LOCUS NC_000962 4411532 bp DNA circular BCT 29-MAR-2010 DEFINITION Mycobacterium tuberculosis H37Rv, complete genome. ACCESSION NC_000962 VERSION NC_000962.2 GI:57116681 DBLINK Project:224 KEYWORDS complete genome. SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales; Corynebacterineae; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Camus,J.C., Pryor,M.J., Medigue,C. and Cole,S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148 (PT 10), 2967-2973 (2002) PUBMED 12368430 REFERENCE 2 (bases 1 to 4411532) AUTHORS Cole,S.T., Brosch,R., Parkhill,J., Garnier,T., Churcher,C., Harris,D., Gordon,S.V., Eiglmeier,K., Gas,S., Barry,C.E. III, Tekaia,F., Badcock,K., Basham,D., Brown,D., Chillingworth,T., Connor,R., Davies,R., Devlin,K., Feltwell,T., Gentles,S., Hamlin,N., Holroyd,S., Hornsby,T., Jagels,K., Krogh,A., McLean,J., Moule,S., Murphy,L., Oliver,K., Osborne,J., Quail,M.A., Rajandream,M.A., Rogers,J., Rutter,S., Seeger,K., Skelton,J., Squares,R., Squares,S., Sulston,J.E., Taylor,K., Whitehead,S. and Barrell,B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393 (6685), 537-544 (1998) PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 3 (bases 1 to 4411532) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (13-SEP-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill,J. TITLE Direct Submission JOURNAL Submitted (11-JUN-1998) Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from AL123456. On Jan 5, 2005 this sequence version replaced gi:15607142. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..4411532 /organism="Mycobacterium tuberculosis H37Rv" /mol_type="genomic DNA" /strain="H37Rv" /db_xref="taxon:83332" gene 1..1524 /gene="dnaA" /locus_tag="Rv0001" /db_xref="GeneID:885041" CDS 1..1524 /gene="dnaA" /locus_tag="Rv0001" /function="PLAYS AN IMPORTANT ROLE IN THE INITIATION AND REGULATION OF CHROMOSOMAL REPLICATION. BINDS TO THE ORIGIN OF REPLICATION; IT BINDS SPECIFICALLY DOUBLE-STRANDED DNA AT A 9 BP CONSENSUS (DNAA BOX): 5'-TTATC(C/A)A(C/A)A-3'. DNAA BINDS TO ATP AND TO ACIDIC PHOSPHOLIPIDS. DNAA PROTEIN BINDS THE ORIGIN OF REPLICATION (oriC), ATP AND ADP, AND EXHIBITED WEAK ATPase ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="binds to the dnaA-box as an ATP-bound complex at the origin of replication during the initiation of chromosomal replication; can also affect transcription of multiple genes including itself." /codon_start=1 /transl_table=11 /product="chromosomal replication initiation protein" /protein_id="NP_214515.1" /db_xref="GI:15607143" /db_xref="GeneID:885041" /translation="MTDDPGSGFTTVWNAVVSELNGDPKVDDGPSSDANLSAPLTPQQ RAWLNLVQPLTIVEGFALLSVPSSFVQNEIERHLRAPITDALSRRLGHQIQLGVRIAP PATDEADDTTVPPSENPATTSPDTTTDNDEIDDSAAARGDNQHSWPSYFTERPHNTDS ATAGVTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPARAYNPLFIWGESGLGKTHLL HAAGNYAQRLFPGMRVKYVSTEEFTNDFINSLRDDRKVAFKRSYRDVDVLLVDDIQFI EGKEGIQEEFFHTFNTLHNANKQIVISSDRPPKQLATLEDRLRTRFEWGLITDVQPPE LETRIAILRKKAQMERLAVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDK ALAEIVLRDLIADANTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAMYL CRELTDLSLPKIGQAFGRDHTTVMYAQRKILSEMAERREVFDHVKELTTRIRQRSKR" misc_feature 622..645 /gene="dnaA" /locus_tag="Rv0001" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1384..1440 /gene="dnaA" /locus_tag="Rv0001" /note="PS01008 DnaA protein signature" gene 2052..3260 /gene="dnaN" /locus_tag="Rv0002" /db_xref="GeneID:887092" CDS 2052..3260 /gene="dnaN" /locus_tag="Rv0002" /EC_number="2.7.7.7" /function="DNA POLYMERASE III IS A COMPLEX, MULTICHAIN ENZYME RESPONSIBLE FOR MOST OF THE REPLICATIVE SYNTHESIS IN BACTERIA. THIS DNA POLYMERASE ALSO EXHIBITS 3' TO 5' EXONUCLEASE ACTIVITY. THE BETA CHAIN IS REQUIRED FOR INITIATION OF REPLICATION ONCE IT IS CLAMPED ONTO DNA, IT SLIDES FREELY (BIDIRECTIONAL AND ATP-INDEPENDENT) ALONG DUPLEX DNA [CATALYTIC ACTIVITY: N deoxynucleoside triphosphate = N diphosphate + {DNA}N]." /experiment="experimental evidence, no additional details recorded" /note="binds the polymerase to DNA and acts as a sliding clamp" /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit beta" /protein_id="NP_214516.1" /db_xref="GI:15607144" /db_xref="GeneID:887092" /translation="MDAATTRVGLTDLTFRLLRESFADAVSWVAKNLPARPAVPVLSG VLLTGSDNGLTISGFDYEVSAEAQVGAEIVSPGSVLVSGRLLSDITRALPNKPVDVHV EGNRVALTCGNARFSLPTMPVEDYPTLPTLPEETGLLPAELFAEAISQVAIAAGRDDT LPMLTGIRVEILGETVVLAATDRFRLAVRELKWSASSPDIEAAVLVPAKTLAEAAKAG IGGSDVRLSLGTGPGVGKDGLLGISGNGKRSTTRLLDAEFPKFRQLLPTEHTAVATMD VAELIEAIKLVALVADRGAQVRMEFADGSVRLSAGADDVGRAEEDLVVDYAGEPLTIA FNPTYLTDGLSSLRSERVSFGFTTAGKPALLRPVSGDDRPVAGLNGNGPFPAVSTDYV YLLMPVRLPG" gene 3280..4437 /gene="recF" /locus_tag="Rv0003" /db_xref="GeneID:887089" CDS 3280..4437 /gene="recF" /locus_tag="Rv0003" /function="THE RECF PROTEIN IS INVOLVED IN DNA METABOLISM AND RECOMBINATION; IT IS REQUIRED FOR DNA REPLICATION AND NORMAL SOS INDUCIBILITY. RECF BINDS PREFERENTIALLY TO SINGLE-STRANDED, LINEAR DNA. IT ALSO SEEMS TO BIND ATP." /experiment="experimental evidence, no additional details recorded" /note="Required for DNA replication; binds preferentially to single-stranded, linear DNA" /codon_start=1 /transl_table=11 /product="recombination protein F" /protein_id="NP_214517.1" /db_xref="GI:15607145" /db_xref="GeneID:887089" /translation="MYVRHLGLRDFRSWACVDLELHPGRTVFVGPNGYGKTNLIEALW YSTTLGSHRVSADLPLIRVGTDRAVISTIVVNDGRECAVDLEIATGRVNKARLNRSSV RSTRDVVGVLRAVLFAPEDLGLVRGDPADRRRYLDDLAIVRRPAIAAVRAEYERVLRQ RTALLKSVPGARYRGDRGVFDTLEVWDSRLAEHGAELVAARIDLVNQLAPEVKKAYQL LAPESRSASIGYRASMDVTGPSEQSDIDRQLLAARLLAALAARRDAELERGVCLVGPH RDDLILRLGDQPAKGFASHGEAWSLAVALRLAAYQLLRVDGGEPVLLLDDVFAELDVM RRRALATAAESAEQVLVTAAVLEDIPAGWDARRVHIDVRADDTGSMSVVLP" misc_feature 3367..3390 /gene="recF" /locus_tag="Rv0003" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 3634..3690 /gene="recF" /locus_tag="Rv0003" /note="PS00617 RecF protein signature 1" misc_feature 4243..4296 /gene="recF" /locus_tag="Rv0003" /note="PS00618 RecF protein signature 2" gene 4434..4997 /locus_tag="Rv0004" /db_xref="GeneID:887088" CDS 4434..4997 /locus_tag="Rv0004" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0004, (MTCY10H4.02), len: 187 aa. Conserved hypothetical protein (see Salazar et al., 1996), highly similar, but longer 21 aa in N-terminus, to AAF33696.1|AF222789 unknown protein from Mycobacterium avium subsp. paratuberculosis (166 aa); and highly similar to NP_301132.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (189 aa); S70990 hypothetical protein from Mycobacterium smegmatis (194 aa). Also highly similar, except in N-terminal part, to NP_599256.1|NC_003450 conserved hypothetical protein from Corynebacterium glutamicum (178 aa), 47.0% identity. Also very weakly similar in C-terminus to C-terminal part of P35925|YREG_STRCO HYPOTHETICAL 19.8 KDA PROTEIN (IN RECF-GYRB INTERGENIC REGION) from Streptomyces coelicolor (190 aa), FASTA scores: opt: 404, E(): 3.9e-18, (40.7% identity in 189 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214518.1" /db_xref="GI:15607146" /db_xref="GeneID:887088" /translation="MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAG RGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQW SAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSL KITGPAAPSWRKGPRHIAGRGPRDTYG" gene 5123..7267 /gene="gyrB" /locus_tag="Rv0005" /db_xref="GeneID:887081" CDS 5123..7267 /gene="gyrB" /locus_tag="Rv0005" /EC_number="5.99.1.3" /function="DNA GYRASE NEGATIVELY SUPERCOILS CLOSED CIRCULAR DOUBLE-STRANDED DNA IN AN ATP-DEPENDENT MANNER AND ALSO CATALYZES THE INTERCONVERSION OF OTHER TOPOLOGICAL ISOMERS OF DOUBLE-STRANDED DNA RINGS, INCLUDING CATENANES AND KNOTTED RINGS [CATALYTIC ACTIVITY: ATP-dependent breakage, passage and rejoining of double-stranded DNA]." /experiment="experimental evidence, no additional details recorded" /note="negatively supercoils closed circular double-stranded DNA" /codon_start=1 /transl_table=11 /product="DNA gyrase subunit B" /protein_id="NP_214519.1" /db_xref="GI:15607147" /db_xref="GeneID:887081" /translation="MGKNEARRSALAPDHGTVVCDPLRRLNRMHATPEESIRIVAAQK KKAQDEYGAASITILEGLEAVRKRPGMYIGSTGERGLHHLIWEVVDNAVDEAMAGYAT TVNVVLLEDGGVEVADDGRGIPVATHASGIPTVDVVMTQLHAGGKFDSDAYAISGGLH GVGVSVVNALSTRLEVEIKRDGYEWSQVYEKSEPLGLKQGAPTKKTGSTVRFWADPAV FETTEYDFETVARRLQEMAFLNKGLTINLTDERVTQDEVVDEVVSDVAEAPKSASERA AESTAPHKVKSRTFHYPGGLVDFVKHINRTKNAIHSSIVDFSGKGTGHEVEIAMQWNA GYSESVHTFANTINTHEGGTHEEGFRSALTSVVNKYAKDRKLLKDKDPNLTGDDIREG LAAVISVKVSEPQFEGQTKTKLGNTEVKSFVQKVCNEQLTHWFEANPTDAKVVVNKAV SSAQARIAARKARELVRRKSATDIGGLPGKLADCRSTDPRKSELYVVEGDSAGGSAKS GRDSMFQAILPLRGKIINVEKARIDRVLKNTEVQAIITALGTGIHDEFDIGKLRYHKI VLMADADVDGQHISTLLLTLLFRFMRPLIENGHVFLAQPPLYKLKWQRSDPEFAYSDR ERDGLLEAGLKAGKKINKEDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLDDAAA ADELFSILMGEDVDARRSFITRNAKDVRFLDV" misc_feature 6608..6634 /gene="gyrB" /locus_tag="Rv0005" /note="PS00177 DNA topoisomerase II signature" gene 7302..9818 /gene="gyrA" /locus_tag="Rv0006" /db_xref="GeneID:887105" CDS 7302..9818 /gene="gyrA" /locus_tag="Rv0006" /EC_number="5.99.1.3" /function="DNA GYRASE NEGATIVELY SUPERCOILS CLOSED CIRCULAR DOUBLE-STRANDED DNA IN AN ATP-DEPENDENT MANNER AND ALSO CATALYZES THE INTERCONVERSION OF OTHER TOPOLOGICAL ISOMERS OF DOUBLE-STRANDED DNA RINGS, INCLUDING CATENANES AND KNOTTED RINGS [CATALYTIC ACTIVITY: ATP-dependent breakage, passage and rejoining of double-stranded DNA]." /experiment="experimental evidence, no additional details recorded" /note="negatively supercoils closed circular double-stranded DNA" /codon_start=1 /transl_table=11 /product="DNA gyrase subunit A" /protein_id="NP_214520.1" /db_xref="GI:15607148" /db_xref="GeneID:887105" /translation="MTDTTLPPDDSLDRIEPVDIEQEMQRSYIDYAMSVIVGRALPEV RDGLKPVHRRVLYAMFDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDSLVRMAQP WSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGRV QEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRELADAVFWALENHDADEEETLAA VMGRVKGPDFPTAGLIVGSQGTADAYKTGRGSIRMRGVVEVEEDSRGRTSLVITELPY QVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVIEIKRDAVAKVVINNLYKH TQLQTSFGANMLAIVDGVPRTLRLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILR GLVKALDALDEVIALIRASETVDIARAGLIELLDIDEIQAQAILDMQLRRLAALERQR IIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVDRHGDDRRTRIIAADGDVSDE DLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLKQDDIVAHFFVCSTHDL ILFFTTQGRVYRAKAYDLPEASRTARGQHVANLLAFQPEERIAQVIQIRGYTDAPYLV LATRNGLVKKSKLTDFDSNRSGGIVAVNLRDNDELVGAVLCSAGDDLLLVSANGQSIR FSATDEALRPMGRATSGVQGMRFNIDDRLLSLNVVREGTYLLVATSGGYAKRTAIEEY PVQGRGGKGVLTVMYDRRRGRLVGALIVDDDSELYAVTSGGGVIRTAARQVRKAGRQT KGVRLMNLGEGDTLLAIARNAEESGDDNAVDANGADQTGN" misc_feature 8811..8849 /gene="gyrA" /locus_tag="Rv0006" /note="PS00018 EF-hand calcium-binding domain" gene 9914..10828 /locus_tag="Rv0007" /db_xref="GeneID:885982" CDS 9914..10828 /locus_tag="Rv0007" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0007, (MTCY10H4.05), len: 304 aa. Possible conserved membrane protein, highly similar to Z70722|MLCB1770_7 from Mycobacterium leprae (303 aa), FASTA scores: opt: 812, E(): 1.6e-25, (54.2% identity in 319 aa overlap). C-terminal part highly similar to C-terminus of CAB92992.1|AL357152 putative integral membrane protein from Streptomyces coelicolor (185 aa); and N-terminal part highly similar to C-terminus of NP_302684.1|NC_002677 hypothetical protein from Mycobacterium leprae (123 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214521.1" /db_xref="GI:15607149" /db_xref="GeneID:885982" /translation="MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPP PWQRAATRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRT PQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAG SSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMI TVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMT ALATIGAFVYNLITDLIGGIEVTLADRD" gene 10887..10960 /locus_tag="Rvnt01" /note="tRNA-Ile(GAT)" /db_xref="GeneID:2700464" tRNA 10887..10960 /locus_tag="Rvnt01" /product="tRNA-Ile" /note="codon recognized: AUC" /anticodon=(pos:10921..10923,aa:Ile) /db_xref="GeneID:2700464" gene 11112..11184 /locus_tag="Rvnt02" /note="tRNA-Ala(TGC)" /db_xref="GeneID:2700469" tRNA 11112..11184 /locus_tag="Rvnt02" /product="tRNA-Ala" /note="codon recognized: GCA" /anticodon=(pos:11145..11147,aa:Ala) /db_xref="GeneID:2700469" gene complement(11874..12311) /locus_tag="Rv0008c" /db_xref="GeneID:887085" CDS complement(11874..12311) /locus_tag="Rv0008c" /function="UNKNOWN" /note="Rv0008c, (MTCY10H4.07c), len: 145 aa. Possible membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214522.1" /db_xref="GI:15607150" /db_xref="GeneID:887085" /translation="MSEQVETRLTPRERLTRGLAYSAVGPVDVTRGLLELGVGLGLQS ARSTAAGLRRRYREGRLAREVAAAQETLAQELTAAQDVVANLPQALQDARTQRRSKHH LWIFAGIAAAILAGGAVAFSIVRRSSRPEPSPRPPSVEVQPRS" gene 12468..13016 /gene="ppiA" /locus_tag="Rv0009" /db_xref="GeneID:887087" CDS 12468..13016 /gene="ppiA" /locus_tag="Rv0009" /EC_number="5.2.1.8" /function="PPIASES ACCELERATE THE FOLDING OF PROTEINS [CATALYTIC ACTIVITY: CIS-TRANS ISOMERIZATION OF PROLINE IMIDIC PEPTIDE BONDS IN OLIGOPEPTIDES]." /experiment="experimental evidence, no additional details recorded" /note="Rv0009, (MTCY10H4.08), len: 182. Probable ppiA (alternate gene name: cfp22), iron-regulated peptidyl-prolyl cis-trans isomerase A (EC 5.2.1.8), equivalent to NP_301138.1|NC_002677 putative peptidyl-prolyl cis-trans isomerase from Mycobacterium leprae (182 aa), FASTA score: (90.1% identity in 182 aa overlap). Also highly similar to others e.g. T36725 from Streptomyces coelicolor (177 aa); T43805 from Halobacterium salinarum (180 aa); NP_219383.1|NC_000919 from Treponema pallidum (215 aa); etc. BELONGS TO THE CYCLOPHILIN-TYPE PPIASE FAMILY. Alternative start codon has been suggested.; cfp22" /codon_start=1 /transl_table=11 /product="iron-regulated peptidyl-prolyl cis-trans isomerase A" /protein_id="NP_214523.1" /db_xref="GI:15607151" /db_xref="GeneID:887087" /translation="MADCDSVTNSPLATATATLHTNRGDIKIALFGNHAPKTVANFVG LAQGTKDYSTQNASGGPSGPFYDGAVFHRVIQGFMIQGGDPTGTGRGGPGYKFADEFH PELQFDKPYLLAMANAGPGTNGSQFFITVGKTPHLNRRHTIFGEVIDAESQRVVEAIS KTATDGNDRPTDPVVIESITIS" gene complement(13133..13558) /locus_tag="Rv0010c" /db_xref="GeneID:887082" CDS complement(13133..13558) /locus_tag="Rv0010c" /function="UNKNOWN" /note="Rv0010c, (MTCY10H4.10c), len: 141 aa. Probable conserved membrane protein, equivalent to NP_301139.1|NC_002677 putative membrane protein from Mycobacterium leprae (137 aa); and similar to Rv1417|P71686|YE17_MYCTU HYPOTHETICAL 16.4 kDa PROTEIN from Mycobacterium tuberculosis (154 aa), FASTA scores: opt: 121, E(): 0.097, (29.6% identity in 81 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214524.1" /db_xref="GI:15607152" /db_xref="GeneID:887082" /translation="MQQTAWAPRTSGIAGCGAGGVVMAIASVTLVTDTPGRVLTGVAA LGLILFASATWRARPRLAITPDGLAIRGWFRTQLLRHSNIKIIRIDEFRRYGRLVRLL EIETVSGGLLILSRWDLGTDPVEVLDALTAAGYAGRGQR" gene complement(13714..13995) /locus_tag="Rv0011c" /db_xref="GeneID:887074" CDS complement(13714..13995) /locus_tag="Rv0011c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="integral membrane protein involved in inhibition of the Z-ring formation" /codon_start=1 /transl_table=11 /product="putative septation inhibitor protein" /protein_id="NP_214525.1" /db_xref="GI:15607153" /db_xref="GeneID:887074" /translation="MPKSKVRKKNDFTVSAVSRTPMKVKVGPSSVWFVSLFIGLMLIG LIWLMVFQLAAIGSQAPTALNWMAQLGPWNYAIAFAFMITGLLLTMRWH" gene 14089..14877 /locus_tag="Rv0012" /db_xref="GeneID:887083" CDS 14089..14877 /locus_tag="Rv0012" /function="UNKNOWN" /note="Rv0012, (MTCY10H4.12), len: 262 aa. Probable conserved membrane protein, similar to AL079308|SCH69_23|T36722 hypothetical protein from Streptomyces coelicolor (237 aa), FASTA scores: opt: 506, E(): 1.9e-25, (39.8% identity in 236 aa overlap). Some similarity to BLU0|1958_35A2 DIVIB (fragment) (188 aa), FASTA scores: opt: 204, E(): 8.9e-07, (35.6% identity in 90 aa overlap); and G1129091|DDS cell division and sporulation protein from Bacillus subtilis (231 aa), FASTA scores: opt: 180, E(): 3.8e-05, (30.7% identity in 101 aa overlap). Also similar to Rv1823|MTCY1A11_20 from Mycobacterium tuberculosis FASTA score: (30.1% identity in 246 aa overlap); and MTCY1A11_18 FASTA score: (25.5% identity in 235 aa overlap). Contains probable N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214526.1" /db_xref="GI:15607154" /db_xref="GeneID:887083" /translation="MRLTHPTPCPENGETMIDRRRSAWRFSVPLVCLLAGLLLAATHG VSGGTEIRRSDAPRLVDLVRRAQASVNRLATEREALTTRIDSVHGRSVDTALAAMQRR SAKLAGVAAMNPVHGPGLVVTLQDAQRDANGRFPRDASPDDLVVHQQDIEAVLNALWN AGAEAIQMQDQRIIAMSIARCVGNTLLLNGRTYSPPYTIAAIGDAAAMQAALAAAPLV TLYKQYVVRFGLGYCEEVHPDLQIVGYADPVRMHFAQPAGPLDY" gene 14914..15612 /gene="trpG" /locus_tag="Rv0013" /db_xref="GeneID:885955" CDS 14914..15612 /gene="trpG" /locus_tag="Rv0013" /function="POSSIBLY INVOLVED IN BIOSYNTHESIS OF TRYPTOPHAN (AT THE FIRST STEP). SUPPOSED TETRAMER OF TWO COMPONENTS I AND TWO COMPONENTS II: COMPONENT I (Rv1609|trpE) CATALYZES THE FORMATION OF ANTHRANILATE USING AMMONIA RATHER THAN GLUTAMINE, WHEREAS COMPONENT II (Rv0013|trpG) PROVIDES GLUTAMINE AMIDOTRANSFERASE ACTIVITY. POSSIBLY PARTICIPATES IN THE TRYPTOPHAN-DEPENDENT INDOLE-3-ACETIC ACID PRODUCTION [CATALYTIC ACTIVITY: CHORISMATE + L-GLUTAMINE = ANTHRANILATE + PYRUVATE + L-GLUTAMATE]." /experiment="experimental evidence, no additional details recorded" /note="aminodeoxychorismate synthase subunit PabA; with PabB catalyzes the formation of 4-amino-4-deoxychorismate from chorismate and glutamine in para-aminobenzoate synthesis; PabA provides the glutamine amidotransferase activity" /codon_start=1 /transl_table=11 /product="para-aminobenzoate synthase component II" /protein_id="YP_177615.1" /db_xref="GI:57116682" /db_xref="GeneID:885955" /translation="MRILVVDNYDSFVFNLVQYLGQLGIEAEVWRNDDHRLSDEAAVA GQFDGVLLSPGPGTPERAGASVSIVHACAAAHTPLLGVCLGHQAIGVAFGATVDRAPE LLHGKTSSVFHTNVGVLQGLPDPFTATRYHSLTILPKSLPAVLRVTARTSSGVIMAVQ HTGLPIHGVQFHPESILTEGGHRILANWLTCCGWTQDDTLVRRLENEVLTAISPHFPT STASAGEATGRTSA" misc_feature 15100..15150 /gene="trpG" /locus_tag="Rv0013" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature 15145..15180 /gene="trpG" /locus_tag="Rv0013" /note="PS00442 Glutamine amidotransferases class-I active site" gene complement(15590..17470) /gene="pknB" /locus_tag="Rv0014c" /db_xref="GeneID:887072" CDS complement(15590..17470) /gene="pknB" /locus_tag="Rv0014c" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO REGULATE CELL DIVISION/DIFFERENTIATION. CAN PHOSPHORYLATE THE PEPTIDE SUBSTRATE MYELIN BASIC PROTEIN (MBP) [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /experiment="experimental evidence, no additional details recorded" /note="Rv0014c, (MTCY10H4.14c), len: 626 aa. pknB, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citations below), equivalent to MLCB1770_9|T10009 probable serine/threonine-specific protein kinase from Mycobacterium leprae (622 aa), FASTA scores: opt: 3600, E(): 0, (86.4% identity in 626 aa overlap). Also similar (highly similar in N-terminus) to others e.g. T36717 from Streptomyces coelicolor (673 aa); NP_389459.1|NC_000964 from Bacillus subtilis (648 aa); NP_465345.1|NC_003210 from Listeria monocytogenes (655 aa); E235741 protein kinase pknB (315 aa), FASTA scores: opt: 1839, E(): 0, (90.8 identity in 305 aa overlap); etc. Contains PS00107 Protein kinases ATP-binding region signature, and PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation on serine/threonine residues." /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase B PKNB (protein kinase B) (STPK B)" /protein_id="NP_214528.1" /db_xref="GI:15607156" /db_xref="GeneID:887072" /translation="MTTPSHLSDRYELGEILGFGGMSEVHLARDLRLHRDVAVKVLRA DLARDPSFYLRFRREAQNAAALNHPAIVAVYDTGEAETPAGPLPYIVMEYVDGVTLRD IVHTEGPMTPKRAIEVIADACQALNFSHQNGIIHRDVKPANIMISATNAVKVMDFGIA RAIADSGNSVTQTAAVIGTAQYLSPEQARGDSVDARSDVYSLGCVLYEVLTGEPPFTG DSPVSVAYQHVREDPIPPSARHEGLSADLDAVVLKALAKNPENRYQTAAEMRADLVRV HNGEPPEAPKVLTDAERTSLLSSAAGNLSGPRTDPLPRQDLDDTDRDRSIGSVGRWVA VVAVLAVLTVVVTIAINTFGGITRDVQVPDVRGQSSADAIATLQNRGFKIRTLQKPDS TIPPDHVIGTDPAANTSVSAGDEITVNVSTGPEQREIPDVSTLTYAEAVKKLTAAGFG RFKQANSPSTPELVGKVIGTNPPANQTSAITNVVIIIVGSGPATKDIPDVAGQTVDVA QKNLNVYGFTKFSQASVDSPRPAGEVTGTNPPAGTTVPVDSVIELQVSKGNQFVMPDL SGMFWVDAEPRLRALGWTGMLDKGADVDAGGSQHNRVVYQNPPAGTGVNRDGIITLRF GQ" misc_feature complement(17033..17071) /gene="pknB" /locus_tag="Rv0014c" /note="PS00108 Serine/Threonine protein kinases active-site signature" misc_feature complement(17351..17422) /gene="pknB" /locus_tag="Rv0014c" /note="PS00107 Protein kinases ATP-binding region signature" gene complement(17467..18762) /gene="pknA" /locus_tag="Rv0015c" /db_xref="GeneID:885953" CDS complement(17467..18762) /gene="pknA" /locus_tag="Rv0015c" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO REGULATE MORPHOLOGICAL CHANGES ASSOCIATED WITH CELL DIVISION/DIFFERENTIATION PROCESS. PHOSPHORYLATES AT SERINE AND THREONINE RESIDUES [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /experiment="experimental evidence, no additional details recorded" /note="Rv0015c, (MTCY10H4.15c), len: 431 aa. pknA, transmembrane serine/threonine-protein kinase (EC 2.7.1.-), magnesium/manganese dependent (see citations below), equivalent to MLCB1770_10|NP_301143.1|NC_002677 putative serine/threonine protein kinase from Mycobacterium leprae (437 aa), FASTA scores: opt: 1883, E(): 0, (72.1% identity in 434 aa overlap). And also highly similar to other kinases from Mycobacterium leprae e.g. MLCB1770_10 from Mycobacterium leprae (437 aa). Also similar to PKNA_MYCLE protein kinase (253 aa), FASTA scores: opt: 1525, E(): 0, (95.0% identity in 242 aa overlap); etc. Also highly similar in part to others e.g. N-terminus of NP_243370.1|NC_002570 from Bacillus halodurans (664 aa); N-terminus of T36717 from Streptomyces coelicolor (673 aa); etc. Also similar to others from Mycobacterium tuberculosis: MTCY10H4_15, MTV021_9, MTCY28_5, MTCY4C12_28, MTCY50_16, MTCY8D9_8, MTCY49_28, MTCY4C12_30, MTCY28_9, etc. Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. It has been shown that sodium orthovanadate inhibits the activity of the enzyme in vitro." /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase A PKNA (protein kinase A) (STPK A)" /protein_id="NP_214529.1" /db_xref="GI:15607157" /db_xref="GeneID:885953" /translation="MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVL KSEFSSDPEFIERFRAEARTTAMLNHPGIASVHDYGESQMNGEGRTAYLVMELVNGEP LNSVLKRTGRLSLRHALDMLEQTGRALQIAHAAGLVHRDVKPGNILITPTGQVKITDF GIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDASPASDVYSLGVVGYEAVSGKRPFA GDGALTVAMKHIKEPPPPLPPDLPPNVRELIEITLVKNPAMRYRSGGPFADAVAAVRA GRRPPRPSQTPPPGRAAPAAIPSGTTARVAANSAGRTAASRRSRPATGGHRPPRRTFS SGQRALLWAAGVLGALAIIIAVLLVIKAPGDNSPQQAPTPTVTTTGNPPASNTGGTDA SPRLNWTERGETRHSGLQSWVVPPTPHSRASLARYEIAQ" misc_feature complement(18316..18354) /gene="pknA" /locus_tag="Rv0015c" /note="PS00108 Serine/Threonine protein kinases active-site signature" gene complement(18759..20234) /gene="pbpA" /locus_tag="Rv0016c" /db_xref="GeneID:887078" CDS complement(18759..20234) /gene="pbpA" /locus_tag="Rv0016c" /function="INVOLVED IN PEPTIDOGLYCAN SYNTHESIS (AT THE FINAL STAGES). CELL WALL FORMATION; PBPA IS SUPPOSED TO BE RESPONSIBLE FOR THE DETERMINATION OF THE ROD SHAPE OF THE CELL. ITS SYNTHESIZES CROSS-LINKED PEPTIDOGLYCAN FROM LIPID INTERMEDIATES." /experiment="experimental evidence, no additional details recorded" /note="Rv0016c, (MTCY10H4.16c), len: 491 aa. Probable pbpA, penicillin-binding protein, equivalent to NP_301144.1|NC_002677 putative penicillin-binding protein from Mycobacterium leprae (492 aa); and highly similar to MLCB1770_1 penicillin binding protein from Mycobacterium leprae (474 aa), FASTA scores: opt: 2516, E(): 0, (82.4% identity in 472 aa overlap). Also similar to others e.g. T36716 from Streptomyces coelicolor (490 aa); AAF61246.1|AF241575|PbpA from Streptomyces griseus (485 aa); NP_347146.1|NC_003030 from Clostridium acetobutylicum (482 aa); E235825|pbpA penicillin binding protein (325 aa), FASTA scores: opt: 1618, E(): 0, (78.3% identity in 323 aa overlap); etc. And also similar to MTCY270_5 and MTV003_8 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="penicillin-binding protein PbpA" /protein_id="NP_214530.1" /db_xref="GI:15607158" /db_xref="GeneID:887078" /translation="MNASLRRISVTVMALIVLLLLNATMTQVFTADGLRADPRNQRVL LDEYSRQRGQITAGGQLLAYSVATDGRFRFLRVYPNPEVYAPVTGFYSLRYSSTALER AEDPILNGSDRRLFGRRLADFFTGRDPRGGNVDTTINPRIQQAGWDAMQQGCYGPCKG AVVALEPSTGKILALVSSPSYDPNLLASHNPEVQAQAWQRLGDNPASPLTNRAISETY PPGSTFKVITTAAALAAGATETEQLTAAPTIPLPGSTAQLENYGGAPCGDEPTVSLRE AFVKSCNTAFVQLGIRTGADALRSMARAFGLDSPPRPTPLQVAESTVGPIPDSAALGM TSIGQKDVALTPLANAEIAATIANGGITMRPYLVGSLKGPDLANISTTVGYQQRRAVS PQVAAKLTELMVGAEKVAQQKGAIPGVQIASKTGTAEHGTDPRHTPPHAWYIAFAPAQ APKVAVAVLVENGADRLSATGGALAAPIGRAVIEAALQGEP" gene complement(20231..21640) /gene="rodA" /locus_tag="Rv0017c" /db_xref="GeneID:887075" CDS complement(20231..21640) /gene="rodA" /locus_tag="Rv0017c" /function="THIS IS A SEPTUM-PEPTIDOGLYCAN BIOSYNTHETIC PROTEIN, INVOLVED IN CELL WALL FORMATION. PLAYS A ROLE IN THE STABILIZATION OF THE FTSZ RING DURING CELL DIVISION." /note="Rv0017c, (MTCY10H4.17c), len: 469 aa. Probable rodA (alternate gene name: ftsW), cell division protein, integral membrane protein, equivalent to MLCB1770_12|T10012 probable cell division protein from Mycobacterium leprae (465 aa), FASTA scores: opt: 2475, E(): 0, (81.9% identity in 469 aa overlap). Also highly similar to others e.g. T36715|SCH69.16 from Streptomyces coelicolor (479 aa); NP_243432.1|NC_002570 from Bacillus halodurans (366 aa); NP_347145.1|NC_003030 from Clostridium acetobutylicum (400 aa); etc. Also similar to MTCY270_14 from Mycobacterium tuberculosis FASTA score: (32.2% identity in 369 aa overlap). BELONGS TO THE FTSW/RODA/SPOVE FAMILY.; ftsW" /codon_start=1 /transl_table=11 /product="cell division protein RodA" /protein_id="NP_214531.1" /db_xref="GI:15607159" /db_xref="GeneID:887075" /translation="MTTRLQAPVAVTPPLPTRRNAELLLLCFAAVITFAALLVVQANQ DQGVPWDLTSYGLAFLTLFGSAHLAIRRFAPYTDPLLLPVVALLNGLGLVMIHRLDLV DNEIGEHRHPSANQQMLWTLVGVAAFALVVTFLKDHRQLARYGYICGLAGLVFLAVPA LLPAALSEQNGAKIWIRLPGFSIQPAEFSKILLLIFFSAVLVAKRGLFTSAGKHLLGM TLPRPRDLAPLLAAWVISVGVMVFEKDLGASLLLYTSFLVVVYLATQRFSWVVIGLTL FAAGTLVAYFIFEHVRLRVQTWLDPFADPDGTGYQIVQSLFSFATGGIFGTGLGNGQP DTVPAASTDFIIAAFGEELGLVGLTAILMLYTIVIIRGLRTAIATRDSFGKLLAAGLS STLAIQLFIVVGGVTRLIPLTGLTTPWMSYGGSSLLANYILLAILARISHGARRPLRT RPRNKSPITAAGTEVIERV" gene complement(21637..23181) /gene="ppp" /locus_tag="Rv0018c" /db_xref="GeneID:887070" CDS complement(21637..23181) /gene="ppp" /locus_tag="Rv0018c" /EC_number="3.1.3.16" /function="INVOLVED IN REGULATION (USING DEPHOSPHORYLATION OF A SPECIFIC PHOSPHORYLATED SUBSTRATE)." /experiment="experimental evidence, no additional details recorded" /note="Rv0018c, (MTCY10H4.18c), len: 514 aa. Possible ppp, serine/threonine phosphatase (EC 3.1.3.16), equivalent to MLCB1770_13|T10013 PUTATIVE PHOSPHOPROTEIN PHOSPHATASE from Mycobacterium leprae (509 aa), FASTA scores: opt: 2517, E(): 0. Also highly similar to others e.g. T36714 probable protein phosphatase from Streptomyces coelicolor (515 aa); CAA10712.1|AJ132604 pppL protein from Lactococcus lactis (258 aa); NP_248765.1|NC_002516 probable phosphoprotein phosphatase from Pseudomonas aeruginosa (242 aa); etc. Also similar to BSUB0009_46 YLOO PROTEIN from Bacillus subtilis (254 aa), FASTA score: (34.0% identity in 250 aa overlap)." /codon_start=1 /transl_table=11 /product="serine/threonine phosphatase" /protein_id="NP_214532.1" /db_xref="GI:15607160" /db_xref="GeneID:887070" /translation="MARVTLVLRYAARSDRGLVRANNEDSVYAGARLLALADGMGGHA AGEVASQLVIAALAHLDDDEPGGDLLAKLDAAVRAGNSAIAAQVEMEPDLEGMGTTLT AILFAGNRLGLVHIGDSRGYLLRDGELTQITKDDTFVQTLVDEGRITPEEAHSHPQRS LIMRALTGHEVEPTLTMREARAGDRYLLCSDGLSDPVSDETILEALQIPEVAESAHRL IELALRGGGPDNVTVVVADVVDYDYGQTQPILAGAVSGDDDQLTLPNTAAGRASAISQ RKEIVKRVPPQADTFSRPRWSGRRLAFVVALVTVLMTAGLLIGRAIIRSNYYVADYAG SVSIMRGIQGSLLGMSLHQPYLMGCLSPRNELSQISYGQSGGPLDCHLMKLEDLRPPE RAQVRAGLPAGTLDDAIGQLRELAANSLLPPCPAPRATSPPGRPAPPTTSETTEPNVT SSPASPSPTTSAPAPTGTTPAIPTSASPAAPASPPTPWPVTSSPTMAALPPPPPQPGI DCRAAA" repeat_region complement(23173..23273) /note="101 bp Mycobacterial Interspersed Repetitive Unit, Class I. See Supply et al. (1997) Molecular Microbiology 26, 991-1003" gene complement(23270..23737) /locus_tag="Rv0019c" /db_xref="GeneID:887079" CDS complement(23270..23737) /locus_tag="Rv0019c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0019c, (MTCY10H4.19c), len: 155 aa. Conserved hypothetical protein, equivalent to MLCB1770_14|NP_301147.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (155 aa), FASTA scores: opt: 902, E(): 0, (91.0% identity in 155 aa overlap). Also highly similar to T36713|AL079308|SCH69_14 from Streptomyces coelicolor (172 aa), FASTA scores: opt: 389, E(): 6e-21, (46.2% identity in 171 aa overlap); and similar in C-terminus to others e.g. NP_342559.1|NC_002754 Conserved hypothetical protein from Sulfolobus solfataricus (209 aa); etc. C-terminus also highly similar to C-terminal part of AAF07901.1|AF173844_2|AF173844 putative signal transduction protein GarA from Mycobacterium smegmatis (158 aa). Also similar to Rv1827|MTCY 1A11.16c from Mycobacterium tuberculosis ( 162 aa), FASTA score: (41.2% identity in 85 aa overlap); MTMOAIS_3; MAU66560_1 and MLCB1788_15." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214533.1" /db_xref="GI:15607161" /db_xref="GeneID:887079" /translation="MQGLVLQLTRAGFLMLLWVFIWSVLRILKTDIYAPTGAVMMRRG LALRGTLLGARQRRHAARYLVVTEGALTGARITLSEQPVLIGRADDSTLVLTDDYAST RHARLSMRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPIGTPVRIGKTAIELRP" gene complement(23861..25444) /gene="TB39.8" /locus_tag="Rv0020c" /db_xref="GeneID:887067" CDS complement(23861..25444) /gene="TB39.8" /locus_tag="Rv0020c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0020c, (MTCY10H4.20c), len: 527 aa. TB39.8, conserved hypothetical protein, identified by proteomic study by the Statens Serum Institute, Denmark (spot TB39.8) (see citation below). Highly similar to NP_301148.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (488 aa); and Z70722|MLCB1770_15|T10015 hypothetical protein from Mycobacterium leprae (463 aa), FASTA scores: opt: 1213, E(): 2.2e-32, (72.3% identity in 506 aa overlap). Alternative start codon in position 24979 has been suggested (see citation below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214534.1" /db_xref="GI:15607162" /db_xref="GeneID:887067" /translation="MGSQKRLVQRVERKLEQTVGDAFARIFGGSIVPQEVEALLRREA ADGIQSLQGNRLLAPNEYIITLGVHDFEKLGADPELKSTGFARDLADYIQEQGWQTYG DVVVRFEQSSNLHTGQFRARGTVNPDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNS SYRGGQGQGRPDEYYDDRYARPQEDPRGGPDPQGGSDPRGGYPPETGGYPPQPGYPRP RHPDQGDYPEQIGYPDQGGYPEQRGYPEQRGYPDQRGYQDQGRGYPDQGQGGYPPPYE QRPPVSPGPAAGYGAPGYDQGYRQSGGYGPSPGGGQPGYGGYGEYGRGPARHEEGSYV PSGPPGPPEQRPAYPDQGGYDQGYQQGATTYGRQDYGGGADYTRYTESPRVPGYAPQG GGYAEPAGRDYDYGQSGAPDYGQPAPGGYSGYGQGGYGSAGTSVTLQLDDGSGRTYQL REGSNIIGRGQDAQFRLPDTGVSRRHLEIRWDGQVALLADLNSTNGTTVNNAPVQEWQ LADGDVIRLGHSEIIVRMH" gene 25644..25726 /locus_tag="Rvnt03" /note="tRNA-Leu(CAG)" /db_xref="GeneID:2700444" tRNA 25644..25726 /locus_tag="Rvnt03" /product="tRNA-Leu" /note="codon recognized: CUG" /anticodon=(pos:25677..25679,aa:Leu) /db_xref="GeneID:2700444" gene complement(25913..26881) /locus_tag="Rv0021c" /db_xref="GeneID:887066" CDS complement(25913..26881) /locus_tag="Rv0021c" /function="UNKNOWN" /note="Rv0021c, (MTCY10H4.21c), len: 322 aa. Conserved hypothetical protein, similar to various proteins e.g. NP_464341.1|NC_003210 protein similar to oxidoreductases from Listeria monocytogenes (309 aa); NP_357973.1|NC_003098 Enoyl-acyl carrier protein(ACP) reductase from Streptococcus pneumoniae (324 aa); 2NPD_NEUCR|G726338 2-nitropropane dioxygenase precursor from Neurospora crassa (378 aa), FASTA scores: opt: 383, E(): 1.1e-16, (32.2% identity in 348 aa overlap); etc. Also similar to AE001747_25 from Thermotoga maritima section 59 (314 aa), FASTA scores: opt: 442, E(): 1.5e-19, (30.5% identity in 325 aa overlap). Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv3553 (355 aa), FASTA scores: E(): 6.8e-15, (35.3 identity in 235 aa overlap); and Rv1533 (375 aa), FASTA scores: E(): 4.7e-12, (34.4% identity in 262 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214535.1" /db_xref="GI:15607163" /db_xref="GeneID:887066" /translation="MVLSTAFSQMFGIDYPIVSAPMDLIAGGELAAAVSGAGGLGLIG GGYGDRDWLARQFDLAAGAPVGCGFITWSLARQPQLLDLALQYEPVAVMLSFGDPAVF ADAIKSAGTRLVCQIQNRTQAERALQVGADVLVAQGTEAGGHGHGPRSTLTLVPEIVD LVTARGTDIPVIAAGGIADGRGLAAALMLGAAGVLVGTRFYATVEALSTPQARDPLLA ATGDDMCRTTIYDQLRRYPWPQGHTMSVLSNALTDQFEDTELDILHREEAMARYWRAV AARDYSIANVTAGQAAGLVNAVLPAADVITGMAQQAARTLTAMRAV" gene complement(27023..27442) /gene="whiB5" /locus_tag="Rv0022c" /db_xref="GeneID:887071" CDS complement(27023..27442) /gene="whiB5" /locus_tag="Rv0022c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0022c, (MTCY10H4.22c), len: 139 aa. Probable whiB5 (alternate gene name: whmG), WhiB-like regulatory protein (see citations below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Shows some similarity to O88103|AJ239086|SCO239086_1|WHID|SC6G4.45c|WBLB WHID PROTEIN from Streptomyces coelicolor (112 aa), FASTA scores: opt: 125, E(): 0.055, (37.1% identity in 97 aa overlap); and slight similarity to G466960|WHIB WHIB PROTEIN (102 aa), FASTA scores: opt: 112, E(): 0.14, (34.3 identity in 67 aa overlap).; whmG" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIB-like WHIB5" /protein_id="NP_214536.1" /db_xref="GI:15607164" /db_xref="GeneID:887071" /translation="MAHPCATDPELWFGYPDDDGSDGAAKARAYERSATQARIQCLRR CPLLQQRRCAQHAVEHRVEYGVWAGIKLPGGQYRKREQLAAAHDVLRRIAGGEINSRQ LPDNAALLARNEGLEVTPVPGVVVHLPIAQVGPQPAA" gene 27595..28365 /locus_tag="Rv0023" /db_xref="GeneID:887062" CDS 27595..28365 /locus_tag="Rv0023" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0023, (MTCY10H4.23),len: 256 aa. Possible transcriptional regulator, equivalent to CAB96432.1|AJ251434 hypothetical protein from Mycobacterium avium subsp. paratuberculosis (146 aa). N-terminus showing similarity with other transcriptional regulators e.g. AE0002|ECAE000240_9 from Escherichia coli strain K12 (178 aa), FASTA scores: opt: 149, E(): 0.0048, (33.3% identity in 84 aa overlap); etc. Contains probable helix-turn helix motif from aa 19 to 40 (Score 1615, +4.69 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214537.1" /db_xref="GI:15607165" /db_xref="GeneID:887062" /translation="MSRESAGAAIRALRESRDWSLADLAAATGVSTMGLSYLERGARK PHKSTVQKVENGLGLPPGTYSRLLVAADPDAELARLIAAQPSNPTAVRRAGAVVVDRH SDTDVLEGYAEAQLDAIKSVIDRLPATTSNEYETYILSVIAQCVKAEMLAASSWRVAV NAGADSTGRLMEHLRALEATRGALLERMPTSLSARFDRACAQSSLPEAVVAALIGVGA DEMWDIRNRGVIPAGALPRVRAFVDAIEASHDADEGQQ" gene 28362..29207 /locus_tag="Rv0024" /db_xref="GeneID:887061" CDS 28362..29207 /locus_tag="Rv0024" /function="UNKNOWN. THE P60 PROTEIN IS A MAJOR EXTRACELLULAR PROTEIN MAY BE INVOLVED IN THE INVASION OF HOST CELLS." /note="Rv0024, (MTCY10H4.24), len: 281 aa. Putative secreted protein, p60 homologue, similar in part to others and relatives proteins e.g. P60_LISIV|Q01837 protein p60 precursor (invasion-associated protein) from Listeria ivanovii (524 aa), FASTA scores: opt: 245, E(): 1.5e-08, (37.0% identity in 100 aa overlap); CAB92656.1|AL356832 putative NPL/P60 family secreted protein from Streptomyces coelicolor (347 aa) ; etc. Similar to Mycobacterium tuberculosis proteins Rv1477, Rv1478, Rv1566c, Rv2190c. And several homologues in Streptomyces coelicolor e.g. AL049497|SC6G10_8|T35517 probable secreted protein (338 aa), FASTA scores: opt: 399, E(): 9.8e-18, (34.9% identity in 292 aa overlap). COULD BELONG TO THE E. COLI NLPC / LISTERIA P60 FAMILY." /codon_start=1 /transl_table=11 /product="putative secreted protein P60-related protein" /protein_id="NP_214538.1" /db_xref="GI:15607166" /db_xref="GeneID:887061" /translation="MNYSEVELLSRAHQLFAGDSRRPGLDAGTTPYGDLLSRAADLNV GAGQRRYQLAVDHSRAALLSAARTDAAAGAVITGAQRDRAWARRSTGTVLDEARSDTT VTAVMPIAQREAIRRRVARLRAQRAHVLTARRRARRHLAALRALRYRVAHGPGVALAK LRLPSPSGRAGIAVHAALSRLGRPYVWGATGPNQFDCSGLVQWAYAQAGVHLDRTTYQ QINEGIPVPRSQVRPGDLVFPHPGHVQLAIGNNLVVEAPHAGASVRVSSLGNNVQIRR PLSGR" gene 29245..29607 /locus_tag="Rv0025" /db_xref="GeneID:887060" CDS 29245..29607 /locus_tag="Rv0025" /function="UNKNOWN" /note="Rv0025, (MTCY10H4.25), len: 120 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis e.g. Rv0739 (268 aa), FASTA score: (37.6% identity in 101 aa overlap), and Rv0026 FASTA score: (35.4% identity in 113 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214539.1" /db_xref="GI:15607167" /db_xref="GeneID:887060" /translation="MSEQAGSSVAVIQERQALLARQHDAVAEADRELADVLASAHAAM RESVRRLDAIAAELDRAVPDQDQLAVDTPMGAREFQTFLVAKQREIVAVVAAAHELDR AKSAVLKRLRAQYTEPAR" gene 29722..31068 /locus_tag="Rv0026" /db_xref="GeneID:887057" CDS 29722..31068 /locus_tag="Rv0026" /function="UNKNOWN" /note="Rv0026, (MTCY10H4.26), len: 448 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis: Rv0025 FASTA score: (35.4% identity in 113 aa overlap) and Rv0739 (268 aa), FASTA score: (32.4% identity in 142 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214540.1" /db_xref="GI:15607168" /db_xref="GeneID:887057" /translation="MAFDAAMSTHEDLLATIRYVRDRTGDPNAWQTGLTPTEVTAVVT STTRSEQLDAILRKIRQRHSNLYYPAPPDREQGDAARAIADAEAALAHQNSATAQLDL QVVSAILNAHLKTVEGGESLHELQQEIEAAVRIRSDLDTPAGARDFQRFLIGKLKDIR EVVATASLDAASKSALMAAWTSLYDASKGDRGDADDRGPASVGSGGAPARGAGQQPEL PTRAEPDCLLDSLLLEDPGLLADDLQVPGGTSAAIPSASSTPSLPNLGGATMPGGGAT PALVPGVSAPGGLPLSGLLRGVGDEPELTDFDERGQEVRDPADYEHSNEPDERRADDR EGADEDAGLGKSESPPQAPTTVTLPNGETVTAASPQLAAAIKAAASGTPIADAFQQQG IAIPLPGTAVANPVDPARISAGDVGVFTATPLPLALAKLFWTARFNTSQPCEGQTF" gene 31189..31506 /locus_tag="Rv0027" /db_xref="GeneID:887054" CDS 31189..31506 /locus_tag="Rv0027" /function="UNKNOWN" /note="Rv0027, (MTCY10H4.27), len: 105 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214541.1" /db_xref="GI:15607169" /db_xref="GeneID:887054" /translation="MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLG PIFSELRDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIID GSR" gene 31514..31819 /locus_tag="Rv0028" /db_xref="GeneID:885812" CDS 31514..31819 /locus_tag="Rv0028" /function="UNKNOWN" /note="Rv0028, (MTCY10H4.28), len: 101 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214542.1" /db_xref="GI:15607170" /db_xref="GeneID:885812" /translation="MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETL AEAILLTADVSCLKALLEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR" gene 32057..33154 /locus_tag="Rv0029" /db_xref="GeneID:887053" CDS 32057..33154 /locus_tag="Rv0029" /function="UNKNOWN" /note="Rv0029, (MTCY10H4.29), len: 365 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis e.g. C-terminal region of Rv2082|MTCY49_21|E247006 hypothetical 73.6 kDa protein (721 aa), FASTA scores: opt: 453, E(): 1.2e-22, (38.5% identity in 265 aa overlap); Rv3899c|MTY15F10_12 HYPOTHETICAL 40.8 kDa PROTEIN (410 aa), FASTA score: (33.7% identity in 252 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214543.1" /db_xref="GI:15607171" /db_xref="GeneID:887053" /translation="MAIFGRWSARQRLRRATRESLTIPTFSSSLDCTTRVIGGLWPAE LSSNTAETATLAEHLKADLHRIVGSANDELMVIWRAGMADSTRRAEEDRVIDRARASA MRRVESAMRELRQITGRVPVEIPRMRGAGGSDLDTTRLMPAVTVVQPADQACTDWPVA AAEDDEARLQRLLAFVARQEPRLNWAVGVHADGTTVLVTDVAHGWIPPGIALPEGVRL LAPARRAGRAPELVGITTCCKTYTPGDSLRRAVDSTAPTSSVQPRALPAIAGLSVELG IATQRHDGLPKIVHAMATAAGNGAAAEEVDLLRVHVDTALHHVLAQYPRVDPALLLNC MLLAATERSVTGDPIAANYHFAWFRELDSRR" gene 33224..33553 /locus_tag="Rv0030" /db_xref="GeneID:887051" CDS 33224..33553 /locus_tag="Rv0030" /function="UNKNOWN" /note="Rv0030, (MTCY10H4.30), len: 109 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214544.1" /db_xref="GI:15607172" /db_xref="GeneID:887051" /translation="MVSGSDSRSEPSQLSDRDLVESVLRDLSEAADKWEALVTQAETV TYSVDLGDVRAVANSDGRLLELTLHPGVMTGYAHGELADRVNLAITALRDEVEAENRA RYGGRLQ" gene 33582..33794 /locus_tag="Rv0031" /db_xref="GeneID:887049" CDS 33582..33794 /locus_tag="Rv0031" /function="NORMALY, REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv0031, (MTCY10H4.31), len: 70 aa. Possible remnant of a transposase, showing partial similarity to mycobacterial transposases in a short overlap, e.g. Rv2791c|MTV002_57 (459 aa), FASTA score: (72.2% identity in 36 aa overlap); Rv2885c, Rv2978c, Rv3827c, etc." /codon_start=1 /transl_table=11 /product="remnant of A transposase" /protein_id="NP_214545.1" /db_xref="GI:15607173" /db_xref="GeneID:887049" /translation="MLARHFGAGRKAHSRAVATLKADIQAWHPAGIQTPKPRCESDVF ARIGHTSHPSTRKSRVGPGASEAPLA" gene 34295..36610 /gene="bioF2" /locus_tag="Rv0032" /db_xref="GeneID:887050" CDS 34295..36610 /gene="bioF2" /locus_tag="Rv0032" /EC_number="2.3.1.47" /function="COULD BE INVOLVED IN BIOTIN BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: 6-CARBOXYHEXANOYL-COA + L-ALANINE = 8-AMINO-7-OXONONANOATE + CoA + CO2]." /note="Rv0032, (MTCY10H4.32), len: 771 aa. Probable bioF2, 8-amino-7-oxononanoate synthase (EC 2.3.1.47), with its C-terminal similar to others e.g. BIOF_BACSU|P53556 8-amino-7-oxononanoate synthase from Bacillus subtilis (389 aa), FASTA scores: opt: 775, E(): 0, (37.9% identity in 346 aa overlap); P22806|BIOF_BACSH from Bacillus sphaericus (389 aa); etc. Also similar to BIOF1|Rv1569|MTCY336_35 from Mycobacterium tuberculosis (386 aa), AF041819_4 from Mycobacterium bovis, and BIOF_MYCLE|P45487 from Mycobacterium leprae (385 aa). Contains PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site. BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES." /codon_start=1 /transl_table=11 /product="8-amino-7-oxononanoate synthase BioF2" /protein_id="NP_214546.1" /db_xref="GI:15607174" /db_xref="GeneID:887050" /translation="MPTGLGYDFLRPVEDSGINDLKHYYFMADLADGQPLGRANLYSV CFDLATTDRKLTPAWRTTIKRWFPGFMTFRFLECGLLTMVSNPLALRSDTDLERVLPV LAGQMDQLAHDDGSDFLMIRDVDPEHYQRYLDILRPLGFRPALGFSRVDTTISWSSVE EALGCLSHKRRLPLKTSLEFRERFGIEVEELDEYAEHAPVLARLWRNVKTEAKDYQRE DLNPEFFAACSRHLHGRSRLWLFRYQGTPIAFFLNVWGADENYILLEWGIDRDFEHYR KANLYRAALMLSLKDAISRDKRRMEMGITNYFTKLRIPGARVIPTIYFLRHSTDPVHT ATLARMMMHNIQRPTLPDDMSEEFCRWEERIRLDQDGLPEHDIFRKIDRQHKYTGLKL GGVYGFYPRFTGPQRSTVKAAELGEIVLLGTNSYLGLATHPEVVEASAEATRRYGTGC SGSPLLNGTLDLHVSLEQELACFLGKPAAVLCSTGYQSNLAAISALCESGDMIIQDAL NHRSLFDAARLSGADFTLYRHNDMDHLARVLRRTEGRRRIIVVDAVFSMEGTVADLAT IAELADRHGCRVYVDESHALGVLGPDGRGASAALGVLARMDVVMGTFSKSFASVGGFI AGDRPVVDYIRHNGSGHVFSASLPPAAAAATHAALRVSRREPDRRARVLAAAEYMATG LARQGYQAEYHGTAIVPVILGNPTVAHAGYLRLMRSGVYVNPVAPPAVPEERSGFRTS YLADHRQSDLDRALHVFAGLAEDLTPQGAAL" misc_feature 36128..36157 /gene="bioF2" /locus_tag="Rv0032" /note="PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site" gene 36607..36870 /gene="acpA" /locus_tag="Rv0033" /db_xref="GeneID:887052" CDS 36607..36870 /gene="acpA" /locus_tag="Rv0033" /function="KEY COMPONENT IN DE NOVO FATTY ACID BIOSYNTHESIS. THIS PROTEIN IS SUPPOSED TO BE THE CARRIER OF THE GROWING FATTY ACID CHAIN IN FATTY ACID BIOSYNTHESIS." /note="Rv0033, (MTCY10H4.33), len: 87 aa. Probable acpA (alternate gene name: acpP), acyl carrier protein, similar to others e.g. ACP_BACSU|P80643 acyl carrier protein (acp) from Bacillus subtilis (77 aa), FASTA scores: opt: 149, E(): 0.00026, (41.4% identity in 70 aa overlap); NP_224500.1|NC_000922 Acyl Carrier Protein from Chlamydophila pneumoniae (79 aa); NP_228471.1|NC_000853 acyl carrier protein from Thermotoga maritima (81 aa); etc. Also similar to proteins of Mycobacterium tuberculosis Rv1344 and Rv2244 (31.5% identity in 73 aa overlap).; acpP" /codon_start=1 /transl_table=11 /product="acyl carrier protein AcpA" /protein_id="NP_214547.1" /db_xref="GI:15607175" /db_xref="GeneID:887052" /translation="MKEAINATIQRILRTDRGITANQVLVDDLGFDSLKLFQLITELE DEFDIAISFRDAQNIKTVGDVYTSVAVWFPETAKPAPLGKGTA" gene 36867..37262 /locus_tag="Rv0034" /db_xref="GeneID:887046" CDS 36867..37262 /locus_tag="Rv0034" /function="UNKNOWN" /note="Rv0034, (MTCY10H4.34), len: 131 aa. Conserved hypothetical protein, showing weak similarity to AE001980|AE001980_7 hypothetical protein from Deinococcus radiodurans (120 aa), FASTA scores: opt: 141, E(): 0.0028, (29.3% identity in 123 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214548.1" /db_xref="GI:15607176" /db_xref="GeneID:887046" /translation="MTDDADLDLVRRTFAAFARGDLAELTQCFAPDVEQFVPGKHALA GVFRGVDNVVACLGDTAAAADGTMTVTLEDVLSNTDGQVIAVYRLRASRAGKVLDQRE AILVTVAGGRITRLSEFYADPAATESFWA" gene 37259..38947 /gene="fadD34" /locus_tag="Rv0035" /db_xref="GeneID:887048" CDS 37259..38947 /gene="fadD34" /locus_tag="Rv0035" /EC_number="6.2.1.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0035, (MTCY10H4.35), len: 562 aa. Probable fadD34, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to many e.g. MBU75685_1 acyl-CoA synthase from Mycobacterium bovis (582 aa), FASTA scores: opt: 408, E(): 8.2e-20; etc. Also similar to G1171128 SAFRAMYCIN MX1 SYNTHETASE B (1770 aa), FASTA scores: opt: 445, E(): 1.3e-21, (28.1% identity in 573 aa overlap). Also similar to other proteins from Mycobacterium tuberculosis e.g. MTCY02B10.09, FASTA score: (32.3% identity in 468 aa overlap), MTCY349_40, MTCY4D9_17, MTCY338_18, MTV045_3, MTCY409_4, MTCI237_30, MTCY24G1_8, MASC_MYCLE MASC PROTEIN, U00010_6, MTV005_21, MTCY19G5_7, MTCY9F9_39, etc." /codon_start=1 /transl_table=11 /product="fatty-acid-CoA ligase" /protein_id="YP_177686.1" /db_xref="GI:57116683" /db_xref="GeneID:887048" /translation="MTAALLSPAIAWQQISACTDRTLTITCEDSEVISYQDLIARAAA CIPPLRRLDLKRGEPVLITAHTNLEFLSCFLGLMLHGAVPVPIPPREALKTTERFMTR LGPLLRHHRVLICTPAEHDEIRAAASTDCQISRFTALAEAGDEQFGRATAQQLADTAT ADWPLCTLDDDAYVQYTSGSTAAPRGVVITYRNLLSNMRAMAVGSQFQHGDVMGSWLP LHHDMGLVGSLFAALFNSVSAVFTTPHRFLYDPLGFLRLLTSSGATHTFMPNFALEWL INAYHRRGADIEGIDLHKMRRLIIASEPVHAEGMRRFAATFAGVGLAPTALGSGYGLA EATVAVSMSAPNTGFRTETHAAAEVVTGGRVLPGYEVRIDAAPGARAGTIKLRGDSVA AKAYVGGKKLDALDEEGFCDTHDLGFLVDDEIVILGRQDEVFIVHGENRFPYDIEFII RGESEQHRTKVACFGVNERVVVVLESPLDSIIDKAEADRLRCQVVAATGLQLDELITV RRGAIPTTTSGKLKRRAVAQAYRDGTLPRLATHAWTADPDSAPKTTRSSLEGAH" gene complement(39056..39829) /locus_tag="Rv0036c" /db_xref="GeneID:887043" CDS complement(39056..39829) /locus_tag="Rv0036c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0036c, (MTCY10H4.36c), len: 257 aa. Conserved hypothetical protein, highly similar to CAB95889.1|AL359988 conserved hypothetical protein from Streptomyces (276 aa). Also some similarity to Rv3099c|MTCY164_10 (283 aa), FASTA scores: E(): 3.3e-05, (25.9% identity in 205 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214550.1" /db_xref="GI:15607178" /db_xref="GeneID:887043" /translation="MADPGPFVADLRAESDDLDALVAHLPADRWADPTPAPGWTIAHQ IGHLLWTDRVALTAVTDEAGFAELMTAAAANPAGFVDDAATELAAVSPAELLTDWRVT RGRLHEELLAVPDGRKLAWFGPPMSAASMATARLMETWAHGLDVADALGVIRPATQRL RSIAHLGVRTRDYAFIVNNLTPPAEPFLVELRGPSGDTWSWGPSDAAQRVTGSAEDFC FLVTQRRALSTLDVNAVGEDAQRWLTIAQAFAGPPGRGR" gene complement(39877..41202) /locus_tag="Rv0037c" /db_xref="GeneID:887042" CDS complement(39877..41202) /locus_tag="Rv0037c" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF MACROLIDE ACROSS THE MEMBRANE." /note="Rv0037c, (MTCY10H4.37c), len: 441 aa. Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of macrolide, showing some similarity to Rv1258c|MTCY50_24 (419 aa), FASTA score: (25.2% identity in 408 aa overlap); and to AL049826|SCH24_20 from Streptomyces coelicolor (425 aa), FASTA scores: opt: 725, E(): 0, (36.1% identity in 418 aa overlap). Also similarity with several MACROLIDE-EFFLUX PROTEINS e.g. from S. pyogenes (405 aa), FASTA scores: E(): 1.3e-06, (22.8% identity in 416 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_214551.1" /db_xref="GI:15607179" /db_xref="GeneID:887042" /translation="MPRVEVGLVIHSRMHARAPVDVWRSVRSLPDFWRLLQVRVASQF GDGLFQAGLAGALLFNPDRAADPMAIAGAFAVLFLPYSLLGPFAGALMDRWDRRWVLV GANTGRLALIAGVGTILAVGAGDVPLLVGALVANGLARFVASGLSAALPHVVPREQVV TMNSVAIASGAVSAFLGANFMLLPRWLLGSGDEGASAIVFLVAIPVSIALLWSLRFGP RVLGPDDTERAIHGSAVYAVVTGWLHGARTVVQLPTVAAGLSGLAAHRMVVGINSLLI LLLVRHVTARAVGGLGTALLFFAATGLGAFLANVLTPTAIRRWGRYATANGALAAAAT IQVAAAGLLVPVMVVCGFLLGVAGQVVKLCADSAMQMDVDDALRGHVFAVQDALFWVS YILSITVAAALIPEHGHAPVFVLFGSAIYLAGLVVHTIVGRRGQPVIGR" gene 41304..41912 /locus_tag="Rv0038" /db_xref="GeneID:887045" CDS 41304..41912 /locus_tag="Rv0038" /function="UNKNOWN" /note="Rv0038, (MTCY10H4.38), len: 202 aa. Conserved hypothetical protein, equivalent to MLCB1770_16|Q50191|Y038_MYCLE hypothetical 22.0 kDa from Mycobacterium leprae (202 aa), FASTA scores: opt: 1194, E(): 0, (88.6% identity in 202 aa overlap). Also highly similar or similar to other hypothetical proteins e.g. CAB72194.1|AL138851|SCE59.07c from Streptomyces coelicolor (193 aa); AAC06288.1|AF050466 from Mycobacterium bovis (82 aa) (similarity in N-terminus); NP_224347.1|NC_000922|YqgE from Chlamydophila pneumoniae (188 aa); YQGE_ECOLI HYPOTHETICAL 20.7 kDa PROTEIN (187 aa), FASTA score: (29.5% identity in 166 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214552.1" /db_xref="GI:15607180" /db_xref="GeneID:887045" /translation="MVAPHEDPEDHVAPAAQRVRAGTLLLANTDLLEPTFRRSVIYIV EHNDGGTLGVVLNRPSETAVYNVLPQWAKLAAKPKTMFIGGPVKRDAALCLAVLRVGA DPEGVPGLRHVAGRLVMVDLDADPEVLAAAVEGVRIYAGYSGWTIGQLEGEIERDDWI VLSALPSDVLVGPRADLWGQVLRRQPLPLSLLATHPIDLSRN" gene complement(42004..42351) /locus_tag="Rv0039c" /db_xref="GeneID:887038" CDS complement(42004..42351) /locus_tag="Rv0039c" /function="UNKNOWN" /note="Rv0039c, (MTCY21D4.02c, MTCY10H4.39c), len: 115 aa. Possible conserved transmembrane protein, highly similar to NP_301154.1|NC_002677|Z70722|MLCB1770_18 hypothetical protein from Mycobacterium leprae (113 aa), FASTA scores: opt: 492, E(): 7.8e-27, (64.9% identity in 114 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214553.1" /db_xref="GI:15607181" /db_xref="GeneID:887038" /translation="MFLAGVLCMCAAAASALFGSWSLCHTPTADPTALALRAMAPTQL AAAVMLAAGGVVAVAAPGHTALMVVIVCIAGAVGTLAAGSWQSAQYALRRETASPTAN CVGSCAVCTQACH" gene complement(42433..43365) /gene="mtc28" /locus_tag="Rv0040c" /db_xref="GeneID:887037" CDS complement(42433..43365) /gene="mtc28" /locus_tag="Rv0040c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0040c, (MTCY21D4.03c), len: 310 aa. mtc28, secreted proline rich 28 kDa antigen protein (has hydrophobic stretch at N-terminus) (see citation below). Highly similar to O33075|PR28_MYCLE|MT10019 Proline rich 28 kDa antigen from Mycobacterium leprae (278 aa), FASTA scores: opt: 1007, E(): 0, (65.0% identity in 257 aa overlap); and Q9CD47|LPQT_MYCLE|NP_301305.1|NC_002677 putative lipoprotein from Mycobacterium leprae (218 aa). C-terminal part very similar to lipoprotein Rv1016c from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="secreted proline rich protein MTC28 (proline rich 28 kDa antigen)" /protein_id="NP_214554.2" /db_xref="GI:57116684" /db_xref="GeneID:887037" /translation="MIQIARTWRVFAGGMATGFIGVVLVTAGKASADPLLPPPPIPAP VSAPATVPPVQNLTALPGGSSNRFSPAPAPAPIASPIPVGAPGSTAVPPLPPPVTPAI SGTLRDHLREKGVKLEAQRPHGFKALDITLPMPPRWTQVPDPNVPDAFVVIADRLGNS VYTSNAQLVVYRLIGDFDPAEAITHGYIDSQKLLAWQTTNASMANFDGFPSSIIEGTY RENDMTLNTSRRHVIATSGADKYLVSLSVTTALSQAVTDGPATDAIVNGFQVVAHAAP AQAPAPAPGSAPVGLPGQAPGYPPAGTLTPVPPR" gene 43562..46471 /gene="leuS" /locus_tag="Rv0041" /db_xref="GeneID:887040" CDS 43562..46471 /gene="leuS" /locus_tag="Rv0041" /EC_number="6.1.1.4" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-leucine + tRNA(Leu) = AMP + diphosphate + L-leucyl-tRNA(Leu)]." /note="leucine--tRNA ligase; LeuRS; class-I aminoacyl-tRNA synthetase; charges leucine by linking carboxyl group to alpha-phosphate of ATP and then transfers aminoacyl-adenylate to its tRNA; due to the large number of codons that tRNA(Leu) recognizes, the leucyl-tRNA synthetase does not recognize the anticodon loop of the tRNA, but instead recognition is dependent on a conserved discriminator base A37 and a long arm; an editing domain hydrolyzes misformed products; in Methanothermobacter thermautotrophicus this enzyme associates with prolyl-tRNA synthetase" /codon_start=1 /transl_table=11 /product="leucyl-tRNA synthetase" /protein_id="NP_214555.1" /db_xref="GI:15607183" /db_xref="GeneID:887040" /translation="MTESPTAGPGGVPRADDADSDVPRYRYTAELAARLERTWQENWA RLGTFNVPNPVGSLAPPDGAAVPDDKLFVQDMFPYPSGEGLHVGHPLGYIATDVYARY FRMVGRNVLHALGFDAFGLPAEQYAVQTGTHPRTRTEANVVNFRRQLGRLGFGHDSRR SFSTTDVDFYRWTQWIFLQIYNAWFDTTANKARPISELVAEFESGARCLDGGRDWAKL TAGERADVIDEYRLVYRADSLVNWCPGLGTVLANEEVTADGRSDRGNFPVFRKRLRQW MMRITAYADRLLDDLDVLDWPEQVKTMQRNWIGRSTGAVALFSARAASDDGFEVDIEV FTTRPDTLFGATYLVLAPEHDLVDELVAASWPAGVNPLWTYGGGTPGEAIAAYRRAIA AKSDLERQESREKTGVFLGSYAINPANGEPVPIFIADYVLAGYGTGAIMAVPGHDQRD WDFARAFGLPIVEVIAGGNISESAYTGDGILVNSDYLNGMSVPAAKRAIVDRLESAGR GRARIEFKLRDWLFARQRYWGEPFPIVYDSDGRPHALDEAALPVELPDVPDYSPVLFD PDDADSEPSPPLAKATEWVHVDLDLGDGLKPYSRDTNVMPQWAGSSWYELRYTDPHNS ERFCAKENEAYWMGPRPAEHGPDDPGGVDLYVGGAEHAVLHLLYSRFWHKVLYDLGHV SSREPYRRLVNQGYIQAYAYTDARGSYVPAEQVIERGDRFVYPGPDGEVEVFQEFGKI GKSLKNSVSPDEICDAYGADTLRVYEMSMGPLEASRPWATKDVVGAYRFLQRVWRLVV DEHTGETRVADGVELDIDTLRALHRTIVGVSEDFAALRNNTATAKLIEYTNHLTKKHR DAVPRAAVEPLVQMLAPLAPHIAEELWLRLGNTTSLAHGPFPKADAAYLVDETVEYPV QVNGKVRGRVVVAADTDEETLKAAVLTDEKVQAFLAGATPRKVIVVAGRLVNLVI" misc_feature 43799..43831 /gene="leuS" /locus_tag="Rv0041" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene complement(46581..47207) /locus_tag="Rv0042c" /db_xref="GeneID:887034" CDS complement(46581..47207) /locus_tag="Rv0042c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0042c, (MTCY21D4.05c), len: 208 aa. Possible transcriptional regulatory protein, MarR-family, highly similar except in N-terminus to CAC32228.1|AL583926 putative MarR-family regulatory protein from Mycobacterium leprae (243 aa). Also similar in part to others e.g. AB76343.1|AL158061 putative MarR-family transcriptional regulator from Streptomyces coelicolor (163 aa); NP_384406.1|NC_003047 PUTATIVE TRANSCRIPTION REGULATOR PROTEIN from Sinorhizobium meliloti (164 aa); NP_531782.1|NC_003304 transcriptional regulator, MarR family from Agrobacterium tumefaciens (151 aa); etc. Also some similarity to Mycobacterium tuberculosis proteins Rv2327, Rv0880, and Rv1404." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="NP_214556.1" /db_xref="GI:15607184" /db_xref="GeneID:887034" /translation="MSVVRSIGKKMQRISGPNALAVKGRPTQVYGHTHVRLDCRFMAD SEFTAPEVTQLAEGLHRALSKLISMLRRGDPNGAAAGDLTLAQLSILVTLLDQGPIRM TDLAAHERVRTPTTTVAIRRLEKIGLVKRSRDPSDLRAVLVDITPQGRAVHGESLANR RAALAALLSQLPRSDLETLRKALAPLERLASGEPASGPASNSPARKRA" gene complement(47366..48100) /locus_tag="Rv0043c" /db_xref="GeneID:887032" CDS complement(47366..48100) /locus_tag="Rv0043c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0043c, (MTCY21D4.06c), len: 244 aa. Probable transcriptional regulator, GntR family, similar to others e.g. NP_420584.1|NC_002696 transcriptional regulator GntR family from Caulobacter crescentus (221 aa); NP_294539.1|NC_001263 transcriptional regulator GntR family from Deinococcus radiodurans (267 aa); YIN1_STRAM|P32425 hypothetical transcriptional regulatory protein from Streptomyces ambofaciens (236 aa), FASTA scores: opt: 170, E(): 9.8e-05, (27.6% identity in 127 aa overlap); etc. Similar also to SC9B10_7 from Streptomyces coelicolor FASTA score: E():0.00038; and Rv0165c|MTCI28_5 from Mycobacterium tuberculosis (264 aa), FASTA score: (27.7% identity in 130 aa overlap)." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="NP_214557.1" /db_xref="GI:15607185" /db_xref="GeneID:887032" /translation="MPKKYGVKEKDQVVAHILNLLLTGKLRSGDRVDRNEIAHGLGVS RVPIQEALVQLEHDGIVSTRYHRGAFIERFDVATILEHHELDGLLNGIASARAAANPT PRILGQLDAVMRSLRNSKESRAFAECVWEYRRTVNDEYAGPRLHATIRASQNLIPRVF WMTYQNSRDDVLPFYEEENAAIHRREPEAARAACIGRSELMAQTMLAELFRRRVLVPP EGACPGPFGAPIPGFARSYQPSSPVP" gene complement(48233..49027) /locus_tag="Rv0044c" /db_xref="GeneID:887030" CDS complement(48233..49027) /locus_tag="Rv0044c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0044c, (MTCY21D4.07c), len: 264 aa. Possible oxidoreductase (EC 1.-.-.-), highly similar to AAD32732.1|MmcI|AF127374| F420-dependent H4MPT reductase from Streptomyces lavendulae (264 aa). Also similar to Mycobacterium tuberculosis proteins e.g. Rv1855c, Rv0953c, Rv0791c, Rv0132c, etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214558.1" /db_xref="GI:15607186" /db_xref="GeneID:887030" /translation="MTSLVRPDLPVRIGVQLQPQHAPHYRAVRDAVRRCEDIGVDIAF TWDHFFPLYGDPDGPHFECWTVLGAWAEQTSHIEIGALVTCNSYRNPELLADMARTVD HISGGRLILGIGSGWKQKDYDEYGYRFGTAGSRLDDLAAALPRIKARLGKLNPPPTRD IPVLIGGGGERKTLRLVAEYADIWHSFTAGDSYLAKSAVLSTHCSTVGRNPATIERSA AVDGGGLIASAEALAGLGVTLLTVGCDGPDYDLSAAAALCRWRDGR" gene complement(49043..49939) /locus_tag="Rv0045c" /db_xref="GeneID:887029" CDS complement(49043..49939) /locus_tag="Rv0045c" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN LIPID BIOSYNTHESIS." /note="Rv0045c, (MTCY21D4.08c), len 298 aa. Possible hydrolase (EC 3.-.-.-), showing similarity with others eg NP_107230.1|NC_002678 putative hydrolase from Mesorhizobium loti (278 aa); CAB56730.1|AL121600 putative hydrolase from Streptomyces coelicolor (302 aa); NP_438361.1|NC_000907 putative esterase/lipase from Haemophilus influenzae Rd (287 aa); etc. Also similar to Mycobacterium tuberculosis proteins Rv3473c, Rv1123c, Rv1938, Rv3617, Rv3670, etc." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_214559.1" /db_xref="GI:15607187" /db_xref="GeneID:887029" /translation="MLSDDELTGLDEFALLAENAEQAGVNGPLPEVERVQAGAISALR WGGSAPRVIFLHGGGQNAHTWDTVIVGLGEPALAVDLPGHGHSAWREDGNYSPQLNSE TLAPVLRELAPGAEFVVGMSLGGLTAIRLAAMAPDLVGELVLVDVTPSALQRHAELTA EQRGTVALMHGEREFPSFQAMLDLTIAAAPHRDVKSLRRGVFHNSRRLDNGNWVWRYD AIRTFGDFAGLWDDVDALSAPITLVRGGSSGFVTDQDTAELHRRATHFRGVHIVEKSG HSVQSDQPRALIEIVRGVLDTR" gene complement(50021..51124) /gene="ino1" /locus_tag="Rv0046c" /db_xref="GeneID:887028" CDS complement(50021..51124) /gene="ino1" /locus_tag="Rv0046c" /EC_number="5.5.1.4" /function="INVOLVED IN PHOSPHATIDYLINOSITOL (PI) BIOSYNTHETIC PATHWAY [CATALYTIC ACTIVITY: D-GLUCOSE 6-PHOSPHATE = 1L-MYO-INOSITOL 1-PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv0046c, (MTCY21D4.09c), len: 367 aa. ino1 (alternate gene name: tbINO), myo-inositol-1-phosphate synthase (EC 5.5.1.4) (see citations below), equivalent to Q57240|Y046_MYCLE|U00015_14|G466956|B1620_F3_113 HYPOTHETICAL 40.3 KDA PROTEIN from Mycobacterium leprae (369 aa), FASTA scores: opt: 2221, E(): 0, (91.8% identity in 366 aa overlap). N-terminus similar to N-terminus of myo-inositol-1-phosphate synthases e.g. INO1_SPIPO|P42803 myo-inositol-1-phosphate synthase (510 aa), FASTA scores: opt: 144, E(): 0.021, (25.2% identity in 365 aa overlap); CAC21218.1|AJ401007 myo-inositol 1P synthase from Thermotoga sp. SG1 (335 aa); etc. Also highly similar to other hypothetical proteins e.g. AL049826|SCH24_21c hypothetical protein from Streptomyces coelicolor (360 aa), FASTA scores: opt: 1790, E(): 0, (77.8% identity in 360 aa overlap); AE000881_1 conserved protein from M. thermoautotrophicus (368 aa); etc.; tbINO" /codon_start=1 /transl_table=11 /product="myo-inositol-1-phosphate synthase INO1 (inositol 1-phosphate synthetase) (D-glucose 6-phosphate cycloaldolase) (glucose 6-phosphate cyclase) (glucocycloaldolase)" /protein_id="NP_214560.1" /db_xref="GI:15607188" /db_xref="GeneID:887028" /translation="MSEHQSLPAPEASTEVRVAIVGVGNCASSLVQGVEYYYNADDTS TVPGLMHVRFGPYHVRDVKFVAAFDVDAKKVGFDLSDAIFASENNTIKIADVAPTNVI VQRGPTLDGIGKYYADTIELSDAEPVDVVQALKEAKVDVLVSYLPVGSEEADKFYAQC AIDAGVAFVNALPVFIASDPVWAKKFTDARVPIVGDDIKSQVGATITHRVLAKLFEDR GVQLDRTMQLNVGGNMDFLNMLERERLESKKISKTQAVTSNLKREFKTKDVHIGPSDH VGWLDDRKWAYVRLEGRAFGDVPLNLEYKLEVWDSPNSAGVIIDAVRAAKIAKDRGIG GPVIPASAYLMKSPPEQLPDDIARAQLEEFIIG" gene complement(51185..51727) /locus_tag="Rv0047c" /db_xref="GeneID:887031" CDS complement(51185..51727) /locus_tag="Rv0047c" /function="UNKNOWN" /note="Rv0047c, (MTCY21D4.10c), len: 180 aa. Conserved hypothetical protein, equivalent to NP_302717.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (180 aa). Also showing strong similarity to other hypothetical proteins e.g. AL049826|SCH24_22|T36587 from Streptomyces coelicolor (225 aa), FASTA scores: opt: 583, E(): 9e-31, (51.4% identity in 177 aa overlap); etc. Some similarity to Rv1176c from Mycobacterium tuberculosis and to P94443|YFIO from Bacillus subtilis (182 aa), FASTA scores: E(): 0.00066, (24.9% identity in 177 aa overlap). Also some similarity to G1163121 MITHRAMYCIN RESISTANCE DETERMINANT, ATP-BINDING PROTEIN (219 aa), FASTA scores: opt: 143, E(): 0.0091, (29.4% identity in 180 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214561.1" /db_xref="GI:15607189" /db_xref="GeneID:887031" /translation="MLELAILGLLIESPMHGYELRKRLTGLLGAFRAFSYGSLYPALR RMQADGLIAENAAPAGTPVRRARRVYQLTDKGRRRFGELVADTGPHNYTDDGFGVHLA FFNRTPAEARMRILEGRRRQVEERREGLREAVARASSSFDRYTRQLHQLGLESSEREV KWLNELIAAERAAPNPAEQT" gene complement(51828..52697) /locus_tag="Rv0048c" /db_xref="GeneID:887027" CDS complement(51828..52697) /locus_tag="Rv0048c" /function="UNKNOWN" /note="Rv0048c, MTCY21D4.11c, len: 289 aa. Possible membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214562.1" /db_xref="GI:15607190" /db_xref="GeneID:887027" /translation="MAKWLGAPLARGVSTATRAKDSDRQDACRILDDALRDGELSMEE HRERVSAATKAVTLGDLQRLVADLQVESAPAQMPALKSRAKRTELGLLAAAFVASVLL GVGIGWGVYGNTRSPLDFTSDPGAKPDGIAPVVLTPPRQLHSLGGLTGLLEQTRKRFG DTMGYRLVIYPEYASLDRVDPADDRRVLAYTYRGGWGDATSSAKSIADVSVVDLSKFD AKTAVGIMRGAPETLGLKQSDVKSMYLIVEPVKDPTTPAALSLSLYVSSDYGGGYLVF AGDGTIKHVSYPS" gene 52831..53244 /locus_tag="Rv0049" /db_xref="GeneID:887024" CDS 52831..53244 /locus_tag="Rv0049" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0049, (MTCY21D4.12), len: 137 aa. Conserved hypothetical protein, only equivalent to AL022118|MLCB1913_20 hypothetical protein from Mycobacterium leprae (138 aa), FASTA scores: opt: 768, E(): 0, (83.9% identity in 137 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214563.1" /db_xref="GI:15607191" /db_xref="GeneID:887024" /translation="MDYTLRRRSLLAEVYSGRTGVSEVCDANPYLLRAAKFHGKPSRV ICPICRKEQLTLVSWVFGEHLGAVSGSARTAEELILLATRFSEFAVHVVEVCRTCSWN HLVKSYVLGAARPARPPRGSGGTRTARNGARTASE" gene 53663..55699 /gene="ponA1" /locus_tag="Rv0050" /db_xref="GeneID:887065" CDS 53663..55699 /gene="ponA1" /locus_tag="Rv0050" /EC_number="2.4.2.-" /EC_number="3.4.-.-" /function="INVOLVED IN PEPTIDOGLYCAN SYNTHESIS (AT THE FINAL STAGES), CELL WALL FORMATION. SYNTHESIS OF CROSS-LINKED PEPTIDOGLYCAN FROM THE LIPID INTERMEDIATES. THE ENZYME HAS A PENICILLIN-INSENSITIVE TRANSGLYCOSYLASE N-TERMINAL DOMAIN (FORMATION OF LINEAR GLYCAN STRANDS) AND A PENICILLIN-SENSITIVE TRANSPEPTIDASE C-TERMINAL DOMAIN (CROSS-LINKING OF THE PEPTIDE SUBUNITS)." /experiment="experimental evidence, no additional details recorded" /note="Rv0050, (MTCY21D4.13), len: 678 aa. Probable ponA1, penicillin-binding protein (class A), bienzymatic protein with transglycosylase (EC 2.4.2.-) and transpeptidase (EC 3.4.-.-) activities (see Graham & Clark-Curtiss 1999), highly similar to many e.g. NP_302715.1|NC_002677 penicillin-binding protein from Mycobacterium leprae (708 aa); AAB53123.1|L39923 penicillin binding protein from Mycobacterium leprae (686 aa), FASTA scores: (82.3% identity in 679 aa overlap); Q9F9V7|PONA|AAG13121.1|AF165523_1|AF165523 penicillin-binding protein 1 from Mycobacterium smegmatis (715 aa) (see Billman-Jacobe et al., 1999); CAB88838.1|AL353832 probable penicillin-binding protein from Streptomyces coelicolor (756 aa); etc. Also similar to ponA2|Rv3682|MTV025.030 BIFUNCTIONAL MEMBRANE-ASSOCIATED PENICILLIN-BINDING PROTEIN 1A/1B from Mycobacterium tuberculosis (810 aa). BELONGS TO THE TRANSGLYCOSYLASE FAMILY IN THE N-TERMINAL SECTION, AND TO THE TRANSPEPTIDASE FAMILY IN THE C-TERMINAL SECTION." /codon_start=1 /transl_table=11 /product="bifunctional penicillin-binding protein 1A/1B" /protein_id="YP_177687.1" /db_xref="GI:57116685" /db_xref="GeneID:887065" /translation="MVILLPMVTFTMAYLIVDVPKPGDIRTNQVSTILASDGSEIAKI VPPEGNRVDVNLSQVPMHVRQAVIAAEDRNFYSNPGFSFTGFARAVKNNLFGGDLQGG STITQQYVKNALVGSAQHGWSGLMRKAKELVIATKMSGEWSKDDVLQAYLNIIYFGRG AYGISAASKAYFDKPVEQLTVAEGALLAALIRRPSTLDPAVDPEGAHARWNWVLDGMV ETKALSPNDRAAQVFPETVPPDLARAENQTKGPNGLIERQVTRELLELFNIDEQTLNT QGLVVTTTIDPQAQRAAEKAVAKYLDGQDPDMRAAVVSIDPHNGAVRAYYGGDNANGF DFAQAGLQTGSSFKVFALVAALEQGIGLGYQVDSSPLTVDGIKITNVEGEGCGTCNIA EALKMSLNTSYYRLMLKLNGGPQAVADAAHQAGIASSFPGVAHTLSEDGKGGPPNNGI VLGQYQTRVIDMASAYATLAASGIYHPPHFVQKVVSANGQVLFDASTADNTGDQRIPK AVADNVTAAMEPIAGYSRGHNLAGGRDSAAKTGTTQFGDTTANKDAWMVGYTPSLSTA VWVGTVKGDEPLVTASGAAIYGSGLPSDIWKATMDGALKGTSNETFPKPTEVGGYAGV PPPPPPPEVPPSETVIQPTVEIAPGITIPIGPPTTITLAPPPPAPPAATPTPPP" gene 55696..57378 /locus_tag="Rv0051" /db_xref="GeneID:887018" CDS 55696..57378 /locus_tag="Rv0051" /function="UNKNOWN" /note="Rv0051, (MTCY21D4.14), len:560 aa. Probable conserved transmembrane protein, equivalent to NP_302714.1|NC_002677 conserved membrane protein from Mycobacterium leprae (564 aa); and highly similar to C-terminus of AAF25828.1|AF187306_1|AF187306 putative transmembrane protein from Mycobacterium smegmatis (692 aa). Also highly similar to MSGDNAB_5|G886306|L222-ORF5 (418 aa), FASTA scores: opt: 2163, E(): 0, (78.4% identity in 412 aa overlap). Also similar to AL049826|SCH24_24|T36589 probable transmembrane protein from Streptomyces coelicolor (502 aa), FASTA scores: opt: 492, E(): 1.4e-23, (35.8% identity in 522 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214565.1" /db_xref="GI:15607193" /db_xref="GeneID:887018" /translation="MTGALSQSSNISPLPLAADLRSADNRDCPSRTDVLGAALANVVG GPVGRHALIGRTRLMTPLRVMFAIALVFLALGWSTKAACLQSTGTGPGDQRVANWDNQ RAYYQLCYSDTVPLYGAELLSQGKFPYKSSWIETDSNGTPQLRYDGQIAVRYMEYPVL TGIYQYLSMAIAKTYTALSKVAPLPVVAEVVMFFNVAAFGLALAWLTTVWATSGLAGR RIWDAALVAASPLVIFQIFTNFDALATGLATSGLLAWARRRPVLAGVLIGLGSAAKLY PLLFLYPLLLLGIRAGRLNALARTMAAAAATWLLVNLPVMLLFPRGWSEFFRLNTRRG DDMDSLYNVVKSFTGWRGFDPTLGFWEPPLVLNTVVTLLFVLCCAAIAYIALTAPHRP RVAQLTFLTVASFLLVNKVWSPQFSLWLVPLAVLALPHRRILLAWMTIDALVWVPRMY YLYGNPSRSLPEQWFTTTVLLRDIAVMVLCGLVVWQIYRPGRDLVRTGGPGALPACGG VDDPVGGVFANAADAPPGRLPSWLRPRLGDEHARERTPDAGRDRTFSGQHRA" gene 57410..57973 /locus_tag="Rv0052" /db_xref="GeneID:887015" CDS 57410..57973 /locus_tag="Rv0052" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0052, (MTCY21D4.15), len: 187 aa. Conserved hypothetical protein, similar to others e.g. AL049587|SC5F2A_30S|T35272 hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 531, E(): 3.4e-29, (49.5% identity in 182 aa overlap); NP_420588.1|NC_002696 ThiJ/PfpI family protein from Caulobacter crescentus (267 aa); etc. Some similarity to Escherichia coli G1100872|thiJ (198 aa), FASTA scores: opt: 178, E(): 6.1e-06, (29.9% identity in 137 aa overlap). Also similar to Rv1930c from Mycobacterium tuberculosis (174 aa). May be a membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214566.1" /db_xref="GI:15607194" /db_xref="GeneID:887015" /translation="MPSFDVVFVGHRRGEVRSDNAMLGLLCDAAFDELTRPDVVIFPG GIGTRTLIHDQTVLDWVREAHRHTLLTTSVCTGGLVLAAAGLLNGLTATTHWRVQDLF NSLGARYVPQRVVEHLPERVITAAGVSSGIDMGLRLVELLVSREAAEASQLMIEYDPQ PPVDAGSLAKASPATHRLALEFYQHRL" gene 58192..58482 /gene="rpsF" /locus_tag="Rv0053" /db_xref="GeneID:887020" CDS 58192..58482 /gene="rpsF" /locus_tag="Rv0053" /function="BINDS TOGETHER WITH S18 TO 16S RIBOSOMAL RNA." /experiment="experimental evidence, no additional details recorded" /note="binds cooperatively with S18 to the S15-16S complex, allowing platform assembly to continue with S11 and S21" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S6" /protein_id="NP_214567.1" /db_xref="GI:15607195" /db_xref="GeneID:887020" /translation="MRPYEIMVILDPTLDERTVAPSLETFLNVVRKDGGKVEKVDIWG KRRLAYEIAKHAEGIYVVIDVKAAPATVSELDRQLSLNESVLRTKVMRTDKH" misc_feature 58318..58347 /gene="rpsF" /locus_tag="Rv0053" /note="PS01048 Ribosomal protein S6 signature" gene 58586..59080 /gene="ssb" /locus_tag="Rv0054" /db_xref="GeneID:887013" CDS 58586..59080 /gene="ssb" /locus_tag="Rv0054" /function="THIS PROTEIN IS ESSENTIAL FOR REPLICATION OF THE CHROMOSOME. IT IS ALSO INVOLVED IN DNA RECOMBINATION AND REPAIR." /experiment="experimental evidence, no additional details recorded" /note="binds to single stranded DNA and may facilitate the binding and interaction of other proteins to DNA" /codon_start=1 /transl_table=11 /product="single-stranded DNA-binding protein" /protein_id="NP_214568.1" /db_xref="GI:15607196" /db_xref="GeneID:887013" /translation="MAGDTTITIVGNLTADPELRFTPSGAAVANFTVASTPRIYDRQT GEWKDGEALFLRCNIWREAAENVAESLTRGARVIVSGRLKQRSFETREGEKRTVIEVE VDEIGPSLRYATAKVNKASRSGGFGSGSRPAPAQTSSASGDDPWGSAPASGSFGGGDD EPPF" gene 59122..59376 /gene="rpsR" /locus_tag="Rv0055" /db_xref="GeneID:887022" CDS 59122..59376 /gene="rpsR" /locus_tag="Rv0055" /function="THIS PROTEIN HAS BEEN IMPLICATED IN AMINOACYL-TRANSFER RNA BINDING. IT APPEARS TO BE SITUATED AT THE DECODING SITE OF MESSENGER RNA." /note="binds as a heterodimer with protein S6 to the central domain of the 16S rRNA; helps stabilize the platform of the 30S subunit" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S18" /protein_id="YP_177688.1" /db_xref="GI:57116686" /db_xref="GeneID:887022" /translation="MAKSSKRRPAPEKPVKTRKCVFCAKKDQAIDYKDTALLRTYISE RGKIRARRVTGNCVQHQRDIALAVKNAREVALLPFTSSVR" gene 59409..59867 /gene="rplI" /locus_tag="Rv0056" /db_xref="GeneID:887010" CDS 59409..59867 /gene="rplI" /locus_tag="Rv0056" /function="BINDS TO THE 23S RRNA." /experiment="experimental evidence, no additional details recorded" /note="in Escherichia coli this protein is wrapped around the base of the L1 stalk" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L9" /protein_id="NP_214570.1" /db_xref="GI:15607198" /db_xref="GeneID:887010" /translation="MKLILTADVDHLGSIGDTVEVKDGYGRNFLLPRGLAIVASRGAQ KQADEIRRARETKSVRDLEHANEIKAAIEALGPIALPVKTSADSGKLFGSVTAADVVA AIKKAGGPNLDKRIVRLPKTHIKAVGTHFVSVHLHPEIDVEVSLDVVAQS" misc_feature 59445..59528 /gene="rplI" /locus_tag="Rv0056" /note="PS00651 Ribosomal protein L9 signature" gene 59896..60417 /locus_tag="Rv0057" /db_xref="GeneID:887008" CDS 59896..60417 /locus_tag="Rv0057" /function="UNKNOWN" /note="Rv0057, (MTCY21D4.20), len: 173 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214571.1" /db_xref="GI:15607199" /db_xref="GeneID:887008" /translation="MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGL NVRKMCLKANTPGAVTWLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTD VDGYAHAMHSSINSGPLEYLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGAC VGGGESPWRSLMT" gene 60396..63020 /gene="dnaB" /locus_tag="Rv0058" /db_xref="GeneID:887009" CDS 60396..63020 /gene="dnaB" /locus_tag="Rv0058" /EC_number="3.6.1.-" /function="PARTICIPATES IN INITIATION AND ELONGATION DURING CHROMOSOME REPLICATION; IT EXHIBITS DNA-DEPENDENT ATPASE ACTIVITY. THE INTEIN IS AN ENDONUCLEASE (POTENTIAL)." /note="unwinds double stranded DNA; these Mycobacterial enzymes appear to contain inteins" /codon_start=1 /transl_table=11 /product="replicative DNA helicase" /protein_id="NP_214572.1" /db_xref="GI:15607200" /db_xref="GeneID:887009" /translation="MAVVDDLAPGMDSSPPSEDYGRQPPQDLAAEQSVLGGMLLSKDA IADVLERLRPGDFYRPAHQNVYDAILDLYGRGEPADAVTVAAELDRRGLLRRIGGAPY LHTLISTVPTAANAGYYASIVAEKALLRRLVEAGTRVVQYGYAGAEGADVAEVVDRAQ AEIYDVADRRLSEDFVALEDLLQPTMDEIDAIASSGGLARGVATGFTELDEVTNGLHP GQMVIVAARPGVGKSTLGLDFMRSCSIRHRMASVIFSLEMSKSEIVMRLLSAEAKIKL SDMRSGRMSDDDWTRLARRMSEISEAPLFIDDSPNLTMMEIRAKARRLRQKANLKLIV VDYLQLMTSGKKYESRQVEVSEFSRHLKLLAKELEVPVVAISQLNRGPEQRTDKKPML ADLRESGCLTASTRILRADTGAEVAFGELMRSGERPMVWSLDERLRMVARPMINVFPS GRKEVFRLRLASGREVEATGSHPFMKFEGWTPLAQLKVGDRIAAPRRVPEPIDTQRMP ESELISLARMIGDGSCLKNQPIRYEPVDEANLAAVTVSAAHSDRAAIRDDYLAARVPS LRPARQRLPRGRCTPIAAWLAGLGLFTKRSHEKCVPEAVFRAPNDQVALFLRHLWSAG GSVRWDPTNGQGRVYYGSTSRRLIDDVAQLLLRVGIFSWITHAPKLGGHDSWRLHIHG AKDQVRFLRHVGVHGAEAVAAQEMLRQLKGPVRNPNLDSAPKKVWAQVRNRLSAKQMM DIQLHEPTMWKHSPSRSRPHRAEARIEDRAIHELARGDAYWDTVVEITSIGDQHVFDG TVSGTHNFVANGISLHNSLEQDADVVILLHRPDAFDRDDPRGGEADFILAKHRNGPTK TVTVAHQLHLSRFANMAR" misc_feature 61071..61094 /gene="dnaB" /locus_tag="Rv0058" /note="PS00017 ATP/GTP-binding site motif A" gene 63200..63892 /locus_tag="Rv0059" /db_xref="GeneID:887006" CDS 63200..63892 /locus_tag="Rv0059" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0059, (MTV030.02), len: 230 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214573.1" /db_xref="GI:15607201" /db_xref="GeneID:887006" /translation="MITRYKPESGFVARSGGPDRKRPHDWIVWHFTHADNLPGIITAG RLLADSAVTPTTEVAYNPVKELRRHKVVAPDSRYPASMASDHVPFYIAARSPMLYVVC KGHSGYSGGAGPLVHLGVALGDIIDADLTWCASDGNAAASYTKFSRQVDTLGTFVDFD LLCQRQWHNTDDDPNRQSRRAAEILVYGHVPFELVSYVCCYNTETMTRVRTLLDPVGG VRKYVIKPGMYY" gene 63909..64967 /locus_tag="Rv0060" /db_xref="GeneID:887004" CDS 63909..64967 /locus_tag="Rv0060" /function="UNKNOWN" /note="Rv0060, (MTV030.03), len: 352 aa. Conserved hypothetical protein, showing weak similarity to NP_104623.1|NC_002678 hypothetical protein from Mesorhizobium loti (155 aa); and AP000062|AP000062_92 hypothetical protein from Aeropyrum pernix (194 aa), FASTA scores: opt: 186, E(): 4.2e-05, (30.9% identity in 165 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214574.1" /db_xref="GI:15607202" /db_xref="GeneID:887004" /translation="MITYGSGDLLRADTEALVNTVNCVGVMGKGIALQFKRRYPEMFT AYEKACKRGEVTIGKMFVVDTGQLDGPKHIINFPTKKHWRAPSKLAYIDAGLIDLIRV IRELNIASVAVPPLGVGNGGLDWEDVEQRLVSAFQQLPDVDAVIYPPSGGSRAIEGVE GLRMTWGRAVILEAMRRYLQQRRAMEPWEDPAGISHLEIQKLMYFANEADPDLALDFT PGRYGPYSERVRHLLQGMEGAFTVGLGDGTARVLANQPISLTTKGTDAITDYLATDAA ADRVSAAVDTVLRVIEGFEGPYGVELLASTHWVATREGAKEPATAAAAVRKWTKRKGR IYSDDRIGVALDRILMTA" gene 64991..65416 /locus_tag="Rv0061" /db_xref="GeneID:887003" CDS 64991..65416 /locus_tag="Rv0061" /function="UNKNOWN" /note="Rv0061, (MTV030.04), len: 141 aa (questionable ORF). Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214575.1" /db_xref="GI:15607203" /db_xref="GeneID:887003" /translation="MCADAQPSGSVGLLGRNCPTATTRWRRAGEGLTAADTIEVKLWA GKPRLHPLVPKRAVGVLLAVAHGQVAKTPSATRAIAFRHVRLMRVRWICAGNRGRKHK RRCTTQYRSTQASKLQLHFKLRQTLNRLGGLQAMVSACG" gene 65552..66694 /gene="celA1" /locus_tag="Rv0062" /db_xref="GeneID:887007" CDS 65552..66694 /gene="celA1" /locus_tag="Rv0062" /EC_number="3.2.1.4" /function="THE BIOLOGICAL CONVERSION OF CELLULOSE TO GLUCOSE GENERALLY REQUIRES THREE TYPES OF HYDROLYTIC ENZYMES: (1) ENDOGLUCANASES WHICH CUT INTERNAL BETA-1,4-GLUCOSIDIC BONDS; (2) EXOCELLOBIOHYDROLASES THAT CUT THE DISSACCHARIDE CELLOBIOSE FROM THE NONREDUCING END OF THE CELLULOSE POLYMER CHAIN; (3) BETA-1,4-GLUCOSIDASES WHICH HYDROLYZE THE CELLOBIOSE AND OTHER SHORT CELLO-OLIGOSACCHARIDES TO GLUCOSE [CATALYTIC ACTIVITY:Endohydrolysis of 1,4-beta-D-glucosidic linkages in cellulose]." /note="Rv0062, (MTV030.05), len: 380 aa. Possible celA1, cellulase (EC 3.2.1.4), similar to many e.g. AB65568.1|AL136058 putative secreted endoglucanase (cellulase) from Streptomyces coelicolor (332 aa); P07984|GUNA_CELFI ENDOGLUCANASE A PRECURSOR from Cellulomonas fimi (449 aa); GUN1_STRHA|P33682 endoglucanase 1 precursor (cellulase) from STREPTOMYCES HALSTEDII (321 aa), FASTA scores: opt: 702, E(): 1. 2e-27, (38.9% identity in 319 aa overlap); etc. SEEMS TO BELONG TO CELLULASE FAMILY B (FAMILY 6 OF GLYCOSYL HYDROLASES). Note that previously known as celA.; celA" /codon_start=1 /transl_table=11 /product="endo-1,4-beta-glucanase" /protein_id="YP_177689.1" /db_xref="GI:57116687" /db_xref="GeneID:887007" /translation="MTRRTGQRWRGTLPGRRPWTRPAPATCRRHLAFVELRHYFARVM SSAIGSVARWIVPLLGVAAVASIGVIADPVRVVRAPALILVDAANPLAGKPFYVDPAS AAMVAARNANPPNAELTSVANTPQSYWLDQAFPPATVGGTVARYTGAAQAAGAMPVLT LYGIPHRDCGSYASGGFATGTDYRGWIDAVASGLGSSPATIIVEPDALAMADCLSPDQ RQERFDLVRYAVDTLTRDPAAAVYVDAGHSRWLSAEAMAARLNDVGVGRARGFSLNVS NFYTTDEEIGYGEAISGLTNGSHYVIDTSRNGAGPAPDAPLNWCNPSGRALGAPPTTA TAGAHADAYLWIKRPGESDGTCGRGEPQAGRFVSQYAIDLAHNAGQ" gene 66923..68362 /locus_tag="Rv0063" /db_xref="GeneID:886999" CDS 66923..68362 /locus_tag="Rv0063" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0063, (MTV030.06), len: 479 aa. Possible oxidoreductase (EC 1.-.-.-), similar to many e.g. HDNO_ARTOX|P08159 6-hydroxy-d-nicotine oxidase from Arthrobacter oxidans (458 aa), FASTA scores: opt: 343, E(): 3.4e-13, (27.4% identity in 467 aa overlap); AAD28454.1|AF127374_9|AF127374|MitR oxidase from Streptomyces lavendulae (514 aa); AAF81732.1|AF254925|EncM putative FAD-dependent oxygenase from Streptomycesmaritimus (464 aa); etc. Also similar to Mycobacterium tuberculosis proteins e.g. Rv3107c, Rv1257c, etc. Contains PS00862 Oxygen oxidoreductases covalent FAD-binding site." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214577.1" /db_xref="GI:15607205" /db_xref="GeneID:886999" /translation="MAREISRQTFLRGAAGALAAGAVFGSVRATADPAASGWEALSSA LGGKVLQPDDGPQFATAKQVFNTNYNGYTPAVIVTPTSQLDVQKAMAFAAANNLKVAP RGGGHSYVGASTANGAMVLDLRQLPGDINYDATTGRVTVTPATGLYAMHQVLAAAGRG IPTGTCPTVGVAGHALGGGLGANSRHAGLLCDQLTSASVVLPSGQAVTASATDHPDLF WALRGGGGGNFGVTTSLTFATFPSGDLDVVNLNFPPQSFAQVLVGWQNWLRTADRGSW ALADATVDPLGTHCRILATCPAGSGGSVAAAIVSAVGTQPTGTENHTFNYLDLVRYLA VGNLNPSPLGYVGGSDVFTTITPATAQGIASAVDAFPRGAGRMLAIMHALDGALATVS PGATAFPWRRQSALVQWYVETSGSPSEATSWLNTAHQAVRAYSVGGYVNYLEVNQPPA RYFGPNLSRLSAVRQKYDPSRVMFSGLNF" misc_feature 67142..67243 /locus_tag="Rv0063" /note="PS00862 Oxygen oxidoreductases covalent FAD-binding site" gene 68620..71559 /locus_tag="Rv0064" /db_xref="GeneID:886996" CDS 68620..71559 /locus_tag="Rv0064" /function="UNKNOWN" /note="Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, highly similar to NP_301532.1| (NC_002677) putative integral membrane protein from Mycobacterium leprae (983 aa). Also similar to other hypothetical proteins from ARCHAEOGLOBUS FULGIDUS and Synecocystis sp. e.g. P72637|D90899 HYPOTHETICAL 117.2 kDa PROTEIN from SYNECHOCYSTIS SP. (1032 aa), FASTA scores: opt: 1004, E(): 3.6e-32, (31.0 % identity in 848 aa overlap); and CAC01334.1|AL390968 putative integral membrane protein (fragment) from Streptomyces coelicolor (815 aa); etc. Also similar to Rv3193c from Mycobacterium tuberculosis (992 aa), FASTA score: (50.3% identity in 985 aa overlap). Contains probable coiled-coil domain from aa 948 to 976." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214578.1" /db_xref="GI:15607206" /db_xref="GeneID:886996" /translation="METGSPGKRPVLPKRARLLVTAGMGMLALLLFGPRLVDIYVDWL WFGEVGFRSVWITVLLTRLAIVAAVALVVAGIVLAALLLAYRSRPFFVPDEPQRDPVA PLRSAVMRRPRLFGWGIAVTLGVVCGLIASFDWVKVQLFVHGGTFGIVDPEFGYDIGF FVFDLPFYRSVLNWLFVAVVLAFLASLLTHYLFGGLRLTTGRGMLTQAARVQLAVFAG AVVLLKAVAYWLDRYELLSSGRKEPTFTGAGYTDIHAELPAKLVLVAIAVLCAVSFFT AIFLRDLRIPAMAAALLVLSAILVGGLWPLLMEQFSVRPNAADVERPYIQRNIEATRE AYRIGGDWVQYRSYPGIGTKQPRDVPVDVTTIAKVRLLDPHILSRTFTQQQQLKNFFS FAEILDIDRYRIDGELQDYIVGVRELSPKSLTGNQTDWINKHTVYTHGNGFVAAPANR VNAAARGAENISDSNSGYPIYAVSDIASLGSGRQVIPVEQPRVYYGEVIAQADPDYAI VGGAPGSAPREYDTDTSKYTYTGAGGVSIGNWFNRTVFATKVAQHKFLFSREIGSESK VLIHRDPKERVQRVAPWLTTDDNPYPVVVNGRIVWIVDAYTTLDTYPYAQRSSLEGPV TSPTGIVRQGKQVSYVRNSVKATVDAYDGTVTLFQFDRDDPVLRTWMRAFPGTVKSED QIPDELRAHFRYPEDLFEVQRSLLAKYHVDEPREFFTTNAFWSVPSDPTNNANATQPP FYVLVGDQQSAQPSFRLASAMVGYNREFLSAYISAHSDPANYGKLTVLELPTDTLTQG PQQIQNSMISDTRVASERTLLERSNRIHYGNLLSLPIADGGVLYVEPLYTERISTSPS SSTFPQLSRVLVSVREPRTEGGVRVGYAPTLAESLDQVFGPGTGRVATARGGDAASAP PPGAGGPAPPQAVPPPRTTQPPAAPPRGPDVPPATVAELRETLADLRAVLDRLEKAID AAETPGG" gene 71821..72222 /locus_tag="Rv0065" /db_xref="GeneID:886993" CDS 71821..72222 /locus_tag="Rv0065" /function="UNKNOWN" /note="Rv0065, (MTV030.08), len: 133 aa. Conserved hypothetical protein, similar to several hypothetical proteins from Mycobacterium tuberculosis: Rv0960 (127 aa), Rv1720c (129 aa), and Rv0549c (137 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214579.1" /db_xref="GI:15607207" /db_xref="GeneID:886993" /translation="MDECVVDAAAVVDALAGKGASAIVLRGLLKESISNAPHLLDAEV GHALRRAVLSDEISEEQARAALDALPYLIDNRYPHSPRLIEYTWQLRHNVTFYDALYV ALATALDVPLLTGDSRLAAAPGLPCEIKLVR" gene complement(72274..74511) /gene="icd2" /locus_tag="Rv0066c" /db_xref="GeneID:887016" CDS complement(72274..74511) /gene="icd2" /locus_tag="Rv0066c" /EC_number="1.1.1.42" /function="INVOLVED IN THE KREBS CYCLE [CATALYTIC ACTIVITY: Isocitrate + NADP+ = 2-OXOGLUTARATE + CO(2) + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="NADP; Rv0066c, (MTV030.09c), len: 745 aa. Probable icd2, isocitrate dehydrogenase NADP-dependent (EC 1.1.1.42), equivalent to NP_302705.1|NC_002677 isocitrate dehydrogenase [NADP] from Mycobacterium leprae (746 aa). Also highly similar to many members of the monomeric-type family of IDH e.g. NP_251314.1|NC_002516 isocitrate dehydrogenase from Pseudomonas aeruginosa (741 aa); IDH_AZOVI|P16100 isocitrate dehydrogenase (nadp) from Azotobacter vinelandii (741 aa), FASTA scores: opt: 3106, E(): 0, (61.4% identity in 735 aa overlap); NP_230786.1|NC_002505 isocitrate dehydrogenase from Vibrio cholerae (741 aa); etc. BELONGS TO THE MONOMERIC-TYPE FAMILY OF IDH" /codon_start=1 /transl_table=11 /product="isocitrate dehydrogenase" /protein_id="NP_214580.1" /db_xref="GI:15607208" /db_xref="GeneID:887016" /translation="MSAEQPTIIYTLTDEAPLLATYAFLPIVRAFAEPAGIKIEASDI SVAARILAEFPDYLTEEQRVPDNLAELGRLTQLPDTNIIKLPNISASVPQLVAAIKEL QDKGYAVPDYPADPKTDQEKAIKERYARCLGSAVNPVLRQGNSDRRAPKAVKEYARKH PHSMGEWSMASRTHVAHMRHGDFYAGEKSMTLDRARNVRMELLAKSGKTIVLKPEVPL DDGDVIDSMFMSKKALCDFYEEQMQDAFETGVMFSLHVKATMMKVSHPIVFGHAVRIF YKDAFAKHQELFDDLGVNVNNGLSDLYSKIESLPASQRDEIIEDLHRCHEHRPELAMV DSARGISNFHSPSDVIVDASMPAMIRAGGKMYGADGKLKDTKAVNPESTFSRIYQEII NFCKTNGQFDPTTMGTVPNVGLMAQQAEEYGSHDKTFEIPEDGVANIVDVATGEVLLT ENVEAGDIWRMCIVKDAPIRDWVKLAVTRARISGMPVLFWLDPYRPHENELIKKVKTY LKDHDTEGLDIQIMSQVRSMRYTCERLVRGLDTIAATGNILRDYLTDLFPILELGTSA KMLSVVPLMAGGGMYETGAGGSAPKHVKQLVEENHLRWDSLGEFLALGAGFEDIGIKT GNERAKLLGKTLDAAIGKLLDNDKSPSRKTGELDNRGSQFYLAMYWAQELAAQTDDQQ LAEHFASLADVLTKNEDVIVRELTEVQGEPVDIGGYYAPDSDMTTAVMRPSKTFNAAL EAVQG" gene complement(74629..75198) /locus_tag="Rv0067c" /db_xref="GeneID:886991" CDS complement(74629..75198) /locus_tag="Rv0067c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0067c, (MTV030.10c), len: 189 aa. Possible transcriptional regulator, highly similar except in N-terminus to T44726 probable transcription regulator from Mycobacterium leprae (189 aa), FASTA scores: opt: 829, E(): 0, (68.3% identity in 189 aa overlap). And similar to others, often many members of the tetR family, e.g. T36918 probable transcription regulator from Streptomyces coelicolor (202 aa); NP_535866.1|NC_003306 transcriptional regulator TetR family from Agrobacterium tumefaciens strain C58 (Dupont) (194 aa); UIDR_ECOLI|Q59431 uid operon repressor (gus operon repressor) from Escherichia coli (196 aa), FASTA scores: opt: 200, E(): 7.2e-06, (24.7% identity in 186 aa overlap); etc. Also similar to MTCY8D5_28 from Mycobacterium tuberculosis cosmid (229 aa), FASTA score: (32.7% identity in 168 aa overlap). Contains probable helix-turn-helix motif from aa 34 to 55 (Score 1523, +4.37 SD)." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_214581.1" /db_xref="GI:15607209" /db_xref="GeneID:886991" /translation="MAPTDRRVRADAARNRARVLEVAYQTFAADGLSVPVDEIARRAG VGAGTVYRHFPTKEALFQAVIADRMHRIIDKGHALLKSKHPGDALFAFLRSMVLQWGA TDRGLVEALAGVGIEISSAAPEAEADFLDLLTDLLRAAQRAGTVRPDVDVLEVKTLLV GCQAMQSYNAELAAKVTDVALDGLRANRK" gene 75301..76212 /locus_tag="Rv0068" /db_xref="GeneID:886989" CDS 75301..76212 /locus_tag="Rv0068" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0068, (MTV030.11), len: 303 aa. Probable oxidoreductase (EC 1.-.-.-), equivalent to NP_301343.1|NC_002677 putative oxidoreductase from Mycobacterium leprae (304 aa). Also highly similar to many e.g. NP_485762.1|NC_003272 probable oxidoreductase from Nostoc sp. PCC 7120 (311 aa); NP_279536.1|NC_002607|YajO1 probable oxidoreductase from Halobacterium sp. NRC-1 (316 aa); OXIR_STRAT|Q03326 probable oxidoreductase from Streptomyces antibioticus (298 aa), FASTA scores: opt: 430, E(): 1.3e-16, (34.9% identity in 295 aa overlap); etc. Also highly similar to MTV037_3 and MTV022_13 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_214582.1" /db_xref="GI:15607210" /db_xref="GeneID:886989" /translation="MTKWTAADIPDQTGRTAVITGANTGLGFETAAALAAHGAHVVLA VRNLDKGKQAAARITEATPGAEVELQELDLTSLASVRAAAAQLKSDHQRIDLLINNAG VMYTPRQTTADGFEMQFGTNHLGHFALTGLLIDRLLPVAGSRVVTISSVGHRIRAAIH FDDLQWERRYRRVAAYGQAKLANLLFTYELQRRLAPGGTTIAVASHPGVSNTEVVRNM PRPLVAVAAILAPLMQDAELGALPTLRAATDPAVRGGQYFGPDGFGEIRGYPKVVASS AQSHDEQLQRRLWAVSEELTGVVYPVG" gene complement(76237..77622) /gene="sdaA" /locus_tag="Rv0069c" /db_xref="GeneID:886986" CDS complement(76237..77622) /gene="sdaA" /locus_tag="Rv0069c" /EC_number="4.3.1.17" /function="INVOLVED IN GLUCONEOGENESIS FROM SERINE [CATALYTIC ACTIVITY: L-serine + H2O = pyruvate + NH3 + H2O]." /note="Rv0069c, (MTV030.12c), len: 461 aa. Probable sdaA, L-serine dehydratase (EC 4.2.1.13), equivalent to NP_302203.1| NC_002677 L-serine dehydratase from Mycobacterium leprae (458 aa). Also highly similar to many e.g. NP_251133.1|NC_002516 L-serine dehydratase from Pseudomonas aeruginosa (458 aa); O86564|SDHL_STRCO L-SERINE DEHYDRATASE from Streptomyces coelicolor (455 aa); SDHL_ECOLI|P16095 L-serine dehydratase 1 from Escherichia coli (454 aa), FASTA scores: opt: 1381, E(): 0, (51.1% identity in 460 aa overlap); etc. BELONGS TO THE IRON-SULFUR DEPENDENT L-SERINE DEHYDRATASE FAMILY. COFACTOR: IRON-SULFUR (4FE-4S) (PROBABLE)." /codon_start=1 /transl_table=11 /product="L-serine dehydratase SdaA" /protein_id="NP_214583.1" /db_xref="GI:15607211" /db_xref="GeneID:886986" /translation="MTISVFDLFTIGIGPSSSHTVGPMRAANQFVVALRRRGHLDDLE AMRVDLFGSLAATGAGHGTMSAILLGLEGCQPETITTEHKERRLAEIAASGVTRIGGV IPVPLTERDIDLHPDIVLPTHPNGMTFTAAGPHGRVLATETYFSVGGGFIVTEQTSGN SGQHPCSVALPYVSAQELLDICDRLDVSISEAALRNETCCRTENEVRAALLHLRDVMV ECEQRSIAREGLLPGGLRVRRRAKVWYDRLNAEDPTRKPEFAEDWVNLVALAVNEENA SGGRVVTAPTNGAAGIVPAVLHYAIHYTSAGAGDPDDVTVRFLLTAGAIGSLFKERAS ISGAEVGCQGEVGSAAAMAAAGLAEILGGTPRQVENAAEIAMEHSLGLTCDPIAGLVQ IPCIERNAISAGKAINAARMALRGDGIHRVTLDQVIDTMRATGADMHTKYKETSAGGL AINVAVNIVEC" gene complement(77619..78896) /gene="glyA" /locus_tag="Rv0070c" /db_xref="GeneID:886983" CDS complement(77619..78896) /gene="glyA" /locus_tag="Rv0070c" /EC_number="2.1.2.1" /function="KEY ENZYME IN THE BIOSYNTHESIS OF PURINES, LIPIDS, OTHER COMPONENTS. INTERCONVERSION OF SERINE AND GLYCINE [CATALYTIC ACTIVITY: 5,10-methylenetetrahydrofolate + glycine + H2O = tetrahydrofolate + L-serine]." /note="catalyzes the reaction of glycine with 5,10-methylenetetrahydrofolate to form L-serine and tetrahydrofolate" /codon_start=1 /transl_table=11 /product="serine hydroxymethyltransferase" /protein_id="NP_214584.1" /db_xref="GI:15607212" /db_xref="GeneID:886983" /translation="MNTLNDSLTAFDPDIAALIDGELRRQESGLEMIASENYAPLAVM QAQGSVLTNKYAEGYPGRRYYGGCEFVDGVEQLAIDRVKALFGAEYANVQPHSGATAN AATMHALLNPGDTILGLSLAHGGHLTHGMRINFSGKLYHATAYEVSKEDYLVDMDAVA EAARTHRPKMIIAGWSAYPRQLDFARFRAIADEVDAVLMVDMAHFAGLVAAGVHPSPV PHAHVVTSTTHKTLGGPRGGIILCNDPAIAKKINSAVFPGQQGGPLEHVIAAKATAFK MAAQPEFAQRQQRCLDGARILAGRLTQPDVAERGIAVLTGGTDVHLVLVDLRDAELDG QQAEDRLAAVDITVNRNAVPFDPRPPMITSGLRIGTPALAARGFSHNDFRAVADLIAA ALTATNDDQLGPLRAQVQRLAARYPLYPELHRT" misc_feature complement(78183..78233) /gene="glyA" /locus_tag="Rv0070c" /note="PS00096 Serine hydroxymethyltransferase pyridoxal-phosphate attachment site" gene 79486..80193 /locus_tag="Rv0071" /db_xref="GeneID:886988" CDS 79486..80193 /locus_tag="Rv0071" /function="UNKNOWN" /note="Rv0071, (MTV030.14), len: 235 aa. Possible maturase, similar to many proteins of the group II intron maturase family e.g. P95451|U77945 MATURASE-RELATED PROTEIN from PSEUDOMONAS ALCALIGENES (297 aa), FASTA scores: opt: 395, E(): 1.7e-20, (43.5% identity in 147 aa overlap); N-terminus of AAD16434.1|AF101076 maturase-related protein from Pseudomonas putida (473 aa); N-terminus of NP_437373.1|NC_003078 putative reverse transcriptasematurase protein from Sinorhizobium meliloti (453 aa); etc. Also similar to MLCL581_1 from Mycobacterium leprae. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468." /codon_start=1 /transl_table=11 /product="maturase" /protein_id="NP_214585.1" /db_xref="GI:15607213" /db_xref="GeneID:886988" /translation="MSSITVSVDPVDPVDPVDPVDPVDAVVAAGSDGLTVARIESEIG ALEFLNELRTELKSGQFRPQPVRERKIPKPGGLGKVRRLGIPTVADRVVQAALKLVLE PIFETDFEPVSYGFRPARRAHDTIAEIHLFGTQEYRWVLDADIKACFDRIDHADLMDR VRHRIKDKRVLRLVNWQRIRHRWNWTDVRRWLTDPTGRWHPISADGITLFNPAAVPIR RYRYRGNTIPTPWTQAV" repeat_region 79507..79551 /note="5 x 9 bp GTGGACCCG repeats" repeat_region 80236..80550 /note="(MTV030.15), len: 315 bp. Probable REP'-1 pseudogene fragment, similar to many Mycobacterium tuberculosis proteins inside REP13E12 elements e.g. Q50655|Z95390|MTCY13E12.20 (317 aa), FASTA scores; opt: 324 E(): 6.8e-17, 43.4% identity in 99 aa overlap, but no possible startsite.; REP'-1" /rpt_type=DIRECT gene 80624..81673 /locus_tag="Rv0072" /db_xref="GeneID:886984" CDS 80624..81673 /locus_tag="Rv0072" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF GLUTAMINE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0072, (MTV030.16), len: 349 aa. Probable glutamine-transport transmembrane protein ABC-transporter (see citation below), showing weak similarity to NP_465894.1|NC_003210 protein similar to putative ABC-transporter transmembrane subunit from Listeria monocytogenes EGD-e (367 aa); NP_471800.1|NC_003212 protein similar to putative ABC-transporter transmembrane subunit from Listeria innocua (367 aa); E1204111|AJ003195 MEMBRANE SPANNING SUBUNIT DEVC from ANABAENA VARIABILIS (385 aa), FASTA scores: opt: 155, E(): 8.1e-07, (22.0% identity in 381 aa overlap). Also highly similar to Rv2563|Y0A5_MYCTU|Q50735|MTCY9C4.05c from Mycobacterium tuberculosis (388 aa), FASTA scores: E(): 0, (76.2% identity in 349 aa overlap). Note that supposed act with near ORF Rv0073|MTV030.17 ATP-binding protein ABC-transporter." /codon_start=1 /transl_table=11 /product="glutamine-transport transmembrane protein ABC transporter" /protein_id="NP_214586.1" /db_xref="GI:15607214" /db_xref="GeneID:886984" /translation="MLFAALRDMQWRKRRLVITIISTGLIFGMTLVLTGLANGFRVEA RHTVDSMGVDVFVVRSGAAGPFLGSIPFPDVDLARVAAEPGVMAAAPLGSVGTIMKEG TSTRNVTVFGAPEHGPGMPRVSEGRSPSKPDEVAASSTMGRHLGDTVEVGARRLRVVG IVPNSTALAKIPNVFLTTEGLQKLAYNGQPNITSIGIIGMPRQLPEGYQTFDRVGAVN DLVRPLKVAVNSISIVAVLLWIVAVLIVGSVVYLSALERLRDFAVFKAIGTPTRSIMA GLALQALVIALLAAVVGVVLAQVLAPLFPMIVAVPVGAYLALPVAAIVIGLFASVAGL KRVVTVDPAQAFGGP" gene 81676..82668 /locus_tag="Rv0073" /db_xref="GeneID:886977" CDS 81676..82668 /locus_tag="Rv0073" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF GLUTAMINE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0073, (MTV030.17), len: 330 aa. Probable glutamine-transport ATP-binding protein ABC-transporter (see citation below), similar to many ATP-binding proteins e.g. NP_070646.1|NC_000917 ABC transporter ATP-binding protein from Archaeoglobus fulgidus (231 aa); T34822 ABC-transporter ATP binding protein from Streptomyces coelicolor (230 aa); YBJZ_ECOLI|P75831 hypothetical ABC transporter ATP-binding protein from Escherichia coli (648 aa), FASTA scores: opt: 531, E(): 6.8e-30, (38.6% identity in 233 aa overlap); etc. Also highly similar to Y0A4_MYCT|Q50734|MTCY9C4.04c hypothetical ABC transporter ATP-binding protein from Mycobacterium tuberculosis (330 aa), FASTA scores: E(): 0, (83.3% identity in 330 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00211 ABC transporters family signature, and PS00889 Cyclic nucleotide-binding domain signature 2. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Note that supposed act with near ORF Rv0072|MTV030.16 transmembrane ABC-transporter." /codon_start=1 /transl_table=11 /product="glutamine-transport ATP-binding protein ABC transporter" /protein_id="NP_214587.1" /db_xref="GI:15607215" /db_xref="GeneID:886977" /translation="MGDLSIQNLVVEYYSGGYALRPINGLNLDVAAGSLVMLLGPSGC GKTTLLSCLGGILRPKSGAIKFDEVDITTLQGAELANYRRNKVGIVFQAFNLVPSLTA VENVMVPLRSAGMSRRASRRRAEELLARVNLAERMNHRPGDLSGGQQQRVAVARAIAL DPPLILADEPTAHLDFIQVEEVLRLIRELADGERVVVVATHDSRMLPMADRVVELTPD FAETNRPPETVHLQAGEVLFEQSTMGDLIYVVSEGEFEIVHELADGGEELVKVAGPGD YFGEIGVLFHLPRSATVRARSDATAVGYTVQAFRERLGVGGLRDLIEHRALAND" misc_feature 81793..81816 /locus_tag="Rv0073" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 82105..82149 /locus_tag="Rv0073" /note="PS00211 ABC transporters family signature" misc_feature 82507..82560 /locus_tag="Rv0073" /note="PS00889 Cyclic nucleotide-binding domain signature 2" gene 82748..83983 /locus_tag="Rv0074" /db_xref="GeneID:886976" CDS 82748..83983 /locus_tag="Rv0074" /function="UNKNOWN" /note="Rv0074, (MTV030.18), len: 411 aa. Conserved hypothetical protein, similar to Rv2915c|MTCY338.03c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis, and showing some simlarity to various enzymes or hypothetical proteins from other organisms, eg NP_243801.1|NC_002570 aryldialkylphosphatase from Bacillus halodurans (394 aa); NP_421471.1|NC_002696 putativ Xaa-Pro dipeptidase from Caulobacter crescentus (429 aa); NP_343436.1|NC_002754 Prolidase (Xaa-Pro dipeptidase) (pepQ-like2) from Sulfolobus solfataricus (408 aa); Q50432|M91040 ORGANO PHOSPHATE ACID ANHYDRASE OPAB from MYCOBACTERIUM SP. (409 aa), FASTA scores: opt: 166, E(): 3.9e-11, (31.2% identity in 430 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214588.1" /db_xref="GI:15607216" /db_xref="GeneID:886976" /translation="MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRIS AVDFAGSACPDMNLVDLGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRARR HAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLTRSGGHCWFLG GVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPWQPQYGSGQLAAVVAAAEQ VGLPVTAHAHATAGIAAAVAAGVDGIEHCTFLSEGSAAASPDVVEAIVAQGVWCGMTI PRVYPEMPENLVAVVQDGWRNIRRLIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSR HGFTSTEVLTGATAAAAASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVW RSGTQVPLQASAVGYNTPS" gene 83996..85168 /locus_tag="Rv0075" /db_xref="GeneID:886982" CDS 83996..85168 /locus_tag="Rv0075" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0075, (MTV030.19), len: 390 aa. Probable aminotransferase (EC 2.6.1.-), similar to many CLASS-II PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES (MALY/PATB SUBFAMILY) e.g. NP_302217.1|NC_002677 aminotransferase from Mycobacterium leprae (402 aa); PATB_BACSU|Q08432 putative aminotransferase b from Bacillus subtilis (387 aa), FASTA scores: opt: 684, E(): 5.4e-33, (31.3% identity in 384 aa overlap); etc. Also similar to several cystathionine beta-lyase (beta C-S lyase) e.g. AAK69425.1|AF276227_1|AF276227 from Corynebacterium glutamicum (368 aa); etc. Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv2294, Rv0858c, etc." /codon_start=1 /transl_table=11 /product="aminotransferase" /protein_id="NP_214589.1" /db_xref="GI:15607217" /db_xref="GeneID:886982" /translation="MQDSIFNLLTEEQLRGRNTLKWNYFGPDVVPLWLAEMDFPTAPA VLDGVRACVDNEEFGYPPLGEDSLPRATADWCRQRYGWCPRPDWVRVVPDVLKGMEVV VEFLTRPESPVALPVPAYMPFFDVLHVTGRQRVEVPMVQQDSGRYLLDLDALQAAFVR GAGSVIICNPNNPLGTAFTEAELRAIVDIAARHGARVIADEIWAPVVYGSRHVAAASV SEAAAEVVVTLVSASKGWNLPGLMCAQVILSNRRDAHDWDRINMLHRMGASTVGIRAN IAAYHHGESWLDELLPYLRANRDHLARALPELAPGVEVNAPDGTYLSWVDFRALALPS EPAEYLLSKAKVALSPGIPFGAAVGSGFARLNFATTRAILDRAIEAIAAALRDIID" gene complement(85183..85572) /locus_tag="Rv0076c" /db_xref="GeneID:886992" CDS complement(85183..85572) /locus_tag="Rv0076c" /function="UNKNOWN" /note="Rv0076c, (MTV030.20c), len: 129 aa. Probable membrane protein, with membrane-spanning domain at C-terminus." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214590.1" /db_xref="GI:15607218" /db_xref="GeneID:886992" /translation="MPAVTTPSNHWGDERRKLSHQPPVRGQILGRRQARRLSQHFARV GVEAPPKRLQEMLLGAPAADEEWTDVKFALIVTQLNHEKRVAKFHRLQRRATHSLICL GLVLVALNFLICLAYIFFSLTQHAAAL" gene complement(85636..86466) /locus_tag="Rv0077c" /db_xref="GeneID:886969" CDS complement(85636..86466) /locus_tag="Rv0077c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0077c, (MTV030.21c), len: 276 aa. Possible oxidoreductase (EC 1.-.-.-), weakly similar to others e.g. CAC44600.1|AL596162 putative oxidoreductase from Streptomyces coelicolor (275 aa); P33912|BPA1_STRAU NON-HAEM BROMOPEROXIDASE BPO-A1 (BROMIDE PEROXIDASE) (EC 1.11.1.-) from Streptomyces aureofaciens (275 aa); BPA1_STRAU|P33912 non-haem bromoperoxidase bpo-a1 from Streptomyces aureofaciens (274 aa), FASTA scores: opt: 230, E(): 1.5e-07, (26.1% identity in 249 aa overlap); etc. Also similar to MTCY05A6_35 and MTCY1A11_10 from Mycobacterium tuberculosis. And shows some similarity in part with AAL17935.1|AY054120 putative epoxide hydrolase from Mycobacterium smegmatis (203 aa)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214591.1" /db_xref="GI:15607219" /db_xref="GeneID:886969" /translation="MSTIDISAGTIHYEATGPETGRPVVFVHGYMMGGQLWRRVSERL AGRGLRCIAPTWPLGAHPKPLRPGADQTIGGVAGIVADVLAALELKDVVLVGNDTGGV VTQLVAVHYPERLGALVLTSCDAFEHFPPPILKPVILAAKSATLFRAAIQVMRAPAAR NRAYAGLSHHNIDHLTRAWVRPALSNPAIAEDLRQLSLSLRTEVTTAVAARLPEFDKP ALIAWSADDVFFALENGQRLAATIPRARFEVIEGARTFSMVDSPDRLADQLSTVAVRT" gene 86528..87133 /locus_tag="Rv0078" /db_xref="GeneID:886990" CDS 86528..87133 /locus_tag="Rv0078" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0078, (MTV030.22), len: 201 aa. Probable transcriptional regulator, equivalent to NP_302706.1|NC_002677 putative TetR-family transcriptional regulator from Mycobacterium leprae (236 aa), FASTA scores: opt: 755, E(): 0, (71.4% identity in 175 aa overlap). Also similar to others e.g. NP_103770.1|NC_002678 probable transcriptional regulator from Mesorhizobium loti (208 aa); NP_384275.1|NC_003047 PUTATIVE TRANSCRIPTION REGULATOR PROTEIN from Sinorhizobium meliloti (197 aa); NP_250960.1|NC_002516 probable transcriptional regulator from Pseudomonas aeruginosa (196 aa); etc. Also similar to TETC_ECOLI|P28815 transposon tn10 tetc protein from Escherichia coli (197 aa), FASTA scores: opt: 181, E(): 9.7e-05, (24.8% identity in 165 aa overlap). Contains probable helix-turn-helix motif from aa 35 to 56 (Score 1348, +3.78 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214592.1" /db_xref="GI:15607220" /db_xref="GeneID:886990" /translation="MEIKRRTQEERSAATREALITGARKLWGLRGYAEVGTPEIATEA GVTRGAMYHQFADKAALFRDVVEVVEQDVMARMATLVAASGAATPADAIRAAVDAWLE VSGDPEVRQLILLDAPVVLGWAGFRDVAQRYSLGMTEQLITEAIRAGQLARQPVRPLA QVLIGALDEAAMFIATADDPKRARRETRQVLRRLIDGMLNG" gene complement(87208..87801) /locus_tag="Rv0078A" /db_xref="GeneID:3205053" CDS complement(87208..87801) /locus_tag="Rv0078A" /function="UNKNOWN" /note="Rv0078A, len: 197 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177616.1" /db_xref="GI:57116688" /db_xref="GeneID:3205053" /translation="MNAVESTLRRVAKDLTGLRQRWALVGGFAVSARSEPRFTRDVDI VVAVANDDAAESLVRQLLTQQYHLLASVEQDAARRLAAVRLGATADTAANVVVDLLFA SCGIEPEIAEAAEEIEILPDLVAPVATTAHLIAMKLLARDDDRRPQDRSDLRALVDAA SPQDIQDARKAIELITLRGFHRDRDLAAEWTRLAAKW" gene 88204..89025 /locus_tag="Rv0079" /db_xref="GeneID:886995" CDS 88204..89025 /locus_tag="Rv0079" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0079, (MTV030.23), len: 273 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214593.1" /db_xref="GI:15607221" /db_xref="GeneID:886995" /translation="MEPKRSRLVVCAPEPSHAREFPDVAVFSGGRANASQAERLARAV GRVLADRGVTGGARVRLTMANCADGPTLVQINLQVGDTPLRAQAATAGIDDLRPALIR LDRQIVRASAQWCPRPWPDRPRRRLTTPAEALVTRRKPVVLRRATPLQAIAAMDAMDY DVHLFTDAETGEDAVVYRAGPSGLRLARQHHVFPPGWSRCRAPAGPPVPLIVNSRPTP VLTEAAAVDRAREHGLPFLFFTDQATGRGQLLYSRYDGNLGLITPTGDGVADGLA" gene 89022..89480 /locus_tag="Rv0080" /db_xref="GeneID:886966" CDS 89022..89480 /locus_tag="Rv0080" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0080, (MTV030.24), len: 152 aa. Conserved hypothetical protein, similar to several hypothetical proteins from Streptomyces coelicolor e.g. SCJ12.26|AL109989|SCJ12_24 from Streptomyces coelicolor cosmid J1 (137 aa), FASTA scores: opt: 291, E(): 4e-13, (46.5% identity in 129 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214594.1" /db_xref="GI:15607222" /db_xref="GeneID:886966" /translation="MSPGSRRASPQSAREVVELDRDEAMRLLASVDHGRVVFTRAALP AIRPVNHLVVDGRVIGRTRLTAKVSVAVRSSADAGVVVAYEADDLDPRRRTGWSVVVT GLATEVSDPEQVARYQRLLHPWVNMAMDTVVAIEPEIVTGIRIVADSRTP" gene 89575..89919 /locus_tag="Rv0081" /db_xref="GeneID:887012" CDS 89575..89919 /locus_tag="Rv0081" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0081, (MTV030.25), len: 114 aa. Probable transcriptional regulator, highly similar to others e.g. AL078610|SCH35_52|T36657 probable transcription regulator from Streptomyces coelicolor (117 aa), FASTA scores: opt: 404, E(): 4.8e-22, (58.2% identity in 110 aa overlap); AAG02351.1|AF210249_10|AF210249 metal-dependent regulatory protein from Streptomyces verticillus (113 aa); NP_435817.1|NC_003037 Putative transcriptional regulator from Sinorhizobium meliloti (115 aa); etc." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214595.1" /db_xref="GI:15607223" /db_xref="GeneID:887012" /translation="MESEPLYKLKAEFFKTLAHPARIRILELLVERDRSVGELLSSDV GLESSNLSQQLGVLRRAGVVAARRDGNAMIYSIAAPDIAELLAVARKVLARVLSDRVA VLEDLRAGGSAT" gene 89924..90403 /locus_tag="Rv0082" /db_xref="GeneID:886968" CDS 89924..90403 /locus_tag="Rv0082" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0082, (MTV030.26), len: 159 aa. Probable oxidoreductase (EC 1.-.-.-), highly highly similar or similar to other various oxidoreductases e.g. NP_143304.1|NC_000961 NADH-ubiquinone oxidoreductase subunit from Pyrococcus horikoshii (173 aa); NP_126406.1|NC_000868 CO-induced hydrogenase related, subunit L from Pyrococcus abyssi (170 aa); HYCG_ECOLI|P16433 formate hydrogenlyase subunit 7 from Escherichia coli (255 aa), FASTA scores: opt: 442, E(): 8e-29, (43.2% identity in 148 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214596.1" /db_xref="GI:15607224" /db_xref="GeneID:886968" /translation="MGWVAKIFRVGRVVEPAAPLPAAIAEPPAGVRGSLQIRHVDAGS CNGCEVEISGAFGPVYDAERFGARLVASPQHADALLVTGVVTHNMAGPLRKTLEATPR PRVVIACGDCALNRGVFADAYGVVGAVGEVVPVDVEIAGCPPTPAAIMAALRSVTGK" gene 90400..92322 /locus_tag="Rv0083" /db_xref="GeneID:886965" CDS 90400..92322 /locus_tag="Rv0083" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0083, (MTV030.27, MTCY251.01), len: 640 aa. Probable oxidoreductase (EC 1.-.-.-), showing some similarity to other various oxidoreductases e.g. AAK06855.1|AF335723_1|AF335723 hydrogenase-4 component B from Burkholderia pseudomallei (668 aa); HYFB_ ECOLI|P23482 hydrogenase-4 component b from Escherichia coli strain K12 (672 aa), FASTA scores: opt: 995, E(): 0, (32.2% identity in 571 aa overlap); AAF13041.1|AF157639_1|AF157639 putative formate hydrogenlyase integral membrane subunit from Desulfitobacterium dehalogenans (637 aa); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214597.1" /db_xref="GI:15607225" /db_xref="GeneID:886965" /translation="MTAAPTAGGVVTSGVGVAGVGVGLLGMFGPVRVVHVGWLLPLSG VHIELDRLGGFFMALTGAVAAPVGCYLIGYVRREHLGRVPMAVVPLFVAAMLLVPAAG SVTTFLLAWELMAIASLILVLSEHARPQVRSAGLWYAVMTQLGFIAILVGLVVLAAAG GSDRFAGLGAVCDGVRAAVFMLTLVGFGSKAGLVPLHAWLPRAHPEAPSPVSALMSAA MVNLGIYGIVRFDLQLLGPGPRWWGLALLAVGGTSALYGVLQASVAADLKRLLAYSTT ENMGLITLALGAATLFADTGAYGPASIAAAAAMLHMIAHAAFKSLAFMAAGSVLAATG LRDLDLLGGLARRMPATTVFFGVAALGACGLPLGAGFVSEWLLVQSLIHAAPGHDPIV ALTTPLAVGVVALATGLSVAAMTKAFGIGFLARPRSTQAEAAREAPASMRAGMAIAAG ACLVLAVAPLLVAPMVRRAAATLPAAQAVKFTGLGAVVRLPAMSGSIAPGVIAAAVLA AALAVAVLARWRFRRRPAPARLPLWACGAADLTVRMQYTATSFAEPLQRVFGDVLRPD TDIEVTHTAESRYMAERITYRTAVADAIEQRLYTPVVGAVAAMAELLRRAHTGSVHRY LAYGALGVLIVLVVAR" gene 92328..93278 /gene="hycD" /locus_tag="Rv0084" /db_xref="GeneID:886959" CDS 92328..93278 /gene="hycD" /locus_tag="Rv0084" /function="INVOLVED IN HYDROGEN METABOLISM; FHL PATHWAY." /note="Rv0084, (MTCY251.02), len: 316 aa. Possible hycD (alternate gene name: hevD), formate hydrogenlyase (EC 1.-.-.-), integral membrane protein, similar to others e.g. HYCD_ECOLI|P16430 formate hydrogenlyase subunit 4 from Escherichia coli (307 aa), FASTA scores: opt: 570, E(): 2.1e-26, (33.8% identity in 305 aa overlap); AAK06856.1|AF335723_2|AF335723 formate hydrogenlyase subunit 4 from Burkholderia pseudomallei (316 aa); NP_457244.1|NC_003198 formate hydrogenlyase subunit 4 from Salmonella enterica subsp. enterica serovar Typhi (307 aa); etc. Also similar to NUOH_ECOLI|P33603 NADH dehydrogenase I chain H from Escherichia coli (325 aa), FASTA scores: opt: 207, E(): 9.5e-06, (26.5% identity in 260 aa overlap). BELONGS TO THE COMPLEX I SUBUNIT 1 FAMILY.; hevD" /codon_start=1 /transl_table=11 /product="formate hydrogenlyase HYCD" /protein_id="NP_214598.1" /db_xref="GI:15607226" /db_xref="GeneID:886959" /translation="MSYLAGAAQIGGVMVGAPLVIGMTRQVRARWEGRAGAGLLQPWR DLLKQLGKQQITPAGTTIVFAAAPVIVAGTTLLIAAIAPLVATGSPLDPSADLFAVVG LLFLGTVALTLAGIDTGTSFGGMGASREITIAALVEPTILLAVFALSIPAGSANLGAL VASTIDHPGHVVSLAGVLAFVALVIVIVAETGRLPVDNPATHLELTMVHEAMVLEYAG PRLALVEWAAGMRLTVLLALLANLFLPWGIAGAAPTALDVLTGVVAVAAKVAILAVLL ATFEVFLAKLRLFRVPELLAGSFLLALLAVTAANFFTVGA" gene 93289..93951 /gene="hycP" /locus_tag="Rv0085" /db_xref="GeneID:886973" CDS 93289..93951 /gene="hycP" /locus_tag="Rv0085" /function="INVOLVED IN HYDROGEN METABOLISM." /note="Rv0085, (MTCY251.03), len: 220 aa. Possible hycP, hydrogenase (EC 1.-.-.-), integral membrane protein, weakly similar to P77524|HYFE_ECOLI HYDROGENASE-4 COMPONENT E from Escherichia coli (216 aa), FASTA scores: opt: 204, E():1.2e-07, (25.5% identity in 216 aa overlap)." /codon_start=1 /transl_table=11 /product="hydrogenase HycP" /protein_id="NP_214599.1" /db_xref="GI:15607227" /db_xref="GeneID:886973" /translation="MSNANFSILVDFAAGGLVLASVLIVWRRDLRAIVRLLAWQGAAL AAIPLLRGIRDNDRALIAVGIAVLALRALVLPWLLARAVGAEAAAQREATPLVNTASS LLITAGLTLTAFAITQPVVNLEPGVTINAVPAAFAVVLIALFVMTTRLHAVSQAAGFL MLDNGIAATAFLLTAGVPLIVELGASLDVLFAVIVIGVLTGRLRRIFGDADLDKLREL RD" gene 93951..95417 /gene="hycQ" /locus_tag="Rv0086" /db_xref="GeneID:886963" CDS 93951..95417 /gene="hycQ" /locus_tag="Rv0086" /function="POSSIBLY INVOLVED IN HYDROGEN METABOLISM." /note="Rv0086, (MTCY251.04), len: 488 aa. Possible hycQ, hydrogenase (EC 1.-.-.-), integral membrane protein, weakly similar to P77437|HYFF_ECOLI HYDROGENASE-4 COMPONENT F from Escherichia coli (526 aa), FASTA scores: opt: 948, E(): 0, (35.9% identity in 493 aa overlap); and AAK06855.1|AF335723_1|AF335723 hydrogenase-4 component B from Burkholderia pseudomallei (668 aa). Also similar to d9087711 & NUOL_ECOLI|P33607 NADH dehydrogenase I chain L from Escherichia coli (613 aa), FASTA scores: opt: 360, E():3.2e-13, (27.9% identity in 488 aa overlap); and to NUON_ECOLI|P33608 NADH dehydrogenase I chain N from Escherichia coli (425 aa), FASTA scores: opt: 375, E(): 3.9e-14, (25.0% identity in 432 aa overlap)." /codon_start=1 /transl_table=11 /product="hydrogenase HycQ" /protein_id="NP_214600.1" /db_xref="GI:15607228" /db_xref="GeneID:886963" /translation="MTGLLLAAILAPLAASIASLITGWRRTTATLTALSATTVLACAV AMGFWMGSGAQFGLGGLLRADALTVVMLVVIGIVGTLATAASIGYIDTELAHGHIDGR SARLYGVLTPAFLCAMVLAVCANNIGVIWVAIEATTVITAFLVGHRRTRTALEATWKY VVICSVGIAVAFLGTVLLYFAARDSGAAAAGALNLDILAEHAAGLDPGVARLAGGLLL IGYGAKAGLFPFHTWLADAHSQAPAPVSALMSGVLLAVAFSVLIRLRPILDAVSGPAY LRNGLLVVGLATLLVAVLMLTVTGDVKRMLAYSSMEHMGLIAIAAAAGTTLAIAALLL HVLAHGIGKTVLFLAGGQLQAAHDSTAIADITGVMRRSRLIGVSFAVGLIVLLGLPPF AMFASELAIARSLANERLAWVLGAALLLIAIGFTALARNSGRMLLGTPAAGAPAITVP ATAAAALMVGIVVSAALGITAGPLADLLGIAASNVGLP" gene 95414..96892 /gene="hycE" /locus_tag="Rv0087" /db_xref="GeneID:886956" CDS 95414..96892 /gene="hycE" /locus_tag="Rv0087" /function="INVOLVED IN HYDROGEN METABOLISM; FHL PATHWAY." /note="Rv0087, (MTCY251.05), len: 492 aa. Possible hycE (alternate gene name: hevE), formate hydrogenlyase (EC 1.-.-.-), similar to others e.g. HYCE_ECOLI|P16431 formate hydrogenlyase subunit 5 from Escherichia coli (569 aa), FASTA scores: opt: 680, E(): 1.8e-38, (31.2% identity in 449 aa overlap); NP_457243.1|NC_003198 formate hydrogenlyase subunit 5 from Salmonella enterica subsp. enterica serovar Typhi (569 aa); NP_275541.1|NC_000916 formate hydrogenlyase subunit 5 from Methanothermobacter thermautotrophicus (370 aa); etc. Also some similarity with NUOD_ECOLI|P33600 NADH dehydrogenase I chain D from Escherichia coli (407 aa), FASTA scores: opt: 245, E(): 8.9e-10, (24.5% identity in 368 aa overlap). BELONGS TO THE COMPLEX I 49 kDa SUBUNIT FAMILY.; hevE" /codon_start=1 /transl_table=11 /product="formate hydrogenase HycE" /protein_id="NP_214601.1" /db_xref="GI:15607229" /db_xref="GeneID:886956" /translation="MMSASWLRHRVSERGLIATAEQLWADSFRLALVAAHDDGDSLRV VYLFLAGYPDRRVELEYVVPADNPEIRSLAYLSFPAGRFEREMADLYGIRPVGHPKPR RLVRHAHWPDWHPMRTDAGPAPEFTDTGAFPFLAVEGPGVYEIPVGPVHAGLIEPGHF RFSVAGETIVRLKARLWFVHRGIEKLFHGRPATAAVDLAERISGDTSAAHALAHSLAI EDALGIELPHEVHRLRALIVELERLYNHAADLGALANDVGYSLANAHAQRIRENLLRR NAAVTGHRLLRGAIRAGGVALRALPDTDELAALAVDLAEVATLTLANSVVYDRFAGTA VLHPDDASALGCLGYVARASGLRSDARVEHPTIVLPITEIGAPDGDVLARYTVRRDEF AASAALAQHIVESHTGPIEYAATLHPVGAPSSGIGIVEGWRGTIVHRVEIDVDGRITR AKVVDPSWFNWPALPVAMADTIVPDFPLANKSFNQSYAGNDL" gene 96927..97601 /locus_tag="Rv0088" /db_xref="GeneID:886954" CDS 96927..97601 /locus_tag="Rv0088" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0088, (MTCY251.06), len: 224 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214602.1" /db_xref="GI:15607230" /db_xref="GeneID:886954" /translation="MSVYKHAPSRVRLRQTRSTVVKGRSGSLSWRRVRTGDLGLAVWG GREEYRAVKPGTPGIQPKGDMMTVTVVDAGPGRVSRSVEVAAPAAELFAIVADPRRHR ELDGSGTVRGNIKVPAKLVVGSKFSTKMKLFGLPYRITSRVTALKPNELVEWSHPLGH RWRWEFESLSPTLTRVTETFDYHAAGAIKNGLKFYEMTGFAKSNAAGIEATLAKLSDQ YARGRA" gene 97758..98351 /locus_tag="Rv0089" /db_xref="GeneID:886949" CDS 97758..98351 /locus_tag="Rv0089" /EC_number="2.1.1.-" /function="THOUGHT TO CAUSE METHYLATION." /note="Rv0089, (MTCY251.07), len: 197 aa. Possible methyltransferase (EC 2.1.1.-), showing some weak similarity to others e.g. NP_299749.1|NC_002488 3-demethylubiquinone-9 3-methyltransferase from Xylella fastidiosa 9a5c (246 aa); CAC44277.1| (AL596030) putative methyltransferase from Streptomyces coelicolor (285 aa); NP_111415.1|NC_002689 Predicted SAM-dependent methyltransferase from Thermoplasma volcanium (245 aa); etc. Also some similarity with many biotin biosynthesis proteins e.g. P12999|BIOC_ECOLI|B0777 BIOTIN SYNTHESIS PROTEIN from Escherichia coli (251 aa), FASTA scores: opt: 202, E(): 4.5e-07, (39.0% identity in 118 aa overlap); etc. BELONGS TO THE METHYLTRANSFERASE SUPERFAMILY." /codon_start=1 /transl_table=11 /product="methyltransferase/methylase" /protein_id="NP_214603.1" /db_xref="GI:15607231" /db_xref="GeneID:886949" /translation="MDQPWNANIHYDALLDAMVPLGTQCVLDVGCGDGLLAARLARRI PYVTAVDIDAPVLRRAQTRFANAPIRWLHADIMTAELPNAGFDAVVSNAALHHIEDTR TALSRLGGLVTPGGTLAVVTFVTPSLRNGLWHLTSWVACGMANRVKGKWEHSAPIKWP PPQTLHELRSHVRALLPGACIRRLLYGRVLVTWRAPV" gene 98480..99250 /locus_tag="Rv0090" /db_xref="GeneID:886961" CDS 98480..99250 /locus_tag="Rv0090" /function="UNKNOWN" /note="Rv0090, (MTCY251.08), len: 256 aa. Possible membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214604.1" /db_xref="GI:15607232" /db_xref="GeneID:886961" /translation="MAKNQNRIRNRWELITCGLGGHVTYAPDDAALAARLRASTGLGE VWRCLRCGDFALGGPQGRGAPEDAPLIMRGKALRQAIIIRALGVERLVRALVLALAAW AVWEFRGARGAIQATLDRDLPVLRAAGFKVDQMTVIHALEKALAAKPSTLALITGMLA AYAVLQAVEGVGLWLLKRWGEYFAVVATSIFLPLEVHDLAKGITTTRVVTFSINVAAV VYLLISKRLFGVRGGRKAYDVERRGEQLLDLERAAMLT" gene 99684..100451 /gene="mtn" /locus_tag="Rv0091" /db_xref="GeneID:886953" CDS 99684..100451 /gene="mtn" /locus_tag="Rv0091" /EC_number="3.2.2.16" /EC_number="3.2.2.9" /function="RESPONSIBLE FOR CLEAVAGE OF THE GLYCOSIDIC BOND IN BOTH 5'-METHYLTHIOADENOSINE (MTA) AND S-ADENOSYLHOMOCYSTEINE (SAH) [CATALYTIC ACTIVITY 1: Methylthioadenosine + H2O = adenine + 5-methylthio-D-ribose] [CATALYTIC ACTIVITY 2: S-adenosyl-L-homocysteine + H2O = adenine + S-D-ribosyl-L-homocysteine]." /note="Rv0091, (MTCY251.10), len: 255 aa. Probable mtn (alternate gene name: pfs), methylthioadenosine/S-Adenosylhomocysteine nucleosidase (MTA/SAH nucleosidase), including 5'-methylthioadenosine nucleosidase (EC 3.2.2.16) and S-adenosylhomocysteine nucleosidase (EC 3.2.2.9), similar to others e.g. NP_521493.1|NC_003295 PROBABLE BIFUNCTIONAL PROTEIN (MTA/SAH NUCLEOSIDASE) (P46): 5'-METHYLTHIOADENOSINE NUCLEOSIDASE AND S-ADENOSYLHOMOCYSTEINE NUCLEOSIDASE from Ralstonia solanacearum (261 aa); AAC45731.1|U55214 Pfs from Treponema pallidum (249 aa); P96122|MTN_TREPA MTA/SAH NUCLEOSIDASE from Treponema pallidum (269 aa); PFS_ECOLI|P24247 pfs protein (p46) from Escherichia coli (232 aa), FASTA scores: opt: 214, E(): 3.8e-08, (30.5% identity in 246 aa overlap); etc. BELONGS TO THE MTN FAMILY.; pfs" /codon_start=1 /transl_table=11 /product="bifunctional 5'-methylthioadenosine nucleosidase/S-adenosylhomocysteine nucleosidase" /protein_id="NP_214605.1" /db_xref="GI:15607233" /db_xref="GeneID:886953" /translation="MAVTVGVICAIPQELAYLRGVLVDAKRQQVAQILFDSGQLDAHR VVLAAAGMGKVNTGLTATLLADRFGCRTIVFTGVAGGLDPELCIGDIVIADRVVQHDF GLLTDERLRPYQPGHIPFIEPTERLGYPVDPAVIDRVKHRLDGFTLAPLSTAAGGGGR QPRIYYGTILTGDQYLHCERTRNRLHHELGGMAVEMEGGAVAQICASFDIPWLVIRAL SDLAGADSGVDFNRFVGEVAASSARVLLRLLPVLTAC" gene 100583..102868 /gene="ctpA" /locus_tag="Rv0092" /db_xref="GeneID:886946" CDS 100583..102868 /gene="ctpA" /locus_tag="Rv0092" /EC_number="3.6.3.-" /function="CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF A CATION (POSSIBLY COPPER) WITH THE HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + CATION(IN) = ADP + PHOSPHATE + CATION(OUT)]." /note="Rv0092, (MTCY251.11), len: 761 aa. Probable ctpA, cation-transporting P-type ATPase A (transmembrane protein) (EC 3.6.3.-), highly similar to others e.g. CTPA_MYCLE|P46839 cation-transporting P-type ATPase A from Mycobacterium leprae (780 aa), FASTA scores: opt: 3454, E(): 0, (74.4% identity in 741 aa overlap); CAB66270.1|AL136519 probable cation-transporting P-type ATPase from Streptomyces coelicolor (760 aa); NP_391230.1|NC_000964 protein similar to heavy metal-transporting ATPase from Bacillus subtilis (803 aa); etc. Also highly similar to MTCY251.22c from Mycobacterium tuberculosis, FASTA score: (68.3% identity in 742 aa overlap). Contains PS01047 Heavy-metal-associated domain, and PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB." /codon_start=1 /transl_table=11 /product="cation transporter P-type ATPase A" /protein_id="NP_214606.1" /db_xref="GI:15607234" /db_xref="GeneID:886946" /translation="MTTAVTGEHHASVQRIQLRISGMSCSACAHRVESTLNKLPGVRA AVNFGTRVATIDTSEAVDAAALCQAVRRAGYQADLCTDDGRSASDPDADHARQLLIRL AIAAVLFVPVADLSVMFGVVPATRFTGWQWVLSALALPVVTWAAWPFHRVAMRNARHH AASMETLISVGITAATIWSLYTVFGNHSPIERSGIWQALLGSDAIYFEVAAGVTVFVL VGRYFEARAKSQAGSALRALAALSAKEVAVLLPDGSEMVIPADELKEQQRFVVRPGQI VAADGLAVDGSAAVDMSAMTGEAKPTRVRPGGQVIGGTTVLDGRLIVEAAAVGADTQF AGMVRLVEQAQAQKADAQRLADRISSVFVPAVLVIAALTAAGWLIAGGQPDRAVSAAL AVLVIACPCALGLATPTAMMVASGRGAQLGIFLKGYKSLEATRAVDTVVFDKTGTLTT GRLQVSAVTAAPGWEADQVLALAATVEAASEHSVALAIAAATTRRDAVTDFRAIPGRG VSGTVSGRAVRVGKPSWIGSSSCHPNMRAARRHAESLGETAVFVEVDGEPCGVIAVAD AVKDSARDAVAALADRGLRTMLLTGDNPESAAAVATRVGIDEVIADILPEGKVDVIEQ LRDRGHVVAMVGDGINDGPALARADLGMAIGRGTDVAIGAADIILVRDHLDVVPLALD LARATMRTVKLNMVWAFGYNIAAIPVAAAGLLNPLVAGAAMAFSSFFVVSNSLRLRKF GRYPLGCGTVGGPQMTAPSSA" misc_feature 100640..100726 /gene="ctpA" /locus_tag="Rv0092" /note="PS01047 Heavy-metal-associated domain" misc_feature 101909..101929 /gene="ctpA" /locus_tag="Rv0092" /note="PS00154 E1-E2 ATPases phosphorylation site" gene complement(102815..103663) /locus_tag="Rv0093c" /db_xref="GeneID:886945" CDS complement(102815..103663) /locus_tag="Rv0093c" /function="UNKNOWN" /note="Rv0093c, (MTCY251.12c), len: 282 aa. Probable conserved membrane protein, equivalent only to CAC30943.1|AL583924 probable integral membrane protein from Mycobacterium leprae (237 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214607.1" /db_xref="GI:15607235" /db_xref="GeneID:886945" /translation="MLAQATTAGSFNHHASTVLQGCRGVPAAMWSEPAGAIRRHCATI DGMDCEVAREALSARLDGERAPVPSARVDEHLGECSACRAWFTQVASQAGDLRRLAES RPVVPPVGRLGIRRAPRRQHSPMTWRRWALLCVGIAQIALGTVQGFGLDVGLTHQHPT GAGTHLLNESTSWSIALGVIMVGAALWPSAAAGLAGVLTAFVAILTGYVIVDALSGAV STTRILTHLPVVIGAVLAIMVWRSASGPRPRPDAVAAEPDIVLPDNASRGRRRGHLWP TDGSAA" gene complement(103710..104663) /locus_tag="Rv0094c" /db_xref="GeneID:886943" CDS complement(103710..104663) /locus_tag="Rv0094c" /function="UNKNOWN" /note="Rv0094c, (MTCY251.13c), len: 317 aa. Member of 13E12 repeat family, showing some similarity to U15187|MLU15187_7 from Mycobacterium leprae (94 aa), FASTA score: (49.4% identity in 79 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214608.1" /db_xref="GI:15607236" /db_xref="GeneID:886943" /translation="MSTRQAAEADLAGKAAQYRPDELARYAQRVMDWLHPDGDLTDTE RARKRGITLSNQQYDGMSRLSGYLTPQARATFEAVLAKLAAPGATNPDDHTPVIDTTP DAAAIDRDTRSQAQRNHDGLLAGLRALIASGKLGQHNGLPVSIVVTTTLTDLQTGAGK GFTGGGTLLPMADVIRMTSHAHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIM LFANDRGCTKPGCDAPAYHSQAHHVTAWTSTGRTDITELTLACGPDNRLAEKGWTTHN NTHGHTEWLPPPHLDHGQPRTNTFHHPERFLHNQDDDDKPD" repeat_region complement(103713..105215) /note="REP-2, len: 1503 bp. REP251, member of REP13E12 family.; REP-2" /rpt_type=DIRECT gene complement(104805..105215) /locus_tag="Rv0095c" /db_xref="GeneID:886940" CDS complement(104805..105215) /locus_tag="Rv0095c" /function="UNKNOWN" /note="Rv0095c, (MTCY251.14c), len: 136 aa. Member of 13E12 repeat, also partially similar to AF0418|AF041819_8 from Mycobacterium bovis BCG (222 aa), FASTA score: (89.6% identity in 96 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214609.1" /db_xref="GI:15607237" /db_xref="GeneID:886940" /translation="MRYLPVSTRRIWVNPLCHFSFTVISGALFVSARRYDSNMLANSR EELVEVFDALDADLDRLDEVSFEVLSTPERLRSLERLECLARRLPAAQHTLINQLDTQ ASEEELGGTLCCALANRLRITKPEAGRRSAEAKP" gene 105324..106715 /gene="PPE1" /locus_tag="Rv0096" /db_xref="GeneID:886938" CDS 105324..106715 /gene="PPE1" /locus_tag="Rv0096" /function="UNKNOWN" /note="Rv0096, (MTCY251.15), len: 463 aa. Member of the Mycobacterium tuberculosis PPE family, similar to many e.g. Z46257|MLACEA_3 aceA gene for isocitrate L from Mycobacterium leprae (438 aa), FASTA scores: opt: 1207, E(): 0, (55.3% identity in 380 aa overlap). Also similar to Z97559|MTCY261_19 from Mycobacterium tuberculosis (473 aa), FASTA score: (40.2% identity in 478 aa overlap); YHS6_MYCTU|P42611 hypothetical 50.6 kDa protein (517 aa), FASTA scores: opt: 365, E(): 4.6e-12, (37.6% identity in 178 aa overlap). Also similar to MTCY274.23c from Mycobacterium tuberculosis, FASTA score: (31.1% identity in 383 overlap). Some similarity also to MTCY31.06c and MTCY48.17 and other mycobacterial PPE family proteins." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177690.1" /db_xref="GI:57116689" /db_xref="GeneID:886938" /translation="MAIPPEVHSGLLSAGCGPGSLLVAAQQWQELSDQYALACAELGQ LLGEVQASSWQGTAATQYVAAHGPYLAWLEQTAINSAVTAAQHVAAAAAYCSALAAMP TPAELAANHAIHGVLIATNFFGINTVPIALNEADYVRMWLQAADTMAAYQAVADAATV AVPSTQPAPPIRAPGGDAADTRLDVLSSIGQLIRDILDFIANPYKYFLEFFEQFGFSP AVTVVLALVALQLYDFLWYPYYASYGLLLLPFFTPTLSALTALSALIHLLNLPPAGLL PIAAALGPGDQWGANLAVAVTPATAAVPGGSPPTSNPAPAAPSSNSVGSASAAPGISY AVPGLAPPGVSSGPKAGTKSPDTAADTLATAGAARPGLARAHRRKRSESGVGIRGYRD EFLDATATVDAATDVPAPANAAGSQGAGTLGFAGTAPTTSGAAAGMVQLSSHSTSTTV PLLPTTWTTDAEQ" gene 106734..107603 /locus_tag="Rv0097" /db_xref="GeneID:886942" CDS 106734..107603 /locus_tag="Rv0097" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0097, (MTCY251.16), len: 289 aa. Possible oxidoreductase (EC 1.-.-.-), equivalent to NP_302343.1|NC_002677 putative oxidoreductase from Mycobacterium leprae (289 aa). Also highly similar to BAB69377.1|AB070955 putative oxidoreductase from Streptomyces avermitilis (296 aa); and weakly similar to others e.g. NP_518867.1|NC_003295 PUTATIVE ALPHA-KETOGLUTARATE-DEPENDENT TAURINE DIOXYGENASE OXIDOREDUCTASE PROTEIN from Ralstonia solanacearum (301 aa); NP_286110.1|NC_002655 taurine dioxygenase (2-oxoglutarate-dependent) from Escherichia coli strain O157:H7 (283 aa); NP_252624.1|NC_002516 taurine dioxygenase from Pseudomonas aeruginosa (277 aa); ECAE00014310 (283 aa), FASTA scores: opt: 304, E(): 2.6e-13, (27.8% identity in 288 aa overlap); TFDA_ALCEU|P10088 2,4-dichlorophenoxyacetate monooxygenase from A. eutropha (287 aa), FASTA scores: opt: 188, E(): 3.5e-06, (26.6% identity in 188 aa overlap); etc. Contains PS00077 Cytochrome c oxidase subunit I, copper B binding region signature." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214611.1" /db_xref="GI:15607239" /db_xref="GeneID:886942" /translation="MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKD VHPSPREFIKLGRIIGQIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDYMF MPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIR PSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDP EVLQELMAATGQLDPEYQSPFIHTQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRL TMLDGLKTPGYAA" misc_feature 107474..107488 /locus_tag="Rv0097" /note="PS00077 Cytochrome c oxidase subunit I, copper B binding region signature" gene 107600..108151 /locus_tag="Rv0098" /db_xref="GeneID:886935" CDS 107600..108151 /locus_tag="Rv0098" /function="UNKNOWN" /note="Rv0098, (MTCY251.17), len: 183 aa. Conserved hypothetical protein, equivalent to CAC30948.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (183 aa). Also some similarity with BAB69378.1|AB070955 hypothetical protein from Streptomyces avermitilis (172 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214612.1" /db_xref="GI:15607240" /db_xref="GeneID:886935" /translation="MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDA QYSATEDSVLAYGNFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVL RGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCV IEFWDENGGAASGEIELAALNIP" gene 108156..109778 /gene="fadD10" /locus_tag="Rv0099" /db_xref="GeneID:886933" CDS 108156..109778 /gene="fadD10" /locus_tag="Rv0099" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_214613.1" /db_xref="GI:15607241" /db_xref="GeneID:886933" /translation="MGGKKFQAMPQLPSTVLDRVFEQARQQPEAIALRRCDGTSALRY RELVAEVGGLAADLRAQSVSRGSRVLVISDNGPETYLSVLACAKLGAIAVMADGNLPI AAIERFCQITDPAAALVAPGSKMASSAVPEALHSIPVIAVDIAAVTRESEHSLDAASL AGNADQGSEDPLAMIFTSGTTGEPKAVLLANRTFFAVPDILQKEGLNWVTWVVGETTY SPLPATHIGGLWWILTCLMHGGLCVTGGENTTSLLEILTTNAVATTCLVPTLLSKLVS ELKSANATVPSLRLVGYGGSRAIAADVRFIEATGVRTAQVYGLSETGCTALCLPTDDG SIVKIEAGAVGRPYPGVDVYLAATDGIGPTAPGAGPSASFGTLWIKSPANMLGYWNNP ERTAEVLIDGWVNTGDLLERREDGFFYIKGRSSEMIICGGVNIAPDEVDRIAEGVSGV REAACYEIPDEEFGALVGLAVVASAELDESAARALKHTIAARFRRESEPMARPSTIVI VTDIPRTQSGKVMRASLAAAATADKARVVVRG" misc_feature 108675..108710 /gene="fadD10" /locus_tag="Rv0099" /note="PS00455 Putative AMP-binding domain signature" gene 109783..110019 /locus_tag="Rv0100" /db_xref="GeneID:886931" CDS 109783..110019 /locus_tag="Rv0100" /function="UNKNOWN" /note="Rv0100, (MTCY251.19), len: 78 aa. Conserved hypothetical protein, equivalent only to CAC30950.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (78 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214614.1" /db_xref="GI:15607242" /db_xref="GeneID:886931" /translation="MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQ LGVNRQSELPSRLAANPSIAGWLRELEAVCTEFG" gene 110001..117539 /gene="nrp" /locus_tag="Rv0101" /db_xref="GeneID:886951" CDS 110001..117539 /gene="nrp" /locus_tag="Rv0101" /EC_number="6.-.-.-" /function="INVOLVED IN LIPID METABOLISM." /note="Rv0101, (MTCY251.20), len: 2512 aa. Probable nrp, peptide synthetase (EC 6.-.-.-), similar to others e.g. AAD44234.1|AF143772_40|PstB peptide synthetase from Mycobacterium avium (2552 aa); 7476034|S77657 cyclic peptide synthetase from Mycobacterium leprae (1401 aa), FASTA scores: opt: 4268, E(): 0, (65.7% identity in 1091 aa overlap); part of CAB55600.1|AJ238027 peptide synthetase from Mycobacterium smegmatis (5990). Also similar to e.g. AAD56240.1|AF184977_1|AF184977 DhbF protein from Bacillus subtilis (2378 aa); SRF1_BACSU|P27206 surfactin synthetase subunit 1 (3587 aa), FASTA scores: opt: 1708, E(): 0, (30.6% identity in 1633 aa overlap): etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), 2 x PS00455 Putative AMP-binding domain signature, and PS00012 Phosphopantetheine attachment site. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. THOUGHT TO BE NOT INVOLVED IN MYCOBACTIN BIOSYNTHESIS (see citation below)." /codon_start=1 /transl_table=11 /product="peptide synthetase" /protein_id="NP_214615.1" /db_xref="GI:15607243" /db_xref="GeneID:886951" /translation="MHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARFLA ALHATVLDNPVQLCVLENSGADYPDLVPRLRFGDIVRVGSADEHLQSTWCSGILGKPL VRHTVHTDPNGYVTGLDVHTHHILLDGGATGTIEADLARYLTTDPAGETPSVGAGLAK LREAHRRETAKVEESRGRLSAVVQRELADEAYHGGHGHSVSDAPGTAAKGVLHESATI CGNAFDAILTLSEAQRVPLNVLVAAAAVAVDASLRQNTETLLVHTVDNRFGDSDLNVA TCLVNSVAQTVRFPPFASVSDVVRTLDRGYVKAVRRRWLREEHYRRMYLAINRTSHVE ALTLNFIREPCAPGLRPFLSEVPIATDIGPVEGMTVASVLDEEQRTLNLAIWNRADLP ACKTHPKVAERIAAALESMAAMWDRPIAMIVNDWFGIGPDGTRCQGDWPARQPSTPAW FLDSARGVHQFLGRRRFVYPWVAWLVQRGAAPGDVLVFTDDDTDKTIDLLIACHLAGC GYSVCDTADEISVRTNAITEHGDGILVTVVDVAATQLAVVGHDELRKVVDERVTQVTH DALLATKTAYIMPTSGTTGQPKLVRISHGSLAVFCDAISRAYGWGAHDTVLQCAPLTS DISVEEIFGGAACGARLVRSAAMKTGDLAALVDDLVARETTIVDLPTAVWQLLCADGD AIDAIGRSRLRQIVIGGEAIRCSAVDKWLESAASQGISLLSSYGPTEATVVATFLPIV CDQTTMDGALLRLGRPILPNTVFLAFGEVVIVGDLVADGYLGIDGDGFGTVTAADGSR RRAFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTRRIAEDPAVSDVAVELH SGSLGVWFKSQRTREGEQDAAAATRIRLVLVSLGVSSFFVVGVPNIPRKPNGKIDSDN LPRLPQWSAAGLNTAETGQRAAGLSQIWSRQLGRAIGPDSSLLGEGIGSLDLIRILPE TRRYLGWRLSLLDLIGADTAANLADYAPTPDAPTGEDRFRPLVAAQRPAAIPLSFAQR RLWFLDQLQRPAPVYNMAVALRLRGYLDTEALGAAVADVVGRHESLRTVFPAVDGVPR QLVIEARRADLGCDIVDATAWPADRLQRAIEEAARHSFDLATEIPLRTWLFRIADDEH VLVAVAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYVDYTLWQREILGD LDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVADQRGASLVVDWPASVQQQVR RIARQHNATSFMVVAAGLAVLLSKLSGSPDVAVGFPIAGRSDPALDNLVGFFVNTLVL RVNLAGDPSFAELLGQVRARSLAAYENQDVPFEVLVDRLKPTRALTHHPLIQVMLAWQ DNPVGQLNLGDLQATPMPIDTRTARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEA QAIDVLIERLRKVLVAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPVSI PQMLAAQVARIPEAEAVCCGDASMTYRELDEASNRLAHRLAGCGAGPGECVALLFERC APAVVAMVAVLKTGAAYLPIDPANPPPRVAFMLGDAVPVAAVTTAGLRSRLAGHDLPI IDVVDALAAYPGTPPPMPAAVNLAYILYTSGTTGEPKGVGITHRNVTRLFASLPARLS AAQVWSQCHSYGFDASAWEIWGALLGGGRLVIVPESVAASPNDFHGLLVAEHVSVLTQ TPAAVAMLPTQGLESVALVVAGEACPAALVDRWAPGRVMLNAYGPTETTICAAISAPL RPGSGMPPIGVPVSGAALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTASRF VACPFGGSGARMYRTGDLVCWRADGQLEFLGRTDDQVKIRGYRIELGEVATALAELAG VGQAVVIAREDRPGDKRLVGYATEIAPGAVDPAGLRAQLAQRLPGYLVPAAVVVIDAL PLTVNGKLDHRALPAPEYGDTNGYRAPAGPVEKTVAGIFARVLGLERVGVDDSFFELG GDSLAAMRVIAAINTTLNADLPVRALLHASSTRGLSQLLGRDARPTSDPRLVSVHGDN PTEVHASDLTLDRFIDADTLATAVNLPGPSPELRTVLLTGATGFLGRYLVLELLRRLD VDGRLICLVRAESDEDARRRLEKTFDSGDPELLRHFKELAADRLEVVAGDKSEPDLGL DQPMWRRLAETVDLIVDSAAMVNAFPYHELFGPNVAGTAELIRIALTTKLKPFTYVST ADVGAAIEPSAFTEDADIRVISPTRTVDGGWAGGYGTSKWAGEVLLREANDLCALPVA VFRCGMILADTSYAGQLNMSDWVTRMVLSLMATGIAPRSFYEPDSEGNRQRAHFDGLP VTFVAEAIAVLGARVAGSSLAGFATYHVMNPHDDGIGLDEYVDWLIEAGYPIRRIDDF AEWLQRFEASLGALPDRQRRHSVLPMLLASNSQRLQPLKPTRGCSAPTDRFRAAVRAA KVGSDKDNPDIPHVSAPTIINYVTNLQLLGLL" misc_feature 110070..110093 /gene="nrp" /locus_tag="Rv0101" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 111729..111764 /gene="nrp" /locus_tag="Rv0101" /note="PS00455 Putative AMP-binding domain signature" misc_feature 114906..114941 /gene="nrp" /locus_tag="Rv0101" /note="PS00455 Putative AMP-binding domain signature" misc_feature 116040..116087 /gene="nrp" /locus_tag="Rv0101" /note="PS00012 Phosphopantetheine attachment site" gene 117714..119699 /locus_tag="Rv0102" /db_xref="GeneID:886926" CDS 117714..119699 /locus_tag="Rv0102" /function="UNKNOWN" /note="Rv0102, (MTCY251.21), len: 661 aa. Probable conserved integral membrane protein, highly similar to P53525|Y102_MYCLE|ML1998|NP_302349.1|NC_002677 possible membrane protein from Mycobacterium leprae (659 aa), FASTA scores: opt: 3107, E(): 0, (70.2% identity in 662 aa overlap). Also similar to others e.g. CAC01497.1|AL391017 putative integral membrane protein from Streptomyces coelicolor (316 aa); etc. Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_214616.1" /db_xref="GI:15607244" /db_xref="GeneID:886926" /translation="MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLV SGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALCLGALIHVVMTAKPEPDGLIDAA AFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWI VAAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVF AVAFATLTGLKIAAALAGTTPSRAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFA RLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMAAMASIAAMAVMTAP RFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRR GNSWPVGRLIAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGP VTLALRVLPVTGDGRPPGAREWLTWLLHSRVTTFLSHPITAFVLFVASPYIVYFTPLF DTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMPFHAFFG IALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWAR QDRRVASREDRHADSDYADDELEAYNAMLRELSRMRR" misc_feature 119085..119102 /locus_tag="Rv0102" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" gene complement(119915..122173) /gene="ctpB" /locus_tag="Rv0103c" /db_xref="GeneID:886928" CDS complement(119915..122173) /gene="ctpB" /locus_tag="Rv0103c" /EC_number="3.6.3.-" /function="CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF A CATION (POSSIBLY COOPPER) WITH THE HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + CATION(IN) = ADP + ORTHOPHOSPHATE + CATION(OUT)]." /note="Rv0103c, (MTCY251.22c), len: 752 aa. Probable ctpB, cation-transporting P-type ATPase B (transmembrane protein) (EC 3.6.3.-), equivalent to CTPB_MYCLE|P46840 cation-transporting P-type ATPase B from Mycobacterium leprae (750 aa), FASTA scores: opt: 3615, E(): 0, (76.5% identity in 752 aa overlap). Also highly similar to others e.g. CAB96031.1|AL360055 putative metal transporter ATPase from Streptomyces coelicolor (753 aa); NP_241423.1|NC_002570 copper-transporting ATPase from Bacillus halodurans (806 aa); etc. Also highly similar to Z46257|MLACEA_7 aceA gene for isocitrate L from Mycobacterium leprae (750 aa), FASTA scores: opt: 3615, E():0, (76.5% identity in 752 aa overlap). And similar to MTCY251.11 from Mycobacterium tuberculosis, FASTA score: (68.3% identity in 742 aa overlap). Contains PS01047 Heavy-metal-associated domain, PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB." /codon_start=1 /transl_table=11 /product="cation-transporter P-type ATPase B" /protein_id="NP_214617.1" /db_xref="GI:15607245" /db_xref="GeneID:886928" /translation="MAAPVVGDADLQSVRRIRLDVLGMSCAACASRVETKLNKIPGVR ASVNFATRVATIDAVGMAADELCGVVEKAGYHAAPHTETTVLDKRTKDPDGAHARRLL RRLLVAAVLFVPLADLSTLFAIVPSARVPGWGYILTALAAPVVTWAAWPFHSVALRNA RHRTTSMETLISVGIVAATAWSLSSVFGDQPPREGSGIWRAILNSDSIYLEVAAGVTV FVLAGRYFEARAKSKAGSALRALAELGAKNVAVLLPDGAELVIPASELKKRQRFVTRP GETIAADGVVVDGSAAIDMSAMTGEAKPVRAYPAASVVGGTVVMDGRLVIEATAVGAD TQFAAMVRLVEQAQTQKARAQRLADHIAGVFVPVVFVIAGLAGAAWLVSGAGADRAFS VTLGVLVIACPCALGLATPTAMMVASGRGAQLGIFIKGYRALETIRSIDTVVFDKTGT LTVGQLAVSTVTMAGSGTSERDREEVLGLAAAVESASEHAMAAAIVAASPDPGPVNGF VAVAGCGVSGEVGGHHVEVGKPSWITRTTPCHDAALVSARLDGESRGETVVFVSVDGV VRAALTIADTLKDSAAAAVAALRSRGLRTILLTGDNRAAADAVAAQVGIDSAVADMLP EGKVDVIQRLREEGHTVAMVGDGINDGPALVGADLGLAIGRGTDVALGAADIILVRDD LNTVPQALDLARATMRTIRMNMIWAFGYNVAAIPIAAAGLLNPLIAGAAMAFSSFFVV SNSLRLRNFGAQ" misc_feature complement(120818..120838) /gene="ctpB" /locus_tag="Rv0103c" /note="PS00154 E1-E2 ATPases phosphorylation site" misc_feature complement(122027..122113) /gene="ctpB" /locus_tag="Rv0103c" /note="PS01047 Heavy-metal-associated domain" gene 122317..123831 /locus_tag="Rv0104" /db_xref="GeneID:886923" CDS 122317..123831 /locus_tag="Rv0104" /function="UNKNOWN" /note="Rv0104, (MTCY251.23), len: 504 aa. Conserved hypothetical protein, showing weak similarity with other cAMP-dependent protein kinases e.g. AAC37564.1|M65066 cAMP-dependent protein kinase RI-beta regulatory subunit from Homo sapiens (380 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214618.1" /db_xref="GI:15607246" /db_xref="GeneID:886923" /translation="MTPVTTFPLVDAILAGRDRNLDGVILIAAQHLLQTTHAMLRSLF RVGLDPRNVAVIGKCYSTHPGVVDAMRADGIYVDDCSDAYAPHESFDTQYTRHVERFF AESWARLTAGRTARVVLLDDGGSLLAVAGAMLDASADVIGIEQTSAGYAKIVGCALGF PVINIARSSAKLLYESPIIAARVTQTAFERTAGIDSSAAILITGAGAIGTALADVLRP LHDRVDVYDTRSGCMTPIDLPNAIGGYDVIIGATGATSVPASMHELLRPGVLLMSASS SDREFDAVALRRRTTPNPDCHADLRVADGSVDATLLNSGFPVNFDGSPMCGDASMALT MALLAAAVLYASVAVADEMSSDHPHLGLIDQGDIVASFLNIDVPLQALSRLPLLSIDG YRRLQVRSGYTLFRQGERADHFFVIESGELEALVDGKVILRLGAGDHFGEACLLGGMR RIATVRACEPSVLWELDGKAFGDALHGDAAMREIAYGVARTRLMHAGASESLMV" gene complement(123980..124264) /gene="rpmB" /locus_tag="Rv0105c" /db_xref="GeneID:886920" CDS complement(123980..124264) /gene="rpmB" /locus_tag="Rv0105c" /function="POSSIBLY INVOLVED IN A TRANSLATION MECHANISM." /note="required for 70S ribosome assembly" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L28" /protein_id="YP_177691.1" /db_xref="GI:57116690" /db_xref="GeneID:886920" /translation="MSARCQITGRTVGFGKAVSHSHRRTRRRWPPNIQLKAYYLPSED RRIKVRVSAQGIKVIDRDGHRGRRRAARAGSAPAHFARQAGSSLRTAAIL" gene 124374..125570 /locus_tag="Rv0106" /db_xref="GeneID:886919" CDS 124374..125570 /locus_tag="Rv0106" /function="UNKNOWN" /note="Rv0106, (MTCY251.25), len: 398 aa. Conserved hypothetical protein, similar to others e.g. AL049841|SCE9_33 from Streptomyces coelicolor (370 aa), FASTA scores: opt: 282, E(): 2.5e-11, (32.0% identity in 381 aa overlap); etc. Some similarity to P94400 HOMOLOGUE TO NITRILE HYDRATASE REGION from Bacillus subtilis (397 aa), FASTA scores: opt: 226, E(): 5.4e-08, (26.4% identity in 405 aa overlap). Also similar to COBW_PSEDE|P29937 FASTA score: (25.3% identity in 186 aa overlap); and P47K_PSECL|P31521 47 kDa protein (p47k) (419 aa), FASTA score: (25.9% identity in 401 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214620.1" /db_xref="GI:15607248" /db_xref="GeneID:886919" /translation="MRTPVILVAGQDHTDEVTGALLRRTGTVVVEHRFDGHVVRRMTA TLSRGELITTEDALEFAHGCVSCTIRDDLLVLLRRLHRRDNVGRIVVHLAPWLEPQPI CWAIDHVRVCVGHGYPDGPAALDVRVAAVVTCVDCVRWLPQSLGEDELPDGRTVAQVT VGQAEFADLLVLTHPEPVAVAVLRRLAPRARITGGVDRVELALAHLDDNSRRGRTDTP HTPLLAGLPPLAADGEVAIVEFSARRPFHPQRLHAAVDLLLDGVVRTRGRLWLANRPD QVMWLESAGGGLRVASAGKWLAAMAASEVAYVDLERRLFADLMWVYPFGDRHTAMTVL VCGADPTDIVNALNAALLSDDEMASPQRWQSYVDPFGDWHDDPCHEMPDAAGEFSAHR NSGESR" gene complement(125643..130541) /gene="ctpI" /locus_tag="Rv0107c" /db_xref="GeneID:886915" CDS complement(125643..130541) /gene="ctpI" /locus_tag="Rv0107c" /EC_number="3.6.3.-" /function="CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF A CATION (POSSIBLY MAGNESIUM) WITH THE HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + CATION(IN) = ADP + PHOSPHATE + CATION(OUT)]." /note="Rv0107c, (MTCY251.26c, MTV031.01c), len: 1632 aa. Probable ctpI, cation-transporting ATPase I P-type (EC 3.6.3.-), highly similar to NP_302704.1|NC_002677 probable cation transport ATPase from Mycobacterium leprae (1609 aa); and similar to others e.g. CAB69720.1|AL137166 putative transport ATPase from Streptomyces coelicolor (1472 aa); ATA1_SYNY|P37367 cation-transporting ATPase pma1 from Synechocystis sp. (915 aa), FASTA scores: opt: 603, E(): 6.6e-29, (32.4% identity in 710 aa overlap); etc. Also similar to MTCY39.21c and MTCY22G10.22c from Mycobacterium tuberculosis, FASTA score: (34.4% identity in 796 aa overlap). Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES)." /codon_start=1 /transl_table=11 /product="cation-transporter ATPase I" /protein_id="NP_214621.1" /db_xref="GI:15607249" /db_xref="GeneID:886915" /translation="MKIPGVATVLGGVTNGVAQTVRAGARLPGSAAAAVQTLASPVLE LTGPVVQSVVQTTGRAIGVRGSHNESPDGMTPPVRWRSGRRVHFDLDPLLPFPRWHEH AAMVEEPVRRIPGVAEAHVEGSLGRLVVELEPDADSDIAVDEVRDVVSAVAADIFLAG SVSSPNSAPFADPGNPLAILVPLTAAAMDLVAMGATVTGWVARLPAAPQTTRALAALI NHQPRMVSLMESRLGRVGTDIALAATTAAANGLTQSLGTPLLDLVQRSLQISEAAAHR RVWRDREPALASPRRPQAPVVPIISSAGAKSQEPRHSWAAAAAGEASHVVVGGSIDAA IDTAKGSRAGPVEQYVNQAANGSLIAAASALVAGGGTEDAAGAILAGVPRAAHMGRQA FAAVLGRGLANTGQLVLDPGALRRLDRVRVVVIDGAALRGDNRAVLHAQGDEPGWDDD RVYEVADALLHGEQAPEPDPDELPATGARLRWAPAQGPSATPAQGLEHADLVVDGQCV GSVDVGWEVDPYAIPLLQTAHRTGARVVLRHVAGTEDLSASVGSTHPPGTPLLKLVRE LRADRGPVLLITAVHRDFASTDTLAALAIADVGVALDDPRGATPWTADLITGTDLAAA VRILSALPVARAASESAVHLAQGGTTLAGLLLVTGEQDKTTNPASFRRWLNPVNAAAA TALVSGMWSAAKVLRMPDPTPQPLTAWHALDPEIVYSRLAGGSRPLAVEPGIPAWRRI LDDLSYEPVMAPLRGPARTLAQLAVATRHELADPLTPILAVGAAASAIVGSNIDALLV AGVMTVNAITGGVQRLRAEAAAAELFAEQDQLVRRVVVPAVATTRRRLEAARHATRTA TVSAKSLRVGDVIDLAAPEVVPADARLLVAEDLEVDESFLTGESLPVDKQVDPVAVND PDRASMLFEGSTIVAGHARAIVVATGVGTAAHRAISAVADVETAAGVQARLRELTSKV LPMTLAGGAAVTALALLRRASLRQAVADGVAIAVAAVPEGLPLVATLSQLAAAQRLTA RGALVRSPRTIEALGRVDTICFDKTGTLTENRLRVVCALPSSTAAERDPLPQTTDAPS AEVLRAAARASTQPHNGEGHAHATDEAILAAASALAGSLSSQGDSEWVVLAEVPFESS RGYAAAIGRVGTDGIPMLMLKGAPETILPRCRLADPGVDHEHAESVVRHLAEQGLRVL AVAQRTWDNGTTHDDETDADAVDAVAHDLELIGYVGLADTARSSSRPLIEALLDAERN VVLITGDHPITARAIARQLGLPADARVVTGAELAVLDEEAHAKLAADMQVFARVSPEQ KVQIVAALQRCGRVTAMVGDGANDAAAIRMADVGIGVSGRGSSAARGAADIVLTDDDL GVLLDALVEGRSMWAGVRDAVTILVGGNVGEVLFTVIGTAFGAGRAPVGTRQLLLVNL LTDMFPALAVAVTSQFAEPDDAEYPTDDAAERAQREHRRAVLIGPTPSLDAPLLRQIV NRGVVTAAGATAAWAIGRWTPGTERRTATMGLTALVMTQLAQTLLTRRHSPLVIATAL GSAGVLVGIIQTPVISHFSGVPRWDRSPGRASSAPRQEPPQSQRWHRSGWQAQSVSCN LMNALTTRKTLTRVDRTYRRPR" misc_feature complement(127365..127385) /gene="ctpI" /locus_tag="Rv0107c" /note="PS00154 E1-E2 ATPases phosphorylation site" gene complement(130895..131104) /locus_tag="Rv0108c" /db_xref="GeneID:886918" CDS complement(130895..131104) /locus_tag="Rv0108c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0108c, (MTV031.02c), len: 69 aa. Hypothetical unknown protein. TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214622.1" /db_xref="GI:15607250" /db_xref="GeneID:886918" /translation="MVPVETLHSGDPITDVNGGGQRYIVLESKTVGDSCVVLELESRV NHQLQVIEKSFPAGYHVGRAHHRIL" gene 131382..132872 /gene="PE_PGRS1" /locus_tag="Rv0109" /db_xref="GeneID:886912" CDS 131382..132872 /gene="PE_PGRS1" /locus_tag="Rv0109" /function="UNKNOWN" /note="Rv0109, (MTV031.03c), len: 496 aa. Member of the M. tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many e.g. Q50615|Y0DP_MYCTU HYPOTHETICAL GLYCINE-RICH 40.8 kDa PROTEIN from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 1772, E(): 0, (57.3% identity in 513 aa overlap); etc. TBparse score is 0.884." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177692.1" /db_xref="GI:57116691" /db_xref="GeneID:886912" /translation="MSLLITSPATVAAAATHLAGIGSALSTANAAAAAPTTALSVAGA DEVSVLIAALFEAYAQEYQALSAQALAFHDQFVQALNMGAVCYAAAETANATPLQALQ TVQQNVLTVVNAPTQALLGRPIIGNGANGLPNTGQDGGPGGLLFGNGGNGGSGGVDQA GGNGGAAGLIGNGGSGGVGGPGIAGSAGGAGGAGGLLFGNGGPGGAGGIGTTGDGGPG GAGGNAIGLFGSGGTGGMGGVGGMGGVGNGGNAGNGGTAGLFGHGGAGGAGGIGSADG GLGGGGGNGRFMGNGGVGGAGGYGASGDGGNAGNGGLGGVFGDGGAGGTGGLGDVNGG LAGIGGNAGFVRNGGAGGNGQLGSGAVSSAGGMGGNGGLVFGNGGPGGLGGPGTSAGN GGMGGNAVGLFGQGGAGGAGGSGFGAGIPGGRGGDGGSGGLIGDGGTGGGAGAGDAAA SAGGNGGNARLIGNGGDGGPGMFGGPGGAGGSGGTIFGFAGTPGPS" gene 133020..133769 /locus_tag="Rv0110" /db_xref="GeneID:886917" CDS 133020..133769 /locus_tag="Rv0110" /function="UNKNOWN" /note="Rv0110, (MTV031.04), len 249 aa. Probable conserved integral membrane protein, similar to many e.g. AL079308|SCH69_25 from Streptomyces coelicolor (297 aa), FASTA scores: opt: 552, E(): 6.1e-29, (45.4% identity in 251 aa overlap); P54493|YQGP_BACSU HYPOTHETICAL 56.4 KD PROTEIN from Bacillus subtilis (507 aa), FASTA scores: opt: 320, E(): 4e-15, (32.4% identity in 210 aa overlap); etc. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_214624.1" /db_xref="GI:15607252" /db_xref="GeneID:886917" /translation="MRVGPVGHQCAECVREGARAVRQPRTPFGGRQRSATPVVTYTLI SLNALVFVMQVTVMGLERQLALWPPAVASGQTYRLVTSAFLHYGAMHLLLNMWALYVV GPPLEMWLGRLRFGALYAVSALGGSVLVYLIAPLNTATAGASGAVFGLFGATFMVARR LHLDVRWVVALIVINLAFTFLAPAISWQGHVGGLVTGALVAATYVYAPRERRNLIQAT VTITVLVAFVVLIGWRTVDLLALFGGRLNLS" gene 133950..136007 /locus_tag="Rv0111" /db_xref="GeneID:886909" CDS 133950..136007 /locus_tag="Rv0111" /EC_number="2.3.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0111, (MTV031.05), len: 685 aa. Possible transmembrane acyltransferase (EC 2.3.1.-), equivalent to AA22904.1|AL035300 putative acyltransferase from Mycobacterium leprae (696 aa). Also similar to others e.g. C69975 acyltransferase homolog yrhL from Bacillus subtilis (634 aa), FASTA scores: opt: 520, E(): 4e-22, (36.4% identity in 382 aa overlap). Very similar to Mycobacterium tuberculosis proteins Rv0228, Rv1254, Rv1565c, etc." /codon_start=1 /transl_table=11 /product="transmembrane acyltransferase" /protein_id="NP_214625.1" /db_xref="GI:15607253" /db_xref="GeneID:886909" /translation="MPARSVPRPRWVAPVRRVGRLAVWDRPERRSGIPALDGLRAIAV ALVLASHGGIPGMGGGFIGVDAFFVLSGFLITSLLLDELGRTGRIDLSGFWIRRARRL LPALVLMVLTVSAARALFPDQALTGLRSDAIAAFLWTANWRFVAQNTDYFTQGAPPSP LQHTWSLGVEEQYYVVWPLLLIGATLLLAARARRRCRRATVGGVRFAAFLIASLGTMA SATAAVAFTSAATRDRIYFGTDTRAQALLIGSAAAALLVRDWPSLNRGWCLIRTRWGR RIARLLPFVGLAGLAVTTHVATGSVGEFRHGLLIVVAGAAVIVVASVAMEQRGAVARI LAWRPLVWLGTISYGVYLWHWPIFLALNGQRTGWSGPALFAARCAATVVLAGASWWLI EQPIRRWRPARVPLLPLAAATVASAAAVTMLVVPVGAGPGLREIGLPPGVSAVAAVSP SPPEASQPAPGPRDPNRPFTVSVFGDSIGWTLMHYLPPTPGFRFIDHTVIGCSLVRGT PYRYIGQTLEQRAECDGWPARWSAQVNRDQPDVALLIVGRWETVDRVNEGRWTHIGDP TFDAYLNAELQRALSIVGSTGVRVMVTTVPYSRGGEKPDGRLYPEDQPERVNKWNAML HNAISQHSNVGMIDLNKKLCPDGVYTAKVDGIKVRSDGVHLTQEGVKWLIPWLEDSVR VAS" gene 136289..137245 /gene="gca" /locus_tag="Rv0112" /db_xref="GeneID:886907" CDS 136289..137245 /gene="gca" /locus_tag="Rv0112" /EC_number="4.2.1.47" /function="POSSIBLY INVOLVED IN SYNTHESIS OF A-BAND COMMON ANTIGEN LIPOPOLYSACCHARIDE. FIRST OF THE THREE STEPS IN THE BIOSYNTHESIS OF GDP-FUCOSE FROM GDP-MANNOSE [CATALYTIC ACTIVITY: GDP-mannose = GDP-4-dehydro-6-deoxy-D-mannose + H2O]." /note="Rv0112, (MTV031.06), len: 318 aa. Possible gca, GDP-mannose 4,6-dehydratase (EC 4.2.1.47), similar to others e g. U18320|PAU18320_1 GDP-D-mann from Pseudomonas aeruginosa (323 aa), FASTA scores: opt: 415, E(): 4.4e-21, (27.0% identity in 318 aa overlap). Similar to Rv3634c, Rv3784, etc from Mycobacterium tuberculosis. Contains PS00061 Short-chain dehydrogenases/reductases family signature. SEEMS TO BELONG TO THE GDP-MANNOSE 4,6-DEHYDRATASE FAMILY. COFACTOR: NAD(+)." /codon_start=1 /transl_table=11 /product="GDP-mannose 4,6-dehydratase" /protein_id="NP_214626.1" /db_xref="GI:15607254" /db_xref="GeneID:886907" /translation="MKVWITGAGGMMGSHLAEMLLAAGHDVYATYCRPTIDPSDLQFN GAEVDITDWCSVYDSIATFRPDAVFHLAAQSYPAVSWARPVETLTTNMVGTAIVFEAL RRVRPHAKIIVAGSSAEYGFVDPSEVPINERRELRPLHPYGVSKAATDMLAYQYHKSY GMHTVVARIFNCTGPRKVGDALSDFVRRCTWLEHHPEQSAIRVGNLKTKRTIVDVRDL NRALMLMLDKGEAGADYNVGGSIAYEMGDVLKQVIAACKRDDIVPEVDPALLRPTDEK IIYGDCSKLAAITGWQQEICLTQTIADMFDYWRSKSESALMV" misc_feature 136673..136759 /gene="gca" /locus_tag="Rv0112" /note="PS00061 Short-chain dehydrogenases/reductases family signature" gene 137319..137909 /gene="gmhA" /locus_tag="Rv0113" /db_xref="GeneID:886905" CDS 137319..137909 /gene="gmhA" /locus_tag="Rv0113" /EC_number="5.-.-.-" /function="INVOLVED IN BIOSYNTHESIS OF NUCLEOTIDE-ACTIVATED GLYCERO-MANNO-HEPTOSE: SYNTHESIS OF GLYCEROMANNOHEPTOSE 7-PHOSPHATE (INNER CORE LIPOPOLYSACCHARIDE BIOSYNTHESIS) [CATALYTIC ACTIVITY: D-SEDOHEPTULOSE 7-PHOSPHATE = D-GLYCERO-ALPHA,BETA-D-MANNO-HEPTOSE 7-PHOSPHATE]." /note="catalyzes the isomerization of sedoheptulose 7-phosphate to D-glycero-D-manno-heptose 7-phosphate" /codon_start=1 /transl_table=11 /product="phosphoheptose isomerase" /protein_id="NP_214627.1" /db_xref="GI:15607255" /db_xref="GeneID:886905" /translation="MCTARTAEEIFVETIAVKTRILNDRVLLEAARAIGDRLIAGYRA GARVFMCGNGGSAADAQHFAAELTGHLIFDRPPLGAEALHANSSHLTAVANDYDYDTV FARALEGSARPGDTLFAISTSGNSMSVLRAAKTARELGVTVVAMTGESGGQLAEFADF LINVPSRDTGRIQESHIVFIHAISEHVEHALFAPRQ" gene 137941..138513 /gene="gmhB" /locus_tag="Rv0114" /db_xref="GeneID:886903" CDS 137941..138513 /gene="gmhB" /locus_tag="Rv0114" /EC_number="2.-.-.-" /function="INVOLVED IN BIOSYNTHESIS OF NUCLEOTIDE-ACTIVATED GLYCERO-MANNO-HEPTOSE. INVOLVED IN TWO PATHWAYS, D-ALPHA-D PATHWAY [CATALYTIC ACTIVITY: D-GLYCERO-ALPHA-D-MANNO-HEPTOSE 1,7-BIPHOSPHATE = D-GLYCERO-ALPHA-D-MANNO-HEPTOSE 1-PHOSPHATE] AND L-BETA-D PATHWAY [CATALYTIC ACTIVITY: D-GLYCERO-BETA-D-MANNO-HEPTOSE 1,7-BIPHOSPHATE = D-GLYCERO-BETA-D-MANNO-HEPTOSE 1-PHOSPHATE]." /note="Rv0114, (MTV031.08), len: 190 aa. Possible gmhB, D-alpha,beta-D-heptose-1,7-biphosphate phosphatase (EC 2.-.-.-) (see citation below), similar to several hypothetical proteins and phosphatases e.g. HIS7_ECOLI|P06987 imidazoleglycerol-phosphate dehydratase (355 aa), FASTA scores: opt: 250, E(): 3.6e-11, (34.0 % identity in 141 aa overlap)." /codon_start=1 /transl_table=11 /product="D-alpha,beta-D-heptose-1,7-biphosphate phosphatase" /protein_id="NP_214628.1" /db_xref="GI:15607256" /db_xref="GeneID:886903" /translation="MVAERAGHQWCLFLDRDGVINRQVVGDYVRNWRQFEWLPGAARA LKKLRAWAPYIVVVTNQQGVGAGLMSAVDVMVIHRHLQMQLASDGVLIDGFQVCPHHR SQRCGCRKPRPGLVLDWLGRHPDSEPLLSIVVGDSLSDLELAHNVAAAAGACASVQIG GASSGGVADASFDSLWEFAVAVGHARGERG" gene 138513..139673 /gene="hddA" /locus_tag="Rv0115" /db_xref="GeneID:886902" CDS 138513..139673 /gene="hddA" /locus_tag="Rv0115" /EC_number="2.-.-.-" /function="INVOLVED IN BIOSYNTHESIS OF NUCLEOTIDE-ACTIVATED GLYCERO-MANNO-HEPTOSE (D-ALPHA-D PATHWAY) [CATALYTIC ACTIVITY: D-GLYCERO-ALPHA,BETA-D-MANNO-HEPTOSE 7-PHOSPHATE + ATP = D-GLYCERO-ALPHA-D-MANNO-HEPTOSE 1,7-BIPHOSPHATE]." /note="Rv0115, (MTV031.09), len: 386 aa. Possible hddA, D-alpha-D-heptose-7-phosphate kinase (EC 2.-.-.-) (see citation below), similar to several hypothetical proteins and sugar kinases e.g. AAK27850.1|AF324836_3 D-glycero-D-manno-heptose 7-phosphate kinase from Aneurinibacillus thermoaerophilus (341 aa); AAK80995.1|AE007802_11 Sugar kinase from Clostridium acetobutylicum (364 aa). TBparse score is 0.951." /codon_start=1 /transl_table=11 /product="D-alpha-D-heptose-7-phosphate kinase" /protein_id="NP_214629.1" /db_xref="GI:15607257" /db_xref="GeneID:886902" /translation="MAILRGRAPLRLGLGGGGTDVEPYSSQFGGRILSVTIDKYAYAF AERGTGDEIAFRSPDRDRAGQASIDDLASLEEDFPLHVAVYRRVIAEFNGGTPFPLQL ATQVDAPPGSGLGSSSALVVAMLLTTCALIGSSPGPYELARLAWEIERVDLGMAGGWQ DHYAAAFGGFNFMESRPNGEVVVNPLRIRREVIAELEASLLLYFGGVSRLSSEVIADQ QRNVVERDADALAATHSICAEALEMKDLLVVGDIPGFADSLLRGWQAKKRTSTRISNP AIEHAYQVAQSSGMVAGKVSGAGGGGFLMMIVDPRRRIEVARSLERECGGSVAPCLFT KGGAVTWHIPESTAPVRRGVADAVASALGNAGILLCAGCVLATSHSTWRVPV" misc_feature 139188..139220 /gene="hddA" /locus_tag="Rv0115" /note="PS00435 Peroxidases proximal heme-ligand signature" gene complement(140267..141022) /locus_tag="Rv0116c" /db_xref="GeneID:886900" CDS complement(140267..141022) /locus_tag="Rv0116c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0116c, (MTV031.10c), len: 251 aa. Possible conserved membrane protein, showing similarity to several hypothetical mycobacterial proteins e.g. Rv1433 from Mycobacterium tuberculosis (271 aa); and Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa); to the C-terminal regions of others like Rv0192 from Mycobacterium tuberculosis (366 aa), FASTA scores: opt: 451, E(): 1.7e-21, (46.7% identity in 270 aa overlap); and Rv0192|Z97050|MTCI28_32 from Mycobacterium tuberculosis cosmid (366 aa), FASTA scores: opt: 699, E(): 0, (45.7% identity in 221 aa overlap). TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214630.1" /db_xref="GI:15607258" /db_xref="GeneID:886900" /translation="MRRVVRYLSVVVAITLMLTAESVSIATAAVPPLQPIPGVASVSP ANGAVVGVAHPVVVTFTTPVTDRRAVERSIRISTPHNTTGHFEWVASNVVRWVPHRYW PPHTRVSVGVQELTEGFETGDALIGVASISAHTFTVSRNGEVLRTMPASLGKPSRPTP IGSFHAMSKERTVVMDSRTIGIPLNSSDGYLLTAHYAVRVTWSGVYVHSAPWSVNSQG YANVSHGCINLSPDNAAWYFDAVTVGDPIEVVG" gene 141200..142144 /gene="oxyS" /locus_tag="Rv0117" /db_xref="GeneID:886914" CDS 141200..142144 /gene="oxyS" /locus_tag="Rv0117" /function="COULD EFFECT FUNCTIONS OF OXYR DURING EVOLUTION." /experiment="experimental evidence, no additional details recorded" /note="Rv0117, (MTV031.11), len: 314 aa. OxyS, oxidative stress response protein regulatory protein, LysR family (see citation below). Similar to many transcription regulators and OxyR, the oxidative stress response protein of many bacteria. Contains LysR family signature at N-terminus. Also contains helix-turn-helix motif at aa 16-37 (Score 1543, +4.44 SD). BELONGS TO THE LYSR FAMILY OF TRANSCRIPTIONAL REGULATORS. OXYR IS REQUIRED FOR THE INDUCTION OF A REGULON OF HYDROGEN PEROXIDE INDUCIBLE GENES SUCH AS CATALASE, GLUTATHIONE-REDUCTASE, ETC." /codon_start=1 /transl_table=11 /product="oxidative stress response regulatory protein OXYS" /protein_id="NP_214631.1" /db_xref="GI:15607259" /db_xref="GeneID:886914" /translation="MLFRQLEYFVAVAQERHFARAAEKCYVSQPALSSAIAKLERELN VTLINRGHSFEGLTREGERLVVWAKRILAEHAAFKAEVDAVRSGITGTLRLGTVPTAS TTASLVLSAFCSAHPLAKVQVCSRLAATELYRRLREFELDAVIVHPETQDSDDVDLVP LYEEQYVLLSPADMLPPGTSTLVWRDAAQLPLALLTADMRDRQVIDAAFADHAVSAIP QVETDSVASLFAQVATGNWASIVPHTWLWAMPMSGPTGGEIRAVELVDPVLKAQIALA TNALGPGSPVARALITCAQALALNEFFDTQLRGITRRR" misc_feature 141248..141340 /gene="oxyS" /locus_tag="Rv0117" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene complement(142128..143876) /gene="oxcA" /locus_tag="Rv0118c" /db_xref="GeneID:886898" CDS complement(142128..143876) /gene="oxcA" /locus_tag="Rv0118c" /EC_number="4.1.1.8" /function="INVOLVED IN CATABOLISM OF OXALIC ACID [CATALYTIC ACTIVITY: Oxalyl-CoA = formyl-CoA + CO2]." /note="catalyzes the formation of formyl-CoA from oxalyl-CoA" /codon_start=1 /transl_table=11 /product="putative oxalyl-CoA decarboxylase" /protein_id="NP_214632.1" /db_xref="GI:15607260" /db_xref="GeneID:886898" /translation="MTTRSASPCTVLTDGCHLVVDALKANDVDTIYGVVGIPITDLAR AAQASGIRYIGFRHEASAGNAAAAAGFLTARPGVCLTTSGPGFLNGLPALANATTNCF PMIQISGSSSRPMVDLQRGDYQDLDQLNAARPFVKAAYRIGQVQDIGRGVARAIRTAT SGRPGGVYLDIPGDVLGQAVEASAASGAIWRPVDPAPRLLPAPEAIDRALDVLAQAQR PLLVLSKGAAYAQADNVIREFVEHTGIPFLPMSMAKGLLPDSHPQSAAAARSLAMARA DVVLLVGARLNWLLGNGESPQWSADAKFIQVDIEASEFDSNRPIVAPLTGDIGSVMSA LLEAAADRSSVASAAWTGELADRKARNSAKMRRRLADDHHPMRFYNALGAIRSVLQRN PDVYVVNEGANALDLARNIIDMHLPRHRLDSGTWGVMGIGMGYAIAAAVETGRPVVAI EGDSAFGFSGMEFETICRYRLPVTVVILNNGGVYRGDEATIFRSAAPVWRHDPAPTVL NAHARHELIAEAFGGKGYHVSTPTELESALTDALASNGPSLIDCELDPADGVESGHLA KLNTTSAATPAISGDG" gene 144049..145626 /gene="fadD7" /locus_tag="Rv0119" /db_xref="GeneID:886896" CDS 144049..145626 /gene="fadD7" /locus_tag="Rv0119" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_214633.1" /db_xref="GI:15607261" /db_xref="GeneID:886896" /translation="MASDFGPRIADLVEVAATRLPEAPALVVTADRIAISHRDLARLV DELAGQLTRSGLLPGDRVALRMGSNAEFVVALLAASRADLVVVPLDPALPITEQRVRS QAAGARVVLIDADGPHDRAEPTTRWWPLTVNVGGDSGPSGGTLSVHLDAATEPNPATS TPEGLRPDDAMIMFTGGTTGLPKMVPWTHANIASSVRAIITGYRLSPRDATVAVMPLY HGHGLIASLLATLASGGAVSLPARGRFSAHTFWDDIKAVGATWYTAVPTIHQILLERS ATEPSGRKPAALRFIRSCSAPLTAQAALALQTEFAAPVVCAFGMTEATHQVTTTQIEG IDQTETPVVSTGLVGRSTGAQIRIVGSDGLPLPAGAVGEIWLRGTTVVRGYLGDPTIT AANFTDGWLRTGDLGSLSAAGDLSIRGRIKELINRGGEKISPERVEGVLASHPNVMEA AVFGVPHQLYGEAVAAVIVPRESAPPTREELVQFCRERLAAFEIPASFQEASGLPHTA KGSLDRRAVAERFGHSV" misc_feature 144562..144597 /gene="fadD7" /locus_tag="Rv0119" /note="PS00455 Putative AMP-binding domain signature, [LIVM FY].{2}[STG][STAG]G[ST][STEI][SG].[PASLIVM][KR], info count = 22.0" gene complement(145627..147771) /gene="fusA2" /locus_tag="Rv0120c" /gene_synonym="fusA" /db_xref="GeneID:886894" CDS complement(145627..147771) /gene="fusA2" /locus_tag="Rv0120c" /gene_synonym="fusA" /function="INVOLVED IN TRANSLATION MECHANISM. THIS PROTEIN MAY PROMOTE THE GTP-DEPENDENT TRANSLOCATION OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE RIBOSOME." /experiment="experimental evidence, no additional details recorded" /note="EF-G; promotes GTP-dependent translocation of the ribosome during translation; many organisms have multiple copies of this gene" /codon_start=1 /transl_table=11 /product="elongation factor G" /protein_id="NP_214634.1" /db_xref="GI:15607262" /db_xref="GeneID:886894" /translation="MADRVNASQGAAAAPTANGPGGVRNVVLVGPSGGGKTTLIEALL VAAKVLSRPGSVTEGTTVCDFDEAEIRQQRSVGLAVASLAYDGIKVNLVDTPGYADFV GELRAGLRAADCALFVIAANEGVDEPTKSLWQECSQVGMPRAVVITKLDHARANYREA LTAAQDAFGDKVLPLYLPSGDGLIGLLSQALYEYADGKRTTRTPAESDTERIEEARGA LIEGIIEESEDESLMERYLGGETIDESVLIQDLEKAVARGSFFPVIPVCSSTGVGTLE LLEVATRGFPSPMEHPLPEVFTPQGVPHAELACDNDAPLLAEVVKTTSDPYVGRVSLV RVFSGTIRPDTTVHVSGHFSSFFGGGTSNTHPDHDEDERIGVLSFPLGKQQRPAAAVV AGDICAIGKLSRAETGDTLSDKAEPLVLKPWTMPEPLLPIAIAAHAKTDEDKLSVGLG RLAAEDPTLRIEQNQETHQVVLWCMGEAHAGVVLDTLANRYGVSVDTIELRVPLRETF AGNAKGHGRHIKQSGGHGQYGVCDIEVEPLPEGSGFEFLDKVVGGAVPRQFIPNVEKG VRAQMDKGVHAGYPVVDIRVTLLDGKAHSVDSSDFAFQMAGALALREAAAATKVILLE PIDEISVLVPDDFVGAVLGDLSSRRGRVLGTETAGHDRTVIKAEVPQVELTRYAIDLR SLAHGAASFTRSFARYEPMPESAAARVKAGAG" misc_feature complement(147661..147684) /gene="fusA2" /locus_tag="Rv0120c" /gene_synonym="fusA" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(147908..148342) /locus_tag="Rv0121c" /db_xref="GeneID:886892" CDS complement(147908..148342) /locus_tag="Rv0121c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0121c, (MTCI418B.03c), len: 144 aa. Conserved hypothetical protein, showing some similarity with others proteins from Mycobacterium tuberculosis e.g. Rv1155, Rv1875, Rv2074, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214635.1" /db_xref="GI:15607263" /db_xref="GeneID:886892" /translation="MGEFDPKLRFAQSPVARLATSTPDGTPHLVPVVFALGARRPAEA TGADVIYTAVDAKRKTTQRLRRLANLEHNPRASVLVDSYADDWTQLWWVRADGVAAIH RDGEVMRAAYRLLRAKYAQYQSVPLNGPVIAIAVQRWASWHA" gene 148491..148859 /locus_tag="Rv0122" /db_xref="GeneID:886888" CDS 148491..148859 /locus_tag="Rv0122" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0122, (MTCI418B.04), len: 133 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214636.1" /db_xref="GI:15607264" /db_xref="GeneID:886888" /translation="MAGSVSAAAGIGWVGLNVTETNRDQCYRVERTTVDALTHPEYRV HTRGVQRVRVTRNARKHRVSKHRIVAAMRHCGVPVIQEDGSLYYQGRDTSGRLTEVVA VEADDGDLIITHAMPKEWKR" gene 148856..149224 /locus_tag="Rv0123" /db_xref="GeneID:886887" CDS 148856..149224 /locus_tag="Rv0123" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0123, (MTCI418B.05), len: 133 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214637.1" /db_xref="GI:15607265" /db_xref="GeneID:886887" /translation="MTKKPRNPADYVIGDDVEVSDVDLKQEEVYVDGERLTDERVEQM ASESLRLAREREANLIPGGKSLSGGSAHSPAVQVVVSKATHAKLKELARSRKMSVSKL LRPVLDEFVQRETGRILPRR" gene 149533..150996 /gene="PE_PGRS2" /locus_tag="Rv0124" /db_xref="GeneID:886883" CDS 149533..150996 /gene="PE_PGRS2" /locus_tag="Rv0124" /function="UNKNOWN" /note="Rv0124, (MTCI418B.06), len: 487 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many e.g. Y0DP_MYCTU|Q50615 from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 1730, E(): 0, (60.7% identity in 504 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177693.1" /db_xref="GI:57116692" /db_xref="GeneID:886883" /translation="MSFVSVAPEIVVAAATDLAGIGSAISAANAAAAAPTTAVLAAGA DEVSAAIAALFSGHAQAYQALSAQAAAFHQQFVQTLAGGAGAYAAAEAQVEQQLLAAI NAPTQALLGRPLIGNGADGAPGTGQAGGAGGILYGNGGNGGSGAAGQAGGAGGPAGLI GHGGSGGAGGSGAAGGAGGHGGWLWGNGGVGGSGGAGVGAGVAGGHGGAGGAAGLWGA GGGGGNGGNGADANIVSGGDGGLGGAGGGGGWLYGDGGAGGHGGQGAIGLGGGAGGDG GQGGAGRGLWGTGGAGGHGGQGGGTGGPPLPGQAGMGAAGGAGGLIGNGGAGGDGGVG ASGGVAGVGGAGGNAMLIGHGGAGGAGGDSSFANGAAGGAGGAGGHLFGNGGSGGHGG AVTAGNTGIGGAGGVGGDARLIGHGGAGGAGGDRAGALVGRDGGPGGNGGAGGQLYGN GGDGAPGTGGTLQAAVSGLVTALFGAPGQPGDTGQPG" gene 151148..152215 /gene="pepA" /locus_tag="Rv0125" /db_xref="GeneID:886924" CDS 151148..152215 /gene="pepA" /locus_tag="Rv0125" /EC_number="3.4.21.-" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS (SEEMS TO CLEAVE PREFERENTIALLY AFTER SERINE RESIDUES)." /experiment="experimental evidence, no additional details recorded" /note="Rv0125, (MTCI418B.07, MTB32A), len: 355 aa. Probable pepA (alternate gene name: mtb32a), serine protease (EC 3.4.21.-) (see Skeiky et al., 1999), highly similar to other proteases e.g. HHOB_ECOLI|P31137 protease hhob precursor (355 aa), FASTA scores: opt: 400, E(): 3.8e-14, (32.4% identity in 346 aa overlap). Also similar to Q50320 34 kDa PROTEIN PRECURSOR from Mycobacterium tuberculosis (361 aa), FASTA scores: opt: 1689, E(): 0, (70.7% identity in 362 aa overlap). Contains PS00135 Serine proteases, trypsin family, serine active site. Has a putative signal sequence at the N-terminus. BELONGS TO THE SERINE PROTEASE FAMILY.; mtb32a" /codon_start=1 /transl_table=11 /product="serine protease PepA" /protein_id="NP_214639.1" /db_xref="GI:15607267" /db_xref="GeneID:886924" /translation="MSNSRRRSLRWSWLLSVLAAVGLGLATAPAQAAPPALSQDRFAD FPALPLDPSAMVAQVGPQVVNINTKLGYNNAVGAGTGIVIDPNGVVLTNNHVIAGATD INAFSVGSGQTYGVDVVGYDRTQDVAVLQLRGAGGLPSAAIGGGVAVGEPVVAMGNSG GQGGTPRAVPGRVVALGQTVQASDSLTGAEETLNGLIQFDAAIQPGDSGGPVVNGLGQ VVGMNTAASDNFQLSQGGQGFAIPIGQAMAIAGQIRSGGGSPTVHIGPTAFLGLGVVD NNGNGARVQRVVGSAPAASLGISTGDVITAVDGAPINSATAMADALNGHHPGDVISVT WQTKSGGTRTGNVTLAEGPPA" misc_feature 151751..151786 /gene="pepA" /locus_tag="Rv0125" /note="PS00135 Serine proteases, trypsin family, serine active site" gene 152324..154129 /gene="treS" /locus_tag="Rv0126" /db_xref="GeneID:886881" CDS 152324..154129 /gene="treS" /locus_tag="Rv0126" /EC_number="5.4.99.-" /function="INVOLVED IN TREHALOSE BIOSYNTHESIS (PROTECTIVE EFFECT). CONVERTS MALTOSE TO TREHALOSE. Mycobacteria can produce trehalose from glucose 6-phosphate and UDP-glucose (the OtsA-OtsB pathway) from glycogen-like alpha(1-->4)-linked glucose polymers (the TreY-TreZ pathway) and from maltose (the TreS pathway)." /experiment="experimental evidence, no additional details recorded" /note="Rv0126, (MTCI418B.08), len: 601 aa. treS, trehalose synthase (EC 5.4.99.-) (see citation below), highly similar to others e.g. CAA04601.2|AJ001205 putative trehalose synthase from Streptomyces coelicolor (566 aa); S71450|1536814|BAA11303.1|D78198 trehalose synthase maltose-specific from Pimelobacter sp. strain R48 (573 aa). Also similar to MAL1_DROME|P07191 possible maltase precursor (508 aa), FASTA scores: opt: 807, E(): 0, (33.7% identity in 504 aa overlap); and similar to proteins associated with amino-acid transport e.g. Q64319 rat protein which stimulates transport of cystine and dibasic and neutral amino acids (683 aa), FASTA scores: opt: 839, E(): 0, (32.0% identity in 531 aa overlap). Also similar to several other Mycobacterium tuberculosis proteins e.g. Rv2471 FASTA score: (31.7% identity in 164 aa overlap)." /codon_start=1 /transl_table=11 /product="trehalose synthase TRES" /protein_id="NP_214640.1" /db_xref="GI:15607268" /db_xref="GeneID:886881" /translation="MNEAEHSVEHPPVQGSHVEGGVVEHPDAKDFGSAAALPADPTWF KHAVFYEVLVRAFFDASADGSGDLRGLIDRLDYLQWLGIDCIWLPPFYDSPLRDGGYD IRDFYKVLPEFGTVDDFVALVDAAHRRGIRIITDLVMNHTSESHPWFQESRRDPDGPY GDYYVWSDTSERYTDARIIFVDTEESNWSFDPVRRQFYWHRFFSHQPDLNYDNPAVQE AMIDVIRFWLGLGIDGFRLDAVPYLFEREGTNCENLPETHAFLKRVRKVVDDEFPGRV LLAEANQWPGDVVEYFGDPNTGGDECHMAFHFPLMPRIFMAVRRESRFPISEIIAQTP PIPDMAQWGIFLRNHDELTLEMVTDEERDYMYAEYAKDPRMKANVGIRRRLAPLLDND RNQIELFTALLLSLPGSPVLYYGDEIGMGDVIWLGDRDGVRIPMQWTPDRNAGFSTAN PGRLYLPPSQDPVYGYQAVNVEAQRDTSTSLLNFTRTMLAVRRRHPAFAVGAFQELGG SNPSVLAYVRQVAGDDGDTVLCVNNLSRFPQPIELDLQQWTNYTPVELTGHVEFPRIG QVPYLLTLPGHGFYWFQLTTHEVGAPPTCGGERRL" repeat_region 154073..154125 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class III. See citation below." repeat_region 154126..154178 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region 154179..154231 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 154232..155599 /locus_tag="Rv0127" /db_xref="GeneID:886880" CDS 154232..155599 /locus_tag="Rv0127" /function="UNKNOWN" /note="Rv0127, (MTCI418B.09, MTCI5.01), len: 455 aa. Conserved hypothetical protein, highly similar to various proteins e.g. AJ0012|SCJ001205_4 hypothetical protein from Streptomyces coelicolor A3(2) (464 aa), FASTA scores: opt: 412, E(): 1.1e-19, (40.6% identity in 485 aa overlap); AJ0012|SCJ001206_5 hypothetical protein from Streptomyces coelicolor A3(2) (453 aa), FASTA scores: opt: 403, E(): 4.3 e-19, (36.5% identity in 455 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214641.1" /db_xref="GI:15607269" /db_xref="GeneID:886880" /translation="MTRSDTLATKLPWSDWLSRQRWYAGRNRELATVKPGVVVALRHN LDLVLVDVTYTDGATERYQVLVGWDFEPASEYGTKAAIGVADDRTGFDALYDVAGPQF LLSLIVSSAVCGTSTGEVTFTREPDVELPFAAQPRVCDAEQSNTSVIFDRRAILKVFR RVSSGINPDIELNRVLTRAGNPHVARLLGAYQFGRPNRSPTDALAYALGMVTEYEANA AEGWAMATASVRDLFAEGDLYAHEVGGDFAGESYRLGEAVASVHATLADSLGTAQATF PVDRMLARLSSTVAVVPELREYAPTIEQQFQKLAAEAITVQRVHGDLHLGQVLRTPES WLLIDFEGEPGQPLDERRAPDSPLRDVAGVLRSFEYAAYGPLVDQATDKQLAARAREW VERNRAAFCDGYAVASGIDPRDSALLLGAYELDKAVYETGYETRHRPGWLPIPLRSIA RLTAS" gene 155667..156446 /locus_tag="Rv0128" /db_xref="GeneID:886878" CDS 155667..156446 /locus_tag="Rv0128" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0128, (MTCI5.02), len: 259 aa. Probable conserved transmembrane protein, with some similarity to Rv3064c and other bacterial proteins e.g. AAK85977.1|AE007957|AGR_C_254p from Agrobacterium tumefaciens (206 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214642.1" /db_xref="GI:15607270" /db_xref="GeneID:886878" /translation="MQREIYDGEARLSWVLAALAGILGATAFTHSAGYFVTFMTGNSQ RAVLGLFGDDAWMSVTASLLILFFVAGVVIASVCRRHFWAAHPHGPTVLTTFSLIFAA GVDIMLGGWHESMLDFVPILFVVFGIGALNTSFVKDGEVSVPLSYVTGTLVKMGQGIE RHLAGGKVEDWLGYFLLHASFVLGAAAGGAISMVVTGPQMLAVAAVVCAATTGYTYLH ADRRGLVNQKRPQPGKRLFRALRRGELDSGTSTPATNYGSS" gene complement(156578..157600) /gene="fbpC" /locus_tag="Rv0129c" /db_xref="GeneID:886885" CDS complement(156578..157600) /gene="fbpC" /locus_tag="Rv0129c" /EC_number="2.3.1.-" /function="PROTEINS OF THE ANTIGEN 85 COMPLEX ARE RESPONSIBLE FOR THE HIGH AFFINITY OF MYCOBACTERIA TO FIBRONECTIN. POSSESSES A MYCOLYLTRANSFERASE ACTIVITY REQUIRED FOR THE BIOGENESIS OF TREHALOSE DIMYCOLATE (CORD FACTOR), A DOMINANT STRUCTURE NECESSARY FOR MAINTAINING CELL WALL INTEGRITY." /experiment="experimental evidence, no additional details recorded" /note="Rv0129c, (MT0137, MTCI5.03c), len: 340 aa. fbpC (alternate gene names: mpt45, 85C, fbpC2), secreted antigen 85c (fibronectin-binding protein C) (mycolyl transferase 85C) (EC 2.3.1.-) (see citations below), also highly similar to other Mycobacterial antigen precursors e.g. A85C_MYCLE|Q05862 antigen 85-c precursor (85c) from Mycobacterium leprae (333 aa), FASTA scores: opt: 1937, E(): 0, (81.4% identity in 333 aa overlap); etc.; mpt45; 85C; fbpC2" /codon_start=1 /transl_table=11 /product="secreted antigen 85-C FBPC (85C) (antigen 85 complex C) (AG58C) (Mycolyl transferase 85C) (fibronectin-binding protein C)" /protein_id="YP_177694.1" /db_xref="GI:57116693" /db_xref="GeneID:886885" /translation="MTFFEQVRRLRSAATTLPRRLAIAAMGAVLVYGLVGTFGGPATA GAFSRPGLPVEYLQVPSASMGRDIKVQFQGGGPHAVYLLDGLRAQDDYNGWDINTPAF EEYYQSGLSVIMPVGGQSSFYTDWYQPSQSNGQNYTYKWETFLTREMPAWLQANKGVS PTGNAAVGLSMSGGSALILAAYYPQQFPYAASLSGFLNPSEGWWPTLIGLAMNDSGGY NANSMWGPSSDPAWKRNDPMVQIPRLVANNTRIWVYCGNGTPSDLGGDNIPAKFLEGL TLRTNQTFRDTYAADGGRNGVFNFPPNGTHSWPYWNEQLVAMKADIQHVLNGATPPAA PAAPAA" gene 157847..158302 /locus_tag="Rv0130" /db_xref="GeneID:886876" CDS 157847..158302 /locus_tag="Rv0130" /function="UNKNOWN" /note="Rv0130, (MTCI5.04), len: 151 aa. Conserved hypothetical protein, most similar to AL096811|SCI30A_19 from Streptomyces coelicolor (153 aa), FASTA scores: opt: 639, E(): 0, (60.8% identity in 148 aa overlap). Also similar to NODN_RHILV|P08634 nodulation protein from Rhizobium leguminosarum bv. viciae plasmid pRL1JI (161 aa), FASTA scores: opt: 406, E(): 1e-21, (43.9% identity in 148 aa overlap; and to O30041 MONOAMINE OXIDASE REGULATORY PROTEIN (146 aa), FASTA scores: opt: 219, E(): 1.1e-08, (30.8% identity in 133 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214644.1" /db_xref="GI:15607272" /db_xref="GeneID:886876" /translation="MRTFESVADLAAAAGEKVGQSDWVTITQEEVNLFADATGDHQWI HVDPERAAAGPFGTTIAHGFMTLALLPRLQHQMYTVKGVKLAINYGLNKVRFPAPVPV GSRVRATSSLVGVEDLGNGTVQATVSTTVEVEGSAKPACVAESIVRYVA" gene complement(158315..159658) /gene="fadE1" /locus_tag="Rv0131c" /db_xref="GeneID:886874" CDS complement(158315..159658) /gene="fadE1" /locus_tag="Rv0131c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv0131c, (MTCI5.05c), len: 447 aa. Probable fadE1, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. ACDS_HUMAN|P16219 acyl-CoA dehydrogenase short-chain specific precursor (412 aa), FASTA scores: opt: 522, E(): 1.4e-23, (30.1% identity in 425 aa overlap). Also highly similar to MTCI5_28 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE1" /protein_id="NP_214645.1" /db_xref="GI:15607273" /db_xref="GeneID:886874" /translation="MPVRRRAGERLPTVWDFETDPQYQSKLDWVEKFMAEELEPLDLV ALDPYDKKNADTMAILRPLQRQVKDQGLWAAHLRPELGGQGFGQVKLALLNEIIGRSR WAPSAFGCQAPDSGNAEILALFGTDEQKARYLRPLLDGEITSCYSMTEPQGGSDPGLF VTAATRDAAGNGDWIINGEKWFSTNAKHASFFIVMAVTKPEARTYEKMSLFIVPADTP GIEIVRNVGVGAESTRHASHGYIRYHDVRVPADHVLGGEGQAFMIAQTRLGGGRIHHA MRTIALARRAFDMMCERALSRQTRHGRLADLQMTQEKIADSWIQIEQFRLLVLRTAWL IDKHHDYQKVRRDIAAVKVAMPQVLHDVVQRAMHLHGALGVSDEMPFVKMMLAAESLG IADGATELHKMTVARRTLREYQPVTTLFPSQHIPTRRAHAEAWLAQRLEHAIAEF" gene complement(159700..160782) /gene="fgd2" /locus_tag="Rv0132c" /db_xref="GeneID:886877" CDS complement(159700..160782) /gene="fgd2" /locus_tag="Rv0132c" /function="CATALYZES OXIDATION OF GLUCOSE-6-PHOSPHATE TO 6-PHOSPHOGLUCONOLACTONE USING COENZYME F420 (AN *-HYDROXY-5-DEAZAFLAVIN DERIVATIVE) AS THE ELECTRON ACCEPTOR." /note="Rv0132c, (MTCI5.06c), len: 360 aa. Putative fgd2, F420-dependent glucose-6-phosphate dehydrogenase (EC 1.-.-.-), highly similar to many from Mycobacteria e.g. AAD38167|g5031431 from Mycobacterium chelonae. Also similar to MJ1534|Q58929 N5,N10-METHYLENE TETRAHYDROMETHANOPTERIN REDUCTASE from METHANOCOCCUS JANNASCHII (342 aa), FASTA scores: opt: 285, E(): 7.9e-11, (28.4% identity in 292 aa overlap). And also similar to Rv0953c, Rv0791c, etc from Mycobacterium tuberculosis. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="putative f420-dependent glucose-6-phosphate dehydrogenase Fgd2" /protein_id="NP_214646.1" /db_xref="GI:15607274" /db_xref="GeneID:886877" /translation="MTGISRRTFGLAAGFGAIGAGGLGGGCSTRSGPTPTPEPASRGV GVVLSHEQFRTDRLVAHAQAAEQAGFRYVWASDHLQPWQDNEGHSMFPWLTLALVGNS TSSILFGTGVTCPIYRYHPATVAQAFASLAILNPGRVFLGLGTGERLNEQAATDTFGN YRERHDRLIEAIVLIRQLWSGERISFTGHYFRTDELKLYDTPAMPPPIFVAASGPQSA TLAGRYGDGWIAQARDINDAKLLAAFAAGAQAAGRDPTTLGKRAELFAVVGDDKAAAR AADLWRFTAGAVDQPNPVEIQRAAESNPIEKVLANWAVGTDPGVHIGAVQAVLDAGAV PFLHFPQDDPITAIDFYRTNVLPELR" misc_feature complement(160702..160734) /gene="fgd2" /locus_tag="Rv0132c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 160869..161474 /locus_tag="Rv0133" /db_xref="GeneID:886873" CDS 160869..161474 /locus_tag="Rv0133" /EC_number="2.3.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0133, (MTCI5.07), len: 201 aa. Probable acetyltransferase (EC 2.3.1.-), highly similar to others e.g. PUAC_STRLP|P13249 puromycyn N-acetyltransferase (199 aa), FASTA scores: opt: 341, E(): 1.8e-16, (33.3% identity in 201 aa overlap)." /codon_start=1 /transl_table=11 /product="acetyltransferase" /protein_id="NP_214647.1" /db_xref="GI:15607275" /db_xref="GeneID:886873" /translation="MTPQARPARRADVRELSRTMARAFYDDPVMSWLLSNDNARTARL TRLFATIVRHQHLAGGGVEVARGAAGIGGAALWDPPDRWRESRRQQLAMTPGFLRVFG FRTAKARAALDVMMRVHPEEPHWYLAAIGSDPTVRGQGFGQVLMRSRLDRCDAEHCPA YLESTKPENVPYYQRFGFRVTREIALPDAGPPLWAMWREPR" gene 161771..162673 /gene="ephF" /locus_tag="Rv0134" /db_xref="GeneID:886871" CDS 161771..162673 /gene="ephF" /locus_tag="Rv0134" /function="THOUGHT TO BE INVOLVED IN DETOXIFICATION REACTIONS FOLLOWING OXIDATIVE DAMAGE TO LIPIDS [CATALYTIC ACTIVITY: AN EPOXIDE + H(2)O = A GLYCOL]." /note="Rv0134, (MTCI5.08), len: 300 aa. Possible ephE, epoxide hydrolase (EC 3.3.2.3) (see citation below), similar to others e.g. Q39856 epoxide hydrolase (341 aa), FASTA scores: opt: 369, E(): 4.6e-17, (27.2% identity in 335 aa overlap); ETC. Also similar to MTCY09F9.26c from Mycobacterium tuberculosis (29.5% identity in 346 aa overlap)." /codon_start=1 /transl_table=11 /product="epoxide hydrolase EphF" /protein_id="NP_214648.1" /db_xref="GI:15607276" /db_xref="GeneID:886871" /translation="MIALPALEGVEHRHVDVAEGVRIHVADAGPADGPAVMLVHGFPQ NWWEWRDLIGPLAADGNRVLCPDLRGAGWSSAPRSRYTKTEMADDLAAVLDGLGVAKV KLVAHDWGGPVAFIMMLRHPEKVTGFFGVNTVAPWVKRDLGMLRNMWRFWYQIPMSLP VIGPRVISDPKGRYFRLLTGWVGGGFRVPDDDVRLYLDCMREPGHAEAGSRWYRTFQT REMLRWLRGEYNDARVDVPVRWLHGTGDPVITPDLLDGYAERASDFEVELVDGVGHWI VEQRPELVLDRVRAFLAAGTEQRD" gene complement(162644..163249) /locus_tag="Rv0135c" /db_xref="GeneID:886869" CDS complement(162644..163249) /locus_tag="Rv0135c" /function="COULD BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0135c, (MTCI5.09c), len: 201 aa. Possible transcriptional regulator, weakly similar to others e.g. P32398|YHGD_BACSU HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Bacillus subtilis (191 aa), FASTA scores: opt: 145, E(): 0.0012, (21.0% identity in 162 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214649.1" /db_xref="GI:15607277" /db_xref="GeneID:886869" /translation="MTAVAAGALVVETDSFRLRLLDGLVASIGERGYRATTVSDIVRH ARTSKRTFYDRFTSKEQCFLELLLADNETLGNSIRAAVDPNADWHDQIRQAVEAYVTH IESRPAVTLSWIREFPSLGAAAYPVQRRGMEQLTSLLIELSASPGFRRANLPPLNVPL AVILLGGLRELTALTVEDGQPIRNIVEPAVDASIALLGPRS" gene 163366..164691 /gene="cyp138" /locus_tag="Rv0136" /db_xref="GeneID:886868" CDS 163366..164691 /gene="cyp138" /locus_tag="Rv0136" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /experiment="experimental evidence, no additional details recorded" /note="Rv0136, (MT0144, MTCI5.10), len: 441 aa. Probable cyp138, cytochrome P450 138 (EC 1.14.-.-), similar to others e.g. SLR0574|Q59990 from SYNECHOCYSTIS SP. (444 aa), FASTA scores: opt: 315, E(): 1e-13, (25.7% identity in 416 aa overlap); etc. Also similar to MTV039_6 from Mycobacterium tuberculosis (472 aa), FASTA score: (38.2% identity in 442 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop); and PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 138" /protein_id="NP_214650.1" /db_xref="GI:15607278" /db_xref="GeneID:886868" /translation="MSEVVTAAPAPPVVRLPPAVRGPKLFQGLAFVVSRRRLLGRFVR RYGKAFTANILMYGRVVVVADPQLARQVFTSSPEELGNIQPNLSRMFGSGSVFALDGD DHRRRRRLLAPPFHGKSMKNYETIIEEETLRETANWPQGQAFATLPSMMHITLNAILR AIFGAGGSELDELRRLIPPWVTLGSRLAALPKPKRDYGRLSPWGRLAEWRRQYDTVID KLIEAERADPNFADRTDVLALMLRSTYDDGSIMSRKDIGDELLTLLAAGHETTAATLG WAFERLSRHPDVLAALVEEVDNGGHELRQAAILEVQRARTVIDFAARRVNPPVYQLGE WVIPRGYSIIINIAQIHGDPDVFPQPDRFDPQRYIGSKPSPFAWIPFGGGTRRCVGAA FANMEMDVVLRTVLRHFTLETTTAAGERSHGRGVAFTPKDGGRVVMRRR" misc_feature 163699..163722 /gene="cyp138" /locus_tag="Rv0136" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 164506..164535 /gene="cyp138" /locus_tag="Rv0136" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(164712..165260) /gene="msrA" /locus_tag="Rv0137c" /db_xref="GeneID:886865" CDS complement(164712..165260) /gene="msrA" /locus_tag="Rv0137c" /EC_number="1.8.4.11" /function="HAS AN IMPORTANT FUNCTION AS A REPAIR ENZYME FOR PROTEINS THAT HAVE BEEN INACTIVATED BY OXIDATION. CATALYZES THE REVERSIBLE OXIDATION-REDUCTION OF METHIONINE SULFOXIDE IN PROTEINS TO METHIONINE [CATALYTIC ACTIVITY: Protein L-methionine + oxidized thioredoxin = protein L-methionine S-oxide + reduced thioredoxin]." /note="this stereospecific enzymes reduces the S isomer of methionine sulfoxide while MsrB reduces the R form; provides protection against oxidative stress" /codon_start=1 /transl_table=11 /product="methionine sulfoxide reductase A" /protein_id="NP_214651.1" /db_xref="GI:15607279" /db_xref="GeneID:886865" /translation="MTSNQKAILAGGCFWGLQDLIRNQPGVVSTRVGYSGGNIPNATY RNHGTHAEAVEIIFDPTVTDYRTLLEFFFQIHDPTTKDRQGNDRGTSYRSAIFYFDEQ QKRIALDTIADVEASGLWPGKVVTEVSPAGDFWEAEPEHQDYLQRYPNGYTCHFVRPG WRLPRRTAESALRASLSPELGT" gene 165323..165826 /locus_tag="Rv0138" /db_xref="GeneID:886863" CDS 165323..165826 /locus_tag="Rv0138" /function="UNKNOWN" /note="Rv0138, (MTCI5.12), len: 167 aa. Conserved hypothetical protein, showing weak similarity to Q10827|YT10_MYCTU HYPOTHETICAL 17.0 KDA PROTEIN from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 131, E(): 0.047, (31.15% identity in 106 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214652.1" /db_xref="GI:15607280" /db_xref="GeneID:886863" /translation="MSASEFSRAELAAAFEKFEKTVARAAATRDWDCWVQHYTPDVEY IEHAAGIMRGRQRVRAWIQETMTTFPGSHMVAFPSLWSVIDESTGRIICELDNPMLDP GDGSVISATNISIITYAGNGQWCRQEDIYNPLRFLRAAMKWCRKAQELGTLDEDAARW MRRHGGP" gene 165827..166849 /locus_tag="Rv0139" /db_xref="GeneID:886860" CDS 165827..166849 /locus_tag="Rv0139" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0139, (MTCI5.13), len: 340 aa. Possible oxidoreductase (EC 1.-.-.-), similar to others e.g. O34285|HPNA HPNA PROTEIN from Zymomonas mobilis (337 aa), FASTA scores: opt: 507, E (): 5.8e-27, (31.1% identity in 328 aa overlap); TRE_STRGR|P29782 dtdp-glucose 4,6-dehydratase (328 aa), FASTA scores: opt: 254, E(): 2.6e-10, (29.0% identity in 307 aa overlap)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214653.1" /db_xref="GI:15607281" /db_xref="GeneID:886860" /translation="MNAPKLVIGANGFLGSHVTRQLVADCAPQKGEVRAMVRPAANTR SIDDLPLTRFHGDVFDTATVAEAMAGCDDVYYCVVDTRAWLRDPSPLFRTNVAGLRNV LDVATDASLRRFVFTSSYATVGRRRGHVATEEDRVDTRKVTPYVRSRVAAEDLVLQYA HDAGLPAVAMCVSTTYGGGDWGRTPHGAFIAGAVFGRLPFTMRGIRLEAVGVDDAARA LILAAERGRNGERYLISERMMPLQEVVRIAADEAGVPPPRWSISVPVLYALGALGSLR ARLTGKDTELSLASVRMMRSEADVDHGKAVRELGWQPRPVEESIREAARFWAAMRTVG KDPAAS" gene 166910..167290 /locus_tag="Rv0140" /db_xref="GeneID:886859" CDS 166910..167290 /locus_tag="Rv0140" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0140, (MTCI5.14), len: 126 aa. Conserved hypothetical protein, similar to others e.g. P74567|D90916_48 HYPOTHETICAL 20.8 KDP PROTEIN from Synechocystis sp. (180 aa), FASTA scores: opt: 229, E(): 4.7e-10, (36.1% identity in 108 aa overlap). Also similar to Rv1056 and Rv1670 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214654.1" /db_xref="GI:15607282" /db_xref="GeneID:886859" /translation="MSNRIVLEPSADHPITIEPTNRRVQVRVNGEVVADTAAALCLQE ASYPAVQYIPLADVVQDRLIRTETSTYCPFKGEASYYSVTTDAGDIVDDVMWTYENPY PAVAAIAGHVACYPDKAEISIFPG" gene complement(167271..167681) /locus_tag="Rv0141c" /db_xref="GeneID:886872" CDS complement(167271..167681) /locus_tag="Rv0141c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0141c, (MTCI5.15c), len: 136 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214655.1" /db_xref="GI:15607283" /db_xref="GeneID:886872" /translation="MTPFDDPQAELAWMFLQSLCEGGDLDEGFALLSNDFTYWSIVTR TELDKKTFRRAVERRKQVFEVNIELIRCVNEGETVVVEGHCDGVSADRTRYDSPFVCI FETRDGMIISLREYSDTQSLAEVYPVACATPGRC" gene 167711..168637 /locus_tag="Rv0142" /db_xref="GeneID:886858" CDS 167711..168637 /locus_tag="Rv0142" /function="UNKNOWN" /note="Rv0142, (MTCI5.16), len: 208 aa. Conserved hypothetical protein, similar, except in N-terminus, to AB88922.1|AL353862 hypothetical protein SCE34.20 from Streptomyces coelicolor (326 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214656.1" /db_xref="GI:15607284" /db_xref="GeneID:886858" /translation="MRSIDVVVEAVVTFAGAAGFAHTLAPLRRGQQDPCFRVPGDGTI WRTSLLPTGPVTARISRAGRDAARCVAWGSGAEEFVDMAPAMLGAADDASDFVPLHPA VAAAHRRLPNLRLGRTGQVLEALIPAVIEQRVPGADAFRSWRLLVSKYGTQAPGPAPP GMRVPPSAEVWRHIPSWEFHRANVDPGRARAVVGCAQRAASLERLVSLPAARAAEALT SLPGVGVWTAAETTQRVFGDADAVSVGDYHIPKMIGWTLVGRPVDDAGMLELLEPMRP HRHRVVRLLEASGLAREPRRGPRLPVQNIRAL" gene complement(168704..170182) /locus_tag="Rv0143c" /db_xref="GeneID:886856" CDS complement(168704..170182) /locus_tag="Rv0143c" /function="UNKNOWN; POSSIBLY ION CHANNEL INVOLVED IN TRANSPORT OF CHLORIDE ACROSS THE MEMBRANE." /note="Rv0143c, (MTCI5.17c), len: 492 aa. Probable conserved transmembrane protein, CIC family possibly involved in transport of chloride, similar to others and hypothetical proteins e.g. O28857 PUTATIVE CHLORIDE CHANNEL from Archaeoglobus fulgidus (589 aa), FASTA scores: opt: 966, E(): 0, (37.7% identity in 453 aa overlap); YADQ_ECOLI|P37019 hypothetical 46.0 kDa protein (436 aa), FASTA scores: opt: 452, E(): 2.4e-20, (28.0% identity in 460 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214657.1" /db_xref="GI:15607285" /db_xref="GeneID:886856" /translation="MAPGDWSVFAWHAANLPTMPEAEDIGNEAAGGRFGVSIRSAGYL RKWFLLGITIGVIAGLGAVVFYLALKYTSEFLLGYLADYQIPTPVGEGGHRGSTGFAR PWAIPLVTTGGAVLSALIVAKLAPEATGHGTDEAIESVHGDPRAIRGRAVLVKMVASA LTIGSGGSGGREGPTAQISAGFCSLLTRRLNLSNEDGRTAVALGIGAGIGAIFAAPLG GAALGASIPYRDDFDYRNLLPGFIASGTAYAVLGAFLGFDPLFGYIDAEYRFEKAWPL LWFVVIGLIAAAVGYLYARVFHASVAITRRLPGGPVLKPAIGGLLVGLLGLPIPQILS SGYGWAQLAADRGTLLSIPLWIVIVLPIAKILATSLSIGTGGSGGLFGPGIVIGAFVG AAIWRLGELTELPGVPHEPGIFVVVAMMACFGSVSRAPLAVMIMVAEMTGSFSVVPGA IIAVGIAALLLSRTNVTIYETQRLNRQTAEAERGGSDRPTTA" gene 170284..171126 /locus_tag="Rv0144" /db_xref="GeneID:886854" CDS 170284..171126 /locus_tag="Rv0144" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0144, (MTCI5.18), len: 280 aa. Probable transcriptional regulator, possibly tetR family. Has region similar to others e.g. Q59431|UIDR_ECOLI|GUSR|B1618|Z2623|ECS2326 UID OPERON REPRESSOR (GUS OPERON) from Escherichia coli strains K12 and O157:H7 (196 aa), FASTA scores: opt: 214, E(): 1.1e-06, (26.0% identity in 196 aa overlap). Contains probable helix-turn helix motif from aa 109-130 (Score 1463, +4.17 SD). COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_214658.1" /db_xref="GI:15607286" /db_xref="GeneID:886854" /translation="MPHSWTPTSVMTPPLVVAAFRPVGHYRLATDRAGGPCSPPATGA KLTSSVASRPTVGTKPQWWHTLVMSMSLTAGRGPGRPPAAKADETRKRILHAARQVFS ERGYDGATFQEIAVRADLTRPAINHYFANKRVLYQEVVEQTHELVIVAGIERARREPT LMGRLAVVVDFAMEADAQYPASTAFLATTVLESQRHPELSRTENDAVRATREFLVWAV NDAIERGELAADVDVSSLAETLLVVLCGVGFYIGFVGSYQRMATITDSFQQLLAGTLW RPPT" gene 171215..172168 /locus_tag="Rv0145" /db_xref="GeneID:886851" CDS 171215..172168 /locus_tag="Rv0145" /function="UNKNOWN" /note="Rv0145, (MTCI5.19), len: 317 aa. Conserved hypothetical protein, highly similar to many e.g. CAC32172.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (310 aa); and several Mycobacterium tuberculosis proteins e.g. Rv0726c, Rv0731c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214659.1" /db_xref="GI:15607287" /db_xref="GeneID:886851" /translation="MTELDDVSSLPSSRRTAGDTWAITESVGATALGVAAARAVETAA TNPLIRDEFAKVLVSSAGTAWARLADADLAWLDGDQLGRRVHRVACDYQAVRTHFFDE YFGAAVDAGVRQVVILAAGLDARAYRLNWPAGTVVYEIDQPSVLEYKAGILQSHGAVP TARRHAVAVDLRDDWPAALIAAGFDGTQPTAWLAEGLLPYLPGDAADRLFDMVTALSA PGSQVAVEAFTMNTKGNTQRWNRMRERLGLDIDVQALTYHEPDRSDAAQWLATHGWQV HSVSNREEMARLGRAIPQDLVDETVRTTLLRGRLVTPAQPA" gene 172211..173143 /locus_tag="Rv0146" /db_xref="GeneID:886849" CDS 172211..173143 /locus_tag="Rv0146" /function="UNKNOWN" /note="Rv0146, (MTCI5.20), len: 310 aa. Conserved hypothetical protein, highly similar to others e.g. AC30975.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (304 aa); and several Mycobacterium tuberculosis proteins e.g. Rv0726c, Rv0731c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214660.1" /db_xref="GI:15607288" /db_xref="GeneID:886849" /translation="MRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLL VTNAGAGAIWEAMLDPTLVAKAAAIDAETAAIVAYLRSYQAVRTNFFDTYFASAVAAG IRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAENGVTPSAGRREVPA DLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQDRLFTQVGAVSVAGSRIAAET APVHGEERRAEMRARFKKVADVLGIEQTIDVQELVYHDQDRASVADWLTDHGWRARSQ RAPDEMRRVGRWVEGVPMADDPTAFAEFVTAERL" gene 173238..174758 /locus_tag="Rv0147" /db_xref="GeneID:886847" CDS 173238..174758 /locus_tag="Rv0147" /EC_number="1.2.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: An aldehyde + NAD+ + H2O = an acid + NADH]." /note="Rv0147, (MTCI5.21), len: 506 aa. Probable aldehyde dehydrogenase (NAD+) dependent (EC 1.2.1.-), similar to others e.g. DHAP_RAT|P11883 aldehyde dehydrogenase (dimeric NADP-preferring) (452 aa), FASTA scores: opt: 1291, E(): 0, (43.9% identity in 453 aa overlap). Also similar to several Mycobacterium tuberculosis aledehyde dehydrogenases e.g. Rv0768, Rv2858c, etc. Contains PS00687 aldehyde dehydrogenases glutamic acid active site, and PS00070 aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase" /protein_id="NP_214661.1" /db_xref="GI:15607289" /db_xref="GeneID:886847" /translation="MSDRVKAVAPPDGRTMMTTESVARKTQKSETEAPREPAPVSDEK QTDVAKTVARLRKTFASGRTRSVEWRKQQLRALQKLMDENEDAIAAALAEDLDRNPFE AYLADIATTSAEAKYAAKRVRRWMRRRYLLLEVPQLPGRGWVEYEPYGTVLIIGAWNY PFYLTLGPAVGAIAAGNAVVLKPSEIAAASAHLMTELVYRYLDTEAIAVVQGDGAVSQ ELIAQGFDRVMFTGGTEIGRKVYEGAAPHLTPVTLELGGKSPVIVAADADVDVAAKRI AWIKLLNAGQTCVAPDYVLADATVRDELVSKITAALTKFRSGAPQGMRIVNQRQFDRL SGYLAAAKTDAAADGGGVVVGGDCDASNLRIQPTVVVDPDPDGPLMSNEIFGPILPVV TVKSLDDAIRFVNSRPKPLSAYLFTKSRAVRERVIREVPAGGMMVNHLAFQVSTAKLP FGGVGASGMGAYHGRWGFEEFSHRKSVLTKPTRPDLSSFIYPPYTERAIKVARRLF" misc_feature 173994..174017 /locus_tag="Rv0147" /note="PS00687 Aldehyde dehydrogenases glutamic acid active site" misc_feature 174078..174113 /locus_tag="Rv0147" /note="PS00070 Aldehyde dehydrogenases cysteine active site" gene 174833..175693 /locus_tag="Rv0148" /db_xref="GeneID:886845" CDS 174833..175693 /locus_tag="Rv0148" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0148, (MTCI5.22), len: 286 aa. Probable short-chain dehydrogenase (EC 1.-.-.-), similar to others, in particular Estradiol 17 beta-dehydrogenases (EC 1.1.1.62), e.g. DHB4_MOUSE|P51660 estradiol 17 beta-dehydrogenase 4 (735 aa), FASTA scores: opt: 952, E(): 0, (52.5% identity in 276 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short-chain type dehydrogenase/reductase" /protein_id="NP_214662.1" /db_xref="GI:15607290" /db_xref="GeneID:886845" /translation="MPGVQDRVIVVTGAGGGLGREYALTLAGEGASVVVNDLGGARDG TGAGSAMADEVVAEIRDKGGRAVANYDSVATEDGAANIIKTALDEFGAVHGVVSNAGI LRDGTFHKMSFENWDAVLKVHLYGGYHVLRAAWPHFREQSYGRVVVATSTSGLFGNFG QTNYGAAKLGLVGLINTLALEGAKYNIHANALAPIAATRMTQDILPPEVLEKLTPEFV APVVAYLCTEECADNASVYVVGGGKVQRVALFGNDGANFDKPPSVQDVAARWAEITDL SGAKIAGFKL" misc_feature 175283..175369 /locus_tag="Rv0148" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 175700..176668 /locus_tag="Rv0149" /db_xref="GeneID:886843" CDS 175700..176668 /locus_tag="Rv0149" /EC_number="1.6.5.-" /function="POSSIBLY BINDS NADP AND ACTS THROUGH A ONE-ELECTRON TRANSFER PROCESS. QUINONES ARE SUPPOSED TO BE THE BEST SUBSTRATES. MAY ACT IN THE DETOXIFICATION OF XENOBIOTICS [CATALYTIC ACTIVITY: NADPH + quinone = NADP+ + semiquinone]" /note="Rv0149, (MTCI5.23), len: 322 aa. Possible quinone oxidoreductase (EC 1.6.5.-), similar to others oxidoreductases e.g. Q08257 quinone oxidoreductase (EC 1.6.5.5) (329 aa), FASTA scores: opt: 397, E(): 3.2e-18, (28.4% identity in 328 aa overlap); SCHCOADH_4 from Streptomyces coelicolor. Also similar to many proteins from Mycobacterium tuberculosis. Contains PS01162 Quinone oxidoreductase / zeta-crystallin signature. BELONG TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY, QUINONE OXIDOREDUCTASE SUBFAMILY." /codon_start=1 /transl_table=11 /product="quinone oxidoreductase" /protein_id="NP_214663.1" /db_xref="GI:15607291" /db_xref="GeneID:886843" /translation="MKACVVKELSGPSGMVYTDIDEVSGDGGKVVIDVRAAGVCFPDL LLTKGEYQLKLTPPFVPGMETAGVVRSAPSDAGFHVGERVSAFGVLGGYAEQIAVPVA NVVRSPVELDDAGAVSLLVNYNTMYFALARRAALRPGDTVLVLGAAGGVGTAAVQIAK AMQAGKVIAMVHREGAIDYVASLGADVVLPLTEGWAQQVRDHTYGQGVDIVVDPIGGP TFDDALGVLAIDGKLLLIGFAAGAVPTLKVNRLLVRNISVVGVGWGEYLNAVPGSAAL FAWGLNQLVFLGLRPPPPQRYPLSEAQAALQSLDDGGVLGKVVLEP" misc_feature 176117..176170 /locus_tag="Rv0149" /note="PS01162 Quinone oxidoreductase / zeta-crystallin signature" gene complement(176665..176952) /locus_tag="Rv0150c" /db_xref="GeneID:886840" CDS complement(176665..176952) /locus_tag="Rv0150c" /function="UNKNOWN" /note="Rv0150c, (MTCI5.24c), len: 95 aa. Conserved hypothetical protein, showing some similarity with C-terminus of O53949|Rv1800|MTV049.22 PPE-FAMILY PROTEIN from Mycobacterium tuberculosis (655 aa), FASTA score: (36.5% identity in 104 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214664.1" /db_xref="GI:15607292" /db_xref="GeneID:886840" /translation="MLTLPDDRAPTGLPDPGIEALAHTKIASTISTVVADGYAVVLST ADIANSLLANAIGYPIAASVALVTPAAGANSSCWPADPSQHHRIAESRACA" gene complement(177543..179309) /gene="PE1" /locus_tag="Rv0151c" /db_xref="GeneID:886857" CDS complement(177543..179309) /gene="PE1" /locus_tag="Rv0151c" /function="UNKNOWN" /note="Rv0151c, (MTCI5.25c), len: 588 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), with N-terminal region similar to others e.g. MTV032_2 PE_PGRS family from Mycobacterium tuberculosis (468 aa), FASTA scores: opt: 1125, E(): 0, (46.3% identity in 456 aa overlap); MTCY493_24 from Mycobacterium tuberculosis FASTA score: (42.5% identity in 558 aa overlap). Also similar to upstream ORF MTCI5.26c FASTA score: (54.7% identity in 464 aa overlap). Also shows similarity to C-terminal part of some PPE family proteins e.g. MTV049_21 from Mycobacterium tuberculosis FASTA score: (41.5% identity in 591 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177695.1" /db_xref="GI:57116694" /db_xref="GeneID:886857" /translation="MAPFGFTPKARHNRGVALRSTYRLDGWVMGPVDKEGWGLSYVFA QPSVLAAAATDLAGIGSAINQATAAVAAPTTGLAAAAADEVSTALATLFGAYGQQFQA ISAQVAAFHNEFTQRLAAAANAFVNAEATNTSALVQEATAGLFKPTSPPVLPPMFNQN TAIIMGGTGSPIPTPSYVNAITTLFIDPVVSNPVVKALVTPEELYPITGVKSLPFQTS VQLGLQILDGAIWEQINAGNHVTVFGYSQSAVIASLEMQHLISLGPNAPSPSQLNFIL IGNEMNPNGGILARIPGLNVTTLGLPFYGATPDNPYPTTTYTLEYDGFADFPRYPLNV LSDINAVFGILTVHTTYADLTPAQIASATQLPTQGTTSNTYYIIETEHLPLLAPLRAI PVIGPPLAALVEPNLEVIVNLGYGDPRFGYSTSPANVPTPFGLFPDVPASVVADALVA GTQQGVNDFMVELPAALNTLPQTPMPAFPPYVPTLLPPPPPPQPATLINIADTFASVV STGYSILLPTADLGLAFVTILPAYDLTLFVNQLAAGNLRAAIELPLAATIGLAALGGM IEFIAIVVTLADITQQLQSFSI" gene complement(179319..180896) /gene="PE2" /locus_tag="Rv0152c" /db_xref="GeneID:886838" CDS complement(179319..180896) /gene="PE2" /locus_tag="Rv0152c" /function="UNKNOWN" /note="Rv0152c, (MTCI5.26c), len: 525 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), similar to ORF downstream Z92770|MTCI5_25 (588 aa), FASTA scores: opt: 1492, E(): 0, (54.7% identity in 464 aa overlap); and to many other PE family type members." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177696.1" /db_xref="GI:57116695" /db_xref="GeneID:886838" /translation="MRCRPPSRNRSAHTARNTRPCSLKSRRFTVRFHQTLAAAANSYA DAEAAIASTRQNQLAVPAAAPTPAAAAMIPPFPANLTTLFFGPTGIPLPPPSMLTPPI RCRSVRRALQAVFTPEELYPLTGVRSLVLNTSVEEGLTILHDAIMVELATTGNAVTVF GWSQSAIIASLEMQRFTAMGGAAPSASDLNFVLVGNEMNPNGGMLARFPDLTLPTLDL TFYGATPSDTIYPTAIYTLEYDGFADFSRYPLNFISDLNAVAGITFVHTKYLDLTPAQ VEGATKLPTSPGYTGVTDYYIIRTENRPLLQPLRAVPVIGDPLADLIQPNLKVIVNLG YGDPNYGYSTSYADVRTPFGLWPNVPPQVIADALAAGTQEGILDFTADLQALSAQPLT LPQIQLPQPADLVAAVAAAPTPAEVVNTLARIISTNYAVLLPTVDIALALVTTLPLYT TQLFVRQLAAGNLINAIGYPLAATVGLGTIDSGRRGIAHPPRGGLGHRSKHRGPRHLT DSRRHRRPPTTVYRPRQ" gene complement(181155..181985) /gene="ptbB" /locus_tag="Rv0153c" /db_xref="GeneID:886842" CDS complement(181155..181985) /gene="ptbB" /locus_tag="Rv0153c" /EC_number="3.1.3.48" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA DEPHOSPHORYLATION). CAN DEPHOSPHORYLATED IN VITRO THE PHOSPHOTYROSINE RESIDUE OF MYELIN BASIC PROTEIN (MBP) AT pH 7.0. COULD BE INVOLVED IN VIRULENCE BY INTERFERING WITH PHOSPHOTYROSINE-MEDIATED SIGNALS IN MACROPHAGES [CATALYTIC ACTIVITY: Protein tyrosine phosphate + H(2)O = protein tyrosine + phosphate]." /experiment="experimental evidence, no additional details recorded" /note="Rv0153c, (MTCI5.27c), len: 276 aa. ptbB (alternate gene name: MPtpB), protein-tyrosine-phosphatase (see citation below) (EC 3.1.3.48), showing some similarity to several protein-tyrosine phosphatases, polyketide synthase and aminotransferase e.g. Q05918|IPHP_NOSCO|IPH PROTEIN-TYROSINE-PHOSPHATASE PRECURSOR from Nostoc commune (EC 3.1.3.48) (294 aa), FASTA scores: opt: 150, E(): 0.0096, (26.8% identity in 269 aa overlap); etc. Supposed a secreted protein.; MPtpB" /codon_start=1 /transl_table=11 /product="phosphotyrosine protein phosphatase PTPB (protein-tyrosine-phosphatase) (PTPase)" /protein_id="NP_214667.1" /db_xref="GI:15607295" /db_xref="GeneID:886842" /translation="MAVRELPGAWNFRDVADTATALRPGRLFRSSELSRLDDAGRATL RRLGITDVADLRSSREVARRGPGRVPDGIDVHLLPFPDLADDDADDSAPHETAFKRLL TNDGSNGESGESSQSINDAATRYMTDEYRQFPTRNGAQRALHRVVTLLAAGRPVLTHC FAGKDRTGFVVALVLEAVGLDRDVIVADYLRSNDSVPQLRARISEMIQQRFDTELAPE VVTFTKARLSDGVLGVRAEYLAAARQTIDETYGSLGGYLRDAGISQATVNRMRGVLLG" gene complement(181987..183198) /gene="fadE2" /locus_tag="Rv0154c" /db_xref="GeneID:886836" CDS complement(181987..183198) /gene="fadE2" /locus_tag="Rv0154c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv0154c, (MTCI5.28c), len: 403 aa. Probable fadE2, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. C-terminal region of O01590 ACYL-CoA DEHYDROGENASE (974 aa), FASTA scores: opt: 1150, E(): 0, (50.0% identity in 402 aa overlap); ACDS_MEGEL|Q06319 acyl-CoA dehydrogenase (short-chain) (383 aa), FASTA score: (35.0% identity in 306 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE2" /protein_id="NP_214668.1" /db_xref="GI:15607296" /db_xref="GeneID:886836" /translation="MSAKAIDYRTRLSDFMTEHVFGAEADYDDYRRAAGPADHTAPPI IEELKTKAKDRGLWNLFLSAESGLTNLEYAPLAEMTGWSMEIAPEALNCAAPDTGNME ILHMFGTEQQRAQWLRPLLDGKIRSAFSMTEPAVASSDARNIETTISRDGADYVINGR KWWTSGAADPRCKILIVMGRTNPDAAAHQQQSMVLVPIDTPGVTIVRSTPVFGWQDRH GHCEIDYHNVRVPATNLLGEEGSGFAIAQARLGPGRIHHCMRALGAAERALALMVNRV RNRVAFGRPLAEQGVVQQAIAQSRNEIDQARLLCEKAAWTIDQHGNKEARHLVAMIKA VAPRVACDVIDRAIQVHGAAGVSDDTPLARLYGWHRAMRIFDGPDEVHLRSIARAELS REKSTFAAAVT" gene 183622..184722 /gene="pntAa" /locus_tag="Rv0155" /db_xref="GeneID:886832" CDS 183622..184722 /gene="pntAa" /locus_tag="Rv0155" /EC_number="1.6.1.2" /function="THE TRANSHYDROGENATION BETWEEN NADH AND NADP IS COUPLED TO RESPIRATION AND ATP HYDROLYSIS AND FUNCTIONS AS A PROTON PUMP ACROSS THE MEMBRANE [CATALYTIC ACTIVITY: NADPH + NAD+ = NADP+ + NADH]." /experiment="experimental evidence, no additional details recorded" /note="Rv0155, (MTCI5.29), len: 366 aa. Probable pntAa, first part of NAD(P) transhydrogenase subunit alpha (EC 1.6.1.2), similar to N-terminus of others e.g. PNTA_ECOLI|P07001|P76888|B1603 NAD (P) transhydrogenase subunit alpha from Escherichia coli strain K12 (510 aa), FASTA scores: opt: 921, E(): 0, (42.1% identity in 361 aa overlap); PROTON-TRANSLOCATING NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT PNTAA (EC 1.6.1.1).; pntAA" /codon_start=1 /transl_table=11 /product="NAD(P) transhydrogenase subunit alpha" /protein_id="NP_214669.1" /db_xref="GI:15607297" /db_xref="GeneID:886832" /translation="MTDPQTQSTRVGVVAESGPDERRVALVPKAVASLVNRGVAVVVE AGAGERALLPDELYTAVGASIGDAWAADVVVKVAPPTAAEVGRLRGGQTLIGFLAPRN ADNSIGALTQAGVQAFALEAIPRISRAQVMDALSSQANVSGYKAVLLAASESTRFFPM LTTAAGTVKPATVLVLGVGVAGLQALATAKRLGARTTGYDVRPEVADQVRSVGAQWLD LGISASGEGGYARELTDDERAQQQKALEEAISGFDVVITTALVPGRPAPTLVTAAAVE AMKPGSVVVDLAGETGGNCELTEPGRTVVKHDVTIAAPLNLPATMPEHASELYSKNIT ALLDLLIKDGRLAPDFDDEVIAQSCVTRGKDS" gene 184723..185055 /gene="pntAb" /locus_tag="Rv0156" /db_xref="GeneID:886890" CDS 184723..185055 /gene="pntAb" /locus_tag="Rv0156" /function="THE TRANSHYDROGENATION BETWEEN NADH AND NADP IS COUPLED TO RESPIRATION AND ATP HYDROLYSIS AND FUNCTIONS AS A PROTON PUMP ACROSS THE MEMBRANE [CATALYTIC ACTIVITY: NADPH + NAD+ = NADP+ + NADH]." /note="Rv0156, (MTCI5.30), len: 110 aa. Probable pntAb, second part of NAD(P) transhydrogenase subunit alpha, integral membrane protein, similar to C-terminus of others e.g. Q59764 NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT PNTAB (139 aa), FASTA scores: opt: 247, E(): 1.9e-11, (45.5% identity in 88 aa overlap).; pntAB" /codon_start=1 /transl_table=11 /product="NAD(P) transhydrogenase subunit alpha" /protein_id="NP_214670.1" /db_xref="GI:15607298" /db_xref="GeneID:886890" /translation="MYNELLENLAILVLSGFVGFAVISKVPNTLHTPLMSGTNAIHGI VVLGALVVFGEIEHPSLVLQVILFVAVVFGTLNVIGGFIVTDRMLGMFKAKKPAVPAK PDRDEALR" gene 185052..186479 /gene="pntB" /locus_tag="Rv0157" /db_xref="GeneID:886830" CDS 185052..186479 /gene="pntB" /locus_tag="Rv0157" /EC_number="1.6.1.1" /function="THE TRANSHYDROGENATION BETWEEN NADH AND NADP IS COUPLED TO RESPIRATION AND ATP HYDROLYSIS AND FUNCTIONS AS A PROTON PUMP ACROSS THE MEMBRANE [CATALYTIC ACTIVITY: NADPH + NAD+ = NADP+ + NADH]." /note="Rv0157, (MTCI5.31), len: 475 aa. Probable pntB, pyridine nucleotide transhydrogenase (nicotinamide nucleotide transhydrogenase) subunit beta (EC 1.6.1.1), integral membrane protein, similar to others e.g. Q59763 PROTON-TRANSLOCATING NICOTINAMIDE NUCLEOTIDE TRANSHYDROGENASE SUBUNIT BETA from HODOSPIRILLUM RUBRUM (464 aa), FASTA scores: opt: 1344, E(): 0, (46.4% identity in 472 aa overlap); P07002|PNTB_ECOLI|P76890|PNTB|B1602|Z2597|ECS2308 NAD(P) TRANSHYDROGENASE SUBUNIT BETA from Escherichia coli strains K12 and O157:H7 (462 aa)." /codon_start=1 /transl_table=11 /product="NAD(P) transhydrogenase subunit beta" /protein_id="NP_214671.1" /db_xref="GI:15607299" /db_xref="GeneID:886830" /translation="MNLHYLVEILYIISFSLFIYGLMGLTGPKTAVRGNLIAAAGMTI AVAATLVMIRHTSQWPLIIAGLVVGVVLGVPPARLTKMTAMPQLVAFFNGVGGGTVAL IALSEFIDTTGFSAFQHGESPTVHIVVASLFAAIIGSISFWGSIVAFGKLQEIISGRP IGLGKAQQPINLLLLAVAVAAAVVIGLHAHPGSGGVALWWMIGLLVAAGVLGLMVVLP IGGADMPVVISMLNAMTGLSAAAAGLALNNTAMIVAGMIVGASGSILTNLMAKAMNRS IPAIVAGGFGGGGVAPSGGGDDKHVKATSAADAAIQMAYANQVIVVPGYGLAVAQAQH AVKDLATLLEDRGVPVKYAIHPVAGRMPGHMNVLLAEAEVDYDAMKDMDDINDEFART DVTIVIGANDVTNPAARNETSSPIYGMPILNVDKSRSVIVLKRSMNSGFAGIDNPLFY ADGTTMLFGDAKKSVTEVSEELKAL" gene complement(186495..186623) /locus_tag="Rv0157A" /pseudo /db_xref="GeneID:3205086" misc_feature complement(186495..186623) /locus_tag="Rv0157A" /note="Rv0157A, len: 42 aa. Hypothetical protein (probably pseudogene), showing similarity to C-terminal part (aa 186-220) of O53976|Rv1975|MTV051.13 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 173, E(): 3e-06, (62.5% identity in 40 aa overlap).;HYPOTHETICAL PROTEIN (FRAGMENT)" /pseudo /db_xref="PSEUDO:CAE55247.1" gene 186785..187429 /locus_tag="Rv0158" /db_xref="GeneID:886828" CDS 186785..187429 /locus_tag="Rv0158" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0158, (MTV032.01), len: 214 aa. Probable transcriptional regulator, possibly TetR family, showing weak similarity to various transcriptional activators and repressors e.g. P32398|YIXD_BACSU|YHGD HYPOTHETICAL TRANSCRIPTIONAL REGULATORY PROTEIN from Bacillus subtilis (191 aa), FASTA scores: opt:172, E(): 2.4e-05, (23.0% identity in 191 aa overlap). Contains helix-turn-helix motif at aa 32-53 (Score 1296, +3.60 SD). COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_214672.1" /db_xref="GI:15607300" /db_xref="GeneID:886828" /translation="MPSDTSPNGLSRREELLAVATKLFAARGYHGTRMDDVADVIGLN KATVYHYYASKSLILFDIYRQAAEGTLAAVHDDPSWTAREALYQYTVRLLTAIASNPE RAAVYFQEQPYITEWFTSEQVAEVREKEQQVYEHVHGLIDRGIASGEFYECDSHVVAL GYIGMTLGSYRWLRPSGRRTAKEIAAEFSTALLRGLIRDESIRNQSPLGTRKET" gene complement(187433..188839) /gene="PE3" /locus_tag="Rv0159c" /db_xref="GeneID:886826" CDS complement(187433..188839) /gene="PE3" /locus_tag="Rv0159c" /function="UNKNOWN" /note="Rv0159c, (MTV032.02c), len: 468 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), similar to many other PE proteins e.g. O06828 from Mycobacterium tuberculosis (528 aa), FASTA scores: opt: 1163, E(): 0, (45.8% identity in 467 aa overlap). Also highly similar to upstream MTV032_3, and to MTCI5_25, MTCI5_26, MTV049_ 21, MTCY1A10_26, etc." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177697.1" /db_xref="GI:57116696" /db_xref="GeneID:886826" /translation="MSYVIAAPEMLATTAADVDGIGSAIRAASASAAGPTTGLLAAAA DEVSSAAAALFSEYARECQEVLKQAAAFHGEFTRALAAAGAAYAQAEASNTAAMSGTA GSSGALGSVGMLSGNPLTALMMGGTGEPILSDRVLAIIDSAYIRPIFGPNNPVAQYTP EQWWPFIGNLSLDQSIAQGVTLLNNGINAELQNGHDVVVFGYSQSAAVATNEIRALMA LPPGQAPDPSRLAFTLIGNINNPNGGVLERYVGLYLPFLDMSFNGATPPDSPYQTYMY TGQYDGYAHNPQYPLNILSDLNAFMGIRWVHNAYPFTAAEVANAVPLPTSPGYTGNTH YYMFLTQDLPLLQPIRAIPFVGTPIAELIQPDLRVLVDLGYGYGYADVPTPASLFAPI NPIAVASALATGTVQGPQAALVSIGLLPQSALPNTYPYLPSANPGLMFNFGQSSVTEL SVLSGALGSVARLIPPIA" gene complement(188931..190439) /gene="PE4" /locus_tag="Rv0160c" /db_xref="GeneID:886825" CDS complement(188931..190439) /gene="PE4" /locus_tag="Rv0160c" /function="UNKNOWN" /note="Rv0160c, (MTV032.03c), len: 502 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), similar to many other PE proteins e.g. Z92770|MTCI5_26c from Mycobacterium tuberculosis (525 aa), FASTA scores: opt: 816, E(): 0, (41.4% identity in 367 aa overlap); C-terminal region of O06801|RV1768|MTCY28.34 from Mycobacterium tuberculosis (618 aa), FASTA scores: opt: 417, E(): 6.7e-18, (53.5% identity in 142 aa overlap). Also highly similar to downstream ORF MTV032_2." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177698.1" /db_xref="GI:57116697" /db_xref="GeneID:886825" /translation="MSHLVTAPDMLATAAAHVDEIASTLRAANAAAAGPTCNLLAAAG DEVSAATAALFSAYGREYQAVVKQAAAFHSEFTRTLEAAGNAYAHAEAANAARVSHAL DTINAPIRTLLGRAPLSPNGSSGAGGLPAIAQLAAESPITALIMGGTNNPLPDPEYVT DINKAFIQTLFPGAVSQGLFTPEQFWPVTPDLGNLTFNQSVTEGVALLNTAVNNQLAL DNKVVAFGYSQSATIINNYINSLMAMGSPNPDDISFVMIGSGNNPVGGLLARFPGFYI PFLDVPFNGATPANSPYPTHIYTAQYDGIAHAPQFPLRILSDINAFMGYFYVHNTYPE LMATQVDNAVPLPTSPGYTGNTQYYMFLTQDLPLLQPIRDIPYAGPPIADLFQPQLRV LVDLGYADYGPGGNYADIPTPAGLFSIPNPFAVTYYLIKGSLQAPYGAIVEIGVEAGL IGPEWFPDSYPWVPSINPGLNFYFGQPQVTLLSLMSGGLGNILHLIPPPVFT" gene 190607..191956 /locus_tag="Rv0161" /db_xref="GeneID:886835" CDS 190607..191956 /locus_tag="Rv0161" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0161, (MTCI28.01, MTV032.04), len 449 aa. Possible oxidoreductase (EC 1.-.-.-), similar to hypothetical proteins and various oxidoreductases e.g. AIP2_YEAST|P46681 actin interacting protein 2 (530 aa), FASTA scores: opt: 356, E (): 0, (33.3% identity in 357 aa overlap); DLD1_YEAST|P32891 d-lactate dehydrogenase (cytochrome) (587 aa), FASTA scores: opt: 311, E(): 2.5e-20, (27.9% identity in 366 aa overlap). Also similar to other Mycobacteria proteins e.g. MTCY339.30c from Mycobacterium tuberculosis FASTA score: (29.4% identity in 357 aa overlap); MLCL622.30c from Mycobacterium tuberculosis (449 aa)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214675.1" /db_xref="GI:15607303" /db_xref="GeneID:886835" /translation="MLTSLVSAVGSHHVTTDPDVLAGRSVDHTGRYRGRASALVRPGS AEEVAEVLRVCRDAGAYVTVQGGRTSLVAGTVPEHDDVLLSTERLCVVSDVDTVERRI EIGAGVTLAAVQHAASTAGLVFGVDLSARDTATVGGMASTNAGGLRTVRYGNMGEQVV GLDVALPDGTVLRRHSRVRRDNTGYDLPALFVGAEGTLGVITALDLRLHPTPSHRVTA VCGFAELAALVDAGRMFRDVEGIAALELIDGRAAALTREHLGVRPPVEADWLLLVELA ADHDQTDRLADLLGGARMCGEPAVGVDAAAQQRLWRTRESLAEVLGVYGPPLKFDVSL PLSAISGFARDAVALVHRHVPDSPEALPLLFGHIGEGNLHLNVLRCPPDREPALYAKM MGLIAECGGNVSSEHGVGSRKRAYLGMSRQANDVAAMRRVKAALDPTGYLNAAVLFD" gene complement(191984..193135) /gene="adhE1" /locus_tag="Rv0162c" /db_xref="GeneID:886824" CDS complement(191984..193135) /gene="adhE1" /locus_tag="Rv0162c" /EC_number="1.1.1.1" /function="DEHYDROGENESES A ALCOHOL (OXIDO-REDUCTION) [CATALYTIC ACTIVITY: An alcohol + NAD+ = an aldehyde or ketone + NADH]." /note="Rv0162c, (MTCI28.02c), len: 383 aa. Probable adhE1, zinc-type alcohol dehydrogenase (EC 1.1.1.1), similar to others e.g. ADH_MACMU|P28469 alcohol dehydrogenase alpha chain (374 aa), FASTA scores: opt: 619, E(): 0, (34.7% identity in 363 aa overlap). Also similar to other alcohol dehydrogenases from Mycobacterium tuberculosis e.g. MTCY369.06c FASTA score: (34.0% identity in 365 aa overlap), MTV022_9 FASTA score: (35.0% identity in 371 aa overlap). Contains PS00059 Zinc-contain ingalcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY, CLASS-I SUBFAMILY. COFACTOR: ZINC." /codon_start=1 /transl_table=11 /product="zinc-type alcohol dehydrogenase E subunit" /protein_id="YP_177699.1" /db_xref="GI:57116698" /db_xref="GeneID:886824" /translation="MPAVQPWLYSNMPAIRGAVLDQIGVPRPYWRSKPISVVELHLDP PDRGEVLVRIEAAGVCHSDLSVVDGTRVRPVPILLGHEAAGIVEQVGDGVDGVAVGQR VVLVFLPRCGQCAACATDGRTPCEPGSAANKAGTLLGGGIRLSRGGRPVYHHLGVSGF ATHVVVNRASVVPVPHEVPPTVAALLGCAVLTGGGAVLNVGDPQPGQSVAVVGLGGVG MAAVLTALTYTDVRVVAVDQLPEKLSAAKALGAHEIYTPQQATAGGVKAAVVVEAVGH PAALHTAIGLTAPGGRTITVGLPPPDVRISLSPLDFVTEGRSLIGSYLGSAVPSHDIP RFVSLWQSGRLPVESLVTSTIRLDDINEAMDHLADGIAVRQLISFTGDL" misc_feature complement(192854..192898) /gene="adhE1" /locus_tag="Rv0162c" /note="PS00059 Zinc-containing alcohol dehydrogenases signature" gene 193117..193572 /locus_tag="Rv0163" /db_xref="GeneID:886821" CDS 193117..193572 /locus_tag="Rv0163" /function="UNKNOWN" /note="Rv0163, (MTCI28.03), len: 151 aa. Conserved hypothetical protein, similar to others e.g. Q44017 HYPOTHETICAL 16.6 KDA PROTEIN IN GBD 5'REGION (ORF6)from Alcaligenes eutrophus (145 aa), FASTA scores: opt: 155, E(): 0.0002, (26.6% identity in 139 aa overlap). Also weak similary with MTV008.31c|Rv2475c|B70867 from Mycobacterium tuberculosis (138 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214677.1" /db_xref="GI:15607305" /db_xref="GeneID:886821" /translation="MAALPAPEKLLRSDFPVLWPVGTRWADNDMFGHLNNAVYYQLFD TAINAWINTSTGVDPLAMPVLGIVAESGCRYFSELRFPESLMVGLAVTRLGRSSVTYR LGVFKEPDDAGVITALGHWVHVYVDRTSRRPVPIPEAIRSLLSTACVSG" gene 193626..194111 /gene="TB18.5" /locus_tag="Rv0164" /db_xref="GeneID:886267" CDS 193626..194111 /gene="TB18.5" /locus_tag="Rv0164" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0164, (MTCI28.04), len: 161 aa. TB18.5, conserved hypothetical protein, equivalent to CAB08818.1|Z95398 HYPOTHETICAL PROTEIN from Mycobacterium leprae (156 aa) FASTA scores: opt: 762, E(): 0, (76.3% identity in 152 aa overlap). Some similarity to Rv2185c, Rv0854, Rv0857 from Mycobacterium tuberculosis. Alternative start codon has been suggested. 3' part corrected since first submission (-24 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177617.1" /db_xref="GI:57116699" /db_xref="GeneID:886267" /translation="MTAISCSPRPRYASRMPVLSKTVEVTADAASIMAIVADIERYPE WNEGVKGAWVLARYDDGRPSQVRLDTAVQGIEGTYIHAVYYPGENQIQTVMQQGELFA KQEQLFSVVATGAASLLTVDMDVQVTMPVPEPMVKMLLNNVLEHLAENLKQRAEQLAA S" gene complement(194144..194938) /locus_tag="Rv0165c" /db_xref="GeneID:886818" CDS complement(194144..194938) /locus_tag="Rv0165c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0165c, (MTCI28.05c), len: 264 aa. Possible transcriptional regulator, GntR family, showing some similarity to several e.g. NTRA_CHELE|P54988 nta operon transcriptional regulator (231 aa), FASTA scores: opt: 154, E(): 0.00058, (32.0% identity in 125 aa overlap); P46833|GNTR_BACLI GLUCONATE OPERON TRANSCRIPTIONAL REPRESSOR from Bacillus licheniformis (243 aa); GNTR_BACSU GLUCONATE OPERON REPRESSOR from Bacillus subtilis (243 aa). Also similar to Rv0043c from Mycobacterium tuberculosis. SEEMS TO BELONG TO THE GNTR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="YP_177700.1" /db_xref="GI:57116700" /db_xref="GeneID:886818" /translation="MIKHDVVWVTLWPERPNNKPPPSPRQVPGNPGPTLKVLASHVNA PLSAKPRSQLPLRRAQLSDEVAGHLRAAIMSGALRSGTFIRLDETAAELGVSVTPVRE ALLKLRGEGMVGLEPHRGHVVLPLTRQDIDDIFWLQATIAQELATSATAHITDVEIDE LDRINNALAGAIGSGDAKTIASIEFAFHRVFNKASRRIKLAWFLLNAARYMGAGVRGR PAMGRGRGEQSSAADRRAAPPRHSRRNRAHRLAVHRWGTQADGGPG" gene 194993..196657 /gene="fadD5" /locus_tag="Rv0166" /db_xref="GeneID:886822" CDS 194993..196657 /gene="fadD5" /locus_tag="Rv0166" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid--CoA ligase" /protein_id="NP_214680.1" /db_xref="GI:15607307" /db_xref="GeneID:886822" /translation="MTAQLASHLTRALTLAQQQPYLARRQNWVNQLERHAMMQPDAPA LRFVGNTMTWADLRRRVAALAGALSGRGVGFGDRVMILMLNRTEFVESVLAANMIGAI AVPLNFRLTPTEIAVLVEDCVAHVMLTEAALAPVAIGVRNIQPLLSVIVVAGGSSQDS VFGYEDLLNEAGDVHEPVDIPNDSPALIMYTSGTTGRPKGAVLTHANLTGQAMTALYT SGANINSDVGFVGVPLFHIAGIGNMLTGLLLGLPTVIYPLGAFDPGQLLDVLEAEKVT GIFLVPAQWQAVCTEQQARPRDLRLRVLSWGAAPAPDALLRQMSATFPETQILAAFGQ TEMSPVTCMLLGEDAIAKRGSVGRVIPTVAARVVDQNMNDVPVGEVGEIVYRAPTLMS CYWNNPEATAEAFAGGWFHSGDLVRMDSDGYVWVVDRKKDMIISGGENIYCAELENVL ASHPDIAEVAVIGRADEKWGEVPIAVAAVTNDDLRIEDLGEFLTDRLARYKHPKALEI VDALPRNPAGKVLKTELRLRYGACVNVERRSASAGFTERRENRQKL" misc_feature 195554..195589 /gene="fadD5" /locus_tag="Rv0166" /note="PS00455 Putative AMP-binding domain signature" gene 196861..197658 /gene="yrbE1A" /locus_tag="Rv0167" /db_xref="GeneID:886816" CDS 196861..197658 /gene="yrbE1A" /locus_tag="Rv0167" /function="UNKNOWN" /note="Rv0167, (MTCI28.07), len: 265 aa. yrbE1A, hypothetical unknown integral membrane protein, part of mce1 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07791|Rv0587|MTCY19H5.35|yrbE2A (265 aa); O53965|Rv1964|MTV051.02|yrbE3A (265 aa); etc. Also highly similar or similar to conserved hypothetical integral membrane proteins of yrbEA type, e.g. NP_302654.1|NC_002677 conserved membrane protein from Mycobacterium leprae (267 aa); P45030|YRBE_HAEIN|HI1086 hypothetical protein from Haemophilus influenzae (261 aa), FASTA scores: opt: 328, E(): 1.8e-15, (26.6% identity in 244 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein YRBE1A" /protein_id="NP_214681.1" /db_xref="GI:15607308" /db_xref="GeneID:886816" /translation="MTTSTTLGGYVRDQLQTPLTLVGGFFRMCVLTGKALFRWPFQWR EFILQCWFIMRVGFLPTIMVSIPLTVLLIFTLNILLAQFGAADISGSGAAIGAVTQLG PLTTVLVVAGAGSTAICADLGARTIREEIDAMEVLGIDPIHRLVVPRVLASMLVATLL NGLVITVGLVGGFLFGVYLQNVSGGAYLATLTLITGLPEVVIATIKAATFGLIAGLVG CYRGLTVRGGSKGLGTAVNETVVLCVIALFAVNVILTTIGVRFGTGR" gene 197660..198529 /gene="yrbE1B" /locus_tag="Rv0168" /db_xref="GeneID:886812" CDS 197660..198529 /gene="yrbE1B" /locus_tag="Rv0168" /function="UNKNOWN" /note="Rv0168, (MTCI28.08), len: 289 aa. yrbE1B, hypothetical unknown integral membrane protein, part of mce1 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07790|Rv0588|MTCY19H5.34|yrbE2B (295 aa); O53966|Rv1965|MTV051.03|yrbE3B (271 aa); etc. Also highly similar to conserved hypothetical integral membrane proteins of the yrbEB type, e.g. NP_302655.1|NC_002677 conserved membrane protein from Mycobacterium leprae (289 aa); P45030|YRBE_HAEIN|HI1086 hypothetical protein from Haemophilus influenzae (261 aa), FASTA scores: opt: 223, E(): 7.6e-07, (23.7% identity in 257 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein YRBE1B" /protein_id="NP_214682.1" /db_xref="GI:15607309" /db_xref="GeneID:886812" /translation="MSTAAVLRARFPRAVANLRQYGGAAARGLDEAGQLTWFALTSIG QIAHALRYYRKETLRLIAQIGMGTGAMAVVGGTVAIVGFVTLSGSSLVAIQGFASLGN IGVEAFTGFFAALINVRIAGPVVTGVALAATVGAGATAELGAMRISEEIDALEVMGIK SISFLASTRIMAGLVVIIPLYALAMIMSFLSPQITTTVLYGQSNGTYEHYFQTFLRPD DVFWSFLEALIITAIVMVSHCYYGYAAGGGPVGVGEAVGRSMRFSLVSVQVVVLFAAL ALYGVDPNFNLTV" gene 198534..199898 /gene="mce1A" /locus_tag="Rv0169" /db_xref="GeneID:886823" CDS 198534..199898 /gene="mce1A" /locus_tag="Rv0169" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION (ENTRY AND SURVIVAL INSIDE MACROPHAGES)." /experiment="experimental evidence, no additional details recorded" /note="Rv0169, (MTCI28.09), len: 454 aa. mce1A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07789|MCE2|Rv0589|MTCY19H5.33c|mce2A (404 aa); O53967|MCE3|Rv1966|MTV051.04|mce3A (425 aa); etc. Also highly similar to others e.g. AAD52105.1|AF113402_1|AF113402 mycobacterial cell entry protein from Mycobacterium bovis BCG (454 aa); NP_302656.1|NC_002677 putative cell invasion protein from Mycobacterium leprae (441 aa); AAA92845.1|U26018 mce gene product from Mycobacterium avium (88 aa) (similarity on C-terminus); CAC12798.1|AL445327 putative secreted protein from Streptomyces coelicolor (418 aa); etc. Note that equivalent, but longer 22 aa, to P72013|CAA50257.1|X70901 Mcep protein from Mycobacterium tuberculosis (432 aa). Contains a very hydrophobic region around residues 20-35. Note that previously known as mce1.; mce1" /codon_start=1 /transl_table=11 /product="MCE-family protein MCE1A" /protein_id="YP_177701.1" /db_xref="GI:57116701" /db_xref="GeneID:886823" /translation="MTTPGKLNKARVPPYKTAGLGLVLVFALVVALVYLQFRGEFTPK TQLTMLSARAGLVMDPGSKVTYNGVEIGRVDTISEVTRDGESAAKFILDVDPRYIHLI PANVNADIKATTVFGGKYVSLTTPKNPTKRRITPKDVIDVRSVTTEINTLFQTLTSIA EKVDPVKLNLTLSAAAEALTGLGDKFGESIVNANTVLDDLNSRMPQSRHDIQQLAALG DVYADAAPDLFDFLDSSVTTARTINAQQAELDSALLAAAGFGNTTADVFDRGGPYLQR GVADLVPTATLLDTYSPELFCTIRNFYDADPLAKAASGGGNGYSLRTNSEILSGIGIS LLSPLALATNGAAIGIGLVAGLIAPPLAVAANLAGALPGIVGGAPNPYTYPENLPRVN ARGGPGGAPGCWQPITRDLWPAPYLVMDTGASLAPYNHMEVGSPYAVEYVWGRQVGDN TINP" gene 199895..200935 /gene="mce1B" /locus_tag="Rv0170" /db_xref="GeneID:886810" CDS 199895..200935 /gene="mce1B" /locus_tag="Rv0170" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /experiment="experimental evidence, no additional details recorded" /note="Rv0170, (MTCI28.10), len: 346 aa. mce1B (alternate gene name: mceD); belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07788|Rv0590|MTCY19H5.32c|mce2B (275 aa); O53968|Rv1967|MTV051.05|mce3B (342 aa); etc. Also highly similar to others e.g. NP_302657.1|NC_002677 putative secreted protein from Mycobacterium leprae (346 aa); CAC12797.1|AL445327 putative secreted protein from Streptomyces coelicolor (354 aa); etc. Contains hydrophobic region in N-terminal 30 residues. In Escherichia coli, N-terminal part is functional and directs export of a leaderless beta-lactamase into the periplasm (see Chubb et al., 1998).; mceD" /codon_start=1 /transl_table=11 /product="MCE-family protein MCE1B" /protein_id="NP_214684.1" /db_xref="GI:15607311" /db_xref="GeneID:886810" /translation="MKITGTVVKLGIVSVVLLFFTVMIIVIFGQMRFDRTNGYTAEFS NVSGLRQGQFVRASGVEIGKVKALHLVDGGRRVRVEFNIDRSVPLYQSTTAQIRYSDL IGNRYVELKRGEGKGANDLLPPGGLIPLSRTSPALDLDALIGGFKPVFRALDPAKVNN IANALITVFQGQGGTINDILDQTAQLTSQIAERDQAIGEVVKNLNIVLDTTVKHRKEF DETVNNLENLITGLRNHSDQLAGGLAHISNGAGTVADLLAENRTLVRKAVSYLDAIQQ PVIDQRVELDDLLHKTPTALTALGRANGTYGDFQNFYLCDLQIKWNGFQAGGPVRTVK LFSQPTGRCTPQ" gene 200932..202479 /gene="mce1C" /locus_tag="Rv0171" /db_xref="GeneID:886808" CDS 200932..202479 /gene="mce1C" /locus_tag="Rv0171" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /experiment="experimental evidence, no additional details recorded" /note="Rv0171, (MTCI28.11), len: 515 aa. mce1C; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07787|Rv0591|MTCY19H5.31|mce2C (481 aa); O53969|Rv1968|MTV051.06|mce3C (410 aa); etc. Also highly similar to others e.g. NP_302658.1|NC_002677 putative secreted protein from Mycobacterium leprae (519 aa); CAC12796.1|AL445327 putative secreted protein from Streptomyces coelicolor (351 aa); etc. Weakly similar to downstream ORF Rv0172|MTCI28.12|mce1D (530 aa), FASTA score: (24.6% identity in 552 aa overlap). Contains possible signal sequence and highly proline-rich C-terminus." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE1C" /protein_id="NP_214685.1" /db_xref="GI:15607312" /db_xref="GeneID:886808" /translation="MRTLEPPNRMRIGLMGIVVALLVVAVGQSFTSVPMLFAKPSYYG QFTDSGGLHKGDRVRIAGLGVGTVEGLKIDGDHIVVKFSIGTNTIGTESRLAIRTDTI LGRKVLEIEPRGAQALPPGGVLPVGQSTTPYQIYDAFFDVTKAASGWDIETVKRSLNV LSETVDQTYPHLSAALDGVAKFSDTIGKRDEQITHLLAQANQVASILGDRSEQVDRLL VNAKTLIAAFNERGRAVDALLGNISAFSAQVQNLINDNPNLNHVLEQLRILTDLLVDR KEDLAETLTILGRFSASFGETFASGPYFKVLLANLVPGQILQPFVDAAFKKRGISPED FWRSAGLPAYRWPDPNGTRFPNGAPPPPPPVLEGTPEHPGPAVPPGSPCSYTPPADGL PRPWDPLPCANLTQGPFGGPDFPAPLDVATSPPNPDGPPPAPGLPIAGRPGEVPPNVP GTPVPIPQEAPPGARTLPLGPAPGPAPPPAAPGPPAPPGPGPQLPAPFINPGGTGGSG VTGGSEN" gene 202476..204068 /gene="mce1D" /locus_tag="Rv0172" /db_xref="GeneID:886807" CDS 202476..204068 /gene="mce1D" /locus_tag="Rv0172" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /experiment="experimental evidence, no additional details recorded" /note="Rv0172, mce1D (MTCI28.12), len: 530 aa. mce1D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07786|Rv0592|MTCY19H5.30c|mce2D (508 aa); O53970|Rv1969|MTV051.07|mce3D (423 aa); etc. Also highly similar to others e.g. NP_302659.1|NC_002677 putative secreted protein from Mycobacterium leprae (531 aa); CAC12795.1|AL445327 putative secreted protein from Streptomyces coelicolor (337 aa); etc. Hydrophobic region at N-terminus." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE1D" /protein_id="NP_214686.1" /db_xref="GI:15607313" /db_xref="GeneID:886807" /translation="MSTIFDIRNLRLPQLSRASVVIGSLVVVLALAAGIVGVRLYQKL TNNTVVAYFTQANALYVGDKVQIMGLPVGSIDKIEPAGDKMKVTFHYQNKYKVPANAS AVILNPTLVASRNIQLEPPYRGGPVLADNAVIPVERTQVPTEWDELRDSVSHIIDELG PTPEQPKGPFGEVIEAFADGLAGKGKQINTTLNSLSQALNALNEGRGDFFAVVRSLAL FVNALHQDDQQFVALNKNLAEFTDRLTHSDADLSNAIQQFDSLLAVARPFFAKNREVL THDVNNLATVTTTLLQPDPLDGLETVLHIFPTLAANINQLYHPTHGGVVSLSAFTNFA NPMEFICSSIQAGSRLGYQESAELCAQYLAPVLDAIKFNYFPFGLNVASTASTLPKEI AYSEPRLQPPNGYKDTTVPGIWVPDTPLSHRNTQPGWVVAPGMQGVQVGPITQGLLTP ESLAELMGGPDIAPPSSGLQTPPGPPNAYDEYPVLPPIGLQAPQVPIPPPPPGPDVIP GPVPPTPAPVGAPLPAEAGGGQ" gene 204065..205237 /gene="lprK" /locus_tag="Rv0173" /db_xref="GeneID:886804" CDS 204065..205237 /gene="lprK" /locus_tag="Rv0173" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /experiment="experimental evidence, no additional details recorded" /note="Rv0173, (MTCI28.13), len: 390 aa. Possible lprK (alternate gene name: mce1E), lipoprotein which belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07785|LPRL|Rv0593|MTCY19H5.29|mce2E (402 aa); O53971|LPRM|Rv1970|MTV051.08|mce3E (377 aa); etc. Also highly similar to others e.g. NP_302660.1|NC_002677 putative lipoprotein from Mycobacterium leprae (392 aa); CAC12794.1|AL445327 putative secreted protein from Streptomyces coelicolor (413 aa); etc. Contains PS00013 prokaryotic membrane lipoprotein lipid attachment site.; mce1E" /codon_start=1 /transl_table=11 /product="MCE-family lipoprotein LprK" /protein_id="NP_214687.1" /db_xref="GI:15607314" /db_xref="GeneID:886804" /translation="MMSVLARMRVMRHRAWQGLVLLVLALLLSSCGWRGISNVAIPGG PGTGPGSYTIYVQMPDTLAINGNSRVMVADVWVGSIRAIKLKNWVATLTLSLKKDVTL PKNATAKIGQTSLLGSQHVELAAPPDPSPVPLKDGDTIPLKRSSAYPTTEQTLASIAT LLRGGGLVNLEGIQQEINAIVTGRADQIRAFLGKLDTFTDELNQQRDDITRAIDSTNR LLAYVGGRSEVLNRVLTDLPPLIKHFADKQELLINASDAVGRLSQSADQYLSAARGDL HQDLQALQCPLKELRRAAPYLVGALKLILTQPFDVDTVPQLVRGDYMNLSLTLDLTYS AIDNAFLTGTGFSGALRALEQSFGRDPETMIPDIRYTPNPNDAPGGPLVERGNRQC" misc_feature 204125..204157 /gene="lprK" /locus_tag="Rv0173" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 205231..206778 /gene="mce1F" /locus_tag="Rv0174" /db_xref="GeneID:886820" CDS 205231..206778 /gene="mce1F" /locus_tag="Rv0174" /function="UNKNOWN, BUT THOUGHT INVOLVED IN HOST CELL INVASION." /experiment="experimental evidence, no additional details recorded" /note="Rv0174, (MTCI28.14), len: 515 aa. mce1F; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), similar to Mycobacterium tuberculosis proteins O07784|Rv0594|MTCY19H5.28c|mce2F (516 aa); O53972|Rv1971|MTV051.09|mce3F (437 aa); etc. Also highly similar to others e.g. NP_302661.1|NC_002677 putative secreted protein from Mycobacterium leprae (516 aa); AAF74993.1|AF143400_1|AF143400|996A027a protein from Mycobacterium avium (80 aa) (similarity on C-terminus); CAC12793.1|AL445327 putative secreted protein from Streptomyces coelicolor (433 aa); etc. Has hydrophobic stretch, possibly a signal peptide at the N-terminus." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE1F" /protein_id="NP_214688.1" /db_xref="GI:15607315" /db_xref="GeneID:886820" /translation="MLTRFIRRQLILFAIVSVVAIVVLGWYYLRIPSLVGIGQYTLKA DLPASGGLYPTANVTYRGITIGKVTAVEPTDQGARVTMSIASNYKIPVDASANVHSVS AVGEQYIDLVSTGAPGKYFSSGQTITKGTVPSEIGPALDNSNRGLAALPTEKIGLLLD ETAQAVGGLGPALQRLVDSTQAIVGDFKTNIGDVNDIIENSGPILDSQVNTGDQIERW ARKLNNLAAQTATRDQNVRSILSQAAPTADEVNAVFSGVRDSLPQTLANLEVVFDMLK RYHAGVEQLLVFLPQGAAIAQTVLTPTPGAAQLPLAPAINYPPPCLTGFLPASEWRSP ADTSPRPLPSGTYCKIPQDAQLQVRGARNIPCVDVLGKRAATPKECRSKDPYVPLGTN PWFGDPNQILTCPAPGARCDQPVKPGLVIPAPSINTGLNPAPADQVQGTPPPVSDPLQ RPGSGTVQCNGQQPNPCVYTPTSGPSAVYSPASGELVGPDGVKYAVANSSTTGDDGWK EMLAPAS" repeat_region 206812..206850 /note="39 bp direct repeat 1, AGGTGAAGGCGGCGGATTCGGCGGAATCTGACGCCGGAG" gene 206814..207455 /locus_tag="Rv0175" /db_xref="GeneID:886801" CDS 206814..207455 /locus_tag="Rv0175" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0175, (MTCI28.15), len: 213 aa. Probable conserved Mce-associated membrane protein, equivalent, but longer in N-terminus, to CAC32127.1|AL583926 possible membrane protein from Mycobacterium leprae (182 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv0177, Rv1973, etc. Contains two 12 residue direct repeats at N-terminus." /codon_start=1 /transl_table=11 /product="mce associated membrane protein" /protein_id="NP_214689.1" /db_xref="GI:15607316" /db_xref="GeneID:886801" /translation="MKAADSAESDAGADQTGPQVKAADSAESDAGELGEDACPEQALV ERRPSRLRRGWLVGIAATLLALAGGLGAAGYFALRSHQESQSIAREDLAAIEAAKDCV AATQAPDAGAMSASMQKIIECGTGDFGAQASLYTSMLVEAYQAASVHVQVTDMRAAVE RNNNDGSVDVLVALRVKVSNTDSDAHEVGYRLRVRMALDEGRYKIAKLDQVTK" repeat_region 206869..206907 /note="39 bp direct repeat 2, AGGTGAAGGCGGCGGATTCGGCGGAATCTGACGCCGGAG" gene 207452..208420 /locus_tag="Rv0176" /db_xref="GeneID:886799" CDS 207452..208420 /locus_tag="Rv0176" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0176, (MTCI28.16), len: 322 aa. Probable conserved Mce-associated transmembrane protein. Contains short region of similarity to PRA_MYCLE|P41484 proline-rich antigen (36 kDa antigen) from Mycobacterium leprae (249 aa) (outside the proline-rich region), FASTA scores: opt: 165, E(): 2.9e-05, (40.0% identity in 65 aa overlap). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv0177, Rv3493c, etc." /codon_start=1 /transl_table=11 /product="mce associated transmembrane protein" /protein_id="NP_214690.1" /db_xref="GI:15607317" /db_xref="GeneID:886799" /translation="MTVVVEKTPTTLPQATPNGAAPWHVRAGAFAIDVLPGLAVAATM ALTALTVPPGSAWRWLCACLLGLTILLLAVNRLLLPTITGWSLGRALTGIRVVRRDGS AIGPWRLLVRDLAHLVDTLSLFVGWLWPLWDSRRRTFADLLLRTEVRRVEPVQRPAVI RRLTAAVALAAAGACASATAVGAAVVYVNEWQTDHTRAQLATRGPKLVVDVLSYDPET VQRDFERARSLATDRYRPQLSIQQDSVRESGPVRNQYWVTDSAVLSATPAQATMLLFM QGERGTPPNQRYIQSTVRAIFQKSRGQWRLDDLAVVMKPRQPTGEK" gene 208417..208971 /locus_tag="Rv0177" /db_xref="GeneID:886795" CDS 208417..208971 /locus_tag="Rv0177" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0177, (MTCI28.17), len: 184 aa. Probable conserved Mce-associated protein, equivalent to CAC32129.1|AL583926 conserved membrane protein from Mycobacterium leprae (184 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv1973, Rv3493c, etc." /codon_start=1 /transl_table=11 /product="MCE associated protein" /protein_id="NP_214691.1" /db_xref="GI:15607318" /db_xref="GeneID:886795" /translation="MSPRRKFEPGEGALLAPQSIEPSRRWGLPLALTASAVVMAAAIS ACALMRISHESHQRAAHKDIVMLSDVRSFMTMFTSPDPFHANEYAERVLSHATGDFAK QYHERANDILIRISGVEPTTGTVLDAGVQRWNEDGSANVLVVTQITSKSADGKRVVSN ANRWLVTAKQEGNEWKISSLLPVI" gene 208938..209672 /locus_tag="Rv0178" /db_xref="GeneID:886814" CDS 208938..209672 /locus_tag="Rv0178" /function="UNKNOWN" /note="Rv0178, (MTCI28.18), len: 244 aa. Probable conserved Mce-associated membrane protein, highly similar in C-terminus to CAC32130.1|AL583926 putative secreted protein from Mycobacterium leprae (184 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv0177, Rv1973, etc. Note that there is a 10 aa overlap with the upstream ORF." /codon_start=1 /transl_table=11 /product="mce associated membrane protein" /protein_id="NP_214692.1" /db_xref="GI:15607319" /db_xref="GeneID:886814" /translation="MEDQQSASGDLTQKSVANGESTDTASAATEGHRGEIDAAGEPDE RGAAVADSQADEDDSAATAARGGKTRARRSRGRRLAITVGVAAALFVGSAAFAGATVE PYLSERAVVATKLMVARTAANAITTLWTYTPENMDTLADRAANYLSGDFAAQYRRFVD QIAAANKQAKITNDTEVTGAAVESLSGRDAVAIVYTNTTTTSPVTKNIPALKYLSYRL FMKRYDARWLVTRMTTITSLDLTPQV" gene complement(209703..210812) /gene="lprO" /locus_tag="Rv0179c" /db_xref="GeneID:886796" CDS complement(209703..210812) /gene="lprO" /locus_tag="Rv0179c" /function="UNKNOWN" /note="Rv0179c, (MTCI28.19c), len: 369 aa. Possible lprO, lipoprotein (visibly not conserved). Contains possible N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LprO" /protein_id="NP_214693.1" /db_xref="GI:15607320" /db_xref="GeneID:886796" /translation="MWIRAERVAVLTPTASLRRLTACYAALAVCAALACTTGQPAARA ADGREMLAQAIATTRGSYLVYNFGGGHPMPLLNAGGHWYEMNNGGHLMIIKNASQRLS PHLLVDTHTGDQARCEHNPGARTGEGLWQASEIYPPLKAWQRMGRPTIAVNANFFDVR GQKGGSWRSTGCSSPLGAYVDNTRGQGRANQAVTGTVAYAGKQGLSGGNELWSSLTTM ILPVGGAPYVLRPKSRQDYDLATPVIEDLLNKNARFVAVAGIGLLSPGNTGQLHDGGP SAARTALAYAKQKDEMYIFQGGNYTPDNIQDLFRGLGSDTAILLDGGGSSAIVLRRDT GGMWAGAGSPKGSCDTRQVLCDSHERALPSWLAFN" misc_feature complement(210708..210740) /gene="lprO" /locus_tag="Rv0179c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(210892..212250) /locus_tag="Rv0180c" /db_xref="GeneID:886792" CDS complement(210892..212250) /locus_tag="Rv0180c" /function="UNKNOWN" /note="Rv0180c, (MTCI28.20c), len: 452 aa. Probable conserved transmembrane protein, equivalent to CAC32132.1|AL583926 probable conserved membrane protein from Mycobacterium leprae (465 aa). Shows some similarity with others membrane proteins e.g. AL096849|SCI11_29 from Streptomyces coelicolor (354 aa), FASTA scores: opt: 190, E(): 0.00067, (25.9% identity in 409 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214694.1" /db_xref="GI:15607321" /db_xref="GeneID:886792" /translation="MSQAQPRPAAPNPKRNVKAIRTVRFWMAPIATTLALMSALAALY LGGILNPMTNLRHFPIALVNEDAGPAGQQIVDGLVSGLDKNKFDIRVVSPDEARRLLD TAAVYGSALIPPTFSSQLRDFGASAVTPTRTDRPAITISTNPRAGTLAASIAGQTLTR ALTVVNGKVGERLTAEVAAQTGGVALAGAAAAGLASPIDVKSTAYNPLPNGTGNGLSA FYYALLLLLAGFTGSIVVSTLVDSMLGYVPAEFGPVYRFAEQVNISRFRTLLVKWAVM VVLALLTSGVYLAIAHGLGMPIPLGWQVWLYGVFAIIAVGVTSSSLIAVLGSMGLLVS MLIFVILGLPSAGATVPLEAVPAFFRWLAQFEPMHQVFLGVRSLLYLNGNADAGLSQA LTMTSIGLIIGLLLGGFITHLYDRSSFHRIPGAVEMAIAVEHQAQYQARQSARESSSE QP" gene complement(212277..213011) /locus_tag="Rv0181c" /db_xref="GeneID:886788" CDS complement(212277..213011) /locus_tag="Rv0181c" /function="UNKNOWN" /note="Rv0181c, (MTCI28.21c), len: 244 aa. Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. YHHW_ECOLI|P46852 hypothetical 26.3 kd protein from Escherichia coli (231 aa), FASTA scores: opt: 479, E(): 1.2e-29, (37.3% identity in 233 aa overlap); P73623|SLL1773 HYPOTHETICAL 25.7 kDa PROTEIN from Synechocystis sp. strain PCC 6803 (232 aa), FASTA score: (39.1% identity in 233 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214695.1" /db_xref="GI:15607322" /db_xref="GeneID:886788" /translation="MTATVEIRRAADRAVTTTSWLKSRHSFSFGDHYDPDNTHHGLLL VNNDDQMEPASGFDPHPHRDMEIVTWVLRGALRHQDSAGNSGVIYPGLAQRMSAGTGI LHSEMNDSATEPVHFVQMWVIPDATGITASYQQQEIDDELLRAGLVTIASGIPGQDAA LTLHNSSASLHGARLRPGATVSLPCAPFLHLFVAYGRLTLEGGGELADGDAVRFTDAD ARGLTANEPSEVLIWEMHAKLGDSAT" gene complement(213028..214140) /gene="sigG" /locus_tag="Rv0182c" /db_xref="GeneID:886786" CDS complement(213028..214140) /gene="sigG" /locus_tag="Rv0182c" /EC_number="2.7.7.6" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED." /experiment="experimental evidence, no additional details recorded" /note="DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates" /codon_start=1 /transl_table=11 /product="RNA polymerase factor sigma-70" /protein_id="NP_214696.1" /db_xref="GI:15607323" /db_xref="GeneID:886786" /translation="MRTSPMPAKFRSVRVVVITGSVTAAPVRVSETLRRLIDVSVLAE NSGREPADERRGDFSAHTEPYRRELLAHCYRMTGSLHDAEDLVQETLLRAWKAYEGFA GKSSLRTWLHRIATNTCLTALEGRRRRPLPTGLGRPSADPSGELVERREVSWLEPLPD VTDDPADPSTIVGNRESVRLAFVAALQHLSPRQRAVLLLRDVLQWKSAEVADAIGTST VAVNSLLQRARSQLQTVRPSAADRLSAPDSPEAQDLLARYIAAFEAYDIDRLVELFTA EAIWEMPPYTGWYQGAQAIVTLIHQQCPAYSPGDMRLISLIANGQPAAAMYMRAGDVH LPFQLHVLDMAADRVSHVVAFLDTTLFPKFGLPDSL" misc_feature complement(213856..213894) /gene="sigG" /locus_tag="Rv0182c" /note="PS01063 Sigma-70 factors ECF subfamily signature" gene 214088..214927 /locus_tag="Rv0183" /db_xref="GeneID:886785" CDS 214088..214927 /locus_tag="Rv0183" /EC_number="3.1.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0183, (MTCI28.23), len: 279 aa. Possible lysophospholipase (EC 3.1.-.-), similar to several (especially eukaryotic enzymes, weaker with Escherichia coli), e.g. U67963|HSU67963_1 Human lysophospholipase homolog from Homo sapiens (313 aa), FASTA scores: opt: 569, E(): 2.6e-29, (37.1% identity in 259 aa overlap); P07000|PLDB_ECOLI LYSOPHOSPHOLIPASE L2 from Escherichia coli (165 aa), FASTA scores: opt: 219, E(): 0.00012. Start changed based on similarity to AE001997_8 from Deinococcus radiodurans (282 aa), FASTA scores: opt: 510, E(): 1.4e-25, (34.8% identity in 282 aa overlap). Also shows some similarity to epoxide hydrolases from Mycobacterium tuberculosis e.g. Rv1938 FASTA score: (30.7% identity in 114 aa overlap); and O07214|YR15_MYCTU|Rv2715|MT2788|MTCY05A6.36 (341 aa)." /codon_start=1 /transl_table=11 /product="lysophospholipase" /protein_id="NP_214697.2" /db_xref="GI:57116702" /db_xref="GeneID:886785" /translation="MTTTRTERNFAGIGDVRIVYDVWTPDTAPQAVVVLAHGLGEHAR RYDHVAQRLGAAGLVTYALDHRGHGRSGGKRVLVRDISEYTADFDTLVGIATREYPGC KRIVLGHSMGGGIVFAYGVERPDNYDLMVLSAPAVAAQDLVSPVVAVAAKLLGVVVPG LPVQELDFTAISRDPEVVQAYNTDPLVHHGRVPAGIGRALLQVGETMPRRAPALTAPL LVLHGTDDRLIPIEGSRRLVECVGSADVQLKEYPGLYHEVFNEPERNQVLDDVVAWLT ERL" gene 214969..215718 /locus_tag="Rv0184" /db_xref="GeneID:886806" CDS 214969..215718 /locus_tag="Rv0184" /function="UNKNOWN" /note="Rv0184, (MTCI28.24), len: 249 aa. Conserved hypothetical protein, equivalent to CAC32136.1|AL583926 conserved hypothetical protein from Mycobacterium lepra (249 aa); and C-terminus highly similar to CAB08793.1|Z95398 conserved hypothetical protein from Mycobacterium leprae (145 aa), FASTA scores: E(): 0, (75.2 identity in 145 aa overlap). Also similar to 049841|SCE9_39|T36358 hypothetical protein from Streptomyces coelicolor (418 aa), FASTA scores: opt: 231, E(): 8.1e-08, (30.4% identity in 270 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214698.1" /db_xref="GI:15607325" /db_xref="GeneID:886806" /translation="MTNDKMLARIAALLRQAEGTDNPHEADAFMSTAQRLATAASIDL AVARSHAGNRSPAQAPTQRTITIGAAGTRGLRTYVQLFVLIAAANDVRCDVASNSTFV YAYGFAEDIDTSHALYASLVVQMVRASDAYLASGAHRPTPTITARLNFQLAFGARVGQ RLADAREQTRQEATKDRDRPPGTAIALRDKDIELHEYYRRSSKARGAWRASRATAGYS SAARRAGDRAGRQARLGNNPELPGARAALGR" gene 215715..216224 /locus_tag="Rv0185" /db_xref="GeneID:886782" CDS 215715..216224 /locus_tag="Rv0185" /function="UNKNOWN; PROBABLY INVOLVED IN A CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0185, (MTCI28.25a), len: 169 aa. Conserved hypothetical protein, equivalent to CAB08794.1|Z95398|MLCL622_2 from Mycobacterium leprae (168 aa), FASTA scores: opt: 861, E(): 0, (76.4% identity in 165 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214699.1" /db_xref="GI:15607326" /db_xref="GeneID:886782" /translation="MIGADVPRDSQRARVYAAEAFVRTLFDRVTAHGSPTVEFFGTQL TLPPEGRFGSVASVQRYVDDVLALPAVGQNWPTVSPVRVRARRAATAAHYENHGGTGT IAVPDRHTAGWAMRELVVLHEVAHHLCQVPPPHGPEFVATVCTLTELVMGPEVGHVFR VVYAQEGVR" misc_feature 216069..216098 /locus_tag="Rv0185" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene 216269..218344 /gene="bglS" /locus_tag="Rv0186" /db_xref="GeneID:886780" CDS 216269..218344 /gene="bglS" /locus_tag="Rv0186" /EC_number="3.2.1.21" /function="POSSIBLY INVOLVED IN DEGRADATION [CATALYTIC ACTIVITY: Hydrolysis of terminal, non-reducing beta-D-glucose residues with release of beta-D-glucose]." /note="Rv0186, (MTCI28.25b), len: 691 aa. Probable bglS, beta-glucosidase (EC 3.2.1.21), highly similar to many e.g. BGLS_AGRTU|P27034 beta-glucosidase from Agrobacterium tumefaciens (818 aa), FASTA scores: opt: 643, E(): 0, (32.5% identity in 842 aa overlap). SEEMS TO BELONG TO FAMILY 3 OF GLYCOSYL HYDROLASES." /codon_start=1 /transl_table=11 /product="beta-glucosidase" /protein_id="NP_214700.1" /db_xref="GI:15607327" /db_xref="GeneID:886780" /translation="MTDDERFSLLVGLTGASDLWPVRDERIPQGVPMCAGYVPGIPRL GVPALLMSDAGLGVTNPGYRPGDTATALPAGLALAASFNPVLARSSGKAIGREARSRG FNVQLAGAINLARDPRNGRNFEYLSEDPLLSATMAAESIIGIQQQGVIATTKHFSLNC NETNRHWLDAVIDPDAHRESDLLAFEIVIERSQPGAVMAAYNKVNGDYAAGNDHLLND VLKGAWGYRGWVMSDWGGTPSWECALAGLDQECGAQIDAVLWQSEAFTDRLRAAYADG NLPKGRLSDMVRRILRSMFAVGIDRWKPAPAPDMNAHNEIAAQMARQGIVLLQNRGLL PLAPESAGRIAVIGGYAHLGVPAGYGSSAVTPPGGYAGVIPIGGSGLAAGLRNLYLLP SSPLSELRKRLPNAQFEFDPGINPAEAVLAARRADIAIVFAIRAEGEGFDSADLSLPW GQDALIAAVASANANTVVVLETGNPVTMPWRDSVNAIMQAWYPGQAGGQAVAEIVTGQ VNPSGRLPITFPVDLGQTPRSQPPELGAPWGTSTTIHYTEGADVGYRWFASTNQTPMF AFGHGLSYTSFEYRDLVVTGGHTVHASFSVTNTGDRSGADVPQLYMIAAPGESRLRLL GFERVELEPGQTRRVRIEADPRLLARYDGEARSWRIEPGGYTVAVGASAVALKLAAKV KLAGRGFGR" gene 218705..219367 /locus_tag="Rv0187" /db_xref="GeneID:886779" CDS 218705..219367 /locus_tag="Rv0187" /EC_number="2.1.1.-" /function="THOUGHT TO BE INVOLVED IN TRANSFER OF METHYL GROUP." /note="Rv0187, (MTCI28.26), len: 220 aa. Probable O-methyltransferase (EC 2.1.1.-), similar to many e.g. AB93458.1|AL357591 putative O-methyltransferase from Streptomyces coelicolor (223 aa); MDMC_STRMY|Q00719 O-methyltransferase from Streptomyces mycarofaciens (221 aa), FASTA scores: opt: 327, E(): 2.4e-17, (35.9% identity in 192 aa overlap). Also similar to Rv1703c, Rv1220c from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="O-methyltransferase" /protein_id="NP_214701.1" /db_xref="GI:15607328" /db_xref="GeneID:886779" /translation="MGMDQQPNPPDVDAFLDSTLVGDDPALAAALAASDAAELPRIAV SAQQGKFLCLLAGAIQARRVLEIGTLGGFSTIWLARGAGPQGRVVTLEYQPKHAEVAR VNLQRAGVADRVEVVVGPALDTLPTLAGGPFDLVFIDADKENNVAYIQWAIRLARRGA VIVVDNVIRGGGILAESDDADAVAARRTLQMMGEHPGLDATAIQTVGRKGWDGFALAL VR" gene 219486..219917 /locus_tag="Rv0188" /db_xref="GeneID:886776" CDS 219486..219917 /locus_tag="Rv0188" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0188, (MTCI28.27), len: 143 aa. Probable conserved transmembrane protein, similar to T35347|4835334|CAB42956.1|AL049863|SC5H1_31 probable membrane protein from Streptomyces coelicolor (147 aa), FASTA scores: opt: 326, E(): 6.5e-15, (36.2% identity in 141 aa overlap); N-terminus of P80185|MTRC_METTH TETRAHYDROMETHANOPTERIN S-METHYLTRANSFERASE SUBUNIT C (EC 2.1.1.86) from Methanobacterium thermoautotrophicum strain Marburg/DSM 2133 (266 aa), FASTA scores: opt: 125, E(): 0.033, (31.6% identity in 98 aa overlap). Also similar to Rv3635 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214702.1" /db_xref="GI:15607329" /db_xref="GeneID:886776" /translation="MSTVHSSIDQHPDLLALRASFDRAAESTIAHFTFGLALLAGLYV AASPWIVGFSATRGLPTCDLIVGIAVAYLAYGFASALDRTHGMTWTLPVLGVWVIFSP WVLPGVAVTAGMMWSHIIAGAVVAVLGFYFGMRTRAAANQG" gene complement(219996..221723) /gene="ilvD" /locus_tag="Rv0189c" /db_xref="GeneID:886774" CDS complement(219996..221723) /gene="ilvD" /locus_tag="Rv0189c" /EC_number="4.2.1.9" /function="INVOLVED IN VALINE AND ISOLEUCINE BIOSYNTHESIS (AT THE FOURTH STEP) [CATALYTIC ACTIVITY: 2,3-DIHYDROXY-3-METHYLBUTANOATE = 3-METHYL-2- OXOBUTANOATE + H(2)O]." /note="catalyzes the dehydration of 2,3-dihydroxy-3-methylbutanoate to 3-methyl-2-oxobutanoate in valine and isoleucine biosynthesis" /codon_start=1 /transl_table=11 /product="dihydroxy-acid dehydratase" /protein_id="NP_214703.1" /db_xref="GI:15607330" /db_xref="GeneID:886774" /translation="MPQTTDEAASVSTVADIKPRSRDVTDGLEKAAARGMLRAVGMDD EDFAKPQIGVASSWNEITPCNLSLDRLANAVKEGVFSAGGYPLEFGTISVSDGISMGH EGMHFSLVSREVIADSVEVVMQAERLDGSVLLAGCDKSLPGMLMAAARLDLAAVFLYA GSILPGRAKLSDGSERDVTIIDAFEAVGACSRGLMSRADVDAIERAICPGEGACGGMY TANTMASAAEALGMSLPGSAAPPATDRRRDGFARRSGQAVVELLRRGITARDILTKEA FENAIAVVMAFGGSTNAVLHLLAIAHEANVALSLQDFSRIGSGVPHLADVKPFGRHVM SDVDHIGGVPVVMKALLDAGLLHGDCLTVTGHTMAENLAAITPPDPDGKVLRALANPI HPSGGITILHGSLAPEGAVVKTAGFDSDVFEGTARVFDGERAALDALEDGTITVGDAV VIRYEGPKGGPGMREMLAITGAIKGAGLGKDVLLLTDGRFSGGTTGLCVGHIAPEAVD GGPIALLRNGDRIRLDVAGRVLDVLADPAEFASRQQDFSPPPPRYTTGVLSKYVKLVS SAAVGAVCG" misc_feature complement(221283..221315) /gene="ilvD" /locus_tag="Rv0189c" /note="PS00886 Dihydroxy-acid and 6-phosphogluconate dehydratases signature 1" gene 221871..222161 /locus_tag="Rv0190" /db_xref="GeneID:886772" CDS 221871..222161 /locus_tag="Rv0190" /function="UNKNOWN" /note="Rv0190, (MTCI28.29), len: 96 aa. Conserved hypothetical protein, highly similar to several hypothetical proteins e.g. SYCSLRA_35|Q55554|SLL0176 hypothetical 18.9 kDa protein from Synechocystis (167 aa), FASTA scores: opt: 237, E(): 5.8e-16, (39.4% identity in 94 aa overlap). Also highly similar to Z95398|MLCL622_7|O06070 from Mycobacterium leprae (135 aa), FASTA score: (82.6% identity in 92 aa overlap). Also similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0967, Rv0030, Rv1766 (42.5% identity in 80 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214704.1" /db_xref="GI:15607331" /db_xref="GeneID:886772" /translation="MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQ ISAVTSALRSVALNLLDEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS" gene 222289..223530 /locus_tag="Rv0191" /db_xref="GeneID:886770" CDS 222289..223530 /locus_tag="Rv0191" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF DRUG ACROSS THE MEMBRANE." /note="Rv0191, (MTCI28.30), len: 413 aa. Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug, similar to several hypothetical proteins e.g. YDEA_ECOLI|P31122 hypothetical 42.5 kd protein from Escherichia coli (396 aa), FASTA scores: opt: 475, E(): 4.2e-33, (29.7% identity in 381 aa overlap); and to several chloramphenicol resistance proteins e.g. CMLR_STRLI|P31141 chloramphenicol resistance protein from stremtomyces lividans (392 aa), FASTA scores: opt: 394, E(): 6.7e-12, (28.2% identity in 383 aa overlap). Also similar to SVU09991_1 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_214705.1" /db_xref="GI:15607332" /db_xref="GeneID:886770" /translation="MTAPTGTSATTTRPWTPRIATQLSVLACAAFIYVTAEILPVGAL SAIARNLRVSVVLVGTLLSWYALVAAVTTVPLVRWTAHWPRRRALVVSLVCLTVSQLV SALAPNFAVLAAGRVLCAVTHGLLWAVIAPIATRLVPPSHAGRATTSIYIGTSLALVV GSPLTAAMSLMWGWRLAAVCVTGAAAAVALAARLALPEMVLRADQLEHVGRRARHHRN PRLVKVSVLTMIAVTGHFVSYTYIVVIIRDVVGVRGPNLAWLLAAYGVAGLVSVPLVA RPLDRWPKGAVIVGMTGLTAAFTLLTALAFGERHTAATALLGTGAIVLWGALATAVSP MLQSAAMRSGGDDPDGASGLYVTAFQIGIMAGALLGGLLYERSLAMMLTASAGLMGVA LFGMTVSQHLFENPTLSPGDG" gene 223564..224664 /locus_tag="Rv0192" /db_xref="GeneID:886768" CDS 223564..224664 /locus_tag="Rv0192" /function="UNKNOWN" /note="Rv0192, (MTCI28.31), len: 366 aa. Conserved hypothetical protein. Has Gly- Arg-rich region followed by highly Pro-rich repetitive region near N-terminus. Similar in C-terminus to other hypothetical proteins e.g. Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa), FASTA scores: opt: 375, E(): 3.2e-24, (36.1% identity in 255 aa overlap); YV09_MYCTU|Q11149|cY20G9.09 hypothetical 47.9 kDa protein from Mycobacterium tuberculosis (451 aa), FASTA scores: opt: 330, E(): 3.2e-13, (35.1% identity in 271 aa overlap). Also similar to Rv0116c, Rv1433, Rv2518c, Rv0483 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214706.1" /db_xref="GI:15607333" /db_xref="GeneID:886768" /translation="MPHWAEERHRRESNYVALEAGLDEGESIRRSEHSRSGCGADAGC WRCRGGPGRGSRRSRRSRGPGGTAGPVDPPAVDLLAPPPDPLALPPALDPLAPPPPDP LAPPPPDPLAVPVAAGPVAGQDPTSFVGPPPFRPPTFNPVDGAMVGVAKPIVINFAVP IADRAMAESAIHISSIPPVPGKFYWMSPTQVRWRPFEFWPANTAVNIDAAGTKSSFRT GDSLVATADDATHQMTITRNGVVQKTFPMSMGMVSGGHQTPNGTYYVLEKFATVVMDS STYGVPVNSAQGYKLTVSDAVRIDNSGNFVHSAPWSVADQGKRNVTHGCINLSPANAK WFYDNFGSGDPVVVKNSVGTYNKNDGAQDWQI" gene 223607..223909 /locus_tag="Rv0192A" /db_xref="GeneID:3205105" CDS 223607..223909 /locus_tag="Rv0192A" /function="UNKNOWN" /note="Rv0192A, len: 100 aa. Probable N-terminal part of Rv0192, which is member of family P5.17 with Rv0116c, Rv1433, Rv2518c, Rv0483. These are all predicted to be exported/membrane proteins. Rv0192A has typical N-terminal signal peptide which is functional and was identified by PhoA fusion screens: O52054 PGB14T-O1 PRECURSOR (FRAGMENT 45 AA) (see citation below). Since Rv0192 misses a signal peptide this suggests that there is a frameshift in the region of the overlap with Rv0192 but none found on reinspection of sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177618.1" /db_xref="GI:57116703" /db_xref="GeneID:3205105" /translation="MSRWKQGWTRGSLFAALNIAAVVAVLMLGAGVAVADPDAAPGDP GGPGAPGAQRDPSTRRQLTCWRRHPTRWRCRRHLTRWRRRHLTRSRRPRLTRWQCR" gene complement(224724..226571) /locus_tag="Rv0193c" /db_xref="GeneID:886764" CDS complement(224724..226571) /locus_tag="Rv0193c" /function="UNKNOWN" /note="Rv0193c, (MTV033.01c-MTCI28.32), len: 615 aa. Hypothetical unknown protein. TBparse score is 0.924." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214707.1" /db_xref="GI:15607334" /db_xref="GeneID:886764" /translation="MIQISRDMSSLGQTATTQALPDNSDGIQLTKFAADDILPLEYAP PIGPELVSQDQLPAAWAYKRFRDLDDKESYRRKLLQELTDALAAQGSEAAEIATAALR DLIDQMAEQGAVVLADIVESDDFLELVKRYDELMAREGSRSFIHRFLDLRRSPGMLTD PAVNGALVHPLMIALISYAVGGPIRMIDARGKDAEPLSVLAQDNMLHIDNTPFNDEYK ILITWRRGTAQGPAGQNFTFLPGTHKLARTCFVNEDGVPWSSENASIFTTPDSIRKVF DAQRQLGGQDHPTVIEVTDSERPLSGVFAAGSLVHHRFRTASGSARSCIILVFHRVAD NPGRMVSDVEDSSDVSLSELLTRGVPDESYQQRFIATLCAAADEIAELLLKWKKTPQR PVSLPLQTKQIDGARFEEWISAATKAPEVREIRNRELTIPYGEVLSAEEFFDLIWRLM RFDKHGPLDLILYHDNREEPRKWARNLIREMSADRLYERLLGWLADIQQPRPADCLRP LQIHALISEVLKTLPLDEDQDPPADWHFDLLGMSHAEAARSVKHLLEDVAEALLRCED MAAYLSTSLFAFWAVDAAYSLDGRRNLVVKDCARRLLRHYTMLSLTCFQ" gene 226878..230462 /locus_tag="Rv0194" /db_xref="GeneID:886790" CDS 226878..230462 /locus_tag="Rv0194" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF DRUGS ACROSS THE MEMBRANE (EXPORT): MULTIDRUGS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0194, (MTV033.02), len: 1194 aa. Probable drugs-transport transmembrane protein ATP binding protein ABC transporter (see citation below), highly similar to many e.g. U62129|STU62129_2|T30293 ABC transport protein homolog from Salmonella typhi (1218 aa), FASTA scores: opt: 1116, E(): 0, (36.3% identity in 1209 aa overlap); CAB66302.1|AL136519 ABC transporter protein ATP-binding component from Streptomyces coelicolor (1243 aa); I84547 mdl protein from Escherichia coli (1143 aa); etc. Also similar to MTCY50_9 and MTCY50_10 from Mycobacterium tuberculosis, FASTA score: (33.8% identity in 574 aa overlap). Contains two PS00017 ATP/GTP-binding site motif A (P-loop) and one PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Alternative start possible at 1823 but no RBS." /codon_start=1 /transl_table=11 /product="drugs-transport transmembrane ATP-binding protein ABC transporter" /protein_id="NP_214708.1" /db_xref="GI:15607335" /db_xref="GeneID:886790" /translation="MRTNCWWRLSGYVMRHRRDLLLGFGAALAGTVIAVLVPLVTKRV IDDAIAADHRPLAPWAVVLVAAAGATYLLMYVRRYYGGRIAHLVQHDLRMDAFQALLR WDGRQQDRWSSGQLIVRTTNDLQLVQALLFDVPNVLRHVLTLLLGVAVMTWLSVPLAL LAVLLVPVIGLIAHRSRRLLAAATHCAQEHKAAVTGVVDAAVCGIRVVKAFGQEERET VKLVTASRALYAAQLRVARLNAHFGPLLQTLPALGQMAVFALGGWMAAQGSITVGTFV AFWACLTLLARPACDLAGMLTIAQQARAGAVRVLELIDSRPTLVDGTKPLSPEARLSL EFQRVSFGYVADRPVLREISLSVRAGETLAVVGAPGSGKSTLASLATRCYDVTQGAVR IGGQDVRELTLDSLRSAIGLVPEDAVLFSGTIGANIAYGRPDATPEQIATAARAAHIE EFVNTLPDGYQTAVGARGLTLSGGQRQRIALARALLHQPRLLIMDDPTSAVDAVIECG IQEVLREAIADRTAVIFTRRRSMLTLADRVAVLDSGRLLDVGTPDEVWERCPRYRELL SPAPDLADDLVVAERSPVCRPVAGLGTKAAQHTNVHNPGPHDHPPGPDPLRRLLREFR GPLALSLLLVAVQTCAGLLPPLLIRHGIDVGIRRHVLSALWWAALAGTATVVIRWVVQ WGSAMVAGYTGEQVLFRLRSVVFAHAQRLGLDAFEDDGDAQIVTAVTADVEAIVAFLR TGLVVAVISVVTLVGILVALLAIRARLVLLIFTTMPVLALATWQFRRASNWTYRRARH RLGTVTATLREYAAGLRIAQAFRAEYRGLQSYFAHSDDYRRLGVRGQRLLALYYPFVA LLCSLATTLVLLDGAREVRAGVISVGALVTYLLYIELLYTPIGELAQMFDDYQRAAVA AGRIRSLLSTRTPSSPAARPVGTLRGEVVFDAVHYSYRTREVPALAGINLRIPAGQTV VFVGSTGSGKSTLIKLVARFYDPTHGTVRVDGCDLREFDVDGYRNRLGIVTQEQYVFA GTVRDAIAYGRPDATDAQVERAAREVGAHPMITALDNGYLHQVTAGGRNLSAGQLQLL ALARARLVDPDILLLDEATVALDPATEAVVQRATLTLAARRTTLIVAHGLAIAEHADR IVVLEHGTVVEDGAHTELLAAGGHYSRLWAAHTRLCSPEITQLQCIDA" misc_feature 227976..227999 /locus_tag="Rv0194" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 228288..228332 /locus_tag="Rv0194" /note="PS00211 ABC transporters family signature" misc_feature 229803..229826 /locus_tag="Rv0194" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 230899..231534 /locus_tag="Rv0195" /db_xref="GeneID:886762" CDS 230899..231534 /locus_tag="Rv0195" /function="POSSIBLY SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv0195, (MTV033.03), len: 211 aa. Possible two-component response regulator, luxR family, similar to many e.g. U00008|ECOHU49_15 regulatory protein narP from Escherichia coli strain K12 (225 aa), FASTA scores: opt: 232, E(): 7.3e-09, (29.2% identity in 219 aa overlap). Start chosen by similarity. Contains probable helix-turn-helix motif at aa 166-187 (Score 1164, +3.15 SD). TBparse score is 0.931." /codon_start=1 /transl_table=11 /product="two component transcriptional regulatory protein" /protein_id="NP_214709.1" /db_xref="GI:15607336" /db_xref="GeneID:886762" /translation="MAPVNVISVAVVASDPLTRDGALARLSSHRELDVRAWQAGCETS VLLVLATTITAPLLCQIEDVQKDGPSHAPKLVVVADEFSAEQVFRMIKLGLTGLLYRS QSTFDCIVETIRLSAEGRLRLPERVQRYLVGRIKSTPTAEPDTPCAAALAEREVAVLR LLADGLSTHQVAVQLNYCERTIKNIVHDIVTRLKLRNRTHAVAHALRAGLI" gene 231647..232231 /locus_tag="Rv0196" /db_xref="GeneID:886760" CDS 231647..232231 /locus_tag="Rv0196" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0196, (MTV033.04), len: 194 aa. Possible transcriptional regulatory protein, similar to two Bacillus subtilis regulators: P42105|YXAF_BACSU HYPOTHETICAL 21.0 kDa PROTEIN (191 aa), FASTA scores: opt: 323, E(): 2.1e-15, (30.9% identity in 181 aa overlap); and Z99105|BSUB0002_9 negative regulator of the lincomycin operon (188 aa), FASTA scores: opt: 255, E(): 1e-10, (25.9 identity in 185 aa overlap). TBparse score is 0.885." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214710.1" /db_xref="GI:15607337" /db_xref="GeneID:886760" /translation="MQGPRERMVVSAALLIRERGAHATAISDVLQHSGAPRGSAYHYF PGGRTQLLCEAVDYAGEHVAAMINEAEGGLELLDALIDKYRQQLLSTDFRAGCPIAAV SVEAGDEQDRERMAPVIARAAAVFDRWSDLTAQRFIADGIPPDRAHELAVLATSTLEG AILLARVRRDLTPLDLVHRQLRNLLLAELPERSR" gene 232231..234519 /locus_tag="Rv0197" /db_xref="GeneID:886758" CDS 232231..234519 /locus_tag="Rv0197" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0197, (MTV033.05), len: 762 aa. Possible oxidoreductase (EC 1.-.-.-), similar to others e.g. 9948789|AAG06102.1|AE004699_7|B83307 probable molybdopterin oxidoreductase from Pseudomonas aeruginosa strain PAO1 (769 aa); 5441785|CAB46809.1|AL096811|T36812 probable dehydrogenase from Streptomyces coelicolor (747 aa), FASTA scores: opt: 617, E(): 9.8e-30, (29.9% identity in 762 aa overlap); BAB04334.1|AP001509 assimilatory nitrate reductase (catalytic subunit) from Bacillus halodurans (743 aa); etc. TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214711.1" /db_xref="GI:15607338" /db_xref="GeneID:886758" /translation="MTSSDWLPTACILCECNCGIVVQVDDRRLARIRGDKAHPGSAGY TCNKALRLDHYQNNRARLSSPMRRRADGTYEEIDWDTAIVEIAEGFKQIRDTHGGDKI FYYGGGGQGNHLGGAYSGAFLKALGSRYRSNALAQEKTGEAWVDFQLYGGHTRGEFEN AEVSVFVGKNPWMSQSFPRARVVLNEIAKDPGRSMIVIDPVVTDTAKMADFHLRVQPG CDAWCLAALAAVLVQENLCNEAFLAAHVHGVDTVRAALQEVPVADYAQRCGVDEELLR AAARRIGTAASVSVFEDLGIQQAPNSTVCSYLNKLLWILTGNFAKKGGQHLHSSFAPL FSQVSGRTPVTGAPIIAGLIPGNVVPEEILTEHPDRFRAMIVERGNPAHSLADSAACR AAFQALELMVVVDVAMTETARLAHYVLPAASQFEKPEATFFNFEFPRNGFQLRRPLFP PLPGTLPEPEIWARLVRALGVVDEADLRPLREAAAQGRQAYTEAFLAAAATNPTVAKL TAYVLYETLGPTLPDGLAGAAALWGLAQKTAMAYPDAVRRAGHADGNALFDAILERPS GVTFTVHNYEDDFALISHPDHKIALEIPEMLAEIRSLTQTPSRLTTPQLPIVLSVGER RAYTANDIFRDPSWRKRDANGALRVSVEDAQALGLADGCLARITTAAGSAEATVEVTE TMLAGHAALPNGFGLDYTGDDGRTVVAGVAPNALTSTRWRDPYAGTPWHKHVPAAIRR ADAESPIWYPKWAILPARGVLA" gene complement(234516..236507) /locus_tag="Rv0198c" /db_xref="GeneID:886755" CDS complement(234516..236507) /locus_tag="Rv0198c" /EC_number="3.4.24.-" /function="UNKNOWN; HYDROLYZES PEPTIDES AND/OR PROTEINS." /note="Rv0198c, (MTV033.06c), len: 663 aa. Probable zinc metalloprotease (EC 3.4.24.-), equivalent to Z95398|MLCL622.12c from Mycobacterium leprae (667 aa), FASTA scores: opt: 3710, E(): 0, (80.8 % identity in 667 aa overlap). Also similar to many other metalloproteases e.g. members of the eukaryotic neprilysin family: P08473|NEP_HUMAN NEPRILYSIN (EC 3.4.24.11) (749 aa), FASTA scores: opt: 872, E(): 0, (31.1% identity in 692 aa overlap); Q07744|PEPO_LACLA NEUTRAL ENDOPEPTIDASE from Lactococcus lactis (626 aa), FASTA scores: opt: 862, E(): 0, (30.0% identity in 654 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. BELONGS TO PEPTIDASE FAMILY M13 (ZINC METALLOPROTEASE); ALSO KNOWN AS THE NEPRILYSIN SUBFAMILY. TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="zinc metalloprotease" /protein_id="NP_214712.1" /db_xref="GI:15607339" /db_xref="GeneID:886755" /translation="MTLAIPSGIDLSHIDADARPQDDLFGHVNGRWLAEHEIPADRAT DGAFRSLFDRAETQVRDLIIQASQAGAAVGTDAQRIGDLYASFLDEEAVERAGVQPLH DELATIDSAADATELAAALGTLQRAGVGGGIGVYVDTDSKDSTRYLVHFTQSGIGLPD ESYYRDEQHAAVLAAYPGHIARMFGLVYGGESRDHAKTADRIVALETKLADAHWDVVK RRDADLGYNLRTFAQLQTEGAGFDWVSWVTALGSAPDAMTELVVRQPDYLVTFASLWA SVNVEDWKCWARWRLIRARAPWLTRALVAEDFEFYGRTLTGAQQLRDRWKRGVSLVEN LMGDAVGKLYVQRHFPPDAKSRIDTLVDNLQEAYRISISELDWMTPQTRQRALAKLNK FTAKVGYPIKWRDYSKLAIDRDDLYGNVQRGYAVNHDRELAKLFGPVDRDEWFMTPQT VNAYYNPGMNEIVFPAAILQPPFFDPQADEAANYGGIGAVIGHEIGHGFDDQGAKYDG DGNLVDWWTDDDRTEFAARTKALIEQYHAYTPRDLVDHPGPPHVQGAFTIGENIGDLG GLSIALLAYQLSLNGNPAPVIDGLTGMQRVFFGWAQIWRTKSRAAEAIRRLAVDPHSP PEFRCNGVVRNVDAFYQAFDVTEDDALFLDPQRRVRIWN" misc_feature complement(235011..235040) /locus_tag="Rv0198c" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene 236550..237209 /locus_tag="Rv0199" /db_xref="GeneID:886753" CDS 236550..237209 /locus_tag="Rv0199" /function="UNKNOWN" /note="Rv0199, (MTV033.07), len: 219 aa. Probable conserved membrane protein, equivalent to Z95398|MLCL622.13 from Mycobacterium leprae (224 aa), FASTA scores: opt: 920, E(): 0, (67.7% identity in 220 aa overlap). Also some similarity to Mce-associated membrane proteins from Mycobacterium tuberculosis e.g. Rv0178, Rv0175, etc. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214713.1" /db_xref="GI:15607340" /db_xref="GeneID:886753" /translation="MPDGEQSQPPAQEDAEDDSRPDAAEAAAAEPKSSAGPMFSTYGI ASTLLGVLSVAAVVLGAMIWSAHRDDSGERTYLTRVMLTAAEWTAVLINMNADNIDAS LQRLHDGTVGQLNTDFDAVVQPYRQVVEKLRTHSSGRIEAVAIDTVHRELDTQSGAAR PVVTTKLPPFATRTDSVLLVATSVSENAGAKPQTVHWNLRLDVSDVDGKLMISRLESI R" gene 237206..237895 /locus_tag="Rv0200" /db_xref="GeneID:886803" CDS 237206..237895 /locus_tag="Rv0200" /function="UNKNOWN" /note="Rv0200, (MTV033.08), len: 229 aa. Possible conserved transmembrane protein, equivalent to Z95398|MLCL622.14 from Mycobacterium leprae (229 aa), FASTA scores: opt: 1147, E(): 0, (74.7% identity in 229 aa overlap). Also some similarity to Rv1973 from Mycobacterium tuberculosis (160 aa); and Rv1362c|Z75555|MTCY02B10_26 (220 aa), FASTA scores: opt: 134, E(): 0.063, (25.8% identity in 159 aa overlap). TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214714.1" /db_xref="GI:15607341" /db_xref="GeneID:886803" /translation="MRNAWRLVVFDVLAPLATIAALAAIGVLLGWPLWWVSTCSVLVL LVVEGVAINFWLLRRDSVTVGTDDDAPGLRLAVVFLCAAAISAAVVTGYLRWTTPDRD FNRDSREVVHLATGMAETVASFSPSAPAAAVDRAAAMMVPEHAGGFKEQYAKSSADLA RRGVTAQAATLAAGVEAIGPSAASVAVILRVSQSIPGQPTSQAARALRVTLTKRGSGW LVLDVTPINAR" gene complement(237892..238395) /locus_tag="Rv0201c" /db_xref="GeneID:886756" CDS complement(237892..238395) /locus_tag="Rv0201c" /function="UNKNOWN" /note="Rv0201c, (MTV033.09c), len: 167 aa. Conserved hypothetical protein, equivalent to Z95398|MLCL622.15c from Mycobacterium leprae (170 aa), FASTA scores: opt: 646, E(): 0, (63.9% identity in 158 aa overlap). TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214715.1" /db_xref="GI:15607342" /db_xref="GeneID:886756" /translation="MTLAAEPHPAPPQQPTVAWSEPDVDRRVEFWPTVAIRSALESGD IATWQRIAAALKRDPYGRTARQVEEVLEGIPATGIANAFWEVLDRARTHLDANERAEV ARQVGLLLDRSGLQRQEFASRIGVTAQDLTAYLDGIVSPSASLMIRMRRLSDRFVRAK SVRAADS" gene complement(238392..241292) /gene="mmpL11" /locus_tag="Rv0202c" /db_xref="GeneID:886750" CDS complement(238392..241292) /gene="mmpL11" /locus_tag="Rv0202c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /note="Rv0202c, (MTV033.10c), len: 966 aa. Probable mmpL11, conserved transmembrane transport protein (see citation below), equivalent to Z95398|MLCL622.16c from Mycobacterium leprae (1014 aa), FASTA scores: opt: 4076, E(): 0, (72.8% identity in 1017 aa overlap). Member of RND superfamily, similar to several putative transport proteins e.g. P96687 from Bacillus subtilis (724 aa), FASTA scores: opt: 594, E(): 9.1e-29, (26.9% identity in 717 aa overlap); etc. BELONGS TO THE MMPL FAMILY. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL11" /protein_id="NP_214716.1" /db_xref="GI:15607343" /db_xref="GeneID:886750" /translation="MMRLSRNLRRCRWLVFTGWLLALVPAVYLAMTQSGNLTGGGFEV AGSQSLLVHDQLDAHYPDRGAPALALVAAPRPDASYQDIDNAVALLRQIASELPGVTE APNPTQRPPQPDRPYVVSLRLDARNAGTSDVAKKLRDRIGVKGDQSGQTANGKVRLYV IGQGALSAAAAANTKHDIANAERWNLPIILMVLVAVFGSLAAAAIPLALAVCTVVITM GLVFVLSMHTTMSVFVTSTVSMFGIALAVDYSLFILMRYREELRCGRRPPDAVDAAMA TSGLAVVLSGMTVIASLTGIYLINTPALRSMATGAILAVAVAMLTSATLTPAVLATFA RAAAKRSALVHWSRRPASTQSWFWSRWVGWVMRRPWITALAASTVLLVMAAPATLMVL GNSLLRQFDSSHEIRTGAAAAAQALGPGALGPVQVLVRFDAGGASAPEHSQTIAAIRH RIAQAPNVVSVAPPRFADDNGSALLSAVLSVDPEDLGARDTITWMRTQLPRVAGAAQV DVGGPTALIKDFDDRVSATQPLVLVFVAVIAFLMLLISIRSVFLAFKGVLMTLLSVAA AYGSLVMVFQWGWARGLGFPALHSIDSTVPPLVLAMTFGLSMDYEIFLLTRIRERFLQ TGQTRDAVAYGVRTSARTITSAALIMIAVFCGFAFAGMPLVAEIGVACAVAIAVDATV VRLVLVPALMAMFDRWNWWLPRWLAHILPSVDFDRPLPKVDLGDVVVIPDDFAAAIPP SADVRMVLKSAAKLKRLAPDAICVTDPLAFTGCGCDGKALDQVQLAYRNGIARAISWG QRPVHPVTVWRKRLAVALDALQTTTWECGGVQTHRAGPGYRRRSPVETTNVALPTGDR LQIPTGAETLRFKGYLIMSRNSSHDYADFADLVDTMAPETAAAVLAGMDRYYSCQAPG RQWMATQLVGRLADPQPSDLGDQSPGADAQAKWEEVRRRCLSVAVAMLEEAR" gene 241514..241924 /locus_tag="Rv0203" /db_xref="GeneID:886748" CDS 241514..241924 /locus_tag="Rv0203" /function="UNKNOWN" /note="Rv0203, (MTV033.11), len: 136 aa. Possible exported protein (has hydrophobic stretch near N-terminus). Some similarity to part of U02459|LDU02459_1 hypothetical protein from Leishmania donovani (741 aa), FASTA score: opt: 111, E(): 9.1, (30.0% identity in 90 aa overlap). TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214717.1" /db_xref="GI:15607344" /db_xref="GeneID:886748" /translation="MKTGTATTRRRLLAVLIALALPGAAVALLAEPSATGASDPCAAS EVARTVGSVAKSMGDYLDSHPETNQVMTAVLQQQVGPGSVASLKAHFEANPKVASDLH ALSQPLTDLSTRCSLPISGLQAIGLMQAVQGARR" gene complement(241976..243214) /locus_tag="Rv0204c" /db_xref="GeneID:886747" CDS complement(241976..243214) /locus_tag="Rv0204c" /function="UNKNOWN" /note="Rv0204c, (MTV033.12c), len: 412 aa. Probable conserved transmembrane protein (see citation below), equivalent, but has C-terminal extension, to Z95398|MLCL622.17c from Mycobacterium leprae (367 aa), FASTA scores: opt: 2002, E(): 0, (82.4% identity in 374 aa overlap). Some similarity to Rv0585c from Mycobacterium tuberculosis. TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214718.1" /db_xref="GI:15607345" /db_xref="GeneID:886747" /translation="MSHDAPARNLRQRVGALPRTRVGAPPAEGVPPRGKYWWLRWAVL AIVAIVLAIEVALGWDQLAKAWVSLYRAKWWWLLAAVAAAGASMHSFAQIQRTLLKSA GVHVKQWRSEAAFYAANSLSTTLPGGPVLSATFLLRQQRIWGASTVVASWQLVMSGVL QAVGLALLGLGGAFFLGAKNNPFSLLFTLGGFVTLLLLAQAVASRPELIEGIGRRVLS WANSVRGRPADAGLPKWRETLMQLESVSLGRRDLGVAFGWSLFNWIADVACLGFAAYA AGDHASVGGLAVAYAAARAVGTIPLMPGGVLVVEAVLVPGLVSSGMPLPSAISAMLIY RLISWLLIAAIGWVVFFFMFRTESTADSDNDRDPPTDPNLRLVIQPQGTPCDDPVETT PQGPAPTPDLRPEGGETPPR" gene 243384..244487 /locus_tag="Rv0205" /db_xref="GeneID:886766" CDS 243384..244487 /locus_tag="Rv0205" /function="UNKNOWN" /note="Rv0205, (MTV033.13), len: 367 aa. Possible conserved transmembrane protein, similar to hypothetical proteins from many bacteria e.g. AL0209|SC4H8_6 from Streptomyces coelicolor (402 aa), FASTA scores: opt: 436, E(): 1.7e-21, (27.2% identity in 349 aa overlap); Z99117|BSUB0014_221 from Bacillus subtilis (353 aa), FASTA scores: opt: 394, E(): 8.6e-19, (28.7% identity in 324 aa overla). TBparse score is 0.885." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214719.1" /db_xref="GI:15607346" /db_xref="GeneID:886766" /translation="MSASLDDASVAPLVRKTAAWAWRFLVILAAMVALLWVLNKFEVI VVPVLLALMLSALLVPPVDWLDSRGLPHAVAVTLVLLSGFAVLGGILTFVVSQFIAGL PHLVTEVERSIDSARRWLIEGPAHLRGEQIDNAGNAAIEALRNNQAKLTSGALSTAAT ITELVTAAVLVLFTLIFFLYGGRSIWQYVTKAFPASVRDRVRAAGRAGYASLIGYARA TFLVALTDAAGVGAGLAVMGVPLALPLASLVFFGAFIPLIGAVVAGFLAVVVALLAKG IGYALITVGLLIAVNQLEAHLLQPLVMGRAVSIHPLAVVLAIAAGGVLAGVVGALLAV PTVAFFNNAVQVLLGGNPFADVADVSSDHLTEV" gene complement(244484..247318) /gene="mmpL3" /locus_tag="Rv0206c" /db_xref="GeneID:886752" CDS complement(244484..247318) /gene="mmpL3" /locus_tag="Rv0206c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0206c, (MTV033.14c, MTCY08D5.01c), len: 944 aa. Possible mmpL3, conserved transmembrane transport protein (see Tekaia et al., 1999), equivalent to Z95398|MLCL622.18c from Mycobacterium leprae (955 aa), FASTA scores: opt: 806, E(): 1.8e-21, (57.2% identity in 243 aa overlap). Member of RND superfamily, similar to others. BELONGS TO THE MMPL FAMILY. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL3" /protein_id="NP_214720.1" /db_xref="GI:15607347" /db_xref="GeneID:886752" /translation="MFAWWGRTVYRYRFIVIGVMVALCLGGGVFGLSLGKHVTQSGFY DDGSQSVQASVLGDQVYGRDRSGHIVAIFQAPAGKTVDDPAWSKKVVDELNRFQQDHP DQVLGWAGYLRASQATGMATADKKYTFVSIPLKGDDDDTILNNYKAIAPDLQRLDGGT VKLAGLQPVAEALTGTIATDQRRMEVLALPLVAVVLFFVFGGVIAAGLPVMVGGLCIA GALGIMRFLAIFGPVHYFAQPVVSLIGLGIAIDYGLFIVSRFREEIAEGYDTETAVRR TVITAGRTVTFSAVLIVASAIGLLLFPQGFLKSLTYATIASVMLSAILSITVLPACLG ILGKHVDALGVRTLFRVPFLANWKISAAYLNWLADRLQRTKTREEVEAGFWGKLVNRV MKRPVLFAAPIVIIMILLIIPVGKLSLGGISEKYLPPTNSVRQAQEEFDKLFPGYRTN PLTLVIQTSNHQPVTDAQIADIRSKAMAIGGFIEPDNDPANMWQERAYAVGASKDPSV RVLQNGLINPADASKKLTELRAITPPKGITVLVGGTPALELDSIHGLFAKMPLMVVIL LTTTIVLMFLAFGSVVLPIKATLMSALTLGSTMGILTWIFVDGHFSKWLNFTPTPLTA PVIGLIIALVFGLSTDYEVFLVSRMVEARERGMSTQEAIRIGTAATGRIITAAALIVA VVAGAFVFSDLVMMKYLAFGLMAALLLDATVVRMFLVPSVMKLLGDDCWWAPRWARRL QTRIGLGEIHLPDERKRPVSNGRPARPPVTAGLVAARAAGDPRPPHDPTHPLAESPRP ARSSPASSPELTPALEATAAPAAPSGASTTRMQIGSSTEPPTTRLAAAGRSVQSPAST PPPTPTPPSAPSAGQTRAMPLAANRSTDAAGDPAEPTAALPIIRSDGDDSEAATEQLN ARGTSDKTRQRRRGGGALSAQDLLRREGRL" gene complement(247384..248112) /locus_tag="Rv0207c" /db_xref="GeneID:886742" CDS complement(247384..248112) /locus_tag="Rv0207c" /function="UNKNOWN" /note="Rv0207c, (MTCY08D5.02c), len: 242 aa. Conserved hypothetical protein, equivalent to Z95398|MLCL622_19 from Mycobacterium leprae (261 aa), FASTA scores: E(): 0, (60.8 identity in 199 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214721.1" /db_xref="GI:15607348" /db_xref="GeneID:886742" /translation="MSLTEDVTSQTSESLARHSVLAEDLSQDGLTSLGAPGARVLLVW DAPNLDMGLGSILGRRPTALERPRFDALGRWLLARTAEIVAGRPGISTEPEATVFTNI APGSAEVVRPWVDALRNVGFAVFAKPKVDEDSDVDRDMLAHIDERYREGLAALVVASA DGQAFRQPLEAVARSGTPVQVLGFREHASWALASDTLEFVDLEDIAGVFREPLPRIGL DSLPEQGAWLQPFRPLSSLLTSRV" gene complement(248115..248906) /gene="trmB" /locus_tag="Rv0208c" /gene_synonym="yggH" /db_xref="GeneID:886740" CDS complement(248115..248906) /gene="trmB" /locus_tag="Rv0208c" /gene_synonym="yggH" /EC_number="2.1.1.33" /function="CAUSES METHYLATION." /note="tRNA (guanine-N(7)-)-methyltransferase; catalyzes the formation of N(7)-methylguanine at position 46 (m7G46) in tRNA by transferring the methyl residue from S-adenosyl-L-methionine" /codon_start=1 /transl_table=11 /product="tRNA (guanine-N(7)-)-methyltransferase" /protein_id="NP_214722.1" /db_xref="GI:15607349" /db_xref="GeneID:886740" /translation="MVHHGQMHAQPGVGLRPDTPVASGQLPSTSIRSRRSGISKAQRE TWERLWPELGLLALPQSPRGTPVDTRAWFGRDAPVVLEIGSGSGTSTLAMAKAEPHVD VIAVDVYRRGLAQLLCAIDKVGSDGINIRLILGNAVDVLQHLIAPDSLCGVRVFFPDP WPKARHHKRRLLQPATMALIADRLVPSGVLHAATDHPGYAEHIAAAGDAEPRLVRVDP DTELLPISVVRPATKYERKAQLGGGAVIELLWKKHGCSERDLKIR" gene 249038..250123 /locus_tag="Rv0209" /db_xref="GeneID:886739" CDS 249038..250123 /locus_tag="Rv0209" /function="UNKNOWN" /note="Rv0209, (MTCY08D5.04), len: 361 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214723.1" /db_xref="GI:15607350" /db_xref="GeneID:886739" /translation="MRGQGHQIFVDELARFATSSADQRVVAIAQRAAEPLRVAVRGRP GVGCRTVARALQGAGSSSGMTVTPQARAADSDVDLVVYVTVEVVKPEDREAIAATRRP VVAVLNKADLAGPLSGAGPIVMAQARCAQFSTLLGVPMESMIGLLAVAALDDLDDTLR AVLRALAAHPDGFDALDRAVAGFLAAALPVPTEVRLRLLDTLDLFGIALGMAAFRPGR PSRTPAQLRTLLRRVSGVDAVIDKVTAAGSEVRYRRLLDAVAELEALAAQAKEIGGPI GEFLRDDDTVLARMAAAVDVALAVGLDVGPLDDPAAHLPRAVRWHRYSLDNGDMHRTC GADIARGSLRLWSLAGGMPLHRYRKSS" gene 250120..251598 /locus_tag="Rv0210" /db_xref="GeneID:886735" CDS 250120..251598 /locus_tag="Rv0210" /function="UNKNOWN" /note="Rv0210, (MTCY08D5.05), len: 492 aa. Hypothetical unknown protein. Possibly membrane protein; has hydrophobic stretches around aa 333 - 381." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214724.1" /db_xref="GI:15607351" /db_xref="GeneID:886735" /translation="MIRAASDDPAGVDELVAAIAPGLAGLGLPVINRREVVLVTGPWL AGVSGVRAALAERLPQRRFVETAELGPGDAPVAVVFVVSAATALTESDCVLLDTAAEH TDAVVAVVSKIDVHRGWRDVLTSNRDRLAARASRYARVPWVGAAAAPELGEPYLDDLV AAIQKQLADPAVARRNMLRAWESRLLMVARRFDGDAQSAGRRARVDALRQQRRTVLRQ GRQSKSEHTIALRAQIQHARVKLSYFARNRCSLLRVELQEHVAGLSRKDIARFAAYTR GRVQEVVAEVGEGAVAHLADVAQLLGVPVQPPVLENLPAVLPTVVAPPLTSRRLEIRL TTLLGAGFGLGIALTLSRLVAGLTPGLAASGMVAGVAIGLAVTAWVVNARALLHDRVV VDRWTGEVTASLRSVVEQLVATRVVAVETLLSTAISERDDAENARVADQVSIIDGELR EHAVAAARAAALRDREMPAVRAALEAVRAELGEPGAPTTGLF" gene 251782..253602 /gene="pckA" /locus_tag="Rv0211" /db_xref="GeneID:886744" CDS 251782..253602 /gene="pckA" /locus_tag="Rv0211" /EC_number="4.1.1.32" /function="RATE-LIMITING GLUCONEOGENIC ENZYME [CATALYTIC ACTIVITY: GTP + OXALOACETATE = GDP + PHOSPHOENOLPYRUVATE + CO2]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the phosphorylation and decarboxylation of oxaloacetate to form phosphoenolpyruvate using GTP" /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate carboxykinase" /protein_id="NP_214725.1" /db_xref="GI:15607352" /db_xref="GeneID:886744" /translation="MTSATIPGLDTAPTNHQGLLSWVEEVAELTQPDRVVFTDGSEEE FQRLCDQLVEAGTFIRLNPEKHKNSYLALSDPSDVARVESRTYICSAKEIDAGPTNNW MDPGEMRSIMKDLYRGCMRGRTMYVVPFCMGPLGAEDPKLGVEITDSEYVVVSMRTMT RMGKAALEKMGDDGFFVKALHSVGAPLEPGQKDVAWPCSETKYITHFPETREIWSYGS GYGGNALLGKKCYSLRIASAMAHDEGWLAEHMLILKLISPENKAYYFAAAFPSACGKT NLAMLQPTIPGWRAETLGDDIAWMRFGKDGRLYAVNPEFGFFGVAPGTNWKSNPNAMR TIAAGNTVFTNVALTDDGDVWWEGLEGDPQHLIDWKGNDWYFRETETNAAHPNSRYCT PMSQCPILAPEWDDPQGVPISGILFGGRRKTTVPLVTEARDWQHGVFIGATLGSEQTA AAEGKVGNVRRDPMAMLPFLGYNVGDYFQHWINLGKHADESKLPKVFFVNWFRRGDDG RFLWPGFGENSRVLKWIVDRIEHKAGGATTPIGTVPAVEDLDLDGLDVDAADVAAALA VDADEWRQELPLIEEWLQFVGEKLPTGVKDEFDALKERLG" misc_feature 252586..252612 /gene="pckA" /locus_tag="Rv0211" /note="PS00505 Phosphoenolpyruvate carboxykinase (GTP) signature" gene complement(253669..254640) /gene="nadR" /locus_tag="Rv0212c" /db_xref="GeneID:886734" CDS complement(253669..254640) /gene="nadR" /locus_tag="Rv0212c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0212c, (MTCY08D5.07c), len: 323 aa. Possible nadR (alternate gene name: nadI), transcriptional regulator, similar to others e.g. NADR_ECOLI|P27278 transcriptional regulator from Escherichia coli (410 aa), FASTA scores: opt: 377, E (): 1e-17, (31.1% identity in 347 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop).; nadI" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein NadR" /protein_id="NP_214726.1" /db_xref="GI:15607353" /db_xref="GeneID:886734" /translation="MTHGMVLGKFMPPHAGHVYLCEFARRWVDELTIVVGSTAAEPIP GAQRVAWMRELFPFDRVVHLANENPQRPWEHPDFWDIWKASLQGVLATRPDFVFGAEP YNADFAQVLGARFVAVDHGRTVVPVTATDIRADPLGHWQHIPRCVRPAFVKRVSIIGP ESTGKTTLAQAVAEKLRTKWVPERAKMLRELNGGSLIGLEWAEIVRGQIASEEALARD ADRVLICDTDPLATTVWAEFLAGGCPQELRDLARRPYDLTLLTTPDVPWDADDGRCVP GARGTFFARCEQALRAAGRSFVVITGGWEERLSVSLRAVEELVRARR" misc_feature complement(254143..254166) /gene="nadR" /locus_tag="Rv0212c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(254637..255950) /locus_tag="Rv0213c" /db_xref="GeneID:886746" CDS complement(254637..255950) /locus_tag="Rv0213c" /EC_number="2.1.1.-" /function="CAUSES METHYLATION." /note="Rv0213c, (MTCY08D5.08c), len: 437 aa. Possible methyltransferase (EC 2.1.1.-), weakly similar to others methyltransferases e.g. AF127374_30|LINA from Streptomyces lavendulae (611 aa), FASTA scores: opt: 400, E(): 8.1e-19, (27.3% identity in 388 aa overlap); Q50258 fortimicin kl1 methyltransferase (553 aa), FASTA scores: opt: 267, E(): 1.2e-13, (29.3% identity in 351 aa overlap)." /codon_start=1 /transl_table=11 /product="methyltransferase (methylase)" /protein_id="NP_214727.1" /db_xref="GI:15607354" /db_xref="GeneID:886746" /translation="MSIKAYAKTQGIAVTSVNGLVAGHGSVQETWLAMQSAAALSGTP RLVGFSCIDTFPEVLWLAQRARQAWDGVRIVIGNAMATLNYERILRQHDCFDYVVVGD GEVAFTKLALALANDAAVDDVPGLARRSEQGQILRTPSSLVDLDELPRPARDELPTVL ADGFAASVFSTRGCPYRCTFCGTGAMSAMLGKDSYRAKSVDAVVDEIDYLVSDYDVNF LSITDDLFISKHPGSQQRAADFANAVLRRGISVNFMVDIRLDSVVDLDLFKHLHRAGL RRVFIGVETGSYEQLRAYRKQILTRGQDAADTINALQQLGIDVIPGTIMFHPTVQPDE LRETVRLLRATKYTVGFKFMSRIVPYPGTPLYQAYSDAGYLTAKWPLGQWEFVDPEAS RVYADVVAKVAPDVGISFDEAEAYFLSRLDEWENVIAGRIAEATS" gene 256064..257677 /gene="fadD4" /locus_tag="Rv0214" /db_xref="GeneID:886737" CDS 256064..257677 /gene="fadD4" /locus_tag="Rv0214" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid--CoA ligase" /protein_id="NP_214728.1" /db_xref="GI:15607355" /db_xref="GeneID:886737" /translation="MPRGELYKRFRLVMGGIAPCGSGRRAATYPRRMQIRPYIGADKP AVILYPSGTVISFDELEARANRLAHWFRQAGLREDDVVAILMENNEHVHAVMWAARRS GLYYVPINTHLTASEAAYIVDNSGAKAIVGSAALRETCHGLAEHLPGGLPDLLMLAGG GLVGWMTYPECVADQPDTPIEDEREGDLLQYSSGTTGRPKGIKRELPHVSPDAAPGMM PALLDFWMDADSVYLSPAPMYHTAPSVWTMSALAAGVTTVVMEKFDAEGALDAIQRYR VTHAQFVPAMFVRMLKLPEAVRNSYDMSSLRRVIHAAAPCPVQIKEQMIHWWGPIIDE YYASSEASGSTLITAEDWLTHPGSVGKPIQGGVHIVGADGSELPPNQPGEIYFEGGYP FEYLNDPAKTAASRNKHGWVTVGDVGYLDDDGYLFLTGRRHHMIISGGVNIYPQEAEN LLVAHPKVLDAAVFGVPDDEMGQRVMAAVQTVDSADANDQFAGELLAWLRDRLSHFKC PRSIAFEPQLPRTDTGKLYKSGLVEKYSV" misc_feature 256628..256663 /gene="fadD4" /locus_tag="Rv0214" /note="PS00455 Putative AMP-binding domain signature" gene complement(257783..258856) /gene="fadE3" /locus_tag="Rv0215c" /db_xref="GeneID:886730" CDS complement(257783..258856) /gene="fadE3" /locus_tag="Rv0215c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0215c, (MTCY08D5.10c), len: 357 aa. Probable fadE3, acyl- dehydrogenase (EC 1.3.99.-), similar to many e.g. ACDB_BACSU|P45857 acyl-CoA dehydrogenase from B. subtilis (EC 1.3.99.-) (379 aa), FASTA scores: opt: 812, E(): 0, (39.5% identity in 354 aa overlap)." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE3" /protein_id="NP_214729.1" /db_xref="GI:15607356" /db_xref="GeneID:886730" /translation="MRNELNDDEAMLVATVRAFIDRDVKPTVREVEHANSYPEAWIEQ MKHIGIYGLAIDEQYGGSPVSMPCYVQVTQELARGWMSLAGAMGGHTVVAKLLTLFGT EEQRRTYLPPMASGELRATMALTEPGGGSDLQNMSTTALADGPEGSAGLLINGCKTWI SNARRSGLFAVLCKTDPNATPRHQGMSIVLVEPGPGLTVSRDLPKLGYKGVESCELSF DNLRVPVSAILGGAMGQGFSQMMKGLETGRIQVAARALGVATAALEDSLAYAQQRESF GRPIWQHQAVGNYLADMATKLTAARQLTRYAAERYDSGQRCDMEAGMAKLFASEVAME IALNAVRIHGGYGYSTEYDVERR" gene 258913..259926 /locus_tag="Rv0216" /db_xref="GeneID:886729" CDS 258913..259926 /locus_tag="Rv0216" /function="UNKNOWN" /note="Rv0216, (MTCY08D5.11), len: 337 aa. Conserved hypothetical protein, equivalent to Z95398|MLCL622_22 from Mycobacterium leprae (339 aa), FASTA scores: E(): 0, (73.7 identity in 338 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214730.1" /db_xref="GI:15607357" /db_xref="GeneID:886729" /translation="MASGYGGIRVGGPYFDDLSKGQVFDWAPGVTLSLGLAAAHQSIV GNRLRLALDSDLCAAVTGMPGPLAHPGLVCDVAIGQSTLATQRVKANLFYRGLRFHRF PAVGDTLYTRTEVVGLRANSPKPGRAPTGLAGLRMTTIDRTDRLVLDFYRCAMLPASP DWKPGAVPGDDLSRIGADAPAPAADPTAHWDGAVFRKRVPGPHFDAGIAGAVLHSTAD LVSGAPELARLTLNIAATHHDWRVSGRRLVYGGHTIGLALAQATRLLPNLATVLDWES CDHTAPVHEGDTLYSELHIESAQAHADGGVLGLRSLVYAVSDSASEPDRQVLDWRFSA LQF" gene complement(259923..260831) /gene="lipW" /locus_tag="Rv0217c" /db_xref="GeneID:886726" CDS complement(259923..260831) /gene="lipW" /locus_tag="Rv0217c" /EC_number="3.1.1.-" /function="UNKNOWN; LIPOLYTIC ENZYME PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0217c, (MTCY08D5.12c), len: 302 aa. Possible esterase (EC 3.1.1.-), showing similarity with others e.g. EST_ACICA|P18773 esterase (303 aa), FASTA scores: opt: 320, E(): 3.2e-13, (29.2% identity in 274 aa overlap)." /codon_start=1 /transl_table=11 /product="esterase LipW" /protein_id="NP_214731.1" /db_xref="GI:15607358" /db_xref="GeneID:886726" /translation="MSGNEVHPDLRRIAVVTPRQLVGPRTLPVMRALIVVAGLRMSRT PPDIEVLTLESGVGVRLYRPAGSNEPAPALLWIHAGGYVMGTAQQDDRLCLRFSSRLG ITVASVDYRLAPENPYPAALGDCYSALTWLASLPAVDPARVAIGGASAGGGLAAALAL LARDRGGITPAFQLLVYPMLDDRPSIAPANPHYRLWNGRANRFGWRAYLGDADARVAV PGRRDDLGGLAPAWIGVGTHDLLHDEDLAYAERLTAAGVPCQVEVVEGAFHGFDRVAP NVGVSQRFFTSQCNSLRAALALSNRT" gene 260924..262252 /locus_tag="Rv0218" /db_xref="GeneID:886727" CDS 260924..262252 /locus_tag="Rv0218" /function="UNKNOWN" /note="Rv0218, (MTCY08D5.13), len: 442 aa. Probable conserved transmembrane protein, some similarity with sulfite oxidases (EC 1.8.3.1) e.g. SUOX_HUMAN|P51687 sulfite oxidase precursor (488 aa), FASTA scores: opt: 153, E(): 0.0087, (28.6% identity in 161 aa overlap); and with some nitrate reductases (EC 1.6.6.3) e.g. NIA_FUSOX|P39863 nitrate reductase (NADPH) (905 aa), FASTA scores: opt: 143, E(): 0.06, (29.3% identity in 92 aa overlap). Also similar to BSUB0017_86 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214732.1" /db_xref="GI:15607359" /db_xref="GeneID:886727" /translation="MSDPARGAEAEDAYGFPAGLWRWLQRHPPPALHRLTRFRSPLRG PWLTSVFGLVLLVALPFVIITGLLSYIAYAPQLGQAIPGDVGWLRLPAFTWPTRPSWL YRLTQGLHVGLGLVIIPVVLAKLWSVIPRLFVWPPARSIAQVLERLSVLMLVGGILFQ IVTGVLNIQYDYIFGFSFYTGHYFGAWVFIAGFLLHIVVKIPHMVTGLRSIPMREVLG TNVADTRAQPCDPDGLVSVNPGEATLSRRGALGLVGAGVLLIGVLTVGQTLGGFTRKA ALLLPRGRVVSPGDFPVNKTAAAAGITAEAIGPDWRLVLCGGPAEVVLDRATLAGLPQ RTARLPLACVEGWSAVRTWSGVPLAELALLAGVPAARSARVTSLQRGGAFGEAKLAAN QIADPDALLALRVDGADLSLDHGYPARIIVPALPGVHNTKWVAGIEFHKR" gene 262254..262802 /locus_tag="Rv0219" /db_xref="GeneID:886725" CDS 262254..262802 /locus_tag="Rv0219" /function="UNKNOWN" /note="Rv0219, (MTCY08D5.14), len: 182 aa. Probable conserved transmembrane protein, showing similarity with CAB76992.1|AL159178 putative lipoprotein from Streptomyces coelicolor (163 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214733.1" /db_xref="GI:15607360" /db_xref="GeneID:886725" /translation="MFDIATRFKNSYGSGPLHLLAMVSGFALLGYIVATARPSALWNQ ATWWQSIAVWFVAAVVAHDLLLYPLYALADRILARLVGRRDVSAPRRRPELPVRNYIR IPALAAGLTLLVFLPGIIRQGAPTYLDATGQTQEPFLGRWLLLTAVAFGISAAAYAIR LVVAHVRRRRAGCSRVDAIDEE" gene 262812..264023 /gene="lipC" /locus_tag="Rv0220" /db_xref="GeneID:886722" CDS 262812..264023 /gene="lipC" /locus_tag="Rv0220" /EC_number="3.1.1.-" /function="UNKNOWN; LIPOLYTIC ENZYME PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0220, (MTCY08D5.15), len: 403 aa. Probable esterase (EC 3.1.1.-), similar to others proteins and esterases from various organisms and Mycobacterium tuberculosis e.g. Q50681 (431 aa), FASTA scores: opt: 841, E(): 0, (38.2% identity in 408 aa overlap); Rv1426c, Rv1399c, etc. Contains PS00122 Carboxylesterases type-B serine active site." /codon_start=1 /transl_table=11 /product="esterase LipC" /protein_id="NP_214734.1" /db_xref="GI:15607361" /db_xref="GeneID:886722" /translation="MNQRRAAGSTGVAYIRWLLRARPADYMLALSVAGGSLPVVGKHL KPLGGVTAIGVWGARHASDFLSATAKDLLTPGINEVRRRDRASTQEVSVAALRGIVSP DDLAVEWPAPERTPPVCGALRHRRYVHRRRVLYGDDPAQLLDVWRRKDMPTKPAPVLI FVPGGAWVHGSRAIQGYAVLSRLAAQGWVCLSIDYRVAPHHRWPRHILDVKTAIAWAR ANVDKFGGDRNFIAVAGCSAGGHLSALAGLTANDPQYQAELPEGSDTSVDAVVGIYGR YDWEDRSTPERARFVDFLERVVVQRTIDRHPEVFRDASPIQRVTRNAPPFLVIHGSRD CVIPVEQARSFVERLRAVSRSQVGYLELPGAGHGFDLLDGARTGPTAHAIALFLNQVH RSRAQFAKEVI" misc_feature 263481..263528 /gene="lipC" /locus_tag="Rv0220" /note="PS00122 Carboxylesterases type-B serine active site" gene 264067..265476 /locus_tag="Rv0221" /db_xref="GeneID:886719" CDS 264067..265476 /locus_tag="Rv0221" /function="UNKNOWN" /note="Rv0221, (MTCY08D5.16), len: 469 aa. Conserved hypothetical protein, similar to others proteins from Mycobacterium tuberculosis e.g. Q50680|Rv2285|MT2343|MTCY339.25c hypothetical 47.7 kDa protein (445 aa), FASTA scores: opt: 455, E(): 8.1e-23, (26.7% identity in 461 aa overlap); Rv3740c, Rv3734c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214735.1" /db_xref="GI:15607362" /db_xref="GeneID:886719" /translation="MKRLSGWDAVLLYSETPNVHMHTLKVAVIELDSDRQEFGVDAFR EVIAGRLHKLEPLGYQLVDVPLKFHHPMWREHCQVDLNYHIRPWRLRAPGGRRELDEA VGEIASTPLNRDHPLWEMYFVEGLANHRIAVVAKIHHALADGVASANMMARGMDLLPG PEVGRYVPDPAPTKRQLLSAAFIDHLRHLGRIPATIRYTTQGLGRVRRSSRKLSPALT MPFTPPPTFMNHRLTPERRFATATLALIDVKATAKLLGATINDMVLAMSTGALRTLLL RYDGKAEPLLASVPVSYDFSPERISGNRFTGMLVALPADSDDPLQRVRVCHENAVSAK ESHQLLGPELISRWAAYWPPAGAEALFRWLSERDGQNKVLNLNISNVPGPRERGRVGA ALVTEIYSVGPLTAGSGLNITVWSYVDQLNISVLTDGSTVQDPHEVTAGMIADFIEIR RAAGLSVELTVVESAMAQA" gene 265507..266295 /gene="echA1" /locus_tag="Rv0222" /db_xref="GeneID:886723" CDS 265507..266295 /gene="echA1" /locus_tag="Rv0222" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_214736.1" /db_xref="GI:15607363" /db_xref="GeneID:886723" /translation="MSSESDAANTEPEVLVEQRDRILIITINRPKAKNAVNAAVSRGL ADAMDQLDGDAGLSVAILTGGGGSFCAGMDLKAFARGENVVVEGRGLGFTERPPTKPL IAAVEGYALAGGTELALAADLIVAARDSAFGIPEVKRGLVAGGGGLLRLPERIPYAIA MELALTGDNLPAERAHELGLVNVLAEPGTALDAAIALAEKITANGPLAVVATKRIITE SRGWSPDTMFAEQMKILVPVFTSNDAKEGAIAFAERRRPRWTGT" gene complement(266301..267764) /locus_tag="Rv0223c" /db_xref="GeneID:886718" CDS complement(266301..267764) /locus_tag="Rv0223c" /EC_number="1.2.1.-" /function="THOUGHT TO OXIDIZE A WIDE VARIETY OF ALIPHATIC AND AROMATIC ALDEHYDES." /note="Rv0223c, (MTCY08D5.18), len: 487 aa. Probable aldehyde dehydrogenase (EC 1.2.1.-), similar to others e.g. A75608|6460525|AAF12231.1|AE001862_57 aldehyde dehydrogenase from Deinococcus radiodurans strain R1 (495 aa); Q47943 L-sorbosone dehydrogenase NAD(P) dependent from Gluconobacter oxydans (498 aa), FASTA scores: opt: 1157, E (): 0, (42.1% identity in 482 aa overlap); etc. Also similar to Rv0768, Rv2858c, etc from Mycobacterium tuberculosis. Contains PS00687 Aldehyde dehydrogenases glutamic acid active site; and PS00070 Aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase" /protein_id="NP_214737.1" /db_xref="GI:15607364" /db_xref="GeneID:886718" /translation="MSDSATEYDKLFIGGKWTKPSTSDVIEVRCPATGEYVGKVPMAA AADVDAAVAAARAAFDNGPWPSTPPHERAAVIAAAVKMLAERKDLFTKLLAAETGQPP TIIETMHWMGSMGAMNYFAGAADKVTWTETRTGSYGQSIVSREPVGVVGAIVAWNVPL FLAVNKIAPALLAGCTIVLKPAAETPLTANALAEVFAEVGLPEGVLSVVPGGIETGQA LTSNPDIDMFTFTGSSAVGREVGRRAAEMLKPCTLELGGKSAAIILEDVDLAAAIPMM VFSGVMNAGQGCVNQTRILAPRSRYDEIVAAVTNFVTALPVGPPSDPAAQIGPLISEK QRTRVEGYIAKGIEEGARLVCGGGRPEGLDNGFFIQPTVFADVDNKMTIAQEEIFGPV LAIIPYDTEEDAIAIANDSVYGLAGSVWTTDVPKGIKISQQIRTGTYGINWYAFDPGS PFGGYKNSGIGRENGPEGVEHFTQQKSVLLPMGYTVA" misc_feature complement(266889..266924) /locus_tag="Rv0223c" /note="PS00070 Aldehyde dehydrogenases cysteine active site" misc_feature complement(266985..267008) /locus_tag="Rv0223c" /note="PS00687 Aldehyde dehydrogenases glutamic acid active site" gene complement(267863..268627) /locus_tag="Rv0224c" /db_xref="GeneID:886715" CDS complement(267863..268627) /locus_tag="Rv0224c" /EC_number="2.1.1.-" /function="CAUSES METHYLATION" /note="Rv0224c, (MTCY08D5.19c), len: 254 aa. Possible methyltransferase (EC 2.1.1.-), showing weak similarity with other methyltransferases e.g. P74388 STEROL-C-METHYLTRANSFERASE (318 aa), FASTA scores: opt: 190, E(): 3.6e-05, (33.3% identity in 114 aa overlap). Equivalent to AL022486|MLCB1883_1 from Mycobacterium leprae (269 aa), FASTA scores: opt: 1456, E(): 0, (82.9% identity in 252 aa overlap). Also some similarity with MTCY21B4.22c from Mycobacterium tuberculosis FASTA score: (30.1% identity in 136 aa overlap)." /codon_start=1 /transl_table=11 /product="methyltransferase (methylase)" /protein_id="NP_214738.1" /db_xref="GI:15607365" /db_xref="GeneID:886715" /translation="MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAM IGDLWLATHSEPPVGRTLLDVGGGPGYFATAFSDAGVGYIGVEPDPDEMHAAGPAFTG RPGMFVRASGMALPFADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKPGGLVVLSYTV WLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSSLFAVSAAEGLRWAAGTGA ALAVFPRYHPRWAWWLTSVPVLREFLVSNLVLVLTP" gene 268663..269817 /locus_tag="Rv0225" /db_xref="GeneID:886713" CDS 268663..269817 /locus_tag="Rv0225" /function="POSSIBLY INVOLVED IN LPS BIOSYNTHESIS." /note="Rv0225, (MTCY08D5.20), len: 384 aa. Possible conserved protein involved in LPS biosynthesis, similar to O26275 LPS BIOSYNTHESIS RFBU RELATED PROTEIN (382 aa), FASTA scores: opt: 426, E(): 1.2e-20, (28.2% identity in 394 aa overlap). Some similarity with Rv3032 from Mycobacterium tuberculosis FASTA score: (31.6% identity in 228 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214739.1" /db_xref="GI:15607366" /db_xref="GeneID:886713" /translation="MSALRSVLLLCWRDIGHPQGGGSEAYLQRIGAQLAASGIAVTLR TARYPGAPRHELVDGVRISRAGGRYSVYLWALLAMAAARCGLGPLRRVRPDVVVDTQN GWPFVARLLYGRRSLVLVHHCHREQWPVAGRMMGRLGWYVESMLSPRLHRRNQYVTVS LPSARDLIALGVDSERIAVVRNGLDEAPSPTLSGPRAPTPRVVVLSRLVPHKQIEDAL AAVAELQPRIPGLHLDIVGGGWWRQRLVDHVHRLDIADAVTFHGHVDDVTKHHVLQSS WVHLLPSRKEGWGLAVIEAAQHGVPTIGYRSSGGLADSIVDGVTGILVDDRAELVAWL EQLLSDSVLRDQLGAKAQARSGEFSWRQSAEALRSVLEAVQASRFVSGVV" gene complement(269834..271564) /locus_tag="Rv0226c" /db_xref="GeneID:886711" CDS complement(269834..271564) /locus_tag="Rv0226c" /function="UNKNOWN" /note="Rv0226c, (MTCY08D5.21c), len: 576 aa. Probable conserved transmembrane protein, equivalent, except in N-terminal part, to AC32114.1|AL583926 conserved membrane protein from Mycobacterium leprae (600 aa), FASTA scores: opt: 2086, E(): 0, (70.3% identity in 579 aa overlap). Also similar to AL021411|SC7H1_20 from Streptomyces coelicolor (483 aa), FASTA scores: opt: 180, E(): 0.00028, (26.5 identity in 388 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214740.1" /db_xref="GI:15607367" /db_xref="GeneID:886711" /translation="MRWFRPGYALVLVLLLAAPLLRPGYLLLRDAVSTPRSYVSANAL GLTSAPRATPQDFAVALASHLVDGGVVVKALLLLGLWLAGWGAARLVATALPAAGAAG QFVAITLAIWNPYVAERLLQGHWSLLVGYGCLPWVATAMLTMRTTVGAGWFGLFGLAF WVALAGLTPSGLLLAATVAVVCVAMPGAGRPRWQCGVAALGSALVGALPWLTASALGS SLTSHTAANQLGVTAFAPRAEPGLGTLGSLASLGGIWNGEAVPSSRTTLFAVASAVVL LAMVAIGLPTVARRPVAVPLLTLAAVSVMVPAVLATGPGLHALRVVVDAAPGLGVLRD GQKWVALAVPGYTLSGAGTVLTLRRWLRPATAAVVCCLALVLTLPDLAWGVWGKVAPV HYPSGWAAVAAAINADPRTVAVLPAGTMRRFSWSGSAPVLDPLPRWVRADVLTTGDLV ISGVTVPGEDAHARAVQELLLTGPHPSTLAAAGVGWLVVESDSAGDMGAAARTLGRLA AAHRDDELALYRVGGQTSGASSARLKATMLAHWAWLSMLLVGGAGAAGYWVRRHLHHC EDTPASRAQD" gene complement(271574..272839) /locus_tag="Rv0227c" /db_xref="GeneID:886710" CDS complement(271574..272839) /locus_tag="Rv0227c" /function="UNKNOWN" /note="Rv0227c, (MTCY08D5.22c), len: 421 aa. Possible conserved membrane protein, equivalent to AL022486|MLCB1883_4 from Mycobacterium leprae (448 aa), FASTA scores: opt: 2148, E(): 0, (76.6% identity in 423 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214741.1" /db_xref="GI:15607368" /db_xref="GeneID:886710" /translation="MLRFAACGAIGLGAALLIAALLLSTYTTSRIAEIPLDIDATLIS DGTGTALDSASLATEHIVVNQDVPLVSQQQVTVESPANADVVTLQVGSSLRRTDKQKD SGLLLAIVDTVTLNRKTAMAVSDDTHTGGAVQKPRGLNDENPPTAIPLRHDGLSYRFP FHTEKKTYPYFDPIAQKAFDANYEGEEDVNGLTTYRFTQNVGYTPEGKLVAPLKYPSL YAGDEDGKVTTSAAMWGLPGDPNEQITMTRYYAAQRTFWVDPVSGTIVKETERANHYF ARDPLKPEVTFADYQVTSTEETVESQVNAARDERDRLALWSRVLPITFTAAGLVALVG GGLFASFSLRTEGALMAASGDRDDHDYRRGGFEEPVPGAEAETEKLPTQRPDFPREPS GSDPPRLGSAQPPPPPDAGHPDPGPPERR" repeat_region complement(272855..272955) /note="101 bp Mycobacterial Interspersed Repetitive Unit, class III" gene 273055..274278 /locus_tag="Rv0228" /db_xref="GeneID:886708" CDS 273055..274278 /locus_tag="Rv0228" /EC_number="2.3.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0228, (MTCY08D5.23), len: 407 aa. Probable integral membrane acyltransferase (EC 2.3.1.-), equivalent to 3063875|CAA18555.1|AL022486|T44870 ACYLTRANSFERASE from Mycobacterium leprae (384 aa), FASTA scores: opt: 2004, E(): 0, (79.3% identity in 381 aa overlap). Also similar to others e.g. Q11064 PROBABLE ACYLTRANSFERASE CY50.28C (383 aa), FASTA scores: opt: 372, E(): 2.6e-16, (35.9% identity in 359 aa overlap); Q00718|MDMB_STRMY ACYLTRANSFERASE. Very similar to Rv0111, Rv1254, etc from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="integral membrane acyltransferase" /protein_id="NP_214742.1" /db_xref="GI:15607369" /db_xref="GeneID:886708" /translation="MGPADESGAPIRPQTPHRHTVLVTNGQVVGGTRGFLPAVEGMRA CAAVGVVVTHVAFQTGHSSGVGGRLFGRFDLAVAVFFAVSGFLLWRGHAAAARDLRSH PRTGPYLRSRVARIMPAYVVAVVVILSLLPDADHASLTVWLANLTLTQIYVPLTLTGG LTQMWSLSVEVAFYAALPVLALLGRRIPVGARVPAIAALAALSWAWGWLPLDAGSGIN PLTWPPAFFSWFAAGMLLAEWAYSPVGLPHRWARRRVAMAVTALLGYLVAASPLAGPE GLVPGTAAQFAVKTAMGSLVAFALVAPLVLDRPDTSHRLLGSPAMVTLGRWSYGLFIW HLAALAMVFPVIGAFPFTGRMPTVLVLTLIFGFAIAAVSYALVESPCREALRRWERRN EPISVGELQADAIAP" gene complement(274306..274986) /locus_tag="Rv0229c" /db_xref="GeneID:886724" CDS complement(274306..274986) /locus_tag="Rv0229c" /function="UNKNOWN" /note="Rv0229c, (MTCY08D5.24c), len: 226 aa. Possible conserved membrane protein, similar to several proteins from Mycobacterium tuberculosis. Other possible start sites and could be shorter as C-terminal region has some similarity with Rv2757c|D70880 from Mycobacterium tuberculosis (138 aa), FASTA scores: E(): 1e-15, (45.3% identity in 137 aa overlap), and Rv0301, Rv2546, etc. Also some similarity with Q48177 virulence associated protein C (132 aa), FASTA scores: opt: 101, E(): 0.6, (24.3% identity in 136 aa overlap). Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214743.1" /db_xref="GI:15607370" /db_xref="GeneID:886724" /translation="MRQPRRANAMGLALCIYIGSLLIYTPIHGETSRRHRRAGFKHGS YRIGHDDDQRHRQRGPAASHVSASSTRRRRSRHAGRRTARGPRRSMALKYLLDTSVIK RLSRPAVRRAVEPLAEAGAVARTQITDLEVGYSARNETEWQRLMVALSAFDLIESTAS HHRRALGIQRLLAARSQRGRKIPDLLIAAAGEEHGLVVLHYDADFDLIAAVTGQPCQW IVPAGTID" misc_feature complement(274393..274425) /locus_tag="Rv0229c" /note="PS00626 Regulator of chromosome condensation (RCC1) signature 2" gene complement(274983..275963) /gene="php" /locus_tag="Rv0230c" /db_xref="GeneID:886705" CDS complement(274983..275963) /gene="php" /locus_tag="Rv0230c" /EC_number="3.1.8.1" /function="ENZYMATIC ACTIVITY IS NOT YET KNOWN [CATALYTIC ACTIVITY: Aryl dialkyl phosphate + H2O = dialkyl phosphate + an aryl alcohol]." /note="Rv0230c, (MTCY08D5.26c), len: 326 aa. Probable php, phosphotriesterase (EC 3.1.8.1), similar to others e.g. AAK42653.1|AE006849 putative aryldialkylphosphatase (phosphotriesterase) (paraoxonase) from Sulfolobus solfataricus (314 aa); PHP_ECOLI|P45548 PHOSPHOTRIESTERASE HOMOLOGY PROTEIN from Escherichia coli (292 aa), FASTA scores: opt: 408, E(): 7.1e-20, (31.1% identity in 305 aa overlap ); OPD_FLASP|P16648 parathion hydrolase precursor (365 aa), FASTA scores: opt: 319, E(): 5.1e-14, (34.5% identity in 333 aa overlap); etc. BELONGS TO THE PHOSPHOTRIESTERASE FAMILY. COFACTOR: CONTAINS 2 MOLES OF ZINC PER SUBUNIT." /codon_start=1 /transl_table=11 /product="phosphotriesterase" /protein_id="NP_214744.1" /db_xref="GI:15607371" /db_xref="GeneID:886705" /translation="MPELNTARGPIDTADLGVTLMHEHVFIMTTEIAQNYPEAWGDED KRVAGAIARLGELKARGVDTIVDLTVIGLGRYIPRIARVAAATELNIVVATGLYTYND VPFYFHYLGPGAQLDGPEIMTDMFVRDIEHGIADTGIKAGILKCATDEPGLTPGVERV LRAVAQAHKRTGAPISTHTHAGLRRGLDQQRIFAEEGVDLSRVVIGHCGDSTDVGYLE ELIAAGSYLGMDRFGVDVISPFQDRVNIVARMCERGHADKMVLSHDACCYFDALPEEL VPVAMPNWHYLHIHNDVIPALKQHGVTDEQLHTMLVDNPRRIFERQGGYQ" gene 276058..277764 /gene="fadE4" /locus_tag="Rv0231" /db_xref="GeneID:886703" CDS 276058..277764 /gene="fadE4" /locus_tag="Rv0231" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0231, (MTCY08D5.27), len: 568 aa. Probable fadE4, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. O29752 ACYL-CoA DEHYDROGENASE (ACD-3) from Archaeoglobus fulgidus (576 aa), FASTA scores: opt: 1788, E(): 0, (51.0% identity in 577 aa overlap); ACDB_BACSU|P45857 acyl-coa dehydrogenase from Bacillus subtilis (379 aa), FASTA scores: opt: 232, E(): 2.2e- 08, (21.6% identity in 291 aa overlap)." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE4" /protein_id="NP_214745.1" /db_xref="GI:15607372" /db_xref="GeneID:886703" /translation="MLLNPNHLTRKYPDRRSGEIMAATVDFFESRGKARLKHDDHERI WYSDFLDFVGRERIFASLLTPASYGADDCRWDTYRISEFAEIMGFYGLSYWYPFQVTA LGLGPIWMSANEDAKRKAAAGLEAGEVFAFGLSEQTHGADVYQTDMILTPSDGGWTAN GEKYYIGNANVARMVSTFGKIAGTPESQEYVFFVADSQHERYDLIKNVVNSQNYVANY ALRDYPVTEADILHRGAEAFHAALNTVNVCKYNLGWGAIGMCTHALYESVTHAANRHL YGTVVTDFSHVRRLLTDAYVRLIAMKLVASRASDYMRSASAADRRYLLYSPLTKAKVT SEGERVITALWDVIAAKGVEKDTFFETVAREIGLLPRLEGTVHINIGLLGKFMPNYLF APDSTLPVIPRRDDAADDAFLFAQGPTGGLGKVRFHDWRASFDTCAHLPNVALLREQV DVFAELLASATPDAAQQKDIDFAFGVGQLFANVPYAQLILEEARLSGVDEALIDEIFG VLVRDFNTHAVELHGRSATTAEQARFAMRMVRRPVHDPARYDQIWKDHVLALNGAYQM AP" gene 277899..278588 /locus_tag="Rv0232" /db_xref="GeneID:886701" CDS 277899..278588 /locus_tag="Rv0232" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0232, (MTCY08D5.28), len: 229 aa. Probable transcriptional regulatory protein, tetR/AcrR family, similar to others e.g. YIXD_BACSU|P32398 hypothetical transcriptional regulator (191 aa), FASTA scores: opt: 149, E(): 0.0014, (21.5% identity in 158 aa overlap). Also similar to MTV030_11 from Mycobacterium tuberculosis. Contains PS01081 Bacterial regulatory proteins, tetR family signature, and probable helix-turn helix motif from aa 33-54 (Score 1142, +3.08 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="TetR/ACRR family transcriptional regulator" /protein_id="NP_214746.1" /db_xref="GI:15607373" /db_xref="GeneID:886701" /translation="MPTVTWARVDPARRAAVVEAAEAEFGAHGFSRGSLNVIARRAGV AKGSLFQYFADKRDLYAFIADIASQRVRSYMEDLIRELDPNRPFFEFLTDLLDGWVAY FAEHPRERALHAAATLEVDTDARISVRSVLHRHYLDVLRPLVRDAHARGDLRADSDTG ALMSLLLLIFPHLALAPYMRGLDPILGLDEPTPEQPALAVRRLVAVLAAAFDAQHPAT NSAQTRSEEIT" misc_feature 277983..278075 /locus_tag="Rv0232" /note="PS01081 Bacterial regulatory proteins, tetR family signature" gene 278585..279529 /gene="nrdB" /locus_tag="Rv0233" /db_xref="GeneID:886699" CDS 278585..279529 /gene="nrdB" /locus_tag="Rv0233" /EC_number="1.17.4.1" /function="INVOLVED IN THE DNA REPLICATION PATHWAY (FIRST REACTION). PROVIDES THE PRECURSORS NECESSARY FOR DNA SYNTHESIS [CATALYTIC ACTIVITY: 2'-deoxyribonucleoside diphosphate + oxidized thioredoxin + H2O = ribonucleoside diphosphate + reduced thioredoxin]." /note="Catalyzes the rate-limiting step in dNTP synthesis" /codon_start=1 /transl_table=11 /product="ribonucleotide-diphosphate reductase subunit beta" /protein_id="NP_214747.1" /db_xref="GI:15607374" /db_xref="GeneID:886699" /translation="MTRTRSGSLAAGGLNWASLPLKLFAGGNAKFWHPADIDFTRDRA DWEKLSDDERDYATRLCTQFIAGEEAVTEDIQPFMSAMRAEGRLADEMYLTQFAFEEA KHTQVFRMWLDAVGISEDLHRYLDDLPAYRQIFYAELPECLNALSADPSPAAQVRASV TYNHIVEGMLALTGYYAWHKICVERAILPGMQELVRRIGDDERRHMAWGTFTCRRHVA ADDANWTVFETRMNELIPLALRLIEEGFALYGDQPPFDLSKDDFLQYSTDKGMRRFGT ISNARGRPVAEIDVDYSPAQLEDTFADEDRRTLAAASA" gene complement(279605..281140) /gene="gabD1" /locus_tag="Rv0234c" /db_xref="GeneID:886732" CDS complement(279605..281140) /gene="gabD1" /locus_tag="Rv0234c" /EC_number="1.2.1.16" /function="INVOLVED IN 4-AMINOBUTYRATE (GABA) DEGRADATION PATHWAY [CATALYTIC ACTIVITY: SUCCINATE SEMIALDEHYDE + NAD(P)(+) + H(2)O = SUCCINATE + NAD(P)H]." /experiment="experimental evidence, no additional details recorded" /note="NADP-dependent semialdehyde dehydrogenase; part of alternative pathway from alpha-ketoglutarate to succinate" /codon_start=1 /transl_table=11 /product="succinic semialdehyde dehydrogenase" /protein_id="NP_216247.2" /db_xref="GI:57116704" /db_xref="GeneID:886732" /translation="MRSVTCSATLVLPVIEPTPADRRPRHLLLGSAGHVSGRLDTGRF VQTHPAKDVSVPIATINPATGETVKTFTAATDDEVDAAIARAHRRFADYRQTSFAQRA RWANATADLLEAEADQAAAMMTLEMGKTLAAAKAEALKCAKGFRYYAENAEALLADEP ADAAKVGASAAYGRYQPLGVILAVMPWNFPLWQAVRFAAPALMAGNVGLLKHASNVPQ CALYLADVIARGGFPDGCFQTLLVSSGAVEAILRDPRVAAATLTGSEPAGQSVGAIAG NEIKPTVLELGGSDPFIVMPSADLDAAVSTAVTGRVQNNGQSCIAAKRFIVHADIYDD FVDKFVARMAALRVGDPTDPDTDVGPLATEQGRNEVAKQVEDAAAAGAVIRCGGKRLD RPGWFYPPTVITDISKDMALYTEEVFGPVASVFRAANIDEAVEIANATTFGLGSNAWT RDETEQRRFIDDIVAGQVFINGMTVSYPELPFGGVKRSGYGRELSAHGIREFCNIKTV WIA" misc_feature complement(280172..280207) /gene="gabD1" /locus_tag="Rv0234c" /note="PS00070 Aldehyde dehydrogenases cysteine active site" gene complement(281166..282614) /locus_tag="Rv0235c" /db_xref="GeneID:886695" CDS complement(281166..282614) /locus_tag="Rv0235c" /function="UNKNOWN" /note="Rv0235c, (MTCY08D5.31c), len: 482 aa. Probable conserved transmembrane protein, highly similar to AL133278|CAB61913.1|SCM11_2 putative integral membrane protein from Streptomyces coelicolor (470 aa), FASTA scores: opt: 2116, E(): 0, (61.8% identity in 474 aa overlap); and similar to hypothetical proteins from other organisms e.g. Q13392|384D8_7 hypothetical protein (579 aa), FASTA scores: opt: 355, E(): 6.9e-17, (28.5% identity in 569 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214749.1" /db_xref="GI:15607376" /db_xref="GeneID:886695" /translation="MGWFSAPEYWLGRLALERGTAIIYLIAFVAAAQQFRPLIGEHGM LPVPRYLAGQSFWRTPSIFHFRYSDRVFAGVCWLGAVLSAAVVAGAASFVPLWATMLI WLTLWVLYLSIVNVGQAWYSFGWESLLLETGFLMIFLGNERTAPPILTLLLARWLLFR VEFGAGLIKMRGDSCWRSLTCLYYHHETQPMPGPLSWFFHHLPKPLHRIEVAGNHFAQ LVVPFGLFTPQPAASIAAAIIVVTQLWLVASGNFSWLNWLTILLACSAIDTSSAAALL PMPAQPALSAPPQWFAGLVVVFTAAVLLLSYWPARNLLSSHQRMNMSFNPFHLVNTYG AFGSICRTRREVVIEGTDESPITEQTVWKAYEFKGKPGDPRRLPRQWAPYHLRLDWLM WFAAISPGYALPWMTPFLNRLLRNDPATLKLLRHNPFPQSPPRYVRAQLYQYRFTTVA ELRRDRAWWHRTLIGRYVPPMSLRKVASPPAD" gene complement(282649..286851) /locus_tag="Rv0236c" /db_xref="GeneID:886707" CDS complement(282649..286851) /locus_tag="Rv0236c" /function="UNKNOWN" /note="Rv0236c, (MTV034.01c, MTV034.02c, MTCY08D5.32c), len: 1400 aa. Probable conserved transmembrane protein, equivalent to AL022486|CAC32102.1|MLCB1883_7 possible integral membrane protein from Mycobacterium leprae (1440 aa), FASTA scores: opt: 7491, E(): 0, (78.8% identity in 1397 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214750.1" /db_xref="GI:15607377" /db_xref="GeneID:886707" /translation="MAPLSRKWLPVVGAVALALTFAQSPGQVSPDTKLDLTANPLRFL ARATNLWNSDLPFGQAQNQAYGYLFPHGTFFVIGHLLGVPGWVTQRLWWAVLLTVGFW GLLRVAEALGVGGPSSRVVGAVAFALSPRVLTTLGSISSETLPMMLAPWVLLPTILAL RGTSGRSVRALAAQAGLAVALMGAVNAIATLAGCLPAVIWWACHRPNRLWWRYTAWWL LAMALATLWWVMALTQLHGVSPPFLDFIESSGVTTQWSSLVEVLRGTDSWTPFVAPNA TAGAPLVTGSAAILGTCLVAAAGLAGLTSPAMPARGRLVTMLLVGVVLLAVGHRGGLA SPVAHPVQAFLDAAGTPLRNVHKVGPVIRLPLVLGLAQLLSRVPLPGSAPRPAWLRAF AHPERDKRVAVAVVALTALMVSTSLAWTGRVAPPGTFGALPQYWQEAADWLRTHHAAT PTPGRVLVVPGAPFATQVWGTSHDEPLQVLGDGPWGVRDSIPLTPPQTIRALDSVQRL FAAGRPSAGLADTLARQGISYVLVRNDLDPETSRSARPILLHRSIAGSPGLAKLAEFG APVGPDPLAGFVNDSGLRPRYPAIEIYRVSAPANPGAPYFAATDQLARVDGGPEVLLR LDERRRLQGQPPLGPVLMTADARAAGLPVPQVAVTDTPVARETDYGRVDHHSSAIRAP GDARHTYNRVPDYPVPGAEPVVGGWTGGRITVSSSSADATAMPDVAPASAPAAAVDGD PATAWVSNALQAAVGQWLQVDFDRPVTNAVVTLTPSATAVGAQVRRILIETVNGSTTL RFDEAGKPLTAALPYGETPWVRFTAAATDDGSAGVQFGITDLAITQYDASGFAHPVQL RHTVLVPGPPPGSAIAGWDLGSELLGRPGCAPGPDGVRCAASMALAPEEPANLSRTLT VPRPVSVTPMVWVRPRQGPKLADLIAAPSTTRASGDSDLVDILGSAYAAADGDPATAW TAPQRVVQHKTPPTLTLTLPRPTVVTGLRLAASRSMLPAHPTVVAINLGDGPQVRQLQ VGELTTLWLHPRVTDTVSVSLLDWDDVIDRNALGFDQLKPPGLAEVVVLSAGGAPIAP ADAARNRARALTVDCDHGPVVAVAGRFVHTSIRTTVGALLDGEPVAALPCEREPIALP AGQQELLISPGAAFVVDGAQLSTPGAGLSSATVTSAETGAWGPTHREVRVPESATSRV LVVPESINSGWVARTSTGARLTPIAVNGWQQAWVVPAGNPGTITLTFAPNSLYRASLA IGLALLPLLALLAFWRTGRRQLADRPTPPWRPGAWAAAGVLAAGAVIASIAGVMVMGT ALGVRYALRRRERLRDRVTVGLAAGGLILAGAALSRHPWRSVDGYAGNWASVQLLALI SVSVVAASVVATSESRGQDRMQ" gene complement(286898..287071) /locus_tag="Rv0236A" /db_xref="GeneID:3205106" CDS complement(286898..287071) /locus_tag="Rv0236A" /function="UNKNOWN" /note="Rv0236A, len: 57 aa. Small secreted protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177619.1" /db_xref="GI:57116705" /db_xref="GeneID:3205106" /translation="MNRIVAPAAASVVVGLLLGAAAIFGVTLMVQQDKKPPLPGGDPS SSVLNRVEYGNRS" gene 287186..288352 /gene="lpqI" /locus_tag="Rv0237" /db_xref="GeneID:886693" CDS 287186..288352 /gene="lpqI" /locus_tag="Rv0237" /function="UNKNOWN" /note="Rv0237, (MTV034.03), len: 388 aa. Probable lpQI, conserved lipoprotein, equivalent to AL022486|MLCB1883_8|T44873 probable secreted hydrolase from Mycobacterium leprae (387 aa), FASTA scores: opt: 1831, E(): 0, (73.3% identity in 390 aa overlap). Also similar to other lipoproteins and various hydrolases e.g. P40406|2126897|YBBD_BACSU|I39839 HYPOTHETICAL 70.6 KDA LIPOPROTEIN from Bacillus subtilis (642 aa); P48823|HEXA_ALTSO BETA-HEXOSAMINIDASE A PRECURSOR from ALTEROMONAS SP. (598 aa), FASTA scores: opt: 415, E(): 5.8e-17, (31.2% identity in 343 aa overlap); PCC6803|P74340 BETA-GLUCOSIDASE from Synechocystis sp. (538 aa), FASTA scores: opt: 414, E(): 6.1e-17, (30.6 identity in 320 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LpqI" /protein_id="YP_177702.1" /db_xref="GI:57116706" /db_xref="GeneID:886693" /translation="MAFPRTLAILAAAAALVVACSHGGTPTGSSTTSGASPATPVAVP VPRSCAEPAGIPALLSPRDKLAQLLVVGVRDAADAQAVVTNYHVGGILIGSDTDLTIF DGALAEIVAGGGPLPLAVSVDEEGGRVSRLRSLIGGTGPSARELAQTRTVQQVRDLAR DRGRQMRKLGITIDFAPVVDVTDAPDDTVIGDRSFGSDPATVTAYAGAYAQGLRDAGV LPVLKHFPGHGRGSGDSHNGGVTTPPLDDLVGDDLVPYRTLVTQAPVGVMVGHLQVPG LTGSEPASLSKAAVNLLRTGTGYGAPPFDGPVFSDDLSGMAAISDRFGVSEAVLRTLQ AGADIALWVTTKEVPAVLDRLEQALRAGELPMSAVDRSVVRVATMKGPNPGCGR" misc_feature 287213..287245 /gene="lpqI" /locus_tag="Rv0237" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 288428..289042 /locus_tag="Rv0238" /db_xref="GeneID:886691" CDS 288428..289042 /locus_tag="Rv0238" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0238, (MTV034.04), len: 204 aa. Possible transcriptional regulatory protein, TetR family, equivalent to AL022486|MLCB1883_9|T44874 probable transcription regulator from Mycobacterium leprae (208 aa), FASTA scores: opt: 1029, E(): 0, (80.9% identity in 199 aa overlap). Also similar to others e.g. CAB77290.1|AL160312 putative tetR-family regulatory protein from Streptomyces coelicolor (240 aa). Also similar to Mycobacterium tuberculosis proteins Z95120|Rv3208 (228 aa), FASTA scores: opt: 266, E(): 8.3e-12, (28.1% identity in 196 aa overlap); and Rv1019 (197 aa)." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_214752.1" /db_xref="GI:15607379" /db_xref="GeneID:886691" /translation="MAGGTKRLPRAVREQQMLDAAVQMFSVNGYHETSMDAIAAEAQI SKPMLYLYYGSKEDLFGACLNREMSRFIDALRSSINFDQSPKDLLRNTIVSFLRYIDA NRASWIVMYTQATSSQAFAHTVREGREQIVQLVAELVRAGTRGPLTDAEIEMMAVALV GAGEAVATRLGIGDTDVDEAAEMMINLFWLGLKGAPVDRLETGH" gene 289104..289337 /locus_tag="Rv0239" /db_xref="GeneID:886689" CDS 289104..289337 /locus_tag="Rv0239" /function="UNKNOWN" /note="Rv0239, (MTV034.05), len: 77 aa. Conserved hypothetical protein, weakly similar to Rv1839c|Z83859|MTCY359_34 from Mycobacterium tuberculosis (87 aa), FASTA scores: opt: 88, E(): 5, (40.0% identity in 45 aa overlap). TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214753.1" /db_xref="GI:15607380" /db_xref="GeneID:886689" /translation="MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPR RDAASDTWQPPTPRRLGPFRASEETWRELANEA" gene 289345..289782 /locus_tag="Rv0240" /db_xref="GeneID:886688" CDS 289345..289782 /locus_tag="Rv0240" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0240, (MTV034.06), len: 145 aa. Conserved hypothetical protein, weak similarity with Rv3697c from Mycobacterium tuberculosis (145 aa), FASTA scores: opt: 145, E(): 7.6e-05, (28.0% identity in 143 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214754.1" /db_xref="GI:15607381" /db_xref="GeneID:886688" /translation="MLSIDTNILLYAQNRDCPEHDAAAAFLVECAGRADVAVCELVLM ELYQLLRNPTVVTRPLEGPEAAEVCQTFRRNRRWALLENAPVMNEVWVLAATPRIARR RLFDARLALTLRHHGVDEFATRNINGFTDFGFSRVWDPITSDG" gene complement(289812..290654) /locus_tag="Rv0241c" /db_xref="GeneID:886686" CDS complement(289812..290654) /locus_tag="Rv0241c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0241c, (MTV034.07c), len: 280 aa. Conserved hypothetical protein, highly similar to MLCB1883.17c|T44876063881|CAA18566.1|AL022486 hypothetical protein from Mycobacterium leprae (280 aa), FASTA scores: opt: 1564, E(): 0, (81.8% identity in 280 aa overlap); and CAC32097.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (300 aa). Also similar to proteins from other organisms e.g. CAB77291.1|AL160312 putative dehydratase from Streptomyces coelicolor (291 aa); part of BAA92930.1|AB032743 fatty acid synthetase beta subunit from Pichia angusta (2060 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214755.1" /db_xref="GI:15607382" /db_xref="GeneID:886686" /translation="MTQPSGLKNLLRAAAGALPVVPRTDQLPNRTVTVEELPIDPANV AAYAAVTGLRYGNQVPLTYPFALTFPSVMSLVTGFDFPFAAMGAIHTENHITQYRPIA VTDAVGVRVRAENLREHRRGLLVDLVTNVSVGNDVAWHQVTTFLHQQRTSLSGEPKPP PQKKPKLPPPAAVLRITPAKIRRYAAVGGDHNPIHTNPIAAKLFGFPTVIAHGMFTAA AVLANIEARFPDAVRYSVRFAKPVLLPATAGLYVAEGDGGWDLTLRNMAKGYPHLTAT VRGL" gene complement(290665..292029) /gene="fabG" /locus_tag="Rv0242c" /db_xref="GeneID:886697" CDS complement(290665..292029) /gene="fabG" /locus_tag="Rv0242c" /EC_number="1.1.1.100" /function="INVOLVED IN THE FATTY ACID BIOSYNTHESIS PATHWAY (FIRST REDUCTION STEP) [CATALYTIC ACTIVITY: (3R)-3-hydroxyacyl-[acyl-carrier protein] + NADP+ = 3-oxoacyl-[acyl-carrier protein] + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the first of the two reduction steps in the elongation cycle of fatty acid synthesis" /codon_start=1 /transl_table=11 /product="3-ketoacyl-(acyl-carrier-protein) reductase" /protein_id="NP_214756.1" /db_xref="GI:15607383" /db_xref="GeneID:886697" /translation="MAPKRSSDLFSQVVNSGPGSFLARQLGVPQPETLRRYRAGEPPL TGSLLIGGAGRVVEPLRAALEKDYDLVGNNLGGRWADSFGGLVFDATGITEPAGLKGL HEFFTPVLRNLGRCGRVVVVGGTPEAAASTNERIAQRALEGFTRSLGKELRRGATTAL VYLSPDAKPAATGLESTMRFLLSAKSAYVDGQVFSVGADDSTPPADWEKPLDGKVAIV TGAARGIGATIAEVFARDGAHVVAIDVESAAENLAETASKVGGTALWLDVTADDAVDK ISEHLRDHHGGKADILVNNAGITRDKLLANMDDARWDAVLAVNLLAPLRLTEGLVGNG SIGEGGRVIGLSSIAGIAGNRGQTNYATTKAGMIGITQALAPGLAAKGITINAVAPGF IETQMTAAIPLATREVGRRLNSLLQGGQPVDVAEAIAYFASPASNAVTGNVIRVCGQA MIGA" misc_feature complement(290905..290991) /gene="fabG" /locus_tag="Rv0242c" /note="PS00061 Short-chain dehydrogenases/reductases family signature" gene 292171..293493 /gene="fadA2" /locus_tag="Rv0243" /db_xref="GeneID:886682" CDS 292171..293493 /gene="fadA2" /locus_tag="Rv0243" /EC_number="2.3.1.9" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION [CATALYTIC ACTIVITY: Acyl-CoA + acetyl-CoA = CoA + 3-oxoacyl-CoA]." /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_214757.1" /db_xref="GI:15607384" /db_xref="GeneID:886682" /translation="MAPAAKNTSQTRRRVAVLGGNRIPFARSDGAYADASNQDMFTAA LSGLVDRFGLAGERLDMVVGGAVLKHSRDFNLMRECVLGSELSPYTPAFDLQQACGTG LQAAIAAADGIAAGRYEVAAAGGVDTTSDPPIGLGDDLRRTLLKLRRSRSNVQRLKLV GTLPASLGVEIPANSEPRTGLSMGEHAAVTAKQMGIKRVDQDELAAASHRNMADAYDR GFFDDLVSPFLGLYRDDNLRPNSSVEKLATLRPVFGVKAGDATMTAGNSTPLTDGASV ALLASEQWAEAHSLAPLAYLVDAETAAVDYVNGNDGLLMAPTYAVPRLLARNGLSLQD FDFYEIHEAFASVVLAHLAAWESEEYCKRRLGLDAALGSIDRSKLNVNGSSLAAGHPF AATGGRILAQTAKQLAEKKAAKKGGGPLRGLISICAAGGQGVAAILEA" misc_feature 293434..293475 /gene="fadA2" /locus_tag="Rv0243" /note="PS00099 Thiolases active site" gene complement(293798..295633) /gene="fadE5" /locus_tag="Rv0244c" /db_xref="GeneID:886698" CDS complement(293798..295633) /gene="fadE5" /locus_tag="Rv0244c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0244c, (MTV034.10c), len: 611 aa. Probable fadE5, acyl-CoA dehydrogenase (EC 1.3.99.-), equivalent to AL022486|MLCB1883_15 from Mycobacterium leprae (611 aa), FASTA scores: opt: 3598, E(): 0, (89.4% identity in 611 aa overlap). Also highly similar to AL0211|MTV007.14 from Mycobacterium tuberculosis (609 aa), FASTA scores: opt: 2576, E(): 0, (64.6% identity in 611 aa overlap); and to various other bacterial proteins described as putative acyl-CoA dehydrogenases e.g. AE0010|AE001025_6 from Archaeoglobus fulgidus (387 aa), FASTA scores: opt: 229, E(): 6.8e-08, (29.8% identity in 409 aa overlap); etc." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE5" /protein_id="NP_214758.1" /db_xref="GI:15607385" /db_xref="GeneID:886698" /translation="MSHYRSNVRDQVFNLFEVLGVDKALGHGEFSDVDVDTARDMLAE VSRLAEGPVAESFVEGDRNPPVFDPKTHSVMLPESFKKSVNAMLEAGWDKVGIDEALG GMPMPKAVVWALHEHILGANPAVWMYAGGAGFAQILYHLGTEEQKKWAVLAAERGWGS TMVLTEPDAGSDVGAARTKAVQQADGSWHIDGVKRFITSGDSGDLFENIFHLVLARPE GAGPGTKGLSLYFVPKFLFDVETGEPGERNGVFVTNVEHKMGLKVSATCELAFGQHGV PAKGWLVGEVHNGIAQMFEVIEQARMMVGTKAIATLSTGYLNALQYAKSRVQGADLTQ MTDKTAPRVTITHHPDVRRSLMTQKAYAEGLRALYLYTATFQDAAVAEVVHGVDAKLA VKVNDLMLPVVKGVGSEQAYAKLTESLQTLGGSGFLQDYPIEQYIRDAKIDSLYEGTT AIQAQDFFFRKIVRDKGVALAHVSGQIQEFVDSGAGNGRLKTERALLAKALTDVQGMA AALTGYLMAAQQDVTSLYKVGLGSVRFLMSVGDLIIGWLLQRQAAVAVAALDAGATGD ERSFYEGKVAVASFFAKNFLPLLTSTREVIETLDNDIMELDEAAF" gene 296005..296493 /locus_tag="Rv0245" /db_xref="GeneID:886680" CDS 296005..296493 /locus_tag="Rv0245" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0245, (MTV034.11), len: 162 aa. Possible oxidoreductase (EC 1.-.-.-), equivalent to AL022486|MLCB1883_17|T44882 probable oxidoreductase from Mycobacterium leprae (162 aa), FASTA scores: opt: 860, E(): 0, (83.4% identity in 157 aa overlap). Also similar to several hypothetical proteins and various oxidoreductases e.g. AAK24246.1|AE005898 NADH:riboflavin 5'-phosphate oxidoreductase from Caulobacter crescentus (174 aa); Q02058|DIM6_STRCO|CAA45048.1 ACTINORHODIN POLYKETIDE DIMERASE from STREPTOMYCES COELICOLOR (177 aa), FASTA scores: opt: 308, E(): 3. 2e-15, (37.8% identity in 143 aa overlap). Also similar to Z84498|Rv1939|MTCY09F9.25c from Mycobacterium tuberculosis (171 aa), FASTA scores: opt: 517, E(): 3.5e-30, (49.4% identity in 158 aa overlap)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214759.1" /db_xref="GI:15607386" /db_xref="GeneID:886680" /translation="MNSTNNLTPSSLREAFGHFPTGVVAIAAEVDGVRQGLAASTFVP VSLEPPLVSFCVQNTSTTWPKLTGVPMLGISVLGEAHDAAVRTLAAKTGDRFAGLETV SNDAGAVFIKGTSVWLESAIEQLVPAGDHTIVVLRVNQVKVDPNVAPIVFHRSVLRRL GV" gene 296809..298119 /locus_tag="Rv0246" /db_xref="GeneID:886678" CDS 296809..298119 /locus_tag="Rv0246" /function="UNKNOWN" /note="Rv0246, (MTV034.12), len: 436 aa (start uncertain). Probable conserved integral membrane protein, similar to Rv2209|1237062|CAA94252.1|Z70283|Q10398|YM09_MYCTU from Mycobacterium tuberculosis (512 aa), FASTA scores: opt: 712, E(): 0, (33.2% identity in 422 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_214760.1" /db_xref="GI:15607387" /db_xref="GeneID:886678" /translation="MAKTSHRVSSADGMSKRILRLIIAQSGFYSAALQLGNVSIVLPF VVAELDAELWIAALIFPAFTAGGAIGNVVAPPAVAAVPRRHRLFIIVSCLAVLAGVNA LCATIGKGSVAGILLVVNVTLIGVVSAISFVAFADLVAAMPSGTARARILLTEVGVGA ALTAVVAATLSFVPDQHPLSRNIHLLWTAAVAMAISAAICRALPHRIVPRVHAAPGLH KLVYVGWTAIRTNGWYRRYLLVQVLFGSVVLGSSFHSIRVAAVPGDQPDEVVAVVLFV CVGLLGGIALWNRVRERFGLVGLFVGSALVSIAAAVLSIAFDLAGAWPNVVAIGLVIA LVSIANQSVFTAGQLWIARDAEPGLRTSLISFGQLVINAGLVGMGLALGLIAQDHDAV WPVMIVLLLNLTAAYSATRFAPAKSVDVRGLPQVSRTSRPKTGG" gene complement(298116..298862) /locus_tag="Rv0247c" /db_xref="GeneID:886677" CDS complement(298116..298862) /locus_tag="Rv0247c" /EC_number="1.3.99.1" /function="INVOLVED IN INTERCONVERSION OF FUMARATE AND SUCCINATE (AEROBIC RESPIRATION) [CATALYTIC ACTIVITY: SUCCINATE + ACCEPTOR = FUMARATE + REDUCED ACCEPTOR]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the fumarate and succinate interconversion; fumarate reductase is used under anaerobic conditions with glucose or glycerol as carbon source" /codon_start=1 /transl_table=11 /product="fumarate reductase iron-sulfur subunit" /protein_id="NP_214761.1" /db_xref="GI:15607388" /db_xref="GeneID:886677" /translation="MTYSASMRVWRGDESCGELREFTVEVNEGEVVLDVILRLQQTQT PDLAVRWNCKAGKCGSCSAEINGKPRLMCMTRMSTFDEDEIVTVTPMRTFPVIRDLVT DVSFNYQKAREIPSFAPPKELQPSEYRMAQVDVARSQEFRKCIECFLCQNVCHVVRDH EENKDAFAGPRFLMRIAELEMHPLDTRDRRSQAQEEHGLGYCNITKCCTEVCPENIKI TDNALIPMKERVADRKYDPVVWLGSKLFRR" misc_feature complement(298680..298706) /locus_tag="Rv0247c" /note="PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature" gene complement(298863..300803) /gene="sdhA" /locus_tag="Rv0248c" /db_xref="GeneID:886675" CDS complement(298863..300803) /gene="sdhA" /locus_tag="Rv0248c" /EC_number="1.3.5.1" /function="INVOLVED IN INTERCONVERSION OF FUMARATE AND SUCCINATE (AEROBIC RESPIRATION) [CATALYTIC ACTIVITY: SUCCINATE + ACCEPTOR = FUMARATE + REDUCED ACCEPTOR]." /experiment="experimental evidence, no additional details recorded" /note="part of four member succinate dehydrogenase enzyme complex that forms a trimeric complex (trimer of tetramers); SdhA/B are the catalytic subcomplex and can exhibit succinate dehydrogenase activity in the absence of SdhC/D which are the membrane components and form cytochrome b556; SdhC binds ubiquinone; oxidizes succinate to fumarate while reducing ubiquinone to ubiquinol" /codon_start=1 /transl_table=11 /product="succinate dehydrogenase flavoprotein subunit" /protein_id="NP_214762.1" /db_xref="GI:15607389" /db_xref="GeneID:886675" /translation="MVEVERHSYDVVVIGAGGAGLRAVIEARERGLKVAVVCKSLFGK AHTVMAEGGCAAAMGNANPKDNWKTHFGDTMRGGKFLNNWRMAELHAKEAPDRVWELE TYGALFDRTDDGRISQRNFGGHTYPRLAHVGDRTGLELIRTLQQKVVSLQQEDHAELG DYEARIKVFAECTITELLKDQGAIAGAFGYWRESGRFIVFEAPAVVLATGGIGKSFKV TSNSWEYTGDGHALALRAGATLINMEFVQFHPTGMVWPPSVKGILVTEGVRGDGGVLK NSENSRFMFDYIPPVFKGQYAETEEEADQWLKDNDSARRTPDLLPRDEVARAINSEVK AGRGTPHGGVYLDIASRLTPAEIKRRLPSMYHQFKELAEVDITTQAMEVGPTCHYVMG GVEVDADTGAATVPGLFAAGECAGGMHGSNRLGGNSLSDLLVFGRRAGLGAADYVRAL SSRPAVSAEAIDAAAQQALSPFEGPKDGSAPENPYALHMDLQYVMNDLVGIIRNADEI SRALTLLAELWSRYHNVLVEGHRQYNPGWNLSIDLRNMLLVSECVARAALQRTESRGG HTRDDHPGMDPNWRRILLVCRATETMGTGGSGSGDSNCHINVTQQLQTPMRPDLLELF EISELEKYYTDEELAEHPGRRG" misc_feature complement(299583..299618) /gene="sdhA" /locus_tag="Rv0248c" /note="PS00141 Eukaryotic and viral aspartyl proteases active site" misc_feature complement(300159..300182) /gene="sdhA" /locus_tag="Rv0248c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(300834..301655) /locus_tag="Rv0249c" /db_xref="GeneID:886671" CDS complement(300834..301655) /locus_tag="Rv0249c" /function="COULD BE INVOLVED IN INTERCONVERSION OF FUMARATE AND SUCCINATE (AEROBIC RESPIRATION). THIS HYDROPHOBIC COMPONENT MAY BE REQUIRED TO ANCHOR THE CATALYTIC COMPONENTS OF THE SUCCINATE DEHYDROGENASE COMPLEX TO THE CYTOPLASMIC MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv0249c, (MTV034.15c), len: 273 aa. Probable succinate dehydrogenase, membrane-anchor subunit for succinate dehydrogenase encoded by Rv0247c and Rv0248c. Highly similar to AC44315.1|AL596043 putative integral membrane protein from Streptomyces coelicolor (278 aa). NOTE THAT SUCCINATE DEHYDROGENASE FORMS GENERALLY PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv0248c ?), AN IRON-SULFUR (Rv0247c ?), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv0249c ?)." /codon_start=1 /transl_table=11 /product="succinate dehydrogenase membrane anchor subunit" /protein_id="NP_214763.1" /db_xref="GI:15607390" /db_xref="GeneID:886671" /translation="MSAPTANRPAIGVFTPTRAQIPERTLRTDLWWLPPLLTNLGLLA FICYATTRAFWGSQYWVEKYHYLTPFYSPCVSASCQPGASHLGVWFGHFPGWIPLGAM VLPFLLGFRLTCYYYRKAYYRSVWQSPTSCAVPEPRAHYTGETRLPLIVQNTHRYFFY IAVVVSLINTYDAIAAFHSPSGFGFGLGNVILTINVVLLWAYTISCHSCRHATGGRLK HFSKHPVRYWIWTQVSKLNTRHMQFAWITLGTLALTDFYIMLVASGSITDLRFIG" gene complement(301735..302028) /locus_tag="Rv0250c" /db_xref="GeneID:886669" CDS complement(301735..302028) /locus_tag="Rv0250c" /function="UNKNOWN. POSSIBLY DOWN-REGULATED BY HSPR|Rv0353." /experiment="experimental evidence, no additional details recorded" /note="Rv0250c, (MTV034.16c), len: 97 aa. Conserved hypothetical protein, equivalent to MLCB1883.27c|T44883|3063888|CAA18576.1|AL022486 hypothetical protein from Mycobacterium leprae (98 aa), FASTA scores: opt: 478, E(): 4.4e-28, (72.6% identity in 95 aa overlap). Also similar to C-terminus of AC44316.1|AL596043|SCBAC31E11.05c hypothetical protein from Streptomyces coelicolor (146 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214764.1" /db_xref="GI:15607391" /db_xref="GeneID:886669" /translation="MSTTAELAELHDLVGGLRRCVTALKARFGDNPATRRIVIDADRI LTDIELLDTDVSELDLERAAVPQPSEKIAIPDTEYDREFWRDVDDEGVGGHRY" gene complement(302173..302652) /gene="hsp" /locus_tag="Rv0251c" /db_xref="GeneID:886667" CDS complement(302173..302652) /gene="hsp" /locus_tag="Rv0251c" /function="THOUGHT TO BE INVOLVED IN THE INITIATION STEP OF TRANSLATION AT HIGH TEMPERATURE. BOUNDED TO 30S RIBOSOMAL SUBUNIT. POSSIBLY A MOLECULAR CHAPERONE. SEEMS TO BE REGULATED POSITIVELY BY SIGE|Rv1221 AND NEGATIVELY BY HSPR|Rv0353." /experiment="experimental evidence, no additional details recorded" /note="Rv0251c, (MTV034.17c), len: 159 aa. hsp (alternate gene name: hsp20, hrpA, acr2), heat-stress-induced ribosome-binding protein A (see citations below). Highly similar to AAD39038.1|AF072875_1|AF072875 putative HSP20 from Mycobacterium smegmatis (145 aa), FASTA scores: opt: 479, E(): 2.3e-24, (59.9% identity in 157 aa overlap); and similar to many bacterial and eukaryotic hsp proteins e.g. P12811|HS2C_CHLRE CHLOROPLAST HEAT SHOCK 22KD PROTEIN from CHLAMYDOMONAS REINHARDTII (157 aa), FASTA scores: opt: 184, E(): 1.2e-05, (32.4% identity in 142 aa overlap). Also similar to PCC6803 Spore protein sp21 from Synechocystis sp. (146 aa), FASTA scores: opt: 213, E(): 1.2e-07, (30.3 identity in 145 aa overlap). Also similar to P30223|14KD_MYCTU 14 KDA ANTIGEN (16 KDA ANTIGEN) 19K major membrane protein (HSP 16.3) from Mycobacterium tuberculosis (144 aa). BELONG TO THE SMALL HEAT SHOCK PROTEIN (HSP20) FAMILY.; hsp 20; hrpA; acr2" /codon_start=1 /transl_table=11 /product="heat shock protein hsp (heat-stress-induced ribosome-binding protein A)" /protein_id="NP_214765.1" /db_xref="GI:15607392" /db_xref="GeneID:886667" /translation="MNNLALWSRPVWDVEPWDRWLRDFFGPAATTDWYRPVAGDFTPA AEIVKDGDDAVVRLELPGIDVDKDVNVELDPGQPVSRLVIRGEHRDEHTQDAGDKDGR TLREIRYGSFRRSFRLPAHVTSEAIAASYDAGVLTVRVAGAYKAPAETQAQRIAITK" gene 302866..305427 /gene="nirB" /locus_tag="Rv0252" /db_xref="GeneID:886665" CDS 302866..305427 /gene="nirB" /locus_tag="Rv0252" /function="INVOLVED IN NITRATE ASSIMILATION (DENITRIFICATION) (AT THE SECOND STEP). THIS ENZYME IS A FAD FLAVOPROTEIN THAT ALSO CONTAINS A SIROHEME AND ONE 2FE-2S IRON-SULFUR CENTER [CATALYTIC ACTIVITY: 3 NAD(P)H + nitrite = 3 NAD(P)+ + NH4OH + H2O.]" /note="Rv0252, (MTV034.18), len: 853 aa. Probable nirB (alternate gene name: nasB), nitrite reductase [NAD(P)H] large subunit (EC 1.6.6.4), flavoprotein containing siroheme and a 2FE-2S iron-sulfur centre. Highly similar to many others bacterial enzymes e.g. P08201|NIRB_ECOLI NITRITE REDUCTASE (NAD(P)H) LARGE SUBUNIT from Escherichia coli strain K12 (847 aa), FASTA scores: opt: 2775, E(): 0, (49.8% identity in 840 aa overlap); Q06458|NIRB_KLEPN NITRITE REDUCTASE (NAD(P)H) LARGE SUBUNIT (957 aa), FASTA scores: opt: 2902, E(): 0, (54.2% identity in 827 aa overlap). Contains PS00365 Nitrite and sulfite reductases iron-sulfur/siroheme-binding site. HOMODIMER WHICH ASSOCIATES WITH NIRD|Rv0253. COFACTORS: FAD; Iron; Siroheme. TBparse score is 0.903.; nasB" /codon_start=1 /transl_table=11 /product="nitrite reductase large subunit" /protein_id="NP_214766.1" /db_xref="GI:15607393" /db_xref="GeneID:886665" /translation="MPTAGSSRAPAAAREIVVVGHGMVGHRLVEAVRARDADGSLRIT VLAEEGDAAYDRVGLTSYTESWDRALLALPGNDYAGDQRVRLLLNTRVTQIDRATKSV VTAAGQRHRYDTLVLATGSYAFVPPVPGHDLPACHVYRTFDDLDAIRAGAQRTLDGGH TDGGVVIGGGLLGLEAANALRQFGLQTHVVEMMPRLMAQQIDEAGGALLARMIADLGI AVHVGTGTESIESVKHSDGSVWARVRLSDGEVIDAGVVIFAAGIRPRDELARAAGLAI GDRGGVLTDLSCRTSDPDIYAVGEVAAIDGRCYGLVGPGYTSAEVVADRLLDGSAEFP EADLSTKLKLLGVDVASFGDAMGATENCLEVVINDAVKRTYAKLVLSDDATTLLGGVL VGDASSYGVLRPMVGAELPGDPLALIAPAGSGAGAGALGVGALPDSAQICSCNNVTKG ELKCAIADGCGDVPALKSCTAAGTSCGSCVPLLKQLLEAEGVEQSKALCEHFSQSRAE LFEIITATEVRTFSGLLDRFGRGKGCDICKPVVASILASTGSDHILDGEQASLQDSND HFLANIQKNGSYSVVPRVPGGDIKPEHLILIGQIAQDFGLYTKITGGQRIDLFGARVD QLPLIWQRLVDGGMESGHAYGKAVRTVKSCVGSDWCRYGQQDSVQLAIDLELRYRGLR APHKIKLGVSGCARECAEARGKDVGVIATEKGWNLYVAGNGGMTPKHAQLLASDLDKE TLIRYIDRFLIYYIRTADRLQRTAPWVESLGLDHVREVVCEDSLGLAEEFEAAMQRHV ANYKCEWKGVLEDPDKLSRFVSFVNAPDAVDSTVTFTERAGRKVPVSIGIPRVRS" misc_feature 304939..304989 /gene="nirB" /locus_tag="Rv0252" /note="PS00365 Nitrite and sulfite reductases iron-sulfur/siroheme-binding site" gene 305453..305809 /gene="nirD" /locus_tag="Rv0253" /db_xref="GeneID:886664" CDS 305453..305809 /gene="nirD" /locus_tag="Rv0253" /function="INVOLVED IN NITRATE ASSIMILATION (DENITRIFICATION); REQUIRED FOR ACTIVITY OF THE REDUCTASE [CATALYTIC ACTIVITY: 3 NAD(P)H + nitrite = 3 NAD(P)+ + NH4OH + H2O.]" /note="Rv0253, (MTV034.19), len: 118 aa. Probable nirD, nitrite reductase [NAD(P)H] small subunit (EC 1.6.6.4), similar to others e.g. P23675|NIRD_ECOLI|B3366|Z4727|ECS4217 from Escherichia coli strains K12 and O157:H7 (108 aa), FASTA scores: opt: 271, E():1.7e-12, (41.9% identity in 105 aa overlap). ASSOCIATES WITH NIRB|Rv0252. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="nitrite reductase [NAD(P)H] small subunit NirD" /protein_id="NP_214767.1" /db_xref="GI:15607394" /db_xref="GeneID:886664" /translation="MTLLNDIQVWTTACAYDHLIPGRGVGVLLDDGSQVALFRLDDGS VHAVGNVDPFSGAAVMSRGIVGDRGGRAMVQSPILKQAFALDDGSCLDDPRVSVPVYP ARVTPEGRIQVARVAV" gene complement(305825..306349) /gene="cobU" /locus_tag="Rv0254c" /db_xref="GeneID:886661" CDS complement(305825..306349) /gene="cobU" /locus_tag="Rv0254c" /EC_number="2.-.-.-" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS." /note="Rv0254c, (MTV034.20), len: 174 aa. Probable cobU, cobalamin biosynthesis protein including a cobinamide kinase (EC 2.-.-.-) and cobinamide phosphate guanylyltransferase (EC 2.-.-.-). Highly similar to many e.g. Q05599|COBU_SALTY COBINAMIDE KINASE / COBINAMIDE PHOSPHATE GUANYLYLTRANSFERASE from Salmonella typhimurium (181 aa), FASTA scores: opt: 308, E(): 1.1e-14, (38.7% identity in 181 aa overlap); P46886|COBU_ECOLI|B1993|Z3153|ECS2788 Bifunctional cobalamin biosynthesis protein cobU from Escherichia coli strains K12 and O157:H7 (181 aa); part of AL096872|SC5F7_10 from Streptomyces coelicolor (397 aa), FASTA scores: opt: 445, E(): 3.6e-23, (46.0% identity in 176 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="bifunctional cobinamide kinase/cobinamide phosphate guanylyltransferase" /protein_id="NP_214768.1" /db_xref="GI:15607395" /db_xref="GeneID:886661" /translation="MRILVTGGVRSGKSTHAEALLGDAADVVYVAPGRPAAGSDPDWD ARVALHRARRPPTWLTVETADVATALSEARSPVLVDCLGTWLTAIMDGEALWSAATAD VYAVLEARLDGLCAALTGLPTAIVVTNEVGLGVVPSHSSGVLFRDLLGTINRRVAAVC DEVHLVIAGRVLKL" misc_feature complement(306308..306331) /gene="cobU" /locus_tag="Rv0254c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(306374..307858) /gene="cobQ1" /locus_tag="Rv0255c" /db_xref="GeneID:886673" CDS complement(306374..307858) /gene="cobQ1" /locus_tag="Rv0255c" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS" /note="catalyzes amidations at positions B, D, E, and G on adenosylcobyrinic A,C-diamide. NH(2) groups are provided by glutamine, and one molecule of ATP is hydrogenolyzed for each amidation" /codon_start=1 /transl_table=11 /product="cobyric acid synthase" /protein_id="YP_177703.1" /db_xref="GI:57116707" /db_xref="GeneID:886673" /translation="MSGLLVAGTTSDAGKSAVTAGLCRALARRGVRVAPFKAQNMSNN SMVCRGPDGTGVEIGRAQWVQALAARTTPEAAMNPVLLKPASDHRSHVVLMGKPWGEV ASSSWCAGRRALAEAACRAFDALAARYDVVVAEGAGSPAEINLRAGDYVNMGLARHAG LPTIVVGDIDRGGVFAAFLGTVALLAAEDQALVAGFVVNKFRGDSDLLAPGLRDLERV TGRRVYGTLPWHPDLWLDSEDALDLQGRRAAGTGARRVAVVRLPRISNFTDVDALGLE PDLDVVFASDPRALDDADLIVLPGTRATIADLAWLRARDLDRALLVHVAAGKPLLGIC GGFQMLGRVIRDPYGIEGPGGQVTEVEGLGLLDVETAFSPHKVLRLPRGEGLGVPASG YEIHHGRITRGDTAEEFLGGARDGPVFGTMWHGSLEGDALREAFLRETLGLAPSGSCF LAARERRLDLLGDLVERHLDVDALLNLARHGCPPTLPFLAPGAP" gene complement(307877..309547) /gene="PPE2" /locus_tag="Rv0256c" /db_xref="GeneID:886684" CDS complement(307877..309547) /gene="PPE2" /locus_tag="Rv0256c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0256c, (MTV034.22c), len: 556 aa. Member of the M. tuberculosis PPE family, similar to many e.g. Rv0280, Rv0286, etc. Equivalent to Z98756|MLCB2492.30 from Mycobacterium leprae (572 aa), FASTA scores: opt: 1837, E(): 0, (62.9% identity in 461 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177704.1" /db_xref="GI:57116708" /db_xref="GeneID:886684" /translation="MTAPIWMASPPEVHSALLSSGPGPGPLLVSAEGWHSLSIAYAET ADELAALLAAVQAGTWDGPTAAVYVAAHTPYLAWLVQASANSAAMATRQETAATAYGT ALAAMPTLAELGANHALHGVLMATNFFGINTIPIALNESDYARMWIQAATTMASYQAV STAAVAAAPQTTPAPQIVKANAPTAASDEPNQVQEWLQWLQKIGYTDFYNNVIQPFIN WLTNLPFLQAMFSGFDPWLPSLGNPLTFLSPANIAFALGYPMDIGSYVAFLSQTFAFI GADLAAAFASGNPATIAFTLMFTTVEAIGTIITDTIALVKTLLEQTLALLPAALPLLA APLAPLTLAPASAAGGFAGLSGLAGLVGIPPSAPPVIPPVAAIAPSIPTPTPTPAPAP APTAVTAPTPPPGPPPPPVTAPPPVTGAGIQSFGYLVGDLNSAAQARKAVGTGVRKKT PEPDSAEAPAAAAAPEEQVQPQRRRRPKIKQLGRGYEYLDLDPETGHDPTGSPQGAGT LGFAGTTHKASPGQVAGLITLPNDAFGGSPRTPMMPGTWDTDSATRVE" gene 309699..310073 /locus_tag="Rv0257" /db_xref="GeneID:3205110" CDS 309699..310073 /locus_tag="Rv0257" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0257, len: 124 aa. Hypothetical protein, orthologue of ML1828A conserved hypothetical protein from Mycobacterium leprae. Replaced Rv0257c (older annotation)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177620.1" /db_xref="GI:57116709" /db_xref="GeneID:3205110" /translation="MTRVSWLPDRCLPRLPACGRGLRGSLPGDSGGTAPDSHRLPASS SPDGKNIGMQSVDLHVERHLPSRGRSHRTVATVTCVTALGDIRSAQLSATGAWPAVLF PSWSWLCGIGGGVDLQKPSCRA" gene complement(310294..310749) /locus_tag="Rv0258c" /db_xref="GeneID:886654" CDS complement(310294..310749) /locus_tag="Rv0258c" /function="UNKNOWN" /note="Rv0258c, (MTCY06A4.02c), len: 151 aa (alternative start possible). Conserved hypothetical protein, showing some similarity to Rv1685c|MTCI125_6 from Mycobacterium tuberculosis (207 aa), FASTA scores: E(): 9.3e-07, (32.1% identity in 140 aa overlap). Also some similarity with AL049819|SCE7_13|T36295 probable transcription regulator from Streptomyces coelicolor (204 aa), FASTA scores: opt: 158, E(): 0.00052, (27.0% identity in 111 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214772.1" /db_xref="GI:15607399" /db_xref="GeneID:886654" /translation="MARSQEPSRGLLDPVAKMLRLPFGTPDFIEKIVTGSVNQVGRRT LYVLITTWDAAGGGPFAASAIATTGLAKTAEIVQSMFIGPVFNPLLKMLGADKIAIRA SLCAAQLVGLGIMRYGVRSEPLHSMSVEMLVDAIGPTMQRYLVGDIGRG" gene complement(310774..311517) /locus_tag="Rv0259c" /db_xref="GeneID:886657" CDS complement(310774..311517) /locus_tag="Rv0259c" /function="UNKNOWN" /note="Rv0259c, (MTCY06A4.03c), len: 247 aa. Conserved hypothetical protein, showing some similarity to Rv2393|Z81368|MTCY253_28 from Mycobacterium tuberculosis (281 aa), FASTA scores: E(): 9.5e-16, (33.6 % identity in 235 aa overlap). Also some similarity with CAC33938.1|AL589708 putative secreted protein from Streptomyces coelicolor (248 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214773.1" /db_xref="GI:15607400" /db_xref="GeneID:886657" /translation="MNLILTAHGTRRPSGVAMIADIAAQVSALVDRTVQVAFVDVLGP SPSEVLSALSCRPAIVVPAFLSRGYHVRTDLPAHVAASAHPHVTVTPALGPCREIAQI VTQQLVESGWRPGDSVILAAAGASDRRARADLHTTRTLVSELTGSWVDMGFAGTGGPD VRTAVQRARDRAEANRGARRVAVASFLLAEGLFQERLRASGADVVTRPLGTHPGLAQL VANRFRSAVARQQRLHRWHGTPTPVTLDL" gene complement(311514..312659) /locus_tag="Rv0260c" /db_xref="GeneID:886651" CDS complement(311514..312659) /locus_tag="Rv0260c" /EC_number="4.2.1.75" /function="COULD BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="catalyzes the formation of uroporphyrinogen-III from hydroxymethylbilane" /codon_start=1 /transl_table=11 /product="bifunctional uroporphyrinogen-III synthetase/response regulator domain protein" /protein_id="NP_214774.1" /db_xref="GI:15607401" /db_xref="GeneID:886651" /translation="MAQAHSAPLTGYRIAVTSARRAEELCALLRRQGAEVCSAPAIKM IALPDDDELQNNTEALIADPPDILVAHTGIGFRGWLAAAEGWGLANELLESLSSARII SRGPKATGALRAAGLREEWSPDSESSHEVLEYLLESGVSRTRIAVQLHGAADSWDPFP EFLGGLRFAGAQVVPIRVYRWKPAPLGGVFDHLVTGIARRQFDAVTFTSAPAAAAVLE RSRELDIEDQLLAALRTDVHAMCVGPVTSRPLIRKGVPTSAPERMRLGALARHIAEEL PLLGSCTFKAAGHVIEIRGTSVLVDDSVKPLSPSGMAILRALVHRPGGVVSRGDLLRV LPGDGSDTHAVDTAVLRLRTALGDKNIVATVVKRGYRLAVDSRHDDV" gene complement(312759..314168) /gene="narK3" /locus_tag="Rv0261c" /db_xref="GeneID:886663" CDS complement(312759..314168) /gene="narK3" /locus_tag="Rv0261c" /function="INVOLVED IN EXCRETION OF NITRITE PRODUCED BY THE DISSIMILATORY REDUCTION OF NITRATE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0261c, (MTCY06A4.05c), len: 469 aa. Probable nirK3, nitrite extrusion protein, integral membrane protein possibly member of major facilitator superfamily (MFS), equivalent to AAB41700.1|U72744 nitrite extrusion protein from Mycobacterium fortuitum (471 aa); and 2342627|CAB11406.1|Z98741|T44908 nitrite extrusion protein homolog from Mycobacterium leprae (517 aa; longer in N-terminus). Also similar to other nitrite extrusion proteins e.g. NARK_ECOLI|P10903|B1223 nitrite extrusion protein 1 from Escherichia coli strain K12 (463 aa), FASTA scores: opt: 755, E(): 0, (35.0% identity in 466 aa overlap). BELONGS TO THE NARK/NASA FAMILY OF TRANSPORTERS." /codon_start=1 /transl_table=11 /product="integral membrane nitrite extrusion protein NarK3" /protein_id="NP_214775.1" /db_xref="GI:15607402" /db_xref="GeneID:886663" /translation="MGRSHQISDWDPEDSVAWEAGNKFIARRNLIWSVAAEHVGFSVW SLWSVMVLFMPTSVYGFSAGDKFLLGATATLVGACLRFPYTFATAKFGGRNWTIFSAL VLLIPTVGSILLLANPGLPLWPYLVCGALAGLGGGNFAASMTNINAFFPQRLKGAALA LNAGGGNLGVPMVQLVGLLVIATAGDREPYWVCAIYLVLLAVAGLGAALYMDNLTEYR IELNTMRAVVSEPHTWVISLLYIGTFGSFIGFSFAFGQVLQINFIASGQSTAQASLHA AQIAFLGPLLGSLSRIYGGKLADRIGGGRVTLAAFCAMLLATGILISASTFGDHLAGP MPTATMVGYVIGFTALFILSGIGNGSVYKMIPSIFEARSHSLQISEAERRQWSRSMSG ALIGLAGAVGALGGVGVNLALRESYLTSGTATSAFWAFGVFYLVASVLTWAIYVRRGL KSAGELVPATTAPAGLAYV" gene complement(314309..314854) /gene="aac" /locus_tag="Rv0262c" /db_xref="GeneID:886648" CDS complement(314309..314854) /gene="aac" /locus_tag="Rv0262c" /EC_number="2.3.1.-" /function="CONFERS RESISTANCE TO AMINOGLYCOSIDES (GENTAMICIN, TOBRAMYCIN, DIBEKACIN, NETILMICIN, AND 6'-N-ETHYLNETILMICIN)." /experiment="experimental evidence, no additional details recorded" /note="Rv0262c, (MTCY06A4.06c), len: 181 aa. aac, aminoglycoside 2'-N-acetyltransferase (aac(2')-IC) (EC 2.3.1.-) (see citation below), highly similar to NP_302635.1|NC_002677 aminoglycoside 2'-N-acetyltransferase from Mycobacterium leprae (182 aa); Q49157|AAC2_MYCFO|AAC aminoglycoside 2'-N-acetyltransferase from Mycobacterium fortuitum (195 aa), FASTA scores: opt: 884, E(): 0, (69.1% identity in 181 aa overlap); and P94968|AAC2_MYCSM|AAC aminoglycoside 2'-N-acetyltransferase from Mycobacterium smegmatis (210 aa) (see also citation below). Also similar to Q52424|AAC2_PROST AMINOGLYCOSIDE 2'-N-ACETYLTRANSFERASE from Providencia stuartii (178 aa). BELONGS TO THE AAC(2')-I FAMILY OF ACETYLTRANSFERASES. Note that previously known as aac(2')-IC.; aac(2')-IC" /codon_start=1 /transl_table=11 /product="aminoglycoside 2'-N-acetyltransferase AAC (AAC(2')-IC)" /protein_id="NP_214776.1" /db_xref="GI:15607403" /db_xref="GeneID:886648" /translation="MHTQVHTARLVHTADLDSETRQDIRQMVTGAFAGDFTETDWEHT LGGMHALIWHHGAIIAHAAVIQRRLIYRGNALRCGYVEGVAVRADWRGQRLVSALLDA VEQVMRGAYQLGALSSSARARRLYASRGWLPWHGPTSVLAPTGPVRTPDDDGTVFVLP IDISLDTSAELMCDWRAGDVW" gene complement(314864..315766) /locus_tag="Rv0263c" /db_xref="GeneID:886659" CDS complement(314864..315766) /locus_tag="Rv0263c" /function="UNKNOWN" /note="Rv0263c, (MTCY06A4.07c), len: 300 aa. Conserved hypothetical protein, equivalent to NP_302634.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (305 aa). Also similar to others e.g. AL121596|SC51A_21 hypothetical protein from Streptomyces coelicolor (285 aa), FASTA scores: opt: 714, E(): 0, (45.3% identity in 289 aa overlap); NP_233164.1|NC_002506 conserved hypothetical protein from Vibrio cholerae (309 aa); NP_406216.1|NC_003143 conserved hypothetical protein from Yersinia pestis (316 aa); YH30_HAEIN|P44298|hi1730 hypothetical protein from Haemophilus influenzae (309 aa), FASTA scores: opt: 430, E(): 3e-20, (29.6% identity in 284 aa overlap); etc. Also similar to carboxylases eg NP_415240.1|NC_000913|P75745|YBGK_ECOLI putative carboxylase from Escherichia coli strain K12 (310 aa), FASTA score: (34.6% identity in 286 aa overlap); NP_459698.1|NC_003197 putative carboxylase from Salmonella typhimurium (310 aa); and to middle part of NP_420636.1|NC_002696 urea amidolyase-related protein from Caulobacter crescentus (1207 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214777.1" /db_xref="GI:15607404" /db_xref="GeneID:886659" /translation="MTTLEILRSGPLALVEDLGRAGLAHLGVGRSGAADRRSHTLANR LVANPDDWATVEVTFGGFSARVRGGDVDIAVTGADTDPTVNGIMVGTNSIHHVRDGQV ISLGTPRAGLRTYLAVRGGVCVEPVLGSRSYDVMSAIGPSPLRAGDVLPVGEHTDDYP ELDQAPVAAIEEHLVELRVVPGPRDDWLVDPDALVHTIWMASNRSDRVGMRLQGRPLQ HRWPDRQLPGEGVTRGAIQVPPNGLPVILGPDHPITGSYPVVGVITDEDIDKVAQIRP GQYVRLHWARPRSRLPGQGVTQAW" gene complement(315783..316415) /locus_tag="Rv0264c" /db_xref="GeneID:886646" CDS complement(315783..316415) /locus_tag="Rv0264c" /function="UNKNOWN" /note="Rv0264c, (MTCY06A4.08c), len: 320 aa. Conserved hypothetical protein, equivalent to CAC32080.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (222 aa). Also similar to others hypothetical proteins e.g. AL121596|SC51A_20 from Streptomyces coelicolor (252 aa), FASTA scores: opt: 420, E(): 2.7e-20, (41.7% identity in 204 aa overlap); P75744|YBGJ_ECOLI HYPOTHETICAL 23.9 KD PROTEIN from Escherichia coli (218 aa), FASTA scores: E(): 2.1e-14, (35.7% identity in 182 aa overlap); YH31_HAEIN|P44299|hi173 hypothetical protein from Haemophilus influenzae (213 aa), FASTA scores: opt: 252, E(): 8.3e-10, (31.1% identity in 183 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214778.1" /db_xref="GI:15607405" /db_xref="GeneID:886646" /translation="MDAALACTVLDYGDHALMLQCDSTADAMAWTDALRAAALPGVVD IVAASRTVLVKLDAPRYQGVTRQRLRRLRVTPEAVAAADHRCDLVIDVVYDGPDLAEV ARCTGLTTAAVINAHTATGWRAGFSGSAPGFAYLIDGDPSLRVPRRPERRTSMPPGSV ALADGFSAIYPSQAPSDWQIIGHTDAVLWDVDRPQPALLTPGMWVQFRAA" gene complement(316511..317503) /locus_tag="Rv0265c" /db_xref="GeneID:886650" CDS complement(316511..317503) /locus_tag="Rv0265c" /function="THOUGHT TO BE INVOLVED IN IRON TRANSPORT ACROSS THE MEMBRANE (IMPORT)." /note="Rv0265c, (MTCY06A4.09c), len: 330 aa. Probable iron-transport lipoprotein, most similar to T36412|5763945|CAB53324.1|AL109974 probable iron-siderophore binding lipoprotein from Streptomyces coelicolor (350 aa); and (N-terminus may be incorrect) to T14166|3560508|AAC82551.1|AF027770 fxuD protein from Mycobacterium smegmatis (420 aa), FASTA scores: opt: 385, E(): 1.5e-16, (32.3% identity in 232 aa overlap). Also similar to AAB97475.1|U02617 DtxR/iron regulated lipoprotein precursor from Corynebacterium diphtheriae (355 aa); FECB_ECOLI|P15028 iron(III) dicitrate-binding periplasmic protein (300 aa), FASTA scores: opt: 191, E(): 2.3e-05, (26.5% identity in 196 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Note that previously known as fecB2.; fecB2" /codon_start=1 /transl_table=11 /product="periplasmic IRON-transport lipoprotein" /protein_id="YP_177705.1" /db_xref="GI:57116710" /db_xref="GeneID:886650" /translation="MRQGCSRRGFLQVAEAAAATGLFAGCSSPKPPPGTPGGAAVTIT HLFGQTVIKEPPKRVVSAGYTEQDDLLAVDVVPIAVTDWFGDQPFAVWPWAAPKLGGA RPAVLNLDNGIQIDRIAALKPDLIVAINAGVDADTYQQLSAIAPTVAQSGGDAFFEPW KDQARSIGQAVFAADRMRSLIEAVDQKFAAVAQRHPRWRGKKALLLQGRLWQGNVVAT LAGWRTDFLNDMGLVIADSIKPFAVDQRGVIPRDHIKAVLDAADVLIWMTESPEDEKA LLADPEIAASQATAQRRHIFTSKEQAGAIAFSSVLSYPVVAEQLPPQISQILGA" misc_feature complement(317426..317458) /locus_tag="Rv0265c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(317525..321154) /gene="oplA" /locus_tag="Rv0266c" /db_xref="GeneID:886642" CDS complement(317525..321154) /gene="oplA" /locus_tag="Rv0266c" /EC_number="3.5.2.9" /function="CATALYZES THE CLEAVAGE OF 5-OXO-L-PROLINE TO FORM L-GLUTAMATE COUPLED TO THE HYDROLYSIS OF ATP TO ADP AND INORGANIC PHOSPHATE [CATALYTIC ACTIVITY: ATP + 5-oxo-L-proline + 2 H2O = ADP + phosphate + L-glutamate]." /note="Rv0266c, (MTCY06A4.10c), len: 1209 aa. Probable oplA, 5-oxoprolinase (EC 3.5.2.9), highly similar to others or to hypothetical proteins e.g. AAK24340.1|AE005906 hydantoinase/oxoprolinase from Caulobacter crescentus (1196 aa); NP_103129.1|14022305|BAB48915.1|AP002997 5-oxoprolinase from Mesorhizobium loti (1210 aa); CAC48426.1|AL603642 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (1205 aa); S77037|slr0697|1006579|BAA10729.1|D6400 HYPOTHETICAL PROTEIN from Synechocystis sp. strain PCC 6803 (1252 aa), FASTA scores: opt: 2016, E(): 0, (51.4% identity in 1247 aa overlap); P97608|OPLA_RAT|T42756|11278797 5-OXOPROLINASE (5-OXO-L-PROLINASE) (PYROGLUTAMASE) (5-OPASE) from Rattus norvegicus (1288 aa); etc. BELONGS TO THE OXOPROLINASE FAMILY." /codon_start=1 /transl_table=11 /product="5-oxoprolinase" /protein_id="NP_214780.1" /db_xref="GI:15607407" /db_xref="GeneID:886642" /translation="MVGAGWHFWVDRGGTFTDVVARRPDGRLLTHKLLSDNPARYRDA AVAGIRALLANGEAGTRVDAVRMGTTVATNALLERTGERTLLVITRGFGDALRIAYQN RPRIFDRRIVLPEMLYERVVEVDERVTADGRVLRAPDLEALGEKMRQAHADGIRAVAV VCLHSYLYPGHEREIGTLAQRIGFAQISLSSEVSPLMKLVPRGDTTVVDAYLSPVLRR YINQVADQMRGVRLMFMQSNGGLAQAGHFRGKDAILSGPAGGIVGMVRMSALAGFDHV IGFDMGGTSTDVSHYAGEYERVFTTQVAGVRLRAPMLDIHTVAAGGGSILHFDGSRYR VGPDSAGADPGPACYRGGGPLCVTDANVMLGRIQPTHFPSVFGPSGDQPLDAGTVRRG FTDLAADIAARTGDDRSPEQVAEGYLRIAVANMANAVKKISVQKGHDVTRYALTTFGG AGGQHACAVADALGIRTVLIPPMAGVLSALGIGLADTTAMREQSVEIPLGPAAPQRLA SVAESLERAARAELLDEGVPGERIRVVRRVHLRYEGTDTAIPVQLAEIETMATAFESS HRALYTFLLDRPLIAEAISVEATGLTDQPDLSQLGDQANDTTGSSETVRIYSNGLWRD APLRRREAMRPGDVLTGPAIIAEANATTVVDDGWQATMTETGHLLAQRVVTPPRPDAA TRAGFEAGFEADPVLLEIFNNLFMSIAEQMGFRLEATAQSVNIRERLDFSCALFDPDG NLVANAPHIPVHLGSMGTTVKEVIRRRLSGMKPGDVYAVNDPYHGGTHLPDITVITPV FNTGGEDVLFFVASRGHHAEIGGITPGSMPADSREIHEEGVLFDNWLLAENGRFREAE TRRLLTEAPFGSRNPDTNLADLRAQIAANQKGVDEVGKMIDHFGRDVVAAYMRHVQDN AEEAVRRVIDRLDNGAYRYRMDSGATIAVRITVDRAARSATIDFTGTSAQLDTNFNAP TSVVNAAVLYVFRTLVADDIPLNDGCLRPLRIVVPEGSMLAPTHPAAVVAGNVETSQA ITGALFAALGVQAEGSGTMNNVTFGNERHQYYETVGSGSGAGDGYHGASVVQTHMTNS RLTDPEVLEWRYPVLLREFAVRQGSGGAGRWRGGDGAVRRLEFTEPMTVSTLSGHRRV RPYGMAGGSPGELGRNRVERADGSTVELAGCGSTHVEPGDTLVIETPGGGGYGPASTS ARRRR" gene 321331..322722 /gene="narU" /locus_tag="Rv0267" /db_xref="GeneID:886644" CDS 321331..322722 /gene="narU" /locus_tag="Rv0267" /function="INVOLVED IN EXCRETION OF NITRITE PRODUCED BY THE DISSIMILATORY REDUCTION OF NITRATE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0267, (MTCY06A4.11), len: 463 aa. Probable narU, nitrite extrusion protein, integral membrane protein possibly member of major facilitator superfamily (MFS), similar to other nitrite extrusion proteins e.g. NARU_ECOLI|P37758 nitrite extrusion protein 2 from Escherichia coli (462 aa), FASTA scores: opt: 630, E(): 4.4e-33, (38.9% identity in 463 aa overlap); and NARK_ECOLI|P10903|B1223 nitrite extrusion protein 1 from Escherichia coli strain K12 (463 aa), FASTA scores: opt: 607, E(): 1.3e-31, (42.0% identity in 457 aa overlap). Also similar to Rv0261c, Rv2329c, Rv1737c, and to MLCB22_25 from Mycobacterium leprae (517 aa), FASTA score: (35.1 identity in 459 aa overlap). BELONGS TO THE NARK/NASA FAMILY OF TRANSPORTERS." /codon_start=1 /transl_table=11 /product="integral membrane nitrite extrusion protein NarU" /protein_id="NP_214781.1" /db_xref="GI:15607408" /db_xref="GeneID:886644" /translation="MALTTAPAIDYALPRQQDEGDHWIDDWRPEDPVFWETIGRPIAR RNLIFSIFAEHVGFSVWMLWSIVVVQMTAAAPGHPAASGWALSASQALCLVAVPSGVG AFLRLPYTFAIPIFGGRNWTTVSAALLVIPCLLLAWAVSHPSLPFAVLVVIAATAGFG GGNFASSMANISFFYPEKDKGWALGLNAAGGNIGVAVVQKIIPPIVVAGSGVALSRAG LFFVPLAVAAAVCAFLFMNNLTEAKADVKPVWQSLRHADTWIMSLLYIGTFGSFIGYS AAFPTLLKTVFGRGDIALGWAFLGAGIGSLVRPLGGKLADRIGGARITAASFVMLAAG AAAALWSVQSVNLPVFFVSFMFLFVATGIGNGSSYRMISRIFQVKGEVAGGDPETMVN MRRQAAGALGIISSIGAFGGFVVPLAYAWSKVHFGNIEPALHFYVAFFLALLVVTWYC YLRRTTPMGQVGV" gene complement(322764..323273) /locus_tag="Rv0268c" /db_xref="GeneID:886647" CDS complement(322764..323273) /locus_tag="Rv0268c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0268c, (MTCY06A4.12c), len: 169 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214782.1" /db_xref="GI:15607409" /db_xref="GeneID:886647" /translation="MGTRSKSRTRQLKQSNGCTATTSGASDRRRRARRRTAPAWLRED EWLRHHLPHPPRQLSRCLHRRRRSACHHRYSRRTPKGGLPMTSSLVPISEARAHLSRL VRESADDDVVLMNHGRPAAILISAERYESLMEELEDLRDRLSVHEREHVTMPLDKLGA ELGVDIGRV" gene complement(323338..324531) /locus_tag="Rv0269c" /db_xref="GeneID:886640" CDS complement(323338..324531) /locus_tag="Rv0269c" /function="UNKNOWN" /note="Rv0269c, (MTCY06A4.13c), len: 397 aa. Conserved hypothetical protein, highly similar to AL079355|SC4C6_19 hypothetical protein from Streptomyces coelicolor (341 aa), FASTA scores: opt: 1019, E(): 0, (46.5% identity in 344 aa overlap), and similar to other proteins e.g. CAC49016.1|AL603644 putative ATP-dependent DNA ligase protein from Sinorhizobium meliloti (636 aa); O34398 YKOU PROTEIN from Bacillus subtilis (611 aa), FASTA score: (27.2% identity in 283 aa overlap). Also similar to proteins from Mycobacterium tuberculosis e.g. Rv3062, Rv3731 (both DNA ligases), and Rv0938, Rv3730c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214783.1" /db_xref="GI:15607410" /db_xref="GeneID:886640" /translation="MSRMAAPVSLDVHGRQVIVTHPGRVVFPAHNDRKGYTKFDLVRY YLAVAEGAMRGVAGRPMILKRFVKGISAEAVFQKRAPANRPDWVDVAELHYASGRSAA EAVIHDAAGLAWVINLGCVDLNPHPVLAGDLDHPDELRVDLDPMPGVAWQRVVEVALV VREVLEDYGLTAWPKTSGSRGFHVYARIAPCWSFPQVRLAAQTVAREVERRLPDAATS RWWKEEREGVFVDFNQNAKDRTVASAYSVRATPDARVSTPLHWEEVPGCDPAVFTMAT VPSRLADIGDPWAGMDDAVGRLDRLLMLAEELGPPQKAQSAKPLIEIARAKTRAEAMA ALDIWRDRYPGAAALLRPADVLVDGMRGPSSIWYRIRINLQHVPADQRPPQEELIADY SPWPR" gene 324567..326249 /gene="fadD2" /locus_tag="Rv0270" /db_xref="GeneID:886637" CDS 324567..326249 /gene="fadD2" /locus_tag="Rv0270" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_214784.1" /db_xref="GI:15607411" /db_xref="GeneID:886637" /translation="MPNLTDLPGQAVSKLQKSIGQYVARGTAELHYLRKIIESGAIGL EPPLNYAALAADIRKWGEVGMLPSHNARRAPNRAAVIDEEGTLTFSELDEAAHAVANG LLAKGVRAGDGVAILARNHRWFVIANYGAARVGARIILLNSEFSGPQIKEVSDREGAK VIIYDDEYTKAVSLAQPPLGKLRALGVNPDDDKPSGSSDETLAELIAHSSTAPAPKAS RRASIIILTSGTTGTPKGANRNTPPTLAPIGGILSHVPFKAGEVTLLPSPMFHALGYM HAALAMFLGSTLVLRRRFKPALVLEDIEKHKATSMVVVPVMLSRILDQLEKTEPKPDL SSLKIVFVSGSQLGAELATRALGDLGPVIYNMYGSTEVAFATIAGPKDLQFNPSTVGP VVKGVTVKILDENGNEVPQGAVGRIFVGNAFPFEGYTGGGGKQIIDGLLSSGDVGYFD ERGLLYVSGRDDEMIVSGGENVFPAEVEDLISGHPDVVEAAAIGVDDKEFGARLRAFV VKKPGADLDEDTIKQYVRDHLARYKVPREVIFLDELPRNPTGKVLKRELRKL" misc_feature 325236..325271 /gene="fadD2" /locus_tag="Rv0270" /note="PS00455 Putative AMP-binding domain signature" gene complement(326266..328461) /gene="fadE6" /locus_tag="Rv0271c" /db_xref="GeneID:886641" CDS complement(326266..328461) /gene="fadE6" /locus_tag="Rv0271c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0271c, (MTCY06A4.15c), len: 731 aa. Probable fadE6, acyl-CoA dehydrogenase (EC 1.3.99.-), with C-terminal half similar to many e.g. ACDS_HUMAN|P16219 acyl-CoA dehydrogenase (short-chain) from Homo sapiens (412 aa), FASTA scores: opt: 339, E(): 1.3e-13, (28.1% identity in 288 aa overlap)." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE6" /protein_id="NP_214785.1" /db_xref="GI:15607412" /db_xref="GeneID:886641" /translation="MSIAITPEHYELADSVRSLVARVAPSEVLHAALESPVENPPPYW QAAAEQGLQGVHLAESVGGQGFGILELAVVLAEFGYGAVPGPFVPSAIASALIAAHDP QAKVLAELATGAAIAAYALDSGLTATRHGDVLVIRGEVRAVPAAAQASVLVLPVAIES RDEWVVLRNDQLEIEAVKSLDPLRPIAHVRANAVDVSDDALLSNLTMTTAHALMSTLL SAEAVGVARWATDTASAYAKIREQFGRPIGQFQAIKHKCAEMIADTERATAAVWDAAR ALDDAGESSSDVEFAAAVAATLAPATAQRCTQDCIQVHGGIGFTWEHDTNVYYRRALM LAACFGRGSEYPQRVVDTATTAGMRPVDIDLDPSTEKLRAQIRAEVAALKAMPREPRT VAIAEGGWVLPYLPKPWGRAASPVEQIIIAQEFTAGRVKRPQIAIATWIVPSIVAFGT DNQKQRLLPPTFRGDIFWCQLFSEPGAGSDLASLATKATRVDGGWRITGQKIWTTGAQ YSQWGALLARTDPSAPKHNGITYFLLDMKSEGVQVKPLRELTGKEFFNTVYLDDVFVP DELVLGEVNRGWEVSRNTLTAERVSIGGSDSTFLPTLGEFVDFVRDYRFEGQFDQVAR HRAGQLIAEGHATKLLNLRSTLLTLAGGDPMAPAAISKLLSMRTGQGYAEFAVSSFGT DAVIGDTERLPGKWGEYLLASRATTIYGGTSEVQLNIIAERLLGLPRDP" gene complement(328575..329708) /locus_tag="Rv0272c" /db_xref="GeneID:886635" CDS complement(328575..329708) /locus_tag="Rv0272c" /function="UNKNOWN" /note="Rv0272c, (MTCY06A4.16c), len: 377 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214786.1" /db_xref="GI:15607413" /db_xref="GeneID:886635" /translation="MTGRAATPGVIREFVGLPSRTAGRAAAGGHPCQGLYHHSVGRKP KVALIAAHYQIDFSEHYLAEYMAIRGIGFLGWNTRFRGFESSFLLDHALVDIGVGVRW LREVQGVETVVLLGNSGGGSLMAAYQSQAVDPNVTPLDGMRPAAGVTELPAADAYVAA AAHPGRPDVLTAWMDAAVIDENDPVATDPELDLFDERNGPPYSPEFISRYRSAQVKRN HTITDWAESELKRVRAAGFSDRPFSVMRTWADPRMVDPSIEPTKRRPNQCYAGTPVKA NRSAHGIAAACTLRGWLGMWSLRVAQTRAAPHLARITCPALVLNAEADTGIFPSDAQQ IYDGLASSDKTQVSIDTDHYFTTPGARSEQADTIAKWIAKRWR" gene complement(329705..330325) /locus_tag="Rv0273c" /db_xref="GeneID:886633" CDS complement(329705..330325) /locus_tag="Rv0273c" /function="COULD BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0273c, (MTV035.01c), len: 206 aa (start uncertain). Possible transcriptional regulator, showing some similarity to hypothetical regulators from Mycobacterium tuberculosis e.g. P96222|Rv3855|MTCY01A6.13c (216 aa); O08377|Rv1534|MTCY07A7A.03 (225 aa), FASTA scores: opt: 123, E(): 3.2e-06, (28.5% identity in 172 aa overlap). TBparse score is 0.945." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214787.1" /db_xref="GI:15607414" /db_xref="GeneID:886633" /translation="MPDFPTQRGRRTQAAIDAAARTVVVRNGILATTVADITAEAGRS AASFYNYYDSKEAMVRQWALRFRDDANQRALSVIRHGLSDRERAYEAAAAHWYTYRNR LAEAISVSQLAMVSDDFAQYWSEICQIPISFITETVKRAQAHGYCVGDDPQLMAEAIV AMFNQFCYLQLSGKRSRRGQPDDQACIQTLANIYYRAIYSKEDSSN" gene 330422..331003 /locus_tag="Rv0274" /db_xref="GeneID:886631" CDS 330422..331003 /locus_tag="Rv0274" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0274, (MTV035.02), len: 193 aa. Conserved hypothetical protein, highly similar to AAK25058.1|AE005973 conserved hypothetical protein from Caulobacter crescentus (174 aa). Shows also some similarity to others hypothetical proteins e.g. AJ002571|BSAJ2571_7 from Bacillus subtilis (316 aa), FASTA scores: opt: 138, E(): 0.033, (27.1% identity in 133 aa overlap). Previous hits with Q56415|M85195 FOSFOMYCIN-RESISTANCE PROTEIN from SERRATIA MARCESCENS (141 aa), FASTA scores: opt: 82, E(): 1.1e -08, (29.1% identity in 151 aa overlap). Contains PS00082 Extradiol ring-cleavage dioxygenases signature near C-terminus. TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214788.1" /db_xref="GI:15607415" /db_xref="GeneID:886631" /translation="MIKPHNTNTEFELGGINHVALVCSDMARTVDFYSNILGMPLIKA LDLPGGQGQHFFFDAGNGDCVAFFWFADAPDRVPGLSSPVAIPGIGDITSAVSTMNHL AFHVPAERFDAYRQRLKDKGVRVGPVLNHDDSETQVSAVVHPGVYVRSFYFQDPDGIT LEFACWTKEFTTSDAQAVPKTAADRRPPVAADR" misc_feature 330842..330907 /locus_tag="Rv0274" /note="PS00082 Extradiol ring-cleavage dioxygenases signature" gene complement(330933..331658) /locus_tag="Rv0275c" /db_xref="GeneID:886629" CDS complement(330933..331658) /locus_tag="Rv0275c" /function="COULD BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0275c, (MTV035.03c), len: 241 aa. Possible transcriptional regulator, tetR family, similar to others e.g. Q9RJE7|SCF81.04c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (219 aa); Q9FBI8|SCP8.33c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (213 aa); Q9I2Q9|PA1836 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (193 aa); etc. Also shows some similarity with Rv0825c from Mycobacterium tuberculosis (213 aa), FASTA scores: opt: 230, E(): 2.7e-07, (32.6% identity in 190 aa overlap). SEEMS TO BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS (M. tuberculosis regulatory protein family with many TetR orthologues)." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="YP_177706.1" /db_xref="GI:57116711" /db_xref="GeneID:886629" /translation="MTRSDRPYRGVEAAERLATRRRQSLSAGLDLLGSDQHDIAELTI RTICRRAGLSVRYFYESFTDKDEFVGRVFDWVVAELVATTQAAVTAVPAREQTRAGMA NIVRTITADARVGRLLFSTQLANAVITRKRAESSALFAMLSGQHAVDTLHAPANDHVK AVAHFAVGGVGQTISAWLAGDVRLDPDQLVDQLAALLDELTDPNLSRPRVAATAAKSG ANDPQPPEVAGQPPSSARPARRS" gene 331748..332668 /locus_tag="Rv0276" /db_xref="GeneID:886627" CDS 331748..332668 /locus_tag="Rv0276" /function="UNKNOWN" /note="Rv0276, (MTV035.04), len: 306 aa. Conserved hypothetical protein, similar to Rv2237|Z70692|MTCY427.18 from Mycobacterium tuberculosis (296 aa), FASTA scores: opt: 874, E(): 0, (49.6% identity in 282 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214790.1" /db_xref="GI:15607417" /db_xref="GeneID:886627" /translation="MAISLVAHQPIPHVERPMADPPRLQLARRRRSAAGPGGNEDSLM GVALLAGPANVIMELAMPGVGYGVLESRVESGRLDRHPIKRARTTFTYVAVAVAGSDD QKAAFRRAVNKVHAQVYSTPESPVSYHAFDPELQLWVAACLYKGGVDVYRTFVGEMDD EEADHHYRAGMAMGTTLQVPPQMWPPDRAAFDRYWRQSLDRVHIDDVVRDYLYPIVAL RIRGIALPGPLRRLSEGIALLITTGFLPQRFRDEMRLPWDATKQRRFDALMAVLRTVN RLMPRFVREFPFNLMLWDLDRRMRRGRPLV" gene complement(332708..333136) /locus_tag="Rv0277c" /db_xref="GeneID:886625" CDS complement(332708..333136) /locus_tag="Rv0277c" /function="UNKNOWN" /note="Rv0277c, (MTV035.05c), len: 152 aa. Conserved hypothetical protein, highly similar to Rv0749|H70824|2911023|CAA17516.1|AL021958 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (142 aa); and similar to several other hypothetical Mycobacterium tuberculosis proteins: Rv0277c, Rv2530c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214791.1" /db_xref="GI:15607418" /db_xref="GeneID:886625" /translation="MFLIDVNVLLAAHRGDHPNHRTVRPWFDRLLAADDPFTVPNLVW ASFLRLTTNRRIFEIPSPRADAFAFVEAVNAQPHHLPTSPGPRHLVLLRKLCDEADAS GDLIPDAVLGAIAVEHHCAVVSLDRDFARFASVRHIRPPI" gene complement(333437..336310) /gene="PE_PGRS3" /locus_tag="Rv0278c" /db_xref="GeneID:886623" CDS complement(333437..336310) /gene="PE_PGRS3" /locus_tag="Rv0278c" /function="UNKNOWN" /note="Rv0278c, (MTV035.06c), len: 957 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), PGRS subfamily of gly-rich proteins, similar to many e.g. Z95890|MTCY28_25|Rv1759c from Mycobacterium tuberculosis (914 aa), FASTA scores: opt: 3849, E(): 0, (67.8% identity in 903 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177707.1" /db_xref="GI:57116712" /db_xref="GeneID:886623" /translation="MSFVIAAPEVIAAAATDLASLGSSISAANAAAAANTTALMAAGA DEVSTAIAALFGAHGQAYQALSAQAQAFHAQFVQALTSGGGAYAAAEAAAVSPLLDPI NEFFLANTGRPLIGNGANGAPGTGANGGDGGWLIGNGGAGGSGAAGVNGGAGGNGGAG GNGGAGGLIGNGGAGGAGGVASSGIGGSGGAGGNAMLFGAGGAGGAGGGVVALTGGAG GAGGAGGNAGLLFGAAGVGGAGGFTNGSALGGAGGAGGAGGLFATGGVGGSGGAGSSG GAGGAGGAGGLFGAGGTGGHGGFADSSFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGG DGGAGGNAGMLALGAAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGG FGFADGGQGGPGGNAGTVFGSGGAGGNGGVGQGFAGGIGGAGGTPGLIGNGGNGGNGG ASAVTGGNGGIGGTGVLIGNGGNGGSGGIGAGKAGVGGVSGLLLGLDGFNAPASTSPL HTLQQNVLNVVNEPFQTLTGRPLIGNGANGTPGTGADGGAGGWLFGNGANGTPGTGAA GGAGGWLFGNGGNGGHGATNTAATATGGAGGAGGILFGTGGNGGTGGIATGAGGIGGA GGAGGVSLLIGSGGTGGNGGNSIGVAGIGGAGGRGGDAGLLFGAAGTGGHGAAGGVPA GVGGAGGNGGLFANGGAGGAGGFNAAGGNGGNGGLFGTGGTGGAGTNFGAGGNGGNGG LFGAGGTGGAAGSGGSGITTGGGGHGGNAGLLSLGASGGAGGSGGASSLAGGAGGTGG NGALLFGFRGAGGAGGHGGAALTSIQQGGAGGAGGNGGLLFGSAGAGGAGGSGANALG AGTGGTGGDGGHAGVFGNGGDGGCRRVWRRYRRQRWCRRQRRADRQRRQRRQRRQSRG HARCRRHRRAAARRERTQRLAIAGRPATTRGVEGISCSPQMMP" misc_feature complement(334136..334207) /gene="PE_PGRS3" /locus_tag="Rv0278c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene complement(336560..339073) /gene="PE_PGRS4" /locus_tag="Rv0279c" /db_xref="GeneID:886621" CDS complement(336560..339073) /gene="PE_PGRS4" /locus_tag="Rv0279c" /function="UNKNOWN" /note="Rv0279c, (MTV035.07c), len: 837 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to many e.g. Z95890|MTCY28_25|Rv0278c from Mycobacterium tuberculosis (914 aa), FASTA scores: opt: 2677, E(): 0, (64.5% identity in 926 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177708.1" /db_xref="GI:57116713" /db_xref="GeneID:886621" /translation="MSFVIAAPEVIAAAATDLASLESSIAAANAAAAANTTALLAAGA DEVSTAVAALFGAHGQAYQALSAQAQAFHAQFVQALTSGGGAYAAAEAAATSPLLAPI NEFFLANTGRPLIGNGTNGAPGTGANGGDGGWLIGNGGAGGSGAAGVNGGAGGNGGAG GLIGNGGAGGAGGRASTGTGGAGGAGGAAGMLFGAAGVGGPGGFAAAFGATGGAGGAG GNGGLFADGGVGGAGGATDAGTGGAGGSGGNGGLFGAGGTGGPGGFGIFGGGAGGDGG SGGLFGAGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGG IGGDGGTLFGSGGAGGVCGLGFDAGGAGGAGGKAGLLIGAGGAGGAGGGSFAGAGGTG GAGGAPGLVGNAGNGGNGGASANGAGAAGGAGGSGVLIGNGGNGGSGGTGAPAGTAGA GGLGGQLLGRDGFNAPASTPLHTLQQQILNAINEPTQALTGRPLIGNGANGTPGTGAD GGAGGWLFGNGGNGGHGATGADGGDGGSGGAGGILSGIGGTGGSGGIGTTGQGGTGGT GGAALLIGSGGTGGSGGFGLDTGGAGGRGGDAGLFLGAAGTGGQAALSQNFIGAGGTA GAGGTGGLFANGGAGGAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAGGHGGLFG AGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGS GGSLFGFGGAGGTGGSSGIGSSGGTGGDGGTAGVFGNGGDGGAGGFGADTGGNSSSVP NAVLIGNGGNGGNGGKAGGTPGAGGTSGLIIGENGLNGL" gene 339364..340974 /gene="PPE3" /locus_tag="Rv0280" /db_xref="GeneID:886619" CDS 339364..340974 /gene="PPE3" /locus_tag="Rv0280" /function="UNKNOWN" /note="Rv0280, (MTV035.08), len: 536 aa. Member of the Mycobacterium tuberculosis PPE family, similar to others e.g. Z80108|MTCY21B4_4|Rv0453 from Mycobacterium tuberculosis (539 aa), FASTA scores: opt: 1131, E(): 0, (51.7% identity in 540 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177709.1" /db_xref="GI:57116714" /db_xref="GeneID:886619" /translation="MTLWMASPPEVHSALLSSGPGPGSVLSAAGVWSSLSAEYAAVAD ELIGLLGAVQTGAWQGPSAAAYVAAHAPYLAWLMRASETSAEAAARHETVAAAYTTAV AAMPTLVELAANHTLHGVLVATNFFGINTIPIALNEADYARMWTQAASTMATYQAVAE AAVASAPQTTPAPPILAAEAADDDHDHDHDHGGEPTPLDYLVAEILRIISGGRLIWDP AEGTMNGIPFEDYTDAAQPIWWVVRAIEFSKDFETFVQELFVNPVEAFQFYFELLLFD YPTHIVQIVEALSQSPQLLAVALGSVISNLGAVTGFAGLSGLAGMQPAAIPALAPVAA APSTLPAVAMAPTMAAPGAAVASAAAPASAPAASTVASATPAPPPAPGAAGFGYPYAI APPGIGFGSGMSASASAQRKAPQPDSAAAAAAAAAVRDQARARRRRRVTRRGYGDEFM DMNIDVDPDWGPPPGEDPVTSTVASDRGAGHLGFAGTARREAVADAAGMTTLAGDDFG DGPTTPMVPGSWDPDRDAPGSAEPGDRG" gene 340998..341906 /locus_tag="Rv0281" /db_xref="GeneID:886618" CDS 340998..341906 /locus_tag="Rv0281" /function="UNKNOWN" /note="Rv0281, (MTV035.09), len: 302 aa. Conserved hypothetical protein; member of Mycobacterium tuberculosis protein family that includes Rv0726c, Rv0731c, Rv3399, Rv1729c, etc. MTCY31_23 (325 aa), FASTA scores: opt: 1386, E(): 0, (69. 1% identity in 301 aa overlap). Contains possible N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214795.1" /db_xref="GI:15607422" /db_xref="GeneID:886618" /translation="MRTEGDSWDITTSVGSTALFVATARALEAQKSDPLVVDPYAEAF CRAVGGSWADVLDGKLPDHKLKSTDFGEHFVNFQGARTKYFDEYFRRAAAAGARQVVI LAAGLDSRAYRLPWPDGTTVFELDRPQVLDFKREVLASHGAQPRALRREIAVDLRDDW PQALRDSGFDAAAPSAWIAEGLLIYLPATAQERLFTGIDALAGRRSHVAVEDGAPMGP DEYAAKVEEERAAIAEGAEEHPFFQLVYNERCAPAAEWFGERGWTAVATLLNDYLEAV GRPVPGPESEAGPMFARNTLVSAARV" gene 342130..344025 /locus_tag="Rv0282" /db_xref="GeneID:886613" CDS 342130..344025 /locus_tag="Rv0282" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0282, (MTV035.10), len: 631 aa. Conserved hypothetical protein, similar to Y14967|MLCB628.18c hypothetical protein from Mycobacterium leprae (573 aa), FASTA scores: opt: 916, E(): 0, (38.7% identity in 568 aa overlap). Also similar to Mycobacterium tuberculosis proteins e.g. Z94121|MTY15F10.26 (619 aa), FASTA scores: opt: 743, E(): 0, (29.9% identity in 612 aa overlap). Member of CFXQ, CBXP family - 9 members in Mycobacterium tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214796.1" /db_xref="GI:15607423" /db_xref="GeneID:886613" /translation="MAGVGEGDSGGVERDDIGMVAASPVASRVNGKVDADVVGRFATC CRALGIAVYQRKRPPDLAAARSGFAALTRVAHDQCDAWTGLAAAGDQSIGVLEAASRT ATTAGVLQRQVELADNALGFLYDTGLYLRFRATGPDDFHLAYAAALASTGGPEEFAKA NHVVSGITERRAGWRAARWLAVVINYRAERWSDVVKLLTPMVNDPDLDEAFSHAAKIT LGTALARLGMFAPALSYLEEPDGPVAVAAVDGALAKALVLRAHVDEESASEVLQDLYA AHPENEQVEQALSDTSFGIVTTTAGRIEARTDPWDPATEPGAEDFVDPAAHERKAALL HEAELQLAEFIGLDEVKRQVSRLKSSVAMELVRKQRGLTVAQRTHHLVFAGPPGTGKT TIARVVAKIYCGLGLLKRENIREVHRADLIGQHIGETEAKTNAIIDSALDGVLFLDEA YALVATGAKNDFGLVAIDTLLARMENDRDRLVVIIAGYRADLDKFLDTNEGLRSRFTR NIDFPSYTSHELVEIAHKMAEQRDSVFEQSALHDLEALFAKLAAESTPDTNGISRRSL DIAGNGRFVRNIVERSEEEREFRLDHSEHAGSGEFSDEELMTITADDVGRSVEPLLRG LGLSVRA" misc_feature 343282..343305 /locus_tag="Rv0282" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 344022..345638 /locus_tag="Rv0283" /db_xref="GeneID:886645" CDS 344022..345638 /locus_tag="Rv0283" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0283, (MTV035.11), len: 538 aa. Possible conserved membrane protein, similar to several hypothetical mycobacterial proteins e.g. Z94121|MTY15F10_16|Rv3895c from Mycobacterium tuberculosis (495 aa), FASTA scores: opt: 698, E(): 0, (37.6% identity in 492 aa overlap); Rv1782; Rv3450c; Rv3869; and Y14967|MLCB628_16|MLCB628.17c from Mycobacterium leprae (481 aa), FASTA scores: opt: 672, E(): 1.5e-31, (37.2% identity in 506 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214797.1" /db_xref="GI:15607424" /db_xref="GeneID:886645" /translation="MTNQQHDHDFDHDRRSFASRTPVNNNPDKVVYRRGFVTRHQVTG WRFVMRRIAAGIALHDTRMLVDPLRTQSRAVLMGVLIVITGLIGSFVFSLIRPNGQAG SNAVLADRSTAALYVRVGEQLHPVLNLTSARLIVGRPVSPTTVKSTELDQFPRGNLIG IPGAPERMVQNTSTDANWTVCDGLNAPSRGGADGVGVTVIAGPLEDTGARAAALGPGQ AVLVDSGAGTWLLWDGKRSPIDLADHAVTSGLGLGADVPAPRIIASGLFNAIPEAPPL TAPIIPDAGNPASFGVPAPIGAVVSSYALKDSGKTISDTVQYYAVLPDGLQQISPVLA AILRNNNSYGLQQPPRLGADEVAKLPVSRVLDTRRYPSEPVSLVDVTRDPVTCAYWSK PVGAATSSLTLLAGSALPVPDAVHTVELVGAGNGGVATRVALAAGTGYFTQTVGGGPD APGAGSLFWVSDTGVRYGIDNEPQGVAGGGKAVEALGLNPPPVPIPWSVLSLFVPGPT LSRADALLAHDTLVPDSRPARPVSAEGGYR" misc_feature 344931..344954 /locus_tag="Rv0283" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 345635..349627 /locus_tag="Rv0284" /db_xref="GeneID:886611" CDS 345635..349627 /locus_tag="Rv0284" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0284, (MTV035.12), len: 1330 aa. Possible conserved membrane protein, similar to products of two adjacent Mycobacterium leprae genes, MLCB628.16c (744 aa) and MLCB628.15c (597 aa); and throughout its length to several large Mycobacterium tuberculosis proteins: Rv3447c, Rv3870, Rv1784, etc. Y14967|MLCB628_ 15 (744 aa), FASTA scores: opt: 942, E(): 0, (33.8% identity in 730 aa overlap); Y14967|MLCB628_14 (597 aa), FASTA scores: opt: 613, E(): 3.1e-30, (31.7% identity in 615 aa overlap); Z94121|MTY15F10_17 (1396 aa), FASTA scores: opt: 652, E(): 2.2e-32, (35.4% identity in 1321 aa overlap); Z95389|MTCY77_19 (1236 aa), FASTA scores: opt 652, E(): 2.2e-32, (35.4% identity in 1321 aa overlap). Contains three PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214798.1" /db_xref="GI:15607425" /db_xref="GeneID:886611" /translation="MSRLIFEARRRLAPPSSHQGTIIIEAPPELPRVIPPSLLRRALP YLIGILIVGMIVALVATGMRVISPQTLFFPFVLLLAATALYRGNDKKMRTEEVDAERA DYLRYLSVVRDNIRAQAAEQRASALWSHPDPTALASVPGSRRQWERDPHDPDFLVLRA GRHTVPLATTLRVNDTADEIDLEPVSHSALRSLLDTQRSIGDVPTGIDLTKVSPITVL GERAQVRAVLRAWIAQAVTWHDPTVLGVALAARDLEGRDWNWLKWLPHVDIPGRLDAL GPARNLSTDPDELIALLGPVLADRPAFTGQPTDALRHLLIVVDDPDYDLGASPLAVGR AGVTVVHCSASAPHREQYSDPEKPILRVAHGAIERWQTGGWQPYIDAADQFSADEAAH LARRLSRWDSNPTHAGLRSAATRGASFTTLLGIEDASRLDVPALWAPRRRDEELRVPI GVTGTGEPLMFDLKDEAEGGMGPHGLMIGMTGSGKSQTLMSILLSLLTTHSAERLIVI YADFKGEAGADSFRDFPQVVAVISNMAEKKSLADRFADTLRGEVARREMLLREAGRKV QGSAFNSVLEYENAIAAGHSLPPIPTLFVVADEFTLMLADHPEYAELFDYVARKGRSF RIHILFASQTLDVGKIKDIDKNTAYRIGLKVASPSVSRQIIGVEDAYHIESGKEHKGV GFLVPAPGATPIRFRSTYVDGIYEPPQTAKAVVVQSVPEPKLFTAAAVEPDPGTVIAD TDEQEPADPPRKLIATIGEQLARYGPRAPQLWLPPLDETIPLSAALARAGVGPRQWRW PLGEIDRPFEMRRDPLVFDARSSAGNMVIHGGPKSGKSTALQTFILSAASLHSPHEVS FYCLDYGGGQLRALQDLAHVGSVASALEPERIRRTFGELEQLLLSRQQREVFRDRGAN GSTPDDGFGEVFLVIDNLYGFGRDNTDQFNTRNPLLARVTELVNVGLAYGIHVIITTP SWLEVPLAMRDGLGLRLELRLHDARDSNVRVVGALRRPADAVPHDQPGRGLTMAAEHF LFAAPELDAQTNPVAAINARYPGMAAPPVRLLPTNLAPHAVGELYRGPDQLVIGQREE DLAPVILDLAANPLLMVFGDARSGKTTLLRHIIRTVREHSTADRVAFTVLDRRLHLVD EPLFPDNEYTANIDRIIPAMLGLANLIEARRPPAGMSAAELSRWTFAGHTHYLIIDDV DQVPDSPAMTGPYIGQRPWTPLIGLLAQAGDLGLRVIVTGRATGSAHLLMTSPLLRRF NDLQATTLMLAGNPADSGKIRGERFARLPAGRAILLTDSDSPTYVQLINPLVDAAAVS GETQQKGSQS" misc_feature 347069..347092 /locus_tag="Rv0284" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 348119..348142 /locus_tag="Rv0284" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 348953..348976 /locus_tag="Rv0284" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 349624..349932 /gene="PE5" /locus_tag="Rv0285" /db_xref="GeneID:886608" CDS 349624..349932 /gene="PE5" /locus_tag="Rv0285" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0285, (MTV035.13), len: 102 aa. Member of the Mycobacterium tuberculosis PE family (see Brennan & Delogu 2002), similar to others e.g. AL0212|MTV012_37 from Mycobacterium tuberculosis (105 aa), FASTA scores: opt: 497, E(): 2.6e-24, (80.4% identity in 102 aa overlap); Z80108|MTCY21B4.03 from Mycobacterium tuberculosis (102 aa), FASTA scores: opt: 413, E(): 3.7e-19, (66.7% identity in 102 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177710.1" /db_xref="GI:57116715" /db_xref="GeneID:886608" /translation="MTLRVVPEGLAAASAAVEALTARLAAAHASAAPVITAVVPPAAD PVSLQTAAGFSAQGVEHAVVTAEGVEELGRAGVGVGESGASYLAGDAAAAATYGVVGG" gene 349935..351476 /gene="PPE4" /locus_tag="Rv0286" /db_xref="GeneID:886607" CDS 349935..351476 /gene="PPE4" /locus_tag="Rv0286" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0286, (MTV035.14), len: 513 aa. Member of the Mycobacterium tuberculosis PPE family, similar to others e.g. AL0212|MTV012_32 from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 958, E(): 0, (43.5% identity in 522 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177711.1" /db_xref="GI:57116716" /db_xref="GeneID:886607" /translation="MAAPIWMASPPEVHSALLSNGPGPGSLVAAATAWSQLSAEYAST AAELSGLLGAVPGWAWQGPSAEWYVAAHLPYVAWLTQASADAAGAAAQHEAAAAAYTT ALAAMPTLAELAANHVIHTVLVATNFFGINTIPITLNEADYVRMWLQAAAVMGLYQAA SGAALASAPRTVPAPTVMNPGGGAASTVGAVNPWQWLLALLQQLWNAYTGFYGWMLQL IWQFLQDPIGNSIKIIIAFLTNPIQALITYGPLLFALGYQIFFNLVGWPTWGMILSSP FLLPAGLGLGLAAIAFLPIVLAPAVIPPASTPLAAAAVAAGSVWPAVSMAVTGAGTAG AATPAAGAAPSAGAAPAPAAPATASFAYAVGGSGDWGPSLGPTVGGRGGIKAPAATVP AAAAAAATRGQSRARRRRRSELRDYGDEFLDMDSDSGFGPSTGDHGAQASERGAGTLG FAGTATKERRVRAVGLTALAGDEFGNGPRMPMVPGTWEQGSNEPEAPDGSGRGGGDGL PHDSK" gene 351525..351818 /gene="esxG" /locus_tag="Rv0287" /db_xref="GeneID:886604" CDS 351525..351818 /gene="esxG" /locus_tag="Rv0287" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0287, (MTV035.15), len: 97 aa. esxG, ESAT-6 like protein. PE-family related protein; distant member of the Mycobacterium tuberculosis PE family, similar to Rv3020c|AL0212|MTV012.34 (97 aa), FASTA scores: opt: 564, E(): 0, (91.8% identity in 97 aa overlap). Contains probable helix-turn-helix motif at aa 14-35 (Score 144, +4.11 SD). SEEMS TO BELONG TO THE ESAT6 FAMILY (see Gey Van Pittius et al., 2001). Note that previously known as TB9.8.; TB9.8" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214801.1" /db_xref="GI:15607428" /db_xref="GeneID:886604" /translation="MSLLDAHIPQLVASQSAFAAKAGLMRHTIGQAEQAAMSAQAFHQ GESSAAFQAAHARFVAAAAKVNTLLDVAQANLGEAAGTYVAADAAAASTYTGF" gene 351848..352138 /gene="esxH" /locus_tag="Rv0288" /db_xref="GeneID:886603" CDS 351848..352138 /gene="esxH" /locus_tag="Rv0288" /function="UNKNOWN. MAY BE INVOLVED IN VIRULENCE." /experiment="experimental evidence, no additional details recorded" /note="Rv0288, (MT0301, MTV035.16), len: 96 aa. esxH, low molecular weight protein antigen 7 (10 kDa antigen) (CFP-7) (Protein TB10.4) (see citations below), ala-rich protein; member of mycobacterial protein family containing ESAT-6, very similar to MTV012_33 from Mycobacterium tuberculosis (96 aa), FASTA scores: opt: 566, E(): 0, (84.4% identity in 96 aa overlap). Alternative start codon possible position 351878 (see Rosenkrands et al., 2000). BELONG TO THE ESAT6 FAMILY (see Skjot et al., 2000; 2002; Gey Van Pittius et al., 2001). Note that previously known as cfp7 (alternate gene name: TB10.4).; cfp7, TB10.4" /codon_start=1 /transl_table=11 /product="low molecular weight protein antigen 7 ESXH (10 kDa antigen) (CFP-7) (protein TB10.4)" /protein_id="NP_214802.1" /db_xref="GI:15607429" /db_xref="GeneID:886603" /translation="MSQIMYNYPAMLGHAGDMAGYAGTLQSLGAEIAVEQAALQSAWQ GDTGITYQAWQAQWNQAMEDLVRAYHAMSSTHEANTMAMMARDTAEAAKWGG" gene 352149..353036 /locus_tag="Rv0289" /db_xref="GeneID:886602" CDS 352149..353036 /locus_tag="Rv0289" /function="UNKNOWN" /note="Rv0289, (MTV035.17), len: 295 aa. Conserved hypothetical protein, equivalent to CAC32061.1|AL583926 possible DNA-binding protein from Mycobacterium leprae (289 aa); and showing some similarity to Rv3866|G70656|CAB06238.1|Z94121|MTCY15F10.23 from Mycobacterium tuberculosis (276 aa), FASTA scores: opt: 149, E(): 0.0035, (27.7% identity in 289 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214803.1" /db_xref="GI:15607430" /db_xref="GeneID:886602" /translation="MDATPNAVELTVDNAWFIAETIGAGTFPWVLAITMPYSDAAQRG AFVDRQRDELTRMGLLSPQGVINPAVADWIKVVCFPDRWLDLRYVGPASADGACELLR GIVALRTGTGKTSNKTGNGVVALRNAQLVTFTAMDIDDPRALVPILGVGLAHRPPARF DEFSLPTRVGARADERLRSGVPLGEVVDYLGIPASARPVVESVFSGPRSYVEIVAGCN RDGRHTTTEVGLSIVDTSAGRVLVSPSRAFDGEWVSTFSPGTPFAIAVAIQTLTACLP DGQWFPGQRVSRDFSTQSS" gene 353083..354501 /locus_tag="Rv0290" /db_xref="GeneID:886599" CDS 353083..354501 /locus_tag="Rv0290" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0290, (MTV035.18), len: 472 aa. Probable conserved transmembrane protein, similar to several others in mycobacteria e.g. Z95389|MTCY77_20|Rv3887c from Mycobacterium tuberculosis (467 aa), FASTA scores: opt: 429, E(): 5.1e-19, (28. 6% identity in 479 aa overlap); Rv3877; Rv1795; Rv3448; and Y14967|MLCB628_9|MLCB628.10c from Mycobacterium leprae (480 aa), FASTA scores: opt: 269, E(): 3.1e-09, (26.0% identity in 503 aa overlap). TBparse score is 0.892." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214804.1" /db_xref="GI:15607431" /db_xref="GeneID:886599" /translation="MSGTVMQIVRVAILADSRLTEMALPAELPLREILPAVQRLVVPS AQNGDGGQADSGAAVQLSLAPVGGQPFSLDASLDTVGVVDGDLLVLQPVPAGPAAPGI VEDIADAAMIFSTSRLKPWGIAHIQRGALAAVIAVALLATGLTVTYRVATGVLAGLLA VAGIAVASALAGLLITIRSPRSGIALSIAALVPIGAALALAVPGKFGPAQVLLGAAGV AAWSLIALMIPSAERERVVAFFTAAAVVGASVALAAGAQLLWQLPLLSIGCGLIVAAL LVTIQAAQLSALWARFPLPVIPAPGDPTPSAPPLRLLEDLPRRVRVSDAHQSGFIAAA VLLSVLGSVAIAVRPEALSVVGWYLVAATAAAATLRARVWDSAACKAWLLAQPYLVAG VLLVFYTATGRYVAAFGAVLVLAVLMLAWVVVALNPGIASPESYSLPLRRLLGLVAAG LDVSLIPVMAYLVGLFAWVLNR" gene 354498..355883 /gene="mycP3" /locus_tag="Rv0291" /db_xref="GeneID:886615" CDS 354498..355883 /gene="mycP3" /locus_tag="Rv0291" /EC_number="3.4.21.-" /function="THOUGHT TO HAVE PROTEOLYTIC ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="Rv0291, (MTV035.19), len: 461 aa. Probable mycP3, membrane-anchored serine protease (mycosin) (EC 3.4.21.-) (see Brown et al., 2000), similar to several others in mycobacteria e.g. Z94121|MTY15F10_28|Rv1796 from Mycobacterium tuberculosis (446 aa), FASTA scores: opt: 1168, E(): 0, (44.6% identity in 453 aa overlap); Rv3886c; Rv3883c; Rv3449; and Y14967|MLCB628_4|MLCB628.04 from Mycobacterium leprae (446 aa), FASTA scores: opt: 1159, E(): 0, (43.5 identity in 446 aa overlap). Has signal sequence and hydrophobic stretch at C-terminus, followed by short positively charged segment, that seems to act as a membrane anchor. Contains PS00137 Serine proteases, subtilase family, histidine active site signature. BELONGS TO PEPTIDASE FAMILY S8 (ALSO KNOWN AS THE SUBTILASE FAMILY), PYROLYSIN SUBFAMILY." /codon_start=1 /transl_table=11 /product="membrane-anchored mycosin" /protein_id="NP_214805.1" /db_xref="GI:15607432" /db_xref="GeneID:886615" /translation="MIRAAFACLAATVVVAGWWTPPAWAIGPPVVDAAAQPPSGDPGP VAPMEQRGACSVSGVIPGTDPGVPTPSQTMLNLPAAWQFSRGEGQLVAIIDTGVQPGP RLPNVDAGGDFVESTDGLTDCDGHGTLVAGIVAGQPGNDGFSGVAPAARLLSIRAMST KFSPRTSGGDPQLAQATLDVAVLAGAIVHAADLGAKVINVSTITCLPADRMVDQAALG AAIRYAAVDKDAVIVAAAGNTGASGSVSASCDSNPLTDLSRPDDPRNWAGVTSVSIPS WWQPYVLSVASLTSAGQPSKFSMPGPWVGIAAPGENIASVSNSGDGALANGLPDAHQK LVALSGTSYAAGYVSGVAALVRSRYPGLNATEVVRRLTATAHRGARESSNIVGAGNLD AVAALTWQLPAEPGGGAAPAKPVADPPVPAPKDTTPRNVAFAGAAALSVLVGLTAATV AIARRRREPTE" misc_feature 354873..354905 /gene="mycP3" /locus_tag="Rv0291" /note="PS00137 Serine proteases, subtilase family, histidin e active site" gene 355880..356875 /locus_tag="Rv0292" /db_xref="GeneID:886601" CDS 355880..356875 /locus_tag="Rv0292" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0292, (MTV035.20), len: 331 aa. Probable conserved transmembrane protein (has two hydrophobic segments at N-terminal end), equivalent to CAC32058.1|AL583926 conserved membrane protein from Mycobacterium leprae (339 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214806.1" /db_xref="GI:15607433" /db_xref="GeneID:886601" /translation="MNPIPSWPGRGRVTLVLLAVVPVALAYPWQSTRDYVLLGVAAAV VIGLFGFWRGLYFTTIARRGLAILRRRRRIAEPATCTRTTVLVWVGPPASDTNVLPLT LIARYLDRYGIRADTIRITSRVTASGDCRTWVGLTVVADDNLAALQARSARIPLQETA QVAARRLADHLREIGWEAGTAAPDEIPALVAADSRETWRGMRHTDSDYVAAYRVSANA ELPDTLPAIRSRPAQETWIALEIAYAAGSSTRYTVAAACALRTDWRPGGTAPVAGLLP QHGNHVPALTALDPRSTRRLDGHTDAPADLLTRLHWPTPTAGAHRAPLTNAVSRT" gene complement(356862..358064) /locus_tag="Rv0293c" /db_xref="GeneID:886594" CDS complement(356862..358064) /locus_tag="Rv0293c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0293c, (MTV035.21c), len: 400 aa. Conserved hypothetical protein, similar in C-terminal part to Rv2627c|B70573|MTCY01A10.05|CAB08637.1|Z95387 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (413 aa), FASTA scores: opt: 394, E(): 2.1e-17, (31.1% identity in 299 aa overlap). TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214807.1" /db_xref="GI:15607434" /db_xref="GeneID:886594" /translation="MSGTFTADAIGPPVPIPDVPGADAGAEGLPSRSVLSARQRILVE SSAIADVALRTAVASVLSATVTPAVVANALRHVNEGSERSNLNFYAELAAAHDPAKSF PAPTELPKVTSRPASPLTEWVARGTVDNIAFASGFRAINPTMRQRWSALTANNIVHAQ HWRHRDGPRPTLCVIHGFMGSSYLLNGLFFSLPWYYRSGYDVLLYTLPFHGQRAEKFS PFSGFGYFTSGLSGFAEAMAQAVYDFRSIVDYLRHIGVDRIALTGISLGGYTSALLAS VESRLEAVIPNCPVVMPAKLFDEWFPANKLVKLGLRLTNISRDELIAGLAYHGPLNYR PLLPKDRRMIITGLGDRMAPPEHAVTLWKQWDRCALHWFPGSHLLHVSQLDYLRRMTV FLQGLMFD" gene 358171..358956 /gene="tam" /locus_tag="Rv0294" /db_xref="GeneID:886593" CDS 358171..358956 /gene="tam" /locus_tag="Rv0294" /EC_number="2.1.1.-" /function="POSSIBLY CATALYZES THE S-ADENOSYLMETHIONINE MONOMETHYL ESTERIFICATION OF TRANS-ACONITATE AT HIGH AFFINITY AND OF CIS-ACONITATE, ISOCITRATE, AND CITRATE AT LOWER VELOCITIES AND AFFINITIES." /note="catalyzes the formation of (E)-3-(methoxycarbonyl)pent-2-enedioate and S-adenosyl-L-homocysteine from S-adenosyl-L-methionine and trans-aconitate" /codon_start=1 /transl_table=11 /product="trans-aconitate 2-methyltransferase" /protein_id="NP_214808.1" /db_xref="GI:15607435" /db_xref="GeneID:886593" /translation="MWDPDVYLAFSGHRNRPFYELVSRVGLERARRVVDLGCGPGHLT RYLARRWPGAVIEALDSSPEMVAAAAERGIDATTGDLRDWKPKPDTDVVVSNAALHWV PEHSDLLVRWVDELAPGSWIAVQIPGNFETPSHAAVRALARREPYAKLMRDIPFRVGA VVQSPAYYAELLMDTGCKVDVWETTYLHQLTGEHPVLDWITGSALVPVRERLSDESWQ QFRQELIPLLNDAYPPRADGSTIFPFRRLFMVAEVGGARRSGG" gene complement(358945..359748) /locus_tag="Rv0295c" /db_xref="GeneID:886596" CDS complement(358945..359748) /locus_tag="Rv0295c" /function="UNKNOWN" /note="Rv0295c, (MTV035.23c), len: 267 aa. Conserved hypothetical protein, showing weak similarity with CAC46877.1|AL591790 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (213 aa); and NP_104818.1|14023999|BAB50604.1|AP00300 Protein with weak similarity to NodH from Mesorhizobium loti (257 aa). TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214809.1" /db_xref="GI:15607436" /db_xref="GeneID:886596" /translation="MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPST GMAPQPREWFAGVDDDTILQLLDPLDPGTPDTATPVAWREHVRTSGRTPNGVWGGKLM WNQTALLQQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQ VWRGHPDPKRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNL TAIVASVLDAIGQDPKLAPAPMLERQANQRSDEWVDRYRAEAPRLGLPT" gene complement(359758..361155) /locus_tag="Rv0296c" /db_xref="GeneID:886600" CDS complement(359758..361155) /locus_tag="Rv0296c" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0296c, (MTCY63.01c, MTV035.24c), len: 465 aa. Probable sulfatase, possibly an aryl-/steryl-sulfatase (EC 3.1.6.-) or a sulfamidase (sulfohydrolase) (sulphamidase) (EC 3.10.1.-). Similar to various hydrolases e.g. AAG41945.1|AF304053_1|AF304053 heparan N-sulfatase from Mus musculus (502 aa); NP_061292.1|6851181|AAF29460.1|AF153827_1|AF153827 N-sulfoglucosamine sulfohydrolase (sulfamidase) (sulphamidase) from Mus musculus (502 aa); AAG17206.1|AF217203_1|AF217203 heparan sulfate sulfamidase from Canis familiaris (507 aa); P08842|STS_HUMAN|1360652 STERYL-SULFATASE PRECURSOR (EC 3.1.6.2) (STEROID SULFATASE) (STERYL-SULFATE SULFOHYDROLASE) (ARYLSULFATASE C) (ASC) from Homo sapiens (583 aa); ARSB_FELCA|P33727 arylsulfatase B precursor (EC 3.1.6.1) (535 aa), FASTA scores: opt: 231, E(): 1.7e-08, (30.3% identity in 261 aa overlap). Also similarity with 4 others sulfatases in Mycobacterium tuberculosis. Contains sulfatases signature 1 (PS00523). Note that previously known as atsG.; atsG" /codon_start=1 /transl_table=11 /product="sulfatase" /protein_id="YP_177712.1" /db_xref="GI:57116717" /db_xref="GeneID:886600" /translation="MTSERATGQRENLLIVHWHDLGRYLGVYHHPDVYSPRLDRLAAE GILFTRAHATAPLCTPSRGSLFTGRYPQSNGLVGLAHHGWEYRTGVQTLPQLLSESGW YSALFGMQHETSYPKRLGFDEFDVSNSYCEYVVAKAQDWLHNRVPALDGQRFLLTAGF FETHRPYPHERYRPADSAAVELPDYLPDTPEVRQDVAEFYGSIATADEAVGRLLDTLA DTGLDASTWVVFVTDHGPAFPRAKSTLYDAGTGIALIIRPPTRRAMAPRVYDELFSGV DLVPTLLDLLRLEVPADVEGVSHAPALLAPDTENAAVRDHVYTAKTYHDSFDPIRAIR TKEYSYIENYAPRPLLDLPWDIQESPAGMAVAPLVKAPRPQRELYDLRADPTETNNLL AGDDSTQGVAAIAADLAVRLHDWRQRTADVIPSDFAGSRIAERYTETYLRIHRKTPTG RSAIAADRGIDEHCS" misc_feature complement(360952..360990) /locus_tag="Rv0296c" /note="PS00523 Sulfatases signature 1" gene 361334..363109 /gene="PE_PGRS5" /locus_tag="Rv0297" /db_xref="GeneID:885981" CDS 361334..363109 /gene="PE_PGRS5" /locus_tag="Rv0297" /function="UNKNOWN" /note="Rv0297, (MTCY63.02), len: 591 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to others e.g. Y03A_MYCTU|Q10637 from Mycobacterium tuberculosis (603 aa), FASTA scores: opt: 1884, E(): 0, (53.7% identity in 635 aa overlap). TBparse score is 0.850." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177713.1" /db_xref="GI:57116718" /db_xref="GeneID:885981" /translation="MSFVIAQPEMIAAAAGELASIRSAINAANAAAAAQTTGVMSAAA DEVSTAVAALFSSHAQAYQAASAQAAAFHAQVVRTLTVDAGAYASAEAANAGPNMLAA VNAPAQALLGRPLIGNGANGAPGTGQAGGDGGLLFGNGGNGGSGAPGQAGGAGGAAGF FGNGGNGGDGGAGANGGAGGTAGWFFGFGGNGGAGGIGVAGINGGLGGAGGDGGNAGF FGNGGNGGMGGAGAAGVNAVNPGLATPVTPAANGGNGLNLVGVPGTAGGGADGANGSA IGQAGGAGGDGGNASTSGGIGIAQTGGAGGAGGAGGDGAPGGNGGNGGSVEHTGATGS SASGGNGATGGNGGVGAPGGAGGNGGHVSGGSVNTAGAGGKGGNGGTGGAGGPGGHGG SVLSGPVGDSGNGGAGGDGGAGVSATDIAGTGGRGGNGGHGGLWIGNGGDGGAGGVGG VGGAGAAGAIGGHGGDGGSVNTPIGGSEAGDGGKGGLGGDGGGRGIFGQFGAGGAGGA GGVGGAGGAGGTGGGGGNGGAIFNAGTPGAAGTGGDGGVGGTGAAGGKGGAGGSGGVN GATGADGAKGLDGATGGKGNNGNPG" gene 363252..363479 /locus_tag="Rv0298" /db_xref="GeneID:886590" CDS 363252..363479 /locus_tag="Rv0298" /function="UNKNOWN" /note="Rv0298, (MTCY63.03), len: 75 aa. Hypothetical unknown protein. TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214812.1" /db_xref="GI:15607439" /db_xref="GeneID:886590" /translation="MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRV ALRDYTAKTVPALDIDAYAQRVYQANRAAGS" gene 363476..363778 /locus_tag="Rv0299" /db_xref="GeneID:886598" CDS 363476..363778 /locus_tag="Rv0299" /function="UNKNOWN" /note="Rv0299, (MTCY63.04), len: 100 aa. Hypothetical unknown protein. Equivalent to AAK44536.1 from Mycobacterium tuberculosis strain CDC1551 (49 aa) but longer 51 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214813.1" /db_xref="GI:15607440" /db_xref="GeneID:886598" /translation="MIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGR VPEDLLAMVVAVEQPNGTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC" gene 363826..364047 /locus_tag="Rv0300" /db_xref="GeneID:886588" CDS 363826..364047 /locus_tag="Rv0300" /function="UNKNOWN" /note="Rv0300, (MTCY63.05), len: 73 aa. Conserved hypothetical protein, similar to Rv1721c|MTCY04C12.06c|Z81360|MTCY4C12_4 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (75 aa), FASTA scores: opt: 84, E(): 8.3, (39.5% identity in 38 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214814.1" /db_xref="GI:15607441" /db_xref="GeneID:886588" /translation="MSDVLIRDIPDDVLASLDAIAARLGLSRTEYIRRRLAQDAQTAR VTVTAADLRRLRGAVAGLGDPELMRQAWR" gene 364044..364469 /locus_tag="Rv0301" /db_xref="GeneID:886586" CDS 364044..364469 /locus_tag="Rv0301" /function="UNKNOWN" /note="Rv0301, (MTCY63.06), len: 141 aa. Conserved hypothetical protein, similar to other hypothetical Mycobacterium tuberculosis proteins e.g. Rv2757c, Rv0229c, Rv2546, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214815.1" /db_xref="GI:15607442" /db_xref="GeneID:886586" /translation="MTDQRWLIDKSALVRLTDSPDMEIWSNRIERGLVHITGVTRLEV GFSAECGEIARREFREPPLSAMPVEYLTPRIEDRALEVQTLLADRGHHRGPSIPDLLI AATAELSGLTVLHVDKDFDAIAALTGQKTERLTHRPPSA" gene 364605..365237 /locus_tag="Rv0302" /db_xref="GeneID:886584" CDS 364605..365237 /locus_tag="Rv0302" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0302, (MTCY63.07), len: 210 aa. Probable transcription regulatory protein, TetR family (see citation below), with its N-terminus similar to N-terminus of several repressors and regulatory proteins of TetR/AcrR family e.g. ACRR_ECOLI|P34000 potential acraB operon repressor from Escherichia coli (215 aa), FASTA scores: opt: 172, E(): 3.1e-05, (22.7% identity in 194 aa overlap). Also similar in N-terminus to N-terminus of MTCY07A7.24 hypothetical regulator from Mycobacterium tuberculosis FASTA score: (38.7% identity in 62 aa overlap). Contains probable helix-turn helix motif from aa 35-56 (Score 1728, +5.07 SD). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="TetR/ACRR family transcriptional regulator" /protein_id="NP_214816.1" /db_xref="GI:15607443" /db_xref="GeneID:886584" /translation="MGVPAKKKQQQGERSRESILDATERLMATKGYAATSISDIRDAC GLAPSSIYWHFGSKEGVLAAMMERGAQRFFAAIPTWDEAHGPVEQRSERQLTELVSLQ SQHPDFLRLFYLLSMERSQDPAVAAVVRRVRNTAIARFRDSITHLLPSDIPPGKADLV VAELTAFAVALSDGVYFAGHLEPDTTDVERMYRRLRQALEALIPVLLEET" gene 365234..366142 /locus_tag="Rv0303" /db_xref="GeneID:886581" CDS 365234..366142 /locus_tag="Rv0303" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0303, (MTCY63.08), len: 302 aa. Possible dehydrogenase/reductase (EC 1.-.-.-), similar to various NADPH dehydrogenases and other NADPH oxidoreductases e.g. O48741|PORC_ARATH|7488284|T00897 PROTOCHLOROPHYLLIDE REDUCTASE C CHLOROPLAST PRECURSOR (EC 1.3.1.33) (NADPH-PROTOCHLOROPHYLLIDE OXIDOREDUCTASE C) from Arabidopsis thaliana (401 aa); Q42850 NADPH DEHYDROGENASE (EC 1.6.99. 1) (395 aa), FASTA scores: opt: 347, E(): 3.8e-16, (35.4% identity in 319 aa overlap). TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="dehydrogenase/reductase" /protein_id="NP_214817.1" /db_xref="GI:15607444" /db_xref="GeneID:886581" /translation="MNTGTAVITGASSGLGLQCARALLRRDASWHVVLAVRDPARGRA AMEELGEPNRCSVLEVDLASVRSVRSFVETVRTTPLPPIRALVCNAGLQVVSGIAFTD DGVEMTFGVNHLGHFALVTGILDWLARPARIVVVSSGTHDPSKHTGMPDPRYTCAADL AHPPTDQNTPAEGRRRYTTSKLCNVLFTYELDRRLDHGEQGVMVNAFDPGLMPGSGLA RDYPPILRLAYRLLSPMLRVLPFVHSTRVSGEHLAALAVDPRFAGVTGQYFAGAKAIR SSAESYDRAKALDLWETSERLLAQVT" gene complement(366150..372764) /gene="PPE5" /locus_tag="Rv0304c" /db_xref="GeneID:886592" CDS complement(366150..372764) /gene="PPE5" /locus_tag="Rv0304c" /function="UNKNOWN" /note="Rv0304c, (MTCY63.9c), len: 2204 aa. Member of the Mycobacterium tuberculosis PE family (PPE, MPTR), similar to others e.g. Z95324|MTY13E10_16 from Mycobacterium tuberculosis (1443 aa), FASTA scores: E(): 0, (50.6% identity in 1403 aa overlap); Y04H_MYCTU|Q10778 from Mycobacterium tuberculosis (734 aa), FASTA scores: opt: 989, E(): 0, (42.3% identity in 522 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177714.1" /db_xref="GI:57116719" /db_xref="GeneID:886592" /translation="MNLVSTTSGMSGFLNVGALGSGVANVGNTISGIYNVGTSDLSTP AVNSGLANIGTNIAGLLRDGAGTAAINLGLANHGNLNVGFASLGGFNFGGATIGHNNV GIGNTGIFDVGLANLGSYNIGFGNLGDDNLGFGNFGSYNIGFGNVGNDNLGFANAGGG NIGFANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSGSGNIGLFNSGSNNIGFFN SGSGNFGIANSGSFNTGIGNTGNTNTGLFNSGDVNTGAFNPGSFNTGSFNTGSFNTGG FNPGNTNTGYLNIGNYNTGIANTGDVDTGAFITGNYSNGLFLSGDYQGLVGLNLVIDM PLPISLGVNIPIDIPITASAGNITLMGVTIPPTGDIVLSSIAGQRAHFGPITIPNITV VGPTTTVAIGGPNTAITITGGGAIRIPLISIPAAPGFGNSTTNPSSGFFNTGAGGASG FGNFGGANSGFWNLASATSGASGLLNVGALGSGLANVGTTVSGFYNTSTSDLATPAFN SGLANISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGDYNIGFANLGSANFGSA NIGGNNIGGANTGIFDIGLANLGSYNIGFGNFGDDNLGFGNLGSYNVGFGNLGNDNLG FANTGSNNIGFANTGSNNIGIGLTGDGQIGFGSLNSGSGNIGLFNSGSGNIGFFNSGN GNVGIGNTGTANFGLGNTGSTNTGFFNSGDVNTGIGNTGSFNTGSFNPGDSNTGDFNP GSYNTGLGNTGDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALTFGVDIPI HIPINIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTSIGITAS AGIGSITIPIIDIPATSGFGNSTTSPSSGFFNSGAGSASGFLNVVAGASGISGYLNVG ALGSGVTNVGHTVSGFYNASALDLVTPAFASGLMRDGMGTMTLNLGLANLGSNNAGFG NTGIFDVGVANLGNYNIGFGNFGDDNLGFANLGSYNIGVANTGSNNIGFANTGSNNIG IGLTGTGQIGIGALNSGSGNIGLFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGS TSTGLFNSGDGNTGGFNPGNFNTGNFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANT GDVSTGAFISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVNIPIDIPITGSFTD LVVDNFTIPIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIAVGGPTTAINISA TAGLGPITIPIIDIPAAPGIGNSTTSPSSGFFNTGAGTASGFGNVGGNTSGLWNLASA ASGVSGLLNVGALGSGVANVGNTISGIYNTSPLDLGTPAFGSGLANIAGLLQGGAGTT ILDLAGLGNLNVGLANLGGSNFGIGNTGIFNVGFANVGNHNIGLANLGNYSVGFANSG NYHIGIANTGSANIGFANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGF FNSGTGNVGIGNTGTANFGIANSGSFNTGLGNTGSTNTGLFNPGNVNTGVGNTGSINT GSINTGSFNTGSTNTGSFNLGDHNTGSFNSGDYNTGYFNAGDYNTGVANTGNVNTGAF ISGNYSNGFFWRGDYQGLIGLSTTITIPEIPYRYDLSVPIDIPITGTVVATTPNSFTI PGFQIRVLLGPAAVLVNEMIGPITIDVNQVIAIDSPIQQTISMVGTGGFGPIPIGISI GGTPGFGNSTTGPSSGFFHTGAGHVSGFGNFGAGNMSGSGNFGAGNSGFFNAGGLGNS GLLNFGALQSGLANLGNTISGVYNTSTLDLATPAFGSGIANIGANLAGLFLDNTGNLT LNFGVANQGGLNAGIGNLGSVNIGFVNTGDSNLGIGNLGDLNFGGVNIGGNNIGIANT GIFDIGLANLGSYNIGLANLGDDNLGFGNAGSYNIGFANFGSDNLGFANTGSYNIGFA NTGNNNIGVGLTGNGQIGIGSLNSGSNNIGLFNSGSGNIGFFNSGTGNVGIFNTGTGN FGLANSGGFNTGIGNAGSTNTGVFNPGDLNTGSFNPGSFNTGGFNPGSGNTGYLNTGD YNTGVANTGDVDTGAFITGSYSNGFLVSGDYQGLIGLPLLGIPVTPGYFNLTGGPSSG FFNSGAGSVSGFVNSGAGLSGYLNTGALGSGVANVGNTISGWLNASALDLATPGFLSG IGNFGTNLAGFFRG" gene complement(372820..375711) /gene="PPE6" /locus_tag="Rv0305c" /db_xref="GeneID:885978" CDS complement(372820..375711) /gene="PPE6" /locus_tag="Rv0305c" /function="UNKNOWN" /note="Rv0305c, (MTCY63.10c), len: 963 aa. Member of the Mycobacterium tuberculosis PE family (PPE, MPTR), similar to others e.g. Y04H_MYCTU|Q10778 from Mycobacterium tuberculosis (734 aa), FASTA scores: opt: 1340, E(): 0, (40.9% identity in 815 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177715.1" /db_xref="GI:57116720" /db_xref="GeneID:885978" /translation="MDFVVSAPEVNSLRMYLGAGSGPMLAAAAAWDGLADELAVAASW FGSVTSGLADAAWRGPAAVAMARAVAPYLGWLISATAQAEQAAAQARVAVATFEAARA ATVHPAIVAANRAVLVSLVSSNLLGFNAPAIAATEAAYERMWAQDVAAMVGYHAGASA AVSALMPFTQQLKKLAGLSERLTSAAAAAAGPPSAAGFNLGLANVGANNVGNGNVGVF NVGFGNLGSYNLGFANLGSDNLGLANLGGHNIGFANTGSNNVGFGNTGSNNVGIGLTG NGQIGFGSFNSGSHNIGLFNSGSGNVGLFNSGTGNFGIGNSGTGNFGLGNTGSTNTGW FNTGDVNTGGFNPGSYNTGNFNTGNYNTGSFNAGNYNTGYFNTGDYNTGVANTGNVNT GAFIAGNYSNGVLWRGDYQGLIGADIALEIPAIPINAQLFSMPIHQVMVMPGSVMTIP GMRLPFTSIVPFVVYYGPVELPQSTLTLPTVTITVGGPTTTIDGNLTGMVGGVSIPLI KIPAAPGFGNSTTSPSSGFFNAGAGTASGFGNFGGGASGFWNLASATSGLSGFGNVGA LGSGVANVGNTISGLYNTSTSNLATPAFNSGLLHHSVGTMTLNFGLANVGGNNVGGAN AGIFNVGLANLGDYNIGFGNLGGDNLGFAHAGSYNIGFANTGSNNLGFANTGDNNIGF ANIGSNNIGIGLTGSGQIGFGSLNSGSHNIGLFNSGDGNIGLFNSGSGNFGIGNAGTG NWGIGNSGAGNFGIGNAGSTNTGLFNSGDLNTGSLNPGSYNTGSVNTGSVNTGGFNAG NYNTGYFNTGDLQHRHGEHRQYQHRRFHLRQPQQRPSVAGRQPGSDRPRHRRRHSRNP DCERRREYPDSHTDHRQLHGHRIQRARSSTEHSRHCYFFRTRRYRPLHRPSDTDNRSH TCGHGGWTHYRDQYRRHCGRRRHQHPDYPYSSDSRLRQLDRRTVVGLLQ" gene 375914..376585 /locus_tag="Rv0306" /db_xref="GeneID:886577" CDS 375914..376585 /locus_tag="Rv0306" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0306, (MTCY63.11), len: 223 aa. Putative oxidoreductase (EC 1.-.-.-), highly similar to H83485|9947208|AAG04663.1|AE004557_4|AE004557 conserved hypothetical protein from Pseudomonas aeruginosa strain PAO1 (218 aa); and to other putative oxidoreductases e.g. middle part of CAB76073.1|AL157953 putative nitroreductase from Streptomyces coelicolor (1212 aa); Q52685|BLUB protein involved in cobalamin (vitamin B12) synthesis from Rhodobacter capsulatus (206 aa), FASTA scores: opt: 318, E(): 2e-15, (35.6% identity in 191 aa overlap). TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="putative oxidoreductase" /protein_id="NP_214820.1" /db_xref="GI:15607447" /db_xref="GeneID:886577" /translation="MFSAPERRAVYRVIAERRDMRRFVPGGVVSEDVLARLLHAAHAA PSVGLMQPWRFIRITDETLKRRIHALVDDERLLTAEALGAREEEFLALKVEGILDCAE LLVVALCDRRGSYIFGRRTLPQMDLASVSCAIQNLWLAARSEGLGMGWVSLFDPQRLA ALLAMPADAEPVAILCLGPVPEFPDRPALELDGWAYARPLAEFVSENRWSYPSALATD HHHGE" gene complement(376573..377055) /locus_tag="Rv0307c" /db_xref="GeneID:886580" CDS complement(376573..377055) /locus_tag="Rv0307c" /function="UNKNOWN" /note="Rv0307c, (MTCY63.12c), len: 160 aa. Hypothetical unknown protein. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214821.1" /db_xref="GI:15607448" /db_xref="GeneID:886580" /translation="MAVIVRKWFGLGRLPADLRCQVEAEGLIYLAEYVAVTRRFTGVI PGLRASHSIASYVGALAFTEQRVLGTLSMVPKLAGRVVDARWDGPQAGAATAEISPTG LQLDLDVADVDPKFSGQLALHFKATIGEDVLSRLPRRSLAFDVPAEYVNLAVGVTYSP" gene 377113..377829 /locus_tag="Rv0308" /db_xref="GeneID:886583" CDS 377113..377829 /locus_tag="Rv0308" /function="UNKNOWN" /note="Rv0308, (MTCY63.13), len: 238 aa. Probable conserved integral membrane protein, with C-terminus highly similar to C-terminus of other integral membrane proteins or phosphatases e.g. AAK25788.1|AF336822_1|13430250|AAK25789.1|AF336823_1 putative phosphatase from Streptococcus pyogenes (201 aa); Q06074 HYPOTHETICAL 24.9 kDa PROTEIN (216 aa), FASTA scores: opt: 209, E(): 2e-07, (27.9% identity in 140 aa overlap). Could be a phosphatase. TBparse score is 0.961." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_214822.1" /db_xref="GI:15607449" /db_xref="GeneID:886583" /translation="MTRPQALLAVSLAFVATAVYAVMWVGHSQDWGWLHSFDWSLLNA AHDIGIKNPAWVRFWDGVSLILGPVVLRPLGLLAAMVALAKRKIRIALLLLACLPLNA IMTIAAKSVAHRPRPATALVSAHSTSFPSGHALEATASVLALLTVLLPMLHSRFTRHI AITVGALCVLTVGVARVALNVHHPTDVVAGWALGYLYFLVCLCVFRPPSIFGAQRASH ALSPPVEVSRQPEPEVDTAR" gene 377931..378587 /locus_tag="Rv0309" /db_xref="GeneID:886574" CDS 377931..378587 /locus_tag="Rv0309" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0309, (MTCY63.14), len: 218 aa. Possible conserved exported protein (has putative N-terminal signal sequence), equivalent to AC32053.1|AL583926 putative secreted protein from Mycobacterium leprae (218 aa). Also similar to others e.g. AB76092.1|AL157956 putative secreted protein from Streptomyces coelicolor (238 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214823.1" /db_xref="GI:15607450" /db_xref="GeneID:886574" /translation="MSRLLALLCAAVCTGCVAVVLAPVSLAVVNPWFANSVGNATQVV SVVGTGGSTAKMDVYQRTAAGWQPLKTGITTHIGSAGMAPEAKSGYPATPMGVYSLDS AFGTAPNPGGGLPYTQVGPNHWWSGDDNSPTFNSMQVCQKSQCPFSTADSENLQIPQY KHSVVMGVNKAKVPGKGSAFFFHTTDGGPTAGCVAIDDATLVQIIRWLRPGAVIAIAK" gene complement(378657..379148) /locus_tag="Rv0310c" /db_xref="GeneID:886570" CDS complement(378657..379148) /locus_tag="Rv0310c" /function="UNKNOWN" /note="Rv0310c, (MTCY63.15c), len: 163 aa. Conserved hypothetical protein, similar to some bile acid dehydratases e.g. P19412|BAIE_EUBSP|98749|D37844|1381566|AAC45413.1|U57489 BILE ACID-INDUCIBLE OPERON PROTEIN E from Eubacterium sp (166 aa), FASTA scores: opt: 302, E(): 1e-11, (38.8% identity in 134 aa overlap); AAF22847.1|AF210152_4 bile acid 7a-dehydratase from Clostridium sp. (168 aa). TBparse score is 0.863." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214824.1" /db_xref="GI:15607451" /db_xref="GeneID:886570" /translation="MCCNGVVTPGDPADIAAIKQLKYRYLRALDTKHWDDFTDTLAED VTGDYGSSVGTELHFTNRADLVDYLRQALGPGVITEHRVTHPEITVTGDTATGIWYLQ DRVIVAEFNFMLIGAAFYHDQYRRTTDGWRISATGYDRTYEATMSLAGLNFNIRPGRA LAD" gene 379172..380401 /locus_tag="Rv0311" /db_xref="GeneID:886579" CDS 379172..380401 /locus_tag="Rv0311" /function="UNKNOWN" /note="Rv0311, (MTCY63.16), len: 409 aa. Hypothetical unknown protein. Contains PS00881 Protein splicing signature. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214825.1" /db_xref="GI:15607452" /db_xref="GeneID:886579" /translation="MSQSRYAGLSRSELAVLLPELLLIGQLIDRSGMAWCIQAFGRQE MLQIAIEEWAGASPIYTKRMQKALNFEGDDVPTIFKGLQLDIGAPPQFMDFRFTLHDR WHGEFHLDHCGALLDVEPMGDDYVVGMCHTIEDPTFDATAIATNPRAQVRPIHRPPRK PADRHPHCAWTVIIDESYPEAEGIPALDAVRETKAATWELDNVDASDDGLVDYSGPLV SDLDFGAFSHSALVRMADEVCLQMHLLNLSFAIAVRKRAKADAQLAISVNTRQLIGVA GLGAERIHRAMALPGGIEGALGVLELHPLLNPAGYVLAETSPDRLVVHNSPAHADGAW ISLCTPASVQPLQAIATAVDPHLKVRISGTDTDWTAELIEADAPASELPEVLVAKVSR GSVFQFEPRRSLPLTVK" misc_feature 380132..380149 /locus_tag="Rv0311" /note="PS00881 Protein splicing signature" gene 380556..382418 /locus_tag="Rv0312" /db_xref="GeneID:886566" CDS 380556..382418 /locus_tag="Rv0312" /function="UNKNOWN" /note="Rv0312, (MTCY63.17), len: 620 aa. Conserved hypothetical protein with highly Pro-, Thr-rich C-terminus. Similar to Pro-,Thr-rich region in Rv2264c|AL021925|MTV022_14 from Mycobacterium tuberculosis (592 aa), FASTA scores: opt: 1075, E(): 0, (38.9% identity in 627 aa overlap). Also some similarity with Rv0350|dnaK from Mycobacterium tuberculosis. Possibly membrane protein; has hydrophobic stetch in its middle part." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214826.1" /db_xref="GI:15607453" /db_xref="GeneID:886566" /translation="MYDPLGLSIGTTNLVAAGNGGPPVTRRAVLTLYPHCAPKIGVPS QNPNLIEPGALMSGFVERIGDAVALVSPDGSVHDPDLLLVEALDAMVLTAGADASSSE IAIAVPAHWKPGAVHALRNGLRTHVGFVRSGMAPRLVSDAIAALTAVNSELGLPHGSV VGLLDFGGSATYVTLVETKSDSRTSDFQPVSATARYQDFSGSQIDQALLLRVIDQFGY GDDVDPASTAAVGQLGQLREQCRAAKERLSTDVATELFAELAGCSSSIEMTREQLEDL IQDPLTGFIYAFDDMLARHNASWADLAAVVTVGGGANIPLVTQRLSFHTRRPVLTASQ PGCAAAMGALLLANRGGERDSRTRTSIGLATAAAAGTSVIELPAGDVMVIDHEALTDR ELAWSQTDFPSEAPARFEGDSYNEGGPCWSMRLNAVEPPKGPAWRRIRVSQLLIGVSA VVAMTAIGGVALTLTAIERRPSPLPTPIVPGLAPMPPGSVVPSSRAPTPPPPPSTVAP LPSAAPAPTTVAPAPPPPTQVVTTTTAPPVTTTPRPSPTTTTTTAPPSTTTTTEPPVT TTSTIPTIPTTTTTVKMTTEWLHVPFLPVPIPVPIPQNPGAGEPQNPFGSLGSG" gene 382490..382876 /locus_tag="Rv0313" /db_xref="GeneID:886572" CDS 382490..382876 /locus_tag="Rv0313" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0313, (MTCY63.18), len: 128 aa. Conserved hypothetical protein, equivalent only to CAC32049.1|AL583926 conserved hypothetical protein from Mycobacterium leprae (130 aa). TBparse score is 0.877." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214827.1" /db_xref="GI:15607454" /db_xref="GeneID:886572" /translation="MGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTG WSAIFEDLSRRSRPAPETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKDNT DPKRKVRFLPYGIAVSVLDDPVDEAQ" gene complement(382879..383541) /locus_tag="Rv0314c" /db_xref="GeneID:886564" CDS complement(382879..383541) /locus_tag="Rv0314c" /function="UNKNOWN" /note="Rv0314c, (MTCY63.19c), len: 220 aa. Possible conserved membrane protein, with hydrophobic stretch from residues 75-100. Similar in C-terminal part to Mycobacterium tuberculosis proteins Rv0679c and Rv0680c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214828.1" /db_xref="GI:15607455" /db_xref="GeneID:886564" /translation="MIVVWEHLCMNPEDDPEARIRELERPLADVARASELGGSQSGGY TYPPGPPPPPYSYGGPFGGPSPRSSSGNRAWWILAAVVVVGVLVLVGGIAAFSAQRLS QGNFVVLSPTPSVSRAVPTPTAQPATTLPPAGASLSVSGVNVNRTIACNDSIVSVSGM SNTVVITGHCTSLTVSGMRNSVTVDSVDTIEAAGFNNEVTYHSGSPKISNAGGSNSVQ QG" gene 383602..384486 /locus_tag="Rv0315" /db_xref="GeneID:886563" CDS 383602..384486 /locus_tag="Rv0315" /EC_number="3.2.1.-" /function="POSSIBLY HYDROLYZES SPECIFIC SUGAR (HYDROLYZATION OF GLYCOSIDIC BOND) AND COULD BE INVOLVED IN EXOPOLYSACCHARIDE BIOSYNTHESIS/DEGRADATION. COULD ALSO HAVE A LYTIC ACTIVITY AGAINST CELL WALLS." /note="Rv0315, (MTCY63.20), len: 294 aa. Possible beta-1,3-glucanase precursor (EC 3.2.1.-) (has hydrophobic stretch in its N-terminal part), similar to others e.g. Q51333|AAC44371.1 BETA-1,3-GLUCANASE II A from Oerskovia xanthineolytica (306 aa), FASTA scores: opt: 76, E(): 3e-14, (34.1% identity in 302 aa overlap); and AAC38290.1|AF052745 beta-1,3-glucanase II from Oerskovia xanthineolytica (435 aa). Contains glycosyl hydrolases family 16 active site signature (PS01034). TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="beta-1,3-glucanase precursor" /protein_id="NP_214829.1" /db_xref="GI:15607456" /db_xref="GeneID:886563" /translation="MLMPEMDRRRMMMMAGFGALAAALPAPTAWADPSRPAAPAGPTP APAAPAAATGGLLFHDEFDGPAGSVPDPSKWQVSNHRTPIKNPVGFDRPQFFGQYRDS RQNVFLDGNSNLVLRATREGNRYFGGLVHGLWRGGIGTTWEARIKFNCLAPGMWPAWW LSNDDPGRSGEIDLIEWYGNGTWPSGTTVHANPDGTAFETCPIGVDGGWHNWRVTWNP SGMYFWLDYADGIEPYFSVPATGIEDLNEPIREWPFNDPGYKVFPVLNLAVGGSGGGD PATGSYPQEMLVDWVRVF" misc_feature 384112..384147 /locus_tag="Rv0315" /note="PS01034 Glycosyl hydrolases family 16 active sites" gene 384535..385149 /locus_tag="Rv0316" /db_xref="GeneID:886560" CDS 384535..385149 /locus_tag="Rv0316" /EC_number="5.3.3.-" /function="COULD BE INVOLVED IN THE CATABOLISM OF CATECHOL TO SUCCINATE- AND ACETYL-CoA IN THE BETA-KETOADIPATE PATHWAY (AT THE THIRD STEP) [CATALYTIC ACTIVITY: 2,5-dihydro-5-oxofuran-2-acetate = 3,4-dihydro-5-oxofuran-2-acetate]." /note="Rv0316, (MTCY63.21), len: 204 aa. Possible muconolactone isomerase (EC 5.3.3.-), showing weak similarity with some muconolactone isomerases e.g. O33947|CTC1_ACILW MUCONOLACTONE DELTA-ISOMERASE 1 (MIASE 1)(96 aa), FASTA scores: opt: 179, E(): 3.9e-05, (32.6% identity in 92 aa overlap). TBparse score is 0.882." /codon_start=1 /transl_table=11 /product="muconolactone isomerase" /protein_id="NP_214830.1" /db_xref="GI:15607457" /db_xref="GeneID:886560" /translation="MEFLVTMTTRVPDSMPADAVERVRAREAARSRELAAQGKLLRLW RPPLRPGEWRTLGLFAADDNGELEQLLASMPPRSWRTDDVTPLGAHPNDPVGQGITIA PGKGPEFLIATTIMVPPGTPAQVVDDTVAREARRAPELAGRGHLVRLWALPDGPDGQR TLGLWRARDPGELMAILESLPLAGWMTIETTPLSPHPDDPIRMP" gene complement(385173..385943) /gene="glpQ2" /locus_tag="Rv0317c" /db_xref="GeneID:886559" CDS complement(385173..385943) /gene="glpQ2" /locus_tag="Rv0317c" /EC_number="3.1.4.46" /function="GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE HYDROLYZES DEACYLATED PHOSPHOLIPIDS TO G3P AND THE CORRESPONDING ALCOHOLS [CATALYTIC ACTIVITY: A GLYCEROPHOSPHODIESTER + H(2)O = AN ALCOHOL + SN-GLYCEROL 3-PHOSPHATE]." /note="Rv0317c, (MTCY63.22c), len: 256 aa (start uncertain, chosen by homology). Possible glpQ2, glycerophosphoryl diester phosphodiesterase (EC 3.1.4.46), similar to others e.g. E75317|6459876|AAF11631.1|AE002044_4 glycerophosphoryl diester phosphodiesterase from Deinococcus radiodurans (285 aa); P10908|UGPQ_ECOLI from Escherichia coli (247 aa), FASTA scores: opt: 220, E(): 5.2e-07, (28.0% identity in 250 aa overlap). Also similar to MTCY01A6.27 from Mycobacterium tuberculosis FASTA score: (27.5% identity in 247 aa overlap). TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="glycerophosphoryl diester phosphodiesterase" /protein_id="NP_214831.1" /db_xref="GI:15607458" /db_xref="GeneID:886559" /translation="MEFLRHGGRIAMAHRGFTSFRLPMNSMGAFQEAAKLGFRYIETD VRATRDGVAVILHDRRLAPGVGLSGAVDRLDWRDVRKAQLGAGQSIPTLEDLLTALPD MRVNIDIKAASAIEPTVNVIERCNAHNRVLIGSFSERRRRRALRLLTKRVASSAGTGA LLAWLTARPLGSRAYAWRMMRDIDCVQLPSRLGGVPVITPARVRGFHAAGRQVHAWTV DEPDVMHTLLDMDVDGIITDRADLLRDVLIARGEWDGA" gene complement(386204..386274) /locus_tag="Rvnt04" /note="tRNA-Gly(CCC)" /db_xref="GeneID:2700441" tRNA complement(386204..386274) /locus_tag="Rvnt04" /product="tRNA-Gly" /note="codon recognized: GGG" /anticodon=(pos:386240..386242,aa:Gly) /db_xref="GeneID:2700441" gene complement(386305..387099) /locus_tag="Rv0318c" /db_xref="GeneID:886576" CDS complement(386305..387099) /locus_tag="Rv0318c" /function="UNKNOWN" /note="Rv0318c, (MTCY63.23c), len: 258 aa. Probable conserved integral membrane protein, with some similarity to C-terminus of GUFA_MYXXA|Q06916 (254 aa), FASTA scores: opt: 157, E (): 0.0032, (28.3% identity in 198 aa overlap). Also similar to O26573 CONSERVED PROTEIN from Methanobacterium thermoauto (259 aa), FASTA scores: opt: 173, E(): 5.2e-05, (32.7% identity in 214 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="YP_177716.1" /db_xref="GI:57116721" /db_xref="GeneID:886576" /translation="MSLAVTMFKRARAEIFDRNREVGISNVTTAASLVTFPVLAGILG GVVPSVRTPSAAMVSGVQHFAAGIVMAAVAGEVLPDLRSRGPLWLIVVGFSAGVAVLV ALRRFDGHGEHQDGDDVGELPVGFLTVVAVDLFIDGLLVATGATVSSRTAIIITIALT VEVLFLGLAVALRLAGSGMPRIRAAATTSALSLVIAVGGVSGAVALGRAGNTVLTLVL AFAAGALLWLVVEELLVEAHETPERPWMAVMFFAGFLILYGLGVME" gene 387148..387816 /gene="pcp" /locus_tag="Rv0319" /db_xref="GeneID:886555" CDS 387148..387816 /gene="pcp" /locus_tag="Rv0319" /EC_number="3.4.19.3" /function="REMOVES 5-OXOPROLINE FROM VARIOUS PENULTIMATE AMINO ACID RESIDUES EXCEPT L-PROLINE [CATALYTIC ACTIVITY: 5-oxoprolyl-peptide + H2O = 5-oxoproline + peptide]." /note="catalyzes the removal of 5-oxoproline from various penultimate amino acid residues except L-proline" /codon_start=1 /transl_table=11 /product="pyrrolidone-carboxylate peptidase" /protein_id="NP_214833.1" /db_xref="GI:15607460" /db_xref="GeneID:886555" /translation="MSKVLVTGFGPYGVTPVNPAQLTAEELDGRTIAGATVISRIVPN TFFESIAAAQQAIAEIEPALVIMLGEYPGRSMITVERLAQNVNDCGRYGLADCAGRVL VGEPTDPAGPVAYHATVPVRAMVLAMRKAGVPADVSDAAGTFVCNHLMYGVLHHLAQK GLPVRAGWIHLPCLPSVAALDHNLGVPSMSVQTAVAGVTAGIEAAIRQSADIREPIPS RLQI" gene 387888..388550 /locus_tag="Rv0320" /db_xref="GeneID:886553" CDS 387888..388550 /locus_tag="Rv0320" /function="UNKNOWN" /note="Rv0320, (MTCY63.25), len: 220 aa. Possible conserved exported protein, similar to some hypothetical proteins and to the middle part of a peptidase: NP_066789.1|10657900|AAG21739.1|AF116907 putative peptidase from Rhodococcus equi (546 aa). Also similar to Rv1728c|MTCY04C12.13c from Mycobacterium tuberculosis (256 aa), FASTA scores: opt: 497, E(): 1.2e-26, (41.8% identity in 225 aa overlap). TBparse score is 0.943." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214834.1" /db_xref="GI:15607461" /db_xref="GeneID:886553" /translation="MGRHELARDRRKSSAVLAAVLAPAAVFFATGGDVSTLAARADAN PVLGDDAPCCVQIVPVAPLAFSSQISGGEIGTGLAASQFASASRWRIVSRYLPVGVAP EQGLQVKTVLTARSISAAFPEIREIGGVRPDALRWHPNGLALDVMVPNPGTAEGIALG NEIVAFVLKNATRFGMQDVIWRGAYYTPNGARTTGAGHYDHIHITTVGGGYPTGEELY IR" gene 388582..389154 /gene="dcd" /locus_tag="Rv0321" /db_xref="GeneID:886552" CDS 388582..389154 /gene="dcd" /locus_tag="Rv0321" /EC_number="3.5.4.13" /function="INVOLVED IN INTERCONVERSION OF dCTP AND dUTP [CATALYTIC ACTIVITY: dCTP + H2O = dUTP + NH3]." /note="Catalyzes the formation of dUTP from dCTP in thymidylate biosynthesis" /codon_start=1 /transl_table=11 /product="deoxycytidine triphosphate deaminase" /protein_id="NP_214835.1" /db_xref="GI:15607462" /db_xref="GeneID:886552" /translation="MLLSDRDLRAEISSGRLGIDPFDDTLVQPSSIDVRLDCLFRVFN NTRYTHIDPAKQQDELTSLVQPVDGEPFVLHPGEFVLGSTLELFTLPDNLAGRLEGKS SLGRLGLLTHSTAGFIDPGFSGHITLELSNVANLPITLWPGMKIGQLCMLRLTSPSEH PYGSSRAGSKYQGQRGPTPSRSYQNFIRST" misc_feature 388864..388887 /gene="dcd" /locus_tag="Rv0321" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 389260..390591 /gene="udgA" /locus_tag="Rv0322" /db_xref="GeneID:886550" CDS 389260..390591 /gene="udgA" /locus_tag="Rv0322" /EC_number="1.1.1.22" /function="POSSIBLY INVOLVED IN POLYSACCHARIDE BIOSYNTHESIS [CATALYTIC ACTIVITY: UDP-glucose + 2 NAD+ + H2O = UDP-glucuronate + 2 NADH]." /note="Rv0322, (MTCY63.27), len: 443 aa. Probable udg (alternate gene name: rkpK), UDP-glucose 6-dehydrogenase (EC 1.1.1.22), highly similar to others e.g. CAC44517.1|AL596138 putative UDP-glucose 6-dehydrogenase from Streptomyces coelicolor (447 aa); Q56812 UDP-GLUCOSE DEHYDROGENASE from Xanthomonas campestris (445 aa), FASTA scores: opt: 713, E(): 0, (41.9% identity in 351 aa overlap); etc. Also similar to several GDP-mannose 6-dehydrogenase. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UDP-GLUCOSE/GDP-MANNOSE DEHYDROGENASES FAMILY. TBparse score is 0.905.; rkpK" /codon_start=1 /transl_table=11 /product="UDP-glucose 6-dehydrogenase UdgA" /protein_id="NP_214836.1" /db_xref="GI:15607463" /db_xref="GeneID:886550" /translation="MRCSVFGTGYLGATHAVGMAQLGHEVVGVDIDPGKVAKLAGGDI PFYEPGLRKLLTDNLAAGRLRFTTDYDMAADFADVHFLGVGTPQKIGEYGADLRHVHA VIDALVPRLVRASILVGKSTVPVGTAAELGHRAGALAPRGVDVEIAWNPEFLREGFAV HDTLNPDRIVLGVQDDSTRAEVAVRELYAPLLAAGVPFLVTDLQTAELVKVSANAFLA TKISFINAISEVCEAAGADVSQLADALGYDPRIGRQCLNAGLGFGGGCLPKDIRAFMA RAGELGADQALTFLREVDSINMRRRTKMVELATTACGGSLLGANIAVLGAAFKPESDD VRDSPALNVAGQLQLNGATVHVYDPKALDNAHRLFPTLNYAVSVAEACERADAVLVLT EWREFIDLEPADLANRVRARVIVDGRNCLDVTRWRRAGWRVFRLGVPRLGH" misc_feature 389599..389622 /gene="udgA" /locus_tag="Rv0322" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(390580..391251) /locus_tag="Rv0323c" /db_xref="GeneID:886557" CDS complement(390580..391251) /locus_tag="Rv0323c" /function="UNKNOWN" /note="Rv0323c, (MTCY63.28c), len: 223 aa. Conserved hypothetical protein, similar to others e.g. YPJG_BACSU|P42981 hypothetical 24.8 kDa protein from Bacillus subtilis (224 aa), FASTA scores: opt: 182, E(): 1.3e-05, (27.5% identity in 211 aa overlap). Also some similarity to MLU15183_8 from Mycobacterium tuberculosis FASTA score: (32.0% identity in 147 aa overlap). TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214837.1" /db_xref="GI:15607464" /db_xref="GeneID:886557" /translation="MNSCNRLPCAHEVLAVFAHPDDESFGLGAVLGDFTAQGTRLRGL CFTHGEASTLGRTDRNLGEVRREELAAAAQVLGVDHVQLLAYPDNGLAQIPLNELTQR VVDALAGADLLLVFDDNGVTGHPDHRRATEAALAAASTPSIPVLAWALPQPIADRLNA EFSASFGGRGHGHLDIMIEVDRSRQLAAIGCHFTQSADNPVLWRRLELLGDREYLRWL RRSVP" gene 391352..392032 /locus_tag="Rv0324" /db_xref="GeneID:886548" CDS 391352..392032 /locus_tag="Rv0324" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0324, (MTCY63.29), len: 226 aa. Possible transcriptional regulator, arsR family, with its N-terminus similar to the N-terminus of other DNA-binding proteins e.g. P30346|MERR_STRLI probable mercury resistance operon from Streptomyces lividans (125 aa), FASTA scores: opt: 154, E(): 0.002, (32.2% identity in 90 aa overlap)), and its C-terminal part similar to hypothetical bacterial proteins e.g. P54510|YQHL_BACSU hypothetical 14.6 kDa protein from Bacillus subtilis (126 aa), FASTA scores: opt: 159, E(): 0.00097, (35.5% identity in 76 aa overlap)). Most similar to AJ005575|SPE005575_2 ORF1 required for antibiotic production from Streptomyces peucetius (226 aa), FASTA scores: opt: 816, E(): 0, (60.7% identity in 211 aa overlap). Also similar in C-terminus to MTCY164.26 molybdopterin biosynthesis moeb protein from Mycobacterium tuberculosis FASTA score: (36.8% identity in 114 aa overlap)." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="NP_214838.1" /db_xref="GI:15607465" /db_xref="GeneID:886548" /translation="MAGQSDRKAALLDQVARVGKALANGRRLQILDLLAQGERAVEAI ATATGMNLTTASANLQALKSGGLVEARREGTRQYYRIAGEDVARLFALVQVVADEHLA DVAVAAADVLGSPEDAITRAELLRRREAGEVTLVDVRPHEEYQAGHIPGAINIPIAEL ADRLAELTGDRDIVAYCRGAYCVMAPDAVRIARDAGREVKRLDDGMLEWRLAGLPVDE GAPVGHGD" gene 392039..392263 /locus_tag="Rv0325" /db_xref="GeneID:886546" CDS 392039..392263 /locus_tag="Rv0325" /function="UNKNOWN" /note="Rv0325, (MTCY63.30), len: 74 aa. Hypothetical unknown protein. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214839.1" /db_xref="GI:15607466" /db_xref="GeneID:886546" /translation="MGPKGSLRLVKRQPELLVAQHEHWQDTYRAHPVLYGTRPSEPGV YAAEVFNADGVQRVLELAAGHGRDTLYFAG" gene 392273..392728 /locus_tag="Rv0326" /db_xref="GeneID:886544" CDS 392273..392728 /locus_tag="Rv0326" /function="UNKNOWN" /note="Rv0326, (MTCY63.31), len: 151 aa. Hypothetical unknown protein. TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214840.1" /db_xref="GI:15607467" /db_xref="GeneID:886544" /translation="MVATDFSDVAVAQLRRSAQARGVSARVQPIVHDLRQPLPVKTGS IDGAFAHMALCMALSTSEIHAVVAEVGRVLRPGGKFIYTVRHTGDAHYGAGQAHGDDI FECAGFAVHFFRRELVARLATGWVLEEVHDFEEGELPRRLWRVTVTKPA" gene complement(392696..394045) /gene="cyp135A1" /locus_tag="Rv0327c" /db_xref="GeneID:886538" CDS complement(392696..394045) /gene="cyp135A1" /locus_tag="Rv0327c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv0327c, (MT0342, MTCY63.32c), len: 449 aa. Possible cyp135A1, cytochrome P450 (EC 1.14.-.-), similar to cytochrome P-450 monoxygenases and other cytochrome P-450 related enzymes e.g. FQ12609 PUTATIVE P450 MONOOXYGENASE (EC 1.14.14.1) (506 aa), FASTA scores: opt: 276, E() : 1.7e-11, (27.9% identity in 433 aa overlap). Also similar to other Mycobacterium tuberculosis proteins e.g. MTV039.06|Rv0568 PUTATIVE CYTOCHROME P450 (472 aa); MTCI5.10 cytochrome p450 FASTA score: (30.4% identity in 434 aa overlap). Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). BELONGS TO THE CYTOCHROME P450 FAMILY. Alternative start possible at 33706 but no RBS." /codon_start=1 /transl_table=11 /product="cytochrome P450 135A1" /protein_id="NP_214841.1" /db_xref="GI:15607468" /db_xref="GeneID:886538" /translation="MASTLTTGLPPGPRLPRYLQSVLYLRFREWFLPAMHRKYGDVFS LRVPPYADNLVVYTRPEHIKEIFAADPRSLHAGEGNHILGFVMGEHSVLMTDEAEHAR MRSLLMPAFTRAALRGYRDMIASVAREHITRWRPHATINSLDHMNALTLDIILRVVFG VTDPKVKAELTSRLQQIINIHPAILAGVPYPSLKRMNPWKRFFHNQTKIDEILYREIA SRRIDSDLTARTDVLSRLLQTKDTPTKPLTDAELRDQLITLLLAGHETTAAALSWTLW ELAHAPEIQSQVVWAAVGGDDGFLEAVLKEGMRRHTVIASTARKVTAPAEIGGWRLPA GTVVNTSILLAHASEVSHPKPTEFRPSRFLDGSVAPNTWLPFGGGVRRCLGFGFALTE GAVILQEIFRRFTITAAGPSKGETPLVRNITTVPKHGAHLRLIPQRRLGGLGDSDPP" misc_feature complement(392891..392920) /gene="cyp135A1" /locus_tag="Rv0327c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand si gnature" gene 394111..394713 /locus_tag="Rv0328" /db_xref="GeneID:886542" CDS 394111..394713 /locus_tag="Rv0328" /function="POSSIBLY INVOLVED IN A TRANSCRIPTIONAL MECHANISM." /note="Rv0328, (MTCY63.33), len: 200 aa. Possible transcription regulator, tetR/acrR family, similar in part to various hypothetical transcriptional regulators e.g. T36696|4726006|CAB41735.1|AL049731 probable regulatory protein from Streptomyces coelicolor (197 aa). Also some similarity with YX44_MYCTU|Q10829 hypothetical transcriptional regulator from Mycobacterium tuberculosis (195 aa), FASTA scores: opt: 154, E(): 0.00061, (26.7% identity in 202 aa overlap). Contains probable helix-turn helix motif from aa 27-48 (Score 1408, +3.98 SD). SEEMS TO BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="NP_214842.1" /db_xref="GI:15607469" /db_xref="GeneID:886542" /translation="MQQQRTNRDKLLDGALACLRERGYGNTSSRDIARAAGVNIASIN YHFGSKDALLDDALGRCFSTWNQRVQEAFDHSRAAGPAGQILAVLEATVDSFEQIRPA VYACVESYAPALRSEALRERLAAGYADVRQHSVDLAGAALAGTDIAPPENLSTIVSVL MAVIDGLMIQWIADPSATPRSTEVIRALASIGAVVTSQLR" gene complement(394694..395320) /locus_tag="Rv0329c" /db_xref="GeneID:886536" CDS complement(394694..395320) /locus_tag="Rv0329c" /function="UNKNOWN" /note="Rv0329c, (MTCY63.34c), len: 208 aa. Conserved hypothetical protein, showing some similarity with others hypothetical proteins and methyltransferases e.g. MitM|AF127374_14 methyltransferase from Streptomyces lavendulae (283 aa), FASTA scores: opt: 242, E(): 1.8e-08, (37.2% identity in 145 aa overlap); Q48938 from Methanosarcina barkeri (262 aa), FASTA scores: opt: 194, E(): 3.6e-06, (31.1% identity in 119 aa overlap). TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214843.1" /db_xref="GI:15607470" /db_xref="GeneID:886536" /translation="MRLTHPARRYLSSQAARPTGAFGRLLGRIWRAETADVNRIAVEL LAPGPGERVCEIGFGPGRTLGLLAAAGAQVSGVEVSTTMIAIAAHHNAKAIAAGLISL YHGDGVTLPVADHSLDKVLGVHNFYFWPDPRASLCDIARALRPGGRLVLTSISDDQPL AARFDPAIYRVPPTLDTAAWLGAAGFIDVGIKRSADHPATVWFTATAT" gene complement(395347..396087) /locus_tag="Rv0330c" /db_xref="GeneID:886540" CDS complement(395347..396087) /locus_tag="Rv0330c" /function="UNKNOWN" /note="Rv0330c, (MTCY63.35c), len: 246 aa. Hypothetical unknown protein. TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214844.1" /db_xref="GI:15607471" /db_xref="GeneID:886540" /translation="MARSIPADRFSAIVAASARVFIAHGYQRTQVQDVADALALAKGT LYGYAQGKAALFAAAVRYGDAQEALPLASELPVAAPVAGEIAAVVSARLAGEVTDMRL THALRATLPPGATTGDARAELAGIVTDLYSRLARHRIALKLVDRCAPELPDLAEVWFG TGRNAQVDAVQAYLVHRERAGLLILPGPAPMVARTIVELCALWAVHLHFDPSPEPWSI VQPGVIDDDAIAATLAEFVVRATTASSD" gene 396201..397367 /locus_tag="Rv0331" /db_xref="GeneID:886534" CDS 396201..397367 /locus_tag="Rv0331" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0331, (MTCY63.36), len: 388 aa. Possible dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases e.g. NP_103779.1|14022957|BAB49565.1|AP002999 flavoprotein reductase from Mesorhizobium loti (377 aa); NP_147681.1 predicted NAD(FAD)-dependent dehydrogenase from Aeropyrum pernix (381 aa); DHSU_CHRVI|Q06530 sulfide dehydrogenase (431 aa), FASTA scores: opt: 347, E(): 6.8e-15, (25.6% identity in 348 aa overlap). TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="dehydrogenase/reductase" /protein_id="NP_214845.1" /db_xref="GI:15607472" /db_xref="GeneID:886534" /translation="MSKTVLILGAGVGGLTTADTLRQLLPPEDRIILVDRSFDGTLGL SLLWVLRGWRRPDDVRVRPTAASLPGVEMVTATVAHIDIAAQVVHTDNSVIGYDALVI ALGAALNTDAVPGLSDALDADVAGQFYTLDGAAELRAKVEALEHGRIAVAIAGVPFKC PAAPFEAAFLIAAQLGDRYATGTVQIDTFTPDPLPMPVAGPEVGEALVSMLKDHGVGF HPRKALARVDEAARTMHFGDGTSEPFDLLAVVPPHVPSAAARSAGLSESGWIPVDPRT LSTSADNVWAIGDATVLTLPNGKPLPKAAVFAEAQAAVVAHGVARHLGYDVAERHFTG TGACYVETGDHQAAKGDGDFFAPSAPSVTLYPPSREFHEEKVAQELAWLTRWKT" gene 397442..398227 /locus_tag="Rv0332" /db_xref="GeneID:886532" CDS 397442..398227 /locus_tag="Rv0332" /function="UNKNOWN" /note="Rv0332, (MTCY63.37), len: 261 aa. Conserved hypothetical protein, similar to several conserved hypothetical proteins from Streptomyces coelicolor e.g. SC6A9.18c|AL031035|SC6A9_18|T35449 hypothetical protein (266 aa), FASTA scores: opt: 508, E(): 5.7e-27, (36.7% identity in 251 aa overlap). TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214846.1" /db_xref="GI:15607473" /db_xref="GeneID:886532" /translation="MRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWS LGQLFRHVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLVDA VEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITVGGEFTLEPNVAADGISE FLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEAGEWTVRRDERGVTWSHRHGKGA VALRGGATELLLAMVRRLSVADTGIELLGDAGVWQKWLDRTPL" gene 398254..398628 /locus_tag="Rv0333" /db_xref="GeneID:886528" CDS 398254..398628 /locus_tag="Rv0333" /function="UNKNOWN" /note="Rv0333, (MTCY63.38), len: 124 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214847.1" /db_xref="GI:15607474" /db_xref="GeneID:886528" /translation="MTTSEIATVLAWHDALNAADIETLVALSTDDIDIGDAHGAVQGH DALRGWASSLTTTAELGRMYVHHGVVVVEQKITSGEDPGIARTGAAAFRVVQDHVASV FRHEDLASALAATELTEDDLVD" gene 398658..399524 /gene="rmlA" /locus_tag="Rv0334" /db_xref="GeneID:886568" CDS 398658..399524 /gene="rmlA" /locus_tag="Rv0334" /EC_number="2.7.7.24" /function="DTDP-L-RHAMNOSE BIOSYNTHESIS WITHIN THE O ANTIGEN BIOSYNTHESIS PATHWAY OF LIPOPOLYSACCHARIDE BIOSYNTHESIS [CATALYTIC ACTIVITY: dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose]." /experiment="experimental evidence, no additional details recorded" /note="Rv0334, (MTCY279.01), len: 288 aa. rmlA (alternate gene name: rfbA), alpha-D-glucose-1-phosphate thymidylyl-transferase (EC 2.7.7.24) (see citations below), equivalent to CAC32020.1|AL583925 glucose-1-phosphate thymidyltransferase from Mycobacterium leprae (288 aa). Also highly similar to others e.g. AAG29804.1|AF235050 glucose-1-phosphate thymidylyltransferase from Streptomyces rishiriensis (296 aa); RBA1_ECOLI|P37744 glucose-1-phosphate thymidylyltransferase from Escherichia coli strain K12 (293 aa), FASTA scores: opt: 1199, E(): 0, (62.0% identity in 284 aa overlap). BELONGS TO THE GLUCOSE-1-PHOSPHATE THYMIDYLYLTRANSFERASE FAMILY.; rfbA" /codon_start=1 /transl_table=11 /product="alpha-D-glucose-1-phosphate thymidylyltransferase RmlA" /protein_id="NP_214848.1" /db_xref="GI:15607475" /db_xref="GeneID:886568" /translation="MRGIILAGGSGTRLYPITMGISKQLLPVYDKPMIYYPLTTLMMA GIRDIQLITTPHDAPGFHRLLGDGAHLGVNISYATQDQPDGLAQAFVIGANHIGADSV ALVLGDNIFYGPGLGTSLKRFQSISGGAIFAYWVANPSAYGVVEFGAEGMALSLEEKP VTPKSNYAVPGLYFYDNDVIEIARGLKKSARGEYEITEVNQVYLNQGRLAVEVLARGT AWLDTGTFDSLLDAADFVRTLERRQGLKVSIPEEVAWRMGWIDDEQLVQRARALVKSG YGNYLLELLERN" gene complement(399535..400050) /gene="PE6" /locus_tag="Rv0335c" /db_xref="GeneID:886527" CDS complement(399535..400050) /gene="PE6" /locus_tag="Rv0335c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0335c, (MTCY279.02c), len: 171 aa. Member of the Mycobacterium tuberculosis PE family (see Brennan & Delogu 2002); contains short region of similarity to part of the unique N-terminus of the Mycobacterium tuberculosis PGRS family of Glycine-rich proteins e.g. Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kd protein (603 aa), FASTA scores: opt: 219, E(): 1.1e-08, (51.5% identity in 66 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177717.1" /db_xref="GI:57116722" /db_xref="GeneID:886527" /translation="MRSMGFLHRACRAPSSLPAPLMARPGRSVLARPAATPPGPLCAT TRPRPPQGNQPPASRISNFPPKRHKTRVLAAAEDEVSAAVAALISAHGRRHHSLNNQA AAFHGQFAQNLNVGAGSCASAETTADAPTQALLGPADRQRRQRRAVRQWLVRWAAHPG RATRGFHNHRQ" gene 400192..401703 /locus_tag="Rv0336" /db_xref="GeneID:886524" CDS 400192..401703 /locus_tag="Rv0336" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv0336, (MTCY279.03), len: 503 aa. Part of Mycobacterium tuberculosis 13E12 repeat family; almost identical to Rv0515|MTCY20G10.05 hypothetical protein from Mycobacterium tuberculosis FASTA scores: (99.8% identity in 503 aa overlap), possibly due to a recent gene duplication. Also similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv1148c, Rv1945, etc." /codon_start=1 /transl_table=11 /product="13E12 repeat family protein" /protein_id="NP_214850.1" /db_xref="GI:15607477" /db_xref="GeneID:886524" /translation="MPSPEAIAHFDERFECHAPRTTRVSAAFIDRICSATRAENRAAA AQLVALGELFAYRWSRCGGREEWVMDTMAAVAAEVAAALRISQGLAASRLRYARAMRE RLPKTAEVFSAGDIGYLMFATIVYRTDLIVDPDVLAAVDAQLAANVARWPSMTKARLA GQVDKIVARADADAVRRRKEYQAQRQFWVGESQDGVCQIGGSLLAVDAHALDARLSAL AGTVCEHDPRSREQRRADALGALAGGADRLGCGCGRADCAAGKRPAAPPVVIHLIAEA ATINGTGSAPASQMNADGLITAELVAELAKTATLVPLVHPGDAPPEPGYAPSKALADF VRCRDLTCRWPGCDEPATNCDLDHTIPYAAGGPTHASNLKCYCRTHHLVKTFWGWRDQ QLPDGTLILTSPSGHTYVSTPGSALLFPSLCHFSGGIPAPEADPPYDHCDQRTAMMPK RRRTRAQDRAYRIATERRQNHAARQRAQVLTQTAAATDTHGPPPDPNDDPPPF" gene complement(401873..403162) /gene="aspC" /locus_tag="Rv0337c" /db_xref="GeneID:886522" CDS complement(401873..403162) /gene="aspC" /locus_tag="Rv0337c" /EC_number="2.6.1.2" /function="GENERATES OXALOACETATE AND L-GLUTAMATE FROM L-ASPARTATE AND 2-OXOGLUTARATE [CATALYTIC ACTIVITY: L-ASPARTATE + 2-OXOGLUTARATE = OXALOACETATE + L-GLUTAMATE]." /note="broad specificity; family IV; in Corynebacterium glutamicum this protein can use glutamate, 2-aminobutyrate, and aspartate as amino donors and pyruvate as the acceptor" /codon_start=1 /transl_table=11 /product="aminotransferase AlaT" /protein_id="NP_214851.1" /db_xref="GI:15607478" /db_xref="GeneID:886522" /translation="MDNDGTIVDVTTHQLPWHTASHQRQRAFAQSAKLQDVLYEIRGP VHQHAARLEAEGHRILKLNIGNPAPFGFEAPDVIMRDIIQALPYAQGYSDSQGILSAR RAVVTRYELVPGFPRFDVDDVYLGNGVSELITMTLQALLDNGDQVLIPSPDYPLWTAS TSLAGGTPVHYLCDETQGWQPDIADLESKITERTKALVVINPNNPTGAVYSCEILTQM VDLARKHQLLLLADEIYDKILYDDAKHISLASIAPDMLCLTFNGLSKAYRVAGYRAGW LAITGPKEHASSFIEGIGLLANMRLCPNVPAQHAIQVALGGHQSIEDLVLPGGRLLEQ RDIAWTKLNEIPGVSCVKPAGALYAFPRLDPEVYDIDDDEQLVLDLLLSEKILVTQGT GFNWPAPDHLRLVTLPWSRDLAAAIERLGNFLVSYRQ" gene complement(403193..405841) /locus_tag="Rv0338c" /db_xref="GeneID:886520" CDS complement(403193..405841) /locus_tag="Rv0338c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0338c, (MTCY279.05c), len: 882 aa. Probable iron-sulphur-binding reductase (EC 1.-.-.-), possibly membrane-bound, equivalent to CAC32018.1|AL583925 probable iron-sulphur-binding reductase from Mycobacterium leprae (880 aa). Also highly similar to others e.g. T36608|5019323|CAB44376.1|AL078610 probable iron-sulfur-binding reductase from Streptomyces coelicolor (760 aa), FASTA scores: opt: 1658, E(): 0, (49.9% identity in 772 aa overlap); BAB07521.1|AP001520 iron-sulphur-binding reductase from Bacillus halodurans (700 aa). Contains PS00070 Aldehyde dehydrogenases cysteine active site and two of PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. First of several possible start sites chosen." /codon_start=1 /transl_table=11 /product="iron-sulfur-binding reductase" /protein_id="NP_214852.1" /db_xref="GI:15607479" /db_xref="GeneID:886520" /translation="MTTQTLIRLILGMSMTAVVGVFALRRVWWLYKLVMSGQPASGRT DNLGTRIWTQISEVLGQRRLLKWSIPGLAHFFTMWGFFILLTVYIEAYGLLFEERFHI PVIGRWDALGFLQDFFATAVFLGITTFAIIRILRNPREIGRSSRFYGSHNGGAWLVLL MIFNVIWTYVLVRGSAVNNGTLPYGNGAFLSQLFGAILRPLGQPANEIIETTALLLHI GVMLAFLILVLHSKHLHIFLAPINVTFKRLPDGLGPLLPLEADGKPIDFENPSEDAVF GRGKIEDFTWKGMLDFATCTECGRCQSQCPAWNTGKPLSPKLVIMDLRDHWMAKAPYI LGQKDASAGGEAGHQEHHHVPESGFGRVPGHGPEQATRPLVGTEEQGGVIDPDVLWSC VTCGACVEQCPVDIEHVDHIVDMRRYQVMMESEFPSELSVLFKNLETKGNPWGQNASD RTNWIDEVDFDVPVYGQDVDSFDGYEYLFWVGCAGAYDDKAKKTTKAVAELLAVARVK YLVLGAGETCNGDSARRSGNEFLFQQLAQQAVETLDGLFEGVETVDRKIVVTCPHCFN TIGKEYRQLGANYTVLHHTQLLNRLVRDKRLVPVTPVSQDITYHDPCYLGRHNKAYEA PRELIGAAGASLTEMPRHADRSFCCGAGGARMWMEEHIGKRINHERVDEALATDATAI ATACPFCRVMVTDGVNDRQEEAGRSGVEVLDVAQVLLGSLDHDKAQLPAKGTAAKQAQ ERAPKAAPKAAAPVTPVEAPAEAPQAPAPAAPAAPVKGLGMAAGAKRPGAKKAAPTPA APAAPAAPVKGLGIAAGAKRPGAKKTPPPAPGLAEPAAQPQPEAKPQPEPAAPPKPQT DGDPAAPAAPVKGLGIARGARPPGKR" misc_feature complement(404276..404311) /locus_tag="Rv0338c" /note="PS00070 Aldehyde dehydrogenases cysteine active site" misc_feature complement(404633..404668) /locus_tag="Rv0338c" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding regi on signature" misc_feature complement(404924..404959) /locus_tag="Rv0338c" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding regi on signature" gene complement(405950..408448) /locus_tag="Rv0339c" /db_xref="GeneID:886516" CDS complement(405950..408448) /locus_tag="Rv0339c" /function="COULD BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0339c, (MTCY279.06c), len: 832 aa. Possible transcriptional regulator, showing very weak similarity with parts of others. Contains PS00017 ATP/GTP-binding site motif A (P-loop ); and probable helix-turn helix motif from aa 778-799 (Score 1041, +2.73 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214853.1" /db_xref="GI:15607480" /db_xref="GeneID:886516" /translation="MQHRGCKNRGQAYDASVTDSLTEVPPAARRALLELANAPTVPVK VLITGGIGTGKTTVLAAARDTLRRSGLTVLACPPPDGEPPETALVIDDAQLLTDTELL RLTERVADSRLTVVAAAEAREHHRALRALTMALERDRPRISLGPLPVAEHLRDCTAGL PFLIHAVSARAQAPAQAAKVALIERLRRLDEPTLDTLLMMSLTHELGVSDVAAALGIS VTDARGLVDRAHASGLIESSHTAAFLQSVHDAIAQIVGNAHHHEVETSLLRSQLDISP VSAELALRLAEHGLRDERLADILTRYAADTRDASVRCARLYRAAVHAGAKGLTVRLAD ALARTGDCTAAATLADDLLSSPDATERAAAVRVAASVAVHDGNTGHAAELFGWLGPHP DTMVSSAATIVFAANGDLATARATLRLKDAGPPTMAARCARNLAEGLLLTMDQPYPVA MAKLGQAIATEQSLSQVIPDSPAALVTLAAIHAGDPVRARSVIGRAVRAGADPLFQRR HLLLSGWIKMQEGQLPSASADVAAASAGTHLHRRDALWAAALQTAISRRTGDIGALQQ HWYAAMEALAEYSLDLFALLPLGELWVAAARMRQVDQLQHTLDQALTLLDSLGNPALW SNSLHWAGVHAGILANSPESVAPHGQALGAMVAHSTLAQALSDAGRTWLRVLAENVDA DEVTAAARSLSHVGLTSDATRLAGQAALQTSDARVSGAMLQLARDLKLGNDFGEPPSG AGDTEPASGTPPAPRQPPAGSPLSDREREVAELLLLGMPYRDIGARLFISAKTVEHHV ARIRQRLGAGSRSEMLSMLRAMLAPESLTADERR" misc_feature complement(408281..408304) /locus_tag="Rv0339c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 408634..409173 /locus_tag="Rv0340" /db_xref="GeneID:886514" CDS 408634..409173 /locus_tag="Rv0340" /function="UNKNOWN" /note="Rv0340, (MTCY279.07), len: 179 aa. Conserved hypothetical protein; MEME-MAST analysis shows similarity to product of downstream gene, Rv0341|iniB." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214854.1" /db_xref="GI:15607481" /db_xref="GeneID:886514" /translation="MANSLLDFVISLVRDPEAAARYAANPERSIAEAHLTDVTRADVN SLIPVVSDSLSMSEPIGAAGGAHAGDRGNVWASGAATAALDAFAPHADAGVVQQHGAV GSVLNQPTPPGPGVTPTDPRPFRAGPHETSALLTSAEIPDTTSEDGGLPTDHPAVWNH PVVDPHTVEPDHHGYDIHG" gene 409362..410801 /gene="iniB" /locus_tag="Rv0341" /db_xref="GeneID:886518" CDS 409362..410801 /gene="iniB" /locus_tag="Rv0341" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0341, iniB, (MTCY13E10.01), len: 479 aa. iniB, isoniazid-inducible gene, (see citations below). Protein very Gly-, Ala-rich, similar to cell wall proteins e.g. P27483|GRP_ARATH GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN from A.thaliana (338 aa), FASTA scores: opt: 532, E(): 5.2e-13, (39.3% identity in 321 aa overlap). MEME-MAST analysis shows similarity to product of upstream gene, Rv0340." /codon_start=1 /transl_table=11 /product="isoniazid inductible gene protein INIB" /protein_id="NP_214855.1" /db_xref="GI:15607482" /db_xref="GeneID:886518" /translation="MTSLIDYILSLFRSEDAARSFVAAPGRAMTSAGLIDIAPHQISS VAANVVPGLNLGAGDPMSGLRQAVAARHGFAQDVANVGFAGDAGAGVASVITTDVGAG LASGLGAGFLGQGGLALAASSGGFGGQVGLAAQVGLGFTAVIEAEVGAQVGAGLGIGT GLGAQAGMGFGGGVGLGLGGQAGGVIGGSAAGAIGAGVGGRLGGNGQIGVAGQGAVGA GVGAGVGGQAGIASQIGVSAGGGLGGVGNVSGLTGVSSNAVLASNASGQAGLIASEGA ALNGAAMPHLSGPLAGVGVGGQAGAAGGAGLGFGAVGHPTPQPAALGAAGVVAKTEAA AGVVGGVGGATAAGVGGAHGDILGHEGAALGSVDTVNAGVTPVEHGLVLPSGPLIHGG TGGYGGMNPPVTDAPAPQVPARAQPMTTAAEHTPAVTQPQHTPVEPPVHDKPPSHSVF DVGHEPPVTHTPPAPIELPSYGLFGLPGF" gene 410838..412760 /gene="iniA" /locus_tag="Rv0342" /db_xref="GeneID:886510" CDS 410838..412760 /gene="iniA" /locus_tag="Rv0342" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0342, iniA, (MTCY13E10.02), len: 640 aa. iniA, isoniazid-inducible gene, (see citations below). Shows slight similarity to some hypothetical bacterial proteins e.g. P40983|YOR6_THER hypothetical protein (402 aa), FASTA scores: opt: 242, E(): 1.4e-07, (22.3% identity in 349 aa overlap). Also some similarity to downstream ORF Rv0343|iniC. Possible transmembrane stretch around residue 490. Alternative translational start at 410824. Contains a phosphopantetheine attachment site motif suggestive of an acyl carrier protein. Note that the iniA gene is also induced by the antibiotic ethambutol, an agent that inhibits cell wall biosynthesis by a mechanism that is distinct from isoniazid." /codon_start=1 /transl_table=11 /product="isoniazid inductible gene protein INIA" /protein_id="NP_214856.1" /db_xref="GI:15607483" /db_xref="GeneID:886510" /translation="MVPAGLCAYRDLRRKRARKWGDTVTQPDDPRRVGVIVELIDHTI AIAKLNERGDLVQRLTRARQRITDPQVRVVIAGLLKQGKSQLLNSLLNLPAARVGDDE ATVVITVVSYSAQPSARLVLAAGPDGTTAAVDIPVDDISTDVRRAPHAGGREVLRVEV GAPSPLLRGGLAFIDTPGVGGLGQPHLSATLGLLPEADAVLVVSDTSQEFTEPEMWFV RQAHQICPVGAVVATKTDLYPRWREIVNANAAHLQRARVPMPIIAVSSLLRSHAVTLN DKELNEESNFPAIVKFLSEQVLSRATERVRAGVLGEIRSATEQLAVSLGSELSVVNDP NLRDRLASDLERRKREAQQAVQQTALWQQVLGDGFNDLTADVDHDLRTRFRTVTEDAE RQIDSCDPTAHWAEIGNDVENAIATAVGDNFVWAYQRSEALADDVARSFADAGLDSVL SAELSPHVMGTDFGRLKALGRMESKPLRRGHKMIIGMRGSYGGVVMIGMLSSVVGLGL FNPLSVGAGLILGRMAYKEDKQNRLLRVRSEAKANVRRFVDDISFVVSKQSRDRLKMI QRLLRDHYREIAEEITRSLTESLQATIAAAQVAETERDNRIRELQRQLGILSQVNDNL AGLEPTLTPRASLGRA" gene 412757..414238 /gene="iniC" /locus_tag="Rv0343" /db_xref="GeneID:886508" CDS 412757..414238 /gene="iniC" /locus_tag="Rv0343" /function="UNKNOWN" /note="Rv0343, (MTCY13E10.03), len: 493 aa. iniC, isoniazid-inducible gene, (see citations below). Shows slight similarity to P40983|YOR6_THER8 hypothetical protein (402 aa), FASTA scores: opt: 196, E(): 2.6e-05, (25.9% identity in 228 aa overlap). Also some similarity to upstream ORF Rv0342|iniA. Contains (PS00017) ATP/GTP-binding site motif A (P-loop). Note that the iniA gene is also induced by the antibiotic ethambutol, an agent that inhibits cell wall biosynthesis by a mechanism that is distinct from isoniazid." /codon_start=1 /transl_table=11 /product="isoniazid inductible gene protein INIC" /protein_id="NP_214857.1" /db_xref="GI:15607484" /db_xref="GeneID:886508" /translation="MSTSDRVRAILHATIQAYRGAPAYRQRGDVFCQLDRIGARLAEP LRIALAGTLKAGKSTLVNALVGDDIAPTDATEATRIVTWFRHGPTPRVTANHRGGRRA NVPITRRGGLSFDLRRINPAELIDLEVEWPAEELIDATIVDTPGTSSLACDASERTLR LLVPADGVPRVDAVVFLLRTLNAADVALLKQIGGLVGGSVGALGIIGVASRADEIGAG RIDAMLSANDVAKRFTRELNQMGICQAVVPVSGLLALTARTLRQTEFIALRKLAGAER TELNRALLSVDRFVRRDSPLPVDAGIRAQLLERFGMFGIRMSIAVLAAGVTDSTGLAA ELLERSGLVALRNVIDQQFAQRSDMLKAHTALVSLRRFVQTHPVPATPYVIADIDPLL ADTHAFEELRMLSLLPSRATTLNDDEIASLRRIIGGSGTSAAARLGLDPANSREAPRA ALAAAQHWRRRAAHPLNDPFTTRACRAAVRSAEAMVAEFSARR" misc_feature 412907..412930 /gene="iniC" /locus_tag="Rv0343" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(414381..414941) /gene="lpqJ" /locus_tag="Rv0344c" /db_xref="GeneID:886512" CDS complement(414381..414941) /gene="lpqJ" /locus_tag="Rv0344c" /function="UNKNOWN" /note="Rv0344c, (MTCY13E10.04c), len: 186 aa. Probable lipoprotein, without homology. Has an appropriately positioned prokaryotic lipoprotein signature (PS00013)." /codon_start=1 /transl_table=11 /product="lipoprotein LpqJ" /protein_id="NP_214858.1" /db_xref="GI:15607485" /db_xref="GeneID:886512" /translation="MRLSLIARGMAALLAATALVAGCNTTIDGRPVASPGSGPTEPTF PTPRPTTAPPGTTAPTLPTTPVSPTAPAGAIPLPPDSNGYVFIETKSGMTRCQINRDS VGCEAPFTNSPLRDGEHANGIHITAGGSVQWVLGNLGAIPTVSIDYRTYEAQGWTIDA TTDGTRFTNNRTGHGMFVSIEKVDTF" misc_feature complement(414873..414905) /gene="lpqJ" /locus_tag="Rv0344c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attac hment site" gene 415050..415460 /locus_tag="Rv0345" /db_xref="GeneID:886505" CDS 415050..415460 /locus_tag="Rv0345" /function="UNKNOWN" /note="Rv0345, (MTCY13E10.05), len: 136 aa. Conserved hypothetical protein, similar to other hypothetical proteins e.g. AL13282 4|SCAH10_9 hypothetical protein from Streptomyces coelicolor (207 aa), FASTA scores: opt: 188, E(): 1.5e-05, (41.0% identity in 117 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214859.1" /db_xref="GI:15607486" /db_xref="GeneID:886505" /translation="MLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCND VILVLGAVEVSAPAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVN AKVVARVLGRALVSRSGLAGRGRIPAHSARRRGC" gene complement(415502..416965) /gene="ansP2" /locus_tag="Rv0346c" /db_xref="GeneID:886530" CDS complement(415502..416965) /gene="ansP2" /locus_tag="Rv0346c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF L-ASPARAGINE ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv0346c, (MTCY13E10.06c), len: 487 aa. Possible ansP2, L-asparagine permease, integral membrane protein belonging to family containing many amino acid permeases, highly similar to G467030|B2126_F2_85|NP_301937.1|NC_002677 probable L-asparagine permease from Mycobacterium leprae (498 aa); and NP_301938.1|NC_002677 probable L-asparagine permease from Mycobacterium leprae (505 aa). Also highly similar to others e.g. P77610|ANSP_ECOLI L-ASPARAGINE PERMEASE from Escherichia coli strain K-12 (499 aa). Also highly similar to ANSP1|Rv2127|MT2186|MTCY261_22|O33261 PROBABLE L-ASPARAGINE PERMEASE from Mycobacterium tuberculosis (489 aa), FASTA score: (72.1% identity in 473 aa overlap). And shows some similarity to MTCY3G12.14 from Mycobacterium tuberculosis. BELONGS TO THE AMINO ACID PERMEASE FAMILY (APC FAMILY). Note that previously known as aroP2.; aroP2" /codon_start=1 /transl_table=11 /product="L-asparagine ABC transporter permease" /protein_id="YP_177718.1" /db_xref="GI:57116723" /db_xref="GeneID:886530" /translation="MPPLDITDERLTREDTGYHKGLHSRQLQMIALGGAIGTGLFLGA GGRLASAGPGLFLVYGICGIFVFLILRALGELVLHRPSSGSFVSYAREFYGEKVAFVA GWMYFLNWAMTGIVDTTAIAHYCHYWRAFQPIPQWTLALIALLVVLSMNLISVRLFGE LEFWASLIKVIALVTFLIVGTVFLAGRYKIDGQETGVSLWSSHGGIVPTGLLPIVLVT SGVVFAYAAIELVGIAAGETAEPAKIMPRAINSVVLRIACFYVGSTVLLALLLPYTAY KEHVSPFVTFFSKIGIDAAGSVMNLVVLTAALSSLNAGLYSTGRILRSMAINGSGPRF TAPMSKTGVPYGGILLTAGIGLLGIILNAIKPSQAFEIVLHIAATGVIAAWATIVACQ LRLHRMANAGQLQRPKFRMPLSPFSGYLTLAFLAGVLILMYFDEQHGPWMIAATVIGV PALIGGWYLVRNRVTAVAHHAIDHTKSVAVVHSADPI" gene 417304..418290 /locus_tag="Rv0347" /db_xref="GeneID:886501" CDS 417304..418290 /locus_tag="Rv0347" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0347, (MTCY13E10.07), len: 328 aa (alternative start possible). Probable conserved membrane protein, similar to Rv0831c|AL022004|MTV043_23 from Mycobacterium tuberculosis (271 aa), FASTA scores: E(): 9.6e-21, (33.1% identity in 266 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214861.1" /db_xref="GI:15607488" /db_xref="GeneID:886501" /translation="MPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGT RPRWVSFLVIVLVIMNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELAR WTPILEQEEVRQVNLETGEHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFR SIVHAMVTARQDVAPVDGCIRIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADL KLTTTAQRHVIQCEGPEPGDSLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDID SAWSDPCKGIPALDAHLVDEVAERLHTPIGPLFESLITSELRTKVLQQPGQE" gene 418293..418946 /locus_tag="Rv0348" /db_xref="GeneID:886500" CDS 418293..418946 /locus_tag="Rv0348" /function="COULD BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0348, (MTCY13E10.08), len: 217 aa. Possible transcriptional regulator, showing some similarity to O53334|RV3188|MTV014.32 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (115 aa), FASTA score: (30.0% identity in 100 aa overlap). Contains probable helix-turn helix motif from aa 89-110 (Score 1407, +3.98 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214862.1" /db_xref="GI:15607489" /db_xref="GeneID:886500" /translation="MTISFSSSNLRDDATSGNGDYRLDKLPETTPSTSVFDRADVTYR QFTELHGQARDTRREAHVVELESKTGERARCAPMHALEQLADYGFAWRDIARVVGVSV PAITKWRKGAGVTGENRLKIARLLALIDMLSDRFIGEPASWLEMPIQAGVGITRMDLL ERGRYDLVLALASTHTGDGTVEYVLNETDKDWRETVVDNAFESYTAEDGVISIRPKR" gene 418949..419608 /locus_tag="Rv0349" /db_xref="GeneID:886506" CDS 418949..419608 /locus_tag="Rv0349" /function="UNKNOWN" /note="Rv0349, (MTCY13E10.09), len: 219 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214863.1" /db_xref="GI:15607490" /db_xref="GeneID:886506" /translation="MPELETPDDPESIYLARLEDVGEHRPTFTGDIYRLGDGRMVMIL QHPCALRHGVDLHPRLLVAPVRPDSLRSNWARAPFGTMPLPKLIDGQDHSADFINLEL IDSPTLPTCERIAVLSQSGVNLVMQRWVYHSTRLAVPTHTYSDSTVGPFDEADLIEEW VTDRVDDGADPQAAEHECASWLDERISGRTRRALLSDRQHASSIRREARSHRKSVKLA D" gene 419835..421712 /gene="dnaK" /locus_tag="Rv0350" /db_xref="GeneID:885946" CDS 419835..421712 /gene="dnaK" /locus_tag="Rv0350" /EC_number="3.6.1.-" /function="ACTS AS A CHAPERONE. INVOLVED IN INDUCTION BY STRESS CONDITIONS e.g. HEAT SHOCK. POSSIBLY HAS AN ATPASE ACTIVITY. SEEMS TO BE REGULATED POSITIVELY BY SIGH (Rv3223c PRODUCT) AND NEGATIVELY BY HSPR (Rv0353 PRODUCT)." /experiment="experimental evidence, no additional details recorded" /note="heat shock protein 70; assists in folding of nascent polypeptide chains; refolding of misfolded proteins; utilizes ATPase activity to help fold; co-chaperones are DnaJ and GrpE; multiple copies in some bacteria" /codon_start=1 /transl_table=11 /product="molecular chaperone DnaK" /protein_id="NP_214864.1" /db_xref="GI:15607491" /db_xref="GeneID:885946" /translation="MARAVGIDLGTTNSVVSVLEGGDPVVVANSEGSRTTPSIVAFAR NGEVLVGQPAKNQAVTNVDRTVRSVKRHMGSDWSIEIDGKKYTAPEISARILMKLKRD AEAYLGEDITDAVITTPAYFNDAQRQATKDAGQIAGLNVLRIVNEPTAAALAYGLDKG EKEQRILVFDLGGGTFDVSLLEIGEGVVEVRATSGDNHLGGDDWDQRVVDWLVDKFKG TSGIDLTKDKMAMQRLREAAEKAKIELSSSQSTSINLPYITVDADKNPLFLDEQLTRA EFQRITQDLLDRTRKPFQSVIADTGISVSEIDHVVLVGGSTRMPAVTDLVKELTGGKE PNKGVNPDEVVAVGAALQAGVLKGEVKDVLLLDVTPLSLGIETKGGVMTRLIERNTTI PTKRSETFTTADDNQPSVQIQVYQGEREIAAHNKLLGSFELTGIPPAPRGIPQIEVTF DIDANGIVHVTAKDKGTGKENTIRIQEGSGLSKEDIDRMIKDAEAHAEEDRKRREEAD VRNQAETLVYQTEKFVKEQREAEGGSKVPEDTLNKVDAAVAEAKAALGGSDISAIKSA MEKLGQESQALGQAIYEAAQAASQATGAAHPGGEPGGAHPGSADDVVDAEVVDDGREA K" gene 421709..422416 /gene="grpE" /locus_tag="Rv0351" /db_xref="GeneID:886497" CDS 421709..422416 /gene="grpE" /locus_tag="Rv0351" /function="STIMULATES, JOINTLY WITH DNAJ|Rv0352, THE ATPASE ACTIVITY OF DNAK|Rv0350. HELPS TO RELEASE ADP FROM DNAK THUS ALLOWING DNAK TO RECYCLE MORE EFFICIENTLY. SEEMS TO BE REGULATED NEGATIVELY BY HSPR (Rv0353 PRODUCT)." /experiment="experimental evidence, no additional details recorded" /note="with DnaK and DnaJ acts in response to hyperosmotic and heat shock by preventing the aggregation of stress-denatured proteins; may act as a thermosensor" /codon_start=1 /transl_table=11 /product="heat shock protein GrpE" /protein_id="NP_214865.1" /db_xref="GI:15607492" /db_xref="GeneID:886497" /translation="MTDGNQKPDGNSGEQVTVTDKRRIDPETGEVRHVPPGDMPGGTA AADAAHTEDKVAELTADLQRVQADFANYRKRALRDQQAAADRAKASVVSQLLGVLDDL ERARKHGDLESGPLKSVADKLDSALTGLGLVAFGAEGEDFDPVLHEAVQHEGDGGQGS KPVIGTVMRQGYQLGEQVLRHALVGVVDTVVVDAAELESVDDGTAVADTAENDQADQG NSADTSGEQAESEPSGS" misc_feature 422132..422266 /gene="grpE" /locus_tag="Rv0351" /note="PS01071 grpE protein signature" gene 422452..423639 /gene="dnaJ1" /locus_tag="Rv0352" /db_xref="GeneID:886495" CDS 422452..423639 /gene="dnaJ1" /locus_tag="Rv0352" /function="ACTS AS A CO-CHAPERONE. STIMULATES, JOINTLY WITH GRPE|Rv0351, THE ATPASE ACTIVITY OF DNAK|Rv0350. SEEMS TO BE REGULATED NEGATIVELY BY HSPR (Rv0353 PRODUCT)." /experiment="experimental evidence, no additional details recorded" /note="chaperone Hsp40; co-chaperone with DnaK; Participates actively in the response to hyperosmotic and heat shock by preventing the aggregation of stress-denatured proteins and by disaggregating proteins, also in an autonomous, dnaK-independent fashion" /codon_start=1 /transl_table=11 /product="chaperone protein DnaJ" /protein_id="YP_177719.1" /db_xref="GI:57116724" /db_xref="GeneID:886495" /translation="MAQREWVEKDFYQELGVSSDASPEEIKRAYRKLARDLHPDANPG NPAAGERFKAVSEAHNVLSDPAKRKEYDETRRLFAGGGFGGRRFDSGFGGGFGGFGVG GDGAEFNLNDLFDAASRTGGTTIGDLFGGLFGRGGSARPSRPRRGNDLETETELDFVE AAKGVAMPLRLTSPAPCTNCHGSGARPGTSPKVCPTCNGSGVINRNQGAFGFSEPCTD CRGSGSIIEHPCEECKGTGVTTRTRTINVRIPPGVEDGQRIRLAGQGEAGLRGAPSGD LYVTVHVRPDKIFGRDGDDLTVTVPVSFTELALGSTLSVPTLDGTVGVRVPKGTADGR ILRVRGRGVPKRSGGSGDLLVTVKVAVPPNLAGAAQEALEAYAAAERSSGFNPRAGWA GNR" misc_feature 422605..422664 /gene="dnaJ1" /locus_tag="Rv0352" /note="PS00636 Nt-dnaJ domain signature" misc_feature 422980..423054 /gene="dnaJ1" /locus_tag="Rv0352" /note="PS00637 CXXCXGXG dnaJ domain signature" gene 423639..424019 /gene="hspR" /locus_tag="Rv0353" /db_xref="GeneID:885929" CDS 423639..424019 /gene="hspR" /locus_tag="Rv0353" /function="INVOLVED IN TRANSCRIPTIONAL REGULATION (REPRESSION) OF HEAT SHOCK PROTEINS e.g. DNAK|Rv0350, GRPE|Rv0351, DNAJ1|Rv0352. BINDS TO THREE INVERTED REPEATS (IR1-IR3) IN THE PROMOTER REGION OF THE DNAK OPERON. INDUCTION: BY HEAT SHOCK." /experiment="experimental evidence, no additional details recorded" /note="Rv0353, (MTCY13E10.13), len: 126 aa. Probable hspR, heat shock regulatory protein (see Stewart et al., 2001), merR family, highly similar to others e.g. HspR|P40183 heat shock regulatory protein from Streptomyces coelicolor (151 aa), FASTA scores: E(): 4.9e-22, (55.7% identity in 140 aa overlap), that binds to three inverted repeats (IR1-IR3) in the promoter region of the dnaK operon. Has possible coiled coil region in C-terminal half. BELONGS TO THE MERR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="HEAT shock protein transcriptional repressor HspR" /protein_id="NP_214867.1" /db_xref="GI:15607494" /db_xref="GeneID:885929" /translation="MAKNPKDGESRTFLISVAAELAGMHAQTLRTYDRLGLVSPRRTS GGGRRYSLHDVELLRQVQHLSQDEGVNLAGIKRIIELTSQVEALQSRLQEMAEELAVL RANQRREVAVVPKSTALVVWKPRR" gene complement(424269..424694) /gene="PPE7" /locus_tag="Rv0354c" /db_xref="GeneID:886498" CDS complement(424269..424694) /gene="PPE7" /locus_tag="Rv0354c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0354c, (MTCY13E10.14c), len: 141 aa. Member of the Mycobacterium tuberculosis PPE family, similar to others e.g. MTCY63_9 from Mycobacterium tuberculosis (2411 aa), FASTA scores: E(): 3.6e-11, (47.6% identity in 103 aa overlap). Possible continuation of ORF upstream, but no sequence error apparent." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177720.1" /db_xref="GI:57116725" /db_xref="GeneID:886498" /translation="MSVCVIYIPFKGCVKHVSVTIPITTEHLGPYEIDASTINPDQPI DTAFTQTLDFAGSGTVGAFPFGFGWQQSPGFFNSTTTPSSGFFNSGAGGASGFLNDAA AAVSGLGNVFTETSGFFNAGGVGIRASKTSATCCRAGRT" gene complement(424777..434679) /gene="PPE8" /locus_tag="Rv0355c" /db_xref="GeneID:886491" CDS complement(424777..434679) /gene="PPE8" /locus_tag="Rv0355c" /function="UNKNOWN" /note="Rv0355c, (MTCY13E10.15c, MTCY13E10.16c, MTCY13E10.17c) len: 3300 aa. Member of the Mycobacterium tuberculosis PPE family, similar to others e.g. AL009198|MTV004_5 from Mycobacterium tuberculosis (3716 aa), FASTA scores: opt: 2906, E(): 0, (40.9% identity in 3833 aa overlap); MTV004_3 FASTA scores: (39.0% identity in 3531 aa overlap); etc. Gene contains large number of clustered Major Polymorphic Tandem Repeats (MPTR). Related to MTCY13E10.16c, E(): 0; MTCY13E10.17c, E(): 0; MTCY48.17, E(): 0; MTCY98.0034c, E(): 0; MTCY03C7.23 E(): 0; MTCY98.0031c, E(): 0; MTCY31.06c, E(): 5.6e-17; MTCY359.33, E(): 2.3e-16." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177721.1" /db_xref="GI:57116726" /db_xref="GeneID:886491" /translation="MSFAVLPPEINSARLYVGAGLAPMLDAAAAWDGLADELGSAAAS FSAVTAGLAGSSWLGAASTAMTGAAAPYLGWLSAAAAQAQQAATQTRLAAAAFEAALA ATVHPAIISANRALFVSLVVSNLLGQNAPAIAATEAAYEQMWAQDVAAMFGYHAGASA AVSALTPFGQALPTVAGGGALVSAAAAQVTTRVFRNLGLANVGEGNVGNGNVGNFNLG SANIGNGNIGSGNIGSSNIGFGNVGPGLTAALNNIGFGNTGSNNIGFGNTGSNNIGFG NTGDGNRGIGLTGSGLLGFGGLNSGTGNIGLFNSGTGNVGIGNSGTGNWGIGNSGNSY NTGFGNSGDANTGFFNSGIANTGVGNAGNYNTGSYNPGNSNTGGFNMGQYNTGYLNSG NYNTGLANSGNVNTGAFITGNFNNGFLWRGDHQGLIFGSPGFFNSTSAPSSGFFNSGA GSASGFLNSGANNSGFFNSSSGAIGNSGLANAGVLVSGVINSGNTVSGLFNMSLVAIT TPALISGFFNTGSNMSGFFGGPPVFNLGLANRGVVNILGNANIGNYNILGSGNVGDFN ILGSGNLGSQNILGSGNVGSFNIGSGNIGVFNVGSGSLGNYNIGSGNLGIYNIGFGNV GDYNVGFGNAGDFNQGFANTGNNNIGFANTGNNNIGIGLSGDNQQGFNIASGWNSGTG NSGLFNSGTNNVGIFNAGTGNVGIANSGTGNWGIGNPGTDNTGILNAGSYNTGILNAG DFNTGFYNTGSYNTGGFNVGNTNTGNFNVGDTNTGSYNPGDTNTGFFNPGNVNTGAFD TGDFNNGFLVAGDNQGQIAIDLSVTTPFIPINEQMVIDVHNVMTFGGNMITVTEASTV FPQTFYLSGLFFFGPVNLSASTLTVPTITLTIGGPTVTVPISIVGALESRTITFLKID PAPGIGNSTTNPSSGFFNSGTGGTSGFQNVGGGSSGVWNSGLSSAIGNSGFQNLGSLQ SGWANLGNSVSGFFNTSTVNLSTPANVSGLNNIGTNLSGVFRGPTGTIFNAGLANLGQ LNIGSANLGDFNLGSGNVGSFNVFSGNQGSYNIGPANLGNYNIGFANLGNYNIGFGNA GDFNQGFANTGNNNIGFANTGNNNIGIGLSGDNQQGFNFAGGWNSGTANIGLFNSGTN NVGIGNSGTGNWGIGNSGSGNTGIGNTGSTNTGFFNTGIVNTGVANAGSYNTGWYNTG DTNTGIANLGDFNTGFYNTGNFSTGFANQGDIATGAFITGDMGNGAFWRGDQQGLFSA GYRVHVPEIPAHVTVEVPVNIPITASFTNTVYSGITLEQINFGFTIDIAGIPLLAGAI SKAVLPPITGTGPAITVNIGDPGGSTAIRIPATASVGPFDVTFVNIAATTGFFNATTD PSSGFFNGGPGTVSGIANIGANISGFQNVANSATSGFNNYGSLQSGLANLGDTVSGVF NTGIGAPANVSGMFNIGSNLAGFFHDQATGMSMFNLGLGNIGQFNVGFSNVGDSNAGL ANIGSFNLGSGNLGSFNVFGGNQGSYNIGPANLGNYNIGLGNLGSYNFGFGNAGDFNL GFANTGNNNIGFANTGNNNIGIGLSGDNQQGFNFAGGWNSGSGNSGLFNSGTNNIGLF NSGTGNIGIGNSGTGNWGIANTGDTNTGIFNTGDVNTGLLNAGNVNTGIFNTGHYNTG SFNAGSFNTAGFNPGSYNTGYLNTGSYNTGLANSGDVNTGGFITGNYSNGFWWRGDYQ GLAGISQTITVPDTAVPVKLHVPIFLDIPVTGTLGTFTVHGFRFPEITGDIFLIGIPF NAATLDAFSFPNISIVLPNIGINLGSGPDPLIDIAGTGGLLPIKIPLIDIPAAPGFGN STTTPSSGFFNAGTGTVSGVGNVGSNSSGFFNLTSGSSGISGVQNFGELISGGFNFGN TVSGLVNASTLGLSMPANLSGGGNVGATVAGFVNNTQILNLGFGNVGSGNVGHGNIGD SNVGLGNLGNANVGHGNIGSFNVFSGNRGSYNIGPANLGNYNIGLGNLGSYNFGFGNA GDFNLGFANSGSNNIGFANTGNNNIGIGLSGHNQQGFGSWNSGTANTGLFNSGTNNIG LFNSGTGNIGIGNSGIGNTGIGNPGVGNTGLGNSGTGNWGLWNPGTGNMGVANVGTYN TGGYNVGSTNTGIANVGIANTGSYNTGSTNTGSFNDGDFNTGFYNTGDYNTGFYNTGD VNTGAFIGGNFSNGAFWQSDHQGQWGAHYAITVPQIPLLNFSLNIPVNIPIHLDFGTL AVNGFQIPAITLRALGVTHFSVGPIIVPRIAGTLPVIDINIGDPGGSSSIPITITSGA GPVVIPLLDIPPAPGFGNSTTGPSSGFFNSGTGSSSGFGNVGANNSGFWNTAFAGIGN SGLQNFGSLQSGWANLGNTVSGFYNTSAADFATPANLSGLSNVGADLTGVLRGPNGST FNAGLANLGQFNVGSANLGSANLGSANLGSANLGNSNVGFGNIGNANIGGANIGDFNV GIANTGPGLTAAVNNIGIGNTGNYNIGVGNTGNYNIGFGNTGNNNIGIGLSGDNQIGF GPLNAGIANMGLFNLGDNNFGMANAGNFNQGIANTGNNNIGLFNTGNNNVGIWLTGDG LSGFSSLNSGAGNTGFFNSGTANTGLFNSGTGNTGLFNSGTGNVGIGNMGTGGFGVGL SGDSQVGIGGTNSGSFNIGLFNSGTGNVGIGNSGTGNVGIGNTGTGNTGIGNSGNYNT GLLNAGLVNTGIANPGNHNTGLFNIGTFNTGIANPGHYNTGSYNTGSYNTGMANAGDY GTGAFITGSMNNGLLWRADRQGLLAANYTITIERPAAFLNVDIPVNIPITGDITNVSI PAITFPRIDASGSVDIGILSGTVLAPVGPITLHGGDASAPLDTPIEIDFGPSPAINLN IGKPDGSTVINIVGGAGAGPISIPIIDLRPAPGFFNATTGPSSGFLNWGAGSASGLLN FGNNSGLYNFATSSMGNSGFQNYGSLQSGWANLGNSISGIYNTGLGAPANVSGLLNIG TNLAGWLQNGPTETTFSVGLANLGFWNLGSANIGNYNLGSANIGVYNLGSANIGDFNL GSANIGDFNLGSANIGSSNIGFGNVGPGLTAAIGNIGFGNTGNGNIGIGNTGTGNIGF GNTGNGNIGIGLTGDTMTGFGGWNSGTGNIGLFNSGTGNIGFGNSGTGNWGIGNSGDY NTGIGNTGSTNSGFFNTGLVNTGIGNSGDYNTGLFNAGNTNTGSFNPGDYNTGGFNPG NYNTGYFNPGNSNTGIANSGDVNTGAFNSGNYSNGFFWRGDYQGLGGFAYQSAVSEIP WSYDRFQH" gene complement(434830..435474) /locus_tag="Rv0356c" /db_xref="GeneID:886490" CDS complement(434830..435474) /locus_tag="Rv0356c" /function="UNKNOWN" /note="Rv0356c, (MTCY13E10.18c), len: 214 aa. Conserved hypothetical protein, equivalent to AL023514|MLCB4_12 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (218 aa), FASTA scores: opt: 1067, E(): 0, (73.4% identity in 214 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214870.1" /db_xref="GI:15607497" /db_xref="GeneID:886490" /translation="MTDASVHPDELDPEYHHHGGFPEYGPASPGAGFGQFVATMRRLQ DLAVAADPGDAVWDEAAERAAALVELLSPFEADEGKAPAGRTPGLPGMGSLLLPPWTV TRYGTDGVEMRGSFSRFHVGGNSAVHGGVLPLLFDHMFGMISHAAGRPISRTAFLHVD YRRITPIDVPLIVRGRVTNTEGRKAFVCAELFDSDETLLAEGNGLMVRLLPGQP" gene complement(435471..436769) /gene="purA" /locus_tag="Rv0357c" /db_xref="GeneID:886484" CDS complement(435471..436769) /gene="purA" /locus_tag="Rv0357c" /EC_number="6.3.4.4" /function="INVOLVED IN AMP BIOSYNTHESIS (FIRST COMMITTED STEP). PLAYS AN IMPORTANT ROLE IN THE DE NOVO PATHWAY OF PURINE NUCLEOTIDE BIOSYNTHESIS [CATALYTIC ACTIVITY: GTP + IMP + L-aspartate = GDP + phosphate + adenylosuccinate]." /note="catalyzes the formation of N6-(1,2,-dicarboxyethyl)-AMP from L-aspartate, inosine monophosphate and GTP in AMP biosynthesis" /codon_start=1 /transl_table=11 /product="adenylosuccinate synthetase" /protein_id="NP_214871.1" /db_xref="GI:15607498" /db_xref="GeneID:886484" /translation="MPAIVLIGAQWGDEGKGKATDLLGGRVQWVVRYQGGNNAGHTVV LPTGENFALHLIPSGVLTPGVTNVIGNGVVIDPGVLLNELRGLQDRGVDTAKLLISAD AHLLMPYHIAIDKVTERYMGSKKIGTTGRGIGPCYQDKIARIGIRVADVLDPEQLTHK VEAACEFKNQVLVKIYNRKALDPAQVVDALLEQAEGFKHRIADTRLLLNAALEAGETV LLEGSQGTLLDVDHGTYPYVTSSNPTAGGAAVGSGIGPTRIGTVLGILKAYTTRVGSG PFPTELFDEHGEYLSKTGREFGVTTGRRRRCGWFDAVIARYAARVNGITDYFLTKLDV LSSLESVPVCVGYEIDGRRTRDMPMTQRDLCRAKPVYEELPGWWEDISGAREFDDLPA KARDYVLRLEQLAGAPVSCIGVGPGREQTIVRRDVLQDRP" gene 436860..437507 /locus_tag="Rv0358" /db_xref="GeneID:886493" CDS 436860..437507 /locus_tag="Rv0358" /function="UNKNOWN" /note="Rv0358, (MTCY13E10.20), len: 215 aa. Conserved hypothetical protein, highly similar to ML0281|AL023514|MLCB4_14 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (229 aa), FASTA scores: opt: 852, E(): 0, (62.9% identity in 229 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214872.1" /db_xref="GI:15607499" /db_xref="GeneID:886493" /translation="MYTAENAPGVAVLLSGDADVPGPLTGLPTHQDNLDTVIGRYSRL IVVGADADLGAVLTRLLRTDRLDVEVGYVPRRRSPATRAYRLPAGRRAARRARCGVAR RVPLIRDETGSVIVGRAQWLPAEEQALIHGEAVVDDTVLFDGDVAGVCIEPTLTLPGL RAAVDGAGKWRRWIGGRAAQLGTTGAAVLRDGVAAPRPVRRSTFYRNVEGWLLVR" gene 437518..438297 /locus_tag="Rv0359" /db_xref="GeneID:886482" CDS 437518..438297 /locus_tag="Rv0359" /function="UNKNOWN" /note="Rv0359, (MTCY13E10.21), len: 259 aa. Probable conserved integral membrane protein, highly similar to hypothetical or other membrane proteins e.g. AL133220|SCC75A_6|T50569 probable membrane protein from Streptomyces coelicolor (265 aa), FASTA scores: opt: 642, E(): 0, (43.1% identity in 248 aa overlap); P70995 HYPOTHETICAL 24.7 kDa PROTEIN from Bacillus subtilis (219 aa), FASTA scores: E(): 1.5e-12, (31.3% identity in 192 aa overlap). Contains neutral zinc metallopeptidases, zinc-binding region signature (PS00142)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_214873.1" /db_xref="GI:15607500" /db_xref="GeneID:886482" /translation="MSETGQRESVRPSPIFLGLLGLTAVGGALAWLAGETVQPLAYAG VFVMVIAGWLVSLCLHEFGHAFTAWRFGDHDVAVRGYLTLDPRRYSHPMLSLGLPMLF IALGGIGLPGAAVYVHTWFMTTARRTLVSLAGPTVNLALAMLLLAATRLLFDPIHAVL WAGVAFLAFLQLTALVLNLLPIPGLDGYAALEPHLRPETQRALAPAKQFALVFLLVLF LAPTLNGWFFGVVYWLFDLSGVSHRLAAAGSVLARFWSIWF" misc_feature 437686..437715 /locus_tag="Rv0359" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(438302..438739) /locus_tag="Rv0360c" /db_xref="GeneID:886480" CDS complement(438302..438739) /locus_tag="Rv0360c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0360c, (MTCY13E10.22c), len: 145 aa. Conserved hypothetical protein, equivalent to AL023514|MLCB4_16|CAA18948.1|AL023514|MLCB4.27c hypothetical protein from Mycobacterium leprae (137 aa), FASTA scores: opt: 793, E(): 0, (85.4% identity in 137 aa overlap). And similar to AL049754|SCH10_25c|T36537 hypothetical protein from Streptomyces coelicolor (143 aa), FASTA scores: opt: 497, E(): 3.2e-27, (55.8% identity in 138 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214874.1" /db_xref="GI:15607501" /db_xref="GeneID:886480" /translation="MTKRTITPMTSMGDLLGPEPILLPGDSDAEAELLANESPSIVAA AHPSASVAWAVLAEGALADDKTVTAYAYARTGYHRGLDQLRRHGWKGFGPVPYSHQPN RGFLRCVAALARAAAAIGETDEYGRCLDLLDDCDPAARPALGL" gene 438822..439649 /locus_tag="Rv0361" /db_xref="GeneID:886478" CDS 438822..439649 /locus_tag="Rv0361" /function="UNKNOWN" /note="Rv0361, (MTCY13E10.23), len: 275 aa. Probable conserved membrane protein (has hydrophobic stretch from residues 132-156), equivalent to AL023514|MLCB4_17|AA18949.1|AL023514 putative membrane protein from Mycobacterium leprae (292 aa), FASTA scores: opt: 1044, E(): 0, (58.6% identity in 292 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214875.1" /db_xref="GI:15607502" /db_xref="GeneID:886478" /translation="MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETE TVVITTSDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPP RMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGK HSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSA AKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN" gene 439871..441253 /gene="mgtE" /locus_tag="Rv0362" /db_xref="GeneID:886476" CDS 439871..441253 /gene="mgtE" /locus_tag="Rv0362" /function="THOUGHT TO BE INVOLVED IN Mg2+ TRANSPORT. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0362, (MTCY13E10.24), len: 460 aa. Possible mgtE, magnesium (Mg2+) transport transmembrane protein; C-terminal region is highly similar to MGTE|G780283 putative Mg2+ transporter from Providencia stuarti (314 aa), FASTA scores: E(): 0, (47.2% identity in 307 aa overlap) (N-terminus extends approx. 150 aa further upstream compared to P. stuarti ORF). Also similar in part to others e.g. AAK20879.1|AF334760_1|AF334760 putative Mg2+ transporter from Aeromonas hydrophila (455 aa); NP_231292.1|NC_002505 magnesium transporter from Vibrio cholerae (451 aa); NP_102305.1|NC_002678 Mg2+ transport protein from Mesorhizobium loti (454 aa); etc. Also similar to Rv1232c|MTV006.04c from Mycobacterium tuberculosis (435 aa). Extended hydrophobic segment spanning last 130 residues. BELONG TO THE MGTE FAMILY." /codon_start=1 /transl_table=11 /product="Mg2+ transport transmembrane protein MgtE" /protein_id="NP_214876.1" /db_xref="GI:15607503" /db_xref="GeneID:886476" /translation="MSIRPAENSTLDIRHVIGIGTPKAVDLWLDVVTELPDRARELGS LSKAELGKLGPLLDGTNAVELFESIDDKLAAEALHAMDPSLAATFLEALDSDHAANIL REFKEPKREALLTLLPLERAMVLRGLLSWPEDCAAAHMVPETLTVRPNMTVSQAVASV RERASGLRSDARTTAYVYVTDADSHLLGVIAFRALVLANPEQRVRELMGDDLIVVSPL TDKELAAQTIMGHNLMAVPVVDADNRLLGIIAEDEAIDIAEEEATEDAERQGGSAPLE VPYLRASPWLLWRKRVVWLLVLFAAEAYTGSVLRAFSDEMEAVIALAFFIPLLIGTGG NTGTQIATTLVRAMATGQVRFRDVPAVLAKELSTGVLVGLTMAAAAVVRAWTLGVGPQ VTLTVALTVAAIVVWSSLVAAVLPPLLKKLRIDPAIVSGPMIATIVDGTGLLIYFLVA HLTLTELHGL" gene complement(441265..442299) /gene="fba" /locus_tag="Rv0363c" /db_xref="GeneID:886474" CDS complement(441265..442299) /gene="fba" /locus_tag="Rv0363c" /EC_number="4.1.2.13" /function="INVOLVED IN GLYCOLYSIS (AT THE SIXTH STEP) [CATALYTIC ACTIVITY: D-FRUCTOSE 1,6-BISPHOSPHATE = GLYCERONE PHOSPHATE + D-GLYCERALDEHYDE 3-PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of glycerone phosphate and glyceraldehyde 3-phosphate from fructose 1,6, bisphosphate" /codon_start=1 /transl_table=11 /product="fructose-bisphosphate aldolase" /protein_id="NP_214877.1" /db_xref="GI:15607504" /db_xref="GeneID:886474" /translation="MPIATPEVYAEMLGQAKQNSYAFPAINCTSSETVNAAIKGFADA GSDGIIQFSTGGAEFGSGLGVKDMVTGAVALAEFTHVIAAKYPVNVALHTDHCPKDKL DSYVRPLLAISAQRVSKGGNPLFQSHMWDGSAVPIDENLAIAQELLKAAAAAKIILEI EIGVVGGEEDGVANEINEKLYTSPEDFEKTIEALGAGEHGKYLLAATFGNVHGVYKPG NVKLRPDILAQGQQVAAAKLGLPADAKPFDFVFHGGSGSLKSEIEEALRYGVVKMNVD TDTQYAFTRPIAGHMFTNYDGVLKVDGEVGVKKVYDPRSYLKKAEASMSQRVVQACND LHCAGKSLTH" gene 442395..443078 /locus_tag="Rv0364" /db_xref="GeneID:886473" CDS 442395..443078 /locus_tag="Rv0364" /function="UNKNOWN" /note="Rv0364, (MTCY13E10.26), len: 227 aa. Possible conserved transmembrane protein, equivalent to O69601|Y364_MYCLE|ML0287|CAA18951.1|AL023514|AL023514|MLC B4 _19 HYPOTHETICAL 24.3 KDA PROTEIN from Mycobacterium leprae (222 aa), FASTA scores: opt: 1027, E(): 0, (66.1% identity in 227 aa overlap). Shows strong similarity to DEDA_ECOLI|P09548 DedA PROTEIN protein from Escherichia coli FASTA scores: E(): 1.3e-28, (39.5% identity in 195 aa overlap). Similar also to Mycobacterium tuberculosis DedA protein Rv2637|MTCY441.0." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214878.1" /db_xref="GI:15607505" /db_xref="GeneID:886473" /translation="MSTAVTAMPDILDPMYWLGANGVFGSAVLPGILIIVFIETGLLF PLLPGESLLFTGGLLSASPAPPVTIGVLAPCVALVAVLGDQTAYFIGRRIGPALFKKE DSRFFKKHYVTESHAFFEKYGKWTIILARFVPIARTFVPVIAGVSYMRYPVFLGFDIV GGVAWGAGVTLAGYFLGSVPFVHMNFQLIILAIVFVSLLPALVSAARVYRARRNAPQS DPDPLVLPE" gene complement(443067..444197) /locus_tag="Rv0365c" /db_xref="GeneID:886487" CDS complement(443067..444197) /locus_tag="Rv0365c" /function="UNKNOWN: MAY BE INVOLVED IN THE ABILITY TO SURVIVE IN MACROPHAGES." /experiment="experimental evidence, no additional details recorded" /note="Rv0365c, (MTCY13E10.27c), len: 376 aa (start uncertain). Conserved hypothetical protein (see citation below), very similar to G388212|CAA35191.1, a truncated ORF immediately upstream of the Corynebacterium glutamicum fda gene encoding fructose-1,6-biphosphate aldolase (304 aa), FASTA scores: E(): 7.1e-19, (42.2% identity in 296 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214879.1" /db_xref="GI:15607506" /db_xref="GeneID:886487" /translation="MNLANRAASAETAVTQRHLRRLWALPGTQLAVVAWPSTRRDRLF GSWHYWWQAHLLDCLVDAQLRDPQPQRRARINRQVRSHRVRNNFSWLNSYYDDMAWLA LALERADRVAGVRRRRALPKLTNQFVEAWVPEDGGGIPWRKQDQFFNAPANGPAGLFL ARYPDQYGKRLKRAEQMADWIDRTLIDPETHLVFDGIKAGSLVRAQYTYCQGVVLGLE TELAVRTGPAARARHCARVHRLVAAVNEHMAPLGVLRGAGGGDGGLFAGITARYLALV ATTLPGDSADDAAARDTARAIVLASAQSAWDYRQTVDGLPVFGAFWDREAELPTAGGE QARSVRGAVHSSAIAERDLSVQLSGWMLMEAAHSAAAVSSLG" gene complement(444222..444815) /locus_tag="Rv0366c" /db_xref="GeneID:886471" CDS complement(444222..444815) /locus_tag="Rv0366c" /function="UNKNOWN" /note="Rv0366c, (MTV036.01c), len: 197 aa. Conserved hypothetical protein, showing weak similarity to HI1395|P44173|YD95_HAEIN HYPOTHETICAL PROTEIN from Haemophilus influenzae (140 aa), FASTA scores: opt: 152, E(): 0.0015, (27.0% identity in 126 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00850 Glycine radical signature. TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214880.1" /db_xref="GI:15607507" /db_xref="GeneID:886471" /translation="MKRLDLVAGPNGAGKSTFVALTLAPLLPGIVFVNADEIAKQRWP DDPTSHAYQAAQVAADTRARLIDLGRPFIAETVFSHPSKLELIRTARTAGYTVVLHVL VIPEGLAVERVRHRVAAGGHDVPETKIRERHRRLAELVAQAITLADGATVYDNSRLAG PRIVAQFSGGGIIGRACWPSWTPPPLMSRWSNRPETA" misc_feature complement(444525..444551) /locus_tag="Rv0366c" /note="PS00850 Glycine radical signature" misc_feature complement(444768..444791) /locus_tag="Rv0366c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(444844..445233) /locus_tag="Rv0367c" /db_xref="GeneID:886468" CDS complement(444844..445233) /locus_tag="Rv0367c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0367c, (MTV036.02c), len: 129 aa. Hypothetical unknown protein. TBparse score is 0.850." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214881.1" /db_xref="GI:15607508" /db_xref="GeneID:886468" /translation="MPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAV SNQHTASRRRVEAALAGHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGI TTVALNDAGDIVEHRPDGTSVVLAATP" gene complement(445314..446525) /locus_tag="Rv0368c" /db_xref="GeneID:886469" CDS complement(445314..446525) /locus_tag="Rv0368c" /function="UNKNOWN" /note="Rv0368c, (MTV036.03c), len: 403 aa. Conserved hypothetical protein, showing some similarity to AJ224684|BJAJ4684_4 cooxS protein from Bradyrhizobium japonicum (422 aa), FASTA scores: opt: 341, E(): 4.3e-13, (27.4% identity in 387 aa overlap); Rv2425c|MTCY428_22 hypothetical protein from Mycobacterium tuberculosis FASTA score: (30.7% identity in 238 aa overlap). Contains PS00213 Lipocalin signature. TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214882.1" /db_xref="GI:15607509" /db_xref="GeneID:886469" /translation="MATPALLPGVDLAAFAAALAARLRDAGIPVSASGQASLVQALQQ LVPRTPAALYWGARLTLVSRVDELATFDAVFASLFGVFGSAEPDGANRPPPPIAGPRT PVAGVGHRAKRRSCAAQAQNLPWDTRSLTMASAGQGGPSRTLPDVLPSRIVARADEPF DQFDPDDLRLLGAWLEATMARWPRRRSMRFESSPHGKRIDLRATMNASRSTGWESVLL ARIRPRRRPRRVLLLCDVSRSMQPYAAIYLRLMRAAVLRRAGGHPEVFAFSTSLTRLT SVLSHRSAEMALHRANARVTDRYGGTFIGRSVAALLAPPHGNALRGAVVIIASDGWDS DPPDVLVHALTRVRRRAELLVWLNPRAAHPEFQPRAGSMAAALPYCDLFLPAHSLAGL HQLLLALAGAR" misc_feature complement(445995..446036) /locus_tag="Rv0368c" /note="PS00213 Lipocalin signature" gene complement(446531..447046) /locus_tag="Rv0369c" /db_xref="GeneID:886485" CDS complement(446531..447046) /locus_tag="Rv0369c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0369c, (MTV036.04c), len: 171 aa. Possible membrane protein oxidoreductase (EC 1.-.-.-), similar to ORF 4 of the Pseudomonas thermocarboxydovorans protein of cutA-cutB-cutC gene cluster: X77931|PTC2CUTAC_4 ORF4 from Pseudomonas thermocarboxydovorans (171 aa), FASTA scores: opt: 226, E(): 9.8e-08, (31.3% identity in 166 aa overlap). Also similar to MTV036.05, MTV036.08, MTV036.09, and MTV026.10." /codon_start=1 /transl_table=11 /product="membrane oxidoreductase" /protein_id="NP_214883.1" /db_xref="GI:15607510" /db_xref="GeneID:886485" /translation="MPGAQLIGHEGDEYLGKVKVKVGPVTSEFSGKVHFVEQDRNQHR AVFDAKGKEARGTGNAAATVAAQLHEVGERTRVTVDTDLKIVGKLAQFGSGMLQQVSE KLLGQFVDSLEAELAAQSSESPQGTPPATEAAPIDLLQLADGGQLKKYGSALLAALTV LLLIWVLRRRR" gene complement(447147..448043) /locus_tag="Rv0370c" /db_xref="GeneID:886465" CDS complement(447147..448043) /locus_tag="Rv0370c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0370c, (MTV036.05c), len: 298 aa. Possible oxidoreductase (EC 1.-.-.-), similar to many hypothetical proteins, but also similar to ORF4|X82447|OCCOXMSL4_4 Protein of coxMSL gene cluster from Pseudomonas/Oligotropha carboxidovorans (295 aa), FASTA scores: opt: 851, E(): 0, (48.2% identity in 282 aa overlap); AJ224684|BJAJ4684_3 cooxS from Bradyrhizobium japonicum (302 aa), FASTA scores: opt: 881, E(): 0, (47.6% identity in 290 aa overlap). Also highly similar to MTCY428_21 from Mycobacterium tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_214884.1" /db_xref="GI:15607511" /db_xref="GeneID:886465" /translation="MTFASPDDVIRRFDEQNYLLDTGTASAIYLAVTLGRPLLLEGEP GVGKTTAAKTLAVVLDTTLIRLQCYEGLTANEALYDWNYQRQLLSIRLAEARGKGISD ISEADLYTEAYLVDRPILRCVRHRGPTPPVLLIDEIDRADDEFEALLLEFLGESAVTV PELGTFLAECPPIAVLTSNRSRDLHDALRRRCLYHWIDYPGPDRAAAIVRRTVPGATA PLIENATQFVCTARDLDLDKPPGVAETIDWVAALVALGVADLTAADSSPALASLGALA KTPDDRTQIRDAYQAFTECSHA" misc_feature complement(447897..447920) /locus_tag="Rv0370c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(448040..448633) /locus_tag="Rv0371c" /db_xref="GeneID:886463" CDS complement(448040..448633) /locus_tag="Rv0371c" /function="UNKNOWN" /note="Rv0371c, (MTV036.06c), len: 197 aa. Conserved hypothetical protein, similar to other hypothetical proteins e.g. AL132824|SCAH10.09c|CAB60163.1|AL132824 hypothetical protein from Streptomyces coelicolor (207 aa), FASTA scores: opt: 247, E(): 4.5e-09, (32.3% identity in 195 aa overlap). Also weak similarity with YURE|D70017|Z99120|BSUB0017_134 hypothetical protein yurE from Bacillus subtilis (197 aa), FASTA scores: opt: 217, E(): 2.5e-08, (27.0% identity in 174 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214885.1" /db_xref="GI:15607512" /db_xref="GeneID:886463" /translation="MTATQITGVVLAAGRSNRLGTPKQLLPYRDTTVLGATLDVARQA GFDQLILTLGGAASAVRAAMALDGTDVVVVEDVERGCAASLRVALARVHPRATGIVLM LGDQPQVAPATLRRIIDVGPATEIMVCRYADGVGHPFWFSRTVFGELARLHGDKGVWK LVHSGRHPVRELAVDGCVPLDVDTWDDYRRLLESVPS" gene complement(448630..449385) /locus_tag="Rv0372c" /db_xref="GeneID:886460" CDS complement(448630..449385) /locus_tag="Rv0372c" /function="UNKNOWN" /note="Rv0372c, (MTV036.07c), len: 251 aa. Conserved hypothetical protein, showing some similarity with CAB76248.1|X82447|COXF CoxF protein from Pseudomonas/Oligotropha carboxidovorans (280 aa); AJ224684|BJAJ4684_6 cooxS from Bradyrhizobium japonicum (176 aa), FASTA scores: opt: 186, E(): 1.6e-05, (41.1% identity in 95 aa overlap). Also similar to upstream ORF Rv0376c from Mycobacterium tuberculosis (380 aa), FASTA scores: E(): 6.8e-07, (31.0% identity in 277 aa overlap). TBparse score is 0.862." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214886.1" /db_xref="GI:15607513" /db_xref="GeneID:886460" /translation="MSISDRAAQLVAARTPFVRATVVRAQQPTSARPGDEAILLADGT IEGFVGGHCAQNSVRKAAMGVLQAGESVLLRVLPDGDVHFPEAPGACVVVNPCLAGGS LEIFLTPQLPAPLIQIYGETPIADALIELCGLLGYDARRDTDPADTDALPTAIVIASH GGPEAEIIRTALDNGVGYVGLVASTVRGASILDSLDLSDAERARVHTPVGLAIGAKTP AEIAVSIAAELIATLRGGGPRGRKALADENGGA" gene complement(449404..451803) /locus_tag="Rv0373c" /db_xref="GeneID:886472" CDS complement(449404..451803) /locus_tag="Rv0373c" /EC_number="1.2.99.2" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: CO + H(2)O + acceptor = CO(2) + reduced acceptor]." /note="Rv0373c, (MTV036.08c), len: 799 aa. Probable carbon monoxide dehydrogenase, large chain (EC 1.2.99.2), highly similar to others e.g. AAD00363.1| U80806|CUTL carbon monoxide dehydrogenase large subunit CutL protein from Hydrogenophaga pseudoflava (803 aa); S49124|509391|CAA54902.1|X77931|1094915|2107180C|CUTA carbon-monoxide dehydrogenase large chain (EC 1.2.99.2) (cut operon) from Pseudomonas thermocarboxydovorans (842 aa); C56279|809566|CAA57829.1|X82447|OCCOXMSL4_3|COXL carbon-monoxide dehydrogenase large chain (EC 1.2.99.2) (cluster coxMSL) from Pseudomonas/Oligotropha carboxydovorans (809 aa), FASTA scores: opt: 2484, E(): 0, (56.0% identity in 804 aa overlap); etc." /codon_start=1 /transl_table=11 /product="carbon monoxyde dehydrogenase large subunit" /protein_id="NP_214887.1" /db_xref="GI:15607514" /db_xref="GeneID:886472" /translation="MTTIESRPPSPEDLADNAQQPCGHGRMMRKEDPRFIRGRGTYVD DVALPGMLHLAILRSPYAHARIVRIDVTAAQAHPKVKAVVTGADLAAKGLAWMPTLAN DVQAVLATDKTRFQGQEVAFVVAEDRYSARDACELVDVDYEPRDPVVDARTALDPSAP VIRTDLEGKSDNHIFDWETGDAAATEAVFAKADVVVQQEIVYPRVHPAPMETCGAVAD LDPVTGKLTLWTTSQAPHAHRTLYALVAGLPEHKIRVISPDIGGGFGNKVPIYPGYVC AIVASLLLDKPVKWMEDRSENLTSTGFARDYIMVGEIAANRDGKILAIRSNVLADHGA FNAQAAPAKYPAGFFGVFTGSYDIEAAYCHMTAVYTNKAPGGVAYACSFRITEAVYFV ERLVDCLAFELKMDPAELRLRNLLRPNQFPYQSKTGWVYDSGDYETTMRKAMNMIGYE ALRAEQKQRRARGELMGIGMSFFTEAVGAGPRKDMDILGLGMADGCELRVHPTGKAVL RLSVQTQGQGHETTFAQIVAEELGIAPDDIEVVHGDTDQTPFGLGTYGSRSTPVSGGA AALVARKVRDKAKIIASGMLEVSVADLQWEKGKFHVKGDPSAAVTIADIAMRAHGAGD LPEGIEGGLDAEVCYNPSNLTYPYGAYFCVVDIDPGTAVVKVRRFLAVDDCGTRINPM IIEGQVHGGIVDGIGMALMEMIAFDEDGNCLGGSLMDYLIPTALEVPHLETGHTVTPS PHHPIGAKGIGESATVGSPPAVVNAVVDALAPFGVRHADMPLTPSRVWEAMQGRATPP I" gene complement(451800..452279) /locus_tag="Rv0374c" /db_xref="GeneID:886462" CDS complement(451800..452279) /locus_tag="Rv0374c" /EC_number="1.2.99.2" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: CO + H(2)O + acceptor = CO(2) + reduced acceptor]." /note="Rv0374c, (MTV036.09c), len: 159 aa. Probable carbon monoxide dehydrogenase, small chain (EC 1.2.99.2), highly similar to others e.g. B56279|5822285|X82447|OCCOXMSL4_2|COXS carbon-monoxide dehydrogenase small chain (EC 1.2.99.2) from Pseudomonas/Oligotropha carboxydovorans (166 aa), FASTA scores: opt: 662, E(): 0, (59.3% identity in 150 aa overlap); CAA12063.1|AJ224684 putative carbon monoxide dehydrogenase small subunit from Bradyrhizobium japonicum (161 aa); S49123|509390|CAA54901.1|X77931|CUTC carbon-monoxide dehydrogenase small chain (EC 1.2.99.2) from Pseudomonas thermocarboxydovorans (163 aa); etc. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="carbon monoxyde dehydrogenase small subunit" /protein_id="NP_214888.1" /db_xref="GI:15607515" /db_xref="GeneID:886462" /translation="MQVNMTVNGEPVTAEVEPRMLLVHFLRDQLRLTGTHWGCDTSNC GTCVVEVDGVPVKSCTMLAVMASGHSIRTVEGLAGPDGQLDPVQEGFMRCHGLQCGFC TPGMLITARALLDRNPDPDEQTIREAISGQICRCTGYTTIVRSIQWAAAHQTVKAQS" gene complement(452294..453154) /locus_tag="Rv0375c" /db_xref="GeneID:886456" CDS complement(452294..453154) /locus_tag="Rv0375c" /EC_number="1.2.99.2" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: CO + H(2)O + acceptor = CO(2) + reduced acceptor]." /note="Rv0375c, (MTV036.10c), len: 286 aa. Probable carbon monoxide dehydrogenase, medium chain (EC 1.2.99.2), similar to others e.g. AAD00361.1|U80806|CUTM carbon monoxide dehydrogenase middle subunit from Hydrogenophaga pseudoflava (287 aa); S49122|509389|CAA54900.1|X77931|CUTB carbon-monoxide dehydrogenase medium chain (EC 1.2.99.2) from Pseudomonas thermocarboxydovorans (287 aa); A56279|809564|CAA57827.1|X82447|OCCOXMSL4_1|COXM|CODH carbon-monoxide dehydrogenase medium chain (EC 1.2.99.2) from Pseudomonas/Oligotropha carboxydovorans (288 aa), FASTA scores: opt: 594, E(): 0, (37.5% identity in 277 aa overlap); etc." /codon_start=1 /transl_table=11 /product="carbon monoxyde dehydrogenase medium subunit" /protein_id="NP_214889.1" /db_xref="GI:15607516" /db_xref="GeneID:886456" /translation="MDHAIGLLDRLGEGARVVAGGHSLLPMMKLRIANPEYLVDINDL APELGYVVVGGINNPNLVRLGAMTRHREILDSDALAAVCPIFRDAERVIADPVVRNRG TLGGSLCQADPAEDLSTVCTVLDAVCLAKGPSGEREIAIDDFLVGPYETALAHNEVLI EVRIPLRHNTSSAYAKVERRVGDWAITAAGAAVTLDGQTILAARVGLTAVNPDPVALA ELSAGLVGQPATEEVFAEAGRRAAQACTPVTDVRGTAEYKRHLAGELTVRTLRTAAGR VLGAPAAPEA" gene complement(453230..454372) /locus_tag="Rv0376c" /db_xref="GeneID:886454" CDS complement(453230..454372) /locus_tag="Rv0376c" /function="UNKNOWN" /note="Rv0376c, (MTV036.11c), len: 380 aa. Conserved hypothetical protein, highly similar to T35481|4008539|CAA22508.1|AL034492|SC6C5.10 hypothetical protein from Streptomyces coelicolor (395 aa); and AAK64260.1|AF373840_20 ORF377 hypothetical CoxI from Arthrobacter nicotinovorans (377 aa). And similar to other conserved hypothetical proteins e.g. NP_101963.1|14021136|BAB47749.1|AP002994 hypothetical protein from Mesorhizobium loti (245 aa). Note that C-terminus shows similarity with C-termini of CAB76248.1|X82447|COXF CoxF protein from Pseudomonas/Oligotropha carboxidovorans (280 aa); CAB76250.1|X82447|COXI CoxI protein from Pseudomonas/Oligotropha carboxidovorans (330 aa); and AJ224684|BJAJ4684_6 cooxS from Bradyrhizobium japonicum (176 aa), FASTA scores: E(): 1.9e-17, (47.1% identity in 138 aa overlap). Also some partial similarity with AJ224684|BJAJ4684_5 cooxS from Bradyrhizobium japonicum (107 aa), FASTA scores: opt: 321, E(): 4.2e-14, (53.3% identity in 92 aa overlap); E1184330|Z99120|YURF YURF PROTEIN from Bacillus subtilis (330 aa), FASTA scores: opt: 170, E(): 2.9e- 16, (27.5% identity in 345 aa overlap). Also similar to downstream ORF Rv0372c from Mycobacterium tuberculosis (251 aa), FASTA scores: E(): 2.1e-06, (30.7% identity in 277 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214890.1" /db_xref="GI:15607517" /db_xref="GeneID:886454" /translation="MAIWAAGDTAGVATVVRTLRSAPRPPGAAMVVAPDGSVSGSVSG GCVEGAVYELAAEVAQTGIPRLEHYGVSDDTAFAVGLTCGGIIDVFVEPVSRATFPEL GELADDIGAQRPVAIATVIAHPDERRVGRRLVIRPDTKSPVTGSLGSARADAAVIDDA RGLLAVGRSEILEYGPDGQRRGEGMEVFVSSHAPRPRMLVFGAIDFAAALARQGSFLG YRVTVCDARAVFATPARFPTADDVVVAWPHRYLAAQAEAGGIDERTVICVLTHDPKFD VPVLEVALRLGVGYVGAMGSRKTHDDRMDRLRAAGLTDAELSRLSSPIGLDLGARTPE ETAVSIAADIIARRWGGGGRPLADIAGRIHHDAQVAGEFKDYLTRH" gene 454421..455386 /locus_tag="Rv0377" /db_xref="GeneID:886452" CDS 454421..455386 /locus_tag="Rv0377" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0377, (MTV036.12), len: 321 aa. Probable transcription regulator, lysR family, showing similarity with many hypothetical transcriptional regulators lysR homolog e.g. P32484|YEIE_ECOLI|M89774 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Escherichia coli (293 aa), FASTA scores: opt: 265, E(): 4.9e-11, (28.6% identity in 266 aa overlap). Also similar to Rv2282c from Mycobacterium tuberculosis. Contains PS00044 bacterial regulatory protein lysR family signature. SEEMS TO BELONG TO THE LYSR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="LysR family transcriptional regulator" /protein_id="NP_214891.1" /db_xref="GI:15607518" /db_xref="GeneID:886452" /translation="MTPAQLRAYSAVVRLGSVRAAAAELGLSDAGVSMHVAALRKELD DPLFTRTGAGLAFTPGGLRLASRAVEILGLQQQTAIEVTEAAHGRRLLRIAASSAFAE HAAPGLIELFSSRADDLSVELSVHPTSRFRELICSRAVDIAIGPASESSIGSDGSIFL RPFLKYQIITVVAPNSPLAAGIPMPALLRHQQWMLGPSAGSVDGEIATMLRGLAIPES QQRIFQSDAAALEEVMRVGGATLAIGFAVAKDLAAGRLVHVTGPGLDRAGEWCVATLA PSARQPAVSELVGFISTPRCIQAMIPGSGVGVTRFRPKVHVTLWS" misc_feature 454469..454561 /locus_tag="Rv0377" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene 455637..455858 /locus_tag="Rv0378" /db_xref="GeneID:886450" CDS 455637..455858 /locus_tag="Rv0378" /function="UNKNOWN" /note="Rv0378, (MTV036.13), len: 73 aa. Conserved hypothetical gly-rich protein, showing some similarity to Mycobacterium tuberculosis PE_PGRS family; also similar to MTCY06H11_16|Z85982 hypothetical glycine-rich 88.5 KD protein (1011 aa), FASTA scores: opt: 237, E(): 0.0032, (58.7% identity in 63 aa overlap); MTV043_25. TBparse score is 0.860." /codon_start=1 /transl_table=11 /product="glycine rich protein" /protein_id="NP_214892.1" /db_xref="GI:15607519" /db_xref="GeneID:886450" /translation="MSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDA GASGSINGNAGDPGNSGERGAVGKPGAPG" gene 455977..456192 /gene="secE2" /locus_tag="Rv0379" /db_xref="GeneID:886449" CDS 455977..456192 /gene="secE2" /locus_tag="Rv0379" /function="THOUGHT TO BE INVOLVED IN PROTEIN TRANSPORT (EXPORT)." /note="Rv0379, (MTV036.14), len: 71 aa. Possible secE2, protein transport protein, showing similarity with P27340|S61G_SULSO|SECE PREPROTEIN TRANSLOCASE SECE SUBUNIT (PROTEIN TRANSPORT PROTEIN SEC61 GAMMA SUBUNIT HOMOLOG) from Sulfolobus acidocaldarius (65 aa), FASTA scores: opt: 79, E(): 4.7. (30.3% identity in 66 aa overlap); and hypothetical proteins e.g. Q9HPW4|VNG1446H HYPOTHETICAL PROTEIN from Halobacterium sp. strain NRC-1 (77 aa); Q9I794|PA0038 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (71 aa); etc. Also highly similar to U85467|MTU85467_1 hypothetical Mycobacterium tuberculosis protein from a patient isolate (116 aa), FASTA scores: opt: 443, E(): 7.7e-29, (98.6% identity in 71 aa overlap). Note that for Rv0379|MTV036.14, a translation initiation region different to the one in U85467|MTU85467_1 was chosen. COULD BE A PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE|Rv0638, SECF|Rv2586c, SECG|Rv1440 AND SECY|Rv0732." /codon_start=1 /transl_table=11 /product="protein transport protein" /protein_id="YP_177722.1" /db_xref="GI:57116727" /db_xref="GeneID:886449" /translation="MSVYKVIDIIGTSPTSWEQAAAEAVQRARDSVDDIRVARVIEQD MAVDSAGKITYRIKLEVSFKMRPAQPR" gene complement(456268..456819) /locus_tag="Rv0380c" /db_xref="GeneID:886446" CDS complement(456268..456819) /locus_tag="Rv0380c" /EC_number="2.1.1.-" /function="POSSIBLY CAUSES METHYLATION OF RNA." /note="Rv0380c, (MTV036.15c), len: 183 aa. Possible RNA methyltransferase (EC 2.1.1.-), equivalent to CAC32002.1|AL583925 possible RNA methyltransferase from Mycobacterium leprae (182 aa). Also some similarity with others methyltransferases e.g. P19396|TRMH_ECOLI|78514|JV0043 TRNA (GUANOSINE-2'-O-)-METHYLTRANSFERASE (TRNA METHYLTRANSFERASE) from Escherichia coli (229 aa), FASTA scores: opt: 227, E(): 1.4e-09, (28.9% identity in 166 aa overlap). Also similar to Rv0881, Rv3579c, Rv1644 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="RNA methyltransferase" /protein_id="NP_214894.1" /db_xref="GI:15607521" /db_xref="GeneID:886446" /translation="MLLRDGDARNVVDAYRYWTREAIIADIDTRRHPLHVAIENFGHD ANIGSVVRTANAFAVHTVHIVGRRRWNRRGAMVTDRYQRLCHHDSTTGLLEFAAGAGL TVVAVDNVPGAARLEQTALPRECLLLFGQEGPGITDDARAGAAVTVSIAQFGSTRSIN AGVAAGIAMHAWIRQHADLGRAW" gene complement(456915..457823) /locus_tag="Rv0381c" /db_xref="GeneID:886444" CDS complement(456915..457823) /locus_tag="Rv0381c" /function="UNKNOWN" /note="Rv0381c, (MTV036.16c), len: 302 aa. Hypothetical unknown protein. Equivalent to AAK44616.1 from Mycobacterium tuberculosis strain CDC1551 (254 aa) but longer 48 aa. TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214895.1" /db_xref="GI:15607522" /db_xref="GeneID:886444" /translation="MRILVAWATCGAVVLSGLTGCSGSSHSGRTYGAQSARTGESLAV LGWNMSVSNLRWSGDYVLIDVDASPTDPHAPHAKPEDIRFGLYGALAHPMESAALGSC GDAMAHVRDVVSPLSAPAGRLTGTVCLGPLKERSAVRGVYTYSPRDRIPGTAAAYPAA FPVGMLPTNQNDAGLVVKTTSVSAWRADGMQLGKPQLGDPVAFTGNGYMLLGLEVDAV PDRYRDDSAARGGPMMLLAAPTLPGRGLSPACATYGSSVLILPDALLDAVHISASLCT QGEINEALLYATVATVGTHAALWTSR" gene complement(457841..458380) /gene="pyrE" /locus_tag="Rv0382c" /db_xref="GeneID:886443" CDS complement(457841..458380) /gene="pyrE" /locus_tag="Rv0382c" /EC_number="2.4.2.10" /function="INVOLVED IN PYRIMIDINE BIOSYNTHESIS (AT THE FIFTH STEP) [CATALYTIC ACTIVITY: Orotidine 5'-phosphate + diphosphate = orotate + 5-phospho-alpha-D-ribose 1-diphosphate]." /note="involved in fifth step of pyrimidine biosynthesis; converts orotidine 5'-phosphate and diphosphate to orotate and 5-phospho-alpha-D-ribose 1-diphosphate" /codon_start=1 /transl_table=11 /product="orotate phosphoribosyltransferase" /protein_id="YP_177723.1" /db_xref="GI:57116728" /db_xref="GeneID:886443" /translation="MAGPDRAELAELVRRLSVVHGRVTLSSGREADYYVDLRRATLHH RASALIGRLMRELTADWDYSVVGGLTLGADPVATAIMHAPGRPIDAFVVRKSAKAHGM QRLIEGSEVTGQRVLVVEDTSTTGNSALTAVHAVQDVGGEVVGVATVVDRATGAAEAI EAEGLRYRSVLGLADLGLD" misc_feature complement(457850..457897) /gene="pyrE" /locus_tag="Rv0382c" /note="PS00589 PTS HPR component serine phosphorylation site signature" gene complement(458461..459315) /locus_tag="Rv0383c" /db_xref="GeneID:886458" CDS complement(458461..459315) /locus_tag="Rv0383c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0383c, (MTV036.18c), len: 284 aa. Possible conserved secreted protein, with hydrophobic stretch in N-terminus and Pro-rich C-terminus. Equivalent to CAC32006.1|AL583925 possible secreted protein from Mycobacterium leprae (286 aa). TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214897.1" /db_xref="GI:15607524" /db_xref="GeneID:886458" /translation="MVPLWFTLSALCFVGAVVLLYVDIDRRRGRSRRRKSWARSHGFD YERESTEILKRWTRGVMSTVGDVAAHNVVLGQIRGEAVYIFDLEEVATVIALHRKVGT NVVVDLRLKGLKEPRESDIWLLGAIGPRMVYSTNLDAARRACDRRMVTFAHTAPDCAE IMWNEQNWTLVSMPIASTRAQWDEGLRTVRQFNDLLRVLPPLPQEMPQQTGVGPRGAA PGRPVAPGGPAELPPRRAQPDPATTVLPDPARRAPEPIRRDEGRSEGVRRPPPAGRNG QQATNYQH" gene complement(459456..462002) /gene="clpB" /locus_tag="Rv0384c" /db_xref="GeneID:886440" CDS complement(459456..462002) /gene="clpB" /locus_tag="Rv0384c" /EC_number="3.-.-.-" /function="THOUGHT TO BE AN ATPASE SUBUNIT OF AN INTRACELLULAR ATP-DEPENDENT PROTEASE. SEEMS TO BE REGULATED POSITIVELY BY SIGH (Rv3223c PRODUCT) AND NEGATIVELY BY HSPR (Rv0353 PRODUCT)." /experiment="experimental evidence, no additional details recorded" /note="Rv0384c, (MTV036.19c), len: 848 aa. Probable clpB (alternate gene name: htpM), endopeptidase ATP-binding protein, chain B (EC 3.-.-.-), equivalent to AC32007.1|AL583925 heat shock protein from Mycobacterium leprae (848 aa). Also highly similar to others e.g. P53532|CLPB_CORGL|1163118|AAB49540.1|U43536|CGU43536_1 CLPB PROTEIN (heat-inducible expression) from Corynebacterium glutamicum (852 aa), FASTA scores: opt: 4113, E(): 0, (74.5% identity in 846 aa overlap); T36551|4753885|CAB42048.1|AL049754|clpB|SCOEDB|SCH10.39c probable ATP-dependent proteinase ATP-binding chain from Streptomyces coelicolor (853 aa); P03815|CLPB_ECOLI|1788943|AAC75641.1|AE000345 CLPB PROTEIN (HEAT SHOCK PROTEIN F84.1) from Escherichia coli strains K12 and O157:H7 (857 aa); etc. Also similar to Rv3596c|ClpC from Mycobacterium tuberculosis. Contains PS00870 and PS00871 Chaperonins clpA/B signatures and two PS000017 ATP/GTP-binding site motives A (P-loop). BELONGS TO THE CLPA/CLPB FAMILY. Contains probable coiled-coil domain from aa 411-503.; htpM" /codon_start=1 /transl_table=11 /product="endopeptidase ATP binding protein" /protein_id="NP_214898.1" /db_xref="GI:15607525" /db_xref="GeneID:886440" /translation="MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDG IAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQPQLSRESLAAITTAQQLATELD DEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEG LAQRIVAGDVPESLRDKTIVALDLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITF IDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEYRKHIEKDAALERRF QQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAI DLVDEAASRLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELA DQKEKLAELTTRWQNEKNAIEIVRDLKEQLEALRGESERAERDGDLAKAAELRYGRIP EVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAKLLRMED ELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFL FDDERAMVRIDMSEYGEKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIE KAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTSNLGSGGSAEQVLAAVRATFK PEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGF DPVYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG" misc_feature complement(460050..460106) /gene="clpB" /locus_tag="Rv0384c" /note="PS00871 Chaperonins clpA/B signature 2" misc_feature complement(460161..460184) /gene="clpB" /locus_tag="Rv0384c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(461082..461120) /gene="clpB" /locus_tag="Rv0384c" /note="PS00870 Chaperonins clpA/B signature 1" misc_feature complement(461364..461387) /gene="clpB" /locus_tag="Rv0384c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 462135..463307 /locus_tag="Rv0385" /db_xref="GeneID:886441" CDS 462135..463307 /locus_tag="Rv0385" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0385, (MTV036.20), len: 390 aa. Probable monooxygenase (EC 1.-.-.-), similar to T37003|5738846|CAB52917.1|AL109949 probable flavohemoprotein from Streptomyces coelicolor (435 aa); and similar in part (C-termini) to various monooxygenases e.g. P19734|DMPP_PSESP|94993|F37831 PHENOL HYDROXYLASE P5 PROTEIN (PHENOL 2-MONOOXYGENASE P5 COMPONENT) (EC 1.14.13.7) from Pseudomonas putida (353 aa), FASTA scores: opt: 363, E(): 4.2e-16, (31.8% identity in 255 aa overlap); S47292|2120861|pir|S70085 phenol 2-monooxygenase (EC 1.14.13.7) chain mopP from Acinetobacter calcoaceticus (350 aa); P21394|XYLA_PSEPU|94933|B37316 XYLENE MONOOXYGENASE ELECTRON TRANSFER COMPONENT (EC 1.18.1.3) [INCLUDES: FERREDOXIN; FERREDOXIN--NAD(+) REDUCTASE] from Pseudomonas putida plasmid pWW0 (350 aa); AAC38360.1|AF043544|NtnMA|ntnA reductase component of 4-nitrotoluene monooxygenase from Pseudomonas sp. (328 aa); etc. TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214899.1" /db_xref="GI:15607526" /db_xref="GeneID:886441" /translation="MGLEDRDALRVLQNAFKLDDPELVRRFYAHWFALDASVRDLFPP DMGAQRAAFGQALHWVYGELVAQRAEEPVAFLAQLGRDHRKYGVLPTQYDTLRRALYT TLRDYLGHPSRGAWTDAVDEAAGQSLNLIIGVMSGAADADDAPAWWDGTVVEHIRVSR DLAVARLQLDRPLHYYPGQYVNVHVPQCPRRWRYLSPAIPADPNGRIEFHVRVVPGGL VSNAIVGETRPGDRWRLSGPHGAFRVDRDGGDVLMVAGSTGLAPLRALIIDLSRFAVN PRVHLFFGARYACELYDLPTLWQIAAHNPWLSVSPVSEYNGDPAWAADYPDVSAPRGL HVRQTGRLPDVVSRYGGWGDRQILICGGPAMVRATKAALIAKGAPPERIQHDPLSR" gene 463411..466668 /locus_tag="Rv0386" /db_xref="GeneID:886030" CDS 463411..466668 /locus_tag="Rv0386" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0386, (MTV036.21), len: 1085 aa. Probable regulatory protein, LuxR/uhpA family, highly similar to CAC30706.1|AL583923 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also similar in part to other regulatory proteins e.g. CAB95788.1|AL359949 putative multi-domain regulatory protein from Streptomyces coelicolor (780 aa); N-terminus of CAB92369.1|AL356612 putative AfsR-like regulatory protein from Streptomyces coelicolor (1114 aa); N-terminus of NP_107139.1|14026327|BAB52925.1|AP003009 transcriptional regulator from Mesorhizobium loti (952 aa); AFSR_STRCO|P25941 regulatory protein afsr from Streptomyces coelicolor (993 aa), FASTA scores: opt: 224, E() : 1.1e-06, (26.1% identity in 867 aa overlap); etc. Also similar to many putative Mycobacterium tuberculosis regulatory proteins e.g. AL0212|MTV008_44 (1137 aa), FASTA scores: opt: 3756, E(): 0, (56.7% identity in 1089 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00622 Bacterial regulatory proteins, luxR family signature and probable helix-turn-helix motif at aa 1042-1063 (Score 1025, +2.68 S D). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="LuxR/UHPA family transcriptional regulator" /protein_id="NP_214900.1" /db_xref="GI:15607527" /db_xref="GeneID:886030" /translation="MSKLLPRGTVTLLLADVEGSTWLWETHPDDMGAAVARLDKAVSG VIAAHDGVRPVEQGEGDSFVLAFACASDAVAAALDLQRARLAPIRLRIGVHTGEVALR DEGNYAGPTINRTARLRDLAHGGQTVLSGVTESLVIDRLPDKAWLVDLGTHALRDLSR PERVMQLCHPELRIDFPPLRVANDDVAHGLPVHLTRFVGRGAQITEVHRLVTDNRLVT LTGAGGVGKTRLAAQLAAQIAGEFGRAWFVDLAPITDPDLVPVTVAGALGLHDQPGRS TTDTVLRFLGGRPALVVLDNCEHLLDATAALVLALVKACRGVRLLATCREPLRVEGEV SYRVPSLSLSDEAVEMFCYRAQRVRPDFRLTDDNSAAVTEICKRLDGLPLAIELAAAR LRSMTLDEIIDGLRDRFALLTGGARTAAHRQQTLWASVDWSYTLLTEPERTLFRRLAV FVGCFFVDDAQAVACSGDVQRYQVLDEITLLVDKSLVMADDNSGRTCYRLCETMRHYA LEKLSEAGEVDAVFARHRDYYTALAARVDNPGPSDYSHCLDQAETEIDNLRAAFVWNR ENSDTEGALALASSLLRVWMTRGRIQEGRAWFDSILADENARHLEVAAAVRARALADK ALLDIFVDAAAGMEQAQQALVIAREVDEPALLSRALTACGLIAVAVARADAAASYFAE AIDLARAVDDRWRLAQILTFQAVDAVVAGDPVAARPAAQEARELAAAIGDHSNALWCR WCLGYAQLMRGELAAAAAQFGEVVDEAEASQEVLHKANSLQGLAFALAYQGELSAARA AADAALEAAELGEYFAGMGYSALTTAALAAGDVQTAQHASEAAWRNLSLALPLSAAVQ RAFNAQAALAGGDLSAARRWCDDAVQSMTGHHLAMALATRARIAVAEGKREEAERDAH KALACAAESGAHLDLPDVLECLAGLASDAGTHHAAARLFGAAEAIRQQIGSVRFAIYR SDYVQSVTALRDAMGEKDFDAAWAEGAALSIKETIAYAQRGHSWRKRPATGWESLTPT EIDVVRLVGEGLANKDIATRLFVSPRTVQTHLTHVYTKLGFTSRLQLAQAAARRT" misc_feature 464071..464094 /locus_tag="Rv0386" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 466531..466614 /locus_tag="Rv0386" /note="PS00622 Bacterial regulatory proteins, luxR family signature" gene complement(466672..467406) /locus_tag="Rv0387c" /db_xref="GeneID:886436" CDS complement(466672..467406) /locus_tag="Rv0387c" /function="UNKNOWN" /note="Rv0387c, (MTV036.22c), len: 244 aa. Conserved hypothetical protein, showing some similarity to MTCI237.20c, and M17282|HUMEL20_1 Human elastin gene, exon 1, Elastin (687 aa), FASTA scores: opt: 193, E(): 0.35, (34.4% identity in 189 aa overlap). TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="NP_214901.1" /db_xref="GI:15607528" /db_xref="GeneID:886436" /translation="MSLLPTLQSFLPPPFDAIPNPIEDLDVLVAAAVAVAAGSLGVSA AQLGEIYRHDVVDEAQKAPHCPAESDQTPAGAAGDGDLPEVGGRVTSPPQPPVAALTG YSANIGGLSVPHSWNLPPAVRQVAAMFPGATPMYMTGSSDGSYAGLAAAGLAGTGLAG LAARGGSAPTPAAAAPAGAGGAGPAATRPAAQQTPAVPAAAAGSAIPGLPPGLPPGVV ANLAATLAAIPGATIIVVPPSPNANQ" gene complement(467459..468001) /gene="PPE9" /locus_tag="Rv0388c" /db_xref="GeneID:886439" CDS complement(467459..468001) /gene="PPE9" /locus_tag="Rv0388c" /function="UNKNOWN" /note="Rv0388c, (MTV036.23c), len: 180 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to others e.g. MTCY10G2_10|Z92539 from Mycobacterium tuberculosis (391 aa), FASTA scores: opt: 667, E(): 0, (58.3% identity in 180 aa overlap) but much shorter." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177724.1" /db_xref="GI:57116729" /db_xref="GeneID:886439" /translation="MDFGALPPEINSARIYSGPGSRPLMQAAAAWQRLANELTATAAS YSSVISGLTGDDWLGPSALSMAAAAVPYVAWMRATAASAEQAAAQAVAAANAYESAYA ATVPPTVIAANRRTMLSLVQTNVFGQNTPAIATSETHYGEMWAHDILAMDGYAGASGA ASQLRRSPATGDHQRGRVAE" gene 468335..469594 /gene="purT" /locus_tag="Rv0389" /db_xref="GeneID:886032" CDS 468335..469594 /gene="purT" /locus_tag="Rv0389" /EC_number="2.1.2.-" /function="INVOLVED IN THIRD STEP (FIRST OF TWO TRANSFORMYLATION REACTIONS) IN DE NOVO PURINE BIOSYNTHESIS. THIS IS AN ALTERNATIVE ENZYME TO THE PURN|Rv0956 GAR TRANSFORMYLASE (5'-PHOSPHORIBOSYLGLYCINAMIDE FORMYLTRANSFERASE). CATALYZES TWO REACTIONS: THE FIRST ONE IS THE PRODUCTION OF BETA-FORMYL GLYCINAMIDE RIBONUCLEOTIDE (GAR) FROM FORMATE, ATP AND BETA GAR; THE SECOND, A SIDE REACTION, IS THE PRODUCTION OF ACETYL PHOSPHATE AND ADP FROM ACETATE AND ATP [CATALYTIC ACTIVITY: FORMATE + ATP + 5'-PHOSPHO-RIBOSYLGLYCINAMIDE = 5'-PHOSPHORIBOSYL-N-FORMYLGLYCINAMIDE + ADP + PYROPHOSPHATE]." /note="non-folate utilizing enzyme, catalyzes the production of beta-formyl glycinamide ribonucleotide from formate, ATP, and beta-GAR and a side reaction producing acetyl phosphate and ADP from acetate and ATP; involved in de novo purine biosynthesis" /codon_start=1 /transl_table=11 /product="phosphoribosylglycinamide formyltransferase 2" /protein_id="NP_214903.1" /db_xref="GI:15607530" /db_xref="GeneID:886032" /translation="MIDGWTEEQHEPTVRHERPAAPQDVRRVMLLGSAEPSRELAIAL QGLGAEVIAVDGYVGAPAHRIADQSVVVTMTDAEELTAVIRRLQPDFLVTVTAAVSVD ALDAVEQADGECTELVPNARAVRCTADREGLRRLAADQLGLPTAPFWFVGSLGELQAV AVHAGFPLLVSPVAGVAGQGSSVVAGPNEVEPAWQRAAGHQVQPQTGGVSPRVCAESV VEIEFLVTMIVVCSQGPNGPLIEFCAPIGHRDADAGELESWQPQKLSTAALDAAKSIA ARIVKALGGRGVFGVELMINGDEVYFADVTVCPAGSAWVTVRSQRLSVFELQARAILG LAVDTLMISPGAARVINPDHTAGRAAVGAAPPADALTGALGVPESDVVIFGRGLGVAL ATAPEVAIARERAREVASRLNVPDSRE" gene 469591..470013 /locus_tag="Rv0390" /db_xref="GeneID:886433" CDS 469591..470013 /locus_tag="Rv0390" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0390, (MTCY04D9.02), len: 140 aa. Conserved hypothetical protein, equivalent to AL023514|MLCB4_11|CAA18942.1|AL023514 hypothetical protein from Mycobacterium leprae (147 aa), FASTA scores: opt: 778, E(): 0, (79.0% identity in 138 aa overlap). Also similar to hypothetical proteins from several Rickettsia species." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214904.1" /db_xref="GI:15607531" /db_xref="GeneID:886433" /translation="MSYAGDITPLQAWEMLSDNPRAVLVDVRCEAEWRFVGVPDLSSL GREVVYVEWATSDGTHNDNFLAELRDRIPADADQHERPVIFLCRSGNRSIGAAEVATE AGITPAYNVLDGFEGHLDAEGHRGATGWRAVGLPWRQG" gene 470010..471230 /gene="metZ" /locus_tag="Rv0391" /db_xref="GeneID:886431" CDS 470010..471230 /gene="metZ" /locus_tag="Rv0391" /EC_number="4.2.99.-" /function="INVOLVED IN METHIONINE BIOSYNTHESIS. CONVERTS O-SUCCINYLHOMOSERINE INTO HOMOCYSTEINE." /note="Rv0391, (MTCY04D9.03), len: 406 aa. Probable metZ, O-succinylhomoserine sulfhydrylase (EC 4.2.99.-), equivalent, but shorter 20 aa in N-terminus, to AA18941.1|AL023514 O-succinylhomoserine sulfhydrylase from Mycobacterium leprae (426 aa). Also highly similar to others e.g. METZ_PSEAE|P55218 o-succinylhomoserine sulfhydrylase from Pseudomonas aeruginosa (403 aa), FASTA scores: opt: 1175, E(): 0, (47.2% identity in 392 aa overlap); etc. BELONGS TO THE TRANS-SULFURATION ENZYMES FAMILY. Could also be a cystathionine gamma-synthase (EC 4.2.99.9)." /codon_start=1 /transl_table=11 /product="O-succinylhomoserine sulfhydrylase" /protein_id="NP_214905.1" /db_xref="GI:15607532" /db_xref="GeneID:886431" /translation="MTDESSVRTPKALPDGVSQATVGVRGGMLRSGFEETAEAMYLTS GYVYGSAAVAEKSFAGELDHYVYSRYGNPTVSVFEERLRLIEGAPAAFATASGMAAVF TSLGALLGAGDRLVAARSLFGSCFVVCSEILPRWGVQTVFVDGDDLSQWERALSVPTQ AVFFETPSNPMQSLVDIAAVTELAHAAGAKVVLDNVFATPLLQQGFPLGVDVVVYSGT KHIDGQGRVLGGAILGDREYIDGPVQKLMRHTGPAMSAFNAWVLLKGLETLAIRVQHS NASAQRIAEFLNGHPSVRWVRYPYLPSHPQYDLAKRQMSGGGTVVTFALDCPEDVAKQ RAFEVLDKMRLIDISNNLGDAKSLVTHPATTTHRAMGPEGRAAIGLGDGVVRISVGLE DTDDLIADIDRALS" gene complement(471227..472639) /gene="ndhA" /locus_tag="Rv0392c" /db_xref="GeneID:886430" CDS complement(471227..472639) /gene="ndhA" /locus_tag="Rv0392c" /EC_number="1.6.99.3" /function="TRANSFER OF ELECTRONS FROM NADH TO THE RESPIRATORY CHAIN. THE IMMEDIATE ELECTRON ACCEPTOR FOR THE ENZYME IS BELIEVED TO BE UBIQUINONE. DOES NOT COUPLE THE REDOX REACTION TO PROTON TRANSLOCATION [CATALYTIC ACTIVITY: NADH + acceptor = NAD+ + reduced acceptor]." /note="Rv0392c, (MTCY04D9.04c), len: 470 aa. Probable ndhA, membrane NADH dehydrogenase (EC 1.6.99.3), equivalent to many e.g. AF038423|AF038423_1 NADH dehydrogenase from Mycobacterium smegmatis (457 aa), FASTA scores: opt: 1991, E(): 0, (67.9% identity in 458 aa overlap); MLCB1788_3 NADH dehydrogenase from Mycobacterium leprae (466 aa), FASTA score: (62.5% identity in 467 aa overlap). Also similar to others from several organisms e.g. P00393|DHNA_ECOLI|66211|581140|CAA23586.1|V00306 NADH DEHYDROGENASE from Escherichia coli (434 aa); and Rv0392c|ndhB from Mycobacterium tuberculosis. Has hydrophobic stretch in C-terminus. BELONGS TO THE NADH DEHYDROGENASE FAMILY." /codon_start=1 /transl_table=11 /product="membrane NADH dehydrogenase" /protein_id="NP_214906.1" /db_xref="GI:15607533" /db_xref="GeneID:886430" /translation="MTLSSGEPSAVGGRHRVVIIGSGFGGLNAAKALKRADVDITLIS KTTTHLFQPLLYQVATGILSEGDIAPTTRLILRRQKNVRVLLGEVNAIDLKAQTVTSK LMDMTTVTPYDSLIVAAGAQQSYFGNDEFATFAPGMKTIDDALELRGRILGAFEAAEV STDHAERERRLTFVVVGAGPTGVEVAGQIVELAERTLAGAFRTITPSECRVILLDAAP AVLPPMGPKLGLKAQRRLEKMDVEVQLNAMVTAVDYKGITIKEKDGGERRIECACKVW AAGVAASPLGKMIAEGSDGTEIDRAGRVIVEPDLTVKGHPNVFVVGDLMFVPGVPGVA QGAIQGARYATTVIKHMVKGNDDPANRKPFHYFNKGSMATISRHSAVAQVGKLEFAGY FAWLAWLVLHLVYLVGYRNRIAALFAWGISFMGRARGQMAITSQMIYARLVMTLMEQQ AQGALAAAEQAEHAEQEAAG" gene 472781..474106 /locus_tag="Rv0393" /db_xref="GeneID:886428" CDS 472781..474106 /locus_tag="Rv0393" /function="UNKNOWN" /note="Rv0393, (MTCY04D9.05), len: 441 aa. Member of Mycobacterium tuberculosis 13E12 repeat family of conserved proteins, similar to many e.g. Rv1148c, Rv1945, Rv3467, Rv0336|MTCY279_3 (503 aa), FASTA scores: E(): 0, (61.1% identity in 347 aa overlap)." /codon_start=1 /transl_table=11 /product="13E12 repeat family protein" /protein_id="NP_214907.1" /db_xref="GI:15607534" /db_xref="GeneID:886428" /translation="MAVGRCAIPRFDQAASGSAINGGQVHLSDGSTSPARQLPAPWPG DAGAAAEGRAGVCCRGNRLPHVSDVGVSHRFDHRPAGVGAGGCRAGAAGAGLAVDDPG QLAAAIDRIVAVADPDAVRQVRERARDREVSIWNSADGMGEVYAQLYATDAQALDARL NALVATVCAGDPRSTDQRRADALGALAAGADRLACRCDNPDCAAEGRPVSAVVIHVVA EQASVKGHGQAPAALLGGDGLIPAELVAELAKTAGLQPIPVPAGTEPGYRPSVKLAAF VRARDLTCRAPGCDRPATQCDLDHTIAFADGGATHAANLKCLCRLHHLLATFCGWRAQ QLPDGTVIWTLPGNQTYVTTPGSALLFPALCTPTGDPPAPEPARADRRGQRTAMMPRR ASTRTQNRAHCIAAERHRNHQARRIAQAAVIATETHGPPPDPDDDPPPF" gene complement(474122..474841) /locus_tag="Rv0394c" /db_xref="GeneID:886435" CDS complement(474122..474841) /locus_tag="Rv0394c" /function="UNKNOWN" /note="Rv0394c, (MTCY04D9.06c), len: 239 aa. Possible secreted protein, sharing no homology with other proteins. Has hydrophobic stretch at its N-terminus." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214908.1" /db_xref="GI:15607535" /db_xref="GeneID:886435" /translation="MTEPRPVFAVVISAGLSAIPMVGGPLQTVFDAIEERTRHRAETT TREICESVGGADTVLSRIDKNPELEPLLSQAIEAATRTSMEAKRRLLAQAAAAALEDD QKVEPASLIVATLSQLEPVHIHALVRLAKAAKSSPDQDEIQRREVMRAASKVEPVPVL AALIQTGVAIATTTVWHGNGTGTPAEESGHILIHDVSDFGHRLLAYLRAADAGAELLI LPSGGSAPTGDHPTPHPSTSR" gene 474940..475344 /locus_tag="Rv0395" /db_xref="GeneID:886425" CDS 474940..475344 /locus_tag="Rv0395" /function="UNKNOWN" /note="Rv0395, (MTCY04D9.07), len: 134 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214909.1" /db_xref="GI:15607536" /db_xref="GeneID:886425" /translation="MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEV LDEHLAVRRRGVPAAIGCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGF PNVALLRLRDMAPSEHGSRCSSARGRLCLSMS" gene 475350..475742 /locus_tag="Rv0396" /db_xref="GeneID:886423" CDS 475350..475742 /locus_tag="Rv0396" /function="UNKNOWN" /note="Rv0396, (MTCY04D9.08), len: 130 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214910.1" /db_xref="GI:15607537" /db_xref="GeneID:886423" /translation="MRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFE PRTVWWGSANWTVRAGSHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPN LVQVEFDDAAMAEAMEEMAEPDDDGEDW" gene 475816..476184 /locus_tag="Rv0397" /db_xref="GeneID:886421" CDS 475816..476184 /locus_tag="Rv0397" /function="UNKNOWN" /note="Rv0397, (MTCY04D9.09), len: 122 aa. Part of 13E12 repeat family of conserved Mycobacterium tuberculosis proteins, similar to downstream Rv0393|Z84725|MTCY4D9_5 CONSERVED 13E12 REPEAT FAMILY PROTEIN (441 aa), FASTA scores: E(): 0, (87.7% identity in 122 aa overlap)." /codon_start=1 /transl_table=11 /product="13E12 repeat family protein" /protein_id="NP_214911.1" /db_xref="GI:15607538" /db_xref="GeneID:886421" /translation="MLATFWGWRAQQLPDGTVIWTLPGDQTYVTTPGSALLFPALCTP TGDPPRPDPARADRRGQRTAMMPRRASTRAQNRAHYIAAERHRNHQARRIAHVVTQTA TTAPETNGPPPDPDDDPPPF" gene complement(476679..477320) /locus_tag="Rv0398c" /db_xref="GeneID:886419" CDS complement(476679..477320) /locus_tag="Rv0398c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0398c, (MTCY04D9.10c), len: 213 aa. Possible secreted protein, sharing no homology with other proteins. Has potential signal sequence with hydrophobic stretch from aa 7-25." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214912.1" /db_xref="GI:15607539" /db_xref="GeneID:886419" /translation="MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGP VIGDGDAALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQRV LGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACVKSGWRKATAG TPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAPNANYRTTASSWPG" gene complement(477327..478556) /gene="lpqK" /locus_tag="Rv0399c" /db_xref="GeneID:886416" CDS complement(477327..478556) /gene="lpqK" /locus_tag="Rv0399c" /function="UNKNOWN" /note="Rv0399c, (MTCY04D9.11c), len: 409 aa. Possible lpqK, conserved lipoprotein, showing some similarity to penicillin binding proteins and various peptidases e.g. DAC_STRSQ|P15555 d-alanyl-d-alanine carboxypeptidase protein (406 aa), FASTA scores: opt: 348, E(): 5.6e-16, (29.2% identity in 301 aa overlap). Also similar to other Mycobacterium tuberculosis PBPs and esterases. Has possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013)." /codon_start=1 /transl_table=11 /product="lipoprotein LpqK" /protein_id="NP_214913.1" /db_xref="GI:15607540" /db_xref="GeneID:886416" /translation="MPVLRRLGCSVLALGLLAGCAPPRTGPASSPTNNGAKADAVIRI VRDFMTQAHLKAVLVRVTVAGKEVVTRAVGDSMTGVPATTAMHFRNGAVAISYVATLL LKLVDEKKLRLDDKLSRWLPDFPHADRVTLGQLAQMTSGYPDYVLGNEAFDAELYANP FRQWTTQELLDQISSRPLLYDPGTNWNYAHTNYLLLGLALEKAAGQDMPTLLQRKVLS PLGLTATANSDTPAIPEPALHAFTSERRAALKIPAGVPFYEESTFWNPSWTITHGAIQ TTTIYDMEATAVGIGSGRLLSADSYKKMVSTELRGKTRAQPGCPTCFEQNDGYSYGLG IVISGHWLLQNPMFAGYAAVEAYLPSQRVAVAVAVTYAPEAFDDQGNYRNQADILFRK IGAEVAPNDAPPMPPGR" gene complement(478566..479753) /gene="fadE7" /locus_tag="Rv0400c" /db_xref="GeneID:886427" CDS complement(478566..479753) /gene="fadE7" /locus_tag="Rv0400c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv0400c, (MTCY04D9.12c), len: 395 aa. Probable fadE7, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. CAC12923.1|AL445403 putative acyl CoA dehydrogenase from Streptomyces coelicolor (397 aa); G624219 GLUTARYL-CoA DEHYDROGENASE PRECURSOR (438 aa), FASTA scores: opt: 1161, E(): 0, (48.1% identity in 391 aa overlap); etc." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE7" /protein_id="NP_214914.1" /db_xref="GI:15607541" /db_xref="GeneID:886427" /translation="MSTPTPPALDRDDPLGLDASLSSDEIAVRDTVRRFCAEHVTPHV AAWFEDGDLPVARDLAKQFGELGLLGMQLHGHGCGGASAVHYGLACRELEAADSGIRS LVSVQGSLAMFAIASFGSDEQKRQWLPGMATGDLLGCFGLTEPDVGSDPAAMKTRARR DGPDWVITGGKMWITNGSVADVAIVWAATDDGIRGFIVPTDTPGFTANTIGHKLSLRA SITSELVLDNVRLPADAMLPGATGLRAPLACLSEARYGIVWGAMGAARSAWQCALDYA RQRTQFGRPIAGFQLTQAKLVDMAVELHKGQLLSLHLGRLKDRVGLRPDQVSFGKLNN TREALKICRTARTILGGNGISLEYPVIRHMVNLESVLTYEGTPEMHQLVLGQAFTGLA AFR" gene 479789..480160 /locus_tag="Rv0401" /db_xref="GeneID:886438" CDS 479789..480160 /locus_tag="Rv0401" /function="UNKNOWN" /note="Rv0401, (MTCY04D9.14), len: 129 aa. Probable conserved transmembrane protein, equivalent to AL023514|MLCB4_9 putative integral membrane protein from Mycobacterium leprae (122 aa), FASTA scores: opt: 548, E(): 4.4e-32, (66.9% identity in 121 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214915.1" /db_xref="GI:15607542" /db_xref="GeneID:886438" /translation="MRPRRALAGLAADVVAVLVFCAVGRRSHAEGLSVTGLAATAWPF LTGTGIGWVLARGWRRPTALAPTGVIVWLCTIVVGMVLRKVSSAGVAASFVVVASAVT AVLLLGWRAAVALMAPHRADG" gene complement(480355..483231) /gene="mmpL1" /locus_tag="Rv0402c" /db_xref="GeneID:886413" CDS complement(480355..483231) /gene="mmpL1" /locus_tag="Rv0402c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0402c, (MTCY04D9.15c), len: 958 aa. Probable mmpL1, conserved transmembrane transport protein (see Tekaia et al., 1999), member of RND superfamily, highly similar to other Mycobacterial proteins e.g. YV34_MYCTU|Q11171 hypothetical 106.2 kDa membrane protein from Mycobacterium tuberculosis (968 aa), FASTA scores: opt: 3551, E(): 0, (55.4% identity in 933aa overlap); YV34_MYCLE|P54881 hypothetical 105.2 kDa protein from Mycobacterium leprae (959 aa), FASTA scores: opt: 3615, E(): 0, (55.5% identity in 941 aa overlap); etc. Highly similar to many other mycobacterial MmpL proteins from Mycobacterium tuberculosis and Mycobacterium leprae e.g. Rv0450c, Rv0676c, Rv0507, etc. BELONGS TO THE MMPL FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL1" /protein_id="NP_214916.1" /db_xref="GI:15607543" /db_xref="GeneID:886413" /translation="MRSQRLAGHLSAAARTIHALSLPIILFWVALTIVVNVVAPQLQS VARTHSVALGPHDAPSLIAMKRIGKDFQQFDSDTTAMVLLEGQEKLGDEAHRFYDVLV TKLSQDTTHVQHIENFWGDPLTAAGSQSADGKAAYVQLNLTGDQGGSQANESVAAVQR IVDSVPPPPGIKAYVTGPGPLGADRVVYGDRSLHTITGISIAVIAIMLFIAYRSLSAA LIMLLTVGLELLAVRGIISTFAVNDLMGLSTFTVNVLVALTIAASTDYIIFLVGRYQE ARATGQNREAAYYTMFGGTAHVVLASGLTVAGAMYCLGFTRLPYFNTLASPCAIGLVT VMLASLTLAPAIIAVASRFGLFDPKRATTKRRWRRIGTVVVRWPGPVLAATLLIALIG LLALPKYQTNYNERYYIPSAAPSNIGYLASDRHFPQARMEPEVLMVEADHDLRNPTDM LILDRIAKTVFHTPGIARVQSITRPLGAPIDHSSIPFQLGMQSTMTIENLQNLKDRVA DLSTLTDQLQRMIDITQRTQELTRQLTDATHDMNAHTRQMRDNANELRDRIADFDDFW RPLRSFTYWERHCFDIPICWSMRSLLNSMDNVDKLTEDLANLTDDTERMDTTQRQLLA QLDPTIATMQTVKDLAQTLTSAFSGLVTQMEDMTRNATVMGRTFDAANNDDSFYLPPE AFQNPDFQRGLKLFLSPDGTCARFVITHRGDPASAEGISHIDPIMQAADEAVKGTPLQ AASIYLAGTSSTYKDIHEGTLYDVMIAVVASLCLIFIIMLGITRSVVASAVIVGTVAL SLGSAFGLSVLIWQHILHMPLHWLVLPMAIIVMLAVGSDYNLLLIARFQEEIGAGLKT GMIRAMAGTGRVVTIAGLVFAFTMGSMVASDLRVVGQIGTTIMIGLLFDTLVVRSYMT PALATLLGRWFWWPRRVDRLARQPQVLGPRRTTALSAERAALLQ" gene complement(483228..483656) /gene="mmpS1" /locus_tag="Rv0403c" /db_xref="GeneID:886411" CDS complement(483228..483656) /gene="mmpS1" /locus_tag="Rv0403c" /function="UNKNOWN" /note="Rv0403c, (MTCY04D9.16c), len: 142 aa. Probable mmpS1, conserved membrane protein (see citation below), highly similar to other Mycobacterial proteins e.g. YV33_MYCLE|P54880 hypothetical 16.9 kDa protein from Mycobacterium leprae (154 aa), FASTA scores: opt: 458, E(): 1.6e-26, (46.9% identity in 143 aa overlap); YV33_MYCTU|Q11170 hypothetical 15 .9 kDa protein from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 362, E(): 1.1e-19, (42.1% identity in 140 aa overlap); etc. Also similar to other MmpS proteins from Mycobacterium tuberculosis e.g. Rv0677c, Rv0451c, etc. BELONGS TO THE MMPS FAMILY." /codon_start=1 /transl_table=11 /product="membrane protein" /protein_id="NP_214917.1" /db_xref="GI:15607544" /db_xref="GeneID:886411" /translation="MFGVAKRFWIPMVIVIVVAVAAVTVSRLHSVFGSHQHAPDTGNL DPIIAFYPKHVLYEVFGPPGTVASINYLDADAQPHEVVNAAVPWSFTIVTTLTAVVAN VVARGDGASLGCRITVNEVIREERIVNAYHAHTSCLVKSA" gene 483977..485734 /gene="fadD30" /locus_tag="Rv0404" /db_xref="GeneID:886409" CDS 483977..485734 /gene="fadD30" /locus_tag="Rv0404" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_214918.1" /db_xref="GI:15607545" /db_xref="GeneID:886409" /translation="MSVISTLRDRATTTPSDEAFVFMDYDTKTGDQIDRMTWSQLYSR VTAVSAYLISYGRHADRRRTAAISAPQGLDYVAGFLGALCAGWTPVPLPEPLGSLRDK RTGLAVLDCAADVVLTTSQAETRVRATIATHGASVTTPVIALDTLDEPSGDNCDLDSQ LSDWSSYLQYTSGSTANPRGVVLSMRNVTENVDQIIRNYFRHEGGAPRLPSSVVSWLP LYHDMGLMVGLFIPLFVGCPVILTSPEAFIRKPARWMQLLAKHQAPFSAAPNFAFDLA VAKTSEEDMAGLDLGHVNTIINGAEQVQPNTITKFLRRFRPYNLMPAAVKPSYGMAEA VVYLATTKAGSPPTSTEFDADSLARGHAELSTFETERATRLIRYHSDDKEPLLRIVDP DSNIELGPGRIGEIWIHGKNVSTGYHNADDALNRDKFQASIREASAGTPRSPWLRTGD LGFIVGDEFYIVGRMKDLIIQDGVNHYPDDIETTVKEFTGGRVAAFSVSDDGVEHLVI AAEVRTEHGPDKVTIMDFSTIKRLVVSALSKLHGLHVTDFLLVPPGALPKTTSGKISR AACAKQYGANKLQRVATFP" gene 485731..489939 /gene="pks6" /locus_tag="Rv0405" /db_xref="GeneID:886407" CDS 485731..489939 /gene="pks6" /locus_tag="Rv0405" /function="POLYKETIDE SYNTHASE POSSIBLY INVOLVED IN LIPID SYNTHESIS." /note="Rv0405, (MTCY22G10.01), len: 1402 aa. Probable pks6, membrane-bound polyketide synthase (see citation below), highly similar to others e.g. CAC29643.1|AL583917 putative polyketide synthase from Mycobacterium leprae (2103 aa); Y06K_MYCTU|Q10977 probable polyketide synthase (1876 aa), FASTA scores: opt: 2303, E(): 0, (38.7% identity in 1232 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site, 2 x PS00017 ATP/GTP-binding site motif A (P-loop), and PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="membrane bound polyketide synthase" /protein_id="NP_214919.1" /db_xref="GI:15607546" /db_xref="GeneID:886407" /translation="MTDGSVTADKLQKWFREYLSTHIECHPNEVSLDVPIRDLGLKSI DVLAIPGDLGDRFGFCIPDLAVWDNPSANDLIDSLLNQRSADSLRESHGHADRNTQGR GSINEPVAVIGVGCRFPGDIDGPERLWDFLTEKKCAITAYPDRGFTNAGTFAESGGFL KDVAGFDNRFFDIPPDEALRMDPQQRLLLEVSWEALEHAGIIPESLRLSRTGVFVGVS STDYVRLVSASAQQKSTIWDNTGGSSSIIANRISYFLDIQGPSIVIDTACSSSLVAVH LACRSLSTWDCDIALVGGTNVLISPEPWGGFREAGILSQTGCCHAFDKSADGMVRGEG CGVIVLQRLSDARLEGRRILAILTGSAVNQDGKSNGIMAPNPSAQIGVLENACKSARV DPLEIGYVEAHGTGTSLGDRIEAHALGMVFGRKRPGSGPLMIGSIKPNIGHLEGAAGI AGLIKAVLMVERGSLLPSGGFTEPNPAIPFTELGLRVVDELQEWPVVAGRPRRAGVSS FGFGGTNAHVIVEEAGSVGADTVSGRADVGGSGGGVVAWVISGKTASALAAQAGRLGR YVRARPALDVVDVGYSLVSTRSVFDHRAVVVGQTRDELLAGLAGVVAGRPEAGVVCGV GKPAGKTAFVFAGQGSQWLGMGSELYAAYPVFAEALDAVVDELDRHLRYPLRDVIWGH DQDLLNTTEFAQPALFAVEVALYRLLMSWGVRPGLVLGHSVGELAAAHVAGALCLPDA AMLVAARGRLMQALPAGGAMFAVQAREDEVAPMLGHDVSIAAVNGPASVVISGAHDAV SAIADRLRGQGRRVHRLAVSHAFHSALMEPMIAEFTAVAAELSVGLPTIPVISNVTGQ LVADDFASADYWARHIRAVVRFGDSVRSAHCAGASRFIEVGPGGGLTSLIEASLADAQ IVSVPTLRKDRPEPVSVMTAAAQGFVSGMGLDWASVFSGYRPKRVELPTYAFQHQKFW LAPAPSVSDPTAAGQIGASDGGAELLASSGFAARLAGRSADEQLAAAIEVVCEHAAAV LGRDGAAGLDAGQAFADSGFNSLSAVELRNRLTAVTAVTLPATAIFDHPTPTELAQYL ITQIDGHGSSAAAAANPAERIDALTDLFLQACDAGRDADGWKMVALASNTRERMSSPV RNNVSKNVALLADGISDVVVICIPTLTVLSDQREYRDIANAMTGRHSVYSLTLPGFDS SDALPQNADMIVETVSNAIIDVVGGSCRFVLSGYSSGGVLAYALCSHLSVKHQRNPLG VALIDTYLPSQIANPSMNEGFSPNDTGKGLSREVIRVARMLNRLTATRLTAAATYAAI FQAWEPGRSMAPVLNIVAKDRIATVENLREERINRWRTAAAEAAYSVAEVPGDHFGMM STSSEAIATEIHDWISGLVRGPHR" misc_feature 486505..486555 /gene="pks6" /locus_tag="Rv0405" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature 486811..486834 /gene="pks6" /locus_tag="Rv0405" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 487366..487389 /gene="pks6" /locus_tag="Rv0405" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 488869..488916 /gene="pks6" /locus_tag="Rv0405" /note="PS00012 Phosphopantetheine attachment site" gene complement(489887..490705) /locus_tag="Rv0406c" /db_xref="GeneID:886403" CDS complement(489887..490705) /locus_tag="Rv0406c" /function="UNKNOWN" /note="Rv0406c, (MTCY22G10.02c), len: 272 aa. Beta-lactamase-like protein, equivalent to AAD38170.1|AF152397_1 beta-lactamase-like protein from Mycobacterium phlei (243 aa); AL023514|MLCB4_8 hypothetical protein from Mycobacterium leprae (251 aa), FASTA scores: opt: 1284, E(): 0, (74.9% identity in 243 aa overlap); and AAD38164.1|AF152394_2 beta-lactamase-like protein from Mycobacterium avium (247 aa), FASTA scores: opt: 1301, E(): 0, (74.2% identity in 244 aa overlap); etc. Also slight similarity to others beta-lactamases and hypothetical proteins e.g. P52700|BLA1_XANMA|628530|S45349 METALLO-BETA-LACTAMASE L1 PRECURSOR (BETA-LACTAMASE, TYPE II) (PENICILLINASE) from Xanthomonas maltophilia (290 aa), FASTA scores: (34.4% identity in 96 aa overlap)." /codon_start=1 /transl_table=11 /product="beta lactamase like protein" /protein_id="NP_214920.1" /db_xref="GI:15607547" /db_xref="GeneID:886403" /translation="MVATRGTRLAALALAPRLAGMAELVQITDKVHLARGHAVNWVLV TDDTGVLLIDAGYPGDRAEVLASLNKLGYTPGDVRAIVLTHAHIDHLGSAIWFAREHS TPVYCHAEEVGHAKREYRENASVFDVALRSWRPRVAVWGIHLLRRGGLTGDGIPTAQP LTAEAAAGLPGQPMAIFTPGHTSGHCSYVVDGVLASGDALITGHPMLRHRGPQLLPAV FSHSQQNSIRSLAALALLETNILAPGHGELWHGPIRKATDEALERAQKSNHVFR" gene 490783..491793 /gene="fgd1" /locus_tag="Rv0407" /db_xref="GeneID:886418" CDS 490783..491793 /gene="fgd1" /locus_tag="Rv0407" /function="CATALYZES OXIDATION OF GLUCOSE-6-PHOSPHATE TO 6-PHOSPHOGLUCONOLACTONE USING COENZYME F420 (AN *-HYDROXY-5-DEAZAFLAVIN DERIVATIVE) AS THE ELECTRON ACCEPTOR." /experiment="experimental evidence, no additional details recorded" /note="Rv0407, (MTCY22G10.03), len: 336 aa. Probable fgd1, F420-dependent glucose-6-phosphate dehydrogenase (EC 1.-.-.-), equivalent to others from Mycobacteria e.g. AAD38165.1|AF152394_3 from Mycobacterium avium (336 aa), FASTA scores: opt: 2082, E(): 0, (89.9% identity in 336 aa overlap); AL023514|MLCB 4_7 from Mycobacterium leprae (336 aa), FASTA scores: opt: 2069, E(): 0, (89.0% identity in 336 aa overlap). Also similar to other dehydrogenases e.g. CAA77276.1|Y18730 F420-dependent alcohol dehydrogenase from Methanofollis liminatans (330 aa). Also similar to many proteins from Mycobacterium tuberculosis e.g. Rv0953c, Rv0791c, etc. Note that previously known as fgd.; fgd" /codon_start=1 /transl_table=11 /product="F420-dependent glucose-6-phosphate dehydrogenase" /protein_id="NP_214921.1" /db_xref="GI:15607548" /db_xref="GeneID:886418" /translation="MAELKLGYKASAEQFAPRELVELAVAAEAHGMDSATVSDHFQPW RHQGGHAPFSLSWMTAVGERTNRLLLGTSVLTPTFRYNPAVIAQAFATMGCLYPNRVF LGVGTGEALNEIATGYEGAWPEFKERFARLRESVGLMRQLWSGDRVDFDGDYYRLKGA SIYDVPDGGVPVYIAAGGPAVAKYAGRAGDGFICTSGKGEELYTEKLMPAVREGAAAA DRSVDGIDKMIEIKISYDPDPELALNNTRFWAPLSLTAEQKHSIDDPIEMEKAADALP IEQIAKRWIVASDPDEAVEKVGQYVTWGLNHLVFHAPGHDQRRFLELFQSDLAPRLRR LG" gene 491786..493858 /gene="pta" /locus_tag="Rv0408" /db_xref="GeneID:886401" CDS 491786..493858 /gene="pta" /locus_tag="Rv0408" /EC_number="2.3.1.8" /function="INVOLVED AT THE LAST STEP (OF TWO) IN THE CONVERSION OF ACETATE TO ACETYL-CoA [CATALYTIC ACTIVITY: Acetyl-CoA + phosphate = CoA + acetyl phosphate]." /note="catalyzes the synthesis of acetylphosphate or propionylphosphate from acetyl-CoA or propionyl-CoA and inorganic phosphate; when using propionyl-CoA the enzyme is functioning in the anaerobic pathway catabolizing threonine to propionate" /codon_start=1 /transl_table=11 /product="phosphate acetyltransferase" /protein_id="NP_214922.1" /db_xref="GI:15607549" /db_xref="GeneID:886401" /translation="MADSSAIYLAAPESQTGKSTIALGLLHRLTAMVAKVGVFRPITR LSAERDYILELLLAHTSAGLPYERCVGVTYQQLHADRDDAIAEIVDSYHAMADECDAV VVVGSDYTDVTSPTELSVNGRIAVNLGAPVLLTVRAKDRTPDQVASVVEVCLAELDTQ RAHTAAVVANRCELSAIPAVTDALRRFTPPSYVVPEEPLLSAPTVAELTQAVNGAVVS GDVALREREVMGVLAAGMTADHVLERLTDGMAVITPGDRSDVVLAVASAHAAEGFPSL SCIVLNGGFQLHPAIAALVSGLRLRLPVIATALGTYDTASAAASARGLVTATSQRKID TALELMDRHVDVAGLLAQLTIPIPTVTTPQMFTYRLLQQARSDLMRIVLPEGDDDRIL KSAGRLLQRGIVDLTILGDEAKVRLRAAELGVDLDGATVIEPCASELHDQFADQYAQL RKAKGITVEHAREIMNDATYFGTMLVHNCHADGMVSGAAHTTAHTVRPALEIIKTVPG ISTVSSIFLMCLPDRVLAYGDCAIIPNPTVEQLADIAICSARTAAQFGIEPRVAMLSY STGDSGKGADVDKVRAATELVRAREPQLPVEGPIQYDAAVEPSVAATKLRDSPVAGRA TVLIFPDLNTGNNTYKAVQRSAGAIAIGPVLQGLRKPVNDLSRGALVDDIVNTVAITA IQAQGVHE" gene 493851..495008 /gene="ackA" /locus_tag="Rv0409" /db_xref="GeneID:886399" CDS 493851..495008 /gene="ackA" /locus_tag="Rv0409" /EC_number="2.7.2.1" /function="INVOLVED AT THE FIRST STEP (OF TWO) IN THE CONVERSION OF ACETATE TO ACETYL-CoA [CATALYTIC ACTIVITY: ATP + acetate = ADP + acetyl phosphate]." /note="AckA utilizes acetate and can acetylate CheY which increases signal strength during flagellar rotation; utilizes magnesium and ATP; also involved in conversion of acetate to aceyl-CoA" /codon_start=1 /transl_table=11 /product="acetate kinase" /protein_id="NP_214923.1" /db_xref="GI:15607550" /db_xref="GeneID:886399" /translation="MSSTVLVINSGSSSLKFQLVEPVAGMSRAAGIVERIGERSSPVA DHAQALHRAFKMLAEDGIDLQTCGLVAVGHRVVHGGTEFHQPTLLDDTVIGKLEELSA LAPLHNPPAVLGIKVARRLLANVAHVAVFDTAFFHDLPPAAATYAIDRDVADRWHIRR YGFHGTSHQYVSERAAAFLGRPLDGLNQIVLHLGNGASASAIARGRPVETSMGLTPLE GLVMGTRSGDLDPGVISYLWRTARMGVEDIESMLNHRSGMLGLAGERDFRRLRLVIET GDRSAQLAYEVFIHRLRKYLGAYLAVLGHTDVVSFTAGIGENDAAVRRDALAGLQGLG IALDQDRNLGPGHGARRISSDDSPIAVLVVPTNEELAIARDCLRVLGGRRA" misc_feature 493863..493898 /gene="ackA" /locus_tag="Rv0409" /note="PS01075 Acetate and butyrate kinases family signature 1" misc_feature 494748..494777 /gene="ackA" /locus_tag="Rv0409" /note="PS00758 ArgE / dapE / ACY1 / CPG2 / yscS family signature 1" gene complement(495062..497314) /gene="pknG" /locus_tag="Rv0410c" /db_xref="GeneID:886397" CDS complement(495062..497314) /gene="pknG" /locus_tag="Rv0410c" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO REGULATE AMINO-ACID UPTAKE AND STATIONARY-PHASE METABOLISM. PHOSPHORYLATES THE PEPTIDE SUBSTRATE MYELIN BASIC PROTEIN (MBP) AT SERINE RESIDUES [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /experiment="experimental evidence, no additional details recorded" /note="Rv0410c, (MTCY22G10.06c), len: 750 aa. pknG, serine/threonine-protein kinase (EC 2.7.1.-) (see citations below), equivalent to PKNG_MYCLE|P57993|13092623|CAC29812.1|AL583918 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium leprae (767 aa). Also similar to others e.g. AB76890.1|AL159139 putative serine/threonine protein kinase from Streptomyces coelicolor (774 aa); etc. Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES." /codon_start=1 /transl_table=11 /product="serine/threonine-protein kinase PKNG (protein kinase G) (STPK G)" /protein_id="NP_214924.1" /db_xref="GI:15607551" /db_xref="GeneID:886397" /translation="MAKASETERSGPGTQPADAQTATSATVRPLSTQAVFRPDFGDED NFPHPTLGPDTEPQDRMATTSRVRPPVRRLGGGLVEIPRAPDIDPLEALMTNPVVPES KRFCWNCGRPVGRSDSETKGASEGWCPYCGSPYSFLPQLNPGDIVAGQYEVKGCIAHG GLGWIYLALDRNVNGRPVVLKGLVHSGDAEAQAMAMAERQFLAEVVHPSIVQIFNFVE HTDRHGDPVGYIVMEYVGGQSLKRSKGQKLPVAEAIAYLLEILPALSYLHSIGLVYND LKPENIMLTEEQLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRTGPTVATDIYTVGRT LAALTLDLPTRNGRYVDGLPEDDPVLKTYDSYGRLLRRAIDPDPRQRFTTAEEMSAQL TGVLREVVAQDTGVPRPGLSTIFSPSRSTFGVDLLVAHTDVYLDGQVHAEKLTANEIV TALSVPLVDPTDVAASVLQATVLSQPVQTLDSLRAARHGALDADGVDFSESVELPLME VRALLDLGDVAKATRKLDDLAERVGWRWRLVWYRAVAELLTGDYDSATKHFTEVLDTF PGELAPKLALAATAELAGNTDEHKFYQTVWSTNDGVISAAFGLARARSAEGDRVGAVR TLDEVPPTSRHFTTARLTSAVTLLSGRSTSEVTEEQIRDAARRVEALPPTEPRVLQIR ALVLGGALDWLKDNKASTNHILGFPFTSHGLRLGVEASLRSLARVAPTQRHRYTLVDM ANKVRPTSTF" misc_feature complement(496463..496501) /gene="pknG" /locus_tag="Rv0410c" /note="PS00108 Serine/Threonine protein kinases active-site signature" gene complement(497314..498300) /gene="glnH" /locus_tag="Rv0411c" /db_xref="GeneID:886393" CDS complement(497314..498300) /gene="glnH" /locus_tag="Rv0411c" /function="INVOLVED IN ACTIVE TRANSPORT OF GLUTAMINE ACROSS THE MEMBRANE (IMPORT). INTERACTS WITH THE GLUTAMINE-TRANSPORT SYSTEM." /note="Rv0411c, (MTCY22G10.07c), len: 328 aa. Probable glnH, glutamine-binding protein, membrane-bound lipoprotein (see citation below), equivalent to AL035159|MLCB1450_15|T44736|4154051|CAA22704.1 glutamine-binding protein homolog from Mycobacterium leprae (325 aa), FASTA scores: opt: 1747, E(): 0, (79.3% identity in 328 aa overlap). Also similar to others e.g. GLNH_BACST|P27676 glutamine-binding protein precursor from Bacillus stearothermophilus (262 aa), FASTA scores: opt: 493, E(): 7.5e-22, (37.8% identity in 193 aa overlap); etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site, PS01039 Bacterial extracellular solute-binding proteins, family 3 signature. BELONGS TO THE BACTERIAL EXTRACELLULAR SOLUTE-BINDING PROTEIN FAMILY 3. Presumed attached to the membrane by a lipid anchor." /codon_start=1 /transl_table=11 /product="glutamine-binding lipoprotein" /protein_id="NP_214925.1" /db_xref="GI:15607552" /db_xref="GeneID:886393" /translation="MTRRALLARAAAPLAPLALAMVLASCGHSETLGVEATPTLPLPT PVGMEIMPPQPPLPPDSSSQDCDPTASLRPFATKAEADAAVADIRARGRLIVGLDIGS NLFSFRDPITGEITGFDVDIAGEVARDIFGVPSHVEYRILSAAERVTALQKSQVDIVV KTMSITCERRKLVNFSTVYLDANQRILAPRDSPITKVSDLSGKRVCVARGTTSLRRIR EIAPPPVIVSVVNWADCLVALQQREIDAVSTDDTILAGLVEEDPYLHIVGPDMADQPY GVGINLDNTGLVRFVNGTLERIRNDGTWNTLYRKWLTVLGPAPAPPTPRYVD" misc_feature complement(497911..497952) /gene="glnH" /locus_tag="Rv0411c" /note="PS01039 Bacterial extracellular solute-binding proteins, family 3 signature" misc_feature complement(498223..498255) /gene="glnH" /locus_tag="Rv0411c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(498300..499619) /locus_tag="Rv0412c" /db_xref="GeneID:886415" CDS complement(498300..499619) /locus_tag="Rv0412c" /function="UNKNOWN" /note="Rv0412c, (MTCY22G10.08c), len: 439 aa. Possible conserved membrane protein, equivalent to AL035159|MLCB1450_16|T44737 probable membrane protein from Mycobacterium leprae (403 aa), FASTA scores: opt: 2027, E(): 0, (80.4% identity in 403 aa overlap). Also some similarity with CAB71201.1|AL138538 putative secreted protein from Streptomyces coelicolor (429 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214926.1" /db_xref="GI:15607553" /db_xref="GeneID:886415" /translation="MTVELAHPSTEPLGSRSPAEPAHPRRWFISTTPGRIMTIGIVLA ALGVASAFATSTTIEHRQQVLTAVLDHTEPLSFAAGRLYTTLSVADAAAATAFIAQAE PGGVRLRYEQAITDASVAVTRASSGLTDESLVQLLGRINAELAVYTGLVEIARANNRA GNPVGSSYLSEASGLMQSTILPDAQRLYQATSARVDRETTASTQIPAPVILVVATTVV FGAFAHRWLARRTRRRINPGLVVGALGILVMVVWVGTALTISTTASRSAKDTAAESLK TITNLAITAQQARADETLSLIRRGDEEVRKQAFYQRIDAMQRQLNDYMARRHAVDKPD LQGADQLLVRWRQANDRINSDISVGNYRAATQVALGKGEDDATPAFDKLDEALTKAMG QSRTQLRHDILNAHRGLAGAQVGGVVLSLGAAIAVALGLWPRLKEYR" gene 499713..500366 /gene="mutT3" /locus_tag="Rv0413" /db_xref="GeneID:886405" CDS 499713..500366 /gene="mutT3" /locus_tag="Rv0413" /EC_number="3.6.1.-" /function="POSSIBLY INVOLVED IN THE GO SYSTEM RESPONSIBLE FOR REMOVING AN OXIDATIVELY DAMAGED FORM OF GUANINE (7,8-DIHYDRO-8-OXOGUANINE) FROM DNA AND THE NUCLEOTIDE POOL. 8-OXO-DGTP IS INSERTED OPPOSITE DA AND DC RESIDUES OF TEMPLATE DNA WITH ALMOST EQUAL EFFICIENCY THUS LEADING TO A.T TO G.C TRANSVERSIONS. MUTT SPECIFICALLY DEGRADES 8-OXO-DGTP TO THE MONOPHOSPHATE [CATALYTIC ACTIVITY: 8-OXO-DGTP + H2O = 8-OXO-DGMP + PYROPHOSPHATE]." /note="Rv0413, (MTCY22G10.10), len: 217 aa. Possible mutT3, mutator protein (EC 3.6.1.-) (see citation below), showing some similarity with e.g. MUTT_PROVU|P32090 mutator mutt protein from Proteus vulgaris (112 aa), FASTA scores: opt: 151, E(): 0.0008, (40.7% identity in 59 aa overlap). SEEMS TO BELONG TO THE NUDIX HYDROLASE FAMILY." /codon_start=1 /transl_table=11 /product="7,8-dihydro-8-oxoguanine-triphosphatase" /protein_id="NP_214927.1" /db_xref="GI:15607554" /db_xref="GeneID:886405" /translation="MPSCPPAYSEQVRGDGDGWVVSDSGVAYWGRYGAAGLLLRAPRP DGTPAVLLQHRALWSHQGGTWGLPGGARDSHETPEQTAVRESSEEAGLSAERLEVRAT VVTAEVCGVDDTHWTYTTVVADAGELLDTVPNRESAELRWVAENEVADLPLHPGFAAS WQRLRTAPATVPLARCDERRQRLPRTIQIEAGVFLWCTPGDADQAPSPLGRRISSLL" gene complement(500350..501018) /gene="thiE" /locus_tag="Rv0414c" /db_xref="GeneID:886391" CDS complement(500350..501018) /gene="thiE" /locus_tag="Rv0414c" /EC_number="2.5.1.3" /function="INVOLVED IN THIAMINE BIOSYNTHESIS. CONDENSES 4-METHYL-5-(BETA-HYDROXYETHYL)-THIAZOLE MONOPHOSPHATE (THZ-P) AND 4-AMINO-5-HYDROXYMETHYL PYRIMIDINE PYROPHOSPHATE (HMP-PP) TO FORM THIAMINE MONOPHOSPHATE (TMP) [CATALYTIC ACTIVITY: 2-methyl-4-amino-5-hydroxymethylpyrimidine diphosphate + 4-4-methyl-5-(2-phosphonooxyethyl)-thiazole = diphosphate + thiamine monophosphate]." /note="Condenses 4-methyl-5-(beta-hydroxyethyl)-thiazole monophosphate and 4-amino-5-hydroxymethyl pyrimidine pyrophosphate to form thiamine monophosphate" /codon_start=1 /transl_table=11 /product="thiamine-phosphate pyrophosphorylase" /protein_id="NP_214928.1" /db_xref="GI:15607555" /db_xref="GeneID:886391" /translation="MHESRLASARLYLCTDARRERGDLAQFAEAALAGGVDIIQLRDK GSPGELRFGPLQARDELAACEILADAAHRYGALFAVNDRADIARAAGADVLHLGQRDL PVNVARQILAPDTLIGRSTHDPDQVAAAAAGDADYFCVGPCWPTPTKPGRAAPGLGLV RVAAELGGDDKPWFAIGGINAQRLPAVLDAGARRIVVVRAITSADDPRAAAEQLRSAL TAAN" gene 501148..502170 /gene="thiO" /locus_tag="Rv0415" /db_xref="GeneID:886390" CDS 501148..502170 /gene="thiO" /locus_tag="Rv0415" /function="POSSIBLY INVOLVED IN THIAMINE BIOSYNTHESIS." /note="Rv0415, (MTCY22G10.12), len: 340 aa. Possible thiO, thiamine biosynthesis oxidoreductase (EC 1.-.-.-), equivalent to T44739|4154054|CAA22708.1|AL035159|MLCB1450.24 hypothetical protein from Mycobacterium leprae (340 aa), FASTA scores: opt: 1867, E(): 0, (82.0% identity in 338 aa overlap). Shows some similarity to other thiO proteins e.g. THIO_RHIET|O34292 Putative thiamine biosynthesis oxidoreductase from Rhizobium etli plasmid pb (327 aa) (see citation below); AAG31046.1|AF264948_8|THIO putative amino acid oxidase flavoprotein ThiO from Erwinia amylovora (349 aa); NP_106392.1|14025578|BAB52178.1|AP003007|THIO THIAMINE BIOSYNTHESIS OXIDOREDUCTASE THIO from Mesorhizobium loti (333 aa); etc." /codon_start=1 /transl_table=11 /product="thiamine biosynthesis oxidoreductase ThiO" /protein_id="NP_214929.1" /db_xref="GI:15607556" /db_xref="GeneID:886390" /translation="MASDLHTGSLAVIGGGVIGLSVARRAAQAGWPVRVHRSDERGAS WVAGGMLAPHSEGWPGEERLLRLGLQSLRLWREGSFLDGLGPQLVTAHESLVVAVDRA DVADLRTVADWLSAQGHPVIWESAARDVEPLLAQGIRHGFRAPTELAVDNRALLDALC RDCERLGVRWSSQVSSLSDVDAHTVVIANGIDAPALWPGLPIRPVKGEVLRLRWRPGC MPLPQRVIRARVRGRQVYLVPRSDGVVVGATQYEHGRDTAPVVSGVRDLLDDACTVLP ALGEYELAECEAGLRPMTPDNLPLVQRLDSRTLVAAGHGRSGFLLAPWTAEQIVSELV SVGAAS" gene 502167..502373 /gene="thiS" /locus_tag="Rv0416" /db_xref="GeneID:886395" CDS 502167..502373 /gene="thiS" /locus_tag="Rv0416" /function="POSSIBLY INVOLVED IN THIAMINE BIOSYNTHESIS." /note="with ThiF, ThiG, and ThiO catalyzes the formation of the thiazole moiety of thiamine pyrophosphate" /codon_start=1 /transl_table=11 /product="sulfur carrier protein ThiS" /protein_id="NP_214930.1" /db_xref="GI:15607557" /db_xref="GeneID:886395" /translation="MIVVVNEQQVEVDEQTTIAALLDSLGFGDRGIAVALNFSVLPRS DWATKICELRKPVRLEVVTAVQGG" gene 502366..503124 /gene="thiG" /locus_tag="Rv0417" /db_xref="GeneID:886396" CDS 502366..503124 /gene="thiG" /locus_tag="Rv0417" /function="INVOLVED IN THIAMINE BIOSYNTHESIS. Required for the synthesis of the thiazole moiety of thiamine." /note="functions in thiamine (vitamin B1) biosynthesis; in Bacillus subtilis this enzyme catalyzes the formation of thiazole from dehydroxyglycine and 1-deoxy-D-xylulose-5-phosphate and ThiS-thiocarboxylate" /codon_start=1 /transl_table=11 /product="thiazole synthase" /protein_id="NP_214931.1" /db_xref="GI:15607558" /db_xref="GeneID:886396" /translation="MAESKLVIGDRSFASRLIMGTGGATNLAVLEQALIASGTELTTV AIRRVDADGGTGLLDLLNRLGITPLPNTAGSRSAAEAVLTAQLAREALNTNWVKLEVI ADERTLWPDAVELVRAAEQLVDDGFVVLPYTTDDPVLARRLEDTGCAAVMPLGSPIGT GLGIANPHNIEMIVAGARVPVVLDAGIGTASDAALAMELGCDAVLLASAVTRAADPPA MAAAMAAAVTAGYLARCAGRIPKRFWAQASSPAR" gene 503496..504998 /gene="lpqL" /locus_tag="Rv0418" /db_xref="GeneID:886381" CDS 503496..504998 /gene="lpqL" /locus_tag="Rv0418" /EC_number="3.4.11.-" /function="UNKNOWN; HYDROLYZES PEPTIDES AND/OR PROTEINS." /note="Rv0418, (MTCCY22G10.15), len: 500 aa. Probable lpqL, lipoprotein aminopeptidase (EC 3.4.11.-), similar to others e.g. B83278|9949035|AAG06327.1|AE004720_3|AE004720|PA2939 probable aminopeptidase from Pseudomonas aeruginosa (536 aa); P80561|APX_STRGR|SGAP|S66427 aminopeptidase (EC 3.4.11.-) from Streptomyces griseus (284 aa) (homology only with C-terminus of Rv0418); P37302|APE3_YEAST|1077010|A54134 aminopeptidase Y (EC 3.4.11.-) from Saccharomyces cerevisiae (537 aa); etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein aminopeptidase LpqL" /protein_id="NP_214932.1" /db_xref="GI:15607559" /db_xref="GeneID:886381" /translation="MVNKSRMMPAVLAVAVVVAFLTTGCIRWSTQSRPVVNGPAAAEF AVALRNRVSTDAMMAHLSKLQDIANANDGTRAVGTPGYQASVDYVVNTLRNSGFDVQT PEFSARVFKAEKGVVTLGGNTVEARALEYSLGTPPDGVTGPLVAAPADDSPGCSPSDY DRLPVSGAVVLVDRGVCPFAQKEDAAAQRGAVALIIADNIDEQAMGGTLGANTDVKIP VVSVTKSVGFQLRGQSGPTTVKLTASTQSFKARNVIAQTKTGSSANVVMAGAHLDSVP EGPGINDNGSGVAAVLETAVQLGNSPHVSNAVRFAFWGAEEFGLIGSRNYVESLDIDA LKGIALYLNFDMLASPNPGYFTYDGDQSLPLDARGQPVVPEGSAGIERTFVAYLKMAG KTAQDTSFDGRSDYDGFTLAGIPSGGLFSGAEVKKSAEQAELWGGTADEPFDPNYHQK TDTLDHIDRTALGINGAGVAYAVGLYAQDLGGPNGVPVMADRTRHLIAKP" misc_feature 503538..503570 /gene="lpqL" /locus_tag="Rv0418" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 505086..506582 /gene="lpqM" /locus_tag="Rv0419" /db_xref="GeneID:886388" CDS 505086..506582 /gene="lpqM" /locus_tag="Rv0419" /EC_number="3.4.11.-" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS." /note="Rv0419, (MTCY22G10.16), len: 498 aa. Possible lpqM, lipoprotein peptidase (EC 3.4.-.-); has potential N-terminal signal peptide and contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site, PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." /codon_start=1 /transl_table=11 /product="lipoprotein peptidase LpqM" /protein_id="NP_214933.1" /db_xref="GI:15607560" /db_xref="GeneID:886388" /translation="MHGRGRYRPLVRCVRPRRVAASVRTPIACLAAVVVIAGCTTVVD GRALSILNDPFRVGGLPATNGPSGARPDAPAASGTVINTNNGAIDKLSLLSVNDIEDY WMAVYSESLKGTFRPVGKLVSYDSNDPSSPIVCHIDTYQLVNAFFSSRCNLIAWDRGV FMAVAQEYFGDMSVNGVLAHEFGHALQVMANLVTRKDPTIVREQQADCFAGVYLWWVA EGKSTRFTLSTADGLDHVLAGIITTRDPVMEADAENDDEHGSALDRVSAFQLGFINGT PACAAIDEDEVERRRGDLPTALRVDASGNPETGEVGINEETLSTLMELMGKIFSPKNP PTLSYQPAGCPDAKPSPPAAYCPATNTIVVDLPALARMGKVASAAEHSLPQGDDTSLS IVMSRYALAVQHERGLPMQSPWTALRTACLTGVAHRKMAVPIDLPSGQQLVLTAGDLD EAVSGLLTNRMVASDADGVSVPAGFTRIAAFRAGVGGDMDACYARYPG" misc_feature 505170..505202 /gene="lpqM" /locus_tag="Rv0419" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" misc_feature 505614..505643 /gene="lpqM" /locus_tag="Rv0419" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(506561..506971) /locus_tag="Rv0420c" /db_xref="GeneID:886383" CDS complement(506561..506971) /locus_tag="Rv0420c" /function="UNKNOWN" /note="Rv0420c, (MTCY22G10.17c), len: 136 aa. Possible transmembrane protein; has potential transmembrane domains aa 53-99 and aa 100-122." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214934.1" /db_xref="GI:15607561" /db_xref="GeneID:886383" /translation="MRLHDASAAAPESRMHIARHGEAVNRRQMFIGITGLLLAVIGLM ALWFPVYLDQYDAYGIKVTCGSGWRSNLTQALYADGNDNTQALVTRCDTALLVRRAWA IPSVALGWLLVTGFLVMWVHNDQHQGQSYPGYRA" gene complement(507132..507761) /locus_tag="Rv0421c" /db_xref="GeneID:886377" CDS complement(507132..507761) /locus_tag="Rv0421c" /function="UNKNOWN" /note="Rv0421c, (MTCY22G10.18c), len: 209 aa. Conserved hypothetical protein, showing similarity with NP_103507.1|14022684|BAB49293.1|AP002998 hypothetical protein from Mesorhizobium loti (214 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214935.1" /db_xref="GI:15607562" /db_xref="GeneID:886377" /translation="MNLDQIAGVAHQPAGPPHGVVVLTHGAGGSRESTLLQQVCAEWT RRGWLAVRYNLPYRRRRPTGPPSGSGSGDRAGIVEAIQLCRGLAEGPLIAGGHSYGGR QTSMVVAAGQAPVDVLTLFSYPVHPPGKPERVRTEHLPGIAVPTVFTHGTADPFGTLA QVRSAAAMVSAPTEVVEITGARHDLGSKTLDVARLAVDAALRLSAGQIA" gene complement(507758..508555) /gene="thiD" /locus_tag="Rv0422c" /db_xref="GeneID:886375" CDS complement(507758..508555) /gene="thiD" /locus_tag="Rv0422c" /EC_number="2.7.4.7" /function="INVOLVED IN THIAMINE BIOSYNTHESIS. CATALYZES THE PHOSPHORYLATION OF HMP-P TO HMP-PP [CATALYTIC ACTIVITY: ATP + 4-amino-2-methyl-5-phosphomethylpyrimidine = ADP + 4-amino-2-methyl-5-diphosphomethylpyrimidine]." /note="catalyzes the formation of 4-amino-2-methyl-5-diphosphomethylpyrimidine" /codon_start=1 /transl_table=11 /product="phosphomethylpyrimidine kinase" /protein_id="NP_214936.1" /db_xref="GI:15607563" /db_xref="GeneID:886375" /translation="MTPPRVLSIAGSDSGGGAGIQADMRTMALLGVHACVAVTAVTVQ NTLGVKDIHEVPNDVVAGQIEAVVTDIGVQAAKTGMLASSRIVATVAATWRRLELSVP LVVDPVCASMHGDPLLAPSALDSLRGQLFPLATLLTPNLDEARLLVDIEVVDAESQRA AAKALHALGPQWVLVKGGHLRSSDGSCDLLYDGVSCYQFDAQRLPTGDDHGGGDTLAT AIAAALAHGFTVPDAVDFGKRWVTECLRAAYPLGRGHGPVSPLFRLS" gene complement(508582..510225) /gene="thiC" /locus_tag="Rv0423c" /db_xref="GeneID:886379" CDS complement(508582..510225) /gene="thiC" /locus_tag="Rv0423c" /function="INVOLVED IN THIAMINE BIOSYNTHESIS. REQUIRED FOR THE SYNTHESIS OF THE HYDROMETHYLPYRIMIDINE (HMP) MOIETY OF THIAMINE (4-AMINO-2-METHYL-5-HYDROXYMETHYLPYRIMIDINE)." /note="required for the synthesis of the hydromethylpyrimidine moiety of thiamine" /codon_start=1 /transl_table=11 /product="thiamine biosynthesis protein ThiC" /protein_id="NP_214937.1" /db_xref="GI:15607564" /db_xref="GeneID:886379" /translation="MTITVEPSVTTGPIAGSAKAYREIEAPGSGATLQVPFRRVHLST GDHFDLYDTSGPYTDTDTVIDLTAGLPHRPGVVRDRGTQLQRARAGEITAEMAFIAAR EDMSAELVRDEVARGRAVIPANHHHPESEPMIIGKAFAVKVNANIGNSAVTSSIAEEV DKMVWATRWGADTIMDLSTGKNIHETREWILRNSPVPVGTVPIYQALEKVKGDPTELT WEIYRDTVIEQCEQGVDYMTVHAGVLLRYVPLTAKRVTGIVSRGGSIMAAWCLAHHRE SFLYTNFEELCDIFARYDVTFSLGDGLRPGSIADANDAAQFAELRTLGELTKIAKAHG AQVMIEGPGHIPMHKIVENVRLEEELCEEAPFYTLGPLATDIAPAYDHITSAIGAAII AQAGTAMLCYVTPKEHLGLPDRKDVKDGVIAYKIAAHAADLAKGHPRAQERDDALSTA RFEFRWNDQFALSLDPDTAREFHDETLPAEPAKTAHFCSMCGPKFCSMRITQDVREYA AEHGLETEADIEAVLAAGMAEKSREFAEHGNRVYLPITQ" gene complement(510377..510652) /locus_tag="Rv0424c" /db_xref="GeneID:886374" CDS complement(510377..510652) /locus_tag="Rv0424c" /function="UNKNOWN" /note="Rv0424c, (MTCY22G10.21c), len: 91 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214938.1" /db_xref="GI:15607565" /db_xref="GeneID:886374" /translation="MAEKNTRRATSQREAVAKIREAETIVMNLPICGQVKIPRPEHLA YYGGLAALAALELIDWPVALVIATGHILANNHHNRVLEELGEAMEEA" gene complement(510702..515321) /gene="ctpH" /locus_tag="Rv0425c" /db_xref="GeneID:886373" CDS complement(510702..515321) /gene="ctpH" /locus_tag="Rv0425c" /EC_number="3.6.1.-" /function="METAL CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF UNDETERMINATED METAL CATION WITH HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED METAL CATION(IN) = ADP + PHOSPHATE + UNDETERMINATED METAL CATION(OUT)]." /note="Rv0425c, (MTCY22G10.22c), len: 1539 aa. Possible ctpH, metal cation-transporting P-type ATPase (transmembrane protein) (EC 3.6.1.-), showing some similarity with CAA17934.1|AL022118|13093871|CAC32203.1|AL583926 putative cation-transporting ATPase from Mycobacterium leprae (1609 aa). Also similar to others ATPases e.g. AE000873_1 CATION-TRANSPORTING P-ATPASE from Methanobacterium thermoautotrop (844 aa), FASTA score: (30.5% identity in 827 aa overlap); AB69720.1|AL137166 putative transport ATPase from Streptomyces coelicolor (1472 aa); etc. C-terminal region similar to other ATPases from Mycobacterium tuberculosis e.g. Y05Q_MYCTU|Q10900 putative cation-transporting ATPase C (855 aa), FASTA scores: opt: 770, E(): 5.3e-32, (44.9% identity in 820 aa overlap)." /codon_start=1 /transl_table=11 /product="metal cation transporting P-type ATPase CtpH" /protein_id="NP_214939.1" /db_xref="GI:15607566" /db_xref="GeneID:886373" /translation="MPVRAVATGFRATATLTGASITAATAVSATLAKTGVGTGMKVAI IPLRAGAKALSGELSRETLGRNCWRGERRAWIEVRGLRSGGDDELGRVVLNAIQAHPG VGSASLNYPLSRVVVAIDDPDTSLRELCRIVDDAEKAERHRHPDQAADQLAQSPGSLP GDGVLLAVRAVTVAATAAGLGLALGGRALRWPRFPLVIEAAVAAVDHQPLLRRLLEDR IGTEATATVLELAMAAAHTVTLSPAALSVDLTIQALKAAECRAGARAWRRHEPQLALH ADEPADQPQSLWPRPARSTQPVQRSVARFALIQALSAVLVGAGTRDADMAATATLVAT PKASRTTPEAFAAALGQGLADQHAVLPLRPESLRRLDRVDAIVIDPRVLCTDDLRVAR IRGCGADELSTAWNRAQLVLTESGLRPGWHRVPGVSASGSDSAVEALFRPMHDRLASA VVAEAHRTGADLVSVDVDALGELRPVFDDIRPLDDGASGSLDEALARAVAELRQAGRT VAVLSSVGKQALSAADVALGVLPPPGAGAPPWYADVLLPDLGAAWRVLHAIPAARAAR QRGNEISGGASALGALLMLPGVRGLGPGPVTTGAAAGLLSGYLLARKVVDAQAPRPAP AHEWHAMSVEQVRKALPSPDEQAPAKAPPSPYPARALAGGLHTAKRGAQITQAPLNAL WQLTKAMRAELSDPLTPMLALGAMASAVLGSPVDAVMVGSVLTGNSILAASQRLRAES RLNRLLAQQIPPARKVLAGADDQPRYIEVRAEELRPGDIIEVRTHEVVPADARVIEEV DVEVDESALTGESLSVTKQVEPTPGVDLIERRCMLYAGTTVVSGTAVAVVTAVGPDTQ ERRAAELVSGDLSSVGLQHQLSRLTNQAWPVSMTGGALVTGLGLLRRRGLRQAVASGI AVTVAAVPEGMPLVATLAQQASARRLSHFGALVRIPRSVEALGRVDMVCFDKTGTLSE NRLRVAQVRPVAGHSREEVLRCAAHAAPASNGPQVHATDVAIVQAAAAAAASGTDGAE PGAAEPAAHLPFRSGRSFSASVSGTELTVKGAPEVVLAACEGIGSSMDDAVAELAANG LRVIAVAHRQLTAQQAQSVVDDPDEIARLCRDELSLVGFLGLSDTPRAQAAALLADLH EHDLDIRLITGDHPITAAAIAEELGMQVSPEQVISGAEWDALSRKDQERAVAERVIFA RMTPENKVQIVQTLEHSGRVCAMVGDGSNDAAAIRAATVGIGVVAHGSDPARVAADLV LVDGRIESLLPAILEGRQLWQRVQAAVSVLLGGNAGEVAFAIIGSAITGTSPLNTRQL LLVNMLTDALPAAALAVSKPSDPVTPATRGPDQRELWRAVGIRGATTAAAATVAWVMA GFTGLPRRASTVALVALVAAQLGQTLVDSHAWLVVLTALGSLAALATLISIPVVSQLL GCTPLDPLGWAQATAAATAATVAVAVLNRVLTGRDKSGQPNPQPPETDALSRDASPGA PPGPRRRRRATARRKAPVKAPSATRQTTKPKGPPAHRSSSTYPRR" gene complement(515373..515816) /locus_tag="Rv0426c" /db_xref="GeneID:886387" CDS complement(515373..515816) /locus_tag="Rv0426c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0426c, (MTCY22G10.23c), len: 147 aa. Possible transmembrane protein; has potential transmembrane domains aa 19-41, and aa 61-83." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214940.1" /db_xref="GI:15607567" /db_xref="GeneID:886387" /translation="MSVVGGTVRTVGRTVSGAATATTAAAGAVGGAAVSGIVGGVTGA AKGIQKGLSSGSKSTAAAALAIGAIGVAGLVDWPILLAVGGGALLLRKLNRTPEVAAP PVKAKLAPVPDKPAAAKEAPAKASKTTARKTSGRRAGTAELRSTN" gene complement(516017..516892) /gene="xthA" /locus_tag="Rv0427c" /db_xref="GeneID:886370" CDS complement(516017..516892) /gene="xthA" /locus_tag="Rv0427c" /EC_number="3.1.11.2" /function="INVOLVED IN BASE EXCISION REPAIR. APURINIC-APYRIMIDINIC ENDONUCLEASE. SUPPOSED TO REMOVE THE DAMAGED DNA AT CYTOSINES AND GUANINES BY CLEAVING AT THE 3' SIDE OF THE AP SITE BY A BETA-ELIMINATION REACTION. POSSIBLY EXHIBITES 3'-5'-EXONUCLEASE, 3'-PHOSPHOMONOESTERASE, 3'-REPAIR DIESTERASE AND RIBONUCLEASE H ACTIVITIES [CATALYTIC ACTIVITY: Degradation of double-stranded DNA. It acts progressively in a 3'- to 5'-direction, releasing 5'-phosphomononucleotides]." /note="Rv0427c, (MTCY22G10.24c), len: 291 aa. Probable xthA (alternate gene name: xth), exodeoxyribonuclease III protein (EC 3.1.11.2) (see citation below), similar to others e.g. EX3_ECOLI|P09030 exodeoxyribonuclease III from Escherichia Coli strain K12 (268 aa), FASTA scores: opt: 360, E(): 1.2e-17, (29.3% identity in 270 aa overlap); etc. BELONGS TO THE AP/EXOA FAMILY OF DNA REPAIR ENZYMES.; xth" /codon_start=1 /transl_table=11 /product="exodeoxyribonuclease III protein" /protein_id="NP_214941.1" /db_xref="GI:15607568" /db_xref="GeneID:886370" /translation="MPDGTIDGGHPQRPASPRLRSPLLRLATWNVNSIRTRLDRVLDW LGRADVDVLAMQETKCPDGQFPALPLFELGYDVAHVGFDQWNGVAIASRVGLDDVRVG FDGQPSWSGKPEVAATTEARALGATCGGIRVWSLYVPNGRALDDPHYTYKLDWLAALR DTAEGWLRDDPAAPIALMGDWNIAPTDDDVWSTEFFAGCTHVSEPERKAFNAIVDAQF TDVVRPFTPGPGVYTYWDYTQLRFPKKQGMRIDFILGSPALAARVMDAQIVREERKGK APSDHAPVLVDLHAG" gene complement(516895..517803) /locus_tag="Rv0428c" /db_xref="GeneID:886368" CDS complement(516895..517803) /locus_tag="Rv0428c" /function="UNKNOWN" /note="Rv0428c, (MTCY22G10.25c), len: 302 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214942.1" /db_xref="GI:15607569" /db_xref="GeneID:886368" /translation="MVSWPGLGTRVTVRYRRPAGSMPPLTDAVGRLLAVDPTVRVQTK TGTIVEFSPVDVVALRVLTDAPVRTAAIRALEHAAAAAWPGVERTWLDGWLLRAGHGA VLAANSAVPLDISAHTNTITEISAWYASRDLQPWLAVPDRLLPLPADLAGERREQVLV RDVSTGEPDRSVTLLDHPDDTWLRLYHQRLPLDMATPVIDGELAFGSYLGVAVARAAV TDAPDGTRWVGLSAMRAADEQSATGSAGRQLWEALLGWGAGRGATRGYVRVHDTATSV LAESLGFRLHHHCRYLPAQSVGWDTF" gene complement(517803..518396) /gene="def" /locus_tag="Rv0429c" /db_xref="GeneID:886366" CDS complement(517803..518396) /gene="def" /locus_tag="Rv0429c" /EC_number="3.5.1.88" /function="REMOVES THE FORMYL GROUP FROM THE N-TERMINAL MET OF NEWLY SYNTHESIZED PROTEINS [CATALYTIC ACTIVITY: N-formyl-L-methionine + H2O = formate + L-methionine]." /note="cleaves off formyl group from N-terminal methionine residues of newly synthesized proteins; binds iron(2+)" /codon_start=1 /transl_table=11 /product="peptide deformylase" /protein_id="NP_214943.1" /db_xref="GI:15607570" /db_xref="GeneID:886366" /translation="MAVVPIRIVGDPVLHTATTPVTVAADGSLPADLAQLIATMYDTM DAANGVGLAANQIGCSLRLFVYDCAADRAMTARRRGVVINPVLETSEIPETMPDPDTD DEGCLSVPGESFPTGRAKWARVTGLDADGSPVSIEGTGLFARMLQHETGHLDGFLYLD RLIGRYARNAKRAVKSHGWGVPGLSWLPGEDPDPFGH" gene 518733..519041 /locus_tag="Rv0430" /db_xref="GeneID:886364" CDS 518733..519041 /locus_tag="Rv0430" /function="UNKNOWN" /note="Rv0430, (MTCY22G10.27), len: 102 aa. Conserved hypothetical protein, equivalent to AC30882.1|AL583923 conserved hypothetical protein from Mycobacterium leprae (102 aa). Also highly similar to CAB93047.1|SCD95A.20|AL357432 hypothetical protein from Streptomyces coelicolor (84 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214944.1" /db_xref="GI:15607571" /db_xref="GeneID:886364" /translation="MDSAMARAIRSGDDAEVADGLTRREHDILAFERQWWKFAGVKEE AIKELFSMSATRYYQVLNALVDRPEALAADPMLVKRLRRLRASRQKARAARRLGFEVT" gene 519073..519567 /locus_tag="Rv0431" /db_xref="GeneID:886362" CDS 519073..519567 /locus_tag="Rv0431" /function="UNKNOWN" /note="Rv0431, (MTCY22G10.28), len: 164 aa. Putative tuberculin related peptide; almost identical to D00815|MSGAT103_1 AT103 from Mycobacterium tuberculosis (172 aa), FASTA score: (99.4% identity in 163 aa overlap). Highly similar to to CAC30881.1|AL583923 tuberculin related peptide (AT103) from Mycobacterium leprae (167 aa). Some similarity to G550415|HRPC (282 aa), FASTA scores: opt: 120, E(): 0.36, (33.3% identity in 111 aa overlap). Potential transmembrane domain at aa 19-37." /codon_start=1 /transl_table=11 /product="putative tuberculin related peptide" /protein_id="NP_214945.1" /db_xref="GI:15607572" /db_xref="GeneID:886362" /translation="MLVTVGSMNERVPDSSGLPLRAMVMVLLFLGVVFLLLVWQALGS SPNSEDDSSAISTMTTTTAAPTSTSVKPAAPRAEVRVYNISGTEGAAARTADRLKAAG FTVTDVGNLSLPDVAATTVYYTEVEGERATADAVGRTLGAAVELRLPELSDQPPGVIV VVTG" gene 519600..520322 /gene="sodC" /locus_tag="Rv0432" /db_xref="GeneID:886358" CDS 519600..520322 /gene="sodC" /locus_tag="Rv0432" /EC_number="1.15.1.1" /function="DESTROYS RADICALS WHICH ARE NORMALLY PRODUCED WITHIN THE CELLS AND ARE TOXIC TO BIOLOGICAL SYSTEMS [CATALYTIC ACTIVITY: 2 superoxide + 2 H+ = O2 + H2O2]." /note="Rv0432, (MTCY22G10.29), len: 240 aa. Probable sodC, periplasmic superoxide dismutase [Cu-Zn] (EC 1.15.1.1), equivalent to CAC30880.1|AL583923 superoxide dismutase precursor (Cu-Zn) from Mycobacterium leprae (240 aa); and AAK20038.1|AF326234_1 copper zinc superoxide dismutase from Mycobacterium avium subsp. paratuberculosis (226 aa). Also similar to others e.g. SODC_PHOLE|P00446 superoxide dismutase precursor (cu-zn) from Photobacterium leiognathi (173 aa), FASTA scores: opt: 214, E(): 5.2 e-06, (36.5% identity in 181 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. BELONGS TO THE CU-ZN SUPEROXIDE DISMUTASE FAMILY. Possibly localized in periplasm, membrane-bound." /codon_start=1 /transl_table=11 /product="periplasmic superoxide dismutase [Cu-Zn] SodC" /protein_id="NP_214946.1" /db_xref="GI:15607573" /db_xref="GeneID:886358" /translation="MPKPADHRNHAAVSTSVLSALFLGAGAALLSACSSPQHASTVPG TTPSIWTGSPAPSGLSGHDEESPGAQSLTSTLTAPDGTKVATAKFEFANGYATVTIAT TGVGKLTPGFHGLHIHQVGKCEPNSVAPTGGAPGNFLSAGGHYHVPGHTGTPASGDLA SLQVRGDGSAMLVTTTDAFTMDDLLSGAKTAIIIHAGADNFANIPPERYVQVNGTPGP DETTLTTGDAGKRVACGVIGSG" misc_feature 519666..519698 /gene="sodC" /locus_tag="Rv0432" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 520324..521454 /locus_tag="Rv0433" /db_xref="GeneID:886356" CDS 520324..521454 /locus_tag="Rv0433" /function="UNKNOWN" /note="ATP-dependent carboxylate-amine ligase" /codon_start=1 /transl_table=11 /product="carboxylate-amine ligase" /protein_id="NP_214947.1" /db_xref="GI:15607574" /db_xref="GeneID:886356" /translation="MPARRSAARIDFAGSPRPTLGVEWEFALVDSQTRDLSNEATAVI AEIGENPRVHKELLRNTVEIVSGICECTAEAMQDLRDTLGPARQIVRDRGMELFCAGT HPFARWSAQKLTDAPRYAELIKRTQWWGRQMLIWGVHVHVGIRSAHKVMPIMTSLLNY YPHLLALSASSPWWGGEDTGYASNRAMMFQQLPTAGLPFHFQRWAEFEGFVYDQKKTG IIDHMDEIRWDIRPSPHLGTLEVRICDGVSNLRELGALVALTHCLIVDLDRRLDAGET LPTMPPWHVQENKWRAARYGLDAVIILDADSNERLVTDDLADVLTRLEPVAKSLNCAD ELAAVSDIYRDGASYQRQLRVAQQHDGDLRAVVDALVAELVI" gene 521514..522167 /locus_tag="Rv0434" /db_xref="GeneID:886360" CDS 521514..522167 /locus_tag="Rv0434" /function="UNKNOWN" /note="Rv0434, (MTCY22G10.31), len: 217 aa. Conserved hypothetical protein, similar to AE002052_2 from Deinococcus radiodurans (213 aa), FASTA scores: opt: 258, E(): 4e-10, (31.9% identity in 213 aa overlap); SYCSLRB_122|Q55701 hypothetical 24.5 kDa protein from Synechocystis (214 aa), FASTA scores: opt: 156, E(): 0.00041, (28.4% identity in 204 aa overlap); MXABSGA_1|LON2_MYXXA|P36774 ATP-dependent protease la 2 from Myxococcus xanthus (826 aa), FASTA scores: opt: 160, E(): 0.00068, (28.4% identity in 197 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214948.1" /db_xref="GI:15607575" /db_xref="GeneID:886360" /translation="MADFAPVELAMFPLESAPLPDEDLPLHIFEPRYAALVRDCMDTA DPRFGVVLISRGREVGGGDTRCDVGTLARITECADAGSGRYMLRCRVGERIRVCDWLP DDPYPRAKVRFWPDQPGHPVTAAQLLEVEDRVVALFERIAAARGVRLPAREVVLGYPV VDPADTGQRLYALACRVPMGPADRYAVLATPSAADRLVRLGDALDSVAAMVEFELST" gene complement(522347..524533) /locus_tag="Rv0435c" /db_xref="GeneID:886352" CDS complement(522347..524533) /locus_tag="Rv0435c" /EC_number="3.6.1.-" /function="ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF UNDETERMINATED SUBSTRATE WITH HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED SUBSTRATE(IN) = ADP + PHOSPHATE + UNDETERMINATED SUBSTRATE(OUT)]." /note="Rv0435c, (MTCY22G10.32c), len: 728 aa. Putative conserved ATPase (EC 3.6.1.-), similar to others e.g. SAV_SULAC|Q07590 sav protein involved in cell division from sulfolobus acidocaldarius (780 aa), FASTA scores: opt: 897, E(): 0, (34.5% identity in 693 aa overlap); NP_148637.1|7435761|B72479 transitional endoplasmic reticulum ATPase from Aeropyrum pernix (699 aa); etc. Also similar to Rv3610c and Rv2115c from Mycobacterium tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00674 AAA-protein family signature." /codon_start=1 /transl_table=11 /product="putative ATPase" /protein_id="NP_214949.1" /db_xref="GI:15607576" /db_xref="GeneID:886352" /translation="MTHPDPARQLTLTARLNTSAVDSRRGVVRLHPNAIAALGIREWD AVSLTGSRTTAAVAGLAAADTAVGTVLLDDVTLSNAGLREGTEVIVSPVTVYGARSVT LSGSTLATQSVPPVTLRQALLGKVMTVGDAVSLLPRDLGPGTSTSAASRALAAAVGIS WTSELLTVTGVDPDGPVSVQPNSLVTWGAGVPAAMGTSTAGQVSISSPEIQIEELKGA QPQAAKLTEWLKLALDEPHLLQTLGAGTNLGVLVSGPAGVGKATLVRAVCDGRRLVTL DGPEIGALAAGDRVKAVASAVQAVRHEGGVLLITDADALLPAAAEPVASLILSELRTA VATAGVVLIATSARPDQLDARLRSPELCDRELGLPLPDAATRKSLLEALLNPVPTGDL NLDEIASRTPGFVVADLAALVREAALRAASRASADGRPPMLHQDDLLGALTVIRPLSR SASDEVTVGDVTLDDVGDMAAAKQALTEAVLWPLQHPDTFARLGVEPPRGVLLYGPPG CGKTFVVRALASTGQLSVHAVKGSELMDKWVGSSEKAVRELFRRARDSAPSLVFLDEL DALAPRRGQSFDSGVSDRVVAALLTELDGIDPLRDVVMLGATNRPDLIDPALLRPGRL ERLVFVEPPDAAARREILRTAGKSIPLSSDVDLDEVAAGLDGYSAADCVALLREAALT AMRRSIDAANVTAADLATARETVRASLDPLQVASLRKFGTKGDLRS" misc_feature complement(522674..522730) /locus_tag="Rv0435c" /note="PS00674 AAA-protein family signature" misc_feature complement(522998..523021) /locus_tag="Rv0435c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(524530..525390) /gene="pssA" /locus_tag="Rv0436c" /db_xref="GeneID:886385" CDS complement(524530..525390) /gene="pssA" /locus_tag="Rv0436c" /EC_number="2.7.8.8" /function="INVOLVED IN PHOSPHOLIPID BIOSYNTHESIS. GENERATES PHOSPHATIDYLSERINE [CATALYTIC ACTIVITY: CDP-diacylglycerol + L-serine = CMP + O-Sn-phosphatidyl-L-serine]." /note="Rv0436c, (MTCY22G10.33c), len: 286 aa. Probable pssA, PS synthase (CDP-diacylglycerol--serine O-phosphatidyltransferase) (EC 2.7.8.8) (see citation below), integral membrane protein, equivalent to AL035159|MLCB1450_9|T44730 from Mycobacterium leprae (300 aa), FASTA scores: opt: 1506, E(): 0, (77.9% identity in 285 aa overlap). Also highly similar to others e.g. NP_108059.1|14027250|BAB54204.1|AP003012 phosphatidylserine synthase from Mesorhizobium loti (248 aa); PSS_BACSU|P39823 cdp-diacylglycerol--serine o-phosphatidyltransferase from Bacillus subtilis (177 aa), FASTA scores: opt: 277, E(): 9.9e-12, (33.3% identity in 183 aa overlap); etc. Contains PS00379 CDP-alcohol phosphatidyltransferases signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY." /codon_start=1 /transl_table=11 /product="CDP-diacylglycerol--serine O-phosphatidyltransferase" /protein_id="NP_214950.1" /db_xref="GI:15607577" /db_xref="GeneID:886385" /translation="MIGKPRGRRGVNLQILPSAMTVLSICAGLTAIKFALEHQPKAAM ALIAAAAILDGLDGRVARILDAQSRMGAEIDSLADAVNFGVTPALVLYVSMLSKWPVG WVVVLLYAVCVVLRLARYNALQDDGTQPAYAHEFFVGMPAPAGAVSMIGLLALKMQFG EGWWTSGWFLSFWVTGTSILLVSGIPMKKMHAVSVPPNYAAALLAVLAICAAAAVLAP YLLIWVIIIAYMCHIPFAVRSQRWLAQHPEVWDDKPKQRRAVRRASRRAHPYRPSMAR LGLRKPGRRL" misc_feature complement(525154..525222) /gene="pssA" /locus_tag="Rv0436c" /note="PS00379 CDP-alcohol phosphatidyltransferases signature" gene complement(525387..526082) /gene="psd" /locus_tag="Rv0437c" /db_xref="GeneID:886350" CDS complement(525387..526082) /gene="psd" /locus_tag="Rv0437c" /EC_number="4.1.1.65" /function="UNKNOWN, BUT INVOLVED IN LIPID METABOLISM [CATALYTIC ACTIVITY: PHOSPHATIDYL-L-SERINE = PHOSPHATIDYLETHANOLAMINE + CO(2)]." /note="catalyzes the decarboxylaton of phospatidyl-L-sering to phosphatidylethanolamine" /codon_start=1 /transl_table=11 /product="phosphatidylserine decarboxylase" /protein_id="NP_214951.1" /db_xref="GI:15607578" /db_xref="GeneID:886350" /translation="MARRPRPDGPQHLLALVRSAVPPVHPAGRPFIAAGLAIAAVGHR YRWLRGTGLLAAAACAGFFRHPQRVPPTRPAAIVAPADGVICAIDSAAPPAELSMGDT PLPRVSIFLSILDAHVQRAPVSGEVIAVQHRPGRFGSADLPEASDDNERTSVRIRMPN GAEVVAVQIAGLVARRIVCDAHVGDKLAIGDTYGLIRFGSRLDTYLPAGAEPIVNVGQ RAVAGETVLAECR" gene complement(526143..527360) /gene="moeA2" /locus_tag="Rv0438c" /db_xref="GeneID:886348" CDS complement(526143..527360) /gene="moeA2" /locus_tag="Rv0438c" /function="INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS: INVOLVED IN THE BIOSYNTHESIS OF A DEMOLYBDO-COFACTOR (MOLYBDOPTERIN), NECESSARY FOR MOLYBDO-ENZYMES (BY SIMILARITY)." /note="Rv0438c, (MTV037.02c), len: 405 aa. Probable moeA2, molybdenum cofactor biosynthesis protein, highly similar to many e.g. Y10817|ANY10817_2 from A. nicotinovorans (429 aa), FASTA scores: opt: 786, E(): 0, (39.2% identity in 398 aa overlap); etc. Also similar to MOEA1|Rv0994|MTCI237.08|O05577 PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Mycobacterium tuberculosis (426 aa), FASTA scores: opt: 667, E(): 2e-32, (36.5% identity in 425 aa overlap). TBparse score is 0.889. Note that previously known as moeA3.; moeA3" /codon_start=1 /transl_table=11 /product="molybdopterin biosynthesis protein MoeA2" /protein_id="YP_177725.1" /db_xref="GI:57116730" /db_xref="GeneID:886348" /translation="MRSVQEHQRVVAEMMRACRPITVPLTQAQGLVLGGDVVAPLSLP VFDNSAMDGYAVRAEDTSGATPQNPVMLPVAEDIPAGRADMLTLQPVTAHRIMTGAPV PTGATAIVPVEATDGGVDSVAIRQQATPGKHIRRSGEDVAAGTTVLHNGQIVTPAVLG LAAALGLAELPVLPRQRVLVISTGSELASPGTPLQPGQIYESNSIMLAAAVRDAGAAV VATATAGDDVAQFGAILDRYAVDADLIITSGGVSAGAYEVVKDAFGSADYRGGDHGVE FVKVAMQPGMPQGVGRVAGTPIVTLPGNPVSALVSFEVFIRPPLRMAMGLPDPYRPHR SAVLTASLTSPRGKRQFRRAILDHQAGTVISYGPPASHHLRWLASANGLLDIPEDVVE VAAGTQLQVWDLT" gene complement(527379..528314) /locus_tag="Rv0439c" /db_xref="GeneID:886342" CDS complement(527379..528314) /locus_tag="Rv0439c" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0439c, (MTV037.03c), len: 311 aa. Probable dehydrogenase/reductase (EC 1.-.-.-), equivalent to AL035159|MLCB1450_6|T44727 probable oxidoreductase from Mycobacterium leprae (304 aa), FASTA scores: opt: 1360, E(): 0, (69.2% identity in 302 aa overlap). Also highly similar to various oxidoreductases, generally dehydrogenases/reductases e.g. PA5031|C83017|9951320|AAG08416.1|AE004916_5|AE004916 probable short chain dehydrogenase from Pseudomonas aeruginosa (309 aa); Q03326|OXIR_STRAT PROBABLE OXIDOREDUCTASE from Streptomyces antibioticus (298 aa), FASTA scores: opt: 400, E(): 1.2e-18, (34.6% identity in 298 aa overlap); etc. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_214953.1" /db_xref="GI:15607580" /db_xref="GeneID:886342" /translation="MTANDNKTRKWSAADVPDQSGRVVVVTGANTGIGYHTAAVFADR GAHVVLAVRNLEKGNAARARIMAARPGAHVTLQQLDLCSLDSVRAAADALRTAYPRID VLINNAGVMWTPKQVTKDGFELQFGTNHLGHFALTGLVLDHMLPVPGSRVVTVSSQGH RIHAAIHFDDLQWERRYNRVAAYGQAKLANLLFTYELQRRLGEAGKSTIAVAAHPGGS NTELTRNLPRLIRPVATVLGPLLFQSPEMGALPTLRAATDPTTQGGQYYGPDGFGEQR GHPKVVQSSAQSHDKDLQRRLWTVSEELTGVSFGV" gene 528608..530230 /gene="groEL" /locus_tag="Rv0440" /db_xref="GeneID:886354" CDS 528608..530230 /gene="groEL" /locus_tag="Rv0440" /function="PREVENTS MISFOLDING AND PROMOTES THE REFOLDING AND PROPER ASSEMBLY OF UNFOLDED POLYPEPTIDES GENERATED UNDER STRESS CONDITIONS." /experiment="experimental evidence, no additional details recorded" /note="60 kDa chaperone family; promotes refolding of misfolded polypeptides especially under stressful conditions; forms two stacked rings of heptamers to form a barrel-shaped 14mer; ends can be capped by GroES; misfolded proteins enter the barrel where they are refolded when GroES binds; many bacteria have multiple copies of the groEL gene which are active under different environmental conditions; the B.japonicum protein in this cluster is expressed constitutively; in Rhodobacter, Corynebacterium and Rhizobium this protein is essential for growth" /codon_start=1 /transl_table=11 /product="chaperonin GroEL" /protein_id="NP_214954.1" /db_xref="GI:15607581" /db_xref="GeneID:886354" /translation="MAKTIAYDEEARRGLERGLNALADAVKVTLGPKGRNVVLEKKWG APTITNDGVSIAKEIELEDPYEKIGAELVKEVAKKTDDVAGDGTTTATVLAQALVREG LRNVAAGANPLGLKRGIEKAVEKVTETLLKGAKEVETKEQIAATAAISAGDQSIGDLI AEAMDKVGNEGVITVEESNTFGLQLELTEGMRFDKGYISGYFVTDPERQEAVLEDPYI LLVSSKVSTVKDLLPLLEKVIGAGKPLLIIAEDVEGEALSTLVVNKIRGTFKSVAVKA PGFGDRRKAMLQDMAILTGGQVISEEVGLTLENADLSLLGKARKVVVTKDETTIVEGA GDTDAIAGRVAQIRQEIENSDSDYDREKLQERLAKLAGGVAVIKAGAATEVELKERKH RIEDAVRNAKAAVEEGIVAGGGVTLLQAAPTLDELKLEGDEATGANIVKVALEAPLKQ IAFNSGLEPGVVAEKVRNLPAGHGLNAQTGVYEDLLAAGVADPVKVTRSALQNAASIA GLFLTTEAVVADKPEKEKASVPGGGDMGGMDF" misc_feature 529814..529849 /gene="groEL" /locus_tag="Rv0440" /note="PS00296 Chaperonins cpn60 signature" gene complement(530296..530724) /locus_tag="Rv0441c" /db_xref="GeneID:886344" CDS complement(530296..530724) /locus_tag="Rv0441c" /function="UNKNOWN" /note="Rv0441c, (MTV037.05c), len: 142 aa. Hypothetical unknown protein. TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214955.1" /db_xref="GI:15607582" /db_xref="GeneID:886344" /translation="MGAKKVDLKRLAAALPDYPFAYLITVDDGHRVHTVAVEPVLREL PDGPDGPRAVVDVGLIGGRTRQNLAHRSEVTLLWPPSDPSGYSLIVDGRAQASDAGPD DDTARCGVVPIRALLHRDAAPDSPTAAKGCLHDCVVFSVP" gene complement(530751..532214) /gene="PPE10" /locus_tag="Rv0442c" /db_xref="GeneID:886340" CDS complement(530751..532214) /gene="PPE10" /locus_tag="Rv0442c" /function="UNKNOWN" /note="Rv0442c, (MTV037.06c), len: 487 aa. Member of the Mycobacterium tuberculosis PPE family, nearly identical to hypothetical protein from Mycobacterium tuberculosis (strain Erdman) and to AN5S46909_1 protein fragment from Mycobacterium bovis (302 aa); P42611|YHS6_MYCTU HYPOTHETICAL 50.6 kDa PROTEIN (517 aa), FASTA scores: opt: 3144, E(): 0, (98.4 identity in 492 aa overlap); and S46909|S46909_1 (302 aa), FASTA scores: opt: 1897, E(): 0, (98.0% identity in 302 aa overlap). TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177726.1" /db_xref="GI:57116731" /db_xref="GeneID:886340" /translation="MTSPHFAWLPPEINSALMFAGPGSGPLIAAATAWGELAEKLLAS IASLGSVTSELTSGAWLGPSAAAMMAVATQYLAWLSTAAAQAEQAAAQAMAIATAFEA ALAATVQPAVVAANRGLMQLLAATNWFGQNAPALMDVEAAYEQMWALDVAAMAGYHFD ASAAVAQLAPWQQVLRNLGIDIGKNGQINLGFGNTGSGNIGNNNIGNNNIGSGNTGTG NIGSGNTGSGNLGLGNLGDGNIGFGNTGSGNIGFGITGDHQMGFGGFNSGSGNIGFGN SGTGNVGLFNSGSGNIGIGNSGSLNSGIGTSGTINAGLGSAGSLNTSFWNAGMQNAAL GSAAGSEAALVSSAGYATGGMSTAALSSGILASALGSTGGLQHGLANVLNSGLTNTPV AAPASAPVGGLDSGNPNPGSGSAAAGSGANPGLRSPGTSYPSFVNSGSNDSGLRNTAV REPSTPGSGIPKSNFYPSPDRESAYASPRIGQPVGSE" gene 532396..532911 /locus_tag="Rv0443" /db_xref="GeneID:886336" CDS 532396..532911 /locus_tag="Rv0443" /function="UNKNOWN" /note="Rv0443, (MTV037.07), len: 171 aa. Conserved hypothetical protein, highly similar to AL049863|SC5H1_23|T35339 hypothetical protein from Streptomyces coelicolor (171 aa), FASTA scores: opt: 561, E(): 2.3e-32, (49.7% identity in 165 aa overlap); and CAC42482.1|AJ318385 hypothetical protein from Amycolatopsis mediterranei (163 aa). TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214957.1" /db_xref="GI:15607584" /db_xref="GeneID:886336" /translation="MASTDAAAQELLRDAFTRLIEHVDELTDGLTDQLACYRPTPSAN SIAWLLWHSARVQDIQVAHVAGVEEVWTRDGWVDRFGLDLPRHDTGYGHRPEDVAKVR APADLLSGYYHAVHKLTLEYIAGMTADELSRVVDTSWNPPVTVSARLVSIVDDCAQHL GQAAYLRGIAR" gene complement(533091..533789) /locus_tag="Rv0444c" /db_xref="GeneID:886346" CDS complement(533091..533789) /locus_tag="Rv0444c" /function="UNKNOWN" /note="Rv0444c, (MTV037.08c), len: 232 aa. Conserved hypothetical protein; C-terminus similar to P12752|Y24K_STRGR HYPOTHETICAL 24.7 kDa PROTEIN from Streptomyces griseus (238 aa), FASTA scores: opt: 207, E(): 2.2e-05, (32.9% identity in 158 aa overlap). TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214958.1" /db_xref="GI:15607585" /db_xref="GeneID:886346" /translation="MTEHTDFELLELATPYALNAVSDDERADIDRRVAAAPSPVAAAF NDEVRAVRETMAVVSAATTAEPPAHLRTAILDATKPEVRRQSRWRTAAFASAAAIAVG LGAFGLGVLTRPSPPPTVAEQVLTAPDVRTVSRPLGAGTATVVFSRDRNTGLLVMNNV APPSRGTVYQMWLLGGAKGPRSAGTMGTAAVTPSTTATLTDLGASTALAFTVEPGTGS PQPTGTILAELPLG" gene complement(533833..534396) /gene="sigK" /locus_tag="Rv0445c" /db_xref="GeneID:886334" CDS complement(533833..534396) /gene="sigK" /locus_tag="Rv0445c" /function="INVOLVED IN TRANSCRIPTION MECHANISM. THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED." /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription; in M. bovis this protein has been shown to be involved in expression of antigenic proteins" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigK" /protein_id="NP_214959.1" /db_xref="GI:15607586" /db_xref="GeneID:886334" /translation="MTGPPRLSSDLDALLRRVAGHDQAAFAEFYDHTKSRVYGLVMRV LRDTGYSEETTQEIYLEVWRNASEFDSAKGSALAWLLTMAHRRAVDRVRCEQAGNQRE VRYGAANVDPASDVVADLAIAGDERRRVTECLKALTDTQRQCIELAYYGGLTYVEVSR RLAANLSTIKSRMRDALRSLRNCLDVS" gene complement(534445..535215) /locus_tag="Rv0446c" /db_xref="GeneID:886332" CDS complement(534445..535215) /locus_tag="Rv0446c" /function="UNKNOWN" /note="Rv0446c, (MTV037.10c), len: 256 aa. Possible conserved transmembrane protein, similar at N-terminus to U1740AF|U15183|MLU15183_40 from Mycobacterium leprae (117 aa), FASTA scores: opt: 175, E(): 2.5e-05, (62.5% identity in 40 aa overlap); and at C-terminus to AL021529|SC10A5_3 from Streptomyces coelicolor (226 aa), FASTA scores: opt: 207, E(): 9.8e-07, (34.2% identity in 114 aa overlap). Also similar to others hypothetical proteins e.g. AAK04680.1|AE006291_14|AE006291) HYPOTHETICAL PROTEIN from Lactococcus lactis subsp. lactis (257 aa). TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214960.1" /db_xref="GI:15607587" /db_xref="GeneID:886332" /translation="MVTSVSALAVAVVHSVAFAIGRRIGRYNVVDVVWGLGFVAVAVA AATLGHGDPVRRWLLLALVSTWGLRLSWHMYRKTAGQGEDPRYADLLRGATPVQALRK VFGLQGLLTLFVSFPLQLSAVTGPTPKPLLAVGGVGLAVWLVGITFEAVGDWQLWVFK SDPANRGVIMDRGLWAWTRHPNYFGDACVWWGLWLITINDWAPLATVGSPLLMTYLLV DVSGARLTERYLKGRPGFAEYQRRTAYFVPRPPRSARR" gene complement(535224..536507) /gene="ufaA1" /locus_tag="Rv0447c" /db_xref="GeneID:886330" CDS complement(535224..536507) /gene="ufaA1" /locus_tag="Rv0447c" /EC_number="2.1.1.79" /function="TRANSFERS A METHYLENE GROUP FROM S-ADENOSYL-L-METHIONINE TO THE CIS DOUBLE BOND OF AN UNSATURATED FATTY ACID CHAIN RESULTING IN THE REPLACEMENT OF THE DOUBLE BOND WITH A METHYLENE BRIDGE [CATALYTIC ACTIVITY: S-adenosyl-L-methionine + phospholipid olefinic fatty acid = S-adenosyl-L-homocysteine + phospholipid cyclopropane fatty acid]." /note="Rv0447c, (MTV037.11c), len: 427 aa (start uncertain). Probable ufaA1, cyclopropane-fatty-acyl-phospholipid synthase (EC 2.1.1.79), similar to others e.g. NP_102178.1|14021351|BAB47964.1|AP002994 cyclopropane-fatty-acyl-phospholipid synthase from Mesorhizobium loti (378 aa); B82240|9655593|AAF94281.1|AE004192 cyclopropane-fatty-acyl-phospholipid synthase from Vibrio cholerae (432 aa); P30010|CFA_ECOLI CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE from Escherichia coli strain K-12 (382 aa); X55704|PPLPD_3 LPD-3 from P.putida (394 aa), FASTA scores: opt: 556, E(): 2.8e-30, (33.3% identity in 387 aa overlap); AE0005|HPAE000557_9 from Helicobacter pylori (389 aa), FASTA scores: opt: 539, E(): 3.9e-29, (34.3% identity in 382 aa overlap). TBparse score is 0.919." /codon_start=1 /transl_table=11 /product="cyclopropane-fatty-acyl-phospholipid synthase" /protein_id="NP_214961.1" /db_xref="GI:15607588" /db_xref="GeneID:886330" /translation="MTVETSQTPSAAIDSDRWPAVAKVPRGPLAAASAAIANRLLRRT ATHLPLRLVYSDGTATGAADPRAPSLFIHRPDALARRIGRHGLIGFGESYMAGEWSSK ELTRVLTVLAGSVDELVPRSLHWLRPITPTFRPSWPDHSRDQARRNIAVHYDLSNDLF AAFLDETMTYSCAMFTDLLAQPTPAWTELAAAQRRKIDRLLDVAGVQQGSHVLEIGTG WGELCIRAAARGAHIRSVTLSVEQQRLARQRVAAAGFGHRVEIDLCDYRDVDGQYDSV VSVEMIEAVGYRSWPRYFAALEQLVRPGGPVAIQAITMPHHRMLATRHTQTWIQKYIF PGGLLPSTQAIIDITGQHTGLRIVDAASLRPHYAETLRLWRERFMQRRDGLAHLGFDE VFARMWELYLAYSEAGFRSGYLDVYQWTLIREGPP" gene complement(536504..537169) /locus_tag="Rv0448c" /db_xref="GeneID:886328" CDS complement(536504..537169) /locus_tag="Rv0448c" /function="UNKNOWN" /note="Rv0448c, (MTV037.12c), len: 221 aa. Conserved hypothetical protein, similar to other hypothetical proteins e.g. Z74841|BOD5A2_1 from B. oleracea (283 aa), FASTA scores: opt: 257, E(): 1.4e-10, (32.0% identity in 197 aa overlap); etc. Some similarity to U15183|MLU15183_38 from Mycobacterium leprae (82 aa), FASTA scores: opt: 134, E(): 0.014, (71.0% identity in 31 aa overlap). TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214962.1" /db_xref="GI:15607589" /db_xref="GeneID:886328" /translation="MHHSFAYRSYSWYVDVDNLPQLPWWLRPFARFHADDHFADPFSC PPHSSLRDRLDAFFAARGLAVPDGRITALLQARVLGYVFNPLSIFWCHDRDGQLRHVI AEVHNTYGGRHAYLLPPADLPVVTAKNFYVSPFHQLAGYYLIRAPRPDRELDVTVTLH RDRRQVCPEFTATLRGQRRPATTRQIAMMQIISPLAPMVVAARIRIQGIRLWLRRVPV VPR" gene complement(537229..538548) /locus_tag="Rv0449c" /db_xref="GeneID:886326" CDS complement(537229..538548) /locus_tag="Rv0449c" /function="UNKNOWN" /note="Rv0449c, (MTV037.13c), len: 439 aa. Conserved hypothetical protein, some similarity with several hypothetical proteins and various enzymes e.g. AAK24569.1|AE005927 amine oxidase, flavin-containing from Caulobacter crescentus (454 aa); BAB02771.1|AB023036 mycolic acid methyl transferase-like protein from Arabidopsis thaliana (842 aa); BAB01742.1|AP000374 protein which contains similarity to cyclopropane fatty acid synthase from Arabidopsis thaliana (793 aa); etc. Has hydrophobic stretch at N-terminus. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214963.1" /db_xref="GI:15607590" /db_xref="GeneID:886326" /translation="MQQSLRRSVAVVGSGVAGLTAAYILSGRDRVTLYEADGRLGGHA HTHYLDNGGGPRGTDVVGVDSAFLVHNDRTYPTLCRLFAELGVATQESEMSMSVRADD IGLEYAGALGARGLFACRQSLRPRYLCMLAEILRFHRAAARLLREETDNAEDKPETLE AFLSRHHFSQYFVDYFITPLVAAVWSCGGADALRYPARYLFVFLDHHGMLSVFGSPTW RTVTGGSANYVQAIAAQLDEVSTRTPVHSLRRLPDGVLVGAGDGPSRRFDAAVVAVHP DQALLLLDEPTPAERAVLGAIAYSTNSAQLHTDESVLPRHHRARASWNYLVTPGQHQV VVSYDISRLMRLDGGRRYLVTLGGHDRVDPSSVIAEMTYSHPLYTPESVAAQRLLPTL GDNRVVFAGAYHGWGFHEDGAASGLRAARRLGADWPAAIPQEAMVAC" gene complement(538588..541491) /gene="mmpL4" /locus_tag="Rv0450c" /db_xref="GeneID:886323" CDS complement(538588..541491) /gene="mmpL4" /locus_tag="Rv0450c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0450c, (MTV037.14c), len: 967 aa. Probable mmpL4, conserved transmembrane transport protein (see citations below), member of RND superfamily, equivalent to U1740V|P54881|YV34_MYCLE HYPOTHETICAL 105.2 kDa PROTEIN from Mycobacterium leprae (959 aa), FASTA scores: opt: 5051, E(): 0, (78.4% identity in 962 aa overlap). Also highly similar to other proteins from Mycobacterium tuberculosis e.g. Z83860|MTCY98.08 (962 aa), FASTA scores: opt: 3917, E(): 0, (61.3% identity in 950 aa overlap), MTCY20G9.34, etc. Contains PS00211 ABC transporters family signature. BELONGS TO THE MMPL FAMILY. TBparse score is 0.948." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL4" /protein_id="NP_214964.1" /db_xref="GI:15607591" /db_xref="GeneID:886323" /translation="MSTKFANDSNTNARPEKPFIARMIHAFAVPIILGWLAVCVVVTV FVPSLEAVGQERSVSLSPKDAPSFEAMGRIGMVFKEGDSDSFAMVIIEGNQPLGDAAH KYYDGLVAQLRADKKHVQSVQDLWGDPLTAAGVQSNDGKAAYVQLSLAGNQGTPLANE SVEAVRSIVESTPAPPGIKAYVTGPSALAADMHHSGDRSMARITMVTVAVIFIMLLLV YRSIITVVLLLITVGVELTAARGVVAVLGHSGAIGLTTFAVSLLTSLAIAAGTDYGIF IIGRYQEARQAGEDKEAAYYTMYRGTAHVILGSGLTIAGATFCLSFARMPYFQTLGIP CAVGMLVAVAVALTLGPAVLHVGSRFGLFDPKRLLKVRGWRRVGTVVVRWPLPVLVAT CAIALVGLLALPGYKTSYNDRDYLPDFIPANQGYAAADRHFSQARMKPEILMIESDHD MRNPADFLVLDKLAKGIFRVPGISRVQAITRPEGTTMDHTSIPFQISMQNAGQLQTIK YQRDRANDMLKQADEMATTIAVLTRMHSLMAEMASTTHRMVGDTEEMKEITEELRDHV ADFDDFWRPIRSYFYWEKHCYGIPICWSFRSIFDALDGIDKLSEQIGVLLGDLREMDR LMPQMVAQIPPQIEAMENMRTMILTMHSTMTGIFDQMLEMSDNATAMGKAFDAAKNDD SFYLPPEVFKNKDFQRAMKSFLSSDGHAARFIILHRGDPQSPEGIKSIDAIRTAAEES LKGTPLEDAKIYLAGTAAVFHDISEGAQWDLLIAAISSLCLIFIIMLIITRAFIAAAV IVGTVALSLGASFGLSVLLWQHILAIHLHWLVLAMSVIVLLAVGSDYNLLLVSRFKQE IGAGLKTGIIRSMGGTGKVVTNAGLVFAVTMASMAVSDLRVIGQVGTTIGLGLLFDTL IVRSFMTPSIAALLGRWFWWPLRVRSRPARTPTVPSETQPAGRPLAMSSDRLG" misc_feature complement(540445..540489) /gene="mmpL4" /locus_tag="Rv0450c" /note="PS00211 ABC transporters family signature" gene complement(541488..541910) /gene="mmpS4" /locus_tag="Rv0451c" /db_xref="GeneID:886321" CDS complement(541488..541910) /gene="mmpS4" /locus_tag="Rv0451c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0451c, (MTV037.15c), len: 140 aa. Probable mmpS4, conserved membrane protein (see citations below), equivalent to U1740W|P54880|YV33_MYCLE HYPOTHETICAL 16.9 kDa PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 727, E(): 0, (75.9% identity in 137 aa overlap). Also similar to other Mycobacterial proteins e.g. Z84725|MTCY04D9.16c from Mycobacterium tuberculosis (142 aa), FASTA scores: opt: 451, E(): 3.2e-24, (50.0% identity in 138 aa overlap); etc. BELONGS TO THE MMPS FAMILY. TBparse score is 0.953." /codon_start=1 /transl_table=11 /product="membrane protein" /protein_id="NP_214965.1" /db_xref="GI:15607592" /db_xref="GeneID:886321" /translation="MLMRTWIPLVILVVVIVGGFTVHRIRGFFGSENRPSYSDTNLEN SKPFNPKHLTYEIFGPPGTVADISYFDVNSEPQRVDGAVLPWSLHITTNDAAVMGNIV AQGNSDSIGCRITVDGKVRAERVSNEVNAYTYCLVKSA" gene 542142..542852 /locus_tag="Rv0452" /db_xref="GeneID:886319" CDS 542142..542852 /locus_tag="Rv0452" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0452, (MTV037.16), len: 236 aa. Possible transcriptional regulator, similar to several putative TetR-family transcriptional regulators from Streptomyces coelicolor. Also similar in N-terminus to U1740Y|U15183|MLU15183_33 from Mycobacterium leprae (67 aa), FASTA score: (76.1% identity in 67 aa overlap). Contains probable helix-turn-helix motif at aa 44-65 (Score 1727, +5.07 SD). TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214966.1" /db_xref="GI:15607593" /db_xref="GeneID:886319" /translation="MRYPLAVAQLGFQRARTEENKRQRAAALVEAARSLALETGVASV TLTAVAGRAGIHYSAVRRYFTSHKEVLLHLAAEGWARWSGTVCEQLGEPGPMSAPRVA EALANGLAADPLFCDLLANLHLHLEQEVDVDRVIEVKRTSIAAVIALVDAIESALPAL GRSGAFDILLAAYSLAATLWQIANPPERLTDAYAEEPELLPPEWNLDFAAALTRLLTA TLLGLLAGSPCECRSPTR" gene 543174..544730 /gene="PPE11" /locus_tag="Rv0453" /db_xref="GeneID:886317" CDS 543174..544730 /gene="PPE11" /locus_tag="Rv0453" /function="UNKNOWN" /note="Rv0453, (MTV037.17), len: 518 aa. Member of the Mycobacterium tuberculosis PPE family, similar to many e.g. AL0212|MTV012_32 from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 882, E(): 7e-31, (41.8% identity in 514 aa overlap). TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177727.1" /db_xref="GI:57116732" /db_xref="GeneID:886317" /translation="MTSALIWMASPPEVHSALLSSGPGPGPVLAAATGWSSLGREYAA VAEELGALLAAVQAGVWQGPSAESFAAACLPYLSWLTQASADCAAAAARLEAVTAAYA AALVAMPTLAELAANHATHGAMVATNFFGINTIPIAVNEADYVRMWLQAATTMATYQA VADSAVRSIPDSVPPPRILKSNAQSQHSSSNNSGGADPVDDFIAEILKIITGGRVIWD PEAGTVNGLPYDAYTNPGTLMWWIARSLELLQDFQEFAKLLFTNPVKAFQFLVDLILF DWPTHMLQLATWLAENPQLLVAALTPAISGLGAVSGLAGLTGLVPQPPVVPAPAPDAV VPTVLPLAGTATPTTAPASAPAAGAAPGPPAGTATATSASVPTSAGGFPPYLVGSGPG IDFDAGTPAGSRRAQPAADNVTAVAAAQVSARHQARRRRRAAAKERGNADEFVDMDSG PAIPPSGERDAWASNSGVGGLGFAGTASNETVAAPAGLTTLADDEFQCGPRMPMLPGA WDLGTWDRGD" gene 544835..545185 /locus_tag="Rv0454" /db_xref="GeneID:886339" CDS 544835..545185 /locus_tag="Rv0454" /function="UNKNOWN" /note="Rv0454, (MTV037.18), len: 116 aa (start uncertain). Conserved hypothetical protein, showing similarity with AAA63007.1|U15183 hypothetical protein from Mycobacterium leprae (115 aa), FASTA scores: opt: 151, E(): 0.0019, (31.5% identity in 89 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214968.1" /db_xref="GI:15607595" /db_xref="GeneID:886339" /translation="MKQDFGLDVPQAGNAQNFDGVPEWVQVGVVTFVYRMQMHHVTRP VGAPGSGLAGDSTPVQGRQRVWDLVAGRLTHAPRSSVQAMRPTMFTSAPQRHGIPARG RWWLGYQERSRAWP" gene complement(545375..545821) /locus_tag="Rv0455c" /db_xref="GeneID:886314" CDS complement(545375..545821) /locus_tag="Rv0455c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0455c, (MTV037.19c), len: 148 aa. Conserved hypothetical protein, equivalent to CAC31896.1|AL583925 possible secreted protein from Mycobacterium leprae (153 aa). Has hydrophobic stretch at N-terminus. TBparse score is 0.947." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214969.1" /db_xref="GI:15607596" /db_xref="GeneID:886314" /translation="MSRLSSILRAGAAFLVLGIAAATFPQSAAADSTEDFPIPRRMIA TTCDAEQYLAAVRDTSPVYYQRYMIDFNNHANLQQATINKAHWFFSLSPAERRDYSEH FYNGDPLTFAWVNHMKIFFNNKGVVAKGTEVCNGYPAGDMSVWNWA" gene complement(545889..546803) /gene="echA2" /locus_tag="Rv0456c" /db_xref="GeneID:886312" CDS complement(545889..546803) /gene="echA2" /locus_tag="Rv0456c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_214970.1" /db_xref="GI:15607597" /db_xref="GeneID:886312" /translation="MPTPDFQTLLYTTAGPVATITLNRPEQLNTIVPPMPDEIEAAIG LAERDQDIKVIVLRGAGRAFSGGYDFGGGFQHWGDAMMTDGRWDPGKDFAMVTARETG PTQKFMAIWRASKPVIAQVHGWCVGGASDYALCADIVIASEDAVIGTPYSRMWGAYLT GMWLYRLSLAKVKWHSLTGRPLTGVQAAEAELINEAVPFERLEARVAEIATELARIPL SQLQAQKLIVNQAYENMGLASTQLLGGILDGLMRNTPDALEFIRTAQTQGVRAAVERR DGPFGDYSQAPPELRPDPTHVITPDGSM" gene complement(547076..547357) /locus_tag="Rv0456A" /db_xref="GeneID:3205039" CDS complement(547076..547357) /locus_tag="Rv0456A" /function="UNKNOWN" /note="Rv0456A, len: 93 aa. Conserved hypothetical protein; N-terminus highly similar to N-terminal part of P71650|Rv2801c|MT2869|MTCY16B7.42 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (118 aa), FASTA scores: opt: 303, E(): 1e-14, (60.44% identity in 91 aa overlap). Also some similarity in part with other hypothetical proteins e.g. Q9PHH8|XFA0027 Plasmid maintenance protein from Xylella fastidiosa (108 aa), FASTA scores: opt: 169, E(): 3.9e-05, (50.820% identity in 61 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177621.1" /db_xref="GI:57116733" /db_xref="GeneID:3205039" /translation="MLRGEIWQVDLDPARGSAANMRRPAVIVSNDRANAAAIRLDRGV VPVVPVTSNTEKVPIPGVVAGSERWPGRRFEGAGPAGWIRRCATSPLPS" repeat_region complement(547488..547517) /note="3 copies of a 10 bp near-perfect direct repeat, ATTACTACCTATTACTACGTATTACTATCT" gene complement(547586..549607) /locus_tag="Rv0457c" /db_xref="GeneID:886310" CDS complement(547586..549607) /locus_tag="Rv0457c" /EC_number="3.4.-.-" /function="UNKNOWN; HYDROLYZES PEPTIDES AND/OR PROTEINS." /note="Rv0457c, (MTCI429A.01, MTV038.01c), len: 673 aa. Probable peptidase (EC 3.4.-.-), similar to many e.g. NP_102851.1|14022026|BAB48637.1 probable endopeptidase from Mesorhizobium loti (687 aa); Y4NA_RHISN|P55577 probable peptidase (EC 3.4.21.-) (726 aa), FASTA scores: opt: 1126, E(): 0, (40.9% identity in 491 aa overlap). Also similar to Mycobacterium tuberculosis protein MTCY369.26 FASTA score: (33.8% identity in 299 aa overlap)." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="NP_214971.1" /db_xref="GI:15607598" /db_xref="GeneID:886310" /translation="MTFEPAPDGADPYLWLEDVTGAEALDWVRARNKPTTAAFCDAEF ERMRVEALEVLDTDARIPYVNRRGNYLYNFWRDAANPRGLWRRTTLDSYRTDSPGWDV LIDVDELGRADDQKWVWGGAGVIEPDYTRALIGLSPGGSDASIVREFDMLTREFVEDG FQLPPAKSQITWEDPDTVLLGTDFGGDSLTTSGYPRVIKRWRRGKPLADAETIFEGAG TDVRVNASADRTPGFERTLLGRALDFWNEEVYELRGSELIRIEAPTDASVSIHRDWLL IELRTDWTVATTRYTAGSLLAAEYDEFLAGSAELQVVFEPDEHTALYQYAWTRDRLLI VTLADVASRVEIATPGSWRREPLSGIPAATNTVIVSADSHGDEFFLDSSGFDTPSRLM RGTDDGRLAEIKSAPAFFDAENMAVTQYFATSDDGTSIPYFVVRRTDADNPGPTLLNG YGGFETSRTPTYDGVLGRLWLARGGTYALANIRGGGEYGPGWHTQAMREGRDKVAQDF AAVATDLVTRGITTAEQLGARGGSNGGLLMGIMLTGYPEKFGALVCDVPLLDMKRYHL LLAGASWMAEYGDPDNPDDWKFISEYSPYQNISANRKYPPVLMTTSTRDDRVHPGHAR KMTAALQAAGHPVWYYENIEGGHAGAADNAQIAFKSALSFAFLWRMLAG" gene 549675..551198 /locus_tag="Rv0458" /db_xref="GeneID:886306" CDS 549675..551198 /locus_tag="Rv0458" /EC_number="1.2.1.3" /function="INTERCONVERSION ALDEHYDE AND ACID [CATALYTIC ACTIVITY: An aldehyde + NAD+ + H2O = an acid + NADH]." /note="Rv0458, (MTV038.02), len: 507 aa. Probable aldehyde dehydrogenase (EC 1.2.1.3), highly similar to many, closest to P46369|THCA_RHOER EPTC-INDUCIBLE ALDEHYDE DEHYDROGENASE from Rhodococcus erythropolis (506 aa), FASTA scores: opt: 2767, E(): 0, (79.7% identity in 507 aa overlap); AAC13641.1|AF029733 chloroacetaldehyde dehydrogenase from Xanthobacter autotrophicus (505 aa), FASTA scores: opt: 2563, E(): 0, (75.4% identity in 492 aa overlap); Q9RJZ6|DHAL_STRCO PROBABLE ALDEHYDE DEHYDROGENASE from Streptomyces coelicolor (507 aa). Also similar to other semialdehyde dehydrogenases in Mycobacterium tuberculosis e.g. Rv0768, Rv2858c. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY. TBparse score is 0.866." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase" /protein_id="NP_214972.1" /db_xref="GI:15607599" /db_xref="GeneID:886306" /translation="MTVFSRPGSAGALMSYESRYQNFIGGQWVAPVHGRYFENPTPVT GQPFCEVPRSDAADIDKALDAAHAAAPGWGKTAPAERAAILNMIADRIDKNAAALAVA EVWDNGKPVREALAADIPLAVDHFRYFAAAIRAQEGALSQIDEDTVAYHFHEPLGVVG QIIPWNFPILMAAWKLAPALAAGNTAVLKPAEQTPASVLYLMSLIGDLLPPGVVNVVN GFGAEAGKPLASSDRIAKVAFTGETTTGRLIMQYASHNLIPVTLELGGKSPNIFFADV LAAHDDFCDKALEGFTMFALNQGEVCTCPSRSLIQADIYDEFLELAAIRTKAVRQGDP LDTETMLGSQASNDQLEKVLSYIEIGKQEGAVIIAGGERAELGGDLSGGYYMQPTIFT GTNNMRIFKEEIFGPVVAVTSFTDYDDAIGIANDTLYGLGAGVWSRDGNTAYRAGRDI QAGRVWVNCYHLYPAHAAFGGYKQSGIGREGHQMMLQHYQHTKNLLVSYSDKALGFF" gene 551198..551689 /locus_tag="Rv0459" /db_xref="GeneID:886372" CDS 551198..551689 /locus_tag="Rv0459" /function="UNKNOWN" /note="Rv0459, (MTV038.03), len: 163 aa. Conserved hypothetical protein, highly similar to other hypothetical proteins. Note that highly similar to products of unidentified ORFs in Xanthobacter autotrophicus, AF029733_2 (139 aa), and Rhodococcus erythropolis, REREUTP BC_1 (186 aa). Like MTV038.03, these ORF's are linked to aldehyde dehydrogenase genes. FASTA scores: AF0297|AF029733_2 (139 aa), opt: 439, E(): 6.2e-24, (50.0% identity in 126 aa overlap); and L24492|REREUTPBC_1 (186 aa), opt: 347, E(): 2.1e-17, (52.7% identity in 169 aa overlap). N-terminus also highly similar to AAA63041.1|U15183 ethanolamine permease (eutP) match from Mycobacterium leprae (53 aa). TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214973.1" /db_xref="GI:15607600" /db_xref="GeneID:886372" /translation="MNAPAGVLITAEAAALLAGLQDRHGPVMFHQSGGCCDGSAPMCY PRADFLVGDRDILLGVLDVGEDGVPVWISGPQYQAWKHTQLIIDVVPGRGGGFSLEAP EGVRFLSRGRVFSDAEKAMREAAPVITGAAYECGERPLVRGLVVDLDDPDATPGVCRA SRR" gene 551749..551988 /locus_tag="Rv0460" /db_xref="GeneID:886304" CDS 551749..551988 /locus_tag="Rv0460" /function="UNKNOWN" /note="Rv0460, (MTV038.04), len: 79 aa. Conserved hydrophobic protein, highly similar AAA63024.1|U15183 hypothetical protein from Mycobacterium leprae (56 aa), FASTA scores: opt: 197, E(): 3.7e-09, (63.8% identity in 47 aa overlap). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214974.1" /db_xref="GI:15607601" /db_xref="GeneID:886304" /translation="MLVGNAIGLLAGVACSVLVHARIRPDIVIAMVVGIPSAIGLLVI LFSGRRWVTMLGAFILALAPGWFGVLVAIQVASSG" gene 552026..552550 /locus_tag="Rv0461" /db_xref="GeneID:886302" CDS 552026..552550 /locus_tag="Rv0461" /function="UNKNOWN" /note="Rv0461, (MTV038.05), len: 174 aa (start uncertain). Probable transmembrane protein. TBparse score is 0.917." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214975.1" /db_xref="GI:15607602" /db_xref="GeneID:886302" /translation="MPDFDTGAHSQRFLSLAGQQDRAGKSWPGSTPKPQEDPVGVAPS ASVEVLGSEPAATLAHSVTVPGRYTYLKWWKFVLVVLGVWIGAGEVGLSLFYWWYHTL DKTAAVFVVLVYVVACTVGGLILALVPGRPLITALSLGVMSGPFASVAAAAPLYGYYY CERMSHCLVGVIPY" gene 552614..554008 /gene="lpd" /locus_tag="Rv0462" /db_xref="GeneID:886300" CDS 552614..554008 /gene="lpd" /locus_tag="Rv0462" /EC_number="1.8.1.4" /function="INVOLVED IN ENERGY METABOLISM. LIPOAMIDE DEHYDROGENASE IS A COMPONENT OF THE ALPHA-KETOACID DEHYDROGENASE COMPLEXE [CATALYTIC ACTIVITY: DIHYDROLIPOAMIDE + NAD(+) = LIPOAMIDE + NADH]." /experiment="experimental evidence, no additional details recorded" /note="E3 component of alpha keto acid dehydrogenase complexes LpdC; forms a homodimer; binds one molecule of FAD monomer; catalyzes NAD+-dependent oxidation of dihydrolipoyl cofactors that are covalently linked to the E2 component" /codon_start=1 /transl_table=11 /product="dihydrolipoamide dehydrogenase" /protein_id="NP_214976.1" /db_xref="GI:15607603" /db_xref="GeneID:886300" /translation="MTHYDVVVLGAGPGGYVAAIRAAQLGLSTAIVEPKYWGGVCLNV GCIPSKALLRNAELVHIFTKDAKAFGISGEVTFDYGIAYDRSRKVAEGRVAGVHFLMK KNKITEIHGYGTFADANTLLVDLNDGGTESVTFDNAIIATGSSTRLVPGTSLSANVVT YEEQILSRELPKSIIIAGAGAIGMEFGYVLKNYGVDVTIVEFLPRALPNEDADVSKEI EKQFKKLGVTILTATKVESIADGGSQVTVTVTKDGVAQELKAEKVLQAIGFAPNVEGY GLDKAGVALTDRKAIGVDDYMRTNVGHIYAIGDVNGLLQLAHVAEAQGVVAAETIAGA ETLTLGDHRMLPRATFCQPNVASFGLTEQQARNEGYDVVVAKFPFTANAKAHGVGDPS GFVKLVADAKHGELLGGHLVGHDVAELLPELTLAQRWDLTASELARNVHTHPTMSEAL QECFHGLVGHMINF" gene 554016..554309 /locus_tag="Rv0463" /db_xref="GeneID:886299" CDS 554016..554309 /locus_tag="Rv0463" /function="UNKNOWN" /note="Rv0463, (MTV038.07), len: 97 aa. Probable conserved transmembrane protein, highly similar to AAA63017.1|U15183 hypothetical protein from Mycobacterium leprae (101 aa), FASTA scores: opt: 364, E(): 4e-21, (57.9% identity in 95 aa overlap). TBparse score is 0.893." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214977.1" /db_xref="GI:15607604" /db_xref="GeneID:886299" /translation="MTRRASTDTPQIIMGAIGGVVTGYILWLAAISVGDGLTTVSQWS RVVLLLSVLVAVCGAAGGLRLRSRGKLAWSAFAFSLPIPPVVLTVAVLADIYL" gene complement(554313..554885) /locus_tag="Rv0464c" /db_xref="GeneID:886296" CDS complement(554313..554885) /locus_tag="Rv0464c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0464c, (MTV038.08c), len: 190 aa. Conserved hypothetical protein, highly similar to CAC31982.1|AL583925 conserved hypothetical protein from Mycobacterium leprae (188 aa). Also some similarity with Rv1531|AL022000|MTV045_5|D70820 hypothetical protein from Mycobacterium tuberculosis (188 aa), FASTA scores: E(): 9.6e-10, (30.9% identity in 175 aa overlap). TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214978.1" /db_xref="GI:15607605" /db_xref="GeneID:886296" /translation="MTGQNGQVARISPGKFRQLGPVNWLVAKLAARAVGAPQMHLFTT LGYRQYLFWTFAIYTGRLLHGRLPGVDTELVILRVAHLRSCEYELQHHRRMARRRGLD ANTQATIFAWPDVPDGDGPRKVLSARQQALLQATDELIKDRTITAGTWERLATHLDPR LLIEFCLLATQYDAIAATITALAIPPDNPQ" gene complement(554882..556306) /locus_tag="Rv0465c" /db_xref="GeneID:886320" CDS complement(554882..556306) /locus_tag="Rv0465c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0465c, (MTV038.09c), len: 474 aa. Probable transcriptional regulator, highly similar to AC44331.1|AL596102 putative DNA-binding protein from Streptomyces coelicolor (489 aa); and similar to several hypothetical proteins and others transcriptional regulators. Some similarity in N-terminal region (1-100 aa) with repressors e.g. P06153|RPC_BPPH1 IMMUNITY REPRESSOR PROTEIN (144 aa), FASTA scores: opt: 130, E(): 0.084,(27.0% identity in 100 aa overlap). Very similar to Rv1129c|Z95585|MTCY22G8.18c from Mycobacterium tuberculosis (486 aa), FASTA scores: opt: 1475, E(): 0, (47.4% identity in 468 aa overlap). Contains probable helix-turn-helix motif at aa 19-40 (1827, +5.41 SD). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214979.1" /db_xref="GI:15607606" /db_xref="GeneID:886320" /translation="MSKTYVGSRVRQLRNERGFSQAALAQMLEISPSYLNQIEHDVRP LTVAVLLRITEVFGVDATFFASQDDTRLVAELREVTLDRDLDIAIDPHEVAEMVSAHP GLACAVVNLHRRYRITTAQLAAATEERFSDGSGRGSITMPHEEVRDYFYQRQNYLHAL DTAAEDLTAQMRMHHGDLARELTRRLTEVHGVRINKRIDLGDTVLHRYDPATNTLEIS SHLSPGQQVFKMAAELAYLEFGDLIDAMVTDGKFTSAESRTLARLGLANYFAAATVLP YRQFHDVAENFRYDVERLSAFYSVSYETIAHRLSTLQRPSMRGVPFTFVRVDRAGNMS KRQSATGFHFSSSGGTCPLWNVYETFANPGKILVQIAQMPDGRNYLWVARTVELRAAR YGQPGKTFAIGLGCELRHAHRLVYSEGLDLSGDPNTAATPIGAGCRVCERDNCPQRAF PALGRALDLDEHRSTVSPYLVKQL" gene 556458..557252 /locus_tag="Rv0466" /db_xref="GeneID:886294" CDS 556458..557252 /locus_tag="Rv0466" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0466, (MTV038.10), len: 264 aa. Conserved hypothetical protein, equivalent to CAC31980.1|AL583925 conserved hypothetical protein from Mycobacterium leprae (264 aa). Similar to Rv2001|Z74025|MTCY39.17c HYPOTHETICAL 28.7 KDA PROTEIN from Mycobacterium tuberculosis (250 aa), FASTA scores: opt: 592, E(): 0, (38.0% identity in 263 aa overlap). Some similarity to several THIOESTERASES e.g. Q42561|ATACPTE17_1 ACYL-(ACYL CARRIER PROTEIN) THIOESTER from A. thaliana (362 aa), FASTA scores: E(): 0.0092, (24.4% identity in 197 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214980.1" /db_xref="GI:15607607" /db_xref="GeneID:886294" /translation="MSLDKKLMPVPDGHPDVFDREWPLRVGDIDRAGRLRLDAACRHI QDIGQDQLREMGFEETHPLWIVRRTMVDLIRPIEFGDMLRCRRWCSGTSNRWCEMRVR VDGRKGGLIESEAFWIHVNRETEMPARIADDFLAGLHRTTSVDRLRWKGYLKPGSRDD ASEIHEFPVRVTDIDLFDHMNNAVYWSVIEDYLASHAELLRGPLRVTIEHEAPVALGD KLEIISHVHPAGSTEIFGPGLVDRAVTTLTYVVGDEPKAVASLFNL" gene 557527..558813 /gene="icl" /locus_tag="Rv0467" /db_xref="GeneID:886291" CDS 557527..558813 /gene="icl" /locus_tag="Rv0467" /EC_number="4.1.3.1" /function="INVOLVED IN GLYOXYLATE BYPASS (AT THE FIRST STEP), AN ALTERNATIVE TO THE TRICARBOXYLIC ACID CYCLE (IN BACTERIA, PLANTS, AND FUNGI) [CATALYTIC ACTIVITY: ISOCITRATE = SUCCINATE + GLYOXYLATE]. INVOLVED IN THE PERSISTENCE IN THE HOST." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the first step in the glyoxalate cycle, which converts lipids to carbohydrates" /codon_start=1 /transl_table=11 /product="isocitrate lyase" /protein_id="YP_177728.1" /db_xref="GI:57116734" /db_xref="GeneID:886291" /translation="MSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVE EHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLS GHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGG ALNVYELQKALIAAGVAGSHWEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADV ADVPTVVIARTDAEAATLITSDVDERDQPFITGERTREGFYRTKNGIEPCIARAKAYA PFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQK ELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATK HQREVGAGYFDRIATTVDPNSSTTALTGSTEEGQFH" gene 558895..559755 /gene="fadB2" /locus_tag="Rv0468" /db_xref="GeneID:886288" CDS 558895..559755 /gene="fadB2" /locus_tag="Rv0468" /EC_number="1.1.1.157" /function="BUTYRATE/BUTANOL-PRODUCING PATHWAY [CATALYTIC ACTIVITY: (S)-3-hydroxybutanoyl-CoA + NADP+ = 3-acetoacetyl-CoA + NADPH]" /experiment="experimental evidence, no additional details recorded" /note="converts (S)-3-hydroxybutanoyl-CoA to 3-acetoacetyl-CoA" /codon_start=1 /transl_table=11 /product="3-hydroxybutyryl-CoA dehydrogenase" /protein_id="NP_214982.1" /db_xref="GI:15607609" /db_xref="GeneID:886288" /translation="MSDAIQRVGVVGAGQMGSGIAEVSARAGVEVTVFEPAEALITAG RNRIVKSLERAVSAGKVTERERDRALGLLTFTTDLNDLSDRQLVIEAVVEDEAVKSEI FAELDRVVTDPDAVLASNTSSIPIMKVAAATKQPQRVLGLHFFNPVPVLPLVELVRTL VTDEAAAARTEEFASTVLGKQVVRCSDRSGFVVNALLVPYLLSAIRMVEAGFATVEDV DKAVVAGLSHPMGPLRLSDLVGLDTLKLIADKMFEEFKEPHYGPPPLLLRMVEAGQLG KKSGRGFYTY" gene 559888..560748 /gene="umaA" /locus_tag="Rv0469" /db_xref="GeneID:886286" CDS 559888..560748 /gene="umaA" /locus_tag="Rv0469" /EC_number="2.-.-.-" /function="Involved in mycolic acid modification or synthesis." /note="Rv0469, (MTV038.13), len: 286 aa. Possible umaA, mycolic acid synthase (EC 2.-.-.-) (see citations below), highly similar to CAC30854.1|AL583923 methyl mycolic acid synthase 1 from Mycobacterium leprae (286 aa); and CAC31976.1|AL583925 Mycolic acid synthase from Mycobacterium leprae (295 aa), FASTA scores: opt: 1402, E(): 0, (69.6% identity in 286 aa overlap). Also very similar to mycobacterial methyltransferases e.g. U77466|CmaD|MBU77466_1 (286 aa); MTCY20H10.26c|Z92772|MTY20H10_27 (296 aa); highly similar to CFA1_MYCTU|Q11195|U66108|MTU66108_1 cyclopropane-fatty-acyl-phospholipid synthase 1 (287 aa), FASTA scores: opt: 1360, E(): 0, (67.8% identity in 286 aa overlap) (see citation below); and very similar also to methoxy mycolic acid synthase 1 from Mycobacterium tuberculosis e.g. MTU66108_1 (286 aa). TBparse score is 0.944. Note that previously known as umaA1.; umaA1" /codon_start=1 /transl_table=11 /product="mycolic acid synthase" /protein_id="YP_177729.1" /db_xref="GI:57116735" /db_xref="GeneID:886286" /translation="MTELRPFYEESQSIYDVSDEFFSLFLDPTMAYTCAYFEREDMTL EEAQNAKFDLALDKLHLEPGMTLLDIGCGWGGGLQRAIENYDVNVIGITLSRNQFEYS KAKLAKIPTERSVQVRLQGWDEFTDKVDRIVSIGAFEAFKMERYAAFFERSYDILPDD GRMLLHTILTYTQKQMHEMGVKVTMSDVRFMKFIGEEIFPGGQLPAQEDIFKFAQAAD FSVEKVQLLQQHYARTLNIWAANLEANKDRAIALQSEEIYNKYMHYLTGCEHFFRKGI SNVGQFTLTK" gene complement(560848..561711) /gene="pcaA" /locus_tag="Rv0470c" /db_xref="GeneID:886284" CDS complement(560848..561711) /gene="pcaA" /locus_tag="Rv0470c" /EC_number="2.-.-.-" /function="INVOLVED IN THE MYCOLIC ACID MODIFICATION OR SYNTHESIS; ESSENTIAL FOR THE CYCLOPROPANATION FUNCTION. REQUIRED FOR CORDING AND MYCOLIC ACID CYCLOPROPANE RING SYNTHESIS IN THE CELL WALL." /experiment="experimental evidence, no additional details recorded" /note="Rv0470c, (MTV038.14), len: 287 aa. pcaA (previously known as umaA2), mycolic acid synthase (cyclopropane synthase) (EC 2.-.-.-) (see citations below), equivalent to CAC31976.1|AL583925 Mycolic acid synthase from Mycobacterium leprae (295 aa); and highly similar to S72886|B2168_F3_130|467038|AAA17222.1|U00018 hypothetical protein from Mycobacterium leprae (308 aa); Q11195|CFA1_MYCTU CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE 1 (CYCLOPROPANE MYCOLIC ACID SYNTHASE 1) (287 aa) (see Glickman et al., 2000); U27357|MTU27357_1 cyclopropane mycolic acid synthase from Mycobacterium tuberculosis (287 aa), FASTA scores: opt: 1415, E(): 0, (72.8% identity in 287 aa overlap); and related enzymes e.g. MTCY20H10.25c|Z92772|MTY20H10_26 (287 aa), FASTA scores: opt: 1387, E(): 0, (72.5% identity in 287 aa overlap). TBparse score is 0.893.; umaA2" /codon_start=1 /transl_table=11 /product="mycolic acid synthase PcaA" /protein_id="YP_177730.1" /db_xref="GI:57116736" /db_xref="GeneID:886284" /translation="MSVQLTPHFGNVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDMT LQEAQIAKIDLALGKLNLEPGMTLLDIGCGWGATMRRAIEKYDVNVVGLTLSENQAGH VQKMFDQMDTPRSRRVLLEGWEKFDEPVDRIVSIGAFEHFGHQRYHHFFEVTHRTLPA DGKMLLHTIVRPTFKEGREKGLTLTHELVHFTKFILAEIFPGGWLPSIPTVHEYAEKV GFRVTAVQSLQLHYARTLDMWATALEANKDQAIAIQSQTVYDRYMKYLTGCAKLFRQG YTDVDQFTLEK" gene complement(561854..562294) /locus_tag="Rv0470A" /db_xref="GeneID:3205059" CDS complement(561854..562294) /locus_tag="Rv0470A" /function="UNKNOWN" /note="Rv0470A, len: 146 aa. Hypothetical unknown protein. GC plot suggests CDS for Cys-rich protein, could possibly be continuation of Rv0471c but no frameshift found to allow this. Sequence same in Mycobacterium bovis and Mycobacterium tuberculosis strain CDC1551. Weak hits to Cys-rich region (aa 258-314) of D63395|D63395_1 mRNA for NOTCH4 from Homo sapiens (1095 aa), FASTA scores: opt: 132, E(): 1.1, (39.35% identity in 61 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177622.1" /db_xref="GI:57116737" /db_xref="GeneID:3205059" /translation="MGAGGWEVVLASLPYGLLCTTVLMGKHIDKIGYDEPLGIRTLPV LLGETCARTVTLAMMVGFYLLIAVNVMLAAMPWPRCWSPGRCPGWRKCGPISCDGGPS SRHRRFRCGRCGMPRWPGCTCVRPVRCWLWAWRSVPGGAPGDFR" gene complement(562225..562713) /locus_tag="Rv0471c" /db_xref="GeneID:886280" CDS complement(562225..562713) /locus_tag="Rv0471c" /function="UNKNOWN" /note="Rv0471c, (MTV038.15c), len: 162 aa. Hypothetical unknown protein. TBparse score is 0.937." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214985.1" /db_xref="GI:15607612" /db_xref="GeneID:886280" /translation="MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLP MTLVSGLVAGLLAIGEPGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARA RYAQHPAATGANRAAYTTPRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAP RC" gene complement(562723..563427) /locus_tag="Rv0472c" /db_xref="GeneID:886308" CDS complement(562723..563427) /locus_tag="Rv0472c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0472c, (MTV038.16c), len: 234 aa. Probable regulatory protein, possibly tetR family, equivalent to CAC31974.1|AL583925 possible TetR-family transcriptional regulator from Mycobacterium leprae (233 aa). Also similar to CAC01492.1|AL391017 putative transcriptional regulatory protein from Streptomyces coelicolor (218 aa); and CAC01371.1|AL390975 putative tetR-family transcriptional regulator from Streptomyces coelicolor (228 aa). Also similar to AL0212|MTV012_65 from Mycobacterium tuberculosis (246 aa), FASTA scores: opt: 327, E(): 1.8e-15, (31.0% identity in 232 aa overlap); and Z95120|MTCY07D11.18c (228 aa), FASTA scores: opt: 190, E(): 4.4e-06, (23.1% identity in 186 aa overlap). Contains probable helix-turn-helix doimain at aa 45-66 (Score 1429, +4.05 SD). TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_214986.1" /db_xref="GI:15607613" /db_xref="GeneID:886308" /translation="MAERIPAVTVKTDGRKRRWHQHKVERRNELVDGTIEAIRRHGRF LSMDEIAAEIGVSKTVLYRYFVDKNDLTTAVMMRFTQTTLIPNMIAALSADMDGFELT REIIRVYVETVAAQPEPYRFVMANSSASKSKVIADSERIIARMLAVMLRRRMQEAGMD TGGVEPWAYLIVGGVQLATHSWMSDPRMSSDELIDYLTMLSWSALCGIVEAGGSLEKF REQPHPSPIVPAWGQV" gene 563564..564934 /locus_tag="Rv0473" /db_xref="GeneID:886279" CDS 563564..564934 /locus_tag="Rv0473" /function="UNKNOWN" /note="Rv0473, (MTV038.17), len: 456 aa. Possible conserved transmembrane protein, showing some similarity to hypothetical proteins e.g. NP_102800.1|14021975|BAB48586.1|AP002996 hypothetical protein from Mesorhizobium loti (431 aa); P39385|YJIN_ECOLI|YJIN|B4336 HYPOTHETICAL 48.2 kDa PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Escherichia coli strain K12 (426 aa), FASTA scores: opt: 396, E(): 9.8e-19, (31.8 % identity in 424 aa overlap); etc. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214987.1" /db_xref="GI:15607614" /db_xref="GeneID:886279" /translation="MVAHKAEVSGSPPPRLNLSTQPTVARRVRASFAESFAAADPEAD AARRMALRRMKVVAVGFLVGATGVFLACRWAQADGADHAWLGYLGAAAEAGMVGALAD WFAVTALFKHPLGIPIPHTAIIKRKKDQLGEGLGTFVRENFLSPPVVETKLRDAQIPS RLGKWLSEATHAQRVAAETATVLRVLVELLRDEDIQQVIDRMIVRRIAEPQWGPPAGR VLATLLAENRQEAFIQLLADRAFQWSLNAGVVIQRVVERDSPSWSPRFIDHLVGDRIH RELMEFTDKVRRNPDHELRRSATRFLFDFADDLQHDPATVARADAIKEELMARDEIAT AAAAAWKTLKRLVLEGVDDPSSALRTRITDAVIRIGESLRDDADLRDKVDSWTVRAAQ HLVSEYGVEITAIITETIERWDAEEASRRIELHVGRDLQFIRINGTVVGAMAGLAIYA IAQLLF" gene 565021..565443 /locus_tag="Rv0474" /db_xref="GeneID:886276" CDS 565021..565443 /locus_tag="Rv0474" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0474, (MTV038.18), len: 140 aa. Probable transcriptional regulator, highly similar to others e.g. CAC04034.1|AL391406 putative DNA-binding protein from Streptomyces coelicolor (141 aa); N-terminus of NP_104173.1|14023352|BAB49959.1|AP003000 transcriptional regulator from Mesorhizobium loti (219 aa); N-terminus of A83618|PA0225 probable transcription regulator from Pseudomonas aeruginosa (179 aa); SINR_BACSU|P06533 sinr protein from Bacillus subtilis (111 aa), FASTA scores: opt: 147, E(): 8.9e-06, (30.6% identity in 111 aa overlap). Also similar to other hypothetical proteins e.g. X66407|RRPHAS_1|ORF1 from Rhodococcus ruber (171 aa), FASTA scores: opt: 280, E(): 4.8e-12, (43.6% identity in 117 aa overlap). Also similar to Rv2745c from Mycobacterium tuberculosis. Contains probable helix-turn-helix domain at aa 35-56 (Score 1709, +5.01 SD). TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214988.1" /db_xref="GI:15607615" /db_xref="GeneID:886276" /translation="MSSEEKLAAKVSTKASDVASDIGSFIRSQRETAHVSMRQLAERS GVSNPYLSQVERGLRKPSADVLSQIAKALRVSAEVLYVRAGILEPSETSQVRDAIITD TAITERQKQILLDIYASFTHQNEATREECPSDPTPTDD" gene 565797..566396 /gene="hbhA" /locus_tag="Rv0475" /db_xref="GeneID:886272" CDS 565797..566396 /gene="hbhA" /locus_tag="Rv0475" /function="REQUIRED FOR EXTRAPULMONARY DISSEMINATION. MEDIATES ADHERENCE TO EPITHELIAL CELLS BY BINDING TO SULFATED GLYCOCONJUGATES PRESENT AT THE SURFACE OF THESE CELLS; BINDS HEPARIN, DEXTRAN SULFATE, FUCOIDAN AND CHONDROITIN SULFATE. PROMOTES HEMAGGLUTINATION OF ERYTHROCYTES OF CERTAIN HOST SPECIES. INDUCES MYCOBACTERIAL AGGREGATION." /experiment="experimental evidence, no additional details recorded" /note="Rv0475, hbhA (MTCY20G9.01), len: 199 aa. hbhA, iron-regulated heparin-binding hemagglutinin (see citations below), equivalent to CAC31971.1|AL583925 possible hemagglutinin from Mycobacterium leprae (188 aa). Contains possible N-terminal signal sequence and K-A-rich region at C-terminus: SUBCELLULAR LOCATION: SURFACE ASSOCIATED." /codon_start=1 /transl_table=11 /product="iron-regulated heparin binding hemagglutinin hbhA (adhesin)" /protein_id="NP_214989.1" /db_xref="GI:15607616" /db_xref="GeneID:886272" /translation="MAENSNIDDIKAPLLAALGAADLALATVNELITNLRERAEETRT DTRSRVEESRARLTKLQEDLPEQLTELREKFTAEELRKAAEGYLEAATSRYNELVERG EAALERLRSQQSFEEVSARAEGYVDQAVELTQEALGTVASQTRAVGERAAKLVGIELP KKAAPAKKAAPAKKAAPAKKAAAKKAPAKKAAAKKVTQK" gene 566508..566771 /locus_tag="Rv0476" /db_xref="GeneID:886282" CDS 566508..566771 /locus_tag="Rv0476" /function="UNKNOWN" /note="Rv0476, (MTCY20G9.02), len: 87 aa. Possible conserved transmembrane protein, equivalent to CAC31970.1|AL583925 conserved membrane protein from Mycobacterium leprae (95 aa). Also highly similar to CAC04036.1|AL391406 putative membrane protein from Streptomyces coelicolor (113 aa). Contains PS00606 Beta-ketoacyl synthases active site." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_214990.1" /db_xref="GI:15607617" /db_xref="GeneID:886282" /translation="MLVLLVAVLVTAVYAFVHAALQRPDAYTAADKLTKPVWLVILGA AVALASILYPVLGVLGMAMSACASGVYLVDVRPKLLEIQGKSR" misc_feature 566676..566726 /locus_tag="Rv0476" /note="PS00606 Beta-ketoacyl synthases active site" gene 566776..567222 /locus_tag="Rv0477" /db_xref="GeneID:886273" CDS 566776..567222 /locus_tag="Rv0477" /function="UNKNOWN" /note="Rv0477, (MTCY20G9.03), len: 148 aa. Possible conserved secreted protein, equivalent to CAC31969.1|AL583925 hypothetical protein from Mycobacterium leprae (123 aa). Also similar to G83406|PA1914 conserved hypothetical protein from Pseudomonas aeruginosa (408 aa). Contains possible N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214991.1" /db_xref="GI:15607618" /db_xref="GeneID:886273" /translation="MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPR LVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQ FICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF" gene 567222..567896 /gene="deoC" /locus_tag="Rv0478" /db_xref="GeneID:888425" CDS 567222..567896 /gene="deoC" /locus_tag="Rv0478" /EC_number="4.1.2.4" /function="INVOLVED IN NUCLEOTIDE AND DEOXYRIBONUCLEOTIDE CATABOLISM [CATALYTIC ACTIVITY: 2-deoxy-D-ribose 5-phosphate = D-glyceraldehyde 3-phosphate + acetaldehyde]." /note="catalyzes the formation of D-glyceraldehyde 3-phosphate and acetaldehyde from 2-deoxy-D-ribose-5-phosphate" /codon_start=1 /transl_table=11 /product="deoxyribose-phosphate aldolase" /protein_id="NP_214992.1" /db_xref="GI:15607619" /db_xref="GeneID:888425" /translation="MLGQPTRAQLAALVDHTLLKPETTRADVAALVAEAAELGVYAVC VSPSMVPVAVQAGGVRVAAVTGFPSGKHVSSVKAHEAAAALASGASEIDMVIDIGAAL CGDIDAVRSDIEAVRAAAAGAVLKVIVESAVLLGQSNAHTLVDACRAAEDAGADFVKT STGCHPAGGATVRAVELMAETVGPRLGVKASGGIRTAADAVAMLNAGATRLGLSGTRA VLDGLS" gene complement(567921..568967) /locus_tag="Rv0479c" /db_xref="GeneID:885535" CDS complement(567921..568967) /locus_tag="Rv0479c" /function="UNKNOWN" /note="Rv0479c, (MTCY20G9.04c), len: 348 aa. Probable conserved membrane protein, equivalent to CAC31967.1|AL583925 possible secreted protein from Mycobacterium leprae (254 aa); and C-terminus highly similar to AAF74996.1|AF143402_1|AF143402 putative multicopper oxidase from Mycobacterium avium (149 aa). Contains hydrophobic domain in centre of protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214993.1" /db_xref="GI:15607620" /db_xref="GeneID:885535" /translation="MTNPQGPPNDPSPWARPGDQGPLARPPASSEASTGRLRPGEPAG HIQEPVSPPTQPEQQPQTEHLAASHAHTRRSGRQAAHQAWDPTGLLAAQEEEPAAVKT KRRARRDPLTVFLVLIIVFSLVLAGLIGGELYARHVANSKVAQAVACVVKDQATASFG VAPLLLWQVATRHFTNISVETAGNQIRDAKGMQIKLTIQNVRLKNTPNSRGTIGALDA TITWSSEGIKESVQNAIPILGAFVTSSVVTHPADGTVELKGLLNNITAKPIVAGKGLE LQIINFNTLGFSLPKETVQSTLNEFTSSLTKNYPLGIHADSVQVTSTGVVSRFSTRDA AIPTGIQNPCFSHI" gene complement(568964..569806) /locus_tag="Rv0480c" /db_xref="GeneID:887163" CDS complement(568964..569806) /locus_tag="Rv0480c" /EC_number="3.-.-.-" /function="UNKNOWN; HYDROLYTIC ENZYME PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0480c, (MTCY20G9.06c), len: 280 aa. Possible amidohydrolase (EC 3.-.-.-), highly similar to NP_302587.1|NC_002677|CAC31966.1|AL583925 putative hydrolase from Mycobacterium leprae (271 aa). Also similar to other hydrolases and hypothetical proteins e.g. NP_601985.1|NC_003450 Predicted amidohydrolase from Corynebacterium glutamicum (266 aa); NP_459623.1|NC_003197 putative hydrolase from Salmonella typhimurium LT2 (262 aa); AL096822|SCGD3_8|NP_627996.1|NC_003888 probable hydrolase from Streptomyces coelicolor (264 aa), FASTA scores: opt: 368, E(): 6.1e-15, (34.2% identity in 272 aa overlap); YAUB_SCHPO|Q10166 hypothetical 35.7 kDa protein c26a3.11 from S. pombe (322 aa), FASTA scores: opt: 338, E():1.4e-13, (30.3% identity in 277 aa overlap); etc. Start changed since first submission (-60 aa)." /codon_start=1 /transl_table=11 /product="amidohydrolase" /protein_id="NP_214994.2" /db_xref="GI:57116738" /db_xref="GeneID:887163" /translation="MRIALAQIRSGTDPAANLQLVGKYAGEAATAGAQLVVFPEATMC RLGVPLRQVAEPVDGPWANGVRRIATEAGITVIAGMFTPTGDGRVTNTLIAAGPGTPN QPDAHYHKIHLYDAFGFTESRTVAPGREPVVVVVDGVRVGLTVCYDIRFPALYTELAR RGAQLIAVCASWGSGPGKLEQWTLLARARALDSMSYVAAAGQADPGDARTGVGASSAA PTGVGGSLVASPLGEVVVSAGTQPQLLVADIDVDNVAAARDRIAVLRNQTDFVQIDKA QSRG" gene complement(569988..570512) /locus_tag="Rv0481c" /db_xref="GeneID:887161" CDS complement(569988..570512) /locus_tag="Rv0481c" /function="UNKNOWN" /note="Rv0481c, (MTCY20G9.07c), len: 174 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_214995.1" /db_xref="GI:15607622" /db_xref="GeneID:887161" /translation="MPRSFDMSADYEGSVEEVHRAFYEADYWKARLAETPVDVATLES IRVGGDSGDDGTIEVVTLQMVRSHNLPGLVTQLHRGDLSVRREETWGPVKEGIATASI AGSIVDAPVNLWGTAVLSPIPESGGSRMTLQVTIQVRIPFIGGKLERLIGTQLSQLVT IEQRFTTLWITNNV" gene 570539..571648 /gene="murB" /locus_tag="Rv0482" /db_xref="GeneID:887169" CDS 570539..571648 /gene="murB" /locus_tag="Rv0482" /EC_number="1.1.1.158" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS [CATALYTIC ACTIVITY: UDP-N-acetylmuramate + NADP+ = UDP-N-acetyl-3-O-(1-carboxyvinyl)-D-glucosamine + NADPH]." /note="catalyzes the reduction of UDP-N-acetylglucosamine enolpyruvate to form UDP-N-acetylmuramate in peptidoglycan biosynthesis" /codon_start=1 /transl_table=11 /product="UDP-N-acetylenolpyruvoylglucosamine reductase" /protein_id="NP_214996.1" /db_xref="GI:15607623" /db_xref="GeneID:887169" /translation="MKRSGVGSLFAGAHIAEAVPLAPLTTLRVGPIARRVITCTSAEQ VVAALRHLDSAAKTGADRPLVFAGGSNLVIAENLTDLTVVRLANSGITIDGNLVRAEA GAVFDDVVVRAIEQGLGGLECLSGIPGSAGATPVQNVGAYGAEVSDTITRVRLLDRCT GEVRWVSARDLRFGYRTSVLKHADGLAVPTVVLEVEFALDPSGRSAPLRYGELIAALN ATSGERADPQAVREAVLALRARKGMVLDPTDHDTWSVGSFFTNPVVTQDVYERLAGDA ATRKDGPVPHYPAPDGVKLAAGWLVERAGFGKGYPDAGAAPCRLSTKHALALTNRGGA TAEDVVTLARAVRDGVHDVFGITLKPEPVLIGCML" gene 571710..573065 /gene="lprQ" /locus_tag="Rv0483" /db_xref="GeneID:887167" CDS 571710..573065 /gene="lprQ" /locus_tag="Rv0483" /function="UNKNOWN" /note="Rv0483, (MTCY20G9.09), len: 451 aa. Probable lprQ, conserved lipoprotein, equivalent to CAC31963.1|AL583925|ML2446 possible lipoprotein from Mycobacterium leprae (441 aa); appears longer than ML2446, so start may be further downstream. Shows also similarity with MLCL383_24|O07707 HYPOTHETICAL 43.6 kDa PROTEIN from Mycobacterium leprae; and to Q49706|B1496_F2_81 (271 aa). Similar to others lipoproteins from other organisms. Also similar to several Mycobacterium tuberculosis hypothetical proteins e.g. Rv0116c, Rv0192, Rv1433, Rv2518c. Contains potential N-terminal signal sequence and appropriately positioned PS00013 prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LprQ" /protein_id="NP_214997.1" /db_xref="GI:15607624" /db_xref="GeneID:887167" /translation="MVIRVLFRPVSLIPVNNSSTPQSQGPISRRLALTALGFGVLAPN VLVACAGKVTKLAEKRPPPAPRLTFRPADSAADVVPIAPISVEVGDGWFQRVALTNSA GKVVAGAYSRDRTIYTITEPLGYDTTYTWSGSAVGHDGKAVPVAGKFTTVAPVKTINA GFQLADGQTVGIAAPVIIQFDSPISDKAAVERALTVTTDPPVEGGWAWLPDEAQGARV HWRPREYYPAGTTVDVDAKLYGLPFGDGAYGAQDMSLHFQIGRRQVVKAEVSSHRIQV VTDAGVIMDFPCSYGEADLARNVTRNGIHVVTEKYSDFYMSNPAAGYSHIHERWAVRI SNNGEFIHANPMSAGAQGNSNVTNGCINLSTENAEQYYRSAVYGDPVEVTGSSIQLSY ADGDIWDWAVDWDTWVSMSALPPPAAKPAATQIPVTAPVTPSDAPTPSGTPTTTNGPG G" gene complement(573046..573801) /locus_tag="Rv0484c" /db_xref="GeneID:887158" CDS complement(573046..573801) /locus_tag="Rv0484c" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0484c, (MTCY20G9.10c), len: 251 aa. Probable short-chain oxidoreductase (EC 1.-.-.-), highly similar to others e.g. T36118|4678912|CAB41284.1|AL049707 probable oxidoreductase from Streptomyces coelicolor (260 aa); YDFG_HAEIN|P45200|HI1430 hypothetical oxidoreductase (SDR family) from Haemophilus influenzae (252 aa), FASTA scores: opt: 496, E(): 7.9e-25, (35.0 % identity in 243 aa overlap); etc. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. STRONG SIMILARITY, TO BACTERIAL YDFG HOMOLOGS." /codon_start=1 /transl_table=11 /product="short-chain type oxidoreductase" /protein_id="NP_214998.1" /db_xref="GI:15607625" /db_xref="GeneID:887158" /translation="MTTIGTRKRVAVVTGASSGIGEATARTLAAQGFHVVAVARRADR ITALANQIGGTAIVADVTDDAAVEALARALSRVDVLVNNAGGAKGLQFVADADLEHWR WMWDTNVLGTLRVTRALLPKLIDSGDGLIVTVTSIAAIEVYDGGAGYTAAKHAQGALH RTLRGELLGKPVRLTEIAPGAVETEFSLVRFDGDQQRADAVYAGMTPLVAADVAEVIG FVATRPSHVNLDQIVIRPRDQASASRRATHPVR" gene 573984..575300 /locus_tag="Rv0485" /db_xref="GeneID:887170" CDS 573984..575300 /locus_tag="Rv0485" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0485, (MTCY20G9.11), len: 438 aa. Possible transcriptional repressor, member of the NAGC/XYLR repressor FAMILY; similar to several e.g. D87820_3|O32446|D82254 NAGC N-acetylglucosamine repressor from Vibrio cholerae (404 aa), FASTA scores: opt: 378, E(): 1.2e-17, (26.9% identity in 350 aa overlap); NAGC_ECOLI|P15301 N-acetylglucosamine repressor from Escherichia coli (406 aa), FASTA scores: opt: 305, E(): 1.8e-12, (21.8% identity in 357 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_214999.1" /db_xref="GI:15607626" /db_xref="GeneID:887170" /translation="MYSTNRTSQSLSRKPGRKHQLRSHRYVMPPSLHLSDSAAASVFR AVRLRGPVGRDVIAGSTSLSIATVNRQVIALLEAGLLRERADLAVSGAIGRPRVPVEV NHEPFVTLGIHIGARTTSIVATDLFGRTLDTVETPTPRNAAGAALTSLADSADRYLQR WRRRRALWVGVTLGGAVDSATGHVDHPRLGWRQAPVGPVLADALGLPVSVASHVDAMA GAELMLGMRRFAPSSSTSLYVYARETVGYALMIGGRVHCPASGPGTIAPLPVHSEMLG GTGQLESTVSDEAVLAAARRLRIIPGIASRTRTGGSATAITDLLRVARAGNQQAKELL AERARVLGGAVALLRDLLNPDEVVVGGQAFTEYPEAMEQVEAAFTAGSVLAPRDIRVT VFGNRVQEAGAGIVSLSGLYADPLGALRRSGALDARLQDTAPEALA" gene 575348..576790 /locus_tag="Rv0486" /db_xref="GeneID:887160" CDS 575348..576790 /locus_tag="Rv0486" /EC_number="2.4.1.-" /function="THOUGHT TO BE INVOLVED IN POLYPRENOLMANNOSE SYNTHESIS." /note="Rv0486, (MTCY20G9.12), len: 480 aa. Mannosyltransferase (EC 2.4.1.-) (see citations below), highly similar to P54138|Y486_MYCLE|ML2443 possible glycosyl transferase from Mycobacterium leprae (428 aa); and S72892|B2168_C2_201 probable hexosyltransferase (EC 2.4.1.-) from Mycobacterium leprae (409 aa), FASTA scores: opt: 2375, E(): 0, (86.4% identity in 413 aa overlap). Also highly similar to CAC04040.1|AL391406 putative transferase from Streptomyces coelicolor (496 aa); and similar to various transferases e.g. NP_437172.1|NC_003078 putative membrane-anchored glycosyltransferase protein from Sinorhizobium meliloti (416 aa); O26550|U67601_1 LPS BIOSYNTHESIS RELATED PROTEIN from Methanococcus jannaschii (411 aa), FASTA score: (25.3% identity in 387 aa overlap); etc. Also similar to CAC87824.1|AJ316594 putative sucrose-phosphate synthase from Nostoc punctiforme (422 aa). Contains PS00039 DEAD-box subfamily ATP-dependent helicases signature." /codon_start=1 /transl_table=11 /product="mannosyltransferase" /protein_id="NP_215000.1" /db_xref="GI:15607627" /db_xref="GeneID:887160" /translation="MAGVRHDDGSGLIAQRRPVRGEGATRSRGPSGPSNRNVSAADDP RRVALLAVHTSPLAQPGTGDAGGMNVYMLQSALHLARRGIEVEIFTRATASADPPVVR VAPGVLVRNVVAGPFEGLDKYDLPTQLCAFAAGVLRAEAVHEPGYYDIVHSHYWLSGQ VGWLARDRWAVPLVHTAHTLAAVKNAALADGDGPEPPLRTVGEQQVVDEADRLIVNTD DEARQVISLHGADPARIDVVHPGVDLDVFRPGDRRAARAALGLPVDERVVAFVGRIQP LKAPDIVLRAAAKLPGVRIIVAGGPSGSGLASPDGLVRLADELGISARVTFLPPQSHT DLATLFRAADLVAVPSYSESFGLVAVEAQACGTPVVAAAVGGLPVAVRDGITGTLVSG HEVGQWADAIDHLLRLCAGPRGRVMSRAAARHAATFSWENTTDALLASYRRAIGEYNA ERQRRGGEVISDLVAVGKPRHWTPRRGVGA" misc_feature 575963..575989 /locus_tag="Rv0486" /note="PS00039 DEAD-box subfamily ATP-dependent helicases signature" gene 576787..577338 /locus_tag="Rv0487" /db_xref="GeneID:887162" CDS 576787..577338 /locus_tag="Rv0487" /function="UNKNOWN" /note="Rv0487, (MTCY20G9.13), len: 183 aa. Conserved hypothetical protein, highly similar to P54139|Y487_MYCLE|U00018_38|ML2442 HYPOTHETICAL 20.8 KDA PROTEIN from Mycobacterium leprae (184 aa), FASTA scores: opt: 760, E(): 2.4 e-34, (73.0% identity in 159 aa overlap). Also highly similar to CAC04041.1|AL391406 conserved hypothetical protein from Streptomyces coelicolor (168 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215001.1" /db_xref="GI:15607628" /db_xref="GeneID:887162" /translation="MTSSLPTVQRVIQNALEVSQLKYSQHPRPGGAPPALIVELPGER KLKINTILSVGEHSVRVEAFVCRKPDENREDVYRFLLRRNRRLYGVAYTLDNVGDIYL VGQMALSAVDADEVDRVLGQVLEVVDSDFNALLELGFRSSIQREWQWRLSRGESLQNL QAFAHLRPTTMQSAQRDEKELGG" gene 577664..578269 /locus_tag="Rv0488" /db_xref="GeneID:887171" CDS 577664..578269 /locus_tag="Rv0488" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF LYSINE ACROSS THE MEMBRANE." /note="Rv0488, (MTCY20G9.14), len: 201 aa. Probable conserved integral membrane protein, LysE family possibly involved in transport of Lysine, similar to others and conserved hypothetical proteins e.g. AB93746.1|AL357613 putative membrane transport protein from Streptomyces coelicolor (204 aa); D83100|PA4365 probable transporter from Pseudomonas aeruginosa (200 aa); YGGA_ECOLI|P11667 hypothetical 21.7 kDa protein from Escherichia coli (197 aa), FASTA scores: opt: 382, E(): 1.1e-19, (39.1% identity in 179 aa overlap); CGLYSEG_2 C|P94633 LYSINE EXPORTER PROTEIN (236 aa), FASTA scores: E(): 2.3e-07, (33.3% identity in 219 aa overlap). Also similar to Rv1986 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215002.1" /db_xref="GI:15607629" /db_xref="GeneID:887171" /translation="MMTLKVAIGPQNAFVLRQGIRREYVLVIVALCGIADGALIAAGV GGFAALIHAHPNMTLVARFGGAAFLIGYALLAARNAWRPSGLVPSESGPAALIGVVQM CLVVTFLNPHVYLDTVVLIGALANEESDLRWFFGAGAWAASVVWFAVLGFSAGRLQPF FATPAAWRILDALVAVTMIGVAVVVLVTSPSVPTANVALII" gene 578426..579175 /gene="gpmA" /locus_tag="Rv0489" /db_xref="GeneID:887183" CDS 578426..579175 /gene="gpmA" /locus_tag="Rv0489" /EC_number="5.4.2.1" /function="INVOLVED IN GLYCOLYSIS [CATALYTIC ACTIVITY: 1,3-DIPHOSPHOGLYCERATE + 3-PHOSPHOGLYCERATE = 2,3-DIPHOSPHOGLYCERATE + 3-PHOSPHOGLYCERATE]." /note="2,3-bisphosphoglycerate-dependent; catalyzes the interconversion of 2-phosphoglycerate to 3-phosphoglycerate" /codon_start=1 /transl_table=11 /product="phosphoglyceromutase" /protein_id="YP_177731.1" /db_xref="GI:57116739" /db_xref="GeneID:887183" /translation="MANTGSLVLLRHGESDWNALNLFTGWVDVGLTDKGQAEAVRSGE LIAEHDLLPDVLYTSLLRRAITTAHLALDSADRLWIPVRRSWRLNERHYGALQGLDKA ETKARYGEEQFMAWRRSYDTPPPPIERGSQFSQDADPRYADIGGGPLTECLADVVARF LPYFTDVIVGDLRVGKTVLIVAHGNSLRALVKHLDQMSDDEIVGLNIPTGIPLRYDLD SAMRPLVRGGTYLDPEAAAAGAAAVAGQGRG" misc_feature 578450..578479 /gene="gpmA" /locus_tag="Rv0489" /note="PS00175 Phosphoglycerate mutase family phosphohistidine signature" misc_feature 578933..578956 /gene="gpmA" /locus_tag="Rv0489" /note="PS00017 ATP/GTP-binding site motif A" gene 579349..580581 /gene="senX3" /locus_tag="Rv0490" /db_xref="GeneID:887185" CDS 579349..580581 /gene="senX3" /locus_tag="Rv0490" /EC_number="2.7.3.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM. PROBABLY FORMS PART OF A TWO-COMPONENT REGULATORY SYSTEM SENX3/REGX3; PHOSPHORYLATES REGX3." /note="Rv0490, (MTCY20G9.16), len: 410 aa. Putative senX3, two-component sensor histidine kinase (EC 2.7.3.-), transmembrane protein (see citations below), equivalent to O07129|SEX3_MYCBO SENSOR-LIKE HISTIDINE KINASE SENX3 from Mycobacterium bovis BCG (410 aa), FASTA scores: E(): 0, (99.5% identity in 410 aa overlap); and highly similar to P54883|SEX3_MYCLE|SENX3 SENSOR-LIKE HISTIDINE KINASE from Mycobacterium leprae (443 aa), FASTA score: (83.8% identity in 408 aa overlap). Also highly similar, except in N-terminus, to CAC31957.1|AL583925 probable two-component system sensor histidine kinase from Mycobacterium leprae (441 aa). Also highly similar to sensor kinase proteins from other organisms e.g. CAB77323.1|AL160331 putative sensor kinase protein from Streptomyces coelicolor (426 aa)." /codon_start=1 /transl_table=11 /product="putative two component sensor histidine kinase SENX3" /protein_id="NP_215004.1" /db_xref="GI:15607631" /db_xref="GeneID:887185" /translation="MTVFSALLLAGVLSALALAVGGAVGMRLTSRVVEQRQRVATEWS GITVSQMLQCIVTLMPLGAAVVDTHRDVVYLNERAKELGLVRDRQLDDQAWRAARQAL GGEDVEFDLSPRKRSATGRSGLSVHGHARLLSEEDRRFAVVFVHDQSDYARMEAARRD FVANVSHELKTPVGAMALLAEALLASADDSETVRRFAEKVLIEANRLGDMVAELIELS RLQGAERLPNMTDVDVDTIVSEAISRHKVAADNADIEVRTDAPSNLRVLGDQTLLVTA LANLVSNAIAYSPRGSLVSISRRRRGANIEIAVTDRGIGIAPEDQERVFERFFRGDKA RSRATGGSGLGLAIVKHVAANHDGTIRVWSKPGTGSTFTLALPALIEAYHDDERPEQA REPELRSNRSQREEELSR" repeat_region 580578..580654 /note="77 bp Mycobacterial Interspersed Repetitive Unit, Class I. See citation below." repeat_region 580655..580731 /note="77 bp Mycobacterial Interspersed Repetitive Unit, Class I. See citation below." repeat_region 580732..580808 /note="77 bp Mycobacterial Interspersed Repetitive Unit, Class I. See citation below." gene 580809..581492 /gene="regX3" /locus_tag="Rv0491" /db_xref="GeneID:887195" CDS 580809..581492 /gene="regX3" /locus_tag="Rv0491" /function="TRANSCRIPTIONAL REGULATORY PROTEIN PART OF THE TWO COMPONENT REGULATORY SYSTEM REGX3/SENX3." /note="Rv0491, (MTCY20G9.17), len: 227 aa. regX3, response regulator protein (sensory transduction protein) (see citations below), equivalent to O07130|RGX3_MYCBO|REGX3 SENSORY TRANSDUCTION PROTEIN from Mycobacterium bovis BCG (227 aa); AAG09797.1|AF258346_2|AF258346|REGX3 response regulator from Mycobacterium smegmatis (228 aa); equivalent to P54884|RGX3_MYCLE|REGX3 SENSORY TRANSDUCTION PROTEIN from Mycobacterium leprae (198 aa), FASTA scores : E(): 0, (95.4% identity in 197 aa overlap). Also highly similar to other response regulators e.g. AAG43239.1|AF123314_2 |AF123314 putative response regulator from Corynebacterium glutamicum (232 aa)." /codon_start=1 /transl_table=11 /product="two component sensory transduction protein RegX3" /protein_id="NP_215005.1" /db_xref="GI:15607632" /db_xref="GeneID:887195" /translation="MTSVLIVEDEESLADPLAFLLRKEGFEATVVTDGPAALAEFDRA GADIVLLDLMLPGMSGTDVCKQLRARSSVPVIMVTARDSEIDKVVGLELGADDYVTKP YSARELIARIRAVLRRGGDDDSEMSDGVLESGPVRMDVERHVVSVNGDTITLPLKEFD LLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKRLRSKIEADPANPVHLVTV RGLGYKLEG" gene complement(581489..583378) /locus_tag="Rv0492c" /db_xref="GeneID:887199" CDS complement(581489..583378) /locus_tag="Rv0492c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0492c, (MT0511/MT0512, MTCY20G9.18c), len: 629 aa. Probable oxidoreductase GMC type (EC 1.-.-.-), similar to others except in N-terminus e.g. P55582|AE000087_5|Y4NJ_RHISN HYPOTHETICAL GMC-TYPE OXIDOREDUCTASE from Rhizobium sp. (505 aa), FASTA scores: opt: 873, E():0, (34.3% identity in 502 aa overlap); YTH2_RHOER|P46371 HYPOTHETICAL 53.0 kDa GMC-TYPE OXIDOREDUCTASE from Rhodococcus erythropolis (493 aa), FASTA score: (25.7% identity in 521 aa overlap); YTH2_RHOSO|P46371 hypothetical 53.0 kDa gmc-type oxidoreductase from Rhodococcus erythropolis (493 aa), FASTA score: (25.7% identity in 521 aa overlap); NP_085596.1|NC_002679 probable oxidoreductase from Mesorhizobium loti (507 aa); NP_285451.1|NC_001264 GMC oxidoreductase from Deinococcus radiodurans (722 aa); NP_249055.1|NC_002516 probable oxidoreductase from Pseudomonas aeruginosa (531 aa); etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature, and PS00624 GMC oxidoreductases signature 2. BELONGS TO THE GMC OXIDOREDUCTASES FAMILY. COFACTOR: FAD (BY SIMILARITY). Note that start changed since first submission (previously 684 aa)." /codon_start=1 /transl_table=11 /product="oxidoreductase GMC-type" /protein_id="NP_215006.2" /db_xref="GI:57116740" /db_xref="GeneID:887199" /translation="MSRLADRAKSYPLASFGAALLPPELGGPLPAQFVQRVDRYVTRL PATSRFAVRAGLASLAAASYLTTGRSLPRLHPDERARVLHRIAALSPEVAAAVEGLKA IVLLANGADTYAHELLARAQEHDAARPDAELTVILSADSPSVTRADAVVVGSGAGGAM VARTLARAGLDVVVLEEGRRWTVEEFRSTHPVDRYAGLYRGAGATVALGRPAVVLPMG RAVGGTTVVNSGTCFRPSLAVQRRWRDEFGLGLADPDQLGRRLDDAEQTLRVAPVPLE IMGRNGRLLLQAAKSLGWRAAPIPRNAPGCRGCCQCAIGCPSNAKFGVHLNALPQACA AGARIISWARVERILHRAGRAYGVRARRPDGTTLDVLADAVVVAAGATETPGLLRRSG LGGHPRLGHNLALHPATMLAGLFDDDVFAWRGVLQSAAVHEFHESDGVLIEATSTPPG MGSMVFPGYGAELLRWLDRAPQIATFGAMVADRGVGTVRSVRGETVVRYDIAPGEIAK LRVALQAIGRLLFAAGAVEVLTGIPGAPPMRSLPELQDVLRRANPRSLHLAAFHPTGT AAAGADEQLCPVDATGRLRGVEGVWVADASILPSCPEVNPQLSIMAMALAVADQTVAK VVGVR" misc_feature complement(582197..582241) /locus_tag="Rv0492c" /note="PS00624 GMC oxidoreductases signature 2" misc_feature complement(582428..582463) /locus_tag="Rv0492c" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature" gene complement(583375..583704) /locus_tag="Rv0492A" /db_xref="GeneID:3205069" CDS complement(583375..583704) /locus_tag="Rv0492A" /function="UNKNOWN" /note="Rv0492A, len: 109 aa. Hypothetical unknown protein. GC plot suggests CDS." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177623.1" /db_xref="GI:57116741" /db_xref="GeneID:3205069" /translation="MSFLLDPPLLFVCGVLIERRLPVDRRDAAEAAALGVFFGASFGL YHNVPGLGMLWRPFRAQNGRDFMWNSGVFSVDVARAEWPLHAMAAAIFATYPFFIKLG RRLGRRI" gene complement(583701..584690) /locus_tag="Rv0493c" /db_xref="GeneID:887200" CDS complement(583701..584690) /locus_tag="Rv0493c" /function="UNKNOWN" /note="Rv0493c, (MTCY20G9.19), len: 329 aa. Conserved hypothetical protein, showing some similarity to U00018_33|B2168_F2_93 from Mycobacterium leprae (167 aa), FASTA scores: opt: 166, E(): 0.00077, (35.9% identity in 131 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215007.1" /db_xref="GI:15607634" /db_xref="GeneID:887200" /translation="MGESTTQPAGGAAVDDETRSAALPRWRGAAGRLEVWYATLSDPL TRTGLWVHCETVAPTTGGPYAHGWVTWFPPDAPPGTERFGPQPAQPAAGPAWFDIAGV RMAPAELTGRTRSLAWELSWKDTAAPLWTFPRVAWERELLPGAQVVIAPTAVFAGSLA VGETTHRVDSWRGSVAHIYGHGNAKRWGWIHADLGDGDVLEVVTAVSHKPGLRRLAPL AFVRFRIDGKDWPASPLPSLRMRTTLGVRHWQLEGRIGGREALIRVDQPPERCVSLGY TDPDGAKAVCTNTEQADIHIELGGRHWSVLGTGHAEVGLRGTAAPAIKEGTPA" gene 584695..585423 /locus_tag="Rv0494" /db_xref="GeneID:887166" CDS 584695..585423 /locus_tag="Rv0494" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0494, (MTCY20G9.20), len: 242 aa. Probable transcriptional regulator, GntR family, with C-terminal part highly similar to S72893|B2168_C2_205 hypothetical protein from Mycobacterium leprae (105 aa). Also similar to other transcription regulators e.g. PDHR_ECOLI|P06957 pyruvate dehydrogenase complex repressor PDHR or GENA from Escherichia coli (254 aa), FASTA scores: opt: 284, E(): 1.2e-11, (32.6% identity in 224 aa overlap); etc. Contains PS00043 Bacterial regulatory proteins, gntR family signature, and probable helix-turn helix motif from aa 50-71 (Score 1229, +3.37 SD)." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="NP_215008.2" /db_xref="GI:57116742" /db_xref="GeneID:887166" /translation="MVEPMNQSSVFQPPDRQRVDERIATTIADAILDGVFPPGSTLPP ERDLAERLGVNRTSLRQGLARLQQMGLIEVRHGSGSVVRDPEGLTHPAVVEALVRKLG PDFLVELLEIRAALGPLIGRLAAARSTPEDAEALCAALEVVQQADTAAARQAADLAYF RVLIHSTRNRALGLLYRWVEHAFGGREHALTGAYDDADPVLTDLRAINGAVLAGDPAA AAATVEAYLNASALRMVKSYRDRA" misc_feature 584827..584892 /locus_tag="Rv0494" /note="PS00043 Bacterial regulatory proteins, gntR family signature" gene complement(585424..586314) /locus_tag="Rv0495c" /db_xref="GeneID:887198" CDS complement(585424..586314) /locus_tag="Rv0495c" /function="UNKNOWN" /note="Rv0495c, (MTCY20G9.21c), len: 296 aa. Conserved hypothetical protein, highly similar to S72915|B2168_F1_37 hypothetical protein from Mycobacterium leprae (323 aa), FASTA scores: opt: 1615, E(): 0, (82.7% identity in 271 aa overlap); and P54579|Y495_MYCLE|ML243|13094009|CAC31952.1|AL583925 conserved hypothetical protein from Mycobacterium leprae (277 aa). Also highly similar to Q9X8H2|Y716_STRCO|SCE7.16 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (271 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215009.1" /db_xref="GI:15607636" /db_xref="GeneID:887198" /translation="MWRPAQGARWHVPAVLGYGGIPRRASWSNVESVANSRRRPVHPG QEVELDFAREWVEFYDPDNPEHLIAADLTWLLSRWACVFGTPACQGTVAGRPNDGCCS HGAFLSDDDDRTRLADAVHKLTDDDWQFRAKGLRRKGYLELDEHDGQPQHRTRKHKGA CIFLNRPGFAGGAGCALHSKALKLGVPPLTMKPDVCWQLPIRRSQEWVTRPDGTEILK TTLTEYDRRGWGSGGADLHWYCTGDPAAHVGTKQVWQSLADELTELLGEKAYGELAAM CKRRSQLGLIAVHPATRAAQ" gene 586394..587380 /locus_tag="Rv0496" /db_xref="GeneID:887234" CDS 586394..587380 /locus_tag="Rv0496" /function="UNKNOWN" /note="Rv0496, (MTCY20G9.22), len: 328 aa. Conserved hypothetical protein, highly similar to S72894|467046|AAA17230.1|U00018 exopolyphosphatase (EC 3.6.1.11) ppx from Mycobacterium leprae (406 aa), FASTA scores: opt: 1902, E(): 0, (86.6% identity in 343 aa overlap); and P54882|Y496_MYCLE|ML2434|13094008|CAC31951.1|AL583925 HYPOTHETICAL 36.2 KDA PROTEIN from Mycobacterium leprae (339 aa). Also highly similar to hypothetical proteins and exopolyphosphatases e.g. Q9X8H1|Y715_STRCO|SCE7.15c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (309 aa). C-terminal region similar to CGU31224_1|Q46054 protein similar to ppx gene product of Mycobacterium leprae from Cornybacterium glutamicum (140 aa), FASTA scores: opt: 615, E(): 2.7e-33, (70.9% identity in 134 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215010.1" /db_xref="GI:15607637" /db_xref="GeneID:887234" /translation="MVDAHRGGHPTPMSSTKATLRLAEATDSSGKITKRGADKLISTI DEFAKIAISSGCAELMAFATSAVRDAENSEDVLSRVRKETGVELQALRGEDESRLTFL AVRRWYGWSAGRILNLDIGGGSLEVSSGVDEEPEIALSLPLGAGRLTREWLPDDPPGR RRVAMLRDWLDAELAEPSVTVLEAGSPDLAVATSKTFRSLARLTGAAPSMAGPRVKRT LTANGLRQLIAFISRMTAVDRAELEGVSADRAPQIVAGALVAEASMRALSIEAVEICP WALREGLILRKLDSEADGTALIESSSVHTSVRAVGGQPADRNAANRSRGSKP" gene 587377..588309 /locus_tag="Rv0497" /db_xref="GeneID:887240" CDS 587377..588309 /locus_tag="Rv0497" /function="UNKNOWN" /note="Rv0497, (MTCY20G9.23), len: 310 aa. Probable conserved transmembrane protein, equivalent (but shorter in C-terminus) to P54580|Y497_MYCLE|ML2433 HYPOTHETICAL 37.9 KDA PROTEIN from Mycobacterium leprae (355 aa). N-terminus highly similar to S72922|B2168_C1_166|467074 hypothetical protein from Mycobacterium leprae (118 aa), FASTA scores: opt: 350, E(): 1.4e-12, (57.9% identity in 114 aa overlap); and hydrophobic C-terminus, highly similar to S72895|B2168_C2_209|467047 hypothetical protein from Mycobacterium leprae (241 aa), FASTA scores: opt: 473, E(): 8e-19, (53.9% identity in 241 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215011.1" /db_xref="GI:15607638" /db_xref="GeneID:887240" /translation="MTGPHPETESSGNRQISVAELLARQGVTGAPARRRRRRRGDSDA ITVAELTGEIPIIRDDHHHAGPDAHASQSPAANGRVQVGEAAPQSPAEPVAEQVAEEP TRTVYWSQPEPRWPKSPPQDRRESGPELSEYPRPLRHTHSDRAPAGPPSGAEHMSPDP VEHYPDLWVDVLDTEVGEAEAETEVREAQPGRGERHAAAAAAGTDVEGDGAAEARVAR RALDVVPTLWRGALVVLQSILAVAFGAGLFIAFDQLWRWNSIVALVLSVMVILGLVVS VRAVRKTEDIASTLIAVAVGALITLGPLALLQSG" gene 588325..589167 /locus_tag="Rv0498" /db_xref="GeneID:887238" CDS 588325..589167 /locus_tag="Rv0498" /function="UNKNOWN" /note="Rv0498, (MTCY20G9.24), len: 280 aa. Conserved hypothetical protein, highly similar to P54581|Y498_MYCLE|ML2432 HYPOTHETICAL 30.5 KDA PROTEIN from Mycobacterium leprae (280 aa); and S72896|B2168_C2_210 hypothetical protein from Mycobacterium leprae (244 aa), FASTA scores: opt: 1486, E():0, (89.3% identity in 244 aa overlap). Also similar to Q9X8H0|Y714_STRCO|SCE7.14c HYPOTHETICAL PROTEIN from Streptomyces coelicolor." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215012.1" /db_xref="GI:15607639" /db_xref="GeneID:887238" /translation="MRPAIKVGLSTASVYPLRAEAAFEYADRLGYDGVELMVWGESVS QDIDAVRKLSRRYRVPVLSVHAPCLLISQRVWGANPILKLDRSVRAAEQLGAQTVVVH PPFRWQRRYAEGFSDQVAALEAASTVMVAVENMFPFRADRFFGAGQSRERMRKRGGGP GPAISAFAPSYDPLDGNHAHYTLDLSHTATAGTDSLDMARRMGPGLVHLHLCDGSGLP ADEHLVPGRGTQPTAEVCQMLAGSGFVGHVVLEVSTSSARSANERESMLAESLQFART HLLR" gene 589183..590058 /locus_tag="Rv0499" /db_xref="GeneID:887243" CDS 589183..590058 /locus_tag="Rv0499" /function="UNKNOWN" /note="Rv0499, (MTCY20G9.25), len: 291 aa. Conserved hypothetical protein, showing some similarity to AL031184|SC2A11_16|T34762 hypothetical protein from Streptomyces coelicolor (340 aa), FASTA scores: opt: 240, E(): 1.8e-07, (28.9% identity in 270 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215013.1" /db_xref="GI:15607640" /db_xref="GeneID:887243" /translation="MNALFTTAMALRPLDSDPGNPACRVFEGELNEHWTIGPKVHGGA MVALCANAARTAYGAAGQQPMRQPVAVSASFLWAPDPGTMRLVTSIRKRGRRISVADV ELTQGGRTAVHAVVTLGEPEHFLPGVDGSGGASGTAPLLSANPVVELMAPEPPEGVVP IGPGHQLAGLVHLGEGCDVRPVLSTLRSATDGRPPVIQLWARPRGVAPDALFALLCGD LSAPVTFAVDRTGWAPTVALTAYLRALPADGWLRVLCTCVEIGQDWFDEDHIVVDRLG RIVVQTRQLAMVPAQ" gene 590083..590970 /gene="proC" /locus_tag="Rv0500" /db_xref="GeneID:887256" CDS 590083..590970 /gene="proC" /locus_tag="Rv0500" /EC_number="1.5.1.2" /function="INVOLVED AT THE TERMINAL (THIRD) STEP IN PROLINE BIOSYNTHESIS [CATALYTIC ACTIVITY: L-proline + NAD(P)+ = 1-pyrroline-5-carboxylate + NAD(P)H]." /note="catalyzes the formation of L-proline from pyrroline-5-carboxylate" /codon_start=1 /transl_table=11 /product="pyrroline-5-carboxylate reductase" /protein_id="NP_215014.1" /db_xref="GI:15607641" /db_xref="GeneID:887256" /translation="MLFGMARIAIIGGGSIGEALLSGLLRAGRQVKDLVVAERMPDRA NYLAQTYSVLVTSAADAVENATFVVVAVKPADVEPVIADLANATAAAENDSAEQVFVT VVAGITIAYFESKLPAGTPVVRAMPNAAALVGAGVTALAKGRFVTPQQLEEVSALFDA VGGVLTVPESQLDAVTAVSGSGPAYFFLLVEALVDAGVGVGLSRQVATDLAAQTMAGS AAMLLERMEQDQGGANGELMGLRVDLTASRLRAAVTSPGGTTAAALRELERGGFRMAV DAAVQAAKSRSEQLRITPE" gene 591111..591347 /locus_tag="Rv0500A" /db_xref="GeneID:3205035" CDS 591111..591347 /locus_tag="Rv0500A" /function="UNKNOWN" /note="Rv0500A, len: 78 aa. Conserved hypothetical protein, similar to proteins from Mycobacterium leprae and Streptomyces coelicolor e.g. U00018_25 from Mycobacterium leprae cosmid B2168 (86 aa), FASTA scores: opt: 428, E(): 1.3e-27, (82.6% identity in 86 aa overlap); AL079345|SCE68_26 from Streptomyces coelicolor cosmid E6 (70 aa), FASTA scores: opt: 252, E(): 1.2 e-13, (72.2 identity in 54 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177624.1" /db_xref="GI:57116743" /db_xref="GeneID:3205035" /translation="MTSTNGPSARDTGFVEGQQAKTQLLTVAEVAALMRVSKMTVYRL VHNGELPAVRVGRSFRVHAKAVHDMLETSYFDAG" gene 591475..591576 /locus_tag="Rv0500B" /db_xref="GeneID:3205036" CDS 591475..591576 /locus_tag="Rv0500B" /function="UNKNOWN" /note="Rv0500B, len: 33 aa. Conserved hypothetical protein. Basic protein 18 of the 33 aa are Arg or Lys, with strong similarity to AL079345|SCE68_25 protein from Streptomyces coelicolor cosmid E6 (32 aa), FASTA scores: opt: 176, E(): 1e-06, (93.1% identity in 29 aa overlap). Same gene arrangement in both actinomycetes." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177625.1" /db_xref="GI:57116744" /db_xref="GeneID:3205036" /translation="MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK" gene 591654..592784 /gene="galE2" /locus_tag="Rv0501" /db_xref="GeneID:887228" CDS 591654..592784 /gene="galE2" /locus_tag="Rv0501" /EC_number="5.1.3.2" /function="INVOLVED IN GALACTOSE METABOLISM [CATALYTIC ACTIVITY: UDP-GLUCOSE = UDP-GALACTOSE]." /note="Rv0501, (MTCY20G9.28), len: 376 aa. Possible galE2, UDP-glucose 4-epimerase (EC 5.1.3.2), highly similar (except in N-terminus) to CAC31944.1|AL583925 possible glucose epimerase/dehydratase from Mycobacterium leprae (364 aa). N-terminus highly similar to S72923|B2168_C1_174|467075|AAA17259.1|U00018 hypothetical protein from Mycobacterium leprae (180 aa), FASTA scores: opt: 934, E(): 0, (89.6% identity in 164 aa overlap); and C-terminus highly similar to S72898|467050|AAA17234.1|U00018 hypothetical protein from Mycobacterium leprae (168 aa), FASTA scores: opt: 928, E(): 0, (82.7% identity in 168 aa overlap). Also highly similar to T36274|5123671|CAB45360.1|AL079345 probable epimerase from Streptomyces coelicolor (353 aa); and similar in part to other epimerases e.g. GALE_ECOLI|P09147 UDP-glucose 4-epimerase from Escherichia coli (338 aa), FASTA scores: opt: 241, E(): 6.7e-09, (28.2% identity in 294 aa overlap); etc. BELONGS TO THE SUGAR EPIMERASE FAMILY. COFACTOR: NAD. Note that previously known as galE1.; galE1" /codon_start=1 /transl_table=11 /product="UDP-glucose 4-epimerase" /protein_id="NP_215050.2" /db_xref="GI:57116745" /db_xref="GeneID:887228" /translation="MSSSNGRGGAGGVGGSSEHPQYPKVVLVTGACRFLGGYLTARLA QNPLINRVIAVDAIAPSKDMLRRMGRAEFVRADIRNPFIAKVIRNGEVDTVVHAAAAS YAPRSGGSAALKELNVMGAMQLFAACQKAPSVRRVVLKSTSEVYGSSPHDPVMFTEDS SSRRPFSQGFPKDSLDIEGYVRALGRRRPDIAVTILRLANMIGPAMDTTLSRYLAGPL VPTIFGRDARLQLLHEQDALGALERAAMAGKAGTFNIGADGILMLSQAIRRAGRIPVP VPGFGVWALDSLRRANHYTELNREQFAYLSYGRVMDTTRMRVELGYQPKWTTVEAFDD YFRGRGLTPIIDPHRVRSWEGRAVGLAQRWGSRNPIPWSGLR" gene 592791..593867 /locus_tag="Rv0502" /db_xref="GeneID:887260" CDS 592791..593867 /locus_tag="Rv0502" /function="UNKNOWN" /note="Rv0502, (MTCY20G9.29), len: 358 aa. Conserved hypothetical protein, equivalent to P54878|Y502_MYCLE|ML2427 HYPOTHETICAL 40.5 KDA PROTEIN from Mycobacterium leprae (367 aa), FASTA scores: opt: 2042, E(): 0, (84.1% identity in 365 aa overlap). Also similar to T36273|SCE68.23c hypothetical protein from Streptomyces coelicolor (355 aa). C-terminal similar to AL021529|SC10A5_4|T34572 hypothetical protein from Streptomyces coelicolor (295 aa), FASTA score: (57.8% identity in 263 aa overlap); and to hypothetical proteins from Mycobacterium tuberculosis Rv1920|G70808 (287 aa); and Rv1428c|G70914 (275 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215016.1" /db_xref="GI:15607643" /db_xref="GeneID:887260" /translation="MGNVAGETRANVIPLHTNRSRVAARRRAGQRAESRQHPSLLSDP NDRASAEQIAAVVREIDEHRRAAGATTSSTEATPNDLAQLVAAVAGFLRQRLTGDYSV DEFGFDPHFNSAIVRPLLRFFFKSWFRVEVSGVENIPRDGAALVVANHAGVLPFDGLM LSVAVHDEHPAHRDLRLLAADMVFDLPVIGEAARKAGHTMACTTDAHRLLASGELTAV FPEGYKGLGKRFEDRYRLQRFGRGGFVSAALRTKAPIVPCSIIGSEEIYPMLTDVKLL ARLFGLPYFPITPLFPLAGPVGLVPLPSKWRIAFGEPICTADYASTDADDPMVTFELT DQVRETIQQTLYRLLAGRRNIFFG" gene complement(593871..594779) /gene="cmaA2" /locus_tag="Rv0503c" /db_xref="GeneID:887264" CDS complement(593871..594779) /gene="cmaA2" /locus_tag="Rv0503c" /EC_number="2.1.1.79" /function="ESSENTIAL FOR THE CYCLOPROPANATION FUNCTION. TRANSFERS A METHYLENE GROUP FROM S-ADENOSYL-L-METHIONINE TO THE CIS DOUBLE BOND OF AN UNSATURATED FATTY ACID CHAIN RESULTING IN THE REPLACEMENT OF THE DOUBLE BOND WITH A METHYLENE BRIDGE. MYCOLIC ACIDS, WHICH REPRESENT THE MAJOR CONSTITUENT OF MYCOBACTERIAL CELL WALL COMPLEX, ACT AS SUBSTRATES [CATALYTIC ACTIVITY: S-adenosyl-L-methionine + phospholipid olefinic fatty acid = S-adenosyl-L-homocysteine + phospholipid cyclopropane fatty acid]." /experiment="experimental evidence, no additional details recorded" /note="Rv0503c, (MTCY20G9.30c), len: 302 aa. cmaA2 (alternate gene name: cma2), cyclopropane-fatty-acyl-phospholipid synthase 2 (mycolic acid trans-cyclopropane synthetase) (EC 2.1.1.79) (see citations below). Note that this protein has 302 aa and not 322 aa: we have chosen a different initiation codon on the basis of homology). Equivalent to S72886|B2168_F3_130 hypothetical protein from Mycobacterium leprae (308 aa), FASTA score: (78.9% identity in 303 aa overlap); and highly similar to other proteins from Mycobacterium leprae. Also similar to other proteins from Mycobacterium tuberculosis and Mycobacterium bovis BCG e.g. MTV038_14|UMAA2|Rv0470c|MTV038.14 PUTATIVE MYCOLIC ACID SYNTHESIS/MODIFICATION PROTEIN (287 aa) (57.2% identity in 297 aa overlap).; cma2" /codon_start=1 /transl_table=11 /product="cyclopropane-fatty-acyl-phospholipid synthase 2" /protein_id="NP_215017.1" /db_xref="GI:15607644" /db_xref="GeneID:887264" /translation="MTSQGDTTSGTQLKPPVEAVRSHYDKSNEFFKLWLDPSMTYSCA YFERPDMTLEEAQYAKRKLALDKLNLEPGMTLLDIGCGWGSTMRHAVAEYDVNVIGLT LSENQYAHDKAMFDEVDSPRRKEVRIQGWEEFDEPVDRIVSLGAFEHFADGAGDAGFE RYDTFFKKFYNLTPDDGRMLLHTITIPDKEEAQELGLTSPMSLLRFIKFILTEIFPGG RLPRISQVDYYSSNAGWKVERYHRIGANYVPTLNAWADALQAHKDEAIALKGQETYDI YMHYLRGCSDLFRDKYTDVCQFTLVK" gene complement(594802..595302) /locus_tag="Rv0504c" /db_xref="GeneID:887268" CDS complement(594802..595302) /locus_tag="Rv0504c" /function="UNKNOWN" /note="Rv0504c, (MTCY20G9.31c), len: 166 aa. Conserved hypothetical protein, equivalent to P54879|Y504_MYCLE|ML2425 HYPOTHETICAL 18.7 KDA PROTEIN from Mycobacterium leprae (166 aa), FASTA scores: opt: 884, E(): 0, (83.1% identity in 166 aa overlap); and highly similar to other proteins from Mycobacterium leprae. Also highly similar to CAB77410.1|AL160431|SCD82.07 hypothetical protein from Streptomyces coelicolor (150 aa). Also similar to M. tuberculosis hypothetical proteins Rv0635|H70612 (158 aa); and Rv0637|B70613 (166 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215018.1" /db_xref="GI:15607645" /db_xref="GeneID:887268" /translation="MTVPEEAQTLIGKHYRAPDHFLVGREKIREFAVAVKDDHPTHYS EPDAAAAGYPALVAPLTFLAIAGRRVQLEIFTKFNIPINIARVFHRDQKFRFHRPILA NDKLYFDTYLDSVIESHGTVLAEIRSEVTDAEGKPVVTSVVTMLGEAAHHEADADATV AAIASI" gene complement(595464..596585) /gene="serB1" /locus_tag="Rv0505c" /db_xref="GeneID:887270" CDS complement(595464..596585) /gene="serB1" /locus_tag="Rv0505c" /EC_number="3.1.3.3" /function="REMOVES A PHOSPHATE FROM PHOSPHOSERINE [CATALYTIC ACTIVITY: Phosphoserine + H2O = serine + phosphate]." /note="Rv0505c, (MTCY20G9.32c), len: 373 aa. Possible serB1, phosphoserine phosphatase (EC 3.1.3.3), equivalent (but longer 70 aa in N-terminus) to S72914|serB phosphoserine phosphatase from Mycobacterium leprae (300 aa), FASTA scores: opt: 1570, E(): 0, (83.0% identity in 306 aa overlap). C-terminus highly similar to CAB55344.1|AJ010584 phosphoserine phosphatase from Streptomyces coelicolor (266 aa). Low similarity to SERB_ECOLI|P06862 phosphoserine phosphatase from Escherichia coli strains K12 and O157:H7 (322 aa), FASTA scores: opt: 148, E(): 0.043, (24.0% identity in 150 aa overlap). C-terminus is also similar to O33611|AB004855_1|IMD_STRCN PROTEIN INVOLVED IN INHIBITION OF MORPHOLOGICAL DIFFERENTIATION from Streptomyces cyaneus (277 aa), FASTA score: (37.7% identity in 252 aa overlap). SEEMS TO BELONG TO THE SERB FAMILY. Note that previously known as serB.; serB" /codon_start=1 /transl_table=11 /product="phosphoserine phosphatase" /protein_id="YP_177732.1" /db_xref="GI:57116746" /db_xref="GeneID:887270" /translation="MGLTCWPRTAAGRVHDESRCGLANFDTALGLQINPRQPRAPPRI CRIGLITAAASATGQAPRLGVMMVSSHLGSPDQAGHVDLASPADPPPPDASASHSPVD MPAPVAAAGSDRQPPIDLTAAAFFDVDNTLVQGSSAVHFGRGLAARHYFTYRDVLGFL YAQAKFQLLGKENSNDVAAGRRKALAFIEGRSVAELVALGEEIYDEIIADKIWDGTRE LTQMHLDAGQQVWLITATPYELAATIARRLGLTGALGTVAESVDGIFTGRLVGEILHG TGKAHAVRSLAIREGLNLKRCTAYSDSYNDVPMLSLVGTAVAINPDARLRSLARERGW EIRDFRIARKAARIGVPSALALGAAGGALAALASRRQSR" gene 596759..597202 /gene="mmpS2" /locus_tag="Rv0506" /db_xref="GeneID:887279" CDS 596759..597202 /gene="mmpS2" /locus_tag="Rv0506" /function="UNKNOWN" /note="Rv0506, (MTCY20G9.33), len: 147 aa. Probable mmpS2, conserved membrane protein (see citation below), highly similar to other Mycobacterial proteins e.g. C-terminus of AAD44232.1|AF143772_38|AF143772|TmtpA from Mycobacterium avium (221 aa); P54880|MMS4_MYCLE|MMPS4 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 392, E(): 1.3e-20, (43.7% identity in 151 aa overlap); and the PUTATIVE MEMBRANE PROTEINS from Mycobacterium tuberculosis MTV040_5, MTCY4D9_16, MTV037_15. BELONGS TO THE MMPS FAMILY." /codon_start=1 /transl_table=11 /product="membrane protein" /protein_id="NP_215020.1" /db_xref="GI:15607647" /db_xref="GeneID:887279" /translation="MRMISVSGAVKRMWLLLAIVVVAVVGGLGIYRLHSIFGVHEQPT VMVKPDFDVPLFNPKRVTYEVFGPAKTAKIAYLDPDARVHRLDSVSLPWSVTVETTLP AVSVNLMAQSNADVISCRIIVNGAVKDERSETSPRALTSCQVSSG" gene 597199..600105 /gene="mmpL2" /locus_tag="Rv0507" /db_xref="GeneID:887248" CDS 597199..600105 /gene="mmpL2" /locus_tag="Rv0507" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /note="Rv0507, (MTCY20G9.34), len: 968 aa. Probable mmpL2, conserved transmembrane transport protein (see citations below), member of RND superfamily, highly similar to other Mycobacterial proteins e.g. YV34_MYCLE from Mycobacterium leprae (959 aa), FASTA scores: opt: 3699, E(): 0, (58.3% identity in 940 aa overlap); and the Mycobacterium tuberculosis proteins MTV037_14, MTV040_4, MTCY98_8, MTCY4D9_15, MTCY48_8, MTCY19G5_6, MTV005_19, etc. Also similar to STMACTII_3|SC10A5_9 from Streptomyces coelicolor; and BSUB0|004_12 from Bacillus subtilis. C-terminal half similar to Q50086|U1740AB from Mycobacterium leprae (386 aa), FASTA scores: opt: 1526, E(): 0, (61.5% identity in 371 aa overlap). BELONGS TO THE MMPL FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL2" /protein_id="NP_215021.1" /db_xref="GI:15607648" /db_xref="GeneID:887248" /translation="MSERHAALTSLPPILPRLIRRFAVVIVLLWLGFTAFVNLAVPQL EVVGKAHSVSMSPSDAASIQAIKRVGQVFGEFDSDNAVTIVLEGDQPLGGDAHRFYSD LMRKLSADTRHVAHIQDFWGDPLTAAGSQSADDRAAYVVVYLVGNNETEAYDSVHAVR HMVDTTPPPHGVKAYVTGPAALNADQAEAGDKSIAKVTAITSMVIAAMLLVIYRSVIT AVLVLIMVGIDLGAIRGFIALLADHNIFSLSTFATNLLVLMAIAASTDYAIFMLGRYH ESRYAGEDRETAFYTMFHGTAHVILGSGLTIAGAMYCLSFARLPYFETLGAPIAIGML VAVLAALTLGPAVLTVGSFFKLFDPKRRMNTRRWRRVGTAIVRWPGPVLAATCLVASI GLLALPSYRTTYDLRKFMPASMPSNVGDAAAGRRFSRARLNPEVLLIETDHDMRNPVD MLVLDKVAKNIYHSPGIEQVKAITRPLGTTIKHTSIPFIISMQGVNSSEQMEFMKDRI DDILVQVAAMNTSIETMHRMYALMGEVIDNTVDMDHLTHDMSDITATLRDHLADFEDF FRPIRSYFYWEKHCFDVPLCWSIRSIFDMFDSVDQLSEKLEYLVKDMDILITLLPQMR AQMPPMISAMTTMRDMMLIWHGTLGAFYKQQERNNKDPGAMGRVFDAAQIDDSFYLPQ SAFENPDFKRGLKMFLSPDGKAARFVIALEGDPATPEGISRVEPIKREAREAIKGTPL QGAAIYLGGTAATFKDIREGARYDLLIAGVAAISLILIIMMIITRSVVAAVVIVGTVV LSMGASFGLSVLVWQDILGIELYWMVLAMSVILLLAVGSDYNLLLISRLKEEIGAGLN TGIIRAMAGTGGVVTAAGMVFAVTMSLFVFSDLRIIGQIGTTIGLGLLFDTLVVRSFM TPSIAALLGRWFWWPLRVRPRPASQMLRPFAPRRLVRALLLPSGQHPSATGAHE" gene 600098..600391 /locus_tag="Rv0508" /db_xref="GeneID:887301" CDS 600098..600391 /locus_tag="Rv0508" /function="UNKNOWN" /note="Rv0508, (MTCY20G9.35), len: 97 aa. Conserved hypothetical protein, showing similarity with T36269|5123666|CAB45355.1|AL079345 probable redoxin from Streptomyces coelicolor (101 aa), FASTA scores: opt: 160, E(): 3.4e-05, (33.3% identity in 75 aa overlap); and E81943|NMA0966 probable thioredoxin from Neisseria meningitidis group A strain Z2491 (77 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215022.1" /db_xref="GI:15607649" /db_xref="GeneID:887301" /translation="MSRPQVELLTRAGCAICVRVAEQLAELSSELGFDMMTIDVDVAA STGNPGLRAEFGDRLPVVLLDGREHSYWEVDEHRLRADIARSTFGSPPDKRLP" gene 600441..601847 /gene="hemA" /locus_tag="Rv0509" /db_xref="GeneID:887292" CDS 600441..601847 /gene="hemA" /locus_tag="Rv0509" /EC_number="1.2.1.-" /function="INVOLVED IN PORPHYRIN BIOSYNTHESIS BY THE C5 PATHWAY (AT THE FIRST STEP) [CATALYTIC ACTIVITY: GLUTAMYL-TRNA(GLU) + NADPH = GLUTAMATE-1-SEMIALDEHYDE + NADP+ + TRNA(GLU)]." /note="catalyzes the formation of glutamate-1-semialdehyde from glutamyl-tRNA(Glu) and NADPH; the second step of the pathway is catalyzed by glutamate-1-semialdehyde aminomutase which results in the formation of 5-aminolevulinic acid; functions in porphyrin (tetrapyrroles) biosynthesis; the crystal structure showed a C-terminal dimerization domain that appears to be absent in Chlamydial proteins" /codon_start=1 /transl_table=11 /product="glutamyl-tRNA reductase" /protein_id="NP_215023.1" /db_xref="GI:15607650" /db_xref="GeneID:887292" /translation="MSVLLFGVSHRSAPVVVLEQLSIDESDQVKIIDRVLASPLVTEA MVLSTCNRVEVYAVVDAFHGGLSVIGQVLAEHSGMSMGELTKYAYVRYSEAAVEHLFA VASGLDSAVIGEQQVLGQVRRAYAVAESNRTVGRVLHELAQRALSVGKRVHSETAIDA AGASVVSVALGMAERKLGSLAGTTAVVIGAGAMGALSAVHLTRAGVGHIQVLNRSLSR AQRLARRIRESGVPAEALALDRLANVLADADVVVSCTGAVRPVVSLADVHHALAAARR DEATRPLVICDLGMPRDVDPAVARLPCVWVVDVDSVQHEPSAHAAAADVEAARHIVAA EVASYLVGQRMAEVTPTVTALRQRAAEVVEAELLRLDNRLPGLQSVQREEVARTVRRV VDKLLHAPTVRIKQLASAPGGDSYAEALRELFELDQTAVDAVATAGELPVVPSGFDAE SRRGGGDMQSSPKRSPSN" misc_feature 600735..600806 /gene="hemA" /locus_tag="Rv0509" /note="PS00747 Glutamyl-tRNA reductase signature" gene 601857..602786 /gene="hemC" /locus_tag="Rv0510" /db_xref="GeneID:887306" CDS 601857..602786 /gene="hemC" /locus_tag="Rv0510" /EC_number="2.5.1.61" /function="INVOLVED IN PORPHYRIN BIOSYNTHESIS BY THE C5 PATHWAY (AT THE FOURTH STEP). TETRAPOLYMERIZATION OF THE MONOPYRROLE PBG INTO THE HYDROXYMETHYLBILANE PREUROPORPHYRINOGEN IN SEVERAL DISCRETE STEPS [CATALYTIC ACTIVITY: 4 porphobilinogen + H2O = hydroxymethylbilane + 4 NH3]." /note="transformation of porphobilinogen to hydroxymethylbilane in porphyrin biosynthesis" /codon_start=1 /transl_table=11 /product="porphobilinogen deaminase" /protein_id="NP_215024.1" /db_xref="GI:15607651" /db_xref="GeneID:887306" /translation="MIRIGTRGSLLATTQAATVRDALIAGGHSAELVTISTEGDRSMA PIASLGVGVFTTALREAMEAGLVDAAVHSYKDLPTAADPRFTVAAIPPRNDPRDAVVA RDGLTLGELPVGSLVGTSSPRRAAQLRALGLGLEIRPLRGNLDTRLNKVSSGDLDAIV VARAGLARLGRLDDVTETLEPVQMLPAPAQGALAVECRAGDSRLVAVLAELDDADTRA AVTAERALLADLEAGCSAPVGAIAEVVESIDEDGRVFEELSLRGCVAALDGSDVIRAS GIGSCGRARELGLSVAAELFELGARELMWGVRH" gene 602819..604516 /gene="hemD" /locus_tag="Rv0511" /db_xref="GeneID:887280" CDS 602819..604516 /gene="hemD" /locus_tag="Rv0511" /EC_number="2.1.1.107" /function="Possibly involved in the biosynthesis of siroheme and cobalamin [CATALYTIC ACTIVITY: 2 S-adenosyl-L-methionine + uroporphyrin III = 2 S-adenosyl-L-homocysteine + sirohydrochlorin]." /note="Rv0511, (MTCY21C8.02), len: 565 aa. Probable hemD (alternate gene name: cysG), uroporphyrin-III C-methyltransferase (EC 2.1.1.107), highly similar to others e.g. CAC31936.1|AL583925 possible uroporphyrin-III C-methyltransferase from Mycobacterium leprae (563 aa); and S72909|CYSG from Mycobacterium leprae (472 aa), FASTA scores: opt: 1946, E(): 0, (83.3% identity in 472 aa overlap); T36265|5123662|CAB45351.1|AL079345 probable uroporphyrin-III C-methyltransferase from Streptomyces coelicolor (565 aa); and similar to others e.g. AAK00606.1|AF221100_3|AF221100 from Selenomonas ruminantium subsp. ruminantium (505 aa); etc. Also similar to Rv2071c and Rv2847c from Mycobacterium tuberculosis. Note that previously known as cysG.; cysG" /codon_start=1 /transl_table=11 /product="uroporphyrin-III C-methyltransferase HemD" /protein_id="YP_177733.1" /db_xref="GI:57116747" /db_xref="GeneID:887280" /translation="MTRGRKPRPGRIVFVGSGPGDPGLLTTRAAAVLANAALVFTDPD VPEPVVALIGTDLPPVSGPAPAEPVAGNGDAAGGGSAQEHGRAASAVVSGGPDIRPAL GDPADVAKTLTAEARSGVDVVRLVAGDPLTVDAVISEVNAVARTHLHIEIVPGLAASS AVPTYAGLPLGSSHTVADVRIDPENTDWDALAAAPGPLILQATASHLAESARSLIDHQ LAESTPCVVTAHGTTCQQRSVETTLQGLTDPAVLGATDPACSANGRDSQAGPLIVTIG KTVTSRAKLNWWESRALYGWTVLVPRTKDQAGEMSERLTSYGALPVEVPTIAVEPPRS PAQMERAVKGLVDGRFQWIVFTSTNAVRAVWEKFGEFGLDARAFSGVKIACVGESTAD RVRAFGISPELVPSGEQSSLGLLDDFPPYDSVFDPVNRVLLPRADIATETLAEGLRER GWEIEDVTAYRTVRAAPPPATTREMIKTGGFDAVCFTSSSTVRNLVGIAGKPHARTII ACIGPKTAETAAEFGLRVDVQPDTAAIGPLVDALAEHAARLRAEGALPPPRKKSRRR" gene 604602..605591 /gene="hemB" /locus_tag="Rv0512" /db_xref="GeneID:887312" CDS 604602..605591 /gene="hemB" /locus_tag="Rv0512" /EC_number="4.2.1.24" /function="INVOLVED IN PORPHYRIN AND HEME BIOSYNTHESIS (AT THE SECOND STEP) [CATALYTIC ACTIVITY: 2 5-aminolevulinate = porphobilinogen + 2 H2O]." /note="catalyzes the formation of porphobilinogen from 5-aminolevulinate" /codon_start=1 /transl_table=11 /product="delta-aminolevulinic acid dehydratase" /protein_id="NP_215026.1" /db_xref="GI:15607653" /db_xref="GeneID:887312" /translation="MSMSSYPRQRPRRLRSTVAMRRLVAQTSLEPRHLVLPMFVADGI DEPRPITSMPGVVQHTRDSLRRAAAAAVAAGVGGLMLFGVPRDQDKDGVGSAGIDPDG ILNVALRDLAKDLGEATVLMADTCLDEFTDHGHCGVLDDRGRVDNDATVARYVELAVA QAESGAHVVGPSGMMDGQVAAIRDGLDAAGYIDVVILAYAAKFASAFYGPFREAVSSS LSGDRRTYQQEPGNAAEALREIELDLDEGADIVMVKPAMGYLDVVAAAADVSPVPVAA YQVSGEYAMIRAAAANNWIDERAAVLESLTGIRRAGADIVLTYWAVDAAGWLT" misc_feature 605340..605378 /gene="hemB" /locus_tag="Rv0512" /note="PS00169 Delta-aminolevulinic acid dehydratase active site" gene 605604..606152 /locus_tag="Rv0513" /db_xref="GeneID:887307" CDS 605604..606152 /locus_tag="Rv0513" /function="UNKNOWN" /note="Rv0513, (MTCY20G10.03), len: 182 aa. Possible conserved transmembrane protein, with its N-terminus highly similar to S72925|B2168_C1_182 hypothetical protein from Mycobacterium leprae (103 aa), FASTA scores: opt: 217, E(): 8.2e-14, (45.3 % identity in 106 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215027.1" /db_xref="GI:15607654" /db_xref="GeneID:887307" /translation="MTPTGDTKPKLLFYEPGASWYWVLTGPLAAVSVLLLEISSGAGV GLITPAIFLVMVSAFVALQVKAARIHTSVELTHDALRQGTETIRLAEIVKIYPEADGR ETSGEEPAKWQSARTLGELVGVPRGRVGIGLKLTGGRTAQAWARRHQQLRAALTPLVQ ERLGPVDSDVADVNGDDAGPAR" gene 606149..606448 /locus_tag="Rv0514" /db_xref="GeneID:887319" CDS 606149..606448 /locus_tag="Rv0514" /function="UNKNOWN" /note="Rv0514, (MTCY20G10.04), len: 99 aa. Possible transmembrane protein." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215028.1" /db_xref="GI:15607655" /db_xref="GeneID:887319" /translation="MIARYRAGAELFLACAALAGSAASWSRTRSTVAVAPVIDGQPVT LSVVYHPQPLVLTLLLATIAGVLSVVGTARLRRARAGLNAHPDGLNQRPPGGWCH" gene 606551..608062 /locus_tag="Rv0515" /db_xref="GeneID:887322" CDS 606551..608062 /locus_tag="Rv0515" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv0515, (MTCY20G10.05), len: 503 aa. Part of M. tuberculosis 13E12 repeat family. Almost identical to Rv0336 (99.8% identity in 503 aa overlap), possibly due to a recent gene duplication. Also similar to other M. tuberculosis hypothetical 13E12 repeat proteins e.g. Rv1148c, Rv1945, etc." /codon_start=1 /transl_table=11 /product="13E12 repeat family protein" /protein_id="NP_215029.1" /db_xref="GI:15607656" /db_xref="GeneID:887322" /translation="MPSPEAIAHFDERFECHAPRTTRVSAAFIDRICSATRAENRAAA AQLVALGELFAYRWSRCGGREEWVMDTMAAVAAEVAAALRISQGLAASRLRYARAMRE RLPKTAEVFSAGDIGYLMFATIVYRTDLIVDPDVLAAVDAQLAANVARWPSMTKARLA GQVDKIVARADADAVRRRKEYQAQRQFWVGESQDGVCQIGGSLLAVDAHALDARLSAL AGTVCEHDPRSREQRRADALGALAGGADRLGCGCGRADCAAGKRPAAPPVVIHLIAEA ATINGTGSAPASQMNADGLITAELVAELAKTATLVPLVHPGDAPPEPGYAPSKALADF VRCRDLTCRWPGCDEPATNCDLDHTIPYAAGGPTHASNLKCYCRTHHLVKTFWGWRDQ QLPDGTLILTSPSGHTYVSTPGSALLFPSLCHFSGGIPAPEADPPYDHCDQRTAMMPK RRRTRAQDRAYRIATERRQNHAARQRAQVLTQTAAATDTHGPPPDHNDDPPPF" gene complement(608059..608535) /locus_tag="Rv0516c" /db_xref="GeneID:887324" CDS complement(608059..608535) /locus_tag="Rv0516c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0516c, (MTCY20G10.06c), len: 158 aa. Conserved hypothetical protein, showing some similarity to Rv1365c|MTCY02B10_29 from Mycobacterium tuberculosis (128 aa), FASTA scores: E(): 0.0012, (27.4% identity in 124 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215030.1" /db_xref="GI:15607657" /db_xref="GeneID:887324" /translation="MTTTIPTSKSACSVTTRPGNAAVDYGGAQIRAYLHHLATVVTIR GEIDAANVEQISEHVRRFSLGTNPMVLDLSELSHFSGAGISLLCILDEDCRAAGVQWA LVASPAVVEQLGGRCDQGEHESMFPMARSVHKALHDLADAIDRRRQLVLPLISRSA" gene 608746..610056 /locus_tag="Rv0517" /db_xref="GeneID:887323" CDS 608746..610056 /locus_tag="Rv0517" /EC_number="2.3.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0517, (MTCY20G10.07), len: 436 aa. Possible acyltransferase (EC 2.3.1.-), integral membrane protein, equivalent (but longer 26 aa in N-terminus) to AAK44761.1|AE006954 putative acyltransferase from Mycobacterium tuberculosis strain CDC1551 (410 aa). Also similar to many acyltransferases e.g. MDMB_STRMY|Q00718 from Streptomyces mycarofaciens (387 aa), FASTA scores: opt: 200, E(): 1.1e-08, (28.2% identity in 394 aa overlap). And similar to Rv0111, Rv0228, Rv1254, Rv1565c from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="membrane acyltransferase" /protein_id="NP_215031.1" /db_xref="GI:15607658" /db_xref="GeneID:887323" /translation="MAGGMDQPPGQPRRRTRQQSSDGKNGVRAAEITGEIRALTGLRI VAAVWVVLFHFRPMLGDASPGFRDALAPVLDCGAQGVDLFFILSGFVLTWNYLDRMGR SWSVRANLHFLWLRLARVWPVYLVTLHLAAVWVIFTLHVGHVPSPEAGQLTAISYVRQ ILLVQLWFQPYFDGSSWDGPAWSISAEWLAYLLFGLLILVIFRMKHATRARGLMWLAF AASLPPVVLLLASGQFYTPWSWLPRIVTQFAAGALACAAVRRLRPTDRARRIAGYLSV LVGVAIVGILYLLHAHPLAGVEDSGGVVDVLFVPLVISLAIGVGSLPALLSTRLMVFG GQISFCLYMVHELVHTAWGWAVQQYELALQDQPWKWNVVGLLAIALGAAILLYHFVEE PGRRWMRRMVDVKAASARSEPGEPVGSTRYQIDDALEGVSARAV" gene 610188..610883 /locus_tag="Rv0518" /db_xref="GeneID:887321" CDS 610188..610883 /locus_tag="Rv0518" /function="UNKNOWN" /note="Rv0518, (MTCY20G10.08), len: 231 aa. Possible exported protein; has hydrophobic N-terminus." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215032.1" /db_xref="GI:15607659" /db_xref="GeneID:887321" /translation="MSRPGTYVIGLTLLVGLVVGNPGCPRSYRPLTLDYRLNPVAVIG DSYTTGTDEGGLGSKSWTARTWQMLAARGVRIAADVAAEGRAGYGVPGDHGNVFEDLT ARAVQPDDALVVFFGSRNDQGMDPEDPEMLAEKVRDTFDLARHRAPSASLLVIAPPWP TADVPGPMLRIRDVLGAQARAAGAVFVDPIADHWFVDRPELIGADGVHPNDAGHEYLA DKIAPLISMELVG" gene complement(611172..612074) /locus_tag="Rv0519c" /db_xref="GeneID:887334" CDS complement(611172..612074) /locus_tag="Rv0519c" /function="UNKNOWN; COULD HAVE POSSIBLY A LIPOLYTIC ACTIVITY." /note="Rv0519c, (MTCY20G10.09c), len: 300 aa. Possible conserved membrane protein, with hydrophobic region near N-terminus. Could be a lipase (EC 3.1.-.-). Similar to Rv0774c|MTCY369.19c|A70708 from Mycobacterium tuberculosis (312 aa), FASTA scores: opt: 1092, E(): 0, (57.9% identity in 299 aa overlap). Contains PS00120 Lipases, serine active site." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215033.1" /db_xref="GI:15607660" /db_xref="GeneID:887334" /translation="MLRRGCAGNTDRRGIMTPMADLTRRALLRWGAGAGAGAAGVWAF GALVDPLEPQAAPAPFEPPTAGSSLPTRISGSFISAARGGIKTNWVISMPPGQSGQLR PVIALHGKDGNAGMMLDLGVEQGLARLVKEGKPAFAVVGVDGGNTYWHRRSSGGDSGA MVLDELLPMLTSMGMDTSRVGFLGWSMGGYGALLLGARLGPARTAGICAISPALFTSF TGSTPGAFDSYDDYVQHSVLGLPALNSIPLRVDCGTSDRFYFATRQFVNQLHQPPAGS FSPGGHDASYWREQLPGELAWMAS" misc_feature complement(611508..611537) /locus_tag="Rv0519c" /note="PS00120 Lipases, serine active site" gene 612255..612605 /locus_tag="Rv0520" /db_xref="GeneID:887331" CDS 612255..612605 /locus_tag="Rv0520" /function="COULD CAUSE METHYLATION." /note="Rv0520, (MTCY20G10.10), len: 116 aa. Possible fragment of methyltransferase (possibly first part) (EC 2.1.1.-), highly similar to part of several methyltransferases e.g. Q43445|U43683 S-ADENOSYL-L-METHIONINE:DELTA24-STEROL-C-METHYLTRANSFERAS E from Glycine max (Soybean)(367 aa), FASTA scores: opt: 190, E(): 2.3e-12, (39.2% identity in 74 aa overlap). Also some similarity to MTCY19G5_5 from Mycobacterium tuberculosis. Possibly continues as Rv0521 but we can find no frameshift to account for this." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215034.1" /db_xref="GI:15607661" /db_xref="GeneID:887331" /translation="MGGCSITCLNISEVPNETNRKKNRQAGLDRSIRVIHGSFDDIPE PDSGYDVVWSQDAILHAPDRRKVLEEAFRVLRPGGELIFTDPMQADDVPDGVLQPVYD RLNLRDLGSMRFYA" gene 612598..612903 /locus_tag="Rv0521" /db_xref="GeneID:3205045" CDS 612598..612903 /locus_tag="Rv0521" /function="COULD CAUSE METHYLATION" /note="Rv0521, (replaces MTCY20G10.11), len: 101 aa. Possible fragment of methyltransferase (possibly second part) (EC 2.1.1.-), highly similar to C-terminus of several methyltransferases e.g. AAF87203.1|AF216282 sarcosine-dimethylglycine methyltransferase from Halorhodospira halochloris (279 aa). Possibly continuation of Rv0520 but we can find no frameshift to account for this." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177626.1" /db_xref="GI:57116748" /db_xref="GeneID:3205045" /translation="MREAAQALGFEVLDQRDLVRNLRTHYSRVFEELEARRLELEGKS SQEYLDKMRVGLKNWVEAADNGHSRVGHPTFPRTRLTPICQLPTAAIDSTAGRRRYR" gene 613038..614342 /gene="gabP" /locus_tag="Rv0522" /db_xref="GeneID:886261" CDS 613038..614342 /gene="gabP" /locus_tag="Rv0522" /function="INVOLVED IN 4-AMINOBUTYRATE (GABA) DEGRADATION PATHWAY. TRANSPORTER FOR GABA. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0522, (MTCY20G10.12), len: 434 aa. Probable gabP, GABA permease (gamma-aminobutyrate permease), integral membrane protein, highly similar to others e.g. GABP_ECOLI|P25527 gaba permease from Escherichia coli (466 aa), FASTA scores: opt: 1218, E(): 0, (44.3% identity in 424 aa overlap); etc. Also similar to other M. tuberculosis permeases e.g. MTCY13E10.06c FASTA score: (34.4% identity in 407 aa overlap). Contains PS00218 Amino acid permeases signature. Overlaps and extends Rv0523c|MTCY25D10.01 from overlapping cosmid. BELONGS TO THE AMINO ACID PERMEASE FAMILY (APC FAMILY)." /codon_start=1 /transl_table=11 /product="GABA permease GabP" /protein_id="YP_177734.1" /db_xref="GI:57116749" /db_xref="GeneID:886261" /translation="MIAIGGVIGAGLFVGSGVVIRATGPAAFLTYALCGALIVLVMRM LGEMAAANPSTGAFADYAAKALGGWAGFSVGWLYWYFWVIVVGFEAVAGGKVLTYWID APLWLASLCLMMMMTATNLVSVSSFGEFEFWFAGVKVATIVGFLVLGTAFAFGLLPGH GMDFSNLSAHGGFFPDGVGAVFAAIVVAIFSMTGTEVVTIAAAEAPDPQRAVQRAMST VVARIVIFFVGSVFLLTVILPWNSLELGASPYVAALRHMGIGGADQIMNAVVLTAVLS CLNSGLYTASRMLFVLAARQEAPAQLVKVNRRGVPTFAIMGSSVVGFLCVIMAWVSPA TVFVFLLNSSGAVILFVYLLIALSQIVLRRQTSGQNLGVRMWLFPGLSIVTVTGIVAV LARMAFDYAARSQLWLSLLSWAVVVGCYLVTTLVRRPLNRPW" misc_feature 613104..613196 /gene="gabP" /locus_tag="Rv0522" /note="PS00218 Amino acid permeases signature" gene complement(614326..614721) /locus_tag="Rv0523c" /db_xref="GeneID:887341" CDS complement(614326..614721) /locus_tag="Rv0523c" /function="UNKNOWN" /note="Rv0523c, (MTCY25D10.02), len: 131 aa. Conserved hypothetical protein, showing some similarity to M. tuberculosis proteins Rv1598c|MTCY336.06; and Rv1871c|MTCY336_06|O06592 (136 aa), FASTA scores: opt: 197, E(): 5e-08, (38.4% identity in 99 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215037.1" /db_xref="GI:15607663" /db_xref="GeneID:887341" /translation="MQLPQWLARFNRYVTNPIQRLWAGWLPAFAILEHVGRRSGKPYR TPLNVFSADVDGRAGVAILLTYGPNRDWLKNITAAGGGRMRRYGKTFGVANPRRLTKA EAAPYVSSRWRPVFARLPFDEAVLLTKAD" gene 614835..616223 /gene="hemL" /locus_tag="Rv0524" /db_xref="GeneID:887349" CDS 614835..616223 /gene="hemL" /locus_tag="Rv0524" /EC_number="5.4.3.8" /function="INVOLVED IN PORPHYRIN BIOSYNTHESIS BY THE C5 PATHWAY (AT THE SECOND STEP) [CATALYTIC ACTIVITY: (S)-4-amino-5-oxopentanoate = 5-aminolevulinate]." /note="Converts (S)-4-amino-5-oxopentanoate to 5-aminolevulinate during the porphyrin biosynthesis pathway" /codon_start=1 /transl_table=11 /product="glutamate-1-semialdehyde aminotransferase" /protein_id="NP_215038.1" /db_xref="GI:15607664" /db_xref="GeneID:887349" /translation="MGSTEQATSRVRGAARTSAQLFEAACSVIPGGVNSPVRAFTAVG GTPRFITEAHGCWLIDADGNRYVDLVCSWGPMILGHAHPAVVEAVAKAAARGLSFGAP TPAETQLAGEIIGRVAPVERIRLVNSGTEATMSAVRLARGFTGRAKIVKFSGCYHGHV DALLADAGSGVATLGLCDDPQRPASPRSQSSRGLPSSPGVTGAAAADTIVLPYNDIDA VQQTFARFGEQIAAVITEASPGNMGVVPPGPGFNAALRAITAEHGALLILDEVMTGFR VSRSGWYGIDPVPADLFAFGKVMSGGMPAAAFGGRAEVMQRLAPLGPVYQAGTLSGNP VAVAAGLATLRAADDAVYTALDANADRLAGLLSEALTDAVVPHQISRAGNMLSVFFGE TPVTDFASARASQTWRYPAFFHAMLDAGVYPPCSAFEAWFVSAALDDAAFGRIANALP AAARAAAQERPA" misc_feature 615630..615740 /gene="hemL" /locus_tag="Rv0524" /note="PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site" gene 616223..616831 /locus_tag="Rv0525" /db_xref="GeneID:887358" CDS 616223..616831 /locus_tag="Rv0525" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0525, (MTCY25D10.04), len: 202 aa. Conserved hypothetical protein, equivalent to Q49821|B2168_C3_276|S72912 hypothetical protein from Mycobacterium leprae (202 aa), FASTA scores: opt: 1151, E(): 0, (82.5% identity in 200 aa overlap). Also highly similar to CAC08377.1|AL392176 putative phosphoglycerate mutase from Streptomyces coelicolor (233 aa); and similar to SLL0395|Q55734 hypothetical 23.8 kDa protein from SYNECHOCYSTIS SP. (212 aa), FASTA scores: opt: 207, E(): 5.1e-07, (28.2% identity in 195 aa overlap). Also some similarity to Rv2228c|Y019_MYCTU|Q10512|cy427.09 hypothetical 39.2 kDa protein from Mycobacterium tuberculosis (364 aa), FASTA scores: opt: 236, E(): 1.1e-08, (34.3% identity in 198 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215039.1" /db_xref="GI:15607665" /db_xref="GeneID:887358" /translation="MPEETQVHVVRHGEVHNPTGILYGRLPGFHLSATGAAQAAAVAD ALADRDIVAVIASPLQRAQETAAPIAARHDLAVETDPDLIESANFFEGRRVGPGDGAW RDPRVWWQLRNPFTPSWGEPYVDIAARMTTAVDKARVRGAGHEVVCVSHQLPVWTLRL YLTGKRLWHDPRRRDCALASVTSLIYDGDRLVDVVYSQPAAL" repeat_region 616828..616878 /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 616846..617496 /locus_tag="Rv0526" /db_xref="GeneID:887339" CDS 616846..617496 /locus_tag="Rv0526" /function="POSSIBLY ACTS ON THIOREDOXIN" /note="Rv0526, (MTCY25D10.05), len: 216 aa. Possible thioredoxin protein (thiol-disulfide interchange protein) (EC 1.-.-.-), equivalent to Q49816|U2168C|S72901 hypothetical protein from Mycobacterium leprae (216 aa), FASTA scores: opt: 1144, E(): 0, (78.5% identity in 214 aa overlap). C-terminus shows some similarity to C-terminus of thioredoxins e.g. RESA_BACSU|P35160 resa protein from Bacillus subtilis (181 aa), FASTA scores: opt: 200, E(): 7.4e-06, (24.2% identity in 132 aa overlap); etc. Also similar to Mycobacterium tuberculosis thioredoxin-like proteins Rv1470, Rv1471, Rv1677, etc. Contains PS00194 Thioredoxin family active site. SEEMS TO BELONG TO THE THIOREDOXIN FAMILY." /codon_start=1 /transl_table=11 /product="thioredoxin protein" /protein_id="NP_215040.1" /db_xref="GI:15607666" /db_xref="GeneID:887339" /translation="MQSRATRRSGALTMRRLVIAAAVSALLLTGCSGRDAVAQGGTFE FVSPGGKTDIFYDPPASRGRPGPLSGPELADPARSVSLDDFPGQVVVVNVWGQWCGPC RAEVSQLQRVYDATRGAGVSFLGIDVRDNNRQAPQDFINDRHVTYPSIYDPAMRTLIA FGGKYPTSVIPSTLVLDRQHRVAAVFLRELLAADLQPVVERVAEEEPSGRAPVGAQ" misc_feature 617116..617172 /locus_tag="Rv0526" /note="PS00194 Thioredoxin family active site" gene 617493..618272 /gene="ccdA" /locus_tag="Rv0527" /db_xref="GeneID:887362" CDS 617493..618272 /gene="ccdA" /locus_tag="Rv0527" /function="POSSIBLY INVOLVED IN CYTOCHROME C SYNTHESIS. MIGHT TRANSFER REDUCING EQUIVALENTS ACROSS THE CYTOPLASMIC MEMBRANE, PROMOTING EFFICIENT DISULFIDE BOND ISOMERIZATION OF PROTEINS LOCALIZED ON THE OUTER SURFACE OF THE MEMBRANE." /note="Rv0527, (MTCY25D10.06), len: 259 aa. Possible ccdA, cytochrome C-type biogenesis protein, integral membrane protein, equivalent to Q49810|B2168_C1_192|S72890 hypothetical protein from Mycobacterium leprae (262 aa), FASTA scores: opt: 1341, E(): 0, (79.0% identity in 262 aa overlap). Also highly similar to others e.g. CAC08380.1 (253 aa); CCDA_BACSU|P45706 cytochrome C-type biogenesis protein from Bacillus subtilis (235 aa), FASTA scores: opt: 307, E(): 7.4e-13, (30.4% identity in 237 aa overlap); etc. SEEMS TO BELONG TO THE DSBD SUBFAMILY. Note that previously known as ccsA.; ccsA" /codon_start=1 /transl_table=11 /product="cytochrome C-type biogenesis protein CcdA" /protein_id="YP_177735.1" /db_xref="GI:57116750" /db_xref="GeneID:887362" /translation="MTGFTEIAAVGPLLVAVGVCLLAGLVSFASPCVVPLVPGYLSYL AAVVGVDEQLPAGVVKPPVAARWRVAGSAALFVAGFTTVFVLGTVAVLGMTTTLITNQ LLLQRVGGVLIVVMGLVFVGFIGALQRQARFTPRQLTSVAGAPVLGAVFALGWTPCLG PTLTGVITVASATEGASVARGIVLVIAYCLGLGIPFVLLAFGSAWAVAGLGWLRRHTR AIQIFGGALLIAVGAALVTGVWNDVVSWLRDAFVSDVRLPI" gene 618305..619894 /locus_tag="Rv0528" /db_xref="GeneID:887372" CDS 618305..619894 /locus_tag="Rv0528" /function="UNKNOWN" /note="Rv0528, (MTCY25D10.07), len: 529 aa. Probable conserved transmembrane protein, equivalent (shorter 14 aa in N-terminus) to CAC31926.1|AL583925 conserved membrane protein from Mycobacterium leprae (542 aa). Also highly similar to Q49817|B2168_C2_237|S72902 hypothetical protein from Mycobacterium leprae (364 aa), FASTA scores: opt: 1846, E(): 0, (81.1% identity in 338 aa overlap); and Q49811|B2168_C1_194|S72891 hypothetical protein from Mycobacterium leprae (106 aa), FASTA scores: opt: 506, E(): 3.8e-26, (73.6% identity in 106 aa overlap). Also highly similar to CAC08381.1|AL392176 putative integral membrane protein from Streptomyces coelicolor (574 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215042.1" /db_xref="GI:15607668" /db_xref="GeneID:887372" /translation="MWRSLTSMGTALVLLFLLALAAIPGALLPQRGLNAAKVDDYLAA HPLIGPWLDELQAFDVFSSFWFTAIYVLLFVSLVGCLAPRTIEHARSLRATPVAAPRN LARLPKHAHARLAGEPAALAATITGRLRGWRSITRQQGDSVEVSAEKGYLREFGNLVF HFALLGLLVAVAVGKLFGYEGNVIVIADGGPGFCSASPAAFDSFRAGNTVDGTSLHPI CVRVNNFQAHYLPSGQATSFAADIDYQADPATADLIANSWRPYRLQVNHPLRVGGDRV YLQGHGYAPTFTVTFPDGQTRTSTVQWRPDNPQTLLSAGVVRIDPPAGSYPNPDERRK HQIAIQGLLAPTEQLDGTLLSSRFPALNAPAVAIDIYRGDTGLDSGRPQSLFTLDHRL IEQGRLVKEKRVNLRAGQQVRIDQGPAAGTVVRFDGAVPFVNLQVSHDPGQSWVLVFA ITMMAGLLVSLLVRRRRVWARITPTTAGTVNVELGGLTRTDNSGWGAEFERLTGRLLA GFEARSPDMAEAAAGTGRDVD" gene 619891..620865 /gene="ccsA" /locus_tag="Rv0529" /db_xref="GeneID:887380" CDS 619891..620865 /gene="ccsA" /locus_tag="Rv0529" /function="REQUIRED DURING CYTOCHROME BIOGENESIS AT THE STEP OF HEME ATTACHMENT." /note="Rv0529, (MTCY25D10.08), len: 324 aa. Possible ccsA, cytochrome C-type biogenesis protein, integral membrane protein, equivalent to NP_302558.1|NC_002677|B2168_C3_281 possible cytochrome C biogenesis protein from Mycobacterium leprae (327 aa), FASTA scores: opt: 1779, E(): 0, (82.9% identity in 327 aa overlap). Also highly similar to others e.g. CAC08382.1|AL392176 putative cytochrome biogenesis related protein from Streptomyces coelicolor (380 aa); CCSA_CHLRE|P48269 probable cytochrome c biogenesis protein from Chlamydomonas reinhardtii (353 aa), FASTA scores: opt: 449, E(): 1.3e-23, (34.4% identity in 247 aa overlap); etc. BELONGS TO THE CCMF/CYCK/CCL1/NRFE/CCSA FAMILY. Note that previously known as ccsB.; ccsB" /codon_start=1 /transl_table=11 /product="cytochrome C-type biogenesis protein CcsA" /protein_id="NP_215041.2" /db_xref="GI:57116751" /db_xref="GeneID:887380" /translation="MNTLHVNVGLARYSDWAFTSAVVALVVALLLLAFEFAQVRGRGL APLAVPAGSVATDSATPGIVADQRHRPFDERVGRGGLAVAYLGIGLLLACVVLRGLAT QRVPWGNMYEFINLTCLSGLIAGAVVLRRARYRPLWVFLLVPVLILLTVSGRWLYANA APVMPALQSYWLPIHVSVVSLGSGVFLVAGVASILFLVRTSRLGEPTGEGALAGMVRR LPDAQTLDGIAYRTTIFAFPVFGFGVIFGAIWAEEAWGRYWGWDPKETVSFVAWVVYA AYLHARSTAGWRDRKAAWINVAGFVAMVFNLFFVNLVTVGLHSYAGVG" gene 620907..622124 /locus_tag="Rv0530" /db_xref="GeneID:887385" CDS 620907..622124 /locus_tag="Rv0530" /function="UNKNOWN" /note="Rv0530, (MTCY25D10.09), len: 405 aa. Conserved hypothetical protein, similar in part to other hypothetical proteins e.g. AL031231|SC3C3_3|CAA20252.1 from Streptomyces coelicolor (1083 aa), FASTA scores: opt: 870, E(): 0, (39.5% identity in 443 aa overlap); etc. Also similar to Mycobacterium tuberculosis proteins e.g. Rv3868, Rv0282, Rv1798, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215044.1" /db_xref="GI:15607670" /db_xref="GeneID:887385" /translation="MLVTEHPRTGVGAPDSGNGGTDHPTVQLPPVPSVGAPPAAAGGE TPTRSVAGFRTQRLDPTAYGAYYSGPDEGPASPAERPPYRLEPVPHTPYPELATTTLL RPVKPPPSEGWRRLLYLLSGRLINAGEGPRAAHLNDLVAQVNRPLRGCYRIAVLSLKG GVGKTTITATLGATFADLRGDRVVAVDANPDRGTLSQKVPLETPATVRHLLRDADGIE RYSDVRGYTSKGPSGLEVLASDSDPASSDAFSADDYTRTLDILERFYGLVLTDCGTGL LHSAMSAVLPRSDVLVVVSSGSIDGARSAAATLDWLQAHGHDDQVRNSIAVVNAVRPR AGKVDVGKVVEHFSRRCRAVRVVPFDPHLEEGAEIALDRLRRETREALTELAAVVAAG FPGDPRRCKPSFT" gene 622329..622646 /locus_tag="Rv0531" /db_xref="GeneID:887376" CDS 622329..622646 /locus_tag="Rv0531" /function="UNKNOWN" /note="Rv0531, (MTCY25D10.10), len: 105 aa. Possible conserved membrane protein, highly similar to Y13803|MLB1306_1|CAA74131.1 hypothetical protein from Mycobacterium leprae (86 aa), FASTA scores: E(): 2.1e-24, (74.4% identity in 86 aa overlap); and NP_302557.1|NC_002677 putative membrane protein from Mycobacterium leprae (111 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215045.1" /db_xref="GI:15607671" /db_xref="GeneID:887376" /translation="MSEAPNDKTTRGVVDILVYATARLLLVVAVSAAIFGVARLIGLT EFPVVVATLFGLIIAMPLGIWVFSPLRRRATAALAVAGERRRAERERLRARLRGESLP EEQ" gene 622793..624577 /gene="PE_PGRS6" /locus_tag="Rv0532" /db_xref="GeneID:887391" CDS 622793..624577 /gene="PE_PGRS6" /locus_tag="Rv0532" /function="UNKNOWN" /note="Rv0532, (MTCY25D10.11), len: 594 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to others e.g. Y0DP_MYCTU|Q50615 from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 1703, E(): 0, (58.2% identity in 536 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177736.1" /db_xref="GI:57116752" /db_xref="GeneID:887391" /translation="MSNLLVTPELVAAAAADLAGIGSAIGAANAAAGAPTMALLAAGA DEVSAAVAAVFSSYAQQYQALSAAAAAFHDQFVRALAAGAGAYAGAEAANVEQQLLNA INAPTLALLGRPLIGNGADGAAGTGQAGGAGGLLYGNGGNGGSGAAGQAGGAGGAAGL IGHGGTGGAVTGVSTTGGPGGHGGDAGLYGFGGAGGAGGFGQSGAAGGAGGAGGWLYG DGGDGGAGDNGGNESGTGVSAVGGVGGAGGAGGLLFGNGGDGGVGGDGGDGSSTQDSG GDGGAGGAGGAGGWLLGNGGAGGAGGAASIKVATGGLGGDGGDAGLFGFGGDGGWGGR GVDARFGAAGGAAGAGGAGGWLYGDGGAGGVGGVGGAVFSLSSGDGGAGGAGGGGGWL FGNGGDGGAGGGGGGRFGSGSGAGGDGAVGGAGGAGAWFGNGGAGGVGGGGGRGTTAI GGDGGAGGAGGAGGWLYGDGGAGGAGGGGGRGGTGNDGGDGGDGGRGGDAQLLGNGGD GGAGGAGGPAGLALPPGPARPAGAAVPAVRCSAAPARPARTADPWLAPIFARSTLRHS HHLGGIAQTGAVADQQGQIAGLGRAGRQ" gene complement(624473..625480) /gene="fabH" /locus_tag="Rv0533c" /db_xref="GeneID:887381" CDS complement(624473..625480) /gene="fabH" /locus_tag="Rv0533c" /EC_number="2.3.1.41" /function="INVOLVED IN FATTY ACID BIOSYNTHESIS. CATALYZES THE CONDENSATION REACTION OF FATTY ACID SYNTHESIS BY THE ADDITION TO AN ACYL ACCEPTOR OF TWO CARBONS FROM MALONYL-ACP. KAS III CATALYZES THE FIRST CONDENSATION REACTION WHICH INITIATES FATTY ACID SYNTHESIS AND MAY THEREFORE PLAY A ROLE IN GOVERNING THE TOTAL RATE OF FATTY ACID PRODUCTION. POSSESSES BOTH ACETOACETYL-ACP SYNTHASE AND ACETYL TRANSACYLASE ACTIVITIES [CATALYTIC ACTIVITY: Acyl-[acyl-carrier protein] + malonyl-[acyl-carrier protein] = 3-oxoacyl-[acyl-carrier protein] + CO2 + [acyl-carrier protein]]." /note="FabH; beta-ketoacyl-acyl carrier protein synthase III; catalyzes the condensation of acetyl-CoA with malonyl-ACP to initiate cycles of fatty acid elongation; differs from 3-oxoacyl-(acyl carrier protein) synthase I and II in that it utilizes CoA thioesters as primers rather than acyl-ACPs" /codon_start=1 /transl_table=11 /product="3-oxoacyl-(acyl carrier protein) synthase III" /protein_id="NP_215047.1" /db_xref="GI:15607673" /db_xref="GeneID:887381" /translation="MTEIATTSGARSVGLLSVGAYRPERVVTNDEICQHIDSSDEWIY TRTGIKTRRFAADDESAASMATEACRRALSNAGLSAADIDGVIVTTNTHFLQTPPAAP MVAASLGAKGILGFDLSAGCAGFGYALGAAADMIRGGGAATMLVVGTEKLSPTIDMYD RGNCFIFADGAAAVVVGETPFQGIGPTVAGSDGEQADAIRQDIDWITFAQNPSGPRPF VRLEGPAVFRWAAFKMGDVGRRAMDAAGVRPDQIDVFVPHQANSRINELLVKNLQLRP DAVVANDIEHTGNTSAASIPLAMAELLTTGAAKPGDLALLIGYGAGLSYAAQVVRMPK G" gene complement(625562..626440) /gene="menA" /locus_tag="Rv0534c" /db_xref="GeneID:887408" CDS complement(625562..626440) /gene="menA" /locus_tag="Rv0534c" /EC_number="2.5.1.-" /function="INVOLVED IN MENAQUINONE BIOSYNTHESIS. CONVERSION OF 1,4-DIHYDROXY-2-NAPHTHOATE (DHNA) TO DIMETHYLMENAQUINONE (DMK). ATTACHES OCTAPRENYLPYROPHOSPHATE, A MEMBRANE-BOUND 40-CARBON SIDE CHAIN TO DHNA. THE CONVERSION OF DHNA TO DMK PROCEEDS IN THREE STAGES: THE REMOVAL OF THE CARBOXYL GROUP OF DHNA AS CO2, THE ATTACHMENT OF THE ISOPRENOID SIDE CHAIN, AND A QUINOL-TO-QUINONE OXIDATION, WHICH IS THOUGHT TO BE SPONTANEOUS." /note="catalyzes the formation of dimethylmenaquinone from 1,4-dihydroxy-2-naphthoate and octaprenyl diphosphate" /codon_start=1 /transl_table=11 /product="1,4-dihydroxy-2-naphthoate octaprenyltransferase" /protein_id="NP_215048.1" /db_xref="GI:15607674" /db_xref="GeneID:887408" /translation="MASFAQWVSGARPRTLPNAIAPVVAGTGAAAWLHAAVWWKALLA LAVAVALVIGVNYANDYSDGIRGTDDDRVGPVRLVGSRLATPRSVLTAAMTSLALGAL AGLVLALLSAPWLIAVGAICIAGAWLYTGGSKPYGYAGFGELAVFVFFGPVAVLGTQY TQALRVDWVGLAQAVATGALSCSVLVANNLRDIPTDARADKITLAVRLGDARTRMLYQ GLLAVAGVLTFVLMLATPWCVVGLVAAPLALRAAGPVRSGRGGRELIPVLRDTGLAML VWALAVAGALAFGQLS" gene 626457..627251 /gene="pnp" /locus_tag="Rv0535" /db_xref="GeneID:887430" CDS 626457..627251 /gene="pnp" /locus_tag="Rv0535" /EC_number="2.4.2.28" /function="PHOSPHORYLATES 5'-methylthioadenosin [CATALYTIC ACTIVITY: 5'-methylthioadenosine + phosphate = adenine + 5-methylthio-D-ribose 1-phosphate]." /note="Catalyzes the reversible phosphorolysis of 5'-deoxy-5'- methylthioadenosine (MTA) to adenine and 5-methylthio-D-ribose-1- phosphate" /codon_start=1 /transl_table=11 /product="5'-methylthioadenosine phosphorylase" /protein_id="NP_215049.1" /db_xref="GI:15607675" /db_xref="GeneID:887430" /translation="MHNNGRMLGVIGGSGFYTFFGSDTRTVNSDTPYGQPSAPITIGT IGVHDVAFLPRHGAHHQYSAHAVPYRANMWALRALGVRRVFGPCAVGSLDPELEPGAV VVPDQLVDRTSGRADTYFDFGGVHAAFADPYCPTLRAAVTGLPGVVDGGTMVVIQGPR FSTRAESQWFAAAGCNLVNMTGYPEAVLARELELCYAAIALVTDVDAGVAAGDGVKAA DVFAAFGENIELLKRLVRAAIDRVADERTCTHCQHHAGVPLPFELP" gene 627248..628288 /gene="galE3" /locus_tag="Rv0536" /db_xref="GeneID:887457" CDS 627248..628288 /gene="galE3" /locus_tag="Rv0536" /EC_number="5.1.3.2" /function="POSSIBLY INVOLVED IN GALACTOSE METABOLISM [CATALYTIC ACTIVITY: UDP-GLUCOSE = UDP-GALACTOSE]." /note="Rv0536, (MTCY25D10.15), len: 346 aa. Possible galE3, UDP-glucose 4-epimerase (EC 5.1.3.2), highly similar to CAB76986.1|AL159178 putative epimerase from Streptomyces coelicolor (334 aa); and similar to other epimerases e.g. NP_436775.1|NC_003078 putative NDP-glucose dehydrataseepimerase protein from Sinorhizobium meliloti (368 aa); AF143772|AF143772_7 GepiA from Mycobacterium avium strain 2151 (353 aa), FASTA scores: opt: 577, E(): 3.9e-29, (36.6% identity in 352 aa overlap); GALE_METJA|Q57664 putative UDP-glucose 4-epimerase (305 aa), FASTA scores: opt: 300, E(): 1.6e-12, (30.9% identity in 343 aa overlap); etc. Also similar to Mycobacterium tuberculosis proteins e.g. Rv3634c, Rv3784, etc. SEEMS TO BELONG TO THE SUGAR EPIMERASE FAMILY. Note that previously known as galE2.; galE2" /codon_start=1 /transl_table=11 /product="UDP-glucose 4-epimerase" /protein_id="YP_177737.1" /db_xref="GI:57116753" /db_xref="GeneID:887457" /translation="MRVLLTGAAGFIGSRVDAALRAAGHDVVGVDALLPAAHGPNPVL PPGCQRVDVRDASALAPLLAGVDLVCHQAAMVGAGVNAADAPAYGGHNDFATTVLLAQ MFAAGVRRLVLASSMVVYGQGRYDCPQHGPVDPLPRRRADLDNGVFEHRCPGCGEPVI WQLVDEDAPLRPRSLYAASKTAQEHYALAWSEASGGSVVALRYHNVYGPGMPRDTPYS GVAAIFRSAVEKGKPPKVFEDGGQMRDFVHVDDVAAANLAAVHLGEADRDGFTAVNVC SGRPISILQVATAICDARGGSMSPAITGHYRSGDVRHIVADPARAARVLGFRAAVDPG EGLREFAFAPLR" gene complement(628298..629731) /locus_tag="Rv0537c" /db_xref="GeneID:887465" CDS complement(628298..629731) /locus_tag="Rv0537c" /function="UNKNOWN" /note="Rv0537c, (MTCY25D10.16c), len: 477 aa. Probable integral membrane protein, showing weak similarity to YDNK_STRCO|P40180 hypothetical 41.2 kDa protein from Streptomyces coelicolor (411 aa), FASTA scores: opt: 122, E(): 0.85, (28.2% identity in 373 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215051.1" /db_xref="GI:15607677" /db_xref="GeneID:887465" /translation="MGLSSDDTRRREVVRDLAAGALLIGALFFPWNLYFGFRIPDSSK TVFGLLLAVTSLSLASLAVTFAGRRSQLRLGLNVPYLLLVLAFVVFDAIQTIRLGGTV HVPGGVGPGGWLGITGALLSAQPALTGATTDEGSHSRWLRATQFLGYASMLGAALSTG FNLSWRVRYALEPAAGASGFGKQNLAVIDTAVVYGVVALAAVLVASRWLLRPTAAEAL STVALGGSTLIAGSIVWSLPIGREIDAFHGIAQNTSTAGVGYEGYLVWAAAAAMCAPL TLFRSPNAPPIDKTVWRAASRNGLLLIAVWCLGSVAMRLTDLVVAVLLNYPFSRYDSM ALAAFDLATAVLAIWLRFNMATEALPARLISSLCGLLCTFTVSRVIVGVVLAPRFQAS SGGSAHPVYGNDLAQQITSTFDVVLCGLALSILAAAIVIGRLRQLPQPPHTPALSRPA GSPRIFRSAGSTHPVRPKIYRPPDHSS" gene 630040..631686 /locus_tag="Rv0538" /db_xref="GeneID:887473" CDS 630040..631686 /locus_tag="Rv0538" /function="UNKNOWN" /note="Rv0538, (MTCY25D10.17), len: 548 aa. Possible conserved membrane protein. Middle region highly similar to AAB63811.1|AF009829|MBE4863a|O32850 unknown protein from Mycobacterium bovis (295 aa) possible transmembrane protein with a repetitive proline, threonine-rich region at C-terminus." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215052.1" /db_xref="GI:15607678" /db_xref="GeneID:887473" /translation="MDVALGVAVTDRVARLALVDSAAPGTVIDQFVLDVAEHPVEVLT ETVVGTDRSLAGENHRLVATRLCWPDQAKADELQHALQDSGVHDVAVISEAQAATALV GAAHAGSAVLLVGDETATLSVVGDPDAPPTMVAVAPVAGADATSTVDTLMARLGDQAL APGDVFLVGRSAEHTTVLADQLRAASTMRVQTPDDPTFALARGAAMAAGAATMAHPAL VADATTSLPRAEAGQSGSEGEQLAYSQASDYELLPVDEYEEHDEYGAAADRSAPLSRR SLLIGNAVVAFAVIGFASLAVAVAVTIRPTAASKPVEGHQNAQPGKFMPLLPTQQQAP VPPPPPDDPTAGFQGGTIPAVQNVVPRPGTSPGVGGTPASPAPEAPAVPGVVPAPVPI PVPIIIPPFPGWQPGMPTIPTAPPTTPVTTSATTPPTTPPTTPVTTPPTTPPTTPVTT PPTTPPTTPVTTPPTTVAPTTVAPTTVAPTTVAPTTVAPATATPTTVAPQPTQQPTQQ PTQQMPTQQQTVAPQTVAPAPQPPSGGRNGSGGGDLFGGF" gene 631743..632375 /locus_tag="Rv0539" /db_xref="GeneID:887476" CDS 631743..632375 /locus_tag="Rv0539" /EC_number="2.4.1.-" /function="SUBSTRATE (SUGAR) UNKNOWN [CATALYTIC ACTIVITY: NDP-sugar + dolichyl phosphate = NDP + dolichyl sugar phosphate]." /note="Rv0539, (MTCY25D10.18), len: 210 aa. Probable dolichol-P-sugar synthase (EC 2.4.1.-), highly similar to CAB76989.1|AL159178 putative glycosyltransferase from Streptomyces coelicolor (242 aa), and similar to various dolichol-P-sugar synthetases and sugar transferases e.g. NP_126257.1|NC_000868 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE RELATED PROTEIN from Pyrococcus abyssi (211 aa); N-terminus of NP_127133.1|NC_000868 DOLICHOL-P-GLUCOSE SYNTHETASE from Pyrococcus abyssi (378 aa); N-terminus of NP_068880.1|NC_000917 putative dolichol-P-glucose synthetase from Archaeoglobus fulgidus (369 aa), FASTA scores: E(): 2.4e-13, (32. 1% identity in 193 aa overlap); Q26732 DOLICHYL-PHOSPHATE-MANNOSE SYNTHASE PRECURSOR from TRYPANOSOMA BRUCEI (267 aa), FASTA scores: opt: 179, E(): 0.0011, (30.7% identity in 205 aa overlap); etc. Also similar to Rv2051c|MTY25D10_18 from Mycobacterium tuberculosis. Contains S00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="dolichyl-phosphate sugar synthase" /protein_id="NP_215053.1" /db_xref="GI:15607679" /db_xref="GeneID:887476" /translation="MLPCLNEEESLPAVLAAIPAGYRALVVDNNSTDDTATVAARHGA QVVVEPRPGYGSAVHAGVLAATTPIVAVIDADGSMDAGDLPKLVAELDKGADLVTGRR RPVAGLHWPWVARVGTVVMSWRLRTRHRLPVHDIAPMRVARREALLDLGVVDRRSGYP LELLVRAAAAGWRVVELDVSYGPRTGGKSKVSGSLRGSIIAILDFWKVIS" misc_feature 632286..632309 /locus_tag="Rv0539" /note="PS00017 ATP/GTP-binding site motif A" gene 632372..633034 /locus_tag="Rv0540" /db_xref="GeneID:887489" CDS 632372..633034 /locus_tag="Rv0540" /function="UNKNOWN" /note="Rv0540, (MTCY25D10.19), len: 220 aa. Conserved hypothetical protein, similar to hypothetical proteins from Streptomyces coelicolor: CAB76990.1|AL159178 (213 aa); N-terminus of BAA84086.1|AB032065 (446 aa); and CAB61872.1|AL133252|SCE46_21 (210 aa), FASTA scores: opt: 267, E(): 5.3e-10, (32.7% identity in 202 aa overlap). Also some similarity with D90913_63|PCC6803 from Synecho cystis sp (211 aa), FASTA scores: opt: 189, E(): 4.7e-06, (25.3 identity in 194 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215054.1" /db_xref="GI:15607680" /db_xref="GeneID:887489" /translation="MSCLPVSVLVVAKAPEPGRVKTRLAAAIGDKVAADIAAAALLDT LDAVAAAPVTARAVALTGDLDSAADSAEIRRRLKSFTVFRQRGDAFADRLANAHVDAA DGYPVLQIGMDTPQVTAELLADCARLLLQIPAVLGLAFDGGWWVLGIRTPTAAECLRA VPMSQPDTGELTLKALRDNGIDVTLVQRLGDFDIVDDIALVRDCCAPGSRFAQATRAA GL" gene complement(633055..634404) /locus_tag="Rv0541c" /db_xref="GeneID:887416" CDS complement(633055..634404) /locus_tag="Rv0541c" /function="UNKNOWN" /note="Rv0541c, (MTCY25D10.20c), len: 449 aa. Probable conserved integral membrane protein, highly similar (except first 40 residues) to CAB76994.1|AL159178 putative integral membrane protein from Streptomyces coelicolor (456 aa). Also some similarity to Q13724|GCS1_HUMAN MANNOSYL-OLIGOSACCHARIDE GLUCOSIDASE (834 aa), FASTA scores: opt: 150, E(): 0.013, (27.1% identity in 339 aa overlap). Contains PS00041 Bacterial regulatory proteins, araC family signature." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215055.1" /db_xref="GI:15607681" /db_xref="GeneID:887416" /translation="MRIGRREGLAVAIGFVLVGAAFVLPRLNLGIKPRSDIGLERFAT RAGAAPIFGYWDAHVGWGTAPAVLTAVAVVAWGPVVAHRLPWRVLTLSTWATAAAWAF SLAMIDGWQRGFAGRLTTRDEYLWQVPGIADIPATLRTFTSRILDFQPNSWVTHVSGH PPGALLTFVWLDRIGLRGGGWAGLVCLLVGSSAAAAVLIAVRVLASEQMARRTAPFVA VAPTAIWIAVSADGYFAGVAAWGIALLAVAVHGATRFPALVAAGAGLLLGWGVFLNYG LVLIVLPGMAVLAAADWRPVLRALGPAVLAALVVAVSFAVAGFSWFDGYTLVQQRYWQ GIAKDRPFGYWSWANLACVVCAIGLGSVAGLSRVFDRAAISRRSGCHLLLLAVLAAIA LADLSMLSKAETERIWLPFTIWLTAAPALLPPRSHRLWLAVNAAGALLLNSIIFTNW" misc_feature complement(633667..633801) /locus_tag="Rv0541c" /note="PS00041 Bacterial regulatory proteins, araC family signature" gene complement(634416..635504) /gene="menE" /locus_tag="Rv0542c" /db_xref="GeneID:887507" CDS complement(634416..635504) /gene="menE" /locus_tag="Rv0542c" /EC_number="6.2.1.26" /function="INVOLVED IN MENAQUINONE BIOSYNTHESIS. O-SUCCINYLBENZOIC ACID (OSB) TO O-SUCCINYLBENZOYL-COA (OSB-COA) [CATALYTIC ACTIVITY: ATP + O-succinylbenzoate + CoA = AMP + diphosphate + O-succinylbenzoyl-CoA]." /note="Rv0542c, (MTCY25D10.21c), len: 362 aa. Possible menE, O-succinylbenzoic acid-CoA ligase (EC 6.2.1.26), highly similar to Q50170|AAA63145.1|U15187|XCLB 4-Coumarate--CoA ligase from Mycobacterium leprae (352 aa), FASTA scores: opt: 1815, E(): 0, (78.9% identity in 351 aa overlap). Also similar to N-terminus of acid-CoA ligases e.g. NP_471116.1|NC_003212 O-succinylbenzoic acid-CoA ligase from Listeria innocua (469 aa); NP_390957.1|NC_000964 O-succinylbenzoic acid-CoA ligase from Bacillus subtilis (486 aa); MENE_HAEIN|P44565 O-succinylbenzoic acid-CoA ligase from Haemophilus influenzae (452 aa), FASTA scores: opt: 307, E(): 4.6e-12, (25.4% identity in 339 aa overlap); etc. Also some similarity with fadD proteins from Mycobacterium tuberculosis. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY." /codon_start=1 /transl_table=11 /product="O-succinylbenzoic acid--CoA ligase" /protein_id="NP_215056.1" /db_xref="GI:15607682" /db_xref="GeneID:887507" /translation="MLGGSDPALVAVPTQHESLLGALRVGEQIDDDVALVVTTSGTTG PPKGAMLTAAALTASASAAHDRLGGPGSWLLAVPPYHIAGLAVLVRSVIAGSVPVELN VSAGFDVTELPNAIKRLGSGRRYTSLVAAQLAKALTDPAATAALAELDAVLIGGGPAP RPILDAAAAAGITVVRTYGMSETSGGCVYDGVPLDGVRLRVLAGGRIAIGGATLAKGY RNPVSPDPFAEPGWFHTDDLGALESGDSGVLTVLGRADEAISTGGFTVLPQPVEAALG THPAVRDCAVFGLADDRLGQRVVAAIVVGDGCPPPTLEALRAHVARTLDVTAAPRELH VVNVLPRRGIGKVDRAALVRRFAGEADQ" misc_feature complement(635364..635399) /gene="menE" /locus_tag="Rv0542c" /note="PS00455 Putative AMP-binding domain signature" gene complement(635573..635875) /locus_tag="Rv0543c" /db_xref="GeneID:887494" CDS complement(635573..635875) /locus_tag="Rv0543c" /function="UNKNOWN" /note="Rv0543c, (MTCY25D10.22c), len: 100 aa. Conserved hypothetical protein, equivalent to Q50171|MLU15187_32|NP_302469.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (100 aa), FASTA scores: opt: 493, E(): 6.1e-30, (73.5% identity in 98 aa overlap). Some similarity to Rv3046c|NP_217562.1 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215057.1" /db_xref="GI:15607683" /db_xref="GeneID:887494" /translation="MNRFLTSIVAWLRAGYPEGIPPTDSFAVLALLCRRLSHDEVKAV ANELMRLGDFDQIDIGVVITHFTDELPSPEDVERVRARLAAQGWPLDDVRDREEHA" gene complement(635935..636213) /locus_tag="Rv0544c" /db_xref="GeneID:887511" CDS complement(635935..636213) /locus_tag="Rv0544c" /function="UNKNOWN" /note="Rv0544c, (MTCY25D10.23c), len: 92 aa. Possible conserved transmembrane protein, equivalent to NP_302470.1|NC_002677 possible membrane protein from Mycobacterium leprae (96 aa); and shows some similarity to MLU15187_33|Q50172|U296V from Mycobacterium leprae (36 aa), FASTA scores: opt: 151, E(): 2.1e-05, (71.4% identity in 35 aa overlap). Also some similarity with VATL_NEPNO|Q26250 vacuolar ATP synthase 16 kDa proteolipid from Nephrops norvegicus (159 aa), FASTA scores: opt: 80, E(): 11, (26.1% identity in 88 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215058.1" /db_xref="GI:15607684" /db_xref="GeneID:887511" /translation="MSAWFNYTATLKILIFSLLAGALLPGLFAVGVRLQAAGDGADAT ARRRPLLVAVSWAIFALVLAVVIIGVLYIARDFIAHHTGWAFLGATPK" gene complement(636210..637463) /gene="pitA" /locus_tag="Rv0545c" /db_xref="GeneID:887517" CDS complement(636210..637463) /gene="pitA" /locus_tag="Rv0545c" /function="INVOLVED IN LOW-AFFINITY INORGANIC PHOSPHATE TRANSPORT ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0545c, (MTCY25D10.24c), len: 417 aa. Probable pitA, low-affinity inorganic phosphate transporter, integral membrane protein, equivalent to Q50173|NP_302471.1 pitA from Mycobacterium leprae (414 aa), FASTA scores: opt: 2035, E(): 0, (76.3% identity in 418 aa overlap). Also highly similar to others e.g. CAB59461.1|AL132644 putative low-affinity phosphate transport protein from Streptomyces coelicolor (423 aa); PITA_ECOLI|P37308 low-affinity inorganic phosphate transporter from Escherichia coli (499 aa), FASTA scores: opt: 304, E(): 6.9e-10, (32.5 % identity in 234 aa overlap); etc. BELONGS TO THE PHO-4 FAMILY OF TRANSPORTERS, PIT SUBFAMILY." /codon_start=1 /transl_table=11 /product="inorganic phosphate transporter" /protein_id="NP_215059.1" /db_xref="GI:15607685" /db_xref="GeneID:887517" /translation="MNLQLFLLLIVVVTALAFDFTNGFHDTGNAMATSIASGALAPRV AVALPAVLNLIGAFLSTAVAATIAKGLIDANLVTLELVFAGLVGGIVWNLLTWLLGIP SSSSHALIGGIVGATIAAVGLRGVIWSGVVSKVIVPAVVAALLATLVGAVGTWLVYRT TRGVAEKRTERGFRRGQIGSASLVSLAHGTNDAQKTMGVIFLALMSYGAVSTTASVPP LWVIVSCAVAMAAGTYLGGWRIIRTLGKGLVEIKPPQGMAAESSSAAVILLSAHFGYA LSTTQVATGSVLGSGVGKPGAEVRWGVAGRMVVAWLVTLPLAGLVGAFTYGLVHFIGG YPGAILGFALLWLTATAIWLRSRRAPIDHTNVNADWEGNLTAGLEAGAQPLADQRPPV PAPPAPTPPPNHRAPQFGVTTRNAP" gene complement(637583..637969) /locus_tag="Rv0546c" /db_xref="GeneID:887509" CDS complement(637583..637969) /locus_tag="Rv0546c" /function="UNKNOWN" /note="Rv0546c, (MTCY25D10.25c), len: 128 aa. Conserved hypothetical protein, equivalent to AAA63111.1|U15187|Q50174|U296X hypothetical protein from Mycobacterium leprae (144 aa), FASTA scores: opt: 748, E(): 0, (84.2% identity in 133 aa overlap). Also highly similar to CAB95979.1|AL360034 conserved hypothetical protein from Streptomyces coelicolor (130 aa); and similar to AE000854_8|O26852 S-D-LACTOYLGLUTATHIONE METHYLGLYOXAL LYASE from Methanobacterium thermoautotropto (116 aa), FASTA scores: opt: 155, E(): 0.00019, (30.6% identity in 108 aa overlap); YAER_ECOLI hypothetical 14.7 kDa protein from Escherichia coli (129 aa), FASTA scores: opt: 104, E(): 0.42, (28.7% identity in 115 aa overlap). Also similar to Rv2068c from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215060.1" /db_xref="GI:15607686" /db_xref="GeneID:887509" /translation="MEILASRMLLRPADYQRSLSFYRDQIGLAIAREYGAGTVFFAGQ SLLELAGYGEPDHSRGPFPGALWLQVRDLEATQTELVSRGVSIAREPRREPWGLHEMH VTDPDGITLIFVEVPEGHPLRTDTRA" gene complement(638032..638916) /locus_tag="Rv0547c" /db_xref="GeneID:887527" CDS complement(638032..638916) /locus_tag="Rv0547c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0547c, (MTCY25D10.26c), len: 294 aa. Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases e.g. fatty acyl-CoA reductase from Acinetobacter calcoaceticus (295 aa); NP_280196.1|NC_002607 3-oxoacyl-[acyl-carrier-protein] reductase from Halobacterium sp. NRC-1 (255 aa); NP_349214.1|NC_003030 Short-chain alcohol dehydrogenase family protein from Clostridium acetobutylicum (255 aa); etc. Also similar to several proteins from Mycobacterium tuberculosis e.g. Y04M_MYCTU|Q10783 putative oxidoreductase (341 aa), FASTA scores: opt: 644, E(): 0, (46.1% identity in 258 aa overlap)." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_215061.1" /db_xref="GI:15607687" /db_xref="GeneID:887527" /translation="MSKRPLRWLTEQITLAGMRPPISPQLLINRPAMQPVDLTGKRIL LTGASSGIGAAATKQFGLHRAVVVAVARRKDLLDAVADRITGDGGTAMSLPCDLSDME AIDALVEDVEKRIGGIDILINNAGRSIRRPLAESLERWHDVERTMVLNYYAPLRLIRG LAPGMLERGDGHIINVATWGVLSEASPLFSVYNASKAALSAVSRIIETEWGSQGVHST TLYYPLVATPMIAPTKAYDGLPALTAAEAAEWMVTAARTRPVRIAPRVAVAVNALDSI GPRWVNALMQRRNEQLNP" gene complement(639012..639956) /gene="menB" /locus_tag="Rv0548c" /db_xref="GeneID:887529" CDS complement(639012..639956) /gene="menB" /locus_tag="Rv0548c" /EC_number="4.1.3.36" /function="INVOLVED IN MENAQUINONE BIOSYNTHESIS. CONVERT O-SUCCINYLBENZOYL-CoA (OSB-COA) TO 1,4-DIHYDROXY-2-NAPHTHOIC ACID (DHNA) [CATALYTIC ACTIVITY: O-succinylbenzoyl-CoA = 1,4-dihydroxy-2-naphthoate + CoA]." /note="catalyzes the formation of 1,4-dihydroxy-2-naphthoate from O-succinylbenzoyl-CoA" /codon_start=1 /transl_table=11 /product="naphthoate synthase" /protein_id="NP_215062.1" /db_xref="GI:15607688" /db_xref="GeneID:887529" /translation="MVAPAGEQGRSSTALSDNPFDAKAWRLVDGFDDLTDITYHRHVD DATVRVAFNRPEVRNAFRPHTVDELYRVLDHARMSPDVGVVLLTGNGPSPKDGGWAFC SGGDQRIRGRSGYQYASGDTADTVDVARAGRLHILEVQRLIRFMPKVVICLVNGWAAG GGHSLHVVCDLTLASREYARFKQTDADVGSFDGGYGSAYLARQVGQKFAREIFFLGRT YTAEQMHQMGAVNAVAEHAELETVGLQWAAEINAKSPQAQRMLKFAFNLLDDGLVGQQ LFAGEATRLAYMTDEAVEGRDAFLQKRPPDWSPFPRYF" gene complement(640228..640641) /locus_tag="Rv0549c" /db_xref="GeneID:887534" CDS complement(640228..640641) /locus_tag="Rv0549c" /function="UNKNOWN" /note="Rv0549c, (MTCY25D10.28c), len: 137 aa. Conserved hypothetical protein, similar to Rv0960, Rv0065, and Rv1720c from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215063.1" /db_xref="GI:15607689" /db_xref="GeneID:887534" /translation="MRASPTSPPEQVVVDASAMVDLLARTSDRCSAVRARLARTAMHA PAHFDAEVLSALGRMQRAGALTVAYVDAALEELRQVPVTRHGLSSLLAGAWSRRDTLR LTDALYVELAETAGLVLLTTDERLARAWPSAHAIG" gene complement(640638..640904) /locus_tag="Rv0550c" /db_xref="GeneID:887515" CDS complement(640638..640904) /locus_tag="Rv0550c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0550c, (MTCY25D10.29c), len: 88 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215064.1" /db_xref="GI:15607690" /db_xref="GeneID:887515" /translation="MLSRRTKTIVVCTLVCMARLNVYVPDELAERARARGLNVSALTQ AAISAELENSATDAWLEGLEPRSTGARHDDVLGAIDAARDEFEA" gene complement(641096..642811) /gene="fadD8" /locus_tag="Rv0551c" /db_xref="GeneID:887526" CDS complement(641096..642811) /gene="fadD8" /locus_tag="Rv0551c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_215065.1" /db_xref="GI:15607691" /db_xref="GeneID:887526" /translation="MSTAGDDAVGVPPACGGRSDAVGVPQLARESGAMRDQDCSGELL RSPTHNGHLLVGALKRHQNKPVLFLGDTRLTGGQLADRISQYIQAFEALGAGTGVAVG LLSLNRPEVLMIIGAGQARGYRRTALHPLGSLADHAYVLNDAGISSLIIDPNPMFVER ALALLEQVDSLQQILTIGPVPDALKHVAVDLSAEAAKYQPQPLVAADLPPDQVIGLTY TGGTTGKPKGVIGTAQSIATMTSIQLAEWEWPANPRFLMCTPLSHAGAAFFTPTVIKG GEMIVLAKFDPAEVLRIIEEQRITATMLVPSMLYALLDHPDSHTRDLSSLETVYYGAS AINPVRLAEAIRRFGPIFAQYYGQSEAPMVITYLAKGDHDEKRLTSCGRPTLFARVAL LDEHGKPVKQGEVGEICVSGPLLAGGYWNLPDETSRTFKDGWLHTGDLAREDSDGFYY IVDRVKDMIVTGGFNVFPREVEDVVAEHPAVAQVCVVGAPDEKWGEAVTAVVVLRSNA ARDEPAIEAMTAEIQAAVKQRKGSVQAPKRVVVVDSLPLTGLGKPDKKAVRARFWEGA GRAVG" misc_feature complement(642131..642166) /gene="fadD8" /locus_tag="Rv0551c" /note="PS00455 Putative AMP-binding domain signature" repeat_region complement(642754..642811) /note="58 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene 642889..644493 /locus_tag="Rv0552" /db_xref="GeneID:887537" CDS 642889..644493 /locus_tag="Rv0552" /function="UNKNOWN" /note="Rv0552, (MTCY25D10.31), len: 534 aa. Conserved hypothetical protein, similar to others from several organisms. Also shows some similarity with regulatory proteins e.g. AEPA_ERWCA|Q06555 exoenzymes regulatory protein aepA [Precursor] from Erwinia carotovora (465 aa), FASTA scores: opt: 278, E(): 7.6e-11, (23.0% identity in 408 aa overlap). Also similar to Z99119|BSUB0016_28 from Bacillus subtilis (529 aa), FASTA scores: opt: 436, E(): 8.3e-20, (23.8% identity in 547 aa overlap). C-terminus is similar to MLRRNOPR_1 HYPOTHETICAL 17.7 kDa PROTEIN from Mycobacterium leprae (154 aa), FASTA score: (43.1% identity in 160 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215066.1" /db_xref="GI:15607692" /db_xref="GeneID:887537" /translation="MADADLVMTGTVLTVDDARPTAEAIAVADGRVIAVGDRSEVAGL VGANTRVIDLGAGCVMPGFVEAHGHPLLEAVVLSDRFVDIRPVTMRDADDVVAAIRGE VARRGPAGAYLVGWDPLLQSGLGEPTLTWLDSLAPNGPLVIIHNSGHKAYFNSHAAWL NGLTRDTADPKGAKYGRDGNGELDGTAEEIGAILPLLAGVADPSNFGAMLRAECARLN RAGLTTCSEMAFDPGYRPMVEAVRAELTVRLCTYEISNARMCTDATPGQGDDMLRQVG IKIWVDGSPWVGNIDLTFPYLDTPATRAIGVPPGSRGCANYTREQLAEIVGAYFPRGW QIACHVHGDGGVDTILDVYEEALRRNPRDDHRLRLEHVGAIRPDQLRRAAELGVTCSI FVDQIHYWGDVIVDDLFGAQRGSRWMPAGSAVAAGMRISLHNDPPVTPEEPLRNISVA ATRVAPSGRVLAPEERLTVEQAIRAQTIDAAWQLFAEDAIGSLQVGKYADMVVLSADP RTVPPEQIADLAVRATFLAGRQVYRR" gene 644490..645470 /gene="menC" /locus_tag="Rv0553" /db_xref="GeneID:887544" CDS 644490..645470 /gene="menC" /locus_tag="Rv0553" /EC_number="5.5.1.1" /function="POSSIBLY INVOLVED IN MENAQUINONE BIOSYNTHESIS. CATALYZES A SYN CYCLOISOMERIZATION [CATALYTIC ACTIVITY: 2,5-dihydro-5-oxofuran-2-acetate = cis,cis-hexadienedioate]." /note="catalyzes the dehydration of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylic acid to form O-succinylbenzoate" /codon_start=1 /transl_table=11 /product="O-succinylbenzoate synthase" /protein_id="NP_215067.1" /db_xref="GI:15607693" /db_xref="GeneID:887544" /translation="MIPVLPPLEALLDRLYVVALPMRVRFRGITTREVALIEGPAGWG EFGAFVEYQSAQACAWLASAIETAYCAPPPVRRDRVPINATVPAVAAAQVGEVLARFP GARTAKVKVAEPGQSLADDIERVNAVRELVPMVRVDANGGWGVAEAVAAAAALTADGP LEYLEQPCATVAELAELRRRVDVPIAADESIRKAEDPLAVVRAQAADIAVLKVAPLGG ISALLDIAARIAVPVVVSSALDSAVGIAAGLTAAAALPELDHACGLGTGGLFEEDVAE PAAPVDGFLAVARTTPDPARLQALGAPPQRRQWWIDRVKACYSLLVPSFG" gene 645467..646255 /gene="bpoC" /locus_tag="Rv0554" /db_xref="GeneID:887535" CDS 645467..646255 /gene="bpoC" /locus_tag="Rv0554" /EC_number="1.11.1.-" /function="SUPPOSED INVOLVED IN DETOXIFICATION REACTIONS." /note="Rv0554, (MTCY25D10.33), len: 262 aa. Possible bpoC, peroxidase (non-haem peroxidase) (EC 1.11.1.-), equivalent to NP_302477.1|NC_002677 putative hydrolase from Mycobacterium leprae (265 aa). Also highly similar or similar to various hydrolases and peroxidases e.g. CAB38877.1|AL035707|T36181 probable hydrolase from Streptomyces coelicolor (272 aa); CAC48368.1|Y16952 putative hydrolase from Amycolatopsis mediterranei (284 aa); P29715|BPA2_STRAU non-haem bromoperoxidase bpo-a2 (bromide peroxidase) (EC 1.11.1.-) from Streptomyces aureofaciens (277 aa), FASTA scores: opt: 325, E(): 2.3e-15, (29.5% identity in 268 aa overlap); O31168|PRXC_STRAU|CPO|CPOT non-heme chloroperoxidase (chloride peroxidase) (EC 1.11.1.10) from Streptomyces aureofaciens (278 aa); etc. Also similar to M. tuberculosis non-heme haloperoxidases and epoxide hydrolases e.g. Rv1938, Rv3617, etc." /codon_start=1 /transl_table=11 /product="peroxidase BpoC" /protein_id="NP_215068.1" /db_xref="GI:15607694" /db_xref="GeneID:887535" /translation="MINLAYDDNGTGDPVVFIAGRGGAGRTWHPHQVPAFLAAGYRCI TFDNRGIGATENAEGFTTQTMVADTAALIETLDIAPARVVGVSMGAFIAQELMVVAPE LVSSAVLMATRGRLDRARQFFNKAEAELYDSGVQLPPTYDARARLLENFSRKTLNDDV AVGDWIAMFSMWPIKSTPGLRCQLDCAPQTNRLPAYRNIAAPVLVIGFADDVVTPPYL GREVADALPNGRYLQIPDAGHLGFFERPEAVNTAMLKFFASVKA" gene 646298..647962 /gene="menD" /locus_tag="Rv0555" /db_xref="GeneID:887554" CDS 646298..647962 /gene="menD" /locus_tag="Rv0555" /function="INVOLVED IN MENAQUINONE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY 1: ISOCHORISMATE + 2-KETOGLUTARATE = 2-SUCCINYL-6-HYDROXY-2,4-CYCLOHEXADIENE-1-CARBOXYLATE + PYRUVATE + CO(2)] [CATALYTIC ACTIVITY 2: 2-OXOGLUTARATE = SUCCINATE SEMIALDEHYDE + CO(2)]." /note="SEPHCHC synthase; forms 5-enolpyruvoyl-6-hydroxy-2-succinyl-cyclohex-3-ene-1- carboxylate from 2-oxoglutarate and isochorismate in menaquinone biosynthesis" /codon_start=1 /transl_table=11 /product="2-succinyl-5-enolpyruvyl-6-hydroxy-3- cyclohexene-1-carboxylate synthase" /protein_id="NP_215069.1" /db_xref="GI:15607695" /db_xref="GeneID:887554" /translation="MNPSTTQARVVVDELIRGGVRDVVLCPGSRNAPLAFALQDADRS GRIRLHVRIDERTAGYLAIGLAIGAGAPVCVAMTSGTAVANLGPAVVEANYARVPLIV LSANRPYELLGTGANQTMEQLGYFGTQVRASISLGLAEDAPERTSALNATWRSATCRV LAAATGARTANAGPVHFDIPLREPLVPDPEPLGAVTPPGRPAGKPWTYTPPVTFDQPL DIDLSVDTVVISGHGAGVHPNLAALPTVAEPTAPRSGDNPLHPLALPLLRPQQVIMLG RPTLHRPVSVLLADAEVPVFALTTGPRWPDVSGNSQATGTRAVTTGAPRPAWLDRCAA MNRHAIAAVREQLAAHPLTTGLHVAAAVSHALRPGDQLVLGASNPVRDVALAGLDTRG IRVRSNRGVAGIDGTVSTAIGAALAYEGAHERTGSPDSPPRTIALIGDLTFVHDSSGL LIGPTEPIPRSLTIVVSNDNGGGIFELLEQGDPRFSDVSSRIFGTPHDVDVGALCRAY HVESRQIEVDELGPTLDQPGAGMRVLEVKADRSSLRQLHAAIKAAL" gene 647959..648474 /locus_tag="Rv0556" /db_xref="GeneID:887551" CDS 647959..648474 /locus_tag="Rv0556" /function="UNKNOWN" /note="Rv0556, (MTCY25D10.35), len: 171 aa. Probable conserved transmembrane protein, equivalent to NP_302479.1|NC_002677 putative membrane protein from Mycobacterium leprae (175 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215070.1" /db_xref="GI:15607696" /db_xref="GeneID:887551" /translation="MISPKPLLHILIHGLSDELPDTRGRIVLRWLRIAVLIVTGLVTL QSVLLVAGAWRNDIAIQRNMGVAQAEVLSAGPRRSTIEFVTPDRITYRPQLGVLYPSE LSTGMRIYVEYNKRDPNLVRVQHRNAGLAIIPAGSIAVVAWLIAAAALVVLAVLDKRL ERRENSASATG" gene 648536..649672 /gene="pimB" /locus_tag="Rv0557" /db_xref="GeneID:887609" CDS 648536..649672 /gene="pimB" /locus_tag="Rv0557" /EC_number="2.4.1.-" /function="INVOLVED IN LIPOARABINOMANNAN (LAM) BIOSYNTHESIS." /experiment="experimental evidence, no additional details recorded" /note="Rv0557, (MTCY25D10.36), len: 378 aa. pimB (alternate gene name: mtfB), mannosyltransferase (EC 2.4.1.-) (see citation below), similar to other various transferases e.g. NP_243554.1|NC_002570 alpha-D-mannose-alpha(1-6)phosphatidyl myo-inositol monomannoside transferase from Bacillus halodurans (381 aa); NP_249533.1|NC_002516 probable glycosyl transferase from Pseudomonas aeruginosa (406 aa); NP_419573.1|NC_002696 glycosyl transferase, group 1 family protein, from Caulobacter crescentus (455 aa); etc. Also similar to Q55598 hypothetical 44.9 kDa protein from SYNECHOCYSTIS SP (409 aa), FASTA scores: opt: 703, E(): 0, (33.9% identity in 378 aa overlap); GPI3_YEAST|P32363 n-acetylglucosaminyl-phosphatidylinositol biosynthetic protein (452 aa), FASTA scores: opt: 230, E(): 1.1e-07, (23.5% identity in 328 aa overlap).; mtfB" /codon_start=1 /transl_table=11 /product="mannosyltransferase PIMB" /protein_id="NP_215071.1" /db_xref="GI:15607697" /db_xref="GeneID:887609" /translation="MCGVRVAIVAESFLPQVNGVSNSVVKVLEHLRRTGHEALVIAPD TPPGEDRAERLHDGVRVHRVPSRMFPKVTTLPLGVPTFRMLRALRGFDPDVVHLASPA LLGYGGLHAARRLGVPTVAVYQTDVPGFASSYGIPMTARAAWAWFRHLHRLADRTLAP STATMESLIAQGIPRVHRWARGVDVQRFAPSARNEVLRRRWSPDGKPIVGFVGRLAPE KHVDRLTGLAASGAVRLVIVGDGIDRARLQSAMPTAVFTGARYGKELAEAYASMDVFV HSGEHETFCQVVQEALASGLPVIAPDAGGPRDLITPHRTGLLLPVGEFEHRLPDAVAH LVHERQRYALAARRSVLGRSWPVVCDELLGHYEAVRGRRTTQAA" gene 649689..650393 /gene="ubiE" /locus_tag="Rv0558" /db_xref="GeneID:887591" CDS 649689..650393 /gene="ubiE" /locus_tag="Rv0558" /EC_number="2.1.1.-" /function="INVOLVED IN MENAQUINONE BIOSYNTHESIS (AT THE LAST STEP) CONVERTS DMKH2 INTO MKH2 [CATALYTIC ACTIVITY: S-ADENOSYL-L-METHIONINE + DEMETHYLMENAQUINOL = S-ADENOSYL-L-HOMOCYSTEINE + MENAQUINOL]." /note="Catalyzes the carbon methylation reaction in the biosynthesis of ubiquinone" /codon_start=1 /transl_table=11 /product="ubiquinone/menaquinone biosynthesis methyltransferase" /protein_id="YP_177738.1" /db_xref="GI:57116754" /db_xref="GeneID:887591" /translation="MSRAALDKDPRDVASMFDGVARKYDLTNTVLSLGQDRYWRRATR SALRIGPGQKVLDLAAGTAVSTVELTKSGAWCVAADFSVGMLAAGAARKVPKVAGDAT RLPFGDDVFDAVTISFGLRNVANQQAALREMARVTRPGGRLLVCEFSTPTNALFATAY KEYLMRALPRVARAVSSNPEAYEYLAESIRAWPDQAVLAHQISRAGWSGVRWRNLTGG IVALHAGYKPGKQTPQ" gene complement(650407..650745) /locus_tag="Rv0559c" /db_xref="GeneID:887569" CDS complement(650407..650745) /locus_tag="Rv0559c" /function="UNKNOWN" /note="Rv0559c, (MTCY25D10.38c), len: 112 aa. Possible conserved secreted protein, similar to NP_302481.1|NC_002677 putative secreted protein from Mycobacterium leprae (112 aa). Also similar to Y08B_MYCTU|Q11048 hypothetical 11.6 kDa protein FASTA scores: opt: 111, E(): 011, (25.4% identity in 114 aa overlap). Contains possible N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215073.1" /db_xref="GI:15607699" /db_xref="GeneID:887569" /translation="MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGP QDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEH VGVLQRAGTR" gene complement(650779..651504) /locus_tag="Rv0560c" /db_xref="GeneID:887637" CDS complement(650779..651504) /locus_tag="Rv0560c" /EC_number="2.1.1.-" /function="POSSIBLY CAUSES METHYLATION" /experiment="experimental evidence, no additional details recorded" /note="Rv0560c, (MTCY25D10.39c), len: 271 aa. Possible benzoquinone methyltransferase (EC 2.1.1.-) (see citation below), similar to other hypothetical proteins and methyltransferases e.g. Q54300 METHYLTRANSFERASE (211 aa), FASTA scores: opt: 203, E(): 4.8e-07, (30.9% identity in 136 aa overlap). Similar to Rv3699, Rv1377c, Rv2675c, etc from Mycobacterium tuberculosis. Rv0560c can be induced by salicylate and para-amino-salicylate (PAS)." /codon_start=1 /transl_table=11 /product="benzoquinone methyltransferase" /protein_id="NP_215074.1" /db_xref="GI:15607700" /db_xref="GeneID:887637" /translation="MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPW SIGEPQPELAALIVQGKFRGDVLDVGCGEAAISLALAERGHTTVGLDLSPAAVELARH EAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYLQSIVRAAAPG ASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIKPARLYARFPAGFAGMPAL LDIREEPNGLQSIGGWLLSAHLG" gene complement(651529..652755) /locus_tag="Rv0561c" /db_xref="GeneID:887638" CDS complement(651529..652755) /locus_tag="Rv0561c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0561c, (MTCY25D10.40c), len: 408 aa. Possible oxidoreductase (EC 1.-.-.-), highly similar (except in first 30 aa) to NP_302482.1|NC_002677 putative FAD-linked oxidoreductase from Mycobacterium leprae (408 aa). Also similar to T34627 probable electron transfer oxidoreductase from Streptomyces coelicolor (430 aa); and some bacteriochlorophyll synthases e.g. NP_069300.1|NC_000917 bacteriochlorophyll synthase from Archaeoglobus fulgidus (410 aa); Q55087 GERANYLGERANYL HYDROGENASE (407 aa), FASTA scores: opt: 208, E(): 1.7e-06, (26.9% identity in 327 aa overlap)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215075.1" /db_xref="GI:15607701" /db_xref="GeneID:887638" /translation="MSVDDSADVVVVGAGPAGSAAAAWAARAGRDVLVIDTATFPRDK PCGDGLTPRAVAELHQLGLGKWLADHIRHRGLRMSGFGGEVEVDWPGPSFPSYGSAVA RLELDDRIRKVAEDTGARMLLGAKAVAVHHDSSRRVVSLTLADGTEVGCRQLIVADGA RSPLGRKLGRRWHRETVYGVAVRGYLSTAYSDDPWLTSHLELRSPDGAVLPGYGWIFP LGNGEVNIGVGALSTSRRPADLALRPLISYYTDLRRDEWGFTGQPRAVSSALLPMGGA VSGVAGSNWMLIGDAAACVNPLNGEGIDYGLETGRLAAELLDSRDLARLWPSLLADRY GRGFSVARRLALLLTFPRFLPTTGPITMRSTALMNIAVRVMSNLVTDDDRDWVARVWR GGGQLSRLVDRRPPFS" gene 652771..653778 /gene="grcC1" /locus_tag="Rv0562" /db_xref="GeneID:887647" CDS 652771..653778 /gene="grcC1" /locus_tag="Rv0562" /EC_number="2.5.1.-" /function="POSSIBLY SUPPLIES POLYPRENYL DIPHOSPHATE." /note="Rv0562, (MTCY25D10.41), len: 335 aa. Probable grcC1, polyprenyl diphosphate synthetase (EC 2.5.1.-), equivalent to NP_302483.1|NC_002677 polyprenyl diphosphate synthase component from Mycobacterium leprae (330 aa). Also similar to others (generally hepta (EC 2.5.1.30) or hexaprenyl) e.g. GRC3_BACSU|P31114 probable heptaprenyl diphosphate syntetase (348 aa), FASTA scores: opt: 599, E(): 4e-31, (33.2% identity in 307 aa overlap); etc. Also highly similar to Mycobacterium tuberculosis proteins Rv0989c|grcC2|NP_215504.1|MTCI237.03c PROBABLE POLYPRENYL-DIPHOSPHATE SYNTHASE (325 aa); Rv3383c, Rv3398c, etc. Contains PS00444 Polyprenyl synthetases signature 2. BELONGS TO THE FPP/GGPP SYNTHETASES FAMILY." /codon_start=1 /transl_table=11 /product="polyprenyl-diphosphate synthase" /protein_id="NP_215076.1" /db_xref="GI:15607702" /db_xref="GeneID:887647" /translation="MRTPATVVAGVDLGDAVFAAAVRAGVARVEQLMDTELRQADEVM SDSLLHLFNAGGKRFRPLFTVLSAQIGPQPDAAAVTVAGAVIEMIHLATLYHDDVMDE AQVRRGAPSANAQWGNNVAILAGDYLLATASRLVARLGPEAVRIIADTFAQLVTGQMR ETRGTSENVDSIEQYLKVVQEKTGSLIGAAGRLGGMFSGATDEQVERLSRLGGVVGTA FQIADDIIDIDSESDESGKLPGTDVREGVHTLPMLYALRESGPDCARLRALLNGPVDD DAEVREALTLLRASPGMARAKDVLAQYAAQARHELALLPDVPGRRALAALVDYTVSRH G" misc_feature 653413..653451 /gene="grcC1" /locus_tag="Rv0562" /note="PS00444 Polyprenyl synthetases signature 2" gene 653879..654739 /gene="htpX" /locus_tag="Rv0563" /db_xref="GeneID:887649" CDS 653879..654739 /gene="htpX" /locus_tag="Rv0563" /EC_number="3.4.24.-" /function="POSSIBLY INVOLVED IN ADAPTATION. HYDROLIZES SPECIFIC PEPTIDES AND/OR PROTEINS." /experiment="experimental evidence, no additional details recorded" /note="putative metalloprotease" /codon_start=1 /transl_table=11 /product="heat shock protein HtpX" /protein_id="NP_215077.1" /db_xref="GI:15607703" /db_xref="GeneID:887649" /translation="MTWHPHANRLKTFLLLVGMSALIVAVGALFGRTALMLAALFAVG MNVYVYFNSDKLALRAMHAQPVSELQAPAMYRIVRELATSAHQPMPRLYISDTAAPNA FATGRNPRNAAVCCTTGILRILNERELRAVLGHELSHVYNRDILISCVAGALAAVITA LANMAMWAGMFGGNRDNANPFALLLVALLGPIAATVIRMAVSRSREYQADESGAVLTG DPLALASALRKISGGVQAAPLPPEPQLASQAHLMIANPFRAGERIGSLFSTHPPIEDR IRRLEAMARG" misc_feature 654272..654301 /gene="htpX" /locus_tag="Rv0563" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(654924..655949) /gene="gpsA" /locus_tag="Rv0564c" /db_xref="GeneID:887651" CDS complement(654924..655949) /gene="gpsA" /locus_tag="Rv0564c" /EC_number="1.1.1.94" /function="Involved in de novo phospholipid biosynthesis; glycerol-3 phosphate formation [CATALYTIC ACTIVITY: Sn-glycerol 3-phosphate + NAD(P)+ = glycerone phosphate + NAD(P)H]." /note="catalyzes the NAD(P)H-dependent reduction of glycerol 3-phosphate to glycerone phosphate" /codon_start=1 /transl_table=11 /product="NAD(P)H-dependent glycerol-3-phosphate dehydrogenase" /protein_id="NP_215078.1" /db_xref="GI:15607704" /db_xref="GeneID:887651" /translation="MAANKREPKVVVLGGGSWGTTVASICARRGPTLQWVRSAVTAQD INDNHRNSRYLGNDVVLSDTLRATTDFTEAANCADVVVMGVPSHGFRGVLVELSKELR PWVPVVSLVKGLEQGTNMRMSQIIEEVLPGHPAGILAGPNIAREVAEGYAAAAVLAMP DQHLATRLSAMFRTRRFRVYTTDDVVGVETAGALKNVFAIAVGMGYSLGIGENTRALV IARALREMTKLGVAMGGKSETFPGLAGLGDLIVTCTSQRSRNRHVGEQLGAGKPIDEI IASMSQVAEGVKAAGVVMEFANEFGLNMPIAREVDAVINHGSTVEQAYRGLIAEVPGH EVHGSGF" misc_feature complement(655239..655262) /gene="gpsA" /locus_tag="Rv0564c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(656010..657470) /locus_tag="Rv0565c" /db_xref="GeneID:887662" CDS complement(656010..657470) /locus_tag="Rv0565c" /EC_number="1.14.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0565c, (MTV039.03c), len: 486 aa. Probable monoxygenase (EC 1.14.-.-), highly similar to NP_301173.1|NC_002677 putative monooxygenase from Mycobacterium leprae (494 aa). Also highly similar to others e.g. NP_421371.1|NC_002696 monooxygenase (flavin-binding family) from Caulobacter crescentus (498 aa); C-terminus of NP_051574.1|NC_000958 arylesterase/monoxygenase from Deinococcus radiodurans (833 aa); P12015|CYMO_ACISP CYCLOHEXANONE MONOOXYGENASE (EC 1.14.13.22) from Acinetobacter sp. (542 aa), FASTA scores: opt: 354, E(): 2.1e-16, (23.7% identity in 435 aa overlap); etc. Also similar to other putative monoxygenases from Mycobacterium tuberculosis e.g. Rv3854c (489 aa), MTCY01A6.14 (489 aa), MTV013_4 (495 aa), MTCY31.20 (495 aa). TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="NP_215079.1" /db_xref="GI:15607705" /db_xref="GeneID:887662" /translation="MSVTPNAGCVDVVIVGAGISGLGAAYRIIERNPQLTYTILERRA RIGGTWDLFRYPGVRSDSSIFTLSFPYEPWTREEGIADGAHIREYLTDMAHKYGIDRH IEFNSYVRAADWDSSTDTWTVTFEQNGVHKHYRSRFVFFGSGYYNYDEGYTPDFGGIE KFGGAVVHPQHWPEDLDYTGKKIVVIGSGATAVTLIPSLTDRAEKVTMLQRSPTYLIS ASKYSTFAAVVRKALPPKTSHLIVRMYNALLEAVFWFLSRKTPVFVKWLLRRTAIKNL PEGYDIETHFTPRYNPWDQRLCLIPDADLYNAITSGRAEVVTDHIDHFDATGIALKSG GHLDADIIVTATGLQLQALGGAAISLDGVEIDPRDRFVYKAHMLEDVPNLFWCVGYTN ASWTLRADMTARATAKLLAHMAAHGHTRAAPHLGDEPMDEKPSWDIQAGYVKRAPYAL PKSGTKRPWNVRQNYLADAIDYRFDRIEEAMVFGAA" gene complement(657548..658039) /locus_tag="Rv0566c" /db_xref="GeneID:887633" CDS complement(657548..658039) /locus_tag="Rv0566c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="putative nucleotide binding property based on structural studies of Haemophilus influenzae crystallized protein in PDB Accession Number 1IN0 and NMR studies of Escherichia coli YajQ; the YajQ protein from Pseudomonas synringae appears to play a role in activation of bateriophage phi6 segment L transcription" /codon_start=1 /transl_table=11 /product="putative nucleotide-binding protein" /protein_id="NP_215080.1" /db_xref="GI:15607706" /db_xref="GeneID:887633" /translation="MADSSFDIVSKVDRQEVDNALNQAAKELATRFDFRGTDTKIAWK GDEAVELTSSTEERVKAAVDVFKEKLIRRDISLKAFEAGEPQASGKTYKVTGALKQGI SSENAKKITKLIRDAGPKNVKTQIQGDEVRVTSKKRDDLQAVIAMLKKADLDVALQFV NYR" gene 658109..658189 /locus_tag="Rvnt05" /note="tRNA-Tyr(GTA)" /db_xref="GeneID:2700426" tRNA 658109..658189 /locus_tag="Rvnt05" /product="tRNA-Tyr" /note="codon recognized: UAC" /anticodon=(pos:658143..658145,aa:Tyr) /db_xref="GeneID:2700426" gene 658321..659340 /locus_tag="Rv0567" /db_xref="GeneID:887667" CDS 658321..659340 /locus_tag="Rv0567" /EC_number="2.1.1.-" /function="CAUSES METHYLATION" /note="Rv0567, (MTV039.05), len: 339 aa. Probable methyltransferase (EC 2.1.1.-), similar to several e.g. P39896|TCMO_STRGA TETRACENOMYCIN POLYKETIDE SYNTHESIS 8-O-METHYLTRANSFERASE from Streptomyces glaucescens (339 aa), FASTA scores: opt: 685, E(): 0, (35.8% identity in 335 aa overlap); P10950|HIOM_BOVIN HYDROXYINDOLE O-METHYLTRANSFERASE (EC 2.1.1.4) from Bos taurus (345 aa), FASTA scores: opt: 509, E(): 3.4e-27, (30.7% identity in 332 aa overlap) etc. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="methyltransferase/methylase" /protein_id="NP_215081.1" /db_xref="GI:15607707" /db_xref="GeneID:887667" /translation="MELSPDRIMAIGGGYGPSKVLLTAVGLGLFTELGDEAMTAEAIA DRLGLLKRPAIDFLDALVSLDLLARDGDGPGSHYRNTPETAHFLDEARPTYAGGLLKI WNERNYRFWADLTEALKTGKAQSEVKQTGRPFFEALYADPRRLEAFMAAMDAASRRNI ELLAKRFPFERYRRLCDVGCADGLLSRIVAAAHPHLQCVSFDLPAVTEIARRKLTAEG LGERVQACAGDFLADPLPAADVITMGQILHDWNLDRKQQLVAKAYEALSKEGAFIVIE TLIDDARRENTTGLMMSLNMLIEFGDAFDYSAADFRGWCGEAGFRSFEVIPLAGGSSA AVAYK" gene 659450..660868 /gene="cyp135B1" /locus_tag="Rv0568" /db_xref="GeneID:887654" CDS 659450..660868 /gene="cyp135B1" /locus_tag="Rv0568" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv0568, (MT0594, MTV039.06), len: 472 aa. Possible cyp135B1, cytochrome P450 (EC 1.14.-.-), similar to putative cytochrome P-450 monoxygenases and other cytochrome P-450 related enzymes e.g. P29980|CPXN_ANASP PROBABLE CYTOCHROME P450 from Anabaena sp. strain PCC 7120 (459 aa), FASTA scores: opt: 525, E(): 7.2e-27, (31.9% identity in 417 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv0327c|NP_214841.1|NC_000962|CYP135A1|MT0342|MTCY63.32c PUTATIVE CYTOCHROME P450 (449 aa), FASTA scores: opt: 1080, E(): 0, (40.5% identity in 444 aa overlap); Rv3685c|NP_218202.1|NC_000962 PUTATIVE CYTOCHROME P450 (476 aa); Rv0136|NP_214650.1|NC_000962 PUTATIVE CYTOCHROME P450 (441 aa); etc. Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="cytochrome P450 135B1" /protein_id="NP_215082.1" /db_xref="GI:15607708" /db_xref="GeneID:887654" /translation="MSGTSSMGLPPGPRLSGSVQAVLMLRHGLRFLTACQRRYGSVFT LHVAGFGHMVYLSDPAAIKTVFAGNPSVFHAGEANSMLAGLLGDSSLLLIDDDVHRDR RRLMSPPFHRDAVARQAGPIAEIAAANIAGWPMAKAFAVAPKMSEITLEVILRTVIGA SDPVRLAALRKVMPRLLNVGPWATLALANPSLLNNRLWSRLRRRIEEADALLYAEIAD RRADPDLAARTDTLAMLVRAADEDGRTMTERELRDQLITLLVAGHDTTATGLSWALER LTRHPVTLAKAVQAADASAAGDPAGDEYLDAVAKETLRIRPVVYDVGRVLTEAVEVAG YRLPAGVMVVPAIGLVHASAQLYPDPERFDPDRMVGATLSPTTWLPFGGGNRRCLGAT FAMVEMRVVLREILRRVELSTTTTSGERPKLKHVIMVPHRGARIRVRATRDVSATSQA TAQGAGCPAARGGGPSRAVGSQ" misc_feature 660590..660619 /gene="cyp135B1" /locus_tag="Rv0568" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene 661003..661269 /locus_tag="Rv0569" /db_xref="GeneID:887678" CDS 661003..661269 /locus_tag="Rv0569" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0569, (MTV039.07), len: 88 aa. Conserved hypothetical protein. C-terminus highly similar to AAA63065.1|U15184|MLU15184_10 hypothetical protein from Mycobacterium leprae (53 aa), FASTA scores: opt: 140, E(): 0.0046, (64.7% identity in 34 aa overlap). Also similar to T36824|SCI35.11 hypothetical protein from Streptomyces coelicolor (64 aa); and N-terminus of T36956 probable DNA-binding protein from Streptomyces coelicolor (323 aa). Also highly similar to Rv2302|MTCY339.07c|NP_216818.1|NC_000962 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (80 aa), FASTA scores: opt: 300, E(): 1.4e-13, (61.8% identity in 76 aa overlap). TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215083.1" /db_xref="GI:15607709" /db_xref="GeneID:887678" /translation="MKAKVGDWLVIKGATIDQPDHRGLIIEVRSSDGSPPYVVRWLET DHVATVIPGPDAVVVTAEEQNAADERAQHRFGAVQSAILHARGT" gene 661295..663373 /gene="nrdZ" /locus_tag="Rv0570" /db_xref="GeneID:887666" CDS 661295..663373 /gene="nrdZ" /locus_tag="Rv0570" /EC_number="1.17.4.-" /function="INVOLVED IN THE DNA REPLICATION PATHWAY (AT THE FIRST REACTION). PROVIDES THE PRECURSORS NECESSARY FOR DNA SYNTHESIS [CATALYTIC ACTIVITY: 2'-deoxyribonucleoside diphosphate + oxidized thioredoxin + H2O = ribonucleoside diphosphate + reduced thioredoxin]." /note="Rv0570, (MTV039.08), len: 692 aa. Probable nrdZ, ribonucleoside-diphosphate reductase, large subunit (EC 1.17.4.-), highly similar to others e.g. NP_070492.1|NC_000917|NRD|AE000988_11 ribonucleotide reductase from Archaeoglobus fulgidus (752 aa), FASTA scores: opt: 2001, E(): 0, (52.5% identity in 562 aa overlap) (N-terminus shorter); U73619|TAU73619_1|T37459 ribonucleotide reductase from Thermoplasma acidophilum (857 aa), FASTA scores: opt: 1678, E(): 0, (43.7% identity in 723 aa overlap); etc. BELONGS TO THE RIBONUCLEOSIDE DIPHOSPHATE REDUCTASE LARGE CHAIN FAMILY. TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="ribonucleoside-diphosphate reductase large subunit NrdZ" /protein_id="NP_215084.1" /db_xref="GI:15607710" /db_xref="GeneID:887666" /translation="MGVSWPAKVRRRDGTLVPFDIARIEAAVTRAAREVACDDPDMPG TVAKAVADALGRGIAPVEDIQDCVEARLGEAGLDDVARVYIIYRQRRAELRTAKALLG VRDELKLSLAAVTVLRERYLLHDEQGRPAESTGELMDRSARCVAAAEDQYEPGSSRRW AERFATLLRNLEFLPNSPTLMNSGTDLGLLAGCFVLPIEDSLQSIFATLGQAAELQRA GGGTGYAFSHLRPAGDRVASTGGTASGPVSFLRLYDSAAGVVSMGGRRRGACMAVLDV SHPDICDFVTAKAESPSELPHFNLSVGVTDAFLRAVERNGLHRLVNPRTGKIVARMPA AELFDAICKAAHAGGDPGLVFLDTINRANPVPGRGRIEATNPCGEVPLLPYESCNLGS INLARMLADGRVDWDRLEEVAGVAVRFLDDVIDVSRYPFPELGEAARATRKIGLGVMG LAELLAALGIPYDSEEAVRLATRLMRRIQQAAHTASRRLAEERGAFPAFTDSRFARSG PRRNAQVTSVAPTGTISLIAGTTAGIEPMFAIAFTRAIVGRHLLEVNPCFDRLARDRG FYRDELIAEIAQRGGVRGYPRLPAEVRAAFPTAAEIAPQWHLRMQAAVQRHVEAAVSK TVNLPATATVDDVRAIYVAAWKAKVKGITVYRYGSREGQVLSYAAPKPLLAQADTEFS GGCAGRSCEF" gene complement(663487..664818) /locus_tag="Rv0571c" /db_xref="GeneID:887710" CDS complement(663487..664818) /locus_tag="Rv0571c" /function="UNKNOWN" /note="Rv0571c, (MTV039.09c), len: 443 aa. Conserved hypothetical protein, highly similar to the products of two adjacent orfs in Mycobacterium leprae: AAA63059.1|U15184|U650S|Q50111 hypothetical protein (258 aa), FASTA scores: opt: 1071, E(): 0, (72.5% identity in 233 aa overlap); and AAA63058.1|U15184|U650T hypothetical protein (86 aa), FASTA scores: opt: 192, E(): 6.4e-06, (70.8% identity in 48 aa overlap). Also similar to others e.g. NP_107072.1|NC_002678 hypothetical protein from Mesorhizobium loti (235 aa); NP_213031.1|NC_000918 hypothetical protein from Aquifex aeolicus (175 aa); etc. And similar to part of hypothetical proteins from Mycobacterium tuberculosis e.g. C-terminus of Rv2143|MTCY270.25c|Z95388|NP_216659.1|NC_000962 (352 aa), FASTA scores: opt: 592, E(): 7e-32, (49.3% identity in 205 aa overlap); N-terminus of Rv2030c|NP_216546.1|NC_000962 (681 aa). TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215085.1" /db_xref="GI:15607711" /db_xref="GeneID:887710" /translation="MKLFDDRGDAGRQLAQRLAQLSGKAVVVLGLPRGGVPVAFEVAK SLQAPLDVLVVRKLGVPFQPELAFGAIGEDGVRVLNDDVVRGTHLDAAAMDAVERKQL IELQRRAERFRRGRDRIPLTGRIAVIVDDGIATGATAKAACQVARAHGADKVVLAVPI GPDDIVARFAGYADEVVCLATPALFFAVGQGYRNFTQTSDDEVVAFLDRAHRDFAEAG AIDAAADPPLRDEEVQVVAGPVPVAGHLTVPEKPRGIVVFAHGSGSSRHSIRNRYVAE VLTGAGFATLLFDLLTPEEERNRANVFDIELLASRLIDVTGWLATQPDTASLPVGYFG ASTGAGAALVAAADPRVNVRAVVSRGGRPDLAGDSLGSVVAPTLLIVGGRDQVVLELN QRAQAVIPGKCQLTVVPGATHLFEEPGTLEQVAKLACDWFIDHLCGPGPSG" gene complement(665042..665383) /locus_tag="Rv0572c" /db_xref="GeneID:887696" CDS complement(665042..665383) /locus_tag="Rv0572c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0572c, (MTV039.10c), len: 113 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215086.1" /db_xref="GI:15607712" /db_xref="GeneID:887696" /translation="MGEHAIKRHMRQRKPTKHPLAQKRGARILVFTDDPRRSVLIVPG CHLDSMRREKNAYYFQDGNALVGMVVSGGTVEYDADDRTYVVQLTDGRHTTESSFEHS SPSRSPQSDDL" gene complement(665851..667242) /locus_tag="Rv0573c" /db_xref="GeneID:887716" CDS complement(665851..667242) /locus_tag="Rv0573c" /EC_number="2.4.2.11" /function="UNKNOWN" /note="catalyzes the formation of 5-phospho-alpha-D-ribose 1-diphosphate and nicotinate from nicotinate D-ribonucleotide and diphosphate" /codon_start=1 /transl_table=11 /product="nicotinate phosphoribosyltransferase" /protein_id="NP_215087.1" /db_xref="GI:15607713" /db_xref="GeneID:887716" /translation="MAIRQHVGALFTDLYEVTMAQAYWAERMSGTAVFEIFFRKLPPG RSYIMAAGLADVVEFLEAFRFDEQDLRYLRGLGQFSDEFLRWLAGVRFTGDVWAAPEG TVIFPNEPAVQLIAPIIEAQLVETFVLNQIHLQSVLASKAARVVAAARGRPVVDFGAR RAHGTDAACKVARTSYLAGAAGTSNLLAARQYGIPTFGTMAHSFVQAFDSEVAAFEAF ARLYPATMLLVDTYDTLRGVDHVIELAKRLGNRFDVRAVRLDSGDLDELSKATRARLD TAGLEQVEIFASSGLDENRIAALLAARCPIDGFGVGTQLVVAQDAPALDMAYKLVAYD GSGRTKFSSGKVIYPGRKQVFRKLEHGVFCGDTLGEHGENLPGDPLLVPIMTNGRRIR QHAPTLDGARDWARQQIDALPPELRSLEDTGYSYPVAVSDRIVGELARLRHADTAEAH PGSNVVGAKAKRP" gene complement(667252..668394) /locus_tag="Rv0574c" /db_xref="GeneID:887721" CDS complement(667252..668394) /locus_tag="Rv0574c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0574c, (MTV039.12c), len: 380 aa. Conserved hypothetical protein, showing similarity with other hypothetical proteins and polyglutamate synthases (encapsulation proteins) e.g. AAK64444.1|AF377339_5|AF377339 polyglutamate synthase CapA from Myxococcus xanthus (405 aa); M24150|BACCAPABC_3|CapA polyglutamate synthase (encapsulation protein) from B.anthracis (411 aa), FASTA scores: opt: 261, E(): 4.3e-10, (25.8% identity in 287 aa overlap); etc. TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215088.1" /db_xref="GI:15607714" /db_xref="GeneID:887721" /translation="MAGNPDVVTVLLGGDVMLGRGVDQILPHPGKPQLRERYMRDATG YVRLAERVNGRIPLPVDWRWPWGEALAVLENTATDVCLINLETTITADGEFADRKPVC YRMHPDNVPALTALRPHVCALANNHILDFGYQGLTDTVAALAGAGIQSVGAGADLLAA RRSALVTVGHERRVIVGSVAAESSGVPESWAARRDRPGVWLIRDPAQRDVADDVAAQV LADKRPGDIAIVSMHWGSNWGYATAPGDVAFAHRLIDAGIDMVHGHSSHHPRPIEIYR GKPILYGCGDVVDDYEGIGGHESFRSELRLLYLTVTDPASGNLISLQMLPLRVSRMRL QRASQTDTEWLRNTIERISRRFGIRVVTRPDNLLEVVPAANLTSKE" gene complement(668579..669745) /locus_tag="Rv0575c" /db_xref="GeneID:887720" CDS complement(668579..669745) /locus_tag="Rv0575c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0575c, (MTV039.13c), len: 388 aa. Possible oxidoreductase (EC 1.-.-.-), similar to many diverse oxidoreductases and monooxygenases e.g. AL109974|SCF34_5|T36404 probable monooxygenase from Streptomyces coelicolor (407 aa), FASTA scores: opt: 786, E(): 0, (38.7% identity in 398 aa overlap); P96555|AB000564 SALICYLATE HYDROXYLASE from SPHINGOMONAS (395 aa), FASTA scores: opt: 267, E():5e-11, (26.4% identity in 390 aa overlap). Also similar to Rv1260|Z77137|MTCY50.22C from Mycobacterium tuberculosis (372 aa), FASTA scores: opt: 762, E(): 0, (40.9% identity in 345 aa overlap). TBparse score is 0.868. The transcription of this CDS seems to be activated in macrophages (see citation below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215089.1" /db_xref="GI:15607715" /db_xref="GeneID:887720" /translation="MKVAISGAGVAGAALAHWLQRTGHTPTVIERAPKFRTGGYMIDF WGVGYQVAKRMGITDQIAAAGYHMEHVRSVGPTGKVKADLGVDVFRRMVGDDFTSLPR GDLAAAIYTTIEDQVETIFDDSIATIDEHRDGVRLTFERTAPRDFDLVIGADGLHSNV RRLVFGPERDFEHYLGCKVAACVVDGYRPRDERSYVLYNTVDRQLARFALRGDRTMFL FVFRAEHDNPGVAPKDELRDQFGDVGWESRDILAALDDVEDLYFDVVSQIRMDRWSRG RVLLIGDAAGCISLLGGEGTGLAITEAYVLAGELARAGGDHRRAFDAYEKRLRPFIEG KQASAAKFIWFFATRTRFGLWFRNVAMRTMNFGPLATLFAGSVRDDFELPDYTW" gene 669848..671152 /locus_tag="Rv0576" /db_xref="GeneID:887717" CDS 669848..671152 /locus_tag="Rv0576" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0576, (MTV039.14), len: 434 aa. Probable transcriptional regulator, ArsR family. N-terminus highly similar to others e.g. NP_102487.1|NC_002678 transcriptional regulator from Mesorhizobium loti (104 aa); NP_242952.1|NC_002570 transcriptional regulator (ArsR family) from Bacillus halodurans (109 aa); etc. C-terminal region ( 240-434) shows similarity with D67028_1 from Rhodococcus rhodochrous (112 aa); and Rv0738 from Mycobacterium tuberculosis (182 aa). N-terminus also highly similar to Rv2034 from Mycobacterium tuberculosis (107 aa). Contains helix-turn-helix motif at aa 23-43 (Score 1628, +4.73 SD). TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="NP_215090.1" /db_xref="GI:15607716" /db_xref="GeneID:887717" /translation="MLEVAAEPTRRRLLQLLAPGERTVTQLASQFTVTRSAISQHLGM LAEAGLVTARKQGRERYYRLDERGVLRLRALMESFWSDELDRLVADAAHYPPSQGDCA MPFEKAVVVPLDPTSTFALITQPDRLRRWMAVAARIELRTGGAYRWTVTPGHSAAGTV IDVDPGKRVVFTWGWEDHGDPPPGGSTVTITLTPVDGGTEVRLVHDGLTAQQAARHAK GWNHFLDRLVVAGQRGDAGPDEWAAAPDPLDELSCAEATLAVLQHVLRGIGASDLTRQ TPCTEYDVSQLADHLLRSLAIIGAAAGAQLAPRDVDAPLETQVADAAQAVMEAWRRRG LAGTVELNSNQVPATVPVGILCLEFLVHAWDFAIATGSQVIASEPVSEYVLAVAGKVI TPATRNSAGFAAPAAVGSFAPVLDRLIAFTGRQPTAGHVSAT" gene 671166..671951 /gene="TB27.3" /locus_tag="Rv0577" /db_xref="GeneID:887732" CDS 671166..671951 /gene="TB27.3" /locus_tag="Rv0577" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0577, (MTV039.15), len: 261 aa. TB27.3, conserved hypothetical protein. Corresponds to O53774|CF30_MYCTU 27 kDa ANTIGEN CFP30B from Mycobacterium tuberculosis culture filtrate (260 aa), FASTA scores: opt: 1781, E(): 0, (100.0% identity in 260 aa overlap). Also similar to several hypothetical proteins and hydroxylases from Steptomyces sp. e.g. T35032 probable hydroxylase from Streptomyces coelicolor (263 aa); Q55078 orfA gene product from Streptomyces sp. (275 aa), FASTA scores: E(): 1.5e-1 9, (38.6% identity in 264 aa overlap); D89734_1|P95754 DNA for SgaA SGAA PROTEIN from Streptomyces griseus; and SC9B10_20 from Streptomyces coelicolor (267 aa), FASTA score: (38.9 identity in 252 aa overlap). Also similar to Rv0911|MTCY21C12.05 from Mycobacterium tuberculosis (257 aa), FASTA scores: E(): 1.1e-20, (32.0% identity in 259 aa overlap). TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215091.1" /db_xref="GI:15607717" /db_xref="GeneID:887732" /translation="MPKRSEYRQGTPNWVDLQTTDQSAAKKFYTSLFGWGYDDNPVPG GGGVYSMATLNGEAVAAIAPMPPGAPEGMPPIWNTYIAVDDVDAVVDKVVPGGGQVMM PAFDIGDAGRMSFITDPTGAAVGLWQANRHIGATLVNETGTLIWNELLTDKPDLALAF YEAVVGLTHSSMEIAAGQNYRVLKAGDAEVGGCMEPPMPGVPNHWHVYFAVDDADATA AKAAAAGGQVIAEPADIPSVGRFAVLSDPQGAIFSVLKPAPQQ" gene complement(671996..675916) /gene="PE_PGRS7" /locus_tag="Rv0578c" /db_xref="GeneID:887725" CDS complement(671996..675916) /gene="PE_PGRS7" /locus_tag="Rv0578c" /function="UNKNOWN" /note="Rv0578c, (MTV039.16c), len: 1306 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many other PGRS proteins e.g. MTCY493.04|Z95844 from Mycobacterium tuberculosis (1329 aa), FASTA scores: opt: 3994, E(): 0, (54.6% identity in 1375 aa overlap). Contains two PS00583 pfkB family of carbohydrate kinases signatures possibly fortuitously. TBparse score is 0.867." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177739.1" /db_xref="GI:57116755" /db_xref="GeneID:887725" /translation="MSFVIATPEMLTTAATDLAKIGSTITAANTAAAAVAKVLPASAD EVSVAVAALFGTHAQEYQTVSAQVATFHDRFVQTLSAAASSYVAAEAVNVEQSLLAAV NAPTQALFGRPLIGNGADGSPGTGQAGGPGGILYGNGGNGGSGAPGQRGGAGGAAGLI GNGGNGGAGGVGTTGGAGGHGGAGGWLYGNGGAGGFGGAGAVGGNGGAGGTAGLFGVG GAGGAGGNGIAGVTGTSASTPGGSGTAGGAGGIGGNGGAGGAGGVLMGNGGNGGAGGE GGPGGAGGAGASGAHATNLGADGQAGGNGGNGGAGGTGGVGGPGGGHGLLGLGGSHGA GGAGGSGGDGGAPGDGGNGATGTWGHNLGAGGTGGNGGNPGAGGAGGAGGASVGGSAH GANGAPGTTSTSGGNGGDGGKGADAISSGQTGANGGRGGDGGQVGNGGAGGAGGRGGA GGLGFGSEAPGRPGGAGGTGGAGGNGGTQAGDGGTGGAGGAGGDGGSGGAGSIGFNAS APGAAGSPGGNGGNGGPGGAGGEGGAGGLALAASGQNGSQGAGGDGGAGGNGGTPGNG GHGAAGALGVNGGVGGAGGHGGDPGVGGAGGQGGSGSTPGANGAPGNTPTSGGNGGNG GRGADATGFGQTGASGGRGGDGGLVGNGGAGGAGGNGSKGLPGLGRLGNPGLDGGTGG NGGAGGSGGAWAGNGGTGGAGGTGGVGGTGGSGSDGVNGSSAGADGHPGGTGGVGGTG GKGGDGGDGGAAPNGVAGSQGPGGAGGDGGTGGVGGNGGRGIDGADGATAGARGQDGG AGGAGGKGGRGGTGGPGGAGPAGTTGSQGAGGNGGSGGTGGDPGDGGNGANGSVFTNN GIGGNGGNGGNAGPSGAGGSGGAGSTFGATGSSSSIHVNGGNGGNGGNGDHALSGNGA AGGNGGNGGNGSLRGSGGAGGHGGNGGNASRGMGGDGGTGGAGGNAGQIGNGGAGGNG GDGGTGSDGNPGAITGSGGRGGDGGVGGQGGSVAGDGADGGRGGAGGTGGTGLRGTTG ATGATGTFDAGADGHGGNGGTGGVGGTGGAGGGGGNGGAGGKALSPTGNNGSQGAGGD GGAGGAGGTGGTGGDGGRGAHGTLFSSLAGTGGTGGNGGTGGTGGTGGAGGAGGTGST LGATGATGAAGRAGNGGVGGSGGLGSAFGPGGTGGMGGAGGTSTVSAGGDGGRGGFGG DGLDASSGGNGGDGGHGGDGFRTAGAGGRGGDGGKGADPGGLFPIPGAGGKGGTGGTG GTAHLGPLAIIGQSGQPGQFGSPGADGRGGAGGAGGGGGAGGSF" misc_feature complement(673034..673108) /gene="PE_PGRS7" /locus_tag="Rv0578c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" misc_feature complement(673880..673954) /gene="PE_PGRS7" /locus_tag="Rv0578c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene 676238..676996 /locus_tag="Rv0579" /db_xref="GeneID:887738" CDS 676238..676996 /locus_tag="Rv0579" /function="UNKNOWN" /note="Rv0579, (MTV039.17), len: 252 aa. Conserved hypothetical protein, showing some similarity to others e.g. AE001747_4 hypothetical protein from Thermotoga maritima (247 aa), FASTA scores: opt: 612, E(): 0, (39.6% identity in 235 aa overlap); AE001004_2 hypothetical protein from Archaeoglobus fulgidus (159 aa), FASTA scores: opt: 196, E(): 1e-06, (28.3% identity in 159 aa overlap); etc. TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215093.1" /db_xref="GI:15607719" /db_xref="GeneID:887738" /translation="MVGYVDVRAYAELNEFVELQARGLTVRRPFRSHQTVKDVLEAMG IPHTEVDLILVNGDPADFSYRPVAGDRIAAYPMFEALDIGSTARLRPAPLRNPRFVVD VNLGQLARLLRLLGFDTRWSSAADDPTLADISLGEQRILLTRDRGLLKRRAITHGLFV HSQHPEEQALEVLRRLDLNGRLAPLSRCLRCNGELAAVSKDEVIGQLEPLTRRYYESF SRCFGCGRIYWPGSHHARLVRLVERLRDQLTTST" gene complement(677125..677616) /locus_tag="Rv0580c" /db_xref="GeneID:887712" CDS complement(677125..677616) /locus_tag="Rv0580c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0580c, (MTV039.18c), len: 163 aa. Conserved hypothetical protein, equivalent to AAA90989.1|U20446|MK35 lipoprotein precursor from Mycobacterium kansasii (225 aa). TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215094.1" /db_xref="GI:15607720" /db_xref="GeneID:887712" /translation="MTDQSYAVDIAHPPAALLRLVNPILRSLLHTPLAGPLRTQLMVV SFTGRKTGRHFSIPLSAHVIDNDLYALTEAGWKHNFSDGAAAQVVYDGKTTAMRGELI RDRAVVSELFLRAAQAYGVKRGQRMLGLSFRDRRIPTLEEFAEAVDRLKLVAIRLTPA DNS" gene 677710..677925 /locus_tag="Rv0581" /db_xref="GeneID:887739" CDS 677710..677925 /locus_tag="Rv0581" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0581, (MTV039.19), len: 71 aa. Conserved hypothetical protein, showing weak similarity to several Mycobacterium tuberculosis proteins including P95003|Z83863|Rv2550c|MTCY159_6 CONSERVED HYPOTHETICAL PROTEIN (81 aa), FASTA scores: opt: 93, E(): 3.2, (25.7% identity in 70 aa overlap); Rv2871; Rv1241; etc. Also shows weak similarity to X05648|SGSPH_1 from Streptomyces glaucescens (77 aa), FASTA scores: opt: 92, E(): 3.6, (35.4% identity in 65 aa overlap). TBparse score is 0.864." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215095.1" /db_xref="GI:15607721" /db_xref="GeneID:887739" /translation="MDKTTVYLPDELKAAVKRAARQRGVSEAQVIRESIRAAVGGAKP PPRGGLYAGSEPIARRVDELLAGFGER" gene 677922..678329 /locus_tag="Rv0582" /db_xref="GeneID:887747" CDS 677922..678329 /locus_tag="Rv0582" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0582, (MTV039.20), len: 135 aa. Hypothetical unknown protein. TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215096.1" /db_xref="GI:15607722" /db_xref="GeneID:887747" /translation="MIIDTSALLAYFDAAEPDHAAVSECIDSSADALVVSPYVVAELD YLVATRVGVDAELAVLRELAGGAWELANCGAAEIEQAARIVTKYQDQRIGIADAANVV LADRYRTRTILTLDRRHFSALRPIGGGRFTVIP" gene complement(678389..679075) /gene="lpqN" /locus_tag="Rv0583c" /db_xref="GeneID:887733" CDS complement(678389..679075) /gene="lpqN" /locus_tag="Rv0583c" /function="UNKNOWN" /note="Rv0583c, (MTV039.21c), len: 228 aa. Probable lpqN, conserved lipoprotein, equivalent to AAA90989.1|U20446|MK35|U20446|MKU20446_1 lipoprotein precursor from Mycobacterium kansasii (225 aa), FASTA scores: opt: 945, E(): 0, (62.7% identity in 228 aa overlap); and similar to others from Mycobacteria e.g. Rv0040c and Rv1016c from Mycobacterium tuberculosis. Contains N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="lipoprotein LpqN" /protein_id="NP_215097.1" /db_xref="GI:15607723" /db_xref="GeneID:887733" /translation="MKHFTAAVATVALSLALAGCSFNIKTDSAPTTSPTTTSPTTSTT TTSATTSAQAAGPNYTIADYIRDNHIQETPVHHGDPGSPTIDLPVPDDWRLLPESSRA PYGGIVYTQPADPNDPPTIVAILSKLTGDIDPAKVLQFAPGELKNLPGFQGSGDGSAA TLGGFSAWQLGGSYSKNGKLRTVAQKTVVIPSQGAVFVLQLNADALDDETMTLMDAAN VIDEQTTITP" misc_feature complement(679016..679048) /gene="lpqN" /locus_tag="Rv0583c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 679229..681862 /locus_tag="Rv0584" /db_xref="GeneID:887752" CDS 679229..681862 /locus_tag="Rv0584" /function="UNKNOWN" /note="Rv0584, (MTV039.22), len: 877 aa. Possible conserved exported protein, similar to other hypothetical proteins which are not necessarily secreted e.g. CAB61925.1|AL133278 putative secreted protein from Streptomyces coelicolor (772 aa); AAD51075.1|AF175722_1|AF175722 immunoreactive 89kD antigen PG87 from Porphyromonas gingivalis (781 aa), FASTA scores: opt: 637, E(): 2.1e-30, (29.1% identity in 794 aa overlap); etc. Contains PS00699 Nitrogenases component 1 alpha and beta subunits signature 1. Has potential N-terminal signal peptide. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215098.1" /db_xref="GI:15607724" /db_xref="GeneID:887752" /translation="MRARRLRRALAALLAVAGLFVPFIVGVPTAYDGEPVFVAIPVEH VNTLIGTGTGAAIVGEINNFPGASVPFGMVQYSPDTVDNYAGYDYDNPHSTGFSMTHA SVGCPAFGDISMLPTTTPLGSQPWSAWEEIAHDDTEVGVPGYYTVRFPGTGVIAELTA TTRTGVGRFRYPRNGWPALFHVRSGASLAGNYAATLQIEDNTTITGSATSGGFCGKKN LYTVYFAMKFSQPFSSYGTWDGYAVYPGSHSMNSSYSGGYVGFPAGSVLEVRTALSYV SVDGARANLDAEGGASFDDIRAATSSEWNAALSRIAVAGRGPGDVDTFYTCLYRSLLH PNTFNDVDGRYIGFDGVIHSVASGHTHYANFSDWDTYRSLAPLQGLLFPQRASDMIQS LVTDAEQSGAYPRWALANSATGMMSGDSVVPLIVNLYAFGARDFDLKSALHYMVNAAT QGGVGLDGFLERPGIAAYLRLGYGPQTAEFRANGRIAGASVTLEWSVDDFAISRFADS LGDTATAAVFQNRSQYWQNLFNPTTGYISPRSAAGFFPDGPGFVAYPSGFGQDGYDEG NAEQYLWWVPHNVAGLVTALGGRTAVVKRLDRFTKKLNVGPNEPYLWAGNEPGFGVPW LYNYIGQPWKTQRTVDRVRGLFGPTPGGAPGNDDLGALSSWYVWAALGLYPSTPGTTI LTVNTPLFDRAVIALPTGKSIQITAPGASGRNRLKYIDGLTIDRQPSNQTFLPESIVR TGGDLTFSLAGTPNKVWGTAASAAPPSFGAGSSAVTVNIARPIIGIVPGATGTVTVDA QRMIDGVDDYTVTPTSYVVGIAAEPLSGQFDDDGAVSASVAITVARSVPSGYYPIYVT TSAGDSARTLIVLVVVAEAVE" misc_feature 679523..679546 /locus_tag="Rv0584" /note="PS00699 Nitrogenases component 1 alpha and beta subunits signature 1" gene complement(681885..684272) /locus_tag="Rv0585c" /db_xref="GeneID:887753" CDS complement(681885..684272) /locus_tag="Rv0585c" /function="UNKNOWN" /note="Rv0585c, (MTV039.23c, MTCY19H5.37), len: 795 aa. Probable conserved integral membrane protein. C-terminus similar to CAB88984.1|AL353864 putative integral membrane protein from Streptomyces coelicolor (299 aa); and C-terminal region of CAC01311.1|AL390968 putative integral membrane protein from Streptomyces coelicolor (925 aa). Also some similarity with Rv0204 from Mycobacterium tuberculosis. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215099.1" /db_xref="GI:15607725" /db_xref="GeneID:887753" /translation="MRVDGRDIGVSGNLLQPLTRRTNDIIRAVLAAIYLVAVITSSLI TRPQWVALEKSISEIVGVLSPSQSDLVYLGYGLAILALPFVILIGLIVSRQWKLLGAY AAAGLMAVLPLSISSSRIAAPRWHFDLSDRLATLLAQFLDDPRWIAMLAAVLTVSGPW LPARWRHWWWALLLAFVPIHLVVSAIVPARSLLGLAVGWLVGALVVLVVGTPALEVPL DGAIRALAKRGFAVSGLAVVRPAGPGPLVLSAACEQPNAGACSEALIELYGPHQSGGG ALRQLWLKLTLRGTETAPLQASMRRAVEHRALMAIAFGDLGMANTTVIAVSPLDRGWT LYAHRPARGIGISECTKTTPTAHVWEALRTLHDQQISHGDLCSAEITVDNGAVLFGGF GEAEYGATDAQLQSDLAQLLVTTSALYDAEAAVTAAIDTFGKQAILAASRRLTKSAVP KRIRESITDPNAVIASTRAEVMRQTGADQIKAETITRFSRGQLIQLVLIGALVYVAYP FISTVPTFFSQLRTANWWWALLGLAVSALTYVGAAAALWACADGLVGFWKLSIMQVAN TFAATTTPAGVGGLALSTRFLQKGGLTAVRATAAVALQQSVQVIVHLVLLILFSALAG TSTDLSHFVPNATVLYLIAGVALGIVGTFLFVPKLRRWLATAVRPKLREVTNDLIALA REPKRLALIVLGCAGTTLGAALALWASIEAFGGGTTFVTVTVVTMVGGTLASAAPTPG GVGAVEAALIGGLAAFGVPAALGVPSVLLYRLLTCWLPVFAGWQVMHWLTRHEMI" gene 684410..685132 /locus_tag="Rv0586" /db_xref="GeneID:887754" CDS 684410..685132 /locus_tag="Rv0586" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0586, (MTCY19H5.36c), len: 240 aa. Probable transcriptional regulator, GntR family, similar to many e.g. P33233|LLDR_ECOLI putative L-lactate dehydrogenase operon regulatory protein from Escherichia coli (258 aa), FASTA scores: opt: 225, E(): 9.3e-08, (26.7% identity in 232 aa overlap); etc. Also similar to other M. tuberculosis transcriptional regulators GntR proteins e.g. Rv3060c, Rv0792c, etc. Contains PS00043 Bacterial regulatory proteins, gntR family signature and probable helix-turn helix motif from aa 35-56 (Score 1531, +4.40 SD)." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="NP_215100.1" /db_xref="GI:15607726" /db_xref="GeneID:887754" /translation="MALQPVTRRSVPEEVFEQIATDVLTGEMPPGEALPSERRLAELL GVSRPAVREALKRLSAAGLVEVRQGDVTTVRDFRRHAGLDLLPRLLFRNGELDISVVR SILEARLRNFPKVAELAAERNEPELAELLQDSLRALDTEEDPIVWQRHTLDFWDHVVD SAGSIVDRLMYNAFRAAYEPTLAALTTTMTAAAKRPSDYRKLADAICSGDPTGAKKAA QDLLELANTSLMAVLVSQASRQ" misc_feature 684518..684583 /locus_tag="Rv0586" /note="PS00043 Bacterial regulatory proteins, gntR family signature" gene 685129..685926 /gene="yrbE2A" /locus_tag="Rv0587" /db_xref="GeneID:887755" CDS 685129..685926 /gene="yrbE2A" /locus_tag="Rv0587" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0587, (MTCY19H5.35c), len: 265 aa. yrbE2A, hypothetical unknown integral membrane protein, part of mce2 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07412|Rv0167|MTCI28.07|yrbE1A (265 aa); O53965|Rv1964|MTV051.02|yrbE3A (265 aa); etc. Also highly similar to conserved hypothetical integral membrane proteins of the yrbEA type, e.g. P45392|YRBE_ECOLI hypothetical 27.9 kDa protein from Escherichia coli (260 aa), FASTA scores: opt: 287, E(): 6.1e-12, (21.5% identity in 256 aa overlap); P45030|YRBE_HAEIN|HI1086 hypothetical protein from Haemophilus influenzae (261 aa), FASTA scores: opt: 311, E(): 1.8e-83, (24.2% identity in 265 aa overlap); NP_302654.1|NC_002677 conserved membrane protein from Mycobacterium leprae (267 aa); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein YrbE2a" /protein_id="NP_215101.1" /db_xref="GI:15607727" /db_xref="GeneID:887755" /translation="MTTHAVIITYLRDQTQPAVDAIGGFYRTCVLTGKALVRRPFHWR EAIEQGWFITSVSLLPTLAVSIPLTVLIIFTLNILLAEFGAADISGAGAALGAVTQLG PLTTVLVIAGAGATAICADLGARTIREEIDAMEVLGIDPIHRLVVPRVVAATIVAALL NGAVITIGLVGGFVFSVFIQHVSAGAYVGTLTLVTGLPEVIISVVKSATFGLIAGLVG CYRGLTTKGGPKGVGTAVNETLVLCVIALFATNVVLTTIGVRFGTGH" gene 685928..686815 /gene="yrbE2B" /locus_tag="Rv0588" /db_xref="GeneID:887761" CDS 685928..686815 /gene="yrbE2B" /locus_tag="Rv0588" /function="UNKNOWN" /note="Rv0588, (MTCY19H5.34c), len: 295 aa. yrbE2B, hypothetical unknown integral membrane protein, part of mce2 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07413|Rv0168|MTCI28.08|yrbE1B (289 aa); O53966|Rv1965|MTV051.03|yrbE3B (271 aa); etc. Also highly similar to conserved hypothetical integral membrane proteins of the yrbEB type, e.g. P45392|YRBE_ECOLI hypothetical 27.9 kDa protein from Escherichia coli (260 aa), FASTA scores: opt: 232, E(): 8.4e-08, (22.1 % identity in 267 aa overlap); P45030|YRBE_HAEIN|HI1086 hypothetical protei from Haemophilus influenzae (261 aa), FASTA scores: opt: 234, E(): 6.3e-08, (24.2% identity in 215 aa overlap); NP_302655.1|NC_002677 conserved membrane protein from Mycobacterium leprae (289 aa); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein YrbE2b" /protein_id="NP_215102.1" /db_xref="GI:15607728" /db_xref="GeneID:887761" /translation="MVESSTASAAAVLRARYPRTAASLDRYGGGTARRLERTGTFARF TRISVVQIGWALRRYRRETLRLVAEIGMGTGAMAVVGGTVAIIGFVTLSGGSLIAIQG FASLGNIGVEAFTGFFAALANTRVAAPIVSGVALAATVGAGATAQLGAMRISEEIDAL EVMGIKSISFLVSTRILGGLVVIMPLYALALDMAFTSGQVVTTVFYGQSNGTYEHYFR TFLRPEDVGWSVVEVVIIAVVVMITHCYYGYTASGGPVGVGQAVGRSMRFSLVSVVVV VLLAELALYGVDPNFNLTV" gene 686821..688035 /gene="mce2A" /locus_tag="Rv0589" /db_xref="GeneID:887745" CDS 686821..688035 /gene="mce2A" /locus_tag="Rv0589" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv0589, (MTCY19H5.33c), len: 404 aa. mce2A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins P72013|MCE1|Rv0169|MTCI28.09|mce1A (454 aa); O53967|MCE3|Rv1966|MTV051.04|mce3A (425 aa); etc. Also highly similar to others e.g. AAD52105.1|AF113402_1|AF113402 mycobacterial cell entry protein from Mycobacterium bovis BCG (454 aa); NP_302656.1|NC_002677 putative cell invasion protein from Mycobacterium leprae (441 aa); CAC12798.1|AL445327 putative secreted protein from Streptomyces coelicolor (418 aa); etc. Also highly similar, but longer 21 aa, to P72013|CAA50257.1|X70901|MTCI28.08 Mcep protein from Mycobacterium tuberculosis (432 aa), FASTA scores: opt: 1324, E(): 0, (62.6% identity in 436 aa overlap). Contains a possible N-terminal signal or anchor sequence. Note that previously known as mce2.; mce2" /codon_start=1 /transl_table=11 /product="MCE-family protein MCE2A" /protein_id="YP_177740.1" /db_xref="GI:57116756" /db_xref="GeneID:887745" /translation="MPTLVTRKNRRAWLYVEGVVLLLVGALVLVLVYKQFRGEFTPKT ELTMVAFRAGLVMEAGSKVTYNGVEIGRVGSISEIERDGRPAAKLVLDVNPRYISLIP VNVVADIEAATLFGNKYVALSAPKIPQQQRISSHDVIDVGSVTTEFNTLFETITSIAE KVDPIELNATLSAVAQALDGLGGKFGESIVNGNQILAQLNPRLPQLGYDVRRLADLGE VYVDASPDLWSFLQNALTTARTLTSQQRDLDAALLAATGAGNTGEDVFARGGPYLARA AADLVPTATLLDTYSPELFCMIRNFHDAAPKVADAVGGNGYSLAAAGTILGAPNPYVY PDNLPRVNAHGGPGGRPGCWQTITRELWPAPYLVMDTGASLAPYNHVELGQPMFTEYV WGRQYGENTINP" gene 688032..688859 /gene="mce2B" /locus_tag="Rv0590" /db_xref="GeneID:887771" CDS 688032..688859 /gene="mce2B" /locus_tag="Rv0590" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv0590, (MTCY19H5.32c), len: 275 aa. mce2B; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07414|Rv0170|MTCI28.10|mce1B (346 aa); O53968|Rv1967|MTV051.05|mce3B (342 aa); etc. Also highly similar to others e.g. NP_302657.1|NC_002677 putative secreted protein from Mycobacterium leprae (346 aa); P45391|YRBD_ECOLI hypothetical 19.6 kDa protein from Escherichia coli (183 aa), FASTA scores: opt: 160, E(): 0.00099, (28.3% identity in 166 aa overlap); P45029|YRBD_HAEIN|HI1085 hypothetical protein from Haemophilus influenzae (167 aa), FASTA scores: opt: 135, E():0.035, (25.9% identity in 143 aa overlap); etc. Contains possible N-terminal signal or anchor sequence." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE2B" /protein_id="NP_215104.1" /db_xref="GI:15607730" /db_xref="GeneID:887771" /translation="MKTTGTTIKLGIVWLVLSVFTVMIIVVFGQVRFHHTTGYSAVFT HVSGLRAGQFVRAAGVEVGKVAKVTLIDGDKQVLVDFTVDRSLSLDQATTASIRYLNL IGDRYLELGRGHSGQRLAPGATIPLEHTHPALDLDALLGGFRPLFQTLDPDKVNSIAS SIITVFQGQGATINDILDQTASLTATLADRDHAIGEVVNNLNTVLATTVKHQTEFDRT VDKLEVLITGLKNRADPLAAAAAHISSAAGTLADLLGRIVHCCTAASGTSRASSSRS" gene 688808..689062 /locus_tag="Rv0590A" /db_xref="GeneID:3205078" CDS 688808..689062 /locus_tag="Rv0590A" /function="UNKNOWN, BUT COULD BE INVOLVED IN HOST CELL INVASION." /note="Rv0590A, len: 84 aa. Probable continuation of mce2B|Rv0590. Can find no frameshift to account for this. Possible nucleotide G missing at 688793 as there are 5 in Mycobacterium bovis but only 4 in CDC1551. Strong similarity to C-terminus of other Mce proteins e.g. AL583926|AL583926_38 from Mycobacterium leprae strain TN (346 aa), FASTA scores: E(): 1.2e-20, (67.85% identity in 84 aa overlap)." /codon_start=1 /transl_table=11 /product="MCE family-like protein" /protein_id="YP_177627.1" /db_xref="GI:57116757" /db_xref="GeneID:3205078" /translation="MLHSSFGHLEGIQQPLIDELAELDHVLGKLPDAYRIIGRAGGIY GDFFNFYLCDISLKVNGLQPGGPVRTVKLFGQPTGRCTPQ" gene 689059..690504 /gene="mce2C" /locus_tag="Rv0591" /db_xref="GeneID:887770" CDS 689059..690504 /gene="mce2C" /locus_tag="Rv0591" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv0591, (MTCY19H5.31c), len: 481 aa. mce2C; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07415|R0171|MTCI28.11|mce1C (515 aa); O53969|Rv1968|MTV051.06|mce3C (410 aa); etc. Also highly similar to others e.g. NP_302658.1|NC_002677 putative secreted protein from Mycobacterium leprae (519 aa); CAC12796.1|AL445327 putative secreted protein from Streptomyces coelicolor (351 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and may contain N-terminal signal or anchor sequence. Has highly Pro-rich C-terminus." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE2C" /protein_id="NP_215105.1" /db_xref="GI:15607731" /db_xref="GeneID:887770" /translation="MRTLTEFNRGRVGMMGAVVTVLVVGVAQSFTSVPMLFATPTYYA QFADTGGINTGDKVEIAGVNVGLVRSLAIRGNRVLIGFSLPGKTIGMQSRAAIRTDTI LGRKNLEIEPRGSEPLKPNGFLPLAQTTTPYQIYDAFVDVTKAATGWDIDAVKRSLNV LSETFDQTAPHLSAALEGVKAFSDTVGRRGEQIEQLLANANRIARVLGDRSEQVNGLL VNAKTLLAAFKQRSQALRILLTNVSEASAQVSGLITDNPNLNHVLAQLRTVSEELVKR KNELADVAVLLGRYTAALTEAVGSGPFFKAMVVNLLPYQILQPWVDAAFKKRGIDPEN FWRSAGLPEFRWPDPNGTRFPNGAPPAAPPVREGTPKHPGPAVPPGTPCSYTPAAGAL PRPDTPLPCAGATVGPFGGPDFPAPLDVQPSPPNPDGPPPTPGILSAGRPGEPAPAVP GIPMPLPPNAPPGARTQPLEPFPDGTGGSNQ" misc_feature 689299..689322 /gene="mce2C" /locus_tag="Rv0591" /note="PS00017 ATP/GTP-binding site motif A" gene 690501..692027 /gene="mce2D" /locus_tag="Rv0592" /db_xref="GeneID:887786" CDS 690501..692027 /gene="mce2D" /locus_tag="Rv0592" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv0592, (MTCY19H5.30c), len: 508 aa. mce2D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07416|Rv0172|MTCI28.12|mce1D (530 aa); O53970|Rv1969|MTV051.07|mce3D (423 aa); etc. Also highly similar to others e.g. NP_302659.1|NC_002677 putative secreted protein from Mycobacterium leprae (531 aa); CAC12795.1|AL445327 putative secreted protein from Streptomyces coelicolor (337 aa); etc. Has highly Pro-rich C-terminus and may contain N-terminal signal or anchor sequence." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE2D" /protein_id="NP_215106.1" /db_xref="GI:15607732" /db_xref="GeneID:887786" /translation="MSTIFDIRSLRLPKLSAKVVVVGGLVVVLAVVAAAAGARLYRKL TTTTVVAYFSEALALYPGDKVQIMGVRVGSIDKIEPAGDKMRVTLHYSNKYQVPATAT ASILNPSLVASRTIQLSPPYTGGPVLQDGAVIPIERTQVPVEWDQLRDSINGILRQLG PTERQPKGPFGDLIESAADNLAGKGRQLNETLNSLSQALTALNEGRGDFVAITRSLAL FVSALYQNDQQFVALNENLAEFTDWFTKSDHDLADTVERIDDVLGTVRKFVSDNRSVL AADVNNLADATTTLVQPEPRDGLETALHVLPTYASNFNNLYYPLHSSLVGQFVFPNFA NPIQLICSAIQAGSRLGYQESAELCAQYLAPVLDALKFNYLPFGSNPFSSAATLPKEV AYSEERLRPPPGYKDTTVPGIFSRDTPFSHGNHEPGWVVAPGMQGMQVQPFTANMLTP ESLAELLGGPDIAPPPPGTNLPGPPNAYDESNPLPPPWYPQPASLPAAGATGQPGPGQ" gene 692024..693232 /gene="lprL" /locus_tag="Rv0593" /db_xref="GeneID:887829" CDS 692024..693232 /gene="lprL" /locus_tag="Rv0593" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv0593, (MTCY19H5.29c), len: 402 aa. Possible lprL (alternate gene name: mce2E), lipoprotein which belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07417|LPRK|Rv0173|MTCI28.13|mce1E (390 aa); O53971|LPRM|Rv1970|MTV051.08|mce3E (377 aa); etc. Also highly similar to others e.g. NP_302660.1|NC_002677 putative lipoprotein from Mycobacterium leprae (392 aa); CAC12794.1|AL445327 putative secreted protein from Streptomyces coelicolor (413 aa); etc. Contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site.; mce2E" /codon_start=1 /transl_table=11 /product="MCE-family lipoprotein LprL" /protein_id="NP_215107.1" /db_xref="GI:15607733" /db_xref="GeneID:887829" /translation="MRCGVSAGSANGKPNRWTLRCGVSAGHRGSVFLLAVLLAPVVLT SCTWRGIANVPLPVGRGMGPDRMTIYVQMPDTLALNTNSRVRVADVWVGTVRDISLRN WIATLTLELEPTVRLPANATAKIGQTSLLGTQHVELAAPPIPSPQPLKSGDTIGLKNS SAYPTVERTLASVALILTGGGIVNLDVIQTEILNILDGHAGQIREFLERLATFTAELN NQRGDLTRAIDSTNQLLTIIANRNDTLDRVLTDVPPLIEHFADTGQLFADATESLGRF SEVANRALAATRPNLHQTLQSLQRPLRQLERASPYVVGALKLGLTAPFNIDEVPNVIR GDYVNVSATFDVTLSALDNALLSGTGISGMLRALEQAWGRDPDTMIPDVRYTPNPNDA PGGPLVERAE" misc_feature 692129..692161 /gene="lprL" /locus_tag="Rv0593" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 693237..694787 /gene="mce2F" /locus_tag="Rv0594" /db_xref="GeneID:887797" CDS 693237..694787 /gene="mce2F" /locus_tag="Rv0594" /function="UNKNOWN, BUT THOUGHT INVOLVED IN HOST CELL INVASION." /note="Rv0594, (MTCY19H5.28c), len: 516 aa. mce2F; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), similar to Mycobacterium tuberculosis proteins O07418|Rv0174|MTCI28.14|mce1F (515 aa); O53972|Rv1971|MTV051.09|mce3F (437 aa); etc. Also highly similar to others e.g. NP_302661.1|NC_002677 putative secreted protein from Mycobacterium leprae (516 aa); AAF74993.1|AF143400_1|AF143400|996A027a protein from Mycobacterium avium (80 aa) (similarity on C-terminus); CAC12793.1|AL445327 putative secreted protein from Streptomyces coelicolor (433 aa); etc. Contains possible N-terminal signal or anchor sequence." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE2F" /protein_id="NP_215108.1" /db_xref="GI:15607734" /db_xref="GeneID:887797" /translation="MLTRAIKTQLVLLTVLAVIAVVVLGWYFLRIPSLVGIGRYTLYA ELPRSGGLYRTANVTYRGITIGKVTGVEPTERGARATMSIDNGYQIPTDASANVHSVS AVGEQFVDLVSTRTSGPYLRHGQTITTTTVPSQIGPALDAANRGLAVLPKDRVASVLH EASEAVGGLGSSLNRLIEATQAIAHDVRGSLEDIDDIIERSAPIIDSQVNSGNEIARW AANLNTLAAQTAQTDPAVRSILANAAPTADQVNATFSDVRESLPQTLANLEVVIDMLK RYHNGVEQALVFLPQSGAIAQSVTTEFPGQAGLGVGGLALNQPPPCLTGFLPASEWRS PADTSTAPLPKGTYCRIPMDASNVVRGARNNPCVDVPGKRAATPRECRSNEAYVPGGT NPWYGDPNQMLSCPAPAARCDQPVKPGQVIPAPSVNNGINPLPADQLPGTPPPVNDPL QRPGSGTVQCNGQQPNPCVYTPSTFPTTIYDVQSGKVVAPDGVVYSVEASTHAGADGW KVMLAPTG" gene complement(694839..695231) /locus_tag="Rv0595c" /db_xref="GeneID:887835" CDS complement(694839..695231) /locus_tag="Rv0595c" /function="UNKNOWN" /note="Rv0595c, (MTCY19H5.27), len: 130 aa. Conserved hypothetical protein, similar to other conserved hypothetical proteins e.g. Rv0627 (135 aa) and Rv0665 (112 aa) from Mycobacterium tuberculosis; and STBB_PSESM|Q52562 plasmid stability protein from Pseudomonas syringae (139 aa), FASTA scores: opt: 131, E(): 0.0035, (35.2% identity in 88 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215109.1" /db_xref="GI:15607735" /db_xref="GeneID:887835" /translation="MNVRRALADTSVFIGIEATRFDPDRFAGYEWGVSVVTLGELRLG VLQASGPEAAARRLSTYQLAQRFEPLGIDEAVSEAWALLVSKLRAAKLRVPINDSWIA ATAVAHGIAILTQDNDYAAMPDVEVITI" gene complement(695228..695485) /locus_tag="Rv0596c" /db_xref="GeneID:887846" CDS complement(695228..695485) /locus_tag="Rv0596c" /function="UNKNOWN" /note="Rv0596c, (MTCY19H5.26), len: 85 aa. Conserved hypothetical protein, highly similar in part to other M. tuberculosis hypothetical proteins e.g. Rv0626, Rv3181c, Rv3385c, Rv3407, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215110.1" /db_xref="GI:15607736" /db_xref="GeneID:887846" /translation="MSATIPARDLRNHTAEVLRRVAAGEEIEVLKDNRPVARIVPLKR RRQWLPAAEVIGELVRLGPDTTNLGEELRETLTQTTDDVRW" gene complement(695668..696903) /locus_tag="Rv0597c" /db_xref="GeneID:887853" CDS complement(695668..696903) /locus_tag="Rv0597c" /function="UNKNOWN" /note="Rv0597c, (MTCY19H5.25), len: 411 aa. Conserved hypothetical protein, highly similar to Rv3179 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (429 aa). Also similar to AAF76191.1|AF271296_1|AF271296 putative ATP/GTP binding protein from Mycobacterium smegmatis (428 aa); Rv2008c|YW09_MYCTU|Q10849 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (441 aa), FASTA scores: opt: 270, E(): 3.6e-11, (30.5% identity in 416 aa overlap) (N-terminus longer). Also similar to other hypothetical proteins e.g. NP_085874.1|NC_002679 hypothetical protein from Mesorhizobium loti (435 aa) (N-terminus longer). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215111.1" /db_xref="GI:15607737" /db_xref="GeneID:887853" /translation="MGVVERAIAPSVLAALADTPVVVVNGARQVGKTTLVARLDYPGS SEVVSLDDVANRDAARDDPRAFVSRPVDTLVIDEAQLEPGLFRAIKAEVDRDRRPGRF LLTGSARLLSAPDMADALVGRVEIIELWPFSQGERAGIADGFVDALFTAPRELIHGSD MRRADLVDRIATGGFPDIVARSPSRRRAWFDNYLTTATQSVIREISPIERLAEMPRVL RLCAARTGAELNVSALANDLSIPARTTAGYLALLEAAFLIHRVPAWSTNLSRKVIRRP KLVVSDSGLACHLLGVTGATLDRPGRPLGPLLETFVANEIRKQLTWSTERPSLWHFRD RGGAEVDLVLEHPDGRVCGIEVKATSTPRAEDLRGLRYLAERLDDRFQFGVLLTAAPE ATPFGPTLAALPVSTLWAG" misc_feature complement(696805..696828) /locus_tag="Rv0597c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(697154..697567) /locus_tag="Rv0598c" /db_xref="GeneID:887861" CDS complement(697154..697567) /locus_tag="Rv0598c" /function="UNKNOWN" /note="Rv0598c, (MTCY19H5.24), len: 137 aa. Conserved hypothetical protein; similar to Rv2596|Y0B5_MYCTU|Q50625 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (134 aa), FASTA scores: opt: 254, E(): 8.2e-12, (41.5% identity in 130 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215112.1" /db_xref="GI:15607738" /db_xref="GeneID:887861" /translation="MKPPLAVDTSVAIPLLVRTHTAHAAVVAWWAHREAALCGHALAE TYSVLTRLPRDLRLAPMDAARLLTERFAAPLLLSSRTTEHLPRVLAQFEITGGAVYDA LVALAAAEHRAELATRDARAKDTYEKIGVHVVVAA" gene complement(697564..697800) /locus_tag="Rv0599c" /db_xref="GeneID:887856" CDS complement(697564..697800) /locus_tag="Rv0599c" /function="UNKNOWN" /note="Rv0599c, (MTCY19H5.23), len: 78 aa. Conserved hypothetical protein, similar to Rv2595|Y0B6_MYCTU|Q50626 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (81 aa), FASTA scores: opt: 160, E(): 6.2e-07, (35.8% identity in 81 aa overlap). N-terminus shows stong similarity with N-terminus of NP_104908.1|NC_002678 hypothetical protein from Mesorhizobium loti (89 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215113.1" /db_xref="GI:15607739" /db_xref="GeneID:887856" /translation="MKAVVDAAGRIVVPKPLREALGLQPGSTVEISRYGAGLHLIPTG RTARLEEENGVLVATGETTIDDEVVFGLIDSGRK" gene complement(697904..698410) /locus_tag="Rv0600c" /db_xref="GeneID:887847" CDS complement(697904..698410) /locus_tag="Rv0600c" /EC_number="2.7.3.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv0600c, (MTCY19H5.22), len: 168 aa (probable partial CDS). Probable two-component sensor kinase (second part) (EC 2.7.3.-), similar to part (C-termini) of many others e.g. Q04943|AFQ2_STRCO sensor protein afsq2 from Streptomyces coelicolor (535 aa), FASTA scores: opt: 347, E(): 1.9e-12, (33.0% identity in 206 aa overlap); etc. Note that sequence was checked and no errors were detected, which would allow this and the upstream ORF to be joined. Start changed since first submission (- 39 aa)." /codon_start=1 /transl_table=11 /product="two component sensor kinase" /protein_id="NP_215114.2" /db_xref="GI:57116758" /db_xref="GeneID:887847" /translation="MPITPLLHESVARFAATGADITTRAEPDLFVSIDPDHLRRILTA VLDNAITHGDGEIAVTAHARDGAVDIGVRDHGPGFADHFLPVAFDRFTRADTARGGRG SGLGLAIVAALTTTHGGHANATNHPDGGAELRITLPTPRPPFHEELPRITSSDTKDPN REHDTSDQ" gene complement(698524..698994) /locus_tag="Rv0601c" /db_xref="GeneID:887868" CDS complement(698524..698994) /locus_tag="Rv0601c" /EC_number="2.7.3.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv0601c, (MTCY19H5.21), len: 156 aa (probable partial CDS). Probable two-component sensor kinase (first part) (EC 2.7.3.-), similar to part (N-termini) of others e.g. Q0375|CUTS_STRLI cuts protein from streptomyces lividans (414 aa), FASTA scores: opt: 230, E(): 3.1e-08, (39.1% identity in 115 aa overlap). Note that the sequence was checked and no errors were detected that would allow this and the downstream ORF to be joined." /codon_start=1 /transl_table=11 /product="two component sensor kinase" /protein_id="NP_215115.1" /db_xref="GI:15607741" /db_xref="GeneID:887868" /translation="MALVLAAAGAVTVVQFRDAAHEADPDGALRGLTDDITADLVREL VTILPIVLVIAAVAAYLLSRAALRPVDRIRAAAQTLTTTPHPDTDAPLPVPPTDDEIA WLATTLNTMLTRLQRALAHEQQFVADASHELRTPLALLTTELELRCAGPDPPTS" gene complement(699038..699799) /gene="tcrA" /locus_tag="Rv0602c" /db_xref="GeneID:887870" CDS complement(699038..699799) /gene="tcrA" /locus_tag="Rv0602c" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv0602c, (MTCY19H5.20), len: 253 aa. Probable tcrA, two-component DNA-binding response regulator, highly similar to others e.g. NP_107959.1|NC_002678 two-component response regulator from Mesorhizobium loti (239 aa); etc. Also similar to many other Mycobacterium tuberculosis two-component regulators e.g. Q50806|MTCY10G2.16|Rv1033c RESPONSE REGULATOR HOMOLOG TRCR (TCRV) (257 aa), FASTA score: (47.4 identity in 232 aa overlap); etc." /codon_start=1 /transl_table=11 /product="two component DNA binding transcriptional regulatory protein TCRA" /protein_id="NP_215116.1" /db_xref="GI:15607742" /db_xref="GeneID:887870" /translation="MADETTMRAGRGPGRACGRVSGVRILVVEDEPKMTALLARALTE EGHTVDTVADGRHAVAAVDGGDYDAVVLDVMLPGIDGFEVCARLRRQRVWTPVLMLTA RGAVTDRIAGLDGGADDYLTKPFNLDELFARLRALSRRGPIPRPPTLEAGDLRLDPSE HRVWRADTEIRLSHKEFTLLEALIRRPGIVHTRAQLLERCWDAAYEARSNIVDVYIRY LRDKIDRPFGVTSLETIRGAGYRLRKDGGRHALPR" gene 699856..700167 /locus_tag="Rv0603" /db_xref="GeneID:887863" CDS 699856..700167 /locus_tag="Rv0603" /function="UNKNOWN" /note="Rv0603, (MTCY19H5.19c), len: 103 aa. Possible exported protein with hydrophobic stretch at aa 7-29." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215117.1" /db_xref="GI:15607743" /db_xref="GeneID:887863" /translation="MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRAR AAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDG G" gene 700239..701189 /gene="lpqO" /locus_tag="Rv0604" /db_xref="GeneID:887879" CDS 700239..701189 /gene="lpqO" /locus_tag="Rv0604" /function="UNKNOWN" /note="Rv0604, (MTCY19H5.18c), len: 316 aa. Probable lpqO, conserved lipoprotein, highly similar to Rv2999|lppY PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (321 aa), FASTA scores: opt: 1153, E(): 0, (53.2% identity in 312 aa overlap). Contains probable N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein lpqo" /protein_id="NP_215118.1" /db_xref="GI:15607744" /db_xref="GeneID:887879" /translation="MIRRRGARMAALLAAAALALTACAGSDDKGEPDDGGDRGASLAT TSDADWKPVADILGRTGKLNDGSVYKIGFARSDLSVQTKGVTVAPALSLGSWVAFART PDGQTMLMGDLVVTEDELASVTDAVQAGGLQQTALHKHLLEQSPPIWWTHIAGHGDAA DLARAVRSALDATDTPPPASATSGQTSLDLDTAAIDEALGRSGTIAGGVYKFFIARRD PVTMSGMLIPPSMGLATALNFQPTGNGRAAINGDFVMTAAEVQDVVQALRGGGIDIVA IHNHGFDEQPRLFYMHFWAENDAVALARTLRAAVDATAAR" misc_feature 700275..700307 /gene="lpqO" /locus_tag="Rv0604" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" repeat_region complement(701247..701369) /note="123 bp imperfect direct repeat 2, 92/103 bp identical to first copy at 709425..709548, AGCCCCGGCTCGACGCGGCATAGGGTGGCCACCGTGGCCGAAGCGTTCCATGCGACC GTGCCGTGGCGAGGATCCCGGCCGAACATGGCCCATTGAACGAGGACGTCATCGCAC GACGCCTGC" repeat_region 701384..702767 /note="IS1536, len: 1384 bp. Partial copy of insertion sequence IS_1536." /mobile_element="insertion sequence:IS1536" gene 701406..702014 /locus_tag="Rv0605" /db_xref="GeneID:887872" CDS 701406..702014 /locus_tag="Rv0605" /function="PREVENTS THE COINTEGRATION OF FOREIGN DNA BEFORE INTEGRATION INTO THE CHROMOSOME." /experiment="experimental evidence, no additional details recorded" /note="Rv0605, (MTCY19H5.17c), len: 202 aa. Possible resolvase for IS_Y349 element, similar to several Mycobacterial hypothetical proteins and weakly similar to Q52563 resolvase from Pseudomonas syringae (210 aa), FASTA scores: opt: 99, E(): 3.1, (35.7% identity in 98 aa overlap). Contains PS00397 Site-specific recombinases active site and probable helix-turn helix motif from aa 9-30 (Score 1815, +5.37 SD)." /codon_start=1 /transl_table=11 /product="resolvase" /protein_id="NP_215119.1" /db_xref="GI:15607745" /db_xref="GeneID:887872" /translation="MACCRNRGMNLAAWAERNGVARVTAYRWFHAGLLPVPARKVGRL ILVDELASEAGAQPKTAVYARVSSADQKSDLDRQVARVTSWATAEQIPVDKVVTEVGS VLNGHRRKFPAVLRDLSVTRIVVEHRDRFCRFGSEYVHAALAAQGRELVVVDSAEVDD DLVWDMTEILTSMCARLYGKRAAQNRAKRAVAAAAVDDHEAA" misc_feature 701592..701618 /locus_tag="Rv0605" /note="PS00397 Site-specific recombinases active site" gene 702016..702759 /locus_tag="Rv0606" /db_xref="GeneID:887889" CDS 702016..702759 /locus_tag="Rv0606" /function="THOUGHT TO BE REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS_1536." /note="Rv0606, (MTCY19H5.16c), len: 247 aa. Possible truncated transposase for IS_1536 element, highly similar to N-terminus of other transposases from Mycobacterium tuberculosis e.g. YX16_MYCTU|Q10809|Rv2885c|MT2953|MTCY274.16c PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis (460 aa), FASTA scores: opt: 1368, E(): 0, (83.5% identity in 237 aa overlap); Rv2978c, Rv0922, Rv3827c, etc. Also similar to N-terminus of MTV002_57|Rv2792 RESOLVASE from M. tuberculosis (193 aa), FASTA score: (87.4% identity in 238 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215120.1" /db_xref="GI:15607746" /db_xref="GeneID:887889" /translation="MPRLEIPNGWCVQAFRFTLDPTAEQAHALARHFGARRKAYNWTV AQLKADIQAWRATGAQTAKPSLRVLRKRWNTVKDEVCVNAETGTVWWPECSKEAYADG IAGAVDAYWNWQQRRAGKRDGKRMGFPRFKKKGRDADRVSFTTGAMRVEPDRRHLTLP VIGCVRTHENTRRIERLIAKDRARVLAITVRRNGTRLDASVRVLVQRPQQPNVELPES RIGVDVGVRRLATVATADGACCPVLVPDG" gene 702813..703199 /locus_tag="Rv0607" /db_xref="GeneID:887883" CDS 702813..703199 /locus_tag="Rv0607" /function="UNKNOWN" /note="Rv0607, (MTCY19H5.15c), len: 128 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215121.1" /db_xref="GI:15607747" /db_xref="GeneID:887883" /translation="MGAWQTADTMGIFQALPDVWGGWRTECWEDRFEEQLIRCNGALR LPELDLAAGMDSAREWLRDRIFQRFSDSPAGQILKLSELLADVGPGLVVSDDAVTNGG ARPNNEEWARFVAACDLVRGAHAESA" gene 703244..703489 /locus_tag="Rv0608" /db_xref="GeneID:887895" CDS 703244..703489 /locus_tag="Rv0608" /function="UNKNOWN" /note="Rv0608, (MTCY19H5.14c), len: 81 aa. Conserved hypothetical protein, similar to several other M. tuberculosis hypothetical short proteins e.g. Rv0623|P96913|MTCY20H10.04 (84 aa), FASTA scores: opt: 159, E(): 1.2e-09, (43.0% identity in 86 aa overlap); Rv2760c (89 aa); Rv1740 (70 aa), etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215122.1" /db_xref="GI:15607748" /db_xref="GeneID:887895" /translation="MALNIKDPSVHQAVKQIAKITGESQARAVATAVNERLARLRSDD LAARLLAIGHKTASRMSPEAKRLDHDALLYDERGLPA" gene 703486..703887 /locus_tag="Rv0609" /db_xref="GeneID:887896" CDS 703486..703887 /locus_tag="Rv0609" /function="UNKNOWN" /note="Rv0609, (MTCY19H5.13c), len: 133 aa. Conserved hypothetical protein, similar to several Mycobacterium tuberculosis hypothetical proteins e.g. YW37_MYCTU|Q10874|Rv1982c|MT2034|MTCY39.37 CONSERVED HYPOTHETICAL PROTEIN (139 aa), FASTA scores: opt: 262, E(): 8.1e-12, (39.1% identity in 128 aa overlap); MTCY20H10.05|Rv0624|MT0652|MTCY20H10.05 CONSERVED HYPOTHETICAL PROTEIN (131 aa), FASTA score: (42.9% identity in 126 aa overlap), Rv0565c, Rv3854c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215123.1" /db_xref="GI:15607749" /db_xref="GeneID:887896" /translation="MIVDTSAIIAILRDEDDAAAYADALANADVRRLSAASYLECGIV LDSQRDPVISRALDELIEEAEFVVEPVTERQARLARAAYADFGRGSGHPAGLNFGDCL SYALAIDRREPLLWKGNDFGHTGVQRALDRR" gene 703830..704057 /locus_tag="Rv0609A" /db_xref="GeneID:3205046" CDS 703830..704057 /locus_tag="Rv0609A" /function="UNKNOWN" /note="Rv0609A, len: 75 aa. Conserved hypothetical protein, highly similar to part of upstream ORF Rv0612|MTCY19H5.09c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (201 aa), FASTA scores: opt: 154, E(): 1.8e-05, (74.3% identity in 35 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177628.1" /db_xref="GI:57116759" /db_xref="GeneID:3205046" /translation="MEGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRSDDPVAA AGFVVSAVWSCGRGPGNATGSGRLPKPLRHS" repeat_region complement(703912..703985) /note="74 bp imperfect direct repeat 2, 64/73 bp identical to first copy at 706790..706863, CACAGCGGACACCACAAAGCCCGCCGCTGCCACCGGATCGTCGGAACGAAAATAGTC GTACCCGTGAGCCTCGC" gene complement(704752..705909) /locus_tag="Rv0610c" /db_xref="GeneID:887890" CDS complement(704752..705909) /locus_tag="Rv0610c" /function="UNKNOWN" /note="Rv0610c, (MTCY19H5.11), len: 385 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215124.1" /db_xref="GI:15607750" /db_xref="GeneID:887890" /translation="MDDELRGLLARYARGELSADDARRAILRYPKWRVAEIDGELETV ALDDGTPMLIAESSASDGREYSGLELVRDIAPLVGGLSFDPDEPWGSAFRPGALPELQ NWARTVELEDAVAKPGPGQRDLLYEGPWWVAVSPGTGRPAVHRADGLDVITIMTAPDA AATFRRTERHRGLDVVRLGPALWGDLAKRSDFDGVRLNPLRPLAQLWPPHVPAMLVAG CDPRPNAEPLPARTVAEIHLWLDQHGARQEKRELSNRATPVGEVTVARAWWNYDRREI AFTRVAPASDTEGLGSVPSRILCAGKLRQSIQSKLAGLPRLTWRADAWHRQRAALAVG WALELEKLVCGERVPFAALRTPEGAHLWHLEPQAFTARAIRKLRDRAASFR" gene complement(705961..706344) /locus_tag="Rv0611c" /db_xref="GeneID:887906" CDS complement(705961..706344) /locus_tag="Rv0611c" /function="UNKNOWN" /note="Rv0611c, (MTCY19H5.10), len: 127 aa. Hypothetical unknown protein. Note that first start has been taken although this overlaps slightly with the upstream ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215125.1" /db_xref="GI:15607751" /db_xref="GeneID:887906" /translation="MPDRPQHPTASRQSSMVSWNHGAAGWLHCVQCGSATNPTACLDW LPPIHARSGPMYAEHDVVVLTRDVPDKSLIAGDVGAVVGRYAAGGYEVDFTAANGCTV AVVTLAGDDIRPRRRREIPHVREVA" gene 706324..706929 /locus_tag="Rv0612" /db_xref="GeneID:887908" CDS 706324..706929 /locus_tag="Rv0612" /function="UNKNOWN" /note="Rv0612, (MTCY19H5.09c), len: 201 aa. Conserved hypothetical protein, highly similar, but in part, to downstream ORF Rv0609A CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (75 aa); and showing weak similarity with other hypothetical proteins from Mycobacterium tuberculosis. Note that first start has been taken although this overlaps slightly with the upstream ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215126.1" /db_xref="GI:15607752" /db_xref="GeneID:887908" /translation="MLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDED RLRKALWNLYWRGTANMRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLAR SGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADG YDYFRSDDPVAAAGFVVSDVAAAGHPHFREFAAEIGAAIPP" repeat_region complement(706790..706863) /note="74 bp imperfect direct repeat 1, 64/73 bp identical to second copy at 703912..703985, CACATCGGACACGACGAAACCCGCCGCTGCCACCGGATCGTCGGAGCGGAAGTAGTC GTACCCGTCGGCCTCGC" gene complement(706948..709515) /locus_tag="Rv0613c" /db_xref="GeneID:887913" CDS complement(706948..709515) /locus_tag="Rv0613c" /function="UNKNOWN" /note="Rv0613c, (MTCY19H5.08), len: 855 aa. Hypothetical unknown protein. Contains a very short region with strong similarity to several preprotein translocases e.g. P47847|SECA_LISMO preprotein translocase seca subunit (836 aa), FASTA scores: opt: 138, E(): 0.18, (38.6% identity in 70 aa overlap, and 72.7% identity in 22 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215127.1" /db_xref="GI:15607753" /db_xref="GeneID:887913" /translation="MAEAFDATQAVARILAEHGPLSEDDIARRLLDSGVADPDAVLRA LRLETEWPARQLVDDRWVWLPTLLAGRVFTHRLGADEAVHDMLGVTPDLDPITTLCEH EEYGRLADGSAARIVLAGYDEELLERRGIPDEAIDPGGALLLEPGTLATLGAAAGDLV GVRLTAAGLVLERIGTAGADTSVGARLAELVDPDEPAFFPAAVWTACVDDPAAFTEPV APLREILDQHGLTHEDDWLAPGGFNFDAWRFENRCELLAFRHDLDPNDAVALYTLIKL HETMSLLLEATDPDELPRDVLATAAETATETGSDSLVDLLGDIGAALADPLLAELLVA ETVGTDSGGAAALGLLTEMLEPKVPRAARVAVRWLRAVALDRIGDVEAAERELLAAES MDTEWPLPLLDLARIASDRGDAERGLALLRRAGTEPDHPLVRLLERHRAQPRRDLGRN EACWCGSGRKYKKCHLGREALPLAERVDWLYAKASQHALSGDWTGLLAEVSYERFRYA DSDDEDALAAALADPLVLDAVLFEGGAFAEFLEVRGSLLPDDERLLAEQWLLVERSVF EVEHVQPGEGVIVRDVRTGDTHEVHERAASRQLRAGQLICARPVPAGDTMVFFGGIEP VALHERAVLIELLDDEPDPVTLVAQLSRRFAPPTLVNTEGDSLAICEASVRVDDPAGI QGALDGVYDRVDGEEPPRWIEHVTNDGMLRVRATLVLDGDTLRVETNSEPRMDRVLAT LTRLDPAMTVLDDDRRPLRNTREAAALAEQMPVTGAGAPDPDSPELAAALEEFIRDYE TSWLDQPIPALDGHTPRQAADDPTRRADLIKLLDTFPAGAGARGGMDADRLRTALGL" gene 709356..710348 /locus_tag="Rv0614" /db_xref="GeneID:887914" CDS 709356..710348 /locus_tag="Rv0614" /function="UNKNOWN" /note="Rv0614, (MTCY19H5.07c), len: 330 aa. Conserved hypothetical protein, similar in part to Mycobacterium tuberculosis hypothetical proteins e.g. YY16_MYCTU|Q10685|Rv2077c|MT2137|MTCY49.16c CONSERVED HYPOTHETICAL PROTEIN (323 aa), FASTA scores: opt: 200, E(): 0.00016, (28.3% identity in 269 aa overlap); MTCY9F9_15 FASTA score: (40.3% identity in 144 aa overlap), Rv1949c, Rv2542, etc. Several start sites are possible; first start has been chosen. Note that this ORF overlaps with the upstream ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215128.1" /db_xref="GI:15607754" /db_xref="GeneID:887914" /translation="MPAIPFQGEARAGRRPGRPRRCPAGVVRCRPRSMGHVRPGFSPR LGSHRTLRPRWPPYAAASRGLTSGTSRWGWPRLGFGVVTAPTRWTLADGRELLFFSLP GPRTSGTAAERVARHAQAQTFAGDIRQRAIQLVVSEQEVASKITAATAGIATTTFPET PSIDDTIIGNDNRDTGVRLVDVKQDGGTSPPPPFAPWDTPDGTPPPGTGLSPTLQQMI LGGDPANLTGQGLADNVQRFVQSLPANDPNTAWLRGQVADLQAHVADIEYARTHCSTN DWIDRTAQFASGAIVFSIGVLTAETGAGVVAAAAGGVGAATAGVSLLQCLVGSK" repeat_region complement(709425..709548) /note="123 bp imperfect direct repeat 1, 92/103 bp identical to second copy at 701247..701369, AGCCTCGGCTGGCCGCGGCATAAGGTGGCCACCGTGGCCGAAGCGTTCGATGCGACC CAAGCCGTGGCGAGAATCCTGGCCGAACATGGCCCATTGAGCGAGGACGACATCGCA CGACGCCTGC" repeat_region 709585..709663 /note="79 bp imperfect direct repeat 1, 73/78 bp identical to second copy at 711624..711702, TAGGGTTCGGCGTTGTGACGGCGCCGACGCGGTGGACCCTGGCCGACGGACGTGAGC TGCTGTTCTTTTCGCTGCCCGG" gene 710345..710587 /locus_tag="Rv0615" /db_xref="GeneID:887920" CDS 710345..710587 /locus_tag="Rv0615" /function="UNKNOWN" /note="Rv0615, (MTCY19H5.06c), len: 80 aa. Probable integral membrane protein." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215129.1" /db_xref="GI:15607755" /db_xref="GeneID:887920" /translation="MMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGL LVVTGQTLMAISVAFLVALGGPLVVVNHRRAERSRG" gene complement(710584..710850) /locus_tag="Rv0616c" /db_xref="GeneID:887921" CDS complement(710584..710850) /locus_tag="Rv0616c" /function="UNKNOWN" /note="Rv0616c, (MTCY19H5.05), len: 88 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215130.1" /db_xref="GI:15607756" /db_xref="GeneID:887921" /translation="MRIPGNRQCLLVQVLRQVDGSAHRLILTSLHRDARADAHRYSNG TDHAGRAADEPAETAHEPCWVAARGLASQASRAMSATYRPSSFI" gene 711006..711407 /locus_tag="Rv0617" /db_xref="GeneID:887926" CDS 711006..711407 /locus_tag="Rv0617" /function="UNKNOWN" /note="Rv0617, (MTCY19H5.04c), len: 133 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv2494, Rv3320c, Rv0749, Rv0277c, Rv2530c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215131.1" /db_xref="GI:15607757" /db_xref="GeneID:887926" /translation="MTVLLDANVLIALVVAEHVHHDAAADWLMASDTGFATCPMTQGS LVRFLVRSGQSAAAARDVVSAVQCTSRHEFWPDALSFAGVEVAGVVGHRQVTDAYLAQ LARSHDGQLATLDSGLAHLHGDVAVLIPTTT" gene 711536..712231 /gene="galTa" /locus_tag="Rv0618" /db_xref="GeneID:887932" CDS 711536..712231 /gene="galTa" /locus_tag="Rv0618" /EC_number="2.7.7.10" /function="INVOLVED IN GALACTOSE METABOLISM (LELOIR PATHWAY) [CATALYTIC ACTIVITY: UTP + alpha-D-galactose 1-phosphate = diphosphate + UDP-galactose]." /note="FIRST PART; Rv0618, (MTCY19H5.03c), len: 231 aa (probable partial CDS). Probable galTa, first part of galactose-1-phosphate uridylyltransferase (EC 2.7.7.10), highly similar to N-terminal half of other galT proteins e.g. P13212|GAL7_STRLI galactose-1-phosphate uridylyltransferase from Streptomyces lividans (354 aa), FASTA scores: opt: 296, E(): 1.4e-11, (50.8% identity in 177 aa overlap); etc. Also highly similar to N-terminal half of some UDP glucose--hexose-1-phosphate uridylyltransferases (EC 2.7.7.12). N-terminal 28 aa similar to MTCY20H11.08|Rv0627|MTCY20H11.08 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (135 aa), FASTA score: (71.4% identity in 28 overlap). Cosmid sequence is correct but there may be a frameshift mutation in this region which would allow the two ORFs to be joined. BELONGS TO THE GALACTOSE-1-PHOSPHATE URIDYLYLTRANSFERASE FAMILY 1. Note that previously known as galT'.; galT'" /codon_start=1 /transl_table=11 /product="galactose-1-phosphate uridylyltransferase galTa" /protein_id="YP_177741.1" /db_xref="GI:57116760" /db_xref="GeneID:887932" /translation="MSATPPPGGLDASVFIANERGRQLDEALPVGFCVVTAPTRWTLA DGRDLLFFSLPGHVPAPVSDRRPLPERDPAPSRLRFDRATGQWVIVAAQRQDRTYKPP AARCPLCPGPTGLSSEVPAPDYDVVVFENRFPSLAGAGIAPIGAPDGDGFVSAPGHGR CEVICFSADHTGSFAGLDPAHARLVVHAWRHRTAELTALPGVAQVFCFENRGEEIGVT LPTRTARFTPIRI" repeat_region 711624..711702 /note="79 bp imperfect direct repeat 2, 73/78 bp identical to first copy at 709585..709663, TAGGGTTCTGCGTTGTGACGGCGCCGACGCGGTGGACCCTGGCCGATGGCCGTGACC TGCTGTTCTTTTCGCTGCCCGG" gene 712174..712719 /gene="galTb" /locus_tag="Rv0619" /db_xref="GeneID:887943" CDS <712174..712719 /gene="galTb" /locus_tag="Rv0619" /EC_number="2.7.7.10" /function="INVOLVED IN GALACTOSE METABOLISM (LELOIR PATHWAY) [CATALYTIC ACTIVITY: UTP + alpha-D-galactose 1-phosphate = diphosphate + UDP-galactose]." /note="SECOND PART; Rv0619, (MTCY19H5.02c), len: 181 aa (probable partial CDS). Probable galTb, second part of galactose-1-phosphate uridylyltransferase (EC 2.7.7.10), highly similar to C-terminal half of other galT proteins e.g. P13212|GAL7_STRLI galactose-1-phosphate uridylyltransferase from Streptomyces lividans (354 aa), FASTA scores: opt: 416, E(): 5.2e-22, (43.0% identity in 186 aa overlap), etc. Cosmid sequence is correct but there may be a frameshift mutation in this region which would allow the two ORFS to be joined. BELONGS TO THE GALACTOSE-1-PHOSPHATE URIDYLYLTRANSFERASE FAMILY 1. Note that previously known as 'galT.; 'galT" /codon_start=1 /transl_table=11 /product="galactose-1-phosphate uridylyltransferase GalTb" /protein_id="YP_177742.1" /db_xref="GI:57116761" /db_xref="GeneID:887943" /translation="GDRGDPAHPHGQIYAYPYLTPRTAAMLRQARRHRKRHGDNLFAS LLAREVADGSRIVVRGELFTAFVPFAARWPVEVHIYPNRLVRNLTELNDGELDEFARI YLDVLQRFDRMYSSPLPYMSALHQFSEVQRDGYFHVELMSIRRSATKLKYLAAAESAM DAFIADVIPESVATRLRELGP" gene 712716..713807 /gene="galK" /locus_tag="Rv0620" /db_xref="GeneID:887936" CDS 712716..713807 /gene="galK" /locus_tag="Rv0620" /EC_number="2.7.1.6" /function="INVOLVED IN GALACTOSE METABOLISM (LELOIR PATHWAY) (AT THE FIRST REACTION) [CATALYTIC ACTIVITY: ATP + D-galactose = ADP + D-galactose 1-phosphate]." /note="catalyzes the formation of alpha-D-galactose 1-phosphate from D-galactose in galactose metabolism" /codon_start=1 /transl_table=11 /product="galactokinase" /protein_id="NP_215134.1" /db_xref="GI:15607760" /db_xref="GeneID:887936" /translation="MTVSYGAPGRVNLIGEHTDYNLGFALPIALPRRTVVTFTPEHTG AITARSDRADGSARIPLDTTPGQVTGWAAYAAGAIWALRGAGHPVPGGAMSITSDVEI GSGLSSSAALIGAVLGAVGAATGTRIDRLERARLAQRAENDYVGAPTGLLDHLAALFG APKTALLIDFRDITVRPVAFDPDACDVVLLLMDSRARHCHAGGEYALRRASCERAAAD LGVSSLRAVQDRGLAALGAIADPIDARRARHVLTENQRVLDFAAALADSDFTAAGQLL TASHESMREDFAITTERIDLIAESAVRAGALGARMTGGGFGGAVIALVPADRARDVAD TVRRAAVTAGYDEPAVSRTYAAPGAAECR" misc_feature 712740..712775 /gene="galK" /locus_tag="Rv0620" /note="PS00106 Galactokinase signature" misc_feature 712959..712985 /gene="galK" /locus_tag="Rv0620" /note="PS00560 Serine carboxypeptidases, histidine active site" gene 714202..715266 /locus_tag="Rv0621" /db_xref="GeneID:887901" CDS 714202..715266 /locus_tag="Rv0621" /function="UNKNOWN" /note="Rv0621, (MTCY20H10.02), len: 354 aa. Possible membrane protein; contains potential membrane spanning regions. Also contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215135.1" /db_xref="GI:15607761" /db_xref="GeneID:887901" /translation="MAGDRGADPGPANVTPGADDHAQHASPTVLCPQGHVNAWDYRFC ERCGSPIGVVPWPSEESGTRQTAPARSFVPLVVLAATLLVVAVVVTAVGYAVTRPARN DREEPSSARGAATTGVPFAQAEAASCPDDPVLEAESIDLTSDGLAVSAAFMSACAGGD VESNSALEVTVADGRRDVAAGSFDFSADPLRIEPGVPARRTLVFPPGMYWRTPDMLSG APALAATRKGRSDRSAARGGSARTTMVAAASAAPAYGSINAVAGAVLVELRDSDFPYV RVGIANRWVPQVSSKRVGLVAAGKTWTSADILRDHLALRQRFGGARLVWSGHWTTFSG PDFWVTVVGPAQPTAAEANR" misc_feature 715081..715104 /locus_tag="Rv0621" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 715370..716317 /locus_tag="Rv0622" /db_xref="GeneID:887942" CDS 715370..716317 /locus_tag="Rv0622" /function="UNKNOWN" /note="Rv0622, (MTCY20H10.03), len: 315 aa. Possible membrane protein; contains potential membrane spanning region. Shows weak similarity with Mycobacterium tuberculosis hypothetical proteins Rv1804c, Rv1810, etc. Start changed since first submission (-26 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215136.2" /db_xref="GI:57116762" /db_xref="GeneID:887942" /translation="MSFCVYCGAELADPTRCGACGAYKIGSTWHRTTTPTVGAATTAT GWRPDPTGRHEGRYFVAGQPTDLVREGDAEAVDPLGQQQLDQSGAVGVSPSAVSGWVR SGHRRLWWALAGVVAFLGLVGAGVVGTLFLNRDRESIDDKYLAALRRSGLTGEFNSDA NAIARGKQVCRQLQDGGEQQGMPVDQVAVQYYCPQFSDGFHILETITVTGSFTLKDES PNVYAPAITVSGSGCSGSAGYADIDRGTQVTVKNGQGDILATAFLQAGQGGRFLCTFP FSFEITEGEDRYVVSVSRRGEMSYSFADLKANGLSLVLG" gene 716410..716664 /locus_tag="Rv0623" /db_xref="GeneID:887970" CDS 716410..716664 /locus_tag="Rv0623" /function="UNKNOWN" /note="Rv0623, (MTCY20H10.04), len: 84 aa. Conserved hypothetical protein, highly similar to NP_384911.1|NC_003047 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (84 aa). Also similar to several Mycobacterium tuberculosis hypothetical proteins e.g MTCY28_2|Rv1740|MTCY28.02|MTCY04C12.25 CONSERVED HYPOTHETICAL PROTEIN (70 aa), FASTA score: (73.5% identity in 68 aa overlap); MTCY4C12_25|Rv0608|MTCY19H5.14c CONSERVED HYPOTHETICAL PROTEIN (81 aa), FASTA score: (73.5 identity in 68 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215137.1" /db_xref="GI:15607763" /db_xref="GeneID:887970" /translation="MALSIKHPEADRLARALAARTGETLTEAVVTALRERLARETGRA RVVPLRDELAAIRHRCAALPVVDNRSAEAILGYDERGLPA" gene 716664..717059 /locus_tag="Rv0624" /db_xref="GeneID:887951" CDS 716664..717059 /locus_tag="Rv0624" /function="UNKNOWN" /note="Rv0624, (MTCY20H10.05), len: 131 aa. Conserved hypothetical protein, highly similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1741, Rv0609, Rv2759c,Rv0565c, Rv3854c, Rv3083, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215138.1" /db_xref="GI:15607764" /db_xref="GeneID:887951" /translation="MVIDTSALVAMLSDEPDAERFEAAVEADHIRLMSTASYLETALV IEARFGEPGGRELDLWLHRAAVDLVAVHADQADAARAAYRTYGKGRHRAGLNYGDCFS YGLAKISGQPLLFKGEDFQHTDIATVALP" gene complement(717153..717893) /locus_tag="Rv0625c" /db_xref="GeneID:887967" CDS complement(717153..717893) /locus_tag="Rv0625c" /function="UNKNOWN" /note="Rv0625c, (MTCY20H10.06c), len: 246 aa. Probable conserved transmembrane protein, showing similarity with others e.g. CAB61866.1|AL133252 putative integral membrane protein from Streptomyces coelicolor (249 aa). Also similar to Rv1491c|MTCY277_13 from Mycobacterium tuberculosis. Contains potential membrane spanning regions." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215139.1" /db_xref="GI:15607765" /db_xref="GeneID:887967" /translation="MSTHNDSAPTSRRRHIVRLVVFAGFLVGMFYLVAATDVIDVAAV RGAVSATGPAAPLTYVVVSAVLGALFVPGPILAASSGLLFGPLVGVFVTLGATVGTAV VASLVGRRAGRASARALLGGERADRTDALIERCGLWAVVGQRFVPGISDAFASYAFGT FGVPLWQMAVGAFIGSAPRAFAYTALGAAIGDRSPLLASCAIAVWCVTAIIGAFAARH GYRQWRAHARGDGADGGVEDPDREVGAR" gene 718025..718285 /locus_tag="Rv0626" /db_xref="GeneID:887996" CDS 718025..718285 /locus_tag="Rv0626" /function="UNKNOWN" /note="Rv0626, (MTCY20H10.07), len: 86 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv0596c, Rv3385c, Rv3407,Rv3181c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215140.1" /db_xref="GI:15607766" /db_xref="GeneID:887996" /translation="MSEVASRELRNDTAGVLRRVRAGEDVTITVSGRPVAVLTPVRPR RRRWLSKTEFLSRLRGAQADPGLRNDLAVLAGDTTEDLGPIR" gene 718282..718689 /locus_tag="Rv0627" /db_xref="GeneID:887991" CDS 718282..718689 /locus_tag="Rv0627" /function="UNKNOWN" /note="Rv0627, (MTCY20H11.08), len: 135 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins Rv0595c and Rv0665." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215141.1" /db_xref="GI:15607767" /db_xref="GeneID:887991" /translation="MSTTPAAGVLDTSVFIATESGRQLDEALIPDRVATTVVTLAELR VGVLAAATTDIRAQRLATLESVADMETLPVDDDAARMWARLRIHLAESGRRVRINDLW IAAVAASRALPVITQDDDFAALDGAASVEIIRV" gene complement(718761..719912) /locus_tag="Rv0628c" /db_xref="GeneID:887986" CDS complement(718761..719912) /locus_tag="Rv0628c" /function="UNKNOWN" /note="Rv0628c, (MTCY20H10.09c), len: 383 aa. Conserved hypothetical protein, highly similar to Rv0874c|YZ02_MYCTU|Q10536 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (386 aa), FASTA scores: opt: 2082, E(): 0, (81.5% identity in 383 aa overlap). Also some similarity to P72543|SPU62616_1 HYPOTHETICAL PROTEIN from Synechococcus, FASTA scores: E(): 2.8e-28, (36.6 identity in 265 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215142.1" /db_xref="GI:15607768" /db_xref="GeneID:887986" /translation="MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSH TDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDF VRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGR RRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGG RPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGA IGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMF GVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD" gene complement(720005..721732) /gene="recD" /locus_tag="Rv0629c" /db_xref="GeneID:887999" CDS complement(720005..721732) /gene="recD" /locus_tag="Rv0629c" /EC_number="3.1.11.5" /function="INVOLVED IN HOMOLOGOUS RECOMBINATION." /note="Rv0629c, (MTCY20H10.10c), len: 575 aa. Probable recD, exonuclease V, alpha chain (exodeoxyribonuclease V, alpha chain) (EC 3.1.11.5) (see citation below), highly similar to other exonucleases e.g. AF157643_3|AAD46809.1|recD Escherichia coli RecD protein homolog from Mycobacterium smegmatis (554 aa); P04993|EX5A_ECOLI|B2819 exodeoxyribonuclease v 67kd polypeptide (EC 3.1.11.5) (EXONUCLEASE V ALPHA CHAIN) from Escherichia coli strain K12 (608 aa), FASTA scores: opt: 512, E(): 1.9e-24, (36.9% identity in 582 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). CONSIST OF THREE SUBUNITS; RECB|Rv0630c, RECC|Rv0631c AND RECD." /codon_start=1 /transl_table=11 /product="exonuclease V alpha chain" /protein_id="NP_215143.1" /db_xref="GI:15607769" /db_xref="GeneID:887999" /translation="MKLTDVDFAVEASGMVRAFNQAGVLDVSDVHVAQRLCALAGESD ERVALAVAVAVRALRAGSVCVDLLSIARVAGHDDLPWPDPADWLAAVRASPLLADPPV LHLYDDRLLYLDRYWREEEQVCADLLALLTSRRPAGVPDLRRLFPTGFDEQRRAAEIA LSQGVTVLTGGPGTGKTTTVARLLALVAEQAELAGEPRPRIALAAPTGKAAARLAEAV RREMAKLDATDRARLGDLHAVTLHRLLGAKPGARFRQDRQNRLPHNVIVVDETSMVSL TLMARLAEAVRPGARLILVGDADQLASVEAGAVLADLVDGFSVRDDALVAQLRTSHRF GKVIGTLAEAIRAGDGDAVLGLLRSGEERIEFVDDEDPAPRLRAVLVPHALRLREAAL LGASDVALATLDEHRLLCAHRDGPTGVLHWNRRVQAWLAEETGQPPWTPWYAGRPLLV TANDYGLRVYNGDTGVVLAGPTGLRAVISGASGPLDVATGRLGDVETMHAMTIHKSQG SQVDEVTVLMPQEDSRLLTRELLYTAVTRAKRKVRVVGSEASVRAAIARRAVRASGLR MRLQSTGCG" misc_feature complement(721202..721225) /gene="recD" /locus_tag="Rv0629c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(721729..725013) /gene="recB" /locus_tag="Rv0630c" /db_xref="GeneID:888004" CDS complement(721729..725013) /gene="recB" /locus_tag="Rv0630c" /EC_number="3.1.11.5" /function="INVOLVED IN HOMOLOGOUS RECOMBINATION." /note="Rv0630c, (MTCY20H10.11c), len: 1094 aa. Probable recB, exonuclease V, beta chain (exodeoxyribonuclease V, beta chain) (EC 3.1.11.5) (see citation below), highly similar to other exonucleases e.g. AF157643_2|recB|AAD46808.1 Escherichia coli RecB protein homolog from Mycobacterium smegmatis (1083 aa); P08394|EX5B_ECOLI|RORA|B2820 exodeoxyribonuclease v 135 kDa polypeptide (EC 3.1.11.5) (EXONUCLEASE V BETA CHAIN) from Escherichia coli strain K12 (1180 aa), FASTA scores: opt: 289, E(): 4.3e-11, (29.5 identity in 1059 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE HELICASE FAMILY, UVRD SUBFAMILY. CONSIST OF THREE SUBUNITS; RECB, RECC|Rv0631c AND RECD|Rv0629c." /codon_start=1 /transl_table=11 /product="exonuclease V beta chain" /protein_id="NP_215144.1" /db_xref="GI:15607770" /db_xref="GeneID:888004" /translation="MDRFELLGPLPREGTTTVLEASAGTGKTFALAGLVTRYLAETAA TLDEMLLITFNRAASRELRERVRGQIVEAVGALQGDAPPSGELVEHLLRGSDAERAQK RSRLRDALANFDAATIATTHEFCGSVLKSLGVAGDNAADVELKESLTDLVTEIVDDRY LANFGRQETDPELTYAEALALALAVVDDPCAQLRPPDPEPGSKAAVRLRFAAEVLEEL ERRKGRLRAQGFNDLLIRLATALEAADSPARDRMRERWRIVLVDEFQDTDPMQWRVLE RAFSRHSALILIGDPKQAIYGFRGGDIHTYLKAAGTADARYTLGVNWRSDRALVESLQ TVLRDATLGHADIVVRGTDAHHAGHRLASAPRPAPFRLRVVKRHTLGYDGTAHVPIEA LRRHIPDDLAADVAALLASGATFAGRPVVAADIAVIVEHHKDARACRNALAEAGIPAI YTGDTDVFASQAAKDWLCLLEAFDAPQRSGLVRAAACTMFFGETAESLAAEGDALTDR VAGTLREWADHARHRGVAAVFQAAQLAGMGRRVLSQRGGERDLTDLAHIAQLLHEAAH RERLGLPGLRDWLRRQAKAGAGPPEHNRRLDSDAAAVQIMTVFVAKGLQFPIVYLPFA FNRNVRSDDILLYHDDGTRCLYIGGKDGGAQRRTVEGLNRVEAAHDNLRLTYVALTRA QSQVVAWWAPTFDEVNGGLSRLLRGRRPGQSQVPDRCTPRVTDEQAWAVFAQWEAAGG PSVEESVIGARSSLEKPVPVPGFEVRHFHRRIDTTWRRTSYSDLVRGSEAVTVTSEPA AGGRADEVEIAVVAAPGSGADLTSPLAALPSGASFGSLVHAVLETADPAAPDLAAELE AQVRRHAPWWTVDVDHAQLAPELARALLPMHDTPLGPAAAALTLRQIGVRDRLRELDF EMPLAGGDLRGRSPDVSLADVGELLASHLPGDDPLSPYADRLGSAGLGDQPLRGYLAG SIDVVLRLPGQRYLVVDYKTNHLGDTAADYGFERLTEAMLHSDYPLQALLYVVVLHRF LRWRQRDYAPARHLGGVLYLFVRGMCGAATPVTAGHPAGVFTWNPPTALVVALSDLLD RGRLQS" misc_feature complement(724930..724953) /gene="recB" /locus_tag="Rv0630c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(725013..728306) /gene="recC" /locus_tag="Rv0631c" /db_xref="GeneID:888008" CDS complement(725013..728306) /gene="recC" /locus_tag="Rv0631c" /EC_number="3.1.11.5" /function="INVOLVED IN HOMOLOGOUS RECOMBINATION." /note="Rv0631c, (MTCY20H10.12c), len: 1097 aa. Probable recC, exonuclease V, gamma chain (exodeoxyribonuclease V, gamma chain) (EC 3.1.11.5) (see Mizrahi & Andersen 1998), highly similar to other exonucleases e.g. AF157643_1|RecC|AAD46807.1 Escherichia coli RecC protein homolog from Mycobacterium smegmatis (1085 aa); P07648|EX5C_ECOLI|B2822 exodeoxyribonuclease v 125 kDa polypeptide (EC 3.1.11.5) (EXONUCLEASE V GAMMA CHAIN) from Escherichia coli strain K12 (1122 aa), FASTA scores: opt: 954, E(): 0, (29.2% identity in 1109 aa overlap); etc. CONSIST OF THREE SUBUNITS; RECB|Rv0630c, RECC AND RECD|Rv0629c. The transcription of this CDS seems to be activated specifically in host granulomas (see Ramakrishnan et al., 2000)." /codon_start=1 /transl_table=11 /product="exonuclease V gamma chain" /protein_id="NP_215145.1" /db_xref="GI:15607771" /db_xref="GeneID:888008" /translation="MALHLHRAERTDLLADGLGALLADPQPDPFAQELVLVAARGVER WLSQRLSLVLGCGPGRADGVCAGIAFRNPQSLIAEITGTLDDDPWSPEALAWPLLAVI DASLDEPWCRTLASHLGHFATTDAEAELRRGRRYSVARRLAGLFASYARQRPGLLAAW LDGDLGELPGDLAWQPPLWRALVTTVGADPPHVRHDKTIARLRDGPADLPARLSLFGH TRLACTDVQLLDALAVHHDLHLWLPHPSDELWRALAGFQGADGLLPRRQDTSRRAAQH PLLETLGRDVRELQRALPAARATDEFLGATTKPDTLLGWLQADIAGNAPRPAGRSLSD ADRSVQVHACHGPARQIDVLREVLLGLLEDDPTLQPRDIVVMCPDIDTYAPLIVAGFG LGEVAGDCHPAHRLRVRLADRALTQTNPLLSVAAELLTIAETRATASQLLNLAQAAPV RAKFGFADDDLDTITTWVRESNIRWGFDPTHRRRYGLDTVVHNTWRFGLDRILTGVAM SEDSQAWLDTALPLDDVGSNRVELAGRLAEFVERLHHVVGGLSGARPLVAWLDALATG IDLLTACNDGWQRAQVQREFADVLARAGSRAAPLLRLPDVRALLDAQLAGRPTRANFR TGTLTVCTMVPMRSVPHRVVCLVGLDDGVFPRLSHPDGDDVLAREPMTGERDIRSEDR QLLLDAIGAATQTLVITYTGADERTGQPRPPAVPLAELLDALDQTTSAPVRERILVTH PLQPFDRKNVTPGALLGAKPFTFDPAALAAAQAAAGKRCPPTAFISGRLPAPPAADVT LADLLDFFKDPVKGFFRALDYTLPWDVDTVEDSIPVQVDALAEWTVGERMLRDMLRGL HPDDAAHSEWRRGTLPPGRLGVRRAKEIRNRARDLAAAALAHRDGHGQAHDVDVDLGD GRRLSGTVTPVFGGRTVSVTYSKLAPKHVLPAWIGLVTLAAQEPGREWSALCIGRSKT RNHIARRLFVPPPDPVAVLRELVLLYDAGRREPLPLPLKTSCAWAQARRDGQDPYPPA RECWQTNRFRPGDDDAPAHVRAWGPRAPFEVLLGKPRAGEEVAGEETRLGALAARLWL PLLAAEGSV" gene complement(728583..729278) /gene="echA3" /locus_tag="Rv0632c" /db_xref="GeneID:888015" CDS complement(728583..729278) /gene="echA3" /locus_tag="Rv0632c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215146.1" /db_xref="GI:15607772" /db_xref="GeneID:888015" /translation="MSDPVSYTRKDSIAVISMDDGKVNALGPAMQQALNAAIDNADRD DVGALVITGNGRVFSGGFDLKILTSGEVQPAIDMLRGGFELAYRLLSYPKPVVMACTG HAIAMGAFLLSCGDHRVAAHAYNIQANEVAIGMTIPYAALEIMKLRLTRSAYQQATGL AKTFFGETALAAGFIDEIALPEVVVSRAEEAAREFAGLNQHAHAATKLRSRADALTAI RAGIDGIAAEFGL" gene complement(729327..730166) /locus_tag="Rv0633c" /db_xref="GeneID:888001" CDS complement(729327..730166) /locus_tag="Rv0633c" /function="UNKNOWN" /note="Rv0633c, (MTCY20H11.14c), len: 279 aa. Possible exported protein; has hydrophobic stretch at aa 23-41." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215147.1" /db_xref="GI:15607773" /db_xref="GeneID:888001" /translation="MVDSMGWVLSSWHEVTGVDSGTWLAWAAWAALGLGVVALVVTKR QIQRNRRLAAEQTRPYVAMFMEPHVADWHVIELVVRNFGRTAAYDVRFSFPNPPTVAQ YENAANGYADVVELRLPQELPMLAPGQEWRMVWDSALDRAEIGRGIESRFPGTVTYYD RPEQPRRWRFWRRGRRPLETKVVLDWDALPPVARIELMTTHDLAKREKQKLELLRSLL TYFHYASKETRPDVFRSEIDRINRAAAETQDRWRARQVEVPTEVSQRSEGQGPQPTRI PAG" gene complement(730320..731033) /locus_tag="Rv0634c" /db_xref="GeneID:888016" CDS complement(730320..731033) /locus_tag="Rv0634c" /EC_number="3.1.2.6" /function="THOUGHT TO BE INVOLVED IN GLYOXAL PATHWAY. THIOLESTERASE THAT CATALYSES THE HYDROLYSIS OF S-D-LACTOYL-GLUTATHIONE TO FORM GLUTATHIONE AND D-LACTIC ACID [CATALYTIC ACTIVITY: (S)-(2-hydroxyacyl)glutathione + H2O = glutathione + a 2-hydroxy acid anion]." /note="Rv0634c, (MTCY20H10.15c), len: 237 aa. Possible glyoxalase II (EC 3.1.2.6), equivalent to NP_302290.1|NC_002677 putative glyoxylase II from Mycobacterium leprae (238 aa); and similar to U00011_3|Y0BK_MYCLE|Q49649 hypothetical 23.9 kDa protein from Mycobacterium leprae (218 aa), FASTA scores: opt: 281, E(): 3.9e-12, (31.8% identity in 201 aa overlap). Also similar to other glyoxalases and metallo-beta-lactamase family proteins e.g. NP_386770.1|NC_003047 PUTATIVE HYDROXYACYLGLUTATHIONE HYDROLASE from Sinorhizobium meliloti (256 aa); etc. Also similar to other putative glyoxylases from Mycobacterium tuberculosis e.g. Rv1637c. BELONGS TO THE GLYOXALASE II FAMILY. COFACTOR: BINDS TWO ZINC IONS." /codon_start=1 /transl_table=11 /product="glyoxalase II" /protein_id="NP_215148.1" /db_xref="GI:15607774" /db_xref="GeneID:888016" /translation="MSKDRLYFRQLLSGRDFAVGDMFATQMRNFAYLIGDRTTGDCVV VDPAYAAGDLLDALESDDMQLSGVLVTHHHPDHVGGSMMGFQLPGLAELLERASVPVH VNTHEALWVSRVTGIPVGDLITHEHGDKVSVGDIDIELLHTPGHTPGSQCFLLDGRLV AGDTLFLEGCGRTDFPGGDSDEMYRSLRQLAELPGDPTVFPGHWYSAEPSASLSEVKR SNYVYRPASLDQWRMLMGG" gene 731113..731364 /locus_tag="Rv0634A" /db_xref="GeneID:3205041" CDS 731113..731364 /locus_tag="Rv0634A" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0634A, len: 83 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177629.1" /db_xref="GI:57116763" /db_xref="GeneID:3205041" /translation="MGSDCGCGGYLWSMLKRVEIEVDDDLIQKVIRRYRVKGAREAVN LALRTLLGEADTAEHGHDDEYDEFSDPNAWVPRRSRDTG" gene 731494..731566 /locus_tag="Rvnt06" /note="tRNA-Thr(GGT)" /db_xref="GeneID:2700463" tRNA 731494..731566 /locus_tag="Rvnt06" /product="tRNA-Thr" /note="codon recognized: ACC" /anticodon=(pos:731527..731529,aa:Thr) /db_xref="GeneID:2700463" gene 731603..731676 /locus_tag="Rvnt07" /note="tRNA-Met(CAT)" /db_xref="GeneID:2700431" tRNA 731603..731676 /locus_tag="Rvnt07" /product="tRNA-Met" /note="codon recognized: AUG" /anticodon=(pos:731637..731639,aa:Met) /db_xref="GeneID:2700431" gene 731712..731879 /gene="rpmG" /locus_tag="Rv0634B" /db_xref="GeneID:3205042" CDS 731712..731879 /gene="rpmG" /locus_tag="Rv0634B" /function="INVOLVED IN TRANSLATION MECHANISM." /note="in Escherichia coli BM108, a mutation that results in lack of L33 synthesis had no effect on ribosome synthesis or function; there are paralogous genes in several bacterial genomes, and a CXXC motif for zinc binding and an upstream regulation region of the paralog lacking this motif that are regulated by zinc similar to other ribosomal proteins like L31; the proteins in this group have the CXXC motif" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L33" /protein_id="YP_177630.1" /db_xref="GI:57116764" /db_xref="GeneID:3205042" /translation="MASSTDVRPKITLACEVCKHRNYITKKNRRNDPDRLELKKFCPN CGKHQAHRETR" gene 731930..732406 /locus_tag="Rv0635" /db_xref="GeneID:888032" CDS 731930..732406 /locus_tag="Rv0635" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="functions as a heterodimer along with HadB in fatty acid biosynthesis; fatty acid synthase type II; FAS-II" /codon_start=1 /transl_table=11 /product="(3R)-hydroxyacyl-ACP dehydratase subunit HadA" /protein_id="NP_215149.1" /db_xref="GI:15607775" /db_xref="GeneID:888032" /translation="MALSADIVGMHYRYPDHYEVEREKIREYAVAVQNDDAWYFEEDG AAELGYKGLLAPLTFICVFGYKAQAAFFKHANIATAEAQIVQVDQVLKFEKPIVAGDK LYCDVYVDSVREAHGTQIIVTKNIVTNEEGDLVQETYTTLAGRAGEDGEGFSDGAA" gene 732393..732821 /locus_tag="Rv0636" /db_xref="GeneID:888031" CDS 732393..732821 /locus_tag="Rv0636" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="functions as a heterodimer along with HadA or HadC in fatty acid biosynthesis; fatty acid synthase type II; FAS-II" /codon_start=1 /transl_table=11 /product="(3R)-hydroxyacyl-ACP dehydratase subunit HadB" /protein_id="NP_215150.1" /db_xref="GI:15607776" /db_xref="GeneID:888031" /translation="MALREFSSVKVGDQLPEKTYPLTRQDLVNYAGVSGDLNPIHWDD EIAKVVGLDTAIAHGMLTMGIGGGYVTSWVGDPGAVTEYNVRFTAVVPVPNDGKGAEL VFNGRVKSVDPESKSVTIALTATTGGKKIFGRAIASAKLA" gene 732825..733325 /locus_tag="Rv0637" /db_xref="GeneID:888019" CDS 732825..733325 /locus_tag="Rv0637" /function="UNKNOWN" /note="functions as a heterodimer along with HadB in fatty acid biosynthesis; fatty acid synthase type II; FAS-II" /codon_start=1 /transl_table=11 /product="(3R)-hydroxyacyl-ACP dehydratase subunit HadC" /protein_id="NP_215151.1" /db_xref="GI:15607777" /db_xref="GeneID:888019" /translation="MALKTDIRGMIWRYPDYFIVGREQCREFARAVKCDHPAFFSEEA AADLGYDALVAPLTFVTILAKYVQLDFFRHVDVGMETMQIVQVDQRFVFHKPVLAGDK LWARMDIHSVDERFGADIVVTRNLCTNDDGELVMEAYTTLMGQQGDGSARLKWDKESG QVIRTA" gene 733524..733596 /locus_tag="Rvnt08" /note="tRNA-Trp(CCA)" /db_xref="GeneID:2700453" tRNA 733524..733596 /locus_tag="Rvnt08" /product="tRNA-Trp" /note="codon recognized: UGG" /anticodon=(pos:733557..733559,aa:Trp) /db_xref="GeneID:2700453" gene 733737..734222 /gene="secE" /locus_tag="Rv0638" /db_xref="GeneID:888042" CDS 733737..734222 /gene="secE" /locus_tag="Rv0638" /function="ESSENTIAL FOR PROTEIN EXPORT." /note="forms a complex with SecY and SecG; SecYEG forms a putative protein-conducting channel to which secA binds and translocates targeted polypeptides across the cytoplasmic membrane, a process driven by ATP and a proton-motive force" /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecE" /protein_id="YP_177743.1" /db_xref="GI:57116765" /db_xref="GeneID:888042" /translation="MSDEGDVADEAVADGAENADSRGSGGRTALVTKPVVRPQRPTGK RSRSRAAGADADVDVEEPSTAASEATGVAKDDSTTKAVSKAARAKKASKPKARSVNPI AFVYNYLKQVVAEMRKVIWPNRKQMLTYTSVVLAFLAFMVALVAGADLGLTKLVMLVF G" misc_feature 734058..734144 /gene="secE" /locus_tag="Rv0638" /note="PS01067 Protein secE/sec61-gamma signature" gene 734254..734970 /gene="nusG" /locus_tag="Rv0639" /db_xref="GeneID:888039" CDS 734254..734970 /gene="nusG" /locus_tag="Rv0639" /function="INFLUENCES TRANSCRIPTION TERMINATION AND ANTITERMINATION. ACTS AS A COMPONENT OF THE TRANSCRIPTION COMPLEX, AND INTERACTS WITH THE TERMINATION FACTOR RHO AND RNA POLYMERASE." /experiment="experimental evidence, no additional details recorded" /note="Modulates Rho-dependent transcription termination" /codon_start=1 /transl_table=11 /product="transcription antitermination protein NusG" /protein_id="NP_215153.1" /db_xref="GI:15607779" /db_xref="GeneID:888039" /translation="MTTFDGDTSAGEAVDLTEANAFQDAAAPAEEVDPAAALKAELRS KPGDWYVVHSYAGYENKVKANLETRVQNLDVGDYIFQVEVPTEEVTEIKNGQRKQVNR KVLPGYILVRMDLTDDSWAAVRNTPGVTGFVGATSRPSALALDDVVKFLLPRGSTRKA AKGAASTAAAAEAGGLERPVVEVDYEVGESVTVMDGPFATLPATISEVNAEQQKLKVL VSIFGRETPVELTFGQVSKI" misc_feature 734914..734943 /gene="nusG" /locus_tag="Rv0639" /note="PS01014 Transcription termination factor nusG signature" gene 735022..735450 /gene="rplK" /locus_tag="Rv0640" /db_xref="GeneID:888045" CDS 735022..735450 /gene="rplK" /locus_tag="Rv0640" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA." /experiment="experimental evidence, no additional details recorded" /note="binds directly to 23S ribosomal RNA" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L11" /protein_id="NP_215154.1" /db_xref="GI:15607780" /db_xref="GeneID:888045" /translation="MAPKKKVAGLIKLQIVAGQANPAPPVGPALGQHGVNIMEFCKAY NAATENQRGNVIPVEITVYEDRSFTFTLKTPPAAKLLLKAAGVAKGSAEPHKTKVAKV TWDQVREIAETKKTDLNANDVDAAAKIIAGTARSMGITVE" misc_feature 735403..735447 /gene="rplK" /locus_tag="Rv0640" /note="PS00359 Ribosomal protein L11 signature" gene 735517..736224 /gene="rplA" /locus_tag="Rv0641" /db_xref="GeneID:888043" CDS 735517..736224 /gene="rplA" /locus_tag="Rv0641" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA AND IS LOCATED IN THE NEIGHBORHOOD OF THE SITE WHERE ELONGATION FACTOR TU IS BOUND TO THE RIBOSOME." /experiment="experimental evidence, no additional details recorded" /note="in Escherichia coli and Methanococcus, this protein autoregulates expression; the binding site in the mRNA mimics the binding site in the 23S rRNA" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L1" /protein_id="NP_215155.1" /db_xref="GI:15607781" /db_xref="GeneID:888043" /translation="MSKTSKAYRAAAAKVDRTNLYTPLQAAKLAKETSSTKQDATVEV AIRLGVDPRKADQMVRGTVNLPHGTGKTARVAVFAVGEKADAAVAAGADVVGSDDLIE RIQGGWLEFDAAIATPDQMAKVGRIARVLGPRGLMPNPKTGTVTADVAKAVADIKGGK INFRVDKQANLHFVIGKASFDEKLLAENYGAAIDEVLRLKPSSSKGRYLKKITVSTTT GPGIPVDPSITRNFAGE" gene complement(736298..737203) /gene="mmaA4" /locus_tag="Rv0642c" /db_xref="GeneID:888056" CDS complement(736298..737203) /gene="mmaA4" /locus_tag="Rv0642c" /EC_number="2.1.1.-" /function="INVOLVED IN MYCOLIC ACIDS MODIFICATION. CATALYZES UNUSUAL S-ADENOSYL-METHIONINE-DEPENDENT TRANSFORMATION OF A CIS-OLEFIN MYCOLIC ACID INTO A SECONDARY ALCOHOL. CATALYZES INTRODUCTION OF A HYDROXYL GROUP AT THE DISTAL POSITION ON MYCOLIC ACID CHAINS TO PRODUCE THE HYDROXYL MYCOLATE. Mycolic acids represent a major constituent of the mycobacterial cell wall complex. Methyl transfer results in formation of a secondary hydroxy group with an adjacent methyl branch; Olefinic mycolic acid methyl transferase." /experiment="experimental evidence, no additional details recorded" /note="Rv0642c, (MTCY20H10.23c), len: 301 aa. mmaA4, methoxy mycolic acid synthase 4 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to AAC44876|AAC44876.1|cmaA methyl transferase (mycolic acid modification protein) from Mycobacterium bovis BCG strain Pasteur (298 aa); NP_302280.1|NC_002677 methyl mycolic acid synthase 4 from Mycobacterium leprae (298 aa); and highly similar to others from Mycobacteria e.g. downstream ORF P72027|mmaA3|Rv0643c|MTCY20H10.24c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 3 from Mycobacterium tuberculosis (293 aa)." /codon_start=1 /transl_table=11 /product="methoxy mycolic acid synthase" /protein_id="NP_215156.1" /db_xref="GI:15607782" /db_xref="GeneID:888056" /translation="MTRMAEKPISPTKTRTRFEDIQAHYDVSDDFFALFQDPTRTYSC AYFEPPELTLEEAQYAKVDLNLDKLDLKPGMTLLDIGCGWGTTMRRAVERFDVNVIGL TLSKNQHARCEQVLASIDTNRSRQVLLQGWEDFAEPVDRIVSIEAFEHFGHENYDDFF KRCFNIMPADGRMTVQSSVSYHPYEMAARGKKLSFETARFIKFIVTEIFPGGRLPSTE MMVEHGEKAGFTVPEPLSLRPHYIKTLRIWGDTLQSNKDKAIEVTSEEVYNRYMKYLR GCEHYFTDEMLDCSLVTYLKPGAAA" gene complement(737268..738149) /gene="mmaA3" /locus_tag="Rv0643c" /db_xref="GeneID:888058" CDS complement(737268..738149) /gene="mmaA3" /locus_tag="Rv0643c" /EC_number="2.1.1.-" /function="INVOLVED IN MYCOLIC ACIDS MODIFICATION. CATALYZES UNUSUAL S-ADENOSYL-METHIONINE-DEPENDENT TRANSFORMATION OF A CIS-OLEFIN MYCOLIC ACID INTO A SECONDARY ALCOHOL. CATALYZES INTRODUCTION OF A HYDROXYL GROUP AT THE DISTAL POSITION ON MYCOLIC ACID CHAINS TO PRODUCE THE HYDROXYL MYCOLATE. Mycolic acids represent a major constituent of the mycobacterial cell wall complex. Methyl transfer results in formation of a secondary hydroxy group with an adjacent methyl branch; Olefinic mycolic acid methyl transferase." /experiment="experimental evidence, no additional details recorded" /note="Rv0643c, (MTCY20H10.24c), len: 293 aa. mmaA3, methoxy mycolic acid synthase 3 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to AAC44875|AAC44875.1|cmaB methyl transferase (mycolic acid modification protein) from Mycobacterium bovis BCG strain Pasteur (289 aa); and highly similar to others from Mycobacteria e.g. upstream ORF P72028|mmaA4|Rv0642c|MTCY20H10.23c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 4 from Mycobacterium tuberculosis (301 aa)." /codon_start=1 /transl_table=11 /product="methoxy mycolic acid synthase" /protein_id="NP_215157.1" /db_xref="GI:15607783" /db_xref="GeneID:888058" /translation="MSDNSTGTTKSRSNVDDVQAHYDLSDAFFALFQDPTRTYSCAYF ERDDMTLHEAQVAKLDLTLGKLGLEPGMTLLDVGCGWGSVMKRAVERYDVNVVGLTLS KNQHAYCQQVLDKVDTNRSHRVLLSDWANFSEPVDRIVTIEAIEHFGFERYDDFFKFA YNAMPADGVMLLHSITGLHVKQVIERGIPLTMEMAKFIRFIVTDIFPGGRLPTIETIE EHVTKAGFTITDIQSLQPHFARTLDLWAEALQAHKDEAIEIQSAEVYERYMKYLTGCA KAFRMGYIDCNQFTLAK" gene complement(738297..739160) /gene="mmaA2" /locus_tag="Rv0644c" /db_xref="GeneID:888061" CDS complement(738297..739160) /gene="mmaA2" /locus_tag="Rv0644c" /EC_number="2.1.1.-" /function="INVOLVED IN MYCOLIC ACIDS MODIFICATION. CATALYZES UNUSUAL S-ADENOSYL-METHIONINE-DEPENDENT TRANSFORMATION OF A CIS-OLEFIN MYCOLIC ACID INTO A SECONDARY ALCOHOL. CATALYZES INTRODUCTION OF A HYDROXYL GROUP AT THE DISTAL POSITION ON MYCOLIC ACID CHAINS TO PRODUCE THE HYDROXYL MYCOLATE. Mycolic acids represent a major constituent of the mycobacterial cell wall complex. Methyl transfer results in formation of a secondary hydroxy group with an adjacent methyl branch; Olefinic mycolic acid methyl transferase. HAS ALSO CYCLOPROPANE FUNCTION." /note="Rv0644c, (MTCY20H10.25c), len: 287 aa. mmaA2, methoxy mycolic acid synthase 2 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to AAC44874|AAC44874.1|cmaC methyl transferase (mycolic acid modification protein) from Mycobacterium bovis BCG strain Pasteur (287 aa); and highly similar to others from Mycobacteria e.g. upstream ORF P72028|mmaA4|Rv0642c|MTCY20H10.23c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 4 from Mycobacterium tuberculosis (301 aa). Note that alternative start is at position 739247." /codon_start=1 /transl_table=11 /product="methoxy mycolic acid synthase" /protein_id="NP_215158.1" /db_xref="GI:15607784" /db_xref="GeneID:888061" /translation="MVNDLTPHFEDVQAHYDLSDDFFRLFLDPTQTYSCAHFEREDMT LEEAQIAKIDLALGKLGLQPGMTLLDIGCGWGATMRRAIAQYDVNVVGLTLSKNQAAH VQKSFDEMDTPRDRRVLLAGWEQFNEPVDRIVSIGAFEHFGHDRHADFFARAHKILPP DGVLLLHTITGLTRQQMVDHGLPLTLWLARFLKFIATEIFPGGQPPTIEMVEEQSAKT GFTLTRRQSLQPHYARTLDLWAEALQEHKSEAIAIQSEEVYERYMKYLTGCAKLFRVG YIDVNQFTLAK" gene complement(739327..740187) /gene="mmaA1" /locus_tag="Rv0645c" /db_xref="GeneID:888060" CDS complement(739327..740187) /gene="mmaA1" /locus_tag="Rv0645c" /EC_number="2.1.1.-" /function="INVOLVED IN MYCOLIC ACIDS MODIFICATION. CATALYZES UNUSUAL S-ADENOSYL-METHIONINE-DEPENDENT TRANSFORMATION OF A CIS-OLEFIN MYCOLIC ACID INTO A SECONDARY ALCOHOL. CATALYZES INTRODUCTION OF A HYDROXYL GROUP AT THE DISTAL POSITION ON MYCOLIC ACID CHAINS TO PRODUCE THE HYDROXYL MYCOLATE. Mycolic acids represent a major constituent of the mycobacterial cell wall complex. Methyl transfer results in formation of a secondary hydroxy group with an adjacent methyl branch; Olefinic mycolic acid methyl transferase." /note="Rv0645c, (MTCY20H10.26c), len: 286 aa. mmaA1, methoxy mycolic acid synthase 1 (methyltransferase) (EC 2.1.1.-) (see citations below). Equivalent to NP_302279.1|NC_002677 methyl mycolic acid synthase 1 from Mycobacterium leprae (286 aa); and highly similar to others from Mycobacteria e.g. upstream ORF P72028|mmaA4|Rv0642c|MTCY20H10.23c PUTATIVE METHOXY MYCOLIC ACID SYNTHASE 4 from Mycobacterium tuberculosis (301 aa)." /codon_start=1 /transl_table=11 /product="methoxy mycolic acid synthase" /protein_id="NP_215159.1" /db_xref="GI:15607785" /db_xref="GeneID:888060" /translation="MAKLRPYYEESQSAYDISDDFFALFLDPTWVYTCAYFERDDMTL EEAQLAKVDLALDKLNLEPGMTLLDVGCGWGGALVRAVEKYDVNVIGLTLSRNHYERS KDRLAAIGTQRRAEARLQGWEEFEENVDRIVSFEAFDAFKKERYLTFFERSYDILPDD GRMLLHSLFTYDRRWLHEQGIALTMSDLRFLKFLRESIFPGGELPSEPDIVDNAQAAG FTIEHVQLLQQHYARTLDAWAANLQAARERAIAVQSEEVYNNFMHYLTGCAERFRRGL INVAQFTMTK" gene complement(740234..741139) /gene="lipG" /locus_tag="Rv0646c" /db_xref="GeneID:888065" CDS complement(740234..741139) /gene="lipG" /locus_tag="Rv0646c" /EC_number="3.1.-.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /note="Rv0646c, (MTCY20H10.27c), len: 301 aa. Probable lipG, lipase/esterase (EC 3.1.-.-), equivalent to NP_302278.1|NC_002677 probable hydrolase from Mycobacterium leprae (304 aa). Also highly similar to various hydrolases, especially lipases e.g. AA61351.1|X88895 carboxyl esterase from Acinetobacter calcoaceticus (312 aa), FASTA scores: opt: 867, E(): 0, (50.2% identity in 279 aa overlap); etc. Also similar to transferases e.g. P77026 MACROLIDE 2'-PHOSPHOTRANSFERASE II from Escherichia coli (279 aa), FASTA scores: E(): 1.3e-14, (32.5% identity in 286 aa overlap). Similar to M. tuberculosis non-heme bromoperoxidases and epoxide hydrolases." /codon_start=1 /transl_table=11 /product="lipase/esterase LipG" /protein_id="NP_215160.1" /db_xref="GI:15607786" /db_xref="GeneID:888065" /translation="MDIRSGTAVSGDVKLYYEDMGDLDHPPVLLIMGLGAQMLLWRTD FCARLVAKGLRVIRYDNRDVGLSTKTERHRPGQPLATRLVRSWLGLPSQAAYTLEDMA ADAAALLDHLDVKHAHVVGASMGGMIAQIFAARFAQRTKTLAVIFSSNNHRFLPPPAP RALLALLTGPPPDSPRDVIVDNAVRVSKIIGSPAYPIPEDQVRAEAAESYDRNFHPWG IAQQFSAILGSGSLLRYDRRIVAPTVVIHGRADKLMRPFGGRAVARAINGARLVLIDG MGHDLPRQLWDRVIGELTRNFSEAG" gene complement(741151..742617) /locus_tag="Rv0647c" /db_xref="GeneID:888070" CDS complement(741151..742617) /locus_tag="Rv0647c" /function="UNKNOWN" /note="Rv0647c, (MTCY20H10.28c), len: 488 aa. Conserved hypothetical protein, equivalent to NP_302277.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (448 aa). Also showing similarity to a variety of hypothetical ABC1-LIKE proteins or conserved hypothetical proteins e.g. D90908_28|P73627 ABC1-LIKE PROTEIN from Synechocystis (585 aa), FASTA scores: E(): 1.8e-31, (29.1% identity in 474 aa overlap); Q55884 HYPOTHETICAL6 5.0 KD PROTEIN (567 aa), FASTA scores: opt: 583, E(): 5.7e-30, (28.1% identity in 416 aa overlap); etc. Also similar to Rv3197 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215161.1" /db_xref="GI:15607787" /db_xref="GeneID:888070" /translation="MRAEIGPDFRPHYTFGDAYPASERAHVNWELSAPVWHTAQMGST THREVAKLDRVPLPVEAARVAATGWQVTRTAVRFIGRLPRKGPWQQKVIKELPQTFAD LGPTYVKFGQIIASSPGAFGESLSREFRGLLDRVPPAKTDEVHKLFVEELGDEPARLF ASFEEEPFASASIAQVHYATLRSGEEVVVKIQRPGIRRRVAADLQILKRFAQTVELAK LGRRLSAQDVVADFADNLAEELDFRLEAQSMEAWVSHLHASPLGKNIRVPQVHWDFTT ERVLTMERVHGIRIDNAAAIRKAGFDGVELVKALLFSVFEGGLRHGLFHGDLHAGNLY VDEAGRIVFFDFGIMGRIDPRTRWLLRELVYALLVKKDHAAAGKIVVLMGAVGTMKPE TQAAKDLERFATPLTMQSLGDMSYADIGRQLSALADAYDVKLPRELVLIGKQFLYVER YMKLLAPRWQMMSDPQLTGYFANFMVEVSREHQSDIEV" gene 742719..746366 /locus_tag="Rv0648" /db_xref="GeneID:888048" CDS 742719..746366 /locus_tag="Rv0648" /EC_number="3.2.1.-" /function="Alpha-mannosidase activity: hydrolysis of terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides." /experiment="experimental evidence, no additional details recorded" /note="Rv0648, (MTCY20H10.29), len: 1215 aa. Alpha-mannosidase (EC 3.2.1.-) (see citation below), showing some similarity to hypothetical proteins and various sugar hydrolases e.g. SYCSLRA_6|Q55528 HYPOTHETICAL 1 20.4 kDa PROTEIN from Synechocystis (1042 aa), FASTA scores: opt: 260, E(): 3.6e-08, (23.4% identity in 602 aa overlap); etc. Contains PS00659 Glycosyl hydrolases family 5 signature." /codon_start=1 /transl_table=11 /product="alpha-mannosidase" /protein_id="NP_215162.1" /db_xref="GI:15607788" /db_xref="GeneID:888048" /translation="MMGGTYNEPNTNLTSPETTIRNLVHGIGFQRDVLGAEPATAWQL DVFGHDPQFPGLAADAGLTSSSWARGPHHQWGPAQGGVDRMQFCSEFEWIAPSGRGLL THYMPAHYSAGWSMDSSTSLADAEAATYALFDQLKKVALTRNVLLPVGTDYTPPNKWV TAIHRDWGARYTWPRFVCALPKEFFAAVRAELAKRGWVPSPQTRDMNPIYTGKDVSYI DTKQANRAAENAVLEAERFAVFAALLTGAEYPQAALAKAWVQLAYGAHHDAITGSESD QVYLDLLTGWRDAWELGRAARDNSLRLLSGAVAASHDRVVVWNPLTQRRTDIVTARVD PPLQAGVRVFDPDGAEVAALVEHDGRSVTWLACDVPSLGWRVYRLVPADEAPGWELVP GTDIANEHYRLAVDPERGGALSSLVQDGRQLIAAGRVANELALYEEYPSHPTQGEGPW HLLPTGPVVCSSACPAQVQAYRGPLGQRLVVRGRIGTLLRYTQTLTLWDGVDRVDCRT SIDEFTGEDRLLRLRWPCPVPGAMPISEVGDAVVGRGFALLHEGPESVDTAQHPWTLD NPAYGWFGLSSAVRVRAGDGVRAVSVAEVVSPTETVSGPMARDLMVALVRAGVTATCS GADKPRYGHLDVDSNLPDARIALGGPDRNTFTKAVLAEAAPAYTAELQRQLAKTGTAR VWVPAANPLARAWLPGADLRAPCALPVLVIDGRDEKHLRAAVASLADDLADAEIVVHQ RAAPQMEPFEDRTVALLNRGVPSFAVDSEGTLHTALMRSCTGWPSGVWIDQPRRTAPD GSNFQLQHWTHHFDYALVCGGGDWRRAGIPARSAQFSHPLLAVAPRRPQGELPAVGSL LHVEPADSVQLGALKAAGNPLAAGSARPVQPAAVALRLVQTTGADTPVTIGCELGKVG ALRPADLLETPLAMARARKSSIDLHGYQVATVLARLDVAADMANVLAADDVALAPHAE TAQPQYARYWLHNRGPAPLGGLPAVAHLHPRRVRGQPGDDVVLRLTAASDCTDSVLGG VVDVVCPLGWPATPARLPFTLGAGAHLQADIALSIPAGAPPGPYPVRAQLRVVDTAVP AAWRQVVEDVCVVTVGADSDLEELVYLVDGPADIELAAGDRARLAVTIGSRAHAELAL DAHSISPWGTWEWIGPPALGAVLPARGMAKLAFDVTPPAWLEPGQWWALVRVGCAGQL VYSPAVKVSVT" misc_feature 742719..742748 /locus_tag="Rv0648" /note="PS00659 Glycosyl hydrolases family 5 signature" gene 746363..747037 /gene="fabD2" /locus_tag="Rv0649" /db_xref="GeneID:888079" CDS 746363..747037 /gene="fabD2" /locus_tag="Rv0649" /EC_number="2.3.1.39" /function="INVOLVED IN LIPID METABOLISM; FATTY ACID BIOSYNTHESIS [CATALYTIC ACTIVITY: MALONYL-CoA + [ACYL-CARRIER PROTEIN] = CoA + MALONYL-[ACYL-CARRIER PROTEIN]]." /note="Rv0649, (MTCY20H10.30), len: 224 aa. Possible fabD2, malonyl CoA-acyl carrier protein transacylase (EC 2.3.1.39), similar to MTFABD|FABD_MYCTU|Q10501|Rv2243 malonyl CoA-acyl carrier protein transacylase from Mycobacterium tuberculosis (302 aa), FASTA scores: opt: 133, E(): 0.074, (31.3% identity in 147 aa overlap)." /codon_start=1 /transl_table=11 /product="malonyl CoA-acyl carrier protein transacylase" /protein_id="YP_177744.1" /db_xref="GI:57116766" /db_xref="GeneID:888079" /translation="MSGRSRLPGSSSRRDAARIVAERVVATVAGVAVAVDEVDAAEAR LRDGPRAAALPASGTSEGRQLRRWLTQLIVTERVVAAEAAARGLTAAGAPAEADLLPD ATARLEIGSVAAAVLADPLARALFAAVTARVAVTDDAVADYHARNPLRFAAPCPGQHG WRAPAAAAPPLDQVRRAITEHLLGAARRRAFRVWLDARRNALVVLAPGYEHPGDPRQP DNTRRH" gene 747037..747945 /locus_tag="Rv0650" /db_xref="GeneID:888082" CDS 747037..747945 /locus_tag="Rv0650" /EC_number="2.7.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN SPECIFIC SUGAR METABOLISM OR REGULATION." /note="Rv0650, (MTCY20H10.31), len: 302 aa. Possible sugar kinase, highly similar to others e.g. CAB95296.1|AL359779 putative sugar kinase from Streptomyces coelicolor (317 aa); NP_406512.1|NC_003143 putative sugar kinase from Yersinia pestis (290 aa); NP_229269.1|NC_000853 glucokinase from Thermotoga maritima (317 aa); etc. Contains PS01125 ROK family signature. BELONGS TO THE ROK (NAGC/XYLR) FAMILY." /codon_start=1 /transl_table=11 /product="sugar kinase" /protein_id="NP_215164.1" /db_xref="GI:15607790" /db_xref="GeneID:888082" /translation="MLTLCLDIGGTKIAAGLADPAGTLVHTAQRPTPAYGGAEQVWAA VAEMIADALGVAGGAVGGVGIASAGPIDLHSGRVSPINIGSWGGFPLRDRVAAAVPGV PVRLGGDGVCMALGEHWLGAGRGARFLLGLVVSTGVGGGLVLDGAPCLGRTGNAGHVG HVVVDPDGSPCPCGGRGCVETIASGPSLARWARANGWSAPPGAGAKELAEAAGAGDPV ALRAFRRGAAALAAMIASVGAVCDLDLAVIGGGVAKSGRLLFEPLRAALADHARLDFL AGLRVVPAELGGAAGLVGAARLAAIA" misc_feature 747436..747519 /locus_tag="Rv0650" /note="PS01125 ROK family signature" gene 748276..748812 /gene="rplJ" /locus_tag="Rv0651" /db_xref="GeneID:888049" CDS 748276..748812 /gene="rplJ" /locus_tag="Rv0651" /function="INVOLVED IN TRANSLATION MECHANISMS." /experiment="experimental evidence, no additional details recorded" /note="binds the two ribosomal protein L7/L12 dimers and anchors them to the large ribosomal subunit" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L10" /protein_id="NP_215165.1" /db_xref="GI:15607791" /db_xref="GeneID:888049" /translation="MARADKATAVADIAAQFKESTATLITEYRGLTVANLAELRRSLT GSATYAVAKNTLIKRAASEAGIEGLDELFVGPTAIAFVTGEPVDAAKAIKTFAKEHKA LVIKGGYMDGHPLTVAEVERIADLESREVLLAKLAGAMKGNLAKAAGLFNAPASQLAR LAAALQEKKACPGPDSAE" gene 748849..749241 /gene="rplL" /locus_tag="Rv0652" /db_xref="GeneID:888078" CDS 748849..749241 /gene="rplL" /locus_tag="Rv0652" /function="INVOLVED IN TRANSLATION MECHANISMS: SEEMS TO BE THE BINDING SITE FOR SEVERAL OF THE FACTORS INVOLVED IN PROTEIN SYNTHESIS AND APPEARS TO BE ESSENTIAL FOR ACCURATE TRANSLATION." /experiment="experimental evidence, no additional details recorded" /note="present in two forms; L12 is normal, while L7 is aminoacylated at the N-terminal serine; the only multicopy ribosomal protein; 4:1 ratio of L7/L12 per ribosome; two L12 dimers bind L10; critically important for translation efficiency and fidelity; stimulates GTPase activity of translation factors" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L7/L12" /protein_id="NP_215166.1" /db_xref="GI:15607792" /db_xref="GeneID:888078" /translation="MAKLSTDELLDAFKEMTLLELSDFVKKFEETFEVTAAAPVAVAA AGAAPAGAAVEAAEEQSEFDVILEAAGDKKIGVIKVVREIVSGLGLKEAKDLVDGAPK PLLEKVAKEAADEAKAKLEAAGATVTVK" gene complement(749234..749929) /locus_tag="Rv0653c" /db_xref="GeneID:888087" CDS complement(749234..749929) /locus_tag="Rv0653c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0653c, (MTCI376.23, MTCY20H10.34c), len: 231 aa. Possible transcriptional regulator, TetR family, similar in N-terminus to others e.g. CAC03642.1|AL391338 putative TetR-family transcriptional regulator from Streptomyces coelicolor (190 aa); Q51597 CAM REPRESSOR from Pseudomonas putida (186 aa), FASTA scores: opt: 150, E(): 0.00085, (27.8% identity in 97 aa overlap); etc. Also some similarity to Mycobacterium tuberculosis hypothetical transcriptional regulators Rv0681 and Rv1816. Contains probable helix-turn helix motif from aa 27-48 (Score 1156, +3.12 SD)." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_215167.1" /db_xref="GI:15607793" /db_xref="GeneID:888087" /translation="MTSQTGVRDELLHAGVRLLDDHGPDALQTRKVAAAAGTSTMAVY THFGGMRGLIAAIAEEGLRQFDVALTVPQTADPVADLLAIGTAYRRYAIERPHMYRLM FGSTSAHGINVPARDVLTLKVAEIEHQHPSFAHVVRAVHRCLLAGRFATALGADDDTA IVATAAQFWSQIHGFVMLELAGFYGDRGAAVEPVLAAMTVNLLVALGDSPERAQCSLR AEQTQKNTLGRAT" gene 750000..751505 /locus_tag="Rv0654" /db_xref="GeneID:888089" CDS 750000..751505 /locus_tag="Rv0654" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0654, (MTCI376.22), len: 501 aa. Probable dioxygenase (EC 1.-.-.-), highly similar to others eg AAK06796.1|AF324838_15|AF324838|SimC5 putative dioxygenase (involved in tetraene formation) from Streptomyces antibioticus (456 aa); CAB56138.1| AL117669 putative dioxygenase from Streptomyces coelicolor (503 aa); T51734 neoxanthin cleavage enzyme (9-cis-epoxy-carotenoid dioxygenase) from Arabidopsis thaliana (538 aa); Q53353 LIGNOSTILBENE-ALPHA,BETA-DIOXYGENASE from Pseudomonas paucimobilis (Sphingomonas paucimobilis), FASTA scores: opt: 280, E(): 2.3e-11, (28.5% identity in 523 aa overlap); etc. Also some similarity with Rv0913c|MTCY21C12.07c POSSIBLE DIOXYGENASE from Mycobacterium tuberculosis (501 aa), FASTA score: (29.5% identity in 522 aa overlap)." /codon_start=1 /transl_table=11 /product="dioxygenase" /protein_id="NP_215168.1" /db_xref="GI:15607794" /db_xref="GeneID:888089" /translation="MTTAQAAESQNPYLEGFLAPVSTEVTATDLPVTGRIPEHLDGRY LRNGPNPVAEVDPATYHWFTGDAMVHGVALRDGKARWYRNRWVRTPAVCAALGEPISA RPHPRTGIIEGGPNTNVLTHAGRTLALVEAGVVNYELTDELDTVGPCDFDGTLHGGYT AHPQRDPHTGELHAVSYSFARGHRVQYSVIGTDGHARRTVDIEVAGSPMMHSFSLTDN YVVIYDLPVTFDPMQVVPASVPRWLQRPARLVIQSVLGRVRIPDPIAALGNRMQGHSD RLPYAWNPSYPARVGVMPREGGNEDVRWFDIEPCYVYHPLNAYSECRNGAEVLVLDVV RYSRMFDRDRRGPGGDSRPSLDRWTINLATGAVTAECRDDRAQEFPRINETLVGGPHR FAYTVGIEGGFLVGAGAALSTPLYKQDCVTGSSTVASLDPDLLIGEMVFVPNPSARAE DDGILMGYGWHRGRDEGQLLLLDAQTLESIATVHLPQRVPMGFHGNWAPTT" gene 751517..752596 /gene="mkl" /locus_tag="Rv0655" /db_xref="GeneID:888081" CDS 751517..752596 /gene="mkl" /locus_tag="Rv0655" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF RIBONUCLEOTIDE ACROSS THE MEMBRANE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv0655, (MTCI376.21), len: 359 aa. Possible mkl, ribonucleotide-transport ATP-binding protein ABC transporter (see Braibant et al., 2000), equivalent to P30769|MKL_MYCLE|ML1892 POSSIBLE RIBONUCLEOTIDE TRANSPORT ATP-BINDING PROTEIN from Mycobacterium leprae (347 aa), FASTA scores: opt: 2021, E(): 0, (92.2% identity in 335 aa overlap). Also highly similar to many e.g. AB92896.1|AL356992 putative ABC-transporter ATP-binding protein from Streptomyces coelicolor (343 aa); NP_253146.1|NC_002516 probable ATP-binding component of ABC transporter from Pseudomonas aeruginosa (269 aa); P45393|YRBF_ECOLI hypothetical ABC transporter ATP-binding protein from Escherichia coli (269 aa), FASTA scores: opt: 644, E(): 3.4e-33, (38.5% identity in 244 aa overlap); etc. Also similar to many other Mycobacterium tuberculosis ABC transporters e.g. P71747|CYSA|Rv2397c|MTCY253.24 (351 aa), FASTA score: (33.6% identity in 241 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="ribonucleotide ABC transporter ATP-binding protein" /protein_id="NP_215169.1" /db_xref="GI:15607795" /db_xref="GeneID:888081" /translation="MRYSDSYHTTGRWQPRASTEGFPMGVSIEVNGLTKSFGSSRIWE DVTLTIPAGEVSVLLGPSGTGKSVFLKSLIGLLRPERGSIIIDGTDIIECSAKELYEI RTLFGVLFQDGALFGSMNLYDNTAFPLREHTKKKESEIRDIVMEKLALVGLGGDEKKF PGEISGGMRKRAGLARALVLDPQIILCDEPDSGLDPVRTAYLSQLIMDINAQIDATIL IVTHNINIARTVPDNMGMLFRKHLVMFGPREVLLTSDEPVVRQFLNGRRIGPIGMSEE KDEATMAEEQALLDAGHHAGGVEEIEGVPPQISATPGMPERKAVARRQARVREMLHTL PKKAQAAILDDLEGTHKYAVHEIGQ" misc_feature 751694..751717 /gene="mkl" /locus_tag="Rv0655" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 752006..752050 /gene="mkl" /locus_tag="Rv0655" /note="PS00211 ABC transporters family signature" gene complement(752984..753367) /locus_tag="Rv0656c" /db_xref="GeneID:888106" CDS complement(752984..753367) /locus_tag="Rv0656c" /function="UNKNOWN" /note="Rv0656c, (MTCI376.20), len: 127 aa. Conserved hypothetical protein, showing similarity with proteins from Mycobacterium tuberculosis e.g. Rv2757c, Rv2546, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215170.1" /db_xref="GI:15607796" /db_xref="GeneID:888106" /translation="MAAATTTGTHRGLELRAAQRAVGSCEPQRAEFCRSARNADEFDQ MSRMFGDVYPDVPVPKSVWRWIDSAQHRLARAGAVGALSVVDLLICDTAAARGLVVLH DDADYELAERHLPDIRVRRVVSADD" gene complement(753462..753617) /locus_tag="Rv0657c" /db_xref="GeneID:888077" CDS complement(753462..753617) /locus_tag="Rv0657c" /function="UNKNOWN" /note="Rv0657c, (MTCI376.19), len: 51 aa. Conserved hypothetical protein, showing similarity with hypothetical proteins from Mycobacterium tuberculosis e.g. Rv2009|MT2064.1|MTCY39.08c|YW08_MYCTU|Q10848 (80 aa), FASTA scores: opt: 107, E(): 0.0038, (45.8% identity in 48 aa overlap), Rv2871, Rv1560, etc. Also some similarity with AL020958|SC4H8_7 from Streptomyces coelicolor (66 aa), FASTA score: (41.0% identity in 39 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215171.1" /db_xref="GI:15607797" /db_xref="GeneID:888077" /translation="MSVTQIDLDDEALADVMRIAAVHTKKEAVNLAMRDYVERFRRIE ALARSRE" gene complement(753693..754409) /locus_tag="Rv0658c" /db_xref="GeneID:888102" CDS complement(753693..754409) /locus_tag="Rv0658c" /function="UNKNOWN. SUPPOSED INVOLVED IN STATIONARY-PHASE SURVIVAL." /experiment="experimental evidence, no additional details recorded" /note="Rv0658c, (MTCI376.18), len: 238 aa. Probable conserved integral membrane protein, equivalent to a predicted homologous protein from Mycobacterium smegmatis (see citation below), and showing some similarity with P33774|YPRB_ECOLI hypothetical 24.3 kDa protein from Escherichia coli (217 aa), FASTA scores: opt: 174, E(): 5.3e-05, (25.6% identity in 223 aa overlap). Also similar to Rv1863c and Rv0804 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215172.1" /db_xref="GI:15607798" /db_xref="GeneID:888102" /translation="MEAGRADTVAPSHRWGLGAFLVVELVFLVASTSLAVVLTGHGPV SAGVLALALAAPTVVAAGLAILITRLRGNGLRTDLRLRWSWRGLRLGLMFGFGGMLVT IPASLVYTAIVGPEANSAVVRIFGGVRASWPWALVVFLVVVFVAPLCEEIIYRGLLWG AVDRRWGRWAALVVTTVVFALAHLEFARAPLLVVVAIPIALARFYSGGLLASIVTHQV TNLLPGIVLLLGLTGAISLP" gene complement(754685..754993) /locus_tag="Rv0659c" /db_xref="GeneID:888134" CDS complement(754685..754993) /locus_tag="Rv0659c" /function="UNKNOWN" /note="Rv0659c, (MTCI376.17), len: 102 aa. Conserved hypothetical protein, weakly similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv1942c, Rv1495, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215173.1" /db_xref="GI:15607799" /db_xref="GeneID:888134" /translation="MRRGELWFAATPGGDRPVLVLTRDPVADRIGAVVVVALTRTRRG LVSELELTAVENRVPSDCVVNFDNIHTLPRTAFRRRITRLSPARLHEACQTLRASTGC" gene complement(754980..755225) /locus_tag="Rv0660c" /db_xref="GeneID:888141" CDS complement(754980..755225) /locus_tag="Rv0660c" /function="UNKNOWN" /note="Rv0660c, (MTCI376.16), len: 81 aa. Conserved hypothetical protein, showing some similarity to AF016485_130 from Halobacterium sp (100 aa), FASTA scores: (32.4% identity in 74 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215174.1" /db_xref="GI:15607800" /db_xref="GeneID:888141" /translation="MLSFRADDHDVDLADAWARRLHIGRSELLRDALRRHLAALAADQ DVQAYTERPLTDDENALAEIADWGPAEDWADWADAAR" gene complement(755335..755772) /locus_tag="Rv0661c" /db_xref="GeneID:888143" CDS complement(755335..755772) /locus_tag="Rv0661c" /function="UNKNOWN" /note="Rv0661c, (MTCI376.15), len: 145 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv2863|MTV003.09|MTV003_7 (126 aa), FASTA scores: E(): 0.00087, (30.4% identity in 125 aa overlap), Rv0749|MTV041.23 (163 aa); Rv0277c, Rv2530c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215175.1" /db_xref="GI:15607801" /db_xref="GeneID:888143" /translation="MIVLDTTVLVYAKGAEHPLRDPCRDLVAAIADERIAATTTAEVI QEFVHVRARRRDRSDAAALGRVTMPNCSRRYSPSIEATSKRGLTLFETTPGLEACDAV LAAVAASAGATALVSADPAFADLSDVVHVIPDAAGMVSLLGDR" gene complement(755769..756137) /locus_tag="Rv0662c" /db_xref="GeneID:888117" CDS complement(755769..756137) /locus_tag="Rv0662c" /function="UNKNOWN" /note="Rv0662c, (MTCI376.14), len: 133 aa. Conserved hypothetical protein, showing weak similarity with other hypothetical proteins from Mycobacterium tuberculosis e.g. Rv2871, Rv1241, Rv2550c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215176.1" /db_xref="GI:15607802" /db_xref="GeneID:888117" /translation="MFLPNTRAYRRYNRSVWAVRGSTRPQWQPPPKFQHAKCMSMRLA HRLQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAADM SVPEPRELKQELEALRARRG" gene 756137..758500 /gene="atsD" /locus_tag="Rv0663" /db_xref="GeneID:888144" CDS 756137..758500 /gene="atsD" /locus_tag="Rv0663" /EC_number="3.1.6.1" /function="THOUGHT TO PLAY AN IMPORTANT ROLE IN THE MINERALIZATION OF SULFATES [CATALYTIC ACTIVITY: A phenol sulfate + H2O = a phenol + sulfate]." /note="Rv0663, (MTCI376.13c), len: 787 aa. Possible atsD, arylsulfatase (EC 3.1.6.1), similar to others e.g. P5169|ARS_PSEAE arylsulfatase from Pseudomonas aeruginosa (532 aa), FASTA scores: opt: 653, E(): 0, (33.1% identity in 544 aa overlap); etc. Also similar to P95059|MTCY210.30|ATSA|Rv0711|MTCY210.30 from Mycobacterium tuberculosis (787 aa), FASTA score: (38.9% identity in 769 aa overlap); and other arylsulfatases from Mycobacterium tuberculosis e.g. Rv3299c|ATSB (970 aa), Rv0711, etc. Contains PS00523 Sulfatases signature 1. BELONGS TO THE SULFATASE FAMILY." /codon_start=1 /transl_table=11 /product="arylsulfatase AtsD" /protein_id="NP_215177.1" /db_xref="GI:15607803" /db_xref="GeneID:888144" /translation="MPQPRTHLPIPSAARTGLITYDAKDPDSTYPPIEQLRPPAGAPN VLLILLDDVGFGASSAFGGPCRTSTAELLAGNGLRYNRFHTTALCSPTRQALLTGRNH HSAGMGGITEIATGAPGYSSVLPNTMSPIARTLKLNGYNTAQFGKCHEVPVWQTSPVG PFDAWPSGGGGFEYFYGFIGGEANQWYPSLYEGTTPVEVNRTPEEGYHFMADMTDKAL GWIGQQKALAPDRPFFVYFAPGATHAPHHVPREWADKYRGRFDVGWDALREETFARQK ELGVIPADCQLTARHAEIPAWDDMPEDLKPVLCRQMEVYAGFLEYTDHHVGRLVDGLQ RLGVLDDTLVFYIIDDNGASAEGTINGTYNEMLNFNGLADIETPRFMTDRLDKFGGPE SYNHYSVGWAHAMDTPYQWTKQVASHWGGTRNGTIVHWPNGIAAKGEMRWQFHHVIDV APTILEAAGLPEPLFVNGVQQHPIEGVSMAYSFDDAQAPDRHETQYFEMFGNRGIYHK GWTAVTKHKTPWILVGEQTVAFDDDVWELYDTTKDWSQAKDLAKEMPEKLHELQRLWL IEATRYNVLPLDDDTASRINPDLAGRPVLIRGNTQVLFSNMGRLSENCVLNLKNKSHT VTAEVEVPETGAEGVIVAQGASIGGWSLYANDGKLKYCYNLGGIKHFYAESADPLPAG AHQVRMEFAYAGGGLGKGGEVTLYVDGQQVGEGHVEATLAIVFSADDGCDVGMDSGSP VSPDYAPGSNAFNGRIKGVQLAIAEAAAAAGHLVDPEHAIRIALARQ" misc_feature 756395..756433 /gene="atsD" /locus_tag="Rv0663" /note="PS00523 Sulfatases signature 1" gene 758532..758804 /locus_tag="Rv0664" /db_xref="GeneID:888146" CDS 758532..758804 /locus_tag="Rv0664" /function="UNKNOWN" /note="Rv0664, (MTCI376.12c), len: 90 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215178.1" /db_xref="GI:15607804" /db_xref="GeneID:888146" /translation="MEKSRCHAVAHGGGCAGSAKSHKSGGRCGQGRGAGDSHGTRGAG RRYRAASAPHPLAVGAHLRDELAKRSADPRLTDELNDLAGHTLDDL" gene 758801..759139 /locus_tag="Rv0665" /db_xref="GeneID:888149" CDS 758801..759139 /locus_tag="Rv0665" /function="UNKNOWN" /note="Rv0665, (MTCI376.11c), len: 112 aa. Conserved hypothetical protein, similar to Rv0627 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (135 aa), and showing similarity with Rv0595c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215179.1" /db_xref="GI:15607805" /db_xref="GeneID:888149" /translation="MTEGEVGVGLLDTSVFIARESGGAIADLPERVALSVMTIGELQL GLLNAGDSATRSRRADTLALARTADQIPVSEAVMISLARLVADCRAAGVRRSVKLTDA LIAATAEIKV" gene 759136..759309 /locus_tag="Rv0666" /db_xref="GeneID:888158" CDS 759136..759309 /locus_tag="Rv0666" /function="UNKNOWN" /note="Rv0666, (MTCI376.10c), len: 57 aa. Possible membrane protein; has hydrophobic stretch at aa 29-47." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215180.1" /db_xref="GI:15607806" /db_xref="GeneID:888158" /translation="MTPRTDEGAAAPCLMPDVTMPVKRGDARGALGVGPALFVVSVSS SLVRARSCRCTAD" gene 759807..763325 /gene="rpoB" /locus_tag="Rv0667" /db_xref="GeneID:888164" CDS 759807..763325 /gene="rpoB" /locus_tag="Rv0667" /EC_number="2.7.7.6" /function="CATALYZES THE TRANSCRIPTION OF DNA INTO RNA USING THE FOUR RIBONUCLEOSIDE TRIPHOSPHATES AS SUBSTRATES [CATALYTIC ACTIVITY: N NUCLEOSIDE TRIPHOSPHATE = N DIPHOSPHATE + {RNA}(N)]." /experiment="experimental evidence, no additional details recorded" /note="DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates; beta subunit is part of the catalytic core which binds with a sigma factor to produce the holoenzyme" /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit beta" /protein_id="NP_215181.1" /db_xref="GI:15607807" /db_xref="GeneID:888164" /translation="MADSRQSKTAASPSPSRPQSSSNNSVPGAPNRVSFAKLREPLEV PGLLDVQTDSFEWLIGSPRWRESAAERGDVNPVGGLEEVLYELSPIEDFSGSMSLSFS DPRFDDVKAPVDECKDKDMTYAAPLFVTAEFINNNTGEIKSQTVFMGDFPMMTEKGTF IINGTERVVVSQLVRSPGVYFDETIDKSTDKTLHSVKVIPSRGAWLEFDVDKRDTVGV RIDRKRRQPVTVLLKALGWTSEQIVERFGFSEIMRSTLEKDNTVGTDEALLDIYRKLR PGEPPTKESAQTLLENLFFKEKRYDLARVGRYKVNKKLGLHVGEPITSSTLTEEDVVA TIEYLVRLHEGQTTMTVPGGVEVPVETDDIDHFGNRRLRTVGELIQNQIRVGMSRMER VVRERMTTQDVEAITPQTLINIRPVVAAIKEFFGTSQLSQFMDQNNPLSGLTHKRRLS ALGPGGLSRERAGLEVRDVHPSHYGRMCPIETPEGPNIGLIGSLSVYARVNPFGFIET PYRKVVDGVVSDEIVYLTADEEDRHVVAQANSPIDADGRFVEPRVLVRRKAGEVEYVP SSEVDYMDVSPRQMVSVATAMIPFLEHDDANRALMGANMQRQAVPLVRSEAPLVGTGM ELRAAIDAGDVVVAEESGVIEEVSADYITVMHDNGTRRTYRMRKFARSNHGTCANQCP IVDAGDRVEAGQVIADGPCTDDGEMALGKNLLVAIMPWEGHNYEDAIILSNRLVEEDV LTSIHIEEHEIDARDTKLGAEEITRDIPNISDEVLADLDERGIVRIGAEVRDGDILVG KVTPKGETELTPEERLLRAIFGEKAREVRDTSLKVPHGESGKVIGIRVFSREDEDELP AGVNELVRVYVAQKRKISDGDKLAGRHGNKGVIGKILPVEDMPFLADGTPVDIILNTH GVPRRMNIGQILETHLGWCAHSGWKVDAAKGVPDWAARLPDELLEAQPNAIVSTPVFD GAQEAELQGLLSCTLPNRDGDVLVDADGKAMLFDGRSGEPFPYPVTVGYMYIMKLHHL VDDKIHARSTGPYSMITQQPLGGKAQFGGQRFGEMECWAMQAYGAAYTLQELLTIKSD DTVGRVKVYEAIVKGENIPEPGIPESFKVLLKELQSLCLNVEVLSSDGAAIELREGED EDLERAAANLGINLSRNESASVEDLA" gene 763370..767320 /gene="rpoC" /locus_tag="Rv0668" /db_xref="GeneID:888177" CDS 763370..767320 /gene="rpoC" /locus_tag="Rv0668" /EC_number="2.7.7.6" /function="CATALYZES THE TRANSCRIPTION OF DNA INTO RNA USING THE FOUR RIBONUCLEOSIDE TRIPHOSPHATES AS SUBSTRATES [CATALYTIC ACTIVITY: N NUCLEOSIDE TRIPHOSPHATE = N DIPHOSPHATE + {RNA}(N)]." /experiment="experimental evidence, no additional details recorded" /note="DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Subunit beta' binds to sigma factor allowing it to bind to the -10 region of the promoter" /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit beta'" /protein_id="NP_215182.1" /db_xref="GI:15607808" /db_xref="GeneID:888177" /translation="MLDVNFFDELRIGLATAEDIRQWSYGEVKKPETINYRTLKPEKD GLFCEKIFGPTRDWECYCGKYKRVRFKGIICERCGVEVTRAKVRRERMGHIELAAPVT HIWYFKGVPSRLGYLLDLAPKDLEKIIYFAAYVITSVDEEMRHNELSTLEAEMAVERK AVEDQRDGELEARAQKLEADLAELEAEGAKADARRKVRDGGEREMRQIRDRAQRELDR LEDIWSTFTKLAPKQLIVDENLYRELVDRYGEYFTGAMGAESIQKLIENFDIDAEAES LRDVIRNGKGQKKLRALKRLKVVAAFQQSGNSPMGMVLDAVPVIPPELRPMVQLDGGR FATSDLNDLYRRVINRNNRLKRLIDLGAPEIIVNNEKRMLQESVDALFDNGRRGRPVT GPGNRPLKSLSDLLKGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPKLMALE LFKPFVMKRLVDLNHAQNIKSAKRMVERQRPQVWDVLEEVIAEHPVLLNRAPTLHRLG IQAFEPMLVEGKAIQLHPLVCEAFNADFDGDQMAVHLPLSAEAQAEARILMLSSNNIL SPASGRPLAMPRLDMVTGLYYLTTEVPGDTGEYQPASGDHPETGVYSSPAEAIMAADR GVLSVRAKIKVRLTQLRPPVEIEAELFGHSGWQPGDAWMAETTLGRVMFNELLPLGYP FVNKQMHKKVQAAIINDLAERYPMIVVAQTVDKLKDAGFYWATRSGVTVSMADVLVPP RKKEILDHYEERADKVEKQFQRGALNHDERNEALVEIWKEATDEVGQALREHYPDDNP IITIVDSGATGNFTQTRTLAGMKGLVTNPKGEFIPRPVKSSFREGLTVLEYFINTHGA RKGLADTALRTADSGYLTRRLVDVSQDVIVREHDCQTERGIVVELAERAPDGTLIRDP YIETSAYARTLGTDAVDEAGNVIVERGQDLGDPEIDALLAAGITQVKVRSVLTCATST GVCATCYGRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGVGEDITGGLPR VQELFEARVPRGKAPIADVTGRVRLEDGERFYKITIVPDDGGEEVVYDKISKRQRLRV FKHEDGSERVLSDGDHVEVGQQLMEGSADPHEVLRVQGPREVQIHLVREVQEVYRAQG VSIHDKHIEVIVRQMLRRVTIIDSGSTEFLPGSLIDRAEFEAENRRVVAEGGEPAAGR PVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKENVIIGKLIPAGT GINRYRNIAVQPTEEARAAAYTIPSYEDQYYSPDFGAATGAAVPLDDYGYSDYR" gene complement(767684..769597) /locus_tag="Rv0669c" /db_xref="GeneID:888181" CDS complement(767684..769597) /locus_tag="Rv0669c" /EC_number="3.-.-.-" /function="UNKNOWN; HYDROLYTIC ENZYME PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0669c, (MTCI376.05), len: 637 aa. Possible hydrolase (EC 3.-.-.-), highly similar to various hydrolases (N-terminus shorter) e.g. BAA88409.1|AB028646 alkaline ceramidase from Pseudomonas aeruginosa (670 aa,) FASTA scores: opt: 1490, E(): 0, (41.2% identity in 651 aa overlap); NP_063946.1|NM_019893 mitochondrial ceramidase from Homo sapiens (761 aa); P_446098.1|NM_053646 N-acylsphingosine amidohydrolase 2 from Rattus norvegicus (761 aa); BAB09641.1|AB016885 neutral ceramidase from Arabidopsis thaliana (705 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_215183.1" /db_xref="GI:15607809" /db_xref="GeneID:888181" /translation="MLSVGRGIADITGEAADCGMLGYGKSDQRTAGIHQRLRSRAFVF RDDSQDGDARLLLIVAELPLPMQNVNEEVLRRLADLYGDTYSEQNTLITATHTHAGPG GYCGYLLYNLTTSGFRPATFAAIVDGIVESVEHAHADVAPAEVSLSHGELYGASINRS PSAFDRNPPADKAFFPKRVDPHTTLVRIDRGEATVGVIHFFATHGTSMTNRNHLISGD NKGFAAYHWERTVGGADYLAGQPDFIAAFAQTNPGDMSPNVDGPLSPEAPPDREFDNT RRTGLCQFEDAFTQLSGATPIGAGIDARFTYVDLGSVLVRGEYTPDGEERRTGRPMFG AGAMAGTDEGPGFHGFRQGRNPFWDRLSRAMYRLARPTAAAQAPKGIVMPARLPNRIH PFVQEIVPVQLVRIGRLYLIGIPGEPTIVAGLRLRRMVASIVGADLADVLCVGYTNAY IHYVTTPEEYLEQRYEGGSTLFGRWELCALMQTVAELAEAMRDGRPVTLGRRPRPTRE LSWVRGAPADAGSFGAVIAEPSATYRPGQAVEAVFVSALPNNDLRRGGTYLEVVRREG ASWVRIADDGDWATSFRWQRQGRAGSHVSIRWDVPGDTTPGQYRIVHHGTARDRNGML TAFSATTREFTVV" misc_feature complement(769520..769543) /locus_tag="Rv0669c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 769792..770550 /gene="end" /locus_tag="Rv0670" /db_xref="GeneID:888190" CDS 769792..770550 /gene="end" /locus_tag="Rv0670" /EC_number="3.1.21.2" /function="INVOLVED IN BASE EXCISION REPAIR. ENDONUCLEASE IV PLAYS A ROLE IN DNA REPAIR. IT CLEAVES PHOSPHODIESTER BONDS AT APURINIC OR APYRIMIDINIC SITES (AP SITES) TO PRODUCE NEW 5' ENDS THAT ARE BASE-FREE DEOXYRIBOSE 5-PHOSPHATE RESIDUES [CATALYTIC ACTIVITY: Endonucleolytic cleavage to 5'-phosphooligonucleotide end-products]." /note="Assists in DNA repair by cleaving phosphodiester bonds at apurinic or apyrimidinic sties to produce new 5' ends that are base-free deoxyribose 5-phosphate residues" /codon_start=1 /transl_table=11 /product="endonuclease IV" /protein_id="NP_215184.1" /db_xref="GI:15607810" /db_xref="GeneID:888190" /translation="MLIGSHVSPTDPLAAAEAEGADVVQIFLGNPQSWKAPKPRDDAA ALKAATLPIYVHAPYLINLASANNRVRIPSRKILQETCAAAADIGAAAVIVHGGHVAD DNDIDKGFQRWRKALDRLETEVPVYLENTAGGDHAMARRFDTIARLWDVIGDTGIGFC LDTCHTWAAGEALTDAVDRIKAITGRIDLVHCNDSRDEAGSGRDRHANLGSGQIDPDL LVAAVKAAGAPVICETADQGRKDDIAFLRERTGS" misc_feature 769957..769983 /gene="end" /locus_tag="Rv0670" /note="PS00729 AP endonucleases family 2 signature 1" misc_feature 770263..770286 /gene="end" /locus_tag="Rv0670" /note="PS00730 AP endonucleases family 2 signature 2" gene 770582..771424 /gene="lpqP" /locus_tag="Rv0671" /db_xref="GeneID:888194" CDS 770582..771424 /gene="lpqP" /locus_tag="Rv0671" /function="UNKNOWN" /note="Rv0671, (MTCI376.03c), len: 280 aa. Possible lpqP, conserved lipoprotein, similar to U00012|B1308_F2_43|Q49658 from Mycobacterium leprae (302 aa), FASTA scores: opt: 449, E(): 2.4e-22, (37.6% identity in 242 aa overlap). Also highly similar to lpqC|Rv3298c|MTCY71.38c PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (304 aa). Also similar to a large variety of proteins including various esterases and poly(3-hydroxyalkanoate) depolymerases, e.g. NP_249234.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (322 aa); C-terminus of AAD45376.1|AF164516_1|AF164516 cinnamoyl ester hydrolase EstA from Piromyces equi (536 aa); part of P52090|PHA1_PSELE POLY(3-HYDROXYALKANOATE) DEPOLYMERASE C PRECURSOR from Pseudomonas lemoignei (414 aa); CAC10310.1|AL442629 putative secreted protein from Streptomyces coelicolor (348 aa); etc. Has a 17 aa signal sequence and contains appropriately positioned (PS00013) Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LpqP" /protein_id="NP_215185.1" /db_xref="GI:15607811" /db_xref="GeneID:888194" /translation="MLRRVAILLAAVLAFAGCSGGTRLAAGFGNGNSVHTLDVDGAGR SYRLYKPVGLPSSAPLVVMLHGGFGSAKQAERSYGWDELADSEKFLVAYPDGYHRAWN ANGGGCCGRPAREGVDDIGFVRAVVADIANNVSIDPARVYVTGMSNGAIMSYTLACNT SIFAAIGVVSGTQLDPCQSPRPVSVIHIHGTADPLVRYHGGPGAGFARIDGPPVPDLN AFWREVNRCGALDTTTEGPVTTSGATCADNRRVVLLTVDDAGHRWPSFATQTLWRFFA AHFR" misc_feature 770603..770635 /gene="lpqP" /locus_tag="Rv0671" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 771484..773112 /gene="fadE8" /locus_tag="Rv0672" /db_xref="GeneID:888198" CDS 771484..773112 /gene="fadE8" /locus_tag="Rv0672" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0672, (MTCI376.02c), len: 545 aa. Probable fadE8, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. CAC33951.1|AL589708 putative acyl-CoA dehydrogenase from Streptomyces coelicolor (557 aa); P33224|AIDB_ECOLI|B4187 aidb protein (ACYL-COA DEHYDROGENASES FAMILY) from Escherichia coli strain K12 (546 aa), FASTA scores: opt: 1369, E(): 0, (44.1% identity in 524 aa overlap); etc. Also similar to several other M. tuberculosis proteins e.g. Rv0154cRv0154c|MTCI5.28c FASTA score: (26.3% identity in 342 aa overlap); etc. Contains acyl-CoA dehydrogenases signature 2 (PS00073). BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE8" /protein_id="NP_215186.1" /db_xref="GI:15607812" /db_xref="GeneID:888198" /translation="MSDTHVVTNQVPPLENYNPASSPVLIEALIQEGGQWGLDEVNEV GAISASCQAQRWGELADRNRPILHTHDAYGYRVDEVEYDPAYHELMRTAITHGMHAAP WADDRPGAHVVRAAKTSVWTVEPGHICPISMTYAVVPALRYNSELAAVYEPLLTSREY DPELKPATTKAGITAGMSMTEKQGGSDVRAGTTQATPNADGSYSLTGHKWFTSAPMCD IFLVLAQAPDGLSCFLLPRVLPDGTRNRMFLQRLKDKLGNHANASSEVEYDGAVAWLV GEEGRGVPTIIEMVNLTRLDCALGSATSMRTGLTRAVHHAQHRKAFGAYLIDQPLMRN VLADLAVEAEAATIVAMRMAGATDNAVRGNETEALLRRIGLAAAKYWVCKRSTAHAAE ALECLGGNGYVEDSGMPRLYREAPLMGIWEGSGNVSALDTLRAMATRPACVEVLFDEL ARSAGQDPRLDGHVERLRPQLGDLDTIGYRARKIAEDICLALQGSLLVRHGHPAVAEA FLATRLGGQWGGAYGTMPAGLDLAPILERALVKG" misc_feature 772666..772725 /gene="fadE8" /locus_tag="Rv0672" /note="PS00073 Acyl-CoA dehydrogenases signature 2" gene 773123..774061 /gene="echA4" /locus_tag="Rv0673" /db_xref="GeneID:888175" CDS 773123..774061 /gene="echA4" /locus_tag="Rv0673" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215187.1" /db_xref="GI:15607813" /db_xref="GeneID:888175" /translation="MTHAIRPVDFDNLKTMTYEVTGRIARITFNRPEKGNAIIADTPL ELSALVERADLDPGVHVILVSGRGEGFCAGFDLSAYAEGSSSTGGGGAYQGTVLDGKT QAVNHLPNQPWDPMIDYQMMSRFVRGFASLMHADKPTVVKIHGYCVAGGTDIALHADQ VIAAADAKIGYPPTRVWGVPAAGLWAHRLGDQRAKRLLFTGDCITGAQAAEWGLAVEA PEPADLDERTERLVARIAALPVNQLIMVKLALNSALLQQGVATSRMVSTVFDGAARHT PEGHAFVADAVEHGFRDAVRRRDEPFGDYGRQASRV" misc_feature 773405..773428 /gene="echA4" /locus_tag="Rv0673" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 774064..774786 /locus_tag="Rv0674" /db_xref="GeneID:888203" CDS 774064..774786 /locus_tag="Rv0674" /function="UNKNOWN" /note="Rv0674, (MTV040.02), len: 240 aa. Conserved hypothetical protein, highly similar to AC13063.1|AL445503 conserved hypothetical protein from Streptomyces coelicolor (268 aa); and similar to NP_438100.1|NC_003078 putative regulator of phenylacetic acid degradation ArsR family protein from Sinorhizobium meliloti (306 aa) and other proteins e.g. AB011837|AB011837_13 hypothetical protein from Bacillus halodurans (298 aa), FASTA scores: opt: 148, E(): 0.0081, (25.1% identity in 235 aa overlap); etc. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215188.1" /db_xref="GI:15607814" /db_xref="GeneID:888203" /translation="MPAMTARSVVLSVLLGAHPAWATASELIQLTADFGIKETTLRVA LTRMVGAGDLVRSADGYRLSDRLLARQRRQDEAMRPRTRAWHGNWHMLIVTSIGTDAR TRAALRTCMHHKRFGELREGVWMRPDNLDLDLESDVAARVRMLTARDEAPADLAGQLW DLSGWTEAGHRLLGDMAAATDMPGRFVVAAAMVRHLLTDPMLPAELLPADWPGAGLRA AYHDFATAMAKRRDATQLLEVT" gene 774783..775574 /gene="echA5" /locus_tag="Rv0675" /db_xref="GeneID:888222" CDS 774783..775574 /gene="echA5" /locus_tag="Rv0675" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="YP_177745.1" /db_xref="GI:57116767" /db_xref="GeneID:888222" /translation="MSDLVRVERKGRVTTVILNRPASRNAVNGPTAAALCAAFEQFDR DDAASVAVLWGAGGTFCAGADLKAFGTPEANSVHRTGPGPMGPSRMMLSKPVIAAVSG YAVAGGLELALWCDLRVAEEDAVFGVFCRRWGVPLIDGGTVRLPRLIGHSRAMDMILT GRGVPADEALAMGLANRVVPKGQARQAAEELAAQLAALPQQCLRSDRLSALHQWGLPE SAALDLEFASIARVAGEALEGARRFAAGAGRHGAPAPRAEQGDTL" misc_feature 775071..775133 /gene="echA5" /locus_tag="Rv0675" /note="PS00166 Enoyl-CoA hydratase/isomerase signature" gene complement(775586..778480) /gene="mmpL5" /locus_tag="Rv0676c" /db_xref="GeneID:888219" CDS complement(775586..778480) /gene="mmpL5" /locus_tag="Rv0676c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0676c, (MTV040.04c), len: 964 aa. Probable mmpL5, conserved transmembrane transport protein (see Tekaia et al., 1999), member of RND superfamily, highly similar to other Mycobacterial proteins e.g. MTV037_14, MTCY98_8, MTCY20G9_34, MTCY4D9_15, MTCY48_8, MTCY19G5_6, MTV005_19, etc. Also similar to other Mycobacterial mmpl proteins e.g. P54881|MML4_MYCLE PUTATIVE MEMBRANE PROTEIN MMPL4 from Mycobacterium leprae (959 aa), FASTA scores: opt: 3991, E(): 0, (62.8% identity in 933 aa overlap); etc. BELONGS TO THE MMPL FAMILY. TBparse score is 0.884." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL5" /protein_id="NP_215190.1" /db_xref="GI:15607816" /db_xref="GeneID:888219" /translation="MIVQRTAAPTGSVPPDRHAARPFIPRMIRTFAVPIILGWLVTIA VLNVTVPQLETVGQIQAVSMSPDAAPSMISMKHIGKVFEEGDSDSAAMIVLEGQRPLG DAAHAFYDQMIGRLQADTTHVQSLQDFWGDPLTATGAQSSDGKAAYVQVKLAGNQGES LANESVEAVKTIVERLAPPPGVKVYVTGSAALVADQQQAGDRSLQVIEAVTFTVIIVM LLLVYRSIITSAIMLTMVVLGLLATRGGVAFLGFHRIIGLSTFATNLLVVLAIAAATD YAIFLIGRYQEARGLGQDRESAYYTMFGGTAHVVLGSGLTIAGATFCLSFTRLPYFQT LGVPLAIGMVIVVAAALTLGPAIIAVTSRFGKLLEPKRMARVRGWRKVGAAIVRWPGP ILVGAVALALVGLLTLPGYRTNYNDRNYLPADLPANEGYAAAERHFSQARMNPEVLMV ESDHDMRNSADFLVINKIAKAIFAVEGISRVQAITRPDGKPIEHTSIPFLISMQGTSQ KLTEKYNQDLTARMLEQVNDIQSNIDQMERMHSLTQQMADVTHEMVIQMTGMVVDVEE LRNHIADFDDFFRPIRSYFYWEKHCYDIPVCWSLRSVFDTLDGIDVMTEDINNLLPLM QRLDTLMPQLTAMMPEMIQTMKSMKAQMLSMHSTQEGLQDQMAAMQEDSAAMGEAFDA SRNDDSFYLPPEVFDNPDFQRGLEQFLSPDGHAVRFIISHEGDPMSQAGIARIAKIKT AAKEAIKGTPLEGSAIYLGGTAAMFKDLSDGNTYDLMIAGISALCLIFIIMLITTRSV VAAAVIVGTVVLSLGASFGLSVLIWQHILGIELHWLVLAMAVIILLAVGADYNLLLVA RLKEEIHAGINTGIIRAMGGSGSVVTAAGLVFAFTMMSFAVSELTVMAQVGTTIGMGL LFDTLIVRSFMTPSIAALLGKWFWWPQVVRQRPIPQPWPSPASARTFALV" gene complement(778477..778905) /gene="mmpS5" /locus_tag="Rv0677c" /db_xref="GeneID:888233" CDS complement(778477..778905) /gene="mmpS5" /locus_tag="Rv0677c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0677c, (MTV040.05c), len: 142 aa. Possible mmpS5, conserved membrane protein (see Tekaia et al., 1999), highly similar to other Mycobacterial proteins e.g. P54880|MMS4_MYCLE PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 443, E(): 1.4e-23, (47.1% identity in 155 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis. BELONGS TO THE MMPS FAMILY. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215191.1" /db_xref="GI:15607817" /db_xref="GeneID:888233" /translation="MIGTLKRAWIPLLILVVVAIAGFTVQRIRTFFGSEGILVTPKVF ADDPEPFDPKVVEYEVSGSGSYVNINYLDLDAKPQRIDGAALPWSLTLKTTAPSAAPN ILAQGDGTSITCRITVDGEVKDERTATGVDALTYCFVKSA" gene 778990..779487 /locus_tag="Rv0678" /db_xref="GeneID:888235" CDS 778990..779487 /locus_tag="Rv0678" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0678, (MTV040.06), len: 165 aa. Conserved hypothetical protein, showing weak similarity with AL049754|SCH10_10 hypothetical protein from Streptomyces coelicolor (152 aa), FASTA scores: opt: 149, E(): 0.0018, (22.9% identity in 140 aa overlap). TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215192.1" /db_xref="GI:15607818" /db_xref="GeneID:888235" /translation="MSVNDGVDQMGAEPDIMEFVEQMGGYFESRSLTRLAGRLLGWLL VCDPERQSSEELATALAASSGGISTNARMLIQFGFIERLAVAGDRRTYFRLRPNAFAA GERERIRAMAELQDLADVGLRALGDAPPQRSRRLREMRDLLAYMENVVSDALGRYSQR TGEDD" gene complement(779543..780040) /locus_tag="Rv0679c" /db_xref="GeneID:888230" CDS complement(779543..780040) /locus_tag="Rv0679c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0679c, (MTV040.07c), len: 165 aa. Conserved hypothetical Thr-rich protein, similar in part to neighboring ORF Rv0680c (124 aa), FASTA score: (35.1% identity in 131 aa overlap); and Rv0314c (220 aa). Contains probable N-terminal signal sequence. TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="putative threonine rich protein" /protein_id="NP_215193.1" /db_xref="GI:15607819" /db_xref="GeneID:888230" /translation="MVEKPLRADRATHSRLATFALALAAAALPLAGCSSTANPPAATT TPATATTTTATSGPTAAPTVTTGESTTASIQIGDMLTYGSIGTTATLDCADGKSLNVA GSDNTLTVNGTCETVTVGGANNKIAFDRIDERLVVVGLDNTVTYKNGDPTIDNLGAGN RINKE" gene complement(780042..780416) /locus_tag="Rv0680c" /db_xref="GeneID:888229" CDS complement(780042..780416) /locus_tag="Rv0680c" /function="UNKNOWN" /note="Rv0680c, (MTV040.08c), len: 124 aa. Possible conserved transmembrane protein, showing similarity with C-terminal part of Rv0314c|Z96800|MTCY63.19c conserved hypothetical protein from Mycobacterium tuberculosis (220 aa), FASTA scores: opt: 175, E(): 2.2e-05, (31.4% identity in 102 aa overlap). Also some similarity to upstream ORF Rv0679c|MTV040.07c CONSERVED HYPOTHETICAL THREONINE RICH PROTEIN (124 aa), FASTA score: (35.1% identity in 131 aa overlap). Contains probable N-terminal signal sequence. TBparse score is 0.877." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215194.1" /db_xref="GI:15607820" /db_xref="GeneID:888229" /translation="MKWNTVAASLAAGVITIAVALAAPPPAAHAKNGDTHVTGQGIER TLDCNESTLLVNGTQNIVTALGTCWAVTVMGSSNTVVADTIINDITVYGWDETVFFRN GDPFIWDRGRELGMVNRLQRVG" gene 780721..781311 /locus_tag="Rv0681" /db_xref="GeneID:888239" CDS 780721..781311 /locus_tag="Rv0681" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0681, (MTV040.09), len: 196 aa. Probable transcription regulator, TetR family, similar to others and especially many tetracycline repressors e.g. T34657 probable transcription regulator from Streptomyces coelicolor (189 aa); AF0278|AF027868_40|NP_389788.1|NC_000964 yobS regulator from Bacillus subtilis (191 aa), FASTA scores: opt: 213, E(): 1.6e-07, (28.8% identity in 153 aa overlap); P09164|TER4_ECOLI TETRACYCLINE REPRESSOR PROTEIN from Escherichia coli (217 aa), FASTA scores: opt: 145, E(): 0.0068, (39.0% identity in 59 aa overlap); etc. Contains helix-turn-helix motif at aa 28-49 (Score 1020, +2.66 SD). TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_215195.1" /db_xref="GI:15607821" /db_xref="GeneID:888239" /translation="MARPAKLSRESIVEGALTFLDREGWDSLTINALATQLGTKGPSL YNHVDSLEDLRRAVRIRVIDDIITMLNRVGAGRARDDAVLVMAGAYRSYAHHHPGRYS AFTRMPLGGDDPEYTAATRGAAAPVIAVLSSYGLDGEQAFYAALEFWSALHGFVLLEM TGVMDDIDTDAVFTDMVLRLAAGMERRTTHGGTAST" gene 781560..781934 /gene="rpsL" /locus_tag="Rv0682" /db_xref="GeneID:888259" CDS 781560..781934 /gene="rpsL" /locus_tag="Rv0682" /function="PROTEIN S12 IS INVOLVED IN THE TRANSLATION INITIATION STEP." /note="interacts with and stabilizes bases of the 16S rRNA that are involved in tRNA selection in the A site and with the mRNA backbone; located at the interface of the 30S and 50S subunits, it traverses the body of the 30S subunit contacting proteins on the other side; mutations in the S12 gene confer streptomycin resistance" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S12" /protein_id="NP_215196.1" /db_xref="GI:15607822" /db_xref="GeneID:888259" /translation="MPTIQQLVRKGRRDKISKVKTAALKGSPQRRGVCTRVYTTTPKK PNSALRKVARVKLTSQVEVTAYIPGEGHNLQEHSMVLVRGGRVKDLPGVRYKIIRGSL DTQGVKNRKQARSRYGAKKEKG" misc_feature 781686..781709 /gene="rpsL" /locus_tag="Rv0682" /note="PS00055 Ribosomal protein S12 signature" gene 781934..782404 /gene="rpsG" /locus_tag="Rv0683" /db_xref="GeneID:888245" CDS 781934..782404 /gene="rpsG" /locus_tag="Rv0683" /function="PROTEIN S7 BINDS SPECIFICALLY TO PART OF THE 3' END OF 16S RIBOSOMAL RNA." /note="binds directly to 16S rRNA where it nucleates assembly of the head domain of the 30S subunit" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S7" /protein_id="NP_215197.1" /db_xref="GI:15607823" /db_xref="GeneID:888245" /translation="MPRKGPAPKRPLVNDPVYGSQLVTQLVNKVLLKGKKSLAERIVY GALEQARDKTGTDPVITLKRALDNVKPALEVRSRRVGGATYQVPVEVRPDRSTTLALR WLVGYSRQRREKTMIERLANEILDASNGLGASVKRREDTHKMAEANRAFAHYRW" misc_feature 781991..782071 /gene="rpsG" /locus_tag="Rv0683" /note="PS00052 Ribosomal protein S7 signature" gene 782485..784590 /gene="fusA1" /locus_tag="Rv0684" /gene_synonym="fusA" /db_xref="GeneID:888240" CDS 782485..784590 /gene="fusA1" /locus_tag="Rv0684" /gene_synonym="fusA" /function="THIS PROTEIN PROMOTES THE GTP-DEPENDENT TRANSLOCATION OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE RIBOSOME." /experiment="experimental evidence, no additional details recorded" /note="EF-G; promotes GTP-dependent translocation of the ribosome during translation; many organisms have multiple copies of this gene" /codon_start=1 /transl_table=11 /product="elongation factor G" /protein_id="YP_177746.1" /db_xref="GI:57116768" /db_xref="GeneID:888240" /translation="MAQKDVLTDLSRVRNFGIMAHIDAGKTTTTERILYYTGINYKIG EVHDGAATMDWMEQEQERGITITSAATTTFWKDNQLNIIDTPGHVDFTVEVERNLRVL DGAVAVFDGKEGVEPQSEQVWRQADKYDVPRICFVNKMDKIGADFYFSVRTMGERLGA NAVPIQLPVGAEADFEGVVDLVEMNAKVWRGETKLGETYDTVEIPADLAEQAEEYRTK LLEVVAESDEHLLEKYLGGEELTVDEIKGAIRKLTIASEIYPVLCGSAFKNKGVQPML DAVVDYLPSPLDVPPAIGHAPAKEDEEVVRKATTDEPFAALAFKIATHPFFGKLTYIR VYSGTVESGSQVINATKGKKERLGKLFQMHSNKENPVDRASAGHIYAVIGLKDTTTGD TLSDPNQQIVLESMTFPDPVIEVAIEPKTKSDQEKLSLSIQKLAEEDPTFKVHLDSET GQTVIGGMGELHLDILVDRMRREFKVEANVGKPQVAYKETIKRLVQNVEYTHKKQTGG SGQFAKVIINLEPFTGEEGATYEFESKVTGGRIPREYIPSVDAGAQDAMQYGVLAGYP LVNLKVTLLDGAYHEVDSSEMAFKIAGSQVLKKAAALAQPVILEPIMAVEVTTPEDYM GDVIGDLNSRRGQIQAMEERAGARVVRAHVPLSEMFGYVGDLRSKTQGRANYSMVFDS YSEVPANVSKEIIAKATGE" misc_feature 782542..782565 /gene="fusA1" /locus_tag="Rv0684" /gene_synonym="fusA" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 782644..782691 /gene="fusA1" /locus_tag="Rv0684" /gene_synonym="fusA" /note="PS00301 GTP-binding elongation factors signature" gene 784821..786011 /gene="tuf" /locus_tag="Rv0685" /db_xref="GeneID:888262" CDS 784821..786011 /gene="tuf" /locus_tag="Rv0685" /EC_number="3.6.5.3" /function="THIS PROTEIN PROMOTES THE GTP-DEPENDENT BINDING OF AMINOACYL-TRNA TO THE A-SITE OF RIBOSOMES DURING PROTEIN BIOSYNTHESIS." /experiment="experimental evidence, no additional details recorded" /note="EF-Tu; promotes GTP-dependent binding of aminoacyl-tRNA to the A-site of ribosomes during protein biosynthesis; when the tRNA anticodon matches the mRNA codon, GTP hydrolysis results; the inactive EF-Tu-GDP leaves the ribosome and release of GDP is promoted by elongation factor Ts; many prokaryotes have two copies of the gene encoding EF-Tu" /codon_start=1 /transl_table=11 /product="elongation factor Tu" /protein_id="NP_215199.1" /db_xref="GI:15607825" /db_xref="GeneID:888262" /translation="MAKAKFQRTKPHVNIGTIGHVDHGKTTLTAAITKVLHDKFPDLN ETKAFDQIDNAPEERQRGITINIAHVEYQTDKRHYAHVDAPGHADYIKNMITGAAQMD GAILVVAATDGPMPQTREHVLLARQVGVPYILVALNKADAVDDEELLELVEMEVRELL AAQEFDEDAPVVRVSALKALEGDAKWVASVEELMNAVDESIPDPVRETDKPFLMPVED VFTITGRGTVVTGRVERGVINVNEEVEIVGIRPSTTKTTVTGVEMFRKLLDQGQAGDN VGLLLRGVKREDVERGQVVTKPGTTTPHTEFEGQVYILSKDEGGRHTPFFNNYRPQFY FRTTDVTGVVTLPEGTEMVMPGDNTNISVKLIQPVAMDEGLRFAIREGGRTVGAGRVT KIIK" misc_feature 784875..784898 /gene="tuf" /locus_tag="Rv0685" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 784977..785024 /gene="tuf" /locus_tag="Rv0685" /note="PS00301 GTP-binding elongation factors signature" gene 786149..786946 /locus_tag="Rv0686" /db_xref="GeneID:888271" CDS 786149..786946 /locus_tag="Rv0686" /function="UNKNOWN" /note="Rv0686, (MTCY210.03), len: 265 aa. Probable membrane protein, with hydrophobic N-terminus. TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215200.1" /db_xref="GI:15607826" /db_xref="GeneID:888271" /translation="MLARYIKMQLLVLLCGGLVGPIFLVVYFTLGLGSLMSWMFYVGL IITVADVLVALALTNYGAKTAAKTAALERSGVLALAQITGLSETGTRINDQPLVKVHL HISGPGITPFDTEDRVIASVTRLGNLTARKLVVLVNPATQQYLIDWERSALVNGLVPA QFTVAEDNKTYDLSGQTGPLMEILQILKANNVPLNRMVDIRSNPALRQQVQAVVRRAA ERQAPAAEPASQGSIAERLAELESLRASGAVNAAEYESKRAQIISEI" gene 787099..787926 /gene="fabG" /locus_tag="Rv0687" /db_xref="GeneID:888279" CDS 787099..787926 /gene="fabG" /locus_tag="Rv0687" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="catalyzes the first of the two reduction steps in the elongation cycle of fatty acid synthesis" /codon_start=1 /transl_table=11 /product="3-ketoacyl-(acyl-carrier-protein) reductase" /protein_id="NP_215201.1" /db_xref="GI:15607827" /db_xref="GeneID:888279" /translation="MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICA PVSGSVTYPPATSEDLGETVRAVEAEGRKVLAREVDIRDDAELRRLVADGVEQFGRLD IVVANAGVLGWGRLWELTDEQWETVIGVNLTGTWRTLRATVPAMIDAGNGGSIVVVSS SAGLKATPGNGHYAASKHALVALTNTLAIELGEFGIRVNSIHPYSVDTPMIEPEAMIQ TFAKHPGYVHSFPPMPLQPKGFMTPDEISDVVVWLAGDGSGALSGNQIPVDKGALKY" misc_feature 787576..787662 /gene="fabG" /locus_tag="Rv0687" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 787940..789160 /locus_tag="Rv0688" /db_xref="GeneID:888280" CDS 787940..789160 /locus_tag="Rv0688" /function="FERREDOXINS ARE IRON-SULFUR PROTEINS THAT TRANSFER ELECTRONS IN A WIDE VARIETY OF METABOLIC REACTIONS." /note="Rv0688, (MTCY210.05), len: 406 aa. Putative ferredoxin reductase (EC 1.-.-.-), highly similar to others e.g. BAB55881.1|AB054975 ferredoxin reductase from Terrabacter sp. DBF63 (410 aa); CAC04223.1|AL391515 putative ferredoxin reductase from Streptomyces coelicolor (420 aa); PPU24215_8|Q51973 P-CUMATE DIOXYGENASE FERREDOXIN REDUCTASE SUBUNIT from Pseudomonas putida (402 aa), FASTA scores: opt: 738, E(): 0, (38.8% identity in 330 aa overlap); etc. Also similar to Rv0253 and Rv1869c from Mycobacterium tuberculosis. COULD BELONG TO THE BACTERIAL TYPE FERREDOXIN FAMILY. TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="putative ferredoxin reductase" /protein_id="NP_215202.1" /db_xref="GI:15607828" /db_xref="GeneID:888280" /translation="MNAHVTSREGVNEFDDGIVIVGGGLAAARTAEQLRRAGYSGRLT IVSDEVHLPYDRPPLSKEVLRSEVDDVALKPREFYDEKDIALRLGSAAVSLDTGEQTV TLADGTVLGYDELVIATGLVPRRIPSLPDLDGIRVLRSFDESMALRKHASAARHAVVV GAGFIGCEVAASLRGLGVDVVLVEPQPAPLASVLGEQIGQLVTRLHRDEGVDVRTGVT VAEVRGKGHVDAVVLTDGTELPADLVVVGIGSTPATEWLEGSGVEVDNGVICDKAGRT SAPNVWALGDVASWRDPMGHQARVEHWSNVADQARVVVPAMLGTDVPTGVVVPYFWSD QYDVKIQCLGEPHATDVVHLVEDDGRKFLAYYERDGVLVGVVGGGMAGKVMKVRGKIA AGAPIAEVLDQTQA" gene complement(789157..789411) /locus_tag="Rv0689c" /db_xref="GeneID:888283" CDS complement(789157..789411) /locus_tag="Rv0689c" /function="UNKNOWN" /note="Rv0689c, (MTCY210.06c), len: 84 aa. Hypothetical unknown protein. TBparse score is 0.879." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215203.1" /db_xref="GI:15607829" /db_xref="GeneID:888283" /translation="MLGWTVKPGRVADGWQAPGVHLMARCSGPQPASERRADMDGGDI DAAVARVRAAGALAEPSRQPDDMSAECADDQGARCHLGQL" gene complement(790024..791073) /locus_tag="Rv0690c" /db_xref="GeneID:888292" CDS complement(790024..791073) /locus_tag="Rv0690c" /function="UNKNOWN" /note="Rv0690c, (MTCY210.07c), len: 349 aa. Conserved hypothetical protein, showing similarity with NP_386956.1|NC_003047 CONSERVED HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (358 aa); NP_356573.1|NC_003063 AGR_L_1570p from Agrobacterium tumefaciens (346 aa); NP_421938.1|NC_002696 conserved hypothetical protein from Caulobacter crescentus (370 aa). TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215204.1" /db_xref="GI:15607830" /db_xref="GeneID:888292" /translation="MTGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFA SILADQKGAPEGQAVPLRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRT ATDQPESLRAALDRPPQTNEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPD RYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNA LSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQ YLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHAR VLGECHPHGPPVTWQ" gene complement(791070..791666) /locus_tag="Rv0691c" /db_xref="GeneID:888296" CDS complement(791070..791666) /locus_tag="Rv0691c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv0691c, (MTCY210.08c), len: 198 aa. Probable transcriptional regulator, highly similar to AAC77476.1|U17129 unknown protein from Rhodococcus erythropolis (185 aa); and showing similarity with putative regulatory proteins eg STMTCREP_1|TCMR_STRGA|P39885 tetracenomycin c transcriptional repressor from Streptomyces glaucescens (226 aa), FASTA scores: opt: 178, E(): 8.5e-06, (27.9% identity in 201 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and probable helix-turn helix motifs from aa 34-55 (Score 1100, +2.93 SD) and 151-172 (Score 1124, +3.02 SD). TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215205.1" /db_xref="GI:15607831" /db_xref="GeneID:888296" /translation="MPHESRVGRRRSTTPHHISDVAIELFAAHGFTDVSVDDIARAAG IARRTLFRYYASKNAIPWGDFSTHLAQLQGLLDNIDSRIQLRDALRAALLAFNTFDES ETIRHRKRMRVILQTPELQAYSMTMYAGWREVIAKFVARRSGGKTTDFMPQTVAWTML GVALSAYEHWLRDESVSLTEALGAAFDVVGAGLDRLNQ" misc_feature complement(791226..791249) /locus_tag="Rv0691c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 791831..792160 /locus_tag="Rv0692" /db_xref="GeneID:888285" CDS 791831..792160 /locus_tag="Rv0692" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0692, (MTCY210.09), len: 109 aa. Conserved hypothetical protein, highly similar to U17129|RSU17129_3|AAC77477.1 unknown protein from Rhodococcus erythropolis (95 aa), FASTA scores: opt: 393, E(): 8.8e-22, (68.2% identity in 88 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215206.1" /db_xref="GI:15607832" /db_xref="GeneID:888285" /translation="MWGLLTVPAPAQARRADSSEFDPDRGWRLHPQVAVRPEPFGALL YHFGTRKLSFLKNRTILAVVQTLADYPDIRSACRGAGVDDCDQDPYLHALSVLAGSNM LVPRQTT" gene 792157..793332 /gene="pqqE" /locus_tag="Rv0693" /db_xref="GeneID:888302" CDS 792157..793332 /gene="pqqE" /locus_tag="Rv0693" /function="REQUIRED FOR COENZYME PYRROLO-QUINOLINE-QUINONE (PQQ) BIOSYNTHESIS." /note="Rv0693, (MTCY210.10), len: 391 aa. Probable pqqE (alternate gene name: pqqIII), coenzyme PQQ synthesis protein E, similar to others AE001109_9|O30258|PQQE COENZYME PQQ SYNTHESIS PROTEIN from Archaeoglobus fulgidus (375 aa), FASTA scores: E(): 1.6e-16, (28.1% identity in 377 aa overlap); PQQE_ACICA|P07782 coenzyme pqq synthesis protein e from Acinetobacter calcoaceticus (384 aa), FASTA scores: opt: 302, E(): 1.8e-12, (23.9% identity in 377 aa overlap); etc. Also similar to C-terminus of heme biosynthesis proteins e.g. O28270|AF2009 HEME BIOSYNTHESIS PROTEIN (NIRJ-2) from Archaeoglobus fulgidus (468 aa). Note that also highly similar to U17129|RSU17129_4|AAC77478.1 unknown protein from Rhodococcus erythropolis (405 aa), FASTA scores: opt: 1997, E(): 0, (73.3% identity in 390 aa overlap). COULD BELONG TO THE MOAA / NIFB / PQQE FAMILY. TBparse score is 0.919.; pqqIII" /codon_start=1 /transl_table=11 /product="coenzyme PQQ synthesis protein E" /protein_id="NP_215207.1" /db_xref="GI:15607833" /db_xref="GeneID:888302" /translation="MTSPVPRLIEQFERGLDAPICLTWELTYACNLACVHCLSSSGKR DPGELSTRQCKDIIDELERMQVFYVNIGGGEPTVRPDFWELVDYATAHHVGVKFSTNG VRITPEVATRLAATDYVDVQISLDGATAEVNDAIRGTGSFDMAVRALQNLAAAGFAGV KISVVITRRNVAQLDEFATLASRYGATLRITRLRPSGRGTDVWADLHPTADQQVQLYD WLVSKGERVLTGDSFFHLAPLGQSGALAGLNMCGAGRVVCLIDPVGDVYACPFAIHDH FLAGNVLSDGGFQNVWKNSSLFRELREPQSAGACGSCGHYDSCRGGCMAAKFFTGLPL DGPDPECVQGHSEPALARERHLPRPRADHSRGRRVSKPVPLTLSMRPPKRPCNESPV" gene 793335..794525 /gene="lldD1" /locus_tag="Rv0694" /db_xref="GeneID:888310" CDS 793335..794525 /gene="lldD1" /locus_tag="Rv0694" /EC_number="1.1.2.3" /function="INVOLVED IN RESPIRATION; CATALYZES CONVERSION OF LACTATE INTO PYRUVATE [CATALYTIC ACTIVITY: (S)-LACTATE + 2 FERRICYTOCHROME C = PYRUVATE + 2 FERROCYTOCHROME C]." /note="Rv0694, (MTCY210.11), len: 396 aa. Possible lldD1, L-lactate dehydrogenase (cytochrome) (EC 1.1.2.3), similar to NP_302368.1|NC_002677 L-lactate dehydrogenase from Mycobacterium leprae (414 aa). Also similar to others e.g. NP_384560.1|NC_003047 PUTATIVE L-LACTATE DEHYDROGENASE (CYTOCHROME) PROTEIN from Sinorhizobium meliloti (403 aa); NP_251072.1|NC_002516 L-lactate dehydrogenase from Pseudomonas aeruginosa (383 aa); P33232|LLDD_ECOLI L-lactate dehydrogenase (cytochrome) from Escherichia coli strain K12 (396 aa), FASTA scores: opt: 697, E(): 0, (34.5 identity in 380 aa overlap); etc; and also similar to other oxidoreductases. Note that also highly similar to RSU17129_5|AAC77479.1|U17129 unknown protein from Rhodococcus erythropolis (392 aa), FASTA scores: opt: 2006, E(): 0, (74.1% identity in 386 aa overlap). Also similar to lldD2|Rv1872c|MTCY180.46|MTCY359.01 POSSIBLE L-LACTATE DEHYDROGENASE (CYTOCHROME) from Mycobacterium tuberculosis (414 aa). BELONGS TO THE FMN-DEPENDENT ALPHA-HYDROXY ACID DEHYDROGENASES FAMILY. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="L-lactate dehydrogenase (cytochrome) LldD1" /protein_id="NP_215208.1" /db_xref="GI:15607834" /db_xref="GeneID:888310" /translation="MAEAWFETVAIAQQRAKRRLPKSVYSSLIAASEKGITVADNVAA FSELGFAPHVIGATDKRDLSTTVMGQEVSLPVIISPTGVQAVDPGGEVAVARAAAARG TVMGLSSFASKPIEEVIAANPKTFFQVYWQGGRDALAERVERARQAGAVGLVVTTDWT FSHGRDWGSPKIPEEMNLKTILRLSPEAITRPRWLWKFAKTLRPPDLRVPNQGRRGEP GPPFFAAYGEWMATPPPTWEDIGWLRELWGGPFMLKGVMRVDDAKRAVDAGVSAISVS NHGGNNLDGTPASIRALPAVSAAVGDQVEVLLDGGIRRGSDVVKAVALGARAVMIGRA YLWGLAANGQAGVENVLDILRGGIDSALMGLGHASVHDLSPADILVPTGFIRDLGVPS RRDV" gene 794715..795470 /locus_tag="Rv0695" /db_xref="GeneID:888314" CDS 794715..795470 /locus_tag="Rv0695" /function="UNKNOWN" /note="Rv0695, (MTCY210.12), len: 251 aa. Conserved hypothetical protein, similar to many creatinine amidohydrolases or hypothetical proteins e.g. NP_443048.1|NC_000911 creatinine amidohydrolase from Synechococcus sp. PCC 6803 (273 aa); NP_466169.1|NC_003210 protein similar to creatinine amidohydrolase from Listeria monocytogenes (249 aa); T35153|SC5A7.04c hypothetical protein from Streptomyces coelicolor (273 aa); etc. Note that highly similar to RSU17129_10|AAC77474.1|U17129 unknown protein from Rhodococcus erythropolis (230 aa), FASTA scores: opt: 693, E(): 0, (55.7% identity in 237 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215209.1" /db_xref="GI:15607835" /db_xref="GeneID:888314" /translation="MNSSYHRRVPVVGELGSATSSQLPSTSPSIVIPLGSTEQHGPHL PLDTDTRIATAVARTVTARLHAEDLPIAQEEWLMAPAIAYGASGEHQRFAGTISIGTE ALTMLLVEYGRSAACWARRLVFVNGHGGNVGALTRAVGLLRAEGRDAGWCPCTCPGGD PHAGHTETSVLLHLSPADVRTERWRAGNRAPLPVLLPSMRRGGVAAVSETGVLGDPTT ATAAEGRRIFAAMVDDCVRRVARWMPQPDGMLT" repeat_region 795467..795518 /note="52 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 795519..796931 /locus_tag="Rv0696" /db_xref="GeneID:888307" CDS 795519..796931 /locus_tag="Rv0696" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0696, (MTCY210.13), len: 470 aa. Probable membrane sugar transferase (EC 2.-.-.-), similar (except in N-terminus) to NP_069157.1|NC_000917 glycosyl transferase from Archaeoglobus fulgidus (324 aa); NP_279985.1|NC_002607 rhamnosyl transferase from Halobacterium sp. NRC-1 (299 aa); NP_059113.1|NM_017417 polypeptide N-acetylgalactosaminyltransferase 8 from (637 aa). Note that also highly similar to P46370|YTH1_RHOER HYPOTHETICAL 55.3 KDA PROTEIN from Rhodococcus erythropolis (513 aa), FASTA scores: opt: 1514, E(): 0, (51.8% identity in 469 aa overlap). TBparse score is 0.924." /codon_start=1 /transl_table=11 /product="membrane sugar transferase" /protein_id="NP_215210.1" /db_xref="GI:15607836" /db_xref="GeneID:888307" /translation="MTATRLPDGFAVQVDRRVRVLGDGSALLGGSPTRLLRLAPAARG LLCDGRLKVRDEVSAELARILLDATVAHPRPPSGPSHRDVTVVIPVRNNASGLRRLVT SLRGLRVIVVDDGSACPVESDDFVGAHCDIEVLHHPHSKGPAAARNTGLAACTTDFVA FLDSDVTPRRGWLESLLGHFCDPTVALVAPRIVSLVEGENPVARYEALHSSLDLGQRE APVLPHSTVSYVPSAAIVCRSSAIRDVGGFDETMHSGEDVDLCWRLIEAGARLRYEPI ALVAHDHRTQLRDWIARKAFYGGSAAPLAVRHPDKTAPLVISGGALMAWILMSIGTGL GRLASLVIAVLTGRRIARAMRCAETSFLDVLAVATRGLWAAALQLASAICRHYWPLAL LAAILSRRCRRVVLIAAVVDGVVDWLRRREGADDDAEPIGPLTYLVLKRVDDLAYGAG LWYGVVRERNIGALKPQIRT" gene 796933..798372 /locus_tag="Rv0697" /db_xref="GeneID:888316" CDS 796933..798372 /locus_tag="Rv0697" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0697, (MTCY210.14, unknown), len: 479 aa. Probable dehydrogenase (EC 1.-.-.-), highly similar to P30772|YTUR_MYCLE HYPOTHETICAL 24 kDa PROTEIN from Mycobacterium leprae (220 aa), FASTA scores: opt: 557, E(): 1.7e-28, (46.2% identity in 223 aa overlap). Also highly similar to P46371|YTH2_RHOER HYPOTHETICAL 53.0 KDA GMC-TYPE OXIDOREDUCTASE from Rhodococcus erythropolis (493 aa); and similar to many dehydrogenases e.g. NP_250814.1|NC_002516 probable dehydrogenase from Pseudomonas aeruginosa (545 aa); BAA13145.1|D86622 FAD dependent L-sorbose dehydrogenase from Gluconobacter oxydans (531 aa); etc. Also similar to Rv1279 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_215211.1" /db_xref="GI:15607837" /db_xref="GeneID:888316" /translation="MTAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLA DPGLLAQTANGLQLPIGAGSPLVERYRTRLTDRPVRHLPIVRGATVGGSGAINGGYFC RGLPSDFDRASIPGWAWSDVLEHFRAIETDLDFETPVHGRSGPIPVRRTHEMTGITES FMAAAEDAGFAWIADLNDVGPEMPSGVGAVPLNIVNGVRTSSAVGYLMPALGRPNLTL LARTRAVRLRFSATTAVGVDAIGPGGPVSLSADRIVLCAGAIQSAHLLMLSGVGEEEV LRSAGVKVLMALPVGMGCSDHPEWVMPTNWAVAVDRPVLEVLLSTHDGIEIRPYTGGF VAMTGDGTAGHRDWPHIGVALMQPRARGRITLVSSDPQIPVRIEHRYDSEPADVAALR QGSALAHELCGAATRIGPAVWATSQHLCGSAPMGTDDDPRAVVDPRCRVRGIENLWVI DGSVLPSITSRGPHATIVMLGHRAAEFVQ" gene 798833..799444 /locus_tag="Rv0698" /db_xref="GeneID:888327" CDS 798833..799444 /locus_tag="Rv0698" /function="UNKNOWN" /note="Rv0698, (MTCY210.15), len: 203 aa. Conserved hypothetical protein, highly similar to C-terminus of Rv3639c|MTY15C10.12 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (188 aa), FASTA scores: E(): 2.1e-07, (54.8% identity in 73 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215212.1" /db_xref="GI:15607838" /db_xref="GeneID:888327" /translation="MGRRGNRRVHVDRVRLTGTERELRAENQSPPIFRPQNTLGDGAN GLPLAVCTTTAHTCHTSHTHPSRWTPNPVPATKGVPAGLVQATFIIENLDPGNNDTPT PPTPKLRLARKPGHHRRSEYDADSVLRRKDTSRRCVQADDVRCVQLVQDPRRGRVELG GYRAELTVGRRAAVNCQRPQYGADGWPVRLGCGVGGAARGDQR" gene 799629..799850 /locus_tag="Rv0699" /db_xref="GeneID:888335" CDS 799629..799850 /locus_tag="Rv0699" /function="UNKNOWN" /note="Rv0699, (MTCY210.17), len: 73 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215213.1" /db_xref="GI:15607839" /db_xref="GeneID:888335" /translation="MGDRRVDLLAAKDSEIRRSMGAVPVGAGSSQVATSWASDRCIRC RAAILSADCANLARANSRGGLAVGGSAVS" gene 800487..800792 /gene="rpsJ" /locus_tag="Rv0700" /gene_synonym="nusE" /db_xref="GeneID:888331" CDS 800487..800792 /gene="rpsJ" /locus_tag="Rv0700" /gene_synonym="nusE" /function="THIS PROTEIN IS INVOLVED IN THE BINDING OF tRNA TO THE RIBOSOMES, AND IN THE REGULATION OF rRNA BIOSYNTHESIS (BY MODULATING THE EFFICIENCY OF TRANSCRIPTIONAL TERMINATION). INTERACTS WITH NUSB|Rv2533c." /experiment="experimental evidence, no additional details recorded" /note="NusE; involved in assembly of the 30S subunit; in the ribosome, this protein is involved in the binding of tRNA; in Escherichia coli this protein was also found to be involved in transcription antitermination; NusB/S10 heterodimers bind boxA sequences in the leader RNA of rrn operons which is required for antitermination; binding of NusB/S10 to boxA nucleates assembly of the antitermination complex" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S10" /protein_id="NP_215214.1" /db_xref="GI:15607840" /db_xref="GeneID:888331" /translation="MAGQKIRIRLKAYDHEAIDASARKIVETVVRTGASVVGPVPLPT EKNVYCVIRSPHKYKDSREHFEMRTHKRLIDIIDPTPKTVDALMRIDLPASVDVNIQ" misc_feature 800571..800618 /gene="rpsJ" /locus_tag="Rv0700" /gene_synonym="nusE" /note="PS00361 Ribosomal protein S10 signature" gene 800809..801462 /gene="rplC" /locus_tag="Rv0701" /db_xref="GeneID:888343" CDS 800809..801462 /gene="rplC" /locus_tag="Rv0701" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA AND MAY PARTICIPATE IN THE FORMATION OF THE PEPTIDYLTRANSFERASE CENTER OF THE RIBOSOME." /experiment="experimental evidence, no additional details recorded" /note="binds directly near the 3' end of the 23S rRNA, where it nucleates assembly of the 50S subunit; essential for peptidyltransferase activity; mutations in this gene confer resistance to tiamulin" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L3" /protein_id="NP_215215.1" /db_xref="GI:15607841" /db_xref="GeneID:888343" /translation="MARKGILGTKLGMTQVFDESNRVVPVTVVKAGPNVVTRIRTPER DGYSAVQLAYGEISPRKVNKPLTGQYTAAGVNPRRYLAELRLDDSDAATEYQVGQELT AEIFADGSYVDVTGTSKGKGFAGTMKRHGFRGQGASHGAQAVHRRPGSIGGCATPARV FKGTRMAGRMGNDRVTVLNLLVHKVDAENGVLLIKGAVPGRTGGLVMVRSAIKRGEK" misc_feature 801124..801195 /gene="rplC" /locus_tag="Rv0701" /note="PS00474 Ribosomal protein L3 signature" gene 801462..802133 /gene="rplD" /locus_tag="Rv0702" /db_xref="GeneID:888345" CDS 801462..802133 /gene="rplD" /locus_tag="Rv0702" /function="THIS PROTEIN BINDS DIRECTLY AND SPECIFICALLY TO 23S RRNA." /experiment="experimental evidence, no additional details recorded" /note="L4 is important during the early stages of 50S assembly; it initially binds near the 5' end of the 23S rRNA" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L4" /protein_id="NP_215216.1" /db_xref="GI:15607842" /db_xref="GeneID:888345" /translation="MAAQEQKTLKIDVKTPAGKVDGAIELPAELFDVPANIALMHQVV TAQRAAARQGTHSTKTRGEVSGGGRKPYRQKGTGRARQGSTRAPQFTGGGVVHGPKPR DYSQRTPKKMIAAALRGALSDRARNGRIHAITELVEGQNPSTKSARAFLASLTERKQV LVVIGRSDEAGAKSVRNLPGVHILAPDQLNTYDVLRADDVVFSVEALNAYIAANTTTS EEVSA" gene 802133..802435 /gene="rplW" /locus_tag="Rv0703" /db_xref="GeneID:888353" CDS 802133..802435 /gene="rplW" /locus_tag="Rv0703" /function="BINDS TO A SPECIFIC REGION ON THE 23S RRNA." /experiment="experimental evidence, no additional details recorded" /note="binds third domain of 23S rRNA and protein L29; part of exit tunnel" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L23" /protein_id="NP_215217.1" /db_xref="GI:15607843" /db_xref="GeneID:888353" /translation="MATLADPRDIILAPVISEKSYGLLDDNVYTFLVRPDSNKTQIKI AVEKIFAVKVASVNTANRQGKRKRTRTGYGKRKSTKRAIVTLAPGSRPIDLFGAPA" misc_feature 802370..802417 /gene="rplW" /locus_tag="Rv0703" /note="PS00050 Ribosomal protein L23 signature" repeat_region 802429..802477 /note="49 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 802528..803370 /gene="rplB" /locus_tag="Rv0704" /db_xref="GeneID:888341" CDS 802528..803370 /gene="rplB" /locus_tag="Rv0704" /function="THIS PROTEIN IS A PRIMARY 23S RRNA-BINDING PROTEIN. IT HAS PEPTIDYLTRANSFERASE ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="one of the primary rRNA-binding proteins; required for association of the 30S and 50S subunits to form the 70S ribosome, for tRNA binding and peptide bond formation" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L2" /protein_id="NP_215218.1" /db_xref="GI:15607844" /db_xref="GeneID:888341" /translation="MAIRKYKPTTPGRRGASVSDFAEITRSTPEKSLVRPLHGRGGRN AHGRITTRHKGGGHKRAYRMIDFRRNDKDGVNAKVAHIEYDPNRTARIALLHYLDGEK RYIIAPNGLSQGDVVESGANADIKPGNNLPLRNIPAGTLIHAVELRPGGGAKLARSAG SSIQLLGKEASYASLRMPSGEIRRVDVRCRATVGEVGNAEQANINWGKAGRMRWKGKR PSVRGVVMNPVDHPHGGGEGKTSGGRHPVSPWGKPEGRTRNANKSSNKFIVRRRRTGK KHSR" misc_feature 803182..803217 /gene="rplB" /locus_tag="Rv0704" /note="PS00467 Ribosomal protein L2 signature" gene 803411..803692 /gene="rpsS" /locus_tag="Rv0705" /db_xref="GeneID:888356" CDS 803411..803692 /gene="rpsS" /locus_tag="Rv0705" /function="PROTEIN S19 FORMS A COMPLEX WITH S13 THAT BINDS STRONGLY TO THE 16S RIBOSOMAL RNA." /experiment="experimental evidence, no additional details recorded" /note="protein S19 forms a complex with S13 that binds strongly to the 16S ribosomal RNA" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S19" /protein_id="NP_215219.1" /db_xref="GI:15607845" /db_xref="GeneID:888356" /translation="MPRSLKKGPFVDEHLLKKVDVQNEKNTKQVIKTWSRRSTIIPDF IGHTFAVHDGRKHVPVFVTESMVGHKLGEFAPTRTFKGHIKDDRKSKRR" misc_feature 803567..803641 /gene="rpsS" /locus_tag="Rv0705" /note="PS00323 Ribosomal protein S19 signature" gene 803689..804282 /gene="rplV" /locus_tag="Rv0706" /db_xref="GeneID:888359" CDS 803689..804282 /gene="rplV" /locus_tag="Rv0706" /function="THIS PROTEIN BINDS SPECIFICALLY TO 23S RRNA; ITS BINDING IS STIMULATED BY OTHER RIBOSOMAL PROTEINS, E.G., L4, L17, AND L20. IT IS IMPORTANT DURING THE EARLY STAGES OF 50S RECONSTITUTION." /experiment="experimental evidence, no additional details recorded" /note="binds specifically to 23S rRNA during the early stages of 50S assembly; makes contact with all 6 domains of the 23S rRNA in the assembled 50S subunit and ribosome; mutations in this gene result in erythromycin resistance; located near peptidyl-transferase center" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L22" /protein_id="NP_215220.1" /db_xref="GI:15607846" /db_xref="GeneID:888359" /translation="MTAATKATEYPSAVAKARFVRVSPRKARRVIDLVRGRSVSDALD ILRWAPQAASGPVAKVIASAAANAQNNGGLDPATLVVATVYADQGPTAKRIRPRAQGR AFRIRRRTSHITVVVESRPAKDQRSAKSSRARRTEASKAASKVGATAPAKKAAAKAPA KKAPASSGVKKTPAKKAPAKKAPAKASETSAAKGGSD" misc_feature 803965..804039 /gene="rplV" /locus_tag="Rv0706" /note="PS00464 Ribosomal protein L22 signature" gene 804282..805106 /gene="rpsC" /locus_tag="Rv0707" /db_xref="GeneID:888357" CDS 804282..805106 /gene="rpsC" /locus_tag="Rv0707" /function="THIS PROTEIN IS INVOLVED IN THE BINDING OF INITIATOR MET-TRNA." /experiment="experimental evidence, no additional details recorded" /note="forms a complex with S10 and S14; binds the lower part of the 30S subunit head and the mRNA in the complete ribosome to position it for translation" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S3" /protein_id="NP_215221.1" /db_xref="GI:15607847" /db_xref="GeneID:888357" /translation="MGQKINPHGFRLGITTDWKSRWYADKQYAEYVKEDVAIRRLLSS GLERAGIADVEIERTRDRVRVDIHTARPGIVIGRRGTEADRIRADLEKLTGKQVQLNI LEVKNPESQAQLVAQGVAEQLSNRVAFRRAMRKAIQSAMRQPNVKGIRVQCSGRLGGA EMSRSEFYREGRVPLHTLRADIDYGLYEAKTTFGRIGVKVWIYKGDIVGGKRELAAAA PAGADRPRRERPSGTRPRRSGASGTTATGTDAGRAAGGEEAAPDAAAPVEAQSTES" gene 805110..805526 /gene="rplP" /locus_tag="Rv0708" /db_xref="GeneID:888377" CDS 805110..805526 /gene="rplP" /locus_tag="Rv0708" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA AND IS LOCATED AT THE A SITE OF THE PEPTIDYLTRANSFERASE CENTER." /experiment="experimental evidence, no additional details recorded" /note="located in the peptidyl transferase center and may be involved in peptidyl transferase activity; similar to archaeal L10e" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L16" /protein_id="NP_215222.1" /db_xref="GI:15607848" /db_xref="GeneID:888377" /translation="MLIPRKVKHRKQHHPRQRGIASGGTTVNFGDYGIQALEHAYVTN RQIESARIAINRHIKRGGKVWINIFPDRPLTKKPAETRMGSGKGSPEWWVANVKPGRV LFELSYPNEGVARAALTRAIHKLPIKARIITREEQF" misc_feature 805353..805388 /gene="rplP" /locus_tag="Rv0708" /note="PS00701 Ribosomal protein L16 signature 2" gene 805526..805759 /gene="rpmC" /locus_tag="Rv0709" /db_xref="GeneID:888374" CDS 805526..805759 /gene="rpmC" /locus_tag="Rv0709" /function="INVOLVED IN TRANSLATION MECHANISMS." /experiment="experimental evidence, no additional details recorded" /note="one of the stabilizing components for the large ribosomal subunit" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L29" /protein_id="NP_215223.1" /db_xref="GI:15607849" /db_xref="GeneID:888374" /translation="MAVGVSPGELRELTDEELAERLRESKEELFNLRFQMATGQLNNN RRLRTVRQEIARIYTVLRERELGLATGPDGKES" gene 805756..806166 /gene="rpsQ" /locus_tag="Rv0710" /db_xref="GeneID:888391" CDS 805756..806166 /gene="rpsQ" /locus_tag="Rv0710" /function="PROTEIN S17 BINDS SPECIFICALLY TO THE 5' END OF 16S RIBOSOMAL RNA." /experiment="experimental evidence, no additional details recorded" /note="primary binding protein; helps mediate assembly; involved in translation fidelity" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S17" /protein_id="NP_215224.1" /db_xref="GI:15607850" /db_xref="GeneID:888391" /translation="MMAEAKTGAKAAPRVAKAAKAAPKKAAPNDAEAIGAANAANVKG PKHTPRTPKPRGRRKTRIGYVVSDKMQKTIVVELEDRMRHPLYGKIIRTTKKVKAHDE DSVAGIGDRVSLMETRPLSATKRWRLVEILEKAK" misc_feature 806080..806118 /gene="rpsQ" /locus_tag="Rv0710" /note="PS00056 Ribosomal protein S17 signature" gene 806335..808698 /gene="atsA" /locus_tag="Rv0711" /db_xref="GeneID:888394" CDS 806335..808698 /gene="atsA" /locus_tag="Rv0711" /EC_number="3.1.6.1" /function="THOUGHT TO PLAY AN IMPORTANT ROLE IN THE MINERALIZATION OF SULFATES [CATALYTIC ACTIVITY: A phenol sulfate + H2O = a phenol + sulfate]." /note="Rv0711, (MTCY210.30), len: 787 aa. Possible atsA, arylsulfatase (EC 3.1.6.1), similar to others e.g. P51691|ARS_PSEAE arylsulfatase from Pseudomonas aeruginosa (532 aa), FASTA scores: opt: 439, E(): 2.9e-21, (30.8% identity in 552 aa overlap); etc. Also similar to other hypothetical arylsulfatases from Mycobacterium tuberculosis e.g. Rv3299c, Rv0663, etc. Contains PS00523 Sulfatases signature 1, and PS00149 Sulfatases signature 2. BELONGS TO THE SULFATASE FAMILY." /codon_start=1 /transl_table=11 /product="arylsulfatase AtsA" /protein_id="NP_215225.1" /db_xref="GI:15607851" /db_xref="GeneID:888394" /translation="MAPEATEAFNGTIELDIRDSEPDWGPYAAPVAPEHSPNILYLVW DDVGIATWDCFGGLVEMPAMTRVAERGVRLSQFHTTALCSPTRASLLTGRNATTVGMA TIEEFTDGFPNCNGRIPADTALLPEVLAEHGYNTYCVGKWHLTPLEESNMASTKRHWP TSRGFERFYGFLGGETDQWYPDLVYDNHPVSPPGTPEGGYHLSKDIADKTIEFIRDAK VIAPDKPWFSYVCPGAGHAPHHVFKEWADRYAGRFDMGYERYREIVLERQKALGIVPP DTELSPINPYLDVPGPNGETWPLQDTVRPWDSLSDEEKKLFCRMAEVFAGFLSYTDAQ IGRILDYLEESGQLDNTIIVVISDNGASGEGGPNGSVNEGKFFNGYIDTVAESMKLFD HLGGPQTYNHYPIGWAMAFNTPYKLFKRYASHEGGIADPAIISWPNGIAAHGEIRDNY VNVSDITPTVYDLLGMTPPGTVKGIPQKPMDGVSFIAALADPAADTGKTTQFYTMLGT RGIWHEGWFANTIHAATPAGWSNFNADRWELFHIAADRSQCHDLAAEHPDKLEELKAL WFSEAAKYNGLPLADLNLLETMTRSRPYLVSERASYVYYPDCADVGIGAAVEIRGRSF AVLADVTIDTTGAEGVLFKHGGAHGGHVLFVRDGRLHYVYNFLGERQQLVSSSGPVPS GRHLLGVRYLRTGTVPNSHTPVGDLELFFDENLVGALTNVLTHPGTFGLAGAAISVGR NGGSAVSSHYEAPFAFTGGTITQVTVDVSGRPFEDVESDLALAFSRD" misc_feature 806452..806496 /gene="atsA" /locus_tag="Rv0711" /note="PS00678 Beta-transducin family Trp-Asp repeats signature" misc_feature 806575..806613 /gene="atsA" /locus_tag="Rv0711" /note="PS00523 Sulfatases signature 1" misc_feature 806731..806760 /gene="atsA" /locus_tag="Rv0711" /note="PS00149 Sulfatases signature 2" gene 808746..809645 /locus_tag="Rv0712" /db_xref="GeneID:888346" CDS 808746..809645 /locus_tag="Rv0712" /function="UNKNOWN" /note="Rv0712, (MTCY210.31), len: 299 aa. Conserved hypothetical protein, similar to others e.g. NP_106128.1|NC_002678 hypothetical protein from Mesorhizobium loti (372 aa); D90901_33|P72841 HYPOTHETICAL 48.1 kDa PROTEIN from Synechocystis sp (410 aa), FASTA scores: E(): 1.1e-07, (28.8% identity in 299 aa overlap); etc. Slight similarity to carboxykinases. Similar to C-terminal part of Rv3703c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (425 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215226.1" /db_xref="GI:15607852" /db_xref="GeneID:888346" /translation="MLTELVDLPGGSFRMGSTRFYPEEAPIHTVTVRAFAVERHPVTN AQFAEFVSATGYVTVAEQPLDPGLYPGVDAADLCPGAMVFCPTAGPVDLRDWRQWWDW VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYARWAGRRLPTEAEWEYAARGGTT ATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTSPVGRFPANGFGLLDMIGNVW EWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKGGSHLCAPEYCHRYRPAAR SPQSQDTATTHIGFRCVADPVSG" gene 809946..810887 /locus_tag="Rv0713" /db_xref="GeneID:888405" CDS 809946..810887 /locus_tag="Rv0713" /function="UNKNOWN" /note="Rv0713, (MTCY210.32), len: 313 aa. Probable conserved transmembrane protein, similar to Rv3435c|MTCY77_7|O06252 from Mycobacterium tuberculosis (284 aa), FASTA scores: opt: 557, E(): 2.1e-29, (35.8% identity in 282 aa overlap); MLCB2492_12|O32991 HYPOTHETICAL 10.7 kDa PROTEIN from Mycobacterium leprae (95 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215227.1" /db_xref="GI:15607853" /db_xref="GeneID:888405" /translation="MAGSDPPTGGPASQAGSDAGASPEHKHMSRRKHLVLDVCIILGV LIAYVFSLLGYDWLAHTPGPLPQPDVGTTDDTVVLIRFEELHTVANRLDVKVLVLPDD SMIDHRLQVLTTDTSVRLYPENELGDLQYPVGKLPAQVATTIEAHGNPGAWPFDTYTT DTVQADVLVGAGDNRQYVPARVEVTGSLEGWDISAVRVGESSQTSDRPDNVIITLKRA KGPLVFDLGICLVLITLPTLALFVAIQMITGRRKFQPPFGTWYAAMLFAVVPLRTILP GSPPAGAWIDRAVVIWVLIALAAAMVVYIVAWYRESD" gene 811373..811741 /gene="rplN" /locus_tag="Rv0714" /db_xref="GeneID:888411" CDS 811373..811741 /gene="rplN" /locus_tag="Rv0714" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA." /experiment="experimental evidence, no additional details recorded" /note="binds to the 23S rRNA between the centers for peptidyl transferase and GTPase" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L14" /protein_id="NP_215228.1" /db_xref="GI:15607854" /db_xref="GeneID:888411" /translation="MIQQESRLKVADNTGAKEILCIRVLGGSSRRYAGIGDVIVATVK DAIPGGNVKRGDVVKAVVVRTVKERRRPDGSYIKFDENAAVIIKPDNDPRGTRIFGPV GRELREKRFMKIISLAPEVL" misc_feature 811550..811630 /gene="rplN" /locus_tag="Rv0714" /note="PS00049 Ribosomal protein L14 signature" gene 811742..812059 /gene="rplX" /locus_tag="Rv0715" /db_xref="GeneID:888421" CDS 811742..812059 /gene="rplX" /locus_tag="Rv0715" /function="THIS PROTEIN IS FOUND IN THE RIBONUCLEOPROTEIN CORE AND IS INVOLVED IN THE EARLY ASSEMBLY OF THE 50S SUBUNIT. IT IS NOT INVOLVED IN THE FUNCTIONS OF THE MATURE 50S SUBUNIT." /experiment="experimental evidence, no additional details recorded" /note="assembly initiator protein; binds to 5' end of 23S rRNA and nucleates assembly of the 50S; surrounds polypeptide exit tunnel" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L24" /protein_id="NP_215229.1" /db_xref="GI:15607855" /db_xref="GeneID:888421" /translation="MKVHKGDTVLVISGKDKGAKGKVLQAYPDRNRVLVEGVNRIKKH TAISTTQRGARSGGIVTQEAPIHVSNVMVVDSDGKPTRIGYRVDEETGKRVRISKRNG KDI" misc_feature 811766..811810 /gene="rplX" /locus_tag="Rv0715" /note="PS01108 Ribosomal protein L24 signature" gene 812059..812622 /gene="rplE" /locus_tag="Rv0716" /db_xref="GeneID:888400" CDS 812059..812622 /gene="rplE" /locus_tag="Rv0716" /function="THIS IS ONE OF 3 PROTEINS THAT MEDIATE THE ATTACHMENT OF THE 5S RNA INTO THE LARGE RIBOSOMAL SUBUNIT." /experiment="experimental evidence, no additional details recorded" /note="part of 50S and 5S/L5/L18/L25 subcomplex; contacts 5S rRNA and P site tRNA; forms a bridge to the 30S subunit in the ribosome by binding to S13" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L5" /protein_id="NP_215230.1" /db_xref="GI:15607856" /db_xref="GeneID:888400" /translation="MTTAQKVQPRLKERYRSEIRDALRKQFGYGNVMQIPTVTKVVVN MGVGEAARDAKLINGAVNDLALITGQKPEVRRARKSIAQFKLREGMPVGVRVTLRGDR MWEFLDRLTSIALPRIRDFRGLSPKQFDGVGNYTFGLAEQAVFHEVDVDKIDRVRGMD INVVTSAATDDEGRALLRALGFPFKEN" gene 812627..812812 /gene="rpsN" /locus_tag="Rv0717" /db_xref="GeneID:888414" CDS 812627..812812 /gene="rpsN" /locus_tag="Rv0717" /function="KNOWN TO BE REQUIRED FOR THE ASSEMBLY OF 30S PARTICLES AND MAY ALSO BE RESPONSIBLE FOR DETERMINING THE CONFORMATION OF THE 16S RRNA AT THE A SITE." /note="located in the peptidyl transferase center and involved in assembly of 30S ribosome subunit; similar to what is observed with proteins L31 and L33, some proteins in this family contain CXXC motifs that are involved in zinc binding; if two copies are present in a genome, then the duplicated copy appears to have lost the zinc-binding motif and is instead regulated by zinc; the proteins in this group appear to contain the zinc-binding motif" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S14" /protein_id="YP_177747.1" /db_xref="GI:57116769" /db_xref="GeneID:888414" /translation="MAKKALVNKAAGKPRFAVRAYTRCSKCGRPRAVYRKFGLCRICL REMAHAGELPGVQKSSW" misc_feature 812693..812761 /gene="rpsN" /locus_tag="Rv0717" /note="PS00527 Ribosomal protein S14 signature" repeat_region 812835..812921 /note="87 bp Mycobacterial Interspersed Repetitive Unit, Class III" repeat_region 812922..812975 /note="54 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 812976..813374 /gene="rpsH" /locus_tag="Rv0718" /db_xref="GeneID:888424" CDS 812976..813374 /gene="rpsH" /locus_tag="Rv0718" /function="BINDS DIRECTLY TO THE CENTRAL DOMAIN OF 16S RIBOSOMAL RNA." /experiment="experimental evidence, no additional details recorded" /note="binds directly to 16S rRNA central domain where it helps coordinate assembly of the platform of the 30S subunit" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S8" /protein_id="NP_215232.1" /db_xref="GI:15607858" /db_xref="GeneID:888424" /translation="MTMTDPIADFLTRLRNANSAYHDEVSLPHSKLKANIAQILKNEG YISDFRTEDARVGKSLVIQLKYGPSRERSIAGLRRVSKPGLRVYAKSTNLPRVLGGLG VAIISTSSGLLTDRQAARQGVGGEVLAYVW" misc_feature 813279..813332 /gene="rpsH" /locus_tag="Rv0718" /note="PS00053 Ribosomal protein S8 signature" gene 813398..813937 /gene="rplF" /locus_tag="Rv0719" /db_xref="GeneID:888433" CDS 813398..813937 /gene="rplF" /locus_tag="Rv0719" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA AND IS LOCATED AT THE AMINOACYL-TRNA BINDING SITE OF THE PEPTIDYLTRANSFERASE CENTER." /note="ribosomal protein L6 appears to have arisen as a result of an ancient gene duplication as based on structural comparison of the Bacillus stearothermophilus protein; RNA-binding appears to be in the C-terminal domain; mutations in the L6 gene confer resistance to aminoglycoside antibiotics such as gentamicin and these occur in truncations of the C-terminal domain; it has been localized to a region between the base of the L7/L12 stalk and the central protuberance" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L6" /protein_id="NP_215233.1" /db_xref="GI:15607859" /db_xref="GeneID:888433" /translation="MSRIGKQPIPVPAGVDVTIEGQSISVKGPKGTLGLTVAEPIKVA RNDDGAIVVTRPDDERRNRSLHGLSRTLVSNLVTGVTQGYTTKMEIFGVGYRVQLKGS NLEFALGYSHPVVIEAPEGITFAVQAPTKFTVSGIDKQKVGQIAANIRRLRRPDPYKG KGVRYEGEQIRRKVGKTGK" misc_feature 813860..813886 /gene="rplF" /locus_tag="Rv0719" /note="PS00525 Ribosomal protein L6 signature 1" gene 813940..814308 /gene="rplR" /locus_tag="Rv0720" /db_xref="GeneID:888457" CDS 813940..814308 /gene="rplR" /locus_tag="Rv0720" /function="THIS IS ONE OF 3 PROTEINS THAT MEDIATE THE ATTACHMENT OF THE 5S RNA INTO THE LARGE RIBOSOMAL SUBUNIT." /note="binds 5S rRNA along with protein L5 and L25" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L18" /protein_id="NP_215234.1" /db_xref="GI:15607860" /db_xref="GeneID:888457" /translation="MAQSVSATRRISRLRRHTRLRKKLSGTAERPRLVVHRSARHIHV QLVNDLNGTTVAAASSIEADVRGVPGDKKARSVRVGQLIAERAKAAGIDTVVFDRGGY TYGGRIAALADAARENGLSF" gene 814328..814990 /gene="rpsE" /locus_tag="Rv0721" /db_xref="GeneID:888465" CDS 814328..814990 /gene="rpsE" /locus_tag="Rv0721" /function="PROTEIN S5 IS IMPORTANT IN THE ASSEMBLY AND FUNCTION OF THE 30S RIBOSOMAL SUBUNIT." /experiment="experimental evidence, no additional details recorded" /note="located at the back of the 30S subunit body where it stabilizes the conformation of the head with respect to the body; contacts S4 and S8; with S4 and S12 plays a role in translational accuracy; mutations in this gene result in spectinomycin resistance" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S5" /protein_id="NP_215235.1" /db_xref="GI:15607861" /db_xref="GeneID:888465" /translation="MAEQPAGQAGTTDNRDARGDREGRRRDSGRGSRERDGEKSNYLE RVVAINRVSKVVKGGRRFSFTALVIVGDGNGMVGVGYGKAKEVPAAIAKGVEEARKSF FRVPLIGGTITHPVQGEAAAGVVLLRPASPGTGVIAGGAARAVLECAGVHDILAKSLG SDNAINVVHATVAALKLLQRPEEVAARRGLPIEDVAPAGMLKARRKSEALAASVLPDR TI" misc_feature 814502..814600 /gene="rpsE" /locus_tag="Rv0721" /note="PS00585 Ribosomal protein S5 signature" misc_feature 814607..814654 /gene="rpsE" /locus_tag="Rv0721" /note="PS00589 PTS HPR component serine phosphorylation site signature" gene 814993..815190 /gene="rpmD" /locus_tag="Rv0722" /db_xref="GeneID:888505" CDS 814993..815190 /gene="rpmD" /locus_tag="Rv0722" /function="INVOLVED ION TRANSLATION MECHANISMS." /note="L30 binds domain II of the 23S rRNA and the 5S rRNA; similar to eukaryotic protein L7" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L30" /protein_id="NP_215236.1" /db_xref="GI:15607862" /db_xref="GeneID:888505" /translation="MSQLKITQVRSTIGARWKQRESLRTLGLRRIRHSVIREDNAATR GLIAVVRHLVEVEPAQTGGKT" gene 815190..815630 /gene="rplO" /locus_tag="Rv0723" /db_xref="GeneID:888531" CDS 815190..815630 /gene="rplO" /locus_tag="Rv0723" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA." /experiment="experimental evidence, no additional details recorded" /note="late assembly protein" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L15" /protein_id="NP_215237.1" /db_xref="GI:15607863" /db_xref="GeneID:888531" /translation="MTLKLHDLRPARGSKIARTRVGRGDGSKGKTAGRGTKGTRARKQ VPVTFEGGQMPIHMRLPKLKGFRNRFRTEYEIVNVGDINRLFPQGGAVGVDDLVAKGA VRKNALVKVLGDGKLTAKVDVSAHKFSGSARAKITAAGGSATEL" misc_feature 815259..815282 /gene="rplO" /locus_tag="Rv0723" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 815517..815609 /gene="rplO" /locus_tag="Rv0723" /note="PS00475 Ribosomal protein L15 signature" gene 815663..817534 /gene="sppA" /locus_tag="Rv0724" /db_xref="GeneID:888535" CDS 815663..817534 /gene="sppA" /locus_tag="Rv0724" /EC_number="3.4.21.-" /function="INVOLVED IN DIGESTION OF THE CLEAVED SIGNAL PEPTIDES. THIS ACTIVITY IS NECESSARY TO MAINTAIN PROPER SECRETION OF MATURE PROTEINS ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv0724, (MTCY210.43), len: 623 aa. Possible sppA, protease IV (endopeptidase IV) (EC 3.4.21.-), equivalent (but longer 23 aa) to MLCB2492_24|O33003 ENDOPEPTIDASE IV from Mycobacterium leprae (602 aa). Also similar to others e.g. NP_419743.1|NC_002696 signal peptide peptidase SppA from Caulobacter crescentus (594 aa); P08395|SPPA_ECOLI|B1766 protease IV (endopeptidase) from Escherichia coli strain K-12 (618 aa), FASTA scores: opt: 582, E(): 8.9e-27, (34.1% identity in 525 aa overlap); etc. BELONGS TO PEPTIDASE FAMILY S49." /codon_start=1 /transl_table=11 /product="protease IV SppA" /protein_id="NP_215238.1" /db_xref="GI:15607864" /db_xref="GeneID:888535" /translation="MPIFGGFCVCSRALGGRWVRWVNMVAFLPSIPVVEDLRALVGRV DTARHHGVPNGCVLEFNLRSVPPETTGFDPLTVLTGGGRPMALRDAVAAIHRAAEDPR VAGLIARVQLPPSPAGAVQELREAIAAFSAVKPSLAWAETYPGTLSYYLASAFGEVWM QPSGSVGLVGFATNATFLRDALHKAGIEAQFVARGEYKSAANLFTEDGFTDAHREAVT RMLDSLQDQVWQAVAKSRNIGVDALDELADRAPLLRDDAVTCGLIDRIGFRDQAYARM AELVGVEKGSPESSGSQTSPDEKPPRMYLARYASSARPRLTPPVPSIPGRRSKPTIAV VTLEGPIVNGRGGPQFLPLGPSSAGGDTIAAALREVAADDSVSAIVLRVDSPGGSVTA SETIWREVARARDRGKPVVASMGAVAASGGYYVSMGADAIVANPGTITGSIGVITGKL VVRDLKDRLGVGSDAVRTNANADAWSIDAPFTPDQQAHREAEADLFYSDFVERVAEGR KMTTDAVDVVARGRVWTGADALDRGLVDELGGLRTAVRRAKVLAGLDEDTEVRIVSYP GSSLWDMVRPRPSSRPAAASLPDAMGALLARSIVGIVEQVEQTLSGASVLWLGESRL" gene complement(817531..817866) /locus_tag="Rv0724A" /db_xref="GeneID:3205058" CDS complement(817531..>817866) /locus_tag="Rv0724A" /function="UNKNOWN" /note="Rv0724A, len: 111 aa. Similarity suggests that this CDS should be continuation of Rv0725c but we can find no frame-shift to account for this. Possible extended protein is very similar to other hypothetical Mycobacterium tuberculosis proteins e.g. Rv1729c|Z81360_12 (312 aa), FASTA scores: opt: 399, E(): 2e-19, (58.7% identity in 109 aa overlap); Rv0731c, Rv0726c, etc. Frame-shift could occur at nt 817866. Same sequence for strain CDC1551 and Mycobacterium bovis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177631.1" /db_xref="GI:57116770" /db_xref="GeneID:3205058" /translation="SQDRLFDNSTELSVAGSTIATELVPGIVDFDAGRVREMADSFRK HGVDIDMASLVYSGERSHVVDYLRAKGWDVEGTVRTDLFRRNGLPVPAPHDDDPLGEI IFISGRLNG" gene complement(817539..818444) /locus_tag="Rv0725c" /db_xref="GeneID:888447" CDS complement(817539..818444) /locus_tag="Rv0725c" /function="UNKNOWN" /note="Rv0725c, (MTCY210.44c), len: 301 aa. Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0726c, Rv0731c, Rv3399, etc, e.g. Y893_MYCTU|Q10552|Rv0893C hypothetical 36.1 kDa protein cy31.21c (325 aa), FASTA scores: opt: 600, E(): 3.9e-32, (43.8% identity in 219 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215239.1" /db_xref="GI:15607865" /db_xref="GeneID:888447" /translation="MPRAHDDNWDLASSVGATATMVAAGRALATKDPRGLINDPFAEP LVRAVGLDFFTKLIDGELDIATTGNLSPGRAQAMIDGIAVRTKYFDDYFRTATDGGVR QVVILAAGLDARAYRLPWPAGTVVYEIDQPQVIDFKTTTLAGIGAKPTAIRRTVYIDL RADWPAALQAAGLDSTAPTAWLAEGMLIYLPPDPRTGCSTTAPNSVLRAARSLPNLSR ALWISTQAGYEKWRIRFASTAWTSTWRRWCIPANAATSSTTCAPRAGTLRAQCGPTYS GAMVCPFPPHTTTIRSAKSSSSAVV" gene complement(818537..819640) /locus_tag="Rv0726c" /db_xref="GeneID:888552" CDS complement(818537..819640) /locus_tag="Rv0726c" /function="UNKNOWN" /note="Rv0726c, (MTCY210.45c), len: 367 aa. Conserved hypothetical protein, highly similar to other conserved hypothetical proteins from Mycobacterium tuberculosis e.g. Q10552|Y893_MYCTU|Rv0893c|MT0917|MTCY31.21c (325 aa), FASTA scores: opt: 646, E(): 0, (38.3% identity in 329 aa overlap); Rv0731c|MTV041.05c (318 aa), Rv3399, etc. Also similar to proteins from Mycobacterium leprae and other organisms e.g. T35930 hypothetical protein SC9B5.10 from Streptomyces coelicolor (303 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215240.1" /db_xref="GI:15607866" /db_xref="GeneID:888552" /translation="MTYTGSIRCEGDTWDLASSVGATATMVAAARAMATRAANPLIND QFAEPLVRAVGVDVLTRLASGELTASDIDDPERPNASMVRMAEHHAVRTKFFDEFFMD ATRAGIRQVVILASGLDSRAYRLAWPAQTVVYEIDQPQVMEFKTRTLAELGATPTADR RVVTADLRADWPTALGAAGFDPTQPTAWSAEGLLRYLPPEAQDRLLDNVTALSVPDSR FATESIRNFKPHHEERMRERMTILANRWRAYGFDLDMNELVYFGDRNEPASYLSDNGW LLTEIKSQDLLTANGFQPFEDEEVPLPDFFYVSARLQRKHRQYPAHRKPAPSWRHTAC PVNELSKSAAYTMTRSDAHQASTTAPPPPGLTG" gene complement(819843..820499) /gene="fucA" /locus_tag="Rv0727c" /db_xref="GeneID:888550" CDS complement(819843..820499) /gene="fucA" /locus_tag="Rv0727c" /EC_number="4.1.2.17" /function="INVOLVED IN FUCOSE METABOLISM (AT THE THIRD STEP) [CATALYTIC ACTIVITY: L-FUCULOSE 1-PHOSPHATE = GLYCERONE PHOSPHATE + (S)-LACTALDEHYDE]." /note="catalyzes the formation of glycerone phosphate and (S)-lactaldehyde from L-fuculose 1-phosphate" /codon_start=1 /transl_table=11 /product="L-fuculose-phosphate aldolase" /protein_id="NP_215241.1" /db_xref="GI:15607867" /db_xref="GeneID:888550" /translation="MNFVDAPESAVLAAAKDMLRRGLVEGTAGNISARRSDGNVVITP SSVDYAEMLLHDLVLVDAGGAVLHAKDGRSPSTELNLHLACYRAFDDIGSVIHSHPVW ATMFAVAHEPIPACIDEFAIYCGGDVRCTEYAASGTPEVGRNAVRALEGRAAALIANH GLVAVGPRPDQVLRVTALVERTAQIVWGARALGGPVPIPEDVCRNFTGVYGYLRANPL" gene complement(820496..821476) /gene="serA2" /locus_tag="Rv0728c" /db_xref="GeneID:888555" CDS complement(820496..821476) /gene="serA2" /locus_tag="Rv0728c" /EC_number="1.1.1.95" /function="INVOLVED AT THE FIRST COMMITTED STEP IN THE 'PHOSPHORYLATED' PATHWAY OF L-SERINE BIOSYNTHESIS. CATALYZES THE OXIDATION OF D-3-PHOSPHOGLYCERATE TO 3-PHOSPHOHYDROXYPYRUVATE [CATALYTIC ACTIVITY: 3-PHOSPHOGLYCERATE + NAD(+) = 3-PHOSPHOHYDROXYPYRUVATE + NADH]." /note="Rv0728c, (MTV041.02c), len: 326 aa. Possible serA2, D-3-phosphoglycerate dehydrogenase (EC 1.1.1.95), similar to others e.g. AF0278|AF027868_5|YoaD D-3-phosphoglycerate dehydrogenase from Bacillus subtilis (344 aa), FASTA scores: opt: 594, E(): 3.1e-31, (35.9% identity in 309 aa overlap); etc. Also similar to Rv2996c|MTV012.10|SERA1 D-3-phosphoglycerate dehydrogenase from Mycobacterium tuberculosis (528 aa). TBparse score is 0.882." /codon_start=1 /transl_table=11 /product="D-3-phosphoglycerate dehydrogenase" /protein_id="NP_215242.1" /db_xref="GI:15607868" /db_xref="GeneID:888555" /translation="MTPRPRALVTAPLRGPGFAQLRRLADVVYDPWIDQRPLRIYSAE QLADRITAVAADVLVVESDSVGGPVFERGLRVVAATRGDPSNVDIPGATAAGIPVLHT PARNADAVAEMTVALLLAVARHLIPADADVRSGNIFRDGTIPYQRFRGAEIAGLTAGL VGLGAVGRAVRWRLSGLGLRVIAHDPYRDDAGHSLDELLAEADIVSMHAAVTDDTIGM IGAQQFAAMRDGAVFLNTARSQLRDTDALVDALRGGKLAAAGLDHFTGEWLPTDHPLV SMPNVVLTPHIGGATWNTEARQARMVADDLGALLSGNRPAHVVNPEVLGS" gene 821507..822853 /gene="xylB" /locus_tag="Rv0729" /db_xref="GeneID:888548" CDS 821507..822853 /gene="xylB" /locus_tag="Rv0729" /EC_number="2.7.1.17" /function="PHOSPHORYLATES D-XYLULOSE [CATALYTIC ACTIVITY: ATP + D-XYLULOSE = ADP + D-XYLULOSE 5-PHOSPHATE]." /note="Rv0729, (MTV041.03), len: 448 aa. Possible xylB, D-xylulose-kinase (xylulokinase) (EC 2.7.1.17). C-terminus highly similar to AAD09880.1|U77912 unknown protein from Mycobacterium bovis (102 aa); and N-terminus highly similar to T45387|Z98756|MLCB2492_25 hypothetical protein from Mycobacterium leprae (110 aa), FASTA scores: opt: 427, E(): 1.1e-19, (60.9% identity in 110 aa overlap). Also similar to xylA/xylB genes from various bacterial species e.g. AAC26499.1|AF045245 D-xylulose-kinase from Klebsiella pneumoniae (487 aa); NP_418021.1|NC_000913 xylulokinase from Escherichia coli strain K12 (484 aa), FASTA scores: opt: 260, E(): 7.5e-09, (25.9% identity in 478 aa overlap); etc. Also similar to Rv3696c|glpK PROBABLE GLYCEROL KINASE (EC 2.7.1.30) from Mycobacterium tuberculosis (517 aa). BELONGS TO THE FUCOKINASE / GLUCONOKINASE / GLYCEROKINASE / XYLULOKINASE FAMILY. TBparse score is 0.895." /codon_start=1 /transl_table=11 /product="D-xylulose kinase XylB" /protein_id="NP_215243.1" /db_xref="GI:15607869" /db_xref="GeneID:888548" /translation="MSRDDVTIGIDIGTTAVKAVAADDNGRVTARVRIGHQLAVPAPD RLEHDADEAWRRGPLAALDRLVGPDTRALAVAAMVPSLTAVDPAGRPITPGLLYGDAR GRVPNASVARAQSVPSVGETAEFLRWTAGQALDASGYWPAPAVANYALSGEAVIDYAT AVTTLPLFDGTGWNATACADCGVTVDRMPRVETFGVGVGQVRGTGAVLAVGAVDALCE QIVAGADRDGDVLVLCGATLIVWTTISAARQVPGLWTIPHTAPGKSQIGGASNAGGLF LNWVDRVIGPGDPALADPRRVPVWLPYIRGERTPFHEPDRRAVLDGVDLSQDAASVRR AAYEASGFVVRQLIELSGAPVARIVAAGGGTRIQPWMQAIADATGRPVEVSRVAEGAA LGAAFLGRLAAGLESSIADAARWASTDRIVEPSADWAGPTKERYRRFLALSGSKLA" gene 822866..823594 /locus_tag="Rv0730" /db_xref="GeneID:888558" CDS 822866..823594 /locus_tag="Rv0730" /function="UNKNOWN" /note="Rv0730, (MTV041.04), len: 242 aa. Conserved hypothetical protein, only equivalent to Z98756|MLCB2492_26 HYPOTHETICAL PROTEIN from Mycobacterium leprae (227 aa), FASTA scores: opt: 1180, E(): 0, (83.5% identity in 218 aa overlap). TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215244.1" /db_xref="GI:15607870" /db_xref="GeneID:888558" /translation="MHGARTGVSFYAYAMTDHDQTAARREIADALLAALERRHEVADA IVEAANKAAAVEAIVNLLGTSHLAAEAVMSMSFDQLTQDARTKIIAELDDLNKQLSFT VKERPASSGEGLELRPFSPDEDRDIFARRTEEMGAAGDGSGGPAGSVDDEIRAAQKRV DDEEAAWFVAVDSGVKVGMVFGELVHGEVDVRIWIHPDHRKKGYGTAALRKSRSEMAW AFPAVPMVARAPAAQPAQPGSAGR" gene complement(823683..824639) /locus_tag="Rv0731c" /db_xref="GeneID:888556" CDS complement(823683..824639) /locus_tag="Rv0731c" /function="UNKNOWN" /note="Rv0731c, (MTV041.05c), len: 318 aa. Conserved hypothetical protein, highly similar to other conserved hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0726c|MTCY210.45c (367 aa), FASTA score: (60.9% identity in 317 aa overlap); Rv3399, Rv1729c, etc. TBparse score is 0.880." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215245.1" /db_xref="GI:15607871" /db_xref="GeneID:888556" /translation="MTQTGSARFEGDSWDLASSVGLTATMVAAARAVAGRAPGALVND QFAEPLVRAVGVDFFVRMASGELDPDELAEDEANGLRRFADAMAIRTHYFDNFFLDAT RAGIRQAVILASGLDSRAYRLRWPAGTIVFEVDQPQVIDFKTTTLAGLGAAPTTDRRT VAVDLRDDWPTALQKAGFDNAQRTAWIAEGLLGYLSAEAQDRLLDQITAQSVPGSQFA TEVLRDINRLNEEELRGRMRRLAERFRRHGLDLDMSGLVYFGDRTDARTYLADHGWRT ASASTTDLLAEHGLPPIDGDDAPFGEVIYVSAELKQKHQDTR" gene 824800..826125 /gene="secY" /locus_tag="Rv0732" /db_xref="GeneID:888559" CDS 824800..826125 /gene="secY" /locus_tag="Rv0732" /function="ESSENTIAL FOR PROTEIN EXPORT. INTERACTS WITH SECA|Rv3240c AND SECE|Rv0638 TO ALLOW THE TRANSLOCATION OF PROTEINS ACROSS THE PLASMA MEMBRANE, BY FORMING PART OF A CHANNEL." /note="forms heterotrimeric complex in the membrane; in bacteria the complex consists of SecY which forms the channel pore and SecE and SecG; the SecG subunit is not essential; in bacteria translocation is driven via the SecA ATPase" /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecY" /protein_id="NP_215246.1" /db_xref="GI:15607872" /db_xref="GeneID:888559" /translation="MLSAFISSLRTVDLRRKILFTLGIVILYRVGAALPSPGVNFPNV QQCIKEASAGEAGQIYSLINLFSGGALLKLTVFAVGVMPYITASIIVQLLTVVIPRFE ELRKEGQAGQSKMTQYTRYLAIALAILQATSIVALAANGGLLQGCSLDIIADQSIFTL VVIVLVMTGGAALVMWMGELITERGIGNGMSLLIFVGIAARIPAEGQSILESRGGVVF TAVCAAALIIIVGVVFVEQGQRRIPVQYAKRMVGRRMYGGTSTYLPLKVNQAGVIPVI FASSLIYIPHLITQLIRSGSGVVGNSWWDKFVGTYLSDPSNLVYIGIYFGLIIFFTYF YVSITFNPDERADEMKKFGGFIPGIRPGRPTADYLRYVLSRITLPGSIYLGVIAVLPN LFLQIGAGGTVQNLPFGGTAVLIMIGVGLDTVKQIESQLMQRNYEGFLK" misc_feature 825022..825081 /gene="secY" /locus_tag="Rv0732" /note="PS00755 Protein secY signature 1" misc_feature 825325..825381 /gene="secY" /locus_tag="Rv0732" /note="PS00756 Protein secY signature 2" gene 826122..826667 /gene="adk" /locus_tag="Rv0733" /db_xref="GeneID:888567" CDS 826122..826667 /gene="adk" /locus_tag="Rv0733" /EC_number="2.7.4.3" /function="THIS SMALL UBIQUITOUS ENZYME IS ESSENTIAL FOR MAINTENANCE AND CELL GROWTH [CATALYTIC ACTIVITY: ATP + AMP = ADP + ADP]." /experiment="experimental evidence, no additional details recorded" /note="essential enzyme that recycles AMP in active cells; converts ATP and AMP to two molecules of ADP" /codon_start=1 /transl_table=11 /product="adenylate kinase" /protein_id="NP_215247.1" /db_xref="GI:15607873" /db_xref="GeneID:888567" /translation="MRVLLLGPPGAGKGTQAVKLAEKLGIPQISTGELFRRNIEEGTK LGVEAKRYLDAGDLVPSDLTNELVDDRLNNPDAANGFILDGYPRSVEQAKALHEMLER RGTDIDAVLEFRVSEEVLLERLKGRGRADDTDDVILNRMKVYRDETAPLLEYYRDQLK TVDAVGTMDEVFARALRALGK" misc_feature 826362..826397 /gene="adk" /locus_tag="Rv0733" /note="PS00113 Adenylate kinase signature" gene 826670..827470 /gene="mapA" /locus_tag="Rv0734" /db_xref="GeneID:888564" CDS 826670..827470 /gene="mapA" /locus_tag="Rv0734" /EC_number="3.4.11.18" /function="REMOVES THE AMINO-TERMINAL METHIONINE FROM NASCENT PROTEINS [CATALYTIC ACTIVITY: L-METHIONYLPEPTIDE + H2O = L-METHIONINE + PEPTIDE]." /note="catalyzes the removal of N-terminal amino acids from peptides and arylamides; generally Co(II) however activity has been shown for some methionine aminopeptidases with Zn, Fe, or Mn" /codon_start=1 /transl_table=11 /product="methionine aminopeptidase" /protein_id="YP_177748.1" /db_xref="GI:57116771" /db_xref="GeneID:888564" /translation="MRPLARLRGRRVVPQRSAGELDAMAAAGAVVAAALRAIRAAAAP GTSSLSLDEIAESVIRESGATPSFLGYHGYPASICASINDRVVHGIPSTAEVLAPGDL VSIDCGAVLDGWHGDAAITFGVGALSDADEALSEATRESLQAGIAAMVVGNRLTDVAH AIETGTRAAELRYGRSFGIVAGYGGHGIGRQMHMDPFLPNEGAPGRGPLLAAGSVLAI EPMLTLGTTKTVVLDDKWTVTTADGSRAAHWEHTVAVTDDGPRILTLG" gene 827543..828076 /gene="sigL" /locus_tag="Rv0735" /db_xref="GeneID:888609" CDS 827543..828076 /gene="sigL" /locus_tag="Rv0735" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED." /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription; in M. tuberculosis this protein regulates polyketide synthases and secreted or membrane proteins" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigL" /protein_id="NP_215249.1" /db_xref="GI:15607875" /db_xref="GeneID:888609" /translation="MARVSGAAAAEAALMRALYDEHAAVLWRYALRLTGDAAQAEDVV QETLLRAWQHPEVIGDTARPARAWLFTVARNMIIDERRSARFRNVVGSTDQSGTPEQS TPDEVNAALDRLLIADALAQLSAEHRAVIQRSYYRGWSTAQIATDLGIAEGTVKSRLH YAVRALRLTLQELGVTR" misc_feature 827660..827698 /gene="sigL" /locus_tag="Rv0735" /note="PS01063 Sigma-70 factors ECF subfamily signature" gene 828140..828892 /locus_tag="Rv0736" /db_xref="GeneID:888611" CDS 828140..828892 /locus_tag="Rv0736" /function="UNKNOWN" /note="Rv0736, (MTV041.10), len: 250 aa. Probable conserved membrane protein, showing weak similarity with AL133469|SCM10_32 putative membrane protein from Streptomyces coelicolor (216 aa), FASTA scores: opt: 180, E(): 0.00018, (34.3% identity in 216 aa overlap). TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215250.1" /db_xref="GI:15607876" /db_xref="GeneID:888611" /translation="MTMPLRGLGPPDDTGVREVSTGDDHHYAMWDAAYVLGALSAADR REFEAHLAGCPECRGAVTELCGVPALLSQLDRDEVAAISESAPTVVASGLSPELLPSL LAAVHRRRRRTRLITWVASSAAAAVLAIGVLVGVQGHSAAPQRAAVSALPMAQVGTQL LASTVSISGEPWGTFINLRCVCLAPPYASHDTLAMVVVGRDGSQTRLATWLAEPGHTA TPAGSISTPVDQIAAVQVVAADTGQVLLQRSL" gene 829207..829704 /locus_tag="Rv0737" /db_xref="GeneID:888619" CDS 829207..829704 /locus_tag="Rv0737" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0737, (MTV041.11), len: 165 aa. Possible transcriptional regulator, similar to others e.g. BAB69161.1|AB070937 regulator protein from Streptomyces avermitilis (169 aa); NP_419731.1|NC_002696 transcriptional regulator MarR family from Caulobacter crescentus (148 aa) (homology only at C-terminus); etc. Also shows weak similarity to AB0014|AB001488_14 hypothetical protein from Bacillus subtilis (164 aa), FASTA scores: opt: 163, E(): 9.3e-05, (32.8% identity in 116 aa overlap), which is similar to slyY gene of S. typhimurium required for survival in macrophage. Contains possible helix-turn helix motif from aa 73-94 (Score 1138, +3.06 SD). TBparse score is 0.888." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215251.1" /db_xref="GI:15607877" /db_xref="GeneID:888619" /translation="MASDNRDPIAAARANWERSGWGDVSLGMVAVTSVMRAHQILLAR VETALRPYDLSFSRFELLRLLAFSRIGALPITKASDRLQVHVTSVTHAIRRLEADGLV RRVPHPTDGRTTLVQITELGRSTVEDATVTLNEQVFANVGMGAEESQALVSAVETLRR NAGDF" gene 830062..830610 /locus_tag="Rv0738" /db_xref="GeneID:888620" CDS 830062..830610 /locus_tag="Rv0738" /function="UNKNOWN" /note="Rv0738, (MTV041.12), len: 182 aa. Conserved hypothetical protein, showing weak similarity with hypothetical proteins from Mycobacterium tuberculosis: Rv1727|MTCY04C12.12 (189 aa); MTY13D12_7|Z80343 hypothetical protein from Mycobacterium tuberculosis (194 aa), FASTA scores: opt: 172, E(): 0.0004, (24.2% identity in 178 aa overlap); and C-terminus of Rv0576. TBparse score is 0.886." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215252.1" /db_xref="GI:15607878" /db_xref="GeneID:888620" /translation="MDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHV VGGNEQVGRWAASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPG QVFIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADE KPCPRERPPADQLAAFLGRTVR" gene 830855..831661 /locus_tag="Rv0739" /db_xref="GeneID:888622" CDS 830855..831661 /locus_tag="Rv0739" /function="UNKNOWN" /note="Rv0739, (MTV041.13), len: 268 aa. Conserved hypothetical protein, showing some similarity to Mycobacterium tuberculosis proteins Rv0026 (448 aa), FASTA score: (37.6% identity in 101 aa overlap)and Rv0025 (120 aa), FASTA score: (32.4% identity in 142 aa overlap). TBparse score is 0.942." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215253.1" /db_xref="GI:15607879" /db_xref="GeneID:888622" /translation="MVLTRRAREVALTQHIGVSAETDRAVVPKLRQAYDSLVCGRRRL GAIGAEIENAVAHQRALGLDTPAGARNFSRFLATKAHDITRVLAATAAESQAGAARLR SLASSYQAVGFGPKPQEPPPDPVPFPPYQPKVWAACRARGQDPDKVVRTFHHAPMSAR FRSLPAGDSVLYCGNDKYGLLHIQAKHGRQWHDIADARWPSAGNWRYLADYAIGATLA YPERVEYNQDNDTFAVYRRMSLPDGRYVFTTRVIISARDGKIITAFPQTT" gene 831776..832303 /locus_tag="Rv0740" /db_xref="GeneID:888638" CDS 831776..832303 /locus_tag="Rv0740" /function="UNKNOWN" /note="Rv0740, (MTV041.14), len: 175 aa. Conserved hypothetical protein; C-terminus (possibly part of truncated IS1557) shows nearly perfect identity to Rv0750|MTV041_24 (81 aa), FASTA score: (92.6% identity in 81 aa overlap). Also shows weak similarity to MTV007_5 hypothetical protein from Mycobacterium tuberculosis (313 aa), FASTA score: (34.5% identity in 110 aa overlap); and MLCL536_27 hypothetical protein from Mycobacterium leprae (315 aa), FASTA score: (34.5% identity in 84 aa overlap). TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215254.1" /db_xref="GI:15607880" /db_xref="GeneID:888638" /translation="MLPKNTRPTSETAEEFWDNSLWCSWGDRETGYTRTVTVSICQVA DGEREAEGVRDMMRLECPAGLDLRTPNPEAYEITGQRPGEFVFVLGYLGHVRAIVGNC YIEIMPMGTRVELSKLADVALDIGRSVGCSAYENDFTLPDIPTQWRNQPLGWYTQGLA PYLPGLSDPKDAAEG" repeat_region 832352..832868 /note="IS1557'-1, len: 517 bp. Region similar to Insertion sequence IS1557 on MTCY373- (IS1557- 1st copy)." /mobile_element="insertion sequence:IS1557'-1" gene 832534..832848 /locus_tag="Rv0741" /db_xref="GeneID:888644" CDS 832534..832848 /locus_tag="Rv0741" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1557." /note="Rv0741, (MTV041.15), len: 104 aa. Probable truncated transposase for IS1557, showing similarity to transposases and IS elements e.g. U63997|EFU63997_1 insertion sequence from Enterococcus faecium (424 aa), FASTA score: (31.0% identity in 87 aa overlap). Very high similarity with the C-terminal part of Z73419|MTCY373_3 2 IS1557 from Mycobacterium tuberculosis (444 aa), FASTA score: (86.5% identity in 104 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215255.1" /db_xref="GI:15607881" /db_xref="GeneID:888644" /translation="MFSVKGEEGKQALDRWISWARRCRIPVFVELAGGIVRHRQAIDA ALDHGLWQGLIESTNTKIRLLTRIAFGFRSPEALIALAMLALGGRRPALPGRTKHPRI SQ" gene 832981..833508 /gene="PE_PGRS8" /locus_tag="Rv0742" /db_xref="GeneID:888645" CDS 832981..833508 /gene="PE_PGRS8" /locus_tag="Rv0742" /function="UNKNOWN" /note="Rv0742, (MTV041.16), len: 175 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to many Mycobacterium tuberculosis PGRS-type proteins e.g. Z78020|MTCY1A11_25 (498 aa), FASTA scores: opt: 766, E(): 6.1e-25, (73.6% identity in 178 aa overlap). Similarity suggests ORF starts with ATA start codon. TBparse score is 0.846." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177749.1" /db_xref="GI:57116772" /db_xref="GeneID:888645" /translation="MSFVIAAPEAIAAAATDLASIGSTIGAANAAAAANTTAVLAAGA DQVSVAIAAAFGAHGQAYQALSAQAATFHIQFVQALTAGAGSYAAAEAASAASITSPL LDAINAPFLAALGRPLIGNGADGAPGTGAAGGAGGLLFGNGGAGGSGAPGGAGGLLFG NGGAGGPGASGGALG" gene complement(833886..834443) /locus_tag="Rv0743c" /db_xref="GeneID:888641" CDS complement(833886..834443) /locus_tag="Rv0743c" /function="UNKNOWN" /note="Rv0743c, (MTV041.17c), len: 185 aa. Hypothetical unknown protein. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215257.1" /db_xref="GI:15607883" /db_xref="GeneID:888641" /translation="MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQAT ASQEADIAFVNDPARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLV SWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLP EETDPRIGQRIAAWLNYYGAGNHSS" gene complement(834440..834946) /locus_tag="Rv0744c" /db_xref="GeneID:888648" CDS complement(834440..834946) /locus_tag="Rv0744c" /function="THOUGHT TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0744c, (MTV041.18c), len: 168 aa. Possible transcriptional regulator, showing weak similarity with O86661|SC4A2.05 PUTATIVE TWO-COMPONENT SENSOR from Streptomyces coelicolor (436 aa), FASTA scores: opt: 117, E(): 0.88, (37.25% identity in 94 aa overlap); and some putative excisionases or transposases. Also weakly similar to P71902|YN10_MYCTU|Rv2310|MT2372|MTCY3G12.24c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (114 aa); and Q11144|Y477_MYCTU|Rv0477|MT0495|MTCY20G9.03 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (148 aa). Equivalent to AAK45006 from Mycobacterium tuberculosis strain CDC1551 (179 aa) but shorter 11 aa. Contains probable helix-turn helix motif from aa 5-26 (Score 1350, +3.78 SD). TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215258.1" /db_xref="GI:15607884" /db_xref="GeneID:888648" /translation="METLLKTSEAAQILGVSRQHVVNMCDRGEMVCVHVGSHRRVPSS EVERVTSRRLTREEERSLWLHRALLSPLLTEPDTVVSAARENLRRWSGMHRRDGMAGW YFTKWQRVLNDGLDAVMHVLTSPSEDAREMRQNSPFAGILPEATRVAVLRSFKDHWDR EHERAMTE" gene 835154..835681 /locus_tag="Rv0745" /db_xref="GeneID:888595" CDS 835154..835681 /locus_tag="Rv0745" /function="UNKNOWN" /note="Rv0745, (MTV041.19), len: 175 aa. Conserved hypothetical protein; shows high similarity to a 50 aa region of Rv3649|Z95436|MTY15C10_3 CONSERVED HYPOTHETICAL PROTEIN, similar to ATP-dependent helicases, from Mycobacterium tuberculosis (771 aa), FASTA scores: opt: 225, E(): 7e-06, (70.0% identity in 50 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215259.1" /db_xref="GI:15607885" /db_xref="GeneID:888595" /translation="MGPPHRSRPPLPSPGPTCQVLPTTAVIHTVTAEALGRIGIDAPR IPGSLDVAAHAAIGLLPLVAGCDRRHRRPVRGARAGRAAQVSLCMTAIRVEPVSSNAV CTGPAAQVGDQSRSPQRDYAHQALQPDVPRRRARRHRPRRCSAKTGSSSSTMRCTCHQ NQCLWSSGVSWALAR" gene 835701..838052 /gene="PE_PGRS9" /locus_tag="Rv0746" /db_xref="GeneID:888664" CDS 835701..838052 /gene="PE_PGRS9" /locus_tag="Rv0746" /function="UNKNOWN" /note="Rv0746, (MTV041.20), len: 783 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to part of MTCY28.25c|Rv1759c|Z95890 antigen wag22 from M. tuberculosis (914 aa), FASTA scores: opt: 2429, E(): 0, (56.9% identity in 873 aa overlap). Also similar to other PE-PGRS FAMILY PROTEINS e.g. AL0212|MTV008_46 FASTA score: (48.8% identity in 887 aa overlap); etc. TBparse score is 0.860." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177750.1" /db_xref="GI:57116773" /db_xref="GeneID:888664" /translation="MSFVLAMPEVLGSAATDLAALGSVLGAADAAAAATTTGIVAAAQ DEVSAAIAALFSAHGRAYQVASAQAAAVHAQFVEALSAGAGAYASAEAAGAAVLANPA QSVQQDLLAAVNAQSVALTGRPLIGNGANGAPGTGANGAPGGWLLGNGGAGGSAAAGS GLPGGAGGAAGLFGTGGAGGAGGSSTVGDGEAGGAGGSGGWLLGTGGVGGVGGLGAGA GGAGGVGGAGGLLGAGGHGGAGGLGAVTGGVGGTGGAGGLLAGLLAGPGGAGGTGGRG FLNNGGVGGAGGNAGLLFGAGGTGGSGGAGLGGDGGAGGAGGNTGVLFGNAGSGGTGG FGDTDGGAGGAGGDAGWLGSGGVGGAGGFGETGDGGVGGAGGKAGLLIGNGGAGGAGG QGAVTGGTGGAGGDGVLIGNGGNAGIGGTGPTAGDTGAGGISGLLLGADGFNTPASAS PLHTLKQQALAAINAPTQTLTGRPLIGNGTPGAVGSGATGAPGGWLLGDGGAGGSGAA GSGAPGGAGGAAGLWGTGGAGGAGGSSAGGGGAGGAGGAGGWLLGDGGAGGIGGASTV LGGTGGGGGVGGLWGAGGAGGAGGTGLVGGDGGAGGAGGTGGLLAGLIGAGGGHGGTG GLSTNGDGGVGGAGGNAGMLAGPGGAGGAGGDGENLDTGGDGGAGGSAGLLFGSGGAG GAGGFGFLGGDGGAGGNAGLLLSSGGAGGFGGFGTAGGVGGAGGNAGWLGFGGAGGVG GSAGLIGTGGNGGNGGTGANAGSPGTGGAGGLLLGQNGLNGLP" gene 838451..840856 /gene="PE_PGRS10" /locus_tag="Rv0747" /db_xref="GeneID:888662" CDS 838451..840856 /gene="PE_PGRS10" /locus_tag="Rv0747" /function="UNKNOWN" /note="Rv0747, (MTV041.21), len: 801 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to part of MTCY28.25c|Rv1759c|Z95890 antigen wag22 from M. tuberculosis (914 aa), FASTA scores: opt: 2772, E(): 0, (60.9% identity in 941 aa overlap). Also similar to other PE-PGRS FAMILY PROTEINS e.g. Z95844|MTCY493_2 FASTA score: (50.2% identity in 815 aa overlap). Contains PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177751.1" /db_xref="GI:57116774" /db_xref="GeneID:888662" /translation="MSWVMVSPELVVAAAADLAGIGSAISSANAAAAVNTTGLLTAGA DEVSTAIAALFGAQGQAYQAASAQAAAFYAQFVQALSAGGGAYAAAEAAAVSPLLAPI NAQFVAATGRPLIGNGANGAPGTGANGGPGGWLIGNGGAGGSGAPGAGAGGNGGAGGL FGSGGAGGASTDVAGGAGGAGGAGGNAGMLFGAAGVGGVGGFSNGGATGGAGGAGGAG GLFGAGRERGSGGSGNLTGGAGGAGGNAGTLATGDGGAGGTGGASRSGGFGGAGGAGG DAGMFFGSGGSGGAGGISKSVGDSAAGGAGGAPGLIGNGGNGGNGGASTGGGDGGPGG AGGTGVLIGNGGNGGSGGTGATLGKAGIGGTGGVLLGLDGFTAPASTSPLHTLQQDVI NMVNDPFQTLTGRPLIGNGANGTPGTGADGGAGGWLFGNGGNGGQGTIGGVNGGAGGA GGAGGILFGTGGTGGSGGPGATGLGGIGGAGGAALLFGSGGAGGSGGAGAVGGNGGAG GNAGALLGAAGAGGAGGAGAVGGNGGAGGNGGLFANGGAGGPGGFGSPAGAGGIGGAG GNGGLFGAGGTGGAGGGSTLAGGAGGAGGNGGLFGAGGTGGAGSHSTAAGVSGGAGGA GGDAGLLSLGASGGAGGSGGSSLTAAGVVGGIGGAGGLLFGSGGAGGSGGFSNSGNGG AGGAGGDAGLLVGSGGAGGAGASATGAATGGDGGAGGKSGAFGLGGDGGAGGATGLSG AFHIGGKGGVGGSAVLIGNGGNGGNGGNSGNAGKSGGAPGPSGAGGAGGLLLGENGLN GLM" misc_feature 840371..840418 /gene="PE_PGRS10" /locus_tag="Rv0747" /note="PS00012 Phosphopantetheine attachment site" gene 840947..841204 /locus_tag="Rv0748" /db_xref="GeneID:888682" CDS 840947..841204 /locus_tag="Rv0748" /function="UNKNOWN" /note="Rv0748, (MTV041.22), len: 85 aa. Conserved hypothetical protein, N-terminus similar to N-terminal region of NP_436939.1|NC_003078 HYPOTHETICAL PROTEIN from Sinorhizobium meliloti (75 aa). Also similar to Mycobacterium tuberculosis proteins Rv2871 CONSERVED HYPOTHETICAL PROTEIN (75 aa); Rv1241, Rv2132, Rv3321c, etc. TBparse score is 0.875." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215262.1" /db_xref="GI:15607888" /db_xref="GeneID:888682" /translation="MRTTVSISDEILAAAKRRARERGQSLGAVIEDALRREFAAAHVG GARPTVPVFDGGTGPRRGIDLTSNRALSEVLDEGLELNSRK" gene 841228..841656 /locus_tag="Rv0749" /db_xref="GeneID:888681" CDS 841228..841656 /locus_tag="Rv0749" /function="UNKNOWN" /note="Rv0749, (MTV041.23), len: 142 aa. Conserved hypothetical protein, similar to other hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0749, Rv0277c, Rv2530c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215263.1" /db_xref="GI:15607889" /db_xref="GeneID:888681" /translation="MFLLDANVLLAAHRGDHPNHRTVRPWFDRLLAADDPFTVPNLVW ASFLRLATNRRIFEIPSPRAEAFAFVEAVTAQPHHLPTNPGPRHLMLLRKLCDEADAS GDLIPDAVLAAIAVGHHCAVVSLDRDFARFASVRHIRPPL" gene complement(841737..841874) /locus_tag="Rv0749A" /db_xref="GeneID:3205052" CDS complement(841737..841874) /locus_tag="Rv0749A" /function="UNKNOWN" /note="Rv0749A, len: 45 aa. Conserved hypothetical protein (probably gene fragment), similar to part (aa 250-292) of Rv2807|Z81331_12 from Mycobacterium tuberculosis (384 aa), FASTA scores: opt: 238, E(): 1.9e-13, (79.07% identity in 43 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177632.1" /db_xref="GI:57116775" /db_xref="GeneID:3205052" /translation="MVRKHAFHWRYDSTEELELLNQLWQLVSLRLNFFTPTKKALGFR P" gene 842033..842278 /locus_tag="Rv0750" /db_xref="GeneID:888688" CDS 842033..842278 /locus_tag="Rv0750" /function="UNKNOWN" /note="Rv0750, (MTV041.24), len: 81 aa. Conserved hypothetical protein, showing almost perfect overlap with C-terminus of Rv0740|MTV041_14 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (175 aa), FASTA scores: (93.8% identity in 81 aa overlap). Possible duplication. TBparse score is 0.872." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215264.1" /db_xref="GI:15607890" /db_xref="GeneID:888688" /translation="MRAIVGDCVIHIMPMGTGVELSKLADLALDIGRSVGCSAYENDF TLPDIPTQWRNQPLGWYTQGLAPYLPGLSDPKDAAEG" gene complement(842347..843231) /gene="mmsB" /locus_tag="Rv0751c" /db_xref="GeneID:888658" CDS complement(842347..843231) /gene="mmsB" /locus_tag="Rv0751c" /EC_number="1.1.1.31" /function="Catalyzes the NAD-dependent, reversible oxidation of 3-hydroxbutyrate to methylmalonate [CATALYTIC ACTIVITY: 3-hydroxy-2-methylpropanoate + NAD+ = 2-methyl-3-oxopropanoate + NADH]." /note="Rv0751c, (MTV041.25c), len: 294 aa. Probable mmsB, 3-hydroxyisobutyrate dehydrogenase (EC 1.1.1.31), highly similar to others e.g. NP_102847.1|NC_002678 3-hydroxyisobutyrate dehydrogenase from Mesorhizobium loti (294 aa); NP_420167.1|NC_002696 3-hydroxyisobutyrate dehydrogenase from Caulobacter crescentus (298 aa); A32867 3-hydroxyisobutyrate dehydrogenase from Rattus norvegicus (346 aa); etc. Also similar to methylmalonate semialdehyde dehydrogenases e.g. M84911|PSE MMSRAB_3 methylmalonate semialdehyde dehydrogenase from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 786, E(): 0, (45.8% identity in 297 aa overlap). Also similar to 6-phosphogluconate dehydrogenases from Mycobacterium tuberculosis e.g. Rv1122 and Rv1844c. Contains PS00895 3-hydroxyisobutyrate dehydrogenase signature. BELONGS TO THE 3-HYDROXYISOBUTYRATE DEHYDROGENASE FAMILY." /codon_start=1 /transl_table=11 /product="3-hydroxyisobutyrate dehydrogenase MmsB" /protein_id="NP_215265.1" /db_xref="GI:15607891" /db_xref="GeneID:888658" /translation="MTTIAFLGLGNMGAPMSANLVGAGHVVRGFDPAPTAASGAAAHG VAVFRSAPEAVAEADVVITMLPTGEVVRRCYTDVLAAARPATLFIDSSTISVTDAREV HALAESHGMLQLDAPVSGGVKGAAAATLAFMVGGDESTLRRARPVLEPMAGKIIHCGA AGAGQAAKVCNNMVLAVQQIAIAEAFVLAEKLGLSAQSLFDVITGATGNCWAVHTNCP VPGPVPTSPANNDFKPGFSTALMNKDLGLAMDAVAATGATAPLGSHAADIYAKFAADH ADLDFSAVIHTLRARADA" misc_feature complement(843172..843216) /gene="mmsB" /locus_tag="Rv0751c" /note="PS00895 3-hydroxyisobutyrate dehydrogenase signature" gene complement(843242..844414) /gene="fadE9" /locus_tag="Rv0752c" /db_xref="GeneID:888684" CDS complement(843242..844414) /gene="fadE9" /locus_tag="Rv0752c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv0752c, (MTV041.26c), len: 390 aa. Probable fadE9, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. NP_437985.1|NC_003078 putative acyl-CoA dehydrogenase protein from Sinorhizobium meliloti (380 aa); Z99123|BSUB0020_14 from Bacillus subtilis (379 aa), FASTA scores: opt: 853, E(): 0, (39.8% identity in 384 aa overlap); etc. Contains PS00072 Acyl-CoA dehydrogenases signature 1, and PS00073 Acyl-Co Adehydrogenases signature 2. BELONGS TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE9" /protein_id="NP_215266.1" /db_xref="GI:15607892" /db_xref="GeneID:888684" /translation="MFVLNDDERVIVETAAAFAGKRLAPHALEWDAAKHFPVDVLREA AELGMAAIYCRDDVGGSGLRRLDGVRIFEQLAIADPVTAAFLSIHNMCAWMIDSFGTD EQRKDWIPRLATMGVIASYCLTEPGAGSDAGALSTRAVRHGSGKGGDYVLDGVKQFIS GAAASDVYVVMARTGAEGPRGVSAFIVEKGTPGLSFGAPEAKMGWHAQPTAQVVLDGV RVPAEAMLGGADGEGAGFGIAMSGLNGGRLNIAACSLGGAQAAFDKAGAYVRDRQAFG GSLLDEPTVRFTLADMATGLQTSRMLLWRAASALDDDDADKVELCAMAKRYVTDTCFE VADQALQLHGGYGYLREYGLEKIVRDLRVHRILEGTNEIMRLVIGRAEAARFRATV" misc_feature complement(843335..843394) /gene="fadE9" /locus_tag="Rv0752c" /note="PS00073 Acyl-CoA dehydrogenases signature 2" misc_feature complement(844013..844051) /gene="fadE9" /locus_tag="Rv0752c" /note="PS00072 Acyl-CoA dehydrogenases signature 1" gene complement(844421..845953) /gene="mmsA" /locus_tag="Rv0753c" /db_xref="GeneID:888707" CDS complement(844421..845953) /gene="mmsA" /locus_tag="Rv0753c" /EC_number="1.2.1.27" /function="PLAYS A ROLE IN VALINE AND PYRIMIDINE METABOLISM. BINDS FATTY ACYL-CoA [CATALYTIC ACTIVITY: 2-methyl-3-oxopropanoate + CoA + NAD+ = propanoyl-CoA + CO2 + NADH]." /experiment="experimental evidence, no additional details recorded" /note="Rv0753c, (MTV041.27c), len: 510 aa. Probable mmsA, methylmalonic acid semialdehyde dehydrogenase (EC 1.2.1.27), highly similar to others e.g. NP_420115.1|NC_002696 putative methylmalonate-semialdehyde dehydrogenase from Caulobacter crescentus (499 aa); L48550|STMMSDA_1|CAB75315.1|AL139164 methylmalonic acid semialdehyde dehydrogenase from Streptomyces coelicolor (500 aa), FASTA score: (51.6% identity in 498 aa overlap); M84911|PSEMMSRAB_2|NP_252260.1|NC_002516 methylmalonate-semialdehyde dehydrogenase from Pseudomonas aeruginosa (497 aa), FASTA scores: opt: 1127, E(): 0, (47.9% identity in 507 aa overlap); etc. Note that also highly similar to malonic semialdehyde oxidative decarboxylases e.g. NP_104968.1|NC_002678 malonic semialdehyde oxidative decarboxylase from Mesorhizobium loti (498 aa); NP_384832.1|NC_003047 PUTATIVE MALONIC SEMIALDEHYDE OXIDATIVE DECARBOXYLASE PROTEIN from Sinorhizobium meliloti (498 aa); etc. Contains PS00070 Aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="methylmalonate-semialdehyde dehydrogenase" /protein_id="NP_215267.1" /db_xref="GI:15607893" /db_xref="GeneID:888707" /translation="MTTQISHFIDGQRTAGQSTRSADVFDPNTGQIQAKVPMAGKSDI DAAVASAVEAQKGWAAWNPQRRARVLMRFIELVNDTIDELAELLSREHGKTLADARGD VQRGIEVIEFCLGIPHLLKGEYTEGAGPGIDVYSLRQPLGVVAGITPFNFPAMIPLWK AGPALACGNAFVLKPSERDPSVPVRLAELFIEAGLPAGVFQVVHGDKEAVDAILHHPD IKAVGFVGSSDIAQYIYAGAAATGKRAQCFGGAKNHMIVMPDADLDQAVDALIGAGYG SAGERCMAISVAVPVGDQTAERLRARLIERINNLRVGHSLDPKADYGPLVTGAALARV RDYIGQGVAAGAELVVDGRDRASDDLTFGLPEGDANLEGGFFIGPTLFDHVAAHMSIY TDEIFGPVLCMVRARDYEEALRLPSEHEYGNGVAIFTRDGDAARDFVSRVQVGMVGVN VPIPVPVAYHTFGGWKRSGFGDLNQHGPAAIQFYTKVKTVTSRWPSGIKDGAEFVIPT MS" misc_feature complement(845096..845131) /gene="mmsA" /locus_tag="Rv0753c" /note="PS00070 Aldehyde dehydrogenases cysteine active site" gene 846159..847913 /gene="PE_PGRS11" /locus_tag="Rv0754" /db_xref="GeneID:888695" CDS 846159..847913 /gene="PE_PGRS11" /locus_tag="Rv0754" /function="UNKNOWN" /note="Rv0754, (MTV041.28), len: 584 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to others e.g. AL0212|MTV008_46 from Mycobacterium tuberculosis (1660 aa), FASTA score: (48.7% identity in 345 aa overlap); Z80225|MTCY441_4 from Mycobacterium tuberculosis (778 aa), FASTA score: (41.6% identity in 442 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177752.1" /db_xref="GI:57116776" /db_xref="GeneID:888695" /translation="MSFVIVARDALAAAAADLAQIGSAVNAGNLAAANPTTAVAAAAA DEVSAALAALFGAHAREYQAAAAQAAAYHEQFVHRLSAAATSYAVTEVTIATSLRGAL GSAPASVSDGFQAFVYGPIHATGQQWINSPVGEALAPIVNAPTNVLLGRDLIGNGVTG TAAAPNGGPGGLLFGDGGAGYTGGNGGSAGLIGNGGTGGAGFAGGVGGMGGTGGWLMG NGGMGGAGGVGGNGGAGGQALLFGNGGLGGAGGAGGVDGAIGRGGWFIGTGGMATIGG GGNGQSIVIDFVRHGQTPGNAAMLIDTAVPGPGLTALGQQQAQAIANALAAKGPYAGI FDSQLIRTQQTAAPLANLLGMAPQVLPGLNEIHAGIFEDLPQISPAGLLYLVGPIAWT LGFPIVPMLAPGSTDVNGIVFNRAFTGAVQTIYDASLANPVVAADGNITSVAYSSAFT IGVGTMMNVDNPHPLLLLTHPVPNTGAVVVQGNPEGGWTLVSWDGIPVGPASLPTALF VDVRELITAPQYAAYDIWESLFTGDPAAVINAVRDGADEVGAAVVQFPHAVADDVIDA TGHPYLSGLPIGLPSLIP" gene complement(848103..850040) /gene="PPE12" /locus_tag="Rv0755c" /db_xref="GeneID:888708" CDS complement(848103..850040) /gene="PPE12" /locus_tag="Rv0755c" /function="UNKNOWN" /note="Rv0755c, (MTV041.29), len: 645 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to others e.g. Z82098|MTCY3C7_23 from Mycobacterium tuberculosis (582 aa), FASTA scores: (56.1% identity in 636 aa overlap); Z92774|MTCY6G11_5 from Mycobacterium tuberculosis (552 aa), FASTA scores: (55.8% identity in 590 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177753.1" /db_xref="GI:57116777" /db_xref="GeneID:888708" /translation="MVGFAWLPPETNSLRMYLGAGSRPLLAAAGAWDGLAEELHAAAS SFGSVTSELAGGAWQGPASAAMANAAGPYASWLTAAGAQAELAARQARAAAGAFEEAL AGVVHPAVVQANRVRTWLLAVSNVFGQNAPAIAAMESTYEQMWAQDVAVMAGYHAASS AAAAQLASWQPALPNINLGVGNIGNLNVGNGNTGDYNLGNGNLGNANFGGGNGSAFHG QISSFNVGSGNIGNFNLGSGNGNVGIGPSSFNVGSGNIGNANVGGGNSGDNNFGFGNF GNANIGIGNAGPNMSSPAVPTPGNGNVGIGNGGNGNFGGGNTGNANIGLGNVGDGNVG FGNSGSYNFGFGNTGNNNIGIGLTGSNQIGFGGLNSGSGNIGFGNSGTGNIGFFNSGS GNFGVGNSGVTNTGVANSGNINTGFGNSGFINTGFGNALSVNTGFGNSGQANTGIGNA GDFNTGNFNGGIINTGSFNSGAFNSGSFNGGDANSGFLNSGLTNTGFANSGNINTGGF NAGNLNTGFGNTTDGLGENSGFGNAGSGNSGFNNSGRGNSGAQNVGNLQISGFANSGQ SVTGYNNSVSVTSGFGNKGTGLFSGFMSGFGNTGFLQSGFGNLEANPDNNSATSGFGN SGKQDSGGFNSIDFVSGFFHR" gene complement(850342..850527) /locus_tag="Rv0755A" /db_xref="GeneID:3205072" CDS complement(850342..850527) /locus_tag="Rv0755A" /function="COULD BE REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv0755A, len: 61 aa. Putative transposase (possibly gene fragment), similar to C-terminal part of Q9EZM2|ISMav2|AF286339_1 putative transposase from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 284, E(): 5e-13, (83.02% identity in 53 aa overlap); and to SCJ11.25c|Q9RI80 possible noncomposite transposon transposase from Streptomyces coelicolor (283 aa)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="YP_177633.1" /db_xref="GI:57116778" /db_xref="GeneID:3205072" /translation="MKELSVAEQRYQAVLAVISDGLSISQVAEKVGVSRQTLHTWLAR YEAEGLDGLRIGTGTAL" gene complement(850642..850713) /locus_tag="Rvnt09" /note="tRNA-Thr(TGT)" /db_xref="GeneID:2700448" tRNA complement(850642..850713) /locus_tag="Rvnt09" /product="tRNA-Thr" /note="codon recognized: ACA" /anticodon=(pos:850679..850681,aa:Thr) /db_xref="GeneID:2700448" gene complement(850741..851466) /locus_tag="Rv0756c" /db_xref="GeneID:888730" CDS complement(850741..851466) /locus_tag="Rv0756c" /function="UNKNOWN" /note="Rv0756c, (MTCY369.01c), len: 241 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215270.1" /db_xref="GI:15607896" /db_xref="GeneID:888730" /translation="MNLGQTLVGIATWPARAGLAAADTGLNMAGAAVDMAKQALGDAG GASGSTSMANMLGIDDTIARANRLARLLDDDMPLGRAIAPNGPMDRMLRPGGVVDLLT QPGGLLDRLTAEGGAMQRALQPGGLADQLLAEDGLIERVLSEDGLADRLLAEGGLIDK ITAKDGPLEQLADVADTLARLTPGMEALEPAIATLQDAVIALTMVVNPLSSIAERIPL PGRRPARRSSSRSVRSQRVVDSE" gene 851608..852351 /gene="phoP" /locus_tag="Rv0757" /db_xref="GeneID:888772" CDS 851608..852351 /gene="phoP" /locus_tag="Rv0757" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM. PART OF THE TWO COMPONENT REGULATORY SYSTEM PHOP/PHOQ. THIS PROTEIN IS THOUGHT TO BE A POSITIVE REGULATOR FOR THE PHOSPHATE REGULON, REQUIRED FOR INTRACELLULAR GROWTH. TRANSCRIPTION OF THIS OPERON IS POSITIVELY REGULATED BY PHOB AND PHOR|Rv0758 WHEN PHOSPHATE IS LIMITED." /experiment="experimental evidence, no additional details recorded" /note="Rv0757, (MTCY369.02), len: 247 aa. Possible phoP, two component system response phosphate regulon transcriptional regulator (see citations below), highly similar to various transcriptional regulators e.g. CAC32360.1|AL583945 putative two component system response regulator from Streptomyces coelicolor (271 aa); T45446 probable two-component response regulator from Mycobacterium leprae (253 aa); and similar to phoP proteins e.g. P13792|PHOP_BACSU alkaline phosphatase synthesis transcription regulatory protein from Bacillus subtilis (240 aa), FASTA scores: opt: 594, E(): 2.3e-33, (41.0% identity in 234 aa overlap); etc. Also highly similar to Rv3765c from Mycobacterium tuberculosis (234 aa), Rv1033c (257 aa), RV0903c|MTCY31.31c|Q10531 (236 aa), FASTA score: (45.4% identity in 229 aa overlap); MTCY10G2_16 and MTU88959_1." /codon_start=1 /transl_table=11 /product="two component system response transcriptional positive regulator PHOP" /protein_id="NP_215271.1" /db_xref="GI:15607897" /db_xref="GeneID:888772" /translation="MRKGVDLVTAGTPGENTTPEARVLVVDDEANIVELLSVSLKFQG FEVYTATNGAQALDRARETRPDAVILDVMMPGMDGFGVLRRLRADGIDAPALFLTARD SLQDKIAGLTLGGDDYVTKPFSLEEVVARLRVILRRAGKGNKEPRNVRLTFADIELDE ETHEVWKAGQPVSLSPTEFTLLRYFVINAGTVLSKPKILDHVWRYDFGGDVNVVESYV SYLRRKIDTGEKRLLHTLRGVGYVLREPR" gene 852396..853853 /gene="phoR" /locus_tag="Rv0758" /db_xref="GeneID:888775" CDS 852396..853853 /gene="phoR" /locus_tag="Rv0758" /EC_number="2.7.-.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM. THIS PROTEIN IS THOUGHT TO BE A SENSOR KINASE FOR THE PHOSPHATE REGULON. TRANSCRIPTION OF THIS OPERON IS POSITIVELY REGULATED BY PHOB|Rv0757 AND PHOR WHEN PHOSPHATE IS LIMITED." /note="Rv0758, (MTCY369.03), len: 485 aa. Possible phoR, two component system response phosphate sensor kinase membrane-associated (EC 2.7.-.-), highly similar to various sensor kinases e.g. CAC32361.1|AL583945 putative two component system histidine kinase from Streptomyces coelicolor (524 aa); NP_349365.1|NC_003030 Membrane-associated sensory histidine kinase with HAMP domain from Clostridium acetobutylicum (482 aa); and similar to phoP proteins e.g. NP_372216.1|NC_002758 alkaline phosphatase synthesis sensor protein from Staphylococcus aureus (554 aa); P23545|PHOR_BACSU alkaline phosphatase synthesis sensor from Bacillus subtilis (579 aa), FASTA scores: opt: 515, E(): 1.9e-25, (40.0% identity in 230 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. MTCY20G9.16 FASTA scores: (34.5% identity in 264 aa overlap), MTU88959_2 (509 aa), MTCY10G2_17, etc." /codon_start=1 /transl_table=11 /product="two component system response sensor kinase membrane associated PHOR" /protein_id="NP_215272.1" /db_xref="GI:15607898" /db_xref="GeneID:888775" /translation="MARHLRGRLPLRVRLVAATLILVATGLVASGIAVTSMLQHRLTS RIDRVLLEEAQIWAQITLPLAPDPYPGHNPDRPPSRFYVRVISPDGQSYTALNDNTAI PAVPANNDVGRHPTTLPSIGGSKTLWRAVSVRASDGYLTTVAIDLADVRSTVRSLVLL QVGIGSAVLVVPGVAGYAVVRRSLRPLAEFEQTAAAIGAGQLDRRVPQWHPRTEVGRL SLALNGMLAQIQRAVASAESSAEKARDSEDRMRQFITDASHELRTPLTTIRGFAELYR QGAARDVGMLLSRIESEASRMGLLVDDLLLLARLDAHRPLELCRVDLLALASDAAHDA RAMDPKRRITLEVLDGPGTPEVLGDESRLRQVLRNLVANAIQHTPESADVTVRVGTEG DDAILEVADDGPGMSQEDALRVFERFYRADSSRARASGGTGLGLSIVDSLVAAHGGAV TVTTALGEGCCFRVSLPRVSDVDQLSLTPVVPGPP" gene complement(853825..854157) /locus_tag="Rv0759c" /db_xref="GeneID:888776" CDS complement(853825..854157) /locus_tag="Rv0759c" /function="UNKNOWN" /note="Rv0759c, (MTCY369.04c), len: 110 aa. Conserved hypothetical protein, highly similar (but shorter 45 aa in N-terminus) to P49774|YHIT_MYCLE|ML2237|MLCB5.04c|U296A HYPOTHETICAL HIT-LIKE PROTEIN from Mycobacterium leprae (155 aa), FASTA scores: opt: 766, E(): 0, (78.7% identity in 150 aa overlap). Also highly similar (but N-terminus always shorter) to HIT-like proteins and protein kinase inhibitors e.g. AAF72728.1|AF265258_1|AF265258 HIT-like protein from Rhodococcus sp. (141 aa); NP_212513.1|NC_001318 protein kinase C1 inhibitor (pkcI) from Borrelia burgdorferi (149 aa) ; P94252|YHIT_BORBU|BB0379 HYPOTHETICAL HIT-LIKE PROTEIN from Borrelia burgdorferi (139 aa); NP_110768.1|NC_002689 HIT (histidine triad) family protein from Thermoplasma volcanium (158 aa); P16436|IPK1_BOVIN protein kinase C inhibitor 1 (pkci-1) from Bos taurus (Bovine) (125 aa), FASTA scores: opt: 195, E(): 5.2e-08, (33.3% identity in 111 aa overlap); etc. Also shows similarity with Rv2613c|MTCY01A10.20A CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (195 aa) and Rv1262c|MTCY50.20 HYPOTHETICAL HIT-LIKE PROTEIN (144 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215273.1" /db_xref="GI:15607899" /db_xref="GeneID:888776" /translation="MAFLTIEPMTQGHTLVVPRAEIDHWQNVDPALFGRVMSVSQLIG KAVCRAFSTQRAGMIIAGLEVPHLHIHVFPTRSLSDFGFANVDRNPSPGSLDEAQAKI RAALAQLA" gene complement(854267..854686) /locus_tag="Rv0760c" /db_xref="GeneID:888784" CDS complement(854267..854686) /locus_tag="Rv0760c" /function="UNKNOWN" /note="Rv0760c, (MTCY369.05), len: 139 aa. Conserved hypothetical protein, similar to N-terminal part of Rv2042c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (265 aa), FASTA scores: opt: 150, E(): 4.1e-05, (28.7% identity in 136 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215274.1" /db_xref="GI:15607900" /db_xref="GeneID:888784" /translation="MTQTTQSPALIASQSSWRCVQAHDREGWLALMADDVVIEDPIGK SVTNPDGSGIKGKEAVGAFFDTHIAANRLTVTCEETFPSSSPDEIAHILVLHSEFDGG FTSEVRGVFTYRVNKAGLITNMRGYWNLDMMTFGNQE" gene complement(854699..855826) /gene="adhB" /locus_tag="Rv0761c" /db_xref="GeneID:888738" CDS complement(854699..855826) /gene="adhB" /locus_tag="Rv0761c" /EC_number="1.1.1.1" /function="THOUGHT TO CATALYZE THE REVERSIBLE OXIDATION OF ETHANOL TO ACETALDEHYDE WITH THE CONCOMITANT REDUCTION OF NAD. PROBABLY ACTS ON PRIMARY OR SECONDARY ALCOHOLS OR HEMIACETALS [CATALYTIC ACTIVITY: An alcohol + NAD+ = an aldehyde or ketone + NADH]." /note="Rv0761c, (MTCY369.06c), len: 375 aa. Possible adhB, zinc-containing alcohol dehydrogenase NAD-dependent (EC 1.1.1.1), similar to others e.g. AAC15839.1|AF060871_4 hypothetical alcohol dehydrogenase from Rhodococcus rhodochrous (370 aa), FASTA scores: opt: 1234, E(): 0, (46.8% identity in 370 aa overlap); P80468|ADH2_STRCA ALCOHOL DEHYDROGENASE II from Struthio camelus (Ostrich) (379 aa); Q03505|ADH1_RABIT alcohol dehydrogenase alpha chain from Oryctolagus cuniculus (Rabbit) (374 aa), FASTA scores: opt: 872, E(): 0, (39.1% identity in 379 aa overlap); etc. Also similar to adhD alcohol dehydrogenase from Mycobacterium tuberculosis (368 aa). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY." /codon_start=1 /transl_table=11 /product="zinc-containing alcohol dehydrogenase NAD dependent ADHB" /protein_id="YP_177754.1" /db_xref="GI:57116779" /db_xref="GeneID:888738" /translation="MKTKGALIWEFNQPWSVEEIEIGDPRKDEVKIQMEAAGMCRSDH HLVTGDIPMAGFPVLGGHEGAGIVTEVGPGVDDFAPGDHVVLAFIPSCGKCPSCQAGM RNLCDLGAGLLAGESVTDGSFRIQARGQNVYPMTLLGTFSPYMVVHRSSVVKIDPSVP FEVACLVGCGVTTGYGSAVRTADVRPGDDVAIVGLGGVGMAALQGAVSAGARYVFAVE PVEWKRDQALKFGATHVYPDINAALMGIAEVTYGLMAQKVIITVGKLDGADVDSYLTI TAKGGTCVLTAIGSLVDTQVTLNLAMLTLLQKNIQGTIFGGGNPHYDIPKLLSMYKAG KLNLDDMVTTAYKLEQINDGYQDMLNGKNIRGVIRYTDDDR" misc_feature complement(855602..855646) /gene="adhB" /locus_tag="Rv0761c" /note="PS00059 Zinc-containing alcohol dehydrogenases signature" gene complement(855925..856470) /locus_tag="Rv0762c" /db_xref="GeneID:888807" CDS complement(855925..856470) /locus_tag="Rv0762c" /function="UNKNOWN" /note="Rv0762c, (MTCY369.07c), len: 181 aa. Conserved hypothetical protein, showing weak similarity to D90907_77|P73575 HYPOTHETICAL 31.3KD PROTEIN from Synechocystis sp, FASTA scores: E(): 0.0012, (30.4% identity in 92 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215276.1" /db_xref="GI:15607902" /db_xref="GeneID:888807" /translation="MAGYPRDELEDVVHRWLQANRTAERRGDWTLLADFYTDDATYGW NVGPNEDVMCVGIDEIRDIALGQEMDGLQGWRYPYQRVVIDEKQGEVVGFWKQVATDA NGAEQEVYGIGGSWFRYAGGGKWNWQRDFFDFGHVSALYLELIKAGKLSPGMQKRIER AVSGNKVPGYYPLGKTPVPLW" misc_feature complement(855943..855966) /locus_tag="Rv0762c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(856473..856679) /locus_tag="Rv0763c" /db_xref="GeneID:888814" CDS complement(856473..856679) /locus_tag="Rv0763c" /function="FERREDOXINS ARE IRON-SULFUR PROTEINS THAT TRANSFER ELECTRONS IN A WIDE VARIETY OF METABOLIC REACTIONS. PROBABLY INVOLVED IN ELECTRON TRANSPORT FOR CYTOCHROME P-450 SYSTEM." /note="Rv0763c, (MTCY369.08c), len: 68 aa. Possible ferredoxin, similar to others and related proteins e.g. P18324|FER1_STRGO|SUAB ferredoxin 1 (fd-1) from Streptomyces griseolus (68 aa); AAK31349.1|AF350429_2|AF350429 putative ferredoxin from Nocardioides sp (63 aa); AAK16536.1|AF331043_16|AF331043 phthalate dioxygenase ferredoxin subunit from Arthrobacter keyseri (64 aa); etc. Probably involved in electron transport for cytochrome P-450 system e.g. downstream ORF Rv0764c|MTCY369.09c PROBABLE CYTOCHROME P450 51 from Mycobacterium tuberculosis (451 aa), FASTA scores: opt: 137, E(): 0.00013, (36.4% identity in 66 aa overlap). Also similar to putative ferredoxins Rv3503c and Rv1786 from Mycobacterium tuberculosis. COULD BELONG TO THE BACTERIAL TYPE FERREDOXIN FAMILY." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="NP_215277.1" /db_xref="GI:15607903" /db_xref="GeneID:888814" /translation="MGYRVEADRDLCQGHAMCELEAPEYFRVPKRGQVEILDPEPPEE ARGVIKHAVWACPTQALSIRETGE" gene complement(856682..858037) /gene="cyp51" /locus_tag="Rv0764c" /db_xref="GeneID:888819" CDS complement(856682..858037) /gene="cyp51" /locus_tag="Rv0764c" /EC_number="1.14.14.-" /function="INVOLVED IN STEROL BIOSYNTHESIS. ITS PRECISE BIOLOGICAL SUBSTRATE IS NOT KNOWN. CATALYZES C14-DEMETHYLATION OF LANOSTEROL, 24,25-DIHYDROLANOSTEROL AND OBTUSIFOLIOL WHICH IS CRITICAL FOR ERGOSTEROL BIOSYNTHESIS. IT TRANSFORMS LANOSTEROL INTO 4,4'-DIMETHYL CHOLESTA-8,14,24-TRIENE-3-BETA-OL." /experiment="experimental evidence, no additional details recorded" /note="Rv0764c, (MT0788, MTCY369.09c), len: 451 aa. cyp51, cytochrome P450 51 (sterol 14-alpha demethylase) (EC 1.14.14.-), similar to others e.g. Q16850|CP51_HUMAN CYTOCHROME P450 51 (CYPL1) (P450L1) (STEROL 14-ALPHA DEMETHYLASE) (LANOSTEROL 14-ALPHA DEMETHYLASE) from Homo sapiens (509 aa), FASTA scores: opt: 848, E(): 0, (33.9% identity in 439 aa overlap); NP_172633.1|NC_003070 putative obtusifoliol 14-alpha demethylase from Arabidopsis thaliana (488 aa); P93596|CP51_WHEAT CYTOCHROME P450 51 (CYPL1) (P450-L1A1) (OBTUSIFOLIOL 14-ALPHA DEMETHYLASE) from Triticum aestivum (453 aa); etc. Also similar to many other Mycobacterium tuberculosis cytochromes P450 e.g. Rv1394c, FASTA score: (22.5% identity in 444 aa overlap). Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 sterol 14-alpha demethylase" /protein_id="NP_215278.1" /db_xref="GI:15607904" /db_xref="GeneID:888819" /translation="MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQ LAGKQVVLLSGSHANEFFFRAGDDDLDQAKAYPFMTPIFGEGVVFDASPERRKEMLHN AALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSACLIGKKFRDQ LDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNGLVALVADIMNGRIANPPT DKSDRDMLDVLIAVKAETGTPRFSADEITGMFISMMFAGHHTSSGTASWTLIELMRHR DAYAAVIDELDELYGDGRSVSFHALRQIPQLENVLKETLRLHPPLIILMRVAKGEFEV QGHRIHEGDLVAASPAISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRH RCVGAAFAIMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTG V" misc_feature complement(856850..856879) /gene="cyp51" /locus_tag="Rv0764c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(858037..858864) /locus_tag="Rv0765c" /db_xref="GeneID:888793" CDS complement(858037..858864) /locus_tag="Rv0765c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM, POSSIBLY ELECTRON TRANSFERT." /note="Rv0765c, (MTCY369.10c), len: 275 aa. Probable oxidoreductase (EC 1.-.-.-), similar others e.g. P39071|DHBA_BACSU 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase from Bacillus subtilis (261 aa), FASTA scores: opt: 385, E(): 1.8e-17, (30.6% identity in 252 aa overlap); AAF81239.1|AF263012 putative beta-ketoacyl reductase from Streptomyces griseus (274 aa); NP_436514.1|NC_003037 putative oxidoreductase from Sinorhizobium meliloti (240 aa); etc. Also similar to several other oxidoreductases from Mycobacterium tuberculosis e.g. Rv1544|MTCY48.21, FASTA score: (32.6% identity in 267 aa overlap); etc. Contains PS00061 Short-chain alcohol dehydrogenase family signature." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_215279.1" /db_xref="GI:15607905" /db_xref="GeneID:888793" /translation="MPRFEPHPARRTTVVAGASSGIGAATATELAGRGFPVALGARRM DKLAELVDKIRADGGEAVAFPLDVTDPESVKSFVAQTVEALGEVELLVSSAGDMLPGQ LHEVSTEAFAEQVQIHLVGANRLATAVLPAMVARRRGDLIFVGSDVGLRQRPHMGAYG AAKAGLAAMVTNLQMELEGTGVRASIVHPGPTLTGMGWQLSAEQVGPMLADWAKWGQA RHNYFLRPSDLARAIAFVAETPRGCVVVNMEIQPEAPLRDAPAHRQKLVLGEEGMPG" misc_feature complement(858343..858429) /locus_tag="Rv0765c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(858864..860072) /gene="cyp123" /locus_tag="Rv0766c" /db_xref="GeneID:888834" CDS complement(858864..860072) /gene="cyp123" /locus_tag="Rv0766c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /experiment="experimental evidence, no additional details recorded" /note="Rv0766c, (MT0790, MTCY369.11c), len: 402 aa. Probable cyp123, cytochrome P-450 (EC 1.14.-.-), similar to others e.g. P33271|CPXK_SACER cytochrome P-450 107B1 from Saccharopolyspora erythraea (405 aa), FASTA scores: opt: 770, E(): 0, (36.9% identity in 406 aa overlap); T36526 probable cytochrome P450 hydroxylase from Streptomyces coelicolor (411 aa); P27632|CPXM_BACSU CYTOCHROME P450 109 from Bacillus subtilis (405 aa); etc. Also similar to several other cytochromes P-450 from Mycobacterium tuberculosis e.g. Rv1256c|MTCY50.26 (405 aa), FASTA score: (35.2% identity in 389 aa overlap); etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 123" /protein_id="NP_215280.1" /db_xref="GI:15607906" /db_xref="GeneID:888834" /translation="MTVRVGDPELVLDPYDYDFHEDPYPYYRRLRDEAPLYRNEERNF WAVSRHHDVLQGFRDSTALSNAYGVSLDPSSRTSEAYRVMSMLAMDDPAHLRMRTLVS KGFTPRRIRELEPQVLELARIHLDSALQTESFDFVAEFAGKLPMDVISELIGVPDTDR ARIRALADAVLHREDGVADVPPPAMAASIELMRYYADLIAEFRRRPANNLTSALLAAE LDGDRLSDQEIMAFLFLMVIAGNETTTKLLANAVYWAAHHPGQLARVFADHSRIPMWV EETLRYDTSSQILARTVAHDLTLYDTTIPEGEVLLLLPGSANRDDRVFDDPDDYRIGR EIGCKLVSFGSGAHFCLGAHLARMEARVALGALLRRIRNYEVDDDNVVRVHSSNVRGF AHLPISVQAR" misc_feature complement(859017..859046) /gene="cyp123" /locus_tag="Rv0766c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(860069..860710) /locus_tag="Rv0767c" /db_xref="GeneID:888833" CDS complement(860069..860710) /locus_tag="Rv0767c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0767c, (MTCY369.12c), len: 213 aa. Conserved hypothetical protein, showing weak similarity with AL133220|SCC75A_26 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (215 aa), FASTA scores: opt: 152, E(): 0.0048, (28.4% identity in 204 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215281.1" /db_xref="GI:15607907" /db_xref="GeneID:888833" /translation="MSSDVLVTTPAQRQTEPHAEAVSRNRRQQATFRKVLAAAMATLR EKSYADLTVRLVAARAKVAPATAYTYFSSKNHLIAEVYLDLVRQVPCVTDVNVPMPIR VTSSLRHLALVVADEPEIGAACTAALLDGGADPAVRAVRDRIGAEIHRRITSAIGPGA DPGTVFALEMAFFGALVQAGSGTFTYHEIADRLGYVVGLILAGANEPSTGGSE" gene 860912..862381 /gene="aldA" /locus_tag="Rv0768" /db_xref="GeneID:888832" CDS 860912..862381 /gene="aldA" /locus_tag="Rv0768" /EC_number="1.2.1.-" /function="OXIDIZES A VARIETY OF ALDEHYDES [CATALYTIC ACTIVITY: An aldehyde + NAD+ + H2O = an acid + NADH]." /experiment="experimental evidence, no additional details recorded" /note="Rv0768, (MTCY369.13), len: 489 aa. Probable aldA, NAD-dependent aldehyde dehydrogenase (EC 1.2.1.-), highly similar to others e.g. AAL14238.1|AY052630 6-oxolauric acid dehydrogenase from Rhodococcus ruber (474 aa); NP_285450.1|NC_001264 aldehyde dehydrogenase from Deinococcus radiodurans (495 aa); NP_241405.1|NC_002570 NADP-dependent aldehyde dehydrogenase from Bacillus halodurans (498 aa); P42757|DHAB_ATRHO betaine-aldehyde dehydrogenase precursor from Atriplex hortensis (Mountain spinach) (502 aa), FASTA scores: opt: 1001, E(): 0, (35.6% identity in 486 aa overlap); etc. Also highly similar to Rv0223c ALDEHYDE DEHYDROGENASE from Mycobacterium tuberculosis (487 aa). Contains PS00687 Aldehyde dehydrogenases glutamic acid active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase NAD dependent AldA" /protein_id="NP_215282.1" /db_xref="GI:15607908" /db_xref="GeneID:888832" /translation="MALWGDGISALLIDGKLSDGRAGTFPTVNPATEEVLGVAADADA EDMGRAIEAARRAFDSTDWSRNTELRVRCVRQLRDAMQQHVEELRELTISEVGAPRML TASAQLEGPVGDLSFAADTAESYPWKQDLGEASPLGIATRRTLAREAVGVVGAITPWN FPHQINLAKLGPALAAGNTVVLKPAPDTPWCAAALGEIIVEHTDFPPGVVNIVTSSSH ALGALLAKDPRVDMISFTGSTATGRAVMADAAATIKKVFLELGGKSAFVVLDDADLAA ASAVSAFSACMHAGQGCAITTRLVVPRARYEEAVAIAAATMSSIRPGDPNDPGTVCGP LISARQRDRVQGYLDLAVAEGGRFACGGARPADREVGFYIEPTVIAGLTNDARVAREE IFGPVLTVIAHDGDDDAVRIANDSPYGLSGTVYGADPQRAARIASRLRVGTVNVNGGV WYCADAPFGGYKQSGIGREMGLLGFEEYLEAKLIATAAN" misc_feature 861683..861706 /gene="aldA" /locus_tag="Rv0768" /note="PS00687 Aldehyde dehydrogenases glutamic acid active site" gene 862412..863158 /locus_tag="Rv0769" /db_xref="GeneID:888837" CDS 862412..863158 /locus_tag="Rv0769" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0769, (MTCY369.14), len: 248 aa. Probable dehydrogenase/reductase (EC 1.-.-.-), similar to others, especially short-chain type dehydrogenases/reductases and 3-oxoacyl-(acyl-carrier protein) reductases e.g. NP_106890.1|NC_002678 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mesorhizobium loti (374 aa); NP_243357.1|NC_002570 3-oxoacyl-(acyl-carrier protein) reductase from Bacillus halodurans (246 aa); P28643|FABG_CUPLA 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE from Cuphea lanceolata (320 aa); P25529|HDHA_ECOLI 7-alpha-hydroxysteroid dehydrogenase from Escherichia coli (255 aa), FASTA scores: opt: 536, E(): 6.5e-27, (37.7% identity in 247 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. MTCY02B10.14, FASTA score: (33.7% identity in 249 aa overlap); etc." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_215283.1" /db_xref="GI:15607909" /db_xref="GeneID:888837" /translation="MFDSKVAIVTGAAQGIGQAYAQALAREGASVVVADINADGAAAV AKQIVADGGTAIHVPVDVSDEDSAKAMVDRAVGAFGGIDYLVNNAAIYGGMKLDLLLT VPLDYYKKFMSVNHDGVLVCTRAVYKHMAKRGGGAIVNQSSTAAWLYSNFYGLAKVGV NGLTQQLARELGGMKIRINAIAPGPIDTEATRTVTPAELVKNMVQTIPLSRMGTPEDL VGMCLFLLSDSASWITGQIFNVDGGQIIRS" repeat_region 863155..863255 /note="101 bp Mycobacterial Interspersed Repetitive Unit, Class I" gene 863256..864143 /locus_tag="Rv0770" /db_xref="GeneID:888868" CDS 863256..864143 /locus_tag="Rv0770" /EC_number="1.1.1.-" /function="UNKNOWN; 3-HYDROXYISOBUTYRATE DEHYDROGENASE FAMILY PROTEIN PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0770, (MTCY369.15), len: 295 aa. Probable dehydrogenase/reductase, 3-hydroxyisobutyrate dehydrogenase family (EC 1.1.1.-), possibly 3-hydroxyisobutyrate dehydrogenase (EC 1.1.1.31) or 2-hydroxy-3-oxopropionate reductase (EC 1.1.1.60), similar to others e.g. P23523|GARR_ECOLI 2-HYDROXY-3-OXOPROPIONATE REDUCTASE (TARTRONATE SEMIALDEHYDE REDUCTASE) (TSAR) from Escherichia coli strain K12 (294 aa), FASTA scores: opt: 469, E(): 6.7e-22, (34.4% identity in 282 aa overlap); P28811|MMSB_PSEAE 3-hydroxyisobutyrate dehydrogenase (HIBADH) from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 439, E(): 4.3e-20, (34.9% identity in 269 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv1122 and Rv1844c. SEEMS TO BELONG TO THE 3-HYDROXYISOBUTYRATE DEHYDROGENASE FAMILY." /codon_start=1 /transl_table=11 /product="dehydrogenase/reductase" /protein_id="NP_215284.1" /db_xref="GI:15607910" /db_xref="GeneID:888868" /translation="MTAHPETPRLGYIGLGNQGAPMAKRLLDWPGGLTVFDVRVEAMA PFVEGGATAAASVSDVAEADIISITVFDDAQVSSVITADNGLATHAKPGTIVAIHSTI ADTTAVDLAEKLKPQGIHIVDAPVSGGAAAAAKGELAVMVGADDEAFQRIKEPFSRWA SLLIHAGEPGAGTRMKLARNMLTFVSYAAAAEAQRLAEACGLDLVALGKVVRHSDSFT GGAGAIMFRNTTAPMEPADPLRPLLEHTRGLGEKDLSLALALGEVVSVDLPLAQLALQ RLAAGLGVPHPDTEPAKET" gene 864140..864574 /locus_tag="Rv0771" /db_xref="GeneID:888872" CDS 864140..864574 /locus_tag="Rv0771" /EC_number="4.1.1.44" /function="INVOLVED IN AROMATIC HYDROCARBONS CATABOLISM. THOUGHT TO BE INVOLVED IN THE CATABOLISM OF PROTOCATECHUATE TO SUCCINATE-AND ACETYL-CoA IN THE BETA-KETOADIPATE PATHWAY (AT THE THIRD STEP) [CATALYTIC ACTIVITY: 2-CARBOXY-5-OXO-2,5-DIHYDROFURAN-2-ACETATE = 5-OXO-4,5-DIHYDROFURAN-2-ACETATE + CO(2)]." /note="Rv0771, (MTCY369.16), len: 144 aa. Possible 4-carboxymuconolactone decarboxylase (EC 4.1.1.44), showing similarity with other carboxymuconolactone decarboxylases e.g. AAD39557.1|AF031417 PcaC-like protein from Pseudomonas putida (130 aa); P20370|DC4C_ACICA 4-CARBOXYMUCONOLACTONE DECARBOXYLASE (CMD) from Acinetobacter sp. ADP1 (134 aa), FASTA scores: opt: 174, E(): 0.00075, (31.4% identity in 121 aa overlap); C-terminus of NP_421214.1|NC_002696 3-oxoadipate enol-lactone hydrolase/4-carboxymuconolactone decarboxylase from Caulobacter crescentus (393 aa); C-terminus of T47115 probable 4-carboxymuconolactone decarboxylase / 3-oxoadipate enol-lactone hydrolase from Streptomyces sp (373 aa); NP_407104.1|NC_003143 putative gamma carboxymuconolactone decarboxylase from Yersinia pestis (131 aa); etc." /codon_start=1 /transl_table=11 /product="4-carboxymuconolactone decarboxylase" /protein_id="NP_215285.1" /db_xref="GI:15607911" /db_xref="GeneID:888872" /translation="MMDELRRTGLDKMNEVYAWDMPDMPGEFFALTVDHLFGRIWTRP GLSMRDRRMAVIAVLTAQGQSDLLEVQVNAVLHNDELTIDELRELAVFITHYVGFPLG SRLNSAIERVAAKRKQAAENGSLPDTKANVAEVLAKESGKSS" gene 864586..865854 /gene="purD" /locus_tag="Rv0772" /db_xref="GeneID:888873" CDS 864586..865854 /gene="purD" /locus_tag="Rv0772" /EC_number="6.3.4.13" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE SECOND STEP) [CATALYTIC ACTIVITY: ATP + 5-PHOSPHORIBOSYLAMINE + GLYCINE = ADP + PHOSPHATE + 5'-PHOSPHORIBOSYLGLYCINAMIDE]." /note="catalyzes the formation of N(1)-(5-phospho-D-ribosyl)glycinamide from 5-phospho-D-ribosylamine and glycine in purine biosynthesis" /codon_start=1 /transl_table=11 /product="phosphoribosylamine--glycine ligase" /protein_id="NP_215286.1" /db_xref="GI:15607912" /db_xref="GeneID:888873" /translation="MRVLVIGSGAREHALLLALGKDPQVSGLIVAPGNAGTARIAEQH DVDITSAEAVVALAREVGADMVVIGPEVPLVLGVADAVRAAGIVCFGPGKDAARIEGS KAFAKDVMAAAGVRTANSEIVDSPAHLDAALDRFGPPAGDPAWVVKDDRLAAGKGVVV TADRDVARAHGAALLEAGHPVLLESYLDGPEVSLFCVVDRTVVVPLLPAQDFKRVGED DTGLNTGGMGAYAPLPWLPDNIYREVVSRIVEPVAAELVRRGSSFCGLLYVGLAITAR GPAVVEFNCRFGDPETQAVLALLESPLGQLLHAAATGKLADFGELRWRDGVAVTVVLA AENYPGRPRVGDVVVGSEAEGVLHAGTTRRDDGAIVSSGGRVLSVVGTGADLSAARAH AYEILSSIRLPGGHFRSDIGLRAAEGKISV" gene complement(865851..867389) /gene="ggtA" /locus_tag="Rv0773c" /db_xref="GeneID:888893" CDS complement(865851..867389) /gene="ggtA" /locus_tag="Rv0773c" /EC_number="3.5.1.-" /EC_number="2.3.2.2" /function="BESIDES THE CEPHALOSPORIN ACYLASE I ACTIVITY WHICH CONVERTS GL-7ACA INTO 7-ACA; THIS ENZYME DISPLAYS SOME GAMMA GLUTAMYLTRANSPEPTIDASE ACTIVITY: GGT PLAYS A KEY ROLE IN THE GAMMA-GLUTAMYL CYCLE, A PATHWAY FOR THE SYNTHESIS AND DEGRADATION OF GLUTATHIONE. [CATALYTIC ACTIVITY 1: 7-BETA-(4-CARBOXYBUTANAMIDO)-CEPHALOSPORANIC ACID + H2O = 7-AMINOCEPHALOSPORANIC ACID + GLUTARIC ACID] [CATALYTIC ACTIVITY 2: (5-L-glutamyl)-peptide + an amino acid = peptide + 5-L-glutamyl-amino acid]." /note="Rv0773c, (MTCY369.18), len: 512 aa. Probable ggtA, bifunctional acylase including cephalosporin acylase (EC 3.5.1.-), and gamma-glutamyl transpeptidase (EC 2.3.2.2); highly similar to others e.g. NP_295247.1|NC_001263 cephalosporin acylase from Deinococcus radiodurans (535 aa); NP_248854.1|NC_002516 probable gamma-glutamyltranspeptidase from Pseudomonas aeruginosa (538 aa); P15557|PAC1_PSES3 ACYLASE ACY 1 [INCLUDES: CEPHALOSPORIN ACYLASE (GL-7ACA ACYLASE); GAMMA-GLUTAMYLTRANSPEPTIDASE (GGT)] from Pseudomonas sp. strain SE83 (558 aa), FASTA scores: opt: 784, E(): 0, (34.2% identity in 526 aa overlap); NP_391491.1|NC_000964|Z93767|BSZ93767_6|O0521 protein similar to gamma-glutamyltransferase from Bacillus subtilis (525 aa), FASTA scores: opt: 1169, E(): 0, (40.1% identity in 516 aa overlap); etc. Also similar to Rv2394|ggtB from Mycobacterium tuberculosis. Member of GL-7ACA ACYLASES AND TO GGT group." /codon_start=1 /transl_table=11 /product="bifunctional cephalosporin acylase/gamma-glutamyltranspeptidase" /protein_id="NP_215287.1" /db_xref="GI:15607913" /db_xref="GeneID:888893" /translation="MPILATNVVCTSQPLAAQAGLRMLADGGNAVDAAVATAITLTVV EPVSNGIGSDAFSIVWDGQKLHGLNASGRSPSAWTPEYFGGNAVPVLGWNSVTVPGAV SAWVELHARFGRLPFETLFEPAISYGRNGFLVSPTVAAQWAAQVPLFASQPGFADAFM PGGRAPKPGELFTFPDHAATLEKIAATNGEEFYRGELAAKLEAHSAANGGVMRADDLA AHRVDWVDTITGTYRGYTIHQIPPNGQGIVALIALGILEHFDMSSWSVDSAESVHVQI EALKLAFADAQACVADIDYMPVHPKRLLDKEYLRQRATLIDPKRAMPAATGIPRGGTV YLAAADAAGMMVSMIQSNYLGFGSGVVVPGTGISLHNRGSDFTVVPRHPNRVGPRKRP YHTIIPGFVTRDGAPVMSFGVMGGMMQPQGHVQVLVRIADYGQNPQAACDGPRFRWVN GMRVSFENGFPDSTLDELRQRGHDLVAVADYSQFGSCQAIWRLDDGYLAASDPRRDGQ AAAC" gene complement(867440..868351) /locus_tag="Rv0774c" /db_xref="GeneID:888895" CDS complement(867440..868351) /locus_tag="Rv0774c" /function="UNKNOWN; COULD HAVE POSSIBLY A LIPOLYTIC ACTIVITY." /note="Rv0774c, (MTCY369.19c), len: 303 aa. Possible conserved exported protein with hydrophobic region near N-terminus, highly similar, except in N-terminus, to Rv0519c|Z97831|MTY20G10.09c|O33364 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (300 aa), FASTA scores: opt: 1092, E(): 0, (57.9% identity in 299 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature, and PS00120 Lipases, serine active site. So could be a lipase (EC 3.1.-.-). Start changed since first submission (-9 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215288.2" /db_xref="GI:57116780" /db_xref="GeneID:888895" /translation="MMARMPELSRRAVLGLGAGTVLGATSAYAIDMLLQPRTSHAAPA AAIGTNVPLAPTPALDPAPPAQAAPTMSTGSFVSAARAGKMTNWAIARPPGQTQALRP VIALHGLGGSASAVMDGGVEQGLAQAVNAGLPPFAVVSVDGGSSYWHQRASGEDAGAM VLNELIPLLDTQRLDTSRVAFLGWSMGGYGALLLGSRLGPARTAAICAVSPALWLSAG SVAPGSFDGPDDWSANSVFGLPALGSIPIRVDCGNSDPFYAATKQFVAQLPHPPAGGF SPGGHNGGFWSAQLPAELTWFAPLLTG" misc_feature complement(867533..867619) /locus_tag="Rv0774c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" misc_feature complement(867788..867817) /locus_tag="Rv0774c" /note="PS00120 Lipases, serine active site" gene 868407..869030 /locus_tag="Rv0775" /db_xref="GeneID:888899" CDS 868407..869030 /locus_tag="Rv0775" /function="UNKNOWN" /note="Rv0775, (MTCY369.20), len: 207 aa. Conserved hypothetical protein, showing some similarity to other proteins e.g. ECAE000186_11|MG1655 HYPOTHETICAL PROTEIN from Escherichia coli strain K-12 (178 aa), FASTA scores: E(): 6.4e-05, (27.2% identity in 147 aa overlap); P41037|BIH_ECOLI hypothetical transcriptional regulator from Escherichia coli (103 aa), FASTA scores: opt: 138, E(): 0.003, (30.9% identity in 97 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215289.1" /db_xref="GI:15607915" /db_xref="GeneID:888899" /translation="MGVTAAVTPKGERRRYALVSAAAELLGEGGFEAVRHRAVARRAG LPLASTTYYFSSLDDLIARAVEHIGMIEVAQLRARVSALSRRRRGPETTAVVLVDLLV GEMSSPGLAEQLISRYERHIACTRLPDLRESMRRSLRQRAEAVAEAIERSGRSAQIEL VCTLICAVDGSVVSALVEGRDPRAAALATVVDLIDVLAPVDQRPVPF" gene complement(868984..869763) /locus_tag="Rv0776c" /db_xref="GeneID:888918" CDS complement(868984..869763) /locus_tag="Rv0776c" /function="UNKNOWN" /note="Rv0776c, (MTCY369.21a), len: 259 aa. Conserved hypothetical protein, similar (except first 50 aa) to P72737|D90900_57 hypothetical protein from Synechocystis sp. strain PCC 6803 (261 aa), FASTA scores: opt: 337, E(): 1.7e-15, (30.5% identity in 266 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215290.1" /db_xref="GI:15607916" /db_xref="GeneID:888918" /translation="MYFVGVDLAWAGRNPTGVAAVDADGCLVGVGAARDDASVLAALR PYVVGDCLVAFDAPLVVANRTGQRPAEAALNRDFRQFEAGAYPANTEKPEFADVPRAA RLARQLALDMDPLSSATRRAIEVYPHPATVALFRLPRALKYKAKPGRSVDLLKSELLR LMDGVEGLAQAGVRMQVAGQPDWVSLRRQVTVAQRKSDLRAAEDPIDAVVCAYVALYA QRRPADVTIYGDFTTGYIVTPSLPTDFRTAPDAGRRARARR" gene 870008..871426 /gene="purB" /locus_tag="Rv0777" /db_xref="GeneID:888929" CDS 870008..871426 /gene="purB" /locus_tag="Rv0777" /EC_number="4.3.2.2" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE EIGHT STEP) [CATALYTIC ACTIVITY: 1-(5-PHOSPHORIBOSYL)-4-(N-SUCCINO-CARBOXAMIDE) -5-AMINOIMIDAZOLE = FUMARATE + 5'-PHOSPHORIBOSYL-5-AMINO-4-IMIDAZOLECARBOXAMIDE (ALSO CATALYZES: N6-(1,2-DICARBOXYETHYL)AMP = FUMARATE + AMP)]." /note="Catalyzes two discrete reactions in the de novo synthesis of purines: the cleavage of adenylosuccinate and succinylaminoimidazole carboxamide ribotide" /codon_start=1 /transl_table=11 /product="adenylosuccinate lyase" /protein_id="NP_215291.1" /db_xref="GI:15607917" /db_xref="GeneID:888929" /translation="MSIPNVLATRYASAEMVAIWSPEAKVVSERRLWLAVLRAQAELG VAVADSVLADYERVVDDVDLASISARERVLRHDVKARIEEFNALAGHEHVHKGMTSRD LTENVEQLQIRRSLEVIFAHGVAAVARLAERAVSYRDLIMAGRSHNVAAQATTLGKRF ASAAQEMMIALRRLRELIDRYPLRGIKGPMGTGQDMLDLLGGDRAALADLERRVADFL GFATVFNSVGQVYPRSLDHDVVSALVQLGAGPSSLAHTIRLMAGHELATEGFAPGQVG SSAMPHKMNTRSCERVNGLQVVLRGYASMVAELAGAQWNEGDVFCSVVRRVALPDSFF AVDGQIETFLTVLDEFGAYPAVIGRELDRYLPFLATTKVLMAAVRAGMGRESAHRLIS EHAVATALAMREHGAEPDLLDRLAADPRLTLGRDALEAALADKKAFAGAAGDQVDDVV AMVDALVSRYPDAAKYTPGAIL" misc_feature 870833..870862 /gene="purB" /locus_tag="Rv0777" /note="PS00163 Fumarate lyases signature" gene 871431..872675 /gene="cyp126" /locus_tag="Rv0778" /db_xref="GeneID:888913" CDS 871431..872675 /gene="cyp126" /locus_tag="Rv0778" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv0778, (MT0802, MTCY369.22), len: 414 aa. Possible cyp126, cytochrome P-450 (EC 1.14.-.-), similar to other cytochromes and related proteins e.g. AAG29781.1|AF235050_4|AF235050 cytochrome P-450 from Streptomyces rishiriensis (407 aa); Q59723|PSECYTOCHR_1 cytochrome p-450 linalool 8-monooxygenase (EC 1.14.99.28) (lin C) from Pseudomonas incognita (406 aa), FASTA scores: opt: 769, E(): 0, (37.0% identity in 411 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv0766c, Rv2266, Rv3545c, etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature." /codon_start=1 /transl_table=11 /product="cytochrome P450 126" /protein_id="NP_215292.1" /db_xref="GI:15607918" /db_xref="GeneID:888913" /translation="MTTAAGLSGIDLTDLDNFADGFPHHLFAIHRREAPVYWHRPTEH TPDGEGFWSVATYAETLEVLRDPVTYSSVTGGQRRFGGTVLQDLPVAGQVLNMMDDPR HTRIRRLVSSGLTPRMIRRVEDDLRRRARGLLDGVEPGAPFDFVVEIAAELPMQMICI LLGVPETDRHWLFEAVEPGFDFRGSRRATMPRLNVEDAGSRLYTYALELIAGKRAEPA DDMLSVVANATIDDPDAPALSDAELYLFFHLLFSAGAETTRNSIAGGLLALAENPDQL QTLRSDFELLPTAIEEIVRWTSPSPSKRRTASRAVSLGGQPIEAGQKVVVWEGSANRD PSVFDRADEFDITRKPNPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGSVRVVEP AEWTRSNRHTGIRHLVVELRGG" misc_feature 872496..872525 /gene="cyp126" /locus_tag="Rv0778" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(872672..873292) /locus_tag="Rv0779c" /db_xref="GeneID:888941" CDS complement(872672..873292) /locus_tag="Rv0779c" /function="UNKNOWN" /note="Rv0779c, (MTCY369.23c), len: 206 aa. Possible conserved transmembrane protein, equivalent to Z95151|MLCB5_14 O05747 conserved hypothetical protein from Mycobacterium leprae (206 aa), FASTA scores: opt: 902, E(): 0, (67.2% identity in 204 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215293.1" /db_xref="GI:15607919" /db_xref="GeneID:888941" /translation="MRSRFLPYATTPGRLLAQLISDITVAVWTTLWMLVGLAVHDAIS IIGEAGRQIEIGSHGIAGNLAAAGQDAQRIPVVGDALSNPITAASQAALDIAGAGHNL DTTAGWLAVVLALAVAATPILAVAMPWLFLRLRFCRRKWTVTTLAATPAGRQLLALRA LANRPPGKLAAVSTDPVGAWRREDPATMRALAALELRAAGIPLRGD" gene 873343..874236 /gene="hemH" /locus_tag="Rv0780" /db_xref="GeneID:888928" CDS 873343..874236 /gene="hemH" /locus_tag="Rv0780" /EC_number="6.3.2.6" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE SEVENTH STEP) [CATALYTIC ACTIVITY: ATP + 1-(5-phosphoribosyl)-4-carboxy-5-aminoimidazole + L-aspartate = ADP + phosphate + 1-(5-phosphoribosyl)-4-(N-succino-carboxamide)-5-aminoimi da zole]." /note="catalyzes the formation of (S)-2-(5-amino-1-(5-phospho-D-ribosyl)imidazole-4- carboxamido)succinate from 5-amino-1-(5-phospho-D-ribosyl)imidazole-4-carboxylate and L-aspartate in purine biosynthesis; SAICAR synthase" /codon_start=1 /transl_table=11 /product="phosphoribosylaminoimidazole-succinocarboxamide synthase" /protein_id="NP_215294.1" /db_xref="GI:15607920" /db_xref="GeneID:888928" /translation="MRPALSDYQHVASGKVREIYRVDDEHLLLVASDRISAYDYVLDS TIPDKGRVLTAMSAFFFGLVDAPNHLAGPPDDPRIPDEVLGRALVVRRLEMLPVECVA RGYLTGSGLLDYQATGKVCGIALPPGLVEASRFATPLFTPATKAALGDHDENISFDRV VEMVGALRANQLRDRTLQTYVQAADHALTRGIIIADTKFEFGIDRHGNLLLADEIFTP DSSRYWPADDYRAGVVQTSFDKQFVRSWLTGSESGWDRGSDRPPPPLPEHIVEATRAR YINAYERISELKFDDWIGPGA" misc_feature 873922..873948 /gene="hemH" /locus_tag="Rv0780" /note="PS01058 SAICAR synthetase signature 2" gene 874233..874943 /gene="ptrBa" /locus_tag="Rv0781" /db_xref="GeneID:885840" CDS 874233..874943 /gene="ptrBa" /locus_tag="Rv0781" /EC_number="3.4.21.83" /function="CLEAVES PEPTIDE BONDS ON THE C-TERMINAL SIDE OF LYSYL AND ARGININYL RESIDUES [CATALYTIC ACTIVITY: HYDROLYSIS OF ARG-|-XAA AND LYS-|-XAA BONDS IN OLIGOPEPTIDES, EVEN WHEN P1' RESIDUE IS PROLINE]." /note="Rv0781, (MTCY369.25), len: 236 aa. Probable ptrBa, first part of protease II (EC 3.4.21.83), equivalent to N-terminus of NP_302455.1|NC_002677 protease II from Mycobacterium leprae (724 aa). Also highly similar to N-termini of many proteases II e.g. P24555|PTRB_ECOLI|TLP|B1845 protease II from Escherichia coli strains K12 and HB101 (707 aa), FASTA scores: opt: 204, E(): 7.4e-07, (29.6% identity in 230 aa overlap); etc. ORFs Rv0782 and Rv0781 appear to be a frameshifted homologues of protease II, but we can find no error in the cosmid sequence to account for this. BELONGS TO PEPTIDASE FAMILY S9A; ALSO KNOWN AS THE PROLYL OLIGOPEPTIDASE FAMILY. Note that previously known as ptrBb.; ptrBb" /codon_start=1 /transl_table=11 /product="oligopeptidase B" /protein_id="NP_215296.2" /db_xref="GI:57116781" /db_xref="GeneID:885840" /translation="MMHRTALPSPPVAKRVQTRREHHGDVFVDPYEWLRDKDSPEVIA YLEAENDYTERTTAHLEPLRQKIFHEIKARTKETDLSVPTRRGNWWYYARTFEGKQYG VHCRCPVTDPDDWNPPEFDERTEIPGEQLLLDENVEADGHDFFALGAASVSLDDNLLA YSVDVVGDERYTLRFKDLRTGEQYPDEIAGIGAGVTWAADNHCLLHHRGRGLASGHSV AIPTRVRRIVGAGLPRSR" gene 874732..876390 /gene="ptrBb" /locus_tag="Rv0782" /db_xref="GeneID:885862" CDS 874732..876390 /gene="ptrBb" /locus_tag="Rv0782" /EC_number="3.4.21.83" /function="CLEAVES PEPTIDE BONDS ON THE C-TERMINAL SIDE OF LYSYL AND ARGININYL RESIDUES [CATALYTIC ACTIVITY: HYDROLYSIS OF ARG-|-XAA AND LYS-|-XAA BONDS IN OLIGOPEPTIDES, EVEN WHEN P1' RESIDUE IS PROLINE]." /note="Rv0782, (MTCY369.26), len: 552 aa. Probable ptrBb, second part of protease II (EC 3.4.21.83), equivalent to C-terminus of NP_302455.1|NC_002677 protease II from Mycobacterium leprae (724 aa). Also highly similar to N-termini of many proteases II e.g. P24555|PTRB_ECOLI|TLP|B1845 protease II from Escherichia coli strains K12 and HB101 (707 aa), FASTA scores: opt: 1251, E(): 0, (42.7% identity in 489 aa overlap); etc. ORFs Rv0782 and Rv0781 appear to be a frameshifted homologues of protease II, but we can find no error in the cosmid sequence to account for this. BELONGS TO PEPTIDASE FAMILY S9A; ALSO KNOWN AS THE PROLYL OLIGOPEPTIDASE FAMILY. Note that previously known as ptrBa.; ptrBa" /codon_start=1 /transl_table=11 /product="oligopeptidase B" /protein_id="NP_215295.2" /db_xref="GI:57116782" /db_xref="GeneID:885862" /translation="MTNDIPCGSRIYAPENSTRTRSPGSERESPGQLTTTVYYTTVDA AWRPDTVWRYRLGSGESSERVYHEADDRFWLAVGRTRSNAYLLIAAGSSITSEVRYAH AADPTAQFSVVLPRRDGVEYSVEHAVIAGQDRFLILHNDGAVNFTLVEAPVEDPARQR TLIAHRDDVRLDAVDALAGHLVVSYRREALPRVQLWPIGPDGNYGEPEEISFDSELMS AGLGPNPNWDSPKLRVGAGSFVTPVRIYDIDLVTGERTLLKEQPVLGGYRREDYVERR DWAYGDDGTRIPVSIVHRADIEFPAPALIYGYGAYEICEDPRFSIARLSLLDRGMVFV VAHVRGGGEMGRLWYENGKLLDKKNTFTDFIAVARHLVDTGLTSQQQLVALGGSAGGL LMGAVANMAPDLFAGILAQVPFVDPLTTILDPSLPLTVTEWDEWGNPLNDSDVYAYVK SYSPYENVTAQKYPAILAMTSLNDTRVYYVEPAKWVAALRHAKTDGNSVLLKTQMHAG HGGISGRYERWKETAFQYGWLLATADSDRYGGGQGNDLDGAAPA" gene complement(876818..878440) /gene="emrB" /locus_tag="Rv0783c" /db_xref="GeneID:885836" CDS complement(876818..878440) /gene="emrB" /locus_tag="Rv0783c" /function="TRANSLOCASE THAT CONFERS RESISTANCE TO SUBSTANCES OF HIGH HYDROPHOBICITY. INVOLVED IN TRANSPORT OF MULTIDRUG ACROSS THE MEMBRANE (EXPORT): MULTIDRUG RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0783c, (MTCY369.27c), len: 540 aa. Possible emrB, integral membrane drug efflux protein, member of major facilitator superfamily (MFS), equivalent to AAL16083.1|AF421382_1|AF421382 EmrB efflux protein from Mycobacterium avium (538 aa). Also similar to other membrane proteins e.g. CAB61606.1|AL133210 putative export protein from Streptomyces coelicolor (496 aa); NP_108371.1|NC_002678 efflux pump protein FarB from Mesorhizobium loti (511 aa); P44927|EMRB_HAEINHI0897| multidrug resistance protein b homologue from Haemophilus influenzae (510 aa), FASTA scores: opt: 706, E(): 1.3e-36, (30.4% identity in 408 aa overlap); etc. Also similar to Rv2333c|MTCY3G12.01 from Mycobacterium tuberculosis (537 aa), FASTA score: (28.2% identity in 408 aa overlap); and Rv1410c|MTCY21B4.27c from Mycobacterium tuberculosis (518 aa), FASTA score: (26.8% identity in 496 aa overlap). BELONGS TO THE MAJOR FACILITATOR FAMILY; ALSO KNOWN AS THE DRUG RESISTANCE TRANSLOCASE FAMILY." /codon_start=1 /transl_table=11 /product="multidrug resistance integral membrane efflux protein EmrB" /protein_id="NP_215297.1" /db_xref="GI:15607923" /db_xref="GeneID:885836" /translation="MLGNAMVEACPAEGDAPVPITPAGRPRSGQRSYPDRLDVGLLRT AGVCVLASVMAHVDVTVVSVAQRTFVADFGSTQAVVAWTMTGYMLALATVIPTAGWAA DRFGTRRLFMGSVLAFTLGSLLCAVAPNILLLIIFRVVQGFGGGMLTPVSFAILAREA GPKRLGRVMAVVGIPMLLGPVGGPILGGWLIGAYGWRWIFLVNLPVGLSALVLAAIVF PRDRPAASENFDYMGLLLLSPGLATFLFGVSSSPARGTMADRHVLIPAITGLALIAAF VAHSWYRTEHPLIDMRLFQNRAVAQANMTMTVLSLGLFGSFLLLPSYLQQVLHQSPMQ SGVHIIPQGLGAMLAMPIAGAMMDRRGPAKIVLVGIMLIAAGLGTFAFGVARQADYLP ILPTGLAIMGMGMGCSMMPLSGAAVQTLAPHQIARGSTLISVNQQVGGSIGTALMSVL LTYQFNHSEIIATAKKVALTPESGAGRGAAVDPSSLPRQTNFAAQLLHDLSHAYAVVF VIATALVVSTLIPAAFLPKQQASHRRAPLLSA" gene 878638..879324 /locus_tag="Rv0784" /db_xref="GeneID:885863" CDS 878638..879324 /locus_tag="Rv0784" /function="UNKNOWN" /note="Rv0784, (MTC369.28), len: 228 aa. Conserved hypothetical protein, with some similarity to MLCB5_20|O05752 hypothetical protein from Mycobacterium leprae (193 aa), FASTA scores: opt: 141, E(): 0.0022, (36.0% identity in 114 aa overlap). Also similar to N-terminus of NP_253002.1|NC_002516 conserved hypothetical protein from Pseudomonas aeruginosa (253 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215298.1" /db_xref="GI:15607924" /db_xref="GeneID:885863" /translation="MSVSGIGESTLADVDAFCAEMDARSVPVSLLVAPRMRDDYRLDR DPRTVDWLTGRRAAGDALVLHGYDEAATKRRRGEFAMLRAHEANLRLMAADRVLEHLG LRTRLFAAPGWLVSPGVRTALPANGFRLLADLHGITDLVRLTTVRARVLGIGEGFLAE PWWCRMVVMSAERIARRGGVVRIAVAARHLRKSGPLQAMLDAVDLAMLQGCTPMVYRW RADAAVLDAA" gene 879340..881040 /locus_tag="Rv0785" /db_xref="GeneID:885864" CDS 879340..881040 /locus_tag="Rv0785" /function="UNKNOWN" /note="proposed role in polysaccahride synthesis" /codon_start=1 /transl_table=11 /product="putative FAD-binding dehydrogenase" /protein_id="NP_215299.1" /db_xref="GI:15607925" /db_xref="GeneID:885864" /translation="MALTCTDMSDAVAGSDAEGLTADAIVVGAGLAGLVAACELADRG LRVLILDQENRANVGGQAFWSFGGLFLVNSPEQRRLGIRDSHELALQDWLGTAAFDRP EDYWPEQWAHAYVDFAAGEKRSWLRARGLKIFPLVGWAERGGYDAQGHGNSVPRFHIT WGTGPALVDIFVRQLRDRPTVRFAHRHQVDKLIVEGNAVTGVRGTVLEPSDEPRGAPS SRKSVGKFEFRASAVIVASGGIGGNHELVRKNWPRRMGRIPKQLLSGVPAHVDGRMIG IAQKAGAAVINPDRMWHYTEGITNYDPIWPRHGIRIIPGPSSLWLDAAGKRLPVPLFP GFDTLGTLEYITKSGHDYTWFVLNAKIIEKEFALSGQEQNPDLTGRRLGQLLRSRAHA GPPGPVQAFIDRGVDCVHANSLRELVAAMNELPDVVPLDYETVAAAVTARDREVVNKY SKDGQITAIRAARRYRGDRFGRVVAPHRLTDPKAGPLIAVKLHILTRKTLGGIETDLD ARVLKADGTPLAGLYAAGEVAGFGGGGVHGYRALEGTFLGGCIFSGRAAGRGAAEDIR" gene complement(881075..881464) /locus_tag="Rv0786c" /db_xref="GeneID:885615" CDS complement(881075..881464) /locus_tag="Rv0786c" /function="UNKNOWN" /note="Rv0786c, (MTCY369.30c), len: 129 aa. Conserved hypothetical protein, similar to three other hypothetical proteins from Streptomyces coelicolor e.g. SC7H1.08c|T35703 hypothetical protein (202 aa), FASTA scores: opt: 241, E(): 5.1e-10, (41.0% identity in 105 aa overlap); SC3A7.08|T29426 (211 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215300.1" /db_xref="GI:15607926" /db_xref="GeneID:885615" /translation="MHVGDELPLAELTVRAVGGCHAVIHPEIPVIENISYLVGDSKHR ARLMHPGDALFVPGEQVDVLATPAAAPWMKISEAVDYLRAVAPARAVPIHQAIVAPDA RGIYYGRLTEMTTTDFQVLPEESAVTF" gene 881459..882418 /locus_tag="Rv0787" /db_xref="GeneID:885468" CDS 881459..882418 /locus_tag="Rv0787" /function="UNKNOWN" /note="Rv0787, (MTCY369.31), len: 319 aa. Hypothetical unknown protein, equivalent to AAK45053.1 from Mycobacterium tuberculosis strain CDC1551 (242 aa) but longer 77 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215301.1" /db_xref="GI:15607927" /db_xref="GeneID:885468" /translation="MHRPPWLAQLRRRLRIGVQLGSRVVLEQGRQPRDVYVIGVLVGD QDRGQTGDSLEAVRESTGIEEQAGLTELSEEAGMAEMRELHVYDCALMGAFPMRLILA TMLVAGRLLATLMAAPSAQAEPETCPPICDQIPATAWISTHAVPLNSQYRWPAMAGAA VAVTRATPRFGFEQVCATPAFPHDSRDWAVAGRVTVVHPDGQWQLQAQVLHWRGDTAR GGQIAASVFGTAVAALRACQLGAPLQSPSVTDDEPTRMAAVISGPVIMYTYLVAHVSS STISELTLWSSGPPQVPWPTVADSAVLDALTAPLCEAYIGSCP" gene 882524..882763 /locus_tag="Rv0787A" /db_xref="GeneID:886264" CDS 882524..882763 /locus_tag="Rv0787A" /EC_number="6.3.5.3" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="With PurL and PurQ catalyzes the conversion of formylglycinamide ribonucleotide, ATP, and glutamine to formylglycinamidine ribonucleotide, ADP, and glutamate in the fourth step of the purine biosynthetic pathway" /codon_start=1 /transl_table=11 /product="phosphoribosylformylglycinamidine synthase subunit PurS" /protein_id="YP_177755.1" /db_xref="GI:57116783" /db_xref="GeneID:886264" /translation="MARVVVHVMPKAEILDPQGQAIVGALGRLGHLGISDVRQGKRFE LEVDDTVDDTTLAEIAESLLANTVIEDWTISRDPQ" gene 882760..883434 /gene="purQ" /locus_tag="Rv0788" /db_xref="GeneID:885181" CDS 882760..883434 /gene="purQ" /locus_tag="Rv0788" /EC_number="6.3.5.3" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE FOURTH STEP) [CATALYTIC ACTIVITY: ATP + 5'-phosphoribosylformylglycinamide + L-glutamine + H2O = ADP + phosphate + 5'-phosphoribosylformylglycinamidine + L-glutamate]." /note="catalyzes the formation of 2-(formamido)-N1-(5-phospho-D-ribosyl)acetamidine from N2-formyl-N1-(5-phospho-D-ribosyl)glycinamide and L-glutamine in purine biosynthesis" /codon_start=1 /transl_table=11 /product="phosphoribosylformylglycinamidine synthase I" /protein_id="NP_215303.1" /db_xref="GI:15607928" /db_xref="GeneID:885181" /translation="MTARIGVVTFPGTLDDVDAARAARQVGAEVVSLWHADADLKGVD AVVVPGGFSYGDYLRAGAIARFAPVMDEVVAAADRGMPVLGICNGFQVLCEAGLLPGA LTRNVGLHFICRDVWLRVASTSTAWTSRFEPDADLLVPLKSGEGRYVAPEKVLDELEG EGRVVFRYHDNVNGSLRDIAGICSANGRVVGLMPHPEHAIEALTGPSDDGLGLFYSAL DAVLTG" misc_feature 883003..883038 /gene="purQ" /locus_tag="Rv0788" /note="PS00442 Glutamine amidotransferases class-I active site" gene complement(883451..884050) /locus_tag="Rv0789c" /db_xref="GeneID:885153" CDS complement(883451..884050) /locus_tag="Rv0789c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0789c, (MTCY369.33c), len: 199 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215304.1" /db_xref="GI:15607929" /db_xref="GeneID:885153" /translation="MSRRAIHSGRAAPRRSGNSHLVLRNRVPSSKDSPRRRPHHEFMT ESIGEPLSTNLIERYLRARGRRYFRGHHDAEFFFVANAHLRLHVHLEISPAYRDVFTI RVSPAYFFPATDHTRLAEIVNAWNLQNHEVTAIVHGSSDPHRIGVAAERSLIRDRIRF DDFATFVDNAVSAATELFGQLTAAGLPPTATPPLLRDAG" gene complement(884072..884800) /locus_tag="Rv0790c" /db_xref="GeneID:885253" CDS complement(884072..884800) /locus_tag="Rv0790c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0790c, (MTCY369.34c), len: 242 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215305.1" /db_xref="GI:15607930" /db_xref="GeneID:885253" /translation="MTLANNGTGMDHFLTPTEYLDAGHPLVRTTAATLIRDAVSDTER VRRIYYYVRDVPYDVLASFRYLAQGHHRASDVIGHGVAFCMGKASSFVALCRAAGVPA RIAFQTIDAPDKEFLSPQVRALWGGRTGRPFPWHSLGEAYLGRRWVKLDATIDAPTAA RLGKPYRQEFDGATPIPTVEGTILRENGSYADYPSAVAQWYERIAQSVLKALQSTEVH ALVAADEELWTGPPVELADATHRL" gene complement(884797..885840) /locus_tag="Rv0791c" /db_xref="GeneID:885273" CDS complement(884797..885840) /locus_tag="Rv0791c" /function="UNKNOWN" /note="Rv0791c, (MTV042.01c, MTCY369.35c), len: 347 aa. Conserved hypothetical protein, similar (except in N-terminus) to others e.g. CAC44585.1|AL596162 conserved hypothetical protein from Streptomyces coelicolor (307 aa); NP_252643.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (364 aa); etc. Also some similarity with oxidoreductases e.g. AAK38097.1|AF323606_3|AF323606 putative F420-dependent dehydrogenase from Rhodococcus erythropolis (295 aa); etc. And also similar in part to other proteins from Mycobacterium tuberculosis e.g. Rv1855c|MTCY359.18|Z83859 (307 aa), FASTA scores: opt: 366, E(): 4e-16, (35.0% identity in 226 aa overlap); Rv3079c|MTCY22D7.02|Z83866 CONSERVED HYPOTHETICAL PROTEIN (275 aa), FASTA scores: opt: 342, E(): 1.2e-14, (31.6% identity in 234 aa overlap); Rv0044c POSSIBLE OXIDOREDUCTASE (264 aa). TBparse score is 0.916." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215306.1" /db_xref="GI:15607931" /db_xref="GeneID:885273" /translation="MNAKDDPHFGLMLAATVNGLAVGSYREMVVVSQTAEEYGFDSVW LCDHFLTISPGEYAKVAGIAADTGSATGTETGGAGQCAPSRSLPLLECWTALAALSRD TTKLRLGTSVLCNSYRHPSVLAKMAATLDVISQGRLDLGLGAGWFRRESQAYGIPFPP VGDRVSALAESLQVIKAVWTEPNPTYAGRFYTLDGATCDPPPVQRPHPPLWIGGEGDR VQRIAAKHAQGLNVRWWSPQQVTQRRGFLTQASEAAGRDPDTLRLSVTLLLAPTQSGE EEVRIREEFASIPEPGLIVGTPDRCVERIREYQDRGVGHFLFTIPHVVKSDYLHIIGS DIIPRVKTEVTIP" gene complement(885837..886646) /locus_tag="Rv0792c" /db_xref="GeneID:885142" CDS complement(885837..886646) /locus_tag="Rv0792c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0792c, (MTV042.02c), len: 269 aa. Probable transcriptional regulator, GntR-family, similar to many others of GntR family e.g. BSUB0018_189|Z99121 from Bacillus subtilis (243 aa), FASTA scores: opt: 367, E(): 1.5e-17, (32.1% identity in 246 aa overlap); P31453|YIDP_ECOLI from Escherichia coli (238 aa), FASTA scores: opt: 236, E(): 8.8e-09, (26.4% identity in 235 aa overlap); etc. TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="NP_215307.1" /db_xref="GI:15607932" /db_xref="GeneID:885142" /translation="MTSVKLDLDAADLRISRGSVPASTQLAEALKAQIIQQRLPRGGR LPSERELIDRSGLSRVTVRAAVGMLQRQGWLVRRQGLGTFVADPVEQELSCGVRTITE VLLSCGVTPQVDVLSHQTGPAPQRISETLGLVEVLCIRRRIRTGDQPLALVTAYLPPG VGPAVEPLLSGSADTETTYAMWERRLGVRIAQATHEIHAAGASPDVADALGLAVGSPV LVVDRTSYTNDGKPLEVVVFHHRPERYQFSVTLPRTLPGSGAGIIEKRDFA" gene 886719..887024 /locus_tag="Rv0793" /db_xref="GeneID:885497" CDS 886719..887024 /locus_tag="Rv0793" /function="UNKNOWN" /note="Rv0793, (MTV042.03), len: 101 aa. Conserved hypothetical protein, similar to others e.g. NP_250888.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (114 aa); AE 001908|AE001908_7 hypothetical protein from Deinococcus radiodurans (101 aa), FASTA scores: opt: 215, E(): 3.1e-09, (40.4% identity in 99 aa overlap); NP_440966.1|NC_000911|D90908|PCC6803|D90908_2 unknown protein from Synechocystis sp. strain PCC 6803 (147 aa), FASTA scores: opt: 194, E(): 4.5e-08, (31.1% identity in 90 aa overlap); etc. Also similar to Rv2749|MTV002.14|AL0089|MTV002_15 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (104 aa), FASTA scores: opt: 143, E(): 0.00026, (26.9% identity in 93 aa overlap). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215308.1" /db_xref="GI:15607933" /db_xref="GeneID:885497" /translation="MTSPVAVIARFMPRPDARSALRALLDAMITPTRAEDGCRSYDLY ESADGGELVLFERYRSRIALDEHRGSPHYLNYRAQVGELLTRPVAVTVLAPLDEASA" gene complement(887137..888636) /locus_tag="Rv0794c" /db_xref="GeneID:885076" CDS complement(887137..888636) /locus_tag="Rv0794c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0794c, (MTV042.04c), len: 499 aa. Probable oxidoreductase (EC 1.-.-.-), possibly dihydrolipoamide dehydrogenase (EC 1.8.1.4) or mercuric reductase (EC 1.16.1.1). Highly similar to CAB62675.1|AL133422 probable oxidoreductase from Streptomyces coelicolor (477 aa); and similar to various oxidoreductases e.g. P08663|MERA_STAAU MERCURIC REDUCTASE (HG(II) REDUCTASE) (EC 1.16.1.1) from Staphylococcus aureus (547 aa); AAK70920.1|AC087551_19|AC087551 putative lipoamide dehydrogenase from Oryza sativa (563 aa); NP_437349.1|NC_003078 putative FAD-dependent pyridine nucleotide-disulphide oxidoreductase, similar to mercuric reductases protein from Sinorhizobium meliloti (473 aa); Q04829|DLDH_HALVO DIHYDROLIPOAMIDE DEHYDROGENASE (EC 1.8.1.4) from Haloferax volcanii (475 aa); P08332|MERA_SHIFL MERCURIC REDUCTASE (EC 1.16.1.1) (564 aa), FASTA scores: opt: 522, E(): 3.7e-26, (31.7% identity in 467 aa overlap); P72740|DLDH_SYNY3|Q53395|LPDA|PDHD|SLR1096 DIHYDROLIPOAMIDE DEHYDROGENASE (EC 1.8.1.4) from Synechocystis sp. strain PCC 6803 (474 aa), FASTA scores: opt: 602, E(): 2.3e-31, (31.0% identity in 493 aa overlap); etc. TBparse score is 0.909. Note that previously known as lpdB.; lpdB" /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="YP_177756.1" /db_xref="GI:57116784" /db_xref="GeneID:885076" /translation="MTAAQQDQAPMATPGCREGETYDVVVLGAGPVGQNVADRARAGG LRVAVVERELVGGECSYWACVPSKALLRPVIAISDARRVDGAREAVDGSINTAGVFGR RNRYVAHWDDTGQADWVSGIGATLIRGDGRLDGPRRVVVTKSSGESVALTARHAVVIC TGSRPALPDLPGITEARPWTNRQATDNSTVPDRLAIVGAGGVGVEMATAWQGLGASVT LLARGSGLLPRMEPFVGELIGRGLADAGVDVRVGVSVRALGRPNPTGPVVLELDDGTE LRVDEVLFATGRAPRTDDIGLETIGLTPGSWLDVDDTCRVRAVDDGWLYAAGDVNHRA LLTHQGKYQARIAGTAIGARAAGRPLDTTSWGMHATTADHHAVPQAFFTDPEAAAVGL TADQAAQAGHRIKAIDVEIGDVVMGAKLFADGYTGRARMVVDVDRGHLLGVTMVGPGA AELLHSATVAVAGQVPIDRLWHAVPCFPTISELWLRLLESYRDSFYLLV" repeat_region 889017..889020 /note="4 bp direct repeat: GAGG, at the right end of IS6110" repeat_region 889021..890375 /note="IS6110-1, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-1" repeat_region 889021..889048 /note="28 bp inverted repeat at the left end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 889072..889398 /locus_tag="Rv0795" /db_xref="GeneID:885454" CDS 889072..889398 /locus_tag="Rv0795" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv0795, (MTV042.05), len: 108 aa. Putative transposase for IS6110 (fragment), identical to Q50686 INSERTION ELEMENT IS6110 (108 aa), FASTA score: (100.0 % identity in 108 aa overlap). TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="transposase IS6110" /protein_id="NP_215310.1" /db_xref="GI:15607935" /db_xref="GeneID:885454" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 889395..890333 /locus_tag="Rv0796" /db_xref="GeneID:885099" CDS <889395..890333 /locus_tag="Rv0796" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv0796, (MTV042.06), len: 312 aa. Putative transposase for IS6110. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="transposase IS6110" /protein_id="NP_215311.1" /db_xref="GI:15607936" /db_xref="GeneID:885099" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region complement(890348..890375) /note="28 bp inverted repeat at the right end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC" repeat_region 890376..890379 /note="4 bp direct repeat: GAGG, at the left end of IS6110" gene 890388..891482 /locus_tag="Rv0797" /db_xref="GeneID:885476" CDS 890388..891482 /locus_tag="Rv0797" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1547." /note="Rv0797, (MTCI249B.03c, MTV042.07), len 364 aa. Putative transposase for IS1547; almost identical to (but 20 aa shorter than) Y13470|MTY13470_2 from Mycobacterium tuberculosis (383 aa). Also similar to other transposases e.g. MAIS1110A _1|Q48909 transposase from Mycobacterium avium (464 aa), FASTA scores: opt: 226, E(): 2.4e-08, (30.7% identity in 199 aa overlap). Also slight similarity to Rv2014|MTCY39.03c from Mycobacterium tuberculosis (222 aa), FASTA score: (24.8% identity in 141 aa overlap)." /codon_start=1 /transl_table=11 /product="IS1547 transposase" /protein_id="NP_215312.1" /db_xref="GI:15607937" /db_xref="GeneID:885476" /translation="MVVVGTDAHKYSHTFVATDEVGRQLGEKTVKATTAGHATAIMWA REQFGLELIWGIEDCRNMSARLERDLLAAGQQVVRVPTKLMAQTRKSARSRGKSDPID ALAVARAVMRETDLPLATHDETSRELKLLTDRRDVLVAQRTSAINRLRWLVHELDPER APAARSLDAAKHQQALRTWLDTQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQ VAPALLEIPGCAELTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMRLS RSGNRQLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLKRRLARTVFQA LRTVHQPSSEHTQPAAACHRSYCSRSCLSG" repeat_region 890388..891479 /note="IS1547-1, len: 1092 bp. Insertion sequence IS1547." /mobile_element="insertion sequence:IS1547-1" gene complement(891472..892269) /gene="cfp29" /locus_tag="Rv0798c" /db_xref="GeneID:885460" CDS complement(891472..892269) /gene="cfp29" /locus_tag="Rv0798c" /function="UNKNOWN. SUPPOSED RELEASED FROM THE ENVELOPE TO THE OUTSIDE DURING GROWTH." /experiment="experimental evidence, no additional details recorded" /note="Rv0798c, (MTCI429B.02), len: 265 aa. cfp29, 29 kDa antigen (see citations below). Highly similar to Q45296|BLLINM18P_1|CAA63787.1|X93588 linocin M18 from Brevibacterium linens (266 aa), FASTA scores: (58.5% identity in 265 aa overlap). Also shows similarity with NP_228594.1|NC_000853 bacteriocin from Thermotoga maritima (262 aa)." /codon_start=1 /transl_table=11 /product="29 kDa antigen CFP29" /protein_id="NP_215313.1" /db_xref="GI:15607938" /db_xref="GeneID:885460" /translation="MNNLYRDLAPVTEAAWAEIELEAARTFKRHIAGRRVVDVSDPGG PVTAAVSTGRLIDVKAPTNGVIAHLRASKPLVRLRVPFTLSRNEIDDVERGSKDSDWE PVKEAAKKLAFVEDRTIFEGYSAASIEGIRSASSNPALTLPEDPREIPDVISQALSEL RLAGVDGPYSVLLSADVYTKVSETSDHGYPIREHLNRLVDGDIIWAPAIDGAFVLTTR GGDFDLQLGTDVAIGYASHDTDTVRLYLQETLTFLCYTAEASVALSH" gene complement(892266..893273) /locus_tag="Rv0799c" /db_xref="GeneID:885388" CDS complement(892266..893273) /locus_tag="Rv0799c" /function="UNKNOWN" /note="Rv0799c, (MTCY07H7A.10, MTCI429B.01), len: 335 aa. Conserved hypothetical protein, similar to Q50021|U2266C from Mycobacterium leprae (146 aa), FASTA scores: opt: 147, E(): 0.0016, (33.3% identity in 117 aa overlap); Q50020|U2266B from Mycobacterium leprae (27 aa), FASTA scores: opt: 94, E(): 1.3, (56.5% identity in 23 aa overlap). Also highly similar to others e.g. CAC01593.1|AL391041 conserved hypothetical protein from Streptomyces coelicolor (316 aa); AF088897|AF088897_9 hypothetical protein from Zymomonas mobilis (322 aa), FASTA scores: opt: 1132, E(): 0, (56.1% identity in 303 aa overlap); P76536|ECAE000330_8 hypothetical protein from Escherichia coli strain K-12 (308 aa), FASTA scores: E(): 2.2e-30, (37.4% identity in 297 aa overlap); etc. Also similar to some tyrA proteins." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215314.1" /db_xref="GI:15607939" /db_xref="GeneID:885388" /translation="MAVPAVSPQPILAPLTPAAIFLVATIGADGEATVHDALSKISGL VRAIGFRDPTKHLSVVVSIGSDAWDRLFAGPRPTELHPFVELTGPRHTAPATPGDLLF HIRAETMDVCFELAGRILKSMGDAVTVVDEVHGFRFFDNRDLLGFVDGTENPSGPIAI KATTIGDEDRNFAGSCYVHVQKYVHDMASWESLSVTEQERVIGRTKLDDIELDDNAKP ANSHVALNVITDDDGTERKIVRHNMPFGEVGKGEYGTYFIGYSRTPTVTEQMLRNMFL GDPAGNTDRVLDFSTAVTGGLFFSPTIDFLDHPPPLPQAATPTLAAGSLSIGSLKGSP R" gene 893318..894619 /gene="pepC" /locus_tag="Rv0800" /db_xref="GeneID:885461" CDS 893318..894619 /gene="pepC" /locus_tag="Rv0800" /EC_number="3.4.11.-" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the removal of amino acids from the N termini of peptides" /codon_start=1 /transl_table=11 /product="putative aminopeptidase 2" /protein_id="NP_215315.1" /db_xref="GI:15607940" /db_xref="GeneID:885461" /translation="MAATAHGLCEFIDASPSPFHVCATVAGRLLGAGYRELREADRWP DKPGRYFTVRAGSLVAWNAEQSGHTQVPFRIVGAHTDSPNLRVKQHPDRLVAGWHVVA LQPYGGVWLHSWLDRDLGISGRLSVRDGTGVSHRLVLIDDPILRVPQLAIHLAEDRKS LTLDPQRHINAVWGVGERVESFVGYVAQRAGVAAADVLAADLMTHDLTPSALIGASVN GTASLLSAPRLDNQASCYAGMEALLAVDVDSASSGFVPVLAIFDHEEVGSASGHGAQS DLLSSVLERIVLAAGGTREDFLRRLTTSMLASADMAHATHPNYPDRHEPSHPIEVNAG PVLKVHPNLRYATDGRTAAAFALACQRAGVPMQRYEHRADLPCGSTIGPLAAARTGIP TVDVGAAQLAMHSARELMGAHDVAAYSAALQAFLSAELSEA" gene 894631..894978 /locus_tag="Rv0801" /db_xref="GeneID:885376" CDS 894631..894978 /locus_tag="Rv0801" /function="UNKNOWN" /note="Rv0801, (MTCY07H7A.08c), len: 115 aa. Conserved hypothetical protein, similar to many hypothetical proteins from Streptomyces sp. e.g. SCD840A.20|AB81865.1|AL161691 hypothetical protein from Streptomyces coelicolor (145 aa); AF072709|AF072709_8 from Streptomyces lividans (131 aa), FASTA scores: opt: 120, E(): 0.2, (26.3% identity in 118 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215316.1" /db_xref="GI:15607941" /db_xref="GeneID:885376" /translation="MALKVEMVTFDCSDPAKLAGWWAEQFDGTTRELLPGEFVVVART DGPRLGFQKVPDPAPGKNRVHLDFTTKDLDAEVLRLVAAGASEVGRHQVGESFRWVVL ADPEGNAFCVAGQ" gene complement(894972..895628) /locus_tag="Rv0802c" /db_xref="GeneID:885332" CDS complement(894972..895628) /locus_tag="Rv0802c" /function="UNKNOWN" /note="Rv0802c, (MTCY07H7A.07c), len: 218 aa. Conserved hypothetical protein, showing partial similarity with many acetyltransferases and hypothetical proteins e.g. P96579|BSUB0003_68 PROBABLE ACETYLTRANSFERASE from Bacillus subtilis (183 aa), FASTA scores: E(): 0.0044, (26.4% identity in 110 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215317.1" /db_xref="GI:15607942" /db_xref="GeneID:885332" /translation="MSRHWPLFDLRITTPRLQLQLPTEELCDQLIDTILEGVHDPDRM PFSVPWTRASREDLPFNTLSHLWQQLAGFKRDDWSLPLAVLVDGRAVGVQALSSKDFP ITRQVDSGSWLGLRYQGHGYGTEMRAAVLYFAFAELEAQVATSRSFVDNPASIAVSRR NGYRDNGLDRVAREGAMAEALLFRLTRDDWQRHRTVEVRVDGFDRCRPLFGPLEPPRY" gene 895820..898084 /gene="purL" /locus_tag="Rv0803" /db_xref="GeneID:885358" CDS 895820..898084 /gene="purL" /locus_tag="Rv0803" /EC_number="6.3.5.3" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE FOURTH STEP) [CATALYTIC ACTIVITY: ATP + 5'-PHOSPHORIBOSYLFORMYLGLYCINAMIDE + L-GLUTAMINE + H(2)O = ADP + PHOSPHATE + 5'-PHOSPHORIBOSYLFORMYLGLYCINAMIDINE + L-GLUTAMATE]." /note="catalyzes the formation of 2-(formamido)-N1-(5-phospho-D-ribosyl)acetamidine from N2-formyl-N1-(5-phospho-D-ribosyl)glycinamide and L-glutamine in purine biosynthesis" /codon_start=1 /transl_table=11 /product="phosphoribosylformylglycinamidine synthase II" /protein_id="NP_215318.1" /db_xref="GI:15607943" /db_xref="GeneID:885358" /translation="MLDTVEHAATTPDQPQPYGELGLKDDEYRRIRQILGRRPTDTEL AMYSVMWSEHCSYKSSKVHLRYFGETTSDEMRAAMLAGIGENAGVVDIGDGWAVTFKV ESHNHPSYVEPYQGAATGVGGIVRDIMAMGARPVAVMDQLRFGAADAPDTRRVLDGVV RGIGGYGNSLGLPNIGGETVFDPCYAGNPLVNALCVGVLRQEDLHLAFASGAGNKIIL FGARTGLDGIGGVSVLASDTFDAEGSRKKLPSVQVGDPFMEKVLIECCLELYAGGLVI GIQDLGGAGLSCATSELASAGDGGMTIQLDSVPLRAKEMTPAEVLCSESQERMCAVVS PKNVDAFLAVCRKWEVLATVIGEVTDGDRLQITWHGETVVDVPPRTVAHEGPVYQRPV ARPDTQDALNADRSAKLSRPVTGDELRATLLALLGSPHLCSRAFITEQYDRYVRGNTV LAEHADGGMLRIDESTGRGIAVSTDASGRYTLLDPYAGAQLALAEAYRNVAVTGATPV AVTNCLNFGSPEDPGVMWQFTQAVRGLADGCADLGIPVTGGNVSFYNQTGSAAILPTP VVGVLGVIDDVRRRIPTGLGAEPGETLMLLGDTRDEFDGSVWAQVTADHLGGLPPVVD LAREKLLAAVLSSASRDGLVSAAHDLSEGGLAQAIVESALAGETGCRIVLPEGADPFV LLFSESAGRVLVAVPRTEESRFRGMCEARGLPAVRIGVVDQGSDAVEVQGLFAVSLAE LRATSEAVLPRYFG" gene 898081..898710 /locus_tag="Rv0804" /db_xref="GeneID:885330" CDS 898081..898710 /locus_tag="Rv0804" /function="UNKNOWN" /note="Rv0804, (MTCY07H7A.05c), len: 209 aa. Conserved hypothetical protein, showing similarity with C-terminus of Rv1863c|MTCY359.10 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (256 aa), FASTA scores: opt: 199, E(): 1.2e-05, (33.2% identity in 220 aa overlap); and Rv0658c. Contains PS01151 Fimbrial biogenesis outer membrane usher protein signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215319.1" /db_xref="GI:15607944" /db_xref="GeneID:885330" /translation="MSRLRALSLAAGLVGWSLVSPRLPAPWRIPLQAGLGSVLVLVTR ATMGLWPPRLWAGLRLGWAAGAAAATAIAATTPVPMVRLSMSARELPASVPVWLVWHI PGGTVWAEEAAFRGALATIGARAFGRSGGRILQAGAFGLSHIADARATGEPLVLTVLA TGIAGWMFGWLADRSGSLAAPLLTHLAINEAGAVAAVLVQRRSGISTRL" misc_feature 898480..898512 /locus_tag="Rv0804" /note="PS01151 Fimbrial biogenesis outer membrane usher protein signature" gene 898831..899787 /locus_tag="Rv0805" /db_xref="GeneID:885326" CDS 898831..899787 /locus_tag="Rv0805" /function="UNKNOWN" /note="Rv0805, (MTCY07H7A.04c), len: 318 aa. Conserved hypothetical protein, equivalent to Q50024 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (317 aa), FASTA scores: opt: 1713, E(): 0, (82.5% identity in 315 aa overlap). Also shows similarity with hypothetical proteins and icc proteins e.g. SC9B1.22c|T35867 hypothetical protein from Streptomyces coelicolor (305 aa); P36650|ICC_ECOLI icc protein from Escherichia coli (275 aa), FASTA scores: opt: 310, E(): 8.9e-14, (31.3% identity in 214 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215320.1" /db_xref="GI:15607945" /db_xref="GeneID:885326" /translation="MHRLRAAEHPRPDYVLLHISDTHLIGGDRRLYGAVDADDRLGEL LEQLNQSGLRPDAIVFTGDLADKGEPAAYRKLRGLVEPFAAQLGAELVWVMGNHDDRA ELRKFLLDEAPSMAPLDRVCMIDGLRIIVLDTSVPGHHHGEIRASQLGWLAEELATPA PDGTILALHHPPIPSVLDMAVTVELRDQAALGRVLRGTDVRAILAGHLHYSTNATFVG IPVSVASATCYTQDLTVAAGGTRGRDGAQGCNLVHVYPDTVVHSVIPLGGGETVGTFV SPGQARRKIAESGIFIEPSRRDSLFKHPPMVLTSSAPRSPVD" gene complement(899732..901330) /gene="cpsY" /locus_tag="Rv0806c" /db_xref="GeneID:885243" CDS complement(899732..901330) /gene="cpsY" /locus_tag="Rv0806c" /EC_number="5.1.3.2" /function="THOUGHT TO BE INVOLVED IN EXOPOLYSACCHARIDE AND/OR LIPOPOLYSACCHARIDE BIOSYNTHETIC PATHWAY [CATALYTIC ACTIVITY: UDP-GLUCOSE = UDP-GALACTOSE]." /note="Rv0806c, (MTCY07H7A.03), len: 532 aa. Possible cpsY, UDP-glucose-4-epimerase (EC 5.1.3.2), equivalent to Q50025|CPSY probable UDP-glucose-4-epimerase from Mycobacterium leprae (542 aa), FASTA scores: opt: 2964, E(): 0, (82.3% identity in 530 aa overlap). Also similar to AAC38286.1|AF019760|SACB CpsY homolog (involved in meningococcal capsule biosynthesis) from Neisseria meningitidis serogroup A (545 aa); Q51151 CAPSULE GENE COMPLEX UPD-GLUCOSE-4-EPIMERASE (GALE) from Neisseria meningitidis (373 aa), FASTA scores: opt: 496, E(): 9.5e-27, (29.3% identity in 358 aa overlap); C-terminus of CAB75373.1|AL139298 putative transferase from Streptomyces coelicolor (942 aa); and many hypothetical proteins from Streptomyces coelicolor. SEEMS TO BELONG TO THE SUGAR EPIMERASE FAMILY." /codon_start=1 /transl_table=11 /product="UDP-glucose-4-epimerase CpsY" /protein_id="NP_215321.1" /db_xref="GI:15607946" /db_xref="GeneID:885243" /translation="MPKISSRDGGRPAQRTVNPIIVTRRGKIARLESGLTPQEAQIED LVFLRKVLNRADIPYLLIRNHKNRPVLAINIELRAGLERALAAACATEPMYAKTIDEP GLSPVLVATDGLSQLVDPRVVRLYRRRIAPGGFRYGPAFGVELQFWVYEETVIRCPVE NSLSRKVLPRNEITPTNVKLYGYKWPTLDGMFAPHASDVVFDIDMVFSWVDGSDPEFR ARRMAQMSQYVVGEGDDAEARIRQIDELKYALRSVNMFAPWIRRIFIATDSTPPPWLA EHPKITIVRAEDHFSDRSALPTYNSHAVESQLHHIPGLSEHFLYSNDDMFFGRPLKAS MFFSPGGVTRFIEAKTRIGLGANNPARSGFENAARVNRQLLFDRFGQVITRHLEHTAV PLRKSVLIEMEREFPEEFARTAASPFRSDTDISVTNSFYHYYALMTGRAVPQEKAKVL YVDTTSYAGLRLLPKLRKHRGYDFFCLNDGSFPEVPAAQRAERVVSFLERYFPIPAPW EKIAADVSRRDFAVPRTSAPSEGA" gene 901635..902024 /locus_tag="Rv0807" /db_xref="GeneID:885272" CDS 901635..902024 /locus_tag="Rv0807" /function="UNKNOWN" /note="Rv0807, (MTCY07H7A.02c), len: 129 aa. Conserved hypothetical protein, equivalent to O05761|MLCB5_31 HYPOTHETICAL 14.0 kDa PROTEIN from Mycobacterium leprae (131 aa), FASTA scores: E(): 0, (73.4% identity in 128 aa overlap). Also highly similar to BAA89438.1|AB003158|ORF3 HYPOTHETICAL PROTEIN from Corynebacterium ammoniagenes (132 aa); and C-terminus of SCD25.20|CAB56364.1|AL118514 hypothetical protein from Streptomyces coelicolor (202 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215322.1" /db_xref="GI:15607947" /db_xref="GeneID:885272" /translation="MSARDRVDPAKTRQVVLALADWLRDETLPAPDTDVLAAAVRLTA RTLAALAPGASVEVRIPPFAAVQCISGPRHTRGTPPNVVQTDPRTWLLVATGLSGVAQ ARGSGALQLSGSRAGEIEAWLPLVDLG" gene 902111..903694 /gene="purF" /locus_tag="Rv0808" /db_xref="GeneID:885085" CDS 902111..903694 /gene="purF" /locus_tag="Rv0808" /EC_number="2.4.2.14" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: 5-phospho-beta-D-ribosylamine + diphosphate + L-glutamate = L-glutamine + 5-phospho-alpha-D-ribose 1-diphosphate + H2O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes first step of the de novo purine nucleotide biosynthetic pathway" /codon_start=1 /transl_table=11 /product="amidophosphoribosyltransferase" /protein_id="NP_215323.1" /db_xref="GI:15607948" /db_xref="GeneID:885085" /translation="MAVDSDYVTDRAAGSRQTVTGQQPEQDLNSPREECGVFGVWAPG EDVAKLTYYGLYALQHRGQEAAGIAVADGSQVLVFKDLGLVSQVFDEQTLAAMQGHVA IGHCRYSTTGDTTWENAQPVFRNTAAGTGVALGHNGNLVNAAALAARARDAGLIATRC PAPATTDSDILGALLAHGAADSTLEQAALDLLPTVRGAFCLTFMDENTLYACRDPYGV RPLSLGRLDRGWVVASETAALDIVGASFVRDIEPGELLAIDADGVRSTRFANPTPKGC VFEYVYLARPDSTIAGRSVHAARVEIGRRLARECPVEADLVIGVPESGTPAAVGYAQE SGVPYGQGLMKNAYVGRTFIQPSQTIRQLGIRLKLNPLKEVIRGKRLIVVDDSIVRGN TQRALVRMLREAGAVELHVRIASPPVKWPCFYGIDFPSPAELIANAVENEDEMLEAVR HAIGADTLGYISLRGMVAASEQPTSRLCTACFDGKYPIELPRETALGKNVIEHMLANA ARGAALGELAADDEVPVGR" misc_feature 903251..903289 /gene="purF" /locus_tag="Rv0808" /note="PS00103 Purine/pyrimidine phosphoribosyl transferases signature" gene 903725..904819 /gene="purM" /locus_tag="Rv0809" /db_xref="GeneID:885134" CDS 903725..904819 /gene="purM" /locus_tag="Rv0809" /EC_number="6.3.3.1" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE FIFTH STEP) [CATALYTIC ACTIVITY: ATP + 5'-PHOSPHORIBOSYLFORMYLGLYCINAMIDINE = ADP + PHOSPHATE + 5'-PHOSPHORIBOSYL-5-AMINOIMIDAZOLE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 1-(5-phosphoribosyl)-5-aminoimidazole from 2-(formamido)-N1-(5-phosphoribosyl)acetamidine and ATP in purine biosynthesis" /codon_start=1 /transl_table=11 /product="phosphoribosylaminoimidazole synthetase" /protein_id="NP_215324.1" /db_xref="GI:15607949" /db_xref="GeneID:885134" /translation="MTDLAKGPGKDPGSRGITYASAGVDIEAGDRAIDLFKPLASKAT RPEVRGGLGGFAGLFTLRGDYREPVLAASSDGVGTKLAIAQAMDKHDTVGLDLVAMVV DDLVVCGAEPLFLLDYIAVGRIVPERLSAIVAGIADGCMRAGCALLGGETAEHPGLIE PDHYDISATGVGVVEADNVLGPDRVKPGDVIIAMGSSGLHSNGYSLVRKVLLEIDRMN LAGHVEEFGRTLGEELLEPTRIYAKDCLALAAETRVRTFCHVTGGGLAGNLQRVIPHG LIAEVDRGTWTPAPVFTMIAQRGRVRRTEMEKTFNMGVGMIAVVAPEDTTRALAVLTA RHLDCWVLGTVCKGGKQGPRAKLVGQHPRF" gene complement(904905..905087) /locus_tag="Rv0810c" /db_xref="GeneID:885410" CDS complement(904905..905087) /locus_tag="Rv0810c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0810c, (MTV043.02c), len: 60 aa. Conserved hypothetical protein, with its N-terminus highly similar to NP_302445.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (62 aa); and AL118514|SCD25_24 hypothetical protein from Streptomyces coelicolor (84 aa), FASTA scores: opt: 180, E(): 5.7e-07, (51.8% identity in 56 aa overlap). TBparse score is 0.876." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215325.1" /db_xref="GI:15607950" /db_xref="GeneID:885410" /translation="MGRGRAKAKQTKVARELKYSSPQTDFQRLQRELSGTGTDRLDGD GPSDDDSWNDEDDWRR" gene complement(905234..906340) /locus_tag="Rv0811c" /db_xref="GeneID:885401" CDS complement(905234..906340) /locus_tag="Rv0811c" /function="UNKNOWN" /note="Rv0811c, (MTV043.03c), len: 368 aa. Conserved hypothetical protein, equivalent to U2266F|U15182|MLU15182_13 HYPOTHETICAL PROTEIN from Mycobacterium leprae (366 aa), FASTA scores: opt: 1870, E(): 0, (77.4% identity in 367 aa overlap). Also highly similar to BAA89441.1|AB003158|ORF4 HYPOTHETICAL PROTEIN from Corynebacterium ammoniagenes (359 aa); and CAB94085.1|AL358692 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (321 aa). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215326.1" /db_xref="GI:15607951" /db_xref="GeneID:885401" /translation="MAAVPAPDPGPDAGAIWHYGDPLGEQRAGQADAVLVDRSHRAVL TLDGGDRQTWLHSISTQHVSDLPEGASTQNLSLDGQGRVEDHWIQTELGGTTYLDTEP WRGEPLLAYLRKMVFWSMVTPRAADMAVLSLLGPRLAEERVLDALGLDVLPAEWLAVP LAGGGIVRRMPDGLAGQIELDVVVKRGDRADWQRRLTQAGVRPAGIWAYEAHRVAHRV PARRPRLGVDTDERTIPHEVGWIGGPGAGAVHLNKGCYRGQETVARVHNLGRPPRMLV LLHLDESVQRPSTGDAVLAGGRTVGRLGTVVEHVELGPVALALLKRGLPGDTALVTGP EAEVAAVIDVDSLPPADDVGAGRRAVERLRGGIR" gene 906423..907292 /locus_tag="Rv0812" /db_xref="GeneID:885397" CDS 906423..907292 /locus_tag="Rv0812" /EC_number="4.1.3.38" /function="ACTS ON AMINO ACIDS." /note="catalyzes the formation of 4-aminobenzoate and pyruvate from 4-amino-4-deoxychorismate" /codon_start=1 /transl_table=11 /product="4-amino-4-deoxychorismate lyase" /protein_id="YP_177757.1" /db_xref="GI:57116785" /db_xref="GeneID:885397" /translation="MVVTLDGEILQPGMPLLHADDLAAVRGDGVFETLLVRDGRACLV EAHLQRLTQSARLMDLPEPDLPRWRRAVEVATQRWVASTADEGALRLIYSRGREGGSA PTAYVMVSPVPARVIGARRDGVSAITLDRGLPADGGDAMPWLIASAKTLSYAVNMAVL RHAARQGAGDVIFVSTDGYVLEGPRSTVVIATDGDQGGGNPCLLTPPPWYPILRGTTQ QALFEVARAKGYDCDYRALRVADLFDSQGIWLVSSMTLAARVHTLDGRRLPRTPIAEV FAELVDAAIVSDR" gene complement(907338..908018) /locus_tag="Rv0813c" /db_xref="GeneID:885395" CDS complement(907338..908018) /locus_tag="Rv0813c" /function="UNKNOWN" /note="Rv0813c, (MTV043.05c), len: 226 aa. Conserved hypothetical protein, highly similar to U15182|MLU15182_16 HYPOTHETICAL PROTEIN from Mycobacterium leprae (242 aa), FASTA scores: opt: 1191, E(): 0, (78.3% identity in 226 aa overlap); and NP_302442.1|NC_002677 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (228 aa). Also similar to AB94083.1|AL358692|SCD66.16 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (191 aa); and Rv2717c|MTCY05A6_37 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (164 aa), FASTA score: (30.4% identity in 171 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215328.1" /db_xref="GI:15607953" /db_xref="GeneID:885395" /translation="MSSGAGSDATGAGGVHAAGSGDRAVAAAVERAKATAARNIPAFD DLPVPADTANLREGADLNNALLALLPLVGVWRGEGEGRGPDGDYRFGQQIVVSHDGGD YLNWESRSWRLTATGDYQEPGLREAGFWRFVADPYDPSESQAIELLLAHSAGYVELFY GRPRTQSSWELVTDALARSRSGVLVGGAKRLYGIVEGGDLAYVEERVDADGGLVPHLS ARLSRFVG" gene complement(908181..908483) /gene="sseC2" /locus_tag="Rv0814c" /db_xref="GeneID:885435" CDS complement(908181..908483) /gene="sseC2" /locus_tag="Rv0814c" /function="THOUGHT TO BE INVOLVED IN SULPHUR METABOLISM." /note="Rv0814c, (MTV043.06c, O05794), len: 100 aa. sseC2, conserved hypothetical protein, highly similar to AAA62972.1|U15182|MLU15182_17 hypothetical protein from Mycobacterium leprae (143 aa), FASTA scores: opt: 545, E(): 0, (84.0% identity in 100 aa overlap); and NP_302441.1|NC_002677|Z95150|MTCY164_29 conserved hypothetical protein from Mycobacterium leprae (100 aa), FASTA scores: opt: 647, E(): 0, (100.0% identity in 100 aa overlap). Also highly similar to M29612|SERCYSA_5 rhodanese-like protein from Saccharopolyspora erythraea (101 aa), FASTA scores: opt: 345, E(): 1.2e-18, (57.1% identity in 98 aa overlap); and similar at the C-terminus to the C-terminus of CAB94069.1|AL358692 conserved hypothetical protein from Streptomyces coelicolor (95 aa). Identical second copy present as Rv3118|MTCY164.28|SSEC1 from Mycobacterium tuberculosis (100 aa) (100.0% identity in 100 aa overlap). TBparse score is 0.853." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215329.1" /db_xref="GI:15607954" /db_xref="GeneID:885435" /translation="MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLD SSDEFTAEVVASATGDFRFFAAPGSWTLRALSAAGNGDAVVQPSGAGIHEVDVKIT" gene complement(908485..909318) /gene="cysA2" /locus_tag="Rv0815c" /db_xref="GeneID:885449" CDS complement(908485..909318) /gene="cysA2" /locus_tag="Rv0815c" /EC_number="2.8.1.1" /function="MAY BE A SULFOTRANSFERASE INVOLVED IN THE FORMATION OF THIOSULFATE [CATALYTIC ACTIVITY: THIOSULFATE + CYANIDE = SULFITE + THIOCYANATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv0815c, (MTV043.07c, MT0837, O05793), len: 277 aa. Probable cysA2 (alternate gene name: sseC4), thiosulfate sulfurtransferase (EC 2.8.1.1) (see Wooff et al., 2002), equivalent to Q50036|CYSA|CYSA3|ML2198|THTR_MYCLE PUTATIVE SULFURTRANSFERASE THIOSULFATE from Mycobacterium leprae (277 aa). Also highly similar to other putative thiosulfate sulfurtransferases e.g. P16385|THTR_SACER PUTATIVE THIOSULFATE SULFURTRANSFERASE from Saccharopolyspora erythraea (Streptomyces erythraeus) (281 aa); NP_293941.1|NC_001263 thiosulfate sulfurtransferase from Deinococcus radiodurans (286 aa); etc. Identical second copy present as Rv3117|MTCY164.27|MT3199|O05793|cysA3 (277 aa) (100.0% identity in 277 aa overlap). Contains PS00683 Rhodanese C-terminal signature at C-terminus. BELONGS TO THE RHODANESE FAMILY. TBparse score is 0.901.; sseC4" /codon_start=1 /transl_table=11 /product="thiosulfate sulfurtransferase CysA2" /protein_id="NP_215330.1" /db_xref="GI:15607955" /db_xref="GeneID:885449" /translation="MARCDVLVSADWAESNLHAPKVVFVEVDEDTSAYDRDHIAGAIK LDWRTDLQDPVKRDFVDAQQFSKLLSERGIANEDTVILYGGNNNWFAAYAYWYFKLYG HEKVKLLDGGRKKWELDGRPLSSDPVSRPVTSYTASPPDNTIRAFRDEVLAAINVKNL IDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSRAANEDGTFKSDEELAKLY ADAGLDNSKETIAYCRIGERSSHTWFVLRELLGHQNVKNYDGSWTEYGSLVGAPIELG S" misc_feature complement(908524..908547) /gene="cysA2" /locus_tag="Rv0815c" /note="PS00683 Rhodanese C-terminal signature" gene complement(909611..910033) /gene="thiX" /locus_tag="Rv0816c" /db_xref="GeneID:885087" CDS complement(909611..910033) /gene="thiX" /locus_tag="Rv0816c" /function="THIOREDOXIN PARTICIPATES IN VARIOUS REDOX REACTIONS THROUGH THE REVERSIBLE OXIDATION OF ITS ACTIVE CENTER DITHIOL, TO A DISULFIDE, & CATALYZES DITHIOL-DISULFIDE EXCHANGE REACTIONS." /note="Rv0816c, (MTV043.08c), len: 140 aa. Probable thiX, thioredoxin (EC 1.-.-.-), equivalent to ThiX|U15182|MLU15182_21 thioredoxin from Mycobacterium leprae (172 aa), FASTA scores: opt: 556, E(): 8.8e-31, (63.8% identity in 141 aa overlap); and similar to AAL08576.1|AF418548_2|AF418548 thioredoxin from Mycobacterium avium subsp. paratuberculosis (117 aa). Also similar to other bacterial thioredoxins e.g. CAB95303.1|AL359779 putative thioredoxin from Streptomyces coelicolor (126 aa); P33791|THIO_STRAU|TRX|TRXA THIOREDOXIN from Streptomyces aureofaciens (106 aa); etc. And similar to Rv3914|MT4033|MTV028.05|NP_218431.1|NC_000962|trxC THIOREDOXIN (TRX) (MPT46) from Mycobacterium tuberculosis (116 aa). Has hydrophobic stretch at N-terminus. SEEMS TO BELONG TO THE THIOREDOXIN FAMILY. TBparse score is 0.919." /codon_start=1 /transl_table=11 /product="thioredoxin ThiX" /protein_id="NP_215331.1" /db_xref="GI:15607956" /db_xref="GeneID:885087" /translation="MTTMIVASVATGALATIARWLLTRRSVILREVGPETTPAAPART AELGLSGAGPTVVHFRAPGCAPCDRVRRGVGDVCADLGDVAHIEVDLDSNPQAARRFS VLSLPTTLIFDVDGRQRYRTSGVPKAADLRSALKPLLA" gene complement(910030..910842) /locus_tag="Rv0817c" /db_xref="GeneID:885440" CDS complement(910030..910842) /locus_tag="Rv0817c" /function="UNKNOWN" /note="Rv0817c, (MTV043.09c), len: 270 aa. Probable conserved exported protein, with N-terminal signal sequence, equivalent (but shorter 13 aa) to U15182|MLU15182_22|U2266M probable exported protein from Mycobacterium leprae (283 aa), FASTA scores: opt: 1287, E(): 0, (73.0% identity in 270 aa overlap). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215332.1" /db_xref="GI:15607957" /db_xref="GeneID:885440" /translation="MPMRKVLVGVTGAAIVVAVLIVGAVGADFGASIYAEYRLSTTVR KAANLRSDPFVAILRFPFIPQAMREHYAELEIKAFAVEHAGSGTATLEATMHSIDLSY ASWLIRPDAKLPVGELESRIIIDSMHLGRYLGISDLMVAAPRQESNDATGGTTESGIS GSRGLVFSGTPISANFAHRVSVLVDLSVASDDRATLVITPTAVVTGPDTADQPVPDDK RDAVLHAFASKLPNQKLPFGVVPNTVGARGSDVIIEGITRGVTISLDEFKQS" gene 910972..911739 /locus_tag="Rv0818" /db_xref="GeneID:885144" CDS 910972..911739 /locus_tag="Rv0818" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0818, (MTV043.10), len: 255 aa. Probable transcriptional regulatory protein, highly similar to Q05943|GLNR_STRCO|L03213|STMGLNR_1|SCD84.26c TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (267 aa), FASTA scores: opt: 945, E(): 0, (61.5 identity in 239 aa overlap); and similar to others from other organisms. Also similar to Rv2884|MTCY274.15|Z74024 from Mycobacterium tuberculosis (252 aa), FASTA scores: opt: 662, E(): 0, (47.8% identity in 226 aa overlap). TBparse score is 0.889." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215333.1" /db_xref="GI:15607958" /db_xref="GeneID:885144" /translation="MLELLLLTSELYPDPVLPALSLLPHTVRTAPAEASSLLEAGNAD AVLVDARNDLSSGRGLCRLLSSTGRSIPVLAVVSEGGLVAVSADWGLDEILLLSTGPA EIDARLRLVVGRRGDLADQESLGKVSLGELVIDEGTYTARLRGRPLDLTYKEFELLKY LAQHAGRVFTRAQLLHEVWGYDFFGGTRTVDVHVRRLRAKLGPEHEALIGTVRNVGYK AVRPARGRPPAADPDDEDADPGRDGMQEPLVDPLRSQ" gene 911736..912683 /locus_tag="Rv0819" /db_xref="GeneID:885251" CDS 911736..912683 /locus_tag="Rv0819" /function="UNKNOWN" /note="Rv0819, (MTV043.11), len: 315 aa. Conserved hypothetical protein, equivalent to U2266N|U15182|MLU15182_24 HYPOTHETICAL PROTEIN from Mycobacterium leprae (312 aa), FASTA scores: opt: 1540, E(): 0, (75.2% identity in 314 aa overlap). Also highly similar to CAB88484.1|AL353816 putative acetyltransferase from Streptomyces coelicolor (309 aa). TBparse score is 0.893" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215334.1" /db_xref="GI:15607959" /db_xref="GeneID:885251" /translation="MTALDWRSALTADEQRSVRALVTATTAVDGVAPVGEQVLRELGQ QRTEHLLVAGSRPGGPIIGYLNLSPPRGAGGAMAELVVHPQSRRRGIGTAMARAALAK TAGRNQFWAHGTLDPARATASALGLVGVRELIQMRRPLRDIPEPTIPDGVVIRTYAGT SDDAELLRVNNAAFAGHPEQGGWTAVQLAERRGEAWFDPDGLILAFGDSPRERPGRLL GFHWTKVHPDHPGLGEVYVLGVDPAAQRRGLGQMLTSIGIVSLARRLGGRKTLDPAVE PAVLLYVESDNVAAVRTYQSLGFTTYSVDTAYALAGTDN" gene 912726..913502 /gene="phoT" /locus_tag="Rv0820" /db_xref="GeneID:885136" CDS 912726..913502 /gene="phoT" /locus_tag="Rv0820" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT); RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM. THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /note="ATP-binding protein; PstABCS is an ATP dependent phosphate uptake system which is responsible for inorganic phosphate uptake during phosphate starvation" /codon_start=1 /transl_table=11 /product="phosphate transporter ATP-binding protein" /protein_id="NP_215335.1" /db_xref="GI:15607960" /db_xref="GeneID:885136" /translation="MAKRLDLTDVNIYYGSFHAVADVSLAILPRSVTAFIGPSGCGKT TVLRTLNRMHEVIPGARVEGAVLLDDQDIYAPGIDPVGVRRAIGMVFQRPNPFPAMSI RNNVVAGLKLQGVRNRKVLDDTAESSLRGANLWDEVKDRLDKPGGGLSGGQQQRLCIA RAIAVQPDVLLMDEPCSSLDPISTMAIEDLISELKQQYTIVIVTHNMQQAARVSDQTA FFNLEAVGKPGRLVEIASTEKIFSNPNQKATEDYISGRFG" misc_feature 912834..912857 /gene="phoT" /locus_tag="Rv0820" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 913170..913214 /gene="phoT" /locus_tag="Rv0820" /note="PS00211 ABC transporters family signature" gene complement(913558..914199) /gene="phoY2" /locus_tag="Rv0821c" /db_xref="GeneID:885270" CDS complement(913558..914199) /gene="phoY2" /locus_tag="Rv0821c" /function="INVOLVED IN TRANSCRIPTIONAL REGULATION OF ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv0821c, (MTV043.13c), len: 213 aa. Probable phoY2, phosphate-transport system regulatory protein, highly similar to PhoY|MLU15182_29|U15182 phosphate transport system regulator from Mycobacterium leprae (222 aa), FASTA scores: opt: 1268, E(): 0, (93.0% identity in 213 aa overlap). Also similar to others e.g. NP_384620.1|NC_003047 PROBABLE PHOSPHATE TRANSPORT SYSTEM TRANSCRIPTIONAL REGULATOR PROTEIN from Sinorhizobium meliloti (237 aa); etc. Also highly similar to MTCI418A.03c|Z96070|PhoY1 PROBABLE PHOSPHATE TRANSPORT SYSTEM TRANSCRIPTIONAL REGULATOR PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 937, E(): 0, (63.4% identity in 213 aa overlap). BELONGS TO THE PHOU FAMILY. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="phosphate transport regulator" /protein_id="NP_215336.1" /db_xref="GI:15607961" /db_xref="GeneID:885270" /translation="MRTAYHEQLSELSERLGEMCGLAGIAMERATQALLQADLVLAEQ VISDHEKIATLSARAEESAFVLLALQAPVAGDLRAIVSAIQMVADIDRMGALALHVAK IARRRHPQHALPEEVNGYFAEMGRVAVELGNSAQEVVLSHDPEKAAQIREEDDAMDDL HRHLFTVLMDREWKHGVAAAVDVTLLSRFYERFADHAVEVARRVIFQATGAFP" gene complement(914257..916311) /locus_tag="Rv0822c" /db_xref="GeneID:885374" CDS complement(914257..916311) /locus_tag="Rv0822c" /function="UNKNOWN" /note="Rv0822c, (MTV043.14c), len: 684 aa. Conserved hypothetical protein, highly similar in the region between aa 370 - 580 to U2266O|U15182|MLU15182_30 HYPOTHETICAL PROTEIN from Mycobacterium leprae (222 aa), FASTA scores: opt: 819, E(): 0, (60.6% identity in 221 aa overlap). More extended similarity to Rv3267|Z92771|MTCY71_7 from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 434, E(): 2.2e-17, (26.6% identity in 541 aa overlap), and Rv3484. Also similar to various proteins, preferiously putative membrane proteins and membrane-bound regulatory proteins e.g. CAC44512.1|AL596138 putative membrane protein from Streptomyces coelicolor (524 aa); U56901|BSU56901_1 regulatory protein from Bacillus subtilis (391 aa), FASTA scores: opt: 225, E(): 1.3e-05, (24.7% identity in 340 aa overlap). Contains hydrophobic stretch (aa 160-195) and PS00041 Bacterial regulatory proteins, araC family signature. TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215337.1" /db_xref="GI:15607962" /db_xref="GeneID:885374" /translation="MSDGESAAPWARLSESAFPDGVDRWITVPPATWVAAQGPRDTQN VGCHATGAVSVADLIARLGPAFPDLPTHRHVAPEPEPSGRGPKVHDDADDQQDTEAIA IPAHSLEFLSELPDLRAANYPRADHARREPELPGKQLTGSARVRPLRIRRTSPAPAKP APNSGRRPMVLAARSLAALFAALALALTGGAWQWSASKNSRLNMVSALDPHSGDIVNP SGQHGDENFLLVGMDSRAGANANIGAGDAEDAGGARSDTVMLVNIPASRERVVAVSFP RDLAITPIQCEAWNPETGKYGPIYDEKTGTMGPRLVYTETKLNSAFSFGGPKCLVKVI QKLSGLSINRFIAIDFVGFARMVEALGGVEVCSTTPLRDYELGTVLEHAGRQVIDGPT ALNYVRARQVTTESNGDYGRIKRQQLFLSSLLRSMISTDTLFNLSRLNNVVNMFIGNS YVDNVKTKDLVELGRSLQHMAAGHVTFVTVPTGITDQNGDEPPRTSDMKALFTAIIDD DPLPLENDHNAQRLGNTPSTPPTTTKKAPQAGLTNEIQHQQVTTTSPKEVTVQVSNST GQAGLATTATDQLKRNGFNVMAPDDYPSSLLATTVFFSPGNEQAAATVAAVFGQSKIE RVTGIGQLVQVVLGQDFSAVRAPLPSGSTVSVQISRNSSSPPTKLPEDLTVTNAADTT CE" misc_feature complement(914365..914487) /locus_tag="Rv0822c" /note="PS00041 Bacterial regulatory proteins, araC family signature" gene complement(916477..917646) /locus_tag="Rv0823c" /db_xref="GeneID:885380" CDS complement(916477..917646) /locus_tag="Rv0823c" /function="THOUGHT TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0823c, (MTV043.15c), len: 389 aa. Possible transcriptional regulator (resembles nitrogen regulation protein), equivalent (but longer 24 aa in N-terminus) to MLU15182_31|U15182|NtrB NtrB protein from Mycobacterium leprae (384 aa), FASTA scores: opt: 2070, E(): 0, (82.3% identity in 384 aa overlap) (see citation below). Also highly similar to CAB63312.1|AL133471|SCC82.03c hypothetical protein from Streptomyces coelicolor (406 aa); and to many transcriptional regulators members of UPF0034 FAMILY (NIFR3/SMM1) e.g. D26185|BAC180K_143 protein similar to transcriptional regulator (nitrogen regulation protein) from Bacillus subtilis (333 aa), FASTA scores: opt: 609, E(): 1.4e-32, (38.3% identity in 326 aa overlap); NP_349795.1|NC_003030 NifR3 family enzyme from Clostridium acetobutylicum (321 aa); etc. Contains PS01136 Uncharacterized protein family UPF0034 signature. TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215338.1" /db_xref="GI:15607963" /db_xref="GeneID:885380" /translation="MSRRRAIQPSPALRIGPIELASPVVLAPMAGVTNVAFRALCRQL EQSKVGTVSGLYVCEMVTARALIERHPVTMHMTTFSADESPRSLQLYTVDPDTTYAAA RMIAGEGLADHIDMNFGCPVPKVTKRGGGAALPFKRRLFGQIVAAAVRATEGTDIPVT VKFRIGIDDAHHTHLDAGRIAEAEGAAAVALHARTAAQRYSGTADWEQIARLKQHVRT IPVLGNGDIYDAGDALAMMSTTGCDGVVIGRGCLGRPWLFAELSAAFTGSPAPTPPTL GEVADIIRRHGTLLAAHFGEDKGMRDIRKHIAWYLHGFPAGSALRRALAMVKTFDELD CLLDRLDGTVPFPDSATGARGRQGSPARVALPDGWLTDPDDCRVPEGADAMGSGG" misc_feature complement(917251..917307) /locus_tag="Rv0823c" /note="PS01136 Uncharacterized protein family UPF0034 signature" gene complement(917734..918750) /gene="desA1" /locus_tag="Rv0824c" /db_xref="GeneID:885444" CDS complement(917734..918750) /gene="desA1" /locus_tag="Rv0824c" /EC_number="1.14.19.2" /function="CATALYZES THE PRINCIPAL CONVERSION OF SATURATED FATTY ACIDS TO UNSATURATED FATTY ACIDS. THOUGHT TO CONVERT STEAROYL-ACP TO OLEOYL-ACP BY INTRODUCTION OF A CIS DOUBLE BOND BETWEEN CARBONS DELTA-9 AND DELTA-10 OF THE ACYL CHAIN [CATALYTIC ACTIVITY: Stearoyl-[acyl-carrier protein] + AH2 + O2 = oleoyl-[acyl-carrier protein] + A + 2 H2O]." /experiment="experimental evidence, no additional details recorded" /note="Rv0824c, (MTV043.16c), len: 338 aa. Probable desA1 (alternate gene name: des), acyl-[acyl-carrier protein] desaturase (stearoyl-ACP desaturase) (EC 1.14.99.6) (see Jackson et al., 1997), equivalent to U15182|MLU15182_32 acyl-[ACP] desaturase from Mycobacterium leprae (338 aa), FASTA scores: opt: 1880, E(): 0, (79.9% identity in 338 aa overlap); and highly similar in part to fragment CAB96061.1|AJ250019 Steroyl-ACP-desaturase from Mycobacterium avium subsp. paratuberculosis (93 aa). Also similar to other fatty acid desaturases e.g. T35035 probable acyl-[acyl-carrier protein] desaturase from Streptomyces coelicolor (328 aa); Q40731|STAD_ORYSA ACYL-[ACYL-CARRIER PROTEIN] DESATURASE PRECURSOR from Oryza sativa (Rice) (390 aa); etc. Also highly similar to desA2|Rv1094 from Mycobacterium tuberculosis (275 aa). Contains PS00225 Crystallins beta and gamma 'Greek key' motif signature. BELONGS TO THE FATTY ACID DESATURASE FAMILY. COFACTOR: FERREDOXIN, FERREDOXIN NADPH REDUCTASE, AND NADPH. TBparse score is 0.898.; des" /codon_start=1 /transl_table=11 /product="acyl-[acyl-carrier protein] desaturase" /protein_id="YP_177758.1" /db_xref="GI:57116786" /db_xref="GeneID:885444" /translation="MSAKLTDLQLLHELEPVVEKYLNRHLSMHKPWNPHDYIPWSDGK NYYALGGQDWDPDQSKLSDVAQVAMVQNLVTEDNLPSYHREIAMNMGMDGAWGQWVNR WTAEENRHGIALRDYLVVTRSVDPVELEKLRLEVVNRGFSPGQNHQGHYFAESLTDSV LYVSFQELATRISHRNTGKACNDPVADQLMAKISADENLHMIFYRDVSEAAFDLVPNQ AMKSLHLILSHFQMPGFQVPEFRRKAVVIAVGGVYDPRIHLDEVVMPVLKKWRIFERE DFTGEGAKLRDELALVIKDLELACDKFEVSKQRQLDREARTGKKVSAHELHKTAGKLA MSRR" misc_feature complement(917896..917943) /gene="desA1" /locus_tag="Rv0824c" /note="PS00225 Crystallins beta and gamma 'Greek key' motif signature" gene complement(918912..919553) /locus_tag="Rv0825c" /db_xref="GeneID:885354" CDS complement(918912..919553) /locus_tag="Rv0825c" /function="UNKNOWN" /note="Rv0825c, (MTV043.17c), len: 213 aa. Conserved hypothetical protein, highly similar, but in part (between aa 43-96) to fadD27|Rv0275c|MTV035.03 PUTATIVE FATTY-ACID-CoA LIGASE from Mycobacterium tuberculosis (241 aa), FASTA scores: E(): 7.3e-09, (32.6% identity in 190 aa overlap). Also shows similarity with other proteins from Mycobacterium tuberculosis e.g. Rv0078|AL0214|MTV030_22 (201 aa), FASTA scores: opt:118, E(): 0.32, (34.5% identity in 113 aa overlap); etc. TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215340.1" /db_xref="GI:15607965" /db_xref="GeneID:885354" /translation="MQTGQNRGRWSGVPLESRHALRRDNLVAAGVQLLGGAGGPALTV RAVCRHAGLTERYFYESFADREHFVRAVYDDVCTRAMATLTSAQTPREAVEQFVELMV DDPVRGRVLLLAPAVEPALTRSGAEWMPNFIELLQRKLSRIVDPVLQKLVATSLIGAL TGLFTAYLNGRLGATRKQFIDYCVNMLLSTAATYAPHRERGESEHSIPAGPHN" gene 919634..920689 /locus_tag="Rv0826" /db_xref="GeneID:885360" CDS 919634..920689 /locus_tag="Rv0826" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0826, (MTV043.18), len: 351 aa. Conserved hypothetical protein, similar to CAB94053.1|AL358672|SC7A12.06 hypothetical protein from Streptomyces coelicolor (300 aa); and NP_421372.1|NC_002696 hypothetical protein from Caulobacter crescentus (299 aa). Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv1645c|Z85982|MTCY06H11.09 (351 aa), FASTA scores: opt: 1199, E(): 0, (57.5% identity in 299 aa overlap); Rv2237; Rv0276; etc. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215341.1" /db_xref="GI:15607966" /db_xref="GeneID:885360" /translation="MTQDTSATCPLTSTVQDSSPVAGQLGRPIGFRGLAGGCPVSPLG YESPPLPLGPDSLTWRYFGDWRGMLQGPWAGSMQNMHPQLGAAVEDHSTFFRERWPRL LRSLYPIGGVVFDGDRAPVTGVQVRDYHITIKGVDGAGRRYHALNPDVFYWAHATFFV GTLHVAERFCGGLTEAQRRQLFDEHVQWYRMYGMSMRPVPATWEEFQDYWDHMCRNVL ENNFAARAVLDLTELPKPPFAQRVPDWLWAAPRKLLARFFVWLTVGLYDPPVRELMGY RWLRRDEWLHRRFGDIVRLVFALVPFRFRKHPRARAGWDRATGRIPADAPLVQTPARN LPPPDERDNPTHYCPKV" gene complement(920741..921133) /locus_tag="Rv0827c" /db_xref="GeneID:885375" CDS complement(920741..921133) /locus_tag="Rv0827c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0827c, (MTV043.19c), len: 130 aa. Probable transcriptional regulator, similar to many e.g. CAC42856.1|AL592292 putative regulatory protein from Streptomyces coelicolor (115 aa); NP_301626.1|NC_002677 putative ArsR-family transcriptional regulator from Mycobacterium leprae (140 aa); BSUB0011_75|O31844|Z99114 YOZA PROTEIN from Bacillus subtilis (107 aa), FASTA scores: opt: 208, E(): 3.2e-08, (35.5% identity in 93 aa overlap); etc. Also similar to MTCY27.22c|Z95208 from Mycobacterium tuberculosis (135 aa), FASTA scores: opt: 201, E(): 1.2e-07, (35.7% identity in 98 aa overlap). Contains probable helix-turn helix motif from aa 42-63 (Score 1300, +3.61 SD). BELONGS TO THE ARSR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215342.1" /db_xref="GI:15607967" /db_xref="GeneID:885375" /translation="MYADSGPDPLPDDQVCLVVEVFRMLADATRVQVLWSLADREMSV NELAEQVGKPAPSVSQHLAKLRMARLVRTRRDGTTIFYRLENEHVRQLVIDAVFNAEH AGPGIPRHHRAAGGLQSVAKASATKDVG" gene complement(921191..921613) /locus_tag="Rv0828c" /db_xref="GeneID:885265" CDS complement(921191..921613) /locus_tag="Rv0828c" /EC_number="3.5.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN DEAMINATION OF SPECIFIC SUBSTRATE." /note="Rv0828c, (MTV043.20c), len: 140 aa. Possible deaminase (EC 3.5.-.-), with its N-terminus highly similar to middle part of NP_302602.1|NC_002677 possible cytidine/deoxycytidylate deaminase from Mycobacterium leprae (171 aa). Also similar to other deaminases e.g. CAC18715.2|AL451182 putative deaminase from Streptomyces coelicolor (167 aa); NP_251189.1|NC_002516 probable deaminase from Pseudomonas aeruginosa (151 aa); NP_108387.1|NC_002678 nitrogen fixation protein gene from Mesorhizobium loti (149 aa); etc. Also similar to many conserved hypothetical proteins e.g. NP_389200.1|NC_000964 hypothetical protein from Bacillus subtilis (156 aa), FASTA scores: E(): 1.3e-07, (38.9% identity in 95 aa overlap); etc. And similar to Rv3752c possible deaminase from Mycobacterium tuberculosis. Contains PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature. BELONGS TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY." /codon_start=1 /transl_table=11 /product="deaminase" /protein_id="NP_215343.1" /db_xref="GI:15607968" /db_xref="GeneID:885265" /translation="MPAGMAGFRRWAQTNDPTAHAESLAIRAACTKLGTEHLVGTTLN VLAHPCPMCYGSLYYCSPDEVVFLTSRDAYEPHYVDDRRYFEPATFYDEFAKEWQDRR LPMRQEHRPDIRAGAVDVYRFRQEPNGGERSAIAAPTG" misc_feature complement(921443..921556) /locus_tag="Rv0828c" /note="PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature" gene 921575..921865 /locus_tag="Rv0829" /db_xref="GeneID:885403" CDS 921575..921865 /locus_tag="Rv0829" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1605'." /note="Rv0829, (MTV043.21), len: 96 aa. Possible transposase for IS1605' (fragment), similar to C-terminal end of many mycobacterial transposases and hypothetical proteins e.g. Z74024|MTCY274_16 from Mycobacterium tuberculosis (460 aa), FASTA scores: opt: 668, E(): 6.2e-32, (98.9% identity in 93 aa overlap); MTV002_57|O33333 TRANSPOSASE from Mycobacterium tuberculosis ; L07627|SERRY1_1 insertion element IS1136 from Saccharopolyspora erythraea (90 aa), FASTA score: (34.9% identity in 83 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215344.1" /db_xref="GI:15607969" /db_xref="GeneID:885403" /translation="MGPSSKTCHACRHVQDIGWDEKWQCDGCSITHQRDDNAAINLAR YEEPPSVVGPVGAAVKRGADRKTGPGPAGGREARKATGHPAGEQPRDGVLVA" repeat_region 921575..921862 /note="IS1605', len: 288 bp. Insertion sequence IS1605'." /mobile_element="insertion sequence:IS1605'" gene 921970..922875 /locus_tag="Rv0830" /db_xref="GeneID:885886" CDS 921970..922875 /locus_tag="Rv0830" /function="UNKNOWN" /note="Rv0830, (MTV043.22), len: 301 aa. Conserved hypothetical protein, member of Mycobacterium tuberculosis protein family consisting of the proteins Rv0726c, Rv0731c, Rv3399, Rv1729c|Z81360|MTCY4C12_14c (312 aa), FASTA scores: opt: 1014, E(): 0, (54.1% identity in 292 aa overlap); etc. TBparse score is 0.917." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215345.1" /db_xref="GI:15607970" /db_xref="GeneID:885886" /translation="MVRADRDRWDLATSVGATATMVAAQRALAADPRYALIDDPYAAP LVRAVGMDVYTRLVDWQIPVEGDSEFDPQRMATGMACRTRFFDQFFLDATHSGIGQFV ILASGLDARAYRLAWPVGSIVYEVDMPEVIEFKTATLSDLGAEPATERRTVAVDLRDD WATALQTAGFDPKVPAAWSAEGLLVYLPVEAQDALFDNITALSAPGSRLAFEFVPDTA IFADERWRNYHNRMSELGFDIDLNELVYHGQRGHVLDYLTRDGWQTSALTVTQLYEAN GFAYPDDELATAFADLTYSSATLMR" gene complement(922894..923709) /locus_tag="Rv0831c" /db_xref="GeneID:885349" CDS complement(922894..923709) /locus_tag="Rv0831c" /function="UNKNOWN" /note="Rv0831c, (MTV043.23c), len: 271 aa. Conserved hypothetical protein, similar to Rv0347|MTY13E10_7|Z95324 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (328 aa), FASTA scores: opt: 426, E(): 2.6e-21, (33.6% identity in 262 aa overlap). TBparse score is 0.939." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215346.1" /db_xref="GI:15607971" /db_xref="GeneID:885349" /translation="MLPETNQDEVQPNAPVALVTVEIRHPTTDSLTESANRELKHLLI NDLPIERQAQDVSWGMTAPGGAPTPVADRFVRYVNRDNTTAASLKNQAIVVETTAYRS FEAFTDVVMRVVDARAQVSSIVGLERIGLRFVLEIRVPAGVDGRITWSNWIDEQLLGP QRFTPGGLVLTEWQGAAVYRELQPGKSLIVRYGPGMGQALDPNYHLRRITPAQTGPFF LLDIDSFWTPSGGSIPEYNRDALVSTFQDLYGPAQVVFQEMITSRLKDELLRQ" gene complement(923803..923875) /locus_tag="Rvnt10" /note="tRNA-Lys(TTT)" /db_xref="GeneID:2700435" tRNA complement(923803..923875) /locus_tag="Rvnt10" /product="tRNA-Lys" /note="codon recognized: AAA" /anticodon=(pos:923840..923842,aa:Lys) /db_xref="GeneID:2700435" gene 923999..924072 /locus_tag="Rvnt11" /note="tRNA-Glu(TTC)" /db_xref="GeneID:2700445" tRNA 923999..924072 /locus_tag="Rvnt11" /product="tRNA-Glu" /note="codon recognized: GAA" /anticodon=(pos:924034..924036,aa:Glu) /db_xref="GeneID:2700445" gene 924110..924183 /locus_tag="Rvnt12" /note="tRNA-Asp(GTC)" /db_xref="GeneID:2700454" tRNA 924110..924183 /locus_tag="Rvnt12" /product="tRNA-Asp" /note="codon recognized: GAC" /anticodon=(pos:924144..924146,aa:Asp) /db_xref="GeneID:2700454" gene 924213..924286 /locus_tag="Rvnt13" /note="tRNA-Phe(GAA)" /db_xref="GeneID:2700446" tRNA 924213..924286 /locus_tag="Rvnt13" /product="tRNA-Phe" /note="codon recognized: UUC" /anticodon=(pos:924247..924249,aa:Phe) /db_xref="GeneID:2700446" gene 924951..925364 /gene="PE_PGRS12" /locus_tag="Rv0832" /db_xref="GeneID:885236" CDS 924951..925364 /gene="PE_PGRS12" /locus_tag="Rv0832" /function="UNKNOWN" /note="Rv0832, (MTV043.24), len: 137 aa. Member of the Mycobacterium tuberculosis PE family, possibly PGRS subfamily of gly-rich proteins (see citation below), highly similar to many others e.g. MTCY1A11.25c|Z78020 (498 aa), FASTA scores: opt: 529, E(): 5.2e-22, (61.8% identity in 136 aa overlap); etc. Appears to have incurred frameshift as next ORF should be continuation; sequence has been checked but no error found. TBparse score is 0.875." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177759.1" /db_xref="GI:57116787" /db_xref="GeneID:885236" /translation="MSYVSVLPATLATAATEVARIGSALSLASAVAAAQTSAVQAAAA DEVSAAIAALFSAHGRDFQALSARAAAFHHEFVQALAAGAGSYAVAEIAAASPLQSLI DVFNAPIQAATGRPLIGNGANGQPGTGAPGGPAGG" gene 925361..927610 /gene="PE_PGRS13" /locus_tag="Rv0833" /db_xref="GeneID:885391" CDS 925361..927610 /gene="PE_PGRS13" /locus_tag="Rv0833" /function="UNKNOWN" /note="Rv0833, (MTV043.25), len: 749 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), but lacking N-terminal domain (present in preceding ORF), possibly due to frameshift. Similar in part to many others e.g. MTCY28_25|Z95890 (914 aa), FASTA scores: opt: 2726, E(): 0, (60.1% identity in 776 aa overlap); etc. TBparse score is 0.859." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177760.1" /db_xref="GI:57116788" /db_xref="GeneID:885391" /translation="MIGNGGAGGSGAPGAIGGAGGPAGLIGVGGAGGAGGDSAVAGVI GGAGGAGGAALLFGAGGAGGAGGSGGSGAAGGAGGAGGAGGLFASGGSGGFGGFASTG TGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGGTGGAGGLFASGGAGGAGGSGGT GGAGGTGGAGGLFGAGGAGGLGGQGNHTGGHGGAGGSAGLLALGDGGAGGAGGAATTG TGGAGGAGGKAGLLFGSGGAGGSGGAAGTFGDTGNSGGAGGAGGKAGLLFGSGGAGGS GGAGGFANGSTGGAGGAGGGAGLIGNGGNGGSGGTSVATGGAGNGGAGGAGGGAGLIG NGGNGGSGGMGDAPGGTGVGGIGGLLLGLDGANAPASTNPLHTAQQQALAAVNAPIQA VTGRPLIGNGANGAPGSGAPGGHGGWLFGGGGTGGSGVSGGAGGDGGAGGILFGAGGA GGAGGAVTGTGATGGSGGAGGGALLFGAGGAGGAGGSSGIGGFAAGGAGGPGGAGGLF NGGGAGGAGGSGVSGGAGGEGGAGGAGGLFAGGGAGGAGGSGNNVGGAGGAGGVGGLF GAGGAGGSGGGGSVAGDSGAGGNAGLLAPGLAGGAGGGGGQGFDTGGAGGPGGDAGLL VGSGGVGGAGGFGLTTGGPGAAGGDAGLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGL IGNGGNGGAGGAGGNGGGDGGPGGAAFGLGNGGNGGNGGTGTSAGSPGAGGAGGSLIG AEGLPGLLP" gene complement(927837..930485) /gene="PE_PGRS14" /locus_tag="Rv0834c" /db_xref="GeneID:885369" CDS complement(927837..930485) /gene="PE_PGRS14" /locus_tag="Rv0834c" /function="UNKNOWN" /note="Rv0834c, (MTV043.26c), len: 882 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002), highly similar to many others e.g. MTCY493_4|Z95844 (1329 aa), FASTA scores: opt: 2577, E(): 0, (52.0% identity in 950 aa overlap); etc. TBparse score is 0.860. Thought to be differentially expressed within host cells (see Triccas et al., 1999)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177761.1" /db_xref="GI:57116789" /db_xref="GeneID:885369" /translation="MSFVIAAPDLVAMATEDLAGIGASLTAANAAAAVPTSGLLAAAG DEVSAAIAALFSSHGQQYQAMSAQAAAFHARFVQALAGAMGAYAAAEAANASPLQTLE QGLLGAINAPAAALSGRPFIGNGTNGAPGTGEAGGPGGWLLGNGGNGGSGAPGQTGGA GGAAGLLGHGGTGGAGGTGASGGKGGTGGWLWGSGGAGGAGGSGGGSGGAGGNALMFG IGGNGGAGGAASGVGNGGVGGAGGAGGALVAIGGAGGAGGAATTGTGGAGGAGSNALG LFLGLGGSGGQGGDSAMGSGGAGGAGGSGGAASPFGIDIGIGGAGGHGGAGTNGGAGG AGGAGGSSGTVFALDLSWGGAGGNGGAATTGTGGAGGTGGFAVAPDFIGFGAAYGGAG GLGGAATGAGGTGGTGGVGAGGFAALGVGVGGAGGAGGAATETGGIGGAGGLGVGLLG GAGGAGGPGGAASAGSGGHGGTGGDALGLIGAGIGGVGGVGGAATDTGGNGGAGGSGT GLLGGVGGAGGHGGGASVGTGGSGGAGGDGFGFVGAGGNGGNAGTGVGVNGANGGNGG SATGALAAVGGAGAAGGDATSGTGGFGGAGGSARGLIFALGGAGAAGGDASTGVGGPG GPGGTGTASSPFGIAIAIGGAGAQGGAGTSGATGGAGGDGVFEGIAVLGLGFGGAAGA GGAATGDGATGGAGGFGGAGAGIANFLGFSVLHGGAGGAGGTATGTGGNGGAGGGGGL SSPVILGIGIGGAGGDGGGALGVLGGMGGDGGDGGEAVAVGIAVGGAGGAGGAAPTGN GGAGGNGGDALGLVGVGGNGGNAGTGFGANTGGNGGDTTIVVNGMLAPSTLGYGGNGG NGVNGGAGGTGGKAGVFGAPGQNGLP" gene 930953..931597 /gene="lpqQ" /locus_tag="Rv0835" /db_xref="GeneID:885883" CDS 930953..931597 /gene="lpqQ" /locus_tag="Rv0835" /function="UNKNOWN" /note="Rv0835, (MTV043.27), len: 214 aa. Possible lpqQ, lipoprotein. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.944." /codon_start=1 /transl_table=11 /product="lipoprotein LpqQ" /protein_id="NP_215350.1" /db_xref="GI:15607975" /db_xref="GeneID:885883" /translation="MCCSTAAKSAVIVCCAAIATTACSFQATSTQPSTAPPTSRVDSL IVSIEDVRRIANYEELAAHFQTDLREPPEADTNVPGPCRVVGSSDRTFGTDWSEFRSA GYHGVTDDLRPGGPVMVETVSQAIALYPDPSTARGVFHRLESSLAECAGLHDPYFDFI LDRPDASTVRIGAAGWSHVYRLKSSVFISVGVLGIEPAEPIANVILQTISDRIQ" misc_feature 930989..931021 /gene="lpqQ" /locus_tag="Rv0835" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(932279..932932) /locus_tag="Rv0836c" /db_xref="GeneID:885356" CDS complement(932279..932932) /locus_tag="Rv0836c" /function="UNKNOWN" /note="Rv0836c, (MTV043.29c), len: 217 aa (start uncertain). Hypothetical unknown protein. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215351.1" /db_xref="GI:15607976" /db_xref="GeneID:885356" /translation="MLVGAQCRDLLHWRFCRGVPPRATNDTDIAGTLNNWDHFEAIRA TFRALGSTGHRFLIADRAVDALPFGEVESPTGTTRHPPGNQLMNVHGCTDAYLRADVL PLPGGLTVHLPQPPNYAVLKLHAWLDRSADHDYKDGPDLALVVHWYAGDLDRLYAKPD QWALRRHDFDLRTAAAALLGHDMRASVSAPEAAVLATRATQADHDLLAQHFAVGRPG" gene complement(933003..934031) /locus_tag="Rv0837c" /db_xref="GeneID:885109" CDS complement(933003..934031) /locus_tag="Rv0837c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0837c, (MTV043.30c), len: 342 aa. Hypothetical unknown protein. TBparse score is 0.941." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215352.1" /db_xref="GI:15607977" /db_xref="GeneID:885109" /translation="MDQIGADLAEAVERHLTEYGVRVLGGLSALNSAHPESLDLEIDA HPLTITALYLPHLSATAALQAWDTAGAGSPLLVVGPRLHPSSAETLRARGLWYIDGAG NAYLRHQGGLLIDVRGRRSAVSAQPGTLGDGLHSDGPRNPFTPKRAQVVCVLLDAPQL VDAPLRAIAASAGVSVGMAKETMDTLRTTGFFEHLGSRRRLVRTDELLDLWAAAYPGG LGRANKLLVASGDIHTWSAPDGLAVAVSGEQALPDEIRNPESLMLYVDTPAPGLPADL LIHNRWHRDPHGSIVIRKLFWRNLPDEQPGLAPTALIYADLLASREPRQVEVAHLMRR QDERLARL" gene 934720..935490 /gene="lpqR" /locus_tag="Rv0838" /db_xref="GeneID:885417" CDS 934720..935490 /gene="lpqR" /locus_tag="Rv0838" /function="UNKNOWN" /note="Rv0838, (MTV043.31), len: 256 aa. Probable lpqR, conserved lipoprotein. Similar (except in N-terminus) to hypothetical proteins and D-alanyl-D-alanine dipeptidases e.g. NP_416005.1|NC_000913 hypothetical protein from Escherichia coli strain K12 (193 aa); NP_421076.1|NC_002696 D-alanyl-D-alanine dipeptidase from Caulobacter crescentus (212 aa); Q06241|VANX_ENTFC D-ALANYL-D-ALANINE DIPEPTIDASE from Enterococcus faecium (202 aa), FASTA scores: opt: 198, E(): 1.9e-05, (28.1% identity in 199 aa overlap); etc. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.931." /codon_start=1 /transl_table=11 /product="lipoprotein LpqR" /protein_id="NP_215353.1" /db_xref="GI:15607978" /db_xref="GeneID:885417" /translation="MRLIGRLRLLMVGLVVICGACACDRVSAGRWSESPSATSWPVRP VNTTTPSGPVPPVSEAARAAGLVDVRGVVPDAAIDLRYATANNFTGTQLYPPGARCLV HESMAEGLAAAAAVLRPHGQVLVFWDCYRPHDVQVRMFDVVPNPAWVARPGKYAHSHE AGRSVDVTFASAQRQCPSVRRSGELCLADMGTDFDDFSSRATAFATQGVSAEAQANRA HLRAAMQAGGLTVYSGEWWHFDGPGAGVDRPILEVPVD" misc_feature 934756..934788 /gene="lpqR" /locus_tag="Rv0838" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 935577..936389 /locus_tag="Rv0839" /db_xref="GeneID:885255" CDS 935577..936389 /locus_tag="Rv0839" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0839, (MTV043.32), len: 270 aa. Conserved hypothetical protein, similar to various hypothetical proteins or methyltransferases from yeast and bacteria e.g. T34740|SC1E6.19c|AL033505|SC1E6_19 hypothetical protein from Streptomyces coelicolor (273 aa), FASTA scores: opt: 1102, E(): 0, (58.6% identity in 263 aa overlap); T38024|Z98598|SPAC1B3.06c hypothetical protein from Schizosaccharomyces pombe (278 aa), FASTA scores: opt: 562, E(): 1.9e-3, (36.4% identity in 269 aa overlap); JC6531 avermectin B 5-O-methyltransferase (EC 2.1.1.-) from Streptomyces avermitilis (283 aa); etc. Also similar to other Mycobacterium tuberculosis hypothetical proteins that may be methyltransferases e.g. Rv1523, Rv2952, Rv1405c, etc. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215354.1" /db_xref="GI:15607979" /db_xref="GeneID:885255" /translation="MNDKRRAIYTHGYHESVLRSHRRRTAENSAGYLLPYLVPGLSVL DVGCGPGTITVDLAARVVPGSVTGVEPTDDALSLARAEAQLHRLSNISFTTSDVHKLD FPDDAFDVVHAHQVLQHVADPVRALQEMRRVCTPGGIVAARDADYSGFIWFPKLPALD RWLDLYERAARANGGEPDAGRRLLSWARAAGFDDVTPTASVWCFATASAREWWGLVWA DRILQSDLAHQLVDSGLATAAQLEEISTAWREWAAAPDGWLAIPHGEILCRA" gene complement(936457..937317) /gene="pip" /locus_tag="Rv0840c" /db_xref="GeneID:885611" CDS complement(936457..937317) /gene="pip" /locus_tag="Rv0840c" /EC_number="3.4.11.5" /function="SPECIFICALLY CATALYZES THE REMOVAL OF N-TERMINAL PROLINE RESIDUES FROM PEPTIDES. THOUGHT TO RELEASE THE N-TERMINAL PROLINE FROM THE DIPEPTIDES, PRO-PRO, PRO-GLN, PRO-TRP AND PRO-TYR; ALSO FROM AMIDES (PRO-BETA NA) AND OLIGOPEPTIDES, PRO-LEU-GLYNH2, PRO-LEU-GLY AND PRO-PHE-GLY-LYS. HIGHER ACTIVITY TOWARD SMALL PEPTIDES (UP TO THREE RESIDUES), BUT VERY LOW ACTIVITY FOR LONGER PEPTIDES [CATALYTIC ACTIVITY: Release of a N-terminal proline from a peptide]." /note="Rv0840c, (MTV043.33c), len: 286 aa. Possible pip, proline iminopeptidase (EC 3.4.11.5), similar to many e.g. P46541|PIP_BACCO PROLINE IMINOPEPTIDASE from BACILLUS COAGULANS (288 aa), FASTA scores: opt: 657, E(): 0, (37.6% identity in 282 aa overlap); NP_386922.1|NC_003047 PUTATIVE PROLINE IMINOPEPTIDASE PROTEIN from Sinorhizobium meliloti (296 aa); etc. BELONGS TO PEPTIDASE FAMILY S33. TBparse score is 0.948." /codon_start=1 /transl_table=11 /product="proline iminopeptidase" /protein_id="NP_215355.1" /db_xref="GI:15607980" /db_xref="GeneID:885611" /translation="MEGTIAVPGGRVWFQRIGGGPGRPLLVVHGGPGLPHNYLAPLRR LSDEREVIFWDQLGCGNSACPSDVDLWTMNRSVAEMATVAEALALTRFHIFSHSWGGM LAQQYVLDKAPDAVSLTIANSTASIPEFSASLVSLKSCLDVATRSAIDRHEAAGTTHS AEYQAAIRTWNETYLCRTRPWPRELTEAFANMGTEIFETMFGPSDFRIVGNVRDWDVV DRLADIAVPTLLVVGRFDECSPEHMREMQGRIAGSRLEFFESSSHMPFIEEPARFDRV MREFLRLHDI" gene 937593..937835 /locus_tag="Rv0841" /db_xref="GeneID:3205068" CDS 937593..937835 /locus_tag="Rv0841" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0841, 80 aa. Conserved transmembrane protein, highly similar to C-terminus of next ORF Rv0842|O53854 PUTATIVE MEMBRANE PROTEIN from Mycobacterium tuberculosis (442 aa), FASTA scores: opt: 246, E(): 3.3e-10, (59.7% identity in 72 aa overlap). Replace previous Rv0841c." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="YP_177634.1" /db_xref="GI:57116790" /db_xref="GeneID:3205068" /translation="MVAASIVHHSAAPANRGRYHGIWSMTPVVASVVVPIMASYGPIH GAHLLAAVVVGSAGAALCLPLARALRRPTPSAMTTD" gene 938112..939404 /locus_tag="Rv0842" /db_xref="GeneID:885616" CDS 938112..939404 /locus_tag="Rv0842" /function="UNKNOWN" /note="Rv0842, (MT0864, MTV043.35), len: 430 aa. Probable conserved integral membrane protein, showing similarity with other integral membrane proteins e.g. P28246|BCR_ECOLI BICYCLOMYCIN RESISTANCE PROTEIN from EScherichia coli (396 aa), FASTA scores: opt: 216, E(): 5.4e-07, (23.7% identity in 376 aa overlap); etc. TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215357.1" /db_xref="GI:15607982" /db_xref="GeneID:885616" /translation="MRYTGPERCSGDGQVRAAGDRYSTVIWLLGGNLLVRSAGFGYPF LAYHVAGRGHGAGAVGAVVAAYGLGWAVGQLLCGWLVDRVGARVTLVSTMLVAAAVLV LMAGLHTVPGLLVGAMIAGLVCDAPRPVLGAVIAELVADPQRRAQLDGWRYGWVLNIG AAITGGVGGVVAGWLDTPVLYWINGIGCAIFAGLAGRCIPADVCRRTESGLRACTAMS KVGYRQALSDKRLVLLAVSGLATLTTLMGFFAAVPMLMSASGLGVGAYGWVQLINALA VVAVTPLLTPWLSKQLALGPRPDILAGAGVWVTLCMAAAGLARTTVGFSVAAAACSPG EIAWFVVAAGIVHRIAPPAHGGRYHGIWSMAVAASSVAAPILAAFNLANGGRLVLAAT TVTVGFFGAALCLPLARVLAAASCGPLSSKEPSRDSYQ" gene 939388..940392 /locus_tag="Rv0843" /db_xref="GeneID:885554" CDS 939388..940392 /locus_tag="Rv0843" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0843, (MTV043.36), len: 334 aa. Probable dehydrogenase (EC 1.-.-.-), similar to various dehydrogenases e.g. Q46142|Q46142 TPP-DEPENDENT ACETOIN DEHYDROGENASE (326 aa), FASTA scores: opt: 500, E(): 2.4e-26, (32.3% identity in 300 aa overlap); P51267|ODPA_PORPU PYRUVATE DEHYDROGENASE E1 COMPONENT from Porphyra purpurea (344 aa), FASTA scores: opt: 451, E(): 4.7e-23, (29.6% identity in 311 aa overlap); etc. Also similar to Rv2497c|pdhA pyruvate dehydrogenase E1 component from Mycobacterium tuberculosis (367 aa). TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_215358.1" /db_xref="GI:15607983" /db_xref="GeneID:885554" /translation="MTRTSEGLAAFVVDQLEELYRRMWVLRLLDMALEQLRIEGLING PLQGGFGQEAVSVGAAAALGEGDVIITTHRPHAQHVGTDAPLGPVIADMLGATAGDLE GADEDAHIADPRAGLPAAIRVVKQSPLLAIGHAYALWLRDTGRVTLCVTQDCDVDADA FNEAADLAAVWQLPVVILVENIRGALSVHLDRYTHEPRVYRRAVAYGMPGVSVDGNDV EAVRDCVANAVVRARAGGGPTLVQAITYRTTDFSGSDRGGYRDLAGSEQFLDPLIFAR RRLIAAGTTRGRLDEQERAACQQVADAVAFAKARARPNGGGPISRPTSGWHQQPKTRF" gene complement(940456..941106) /gene="narL" /locus_tag="Rv0844c" /db_xref="GeneID:885603" CDS complement(940456..941106) /gene="narL" /locus_tag="Rv0844c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM AND REGULATES NITRATE/NITRITE." /note="Rv0844c, (MTV043.37c), len: 216 aa. Possible narL, nitrate/nitrite response regulator protein, similar to many e.g. CAB44989.1|AJ131854 NarL protein from Pseudomonas stutzeri (218 aa); CAA75536.1|Y15252 nitrate/nitrite regulatory protein from Pseudomonas aeruginosa (216 aa); PCC6803|D64005|SYCSLRG_24 NarL protein from Synechocystis sp. (209 aa), FASTA scores: opt: 438, E(): 1.5e-23, (34.6% identity in 208 aa overlap); etc. Also similar to unidentified regulator e.g. CAB76009.1|AL157916 putative two-component system response regulator from Streptomyces coelicolor (224 aa); etc. Contains probable helix-turn helix motif from aa 170-191 (Score 1124, +3.02 SD). TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="nitrate/nitrite response transcriptional regulatory protein NarL" /protein_id="NP_215359.1" /db_xref="GI:15607984" /db_xref="GeneID:885603" /translation="MSNPQPEKVRVVVGDDHPLFREGVVRALSLSGSVNVVGEADDGA AALELIKAHLPDVALLDYRMPGMDGAQVAAAVRSYELPTRVLLISAHDEPAIVYQALQ QGAAGFLLKDSTRTEIVKAVLDCAKGRDVVAPSLVGGLAGEIRQRAAPVAPVLSARER EVLNRIACGQSIPAIAAELYVAPSTVKTHVQRLYEKLGVSDRAAAVAEAMRQRLLD" gene 941190..942467 /locus_tag="Rv0845" /db_xref="GeneID:885218" CDS 941190..942467 /locus_tag="Rv0845" /EC_number="2.7.-.-" /function="POSSIBLE SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv0845, (MTV043.38), len: 425 aa. Possible two-component sensor kinase (EC 2.7.-.-), with its C-terminus similar to C-terminal part of others e.g. NP_294951.1|NC_001263 two-component sensor histidine kinase from Deinococcus radiodurans (469 aa); CAC32293.1|AL583943 putative two component system histidine kinase from Streptomyces coelicolor (404 aa); NP_464546.1|NC_003210 protein similar to two-component sensor histidine kinase from Listeria monocytogenes (352 aa); BSUB0017_193|Z9912 two-component sensor kinase from Bacillus subtilis (360 aa), FASTA scores: opt: 275, E(): 1.6e-11, (30.3% identity in 234 aa overlap); etc. TBparse score is 0.938." /codon_start=1 /transl_table=11 /product="two component sensor kinase" /protein_id="NP_215360.1" /db_xref="GI:15607985" /db_xref="GeneID:885218" /translation="MPSYGNLGRLGGRHEYGVLVAMTSSAELDRVRWAHQLRSYRIAS VLRIGVVGLMVAAMVVGTSRSEWPQQIVLIGVYAVAALWALLLAYSASRRFFALRRFR SMGRLEPFAFTAVDVLILTGFQLLSTDGIYPLLIMILLPVLVGLDVSTRRAAVVLACT LVGFAVAVLGDPVMLRAIGWPETIFRFALYAFLCATALMVVRIEERHTRSVAGLSALR AELLAQTMTASEVLQRRIAEAIHDGPLQDVLAARQELIELDAVTPGDERVGRALAGLQ SASERLRQATFELHPAVLEQVGLGPAVKQLAASTAQRSGIKISTDIDYPIRSGIDPIV FGVVRELLSNVVRHSGATTASVRLGITDEKCVLDVADDGVGVTGDTMARRLGEGHIGL ASHRARVDAAGGVLVFLATPRGTHVCVELPLKR" gene complement(942680..944194) /locus_tag="Rv0846c" /db_xref="GeneID:885207" CDS complement(942680..944194) /locus_tag="Rv0846c" /function="MAY HAVE MULTICOPPER OXIDASE ACTIVITY." /note="Rv0846c, (MTV043.39c), len: 504 aa. Probable oxidase (EC 1.-.-.-), showing similarity with several oxidases, mainly L-ascorbate oxidases and copper resistance proteins A (precursors) e.g. P24792|ASO_CUCMA L-ASCORBATE OXIDASE PRECURSOR (ASCORBASE) (EC 1.10.3.3) from Cucurbita maxima (Pumpkin) (Winter squash) (579 aa), FASTA scores: opt: 423, E(): 5.8e-18, (28.4% identity in 493 aa overlap); AF010496|AF010496_32 potential multicopper oxidase from Rhodobacter capsulatus (491 aa), FASTA scores: opt: 490, E(): 2.7e-22, (28.8% identity in 510 aa overlap); 47452|PCOA_ECOLI COPPER RESISTANCE PROTEIN A PRECURSOR (BELONGS TO THE FAMILY OF MULTICOPPER OXIDASES) from Escherichia coli strain K12 (605 aa); etc. Contains PS00080 Multicopper oxidases signature 2 at C-terminus. SEEMS TO BELONG TO THE FAMILY OF MULTICOPPER OXIDASES. TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="oxidase" /protein_id="NP_215361.1" /db_xref="GI:15607986" /db_xref="GeneID:885207" /translation="MPELATSGNAFDKRRFSRRGFLGAGIASGFALAACASKPTASGA AGMTAAIDAAEAARPHSGRTVTATLTPQPARIDLGGPIVSTLTYGNTIPGPLIRATVG DEIVVSVTNRLGDPTSVHWHGIALRNDMDGTEPATANIGPGGDFTYRFSVPDPGTYWA HPHVGLQGDHGLYLPVVVDDPTEPGHYDAEWIIILDDWTDGIGKSPQQLYGELTDPNK PTMQNTTGMPEGEGVDSNLLGGDGGDIAYPYYLINGRIPVAATSFKAKPGQRIRIRII NSAADTAFRIALAGHSMTVTHTDGYPVIPTEVDALLIGMAERYDVMVTAAGGVFPLVA LAEGKNALARALLSTGAGSPPDPQFRPDELNWRVGTVEMFTAATTANLGRPEPTHDLP VTLGGTMAKYDWTINGEPYSTTNPLHVRLGQRPTLMFDNTTMMYHPIHLHGHTFQMIK ADGSPGARKDTVIVLPKQKMRAVLVADNPGVWVMHCHNNYHQVAGMATRLDYIL" misc_feature complement(942707..942742) /locus_tag="Rv0846c" /note="PS00080 Multicopper oxidases signature 2" gene 944343..944735 /gene="lpqS" /locus_tag="Rv0847" /db_xref="GeneID:885051" CDS 944343..944735 /gene="lpqS" /locus_tag="Rv0847" /function="UNKNOWN" /note="Rv0847, (MTV043.40), len: 130 aa. Probable lpqS, lipoprotein. Contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="lipoprotein LpqS" /protein_id="NP_215362.1" /db_xref="GI:15607987" /db_xref="GeneID:885051" /translation="MVWMRSAIVAVALGVTVAAVAAACWLPQLHRHVAHPNHPLTTSV GSEFVINTDHGHLVDNSMPPCPERLATAVLPRSATPVLLPDVVAAAPGMTAALTDPVA PAARGPPAAQGSVRTGQDLLTRFCLARR" misc_feature 944382..944414 /gene="lpqS" /locus_tag="Rv0847" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 944938..946056 /gene="cysK2" /locus_tag="Rv0848" /db_xref="GeneID:885545" CDS 944938..946056 /gene="cysK2" /locus_tag="Rv0848" /EC_number="2.5.1.47" /function="THOUGHT TO BE INVOLVED IN CYSTEINE BIOSYNTHESIS [CATALYTIC ACTIVITY: O3-ACETYL-L-SERINE + H(2)S = L-CYSTEINE + ACETATE]." /note="Rv0848, (MTV043.41), len: 372 aa. Possible cysK2, cysteine synthase A (EC 4.2.99.8), but could be also a cysteine synthase B (EC 4.2.99.8) cysM2-product, similar to many e.g. NP_109408.1|NC_002682 cysteine synthase from Mesorhizobium loti (357 aa); Q44004|CYSM_ALCEU CYSTEINE SYNTHASE from Alcaligenes eutrophus strain CH34 (Ralstonia eutropha) (339 aa), FASTA scores: opt: 511, E(): 1.7e-25, (35.0% identity in 314 aa overlap); etc. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY. COFACTOR: PYRIDOXAL PHOSPHATE. Note that previously known as cysM3.; cysM3" /codon_start=1 /transl_table=11 /product="cysteine synthase A CysK2" /protein_id="YP_177762.1" /db_xref="GI:57116791" /db_xref="GeneID:885545" /translation="MRSRQTRDRYRLLPEGYQVTPGRNRHPGTMVGNTPVLWIPELSG TSDPDRGFWAKLEGFNPGGMKDRPALYMVECARARGDIAPGAAIVESTGGTLGLGLAL AGKVYRHPVTLVTDPGLEPIIARMLTAYGAGVDMVTQPHPVGGWQQARKDRVAQLMAE YPGAWNPNQYGNPDNVGAYRSLALELVAQLGRIDVLVCSVGTGGHSAGVARVLREFNP DMRLIGVDTIGSTIFGQPASNRLMRGLGSSIYPRNVDYRAFDEVHWVAPPEAVWACRS LAATHYASGGWSVGAVALVAGWAARNLPADTTIAAVFPDGPQRYFDTIYNDAYCNEHE LLGGQPPTEPDEIASPLDAVVTRWTRSTTVIDPTQVVS" gene 946056..947315 /locus_tag="Rv0849" /db_xref="GeneID:885111" CDS 946056..947315 /locus_tag="Rv0849" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY DRUG) ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0849, (MTV043.42), len: 419 aa. Probable conserved integral membrane transport protein, possibly member of major facilitator superfamily (MFS) involved in transport of drug, showing similarity with others e.g. T35055 probable transport system permease protein from Streptomyces coelicolor (436 aa); NP_295031.1|NC_001263 major facilitator family protein from Deinococcus radiodurans (458 aa); NP_455659.1|NC_003198 putative membrane transporter from Salmonella enterica subsp. enterica serovar Typhi (402 aa); etc. TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_215364.1" /db_xref="GI:15607989" /db_xref="GeneID:885111" /translation="MGARAIFRGFNRPSRVLMINQFGINIGFYMLMPYLADYLAGPLG LAAWAVGLVMGVRNFSQQGMFFVGGTLADRFGYKPLIIAGCLIRTGGFALLVVAQSLP SVLIAAAATGFAGALFNPAVRGYLAAEAGERKIEAFAMFNVFYQSGILLGPLVGLVLL ALDFRITVLAAAGVFGLLTVAQLVALPQHRADSEREKTSILQDWRVVVRNRPFLTLAA AMTGCYALSFQIYLALPMQASILMPRNQYLLIAAMFAVSGLVAVGGQLRITRWFAVRW GAERSLVVGATILAASFIPVAVIPNGQRFGVAVAVMALVLSASLLAVASAALFPFEMR AVVALSGDRLVATHYGFYSTIVGVGVLVGNLAIGSLMSAARRLNTDEIVWGGLILVGI VAVAGLRRLDTFTSGSQNMTGRWAAPR" repeat_region 947311..947641 /note="IS1606', len: 331 bp. Insertion sequence IS1606'" /mobile_element="insertion sequence:IS1606'" gene 947312..947644 /locus_tag="Rv0850" /db_xref="GeneID:885054" CDS 947312..947644 /locus_tag="Rv0850" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv0850, (MTV043.43), len: 110 aa. Putative transposase (fragment), similar in part to others e.g. Q45144|Q4514 TRANSPOSABLE ELEMENT IS31831 (436 aa), FASTA scores: opt: 175, E(): 4.3e-05, (38.6% identity in 57 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215365.1" /db_xref="GI:15607990" /db_xref="GeneID:885054" /translation="MTRDPHSPDCGREGSYRDTITRPLTDLPVAGYPLVPRVASPRYR CTTPQCGRAVFNQDLANVDQYLVVNQLAHQLIDGSSLIPDADKRWDARRHADMTHHLT SSLKENQS" gene complement(947641..948468) /locus_tag="Rv0851c" /db_xref="GeneID:885550" CDS complement(947641..948468) /locus_tag="Rv0851c" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0851c, (MTV043.44c), len: 275 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to many e.g. Q01198|LIGD_PSEPA C ALPHA-DEHYDROGENASE (EC 1.1.1.-)(SDR FAMILY) from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (305 aa); D11473|PSELIG_1 C alpha-dehydrogenase from P. paucimobilis (305 aa), FASTA scores: opt: 468, E(): 4.9e-23, (30.8% identity in 279 aa overlap); NP_421969.1|NC_002696 short chain dehydrogenase family protein from Caulobacter crescentus (278 aa); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_215366.1" /db_xref="GI:15607991" /db_xref="GeneID:885550" /translation="MDGFPGRGAVITGGASGIGLATGTEFARRGARVVLGDVDKPGLR QAVNHLRAEGFDVHSVMCDVRHREEVTHLADEAFRLLGHVDVVFSNAGIVVGGPIVEM THDDWRWVIDVDLWGSIHTVEAFLPRLLEQGTGGHVVFTASFAGLVPNAGLGAYGVAK YGVVGLAETLAREVTADGIGVSVLCPMVVETNLVANSERIRGAACAQSSTTGSPGPLP LQDDNLGVDDIAQLTADAILANRLYVLPHAASRASIRRRFERIDRTFDEQAAEGWRH" misc_feature complement(947956..948042) /locus_tag="Rv0851c" /note="PS00061 Short-chain dehydrogenases/reductases family signature" gene 948559..949395 /gene="fadD16" /locus_tag="Rv0852" /db_xref="GeneID:885044" CDS 948559..949395 /gene="fadD16" /locus_tag="Rv0852" /EC_number="6.2.1.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0852, (MTV043.45), len: 278 aa. Possible fadD16, fatty-acid-CoA synthetase (EC 6.2.1.-), similar in part to various CoA ligases e.g. P18163|LCFB_RAT LONG-CHAIN-FATTY-ACID--CoA LIGASE from Rattus norvegicus (Rat) (699 aa); D49366|LEP4CCOALA_1 4-coumarate:CoA ligase from Lithospermum erythrorhizon (636 aa), FASTA scores: opt: 134, E(): 0.15, (26.8% identity in 213 aa overlap); orgp|L09229|HUMFACAL_1 long-chain acyl-coenzyme A from homo sapiens (human) (699 aa), FASTA score: (50.0% identity in 40 aa overlap); etc. Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2." /codon_start=1 /transl_table=11 /product="fatty-acid-CoA ligase" /protein_id="NP_215367.1" /db_xref="GI:15607992" /db_xref="GeneID:885044" /translation="MFTIGYSCASRGADSWLIRRCSVVQGCLDDPGATVEAIDDDGWP HTGDPCSPNSAASGKYGERPASVSTGDIHSLVIASDYRVPDPGRVWPLLQRNKSALAD IGAHHVLIYASTHDSGRVLVMIGVRSREPIVELLRSRVFFDWFDAMGVDDIPAVFAGE IVDRFVAAPTTTQSTPRVPGVVVAAFASVNNVSNLTAEVRSAIARFTAAGIRKTWVFQ AFDDAHEVLILQEFADEAGARQWIEHPDAAAEWMSGAGVGAYPPLFVGRFFDMMRIEA LQ" misc_feature 948757..948789 /gene="fadD16" /locus_tag="Rv0852" /note="PS00626 Regulator of chromosome condensation (RCC1) signature 2" gene complement(949436..951118) /gene="pdc" /locus_tag="Rv0853c" /db_xref="GeneID:885576" CDS complement(949436..951118) /gene="pdc" /locus_tag="Rv0853c" /EC_number="4.1.1.-" /function="POSSIBLE INDOLE-3-PYRUVATE DECARBOXYLASE; EC 4.1.1.74 [CATALYTIC ACTIVITY: 3-(indol-3-yl)pyruvate = 2-(indol-3-yl)acetaldehyde + CO2], OR POSSIBLE PYRUVATE DECARBOXYLASE; EC 4.1.1.1 [CATALYTIC ACTIVITY: A 2-oxo acid = an aldehyde + CO2]." /note="Rv0853c, (MTV043.46c), len: 560 aa. Probable pdc, pyruvate or indole-pyruvate decarboxylase (EC 4.1.1.-), equivalent to NP_302424.1|NC_002677 pyruvate (or indolepyruvate) decarboxylase from Mycobacterium leprae (569 aa). Also highly similar to others e.g. AAB06571.1|L80006 indolepyruvate decarboxylase from Pantoea agglomerans (550 aa); Q12629|DCPY_KLULA PYRUVATE DECARBOXYLASE (EC 4.1.1.1) from Kluyveromyces marxianus var. lactis (563 aa); P71323 INDOLEPYRUVATE DECARBOXYLASE (EC 4.1.1.74) from Enterobacter herbicola (550 aa), FASTA scores: opt: 1642, E(): 0, (48.1% identity in 547 aa overlap); P23234|DCIP_ENTCL INDOLE-3-PYRUVATE DECARBOXYLASE (INDOLEPYRUVATE DECARBOXYLASE) from Enterobacter cloacae (552 aa), FASTA scores: opt: 1596, E(): 0, (46.8% identity in 551 aa overlap); etc. Contains PS00187 Thiamine pyrophosphate enzymes signature and PS00017 ATP/GTP-binding site motif A (P-loop). COFACTOR: THIAMINE PYROPHOSPHATE. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="pyruvate or indole-3-pyruvate decarboxylase pdc" /protein_id="NP_215368.1" /db_xref="GI:15607993" /db_xref="GeneID:885576" /translation="MTPQKSDACSDPVYTVGDYLLDRLAELGVSEIFGVPGDYNLQFL DHIVAHPTIRWVGSANELNAGYAADGYGRLRGMSAVVTTFGVGELSVTNAIAGSYAEH VPVVHIVGGPTKDAQGTRRALHHSLGDGDFEHFLRISREITCAQANLMPATAGREIDR VLSEVREQKRPGYILLSSDVARFPTEPPAAPLPRYPGGTSPRALSLFTKAAIELIADH QLTVLADLLVHRLQAVKELEALLAADVVPHATLMWGKSLLDESSPNFLGIYAGAASAE RVRAAIEGAPVLVTAGVVFTDMVSGFFSQRIDPARTIDIGQYQSSVADQVFAPLEMSA ALQALATILTGRGISSPPVVPPPAEPPPAMPARDEPLTQQMVWDRVCSALTPGNVVLA DQGTSFYGMADHRLPQGVTFIGQPLWGSIGYTLPAAVGAAVAHPDRRTVLLIGDGAAQ LTVQELGTFSREGLSPVIVVVNNDGYTVERAIHGETAPYNDIVSWNWTELPSALGVTN HLAFRAQTYGQLDDALTVAAARRDRMVLVEVVLPRLEIPRLLGQLVGSMAPQ" misc_feature complement(949775..949834) /gene="pdc" /locus_tag="Rv0853c" /note="PS00187 Thiamine pyrophosphate enzymes signature" misc_feature complement(950351..950374) /gene="pdc" /locus_tag="Rv0853c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 951183..951626 /locus_tag="Rv0854" /db_xref="GeneID:885127" CDS 951183..951626 /locus_tag="Rv0854" /function="UNKNOWN" /note="Rv0854, (MTV043.47), len: 147 aa. Conserved hypothetical protein, similar to several hypothetical protein from Mycobacterium leprae e.g. NP_301674.1|NC_002677 (144 aa); NP_302683.1|NC_002677|Z95398|MLCL622.27c (156 aa), FASTA scores: opt: 193, E(): 1.6e-06, (24.6% identity in 134 aa overlap); NP_301218.1|NC_002677 (146 aa); MTCI28.04|Z97050 (184 aa), FASTA scores: opt: 171, E(): 5.8e-05, (21.5% identity in 135 aa overlap). Also similar to SC6G10.02c|T35511|AL049497|SC6G10_2 hypothetical protein from Streptomyces coelicolor (144 aa), FASTA scores: opt: 344, E(): 6.1e- 17, (37.6% identity in 141 aa overlap). And similar to many proteins from Mycobacterium tuberculosis e.g. downstreams ORFs Rv0856 and Rv0857, etc. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215369.1" /db_xref="GI:15607994" /db_xref="GeneID:885127" /translation="MAIKESRDIVIEASPEEILDVIADFEAMTEWSPAHQSVEILETG DDGRPSKVKMKVKTAGITDEQVVAYSWTDRSVRWTLVSSTQQRSQDGKYELTPKGDNT LVQFEITVDPQVPLPGFVLKRAIKGTIDTATEALRSQVLKVKKGQ" gene 951632..952711 /gene="far" /locus_tag="Rv0855" /db_xref="GeneID:885790" CDS 951632..952711 /gene="far" /locus_tag="Rv0855" /EC_number="5.1.-.-" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION (RACEMIZATION)." /note="Rv0855, (MTV043.48), len: 359 aa. Probable far, fatty acid-CoA racemase (EC 5.1.-.-), highly similar to CAB08122.1|Z94723 unknown protein from Mycobacterium leprae (253 aa) (C-terminus shorter). Also similar to many eukaryotic and bacteria racemases e.g. T35425 probable fatty acid CoA racemase from Streptomyces coelicolor (387 aa); P70473|AMAC_RAT ALPHA-METHYLACYL-CoA RACEMASE (2-METHYLACYL-CoA RACEMASE) (2-ARYLPROPIONYL-COA EPIMERASE) (EC 5.1.99.4) from Rattus norvegicus (Rat) (382 aa); NP_103687.1|NC_002678 probable fatty acid Co-A racemase from Mesorhizobium loti (389 aa); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. Rv1143|MTCI65.10|MCR from Mycobacterium tuberculosis (360 aa), FASTA scores: opt: 1373, E(): 0, (56.8% identity in 359 aa overlap), Rv1866|MTCY359.07 (C-terminal half) (778 aa), Rv3272 (360 aa). TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="fatty-acid-CoA racemase" /protein_id="NP_215370.1" /db_xref="GI:15607995" /db_xref="GeneID:885790" /translation="MTTGGPLAGVKVIELGGIGPGPHAGMVLADLGADVVRVRRPGGL TMPSEDRDLLHRGKRIVDLDVKTQPQAMLELAAKADVLLDCFRPGTCERLGIGPDDCA SVNPRLIFARITGWGQDGPLASTAGHDINYLSQTGALAAFGYADRPPMPPLNLVADFG GGSMLVLLGIVVALYERERSGVGQVVDAAMVDGVSVLAQMMWTMKGIGSLRDQRESFL LDGGAPFYRCYETSDGKYMAVGAIEPQFFAALLSGLGLSAADVPTQLDVAGYPQMYDI FAERFASRTRDEWTRVFAGTDACVTPVLAWSEAANNDHLKARSTVITAHGVQQAAPAP RFSRTPAGPVRPPPAAATPIDEINW" gene 952825..953229 /locus_tag="Rv0856" /db_xref="GeneID:885783" CDS 952825..953229 /locus_tag="Rv0856" /function="UNKNOWN" /note="Rv0856, (MTV043.49), len: 134 aa. Conserved hypothetical protein, showing weak similarity with NP_301674.1| (NC_002677) conserved hypothetical protein from Mycobacterium leprae (144 aa); and SC6G10.02c|T35511 hypothetical protein from Streptomyces coelicolor (144 aa). Also highly similar to other proteins from Mycobacterium tuberculosis e.g. neighbouring ORF downstream Rv0857 CONSERVED HYPOTHETICAL PROTEIN (126 aa), FASTA scores: E(): 7.4e-27, (62.0% identity in 100 aa overlap); neighbouring ORF Rv0854|MTV043_47 CONSERVED HYPOTHETICAL PROTEIN (147 aa), FASTA scores: E(): 1.6e-15, (36.6% identity in 123 aa overlap), MTCI28.04|Z97050|MTCI28_4 (184 aa), FASTA scores: opt: 127, E(): 0.036, (26.0% identity in 127 aa overlap); and MLCL622.27c|Z95398 (156 aa), FASTA scores: opt: 123, E(): 0.06, (26.4% identity in 125 aa overlap). TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215371.1" /db_xref="GI:15607996" /db_xref="GeneID:885783" /translation="MEALADVGVLASWSPLHKQVEVIDYYPDGRPHHVRATVKILGLV DKEVLEYHWGPDWVCWDADQTFQQHGQHIEYTVKPEGVDRARVRFDITVEPAGPIPGF IVKRASEHVLDAAAKGLQKLIAGAGDQGNAKS" gene 953257..953730 /locus_tag="Rv0857" /db_xref="GeneID:885078" CDS 953257..953730 /locus_tag="Rv0857" /function="UNKNOWN" /note="Rv0857, (MTV043.50), len: 157 aa. Conserved hypothetical protein, showing weak similarity with Q9X7Y8|SC6G10.02c|T35511 hypothetical protein from Streptomyces coelicolor (144 aa), FASTA scores: opt: 215, E(): 7.6e-08, (30.282% identity in 142 aa overlap). Also highly similar to other proteins from Mycobacterium tuberculosis e.g. upstream ORF Rv0856 (134 aa), FASTA scores: opt: 566, E(): 2e-32, (58.15% identity in 129 aa overlap); upstream ORF Rv0854 (147 aa), FASTA scores: opt: 401, E(): 7.2e-21, (41.8% identity in 146 aa overlap); MTCI28.04|Z97050 (184 aa), FASTA scores: opt: 122, E(): 0.031, (29.4% identity in 85 aa overlap); and MLCL622.27c|Z95398 (156 aa), FASTA scores: opt: 114, E(): 0.1, (30.9% identity in 55 aa overlap). Length extended since first submission (+33 aa). TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215372.2" /db_xref="GI:57116792" /db_xref="GeneID:885078" /translation="MIANLVAVAIRASREVVIEAPPEVIVEALADMDAVPSWSSVHKR VEVVDTYSDGRPHHVKVTIKVAGIVDTELLEYHWGPDWVVWDAAKTAQQHGQHGEYNL RREDNDKTRVRFTLTVEPSAPLPAFWVNIARKKILHAATEGLRKQVVGRRRFTSG" gene complement(953727..954920) /locus_tag="Rv0858c" /db_xref="GeneID:885784" CDS complement(953727..954920) /locus_tag="Rv0858c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0858c, (MTV043.51c), len: 397 aa. Probable aminotransferase (EC 2.6.1.-), highly similar to others from Eukaryota and bacteria, especially aspartate aminotransferases (transaminases) (EC 2.6.1.1), e.g. NP_177890.1|NC_003070 putative aminotransferase from Arabidopsis thaliana (440 aa); NP_419555.1|NC_002696 aminotransferase class I from Caulobacter crescentus (385 aa); NP_415133.1|NC_000913|AE0001|ECAE000165_8 putative aminotransferase from Escherichia coli strain K12 (386 aa), FASTA scores: opt: 830, E(): 0, (38.0% identity in 389 aa overlap); X99521|TAX99521_1 aspartate aminotransferase from Thermus aquaticus (383 aa), FASTA scores: opt: 702, E(): 0, (34.9% identity in 393 aa overlap); etc. Also similar to other putative aminotransferases from Mycobacterium tuberculosis e.g. Rv2294, Rv3565, etc. TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="aminotransferase" /protein_id="NP_215373.1" /db_xref="GI:15607998" /db_xref="GeneID:885784" /translation="MTVSRLRPYATTVFAEMSALATRIGAVNLGQGFPDEDGPPKMLQ AAQDAIAGGVNQYPPGPGSAPLRRAIAAQRRRHFGVDYDPETEVLVTVGATEAIAAAV LGLVEPGSEVLLIEPFYDSYSPVVAMAGAHRVTVPLVPDGRGFALDADALRRAVTPRT RALIINSPHNPTGAVLSATELAAIAEIAVAANLVVITDEVYEHLVFDHARHLPLAGFD GMAERTITISSAAKMFNCTGWKIGWACGPAELIAGVRAAKQYLSYVGGAPFQPAVALA LDTEDAWVAALRNSLRARRDRLAAGLTEIGFAVHDSYGTYFLCADPRPLGYDDSTEFC AALPEKVGVAAIPMSAFCDPAAGQASQQADVWNHLVRFTFCKRDDTLDEAIRRLSVLA ERPAT" gene 955077..956288 /gene="fadA" /locus_tag="Rv0859" /db_xref="GeneID:885774" CDS 955077..956288 /gene="fadA" /locus_tag="Rv0859" /EC_number="2.3.1.9" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_215374.1" /db_xref="GI:15607999" /db_xref="GeneID:885774" /translation="MSEEAFIYEAIRTPRGKQKNGSLHEVKPLSLVVGLIDELRKRHP DLDENLISDVILGCVSPVGDQGGDIARAAVLASGMPVTSGGVQLNRFCASGLEAVNTA AQKVRSGWDDLVLAGGVESMSRVPMGSDGGAMGLDPATNYDVMFVPQSIGADLIATIE GFSREDVDAYALRSQQKAAEAWSGGYFAKSVVPVRDQNGLLILDHDEHMRPDTTKEGL AKLKPAFEGLAALGGFDDVALQKYHWVEKINHVHTGGNSSGIVDGAALVMIGSAAAGK LQGLTPRARIVATATSGADPVIMLTGPTPATRKVLDRAGLTVDDIDLFELNEAFASVV LKFQKDLNIPDEKLNVNGGAIAMGHPLGATGAMILGTMVDELERRNARRALITLCIGG GMGVATIIERV" misc_feature 955338..955394 /gene="fadA" /locus_tag="Rv0859" /note="PS00098 Thiolases acyl-enzyme intermediate signature" misc_feature 956121..956171 /gene="fadA" /locus_tag="Rv0859" /note="PS00737 Thiolases signature 2" misc_feature 956226..956267 /gene="fadA" /locus_tag="Rv0859" /note="PS00099 Thiolases active site" gene 956293..958455 /gene="fadB" /locus_tag="Rv0860" /db_xref="GeneID:885799" CDS 956293..958455 /gene="fadB" /locus_tag="Rv0860" /function="INVOLVED IN FATTY ACID DEGRADATION (PROBABLY IN FATTY ACID BETA-OXIDATION CYCLE)." /experiment="experimental evidence, no additional details recorded" /note="Rv0860, (MTV043.53), len: 720 aa. Probable fadB, fatty oxidation protein, equivalent to NP_302422.1|NC_002677 putative fatty oxidation complex alpha subunit from Mycobacterium leprae (714 aa). Also highly similar to others and various proteins involved in fatty acid metabolism, e.g. T35429 probable fatty oxidation protein from Streptomyces coelicolor (733 aa); NP_250428.1|NC_002516 probable 3-hydroxyacyl-CoA dehydrogenase from Pseudomonas aeruginosa (714 aa); NP_418895.1|NC_002696 fatty oxidation complex alpha subunit from Caulobacter crescentus (709 aa); P40939|ECHA_HUMAN TRIFUNCTIONAL ENZYME ALPHA SUBUNIT [INCLUDES: LONG-CHAIN ENOYL-CoA HYDRATASE (EC 4.2.1.17); LONG CHAIN 3-HYDROXYACYL-CoA DEHYDROGENASE (EC 1.1.1.35)] from Homo sapiens (763 aa), FASTA scores: opt: 1176, E(): 0, (32.4% identity in 722 aa overlap); P21177|FADB_ECOLI FATTY OXIDATION COMPLEX ALPHA SUBUNIT [INCLUDES: ENOYL-COA HYDRATASE (EC 4.2.1.17); DELTA(3)-CIS-DELTA(2)-TRANS-ENOYL-CoA ISOMERASE (EC 5.3.3.8); 3-HYDROXYACYL-CoA DEHYDROGENASE (EC 1.1.1.35); 3- HYDROXYBUTYRYL-CoA EPIMERASE (EC 5.1.2.3)] from Escherichia coli strain K12 (729 aa), FASTA scores: opt: 873, E(): 0, (33.6% identity in 693 aa overlap); etc. TBparse score is 0.864." /codon_start=1 /transl_table=11 /product="fatty oxidation protein FadB" /protein_id="NP_215375.1" /db_xref="GI:15608000" /db_xref="GeneID:885799" /translation="MPDNTIQWDKDADGIVTLTMDDPSGSTNVMNEAYIESMGKAVDR LVAEKDSITGVVVASAKKTFFAGGDVKTMIQARPEDAGDVFNTVETIKRQLRTLETLG KPVVAAINGAALGGGLEIALACHHRIAADVKGSQLGLPEVTLGLLPGGGGVTRTVRMF GIQNAFVSVLAQGTRFKPAKAKEIGLVDELVATVEELVPAAKAWIKEELKANPDGAGV QPWDKKGYKMPGGTPSSPGLAAILPSFPSNLRKQLKGAPMPAPRAILAAAVEGAQVDF DTASRIESRYFASLVTGQVAKNMMQAFFFDLQAINAGGSRPEGIGKTPIKRIGVLGAG MMGAGIAYVSAKAGYEVVLKDVSLEAAAKGKGYSEKLEAKALERGRTTQERSDALLAR ITPTADAADFKGVDFVIEAVFENQELKHKVFGEIEDIVEPNAILGSNTSTLPITGLAT GVKRQEDFIGIHFFSPVDKMPLVEIIKGEKTSDEALARVFDYTLAIGKTPIVVNDSRG FFTSRVIGTFVNEALAMLGEGVEPASIEQAGSQAGYPAPPLQLSDELNLELMHKIAVA TRKGVEDAGGTYQPHPAEAVVEKMIELGRSGRLKGAGFYEYADGKRSGLWPGLRETFK SGSSQPPLQDMIDRMLFAEALETQKCLDEGVLTSTADANIGSIMGIGFPPWTGGSAQF IVGYSGPAGTGKAAFVARARELAAAYGDRFLPPESLLS" gene complement(958523..960151) /gene="ercc3" /locus_tag="Rv0861c" /db_xref="GeneID:885425" CDS complement(958523..960151) /gene="ercc3" /locus_tag="Rv0861c" /EC_number="3.6.1.-" /function="INVOLVED IN NUCLEOTIDE EXCISION REPAIR. HAS HELICASE ACTIVITY: ACTS BY OPENING DNA EITHER AROUND THE RNA TRANSCRIPTION START SITE OR THE DNA DAMAGE." /note="Rv0861c, (MTV043.54c), len: 542 aa. Probable ercc3, DNA helicase (EC 3.6.1.-) (see citation below), equivalent to NP_302420.1|NC_002677 probable DNA helicase from Mycobacterium leprae (549 aa). Also highly similar to others (shorter than several eukaryotic enzymes) e.g. NP_218820.1|NC_000919|AE001217|AE0 01217_6 putative DNA repair helicase from Treponema pallidum (606 aa), FASTA scores: opt: 1275, E(): 0, (47.5% identity in 592 aa overlap); Q00578|RA25_YEAST DNA REPAIR HELICASE from Saccharomyces cerevisiae (843 aa), FASTA scores: opt: 777, E(): 0, (30.4% identity in 605 aa overlap); P49135|XPB_MOUSE DNA-REPAIR PROTEIN COMPLEMENTING XP-B CELLS from Mus musculus (Mouse) (783 aa), FASTA scores: opt: 761, E(): 0, (36.3% identity in 375 aa overlap); etc. SEEMS TO BELONG TO THE HELICASE FAMILY. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="DNA helicase ErcC3" /protein_id="NP_215376.1" /db_xref="GI:15608001" /db_xref="GeneID:885425" /translation="MQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITPL ALWNARAAGHDAEQVVDALVSYSRYAVPQPLLVDIVDTMARYGRLQLVKNPAHGLTLV SLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQLLLKIGWPAEDLAGYVD GEAHPISLHQEGWQLRDYQRLAADSFWAGGSGVVVLPCGAGKTLVGAAAMAKAGATTL ILVTNIVAARQWKRELVARTSLTENEIGEFSGERKEIRPVTISTYQMITRRTKGEYRH LELFDSRDWGLIIYDEVHLLPAPVFRMTADLQSKRRLGLTATLIREDGREGDVFSLIG PKRYDAPWKDIEAQGWIAPAECVEVRVTMTDSERMMYATAEPEERYRICSTVHTKIAV VKSILAKHPDEQTLVIGAYLDQLDELGAELGAPVIQGSTRTSEREALFDAFRRGEVAT LVVSKVANFSIDLPEAAVAVQVSGTFGSRQEEAQRLGRILRPKADGGGAIFYSVVARD SLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI" repeat_region complement(960173..960225) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(960226..960278) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(960279..960333) /note="55 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene complement(960342..962612) /locus_tag="Rv0862c" /db_xref="GeneID:885413" CDS complement(960342..962612) /locus_tag="Rv0862c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0862c, (MTV043.55c), len: 756 aa. Conserved hypothetical protein, equivalent to NP_302419.1|NC_002677 possible DNA-binding protein from Mycobacterium leprae (753 aa); and highly similar (except in C-terminus) to MLCB57.01|Z99494|T45333 hypothetical protein from Mycobacterium leprae (>577 aa, truncated), FASTA scores: opt: 3047, E(): 0, (78.9% identity in 578 aa overlap). Also similar in part to SCD12A.03c|AB93395.1|AL357524 hypothetical protein from Streptomyces coelicolor (867 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215377.1" /db_xref="GI:15608002" /db_xref="GeneID:885413" /translation="MTEHTPDIPLGSWLAALPDERLTQLLELRPDLAQPPPGSIAALA ARAQARQSVKAATDELDFLRLAVFDALLVLQADTAPVPIVRLLAVIGDRAAQADVLGA LADLKQRALAWGETAVRVATDAGTALPWHPGQVTLEGSSRSGDQLADLIAGLDPAQRD VLDKLLQGSPVGRTRDAAPGAPSDRPVPRLLAMGLLRRIDAETVILPRHVGQVLRGEQ PGPMELTAPDPVVSTTTPDDADAAAAGAVIDLLREVDVLLENLGATPVAELRSGGLGV REFKRLAKATGIDEPRLGLILEIAAAAGLIASGMPDPEPPHSDGPFWAPTVAADRFAT MSPAERWHLLASAWLDLPGRPALIGTRGPDAKPYGALSDSLFSTAAPLDRRLLLGMLA ELPAGAGVDASRASATLIWRRPRWARRLQPAPIADLLTEGHALGLVGRGAISTPARAL LDEALEPATAPAAAVGVMARALPKPIDHFLVQADLTVVVPGPLQRELADDLTTVATVE SAGTAMVYRVSEQSIRHALDVGKSRDWLQEFFANRSKTPVPQGLTYLIDDVARRHGQL RIGMAASFVRCEDPTLLAQVVAAPEADGLALRALAPTVAVSPAPISEVLVTLRGAGFA PAAEDSTGAVVDVRTRGARVPTPQRRRPYRPPPRPNSEALKAVVAVLREVTAAPFANV RVDPAVTMSLLQRAAKDQATLVISYLDAAGVATQRVVAPITLRGGQLVAFDSSSGRLR DFAIHRITLVVSAHDR" gene 962599..962880 /locus_tag="Rv0863" /db_xref="GeneID:885423" CDS 962599..962880 /locus_tag="Rv0863" /function="UNKNOWN" /note="Rv0863, (MTV043.56), len: 93 aa. Conserved hypothetical protein, highly similar to NP_302418.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (74 aa). Also weakly similar in part to U82598|ECU82598_135 HYPOTHETICAL PROTEIN from Escherichia coli, FASTA scores: (32.4% identity in 71 aa overlap); and M74011|YEPYSCOP_8 HYPOTHETICAL PROTEIN from Yersinia enterocolitica (165 aa), FASTA scores: (38.6 identity in 57 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215378.1" /db_xref="GI:15608003" /db_xref="GeneID:885423" /translation="MCSVIADQRRPDQPCGVGGCKTCQNGFVADIAEGKARKTRYVDH GWPTTDPDDHAVSELVTDRTGALSPFGELTFPVPSDDLPYIHPVTVINR" gene 962890..963393 /gene="moaC" /locus_tag="Rv0864" /db_xref="GeneID:885826" CDS 962890..963393 /gene="moaC" /locus_tag="Rv0864" /function="INVOLVED IN THE BIOSYNTHESIS OF MOLYBDOPTERIN." /note="MoaC; along with MoaA is involved in conversion of a guanosine derivative into molybdopterin precursor Z; involved in molybdenum cofactor biosynthesis" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein MoaC" /protein_id="NP_215379.1" /db_xref="GI:15608004" /db_xref="GeneID:885826" /translation="MARASGASDYRSGELSHQDERGAAHMVDITEKATTKRTAVAAGI LRTSAQVVALISTGGLPKGDALATARVAGIMAAKRTSDLIPLCHQLALTGVDVDFTVG QLDIEITATVRSTDRTGVEMEALTAVSVAALTLYDMIKAVDPGALIDDIRVLHKEGGR RGTWTRR" gene 963390..963872 /gene="mog" /locus_tag="Rv0865" /db_xref="GeneID:885348" CDS 963390..963872 /gene="mog" /locus_tag="Rv0865" /function="INVOLVED IN MOLYBDOPTERIN BIOSYNTHESIS; INVOLVED IN THE BIOSYNTHESIS OF A DEMOLYBDO-COFACTOR (MOLYBDOPTERIN), NECESSARY FOR MOLYBDO-ENZYMES." /note="Rv0865, (MTV043.58), len: 160 aa. Probable mog, molybdopterin biosynthesis MOG protein, highly similar or similar to other molybdenum cofactor biosynthesis proteins e.g. CAB59675.1|AL132674 molybdenum cofactor biosynthesis protein from Streptomyces coelicolor (179 aa); NP_301253.1|NC_002677 putative molybdenum cofactor biosynthesis protein from Mycobacterium leprae (181 aa); CAC39235.1|AJ312124 Mog protein from Eubacterium acidaminophilum (162 aa); P44645|MOG_HAEIN|MOGA|HI0336 MOLYBDOPTERIN BIOSYNTHESIS MOG PROTEIN from Haemophilus influenzae (197 aa), FASTA scores: opt: 306, E(): 9e-13, (39.6% identity in 139 aa overlap); P28694|MOG_ECOLI MOLYBDOPTERIN BIOSYNTHESIS MOG PROTEIN from Escherichia coli (195 aa), FASTA scores: opt: 265, E(): 3.6e-10, (34.2 identity in 146 aa overlap); etc. Also highly similar to Rv0984|MTV044.12|MOAB2 POSSIBLE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Mycobacterium tuberculosis (181 aa). TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="molybdopterin biosynthesis Mog protein" /protein_id="NP_215380.1" /db_xref="GI:15608005" /db_xref="GeneID:885348" /translation="MSTRSARIVVVSSRAAAGVYTDDCGPIIAGWLEQHGFSSVQPQV VADGNPVGEALHDAVNAGVDVIITSGGTGISPTDTTPEHTVAVLDYVIPGLADAIRRS GLPKVPTSVLSRGVCGVAGRTLIINLPGSPGGVRDGLGVLADVLDHALEQIAGGDHPR" gene 963869..964294 /gene="moaE2" /locus_tag="Rv0866" /db_xref="GeneID:885431" CDS 963869..964294 /gene="moaE2" /locus_tag="Rv0866" /function="POSSIBLY A MOLYBDENUM BIOSYNTHESIS COFACTOR. CONVERSION OF MOLYBDOPTERIN PRECURSOR Z INTO MOLYBDOPTERIN REQUIRES TRANSFER OF TWO SULFUR ATOMS TO PRECURSOR Z (TO GENERATE THE DITHIOLENE GROUP). THIS IS CATALYZED BY THE CONVERTING FACTOR COMPOSED OF A SMALL AND LARGE SUBUNIT." /note="Rv0866, (MTV043.59), len: 141 aa. Probable moaE2, molybdopterin converting factor E (molybdopterin converting factor (subunit 2)), similar to others e.g. Y10817|ANY10817_4|T44853 molybdopterin biosynthesis protein E chain from Arthrobacter nicotinovorans plasmid pAO1 (155 aa), FASTA scores: opt: 460, E(): 3.5e-27, (49.3 identity in 146 aa overlap); CAC01331.1|AL390968 moaE-like protein from Streptomyces coelicolor (152 aa); NP_389313.1|NC_000964 molybdopterin converting factor (subunit 2) from Bacillus subtilis (157 aa); etc. Also highly similar to Rv3119|MOAE1|Z95150|MTCY164_30 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN E from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 321, E(): 5.9e-17, (40.9% identity in 132 aa overlap); and O53375|GPHA|Rv3323c|MTV016.23c MOAD-MOAE FUSION PROTEIN from Mycobacterium tuberculosis (221 aa). TBparse score is 0.889." /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein E2" /protein_id="NP_215381.1" /db_xref="GI:15608006" /db_xref="GeneID:885431" /translation="MTQVLRAALTDQPIFLAEHEELVSHRSAGAIVGFVGMIRDRDGG RGVLRLEYSAHPSAAQVLADLVAEVAEESSGVRAVAASHRIGVLQVGEAALVAAVAAD HRRAAFGTCAHLVETIKARLPVWKHQFFEDGTDEWVGSV" gene complement(964312..965535) /gene="rpfA" /locus_tag="Rv0867c" /db_xref="GeneID:885749" CDS complement(964312..965535) /gene="rpfA" /locus_tag="Rv0867c" /function="UNKNOWN. MAY BE PROMOTE THE RESUSCITATION AND GROWTH OF DORMANT, NONGROWING CELL." /note="Rv0867c, (MTV043.60c), len: 407 aa. Possible rpfA, resuscitation-promoting factor (see citation below). N-terminus highly similar to N-terminal part (1-125 aa) of Z99494|MLCB57_3|NP_302417.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (174 aa), FASTA scores: opt: 785, E(): 1.8e-18, (63.0% identity in 200 aa overlap); and highly similar to C-terminus of NP_301299.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (375 aa); and middle part of NP_302360.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (157 aa). N-terminus also highly similar in part of three secreted proteins from Streptomyces coelicolor e.g. CAC09538.1|AL442120 putative secreted protein (244 aa). Regions highly similar to CAB76321.1|AL158060 putative membrane protein from Streptomyces coelicolor (121 aa); and middle part of CAB09664.1|Z96935 rpf from Micrococcus luteus (220 aa). Also highly similar in part to four resuscitation-promoting factors from Mycobacterium tuberculosis: Rv2450 (172 aa), Rv1009 (362 aa), Rv1884c (176 aa), and Rv2389c (154 aa). Contains a probable secretory signal sequence in N-terminus. TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="resuscitation-promoting factor RpfA" /protein_id="NP_215382.1" /db_xref="GI:15608007" /db_xref="GeneID:885749" /translation="MSGRHRKPTTSNVSVAKIAFTGAVLGGGGIAMAAQATAATDGEW DQVARCESGGNWSINTGNGYLGGLQFTQSTWAAHGGGEFAPSAQLASREQQIAVGERV LATQGRGAWPVCGRGLSNATPREVLPASAAMDAPLDAAAVNGEPAPLAPPPADPAPPV ELAANDLPAPLGEPLPAAPADPAPPADLAPPAPADVAPPVELAVNDLPAPLGEPLPAA PADPAPPADLAPPAPADLAPPAPADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPA ELAPPADLAPASADLAPPAPADLAPPAPAELAPPAPADLAPPAAVNEQTAPGDQPATA PGGPVGLATDLELPEPDPQPADAPPPGDVTEAPAETPQVSNIAYTKKLWQAIRAQDVC GNDALDSLAQPYVIG" gene complement(965983..966261) /gene="moaD2" /locus_tag="Rv0868c" /db_xref="GeneID:885763" CDS complement(965983..966261) /gene="moaD2" /locus_tag="Rv0868c" /function="INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS." /note="Rv0868c, (MTV043.61c), len: 92 aa. Probable moaD2, molybdenum cofactor biosynthesis protein (molybdopterin converting factor (subunit 1)), similar to CAB88494.1|AL353816 putative molybdopterin converting factor from Streptomyces coelicolor (84 aa); and weakly similar to others MoaD proteins e.g. Z99111|BSUB0008_103 from Bacillus subtilis (77 aa), FASTA scores: opt: 86, E(): 2.8, (22.9% identity in 83 aa overlap); etc. Also some similarity with Rv3112|MOAD1|MTCY164.22 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D from Mycobacterium tuberculosis (83 aa), FASTA scores: opt: 113, E(): 0.024, (31.3% identity in 83 aa overlap). TBparse score is 0.917." /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein D" /protein_id="NP_215383.1" /db_xref="GI:15608008" /db_xref="GeneID:885763" /translation="MTQVSDESAGIQVTVRYFAAARAAAGAGSEKVTLRSGATVAELI DGLSVRDVRLATVLSRCSYLRDGIVVRDDAVALSAGDTIDVLPPFAGG" gene complement(966265..967347) /gene="moaA" /locus_tag="Rv0869c" /db_xref="GeneID:885773" CDS complement(966265..967347) /gene="moaA" /locus_tag="Rv0869c" /function="INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS; INVOLVED IN THE BIOSYNTHESIS OF MOLYBDOPTERIN PRECURSOR Z FROM GUANOSINE." /note="together with moaC, is involved in the conversion of a guanosine derivative (GXP) into molybdopterin precursor Z" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein A" /protein_id="NP_215384.1" /db_xref="GI:15608009" /db_xref="GeneID:885773" /translation="MTLTALGMPALRSRTNGIADPRVVPTTGPLVDTFGRVANDLRVS LTDRCNLRCSYCMPERGLRWLPGEQLLRPDELARLIHIAVTRLGVTSVRFTGGEPLLA HHLDEVVAATARLRPRPEISLTTNGVGLARRAGALAEAGLDRVNVSLDSIDRAHFAAI TRRDRLAHVLAGLAAAKAAGLTPVKVNAVLDPTTGREDVVDLLRFCLERGYQLRVIEQ MPLDAGHSWRRNIALSADDVLAALRPHFRLRPDPAPRGSAPAELWLVDAGPNTPRGRF GVIASVSHAFCSTCDRTRLTADGQIRSCLFSTEETDLRRLLRGGADDDAIEAAWRAAM WSKPAGHGINAPDFIQPDRPMSAIGG" gene complement(967344..967733) /locus_tag="Rv0870c" /db_xref="GeneID:885709" CDS complement(967344..967733) /locus_tag="Rv0870c" /function="UNKNOWN" /note="Rv0870c, (MTV043.63c), len: 129 aa. Possible conserved integral membrane protein, highly similar to other membrane proteins: putative secreted proteins or hypothetical proteins e.g. CAC08263.1| AL392146 putative integral membrane protein from Streptomyces coelicolor (138 aa); NP_233433.1|NC_002506 conserved hypothetical protein from Vibrio cholerae (143 aa); NP_455572.1|NC_003198 putative membrane protein from Salmonella enterica subsp. enterica serovar Typhi (148 aa); P37065|YCCF_ECOLI HYPOTHETICAL 16.3 kDa PROTEIN from Escherichia coli (148 aa), FASTA scores: opt: 183, E(): 1.9e-06, (36.6% identity in 134 aa overlap); etc. TBparse score is 0.886." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215385.1" /db_xref="GI:15608010" /db_xref="GeneID:885709" /translation="MRLILNVIWLVFGGLWLALGYLLASLVCFLLIITIPFGFAALRI ASYALWPFGRTIVEKPTAGTGALIGNVIWVLLFGIWLALGHLVSAAAMAVTIIGIPLA LANLKLIPVSLVPLGKDIVGVNSQVPT" gene 967898..968305 /gene="cspB" /locus_tag="Rv0871" /db_xref="GeneID:885725" CDS 967898..968305 /gene="cspB" /locus_tag="Rv0871" /function="UNKNOWN; THOUGHT TO ACT IN RESPONSE TO LOW TEMPERATURE." /experiment="experimental evidence, no additional details recorded" /note="Rv0871, (MTV043.64), len: 135 aa. Probable cspB, cold shock-like protein B, equivalent to Z99494|MLCB57_7|MLCB57.11 probable cold shock protein from Mycobacterium leprae (136 aa), FASTA scores: opt: 787, E(): 0, (86.0% identity in 136 aa overlap). Also highly similar (but often longer than) to others e.g. CAB93399.1|AL357524 cold shock protein B from Streptomyces coelicolor (127 aa); Q45099|CSPD_BACCE COLD SHOCK-LIKE PROTEIN CSPD from Bacillus cereus (66 aa); Y101 81|LLCSPB_1 cold shock protein from Lactococcus lactis (66 aa), FASTA scores: opt: 220, E(): 2.5e-07, (48.3% identity in 60 aa overlap); etc. SEEMS TO BELONG TO THE COLD-SHOCK DOMAIN (CSD) FAMILY. TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="cold shock-like protein B CspB" /protein_id="NP_215386.1" /db_xref="GI:15608011" /db_xref="GeneID:885725" /translation="MPTGKVKWYDPDKGFGFLSQEGGEDVYVRSSALPTGVEALKAGQ RVEFGIASGRRGPQALSLRLIEPPPSLSRPRREPAAEHKHSPDELHGMVEDMITLLES TVQPELRKGRYPDRKTARRVAEVVRAVAREFES" gene complement(968424..970244) /gene="PE_PGRS15" /locus_tag="Rv0872c" /db_xref="GeneID:885742" CDS complement(968424..970244) /gene="PE_PGRS15" /locus_tag="Rv0872c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0872c, (MTV043.65c), len: 606 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002), similar to many e.g. MTCY24A1.04c|Z95207 (615 aa), FASTA scores: opt: 2636, E(): 0, (64.6% identity in 619 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177763.1" /db_xref="GI:57116793" /db_xref="GeneID:885742" /translation="MSYVLATPEMVAAAANNLAQIGSTLSAANAAALAPTTGVLAAGA DEVSAAVASLFSGHAQAYQTLGTQAAAFHERFIQALSTAAGAYGSAEAANASPLQQAL NVINAPTQTLLGRPLIGNGTNGAPGTGQAGGPGGLLYGNGGNGGSGGVGQAGGAGGSA GLIGIGGTGGAGGAGAVGGVGGNGGWLYGNGGAGGLGGTGVAGVNGGMGAAGGAGGNA YLFGSGGAGGQGGMGAAGADGVNPTPTGTADAGSTGTDQTLGGNAIGGNGGPGDAGDA MTSGGAGGSGGNAVSTVNGDAVGGEGGKGGEGAYGGAGGAGGSAASIGNAAIGGNGGA GGNAQAPGGVGGAGGEGGDAQVGTNSPSNAEAGNGGSGGNGFDSFASGGTGGAGGTGG AGGRGGLLIGDGGAGGAGGVGGTGGSGAPGGGGGAGGDGGAANTDSAGSSRKAFGGDG GVGGDGASALGTGGEGGIGGQGGNGGAGGLLIGNGGAGGVGGTAGAGGTGGSGGAGGA GGAGGGGTNSGPGAAFGGNGNTGGNGGNGGAPGALGGKGGSGGLIGRAGSDGGVGAGG AGGAGGAGGTGGEGGTGGDGKTTDGNPGMGGSPGSAGQPG" misc_feature complement(968481..968504) /gene="PE_PGRS15" /locus_tag="Rv0872c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 970505..972457 /gene="fadE10" /locus_tag="Rv0873" /db_xref="GeneID:885636" CDS 970505..972457 /gene="fadE10" /locus_tag="Rv0873" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv0873, (MTV043.66-MTCY31.01), len: 650 aa. Probable fadE10, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. CAB91129.1|AL355913 putative acyl CoA dehydrogenase from Streptomyces coelicolor (658 aa); P50544|ACDV_MOUSE ACYL-CoA DEHYDROGENASE from Mus musculus (656 aa); D30647|RATVLCAD_1 very-long-chain Acyl-CoA dehydrogenase from Rattus norvegicus (655 aa), FASTA scores: opt: 675, E(): 0, (33.9% identity in 380 aa overlap); etc." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE10" /protein_id="NP_215388.1" /db_xref="GI:15608013" /db_xref="GeneID:885636" /translation="MAQQTQVTEEQARALAEESRESGWDKPSFAKELFLGRFPLGLIH PFPKPSDAEEARTEAFLVKLREFLDTVDGSVIERAAQIPDEYVKGLAELGCFGLKIPS EYGGLNMSQVAYNRVLMMVTTVHSSLGALLSAHQSIGVPEPLKLAGTAEQKRRFLPRC AAGAISAFLLTEPDVGSDPARMASTATPIDDGQAYELEGVKLWTTNGVVADLLVVMAR VPRSEGHRGGISAFVVEADSPGITVERRNKFMGLRGIENGVTRLHRVRVPKDNLIGRE GDGLKIALTTLNAGRLSLPAIATGVAKQALKIAREWSVERVQWGKPVGQHEAVASKIS FIAATNYALDAVVELSSQMADEGRNDIRIEAALAKLWSSEMACLVGDELLQIRGGRGY ETAESLAARGERAVPVEQMVRDLRINRIFEGSSEIMRLLIAREAVDAHLTAAGDLANP KADLRQKAAAAAGASGFYAKWLPKLVFGEGQLPTTYREFGALATHLRFVERSSRKLAR NTFYGMARWQASLEKKQGFLGRIVDIGAELFAISAACVRAEAQRTADPVEGEQAYELA EAFCQQATLRVEALFDALWSNTDSIDVRLANDVLEGRYTWLEQGILDQSEGTGPWIAS WEPGPSTEANLARRFLTVSPSSEAKL" gene complement(972546..973706) /locus_tag="Rv0874c" /db_xref="GeneID:885658" CDS complement(972546..973706) /locus_tag="Rv0874c" /function="UNKNOWN" /note="Rv0874c, (MTCY31.02c), len: 386 aa. Conserved hypothetical protein, highly similar in part to SPU62616_1 hypothetical protein from Synechococcus sp. (280 aa), FASTA scores: E(): 6.3e-26, (35.2% identity in 264 aa overlap); SYCSLLLH_102 from Synechocystis sp. (447 aa), FASTA scores: E(): 1.1e-18, (29.5% identity in 400 aa overlap). Also highly similar to Rv0628c|MTCY20H10_9 from Mycobacterium tuberculosis (383 aa), FASTA scores: E():0, (81.5% identity in 383 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215389.1" /db_xref="GI:15608014" /db_xref="GeneID:885658" /translation="MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAH TDRAADVLSAVLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDF VRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGR RRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGG RPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGS IEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMF GVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASMALFVDDME" gene complement(973806..974294) /locus_tag="Rv0875c" /db_xref="GeneID:885669" CDS complement(973806..974294) /locus_tag="Rv0875c" /function="UNKNOWN" /note="Rv0875c, (MTCY31.03c), len: 162 aa. Possible conserved exported protein, equivalent to MLCB57_11|O33056 possible exported protein from Mycobacterium leprae (162 aa), FASTA scores: opt: 789, E(): 0, (71.4% identity in 161 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215390.1" /db_xref="GI:15608015" /db_xref="GeneID:885669" /translation="MKRGVATLPVILVILLSVAAGAGAWLLVRGHGPQQPEISAYSHG HLTRVGPYLYCNVVDLDDCQTPQAQGELPVSERYPVQLSVPEVISRAPWRLLQVYQDP ANTTSTLFRPDTRLAVTIPTVDPQRGRLTGIVVQLLTLVVDHSGELRDVPHAEWSVRL IF" gene complement(974291..975937) /locus_tag="Rv0876c" /db_xref="GeneID:885562" CDS complement(974291..975937) /locus_tag="Rv0876c" /function="UNKNOWN" /note="Rv0876c, (MTCY31.04c), len: 548 aa. Possible conserved transmembrane protein, equivalent to MLCB57_12|O33057 possible membrane protein from Mycobacterium leprae (579 aa), FASTA scores: opt: 2850, E(): 0, (81.0% identity in 568 aa overlap). Also highly similar (except in N-terminus) to CAB93403.1|AL357524 putative integral membrane protein from Streptomyces coelicolor (463 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215391.1" /db_xref="GI:15608016" /db_xref="GeneID:885562" /translation="MAPTPGRRTRNGSVNGHPGMANYPPDDANYRRSRRPPPMPSANR YLPPLGEQPEPERSRVPPRTTRAGERITVTRAAAMRSREMGSRMYLLVHRAATADGAD KSGLTALTWPVMANFAVDSAMAVALANTLFFAAASGESKSRVALYLLITIAPFAVIAP LIGPALDRLQHGRRVALALSFGLRTALAVVLIMNYDGATGSFPSWVLYPCALAMMVFS KSFSVLRSAVTPRVMPPTIDLVRVNSRLTVFGLLGGTIAGGAIAAGVEFVCTHLFQLP GALFVVVAITIAGASLSMRIPRWVEVTSGEVPATLSYHRDRGRLRRRWPEEVKNLGGT LRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKAHEANGWVQLGMLGLIGAAAAVG NFAGNFTSARLQLGRPAVLVVRCTVLVTVLAIAAAVAGSLAATAIATLITAGSSAIAK ASLDASLQHDLPEESRASGFGRSESTLQLAWVLGGAVGVLVYTELWVGFTAVSALLIL GLAQTIVSFRGDSLIPGLGGNRPVMAEQETTRRGAAVAPQ" gene 976075..976863 /locus_tag="Rv0877" /db_xref="GeneID:885601" CDS 976075..976863 /locus_tag="Rv0877" /function="UNKNOWN" /note="Rv0877, (MTCY31.05), len: 262 aa. Conserved hypothetical protein, equivalent to MLCB57_13|O33058 conserved hypothetical protein from Mycobacterium leprae (269 aa), FASTA scores: E(): 0, (80.5% identity in 257 aa overlap). Also highly similar (except in C-terminus) to SCD12A.13|CAB93404.1|AL357524 hypothetical protein from Streptomyces coelicolor (308 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215392.1" /db_xref="GI:15608017" /db_xref="GeneID:885601" /translation="MTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAV GDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALL APDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVM SAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSAD GHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKPAES" gene complement(976872..978203) /gene="PPE13" /locus_tag="Rv0878c" /db_xref="GeneID:885617" CDS complement(976872..978203) /gene="PPE13" /locus_tag="Rv0878c" /function="UNKNOWN" /note="Rv0878c, (MTCY31.06c), len: 443 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. P4261|YHS6_MYCTU (517 aa), FASTA scores: opt: 1044, E(): 0, (47.4% identity in 397 aa overlap); MTV014_3, MTCI65_2, MTCY98_24, MTCY3C7_23, MTCY48_17, MTV004_5, MTV004_3, etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177764.1" /db_xref="GI:57116794" /db_xref="GeneID:885617" /translation="MNFMVLPPEVNSARIYAGAGPAPMLAAAVAWDGLAAELGMAAAS FSLLISGLTAGPGSAWQGPAAAAMAAAAAPYLSWLNAATARAEGAAAGAKAAAAVYEA ARAATAHPALVAANRNQLLSLVLSNLFGQNLPAIAATEASYEQLWAQDVAAMVGYHGG ASTVASQLTPWQQLLSVLPPVVTAAPAGAVGVPAALAIPALGVENIGVGNFLGIGNIG NNNVGSGNTGDYNFGIGNIGNANLGNGNIGNANLGSGNAGFFNFGNGNDGNTNFGSGN AGFLNIGSGNEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNSGDLNTGIGSPVTQGVAN SGFGNTGTGHSGFFNSGNSGSGFQNLGNGSSGFGNASDTSSGFQNAGTALTRASSTWA DSPRAWPIRAPSRLQVWRTRATTARECSIRVIISRVSSTGAPPQKKVGNSG" gene complement(978481..978756) /locus_tag="Rv0879c" /db_xref="GeneID:885095" CDS complement(978481..978756) /locus_tag="Rv0879c" /function="UNKNOWN" /note="Rv0879c, (MTCY31.07c), len: 91 aa. Possible conserved transmembrane protein, C-terminus highly similar to C-terminal part of MLCB57_14|O33059 conserved hypothetical protein from Mycobacterium leprae (91 aa), FASTA scores: E(): 1.2e-25, (76.9% identity in 91 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215394.1" /db_xref="GI:15608019" /db_xref="GeneID:885095" /translation="MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPG LASWRPVTVAGLATGLLGTTIFVWQLAAARRGARGAQAGLETYLDPK" gene 978934..979365 /locus_tag="Rv0880" /db_xref="GeneID:885205" CDS 978934..979365 /locus_tag="Rv0880" /function="THOUGHT TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0880, (MTCY31.08), len: 143 aa. Possible transcriptional regulator, MarR family, equivalent to MLCB57_15|O3306|NP_302411.1|NC_002677 putative MarR-family protein from Mycobacterium leprae (143 aa), FASTA scores: opt: 818, E(): 0, (89.5% identity in 143 aa overlap). Also similar to many others e.g. CAB93410.1|AL357524 putative marR-family protein from Streptomyces coelicolor (145 aa); NP_251757.1|NC_002516 probable transcriptional regulator from Pseudomonas aeruginosa (147 aa); etc. Also similar to Rv2327 from Mycobacterium tuberculosis (163 aa)." /codon_start=1 /transl_table=11 /product="MarR family transcriptional regulator" /protein_id="NP_215395.1" /db_xref="GI:15608020" /db_xref="GeneID:885205" /translation="MLDSDARLASDLSLAVMRLSRQLRFRNPSSPVSLSQLSALTTLA NEGAMTPGALAIRERVRPPSMTRVIASLADMGFVDRAPHPIDGRQVLVSVSESGAELV KAARRARQEWLAERLATLNRSERDILRSAADLMLALVDESP" gene 979362..980228 /locus_tag="Rv0881" /db_xref="GeneID:885121" CDS 979362..980228 /locus_tag="Rv0881" /EC_number="2.1.1.-" /function="CAUSES METHYLATION." /note="Rv0881, (MTCY31.09), len: 288 aa. Possible rRNA methyltransferase (EC 2.1.1.-), highly similar to others and hypothetical proteins e.g. CAB76071.1|AL157953 putative rRNA methylase from Streptomyces coelicolor (272 aa); NP_421117.1|NC_002696 spoU rRNA methylase family protein from Caulobacter crescentus (268 aa); D90913_93|P74261 rRNA METHYLASE from Synechocystis sp. (274 aa), FASTA scores: E(): 1.1e-13, (26.3% identity in 278 aa overlap); P18644|TSNR_STRCN rRNA METHYLTRANSFERASE (EC 2.1.1.66) from Streptomyces cyaneus (Streptomyces curacoi) (269 aa), FASTA scores: E(): 3.7e-08, (23.9% identity in 268 aa overlap); etc. Equivalent to AAK45146.1 from Mycobacterium tuberculosis strain CDC1551 (242 aa) but longer 46 aa." /codon_start=1 /transl_table=11 /product="rRNA methyltransferase" /protein_id="NP_215396.1" /db_xref="GI:15608021" /db_xref="GeneID:885121" /translation="MTEGRCAQHPDGLDVQDVCDPDDPRLDDFRDLNSIDRRPDLPTG KALVIAEGVLVVQRMLASRFTPLALFGTDRRLAELKDDLAGVGAPYYRASADVMARVI GFHLNRGVLAAAGRVPEPSVAQVVAGARTVAVLEGVNDHENLGSIFRNAAGLSVDAVV FGTGCADPLYRRAVRVSMGHALLVPYARAADWPTELMTLKESGFRLLAMTPHGNACKL PEAIAAVSHERIALLVGAEGPGLTAAALRISDVRVRIPMSRGTDSLNVATAAALAFYE RTRSGHHIGPGT" gene 980225..980509 /locus_tag="Rv0882" /db_xref="GeneID:885248" CDS 980225..980509 /locus_tag="Rv0882" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0882, (MTCY31.10), len: 94 aa. Probable transmembrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215397.1" /db_xref="GI:15608022" /db_xref="GeneID:885248" /translation="MNDQRDQAVPWATGLAVAGFVAAVIAVAVVVLSLGLIRVHPLLA VGLNIVAVSGLAPTLWGWRRTPVLRWFVLGAAVGVAGAWLALLALTLGDG" gene complement(980506..981267) /locus_tag="Rv0883c" /db_xref="GeneID:885139" CDS complement(980506..981267) /locus_tag="Rv0883c" /function="UNKNOWN" /note="Rv0883c, (MTCY31.11c), len: 253 aa. Conserved hypothetical protein, equivalent to O3306|MLCB57_16 CONSERVED HYPOTHETICAL PROTEI from Mycobacterium leprae (251 aa), FASTA scores: E(): 0, (79.4% identity in 253 aa overlap). Also highly similar to N_terminus of AL009204|SC9B10_22 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (352 aa), FASTA scores: E(): 6.1e-20, (35.0% identity in 246 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215398.1" /db_xref="GI:15608023" /db_xref="GeneID:885139" /translation="MRELKVVGLDADGKNIICQGAIPSEQFKLPVDDRLRAALRDDSV QPEQAQLDIEVTNVLSPKEIQARIRAGASVEQVAAASGSDIARIRRFAHPVLLERSRA AELATAAHPVLADGPAVLTMQETVAAALVARGLNPDSLTWDAWRNEDSRWTVQLAWKA GRSDNLAHFRFTPGAHGGTATAIDDTAHELINPTFNRPLRPLAPVAHLDFDEPEPAQP TLTVPSAQPVSNRRGKPAIPAWEDVLLGVRSGGRR" gene complement(981424..982554) /gene="serC" /locus_tag="Rv0884c" /db_xref="GeneID:885140" CDS complement(981424..982554) /gene="serC" /locus_tag="Rv0884c" /EC_number="2.6.1.52" /function="CATALYZES THE REVERSIBLE INTERCONVERSION OF PHOSPHOSERINE AND 2-OXOGLUTARATE TO 3-PHOSPHONOOXYPYRUVATE AND GLUTAMATE. REQUIRE BOTH IN THE MAJOR PHOSPHORYLATED PATHWAY OF SERINE BIOSYNTHESIS AND IN PYRIDOXINE BIOSYNTHESIS [CATALYTIC ACTIVITY: O-phospho-L-serine + 2-oxoglutarate = 3-phosphonooxypyruvate + L-glutamate]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 3-phosphonooxypyruvate and glutamate from O-phospho-L-serine and 2-oxoglutarate" /codon_start=1 /transl_table=11 /product="phosphoserine aminotransferase" /protein_id="NP_215399.1" /db_xref="GI:15608024" /db_xref="GeneID:885140" /translation="MADQLTPHLEIPTAIKPRDGRFGSGPSKVRLEQLQTLTTTAAAL FGTSHRQAPVKNLVGRVRSGLAELFSLPDGYEVILGNGGATAFWDAAAFGLIDKRSLH LTYGEFSAKFASAVSKNPFVGEPIIITSDPGSAPEPQTDPSVDVIAWAHNETSTGVAV AVRRPEGSDDALVVIDATSGAGGLPVDIAETDAYYFAPQKNFASDGGLWLAIMSPAAL SRIEAIAATGRWVPDFLSLPIAVENSLKNQTYNTPAIATLALLAEQIDWLVGNGGLDW AVKRTADSSQRLYSWAQERPYTTPFVTDPGLRSQVVGTIDFVDDVDAGTVAKILRANG IVDTEPYRKLGRNQLRVAMFPAVEPDDVSALTECVDWVVERL" gene 982762..983784 /locus_tag="Rv0885" /db_xref="GeneID:885285" CDS 982762..983784 /locus_tag="Rv0885" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0885, (MTCY31.13), len: 340 aa. Conserved hypothetical protein, equivalent to O33063|MLCB57_18 possible transmembrane protein from Mycobacterium leprae (341 aa), FASTA score: (83.9% identity in 341 aa overlap). Also similar except in C-terminus to T35630 probable membrane protein from Streptomyces coelicolor (312 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215400.1" /db_xref="GI:15608025" /db_xref="GeneID:885285" /translation="MDRTRIVRRWRRNMDVADDAEYVEMLATLSEGSVRRNFNPYTDI DWESPEFAVTDNDPRWILPATDPLGRHPWYQAQSRERQIEIGMWRQANVAKVGLHFES ILIRGLMNYTFWMPNGSPEYRYCLHESVEECNHTMMFQEMVNRVGADVPGLPRRLRWV SPLVPLVAGPLPVAFFIGVLAGEEPIDHTQKNVLREGKSLHPIMERVMSIHVAEEARH ISFAHEYLRKRLPRLTRMQRFWISLYFPLTMRSLCNAIVVPPKAFWEEFDIPREVKKE LFFGSPESRKWLCDMFADARMLAHDTGLMNPIARLVWRLCKIDGKPSRYRSEPQRQHL AAAPAA" gene 983803..985530 /gene="fprB" /locus_tag="Rv0886" /db_xref="GeneID:885195" CDS 983803..985530 /gene="fprB" /locus_tag="Rv0886" /EC_number="1.18.1.2" /function="SERVES AS THE FIRST ELECTRON TRANSFER PROTEIN IN ALL THE P450 SYSTEMS [CATALYTIC ACTIVITY: REDUCED ADRENODOXIN + NADP+ = OXIDIZED ADRENODOXIN + NADPH]." /note="Rv0886, (MTCY31.14), len: 525 aa. Probable fprB, ferredoxin/ferredoxin-NADP(+) reductase (NADPH:adrenodoxin oxidoreductase) (EC 1.18.1.2), equivalent to O3306|MLCB57_19 FERREDOXIN/FERREDOXIN--NADP REDUCTASE from Mycobacterium leprae (555 aa), FASTA scores: E(): 0, (76.6 identity in 560 aa overlap). Also highly similar or similar to others e.g. NP_294219.1|NC_001263 putative ferredoxin/ferredoxin--NADP reductase from Deinococcus radiodurans (479 aa) (N-terminus shorter); P22570|ADRO_HUMAN NADPH:adrenodoxin oxidoreductase from homo sapiens (497 aa), FASTA scores: opt: 624, E(): 3e-30, (39.7% identity in 484 aa overlap); P08165|ADRO_BOVIN NADPH:ADRENODOXIN OXIDOREDUCTASE from Bos taurus (492 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv3106, Rv3858c, etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature." /codon_start=1 /transl_table=11 /product="NADPH:adrenodoxin oxidoreductase FprB" /protein_id="NP_215401.1" /db_xref="GI:15608026" /db_xref="GeneID:885195" /translation="MPHVITQSCCNDASCVFACPVNCIHPTPDEPGFATSEMLYIDPV ACVDCGACVTACPVSAIAPNTRLDFEQLPFVEINASYYPKRPAGVKLAPTSKLAPVTP AAEVRVRRQPLTVAVVGSGPAAMYAADELLVQQGVQVNVFEKLPTPYGLVRSGVAPDH QNTKRVTRLFDRIAGHRRFRFYLNVEIGKHLGHAELLAHHHAVLYAVGAPDDRRLTID GMGLPGTGTATELVAWLNGHPDFNDLPVDLSHERVVIIGNGNVALDVARVLAADPHEL AATDIADHALSALRNSAVREVVVAARRGPAHSAFTLPELIGLTAGADVVLDPGDHQRV LDDLAIVADPLTRNKLEILSTLGDGSAPARRVGRPRIRLAYRLTPRRVLGQRRAGGVQ FSVTGTDELRQLDAGLVLTSIGYRGKPIPDLPFDEQAALVPNDGGRVIDPGTGEPVPG AYVAGWIKRGPTGFIGTNKSCSMQTVQALVADFNDGRLTDPVATPTALDQLVQARQPQ AIGCAGWRAIDAAEIARGSADGRVRNKFTDVAEMLAAATSAPKEPLRRRVLARLRDLG QPIVLTVPL" misc_feature 983938..983973 /gene="fprB" /locus_tag="Rv0886" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature" gene complement(985513..985971) /locus_tag="Rv0887c" /db_xref="GeneID:885113" CDS complement(985513..985971) /locus_tag="Rv0887c" /function="UNKNOWN" /note="Rv0887c, (MTCY31.15c), len: 152 aa. Conserved hypothetical protein, highly similar to others e.g. NP_436346.1|NC_003037 Hypothetical protein from Sinorhizobium meliloti (149 aa); AL132644|SCI8_26 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (194 aa), FASTA scores: opt: 220, E(): 1.5e-07, (33.6% identity in 131 aa overlap); etc. Also shows weak similarity with transposases and related proteins." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215402.1" /db_xref="GI:15608027" /db_xref="GeneID:885113" /translation="MAINVEPALSPHLVVDDAASAIDFYVKAFDAVELGRVPGPDGKL IHAALRINGFTVMLNDDVPQMCGGKSMTPTSLGGTPVTIHLTVTDVDAKFQRALNAGA TVVTALEDQLWGDRYGVVADPFGHHWSLGQPVREVNMDEIQAAMSSQGDG" gene 987233..988705 /locus_tag="Rv0888" /db_xref="GeneID:885210" CDS 987233..988705 /locus_tag="Rv0888" /function="UNKNOWN" /note="Rv0888, (MTCY31.16), len: 490 aa. Probable exported protein. Equivalent to AAK45157.1 from Mycobacterium tuberculosis strain CDC1551 (507 aa) but shorter 17 aa. Contains possible N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215403.1" /db_xref="GI:15608028" /db_xref="GeneID:885210" /translation="MDYAKRIGQVGALAVVLGVGAAVTTHAIGSAAPTDPSSSSTDSP VDACSPLGGSASSLAAIPGASVPQVGVRQVDPGSIPDDLLNALIDFLAAVRNGLVPII ENRTPVANPQQVSVPEGGTVGPVRFDACDPDGNRMTFAVRERGAPGGPQHGIVTVDQR TASFIYTADPGFVGTDTFSVNVSDDTSLHVHGLAGYLGPFHGHDDVATVTVFVGNTPT DTISGDFSMLTYNIAGLPFPLSSAILPRFFYTKEIGKRLNAYYVANVQEDFAYHQFLI KKSKMPSQTPPEPPTLLWPIGVPFSDGLNTLSEFKVQRLDRQTWYECTSDNCLTLKGF TYSQMRLPGGDTVDVYNLHTNTGGGPTTNANLAQVANYIQQNSAGRAVIVTGDFNARY SDDQSALLQFAQVNGLTDAWVQVEHGPTTPPFAPTCMVGNECELLDKIFYRSGQGVTL QAVSYGNEAPKFFNSKGEPLSDHSPAVVGFHYVADNVAVR" gene complement(988740..989861) /gene="citA" /locus_tag="Rv0889c" /db_xref="GeneID:885466" CDS complement(988740..989861) /gene="citA" /locus_tag="Rv0889c" /EC_number="2.3.3.1" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE (KREBS CYCLE) [CATALYTIC ACTIVITY: Citrate + CoA = acetyl-CoA + H2O + oxaloacetate]." /note="forms citrate from oxaloacetate and acetyl-CoA; functions in TCA cycle" /codon_start=1 /transl_table=11 /product="citrate synthase 2" /protein_id="NP_215404.1" /db_xref="GI:15608029" /db_xref="GeneID:885466" /translation="MTVVPENFVPGLDGVVAFTTEIAEPDKDGGALRYRGVDIEDLVS QRVTFGDVWALLVDGNFGSGLPPAEPFPLPIHSGDVRVDVQAGLAMLAPIWGYAPLLD IDDATARQQLARASVMALSYVAQSARGIYQPAVPQRIIDECSTVTARFMTRWQGEPDP RHIEAIDAYWVSAAEHGMNASTFTARVIASTGADVAAALSGAIGAMSGPLHGGAPARV LPMLDEVERAGDARSVVKGILDRGEKLMGFGHRVYRAEDPRARVLRAAAERLGAPRYE VAVAVEQAALSELRERRPDRAIETNVEFWAAVVLDFARVPANMMPAMFTCGRTAGWCA HILEQKRLGKLVRPSAIYVGPGPRSPESVDGWERVLTTA" misc_feature complement(989085..989123) /gene="citA" /locus_tag="Rv0889c" /note="PS00480 Citrate synthase signature" gene complement(989948..992596) /locus_tag="Rv0890c" /db_xref="GeneID:885227" CDS complement(989948..992596) /locus_tag="Rv0890c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0890c, (MTCY31.18c), len: 882 aa. Probable transcriptional regulatory protein, LuxR family, highly similar (but shorter 238 aa in N-terminus) to NP_302202.1|NC_002677 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also highly similar (generally in part) to others e.g. T50568 probable multi-domain regulatory protein from Streptomyces coelicolor (1334 aa); P10957|NARL_ECOLI nitrate/nitrite response regulator protein from Escherichia coli (216 aa), FASTA scores: opt: 193, E(): 6e-06, (37.4% identity in 99 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. MTCY02B10_22, MTV008_44, MTV036_21, and MTCY31_24. Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00622 Bacterial regulatory proteins, luxR family signature, and probable helix-turn helix motif from aa 836 to 857 (Score 1559, +4.50 SD). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="NP_215405.1" /db_xref="GI:15608030" /db_xref="GeneID:885227" /translation="MRALLAQNRLVTLCGTGGVGKTRLAIQIASASELRDGLCFVDLA PITESGIVAATAARAVGLPDQPGRSTMDSLRRFIGNRRMLMVLDNCEHLLDACAALVV ELLGACPELTILATSREPIGMAGEITWRVPSMSITDEAVELFADRASRVQPGFTIANH NAAAVGEICRRLDGIPLAIEFAAARVRSMSPLEIADGLDDCFRLLAGGVRGAVQRQQT LRASIDWSHALLTETEQILFRRLAPFVGGFDLAAVRAVAAGSDLDPFSVLDQLTLLVD KSLVVADDCQGRTRYRLLETVRRYALEKLGDSGEADVHARHRDYYTALAASLNTPADN DHQRLVARAETEIDNLRAAFAWSRENGHITEALQLASSLQPIWFGRAHLREGLSWFNS ILEDQRFHRLAVSTAVRARALADKAMLSTWLATSPVGATDIIAPAQQALAMAREVGDP AALVRALTACGCSSGYNAEAAAPYFAEATDLARAIDDKWTLCQILYWRGVGTCISGDP NALRAAAEECRDLADTIGDRFVSRHCSLWLSLAQMWAGNLTEALELSREITAEAEASN DVPTKVLGLYTQAQVLAYCGASAAHAIAGACIAAATELGGVYQGIGYAAMTYAALAAG DVTAALEASDAARPILRAQPDQVTMHQVLMAQLALAGGDAIAARQFANDAVDATNGWH RMVALTIRARVATARGEPELARDDAHAALACGAELHIYQGMPDAMELLAGLAGEVGSH SEGVRLLGAAAALRQQTRQVRFKIWDAGYQASVTALREAMGDEDFDRAWAEGAALSTD EAIAYAQRGRGERKRPARGWGSLTPTERDVVRLVSEGLSNKDIAKRLFVSPRTVQTHL THVYAKLGLPSRVQLVDEAARRGSPS" misc_feature complement(990011..990094) /locus_tag="Rv0890c" /note="PS00622 Bacterial regulatory proteins, luxR family signature" misc_feature complement(992531..992554) /locus_tag="Rv0890c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(992598..993455) /locus_tag="Rv0891c" /db_xref="GeneID:885493" CDS complement(992598..993455) /locus_tag="Rv0891c" /function="THOUGHT TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0891c, (MTCY31.19c), len: 285 aa. Possible transcriptional regulator, highly similar in N-terminus to NP_302202.1|NC_002677 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also highly similar to several Mycobacterium tuberculosis putative transcriptional regulators e.g. Q1102|MTCY02B10_22 PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN (1159 aa), FASTA scores: opt: 702, E(): 8.3e-40, (50.6% identity in 247 aa overlap); MTV036_21; MTV008_44; MTCY02B10_23. Also shows similarity with several adenylate cyclases and hydrolases from other organisms." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215406.1" /db_xref="GI:15608031" /db_xref="GeneID:885493" /translation="MLFNAVHNSLPPNIDIDHAILRGEDHPPTCAKCVARVRISALGS LDLRYHSLRCYAAPPDVGRCEFVPPRRRVLIANQGLDVSRLPPTGTVTLLLADVEEST HLWQMCPEDMATAIAHLDHTVSEAITNHGGVQPVKRYEGDSFVAAFTRASDAAACALD LQRTSLAPIRLRIGLHTGEVQLRDELYVGPTINRTARLRDLAHGGQVVLSAATGDLVT GRLPADAWLVDLGRHPLRGLPRPEWVMQLCHPDIREKFPPLRTAKSSPTSILPAQFTT FVGRRAQIS" gene 993853..995340 /locus_tag="Rv0892" /db_xref="GeneID:885225" CDS 993853..995340 /locus_tag="Rv0892" /EC_number="1.14.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0892, (MTCY31.20), len: 495 aa. Probable monooxygenase (EC 1.14.-.-), highly similar to others e.g. NP_250787.1|NC_002516 probable flavin-binding monooxygenase from Pseudomonas aeruginosa (491 aa); CAB59668.1|AL132674 monooxygenase from Streptomyces coelicolor (519 aa); P12015|CYMO_ACIS cyclohexanone monooxygenase from Acinetobacter sp. (542 aa), FASTA scores: opt: 489, E(): 6.8e-26, (30.3% identity in 492 aa overlap); etc. Also highly similar to Rv0565c, Rv3854c, Rv3083, etc from Mycobacterium tuberculosis. Has hydrophobic stretch at N-terminus." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="NP_215407.1" /db_xref="GI:15608032" /db_xref="GeneID:885225" /translation="MTGRCPTVAVVGAGMSGMCVAITLLSAGITDVCIYEKADDVGGT WRDNTYPGLTCDVPSRLYQYSFAKNPNWTQMFSRGGEIQDYLRGIAERYGLRHRIRFG ATVVSARFDDGRWVLRTDSGTESTVDFLISATGVLHHPRIPPIAGLDDFRGTVFHSAR WDHTVPLLGRRIAVIGTGSTGVQLVCGLAGVAGKVTMFQRTAQWVLPWPNPRYSKLAR VFHRAFPCLGSLAYKAYSLSFETFAVALSNPGLHRKLVGAVCRASLRRVRDPRLRRAL TPDYEPMCKRLVMSGGFYRAIQRDDVELVTAGIDHVEHRGIVTDDGVLHEVDVIVLAT GFDSHAFFRPMQLTGRDGIRIDDVWQDGPHAHQTVAIPGFPNFFMMLGPHSPVGNFPL TAVAESQAEHIVQWIKRWRHGEFDTMEPKSAATEAYNTVLRAAMPNTVWTTGCDSWYL NKDGIPEVWPFAPAKHRAMLANLHPEEYDLRRYAAVRATSRPQSA" gene complement(995318..996295) /locus_tag="Rv0893c" /db_xref="GeneID:885477" CDS complement(995318..996295) /locus_tag="Rv0893c" /function="UNKNOWN" /note="Rv0893c, (MTCY31.21c), len: 325 aa. Conserved hypothetical protein, belongs in family with P96823|Rv0146|MTCI5.20 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (310 aa), FASTA scores: opt: 784, E(): 0, (43.2% identity in 308 aa overlap); Rv0726c, Rv0731c, Rv3399, etc. Also shows some similarity with others e.g. SC9B5.10|T35930 hypothetical protein from Streptomyces coelicolor (303 aa); BSUB0008_141|Q45500 HYPOTHETICAL 34.8 kDa PROTEIN from Bacillus subtilis (304 aa), FASTA scores: E(): 0.00033, (26.8% identity in 168 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215408.1" /db_xref="GI:15608033" /db_xref="GeneID:885477" /translation="MRTEDDSWDVTTSVGSTGLLVAAARALETQKADPLAIDPYAEVF CRAAGGEWADVLDGKLPDHYLTTGDFGEHFVNFQGARTRYFDEYFSRATAAGMKQVVI LAAGLDSRAFRLQWPIGTTIFELDRPQVLDFKNAVLADYHIRPRAQRRSVAVDLRDEW QIALCNNGFDANRPSAWIAEGLLVYLSAEAQQRLFIGIDTLASPGSHVAVEEATPLDP CEFAAKLERERAANAQGDPRRFFQMVYNERWARATEWFDERGWRATATPLAEYLRRVG RAVPEADTEAAPMVTAITFVSAVRTGLVADPARTSPSSTSIGFKRFEAD" gene 996524..997705 /locus_tag="Rv0894" /db_xref="GeneID:885199" CDS 996524..997705 /locus_tag="Rv0894" /function="THOUGHT TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv0894, (MTCY31.22), len: 393 aa. Possible regulatory protein, LuxR family, highly similar in part to NP_302202.1|NC_002677 possible transcriptional regulator from Mycobacterium leprae (1106 aa). Also similar to others e.g. CAB95788.1|AL359949 putative multi-domain regulatory protein from Streptomyces coelicolor (780 aa); NP_107293.1|NC_002678 transcriptional regulator from Mesorhizobium loti (903 aa); etc. Also similar to other regulatory proteins from Mycobacterium tuberculosis e.g. Rv2488c|MTV008_44 (1137 aa), FASTA score: (53.2% identity in 363 aa overlap); Rv1358|MTCY02B10_22 (1159 aa), FASTA score: (52.3% identity in 365 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="NP_215409.1" /db_xref="GI:15608034" /db_xref="GeneID:885199" /translation="MPSRATVQEFSDSYPFCHNGFRPIMMPKIVSVQHSTRRHLTSFV GRKAELNDVRRLLSDKRLVTLTGPDGMGKSRLALQIGAQIAHEFTYGRWDCDLATVTD RDCVSISMLNALGLPVQPGLSAIDTLVGVINDARVLLVLDHCEHLLDACAAIIDSLLR SCPRLTILTTSTEAIGLAGELTWRVPPLSLTNDAIELFVDRARRVRSDFAINADTAVT VGEICRRLDGVPLAIELAAARTDTLSPVEILAGLNDRFRLVAGAAGNAVRPEQTLCAT VQWSHALLSGPERALLHRLAVFAGGFDLDGAQAVGANDEDFEGYQTLGRFAELVDKAF VVVENNRGRAGYRLLYSVRQYALEKLSESGEADAVLARYRKHLKQPNQVVRAGSGGVR Y" misc_feature 996722..996745 /locus_tag="Rv0894" /note="PS00017 ATP/GTP-binding site motif A" gene 997782..999299 /locus_tag="Rv0895" /db_xref="GeneID:885481" CDS 997782..999299 /locus_tag="Rv0895" /function="UNKNOWN" /note="Rv0895, (MTCY31.23), len: 505 aa. Conserved hypothetical protein; member of family with: Rv3740c, Rv3734c, Rv1425, Rv1760, etc. Shows some similarity with NP_301898.1|NC_002677 conserved membrane protein from Mycobacterium leprae (491 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215410.1" /db_xref="GI:15608035" /db_xref="GeneID:885481" /translation="MRQQQEADVVALGRKPGLLCVPERFRAMDLPMAAADALFLWAET PTRPLHVGALAVLSQPDNGTGRYLRKVFSAAVARQQVAPWWRRRPHRSLTSLGQWSWR TETEVDLDYHVRLSALPPRAGTAELWALVSELHAGMLDRSRPLWQVDLIEGLPGGRCA VYVKVHHALADGVSVMRLLQRIVTADPHQRQMPTLWEVPAQASVAKHTAPRGSSRPLT LAKGVLGQARGVPGMVRVVADTTWRAAQCRSGPLTLAAPHTPLNEPIAGARSVAGCSF PIERLRQVAEHADATINDVVLAMCGGALRAYLISRGALPGAPLIAMVPVSLRDTAVID VFGQGPGNKIGTLMCSLATHLASPVERLSAIRASMRDGKAAIAGRSRNQALAMSALGA APLALAMALGRVPAPLRPPNVTISNVPGPQGALYWNGARLDALYLLSAPVDGAALNIT CSGTNEQITFGLTGCRRAVPALSILTDQLAHELELLVGVSEAGPGTRLRRIAGRR" gene 999472..1000767 /gene="gltA" /locus_tag="Rv0896" /db_xref="GeneID:885208" CDS 999472..1000767 /gene="gltA" /locus_tag="Rv0896" /EC_number="2.3.3.1" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE (KREBS CYCLE) [CATALYTIC ACTIVITY: Citrate + CoA = acetyl-CoA + H2O + oxaloacetate]." /note="type II enzyme; in Escherichia coli this enzyme forms a trimer of dimers which is allosterically inhibited by NADH and competitively inhibited by alpha-ketoglutarate; allosteric inhibition is lost when Cys206 is chemically modified which also affects hexamer formation; forms oxaloacetate and acetyl-CoA and water from citrate and coenzyme A; functions in TCA cycle, glyoxylate cycle and respiration; enzyme from Helicobacter pylori is not inhibited by NADH" /codon_start=1 /transl_table=11 /product="type II citrate synthase" /protein_id="NP_215411.1" /db_xref="GI:15608036" /db_xref="GeneID:885208" /translation="MADTDDTATLRYPGGEIDLQIVHATEGADGIALGPLLAKTGHTT FDVGFANTAAAKSSITYIDGDAGILRYRGYPIDQLAEKSTFIEVCYLLIYGELPDTDQ LAQFTGRIQRHTMLHEDLKRFFDGFPRNAHPMPVLSSVVNALSAYYQDALDPMDNGQV ELSTIRLLAKLPTIAAYAYKKSVGQPFLYPDNSLTLVENFLRLTFGFPAEPYQADPEV VRALDMLFILHADHEQNCSTSTVRLVGSSRANLFTSISGGINALWGPLHGGANQAVLE MLEGIRDSGDDVSEFVRKVKNREAGVKLMGFGHRVYKNYDPRARIVKEQADKILAKLG GDDSLLGIAKELEEAALTDDYFIERKLYPNVDFYTGLIYRALGFPTRMFTVLFALGRL PGWIAHWREMHDEGDSKIGRPRQIYTGYTERDYVTIDAR" misc_feature 1000387..1000425 /gene="gltA" /locus_tag="Rv0896" /note="PS00480 Citrate synthase signature" gene complement(1000808..1002415) /locus_tag="Rv0897c" /db_xref="GeneID:885641" CDS complement(1000808..1002415) /locus_tag="Rv0897c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0897c, (MTCY31.25c), len: 535 aa. Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases from diverse organisms e.g. CAB94055.1|AL358672 putative oxidoreductase from Streptomyces coelicolor (540 aa); NP_147877.1|NC_000854 phytoene dehydrogenase from Aeropyrum pernix (549 aa); Q01671|CRTD_RHOSH methoxyneurosporene dehydrogenase from Rhodobacter sphaeroides (495 aa), FASTA scores: opt: 139, E(): 2.6e-06, (23.8% identity in 538 aa overlap); etc. Also similar to Rv1432, Rv2997, and Rv3829c from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215412.1" /db_xref="GI:15608037" /db_xref="GeneID:885641" /translation="MSDHDRDFDVVVVGGGHNGLVAAAYLARAGLRVRLLERLAQTGG AAVSIQAFDGVEVALSRYSYLVSLLPSRIVADLGAPVRLARRPFSSYTPAPATAGRSG LLIGPTGEPRAAHLAAIGAAPDAHGFAAFYRRCRLVTARLWPTLIEPLRTREQARRDI VEYGGHEAAAAWQAMVDEPIGHAIAGAVANDLLRGVIATDALIGTFARMHEPSLMQNI CFLYHLVGGGTGVWHVPIGGMGSVTSALATAAARHGAEIVTGADVFALDPDGTVRYHS DGSDGAEHLVRGRFVLVGVTPAVLASLLGEPVAALAPGAQVKVNMVVRRLPRLRDDSV TPQQAFAGTFHVNETWSQLDAAYSQAASGRLPDPLPCEAYCHSLTDPSILSARLRDAG AQTLTVFGLHTPHSVFGDTEGLAERLTAAVLASLNSVLAEPIQDVLWTDAQSKPCIET TTTLDLQRTLGMTGGNIFHGALSWPFADNDDPLDTPARQWGVATDHERIMLCGSGARR GGAVSGIGGHNAAMAVLACLASRRKSP" gene complement(1002441..1002704) /locus_tag="Rv0898c" /db_xref="GeneID:885206" CDS complement(1002441..1002704) /locus_tag="Rv0898c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0898c, (MTCY31.26c), len: 87 aa. Conserved hypothetical protein, highly similar to CAC01589.1|AL391041 hypothetical protein from Streptomyces coelicolor (87 aa). Also shows some similarity to Rv0709|MTCY210.28|rpmC from Mycobacterium tuberculosis (77 aa), FASTA score: (28.8% identity in 73 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215413.1" /db_xref="GI:15608038" /db_xref="GeneID:885206" /translation="MGKGRKPTDSETLAHIRDLVAEEKALRAQLRHGGISESEEQQQL RRIEIELDQCWDLLRQRRALRQTGGDPREAVVRPADQVEGYTG" gene 1002812..1003792 /gene="ompA" /locus_tag="Rv0899" /db_xref="GeneID:885286" CDS 1002812..1003792 /gene="ompA" /locus_tag="Rv0899" /function="THE PROTEIN BEHAVED AS A PORIN OF LOW SPECIFIC ACTIVITY. STRUCTURAL PROTEIN THAT MAY PROTECT THE INTEGRITY OF THE BACTERIUM." /experiment="experimental evidence, no additional details recorded" /note="Rv0899, (MTCY31.27), len: 326 aa. ompA, outer membrane protein A (see citation below). C-terminal region similar to C-terminus of many members of the OMPA family of outer membrane proteins, e.g. NP_458280.1|NC_003198 putative outer membrane protein from Salmonella enterica subsp. enterica serovar Typhi (220); NP_418008.1|NC_000913 putative outer membrane protein from Escherichia coli strain K12 (219 aa), FASTA scores: opt: 296, E(): 2.2e-11, (45.3% identity in 117 aa overlap); NP_231844.1|NC_002505 outer membrane protein OmpA from Vibrio cholerae (321 aa); Q05146|OMPA_BORAV OUTER MEMBRANE PROTEIN A PRECURSOR from Bordetella avium (194 aa); etc. A signal peptide sequence probably exists at the N-terminus. Contains PS00044 Bacterial regulatory proteins, lysR family signature. BELONGS TO THE OMPA FAMILY." /codon_start=1 /transl_table=11 /product="outer membrane protein A OMPA" /protein_id="NP_215414.1" /db_xref="GI:15608039" /db_xref="GeneID:885286" /translation="MASKAGLGQTPATTDARRTQKFYRGSPGRPWLIGAVVIPLLIAA IGYGAFERPQSVTGPTGVLPTLTPTSTRGASALSLSLLSISRSGNTVTLIGDFPDEAA KAALMTALNGLLAPGVNVIDQIHVDPVVRSLDFSSAEPVFTASVPIPDFGLKVERDTV TLTGTAPSSEHKDAVKRAATSTWPDMKIVNNIEVTGQAPPGPPASGPCADLQSAINAV TGGPIAFGNDGASLIPADYEILNRVADKLKACPDARVTINGYTDNTGSEGINIPLSAQ RAKIVADYLVARGVAGDHIATVGLGSVNPIASNATPEGRAKNRRVEIVVN" misc_feature 1003304..1003381 /gene="ompA" /locus_tag="Rv0899" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene 1003805..1003957 /locus_tag="Rv0900" /db_xref="GeneID:885179" CDS 1003805..1003957 /locus_tag="Rv0900" /function="UNKNOWN" /note="Rv0900, (MTCY31.28), len: 50 aa. Possible membrane protein, with hydrophobic domain from aa 4-26." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215415.1" /db_xref="GI:15608040" /db_xref="GeneID:885179" /translation="MDFVIQWSCYLLAFLGGSAVAWVVVTLSIKRASRDEGAAEAPSA AETGAQ" gene 1003957..1004484 /locus_tag="Rv0901" /db_xref="GeneID:885203" CDS 1003957..1004484 /locus_tag="Rv0901" /function="UNKNOWN" /note="Rv0901, (MTCY31.29), len: 175 aa. Possible conserved exported or membrane protein, with hydrophobic N-terminus at aa 7-25. Shows some similarity in C-terminus to O33070|Z99494|MLCB57.59 HYPOTHETICAL PROTEIN from Mycobacterium leprae (113 aa), FASTA scores: opt: 204, E(): 3.2e-12, (44.9% identity in 78 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215416.1" /db_xref="GI:15608041" /db_xref="GeneID:885203" /translation="MEHVHWWLAGLAFTLGMVLTSTLMVRPVEHQVLVKKSVRGSSAK SKPPTARKPAVKSGTKREESPTAKTKVATESAAEQIPVAGEPAAEPIPVAGEPAARIP VVPYAPYGPGSARAGADGSGPQGWLVKGRSDTRLYYTPEDPTYDPTVAQVWFQDEESA ARAFFTPWRKSTRRT" gene complement(1004501..1005841) /gene="prrB" /locus_tag="Rv0902c" /db_xref="GeneID:885647" CDS complement(1004501..1005841) /gene="prrB" /locus_tag="Rv0902c" /EC_number="2.7.3.-" /function="SENSOR PART OF THE TWO COMPONENT REGULATORY SYSTEM PRRA/PRRB. THOUGHT TO BE INVOLVED IN THE ENVIRONMENTAL ADAPTATION , SPECIFICALLY IN AN EARLY PHASE OF THE INTRACELLULAR GROWTH." /experiment="experimental evidence, no additional details recorded" /note="Rv0902c, (MTCY31.30c), len: 446 aa. prrB, two-component sensor histidine kinase (EC 2.7.3.-) (see citations below), transmembrane protein, equivalent to MLCB57_26|NP_302403.1|NC_002677 sensor histidine kinase from Mycobacterium leprae (446 aa); and similar at C-termini to NP_301251.1|NC_002677 putative two-component system sensor kinase from Mycobacterium leprae (519 aa). C-terminus also similar to the C-termini of many sensor-like histidine kinase proteins e.g. P08336|CPXA_ECOLI|ECFB|SSD|EUP|B3911|Z5456|ECS4837 sensor protein from Escherichia coli strain K12 (457 aa), FASTA scores: opt: 364, E(): 1.7e-15, (27.1% identity in 398 aa overlap); CAB89748.1|AL354616 putative two-component histidine kinase from Streptomyces coelicolor (483 aa); CAB82845.1|AJ277081 putative histidine kinase from Amycolatopsis mediterranei (472 aa); etc. Also similar in part to Mycobacterium tuberculosis proteins Rv3764c (475 aa); and Rv0982 (504 aa). Thought to be induced at phagocytosis (see Graham & Clark-Curtiss 1999)." /codon_start=1 /transl_table=11 /product="two component sensor histidine kinase PRRB" /protein_id="NP_215417.1" /db_xref="GI:15608042" /db_xref="GeneID:885647" /translation="MNILSRIFARTPSLRTRVVVATAIGAAIPVLIVGTVVWVGITND RKERLDRRLDEAAGFAIPFVPRGLDEIPRSPNDQDALITVRRGNVIKSNSDITLPKLQ DDYADTYVRGVRYRVRTVEIPGPEPTSVAVGATYDATVAETNNLHRRVLLICTFAIGA AAVFAWLLAAFAVRPFKQLAEQTRSIDAGDEAPRVEVHGASEAIEIAEAMRGMLQRIW NEQNRTKEALASARDFAAVSSHELRTPLTAMRTNLEVLSTLDLPDDQRKEVLNDVIRT QSRIEATLSALERLAQGELSTSDDHVPVDITDLLDRAAHDAARIYPDLDVSLVPSPTC IIVGLPAGLRLAVDNAIANAVKHGGATLVQLSAVSSRAGVEIAIDDNGSGVPEGERQV VFERFSRGSTASHSGSGLGLALVAQQAQLHGGTASLENSPLGGARLVLRLPGPS" gene complement(1005852..1006562) /gene="prrA" /locus_tag="Rv0903c" /db_xref="GeneID:885209" CDS complement(1005852..1006562) /gene="prrA" /locus_tag="Rv0903c" /function="TRANSCRIPTIONAL REGULATOR PART OF THE TWO COMPONENT REGULATORY SYSTEM PRRA/PRRB. THOUGHT TO BE INVOLVED IN THE ENVIRONMENTAL ADAPTATION , SPECIFICALLY IN AN EARLY PHASE OF THE INTRACELLULAR GROWTH." /experiment="experimental evidence, no additional details recorded" /note="Rv0903c, (MTCY31.31c), len: 236 aa. prrA, two-component response regulator (see citations below), equivalent to Z99494|MLCB57_27|NP_302402.1|NC_002677 two-component response regulator from Mycobacterium leprae (233 aa), FASTA scores: opt: 1414, E(): 0, (95.7% identity in 233 aa overlap); and similar to T45446 probable two-component response regulator from Mycobacterium leprae (253 aa). Also similar to many sensor-like histidine kinase proteins e.g. CAB88489.1|AL353816 putative two-component systen response regulator from Streptomyces coelicolor (248 aa); AAG36759.1|AF119221_1 |AF119221 response regulator from Corynebacterium glutamicum (232 aa); Q02540|COPR_PSESM transcriptional activator protein COPR from Pseudomonas syringae (pv. tomato) (227 aa), FASTA scores: opt: 600, E(): 0, (44.4% identity in 225 aa overlap); etc. Also similar to Rv0981 from Mycobacterium tuberculosis (230 aa), Rv3765c (234 aa), phoP (247 aa), etc. Thought to be induced at phagocytosis (see Graham & Clark-Curtiss 1999)." /codon_start=1 /transl_table=11 /product="two component response transcriptional regulatory protein PRRA" /protein_id="NP_215418.1" /db_xref="GI:15608043" /db_xref="GeneID:885209" /translation="MGGMDTGVTSPRVLVVDDDSDVLASLERGLRLSGFEVATAVDGA EALRSATENRPDAIVLDINMPVLDGVSVVTALRAMDNDVPVCVLSARSSVDDRVAGLE AGADDYLVKPFVLAELVARVKALLRRRGSTATSSSETITVGPLEVDIPGRRARVNGVD VDLTKREFDLLAVLAEHKTAVLSRAQLLELVWGYDFAADTNVVDVFIGYLRRKLEAGG GPRLLHTVRGVGFVLRMQ" gene complement(1006693..1008180) /gene="accD3" /locus_tag="Rv0904c" /db_xref="GeneID:885279" CDS complement(1006693..1008180) /gene="accD3" /locus_tag="Rv0904c" /EC_number="6.4.1.2" /function="THIS PROTEIN IS A COMPONENT OF THE ACETYL COENZYME A CARBOXYLASE COMPLEX; FIRST, BIOTIN CARBOXYLASE CATALYZES THE CARBOXYLATION OF THE CARRIER PROTEIN AND THEN THE TRANSCARBOXYLASE TRANSFERS THE CARBOXYL GROUP TO FORM MALONYL-CoA [CATALYTIC ACTIVITY ATP + ACETYL-CoA + HCO(3)(-) = ADP + PHOSPHATE + MALONYL-COA]." /note="Rv0904c, (MTCY31.32c, MT0927), len: 495 aa. Putative accD3, acetyl-CoA carboxylase carboxyl transferase, beta subunit (carboxyltransferase subunit of acetyl-CoA carboxylase) (EC 6.4.1.2), highly similar in part to AAA63045.1|U15184 zinc finger protein from Mycobacterium leprae (201 aa). Also highly similar to others e.g. CAC42827.1|Y17592 putative carboxyltransferase subunit of acetyl-CoA carboxylase from Corynebacterium glutamicum (491 aa); CAB86110.1|AL163003 putative acetyl CoA carboxylase (alpha and beta subunits) from Streptomyces coelicolor (458 aa); Q54776|ACCD_SYNP7 ACETYL-COENZYME A CARBOXYLASE CARBOXYL TRANSFERASE SUBUNIT BETA from Synechococcus sp. (305 aa); P12217|ACCD_MARPO ACETYL-COENZYME A CARBOXYLASE CARBOXYL TRANSFERASE SUBUNIT BETA from Marchantia polymorpha (316 aa), FASTA scores: opt: 519, E():1.6e-24, (40.2% identity in 219 aa overlap); etc. Also similar to Rv3280, Rv2502c, etc from Mycobacterium tuberculosis. BELONGS TO THE ACCD/PCCB FAMILY." /codon_start=1 /transl_table=11 /product="putative acetyl-coenzyme A carboxylase carboxyl transferase subunit beta" /protein_id="NP_215419.1" /db_xref="GI:15608044" /db_xref="GeneID:885279" /translation="MSRITTDQLRHAVLDRGSFVSWDSEPLAVPVADSYARELAAARA ATGADESVQTGEGRVFGRRVAVVACEFDFLGGSIGVAAAERITAAVERATAERLPLLA SPSSGGTRMQEGTVAFLQMVKIAAAIQLHNQARLPYLVYLRHPTTGGVFASWGSLGHL TVAEPGALIGFLGPRVYELLYGDPFPSGVQTAENLRRHGIIDGVVALDRLRPMLDRAL TVLIDAPEPLPAPQTPAPVPDVPTWDSVVASRRPDRPGVRQLLRHGATDRVLLSGTDQ GEAATTLLALARFGGQPTVVLGQQRAVGGGGSTVGPAALREARRGMALAAELCLPLVL VIDAAGPALSAAAEQGGLAGQIAHCLAELVTLDTPTVSILLGQGSGGPALAMLPADRV LAALHGWLAPLPPEGASAIVFRDTAHAAELAAAQGIRSADLLKSGIVDTIVPEYPDAA DEPIEFALRLSNAIAAEVHALRKIPAPERLATRLQRYRRIGLPRD" gene 1008207..1008938 /gene="echA6" /locus_tag="Rv0905" /db_xref="GeneID:885825" CDS 1008207..1008938 /gene="echA6" /locus_tag="Rv0905" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215420.1" /db_xref="GI:15608045" /db_xref="GeneID:885825" /translation="MIGITQAEAVLTIELQRPERRNALNSQLVEELTQAIRKAGDGSA RAIVLTGQGTAFCAGADLSGDAFAADYPDRLIELHKAMDASPMPVVGAINGPAIGAGL QLAMQCDLRVVAPDAFFQFPTSKYGLALDNWSIRRLSSLVGHGRARAMLLSAEKLTAE IALHTGMANRIGTLADAQAWAAEIARLAPLAIQHAKRVLNDDGAIEEAWPAHKELFDK AWGSQDVIEAQVARMEKRPPKFQGA" gene 1008944..1010062 /locus_tag="Rv0906" /db_xref="GeneID:885150" CDS 1008944..1010062 /locus_tag="Rv0906" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0906, (MTCY31.34), len: 372 aa. Conserved hypothetical protein, highly similar to others e.g. SC6A5.25|AL049485|T35416 hypothetical protein from Streptomyces coelicolor (370 aa), FASTA scores: opt: 1125, E(): 0, (51.3% identity in 335 aa overlap); NP_242955.1|NC_002570|BH2089 conserved protein from Bacillus halodurans (370 aa); etc. Also shows some similarity to C-terminus of Q48412|ROMA_KLEPN Q48412 outer membrane protein roma (fragment) from Klebsiella pneumoniae (132 aa), FASTA scores: opt: 319, E(): 8.5e-14, (46.2% identity in 104 aa overlap); NP_105215.1|NC_002678 hypothetical protein which contains similarity to outer membrane protein romA from Enterobacter cloacae (350 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215421.1" /db_xref="GI:15608046" /db_xref="GeneID:885150" /translation="MVRRALRLAAGTASLAAGTWLLRALHGTPAALGADAASIRAVSE QSPNYRDGAFVNLDPASMFTLDREELRLIVWELVARHSASRPAAPIPLASPNIYRGDA SRLAVSWFGHSTALLEIDGYRVLTDPVWSDRCSPSDVVGPQRLHPPPVQLAALPAVDA VVISHDHYDHLDIDTVVALVGMQRAPFLVPLGVGAHLRSWGVPQDRIVELDWNQSAQV DELTVVCVPARHFSGRFLSRNTTLWASWAFVGPNHRAYFGGDTGYTKSFTQIGADHGP FDLTLLPIGAYNTAWPDIHMNPEEAVRAHLDVTDSGSGMLVPVHWGTFRLAPHPWGEP VERLLAAAEPEHVTVAVPLPGQRVDPTGPMRLHPWWRL" gene 1010136..1011734 /locus_tag="Rv0907" /db_xref="GeneID:885146" CDS 1010136..1011734 /locus_tag="Rv0907" /function="UNKNOWN, POSSIBLY INVOLVED IN CELL WALL BIOSYNTHESIS." /note="Rv0907, (MTCY21C12.01), len: 532 aa. Conserved hypothetical protein, possibly involved in cell wall biosynthesis: similar to many beta-lactamases, penicillin-binding proteins and hypothetical proteins e.g. NP_298910.1|NC_002488 beta-lactamase from Xylella fastidiosa (455 aa); Q06317|PBP4_NOCLA PENICILLIN-BINDING PROTEIN 4 (PBP-4) (381 aa), FASTA scores: opt: 299, E(): 8.8e-05, (28.7% identity in 401 aa overlap); etc. N-terminus highly similar to AAA63047.1|U15184 hypothetical protein from Mycobacterium leprae (58 aa). Related to other putative esterases and penicillin binding proteins in Mycobacterium tuberculosis e.g. Rv1730c|MTCY04C12.15c (517 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215422.1" /db_xref="GI:15608047" /db_xref="GeneID:885146" /translation="MATICGHDQTSGNGRHGDVADVNGCGSTHQALGPPSGLPDASPN ERSAIQIPAGRIDDAVAKVDGLVGELMQNTGIPGMAVAIVHGGKTLYAKGFGVRDVGK GGGPDNKVDADTVFQLASVSKSVGATVVAHAVTDNVVTWDTPVVSKLPWFALRDPYVT GQVTIADLYSHRSGLPDHAGDLLEDLGYDRRQVLQRLKYLPLAPFRISYAYTNFGVTA AAEAVAAAAGQSWEDLSDEVLYRPLGMGSTSSRFTDFLARPNHAVNHVKVADRWEARY QRDPDAQSPAGGVSSSLNDMTHWLAMVLADGVYNGRRITSPEALLPVYTPQVISRHPV SPRARASFYGYGFNVGVTSSGRTEYSHSGAFGLGAAANFVVLPSEDLAIIALTNAGPI GVPETLTAEFMDLVQYGQVREDWAALYKKAFAPLNELAGSLVGKQSPANPAPSRPLND YVGVYANDYWGPATVTYHDGQLRLSLGPKNQTFDLTHWDGDTFTFTLSTENALPGSIS KATFAGDTLNLEYYDADKLGTFTR" gene 1011731..1014124 /gene="ctpE" /locus_tag="Rv0908" /db_xref="GeneID:885657" CDS 1011731..1014124 /gene="ctpE" /locus_tag="Rv0908" /EC_number="3.6.1.-" /function="METAL CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF UNDETERMINATED METAL CATION WITH HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED METAL CATION(IN) = ADP + PHOSPHATE + UNDETERMINATED METAL CATION(OUT)]." /note="Rv0908, (MTCY21C12.02), len: 797 aa. Probable ctpE, metal cation-transporting ATPase P-type, transmembrane protein, E1-E2 family, highly similar to many e.g. AB93406.1|AL357524 putative integral membrane ATPase from Streptomyces coelicolor (802 aa); NP_346063.1|NC_003028 cation-transporting ATPase (E1-E2 family) from Streptococcus pneumoniae (778 aa); P37278|ATCL_SYNP7|PACL cation-transporting atpase from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (926 aa), FASTA scores: opt: 257, E(): 4.8e-33, (27.7% identity in 905 aa overlap); etc. Contains E1-E2 ATPases phosphorylation site (PS00154). BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES)." /codon_start=1 /transl_table=11 /product="metal cation transporter ATPase P-type CtpE" /protein_id="NP_215423.1" /db_xref="GI:15608048" /db_xref="GeneID:885657" /translation="MTRSASATAGLTDAEVAQRVAEGKSNDIPERVTRTVGQIVRANV FTRINAILGVLLLIVLATGSLINGMFGLLIIANSVIGMVQEIRAKQTLDKLAIIGQAK PLVRRQSGTRTRSTNEVVLDDIIELGPGDQVVVDGEVVEEENLEIDESLLTGEADPIA KDAGDTVMSGSFVVSGAGAYRATKVGSEAYAAKLAAEASKFTLVKSELRNGINRILQF ITYLLVPAGLLTIYTQLFTTHVGWRESVLRMVGALVPMVPEGLVLMTSIAFAVGVVRL GQRQCLVQELPAIEGLARVDVVCADKTGTLTESGMRVCEVEELDGAGRQESVADVLAA LAAADARPNASMQAIAEAFHSPPGWVVAANAPFKSATKWSGVSFRDHGNWVIGAPDVL LDPASVAARQAERIGAQGLRVLLLAAGSVAVDHAQAPGQVTPVALVVLEQKVRPDARE TLDYFAVQNVSVKVISGDNAVSVGAVADRLGLHGEAMDARALPTGREELADTLDSYTS FGRVRPDQKRAIVHALQSHGHTVAMTGDGVNDVLALKDADIGVAMGSGSPASRAVAQI VLLNNRFATLPHVVGEGRRVIGNIERVANLFLTKTVYSVLLALLVGIECLIAIPLRRD PLLFPFQPIHVTIAAWFTIGIPAFILSLAPNNERAYPGFVRRVMTSAVPFGLVIGVAT FVTYLAAYQGRYASWQEQEQASTAALITLLMTALWVLAVIARPYQWWRLALVLASGLA YVVIFSLPLAREKFLLDASNLATTSIALAVGVVGAATIEAMWWIRSRMLGVKPRVWR" misc_feature 1012631..1012651 /gene="ctpE" /locus_tag="Rv0908" /note="PS00154 E1-E2 ATPases phosphorylation site" gene 1014681..1014860 /locus_tag="Rv0909" /db_xref="GeneID:885197" CDS 1014681..1014860 /locus_tag="Rv0909" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0909, (MTCY21C12.03), len: 59 aa. Conserved hypothetical protein, equivalent to NP_302399.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (56 aa). Also some similarity with AL022268|SC4H2_10c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (97 aa), FASTA scores: opt: 106, E(): 0.13, (43.2% identity in 37 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215424.1" /db_xref="GI:15608049" /db_xref="GeneID:885197" /translation="MGILDKVKNLLSQNADKVETVINKAGEFVDEQTQGNYSDAIHKL HDAASNVVGMSDQQS" gene 1014866..1015300 /locus_tag="Rv0910" /db_xref="GeneID:885137" CDS 1014866..1015300 /locus_tag="Rv0910" /function="UNKNOWN" /note="Rv0910, (MTCY21C12.04), len: 144 aa. Conserved hypothetical protein, equivalent to NP_302398.1|NC_002677 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (181 aa), FASTA scores: opt: 820, E(): 0, (83.9% identity in 143 aa overlap). Also similar to Rv1546|MTCY48.19c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (143 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215425.1" /db_xref="GI:15608050" /db_xref="GeneID:885137" /translation="MAKLSGSIDVPLPPEEAWMHASDLTRYREWLTIHKVWRSKLPEV LEKGTVVESYVEVKGMPNRIKWTIVRYKPPEGMTLNGDGVGGVKVKLIAKVAPKEHGS VVSFDVHLGGPALLGPIGMIVAAALRADIRESLQNFVTVFAG" gene 1015398..1016171 /locus_tag="Rv0911" /db_xref="GeneID:885093" CDS 1015398..1016171 /locus_tag="Rv0911" /function="UNKNOWN" /note="Rv0911, (MTCY21C12.05), len: 257 aa. Conserved hypothetical protein, showing similarity with hydroxylases and hypothetical proteins e.g. T35325 probable hydroxylase from Streptomyces coelicolor (265 aa); Q54242 hypothetical protein from Streptomyces, FASTA scores: opt: 372, E(): 8.8e-18, (32.0% identity in 256 aa overlap); AAD04716.1|U77891 doxorubicin biosynthesis enzyme DnrV from Streptomyces peucetius (275 aa); AAA63051.1|U15184 hypothetical protein from Mycobacterium leprae (94 aa); etc. Also similar to Rv0577 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (261 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215426.1" /db_xref="GI:15608051" /db_xref="GeneID:885093" /translation="MPTRSSAPLGAPCWIDLTTSDVDRAQDFYGTVFGWAFESAGPDY GGYINAAKGGHPVAGLMANRPEFQSPDGWATYFHTVDIGATVAKLAAAGGSSCLDPME VPGKGFMSLAVDPSGAAFGLWQPLQHHGFEVIGEAGSPVWHQLTTRDYRSVIDFYRQV FGWRTEQISDTDEFCYTTAWFDDQQLLGVMDGSSCLPEGVPSNWTIFFGAEDVDETLR VICDNGGSVVRAAENTPYGRLAAAADPMGVVFNLSSLQA" gene 1016236..1016685 /locus_tag="Rv0912" /db_xref="GeneID:885103" CDS 1016236..1016685 /locus_tag="Rv0912" /function="UNKNOWN" /note="Rv0912, (MTCY21C12.06), len: 149 aa. Probable conserved transmembrane protein, equivalent to Q50121|NP_302397.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (144 aa), FASTA scores: opt: 677, E(): 6.9e-38, (69.5% identity in 141 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215427.1" /db_xref="GI:15608052" /db_xref="GeneID:885103" /translation="MTRRLRPGWLVALSAAVIAASTWMPWLTTTVGGGGWVNAIGGTH GSLELPHGFGPGQLIVLLSSTLLVVGAMAGRGLSVKLSSIAALVVSLLIVALTVWYYK LNVNPPVSAEYGLYFGAAGGVCAVGCSLWAAVSAASPGRRRHREVVR" gene complement(1017217..1018725) /locus_tag="Rv0913c" /db_xref="GeneID:885042" CDS complement(1017217..1018725) /locus_tag="Rv0913c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0913c, (MTCY21C12.07c), len: 501 aa. Possible dioxygenase (EC 1.-.-.-), showing similarity with others e.g. AAK38744.1|AY029525 carotenoid 9,10-9',10' cleavage dioxygenase from Phaseolus vulgaris (543 aa); CAB56138.1|AL117669 putative dioxygenase from Streptomyces coelicolor (503 aa); AAK06796.1|AF324838_15|AF324838 putative dioxygenase SimC5 from Streptomyces antibioticus (456 aa); Q53353|S65040 LIGNOSTILBENE-ALPHA,BETA-DIOXYGENASE (EC 1.13.11.43) from Pseudomonas paucimobilis (485 aa), FASTA scores: opt: 310, E(): 3.4e-20, (28.9% identity in 495 aa overlap); etc. Also some similarity with Rv0654|MTCI376.22 PROBABLE DIOXYGENASE from Mycobacterium tuberculosis (501 aa)." /codon_start=1 /transl_table=11 /product="dioxygenase" /protein_id="NP_215428.1" /db_xref="GI:15608053" /db_xref="GeneID:885042" /translation="MDITIVGKYLSTLPEDDDHPYRTGPWRPQTTEWDADDLTTVTGE VPADLDGIYLRNTENPLHPAFATYHPFDGDGMIHVVGFRDGKAFYRNRFIRTDGFLAE NEAGGPLWPGLAEPVQLAKREHGWGARGLMKDASSTDVIVHRGIALTSFYQCGDLYRI DPYSANTLGKESWHGRFPFDWGVSAHPKVDNKTGELLFFNYSKQEPYMRYGVVDQNNE LVHYVDVPLPGPRLPHDMAFTENYVILNDFPLFWDPRLLERDVHLPRFYPEIPSRFAV VARRGNDIRWFEADPTFVLHFTNAYEQGDEIVLDGFYEGDPQPLDTGGTKWEKLFRFL ALDRLQSRLHRWRLNMVTGAVHEEQLSESITEFGTINADYAASSYRYTYAATGKPSWF LFDGLVKHDLLTGNHECYSFGDGVYGSETAMAPRVGSSAEDDGYLVTLTTDMNDDASY CLVFDAARPGDGPICKLALPERISSGTHSAWVPGAELRRWDHAESPAAAVGL" gene complement(1018727..1019965) /locus_tag="Rv0914c" /db_xref="GeneID:885046" CDS complement(1018727..1019965) /locus_tag="Rv0914c" /EC_number="2.3.1.9" /function="THOUGHT TO BE INVOLVED IN DEGRADATIVE PATHWAYS SUCH AS FATTY ACID BETA_OXIDATION." /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_215429.1" /db_xref="GI:15608054" /db_xref="GeneID:885046" /translation="MDDGVWILGGYQSDFARNLSKENRDFADLTREVVDGTLTAAKVD AADLAAAGVVHVANAFGEMFARQGHLGAMPATVCDDLWDTPATRHEAACASGSVATLA AMADLRSGAYRVALVVGLELEKTVPGDTAAEHLSAAAWTGHEGAEARYLWPSMFAQVA DEYDRRYGLDDTHLRAIAQLNFANARRNPNAQTRGWTIPDPITDDDATNPLTEGRLRR FDCSQMTDGGAGLVLVSDAYLRDHRDARPIGRIDGWGHRTVGLGLRQKLDRVAQGDSA PYLLPHVRATVLDALRRARVTLDDLDGIEVHDCFTPSEYLAIDHIGLTGPGESWKAIE NGEIEIGGRLPINPSGGLIGGGHPVGASGVRMLLDAAKQVSGIAGDYQVENAEAFGTL NFGGSTATTVSFVVSTTRGS" gene complement(1020058..1021329) /gene="PPE14" /locus_tag="Rv0915c" /db_xref="GeneID:885069" CDS complement(1020058..1021329) /gene="PPE14" /locus_tag="Rv0915c" /function="UNKNOWN. POSSIBLY A PROTECTIVE ANTIGEN INVOLVED WITH THE EARLY CONTROL OF INFECTION." /experiment="experimental evidence, no additional details recorded" /note="Rv0915c, (MTCY21C12.09c), len: 423 aa. PPE14 (alternate gene name: MTB41). Member of the Mycobacterium tuberculosis PPE family (see citation below), highly similar to many e.g. Rv1807 from Mycobacterium tuberculosis (403 aa), FASTA scores: opt: 966, E(): 4.4e-30, (45.7% identity in 392 aa overlap); etc. Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2.; MTB41" /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177765.1" /db_xref="GI:57116795" /db_xref="GeneID:885069" /translation="MDFGLLPPEVNSSRMYSGPGPESMLAAAAAWDGVAAELTSAAVS YGSVVSTLIVEPWMGPAAAAMAAAATPYVGWLAATAALAKETATQARAAAEAFGTAFA MTVPPSLVAANRSRLMSLVAANILGQNSAAIAATQAEYAEMWAQDAAVMYSYEGASAA ASALPPFTPPVQGTGPAGPAAAAAATQAAGAGAVADAQATLAQLPPGILSDILSALAA NADPLTSGLLGIASTLNPQVGSAQPIVIPTPIGELDVIALYIASIATGSIALAITNTA RPWHIGLYGNAGGLGPTQGHPLSSATDEPEPHWGPFGGAAPVSAGVGHAALVGALSVP HSWTTAAPEIQLAVQATPTFSSSAGADPTALNGMPAGLLSGMALASLAARGTTGGGGT RSGTSTDGQEDGRKPPVVVIREQPPPGNPPR" misc_feature complement(1020346..1020378) /gene="PPE14" /locus_tag="Rv0915c" /note="PS00626 Regulator of chromosome condensation (RCC1) signature 2" gene complement(1021344..1021643) /gene="PE7" /locus_tag="Rv0916c" /db_xref="GeneID:885167" CDS complement(1021344..1021643) /gene="PE7" /locus_tag="Rv0916c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0916c, (MTCY21C12.10c), len: 99 aa. PE7 (alternate gene name: MTB10). Member of the Mycobacterium tuberculosis PE family (see citations below), similar to many e.g. Rv1788 from Mycobacterium tuberculosis (99 aa), FASTA scores: opt: 321, E(): 1.3e-11, (53.5% identity in 99 aa overlap); etc.; MTB10" /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177766.1" /db_xref="GI:57116796" /db_xref="GeneID:885167" /translation="MSFVTIQPVVLAAATGDLPTIGTAVSARNTAVCAPTTGVLPPAA NDVSVLTAARFTAHTKHYRVVSKPAALVHGMFVALPAATADAYATTEAVNVVATG" gene 1022087..1023868 /gene="betP" /locus_tag="Rv0917" /db_xref="GeneID:885172" CDS 1022087..1023868 /gene="betP" /locus_tag="Rv0917" /function="HIGH-AFFINITY UPTAKE OF GLYCINE BETAINE. SUPPOSED RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv0917, (MTCY21C12.11), len: 593 aa. Possible betP, glycine betaine transporter, integral membrane protein, highly similar to many transporters, mainly glycine betaine transporters, e.g. P54582|BETP_CORGL glycine betaine transporter from Corynebacterium glutamicum (Brevibacterium flavum) (595 aa), FASTA scores: opt: 1367, E(): 0, (42.7% identity in 504 aa overlap); T35264 probable BCCT family transporter from Streptomyces coelicolor (578 aa); NP_243511.1|NC_002570 glycine betaine transporter from Bacillus halodurans (504 aa); NP_439848.1|NC_000907 high-affinity choline transport protein (betT) from Haemophilus influenzae (669 aa); etc. SEEMS TO BELONG TO THE BCCT (TC 2.33) FAMILY OF TRANSPORTERS." /codon_start=1 /transl_table=11 /product="glycine betaine transport integral membrane protein BetP" /protein_id="NP_215432.1" /db_xref="GI:15608057" /db_xref="GeneID:885172" /translation="MSAKERGDQNAVVDALRSIQPAVFIPASVVIVAMIVVSVVYSSV AENAFVRLNSAITGGVGWWYILVATGFVVFALYCGISRIGTIRLGRDDELPEFSFWAW LAMLFSAGMGIGLVFYGVAEPLSHYLRPPRSRGVPALTDAAANQAMALTVFHWGLHAW AIYVVVGLGMAYMTYRRGRPLSVRWLLEPVVGRGRVEGALGHAVDVIAIVGTLFGVAT SLGFGITQIASGLEYLGWIRVDNWWMVGMIAAITATATASVVSGVSKGLKWLSNINMA LAAALALFVLLLGPTLFLLQSWVQNLGGYVQSLPQFMLRTAPFSHDGWLGDWTIFYWG WWISWAPFVGMFIARISRGRTIREFIGAVLLVPTVIASLWFTIFGDSALLRQRNNGDM LVNGAVDTNTSLFRLLDGLPIGAITSVLAVLVIVFFFVTSSDSGSLVIDILSAGGELD PPKLTRVYWAVLEGVAAAVLLLIGGAGSLTALRTAAIATALPFSIVMVVACYAMTKAF HFDLAATPRLLHVTVPDVVAAGNRRRHDISATLSGLIAVRDVDSGTYIVHPDTGALTV TAPPDPLDDHVFESDRHVTRRNTTSSR" gene 1024211..1024687 /locus_tag="Rv0918" /db_xref="GeneID:885198" CDS 1024211..1024687 /locus_tag="Rv0918" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0918, (MTCY21C12.12), len: 158 aa. Conserved hypothetical protein, similar in part to Q50116 hypothetical protein from Mycobacterium leprae (44 aa), FASTA scores: opt: 132, E(): 0.0055, (65.6% identity in 32 aa overlap). Also some similarity in C-terminus with other hypothetical proteins e.g. NP_289961.1|NC_002655 hypothetical protein from Escherichia coli strain O157:H7 (94 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215433.1" /db_xref="GI:15608058" /db_xref="GeneID:885198" /translation="MHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSL GSSGLPALNCTPIRHTIDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFT VTAALAHARDVLADRRLFVLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG" gene 1024684..1025184 /locus_tag="Rv0919" /db_xref="GeneID:885221" CDS 1024684..1025184 /locus_tag="Rv0919" /function="UNKNOWN" /note="Rv0919, (MTCY21C12.13), len: 166 aa. Conserved hypothetical protein, some similarity to Q50115 hypothetical protein from Mycobacterium leprae (90 aa), FASTA scores: opt: 243, E(): 5.3e-11, (56.5% identity in 85 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215434.1" /db_xref="GI:15608059" /db_xref="GeneID:885221" /translation="MSGYSAPRRISDADDVTSFSSGEPSLDDYLRKRALANHVQGGSR CFVTCRDGRVVGFYALASGSVAHADAPGRVRRNMPDPVPVILLSRLAVDRKEQGRGLG SHLLRDAIGRCVQAADSIGLRAILVHALHDEARAFYVHFDFEISPTDPLHLMLLMKDA RALIGD" gene complement(1025321..1025393) /locus_tag="Rvnt14" /note="tRNA-Arg(CCT)" /db_xref="GeneID:2700422" tRNA complement(1025321..1025393) /locus_tag="Rvnt14" /product="tRNA-Arg" /note="codon recognized: AGG" /anticodon=(pos:1025358..1025360,aa:Arg) /db_xref="GeneID:2700422" repeat_region complement(1025458..1026893) /note="IS1554, len: 1436 bp. Putative Insertion sequence element bounded by 15 bp inverted repeats." /mobile_element="insertion sequence:IS1554" repeat_region 1025458..1025472 /note="15 bp inverted repeat, ATTCGGTGTAAGTGG, at the left end of IS1554 element" gene complement(1025497..1026816) /locus_tag="Rv0920c" /db_xref="GeneID:885549" CDS complement(1025497..1026816) /locus_tag="Rv0920c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1554." /note="Rv0920c, (MTCY21C12.14c), len: 439 aa. Probable transposase for IS1554, highly similar to others e.g. MTCY441.35|Q45111 transposase from Mycobacterium tuberculosis (419 aa), FASTA scores: opt: 1113, E(): 0, (43.9% identity in 378 aa overlap); etc. Contains transposases mutator family signature (PS01007)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215435.1" /db_xref="GI:15608060" /db_xref="GeneID:885549" /translation="MDAAQVIEPAHAGQDVDEAAVAARELSGAERALVGDLVRQARAE GVALTGPDGLLKALTKTVLEAALQEEMTEHLGYDRHAAAGRGSGNSRNGSRNKKVITD ACGQVEIAVPRDRNGTFEPVIVGKRKRRVTDVDRVVLSLYAKGLTTGEIAAHFADVYG VSVSKDTISRITDRVIEEMQAWWSRPLEKVYAAVFIDAIMVKIRDGQVRNRPVYAAIG VDLDGHKDILGMWAGEGDGESAKFWLAVLTDLRNRGVKDIFFLVCDGLKGLPDSVSAA FPLATVQTCIIHLIRNTFRYASRKYWDKISVDLKPIYTAASAAEARLRYEEFAEKWGK PYPAITRLWDSAWEEFIPFLDYDVEIRRVPCSTNAIESLNARYRRAVRARGHFPNEQS ALKTLYLVTRSLDPKGTGQTKWAVRWKPALNALAITFADRMPAAEER" misc_feature complement(1025953..1026027) /locus_tag="Rv0920c" /note="PS01007 Transposases, Mutator family, signature, D" repeat_region complement(1026879..1026893) /note="15 bp inverted repeat, ATTCGGTGTAAGTGG, at the right end of IS1554 element" repeat_region 1027061..1029360 /note="IS1535, len: 2300 bp. Putative Insertion sequence element bounded by 16 bp inverted repeats." /mobile_element="insertion sequence:IS1535" repeat_region 1027061..1027076 /note="16 bp inverted repeat, TTGAGTGTGTTTTAGT, at the left end of IS element IS1535" gene 1027104..1027685 /locus_tag="Rv0921" /db_xref="GeneID:885557" CDS 1027104..1027685 /locus_tag="Rv0921" /function="PREVENTS THE COINTEGRATION OF FOREIGN DNA BEFORE INTEGRATION INTO THE CHROMOSOME." /experiment="experimental evidence, no additional details recorded" /note="Rv0921, (MTCY21C12.15), len: 193 aa. Possible resolvase for IS1535, highly similar to many bacterial resolvases e.g. MTCY274.17c|YX1C_MYCTU Q10831 from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 537, E(): 5.7e-29, (51.8% identity in 166 aa overlap). Presents an helix turn helix motif." /codon_start=1 /transl_table=11 /product="resolvase" /protein_id="NP_215436.1" /db_xref="GI:15608061" /db_xref="GeneID:885557" /translation="MNLADWAESVGVNRHTAYRWFREGTLPVPAERVGRLILVKTAAS ASAAAAGVVLYARVSSHDRRSDLDRQVARLTAWATERDLGVGQVVCEVGSGLNGKRPK LRRILSDPDARVIVVEHRDRLARFGVEHLEAALSAQGRRIVVADPGETTDDLVCDMIE VLTGMCARLYGRRGARNRAMRAVTEAKREPGAG" gene 1027685..1029337 /locus_tag="Rv0922" /db_xref="GeneID:885564" CDS 1027685..1029337 /locus_tag="Rv0922" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1535" /note="Rv0922, (MTCY21C12.16), len: 550 aa. Possible transposase for IS1535, similar to many e.g. YX16_MYCTU|Q10809|MTCY274.16c from Mycobacterium tuberculosis (460 aa), FASTA scores: opt 939, E(): 0, (40.6% identity in 465 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215437.1" /db_xref="GI:15608062" /db_xref="GeneID:885564" /translation="MIVRMRSCAQAAKVAEATGGVQLAGKPKPDGTPTFSRYVEIGVD FEAHRPVVESVSVLFELYDGDANSYAATGGPGAQLPSGWMVTAAKFEVEWPADPQRAG LVRSHFGARRKAFNWGLAQVKADLDAKAADPAHESVDWDLKSLRWAWNRAKDDVAPWW AENSKECYSSGLADLAQGLANWKAGKNGTRKGRRVGFPRFKSGRRDPGRVRFTTGTMR IEDDRRTITVPVIGPLRAKENTRRVQRHLVSGRAQILNMTLSQRWGRLFVAVCYALRT PTTRSPLTQPTVRAGMDLGVRTLATVATLDTATGEQTIIEYPNPAPLKATLVARRRAG RELSRRIPGSHGHRAVKAKLARLDRRCVHLRREAAHQLTTELAGTYGQVVIEDLDVAA MKRSMRRRAFRRSVSDAAMGLVAPQLAYKTAKCSGVLTVADRWFASSQIHHGCTSPDG TPCRLQGKGRIDKHLLCPVTGEVVDRDRNAALNLRDWPDNASRGPVGTTAPSAPGPTT TVGTGHGADTGSSGAGGASVRPRPRRAGRGEAKTQTPQGDAA" repeat_region complement(1029345..1029360) /note="16 bp inverted repeat, TTGAGTGTGTTTTAGT, at the right end of IS element IS1535" gene complement(1029513..1030577) /locus_tag="Rv0923c" /db_xref="GeneID:885568" CDS complement(1029513..1030577) /locus_tag="Rv0923c" /function="UNKNOWN" /note="Rv0923c, (MTCY21C12.17c), len: 354 aa. Conserved hypothetical protein, showing similarity with C-terminal part of AF034138|AF034138_7|yjoB HYPOTHETICAL PROTEIN from Bacillus subtilis (200 aa), FASTA scores: opt: 193, E(): 4.2e-05, (32.3% identity in 167 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215438.1" /db_xref="GI:15608063" /db_xref="GeneID:885568" /translation="MPDRRHPYFAYGSNLCAHQMASRCPDAGAPRPAVLSDHNWLINQ RGVATVEPFAGNKVHGVLWQLSERDLVRLDSAEGVPVRYRRERLTVHTDDTALPAWVY IDHRVMPGRPRPGYLPRVIDGARHHGLPQRWIDYLHRWDPARWPLPVLPSSRSGPAPQ SLSELLSQPGVIETSQLRSRFGFLAIHGGGLEQVTDLIAERSAEAAGASVYLLRHPDN YPHHLPSARFDPAESARLAEFLDHVDVAVSLHGYDRIGRSTQLLAGGRNRALAAHLAR HIQLPGYRVVTDLAAIPEELRGLHPDNPVNRVRDGGTQLELSIRVRGLGPRSTLPGVG GMSPVTATLVQGLVTAARSW" gene complement(1030578..1031864) /gene="mntH" /locus_tag="Rv0924c" /db_xref="GeneID:885569" CDS complement(1030578..1031864) /gene="mntH" /locus_tag="Rv0924c" /function="H(+)-STIMULATED, HIGHLY SELECTIVE, DIVALENT CATION UPTAKE SYSTEM. RESPONSIBLE FOR THE TRANSLOCATION OF THE DIVALENT METAL ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv0924c, (MTCY21C12.18c), len: 428 aa. mntH (alternative gene name: Nramp, Mramp), H+-dependent divalent cation-transport integral membrane protein (see citations below), equivalent to O69443|MNTH_MYCBO PROBABLE MANGANESE TRANSPORT PROTEIN MNTH (BRAMP) from Mycobacterium bovis (415 aa); and NP_302396.1|NC_002677 probable manganese transport protein from Mycobacterium leprae (426 aa). Also similar (but longer 51 aa in N-terminus) to AAA63075.1|U15184 SMF2 protein from Mycobacterium leprae (377 aa), FASTA scores: opt: 1780, E(): 0, (74.5% identity in 376 aa overlap). Also similar to many orthologues of the eukaryotic Nramp (natural resistance-associated macrophage protein), also known as mntH, e.g. NP_456951.1|NC_003198 manganese transport protein MntH from Salmonella enterica subsp. enterica serovar Typhi (413 aa); etc. BELONGS TO THE NRAMP FAMILY.; Nramp; Mramp" /codon_start=1 /transl_table=11 /product="manganese transport protein MntH" /protein_id="YP_177767.1" /db_xref="GI:57116797" /db_xref="GeneID:885569" /translation="MAGEFRLLSHLCSRGSKVGELAQDTRTSLKTSWYLLGPAFVAAI AYVDPGNVAANVSSGAQFGYLLLWVIVAANVMAALVQYLSAKLGLVTGRSLPEAIGKR MGRPARLAYWAQAEIVAMATDVAEVIGGAIALRIMFNLPLPIGGIITGVVSLLLLTIQ DRRGQRLFERVITALLLVIAIGFTASFFVVTPPPNAVLGGLAPRFQGTESVLLAAAIM GATVMPHAVYLHSGLARDRHGHPDPGPQRRRLLRVTRWDVGLAMLIAGGVNAAMLLVA ALNMRGRGDTASIEGAYHAVHDTLGATIAVLFAVGLLASGLASSSVGAYAGAMIMQGL LHWSVPMLVRRLITLGPALAILTLGFDPTRTLVLSQVVLSFGIPFAVLPLVKLTGSPA VMGGDTNHRATTWVGWVVAVMVSLLNVMLIYLTVTG" gene complement(1031896..1032633) /locus_tag="Rv0925c" /db_xref="GeneID:885570" CDS complement(1031896..1032633) /locus_tag="Rv0925c" /function="UNKNOWN" /note="Rv0925c, (MTCY21C12.19c), len: 245 aa. Conserved hypothetical protein, similar to AL132991|SCF55_19 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (197 aa), FASTA scores: opt: 459, E(): 1.2e-23, (39.3% identity in 201 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215440.1" /db_xref="GI:15608065" /db_xref="GeneID:885570" /translation="MTTTSDQNAAAPPRFDGLRALFINATLKRSPELSHTDGLIERSS GIMREHGVQVDTLRAVDHDIATGVWPDMTEHGWATDEWPALYRRVLDAHILVLCGPIW LGDNSSVMKRVIERLYACSSLLNEDGQYAYYGRAGGCLITGNEDGVKHCAMNVLYSLQ HLGYTIPPQADAGWIGEAGPGPSYLDPGSGGPENDFTNRNTTFMTFNLMHIAQMLRVA GGIPAYGNQRTKWDAGCRPDFANPDYR" gene complement(1032710..1033786) /locus_tag="Rv0926c" /db_xref="GeneID:885387" CDS complement(1032710..1033786) /locus_tag="Rv0926c" /function="UNKNOWN" /note="Rv0926c, (MTCY21C12.20c), len: 358 aa. Conserved hypothetical protein, similar to Rv1059 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (354 aa). Also shows some similarity to AF170923|AF170923_3 dihydrodipicolinate reductase from Mastigocladus laminosus (278 aa), FASTA scores: opt: 170, E(): 0.00088, (25.7% identity in 276 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215441.1" /db_xref="GI:15608066" /db_xref="GeneID:885387" /translation="MAIPVVQLGTGNVGVHSLRALIADPEFELTGVWVSSDAKAGKDA AELAGLADSTGVRASTDLNAVLATGPRCAVYNAMADNRLPEALEDYRRILAAGINIVG SGPVFLQYPWQVIPDEIIKPLQDAARAGNSSLYVNGIDPGFANDLLPMALAGTCESIE QIRCMEIVDYATYDSAVVMFDVMGFGKPMDQIPMLLQPGVLSLAWGSVVRQLAAGLGI SLDGVEEMYVREPAPEAFNIASGHIPKGSAAALRFEVLGLVDGVPAVVLEHVTRLRAD LCPEWPQPAQPGGSYRIEISGEPCYAMDICLSSRHGDHNHAGLVATAMRIVNAIPAVV AAEPGIRTTLDLPLITGEGRYAAA" gene complement(1033840..1034631) /locus_tag="Rv0927c" /db_xref="GeneID:885571" CDS complement(1033840..1034631) /locus_tag="Rv0927c" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0927c, (MTCY21C12.21c), len: 263 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases, notably 7-alpha-hydroxysteroid dehydrogenases and glucose 1-dehydrogenases e.g. P25529|HDHA_ECOLI 7-alpha-hydroxysteroid dehydrogenase from Escherichia coli (255 aa), FASTA scores: opt: 551, E(): 1e-26, (39.5% identity in 248 aa overlap); NP_252778.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (253 aa); AAC44307.1|U59433 3-ketoacyl-acyl carrier protein reductase from Bacillus subtilis (246 aa); etc. Also similar to other dehydrogenases from Mycobacterium tuberculosis e.g. MTCY09F9.36, E():1.4e-18; MTCY369.14, E():8e-17; MTCY02B10.14, E():2.5e-14; MTCY09F9.23c, E():1.5e-13; MTCY03C7.07, E():1.9e-13. Contains PS00061 Short-chain dehydrogenases/reductases family signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_215442.1" /db_xref="GI:15608067" /db_xref="GOA:O05919" /db_xref="UniProtKB/TrEMBL:O05919" /db_xref="GeneID:885571" /translation="MILDMFRLDDKVAVITGGGRGLGAAIALAFAQAGADVLIASRTS SELDAVAEQIRAAGRRAHTVAADLAHPEVTAQLAGQAVGAFGKLDIVVNNVGGTMPNT LLSTSTKDLADAFAFNVGTAHALTVAAVPLMLEHSGGGSVINISSTMGRLAARGFAAY GTAKAALAHYTRLAALDLCPRVRVNAIAPGSILTSALEVVAANDELRAPMEQATPLRR LGDPVDIAAAAVYLASPAGSFLTGKTLEVDGGLTFPNLDLPIPDL" misc_feature complement(1033900..1033923) /locus_tag="Rv0927c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(1034107..1034193) /locus_tag="Rv0927c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 1034903..1036015 /gene="pstS3" /locus_tag="Rv0928" /db_xref="GeneID:885366" CDS 1034903..1036015 /gene="pstS3" /locus_tag="Rv0928" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT). THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0928, (MTCY21C12.22), len: 370 aa. pstS3 (previously known as phoS2), phosphate-binding lipoprotein component of inorganic phosphate transport system (see citations below), highly similar to others from Mycobacterium leprae e.g. Q50099|PSTS3|PHOS1 phosphate-binding protein 3 precursor (328 aa), FASTA scores: opt: 1772, E(): 0, (79.6% identity in 328 aa overlap); and highly similar to others e.g. AAF74819.1|AF137360_1|AF137360 periplasmic phosphate permease from Mycobacterium avium (369 aa). Also highly similar to Rv0932c|MTCY08D9.07|pstS2 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN (370 aa); and Rv0934|pstS1 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN (374 aa) from Mycobacterium tuberculosis (Mycobacterium tuberculosis seems to have three PstS-like proteins, others being Rv0932c and Rv0934c). Contains lipoprotein signature (PS00013) at N-terminus. BELONGS TO FAMILY OF PHOSPHATE RECEPTORS FOR BACTERIAL ABC-TYPE LIPOPROTEIN TRANSPORTERS.; phoS2" /codon_start=1 /transl_table=11 /product="periplasmic phosphate-binding lipoprotein PSTS3 (PBP-3) (PSTS3) (PHOS1)" /protein_id="YP_177768.1" /db_xref="GI:57116798" /db_xref="GOA:O86343" /db_xref="UniProtKB/Swiss-Prot:O86343" /db_xref="GeneID:885366" /translation="MKLNRFGAAVGVLAAGALVLSACGNDDNVTGGGATTGQASAKVD CGGKKTLKASGSTAQANAMTRFVNVFEQACPGQTLNYTANGSGAGISEFNGNQTDFGG SDVPLSKDEAAAAQRRCGSPAWNLPVVFGPIAVTYNLNSVSSLNLDGPTLAKIFNGSI TQWNNPAIQALNRDFTLPGERIHVVFRSDESGTTDNFQRYLQAASNGAWGKGAGKSFQ GGVGEGARGNDGTSAAAKNTPGSITYNEWSFAQAQHLTMANIVTSAGGDPVAITIDSV GQTIAGATISGVGNDLVLDTDSFYRPKRPGSYPIVLATYEIVCSKYPDSQVGTAVKAF LQSTIGAGQSGLGDNGYIPIPDEFKSRLSTAVNAIA" misc_feature 1034939..1034971 /gene="pstS3" /locus_tag="Rv0928" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1036028..1037002 /gene="pstC2" /locus_tag="Rv0929" /db_xref="GeneID:885585" CDS 1036028..1037002 /gene="pstC2" /locus_tag="Rv0929" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT); RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE. THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /note="Rv0929, (MTCY21C12.23), len: 324 aa. pstC2, phosphate-transport integral membrane ABC transporter (see citations below), highly similar to others e.g. NP_302394.1|NC_002677 membrane-bound component of phosphate transport from Mycobacterium leprae (319 aa); CAB88474.1|AL353816 phosphate ABC transport system permease protein from Streptomyces coelicolor (336 aa); NP_290359.1| NC_002655 high-affinity phosphate-specific transport system (cytoplasmic membrane component) from Escherichia coli strain O157:H7 (319 aa); etc. Also similar to Rv935|MTCY08D9.04c|PSTC1 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (338 aa). Contains binding-protein-dependent transport systems inner membrane component signature (PS00402)." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter transmembrane protein" /protein_id="NP_215444.1" /db_xref="GI:15608069" /db_xref="GOA:O86344" /db_xref="UniProtKB/Swiss-Prot:O86344" /db_xref="GeneID:885585" /translation="MVTEPLTKPALVAVDMRPARRGERLFKLAASAAGSTIVIAILLI AIFLLVRAVPSLRANHANFFTSTQFDTSDDEQLAFGVRDLFMVTALSSITALVLAVPV AVGIAVFLTHYAPRRLSRPFGAMVDLLAAVPSIIFGLWGIFVLAPKLEPIARFLNRNL GWLFLFKQGNVSLAGGGTIFTAGIVLSVMILPIVTSISREVFRQTPLIQIEAALALGA TKWEVVRMTVLPYGRSGVVAASMLGLGRALGETVAVLVILRSAARPGTWSLFDGGYTF ASKIASAASEFSEPLPTGAYISAGFALFVLTFLVNAAARAIAGGKVNG" misc_feature 1036631..1036717 /gene="pstC2" /locus_tag="Rv0929" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene 1036999..1037925 /gene="pstA1" /locus_tag="Rv0930" /db_xref="GeneID:885589" CDS 1036999..1037925 /gene="pstA1" /locus_tag="Rv0930" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT); RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE. THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /note="Rv0930, (MTCY21C12.24), len: 308 aa. Probable pstA1, phosphate-transport integral membrane ABC transporter (see citation below), highly similar to others e.g. NP_302393.1|NC_002677 membrane-bound component of phosphate transport from Mycobacterium leprae (304 aa); CAB88473.1|AL353816 phosphate ABC transport system permease protein from Streptomyces coelicolor (354 aa) (N-terminus longer); NP_312689.1|NC_002695 phosphate transport system permease protein PstA from Escherichia coli strain O157:H7 (296 aa); etc. Also similar to Rv0936|MTCY08D9.03c|PSTA2 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (301 aa)." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter transmembrane protein" /protein_id="NP_215445.2" /db_xref="GI:57116799" /db_xref="GOA:O86345" /db_xref="UniProtKB/Swiss-Prot:O86345" /db_xref="GeneID:885589" /translation="MSPSMSIEALDQPVKPVVFRPLTLRRRIKNSVATTFFFTSFVVA LIPLVWLLWVVIARGWFAVTRSGWWTHSLRGVLPEQFAGGVYHALYGTLVQAGVAAVL AVPLGLMTAVYLVEYGTGRMSRVTTFTVDVLAGVPSIVAALFVFSLWIATLGFQQSAF AVALALVLLMLPVVVRAGEEMLRLVPDELREASYALGVPKWKTIVRIVAPIAMPGIVS GILLSIARVVGETAPVLVLVGYSHSINLDVFHGNMASLPLLIYTELTNPEHAGFLRVW GAALTLIIVVATINLAAAMIRFVATRRRRLPL" gene complement(1037920..1039914) /gene="pknD" /locus_tag="Rv0931c" /db_xref="GeneID:885607" CDS complement(1037920..1039914) /gene="pknD" /locus_tag="Rv0931c" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO REGULATE PHOSPHATE TRANSPORT. CAN PHOSPHORYLATE THE PEPTIDE SUBSTRATE MYELIN BASIC PROTEIN (MBP) AT SERINE AND THREONINE RESIDUES. CAN BE AUTOPHOSPHORYLATED ON THREONINE RESIDUES [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /experiment="experimental evidence, no additional details recorded" /note="Rv0931c, (MTCY08D9.08), len: 664 aa. pknD (alternate gene name: mbk), transmembrane serine/threonine protein kinase (EC 2.7.1.-) (see citations below), equivalent to CAB62227.1|AJ250200 putative serine/threonine protein kinase from Mycobacterium bovis BCG (291 aa); and highly similar in N-terminus to P54744|PKNB_MYCLE probable serine/threonine-specific protein kinase (EC 2.7.1.-) from Mycobacterium leprae (622 aa). Also highly similar to others, particularly in N-terminal half e.g. NP_243370.1|NC_002570 serine/threonine protein kinase from Bacillus halodurans (664 aa); NP_268044.1|NC_002662 serine/threonine protein kinase from Lactococcus lactis (627 aa); etc. Also highly similar to other serine/threonine protein kinases from Mycobacterium tuberculosis e.g. pknH (626 aa), FASTA scores: opt: 1398, E: 0, (49.3% identity in 540 aa overlap); pknE (566 aa); pknB (626 aa); Rv3524 (343 aa); etc. Contains Hank's kinase subdomain. Contains two transmembrane segments, which flank a highly repetitive region, suggesting a receptor-like anchoring. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation on a serine residue. Appears to be co-transcribed with Rv0932c|pstS2.; mbk" /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase D PKND (protein kinase D) (STPK D)" /protein_id="NP_215446.1" /db_xref="GI:15608071" /db_xref="GOA:O05871" /db_xref="UniProtKB/Swiss-Prot:O05871" /db_xref="GeneID:885607" /translation="MSDAVPQVGSQFGPYQLLRLLGRGGMGEVYEAEDTRKHRVVALK LISPQYSDNAVFRARMQREADTAGRLTEPHIVPIHDYGEINGQFFVEMRMIDGTSLRA LLKQYGPLTPARAVAIVRQIAAALDAAHANGVTHRDVKPENILVTASDFAYLVDFGIA RAASDPGLTQTGTAVGTYNYMAPERFTGDEVTYRADIYALACVLGECLTGAPPYRADS VERLIAAHLMDPAPQPSQLRPGRVPPALDQVIAKGMAKNPAERFMSAGDLAIAAHDAL TTSEQHQATTILRRGDNATLLATPADTGLSQSESGIAGAGTGPPTPGAARWSPGDSAT VAGPLAADSRGGNWPSQTGHSPAVPNALQASLGHAVPPAGNKRKVWAVVGAAAIVLVA IVAAAGYLVLRPSWSPTQASGQTVLPFTGIDFRLSPSGVAVDSAGNVYVTSEGMYGRV VKLATGSTGTTVLPFNGLYQPQGLAVDGAGTVYVTDFNNRVVTLAAGSNNQTVLPFDG LNYPEGLAVDTQGAVYVADRGNNRVVKLAAGSKTQTVLPFTGLNDPDGVAVDNSGNVY VTDTDNNRVVKLEAESNNQVVLPFTDITAPWGIAVDEAGTVYVTEHNTNQVVKLLAGS TTSTVLPFTGLNTPLAVAVDSDRTVYVADRGNDRVVKLTS" misc_feature complement(1039477..1039515) /gene="pknD" /locus_tag="Rv0931c" /note="PS00108 Serine/Threonine protein kinases active-site signature" misc_feature complement(1039783..1039854) /gene="pknD" /locus_tag="Rv0931c" /note="PS00107 Protein kinases ATP-binding region signature" gene complement(1039936..1041048) /gene="pstS2" /locus_tag="Rv0932c" /db_xref="GeneID:885613" CDS complement(1039936..1041048) /gene="pstS2" /locus_tag="Rv0932c" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT). THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0932c, (MTCY08D9.07), len: 370 aa. pstS2, phosphate-binding lipoprotein component of inorganic phosphate transport system (see citations below), highly similar to AAF74819.1|AF137360_1|AF137360 periplasmic phosphate permease from Mycobacterium avium (369 aa); Rv0928|MTCY21C12.22|pstS3 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (370 aa), FASTA scores: opt: 1601, E(): 0, (64.5% identity in 372 aa overlap); and Rv0934|MTCY08D9.05c|pstS1 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (374 aa) (Mycobacterium tuberculosis seems to have three PstS-like proteins, others being Rv0928 and Rv0934c). Also highly similar to MTCY08D9.05c|P15712|PAB_MYCTU PROTEIN ANTIGEN B PRECURSOR from Mycobacterium tuberculosis (374 aa), FASTA scores: opt: 460, E(): 2.7e-20, (31.2% identity in 375 aa overlap). Contains prokaryotic membrane lipoprotein lipid attachment site (PS00013) at N-terminus so the leader peptide of 22 aa is probably removed. BELONGS TO FAMILY OF PHOSPHATE RECEPTORS FOR BACTERIAL ABC-TYPE LIPOPROTEIN TRANSPORTERS. Appears to be co-transcribed with Rv0931c|pknD|mbk." /codon_start=1 /transl_table=11 /product="periplasmic phosphate-binding lipoprotein PSTS2 (PBP-2) (PSTS2)" /protein_id="YP_177769.1" /db_xref="GI:57116800" /db_xref="GOA:O05870" /db_xref="UniProtKB/Swiss-Prot:O05870" /db_xref="GeneID:885613" /translation="MKFARSGAAVSLLAAGTLVLTACGGGTNSSSSGAGGTSGSVHCG GKKELHSSGSTAQENAMEQFVYAYVRSCPGYTLDYNANGSGAGVTQFLNNETDFAGSD VPLNPSTGQPDRSAERCGSPAWDLPTVFGPIAITYNIKGVSTLNLDGPTTAKIFNGTI TVWNDPQIQALNSGTDLPPTPISVIFRSDKSGTSDNFQKYLDGASNGAWGKGASETFN GGVGVGASGNNGTSALLQTTDGSITYNEWSFAVGKQLNMAQIITSAGPDPVAITTESV GKTIAGAKIMGQGNDLVLDTSSFYRPTQPGSYPIVLATYEIVCSKYPDATTGTAVRAF MQAAIGPGQEGLDQYGSIPLPKSFQAKLAAAVNAIS" misc_feature complement(1040980..1041012) /gene="pstS2" /locus_tag="Rv0932c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1041264..1042094 /gene="pstB" /locus_tag="Rv0933" /db_xref="GeneID:885653" CDS 1041264..1042094 /gene="pstB" /locus_tag="Rv0933" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT); RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM. THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT. HAVE ATP-BINDING ABILITY AND ATPase ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="ATP-binding protein; PstABCS is an ATP dependent phosphate uptake system which is responsible for inorganic phosphate uptake during phosphate starvation" /codon_start=1 /transl_table=11 /product="phosphate ABC transporter ATP-binding protein" /protein_id="NP_215448.1" /db_xref="GI:15608073" /db_xref="GOA:P95302" /db_xref="UniProtKB/Swiss-Prot:P95302" /db_xref="GeneID:885653" /translation="MACERLGGQSGAADVDAAAPAMAAVNLTLGFAGKTVLDQVSMGF PARAVTSLMGPTGSGKTTFLRTLNRMNDKVSGYRYSGDVLLGGRSIFNYRDVLEFRRR VGMLFQRPNPFPMSIMDNVLAGVRAHKLVPRKEFRGVAQARLTEVGLWDAVKDRLSDS PFRLSGGQQQLLCLARTLAVNPEVLLLDEPTSALDPTTTEKIEEFIRSLADRLTVIIV THNLAQAARISDRAALFFDGRLVEEGPTEQLFSSPKHAETARYVAGLSGDVKDAKRGN" misc_feature 1041423..1041446 /gene="pstB" /locus_tag="Rv0933" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1041753..1041797 /gene="pstB" /locus_tag="Rv0933" /note="PS00211 ABC transporters family signature" gene 1042115..1043239 /gene="pstS1" /locus_tag="Rv0934" /db_xref="GeneID:885724" CDS 1042115..1043239 /gene="pstS1" /locus_tag="Rv0934" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT). THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0934, (MTCY08D9.05c), len: 374 aa. pstS1 (previously known as phoS1 or phoS), phosphate-binding lipoprotein component of inorganic phosphate transport system (see citations below), highly similar to Rv0932c|MTCY08D9.07|pstS2 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (370 aa), FASTA scores: opt: 460, E(): 5.9e-19, (31.2% identity in 375 aa overlap); and Rv0928|MTCY21C12.22|pstS3 PHOSPHATE-BINDING PERIPLASMIC LIPOPROTEIN from Mycobacterium tuberculosis (374 aa), FASTA scores: opt: 435, E():1.1e-17, (30.0% identity in 380 aa overlap) (Mycobacterium tuberculosis seems to have three PstS-like proteins, others being Rv0932c and Rv0928c). Also highly similar to MTCY08D9.05c|P15712|PAB_MYCTU PROTEIN ANTIGEN B PRECURSOR from Mycobacterium tuberculosis (374 aa), FASTA scores: opt: 2459, E(): 0, (100% identity in 374 aa overlap). Contains a prokaryotic membrane lipoprotein lipid attachment site (PS00013) at the N-terminus so the 23 aa leader peptide sequence is probably removed. BELONGS TO FAMILY OF PHOSPHATE RECEPTORS FOR BACTERIAL ABC-TYPE LIPOPROTEIN TRANSPORTERS.; phoS1; phoS" /codon_start=1 /transl_table=11 /product="periplasmic phosphate-binding lipoprotein PSTS1 (PBP-1) (PSTS1)" /protein_id="YP_177770.1" /db_xref="GI:57116801" /db_xref="GOA:P15712" /db_xref="UniProtKB/Swiss-Prot:P15712" /db_xref="GeneID:885724" /translation="MKIRLHTLLAVLTAAPLLLAAAGCGSKPPSGSPETGAGAGTVAT TPASSPVTLAETGSTLLYPLFNLWGPAFHERYPNVTITAQGTGSGAGIAQAAAGTVNI GASDAYLSEGDMAAHKGLMNIALAISAQQVNYNLPGVSEHLKLNGKVLAAMYQGTIKT WDDPQIAALNPGVNLPGTAVVPLHRSDGSGDTFLFTQYLSKQDPEGWGKSPGFGTTVD FPAVPGALGENGNGGMVTGCAETPGCVAYIGISFLDQASQRGLGEAQLGNSSGNFLLP DAQSIQAAAAGFASKTPANQAISMIDGPAPDGYPIINYEYAIVNNRQKDAATAQTLQA FLHWAITDGNKASFLDQVHFQPLPPAVVKLSDALIATISS" misc_feature 1042154..1042186 /gene="pstS1" /locus_tag="Rv0934" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1043299..1044315 /gene="pstC1" /locus_tag="Rv0935" /db_xref="GeneID:885644" CDS 1043299..1044315 /gene="pstC1" /locus_tag="Rv0935" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT); RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE. THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0935, (MTCY08D9.04c), len: 338 aa. pstC1, phosphate-transport integral membrane ABC transporter (see citations below), highly similar to others e.g. NP_104768.1|NC_002678|pstC phosphate ABC transporter permease protein from Mesorhizobium loti (327 aa); NP_245372.1|NC_002663|PstC PstC protein from Pasteurella multocida (320 aa); P45191|PSTC_HAEIN PHOSPHATE TRANSPORT SYSTEM PERMEASE from Haemophilus influenza (315 aa), FASTA scores: opt: 667, E(): 0, (36.2% identity in 309 aa overlap); etc. Also similar to Rv0929|MTCY21C12.23|PSTC2 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (324 aa), FASTA scores: opt: 487, E(): 4.1e-21, (32.3% identity in 303 aa overlap); and shows slight similarity to MTCY08D9.03c|PSTA2|Rv0936 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (301 aa). Contains binding-protein-dependent transport systems inner membrane comp signature (PS00402)." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter transmembrane protein" /protein_id="YP_177771.1" /db_xref="GI:57116802" /db_xref="GOA:P95303" /db_xref="UniProtKB/Swiss-Prot:P95303" /db_xref="GeneID:885644" /translation="MLARAGEVGRAGPAIRWLGGIGAVIPLLALVLVLVVLVIEAMGA IRLNGLHFFTATEWNPGNTYGETVVTDGVAHPVGAYYGALPLIVGTLATSAIALIIAV PVSVGAALVIVERLPKRLAEAVGIVLELLAGIPSVVVGLWGAMTFGPFIAHHIAPVIA HNAPDVPVLNYLRGDPGNGEGMLVSGLVLAVMVVPIIATTTHDLFRQVPVLPREGAIA LGMSNWECVRRVTLPWVSSGIVGAVVLGLGRALGETMAVAMVSGAVLGAMPANIYATM TTIAATIVSQLDSAMTDSTNFAVKTLAEVGLVLMVITLLTNVAARGMVRRVSRTALPV GRGI" misc_feature 1043911..1043997 /gene="pstC1" /locus_tag="Rv0935" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene 1044317..1045222 /gene="pstA2" /locus_tag="Rv0936" /db_xref="GeneID:885756" CDS 1044317..1045222 /gene="pstA2" /locus_tag="Rv0936" /function="INVOLVED IN ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE (IMPORT); RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE. THIS IS ONE OF THE PROTEINS REQUIRED FOR BINDING-PROTEIN-MEDIATED PHOSPHATE TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv0936, (MTCY08D9.03c), len: 301 aa. pstA2, phosphate-transport integral membrane ABC transporter (see citations below), highly similar to others e.g. NP_442269.1|NC_000911|PstA phosphate transport system permease protein from Synechocystis sp. strain PCC 6803 (287 aa); NP_232473.1|NC_002506 phosphate ABC transporter permease protein from Vibrio cholerae (289 aa); P07654|PSTA_ECOLI PHOSPHATE TRANSPORT SYSTEM PERMEASE from Escherichia coli (296 aa), FASTA scores: opt: 464, E(): 6.7e-24, (30.5% identity in 282 aa overlap); etc. Also similar to O86345|MTCY21C12.24|PSTA1|Rv0930 PROBABLE TRANSMEMBRANE ABC TRANSPORTER COMPONENT OF PHOSPHATE UPTAKE SYSTEM from Mycobacterium tuberculosis (304 aa), FASTA scores: opt: 369, E(): 6.1e-15, (32.7% identity in 248 aa overlap). Contains binding-protein-dependent transport systems inner membrane comp signature (PS00402)." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter transmembrane protein" /protein_id="NP_215451.1" /db_xref="GI:15608076" /db_xref="GOA:Q50796" /db_xref="UniProtKB/Swiss-Prot:Q50796" /db_xref="GeneID:885756" /translation="MGESAESGSRQLPAMSPPRRSVAYRRKIVDALWWAACVCCLAVV ITPTLWMLIGVVSRAVPVFHWSVLVQDSQGNGGGLRNAIIGTAVLAIGVILVGGTVSV LTGIYLSEFATGKTRSILRGAYEVLSGIPSIVLGYVGYLALVVYFDWGFSLAAGVLVL SVMSIPYIAKATESALAQVPTSYREAAEALGLPAGWALRKIVLKTAMPGIVTGMLVAL ALAIGETAPLLYTAGWSNSPPTGQLTDSPVGYLTYPIWTFYNQPSKSAQDLSYDAALL LIVFLLLLIFIGRLINWLSRRRWDV" misc_feature 1044842..1044928 /gene="pstA2" /locus_tag="Rv0936" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene complement(1045199..1046020) /locus_tag="Rv0937c" /db_xref="GeneID:885050" CDS complement(1045199..1046020) /locus_tag="Rv0937c" /function="UNKNOWN" /note="Rv0937c, (MTCY08D9.02), len: 273 aa. Conserved hypothetical protein, highly similar to others e.g. SC6G9.24c|T35620|AL079356 hypothetical protein from Streptomyces coelicolor (365 aa), FASTA scores: opt: 648, E(): 0, (36.5% identity in 274 aa overlap); Z99110|BSUB0007_223|NP_389224.1|NC_000964 hypothetical proteins from Bacillus subtilis (311 aa), FASTA scores: opt: 623, E(): 1.1e-31, (33.9% identity in 274 aa overlap); O28548|AE000984|AF1726|NP_070554.1|NC_000917 conserved hypothetical protein from Archaeoglobus fulgidus (286 aa), FASTA scores: opt: 583, E(): 0, (36.6% identity in 262 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215452.1" /db_xref="GI:15608077" /db_xref="GOA:O05866" /db_xref="UniProtKB/TrEMBL:O05866" /db_xref="GeneID:885050" /translation="MRAIWTGSIAFGLVNVPVKVYSATADHDIRFHQVHAKDNGRIRY KRVCEACGEVVDYRDLARAYESGDGQMVAITDDDIASLPEERSREIEVLEFVPAADVD PMMFDRSYFLEPDSKSSKSYVLLAKTLAETDRMAIVHFTLRNKTRLAALRVKDFGKRE VMMVHTLLWPDEIRDPDFPVLDQKVEIKPAELKMAGQVVDSMADDFNPDRYHDTYQEQ LQELIDTKLEGGQAFTAEDQPRLLDEPEDVSDLLAKLEASVKARSKANSNVPTPP" gene 1046136..1048415 /locus_tag="Rv0938" /db_xref="GeneID:885561" CDS 1046136..1048415 /locus_tag="Rv0938" /EC_number="6.5.1.1" /function="THIS PROTEIN IS THOUGHT TO SEAL NICKS IN DOUBLE-STRANDED DNA DURING DNA REPLICATION, DNA RECOMBINATION AND DNA REPAIR [CATALYTIC ACTIVITY:ATP + {DEOXYRIBONUCLEOTIDE}(N) + {DEOXYRIBONUCLEOTIDE}(M) = AMP + DIPHOSPHATE + {DEOXYRIBONUCLEOTIDE}(N+M)]." /note="catalyzes the ATP-dependent formation of a phosphodiester at the site of a single-strand break in duplex DNA and has been shown to have polymerase activity" /codon_start=1 /transl_table=11 /product="ATP-dependent DNA ligase" /protein_id="NP_215453.1" /db_xref="GI:15608078" /db_xref="GOA:P71571" /db_xref="UniProtKB/Swiss-Prot:P71571" /db_xref="GeneID:885561" /translation="MGSASEQRVTLTNADKVLYPATGTTKSDIFDYYAGVAEVMLGHI AGRPATRKRWPNGVDQPAFFEKQLALSAPPWLSRATVAHRSGTTTYPIIDSATGLAWI AQQAALEVHVPQWRFVAEPGSGELNPGPATRLVFDLDPGEGVMMAQLAEVARAVRDLL ADIGLVTFPVTSGSKGLHLYTPLDEPVSSRGATVLAKRVAQRLEQAMPALVTSTMTKS LRAGKVFVDWSQNSGSKTTIAPYSLRGRTHPTVAAPRTWAELDDPALRQLSYDEVLTR IARDGDLLERLDADAPVADRLTRYRRMRDASKTPEPIPTAKPVTGDGNTFVIQEHHAR RPHYDFRLECDGVLVSWAVPKNLPDNTSVNHLAIHTEDHPLEYATFEGAIPSGEYGAG KVIIWDSGTYDTEKFHDDPHTGEVIVNLHGGRISGRYALIRTNGDRWLAHRLKNQKDQ KVFEFDNLAPMLATHGTVAGLKASQWAFEGKWDGYRLLVEADHGAVRLRSRSGRDVTA EYPQLRALAEDLADHHVVLDGEAVVLDSSGVPSFSQMQNRGRDTRVEFWAFDLLYLDG RALLGTRYQDRRKLLETLANATSLTVPELLPGDGAQAFACSRKHGWEGVIAKRRDSRY QPGRRCASWVKDKHWNTQEVVIGGWRAGEGGRSSGVGSLLMGIPGPGGLQFAGRVGTG LSERELANLKEMLAPLHTDESPFDVPLPARDAKGITYVKPALVAEVRYSEWTPEGRLR QSSWRGLRPDKKPSEVVRE" gene 1048412..1050346 /locus_tag="Rv0939" /db_xref="GeneID:885560" CDS 1048412..1050346 /locus_tag="Rv0939" /EC_number="5.3.3.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM, POSSIBLY IN A DEGRADATION PATHWAY." /note="Rv0939, (MTCY10D7.35c), len: 644 aa. Possible bifunctional enzyme, including 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase activity (EC 5.3.3.-), and cyclase/dehydrase activity (EC undetermined). N-terminal part similar to many isomerases e.g. NP_343861.1|NC_002754 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase (hpcE-1) from Sulfolobus solfataricus (318 aa); NP_068932.1|NC_000917 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase (hpcE-1) from Archaeoglobus fulgidus (324 aa), FASTA scores: opt: 400, E(): 5.8e-15, (33.9% identity in 289 aa overlap); etc. And C-terminal part highly similar to many cyclases/dehydrases e.g. AAK61721.1|AY033994 cyclase-like protein from Streptomyces aureofaciens (305 aa); CAC44204.1|AL593842 cyclase from Streptomyces coelicolor (297 aa), FASTA scores: opt: 375, E(): 2.7e-26, (35.6% identity in 284 aa overlap); NP_343860.1|NC_002754 putative Cyclase/dehydrase from Sulfolobus solfataricus (308 aa); etc. Also similar to Rv2993c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="bifunctional 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase/cyclase/dehydrase" /protein_id="NP_215454.1" /db_xref="GI:15608079" /db_xref="GOA:O86346" /db_xref="UniProtKB/TrEMBL:O86346" /db_xref="GeneID:885560" /translation="MKWVTYRSDHGERTGVLSGDAIYAMPPDVSLLDLVGRGADGLRT AGERAVRSPAAVVALDEVTLAAPIPRPPSIRDSLCFLDHMRNCQEAMGGGRVLMDTWY RIPAFYFACPSTVLGPYDDAPTAPGSAWQDFELEIAAVIGTSGKDLTVEQAERSIIGY TIFNDWSARDLQMLEGQLRIGQAKGKDSGITLGPYLVTPDELEPYCRGGKLSLRVIAL VNGTVIGSGSTAQMDWSFGEVIAYASRGVTLTPGDVFGSGTVPTCTLVEHLRPPESFP GWLHDGDVVTLQVEGLGETRQTVRTSGTPFPLALRPNPDAEPDRRGVNPAPTRVPFTR GLHEVADRVWAWTLPDGGYGFSNAGLVAGDGASLLVDTLFDLALTREMLAAMKPVTER APITDALITHSNGDHTHGTQLLDRSVRIIAAKGTSEEIEHGPAPEMLARIQTADLGPV ATRYLRDRFGHFDFSGIKLRNADLTFDRDLAIELGGRRVDLLNLGPAHTTADSVVHVA DAGVLFAGDLLFIGCTPIVWAGPIANWVAACDAMIALDAPTVVPGHGPVTGPDGIRAV RGYLAHIAEQAEAAYRKGLSLPEAVETIDLGEYASWLDSERVVVNVYQRYRELDPDTP RQDLLALLVMQAEWAARHCT" gene complement(1050593..1051459) /locus_tag="Rv0940c" /db_xref="GeneID:885412" CDS complement(1050593..1051459) /locus_tag="Rv0940c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv0940c, (MTCY10D7.34), len: 288 aa. Possible oxidoreductase (EC 1.-.-.-), similar to hypothetical proteins and oxidoreductases e.g. AAK38097.1|AF323606_3|AF323606 putative F420-dependent dehydrogenase from Rhodococcus erythropolis (295 aa); AAG52987.1|AF040570|Rif17 putative alkanal monooxygenase from Amycolatopsis mediterranei (356 aa); etc. Also similar to putative oxidoreductases from Mycobacterium tuberculosis such as Rv0953c|P71557|YT21_MYCTU (282 aa), FASTA scores: opt: 311, E(): 3.7e-08, (31.0% identity in 248 aa overlap), Rv3079c (275 aa), Rv0791c (347 aa), etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215455.1" /db_xref="GI:15608080" /db_xref="UniProtKB/Swiss-Prot:P64761" /db_xref="GeneID:885412" /translation="MRFSYAEAMTDFTFYIPLAKAAEAAGYSSMTIPDSIAYPFESDS KYPYTPDGNREFMDGKPFIETFVLTAALGAVTTRLRFNFFVLKLPIRPPALVAKQAGS LAALIGNRVGLGVGTSPWPEDYELMGVPFAKRGKRIDECIEIVRGLTTGDYFEFHGEF YDIPKTKMTPAPTQPIPILVGGHADAALRRAARADGWMHGGGDPDELDRLIARVKRLR EEAGKTSPFEIHVISLDGFTVDGVKRLEDKGVTDVIVGFRVPYTMGPDTEPLQTKIRN LEMFAENVIAKV" gene complement(1051544..1052317) /locus_tag="Rv0941c" /db_xref="GeneID:885914" CDS complement(1051544..1052317) /locus_tag="Rv0941c" /function="UNKNOWN" /note="Rv0941c, (MTCY10D7.33), len: 257 aa. Conserved hypothetical protein, showing some similarity with parts of several hypothetical proteins from Streptomyces coelicolor e.g. AL035161|SC9C7_20 (860 aa), FASTA scores: opt: 197, E(): 2.6e-05, (34.2% identity in 114 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215456.1" /db_xref="GI:15608081" /db_xref="UniProtKB/TrEMBL:P71568" /db_xref="GeneID:885914" /translation="MVAVSTAAKSPTALAIAVRTQDSVVILTADGALDSSSSALLRDS LTRATLEQPSAVIVNVTELQVAEESAWSVFISARWQADFRADVPVLLVCGHRAGRAAV TRTGVARFMPVYPTEKAASKAIGRLARRNFKRSDAQLPANLNSLRESRQLVREWLTQW SRPGLIPVALVVVNVFVENVLKHTGSDPVMRIESDGPTATIAVSDGSSAPAVRLASPP KGIDVSGLAIVAALSRAWGSSPTSSGKTVWAIIGPENQL" gene 1052360..1052638 /locus_tag="Rv0942" /db_xref="GeneID:885913" CDS 1052360..1052638 /locus_tag="Rv0942" /function="UNKNOWN" /note="Rv0942, (MTCY10D7.32c), len: 92 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215457.1" /db_xref="GI:15608082" /db_xref="UniProtKB/Swiss-Prot:P64763" /db_xref="GeneID:885913" /translation="MGRSATIAMVPKRRDAMNRHSGPILSSGFIASSSNSCPANSLRM PSALAAETLSFDDRAVRRSTHHPGGGYPQKHAINLQSGLCPAYANASR" gene complement(1052696..1053736) /locus_tag="Rv0943c" /db_xref="GeneID:885889" CDS complement(1052696..1053736) /locus_tag="Rv0943c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0943c, (MTCY10D7.31), len: 346 aa. Possible monooxygenase (EC 1.-.-.-), similar in part to others e.g. NP_250229.1|NC_002516 probable flavin-containing monooxygenase from Pseudomonas aeruginosa (527 aa); AAC36351.1|AF090329 cyclohexanone monooxygenase homolog from Pseudomonas fluorescens (437 aa); CAB59668.1|AL132674 monooxygenase from Streptomyces coelicolor (519 aa); etc. Also similar to putative monooxygenases from Mycobacterium tuberculosis e.g. Rv1393c|P71662|CY21B4.10C (492 aa). FASTA scores: opt: 129, E(): 8.5e-21, (27.5% identity in 236 aa overlap); Rv0892 (495 aa); Rv3049c (524 aa); etc." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="NP_215458.1" /db_xref="GI:15608083" /db_xref="UniProtKB/Swiss-Prot:P64765" /db_xref="GeneID:885889" /translation="MAGVSEAERRGHRKLVRFQARRAIGPIRPTSAAWDRDFDPAGKR IAVVGTDAAAAHYISRLSESAASVTVFTQAPRRVVTGVPLWTTRAKRWLRRRTGAEHP AVAWATAAIDALTSSGIRTSDGVEHPVDAIIYGTGFAIADQVGDQTLVGAGGVTIRQA WDDGMEPYLGVAVHGFPNYFFITGPDTAAQARCVVECMKLMERTASRRIEVRRSSQQV FNERAQLKPAQPHRQTGGLEAFDLSSAATEDDQTYDGAATLTLAGARFRVRVRLTGHL DPIDGNYHWQGTVFDSLPETSLTHARAATLTIGGRSAPARITEQTPWGTHSVAGVGPP PYARSGPASATT" gene 1053765..1054241 /locus_tag="Rv0944" /db_xref="GeneID:885888" CDS 1053765..1054241 /locus_tag="Rv0944" /EC_number="3.2.2.23" /function="THIS ENZYME MAY PLAY A SIGNIFICANT ROLE IN PROCESSES LEADING TO RECOVERY FROM MUTAGENESIS AND/OR CELL DEATH BY ALKYLATING AGENTS [CATALYTIC ACTIVITY: Hydrolysis of DNA containing ring-opened N7-methylguanine residues, releasing 2,6-diamino-4-hydroxy-5-(N-methyl)formamidopyrimide]." /note="Rv0944, (MTCY10D7.30c), len: 158 aa. Possible formamidopyrimidine-DNA glycosylase (EC 3.2.2.23), similar to C-terminus of formamidopyrimidine-DNA glycosylases e.g. CAB63194.1|AL133469 putative formamidopyrimidine-DNA glycosylase from Streptomyces coelicolor (287 aa); FPG_LACLA|NP_266509.1|NC_002662 formamidopyrimidine-DNA glycosylase (EC 3.2.2.23) from Lactococcus lactis subsp. lactis (273 aa), FASTA scores: opt: 246, E(): 2.4e-09, (28.9% identity in 142 aa overlap); O50606|FPG_THETH|MUTM|FPG FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE from Thermus thermophilus (267 aa); etc. Also similar to C-termini of endonucleases or DNA glycosylases of Mycobacterium tuberculosis e.g. Rv3297, Rv2464c, Rv2924c. MAY BE BELONG TO THE FPG FAMILY." /codon_start=1 /transl_table=11 /product="formamidopyrimidine-DNA glycosylase" /protein_id="NP_215459.1" /db_xref="GI:15608084" /db_xref="GOA:P71565" /db_xref="UniProtKB/TrEMBL:P71565" /db_xref="GeneID:885888" /translation="MAGTPQPRALGPDALDVSTDDLAGLLAGNTGRIKTVITDQKVIA GIGNAYSDEILHVAKISPFATAGKLSGAQLTCLHEAMASVLSDAVRRSVGQGAAMLKG EKRSGLRVHARTGLPCPVCGDTVREVSFADKSFQYCPTCQTGGKALADRRMSRLLK" gene 1054247..1055008 /locus_tag="Rv0945" /db_xref="GeneID:885629" CDS 1054247..1055008 /locus_tag="Rv0945" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0945, (MTCY10D7.29c), len: 253 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases e.g. NP_346338.1|NC_003028 oxidoreductase (short chain dehydrogenase/reductase family) from Streptococcus pneumoniae (253 aa); AAB70845.1|AF019986|PksB from Dictyostelium discoideum (260 aa); AAF86624.1|U87786 clavaldehyde dehydrogenase from Streptomyces clavuligerus (247 aa); P37440|UCPA_ECOLI oxidoreductase from Escherichia coli (285 aa), FASTA scores: opt: 275, E(): 1.1e-12, (33.8% identity in 201 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_215460.1" /db_xref="GI:15608085" /db_xref="GOA:P71564" /db_xref="UniProtKB/Swiss-Prot:P71564" /db_xref="GeneID:885629" /translation="MLTGVTRQKILITGASSGLGAGMARSFAAQGRDLALCARRTDRL TELKAELSQRYPDIKIAVAELDVNDHERVPKVFAELSDEIGGIDRVIVNAGIGKGARL GSGKLWANKATIETNLVAALVQIETALDMFNQRGSGHLVLISSVLGVKGVPGVKAAYA ASKAGVRSLGESLRAEYAQRPIRVTVLEPGYIESEMTAKSASTMLMVDNATGVKALVA AIEREPGRAAVPWWPWAPLVRLMWVLPPRLTRRFA" misc_feature 1054682..1054768 /locus_tag="Rv0945" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(1055024..1056685) /gene="pgi" /locus_tag="Rv0946c" /db_xref="GeneID:885533" CDS complement(1055024..1056685) /gene="pgi" /locus_tag="Rv0946c" /EC_number="5.3.1.9" /function="INVOLVED IN GLYCOLYSIS AND IN GLUCONEOGENESIS [CATALYTIC ACTIVITY: D-glucose 6-phosphate = D-fructose 6-phosphate]." /note="functions in sugar metabolism in glycolysis and the Embden-Meyerhof pathways (EMP) and in gluconeogenesis; catalyzes reversible isomerization of glucose-6-phosphate to fructose-6-phosphate; member of PGI family" /codon_start=1 /transl_table=11 /product="glucose-6-phosphate isomerase" /protein_id="NP_215461.1" /db_xref="GI:15608086" /db_xref="GOA:P64192" /db_xref="UniProtKB/Swiss-Prot:P64192" /db_xref="GeneID:885533" /translation="MTSAPIPDITATPAWDALRRHHDQIGNTHLRQFFADDPGRGREL TVSVGDLYIDYSKHRVTRETLALLIDLARTAHLEERRDQMFAGVHINTSEDRAVLHTA LRLPRDAELVVDGQDVVTDVHAVLDAMGAFTDRLRSGEWTGATGKRISTVVNIGIGGS DLGPVMVYQALRHYADAGISARFVSNVDPADLIATLADLDPATTLFIVASKTFSTLET LTNATAARRWLTDALGDAAVSRHFVAVSTNKRLVDDFGINTDNMFGFWDWVGGRYSVD SAIGLSLMTVIGRDAFADFLAGFHIIDRHFATAPLESNAPVLLGLIGLWYSNFFGAQS RTVLPYSNDLSRFPAYLQQLTMESNGKSTRADGSPVSADTGEIFWGEPGTNGQHAFYQ LLHQGTRLVPADFIGFAQPLDDLPTAEGTGSMHDLLMSNFFAQTQVLAFGKTAEEIAA DGTPAHVVAHKVMPGNRPSTSILASRLTPSVLGQLIALYEHQVFTEGVVWGIDSFDQW GVELGKTQAKALLPVITGAGSPPPQSDSSTDGLVRRYRTERGRAG" misc_feature complement(1055144..1055173) /gene="pgi" /locus_tag="Rv0946c" /note="PS00174 Phosphoglucose isomerase signature 2" misc_feature complement(1055846..1055881) /gene="pgi" /locus_tag="Rv0946c" /note="PS00765 Phosphoglucose isomerase signature 1" gene complement(1057303..1057530) /locus_tag="Rv0947c" /pseudo /db_xref="GeneID:885494" misc_feature complement(1057303..1057530) /locus_tag="Rv0947c" /note="Rv0947c, (MTCY10D7.27), len: 76 aa. Probable mycolyl transferase pseudogene (EC 2.-.-.-), similar to part of P31953|A85C_MYCTU|fbpC2 antigen 85-c precursor (85c) (FIBRONECTIN-BINDING PROTEIN C) from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 213, E(): 2e-08, (69.6% identity in 46 aa overlap).;PROBABLE MYCOLYL TRANSFERASE" /pseudo gene complement(1057646..1057963) /locus_tag="Rv0948c" /db_xref="GeneID:885485" CDS complement(1057646..1057963) /locus_tag="Rv0948c" /function="UNKNOWN" /note="Rv0948c, (MTCY10D7.26), len: 105 aa. Conserved hypothetical protein, equivalent to NP_301237.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (105 aa). Also similar (except in N-terminus) to SCD63.16c|CAB82023.1|AL161755 hypothetical protein from Streptomyces coelicolor (110 aa); and to N-terminus of two chorismate mutase/prephenate dehydratase." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215463.1" /db_xref="GI:15608088" /db_xref="GOA:P64767" /db_xref="UniProtKB/Swiss-Prot:P64767" /db_xref="GeneID:885485" /translation="MRPEPPHHENAELAAMNLEMLESQPVPEIDTLREEIDRLDAEIL ALVKRRAEVSKAIGKARMASGGTRLVHSREMKVIERYSELGPDGKDLAILLLRLGRGR LGH" gene 1058260..1060575 /gene="uvrD1" /locus_tag="Rv0949" /db_xref="GeneID:885442" CDS 1058260..1060575 /gene="uvrD1" /locus_tag="Rv0949" /EC_number="3.6.1.-" /function="INVOLVED IN NUCLEOTIDE EXCISION REPAIR. HAS A 3'-5' HELICASE ACTIVITY IN PRESENCE OF ATP. PREFERRED SUBSTRATE BEING ONE WITH BOTH SINGLE AND DOUBLE STRANDED REGIONS OF DNA." /note="Rv0949, (MTCY10D7.25c), len: 771 aa. Probable uvrD1, ATP dependent DNA helicase II (EC 3.6.1.-) (see citation below), equivalent to P_301239.1|NC_002677 putative ATP-dependent DNA helicase from Mycobacterium leprae (778 aa). Also highly similar to others e.g. CAB92660.1|AL356832 from Streptomyces coelicolor (831 aa) (N-terminus longer); P56255|PCRA_BACST from Bacillus stearothermophilus (724 aa); Q10213|YAY5_SCHPO from Schizosaccharomyces pombe (Fission yeast) (887 aa), FASTA scores: opt: 927, E(): 0, (33.5% identity in 659 aa overlap); etc. Also similar to several other UvrD-like proteins in Mycobacterium tuberculosis e.g. Rv3201c, Rv3198c, Rv3202c. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UVRD SUBFAMILY OF HELICASES. Note that previously known as uvrD.; uvrD" /codon_start=1 /transl_table=11 /product="ATP-dependent DNA helicase II UVRD1" /protein_id="YP_177772.1" /db_xref="GI:57116803" /db_xref="GOA:P71561" /db_xref="UniProtKB/Swiss-Prot:P71561" /db_xref="GeneID:885442" /translation="MSVHATDAKPPGPSPADQLLDGLNPQQRQAVVHEGSPLLIVAGA GSGKTAVLTRRIAYLMAARGVGVGQILAITFTNKAAAEMRERVVGLVGEKARYMWVST FHSTCVRILRNQAALIEGLNSNFSIYDADDSRRLLQMVGRDLGLDIKRYSPRLLANAI SNLKNELIDPHQALAGLTEDSDDLARAVASVYDEYQRRLRAANALDFDDLIGETVAVL QAFPQIAQYYRRRFRHVLVDEYQDTNHAQYVLVRELVGRDSNDGIPPGELCVVGDADQ SIYAFRGATIRNIEDFERDYPDTRTILLEQNYRSTQNILSAANSVIARNAGRREKRLW TDAGAGELIVGYVADNEHDEARFVAEEIDALAEGSEITYNDVAVFYRTNNSSRSLEEV LIRAGIPYKVVGGVRFYERKEIRDIVAYLRVLDNPGDAVSLRRILNTPRRGIGDRAEA CVAVYAENTGVGFGDALVAAAQGKVPMLNTRAEKAIAGFVEMFDELRGRLDDDLGELV EAVLERTGYRRELEASTDPQELARLDNLNELVSVAHEFSTDRENAAALGPDDEDVPDT GVLADFLERVSLVADADEIPEHGAGVVTLMTLHTAKGLEFPVVFVTGWEDGMFPHMRA LDNPTELSEERRLAYVGITRARQRLYVSRAIVRSSWGQPMLNPESRFLREIPQELIDW RRTAPKPSFSAPVSGAGRFGSARPSPTRSGASRRPLLVLQVGDRVTHDKYGLGRVEEV SGVGESAMSLIDFGSSGRVKLMHNHAPVTKL" misc_feature 1058383..1058406 /gene="uvrD1" /locus_tag="Rv0949" /note="PS00017 ATP/GTP-binding site motif A" gene complement(1060656..1061654) /locus_tag="Rv0950c" /db_xref="GeneID:885438" CDS complement(1060656..1061654) /locus_tag="Rv0950c" /function="UNKNOWN" /note="Rv0950c, (MTCY10D7.24), len: 332 aa. Conserved hypothetical protein, highly similar to AL035500|MLCL373.02c|T45433 hypothetical protein from Mycobacterium leprae (343 aa), FASTA scores: opt: 1500, E(): 0, (71.0% identity in 331 aa overlap). C-terminus highly similar to part of various proteins e.g. C-terminal part of NP_441943.1|NC_000911|NlpD lipoprotein from Synechocystis sp (715 aa); N-terminal part of NP_066789.1|NC_002576 putative peptidase from Rhodococcus equi (546 aa); C-terminal part of NP_212396.1|NC_001318 conserved hypothetical protein from Borrelia burgdorferi (417 aa); C-terminal part of P33648|NLPD_ECOLI|nlpd lipoprotein from Escherichia coli (379 aa), FASTA scores: opt: 276, E(): 2e-10, (29.9% identity in 234 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215465.1" /db_xref="GI:15608090" /db_xref="GOA:P71560" /db_xref="UniProtKB/TrEMBL:P71560" /db_xref="GeneID:885438" /translation="MAAIRTPRDRWPHHHRNEVTEIIPLDGFLDGLALYDELDFAELD DLDLGDDCVFDYEAQLLAAPELDDLDDADDLAPEWLVAPTVVLTPEVTPVSRRVGQHR KQPIGAARGRLLISAMAAGAAAAAAHTAIQQSETPRTETVLTAHASALNEGSGSNPPR GVQVIAAQPAASAAVHNAEFARGVAFAEERAEREARLQRPLYVMPTKGIFTSSFGYRW GVLHAGIDLANAIGTPIYAVSDGVVIDAGPTAGYGMWVKLLHADGTVTLYGHVNTTLV SVGERVMAGDQIATMGSRGFSTGPHLHFEVLLGGTERVDPVPWLAKRGLSVGNYTG" gene 1061964..1063127 /gene="sucC" /locus_tag="Rv0951" /db_xref="GeneID:885434" CDS 1061964..1063127 /gene="sucC" /locus_tag="Rv0951" /EC_number="6.2.1.5" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE [CATALYTIC ACTIVITY: ATP + succinate + CoA = ADP + succinyl-CoA + phosphate]." /note="catalyzes the interconversion of succinyl-CoA and succinate" /codon_start=1 /transl_table=11 /product="succinyl-CoA synthetase subunit beta" /protein_id="NP_215466.1" /db_xref="GI:15608091" /db_xref="GOA:P71559" /db_xref="UniProtKB/Swiss-Prot:P71559" /db_xref="GeneID:885434" /translation="MDLFEYQAKELFAKHNVPSTPGRVTDTAEGAKAIATEIGRPVMV KAQVKIGGRGKAGGVKYAATPQDAYEHAKNILGLDIKGHIVKKLLVAEASDIAEEYYL SFLLDRANRTYLAMCSVEGGMEIEEVAATKPERLAKVPVNAVKGVDLDFARSIAEQGH LPAEVLDTAAVTIAKLWELFVAEDATLVEVNPLVRTPDHKILALDAKITLDGNADFRQ PGHAEFEDRAATDPLELKAKEHDLNYVKLDGQVGIIGNGAGLVMSTLDVVAYAGEKHG GVKPANFLDIGGGASAEVMAAGLDVVLGDQQVKSVFVNVFGGITSCDAVATGIVKALG MLGDEANKPLVVRLDGNNVEEGRRILTEANHPLVTLVATMDEAADKAAELASA" gene 1063140..1064051 /gene="sucD" /locus_tag="Rv0952" /db_xref="GeneID:885426" CDS 1063140..1064051 /gene="sucD" /locus_tag="Rv0952" /EC_number="6.2.1.5" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE [CATALYTIC ACTIVITY: ATP + succinate + CoA = ADP + succinyl-CoA + phosphate]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the only substrate-level phosphorylation in the TCA cycle" /codon_start=1 /transl_table=11 /product="succinyl-CoA synthetase subunit alpha" /protein_id="NP_215467.1" /db_xref="GI:15608092" /db_xref="GOA:P71558" /db_xref="UniProtKB/Swiss-Prot:P71558" /db_xref="GeneID:885426" /translation="MTHMSIFLSRDNKVIVQGITGSEATVHTARMLRAGTQIVGGVNA RKAGTTVTHEDKGGRLIKLPVFGSVAEAMEKTGADVSIIFVPPTFAKDAIIEAIDAEI PLLVVITEGIPVQDTAYAWAYNLEAGHKTRIIGPNCPGIISPGQSLAGITPANITGPG PIGLVSKSGTLTYQMMFELRDLGFSTAIGIGGDPVIGTTHIDAIEAFERDPDTKLIVM IGEIGGDAEERAADFIKTNVSKPVVGYVAGFTAPEGKTMGHAGAIVSGSSGTAAAKQE ALEAAGVKVGKTPSATAALAREILLSL" misc_feature 1063908..1063925 /gene="sucD" /locus_tag="Rv0952" /note="PS00399 ATP-citrate lyase and succinyl-CoA ligases active site" misc_feature 1063980..1064003 /gene="sucD" /locus_tag="Rv0952" /note="PS00017 ATP/GTP-binding site motif A" gene complement(1064114..1064962) /locus_tag="Rv0953c" /db_xref="GeneID:885419" CDS complement(1064114..1064962) /locus_tag="Rv0953c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv0953c, (MTCY10D7.21), len: 282 aa. Possible oxidoreductase (EC 1.-.-.-), equivalent to CAA48222.1|X68102 hypothetical protein from Mycobacterium avium subsp. paratuberculosis (166 aa). Similar to several hypothetical proteins and oxidoreductases e.g. AAK38097.1|AF323606_3|AF323606 putative F420-dependent dehydrogenase from Rhodococcus erythropolis (295 aa); NP_070025.1|NC_000917 N5,N10-methylenetetrahydromethanopterin reductase (mer-2) from Archaeoglobus fulgidus (348 aa); etc. Also similar to several hypothetical proteins and oxidoreductases from Mycobacterium tuberculosis e.g. Rv2161c|O06216|Z95388|MTCY270.07 (288 aa), FASTA scores: opt: 633, E(): 0, (40.4% identity in 277 aa overlap), Rv3079c (275 aa), Rv0791c (347 aa), etc. Contains PS00201 Flavodoxin signature." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215468.1" /db_xref="GI:15608093" /db_xref="UniProtKB/Swiss-Prot:P64769" /db_xref="GeneID:885419" /translation="MHYGLVLFTSDRGITPAAAARLAESHGFRTFYVPEHTHIPVKRQ AAHPTTGDASLPDDRYMRTLDPWVSLGAASAVTSRIRLATAVALPVEHDPITLAKSIA TLDHLSHGRVSVGVGFGWNTDELVDHGVPPGRRRTMLREYLEAMRALWTQEEACYDGE FVKFGPSWAWPKPVQPHIPVLVGAAGTEKNFKWIARSADGWITTPRDVDIDEPVKLLQ DIWAAAGRDGLPQIVALDVKPVPDKLARWAELGVTEVLFGMPDRSADDAAAYVERLAA KLACCV" misc_feature complement(1064897..1064947) /locus_tag="Rv0953c" /note="PS00201 Flavodoxin signature" gene 1065127..1066038 /locus_tag="Rv0954" /db_xref="GeneID:885411" CDS 1065127..1066038 /locus_tag="Rv0954" /function="UNKNOWN" /note="Rv0954, (MTCY10D7.20c), len: 303 aa. Probable conserved transmembrane protein, highly similar to 34KD_MYCPA|Q04959 34 kDa antigenic protein from Mycobacterium paratuberculosis (298 aa), FASTA scores: opt: 1023, E(): 7.2e-36, (59.3% identity in 305 aa overlap); AAC69251.1|U82111 34 kDa antigen precursor from Mycobacterium leprae (336 aa); and AL035500|MLCL373.06 hypothetical membrane protein from Mycobacterium leprae (297 aa), FASTA score: (55.6% identity in 315 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215469.1" /db_xref="GI:15608094" /db_xref="GOA:P65637" /db_xref="UniProtKB/Swiss-Prot:P65637" /db_xref="GeneID:885411" /translation="MTYSPGNPGYPQAQPAGSYGGVTPSFAHADEGASKLPMYLNIAV AVLGLAAYFASFGPMFTLSTELGGGDGAVSGDTGLPVGVALLAALLAGVALVPKAKSH VTVVAVLGVLGVFLMVSATFNKPSAYSTGWALWVVLAFIVFQAVAAVLALLVETGAIT APAPRPKFDPYGQYGRYGQYGQYGVQPGGYYGQQGAQQAAGLQSPGPQQSPQPPGYGS QYGGYSSSPSQSGSGYTAQPPAQPPAQSGSQQSHQGPSTPPTGFPSFSPPPPVSAGTG SQAGSAPVNYSNPSGGEQSSSPGGAPV" gene 1066078..1067445 /locus_tag="Rv0955" /db_xref="GeneID:885408" CDS 1066078..1067445 /locus_tag="Rv0955" /function="UNKNOWN" /note="Rv0955, (MTCY10D7.19c), len: 455 aa. Probable conserved integral membrane protein, highly similar to AL035500|MLCL373_6 putative membrane protein from Mycobacterium leprae (430 aa), FASTA score: (75.9% identity in 419 aa overlap); and AAL05878.1|AF411607_2|AF411607 unknown protein from Mycobacterium avium subsp. paratuberculosis (409 aa)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215470.1" /db_xref="GI:15608095" /db_xref="GOA:P64771" /db_xref="UniProtKB/Swiss-Prot:P64771" /db_xref="GeneID:885408" /translation="MNRVSASADDRAAGARPARDLVRVAFGPGVVALGIIAAVTLLQL LIANSDMTGAWGAIASMWLGVHLVPISIGGRALGVMPLLPVLLMVWATARSTARATSP QSSGLVVRWVVASALGGPLLMAAIALAVIHDASSVVTELQTPSALRAFTSVLVVHSVG AATGVWSRVGRRALAATALPDWLHDSMRAAAAGVLALLGLSGVVTAGSLVVHWATMQE LYGITDSIFGQFSLTVLSVLYAPNVIVGTSAIAVGSSAHIGFATFSSFAVLGGDIPAL PILAAAPTPPLGPAWVALLIVGASSGVAVGQQCARRALPFVAAMAKLLVAAVAGALVM AVLGYGGGGRLGNFGDVGVDEGALVLGVLFWFTFVGWVTVVIAGGISRRPKRLRPAPP VELDADESSPPVDMFDGAASEQPPASVAEDVPPSHDDIANGLKAPTADDEALPLSDEP PPRAD" gene 1067561..1068208 /gene="purN" /locus_tag="Rv0956" /db_xref="GeneID:885407" CDS 1067561..1068208 /gene="purN" /locus_tag="Rv0956" /EC_number="2.1.2.2" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE THIRD STEP) [CATALYTIC ACTIVITY: 10-FORMYLTETRAHYDROFOLATE + 5'-PHOSPHORIBOSYLGLYCINAMIDE = TETRAHYDROFOLATE + 5'-PHOSPHORIBOSYL-N-FORMYLGLYCINAMIDE]." /note="glycinamide ribonucleotide transformylase; GAR Tfase; catalyzes the synthesis of 5'-phosphoribosylformylglycinamide from 5'-phosphoribosylglycinamide and 10-formyltetrahydrofolate; PurN requires formyl folate for the reaction unlike PurT which uses formate" /codon_start=1 /transl_table=11 /product="phosphoribosylglycinamide formyltransferase" /protein_id="NP_215471.1" /db_xref="GI:15608096" /db_xref="GOA:P71554" /db_xref="UniProtKB/TrEMBL:P71554" /db_xref="GeneID:885407" /translation="MQEPLRVPPSAPARLVVLASGTGSLLRSLLDAAVGDYPARVVAV GVDRECRAAEIAAEASVPVFTVRLADHPSRDAWDVAITAATAAHEPDLVVSAGFMRIL GPQFLSRFYGRTLNTHPALLPAFPGTHGVADALAYGVKVTGATVHLVDAGTDTGPILA QQPVPVLDGDDEETLHERIKVTERRLLVAAVAALATHGVTVVGRTATMGRKVTIG" gene 1068205..1069776 /gene="purH" /locus_tag="Rv0957" /db_xref="GeneID:885406" CDS 1068205..1069776 /gene="purH" /locus_tag="Rv0957" /EC_number="3.5.4.10" /EC_number="2.1.2.3" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS (AT THE NINTH AND TENTH STEPS) [CATALYTIC ACTIVITY 1: 10-formyltetrahydrofolate + 5'-phosphoribosyl-5-amino-4-imidazolecarboxamide = tetrahydrofolate + 5'-phosphoribosyl-5-formamido-4-imidazolecarboxamide] [CATALYTIC ACTIVITY 2: IMP + H2O = 5-formamido-1-(5-phosphoribosyl)imidazole-4-carboxamide]." /note="involved in de novo purine biosynthesis" /codon_start=1 /transl_table=11 /product="bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase" /protein_id="NP_215472.1" /db_xref="GI:15608097" /db_xref="GOA:P67541" /db_xref="UniProtKB/Swiss-Prot:P67541" /db_xref="GeneID:885406" /translation="MSTDDGRRPIRRALISVYDKTGLVDLAQGLSAAGVEIISTGSTA KTIADTGIPVTPVEQLTGFPEVLDGRVKTLHPRVHAGLLADLRKSEHAAALEQLGIEA FELVVVNLYPFSQTVESGASVDDCVEQIDIGGPAMVRAAAKNHPSAAVVTDPLGYHGV LAALRAGGFTLAERKRLASLAFQHIAEYDIAVASWMQQTLAPEHPVAAFPQWFGRSWR RVAMLRYGENPHQQAALYGDPTAWPGLAQAEQLHGKDMSYNNFTDADAAWRAAFDHEQ TCVAIIKHANPCGIAISSVSVADAHRKAHECDPLSAYGGVIAANTEVSVEMAEYVSTI FTEVIVAPGYAPGALDVLARKKNIRVLVAAEPLAGGSELRPISGGLLIQQSDQLDAHG DNPANWTLATGSPADPATLTDLVFAWRACRAVKSNAIVIAADGATVGVGMGQVNRVDA ARLAVERGGERVRGAVAASDAFFPFPDGLETLAAAGVTAVVHPGGSVRDEEVTEAAAK AGVTLYLTGARHFAH" gene 1069883..1071262 /locus_tag="Rv0958" /db_xref="GeneID:885405" CDS 1069883..1071262 /locus_tag="Rv0958" /EC_number="4.99.1.-" /function="CHELATION, INTRODUCING A MAGNESIUM ION INTO SPECIFIC SUBSTRATE." /note="Rv0958, (MTCY10D7.16c), len: 459 aa. Possible magnesium chelatase (EC 4.99.1.-), similar to others (especially in N-terminal parts) e.g. NP_296313.1|NC_001263|AE002088_10 putative magnesium protoporphyrin chelatase from Deinococcus radiodurans (487 aa), FASTA scores: opt: 1148, E(): 0, (42.4% identity in 450 aa overlap); Q44498|CHLI_ANAVA MAGNESIUM-CHELATASE SUBUNIT CHLI from Anabaena variabilis (338 aa); T31460 probable magnesium chelatase (EC 4.99.1.-) chain I bchI from Heliobacillus mobilis (363 aa); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="magnesium chelatase" /protein_id="NP_215473.1" /db_xref="GI:15608098" /db_xref="GOA:P71552" /db_xref="UniProtKB/TrEMBL:P71552" /db_xref="GeneID:885405" /translation="MSPSNLPRTVGELRAAGHRERGVKQEIRENLLTALADGDNVWPG ILGFDDTVIPQVERALIAGHDFVLLGERGQGKTRLLRALAGLLDEWTPVIAGAELGEH PYTPITPESIRRAAQLGDDLPVAWKHRSERYTEKLATPDTSVADLVGDVDPIKVAEGR SLGDPETIAYGLIPRAHRGIVAVNELPDLAERIQVSMLNVMEERDIQVRGYTLRLPLD VLVVASANPEDYTNRGRIITPIKDRFGAEIRTHYPLELEAEMGVIVQEAHLSAQVSDY LMQVLARFARYLRESRSIDQRSGVSARFAIAAAETVAAAARHRGAVLGETDPVARVVD LGTVIDVLRGKLEFESGEEGREQAVLEHLLRRATADTASRVLGGIDVGSLVTAVEGGS AVTTGERVSAKDVLAAVPGLPVVDRIARKLGAESEGERAAALELALEALYLAKRVDKV CGEGQTVYG" misc_feature 1070090..1070113 /locus_tag="Rv0958" /note="PS00017 ATP/GTP-binding site motif A" gene 1071255..1073273 /locus_tag="Rv0959" /db_xref="GeneID:885329" CDS 1071255..1073273 /locus_tag="Rv0959" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0959, (MTCY10D7.15c), len: 672 aa. Conserved hypothetical protein, similar to AE002069|AE002069_12 hypothetical protein from Deinococcus radiodurans (403 aa), FASTA scores: opt: 395, E(): 1.3e-15, (26.8% identity in 426 aa overlap). Contains a single copy at the N-terminus of a short repeat found three times in the M. tuberculosis ORF O33341|MTV003.05c|AL008883." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215474.1" /db_xref="GI:15608099" /db_xref="UniProtKB/Swiss-Prot:P71551" /db_xref="GeneID:885329" /translation="MAKSDGDDPLRPASPRLRSSRRHSLRYSAYTGGPDPLAPPVDLR DALEQIGQDVMAGASPRRALSELLRRGTRNLTGADRLAAEVNRRRRELLRRNNLDGTL QEIKKLLDEAVLAERKELARALDDDARFAELQLDALPASPAKAVQELAEYRWRSGQAR EKYEQIKDLLGRELLDQRFAGMKQALAGATDDDRRRVTEMLDDLNDLLDKHARGEDTQ RDFDEFMTKHGEFFPENPRNVEELLDSLAKRAAAAQRFRNSLSQEQRDELDALAQQAF GSPALMRALDRLDAHLQAARPGEDWTGSQQFSGDNPFGMGEGTQALADIAELEQLAEQ LSQSYPGASMDDVDLDALARQLGDQAAVDARTLAELERALVNQGFLDRGSDGQWRLSP KAMRRLGETALRDVAQQLSGRHGERDHRRAGAAGELTGATRPWQFGDTEPWHVARTLT NAVLRQAAAVHDRIRITVEDVEVAETETRTQAAVALLVDTSFSMVMENRWLPMKRTAL ALHHLVCTRFRSDALQIIAFGRYARTVTAAELTGLAGVYEQGTNLHHALALAGRHLRR HAGAQPVVLVVTDGEPTAHLEDFDGDGTSVFFDYPPHPRTIAHTVRGFDDMARLGAQV TIFRLGSDPGLARFIDQVARRVQGRVVVPDLDGLGAAVVGDYLRFRRR" gene 1073545..1073928 /locus_tag="Rv0960" /db_xref="GeneID:885158" CDS 1073545..1073928 /locus_tag="Rv0960" /function="UNKNOWN" /note="Rv0960, (MTCY10D7.14c), len: 127 aa. Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv0065|MTV030.08 (133 aa), FASTA scores: E(): 1.5e-14, (38.3% identity in 128 aa overlap), Rv1720c (129 aa), and Rv0549c (137 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215475.1" /db_xref="GI:15608100" /db_xref="UniProtKB/Swiss-Prot:P64773" /db_xref="GeneID:885158" /translation="MIVVDASAALAALLNDGQARQLIAAERLHVPHLVDSEIASGLRR LAQRDRLGAADGRRALQTWRRLAVTRYPVVGLFERIWEIRANLSAYDASYVALAEALN CALVTADLRLSDTGQAQCPITVVPR" gene 1074074..1074421 /locus_tag="Rv0961" /db_xref="GeneID:885173" CDS 1074074..1074421 /locus_tag="Rv0961" /function="UNKNOWN" /note="Rv0961, (MTCY10D7.13c), len: 115 aa. Probable integral membrane protein." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215476.1" /db_xref="GI:15608101" /db_xref="GOA:P64775" /db_xref="UniProtKB/Swiss-Prot:P64775" /db_xref="GeneID:885173" /translation="MRVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAM ATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLA LGLVYVAADAVLH" gene complement(1074440..1075114) /gene="lprP" /locus_tag="Rv0962c" /db_xref="GeneID:885177" CDS complement(1074440..1075114) /gene="lprP" /locus_tag="Rv0962c" /function="UNKNOWN" /note="Rv0962c, (MTCY10D7.12), len: 224 aa. Possible lprP, lipoprotein. Contains possible N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LprP" /protein_id="NP_215477.1" /db_xref="GI:15608102" /db_xref="GOA:P71548" /db_xref="UniProtKB/Swiss-Prot:P71548" /db_xref="GeneID:885177" /translation="MKRTSRSLTAALLGIAALLAGCIKPNTFDPYANPGRGELDRRQK IVNGRPDLETVQQQLANLDATIRAMIAKYSPQTRFSTGVTVSHLTNGCNDPFTRTIGR QEASELFFGRPAPTPQQWLQIVTELAPVFKAAGFRPNNSVPGDPPQPLGAPNYSQIRD DGVTINLVNGDNRGPLGYSYNTGCHPPAAWRTAPPPLNMRPANDPDVHYPYLYGSPGG RTRDAY" gene complement(1075297..1076097) /locus_tag="Rv0963c" /db_xref="GeneID:885184" CDS complement(1075297..1076097) /locus_tag="Rv0963c" /function="UNKNOWN" /note="Rv0963c, (MTCCY10D7.11), len: 266 aa. Conserved hypothetical protein, similar in part to other CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis e.g. Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: E(): 1.2e-23, (39.0% identity in 254 aa overlap); Rv2542 (403 aa); Rv2079 (656 aa). Also similar in part to AL133423|SC4A7_3 HYPOTHETICAL SECRETED PROTEIN from Streptomyces coelicolor (406 aa), FASTA scores: opt: 231, E(): 6.8e-07, (31.4% identity in 204 aa overlap); and SCH10.21c|T36533 hypothetical protein from Streptomyces coelicolor (329 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215478.1" /db_xref="GI:15608103" /db_xref="UniProtKB/Swiss-Prot:P64777" /db_xref="GeneID:885184" /translation="MLQRELTRLQNGWLSRDGVWHTDTDKLADLRALRDTLAAHPGTS LILLDTASDPRKVLAAVGVGDVDNAERVGVTMGGLNTRVSSSVGDMVKEAGIQRAKAA ELRERAGWPNYDAVASIAWLGYDAPDGLKDVMHDWSARDAAGPLNRFDKGLAATTNVS DQHITAFGHSYGSLVTSLALQQGAPVSDVVLYGSPGTELTHASQLGVEPGHAFYMIGV NDHVANTIPEFGAFGSAPQDVPGMTQLSVNTGLAPGPLLGDGQLHERA" gene complement(1076196..1076678) /locus_tag="Rv0964c" /db_xref="GeneID:885186" CDS complement(1076196..1076678) /locus_tag="Rv0964c" /function="UNKNOWN" /note="Rv0964c, (MTCY10D7.10), len: 160 aa. Hypothetical unknown protein. Equivalent to AAK45241.1 from Mycobacterium tuberculosis strain CDC1551 (138 aa) but longer 22 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215479.1" /db_xref="GI:15608104" /db_xref="UniProtKB/Swiss-Prot:P71546" /db_xref="GeneID:885186" /translation="MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAE EVAAIKMRLQVIRDAAREHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNV CWPTPRPPMRIWPRRFDAPPGPCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR" gene complement(1076778..1077197) /locus_tag="Rv0965c" /db_xref="GeneID:885230" CDS complement(1076778..1077197) /locus_tag="Rv0965c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0965c, (MTCY10D7.09), len: 139 aa. Conserved hypothetical protein, showing weak similarity with Rv2798c|MTCY16B7.45 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (108 aa), FASTA scores: E(): 5.6e-12, (38.9% identity in 90 aa overlap). Equivalent to AAK45242.1 from Mycobacterium tuberculosis strain CDC1551 (146 aa) but shorter 7 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215480.1" /db_xref="GI:15608105" /db_xref="UniProtKB/Swiss-Prot:P71545" /db_xref="GeneID:885230" /translation="MRVNRPQCARVPYSAESLVRVEASWYGRTLRAIPEVLSQVGYQQ ADHGESLLTSHHCCLGAAEGARPGWVGSSAGALSGLLDSWAEASTAHAARIGDHSYGM HLAAVGFAEMEEHNAAALAAVYPTGGGSARCDGVDVS" gene complement(1077233..1077835) /locus_tag="Rv0966c" /db_xref="GeneID:885043" CDS complement(1077233..1077835) /locus_tag="Rv0966c" /function="UNKNOWN" /note="Rv0966c, (MTCY10D7.08), len: 200 aa. Conserved hypothetical protein, equivalent to AL035500|MLCL373_12 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (200 aa), FASTA scores: opt: 1080, E(): 0, (79.5% identity in 200 aa overlap). Also highly similar to SCE6.30c|CAB88834.1|AL353832 hypothetical protein from Streptomyces coelicolor (277 aa). Some similarity to Rv2862c|MTV007.08 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (194 aa), FASTA scores: E(): 3.1e-06, (31.5% identity in 184 aa overlap). Equivalent to AAK45243.1 from Mycobacterium tuberculosis strain CDC1551 (230 aa) but shorter 30 aa. Note that Rv0966c has been shortened since first entry." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215481.2" /db_xref="GI:57116804" /db_xref="UniProtKB/Swiss-Prot:P71544" /db_xref="GeneID:885043" /translation="MSNSAQRDARNSRDESARASDTDRIQIAQLLAYAAEQGRLQLTD YEDRLARAYAATTYQELDRLRADLPGAAIGPRRGGECNPAPSTLLLALLGGFERRGRW NVPKKLTTFTLWGSGVLDLRYADFTSTEVDIRAYSIMGAQTILLPPEVNVEIHGHRVM GGFDRKVVGEGTRGVPTVRIRGFSLWGDVGIKRKPRKPRK" gene 1077975..1078334 /locus_tag="Rv0967" /db_xref="GeneID:885312" CDS 1077975..1078334 /locus_tag="Rv0967" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0967, (MTCY10D7.07c), len: 119 aa. Conserved hypothetical protein, similar to hypothetical proteins from several organisms e.g. AE002074|AE002074_11 from Deinococcus radiodurans (102 aa), FASTA scores: opt: 233, E(): 8.6e-10, (47.0% identity in 83 aa overlap); O32222|Z99121|YVGZ from Bacillus subtilis (101 aa), FASTA scores: opt:228, E(): 3.2e-15, (38.0% identity in 92 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical proteins Rv0190, and Rv1766." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215482.1" /db_xref="GI:15608107" /db_xref="UniProtKB/TrEMBL:P71543" /db_xref="GeneID:885312" /translation="MSKELTAKKRAALNRLKTVRGHLDGIVRMLESDAYCVDVMKQIS AVQSSLERANRVMLHNHLETCFSTAVLDGHGQAAIEELIDAVKFTPALTGPHARLGGA AVGESATEEPMPDASNM" gene 1078391..1078687 /locus_tag="Rv0968" /db_xref="GeneID:885052" CDS 1078391..1078687 /locus_tag="Rv0968" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0968, (MTCY10D7.06c), len: 98 aa. Conserved hypothetical protein, similar to NP_301579.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (92 aa). Also highly similar to CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis e.g. Rv3269 (93 aa), FASTA score: (51.1% identity in 94 aa overlap); and Rv1993c (90 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215483.1" /db_xref="GI:15608108" /db_xref="UniProtKB/Swiss-Prot:P64779" /db_xref="GeneID:885052" /translation="MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVA AWGIRLAREAERKAGESAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH" gene 1078743..1081055 /gene="ctpV" /locus_tag="Rv0969" /db_xref="GeneID:885254" CDS 1078743..1081055 /gene="ctpV" /locus_tag="Rv0969" /EC_number="3.6.3.-" /function="METAL CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF UNDETERMINATED METAL CATION WITH HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED METAL CATION(IN) = ADP + PHOSPHATE + UNDETERMINATED METAL CATION(OUT)]." /experiment="experimental evidence, no additional details recorded" /note="Rv0969, (MTCY10D7.05c), len: 770 aa. Probable ctpV, metal cation transporter P-type ATPase (transmembrane protein) (EC 3.6.3.-) (see citation below), highly similar (except in N-terminus) to others e.g. NP_391230.1|NC_000964 similar to heavy metal-transporting ATPase from Bacillus subtilis (803 aa); P37279|ATCS_SYNP7|PACS cation-transporting ATPase from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (747 aa), FASTA scores: opt: 1851, E(): 0, (52.1% identity in 664 aa overlap); etc. Equivalent to AAK45246.1 from Mycobacterium tuberculosis strain CDC1551 (792 aa) but shorter 22 aa. Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES)." /codon_start=1 /transl_table=11 /product="metal cation transporter P-type ATPase CtpV" /protein_id="NP_215484.1" /db_xref="GI:15608109" /db_xref="GOA:P77894" /db_xref="UniProtKB/Swiss-Prot:P77894" /db_xref="GeneID:885254" /translation="MRVCVTGFNVDAVRAVAIEETVSQVTGVHAVHAYPRTASVVIWY SPELGDTAAVLSAITKAQHVPAELVPARAPHSAGVRGVGVVRKITGGIRRMLSRPPGV DKPLKASRCGGRPRGPVRGSASWPGEQNRRERRTWLPRVWLALPLGLLALGSSMFFGA YPWAGWLAFAATLPVQFVAGWPILRGAVQQARALTSNMDTLIALGTLTAFVYSTYQLF AGGPLFFDTSALIIAFVVLGRHLEARATGKASEAISKLLELGAKEATLLVDGQELLVP VDQVQVGDLVRVRPGEKIPVDGEVTDGRAAVDESMLTGESVPVEKTAGDRVAGATVNL DGLLTVRATAVGADTALAQIVRLVEQAQGDKAPVQRLADRVSAVFVPAVIGVAVATFA GWTLIAANPVAGMTAAVAVLIIACPCALGLATPTAIMVGTGRGAELGILVKGGEVLEA SKKIDTVVFDKTGTLTRARMRVTDVIAGQRRQPDQVLRLAAAVESGSEHPIGAAIVAA AHERGLAIPAANAFTAVAGHGVRAQVNGGPVVVGRRKLVDEQHLVLPDHLAAAAVEQE ERGRTAVFVGQDGQVVGVLAVADTVKDDAADVVGRLHAMGLQVAMITGDNARTAAAIA KQVGIEKVLAEVLPQDKVAEVRRLQDQGRVVAMVGDGVNDAPALVQADLGIAIGTGTD VAIEASDITLMSGRLDGVVRAIELSRQTLRTIYQNLGWAFGYNTAAIPLAALGALNPV VAGAAMGFSSVSVVTNSLRLRRFGRDGRTA" misc_feature 1080120..1080140 /gene="ctpV" /locus_tag="Rv0969" /note="PS00154 E1-E2 ATPases phosphorylation site" gene 1081052..1081684 /locus_tag="Rv0970" /db_xref="GeneID:885242" CDS 1081052..1081684 /locus_tag="Rv0970" /function="UNKNOWN" /note="Rv0970, (MTCY10D7.04c), len: 210 aa. Probable conserved integral membrane protein, equivalent to NP_302348.1|NC_002677 probable integral membrane protein from Mycobacterium leprae (210 aa)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215485.1" /db_xref="GI:15608110" /db_xref="GOA:P64781" /db_xref="UniProtKB/Swiss-Prot:P64781" /db_xref="GeneID:885242" /translation="MIHDLMLRWVVTGLFVLTAAECGLAIIAKRRPWTLIVNHGLHFA MAVAMAVMAWPWGARVPTTGPAVFFLLAAVWFGATAVVAVRGTATRGLYGYHGLMMLA TAWMYAAMNPRLLPVRSCTEYATEPDGSMPAMDMTAMNMPPNSGSPIWFSAVNWIGTV GFAVAAVFWACRFVMERRQEATQSRLPGSIGQAMMAAGMAMLFFAMLFPV" gene complement(1081775..1082584) /gene="echA7" /locus_tag="Rv0971c" /db_xref="GeneID:885308" CDS complement(1081775..1082584) /gene="echA7" /locus_tag="Rv0971c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215486.1" /db_xref="GI:15608111" /db_xref="GOA:P71540" /db_xref="UniProtKB/TrEMBL:P71540" /db_xref="GeneID:885308" /translation="MDSPVDYAGPAACGGPFARLTLNSPHNRNALSSTLVSQLHQGLS AAEADPAVRLVVLGHTGGTFCAGADLSEAGGGGGDPYRMAVARAREMTALLRAIVESP LPVVGAINGHVRAGGFGLVGACDMVVAGPESTFALTEARIGVAPAIISLTLLPKLSPR AAARYYLTGEKFGAREAADIGLITMAADDVDAAVAALVADVGRGSPQGLAASKALTTA AVLEGFDRDAERLTEESARLFVSDEAREGMLAFLQKRPPRWVQPATMRAAD" gene complement(1082584..1083750) /gene="fadE12" /locus_tag="Rv0972c" /db_xref="GeneID:885237" CDS complement(1082584..1083750) /gene="fadE12" /locus_tag="Rv0972c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv0972c, (MTCY10D7.02), len: 388 aa. Probable fadE12, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. CAB95893.1|AL359988 putative acyl CoA dehydrogenase from Streptomyces coelicolor (382 aa); P45857|ACDB_BACSU from Bacillus subtilis (379 aa), FASTA scores: opt: 576, E(): 2.3e-26, (29.7% identity in 381 aa overlap); etc." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE12" /protein_id="NP_215487.1" /db_xref="GI:15608112" /db_xref="GOA:P71539" /db_xref="UniProtKB/Swiss-Prot:P71539" /db_xref="GeneID:885237" /translation="MTDTSFIESEERQALRKAVASWVANYGHEYYLDKARKHEHTSEL WAEAGKLGFLGVNLPEEYGGGGAGMYELSLVMEEMAAAGSALLLMVVSPAINGTIIAK FGTDDQKKRWLPGIADGSLTMAFAITEPDAGSNSHKITTTARRDGSDWIIKGQKVFIS GIDQAQAVLVVGRSEEAKTGKLRPALFVVPTDAPGFSYTPIEMELVSPERQFQVFLDD VRLPADALVGAEDAAIAQLFAGLNPERIMGAASAVGMGRFALGRAVDYVKTRKVWSTP IGAHQGLAHPLAQCHIEVELAKLMTQKAATLYDHGDDFGAAEAANMAKYAAAEASSRA VDQAVQSMGGNGLTKEYGVAAMMTSARLARIAPISREMVLNFVAQTSLGLPRSY" gene complement(1083747..1085750) /gene="accA2" /locus_tag="Rv0973c" /db_xref="GeneID:885922" CDS complement(1083747..1085750) /gene="accA2" /locus_tag="Rv0973c" /EC_number="6.3.4.14" /function="THIS PROTEIN CARRIES TWO FUNCTIONS: BIOTIN CARBOXYL CARRIER PROTEIN AND BIOTIN CARBOXYLTRANSFERASE. INVOLVED IN THE FIRST STEP OF LONG-CHAIN FATTY ACID SYNTHESIS [CATALYTIC ACTIVITY: ATP + BIOTIN-CARBOXYL-CARRIER PROTEIN + CO(2) = ADP + PHOSPHATE + CARBOXYBIOTIN-CARBOXYL-CARRIER PROTEIN]." /note="Rv0973c, (MTV044.01c, MTCY10D7.01), len: 667 aa. Probable accA2 (alternate gene name: bccA), acetyl-/propionyl-coenzyme A carboxylase (alpha subunit) [INCLUDES: BIOTIN CARBOXYLASE (EC 6.3.4.14); BIOTIN CARBOXYL CARRIER PROTEIN (BCCP)], highly similar to others e.g. CAB95892.1|AL359988 putative acetyl/propionyl CoA carboxylase alpha subunit from Streptomyces coelicolor (614 aa); NP_250702.1|NC_002516 probable acyl-CoA carboxylase alpha chain from Pseudomonas aeruginosa (655 aa); NP_420971.1|NC_002696 acetyl/propionyl-CoA carboxylase alpha subunit from Caulobacter crescentus ( 654 aa); NP_251581.1|NC_002516 probable biotin carboxylase/biotin carboxyl carrier protein from Pseudomonas aeruginosa (661 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv2501c|P46401|MTCY07A7.07c|BCCA_MYCTU|ACCA1 PROBABLE ACETYL-/PROPIONYL-COENZYME A CARBOXYLASE ALPHA CHAIN (ALPHA SUBUNIT) (654 aa), FASTA scores, opt: 250, E(): 4e-09, (28.6% identity in 182 aa overlap); and Rv3285|MTCY71.25|ACCA3 (600 aa); Z83018|MTCY349_20 (1127 aa), FASTA scores: opt: 838, E(): 0, (40.2% identity in 500 aa overlap). Contains PS00867 Carbamoyl-phosphate synthase subdomain signature 2 and PS00188 Biotin-requiring enzymes attachment site.; bccA" /codon_start=1 /transl_table=11 /product="acetyl-/propionyl-coenzyme A carboxylase subunit alpha" /protein_id="NP_215488.1" /db_xref="GI:15608113" /db_xref="GOA:P71538" /db_xref="UniProtKB/TrEMBL:P71538" /db_xref="GeneID:885922" /translation="MGITRVLVANRGEIARRVFATCRRLGLGTVAVYTDPDAAAPHVA EADARVRLPQTTDYLNAEAIIAAAQAAGADAVHPGYGFLSENAEFAAAVQEAGLTWVG PPVDAVRAMGSKIESKKLMAAAGVPVLEELDPDAVTTAQLPVLVKASAGGGGRGMRVV HELSALPAEVEAARREAQSAFGDPTVFCERYLPTGHHVEVQVMADTHGTVWAVGEREC SIQRRHQKIIEEAPSPLVERVPGMRAKLFDAARLAASAIGYTGAGTVEFLADDSPGRE GEFYFLEMNTRLQVEHPVTEETTGLDLVELQLMIADCGRLDTEPPPAQGYSIEARLYA EDPAHGWQPQAGVMHTIEVPGVRAQFDSLGQRTGIRLDSGIVDGSTVSIHYDPMLAKV VSYGATRRQAALVLADALVRARLHGLRTNRELLVNVLRHPAFLDGATDTGFFDTHGMA ELSTPLADTATLRLSAIAAALADAEHNRASAGVFSSIPSGWRNLASGYQVKTYRDDAD TEHRVEYRFTRTGLALPGDPVVQLVSADVDQVVLAQDGVAHGFTVARHGPDVYVDSAR GPVHLVALSRFPEPSSAVEQGSLVAPMPGNVIRIGAEVGDTVTAGQPLIWLEAMKMEH TIAAPADGVLTHVSVNTGQQVEVGAILARVEAPQNGPAEGDSP" misc_feature complement(1083867..1083920) /gene="accA2" /locus_tag="Rv0973c" /note="PS00188 Biotin-requiring enzymes attachment site" misc_feature complement(1084887..1084910) /gene="accA2" /locus_tag="Rv0973c" /note="PS00867 Carbamoyl-phosphate synthase subdomain signature 2" gene complement(1085756..1087345) /gene="accD2" /locus_tag="Rv0974c" /db_xref="GeneID:886064" CDS complement(1085756..1087345) /gene="accD2" /locus_tag="Rv0974c" /EC_number="6.4.1.-" /function="INVOLVED IN FATTY ACID METABOLISM." /note="Rv0974c, (MTV044.02c), len: 529 aa. Probable accD2, acetyl-/propionyl-CoA carboxylase (beta subunit) (EC 6.4.1.-), highly similar to many e.g. CAB95891.1|AL35998 putative acetyl/propionyl CoA carboxylase beta subunit from Streptomyces coelicolor (532 aa); NP_250704.1|NC_002516 probable acyl-CoA carboxyltransferase beta chain from Pseudomonas aeruginosa (535 aa); BAB16296.1|AB039884 acetyl-CoA carboxylase carboxyltransferase from Myxococcus xanthus (538 aa); NP_420973.1|NC_002696 putative propionyl-CoA carboxylase beta subunit from Caulobacter crescentus (530 aa); etc. Also similar to other from Mycobacterium tuberculosis: Rv2502c|ACCD1, Rv3799c|ACCD4, etc. COULD BELONG TO THE ACCD/PCCB FAMILY. TBparse score is 0.875." /codon_start=1 /transl_table=11 /product="acetyl-/propionyl-CoA carboxylase subunit beta" /protein_id="NP_215489.1" /db_xref="GI:15608114" /db_xref="GOA:O86318" /db_xref="UniProtKB/TrEMBL:O86318" /db_xref="GeneID:886064" /translation="MLQSTLDPNASAYDEAAATMSGKLDEINAELAKALAGGGPKYVD RHHARGNLTPRERIELLVDPDSPFLELSPLAAYGSNFQIGASLVTGIGAVCGVECMIV ANDPTVKGGTSNPWTLRKILRANQIAFENRLPVISLVESGGADLPTQKEIFIPGGQMF RDLTRLSAAGIPTIALVFGNSTAGGAYVPGMSDHVVMIKERSKVFLAGPPLVKMATGE ESDDESLGGAEMHARISGLADYFALDELDAIRIGRRIVARLNWIKQGPAPAPVTEPLF DAEELIGIVPPDLRIPFDPREVIARIVDGSEFDEFKPLYGSSLVTGWARLHGYPLGIL ANARGVLFSEESQKATQFIQLANRADTPLLFLHNTTGYMVGKDYEEGGMIKHGSMMIN AVSNSTVPHISLLIGASYGAGHYGMCGRAYDPRFLFAWPSAKSAVMGGAQLSGVLSIV ARAAAEARGQQVDEAADAAMRAAVEGQIEAESLPLVLSGMLYDDGVIDPRDTRTVLGM CLSAIANGPIKGTSNFGVFRM" gene complement(1087348..1088496) /gene="fadE13" /locus_tag="Rv0975c" /db_xref="GeneID:885856" CDS complement(1087348..1088496) /gene="fadE13" /locus_tag="Rv0975c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv0975c, (MTV044.03c), len: 382 aa. Probable fadE13, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. T35427 probable acyl-CoA dehydrogenase from Streptomyces coelicolor (382 aa); M74096|HUMACADL_1 Human long chain acyl-CoA dehydrogenase from Homo sapiens (430 aa), FASTA scores: opt: 819, E(): 0, (37.0% identity in 376 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. fadE20|Z98209|MTCY154_4 (386 aa), FASTA scores: (40.3% identity in 375 aa overlap). Contains PS00073 Acyl-CoA dehydrogenases signature 2. BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE13" /protein_id="NP_215490.1" /db_xref="GI:15608115" /db_xref="GOA:O86319" /db_xref="UniProtKB/TrEMBL:O86319" /db_xref="GeneID:885856" /translation="MNIWTTPERQQLRKTVRAFAEREILPHVDEWERIGELPRGLHRL AGAAGLLGAGFPEAVGGGGGDGADPVIICEEMHQAGAPGGVYASLFTCGIAVPHMVAS GDERLIATYVRPTLAGEKIGALAITEPGGGSDVGHLRTSAVRDGDHYVINGAKTYITS GVRADYVVTAVRTGGPGAAGVSLLVVEKDTPGFEVTRKLDKMGWRSSDTAELCYTDVA VPATNLVGAENSGFTQIARAFVSERIGLAAQAYSSAQRCLDLTAQWCRDRETFGRPLI SRQSVQNTLAEMARRIDVARVYAHHVVERQLAGETDLIAQVCFAKNTAVQAGEWVANQ AVQLFGGMGYMAESEVERQYRDMRILGIGGGTTEILTALAAKTLGYQS" misc_feature complement(1087429..1087488) /gene="fadE13" /locus_tag="Rv0975c" /note="PS00073 Acyl-CoA dehydrogenases signature 2" gene complement(1088493..1090175) /locus_tag="Rv0976c" /db_xref="GeneID:885860" CDS complement(1088493..1090175) /locus_tag="Rv0976c" /function="UNKNOWN" /note="Rv0976c, (MTV044.04c), len: 560 aa. Conserved hypothetical protein, highly similar to others e.g. CAB95890.1|AL359988 conserved hypothetical protein from Streptomyces coelicolor (558 aa); P_251576.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (600 aa); etc. N-terminal part highly similar to AL035500|MLCL373_14 probable pseudogene from Mycobacterium leprae (163 aa), FASTA score: (50.0% identity in 122 aa overlap). TBparse score is 0.860." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215491.1" /db_xref="GI:15608116" /db_xref="UniProtKB/TrEMBL:O86320" /db_xref="GeneID:885860" /translation="MRIGNCSGFYGDRLSAMREMLTGGELDYLTGDYLAELTMLILGR DRMKNPDRGYAKTFLAQLEDCLGLAHDRGVRIVTNAGGLNPAGLANAVRALAARLGIP AQVAHVEGDDLQPRAAELGLGTPLTANAYLGAWGIVDCFERGADVVVTGRVTDASVVV GAAAAHFGWGRTDYHRLAGAVVAGHVIECGVQATGGNYAFFTEIGDLTHAGFPLAEIA ADGSSVITKHHGTGGLVSVDTITAQLLYEITGARYANPDVTARMDSVELSPDGPDRVR ISGVIGEPPPPTYKVSLNSIGGFRNAMTFVLTGLDIDAKADLVRRQLEAALTVKPAEL QWTLARTDHPDADTEETASALLTCVARDPDPANVGRQFSSAAVELALASYPGFTATAP PGDGQVYGVFTPGYVDAGKVAHIAVHADGTRTEIPCATETLELAPAHPPALPDPLPAG PTRRVPLGLIAGARSGDKGGSANVGVWVRTDEQWRWLAHTLTVELLKELLPETAGLVV TRHVLPNLRALNFVIEAILGQGVAYQARFDPQAKGLGEWLRSRHVEIPETLL" gene 1090373..1093144 /gene="PE_PGRS16" /locus_tag="Rv0977" /db_xref="GeneID:885264" CDS 1090373..1093144 /gene="PE_PGRS16" /locus_tag="Rv0977" /function="UNKNOWN" /note="Rv0977, (MTV044.05), len: 923 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to other PGRS-type sequences e.g. AL0091|MTV004_1 from Mycobacterium tuberculosis (1125 aa), FASTA score: (45.4% identity in 959 aa overlap); Z80225|MTCY441_4 from Mycobacterium tuberculosis (778 aa), FASTA score: (51.5% identity in 750 aa overlap); etc. TBparse score is 0.868." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177773.1" /db_xref="GI:57116805" /db_xref="UniProtKB/TrEMBL:Q79FU3" /db_xref="GeneID:885264" /translation="MSFVVTAPPVLASAASDLGGIASMISEANAMAAVRTTALAPAAA DEVSAAIAALFSSYARDYQTLSVQVTAFHVQFAQTLTNAGQLYAVVDVGNGVLLKTEQ QVLGVINAPTQTLVGRPLIGDGTHGAPGTGQNGGAGGILWGNGGNGGSGAPGQPGGRG GDAGLFGHGGHGGVGGPGIAGAAGTAGLPGGNGANGGSGGIGGAGGAGGNGGLLFGNG GAGGQGGSGGLGGSGGTGGAGMAAGPAGGTGGIGGIGGIGGAGGVGGHGSALFGHGGI NGDGGTGGMGGQGGAGGNGWAAEGITVGIGEQGGQGGDGGAGGAGGIGGSAGGIGGSQ GAGGHGGDGGQGGAGGSGGVGGGGAGAGGDGGAGGIGGTGGNGSIGGAAGNGGNGGRG GAGGMATAGSDGGNGGGGGNGGVGVGSAGGAGGTGGDGGAAGAGGAPGHGYFQQPAPQ GLPIGTGGTGGEGGAGGAGGDGGQGDIGFDGGRGGDGGPGGGGGAGGDGSGTFNAQAN NGGDGGAGGVGGAGGTGGTGGVGADGGRGGDSGRGGDGGNAGHGGAAQFSGRGAYGGE GGSGGAGGNAGGAGTGGTAGSGGAGGFGGNGADGGNGGNGGNGGFGGINGTFGTNGAG GTGGLGTLLGGHNGNIGLNGATGGIGSTTLTNATVPLQLVNTTEPVVFISLNGGQMVP VLLDTGSTGLVMDSQFLTQNFGPVIGTGTAGYAGGLTYNYNTYSTTVDFGNGLLTLPT SVNVVTSSSPGTLGNFLSRSGAVGVLGIGPNNGFPGTSSIVTAMPGLLNNGVLIDESA GILQFGPNTLTGGITISGAPISTVAVQIDNGPLQQAPVMFDSGGINGTIPSALASLPS GGFVPAGTTISVYTSDGQTLLYSYTTTATNTPFVTSGGVMNTGHVPFAQQPIYVSYSP TAIGTTTFN" gene complement(1093361..1094356) /gene="PE_PGRS17" /locus_tag="Rv0978c" /db_xref="GeneID:885077" CDS complement(1093361..1094356) /gene="PE_PGRS17" /locus_tag="Rv0978c" /function="UNKNOWN" /note="Rv0978c, (MTV044.06c), len: 331 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to others e.g. Z95387|MTCY1A10_19 from Mycobacterium tuberculosis (461 aa), FASTA score: (73.6% identity in 277 aa overlap); etc. TBparse score is 0.861." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177774.1" /db_xref="GI:57116806" /db_xref="UniProtKB/TrEMBL:Q79FU2" /db_xref="GeneID:885077" /translation="MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLAAAQD EVSTAIAALFGSHGQHYQAISAQVAAYQQRFVLALSQAGSTYAVAEAASATPLQNVLD AINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAG LIGNGGAGGTGGAVSLARAGTAGGAGRGPVGGIGGAGGVGGAGGAAGAVTTITHASFN DPHGVAVNPGGNVYVTNFGSGTVSVINPATNTVTGSPITIGNGPSGVAVSPVTGLVFV TNFDSNTVSVIDPTTNTVTGSPITVGTAPTGVAVNPVTGEVYVTNFAGDTVSVIS" gene complement(1094670..1094864) /locus_tag="Rv0979c" /db_xref="GeneID:885063" CDS complement(1094670..1094864) /locus_tag="Rv0979c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0979c, (MTV044.07c), len: 64 aa (unlikely ORF). Hypothetical unknown protein. Start codon changed since first submission (-44 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215494.2" /db_xref="GI:57116807" /db_xref="UniProtKB/TrEMBL:O53892" /db_xref="GeneID:885063" /translation="MGFRTQVGAATIASTMTWRIPVEDGPAQFRAGVGPGRDRQFTVV APMVVGLWDRNRRPGWQWPS" gene 1094886..1095059 /gene="rpmF" /locus_tag="Rv0979A" /db_xref="GeneID:3205057" CDS 1094886..1095059 /gene="rpmF" /locus_tag="Rv0979A" /function="INVOLVED IN TRANSLATION MECHANISM." /note="some L32 proteins have zinc finger motifs consisting of CXXC while others do not" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L32" /protein_id="YP_177635.1" /db_xref="GI:57116808" /db_xref="GOA:P58287" /db_xref="UniProtKB/Swiss-Prot:P58287" /db_xref="GeneID:3205057" /translation="MAVPKRRKSRSNTRSRRSQWKAAKTELVGVTVAGHAHKVPRRLL KAARLGLIDFDKR" gene complement(1095078..1096451) /gene="PE_PGRS18" /locus_tag="Rv0980c" /db_xref="GeneID:885327" CDS complement(1095078..1096451) /gene="PE_PGRS18" /locus_tag="Rv0980c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0980c, (MTV044.08c), len: 457 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002), highly similar to others e.g. Z95387|MTCY1A10_19 from Mycobacterium tuberculosis (461 aa), FASTA score: (66.7% identity in 405 aa overlap); Z95844|MTCY493_2 from Mycobacterium tuberculosis (741 aa), FASTA score: (53.0% identity in 394 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177775.1" /db_xref="GI:57116809" /db_xref="UniProtKB/TrEMBL:Q79FU0" /db_xref="GeneID:885327" /translation="MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLAAAHD EVSTAIAALFGSHGQHYQAISAQVAAYQERFVLALSQASSTYAVAEAASATPLQNVLD AINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAG LIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGAGGVGGAGGAGTTFGVAGGDG GTGGVGGHGGLIGVGGHGGDGGTGGTGGAVSLARAGTAGGAGGGPAGGIGGAGGVGGA GGAAGAVTTITHASFNDPHGVAVNPGGNIYVTNQGSNTVSVIDPVTNTVTGSITDGNG PSGVAVSPVTGLVFVTNFDSNTVSVIDPNTNTVTGSIPVGTGAYGVAVNPGGNIYVTN QFSNTVSVIDPATNTVTGSPIPVGLDPTGVAVNPVTGVVYVTNSLDDTVSVITGEPAR SVCSAAI" gene 1096816..1097508 /gene="mprA" /locus_tag="Rv0981" /db_xref="GeneID:885038" CDS 1096816..1097508 /gene="mprA" /locus_tag="Rv0981" /function="REGULATOR PART OF A TWO COMPONENT REGULATORY SYSTEM (SUPPOSED MPRAB SYSTEM)." /note="Rv0981, (MTV044.09), len: 230 aa. mprA, mycobacterial persistence regulator, a two-component response regulator whose expression is required for entrance into and maintenance of persistent infection (see citation below), equivalent to NP_301250.1|NC_002677 putative two-component response regulator from Mycobacterium leprae (228 aa); and highly similar to others from Mycobacterium leprae. Also highly similar to others e.g. AAG36759.1|AF119221_1|AF119221 response regulator from Corynebacterium glutamicum (232 aa); CAB88489.1|AL353816 putative two-component system response regulator from Streptomyces coelicolor (248 aa); BJY09666_1 two-component response regulator (ragA, ragB and rpoH3) from B.japonicum (226 aa), FASTA score: (43.8% identity in 224 aa overlap); BSAJ2571_44 two-component response regulator from Bacillus subtilis (228 aa), FASTA score: (46.4% identity in 224 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv1033c (257 aa); Rv0903c (236 aa), FASTA score: (50.7 identity in 225 aa overlap); etc. Contains PS00217 Sugar transport proteins signature 2." /codon_start=1 /transl_table=11 /product="two component response transcriptional regulatory protein MprA" /protein_id="NP_215496.1" /db_xref="GI:15608121" /db_xref="GOA:O53894" /db_xref="UniProtKB/TrEMBL:O53894" /db_xref="GeneID:885038" /translation="MSVRILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIAS DRPDALVLDVMMPRLDGLEVCRQLRGTGDDLPILVLTARDSVSERVAGLDAGADDYLP KPFALEELLARMRALLRRTKPEDAAESMAMRFSDLTLDPVTREVNRGQRRISLTRTEF ALLEMLIANPRRVLTRSRILEEVWGFDFPTSGNALEVYVGYLRRKTEADGEPRLIHTV RGVGYVLRETPP" misc_feature 1097083..1097160 /gene="mprA" /locus_tag="Rv0981" /note="PS00217 Sugar transport proteins signature 2" gene 1097508..1099022 /gene="mprB" /locus_tag="Rv0982" /db_xref="GeneID:885062" CDS 1097508..1099022 /gene="mprB" /locus_tag="Rv0982" /EC_number="2.7.3.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM (SUPPOSED MPRAB SYSTEM)." /note="Rv0982, (MTV044.10), len: 504 aa. Probable mprB, two component sensor kinase, transmembrane protein (EC 2.7.3.-) (see citation below), equivalent to AL035500|MLCL373_16|NP_301251.1|NC_002677 putative two-component system sensor kinase from Mycobacterium leprae (519 aa), FASTA score: (81.0% identity in 521 aa overlap). Also highly similar to others (especially in C-terminal part) e.g. AAG36760.1|AF119221_2|AF119221 sensor kinase from Corynebacterium glutamicum (455 aa); CAB89748.1|AL354616 putative two-component histidine kinase from Streptomyces coelicolor (481 aa); X58793|SLCUTRS_2 sensor kinase from S.lividans (414 aa), FASTA scores: opt: 451, E(): 4.2e-21, (36.0% identity in 303 aa overlap); P30847|BAES_ECOLI SENSOR PROTEIN (EC 2.7.3.-) from Escherichia coli (467 aa), FASTA scores: opt: 412, E(): 1.3e-18, (30.4% identity in 336 aa overlap); etc. Also similar in C-terminal region to C-terminus of Rv0902c|Z73101|MTCY31_33 from Mycobacterium tuberculosis (446 aa), FASTA scores: opt: 423, E(): 2.6e-19, (28.4 identity in 462 aa overlap)." /codon_start=1 /transl_table=11 /product="two component sensor kinase MprB" /protein_id="NP_215497.1" /db_xref="GI:15608122" /db_xref="GOA:O53895" /db_xref="UniProtKB/TrEMBL:O53895" /db_xref="GeneID:885062" /translation="MWWFRRRDRAPLRATSSLSLRWRVMLLAMSMVAMVVVLMSFAVY AVISAALYSDIDNQLQSRAQLLIASGSLAADPGKAIEGTAYSDVNAMLVNPGQSIYTA QQPGQTLPVGAAEKAVIRGELFMSRRTTADQRVLAIRLTNGSSLLISKSLKPTEAVMN KLRWVLLIVGGIGVAVAAVAGGMVTRAGLRPVGRLTEAAERVARTDDLRPIPVFGSDE LARLTEAFNLMLRALAESRERQARLVTDAGHELRTPLTSLRTNVELLMASMAPGAPRL PKQEMVDLRADVLAQIEELSTLVGDLVDLSRGDAGEVVHEPVDMADVVDRSLERVRRR RNDILFDVEVIGWQVYGDTAGLSRMALNLMDNAAKWSPPGGHVGVRLSQLDASHAELV VSDRGPGIPVQERRLVFERFYRSASARALPGSGLGLAIVKQVVLNHGGLLRIEDTDPG GQPPGTSIYVLLPGRRMPIPQLPGATAGARSTDIENSRGSANVISVESQSTRAT" gene 1099066..1100460 /gene="pepD" /locus_tag="Rv0983" /db_xref="GeneID:885382" CDS 1099066..1100460 /gene="pepD" /locus_tag="Rv0983" /EC_number="3.4.21.-" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS (SEEMS TO CLEAVE PREFERENTIALLY AFTER SERINE RESIDUES)." /experiment="experimental evidence, no additional details recorded" /note="Rv0983, (MTV044.11), len: 464 aa. Probable pepD (alternate gene name: mtb32b), secreted or membrane serine protease (EC 3.4.21.-) (see citation below), equivalent (but longer 18 aa in N-terminus) to AL035500|MLCL373_17|T45448 probable serine proteinase (EC 3.4.21.-) from Mycobacterium leprae (452 aa), FASTA score: (74.2% identity in 466 aa overlap); and highly similar to others from Mycobacterium leprae. Also highly similar (except in N-terminus) to other proteases e.g. CAC01350.1|AL390975 putative protease from Streptomyces coelicolor (542 aa); NP_440705.1|NC_000911|HtrA serine protease from Synechocystis sp. (452 aa); NP_346646.1|NC_003028 serine protease from Streptococcus pneumoniae (393 aa); etc. Also similar in part to members of the htrA-antigen family e.g. U87242|MTU87242_3|HtrA serine protease from M. tuberculosis (542 aa), FASTA scores: opt: 846, E(): 2e-28, (40.6% identity in 392 aa overlap); and similar to other hypothetical serine proteases e.g. Rv0983, Rv0125, etc. BELONGS TO THE SERINE PROTEASE FAMILY.; mtb32b" /codon_start=1 /transl_table=11 /product="serine protease PepD" /protein_id="NP_215498.1" /db_xref="GI:15608123" /db_xref="GOA:O53896" /db_xref="UniProtKB/TrEMBL:O53896" /db_xref="GeneID:885382" /translation="MAKLARVVGLVQEEQPSDMTNHPRYSPPPQQPGTPGYAQGQQQT YSQQFDWRYPPSPPPQPTQYRQPYEALGGTRPGLIPGVIPTMTPPPGMVRQRPRAGML AIGAVTIAVVSAGIGGAAASLVGFNRAPAGPSGGPVAASAAPSIPAANMPPGSVEQVA AKVVPSVVMLETDLGRQSEEGSGIILSAEGLILTNNHVIAAAAKPPLGSPPPKTTVTF SDGRTAPFTVVGADPTSDIAVVRVQGVSGLTPISLGSSSDLRVGQPVLAIGSPLGLEG TVTTGIVSALNRPVSTTGEAGNQNTVLDAIQTDAAINPGNSGGALVNMNAQLVGVNSA IATLGADSADAQSGSIGLGFAIPVDQAKRIADELISTGKASHASLGVQVTNDKDTLGA KIVEVVAGGAAANAGVPKGVVVTKVDDRPINSADALVAAVRSKAPGATVALTFQDPSG GSRTVQVTLGKAEQ" gene 1100460..1101005 /gene="moaB2" /locus_tag="Rv0984" /db_xref="GeneID:885378" CDS 1100460..1101005 /gene="moaB2" /locus_tag="Rv0984" /EC_number="4.2.1.96" /function="THOUGHT TO BE INVOLVED IN MOLYBDOPTERIN BIOSYNTHESIS. CATALYZES THE DEHYDRATATION OF 4A-HYDROXYTETRAHYDROPTERINS [CATALYTIC ACTIVITY: (6R)-6-(L-ERYTHRO-1,2-DIHYDROXYPROPYL)-5,6,7,8-TETRAHYDRO -4 A-HYDROXYPTERIN = (6R)-6-(L-ERYTHRO-1,2- DIHYDROXYPROPYL)-7,8-DIHYDRO-6H-PTERIN + H(2)O.]." /experiment="experimental evidence, no additional details recorded" /note="Rv0984, (MTV044.12), len: 181 aa. Possible moaB2, pterin-4-alpha-carbinolamine dehydratase (EC 4.2.1.96), highly similar to NP_301253.1|NC_002677 putative molybdenum cofactor biosynthesis protein from Mycobacterium leprae (181 aa), FASTA score: (92.3% identity in 181 aa overlap). Also similar to others e.g. CAB59675.1|AL132674 molybdenum cofactor biosynthesis protein from Streptomyces coelicolor (179 aa); Q56208|MOCB_SYNP7 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN CB from Synechococcus sp. (319 aa), FASTA score: (37.3% identity in 142 aa overlap); C-terminus of NP_197599.1|NC_003076 MOLYBDOPTERIN BIOSYNTHESIS CNX1 PROTEIN from Arabidopsis thaliana (670 aa); etc. Also similar to Rv0865|MOG from Mycobacterium tuberculosis (160 aa); and other mog proteins e.g. CAC39235.1|AJ312124 Mog protein from Eubacterium acidaminophilum (162 aa). COULD BELONG TO THE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE FAMILY. Alternative start codon has been suggested in position 1100508." /codon_start=1 /transl_table=11 /product="pterin-4-alpha-carbinolamine dehydratase MoaB2" /protein_id="NP_215499.1" /db_xref="GI:15608124" /db_xref="GOA:O53897" /db_xref="UniProtKB/TrEMBL:O53897" /db_xref="GeneID:885378" /translation="MKVAAQCSKLGYTVAPMEQRAELVVGRALVVVVDDRTAHGDEDH SGPLVTELLTEAGFVVDGVVAVSADEVEIRNALNTAVIGGVDLVVSVGGTGVTPRDVT PEATRDILDREILGIAEAIRASGLSAGIVDAGLSRGLAGVSGSTLVVNLAGSRYAVRD GMATLNPLAAQIIGQLSSLEI" gene complement(1101025..1101480) /gene="mscL" /locus_tag="Rv0985c" /db_xref="GeneID:885368" CDS complement(1101025..1101480) /gene="mscL" /locus_tag="Rv0985c" /function="ION CHANNEL THAT OPENS IN RESPONSE TO STRETCH FORCES IN THE MEMBRANE LIPID BILAYER. MAY PARTICIPATE IN THE REGULATION OF OSMOTIC PRESSURE CHANGES WITHIN THE CELL." /note="forms homopentamer; channel that opens in response to pressure or hypoosmotic shock" /codon_start=1 /transl_table=11 /product="large-conductance mechanosensitive channel" /protein_id="NP_215500.1" /db_xref="GI:15608125" /db_xref="GOA:O53898" /db_xref="UniProtKB/Swiss-Prot:O53898" /db_xref="GeneID:885368" /translation="MLKGFKEFLARGNIVDLAVAVVIGTAFTALVTKFTDSIITPLIN RIGVNAQSDVGILRIGIGGGQTIDLNVLLSAAINFFLIAFAVYFLVVLPYNTLRKKGE VEQPGDTQVVLLTEIRDLLAQTNGDSPGRHGGRGTPSPTDGPRASTESQ" gene 1101803..1102549 /locus_tag="Rv0986" /db_xref="GeneID:885364" CDS 1101803..1102549 /locus_tag="Rv0986" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF ADHESION COMPONENT ACROSS THE MEMBRANE: INVOLVED IN ATACHMENT AND VIRULENCE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv0986, (MTV044.14), len: 248 aa. Probable ATP-binding protein ABC transporter supposed involved in transport of adhesion component (see citation below), highly similar to many ATP-binding proteins e.g. AE0010|AE001033_8 ABC transporter ATP-binding protein from Archaeoglobus fulgidus (228 aa), FASTA scores: opt: 669, E(): 0, (45.7% identity in 219 aa overlap); CAB81857.1|AL161691 putative ABC-transporter ATP-binding protein from Streptomyces coelicolor (246 aa); X84019|ZMDNAGRP_4 glutamate uptake regulatory protein (grp) from Z.mobilis (232 aa), FASTA score: (44.4% identity in 225 aa overlap); Z99111|BSUB0008_108 from Bacillus subtilis (230 aa), FASTA score: (38.7% identity in 222 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="adhesion component transport ATP-binding protein ABC transporter" /protein_id="NP_215501.1" /db_xref="GI:15608126" /db_xref="GOA:O53899" /db_xref="UniProtKB/Swiss-Prot:O53899" /db_xref="GeneID:885364" /translation="MNRQPIVQLSNLSWTFREGETRRQVLDHITFDFEPGEFVALLGQ SGSGKSTLLNLISGIEKPTTGDVTINGFAITQKTERDRTLFRRDQIGIVFQFFNLIPT LTVLENITLPQELAGVSQRKAAVVARDLLEKVGMADRERTFPDKLSGGEQQRVAISRA LAHNPMLVLADEPTGNLDSDTGDKVLDVLLDLTRQAGKTLIMATHSPSMTQHADRVVN LQGGRLIPAVNRENQTDQPASTILLPTSYE" misc_feature 1101929..1101952 /locus_tag="Rv0986" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1102241..1102285 /locus_tag="Rv0986" /note="PS00211 ABC transporters family signature" gene 1102542..1105109 /locus_tag="Rv0987" /db_xref="GeneID:885363" CDS 1102542..1105109 /locus_tag="Rv0987" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF ADHESION COMPONENT ACROSS THE MEMBRANE: INVOLVED IN ATACHMENT AND VIRULENCE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv0987, (MTV044.15, MTCI237.01), len: 855 aa. Probable transmembrane protein ABC transporter supposed involved in transport of adhesion component (see citation below), whose N-terminus shows similarity with hypothetical proteins, generally transmembrane proteins, e.g. CAB96016.1|AL360055 putative ABC transport system integral membrane protein from Streptomyces coelicolor (855 aa); P44252|YCFU_HAEIN|HI1555 HYPOTHETICAL PROTEIN from Haemophilus influenzae (393 aa), FASTA scores: opt: 265, E(): 1.7e-09, (23.6% identity in 402 aa overlap); etc. N- and C-termini respectively show similarity to O32735 ATTF PROTEIN (420 aa), FASTA scores: E(): 1e-09, (26.7% identity in 430 aa overlap), and G2340078 ATTG PROTEIN (359 aa), FASTA scores: E(): 2.7e-08, (27.8% identity in 356 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="adhesion component transport transmembrane protein ABC transporter" /protein_id="NP_215502.1" /db_xref="GI:15608127" /db_xref="GOA:O53900" /db_xref="UniProtKB/TrEMBL:O53900" /db_xref="GeneID:885363" /translation="MNDQAPVAYAPLWRTAWRRLRQRPFQYILLVLGIALGVAMIVAI DVSSNSAQRAFDLSAAAITGKSTHRLVSGPAGVDQQLYVDLRRHGYDFSAPVIEGYVL ARGLGNRAMQFMGTDPFAESAFRSPLWSNQNIAELGGFLTRPNGVVLSRQVAQKYGLA VGDRIALQVKGAPTTVTLVGLLTPADEVSNQKLSDLIIADISTAQELFHMPGRLSHID LIIKDEATATRIQQRLPAGVRMETSDTQRDTVKQMTDAFTVNLTALSLIALLVGIFLI YNTVTFNVVQRRPFFAILRCLGVTREQLFWLIMTESLVAGLIGTGLGLLIGIWLGEGL IGLVTQTINDFYFVINVRNVSVSAESLLKGLIIGIFAAMLATLPPAIEAMRTVPASTL RRSSLESKITKLMPWLWVAWFGLGSFGVLMLWLPGNNLVVAFVGLFSVLIALALIAPP LTRFVMLRLAPGLGRLLGPIGRMAPRNIVRSLSRTSIAIAALMMAVSLMVGVSISVGS FRQTLANWLEVTLKSDVYVSPPTLTSGRPSGNLPVDAVRNISKWPGVRDAVMARYSSV FAPDWGREVELMAVSGDISDGKRPYRWIDGNKDTLWPRFLAGKGVMLSEPMVSRQHLQ MPPRPITLMTDSGPQTFPVLAVFSDYTSDQGVILMDRASYRAHWQDDDVTTMFLFLAS GANSGALIDQLQAAFAGREDIVIQSTHSVREASMFIFDRSFTITIALQLVATVVAFIG VLSALMSLELDRAHELGVFRAIGMTTRQLWKLMFIETGLMGGMAGLMALPTGCILAWI LVRIINVRSFGWTLQMHFESAHFLRALLVAVVAALAAGMYPAWRLGRMTIRTAIREE" misc_feature 1102716..1102739 /locus_tag="Rv0987" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1105116..1106276 /locus_tag="Rv0988" /db_xref="GeneID:885353" CDS 1105116..1106276 /locus_tag="Rv0988" /function="UNKNOWN" /note="Rv0988, (MTCI237.02), len: 386 aa. Possible conserved exported protein, with potential N-terminal signal sequence, similar (except in N-terminus) to O32737|L63540 ATTH PROTEIN from Agrobacterium tumefaciens (355 aa), FASTA scores: opt: 651, E(): 5.7e-33, (33.4% identity in 344 aa overlap); and NP_231265.1|NC_002505 conserved hypothetical protein from Vibrio cholerae (372 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215503.1" /db_xref="GI:15608128" /db_xref="UniProtKB/TrEMBL:O86370" /db_xref="GeneID:885353" /translation="MRKAGLTGVVLVLTLTLVAFWWWQRPRTNAVAADSLVGVLVDEN NAGYSLATVPGAVRFPRDLGPHYDYQTEWWYYTGNLETADGRLFGYQLTFFRRALAPP GEGVAIADASSWRTTQVYMAHFAISDISNRGFYPAEKFSRQALGLAGASSEPYAVWLD DWYARESNNNSVQLFARTQNTVLDLTLTQTLPPILQGNAGLSVKGAQPGNASNYYSLV RQESRGTVSVNGDTFMVSGLSWKDHEYMTSALAPEDVGWDWFGLQFYNGTALMLFQIR QADGSVTRFSSGTFVAGDGGVIPLESSDFRIKTTDRWTSDQSGATYPIAWEIEIERIG LTLRGAALMANQELRLSRTYWEGAVALEGRYQGMPISGRGYVEMTGYVQRLS" gene complement(1106405..1107382) /gene="grcC2" /locus_tag="Rv0989c" /db_xref="GeneID:885355" CDS complement(1106405..1107382) /gene="grcC2" /locus_tag="Rv0989c" /EC_number="2.5.1.-" /function="POSSIBLE SUPPLIER OF POLYPRENYL DIPHOSPHATE." /note="Rv0989c, (MTCI237.03c), len: 325 aa. Probable grcC2, polyprenyl diphosphate synthetase (EC 2.5.1.-), highly similar to NP_302483.1|NC_002677 polyprenyl diphosphate synthase component from Mycobacterium leprae (330 aa). Also similar to others (generally hepta (EC 2.5.1.30) or hexaprenyl e.g. NP_471378.1|NC_003212 protein similar to heptaprenyl diphosphate synthase component II (menaquinone biosynthesis) from Listeria innocua (321 aa); NP_371994.1|NC_002758 heptaprenyl diphosphate syntase component II from Staphylococcus aureus subsp. aureus Mu50 (319 aa); P55785|HEP2_BACST heptaprenyl diphosphate synthase component from Bacillus subtilis (323 aa), FASTA scores: opt: 496, E(): 1.4e-24, (31.4% identity in 306 aa overlap); etc. Also highly similar to Mycobacterium tuberculosis proteins e.g. Rv0562|grcC1|NP_215076.1|MTCY25D10.41 PROBABLE POLYPRENYL-DIPHOSPHATE SYNTHASE (335 aa); Rv3383, Rv3398c, Rv2173, etc. SEEMS TO BELONG TO THE FPP/GGPP SYNTHETASES FAMILY." /codon_start=1 /transl_table=11 /product="polyprenyl-diphosphate synthase" /protein_id="NP_215504.1" /db_xref="GI:15608129" /db_xref="GOA:O05572" /db_xref="UniProtKB/TrEMBL:O05572" /db_xref="GeneID:885355" /translation="MIPAVSLGDPQFTANVHDGIARITELINSELSQADEVMRDTVAH LVDAGGTPFRPLFTVLAAQLGSDPDGWEVTVAGAAIELMHLGTLCHDRVVDESDMSRK TPSDNTRWTNNFAILAGDYRFATASQLASRLDPEAFAVVAEAFAELITGQMRATRGPA SHIDTIEHYLRVVHEKTGSLIAASGQLGAALSGAAEEQIRRVARLGRMIGAAFEISRD IIAISGDSATLSGADLGQAVHTLPMLYALREQTPDTSRLRELLAGPIHDDHVAEALTL LRCSPGIGKAKNVVAAYAAQAREELPYLPDRQPRRALATLIDHAISACD" gene complement(1107443..1108099) /locus_tag="Rv0990c" /db_xref="GeneID:885343" CDS complement(1107443..1108099) /locus_tag="Rv0990c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0990c, (MTCI237.04c), len: 218 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215505.1" /db_xref="GI:15608130" /db_xref="UniProtKB/TrEMBL:O05573" /db_xref="GeneID:885343" /translation="MAESSLNPSLVSRISAFLRPDWTRTVRARRFAAAGLVMLAGVAA LRSNPEDDRSEVVVAAHDLRPGTALTPGDVRLEKRSATTLPDGSQADLDAVVGSTLAS PTRRGEVLTDVRLLGSRLAESTAGPDARIVPLHLADSALVDLVRVGDVVDVLAAPVTD SPAALRLLATDAIVVLVSAQQKAQAADSDRVVLVALPARLANTVAGAALGQTVTLTLH" gene complement(1108172..1108504) /locus_tag="Rv0991c" /db_xref="GeneID:885350" CDS complement(1108172..1108504) /locus_tag="Rv0991c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv0991c, (MTCI237.05c), len: 110 aa. Conserved hypothetical ser-rich protein (especially in C-terminus), highly similar to N-terminus of NP_301255.1|NC_002677 conserved hypothetical protein (Ser-rich C-terminus) from Mycobacterium leprae (99 aa). Also highly similar to SCE22.04|AB90971.1|AL355832 hypothetical protein from Streptomyces coelicolor (110 aa); and similar to others." /codon_start=1 /transl_table=11 /product="putative serine rich protein" /protein_id="NP_215506.1" /db_xref="GI:15608131" /db_xref="UniProtKB/TrEMBL:O05574" /db_xref="GeneID:885350" /translation="MPTYSYECTQCANRFDVVQAFTDDALTTCERCSGRLRKLFNAVG VVFKGTGFYRTDSRESGKKSKSQTNGSSTSESTKSSGSSGSSGSSESKASGSTEKSTS STTAAAAV" gene complement(1108578..1109171) /locus_tag="Rv0992c" /db_xref="GeneID:885337" CDS complement(1108578..1109171) /locus_tag="Rv0992c" /function="UNKNOWN" /note="Rv0992c, (MTCI237.06c), len: 197 aa. Conserved hypothetical protein, equivalent to NP_301256.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (197 aa). Also similar, except in N-terminus, to other hypothetical proteins and ligases e.g. SCE87.34|CAB59679.1|AL132674 hypothetical protein from Streptomyces coelicolor (204 aa); NP_461977.1|NC_003197 putative ligase from Salmonella typhimurium (182 aa); P09160|YGFA_ECOLI HYPOTHETICAL 21.1 kDa PROTEIN from Escherichia coli (182 aa), FASTA scores: opt: 191, E(): 1.1e-09, (29.5% identity in 146 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215507.1" /db_xref="GI:15608132" /db_xref="GOA:O05575" /db_xref="UniProtKB/TrEMBL:O05575" /db_xref="GeneID:885337" /translation="MAMASKSALRDQLLAARRRVADDVRAAEARMLRGHLERMVTSDS TVCAYVPVGGEPGSIEMLDVLLRRAGRVLLPVARTAGGDLPLPLRWGEYRAGGLARAR WGLLEPPEPWLPEAALAQASLVLVPALAVDRQGVRLGRGRGFYDRSLRCRDPHARLVA VVRTVELVDVLPSEPHDVPMTHALTPERGLIALPCGE" gene 1109272..1110192 /gene="galU" /locus_tag="Rv0993" /db_xref="GeneID:885341" CDS 1109272..1110192 /gene="galU" /locus_tag="Rv0993" /EC_number="2.7.7.9" /function="MAY PLAY A ROLE IN STATIONARY PHASE SURVIVAL [CATALYTIC ACTIVITY: UTP + alpha-D-glucose 1-phosphate = diphosphate + UDP-glucose]." /note="Rv0993, (MTCI237.07), len: 306 aa. Probable galU, UTP--glucose-1-phosphate uridylyltransferase (EC 2.7.7.9), equivalent to AL035500|MLCL373_22 putative UTP-glucose-1-phosphate uridylyltransferase from Mycobacterium leprae (306 aa), FASTA score: (89.7% identity in 302 aa overlap). Also highly similar to others e.g. AB59678.1|AL132674 UTP-glucose-1-phosphate uridylyltransferase from Streptomyces coelicolor (303 aa); NP_244519.1|NC_002570 UTP-glucose-1-phosphate uridylyltransferase from Bacillus halodurans (297 aa); P25520|GALU_ECOLI|B1236|Z2012|ECS17 UTP--glucose-1-phosphate uridylyltransferase from Escherichia coli strains K12 and O157:H7 (301 aa), FASTA scores: opt: 624, E(): 2.4e-33, (38.8% identity in 299 aa overlap); etc. BELONGS TO THE PROKARYOTIC UDPGP FAMILY." /codon_start=1 /transl_table=11 /product="UTP--glucose-1-phosphate uridylyltransferase GalU" /protein_id="NP_215508.1" /db_xref="GI:15608133" /db_xref="GOA:O05576" /db_xref="UniProtKB/TrEMBL:O05576" /db_xref="GeneID:885341" /translation="MSRPEVLTPFTAIVPAAGLGTRFLPATKTVPKELLPVVDTPGIE LVAAEAAAAGAERLVIVTSEGKDGVVAHFVEDLVLEGTLEARGKIAMLAKVRRAPALI KVESVVQAEPLGLGHAIGCVEPTLSPDEDAVAVLLPDDLVLPTGVLETMSKVRASRGG TVLCAIEVAREEISAYGVFDVEPVPDGDYTDDPNVLKVRGMVEKPKAETAPSRYAAAG RYVLDRAIFDALRRIDQGAGGEVQLTDAIALLIAEGHPVHVVVHQGSRHDLGNPGGYL KAAVDFALDRDDYGPDLRRWLVARLGLTEQ" gene 1110269..1111549 /gene="moeA1" /locus_tag="Rv0994" /db_xref="GeneID:885404" CDS 1110269..1111549 /gene="moeA1" /locus_tag="Rv0994" /function="INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS: INVOLVED IN THE BIOSYNTHESIS OF A DEMOLYBDO-COFACTOR (MOLYBDOPTERIN), NECESSARY FOR MOLYBDO-ENZYMES." /note="Rv0994, (MTCI237.08), len: 426 aa. Probable moeA1, molybdenum cofactor biosynthesis protein, equivalent to AL035500|MLCL373_23 putative molybdopterin biosynthesis protein from Mycobacterium leprae (424 aa), FASTA score: (88.3% identity in 426 aa overlap). Also highly similar to many e.g. CAB59677.1|AL132674 molybdopterin biosynthesis protein from Streptomyces coelicolor (424 aa); NP_385769.1|NC_003047 PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Sinorhizobium meliloti (406 aa); P12281|MOEA_ECOLI molybdopterin biosynthesis moea protein from Escherichia coli (411 aa), FASTA scores: opt: 519, E(): 1.3e-24, (32.3% identity in 402 aa overlap); etc. Also similar to MOEA2|Rv0438c|MTV037.02c PROBABLE MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Mycobacterium tuberculosis (405 aa). Note that previously known as moeA.; moeA" /codon_start=1 /transl_table=11 /product="molybdopterin biosynthesis protein MoeA1" /protein_id="YP_177776.1" /db_xref="GI:57116810" /db_xref="GOA:O05577" /db_xref="UniProtKB/Swiss-Prot:O05577" /db_xref="GeneID:885404" /translation="MRSVEEQQARISAAAVAPRPIRVAIAEAQGLMCAEEVVTERPMP GFDQAAIDGYAVRSVDVAGVGDTGGVQVFADHGDLDGRDVLTLPVMGTIEAGARTLSR LQPRQAVRVQTGAPLPTLADAVLPLRWTDGGMSRVRVLRGAPSGAYVRRAGDDVQPGD VAVRAGTIIGAAQVGLLAAVGRERVLVHPRPRLSVMAVGGELVDISRTPGNGQVYDVN SYALAAAGRDACAEVNRVGIVSNDPTELGEIVEGQLNRAEVVVIAGGVGGAAAEAVRS VLSELGEMEVVRVAMHPGSVQGFGQLGRDGVPTFLLPANPVSALVVFEVMVRPLIRLS LGKRHPMRRIVSARTLSPITSVAGRKGYLRGQLMRDQDSGEYLVQALGGAPGASSHLL ATLAEANCLVVVPTGAEQIRTGEIVDVAFLAQHG" gene 1111612..1112223 /gene="rimJ" /locus_tag="Rv0995" /db_xref="GeneID:885399" CDS 1111612..1112223 /gene="rimJ" /locus_tag="Rv0995" /EC_number="2.3.1.128" /function="THIS ENZYME ACETYLATES THE N-TERMINAL ALANINE OF RIBOSOMAL PROTEIN S5 [CATALYTIC ACTIVITY: ACETYL-CoA + RIBOSOMAL-PROTEIN L-ALANINE = CoA + RIBOSOMAL-PROTEIN N-ACETYL-L-ALANINE]." /note="Rv0995, (MTCI237.09), len: 203 aa. Possible rimJ, ribosomal-protein-alanine acetyltransferase (EC 2.3.1.128), equivalent to AL035500|MLCL373_24 probable acyltransferase from Mycobacterium leprae (218 aa), FASTA scores: (86.0% identity in 200 aa overlap). Also similar to others and many acyltransferases e.g. BAB69252.1|AB070946 possible acyltransferase from Streptomyces avermitilis (156 aa); NP_385025.1|NC_003047 PROBABLE RIBOSOMAL-PROTEIN-ALANINE ACETYLTRANSFERASE from Sinorhizobium meliloti (203 aa); P09454|RIMJ_ECOLI|B1066|Z1703|ECS1444 ribosomal-protein-alanine acetyltransferase from Escherichia coli strains K12 and O157:H7 (194 aa), FASTA scores: opt: 247, E(): 1.5e-10, (26.9% identity in 186 aa overlap). SEEMS TO BELONG TO THE ACETYLTRANSFERASE FAMILY, RIMJ SUBFAMILY." /codon_start=1 /transl_table=11 /product="ribosomal-protein-alanine acetyltransferase" /protein_id="NP_215510.1" /db_xref="GI:15608135" /db_xref="GOA:O05578" /db_xref="UniProtKB/TrEMBL:O05578" /db_xref="GeneID:885399" /translation="MAVGPLRVSAGVIRLRPVRMRDGVHWSRIRLADRAHLEPWEPSA DGEWTVRHTVAAWPAVCSGLRSEARNGRMLPYVIELDGQFCGQLTIGNVTHGALRSAW IGYWVPSAATGGGVATGALALGLDHCFGPVMLHRVEATVRPENAASRAVLAKVGFREE GLLRRYLEVDRAWRDHLLMAITVEEVYGSVASTLVRAGHASWP" gene 1112384..1113460 /locus_tag="Rv0996" /db_xref="GeneID:885393" CDS 1112384..1113460 /locus_tag="Rv0996" /function="UNKNOWN" /note="Rv0996, (MTCI237.10), len: 358 aa. Probable conserved transmembrane protein, equivalent to AL035500|MLCL373_25 putative membrane protein from Mycobacterium leprae (342 aa), FASTA scores: (66.4% identity in 360 aa overlap). Contains possible signal sequence and other hydrophobic domains." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215511.1" /db_xref="GI:15608136" /db_xref="UniProtKB/TrEMBL:O05579" /db_xref="GeneID:885393" /translation="MPSIPQSLLWISLVVLWLFVLVPMLISKRDAVRRTSDVALATRV LNGGAGARLLKRGGPAAGHRWGYLPPEGQGDDPDWKPEEDWRDDPVEDGFADVEHDID EDQEADDARRRGAVVMKVAAPQTAGADEPDYLDVDVVEEDSEALPVGAGAAVGESADE ADAEAADGVAGHADPEADPVEYEYEYEYVEDTCGLELEEDDQEAPPTVASGTSRRRRF DTKTAAAVSARKYTFRKRALIVMAVILVGSAAAAFELTPVAWWICGSATGVTVLYLAY LRRQTRIEEKVRRRRMQRIARARLGVENTRDREYDVVPSRLRRPGAVVLEIDDEDPIF THLESAAPIRNYGWPRDLPRAVGQ" gene 1113511..1113583 /locus_tag="Rvnt15" /note="tRNA-Ala(CGC)" /db_xref="GeneID:2700430" tRNA 1113511..1113583 /locus_tag="Rvnt15" /product="tRNA-Ala" /note="codon recognized: GCG" /anticodon=(pos:1113544..1113546,aa:Ala) /db_xref="GeneID:2700430" gene 1114293..1114724 /locus_tag="Rv0997" /db_xref="GeneID:885386" CDS 1114293..1114724 /locus_tag="Rv0997" /function="UNKNOWN" /note="Rv0997, (MTCI237.11), len: 143 aa. Hypothetical unknown protein, equivalent to AAK45276.1 from Mycobacterium tuberculosis strain CDC1551 (87 aa) but longer 56 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215512.1" /db_xref="GI:15608137" /db_xref="UniProtKB/TrEMBL:O05580" /db_xref="GeneID:885386" /translation="MAGIAGVDRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECF VAEWHHAGVAADMTRPWPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTD IEHSVGAAEVQRHRGAVPLGSGGDAAGKVEGGRTPQPFVQP" gene 1114748..1115749 /locus_tag="Rv0998" /db_xref="GeneID:885385" CDS 1114748..1115749 /locus_tag="Rv0998" /function="UNKNOWN" /note="Rv0998, (MTCI237.12), len: 333 aa. Conserved hypothetical protein, possibly cyclic nucleotide-dependent protein kinase (EC 2.7.-.-), highly similar to NP_301261.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (353 aa); and AL035500|MLCL373.38|T45457 hypothetical protein from Mycobacterium leprae (143 aa), FASTA score: (61.5% identity in 143 aa overlap). Also similar to many hypothetical proteins and cyclic-NMP-dependent protein kinases (generally at C-terminus) e.g. N-terminus of SC9B10.09|T35878 hypothetical protein from Streptomyces coelicolor (1039 aa); P05987|KAPR_DICDI CAMP-DEPENDENT PROTEIN KINASE REGULATORY CHAIN (EC 2.7.1.37) from Dictyostelium discoideum (327 aa), FASTA scores: opt: 177, E(): 0.00036, (32.0% identity in 122 aa overlap); NP_104403.1|NC_002678 hypothetical protein (contains similarity to cAMP-dependent protein kinase regulatory subunit) from Mesorhizobium loti (151 aa); etc. Contains PS00889 Cyclic nucleotide-binding domain signature 2." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215513.1" /db_xref="GI:15608138" /db_xref="GOA:O05581" /db_xref="UniProtKB/TrEMBL:O05581" /db_xref="GeneID:885385" /translation="MDGIAELTGARVEDLAGMDVFQGCPAEGLVSLAASVQPLRAAAG QVLLRQGEPAVSFLLISSGSAEVSHVGDDGVAIIARALPGMIVGEIALLRDSPRSATV TTIEPLTGWTGGRGAFATMVHIPGVGERLLRTARQRLAAFVSPIPVRLADGTQLMLRP VLPGDRERTVHGHIQFSGETLYRRFMSARVPSPALMHYLSEVDYVDHFVWVVTDGSDP VADARFVRDETDPTVAEIAFTVADAYQGRGIGSFLIGALSVAARVDGVERFAARMLSD NVPMRTIMDRYGAVWQREDVGVITTMIDVPGPGELSLGREMVDQINRVARQVIEAVG" misc_feature 1115006..1115059 /locus_tag="Rv0998" /note="PS00889 Cyclic nucleotide-binding domain signature 2" gene 1115767..1116525 /locus_tag="Rv0999" /db_xref="GeneID:886043" CDS 1115767..1116525 /locus_tag="Rv0999" /function="UNKNOWN" /note="Rv0999, (MTCI237.13), len: 252 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215514.1" /db_xref="GI:15608139" /db_xref="UniProtKB/TrEMBL:O05582" /db_xref="GeneID:886043" /translation="MRPPLAPQFAADLLVKTVSTLRSSGAALGRLTTMRKAVLAVGSV CWLVGCSSGASSTTASTGDIAKVAEVKSGFGPEYTVTDVTPRAIDPGFFSARKLPDGL SFDPANCAQVAAGPQLPTGLQGNMAAVSAEGNGNRFVVIAVETSQPLPAPSPGKDCSK VTFSGTQLRGGIEVVDVPHIDGTQTLGVHRVLQAVVGGSARTGELYDYSARFGDYQVI VIANPLVIPGRPVARVDTQRARDLLVQAVAAVRG" gene complement(1116531..1117148) /locus_tag="Rv1000c" /db_xref="GeneID:886266" CDS complement(1116531..1117148) /locus_tag="Rv1000c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1000c, len: 205 aa. Conserved hypothetical protein, equivalent to ML0190|NP_301263.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (205 aa). Also highly similar to SC5F8.12c|CAB93740.1|AL357613 hypothetical protein from Streptomyces coelicolor (210 aa), FASTA scores: E(): 2.1e-45, (56.8% identity); 9106290|AAF84108.1|AE003963_5|NP_298588.1|NC_002488 protein described as DNA repair system specific for alkylated DNA from Xylella fastidiosa (200 aa), FASTA scores: E(): 3.4e-14, (38.55% identity); and similar in C-terminus to other hypothetical proteins. Note that replaces original Rv1000 predicted on other strand." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177777.1" /db_xref="GI:57116811" /db_xref="UniProtKB/TrEMBL:Q8VK98" /db_xref="GeneID:886266" /translation="MCDKLGGVAIAVQGALFEHNERRQLGDGAFIDIRSGWLTGGEEL LDALLSTVPWRAERRQMYDRVVDVPRLVSFHDLTIEDPPHPQLARMRRRLNDIYGGEL GEPFTTAGLCYYRDGSDSVAWHGDTIGRGSTEDTMVAIVSLGATRVFALRPRGRGPSL RLPLAHGDLLVMGGSCQRTFEHAVPKTSAPTGPRVSIQFRPRDVR" gene 1117185..1118393 /gene="arcA" /locus_tag="Rv1001" /db_xref="GeneID:888313" CDS 1117185..1118393 /gene="arcA" /locus_tag="Rv1001" /EC_number="3.5.3.6" /function="ARGININE DEGRADATION [CATALYTIC ACTIVITY:L-ARGININE + H(2)O = L-CITRULLINE + NH(3)]" /note="catalyzes the degradation of arginine to citruline and ammonia" /codon_start=1 /transl_table=11 /product="arginine deiminase" /protein_id="NP_215517.1" /db_xref="GI:15608141" /db_xref="GOA:P63551" /db_xref="UniProtKB/Swiss-Prot:P63551" /db_xref="GeneID:888313" /translation="MGVELGSNSEVGALRVVILHRPGAELRRLTPRNTDQLLFDGLPW VSRAQDEHDEFAELLASRGAEVLLLSDLLTEALHHSGAARMQGIAAAVDAPRLGLPLA QELSAYLRSLDPGRLAHVLTAGMTFNELPSDTRTDVSLVLRMHHGGDFVIEPLPNLVF TRDSSIWIGPRVVIPSLALRARVREASLTDLIYAHHPRFTGVRRAYESRTAPVEGGDV LLLAPGVVAVGVGERTTPAGAEALARSLFDDDLAHTVLAVPIAQQRAQMHLDTVCTMV DTDTMVMYANVVDTLEAFTIQRTPDGVTIGDAAPFAEAAAKAMGIDKLRVIHTGMDPV VAEREQWDDGNNTLALAPGVVVAYERNVQTNARLQDAGIEVLTIAGSELGTGRGGPRC MSCPAARDPL" gene complement(1118428..1119939) /locus_tag="Rv1002c" /db_xref="GeneID:887882" CDS complement(1118428..1119939) /locus_tag="Rv1002c" /function="UNKNOWN" /note="Rv1002c, (MTCI237.17c), len: 503 aa. Conserved membrane protein. Similar to AL132674|SCE87.05 hypothetical protein from Streptomyces coelicolor (591 aa), FASTA scores: opt: 666, E(): 0, (39.0% identity in 546 aa overlap); weakly similar and to TSCC_PSEAM|P55019 thiazide-sensitive sodium-chloride cotransporter from Pseudopleuronectes americanus (1023 aa), FASTA scores: opt: 44, E(): 4.2e-06, (22.4% identity in 326 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215518.1" /db_xref="GI:15608142" /db_xref="GOA:O05586" /db_xref="UniProtKB/Swiss-Prot:O05586" /db_xref="GeneID:887882" /translation="MVPVVSPGPLVPVADFGPLDRLRGWIVTGLITLLATVTRFLNLG SLTDAGTPIFDEKHYAPQAWQVLNNHGVEDNPGYGLVVHPPVGKQLIAIGEAIFGYNG FGWRFTGALLGVVLVALVVRIVRRISRSTLVGAIAGVLLICDGVSFVTARTALLDGFL TFFVVAAFGALIVDRDQVRERMHIALLAGRSAATVWGPRVGVRWWRFGAGVLLGLACA TKWSGVYFVLFFGAMALAFDVAARRQYQVQRPWLGTVRRDVLPSGYALGLIPFAVYLA TYAPWFASETAIDRHAVGQAVGRNSVVPLPDAVRSLWHYTAKAFHFHAGLTNSAGNYH PWESKPWTWPMSLRPVLYAIDQQDVAGCGAQSCVKAEMLVGTPAMWWLAVPVLAYAGW RMFVRRDWRYAVVLVGYCAGWLPWFADIDRQMYFFYAATMAPFLVMGISLVLGDILYH PGQGSERRTLGLIVVCCYVALVVTNFAWLYPVLTGLPISQQTWNLEIWLPSWR" misc_feature complement(1119289..1119321) /locus_tag="Rv1002c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1120022..1120879 /locus_tag="Rv1003" /db_xref="GeneID:887935" CDS 1120022..1120879 /locus_tag="Rv1003" /function="UNKNOWN" /note="Rv1003, (MTCI237.19), len: 285 aa. Conserved hypothetical protein, similar to others e.g. AL132674|SCE87.04 Streptomyces coelicolor (286 aa), FASTA scores: opt: 877, E(): 0, (53.2% identity in 280 aa overlap); and YRAL_ECOLI|P45528 hypothetical 31.3 kd protein (286 aa), FASTA scores: opt: 561, E(): 4.4e-27, (36.9% identity in 279 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215519.1" /db_xref="GI:15608143" /db_xref="GOA:O05588" /db_xref="UniProtKB/Swiss-Prot:O05588" /db_xref="GeneID:887935" /translation="MSSGRLLLGATPLGQPSDASPRLAAALATADVVAAEDTRRVRKL AKALDIRIGGRVVSLFDRVEALRVTALLDAINNGATVLVVSDAGTPVISDPGYRLVAA CIDAGVSVTCLPGPSAVTTALVMSGLPAEKFCFEGFAPRKGAARRAWLAELAEERRTC VFFESPRRLAACLNDAVEQLGGARPAAICRELTKVHEEVVRGSLDELAIWAAGGVLGE ITVVVAGAAPHAELSSLIAQVEEFVAAGIRVKDACSEVAAAHPGVRTRQLYDAVLQSR RETGGPAQP" gene complement(1120889..1122148) /locus_tag="Rv1004c" /db_xref="GeneID:886039" CDS complement(1120889..1122148) /locus_tag="Rv1004c" /function="UNKNOWN" /note="Rv1004c, (MTCI237.20c), len: 419 aa. Probable membrane protein. Contains repetitive sequences, which have similarities with elastin, and possible N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215520.1" /db_xref="GI:15608144" /db_xref="UniProtKB/TrEMBL:O05589" /db_xref="GeneID:886039" /translation="MSISCRVREGFVMRLAIVGTAAAAAIGGTLAVAPLTLSTPERVA GGTCSAGQQCDRLAAVLMPDTATPSGPAAAEHAVPAPFEPVADTIAPGLVPRPGVPAA AAVPRVGPPAVPGLPNIPGAAGPALPPPPALPNLAAPSVPGVGIPGIGIPGIGIPGIG IPGVPDPITGVNTAAAVVNGVLGVGGTAAGVVTASAVAVTYLVLAVNALESSGILPTA RGTASTVASLLLPGAQSAAAALPAVGLPALPGVTPASLLAMAAAAGLPGVGFPSLPGV SPTDLMAMAAAAGLPTSLPGLAGMSPAELTALVAGGLPMLAAAGLPAGLAGVDPATLA AALPALAAGGLPPGLPALPGVDPAALAAALPALAAGLPALPAGLPPLPAVPALPAPPP LPGPPPLPALPSRLCTPGFGPIGVCIP" gene complement(1122222..1123598) /gene="pabB" /locus_tag="Rv1005c" /db_xref="GeneID:888205" CDS complement(1122222..1123598) /gene="pabB" /locus_tag="Rv1005c" /function="CATALYZES THE BIOSYNTHESIS OF 4-AMINO-4-DEOXYCHORISMATE (ADC) FROM CHORISMATE AND GLUTAMINE" /note="catalyzes the formation of 4-amino-4-deoxychorismate from chorismate and glutamine" /codon_start=1 /transl_table=11 /product="aminodeoxychorismate synthase component I" /protein_id="NP_215521.1" /db_xref="GI:15608145" /db_xref="GOA:O05591" /db_xref="UniProtKB/TrEMBL:O05591" /db_xref="GeneID:888205" /translation="MNLAWELSTRTKSPRSHLRCENPQFCQARTVRIDRLGDLGGAPA VLRAVGRATSRLDLPPPAALTGEWFGALAVIAPSVSIQPVSGDDVFSGPPGTGGPDAT GAVGGGWVGYLSYPDAGADGRPHRIPEAAGGWTDCVLRRDRDGQWWYESLSGAPIADW LASALATTRASVARPAPACRIDWEPADRAAHRDGVLACLEAIGAGEVYQACVCTQFAG TVTGSPLDFFIDGFGRTAPSRSAFVAGPWGAVASLSPELFLRRRGSVVTSSPIKGTLP LDAPPSALRASAKEVAENIMIVDLVRNDLGRVAVTGTVTVPELLVVRPAPGVWHLVST VSARVPLEEPMSALLDAAFPPASVTGTPKLRARQLISQWERYRRGIYCGTVGLASPVA GCELNVAIRTVEFDTAGNAVLGVGGGITADSDPDAEWAECLHKAAPIVGLPAATRTTP ARLASKVR" gene 1123714..1125417 /locus_tag="Rv1006" /db_xref="GeneID:888234" CDS 1123714..1125417 /locus_tag="Rv1006" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1006, (MTCI237.23), len: 567 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215522.1" /db_xref="GI:15608146" /db_xref="UniProtKB/TrEMBL:O05592" /db_xref="GeneID:888234" /translation="MVLRSRKSTLGVVVCLALVLGGPLNGCSSSASHRGPLNAMGSPA IPSTAQEIPNPLRGQYEDLMEPLFPQGNPAQQRYPPWPASYDASLRVSWRQLQPTDPR TLPPDAPDDRKYDFSVIDNALTRLADRGMRLTLRVYAYSSCCKASYPDGTNIAIPDWE RAIASTNTSYPGPATDPSTGVVQVVPNFNDSTYLNDFAQLLAALGRRYDGDERLSVFE FSGYGDFSENHVAYLRDTLGAPGPGPDESVATLGYYSQFRDQNITTASIKQLIAANVS AFPHTQLVTSPANPEIVRELFADEVTNKLAAPVGVRSDCLGVDAPLPAWAESSTSHYV QTKDPVVAALRQRLATAPVITEWCELPTGSSPRAYYEKGLRDVIRYHVSMTSSVNFPD QTATSPMDPALYLVWAQANAAAGYRYSVEAQPGSQALAGKVATISVTWTNYGAAAATE KWVPGYRLVDSTGQVVRTLPAAVDLKTLVSDQRGDRSSDQPTPASVAETVRVDLSGLP AGHYTLRAAIDWQQHKPNGSHVVNYPPMLLSRDGRDDSGFYPVATLDIPRDAQTAVNA S" gene complement(1125444..1127003) /gene="metG" /locus_tag="Rv1007c" /db_xref="GeneID:886050" CDS complement(1125444..1127003) /gene="metG" /locus_tag="Rv1007c" /EC_number="6.1.1.10" /function="IT IS PROBABLY ESSENTIAL FOR CELL SURVIVAL, BEING REQUIRED NOT ONLY FOR ELONGATION OF PROTEIN SYNTHESIS BUT ALSO FOR THE INITIATION OF ALL MRNA TRANSLATION THROUGH INITIATOR TRNA(FMET) AMINOACYLATION [CATALYTIC ACTIVITY: ATP + L-METHIONINE + TRNA(MET) = AMP + DIPHOSPHATE + L-METHIONYL-TRNA(MET)]" /note="methionine--tRNA ligase; MetRS; adds methionine to tRNA(Met) with cleavage of ATP to AMP and diphosphate; some MetRS enzymes form dimers depending on a C-terminal domain that is also found in other proteins such as Trbp111 in Aquifex aeolicus and the cold-shock protein CsaA from Bacillus subtilis while others do not; four subfamilies exist based on sequence motifs and zinc content" /codon_start=1 /transl_table=11 /product="methionyl-tRNA synthetase" /protein_id="NP_215523.1" /db_xref="GI:15608147" /db_xref="GOA:O05593" /db_xref="UniProtKB/Swiss-Prot:O05593" /db_xref="GeneID:886050" /translation="MKPYYVTTAIAYPNAAPHVGHAYEYIATDAIARFKRLDRYDVRF LTGTDEHGLKVAQAAAAAGVPTAALARRNSDVFQRMQEALNISFDRFIRTTDADHHEA SKELWRRMSAAGDIYLDNYSGWYSVRDERFFVESETQLVDGTRLTVETGTPVTWTEEQ TYFFRLSAYTDKLLAHYHANPDFIAPETRRNEVISFVSGGLDDLSISRTSFDWGVQVP EHPDHVMYVWVDALTNYLTGAGFPDTDSELFRRYWPADLHMIGKDIIRFHAVYWPAFL MSAGIELPRRIFAHGFLHNRGEKMSKSVGNIVDPVALAEALGVDQVRYFLLREVPFGQ DGSYSDEAIVTRINTDLANELGNLAQRSLSMVAKNLDGRVPNPGEFADADAALLATAD GLLERVRGHFDAQAMHLALEAIWLMLGDANKYFSVQQPWVLRKSESEADQARFRTTLY VTCEVVRIAALLIQPVMPESAGKILDLLGQAPNQRSFAAVGVRLTPGTALPPPTGVFP RYQPPQPPEGK" misc_feature complement(1126938..1126967) /gene="metG" /locus_tag="Rv1007c" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene 1127089..1127883 /gene="tatD" /locus_tag="Rv1008" /db_xref="GeneID:886047" CDS 1127089..1127883 /gene="tatD" /locus_tag="Rv1008" /EC_number="3.1.21.-" /function="DNase INVOLVED IN PROTEINS EXPORT. THIS SEC-INDEPENDENT PATHWAY IS TERMED TAT FOR TWIN-ARGININE TRANSLOCATION SYSTEM. THIS SYSTEM MAINLY TRANSPORTS PROTEINS WITH BOJ Biol Chem 2000;275:16717-16722UND COFACTORS THAT REQUIRE FOLDING PRIOR TO EXPORT (BY SIMILARITY)." /inference="non-experimental evidence, no additional details recorded" /note="Rv1008, (MTCI237.25), len: 264 aa. Probable tatD (alternate gene name: yjjV), deoxyribonuclease (EC 3.1.21.-), component of twin arginine translocation protein export system (see citations below). Similar to many members of the YBL055C/YJJV family e.g. YCFH_ECOLI|P37346 Putative deoxyribonuclease ycfH (EC 3.1.21.-) (265 aa), fasta scores: opt: 487, E(): 1.4e-24, (36.7% identity in 270 aa overlap). Also similar to P37545|YABD_BACSU Putative deoxyribonuclease yabD (255 aa), FASTA scores: opt: 599, E(): 7.7e-33, (40.1% identity in 262 aa overlap). Contains PS01137 Hypothetical YBL055c/yjjV family signature 1, and PS01091 Hypothetical YBL055c/yjjV family signature 3.; yjjV" /codon_start=1 /transl_table=11 /product="deoxyribonuclease TatD (YjjV protein)" /protein_id="NP_215524.1" /db_xref="GI:15608148" /db_xref="UniProtKB/TrEMBL:O08343" /db_xref="GeneID:886047" /translation="MVDAHTHLDACGARDADTVRSLVERAAAAGVTAVVTVADDLESA RWVTRAAEWDRRVYAAVALHPTRADALTDAARAELERLVAHPRVVAVGETGIDMYWPG RLDGCAEPHVQREAFAWHIDLAKRTGKPLMIHNRQADRDVLDVLRAEGAPDTVILHCF SSDAAMARTCVDAGWLLSLSGTVSFRTARELREAVPLMPVEQLLVETDAPYLTPHPHR GLANEPYCLPYTVRALAELVNRRPEEVALITTSNARRAYGLGWMRQ" misc_feature 1127089..1127115 /gene="tatD" /locus_tag="Rv1008" /note="PS01137 Hypothetical YBL055c/yjjV family signature 1" misc_feature 1127668..1127718 /gene="tatD" /locus_tag="Rv1008" /note="PS01091 Hypothetical YBL055c/yjjV family signature 3" gene 1128091..1129179 /gene="rpfB" /locus_tag="Rv1009" /db_xref="GeneID:886048" CDS 1128091..1129179 /gene="rpfB" /locus_tag="Rv1009" /function="THOUGHT TO PROMOTE THE RESUSCITATION AND GROWTH OF DORMANT, NONGROWING CELL. COULD ALSO STIMULATES THE GROWTH OF SEVERAL OTHER HIGH G+C GRAM+ ORGANISMS, e.g. Mycobacterium avium, Mycobacterium bovis (BCG), Mycobacterium kansasii, Mycobacterium smegmatis." /note="Rv1009, (MTCI237.26), len: 362 aa. Probable rpfB, resuscitation-promoting factor (see citation below), similar to others from Mycobacterium tuberculosis: Rv2450c|MTV008.06c|RPFE PROBABLE RESUSCITATION-PROMOTING FACTOR (172 aa), FASTA scores: E(): 1.9e-19, (42.9% identity in 147 aa overlap); Rv0867c|RPFA, Rv1884c|RPFC, and Rv2389c|RPFD. Possible lipoprotein; contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="resuscitation-promoting factor rpfB" /protein_id="NP_215525.1" /db_xref="GI:15608149" /db_xref="UniProtKB/TrEMBL:O05594" /db_xref="GeneID:886048" /translation="MLRLVVGALLLVLAFAGGYAVAACKTVTLTVDGTAMRVTTMKSR VIDIVEENGFSVDDRDDLYPAAGVQVHDADTIVLRRSRPLQISLDGHDAKQVWTTAST VDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNVA GLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLPPNARRVEDPEM NMSREVVEDPGVPGTQDVTFAVAEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEV PPVIDGSIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANGGLRYAPRADLATRE EQIAVAEVTRLRQGWGAWPVCAARAGAR" misc_feature 1128130..1128162 /gene="rpfB" /locus_tag="Rv1009" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1129152..1130105 /gene="ksgA" /locus_tag="Rv1010" /db_xref="GeneID:888792" CDS 1129152..1130105 /gene="ksgA" /locus_tag="Rv1010" /EC_number="2.1.1.-" /function="SPECIFICALLY DIMETHYLATES TWO ADJACENT ADENOSINES IN THE LOOP OF A CONSERVED HAIRPIN NEAR THE 3'-END OF 16S RRNA IN THE 30S PARTICLE. ITS INACTIVATION LEADS TO KASUGAMYCIN RESISTANCE" /note="catalyzes the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases A1518 and A1519 in 16S rRNA; mutations in ksgA causes resistance to the translation initiation inhibitor kasugamycin" /codon_start=1 /transl_table=11 /product="dimethyladenosine transferase" /protein_id="NP_215526.1" /db_xref="GI:15608150" /db_xref="GOA:P66660" /db_xref="UniProtKB/Swiss-Prot:P66660" /db_xref="GeneID:888792" /translation="MCCTSGCALTIRLLGRTEIRRLAKELDFRPRKSLGQNFVHDANT VRRVVAASGVSRSDLVLEVGPGLGSLTLALLDRGATVTAVEIDPLLASRLQQTVAEHS HSEVHRLTVVNRDVLALRREDLAAAPTAVVANLPYNVAVPALLHLLVEFPSIRVVTVM VQAEVAERLAAEPGSKEYGVPSVKLRFFGRVRRCGMVSPTVFWPIPRVYSGLVRIDRY ETSPWPTDDAFRRRVFELVDIAFAQRRKTSRNAFVQWAGSGSESANRLLAASIDPARR GETLSIDDFVRLLRRSGGSDEATSTGRDARAPDISGHASAS" misc_feature 1129329..1129412 /gene="ksgA" /locus_tag="Rv1010" /note="PS01131 Ribosomal RNA adenine dimethylases signature" gene 1130191..1131111 /gene="ispE" /locus_tag="Rv1011" /db_xref="GeneID:886034" CDS 1130191..1131111 /gene="ispE" /locus_tag="Rv1011" /EC_number="2.7.1.148" /function="THOUGHT TO BE INVOLVED IN DEOXYXYLULOSE-5-PHOSPHATE PATHWAY (DXP) OF ISOPRENOID BIOSYNTHESIS (AT THE FOURTH STEP). CATALYZES THE PHOSPHORYLATION OF THE POSITION 2 HYDROXY GROUP OF 4-DIPHOSPHOCYTIDYL-2C-METHYL-D-ERYTHRITOL [CATALYTIC ACTIVITY: ATP + 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol = ADP + 2-phospho-4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol]." /note="catalyzes the phosphorylation of 4-diphosphocytidyl-2-C-methyl-D-erythritol in the nonmevalonate pathway of isoprenoid biosynthesis" /codon_start=1 /transl_table=11 /product="4-diphosphocytidyl-2-C-methyl-D-erythritol kinase" /protein_id="NP_215527.1" /db_xref="GI:15608151" /db_xref="GOA:P65178" /db_xref="UniProtKB/Swiss-Prot:P65178" /db_xref="GeneID:886034" /translation="MPTGSVTVRVPGKVNLYLAVGDRREDGYHELTTVFHAVSLVDEV TVRNADVLSLELVGEGADQLPTDERNLAWQAAELMAEHVGRAPDVSIMIDKSIPVAGG MAGGSADAAAVLVAMNSLWELNVPRRDLRMLAARLGSDVPFALHGGTALGTGRGEELA TVLSRNTFHWVLAFADSGLLTSAVYNELDRLREVGDPPRLGEPGPVLAALAAGDPDQL APLLGNEMQAAAVSLDPALARALRAGVEAGALAGIVSGSGPTCAFLCTSASSAIDVGA QLSGAGVCRTVRVATGPVPGARVVSAPTEV" gene 1131128..1131421 /locus_tag="Rv1012" /db_xref="GeneID:886045" CDS 1131128..1131421 /locus_tag="Rv1012" /function="UNKNOWN" /note="Rv1012, (MTCI237.29), len: 97 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215528.1" /db_xref="GI:15608152" /db_xref="UniProtKB/TrEMBL:O05597" /db_xref="GeneID:886045" /translation="MPRAARGIRACRGRWVDRLAHQHASGRAAGIRPREVGGAHQSQA QKPYHDATEPLGESLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAVTKL" gene 1131625..1133259 /gene="pks16" /locus_tag="Rv1013" /db_xref="GeneID:886035" CDS 1131625..1133259 /gene="pks16" /locus_tag="Rv1013" /EC_number="2.3.1.86" /function="POTENTIALLY INVOLVED IN SOME INTERMEDIATE STEPS FOR THE SYNTHESIS OF A POLYKETIDE MOLECULE WHICH MAY BE INVOLVED IN SECONDARY METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid--CoA ligase" /protein_id="NP_215529.1" /db_xref="GI:15608153" /db_xref="GOA:O05598" /db_xref="UniProtKB/TrEMBL:O05598" /db_xref="GeneID:886035" /translation="MSRFTEKMFHNARTATTGMVTGEPHMPVRHTWGEVHERARCIAG GLAAAGVGLGDVVGVLAGFPVEIAPTAQALWMRGASLTMLHQPTPRTDLAVWAEDTMT VIGMIEAKAVIVSEPFLVAIPILEQKGMQVLTVADLLASDPIGPIEVGEDDLALMQLT SGSTGSPKAVQITHRNIYSNAEAMFVGAQYDVDKDVMVSWLPCFHDMGMVGFLTIPMF FGAELVKVTPMDFLRDTLLWAKLIDKYQGTMTAAPNFAYALLAKRLRRQAKPGDFDLS TLRFALSGAEPVEPADVEDLLDAGKPFGLRPSAILPAYGMAETTLAVSFSECNAGLVV DEVDADLLAALRRAVPATKGNTRRLATLGPLLQDLEARIIDEQGDVMPARGVGVIELR GESLTPGYLTMGGFIPAQDEHGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPT DIERAAGRVDGVRPGCAVAVRLDAGHSRESFAVAVESNAFEDPAEVRRIEHQVAHEVV AEVDVRPRNVVVLGPGTIPKTPSGKLRRANSVTLVT" misc_feature 1132093..1132128 /gene="pks16" /locus_tag="Rv1013" /note="PS00455 Putative AMP-binding domain signature" gene complement(1133333..1133908) /gene="pth" /locus_tag="Rv1014c" /db_xref="GeneID:886037" CDS complement(1133333..1133908) /gene="pth" /locus_tag="Rv1014c" /EC_number="3.1.1.29" /function="THE NATURAL SUBSTRATE FOR THIS ENZYME MAY BE PEPTIDYL-TRNAS WHICH DROP OFF THE RIBOSOME DURING PROTEIN SYNTHESIS [CATALYTIC ACTIVITY : N-SUBSTITUTED AMINOACYL-TRNA + H(2)O = N-SUBSTITUTED AMINO ACID + TRNA]" /note="Enables the recycling of peptidyl-tRNAs produced at termination of translation" /codon_start=1 /transl_table=11 /product="peptidyl-tRNA hydrolase" /protein_id="NP_215530.1" /db_xref="GI:15608154" /db_xref="GOA:P65865" /db_xref="UniProtKB/Swiss-Prot:P65865" /db_xref="GeneID:886037" /translation="MAEPLLVVGLGNPGANYARTRHNLGFVVADLLAARLGAKFKAHK RSGAEVATGRSAGRSLVLAKPRCYMNESGRQIGPLAKFYSVAPANIIVIHDDLDLEFG RIRLKIGGGEGGHNGLRSVVAALGTKDFQRVRIGIGRPPGRKDPAAFVLENFTPAERA EVPTICEQAADATELLIEQGMEPAQNRVHAW" gene complement(1133921..1134568) /gene="rplY" /locus_tag="Rv1015c" /db_xref="GeneID:885992" CDS complement(1133921..1134568) /gene="rplY" /locus_tag="Rv1015c" /function="BINDS TO THE 50S RRNA" /note="the Ctc family of proteins consists of two types, one that contains the N-terminal ribosomal protein L25 domain only which in Escherichia coli binds the 5S rRNA while a subset of proteins contain a C-terminal extension that is involved in the stress response" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L25/general stress protein Ctc" /protein_id="NP_215531.1" /db_xref="GI:15608155" /db_xref="GOA:P66121" /db_xref="UniProtKB/Swiss-Prot:P66121" /db_xref="GeneID:885992" /translation="MAKSASNQLRVTVRTETGKGASRRARRAGKIPAVLYGHGAEPQH LELPGHDYAAVLRHSGTNAVLTLDIAGKEQLALTKALHIHPIRRTIQHADLLVVRRGE KVVVEVSVVVEGQAGPDTLVTQETNSIEIEAEALSIPEQLTVSIEGAEPGTQLTAGQI ALPAGVSLISDPDLLVVNVVKAPTAEELEGEVAGAEEAEEAAVEAGEAEAAGESE" gene complement(1134785..1135465) /gene="lpqT" /locus_tag="Rv1016c" /db_xref="GeneID:886066" CDS complement(1134785..1135465) /gene="lpqT" /locus_tag="Rv1016c" /function="UNKNOWN" /note="Rv1016c, (MTCY10G2.33), len: 226 aa. Probable lpqT, conserved lipoprotein. Similar to several Mycobacterium tuberculosis hypothetical proteins e.g. Rv0040c|Y0H3_MYCTU|P71697 Proline rich 28 kDA antigen (310 aa), FASTA scores: opt: 329, E(): 2e-17, (32.3% identity in 229 aa overlap); Rv0583c. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LpqT" /protein_id="NP_215532.1" /db_xref="GI:15608156" /db_xref="GOA:P96384" /db_xref="UniProtKB/Swiss-Prot:P96384" /db_xref="GeneID:886066" /translation="MAGRRCPQDSVRPLAVAVAVATLAMSAVACGPKSPDFQSILSTS PTTSAVSTTTEVPVPLWKYLESVGVTGEPVAPSSLTDLTVSIPTPPGWAPMKNPNITP NTEMIAKGESYPTAMLMVFKLHRDFDIAEALKHGTADARLSTNFTELDSSTADFNGFP SSMIQGSYDLHGRRLHTWNRIVFPTGAPPAKQRYLVQLTITSLANEAVKHASDIEAII AGFVVAAK" misc_feature complement(1135376..1135408) /gene="lpqT" /locus_tag="Rv1016c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(1135501..1136481) /gene="prsA" /locus_tag="Rv1017c" /db_xref="GeneID:885993" CDS complement(1135501..1136481) /gene="prsA" /locus_tag="Rv1017c" /EC_number="2.7.6.1" /function="Catalyzes the formation of PRPP from ATP and ribose 5-phosphate. PRPP is then used in various biosynthetic pathways, as for example in the formation of purines, pyrimidines, histidine and tryptophan. [CATALYTIC ACTIVITY: ATP + D-ribose 5-phosphate = AMP + 5-phospho-alpha-D-ribose 1-diphosphate]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 5-phospho-alpha-D-ribose 1-phosphate from D-ribose 5-phosphate and ATP" /codon_start=1 /transl_table=11 /product="ribose-phosphate pyrophosphokinase" /protein_id="NP_215533.1" /db_xref="GI:15608157" /db_xref="GOA:P65232" /db_xref="UniProtKB/Swiss-Prot:P65232" /db_xref="GeneID:885993" /translation="MSHDWTDNRKNLMLFAGRAHPELAEQVAKELDVHVTSQDAREFA NGEIFVRFHESVRGCDAFVLQSCPAPVNRWLMEQLIMIDALKRGSAKRITAVMPFYPY ARQDKKHRGREPISARLIADLLKTAGADRIVTVDLHTDQIQGFFDGPVDHMRGQNLLT GYIRDNYPDGNMVVVSPDSGRVRIAEKWADALGGVPLAFIHKTRDPRVPNQVVSNRVV GDVAGRTCVLIDDMIDTGGTIAGAVALLHNDGAGDVIIAATHGVLSDPAAQRLASCGA REVIVTNTLPIGEDKRFPQLTVLSIAPLLASTIRAVFENGSVTGLFDGDA" misc_feature complement(1135762..1135788) /gene="prsA" /locus_tag="Rv1017c" /note="PS00144 Asparaginase / glutaminase active site signature 1" misc_feature complement(1135768..1135806) /gene="prsA" /locus_tag="Rv1017c" /note="PS00103 Purine/pyrimidine phosphoribosyl transferases signature" gene complement(1136573..1138060) /gene="glmU" /locus_tag="Rv1018c" /db_xref="GeneID:886069" CDS complement(1136573..1138060) /gene="glmU" /locus_tag="Rv1018c" /EC_number="2.7.7.23" /function="PEPTIDOGLYCAN AND LIPOPOLYSACCHARIDE BIOSYNTHESIS" /note="forms a homotrimer; catalyzes the acetylation of glucosamine-1-phosphate and uridylation of N-acetylglucosamine-1-phosphate to produce UDP-GlcNAc; function in cell wall synthesis" /codon_start=1 /transl_table=11 /product="bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase" /protein_id="NP_215534.1" /db_xref="GI:15608158" /db_xref="GOA:P96382" /db_xref="UniProtKB/TrEMBL:P96382" /db_xref="GeneID:886069" /translation="MTFPGDTAVLVLAAGPGTRMRSDTPKVLHTLAGRSMLSHVLHAI AKLAPQRLIVVLGHDHQRIAPLVGELADTLGRTIDVALQDRPLGTGHAVLCGLSALPD DYAGNVVVTSGDTPLLDADTLADLIATHRAVSAAVTVLTTTLDDPFGYGRILRTQDHE VMAIVEQTDATPSQREIREVNAGVYAFDIAALRSALSRLSSNNAQQELYLTDVIAILR SDGQTVHASHVDDSALVAGVNNRVQLAELASELNRRVVAAHQLAGVTVVDPATTWIDV DVTIGRDTVIHPGTQLLGRTQIGGRCVVGPDTTLTDVAVGDGASVVRTHGSSSSIGDG AAVGPFTYLRPGTALGADGKLGAFVEVKNSTIGTGTKVPHLTYVGDADIGEYSNIGAS SVFVNYDGTSKRRTTVGSHVRTGSDTMFVAPVTIGDGAYTGAGTVVREDVPPGALAVS AGPQRNIENWVQRKRPGSPAAQASKRASEMACQQPTQPPDADQTP" gene complement(1138076..1138147) /locus_tag="Rvnt16" /note="tRNA-Gln(TTG)" /db_xref="GeneID:2700462" tRNA complement(1138076..1138147) /locus_tag="Rvnt16" /product="tRNA-Gln" /note="codon recognized: CAA" /anticodon=(pos:1138112..1138114,aa:Gln) /db_xref="GeneID:2700462" gene 1138315..1138908 /locus_tag="Rv1019" /db_xref="GeneID:886056" CDS 1138315..1138908 /locus_tag="Rv1019" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1019, (MTCY10G2.30c), len: 197 aa. Probable transcriptional regulator, similar to many memebers of the tetR family e.g. MTCY7D11.18c (34.4% identity in 189 aa overlap). Helix turn helix motif from aa 27-48 (+5.42 SD)." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_215535.1" /db_xref="GI:15608159" /db_xref="GOA:P96381" /db_xref="UniProtKB/TrEMBL:P96381" /db_xref="GeneID:886056" /translation="MTGTERRHQLIGIARSLFAERGYDGTSIEEIAQRANVSKPVVYE HFGGKEGLYAVVVDREMSALLDGITSSLTNNRSRVRVERVALALLTYVEERTDGFRIM IRDSPASISSGTYSSLLNDAVSQVSSILAGDFARRGLDPDLAPLYAQALVGSVSMTAQ WWLDAREPKKEVVAAHLVNLVWNGLTHLEADPRLQDE" gene 1138967..1142671 /gene="mfd" /locus_tag="Rv1020" /db_xref="GeneID:886077" CDS 1138967..1142671 /gene="mfd" /locus_tag="Rv1020" /function="INVOLVED IN NUCLEOTIDE EXCISION REPAIR. NECESSARY FOR STRAND-SPECIFIC REPAIR. A LESION IN THE TEMPLATE STRAND BLOCKS THE RNA POLYMERASE COMPLEX (RNAP). THE RNAP-DNA-RNA COMPLEX IS SPECIFICALLY RECOGNIZED BY TRCF WHICH RELEASES RNAP AND THE TRUNCATED TRANSCRIPT; THE TCRF MAY REPLACE RNAP AT THE LESION SITE AND THEN RECRUIT THE UVRA/B/C REPAIR SYSTEM." /note="Rv1020, (MTCY10G2.29c), len: 1234 aa. Probable mfd (alternate gene name: trcF), transcription-repair coupling factor (see citation below), similar to many e.g. MFD_ECOLI|P30958 transcription-repair coupling factor from Escherichia coli (1148 aa), FASTA scores: opt: 1900, E(): 0, (37.9% identity in 1107 aa overlap); similar to M. tuberculosis Rv2973c and Rv1633. Contains PS00017 ATP/GTP-binding site motif A (P-loop). IN THE N-TERMINAL SECTION; BELONGS TO THE UVRB FAMILY. IN THE C-TERMINAL SECTION; BELONGS TO THE HELICASE FAMILY. RECG SUBFAMILY.; trcF" /codon_start=1 /transl_table=11 /product="transcription-repair coupling factor Mfd (TRCF)" /protein_id="NP_215536.1" /db_xref="GI:15608160" /db_xref="GOA:P64326" /db_xref="UniProtKB/Swiss-Prot:P64326" /db_xref="GeneID:886077" /translation="MTAPGPACSDTPIAGLVELALSAPTFQQLMQRAGGRPDELTLIA PASARLLVASALARQGPLLVVTATGREADDLAAELRGVFGDAVALLPSWETLPHERLS PGVDTVGTRLMALRRLAHPDDAQLGPPLGVVVTSVRSLLQPMTPQLGMMEPLTLTVGD ESPFDGVVARLVELAYTRVDMVGRRGEFAVRGGILDIFAPTAEHPVRVEFWGDEITEM RMFSVADQRSIPEIDIHTLVAFACRELLLSEDVRARAAQLAARHPAAESTVTGSASDM LAKLAEGIAVDGMEAVLPVLWSDGHALLTDQLPDGTPVLVCDPEKVRTRAADLIRTGR EFLEASWSVAALGTAENQAPVDVEQLGGSGFVELDQVRAAAARTGHPWWTLSQLSDES AIELDVRAAPSARGHQRDIDEIFAMLRAHIATGGYAALVAPGTGTAHRVVERLSESDT PAGMLDPGQAPKPGVVGVLQGPLRDGVIIPGANLVVITETDLTGSRVSAAEGKRLAAK RRNIVDPLALTAGDLVVHDQHGIGRFVEMVERTVGGARREYLVLEYASAKRGGGAKNT DKLYVPMDSLDQLSRYVGGQAPALSRLGGSDWANTKTKARRAVREIAGELVSLYAKRQ ASPGHAFSPDTPWQAELEDAFGFTETVDQLTAIEEVKADMEKPIPMDRVICGDVGYGK TEIAVRAAFKAVQDGKQVAVLVPTTLLADQHLQTFGERMSGFPVTIKGLSRFTDAAES RAVIDGLADGSVDIVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVD VLTMSATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQIAAALRRELLR DGQAFYVHNRVSSIDAAAARVRELVPEARVVVAHGQMPEDLLETTVQRFWNREHDILV CTTIVETGLDISNANTLIVERADTFGLSQLHQLRGRVGRSRERGYAYFLYPPQVPLTE TAYDRLATIAQNNELGAGMAVALKDLEIRGAGNVLGIEQSGHVAGVGFDLYVRLVGEA LETYRDAYRAAADGQTVRTAEEPKDVRIDLPVDAHLPPDYIASDRLRLEGYRRLAAAS SDREVAAVVDELTDRYGALPEPARRLAAVARLRLLCRGSGITDVTAASAATVRLSPLT LPDSAQVRLKRMYPGAHYRATTATVQVPIPRAGGLGAPRIRDVELVQMVADLITALAG KPRQHIGITNPSPPGEDGRGRNTTIKERQP" misc_feature 1140992..1141015 /gene="mfd" /locus_tag="Rv1020" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1142671..1143648 /locus_tag="Rv1021" /db_xref="GeneID:886052" CDS 1142671..1143648 /locus_tag="Rv1021" /EC_number="3.6.1.19" /function="UNKNOWN" /note="functions in degradation of stringent response intracellular messenger ppGpp; in Escherichia coli this gene is co-transcribed with the toxin/antitoxin genes mazEF; activity of MazG is inhibited by MazEF in vitro; ppGpp inhibits mazEF expression; MazG thus works in limiting the toxic activity of the MazF toxin induced during starvation; MazG also interacts with the GTPase protein Era" /codon_start=1 /transl_table=11 /product="nucleoside triphosphate pyrophosphohydrolase" /protein_id="NP_215537.1" /db_xref="GI:15608161" /db_xref="UniProtKB/TrEMBL:P96379" /db_xref="GeneID:886052" /translation="MIVVLVDPRRPTLVPVEAIEFLRGEVQYTEEMPVAVPWSLPAAR SAHAGNDAPVLLSSDPNHPAVITRLAAGARLISAPDSQRGERLVDAVAMMDKLRTAGP WESEQTHDSLRRYLLEETYELLDAVRSGSVDQLREELGDLLLQVLFHARIAEDASQSP FTIDDVADTLMRKLGNRAPGVLAGESISLEDQLAQWEAAKASEKARKSVADDVHTGQP ALALAQKVIQRAQKAGLPAHLIPDEITSVSVSADVDAENTLRTAVLDFIDRLRCAERA IAVARRGSNVAEQLDVTPLGVITEQEWLAHWPTAVNDSRGGSKKRKGMR" gene 1143736..1144467 /gene="lpqU" /locus_tag="Rv1022" /db_xref="GeneID:886076" CDS 1143736..1144467 /gene="lpqU" /locus_tag="Rv1022" /function="UNKNOWN" /note="Rv1022, (MTCY10G2.27c), len: 243 aa. Probable lpqU conserved lipoprotein. Similar to Mycobacterium tuberculosis hypothetical protein Rv1230c|MTV006.02C, FASTA scores: E(): 2.8e-18, (37.9% identity in 240 aa overlap). Similar to AL133423|SC4A7.37 hypothetical protein from Streptomyces coelicolor (421 aa), FASTA scores: opt: 474, E(): 2.7e-21, (42.2% identity in 211 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LpqU" /protein_id="NP_215538.1" /db_xref="GI:15608162" /db_xref="UniProtKB/TrEMBL:P96378" /db_xref="GeneID:886076" /translation="MSPRRWLRAVAVIGATAMLLASSCTWQLSLFITDGVPPPPGDPV PPVDTHAGGRPADQLREWAEKRAAALGIPVIALEAYAYAARVAEVENPKCHLAWTTLA GIGRVESHHGTYRGATIAPNGDVSPPIRGVRLDGTGGTLRIVDRDGGGLDGDAAVERA MGPMQFISETWRLYGVAARNDGIANVDNIDDAALSAAGYLCWRGKDLATPRGWITALR AYNNSVIYARAVRDWATAYAAGHPL" misc_feature 1143775..1143807 /gene="lpqU" /locus_tag="Rv1022" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1144564..1145853 /gene="eno" /locus_tag="Rv1023" /db_xref="GeneID:886062" CDS 1144564..1145853 /gene="eno" /locus_tag="Rv1023" /EC_number="4.2.1.11" /function="GLYCOLYSIS [CATALYTIC ACTIVITY:2-PHOSPHO-D-GLYCERATE = PHOSPHOENOLPYRUVATE + H(2)O]" /note="enolase; catalyzes the formation of phosphoenolpyruvate from 2-phospho-D-glycerate in glycolysis" /codon_start=1 /transl_table=11 /product="phosphopyruvate hydratase" /protein_id="NP_215539.1" /db_xref="GI:15608163" /db_xref="GOA:P96377" /db_xref="UniProtKB/Swiss-Prot:P96377" /db_xref="GeneID:886062" /translation="MPIIEQVRAREILDSRGNPTVEVEVALIDGTFARAAVPSGASTG EHEAVELRDGGDRYGGKGVQKAVQAVLDEIGPAVIGLNADDQRLVDQALVDLDGTPDK SRLGGNAILGVSLAVAKAAADSAELPLFRYVGGPNAHILPVPMMNILNGGAHADTAVD IQEFMVAPIGAPSFVEALRWGAEVYHALKSVLKKEGLSTGLGDEGGFAPDVAGTTAAL DLISRAIESAGLRPGADVALALDAAATEFFTDGTGYVFEGTTRTADQMTEFYAGLLGA YPLVSIEDPLSEDDWDGWAALTASIGDRVQIVGDDIFVTNPERLEEGIERGVANALLV KVNQIGTLTETLDAVTLAHHGGYRTMISHRSGETEDTMIADLAVAIGSGQIKTGAPAR SERVAKYNQLLRIEEALGDAARYAGDLAFPRFACETK" gene 1145858..1146544 /locus_tag="Rv1024" /db_xref="GeneID:886059" CDS 1145858..1146544 /locus_tag="Rv1024" /function="UNKNOWN" /note="Rv1024, (MTCY10G2.25c), len: 228 aa. Possible conserved membrane protein, with a hydrophobic region from aa 83-101. Equivalent to ML0256|NP_301311.1|NC_002677 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (227 aa), S&W scores: 178, E()= 2e-72, Identities: 145/203 (71%)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215540.1" /db_xref="GI:15608164" /db_xref="GOA:P96376" /db_xref="UniProtKB/TrEMBL:P96376" /db_xref="GeneID:886059" /translation="MPEAKRPESKRRSPASRPGKAGDSVRGGRATKPSAKPSTPAPHA SRKTTRTPHEHIVEPIKRAITESVEKRSEQRLGFTARRAAILAAVVCVLTLTIARPVR TYFAQRAEMEQLAATEAMLRRQIADLEEQQVKLADPAYIAAQARERLGFVMPGDIPFQ VQLPSTPLAPPQPGSDAATATNNEPWYTALWHTIADDPHLPPAAPPAPEPGRPGPLPP ASPNPEQPGG" gene 1146561..1147028 /locus_tag="Rv1025" /db_xref="GeneID:886042" CDS 1146561..1147028 /locus_tag="Rv1025" /function="UNKNOWN" /note="Rv1025, (MTCY10G2.24c), len: 155 aa. Conserved hypothetical protein, similar to hypothetical protein AE001768|AE001768_4 Thermotoga maritima (170 aa) FASTA scores: opt: 254, E(): 9.5e-10, (35.7% identity in 143 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215541.1" /db_xref="GI:15608165" /db_xref="UniProtKB/TrEMBL:P96375" /db_xref="GeneID:886042" /translation="MVTRQLGRAPRGVLAIAYRCPNGEPGVVKTAPRLPDGTPFPTLY YLTHPVLTAAASRLETTGLMREMNRRLGQDAELAAAYRRAHESYLSERDALEPLGTTV SAGGMPDRVKCLHVLIAHSLAKGPGLNPFGDEALALLAAEPRTAATLVAGQWR" gene 1147019..1147978 /locus_tag="Rv1026" /db_xref="GeneID:886089" CDS 1147019..1147978 /locus_tag="Rv1026" /function="UNKNOWN. COULD BE INVOLVED IN AN ADAPTIVE PROCESS THAT ALLOWS BACTERIA TO RESPOND TO AMINO ACID STARVATION." /note="Rv1026, (MTCY10G2.23c), len: 319 aa. Conserved hypothetical protein. Similar to GPPA_ECOLI|P25552 guanosine-5'-triphosphate,3'-diphosphate pyrophoshatase from Escherichia coli (494 aa), FASTA scores: opt: 281, E(): 3.2e-11, (30.6% identity in 291 aa overlap). Equivalent to AL023514|MLCB4.02 hypothetical protein from Mycobacterium leprae (317 aa) (77.9% identity in 321 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215542.1" /db_xref="GI:15608166" /db_xref="UniProtKB/TrEMBL:P96374" /db_xref="GeneID:886089" /translation="MALTRVAAIDCGTNSIRLLIADVGAGLARGELHDVHRETRIVRL GQGVDATGRFAPEAIARTRTALTDYAELLTFHHAERVRMVATSAARDVVNRDVFFAMT ADVLGAALPGSAAEVITGAEEAELSFRGAVGELGSAGAPFVVVDLGGGSTEIVLGEHE VVASYSADIGCVRLTERCLHSDPPTLQEVSTARRLVRERLEPALRTVPLELARTWVGL AGTMTTLSALAQSMTAYDAAAIHLSRVPGADLLEVCQRLIGMTRKQRAALAPMHPGRA DVIGGGAIVVEELARELRERAGIDQLTVSEHDILDGIALSLAG" gene complement(1148427..1149107) /gene="kdpE" /locus_tag="Rv1027c" /db_xref="GeneID:886046" CDS complement(1148427..1149107) /gene="kdpE" /locus_tag="Rv1027c" /function="MEMBER OF THE TWO-COMPONENT REGULATORY SYSTEM KDPD/KDPE INVOLVED IN THE REGULATION OF THE KDP OPERON." /note="Rv1027c, (MTCY10G2.22), len: 226 aa. Probable KdpE, transcriptional regulatory protein, similar to others e.g. KDPE_ECOLI|P21866 kdp operon transcriptional regulatory protein from Escherichia coli strain K12 (225 aa), FASTA scores: opt: 691, E(): 0, (47.8% identity in 224 aa overlap); AL021530|SC2E9.13 from Streptomyces coelicolor (227 aa), FASTA scores: opt: 981, E(): 0, (66.4% identity in 226 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein KdpE" /protein_id="NP_215543.1" /db_xref="GI:15608167" /db_xref="GOA:P96373" /db_xref="UniProtKB/TrEMBL:P96373" /db_xref="GeneID:886046" /translation="MTLVLVIDDEPQILRALRINLTVRGYQVITASTGAGALRAAAEH PPDVVILDLGLPDMSGIDVLGGLRGWLTAPVIVLSARTDSSDKVQALDAGADDYVTKP FGMDEFLARLRAAVRRNTAAAELEQPVIETDSFTVDLAGKKVIKDGAEVHLTPTEWGM LEMLARNRGKLVGRGELLKEVWGPAYATETHYLRVYLAQLRRKLEDDPSHPKHLLTES GMGYRFEA" gene complement(1149104..1151686) /gene="kdpD" /locus_tag="Rv1028c" /db_xref="GeneID:886084" CDS complement(1149104..1151686) /gene="kdpD" /locus_tag="Rv1028c" /EC_number="2.7.3.-" /function="MEMBER OF THE TWO-COMPONENT REGULATORY SYSTEM KDPD/KDPE INVOLVED IN THE REGULATION OF THE KDP OPERON. KDPD MAY FUNCTION AS A MEMBRANE-ASSOCIATED PROTEIN KINASE THAT PHOSPHORYLATES KDPE|Rv1027c IN RESPONSE TO ENVIRONMENTAL SIGNALS." /note="Rv1028c, (MTCY10G2.21), len: 860 aa. Probable kdpD, sensor protein (EC 2.7.3.-), similar to others e.g. KDPD_ECOLI|P21865 sensor protein from Escherichia coli strain K12 (894 aa), FASTA scores: opt: 1041, E(): 0, (32.3% identity in 888 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="sensor protein KDPD" /protein_id="NP_215544.1" /db_xref="GI:15608168" /db_xref="GOA:P96372" /db_xref="UniProtKB/Swiss-Prot:P96372" /db_xref="GeneID:886084" /translation="MTLLFADLCAIFTPYRWMIEHVTTKRGQLRIYLGAAPGVGKTYA MLGEAHRRLERGTDVVAAVVETHGRNKTAKLLEGIEMIPPRYVEYRGARFPELDVEAV LRRHPQVVLVDELAHTNTPGSKNPKRWQDVQEILDAGITVISTVNIQHLEGLNDVVEQ ITGIEQKEKIPDEIVRAADQVELVDITPEALRRRLAHGNVYAAERVDAALSNYFRTGN LTALREIALLWLADQVDAALEKYRADKKITATWEARERVVVAVTGGPESETLVRRASR IASKSSAELMVVHVIRGDGLAGVSAPQLGRVRELATSLGATMHTVVGDDVPTALLDFA REMNATQLVVGTSRRSRWARLFDEGIGARTVQEPGGIDVHMVTHPAASRASGWSRVSP RERHIASWLAALVVPSVICAITVAWLDRFMGIGGESALFFIGVLIVALLGGVAPAALS ALLSGMLLNYFLTEPRYTWTIAEPDAAVTEFVLLAMAVAVAVLVDGAASRTREARRAS QEAELLALFAGSVLRGADLATLLQRVRETYSQRAVTMLRVRQGASTGETVACVGTNPC RDVDSADTAIEVGDDEFWMLMAGRKLAARDRRVLTAVATQAAGLVKQRELAEEAGQAE AIARADELRRSLLSAVSHDLRTPLAAAKVAVSSLRTEDVAFSPEDTAELLATIEESID QLTALVANLLDSSRLAAGVIRPQLRRAYLEEAVQRALVSIGKGATGFYRSGIDRVKVD VGDAVAMADAGLLERVLANLIDNALRYAPDCVVRVNAGRVRERVLINVIDEGPGVPRG TEEQLFAPFQRPGDHDNTTGVGLGMSVARGFVEAMGGTISATDTPGGGLTVVIDLAAP EDRP" misc_feature complement(1151561..1151584) /gene="kdpD" /locus_tag="Rv1028c" /note="PS00017 ATP/GTP-binding site motif A" gene 1151920..1152012 /gene="kdpF" /locus_tag="Rv1028A" /db_xref="GeneID:3205056" CDS 1151920..1152012 /gene="kdpF" /locus_tag="Rv1028A" /function="THOUGHT TO BE INVOLVED IN STABILIZATION OF THE KDP COMPLEX." /note="Rv1028A, len: 30 aa. Probable kdpF, membrane protein, showing similarity with P36937|KDPF_ECOLI|B0698.1 PROTEIN KDPF from Escherichia coli strain K12 (see citation below) (27% identity); and KdpF protein from Streptomyces coelicolor (51% identity)." /codon_start=1 /transl_table=11 /product="membrane protein kdpF" /protein_id="YP_177636.1" /db_xref="GI:57116812" /db_xref="UniProtKB/TrEMBL:Q79FT7" /db_xref="GeneID:3205056" /translation="MTTVDNIVGLVIAVALMAFLFAALLFPEKF" gene 1152012..1153727 /gene="kdpA" /locus_tag="Rv1029" /db_xref="GeneID:887414" CDS 1152012..1153727 /gene="kdpA" /locus_tag="Rv1029" /EC_number="3.6.3.12" /function="ONE OF THE COMPONENTS OF THE HIGH-AFFINITY ATP-DRIVEN POTASSIUM TRANSPORT (OR KDP) SYSTEM, WHICH CATALYZES THE HYDROLYSIS OF ATP COUPLED WITH THE EXCHANGE OF HYDROGEN AND POTASSIUM IONS [CATALYTIC ACTIVITY: ATP + H(2)O + K(+)(OUT) = ADP + PHOSPHATE + K(+)(IN)]." /note="catalyzes the hydrolysis of ATP coupled with the exchange of hydrogen and potassium ions" /codon_start=1 /transl_table=11 /product="potassium-transporting ATPase subunit A" /protein_id="NP_215545.1" /db_xref="GI:15608169" /db_xref="GOA:P65209" /db_xref="UniProtKB/Swiss-Prot:P65209" /db_xref="GeneID:887414" /translation="MSGTSWLQFAALIAVLLLTAPALGGYLAKIYGDEAKKPGDRVFG PIERVIYQVCRVDPGSEQRWSTYALSVLAFSVMSFLLLYGIARFQGVLPFNPTDKPAV TDHVAFNAAVSFMTNTNWQSYSGEATMSHFTQMTGLAVQNFVSASAGMCVLAALIRGL ARKRASTLGNFWVDLARTVLRIMFPLSFVVAILLVSQGVIQNLHGFIVANTLEGAPQL IPGGPVASQVAIKQLGTNGGGFFNVNSAHPFENYTPIGNFVENWAILIIPFALCFAFG KMVHDRRQGWAVLAIMGIIWIGMSVAAMSFEAKGNPRLDALGVTQQTTVDQSGGNLEG KEVRFGVGASGLWAASTTGTSNGSVNSMHDSYTPLGGMVPLAHMMLGEVSPGGTGVGL NGLLVMAILAVFIAGLMVGRTPEYLGKKIQATEMKLVTLYILAMPIALLSFAAASVLI SSALASRNNPGPHGLSEILYAYTSGANNNGSAFAGLTASTWSYDTTIGVAMLIGRFFL IIPVLAIAGSLARKGTTPVTAATFPTHKPLFVGLVIGVVLIVGGLTFFPALALGPIVE QLSTQ" gene 1153724..1155853 /gene="kdpB" /locus_tag="Rv1030" /db_xref="GeneID:887343" CDS 1153724..1155853 /gene="kdpB" /locus_tag="Rv1030" /EC_number="3.6.3.12" /function="ONE OF THE COMPONENTS OF THE HIGH-AFFINITY ATP-DRIVEN POTASSIUM TRANSPORT (OR KDP) SYSTEM, WHICH CATALYZES THE HYDROLYSIS OF ATP COUPLED WITH THE EXCHANGE OF HYDROGEN AND POTASSIUM IONS [CATALYTIC ACTIVITY:ATP + H(2)O + K(+)(OUT) = ADP + PHOSPHATE + K(+)(IN)]." /note="One of the components of the high-affinity ATP-driven potassium transport (or KDP) system, which catalyzes the hydrolysis of ATP coupled with the exchange of hydrogen and potassium ions" /codon_start=1 /transl_table=11 /product="potassium-transporting ATPase subunit B" /protein_id="NP_215546.1" /db_xref="GI:15608170" /db_xref="GOA:P63681" /db_xref="UniProtKB/Swiss-Prot:P63681" /db_xref="GeneID:887343" /translation="MMIARMETSATAAAATSAPRLRLAKRSLFDPMIVRSALPQSLRK LAPRVQARNPVMLVVLVGAVITTLAFLRDLASSTAQENVFNGLVAAFLWFTVLFANFA EAMAEGRGKAQAAALRKVRSETMANRRTAAGNIESVPSSRLDLDDVVEVSAGETIPSD GEIIEGIASVDESAITGESAPVIRESGGDRSAVTGGTVVLSDRIVVRITAKQGQTFID RMIALVEGAARQQTPNEIALNILLAGLTIIFLLAVVTLQPFAIYSGGGQRVVVLVALL VCLIPTTIGALLSAIGIAGMDRLVQHNVLATSGRAVEAAGDVNTLLLDKTGTITLGNR QATEFVPINGVSAEAVADAAQLSSLADETPEGRSIVVLAKDEFGLRARDEGVMSHARF VPFTAETRMSGVDLAEVSGIRRIRKGAAAAVMKWVRDHGGHPTEEVGAIVDGISSGGG TPLVVAEWTDNSSARAIGVVHLKDIVKVGIRERFDEMRRMSIRTVMITGDNPATAKAI AQEAGVDDFLAEATPEDKLALIKREQQGGRLVAMTGDGTNDAPALAQADVGVAMNTGT QAAREAGNMVDLDSDPTKLIEVVEIGKQLLITRGALTTFSIANDVAKYFAIIPAMFVG LYPVLDKLNVMALHSPRSAILSAVIFNALVIVALIPLALRGVRFRAESASAMLRRNLL IYGLGGLVVPFIGIKLVDLVIVALGVS" misc_feature 1154693..1154713 /gene="kdpB" /locus_tag="Rv1030" /note="PS00154 E1-E2 ATPases phosphorylation site" gene 1155853..1156422 /gene="kdpC" /locus_tag="Rv1031" /db_xref="GeneID:886008" CDS 1155853..1156422 /gene="kdpC" /locus_tag="Rv1031" /EC_number="3.6.3.12" /function="ONE OF THE COMPONENTS OF THE HIGH-AFFINITY ATP-DRIVEN POTASSIUM TRANSPORT (OR KDP) SYSTEM, WHICH CATALYZES THE HYDROLYSIS OF ATP COUPLED WITH THE EXCHANGE OF HYDROGEN AND POTASSIUM IONS. THE C SUBUNIT MAY BE INVOLVED IN ASSEMBLY OF THE KDP COMPLEX. [CATALYTIC ACTIVITY: ATP + H(2)O + K(+)(OUT) = ADP + PHOSPHATE + K(+)(IN)]." /note="one of the components of the high-affinity ATP-driven potassium transport (or KDP)system, which catalyzes the hydrolysis of ATP coupled with the exchange of hydrogen and potassium ions; the C subunit may be involved in assembly of the KDP complex" /codon_start=1 /transl_table=11 /product="potassium-transporting ATPase subunit C" /protein_id="NP_215547.1" /db_xref="GI:15608171" /db_xref="GOA:P65211" /db_xref="UniProtKB/Swiss-Prot:P65211" /db_xref="GeneID:886008" /translation="MRRQLLPALTMLLVFTVITGIVYPLAVTGVGQLFFGDQANGALL ERDGQVIGSAHIGQQFTAAKYFHPRPSSAGDGYDAAASSGSNLGPTNEKLLAAVAERV TAYRKENNLPADTLVPVDAVTGSGSGLDPAISVVNAKLQAPRVAQARNISIRQVERLI EDHTDARGLGFLGERAVNVLRLNLALDRL" gene complement(1156426..1157955) /gene="trcS" /locus_tag="Rv1032c" /db_xref="GeneID:887790" CDS complement(1156426..1157955) /gene="trcS" /locus_tag="Rv1032c" /EC_number="2.7.3.-" /function="SENSOR PART OF THE TWO COMPONENT REGULATORY SYSTEM TRCS/TRCR." /experiment="experimental evidence, no additional details recorded" /note="Rv1032c, (MTCY10G2.17), len: 509 aa. trcS, two component sensor histidine kinase protein (EC 2.7.3.-) (see citations below), similar to YV16_MYCLE|P54883 probable sensor-like histidine kinase from Mycobacterium leprae (443 aa), FASTA scores: opt: 392, E(): 3.8e-18, (31.7% identity in 334 aa overlap). Note that in vitro autophosphorylation of TrcS requires the presence of Mn2+or Ca2+as a divalent cation cofactor and subsequent transphosphorylation of TrcR is evident in the presence of TrcS-phosphate and Ca2+." /codon_start=1 /transl_table=11 /product="two component sensor histidine kinase TRCS" /protein_id="NP_215548.1" /db_xref="GI:15608172" /db_xref="GOA:P96368" /db_xref="UniProtKB/TrEMBL:P96368" /db_xref="GeneID:887790" /translation="MIPDRNTRSRKAPCWRPRSLRQQLLLGVLAVVTVVLVAVGVVSV LSLSGYVTAMNDAELVESLHALNHSYTRYRDSAQTSTPTGNLPMSQAVLEFTGQTPGN LIAVLHDGVVIGSAVFSEDGARPAPPDVIRAIEAQVWDGGPPRVESLGSLGAYQVDSS AAGADRLFVGVSLSLANQIIARKKVTTVALVGAALVVTAALTVWVVGYALRPLRRVAA TAAEVATMPLTDDDHQISVRVRPGDTDPDNEVGIVGHTLNRLLDNVDGALAHRVDSDL RMRQFITDASHELRTPLAAIQGYAELTRQDSSDLPPTTEYALARIESEARRMTLLVDE LLLLSRLSEGEDLETEDLDLTDLVINAVNDAAVAAPTHRWVKNLPDEPVWVNGDHARL HQLVSNLLTNAWVHTQPGVTVTIGITCHRTGPNAPCVELSVTDDGPDIDPEILPHLFD RFVRASKSRSNGSGHGLGLAIVSSIVKAHRGSVTAESGNGQTVFRVRLPMIEQQIATT A" gene complement(1157963..1158736) /gene="trcR" /locus_tag="Rv1033c" /db_xref="GeneID:887957" CDS complement(1157963..1158736) /gene="trcR" /locus_tag="Rv1033c" /function="SENSOR PART OF THE TWO COMPONENT REGULATORY SYSTEM TRCS/TRCR. INVOLVED IN TRANSCRIPTIONAL AUTOACTIVATION: TRCR ACTIVATES ITS OWN EXPRESSION BY INTERACTING WITH THE AT-RICH SEQUENCE OF THE TRCR PROMOTER." /experiment="experimental evidence, no additional details recorded" /note="Rv1033c, (MTCY10G2.16), len: 257 aa. trcR, two-component regulatory protein (see citations below), similar to Q50825 TWO COMPONENT RESPONSE REGULATOR from Mycobacterium tuberculosis (234 aa), FASTA scores: opt: 628, E(): 0, (46.0% identity in 226 aa overlap). Note that in vitro autophosphorylation of TrcS requires the presence of Mn2+or Ca2+as a divalent cation cofactor and subsequent transphosphorylation of TrcR is evident in the presence of TrcS-phosphate and Ca2+." /codon_start=1 /transl_table=11 /product="two component transcriptional regulator TRCR" /protein_id="NP_215549.1" /db_xref="GI:15608173" /db_xref="GOA:Q50806" /db_xref="UniProtKB/TrEMBL:Q50806" /db_xref="GeneID:887957" /translation="MTTMSGYTRSQRPRQAILGQLPRIHRADGSPIRVLLVDDEPALT NLVKMALHYEGWDVEVAHDGQEAIAKFDKVGPDVLVLDIMLPDVDGLEILRRVRESDV YTPTLFLTARDSVMDRVTGLTSGADDYMTKPFSLEELVARLRGLLRRSSHLERPADEA LRVGDLTLDGASREVTRDGTPISLSSTEFELLRFLMRNPRRALSRTEILDRVWNYDFA GRTSIVDLYISYLRKKIDSDREPMIHTVRGIGYMLRPPE" gene complement(1158918..1159307) /locus_tag="Rv1034c" /db_xref="GeneID:886010" CDS complement(1158918..1159307) /locus_tag="Rv1034c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1560." /note="Rv1034c, (MTCY10G2.15), len: 129 aa. Probable IS1560 transposase fragment, similar to part of Rv3387|E1202305|MTV004.45 (225 aa) (65.1% identity in 129 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215550.1" /db_xref="GI:15608174" /db_xref="UniProtKB/TrEMBL:P96367" /db_xref="GeneID:886010" /translation="MQQGNPPDAPQLAPAVAWVKKRAGRTPRTVTADRGYGEAAVDQQ LTEVGVKNVLIPRKGKPSQDRRAEEHRKAFRRTIKWRTGCEGRISHLKRGYGWDRGRI GGLEGTRTWVGHGVFAHNLVTISALPA" repeat_region complement(1158921..1160433) /note="IS1560-1, len: 1513 bp. Insertion sequence IS1560." /mobile_element="insertion sequence:IS1560-1" gene complement(1159375..1160061) /locus_tag="Rv1035c" /db_xref="GeneID:888206" CDS complement(1159375..1160061) /locus_tag="Rv1035c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1560." /note="Rv1035c, (MTCY10G2.14), len: 228 aa. Probable IS1560 transposase fragment, similar to parts of Rv3387|E1202305|MTV004.45 (225 aa) (47.8% identity in 67 aa overlap) and Rv3386|E1202304|MTV004.44 (234 aa) (55.1% identity in 127 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215551.1" /db_xref="GI:15608175" /db_xref="UniProtKB/TrEMBL:P96366" /db_xref="GeneID:888206" /translation="MPHPTTLMKLTTRCGSAAIDGLNEALLAKAAEAKLLGTNRIRAD TTVARANVSYPTDLGLLAKAMRRIAATGKRIQAAGGAVRTRVGDRSRAAGRRAHAVAA KLRSRAELGRDEARAAVLRFTGELAELAQAAAQEAQQLLDNAKQAVLRAKAKAAALAA RGERDAVAGRRCGGLVRAVNDLTELLNATRQIVAQTRQRVAGITSDGASRRVSLHDGD ARPDHQGSAR" gene complement(1160095..1160433) /locus_tag="Rv1036c" /db_xref="GeneID:888227" CDS complement(1160095..1160433) /locus_tag="Rv1036c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1560." /note="Rv1036c, (MTCY10G2.13), len: 112 aa. Probable IS1560 transposase fragment, similar to part of Rv3386|E1202304|MTV004.44 (234 aa) (82.8% identity in 87 aa overlap)." /codon_start=1 /transl_table=11 /product="truncated IS1560 transposase" /protein_id="NP_215552.1" /db_xref="GI:15608176" /db_xref="UniProtKB/TrEMBL:P96365" /db_xref="GeneID:888227" /translation="MIPGRMVLNWEDGLNALVAEGIEAIVFRTLGDQCWLWESLLPDE VRRLPEELARVDALLDDPAFFAPFVPFFDPRRGRPSTPMEVYLQLMFVKFRYRLGYES LCREVADSIT" gene complement(1160544..1160828) /gene="esxI" /locus_tag="Rv1037c" /db_xref="GeneID:888299" CDS complement(1160544..1160828) /gene="esxI" /locus_tag="Rv1037c" /function="UNKNOWN" /note="Rv1037c, (MTCY10G2.12), len: 94 aa. esxI, ESAT-6 like protein (see citations below), highly similar to Q49946|ES6X_MYCLE|U1756D PUTATIVE ESAT-6 LIKE PROTEIN X from Mycobacterium leprae (95 aa), FASTA scores: opt: 409, E(): 6.3e-23, (64.15% identity in 92 aa overlap); Rv3619c, Rv1198, Rv2346c, etc from Mycobacterium tuberculosis. Strictly identical to P96364|ES61_MYCTU|Rv3619c|MTCY15C10.33|MTCY07H7B.03|MT372 1 PUTATIVE ESAT-6 LIKE PROTEIN 1 (94 aa). BELONGS TO THE ESAT6 FAMILY.; ES6_1, Mtb9.9D" /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXI (ESAT-6 like protein 1)" /protein_id="NP_215553.1" /db_xref="GI:15608177" /db_xref="UniProtKB/Swiss-Prot:P96364" /db_xref="GeneID:888299" /translation="MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGG AGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" gene complement(1160855..1161151) /gene="esxJ" /locus_tag="Rv1038c" /db_xref="GeneID:888372" CDS complement(1160855..1161151) /gene="esxJ" /locus_tag="Rv1038c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1038c, (MT1067, MTCY10G2.11), len: 98 aa. esxJ, ESAT-6 like protein (see Gey Van Pittius et al., 2001), similar to Q49945|U1756C, Mycobacterium leprae (100 aa), FASTA scores: opt: 375, E(): 7.7e-21, (58.3% identity in 96 aa overlap). Member of M. tuberculosis hypothetical QILSS protein family with Rv1197, Rv1792, Rv2347c and Rv3620c. BELONGS TO THE ESAT6 FAMILY.; ES6_2, TB11.0, QILSS" /codon_start=1 /transl_table=11 /product="Esat-6 like protein esxJ (Esat-6 like protein 2)" /protein_id="NP_215554.1" /db_xref="GI:15608178" /db_xref="UniProtKB/Swiss-Prot:P96363" /db_xref="GeneID:888372" /translation="MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAG WSGMAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS" gene complement(1161297..1162472) /gene="PPE15" /locus_tag="Rv1039c" /db_xref="GeneID:888477" CDS complement(1161297..1162472) /gene="PPE15" /locus_tag="Rv1039c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1039c, (MTCY10G2.10), len: 391 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to Rv2768c|AL008967|MTV002_33 Mycobacterium tuberculosis H37Rv (394 aa), FASTA scores: opt: 1721, E(): 0, (70.4% identity in 398 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177778.1" /db_xref="GI:57116813" /db_xref="UniProtKB/TrEMBL:Q7D8Y7" /db_xref="GeneID:888477" /translation="MDFGALPPEINSARMYAGAGAGPMMAAGAAWNGLAAELGTTAAS YESVITRLTTESWMGPASMAMVAAAQPYLAWLTYTAEAAAHAGSQAMASAAAYEAAYA MTVPPEVVAANRALLAALVATNVLGINTPAIMATEALYAEMWAQDALAMYGYAAASGA AGMLQPLSPPSQTTNPGGLAAQSAAVGSAAATAAVNQVSVADLISSLPNAVSGLASPV TSVLDSTGLSGIIADIDALLATPFVANIINSAVNTAAWYVNAAIPTAIFLANALNSGA PVAIAEGAIEAAEGAASAAAAGLADSVTPAGLGASLGEATLVGRLSVPAAWSTAAPAT TAGATALEGSGWTVAAEEAGPVTGMMPGMASAAKGTGAYAGPRYGFKPTVMPKQVVV" gene complement(1162549..1163376) /gene="PE8" /locus_tag="Rv1040c" /db_xref="GeneID:888533" CDS complement(1162549..1163376) /gene="PE8" /locus_tag="Rv1040c" /function="UNKNOWN" /note="Rv1040c, (MTCY10G2.09), len: 275 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), most similar to AL008967|MTV002_34 Mycobacterium tuberculosis H37Rv (275 aa), FASTA scores: opt: 1111, E(): 0, (68.6% identity in 283 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177779.1" /db_xref="GI:57116814" /db_xref="UniProtKB/TrEMBL:Q7D8Y6" /db_xref="GeneID:888533" /translation="MSFLKTVPEELTAAAAQLGTIGAAMAAQNAAAAAPTTAIAPAAL DEVSALQAALFTAYGTFYQQVSAEAQAMHDMFVNTLGISAGTYGVTESLNSSAAASPL SGITGEASAIIQATTGLFPPELSGGIGNILNIGAGNWASATSTLIGLAGGGLLPAEEA AEAASALGGEAALGELGALGAAEAALGEAGIAAGLGSASAIGMLSVPPAWAGQATLVS TTSTLPGAGWTAAAPQAAAGTFIPGMPGVASAARNSAGFGAPRYGVKPIVMPKPATV" repeat_region complement(1164572..1165549) /note="IS-LIKE-1, len: 978 bp. Insertion sequence, ISLIKE, region identical to cosmid y348, blast score= 4902 (+1) 9377 10354 EM_NEW:MTAD20 Ad000020 Mycobacterium tuberculosis sequence from clone y348" /mobile_element="insertion sequence:IS-LIKE-1" gene complement(1164572..1165435) /locus_tag="Rv1041c" /db_xref="GeneID:888546" CDS complement(1164572..1165435) /locus_tag="Rv1041c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /experiment="experimental evidence, no additional details recorded" /note="Rv1041c, (MTCY10G2.08), len: 287 aa. Probable IS like-2 transposase, overlaps MTCY10G2.07. Similar to Q00430|X53945 insertion element IS869 hypothetical protein from Agrobacterium tumefaciens (186 aa), FASTA scores: opt: 173, E(): 0.00016, (40.9% identity in 176 aa overlap). Similar to Rv1150, C-terminal part of transposase of putative Mycobacterium tuberculosis IS like-1. MTCY10G2.07 and MTCY10G2.08 are frameshifted with respect to Mycobacterium tuberculosis Q50761 transposase, the 10G2 cosmid sequence appears to be correct." /codon_start=1 /transl_table=11 /product="IS like-2 transposase" /protein_id="NP_215557.1" /db_xref="GI:15608181" /db_xref="GOA:P96360" /db_xref="UniProtKB/TrEMBL:P96360" /db_xref="GeneID:888546" /translation="MRASPADGLAITGLSWKGSRGGSVREVRGGTCPLSSGRGKRCGS AITVGRWMVPATRCSPTLPRCSGWTLRWPRISRSCCRWIPRTCGHTSIRRAPARTRSP QGALSDYKKSADEPDDHAIGRSRGGLTTKIHALTDQREAPVRIRLTAGQAGDNPQLLP LLDDYRHASTEYALGSTDFRLLADKAYSHPSTRAALRSKKIKHTIPERQDQIDRRKAK GSAGGRPPAFDAALYGLRNTVERGFHRLKQWRGIATRYDKYALTYLGGVLLACAVIHA RVGTPKLGDTP" repeat_region 1164572..1164589 /note="18 bp inverted repeat at the left end of IS-LIKE element, CTAGGGCGTGTCTCCCAA" gene complement(1165092..1165499) /locus_tag="Rv1042c" /db_xref="GeneID:888607" CDS complement(1165092..1165499) /locus_tag="Rv1042c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv1042c, (MTCY10G2.07), len: 135 aa. Probable IS like-2 transposase, similar to Q50761 TRANSPOSASE from Mycobacterium tuberculosis (308 aa), FASTA scores: opt: 823, E(): 0, (99.1% identity in 117 aa overlap). Second copy is Rv1149." /codon_start=1 /transl_table=11 /product="IS like-2 transposase" /protein_id="NP_215558.1" /db_xref="GI:15608182" /db_xref="UniProtKB/TrEMBL:P96359" /db_xref="GeneID:888607" /translation="MTRVGVISDEFWAVVEPLMPSHEGKPGRRFSDHRLILEGIAWRF RTGSPWRDLPAEFGPWQTVWKRHHRWSLDGTCDEVFAHVAAVFGVDAEVAEDIEKLLS VDSTNVRAHQHSAGACSDTLATGGTVGLQEIRR" repeat_region complement(1165532..1165549) /note="18 bp inverted repeat at the right end of a IS-LIKE element, CTAGGGCGTGTCTCCCAA" gene complement(1165781..1166806) /locus_tag="Rv1043c" /db_xref="GeneID:888634" CDS complement(1165781..1166806) /locus_tag="Rv1043c" /function="UNKNOWN" /note="Rv1043c, (MTCY10G2.06), len: 341 aa. Conserved hypothetical protein similar to AL096872|SC5F7.08 PUTATIVE LIPOATE-PROTEIN LIGASE from Streptomyces coelicolor (362 aa), FASTA scores: opt: 206, E(): 1.4e-05, (30.3% identity in 201 aa overlap). Weak similarity to P39668|YYXA_BACSU HYPOTHETICAL PROTEASE from Bacillus subtitis (400 aa), FASTA scores: opt: 159, E(): 0.013, (27.1% identity in 210 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215559.1" /db_xref="GI:15608183" /db_xref="GOA:P96358" /db_xref="UniProtKB/TrEMBL:P96358" /db_xref="GeneID:888634" /translation="MCAHQFFGLVHNPVVAAAIGKPEPPPVDSDIGLPTTVPFEPWSV ADFSRYLSTLGLPAAGDAVTLHRILSSMERAGLLLPLGWDPRLPVMGQKYISQGAISK GQRGGNLWLSEVFGAELIIPSYNAVTVQLAGHDDAGNPVDSWGTGLVVDHNHVITNKH VVTGLAGTSAGLSVYPSSNHAEAELVNFSGTAHPHPTLDVAVIKFEMPEGKYIPRLGG MAFRDPDWADEVYVFGYPRVPMTAEMAITVQRGEVVNPAATTIPGRQKIFLYSAIARP GNSGGPIVAQDGRVIGLVVEDSAEAPSTGTGPNAAPFYRGIPSSEVIRALDELDFGGI VEMDTLP" gene 1167053..1167676 /locus_tag="Rv1044" /db_xref="GeneID:888712" CDS 1167053..1167676 /locus_tag="Rv1044" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1044, (MTCY10G2.05c), len: 207 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein MTCY06G11.02C|P96837 (289 aa), fasta scores: E(): 8.9e-06, (30.7% identity in 150 aa overlap). Some similarity to U36837|LLU36837_1 Lactococcus lactis plasmid pNP40 (287 aa), FASTA scores: opt: 147, E (): 0.0087, (29.7% identity in 91 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215560.1" /db_xref="GI:15608184" /db_xref="UniProtKB/TrEMBL:P96357" /db_xref="GeneID:888712" /translation="MCAKPYLIDTIAHMAIWDRLVEVAAEQHGYVTTRDARDIGVDPV QLRLLAGRGRLERVGRGVYRVPVLPRGEHDDLAAAVSWTLGRGVISHESALALHALAD VNPSRIHLTVPRNNHPRAAGGELYRVHRRDLQAAHVTSVDGIPVTTVARTIKDCVKTG TDPYQLRAAIERAEAEGTLRRGSAAELRAALDETTAGLRARPKRASA" gene 1167673..1168554 /locus_tag="Rv1045" /db_xref="GeneID:888783" CDS 1167673..1168554 /locus_tag="Rv1045" /function="UNKNOWN" /note="Rv1045, (MTCY10G2.04c), len: 293 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215561.1" /db_xref="GI:15608185" /db_xref="UniProtKB/TrEMBL:P96356" /db_xref="GeneID:888783" /translation="MTKPYSSPPTNLRSLRDRLTQVAERQGVVFGRLQRHVAMIVVAQ FAATLTDDTGAPLLLVKGGSSLELRRGIPDSRTSKDFDTVARRDIELIHEQLADAGET GWEGFTAIFTAPEEIDVPGMPVKPRRFTAKLSYRGRAFATVPIEVSSVEAGNADQFDT LTSDALGLVGVPAAVAVPCMTIPWQIAQKLHAVTAVLEEPKVNDRAHDLVDLQLLEGL LLDADLMPTRSACIAIFEARAQHPWPPRVATLPHWPLIYAGALEGLDHLELARTVDAA AQAVQRFVARIDRATKR" gene complement(1168704..1169228) /locus_tag="Rv1046c" /db_xref="GeneID:886071" CDS complement(1168704..1169228) /locus_tag="Rv1046c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1046c, (MTCY10G2.03), len: 174 aa. Hypothetical unknown protein. Start changed since first submission (-65 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215562.2" /db_xref="GI:57116815" /db_xref="UniProtKB/TrEMBL:O86321" /db_xref="GeneID:886071" /translation="MKVQARVGWNRRQLSAVGGRGQQLFANAPGHIPSTSHRRGTGDI NRKIDESLAGAARPQANANYGATSDPPLTHQPKPGSPTQVGPRSPSPPGLRGLVKQLP EVHQSSLHLDTVASLPSSRPSPHHTPLALRSRSGHFSPDEIRNRRSRKRSQSHMPPRT PPRGRCLRAPEALA" repeat_region 1169298..1170732 /note="IS1081-1, len: 1435 bp. Insertion sequence IS1081, almost identical to Mycobacterium bovis IS1081 (7157 (-1) 60 14 94 EM_BA:MBBIS1081 X84741 Mycobacterium bovis BCG IS1081 DNA. 4/96" /mobile_element="insertion sequence:IS1081-1" gene 1169423..1170670 /locus_tag="Rv1047" /db_xref="GeneID:886060" CDS 1169423..1170670 /locus_tag="Rv1047" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1081." /note="Rv1047, (MTCY10G2.02c), len: 415 aa. IS1081 transposase, most similar to TRA1_MYCBO|P35882 transposase for insertion sequence element (415 aa), FASTA scores: opt: 2675, E(): 0, (99.8% identity in 415 aa overlap). Contains PS01007 Transposases, Mutator family, signature" /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215563.1" /db_xref="GI:15608187" /db_xref="GOA:P96354" /db_xref="UniProtKB/TrEMBL:P96354" /db_xref="GeneID:886060" /translation="MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" misc_feature 1170119..1170193 /locus_tag="Rv1047" /note="PS01007 Transposases, Mutator family, signature" gene complement(1171038..1172153) /locus_tag="Rv1048c" /db_xref="GeneID:885295" CDS complement(1171038..1172153) /locus_tag="Rv1048c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1048c, (MTV017.01c-MTCY10G2.01), len: 371 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215564.1" /db_xref="GI:15608188" /db_xref="UniProtKB/TrEMBL:P96353" /db_xref="GeneID:885295" /translation="MQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSAL EGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAP TMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRA TLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLI VDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSA SLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQ NLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK" gene 1172386..1172832 /locus_tag="Rv1049" /db_xref="GeneID:886091" CDS 1172386..1172832 /locus_tag="Rv1049" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /experiment="experimental evidence, no additional details recorded" /note="Rv1049, (MTV017.02), len: 148 aa. Probable transcriptional repressor protein, similar to many e.g. P74870 NEGATIVE REGULATOR OF EMR LOCUS EMR from Salmonella typhimurium (149 aa), FASTA scores: opt: 146, E(): 0.0011, (31.6% identity in 95 aa overlap). TBparse score is 0.892. Contains probable helix-turn -helix motif at aa 58-79 (Score 1495, +4.28 SD)." /codon_start=1 /transl_table=11 /product="transcriptional repressor protein" /protein_id="NP_215565.1" /db_xref="GI:15608189" /db_xref="GOA:O53397" /db_xref="UniProtKB/TrEMBL:O53397" /db_xref="GeneID:886091" /translation="MGKGAAFDECACYTTRRAARQLGQAYDRALRPSGLTNTQFSTLA VISLSEGSAGIDLTMSELAARIGVERTTLTRNLEVMRRDGLVRVMAGADARCKRIELT AKGRAALQKAVPLWRGVQAEVTASVGDWPRVRRDIANLGQAAEACR" gene 1172881..1173786 /locus_tag="Rv1050" /db_xref="GeneID:887146" CDS 1172881..1173786 /locus_tag="Rv1050" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1050, (MTV017.03), len: 301 aa. Probable oxidoreductase (EC 1.-.-.-) similar to many e.g. Rv1543|MTCY48.22C|Q10783 PUTATIVE OXIDOREDUCTASE CY48.22C (341 aa), FASTA scores: opt: 462, E(): 3e-22, (33.6% identity in 265 aa overlap). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215566.1" /db_xref="GI:15608190" /db_xref="GOA:O53398" /db_xref="UniProtKB/TrEMBL:O53398" /db_xref="GeneID:887146" /translation="MARQRFRDQVVLITGASSGIGEATAKAFAREGAVVALAARREGA LRRVAREIEAAGGRAMVAPLDVSSSESVRAMVADVVGEFGRIDVVFNNAGVSLVGPVD AETFLDDTREMLEIDYLGTVRVVREVLPIMKQQRSGRIMNMSSVVGRKAFARFAGYSS AMHAIAGFSDALRQELRGSGIAVSVIHPALTQTPLLANVDPADMPPPFRSLTPIPVHW VAAAVLDGVARRRARVVVPFQPRLLMVGDAFSPRYGDRVVRLLESKIFGRLIGSYRGS VYRHQPTESAKAQAAQPERGYSSAR" gene complement(1173945..1174700) /locus_tag="Rv1051c" /db_xref="GeneID:887142" CDS complement(1173945..1174700) /locus_tag="Rv1051c" /function="UNKNOWN" /note="Rv1051c, (MTV017.04c), len: 251 aa. Conserved hypothetical protein, similar to LLU36837|U36837.1 protein encoded by Lactococcus lactis plasmid pNP40 (298 aa), FASTA scores: opt: 194, E(): 3.5e-06, (30.3% identity in 155 aa overlap). TBparse score is 0.912. Contains possible helix-turn-helix motif at aa 197-218 (Score 1097, +2.92 SD)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215567.1" /db_xref="GI:15608191" /db_xref="UniProtKB/TrEMBL:O53399" /db_xref="GeneID:887142" /translation="MRADVTAEHLTQVVRDIAVIDIDDGVAFNLDTSSVQEIRERADY PGLRVRVAMSVGPWQGIAAWDVSTGEPIAPWPTRVTIDRILGEPITLLGYAPETIIAE KGVTILERGITSTRWRDYVDIVQLDRRGIDDDELLRSARAVAQYRGATLEPVAPHLAG YGAVAQAKWATEHGRCQHCWRHWKPAHVGRRNMDLLDAKQVSEMIGVPVGTLRHWRHS DIGPASFTLGRRVVYRRDEVSRWISKRESATRR" gene 1175723..1176112 /locus_tag="Rv1052" /db_xref="GeneID:888441" CDS 1175723..1176112 /locus_tag="Rv1052" /function="UNKNOWN" /note="Rv1052, (MTV017.05), len: 129 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215568.1" /db_xref="GI:15608192" /db_xref="UniProtKB/TrEMBL:O53400" /db_xref="GeneID:888441" /translation="MDCCEERGVARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTA VQMPLTPGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTA VEGSLSLDELLSDHFVRDLHARMFGPV" gene complement(1176011..1176286) /locus_tag="Rv1053c" /db_xref="GeneID:887141" CDS complement(1176011..1176286) /locus_tag="Rv1053c" /function="UNKNOWN" /note="Rv1053c, (MTV017.06c), len: 91 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215569.1" /db_xref="GI:15608193" /db_xref="UniProtKB/TrEMBL:O53401" /db_xref="GeneID:887141" /translation="MDSHKVCMNNNTQLPTGPIIGVHPAVRDGVERVAYLDGDLLRCN TDVEFTSSPPPGPVLYRTKHTRVEIADEMVTEKLIKRQRAFNSRRHQ" gene 1176928..1177242 /locus_tag="Rv1054" /db_xref="GeneID:887139" CDS 1176928..1177242 /locus_tag="Rv1054" /function="USE FOR SEQUENCE INTEGRATION. INTEGRASE IS NECESSARY FOR INTEGRATION OF A PHAGE INTO THE HOST GENOME BY SITE-SPECIFIC RECOMBINATION. IN CONJUNCTION WITH EXCISIONASE, INTEGRASE IS ALSO NECESSARY FOR EXCISION OF THE PROPHAGE FROM THE HOST GENOME (BY SIMILARITY)." /note="Rv1054, (MTV017.07), len: 104 aa. Probable integrase (fragment), similar to Rv2309c|MTCY3G12_25|Z79702 hypothetical protein (shows similarity to integrases) from Mycobacterium tuberculosis (151 aa), FASTA scores: opt: 273, E(): 8.8e-13, (64.7% identity in 68 aa overlap); and to L39071|MSGINT_1 integrase from Mycobacterium paratuberculosis (191 aa), FASTA scores: opt: 105, E(): 0.9, (31.8% identity in 85 aaoverlap). This ORF continues in another frame as Rv1055|MTV017.08 but no error can be found to account for frameshift. Length extended since first submission (+36 aa)." /codon_start=1 /transl_table=11 /product="integrase" /protein_id="NP_215570.2" /db_xref="GI:57116816" /db_xref="UniProtKB/TrEMBL:O53402" /db_xref="GeneID:887139" /translation="MTGKGIVESTTKTKRDRHVPVPEPVWRRLHAELPTDPNALVFPG RKGGFLPLGEYRWAFDNAGDQVGIEGWYRTVWGTPRPRWRSAQALTSRSCNGSLDTQQ RR" gene 1177239..1177373 /locus_tag="Rv1055" /db_xref="GeneID:887138" CDS 1177239..1177373 /locus_tag="Rv1055" /function="USE FOR SEQUENCE INTEGRATION. INTEGRASE IS NECESSARY FOR INTEGRATION OF A PHAGE INTO THE HOST GENOME BY SITE-SPECIFIC RECOMBINATION. IN CONJUNCTION WITH EXCISIONASE, INTEGRASE IS ALSO NECESSARY FOR EXCISION OF THE PROPHAGE FROM THE HOST GENOME (BY SIMILARITY)." /note="Rv1055, (MTV017.08), len: 44 aa. Possible integrase (fragment); first 49 aa similar to Rv2309c|MTCY3G12_25|Z79702 hypothetical protein (shows similarity to integrases) from Mycobacterium tuberculosis (151 aa), FASTA scores: opt: 291, E(): 2.2e-16, (74.3% identity in 70 aa overlap); and to L39071|MSGINT_1 integrase from Mycobacterium paratuberculosis (191 aa), FASTA scores: opt: 146, E(): 8.3e-05, (52.1% identity in 48 aa overlap); and to many other integrases or transposases. Shortened since first submission (-34 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215571.2" /db_xref="GI:57116817" /db_xref="UniProtKB/TrEMBL:O53403" /db_xref="GeneID:887138" /translation="MTLDRHGHLLNDDLAVWPMRCAKSSRTLRYHCGMRRRNRVGLRA" gene 1177396..1177469 /locus_tag="Rvnt17" /note="tRNA-Leu(TAA)" /db_xref="GeneID:2700427" tRNA 1177396..1177469 /locus_tag="Rvnt17" /product="tRNA-Leu" /note="codon recognized: UUA" /anticodon=(pos:1177430..1177432,aa:Leu) /db_xref="GeneID:2700427" gene 1177628..1178392 /locus_tag="Rv1056" /db_xref="GeneID:887147" CDS 1177628..1178392 /locus_tag="Rv1056" /function="UNKNOWN" /note="Rv1056, (MTV017.09), len: 254 aa. Conserved hypothetical protein, some similarity in C-terminal region of Rv0140|MTCI5.14|Z92770 Mycobacterium tuberculosis (126 aa), FASTA scores: opt: 254, E(): 1.2e-10, (43.4% identity in 106 aa overlap); and to Rv1670. C-terminal region is similar to AL035569|SC8D9.02 hypothetical protein from Streptomyces coelicolor (113 aa), FASTA scores: opt: 282, E(): 4.5e-12, (48.0% identity in 100 aa overlap). TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215572.1" /db_xref="GI:15608196" /db_xref="UniProtKB/TrEMBL:O53404" /db_xref="GeneID:887147" /translation="MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVP YYPQYYIPLADVRMEFLRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPV AGTVRFNWDPLRWFEEDEPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLL FETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHY PLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS" repeat_region 1179345..1179395 /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 1179396..1180577 /locus_tag="Rv1057" /db_xref="GeneID:887135" CDS 1179396..1180577 /locus_tag="Rv1057" /function="UNKNOWN" /note="Rv1057, (MTV017.10), len: 393 aa. Conserved hypothetical protein, some similarity to X84710|MMSAG_1 surface antigen of Methanosarcina mazeii (491 aa), FASTA scores: opt: 363, E():6.2e-15, (31.3% identity in 294 aa overlap). TBparse score is 0.892." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215573.1" /db_xref="GI:15608197" /db_xref="UniProtKB/TrEMBL:O53405" /db_xref="GeneID:887135" /translation="MSVMNGREVARESRDAQVFEFGTAPGSAVVKIPVQGGPIGGIAI SRDGSLLVVTNNGTDTVSVVGTDTCRVTQTVTSVNEPFAIAMGNAEANRAYVSTVSSA YDAIAVIDVATNTVLGTHPLALSVSDLTLSPDDKYLYVSRNGTRGADVAVLDTTTGAL IDVVDVSQAPGTTTQCVRMSPDGSVLYVGANGPSGGLLVVITTRAQSDGGRIGSRSRS RQKSSKPRGNQAAAGLRVVATIDIGSSVRDVALSPDGAIAYVASCGSDFGAVVDVIDT RTHQITSSRAISEIGGLVTRVSVSGDADRAYLVSEDRVTVLCTRTHDVIGTIRTGQPS CVVESPDGKYLYIADYSGTITRTAVASTIVSGTEQLALQRRGSMQWFSPELQQYAPAL A" gene 1180684..1182315 /gene="fadD14" /locus_tag="Rv1058" /db_xref="GeneID:887133" CDS 1180684..1182315 /gene="fadD14" /locus_tag="Rv1058" /EC_number="2.3.1.86" /function="INVOLVED IN THE FATTY ACID BETA OXIDATION PATHWAY (DEGRADATION)." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid--CoA ligase" /protein_id="NP_215574.1" /db_xref="GI:15608198" /db_xref="GOA:O53406" /db_xref="UniProtKB/TrEMBL:O53406" /db_xref="GeneID:887133" /translation="MYGTMQDFPLTITAIMRHGCGVHGRRTVTTATGEGYRHSSYRDV GQRAGQLANALRRLGVTGDQRVATFMWNNTEHLVTYFAVPSMGAVLHTLNIRLFPEQI AYVTNEAEDRVILVDLSLARLLAPVLPKLDTVHTVIAVGEGDTTPLREAGKTVLRFAE LIDAESPDFGWPQIDENSAAAMCYTSGTTGNPKGVVYSHRSSFLHTMAACTTNGIGVG SSDKVLPIVPMFHANGWGLPYAALMAGADLVLPDRHLDARSLIHMVETLKPTLAGAVP TIWNDVMHYLEKDPDHDMSSLRLVACGGSAVPESLMRTFEDKHDVQIRQLWGMTETSP LATMAWPPPGTPDDQHWAFRITQGQPVCGVETRIVDDDGQVLPNDGNAVGEVEVRGPW IAGSYYGGRDESKFDSGWLRTGDVGRIDEQGFITLTDRAKDVIKSGGEWISSVELENC LIAHPDVLEAAVVGVPDERWQERPLAVVVVREGATVSAGDLRAFLADKVVRWWLPERW AFVDEIPRTSVGKYDKKAIRSRYAEGAYQITEVHT" misc_feature 1181227..1181262 /gene="fadD14" /locus_tag="Rv1058" /note="PS00455 Putative AMP-binding domain signature" gene 1182391..1183455 /locus_tag="Rv1059" /db_xref="GeneID:886015" CDS 1182391..1183455 /locus_tag="Rv1059" /function="UNKNOWN" /note="Rv1059, (MTV017.12), len: 354 aa. Conserved hypothetical protein, similar to Rv0926c|MTCY21C12.20c hypothetical protein from Mycobacterium tuberculosis (358 aa), FASTA scores: opt: 338, E(): 1.4e-14, (33.1% identity in 363 aa overlap). TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215575.1" /db_xref="GI:15608199" /db_xref="UniProtKB/TrEMBL:O53407" /db_xref="GeneID:886015" /translation="MTMSLRVIQWATGSVGVAAIKGVLQHPELELVGCWVHSAAKSGK DVGEIIGSPPLGVIATNSIDDVLALDADAVIYAPLLPSVDEVAALLRSGKNVVTPLGW FYPSEKEAAPLEVAAQAGNATLHGAGIGPGAVTELFPLLLSVMSTGVTFVRSEEFSDL RSYGAPDVLRYVMGFGGTPDSALTGPMQKILDGGFLQSVRLCVDRLGFAADPQIRTSQ EVAVATAPIDSPIGVIEPGQVAGRRFHWEALVEDTVVVQIAVNWLMGSENLDPPWSFG PAGERYEIEVRGSPDTCVTIKGWQPQTVAAGLKSNPGIVATAAHCVNAIPATCAAPAG IQSFFDLPLITGRAAPGLAR" gene 1183508..1183981 /locus_tag="Rv1060" /db_xref="GeneID:887130" CDS 1183508..1183981 /locus_tag="Rv1060" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1060, (MTV017.13), len: 157 aa. Hypothetical unknown protein. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215576.1" /db_xref="GI:15608200" /db_xref="UniProtKB/TrEMBL:O53408" /db_xref="GeneID:887130" /translation="MAKSVVVEQSRAIPVQSEDAFGGTLAAALPVICSHWYGLIPPIK EVRDQTGAWDSVGQARVITMVGGGRVREELTSVDPPRSFGYTLTDIKGPLAPLVALVE GKWSFAPADTGTTVTWQWTIHPRSALAAPVLPVFARMWRGYARGVLEKLSALLVG" gene 1184015..1184878 /locus_tag="Rv1061" /db_xref="GeneID:887136" CDS 1184015..1184878 /locus_tag="Rv1061" /function="UNKNOWN" /note="Rv1061, (MTV017.14), len: 287 aa. Conserved hypothetical protein, similar to hypothetical proteins from various bacteria e.g. D64002|SYCSLRD_75 Synechocystis sp. PCC6803 (304 aa),FASTA scores: opt: 245, E():1.2e-09, (27.1% identity in 258 aa overlap). TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215577.1" /db_xref="GI:15608201" /db_xref="GOA:O53409" /db_xref="UniProtKB/TrEMBL:O53409" /db_xref="GeneID:887136" /translation="MCRLFGLHSGTDAVTATFWLLNASDSLAEQSRRNPDGTGLGVFD EHHQPRLHKQPIAAWQDADFATEAHELTGTTFVAHVRYATTGSLDIRNTHPFLQDGRI FAHNGVVEGLDVLDERLREVGADDLVLGQTDSERVFALITASIRARDGNESAGLIDAL RWLAANVPIYAVNVLLSTATDVWALRYPESHELYILDRRGDGAPEFHLRSKRIRAHST HLRERSSVVFATEPMDDNPRWRLLDAGELVHVDAALRVNRSLVLPDPPRHPIRREDLS EPVLHAQHTSA" gene 1184883..1185740 /locus_tag="Rv1062" /db_xref="GeneID:887129" CDS 1184883..1185740 /locus_tag="Rv1062" /function="UNKNOWN" /note="Rv1062, (MTV017.15), len: 285 aa. Conserved hypothetical protein, some similarity to AL079356|SC6G9_10 hypothetical protein in Streptomyces coelicolor (289 aa), FASTA scores: opt: 556, E(): 1.2e-27, (39.0% identity in 287 aa overlap), and Z99111|BSUB0008_176 Bacillus subtilis (260aa), FASTA scores: opt: 163, E(): 0.0013, (27.4% identity in 179aa overlap). TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215578.1" /db_xref="GI:15608202" /db_xref="GOA:O53410" /db_xref="UniProtKB/TrEMBL:O53410" /db_xref="GeneID:887129" /translation="MTTRRALVLAGGGLAGIAWETGVLRGIADESPAAARLLLDSDVL VGTSAGATVAAQISSGCPLDTLYERQLAETSAEIDPGVDIDAITDLFLTAVTEPHIST RRRLQRIGAVALAVDTVPESVRRQVIAQRLPSHDWPDRVLRVTAIDIATGELVVFHRE SNVALVDAVAASCSVPGAWPPVTIAGRRYMDGGVASSVNLGVADDCDAAVVLVPAGAD APSPFGGGAAAEIAAATGMVFAVFADDDSLAAFGPNPLDPLCRVNSAMAGRQQGRREA QAVARLLGV" gene complement(1185741..1186823) /locus_tag="Rv1063c" /db_xref="GeneID:887128" CDS complement(1185741..1186823) /locus_tag="Rv1063c" /function="UNKNOWN" /note="Rv1063c, (MTV017.16c), len: 360 aa. Conserved hypothetical protein, similar to P37053|YCHK_ECOLI hypothetical protein from Escherichia coli (314 aa), FASTA scores: opt: 487, E(): 7.2e-23, (32.7% identity in 321 aa overlap). Also partially similar to Rv3239c|MTCY20B11.14c. TBparse score is 0.893. BELONGS TO THE UPF0028 (SWS) FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215579.1" /db_xref="GI:15608203" /db_xref="GOA:P67098" /db_xref="UniProtKB/Swiss-Prot:P67098" /db_xref="GeneID:887128" /translation="MPAPAALRVRGSSSPRVALALGSGGARGYAHIGVIQALRERGYD IVGIAGSSMGAVVGGVHAAGRLDEFAHWAKSLTQRTILRLLDPSISAAGILRAEKILD AVRDIVGPVAIEQLPIPYTAVATDLLAGKSVWFQRGPLDAAIRASIAIPGVIAPHEVD GRLLADGGILDPLPMAPIAGVNADLTIAVSLNGSEAGPARDAEPNVTAEWLNRMVRST SALFDVSAARSLLDRPTARAVLSRFGAAAAESDSWSQAPEIEQRPAGPPADREEAADT PGLPKMGSFEVMNRTIDIAQSALARHTLAGYPADLLIEVPRSTCRSLEFHRAVEVIAV GRALATQALEAFEIDDDESAAATIEG" gene complement(1186904..1187323) /gene="lpqV" /locus_tag="Rv1064c" /db_xref="GeneID:887126" CDS complement(1186904..1187323) /gene="lpqV" /locus_tag="Rv1064c" /function="UNKNOWN" /note="Rv1064c, (MTV017.17c), len: 139 aa. Possible lipoprotein LpqV. Has N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="lipoprotein LpqV" /protein_id="NP_215580.1" /db_xref="GI:15608204" /db_xref="GOA:P65310" /db_xref="UniProtKB/Swiss-Prot:P65310" /db_xref="GeneID:887126" /translation="MRPSRYAPLLCAMVLALAWLSAVAGCSRGGSSKAGRSSSVAGTL PAGVVGVSPAGVTTRVDAPAESTEEEYYQACHAARLWMDAQPGSGESLIEPYLAVVQA SPSGVAGSWHIRWAALTPARQAAVIVAARAAANAECG" misc_feature complement(1187246..1187278) /gene="lpqV" /locus_tag="Rv1064c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1187435..1188001 /locus_tag="Rv1065" /db_xref="GeneID:887125" CDS 1187435..1188001 /locus_tag="Rv1065" /function="UNKNOWN" /note="Rv1065, (MTV017.18), len: 188 aa. Conserved hypothetical protein, some similarity to AL0209|SC4H8_11 hypothetical protein from Streptomyces coelicolor (182 aa), FASTA scores: opt: 156, E(): 0.0011, (31.3% identity in 195 aa overlap). TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215581.1" /db_xref="GI:15608205" /db_xref="UniProtKB/TrEMBL:O53413" /db_xref="GeneID:887125" /translation="MVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHL LPDGGVPQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNEYR WDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTLSVHAYSPPLT AMSYYEITERNTLRRQRTELTDQPEGSG" gene 1187998..1188393 /locus_tag="Rv1066" /db_xref="GeneID:887127" CDS 1187998..1188393 /locus_tag="Rv1066" /function="UNKNOWN" /note="Rv1066, (MTV017.19), len: 131 aa. Conserved hypothetical protein, strong similarity to AL0209|SC4H8.10 hypothetical protein from Streptomyces coelicolor (132 aa), FASTA scores: opt: 429, E(): 5.2e-23, (57.1% identity in 119 aa overlap). TBparse score is 0.859." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215582.1" /db_xref="GI:15608206" /db_xref="UniProtKB/TrEMBL:O53414" /db_xref="GeneID:887127" /translation="MSRIDRVLEAARRRYRRLAADQVPEAARRGAVLVDIRPQAQRAR EGEVPGALVIERNVLEWRCDPTSDARLPQAVDDDVEWVILCSEGYTSSLAAASLLDLG LHRATDVVGGYRALAAGGVLAELGGAVGG" gene complement(1188421..1190424) /gene="PE_PGRS19" /locus_tag="Rv1067c" /db_xref="GeneID:887122" CDS complement(1188421..1190424) /gene="PE_PGRS19" /locus_tag="Rv1067c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1067c, (MTV017.20c), len: 667 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002). Similar to Rv3388|MTV004.46 M. tuberculosis (731 aa), FASTA scores: opt: 2227, E(): 0, (55.6% identity in 710 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1, probably fortuitous. TBparse score is 0.837." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177780.1" /db_xref="GI:57116818" /db_xref="UniProtKB/TrEMBL:Q79FT3" /db_xref="GeneID:887122" /translation="MSFVLVSPSQLMAAAADVAGIGSAISAANAAALAPTSVLAAAGA DEVSAAVAALFSAHAGQYQQLGARAALFHEQFVQALTGAASAYASAEATNVEQQVLGL INAPTQALLGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFSQTTAGLTGGTGGSAGL IGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPAGAIGAPGVAGGAGGAGGT AGLFGNGGVGGVGGDGGQGGNGAGAGASGTKGGDAGAGGAGGAGGWIHGHGGAGGDGG AGGAGGQASPGAPGPPSQPGGAGGAGGAGGRGGDGGSAGWLSGNGGDAGNGGGGGTAG GAGNGGQFGGDGGTGGTGGTAGAGGNGGRGAVLFGHGGNAGHGGAGGNGAAAGAGGEH VVATAGKGGTGGVGGDGGGGGAGGGGGLLYGNGGAGGAGNSGGDGGTGLNAALGGNGG GGGVGGNAGAGGTGGSAGWLSGNGGAGGSGGSAGAGGAGGKGGDTPNGLAINPGIGGN GGDTGNAGNGGNGGSAARLFGGGGAGGAGGTGSTAGSGGSGGTNPPTGLQAAGGNGGS GHAGGHGGNGGGAGLLGGGGTGGNGGGGGQGGLGAAAGGVDGNGGNGGNGGKGGDAQL VGDGGNGGNGGKGGAGLIAGLDGAGGAGGTRGLIFGNAGTPGQ" misc_feature complement(1189042..1189116) /gene="PE_PGRS19" /locus_tag="Rv1067c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene complement(1190757..1192148) /gene="PE_PGRS20" /locus_tag="Rv1068c" /db_xref="GeneID:887123" CDS complement(1190757..1192148) /gene="PE_PGRS20" /locus_tag="Rv1068c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1068c, (MTV017.21c), len: 463 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002). Similar to AL021897|MTV017_19 Mycobacterium tuberculosis H37Rv (667 aa), FASTA scores: opt: 1875, E(): 0, (55.0% identity in 667 aa overlap). TBparse score is 0.849." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177781.1" /db_xref="GI:57116819" /db_xref="UniProtKB/Swiss-Prot:O53416" /db_xref="GeneID:887123" /translation="MSYMIAVPDMLSSAAGDLASIGSSINASTRAAAAATTRLLPAAA DEVSAHIAALFSGHGEGYQAIARQMAAFHDQFTLALTSSAGAYASAEATNVEQQVLGL INAPTQALLGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFSQTTAGLTGGTGGSAGL IGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPAGAIGAPGVAGGAGGAGGT AGLFGNGGAGGAGGAGGAGGRGGDGGSAGWLSGNGGDAGTGGGGGNAGNGGNGGSAGW LSGNGGTGGGGGTAGAGGQGGNGNSGIDPGNGGQGADTGNAGNGGHGGSAAKLFGDGG AGGAGGMGSTGGTGGGGGFGGGTGGNGGNGHAGGAGGSGGTAGLLGSGGSGGTGGDGG NGGLGAGSGAKGNGGNGGDGGKGGDAQLIGNGGNGGNGGKGGTGLMPGINGTGGAGGS RGQISGNPGTPGQ" gene complement(1192510..1194273) /locus_tag="Rv1069c" /db_xref="GeneID:887120" CDS complement(1192510..1194273) /locus_tag="Rv1069c" /function="UNKNOWN" /note="Rv1069c, (MTV017.22c), len: 587 aa. Conserved hypothetical protein, hydrophobic regions in N-terminal domain. Similar in part to O07136|B1306.04C B1306.04c protein from Mycobacterium leprae (89 aa), FASTA scores: opt: 229, E(): 1.3e-07, (54.2% identity in 72 aa overlap). TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215585.1" /db_xref="GI:15608209" /db_xref="UniProtKB/TrEMBL:O53417" /db_xref="GeneID:887120" /translation="MTEPAAATTTNASDEPATGAEQAVDTAATPQTPEPQPIRSTWWI RHYTFTGTAMGLVFVWFSMTPSLLPRGPLFQGLVSGICGAFGYGLGVFAVWLVRYMRS HNSSPPPPRWAWPPLIAVGAVGMVGMAVQFHVWQDDVRDLMGVEHLRWYDYPLAAALS LVVLFTLVEIGQFIRWLFRFLVGQVDRIAPFRVSAAIVVVLLVVLTITLLNGVVLKFA MNSMNSTFAAVNNEMNPDSAPPKTPLRSGGPGSLVSWESLGHQGRIFVHSGPTIADLT AFNGTPAVEPIRTYAGLNSADGIMATAELAARELARTGGLRRAVVAVATSTGTGWINE AEASALEYMYNGDTAIVSMQYSFLPSWLSFLVDKENARHAGEALFEAVDKLIRQLPES QRPKLVVFGESLGSFGGEAPFMNLNNILARTDGALFSGPTFNNTVWNSLTANRDAGSP QWLPIYDDGRNVRFVARARDLQRPDAPWGRPRVVYLQHASDPIAWWTPRLLFREPDWL REQRGYDVLPQTRWIPVVTFVQVSADMAVATHVPDGHGHRYVATVADGWAAVLSPPGW TQQKTERLQPLLHANAKPFGS" gene complement(1194270..1195043) /gene="echA8" /locus_tag="Rv1070c" /db_xref="GeneID:887117" CDS complement(1194270..1195043) /gene="echA8" /locus_tag="Rv1070c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS (BY SIMILARITY) [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215586.1" /db_xref="GI:15608210" /db_xref="GOA:P64016" /db_xref="UniProtKB/Swiss-Prot:P64016" /db_xref="GeneID:887117" /translation="MTYETILVERDQRVGIITLNRPQALNALNSQVMNEVTSAATELD DDPDIGAIIITGSAKAFAAGADIKEMADLTFADAFTADFFATWGKLAAVRTPTIAAVA GYALGGGCELAMMCDVLIAADTAKFGQPEIKLGVLPGMGGSQRLTRAIGKAKAMDLIL TGRTMDAAEAERSGLVSRVVPADDLLTEARATATTISQMSASAARMAKEAVNRAFESS LSEGLLYERRLFHSAFATEDQSEGMAAFIEKRAPQFTHR" misc_feature complement(1194690..1194752) /gene="echA8" /locus_tag="Rv1070c" /note="PS00166 Enoyl-CoA hydratase/isomerase signature." gene complement(1195055..1196092) /gene="echA9" /locus_tag="Rv1071c" /db_xref="GeneID:887116" CDS complement(1195055..1196092) /gene="echA9" /locus_tag="Rv1071c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS (BY SIMILARITY) [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="catalyzes the formation of 3-hydroxy-2-methylpropanoate from 3-hydroxy-2-methylpropanoyl-CoA" /codon_start=1 /transl_table=11 /product="3-hydroxyisobutyryl-CoA hydrolase" /protein_id="NP_215587.1" /db_xref="GI:15608211" /db_xref="GOA:O53419" /db_xref="UniProtKB/TrEMBL:O53419" /db_xref="GeneID:887116" /translation="MTGESHEVLTNVEGGVGFVTLNRPKAINSLNQTMVDLLATVLMS WEHEDAVHAVVLSGAGERGLCAGGDVVAVYHSARKDGVEARRFWRHEYLLNALIGRFA KPYVALMDGIVMGGGVGVSAHANTRVVTDTSKVAMPEVGIGFIPDVGGVYLLSRAPGA LGLHAALTGAPFSGADAIALGFADHFVPHGDLDAFTQKIVTGGVESALAAHAVEPPPS TLAAQRDWIDECYAGDSVADIVAALRKQGGEPAVNASDLIASRSPIALSVTLQAVRRA AKLDTLEDVLIQDYRVSSASLRSHDLVEGIRAQLIDKDRNPNWSPATLDAITAADIEA YFEPVDDDLSF" gene 1196279..1197115 /locus_tag="Rv1072" /db_xref="GeneID:887114" CDS 1196279..1197115 /locus_tag="Rv1072" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1072, (MTV017.25), len: 278 aa. Probable conserved transmembrane protein, equivalent to O07139|B1306.07|Y13803 Protein B1306.07 from Mycobacterium leprae (220 aa), FASTA scores: opt:1032, E(): 0, (75.0% identity in 220 aa overlap); and at the C-terminal end to Q50056|U1740D Mycobacterium leprae (96 aa), FASTA scores: opt: 381, E(): 1.2e-18, (71.6% identity in 81 aa overlap). Similar to Q54192|M80628|STMBLDA_1 TRANSFER RNA-LEU (BLDA) GENE AND ORF from Streptomyces griseus (293 aa), FASTA scores: opt:558, E(): 4.7e-30, (41.5% identity in 299 aa overlap). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_215588.1" /db_xref="GI:15608212" /db_xref="UniProtKB/TrEMBL:O53420" /db_xref="GeneID:887114" /translation="MRETSNPVFRSLPKQRGGYAQFGTGTAQQGFPADPYLAPYREAK ATRPLTIDDVVTKTGLTLAMLAGTAVVSYFLVASNVALAMPLTLVGALGGLALVLVAT FGRKQDNPAIVLSYAALEGLFLGAISFVLANFTVASANAGVLIGEAILGTMGVFFGML VVYKTGAIRVTPKFTRMVVAALFGVLVLMLGNLVLAMFNVGGGEGLGLRSPGPLGIIF SLVCIGIAAFSFLIDFDAADQMIRAGAPEKAAWGVALGLTVTLVWLYIEILRLLSYLQ NE" gene 1197231..1198082 /locus_tag="Rv1073" /db_xref="GeneID:887132" CDS 1197231..1198082 /locus_tag="Rv1073" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1073, (MTV017.26), len: 283 aa. Conserved hypothetical protein, similar to several hypothetical mycobacterial proteins e.g. Rv1482c|Z79701|MTCY277.03 Mycobacterium tuberculosis (339 aa), FASTA scores: opt: 810, E(): 0, (47.4% identity in 272 aa overlap); Rv3555c|Z92774|MTCY6G11_2 Mycobacterium tuberculosis (289 aa), FASTA scores: opt: 704, E(): 0, (44.4% identity in 259 aa overlap); and Rv3517, etc., and GIR10|AF002133_10 Mycobacterium avium strain GIR10 (346 aa), FASTA scores: opt: 802, E(): 0, (48.1% identity in 270 aa overlap). TBparse score is 0.942." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215589.1" /db_xref="GI:15608213" /db_xref="UniProtKB/TrEMBL:O53421" /db_xref="GeneID:887132" /translation="MGAQPFIGSEALAAGLISWHELGKYYTAIMPNVYLDKRLKPSLR QRVIAAWLWSGRKGVIAGASASALHGAKWVDDHALVELIWRNARAPNGVRTKDELLLD GEVQRLCGLTVTTVERTAFDLGRRPPLGQAITRLDALANATDFKINDVRELARKHPHT RGLRQLDKALDLVDPGAQSPKETWLRLLLINAGFPRPSTQIPLLGVYGHPKYFLDMGW EDIMLAVEYDGEQHRLSRDQFVKDVERLEYIRRAGWTHIRVLADHKGPDVVRRVRQAW DTLTSRR" gene complement(1198156..1199373) /gene="fadA3" /locus_tag="Rv1074c" /db_xref="GeneID:887113" CDS complement(1198156..1199373) /gene="fadA3" /locus_tag="Rv1074c" /EC_number="2.3.1.9" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN LIPID DEGRADATION (BETA OXYDATION)." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_215590.1" /db_xref="GI:15608214" /db_xref="UniProtKB/TrEMBL:O53422" /db_xref="GeneID:887113" /translation="MPEAVIVSTARSPIGRAMKGSLVGMRPDDLAVQMVRAALDKVPA LNPHQIDDLMMGCGLPGGESGFNIARVVAVALGYDFLPGTTVNRYCSSSLQTTRMAFH AIKAGEGDAFISAGVETVSRFAKGNSDSWPDTKNPLFDGAQERSAAAAAGADEWHDPR TDQKLPDIYIAMGQTAENVAIMTGISREEQDRWGVRSQNRAEEAIKNGFFEREITPVT LPDGTTVSTDDGPRPGTTYEKVSELKPAFRPNGTVTAGNACPLNDGAAAVVITSDTKA KELGLTPLARIVSTGVSGLSPEIMGLGPIEASKKALERAGMAITDIDLVEINEAFAVQ VLGSARELGIDEDKLNISGGAIALGHPFGMTGARITTTLLNNLQTYDKTFGLETMCVG GGQGMAMVIERLA" misc_feature complement(1198276..1198326) /gene="fadA3" /locus_tag="Rv1074c" /note="PS00737 Thiolases signature 2." misc_feature complement(1199254..1199316) /gene="fadA3" /locus_tag="Rv1074c" /note="PS00445 FGGY family of carbohydrate kinases signature 2" gene complement(1199426..1200370) /locus_tag="Rv1075c" /db_xref="GeneID:887110" CDS complement(1199426..1200370) /locus_tag="Rv1075c" /function="UNKNOWN" /note="Rv1075c, (MTV017.28c), len: 314 aa. Possibly exported protein, as it contains a N-terminal signal sequence, hydrophobic domain from aa 7-25. Similar to U15183|MLU15183_2 Mycobacterium leprae cosmid B1740 (106 aa), FASTA scores: opt: 207, E(): 1.6e-06, (42.6% identity in 101 aa overlap). Also weak similarity to many glyceraldehyde-3-phosphate dehydrogenases e.g. Q41595|G3PC_TAXBA Taxus baccata (340 aa), FASTA scores: opt: 147, E(): 0.027, (27.5% identity in 189 aa overlap). TBparse score is 0.916." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215591.1" /db_xref="GI:15608215" /db_xref="GOA:O53423" /db_xref="UniProtKB/TrEMBL:O53423" /db_xref="GeneID:887110" /translation="MPRRSTIALATAGALASTGTAYLGARNLLVGQATHARTVIPKSF DAPPRADGVYTRGGGPVQRWRREVPFDVHLMIFGDSTATGYGCASAEEVPGVLIARGL AEQTGKRIRLSTKAIVGATSKGVCGQVDAMFVVGPPPDAAVIMIGANDITALNGIGPS AQRLADCVRRLRTRGAVVVVGTCPDLGVITAIPQPLRALAHTRGVRLARAQTAAVKAA GGVPVPLGHLLAPKFRAMPELMFSADRYHPSAPAYALAADLLFLALRDALTEKLDIPI HETPSRPGTATLEPGHTRHSMMSRLRRPRPARAVPTGG" gene 1200767..1201660 /gene="lipU" /locus_tag="Rv1076" /db_xref="GeneID:887112" CDS 1200767..1201660 /gene="lipU" /locus_tag="Rv1076" /EC_number="3.1.-.-" /function="HYDROLYSES LIPIDS" /note="Rv1076, (MTV017.29), len: 297 aa. Possible lipU, lipase (EC 3.1.-.-), very similar to several Mycobacterium tuberculosis proteins e.g. Z95390|Rv3487c|MTCY13E12.41c (277 aa), FASTA scores: opt: 1225, E(): 0, (76.0% identity in 246 aa overlap); Rv1426c, etc. Also similar to esterases and lipases of around 300 aa e.g. Q44087 ESTERASE PRECURSOR from Acinetobacter lwoffii esterase (303), FASTA scores: opt: 427, E(): 1.9e-21, (32.5% identity in 280 aa overlap). Equivalent to AL035159|MLCB1450 _7 Mycobacterium leprae (335 aa), FASTA scores: opt: 1588, E(): 0, (79.7% identity in 296 aa overlap). TBparse score is 0.935." /codon_start=1 /transl_table=11 /product="lipase LipU" /protein_id="NP_215592.1" /db_xref="GI:15608216" /db_xref="GOA:O53424" /db_xref="UniProtKB/TrEMBL:O53424" /db_xref="GeneID:887112" /translation="MAVRPVLAVGSYLPHAPWPWGVIDQAARVLLPASTTVRAAVSLP NASAQLVRASGVLPADGTRRAVLYLHGGAFLTCGANSHGRLVELLSKFADSPVLVVDY RLIPKHSIGMALDDCHDGYRWLRLLGYEPEQIVLAGDSAGGYLALALAQRLQEVGEEP AALVAISPLLQLAKEHKQAHPNIKTDAMFPARAFDALDALVASAAARNQVDGEPEELY EPLEHITPGLPRTLIHVSGSEVLLHDAQLAAAKLAAAGVPAEVRVWPGQVHDFQVAAS MLPEAIRSLRQIGEYIREATG" gene 1201717..1203111 /gene="cbs" /locus_tag="Rv1077" /db_xref="GeneID:887108" CDS 1201717..1203111 /gene="cbs" /locus_tag="Rv1077" /EC_number="4.2.1.22" /function="THOUGHT TO BE INVOLVED IN HOMOCYSTEINE TRANSULFURATION [CATALYTIC ACTIVITY: L-serine + L-homocysteine = cystathionine + H2O]" /experiment="experimental evidence, no additional details recorded" /note="Rv1077, (MTV017.30), len: 464 aa. Probable cbs (previously cysM2), cystathionine beta-synthase (EC 4.2.1.22), similar throughout its length to many eukaryotic cystathionine beta-synthases e.g. P32232|CBS_RAT CYSTATHIONINE BETA-SYNTHASE (560 aa), FASTA scores: opt: 951, E(): 0, (40.2% identity in 450 aa overlap); also similar in N-terminal domain (aa 1 - 330) to Rv2334|MTCY98.03 CysK Mycobacterium tuberculosis (310 aa), FASTA scores: opt: 855, E(): 0, (46.8% identity in 314 overlap); and other cysteine synthase proteins e.g. Rv1336, Rv0848, etc. Contains PS00217 Sugar transport proteins signature 2 probably spurious. TBparse score is 0.891. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY.; cysM2" /codon_start=1 /transl_table=11 /product="cystathionine beta-synthase CBS (Serine sulfhydrase) (Beta-thionase) (hemoprotein H-450)" /protein_id="YP_177782.1" /db_xref="GI:57116820" /db_xref="GOA:Q7D8W0" /db_xref="UniProtKB/TrEMBL:Q7D8W0" /db_xref="GeneID:887108" /translation="MRIAQHISELIGGTPLVRLNSVVPDGAGTVAAKVEYLNPGGSSK DRIAVKMIEAAEASGQLKPGGTIVEPTSGNTGVGLALVAQRRGYKCVFVCPDKVSEDK RNVLIAYGAEVVVCPTAVPPHDPASYYSVSDRLVRDIDGAWKPDQYANPEGPASHYVT TGPEIWADTEGKVTHFVAGIGTGGTITGAGRYLKEVSGGRVRIVGADPEGSVYSGGAG RPYLVEGVGEDFWPAAYDPSVPDEIIAVSDSDSFDMTRRLAREEAMLVGGSCGMAVVA ALKVAEEAGPDALIVVLLPDGGRGYMSKIFNDAWMSSYGFLRSRLDGSTEQSTVGDVL RRKSGALPALVHTHPSETVRDAIGILREYGVSQMPVVGAEPPVMAGEVAGSVSERELL SAVFEGRAKLADAVSAHMSPPLRMIGAGELVSAAGKALRDWDALMVVEEGKPVGVITR YDLLGFLSEGAGRR" misc_feature 1202245..1202322 /gene="cbs" /locus_tag="Rv1077" /note="PS00217 Sugar transport proteins signature 2" gene 1203313..1204035 /gene="pra" /locus_tag="Rv1078" /db_xref="GeneID:887111" CDS 1203313..1204035 /gene="pra" /locus_tag="Rv1078" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1078, (MTV017.31), len: 240 aa. Probable pra, Proline-rich antigen homolog, equivalent to X65546|MLPRAG_1 proline rich antigen from Mycobacterium leprae (249 aa), FASTA scores: opt: 1162, E(): 3.3e-30, (64.8% identity in 253 aa overlap). Has potential hydrophobic domains. TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="proline-rich antigen" /protein_id="NP_215594.1" /db_xref="GI:15608218" /db_xref="UniProtKB/Swiss-Prot:O53426" /db_xref="GeneID:887111" /translation="MTEQPPPGGSYPPPPPPPGPSGGHEPPPAAPPGGSGYAPPPPPS SGSGYPPPPPPPGGGAYPPPPPSAGGYAPPPPGPAIRTMPTESYTPWITRVLAAFIDW APYVVLVGIGWVIMLVTQTSSCVTSISEYDVGQFCVSQPSMIGQLVQWLLSVGGLAYL VWNYGYRQGTIGSSIGKSVLKFKVVSETTGQPIGFGMSVVRQLAHFIDAIICFVGFLF PLWDAKRQTLADKIMTTVCVPI" gene 1204067..1205233 /gene="metB" /locus_tag="Rv1079" /db_xref="GeneID:887103" CDS 1204067..1205233 /gene="metB" /locus_tag="Rv1079" /EC_number="2.5.1.48" /function="INVOLVED IN METHIONINE BIOSYNTHESIS: CONVERTS O-SUCCINYL-L-HOMOSERINE TO CYSTATHIONINE [CATALYTIC ACTIVITY : O-SUCCINYL-L-HOMOSERINE + L-CYSTEINE = CYSTATHIONINE + SUCCINATE (CAN ALSO USE HYDROGEN SULFIDE AND METHANETHIOL AS SUBSTRATES)]." /note="catalyzes the formation of cystathionine from L-cysteine and O-succinyl-L-homoserine" /codon_start=1 /transl_table=11 /product="cystathionine gamma-synthase" /protein_id="NP_215595.1" /db_xref="GI:15608219" /db_xref="GOA:P66875" /db_xref="UniProtKB/Swiss-Prot:P66875" /db_xref="GeneID:887103" /translation="MSEDRTGHQGISGPATRAIHAGYRPDPATGAVNVPIYASSTFAQ DGVGGLRGGFEYARTGNPTRAALEASLAAVEEGAFARAFSSGMAATDCALRAMLRPGD HVVIPDDAYGGTFRLIDKVFTRWDVQYTPVRLADLDAVGAAITPRTRLIWVETPTNPL LSIADITAIAELGTDRSAKVLVDNTFASPALQQPLRLGADVVLHSTTKYIGGHSDVVG GALVTNDEELDEEFAFLQNGAGAVPGPFDAYLTMRGLKTLVLRMQRHSENACAVAEFL ADHPSVSSVLYPGLPSHPGHEIAARQMRGFGGMVSVRMRAGRRAAQDLCAKTRVFILA ESLGGVESLIEHPSAMTHASTAGSQLEVPDDLVRLSVGIEDIADLLGDLEQALG" misc_feature 1204664..1204708 /gene="metB" /locus_tag="Rv1079" /note="PS00868 Cys/Met metabolism enzymes pyridoxal-phosphate attachment site" gene complement(1205304..1205798) /gene="greA" /locus_tag="Rv1080c" /db_xref="GeneID:887115" CDS complement(1205304..1205798) /gene="greA" /locus_tag="Rv1080c" /function="NECESSARY FOR EFFICIENT RNA POLYMERASE TRANSCRIPTION ELONGATION PAST TEMPLATE-ENCODED ARRESTING SITES. THE ARRESTING SITES IN DNA HAVE THE PROPERTY OF TRAPPING A CERTAIN FRACTION OF ELONGATING RNA POLYMERASES THAT PASS THROUGH, RESULTING IN LOCKED TERNARY COMPLEXES. CLEAVAGE OF THE NASCENT TRANCRIPT BY CLEAVAGE FACTORS SUCH AS GREA OR GREB ALLOWS THE RESUMPTION OF ELONGATION FROM THE NEW 3'TERMINUS. GREA RELEASES SEQUENCES OF 2 TO 3 NUCLEOTIDES" /experiment="experimental evidence, no additional details recorded" /note="necessary for efficient RNA polymerase transcription elongation past template-encoded arresting sites; arresting sites in DNA have the property of trapping a certain fraction of elongating RNA polymerases that pass through, resulting in locked ternary complexes. Cleavage of the nascent transcript by cleavage factors such as GreA or GreB allows the resumption of elongation from the new 3'terminus" /codon_start=1 /transl_table=11 /product="transcription elongation factor GreA" /protein_id="NP_215596.1" /db_xref="GI:15608220" /db_xref="GOA:P64279" /db_xref="UniProtKB/Swiss-Prot:P64279" /db_xref="GeneID:887115" /translation="MTDTQVTWLTQESHDRLKAELDQLIANRPVIAAEINDRREEGDL RENGGYHAAREEQGQQEARIRQLQDLLSNAKVGEAPKQSGVALPGSVVKVYYNGDKSD SETFLIATRQEGVSDGKLEVYSPNSPLGGALIDAKVGETRSYTVPNGSTVSVTLVSAE PYHS" misc_feature complement(1205379..1205429) /gene="greA" /locus_tag="Rv1080c" /note="PS00830 Prokaryotic transcription elongation factors signature 2" misc_feature complement(1205649..1205771) /gene="greA" /locus_tag="Rv1080c" /note="PS00829 Prokaryotic transcription elongation factors signature 1" gene complement(1205984..1206418) /locus_tag="Rv1081c" /db_xref="GeneID:887102" CDS complement(1205984..1206418) /locus_tag="Rv1081c" /function="UNKNOWN" /note="Rv1081c, (MTV017.34c), len: 144 aa. Probable conserved membrane protein, with hydrophobic stretch from aa 26 - 48, highly similar to NP_302548.1|NC_002677 conserved membrane protein from Mycobacterium leprae. TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215597.1" /db_xref="GI:15608221" /db_xref="UniProtKB/TrEMBL:O53429" /db_xref="GeneID:887102" /translation="MTHTPIPRPDARYGRPRLSRRARRRVAIALGVLVAAAGIVIAVI GYQRISTSAVTGSLVGYRLVDDETASVTISVTRSDPSRPVACIVRVRATNGSETGRRE LLVPPSEATTVQVTTTVKSSQPPVMADVYGCGTEVPSYLRLP" gene 1206520..1207386 /gene="mca" /locus_tag="Rv1082" /db_xref="GeneID:887101" CDS 1206520..1207386 /gene="mca" /locus_tag="Rv1082" /function="Mycothiol-dependent detoxification enzyme, involved in mycothiol biosynthesis." /experiment="experimental evidence, no additional details recorded" /note="Rv1082, (MTV017.35), len: 288 aa. mca, mycothiol conjugate amidase (see citation below), equivalent to NP_302547.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (290 aa), FASTA scores: opt: 1737, E(): 0, (86.4% identity in 287 aa overlap); and similar to Q54358|X79146 lmbE protein from Streptomyces lincolnensis (270 aa). Also similar to Rv1170|MTV005.06|MSHB GlcNAc-Ins deacetylase from Mycobacterium tuberculosis (303 aa), FASTA scores: opt: 411, E(): 9.4e-20, (35.8% identity in 299 aa overlap). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="Mycothiol conjugate amidase Mca (Mycothiol S-conjugate amidase)" /protein_id="NP_215598.1" /db_xref="GI:15608222" /db_xref="UniProtKB/TrEMBL:O53430" /db_xref="GeneID:887101" /translation="MSELRLMAVHAHPDDESSKGAATLARYADEGHRVLVVTLTGGER GEILNPAMDLPDVHGRIAEIRRDEMTKAAEILGVEHTWLGFVDSGLPKGDLPPPLPDD CFARVPLEVSTEALVRVVREFRPHVMTTYDENGGYPHPDHIRCHQVSVAAYEAAGDFC RFPDAGEPWTVSKLYYVHGFLRERMQMLQDEFARHGQRGPFEQWLAYWDPDHDFLTSR VTTRVECSKYFSQRDDALRAHATQIDPNAEFFAAPLAWQERLWPTEEFELARSRIPAR PPETELFAGIEP" gene 1207383..1207649 /locus_tag="Rv1083" /db_xref="GeneID:887100" CDS 1207383..1207649 /locus_tag="Rv1083" /function="UNKNOWN" /note="Rv1083, (MTV017.36), len: 88 aa. Conserved hypothetical protein, similar to U15183|MLU15183_9 hypothetical protein from Mycobacterium leprae (167 aa), FASTA scores: opt: 332, E(): 1.2e-13, (58.4% identity in 101 aa overlap). Hydrophobic domain aa 25-43. TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215599.1" /db_xref="GI:15608223" /db_xref="UniProtKB/TrEMBL:O53431" /db_xref="GeneID:887100" /translation="MNQILLSVIAEGGPGNTGPDFGKASPVGLLVIVLLVIATLFLVR SMNQQLKKVPKSFDRDHPELDQAADEGTDRDGPARPPGPPHESG" gene 1207636..1209657 /locus_tag="Rv1084" /db_xref="GeneID:887098" CDS 1207636..1209657 /locus_tag="Rv1084" /function="UNKNOWN" /note="Rv1084, (MTV017.37), len: 673 aa. Conserved hypothetical protein, similar to P37512|YYAL_BACSU hypothetical protein from Bacillus subtilis (689 aa), FASTA scores: opt: 1063, E() : 0, (36.5% identity in 696 aa overlap); AE0009|AE000983_10 Archaeoglobus fulgidus section 1 (642 aa), FASTA scores: opt: 1018, E(): 0, (37.2% identity in 600 aa overlap). Also similar to AE001938|AE001938_9 Deinococcus radiodurans (690 aa), FASTA scores: opt: 1097, E(): 0, (41.6% identity in 694 aa overlap). TBparse score is 0.872." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215600.1" /db_xref="GI:15608224" /db_xref="UniProtKB/TrEMBL:O53432" /db_xref="GeneID:887098" /translation="MSPANPSGTNTLALATSPYLRQHADNPVHWQQWTPQALAEAAAR AVPILLSVGYAACHWCHVMAHESFDDDEVAAAMNAGFVCIKVDREERPDIDAVYMNAT VALTGQGGWPMTCFLTPNGRPFFCGTYYPKAAFLQLLSAISETWRERRAEVEQASDHI AAELRSMASGLPGGGPEVAPELCDDAVAGVLREQDTAHGGFGGAPKFPPSALLEALMR HYERTRSPAALEAVARTGNAMARGGIYDQLGGGFARYSVDGAWVVPHFEKMLYDNALL LRAYAHWARRTGDPLARRVAAQTARFLLDELGSKAPADMFTSSLDADADGREGSTYVW TPVQLTEVLGGDDGRWAAEVFGVTEAGTFEHGTSVLQLPADPDDAARLDRVRAALLVA RLARAQPARDDKVVTSWNGLAITALAEASVALDDPALAHAARRCATRLLDLHVVDGRL RRASLGGVVGDSAAILEDHAMLATGLLALYQLTSEGAWLTAATGLLDTAVAHFGDPQR PGRWFDTADDAERLMLRPSDPLDGATPSGASSIAEALLTAGHVVDGARAERYWQLAAD TLRAHAVLLARAPRSAGHWLAVAEAVVRGPLQIAVACDLPRSSLLADARRLAPGGAIV VGGAAGSSALLVGRDRVAGADAAYVCRGRVCDLPVTSAAELATALGVPG" gene complement(1209756..1210484) /locus_tag="Rv1085c" /db_xref="GeneID:887107" CDS complement(1209756..1210484) /locus_tag="Rv1085c" /function="NOT KNOWN, BUT SUPPOSED INVOLVED IN VIRULENCE" /note="Rv1085c, (MTV017.38c), len: 242 aa. Possible hemolysin-like protein, integral membrane protein, similar to many hemolysins, and hypothetical proteins e.g. U28375|ECU28375_49 Hypothetical protein from Escherichia coli (219 aa), FASTA scores: opt: 308, E(): 7.5e-15, (30.6% identity in 180 aa overlap); AE0011|HIAE001124_2 Hypothetical protein from Borrelia burgdorferi (233 aa), FASTA scores: opt: 305, E(): 1.3e-14, (25.6% identity in 203 aa overlap). Also weakly similar to HLY3_BACCE|P54176 haemolysin from Bacillus cereus (219 aa), FASTA scores: opt: 247, E(): 8.7e-12, (27.5% identity in 171 aa overlap). Also similar to AE002027|AE002027_8 probable hemolysin from Deinococcus radiodurans (219 aa), FASTA scores: opt: 354, E(): 1.8e-16, (31.1% identity in 219 aa overlap). TBparse score is 0.884." /codon_start=1 /transl_table=11 /product="hemolysin-like protein" /protein_id="NP_215601.1" /db_xref="GI:15608225" /db_xref="GOA:P67157" /db_xref="UniProtKB/Swiss-Prot:P67157" /db_xref="GeneID:887107" /translation="MSGQADTATTAEARTPAHAAHHLVEGVARVLTKPRFRGWIHVYS AGTAVLAGASLVAVSWAVGSAKAGLTTLAYTAATITMFTVSATYHRVNWKSATARNWM KRADHSMIFVFIAGSYTPFALLALPAHDGRVVLSIVWGGAIAGILLKMCWPAAPRSVG VPLYLLLGWVAVWYTATILHNAGVTALVLLFVGGALYSIGGILYAVRWPDPWPTTFGY HEFFHACTAVAAICHYIAMWFVVF" gene 1210595..1211383 /locus_tag="Rv1086" /db_xref="GeneID:887097" CDS 1210595..1211383 /locus_tag="Rv1086" /EC_number="2.5.1.10" /function="CATALYZES THE FIRST COMMITTED STEP IN THE SYNTHESIS OF DECAPRENYL DIPHOSPHATE, A MOLECULE WHICH HAS A CENTRAL ROLE IN THE BIOSYNTHESIS OF MOST FEATURES OF THE MYCOBACTERIAL CELL WALL. ADDS ONE ISOPRENE UNIT TO OMEGA,E-GERANYL DIPHOSPHATE. THE PRODUCT, OMEGA,E, Z-FARNESYL DIPHOSPHATE, IS THE PUTATIVE SUBSTRATE OF Rv2361c PRODUCT [CATALYTIC ACTIVITY: Geranyl diphosphate + isopentenyl diphosphate = diphosphate + trans,trans-farnesyl diphosphate]." /experiment="experimental evidence, no additional details recorded" /note="Rv1086, (MTV017.39), len: 262 aa. Short (C15) chain Z-isoprenyl diphosphate synthase (EC 2.5.1.10) (see citations below), equivalent to NP_302598.1|NC_002677 possible undecaprenyl pyrophosphate synthetase from Mycobacterium leprae (262 aa), similar to many hypothetical proteins and several potential members of the upp synthase family e.g. NP_296167.1|NC_001263 undecaprenyl diphosphate synthase from Deinococcus radiodurans (339 aa); P20182|YT14_STRFR Hypothetical protein from Streptomyces fradiae (259 aa), FASTA scores: opt: 840, E(): 0, (51.0% identity in 259 aa overlap); and P38118|YARF_CORGL Hypothetical protein from Corynebacterium glutamicicum (234 aa), FASTA scores: opt: 729, E(): 0, (56.0% identity in 209 aa overlap); etc. Also similar to Rv2361c|MTCY27.19 (296 aa) (35.6% identity in 233 aa overlap). Contains PS01066 Uncharacterized protein family UPF0015 signature. SEEMS TO BELONG TO THE UPP SYNTHETASE FAMILY." /codon_start=1 /transl_table=11 /product="short (C15) chain Z-isoprenyl diphosphate synthase (Z-FPP synthase) (Z-farnesyl diphosphate synthase) (Z-FPP synthetase) (Z-farnesyl diphosphate synthetase) (geranyltranstransferase) (farnesyl pyrophosphate synthetase)" /protein_id="NP_215602.1" /db_xref="GI:15608226" /db_xref="GOA:O53434" /db_xref="UniProtKB/Swiss-Prot:O53434" /db_xref="GeneID:887097" /translation="MEIIPPRLKEPLYRLYELRLRQGLAASKSDLPRHIAVLCDGNRR WARSAGYDDVSYGYRMGAAKIAEMLRWCHEAGIELATVYLLSTENLQRDPDELAALIE IITDVVEEICAPANHWSVRTVGDLGLIGEEPARRLRGAVESTPEVASFHVNVAVGYGG RREIVDAVRALLSKELANGATAEELVDAVTVEGISENLYTSGQPDPDLVIRTSGEQRL SGFLLWQSAYSEMWFTEAHWPAFRHVDFLRALRDYSARHRSYGR" misc_feature 1211213..1211269 /locus_tag="Rv1086" /note="PS01066 Uncharacterized protein family UPF0015 signature" gene 1211560..1213863 /gene="PE_PGRS21" /locus_tag="Rv1087" /db_xref="GeneID:887094" CDS 1211560..1213863 /gene="PE_PGRS21" /locus_tag="Rv1087" /function="UNKNOWN" /note="Rv1087, (MTV017.40), len: 767 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). Similar to Rv1090|AL021897|MTV017_43 Mycobacterium tuberculosis H37Rv (853 aa), FASTA scores: opt: 2819, E(): 0, (59.8% identity in 860 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1 near C -terminus. TBparse score is 0.859." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177783.1" /db_xref="GI:57116821" /db_xref="UniProtKB/TrEMBL:Q79FT0" /db_xref="GeneID:887094" /translation="MSFVVVAPEVLAAAASDLAGIGSTLAQANAAALAPTTAVLAAGA DEVSAAIASLFGAHGQAYQAVSAQMSAFHAQFMQALTGAGGAYAAAEAVNVSAAQSVE QDLLAAINARFERIFGRPLIGDGANGGPGQDGGPGGLLYGNGGNGGTSTTVGMAGGNG GAAGLIGNGGFGGGGGPGAAGGNGGAGGWLFGNGGAGGAGGLGVAPGVPGGAGGAGGA GGVGGPAGLWGHGGAGGAGGAGVAGAGGFEGTIGAGGAGGVGGAGGVGGAGGAGGWLY GDAGAGGDGGVGGAGGTGGLGNRGGAGGAGGAGGVGGAGGAAGLWGGGGAGGVGGTGG GAGLGAQSVTFSSSLSGLSGGDGGAGGAGGAGGAGGTGGWLYGGGGAAGSGGDGGTGG QGGAGGAGVFSLFGSGGGPGGNGGVGGVGGVGGAGGRAGLFGVGGLGGAGGDAGDSGE GGFGGPGLAGGLFGNPGNGGVGGIGGDAAAGGAGGAGGNGGAGGNGGWLFGNGGAGGS GGDGGAAGRGGAGNLGSAGGINAPAGNPGSGSVGIGGAGGAGGTAGLFGDGGAGGAGG AGAAGGFGGISAATPSAGSEGAMGGAGGVGGNARLLGTGGAGGVGGGGGAGGDGGRGG VATPGGQGGDAGDGGAGGAGGNGGGASGAGGWLLGTGGAGGAGGNGGNGGKAGFSPGP TNFGLNGAGGGGGVGGNGATGPWLFGDGGPTPGSTGAGAAGGHGGDAQLIGNGGHGGA GGTGVPNGSGGAGGLSGLLFGEPGANG" misc_feature 1213639..1213713 /gene="PE_PGRS21" /locus_tag="Rv1087" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene 1214040..1214360 /locus_tag="Rv1087A" /db_xref="GeneID:3205067" CDS 1214040..1214360 /locus_tag="Rv1087A" /function="UNKNOWN" /note="Rv1087A, 106 aa (fragment). Conserved hypothetical protein, highly similar to C-terminus of near ORF O53434|YA86_MYCTU|Rv1086|MT1118|MTV017.39 SHORT (C15) CHAIN Z-ISOPRENYL DIPHOSPHATE SYNTHASE from Mycobacterium tuberculosis (262 aa), FASTA scores: opt: 200, E(): 1.1e-06, (57.9% identity in 76 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177637.1" /db_xref="GI:57116822" /db_xref="GOA:Q8VK75" /db_xref="UniProtKB/TrEMBL:Q8VK75" /db_xref="GeneID:3205067" /translation="MPCVGYGDRREFVDAVAVEAICENLNTSGQPDPDLVIRTSGEQR LSGHRGPTGGVSRRRLLRALRDYSTPHASIPYVPPPYRSDGIHASRLAVESVFDALAG RVEL" gene 1214513..1214947 /gene="PE9" /locus_tag="Rv1088" /db_xref="GeneID:887096" CDS 1214513..1214947 /gene="PE9" /locus_tag="Rv1088" /function="UNKNOWN" /note="Rv1088, (MTV017.41), len: 144 aa. Member of Mycobacterium tuberculosis PE family (see citation below), similar to many others e.g. Z96071|MTCI418B_6 Mycobacterium tuberculosis cosmid (487 aa), FASTA scores: opt: 318, E(): 7.3e-14, (60.9% identity in 87 aa overlap) - except it appears to be frameshifted around codon 84. No error to account for frameshift could be found. TBparse score is 0.943" /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177784.1" /db_xref="GI:57116823" /db_xref="UniProtKB/TrEMBL:Q79FS8" /db_xref="GeneID:887096" /translation="MSYMIATPAALTAAATDIDGIGSAVSVANAAAVAATTGVLAAGG DEVLAAIARLFNANAEEYHALSAQVAAFQTLFVRTLTGGCGVFRRRRGRQCVTAAEHR AAGAGRRQRRRRSGDGQWRLRQQRHFGCGGQPEFRQHSEHRR" gene 1214769..1215131 /gene="PE10" /locus_tag="Rv1089" /db_xref="GeneID:887090" CDS <1214769..1215131 /gene="PE10" /locus_tag="Rv1089" /function="UNKNOWN" /note="Rv1089, (MTV017.42), len: 120 aa. Member of the Mycobacterium tuberculosis PE family of glycine-rich proteins (see citation below). Partial ORF that appears to be frameshifted continuation of Rv1088|MTV017.41. Sequence has been checked and appears correct. Similar to Z95555|MTCY06F7_4 Mycobacterium tuberculosis cosmid (401 aa), FASTA scores: opt:126, E(): 2, (29.6% identity in 125 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177785.1" /db_xref="GI:57116824" /db_xref="UniProtKB/TrEMBL:Q79FS7" /db_xref="GeneID:887090" /translation="SFAGAEAANASQLQSIARQVRGAVNAVAGQVTGNGGSGNSGTSA AAANPNSDNTASIADRGTSAIMTTASATASSTGVDGGIAATYAVASQWDGGYVANYTI TQFGRDFDDRLAVAIHFA" gene 1215517..1215621 /gene="celA2a" /locus_tag="Rv1089A" /db_xref="GeneID:3205066" CDS 1215517..1215621 /gene="celA2a" /locus_tag="Rv1089A" /EC_number="3.2.1.4" /function="THE BIOLOGICAL CONVERSION OF CELLULOSE TO GLUCOSE GENERALLY REQUIRES THREE TYPES OF HYDROLYTIC ENZYMES: (1) ENDOGLUCANASES WHICH CUT INTERNAL BETA-1,4-GLUCOSIDIC BONDS; (2) EXOCELLOBIOHYDROLASES THAT CUT THE DISSACCHARIDE CELLOBIOSE FROM THE NONREDUCING END OF THE CELLULOSE POLYMER CHAIN; (3) BETA-1,4-GLUCOSIDASES WHICH HYDROLYZE THE CELLOBIOSE AND OTHER SHORT CELLO-OLIGOSACCHARIDES TO GLUCOSE [CATALYTIC ACTIVITY:Endohydrolysis of 1,4-beta-D-glucosidic linkages in cellulose]." /note="Rv1089A, len: 34 aa. Probable celA2a, first part of cellulase (endoglucanase) (EC 3.2.1.4), similar to N-terminus of others." /codon_start=1 /transl_table=11 /product="endo-1,4-beta-glucanase" /protein_id="YP_177638.1" /db_xref="GI:57116825" /db_xref="GOA:Q79FS6" /db_xref="UniProtKB/TrEMBL:Q79FS6" /db_xref="GeneID:3205066" /translation="MNGAAPTNGAPLSYPSICEGVHWGHLVGGHQPAY" gene 1215599..1216054 /gene="celA2b" /locus_tag="Rv1090" /db_xref="GeneID:885637" CDS 1215599..1216054 /gene="celA2b" /locus_tag="Rv1090" /EC_number="3.2.1.4" /function="THE BIOLOGICAL CONVERSION OF CELLULOSE TO GLUCOSE GENERALLY REQUIRES THREE TYPES OF HYDROLYTIC ENZYMES: (1) ENDOGLUCANASES WHICH CUT INTERNAL BETA-1,4-GLUCOSIDIC BONDS; (2) EXOCELLOBIOHYDROLASES THAT CUT THE DISSACCHARIDE CELLOBIOSE FROM THE NONREDUCING END OF THE CELLULOSE POLYMER CHAIN; (3) BETA-1,4-GLUCOSIDASES WHICH HYDROLYZE THE CELLOBIOSE AND OTHER SHORT CELLO-OLIGOSACCHARIDES TO GLUCOSE [CATALYTIC ACTIVITY:Endohydrolysis of 1,4-beta-D-glucosidic linkages in cellulose]." /note="Rv1090, (MTV017.43), len: 151 aa. Probable celA2b, second part of cellulase (endoglucanase) (EC 3.2.1.4), similar to C-terminus of others e.g. O08468 cellulase CEL2 from Streptomyces halstedi (377 aa), FASTA scores: opt: 554, E(): 1.2e-30, (52.0% identity in 152 aa overlap); etc. TBparse score is 0.876. Gene appears to have been inactivated by frameshift mutations but no errors could be found that would account for this." /codon_start=1 /transl_table=11 /product="endo-1,4-beta-glucanase" /protein_id="NP_215606.1" /db_xref="GI:15608230" /db_xref="GOA:O53438" /db_xref="UniProtKB/TrEMBL:O53438" /db_xref="GeneID:885637" /translation="MGTNLPTEVGQILSAPTSIDYNYPTTGVWDASYDICLDSTPKTT GVNQQEIMIWFNHQGSIQPVGSPVGNTTIEGKNFVVWDGSNGMNNAMAYVATEPIEVW SFDVMSFVDHTATMEPITDSWYLTSIRAGLEPWSDGVGLGVDSFSAKVN" gene 1216469..1219030 /gene="PE_PGRS22" /locus_tag="Rv1091" /db_xref="GeneID:885258" CDS 1216469..1219030 /gene="PE_PGRS22" /locus_tag="Rv1091" /function="UNKNOWN" /note="Rv1091, (MTV017.44), len: 853 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). Similar to Rv1087|AL021897|MTV017_39 Mycobacterium tuberculosis H37Rv (767 aa), FASTA scores: opt: 2819, E(): 0, (60.0% identity in 860 aa overlap). TBparse score is 0.859." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177786.1" /db_xref="GI:57116826" /db_xref="UniProtKB/TrEMBL:Q79FS5" /db_xref="GeneID:885258" /translation="MSFVIAAPEALVAVASDLAGIGSALAEANAAALAPTTALLAAGA DEVSAAIAALFGAHGQAYQTVSAQASAFHAQFVQALTGGGGAYAAAEAANVSAAQSTD QRLLDLINGPTQALLGRPLIGDGANGGPGQDGGPGGLLYGNGGNGGTSTTAGVAGGNG GAAGLIGNGGAGGGGGAGAAGGNGGAGGWLYGNGGAGGAGGTSVIPGVAGGNGGAGGS AGLWGTGGAGGDGGNGRSGPVNVAGSAGGNGGAGGAAGLFGDAGAGGNGGKGGAGGAA FSINFTAGDGGAGGAGGSGGHALLWGAGGAGGNGGSGGTGGAGGSTAGAGGNGGAGGG GGTGGLLFGNGGAGGHGAAAGNGLAAGNGVSSSGGGGAGGTGGAGGDGGAGGAGGNAR LWGVGGAGGAGGDGGAGGAGGKGGSGLSGNANGGAGGDSGRGGTGGAGGEGGAAGLLV GTGGHGGDGGAGGAAVKGGDGGAAAGTGIAGAGGRGGAGGSGGSGGDGGGGAAGPAGW LFGDGGAGGNGGAAAAGGAGGQAGGGGGNGGNGGNGGNGGNGGNGATGGWLYGNGGAG GQGATAGAGGAGANGVSSTNGGGTGGNGGIGGTGGSGGAGGNAGLLGVGGAGGHGASG GAGDRGGAGGTGFISSDGGAGGDGGDGGNGGAGGTGGLLFGAGGNGGPGGSGGAADIG GNGGAGNGGGTDGNGGNGGSGGGAGSGGDGGGAGGNGAWLFGNGGAGGGGGKGGNGAG GGLGGGSFGLPGLNGSGGDGGDGGNGAPGGVLYGNGGAGGQGSSGGIGGPGATGGAGG KGGDGGDAQLIGDGGNGGNGGAGGTGGTPGPGGPGGSGGLGGLLFGQTGTAGVSP" gene complement(1219248..1220186) /gene="coaA" /locus_tag="Rv1092c" /db_xref="GeneID:885120" CDS complement(1219248..1220186) /gene="coaA" /locus_tag="Rv1092c" /EC_number="2.7.1.33" /function="Coenzyme A (CoA) biosynthesis [CATALYTIC ACTIVITY : ATP + PANTOTHENATE = ADP + D-4'- PHOSPHOPANTOTHENATE.]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of (R)-4'-phosphopantothenate in coenzyme A biosynthesis" /codon_start=1 /transl_table=11 /product="pantothenate kinase" /protein_id="NP_215608.1" /db_xref="GI:15608232" /db_xref="GOA:P63810" /db_xref="UniProtKB/Swiss-Prot:P63810" /db_xref="GeneID:885120" /translation="MSRLSEPSPYVEFDRRQWRALRMSTPLALTEEELVGLRGLGEQI DLLEVEEVYLPLARLIHLQVAARQRLFAATAEFLGEPQQNPDRPVPFIIGVAGSVAVG KSTTARVLQALLARWDHHPRVDLVTTDGFLYPNAELQRRNLMHRKGFPESYNRRALMR FVTSVKSGSDYACAPVYSHLHYDIIPGAEQVVRHPDILILEGLNVLQTGPTLMVSDLF DFSLYVDARIEDIEQWYVSRFLAMRTTAFADPESHFHHYAAFSDSQAVVAAREIWRTI NRPNLVENILPTRPRATLVLRKDADHSINRLRLRKL" misc_feature complement(1219875..1219898) /gene="coaA" /locus_tag="Rv1092c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1220574..1221854 /gene="glyA" /locus_tag="Rv1093" /db_xref="GeneID:885338" CDS 1220574..1221854 /gene="glyA" /locus_tag="Rv1093" /EC_number="2.1.2.1" /function="INTERCONVERSION OF SERINE AND GLYCINE. KEY ENZYME IN THE BIOSYNTHESIS OF PURINES, LIPIDS, HORMONES AND OTHER COMPONENTS [CATALYTIC ACTIVITY: 5,10-METHYLENETETRAHYDROFOLATE + GLYCINE + H(2)O = TETRAHYDROFOLATE + L-SERINE.]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the reaction of glycine with 5,10-methylenetetrahydrofolate to form L-serine and tetrahydrofolate" /codon_start=1 /transl_table=11 /product="serine hydroxymethyltransferase" /protein_id="YP_177787.1" /db_xref="GI:57116827" /db_xref="GOA:O53441" /db_xref="UniProtKB/Swiss-Prot:O53441" /db_xref="GeneID:885338" /translation="MSAPLAEVDPDIAELLAKELGRQRDTLEMIASENFVPRAVLQAQ GSVLTNKYAEGLPGRRYYGGCEHVDVVENLARDRAKALFGAEFANVQPHSGAQANAAV LHALMSPGERLLGLDLANGGHLTHGMRLNFSGKLYENGFYGVDPATHLIDMDAVRATA LEFRPKVIIAGWSAYPRVLDFAAFRSIADEVGAKLLVDMAHFAGLVAAGLHPSPVPHA DVVSTTVHKTLGGGRSGLIVGKQQYAKAINSAVFPGQQGGPLMHVIAGKAVALKIAAT PEFADRQRRTLSGARIIADRLMAPDVAKAGVSVVSGGTDVHLVLVDLRDSPLDGQAAE DLLHEVGITVNRNAVPNDPRPPMVTSGLRIGTPALATRGFGDTEFTEVADIIATALAT GSSVDVSALKDRATRLARAFPLYDGLEEWSLVGR" gene 1221959..1222786 /gene="desA2" /locus_tag="Rv1094" /db_xref="GeneID:885339" CDS 1221959..1222786 /gene="desA2" /locus_tag="Rv1094" /EC_number="1.14.19.2" /function="THOUGHT TO CATALYZE THE PRINCIPAL CONVERSION OF SATURATED FATTY ACIDS TO UNSATURATED FATTY ACIDS. THOUGHT TO CONVERT STEAROYL-ACP TO OLEOYL-ACP BY INTRODUCTION OF A CIS DOUBLE BOND BETWEEN CARBONS DELTA-9 AND DELTA-10 OF THE ACYL CHAIN [CATALYTIC ACTIVITY: Stearoyl-[acyl-carrier protein] + AH2 + O2 = oleoyl-[acyl-carrier protein] + A + 2 H2O]." /experiment="experimental evidence, no additional details recorded" /note="Rv1094, (MTV017.47), len: 275 aa. Possible desA2, acyl-[acyl-carrier protein] desaturase (stearoyl-ACP desaturase) (EC 1.14.99.6), equivalent to AL049491|MLCB1222_15 from Mycobacterium leprae (275 aa), FASTA score: (78.1% identity in 274 aa overlap). Also weakly similar to plant stearoyl-acyl carrier protein desaturases, and very similar to U49839|MTV043.16C|Rv0824c enzyme desA1 from Mycobacterium tuberculosis (338 aa), FASTA scores: opt: 525, E(): 8.5e-30, (32.2% identity in 270 aa overlap); and to U15182|MLU15182_32 acyl-carrier protein desaturase precursor from Mycobacterium leprae (338 aa), FASTA scores: opt: 506, E(): 1.9e-28, (34.1% identity in 261 aa overlap). TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="acyl-[acyl-carrier protein] desaturase" /protein_id="NP_215610.1" /db_xref="GI:15608234" /db_xref="GOA:O53442" /db_xref="UniProtKB/TrEMBL:O53442" /db_xref="GeneID:885339" /translation="MAQKPVADALTLELEPVVEANMTRHLDTEDIWFAHDYVPFDQGE NFAFLGGRDWDPSQSTLPRTITDACEILLILKDNLAGHHRELVEHFILEDWWGRWLGR WTAEEHLHAIALREYLVVTREVDPVANEDVRVQHVMKGYRAEKYTQVETLVYMAFYER CGAVFCRNLAAQIEEPILAGLIDRIARDEVRHEEFFANLVTHCLDYTRDETIAAIAAR AADLDVLGADIEAYRDKLQNVADAGIFGKPQLRQLISDRITAWGLAGEPSLKQFVTG" gene 1222997..1224298 /gene="phoH2" /locus_tag="Rv1095" /db_xref="GeneID:885592" CDS 1222997..1224298 /gene="phoH2" /locus_tag="Rv1095" /function="FUNCTION NOT REALLY KNOWN." /experiment="experimental evidence, no additional details recorded" /note="Rv1095, (MTV017.48), len: 433 aa. Probable phoH2, phoH-like protein (phosphate starvation-induced protein), probably ATP-binding protein. Equivalent to AL049491 MLCB1222_14 Mycobacterium leprae (433 aa) (92.8% identity in 432 aa overlap). Similar to many proteins described as PhoH-like e.g. Z97025|BSZ97025_12 Bacillus subtilis (442 aa), FASTA scores: opt: 605, E(): 0, (40.1% identity in 444 aa overlap); or Mycobacterium tuberculosis Rv2368c|O05830|PHOL_MYCTU Mycobacterium tuberculosis (352 aa), FASTA scores: opt: 390, E(): 4e-19, (31.5% identity in 241 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.885. BELONGS TO THE PHOH FAMILY." /codon_start=1 /transl_table=11 /product="PhoH-like protein PhoH2 (phosphate starvation-inducible protein PsiH)" /protein_id="NP_215611.1" /db_xref="GI:15608235" /db_xref="GOA:O53443" /db_xref="UniProtKB/TrEMBL:O53443" /db_xref="GeneID:885592" /translation="MTDTRTYVLDTSVLLSDPWACSRFAEHDVVVPLVVISELEAKRH HHELGWFARQALRLFDDLRLEHGRLDQPIPVGTQGGTLHVELNHTDPAVLPAGFRTDS NDSRILSCAANLAAEGKRVTLVSKDIPLRVKAAAVGLAADEYHAQDVVVSGWSGMHEL ETASADIDALFADGEIDLVEARDLPCHTGIRLLGGGSHALGRVNAHKRVQLVRGDREA FGLRGRSAEQRVALDLLLDESVGIVSLGGKAGTGKSALALCAGLEAVLERRTHRKVVV FRPLYAVGGQELGYLPGSESEKMGPWAQAVFDTLEGLASPAVLEEVLSRGMLEVLPLT HIRGRSLHDSFVIVDEAQSLERNVLLTVLSRLGTGSRVVLTHDIAQRDNLRVGRHDGV AAVIEKLKGHPLFAHITLLRSERSPIAALVTEMLEEITGPR" misc_feature 1223735..1223758 /gene="phoH2" /locus_tag="Rv1095" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1224385..1225260 /locus_tag="Rv1096" /db_xref="GeneID:885949" CDS 1224385..1225260 /locus_tag="Rv1096" /EC_number="3.-.-.-" /function="PROBABLY INVOLVED IN CARBOHYDRATE DEGRADATION. May Hydrolyse the glycosidic bond between two or more carbohydrates or between a carbohydrate and a non-carbohydrate moiety." /note="Rv1096, (MTV017.49), len: 291 aa. Possible glycosyl hydrolase (EC 3.-.-.-), possibly deacetylase or esterase. Equivalent to AL049491|MLCB1222_13 Mycobacterium leprae (291 aa) (81.3% identity in 289 aa overlap). Similar at the C-terminus to enzymes involved in carbohydrate degradation including Z99110|BSUB0007_92 endo-1,4-beta-xylanase homolog yjeA from Bacillus subtilis (467 aa), FASTA scores: opt: 418, E(): 2.6e-17, (38.6% identity in 184 aa overlap); M64552|STMXLNB_2 acetyl-xylan esterase from Streptomyces lividans (335 aa), FASTA scores: opt: 371, E(): 1.1e-14, (31.6% identity in 237 aa overlap); NP_345933.1|NC_003028 peptidoglycan N-acetylglucosamine deacetylase A from Streptococcus pneumoniae (463 aa); etc. Has possible N-terminal signal sequence with TMhelix at aa 13-31. TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="glycosyl hydrolase" /protein_id="NP_215612.1" /db_xref="GI:15608236" /db_xref="GOA:O53444" /db_xref="UniProtKB/TrEMBL:O53444" /db_xref="GeneID:885949" /translation="MPKRPDNQTWRYWRTVTGVVVAGAVLVVGGLSGRVTRAENLSCS VIKCVALTFDDGPGPYTDRLLHILTDNDAKATFFLIGNKVAANPAGARRIADAGMEIG SHTWEHPNMTTIPPEDIPGQFSRANDVIAAATGRTPTLYRPAGGLSNDAVRQAAAKVG QAEILWDVIPFDWINDSNTAATRHMLMTQIKPGSVVLFHDTYSSTVDVVYQFIPVLKA NGYRLVTVSELLGPRAPGSSYGSRENGPPVNELRDIPASEIPPLPNTSSPKPMPNFPI TDIAGQNSGGPNNGA" gene complement(1225263..1226144) /locus_tag="Rv1097c" /db_xref="GeneID:885962" CDS complement(1225263..1226144) /locus_tag="Rv1097c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1097c, (MTV017.50c), len: 293 aa. Probable membrane Gly-, Pro-rich protein, similar to Mycobacterium tuberculosis Rv2507|MTCY07A7. 13|Z95556 (273 aa), FASTA scores: opt: 219, E(): 0.023, (30.5% identity in 266 aa overlap); and Rv2507. Contains potential membrane spanning region (aa 68-92). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215613.1" /db_xref="GI:15608237" /db_xref="UniProtKB/TrEMBL:O53445" /db_xref="GeneID:885962" /translation="MTVPPAGPYGNYPYGPNTYGQDPYWGGQPQGGSYPPAYPPQQYP PGWPAGPYPPGPPPPGPGSKTPWLILAGLAVLGVILLVVILVIGLRGDNKSTTATSPA TSAPTSQPFSQQTATGCTPNVSGGVQPIGDSISAGKLSFPTSAAPGWSAFSDDQNPNL IDAVGVGHEVAGADQWMMQAEVAITNFVTTMDVAAQASKLMQCVADGPGYAGSSPTLG PTKTSSITVDGVRAARVDADITIADSSRNVKGDSVTIIAVDTKPVTVFLGATPIGDAT SRATVERVIEALKVNKS" gene complement(1226141..1227565) /gene="fumC" /locus_tag="Rv1098c" /db_xref="GeneID:885651" CDS complement(1226141..1227565) /gene="fumC" /locus_tag="Rv1098c" /EC_number="4.2.1.2" /function="Involved in the tricarboxylic acid cycle. Catalyzes the reversible hydration of fumarate to L-malate [CATALYTIC ACTIVITY: (S)-malate = fumarate + H2O]" /experiment="experimental evidence, no additional details recorded" /note="class II family (does not require metal); tetrameric enzyme; fumarase C; reversibly converts (S)-malate to fumarate and water; functions in the TCA cycle" /codon_start=1 /transl_table=11 /product="fumarate hydratase" /protein_id="NP_215614.1" /db_xref="GI:15608238" /db_xref="GOA:O53446" /db_xref="UniProtKB/Swiss-Prot:O53446" /db_xref="GeneID:885651" /translation="MAVDADSANYRIEHDTMGEVRVPAKALWRAQTQRAVENFPISGR GLERTQIRALGLLKGACAQVNSDLGLLAPEKADAIIAAAAEIADGQHDDQFPIDVFQT GSGTSSNMNTNEVIASIAAKGGVTLHPNDDVNMSQSSNDTFPTATHIAATEAAVAHLI PALQQLHDALAAKALDWHTVVKSGRTHLMDAVPVTLGQEFSGYARQIEAGIERVRACL PRLGELAIGGTAVGTGLNAPDDFGVRVVAVLVAQTGLSELRTAANSFEAQAARDGLVE ASGALRTIAVSLTKIANDIRWMGSGPLTGLAEIQLPDLQPGSSIMPGKVNPVLPEAVT QVAAQVIGNDAAIAWGGANGAFELNVYIPMMARNILESFKLLTNVSRLFAQRCIAGLT ANVEHLRRLAESSPSIVTPLNSAIGYEEAAAVAKQALKERKTIRQTVIDRGLIGDRLS IEDLDRRLDVLAMAKAEQLDSDRL" misc_feature complement(1226588..1226617) /gene="fumC" /locus_tag="Rv1098c" /note="PS00163 Fumarate lyases signature" gene complement(1227596..1228582) /gene="glpX" /locus_tag="Rv1099c" /db_xref="GeneID:885861" CDS complement(1227596..1228582) /gene="glpX" /locus_tag="Rv1099c" /EC_number="3.1.3.11" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="type II fructose 1,6-bisphosphatae; in Escherichia coli this protein forms a dimer and binds manganese" /codon_start=1 /transl_table=11 /product="fructose 1,6-bisphosphatase II" /protein_id="NP_215615.1" /db_xref="GI:15608239" /db_xref="GOA:O53447" /db_xref="UniProtKB/TrEMBL:O53447" /db_xref="GeneID:885861" /translation="MELVRVTEAGAMAAGRWVGRGDKEGGDGAAVDAMRELVNSVSMR GVVVIGEGEKDHAPMLYNGEEVGNGDGPECDFAVDPIDGTTLMSKGMTNAISVLAVAD RGTMFDPSAVFYMNKIAVGPDAAHVLDITAPISENIRAVAKVKDLSVRDMTVCILDRP RHAQLIHDVRATGARIRLITDGDVAGAISACRPHSGTDLLAGIGGTPEGIIAAAAIRC MGGAIQAQLAPRDDAERRKALEAGYDLNQVLTTEDLVSGENVFFCATGVTDGDLLKGV RYYPGGCTTHSIVMRSKSGTVRMIEAYHRLSKLNEYSAIDFTGDSSAVYPLP" gene 1228683..1229384 /locus_tag="Rv1100" /db_xref="GeneID:885852" CDS 1228683..1229384 /locus_tag="Rv1100" /function="UNKNOWN" /note="Rv1100, (MTV017.53), len: 233 aa. Conserved hypothetical protein, slightly similar to Rv1906c|MTCY180.12 hypothetical protein from Mycobacterium tuberculosis (156 aa), FASTA scores: opt: 122, E(): 6.9, (27.4% identity in 135 aa overlap). Equivalent to AL049491|MLCB1222_9 Mycobacterium leprae (257 aa) (63.8% identity in 257 aa overlap). TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215616.1" /db_xref="GI:15608240" /db_xref="UniProtKB/TrEMBL:O53448" /db_xref="GeneID:885852" /translation="MVGDCPRSRTVRWSWDTGHVTAEPQPTPRPAKPRLLQDGRDMFW SLAPLVVGCILLAGLVGMCSFQLGGTKRGPIPSYDAAQALRADAKTLGFPIRLPQLPG GWTPNSGGRGGIENGRADPATGQRRNAATSIVGFISPTGRYLSLTQSNADEDKLVGSI HPSMYPTGTVDVGGTRWVVYEGSDENGAVEPVWTTRLTGPGGATQLAITGAGSIDQFR TLASATQSQPPLPAR" gene complement(1229391..1230548) /locus_tag="Rv1101c" /db_xref="GeneID:885482" CDS complement(1229391..1230548) /locus_tag="Rv1101c" /function="UNKNOWN" /note="Rv1101c, (MTV017.54c), len: 385 aa. Conserved membrane protein, shows some similarity to other bacterial proteins e.g. P77406|PERM_ECOLI PUTATIVE PERMEASE PERM from Escherichia coli (353 aa), FASTA scores: opt: 287, E(): 8.8e-12, (24.9% identity in 349 aa overlap). TBparse score is 0.886." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215617.1" /db_xref="GI:15608241" /db_xref="GOA:O53449" /db_xref="UniProtKB/Swiss-Prot:O53449" /db_xref="GeneID:885482" /translation="MNTEFTLTQKRALAILTLIALLFGAYFLRNYFVLIVVAAVGAYL FTPLFKWFTKRFNTGLSAACTLLSALAAVVVPVGALVGLAIVQIARMVDSVADWVRTT DLSTLGDKILQFVNGLFDRVPFLHITVTADALRKAMISVAQNVGEWLLHFLRDAAGSL AGVITSAIIFVYVFVALLVNREKLRTLIGQLNPLGEDVTDLYLQKMGSMVRGTVNGQF VIAACQGVAGAASIYIAGFHHGFFIFAIVLTALSIIPLGGGIVTIPFGIGMIFYGNIA GGIFVLLWHLLVVTNIDNVLRPILVPRDARLNSALMLLSVFAGITMFGPWGIIIGPVL MILIVTTIDVYLAVYKGVELEQFEAPPVRRRWLPRRGPATSRNAPPPSTAE" gene complement(1230660..1230971) /locus_tag="Rv1102c" /db_xref="GeneID:886001" CDS complement(1230660..1230971) /locus_tag="Rv1102c" /function="UNKNOWN" /note="Rv1102c, (MTV017.55c), len: 103 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protens e.g. Rv1942c|MTCY9F9_22 (109 aa), FASTA scores: opt: 158, E(): 3.6e-05, (33.3% identity in 93 aa overlap); Rv0659c|MTCI376_17 (102aa), opt: 140, E(): 0.00072, (40.6% identity in 69aa overlap); and Rv1495. TBparse score is 0.924." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215618.1" /db_xref="GI:15608242" /db_xref="GOA:O53450" /db_xref="UniProtKB/TrEMBL:O53450" /db_xref="GeneID:886001" /translation="MRPIHIAQLDKARPVLILTREVVRPHLTNVTVAPITTTVRGLAT EVPVDAVNGLNQPSVVSCDNTQTIPVCDLGRQIGYLLASQEPALAEAIGNAFDLDWVV A" gene complement(1230971..1231291) /locus_tag="Rv1103c" /db_xref="GeneID:885941" CDS complement(1230971..1231291) /locus_tag="Rv1103c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1103c, (MTV017.56c), len: 106 aa. Conserved hypothetical protein, similar to part of Mycobacterium tuberculosis hypothetical protein Rv2472|AL021246|MTV008_27 Mycobacterium tuberculosis (97 aa), FASTA score: opt: 135, E(): 0.0091, (45.8% identity in 72 aa overlap). TBparse score is 0.916." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215619.1" /db_xref="GI:15608243" /db_xref="UniProtKB/TrEMBL:O53451" /db_xref="GeneID:885941" /translation="MYLPWGVVLAGGANGFGAGAYQTGTICEVSTQIAVRLPDEIVAF IDDEVRGQHARSRAAVVLRALERERRRRLAERDAEILATNTSATGDLDTLAGHCARTA LDID" gene 1231301..1231990 /locus_tag="Rv1104" /db_xref="GeneID:885835" CDS 1231301..1231990 /locus_tag="Rv1104" /function="IN COMBINATION WITH MTV017.58|Rv1105 CATALYZES HYDROLYSIS OF SEVERAL BETA-LACTAM ANTIBIOTIC PNB ESTERS TO THE CORRESPONDING FREE ACID AND PNB ALCOHOL" /note="Rv1104, (MTV017.57), len: 229 aa. Possible para-nitrobenzyl esterase (fragment; possibly first part) (EC 3.1.1.-). Similar to the N-terminal domain of many e.g. P37967|PNBA_BACSU Bacillus subtilis (489 aa), FASTA scores: opt: 715, E(): 0, (53.4% identity in 191 aa overlap). Gene may be inactivated as a frameshift is required to obtain a product continuing in MTV017.58|Rv1105. TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215620.1" /db_xref="GI:15608244" /db_xref="GOA:O53452" /db_xref="UniProtKB/TrEMBL:O53452" /db_xref="GeneID:885835" /translation="MVVDSCVAESRYGPVRGADDGRVKVWKGIRYAAPPLGDLRFRTP EPPERWTEVADATTFGPACPQPAIPNMPLDLGASQSEDCWSLNIWAPADTEPGDGKPV MVWLHGGAYILGSGSQPLYNGRRLAASGDVVVVTVNYRLGALGFLDLSSFNTSRRRFD SNIGLRDVLAVLRWVADNIAVFGGDPEKVTLFGESARESSRPCSPPRRPRVCSRRRSP RAHRRHRSTTR" gene 1232311..1232826 /locus_tag="Rv1105" /db_xref="GeneID:885977" CDS 1232311..1232826 /locus_tag="Rv1105" /function="IN COMBINATION WITH MTV017.57|Rv1104 CATALYZES HYDROLYSIS OF SEVERAL BETA-LACTAM ANTIBIOTIC PNB ESTERS TO THE CORRESPONDING FREE ACID AND PNB ALCOHOL" /note="Rv1105, (MTV017.58), len: 171 aa. Possible para-nitrobenzyl esterase (fragment; possibly second part) (EC 3.1.1.-). Similar to C-terminal domain of many e.g. P71048 PARA-NITROBENZYL ESTERASE from Bacillus subtilis (489 aa), FASTA scores: opt: 248, E(): 2.7e-10, (32.3% identity in 167 aa overlap). Gene may be inactivated as a frameshift is required to obtain a product continuing from MTV017.57|Rv1104. Start changed since first submission. TBparse score is 0.936." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215621.2" /db_xref="GI:57116828" /db_xref="UniProtKB/TrEMBL:O53453" /db_xref="GeneID:885977" /translation="MFTQIAAEQPDLQVPTEEQIGSAYSRWRRKARSLSMATDVGFRM PSVWLAEGHSGVAPVYLYRFDYSTPLLKLLLVRAAHATELPYVWGNLGGSQDPALKLG DAKAAIAVSRRVRTRWINFATRGKPTGPDGEPDWPCYEEAHRACLIIGRRDAVVHDVD AHIRATWGSKW" gene complement(1232844..1233956) /locus_tag="Rv1106c" /db_xref="GeneID:886004" CDS complement(1232844..1233956) /locus_tag="Rv1106c" /EC_number="1.1.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /experiment="experimental evidence, no additional details recorded" /note="Rv1106c, (MTV017.59c), len: 370 aa. Probable cholesterol dehydrogenase (EC 1.1.1.-). Equivalent to AL049491|MLCB1222_7 Mycobacterium leprae (376 aa ) (75.5% identity in 375 aa overlap). Highly similar to Q03704 NAD(P)-dependent cholesterol dehydrogenase from Nocardia sp. (364 aa), FASTA scores: opt: 1789, E(): 0, (74.5% identity in 361 aa overlap). Also similar to U32426|MCU32426_1 3-beta-hydroxy-Delta5-steroid dehydrogenase from Molluscum contagiosum virus (354 aa), FASTA scores: opt: 432, E(): 1.7e-22, (34.6% identity in 347 aa overlap). Also similar to series of Mycobacterium tuberculosis hypothetical proteins described as sugar epimerases or dehydratases e.g. Rv3634c, Rv3784, Rv3464, etc. TBparse score is 0.885. The transcription of this CDS seems to be activated specifically in host granulomas (see Ramakrishnan et al., 2000)." /codon_start=1 /transl_table=11 /product="cholesterol dehydrogenase" /protein_id="NP_215622.1" /db_xref="GI:15608246" /db_xref="GOA:O53454" /db_xref="UniProtKB/TrEMBL:O53454" /db_xref="GeneID:886004" /translation="MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSF DRAPSLLPAHPQLEVLQGDITDADVCAAAVDGIDTIFHTAAIIELMGGASVTDEYRQR SFAVNVGGTENLLHAGQRAGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTE TKVVAERFVLAQNGVDGMLTCAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSA RLDNSYVHNLIHGFILAAAHLVPDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWP KMRISGPAVRWVMTGWQRLHFRFGFPAPLLEPLAVERLYLDNYFSIAKARRDLGYEPL FTTQQALTECLPYYVSLFEQMKNEARAEKTAATVKP" gene complement(1233966..1234223) /gene="xseB" /locus_tag="Rv1107c" /db_xref="GeneID:886005" CDS complement(1233966..1234223) /gene="xseB" /locus_tag="Rv1107c" /EC_number="3.1.11.6" /function="BIDIRECTIONALLY DEGRADES SINGLE-STRANDED DNA INTO LARGE ACID-INSOLUBLE OLIGONUCLEOTIDES, WHICH ARE THEN DEGRADED FURTHER INTO SMALL ACID-SOLUBLE OLIGONUCLEOTIDES [CATALYTIC ACTIVITY: EXONUCLEOLYTIC CLEAVAGE IN EITHER 5'- TO 3'- OR 3'- TO 5'-DIRECTION TO YIELD 5'-PHOSPHOMONONUCLEOTIDES.]" /note="catalyzes the bidirectional exonucleolytic cleavage of DNA" /codon_start=1 /transl_table=11 /product="exodeoxyribonuclease VII small subunit" /protein_id="NP_215623.1" /db_xref="GI:15608247" /db_xref="GOA:P67456" /db_xref="UniProtKB/Swiss-Prot:P67456" /db_xref="GeneID:886005" /translation="MVCDPNGDDTGRTHATVPVSQLGYEACRDELMEVVRLLEQGGLD LDASLRLWERGEQLAKRCEEHLAGARQRVSDVLAGDEAQNG" gene complement(1234213..1235460) /gene="xseA" /locus_tag="Rv1108c" /db_xref="GeneID:885984" CDS complement(1234213..1235460) /gene="xseA" /locus_tag="Rv1108c" /EC_number="3.1.11.6" /function="BIDIRECTIONALLY DEGRADES SINGLE-STRANDED DNA INTO LARGE ACID-INSOLUBLE OLIGONUCLEOTIDES, WHICH ARE THEN DEGRADED FURTHER INTO SMALL ACID-SOLUBLE OLIGONUCLEOTIDES [CATALYTIC ACTIVITY: EXONUCLEOLYTIC CLEAVAGE IN EITHER 5'- TO 3'- OR 3'- TO 5'-DIRECTION TO YIELD 5'-PHOSPHOMONONUCLEOTIDES]" /experiment="experimental evidence, no additional details recorded" /note="bidirectionally degrades single-stranded DNA into large acid-insoluble oligonucleotides" /codon_start=1 /transl_table=11 /product="exodeoxyribonuclease VII large subunit" /protein_id="NP_215624.1" /db_xref="GI:15608248" /db_xref="GOA:P67447" /db_xref="UniProtKB/Swiss-Prot:P67447" /db_xref="GeneID:885984" /translation="MTQNSAENPFPVRAVAIRVAGWIDKLGAVWVEGQLAQITMRPDA KTVFMVLRDPAADMSLTVTCSRDLVLSAPVKLAEGVQVVVCGKPSFYTGRGTFSLRLS EIRAVGIGELLARIDRLRRLLDAEGLFDPRLKRPIPYLPNMIGLITGRASAAERDVTT VASARWPAARFAVRNVAVQGPNAVGQIVEALRELDRDPDVDVIVLARGGGSVEDLLPF SDETLCRAIAACRTPVVSAVGHEPDNPLCDLVVDLRAATPTDAAKKVVPDTAAEQRLI DDLRRRSAQALRNWVSREQRAVAQLRSRPVLADPMTMVSVRAEEVHRARSTLRRNLTL MVAAETERIGHLAARLATLGPAATLARGYAIVQTVAQTGPEGGSEPQVLRSVHDAPEG TKLRVRVADGALAAVSEGQTNGL" gene complement(1235457..1236095) /locus_tag="Rv1109c" /db_xref="GeneID:885828" CDS complement(1235457..1236095) /locus_tag="Rv1109c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1109c, (MTV017.62c), len: 212 aa. Conserved hypothetical protein. Equivalent to AL049491|MLCB1222_4 hypothetical protein from Mycobacterium leprae (205 aa) (68.1% identity in 213 aa overlap). TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215625.1" /db_xref="GI:15608249" /db_xref="UniProtKB/TrEMBL:O53457" /db_xref="GeneID:885828" /translation="MATAPYGVRLLVGAATVAVEETMKLPRTILMYPMTLASQAAHVV MRFQQGLAELVIKGDNTLETLFPPKDEKPEWATFDEDLPDALEGTSIPLLGLSDASEA KNDDRRSDGRFALYSVSDTPETTTASRSADRSTNPKTAKHPKSAAKPTVPTPAVAAEL DYPALTLAQLRARLHTLDVPELEALLAYEQATKARAPFQTLLANRITRATAK" gene 1236185..1237192 /gene="ispH" /locus_tag="Rv1110" /gene_synonym="lytB" /db_xref="GeneID:885830" CDS 1236185..1237192 /gene="ispH" /locus_tag="Rv1110" /gene_synonym="lytB" /EC_number="1.17.1.2" /function="NOT KNOWN. IN OTHER ORGANISMS, LYTB PRODUCT IS INVOLVED IN PENICILLIN TOLERANCE AND CONTROL OF THE STRINGENT RESPONSE." /note="catalyzes the conversion of 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate into isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP); functions in the nonmevalonate isoprenoid biosynthesis pathway" /codon_start=1 /transl_table=11 /product="4-hydroxy-3-methylbut-2-enyl diphosphate reductase" /protein_id="YP_177788.1" /db_xref="GI:57116829" /db_xref="GOA:O53458" /db_xref="UniProtKB/Swiss-Prot:O53458" /db_xref="GeneID:885830" /translation="MVPTVDMGIPGASVSSRSVADRPNRKRVLLAEPRGYCAGVDRAV ETVERALQKHGPPVYVRHEIVHNRHVVDTLAKAGAVFVEETEQVPEGAIVVFSAHGVA PTVHVSASERNLQVIDATCPLVTKVHNEARRFARDDYDILLIGHEGHEEVVGTAGEAP DHVQLVDGVDAVDQVTVRDEDKVVWLSQTTLSVDETMEIVGRLRRRFPKLQDPPSDDI CYATQNRQVAVKAMAPECELVIVVGSRNSSNSVRLVEVALGAGARAAHLVDWADDIDS AWLDGVTTVGVTSGASVPEVLVRGVLERLAECGYDIVQPVTTANETLVFALPRELRSP R" gene complement(1237209..1238192) /locus_tag="Rv1111c" /db_xref="GeneID:885447" CDS complement(1237209..1238192) /locus_tag="Rv1111c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1111c, (MTV017.64c), len: 327 aa. Conserved hypothetical protein, N-terminal domain is hydrophobic, C-terminal half is very rich in Arg. Equivalent to AL049491|MLCB1222_2 hypothetical protein from Mycobacterium leprae (379 aa) (46.0% identity in 374 aa overlap). Start changed since first submission. TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215627.2" /db_xref="GI:57116830" /db_xref="UniProtKB/TrEMBL:O86351" /db_xref="GeneID:885447" /translation="MSAQRARSAVQASHRSIHPHIPGVPWWAAILIAVTATAIGYAID AGSGHKALTLVFTGCYIAGCVGAVLAVRQSDLFTALVQPPLILFCAVPGAYWLFHGGT IGKFKDLLINCGYSLIERFPLMLGTAAGVLLIGLVRWYLGTALFDSIARKLSSLMTGD SDDDGGRRSAQRPARTRSRHARPPSEDNREPIAERRSRRRPRPQNDPHPRRNAHERPA PRSSRFDSYRSYQPSEPSGPAEPVNRYERRGARYQPYARYEPTYEPQRRRARPSEPTN PTHHPISQVRYRGSATRDARRDNYREEQRFDRRDRSRAPRRPPAESWEYDV" gene 1238255..1239328 /locus_tag="Rv1112" /gene_synonym="ychF" /db_xref="GeneID:887151" CDS 1238255..1239328 /locus_tag="Rv1112" /gene_synonym="ychF" /function="UNKNOWN" /note="translation-associated GTPase; the crystal structure of the Haemophilus influenzae YchF protein showed similarity to the yeast structure (PDB: 1NI3); fluorescence spectroscopy revealed nucleic acid binding; the yeast protein YBR025c interacts with the translation elongation factor eEF1" /codon_start=1 /transl_table=11 /product="GTP-dependent nucleic acid-binding protein EngD" /protein_id="NP_215628.1" /db_xref="GI:15608252" /db_xref="GOA:O53459" /db_xref="UniProtKB/TrEMBL:O53459" /db_xref="GeneID:887151" /translation="MSLSLGIVGLPNVGKSTLFNALTRNNVVAANYPFATIEPNEGVV SLPDPRLDKLAELFGSQRVVPAPVTFVDIAGLVKGASEGAGLGNKFLAHIRECDAICQ VVRVFVDDDVTHVTGRVDPQSDIEVVETELILADLQTLERATGRLEKEARTNKARKPV YDAALRAQQVLDAGKTLFAAGVDAAALRELNLLTTKPFLYVFNADEAVLTDPARVGEL RALVAPADAVFLDAAIESELTELDDESAAELLESIGQSERGLDALARAGFHTLKLQTF LTAGPKEARAWTIHQGDTAPKAAGVIHSDFEKGFIKAEIVSYDDLVAAGSMAAAKAAG KVRIEGKDYVMADGDVVEFRFNV" misc_feature 1238279..1238302 /locus_tag="Rv1112" /gene_synonym="ychF" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1239416..1239613 /locus_tag="Rv1113" /db_xref="GeneID:885450" CDS 1239416..1239613 /locus_tag="Rv1113" /function="UNKNOWN" /note="Rv1113, (MTCY22G8.02), len: 65 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein Rv2758c|AL00896 7|MTV002.23 (88 aa) FASTA scores: opt: 97, E(): 0.86, (33.3% identity in 69 aa overlap). Part of family including Rv2871, Rv1241, Rv2132, Rv3321c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215629.1" /db_xref="GI:15608253" /db_xref="UniProtKB/TrEMBL:O06565" /db_xref="GeneID:885450" /translation="MRTTVTVDDALLAKAAELTGVKEKSTLLREGLQTLVRVESARRL AALGGTDPQATAAPRRRTSPR" gene 1239610..1239984 /locus_tag="Rv1114" /db_xref="GeneID:885490" CDS 1239610..1239984 /locus_tag="Rv1114" /function="UNKNOWN" /note="Rv1114, (MTCY22G8.03), len: 124 aa. Conserved hypothetical protein, slight similarity to Mycobacterium tuberculosis hypothetical proteins MTCY159.08c (33.0% identity in 115 aa overlap); Rv1561 and Rv2010." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215630.1" /db_xref="GI:15608254" /db_xref="UniProtKB/TrEMBL:O06566" /db_xref="GeneID:885490" /translation="MILVDTSVWIEHLRAADARLVELLGDDEAGCHPLVIEELALGSI KQRDVVLDLLANLYQFPVVTHDEVLRLVGRRRLWGRGLGAVDANLLGSVALVGGARLW TRDKRLKAACAESGVALAEEVS" gene 1240187..1240885 /locus_tag="Rv1115" /db_xref="GeneID:885951" CDS 1240187..1240885 /locus_tag="Rv1115" /function="UNKNOWN" /note="Rv1115, (MTCY22G8.04), len: 232 aa. Possible exported protein, contains possible N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215631.1" /db_xref="GI:15608255" /db_xref="UniProtKB/TrEMBL:O06567" /db_xref="GeneID:885951" /translation="MISTTRIDFLWILSVAFASMIALATLLTLINQVVGTPYIPGGDS PAGTDCSELASWVSNAATARPVFGDRFNTGNEEAALAARGFQQGTAPNALVIGWNGHH TAVTLPDGTPVSSGEGGGVRVGGGGAYQPKFTHHMYLPMDVDAGEDQPPAPDEPVTAV DDVEPEMPAPCPTQRPPVTPRHNLCNKLRTMPGALSAALAAAAPVWPAPISGCRGFST SLLAKRNHPVIVGK" gene 1241003..1241188 /locus_tag="Rv1116" /db_xref="GeneID:885969" CDS 1241003..1241188 /locus_tag="Rv1116" /function="UNKNOWN" /note="Rv1116, (MTCY22G8.05), len: 61 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215632.1" /db_xref="GI:15608256" /db_xref="UniProtKB/TrEMBL:O06568" /db_xref="GeneID:885969" /translation="MCSRMADEPRLEAGAHPFEEGRDKAPELRATQMDHVRFTEGRRE RNRDRLERSQQFRQPGR" gene complement(1241115..1241390) /locus_tag="Rv1116A" /db_xref="GeneID:3205101" CDS complement(1241115..1241390) /locus_tag="Rv1116A" /function="UNKNOWN" /note="Rv1116A, len: 91 aa. Conserved hypothetical protein (possibly gene fragment), similar to C-terminal part of Rv1646|Z85982_9 from Mycobacterium tuberculosis (310 aa), FASTA scores: opt: 301, E(): 9.3e-13, (68.05% identity in 72 aa overlap). Also overlaps gene on other strand, Rv1116, at 3'-end." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177639.1" /db_xref="GI:57116831" /db_xref="UniProtKB/TrEMBL:Q8VK68" /db_xref="GeneID:3205101" /translation="MGALGTVRGLQDSNTAFVGALHSGNLLGATGAVLQAPGNAVNGF LFGQTSISQSIDVSPEYGYELVAVSDPVGGTAGSARAGHGYVHADLR" gene 1241633..1241956 /locus_tag="Rv1117" /db_xref="GeneID:885847" CDS 1241633..1241956 /locus_tag="Rv1117" /function="UNKNOWN" /note="Rv1117, (MTCY22G8.06), len: 107 aa. Conserved hypothetical protein, some similarity to P94425|D50453 hypothetical protein from Bacillus subtilis (95 aa), fasta scores: opt: 128, E(): 5.1e-06, (28.3% identity in 92 aa overlap); and AL117322|SCF1.02 Streptomyces coelicolor (109 aa), FASTA scores: opt: 437, E(): 1.6e-25, (57.5% identity in 106 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215633.1" /db_xref="GI:15608257" /db_xref="GOA:O06569" /db_xref="UniProtKB/TrEMBL:O06569" /db_xref="GeneID:885847" /translation="MIFIVVKFETKPEWTERWPDLVASFTAATRAEEGNLWFEWSRSL DDPAEYVLVESFRDGEAGGVHVNSDHFRQAMRELPKALASTPKIISQTIDATGWSAMG EMTVG" gene complement(1241971..1242831) /locus_tag="Rv1118c" /db_xref="GeneID:888936" CDS complement(1241971..1242831) /locus_tag="Rv1118c" /function="UNKNOWN" /note="Rv1118c, (MTCY22G8.07c), len: 286 aa. Conserved hypothetical protein, similar to pseudogene ML0942 in Mycobacterium leprae." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215634.1" /db_xref="GI:15608258" /db_xref="UniProtKB/TrEMBL:O06570" /db_xref="GeneID:888936" /translation="MQSGPHLVGRVGTSFPLIARHQGATRDDAGDTGQPDPLPHVAHP DRLYPPMVHGVDPSTLALDRALNETRTGDLWLFRGRSRPDRAIQTLTNAPVNHVGMTV AIDDLPPLIWHAELGDKLLDVWTGTNHRGVQLNDARQVVQQWAGRYRQRCWLRQLTPH ANRDQEDKLLRVIARMNGTPFPTTARLTGRWLRGRLPTLNDWLRGIPVLDRKVREQTQ RRKQQQRTMGLATAYCAETVAITYEEMGLLVTDKDAHWFDPGKFWSGDSLPLAPGYRL GHEIAVDVGG" gene complement(1242864..1243013) /locus_tag="Rv1119c" /db_xref="GeneID:885853" CDS complement(1242864..1243013) /locus_tag="Rv1119c" /function="UNKNOWN" /note="Rv1119c, (MTCY22G8.08c), len: 49 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215635.1" /db_xref="GI:15608259" /db_xref="UniProtKB/TrEMBL:O06571" /db_xref="GeneID:885853" /translation="MTARVAGQAVGGQILVGEPVHDAVSDCADIRFGSYRLFSLDAAP GPDLD" gene complement(1243010..1243504) /locus_tag="Rv1120c" /db_xref="GeneID:885633" CDS complement(1243010..1243504) /locus_tag="Rv1120c" /function="UNKNOWN" /note="Rv1120c, (MTCY22G8.09c), len: 164 aa. Conserved hypothetical protein, some similarity at C-terminus to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1890c|MTCY180.28 (462 aa), FASTA scores: opt: 187, E(): 2.2e-05, (36.6% identity in 93 aa overlap) and Rv2488c|YZ19_MYCTU|Q10551 (285 aa), FASTA scores: opt: 156, E(): 0.00074, (32.7% identity in 107 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215636.1" /db_xref="GI:15608260" /db_xref="GOA:O06572" /db_xref="UniProtKB/TrEMBL:O06572" /db_xref="GeneID:885633" /translation="MLSGGREAVKTVWQTANLVRKEGFGAAVRSSIEDPADWAEVERP DLARVTPDGRVVILFSDIEESTALDERIGDRTWVKLIGAHDKLVHELVRRWSGHMVTS QGDGFMIAFARAEQAVRCGIDIQDALRNSAKRKRNQGIRVRIGTTWGARCGTVTICSA ATSQ" gene 1243707..1245107 /gene="zwf1" /locus_tag="Rv1121" /db_xref="GeneID:885817" CDS 1243707..1245107 /gene="zwf1" /locus_tag="Rv1121" /EC_number="1.1.1.49" /function="INVOLVED IN PENTOSE PHOSPHATE PATHWAY (FIRST STEP) [CATALYTIC ACTIVITY : D-GLUCOSE 6-PHOSPHATE + NADP(+) = D-GLUCONO-1,5-LACTONE 6-PHOSPHATE + NADPH.]" /note="catalyzes the formation of D-glucono-1,5-lactone 6-phosphate from D-glucose 6-phosphate" /codon_start=1 /transl_table=11 /product="glucose-6-phosphate 1-dehydrogenase" /protein_id="YP_177789.1" /db_xref="GI:57116832" /db_xref="GOA:O06573" /db_xref="UniProtKB/Swiss-Prot:O06573" /db_xref="GeneID:885817" /translation="MVDGGGGASDLLVIFGITGDLARKMTFRALYRLERHQLLDCPIL GVASDDMSVGQLVKWARESIGRTEKIDDAVFDRLAGRLSYLHGDVTDSQLYDSLAELI GSACRPLYYLEMPPALFAPIVENLANVRLLERARVAVEKPFGHDLASALELNARLRAV LGEDQILRVDHFLGKQPVVELEYLRFANQALAELWDRNSISEIHITMAEDFGVEDRGK FYDAVGALRDVVQNHLLQVLALVTMEPPVGSSADDLNDKKAEVFRAMAPLDPDRCVRG QYLGYTEVAGVASDSATETYVALRTEIDNWRWAGVPIFVRAGKELPAKVTEVRLFLRR VPALAFLPNRRPAEPNQIVLRIDPDPGMRLQISAHTDDSWRDIHLDSSFAVDLGEPIR PYERLLYAGLVGDHQLFAREDSIEQTWRIVQPLLDNPGEIHRYDRGSWGPEAAQSLLR GHRGWQSPWLPRGTDA" gene 1245129..1246151 /gene="gnd2" /locus_tag="Rv1122" /db_xref="GeneID:885820" CDS 1245129..1246151 /gene="gnd2" /locus_tag="Rv1122" /EC_number="1.1.1.44" /function="INVOLVED IN HEXOSE MONOPHOSPHATE SHUNT (PENTOSE PHOSPHATE PATHWAY). [CATALYTIC ACTIVITY: 6-phospho-D-gluconate + NADP+ = D-ribulose 5-phosphate + CO2 + NADPH]" /note="similar to full-length Gnd, these proteins seems to have a truncated C-terminal 6PGD domainin; in Methylobacillus flagellatus this gene is essential for NAD+-dependent oxidation of 6-phosphogluconate" /codon_start=1 /transl_table=11 /product="6-phosphogluconate dehydrogenase-like protein" /protein_id="NP_215638.1" /db_xref="GI:15608262" /db_xref="GOA:O06574" /db_xref="UniProtKB/TrEMBL:O06574" /db_xref="GeneID:885820" /translation="MQLGMIGLGRMGANIVRRLAKGGHDCVVYDHDPDAVKAMAGEDR TTGVASLRELSQRLSAPRVVWVMVPAGNITTAVIEELANTLEAGDIVIDGGNTYYRDD LRHEKLLFKKGIHLLDCGTSGGVWGRERGYCLMIGGDGDAFARAEPIFATVAPGVAAA PRTPGRDGEVAPSEQGYLHCGPCGSGHFVKMVHNGIEYGMMASLAEGLNILRNADVGT RVQHGDAETAPLPNPECYQYDFDIPEVAEVWRRGSVIGSWLLDLTAIALRESPDLAEF SGRVSDSGEGRWTAIAAIDEGVPAPVLTTALQSRFASRDLDDFANKALSAMRKQFGGH AEKPAN" gene complement(1246144..1247052) /gene="bpoB" /locus_tag="Rv1123c" /db_xref="GeneID:885965" CDS complement(1246144..1247052) /gene="bpoB" /locus_tag="Rv1123c" /EC_number="1.11.1.-" /function="SUPPOSED INVOLVED IN DETOXIFICATION REACTIONS." /note="Rv1123c, (MTCY22G8.12c), len: 302 aa. Possible bpoB, peroxidase (non-haem peroxidase) (EC 1.11.1.-), with some similarity to a range of enzymes from several organisms including: DEH1_MORSP|Q01398 haloacetate dehalogenase (EC 3.8.1.3) from Moraxella sp. (294 aa), FASTA scores: opt: 201, E(): 2.1e-06, (35.8% identity in 134 aa overlap); and BPA1_STRAU|P33912 non-haem bromoperoxidase bpo-a1 (EC 1.11.1.-) from Streptomyces aureofaciens (274 aa), FASTA scores: opt: 187, E(): 1.6e-05, (23.1% identity in 281 aa overlap). Similar to several other Mycobacterium tuberculosis proteins, probable epoxide hydrolases and non-heme bromoperoxidases e.g. Rv1938, Rv3617, Rv3473c, Rv3171c, etc. Contains PS00216 Sugar transport proteins signature 1." /codon_start=1 /transl_table=11 /product="peroxidase BpoB" /protein_id="NP_215639.1" /db_xref="GI:15608263" /db_xref="GOA:O06575" /db_xref="UniProtKB/TrEMBL:O06575" /db_xref="GeneID:885965" /translation="MTIWRVPSKVTSGPVSAVSSSPQAVAFSGARGITLVADEWNRGA AAADRPTILMLHGGGQNRFSWKNTGQILADEGHHVVALDTRGPGDSDRAPGADYAVET PTTDVLHVVEAIGRRVVVVEASMGGLTGILVAERAGPQTVNGLVLVDVVPRYEKEGNA RIRDFMLGNIDGFGSLEEAADAVAEYLPHRDKPRSPEGLKRNLRLRDGRWHWHWDPAM MTAPGHDPQLRTENFERAAMGLTIPVLLIRGKLSDVVSSDGARDFLAKVPNAEFVELS NAGRTAAGDDNDAFTDVVVDFVRRLS" misc_feature complement(1246681..1246734) /gene="bpoB" /locus_tag="Rv1123c" /note="PS00216 Sugar transport proteins signature 1" gene 1247127..1248077 /gene="ephC" /locus_tag="Rv1124" /db_xref="GeneID:886022" CDS 1247127..1248077 /gene="ephC" /locus_tag="Rv1124" /function="THOUGHT TO BE INVOLVED IN DETOXIFICATION REACTIONS FOLLOWING OXIDATIVE DAMAGE TO LIPIDS [CATALYTIC ACTIVITY: AN EPOXIDE + H(2)O = A GLYCOL]." /note="Rv1124, (MTCY22G8.13), len: 316 aa. Probable ephC, epoxide hydrolase (EC 3.3.2.3) (see citation below), similar to Q42566 epoxide hydrolase from Arabidopsis thaliana (321 aa), FASTA scores: opt: 298, E(): 8.2e-13, (27.6% identity in 333 aa overlap). Similar to other M. tuberculosis epoxide hydrolases and non-heme bromoperoxidases e.g. Rv1938, Rv3617, Rv3670, Rv3473c, etc." /codon_start=1 /transl_table=11 /product="epoxide hydrolase EphC" /protein_id="NP_215640.1" /db_xref="GI:15608264" /db_xref="GOA:O06576" /db_xref="UniProtKB/TrEMBL:O06576" /db_xref="GeneID:886022" /translation="MRAGRGERESTWRTTMAEPHWIDVKGPNGDLKALTWGPAGAPVA LCLHGFPDTAYGWRKVAPRLAESGWHVVAPFMRGYAPSSIPADGSYHVGALMHDALRV RSAAGGTERDVIIGHDWGAIAATGLAAMPDSPFAKAVIMSVPPSAAFRPLGRVPERGR LLRELPHQLLRSWYILYFQLPWLPERSASWVVPLLWRRWSPGYHAEEDLRHVDAAIGT PEGRRAALGPYRATMRNTRAPADYADLNRLWTEAPKLPVLYLHGHDDGCATSAFTHWT ARVLPAGSEVAVVEHAGHFLQLEQPDKIAELIVAFIGSPG" gene 1248082..1249326 /locus_tag="Rv1125" /db_xref="GeneID:886021" CDS 1248082..1249326 /locus_tag="Rv1125" /function="UNKNOWN" /note="Rv1125, (MTCY22G8.14), len: 414 aa. Conserved hypothetical protein. Similar to AL133278|SCM11.13 hypothetical protein from Streptomyces coelicolor (446 aa), FASTA scores: opt: 182, E(): 0.0005, (28.1% identity in 437 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215641.1" /db_xref="GI:15608265" /db_xref="UniProtKB/TrEMBL:O06577" /db_xref="GeneID:886021" /translation="MAGHRMAAVDAQFYWMSAKVPNDQFLLYAFDGEPTDLERAVAQV YRRARGCPGLGMRVQDRGALAYPQWVPTPVQRDQLVCHDLADRSWQGCLAAVVGLASK QLDMRRMPWRLHVFTPVHDVPGVSGLGTVAVMQFAHALGDGARASAMAAWLFGRPAAV PEIARSRAGFLPWRAAHAARAHLRLVRDTNAGLVAPGVGSRPPLSTNARPEGVRAVRT LLRRRSQLAGPTVTVTVLAAVSTGLLGLLGGDVDTLGAEVPMAKPGVPRSYNHFGNVV VGLYPRLEPDERVRRIATDLANARRRFEHPAMLSADRAFAAVPAALLRWGVSQFDAEV RPVRVAGNTVVSSVYRGAADLSFGDAPVVLTAGYPALSPAMGLTHGVHGIGDTVAISV HAAESAVSDIDAYMRLLDAALQ" gene complement(1249330..1249935) /locus_tag="Rv1126c" /db_xref="GeneID:885845" CDS complement(1249330..1249935) /locus_tag="Rv1126c" /function="UNKNOWN" /note="Rv1126c, (MTCY22G8.15c), len: 201 aa. Conserved hypothetical protein, similar in N-terminus to O05567|MLCB33.17 hypothetical protein from Mycobacterium leprae (141 aa), FASTA scores: opt: 332, E(): 1.4e-23, (58.4% identity in 101 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215642.1" /db_xref="GI:15608266" /db_xref="UniProtKB/TrEMBL:O06578" /db_xref="GeneID:885845" /translation="MSELTVLQAVRLKGRVITTDLAQTLGEDLADVAATVDRLTAAGL LVDATPLRISPSGRMRLDDLLAEERNRADSTVLAAAYRDFRSVNADFKRLVTDWQLKG EKPNTHDDAEYDAAVLSRLDGVHRRVGPIIGTVAMQLPRLSRYPVKLRAALDKVKAGD IAWLTRPLIDSYHTVWFELHEELIQAVGLTRDEAAKSGDAQ" gene complement(1249932..1251404) /gene="ppdK" /locus_tag="Rv1127c" /db_xref="GeneID:885956" CDS complement(1249932..1251404) /gene="ppdK" /locus_tag="Rv1127c" /EC_number="2.7.9.1" /function="CATALYZES THE REVERSIBLE PHOSPHORYLATION OF PYRUVATE AND PHOSPHATE [CATALYTIC ACTIVITY: ATP + pyruvate + phosphate = AMP + phosphoenolpyruvate + diphosphate]" /note="catalyzes the formation of phosphoenolpyruvate from pyruvate" /codon_start=1 /transl_table=11 /product="pyruvate phosphate dikinase" /protein_id="NP_215643.1" /db_xref="GI:15608267" /db_xref="GOA:O06579" /db_xref="UniProtKB/TrEMBL:O06579" /db_xref="GeneID:885956" /translation="MTRITRANGCPDGTLENAVVALDGGANYPREILGNKGHGIDMMR RHHLPVPPAFCITTEVGVRYLAAPGSTIAAIWDDVLDRMSWLETETSCTFGRGPNPLL VSVRSGATQSMPGMMDTILDVGMTDAVERVLARPGAADFAHDTRRRFTSMYRRIVGSA GPITDDPYAQLRASIEAVFASWNSPRAVAYRDHHGLDDQGGTAVVVQAMVFGNLTANS GAGVLSSRNPITGANEPFGEWLPGGQGDDVVSGLVAVAPITALRDQQPAVYDQLMAAA RSLERMAGDVQEIEFTVEDSQLWLLQTRGAERSAQAAVRLALQLHHEGLIDDTETLRR VTPTHIETLLRPSLQTETRLAAPLLAKGLPACPGVVSGTAYTEVDEALDAADRGEPVI LVRDHTRPEDVMGMLAAQGIVTEVGGAASHAAVVSRELGRVAVVGCGPGVAAALAGKE ITVDGYEGEVRQGVLALSAWSESDTPELRELADIAQRISS" gene complement(1251617..1252972) /locus_tag="Rv1128c" /db_xref="GeneID:885849" CDS complement(1251617..1252972) /locus_tag="Rv1128c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1128c, (MTCY22G8.17c), len: 451 aa. Conserved hypothetical protein, in REP13E12 degenerate repeat, highly similar to several Mycobacterium tuberculosis proteins in REP13E12 repeats e.g. Rv1148c, Rv1945, Rv3467, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215644.1" /db_xref="GI:15608268" /db_xref="UniProtKB/Swiss-Prot:O06580" /db_xref="GeneID:885849" /translation="MCSTREEITEAFASLATALSRVLGLTFDALTTPERLALLEHCET ARRQLPSVEHTLINQIGEQSTEEELGGKLGLTLADRLRITRSEAKRRVAEAADLGQRR ALTGEPLPPLLTATAKAQRHGLIGDGHVEVIRAFVHRLPSWVDLKTLEKAERDLAKQA TQYRPDQLAKLAARIMDCLNPDGDYTDEDRARRRGLTLGKQDVDGMSRLSGYVTPELR ATIEAVWAKLAAPGMCNPEQKAPCVNGAPSKEQARRDTRSCPQRNHDALNAELRSLLT SGNLGQHNGLPASIIVTTTLKDLEAAAGAGLTGGGTILPISDVIRLARHANHYLAIFD RGKALALYHTKRLASPAQRIMLYAKDSGCSAPGCDVPGYYCEVHHVTPYAQCRNTDVN DLTLGCGGHHPLAERGWTTRKNAHGDTEWLPPPHLDHGQPRVNTFHHPEKLLADDEGD P" repeat_region 1251621..1252945 /note="REP-3, len: 1325 bp. REP22G8, member of REP13E12 family.; REP-3" /rpt_type=DIRECT gene complement(1253074..1254534) /locus_tag="Rv1129c" /db_xref="GeneID:885963" CDS complement(1253074..1254534) /locus_tag="Rv1129c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1129c, (MTCY22G8.18c), len: 486 aa. Possible transcriptional regulator protein, similar to Rv0465c|MTV038.09c Mycobacterium tuberculosis (474 aa), FASTA scores: E(): 0, (47.4% identity in 468 aa overlap). Helix turn helix motif present from aa 32-53." /codon_start=1 /transl_table=11 /product="transcriptional regulator protein" /protein_id="NP_215645.1" /db_xref="GI:15608269" /db_xref="GOA:O06581" /db_xref="UniProtKB/TrEMBL:O06581" /db_xref="GeneID:885963" /translation="MTRSNVLPVARTYSRTFSGARLRRLRQERGLTQVALAKALDLST SYVNQLENDQRPITVPVLLLLTERFDLSAQYFSSDSDARLVADLSDVFTDIGVEHAVS GAQIEEFVARMPEVGHSLVAVHRRLRAATEELEGYRSRATAETELPPARPMPFEEVRD FFYDRNNYIHDLDMAAERMFTESGMRTGGLDIQLAELMRDRFGISVVIDDNLPDTAKR RYHPDTKVLRVAHWLMPGQRAFQIATQLALVGQSDLISSIVATDDQLSTEARGVARIG LANYFAGAFLLPYREFHRAAEQLRYDIDLLGRRFGVGFETVCHRLSTLQRPRQRGIPF IFVRTDKAGNISKRQSATAFHFSRVGGSCPLWVVHDAFAQPERIVRQVAQMPDGRSYF WVAKTTAADGLGYLGPHKNFAVGLGCDLAHAHKLVYSTGVVLDDPSTEVPIGAGCKIC NRTSCAQRAFPYLGGRVAVDENAGSSLPYSSTEQSV" gene 1254555..1256135 /locus_tag="Rv1130" /db_xref="GeneID:885843" CDS 1254555..1256135 /locus_tag="Rv1130" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1130, (MTCY22G8.19), len: 526 aa. Conserved hypothetical protein, some similarity to AP000063|AP000063_192 hypothetical protein from Aeropyrum pernix (479 aa), FASTA scores: opt: 717, E(): 0, (34.3% identity in 443 a a overlap), and to PRPD_ECOLI|P77243 prpd protein from Escherichia coli (483aa), FASTA scores: opt: 234, E(): 3.3e-08, (27.0% identity in 429 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215646.1" /db_xref="GI:15608270" /db_xref="UniProtKB/TrEMBL:O06582" /db_xref="GeneID:885843" /translation="MPDQDTKVRFFRVFCWCPVLRMVRIMLMHAVRAWRSADDFPCTE HMAYKIAQVAADPVDVDPEVADMVCNRIIDNAAVSAASMVRRPVTVARHQALAHPVRH GAKVFGVEGSYSADWAAWANGVAARELDFHDTFLAADYSHPADNIPPLVAVAQQLGVC GAELIRGLVTAYEIHIDLTRGICLHEHKIDHVAHLGPAVAAGIGTMLRLDQETIYHAI GQALHLTTSTRQSRKGAISSWKAFAPAHAGKVGIEAVDRAMRGEGSPAPIWEGEDGVI AWLLAGPEHTYRVPLPAPGEPKRAILDSYTKQHSAEYQSQAPIDLACRLRERIGDLDQ IASIVLHTSHHTHVVIGTGSGDPQKFDPDASRETLDHSLPYIFAVALQDGCWHHERSY APERARRSDTVALWHKISTVEDPEWTRRYHCADPAKKAFGARAEVTLHSGEVIVDELA VADAHPLGTRPFERKQYVEKFTELADGVVEPVEQQRFLAVVESLADLESGAVGGLNVL VDPRVLDKAPVIPPGIFR" gene 1256132..1257313 /gene="gltA1" /locus_tag="Rv1131" /db_xref="GeneID:888949" CDS 1256132..1257313 /gene="gltA1" /locus_tag="Rv1131" /EC_number="2.3.3.5" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE (KREBS CYCLE) [CATALYTIC ACTIVITY: Citrate + CoA = acetyl-CoA + H2O + oxaloacetate]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of citrate from acetyl-CoA and oxaloacetate" /codon_start=1 /transl_table=11 /product="citrate synthase" /protein_id="NP_215647.1" /db_xref="GI:15608271" /db_xref="GOA:O08395" /db_xref="UniProtKB/TrEMBL:O08395" /db_xref="GeneID:888949" /translation="MTGPLAAARSVAATKSMTAPTVDERPDIKKGLAGVVVDTTAISK VVPQTNSLTYRGYPVQDLAARCSFEQVAFLLWRGELPTDAELALFSQRERASRRVDRS MLSLLAKLPDNCHPMDVVRTAISYLGAEDPDEDDAAANRAKAMRMMAVLPTIVAIDMR RRRGLPPIAPHSGLGYAQNFLHMCFGEVPETAVVSAFEQSMILYAEHGFNASTFAARV VTSTQSDIYSAVTGAIGALKGRLHGGANEAVMHDMIEIGDPANAREWLRAKLARKEKI MGFGHRVYRHGDSRVPTMKRALERVGTVRDGQRWLDIYQVLAAEMASATGILPNLDFP TGPAYYLMGFDIASFTPIFVMSRITGWTAHIMEQATANALIRPLSAYCGHEQRVLPGT F" misc_feature 1256963..1257001 /gene="gltA1" /locus_tag="Rv1131" /note="PS00480 Citrate synthase signature" gene 1257325..1259055 /locus_tag="Rv1132" /db_xref="GeneID:885824" CDS 1257325..1259055 /locus_tag="Rv1132" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1132, (MTCY22G8.21), len: 576 aa. Conserved membrane protein, similar to O06827|Rv1431|MTCY493.23C membrane protein from Mycobacterium tuberculosis (589 aa), fasta scores: opt: 1811, E(): 0, (48.2% identity in 585 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215648.1" /db_xref="GI:15608272" /db_xref="UniProtKB/TrEMBL:O06583" /db_xref="GeneID:885824" /translation="MGFLQPRLPDIDLAEWSQGSRSQKIRPMAQHWAEVGFGTPVLLH LFYVAKILLYVLVGWLIVLTTKGIDGFTDAAAWYAEPIVFEKVVLYTMLFEVIGLGCG FGPLNNRFFPPMGSILYWMRFGTIRLPPWPDRVPWTRGTKRKPVDVALYALLVMMLLS ALFTDGAGPIPELGTTVGLLPAWQIVLILLLLGVLGLRDKVIFLAARGEVYATLTVTF LFGRLNGIDMIVAAKLVFLVIWIGAATSKLNRHFPFVISTMMSNNPLFRPRFIKRMFF KKFPGDLRPGLLSRIVAHVSTVIEMCVPVVLFVAHGGWPTVVAATIMVCFHLGILTAI PMGVPLEWNVFMIFGVLSLFVGHACLGLADVKNPVPLAILIAVVAGIVIAGNVFPRKI SFLAAMRYYAGNWDTTLWCIKPSAEDKINRGIVAIASMPAAQLERFYGKDRAQIPMYL GYAFRAMNSHGRALFTLAHRAMAGHDEDDYVITDGERVCSTAVGWNFGDGHLHNEQLI AAMQQRCGFQPGEVRVVLLDAQPIHRQTQEYRLVDAATGEFERGYVRVADMVNRQPWD DDVPVHVLPG" gene complement(1259067..1261346) /gene="metE" /locus_tag="Rv1133c" /db_xref="GeneID:888947" CDS complement(1259067..1261346) /gene="metE" /locus_tag="Rv1133c" /EC_number="2.1.1.14" /function="CATALYZES THE TRANSFER OF A METHYL GROUP FROM 5-METHYLTETRAHYDROFOLATE TO HOMOCYSTEINE RESULTING IN METHIONINE FORMATION (PATHWAY: TERMINAL STEP IN THE DE NOVO BIOSYNTHESIS OF METHIONINE) [CATALYTIC ACTIVITY : 5-METHYLTETRAHYDROPTEROYLTRI-L-GLUTAMATE + L- HOMOCYSTEINE = TETRAHYDROPTEROYLTRI-L-GLUTAMATE + L-METHIONINE.]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the transfer of a methyl group from 5-methyltetrahydrofolate to homocysteine to form methionine" /codon_start=1 /transl_table=11 /product="5-methyltetrahydropteroyltriglutamate-- homocysteine S-methyltransferase" /protein_id="NP_215649.1" /db_xref="GI:15608273" /db_xref="GOA:P65340" /db_xref="UniProtKB/Swiss-Prot:P65340" /db_xref="GeneID:888947" /translation="MTQPVRRQPFTATITGSPRIGPRRELKRATEGYWAGRTSRSELE AVAATLRRDTWSALAAAGLDSVPVNTFSYYDQMLDTAVLLGALPPRVSPVSDGLDRYF AAARGTDQIAPLEMTKWFDTNYHYLVPEIGPSTTFTLHPGKVLAELKEALGQGIPARP VIIGPITFLLLSKAVDGAGAPIERLEELVPVYSELLSLLADGGAQWVQFDEPALVTDL SPDAPALAEAVYTALCSVSNRPAIYVATYFGDPGAALPALARTPVEAIGVDLVAGADT SVAGVPELAGKTLVAGVVDGRNVWRTDLEAALGTLATLLGSAATVAVSTSCSTLHVPY SLEPETDLDDALRSWLAFGAEKVREVVVLARALRDGHDAVADEIASSRAAIASRKRDP RLHNGQIRARIEAIVASGAHRGNAAQRRASQDARLHLPPLPTTTIGSYPQTSAIRVAR AALRAGEIDEAEYVRRMRQEITEVIALQERLGLDVLVHGEPERNDMVQYFAEQLAGFF ATQNGWVQSYGSRCVRPPILYGDVSRPRAMTVEWITYAQSLTDKPVKGMLTGPVTILA WSFVRDDQPLADTANQVALAIRDETVDLQSAGIAVIQVDEPALRELLPLRRADQAEYL RWAVGAFRLATSGVSDATQIHTHLCYSEFGEVIGAIADLDADVTSIEAARSHMEVLDD LNAIGFANGVGPGVYDIHSPRVPSAEEMADSLRAALRAVPAERLWVNPDCGLKTRNVD EVTASLHNMVAAAREVRAG" gene 1261922..1262158 /locus_tag="Rv1134" /db_xref="GeneID:885983" CDS 1261922..1262158 /locus_tag="Rv1134" /function="UNKNOWN" /note="Rv1134, (MTCI65.01), len: 78 aa. Hypothetical unknown protein. TBparse score is 0.838." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215650.1" /db_xref="GI:15608274" /db_xref="UniProtKB/TrEMBL:O06534" /db_xref="GeneID:885983" /translation="MAAYQKFGQEHAAAIRGGAVLHPTATATTVRVTGARGGDVVTGD GPYEAADLDEQGPFPMETVYLWEDGPNGTTRMTL" gene complement(1262272..1264128) /gene="PPE16" /locus_tag="Rv1135c" /db_xref="GeneID:885131" CDS complement(1262272..1264128) /gene="PPE16" /locus_tag="Rv1135c" /function="UNKNOWN" /note="Rv1135c, (MTCI65.02c), len: 618 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins. Similar to Rv2356c (59.6% identity in 627 aa overlap); etc.. TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177790.1" /db_xref="GI:57116833" /db_xref="UniProtKB/TrEMBL:Q79FS0" /db_xref="GeneID:885131" /translation="MSFLVLPPEVNSALMFAGAGSGPTLAAAAAWDGLAAELGQAANS FSSATAALADTAWQGPAATAMAAAAAPYASWLSTAATRALSAAAQAKAAAAVYEAARA ATVDPLLVAANRHQLVSLVLSNLFGQNAPAIAATEAAYEQLWAADVAAMVSYHSGASA VAAQLAPWAQAVRALPNPTAPALASGPAALAIPALGIGNTGIGNIFSIGNIGDYNLGN GNTGNANLGSGNTGQANLGSGNTGFFNFGSGNTANTNFGSGNLGNLNLGSGNDGNGNF GLGNIGDGNRGSGNVGSFNFGTANAGSFNVGSANHGSPNVGFANLGNNNLGIANLGNN NLGIANLGNNNIGIGLTGDNMIGIGALNSGIGNLGFGNSGNNNIGLFNSGNNNIGFFN SGDSNFGFFNSGDTNTGFGNAGFTNTGFGNAGSGNFGFGNAGNNNFGFGNSGFENMGV GNSGAYNTGSFNSGTLNTGDLNSGDFNTGWANSGDINTGGFHSGDLNTGFGSPVDQPV MNSGFGNIGTGNSGFNNSGDANSGFQNTNTGAFFIGHSGLLNSGGGQHVGISNSGTGF NTGLFNTGFNNTGIGNSATNAAFTTTSGVANSGDNSSGGFNAGNDQSGFFDG" gene 1264314..1264556 /locus_tag="Rv1135A" /db_xref="GeneID:3205047" CDS 1264314..1264556 /locus_tag="Rv1135A" /EC_number="2.3.1.9" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION [CATALYTIC ACTIVITY: 2 ACETYL-CoA = CoA + ACETOACETYL-COA]." /note="Rv1135A, len: 80 aa. Possible acetyl-CoA acetyltransferase (EC 2.3.1.9) (possible gene fragment), highly similar to other acetyl-CoA acetyltransferases e.g. C-terminal part of Rv3556c|Z92774|MTCY6G11_2|MTCY06G11.03|fadA6 ACETYL-COA ACETYLTRANSFERASE from Mycobacterium tuberculosis (386 aa), FASTA scores: opt: 219, E(): 5.7e-09, (63.6% identity in 55 aa overlap)." /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="YP_177640.1" /db_xref="GI:57116834" /db_xref="GOA:Q8VK64" /db_xref="UniProtKB/TrEMBL:Q8VK64" /db_xref="GeneID:3205047" /translation="MQLGNQNTMRFAGRPQRFRQSAYPLFNPNSAIALGHPFGGSGAR LMTTVLHHMPDKGIRYGLQTMCEGRGQANATIVELL" gene 1264606..1264947 /locus_tag="Rv1136" /db_xref="GeneID:888948" CDS 1264606..1264947 /locus_tag="Rv1136" /EC_number="5.-.-.-" /function="INVOLVED IN CARNITINE METABOLISM." /note="Rv1136, (MTCI65.03), len: 113 aa. Probable enoyl-CoA hydratase (possible gene fragment) (EC 5.-.-.-). Some similarity to N-terminus of carnitine racemases and enoyl-CoA hydratases (but much shorter) e.g. I41014 carnitine racemase from Escherichia coli (297 aa), FASTA scores: opt: 258, E(): 2.5e-11, (44.5% identity in 110 aa overlap); and Rv0222 putative enoyl-CoA hydratase from M. tuberculosis (262 aa). TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215652.1" /db_xref="GI:15608276" /db_xref="GOA:O06536" /db_xref="UniProtKB/TrEMBL:O06536" /db_xref="GeneID:888948" /translation="MVITINRPEARNAVNGAVSIVVGDALEEAHDNPDVRAVVITGAG DKSLCAGADLKAIARRENPYHPHHGEWGIAGYRHHFIDKPTSAAVSGTALDDGAEPAL ASDLVVADEHT" gene complement(1265087..1265455) /locus_tag="Rv1137c" /db_xref="GeneID:885071" CDS complement(1265087..1265455) /locus_tag="Rv1137c" /function="UNKNOWN" /note="Rv1137c, (MTCI65.04c), len: 122 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215653.1" /db_xref="GI:15608277" /db_xref="UniProtKB/TrEMBL:O06537" /db_xref="GeneID:885071" /translation="MLSARCHIRHIGSPGKDARCAHLSATLRPGIGISPTNVGNATVL ADGTPAKPIQGAETMQRARHTGSCFSANARGPAISSGNPSRAGCGVPSSTTTPSSTPQ AIRLLACTDSDALTVTRTAR" gene complement(1265472..1266488) /locus_tag="Rv1138c" /db_xref="GeneID:885119" CDS complement(1265472..1266488) /locus_tag="Rv1138c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv1138c, (MTCI65.05c), len: 338 aa. Possible oxidoreductase (EC 1.-.-.-), similar to Q9EWQ8 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (343 aa). Also similar to many Mycobacterium tuberculosis hypothetical proteins e.g. Rv1751|P72008|MTCY04C12.35 (412 aa), fasta scores: opt: 89, E(): 4.5e-09, (24.6% identity in 358 aa overlap). TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215654.1" /db_xref="GI:15608278" /db_xref="GOA:O06538" /db_xref="UniProtKB/TrEMBL:O06538" /db_xref="GeneID:885119" /translation="MTSYDTDLLVVGGGPGGLATALHARARGLSVIVAEPRENPIDKA CGEGLMPGGLAELTSLGVDPVGLPFHGIAYVGEHRRVQARFRTGPGRGVRRTTLHAAL AARAKEQDTEWIRSRVATIQQDAHGVTAAGVRAKWLVAADGLHSAVRRAVGIKATAGT PRRYGVRWHYRLPVWSDFVEVHWSRWGEAYVTPVEPDLVGVAILSRQRPELAWFPSLA HHLQDASRGHARGCGPLRQVVSRRVAGRVLLVGDAAGYEDALTGEGISLAVKQAAAAV SAIVDDTPASYEAAWHRITRDYRLVTRGLVLASTPRAARRAIVPLCALLPTAFRYGVN ILAY" gene complement(1266485..1266985) /locus_tag="Rv1139c" /db_xref="GeneID:885110" CDS complement(1266485..1266985) /locus_tag="Rv1139c" /function="UNKNOWN" /note="Rv1139c, (MTCI65.06c), len: 166 aa. Conserved hypothetical membrane protein. Highly similar to P54158|YBPQ_BACSU hypothetical Bacillus subtilis protein, YBPQ (168 aa), FASTA scores: opt: 446, E(): 2.2e-26, (38.4% identity in 164 aa overlap). Some similarity to Mycobacterium tuberculosis hypothetical proteins, Rv0740, Rv0750. TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215655.1" /db_xref="GI:15608279" /db_xref="GOA:O06539" /db_xref="UniProtKB/TrEMBL:O06539" /db_xref="GeneID:885110" /translation="MYYLLILAVVFERLAELVVAQRNARWSFAQGGKEFGRPHYVVMV ILHTALLLGCVVEPWALHRPFIPWLGWPMLAVVVASQGLRWWCVKSLGKRWNTRVIVL PHATLVRRGPYRWMRHPNYVAVVAEGFALPLVHTAWLTALVFTLANATLLTVRLRVEN SVLGYI" gene 1267347..1268195 /locus_tag="Rv1140" /db_xref="GeneID:885117" CDS 1267347..1268195 /locus_tag="Rv1140" /function="UNKNOWN" /note="Rv1140, (MTCI65.07), len: 282 aa. Probable integral membrane protein. Weak similarity in C-terminus to hypothetical Escherichia coli proteins YPRA and YPRB, possibly membrane-bound e.g. YPRA_ECOLI HYPOTHETICAL 24.3 kDa PROTEIN (URF 1) (217 aa), FASTA scores: opt: 166, E(): 0.00062, (31.0% identity in 158 aa overlap). TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215656.1" /db_xref="GI:15608280" /db_xref="UniProtKB/TrEMBL:O06540" /db_xref="GeneID:885117" /translation="MPRDYTAPRWAHAWAGEPRPARWHPANQPAHPDHSNRESPACMS QSTTPYRSSVLAEFRRAITNVAVPHHEPPGIVRRRRVVVGVTLVIGAVMLGFSLRRTP GESSFYWLTLALAAVWIAGALMSGPLHLGGICWRGRNQRPVITGTTVGLLLAGIFGVG AMIVRAIPGAAEPIARVLQFAHQGTLLPILLITLINGIAEEMFFRGALYTALGRRYPV TISTVLYVGATMASANLMLGFAAIFVGTVCALERRASGGVLAPILTHFVWGLIMVFAL PPLFAV" gene complement(1268203..1269009) /gene="echA11" /locus_tag="Rv1141c" /db_xref="GeneID:886024" CDS complement(1268203..1269009) /gene="echA11" /locus_tag="Rv1141c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS (BY SIMILARITY) [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215657.1" /db_xref="GI:15608281" /db_xref="GOA:O06541" /db_xref="UniProtKB/TrEMBL:O06541" /db_xref="GeneID:886024" /translation="MPDSGIAALTPVTGLNVTLTDRVLSVRINRPSSLNSLTVPILTG IADTLERAAADPVVKVVRLGGVGRGFSSGVSMSVDDVWGGGPPTAIVEEANRAVRAVA ALPHPVVAVVQGPAVGVAVSLALACDFILASDSAFFMLANTKVALMPDGGASALVAAA TGRIRAMRLALLAEQLPAREALAWGLISAVYPDSDFEAEVDKVISRLLAGPALAFAQA KNAINAAALTELEPTFARELDGQEVLLRTHDFAEGAAAFLQRRTPNFTGS" gene complement(1269152..1269958) /gene="echA10" /locus_tag="Rv1142c" /db_xref="GeneID:885458" CDS complement(1269152..1269958) /gene="echA10" /locus_tag="Rv1142c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS (BY SIMILARITY) [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215658.1" /db_xref="GI:15608282" /db_xref="GOA:O06542" /db_xref="UniProtKB/TrEMBL:O06542" /db_xref="GeneID:885458" /translation="MSNYRIDTRTIVPGLAVTLADGVLSVTIDRPESLNSLTKPVLAG MADAIEGAATDPRVKVVRLGGAGRGFSSGGAISVDDVWASGPPTDTVAEANRTVRAIV ALPQPVVAVVQGPTVGCGVSLALACDLVLASDNAFFMLAHTNVGLMPDGGASALVQAA IGRIRAMHMALLPDRVPAAEALSWGLVSAVYPAADFDAEVDKLISRLLAGPALAIAKT KNAINAATLTELAPTLLRELDGQALLLRTDDFAEGATAFQQRRTPMFTGR" gene 1270062..1271144 /gene="mcr" /locus_tag="Rv1143" /db_xref="GeneID:885067" CDS 1270062..1271144 /gene="mcr" /locus_tag="Rv1143" /EC_number="5.1.99.4" /function="REQUIRED FOR BILE ACID SYNTHESIS AND FOR CATABOLISM OF BRANCHED-CHAIN FATTY ACIDS" /note="Rv1143, (MTCI65.10), len: 360 aa. Probable mcr, alpha-methylacyl-CoA racemase (EC 5.1.99.4). Strong similarity to other alpha-methylacyl-CoA racemases and also some similarity to L-carnitine dehydratase (EC 4.2.1.89) e.g. U89905|g1552373 methylacyl-CoA racemase alpha from Norway rat (361 aa), FASTA scores: opt: 1035, E():0, (47.2% identity in 339 aa overlap). Equivalent to (but longer than) Z94723|MLCB33_13 Mycobacterium leprae (253 aa) (85.3% identity in 245 aa overlap). Also similar to Mycobacterium tuberculosis putative racemases Rv0855, Rv1866, Rv3272. TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="alpha-methylacyl-CoA racemase" /protein_id="NP_215659.1" /db_xref="GI:15608283" /db_xref="GOA:O06543" /db_xref="UniProtKB/TrEMBL:O06543" /db_xref="GeneID:885067" /translation="MAGPLSGLRVVELAGIGPGPHAAMILGDLGADVVRIDRPSSVDG ISRDAMLRNRRIVTADLKSDQGLELALKLIAKADVLIEGYRPGVTERLGLGPEECAKV NDRLIYARMTGWGQTGPRSQQAGHDINYISLNGILHAIGRGDERPVPPLNLVGDFGGG SMFLLVGILAALWERQSSGKGQVVDAAMVDGSSVLIQMMWAMRATGMWTDTRGANMLD GGAPYYDTYECADGRYVAVGAIEPQFYAAMLAGLGLDAAELPPQNDRARWPELRALLT EAFASHDRDHWGAVFANSDACVTPVLAFGEVHNEPHIIERNTFYEANGGWQPMPAPRF SRTASSQPRPPAATIDIEAVLTDWDG" gene 1271156..1271908 /locus_tag="Rv1144" /db_xref="GeneID:885932" CDS 1271156..1271908 /locus_tag="Rv1144" /function="UNKNOWN; SUPPOSED INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1144, (MTCI65.11), len: 250 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to various dehydrogenases e.g. NP_104056.1|NC_002678 3-hydroxyacyl-CoA dehydrogenase type II from Mesorhizobium loti (253 aa); NP_251244.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (255 aa); AAK15008.1|AF233685_1|AF233685 short chain L-3-hydroxyacyl-CoA dehydrogenase from Mus musculus (261 aa); HSU73514|g1778354|XH98G2 human short-chain alcohol dehydrogenase from Homo sapiens (261 aa), FASTA scores: opt: 875, E(): 0, (60.1% identity in 253 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. TBparse score is 0.864." /codon_start=1 /transl_table=11 /product="short-chain type dehydrogenase/reductase" /protein_id="NP_215660.1" /db_xref="GI:15608284" /db_xref="GOA:O06544" /db_xref="UniProtKB/TrEMBL:O06544" /db_xref="GeneID:885932" /translation="MKTKDAVAVVTGGASGLGLATTKRLLDAGAQVVVVDLRGDDVVG GLGDRARFAQADVTDEAAVSNALELADSLGPVRVVVNCAGTGNAIRVLSRDGVFPLAA FRKIVDINLVGTFNVLRLGAERIAKTEPIGEERGVIINTASVAAFDGQIGQAAYSASK GGVVGMTLPIARDLASKLIRVVTIAPGLFDTPLLASLPAEAKASLGQQVPHPSRLGNP DEYGALVLHIIENPMLNGEVIRLDGAIRMAPR" misc_feature 1271582..1271668 /locus_tag="Rv1144" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 1272423..1273334 /gene="mmpL13a" /locus_tag="Rv1145" /db_xref="GeneID:885798" CDS 1272423..1273334 /gene="mmpL13a" /locus_tag="Rv1145" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /note="Rv1145, (MTCI65.12), len: 303 aa. Probable mmpL13a, conserved transmembrane transport protein (see citation below), member of RND superfamily, showing some similarity to putative Mycobacterial and Streptomyces membrane proteins e.g. MTCY987|g1781238 from Mycobacterium tuberculosis (962 aa), FASTA scores: opt: 213, E(): 1.9e-06, (28.0% identity in 296 aa overlap); etc. Strong similarity to U92075|MMU92075_5 hypothetical protein from Mycobacterium marinum (256 aa), FASTA scores: opt: 957, E(): 0, (57.6% identity in 257 aa overlap). Should continue as mmpL13B|Rv1146, but frameshift required. Sequence has been checked and is identical in M. tuberculosis strain CDC1551, and Mycobacterium bovis strain AF2122/97. BELONGS TO THE MMPL FAMILY. TBparse score is 0.919." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL13A" /protein_id="NP_215661.1" /db_xref="GI:15608285" /db_xref="GOA:O06545" /db_xref="UniProtKB/TrEMBL:O06545" /db_xref="GeneID:885798" /translation="MLQRIARLAIAAPRRIIGFAVFVFIAAAVFGVPVADSLSPGGFQ DPRSESARAIEVLTDKFGQSGQKMLIVVTAAAGADSPPAREVGTDIVEVLRRSPLVYN VTSPWTVPPTAAADLLSTDGKSGLIVVNVKGGENDAQNHAQTLSDEVAHDRDGVTVRA GGSAMEYAQINRQNKDDLLVMELIAIPLSFLVLIWVFGGLLAAGLPMAQAVLAVVGSM AVLRLVTFATEVSTFALNLSTALGLALAIDYTLLIVSRYRDELAEGSDRDEALIRTMA LRGARCCFRRSPWRCRCRRLRCSRCTF" gene 1273355..1274767 /gene="mmpL13b" /locus_tag="Rv1146" /db_xref="GeneID:885575" CDS 1273355..1274767 /gene="mmpL13b" /locus_tag="Rv1146" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /note="Rv1146, (MTCI65.13), len: 470 aa. Probable mmpL13b, conserved transmembrane transport protein (see citation below), member of RND superfamily, showing some similarity to putative Mycobacterial and Streptomyces membrane proteins e.g. Q53902|C40046 antibiotic transport-associated protein from Streptomyces coelicolor (711 aa), FASTA scores: opt: 193, E(): 2.1e-05, (28.9% identity in 394 aa overlap); etc. Could be in frame with previous ORF mmpL13A|Rv1145, but no sequence error apparent to account for this; sequence is identical in M. tuberculosis strain CDC1551, and Mycobacterium bovis strain AF2122/97. BELONGS TO THE MMPL FAMILY. TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL13B" /protein_id="NP_215662.1" /db_xref="GI:15608286" /db_xref="UniProtKB/TrEMBL:O06546" /db_xref="GeneID:885575" /translation="MATVAFVATASIVITPAAIVLLGPRLDALDVRRLVRRLLGRPDP VHKPVKQLFWYRSSKFVMRRWLPVGTAVVALLVLLGLPFLSVKWGFPDDRVLPRSASA RQVGDILRDDFGHDPATQIPIVVPDARGLGPVELDSYAAELSRVPDVSAVAAPTGTFV DGSWVGTPRGATGLAEGSAFLTVSSTAPLFSRASDIQLKRLHQVAGPAGRSVVMAGVA QVNRDSVDAVTDRLPMVLGLIAAITYVLLFLLTGSVVLPAKALVCNVLSLTAAFGALV WIFQEGHFGALGTTPSGTLVANMPVLLFCIAFGLSMDYEVFLVSRIREYWLESGAARP ARRSVAEVHAANDESVALGVARTGRVITAAALVMSMSFAALIAAHVSFMRMFGLGLTL AVAADATLVRMVVVPAFMHVTGRWNWWAPRPLAWLHERFGVSEAAEPVSRRRSHAGGL GKIAGRSDGQTIPASLTRNG" gene 1274900..1275550 /locus_tag="Rv1147" /db_xref="GeneID:885471" CDS 1274900..1275550 /locus_tag="Rv1147" /function="UNKNOWN" /note="Rv1147, (MTCI65.14), len: 216 aa. Conserved hypothetical protein, similar to many conserved hypothetical proteins, and some similarity to several methyltransferases e.g. Q05197|PMTA_RHOSH phosphatidylethanolamine N-methyltransferase (EC 2.1.1.17) from R. sphaeroides (203 aa), FASTA scores: opt: 156, E( ): 0.00073, (27.6% identity in 156 aa overlap). TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215663.1" /db_xref="GI:15608287" /db_xref="GOA:O06547" /db_xref="UniProtKB/TrEMBL:O06547" /db_xref="GeneID:885471" /translation="MTSGAAASASRVDHPLFARIWPVVAAHEAEAIRALRRENLAGLS GRVLEVGAGVGTNFAYYPVAVEQVIAMEPEPRLAAKARIAAADAPVPIVVTDKTVEEF RDTETFDAVVCSLVLCSVSDPGAVLAHLRSLLRRGGELRYLEHVASAGARGRVQRFVD ATFWPRLAGNCHTHRHTERAILDAGFVVDSSRREWAFPAWVPLPVSELALGRAHRT" repeat_region 1276296..1277643 /note="REP-4, len: 1348 bp. REP165, member of REP13E12 family.; REP-4" /rpt_type=DIRECT gene complement(1276300..1277748) /locus_tag="Rv1148c" /db_xref="GeneID:885451" CDS complement(1276300..1277748) /locus_tag="Rv1148c" /function="UNKNOWN" /note="Rv1148c, (MTCI65.15c), len: 482 aa. Conserved hypothetical ORF in REP13E12 degenerate repeat, nearly identical to other hypothetical Mycobacterium tuberculosis proteins in REP13E12 repeats, although similarity extends upstream past proposed f-Met start. Very similar to other REP13E12 proteins e.g. Rv1945, Rv3467, Rv0094c, Rv1128c etc. TBparse score is 0.917." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215664.1" /db_xref="GI:15608288" /db_xref="UniProtKB/Swiss-Prot:O06548" /db_xref="GeneID:885451" /translation="MSETFCLTDHSEPMTARFLSVVLRRIRGMRSDTREEISAALDAY HASLSRVLDLKCDALTTPELLACLQRLEVERRRQGAAEHALINQLAGQACEEELGGTL RTALANRLHITPGEASRRIAEAEDLGERRALTGEPLPAQLTATAAAQREGKIGREHIK EIQAFFKELSAAVDLGIREAAEAQLAELATSRRPDHLHGLATQLMDWLHPDGNFSDQE RARKRGITMGKQEFDGMSRISGLLTPELRATIEAVLAKLAAPGACNPDDQTPLVDDTP DADAVRRDTRSQAQRNHDAFLAALRGLLASGELGQHKGLPVTIVVSTTLKELEAATGK GVTGGGSRVPMSDLIRMASHANHYLALFDGAKPLALYHTKRLASPAQRIMLYAKDRGC SRPGCDAPAYHSEVHHVTPWTTTHRTDINDLTLACGPDNRLVEKGWKTRKNAHGDTEW LPPPHLDHGQPRINRYHHPAKILCEQDDDEPH" repeat_region 1277843..1278826 /note="IS-LIKE-2, len: 984 bp. Insertion sequence element IS-LIKE." /mobile_element="insertion sequence:IS-LIKE-2" repeat_region 1277843..1277846 /note="4 bp direct repeat, CTAG, generated by IS element on insertion. Proposed by Mariani et al. 1993. J. Gen. Microbiol., 139: 1767-1772. Note that as motif palindromic could be part of inverted repeat itself." repeat_region 1277847..1277863 /note="17 bp Inverted repeat at the left end of putative IS-LIKE-2 element : GGCGTGTCTCCCAAATT. Proposed by Mariani et al. 1993. J. Gen. Microbiol. 139: 1767-1772." gene 1277893..1278300 /locus_tag="Rv1149" /db_xref="GeneID:885164" CDS 1277893..1278300 /locus_tag="Rv1149" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT" /experiment="experimental evidence, no additional details recorded" /note="Rv1149, (MTCI65.16), len: 135 aa. Possible transposase. Identical to 117 aa N-terminal region of S21394|X65618 transposase of Mycobacterium tuberculosis (308 aa), FASTA scores: opt: 823, E(): 0, (99.1% identity in 117 aa overlap). Second copy is Rv1042c|MTCY10G2.07. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215665.1" /db_xref="GI:15608289" /db_xref="UniProtKB/TrEMBL:P96359" /db_xref="GeneID:885164" /translation="MTRVGVISDEFWAVVEPLMPSHEGKPGRRFSDHRLILEGIAWRF RTGSPWRDLPAEFGPWQTVWKRHHRWSLDGTCDEVFAHVAAVFGVDAEVAEDIEKLLS VDSTNVRAHQHSAGACSDTLATGGTVGLQEIRR" gene 1278269..1278817 /locus_tag="Rv1150" /pseudo /db_xref="GeneID:886020" misc_feature 1278269..1278817 /locus_tag="Rv1150" /experiment="experimental evidence, no additional details recorded" /note="Rv1150, (MTCI65.17), len: 183 aa. Possible fragment of transposase (pseudogene). Identical to C-terminal part of S21394 transposase of putative Mycobacterium tuberculosis IS element (308 aa), FASTA scores: opt: 959, E(): 0, (99.3% identity in 145 aa overlap). The transposase described here may be made by a -1 frame shifting mechanism during translation that fuses Rv1149|MTCI65.16 and Rv1150|MTCI65.17. No evidence found to account for discrepancy with previously published sequence. Second copy is Rv1041c|MTCY10G2.08. TBparse score is 0.914.;POSSIBLE TRANSPOSASE (FRAGMENT)" /pseudo repeat_region complement(1278800..1278816) /note="17 bp Inverted repeat at the right end of putative IS-LIKE-2 element :GGCGTGTCTCCCAATTT. Proposed by Mariani et al. 1993. J. Gen. Microbiol. 139: 1767-1772" repeat_region 1278817..1278820 /note="4 bp direct repeat, CTAG generated by IS element on insertion. Proposed by Mariani et al. 1993. J. Gen. Microbiol. 139: 1767-1772. Note that as motif palindromic could be part of inverted repeat itself." gene complement(1278904..1279617) /locus_tag="Rv1151c" /db_xref="GeneID:886026" CDS complement(1278904..1279617) /locus_tag="Rv1151c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Modulates the activities of several enzymes which are inactive in their acetylated form" /codon_start=1 /transl_table=11 /product="NAD-dependent deacetylase" /protein_id="NP_215667.1" /db_xref="GI:15608291" /db_xref="GOA:P66813" /db_xref="UniProtKB/Swiss-Prot:P66813" /db_xref="GeneID:886026" /translation="MRVAVLSGAGISAESGVPTFRDDKNGLWARFDPYELSSTQGWLR NPERVWGWYLWRHYLVANVEPNDGHRAIAAWQDHAEVSVITQNVDDLHERAGSGAVHH LHGSLFEFRCARCGVPYTDALPEMPEPAIEVEPPVCDCGGLIRPDIVWFGEPLPEEPW RSAVEATGSADVMVVVGTSAIVYPAAGLPDLALARGTAVIEVNPEPTPLSGSATISIR ESASQALPGLLERLPALLK" gene 1279655..1280020 /locus_tag="Rv1152" /db_xref="GeneID:885985" CDS 1279655..1280020 /locus_tag="Rv1152" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /experiment="experimental evidence, no additional details recorded" /note="Rv1152, (MTCI65.19), len: 121 aa (Start uncertain). Probable transcriptional regulatory protein, some similarity to others e.g. YHCF_BACSU HYPOTHETICAL TRANSCRIPTIONAL REGULATOR (121 aa), FASTA scores: opt: 187, E(): 1.9e-06, (34.9% identity in 106 aa overlap). TBparse score is 0.876. Helix turn helix motif from aa 42-63 (+3.10 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215668.1" /db_xref="GI:15608292" /db_xref="GOA:O06550" /db_xref="UniProtKB/TrEMBL:O06550" /db_xref="GeneID:885985" /translation="MELRDWLRVDVKAGKPLFDQLRTQVIDGVRAGALPPGTRLPTVR DLAGQLGVAANTVARAYRELESAAIVETRGRFGTFISRFDPTDAAMAAAAKEYVGVAR ALGLTKSDAMRYLTHVPDD" gene complement(1279998..1280846) /gene="omt" /locus_tag="Rv1153c" /db_xref="GeneID:885994" CDS complement(1279998..1280846) /gene="omt" /locus_tag="Rv1153c" /EC_number="2.1.1.-" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN LIPID METABOLISM" /note="Rv1153c, (MTCI65.20c), len: 282 aa. Probable omt, O-methyltransferase (EC 2.1.1-), similar to TCMP_STRGA|P39887 Tetracenomycin polyketide synthesis O-methyltransferase tcmP (EC 2.1.1.-) from Streptomyces glaucescens (270 aa), FASTA scores: opt: 368, E(): 1.7e-17, (31.3% identity in 233 aa overlap). TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="O-methyltransferase" /protein_id="NP_215669.1" /db_xref="GI:15608293" /db_xref="GOA:O06551" /db_xref="UniProtKB/TrEMBL:O06551" /db_xref="GeneID:885994" /translation="MSAHKPAKQRVALTGVSETALLTLNARAAEARRRDAIIDDPMAV ALVESIDFDFAKFGPTGQGFALRARAFDMAAQHYLDQHPAATVVALAEGLQTSFWRLD VAIPGGQFRWLTVDLPPIVDLRTRLLPSSPRVSVCAQSALDYSWMDSVDPAGGVFITA EGLLMYLQPEQALGLIAQCAQTFPGGQMLFDLPPRWFAGWSRLGLRTSLRYKVPRMPF SMSVAQAADLVNKVPGVVAVRDLRVPPGRGLWVNMALSTVYRLPVFDPLRPCLTLLEF SRPARG" gene complement(1280843..1281484) /locus_tag="Rv1154c" /db_xref="GeneID:885991" CDS complement(1280843..1281484) /locus_tag="Rv1154c" /function="UNKNOWN" /note="Rv1154c, (MTCI65.21c), len: 213 aa. Hypothetical unknown protein, start uncertain. TBparse score is 0.911" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215670.1" /db_xref="GI:15608294" /db_xref="UniProtKB/TrEMBL:O06552" /db_xref="GeneID:885991" /translation="MEFPLITANSLSSKTWRAMPRAYVAVASFSGGLVQSGMAKFAAF LRGVNVGGVNLKMAEVATALTDAGFCNVRTILASGNVLLESTCGAAEVREKTEATLRE RFGYDAWALIYDVDTVRTIVTAYPFECELEGYQSYVTFVADAAILDELSALADTAGPD ENISRGPDPLGVLYWQVPKGSTLDSTIGQTMGKKRYKSSTTTRNLRTLAKVLR" gene 1281429..1281872 /locus_tag="Rv1155" /db_xref="GeneID:885604" CDS 1281429..1281872 /locus_tag="Rv1155" /function="UNKNOWN" /note="Rv1155, (MTCI65.22), len: 147 aa. Conserved hypothetical protein. Similar to other hypothetical proteins e.g. AL079356|SC6G9.20 Streptomyces coelicolor (144 aa), FASTA scores: opt: 478, E(): 2.8e-26, (55.7% identity in 140 aa overlap); and Mycobacterium tuberculosis proteins Rv1875, Rv0121c, Rv2074. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215671.1" /db_xref="GI:15608295" /db_xref="UniProtKB/TrEMBL:O06553" /db_xref="GeneID:885604" /translation="MARQVFDDKLLAVISGNSIGVLATIKHDGRPQLSNVQYHFDPRK LLIQVSIAEPRAKTRNLRRDPRASILVDADDGWSYAVAEGTAQLTPPAAAPDDDTVEA LIALYRNIAGEHSDWDDYRQAMVTDRRVLLTLPISHVYGLPPGMR" gene 1282306..1282893 /locus_tag="Rv1156" /db_xref="GeneID:885593" CDS 1282306..1282893 /locus_tag="Rv1156" /function="UNKNOWN" /note="Rv1156, (MTCI65.23), len: 195 aa. Conserved hypothetical protein, highly similar to CAC32318.1|AL583944 conserved hypothetical protein from Streptomyces coelicolor (197 aa). TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215672.1" /db_xref="GI:15608296" /db_xref="GOA:O06554" /db_xref="UniProtKB/TrEMBL:O06554" /db_xref="GeneID:885593" /translation="MPNLQLVQEPAADALLNANPFALLVGMLLDQQVPMETAFAGPKK IADRMGSFDAGDIADYDPDKFVALCSERPAIHRFPGSMAKRIQALAQIIVDRYDGDAA ALWTAGEPDGNELLRRLKGLPGFGEQKARIFLALLGKQYGVTPKGWQVAAGEFGQPGT YLSVADIVDAGSLGQVRSHKRQRKAAAKAEGKAPT" gene complement(1283056..1284171) /locus_tag="Rv1157c" /db_xref="GeneID:885878" CDS complement(1283056..1284171) /locus_tag="Rv1157c" /function="UNKNOWN" /note="Rv1157c, (MTCI65.24c), len: 371 aa. Conserved hypothetical Ala-, Pro-rich protein, similar to other proline rich proteins and extensins e.g. GBU04267|g451543 sea-island cotton proline-rich protein of cotton fiber (214 aa), FASTA scores: opt: 305, E(): 3.9e-05, (35.7% identity in 182 aa overlap). Has hydrophobic stretch at N-terminus suggestive of secretion signal. First start taken. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215673.1" /db_xref="GI:15608297" /db_xref="GOA:O06555" /db_xref="UniProtKB/TrEMBL:O06555" /db_xref="GeneID:885878" /translation="MRRLTNTEHRENTTVASTWSVCKGLAAVVITSAAAFALCPNAAA DPATPQPNPTQQLPGLPALAQLSPIIQQAAMNPAQATQLLMAAASAFAGNPAVPTESK NVASSVNQFVAEPTNPDSAALGVPAPHGVALPEAIPVPHVPPLGAEPGVQAHLPTGID PSHAAGPAPAVAPTVTPPVAAPPASAPAPAPDAAQPVAVPGPPPAPPAPRAAAPAPAS AAPAPAAAPAPASGFGADAPPTQDFMYPSIGPNCVADGSNSIATALSVAGPAKIPLPG PGPGQTAYVFTAVGTPGPADVQRLPLNVTWVNLTTGKSGSATLRPRSDINPDGPTTLT VIADTGSGSIMSTIFGQVTTKDRQCQFMPTIGSTVVP" gene complement(1284179..1284862) /locus_tag="Rv1158c" /db_xref="GeneID:888930" CDS complement(1284179..1284862) /locus_tag="Rv1158c" /function="UNKNOWN" /note="Rv1158c, (MTCI65.25c), len: 227 aa. Conserved hypothetical Ala-, Pro-rich protein, similar to other proline rich proteins and extensins e.g. MMSAP62|g633250 house mouse (485 aa), FASTA scores: opt: 367, E(): 1.2e-08, (36.3% identity in 212 aa overlap). Has hydrophobic stretch at N-terminus suggestive of secretion signal. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215674.1" /db_xref="GI:15608298" /db_xref="UniProtKB/TrEMBL:O06556" /db_xref="GeneID:888930" /translation="MPTIWTFVRAAAVLVGSSAALLTGGIAHADPAPAPAPAPNIPQQ LISSAANAPQILQNLATALGATPPLSAPKVAEPAPAAPGITATFPGLTPAAPAAAAAP ALTPSIPGVNAPIPGITPAAPALPVTAPAAAPTIPGVNAPIPGITAPAPAAAAVPASV PGVPSAKVDLPQLPYLPLQVPQQLSLPADLPALASGVIPAAPIAPTPPAPGAPALPPG PPSLLAALP" gene 1284992..1286287 /gene="pimE" /locus_tag="Rv1159" /db_xref="GeneID:885899" CDS 1284992..1286287 /gene="pimE" /locus_tag="Rv1159" /function="UNKNOWN" /note="involved in the fifth mannose transfer of phosphatidylinositol mannoside synthesis" /codon_start=1 /transl_table=11 /product="mannosyltransferase" /protein_id="NP_215675.1" /db_xref="GI:15608299" /db_xref="UniProtKB/TrEMBL:O06557" /db_xref="GeneID:885899" /translation="MCRTLIDGPVRSAIAKVRQIDTTSSTPAAARRVTSPPARETRAA VLLLVLSVGARLAWTYLAPNGANFVDLHVYVSGAASLDHPGTLYGYVYADQTPDFPLP FTYPPFAAVVFYPLHLVPFGLIALLWQVVTMAALYGAVRISQRLMGGTAETGHFAAML WTAIAIWIEPLRSTFDYGQINVLLMLAALWAVYTPRWWLSGLLVGVASGVKLTPAITA VYLVGVRRLHAAAFSVVVFLATVGVSLLVVGDEARYYFTDLLGDAGRVGPIATSFNQS WRGAISRILGHDAGFGPLVLAAIASTAVLAILAWRALDRSDRLGKLLVVELFGLLLSP ISWTHHWVWLVPLMIWLIDGPARERPGARILGWGWLVLTIVGVPWLLSFAQPSIWQIG RPWYLAWAGLVYVVATLATLGWIAASERYVRIRPRRMAN" gene complement(1286284..1286568) /gene="phhB" /locus_tag="Rv1159A" /db_xref="GeneID:3205109" CDS complement(1286284..1286568) /gene="phhB" /locus_tag="Rv1159A" /EC_number="4.2.1.96" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="4-alpha-hydroxytetrahydrobiopterin dehydratase activity; catalyzes the formation of (6R)-6-(L-erythro-1,2-dihydroxypropyl)-7, 8-dihydro-6H-pterin from (6R)-6-(L-erythro-1,2-dihydroxypropyl)-5,6,7, 8-tetrahydro-4a-hydroxypterin; functions in recycling tetrahydrobiopterin (BH4) in phenylalanine hydroxylase reaction" /codon_start=1 /transl_table=11 /product="pterin-4-alpha-carbinolamine dehydratase" /protein_id="YP_177641.1" /db_xref="GI:57116835" /db_xref="GOA:P58241" /db_xref="UniProtKB/Swiss-Prot:P58241" /db_xref="GeneID:3205109" /translation="MAVLTDEQVDAALHDLNGWQRAGGVLRRSIKFPTFMAGIDAVRR VAERAEEVNHHPDIDIRWRTVTFALVTHAVGGITENDIAMAHDIDAMFGA" gene 1286595..1287020 /gene="mutT2" /locus_tag="Rv1160" /db_xref="GeneID:888485" CDS 1286595..1287020 /gene="mutT2" /locus_tag="Rv1160" /EC_number="3.6.1.-" /function="INVOLVED IN THE GO SYSTEM RESPONSIBLE FOR REMOVING AN OXIDATIVELY DAMAGED FORM OF GUANINE (7,8-DIHYDRO-8-OXOGUANINE) FROM DNA AND THE NUCLEOTIDE POOL. 8-OXO-DGTP IS INSERTED OPPOSITE DA AND DC RESIDUES OF TEMPLATE DNA WITH ALMOST EQUAL EFFICIENCY THUS LEADING TO A.T TO G.C TRANSVERSIONS. MUTT SPECIFICALLY DEGRADES 8-OXO-DGTP TO THE MONOPHOSPHATE." /note="Rv1160, (MTCI65.27), len: 141 aa. Probable mutT2, mutator protein or homolog (EC 3.6.1.-) (see citation below). More similar to D908197|g1742860 MutT homolog from Escherichia coli (135 aa), FASTA scores: opt: 226, E():1.1e-08, (39.7% identity in 116 aa overlap); than to MUTT_ECOLI|P08337 MUTATOR MUTT PROTEIN from Escherichia coli (129 aa), FASTA scores: opt: 180, E(): 1.2e-05, (27.1% identity in 129 aa overlap). Contains PS00893 mutT domain signature. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="7,8-dihydro-8-oxoguanine-triphosphatase" /protein_id="NP_215676.1" /db_xref="GI:15608300" /db_xref="GOA:O06558" /db_xref="UniProtKB/TrEMBL:O06558" /db_xref="GeneID:888485" /translation="MLNQIVVAGAIVRGCTVLVAQRVRPPELAGRWELPGGKVAAGET ERAALARELAEELGLEVADLAVGDRVGDDIALNGTTTLRAYRVHLLGGEPRARDHRAL CWVTAAELHDVDWVPADRGWIADLARTLNGSAADVHRRC" misc_feature 1286703..1286762 /gene="mutT2" /locus_tag="Rv1160" /note="PS00893 mutT domain signature" gene 1287328..1291026 /gene="narG" /locus_tag="Rv1161" /db_xref="GeneID:885573" CDS 1287328..1291026 /gene="narG" /locus_tag="Rv1161" /EC_number="1.7.99.4" /function="NITRATE REDUCTION [CATALYTIC ACTIVITY: Nitrite + acceptor = nitrate + reduced acceptor]." /note="Rv1161, (MTCI65.28), len: 1232 aa. Probable narG, respiratory nitrate reductase alpha chain (EC 1.7.99.4). Similar to others e.g. NARG_BACSU NITRATEREDUCTASE ALPHA CHAIN from Bacillus subtilis (1228 aa), FASTA scores: opt: 4218, E(): 0, (50.3% identity in 1229 aa overlap); etc. Also highly similar to N-terminal part of Rv1736c|MTCY04C12.21c|NARX PROBABLE NITRATE REDUCTASE from Mycobacterium tuberculosis (85.1% identity in 281 aa overlap). Contains prokaryotic molybdopterin oxidoreductase signatures 1 and 2 (PS00551, PS00490). TBparse score is 0.908. BELONGS TO THE PROKARYOTIC MOLYBDOPTERIN-CONTAINING OXIDOREDUCTASE FAMILY." /codon_start=1 /transl_table=11 /product="respiratory nitrate reductase subunit alpha NarG" /protein_id="NP_215677.1" /db_xref="GI:15608301" /db_xref="GOA:O06559" /db_xref="UniProtKB/TrEMBL:O06559" /db_xref="GeneID:885573" /translation="MTVTPHVGGPLEELLERSGRFFTPGEFSADLRTVTRRGGREGDV FYRDRWSHDKVVRSTHGVNCTGSCSWKIYVKDGIITWETQQTDYPSVGPDRPEYEPRG CPRGASFSWYSYSPTRVRYPYARGVLVEMYREAKTRLGDPVLAWADIQADPERRRRYQ QARGKGGLVRVSWAEASEMVAAAHVHTIKTYGPDRVAGFSPIPAMSMVSHAAGSRFVE LIGGVMTSFYDWYADLPVASPQVFGDQTDVPESGDWWDASYLVMWGSNVPITRTPDAH WMAEARYRGAKVVVVSPDYADNTKFADEWVRCAAGTDTALAMAMGHVILSECYVRNQV PFFVDYVRRYTDLPFLIKLEKRGDLLVPGKFLTAADIGEESENAAFKPALLDELTNTV VVPQGSLGFRFGEDGVGKWNLDLGSVVPALSVEMDKAVNGDRSAELVTLPSFDTIDGH GETVSRGVPVRRAGKHLVCTVFDLMLAHYGVARAGLPGEWPTGYHDRTQQNTPAWQES ITGVPAAQAIRFAKEFARNATESGGRSMIIMGGGICHWFHSDVMYRSVLALLMLTGSM GRNGGGWAHYVGQEKVRPLTGWQTMAMATDWSRPPRQVPGASYWYAHTDQWRYDGYGA DKLASPVGRGRFAGKHTMDLLTSATAMGWSPFYPQFDRSSLDVADEARAAGRDVGDYV AEQLAQHKLKLSITDPDNPVNWPRVLTVWRANLIGSSGKGGEYFLRHLLGTDSNVQSD PPTDGVHPRDVVWDSDIPEGKLDLIMSIDFRMTSTTLVSDVVLPAATWYEKSDLSSTD MHPYVHSFSPAIDPPWETRSDFDAFAAIARAFSALAKRHLGTRTDVVLTALQHDTPDE MAYPDGTERDWLATGEVPVPGRTMSKLTVVERDYTAIYDKWLTLGPLIDQFGMTTKGY TVHPFREVSELAANFGVMNSGVAVGRPAITTAKRMADVILALSGTCNGRLAVEGFLEL EKRTGQRLAHLAEGSEERRITYADTQARPVPVITSPEWSGSESGGRRYAPFTINIEHL KPFHTLTGRMHFYLAHDWVEELGEQLPVYRPPLDMARLFNQPELGPTDDGLGLTVRYL TPHSKWSFHSTYQDNLYMLSLSRGGPTMWMSPGDAAKINVRDNDWVEAVNANGIYVCR AIVSHRMPEGVVFVYHVQERTVDTPRTETNGKRGGNHNALTRVRIKPSHLAGGYGQHA FAFNYLGPTGNQRDEVTVVRRRSQEVRY" misc_feature 1287499..1287555 /gene="narG" /locus_tag="Rv1161" /note="PS00551 Prokaryotic molybdopterin oxidoreductases signature 1" misc_feature 1289644..1289697 /gene="narG" /locus_tag="Rv1161" /note="PS00490 Prokaryotic molybdopterin oxidoreductases signature 2" gene 1291065..1292741 /gene="narH" /locus_tag="Rv1162" /db_xref="GeneID:888265" CDS 1291065..1292741 /gene="narH" /locus_tag="Rv1162" /EC_number="1.7.99.4" /function="NITRATE REDUCTION [CATALYTIC ACTIVITY: Nitrite + acceptor = nitrate + reduced acceptor]." /note="Rv1162, (MTCI65.29), len: 558 aa. Probable narH, respiratory nitrate reductase beta chain (EC 1.7.99.4). Similar to others e.g. NARH_BACSU|P42176 NITRATE REDUCTASE BETA CHAIN from Bacillus subtilis (487 aa), FASTA scores: opt: 2049, E(): 0, (56.8% identity in 488 aa overlap); etc. Contains PS00190 cytochrome c family heme-binding site signature. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="respiratory nitrate reductase subunit beta NarH" /protein_id="NP_215678.1" /db_xref="GI:15608302" /db_xref="GOA:O06560" /db_xref="UniProtKB/TrEMBL:O06560" /db_xref="GeneID:888265" /translation="MKVMAQMAMVMNLDKCIGCHTCSVTCKQAWTNRSGTEYVWFNNV ETRPGVGYPRTYEDQERWRGGWVRDKKGRLRLRDGGRIHKLLRIFANPKLPTIGDYYE PWTYDYENLTSAPAGDTFPTAAPRSLISGNPMKVSWGSNWDDNLAGSPEIVPNDPVLK KVNQVNQEVKLKLEETFMFYLPRICEHCLNPSCVASCPSGAMYKRTEDGIVLVDQDRC RGWRMCVSGCPYKKVYFNHKTGKAEKCTLCYPRIEVGLPTVCSETCVGRLRYLGLVLY DVDQVLQAASVESDTDLYEAQRRILLDPHDPRVIAGARAEGIADEWIEAAQRSPVYAL INTYRVALPLHPEYRTMPMVWYIPPLSPVVDAVSRDGHDGEDLGNLFGALDALRIPIA YLAELFTAGDTEVVAGVLRRLAAMRCYMRDINLGRETQPHIPESVGMTEEQIYQMYRL LAVAKYEERYVIPTSYAGELPAAAMTDDMGCSLSVDGGPGMYESGPFGQGSPTPVPIA VESFHALQHAGSAATGGAGRSRVNLLNWDPNGAAAGLFPEPQPSKDVVQR" misc_feature 1291110..1291127 /gene="narH" /locus_tag="Rv1162" /note="PS00190 Cytochrome c family heme-binding site signature" gene 1292798..1293403 /gene="narJ" /locus_tag="Rv1163" /db_xref="GeneID:885890" CDS 1292798..1293403 /gene="narJ" /locus_tag="Rv1163" /EC_number="1.7.99.4" /function="NITRATE REDUCTAION [CATALYTIC ACTIVITY: Nitrite + acceptor = nitrate + reduced acceptor]." /note="Rv1163, (MTCI65.30), len: 201 aa. Probable narJ, respiratory nitrate reductase delta chain (EC 1.7.99.4). Similar to others e.g. P42178|NARJ_BACSU NITRATE REDUCTASE DELTA CHAIN from Bacillus subtilis (184 aa), FASTA scores: opt: 254, E(): 1.9e-10, (31.8% identity in 179 aa overlap); etc. Strong similarity to region from aa 260 - 410 of Rv1736c|MTCY04C12.21c|NARX PROBABLE NITRATE REDUCTASE from Mycobacterium tuberculosis (64.8% identity in 159 aa overlap). TBparse score is 0.892." /codon_start=1 /transl_table=11 /product="respiratory nitrate reductase subunit delta NarJ" /protein_id="NP_215679.1" /db_xref="GI:15608303" /db_xref="GOA:O06561" /db_xref="UniProtKB/TrEMBL:O06561" /db_xref="GeneID:885890" /translation="MWQSASLLLAYPDDGLAERLHMVDALRAHQTGPAAALLGRTVAE LRALAPMAAAAQYVETFDMRRRSTMYLTYWTAGDTRNRGREMLAFATAYRDAGVKPPR TEAPDYLPVVLEFAATVDPEAGRRLLTEHRVPIDVLRGALADAKSPYEYTVAAICETL PAATNQEVRRAQRLAQSGPPAEAVGLQPFTLTVPPKRAEGA" gene 1293406..1294146 /gene="narI" /locus_tag="Rv1164" /db_xref="GeneID:888935" CDS 1293406..1294146 /gene="narI" /locus_tag="Rv1164" /EC_number="1.7.99.4" /function="NITRATE REDUCTION [CATALYTIC ACTIVITY: Nitrite + acceptor = nitrate + reduced acceptor]." /note="Rv1164, (MTCI65.31), len: 246 aa. Probable narI, respiratory nitrate reductase gamma chain (EC 1.7.99.4). Similar to others e.g. NARI_BACSU|P42177 NITRATE REDUCTASE GAMMA CHAIN from Bacillus subtilis (223 aa), FASTA scores: opt: 652, E(): 0; (41.6% identity in 221 aa overlap); etc. Highly similar to C-terminal part of Rv1736c|MTCY04C12.21c|NARX PROBABLE NITRATE REDUCTASE (GAMMA CHAIN) from Mycobacterium tuberculosis (68.6% identity in 239 aa overlap). TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="respiratory nitrate reductase subunit gamma NarI" /protein_id="NP_215680.1" /db_xref="GI:15608304" /db_xref="GOA:O06562" /db_xref="UniProtKB/TrEMBL:O06562" /db_xref="GeneID:888935" /translation="MAVLDLVEIFWDAAPYVVVAIAVVGTWWRYRYDKFGWTTRSSQL YESRLLSIGSPMFHFGSLLVIMGHVMGLFIPDSWTRAFGMSDHLYHLQALLLGAPAGF ATLLGIGLLIYRRRIQTPVWLATTRNDKLMYLVLVCAIVAGLACTLMGATHEGDMHDY RRSVSVWFRSIWMLAPRGDLMAQATLYYQVHVLIALALFALWPFTRLVHAFSAPIAYL FRPYIVYRSREVAAKHELIGSAPRRRGW" gene 1294168..1296054 /gene="typA" /locus_tag="Rv1165" /db_xref="GeneID:886038" CDS 1294168..1296054 /gene="typA" /locus_tag="Rv1165" /function="UNKNOWN; PROBABLY INTERACTS WITH THE RIBOSOMES IN A GTP DEPENDENT MANNER" /note="Rv1165, (MTV005.01-MTCI65.32), len: 628 aa. Possible typA (alternate gene name: bipA), GTP-binding translation elongation factor, similar to several e.g. P32132|TYPA_ECOLI|BIPA|B387 Escherichia coli (591 aa); YIHK_SYNY3|P72749 gtp-binding protein TYPA/BIPA homolog from synechocystis sp. (597 aa), FASTA scores: E(): 0, (46.9% identity in 610 aa overlap); and to elongation factor EF-G from many organims e.g. EFG_MICLU|P09952 micrococcus luteus (701 aa), FASTA scores: E(): 3e-24, (29.8% identity in 500 aa overlap). BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY, TYPA SUBFAMILY.; bipA" /codon_start=1 /transl_table=11 /product="GTP-binding translation elongation factor TypA" /protein_id="NP_215681.1" /db_xref="GI:15608305" /db_xref="GOA:O06563" /db_xref="UniProtKB/TrEMBL:O06563" /db_xref="GeneID:886038" /translation="MPFRNVAIVAHVDHGKTTLVDAMLRQSGALRERGELQERVMDTG DLEREKGITILAKNTAVHRHHPDGTVTVINVIDTPGHADFGGEVERGLSMVDGVLLLV DASEGPLPQTRFVLRKALAAHLPVILVVNKTDRPDARIAEVVDASHDLLLDVASDLDD EAAAAAEHALGLPTLYASGRAGVASTTAPPDGQVPDGTNLDPLFEVLEKHVPPPKGEP DAPLQALVTNLDASTFLGRLALIRIYNGRIRKGQQVAWIRQVDGQQTVTTAKITELLA TEGVERKPTDAAVAGDIVAVAGLPEIMIGDTLAASANPVALPRITVDEPAISVTIGTN TSPLAGKVGGHKLTARMVRSRLDAELVGNVSIRVVDIGAPDAWEVQGRGELALAVLVE QMRREGFELTVGKPQVVTKTIDGTLHEPFESMTVDCPEEYIGAVTQLMAARKGRMVEM ANHTTGWVRMDFVVPSRGLIGWRTDFLTETRGSGVGHAVFDGYRPWAGEIRARHTGSL VSDRAGAITPFALLQLADRGQFFVEPGQQTYEGMVVGINPRPEDLDINVTREKKLTNM RSSTADVIETLAKPLQLDLERAMELCAPDECVEVTPEIVRIRKVELAAAARARSRART KARG" gene 1296152..1298059 /gene="lpqW" /locus_tag="Rv1166" /db_xref="GeneID:886036" CDS 1296152..1298059 /gene="lpqW" /locus_tag="Rv1166" /function="UNKNOWN" /note="Rv1166, (MTV005.02), len: 635 aa. Probable lpqW, conserved lipoprotein, almost identical in part to G2384665|AF009358 Mycobacterium tuberculosis gene fragment ORFA2-898 (FRAGMENT) (59 aa) (93.9% identity in 49 aa overlap) (see * below). Also similar to Rv1280c and Rv2585c. Contains possible N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. [* Note: Unpublished. Identification of Mycobacterium tuberculosis peptides that stimulate immune human peripheral blood monocytes. Nano F.E., Doran J.L., Treit J.D., Moran A.J.]" /codon_start=1 /transl_table=11 /product="lipoprotein LpqW" /protein_id="NP_215682.1" /db_xref="GI:15608306" /db_xref="GOA:O50422" /db_xref="UniProtKB/TrEMBL:O50422" /db_xref="GeneID:886036" /translation="MGVPSPVRRVCVTVGALVALACMVLAGCTVSPPPAPQSTDTPRS TPPPPRRPTQIIMGIDWIGPGFNPHLLSDLSPVNAAISALVLPSAFRPIPDPNTPTGS RWEMDPTLLVSADVTNNHPFTVTYKIRPEAQWTDNAPIAADDFWYLWQQMVTQPGVVD PAGYHLITSVQSLEGGKQAVVTFAQPYPAWRELFTDILPAHIVKDIPGGFASGLARAL PVTGGQFRVENIDPQRDEILIARNDRYWGPPSKPGIILFRRAGAPAALADSVRNGDTQ VAQVHGGSAAFAQLSAIPDVRTARIVTPRVMQFTLRANVPKLADTQVRKAILGLLDVD LLAAVGAGTDNTVTLDQAQIRSPSDPGYVPTAPPAMSSAAALGLLEASGFQVDTNTSV SPAPSVPDSTTTSVSTGPPEVIRGRISKDGEQLTLVIGVAANDPTSVAVANTAADQLR DVGIAATVLALDPVTLYHDALNDNRVDAIVGWRQAGGNLATLLASRYGCPALQATTVP AANAPTTAPSAPIGPTPSAAPDTATPPPTAPRRPSDPGALVKAPSNLTGICDRSIQSN IDAALNGTKNINDVITAVEPRLWNMSTVLPILQDTTIVAAGPSVQNVSLSGAVPVGIV GDAGQWVKTGQ" misc_feature 1296203..1296235 /gene="lpqW" /locus_tag="Rv1166" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(1298087..1298692) /locus_tag="Rv1167c" /db_xref="GeneID:888758" CDS complement(1298087..1298692) /locus_tag="Rv1167c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1167c, (MTV005.03c), len: 201 aa. Probable transcriptional regulator, similar to several e.g. D1022772|D85417 hemR from Propionibacterium freudenreichii (243 aa), FASTA scores: opt: 268, E(): 5.4e-16, (35.9% identity in 198 aa overlap) and AL022268|SC4H2.32 Streptomyces coelicolor (111 aa), FASTA scores: opt: 274, E(): 5e-11, (55.1% identity in 89 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215683.1" /db_xref="GI:15608307" /db_xref="GOA:O50423" /db_xref="UniProtKB/TrEMBL:O50423" /db_xref="GeneID:888758" /translation="MTVSAPAKANPYRRRGEVLERALYDATLAELESAGYGGLTMEGI AARAQTGKAALYRRWAGKRELVLAAVQYALPPVPEPRADRSARENLLAVFTANCEILA GKTALPSMEIVSQLLHEPELRAIFINSVWAPRLRIVESILQAGVRSGEIDPATLTPMT ARIGPALIHQHVLFTGSPPDREQLTRIIDAMILTTGERRES" gene complement(1298764..1299804) /gene="PPE17" /locus_tag="Rv1168c" /db_xref="GeneID:885990" CDS complement(1298764..1299804) /gene="PPE17" /locus_tag="Rv1168c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1168c, (MTV005.04c), len: 346 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to many e.g. E332789|Z98268|MTCI125.27C (385 aa), FASTA scores: opt: 504, E(): 0, (36.6% identity in 388 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177791.1" /db_xref="GI:57116836" /db_xref="UniProtKB/TrEMBL:Q7D8Q2" /db_xref="GeneID:885990" /translation="MDFTIFPPEFNSLNIQGSARPFLVAANAWKNLSNELSYAASRFE SEINGLITSWRGPSSTIMAAAVAPFRAWIVTTASLAELVADHISVVAGAYEAAHAAHV PLPVIETNRLTRLALATTNIFGIHTPAIFALDALYAQYWSQDGEAMNLYATMAAAAAR LTPFSPPAPIANPGALARLYELIGSVSETVGSFAAPATKNLPSKLWTLLTKGTYPLTA ARISSIPVEYVLAFVEGSNMGQMMGNLAMRSLTPTLKGPLELLPNAVRPAVSATLGNA DTIGGLSVPPSWVADKSITPLAKAVPTSAPGGPSGTSWAQLGLASLAGGAVGAVAART RSGVILRSPAAG" gene complement(1299822..1300124) /gene="PE11" /locus_tag="Rv1169c" /db_xref="GeneID:885930" CDS complement(1299822..1300124) /gene="PE11" /locus_tag="Rv1169c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1169c, (MTV005.05c), len: 100 aa. Member of the Mycobacterium tuberculosis PE family of proteins (see Brennan & Delogu 2002), e.g. O05297|Z93777|MTCI364.07 (99 aa), FASTA scores: opt: 209, E(): 1.6e-15, (37.4% identity in 99 aa overlap). Also simlar to the N-terminus of P77909|U76006 ESTERASE/LIPASE (EC 3.1.1.3) from Mycobacterium tuberculosis (437 aa), FASTA scores: opt: 193, E(): 4.4e-14, (37.2% identity in 94 aa overlap). Contains a helix-turn-helix motif from aa 88-109 (+2.76 SD)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177792.1" /db_xref="GI:57116837" /db_xref="UniProtKB/TrEMBL:Q79FR5" /db_xref="GeneID:885930" /translation="MSFVTTRPDSIGETAANLHEIGVTMSAHDDGVTPLITNVESPAH DLVSIVTSMLFSMHGELYKAIARQAHVIHESFVQTLQTSKTSYWLTELANRAGTST" gene 1300304..1301215 /gene="mshB" /locus_tag="Rv1170" /db_xref="GeneID:885997" CDS 1300304..1301215 /gene="mshB" /locus_tag="Rv1170" /function="Involved in mycothiol biosynthesis. 1-D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranos id e (GlcNAc-Ins)is converted to 1-D-myo-inosityl-2-amino-2-deoxy-alpha-D-glucopyranoside (GlcN-Ins) by this enzyme. Seems to possesse weak mycothiol conjugate amidase activity but shows substantial deacetylation activity with 1-D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranos id e (GlcNAc-Ins), a hypothetical mycothiol biosynthetic precursor. GlcNAc-Ins is an intermediate in MSH biosynthesis." /experiment="experimental evidence, no additional details recorded" /note="Rv1170, (MTV005.06), len: 303 aa. mshB, N-Acetyl-1-D-myo-Inosityl-2-Amino-2-Deoxy-alpha-D-Glucopy ra noside Deacetylase (GlcNAc-Ins deacetylase) (see citation below), similar to Q54358|X79146 lmbE gene from Streptomyces lincolnensis (270 aa), FASTA scores: opt: 308, E(): 1.2e-15, (32.0% identity in 278 aa overlap). Also similar to Rv1082|MCA Mycothiol conjugate amidase from Mycobacterium tuberculosis (288 aa)." /codon_start=1 /transl_table=11 /product="N-Acetyl-1-D-myo-Inosityl-2-amino-2-deoxy-alpha- D-glucopyranoside deacetylase mshB (GlcNAc-Ins deacetylase)" /protein_id="NP_215686.1" /db_xref="GI:15608310" /db_xref="UniProtKB/TrEMBL:O50426" /db_xref="GeneID:885997" /translation="MSETPRLLFVHAHPDDESLSNGATIAHYTSRGAQVHVVTCTLGE EGEVIGDRWAQLTADHADQLGGYRIGELTAALRALGVSAPIYLGGAGRWRDSGMAGTD QRSQRRFVDADPRQTVGALVAIIRELRPHVVVTYDPNGGYGHPDHVHTHTVTTAAVAA AGVGSGTADHPGDPWTVPKFYWTVLGLSALISGARALVPDDLRPEWVLPRADEIAFGY SDDGIDAVVEADEQARAAKVAALAAHATQVVVGPTGRAAALSNNLALPILADEHYVLA GGSAGARDERGWETDLLAGLGFTASGT" gene 1301307..1301747 /locus_tag="Rv1171" /db_xref="GeneID:885986" CDS 1301307..1301747 /locus_tag="Rv1171" /function="UNKNOWN" /note="Rv1171, (MTV005.07), len: 146 aa. Conserved hypothetical protein, possibly transmembrane protein. Start has been changed since first submission." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215687.2" /db_xref="GI:57116838" /db_xref="UniProtKB/TrEMBL:O50427" /db_xref="GeneID:885986" /translation="MGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAG ALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFG GPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG" gene complement(1301755..1302681) /gene="PE12" /locus_tag="Rv1172c" /db_xref="GeneID:885988" CDS complement(1301755..1302681) /gene="PE12" /locus_tag="Rv1172c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1172c, (MTV005.08c), len: 308 aa. Member of the Mycobacterium tuberculosis PE family of proteins (see Brennan & Delogu 2002), e.g. P71748|Z81368|MTCY253.25C (361 aa), FASTA scores: opt: 483, E(): 7.8e-22, (46.4% identity in 192 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177793.1" /db_xref="GI:57116839" /db_xref="UniProtKB/TrEMBL:Q7D8P8" /db_xref="GeneID:885988" /translation="MSFVFAAPEALAAAAADMAGIGSTLNAANVVAAVPTTGVLAAAA DEVSTQVAALLSAHAQGYQQLSRQMMTAFHDQFVQALRASADAYATAEASAAQTMVNA VNAPARALLGHPLISADASTGGGSNALSRVQSMFLGTGGSSALGGSAAANAAASGALQ LQPTGGASGLSAVGALLPRAGAAAAAALPALAAESIGNAIKNLYNAVEPWVQYGFNLT AWAVGWLPYIGILAPQINFFYYLGEPIVQAVLFNAIDFVDGTVTFSQALTNIETATAA SINQFINTEINWIRGFLPPLPPISPPGFPSLP" gene 1302931..1305501 /gene="fbiC" /locus_tag="Rv1173" /db_xref="GeneID:886061" CDS 1302931..1305501 /gene="fbiC" /locus_tag="Rv1173" /function="ESSENTIAL FOR COENZYME F420 PRODUCTION: PARTICIPATES IN A PORTION OF THE F420 BIOSYNTHETIC PATHWAY BETWEEN PYRIMIDINEDIONE AND FO (BIOSYNTHESIS INTERMEDIATE), BEFORE THE DEAZAFLAVIN RING IS FORMED." /note="7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase; catalyzes radical-mediated transfer of hydroxybenzyl group from 4-hydroxyphenylpyruvate (HPP) to 5-amino-6-ribitylamino-2,4(1H,3H)-pyrimidinedione to form 7,8-didemethyl-8-hydroxy-5-deazariboflavin (FO); functions in F420 biosynthesis" /codon_start=1 /transl_table=11 /product="FO synthase" /protein_id="NP_215689.1" /db_xref="GI:15608313" /db_xref="GOA:O50429" /db_xref="UniProtKB/TrEMBL:O50429" /db_xref="GeneID:886061" /translation="MPQPVGRKSTALPSPVVPPQANASALRRVLRRARDGVTLNVDEA AIAMTARGDELADLCASAARVRDAGLVSAGRHGPSGRLAISYSRKVFIPVTRLCRDNC HYCTFVTVPGKLRAQGSSTYMEPDEILDVARRGAEFGCKEALFTLGDRPEARWRQARE WLGERGYDSTLSYVRAMAIRVLEQTGLLPHLNPGVMSWSEMSRLKPVAPSMGMMLETT SRRLFETKGLAHYGSPDKDPAVRLRVLTDAGRLSIPFTTGLLVGIGETLSERADTLHA IRKSHKEFGHIQEVIVQNFRAKEHTAMAAFPDAGIEDYLATVAVARLVLGPGMRIQAP PNLVSGDECRALVGAGVDDWGGVSPLTPDHVNPERPWPALDELAAVTAEAGYDMVQRL TAQPKYVQAGAAWIDPRVRGHVVALADPATGLARDVNPVGMPWQEPDDVASWGRVDLG AAIDTQGRNTAVRSDLASAFGDWESIREQVHELAVRAPERIDTDVLAALRSAERAPAG CTDGEYLALATADGPALEAVAALADSLRRDVVGDEVTFVVNRNINFTNICYTGCRFCA FAQRKGDADAYSLSVGEVADRAWEAHVAGATEVCMQGGIDPELPVTGYADLVRAVKAR VPSMHVHAFSPMEIANGVTKSGLSIREWLIGLREAGLDTIPGTAAEILDDEVRWVLTK GKLPTSLWIEIVTTAHEVGLRSSSTMMYGHVDSPRHWVAHLNVLRDIQDRTGGFTEFV PLPFVHQNSPLYLAGAARPGPSHRDNRAVHALARIMLHGRISHIQTSWVKLGVRRTQV MLEGGANDLGGTLMEETISRMAGSEHGSAKTVAELVAIAEGIGRPARQRTTTYALLAA" repeat_region 1305495..1305556 /note="62 bp direct repeat copy 1, GGCCTAGCCCCGGCGACGATGCCGGGTCGCGGGATGCGGCCCGTTGAGGAGCGGGGC AATCT" repeat_region 1305557..1305618 /note="62 bp direct repeat copy 2, GGCCTAGCCCCGGCGACGATGCCGGGTCGCGGGATGCGGCCCGTTGAGGAGCGGGGC AATCT" repeat_region 1305619..1305661 /note="62 bp direct repeat partial copy 3 (43/62 bp), GGCCTAGCCCCGGCGACGATGCCGGGTCGCGGGATGGGGCCCG" gene complement(1305669..1306001) /gene="TB8.4" /locus_tag="Rv1174c" /db_xref="GeneID:886082" CDS complement(1305669..1306001) /gene="TB8.4" /locus_tag="Rv1174c" /function="UNKNOWN FUNCTION (SECRETED PROTEIN)" /experiment="experimental evidence, no additional details recorded" /note="Rv1174c, (MTV005.10c), len: 110 aa. TB8.4, low molecular weight T-cell antigen (see citations below), hypothetical unknown secreted protein." /codon_start=1 /transl_table=11 /product="low molecular weight T-cell antigen TB8.4" /protein_id="NP_215690.1" /db_xref="GI:15608314" /db_xref="UniProtKB/TrEMBL:O50430" /db_xref="GeneID:886082" /translation="MRLSLTALSAGVGAVAMSLTVGAGVASADPVDAVINTTCNYGQV VAALNATDPGAAAQFNASPVAQSYLRNFLAAPPPQRAAMAAQLQAVPGAAQYIGLVES VAGSCNNY" gene complement(1306202..1308226) /gene="fadH" /locus_tag="Rv1175c" /db_xref="GeneID:886053" CDS complement(1306202..1308226) /gene="fadH" /locus_tag="Rv1175c" /EC_number="1.3.1.34" /function="CATALYZES THE NADP-DEPENDENT REDUCTION OF 2,4-DIENOYL-CoA TO YIELD TRANS-2- ENOYL-CoA [CATALYTIC ACTIVITY: TRANS-2,3-DIDEHYDROACYL-CoA + NADP(+) = TRANS,TRANS-2,3,4,5-TETRADEHYDROACYL-CoA + NADPH]." /note="Rv1175c, (MTV005.11c), len: 674 aa. Probable fadH, NADPH-dependent 2,4-dienoyl-CoA reductase (EC 1.3.1.34), highly similar to others e.g. NP_251782.1|NC_002516 2,4-dienoyl-CoA reductase FadH1 from Pseudomonas aeruginosa (679 aa); CAC01564.1|AL391039 2,4-dienoyl-CoA reductase [NADPH] from Streptomyces coelicolor (671 aa); P42593|FADH_ECOLI 2,4-dienoyl-CoA reductase from Escherichia coli (671 aa), FASTA scores: opt: 2344, E(): 0, (53.1% identity in 671 aa overlap); etc. Also similar to Rv3359|MTV004.16 PUTATIVE OXIDOREDUCTASE from Mycobacterium tuberculosis (396 aa)." /codon_start=1 /transl_table=11 /product="NADPH dependent 2,4-dienoyl-CoA reductase" /protein_id="NP_215691.1" /db_xref="GI:15608315" /db_xref="GOA:O50431" /db_xref="UniProtKB/TrEMBL:O50431" /db_xref="GeneID:886053" /translation="MTNPYPNLLSPLDLGFTTLRNRVVMGSMHTGLEDRARHIDRLAD YFAERARGGVGLIITGGYAPNRTGWLLPFASELVTSAQARRHRRITRAVHDSGAKILL QILHAGRYAYHPLAVSASPIKAPITPFRPRALSARGVEATIADFARCAQLARDAGYDG VEIMGSEGYLLNQFLAPRTNKRTDSWGGTPANRRRFPVEIIRRSRAAVGCDFIICYRL SMADYVAEGQSWDEIVALATEVEGAGATIINSGFGWHEARVPTIVTSVPGGAFVDISS AVAEHVTIPVVASNRINMPQAAERILAETQVRLISMARPMLSDPDWVLKAQSNRVDEI NTCISCNQACLDHAFARKTVSCLLNPRAGRETQLVLSPTRRARSVAVVGAGPAGLATA ANAAQRGHRVTLFEANDFIGGQFDMARRIPGKEEFSETIRYFSTILAKHGVEVRLGTR VAAQELTGYDEVVLATGVAPRIPAIPGIDHPMVLTYAEAITGVRPVGRTVAVVGAGGI GFDVTELLVTDSSPTLNLKEWKAEWGVADPREARGALTTPLPAPPAREVYLLQRTKGP QGKRLGKTTGWVHRASLKAKGVHQLSGVNYEQINDDGLHISFGPKRRRPQLLAVDNVV VCAGQEPVRDLESELRRHGINPHIIGGAAVAAELDAKRAIKQGTELAARL" gene complement(1308223..1308792) /locus_tag="Rv1176c" /db_xref="GeneID:886080" CDS complement(1308223..1308792) /locus_tag="Rv1176c" /function="UNKNOWN" /note="Rv1176c, (MTV005.12c), len: 189 aa. Conserved hypothetical protein, some similarity to P94443|D78508 hypothetical protein from Bacillus subtilis (182 aa), FASTA scores: opt: 219, E(): 1.7e-15, (25.1% identity in 183 aa overlap). Similar to Mycobacterium tuberculosis hypothetical protein Rv0047c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215692.1" /db_xref="GI:15608316" /db_xref="UniProtKB/TrEMBL:O50432" /db_xref="GeneID:886080" /translation="MALPHAILVSLCEQASSGYELARRFDRSIGYFWTATHQQIYRTL RVMENNNWVRATTVLQHGRPDKKVYAISDSGRAELARWIAEPLSPTRPGRGSALTDSS TRDIAVKLRGAGYGDVAALYTQVTALRAERVKSLDTYRGIEKRTFADPSALDGAALHQ YLVLRGGIRAEESAIDWLDEVAEALQEKR" gene 1309005..1309331 /gene="fdxC" /locus_tag="Rv1177" /db_xref="GeneID:885869" CDS 1309005..1309331 /gene="fdxC" /locus_tag="Rv1177" /function="FERREDOXINS ARE IRON-SULFUR PROTEINS THAT TRANSFER ELECTRONS IN A WIDE VARIETY OF METABOLIC REACTIONS." /note="Rv1177, (MTV005.13), len: 108 aa. Probable fdxC, ferredoxin (EC 1.-.-.-), equivalent to NP_302047.1|NC_002677 ferredoxin from Mycobacterium leprae (108 aa); P00215|FER_MYCSM FERREDOXIN from Mycobacterium smegmatis (106 aa), FASTA scores: opt: 705, E(): 0, (87.7% identity in 106 aa overlap). Also highly similar to many e.g. JH0239 ferredoxin precursor from Saccharopolyspora erythraea (105 aa); P24496|FER_SACER FERREDOXIN from Saccharopolyspora erythraea (106 aa); etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. BELONGS TO THE BACTERIAL TYPE FERREDOXIN FAMILY. COFACTOR: BINDS 1 4FE-4S CLUSTER AND A 3FE-4S CLUSTER (BY SIMILARITY)." /codon_start=1 /transl_table=11 /product="ferredoxin FdxC" /protein_id="NP_215693.1" /db_xref="GI:15608317" /db_xref="GOA:O50433" /db_xref="UniProtKB/TrEMBL:O50433" /db_xref="GeneID:885869" /translation="MTYTIAEPCVDIKDKACIEECPVDCIYEGARMLYIHPDECVDCG ACEPVCPVEAIFYEDDVPEQWSHYTQINADFFAELGSPGGAAKVGMTENDPQAVKDLA PQSEDA" misc_feature 1309122..1309157 /gene="fdxC" /locus_tag="Rv1177" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature" gene 1309364..1310452 /locus_tag="Rv1178" /db_xref="GeneID:886031" CDS 1309364..1310452 /locus_tag="Rv1178" /EC_number="2.6.1.17" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="catalyzes the formation of N-succinyl-LL-2,6-diaminopimelate from N-succinyl-L-2-amino-6-oxopimelate in lysine biosynthesis" /codon_start=1 /transl_table=11 /product="N-succinyldiaminopimelate aminotransferase" /protein_id="NP_215694.1" /db_xref="GI:15608318" /db_xref="GOA:O50434" /db_xref="UniProtKB/TrEMBL:O50434" /db_xref="GeneID:886031" /translation="MSASLPVFPWDTLADAKALAGAHPDGIVDLSVGTPVDPVAPLIQ EALAAASAAPGYPATAGTARLRESVVAALARRYGITRLTEAAVLPVIGTKELIAWLPT LLGLGGADLVVVPELAYPTYDVGARLAGTRVLRADALTQLGPQSPALLYLNSPSNPTG RVLGVDHLRKVVEWARGRGVLVVSDECYLGLGWDAEPVSVLHPSVCDGDHTGLLAVHS LSKSSSLAGYRAGFVVGDLEIVAELLAVRKHAGMMVPAPVQAAMVAALDDDAHERQQR ERYAQRRAALLPALGSAGFAVDYSDAGLYLWATRGEPCRDSAAWLAQRGILVAPGDFY GPGGAQHVRVALTATDERVAAAVGRLTC" misc_feature 1310015..1310056 /locus_tag="Rv1178" /note="PS00105 Aminotransferases class-I pyridoxal-phosphate attachment site" gene complement(1310480..1313299) /locus_tag="Rv1179c" /db_xref="GeneID:886083" CDS complement(1310480..1313299) /locus_tag="Rv1179c" /function="UNKNOWN" /note="Rv1179c, MTV005.15c, len: 939 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215695.1" /db_xref="GI:15608319" /db_xref="GOA:O50435" /db_xref="UniProtKB/TrEMBL:O50435" /db_xref="GeneID:886083" /translation="MDPHRDLESRAFAGNWRVYQQQALDAFDADVAAGDNRAYLVLPP GAGKTMIGLEAARRLGRRSLVLVPNTAVQAQWAAAWDNSFPSSDRSASKCGTERGLAS AMNVLTYQSLAVIDAETDSTVRREVLRNRDQQALLDLLHPNGRAVIERAATLGPWTLV LDECHHLLATWGALVSALASVLGAQTALIGLTATPATELTAWQHTLHDELFGTADFVI PTPALVREGDLAPYQELVYLTQPTPEEQAWIGTHRARFADLMLALIDQKVGSMSLAAW LHTRIVDRATREGNQIAWSTFERAEPDLACSGLRFAYDGLIPLPDGVRLREQHRIAPD AQDWVNVLTDFSVGHLQQSADPRDAHALTAIKRVLPGLGYRLTSRGVRVATSPVDRLC ALSESKIAATAHILDTEDAVLGARLRALVLCDFESMTGALPTSLKGAPVSEQSGSAQL VAAMLAASDHRRRTPLHALLVTGQTFACPAAIEDDLIAFCAERGALVTAEPLDAHPSL RVMRGTGGFTPRTWVALATEYFLAGRARVLVGTRSLLGEGWDCAAVNVNIDLTSATTQ AAITQMRGRAIRNDPSDGHKVADNWSVCCIATEHPRGDADYLRLVRKHDGYYAATPQG LIESGVTHCDPSLSPYGPPVTDTHAITARALQRVAERAQARSWWRIGEPYEGVDVATI RVRSRQPLGVAAPRIPASALTPPVPGQFSPVRLARGAVAAVSVVGASTATAVASANLG MLAGAGTAGAIVAAGVGLVATAAAAESRRLDHAPNALEQLAAVVADALYAAGGAQRGS AALRLASDPEGWIRCQLDGVPTEQSLRFTAALDELLAPLAEPRYLIGRKILTPPARPV ARRLFAVRAVVGLSLPGTVAWHAVPRWFARNKDRRQHLAQAWRKHIGPPRQLPADSPQ GQAILDLFRGDNPLSVTTQLRTTWR" gene 1313725..1315191 /gene="pks3" /locus_tag="Rv1180" /db_xref="GeneID:886055" CDS 1313725..1315191 /gene="pks3" /locus_tag="Rv1180" /EC_number="2.3.1.-" /function="POTENTIALLY INVOLVED IN SOME INTERMEDIATE STEPS FOR THE SYNTHESIS OF A POLYKETIDE MOLECULE WHICH MAY BE INVOLVED IN SECONDARY METABOLISM. SUPPOSED INVOLVED IN STATIONARY-PHASE SURVIVAL." /experiment="experimental evidence, no additional details recorded" /note="Rv1180, (MTV005.16), len: 488 aa. Probable polyketide beta-ketoacyl synthase (EC 2.3.1.-), equivalent to a predicted homologous protein from Mycobacterium smegmatis (see citation below), and similar to the N-terminus of many polyketide synthases e.g. MCAS_MYCBO|Q02251 mycocerosic acid synthase from mycobacterium bovis (2110 aa), FASTA scores: opt: 2115, E(): 0, (66.5% identity in 472 aa overlap). Also similar to, and same length as P96284|Z83858|MTCY24G1.02 M. tuberculosis (496 aa), FASTA scores: opt: 1424, E(): 0, (50.9% identity in 444 aa overlap). Contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site, also PS00606 Beta-ketoacyl synthases active site. BELONGS TO THE BETA-KETOACYL-ACP SYNTHASES FAMILY." /codon_start=1 /transl_table=11 /product="polyketide beta-ketoacyl synthase PKS3" /protein_id="NP_215696.1" /db_xref="GI:15608320" /db_xref="GOA:O50436" /db_xref="UniProtKB/TrEMBL:O50436" /db_xref="GeneID:886055" /translation="MRTATATSVAVIGMACRLPGGIDSPQRLWEALLRGDDLVGEIPA DRWDANVYYDPEPGVPGRSVSRWGAFLDDVGGFDCDFFGLTEREATAIDPQHRLLLEV SWEAIEHAGVDPATLAESQTGVFVGLTHGDYELLSADCGAAEGPYGFTGTSNSFASGR VAYTLGLHGPAVTVDTACSSGLTAVHQACRSLDDGESDLALAGGVVVTLEPRKSVSGS LQGMLSPTGRCHAFDEAADGFVSGEGCVVLLLKRLPDAVRDGDRVLAIVRGTAANQDG RTVNIAAPSAQAQIAVYQQALAAAGVEASTVGMVEAHGTGTPVGDPVEYASLAAVYGT EGPCALTSVKTNFGHLQSASGPLGLMKTILALRHGVVPQNLHFCRLPDQLAEIDTELF VPQANTSWPDNTGQPRRAAVSSYGMSGTNVHAILEQAPVSEPAASGPELTPEAGGLAL FPVSATSAEQLHVTAARLADWVDQNGNAGSRVSMRDLG" misc_feature 1313740..1313772 /gene="pks3" /locus_tag="Rv1180" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" misc_feature 1314229..1314279 /gene="pks3" /locus_tag="Rv1180" /note="PS00606 Beta-ketoacyl synthases active site" gene 1315234..1319982 /gene="pks4" /locus_tag="Rv1181" /db_xref="GeneID:886081" CDS 1315234..1319982 /gene="pks4" /locus_tag="Rv1181" /EC_number="2.3.1.-" /function="POTENTIALLY INVOLVED IN SOME INTERMEDIATE STEPS FOR THE SYNTHESIS OF A POLYKETIDE MOLECULE WHICH MAY BE INVOLVED IN SECONDARY METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1181, (MTV005.17), len: 1582 aa. Probable polyketide synthase, similar to many e.g. MCAS_MYCBO|Q02251 mycocerosic acid synthase from mycobacterium bovis (2110 aa), FASTA scores: opt: 3518, E(): 0, (59.7% identity in 1614 aa overlap). Note that this similarity extends upstream of the first initiation codon into the upstream MTV005.16; however the stop codon at the end of MTV005.16 is present in at least 4 independent clones (BAC, cosmid and pUC) from the genome. The two CDS's may represent separate modules of the polyketide synthase." /codon_start=1 /transl_table=11 /product="polyketide beta-ketoacyl synthase PKS4" /protein_id="NP_215697.1" /db_xref="GI:15608321" /db_xref="GOA:O50437" /db_xref="UniProtKB/TrEMBL:O50437" /db_xref="GeneID:886081" /translation="MTASSFDELSAALRDVAGDQIPYQPAVGHDDRGPVWVFSGQGSQ WPGMGTELLVAEPVFAATVAAMEPVIARESGFSVTEAMSAPQTVSGIDRVQPTIFAVQ VALAAALKSYGVRPGAIIGHSLGEAAAAVVAGALSLHDGLRVICRRSRLMSRIAGSGA MASVELPGQQVLSELAIRGISDVVLSVVASPTSTVVGGATQSIRDLVAAWEQQDVLAR EVAVDVASHTPQVDPILDELLEVLAEVDPTAPEIPYYSATLWDPRERPSFTGEYWVEN LRYTVRFAAAVQAALKDGYRVFGELAPHPLLTYAVEQNAASLDMPIATLAAMRRGEQL PFGLRGFVADVHNAGAKVDFSVQYPDGRLVDAPLPSWTHRTLMLSREDSHRSHTGAVQ AVHPLLGAHVHLLEEPERHVWQAGVGTGAHPWLGDHRIHNVAAFPGAAYCEMALAAAR TTLGELSEVRDIKFEQTLLLDEQTVVSSAATIAAPGILQFAVESHQEGEPARRASAML HALEEMPQPPGYDTNALTAAHESSMSGEELRKMFNSLGIQYGPAFSGLVAVHTARGDV TTVLAEVALPGAIRSQQSAYASHPALLDACFQSVLVHPEVQKATVGGLMLPVGVRRLR NYHSTRSAHYCLARVTSSSRAGECEADLDVFDQAGTVLLTVEGLRLAAGISEHERANR VFDERLLTIEWERGELPEVPQIDAGSWLLLSASEADPLTAQLADALNAVGAQSTSVAS ASDVAQLRSLLGGRLTGVVVVTGPPTGGLTQCGRDYVSQLVGIARELAELPGEPPRLF VVTRSAASVLPSDLANLEQAGLRGLMRVIDSEHPHLGATAIDVDNDETVAALVASQLQ SGSQEDETAWRNGIWYTARLRPGPLRPAERRTAVVEYRRDGMRLQIRTPGDLESLEFV TFDRVAPGPGEIEVAVTASSVNFADVLVAFGRYPTFEGYRQQLGIDFAGVVTAVGPDV TEHRIGDHVGGMSANGCWSTFVRCDARLAVTLPPELPVAAAAAVPTASATAWYALHDL ARICSDDKVLIHSGTGGVGQAAIAIARAAGCEIFATAGSAQRRQLLHDMGVEHVYDSR STEFAEQIRGDTDGYGVDVVLNSLPGAAQRAGIELLAFGGRFVEIGKRDIYGDTRLGL FPFRRNLSLYAVDLALLTHSHPHTVRRLLKTVYQHTVEGTLPVPQTTHYPIHDAAVAI RLVGGAGHTGKVVLDVPRTGEGVAVVPPEQVRTSRPDGAYLVTGGLGGLGLFLAGELA AAGCGRIVLNSRSTPSPHATRVIERLRAAGADIQVECGDIADAATAHRVVAVATASGL PVRGVLHAAAVVEDATLANVTDELIDRCWAPKVHGAWNIHRATAAQPLEWFCLFSSAA ALVGSPGQGAYAAANSWLDAFAHWRRAQGLPATSIAWGAWAEIGRATALAEGTGAAIA PAEGARAFQTLLRYGRAYSGYAPIMGTPWLTAFAQRSRFAEAFHATGQNQPATGKFLA ELGSLPREEWPRTVRRLVSDQISLLLRRTIDPDRPLSDYGLDSLGNLELRTRIETETG IRVSPTKITTVRGLAEHVCDELAAAQSAPV" gene 1320035..1321453 /gene="papA3" /locus_tag="Rv1182" /db_xref="GeneID:886072" CDS 1320035..1321453 /gene="papA3" /locus_tag="Rv1182" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1182, (MTV005.18), len: 472 aa. Probable papA3, conserved polyketide synthase (PKS) associated protein, similar to other Mycobacterial hypothetical proteins e.g. Q49618|U00010 B1170_C1_180 from Mycobacterium leprae (471 aa), FASTA scores: opt: 2526, E(): 0, (75.6% identity in 471 aa overlap). Similar to other Mycobacterium tuberculosis hypothetical papA proteins; Rv3824c, Rv3820c, Rv1528c." /codon_start=1 /transl_table=11 /product="polyketide synthase associated protein PapA3" /protein_id="NP_215698.1" /db_xref="GI:15608322" /db_xref="GOA:O50438" /db_xref="UniProtKB/TrEMBL:O50438" /db_xref="GeneID:886072" /translation="MLRVGPLTIGTLDDWAPSTGSTVSWRPSAVAHTKASQAPISDVP VSYMQAQHIRGYCEQKAKGLDYSRLMVVSCQQPGQCDIRAANYVINAHLRRHDTYRSW FQYNGNGQIIRRTIQDPADIEFVPVHHGELTLPQIREIVQNTPDPLQWGCFRFGIVQG CDHFTFFASVDHVHVDAMIVGVTLMEFHLMYAALVGGHAPLELPPAGSYDDFCRRQHT FSSTLTVESPQVRAWTKFAEGTNGSFPDFPLPLGDPSKPSDADIVTVMMLDEEQTAQF ESVCTAAGARFIGGVLACCGLAEHELTGTTTYYGLTPRDTRRTPADAMTQGWFTGLIP ITVPIAGSAFGDAARAAQTSFDSGVKLAEVPYDRVVELSSTLTMPRPNFPVVNFLDAG AAPLSVLLTAELTGTNIGVYSDGRYSYQLSIYVIRVEQGTAVAVMFPDNPIARESVAR YLATLKSVFQRVAESGQQQNVA" gene 1321520..1324528 /gene="mmpL10" /locus_tag="Rv1183" /db_xref="GeneID:886068" CDS 1321520..1324528 /gene="mmpL10" /locus_tag="Rv1183" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv1183, (MTV005.19), len: 1002 aa. Probable mmpL10, conserved transmembrane transport protein (see Tekaia et al., 1999), member of RND superfamily, similar to many Mycobacterial hypothetical membrane proteins e.g. Q49619|U00010 from Mycobacterium leprae (1008 aa), FASTA scores: opt: 4545, E(): 0, (70.6% identity in 978 aa overlap); etc. BELONGS TO THE MMPL FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL10" /protein_id="NP_215699.1" /db_xref="GI:15608323" /db_xref="GOA:P65372" /db_xref="UniProtKB/Swiss-Prot:P65372" /db_xref="GeneID:886068" /translation="MVGCWVALALVLPMAVPSLAEMAQRHPVAVLPADAPSSVAVRQM AEAFHESGSENILVVLLTDEKGLGAADENVYHTLVDRLRNDAKDVVMLQDFLTTPPLR EVLGSKDGKAWILPIGLAGDLGTPKSYHAYTDVERIVKRTVAGTTLTANVTGPAATVA DLTDAGARDRASIELAIAVMLLVILMVIYRNPVTMLLPLVTIGASLMTAQALVAGVSL VGGLAVSNQAIVLLSAMIAGAGTDYAVFLISRYHEYVRLGEHPERAVQRAMMSVGKVI AASAATVGITFLGMRFAKLGVFSTVGPALAIGIAVSFLAAVTLLPAILVLASPRGWVA PRGERMATFWRRAGTRIVRRPKAYLGASLIGLVALASCASLAHFNYDDRKQLPPSDPS SVGYAAMEHHFSVNQTIPEYLIIHSAHDLRTPRGLADLEQLAQRVSQIPGVAMVRGVT RPNGETLEQARATYQAGQVGNRLGGASRMIDERTGDLNRLASGANLLADNLGDVRGQV SRAVAGVRSLVDALAYIQNQFGGNKTFNEIDNAARLVSNIHALGDALQVNFDGIANSF DWLDSVVAALDTSPVCDSNPMCGNARVQFHKLQTARDNGTLDKVVGLARQLQSTRSPQ TVSAVVNDLGRSLNSVVRSLKSLGLDNPDAARARLISMQNGANDLASAGRQVADGVQM LVDQTKNMGIGLNQASAFLMAMGNDASQPSMAGFNVPPQVLKSEEFKKVAQAFISPDG HTVRYFIQTDLNPFSTAAMDQVNTIIDTAKGAQPNTSLADASISMSGYPVMLRDIRDY YERDMRLIVAVTVVVVILILMALLRAIVAPLYLVGSVVISYMSAIGLGVVVFQVFLGQ ELHWSVPGLAFVVLVAVGADYNMLLASRLRDESALGVRSSVIRTVRCTGGVITAAGLI FAASMSGLLFSSIGTVVQGGFIIGVGILIDTFVVRTITVPAMATLLGRASWWPGHPWQ RCAPEEGQMSARMSARTKTVFQAVADGSKR" gene complement(1324532..1325611) /locus_tag="Rv1184c" /db_xref="GeneID:886063" CDS complement(1324532..1325611) /locus_tag="Rv1184c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1184c, (MTV005.20c), len: 359 aa. Possible exported protein with potential N-terminal signal sequence. Similar to several Mycobacterial hypothetical proteins e.g. Q49633|U00010) Protein B1170_F3_112 from M. leprae (391 aa), FASTA scores: opt: 1422, E(): 0, (62.7% identity in 338 aa overlap). Also similar to Rv3822, Rv3539, Rv1430, Rv0151c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215700.1" /db_xref="GI:15608324" /db_xref="UniProtKB/TrEMBL:O50440" /db_xref="GeneID:886063" /translation="MKRVIAGAFAVWLVGWAGGFGTAIAASEPAYPWAPGPPPSPSPV GDASTAKVVYALGGARMPGIPWYEYTNQAGSQYFPNAKHDLIDYPAGAAFSWWPTMLL PPGSHQDNMTVGVAVKDGTNSLDNAIHHGTDPAAAVGLSQGSLVLDQEQARLANDPTA PAPDKLQFTTFGDPTGRHAFGASFLARIFPPGSHIPIPFIEYTMPQQVDSQYDTNHVV TAYDGFSDFPDRPDNLLAVANAAIGAAIAHTPIGFTGPGDVPPQNIRTTVNSRGATTT TYLVPVNHLPLTLPLRYLGMSDAEVDQIDSVLQPQIDAAYARNDNWFTRPVSVDPVRG LDPLTAPGSIVEGARGLLGSPAFGG" gene complement(1325776..1327512) /gene="fadD21" /locus_tag="Rv1185c" /db_xref="GeneID:886065" CDS complement(1325776..1327512) /gene="fadD21" /locus_tag="Rv1185c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_215701.1" /db_xref="GI:15608325" /db_xref="GOA:P63523" /db_xref="UniProtKB/Swiss-Prot:P63523" /db_xref="GeneID:886065" /translation="MSDSSVLSLLRERAGLQPDDAAFTYIDYEQDWAGITETLTWSEV FRRTRIVAHEVRRHCTTGDRAVILAPQGLAYIAAFLGSMQAGAIAVPLSVPQIGSHDE RVSAVLADASPSVILTTSAVAEAVAEHIHRPNTNNVGPIIEIDSLDLTGNSPSFRVKD LPSAAYLQYTSGSTRAPAGVMISHRNLQANFQQLMSNYFGDRNGVAPPDTTIVSWLPF YHDMGLVLGIIAPILGGYRSELTSPLAFLQRPARWLHSLANGSPSWSAAPNFAFELAV RKTTDADIEGLDLGNVLGITSGAERVHPNTLSRFCNRFAPYNFREDMIRPSYGLAEAT LYVASRNSGDKPEVVYFEPDKLSTGSANRCEPKTGTPLLSYGMPTSPTVRIVDPDTCI ECPAGTIGEIWVKGDNVAEGYWNKPDETRHTFGAMLVHPSAGTPDGSWLRTGDLGFLS EDEMFIVGRMKDMLIVYGRNHYPEDIESTVQEITGGRVAAISVPVDHTEKLVTVIELK LLGDSAGEAMDELDVIKNNVTAAISRSHGLNVADLVLVPPGSIPTTTSGKIRRAACVE QYRLQQFTRLDG" gene complement(1327689..1329305) /locus_tag="Rv1186c" /db_xref="GeneID:886057" CDS complement(1327689..1329305) /locus_tag="Rv1186c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1186c, (MTV005.22c), len: 538 aa. Conserved hypothetical protein, similar to AL117385|SC5G9.24 hypothetical protein from Streptomyces coelicolor (555 aa), FASTA scores: opt: 485, E(): 2.3e-23, (32.6% identity in 568 aa overlap). Contains helix turn helix motif from aa 488-509 (+2.81 SD)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215702.1" /db_xref="GI:15608326" /db_xref="GOA:O50442" /db_xref="UniProtKB/TrEMBL:O50442" /db_xref="GeneID:886057" /translation="MRIAGVGLGQLLLALDATVVSLVDAPRGLDLPVASTALIDSDDV RLGLAAAAGSADVFFLIGVTDDEAVRWVDDQARQRAPVAIFVKHPSDSVVAGAVRAGS AVVAVEPRARWERLYHLVNHVLEHHGDRADPTDDSGTDLFGLAQSLADRIHGMISIED AQSHVLAYSASNDEADELRRLSILGRAGPPEHLQWIGQWGIFDALRPGREVVRVAERP ELGLRPRLAIGIHQPGVGALRPPVFAGTIWVQQGSQPLADDAEEMLRGAAVLAARIMS RLATQPNTHALRVQQLLGLAELNATTAPVDVSTIARELGVAAEGNATLIGFDTAENRD TAVRHVRLVDVMALSASAFRHDAQVAANGSRIYVLLPQTTTGRAVTSWVRGTISALRA ELGVALRAAIAGPVAGLAEVNPARVEVDRVLESAERHPILGQVTSLAEARTTVLLDEI VTLVGTDQRLVDPRIRDLGAQDPVLAQTLRAYLDAFGDIGAAARSLQVHPNTVRYRIR RIEQLLSTSLGDPDVRLLFSLGLRAMERTA" gene 1329390..1331021 /gene="rocA" /locus_tag="Rv1187" /db_xref="GeneID:886058" CDS 1329390..1331021 /gene="rocA" /locus_tag="Rv1187" /EC_number="1.5.1.12" /function="INVOLVED IN THE ARGINASE PATHWAY [CATALYTIC ACTIVITY: 1-PYRROLINE-5-CARBOXYLATE + NAD(+) + H(2)O = L-GLUTAMATE + NADH]" /experiment="experimental evidence, no additional details recorded" /note="Rv1187, (MTV005.23), len: 543 aa. Probable rocA, pyrroline-5-carboxylate dehydrogenase (EC 1.5.1.12), similar to many e.g. PUT2_HUMAN|P30038 human delta-1-pyrroline-5-carboxylate dehydrogenase (563 aa), FASTA scores: opt: 1596, E():0, (46.0% identity in 531 aa overlap). Also similar to other Mycobacterium tuberculosis hypothetical dehydrogenases e.g. Rv0768, Rv2858c, etc. Contains PS00687 Aldehyde dehydrogenases glutamic acid active site and PS00070 Aldehyde dehydrogenases cysteine active site." /codon_start=1 /transl_table=11 /product="pyrroline-5-carboxylate dehydrogenase ROCA" /protein_id="NP_215703.1" /db_xref="GI:15608327" /db_xref="GOA:O50443" /db_xref="UniProtKB/TrEMBL:O50443" /db_xref="GeneID:886058" /translation="MDAITQVPVPANEPVHDYAPKSPERTRLRTELASLADHPIDLPH VIGGRHRMGDGERIDVVQPHRHAARLGTLTNATHADAAAAVEAAMSAKSDWAALPFDE RAAVFLRAADLLAGPWREKIAAATMLGQSKSVYQAEIDAVCELIDFWRFNVAFARQIL EQQPISGPGEWNRIDYRPLDGFVYAITPFNFTSIAGNLPTAPALMGNTVIWKPSITQT LAAYLTMQLLEAAGLPPGVINLVTGDGFAVSDVALADPRLAGIHFTGSTATFGHLWQW VGTNIGRYHSYPRLVGETGGKDFVVAHASARPDVLRTALIRGAFDYQGQKCSAVSRAF IAHSVWQRMGDELLAKAAELRYGDITDLSNYGGALIDQRAFVKNVDAIERAKGAAAVT VAVGGEYDDSEGYFVRPTVLLSDDPTDESFVIEYFGPLLSVHVYPDERYEQILDVIDT GSRYALTGAVIADDRQAVLTALDRLRFAAGNFYVNDKPTGAVVGRQPFGGARGSGTND KAGSPLNLLRWTSARSIKETFVAATDHIYPHMAVD" gene 1331021..1332010 /locus_tag="Rv1188" /db_xref="GeneID:886054" CDS 1331021..1332010 /locus_tag="Rv1188" /EC_number="1.5.99.8" /function="OXIDIZES PROLINE TO GLUTAMATE FOR USE AS A CARBON AND NITROGEN SOURCE [CATALYTIC ACTIVITY: L-proline + acceptor + H2O = (S)-1-pyrroline-5-carboxylate + reduced acceptor]" /note="Rv1188, (MTV005.24), len: 329 aa. Possible putA, proline dehydrogenase (EC 1.5.99.8), similar to part of Q52711|X78346 proline dehydrogenase from Rhodobacter capsulatus (1127 aa), FASTA scores: opt: 194, E(): 1.5e-07, (31.2% identity in 349 aa overlap). Also similar to two Bacillus subtilis proline dehydrohenases E1184363|Z99120 (302 aa), FASTA scores: opt: 509, E(): 0, (37.1% identity in 313 aa overlap); and E1182272|Z99105 (303 aa), FASTA scores: opt: 513, E(): 0, (32.5% identity in 311 aa overlap). Highly similar to AL035569|SC8D9.31 Streptomyces coelicolor (308 aa), FASTA scores: opt: 984, E(): 0, (50.0% identity in 312 aa overlap)." /codon_start=1 /transl_table=11 /product="proline dehydrogenase" /protein_id="NP_215704.1" /db_xref="GI:15608328" /db_xref="GOA:O50444" /db_xref="UniProtKB/TrEMBL:O50444" /db_xref="GeneID:886054" /translation="MAGWFAHTLRPAMLAAGRSDRLGRIVERSPLTRGVVRRFVPGDT LDDVVDIVTALRDSGRYLSIDYLGENVTDADDAAAAVRAYLGLLDVLGRRGDIACDGV RPLEVSLKLSALGQALDRDGQKIALDNARAICERAERVGAWVTVDAEDHTTTDSTLSI SGDLRVDFPWLGTVVQAYLRRTLADCAELAAVGARVRLCKGAYDEPASVAYRDAAQVT DSYLRCLRVLTAGRGYPMVATHDPVIIAAVPGITRESGRSQGDFEYQMLYGVRDDEQR RLTGAGNHVRVYVPFGTRWYGYFLRRLAERPANLAFFLRALTDRRRARGCAER" gene 1332092..1332964 /gene="sigI" /locus_tag="Rv1189" /db_xref="GeneID:886079" CDS 1332092..1332964 /gene="sigI" /locus_tag="Rv1189" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED" /experiment="experimental evidence, no additional details recorded" /note="member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigI" /protein_id="NP_215705.1" /db_xref="GI:15608329" /db_xref="GOA:O50445" /db_xref="UniProtKB/TrEMBL:O50445" /db_xref="GeneID:886079" /translation="MSQHDPVSAAWRAHRAYLVDLAFRMVGDIGVAEDMVQEAFSRLL RAPVGDIDDERGWLIVVTSRLCLDHIKSASTRRERPQDIAAWHDGDASVSSVDPADRV TLDDEVRLALLIMLERLGPAERVVFVLHEIFGLPYQQIATTIGSQASTCRQLAHRARR KINESRIAASVEPAQHRVVTRAFIEACSNGDLDTLLEVLDPGVAGEIDARKGVVVVGA DRVGPTILRHWSHPATVLVAQPVCGQPAVLAFVNRALAGVLALSIEAGKITKIHVLVQ PSTLDPLRAELGGG" gene 1332980..1333858 /locus_tag="Rv1190" /db_xref="GeneID:886049" CDS 1332980..1333858 /locus_tag="Rv1190" /function="UNKNOWN" /note="Rv1190, (MTCI364.02), len: 292 aa. Conserved hypothetical protein, similar to Rv1833c|Y0DA_MYCTU|Q50600 hypothetical 32.2 kDa protein cy1a11.10 (286 aa), fasta scores: opt: 331, E(): 1.4e-15, (29.0% identity in 272 aa overlap), also YU14_MYCTU|Q50670 putative haloalkane dehalogenase (300 aa), FASTA scores: opt: 239, E(): 2.2e-09, (29.9% identity in 298 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215706.1" /db_xref="GI:15608330" /db_xref="GOA:O86348" /db_xref="UniProtKB/TrEMBL:O86348" /db_xref="GeneID:886049" /translation="MTMKSLAALDRPSWLSSSAWPWQPYLLSHHQGGIAVTDIGDGPA VLFVHVGSWSFVWRDVLLRLANDFRCVAIDAPGCGLSDRLSTPPTLAQAADAITSVID ALQLRDLTLVAHDLGGPAGFLAAARRGDRVAALAAVNCFAWRPTGPLFRGMLAAMGSA PVRELDAAINALARATSTRFGAGRHWSRADRAAFRAGIDAPARRAWHAYFRDARRAHA LYTDVDAALRGGLADRPLLTIFGQFNDPLRFQPRWKELFPTARQLQVRRGNHFPMCDD PDLVAGALTSFVQRST" gene 1333931..1334845 /locus_tag="Rv1191" /db_xref="GeneID:886041" CDS 1333931..1334845 /locus_tag="Rv1191" /function="UNKNOWN" /note="Rv1191, (MTCI364.03), len: 304 aa. Conserved hypothetical protein, similar to Q54528 RDMC from Streptomyces purpurascens (298 aa), FASTA scores: opt: 196, E(): 1.5e-05, (27.5% identity in 269 aa overlap); Rv0134|MTCI5.08 (300 aa), FASTA scores: opt: 197, E(): 6.6e-06, (26.4% identity in 299 aa overlap), some similarity to PIP_NEIGO|P42786 proline iminopeptidase (EC 3.4. 11.5) (310 aa), FASTA scores: opt: 196, E(): 1.3e-05, (32.2% identity in 152 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215707.1" /db_xref="GI:15608331" /db_xref="GOA:O05293" /db_xref="UniProtKB/TrEMBL:O05293" /db_xref="GeneID:886041" /translation="MAVAIARPKLEGNIAVGEDRRIGFAEFGAPQGRAVFWLHGTPGA RRQIPTEARVYAEHHNIRLIGVDRPGIGASTPHQYETILAFADDLRTIADTLGIDKMA VVGLSGGGPYTLACAAGLPDRVVAAGVLGGVAPTRGPDAISGGLMRLGSAVAPLLQVG GTPLRLGASLLIRAARPVASPALDLYGLLSPRADRHLLARPEFKAMFLDDLLNGSRKQ LAAPFADVIAFARDWGFRLDEVKVPVRWWHGDHDHIVPFSHGEHVVSRLPDAKLLHLP GESHLAGLGRGEEILSTLMQIWDRDLRK" misc_feature 1334141..1334218 /locus_tag="Rv1191" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene 1334927..1335754 /locus_tag="Rv1192" /db_xref="GeneID:886007" CDS 1334927..1335754 /locus_tag="Rv1192" /function="UNKNOWN" /note="Rv1192, (MTCI364.04), len: 275 aa. Hypothetical unknown protein, contains PS00120 lipases, serine active site." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215708.1" /db_xref="GI:15608332" /db_xref="UniProtKB/TrEMBL:O05294" /db_xref="GeneID:886007" /translation="MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRM LPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDL LDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWS FNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRS SHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA" misc_feature 1335266..1335295 /locus_tag="Rv1192" /note="PS00120 Lipases, serine active site" gene 1335794..1337215 /gene="fadD36" /locus_tag="Rv1193" /db_xref="GeneID:886074" CDS 1335794..1337215 /gene="fadD36" /locus_tag="Rv1193" /EC_number="2.3.1.86" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_215709.1" /db_xref="GI:15608333" /db_xref="GOA:O05295" /db_xref="UniProtKB/TrEMBL:O05295" /db_xref="GeneID:886074" /translation="MLLASLNPAVVSAADIADAVRIDGDVLSRSDLVGAATSVAERVA GAHRVAVLATPTASTVLAITGCLIAGVPVVPVPADVGVTERRHMLTDSGVQAWLGPLP DDPAGLPHIPVRTHARSWHRYPEPSPGAIAMVVYTSGTTGPPKGVQLSRRAIAADLDA LAEAWQWTAEDVLVHGLPLYHVHGLVLGLLGSLRFGNRFVHTGKPTPAGYAQACYEAH GTLFFGVPTVWSRVAADQAAAGALKPARLLVSGSAALPVPVFDKLVQLTGHRPVERYG ASESLITLSTRADGERRPGWVGLPLAGVQTRLVDDDGGEVPHDGETVGKLQVRGPTLF DGYLNQPDATAAAFDADSWYRTGDVAVVDGSGMHRIVGRESVDLIKSGGYRVGAGEIE TVLLGHPDVAEAAVVGVPDDDLGQRIVAYVVGSANVDADGLINFVAQQLSVHKRPREV RIVDALPRNALGKVLKKQLLSEG" misc_feature 1336193..1336228 /gene="fadD36" /locus_tag="Rv1193" /note="PS00455 Putative AMP-binding domain signature" gene complement(1337248..1338513) /locus_tag="Rv1194c" /db_xref="GeneID:886040" CDS complement(1337248..1338513) /locus_tag="Rv1194c" /function="UNKNOWN" /note="Rv1194c, (MTCI364.06c), len: 421 aa. Conserved hypothetical protein, highly similar to Q50018 possible transcriptional activator from Mycobacterium leprae (517 aa), FASTA scores: opt: 1960, E(): 0, (69.8% identity in 421 aa overlap). Also similar to Mycobacterium tuberculosis Rv2370c|MTCY27.10, (62.0% identity in 421 aa overlap) and Rv1453|MTCY493.01c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215710.1" /db_xref="GI:15608334" /db_xref="UniProtKB/TrEMBL:O05296" /db_xref="GeneID:886040" /translation="MAWQQPSPRIRELIREGARIALNPSPEWIEELDRATIAANPAIA NDPVLAKVVQTANRANLVYWAAANLRDPGARVPANLGTEPLRMARDLVRRGLDTVAFN IYRTGEHIGWRFWMGIAFELTSDPQELRELLDVSARSVNDFIEATLTGIAAQVQSEHD ELTRSTHAERLEVVGLILDGAPISPERAEAKLGYPLSRAHTAAIIWSDELDGDHSYLD RAADLFCHAVGSTRPLTVVAGAASRWAWVTDADGLDIDTVQAAVDNAPGARIAIGTTA NGVEGFRRSHLEALITQRTLSRLRSTQRVAFFADVKMVALISQNPDAASEFITSTLGD LESASPDLQTALLTFINEQCNASRAAKRLHTHRNTFLRRLESAQRLLPRPLDHTSVHV AVALEALQWRGNKAHALSSPGRRSNSVPA" gene 1339003..1339302 /gene="PE13" /locus_tag="Rv1195" /db_xref="GeneID:886044" CDS 1339003..1339302 /gene="PE13" /locus_tag="Rv1195" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1195, (MTCI364.07), len: 99 aa. Member of Mycobacterium tuberculosis PE family (see Brennan & Delogu 2002), e.g. Y0DP_MYCTU|Q50615 hypothetical glycine-rich 40.8 kd protein (498 aa), FASTA scores: opt: 307, E(): 1.4e-12, (56.3% identity in 96 aa overlap), similar to MTCY21C12.10c (99 aa), FASTA scores: opt:295, E(): 1.9e-11, (51.5% identity in 97 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177794.1" /db_xref="GI:57116840" /db_xref="UniProtKB/TrEMBL:Q79FR3" /db_xref="GeneID:886044" /translation="MSFVMAYPEMLAAAADTLQSIGATTVASNAAAAAPTTGVVPPAA DEVSALTAAHFAAHAAMYQSVSARAAAIHDQFVATLASSASSYAATEVANAAAAS" gene 1339349..1340524 /gene="PPE18" /locus_tag="Rv1196" /db_xref="GeneID:886073" CDS 1339349..1340524 /gene="PPE18" /locus_tag="Rv1196" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1196, (MTCI364.08), len: 391 aa. PPE18 (alternate gene name: mtb39a). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, highly similar to others e.g. Y07P_MYCTU|Q11031 hypothetical 40.0 kDa protein cy02b10.25c (396 aa), FASTA scores: opt: 2124, E(): 0, (85.1% identity in 397 aa overlap). Note that expression of Rv1196 was demonstrated in lysates by immunodetection (see Dillon et al., 1999).; mtb39a" /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177795.1" /db_xref="GI:57116841" /db_xref="UniProtKB/TrEMBL:Q7D8M9" /db_xref="GeneID:886073" /translation="MVDFGALPPEINSARMYAGPGSASLVAAAQMWDSVASDLFSAAS AFQSVVWGLTVGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYETAY GLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAAAMFGYAAATA TATATLLPFEEAPEMTSAGGLLEQAAAVEEASDTAAANQLMNNVPQALQQLAQPTQGT TPSSKLGGLWKTVSPHRSPISNMVSMANNHMSMTNSGVSMTNTLSSMLKGFAPAAAAQ AVQTAAQNGVRAMSSLGSSLGSSGLGGGVAANLGRAASVGSLSVPQAWAAANQAVTPA ARALPLTSLTSAAERGPGQMLGGLPVGQMGARAGGGLSGVLRVPPRPYVMPHSPAAG" gene 1340659..1340955 /gene="esxK" /locus_tag="Rv1197" /db_xref="GeneID:886051" CDS 1340659..1340955 /gene="esxK" /locus_tag="Rv1197" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1197, (MT1235, MTCI364.09), len: 98 aa. esxK, ESAT-6 like protein (see citation below). Member of M. tuberculosis hypothetical QILSS protein family with Rv1038c, etc. Almost identical to MTCY98.023c (98 aa) (99.0% identity in 98 aa overlap) and MTCY10G2.11 (98 aa), FASTA scores: opt: 643, E(): 0, (99.0% identity in 98 aa overlap); highly similar to Q49945|U1756C from Mycobacterium leprae (100 aa), FASTA scores: opt: 377, E(): 8e-21, (58.3% identity in 96 aa overlap). BELONGS TO THE ESAT6 FAMILY.; ES6_3, TB11.0, QILSS" /codon_start=1 /transl_table=11 /product="Esat-6 like protein esxK (Esat-6 like protein 3)" /protein_id="NP_215713.1" /db_xref="GI:15608337" /db_xref="UniProtKB/Swiss-Prot:O05299" /db_xref="GeneID:886051" /translation="MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAG WSGMAEATSLDTMAQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS" gene 1341006..1341290 /gene="esxL" /locus_tag="Rv1198" /db_xref="GeneID:886090" CDS 1341006..1341290 /gene="esxL" /locus_tag="Rv1198" /function="UNKNOWN" /note="Rv1198, (MT1236, MTCI364.10), len: 94 aa. esxL, ESAT-6 like protein (see citation below). Member of the ESAT-6 family with Rv3619c, Rv1037c, etc. Almost identical to MTCY10G2.12 (94 aa) (97.9% identity in 94 aa overlap) and MTCY98.022c (94 aa) (94.7% identity in 94 aa overlap). Highly similar to Q49946|U1756D Mycobacterium leprae (95 aa), FASTA scores: opt: 403, E(): 1.1e-22, (64.1% identity in 92 aa overlap). SEEMS TO BELONG TO THE ESAT6 FAMILY.; ES6_4, Mtb9.9C" /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXL (ESAT-6 like protein 4)" /protein_id="NP_215714.1" /db_xref="GI:15608338" /db_xref="UniProtKB/Swiss-Prot:O05300" /db_xref="GeneID:886090" /translation="MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIIRDVLTASDFWGG AGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" gene complement(1341358..1342605) /locus_tag="Rv1199c" /db_xref="GeneID:886092" CDS complement(1341358..1342605) /locus_tag="Rv1199c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1081." /note="Rv1199c, (MTCI364.11c), len: 415 aa. Possible transposase for IS1081, identical to TRA1_MYCBO|P35882 transposase for insertion sequence element (415 aa); region identical to MTCY441.35 (100.0% identity in 261 aa overlap); and almost identical to MTCY10G2.02c (415 aa) (99.8% identity in 415 aa overlap). Contains PS01007 Transposases, Mutator family, signature, PS00435 Peroxidases proximal heme-ligand signature." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215715.1" /db_xref="GI:15608339" /db_xref="GOA:P60230" /db_xref="UniProtKB/Swiss-Prot:P60230" /db_xref="GeneID:886092" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" repeat_region complement(1341361..1342789) /note="IS1081-2, len: 1429 bp. Insertion sequence IS1081." /mobile_element="insertion sequence:IS1081-2" misc_feature complement(1341835..1341909) /locus_tag="Rv1199c" /note="PS01007 Transposases, Mutator family, signature" misc_feature complement(1342051..1342083) /locus_tag="Rv1199c" /note="PS00435 Peroxidases proximal heme-ligand signature" gene 1342942..1344219 /locus_tag="Rv1200" /db_xref="GeneID:886087" CDS 1342942..1344219 /locus_tag="Rv1200" /function="UNKNOWN; PROBABLY INVOLVED IN TRANSPORT ACCROSS THE MEMBRANE (PROBABLY SUGAR TRANSPORT)." /note="Rv1200, (MTCI364.12), len: 425 aa. Probable conserved integral membrane transport protein, possibly member of major facilitator superfamily (MFS), similar to others e.g. YHJE_ECOLI|P37643 hypothetical metabolite transport protein from Escherichia coli (440 aa), FASTA scores: opt: 1047, E(): 0, (39.1% identity in 427 aa overlap); etc. Contains PS00217 Sugar transport proteins signature 2. The transcription of this CDS seems to be activated in macrophages (see citation below)." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_215716.1" /db_xref="GI:15608340" /db_xref="GOA:O05301" /db_xref="UniProtKB/TrEMBL:O05301" /db_xref="GeneID:886087" /translation="MKRVALACLVGSAIEFYDFLIYGTAAALVFPTVFFPHLDPTVAA VASMGTFAVAFLSRPFGAAVFGYFGDRLGRKKTLVATLLIMGLATVTVGLVPTTVAIG AAAPLILTTMRLLQGFAVGGEWAGSALLSAEYAPASKRGWYGMFTVVGGGIALVLTSL TFLGVNYTIGESSPTFMQWGWRIPFLVSAALIAVALYVRFNIDETPVFARERADEKTR LGPAETPIAQVLRRQRREIVLAAGSAVCCFGFVYLASTYLASYAQTRLGYSRGSILFD SVLGGLLCIVFTALSSALCDQLGRRRVLLAGWAVALPWSLLVMPLIDSGSPSLFAVAV VGMYAIGGFGFGPTASFIPELFATSYRYTGSALAANLAGVAGGALPPVIAGALVATYG SWAIGVMLAILALISLVCTYRLPETAGSALVSR" misc_feature 1343284..1343361 /locus_tag="Rv1200" /note="PS00217 Sugar transport proteins signature 2" gene complement(1344216..1345169) /locus_tag="Rv1201c" /db_xref="GeneID:886088" CDS complement(1344216..1345169) /locus_tag="Rv1201c" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /experiment="experimental evidence, no additional details recorded" /note="Rv1201c, (MTCI364.13c), len: 317 aa. Probable transferase (EC 2.-.-.-). Highly similar to Q49948|U1756F Mycobacterium leprae (317 aa), FASTA scores: opt: 1776, E(): 0, (84.9% identity in 317 aa overlap), also Q46064 ORF3 protein from CORYNEBACTERIUM GLUTAMICUM (316 aa), FASTA scores: opt: 864, E(): 0, (44.1% identity in 311 aa overlap)." /codon_start=1 /transl_table=11 /product="transferase" /protein_id="NP_215717.1" /db_xref="GI:15608341" /db_xref="GOA:O05302" /db_xref="UniProtKB/TrEMBL:O05302" /db_xref="GeneID:886088" /translation="MSTVTGAAGIGLATLAADGSVLDTWFPAPELTESGTSATSRLAV SDVPVELAALIGRDDDRRTETIAVRTVIGSLDDVAADPYDAYLRLHLLSHRLVAPHGL NAGGLFGVLTNVVWTNHGPCAIDGFEAVRARLRRRGPVTVYGVDKFPRMVDYVVPTGV RIADADRVRLGAHLAPGTTVMHEGFVNYNAGTLGASMVEGRISAGVVVGDGSDVGGGA SIMGTLSGGGTHVISIGKRCLLGANSGLGISLGDDCVVEAGLYVTAGTRVTMPDSNSV KARELSGSSNLLFRRNSVSGAVEVLARDGQGIALNEDLHAN" gene 1345260..1346324 /gene="dapE" /locus_tag="Rv1202" /db_xref="GeneID:887386" CDS 1345260..1346324 /gene="dapE" /locus_tag="Rv1202" /EC_number="3.5.1.18" /function="INVOLVED IN LYSINE BIOSYNTHESIS [CATALYTIC ACTIVITY: N-succinyl-LL-2,6-diaminoheptanedioate + H2O = succinate + LL-2,6-diaminoheptanedioate]" /note="catalyzes the formation of succinate and diaminoheptanedioate from succinyldiaminoheptanedioate" /codon_start=1 /transl_table=11 /product="succinyl-diaminopimelate desuccinylase" /protein_id="YP_177796.1" /db_xref="GI:57116842" /db_xref="GOA:Q7D8M5" /db_xref="UniProtKB/TrEMBL:Q7D8M5" /db_xref="GeneID:887386" /translation="MLDLRGDPIELTAALIDIPSESRKEARIADEVEAALRAQASGFE IIRNGNAVLARTKLNRSSRVLLAGHLDTVPVAGNLPSRRENDQLHGCGAADMKSGDAV FLHLAATLAEPTHDLTLVFYDCEEIDSAANGLGRIQRELPDWLSADVAILGEPTAGCI EAGCQGTLRVVLSVTGTRAHSARSWLGDNAIHKLGAVLDRLAVYRARSVDIDGCTYRE GLSAVRVAGGVAGNVIPDAASVTINYRFAPDRSVAAALQHVHDVFDGLDVQIEQTDAA AGALPGLSEPAAKALVEAAGGQVRAKYGWTDVSRFAALGIPAVNYGPGDPNLAHCRDE RVPVGNITAAVDLLRRYLGG" gene complement(1346321..1346905) /locus_tag="Rv1203c" /db_xref="GeneID:886086" CDS complement(1346321..1346905) /locus_tag="Rv1203c" /function="UNKNOWN" /note="Rv1203c, (MTCI364.15c), len: 194 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215719.1" /db_xref="GI:15608343" /db_xref="UniProtKB/TrEMBL:O05304" /db_xref="GeneID:886086" /translation="MLLAYVLITKGEFGAAASMLEPAAATLERTGYSWGPLSLMLLAT AIAQQGHIAESAKTLQRAEARHGTKSALFAPELGLARAWTRAAAQDMTGAIAAAREAA RTAERAGQAAVALCAWHNAVRLGDIRAVDPVTRLAAEIDCTVGNILVKHARGLADGDA AELTAVAEELAGIGMAAAAADATKAAARLGPQQR" gene complement(1346936..1348624) /locus_tag="Rv1204c" /db_xref="GeneID:887552" CDS complement(1346936..1348624) /locus_tag="Rv1204c" /function="UNKNOWN" /note="Rv1204c, (MTCI364.16c), len: 562 aa. Conserved hypothetical protein, some similarity to Q55103 CHO-ORF2 from STREPTOMYCES SP. (642 aa), FASTA scores: opt: 215, E(): 3.6e-06, (26.4% identity in 576 aa overlap). Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215720.1" /db_xref="GI:15608344" /db_xref="GOA:O05305" /db_xref="UniProtKB/TrEMBL:O05305" /db_xref="GeneID:887552" /translation="MRVWKHVEAAVDSPDRCGVVLVGPHGVGKTLLAQLAAEQVMSED GRSGRARWVVGTAPGRAIPFGAFRHLISLPASGADIGRPAALLRAARSSLTGDAGDLL LVVDDAHNLDPLSATLVYQLARAGAARLVVTVASEAEPPDAIAALWSDDLLTRVAIEP LDRAQTAAFVESALDATLDVADADELFRRSLGNPLYLRHLIDGGGLEHVDGRWRCRDE DRRPLSGVIDEYLCALPEPARAVVDYLAIAEPLARTDLVALVGGEQLDTLGQAEAAGA VRVGPDSDTSEIFVGHPLYADRARAVLTAEHAHALRVSLVAQLAKHPSDHVSDQLRLS SLAIDVPASATPAAVTDAATAAGQALRLGDVRLAERLARAALDRSDALAARLPLAYAL GWQGRGREADAVLAAVNPAELTETELMAWAIPRAANRFWMLNEPERATAFLQTTRSRV TEPTARSTLDALAATFAMNSGNLPRAITLATEVLSGPAADDMAVAWAASAAALSSARM GRFGDVDRLAERASAAEHPGLLRFTVGLAQITSLLLAGDVAPAQELAKRFTDFA" misc_feature complement(1348535..1348558) /locus_tag="Rv1204c" /note="PS00017 ATP/GTP-binding site motif A" gene 1348719..1349282 /locus_tag="Rv1205" /db_xref="GeneID:886075" CDS 1348719..1349282 /locus_tag="Rv1205" /function="UNKNOWN" /note="Rv1205, (MTCI364.17), len: 187 aa. Conserved hypothetical protein, similar to Q49952 cosmid B1756 from Mycobacterium leprae (187 aa), FASTA scores: opt: 865, E(): 0, (72.4% identity in 174 aa overlap), also similar to FAS6_RHOFA|P46378 hypothetical 21.1 kDa protein in fasciation locus (ORF6) (198 aa), FASTA scores: opt: 368, E(): 1.3e-17, (37.4% identity in 174 aa overlap). Some similarity to YJL055W Hypothetical protein in BTN1-PEP8 intergenic region from Saccharomyces cerevisiae and P48636 HYPOTHETICAL protein in AZU 5'REGION from Pseudomonas aeruginosa. The transcription of this CDS seems to be activated specifically in host granulomas (see citation below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215721.1" /db_xref="GI:15608345" /db_xref="UniProtKB/TrEMBL:O05306" /db_xref="GeneID:886075" /translation="MSAKIDITGDWTVAVYCAASPTHAELLELAAEVGAAIAGRGWTL VWGGGHVSAMGAVASAARACGGWTVGVIPKMLVYRELADHDADELIVTDTMWERKQIM EDRSDAFIVLPGGVGTLDELFDAWTDGYLGTHDKPIVMVDPWGHFDGLRAWLNGLLDT GYVSPTAMERLVVVDNVKDALRACAPS" gene 1349332..1351125 /gene="fadD6" /locus_tag="Rv1206" /db_xref="GeneID:887549" CDS 1349332..1351125 /gene="fadD6" /locus_tag="Rv1206" /EC_number="2.3.1.86" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A; may be involved in acyclic terpene utilization" /codon_start=1 /transl_table=11 /product="long-chain-acyl-CoA synthetase" /protein_id="NP_215722.1" /db_xref="GI:15608346" /db_xref="GOA:O05307" /db_xref="UniProtKB/TrEMBL:O05307" /db_xref="GeneID:887549" /translation="MSDYYGGAHTTVRLIDLATRMPRVLADTPVIVRGAMTGLLARPN SKASIGTVFQDRAARYGDRVFLKFGDQQLTYRDANATANRYAAVLAARGVGPGDVVGI MLRNSPSTVLAMLATVKCGAIAGMLNYHQRGEVLAHSLGLLDAKVLIAESDLVSAVAE CGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKDTAFYIFTSGTTGFPKASV MTHHRWLRALAVFGGMGLRLKGSDTLYSCLPLYHNNALTVAVSSVINSGATLALGKSF SASRFWDEVIANRATAFVYIGEICRYLLNQPAKPTDRAHQVRVICGNGLRPEIWDEFT TRFGVARVCEFYAASEGNSAFINIFNVPRTAGVSPMPLAFVEYDLDTGDPLRDASGRV RRVPDGEPGLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNTGDVMSPQGMG HAAFVDRLGDTFRWKGENVATTQVEAALASDQTVEECTVYGVQIPRTGGRAGMAAITL RAGAEFDGQALARTVYGHLPGYALPLFVRVVGSLAHTTTFKSRKVELRNQAYGADIED PLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG" misc_feature 1349941..1349976 /gene="fadD6" /locus_tag="Rv1206" /note="PS00455 Putative AMP-binding domain signature" misc_feature 1350133..1350156 /gene="fadD6" /locus_tag="Rv1206" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1351191..1352147 /gene="folP2" /locus_tag="Rv1207" /db_xref="GeneID:887447" CDS 1351191..1352147 /gene="folP2" /locus_tag="Rv1207" /EC_number="2.5.1.15" /function="INVOVLED IN DIHYDROFOLATE BIOSYNTHESIS [CATALYTIC ACTIVITY : 2-AMINO-4-HYDROXY-6-HYDROXYMETHYL-7,8-DIHYDROPTERIDINE DIPHOSPHATE + 4-AMINOBENZOATE = DIPHOSPHATE + DIHYDROPTEROATE]" /note="Rv1207, (MTCI364.19), len: 318 aa. Probable folP2, Dihydropteroate synthase 2 (EC 2.5.1.15), similar to many e.g. DHPS_ECOLI|P26282 Escherichia coli (282 aa), FASTA scores: opt: 480, E(): 1.9e-22, (34.4% identity in 270 aa overlap). Contains PS00792 dihydropteroate synthase signature 1, PS00793 dihydropteroate synthase signature 2." /codon_start=1 /transl_table=11 /product="dihydropteroate synthase 2 FolP2" /protein_id="NP_215723.1" /db_xref="GI:15608347" /db_xref="GOA:P64139" /db_xref="UniProtKB/Swiss-Prot:P64139" /db_xref="GeneID:887447" /translation="MRSTPPASAGRSTPPALAGHSTPPALAGHSTLCGRPVAGDRALI MAIVNRTPDSFYDKGATFSDAAARDAVHRAVADGADVIDVGGVKAGPGERVDVDTEIT RLVPFIEWLRGAYPDQLISVDTWRAQVAKAACAAGADLINDTWGGVDPAMPEVAAEFG AGLVCAHTGGALPRTRPFRVSYGTTTRGVVDAVISQVTAAAERAVAAGVAREKVLIDP AHDFGKNTFHGLLLLRHVADLVMTGWPVLMALSNKDVVGETLGVDLTERLEGTLAATA LAAAAGARMFRVHEVAATRRVLEMVASIQGVRPPTRTVRGLA" misc_feature 1351320..1351367 /gene="folP2" /locus_tag="Rv1207" /note="PS00792 Dihydropteroate synthase signature 1" misc_feature 1351422..1351463 /gene="folP2" /locus_tag="Rv1207" /note="PS00793 Dihydropteroate synthase signature 2" gene 1352144..1353118 /locus_tag="Rv1208" /db_xref="GeneID:886085" CDS 1352144..1353118 /locus_tag="Rv1208" /function="UNKNOWN" /note="Rv1208, (MTCI364.20), len: 324 aa. Conserved hypothetical protein, similar to Q49955|U1756L Mycobacterium leprae (318 aa), FASTA scores, opt: 1621, E(): 0, (80.5% identity in 318 aa overlap)." /codon_start=1 /transl_table=11 /product="putative glucosyl-3-phosphoglycerate synthase" /protein_id="NP_215724.1" /db_xref="GI:15608348" /db_xref="UniProtKB/TrEMBL:O05309" /db_xref="GeneID:886085" /translation="MTASELVAGDLAGGRAPGALPLDTTWHRPGWTIGELEAAKAGRT ISVVLPALNEEATIESVIDSISPLVDGLVDELIVLDSGSTDDTEIRAIASGARVVSRE QALPEVPVRPGKGEALWRSLAATSGDIVVFIDSDLINPHPLFVPWLVGPLLTGEGIQL VKSFYRRPLQVSDVTSGVCATGGGRVTELVARPLLAALRPELGCVLQPLSGEYAASRE LLTSLPFAPGYGVEIGLLIDTFDRLGLDAIAQVNLGVRAHRNRPLDELGAMSRQVIAT LLSRCGIPDSGVGLTQFLPGGPDDSDYTRHTWPVSLVDRPPMKVMRPR" gene 1353157..1353525 /locus_tag="Rv1209" /db_xref="GeneID:887600" CDS 1353157..1353525 /locus_tag="Rv1209" /function="UNKNOWN" /note="Rv1209, (MTCI364.21), len: 122 aa. Conserved hypothetical protein, containing a hydrophobic N-terminus. Similar to Q49956|U1756M hypothetical protein from Mycobacterium leprae (114 aa), FASTA scores: opt: 524, E(): 8.9e-29, (78.6% identity in 112 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215725.1" /db_xref="GI:15608349" /db_xref="UniProtKB/TrEMBL:O05310" /db_xref="GeneID:887600" /translation="MALVLVYLVVLVLVAIVLFAAASLLFGRGEQLPPLPRATTATTL PAFGVTRADVDAVKFTQVLRGYKTSEVDWVLERLGRELEALRSQLGAIHASSEDAEAE SDASNPSRGETVVHYRSDPA" gene 1353522..1354136 /gene="tagA" /locus_tag="Rv1210" /db_xref="GeneID:888035" CDS 1353522..1354136 /gene="tagA" /locus_tag="Rv1210" /EC_number="3.2.2.20" /function="INVOLVED IN BASE EXCISION REPAIR. HYDROLYSIS OF THE DEOXYRIBOSE N-GLYCOSIDIC BOND TO EXCISE 3-METHYLADENINE FROM THE DAMAGED DNA POLYMER FORMED BY ALKYLATION LESIONS" /note="Rv1210, (MTCI364.22), len: 204 aa. Probable tagA, DNA-3-methyladenine glycosidase I (EC 3.2.2.20) (see citation below), similar to several e.g. 3MG1_ECOLI|P05100 DNA-3-methyladenine glycosidase I from Escherichia coli (187 aa), FASTA scores: opt: 530, E(): 1.3e-27, (44.2% identity in 190 aa overlap); similar to Q49957 Mycobacterium leprae cosmid B1756 (192 aa), FASTA scores: opt: 1042, E(): 0, (80.2% identity in 192 aa overlap)." /codon_start=1 /transl_table=11 /product="DNA-3-methyladenine glycosylase I" /protein_id="NP_215726.1" /db_xref="GI:15608350" /db_xref="GOA:O05311" /db_xref="UniProtKB/TrEMBL:O05311" /db_xref="GeneID:888035" /translation="MSGDGLVRCPWAEVRPGPDAQLYRDYHDNEWGRPLYGRVALFER MSLEAFQSGLSWLIILRKRENFRRAFSGFDIDKIARYTDTDVRRLLADDGIVRNRAKI EATIANARAAADLGSSEDLSELLWSFAPPPRPRPVDGSEIPSVSTESKAMSRELKRRG FRFVGPTTAYALMQATGMVDDHIQACWVPTERPFDQPGCPMAAR" gene 1354243..1354470 /locus_tag="Rv1211" /db_xref="GeneID:887990" CDS 1354243..1354470 /locus_tag="Rv1211" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1211, (MTCI364.23), len: 75 aa. Conserved hypothetical protein, similar to Q49958|U1756N Mycobacterium leprae (75 aa), FASTA scores: opt: 460, E(): 0, (90.7% identity in 75 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215727.1" /db_xref="GI:15608351" /db_xref="UniProtKB/TrEMBL:O05312" /db_xref="GeneID:887990" /translation="MLGADQARAGGPARIWREHSMAAMKPRTGDGPLEATKEGRGIVM RVPLEGGGRLVVELTPDEAAALGDELKGVTS" gene complement(1354498..1355661) /locus_tag="Rv1212c" /db_xref="GeneID:887805" CDS complement(1354498..1355661) /locus_tag="Rv1212c" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1212c, (MTCI364.24c), len: 387 aa. Putative glycosyl transferase (EC 2.-.-.-), highly similar to AJ243803|SCO243803_2 Putative glycosyl transferase from Streptomyces coelicolor (387 aa), FASTA scores: opt: 1344, E(): 0, (54.9% identity in 388 aa overlap). Also similar to MJ1607 probable hexosyltransferase (EC 2.4.1.-) from Methanococcus jannaschii (390 aa), FASTA scores: opt: 445, E(): 7.8e-23, (27.9% identity in 401 aa overlap). The region from aa 267-355 highly similar to Q49959 COSMID B1756 from Mycobacterium leprae (91 aa), FASTA scores, opt: 471, E(): 4.8e-25, (80.9% identity in 89 aa overlap). Similar to Mycobacterium tuberculosis hypothetical protein, Rv3032." /codon_start=1 /transl_table=11 /product="putative glycosyl transferase" /protein_id="NP_215728.1" /db_xref="GI:15608352" /db_xref="GOA:O05313" /db_xref="UniProtKB/TrEMBL:O05313" /db_xref="GeneID:887805" /translation="MRVAMLTREYPPEVYGGAGVHVTELVAYLRRLCAVDVHCMGAPR PGAFAYRPDPRLGSANAALSTLSADLVMANAASAATVVHSHTWYTALAGHLAAILYDI PHVLTAHSLEPLRPWKKEQLGGGYQVSTWVEQTAVLAANAVIAVSSAMRNDMLRVYPS LDPNLVHVIRNGIDTETWYPAGPARTGSVLAELGVDPNRPMAVFVGRITRQKGVVHLV TAAHRFRSDVQLVLCAGAADTPEVADEVRVAVAELARNRTGVFWIQDRLTIGQLREIL SAATVFVCPSVYEPLGIVNLEAMACATAVVASDVGGIPEVVADGITGSLVHYDADDAT GYQARLAEAVNALVADPATAERYGHAGRQRCIQEFSWAYIAEQTLDIYRKVCA" gene 1355836..1357050 /gene="glgC" /locus_tag="Rv1213" /db_xref="GeneID:887933" CDS 1355836..1357050 /gene="glgC" /locus_tag="Rv1213" /EC_number="2.7.7.27" /function="INVOLVED IN GLYCOGEN BIOSYNTHESIS (FIRST STEP) [CATALYTIC ACTIVITY:ATP + ALPHA-D-GLUCOSE 1-PHOSPHATE = DIPHOSPHATE + ADP-GLUCOSE]." /note="catalyzes the formation of ADP-glucose and diphosphate from ATP and alpha-D-glucose 1-phosphate" /codon_start=1 /transl_table=11 /product="glucose-1-phosphate adenylyltransferase" /protein_id="NP_215729.1" /db_xref="GI:15608353" /db_xref="GOA:P64241" /db_xref="UniProtKB/Swiss-Prot:P64241" /db_xref="GeneID:887933" /translation="MREVPHVLGIVLAGGEGKRLYPLTADRAKPAVPFGGAYRLIDFV LSNLVNARYLRICVLTQYKSHSLDRHISQNWRLSGLAGEYITPVPAQQRLGPRWYTGS ADAIYQSLNLIYDEDPDYIVVFGADHVYRMDPEQMVRFHIDSGAGATVAGIRVPRENA TAFGCIDADDSGRIRSFVEKPLEPPGTPDDPDTTFVSMGNYIFTTKVLIDAIRADADD DHSDHDMGGDIVPRLVADGMAAVYDFSDNEVPGATDRDRAYWRDVGTLDAFYDAHMDL VSVHPVFNLYNKRWPIRGESENLAPAKFVNGGSAQESVVGAGSIISAASVRNSVLSSN VVVDDGAIVEGSVIMPGTRVGRGAVVRHAILDKNVVVGPGEMVGVDLEKDRERFAISA GGVVAVGKGVWI" gene complement(1357293..1357625) /gene="PE14" /locus_tag="Rv1214c" /db_xref="GeneID:888362" CDS complement(1357293..1357625) /gene="PE14" /locus_tag="Rv1214c" /function="UNKNOWN" /note="Rv1214c, (MTCI364.26c), len: 110 aa. Member of Mycobacterium tuberculosis PE family (see citation below), appears to be frameshifted but sequence appears to be correct. The 5'-end is atypical as first 9 aa appear to be missing." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177797.1" /db_xref="GI:57116843" /db_xref="UniProtKB/TrEMBL:Q7D8L5" /db_xref="GeneID:888362" /translation="MLASAATDLAGIGSALSAANAAAAAPTTAMLAACADEVSAVVAS LFARHAQAYQALSLQATAFHQQFVQALTGAGGAYAAAEAVNAAVAQSVQQDVLNVINA PTQALFDR" gene complement(1357759..1359444) /locus_tag="Rv1215c" /db_xref="GeneID:887684" CDS complement(1357759..1359444) /locus_tag="Rv1215c" /function="UNKNOWN" /note="Rv1215c, (MTCI364.27c), len: 561 aa. Conserved hypothetical protein, low similarity to Rv1835c|Y0D8_MYCTU|Q50598 hypothetical 69.9 kDa protein cy1a11.08 (628 aa), FASTA scores: opt: 257, E(): 1.3e-09, (34.1% identity in 185 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215731.1" /db_xref="GI:15608355" /db_xref="GOA:O05316" /db_xref="UniProtKB/TrEMBL:O05316" /db_xref="GeneID:887684" /translation="MARNPSPALDRPWRRPGALRYALERVRGVAKPPITVTDPPADVV IERDVEVPTRDGTLLRINVFRSAEGGARPVIASIHPYGKDALPRRRGNRWTFSPQYRM LRQPKPLTFSALTGWEAPDPAWWTAQGFVVVNADSRGCGRSDGTGDLLSHQEAEDTYD LVGWLADQAWSDGRVVMLGVSYLAISQYAVAALQPPALRAICPWEGFTDAYRDLAFPG GIRESGFTRLWSRGVRRRTRQTYDMEQMQEAHPLRDDFWRSRVPDLSAIKVPMLVCGS FSDNNLHSRGSIRAFTRSGCGHARLYTHRGGKWETFYSATALSEQLKFLRDALAGSSG SRSVRLEVREDRDTITAVREETQWPLAGTRWRPMYLAGPGLLATEPPPTAGSIRFQTR SRAAAFNWTIPEDIELTGPMAARLWVQLDGCDDANLFVGVEKWRDGQFVAFEGSYGWG RDRVTTGWQRVSLRELDPELSQPWEPVPACARPRPVTAGEVVAVDVALGPSATLFRAG EQLRLVVGGRWLSPRNPLTGQFPAAYPRPPRGRVTLHWGPRYDAHLLIPEVPG" gene complement(1359472..1360146) /locus_tag="Rv1216c" /db_xref="GeneID:888125" CDS complement(1359472..1360146) /locus_tag="Rv1216c" /function="UNKNOWN" /note="Rv1216c, (MTCI364.28c), len: 224 aa. Probable conserved integral membrane protein, C-terminal region similar to Q49963|U1756P from Mycobacterium leprae (134 aa), FASTA scores: opt: 311, E(): 3.3e-15, (52.2% identity in 113 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215732.1" /db_xref="GI:15608356" /db_xref="GOA:O05317" /db_xref="UniProtKB/TrEMBL:O05317" /db_xref="GeneID:888125" /translation="MHIGLKIFIWGVLGLVVFGALLFGPAGTFDYWQAWVFLAAFVST TIGPTIYLARNDPAALQRRMRSGPLAEGRTIQKFIVIGAFLGFFAMMVLSACDHRYGW SSVPAAVCVIGDVLVMTGLGIAMLVVIQNRYAASTVRVEAGQILASDGLYKIVRHPMY AGNVVMMTGIPLALGSYWAMFILVPGTLVLVFRILDEEKLLTQELSGYREYRQLVRYR LVPYVW" gene complement(1360155..1361801) /locus_tag="Rv1217c" /db_xref="GeneID:888401" CDS complement(1360155..1361801) /locus_tag="Rv1217c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF TETRONASIN ACROSS THE MEMBRANE (EXPORT): TETRONASIN RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1217c, (MTCI364.29c), len: 548 aa. Probable tetronasin-transport integral membrane ABC transporter (see citation below), similar to many e.g. AL049754|SCH10_12 probable ABC-type transport system membrane-spanning protein from Streptomyces coelicolor (539 aa), FASTA scores: opt: 1309, E(): 0, (40.9% identity in 550 aa overlap); Q54407|X73633 TnrB3 protein from Streptomyces longisporoflavus (337 aa), FASTA scores: opt: 692, E(): 0, (39.5% identity in 324 aa overlap); etc. Also has regions similar to Mycobacterium leprae proteins Q49964|U1756Q (109 aa), FASTA scores: opt: 431, E(): 3.1e-20, (64.8% identity in 105 aa overlap) and Q49965|U1756R (82 aa), FASTA scores: opt:154, E(): 0.0028, (61.0% identity in 41 aa overlap)." /codon_start=1 /transl_table=11 /product="tetronasin-transport integral membrane protein ABC transporter" /protein_id="NP_215733.1" /db_xref="GI:15608357" /db_xref="UniProtKB/TrEMBL:O05318" /db_xref="GeneID:888401" /translation="MSSTVIDRARPAGHRAPHRGSGFTGTLGLLRLYLRRDRVSLPLW VLLLSVPLATVYIASVETVYPDRSARAAAAAAIMASPAQRALYGPVYNDSLGAVGIWK AGMFHTLIAVAVILTVIRHTRADEESGRAELIDSTVVGRYANLTGALLLSFGASIATG AIGALGLLATDVAPAGSVAFGVALAASGMVFTAVAAVAAQLSPSARFTRAVAFAVLGT AFALRAIGDAGSGTLSWCSPLGWSLQVRPYAGERWWVLLLSLATAAVLTVLAYRLRAG RDVGAGLIAERPGAGTAGPMLSEPFGLAWRLNRGSLLLWTVGLCLYGLVMGSVVHGIG DQLGDNTAVRDIVTRMGGTGALEQAFLALAFTMIGMVAAAFAVSLTLRLHQEETGLRA ETLLAGAVSRTHWLASHLAMALAGSAVATLISGVAAGLAYGMTVGDVGGKLPTVVGTA AVQLPAVWLLSAVTVGLFGLAPRFTPVAWGVLVGFIALYLLGSLAGFPQMLLNLEPFA HIPRVGGGDFTAVPLLWLLAIDAALITLGAMAFRRRDVRC" gene complement(1361798..1362733) /locus_tag="Rv1218c" /db_xref="GeneID:888518" CDS complement(1361798..1362733) /locus_tag="Rv1218c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF TETRONASIN ACROSS THE MEMBRANE (EXPORT): TETRONASIN RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv1218c, (MTCI61.01c), len: 311 aa. Probable tetronasin-transport ATP-binding protein ABC transporter (see citation below), similar to many e.g. Q54406|X73633|TNRB2 TNRB2 PROTEIN from Streptomyces longisporoflavus (300 aa), FASTA scores: opt: 1133, E(): 0, (60.8% identity in 291 aa overlap); etc. Also similar to others in Mycobacterium tuberculosis e.g. MTCY19H9.04 (30.0% identity in 297 aa overlap); etc. Contains PS00211 ABC transporters family signature and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="tetronasin-transport ATP-binding protein ABC transporter" /protein_id="NP_215734.1" /db_xref="GI:15608358" /db_xref="GOA:O86311" /db_xref="UniProtKB/TrEMBL:O86311" /db_xref="GeneID:888518" /translation="MSADNHQVPIEIRGLTKHFGSVRALDGLDLTVREGEVHGFLGPN GAGKSTTLRILLGLVKADGGSVRLLGGDPWTDAVDLHRHIAYVPGDVTLWPSLTGGET IDLLARMRGGIDNARRAELIERFGLDPTKKARTYSKGNRQKVSLISALSSHATLLLLD EPSSGLDPLMENVFQQCIGEARQRGVTVLLSSHILAETEALCEKVTIIRAGKTVESGS LDALRHLSRTSIKAEMIGDPGDLSQIKGVEDISIEGTTVRAQVDSESLRELIQVLGHA GVRSLVSQPPTLEELFLRHYSLGPEVAAEQQVATP" misc_feature complement(1362284..1362328) /locus_tag="Rv1218c" /note="PS00211 ABC transporters family signature" misc_feature complement(1362587..1362610) /locus_tag="Rv1218c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(1362723..1363361) /locus_tag="Rv1219c" /db_xref="GeneID:888582" CDS complement(1362723..1363361) /locus_tag="Rv1219c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1219c, (MTCI61.02c), len: 212 aa. Probable transcriptional regulatory protein, some similarity in N-terminus to YBIH_ECOLI|P41037 hypothetical transcriptional regulator from Escherichia coli (103 aa), FASTA scores: opt: 143, E(): 8.9e-06, (39.7% identity in 63 aa overlap); Helix turn helix motif from aa 28-49." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215735.1" /db_xref="GI:15608359" /db_xref="GOA:O86312" /db_xref="UniProtKB/TrEMBL:O86312" /db_xref="GeneID:888582" /translation="MRSADLTAHARIREAAIEQFGRHGFGVGLRAIAEAAGVSAALVI HHFGSKEGLRKACDDFVAEEIRSSKAAALKSNDPTTWLAQMAEIESYAPLMAYLVRSM QSGGELAKMLWQKMIDNAEEYLDEGVRAGTVKPSRDPRARARFLAITGGGGFLLYLQM HENPTDLRAALRDYAHDMVLPSLEVYTEGLLADRAMYEAFLAEAQQGEAHVG" gene complement(1363503..1364150) /locus_tag="Rv1220c" /db_xref="GeneID:888419" CDS complement(1363503..1364150) /locus_tag="Rv1220c" /EC_number="2.1.1.-" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM" /note="Rv1220c, (MTCI61.03c), len: 215 aa. Possible methyltransferase (EC 2.1.1.-), some similarity to MDMC_STRMY|Q00719 o-methyltransferase from Streptomyces mycarofaciens (221 aa), FASTA scores; opt: 289, E(): 1.3e-07, (30.0% identity in 203 aa overlap). Also similar to Mycobacterium tuberculosis methyltransferases Rv0187|MTCI28.26 (32.9% identity in 222 aa overlap) and Rv1703c. Start site chosen by homology; other possible start sites exist upstream." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="NP_215736.1" /db_xref="GI:15608360" /db_xref="GOA:O33219" /db_xref="UniProtKB/TrEMBL:O33219" /db_xref="GeneID:888419" /translation="MPGQPAPSRGESLWAHAEGSISEDVILAGARERATDIGAGAVTP AVGALLCLLAKLSGGKAVAEVGTGAGVSGLWLLSGMRDDGVLTTIDIEPEHLRLARQA FAEAGIGPSRTRLISGRAQEVLTRLADASYDLVFIDADPIDQPDYVAEGVRLLRSGGV IVVHRAALGGRAGDPGARDAEVIAVREAARLIAEDERLTPALVPLGDGVLAAVRD" gene 1364413..1365186 /gene="sigE" /locus_tag="Rv1221" /db_xref="GeneID:888751" CDS 1364413..1365186 /gene="sigE" /locus_tag="Rv1221" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED. SEEMS TO BE REGULATED BY SIGH (Rv3223c PRODUCT). SEEMS TO REGULATE THE HEAT-SHOCK RESPONSE." /experiment="experimental evidence, no additional details recorded" /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription; in M. tuberculosis this protein is involved in heat shock, oxidative stress and virulence" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigE" /protein_id="NP_215737.1" /db_xref="GI:15608361" /db_xref="GOA:O06289" /db_xref="UniProtKB/TrEMBL:O06289" /db_xref="GeneID:888751" /translation="MELLGGPRVGNTESQLCVADGDDLPTYCSANSEDLNITTITTLS PTSMSHPQQVRDDQWVEPSDQLQGTAVFDATGDKATMPSWDELVRQHADRVYRLAYRL SGNQHDAEDLTQETFIRVFRSVQNYQPGTFEGWLHRITTNLFLDMVRRRARIRMEALP EDYDRVPADEPNPEQIYHDARLGPDLQAALASLPPEFRAAVVLCDIEGLSYEEIGATL GVKLGTVRSRIHRGRQALRDYLAAHPEHGECAVHVNPVR" gene 1365344..1365808 /locus_tag="Rv1222" /db_xref="GeneID:885196" CDS 1365344..1365808 /locus_tag="Rv1222" /function="UNKNOWN" /note="Rv1222, (MTCI61.05), len: 154 aa. Conserved hypothetical protein. Identical to O06290|MTU87242 (but shorter due to different start site chosen by proximity of RBS). Equivalent to O05736|U87308|MAU87308_2 hypothetical protein from Mycobacterium avium (133 aa), FASTA scores: opt: 644, E(): 7e-32, (86.2% identity in 109 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215738.1" /db_xref="GI:15608362" /db_xref="UniProtKB/TrEMBL:Q79FQ8" /db_xref="GeneID:885196" /translation="MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSI EAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGL LSEIPRCPPEGPSKGSSGGSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR" gene 1365875..1367461 /gene="htrA" /locus_tag="Rv1223" /db_xref="GeneID:888912" CDS 1365875..1367461 /gene="htrA" /locus_tag="Rv1223" /EC_number="3.4.21.-" /function="POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS (SEEMS TO CLEAVE PREFERENTIALLY AFTER SERINE RESIDUE)." /experiment="experimental evidence, no additional details recorded" /note="Rv1223, (MTCI61.06), len: 528 aa. Probable htrA (alternate gene name: degP), serine protease precursor (EC 3.4.21.-) (see citations below), equivalent to U15180|MLU15180_31|Q49972|ML1078|HTRA POSSIBLE SERINE PROTEASE from Mycobacterium leprae (533 aa), FASTA scores: opt: 2777, E(): 4.1e-141, (81.6% identity in 533 aa overlap). Also similar to many others e.g. HTRA_ECOLI|P09376 protease do precursor from Escherichia coli (EC 3.4.21.-) (474 aa), FASTA scores: opt: 581, E(): 9.1e-27, (36.3% identity in 278 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Start changed since first submission (-21 aa).; degP" /codon_start=1 /transl_table=11 /product="serine protease HtrA" /protein_id="NP_215739.2" /db_xref="GI:57116844" /db_xref="GOA:O06291" /db_xref="UniProtKB/TrEMBL:O06291" /db_xref="GeneID:888912" /translation="MDTRVDTDNAMPARFSAQIQNEDEVTSDQGNNGGPNGGGRLAPR PVFRPPVDPASRQAFGRPSGVQGSFVAERVRPQKYQDQSDFTPNDQLADPVLQEAFGR PFAGAESLQRHPIDAGALAAEKDGAGPDEPDDPWRDPAAAAALGTPALAAPAPHGALA GSGKLGVRDVLFGGKVSYLALGILVAIALVIGGIGGVIGRKTAEVVDAFTTSKVTLST TGNAQEPAGRFTKVAAAVADSVVTIESVSDQEGMQGSGVIVDGRGYIVTNNHVISEAA NNPSQFKTTVVFNDGKEVPANLVGRDPKTDLAVLKVDNVDNLTVARLGDSSKVRVGDE VLAVGAPLGLRSTVTQGIVSALHRPVPLSGEGSDTDTVIDAIQTDASINHGNSGGPLI DMDAQVIGINTAGKSLSDSASGLGFAIPVNEMKLVANSLIKDGKIVHPTLGISTRSVS NAIASGAQVANVKAGSPAQKGGILENDVIVKVGNRAVADSDEFVVAVRQLAIGQDAPI EVVREGRHVTLTVKPDPDST" misc_feature 1367072..1367095 /gene="htrA" /locus_tag="Rv1223" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1367463..1367858 /gene="tatB" /locus_tag="Rv1224" /db_xref="GeneID:887143" CDS 1367463..1367858 /gene="tatB" /locus_tag="Rv1224" /function="INVOLVED IN PROTEINS EXPORT. THIS SEC-INDEPENDENT PATHWAY IS TERMED TAT FOR TWIN-ARGININE TRANSLOCATION SYSTEM. THIS SYSTEM MAINLY TRANSPORTS PROTEINS WITH BOUND COFACTORS THAT REQUIRE FOLDING PRIOR TO EXPORT (BY SIMILARITY)." /note="mediates the export of protein precursors bearing twin-arginine signal peptides" /codon_start=1 /transl_table=11 /product="sec-independent translocase" /protein_id="NP_215740.1" /db_xref="GI:15608364" /db_xref="GOA:O33220" /db_xref="UniProtKB/Swiss-Prot:O33220" /db_xref="GeneID:887143" /translation="MFANIGWWEMLVLVMVGLVVLGPERLPGAIRWAASALRQARDYL SGVTSQLREDIGPEFDDLRGHLGELQKLRGMTPRAALTKHLLDGDDSLFTGDFDRPTP KKPDAAGSAGPDATEQIGAGPIPFDSDAT" gene complement(1367891..1368721) /locus_tag="Rv1225c" /db_xref="GeneID:887148" CDS complement(1367891..1368721) /locus_tag="Rv1225c" /function="UNKNOWN" /note="Rv1225c, (MTCI61.08c), len: 276 aa. Conserved hypothetical protein, some similarity to other hypothetical proteins e.g. AE001078|AE001078_2 Archaeoglobus fulgidus (265 aa), FASTA scores: opt: 339, E(): 5.1e-15, (27.1% identity in 262 aa overlap), and to NAGD_ECOLI|P15302 nagd protein from Escherichia coli (250 aa), FASTA scores: opt: 167, E(): 6.4e-12, (24.8% identity in 258 aa overlap). Also weakly similar to Mycobacterium tuberculosis hypothetical protein Rv3400|MTCY78.28c (29.1% identity in 251 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215741.1" /db_xref="GI:15608365" /db_xref="GOA:O33221" /db_xref="UniProtKB/TrEMBL:O33221" /db_xref="GeneID:887148" /translation="MDVAHLMAAAVLFDIDGVLVLSWRAIPGAAETVRQLTHRGIACA YLTNTTTRTRRQIAEALGAAGIPVAADDVITAGVLTAEYLHGAYPGARCFLVNNGDIT EDLPGIDVVLSTEIGPEDCPEAPDVVVLGSAGPQFDHRTLSRVYGWMLDGVPVVAMHR NMTWNTTDGLRIDTGMYLTGMEQACGKTATAIGKPAAEGFLAAADRVGVDPQQMVMIG DDLHNDVLAAQAVGMTGVLVRTGKFRQQTLDRWLAGASATRPHHVIDSVAGLPPLLGC" gene complement(1368832..1370295) /locus_tag="Rv1226c" /db_xref="GeneID:887144" CDS complement(1368832..1370295) /locus_tag="Rv1226c" /function="UNKNOWN" /note="Rv1226c, (MTCI61.09c), len: 487 aa. Probable transmembrane protein. Some similarity to AL049841|SCE9.01 Streptomyces coelicolor (436 aa), FASTA scores: opt: 203, E(): 1.2e-05, (29.8% identity in 346 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215742.1" /db_xref="GI:15608366" /db_xref="UniProtKB/TrEMBL:O33222" /db_xref="GeneID:887144" /translation="MTDRPHDWHRLSPRMLLVHPVHEMLRQLPVLIGSVVLGSATGNP VWPLAALGVTVVFGVLRWFFTTYRIDDENVSLRTGILSRRAVSVPRNRIRSVQTEARL LHRLLGLTVLRVGTGQEARGEAAFELDAVDSARVPRLRALLLAESLAPVEPTGRVLAR WQSSWLRYAPLSFSGLVMIGAVIGLGYQTGLAVRLPESGFARSAVDAAQRAGVVLVVA VTVLLVVGVSALLAVLFSWLTYGNLLLRRGGSGQEGVLHLRHGLLRVREHTYDMRRLR GATLREPLLVRLLRGARLDAVMTGVHGEGQSSMLLPPCPFETATAVLTDLIDNTDAAA GPLRRHGPAAARRRWTRALLVPTLAGVALIAAAPILGVPGWAWTLWAVLTAGCAGLAV DRVRSLGHRVADGWLVARAGSLQRRRDCIACTGIIGWTVRQTLFQRRAGVATLVAATV AGRKGYQVLDVPAELAWSVAGAASPWVADSVWLRHGS" gene complement(1370292..1370825) /locus_tag="Rv1227c" /db_xref="GeneID:885872" CDS complement(1370292..1370825) /locus_tag="Rv1227c" /function="UNKNOWN" /note="Rv1227c, (MTCI61.10c), len: 177 aa. Possible transmembrane protein, similar to P96615 hypothetical protein ydbS from Bacillus subtilis (159 aa), fasta scores: E(): 3.6e-07, (30.1% identity in 163 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215743.1" /db_xref="GI:15608367" /db_xref="GOA:O33223" /db_xref="UniProtKB/TrEMBL:O33223" /db_xref="GeneID:885872" /translation="MDHARNVPSATGPQRNHLALAEPAHRPSSQAPVMWALSASLGWI LPVIAQLVWWAVHPQPPWPHLAAAALTAVAMVVHIGVVPLWRYRVHRWEISPQAVFTR TGWLVQERRITPISRVQTVDTYRGPMDRLFGLANVTVTTASSAGAVHIEALDTDVADR VVAQLTDIAALRGEDAT" gene 1370920..1371477 /gene="lpqX" /locus_tag="Rv1228" /db_xref="GeneID:886013" CDS 1370920..1371477 /gene="lpqX" /locus_tag="Rv1228" /function="UNKNOWN" /note="Rv1228, (MTCI61.11), len: 185 aa. Probable lipoprotein LpqX. Contains possible signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LpqX" /protein_id="NP_215744.1" /db_xref="GI:15608368" /db_xref="UniProtKB/TrEMBL:O33224" /db_xref="GeneID:886013" /translation="MSRQWHWLAATLLLITTAACSRPGTEEPDCPTKITLPPGATPTT TLDPRCIVRATTTGTADGDAASRWTGTVRIAGFYASICNAVWDGNVSLAGKDELTGKA TLILVETSCPGKVVAGELVLKGNVGSDSLAITWAHPELPQRAFDLGAGQGTIRRSGDR AEGTFNSDMGGGTEFFLTWSLTMRN" misc_feature 1370947..1370979 /gene="lpqX" /locus_tag="Rv1228" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(1371777..1372949) /gene="mrp" /locus_tag="Rv1229c" /db_xref="GeneID:886067" CDS complement(1371777..1372949) /gene="mrp" /locus_tag="Rv1229c" /function="UNKNOWN: THOUGHT TO BE A ATP-BINDING PROTEIN." /experiment="experimental evidence, no additional details recorded" /note="Rv1229c, (MT1267, MTCI61.12c, MTV006.01c), len: 390 aa. Probable Mrp protein, similar to others e.g. MRP_ECOLI|P21590 mrp protein from Escherichia coli (379 aa), FASTA scores: E(): 0, (34.1% identity in 355 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop); and PS01215 MRP Prosite domain. BELONGS TO THE MRP/NBP35 FAMILY OF ATP-BINDING PROTEINS." /codon_start=1 /transl_table=11 /product="MRP family ATP-binding protein" /protein_id="NP_215745.1" /db_xref="GI:15608369" /db_xref="GOA:P65441" /db_xref="UniProtKB/Swiss-Prot:P65441" /db_xref="GeneID:886067" /translation="MPSRLHSAVMSGTRDGDLNAAIRTALGKVIDPELRRPITELGMV KSIDTGPDGSVHVEIYLTIAGCPKKSEITERVTRAVADVPGTSAVRVSLDVMSDEQRT ELRKQLRGDTREPVIPFAQPDSLTRVYAVASGKGGVGKSTVTVNLAAAMAVRGLSIGV LDADIHGHSIPRMMGTTDRPTQVESMILPPIAHQVKVISIAQFTQGNTPVVWRGPMLH RALQQFLADVYWGDLDVLLLDLPPGTGDVAISVAQLIPNAELLVVTTPQLAAAEVAER AGSIALQTRQRIVGVVENMSGLTLPDGTTMQVFGEGGGRLVAERLSRAVGADVPLLGQ IPLDPALVAAGDSGVPLVLSSPDSAIGKELHSIADGLSTRRRGLAGMSLGLDPTRR" misc_feature complement(1372527..1372550) /gene="mrp" /locus_tag="Rv1229c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(1372962..1374197) /locus_tag="Rv1230c" /db_xref="GeneID:887134" CDS complement(1372962..1374197) /locus_tag="Rv1230c" /function="UNKNOWN" /note="Rv1230c, (MTV006.02c), len: 411 aa. Possible membrane protein with two hydrophobic stretches near N-terminus. Some similarity to Rv1022|MTCY10G2.27c|Z92539 probable lpqU protein Mycobacterium tuberculosis (243 aa), FASTA score: opt: 408, E(): 1e-11, (43.6% identity in 172 aa overlap). Similar to AL133423|SC4A7.37 hypothetical protein from Streptomyces coelicolor (421 aa), FASTA score: opt: 679, E(): 5.1e-23, (36.4% identity in 398 aa overlap). TBparse score is 0.917." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215746.1" /db_xref="GI:15608370" /db_xref="UniProtKB/TrEMBL:O86313" /db_xref="GeneID:887134" /translation="MHIGGRWGARPAVAAVRRGACRLTRAPAFGVAAIAPLVFASAVG SAAPVFPGRTAPVHAVITPVAAVAASGIDLSGPVVIAMKRPPTSFRVAVATISAPPPP MIVNSPGALGIPAMALSAYRNAELKMAAAAPGCGVSWNLLAGIGRIESMHANGGATDA RGTAIQPIYGPTLDGTLPGNEIIIQSSVGNRVTYARAMGPMQFLPGTWARYATDGDDD GVADPQNLFDSTLAAARYLCSGGLNLRDPAQVMAALLRYNNSMPYAQNVLGWAAGYAT GVFPVDLPPITGPPPPLGDAHLENPEGLGPGLPINVNGLTADGPMAHLPLIDLTPRQA ALNPPPMFPWMAPDPSAPMPGCTLICIGSHGPPVGAPPFPPTAPPPPFLPAAPPPPDP LAGPPGDAGLAPPAPAPAG" gene complement(1374322..1374864) /locus_tag="Rv1231c" /db_xref="GeneID:887140" CDS complement(1374322..1374864) /locus_tag="Rv1231c" /function="UNKNOWN" /note="Rv1231c, (MTV006.03c), len: 180 aa. Probable membrane protein, similar to others e.g. AL390975 Streptomyces coelicolor (198 aa). TBparse score is 0.885." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215747.1" /db_xref="GI:15608371" /db_xref="UniProtKB/TrEMBL:O86314" /db_xref="GeneID:887140" /translation="MSKPFAPRRLYTPRTSRTLAPRLDPEAVGRTTESIARFFGTGRY LLVQTLLVLTWIVLNLFAVGLRWDPYPFILLNLAFSTQASYAAPLILLAQNRQEKRDR AVFEEDRRRAAQTKADTEYNARELAALRLAIGEVPTRDYLRHELDSLRALLAELQPTD PDVAQPRVADEAEQHAKKSG" gene complement(1374861..1376168) /locus_tag="Rv1232c" /db_xref="GeneID:886014" CDS complement(1374861..1376168) /locus_tag="Rv1232c" /function="UNKNOWN" /note="Rv1232c, (MTV006.04c), len: 435 aa. Conserved hypothetical protein, similar to other hypothetical proteins e.g. AB013374|AB013374_2 Bacillus halodurans C-125 mamX (449 aa), FASTA scores: opt: 381, E(): 1e-16, (29.9% identity in 251 aa overlap). Some similarity in N-terminus to U15180|MLU1518033 hypothetical Mycobacterium leprae protein u1756u (329 aa), FASTA scores: opt: 300, E(): 4.1e-12, (69.3% identity in 75 aa overlap). TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215748.1" /db_xref="GI:15608372" /db_xref="GOA:O86315" /db_xref="UniProtKB/TrEMBL:O86315" /db_xref="GeneID:886014" /translation="MGSVNRVYLARLSRMSVLGPLGESFGRVRDVVISISIVRQQPRV LGLVVDLATRRKIFIPILRVAAIEPHAVTLSTGNVSLHRFEQRPGEALALGQVLDTLV KVNDPALPELAGVDVVVTDLGVEQTRSRDWMVTRVAVRTQRRLRRRCPVHVVDWHNVA GLTPSALAMPGQDVAQLLDQFEGWKAVDVADAIRGLPPKRRHEVFKALHDKRLADVLQ ELPELDQAEVLSQLGTERAADVLEEMDPDDAADLLAVLNPTEAELLLTRMDPGDSGQV RRLLTHSPDTAGGLMTSDPVVLTPDTSIAEALARVRDPDLTPALASMVFVARPPTATP TGHYLGCVHLQRLLRDPPAELVGGVVDTDLLTLTPETPLAAVTRYFAAYNLVCGPVVD DENHLLGAVTVDDLLDHLLPHDWRVDMPELDPSGAPDRPGGPR" gene complement(1376230..1376826) /locus_tag="Rv1233c" /db_xref="GeneID:887131" CDS complement(1376230..1376826) /locus_tag="Rv1233c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1233c, (MTV006.05c), len: 198 aa. Conserved hypothetical membrane protein, N-terminus is highly proline rich, C-terminus has two hydrophobic stretches. Proline-rich N-terminus has some similarity to CBPA_DICDI calcium binding protein from Dictyostelium discoideum (467 aa), FASTA scores: E(): 4.8e-06, (35.5% identity in 183 aa overlap). Both sequences share multiple copies of a Tyr-Pro-Pro motif." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215749.1" /db_xref="GI:15608373" /db_xref="UniProtKB/TrEMBL:O86316" /db_xref="GeneID:887131" /translation="MTAPSGSSGESAHDAAGGPPPVGERPPEQPIADAPWAPPASSPM ANHPPPAYPPSGYPPAYQPGYPTGYPPPMPPGGYAPPGYPPPGTSSAGYGDIPYPPMP PPYGGSPGGYYPEPGYLDGYGPSQPGMNTMALVSLISALVGVLCCIGSIVGIVFGAIA INQIKQTREEGYGLAVAGIVIGIATLLVYMIAGIFAIP" gene 1376976..1377503 /locus_tag="Rv1234" /db_xref="GeneID:887137" CDS 1376976..1377503 /locus_tag="Rv1234" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1234, (MTV006.06), len: 175 aa. Possible transmembrane protein with two TM helices. TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215750.1" /db_xref="GI:15608374" /db_xref="UniProtKB/TrEMBL:O50451" /db_xref="GeneID:887137" /translation="MTSPFQPRQVPGSTPAAAGAGRRGVPALPTPPKGWPVGSYPTYA EAQRAVDYLSEQQFPVQQVTIVGVDLMQVERVTGRLTWPKVLGGGVLSGAWLGLFIGL VLGFFSPNPWSALVTGLVAGVFFGLITSAVPYAMARGTRDFSSTMQLVAGRYDVLCDP QNAEKARDLLARLAI" gene 1377524..1378930 /gene="lpqY" /locus_tag="Rv1235" /db_xref="GeneID:887145" CDS 1377524..1378930 /gene="lpqY" /locus_tag="Rv1235" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF SUGAR ACROSS THE MEMBRANE (IMPORT)." /note="Rv1235, (MTV006.07), len: 468 aa. Probable lpqY, sugar-binding lipoprotein component of sugar transport system (see citation below), equivalent to MLU1518034 protein u1756v from Mycobacterium leprae (469 aa), FASTA scores: opt: 2442, E(): 0, (77.4% identity in 470 aa overlap). Also similar to P18815|MALE_ENTAE MALTOSE-BINDING PERIPLASMIC PROTEIN from Enterobacter aerogenes (396 aa), FASTA scores: opt: 193, E(): 2.3e-05, (24.2% identity in 297 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="sugar-binding lipoprotein LpqY" /protein_id="NP_215751.1" /db_xref="GI:15608375" /db_xref="GOA:Q7D8J9" /db_xref="UniProtKB/TrEMBL:Q7D8J9" /db_xref="GeneID:887145" /translation="MVMSRGRIPRLGAAVLVALTTAAAACGADSQGLVVSFYTPATDG ATFTAIAQRCNQQFGGRFTIAQVSLPRSPNEQRLQLARRLTGNDRTLDVMALDVVWTA EFAEAGWALPLSDDPAGLAENDAVADTLPGPLATAGWNHKLYAAPVTTNTQLLWYRPD LVNSPPTDWNAMIAEAARLHAAGEPSWIAVQANQGEGLVVWFNTLLVSAGGSVLSEDG RHVTLTDTPAHRAATVSALQILKSVATTPGADPSITRTEEGSARLAFEQGKAALEVNW PFVFASMLENAVKGGVPFLPLNRIPQLAGSINDIGTFTPSDEQFRIAYDASQQVFGFA PYPAVAPGQPAKVTIGGLNLAVAKTTRHRAEAFEAVRCLRDQHNQRYVSLEGGLPAVR ASLYSDPQFQAKYPMHAIIRQQLTDAAVRPATPVYQALSIRLAAVLSPITEIDPESTA DELAAQAQKAIDGMGLLP" misc_feature 1377569..1377601 /gene="lpqY" /locus_tag="Rv1235" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1378927..1379850 /gene="sugA" /locus_tag="Rv1236" /db_xref="GeneID:887124" CDS 1378927..1379850 /gene="sugA" /locus_tag="Rv1236" /function="INVOLVED IN ACTIVE TRANSPORT OF SUGAR ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1236, (MTV006.08), len: 307 aa. Probable sugA, sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to U15180|MLU1518035 protein malFM from Mycobacterium leprae (310 aa), FASTA scores: opt: 1566, E(): 0, (81.8% identity in 292 aa overlap). Also similar to numerous bacterial sugar transport system components. Also similar to Rv2316|MTCY3G12.18c from Mycobacterium tuberculosis (290 aa), FASTA scores: opt: 514, E(): 7.3e-27, (33.2% identity in 283 aa overlap). Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="sugar-transport integral membrane protein ABC transporter SugA" /protein_id="NP_215752.1" /db_xref="GI:15608376" /db_xref="GOA:O50452" /db_xref="UniProtKB/TrEMBL:O50452" /db_xref="GeneID:887124" /translation="MTSVEQRTATAVFSRTGSRMAERRLAFMLVAPAAMLMVAVTAYP IGYALWLSLQRNNLATPNDTAFIGLGNYHTILIDRYWWTALAVTLAITAVSVTIEFVL GLALALVMHRTLIGKGLVRTAVLIPYGIVTVVASYSWYYAWTPGTGYLANLLPYDSAP LTQQIPSLGIVVIAEVWKTTPFMSLLLLAGLALVPEDLLRAAQVDGASAWRRLTKVIL PMIKPAIVVALLFRTLDAFRIFDNIYVLTGGSNNTGSVSILGYDNLFKGFNVGLGSAI SVLIFGCVAVIAFIFIKLFGAAAPGGEPSGR" misc_feature 1379497..1379583 /gene="sugA" /locus_tag="Rv1236" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene 1379855..1380679 /gene="sugB" /locus_tag="Rv1237" /db_xref="GeneID:887121" CDS 1379855..1380679 /gene="sugB" /locus_tag="Rv1237" /function="INVOLVED IN ACTIVE TRANSPORT OF SUGAR ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1237, (MTV006.09), len: 274 aa. Probable sugB, sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to U15180|MLU1518036 protein MalGM from Mycobacterium leprae (296 aa), FASTA scores: opt: 1571, E(): 0, (89.8% identity in 274 aa overlap). Also similar to numerous bacterial sugar transport protein. Related to Rv2834c|MTCY16B7.08 from Mycobacterium tuberculosis (275 aa), FASTA scores: opt: 370, E(): 2.4e-17, (26.8% identity in 269 aa overlap). TBparse score is 0.895." /codon_start=1 /transl_table=11 /product="sugar-transport integral membrane protein ABC transporter SugB" /protein_id="NP_215753.1" /db_xref="GI:15608377" /db_xref="GOA:O50453" /db_xref="UniProtKB/TrEMBL:O50453" /db_xref="GeneID:887121" /translation="MGARRATYWAVLDTLVVGYALLPVLWIFSLSLKPTSTVKDGKLI PSTVTFDNYRGIFRGDLFSSALINSIGIGLITTVIAVVLGAMAAYAVARLEFPGKRLL IGAALLITMFPSISLVTPLFNIERAIGLFDTWPGLILPYITFALPLAIYTLSAFFREI PWDLEKAAKMDGATPGQAFRKVIVPLAAPGLVTAAILVFIFAWNDLLLALSLTATKAA ITAPVAIANFTGSSQFEEPTGSIAAGAIVITIPIIVFVLIFQRRIVAGLTSGAVKG" gene 1380684..1381865 /gene="sugC" /locus_tag="Rv1238" /db_xref="GeneID:887104" CDS 1380684..1381865 /gene="sugC" /locus_tag="Rv1238" /function="INVOLVED IN ACTIVE TRANSPORT OF SUGAR ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv1238, (MTV006.10), len: 393 aa. Probable sugC, sugar-transport ATP-binding protein ABC transporter (see citation below). Highly similar to U15180 protein ugpC from Mycobacterium leprae (392 aa), FASTA score: opt: 2007, E(): 0, (79.9% identity in 389 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="sugar-transport ATP-binding protein ABC transporter SugC" /protein_id="NP_215754.1" /db_xref="GI:15608378" /db_xref="GOA:O50454" /db_xref="UniProtKB/TrEMBL:O50454" /db_xref="GeneID:887104" /translation="MAEIVLDHVNKSYPDGHTAVRDLNLTIADGEFLILVGPSGCGKT TTLNMIAGLEDISSGELRIAGERVNEKAPKDRDIAMVFQSYALYPHMTVRQNIAFPLT LAKMRKADIAQKVSETAKILDLTNLLDRKPSQLSGGQRQRVAMGRAIVRHPKAFLMDE PLSNLDAKLRVQMRGEIAQLQRRLGTTTVYVTHDQTEAMTLGDRVVVMYGGIAQQIGT PEELYERPANLFVAGFIGSPAMNFFPARLTAIGLTLPFGEVTLAPEVQGVIAAHPKPE NVIVGVRPEHIQDAALIDAYQRIRALTFQVKVNLVESLGADKYLYFTTESPAVHSVQL DELAEVEGESALHENQFVARVPAESKVAIGQSVELAFDTARLAVFDADSGANLTIPHR A" misc_feature 1380792..1380815 /gene="sugC" /locus_tag="Rv1238" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1381086..1381130 /gene="sugC" /locus_tag="Rv1238" /note="PS00211 ABC transporters family signature" gene complement(1381942..1383042) /gene="corA" /locus_tag="Rv1239c" /db_xref="GeneID:887106" CDS complement(1381942..1383042) /gene="corA" /locus_tag="Rv1239c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF MAGNESIUM AND COBALT IONS ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1239c, (MTV006.11c), len: 366 aa. Possible corA, magnesium and cobalt transport transmembrane protein, highly similar to U15180 corA protein from Mycobacterium leprae (373 aa), FASTA scores: opt: 1985, E(): 0, (79.1% identity in 369 aa overlap). Also similar to various CorA proteins of Gram negative bacteria e.g. P27841|CORA_ECOLI|B3816|Z5333|ECS4746 Magnesium and cobalt transport protein from Escherichia coli strains K12 and O157:H7 (316 aa), FASTA scores: opt: 236, E(): 8e-08, (24.5% identity in 306 aa overlap); etc. SEEMS TO BELONG TO THE MIT FAMILY. TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="magnesium/cobalt transporter CorA" /protein_id="NP_215755.1" /db_xref="GI:15608379" /db_xref="GOA:O50455" /db_xref="UniProtKB/TrEMBL:O50455" /db_xref="GeneID:887106" /translation="MFPGFDALPEVLRPVARPQPPNAHPVAQPPAQALVDCGVYVCGQ RLPGKYTYAAALREVREIELTGQEAFVWIGLHEPDENQMQDVADVFGLHPLAVEDAVH AHQRPKLERYDETLFLVLKTVNYVPHESVVLAREIVKTGEIMIFVGKDFVVTVRHGEH GGLSEVRKRMDADPEHLRLGPYAVMHAIADYVVDHYLEVTNLMETDIDSIEEVAFAPG RKLDIEPIYLLKREVVELRRCVNPLSTAFQRMQTESKDLISKEVRRYLRDVADHQTEA ADQIASYDDMLNSLVQAALARVGMQQNMDMRKISAWAGIIAVPTMIAGIYGMNFHFMP ELDSRWGYPTVIGGMVLICLFLYHVFRNRNWL" gene 1383213..1384202 /gene="mdh" /locus_tag="Rv1240" /db_xref="GeneID:887119" CDS 1383213..1384202 /gene="mdh" /locus_tag="Rv1240" /EC_number="1.1.1.37" /function="INVOLVED IN THE CONVERSION OF MALATE TO OXALOACETATE [CATALYTIC ACTIVITY: (S)-malate + NAD+ = oxaloacetate + NADH]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the oxidation of malate to oxaloacetate" /codon_start=1 /transl_table=11 /product="malate dehydrogenase" /protein_id="NP_215756.1" /db_xref="GI:15608380" /db_xref="GeneID:887119" /translation="MSASPLKVAVTGAAGQIGYSLLFRLASGSLLGPDRPIELRLLEI EPALQALEGVVMELDDCAFPLLSGVEIGSDPQKIFDGVSLALLVGARPRGAGMERSDL LEANGAIFTAQGKALNAVAADDVRVGVTGNPANTNALIAMTNAPDIPRERFSALTRLD HNRAISQLAAKTGAAVTDIKKMTIWGNHSATQYPDLFHAEVAGKNAAEVVNDQAWIED EFIPTVAKRGAAIIDARGASSAASAASATIDAARDWLLGTPADDWVSMAVVSDGSYGV PEGLISSFPVTTKGGNWTIVSGLEIDEFSRGRIDKSTAELADERSAVTELGLI" misc_feature 1383678..1383716 /gene="mdh" /locus_tag="Rv1240" /note="PS00068 Malate dehydrogenase active site signature" gene 1384278..1384538 /locus_tag="Rv1241" /db_xref="GeneID:887118" CDS 1384278..1384538 /locus_tag="Rv1241" /function="UNKNOWN" /note="Rv1241, (MTV006.13), len: 86 aa. Conserved hypothetical protein, member of family of 16 hypothetical Mycobacterium tuberculosis proteins including: Rv2871|Q10799|YS71_MYCTU HYPOTHETICAL 13.2 kDa PROTEIN CY2 (124 aa), FASTA scores: opt: 172, E(): 9.5e-06, (37.2% identity in 86 aa overlap); Rv2132, Rv3321c, etc. TBparse score is 0.875." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215757.1" /db_xref="GI:15608381" /db_xref="GeneID:887118" /translation="MRTTLTLDDDVVRLVEDAVHRERRPMKQVINDALRRALAPPVKR QEQYRLEPHESAVRSGLDLAGFNKLADELEDEALLDATRRAR" gene 1384535..1384966 /locus_tag="Rv1242" /db_xref="GeneID:887095" CDS 1384535..1384966 /locus_tag="Rv1242" /function="UNKNOWN" /note="Rv1242, (MTV006.14), len: 143 aa. Conserved hypothetical protein, member of family of 14 hypothetical Mycobacterium tuberculosis proteins including: Rv2872|Q10800|YS72_MYCTU (147 aa), FASTA scores: opt: 226, E(): 2.7e-09, (32.1% identity in 137 aa overlap); Rv0749, Rv0277c, Rv2530c, etc. TBparse score is 0.893." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215758.1" /db_xref="GI:15608382" /db_xref="GeneID:887095" /translation="MIIPDINLLLYAVITGFPQHRRAHAWWQDTVNGHTRIGLTYPAL FGFLRIATSARVLAAPLPTADAIAYVREWLSQPNVDLLTAGPRHLDIALGLLDKLGTA SHLTTDVQLAAYGIEYDAEIHSSDTDFARFADLKWTDPLRE" gene complement(1384989..1386677) /gene="PE_PGRS23" /locus_tag="Rv1243c" /db_xref="GeneID:887109" CDS complement(1384989..1386677) /gene="PE_PGRS23" /locus_tag="Rv1243c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1243c, (MTV006.15c), len: 562 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002). TBparse score is 0.875." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177798.1" /db_xref="GI:57116845" /db_xref="GeneID:887109" /translation="MEYLIAAQDVLVAAAADLEGIGSALAAANRAAEAPTTGLLAAGA DEVSAAIASLFSGNAQAYQALSAQAAAFHQQFVRALSSAAGSYAAAEAANASPMQAVL DVVNGPTQLLLGRPLIGDGANGGPGQNGGDGGLLYGNGGNGGSSSTPGQPGGRGGAAG LIGNGGAGGAGGPGANGGAGGNGGWLYGNGGLGGNGGAATQIGGNGGNGGHGGNAGLW GNGGAGGAGAAGAAGANGQNPVSHQVTHATDGADGTTGPDGNGTDAGSGSNAVNPGVG GGAGGIGGDGTNLGQTDVSGGAGGDGGDGANFASGGAGGNGGAAQSGFGDAVGGNGGA GGNGGAGGGGGLGGAGGSANVANAGNSIGGNGGAGGNGGIGAPGGAGGAGGNANQDNP PGGNSTGGNGGAGGDGGVGASADVGGAGGFGGSGGRGGLLLGTGGAGGDGGVGGDGGI GAQGGSGGNGGNGGIGADGMANQDGDGGDGGNGGDGGAGGAGGVGGNGGTGGAGGLFG QSGSPGSGAAGGLGGAGGNGGAGGGGGTGFNPGAPGDPGTQGATGANGQHGLNG" gene 1386857..1387717 /gene="lpqZ" /locus_tag="Rv1244" /db_xref="GeneID:887093" CDS 1386857..1387717 /gene="lpqZ" /locus_tag="Rv1244" /function="UNKNOWN" /note="Rv1244, (MTV006.16), len: 286 aa. Probable lipoprotein lpqZ, equivalent toU15180|MLU1518042 protein u1756x from Mycobacterium leprae (228 aa), FASTA scores: opt: 1039, E(): 0, (72.5% identity in 229 aa overlap). Similar to Mycobacterium tuberculosis hypothetical protein Rv3759c. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.878." /codon_start=1 /transl_table=11 /product="lipoprotein LpqZ" /protein_id="NP_215760.1" /db_xref="GI:15608384" /db_xref="GeneID:887093" /translation="MRITRILALLLAVLLAVSGVAGCSADTGDRHPELVVGSTPDSEA MLLAAIYVAALRSYGFAAHAETAADPVAKLDSGAFTVVPAFTGQMLQTLQPDASVRSD AQVYRAIVSALPEGIAAGDYTTAAEDKPALVVTQSTAKAWGGGDLSELPSHCRGLLVG RVAGAHTPAAVGPCRLPAPREFRNDATMFAALRAGQLVAAWTTTADPDIPADLIMLTD GKPALIRAENIVPLYRRNALTERQLLAVNEVAGVLDTTALIGMRRQVAAGADPAAVAA GWLAEHPLGR" misc_feature 1386893..1386925 /gene="lpqZ" /locus_tag="Rv1244" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(1387798..1388628) /locus_tag="Rv1245c" /db_xref="GeneID:887091" CDS complement(1387798..1388628) /locus_tag="Rv1245c" /function="UNKNOWN; SUPPOSED INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1245c, (MTV006.17c), len: 276 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), equivalent to NP_301801.1|NC_002677 short chain alcohol dehydrogenase from Mycobacterium leprae (277 aa). Also highly similar to various dehydrogenases and oxidoreductases e.g. NP_250228.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (295 aa); NP_421969.1|NC_002696 short chain dehydrogenase family protein from Caulobacter crescentus (278 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv3085|MTV013.06 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE (276 aa), FASTA scores: opt: 368, E(): 1.2e-16, (35.3% identity in 224 aa overlap); Rv3057c|MTCY22D7.24 PUTATIVE SHORT CHAIN ALCOHOL DEHYDROGENASE/REDUCTASE (287 aa), FASTA scores: opt: 471, E(): 1.3e-21, (32.4% identity in 281 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="short-chain type dehydrogenase/reductase" /protein_id="NP_215761.1" /db_xref="GI:15608385" /db_xref="GeneID:887091" /translation="MEGFAGKVAVVTGAGSGIGQALAIELARSGAKVAISDVDTDGLA DTEHRLKAISTPVKTDRLDVTEREAFLAYADAVNEHFGTVNQIYNNAGIAFTGDIEVS QFKDIERVMDVDFWGVVNGTKAFLPHLIASGDGHVINISSVFGLFSAPGQAAYNSAKF AVRGFTEALRQEMALAGHPVKVTTVHPGGVKTAIARNATAAEGLDQAELAETFDKRVA HLSPQRAAQIILTGVAKNKARVLVGVDAKVLDLVVRLTGSGYQRIFPIITGRLIPRPR" misc_feature complement(1388119..1388205) /locus_tag="Rv1245c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(1388685..1388978) /locus_tag="Rv1246c" /db_xref="GeneID:887099" CDS complement(1388685..1388978) /locus_tag="Rv1246c" /function="UNKNOWN" /note="Rv1246c, (MTV006.18c), len: 97 aa. Conserved hypothetical protein, highly similar to Rv2866|MTV003.12 hypothetical Mycobacterium tuberculosis protein (87 aa), FASTA scores: opt: 290, E(): 3.9e-24, (54.1% identity in 85 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215762.1" /db_xref="GI:15608386" /db_xref="GeneID:887099" /translation="MSDDHPYHVAITATAARDLQRLPEKIAAACVEFVFGPLLNNPHR LGKPLRNDLEGLHSARRGDYRVVYAIDDGHHRVEIIHIARRSASYRMNPCRPR" gene complement(1388975..1389244) /locus_tag="Rv1247c" /db_xref="GeneID:887086" CDS complement(1388975..1389244) /locus_tag="Rv1247c" /function="UNKNOWN" /note="Rv1247c, (MTV006.19c), len: 89 aa. Conserved hypothetical protein, some similarity to hypothetical proteins including Mycobacterium tuberculosis proteins Rv2865|MTV003.11 (93 aa), FASTA scores: opt: 249, E(): 5.4e-13, (44.2% identity in 86 aa overlap); Rv0268|Z86089|P95225 (169 aa) opt: 125, E(): 0.0089, (41.8% identity in 55 aa overlap); etc. and Escherichia coli AE000293|ECAE0002933 (92 aa), FASTA scores: opt: 127, E(): 0.0038, (29.3% identity in 82 aa overlap). TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215763.1" /db_xref="GI:15608387" /db_xref="GeneID:887086" /translation="MAVVPLGEVRNRLSEYVAEVELTHERITITRHGHPAAVLISADD LASIEETLEVLRTPGASEAIREGLADVAAGRFVSNDEIRNRYTAR" gene complement(1389357..1393052) /gene="kgd" /locus_tag="Rv1248c" /db_xref="GeneID:887084" CDS complement(1389357..1393052) /gene="kgd" /locus_tag="Rv1248c" /EC_number="1.2.4.2" /function="THE 2-OXOGLUTARATE DEHYDROGENASE COMPLEX CATALYZES THE OVERALL CONVERSION OF 2-OXOGLUTARATE TO SUCCINYL-CoA & CO(2). IT CONTAINS MULTIPLE COPIES OF 3 ENZYMATIC COMPONENTS:2-OXOGLUTARATE DEHYDROGENASE (E1), DIHYDROLIPOAMIDE SUCCINYLTRANSFERASE (E2) AND LIPOAMIDE DEHYDROGENASE (E3) [CATALYTIC ACTIVITY: 2-oxoglutarate + lipoamide = S-succinyldihydrolipoamide + CO2]" /note="kgd; produces succinic semialdehyde; part of alternative pathway from alpha-ketoglutarate to succinate; essential for normal growth" /codon_start=1 /transl_table=11 /product="alpha-ketoglutarate decarboxylase" /protein_id="NP_215764.2" /db_xref="GI:161352467" /db_xref="GeneID:887084" /translation="MANISSPFGQNEWLVEEMYRKFRDDPSSVDPSWHEFLVDYSPEP TSQPAAEPTRVTSPLVAERAAAAAPQAPPKPADTAAAGNGVVAALAAKTAVPPPAEGD EVAVLRGAAAAVVKNMSASLEVPTATSVRAVPAKLLIDNRIVINNQLKRTRGGKISFT HLLGYALVQAVKKFPNMNRHYTEVDGKPTAVTPAHTNLGLAIDLQGKDGKRSLVVAGI KRCETMRFAQFVTAYEDIVRRARDGKLTTEDFAGVTISLTNPGTIGTVHSVPRLMPGQ GAIIGVGAMEYPAEFQGASEERIAELGIGKLITLTSTYDHRIIQGAESGDFLRTIHEL LLSDGFWDEVFRELSIPYLPVRWSTDNPDSIVDKNARVMNLIAAYRNRGHLMADTDPL RLDKARFRSHPDLEVLTHGLTLWDLDRVFKVDGFAGAQYKKLRDVLGLLRDAYCRHIG VEYAHILDPEQKEWLEQRVETKHVKPTVAQQKYILSKLNAAEAFETFLQTKYVGQKRF SLEGAESVIPMMDAAIDQCAEHGLDEVVIGMPHRGRLNVLANIVGKPYSQIFTEFEGN LNPSQAHGSGDVKYHLGATGLYLQMFGDNDIQVSLTANPSHLEAVDPVLEGLVRAKQD LLDHGSIDSDGQRAFSVVPLMLHGDAAFAGQGVVAETLNLANLPGYRVGGTIHIIVNN QIGFTTAPEYSRSSEYCTDVAKMIGAPIFHVNGDDPEACVWVARLAVDFRQRFKKDVV IDMLCYRRRGHNEGDDPSMTNPYVYDVVDTKRGARKSYTEALIGRGDISMKEAEDALR DYQGQLERVFNEVRELEKHGVQPSESVESDQMIPAGLATAVDKSLLARIGDAFLALPN GFTAHPRVQPVLEKRREMAYEGKIDWAFGELLALGSLVAEGKLVRLSGQDSRRGTFSQ RHSVLIDRHTGEEFTPLQLLATNSDGSPTGGKFLVYDSPLSEYAAVGFEYGYTVGNPD AVVLWEAQFGDFVNGAQSIIDEFISSGEAKWGQLSNVVLLLPHGHEGQGPDHTSARIE RFLQLWAEGSMTIAMPSTPSNYFHLLRRHALDGIQRPLIVFTPKSMLRHKAAVSEIKD FTEIKFRSVLEEPTYEDGIGDRNKVSRILLTSGKLYYELAARKAKDNRNDLAIVRLEQ LAPLPRRRLRETLDRYENVKEFFWVQEEPANQGAWPRFGLELPELLPDKLAGIKRISR RAMSAPSSGSSKVHAVEQQEILDEAFG" gene complement(1393194..1393982) /locus_tag="Rv1249c" /db_xref="GeneID:887077" CDS complement(1393194..1393982) /locus_tag="Rv1249c" /function="UNKNOWN" /note="Rv1249c, (MTV006.21c), len: 262 aa. Possible membrane protein. Start uncertain. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215765.1" /db_xref="GI:15608389" /db_xref="GeneID:887077" /translation="MSARRIRSWKRFDNRSANAAEPDPQLAGTGGRPKVSTRALAQVI ERSSRIQGPAAQAYVARLRRAHPGASPAKIVAKLEKRFLSVVTASGAAVGAAATLPGI GTLAAWFAAAGEVVVFLEATALFVLALASVHAIPLDHRERRRALVLAVLVGDNTTAVA DLLGPGRTSGGWVSETMASLPLPAISSLNSRMLKYVVKRFALKRGALMFGKLVPMGIG AIIGAIGNRLVGKKLVRNARSAFGTPPARWPVTLHVLPTVRDAS" gene 1394179..1395918 /locus_tag="Rv1250" /db_xref="GeneID:887073" CDS 1394179..1395918 /locus_tag="Rv1250" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF DRUG ACROSS THE MEMBRANE (EXPORT): DRUG RESISTANCE BY AN EXPORT MECHANISM (CONFERES RESISTANCE TO TOXIC COMPOUNDS BY REMOVING THEM FOR THE CELLS). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv1250, (MTV006.22), len: 579 aa. Probable drug-transport integral membrane protein, member of major facilitator superfamily (MFS), highly similar to several including P39886|TCMA_STRGA TETRACENOMYCIN C RESISTANCE PROTEIN from Streptomyces glaucescens (538 aa), FASTA scores: opt: 847, E(): 0, (32.9% identity in 517 aa overlap); etc. Also similar to MTCY20B11.14c|Rv3239C from Mycobacterium tuberculosis (1048 aa), FASTA scores: opt: 629, E(): 6.7e-13, (31.9% identity in 423 aa overlap). TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="drug-transport integral membrane protein" /protein_id="NP_215766.1" /db_xref="GI:15608390" /db_xref="GeneID:887073" /translation="MTTAIRRAAGSSYFRNPWPALWAMMVGFFMIMLDSTVVAIANPT IMAQLRIGYATVVWVTSAYLLAYAVPMLVAGRLGDRFGPKNLYLIGLGVFTVASLGCG LSSGAGMLIAARVVQGVGAGLLTPQTLSTITRIFPAHRRGVALGAWGTVASVASLVGP LAGGALVDSMGWEWIFFVNVPVGVIGLILAAYLIPALPHHPHRFDWFGVGLSGAGMFL IVFGLQQGQSANWQPWIWAVIVGGIGFMSLFVYWQARNAREPLIPLEVFNDRNFSLSN LRIAIIAFAGTGMMLPVTFYAQAVCGLSPTHTAVLFAPTAIVGGVLAPFVGMIIDRSH PLCVLGFGFSVLAIAMTWLLCEMAPGTPIWRLVLPFIALGVAGAFVWSPLTVTATRNL RPHLAGASSGVFNAVRQLGAVLGSASMAAFMTSRIAAEMPGGVDALTGPAGQDATVLQ LPEFVREPFAAAMSQSMLLPAFVALFGIVAALFLVDFTGAAVAKEPLPESDGDADDDD YVEYILRREPEEDCDTQPLRASRPAAAAASRSGAGGPLAVSWSTSAQGMPPGPPGRRA WQADTESTAPSAL" gene complement(1395821..1399240) /locus_tag="Rv1251c" /db_xref="GeneID:887080" CDS complement(1395821..1399240) /locus_tag="Rv1251c" /function="UNKNOWN" /note="Rv1251c, (MTV006.23c), len: 1139 aa. Conserved hypothetical protein, showing some similarity in C-terminal region with other proteins from eukaryotes and bacteria e.g. NP_142121.1 hypothetical protein from Pyrococcus horikoshii (1188 aa); and some similarity to GTP-binding proteins e.g. P23249|MV10_MOUSE PUTATIVE GTP-BINDING PROTEIN (1004 aa), FASTA scores: opt: 228, E(): 1.7e-06, (27.7% identity in 560 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215767.1" /db_xref="GI:15608391" /db_xref="GeneID:887080" /translation="MFVTGDSIVYSASDLAAAARCQYALLREFDAKLGRGPAVAVDDE LMARAAVLGSAHEGRRLDQLRHEFGDAVAIIGRPAYTPAGLAAAADATRRAIANHAPV VYQAAMFDGRFVGFADFLIRDGHRYRVADTKLARSPTVTALLQLAAYADALVHSGVPV AADAELELGDGTIVRYRVGELIPVYRSQRALLQRLLDGHYTAGTAVRWDDERVQACFR CPQCTERLRASDDLLLVGGMRVRQRDKLLEAGITTIAELADHTAPVPGLTTNALGKLT AQAKLQIRQRDTGAPQFEIVDPRPLTLLPEPNPGDLFFDFEGDPLWTADGKQWGLEYL FGVLEAGRAGVFRPLWAHDRTAERQALTDFLAIVARRRRRHPNMHIYHYAPYEKTALL RLVGRYGIGEDDVDDLLRNGVLVDLYPLVRKSIRVGTDSFSLKALEPLYLGTQPRSGD VTTAADSINSYARYCELRAAGRIDEAATVLKEIEGYNHYDCRSTRALRDWLLMRAWEA GVTPIGAQPVPDADPIDDGDSLASVLSKFTGDAAAGERTPEQTAVALLAAARGYHRRE DKPFWWAHFDRLNYPVDEWSDSTDVFLASEASVTVDWHMPPRARKPQRRVRLTGELAR GDLNGNVFALYEPPAPPGMTDNPDRRAAGPAAVVETDDPTVPTEVVIVERTGSDGNTF QQLPFALAPGPPVPTTALRESIESTAAAVASGSPQLPSTALMDVLLRRPPRTRSGAAL PRSSDPVTDIAAAALDLDSSYLAVHGPPGTGKTYTAARVIAELVTEHAWRIGVVAQSH ATVENLLEGVISAGLDPGQVAKKPHDHTAGRWQSIDGSQYTEFIRDTAGCVIGGTAWD FANGNRVPKASLDLLVIDEAGQFCLANTIAVAPAATNLLLLGDPQQLPQVSQGTHPEP VDTSALSWLVDGQHTLPDERGYFLDRSYRMHPAVCAAVSALSYEGRLCSHTERTAVRR LDGYPPGVHTRGVHHKGNSIESPEEAEAILAELRQLLGSPWTDEHGTRPLAASDVLVL APYNAQVALVRRRLASAGLGGADGVRVGTVDKFQGGQAPVVFISMTASSADDVPRGIS FLLNRNRLNVAVSRAQYAAVIVRSELLTQYLPATPDGLVDLGAFLGLTSTS" misc_feature complement(1396922..1396945) /locus_tag="Rv1251c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(1399296..1399904) /gene="lprE" /locus_tag="Rv1252c" /db_xref="GeneID:887064" CDS complement(1399296..1399904) /gene="lprE" /locus_tag="Rv1252c" /function="UNKNOWN" /note="Rv1252c, (MTCY50.30), len: 202 aa. Probable lipoprotein lprE, some similarity to Mycobacterium tuberculosis protein Rv3483c|MTCY13E12.36C (220 aa), FASTA scores: E(): 7e-05, (29.5% identity in 200 aa overlap). Contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013)." /codon_start=1 /transl_table=11 /product="lipoprotein LprE" /protein_id="NP_215768.1" /db_xref="GI:15608392" /db_xref="GeneID:887064" /translation="MPGVWSPPCPTTPRVGVVAALVAATLTGCGSGDSTVAKTPEATP SLSTAHPAPPSSEPSPPSATAAPPSNHSAAPVDPCAVNLASPTIAKVVSELPRDPRSE QPWNPEPLAGNYNECAQLSAVVIKANTNAGNPTTRAVMFHLGKYIPQGVPDTYGFTGI DTSQCTGDTVALTYASGIGLNNVVKFRWNGGGVELIGNTTGG" gene 1399970..1401661 /gene="deaD" /locus_tag="Rv1253" /db_xref="GeneID:887069" CDS 1399970..1401661 /gene="deaD" /locus_tag="Rv1253" /function="HAS A HELIX-DESTABILIZING ACTIVITY" /note="Rv1253, (MTCY50.29c), len: 563 aa. Probable Dead, Cold-shock DEAD-box protein A homolog, similar to many e.g. DEAD_ECOLI|P23304 Escherichia coli (646 aa), FASTA scores: opt: 1490, E(): 0, (46.7% identity in 578 aa overlap); similar to Mycobacterium tuberculosis Rv3211. Contains PS00017 ATP/GTP-binding site motif A, PS00039 DEAD-box subfamily ATP-dependent helicases signature. BELONGS TO THE DEAD BOX FAMILY HELICASE." /codon_start=1 /transl_table=11 /product="cold-shock DEAD-box protein A" /protein_id="NP_215769.1" /db_xref="GI:15608393" /db_xref="GeneID:887069" /translation="MAFPEYSPAASAATFADLQIHPRVLRAIGDVGYESPTAIQAATI PALMAGSDVVGLAQTGTGKTAAFAIPMLSKIDITSKVPQALVLVPTRELALQVAEAFG RYGAYLSQLNVLPIYGGSSYAVQLAGLRRGAQVVVGTPGRMIDHLERATLDLSRVDFL VLDEADEMLTMGFADDVERILSETPEYKQVALFSATMPPAIRKLSAKYLHDPFEVTCK AKTAVAENISQSYIQVARKMDALTRVLEVEPFEAMIVFVRTKQATEEIAEKLRARGFS AAAISGDVPQAQRERTITALRDGDIDILVATDVAARGLDVERISHVLNYDIPHDTESY VHRIGRTGRAGRSGAALIFVSPRELHLLKAIEKATRQTLTEAQLPTVEDVNTQRVAKF ADSITNALGGPGIELFRRLVEEYEREHDVPMADIAAALAVQCRGGEAFLMAPDPPLSR RNRDQRRDRPQRPKRRPDLTTYRVAVGKRHKIGPGAIVGAIANEGGLHRSDFGQIRIG PDFSLVELPAKLPRATLKKLAQTRISGVLIDLRPYRPPDAARRHNGGKPRRKHVG" misc_feature 1400138..1400161 /gene="deaD" /locus_tag="Rv1253" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1400450..1400476 /gene="deaD" /locus_tag="Rv1253" /note="PS00039 DEAD-box subfamily ATP-dependent helicases signature" gene 1401658..1402809 /locus_tag="Rv1254" /db_xref="GeneID:887076" CDS 1401658..1402809 /locus_tag="Rv1254" /EC_number="2.3.1.-" /function="CATALYZES THE ACYLATION OF THE MYCAMINOSE SUGAR DURING MIDECAMYCIN BIOSYNTHESIS" /note="Rv1254, (MTCY50.28c), len: 383 aa. Probable Acyltransferase (EC 2.3.1.-), similar to G927228 midecamycin 4-0-propionyl transferase (fragment) (388 aa), FASTA scores, opt: 305, E(): 5.6e-14, (28.4% identity in 377 aa overlap). Also similar to other Mycobacterium tuberculosis acyltransferases e.g. Rv0111, Rv0228, etc. Contains PS00881 Protein splicing signature." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="NP_215770.1" /db_xref="GI:15608394" /db_xref="GeneID:887076" /translation="MTLPKERAAQGGLERIAHVDRVASLTGIRAVAALLVVGTHAAYT TGKYTHGYWGLMSSRMEIGVPIFFVLSGFLLFRPWVKSAATGGPPPSLSRYAWHRVRR IMPAYTVTVLLAYLVYHFRTAGPNPGHTWVGLFRNLTLTQIYTDGYLGAFLHQGLTQM WSLAVEVAFYLALPALAYLLLVLVCRRRWQPRLLLATMAGLTMISPAWLILVHNTHWM PDGARLWLPTYLAWFVGGMMLAVLAAMGVRCYAFVAIPLAVICYFIVSTPIAGAPTTS PTALAEALVKTAFYAVIAVLAVAPLALGDQGWYAQLLASRPMVFLGEISYEIFLIHLV TMEIAMVDVLGYRVYTSSMVNLCLVTLVLTIPLAWLLHRFTRVQGDRPS" misc_feature 1402285..1402302 /locus_tag="Rv1254" /note="PS00881 Protein splicing signature" gene complement(1402778..1403386) /locus_tag="Rv1255c" /db_xref="GeneID:887068" CDS complement(1402778..1403386) /locus_tag="Rv1255c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1255c, (MTCY50.27), len: 202 aa. Possible regulatory protein, similar to others e.g. ACRR_ECOLI|P34000 potential acrab operon repressor from E. coli (215 aa), FASTA scores: opt: 128, E(): 0.25, (42.1% identity in 57 aa overlap). Helix turn helix motif present at aa 36-57 (+5.48 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215771.1" /db_xref="GI:15608395" /db_xref="GeneID:887068" /translation="MAGTDWLSARRTELAADRILDAAERLFTQRDPASIGMNEIAKAA GCSRATLYRYFDSREALRTAYVHRETRRLGREIMVKIADVVEPAERLLVSITTTLRMV RDNPALAAWFTTTRPPIGGEMAGRSEVIAALAAAFLNSLGPDDPTTVERRARWVVRML TSLLMFPGRDEADERAMIAEFVVPIVTPASAAARKAGHPGPE" gene complement(1403386..1404603) /gene="cyp130" /locus_tag="Rv1256c" /db_xref="GeneID:887059" CDS complement(1403386..1404603) /gene="cyp130" /locus_tag="Rv1256c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv1256c, (MT1295, MTCY50.26), len: 405 aa. Probable cyp130, cytochrome P450 (EC 1.14.-.-), similar to other cytochromes P-450 e.g. S51594 cytochrome P450 mycG from Micromonospora griseorubida (397 aa); T36526 probable cytochrome P450 hydroxylase from Streptomyces coelicolor (411 aa); CPXK_SACER|P33271|107B1 CYTOCHROME P450 from Saccharopolyspora erythraea (405 aa), FASTA scores: opt: 639, E(): 2.7e-33, (33.2% identity in 391 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv0766c|MTCY369.11c CYTOCHROME P450 (402 aa); etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 130 CYP130" /protein_id="NP_215772.1" /db_xref="GI:15608396" /db_xref="GeneID:887059" /translation="MTSVMSHEFQLATAETWPNPWPMYRALRDHDPVHHVVPPQRPEY DYYVLSRHADVWSAARDHQTFSSAQGLTVNYGELEMIGLHDTPPMVMQDPPVHTEFRK LVSRGFTPRQVETVEPTVRKFVVERLEKLRANGGGDIVTELFKPLPSMVVAHYLGVPE EDWTQFDGWTQAIVAANAVDGATTGALDAVGSMMAYFTGLIERRRTEPADDAISHLVA AGVGADGDTAGTLSILAFTFTMVTGGNDTVTGMLGGSMPLLHRRPDQRRLLLDDPEGI PDAVEELLRLTSPVQGLARTTTRDVTIGDTTIPAGRRVLLLYGSANRDERQYGPDAAE LDVTRCPRNILTFSHGAHHCLGAAAARMQCRVALTELLARCPDFEVAESRIVWSGGSY VRRPLSVPFRVTS" misc_feature complement(1403536..1403565) /gene="cyp130" /locus_tag="Rv1256c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(1404717..1406084) /locus_tag="Rv1257c" /db_xref="GeneID:887063" CDS complement(1404717..1406084) /locus_tag="Rv1257c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1257c, (MTCY50.25), len: 455 aa. Probable oxidoreductase (EC 1.-.-.-), similar to e.g. GLCD_ECOLI|P52075 glycolate oxidase subunit glcd (499 aa), FASTA scores: E(): 0, (38.9% identity in 458 aa overlap). Similar to Mycobacterium tuberculosis oxidoreductases e.g. Rv3107c" /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215773.1" /db_xref="GI:15608397" /db_xref="GeneID:887063" /translation="MNTDVLAGLMAELPEGMVVTDPAVTDGYRQDRAFDPSAGKPLAI IRPRRTEEVQTVLRWASANQVPVVTRGAGSGLSGGATALDGGIVLSTEKMRDITVDPV TRTAVCQPGLYNAEVKEAAAEHGLWYPPDPSSFEICSIGGNIATNAGGLCCVKYGVTG DYVLGMQVVLANGTAVRLGGPRLKDVAGLSLTKLFVGSEGTLGVITEVTLRLLPAQNA SSIVVASFGSVQAAVDAVLGVTGRLRPAMLEFMDSVAINAVEDTLRMDLDRDAAAMLV AGSDERGRAATEDAAVMAAVFAENGAIDVFSTDDPDEGEAFIAARRFAIPAVESKGAL LLEDVGVPLPALGELVTGIARIAEERNLMISVIAHAGDGNTHPLLVYDPADAAMLERA HLAYGEIMDLAVGLGGTITGEHGVGRLKRPWLAGYLGPDVLALNQRIKQALDPQGILN PGSAI" gene complement(1406081..1407340) /locus_tag="Rv1258c" /db_xref="GeneID:887056" CDS complement(1406081..1407340) /locus_tag="Rv1258c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY MACROLIDE) ACROSS THE MEMBRANE (EXPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE UNDETERMINATED SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1258c, MTCY50.24, len: 419 aa. Probable conserved integral membrane transport (efflux) protein, possibly member of major facilitator superfamily (MFS), highly similar to O32859|TAP PROTEIN multidrug-resistance efflux pump from Mycobacterium fortuitum (409 aa), FASTA scores: E(): 0, (68.4% identity in 408 aa overlap). Contains PS00216 Sugar transport proteins signature 1." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_215774.1" /db_xref="GI:15608398" /db_xref="GeneID:887056" /translation="MRNSNRGPAFLILFATLMAAAGDGVSIVAFPWLVLQREGSAGQA SIVASATMLPLLFATLVAGTAVDYFGRRRVSMVADALSGAAVAGVPLVAWGYGGDAVN VLVLAVLAALAAAFGPAGMTARDSMLPEAAARAGWSLDRINGAYEAILNLAFIVGPAI GGLMIATVGGITTMWITATAFGLSILAIAALQLEGAGKPHHTSRPQGLVSGIAEGLRF VWNLRVLRTLGMIDLTVTALYLPMESVLFPKYFTDHQQPVQLGWALMAIAGGGLVGAL GYAVLAIRVPRRVTMSTAVLTLGLASMVIAFLPPLPVIMVLCAVVGLVYGPIQPIYNY VIQTRAAQHLRGRVVGVMTSLAYAAGPLGLLLAGPLTDAAGLHATFLALALPIVCTGL VAIRLPALRELDLAPQADIDRPVGSAQ" misc_feature complement(1407107..1407157) /locus_tag="Rv1258c" /note="PS00216 Sugar transport proteins signature 1" gene 1407339..1408238 /locus_tag="Rv1259" /db_xref="GeneID:887047" CDS 1407339..1408238 /locus_tag="Rv1259" /function="UNKNOWN" /note="Rv1259, (MTCY50.23c), len: 299 aa. Conserved hypothetical protein. Similar to AL109732|SC7H2.04 hypothetical protein from Streptomyces coelicolor (237 aa), FASTA scores: opt: 870, E(): 0, (57.1% identity in 231 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215775.1" /db_xref="GI:15608399" /db_xref="GeneID:887047" /translation="MNIAAESSAKPVWGPPNFCAAAARMQDVRVLMHPKTGRAFRSPV EPGSGWPGDPATPQTPVAADAAQVSALAGGAGSICELNALISVCRACPRLVSWREEVA VVKRRAFADQPYWGRPVPGWGSKRPRLLILGLAPAAHGANRTGRMFTGDRSGDQLYAA LHRAGLVNSPVSVDAADGLRANRIRITAPVRCAPPGNSPTPAERLTCSPWLNAEWRLV SDHIRAIVALGGFAWQVALRLAGASGTPKPRFGHGVVTELGAGVRLLGCYHPSQQNMF TGRLTPTMLDDIFREAKKLAGIE" gene 1408240..1409358 /locus_tag="Rv1260" /db_xref="GeneID:887044" CDS 1408240..1409358 /locus_tag="Rv1260" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv1260, (MTCY50.22c), len: 372 aa. Probable oxidoreductase (EC 1.-.-.-), highly similar to E1245747|AL021411 putative oxidoreductase SC7H1.18 from Streptomyces coelicolor (397 aa), FASTA scores: E(): 1.4e-29, (45.9% identity in 355 aa overlap); also some similarity to G912582 FAD binding protein homologue from Pseudomonas aeruginosa (286 aa), FASTA scores: opt: 245, E(): 2e-09, (27.5% identity in 251 aa overlap); PCPB_FLASP|P42535 pentachlorophenol 4-monooxygenase (537 aa), FASTA scores: opt: 219, E(): 1.7e-07, (23.3% identity in 360 aa overlap); TETX_BACFR|Q01911 tetracycline resistance protein (388 aa), FASTA scores: opt: 183, E(): 3e-05, (22.8% identity in 373 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv0575c and Rv1751." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215776.1" /db_xref="GI:15608400" /db_xref="GeneID:887044" /translation="MKTVVVSGASVAGTAAAYWLGRHGYSVTMVERHPGLRPGGQAID VRGPALDVLERMGLLAAAQEHKTRIRGASFVDRDGNELFRDTESTPTGGPVNSPDIEL LRDDLVELLYGATQPSVEYLFDDSISTLQDDGDSVRVTFERAAAREFDLVIGADGLHS NVRRLVFGPEEQFVKRLGTHAAIFTVPNFLELDYWQTWHYGDSTMAGVYSARNNTEAR AALAFMDTELRIDYRDTEAQFAELQRRMAEDGWVRAQLLHYMRSAPDFYFDEMSQILM DRWSRGRVALVGDAGYCCSPLSGQGTSVALLGAYILAGELKAAGDDYQLGFANYHAEF HGFVERNQWLVSDNIPGGAPIPQEEFERIVHSITIKDY" gene complement(1409484..1409933) /locus_tag="Rv1261c" /db_xref="GeneID:887055" CDS complement(1409484..1409933) /locus_tag="Rv1261c" /function="UNKNOWN" /note="Rv1261c, (MTCY50.21), len: 149 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1558|MTCY48.07c (39.2% identity in 125 aa overlap); Rv3547 and Rv3178." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215777.1" /db_xref="GI:15608401" /db_xref="GeneID:887055" /translation="MDISRWLERHVGVQLLRLHDAIYRGTNGRIGHRIPGAPPSLLLH TTGAKTSQPRTTSLTYARDGDAYLIVASKGGDPRSPGWYHNLKANPDVEINVGPKRFG VTAKPVQPHDPDYARLWQIVNENNANRYTNYQSRTSRPIPVVVLTRR" gene complement(1409938..1410372) /locus_tag="Rv1262c" /db_xref="GeneID:887036" CDS complement(1409938..1410372) /locus_tag="Rv1262c" /function="UNKNOWN" /note="Rv1262c, (MTCY50.20), len: 144 aa. Hypothetical HIT-like protein, similar to Q04344|HIT_YEAST hit1 protein (orf u) (144 aa), FASTA scores: opt: 306, E(): 3e-14, (35.9 % identity in 142 aa overlap); also similar to YHIT_MYCGE|P47378 hypothetical 15.6 kDa protein (141 aa), FASTA scores: opt: 250, E(): 1.6e-10, (35.5% identity in 107 aa overlap); and YHIT_MYCLE|P49774 hypothetical 17.0 kDa protein hit-like (155 aa), FASTA scores: opt: 196, E(): 7e-07, (30.6% identity in 144 aa overlap). Similar to other proteins from Mycobacterium tuberculosis e.g. Rv2613c, Rv0759c. Contains PS00892 HIT family signature. BELONGS TO THE HIT FAMILY." /codon_start=1 /transl_table=11 /product="HIT-like protein" /protein_id="NP_215778.1" /db_xref="GI:15608402" /db_xref="GeneID:887036" /translation="MPCVFCAIIAGEAPAIRIYEDGGYLAILDIRPFTRGHTLVLPKR HTVDLTDTPPEALADMVAIGQRIARAARATKLADATHIAINDGRAAFQTVFHVHLHVL PPRNGDKLSVAKGMMLRRDPDREATGRILREALAQQDAAAQD" misc_feature complement(1410073..1410120) /locus_tag="Rv1262c" /note="PS00892 HIT family signature" gene 1410431..1411819 /gene="amiB2" /locus_tag="Rv1263" /db_xref="GeneID:887041" CDS 1410431..1411819 /gene="amiB2" /locus_tag="Rv1263" /EC_number="3.5.1.4" /function="INVOLVED IN CELLULAR METABOLISM, ACTIVE ON 2- to 6- CARBON ALIPHATIC AMIDES AND ON MANY AROMATIC AMIDES [CATALYTIC ACTIVITY : A MONOCARBOXYLIC ACID AMIDE + H(2)O = A MONOCARBOXYLATE + NH(3)]." /note="catalyzes the hydrolysis of a monocarboxylic acid amid to form a monocarboxylate and ammonia" /codon_start=1 /transl_table=11 /product="amidase" /protein_id="NP_215779.1" /db_xref="GI:15608403" /db_xref="GeneID:887041" /translation="MDPTDLAFAGAAAQARMLADGALTAPMLLEVYLQRIERLDSHLR AYRVVQFDRARAEAEAAQQRLDAGERLPLLGVPIAIKDDVDIAGEVTTYGSAGHGPAA TSDAEVVRRLRAAGAVIIGKTNVPELMIMPFTESLAFGATRNPWCLNRTPGGSSGGSA AAVAAGLAPVALGSDGGGSIRIPCTWCGLFGLKPQRDRISLEPHDGAWQGLSVNGPIA RSVMDAALLLDATTTVPGPEGEFVAAAARQPGRLRIALSTRVPTPLPVRCGKQELAAV HQAGALLRDLGHDVVVRDPDYPASTYANYLPRFFRGISDDADAQAHPDRLEARTRAIA RLGSFFSDRRMAALRAAEVVLSSRIQSIFDDVDVVVTPGAATGPSRIGAYQRRGAVST LLLVVQRVPYFQVWNLTGQPAAVVPWDFDGDGLPMSVQLVGRPYDEATLLALAAQIES ARPWAHRRPSVS" misc_feature 1410776..1410799 /gene="amiB2" /locus_tag="Rv1263" /note="PS00017 ATP/GTP-binding site motif A" gene 1411894..1413087 /locus_tag="Rv1264" /db_xref="GeneID:887035" CDS 1411894..1413087 /locus_tag="Rv1264" /EC_number="4.6.1.1" /function="POSSIBLY INVOLVED IN cAMP SYNTHESIS [CATALYTIC ACTIVITY: ATP = 3',5'-CYCLIC AMP + DIPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv1264, (MTCY50.18c), len: 397 aa. Adenylate cyclase (EC 4.6.1.1) (function proven experimentally: see Linder et al., 2002), showing some similarity to other adenylate cyclases e.g. CYAA_BRELI|P27580 (403 aa), FASTA scores, opt: 270, E(): 1.3e-10, (29.3% identity in 317 aa overlap); etc. Similar to other putative cyclases in M. tuberculosis e.g. Rv2212, Rv1647. The C terminus seems to code for a catalytic domain belonging to a subfamily of adenylyl cyclase isozymes (mostly found in Gram-positive bacteria). The N terminus seems to be a potential novel regulator of adenylyl cyclase activity (autoinhibitory domain). BELONGS TO THE ADENYLYL CYCLASE CLASS-4/GUANYLYL CYCLASE FAMILY." /codon_start=1 /transl_table=11 /product="adenylyl cyclase" /protein_id="NP_215780.1" /db_xref="GI:15608404" /db_xref="GeneID:887035" /translation="MTDHVREADDANIDDLLGDLGGTARAERAKLVEWLLEQGITPDE IRATNPPLLLATRHLVGDDGTYVSAREISENYGVDLELLQRVQRAVGLARVDDPDAVV HMRADGEAAARAQRFVELGLNPDQVVLVVRVLAEGLSHAAEAMRYTALEAIMRPGATE LDIAKGSQALVSQIVPLLGPMIQDMLFMQLRHMMETEAVNAGERAAGKPLPGARQVTV AFADLVGFTQLGEVVSAEELGHLAGRLAGLARDLTAPPVWFIKTIGDAVMLVCPDPAP LLDTVLKLVEVVDTDNNFPRLRAGVASGMAVSRAGDWFGSPVNVASRVTGVARPGAVL VADSVREALGDAPEADGFQWSFAGPRRLRGIRGDVRLFRVRRGATRTGSGGAAQDDDL AGSSP" gene 1413260..1413940 /locus_tag="Rv1265" /db_xref="GeneID:887058" CDS 1413260..1413940 /locus_tag="Rv1265" /function="UNKNOWN. SEEMS TO BE EXPRESSED DURING MACROPHAGE INFECTION." /experiment="experimental evidence, no additional details recorded" /note="Rv1265, (MTCY50.17c), len: 226 aa. Hypothetical unknown protein (see citation below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215781.1" /db_xref="GI:15608405" /db_xref="GeneID:887058" /translation="MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMH GRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLES PEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVP VMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLR AQGPDLPA" gene complement(1413960..1415840) /gene="pknH" /locus_tag="Rv1266c" /db_xref="GeneID:887023" CDS complement(1413960..1415840) /gene="pknH" /locus_tag="Rv1266c" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO BE INVOLVED IN ARABINAN METABOLISM, PHOSPHORYLATING PERHAPS EMBR|Rv1267c [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /note="Rv1266c, (MTCY50.16), len: 626 aa. Probable pknH, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citation below), similar to many e.g. PKN1_MYXXA|P33973 pkn1 (693 aa), FASTA scores: opt: 611, E(): 1.4e- 14, (29.7% identity in 492 aa overlap); etc. Contains PS00107 Protein kinases ATP-binding region signature; PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation." /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase H" /protein_id="NP_215782.1" /db_xref="GI:15608406" /db_xref="GeneID:887023" /translation="MSDAQDSRVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAV KLMTAEFSKDPVFRERMKREARIAGRLQEPHVVPIHDYGEVDGQMFLEMRLVEGTDLD SVLKRFGPLTPPRAVAIITQIASALDAAHADGVMHRDVKPQNILITRDDFAYLVDFGI ASATTDEKLTQLGTAVGTWKYMAPERFSNDEVTYRADIYALACVLHECLTGAPPYRAD SAGTLVSSHLMGPIPQPSAIRPGIPKAFDAVVARGMAKKPEDRYASAGDLALAAHEAL SDPDQDHAADILRRSQESTLPAPPKPVPPPTMPATAMAPRQPPAPPVTPPGVQPAPKP SYTPPAQPGPAGQRPGPTGQPSWAPNSGPMPASGPTPTPQYYQGGGWGAPPSGGPSPW AQTPRKTNPWPLVAGAAAVVLVLVLGAIGIWIAIRPKPVQPPQPVAEERLSALLLNSS EVNAVMGSSSMQPGKPITSMDSSPVTVSLPDCQGALYTSQDPVYAGTGYTAINGLISS EPGDNYEHWVNQAVVAFPTADKARAFVQTSADKWKNCAGKTVTVTNKAKTYRWTFADV KGSPPTITVIDTQEGAEGWECQRAMSVANNVVVDVNACGYRITNQAGQIAAKIVDKVN KE" misc_feature complement(1415400..1415438) /gene="pknH" /locus_tag="Rv1266c" /note="PS00108 Serine/Threonine protein kinases active-site signature" misc_feature complement(1415706..1415777) /gene="pknH" /locus_tag="Rv1266c" /note="PS00107 Protein kinases ATP-binding region signature" gene complement(1416181..1417347) /gene="embR" /locus_tag="Rv1267c" /db_xref="GeneID:887026" CDS complement(1416181..1417347) /gene="embR" /locus_tag="Rv1267c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM. THOUGHT TO REGULATE THE BIOSYNTHESIS OF THE MYCOBACTERIAL CELL WALL ARABINAN AND RESISTANCE TO ETHAMBUTOL (Emb; dextro-2,2'-(ethylenediimino)-di-1-butanol), REGULATING EMBA|Rv3794 AND EMBB|Rv3795." /note="Rv1267c, (MT1305, MTCY50.15), len: 388 aa. Probable embR, regulatory protein (see citation below), similar to many e.g. AFSR_STRCO|P25941 regulatory protein AfsR from Streptomyces coelicolor (993 aa), FASTA scores: opt: 489, E(): 1e-25, (33.5% identity in 361 aa overlap); etc. BELONGS TO THE AFSR/DNRI/REDD FAMILY OF REGULATORS." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein EMBR" /protein_id="NP_215783.1" /db_xref="GI:15608407" /db_xref="GeneID:887026" /translation="MAGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVI NRNRPVGVDALITALWEEWPPSGARASIHSYVSNLRKLLGGAGIDPRVVLAAAPPGYR LSIPDNTCDLGRFVAEKTAGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVE PFATALVEDKVLAHTAKAEAEIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLS DRQSDALGAYRRVKTTLADDLGIDPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTV LDQRTMASGQQAVAYLHDIASGRGYPLQAAATRIGRLHDNDIVLDSANVSRHHAVIVD TGTNYVINDLRSSNGVHVQHERIRSAVTLNDGDHIRICDHEFTFQISAGTHGGT" gene complement(1417658..1418356) /locus_tag="Rv1268c" /db_xref="GeneID:887033" CDS complement(1417658..1418356) /locus_tag="Rv1268c" /function="UNKNOWN" /note="Rv1268c, (MTCY50.14), len: 232 aa. Hypothetical unknown protein, probably secreted protein : contains possible signal peptide sequence (score 7.9 at residue 28)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215784.1" /db_xref="GI:15608408" /db_xref="GeneID:887033" /translation="MTTSKIATAFKTATFALAAGAVALGLASPADAAAGTMYGDPAAA AKYWRQQTYDDCVLMSAADVIGQVTGREPSERAIIKVAQSTPSVVHPGSIYTKPADAE HPNSGMGTSVADIPTLLAHYGVDAVITDEDHATATGVATGMAALEQYLGSGHAVIVSI NAEMIWGQPVEETDSAGNPRSDHAVVVTGVDTENGIVHLNDSGTPTGRDEQIPMETFV EAWATSHDFMAVTT" gene complement(1418579..1418953) /locus_tag="Rv1269c" /db_xref="GeneID:887039" CDS complement(1418579..1418953) /locus_tag="Rv1269c" /function="UNKNOWN" /note="Rv1269c, (MTCY50.13), len: 124 aa. Conserved probable exported protein with putative N-terminal signal sequence. Similar to Mycobacterium tuberculosis protein Rv1813c|Y0DU_MYCTU|Q50620 hypothetical protein cy1a11.30 (137 aa), FASTA scores: E(): 9e-21, (41.6% identity in 137 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215785.1" /db_xref="GI:15608409" /db_xref="GeneID:887039" /translation="MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAY SGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTL AAAMKDALTKLGGGYIDTWACN" gene complement(1419014..1419748) /gene="lprA" /locus_tag="Rv1270c" /db_xref="GeneID:887017" CDS complement(1419014..1419748) /gene="lprA" /locus_tag="Rv1270c" /function="UNKNOWN" /note="Rv1270c, (MTCY50.12), len: 244 aa. Possible lprA, lipoprotein. Similar to O32852|AJ000500 lipoprotein from Mycobacterium bovis (236 aa), fasta scores: E(): 5.2e-23, (35.1% identity in 245 aa overlap). Similar to M. tuberculosis lipoproteins: Rv1368, Rv1411c, Rv2945c. Contains probable N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="lipoprotein LprA" /protein_id="NP_215786.1" /db_xref="GI:15608410" /db_xref="GeneID:887017" /translation="MKHPPCSVVAAATAILAVVLAIGGCSTEGDAGKASDTAATASNG DAAMLLKQATDAMRKVTGMHVRLAVTGDVPNLRVTKLEGDISNTPQTVATGSATLLVG NKSEDAKFVYVDGHLYSDLGQPGTYTDFGNGASIYNVSVLLDPNKGLANLLANLKDAS VAGSQQADGVATTKITGNSSADDIATLAGSRLTSEDVKTVPTTVWIASDGSSHLVQIQ IAPTKDTSVTLTMSDWGKQVTATKPV" gene complement(1419961..1420302) /locus_tag="Rv1271c" /db_xref="GeneID:887019" CDS complement(1419961..1420302) /locus_tag="Rv1271c" /function="UNKNOWN" /note="Rv1271c, (MTCY50.11), len: 113 aa. Conserved hypothetical exported protein with potential N-terminal signal sequence. Similar to Mycobacterium tuberculosis hypothetical proteins Rv1804c, Rv1810, Rv0622, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215787.1" /db_xref="GI:15608411" /db_xref="GeneID:887019" /translation="MLSPLSPRIIAAFTTAVGAAAIGLAVATAGTAGANTKDEAFIAQ MESIGVTFSSPQVATQQAQLVCKKLASGETGTEIAEEVLSQTNLTTKQAAYFVVDATK AYCPQYASQLT" gene complement(1420410..1422305) /locus_tag="Rv1272c" /db_xref="GeneID:887021" CDS complement(1420410..1422305) /locus_tag="Rv1272c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF DRUGS ACROSS THE MEMBRANE (EXPORT): MULTIDRUGS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1272c, (MTCY50.10), len: 631 aa. Probable drugs-transport transmembrane ATP-binding protein ABC transporter (see citation below), similar to e.g. Y015_MYCGE|P47261 hypothetical ABC transporter mg015m from Mycoplasma genitalium (589 aa), FASTA scores: opt: 1054, E(): 0, (34.3% identity in 522 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop); and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS), MSBA SUBFAMILY." /codon_start=1 /transl_table=11 /product="drugs-transport transmembrane ATP-binding protein ABC transporter" /protein_id="NP_215788.1" /db_xref="GI:15608412" /db_xref="GeneID:887021" /translation="MTAPPGARPRAASPPPNMRSRDFWGSAARLVKRLAPQRRLSIAV ITLGIAGTTIGVIVPRILGHATDLLFNGVIGRGLPGGITKAQAVASARARGDNTFADL LSGMNVVPGQGVDFAAVERTLALALALYLAAALMIWAQARLLNLTVQKTMVRLRTDVE DKVHRLPLSYFDGQQRGELLSRVTNDIDNLQSSLSMTISQLVTSILTMVAVLAMMVSI SGLLALITLLTVPLSLLVTRAITRRSQPLFVAHWTSTGRLNAHLEETYSGFTVVKTFG HQAAARERFHELNDDVYQAGFGAQFLSGLVQPATAFIGNLGYVAVAVAGGLQVATGQI TLGSIQAFIQYIRQFNMPLSQLAGMYNALQSGVASAERVFDVLDEPEESPEPEPELPN LTGRVEFEHVNFAYLPGTPVIRDLSLVAEPGSTVAIVGPTGAGKTTLVNLLMRFYEIG SGRILIDGVDIASVSRQSLRSRIGMVLQDTWLYDGTIAENIAYGRPEATTDEIVEAAR AAHVDRFVNTLPAGYQTRVSGDGGSISVGEKQLITIARAFLARPQLLILDEATSSVDT RTELLIQRAMRELRRDRTSFIIAHRLSTIRDADHILVVQTGQIVERGNHAELLARRGV YYQMTRA" misc_feature complement(1420662..1420706) /locus_tag="Rv1272c" /note="PS00211 ABC transporters family signature" misc_feature complement(1420995..1421018) /locus_tag="Rv1272c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(1422302..1424050) /locus_tag="Rv1273c" /db_xref="GeneID:887025" CDS complement(1422302..1424050) /locus_tag="Rv1273c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF DRUGS ACROSS THE MEMBRANE (EXPORT): MULTIDRUGS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1273c, (MTCY50.09), len: 582 aa. Probable drugs-transport transmembrane ATP-binding protein ABC transporter (see citation below), similar to e.g. YWJA_BACSU|P45861 hypothetical abc transporter from B. subtilis (575 aa), FASTA scores: opt: 810, E(): 0, (27.0% identity in 578 aa overlap); etc. Contains PS00136 Serine proteases, subtilase family, aspartic acid active site; 2 x PS00211 ABC transporters family signature; and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS), MSBA SUBFAMILY." /codon_start=1 /transl_table=11 /product="drugs-transport transmembrane ATP-binding protein ABC transporter" /protein_id="NP_215789.1" /db_xref="GI:15608413" /db_xref="GeneID:887025" /translation="MLLALLRQHIRPYRRLVAMLMMLQLVSTLASLYLPTVNAAIVDD GVAKGDTATIVRLGAVMLGVTGLQVLCAIGAVYLGSRTGAGFGRDLRSAMFEHIITFS ERETARFGAPTLLTRSTNDVRQILFLVQMTATVLVTAPIMCVGGIIMAIHQEAALTWL LLVSVPILAVANYWIISHMLPLFRRMQSLIDGINRVMRDQLSGVRVVRAFTREGYERD KFAQANTALSNAALSAGNWQALMLPVTTLTINASSVALIWFGGLRIDSGQMQVGSLIA FLSYFAQILMAVLMATMTLAVLPRASVCAERITEVLSTPAALGNPDNPKFPTDGVTGV VRLAGATFTYPGADCPVLQDISLTARPGTTTAIVGSTGSGKSTLVSLICRLYDVTAGA VLVDGIDVREYHTERLWSAIGLVPQRSYLFSGTVADNLRYGGGPDQVVTEQEMWEALR VAAADGFVQTDGLQTRVAQGGVNFSGGQRQRLAIARAVIRRPAIYVFDDAFSALDVHT DAKVHASLRQVSGDATIIVVTQRISNAAQADQVIVVDNGKIVGTGTHETLLADCPTYA EFAASQSLSATVGGVG" misc_feature complement(1422587..1422631) /locus_tag="Rv1273c" /note="PS00211 ABC transporters family signature" misc_feature complement(1422923..1422946) /locus_tag="Rv1273c" /note="PS00017 ATP/GTP-binding site motif A" misc_feature complement(1423313..1423357) /locus_tag="Rv1273c" /note="PS00211 ABC transporters family signature" misc_feature complement(1423901..1423936) /locus_tag="Rv1273c" /note="PS00136 Serine proteases, subtilase family, aspartic acid active site" gene 1424197..1424754 /gene="lprB" /locus_tag="Rv1274" /db_xref="GeneID:887011" CDS 1424197..1424754 /gene="lprB" /locus_tag="Rv1274" /function="UNKNOWN" /note="Rv1274, (MTCY50.08c), len: 185 aa. Possible lprB, lipoprotein; contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013) . Some similarity to Rv1275." /codon_start=1 /transl_table=11 /product="lipoprotein LprB" /protein_id="NP_215790.1" /db_xref="GI:15608414" /db_xref="GeneID:887011" /translation="MRRKVRRLTLAVSALVALFPAVAGCSDSGDNKPGATIPSTPANA EGRHGPFFPQCGGVSDQTVTELTRVTGLVNTAKNSVGCQWLAGGGILGPHFSFSWYRG SPIGRERKTEELSRASVEDINIDGHSGFIAIGNEPSLGDSLCEVGIQFSDDFIEWSVS FSQKPFPLPCDIAKELTRQSIANSK" gene 1424751..1425293 /gene="lprC" /locus_tag="Rv1275" /db_xref="GeneID:887014" CDS 1424751..1425293 /gene="lprC" /locus_tag="Rv1275" /function="UNKNOWN" /note="Rv1275, (MTCY50.07c), len: 180 aa. Possible lprC, lipoprotein; contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). Some similarity to Rv1274." /codon_start=1 /transl_table=11 /product="lipoprotein LprC" /protein_id="NP_215791.1" /db_xref="GI:15608415" /db_xref="GeneID:887014" /translation="MRRVLVGAAALITALLVLTGCTKSISGTAVKAGGAGVPRNNNSQ ERYPNLLKECEVLTTDILAKTVGADPLDIQSTFVGAICRWQAANPAGLIDITRFWFEQ GSLSNERKVAEGLKYQVETRAIQGVDSIVMRTGDPNGACGVASDAAGVVGWWVNPQAP GIDACGQAIKLMELTLATNA" gene complement(1425438..1425914) /locus_tag="Rv1276c" /db_xref="GeneID:887000" CDS complement(1425438..1425914) /locus_tag="Rv1276c" /function="UNKNOWN" /note="Rv1276c, (MTCY50.06), len: 158 aa. Conserved hypothetical protein, similar to AL096844|SCI28.03 hypothetical protein from Streptomyces coelicolor (172 aa), FASTA scores: opt: 385, E(): 3.3e-19, (43.5% identity in 161 aa overlap). Some similarity to P76502|SIXA_ECOLI PHOSPHOHISTIDINE PHOSPHATASE SIXA (161 aa), FASTA scores: opt: 146, E(): 0.0047, (31.9% identity in 116 aa overlap). BELONGS TO THE SIXA FAMILY OF PHOSPHATASES." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215792.1" /db_xref="GI:15608416" /db_xref="GeneID:887000" /translation="MRHAKSAYPDGIADHDRPLAPRGIREAGLAGGWLRANLPAVDAV LCSTATRARQTLAHTGIDAPARYAERLYGAAPGTVIEEINRVGDNVTTLLVVGHEPTT SALAIVLASISGTDAAVAERISEKFPTSGIAVLRVAGHWADVEPGCAALVGFHVPR" gene 1426164..1427417 /locus_tag="Rv1277" /db_xref="GeneID:887001" CDS 1426164..1427417 /locus_tag="Rv1277" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1277, (MTCY50.05c), len: 417 aa. Conserved hypothetical protein, some similarity to 3914967|O68033|SBCD_RHOCA EXONUCLEASE SBCD HOMOLOG from Rhodobacter capsulatus (405 aa). May be sbcD protein (see Mizrahi & Andersen 1998)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215793.1" /db_xref="GI:15608417" /db_xref="GeneID:887001" /translation="MSPRPGPAGRGPAPCRCADLHSLCVDSHALRRDGMRFLHTADWQ LGMTRHFLAGDAQPRYSAARRDAVAGLKALAADVGAEFVVVAGDVFEHNQLAPQIVGQ SLEAMRVIGLPVYLLPGNHDPLDASSVYTSTLFRAERPDNVVVLDRAGVHEVRPGVQI VAAPWRSKAPTTDPVAEVLAGLPTDAAIRLLVAHGGVDALDPDHDKPSLIRLAALDDA LTRQAIHYVALGDKHSLTQVGSSGRVWYSGAPEVTNFDDVEPDPGHVLVVDIDESDPR HPVTVDARRIGRWRFVTLHHQVDTSRDIADLDLNLDLMTDKDRTVVRLALTGSLTVTD RAALDTCLDKYARLFAWLGLWERHTDLAVIPVDAEFTDLGIGGFAAAAVDELVATARG GDDESAVDAQAALALLLRLADRGAA" gene 1427414..1430041 /locus_tag="Rv1278" /db_xref="GeneID:887005" CDS 1427414..1430041 /locus_tag="Rv1278" /function="UNKNOWN" /note="Rv1278, (MTCY50.04c), len: 875. Hypothetical unknown protein, possible coiled-coil regions, contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215794.1" /db_xref="GI:15608418" /db_xref="GeneID:887005" /translation="MKLHRLALTNYRGIAHRDVEFPDHGVVVVCGANEIGKSSMVEAL DLLLEYKDRSTKKEVKQVKPTNADVGSEVIAEISSGPYRFVYRKRFHKRCETELTVLA PRREQLTGDEAHERVRTMLAETVDTELWHAQRVLQAASTAAVDLSGCDALSRALDLAA GDDAALSGTESLLIERIEAEYARYFTPTGRPTGEWSAAVSRLAAAEAAVADCAAAVAE VDDGVRRHTELTEQVAELSQQLLAHQLRLEAARVAAEKIAAITDDAREAKLIATAAAA TSGASTAAHAGRLGLLTEIDTRTAAVVAAEAKARQAADEQATARAEAEACDAALTEAT QVLTAVRLRAESARRTLDQLADCEEADRLAARLARIDDIEGDRDRVCAELSAVTLTEE LLSRIERAAAAVDRGGAQLASISAAVEFTAAVDIELGVGDQRVSLSAGQSWSVTATGP TEVKVPGVLTARIVPGATALDFQAKYAAAQQELADALAAGEVADLAAARSADLCRREL LSRRDQLTATLAGLCGDEQVDQLRSRLEQLCAGQPAELDLVSTDTATARAELDAVEAA RIAAEKDCETRRQIAAGAARRLAETSTRATVLQNAAAAESAELGAAMTRLACERASVG DDELAAKAEADLRVLQTAEQRVIDLADELAATAPDAVAAELAEAADAVELLRERHDEA IRALHEVGVELSVFGTQGRKGKLDAAETEREHAASHHARVGRRARAARLLRSVMARHR DTTRLRYVEPYRAELHRLGRPVFGPSFEVEVDTDLRIRSRTLDDRTVPYECLSGGAKE QLGILARLAGAALVAKEDAVPVLIDDALGFTDPERLAKMGEVFDTIGADGQVIVLTCS PTRYGGVKGAHRIDLDAIQ" misc_feature 1427504..1427527 /locus_tag="Rv1278" /note="PS00017 ATP/GTP-binding site motif A" gene 1430062..1431648 /locus_tag="Rv1279" /db_xref="GeneID:887002" CDS 1430062..1431648 /locus_tag="Rv1279" /EC_number="1.1.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM, PROBABLY ELECTRON-TRANSFER-LINKED." /note="Rv1279, (MTCY50.03c), len: 528 aa. Probable dehydrogenase, FAD flavoprotein GMC oxidoreductase (EC 1.1.-.-), similar to several e.g. dBETA_ECOLI|P17444 choline dehydrogenase from Escherichia coli (556 aa), FASTA scores, opt: 1047, E(): 0, (37.7% identity in 541 aa overlap). Similar to Rv0697 putative Mycobacterium tuberculosis GMC oxidoreductase. Contains PS00623 GMC oxidoreductases signature 1, and PS00624 GMC oxidoreductases signature 2. BELONGS TO THE GMC OXIDOREDUCTASES FAMILY." /codon_start=1 /transl_table=11 /product="dehydrogenase FAD flavoprotein GMC oxidoreductase" /protein_id="NP_215795.1" /db_xref="GI:15608419" /db_xref="GeneID:887002" /translation="MDTQSDYVVVGTGSAGAVVASRLSTDPATTVVALEAGPRDKNRF IGVPAAFSKLFRSEIDWDYLTEPQPELDGREIYWPRGKVLGGSSSMNAMMWVRGFASD YDEWAARAGPRWSYADVLGYFRRIENVTAAWHFVSGDDSGVTGPLHISRQRSPRSVTA AWLAAARECGFAAARPNSPRPEGFCETVVTQRRGARFSTADAYLKPAMRRKNLRVLTG ATATRVVIDGDRAVGVEYQSDGQTRIVYARREVVLCAGAVNSPQLLMLSGIGDRDHLA EHDIDTVYHAPEVGCNLLDHLVTVLGFDVEKDSLFAAEKPGQLISYLLRRRGMLTSNV GEAYGFVRSRPELKLPDLELIFAPAPFYDEALVPPAGHGVVFGPILVAPQSRGQITLR SADPHAKPVIEPRYLSDLGGVDRAAMMAGLRICARIAQARPLRDLLGSIARPRNSTEL DEATLELALATCSHTLYHPMGTCRMGSDEASVVDPQLRVRGVDGLRVADASVMPSTVR GHTHAPSVLIGEKAADLIRS" misc_feature 1430302..1430373 /locus_tag="Rv1279" /note="PS00623 GMC oxidoreductases signature 1" misc_feature 1430827..1430871 /locus_tag="Rv1279" /note="PS00624 GMC oxidoreductases signature 2" gene complement(1431665..1433440) /gene="oppA" /locus_tag="Rv1280c" /db_xref="GeneID:886985" CDS complement(1431665..1433440) /gene="oppA" /locus_tag="Rv1280c" /function="INVOLVED IN ACTIVE TRANSPORT OF OLIGOPEPTIDE ACROSS THE MEMBRANE (IMPORT). THIS PROTEIN IS A COMPONENT OF THE OLIGOPEPTIDE PERMEASE, A BINDING PROTEIN-DEPENDENT TRANSPORT SYSTEM; IT BINDS PEPTIDES UP TO FIVE AMINO ACIDS LONG WITH HIGH AFFINITY." /note="Rv1280c, (MTCY50.02), len: 591 aa. Probable oppA, oligopeptide-binding lipoprotein component of peptide transport system (see citation below), sharing some similarity to other periplasmic solute binding proteins e.g. OPPA_SALTY|P06202 periplasmic oligopeptide-binding protein from Salmonella typhimurium (542 aa), FASTA scores: E(): 5.1e-05, (22.1% identity in 458 aa overlap); etc. Also similar to Rv1166 and Rv2585c from Mycobacterium tuberculosis. Has possible N-terminal signal sequence and prokaryotic lipoprotein lipid attachment site (PS00013). BELONGS TO THE BACTERIAL EXTRACELLULAR SOLUTE-BINDING PROTEIN FAMILY 5." /codon_start=1 /transl_table=11 /product="periplasmic oligopeptide-binding lipoprotein OppA" /protein_id="NP_215796.1" /db_xref="GI:15608420" /db_xref="GeneID:886985" /translation="MADRGQRRGCAPGIASALRASFQGKSRPWTQTRYWAFALLTPLV VAMVLTGCSASGTQLELAPTADRRAAVGTTSDINQQDPATLQDGGNLRLSLTDFPPNF NILHIDGNNAEVAAMMKATLPRAFIIGPDGSTTVDTNYFTSIELTRTAPQVVTYTINP EAVWSDGTPITWRDIASQIHAISGADKAFEIASSSGAERVASVTRGVDDRQAVVTFAK PYAEWRGMFAGNGMLLPASMTATPEAFNKGQLDGPGPSAGPFVVSALDRTAQRIVLTR NPRWWGARPRLDSITYLVLDDAARLPALQNNTIDATGVGTLDQLTIAARTKGISIRRA PGPSWYHFTLNGAPGSILADKALRLAIAKGIDRYTIARVAQYGLTSDPVPLNNHVFVA GQDGYQDNSGVVAYNPEQAKRELDALGWRRSGAFREKDGRQLVIRDLFYDAQSTRQFA QIAQHTLAQIGVKLELQAKSGSGFFSDYVNVGAFDIAQFGWVGDAFPLSSLTQIYASD GESNFGKIGSPQIDAAIERTLAELDPGKARALANQVDELIWAEGFSLPLTQSPGTVAV RSTLANFGATGLADLDYTAIGFMRR" gene complement(1433433..1435271) /gene="oppD" /locus_tag="Rv1281c" /db_xref="GeneID:886997" CDS complement(1433433..1435271) /gene="oppD" /locus_tag="Rv1281c" /function="INVOLVED IN ACTIVE TRANSPORT OF OLIGOPEPTIDE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv1281c, (MTCY50.01), len: 612 aa. Probable oppD, oligopeptide-transport ATP-binding protein ABC transporter (see citation below), similar to others e.g. DPPD_BACSU|P26905 dipeptide transport ATP-binding protein from Bacillus subtilis (335 aa), FASTA scores: opt: 983, E(): 0, (48.6% identity in 319 aa overlap); etc. Contains 2 x PS00017 ATP/GTP-binding site motif A (P-loop); 2 x PS00211 ABC transporters family signature.BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="oligopeptide-transport ATP-binding protein ABC transporter OppD" /protein_id="NP_215797.1" /db_xref="GI:15608421" /db_xref="GeneID:886997" /translation="MSPLLEVTDLAVTFRTDGDPVTAVRGISYRVEPGEVVAMVGESG SGKSAAAMAVVGLLPEYAQVRGSVRLQGTELLGLADNAMSRFRGKAIGTVFQDPMSAL TPVYTVGDQIAEAIEVHQPRVGKKAARRRAVELLDLVGISQPQRRSRAFPHELSGGER QRVVIAIAIANDPDLLICDEPTTALDVTVQAQILDVLKAARDVTGAGVLIITHDLGVV AEFADRALVMYAGRVVESAGVNDLYRDRRMPYTVGLLGSVPRLDAAQGTRLVPIPGAP PSLAGLAPGCPFAPRCPLVIDECLTAEPELLDVATDHRAACIRTELVTGRSAADIYRV KTEARPAALGDASVVVRVRHLVKTYRLAKGVVLRRAIGEVRAVDGISLELRQGRTLGI VGESGSGKSTTLHEILELAAPQSGSIEVLGTDVATLGTAERRSLRRDIQVVFQDPVAS LDPRLPVFDLIAEPLQANGFGKNETHARVAELLDIVGLRHGDASRYPAEFSGGQKQRI GIARALALQPKILALDEPVSALDVSIQAGIINLLLDLQEQFGLSYLFVSHDLSVVKHL AHQVAVMLAGTVVEQGDSEEVFGNPKHEYTRRLLGAVPQPDPARRG" misc_feature complement(1433730..1433774) /gene="oppD" /locus_tag="Rv1281c" /note="PS00211 ABC transporters family signature" misc_feature complement(1434069..1434092) /gene="oppD" /locus_tag="Rv1281c" /note="PS00017 ATP/GTP-binding site motif A" misc_feature complement(1434765..1434809) /gene="oppD" /locus_tag="Rv1281c" /note="PS00211 ABC transporters family signature" misc_feature complement(1435128..1435151) /gene="oppD" /locus_tag="Rv1281c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(1435268..1436143) /gene="oppC" /locus_tag="Rv1282c" /db_xref="GeneID:886980" CDS complement(1435268..1436143) /gene="oppC" /locus_tag="Rv1282c" /function="INVOLVED IN ACTIVE TRANSPORT OF OLIGOPEPTIDE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1282c, (MTCY373.01c-MTCY3H3.01), len: 291 aa. Probable oppC, oligopeptide-transport integral membrane protein ABC transporter (see citation below), similar to other integral membrane proteins e.g. OPPC_ECOLI|P77664 oligopeptide transport system permease from Escherichia coli (302 aa), FASTA scores: E(): 4.6e-33, (40.7% identity in 275 aa overlap); etc. Also similar to Rv3664c|DPPC probable peptide-transport integral membrane protein from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="oligopeptide-transport integral membrane protein ABC transporter OppC" /protein_id="NP_215798.1" /db_xref="GI:15608422" /db_xref="GeneID:886980" /translation="MTEFASRRTLVVRRFLRNRAAVASLAALLLLFVSAYALPPLLPY SYDDLDFNALLQPPGTKHWLGTNALGQDLLAQTLRGMQKSMLIGVCVAVISTGIAATV GAISGYFGGWRDRTLMWVVDLLLVVPSFILIAIVTPRTKNSANIMFLVLLLAGFGWMI SSRMVRGMTMSLREREFIRAARYMGVSSRRIIVGHVVPNVASILIIDAALNVAAAILA ETGLSFLGFGIQPPDVSLGTLIADGTASATAFPWVFLFPASILVLILVCANLTGDGLR DALDPASRSLRRGVR" gene complement(1436140..1437117) /gene="oppB" /locus_tag="Rv1283c" /db_xref="GeneID:886981" CDS complement(1436140..1437117) /gene="oppB" /locus_tag="Rv1283c" /function="INVOLVED IN ACTIVE TRANSPORT OF OLIGOPEPTIDE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1283c, (MTCY373.02c), len: 325 aa. Probable oppB, oligopeptide-transport integral membrane protein ABC transporter (see citation below), similar to other integral membrane proteins e.g. DPPB_ECOLI|P37316 dipeptide transport system permease protein from Escherichia coli (339 aa), FASTA scores: opt: 402, E(): 3.4e-20, (31.0% identity in 345 aa overlap); etc. Also similar to Rv3665c|DppB probable peptide-transport integral membrane protein from Mycobacterium tuberculosis. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /codon_start=1 /transl_table=11 /product="oligopeptide-transport integral membrane protein ABC transporter OppB" /protein_id="NP_215799.1" /db_xref="GI:15608423" /db_xref="GeneID:886981" /translation="MTRYLARRLLNYLVLLALASFLTYCLTSLAFSPLESLMQRSPRP PQAVIDAKAHDLGLDRPILARYANWVSHAVRGDFGTTITGQPVGTELGRRIGVSLRLL VVGSVFGTVAGVVIGAWGAIRQYRLSDRVMTTLALLVLSTPTFVVANLLILGALRVNW AVGIQLFDYTGETSPGVAGGVWDRLGDRLQHLILPSLTLALAAAAGFSRYQRNAMLDV LGQDFIRTARAKGLTRRRALLKHGLRTALIPMATLFAYGVAGLVTGAVFVEKIFGWHG MGEWMVRGISTQDTNIVAAITVFSGAVVLLAGLLSDVIYAALDPRVRVS" misc_feature complement(1436386..1436472) /gene="oppB" /locus_tag="Rv1283c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature." gene 1437324..1437815 /locus_tag="Rv1284" /db_xref="GeneID:886987" CDS 1437324..1437815 /locus_tag="Rv1284" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1284, (MTCY373.03), len: 163 aa. Conserved hypothetical protein, similar to AL109663|SC4A10.26 hypothetical protein from Streptomyces coelicolor (167 aa), FASTA scores: opt: 567, E(): 1.5e-32, (53.4% identity in 163 aa overla); shows some similarity to hypothetical protein from Methanobacterium thermoautotrophicum. Weak similarity to carbonic anhydrases e.g. U51624|MTU516242|P17582 Methanothermobacter thermautotrophicus (171 aa), FASTA score: opt: 305, E(): 1 .2e-14, (35.2% identity in 165 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215800.1" /db_xref="GI:15608424" /db_xref="GeneID:886987" /translation="MTVTDDYLANNVDYASGFKGPLPMPPSKHIAIVACMDARLDVYR MLGIKEGEAHVIRNAGCVVTDDVIRSLAISQRLLGTREIILLHHTDCGMLTFTDDDFK RAIQDETGIRPTWSPESYPDAVEDVRQSLRRIEVNPFVTKHTSLRGFVFDVATGKLNE VTP" gene 1437909..1438907 /gene="cysD" /locus_tag="Rv1285" /db_xref="GeneID:886979" CDS 1437909..1438907 /gene="cysD" /locus_tag="Rv1285" /EC_number="2.7.7.4" /function="INVOLVED IN SULFATE ACTIVATION PATHWAY. FIRST STEP IN THE SULFATE ACTIVATION PATHWAY. THIS REACTION OCCURS EARLY IN THE REDUCTIVE BRANCH OF THE CYSTEINE BIOSYNTHETIC PATHWAY [CATALYTIC ACTIVITY :ATP + SULFATE = DIPHOSPHATE + ADENYLYLSULFATE]" /experiment="experimental evidence, no additional details recorded" /note="with CysN catalyzes the formation of adenylylsulfate from sulfate and ATP" /codon_start=1 /transl_table=11 /product="sulfate adenylyltransferase subunit 2" /protein_id="NP_215801.1" /db_xref="GI:15608425" /db_xref="GeneID:886979" /translation="MAITINMVNPTGFIRYEDVEQEAMTSDVTVGPAPGQYQLSHLRL LEAEAIHVIREVAAEFERPVLLFSGGKDSIVMLHLALKAFRPGRLPFPVMHVDTGHNF DEVIATRDELVAAAGVRLVVASVQDDIDAGRVVETIPSRNPIQTVTLLRAIRENQFDA AFGGARRDEEKARAKERVFSFRDEFGQWDPKAQRPELWNLYNGRHHKGEHIRVFPLSN WTEFDIWSYIGAEQVRLPSIYFAHRRKVFQRDGMLLAVHRHMQPRADEPVFEATVRFR TVGDVTCTGCVESSASTVAEVIAETAVARLTERGATRADDRISEAGMEDRKRQGYF" gene 1438907..1440751 /gene="cysN" /locus_tag="Rv1286" /db_xref="GeneID:886978" CDS 1438907..1440751 /gene="cysN" /locus_tag="Rv1286" /EC_number="2.7.7.4" /EC_number="2.7.1.25" /function="ATP SULFURYLASE MAY BE THE GTPASE, REGULATING ATP SULFURYLASE ACTIVITY [CATALYTIC ACTIVITY 1: ATP + SULFATE = DIPHOSPHATE + ADENYLYLSULFATE] AND APS KINASE CATALYZES THE SYNTHESIS OF ACTIVATED SULFATE [CATALYTIC ACTIVITY 2: ATP + ADENYLYLSULFATE = ADP + 3'- PHOSPHOADENYLYLSULFATE]. FIRST AND SECOND STEPS IN THE SULFATE ACTIVATION PATHWAY. THESE REACTIONS OCCURS EARLY IN THE REDUCTIVE BRANCH OF THE CYSTEINE BIOSYNTHETIC PATHWAY." /experiment="experimental evidence, no additional details recorded" /note="in Rhizobium meliloti this protein is involved in the synthesis of nodulation factors that are active on the roots of alfalfa; catalyzes formation of activated sulfate intermediate; converts ATP and sulfate to diphosphate and adenylylsulfate and then ATP and adenylyl sulfate to ADP and 3'-phosphoadenylyl sulfate; the activated intermediate is transferred to the nodulation factors by NodH; may interact with NodP and NodQ; similar to the CysD and CysN proteins from EScherichia coli involved in cysteine biosynthesis" /codon_start=1 /transl_table=11 /product="bifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein" /protein_id="NP_215802.1" /db_xref="GI:15608426" /db_xref="GeneID:886978" /translation="MTTLLRLATAGSVDDGKSTLIGRLLYDSKAVMEDQWASVEQTSK DRGHDYTDLALVTDGLRAEREQGITIDVAYRYFATPKRKFIIADTPGHIQYTRNMVTG ASTAQLVIVLVDARHGLLEQSRRHAFLASLLGIRHLVLAVNKMDLLGWDQEKFDAIRD EFHAFAARLDVQDVTSIPISALHGDNVVTKSDQTPWYEGPSLLSHLEDVYIAGDRNMV DVRFPVQYVIRPHTLEHQDHRSYAGTVASGVMRSGDEVVVLPIGKTTRITAIDGPNGP VAEAFPPMAVSVRLADDIDISRGDMIARTHNQPRITQEFDATVCWMADNAVLEPGRDY VVKHTTRTVRARIAGLDYRLDVNTLHRDKTATALKLNELGRVSLRTQVPLLLDEYTRN ASTGSFILIDPDTNGTVAAGMVLRDVSARTPSPNTVRHRSLVTAQDRPPRGKTVWFTG LSGSGKSSVAMLVERKLLEKGISAYVLDGDNLRHGLNADLGFSMADRAENLRRLSHVA TLLADCGHLVLVPAISPLAEHRALARKVHADAGIDFFEVFCDTPLQDCERRDPKGLYA KARAGEITHFTGIDSPYQRPKNPDLRLTPDRSIDEQAQEVIDLLESSS" misc_feature 1438937..1438960 /gene="cysN" /locus_tag="Rv1286" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1439078..1439125 /gene="cysN" /locus_tag="Rv1286" /note="PS00301 GTP-binding elongation factors signature" misc_feature 1440254..1440277 /gene="cysN" /locus_tag="Rv1286" /note="PS00017 ATP/GTP-binding site motif A" gene 1440805..1441290 /locus_tag="Rv1287" /db_xref="GeneID:886998" CDS 1440805..1441290 /locus_tag="Rv1287" /function="UNKNOWN" /note="Rv1287, (MTCY373.06), len: 161 aa. Conserved hypothetical protein, similar to VJEB family of proteins e.g. FASTA score: P44675|Y379_HAEIN HYPOTHETICAL PROTEIN HI0379 (150 aa), FASTA scores: opt: 213, E(): 2.5e-08, (30.0% identity in 130 aa overlap) and YJEB_ECOLI|P21498 hypothetical 15.6 kDa protein in pura-vacb (141 aa), opt: 167, E(): 9.5e-06, (25.0% identity in 136 aa overlap). BELONGS TO THE UPF0074 (RFF2) FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215803.1" /db_xref="GI:15608427" /db_xref="GeneID:886998" /translation="MRMSAKAEYAVRAMVQLATAASGTVVKTDDLAAAQGIPPQFLVD ILTNLRTDRLVRSHRGREGGYELARPGTEISIADVLRCIDGPLASVRDIGLGDLPYSG PTTALTDVWRALRASMRSVLEETTLADVAGGALPEHVAQLADDYRAQESTRHGASRHG D" gene 1441348..1442718 /locus_tag="Rv1288" /db_xref="GeneID:886974" CDS 1441348..1442718 /locus_tag="Rv1288" /function="UNKNOWN" /note="Rv1288, (MTCY373.07), len: 456 aa. Conserved hypothetical protein, some similarity to A85B_MYCTU|P31952 antigen 85-b precursor (85b) (325 aa), FASTA scores: opt: 199, E(): 2.7e-06, (24.7% identity in 279 aa overlap). Also similar to Q01377|CSP1_CORGL PS1 PROTEIN PRECURSOR (related to antigen 85 complex) from Corynebacterium glutamicum (657 aa), FASTA scores: opt: 280, E(): 1.9e-10, (26.4% identity in 352 aa overlap). SEEMS TO CONTAIN 3 LYSM REPEATS" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215804.1" /db_xref="GI:15608428" /db_xref="GeneID:886974" /translation="MVSTHAVVAGETLSALALRFYGDAELYRLIAAASGIADPDVVNV GQRLIMPDFTRYTVVAGDTLSALALRFYGDAELNWLIAAASGIADPDVVNVGQRLIMP DFTRYTVVAGDTLSALAARFYGDASLYPLIAAVNGIADPGVIDVGQVLVIFIGRSDGF GLRIVDRNENDPRLWYYRFQTSAIGWNPGVNVLLPDDYRTSGRTYPVLYLFHGGGTDQ DFRTFDFLGIRDLTAGKPIIIVMPDGGHAGWYSNPVSSFVGPRNWETFHIAQLLPWIE ANFRTYAEYDGRAVAGFSMGGFGALKYAAKYYGHFASASSHSGPASLRRDFGLVVHWA NLSSAVLDLGGGTVYGAPLWDQARVSADNPVERIDSYRNKRIFLVAGTSPDPANWFDS VNETQVLAGQREFRERLSNAGIPHESHEVPGGHVFRPDMFRLDLDGIVARLRPASIGA AAERAD" gene 1442767..1443399 /locus_tag="Rv1289" /db_xref="GeneID:886994" CDS 1442767..1443399 /locus_tag="Rv1289" /function="UNKNOWN" /note="Rv1289, (MTCY373.08), len: 210 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215805.1" /db_xref="GI:15608429" /db_xref="GeneID:886994" /translation="MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLG VGTRFRTALRDSLDIYGVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWL HGHADESSVEFEVSPYVNASAALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPP DYQLSWYDHVFFISVWWGWQDHFREIVNVDRASLVALDFGDLWNGWTPVG" gene complement(1443482..1445047) /locus_tag="Rv1290c" /db_xref="GeneID:886971" CDS complement(1443482..1445047) /locus_tag="Rv1290c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN VIRULENCE." /note="Rv1290c, (MTCY373.09c), len: 521 aa. Conserved hypothetical protein (see citation below), similar to AL031013|SC8A6.09 hypothetical protein from Streptomyces coelicolor (443 aa), FASTA scores: opt: 371, E(): 9.5e-17, (28.3% identity in 446 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215806.1" /db_xref="GI:15608430" /db_xref="GeneID:886971" /translation="MLQRSLGVNGRKLAMSARSAKRERKNASTAASKCYVVPPSARGW VHAYSVTATSMLNRRKAILDYLQGAVWVLPTFGVAIGLGSGAVLSMIPVKSGTLIDKL MFQGTPGDARGVLIVVSATMITTIGIVFSLTVLSLQIASSQFSVRLLRTFLRDVPNQV VLAIFACTFAYSTGGLHTVGEHRDGGAFIPKVAVTGSLALAFVSIAALIYFLHHLMHS IQIDTIMDKVRLRTLGLVDQLYPESDTADRQVETPPSPPADAVPLLAPHSGYLQTVDV DDIAELAAASRYTALLVTFVGDYVTAGGLLGWCWRRGTAPGAPGSDFPQRCLRHVHIG FERTLQQDIRFGLRQMVDIALRALSPALNDPYTAIQVVHHLSAVESVLASRALPDDVR RDRAGELLFWLPYPSFATYLHVGCAQIRRYGSREPLVLTALLQLLSAVAQNCVDPSRR VAVQTQIALVVRAAQREFADESDRAMVLGAAARATEVVERPGTLAPPPSTFGQVAAAQ AAASTIRSADRDG" gene 1445058..1445372 /locus_tag="Rv1290A" /db_xref="GeneID:3205114" CDS 1445058..1445372 /locus_tag="Rv1290A" /function="UNKNOWN" /note="Rv1290A, len: 104 aa. Hypothetical unknown protein, equivalent to AAK45590 from Mycobacterium tuberculosis strain CDC1551 (122 aa) but shorter 18 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177642.1" /db_xref="GI:57116846" /db_xref="GeneID:3205114" /translation="MLALHGLSEGVSGSGGSGGRWGAGEVLEGARIGVIADGVSCFPT KADCRRIRGVPVFDGYTRMVARLMGSLAVLRSVSIPKGYRDFGFGSLRAVAPKNCPDV SG" gene complement(1445499..1445834) /locus_tag="Rv1291c" /db_xref="GeneID:886975" CDS complement(1445499..1445834) /locus_tag="Rv1291c" /function="UNKNOWN" /note="Rv1291c, (MTCY373.10c), len: 111 aa. Conserved hypothetical secreted protein, similar to others in Mycobacterium tuberculosis e.g. Rv1271c|Q11048|YC71_MYCTU HYPOTHETICAL 11.6 kDa PROTEIN (113 aa), FASTA score: opt: 246, E(): 1.7e-09, (40.0% identity in 110 aa overlap); Rv1804c, Rv1810, Rv0622, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215807.1" /db_xref="GI:15608431" /db_xref="GeneID:886975" /translation="MFTRRFAASMVGTTLTAATLGLAALGFAGTASASSTDEAFLAQL QADGITPPSAARAIKDAHAVCDALDEGHSAKAVIKAVAKATGLSAKGAKTFAVDAASA YCPQYVTSS" gene complement(1446193..1446265) /locus_tag="Rvnt18" /note="tRNA-Arg(CCG)" /db_xref="GeneID:2700468" tRNA complement(1446193..1446265) /locus_tag="Rvnt18" /product="tRNA-Arg" /note="codon recognized: CGG" /anticodon=(pos:1446230..1446232,aa:Arg) /db_xref="GeneID:2700468" gene 1446379..1448031 /gene="argS" /locus_tag="Rv1292" /db_xref="GeneID:886964" CDS 1446379..1448031 /gene="argS" /locus_tag="Rv1292" /EC_number="6.1.1.19" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-arginine + tRNA(Arg) = AMP + diphosphate + L-arginyl-tRNA(Arg)]." /note="catalyzes a two-step reaction, first charging an arginine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; class-I aminoacyl-tRNA synthetase" /codon_start=1 /transl_table=11 /product="arginyl-tRNA synthetase" /protein_id="NP_215808.1" /db_xref="GI:15608432" /db_xref="GeneID:886964" /translation="MTPADLAELLKATAAAVLAERGLDASALPQMVTVERPRIPEHGD YASNLAMQLAKKVGTNPRELAGWLAEALTKVDGIASAEVAGPGFINMRLETAAQAKVV TSVIDAGHSYGHSLLLAGRKVNLEFVSANPTGPIHIGGTRWAAVGDALGRLLTTQGAD VVREYYFNDHGAQIDRFANSLIAAAKGEPTPQDGYAGSYITNIAEQVLQKAPDALSLP DAELRETFRAIGVDLMFDHIKQSLHEFGTDFDVYTHEDSMHTGGRVENAIARLRETGN IYEKDGATWLRTSAFGDDKDRVVIKSDGKPAYIAGDLAYYLDKRQRGFDLCIYMLGAD HHGYIARLKAAAAAFGDDPATVEVLIGQMVNLVRDGQPVRMSKRAGTVLTLDDLVEAI GVDAARYSLIRSSVDTAIDIDLALWSSASNENPVYYVQYAHARLSALARNAAELALIP DTNHLELLNHDKEGTLLRTLGEFPRVLETAASLREPHRVCRYLEDLAGDYHRFYDSCR VLPQGDEQPTDLHTARLALCQATRQVIANGLAIIGVTAPERM" misc_feature 1446772..1446801 /gene="argS" /locus_tag="Rv1292" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene 1448028..1449371 /gene="lysA" /locus_tag="Rv1293" /db_xref="GeneID:886960" CDS 1448028..1449371 /gene="lysA" /locus_tag="Rv1293" /EC_number="4.1.1.20" /function="INVOLVED IN BIOSYNTHESIS OF LYSINE (LAST STEP) [CATALYTIC ACTIVITY : MESO-2,6-DIAMINOHEPTANEDIOATE = L-LYSINE + CO(2)]." /note="Rv1293, (MTCY373.13), len: 447 aa. Probable lysA, diaminopimelate decarboxylase (EC 4.1.1.20) (see citation below), almost identical to DCDA_MYCTU|P31848. Contains PS00878 Orn/DAP/Arg decarboxylases family 2 pyridoxal-P attachment site, PS00879 Orn/DAP/Arg decarboxylases family 2 signature 2. BELONGS TO FAMILY 2 OF ORNITHINE, DAP, AND ARGININE DECARBOXYLASES." /codon_start=1 /transl_table=11 /product="diaminopimelate decarboxylase LysA" /protein_id="NP_215809.1" /db_xref="GI:15608433" /db_xref="GeneID:886960" /translation="MNELLHLAPNVWPRNTTRDEVGVVCIAGIPLTQLAQEYGTPLFV IDEDDFRSRCRETAAAFGSGANVHYAAKAFLCSEVARWISEEGLCLDVCTGGELAVAL HASFPPERITLHGNNKSVSELTAAVKAGVGHIVVDSMTEIERLDAIAGEAGIVQDVLV RLTVGVEAHTHEFISTAHEDQKFGLSVASGAAMAAVRRVFATDHLRLVGLHSHIGSQI FDVDGFELAAHRVIGLLRDVVGEFGPEKTAQIATVDLGGGLGISYLPSDDPPPIAELA AKLGTIVSDESTAVGLPTPKLVVEPGRAIAGPGTITLYEVGTVKDVDVSATAHRRYVS VDGGMSDNIRTALYGAQYDVRLVSRVSDAPPVPARLVGKHCESGDIIVRDTWVPDDIR PGDLVAVAATGAYCYSLSSRYNMVGRPAVVAVHAGNARLVLRRETVDDLLSLEVR" misc_feature 1448232..1448288 /gene="lysA" /locus_tag="Rv1293" /note="PS00878 Orn/DAP/Arg decarboxylases family 2 pyridoxal-P attachment site" misc_feature 1448775..1448807 /gene="lysA" /locus_tag="Rv1293" /note="PS00879 Orn/DAP/Arg decarboxylases family 2 signature 2" gene 1449375..1450700 /gene="thrA" /locus_tag="Rv1294" /db_xref="GeneID:886962" CDS 1449375..1450700 /gene="thrA" /locus_tag="Rv1294" /EC_number="1.1.1.3" /function="INVOLVED IN THE CONVERSION OF L-ASPARTATE TO HOMOSERINE (THIRD STEP). HOMOSERINE PARTICIPATES IN THE BIOSYNTHESIS OF THREONINE AND THEN ISOLEUCINE AND IN THE BIOSYNTHESIS OF METHIONINE [CATALYTIC ACTIVITY : L-HOMOSERINE + NAD(P)(+) = L-ASPARTATE 4-SEMIALDEHYDE + NAD(P)H.]" /note="catalyzes the formation of L-aspartate 4-semialdehyde from L-homoserine" /codon_start=1 /transl_table=11 /product="homoserine dehydrogenase" /protein_id="NP_215810.1" /db_xref="GI:15608434" /db_xref="GeneID:886962" /translation="MPGDEKPVGVAVLGLGNVGSEVVRIIENSAEDLAARVGAPLVLR GIGVRRVTTDRGVPIELLTDDIEELVAREDVDIVVEVMGPVEPSRKAILGALERGKSV VTANKALLATSTGELAQAAESAHVDLYFEAAVAGAIPVIRPLTQSLAGDTVLRVAGIV NGTTNYILSAMDSTGADYASALADASALGYAEADPTADVEGYDAAAKAAILASIAFHT RVTADDVYREGITKVTPADFGSAHALGCTIKLLSICERITTDEGSQRVSARVYPALVP LSHPLAAVNGAFNAVVVEAEAAGRLMFYGQGAGGAPTASAVTGDLVMAARNRVLGSRG PRESKYAQLPVAPMGFIETRYYVSMNVADKPGVLSAVAAEFAKREVSIAEVRQEGVVD EGGRRVGARIVVVTHLATDAALSETVDALDDLDVVQGVSSVIRLEGTGL" misc_feature 1449654..1449677 /gene="thrA" /locus_tag="Rv1294" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1449927..1449995 /gene="thrA" /locus_tag="Rv1294" /note="PS01042 Homoserine dehydrogenase signature" gene 1450697..1451779 /gene="thrC" /locus_tag="Rv1295" /db_xref="GeneID:886957" CDS 1450697..1451779 /gene="thrC" /locus_tag="Rv1295" /EC_number="4.2.3.1" /function="INVOLVED IN THREONINE BIOSYNTHESIS [CATALYTIC ACTIVITY : O-PHOSPHO-L-HOMOSERINE + H(2)O = L-THREONINE + PHOSPHATE]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of L-threonine from O-phospho-L-homoserine" /codon_start=1 /transl_table=11 /product="threonine synthase" /protein_id="NP_215811.1" /db_xref="GI:15608435" /db_xref="GeneID:886957" /translation="MTVPPTATHQPWPGVIAAYRDRLPVGDDWTPVTLLEGGTPLIAA TNLSKQTGCTIHLKVEGLNPTGSFKDRGMTMAVTDALAHGQRAVLCASTGNTSASAAA YAARAGITCAVLIPQGKIAMGKLAQAVMHGAKIIQIDGNFDDCLELARKMAADFPTIS LVNSVNPVRIEGQKTAAFEIVDVLGTAPDVHALPVGNAGNITAYWKGYTEYHQLGLID KLPRMLGTQAAGAAPLVLGEPVSHPETIATAIRIGSPASWTSAVEAQQQSKGRFLAAS DEEILAAYHLVARVEGVFVEPASAASIAGLLKAIDDGWVARGSTVVCTVTGNGLKDPD TALKDMPSVSPVPVDPVAVVEKLGLA" misc_feature 1450874..1450915 /gene="thrC" /locus_tag="Rv1295" /note="PS00165 Serine/threonine dehydratases pyridoxal-phosphate attachment site" gene 1451997..1452947 /gene="thrB" /locus_tag="Rv1296" /db_xref="GeneID:886958" CDS 1451997..1452947 /gene="thrB" /locus_tag="Rv1296" /EC_number="2.7.1.39" /function="THREONINE BIOSYNTHESIS FROM ASPARATE (FOURTH STEP) [CATALYTIC ACTIVITY : ATP + L-HOMOSERINE = ADP + O-PHOSPHO-L-HOMOSERINE]." /note="catalyzes the formation of O-phospho-L-homoserine from L-homoserine in threonine biosynthesis from asparate" /codon_start=1 /transl_table=11 /product="homoserine kinase" /protein_id="NP_215812.1" /db_xref="GI:15608436" /db_xref="GeneID:886958" /translation="MVTQALLPSGLVASAVVAASSANLGPGFDSVGLALSLYDEIIVE TTDSGLTVTVDGEGGDQVPLGPEHLVVRAVQHGLQAAGVSAAGLAVRCRNAIPHSRGL GSSAAAVVGGLAAVNGLVVQTDSSPSSDAELIQLASEFEGHPDNAAAAVLGGAVVSWT DHSGDRPNYSAVSLRLHPDIRLFTAIPEQRSSTAETRVLLPAQVSHDDARFNVSRAAL LVVALTERPDLLMAATEDLLHQPQRAAAMTASAEYLRLLRRHNVAAALSGAGPSLIAL STDSELPTDAVEFGAAKGFAVTELTVGEAVRWSPTVRVPG" misc_feature 1452216..1452248 /gene="thrB" /locus_tag="Rv1296" /note="PS00639 Eukaryotic thiol (cysteine) proteases histidine active site" misc_feature 1452282..1452317 /gene="thrB" /locus_tag="Rv1296" /note="PS00627 GHMP kinases putative ATP-binding domain" gene 1453204..1455012 /gene="rho" /locus_tag="Rv1297" /db_xref="GeneID:886952" CDS 1453204..1455012 /gene="rho" /locus_tag="Rv1297" /function="FACILITATES TRANSCRIPTION TERMINATION BY A MECHANISM THAT INVOLVES RHO BINDING TO THE NASCENT RNA, ACTIVATION OF RHO'S RNA-DEPENDENT ATPASE ACTIVITY, AND RELEASE OF THE MRNA FROM THE DNA TEMPLATE" /experiment="experimental evidence, no additional details recorded" /note="An RNA-DNA helicase that actively releases nascent mRNAs from paused transcription complexes" /codon_start=1 /transl_table=11 /product="transcription termination factor Rho" /protein_id="NP_215813.1" /db_xref="GI:15608437" /db_xref="GeneID:886952" /translation="MTDTDLITAGESTDGKPSDAAATDPPDLNADEPAGSLATMVLPE LRALANRAGVKGTSGMRKNELIAAIEEIRRQANGAPAVDRSAQEHDKGDRPPSSEAPA TQGEQTPTEQIDSQSQQVRPERRSATREAGPSGSGERAGTAADDTDNRQGGQQDAKTE ERGTDAGGDQGGDQQASGGQQARGDEDGEARQGRRGRRFRDRRRRGERSGDGAEAELR EDDVVQPVAGILDVLDNYAFVRTSGYLPGPHDVYVSMNMVRKNGMRRGDAVTGAVRVP KEGEQPNQRQKFNPLVRLDSINGGSVEDAKKRPEFGKLTPLYPNQRLRLETSTERLTT RVIDLIMPIGKGQRALIVSPPKAGKTTILQDIANAITRNNPECHLMVVLVDERPEEVT DMQRSVKGEVIASTFDRPPSDHTSVAELAIERAKRLVEQGKDVVVLLDSITRLGRAYN NASPASGRILSGGVDSTALYPPKRFLGAARNIEEGGSLTIIATAMVETGSTGDTVIFE EFKGTGNAELKLDRKIAERRVFPAVDVNPSGTRKDELLLSPDEFAIVHKLRRVLSGLD SHQAIDLLMSQLRKTKNNYEFLVQVSKTTPGSMDSD" gene 1455163..1455405 /gene="rpmE" /locus_tag="Rv1298" /db_xref="GeneID:886955" CDS 1455163..1455405 /gene="rpmE" /locus_tag="Rv1298" /function="INVOLVED IN TRANSLATION" /experiment="experimental evidence, no additional details recorded" /note="RpmE; there appears to be two types of ribosomal proteins L31 in bacterial genomes; some contain a CxxC motif while others do not; Bacillus subtilis has both types; the proteins in this cluster have the CXXC motif; RpmE is found in exponentially growing Bacilli while YtiA was found after exponential growth; expression of ytiA is controlled by a zinc-specific transcriptional repressor; RpmE contains one zinc ion and a CxxC motif is responsible for this binding; forms an RNP particle along with proteins L5, L18, and L25 and 5S rRNA; found crosslinked to L2 and L25 and EF-G; may be near the peptidyltransferase site of the 50S ribosome" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L31" /protein_id="NP_215814.1" /db_xref="GI:15608438" /db_xref="GeneID:886955" /translation="MKSDIHPAYEETTVVCGCGNTFQTRSTKPGGRIVVEVCSQCHPF YTGKQKILDSGGRVARFEKRYGKRKVGADKAVSTGK" misc_feature 1455274..1455312 /gene="rpmE" /locus_tag="Rv1298" /note="PS01143 Ribosomal protein L31 signature" gene 1455495..1456568 /gene="prfA" /locus_tag="Rv1299" /db_xref="GeneID:886948" CDS 1455495..1456568 /gene="prfA" /locus_tag="Rv1299" /function="PEPTIDE CHAIN RELEASE FACTOR 1 DIRECTS THE TERMINATION OF TRANSLATION IN RESPONSE TO THE PEPTIDE CHAIN TERMINATION CODONS UAG AND UAA" /note="recognizes the termination signals UAG and UAA during protein translation a specificity which is dependent on amino acid residues residing in loops of the L-shaped tRNA-like molecule of RF1; this protein is similar to release factor 2" /codon_start=1 /transl_table=11 /product="peptide chain release factor 1" /protein_id="NP_215815.1" /db_xref="GI:15608439" /db_xref="GeneID:886948" /translation="MTQPVQTIDVLLAEHAELELALADPALHSNPAEARRVGRRFARL APIVATHRKLTSARDDLETARELVASDESFAAEVAALEARVGELDAQLTDMLAPRDPH DADDIVLEVKSGEGGEESALFAADLARMYIRYAERHGWAVTVLDETTSDLGGYKDATL AIASKADTPDGVWSRMKFEGGVHRVQRVPVTESQGRVHTSAAGVLVYPEPEEVGQVQI DESDLRIDVFRSSGKGGQGVNTTDSAVRITHLPTGIVVTCQNERSQLQNKTRALQVLA ARLQAMAEEQALADASADRASQIRTVDRSERIRTYNFPENRITDHRIGYKSHNLDQVL DGDLDALFDALSAADKQSRLRQS" misc_feature 1456179..1456229 /gene="prfA" /locus_tag="Rv1299" /note="PS00745 Prokaryotic-type class I peptide chain release factors signature" gene 1456565..1457542 /gene="hemK" /locus_tag="Rv1300" /db_xref="GeneID:886950" CDS 1456565..1457542 /gene="hemK" /locus_tag="Rv1300" /function="POSSIBLY INVOLVED IN THE OXIDATION OF PROTOPORPHYRINOGEN INTO PROTOPORPHYRIN IX" /note="Rv1300, (MTCY373.20), len: 325 aa. Probable hemK protein homolog (EC 2.1.1.-), homology suggests translation may start at aa 22, highly similar to many e.g. HEMK_MYCLE|P45832 Mycobacterium leprae (288 aa), FASTA scores: opt: 936, E(): 0, (76.7% identity in 189 aa overlap). BELONGS TO THE HEMK FAMILY OF MODIFICATION METHYLASES." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215816.1" /db_xref="GI:15608440" /db_xref="GeneID:886950" /translation="MTSAPATMRWGNLPLAGESGTMTLRQAIDLAAALLAEAGVDSAR CDAEQLAAHLAGTDRGRLPLFEPPGDEFFGRYRDIVTARARRVPLQHLIGTVSFGPVV LHVGPGVFVPRPETEAILAWATAQSLPARPLIVDACTGSGALAVALAQHRANLGLKAR IIGIDDSDCALDYARRNAAGTPVELVRADVTTPRLLPELDGQVDLMVSNPPYIPDAAV LEPEVAQHDPHHALFGGPDGMTVISAVVGLAGRWLRPGGLFAVEHDDTTSSSTVDLVS STKLFVDVQARKDLAGRPRFVTAMRWGHLPLAGENGAIDPRQRRCRAKR" repeat_region 1456585..1456627 /note="43 bp Mycobacterial Interspersed Repetitive Unit, Class III" repeat_region 1457453..1457504 /note="52 bp Mycobacterial Interspersed Repetitive Unit, Class III" repeat_region 1457505..1457557 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 1457558..1458211 /locus_tag="Rv1301" /db_xref="GeneID:886970" CDS 1457558..1458211 /locus_tag="Rv1301" /function="UNKNOWN" /note="Rv1301, (MTCY373.21), len: 217 aa. Conserved hypothetical protein, highly similar to YRFE_MYCLE|P45831 hypothetical 22.7 kDa protein in rfe-hemk intergenic region, (220 aa), FASTA scores: opt: 1168, E(): 0, (82.8% identity in 215 aa overlap). Contains PS01147 Hypothetical SUA5/yciO/yrdC family signature. BELONGS TO THE SUA5/YRDC/YCIO/YWLC FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215817.1" /db_xref="GI:15608441" /db_xref="GeneID:886970" /translation="MTETFDCADPEQRSRGIVSAVGAIKAGQLVVMPTDTVYGIGADA FDSSAVAALLSAKGRGRDMPVGVLVGSWHTIEGLVYSMPDGARELIRAFWPGALSLVV VQAPSLQWDLGDAHGTVMLRMPLHPVAIELLREVGPMAVSSANISGHPPPVDAEQARS QLGDHVAVYLDAGPSEQQAGSTIVDLTGATPRVLRPGPVSTERIAEVLGVDAASLFG" misc_feature 1457642..1457680 /locus_tag="Rv1301" /note="PS01147 Hypothetical SUA5/yciO/yrdC family signature" gene 1458295..1459509 /gene="rfe" /locus_tag="Rv1302" /db_xref="GeneID:886947" CDS 1458295..1459509 /gene="rfe" /locus_tag="Rv1302" /EC_number="2.4.1.-" /function="THOUGHT TO BE INVOLVED IN AG BIOSYNTHESIS. MAY BE THE TUNICAMYCIN SENSITIVE TRANSFERASE THAT CATALYZES THE SYNTHESIS OF GLCNAC-PYROPHOSPHORYLUNDECAPRENOL (LIPID I), THE FIRST LIPID-LINKED INTERMEDIATE INVOLVED IN ECA SYNTHESIS." /note="Rv1302, (MTCY373.22), len: 404 aa. Probable rfe (alternate gene name: wecA), undecaprenyl-phosphate alpha-N-acetylglucosaminyltransferase (EC 2.4.1.-) (see citation below), equivalent to RFE_MYCLE|P45830 Mycobacterium leprae (398 aa), FASTA scores, opt: 2285, E(): 0, (89.2% identity in 398 aa overlap).; wecA" /codon_start=1 /transl_table=11 /product="undecapaprenyl-phosphate alpha-N-acetylglucosaminyltransferase rfe (UDP-GlcNAc transferase)" /protein_id="NP_215818.1" /db_xref="GI:15608442" /db_xref="GeneID:886947" /translation="MQYGLEVSSDVAGVAGGLLALSYRGAGVPLRELALVGLTAAIIT YFATGPVRMLASRLGAVAYPRERDVHVTPTPRMGGLAMFLGIVGAVFLASQLPALTRG FVYSTGMPAVLVAGAVIMGIGLIDDRWGLDALTKFAGQITAASVLVTMGVAWSVLYIP VGGVGTIVLDQASSILLTLALTVSIVNAMNFVDGLDGLAAGLGLITALAICMFSVGLL RDHGGDVLYYPPAVISVVLAGACLGFLPHNFHRAKIFMGDSGSMLIGLMLAAASTTAA GPISQNAYGARDVFALLSPFLLVVAVMFVPMLDLLLAIVRRTRAGRSAFSPDKMHLHH RLLQIGHSHRRVVLIIYLWVGIVAFGAASSIFFNPRDTAAVMLGAIVVAGVATLIPLL RRGDDYYDPDLD" gene 1459766..1460251 /locus_tag="Rv1303" /db_xref="GeneID:886944" CDS 1459766..1460251 /locus_tag="Rv1303" /function="UNKNOWN" /note="Rv1303, (MTCY373.23), len: 161 aa. Conserved hypothetical transmembrane protein, highly similar to P53431|Y02N_MYCLE hypothetical Mycobacterium leprae protein (153 aa), FASTA score: opt: 636, E():0, (69.8% identity in 149 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215819.1" /db_xref="GI:15608443" /db_xref="GeneID:886944" /translation="MTTPAQDAPLVFPSVAFRPVRLFFINVGLAAVAMLVAGVFGHLT VGMFLGLGLLLGLLNALLVRRSAESITAKEHPLKRSMALNSASRLAIITILGLIIAYI FRPAGLGVVFGLAFFQVLLVATTALPVLKKLRTATEEPVATYSSNGQTGGSEGRSASD D" gene 1460244..1460996 /gene="atpB" /locus_tag="Rv1304" /db_xref="GeneID:886941" CDS 1460244..1460996 /gene="atpB" /locus_tag="Rv1304" /EC_number="3.6.3.14" /function="KEY COMPONENT OF THE PROTON CHANNEL; IT MAY PLAY A DIRECT ROLE IN THE TRANSLOCATION OF PROTONS (H+) ACROSS THE MEMBRANE" /experiment="experimental evidence, no additional details recorded" /note="Produces ATP from ADP in the presence of a proton gradient across the membrane. Subunit A is part of the membrane proton channel F0" /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit A" /protein_id="NP_215820.1" /db_xref="GI:15608444" /db_xref="GeneID:886941" /translation="MTETILAAQIEVGEHHTATWLGMTVNTDTVLSTAIAGLIVIALA FYLRAKVTSTDVPGGVQLFFEAITIQMRNQVESAIGMRIAPFVLPLAVTIFVFILISN WLAVLPVQYTDKHGHTTELLKSAAADINYVLALALFVFVCYHTAGIWRRGIVGHPIKL LKGHVTLLAPINLVEEVAKPISLSLRLFGNIFAGGILVALIALFPPYIMWAPNAIWKA FDLFVGAIQAFIFALLTILYFSQAMELEEEHH" misc_feature 1460787..1460816 /gene="atpB" /locus_tag="Rv1304" /note="PS00449 ATP synthase a subunit signature" gene 1461045..1461290 /gene="atpE" /locus_tag="Rv1305" /db_xref="GeneID:886937" CDS 1461045..1461290 /gene="atpE" /locus_tag="Rv1305" /EC_number="3.6.3.14" /function="THIS IS ONE OF THE THREE CHAINS OF THE NONENZYMATIC COMPONENT (CF(0) SUBUNIT) OF THE ATPASE COMPLEX." /experiment="experimental evidence, no additional details recorded" /note="Produces ATP from ADP in the presence of a proton gradient across the membrane. Subunit C is part of the membrane proton channel F0" /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit C" /protein_id="NP_215821.1" /db_xref="GI:15608445" /db_xref="GeneID:886937" /translation="MDPTIAAGALIGGGLIMAGGAIGAGIGDGVAGNALISGVARQPE AQGRLFTPFFITVGLVEAAYFINLAFMALFVFATPVK" misc_feature 1461162..1461227 /gene="atpE" /locus_tag="Rv1305" /note="PS00605 ATP synthase c subunit signature" gene 1461321..1461836 /gene="atpF" /locus_tag="Rv1306" /db_xref="GeneID:886939" CDS 1461321..1461836 /gene="atpF" /locus_tag="Rv1306" /EC_number="3.6.3.14" /function="THIS IS ONE OF THE THREE CHAINS OF THE NONENZYMATIC COMPONENT (CF(0) SUBUNIT) OF THE ATPASE COMPLEX." /experiment="experimental evidence, no additional details recorded" /note="Produces ATP from ADP in the presence of a proton gradient across the membrane. Subunit B is part of the membrane proton channel." /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit B" /protein_id="NP_215822.1" /db_xref="GI:15608446" /db_xref="GeneID:886939" /translation="MGEVSAIVLAASQAAEEGGESSNFLIPNGTFFVVLAIFLVVLAV IGTFVVPPILKVLRERDAMVAKTLADNKKSDEQFAAAQADYDEAMTEARVQASSLRDN ARADGRKVIEDARVRAEQQVASTLQTAHEQLKRERDAVELDLRAHVGTMSATLASRIL GVDLTASAATR" gene 1461843..1463183 /gene="atpH" /locus_tag="Rv1307" /db_xref="GeneID:886934" CDS 1461843..1463183 /gene="atpH" /locus_tag="Rv1307" /EC_number="3.6.3.14" /function="THIS PROTEIN SEEMS TO BE PART OF THE STALK THAT LINKS CF(0) TO CF(1). IT EITHER TRANSMITS CONFORMATIONAL CHANGES FROM CF(0) INTO CF(1) OR IS IMPLICATED IN PROTON CONDUCTION [CATALYTIC ACTIVITY : ATP + H(2)O + H(+)(IN) = ADP + PHOSPHATE + H(+)(OUT)]" /experiment="experimental evidence, no additional details recorded" /note="produces ATP from ADP in the presence of a proton gradient across the membrane; the delta subunit is part of the catalytic core of the ATP synthase complex" /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit delta" /protein_id="NP_215823.1" /db_xref="GI:15608447" /db_xref="GeneID:886934" /translation="MSTFIGQLFGFAVIVYLVWRFIVPLVGRLMSARQDTVRQQLADA AAAADRLAEASQAHTKALEDAKSEAHRVVEEARTDAERIAEQLEAQADVEAERIKMQG ARQVDLIRAQLTRQLRLELGHESVRQARELVRNHVADQAQQSATVDRFLDQLDAMAPA TADVDYPLLAKMRSASRRALTSLVDWFGTMAQDLDHQGLTTLAGELVSVARLLDREAV VTRYLTVPAEDATPRIRLIERLVSGKVGAPTLEVLRTAVSKRWSANSDLIDAIEHVSR QALLELAERAGQVDEVEDQLFRFSRILDVQPRLAILLGDCAVPAEGRVRLLRKVLERA DSTVNPVVVALLSHTVELLRGQAVEEAVLFLAEVAVARRGEIVAQVGAAAELSDAQRT RLTEVLSRIYGHPVTVQLHIDAALLGGLSIAVGDEVIDGTLSSRLAAAEARLPD" gene 1463228..1464877 /gene="atpA" /locus_tag="Rv1308" /db_xref="GeneID:886936" CDS 1463228..1464877 /gene="atpA" /locus_tag="Rv1308" /EC_number="3.6.3.14" /function="PRODUCES ATP FROM ADP IN THE PRESENCE OF A PROTON GRADIENT ACROSS THE MEMBRANE. THE ALPHA CHAIN IS A REGULATORY SUBUNIT [CATALYTIC ACTIVITY:ATP + H(2)O + H(+)(IN) = ADP + PHOSPHATE + H(+)(OUT)]" /experiment="experimental evidence, no additional details recorded" /note="produces ATP from ADP in the presence of a proton gradient across the membrane; the alpha chain is a catalytic subunit" /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit alpha" /protein_id="NP_215824.1" /db_xref="GI:15608448" /db_xref="GeneID:886936" /translation="MAELTIPADDIQSAIEEYVSSFTADTSREEVGTVVDAGDGIAHV EGLPSVMTQELLEFPGGILGVALNLDEHSVGAVILGDFENIEEGQQVKRTGEVLSVPV GDGFLGRVVNPLGQPIDGRGDVDSDTRRALELQAPSVVHRQGVKEPLQTGIKAIDAMT PIGRGQRQLIIGDRKTGKTAVCVDTILNQRQNWESGDPKKQVRCVYVAIGQKGTTIAA VRRTLEEGGAMDYTTIVAAAASESAGFKWLAPYTGSAIAQHWMYEGKHVLIIFDDLTK QAEAYRAISLLLRRPPGREAYPGDVFYLHSRLLERCAKLSDDLGGGSLTGLPIIETKA NDISAYIPTNVISITDGQCFLETDLFNQGVRPAINVGVSVSRVGGAAQIKAMKEVAGS LRLDLSQYRELEAFAAFASDLDAASKAQLERGARLVELLKQPQSQPMPVEEQVVSIFL GTGGHLDSVPVEDVRRFETELLDHMRASEEEILTEIRDSQKLTEEAADKLTEVIKNFK KGFAATGGGSVVPDEHVEALDEDKLAKEAVKVKKPAPKKKK" misc_feature 1463741..1463764 /gene="atpA" /locus_tag="Rv1308" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1464323..1464352 /gene="atpA" /locus_tag="Rv1308" /note="PS00152 ATP synthase alpha and beta subunits signature" gene 1464884..1465801 /gene="atpG" /locus_tag="Rv1309" /db_xref="GeneID:886929" CDS 1464884..1465801 /gene="atpG" /locus_tag="Rv1309" /EC_number="3.6.3.14" /function="PRODUCES ATP FROM ADP IN THE PRESENCE OF A PROTON GRADIENT ACROSS THE MEMBRANE. THE GAMMA CHAIN IS BELIEVED TO BE IMPORTANT IN REGULATING ATPASE ACTIVITY AND THE FLOW OF PROTONS THROUGH THE CF(0) COMPLEX." /experiment="experimental evidence, no additional details recorded" /note="Produces ATP from ADP in the presence of a proton gradient across the membrane. The gamma chain is a regulatory subunit" /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit gamma" /protein_id="NP_215825.1" /db_xref="GI:15608449" /db_xref="GeneID:886929" /translation="MAATLRELRGRIRSAGSIKKITKAQELIATSRIARAQARLESAR PYAFEITRMLTTLAAEAALDHPLLVERPEPKRAGVLVVSSDRGLCGAYNANIFRRSEE LFSLLREAGKQPVLYVVGRKAQNYYSFRNWNITESWMGFSEQPTYENAAEIASTLVDA FLLGTDNGEDQRSDSGEGVDELHIVYTEFKSMLSQSAEAHRIAPMVVEYVEEDIGPRT LYSFEPDATMLFESLLPRYLTTRVYAALLESAASELASRQRAMKSATDNADDLIKALT LMANRERQAQITQEISEIVGGANALAEAR" misc_feature 1465742..1465783 /gene="atpG" /locus_tag="Rv1309" /note="PS00153 ATP synthase gamma subunit signature" gene 1465841..1467301 /gene="atpD" /locus_tag="Rv1310" /db_xref="GeneID:886932" CDS 1465841..1467301 /gene="atpD" /locus_tag="Rv1310" /EC_number="3.6.3.14" /function="PRODUCES ATP FROM ADP IN THE PRESENCE OF A PROTON GRADIENT ACROSS THE MEMBRANE. THE BETA CHAIN IS THE CATALYTIC SUBUNIT [CATALYTIC ACTIVITY : ATP + H(2)O + H(+)(IN) = ADP + PHOSPHATE + H(+)(OUT)]" /experiment="experimental evidence, no additional details recorded" /note="Produces ATP from ADP in the presence of a proton gradient across the membrane. The beta chain is a regulatory subunit" /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit beta" /protein_id="NP_215826.1" /db_xref="GI:15608450" /db_xref="GeneID:886932" /translation="MTTTAEKTDRPGKPGSSDTSGRVVRVTGPVVDVEFPRGSIPELF NALHAEITFESLAKTLTLEVAQHLGDNLVRTISLQPTDGLVRGVEVIDTGRSISVPVG EGVKGHVFNALGDCLDEPGYGEKFEHWSIHRKPPAFEELEPRTEMLETGLKVVDLLTP YVRGGKIALFGGAGVGKTVLIQEMINRIARNFGGTSVFAGVGERTREGNDLWVELAEA NVLKDTALVFGQMDEPPGTRMRVALSALTMAEWFRDEQGQDVLLFIDNIFRFTQAGSE VSTLLGRMPSAVGYQPTLADEMGELQERITSTRGRSITSMQAVYVPADDYTDPAPATT FAHLDATTELSRAVFSKGIFPAVDPLASSSTILDPSVVGDEHYRVAQEVIRILQRYKD LQDIIAILGIDELSEEDKQLVNRARRIERFLSQNMMAAEQFTGQPGSTVPVKETIEAF DRLCKGDFDHVPEQAFFLIGGLDDLAKKAESLGAKL" misc_feature 1466351..1466374 /gene="atpD" /locus_tag="Rv1310" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1466903..1466932 /gene="atpD" /locus_tag="Rv1310" /note="PS00152 ATP synthase alpha and beta subunits signature" gene 1467315..1467680 /gene="atpC" /locus_tag="Rv1311" /db_xref="GeneID:886967" CDS 1467315..1467680 /gene="atpC" /locus_tag="Rv1311" /EC_number="3.6.3.14" /function="PRODUCES ATP FROM ADP IN THE PRESENCE OF A PROTON GRADIENT ACROSS THE MEMBRANE [CATALYTIC ACTIVITY : ATP + H(2)O + H(+)(IN) = ADP + PHOSPHATE + H(+)(OUT)]" /note="part of catalytic core of ATP synthase; alpha(3)beta(3)gamma(1)delta(1)epsilon(1); involved in producing ATP from ADP in the presence of the proton motive force across the membrane" /codon_start=1 /transl_table=11 /product="F0F1 ATP synthase subunit epsilon" /protein_id="NP_215827.1" /db_xref="GI:15608451" /db_xref="GeneID:886967" /translation="MAELNVEIVAVDRNIWSGTAKFLFTRTTVGEIGILPRHIPLVAQ LVDDAMVRVEREGEKDLRIAVDGGFLSVTEEGVSILAESAEFESEIDEAAAKQDSESD DPRIAARGRARLRAVGAID" gene 1467688..1468131 /locus_tag="Rv1312" /db_xref="GeneID:886930" CDS 1467688..1468131 /locus_tag="Rv1312" /function="UNKNOWN" /note="Rv1312, (MTCY373.32), len: 147 aa. Conserved hypothetical secreted protein with potential N-terminal signal sequence. Highly similar to P53432|Y02W_MYCLE hypothetical Mycobacterium leprae protein (147 aa), FASTA score: opt: 884, E(): 0, (88.4% identity in 147 aa overlap). N-terminus hydrophobic." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215828.1" /db_xref="GI:15608452" /db_xref="GeneID:886930" /translation="MSAPMIGMVVLVVVLGLAVLALSYRLWKLRQGGTAGIMRDIPAV GGHGWRHGVIRYRGGEAAFYRLSSLRLWPDRRLSRRGVEIISRRAPRGDEFDIMTDEI VVVELCDSTQDRRVGYEIALDRGALTAFLSWLESRPSPRARRRSM" repeat_region complement(1468143..1469651) /note="IS1557-2, len: 1509 bp. Insertion sequence IS1557." /mobile_element="insertion sequence:IS1557-2" repeat_region 1468143..1468161 /note="19 bp inverted repeat, GCAGACGCAAAAGCCCCCA, at the left end of IS1557" gene complement(1468171..1469505) /locus_tag="Rv1313c" /db_xref="GeneID:886922" CDS complement(1468171..1469505) /locus_tag="Rv1313c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1557." /note="Rv1313c, (MTCY373.33c), len: 444 aa. Possible IS1557 transposase, similar to several transposases e.g. U57649|DBU57649 ORF1 from dibenzofuran-degrading bacterium DPO360 (163 aa), FASTA scores: opt: 767, E(): 0, (67.3% identity in 168 aa overlap); TNPA_BORPA|Q06126 transposase for insertion sequence element IS1001 from Bordetella parapertussis (406 aa), FASTA scores: opt: 254, E(): 3.3e-10, (24.9% identity in 402 aa overlap). Also similar to putative Mycobacterium tuberculosis transposases, Rv3798 and Rv0741." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215829.1" /db_xref="GI:15608453" /db_xref="GeneID:886922" /translation="MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSA VLRRCGRCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVPWA RHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADTEKRIDRFANL RRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATLGLFFDALGAERAAQITHV SADAADWIADVVTERCPDAIQCADPFHVVAWATEALDVERRRAWNDARAIARTEPKWG RGRPGKNAAPRPGRERARRLKGARYALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLL KESLRHVFSVKGEEGKQALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQ GLIESTNTKIRLLTRIAFGFRSPQALIALAMLTLAGHRPTLPGRHNHPQISQ" repeat_region complement(1469633..1469651) /note="19 bp inverted repeat, GCAGACGCGAAAGCCCCCA, at the right end of IS1557. Single base difference at 3-end." gene complement(1469671..1470252) /locus_tag="Rv1314c" /db_xref="GeneID:886925" CDS complement(1469671..1470252) /locus_tag="Rv1314c" /function="UNKNOWN" /note="Rv1314c, (MTCY373.34c), len: 193 aa. Conserved hypothetical protein, highly similar to P53523|Y02Y_MYCLE hypothetical Mycobacterium leprae protein (191 aa), FASTA score: opt:1019, E(): 0, (81.2% identity in 191 aa overlap). Some similarity with YDHW_CITFR|P45515 hypothetical 19.8 kDa protein in dhar-dhat intergenic region (176 aa), FASTA scores: opt: 297, E(): 1.6e-13, (37.6% identity in 178 aa overlap). Also similar to hypothetical protein AE002007|AE002007_3 Deinococcus radiodurans (185 aa), FASTA score: opt: 386, E(): 7.7e-19, (42.4% identity in 172 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215830.1" /db_xref="GI:15608454" /db_xref="GeneID:886925" /translation="MAVHLTRIYTRTGDDGTTGLSDMSRVAKTDARLVAYADCDEANA AIGAALALGHPDTQITDVLRQIQNDLFDAGADLSTPIVENPKHPPLRIAQSYIDRLEG WCDAYNAGLPALKSFVLPGGSPLSALLHVARTVVRRAERSAWAAVDAHPEGVSVLPAK YLNRLSDLLFILSRVANPDGDVLWRPGGDRTAS" gene 1470321..1471577 /gene="murA" /locus_tag="Rv1315" /db_xref="GeneID:886921" CDS 1470321..1471577 /gene="murA" /locus_tag="Rv1315" /EC_number="2.5.1.7" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS. ADDS ENOLPYRUVYL TO UDP-N-ACETYLGLUCOSAMINE [CATALYTIC ACTIVITY: PHOSPHOENOLPYRUVATE + UDP-N-ACETYL-D- GLUCOSAMINE = PHOSPHATE + UDP-N-ACETYL-3-O-(1-CARBOXYVINYL)-D-GLUCOSAMINE]" /experiment="experimental evidence, no additional details recorded" /note="adds enolpyruvyl to UDP-N-acetylglucosamine as a component of cell wall formation; gram-positive bacteria have 2 copies of MurA which are active" /codon_start=1 /transl_table=11 /product="UDP-N-acetylglucosamine 1-carboxyvinyltransferase" /protein_id="NP_215831.1" /db_xref="GI:15608455" /db_xref="GeneID:886921" /translation="MAERFVVTGGNRLSGEVAVGGAKNSVLKLMAATLLAEGTSTITN CPDILDVPLMAEVLRGLGATVELDGDVARITAPDEPKYDADFAAVRQFRASVCVLGPL VGRCKRARVALPGGDAIGSRPLDMHQAGLRQLGAHCNIEHGCVVARAETLRGAEIQLE FPSVGATENILMAAVVAEGVTTIHNAAREPDVVDLCTMLNQMGAQVEGAGSPTMTITG VPRLHPTEHRVIGDRIVAATWGIAAAMTRGDISVAGVDPAHLQLVLHKLHDAGATVTQ TDASFRVTQYERPKAVNVATLPFPGFPTDLQPMAIALASIADGTSMITENVFEARFRF VEEMIRLGADARTDGHHAVVRGLPQLSSAPVWCSDIRAGAGLVLAGLVADGDTEVHDV FHIDRGYPLFVENLVSLGAEIERVCC" gene 1471846..1473382 /gene="rrs" /locus_tag="Rvnr01" /db_xref="GeneID:2700429" rRNA 1471846..1473382 /gene="rrs" /locus_tag="Rvnr01" /product="ribosomal RNA 16S" /note="rrs (alternate gene name: rrnS), len: 1537 nt. 16s rRNA gene." /db_xref="GeneID:2700429" gene 1473658..1476795 /gene="rrl" /locus_tag="Rvnr02" /db_xref="GeneID:2700466" rRNA 1473658..1476795 /gene="rrl" /locus_tag="Rvnr02" /product="ribosomal RNA 23S" /note="rrl, len: 3138 nt. 23S rRNA gene (approximate coordinates)." /db_xref="GeneID:2700466" gene 1476899..1477013 /gene="rrf" /locus_tag="Rvnr03" /db_xref="GeneID:2700459" rRNA 1476899..1477013 /gene="rrf" /locus_tag="Rvnr03" /product="ribosomal RNA 5S" /note="rrf, len: 115 nt. 5S rRNA gene. Identical to Em_ba:MT5SRR, D10035 M.tuberculosis 5S rRNA." /db_xref="GeneID:2700459" gene complement(1477134..1477631) /gene="ogt" /locus_tag="Rv1316c" /db_xref="GeneID:886913" CDS complement(1477134..1477631) /gene="ogt" /locus_tag="Rv1316c" /EC_number="2.1.1.63" /function="REPAIR OF ALKYLATED GUANINE IN DNA BY STOICHIOMETRICALLY TRANSFERRING THE ALKYL GROUP AT THE O-6 POSITION TO A CYSTEINE RESIDUE IN THE ENZYME. THIS IS A SUICIDE REACTION: THE ENZYME IS IRREVERSIBLY INACTIVATED [CATALYTIC ACTIVITY : DNA (CONTAINING 6-O-METHYLGUANINE) + [PROTEIN]-L-CYSTEINE = DNA (WITHOUT 6-O-METHYLGUANINE) + PROTEIN S-METHYL-L-CYSTEINE.]" /note="Rv1316c, (MTCY130.01c), len: 165 aa. Probable ogt, methylated-dna--protein-cysteine methytransferase (EC 2.1.1.63) (see citation below), similar to many e.g. OGT_HAEIN|P44687 Haemophilus influenzae (190 aa), FASTA scores: opt: 405, E(): 6.5e-20, (41.9% identity in 155 aa overlap). Contains PS00374 Methylated-DNA--protein-cysteine methyltransferase active site." /codon_start=1 /transl_table=11 /product="methylated-DNA--protein-cysteine methyltransferase" /protein_id="NP_215832.1" /db_xref="GI:15608456" /db_xref="GeneID:886913" /translation="MIHYRTIDSPIGPLTLAGHGSVLTNLRMLEQTYEPSRTHWTPDP GAFSGAVDQLNAYFAGELTEFDVELDLRGTDFQQRVWKALLTIPYGETRSYGEIADQI GAPGAARAVGLANGHNPIAIIVPCHRVIGASGKLTGYGGGINRKRALLELEKSRAPAD LTLFD" misc_feature complement(1477242..1477262) /gene="ogt" /locus_tag="Rv1316c" /note="PS00374 Methylated-DNA--protein-cysteine methyltransferase active site" gene complement(1477628..1479118) /gene="alkA" /locus_tag="Rv1317c" /db_xref="GeneID:886916" CDS complement(1477628..1479118) /gene="alkA" /locus_tag="Rv1317c" /EC_number="2.1.1.63" /function="INVOLVED IN DAMAGE REVERSAL AND IN BASE EXCISION REPAIR. THE METHYLATED ADA PROTEIN ACTS AS A POSITIVE REGULATOR OF ITS OWN SYNTHESIS, AS WELL AS THAT OF OTHER PROTEINS. THE TRANSCRIPTION-ACTIVATING FUNCTION OF THE ADA PROTEIN RESIDES IN ITS N-TERMINUS. REPAIR OF ALKYLATED GUANINE IN DNA BY STOICHIOMETRICALLY TRANSFERRING THE ALKYL GROUP AT THE O-6 POSITION TO A CYSTEINE RESIDUE IN THE ENZYME. THIS IS A SUICIDE REACTION: THE ENZYME IS IRREVERSIBLY INACTIVATED. CAN ALSO REPAIR O-4-METHYLTHYMINE [CATALYTIC ACTIVITY: DNA (CONTAINING 6-O-METHYLGUANINE) + [PROTEIN]-L-CYSTEINE = DNA (WITHOUT 6-O-METHYLGUANINE) + PROTEIN S-METHYL-L-CYSTEINE]" /note="Rv1317c, (MTCY130.02c), len: 496 aa. Probable alkA (alternate gene name: ada), regulatory protein (EC 2.1.1.63) (see citation below), similar to 3MG2_ECOLI|P04395 dna-3-methyladenine glycosidase II from Escherichia coli (282 aa), FASTA scores, opt: 437, E(): 8.6e-22, (32.8% identity in 293 aa overlap), also similar to other ada proteins e.g. ADA_SALTY|P26189 Salmonella typhimurium (352 aa), FASTA scores: E(): 5.3e-08, (35.9% identity in 156 aa overlap). Contains PS00041 Bacterial regulatory proteins, araC family signature.; ada" /codon_start=1 /transl_table=11 /product="bifunctional methylated-DNA--protein-cysteine methyltransferase/O-6-methylguanine-DNA transcription regulator" /protein_id="NP_215833.1" /db_xref="GI:15608457" /db_xref="GeneID:886916" /translation="MHDDFERCYRAIQSKDARFDGWFVVAVLTTGVYCRPSCPVRPPF ARNVRFLPTAAAAQGEGFRACKRCRPDASPGSPEWNVRSDVVARAMRLIADGTVDRDG VSGLAAQLGYTIRQLERLLQAVVGAGPLALARAQRMQTARVLIETTNLPFGDVAFAAG FSSIRQFNDTVRLACDGTPTALRARAAARFESATASAGTVSLRLPVRAPFAFEGVFGH LAATAVPGCEEVRDGAYRRTLRLPWGNGIVSLTPAPDHVRCLLVLDDFRDLMTATARC RRLLDLDADPEAIVEALGADPDLRAVVGKAPGQRIPRTVDEAEFAVRAVLAQQVSTKA ASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDPGHLAVPKARQRTINALVASL ADKSLVLDAGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDAFPASDLGLRLAAKK LGLPAQRRALTVHSARWRPWRSYATQHLWTTLEHPVNQWPPQEKIA" misc_feature complement(1478582..1478710) /gene="alkA" /locus_tag="Rv1317c" /note="PS00041 Bacterial regulatory proteins, araC family signature" gene complement(1479199..1480824) /locus_tag="Rv1318c" /db_xref="GeneID:886910" CDS complement(1479199..1480824) /locus_tag="Rv1318c" /EC_number="4.6.1.1" /function="THOUGHT TO PLAY AN ESSENTIAL ROLES IN REGULATION OF CELLULAR METABOLISM BY CATALYSING THE SYNTHESIS OF A SECOND MESSENGER, CAMP [CATALYTIC ACTIVITY: ATP = 3',5'-CYCLIC AMP + PYROPHOSPHATE]." /note="Rv1318c, (MTCY130.03c), len: 541 aa. Possible adenylate cyclase (EC 4.6.1.1). Some similarity at the c-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores, opt: 270, E(): 2.5e-11, (28.8% identity in 184 aa overlap); similar to other mycbacterium tuberculosis putative adenylate cyclases e.g. Rv1319c|MTCY130.04c (535 aa), FASTA scores: opt: 2505, E(): 0, (71.0% identity in 534 aa overlap), also similar to Rv1320c|MTCY130.05c (567 aa), FASTA scores, opt: 2423, E(): 0, (68.7% identity in 534 aa overlap). N-terminus is hydrophobic. BELONGS TO ADENYLYL CYCLASE CLASS-3 FAMILY." /codon_start=1 /transl_table=11 /product="adenylate cyclase" /protein_id="NP_215834.1" /db_xref="GI:15608458" /db_xref="GeneID:886910" /translation="MSAKKSTAQRLGRVLETVTRQSGRLPETPAYGSWLLGRVSESQR RRRVRIQVMLTALVVTANLLGIGVALLLVTIAIPEPSIVRDTPRWLTFGVVPGYVLLA LALGSYALTRQTVQALRWAIEGRKPTREEERRTFLAPWRVAVGHLMFWGVGTALLTTL YGLINNAFIPRFLFAVSFCGVLVATATYLHTEFALRPFAAQALEAGPPPRRLAPGILG RTMVVWLLGSGVPVVGIALMAMFEMVLLNLTRMQFATGVLIISMVTLVFGFILMWILA WLTATPVRVVRAALRRVERGELRTNLVVFDGTELGELQRGFNAMVAGLRERERVRDLF GRHVGREVAAAAERERSKLGGEERHVAVVFIDIVGSTQLVTSRPPADVVKLLNKFFAI VVDEVDRHHGLVNKFEGDASLTIFGAPNRLPCPEDKALAAARAIADRLVNEMPECQAG IGVAAGQVIAGNVGARERFEYTVIGEPVNEAARLCELAKSRPGKLLASAQAVDAASEE ERARWSLGRHVKLRGHDQPVRLAKPVGLTKPRR" gene complement(1480894..1482501) /locus_tag="Rv1319c" /db_xref="GeneID:886911" CDS complement(1480894..1482501) /locus_tag="Rv1319c" /EC_number="4.6.1.1" /function="THOUGHT TO PLAY AN ESSENTIAL ROLES IN REGULATION OF CELLULAR METABOLISM BY CATALYSING THE SYNTHESIS OF A SECOND MESSENGER, CAMP [CATALYTIC ACTIVITY: ATP = 3',5'-CYCLIC AMP + PYROPHOSPHATE]." /note="Rv1319c, (MTCY130.04c), len: 535 aa. Possible adenylate cyclase (EC 4.6.1.1). Some similarity at the C-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores: opt: 254, E(): 2.4e-10, (33.3% identity in 144 aa overlap); similar to other mycbacterium tuberculosis putative adenylate cyclases e.g. Rv1318c|MTCY130.03c (541 aa), FASTA scores: opt: 2505, E(): 0, (71.0% identity in 534 aa overlap); Rv1320c|MTCY130.05c (567 aa), FASTA scores: opt: 2354, E(): 0, (66.3% identity in 534 aa overlap). N-terminus is hydrophobic. BELONGS TO ADENYLYL CYCLASE CLASS-3 FAMILY." /codon_start=1 /transl_table=11 /product="adenylate cyclase" /protein_id="NP_215835.1" /db_xref="GI:15608459" /db_xref="GeneID:886911" /translation="MPAKKTMAQRLGQALETMTRQCGQLPETPAYGSWLLGRVSESPS RRWVRIKRIVTVYIMTANLTGIVVALLVVTFAFPVPSIYTDAPWWVTFGVAPAYATLA LAIGTYWITTRIVRASIRWAIEERAPSQADGRNTLLLPFRVAAVHLILWDIGGALLAT LYGLANRVFVTIILFSVTICGVLVATNCYLFTEFALRPVAAKALEAGRPPRRFAPGIM GRTMTVWSLGSGVPVTGIATTALYVLLVHNLTETQLASAVLILSITTLIFGFLVMWIL AWLTAAPVRVVRAALKRVEQGDLRGDLVVFDGTELGELQRGFNAMVNGLRERERVRDL FGRHVGREVAAAAERERPQLGGEDRHAAVVFVDIVGSTQLVDNQPAAHVVKLLNRFFA IVVNEVDRHHGLINKFAGDAALAIFGAPNRLDRPEDAALAAARAIADRLANEMPEVQA GIGVAAGQIVAGNVGAKQRFEYTVVGKPVNQAARLCELAKSHPARLLASSDTLHAASE TERAHWSLGETVTLRGHEQPTRLAVPT" gene complement(1482514..1484217) /locus_tag="Rv1320c" /db_xref="GeneID:886906" CDS complement(1482514..1484217) /locus_tag="Rv1320c" /EC_number="4.6.1.1" /function="THOUGHT TO PLAY AN ESSENTIAL ROLES IN REGULATION OF CELLULAR METABOLISM BY CATALYSING THE SYNTHESIS OF A SECOND MESSENGER, cAMP. MAY BE INVOLVED IN VIRULENCE [CATALYTIC ACTIVITY: ATP = 3',5'-CYCLIC AMP + PYROPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv1320c, (MTCY130.05c), len: 567 aa. Possible adenylate cyclase (EC 4.6.1.1) (see Rindi et al., 1999). Some similarity at the C-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores: opt: 277, E(): 2e-12, (34.0% identity in 156 aa overlap); similar to other mycbacterium tuberculosis putative adenylate cyclases e.g. Rv1318c|MTCY130.03c (541 aa), FASTA scores: opt: 2423, E(): 0, (68.7% identity in 534 aa overlap); Rv1319c|MTCY130.04c (535 aa), FASTA scores: opt: 2354, E(): 0, (66.3% identity in 534 aa overlap). N-terminus is hydrophobic. BELONGS TO ADENYLYL CYCLASE CLASS-3 FAMILY." /codon_start=1 /transl_table=11 /product="adenylate cyclase" /protein_id="NP_215836.1" /db_xref="GI:15608460" /db_xref="GeneID:886906" /translation="MPSEKATTRHLPGAVETLSPRTGRRPETPAYGSWLLGRVSESPR MRRVRIQGMLTVAILVTNVIGLIVGAMLLTVAFPKPSVILDAPHWVSFGIVPGYCVLA FILGTYWLTRQTARALRWAIEERTPSHDEARSAFLVPLRVALAVLFLWGAAAALWTII YGLANRLFIPRFLFSMGVIGVVAATSCYLLTEFALRPMAAQALEVGATPRSLVRGIVG RTMLVWLLCSGVPNVGVALTAIFDDTFWELSNDQFMITVLILWAPLLIFGFILMWILA WLTATPVRVVREALNRVEQGDLSGDLVVFDGTELGELQRGFNRMVEGLRERERVRDLF GRHVGREVAAAAERERPKLGGEERHVAVVFVDIVGSTQLVTSRPAAEVVMLLNRFFTV IVDEVNHHRGLVNKFQGDASLAVFGAPNRLSHPEDAALATARAIADRLASEMPECQAG IGVAAGQVVAGNVGAHERFEYTVIGEPVNEAARLCELAKSYPSRLLASSQTLRGASEN ECARWSLGETVTLRGHDQPIRLTSPVQQLQMPAQSADIVGGALGDHQTHTIYRGAHPT D" gene 1484279..1484959 /locus_tag="Rv1321" /db_xref="GeneID:886908" CDS 1484279..1484959 /locus_tag="Rv1321" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1321, (MTCY130.06), len: 226 aa. Conserved hypothetical protein. Equivalent to P53524|YD21_MYCLE hypothetical protein from Mycobacterium leprae (201 aa), FASTA scores: opt: 1144, E(): 0, (87.6% identity in 193 aa overlap). Some similarity to hypothetical proteins from other organisms e.g. Y225_METJA|Q57678 Methanococcus jannaschii (263 aa), FASTA scores: E(): 6.5e-05, (25.0% identity in 212 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215837.1" /db_xref="GI:15608461" /db_xref="GeneID:886908" /translation="MSRVRLVIAQCTVDYIGRLTAHLPSARRLLLFKADGSVSVHADD RAYKPLNWMSPPCWLTEESGGQAPVWVVENKAGEQLRITIEGIEHDSSHELGVDPGLV KDGVEAHLQALLAEHIQLLGEGYTLVRREYMTAIGPVDLLCSDERGGSVAVEIKRRGE IDGVEQLTRYLELLNRDSVLAPVKGVFAAQQIKPQARILATDRGIRCLTLDYDTMRGM DSGEYRLF" gene 1484982..1485278 /locus_tag="Rv1322" /db_xref="GeneID:886927" CDS 1484982..1485278 /locus_tag="Rv1322" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1322, (MTCY130.07), len: 98 aa. Conserved hypothetical protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215838.1" /db_xref="GI:15608462" /db_xref="GeneID:886927" /translation="MARRRKPLHRQRPEPPSWALRRVEAGPDGHEYEVRPVAAARAVK TYRCPGCDHEIRSGTAHVVVWPTDLPQAGVDDRRHWHTPCWANRATRGPTRKWT" gene complement(1485313..1485771) /locus_tag="Rv1322A" /db_xref="GeneID:3205097" CDS complement(1485313..1485771) /locus_tag="Rv1322A" /function="UNKNOWN" /note="Rv1322A, len: 152 aa. Conserved hypothetical protein, similar to proteins from Mycobacterium leprae and Streptomyces coelicolor e.g. AL583921_2|ML1157 from M. leprae strain TN (155 aa), FASTA scores: opt: 771, E(): 5.1e-43, (75.3% identity in 154 aa overlap); and AL137242_2 from Streptomyces coelicolor (146 aa), FASTA scores: opt: 404, E(): 2e-19, (43.165% identity in 139 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177643.1" /db_xref="GI:57116847" /db_xref="GeneID:3205097" /translation="MMTTDQVHARHMLATSLVTGLDHVGIAVADLDVAIEWYHDHLGM ILVHEEINDDQGIREALLAVPGSAAQIQLMAPLDESSVIAKFLDKRGPGIQQLACRVS DLDAMCRRLRSQGVRLVYETARRGTANSRINFIHPKDAGGVLIELVEPAP" gene 1485862..1487031 /gene="fadA4" /locus_tag="Rv1323" /db_xref="GeneID:886904" CDS 1485862..1487031 /gene="fadA4" /locus_tag="Rv1323" /EC_number="2.3.1.9" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION [CATALYTIC ACTIVITY: 2 ACETYL-CoA = CoA + ACETOACETYL-COA]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation; in Rhizobia and Ralstonia is involved in PHB biosynthesis" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_215839.1" /db_xref="GI:15608463" /db_xref="GeneID:886904" /translation="MIVAGARTPIGKLMGSLKDFSASELGAIAIKGALEKANVPASLV EYVIMGQVLTAGAGQMPARQAAVAAGIGWDVPALTINKMCLSGIDAIALADQLIRARE FDVVVAGGQESMTKAPHLLMNSRSGYKYGDVTVLDHMAYDGLHDVFTDQPMGALTEQR NDVDMFTRSEQDEYAAASHQKAAAAWKDGVFADEVIPVNIPQRTGDPLQFTEDEGIRA NTTAAALAGLKPAFRGDGTITAGSASQISDGAAAVVVMNQEKAQELGLTWLAEIGAHG VVAGPDSTLQSQPANAINKALDREGISVDQLDVVEINEAFAAVALASIRELGLNPQIV NVNGGAIAVGHPLGMSGTRITLHAALQLARRGSGVGVAALCGAGGQGDALILRAG" misc_feature 1486099..1486155 /gene="fadA4" /locus_tag="Rv1323" /note="PS00098 Thiolases acyl-enzyme intermediate signature" misc_feature 1486864..1486914 /gene="fadA4" /locus_tag="Rv1323" /note="PS00737 Thiolases signature 2" misc_feature 1486969..1487010 /gene="fadA4" /locus_tag="Rv1323" /note="PS00099 Thiolases active site" gene 1487161..1488075 /locus_tag="Rv1324" /db_xref="GeneID:886897" CDS 1487161..1488075 /locus_tag="Rv1324" /function="THIOREDOXIN PARTICIPATES IN VARIOUS REDOX REACTIONS THROUGH THE REVERSIBLE OXIDATION OF ITS ACTIVE CENTER DITHIOL, TO A DISULFIDE, & CATALYZES DITHIOL-DISULFIDE EXCHANGE REACTIONS" /experiment="experimental evidence, no additional details recorded" /note="Rv1324, (MTCY130.09), len: 304 aa. Possible thioredoxin (EC 1.-.-.-), similar to several e.g. U00014|Q49716 TRXA from Mycobacterium leprae (255 aa), FASTA scores: opt: 1014, E(): 0, (69.7% identity in 228 aa overlap); THIO_RHOSH|P08058 TrxA from Rhodobacter sphaeroides (105 aa), FASTA scores: opt 196, E(): 1.9e-06, (33.0% identity in 103 aa overlap). Contains PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2." /codon_start=1 /transl_table=11 /product="thioredoxin" /protein_id="NP_215840.1" /db_xref="GI:15608464" /db_xref="GeneID:886897" /translation="MTRPRPPLGPAMAGAVDLSGIKQRAQQNAAASTDADRALSTPSG VTEITEANFEDEVIVRSDEVPVVVLLWSPRSEVCVDLLDTLSGLAAAAKGKWSLASVN VDVAPRVAQIFGVQAVPTVVALAAGQPISSFQGLQPADQLSRWVDSLLSATAGKLKGA ASSEESTEVDPAVAQARQQLEDGDFVAARKSYQAILDANPGSVEAKAAIRQIEFLIRA TAQRPDAVSVADSLSDDIDAAFAAADVQVLNQDVSAAFERLIALVRRTSGEERTRVRT RLIELFELFDPADPEVVAGRRNLANALY" misc_feature 1487917..1487946 /locus_tag="Rv1324" /note="PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2" gene complement(1488154..1489965) /gene="PE_PGRS24" /locus_tag="Rv1325c" /db_xref="GeneID:886899" CDS complement(1488154..1489965) /gene="PE_PGRS24" /locus_tag="Rv1325c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1325c, (MTCY130.10c), len: 603 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of ala-, gly-rich proteins (see Brennan & Delogu 2002), similar to many e.g. YQ04_MYCTU|P71933 hypothetical 63.1 kDa glycine-rich protein (778 aa), FASTA scores: E(): 0, (52.3% identity in 724 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177799.1" /db_xref="GI:57116848" /db_xref="GeneID:886899" /translation="MSFVIAAPETLVRAASDLANIGSTLGAANAAALGPTTELLAAGA DEVSAAIASLFAAHGQAYQAVSAQMSAFHAQFVQTFTAGAGAYASAEAAAAAPLEGLL NIVNTPTQLLLGRPLIGNGANGAPGTGQAGGAGGLLYGNGGAGGSGAPGQAGGPGGAA GLFGNGGAGGAGGDGPGNGAAGGAGGAGGLLFGSGGAGGPGGVGNTGTGGLGGDGGAA GLFGAGGIGGAGGPGFNGGAGGAGGRSGLFEVLAAGGAGGTGGLSVNGGTGGTGGTGG GGGLFSNGGAGGAGGFGVSGSAGGNGGTGGDGGIFTGNGGTGGTGGTGTGNQLVGGEG GAGGAGGNAGILFGAGGIGGTGGTGLGAPDPGGTGGKGGVGGIGGAGALFGPGGAGGT GGFGASSADQMAGGIGGSGGSGGAAKLIGDGGAGGTGGDSVRGAAGSGGTGGTGGLIG DGGAGGAGGTGIEFGSVGGAGGAGGNAAGLSGAGGAGGAGGFGETAGDGGAGGNAGLL NGDGGAGGAGGLGIAGDGGNGGKGGKAGMVGNGGDGGAGGASVVANGGVGGSGGNATL IGNGGNGGNGGVGSAPGKGGAGGTAGLLGLNGSPGLS" gene complement(1490117..1492312) /gene="glgB" /locus_tag="Rv1326c" /db_xref="GeneID:886893" CDS complement(1490117..1492312) /gene="glgB" /locus_tag="Rv1326c" /EC_number="2.4.1.18" /function="INVOLVED IN GLYCOGEN BIOSYNTHESIS (CYTOPLASMIC POLYSACCHARIDES) (THIRD STEP) [CATALYTIC ACTIVITY : FORMATION OF 1,6-GLUCOSIDIC LINKAGES OF GLYCOGEN]." /note="catalyzes the transfer of a segment of a 1,4-alpha-D-glucan chain to a primary hydroxy group in a similar glucan chain" /codon_start=1 /transl_table=11 /product="glycogen branching enzyme" /protein_id="NP_215842.1" /db_xref="GI:15608466" /db_xref="GeneID:886893" /translation="MSRSEKLTGEHLAPEPAEMARLVAGTHHNPHGILGAHEYDDHTV IRAFRPHAVEVVALVGKDRFSLQHLDSGLFAVALPFVDLIDYRLQVTYEGCEPHTVAD AYRFLPTLGEVDLHLFAEGRHERLWEVLGAHPRSFTTADGVVSGVSFAVWAPNAKGVS LIGEFNGWNGHEAPMRVLGPSGVWELFWPDFPCDGLYKFRVHGADGVVTDRADPFAFG TEVPPQTASRVTSSDYTWGDDDWMAGRALRNPVNEAMSTYEVHLGSWRPGLSYRQLAR ELTDYIVDQGFTHVELLPVAEHPFAGSWGYQVTSYYAPTSRFGTPDDFRALVDALHQA GIGVIVDWVPAHFPKDAWALGRFDGTPLYEHSDPKRGEQLDWGTYVFDFGRPEVRNFL VANALYWLQEFHIDGLRVDAVASMLYLDYSRPEGGWTPNVHGGRENLEAVQFLQEMNA TAHKVAPGIVTIAEESTPWSGVTRPTNIGGLGFSMKWNMGWMHDTLDYVSRDPVYRSY HHHEMTFSMLYAFSENYVLPLSHDEVVHGKGTLWGRMPGNNHVKAAGLRSLLAYQWAH PGKQLLFMGQEFGQRAEWSEQRGLDWFQLDENGFSNGIQRLVRDINDIYRCHPALWSL DTTPEGYSWIDANDSANNVLSFMRYGSDGSVLACVFNFAGAEHRDYRLGLPRAGRWRE VLNTDATIYHGSGIGNLGGVDATDDPWHGRPASAVLVLPPTSALWLTPA" gene complement(1492320..1494425) /gene="glgE" /locus_tag="Rv1327c" /db_xref="GeneID:886895" CDS complement(1492320..1494425) /gene="glgE" /locus_tag="Rv1327c" /function="UNKNOWN; PROBABLY INVOLVED IN POLYSACCHARIDES DEGRADATION." /note="Rv1327c, (MTCY130.12c), len: 701 aa. Probable glgE, glucanase, similar to AF172946|AF172946_2 putative glucanase GlgE from Mycobacterium smegmatis (697 aa), FASTA scores: opt: 3816, E(): 0, (78.5% identity in 692 aa overlap). Similar to putative alpha-amylases e.g. Q9L1K2 Streptomyces coelicolor (675 aa), FASTA scores: opt: 2243, E(): 7.4e-132, (54.2% identity in 684 aa overlap). Start changed since original submission (-36) based on similarity to GlgE of Mycobacterium smegmatis; previous start at position 1494531." /codon_start=1 /transl_table=11 /product="glucanase GLGE" /protein_id="NP_215843.2" /db_xref="GI:57116849" /db_xref="GeneID:886895" /translation="MSGRAIGTETEWWVPGRVEIDDVAPVVSCGVYPAKAVVGEVVPV SAAVWREGHEAVAATLVVRYLGVRYPHLTDRPRARVLPTPSEPQQRVKPLLIPMTSGQ EPFVFHGQFTPDRVGLWTFRVDGWGDPIHTWRHGLIAKLDAGQGETELSNDLLVGAVL LERAATGVPRGLRDPLLAAAAALRTPGDPVTRTALALTPEIEELLADYPLRDLVTRGE QFGVWVDRPLARFGAWYEMFPRSTGGWDDDGNPVHGTFATAAAELPRIAGMGFDVVYL PPIHPIGKVHRKGRNNSPTAAPTDVGSPWAIGSDEGGHDTVHPSLGTIDDFDDFVSAA RDLGMEVALDLALQCAPDHPWAREHRQWFTELPDGTIAYAENPPKKYQDIYPLNFDND PEGLYDEVLRVVQHWVNHGVKFFRVDNPHTKPPNFWAWLIAQVKTVDPDVLFLSEAFT PPARQYGLAKLGFTQSYSYFTWRTTKWELTEFGNQIAELADYRRPNLFVNTPDILHAV LQHNGPGMFAIRAVLAATMSPAWGMYCGYELFEHRAVREGSEEYLDSEKYELRPRDFA SALDQGRSLQPFITRLNIIRRLHPAFQQLRTIHFHHVDNDALLAYSKFDPATGDCVLV VVTLNAFGPEEATLWLDMAALGMEDYDRFWVRDEITGEEYQWGQANYIRIDPARAVAH IINMPAVPYESRNTLLRRR" gene 1494564..1497155 /gene="glgP" /locus_tag="Rv1328" /db_xref="GeneID:886886" CDS 1494564..1497155 /gene="glgP" /locus_tag="Rv1328" /EC_number="2.4.1.1" /function="PHOSPHORYLASE IS AN IMPORTANT ALLOSTERIC ENZYME IN CARBOHYDRATE METABOLISM. ENZYMES FROM DIFFERENT SOURCES DIFFER IN THEIR REGULATORY MECHANISMS AND IN THEIR NATURAL SUBSTRATES. HOWEVER, ALL KNOWN PHOSPHORYLASES SHARE CATALYTIC AND STRUCTURAL PROPERTIES [CATALYTIC ACTIVITY : {(1,4)-ALPHA-D-GLUCOSYL}(N) + PHOSPHATE = {(1,4)-ALPHA-D-GLUCOSYL}(N-1) + ALPHA-D-GLUCOSE 1-PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv1328, (MTCY130.13), len: 863 aa. Probable glgP, glycogen phosphorylase (EC 2.4.1.1), similar to many e.g. PHSG_HAEIN|P45180 glycogen phosphorylase from Haemophilus influenzae (821 aa), FASTA scores: E(): 6.9e-08, (25.6% identity in 675 aa overlap). BELONGS TO THE GLYCOGEN PHOSPHORYLASE FAMILY." /codon_start=1 /transl_table=11 /product="glycogen phosphorylase GlgP" /protein_id="NP_215844.1" /db_xref="GI:15608468" /db_xref="GeneID:886886" /translation="MKALRRFTVRAHLPERLAALDQLSTNLRWSWDKPTQDLFAAIDP ALWEQCGHDPVALLGAVNPARLDELALDAEFLGALDELAADLNDYLSRPLWYQEQQDA GVAAQALPTGIAYFSLEFGVAEVLPNYSGGLGILAGDHLKSASDLGVPLIAVGLYYRS GYFRQSLTADGWQHETYPSLDPQGLPLRLLTDANGDPVLVEVALGDNAVLRARIWVAQ VGRVPLLLLDSDIPENEHDLRNVTDRLYGGDQEHRIKQEILAGIGGVRAIRAYTAVEK LTPPEVFHMNEGHAGFLGIERIRELVTDAGLDFDTALTVVRSSTVFTTHTPVPAGIDR FPLEMVQRYVNDQRGDGRSRLLPGLPADRIVALGAEDDPAKFNMAHMGLRLAQRANGV SLLHGRVSRAMFNELWAGFDPDEVPIGSVTNGVHAPTWAAPQWLQLGRELAGSDSLRE PVVWQRLHQVDPAHLWWIRSQLRSMLVEDVRARLRQSWLERGATDAELGWIATAFDPN VLTVGFARRVPTYKRLTLMLRDPDRLEQLLLDEQRPIQLIVAGKSHPADDGGKALIQQ VVRFADRPQVRHRIAFLPNYDMSMARLLYWGCDVWLNNPLRPLEACGTSGMKSALNGG LNLSIRDGWWDEWYDGENGWEIPSADGVADENRRDDLEAGALYDLLAQAVAPKFYERD ERGVPQRWVEMVRHTLQTLGPKVLASRMVRDYVEHYYAPAAQSFRRTAGAQFDAAREL ADYRRRAEEAWPKIEIADVDSTGLPDTPLLGSQLTLTATVRLAGLRPNDVTVQGVLGR VDAGDVLMDPVTVEMAHTGTGDGGYEIFSTTTPLPLAGPVGYTVRVLPRHPMLAASNE LGLVTLA" gene complement(1497195..1499189) /gene="dinG" /locus_tag="Rv1329c" /db_xref="GeneID:886889" CDS complement(1497195..1499189) /gene="dinG" /locus_tag="Rv1329c" /function="PROBABLE HELICASE INVOLVED IN DNA REPAIR AND PERHAPS ALSO REPLICATION." /note="Rv1329c, (MTCY130.14c), len: 664 aa. Probable dinG, ATP-dependent helicase (see citation below), similar to several e.g. DING_HAEIN|P44680 probable ATP-dependent helicase ding from Haemophilus influenzae (640 aa), FASTA scores: opt: 685, E(): 2.3e-38, (32.8% identity in 644 aa overlap). Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="ATP-dependent helicase DING" /protein_id="NP_215845.1" /db_xref="GI:15608469" /db_xref="GeneID:886889" /translation="MSESVSMSVPELLAIAVAALGGTRRRGQQEMAAAVAHAFETGEH LVVQAGTGTGKSLAYLVPAIIRALCDDAPVVVSTATIALQRQLVDRDLPQLVDSLTNA LPRRPKFALLKGRRNYLCLNKIHNSVTASDHDDERPQEELFDPVAVTALGRDVQRLTA WASTTVSGDRDDLKPGVGDRSWSQVSVSARECLGVARCPFGSECFSERARGAAGLADV VVTNHALLAIDAVAESAVLPEHRLLVVDEAHELADRVTSVAAAELTSATLGMAARRIT RLVDPKVTQRLQAASATFSSAIHDARPGRIDCLDDEMATYLSALRDAASAARSAIDTG SDTTTASVRAEAGAVLTEISDTASRILASFAPAIPDRSDVVWLEHEDNHESARAVLRV APLSVAELLATQVFARATTVLTSATLTIGGSFDAMATAWGLTADTPWRGLDVGSPFQH AKSGILYVAAHLPPPGRDGSGSAEQLTEIAELITAAGGRTLGLFSSMRAARAATEAMR ERLSTPVLCQGDDSTSTLVEKFTADAATSLFGTLSLWQGVDVPGPSLSLVLIDRIPFP RPDDPLLSARQRAVAARGGNGFMTVAASHAALLLAQGSGRLLRRVTDRGVVAVLDSRM ATARYGEFLRASLPPFWQTTNATQVRAALRRLARADAKAH" misc_feature complement(1499022..1499045) /gene="dinG" /locus_tag="Rv1329c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(1499213..1500559) /locus_tag="Rv1330c" /db_xref="GeneID:886884" CDS complement(1499213..1500559) /locus_tag="Rv1330c" /EC_number="2.4.2.11" /function="UNKNOWN" /note="catalyzes the formation of 5-phospho-alpha-D-ribose 1-diphosphate and nicotinate from nicotinate D-ribonucleotide and diphosphate" /codon_start=1 /transl_table=11 /product="nicotinate phosphoribosyltransferase" /protein_id="NP_215846.2" /db_xref="GI:57116850" /db_xref="GeneID:886884" /translation="MGPPPAARRREGEPDNQDPAGLLTDKYELTMLAAALRDGSANRP TTFEVFARRLPTGRRYGVVAGTGRLLEALPQFRFDADACELLAQFLDPATVRYLREFR FRGDIDGYAEGELYFPGSPVLSVRGSFAECVLLETLVLSIFNHDTAIASAAARMVSAA GGRPLIEMGSRRTHERAAVAAARAAYIAGFAASSNLAAQRRYGVPAHGTAAHAFTMLH AQHGGPTELAERAAFRAQVEALGPGTTLLVDTYDVTTGVANAVAAAGAELGAIRIDSG ELGVLARQAREQLDRLGATRTRIVVSGDLDEFSIAALRGEPVDSYGVGTSLVTGSGAP TANMVYKLVEVDGVPVQKRSSYKESPGGRKEALRRSRATGTITEELVHPAGRPPVIVE PHRVLTLPLVRAGQPVADTSLAAARQLVASGLRSLPGDGLKLAPGEPAIPTRTIPA" gene 1500661..1500966 /gene="clpS" /locus_tag="Rv1331" /db_xref="GeneID:886891" CDS 1500661..1500966 /gene="clpS" /locus_tag="Rv1331" /function="UNKNOWN" /note="involved in the modulation of the specificity of the ClpAP-mediated ATP-dependent protein degradation; binds to the N-terminal domain of the chaperone ClpA" /codon_start=1 /transl_table=11 /product="ATP-dependent Clp protease adaptor protein ClpS" /protein_id="NP_215847.1" /db_xref="GI:15608471" /db_xref="GeneID:886891" /translation="MAVVSAPAKPGTTWQRESAPVDVTDRAWVTIVWDDPVNLMSYVT YVFQKLFGYSEPHATKLMLQVHNEGKAVVSAGSRESMEVDVSKLHAAGLWATMQQDR" gene 1500926..1501582 /locus_tag="Rv1332" /db_xref="GeneID:886901" CDS 1500926..1501582 /locus_tag="Rv1332" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1332, (MTCY130.17), len: 218 aa. Possible regulatory protein, high similarity to ML014|U00014 M. leprae B1549_C3_236 (222 aa), FASTA scores: opt: 1158, E(): 0, (75.6% identity in 221 aa overlap). Helix turn helix motif fram aa 8-29 (+3.03 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215848.1" /db_xref="GI:15608472" /db_xref="GeneID:886901" /translation="MPPVCGRRCSRTGEIRGYSGSIVRRWKRVETRDGPRFRSSLAPH EAALLKNLAGAMIGLLDDRDSSSPSDELEEITGIKTGHAQRPGDPTLRRLLPDFYRPD DLDDDDPTAVDGSESFNAALRSLHEPEIIDAKRVAAQQLLDTVPDNGGRLELTESDAN AWIAAVNDLRLALGVMLEIGPRGPERLPGNHPLAAHFNVYQWLTVLQEYLVLVLMGSR" gene 1501599..1502633 /locus_tag="Rv1333" /db_xref="GeneID:886882" CDS 1501599..1502633 /locus_tag="Rv1333" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1333, (MTCY130.18), len: 344 aa. Possible hydrolase (EC 3.-.-.-), similar to Q57326|D26094 endo-type 6-aminohexanoate oligomer hydrolase (355 aa), fasta scores: E(): 1.4e-10, (31.9% identity in 339 aa overlap). Equivalent to P53425|YD33_MYCLE HYPOTHETICAL 36.1 KD PROTEIN B154 Mycobacterium leprae (362 aa), FASTA scores: opt: 1735, E(): 0, (76.7% identity in 352 aa overlap)." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_215849.1" /db_xref="GI:15608473" /db_xref="GeneID:886882" /translation="MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGA VDCRGGAPGTRETDLLDPANSVRFVDALLLAGGSAYGLAAADGVMRWLEEHRRGVAMD SGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVGVGARAGALKG GVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADLVGEFALRAPPAEQIAALA QLSSPLGAFNTPFNTTIGVIACDAALSPAACRRIAIAAHDGLARTIRPAHTPLDGDTV FALATGAVAVPPEAGVPAALSPETQLVTAVGAAAADCLARAVLAGVLNAQPVAGIPTY RDMFPGAFGS" gene 1502641..1503081 /locus_tag="Rv1334" /db_xref="GeneID:886875" CDS 1502641..1503081 /locus_tag="Rv1334" /function="UNKNOWN" /note="Rv1334, (MTCY130.19), len: 146 aa. Conserved hypothetical protein, similar to AL096852|SCE19A_13 hypothetical protein from Streptomyces coelicolor (140 aa), Fasta scores: opt: 579, E(): 0, (65.0% identity in 140 aa overlap); and Q54330|M29166 MEC+ from Streptomyces kasugaensis (115 aa), FASTA scores; E(): 7.6e-33, (56.9% identity in 109 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215850.1" /db_xref="GI:15608474" /db_xref="GeneID:886875" /translation="MLLRKGTVYVLVIRADLVNAMVAHARRDHPDEACGVLAGPEGSD RPERHIPMTNAERSPTFYRLDSGEQLKVWRAMEDADEVPVVIYHSHTATEAYPSRTDV KLATEPDAHYVLVSTRDPHRHELRSYRIVDGAVTEEPVNVVEQY" gene 1503103..1503384 /locus_tag="Rv1335" /db_xref="GeneID:886879" CDS 1503103..1503384 /locus_tag="Rv1335" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1335, (MT1376.1, MTCY130.20), len: 93 aa. 9.5 kDa culture filtrate antigen cfp10A (see citation below). Similar to hypothetical proteins from other organisms e.g. P74060|D90911 Synechocystis (109 aa), FASTA scores: E(): 2.3e-20, (49.5% identity in 93 aa overlap)." /codon_start=1 /transl_table=11 /product="9.5 kDa culture filtrate antigen CFP10A" /protein_id="NP_215851.1" /db_xref="GI:15608475" /db_xref="GeneID:886879" /translation="MNVTVSIPTILRPHTGGQKSVSASGDTLGAVISDLEANYSGISE RLMDPSSPGKLHRFVNIYVNDEDVRFSGGLATAIADGDSVTILPAVAGG" gene 1503394..1504365 /gene="cysM" /locus_tag="Rv1336" /db_xref="GeneID:886867" CDS 1503394..1504365 /gene="cysM" /locus_tag="Rv1336" /EC_number="2.5.1.47" /function="INVOLVED IN CYSTEINE BIOSYNTHESIS [CATALYTIC ACTIVITY : O3-ACETYL-L-SERINE + H(2)S = L-CYSTEINE + ACETATE]" /note="Rv1336, (MTCY130.21), len: 323 aa. Probable cysM, cysteine synthase B (EC 4.2.99.8), similar to many e.g. CYSM_ECOLI|P16703 Escherichia coli (303 aa), FASTA scores: opt: 720, E(): 4.6e-40, (41.1% identity in 302 aa overlap). Also similar to other Mycobacterium tuberculosis cysteine synthase subunits e.g. Rv1077, Rv2334, Rv0848, etc. Contains PS00901 Cysteine synthase/cystathionine beta-synthase P-phosphate attachment site. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY." /codon_start=1 /transl_table=11 /product="cysteine synthase B CysM" /protein_id="NP_215852.1" /db_xref="GI:15608476" /db_xref="GeneID:886867" /translation="MTRYDSLLQALGNTPLVGLQRLSPRWDDGRDGPHVRLWAKLEDR NPTGSIKDRPAVRMIEQAEADGLLRPGATILEPTSGNTGISLAMAARLKGYRLICVMP ENTSVERRQLLELYGAQIIFSAAEGGSNTAVATAKELAATNPSWVMLYQYGNPANTDS HYCGTGPELLADLPEITHFVAGLGTTGTLMGTGRFLREHVANVKIVAAEPRYGEGVYA LRNMDEGFVPELYDPEILTARYSVGAVDAVRRTRELVHTEGIFAGISTGAVLHAALGV GAGALAAGERADIALVVADAGWKYLSTGAYAGSLDDAETALEGQLWA" misc_feature 1503511..1503567 /gene="cysM" /locus_tag="Rv1336" /note="PS00901 Cysteine synthase/cystathionine beta-synthase P-phosphate attachment site" gene 1504356..1505078 /locus_tag="Rv1337" /db_xref="GeneID:886870" CDS 1504356..1505078 /locus_tag="Rv1337" /function="UNKNOWN" /note="Rv1337, (MTCY130.22), len: 240 aa. Probable integral membrane protein. Highly similar to P53426 hypothetical protein B1549_C3_240 from M.leprae (251); and P74553|D90916 hypothetical protein from Synechocystis sp. (198 aa), FASTA scores: E(): 2.3e-25, (43.6% identity in 181 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215853.1" /db_xref="GI:15608477" /db_xref="GeneID:886870" /translation="MGMTPRRKRRGGAVQITRPTGRPRTPTTQTTKRPRWVVGGTTIL TFVALLYLVELIDQLSGSRLDVNGIRPLKTDGLWGVIFAPLLHANWHHLMANTIPLLV LGFLMTLAGLSRFVWATAIIWILGGLGTWLIGNVGSSCGPTDHIGASGLIFGWLAFLL VFGLFVRKGWDIVIGLVVLFVYGGILLGAMPVLGQCGGVSWQGHLSGAVAGVVAAYLL SAPERKARALKRAGARSGHPKL" gene 1505075..1505890 /gene="murI" /locus_tag="Rv1338" /db_xref="GeneID:886866" CDS 1505075..1505890 /gene="murI" /locus_tag="Rv1338" /EC_number="5.1.1.3" /function="INVOLVED IN PEPTIDOGLYCAN BIOSYNTHESIS. PROVIDES THE (R)-GLUTAMIC ACID REQUIRED FOR CELL WALL BIOSYNTHESIS [CATALYTIC ACTIVITY : L-GLUTAMATE = D-GLUTAMATE]" /note="converts L-glutamate to D-glutamate, a component of peptidoglycan" /codon_start=1 /transl_table=11 /product="glutamate racemase" /protein_id="NP_215854.1" /db_xref="GI:15608478" /db_xref="GeneID:886866" /translation="MNSPLAPVGVFDSGVGGLTVARAIIDQLPDEDIVYVGDTGNGPY GPLTIPEIRAHALAIGDDLVGRGVKALVIACNSASSACLRDARERYQVPVVEVILPAV RRAVAATRNGRIGVIGTRATITSHAYQDAFAAARDTEITAVACPRFVDFVERGVTSGR QVLGLAQGYLEPLQRAEVDTLVLGCTHYPLLSGLIQLAMGENVTLVSSAEETAKEVVR VLTEIDLLRPHDAPPATRIFEATGDPEAFTKLAARFLGPVLGGVQPVHPSRIH" misc_feature 1505615..1505647 /gene="murI" /locus_tag="Rv1338" /note="PS00924 Aspartate and glutamate racemases signature 2" gene 1505917..1506738 /locus_tag="Rv1339" /db_xref="GeneID:886972" CDS 1505917..1506738 /locus_tag="Rv1339" /function="UNKNOWN" /note="Rv1339, (MTCY130.24), len: 273 aa. Conserved hypothetical protein, highly similar to Y211_MYCLE|P50474 hypothetical protein b1549_c2_211 from Mycobacterium leprae (284 aa), FASTA scores: opt: 1672, E(): 0, (86.2% identity in 276 aa overlap). Also similar to AL096852|SCE19A.08 hypothetical protein from Streptomyces coelicolor (250 aa), FASTA scores: opt: 630, E(): 0, (42.2% identity in 256 aa overlap). Similar to M. tuberculosis hypothetical proteins Rv3796, Rv2407." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215855.1" /db_xref="GI:15608479" /db_xref="GeneID:886972" /translation="MRRCIPHRCIGHGTVVSVRITVLGCSGSVVGPDSPASGYLLRAP HTPPLVIDFGGGVLGALQRHADPASVHVLLSHLHADHCLDLPGLFVWRRYHPSRPSGK ALLYGPSDTWSRLGAASSPYGGEIDDCSDIFDVHHWADSEPVTLGALTIVPRLVAHPT ESFGLRITDPSGASLAYSGDTGICDQLVELARGVDVFLCEASWTHSPKHPPDLHLSGT EAGMVAAQAGVRELLLTHIPPWTSREDVISEAKAEFDGPVHAVVCDETFEVRRAG" gene 1506755..1507534 /gene="rph" /locus_tag="Rv1340" /db_xref="GeneID:886864" CDS 1506755..1507534 /gene="rph" /locus_tag="Rv1340" /EC_number="2.7.7.56" /function="RNase PH IS A PHOSPHOROLYTIC EXORIBONUCLEASE THAT REMOVES NUCLEOTIDE RESIDUES FOLLOWING THE -CCA TERMINUS OF tRNA AND ADDS NUCLEOTIDES TO THE ENDS OF RNA MOLECULES BY USING NUCLEOSIDE DIPHOSPHATES AS SUBSTRATES [CATALYTIC ACTIVITY: {TRNA}(N+1) + PHOSPHATE = {TRNA}(N) + A NUCLEOSIDE DIPHOSPHATE]." /note="RNase PH; tRNA nucleotidyltransferase; forms hexamers in Bacillus subtilis; phosphoroltic 3'-5' exoribonuclease; involved in maturation of tRNA precursors and removes terminal nucleotides near CCA acceptor arms of mature tRNAs" /codon_start=1 /transl_table=11 /product="ribonuclease PH" /protein_id="NP_215856.1" /db_xref="GI:15608480" /db_xref="GeneID:886864" /translation="MSKREDGRLDHELRPVIITRGFTENPAGSVLIEFGHTKVLCTAS VTEGVPRWRKATGLGWLTAEYAMLPSATHSRSDRESVRGRLSGRTQEISRLIGRSLRA CIDLAALGENTIAIDCDVLQADGGTRTAAITGAYVALADAVTYLSAAGKLSDPRPLSC AIAAVSVGVVDGRIRVDLPYEEDSRAEVDMNVVATDTGTLVEIQGTGEGATFARSTLD KLLDMALGACDTLFAAQRDALALPYPGVLPQGPPPPKAFGT" repeat_region 1507531..1507581 /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 1507573..1508187 /locus_tag="Rv1341" /db_xref="GeneID:886861" CDS 1507573..1508187 /locus_tag="Rv1341" /function="UNKNOWN" /note="HAM1-like protein; Rec-dependent growth; RgdB; yggV; it is suspected that this protein functions to remove misincorporated bases such as xanthine or hypoxanthine" /codon_start=1 /transl_table=11 /product="putative deoxyribonucleotide triphosphate pyrophosphatase" /protein_id="NP_215857.1" /db_xref="GI:15608481" /db_xref="GeneID:886861" /translation="MALVTKLLVASRNRKKLAELRRVLDGAGLSGLTLLSLGDVSPLP ETPETGVTFEDNALAKARDAFSATGLASVADDSGLEVAALGGMPGVLSARWSGRYGDD AANTALLLAQLCDVPDERRGAAFVSACALVSGSGEVVVRGEWPGTIAREPRGDGGFGY DPVFVPYGDDRTAAQLSPAEKDAVSHRGRALALLLPALRSLATG" gene complement(1508184..1508546) /locus_tag="Rv1342c" /db_xref="GeneID:886862" CDS complement(1508184..1508546) /locus_tag="Rv1342c" /function="UNKNOWN" /note="Rv1342c, (MTCY02B10.06c), len: 120 aa. Conserved membrane protein. Highly similar to G466926|P54133 hypothetical protein B1549_F2_59 from Mycobacterium leprae (119 aa), FASTA scores, opt: 544, E(): 1.9e-29, (68.3 % identity in 120 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177800.1" /db_xref="GI:57116851" /db_xref="GeneID:886862" /translation="MTAPETPAAQHAEPAIAVERIRTALLGYRIMAWTTGLWLIALCY EIVVRYVVKVDNPPTWIGVVHGWVYFTYLLLTLNLAVKVRWPLGKTAGVLLAGTIPLL GIVVEHFQTKEIKARFGL" gene complement(1508543..1508923) /gene="lprD" /locus_tag="Rv1343c" /db_xref="GeneID:886852" CDS complement(1508543..1508923) /gene="lprD" /locus_tag="Rv1343c" /function="UNKNOWN" /note="Rv1343c, (MTCY02B10.07c), len: 126 aa. Probable lprD, conserved lipoprotein, highly similar to G466928 Mycobacterium leprae protein B1549_F3_106 (126 aa), FASTA scores, opt: 704, E(): 7.5e-36, (78.4 % identity in 125 aa overlap). Has N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein attachment site. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LprD" /protein_id="NP_215859.1" /db_xref="GI:15608483" /db_xref="GeneID:886852" /translation="MSTTRRRRPALIALVIIATCGCLALGWWQWTRFQSTSGTFQNLG YALQWPLFAWFCVYAYRNFVRYEETPPQPPTGGAAAEIPAGLLPERPKPAQQPPDDPV LREYNAYLAELAKDDARKQNRTTA" gene 1508968..1509288 /locus_tag="Rv1344" /db_xref="GeneID:886848" CDS 1508968..1509288 /locus_tag="Rv1344" /function="THOUGHT TO BE INVOLVED IN DE NOVO FATTY ACID BIOSYNTHESIS; THIS PROTEIN IS THE CARRIER OF THE GROWING FATTY ACID CHAIN IN FATTY ACID BIOSYNTHESIS" /note="carries the fatty acid chain in fatty acid biosynthesis" /codon_start=1 /transl_table=11 /product="acyl carrier protein" /protein_id="NP_215860.1" /db_xref="GI:15608484" /db_xref="GeneID:886848" /translation="MWRYPLSTRLALPNTPGVASFAMTSSPSTVSTTLLSILRDDLNI DLTRVTPDARLVDDVGLDSVAFAVGMVAIEERLGVALSEEELLTCDTVGELEAAIAAK YRDE" gene 1509281..1510846 /gene="fadD33" /locus_tag="Rv1345" /db_xref="GeneID:886855" CDS 1509281..1510846 /gene="fadD33" /locus_tag="Rv1345" /EC_number="2.3.1.86" /function="UNKNOWN, BUT POSSIBLY INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="converts medium- to long-chain aliphatic fatty acids into acyl adenylate; involved in mycobactin synthesis" /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid--[acyl-carrier-protein] ligase" /protein_id="NP_215861.1" /db_xref="GI:15608485" /db_xref="GeneID:886855" /translation="MSELAAVLTRSMQASAGDLMVLDRETSLWCRHPWPEVHGLAESV AAWLLDHDRPAAVGLVGEPTVELVAAIQGAWLAGAAVSILPGPVRGANDQRWADATLT RFLGIGVRTVLSQGSYLARLRSVDTAGVTIGDLSTAAHTNRSATPVASEGPAVLQGTA GSTGAPRTAILSPGAVLSNLRGLNQRVGTDAATDVGCSWLPLYHDMGLAFVLSAALAG APLWLAPTTAFTASPFRWLSWLSDSGATMTAAPNFAYNLIGKYARRVSEVDLGALRVT LNGGEPVDCDGLTRFAEAMAPFGFDAGAVLPSYGLAESTCAVTVPVPGIGLLADRVID GSGAHKHAVLGNPIPGMEVRISCGDQAAGNASREIGEIEIRGASMMAGYLGQQPIDPD DWFATGDLGYLGAGGLVVCGRAKEVISIAGRNIFPTEVELVAAQVRGVREGAVVALGT GDRSTRPGLVVAAEFRGPDEANARAELIQRVASECGIVPSDVVFVSPGSLPRTSSGKL RRLAVRRSLEMAD" gene 1510846..1512006 /gene="fadE14" /locus_tag="Rv1346" /db_xref="GeneID:886844" CDS 1510846..1512006 /gene="fadE14" /locus_tag="Rv1346" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /note="Rv1346, (MTCY02B10.10), len: 386 aa. Possible fadE14, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. NP_251579.1|NC_002516 probable acyl-CoA dehydrogenase from Pseudomonas aeruginosa (386 aa); NP_036951.1|NM_012819|ACDL_RAT|P15650 acyl Coenzyme A dehydrogenase (long chain) from Rattus norvegicus (430 aa), FASTA scores: opt: 414, E(): 1.2e-18, (26.1% identity in 376 aa overlap); etc." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="NP_215862.1" /db_xref="GI:15608486" /db_xref="GeneID:886844" /translation="MTAGSDLDDFRGLLAKAFDERVVAWTAEAEAQERFPRQLIEHLG VCGVFDAKWATDARPDVGKLVELAFALGQLASAGIGVGVSLHDSAIAILRRFGKSDYL RDICDQAIRGAAVLCIGASEESGGSDLQIVETEIRSRDGGFEVRGVKKFVSLSPIADH IMVVARSVDHDPTSRHGNVAVVAVPAAQVSVQTPYRKVGAGPLDTAAVCIDTWVPADA LVARAGTGLAAISWGLAHERMSIAGQIAASCQRAIGITLARMMSRRQFGQTLFEHQAL RLRMADLQARVDLLRYALHGIAEQGRLELRTAAAVKVTAARLGEEVISECMHIFGGAG YLVDETTLGKWWRDMKLARVGGGTDEVLWELVAAGMTPDHDGYAAVVGASKA" gene complement(1511973..1512605) /locus_tag="Rv1347c" /db_xref="GeneID:886846" CDS complement(1511973..1512605) /locus_tag="Rv1347c" /function="UNKNOWN" /note="Rv1347c, (MTCY02B10.11c), len: 210 aa. Conserved hypothetical protein, some similarity to the C-terminus of malonyl-coenzyme A carboxylases e.g. G545170 malonyl-coenzyme A carboxylase (417 aa), FASTA scores: opt: 392, E(): 4.9 e-20, (35.6% identity in 174 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215863.1" /db_xref="GI:15608487" /db_xref="GeneID:886846" /translation="MTKPTSAGQADDALVRLARERFDLPDQVRRLARPPVPSLEPPYG LRVAQLTDAEMLAEWMNRPHLAAAWEYDWPASRWRQHLNAQLEGTYSLPLIGSWHGTD GGYLELYWAAKDLISHYYDADPYDLGLHAAIADLSKVNRGFGPLLLPRIVASVFANEP RCRRIMFDPDHRNTATRRLCEWAGCKFLGEHDTTNRRMALYALEAPTTAA" gene 1512728..1512811 /locus_tag="Rvnt19" /note="tRNA-Leu(TAG)" /db_xref="GeneID:2700424" tRNA 1512728..1512811 /locus_tag="Rvnt19" /product="tRNA-Leu" /note="codon recognized: CUA" /anticodon=(pos:1512762..1512764,aa:Leu) /db_xref="GeneID:2700424" gene 1513047..1515626 /locus_tag="Rv1348" /db_xref="GeneID:886853" CDS 1513047..1515626 /locus_tag="Rv1348" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF DRUGS ACROSS THE MEMBRANE (EXPORT): MULTIDRUGS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1348, (MTCY02B10.12), len: 859 aa. Probable drugs-transport transmembrane protein ATP binding protein ABC transporter (see citation below), similar to HMT1_SCHPO|Q02592 heavy metal tolerance protein precursor from Schizosaccharomyces pombe (830 aa), FASTA scores: opt: 806, E(): 5.1e-39, (32.9% identity in 504 aa overlap); etc. Also similar to MTCY02B10.13 from Mycobacterium tuberculosis, FASTA score: (31.9% identity in 576 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="drugs-transport transmembrane ATP-binding protein ABC transporter" /protein_id="NP_215864.1" /db_xref="GI:15608488" /db_xref="GeneID:886853" /translation="MARGLQGVMLRSFGARDHTATVIETISIAPHFVRVRMVSPTLFQ DAEAEPAAWLRFWFPDPNGSNTEFQRAYTISEADPAAGRFAVDVVLHDPAGPASSWAR TVKPGATIAVMSLMGSSRFDVPEEQPAGYLLIGDSASIPGMNGIIETVPNDVPIEMYL EQHDDNDTLIPLAKHPRLRVRWVMRRDEKSLAEAIENRDWSDWYAWATPEAAALKCVR VRLRDEFGFPKSEIHAQAYWNAGRAMGTHRATEPAATEPEVGAAPQPESAVPAPARGS WRAQAASRLLAPLKLPLVLSGVLAALVTLAQLAPFVLLVELSRLLVSGAGAHRLFTVG FAAVGLLGTGALLAAALTLWLHVIDARFARALRLRLLSKLSRLPLGWFTSRGSGSIKK LVTDDTLALHYLVTHAVPDAVAAVVAPVGVLVYLFVVDWRVALVLFGPVLVYLTITSS LTIQSGPRIVQAQRWAEKMNGEAGSYLEGQPVIRVFGAASSSFRRRLDEYIGFLVAWQ RPLAGKKTLMDLATRPATFLWLIAATGTLLVATHRMDPVNLLPFMFLGTTFGARLLGI AYGLGGLRTGLLAARHLQVTLDETELAVREHPREPLDGEAPATVVFDHVTFGYRPGVP VIQDVSLTLRPGTVTALVGPSGSGKSTLATLLARFHDVERGAIRVGGQDIRSLAADEL YTRVGFVLQEAQLVHGTAAENIALAVPDAPAEQVQVAAREAQIHDRVLRLPDGYDTVL GANSGLSGGERQRLTIARAILGDTPVLILDEATAFADPESEYLVQQALNRLTRDRTVL VIAHRLHTITRADQIVVLDHGRIVERGTHEELLAAGGRYCRLWDTGQGSRVAVAAAQD GTR" misc_feature 1514973..1514996 /locus_tag="Rv1348" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1515282..1515326 /locus_tag="Rv1348" /note="PS00211 ABC transporters family signature" gene 1515623..1517362 /locus_tag="Rv1349" /db_xref="GeneID:886834" CDS 1515623..1517362 /locus_tag="Rv1349" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF DRUGS ACROSS THE MEMBRANE (EXPORT): MULTIDRUGS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1349, (MTCY02B10.13), len: 579 aa. Probable drugs-transport transmembrane ATP binding protein ABC transporter (see citation below), most similar to YWJA_BACSU|P45861 hypothetical ABC transporter from Bacillus subtilis (575 aa), FASTA scores: opt: 721, E(): 1.8e-35, (28.9% identity in 567 aa overlap); etc. Also similar to MTCY02B10.12 from Mycobacterium tuberculosis, FASTA score: (31.9% identity in 576 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="drugs-transport transmembrane ATP-binding protein ABC transporter" /protein_id="NP_215865.1" /db_xref="GI:15608489" /db_xref="GeneID:886834" /translation="MIRTWIALVPNDHRARLIGFALLAFCSVVARAVGTVLLVPLMAA LFGEAPQRAWLWLGWLSAATVAGWVLDAVTARIGIELGFAVLNHTQHDVADRLPVVRL DWFTAENTATARQAIAATGPELVGLVVNLVTPLTSAILLPAVIALALLPISWQLGVAA LAGVPLLLGALWASAAFARRADTAADKANTALTERIIEFARTQQALRAARRVEPARSL VGNALASQHTATMRLLGMQIPGQLLFSIASQLALIVLAGTTAALTITGTLTVPEAIAL IVVMVRYLEPFTAVSELAPALESTRATLGRIGSVLTAPVMVAGSGTWRDGAVVPRIEF DDVAFGYDGGSGPVLDGVSFCLQPGTTTAIVGPSGCGKSTILALIAGLHQPTRGRVLI DGTDVATLDARAQQAVCSVVFQHPYLFHGTIRDNVFAADPGASDDQFAQAVRLARVDE LIARLPDGANTIVGEAGSALSGGERQRVSIARALLKAAPVLLVDEATSALDAENEAAV VDALAADPRSRTRVIVAHRLASIRHADRVLFVDDGRVVEDGSISELLTAGGRFSQFWR QQHEAAEWQILAE" misc_feature 1516718..1516741 /locus_tag="Rv1349" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1517030..1517074 /locus_tag="Rv1349" /note="PS00211 ABC transporters family signature" gene 1517491..1518234 /gene="fabG" /locus_tag="Rv1350" /db_xref="GeneID:886837" CDS 1517491..1518234 /gene="fabG" /locus_tag="Rv1350" /EC_number="1.1.1.100" /function="INVOLVED IN THE FATTY ACID BIOSYNTHESIS PATHWAY (FIRST REDUCTION STEP) [CATALYTIC ACTIVITY: (3R)-3-hydroxyacyl-[acyl-carrier protein] + NADP+ = 3-oxoacyl-[acyl-carrier protein] + NADPH]." /note="Catalyzes the first of the two reduction steps in the elongation cycle of fatty acid synthesis" /codon_start=1 /transl_table=11 /product="3-ketoacyl-(acyl-carrier-protein) reductase" /protein_id="NP_215866.1" /db_xref="GI:15608490" /db_xref="GeneID:886837" /translation="MASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEAT EVAAKRLGGDDVALAVRCDVTQADDVDILIRTAVERFGGLDVMVNNAGITRDATMRTM TEEQFDQVIAVHLKGTWNGTRLAAAIMRERKRGAIVNMSSVSGKVGMVGQTNYSAAKA GIVGMTKAAAKELAHLGIRVNAIAPGLIRSAMTEAMPQRIWDQKLAEVPMGRAGEPSE VASVAVFLASDLSSYMTGTVLDVTGGRFI" misc_feature 1517914..1518000 /gene="fabG" /locus_tag="Rv1350" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 1518231..1518560 /locus_tag="Rv1351" /db_xref="GeneID:886831" CDS 1518231..1518560 /locus_tag="Rv1351" /function="UNKNOWN" /note="Rv1351, (MTCY02B10.15), len: 109 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215867.1" /db_xref="GI:15608491" /db_xref="GeneID:886831" /translation="MTPRSLPRYGNSSRRKSFPMHRPSNVATATRKKSSIGWVLLACS VAGCKGIDTTEFILGRAGAFELAVRAAQHRHRYLTMVNVGRAPPRRCRTVCMAATDTP RNIRLNG" gene 1518763..1519134 /locus_tag="Rv1352" /db_xref="GeneID:886833" CDS 1518763..1519134 /locus_tag="Rv1352" /function="UNKNOWN" /note="Rv1352, (MTCY02B10.16), len: 123 aa. Conserved hypothetical protein, some similarity to Rv1906c|MTCY180.12 hypothetical protein from Mycobacterium tuberculosis (156 aa), FASTA scores: E(): 4e-05, (36.2% identity in 116 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215868.1" /db_xref="GI:15608492" /db_xref="GeneID:886833" /translation="MARTLALRASAGLVAGMAMAAITLAPGARAETGEQFPGDGVFLV GTDIAPGTYRTEGPSNPLILVFGRVSELSTCSWSTHSAPEVSNENIVDTNTSMGPMSV VIPPTVAAFQTHNCKLWMRIS" gene complement(1519200..1519985) /locus_tag="Rv1353c" /db_xref="GeneID:886827" CDS complement(1519200..1519985) /locus_tag="Rv1353c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1353c, (MTCY02B10.17c), len: 261 aa. Probable transcriptional regulatory protein, similar to TER1_ECOLI|P03038 tetracycline repressor protein class A from Escherichia coli (216 aa), FASTA scores, opt: 231, E(): 1.6e-08, (31.3% identity in 211 aa overlap). Helix turn helix motif present at aa 3859 (+3.59 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215869.1" /db_xref="GI:15608493" /db_xref="GeneID:886827" /translation="MQTTPGKRQRRQRGSINPEDIISGAFELAQQVSIDNLSMPLLGK HLGVGVTSIYWYFRKKDDLLNAMTDRALSKYVFATPYIEAGDWRETLRNHARSMRKTF ADNPVLCDLILIRAALSPKTARLGAQEMEKAIANLVTAGLSLEDAFDIYSAVSVHVRG SVVLDRLSRKSQSAGSGPSAIEHPVAIDPATTPLLAHATGRGHRIGAPDETNFEYGLE CILDHAGRLIEQSSKAAGEVAVRRPTATADAPTPGARAKAVAR" gene complement(1520005..1521876) /locus_tag="Rv1354c" /db_xref="GeneID:886829" CDS complement(1520005..1521876) /locus_tag="Rv1354c" /function="UNKNOWN" /note="Rv1354c, (MTCY02B10.18c), len: 623 aa. Conserved hypothetical protein, similar to many hypothetical proteins e.g. the C-terminus of G1001455 Synechocystis sp. (1244 aa), FASTA scores: opt: 933, E(): 0, (36.8% identity in 462 aa overlap); also similar to Rv1357c|MTCY02B10.21c (34.0% identity in 253 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215870.1" /db_xref="GI:15608494" /db_xref="GeneID:886829" /translation="MCNDTATPQLEELVTTVANQLMTVDAATSAEVSQRVLAYLVEQL GVDVSFLRHNDRDRRATRLVAEWPPRLNIPDPDPLRLIYFADADPVFALCEHAKEPLV FRPEPATEDYQRLIEEARGVPVTSAAAVPLVSGEITTGLLGFIKFGDRKWHEAELNAL MTIATLFAQVQARVAAEARLRYLADHDDLTGLHNRRALLQHLDQRLAPGQPGPVAALF LDLDRLKAINDYLGHAAGDQFIHVFAQRIGDALVGESLIARLGGDEFVLIPASPMSAD AAQPLAERLRDQLKDHVAIGGEVLTRTVSIGVASGTPGQHTPSDLLRRADQAALAAKH AGGDSVAIFTADMSVSGELRNDIELHLRRGIESDALRLVYLPEVDLRTGDIVGTEALV RWQHPTRGLLAPGCFIPVAESINLAGELDRWVLRRACNEFSEWQSAGLGHDALLRINV SAGQLVTGGFVDFVADTIGQHGLDASSVCLEITENVVVQDLHTARATLARLKEVGVHI AIDDFGTGYSAISLLQTLPIDTLKIDKTFVRQLGTNTSDLVIVRGIMTLAEGFQLDVV AEGVETEAAARILLDQRCYRAQGFLFSRPVPGEAMRHMLSARRLPPTCIPATDPALS" gene complement(1521885..1524032) /gene="moeY" /locus_tag="Rv1355c" /db_xref="GeneID:886839" CDS complement(1521885..1524032) /gene="moeY" /locus_tag="Rv1355c" /function="INVOLVED IN BIOSYNTHESIS OF A DEMOLYBDO COFACTOR (MOLYBDOPTERIN), NECESSARY FOR MOLYBDOENZYMES. PLAYS A ROLE IN THE ACTIVATION OF THE SMALL SUBUNIT OF THE MOLYBDOPTERIN CONVERTING FACTOR (MOAD)" /experiment="experimental evidence, no additional details recorded" /note="Rv1355c, (MTCY02B10.19c), len: 715 aa. Possible moeY, Molybdopterin biosynthesis protein, very weak similarity to MOEB_ECOLI|P12282 molybdopterin biosynthesis moeb protein (249 aa), FASTA scores, opt: 180, E(): 8.5e-05, (29.3% identity in 174 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215871.1" /db_xref="GI:15608495" /db_xref="GeneID:886839" /translation="MTIPHEGGSTGILVLRDDDHDDVLVLDRLRSDPSIEFVDRFAEQ LAGVRRLLPQPDPDLLEEAKRWAYYPWRRMVVAILGLRGFRAVRLDRNRHLITAEEQR ALHALRVGVVGLSAGHAIAYTLAAEGACGTLRLADFDKIELSNLNRVPVGVFDIGLNK AMIAARRIAELDPYLAVDLVTSGLSPESVDEFLDGLDVVIEECDSLDIKVILRQAACA RGVPVLMATSDRGLVDVERYDVEPGRPIFHGLLGDIDADKLCGLTTKDKVPHVLNILD CQELSARCAASMIEVDQTLWGWPQLAGDIWVGAATVAEAVRRIGLGEPLESGRVRVDV SAALDRLDQPPMPSRGNGWLLESVPPTAPAEPQPTSEIVAQAAIRAPSGGNVQPWHVV AKQHSLTIRLAPEHTSAMDIAFRGSAVAVGAAMFNARVAAAAHRVLGSVEFDESQPDS PLQATMHFGRGDDPSLAALYRPMLLRTTNRHHGMPGHVHPATVELLTNTAAAEGARLQ LLLSRNEIDRAATILAAADRIRYLTPRLHEEMMSELRWPGDPSLDAGIDVRSLELDSG ELRVLDILRRSDVVARLAQWDCGTALEDNTNERVSASSALAIVYVDGATLTDFARGGS AMQAVWIVAQQHGLAVQPMSPIFLYARGRHDLDQASPHFAAQLHRLQLDFRELVKPGK EGHEVLIFRLFHAPPPSVCSRRRVRHAIPEPHR" gene complement(1524029..1524820) /locus_tag="Rv1356c" /db_xref="GeneID:886850" CDS complement(1524029..1524820) /locus_tag="Rv1356c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1356c, (MTCY02B10.20c), len: 263 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215872.1" /db_xref="GI:15608496" /db_xref="GeneID:886850" /translation="MLIAGYLTDWRIMTTAQLRPIAPQKLHFSENLSVWVSDAQCRLV VSQPALDPTLWNTYLQGALRAYSKHGVECTLDLDAISDGSDTQLFFAAIDIGGDVVGG ARVIGPLRSADDSHAVVEWAGNPGLSAVRKMINDRAPFGVVEVKSGWVNSDAQRSDAI AAALARALPLSMSLLGVQFVMGTAAAHALDRWRSSGGVIAARIPAAAYPDERYRTKMI WWDRRTLANHAEPKQLSRMLVESRKLLRDVEALSATTAATAGAEQ" gene complement(1525293..1526216) /locus_tag="Rv1357c" /db_xref="GeneID:886815" CDS complement(1525293..1526216) /locus_tag="Rv1357c" /function="UNKNOWN" /note="Rv1357c, (MTCY02B10.21c), len: 307 aa. Conserved hypothetical protein, similar to members of the YEGE/YHJK/YJCC family e.g. Y4LL_RHISN|P55552 hypothetical protein Y4ll from Rhizobium sp. (827 aa), FASTA scores: E(): 0, (37.7% identity in 257 aa overlap), also similar to Rv1354c|MTCY02B10.18c (34.0% identity in 253 aa overlap). BELONGS TO THE YEGE/YHDA/YHJK/YJCC FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215873.1" /db_xref="GI:15608497" /db_xref="GeneID:886815" /translation="MDRCCQRATAFACALRPTKLIDYEEMFRGAMQARAMVANPDQWA DSDRDQVNTRHYLSTSMRVALDRGEFFLVYQPIIRLADNRIIGAEALLRWEHPTLGTL LPGRFIDRAENNGLMVPLTAFVLEQACRHVRSWRDHSTDPQPFVSVNVSASTICDPGF LVLVEGVLGETGLPAHALQLELAEDARLSRDEKAVTRLQELSALGVGIAIDDFGIGFS SLAYLPRLPVDVVKLGGKFIECLDGDIQARLANEQITRAMIDLGDKLGITVTAKLVET PSQAARLRAFGCKAAQGWHFAKALPVDFFRE" gene 1526612..1530091 /locus_tag="Rv1358" /db_xref="GeneID:886817" CDS 1526612..1530091 /locus_tag="Rv1358" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1358, (MTCY02B10.22), len: 1159 aa. Probable transcriptional regulatory protein, some similarity to AFSR_STRCO|P25941 regulatory protein afsr from Streptomyces coelicolor (993 aa), FASTA scores: opt: 210, E(): 5.5e-06, (27.5% identity in 739 aa overlap). Similar also to Rv0890C|MTCY31.18c (65.5% identity in 884 aa overlap) and to Rv1359|MTCY02B10.23 (43.7% identity in 197 aa overlap). Contains PS00017 ATP/GTP-binding site motif A, PS00622 Bacterial regulatory proteins, luxR family signature. Helix turn helix motif present at aa 1116-1137, (Score 1291, +3.59 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215874.1" /db_xref="GI:15608498" /db_xref="GeneID:886817" /translation="MFLSAPAFRVEPTRSRHSALRWARHRRFADGPRWQMLRSLQIAD QIARTGHMPVRRLDLIWISARNAARRELDLGVAALVEAVTLLTADVEGSTRLSQTRLN ELAADYPTLDQNISEAVAAHGGVTRPVDQEVGSGLVVAFLRAGDAIACALELQLSTLA PMRPRVGVHTGDVRLRGDGTITGSAINESACLRDLAHEGQTLLSAATGDLVIDQLPAN TWLTDVGKYPLRGLHRQERVIQLCHRDLRNEFPPLRMSVGNRSSLPAQFTTFVGRDAQ INEVQEVLTNYRLVTLRGEGGVGKTRLAIQIAAASEFRDGLCFVDLAPIADPGMVSTT AAHALGLIDRPGSSTFDTLSHAIGNCHMLMVLDNCEHVLDACAELVVELLGACPELSI LATSRESIGVTGEVTWVVPSLSPANEAIQLFTERARLVQPNFEIVADNFDAVSEICRR LDGMPLAIELAAARLRSLSPNEIANSLDDRFRLLTGGARSTVQRQQTLRASMDWSYAL LTDTERILFRRLAVFVGGFDLTAASEVAAAGGDDFVERYSVLDQLTLLVDKSLVVAEE SRGSTRYRLLETVRQYALEKLNESEEIDGVRARHRTHYATMAAGLNVPASTDYEQRLL QAEAEIDNLRAAFTWSRGNGDIAAALQLASALQPLWSQGRMREGLAWLESILEREGDN HLVPAGVWARALAEKVILKAWPATSPMGAPDIVAQAHHALALARDAGDCAVLARALVA CGCGSGCDTEAAQPYFAEAIELARAINDEWTLSQIDYWQVVGIFISGQPIPLRAAAEQ ARELADSIGNRFVSRQCRLFACLAQIWEGDANGALALSRDVTAEAEVANDVVTKVLGL YVEAMALSYIGDSAARTIAGAALEAATELGGIYQDLGYGAITRAALAAGDVAAIEASE ASWDLRNQHNVVTAHHELMAQAALVRGDVTTARRFADEAVLASTGWHLMMALIARARV AIAQDELGKARDDAHAAVACGVGVQTYLAMPDALELLAGLAGEAGNHGQAVRLFGAAA AQRQRTGEVRHKIWDAGYEAATAALRDAMGDEDFTAAWAEGAAAPLDEAIAYAQRGRG ERKRPSNGWDALTPAEHKIVKLVTEGLVTKDIAARLFVSPRTVQTHLTHIYTKLDVTS RVQLVQEAAQHST" misc_feature 1527491..1527514 /locus_tag="Rv1358" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1529951..1530034 /locus_tag="Rv1358" /note="PS00622 Bacterial regulatory proteins, luxR family signature" gene 1530173..1530925 /locus_tag="Rv1359" /db_xref="GeneID:886841" CDS 1530173..1530925 /locus_tag="Rv1359" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1359, (MTCY02B10.23), len: 250 aa. Probable transcriptional regulatory protein, similar to Rv0891c|MTCY31.19c, (48.5% identity in 204 aa overlap) and to Rv1358|MTCY02B10.22 (43.7% identity in 197 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215875.1" /db_xref="GI:15608499" /db_xref="GeneID:886841" /translation="MFMALRAPMLERMNGLHTDDAPVNWLERRGGRLTSRRRVTLLHA GVEHPMRLWGVQSEAITAAMVLSRKVSAIIAGHCGVRLVDQGVGDGFVAAFAHASDAV ACALELHQAPLSPIVLRIGIHTGEAQLVDERIYAGATMNLAAELRDLAHGGQTVMSGA TEDAVLGRLPMRAWLIGLRPMEGSPEGHNFPQSQRIAQLCHPNLRNTFPPLRMRIADA SGIPYVGRILVNVQVVPHWEGGCAAAGMVLAG" gene 1531348..1532370 /locus_tag="Rv1360" /db_xref="GeneID:886813" CDS 1531348..1532370 /locus_tag="Rv1360" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1360, (MTCY02B10.24), len: 340 aa. Probable oxidoreductase (EC 1.-.-.-). Similar to Q49598|G1002714 coenzyme F420-dependent n5, n10-methylenetetrahydromethanopterin reductase from Methanopyrus kandleri (349 aa), FASTA scores: opt: 264, E(): 4.4e-11, (26.3% identity in 323 aa overlap)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_215876.1" /db_xref="GI:15608500" /db_xref="GeneID:886813" /translation="MGGARRLKLDGSIPNQLARAADAAVALERNGFDGGWTAEASHDP FLPLLLAAEHTSRLELGTNIAVAFARNPMIVANVGWDLQTYSKGRLILGLGTQIRPHI EKRFSMPWGHPARRMREFVAALRAIWLAWQDGTKLCFEGEFYTHKIMTPMFTPEPQPY PVPRVFIAAVGEAMTEMCGEVADGHLGHPMVSKRYLTEVSVPALLRGLARSGRDRSAF EVSCEVMVATGADDAELAAACTATRKQIAFYGSTPAYRKVLEQHGWGDLHPELHRLSK LGEWEAMGGLIDDEMLGAFAVVGPVDTIAGALRNRCEGVVDRVLPIFMAASQECINAA LQDFRR" gene complement(1532443..1533633) /gene="PPE19" /locus_tag="Rv1361c" /db_xref="GeneID:886819" CDS complement(1532443..1533633) /gene="PPE19" /locus_tag="Rv1361c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1361c, (MTCY02B10.25c), len: 396 aa. PPE19 (alternate gene name: mtb39b). Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, highly similar to many e.g. Rv1196|MTCI364.08|PPE18, FASTA scores: E(): 0, (84.9% identity in 397 aa overlap); MTCY274.23c (42.3% identity in 416 aa overlap); etc. Contains PS00501 Signal peptidases I serine active site. Note that expression of Rv1361c was demonstrated in lysates by immunodetection (see Dillon et al., 1999).; mtb39b" /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177801.1" /db_xref="GI:57116852" /db_xref="GeneID:886819" /translation="MVDFGALPPEINSARMYAGPGSASLVAAAKMWDSVASDLFSAAS AFQSVVWGLTTGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYETAY GLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAAAMFGYAATAA TATEALLPFEDAPLITNPGGLLEQAVAVEEAIDTAAANQLMNNVPQALQQLAQPTKSI WPFDQLSELWKAISPHLSPLSNIVSMLNNHVSMTNSGVSMASTLHSMLKGFAPAAAQA VETAAQNGVQAMSSLGSQLGSSLGSSGLGAGVAANLGRAASVGSLSVPQAWAAANQAV TPAARALPLTSLTSAAQTAPGHMLGGLPLGQLTNSGGGFGGVSNALRMPPRAYVMPRV PAAG" misc_feature complement(1532848..1532871) /gene="PPE19" /locus_tag="Rv1361c" /note="PS00501 Signal peptidases I serine active site" gene complement(1533948..1534610) /locus_tag="Rv1362c" /db_xref="GeneID:886809" CDS complement(1533948..1534610) /locus_tag="Rv1362c" /function="UNKNOWN" /note="Rv1362c, (MTCY02B10.26c), len: 220 aa. Possible membrane protein, similar to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1362c|MTCY02B10.27c (25.9% identity in 216 aa overlap), Rv0177, Rv1973, Rv1972, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215878.1" /db_xref="GI:15608502" /db_xref="GeneID:886809" /translation="MTDDVRDVNTETTDATEVAEIDSAAGEAGDSATEAFDTDSATES TAQKGQRHRDLWRMQVTLKPVPVILILLMLISGGATGWLYLEQYRPDQQTDSGAARAA VAAASDGTIALLSYSPDTLDQDFATARSHLAGDFLSYYDQFTQQIVAPAAKQKSLKTT AKVVRAAVSELHPDSAVVLVFVDQSTTSKDSPNPSMAASSVMVTLAKVDGNWLITKFT PV" gene complement(1534607..1535392) /locus_tag="Rv1363c" /db_xref="GeneID:886811" CDS complement(1534607..1535392) /locus_tag="Rv1363c" /function="UNKNOWN" /note="Rv1363c, (MTCY02B10.27c), len: 261 aa. Possible membrane protein, similar to Mycobacterium tuberculosis hypothetical proteins Rv1362c|MTCY02B10.26c (25.9% identity in 216 aa overlap ); Rv1972|MTV051.10 and Rv0177 etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215879.1" /db_xref="GI:15608503" /db_xref="GeneID:886811" /translation="MAETTEPPSDAGTSQADAMALAAEAEAAEAEALAAAARARARAA RLKREALAMAPAEDENVPEEYADWEDAEDYDDYDDYEAADQEAARSASWRRRLRVRLP RLSTIAMAAAVVIICGFTGLSGYIVWQHHEATERQQRAAAFAAGAKQGVINMTSLDFN KAKEDVARVIDSSTGEFRDDFQQRAADFTKVVEQSKVVTEGTVNATAVESMNEHSAVV LVAATSRVTNSAGAKDEPRAWRLKVTVTEEGGQYKMSKVEFVP" gene complement(1535683..1537644) /locus_tag="Rv1364c" /db_xref="GeneID:886802" CDS complement(1535683..1537644) /locus_tag="Rv1364c" /function="UNKNOWN" /note="Rv1364c, (MTCY02B10.28c), len: 653 aa. Conserved hypothetical protein, some similarity to RSBU_BACSU|P40399 sigma factor sibg regulation protein from Bacillus subtilis (335 aa), FASTA scores: opt: 224, E(): 2e-07, (25.8% identity in 244 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177802.1" /db_xref="GI:57116853" /db_xref="GeneID:886802" /translation="MAAEMDWDKTVGAAEDVRRIFEHIPAILVGLEGPDHRFVAVNAA YRGFSPLLDTVGQPAREVYPELEGQQIYEMLDRVYQTGEPQSGSEWRLQTDYDGSGVE ERYFDFVVTPRRRADGSIEGVQLIVDDVTSRVRARQAAEARVEELSERYRNVRDSATV MQQALLAASVPVVPGADIAAEYLVAAEDTAAGGDWFDALALGDRLVLVVGDVVGHGVE AAAVMSQLRTALRMQISAGYTVVEALEAVDRFHKQVPGSKSATMCVGSLDFTSGEFQY CTAGHPPPLLVTADASARYVEPTGAGPLGSGTGFPVRSEVLNIGDAILFYTDGLIERP GRPLEASTAEFADLAASIASGSGGFVLDAPARPIDRLCSDTLELLLRSTGYNDDVTLL AMQRRAPTPPLHITLDATINAARTVRAQLREWLAEIGADHSDIADIVHAISEFVENAV EHGYATDVSKGIVVAAALAGDGNVRASVIDRGQWKDHRDGARGRGRGLAMAEALVSEA RIMHGAGGTTATLTHRLSRPARFVTDTMVRRAAFQQTIDSEFVSLVESGRIVVRGDVD STTAATLDRQIAVESRSGIAPVTIDLSAVTHLGSAGVGALAAACDRARKQGTECVLVA PPGSPAHHVLSLVQLPVVGADTEDIFAQE" gene complement(1537783..1538169) /gene="rsfA" /locus_tag="Rv1365c" /db_xref="GeneID:886794" CDS complement(1537783..1538169) /gene="rsfA" /locus_tag="Rv1365c" /function="REGULATES NEGATIVELY Rv3287c|RSBW|USFX. REGULATED BY REDOX POTENTIAL." /experiment="experimental evidence, no additional details recorded" /note="Rv1365c, (MTCY02B10.29c), len: 128 aa. rsfA, anti-anti-sigma factor (see citation below), similar to other Mycobacterium tuberculosis proteins e.g. Rv2638|MTCY441.08 (148 aa), FASTA scores: E(): 0, (53.6% identity in 125 aa overlap); Rv1904, Rv3687c. Weak similarity to putative anti-anti-sigma factors e.g. AF134889|AF134889_1 Streptomyces coelicolor (113 aa), FASTA scores: opt: 137, E(): 0.004, (26.0% identity in 100 aa overlap)." /codon_start=1 /transl_table=11 /product="anti-anti-sigma factor RSFA (anti-sigma factor antagonist) (regulator of sigma F A)" /protein_id="NP_215881.1" /db_xref="GI:15608505" /db_xref="GeneID:886794" /translation="MNPTQAGSFTTPVSNALKATIQHHDSAVIIHARGEIDAANEHTW QDLVTKAAAATTAPEPLVVNLNGLDFMGCCAVAVLAHEAERCRRRGVDVRLVSRDRAV ARIIHACGYGDVLPVHPTTESALSAT" gene 1538390..1539211 /locus_tag="Rv1366" /db_xref="GeneID:886805" CDS 1538390..1539211 /locus_tag="Rv1366" /function="UNKNOWN" /note="Rv1366, (MTCY02B10.30), len: 273 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215882.1" /db_xref="GI:15608506" /db_xref="GeneID:886805" /translation="MVVALVGSAIVDLHSRPPWSNNAVRRLGVALRDGVDPPVDCPSY AEVMLWHADLAAEVQDRIEGRSWSASELLVTSRAKSQDTLLAKLRRRPYLQLNTIQDI AGVRIDADLLLGEQTRLAREIADHFGADQPAIHDLRDHPHAGYRAVHVWLRLPAGRVE IQIRTILQSLWANFYELLADAYGRGIRYDERPEQLAAGVVPAQLQELVGVMQDASADL AMHEAEWQHCAEIEYPGQRAMALGEASKNKATVLATTKFRLERAINEAESAGGGG" gene complement(1539512..1540645) /locus_tag="Rv1367c" /db_xref="GeneID:886793" CDS complement(1539512..1540645) /locus_tag="Rv1367c" /function="UNKNOWN (POSSIBLY INVOLVED IN CELL WALL BIOSYNTHESIS)." /note="Rv1367c, (MTCY02B12.01c,MTCY02B10.31c), len: 377 aa. Conserved hypothetical protein. Some similarity to penicillin binding proteins e.g. PBPE_BACSU|P32959 penicillin-binding protein 4* (pbp 4*) from Bacillus subtilis (451 aa), FASTA scores: E(): 6.9e-06, (23.6% identity in 373 aa overlap). Similar to AL031107|SC5A7.06 hypothetical protein from Streptomyces coelicolor (409 aa), FASTA scores: opt: 675, E(): 0, (40.4% identity in 339 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215883.1" /db_xref="GI:15608507" /db_xref="GeneID:886793" /translation="MVWQREKLLQVNEIGYRDIDAGVPMQRDTLFRIASMTKPVTVAA AMSLVDEGKLALRDPITRWAPELCKVAVLDDAAGPLDRTHPARRAILIEDLLTHTSGL AYGFSVSGPISRAYQRLPFGQGPDVWLAALATLPLVHQPGDRVTYSHAIDVLGVIVSR IEDAPLYQIIDERVLGPAGMTDTGFYVSADAQRRAATMYRLDEQDRLRHDVMGPPHVT PPSFCNAGGGLWSTADDYLRFVRMLLGDGTVDGVRVLSPESVRLMRTDRLTDEQKRHS FLGAPFWVGRGFGLNLSVVTDPAKSRPLFGPGGLGTFSWPGAYGTWWQADPSADLILL YLIQHCPDLSVDAAAAVAGNPSLAKLRTAQPKFVRRTYRALGL" gene 1541020..1541805 /gene="lprF" /locus_tag="Rv1368" /db_xref="GeneID:886798" CDS 1541020..1541805 /gene="lprF" /locus_tag="Rv1368" /function="UNKNOWN" /note="Rv1368, (MTCY02B12.02), len: 261 aa. Probable lprF, conserved lipoprotein; similar to Mycobacterium tuberculosis hypothetical lipoproteins e.g. Rv1270c|Y08C_MYCTU|Q11049 hypothetical 26.4 kDa protein cy50.12. (257 aa), FASTA scores: opt: 286, E(): 5.3e-11, (26.3% identity in 270 aa overlap), also Rv1411c|MTCY21B4.28c, (32.8% identity in 253 aa overlap) and Rv2945c. Contains possible N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). BELONGS TO THE LPPX/LPRAFG FAMILY OF LIPOPROTEINS." /codon_start=1 /transl_table=11 /product="lipoprotein LprF" /protein_id="NP_215884.1" /db_xref="GI:15608508" /db_xref="GeneID:886798" /translation="MNGLISQACGSHRPRRPSSLGAVAILIAATLFATVVAGCGKKPT TASSPSPGSPSPEAQQILQDSSKATKGLHSVHVVVTVNNLSTLPFESVDADVTNQPQG NGQAVGNAKVRMKPNTPVVATEFLVTNKTMYTKRGGDYVSVGPAEKIYDPGIILDKDR GLGAVVGQVQNPTIQGRDAIDGLATVKVSGTIDAAVIDPIVPQLGKGGGRLPITLWIV DTNASTPAPAANLVRMVIDKDQGNVDITLSNWGAPVTIPNPAG" repeat_region 1541949..1541951 /note="3 bp direct repeat, CGG, at 3' end of IS6110 target sequence" repeat_region complement(1541952..1543306) /note="IS6110-2, len: 1355 bp. Almost identical to Insertion sequence IS986 element." /mobile_element="insertion sequence:IS6110-2" gene complement(1541994..1542878) /locus_tag="Rv1369c" /db_xref="GeneID:886789" CDS complement(1541994..1542878) /locus_tag="Rv1369c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv1369c, (MTCY02B12.03c), len: 294 aa. Probable transposase subunit for IS6110, identical from aa 69 to TRA9_MYCTU|P19774 putative transposase for insertion sequence." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215885.1" /db_xref="GI:15608509" /db_xref="GeneID:886789" /translation="MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKE HISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIAD PATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVAST MATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGA VGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVP PVELEAAYYAQRQRPAAG" gene complement(1542929..1543255) /locus_tag="Rv1370c" /db_xref="GeneID:886791" CDS complement(1542929..1543255) /locus_tag="Rv1370c" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv1370c, (MTCY02B12.04c), len: 108 aa. Probable transposase subunit for IS6110, highly similar to G309867 IS401 transposase subunit (107 aa), FASTA scores: opt: 325, E(): 2.7e-1 6, (52.9% identity in 102 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_215886.1" /db_xref="GI:15608510" /db_xref="GeneID:886791" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" repeat_region 1543307..1543309 /note="3 bp direct repeat, CGG, at 5' end of IS6110 target sequence" gene 1543359..1544828 /locus_tag="Rv1371" /db_xref="GeneID:886800" CDS 1543359..1544828 /locus_tag="Rv1371" /function="UNKNOWN" /note="Rv1371, (MTCY02B12.05), len: 489 aa. Probable membrane protein. Weak similarity to delta 5 fatty acid desaturases e.g. AB022097|AB022097_1 Dictyostelium discoideum (467 aa), FASTA score: opt: 173, E(): 0.00052, (22.4% identity in 438 aa overlap); and Homo sapiens." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215887.1" /db_xref="GI:15608511" /db_xref="GeneID:886800" /translation="MTNDLPDVRERDGGPRPAPPAGGPRLSDVWVYNGRAYDLSEWIS KHPGGAFFIGRTKNRDITAIVKSYHRDPAIVERILQRRYALGRDATPRDIHPKHNAPA FLFKDDFNSWRDTPKYRFDDPNDLLHRVKARLAEPALAARIKRMDTLFNAIVAVLAVG YFAVQGVRLVEPSWMPLWAFVIAMVLLRSSLAGFGHYALHRAQRGLNRVFNNAFDLNY VALSLVTADGHTLLHHPYTQSEVDIKKNVFTMMMRLPWLYRVPVHTIHKFGHMLSGMA IRIVDVFRITRKVGVEESYGSWRAALPHFLGSAGVRLLLVSELVVFAIAGDFWPWALQ FVATLWVSTFLVVASHEFEDDTQGGAVNGEDWGIDQLEHANDLTVIGNRYVDCFLSAG LSSHRVHHVLPFQRSGFANIVTEDVLREEAAKFGVEWLPAKGFITDRLPRLCRKYLLT PSRQAKERHWGFVREHCSPAALKASASYVVAGFVGIGSV" gene 1544825..1546006 /locus_tag="Rv1372" /db_xref="GeneID:886797" CDS 1544825..1546006 /locus_tag="Rv1372" /function="UNKNOWN" /note="Rv1372, (MTCY02B12.06), len: 393 aa. Conserved hypothetical protein, similar to several chalcone synthases e.g. CHS2_GERHY|P48391 chalcone synthase 2 from gerbra hybrid (402 aa), FASTA scores: opt: 511, E(): 7e-26, (28.4% identity in 380 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical chalcone synthases, Rv1665, Rv1660." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177803.1" /db_xref="GI:57116854" /db_xref="GeneID:886797" /translation="MNVSAESGAPRRAGQRHEVGLAQLPPAPPTTVAVIEGLATGTPR RVVNQSDAADRVAELFLDPGQRERIPRVYQKSRITTRRMAVDPLDAKFDVFRREPATI RDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTGFIAPGVDVAIVKELG LSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVVCIELCSVNAVFADDINDV VIHSLFGDGCAALVIGASQVQEKLEPGKVVVRSSFSQLLDNTEDGIVLGVNHNGITCE LSENLPGYIFSGVAPVVTEMLWDNGLQISDIDLWAIHPGGPKIIEQSVRSLGISAELA AQSWDVLARFGNMLSVSLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEGMLFDIIR R" gene 1546012..1546992 /locus_tag="Rv1373" /db_xref="GeneID:886781" CDS 1546012..1546992 /locus_tag="Rv1373" /EC_number="2.8.2.-" /function="INVOLVED IN SULFATION: ACTIVITY TOWARDS TYPICAL CERAMIDE GLYCOLIPIDS AND TREHALOSE GLYCOLIPIDS." /experiment="experimental evidence, no additional details recorded" /note="Rv1373, (MTCY02B12.07), len: 326 aa. Glycolipid sulfotransferase (EC 2.8.2.-) (see citation below); slight similarity to sulfotransferases e.g. SUOE_CAVPO|P49887 estrogen sulfotransferase from Cavia porcellus (Guinea pig) (EC 2.8.2.4) (296 aa), FASTA scores, opt: 165, E():0.00054, (24.5% identity in 294 aa overlap)." /codon_start=1 /transl_table=11 /product="glycolipid sulfotransferase" /protein_id="NP_215889.1" /db_xref="GI:15608513" /db_xref="GeneID:886781" /translation="MNSEHPMTDRVVYRSLMADNLRWDALQLRDGDIIISAPSKSGLT WTQRLVSLLVFDGPDLPGPLSTVSPWLDQTIRPIEEVVATLDAQQHRRFIKTHTPLDG LVLDDRVSYICVGRDPRDAAVSMLYQSANMNEDRMRILHEAVVPFHERIAPPFAELGH ARSPTEEFRDWMEGPNQPPPGIGFTHLKGIGTLANILHQLGTVWVRRHLPNVALFHYA DYQADLAGELLRPARVLGIAATRDRARDLAQYATLDAMRSRASEIAPNTTDGIWHSDE RFFRRGGSGDWQQFFTEAEHLRYYHRINQLAPPDLLAWAHEGRRGYDPAN" gene complement(1547072..1547530) /locus_tag="Rv1374c" /db_xref="GeneID:886783" CDS complement(1547072..1547530) /locus_tag="Rv1374c" /function="UNKNOWN" /note="Rv1374c, (MTCY02B12.08c), len: 152 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215890.2" /db_xref="GI:57116855" /db_xref="GeneID:886783" /translation="MVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPAR PNAPIGARSFAVGRKICRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCREV GNYAQRRVGRFAFFEQTFVRHALTPRCSRTDSKTSYTQLNRICKFPPHWV" gene 1547832..1549151 /locus_tag="Rv1375" /db_xref="GeneID:886778" CDS 1547832..1549151 /locus_tag="Rv1375" /function="UNKNOWN" /note="Rv1375, (MTCY02B12.09), len: 439 aa. Conserved hypothetical protein, similar to hypothetical proteins from several organisms e.g. Q52871|U39409 Rhizobium leguminosarum (420 aa), FASTA scores: E(): 2e-30, (34.4% identity in 378 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215891.1" /db_xref="GI:15608515" /db_xref="GeneID:886778" /translation="MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWP SRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQ AVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYD PAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDT TGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDA GDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITA ISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATA VANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE" gene 1549148..1550641 /locus_tag="Rv1376" /db_xref="GeneID:886777" CDS 1549148..1550641 /locus_tag="Rv1376" /function="UNKNOWN" /note="Rv1376, (MTCY02B12.10), len: 497 aa. Conserved hypothetical protein, some similarity to hypothetical proteins from several organisms e.g. Q52872|U39409 Rhizobium leguminosarum (247 aa), FASTA scores: E(): 2.1e-12, (34.7% identity in 219 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215892.1" /db_xref="GI:15608516" /db_xref="GeneID:886777" /translation="MTACGRIVVTAGPTISAADIRSVVPDAEVAPPIAFGQALSYDLR SGDTLLIVDGLFFQQPSVRHKELLTLMADGVRVVGSSSMGALRAAELHPFGMEGYGWV FESYRDGVLEADDEVGVVHGDADDGYPVFVDALVNMRHTLARAVATGVVCSELAERII ETARATPFTMRTWARLLSEVGAPDQRGLAAQLRSLRVDVKHADALLALRQLGQRPRVE PLRPGPPPTVWSRRWRQRWAPPTSVAASADHGESFVDVTDLEVLSFLSVSSVDYWAYR PALQQVAAWYWTLKHPEQSGSVGERAARAVAEVASEGYGRALEFIAYRYALATGIIDE TGFPEAVAAHWLTTEERHGLGNDPISISARVITRTLFVVRLLPAIDHFLDLLRKDSRL PRWRAMAAHALCKRDDLARQKPHLNLGRPDPTQLKRLFGARWGTQVNRIELARRGLMT EDAFYAAATPFAVAAVDDQLPRIEVGTLGPAPLSADVPERHFDFGSV" gene complement(1550579..1551217) /locus_tag="Rv1377c" /db_xref="GeneID:886775" CDS complement(1550579..1551217) /locus_tag="Rv1377c" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1377c, (MTCY02B12.11c), len: 212 aa. Putative transferase (EC 2.-.-.-), similar to YQEM_BACSU|P54458 hypothetical 28.3 kDa protein from Bacillus subtilis (247 aa), FASTA scores: opt: 221, E(): 7.6e-08, (30.6% identity in 144 aa overlap); some similarity to methyltransferases, also similar to Mycobacterium tuberculosis hypothetical proteins Rv0560c, Rv3699, and Rv2675c ( 39.1% identity in 197 aa overlap)." /codon_start=1 /transl_table=11 /product="putative transferase" /protein_id="NP_215893.1" /db_xref="GI:15608517" /db_xref="GeneID:886775" /translation="MPGIDFDALYRGESPGEGLPPITTPPWDTKAPKDNVIGWHTGGW VHGDVLDIGCGLGDNAIYLARNGYQVTGLDISPTALTTAKRRASDAGVDVKFAVGDAT KLTGYTGAFDTVIDCGMFHCLDDDGKRSYAASVHRATRPGATLLLSCFSNAMPPDEEW PRSTVSEQTLRDVLGGAGWDIESLEPATVRRELDGTEVEMAFWNVRAQRRGS" gene complement(1551228..1552655) /locus_tag="Rv1378c" /db_xref="GeneID:886773" CDS complement(1551228..1552655) /locus_tag="Rv1378c" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv1378c, (MTCY02B12.12c), len: 475 aa. Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv3074|MTCY22D7.07C (424 aa), FASTA scores: E(): 0, (73.0% identity in 429 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215894.1" /db_xref="GI:15608518" /db_xref="GeneID:886773" /translation="MGNLDLLLRLSGRIVKGCRPLGSVALARCGPAVRWPRWPRPAIL EHMFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAG VPARRRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATL IVRESACLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARA ETERTVTIRPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVER VTGQPAEAAQPVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSR ATLRRLYRHPRSGALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPH HRGGPTTATNGLGSCERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPPL PGPLEIDVSQVEARIGVALTHLHAA" gene 1552654..1553235 /gene="pyrR" /locus_tag="Rv1379" /db_xref="GeneID:886769" CDS 1552654..1553235 /gene="pyrR" /locus_tag="Rv1379" /EC_number="2.4.2.9" /function="BINDS TO THE CONSERVED SEQUENCE IN THE PYR OPERON MRNA AND DISRUPTS THE ANTITERMINATOR, PERMITTING TERMINATOR HAIRPIN FORMATION AND PROMOTING TRANSCRIPTION TERMINATION" /note="regulates pyrimidine biosynthesis by binding to the mRNA of the pyr genes, also has been shown to have uracil phosphoribosyltransferase activity" /codon_start=1 /transl_table=11 /product="bifunctional pyrimidine regulatory protein PyrR uracil phosphoribosyltransferase" /protein_id="NP_215895.1" /db_xref="GI:15608519" /db_xref="GeneID:886769" /translation="MGAAGDAAIGRESRELMSAADVGRTISRIAHQIIEKTALDDPVG PDAPRVVLLGIPTRGVTLANRLAGNITEYSGIHVGHGALDITLYRDDLMIKPPRPLAS TSIPAGGIDDALVILVDDVLYSGRSVRSALDALRDVGRPRAVQLAVLVDRGHRELPLR ADYVGKNVPTSRSESVHVRLREHDGRDGVVISR" gene 1553232..1554191 /gene="pyrB" /locus_tag="Rv1380" /db_xref="GeneID:886771" CDS 1553232..1554191 /gene="pyrB" /locus_tag="Rv1380" /EC_number="2.1.3.2" /function="INVOLVED IN PYRIMIDINE BIOSYNTHESIS (SECOND STEP) [CATALYTIC ACTIVITY : CARBAMOYL PHOSPHATE + L-ASPARTATE = PHOSPHATE + N-CARBAMOYL-L-ASPARTATE]" /note="catalyzes the transfer of the carbamoyl moiety from carbamoyl phosphate to L- aspartate in pyrimidine biosynthesis" /codon_start=1 /transl_table=11 /product="aspartate carbamoyltransferase catalytic subunit" /protein_id="NP_215896.1" /db_xref="GI:15608520" /db_xref="GeneID:886771" /translation="MTPRHLLTAADLSRDDATAILDDADRFAQALVGRDIKKLPTLRG RTVVTMFYENSTRTRVSFEVAGKWMSADVINVSAAGSSVGKGESLRDTALTLRAAGAD ALIIRHPASGAAHLLAQWTGAHNDGPAVINAGDGTHEHPTQALLDALTIRQRLGGIEG RRIVIVGDILHSRVARSNVMLLDTLGAEVVLVAPPTLLPVGVTGWPATVSHDFDAELP AADAVLMLRVQAERMNGGFFPSVREYSVRYGLTERRQAMLPGHAVVLHPGPMVRGMEI TSSVADSSQSAVLQQVSNGVQVRMAVLFHVLVGAQDAGKEGAA" misc_feature 1553382..1553405 /gene="pyrB" /locus_tag="Rv1380" /note="PS00097 Aspartate and ornithine carbamoyltransferases signature" gene 1554188..1555480 /gene="pyrC" /locus_tag="Rv1381" /db_xref="GeneID:886765" CDS 1554188..1555480 /gene="pyrC" /locus_tag="Rv1381" /EC_number="3.5.2.3" /function="INVOLVED IN PYRIMIDINE BIOSYNTHESIS (THIRD STEP) [CATALYTIC ACTIVITY: (S)-DIHYDROOROTATE + H(2)O = N-CARBAMOYL-L-ASPARTATE]" /note="catalyzes the formation of N-carbamoyl-L-aspartate from (S)-dihydroorotate in pyrimidine biosynthesis" /codon_start=1 /transl_table=11 /product="dihydroorotase" /protein_id="NP_215897.1" /db_xref="GI:15608521" /db_xref="GeneID:886765" /translation="MSVLIRGVRPYGEGERVDVLVDDGQIAQIGPDLAIPDTADVIDA TGHVLLPGFVDLHTHLREPGREYAEDIETGSAAAALGGYTAVFAMANTNPVADSPVVT DHVWHRGQQVGLVDVHPVGAVTVGLAGAELTEMGMMNAGAAQVRMFSDDGVCVHDPLI MRRALEYATGLGVLIAQHAEEPRLTVGAVAHEGPMAARLGLAGWPRAAEESIVARDAL LARDAGARVHICHASAAGTVEILKWAKDQGISITAEVTPHHLLLDDARLASYDGVNRV NPPLREASDAVALRQALADGIIDCVATDHAPHAEHEKCVEFAAARPGMLGLQTALSVV VQTMVAPGLLSWRDIARVMSENPACIARLPDQGRPLEVGEPANLTVVDPDATWTVTGA DLASRSANTPFESMSLPATVTATLLRGKVTARDGKIRA" misc_feature 1555091..1555126 /gene="pyrC" /locus_tag="Rv1381" /note="PS00483 Dihydroorotase signature 2" gene 1555477..1555974 /locus_tag="Rv1382" /db_xref="GeneID:886767" CDS 1555477..1555974 /locus_tag="Rv1382" /function="UNKNOWN" /note="Rv1382, (MTCY02B12.16), len: 165 aa. Possible exported or membrane protein, hydrophobic domain at N-terminus." /codon_start=1 /transl_table=11 /product="export or membrane protein" /protein_id="NP_215898.1" /db_xref="GI:15608522" /db_xref="GeneID:886767" /translation="MNSGTLAGSLIFAAVLVMLIAVLARLMMRGWRRRSERQAELLGD LPDVPEHVSSATVTTRGLYVGATLSPAWNERVTVGDLGYRSKAVLTRYPSGIMVERAR AQPIWIPTESIAAIRMERGVAGKVVAGIGILAIRWRLPSGTEIDVGFRADNRDEYQEW LEEPV" gene 1555971..1557101 /gene="carA" /locus_tag="Rv1383" /db_xref="GeneID:886761" CDS 1555971..1557101 /gene="carA" /locus_tag="Rv1383" /EC_number="6.3.5.5" /function="INVOLVED IN BOTH ARGININE AND PYRIMIDINE BIOSYNTHESIS [CATALYTIC ACTIVITY : 2 ATP + L-GLUTAMINE + CO(2) + H(2)O = 2 ADP + PHOSPHATE + GLUTAMATE + CARBAMOYL PHOSPHATE]" /note="catalyzes production of carbamoyl phosphate from bicarbonate and glutamine in pyrimidine and arginine biosynthesis pathways; forms an octamer composed of four CarAB dimers" /codon_start=1 /transl_table=11 /product="carbamoyl phosphate synthase small subunit" /protein_id="NP_215899.1" /db_xref="GI:15608523" /db_xref="GeneID:886761" /translation="MSKAVLVLEDGRVFTGRPFGATGQALGEAVFSTGMSGYQETLTD PSYHRQIVVATAPQIGNTGWNGEDSESRGERIWVAGYAVRDPSPRASNWRATGTLEDE LIRQRIVGIAGIDTRAVVRHLRSRGSMKAGVFSDGALAEPADLIARVRAQQSMLGADL AGEVSTAEPYVVEPDGPPGVSRFTVAALDLGIKTNTPRNFARRGIRCHVLPASTTFEQ IAELNPHGVFLSNGPGDPATADHVVALTREVLGAGIPLFGICFGNQILGRALGLSTYK MVFGHRGINIPVVDHATGRVAVTAQNHGFALQGEAGQSFATPFGPAVVSHTCANDGVV EGVKLVDGRAFSVQYHPEAAAGPHDAEYLFDQFVELMAGEGR" misc_feature 1556733..1556768 /gene="carA" /locus_tag="Rv1383" /note="PS00442 Glutamine amidotransferases class-I active site" gene 1557101..1560448 /gene="carB" /locus_tag="Rv1384" /db_xref="GeneID:886253" CDS 1557101..1560448 /gene="carB" /locus_tag="Rv1384" /EC_number="6.3.5.5" /function="INVOLVED IN BOTH ARGININE AND PYRIMIDINE BIOSYNTHESIS [CATALYTIC ACTIVITY : 2 ATP + L-GLUTAMINE + CO(2) + H(2)O = 2 ADP + PHOSPHATE + GLUTAMATE + CARBAMOYL PHOSPHATE.]" /note="four CarB-CarA dimers form the carbamoyl phosphate synthetase holoenzyme that catalyzes the production of carbamoyl phosphate; CarB is responsible for the amidotransferase activity" /codon_start=1 /transl_table=11 /product="carbamoyl phosphate synthase large subunit" /protein_id="YP_177804.1" /db_xref="GI:57116856" /db_xref="GeneID:886253" /translation="MPRRTDLHHVLVIGSGPIVIGQACEFDYSGTQACRVLRAEGLQV SLVNSNPATIMTDPEFADHTYVEPITPAFVERVIAQQAERGNKIDALLATLGGQTALN TAVALYESGVLEKYGVELIGADFDAIQRGEDRQRFKDIVAKAGGESARSRVCFTMAEV RETVAELGLPVVVRPSFTMGGLGSGIAYSTDEVDRMAGAGLAASPSANVLIEESIYGW KEFELELMRDGHDNVVVVCSIENVDPMGVHTGDSVTVAPAMTLTDREYQRMRDLGIAI LREVGVDTGGCNIQFAVNPRDGRLIVIEMNPRVSRSSALASKATGFPIAKIAAKLAIG YTLDEIVNDITGETPACFEPTLDYVVVKAPRFAFEKFPGADPTLTTTMKSVGEAMSLG RNFVEALGKVMRSLETTRAGFWTAPDPDGGIEEALTRLRTPAEGRLYDIELALRLGAT VERVAEASGVDPWFIAQINELVNLRNELVAAPVLNAELLRRAKHSGLSDHQIASLRPE LAGEAGVRSLRVRLGIHPVYKTVDTCAAEFEAQTPYHYSSYELDPAAETEVAPQTERP KVLILGSGPNRIGQGIEFDYSCVHAATTLSQAGFETVMVNCNPETVSTDYDTADRLYF EPLTFEDVLEVYHAEMESGSGGPGVAGVIVQLGGQTPLGLAHRLADAGVPIVGTPPEA IDLAEDRGAFGDLLSAAGLPAPKYGTATTFAQARRIAEEIGYPVLVRPSYVLGGRGME IVYDEETLQGYITRATQLSPEHPVLVDRFLEDAVEIDVDALCDGAEVYIGGIMEHIEE AGIHSGDSACALPPVTLGRSDIAKVRKATEAIAHGIGVVGLLNVQYALKDDVLYVLEA NPRASRTVPFVSKATAVPLAKACARIMLGATIAQLRAEGLLAVTGDGAHAARNAPIAV KEAVLPFHRFRRADGAAIDSLLGPEMKSTGEVMGIDRDFGSAFAKSQTAAYGSLPAQG TVFVSVANRDKRSLVFPVKRLADLGFRVLATEGTAEMLRRNGIPCDDVRKHFEPAQPG RPTMSAVDAIRAGEVNMVINTPYGNSGPRIDGYEIRSAAVAGNIPCITTVQGASAAVQ GIEAGIRGDIGVRSLQELHRVIGGVER" misc_feature 1558004..1558027 /gene="carB" /locus_tag="Rv1384" /note="PS00867 Carbamoyl-phosphate synthase subdomain signature 2" misc_feature 1559270..1559314 /gene="carB" /locus_tag="Rv1384" /note="PS00866 Carbamoyl-phosphate synthase subdomain signature 1" misc_feature 1559657..1559680 /gene="carB" /locus_tag="Rv1384" /note="PS00867 Carbamoyl-phosphate synthase subdomain signature 2" gene 1560445..1561269 /gene="pyrF" /locus_tag="Rv1385" /db_xref="GeneID:886763" CDS 1560445..1561269 /gene="pyrF" /locus_tag="Rv1385" /EC_number="4.1.1.23" /function="INVOLVED IN THE BIOSYNTHESIS OF PYRIMIDINES [CATALYTIC ACTIVITY : OROTIDINE 5'-PHOSPHATE = UMP + CO(2)]" /note="OMP decarboxylase; OMPDCase; OMPdecase; type 2 subfamily; involved in last step of pyrimidine biosynthesis; converts orotidine 5'-phosphate to UMP and carbon dioxide; OMP decarboxylase; OMPDCase; OMPdecase" /codon_start=1 /transl_table=11 /product="orotidine 5'-phosphate decarboxylase" /protein_id="NP_215901.1" /db_xref="GI:15608524" /db_xref="GeneID:886763" /translation="MTGFGLRLAEAKARRGPLCLGIDPHPELLRGWDLATTADGLAAF CDICVRAFADFAVVKPQVAFFESYGAAGFAVLERTIAELRAADVLVLADAKRGDIGAT MSAYATAWVGDSPLAADAVTASPYLGFGSLRPLLEVAAAHGRGVFVLAATSNPEGAAV QNAAADGRSVAQLVVDQVGAANEAAGPGPGSIGVVVGATAPQAPDLSAFTGPVLVPGV GVQGGRPEALGGLGGAASSQLLPAVAREVLRAGPGVPELRAAGERMRDAVAYLAAV" misc_feature 1560712..1560753 /gene="pyrF" /locus_tag="Rv1385" /note="PS00156 Orotidine 5'-phosphate decarboxylase active site" gene 1561464..1561772 /gene="PE15" /locus_tag="Rv1386" /db_xref="GeneID:886757" CDS 1561464..1561772 /gene="PE15" /locus_tag="Rv1386" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1386, (MTCY21B4.03), len: 102 aa. Member of Mycobacterium tuberculosis PE family (see Brennan & Delogu 2002), similar to many e.g. G913039 ORF 3' OF PGRS TANDEM REPEAT (polymorphic GC-rich sequence) (100 aa), FASTA scores: opt: 149, E(): 0.0013, (31.5% identity in 92 aa overlap); also similar to Q49943|U1756A (99 aa) (34.7% identity in 95 aa overlap) and G466937|U1620K (100 aa) (36.2% identity in 69 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177805.1" /db_xref="GI:57116857" /db_xref="GeneID:886757" /translation="MTLRVVPESLAGASAAIEAVTARLAAAHAAAAPFIAAVIPPGSD SVSVCNAVEFSVHGSQHVAMAAQGVEELGRSGVGVAESGASYAARDALAAASYLSGGL" gene 1561769..1563388 /gene="PPE20" /locus_tag="Rv1387" /db_xref="GeneID:886784" CDS 1561769..1563388 /gene="PPE20" /locus_tag="Rv1387" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1387, (MTCY21B4.04), len: 539 aa. Member of Mycobacterium tuberculosis PPE family of proteins, similar to many e.g. Y05F_MYCTU|Q10892 hypothetical 46.9 kd protein cy251.15 (463 aa), FASTA scores: E(): 4.2e-26, (37.7% identity in 531 aa overlap); similar also to MTCY274.23c (37.5% identity in 168 aa overlap). Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177806.1" /db_xref="GI:57116858" /db_xref="GeneID:886784" /translation="MTEPWIAFPPEVHSAMLNYGAGVGPMLISATQNGELSAQYAEAA SEVEELLGVVASEGWQGQAAEAFVAAYMPFLAWLIQASADCVEMAAQQHVVIEAYTAA VELMPTQVELAANQIKLAVLVATNFFGINTIPIAINEAEYVEMWVRAATTMATYSTVS RSALSAMPHTSPPPLILKSDELLPDTGEDSDEDGHNHGGHSHGGHARMIDNFFAEILR GVSAGRIVWDPVNGTLNGLDYDDYVYPGHAIWWLARGLEFFQDGEQFGELLFTNPTGA FQFLLYVVVVDLPTHIAQIATWLGQYPQLLSAALTGVIAHLGAITGLAGLSGLSAIPS AAIPAVVPELTPVAAAPPMLAVAGVGPAVAAPGMLPASAPAPAAAAGATAAGPTPPAT GFGGFPPYLVGGGGPGIGFGSGQSAHAKAAASDSAAAESAAQASARAQARAARRGRSA AKARGHRDEFVTMDMGFDAAAPAPEHQPGARASDCGAGPIGFAGTVRKEAVVKAAGLT TLAGDDFGGGPTMPMMPGTWTHDQGVFDEHR" misc_feature 1562315..1562332 /gene="PPE20" /locus_tag="Rv1387" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" gene 1563694..1564266 /gene="mihF" /locus_tag="Rv1388" /db_xref="GeneID:886751" CDS 1563694..1564266 /gene="mihF" /locus_tag="Rv1388" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1388, (MTCY21B4.05), len: 190 aa. Putative mihF, integration host factor. Almost identical to, but longer than, P96802|U75344 Mycobacterium smegmatis integration host factor (mIHF) for mycobacteriophage L5 (105 aa), FASTA scores: E(): 0, (96.1% identity in 102 aa overlap)." /codon_start=1 /transl_table=11 /product="putative integration host factor MIHF" /protein_id="NP_215904.1" /db_xref="GI:15608527" /db_xref="GeneID:886751" /translation="MLGNTIHVPCQPCRHGHGAPSRGLRGRPADRWPVARATPTLHVC PQNQGVGLDFVRKPEYGRLRWPAYPAGTNNDRLISMRDGGIVALPQLTDEQRAAALEK AAAARRARAELKDRLKRGGTNLTQVLKDAESDEVLGKMKVSALLEALPKVGKVKAQEI MTELEIAPTRRLRGLGDRQRKALLEKFGSA" gene 1564401..1565027 /gene="gmk" /locus_tag="Rv1389" /db_xref="GeneID:886787" CDS 1564401..1565027 /gene="gmk" /locus_tag="Rv1389" /EC_number="2.7.4.8" /function="ESSENTIAL FOR RECYCLING GMP AND INDIRECTLY, CGMP [CATALYTIC ACTIVITY : ATP + GMP = ADP + GDP]" /note="Essential for recycling GMP and indirectly, cGMP" /codon_start=1 /transl_table=11 /product="guanylate kinase" /protein_id="NP_215905.1" /db_xref="GI:15608528" /db_xref="GeneID:886787" /translation="MSVGEGPDTKPTARGQPAAVGRVVVLSGPSAVGKSTVVRCLRER IPNLHFSVSATTRAPRPGEVDGVDYHFIDPTRFQQLIDQGELLEWAEIHGGLHRSGTL AQPVRAAAATGVPVLIEVDLAGARAIKKTMPEAVTVFLAPPSWQDLQARLIGRGTETA DVIQRRLDTARIELAAQGDFDKVVVNRRLESACAELVSLLVGTAPGSP" misc_feature 1564482..1564505 /gene="gmk" /locus_tag="Rv1389" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1564563..1564616 /gene="gmk" /locus_tag="Rv1389" /note="PS00856 Guanylate kinase signature" gene 1565093..1565425 /gene="rpoZ" /locus_tag="Rv1390" /db_xref="GeneID:886754" CDS 1565093..1565425 /gene="rpoZ" /locus_tag="Rv1390" /EC_number="2.7.7.6" /function="PROMOTES RNA POLYMERASE ASSEMBLY. LATCHES THE N-AND C-TERMINAL REGIONS OF THE BETA' SUBUNIT THEREBY FACILTATING ITS INTERACTION WITH THE BETA AND ALPHA SUBUNITS (BY SIMILARITY) [CATALYTIC ACTIVITY: N nucleoside triphosphate = N diphosphate + {RNA}N]." /note="Promotes RNA polymerase assembly. Latches the N- and C-terminal regions of the beta' subunit thereby facilitating its interaction with the beta and alpha subunits" /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit omega" /protein_id="NP_215906.1" /db_xref="GI:15608529" /db_xref="GeneID:886754" /translation="MSISQSDASLAAVPAVDQFDPSSGASGGYDTPLGITNPPIDELL DRVSSKYALVIYAAKRARQINDYYNQLGEGILEYVGPLVEPGLQEKPLSIALREIHAD LLEHTEGE" gene 1565441..1566697 /gene="dfp" /locus_tag="Rv1391" /db_xref="GeneID:886749" CDS 1565441..1566697 /gene="dfp" /locus_tag="Rv1391" /EC_number="4.1.1.36" /EC_number="6.3.2.5" /function="FLAVOPROTEIN AFFECTING SYNTHESIS OF DNA AND PANTOTHENATE METABOLISM" /note="catalyzes the conjugation of cysteine to 4'-phosphopantothenate to form 4-phosphopantothenoylcysteine, which is then decarboxylated to form 4'-phosphopantotheine" /codon_start=1 /transl_table=11 /product="bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase" /protein_id="NP_215907.1" /db_xref="GI:15608530" /db_xref="GeneID:886749" /translation="MVDHKRIPKQVIVGVSGGIAAYKACTVVRQLTEASHRVRVIPTE SALRFVGAATFEALSGEPVCTDVFADVPAVPHVHLGQQADLVVVAPATADLLARAAAG RADDLLTATLLTARCPVLFAPAMHTEMWLHPATVDNVATLRRRGAVVLEPATGRLTGA DSGAGRLPEAEEITTLAQLLLERHDALPYDLAGRKLLVTAGGTREPIDPVRFIGNRSS GKQGYAVARVAAQRGADVTLIAGHTAGLVDPAGVEVVHVSSAQQLADAVSKHAPTADV LVMAAAVADFRPAQVATAKIKKGVEGPPTIELLRNDDVLAGVVRARAHGQLPNMRAIV GFAAETGDANGDVLFHARAKLRRKGCDLLVVNAVGEGRAFEVDSNDGWLLASDGTESA LQHGSKTLMASRIVDAIVTFLAGCSS" gene 1566825..1568036 /gene="metK" /locus_tag="Rv1392" /db_xref="GeneID:886741" CDS 1566825..1568036 /gene="metK" /locus_tag="Rv1392" /EC_number="2.5.1.6" /function="Involved in the activated Methyl cycle. Catalyzes the formation of S-adenosylmethionine from methionine and ATP. The overall synthetic reaction is composed of two sequential steps, AdoMet formation and the subsequent tripolyphosphate hydrolysis which occurs prior to release of AdoMet from the enzyme. [CATALYTIC ACTIVITY : ATP + L-METHIONINE + H(2)O = PHOSPHATE + DIPHOSPHATE + S-ADENOSYL-L-METHIONINE]" /experiment="experimental evidence, no additional details recorded" /note="methionine adenosyltransferase; catalyzes the formation of S-adenosylmethionine from methionine and ATP; methionine adenosyltransferase" /codon_start=1 /transl_table=11 /product="S-adenosylmethionine synthetase" /protein_id="NP_215908.1" /db_xref="GI:15608531" /db_xref="GeneID:886741" /translation="MSEKGRLFTSESVTEGHPDKICDAISDSVLDALLAADPRSRVAV ETLVTTGQVHVVGEVTTSAKEAFADITNTVRARILEIGYDSSDKGFDGATCGVNIGIG AQSPDIAQGVDTAHEARVEGAADPLDSQGAGDQGLMFGYAINATPELMPLPIALAHRL SRRLTEVRKNGVLPYLRPDGKTQVTIAYEDNVPVRLDTVVISTQHAADIDLEKTLDPD IREKVLNTVLDDLAHETLDASTVRVLVNPTGKFVLGGPMGDAGLTGRKIIVDTYGGWA RHGGGAFSGKDPSKVDRSAAYAMRWVAKNVVAAGLAERVEVQVAYAIGKAAPVGLFVE TFGTETEDPVKIEKAIGEVFDLRPGAIIRDLNLLRPIYAPTAAYGHFGRTDVELPWEQ LDKVDDLKRAI" misc_feature 1567215..1567232 /gene="metK" /locus_tag="Rv1392" /note="PS00376 S-adenosylmethionine synthetase signature 1" misc_feature 1567659..1567685 /gene="metK" /locus_tag="Rv1392" /note="PS00377 S-adenosylmethionine synthetase signature 2" gene complement(1568109..1569587) /locus_tag="Rv1393c" /db_xref="GeneID:886743" CDS complement(1568109..1569587) /locus_tag="Rv1393c" /EC_number="1.14.13.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1393c, (MTCY21B4.10c), len: 492 aa. Probable monooxygenase (EC 1.14.13.-), similar to others e.g. CYMO_ACISP|P12015 cyclohexanone monooxygenase (EC 1.14.13.22) from Acinetobacter sp. (542 aa), FASTA scores: E(): 0, (33.0% identity in 473 aa overlap); also to Rv3083|MTCY31.20|E241788 hypothetical 55.0 kDa protein from Mycobacterium tuberculosis (495 aa) (36.3% identity in 490 aa overlap); and Rv0565c, Rv3854c, Rv3049c, Rv0892." /codon_start=1 /transl_table=11 /product="monoxygenase" /protein_id="NP_215909.1" /db_xref="GI:15608532" /db_xref="GeneID:886743" /translation="MMPDYHALIVGAGFSGIGAAIKLDRAGFSDYLVVEAGDGVGGTW HWNTYPGIAVDIPSFSYQFSFEQSRHWSRTYAPGHELKAYAEHCVDKYGIRSRIRLNT KVLAAEFDDEHSLWRVQTDPGGEITARFLISACGILTVPKLPDIDGVDSFEGVTMHTA RWDHTQDLTGKRVGIIGTGASAVQVIPEMAPIVSHLTVFQRTPIWCFPKFDVPLPTAV RWAMRIPGGKAVHRLLSQAFVEATFPIAAHYFAVFPLAKHMESAGRRYLRQQVHDPVV REQLTPRYAVGCKRPGFHNTYLSTFNRDNVRLVTEPIDKITPTAVATTDGASHEIDVL VLATGFKVLDTDSIPTYAVTGTGGASLSRFWDEHRLQAYEGVSVPGYPNFFTVFGPYG YVGSSYFALIETQAHHIIRCLKRARRTGATRIEVTEEANARYFAEVMRRRHRQVFWQD SCRLANSYYFDKNGDVPLRPTTTVEAYWRSRRFDLGDYRISS" gene complement(1569584..1570969) /gene="cyp132" /locus_tag="Rv1394c" /db_xref="GeneID:886738" CDS complement(1569584..1570969) /gene="cyp132" /locus_tag="Rv1394c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. IT OXIDIZES A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv1394c, (MT1439, MTCY21B4.11c), len: 461 aa. Probable cyp132, cytochrome P450 132 (EC 1.14.-.-). Some similarity to others e.g. CP4B_HUMAN|P13584 human cytochrome p450 (511 aa), FASTA scores: opt: 486, E(): 7.4e-21, (28.6% identity in 423 aa overlap); etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. MAY BELONG TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 132" /protein_id="YP_177807.1" /db_xref="GI:57116859" /db_xref="GeneID:886738" /translation="MATATTQRPLKGPAKRMSTWTMTREAITIGFDAGDGFLGRLRGS DITRFRCAGRRFVSISHPDYVDHVLHEARLKYVKSDEYGPIRATAGLNLLTDEGDSWA RHRGALNSTFARRHLRGLVGLMIDPIADVTAARVPGAQFDMHQSMVETTLRVVANALF SQDFGPLVQSMHDLATRGLRRAEKLERLGLWGLMPRTVYDTLIWCIYSGVHLPPPLRE MQEITLTLDRAINSVIDRRLAEPTNSADLLNVLLSADGGIWPRQRVRDEALTFMLAGH ETTANAMSWFWYLMALNPQARDHMLTELDDVLGMRRPTADDLGKLAWTTACLQESQRY FSSVWIIAREAVDDDIIDGHRIRRGTTVVIPIHHIHHDPRWWPDPDRFDPGRFLRCPT DRPRCAYLPFGGGRRICIGQSFALMEMVLMAAIMSQHFTFDLAPGYHVELEATLTLRP KHGVHVIGRRR" misc_feature complement(1569737..1569766) /gene="cyp132" /locus_tag="Rv1394c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene 1571047..1572081 /locus_tag="Rv1395" /db_xref="GeneID:886251" CDS 1571047..1572081 /locus_tag="Rv1395" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1395, (MTCY21B4.12), len: 344 aa. Probable transcriptional regulatory protein (see citation below), similar to many e.g. URER_PROMI|Q02458 urease operon transcriptional activator from Proteus mirabilis (293 aa), FASTA scores: E():1.5e-08, (41.7% identity in 84 aa overlap); YHIX_ECOLI|P37639 hypothetical transcriptional regulatory protein from Escherichia coli (274 aa), FASTA scores: opt: 238, E(): 3.5e-09, (27.3% identity in 249 aa overlap); and G296916|X68281 POSSIBLE VIRULENCE-REGULATING protein from Mycobacterium tuberculosis (339 aa), FASTA scores: opt: 228, E(): 1.9e-08, (27.0% identity in 278 aa overlap). Helix turn helix motif present, aa 261-282 (+4.68 SD). BELONGS TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS. 3' part corrected since first submission (-14 aa)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="YP_177808.1" /db_xref="GI:57116860" /db_xref="GeneID:886251" /translation="MGHLPPPAEVRHPVYATRVLCEVANERGVPTADVLAGTAIEPAD LDDPDAVVGALDEITAVRRLLARLPDDAGIGIDVGSRFALTHFGLFGFAVMSCGTLRE LLTIAMRYFALTTMHVDITLFETADDCLVELDASHLPADVRGFFIERDIAGIIATTTS FALPLAAKYADQVSAELAVDAELLRPLLELVPVHDVAFGRAHNRVHFPRAMFDEPLPQ ADRHTLEMCIAQCDVLMQRNERRRGITALVRSKLFRDSGLFPTFTDVAGELDMHPRTL RRRLAEEGTSFRALLGEARSTVAVDLLRNVGLTVQQVSTRLGYTEVSTFSHAFKRWYG VAPSEYSRRG" gene complement(1572127..1573857) /gene="PE_PGRS25" /locus_tag="Rv1396c" /db_xref="GeneID:886745" CDS complement(1572127..1573857) /gene="PE_PGRS25" /locus_tag="Rv1396c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1396c, (MTCY21B4.13c), len: 576 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002), strong similarity to many e.g. glycine rich protein MTCY130.10C|E245019 (603 aa), FASTA scores: opt: 1945, E(): 0, (57.5% identity in 619 aa overlap). Contains PS00017 ATP/GTP-binding site motif A, similar to other PGRS-type sequences." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177809.1" /db_xref="GI:57116861" /db_xref="GeneID:886745" /translation="MSFLFAQPEMLGAAATDLASIGSAISTANAAAAAATTRVLAAGA DEVSAAVAALFSGHAQTYQALRTQAAAFHQQIVQTLTSTAGAYASAEAANVEQQLLGA INAPTMALLGRPLIGHGADGAPGTGQAGGAGGILYGNGGNGGSGATGQAGGAGGAAGL IGHGGAGGLGGTGASGGAGGAGGWLWGNGGAGGNGGVGVAGDPGGVGGAGGAGGAAGL WGSGGSGGTGGQGGVGGGKSGDGGTGGIGGAGGGGGWLHGDGGAGGHGGQGGTGVSSG GNGGAGGTGGDGRGLSGSGGAGGRGGQTGVGGKVGENNFGGAGGAGGTGGLIGNGGAG GNGGQGAISGAGGAGGNAWLIGDGGAGGNGGDIRGQGGGAGGAGGAGGQLIGNGGTGG AGGTVTSPNGLGGAGGAGGSAGLIGHGGTGGAGGHSAQGPDGNGGIGGAGGAGGNGGQ LYGTGGTGGTGGKGGDGFGVFGKGGAGGTGGRGGAAGLIGDAGTGGTGGKGGTAGEDG TGGNGGTGGNGGAAVLIGNGGGGGAGGNGGAGNDGTPGNGGGGGVGGTGGTLFGQPGQ PGPPGQPGPA" misc_feature complement(1573144..1573167) /gene="PE_PGRS25" /locus_tag="Rv1396c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(1574112..1574513) /locus_tag="Rv1397c" /db_xref="GeneID:886736" CDS complement(1574112..1574513) /locus_tag="Rv1397c" /function="UNKNOWN" /note="Rv1397c, (MTCY21B4.14c), len: 133 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis protein MTCY159.08C|Rv2548 (125 aa), FASTA scores: E(): 2.3e-14, (42.4% identity in 125 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215913.1" /db_xref="GI:15608535" /db_xref="GeneID:886736" /translation="MILVDSDVLIAHLRGVVAARDWLVSARKDGPLAISVVSTAELIG GMRTAERREVWRLLASFRVQPATEVIARRAGDMMRRYRRSHNRIGLGDYLIAATADVQ DLQLATLNVWHFPMFEQLKPPFAVPGHRPRA" gene complement(1574510..1574767) /locus_tag="Rv1398c" /db_xref="GeneID:886759" CDS complement(1574510..1574767) /locus_tag="Rv1398c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1398c, (MTCY21B4.15c), len: 85 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis proteins Rv2547|MTCY159.09C (85 aa), FASTA scores: E(): 0.0035, (37.1% identity in 62 aa overlap); Rv0581, Rv2871, Rv1241, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215914.1" /db_xref="GI:15608536" /db_xref="GeneID:886759" /translation="MKRTNIYLDEEQTASLDKLAAQEGVSRAELIRLLLNRALTTAGD DLASDLQAINDSFGTLRHLDPPVRRSGGREQHLAQVWRATS" gene complement(1574850..1575809) /gene="lipH" /locus_tag="Rv1399c" /db_xref="GeneID:886731" CDS complement(1574850..1575809) /gene="lipH" /locus_tag="Rv1399c" /EC_number="3.1.-.-" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN LIPID METABOLISM" /note="Rv1399c, (MTCY21B4.16c), len: 319 aa. Possible LipH, lipase (EC 3.1.-.-), most similar to G695278 lipase like enzyme from Ralstonia eutropha (364 aa), FASTA scores: opt: 648, E(): 4.4e-34, (37.3% identity in 327 aa ov erlap), similar to Mycobacterium tuberculosis hypothetical lipases e.g. Rv2284, Rv2485c, Rv1426c, etc." /codon_start=1 /transl_table=11 /product="lipase LipH" /protein_id="NP_215915.1" /db_xref="GI:15608537" /db_xref="GeneID:886731" /translation="MTEPTVARPDIDPVLKMLLDTFPVTFTAADGVEVARARLRQLKT PPELLPELRIEERTVGYDGLTDIPVRVYWPPVVRDNLPVVVYYHGGGWSLGGLDTHDP VARAHAVGAQAIVVSVDYRLAPEHPYPAGIDDSWAALRWVGENAAELGGDPSRIAVAG DSAGGNISAVMAQLARDVGGPPLVFQLLWYPTTMADLSLPSFTENADAPILDRDVIDA FLAWYVPGLDISDHTMLPTTLAPGNADLSGLPPAFIGTAEHDPLRDDGACYAELLTAA GVSVELSNEPTMVHGYVNFALVVPAAAEATGRGLAALKRALHA" gene complement(1575834..1576796) /gene="lipI" /locus_tag="Rv1400c" /db_xref="GeneID:886728" CDS complement(1575834..1576796) /gene="lipI" /locus_tag="Rv1400c" /EC_number="3.1.-.-" /function="UNKNOWN, BUT POSSIBLY INVOLVED IN LIPID METABOLISM" /note="Rv1400c, (MTCY21B4.17c), len: 320 aa. Possible lipI, lipase (EC 3.1.-.-), most similar to G695278 lipase like enzyme (364 aa), FASTA sscores: opt: 611, E(): 3.5e-30, (36.6% identity in 352 aa overlap); similar to M. tuberculosis hypothetical lipases e.g. Rv1399c|MTCY21B4.16c (58.1% identical in 315 aa overlap); Rv1426c, Rv2284, etc." /codon_start=1 /transl_table=11 /product="lipase LipH" /protein_id="NP_215916.1" /db_xref="GI:15608538" /db_xref="GeneID:886728" /translation="MPSLDNTADEKPAIDPILLKVLDAVPFRLSIDDGIEAVRQRLRD LPRQPVHPELRVVDLAIDGPAGPIGTRIYWPPTCPDQAEAPVVLYFHGGGFVMGDLDT HDGTCRQHAVGADAIVVSVDYRLAPEHPYPAAIEDAWAATRWVAEHGRQVGADLGRIA VAGDSAGGTIAAVIAQRARDMGGPPIVFQLLWYPSTLWDQSLPSLAENADAPILDVKA IAAFSRWYAGEIDLHNPPAPMAPGRAENLADLPPAYIAVAGYDPLRDDGIRYGELLAA AGVPVEVHNAQTLVHGYVGYAGVVPAATEATNRGLVALRVVLHG" gene 1576930..1577532 /locus_tag="Rv1401" /db_xref="GeneID:886733" CDS 1576930..1577532 /locus_tag="Rv1401" /function="UNKNOWN" /note="Rv1401, (MTCY21B4.18), len: 200 aa. Possible membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215917.1" /db_xref="GI:15608539" /db_xref="GeneID:886733" /translation="MLQPAFKASMAVLLAAAAVAHPIGRERRWLVPALLLSATGDWLL AIPWWTWAFVFGLGAFLLAHLCFIGALLPLARQAAPSRGRVAAVVAMCVASAGLLVWF WPHLGKDNLTIPVTVYIVALSAMVCTALLARLPTIWTAVGAVCFAASDSMIGIGRFIL GNEALAVPIWWSYAAAEILITAGFFFGREVPDNAAAPTDS" gene 1577613..1579580 /gene="priA" /locus_tag="Rv1402" /db_xref="GeneID:886716" CDS 1577613..1579580 /gene="priA" /locus_tag="Rv1402" /function="RECOGNIZES A SPECIFIC HAIRPIN SEQUENCE ON PHIX SSDNA; THIS STRUCTURE IS THEN RECOGNIZED AND BOUND BY PROTEINS PRIB AND PRIC. FORMATION OF THE PRIMOSOME PROCEEDS WITH THE SUBSEQUENT ACTIONS OF DNAB, DNAC, DNAT AND PRIMASE. PRIA THEN FUNCTIONS AS A HELICASE WITHIN THE PRIMOSOME" /note="binding of PriA to forked DNA starts the assembly of the primosome, also possesses 3'-5' helicase activity" /codon_start=1 /transl_table=11 /product="primosome assembly protein PriA" /protein_id="NP_215918.1" /db_xref="GI:15608540" /db_xref="GeneID:886716" /translation="MLSVPHLDRDFDYLVPAEHSDDAQPGVRVRVRFHGRLVDGFVLE RRSDSDHHGKLGWLDRVVSPEPVLTTEIRRLVDAVAARYAGTRQDVLRLAVPARHARV EREITTAPGRPVVAPVDPSGWAAYGRGRQFLAALADSRAARAVWQALPGELWADRFAE AAAQTVRAGRTVLAIVPDQRDLDTLWQAATALVDEHSVVALSAGLGPEARYRRWLAAL RGSARLVIGTRSAVFAPLSELGLVMVWADADDSLAEPRAPYPHAREVAMLRAHQARCA ALIGGYARTAEAHALVRSGWAHDVVAPRPEVRARSPRVVALDDSGYDDARDPAARTAR LPSIALRAARSALQSGAPVLVQVPRRGYIPSLACGRCRAIARCRSCTGPLSLQGAGSP GAVCRWCGRVDPTLRCVRCGSDVVRAVVVGARRTAEELGRAFPGTAVITSAGDTLVPQ LDAGPALVVATPGAEPRAPGGYGAALLLDSWALLGRQDLRAAEDALWRWMTAAALVRP RGAGGVVTVVAESSIPTVQSLIRWDPVGHAEAELAARTEVGLPPSVHIAALDGPAGTV TALLEAARLPDPDRLQADLLGPVDLPPGVRRPAGIPADAPVIRMLLRVCREQGLELAA SLRRGIGVLSARQTRQTRSLVRVQIDPLHIG" gene complement(1579598..1580422) /locus_tag="Rv1403c" /db_xref="GeneID:886717" CDS complement(1579598..1580422) /locus_tag="Rv1403c" /EC_number="2.1.1.-" /function="CAUSES METHYLATION" /note="Rv1403c, (MTCY21B4.20c), len: 274 aa. Putative methyltransferase (EC 2.1.1.-), similar to PMTA_RHOSH|Q05197 phosphatidylethanolamine m-methyltransferase (203 aa), FASTA scores: opt: 217, E(): 1.1e-07, (37.1% identity in 105 aa overlap); similar to Rv1405c|MTCY21B4.22c (59.3% identity in 273 aa overlap) and to Rv1523, Rv2952, etc." /codon_start=1 /transl_table=11 /product="putative methyltransferase" /protein_id="NP_215919.1" /db_xref="GI:15608541" /db_xref="GeneID:886717" /translation="MTVYTPTSERQAPATTHRQMWALGDYAAIAEELLAPLGPILVST SGIRRGDRVLDVAAGSGNVSIPAAMAGAHVTASDLTPELLRRAQARAAAAGLELGWRE ANAEALPFSAGEFDAVLSTIGVMFAPRHQRTADELARVCRRGGKISTLNWTPEGFYGK LLSTIRPYRPTLPAGAPHEVWWGSEDYVSGLFRDHVSDIRTRRGSLTVDRFGCPDECR DYFKNFYGPAINAYRSIADSPECVATLDAEITELCREYLCDGVMQWEYLIFTARKC" gene 1580591..1581073 /locus_tag="Rv1404" /db_xref="GeneID:886712" CDS 1580591..1581073 /locus_tag="Rv1404" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /experiment="experimental evidence, no additional details recorded" /note="Rv1404, (MTCY21B4.21), len: 160 aa. Probable transcriptional regulatory protein, some similarity to MARR_ECOLI|P27245 multiple antibiotic resistance protein from Escherichia coli (125 aa), FASTA scores: opt: 136, E(): 0.004, (35.1% identity in 74 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215920.1" /db_xref="GI:15608542" /db_xref="GeneID:886712" /translation="MMPTEYPATAEESVDVITDALLTASRLLVAISAHSIAQVDENIT IPQFRTLVILSNHGPINLATLATLLGVQPSATGRMVDRLVGAELIDRLPHPTSRRELL AALTKRGRDVVRQVTEHRRTEIARIVEQMAPAERHGLVRALTAFTEAGGEPDARYEIE" gene complement(1581145..1581969) /locus_tag="Rv1405c" /db_xref="GeneID:886714" CDS complement(1581145..1581969) /locus_tag="Rv1405c" /EC_number="2.1.1.-" /function="CAUSES METHYLATION" /note="Rv1405c, (MTCY21B4.22c), len: 274 aa. Putative methyltransferase (EC 2.1.1.-), most similar to PMTA_RHOSH|Q05197 phosphatidylethanolamine m-methyltransferase (203 aa), FASTA scores: opt: 219, E(): 2.6e-07, (29.9% identity in 144 aa overlap); similar to Rv1403c|MTCY21B4.20c (59.3% identity in 273 aa overlap), Rv1523, Rv2952, etc." /codon_start=1 /transl_table=11 /product="putative methyltransferase" /protein_id="NP_215921.1" /db_xref="GI:15608543" /db_xref="GeneID:886714" /translation="MTIDTPAREDQTLAATHRAMWALGDYALMAEEVMAPLGPILVAA AGIGPGVRVLDVAAGSGNISLPAAKTGATVISTDLTPELLQRSQARAAQQGLTLQYQE ANAQALPFADDEFDTVISAIGVMFAPDHQAAADELVRVCRPGGTIGVISWTCEGFFGR MLATIRPYRPSVSADLPPSALWGREAYVTGLLGDGVTGLKTARGLLEVKRFDTAQAVH DYFKNNYGPTIEAYAHIGDNAVLAAELDRQLVELAAQYLSDGVMEWEYLLLTAEKR" gene 1582166..1583104 /gene="fmt" /locus_tag="Rv1406" /db_xref="GeneID:886706" CDS 1582166..1583104 /gene="fmt" /locus_tag="Rv1406" /EC_number="2.1.2.9" /function="MODIFY THE FREE AMINO GROUP OF THE AMINOACYL MOIETY OF METHIONYL-TRNA(FMET). THE FORMYL GROUP APPEARS TO PLAY A DUAL ROLE IN THE INITIATOR IDENTITY OF N-FORMYLMETHIONYL-TRNA BY:(I) PROMOTING ITS RECOGNITION BY IF2 AND (II) IMPAIRING ITS BINDING TO EFTU-GTP. [CATALYTIC ACTIVITY : 10-FORMYLTETRAHYDROFOLATE + L-METHIONYL-TRNA + H(2)O = TETRAHYDROFOLATE + N-FORMYLMETHIONYL-TRNA]" /note="modifies the free amino group of the aminoacyl moiety of methionyl-tRNA(fMet) which is important in translation initiation; inactivation of this gene in Escherichia coli severely impairs growth" /codon_start=1 /transl_table=11 /product="methionyl-tRNA formyltransferase" /protein_id="NP_215922.1" /db_xref="GI:15608544" /db_xref="GeneID:886706" /translation="MRLVFAGTPEPALASLRRLIESPSHDVIAVLTRPDAASGRRGKP QPSPVAREAAERGIPVLRPSRPNSAEFVAELSDLAPECCAVVAYGALLGGPLLAVPPH GWVNLHFSLLPAWRGAAPVQAAIAAGDTITGATTFQIEPSLDSGPIYGVVTEVIQPTD TAGDLLKRLAVSGAALLSTTLDGIADQRLTPRPQPADGVSVAPKITVANARVRWDLPA AVVERRIRAVTPNPGAWTLIGDLRVKLGPVHLDAAHRPSKPLPPGGIHVERTSVWIGT GSEPVRLGQIQPPGKKLMNAADWARGARLDLAARAT" gene 1583101..1584474 /gene="fmu" /locus_tag="Rv1407" /db_xref="GeneID:886720" CDS 1583101..1584474 /gene="fmu" /locus_tag="Rv1407" /function="UNKNOWN" /note="Rv1407, (MTCY21B4.24), len: 457 aa. Probable fmu protein, similar to SUN_ECOLI|P36929 sun protein (fmu protein) from Escherichia coli (429 aa), FASTA scores: E(): 2.5e-20, (30.6% identity in 451 aa overlap)." /codon_start=1 /transl_table=11 /product="Fmu protein (SUN protein)" /protein_id="NP_215923.1" /db_xref="GI:15608545" /db_xref="GeneID:886720" /translation="MTPRSRGPRRRPLDPARRAAFETLRAVSARDAYANLVLPALLAQ RGIGGRDAAFATELTYGTCRARGLLDAVIGAAAERSPQAIDPVLLDLLRLGTYQLLRT RVDAHAAVSTTVEQAGIEFDSARAGFVNGVLRTIAGRDERSWVGELAPDAQNDPIGHA AFVHAHPRWIAQAFADALGAAVGELEAVLASDDERPAVHLAARPGVLTAGELARAVRG TVGRYSPFAVYLPRGDPGRLAPVRDGQALVQDEGSQLVARALTLAPVDGDTGRWLDLC AGPGGKTALLAGLGLQCAARVTAVEPSPHRADLVAQNTRGLPVELLRVDGRHTDLDPG FDRVLVDAPCTGLGALRRRPEARWRRQPADVAALAKLQRELLSAAIALTRPGGVVLYA TCSPHLAETVGAVADALRRHPVHALDTRPLFEPVIAGLGEGPHVQLWPHRHGTDAMFA AALRRLT" gene 1584499..1585197 /gene="rpe" /locus_tag="Rv1408" /db_xref="GeneID:886702" CDS 1584499..1585197 /gene="rpe" /locus_tag="Rv1408" /EC_number="5.1.3.1" /function="INVOLVED IN THE CALVIN CYCLE [CATALYTIC ACTIVITY : D-RIBULOSE 5-PHOSPHATE = D-XYLULOSE 5- PHOSPHATE]" /note="catalyzes the interconversion of D-ribulose 5-phosphate to xylulose 5-phosphate" /codon_start=1 /transl_table=11 /product="ribulose-phosphate 3-epimerase" /protein_id="NP_215924.1" /db_xref="GI:15608546" /db_xref="GeneID:886702" /translation="MSLMAGSTGGPLIAPSILAADFARLADEAAAVNGADWLHVDVMD GHFVPNLTIGLPVVESLLAVTDIPMDCHLMIDNPDRWAPPYAEAGAYNVTFHAEATDN PVGVARDIRAAGAKAGISVKPGTPLEPYLDILPHFDTLLVMSVEPGFGGQRFIPEVLS KVRAVRKMVDAGELTILVEIDGGINDDTIEQAAEAGVDCFVAGSAVYGADDPAAAVAA LRRQAGAASLHLSL" misc_feature 1584610..1584654 /gene="rpe" /locus_tag="Rv1408" /note="PS01085 Ribulose-phosphate 3-epimerase family signature 1" misc_feature 1584910..1584984 /gene="rpe" /locus_tag="Rv1408" /note="PS01086 Ribulose-phosphate 3-epimerase family signature 2" gene 1585194..1586213 /gene="ribG" /locus_tag="Rv1409" /db_xref="GeneID:886721" CDS 1585194..1586213 /gene="ribG" /locus_tag="Rv1409" /EC_number="3.5.4.26" /EC_number="1.1.1.193" /function="INVOLVED IN RIBOFLAVIN BIOSYNTHESIS (AT THE SECOND AND THIRD STEPS). CONVERTS 2,5-DIAMINO-6-(RIBOSYLAMINO)-4(3H)-PYRIMIDINONE 5'-PHOSPHATE INTO 5-AMINO-6-(RIBOSYLAMINO)-2,4(1H,3H)-PYRIMIDINEDIONE 5'-PHOSPHATE [CATALYTIC ACTIVITY 1: 2,5-DIAMINO-6-HYDROXY-4-(5-PHOSPHORIBOSYLAMINO)PYRIMIDINE + H(2)O = 5-AMINO-6-(5-PHOSPHORIBOSYLAMINO)URACIL + NH(3)] [CATALYTIC ACTIVITY 2: 5-AMINO-6-(5-PHOSPHORIBITYLAMINO)URACIL + NADP(+) = 5-AMINO-6-(5-PHOSPHORIBOSYLAMINO)URACIL + NADPH]." /note="Rv1409, (MTCY21B4.26), len: 339 aa. Probable ribG (alternate gene name: ribD), bifunctional riboflavin biosynthesis protein, including diaminohydroxyphosphoribosylaminopyrimidine deaminase and 5-amino-6-(5-phosphoribosylamino) uracil reductase (EC 3.5.4.26 and 1.1.1.193), similar to many e.g. RIBD_ECOLI|P25539 riboflavin-specific deaminase from Escherichia coli (367 aa), FASTA scores: E(): 0, (39.8% identity in 364 aa overlap); etc. Contains PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature. IN THE N-TERMINAL SECTION; BELONGS TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY. IN THE C-TERMINAL SECTION; BELONGS TO THE HTP REDUCTASE FAMILY.; ribD" /codon_start=1 /transl_table=11 /product="bifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino) uracil reductase" /protein_id="NP_215925.1" /db_xref="GI:15608547" /db_xref="GeneID:886721" /translation="MNVEQVKSIDEAMGLAIEHSYQVKGTTYPKPPVGAVIVDPNGRI VGAGGTEPAGGDHAEVVALRRAGGLAAGAIVVVTMEPCNHYGKTPPCVNALIEARVGT VVYAVADPNGIAGGGAGRLSAAGLQVRSGVLAEQVAAGPLREWLHKQRTGLPHVTWKY ATSIDGRSAAADGSSQWISSEAARLDLHRRRAIADAILVGTGTVLADDPALTARLADG SLAPQQPLRVVVGKRDIPPEARVLNDEARTMMIRTHEPMEVLRALSDRTDVLLEGGPT LAGAFLRAGAINRILAYVAPILLGGPVTAVDDVGVSNITNALRWQFDSVEKVGPDLLL SLVAR" misc_feature 1585362..1585478 /gene="ribG" /locus_tag="Rv1409" /note="PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature" gene complement(1586210..1587766) /locus_tag="Rv1410c" /db_xref="GeneID:886709" CDS complement(1586210..1587766) /locus_tag="Rv1410c" /function="INVOLVED IN TRANSPORT OF AMINOGLYCOSIDES AND TETRACYCLINE ACROSS THE MEMBRANE (EXPORT): DRUG RESISTANCE BY AN EXPORT MECHANISM (CONFERES RESISTANCE TO TOXIC COMPOUNDS BY REMOVING THEM FOR THE CELLS). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv1410c, (MTCY21B4.27c), len: 518 aa. Aminoglycoside/tetracycline-transport integral membrane protein (see citation below), member of major facilitator superfamily (MFS), similar to others e.g. AC22_STRCO|P46105 probable actinorhodin transporter from Streptomyces coelicolor (578 aa), FASTA scores: opt: 442, E(): 4.9e-21, (28.5% identity in 466 aa overlap); etc. Contains PS00216 Sugar transport proteins signature 1. Could be termed P55. Note that the Rv1410c-Rv1411c operon seems transcribed from two promoters in Mycobacterium bovis BCG (see Bigi et al., 2000).; P55" /codon_start=1 /transl_table=11 /product="aminoglycosides/tetracycline-transport integral membrane protein" /protein_id="NP_215926.1" /db_xref="GI:15608548" /db_xref="GeneID:886709" /translation="MRAGRRVAISAGSLAVLLGALDTYVVVTIMRDIMNSVGIPINQL HRITWIVTMYLLGYIAAMPLLGRASDRFGRKLMLQVSLAGFIIGSVVTALAGHFGDFH MLIAGRTIQGVASGALLPITLALGADLWSQRNRAGVLGGIGAAQELGSVLGPLYGIFI VWLLHDWRDVFWINVPLTAIAMVMIHFSLPSHDRSTEPERVDLVGGLLLALALGLAVI GLYNPNPDGKHVLPDYGAPLLVGALVAAVAFFGWERFARTRLIDPAGVHFRPFLSALG ASVAAGAALMVTLVDVELFGQGVLQMDQAQAAGMLLWFLIALPIGAVTGGWIATRAGD RAVAFAGLLIAAYGYWLISHWPVDLLADRHNILGLFTVPAMHTDLVVAGLGLGLVIGP LSSATLRVVPSAQHGIASAAVVVARMTGMLIGVAALSAWGLYRFNQILAGLSAAIPPN ASLLERAAAIGARYQQAFALMYGEIFTITAIVCVFGAVLGLLISGRKEHADEPEVQEQ PTLAPQVEPL" misc_feature complement(1587524..1587574) /locus_tag="Rv1410c" /note="PS00216 Sugar transport proteins signature 1" gene complement(1587772..1588482) /gene="lprG" /locus_tag="Rv1411c" /db_xref="GeneID:886700" CDS complement(1587772..1588482) /gene="lprG" /locus_tag="Rv1411c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1411c, (MTCY21B4.28c), len: 236. Probable lprG (alternate gene name: P27), conserved lipoprotein, similar to Mycobacterium tuberculosis hypothetical lipoproteins e.g. Rv1270c|MTCY50.12 (35.1% identity in 245 aa overlap); Rv1368, Rv2945c. Contains N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013). Note that the Rv1410c-Rv1411c operon seems transcribed from two promoters in Mycobacterium bovis BCG (see Bigi et al., 2000).; P27" /codon_start=1 /transl_table=11 /product="lipoprotein LprG" /protein_id="NP_215927.1" /db_xref="GI:15608549" /db_xref="GeneID:886700" /translation="MRTPRRHCRRIAVLAAVSIAATVVAGCSSGSKPSGGPLPDAKPL VEEATAQTKALKSAHMVLTVNGKIPGLSLKTLSGDLTTNPTAATGNVKLTLGGSDIDA DFVVFDGILYATLTPNQWSDFGPAADIYDPAQVLNPDTGLANVLANFADAKAEGRDTI NGQNTIRISGKVSAQAVNQIAPPFNATQPVPATVWIQETGDHQLAQAQLDRGSGNSVQ MTLSKWGEKVQVTKPPVS" gene 1588567..1589172 /gene="ribC" /locus_tag="Rv1412" /db_xref="GeneID:886690" CDS 1588567..1589172 /gene="ribC" /locus_tag="Rv1412" /EC_number="2.5.1.9" /function="INVOLVED IN RIBOFLAVIN SYNTHESIS. RIBOFLAVIN SYNTHASE IS A BIFUNCTIONAL ENZYME COMPLEX CATALYZING THE FORMATION OF RIBOFLAVIN FROM 5-AMINO-6-(1'-D)- RIBITYL-AMINO-2,4(1H,3H)-PYRIMIDINEDIONE AND L-3,4-DIHYDROHY-2- BUTANONE-4-PHOSPHATE VIA 6,7-DIMETHYL-8-LUMAZINE. THE ALPHA SUBUNIT CATALYZES THE DISMUTATION OF 6,7-DIMETHYL-8-LUMAZINE TO RIBOFLAVIN AND 5-AMINO-6-(1'-D)-RIBITYL-AMINO-2,4(1H,3H)- PYRIMIDINEDIONE." /note="catalyzes the formation of riboflavin from 6,7-dimethyl-8-(1-D-ribityl)lumazine" /codon_start=1 /transl_table=11 /product="riboflavin synthase subunit alpha" /protein_id="NP_215928.1" /db_xref="GI:15608550" /db_xref="GeneID:886690" /translation="MFTGIVEERGEVTGREALVDAARLTIRGPMVTADAGHGDSIAVN GVCLTVVDVLPDGQFTADVMAETLNRSNLGELRPGSRVNLERAAALGSRLGGHIVQGH VDATGEIVARCPSEHWEVVRIEMPASVARYVVEKGSITVDGISLTVSGLGAEQRDWFE VSLIPTTRELTTLGSAAVGTRVNLEVDVVAKYVERLMRSAG" misc_feature 1588783..1588821 /gene="ribC" /locus_tag="Rv1412" /note="PS00693 Riboflavin synthase alpha chain family signature" misc_feature 1589083..1589121 /gene="ribC" /locus_tag="Rv1412" /note="PS00693 Riboflavin synthase alpha chain family signature" gene 1589386..1589901 /locus_tag="Rv1413" /db_xref="GeneID:886692" CDS 1589386..1589901 /locus_tag="Rv1413" /function="UNKNOWN" /note="Rv1413, (MTCY21B4.30), len: 171 aa. Conserved hypothetical protein, similar to part of AB010956|AB010956_1 metal-activated pyridoxal enzyme from Arthrobacter sp. (379 aa), FASTA scores: opt: 187, E(): 0.00026, (29.0% identity in 162 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215929.1" /db_xref="GI:15608551" /db_xref="GeneID:886692" /translation="MATIGEVEVFVDHGADDVFITYPLWIGTRQADRLRQLADRARIA VGAGTAEGASNTGARLADAAGAIDVLIEIDSGHHRSGVRAEQVLEVAHAVGEAGLHLV GVFTFPGHSYAPGKPGEAGEQERRALNDAANALVAVGFPISCRSGGSTPTALLTAADG ASETSRRLCAR" gene 1589891..1590292 /locus_tag="Rv1414" /db_xref="GeneID:886696" CDS 1589891..1590292 /locus_tag="Rv1414" /function="UNKNOWN" /note="Rv1414, (MTCY21B4.31), len: 133 aa. Conserved hypothetical protein, similar to C-terminal part of AB010956|AB010956_1 novel metal-activated pyridoxal enzyme from Arthrobacter sp. (379 aa), FASTA scores: opt: 163, E(): 0.00063, (32.1% identity in 112 aa overlap). Rv1413 is similar to N-terminal part of same enzyme suggesting possible frameshift. Sequence has been checked and no errors found, it is identical in Mycobacterium bovis strain AF2122/97 and in Mycobacterium tuberculosis CDC1551." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215930.1" /db_xref="GI:15608552" /db_xref="GeneID:886696" /translation="MLGDAQQLELGRCAPADIALTVAATVVSRQDCRSGLRRIVLDCG SKILGSDRPAWATGFGRLIDHADARIAALSEHHATVVWPDDAPLPPVGTRLRVIPNHV CLTTNLVDDVAVVRDATLIDRWKVAARGKNH" gene 1590397..1591674 /gene="ribA2" /locus_tag="Rv1415" /db_xref="GeneID:886694" CDS 1590397..1591674 /gene="ribA2" /locus_tag="Rv1415" /EC_number="3.5.4.25" /function="INVOLVED IN RIBOFLAVIN BIOSYNTHESIS [CATALYTIC ACTIVITY : GTP + 3 H(2)O = FORMATE + 2,5-DIAMINO-6-HYDROXY-4-(5-PHOSPHORIBOSYLAMINO)PYRIMIDINE + DIPHOSPHATE]." /note="bifunctional enzyme DHBP synthase/GTP cyclohydrolase II; functions in riboflavin synthesis; converts GTP to 2,5-diamino-6-hydroxy-4-(5-phosphoribosylamino)pyrimidine; converts ribulose 5-phopshate to 3,4-dihydroxy-2-butanone 4-phosphate" /codon_start=1 /transl_table=11 /product="bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II protein" /protein_id="NP_215931.1" /db_xref="GI:15608553" /db_xref="GeneID:886694" /translation="MTRLDSVERAVADIAAGKAVIVIDDEDRENEGDLIFAAEKATPE MVAFMVRYTSGYLCVPLDGAICDRLGLLPMYAVNQDKHGTAYTVTVDARNGIGTGISA SDRATTMRLLADPTSVADDFTRPGHVVPLRAKDGGVLRRPGHTEAAVDLARMAGLQPA GAICEIVSQKDEGSMAHTDELRVFADEHGLALITIADLIEWRRKHEKHIERVAEARIP TRHGEFRAIGYTSIYEDVEHVALVRGEIAGPNADGDDVLVRVHSECLTGDVFGSRRCD CGPQLDAALAMVAREGRGVVLYMRGHEGRGIGLMHKLQAYQLQDAGADTVDANLKLGL PADARDYGIGAQILVDLGVRSMRLLTNNPAKRVGLDGYGLHIIERVPLPVRANAENIR YLMTKRDKLGHDLAGLDDFHESVHLPGEFGGAL" gene 1591689..1592153 /gene="ribH" /locus_tag="Rv1416" /db_xref="GeneID:886681" CDS 1591689..1592153 /gene="ribH" /locus_tag="Rv1416" /EC_number="2.5.1.9" /function="RIBOFLAVIN SYNTHASE IS A BIFUNCTIONAL ENZYME COMPLEX INVOLVED IN RIBOFLAVIN SYNTHESIS. RIBOFLAVIN SYNTHASE CATALYZES THE FORMATION OF RIBOFLAVIN FROM 5-AMINO-6-(1'-D)- RIBITYL-AMINO-2,4(1H,3H)-PYRIMIDINEDIONE AND L-3,4-DIHYDROHY-2- BUTANONE-4-PHOSPHATE VIA 6,7-DIMETHYL-8-LUMAZINE. THE BETA SUBUNIT CATALYZES THE CONDENSATION OF 5-AMINO-6-(1'-D)-RIBITYL- AMINO-2,4(1H,3H)-PYRIMIDINEDIONE WITH L-3,4-DIHYDROHY-2-BUTANONE- 4-PHOSPHATE YIELDING 6,7-DIMETHYL-8-LUMAZINE." /note="RibE; 6,7-diimethyl-8-ribityllumazine synthase; DMRL synthase; lumazine synthase; beta subunit of riboflavin synthase; condenses 5-amino-6-(1'-D)-ribityl-amino-2,4(1H,3H)-pyrimidinedione with L-3,4-dihydrohy-2-butanone-4-phosphate to generate 6,6-dimethyl-8-lumazine (DMRL); riboflavin synthase then uses 2 molecules of DMRL to produce riboflavin (vitamin B12); involved in the last steps of riboflavin biosynthesis; forms a 60mer (icosahedral shell) in both Bacillus subtilis and Escherichia coli; in Bacillus subtilis this 60mer is associated with the riboflavin synthase subunit (alpha) while in Escherichia coli it is not" /codon_start=1 /transl_table=11 /product="6,7-dimethyl-8-ribityllumazine synthase" /protein_id="NP_215932.1" /db_xref="GI:15608554" /db_xref="GeneID:886681" /translation="MPDLPSLDASGVRLAIVASSWHGKICDALLDGARKVAAGCGLDD PTVVRVLGAIEIPVVAQELARNHDAVVALGVVIRGQTPHFDYVCDAVTQGLTRVSLDS STPIANGVLTTNTEEQALDRAGLPTSAEDKGAQATVAALATALTLRELRAHS" gene 1592150..1592614 /locus_tag="Rv1417" /db_xref="GeneID:886704" CDS 1592150..1592614 /locus_tag="Rv1417" /function="UNKNOWN" /note="Rv1417, (MTCY21B4.35), len: 154 aa. Possible conserved membrane protein, similar to others e.g. AL133213|SC6D7_2 Streptomyces coelicolor (156 aa), FASTA scores: opt: 212, E(): 4.4e-07, (32.4% identity in 136 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215933.1" /db_xref="GI:15608555" /db_xref="GeneID:886704" /translation="MTAAPNDWDVVLRPHWTPLFAYAAAFLIAVAHVAGGLLLKVGSS GVVFQTADQVAMGALGLVLAGAVLLFARPRLRVGSAGLSVRNLLGDRIVGWSEVIGVS FPGGSRWARIDLADDEYIPVMAIQAVDKDRAVAAMDTVRSLLARYRPDLCAR" gene 1592639..1593325 /gene="lprH" /locus_tag="Rv1418" /db_xref="GeneID:886687" CDS 1592639..1593325 /gene="lprH" /locus_tag="Rv1418" /function="UNKNOWN" /note="Rv1418, (MTCY21B4.36), len: 228 aa. Probable lprH, lipoprotein. Contains N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site (PS00013)." /codon_start=1 /transl_table=11 /product="lipoprotein LprH" /protein_id="NP_215934.1" /db_xref="GI:15608556" /db_xref="GeneID:886687" /translation="MACLGRPGCRGWAGASLVLVVVLALAACTESVAGRAMRATDRSS GLPTSAKPARARDLLLQDGDRAPFGQVTQSRVGDSYFTSAVPPECSAALLFKGSPLRP DGSSDHAEAAYNVTGPLPYAESVDVYTNVLNVHDVVWNGFRDVSHCRGDAVGVSRAGR STPMRLRYFATLSDGVLVWTMSNPRWTCDYGLAVVPHAVLVLSACGFKPGFPMAEWAS KRRAQLDSQV" gene 1593505..1593978 /locus_tag="Rv1419" /db_xref="GeneID:886683" CDS 1593505..1593978 /locus_tag="Rv1419" /function="UNKNOWN" /note="Rv1419, (MTCY21B4.37), len: 157 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215935.1" /db_xref="GI:15608557" /db_xref="GeneID:886683" /translation="MGELRLVGGVLRVLVVVGAVFDVAVLNAGAASADGPVQLKSRLG DVCLDAPSGSWFSPLVINPCNGTDFQRWNLTDDRQVESVAFPGECVNIGNALWARLQP CVNWISQHWTVQPDGLVKSDLDACLTVLGGPDPGTWVSTRWCDPNAPDQQWDSVP" gene 1594042..1595982 /gene="uvrC" /locus_tag="Rv1420" /db_xref="GeneID:886672" CDS 1594042..1595982 /gene="uvrC" /locus_tag="Rv1420" /function="INVOLVED IN NUCLEOTIDE EXCISION REPAIR. THE ABC EXCISION NUCLEASE IS A DNA REPAIR ENZYME THAT CATALYZES THE EXCISION REACTION OF UV-DAMAGED NUCLEOTIDE SEGMENTS PRODUCING OLIGOMERS HAVING THE MODIFIED BASE(S). ATTACHES TO THE UVRA-UVRB COMPLEX, DISPLACING UVRA, AND THE DAMAGED DNA STRAND IS NICKED ON BOTH SIDES OF THE DAMAGED SITE" /note="The UvrABC repair system catalyzes the recognition and processing of DNA lesions. UvrC both incises the 5' and 3' sides of the lesion. The N-terminal half is responsible for the 3' incision and the C-terminal half is responsible for the 5' incision" /codon_start=1 /transl_table=11 /product="excinuclease ABC subunit C" /protein_id="NP_215936.1" /db_xref="GI:15608558" /db_xref="GeneID:886672" /translation="MPDPATYRPAPGSIPVEPGVYRFRDQHGRVIYVGKAKSLRSRLT SYFADVASLAPRTRQLVTTAAKVEWTVVGTEVEALQLEYTWIKEFDPRFNVRYRDDKS YPVLAVTLGEEFPRLMVYRGPRRKGVRYFGPYSHAWAIRETLDLLTRVFPARTCSAGV FKRHRQIDRPCLLGYIDKCSAPCIGRVDAAQHRQIVADFCDFLSGKTDRFARALEQQM NAAAEQLDFERAARLRDDLSALKRAMEKQAVVLGDGTDADVVAFADDELEAAVQVFHV RGGRVRGQRGWIVEKPGEPGDSGIQLVEQFLTQFYGDQAALDDAADESANPVPREVLV PCLPSNAEELASWLSGLRGSRVVLRVPRRGDKRALAETVHRNAEDALQQHKLKRASDF NARSAALQSIQDSLGLADAPLRIECVDVSHVQGTDVVGSLVVFEDGLPRKSDYRHFGI REAAGQGRSDDVACIAEVTRRRFLRHLRDQSDPDLLSPERKSRRFAYPPNLYVVDGGA PQVNAASAVIDELGVTDVAVIGLAKRLEEVWVPSEPDPIIMPRNSEGLYLLQRVRDEA HRFAITYHRSKRSTRMTASALDSVPGLGEHRRKALVTHFGSIARLKEATVDEITAVPG IGVATATAVHDALRPDSSGAAR" gene 1595979..1596884 /locus_tag="Rv1421" /db_xref="GeneID:886676" CDS 1595979..1596884 /locus_tag="Rv1421" /function="UNKNOWN" /note="Rv1421, (MTCY21B4.39), len: 301 aa. Conserved hypothetical protein, similar to many hypothetical proteins e.g. YHBJ_ECOLI|P33995 hypothetical 32.5 kd protein from Escherichia coli (284 aa), FASTA scores: opt: 648, E(): 6.3e-36, (38.7% identity in 282aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215937.1" /db_xref="GI:15608559" /db_xref="GeneID:886676" /translation="MMNHARGVENRSEGGGIDVVLVTGLSGAGRGTAAKVLEDLGWYV ADNLPPQLITRMVDFGLAAGSRITQLAVVMDVRSRGFTGDLDSVRNELATRAITPRVV FMEASDDTLVRRYEQNRRSHPLQGEQTLAEGIAAERRMLAPVRATADLIIDTSTLSVG GLRDSIERAFGGDGGATTSVTVESFGFKYGLPMDADMVMDVRFLPNPHWVDELRPLTG QHPAVRDYVLHRPGAAEFLESYHRLLSLVVDGYRREGKRYMTIAIGCTGGKHRSVAIA EALMGLLRSDQQLSVRALHRDLGRE" gene 1596881..1597909 /locus_tag="Rv1422" /db_xref="GeneID:886670" CDS 1596881..1597909 /locus_tag="Rv1422" /function="UNKNOWN" /note="Rv1422, (MTCY21B4.40), len: 342 aa. Conserved hypothetical protein, similar to many hypothetical proteins e.g. YAMB_THETU|P38541 Thermoanaerobacterium thermosulfurigenes (323 aa), FASTA scores: opt: 519, E(): 1.6e-25, (33.1% identity in 320 aa overlap); and AF106003|AF106003_3 Streptomyces coelicolor (363 aa), FASTA scores: opt: 1047, E(): 0, (54.5% identity in 308 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215938.1" /db_xref="GI:15608560" /db_xref="GeneID:886670" /translation="MTDGIVALGGGHGLYATLSAARRLTPYVTAVVTVADDGGSSGRL RSELDVVPPGDLRMALAALASDSPHGRLWATILQHRFGGSGALAGHPIGNLMLAGLSE VLADPVAALDELGRILGVKGRVLPMCPVALQIEADVSGLEADPRMFRLIRGQVAIATT PGKVRRVRLLPTDPPATRQAVDAIMAADLVVLGPGSWFTSVIPHVLVPGLAAALRATS ARRALVLNLVAEPGETAGFSVERHLHVLAQHAPGFTVHDIIIDAERVPSEREREQLRR TATMLQAEVHFADVARPGTPLHDPGKLAAVLDGVCARDVGASEPPVAATQEIPIDGGR PRGDDAWR" gene 1597906..1598883 /gene="whiA" /locus_tag="Rv1423" /db_xref="GeneID:886674" CDS 1597906..1598883 /gene="whiA" /locus_tag="Rv1423" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1423, (MTCY21B4.41-MTCY493.31c), len: 325 aa. Putative whiA, transcriptional regulator, probably equivalent to AL035591|SCC54.10 whiA protein from Streptomyces coelicolor (328 aa), FASTA scores: opt: 1505, E(): 0, (70.4% identity in 324 aa overlap). Also some similarity to O06975|YVCL hypothetical protein from Bacillus subtilis (316 aa), FASTA scores: E(): 1.8e-0 8, (25.7% identity in 304 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIA" /protein_id="NP_215939.1" /db_xref="GI:15608561" /db_xref="GeneID:886674" /translation="MTTDVKDELSRLVVKSVSARRAEVTSLLRFAGGLHIVGGRVVVE AELDLGSIARRLRKEIFELYGYTAVVHVLSASGIRKSTRYVLRVANDGEALARQTGLL DMRGRPVRGLPAQVVGGSIDDAEAAWRGAFLAHGSLTEPGRSSALEVSCPGPEAALAL VGAARRLGVGAKAREVRGADRVVVRDGEAIGALLTRMGAQDTRLVWEERRLRREVRAT ANRLANFDDANLRRSARAAVAAAARVERALEILGDTVPEHLASAGKLRVEHRQASLEE LGRLADPPMTKDAVAGRIRRLLSMADRKAKVDGIPDTESVVTPDLLEDA" gene complement(1598893..1599654) /locus_tag="Rv1424c" /db_xref="GeneID:886685" CDS complement(1598893..1599654) /locus_tag="Rv1424c" /function="UNKNOWN" /note="Rv1424c, (MTCY21B4.42c,MTCY493.30), len: 253 aa. Possible membrane protein, contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215940.1" /db_xref="GI:15608562" /db_xref="GeneID:886685" /translation="MTVVPGAPSRPASAVSRPSYRQCVQASAQTSARRYSFPSYRRPP AEKLVFPVLLGILTLLLSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDS KLAPSRPQVVACDSREARIRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYC YPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGA GGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK" misc_feature complement(1599403..1599489) /locus_tag="Rv1424c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene 1599658..1601037 /locus_tag="Rv1425" /db_xref="GeneID:886668" CDS 1599658..1601037 /locus_tag="Rv1425" /function="UNKNOWN" /note="Rv1425, (MTCY21B4.43,MTCY493.29c), len: 459 aa. Conserved hypothetical protein, similar to many M. tuberculosis hypothetical proteins e.g. Rv3740c, Rv3734c, Rv1760, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215941.1" /db_xref="GI:15608563" /db_xref="GeneID:886668" /translation="MKRLSSVDAAFWSAETAGWHMHVGALAICDPSDAPEYSFQRLRE LIIERLPEIPQLRWRVTGAPLGLDRPWFVEDEELDIDFHIRRIGVPAPGGRRELEELV GRLMSYKLDRSRPLWELWVIEGVEGGRIATLTKMHHAIVDGVSGAGLGEILLDITPEP RPPQQETVGFVGFQIPGLERRAIGALINVGIMTPFRIVRLLEQTVRQQIAALGVAGKP ARYFEAPKTRFNAPVSPHRRVTGTRVELARAKAVKDAFGVKLNDVVLALVAGAARQYL QKRDELPAKPLIAQIPVSTRSEETKADVGNQVSSMTASLATHIEDPAKRLAAIHESTL SAKEMAKAPSAHQIMGLTETTPPGLLQLAARAYTASGLSHNLAPINLVVSNVPGPPFP LYMAGARLDSLVPLGPPVMDVALNITCFSYQDYLDFGLVTTPEVANDIDEMADAIEPA LAELERAAE" gene complement(1601059..1602321) /gene="lipO" /locus_tag="Rv1426c" /db_xref="GeneID:886660" CDS complement(1601059..1602321) /gene="lipO" /locus_tag="Rv1426c" /EC_number="3.1.-.-" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN LIPID METABOLISM" /note="Rv1426c, (MTCY493.28), len: 420 aa. Possible Lipo, esterase (EC 3.1.-.-), similar to several Mycobacterium tuberculosis hypothetical lipases and esterases e.g. Rv1399c, Rv2284, etc. Also similar in central region to AAAD_HUMAN|P22760 human arylacetamide deacetylase (398 aa), FASTA scores: opt:210, E(): 7.6e-07, (29.3% identity in 191 aa overlap)." /codon_start=1 /transl_table=11 /product="esterase LipO" /protein_id="NP_215942.1" /db_xref="GI:15608564" /db_xref="GeneID:886660" /translation="MRFRRMARPRPLTRAAVELLNAANGLRPLSGSGYSTVLAFWLGW PTSEVPGVYLGASVLDALRRGRRGDFGGLKGKAALALTAAAWVILAVIRYRGATTPGP VLEAGLTEQLGPDYAKELATLPTEPMRSRGRNLPLRTAMARRRYVETTNVVCYGPYGR ANLADIWRRRDLPRDAKAPVLVQVPGGAWVLGWRRPQAYPLMSHLAARGWVCVSLNYR VSPRHTWPDHIVDVKRALAWVKENIAAYGGDPNFVAISGGSAGGHLCALAALTPNDPR FQPGFEQVDTSVAAAVPVYGRYDWFTTDAPGRREFVGLLETFVVKRKFSTHRDIFVDA SPIHHVRADAPPFFVLHGRHDSLIPVAEAHAFVEELRAVSKSPVAYADLPHAQHAFDV FGSPRAHHTAEAVARFLSWVYATNPPAT" gene complement(1602321..1603928) /gene="fadD12" /locus_tag="Rv1427c" /db_xref="GeneID:886679" CDS complement(1602321..1603928) /gene="fadD12" /locus_tag="Rv1427c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_215943.1" /db_xref="GI:15608565" /db_xref="GeneID:886679" /translation="MRIRQAFGLIATMRRAGLIAPLRPDRYLRIVAAMRREGMGFTAG FAGAARRCPDRPGLIDELGTLTWRQLDERGNALAAALQALPAGPPRVVGIMCRNHRGF VDALLAVNRIGAHILLLNTSFAGPALAEVVTREGVDTVVYDEEFSATVDRALAEKPQA TRIVAWTDEDHDLTVEKLVAAHAGRRPEHTGSHGKVILLTSGTTGTPKGARHSGGGIG TLKAILDRTPWRAEEVTVIVAPMFHAWGFSQLVLASSLACTIVTRRRFDPEATLDLID RHHATGLVVVPVMFDRIMDLPAEIRNRYDGRSLRFAAASGSRMRPDVVIAFMDQFGDV IYNNYNATEAGMIATATPADLRTAPDTAGRPAEGTEIRILDQQFTEVPTGEVGTIYVR NDSQFDGYTSGAAKDFHAGFMSSGDVGYLDENGRLFVVGRDDEMIVSGGENIYPIEVE KTLATHPDVAEAAVIGVDDQQYGQRLAAFVVLKPGVSATPETLKQHVRDNLANYKVPR DIAVLDELPRGITGKILRTELQSRVGS" misc_feature complement(1603305..1603340) /gene="fadD12" /locus_tag="Rv1427c" /note="PS00455 Putative AMP-binding domain signature" gene complement(1603932..1604759) /locus_tag="Rv1428c" /db_xref="GeneID:886656" CDS complement(1603932..1604759) /locus_tag="Rv1428c" /function="UNKNOWN" /note="Rv1428c, (MTCY493.26), len: 275 aa. Conserved hypothetical protein, some similarity to hypothetical proteins from Mycobacterium tuberculosis e.g. Rv0502|YV29_MYCTU|Q11167 (358 aa), FASTA scores: opt: 355, E(): 5e-16, (32.6% identity in 273 aa overlap); and Rv1920." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215944.1" /db_xref="GI:15608566" /db_xref="GeneID:886656" /translation="MSETDSPGNGDDAGIGDIGKFDPGLTQRLISVLRPVLKTYHRSQ VHGLDSFPPGGALVVANHSGGMFPMDVPVFSVDFYDKFGYDRPVYTLSHDILFMGLTG DLFRRTGYIRATRENAAKALRSGGVVVVFPGGDYDAYRPTFAENVIDFNGRKGYVSTA VEAGVPIVPAVSIGGQESQLYLSRGTWLARRLGLKRLLRSDILPISFGFPFGFSAAIP PNLPLPAKIVMQVLDPINLTKQFGEDPDVDAVDEHVRSVMQQALNDLAAKRRFPILG" gene 1604878..1606146 /locus_tag="Rv1429" /db_xref="GeneID:886658" CDS 1604878..1606146 /locus_tag="Rv1429" /function="UNKNOWN" /note="Rv1429, (MTCY493.25c), len: 422 aa. Conserved hypothetical protein, some similarity to transcriptional regulator proteins e.g. CDAR_ECOLI|P37047 Carbohydrate diacid regulator from Escherichia coli (391 aa), FASTA scores: opt: 210, E(): 3e-06, (27.7% identity in 296 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv2370c, Rv1194c, Rv1453, Rv2242, and Rv1186c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215945.1" /db_xref="GI:15608567" /db_xref="GeneID:886658" /translation="MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARM ADLWRASITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHRLG HARFLEVAMQYVSLLEPADRVSTIIELVNRSARLVDLVADQLIVAYEHEHDRWLSRRS GLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWVDSAVPIGDVVAQFDQVRC LLAGELGPELGPVANSLMVPTDEREARLWFSPAPTRAFAPSRIRAAFESAGIRARLAC GRVGDGLRGFRASLKQAERVKALALAGGARPGGRVMFYDDVAPVALLADDLEELRRFV TDVLGDLSVDDERNSWLRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQ NLDDPDAAFRVQMALEVCRWMAPAVLRAKQ" gene 1606386..1607972 /gene="PE16" /locus_tag="Rv1430" /db_xref="GeneID:886652" CDS 1606386..1607972 /gene="PE16" /locus_tag="Rv1430" /function="UNKNOWN" /note="Rv1430, (MTCY493.24c), len: 528 aa. Member of the Mycobacterium tuberculosis PE family of proteins (see citation below), e.g. Y0D4_MYCTU|Q50594 (55.9% identity in 127 aa overlap). The C-terminus shows similarity to Q49633|LEPB1170_F3_112 hypothetical Mycobacterium leprae protein (391 aa), FASTA scores: opt: 342, E(): 1.2e-13, (29.8% identity in 292 aa overlap). Possible TMhelix aa 500-522." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177810.1" /db_xref="GI:57116862" /db_xref="GeneID:886652" /translation="MSFVFAVPEMVAATASDLASLGAALSEATAAAAIPTTQVLAAAA DEVSAAIAELFGAHGQEFQALSAQASAFHDRFVRALSAAAGWYVDAEAANAALVDTAA TGASELGSGGRTALILGSTGTPRPPFDYMQQVYDRYIAPHYLGYAFSGLYTPAQFQPW TGIPSLTYDQSVAEGAGYLHTAIMQQVAAGNDVVVLGFSQGASVATLEMRHLASLPAG VAPSPDQLSFVLLGNPNNPNGGILARFPGLYLQSLGLTFNGATPDTDYATTIYTTQYD GFADFPKYPLNILADVNALLGIYYSHSLYYGLTPEQVASGIVLPVSSPDTNTTYILLP NEDLPLLQPLRGIVPEPLLDLIEPDLRAIIELGYDRTGYADVPTPAALFPVHIDPIAV PPQIGAAIGGPLTALDGLLDTVINDQLNPVVTSGIYQAGAELSVAAAGYGAPAGVTNA IFIGQQVLPILVEGPGALVTADTHYLVDAIQDLAAGDLSGFNQNLQLIPATNIALLVF AAGIPAVAAVAILTGQDFPV" gene 1608083..1609852 /locus_tag="Rv1431" /db_xref="GeneID:886662" CDS 1608083..1609852 /locus_tag="Rv1431" /function="UNKNOWN" /note="Rv1431, (MTCY493.23c), len: 589 aa. Conserved membrane protein, shows strong similarity to another M. tuberculosis hypothetical protein Rv1132|MTCY22G8.21 (48.2% identity in 585 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215947.1" /db_xref="GI:15608569" /db_xref="GeneID:886662" /translation="MGFLKPDLPDVDHDTWLTQPRRTRLQVVTRDWVEHGFGTPYAVY LLYLTKIAVYVAAGAAIISLNPGLGGLSRIGDWWTQPIVYQKVIVFTLLFEVLGFGCG SGPLTGRFWPPIGGFLYWLRPNTIRLPAWPDKVPFTQGDTRTVVDVALYAIVLIGGVW ALLSPGSPGPGGTPVTAAGDVGLINPVLVVPTIVALGVLGLRDKTIFLAARGEHYWLK LFVFFFPFTDQIAAFKIIMLCLWWGAATSKLNHHFPYVVAVMTSNNALLRSRVFNPIK HLLYRDHANDLRPSWLPKLMAHGGGTTAEFLVPGILVLVADGHPWRWFLIGFMVLFHL NILSNLPMGVPLEWNVFFIFSLCYLFGHYGAITATDLRSPLLLAIVIAVVAVVIMGNL LPEKISFLPAMRYYAGNWATSIWCFRGDAEATMETSVVKSSALVVNQLAKLYDGATAE IMTDKVAAFRAMHTHGRALNGLLPRALDDEAHYRIREGEIVAGPLVGWNFGEGHLHNE QLVAAVQRRCNFADGDLRVIILEGQPIHVQKQWYRIVDAKTGLFEAGYVTVEDMLSRQ PWPEPGDEFPVHVTTQRGTPSKP" gene 1609849..1611270 /locus_tag="Rv1432" /db_xref="GeneID:886643" CDS 1609849..1611270 /locus_tag="Rv1432" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1432, (MTCY493.22c), len: 473 aa. Probable dehydrogenase (EC 1.-.-.-), shows strong simlarity to P49_STRLI|P06108 p49 protein from Streptomyces lividans (469 aa), FASTA scores: opt: 1362, E(): 0, (44.9% identity in 474 aa overlap); and weak simlarity to other dehydrogenases." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_215948.1" /db_xref="GI:15608570" /db_xref="GeneID:886643" /translation="MTTAVVVGAGPNGLAAAIHLARHGVDVQVLEARDTIGGGARSGE LTVPGVIHDHCSAFHPLGVGSPFWAAIDLQRYGLTWKWPDVDCAHPLDDGTAGVLYRS IEATAAGLGPDGKRWQRAVGDLAAGFDELAEDLLRPVLNMPRHPIRLARFGPRAALPA TAMARRFHTERARALFGGAAAHVYTRLDRPLTASLGLMILASGHRHGWPVARGGSGSI TKALAAALDAYGGTVATGVTVTSRRDIPDADIVMLDLSPAAVLGIYGDVMPTRINRSY RRYRAGSSAFKVDFAIEGDVGWTNPDCRRAGTVHLGGTFAEIADTERQRAQGTMVQRP FVLVGQQYLADPSRSVGNINPIWAYAHVPFGYTGDATAAVIDQIERFAPGFRDRIVAT VSTSTTELQTYNRNFIGGDIIGGANDRLQVIFRPRVAVDPYAIGVPGVYLCSQSAPPG AGIHGLCGYHAAESALRWLRKRR" gene 1611434..1612249 /locus_tag="Rv1433" /db_xref="GeneID:886649" CDS 1611434..1612249 /locus_tag="Rv1433" /function="UNKNOWN" /note="Rv1433, (MTCY493.21c), len: 271 aa. Possible exported protein with N-terminal signal sequence, highly similar to Q49706 hypothetical protein from Mycobacterium leprae (271 aa), FASTA scores: opt: 1341, E(): 0, (68.3% identity in 271 aa overlap). Also shows similarity to M. tuberculosis lipoprotein Rv2518c|MTV009.03c lppS (408 aa) (40.0% identity in 230 aa overlap); and others e.g. Rv0116c, Rv0192, Rv2518c, Rv0483." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215949.1" /db_xref="GI:15608571" /db_xref="GeneID:886649" /translation="MRAVFGCAIAVVGIAGSVVAGPADIHLVAAKQSYGFAVASVLPT RGQVVGVAHPVVVTFSAPITNPANRHAAERAVEVKSTPAMTGKFEWLDNDVVQWVPDR FWPAHSTVELSVGSLSSDFKTGPAVVGVASISQHTFTVSIDGVEEGPPPPLPAPHHRV HFGEDGVMPASMGRPEYPTPVGSYTVLSKERSVIMDSSSVGIPVDDPDGYRLSVDYAV RITSRGLYVHSAPWALPALGLENVSHGCISLSREDAEWYYNAVDIGDPVIVQE" gene 1612256..1612393 /locus_tag="Rv1434" /db_xref="GeneID:886634" CDS 1612256..1612393 /locus_tag="Rv1434" /function="UNKNOWN" /note="Rv1434, (MTCY493.20c), len: 45 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215950.1" /db_xref="GI:15608572" /db_xref="GeneID:886634" /translation="MRASPAERVDGAYAGAGPHTQSVLEEDQRQRAPAGAEAEGPGRT G" gene complement(1612342..1612950) /locus_tag="Rv1435c" /db_xref="GeneID:886653" CDS complement(1612342..1612950) /locus_tag="Rv1435c" /function="UNKNOWN" /note="Rv1435c, (MTCY493.19), len: 202 aa. Probable conserved Pro-, Gly-, Val-rich secreted protein (see citation below) with a N-terminal signal sequence. Similar at C-terminus to AF017099|AF017099_1 Mycobacterium tuberculosis pGB1 (87 aa), FASTA scores: opt: 550, E(): 2.3e-17, (97.7% identity in 86 aa overlap). Shows some similarity to N-terminus of CPN_DROME|Q02910 calphotin. drosophila melanogaster (865 aa), FASTA scores: opt: 266, E(): 2.5e-05, (37.2% identity in 191 aa overlap). Contains at least five 7 aa imperfect repeats. Also shows similarity to other Mycobacterium tuberculosis proteins e.g. MTCI237.20c (34.7% identity in 193 aa overlap), MTCI65.25c (36.9% identity in 160 aa overlap) and MTCI65.24c (34.2% identity in 196 aa overlap)." /codon_start=1 /transl_table=11 /product="proline, glycine, valine-rich secreted protein" /protein_id="NP_215951.1" /db_xref="GI:15608573" /db_xref="GeneID:886653" /translation="MTLMAIVNRFNIKVIAGAGLFAAAIALSPDAAADPLMTGGYACI QGMAGDAPVAAGDPVAAGGPAAAGACSAALTDMAGVPFVAPGPVPAAAPVPIGAPVPI PGAPVPIPGAPVPIPGGPVPIPGAPVPVPAVPAPVIPVGTPLIALGPVLAGAPGDGVV SAPIIGMSGVKDALTDPAPAGGPVPGQPVLPGPSASAPAGAR" repeat_region complement(1612558..1612578) /note="21 bp imperfect direct repeat 5, GGCGCACCGGTACCGGTACCC" repeat_region complement(1612579..1612599) /note="21 bp imperfect direct repeat 4, GGCGGACCGGTACCGATACCG" repeat_region complement(1612600..1612620) /note="21 bp imperfect direct repeat 3, GGCGCACCGGTACCAATCCCC" repeat_region complement(1612621..1612641) /note="21 bp imperfect direct repeat 2, GGCGCACCGGTACCGATACCG" repeat_region complement(1612642..1612662) /note="21 bp imperfect direct repeat 1, GGCGCACCGGTACCAATCCCT" gene 1613307..1614326 /gene="gap" /locus_tag="Rv1436" /db_xref="GeneID:886632" CDS 1613307..1614326 /gene="gap" /locus_tag="Rv1436" /EC_number="1.2.1.12" /function="INVOLVED IN SECOND PHASE OF GLYCOLYSIS (FIRST STEP) [CATALYTIC ACTIVITY: D-GLYCERALDEHYDE 3-PHOSPHATE + PHOSPHATE + NAD(+) = 3-PHOSPHO-D-GLYCEROYL PHOSPHATE + NADH.]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 3-phospho-D-glyceroyl phosphate from D-glyceraldehyde 3-phosphate" /codon_start=1 /transl_table=11 /product="glyceraldehyde-3-phosphate dehydrogenase" /protein_id="NP_215952.1" /db_xref="GI:15608574" /db_xref="GeneID:886632" /translation="MTVRVGINGFGRIGRNFYRALLAQQEQGTADVEVVAANDITDNS TLAHLLKFDSILGRLPCDVGLEGDDTIVVGRAKIKALAVREGPAALPWGDLGVDVVVE STGLFTNAAKAKGHLDAGAKKVIISAPATDEDITIVLGVNDDKYDGSQNIISNASCTT NCLAPLAKVLDDEFGIVKGLMTTIHAYTQDQNLQDGPHKDLRRARAAALNIVPTSTGA AKAIGLVMPQLKGKLDGYALRVPIPTGSVTDLTVDLSTRASVDEINAAFKAAAEGRLK GILKYYDAPIVSSDIVTDPHSSIFDSGLTKVIDDQAKVVSWYDNEWGYSNRLVDLVTL VGKSL" misc_feature 1613772..1613795 /gene="gap" /locus_tag="Rv1436" /note="PS00071 Glyceraldehyde 3-phosphate dehydrogenase active site" gene 1614329..1615567 /gene="pgk" /locus_tag="Rv1437" /db_xref="GeneID:886636" CDS 1614329..1615567 /gene="pgk" /locus_tag="Rv1437" /EC_number="2.7.2.3" /function="INVOLVED IN THE SECOND PHASE OF GLYCOLYSIS (SECOND STEP) [CATALYTIC ACTIVITY : ATP + 3-PHOSPHO-D-GLYCERATE = ADP + 3-PHOSPHO-D-GLYCEROYL PHOSPHATE]" /note="Converts 3-phospho-D-glycerate to 3-phospho-D-glyceroyl phosphate during the glycolysis pathway" /codon_start=1 /transl_table=11 /product="phosphoglycerate kinase" /protein_id="NP_215953.1" /db_xref="GI:15608575" /db_xref="GeneID:886636" /translation="MSVANLKDLLAEGVSGRGVLVRSDLNVPLDEDGTITDAGRIIAS APTLKALLDADAKVVVAAHLGRPKDGPDPTLSLAPVAVALGEQLGRHVQLAGDVVGAD ALARAEGLTGGDILLLENIRFDKRETSKNDDDRRALAKQLVELVGTGGVFVSDGFGVV HRKQASVYDIATLLPHYAGTLVADEMRVLEQLTSSTQRPYAVVLGGSKVSDKLGVIES LATKADSIVIGGGMCFTFLAAQGFSVGTSLLEDDMIEVCRGLLETYHDVLRLPVDLVV TEKFAADSPPQTVDVGAVPNGLMGLDIGPGSIKRFSTLLSNAGTIFWNGPMGVFEFPA YAAGTRGVAEAIVAATGKGAFSVVGGGDSAAAVRAMNIPEGAFSHISTGGGASLEYLE GKTLPGIEVLSREQPTGGVL" misc_feature 1615298..1615321 /gene="pgk" /locus_tag="Rv1437" /note="PS00111 Phosphoglycerate kinase signature" gene 1615564..1616349 /gene="tpiA" /locus_tag="Rv1438" /db_xref="GeneID:886628" CDS 1615564..1616349 /gene="tpiA" /locus_tag="Rv1438" /EC_number="5.3.1.1" /function="PLAYS AN IMPORTANT ROLE IN SEVERAL METABOLIC PATHWAYS [CATALYTIC ACTIVITY : D-GLYCERALDEHYDE 3-PHOSPHATE = GLYCERONE PHOSPHATE]" /experiment="experimental evidence, no additional details recorded" /note="Reversibly isomerizes the ketone sugar dihydroxyacetone phosphate to the aldehyde sugar glyceraldehyde-3-phosphate" /codon_start=1 /transl_table=11 /product="triosephosphate isomerase" /protein_id="NP_215954.1" /db_xref="GI:15608576" /db_xref="GeneID:886628" /translation="MSRKPLIAGNWKMNLNHYEAIALVQKIAFSLPDKYYDRVDVAVI PPFTDLRSVQTLVDGDKLRLTYGAQDLSPHDSGAYTGDVSGAFLAKLGCSYVVVGHSE RRTYHNEDDALVAAKAATALKHGLTPIVCIGEHLDVREAGNHVAHNIEQLRGSLAGLL AEQIGSVVIAYEPVWAIGTGRVASAADAQEVCAAIRKELASLASPRIADTVRVLYGGS VNAKNVGDIVAQDDVDGGLVGGASLDGEHFATLAAIAAGGPLP" misc_feature 1616071..1616103 /gene="tpiA" /locus_tag="Rv1438" /note="PS00171 Triosephosphate isomerase active site" gene complement(1616961..1617386) /locus_tag="Rv1439c" /db_xref="GeneID:886630" CDS complement(1616961..1617386) /locus_tag="Rv1439c" /function="UNKNOWN" /note="Rv1439c, (MTCY493.15), len: 141 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215955.1" /db_xref="GI:15608577" /db_xref="GeneID:886630" /translation="MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRH GLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLL RGDKAVERVSYFDSLPLLIKVVKHPSAWRGWLTTMRSRA" gene 1617837..1618070 /gene="secG" /locus_tag="Rv1440" /db_xref="GeneID:886624" CDS 1617837..1618070 /gene="secG" /locus_tag="Rv1440" /function="INVOLVED IN PROTEIN EXPORT. PARTICIPATES IN A EARLY EVENT OF PROTEIN TRANSLOCATION." /note="Rv1440, (MTCY493.14c), len: 77 aa. Probable secG, protein-export membrane protein (translocase subunit) (see citation below), similar to many e.g. P38388|SECG_MYCLE PROBABLE PROTEIN-EXPORT MEMBRANE (77 aa), FASTA scores: opt: 450, E(): 6.7e-24, (96.1% identity in 77 aa overlap). Start changed since original submission (-40 aa). PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA|Rv3240c, SECD|Rv2587c, SECE|Rv0638, SECF|Rv2586c, SECG AND SECY|Rv0732." /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecG" /protein_id="NP_215956.2" /db_xref="GI:57116863" /db_xref="GeneID:886624" /translation="MELALQITLIVTSVLVVLLVLLHRAKGGGLSTLFGGGVQSSLSG STVVEKNLDRLTLFVTGIWLVSIIGVALLIKYR" gene complement(1618209..1619684) /gene="PE_PGRS26" /locus_tag="Rv1441c" /db_xref="GeneID:886626" CDS complement(1618209..1619684) /gene="PE_PGRS26" /locus_tag="Rv1441c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1441c, (MTCY493.13), len: 491 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002), similar to Y0DP_MYCTU|Q50615 hypothetical glycine-rich 40.8 kDa protein (498 aa), fasta scores: opt: 1625, E(): 0, (55.2% identity in 518 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177811.1" /db_xref="GI:57116864" /db_xref="GeneID:886626" /translation="MSNVMVVPGMLSAAAADVASIGAALSAANGAAAPTTAGVLAAGA DEVSAAIASLFSGYARDYQALSAQMARFHQQFVQALTASVGSYAAAEAANASPLQALE QQVLAAINAPTQTLLGRPLIGNGADGLPGQNGGAGGLLWGNGGNGGAGDAAHPNGGNG GDAGMFGNGGAGGAGYSPAAGTGAAGGAGGAGGAGGWLSGNGGAGGNGGTGASGADGG GGLPPVPASPGGNGGGGDAGGAAGMFGTGGAGGTGGDGGAGGAGDSPNSGANGARGGD GGNGAAGGAGGRLFGNGGAGGNGGTAGQGGDGGTALGAGGIGGDGGTGGAGGTGGTAG IGGSSAGAGGAGGDGGAGGTGGGSSMIGGKGGTGGNGGVGGTGGASALTIGNGSSAGA GGAGGAGGTGGTGGYIESLDGKGQAGNGGNGGNGAAGGAGGGGTGAGGNGGAGGNGGD GGPSQGGGNPGFGGDGGTGGPGGVGVPDGIGGANGAQGKHG" gene 1619791..1622091 /gene="bisC" /locus_tag="Rv1442" /db_xref="GeneID:886620" CDS 1619791..1622091 /gene="bisC" /locus_tag="Rv1442" /function="THIS ENZYME MAY SERVE AS A SCAVENGER, ALLOWING THE CELL TO UTILIZE BIOTIN SULFOXIDE AS A BIOTIN SOURCE" /note="Rv1442, (MTCY493.12c), len: 766 aa. Probable bisC, Biotin sulfoxide reductase (EC 1.-.-.-), similar to BISC_ECOLI|P20099 biotin sulfoxide reductase from Escherichia coli (739 aa), FASTA scores: opt: 1271, E():0, (40.2% identity in 744 aa overlap)." /codon_start=1 /transl_table=11 /product="biotin sulfoxide reductase" /protein_id="NP_215958.1" /db_xref="GI:15608580" /db_xref="GeneID:886620" /translation="MQVYTSATHWGVFTARVHGGDIAAVAALASDTNPAPQLQNLPGA VRHRSRIANPAVRRGWLQHGPGPSSARGAEEFVEVSWDELIELLASELRRTVDRYGNE AIYGSSYGWASAGRFHHAQSQVHRFLNMLGGYTASRHSYSAGASEVIFPHIVGAALFE ALAETTTWDVIVDHTALLVAFGGLPVKNTAVMPGGTTAHPDRDYVGRYRARGGRLVSV SPLRDDIAAIAGPLDDRCRWLAPVPGTDVAIMLGLAYVLATESLADRAFLGRYCTGYE RFERYLLGLDDGIPKTPEWAAALSGLAAGDLRDLARRMAEHRTLITTSLSLQRIEHGE QTVWMAATLAAMLGQIGLPGGGFGHGYSSNGVGNPPLACGLPALPQGNNPVSTFIPVA AISELLQRPGQRLAYNGRLLELPDIKCVYWAGGNPFHHHQNLPRLRRALSRVDTIVVH EQYWTAMAKHADIVVPTTTSFERDDFAASKTNPTLIAMPAMVPPYANARDDYHTFSAL AHRLGFGKQFTEGRSAREWLEHMYDKWSAELDFPVPSFAEFWRTGRLELPTRTGLTWL ADFRADPAAHPLGTPSGRIEIFSDTVDAFALPDCAGHPTWYEPSEWLGGPRAARYPLH LIANQPRTRLHSQLDHGGASMASKIRGREPIRIHPDDAAARELTDGDIVRVFNDRGAC LAGVVIDDGLRPKVVQLSTGAWFDPADPRDPDSMCVHGNPNALSNDSGTSSLAHGSTG QHVLVQIERFTGELPPVRAHEPPRLA" gene complement(1622207..1622692) /locus_tag="Rv1443c" /db_xref="GeneID:886622" CDS complement(1622207..1622692) /locus_tag="Rv1443c" /function="UNKNOWN" /note="Rv1443c, (MTCY493.11), len: 161 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215959.1" /db_xref="GI:15608581" /db_xref="GeneID:886622" /translation="MVGYAEPVLIERQSVVAAPAEQVWQRVVTPEGINDELRPWMTMS VPRGAKGMTVDTVPIGAPIGRAWLRLFGVLPFDYDRLSIAELEPGRRFREDSTMLSMR QWQHERTVTPEGDTKTIVRDRITFQTRAGLRFAAPLIAAGLRALFGHRHRRLQRHFAQ G" gene complement(1623287..1623697) /locus_tag="Rv1444c" /db_xref="GeneID:886616" CDS complement(1623287..1623697) /locus_tag="Rv1444c" /function="UNKNOWN" /note="Rv1444c, (MTCY493.10), len: 136 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215960.1" /db_xref="GI:15608582" /db_xref="GeneID:886616" /translation="MTVMADRSGRPAPVRRRMKTLTQAALNADKTVEQVEDVLDGLGK TMAELNSSLSQLNSTVERLEDGLDHLEGTLHSLDDLAKRLIVLVEPVEAIVDRIDYIV SLGETVMSPLSVTEHAVRGVLDRLRNRTVHEPTN" gene complement(1623714..1624457) /gene="devB" /locus_tag="Rv1445c" /db_xref="GeneID:886617" CDS complement(1623714..1624457) /gene="devB" /locus_tag="Rv1445c" /EC_number="3.1.1.31" /function="INVOLVED IN PENTOSE PHOSPHATE PATHWAY. HYDROLYSIS OF 6-PHOSPHOGLUCONOLACTONE TO 6- PHOSPHOGLUCONATE. [CATALYTIC ACTIVITY : 6-PHOSPHO-D-GLUCONO-1,5-LACTONE + H(2)O = 6- PHOSPHO-D-GLUCONATE]" /note="catalyzes the formation of 6-phospho-D-gluconate from 6-phospho-D-glucono-1,5-lactone" /codon_start=1 /transl_table=11 /product="6-phosphogluconolactonase" /protein_id="NP_215961.1" /db_xref="GI:15608583" /db_xref="GeneID:886617" /translation="MSSSIEIFPDSDILVAAAGKRLVGAIGAAVAARGQALIVLTGGG NGIALLRYLSAQAQQIEWSKVHLFWGDERYVPEDDDERNLKQARRALLNHVDIPSNQV HPMAASDGDFGGDLDAAALAYEQVLAASAAPGDPAPNFDVHLLGMGPEGHINSLFPHS PAVLESTRMVVAVDDSPKPPPRRITLTLPAIQRSREVWLLVSGPGKADAVAAAIGGAD PVSVPAAGAVGRQNTLWLLDRDAAAKLPS" gene complement(1624454..1625365) /gene="opcA" /locus_tag="Rv1446c" /db_xref="GeneID:886612" CDS complement(1624454..1625365) /gene="opcA" /locus_tag="Rv1446c" /function="MAY BE INVOLVED IN THE FUNCTIONAL ASSEMBLY OF GLUCOSE 6-PHOSPHATE DEHYDROGENASE" /note="Rv1446c, (MTCY493.08), len: 303 aa. Putative opcA, OxPP cycle protein. Highly similar to S72774 B1496_F1_30 protein from Mycobacterium leprae (265 aa), FASTA scores: opt: 1056, E(): 0, (70.3% identity in 239 aa overlap). Also similar to OPCA_NOSS2|P48971 putative oxppcycle protein opca from Nostoc punctiforme (465 aa), fasta scores: opt: 177, E(): 7.3e-05, (23.4% identity in 321 aa overlap). AIDS IN G6PD ACTIVITY." /codon_start=1 /transl_table=11 /product="putative OXPP cycle protein OPCA" /protein_id="NP_215962.1" /db_xref="GI:15608584" /db_xref="GeneID:886612" /translation="MIVDLPDTTTTAVNKKLDELREKIGAVAMGRVLTLIIAPDSEAM LEESIEAANDASHEHPSRIIVTMRGDPYADRPRLDAQLRVGADAGAGEFVVLRLSGPL AGHADSVVIPFLLPDIPVVAWWPDIAPAVPAQDALGKLAIRRITDATNAIDPLSAIKS RLAGYGAGDTDLAWSRITYWRALLTSAVDQPRHEPIESALVSGLKTEPALDVLAGWLA SRIEGPVRRAVGELKVELVRNSETIVLSRPQEGITATLTRTGKPDALVPLARRVTGEC LAEDLRRLDPDEIYCAALEGIKKVQYR" repeat_region complement(1625366..1625418) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene complement(1625418..1626962) /gene="zwf2" /locus_tag="Rv1447c" /db_xref="GeneID:886614" CDS complement(1625418..1626962) /gene="zwf2" /locus_tag="Rv1447c" /EC_number="1.1.1.49" /function="INVOLVED IN PENTOSE PHOSPHATE PATHWAY [CATALYTIC ACTIVITY : D-GLUCOSE 6-PHOSPHATE + NADP(+) = D-GLUCONO- 1,5-LACTONE 6-PHOSPHATE + NADPH]" /note="catalyzes the formation of D-glucono-1,5-lactone 6-phosphate from D-glucose 6-phosphate" /codon_start=1 /transl_table=11 /product="glucose-6-phosphate 1-dehydrogenase" /protein_id="NP_215963.1" /db_xref="GI:15608585" /db_xref="GeneID:886614" /translation="MKPAHAAASWRNPLRDKRDKRLPRIAGPCGMVIFGVTGDLARKK VMPAVYDLANRGLLPPTFSLVGFARRDWSTQDFGQVVYNAVQEHCRTPFRQQNWDRLA EGFRFVPGTFDDDDAFAQLAETLEKLDAERGTGGNHAFYLAIPPKSFPVVCEQLHKSG LARPQGDRWSRVVIEKPFGHDLASARELNKAVNAVFPEEAVFRIDHYLGKETVQNILA LRFANQLFDPIWNAHYVDHVQITMAEDIGLGGRAGYYDGIGAARDVIQNHLMQLLALT AMEEPVSFHPAALQAEKIKVLSATRLAEPLDQTTSRGQYAAGWQGGEKVVGLLDEEGF AEDSTTETFAAITLEVDTRRWAGVPFYLRTGKRLGRRVTEIALVFRRAPHLPFDATMT DELGTNAMVIRVQPDEGVTLRFGSKVPGTAMEVRDVNMDFSYGSAFAEDSPEAYERLI LDVLLGEPSLFPVNAEVELAWEILDPALEHWAAHGTPDAYEAGTWGPESSLEMLRRTG REWRRP" misc_feature complement(1626330..1626350) /gene="zwf2" /locus_tag="Rv1447c" /note="PS00069 Glucose-6-phosphate dehydrogenase active site" gene complement(1626959..1628080) /gene="tal" /locus_tag="Rv1448c" /db_xref="GeneID:886606" CDS complement(1626959..1628080) /gene="tal" /locus_tag="Rv1448c" /EC_number="2.2.1.2" /function="TRANSALDOLASE IS IMPORTANT FOR THE BALANCE OF METABOLITES IN THE PENTOSE-PHOSPHATE PATHWAY [CATALYTIC ACTIVITY : SEDOHEPTULOSE 7-PHOSPHATE + D-GLYCERALDEHYDE 3-PHOSPHATE = D-ERYTHROSE 4-PHOSPHATE + D-FRUCTOSE 6-PHOSPHATE]" /note="catalyzes the reversible formation of D-erythrose 4-phosphate and D-fructose 6-phosphate from sedoheptulose 7-phosphate and D-glyceraldehyde 3-phosphate" /codon_start=1 /transl_table=11 /product="transaldolase" /protein_id="NP_215964.1" /db_xref="GI:15608586" /db_xref="GeneID:886606" /translation="MTAQNPNLAALSAAGVSVWLDDLSRDRLRSGNLQELIDTKSVVG VTTNPSIFQKALSEGHTYDAQIAELAARGADVDATIRTVTTDDVRSACDVLVPQWEDS DGVDGRVSIEVDPRLAHETEKTIQQAIELWKIVDRPNLFIKIPATKAGLPAISAVLAE GISVNVTLIFSVQRYREVMDAYLTGMEKARQAGHSLSKIHSVASFFVSRVDTEIDKRL DRIGSRQALELRGQAGVANARLAYATYREVFEDSDRYRSLKVDGARVQRPLWASTGVK NPDYSDTLYVTELVAPHTVNTMPEKTIDAVADHGVIQGDTVTGTASDAQAVFDQLGAI GIDLTDVFAVLEEEGVRKFEASWNELLQETRAHLDTAAQ" gene complement(1628097..1630199) /gene="tkt" /locus_tag="Rv1449c" /db_xref="GeneID:886638" CDS complement(1628097..1630199) /gene="tkt" /locus_tag="Rv1449c" /EC_number="2.2.1.1" /function="This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. It catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor [CATALYTIC ACTIVITY: Sedoheptulose 7-phosphate + D-glyceraldehyde 3-phosphate = D-ribose 5-phosphate + D-xylulose 5-phosphate]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of ribose 5-phosphate and xylulose 5-phosphate from sedoheptulose 7-phosphate and glyceraldehyde 3-phosphate; can transfer ketol groups between several groups; in Escherichia coli there are two tkt genes, tktA expressed during exponential growth and the tktB during stationary phase" /codon_start=1 /transl_table=11 /product="transketolase" /protein_id="NP_215965.1" /db_xref="GI:15608587" /db_xref="GeneID:886638" /translation="MTTLEEISALTRPRHPDYWTEIDSAAVDTIRVLAADAVQKVGNG HPGTAMSLAPLAYTLFQRTMRHDPSDTHWLGRDRFVLSAGHSSLTLYIQLYLGGFGLE LSDIESLRTWGSKTPGHPEFRHTPGVEITTGPLGQGLASAVGMAMASRYERGLFDPDA EPGASPFDHYIYVIASDGDIEEGVTSEASSLAAVQQLGNLIVFYDRNQISIEDDTNIA LCEDTAARYRAYGWHVQEVEGGENVVGIEEAIANAQAVTDRPSFIALRTVIGYPAPNL MDTGKAHGAALGDDEVAAVKKIVGFDPDKTFQVREDVLTHTRGLVARGKQAHERWQLE FDAWARREPERKALLDRLLAQKLPDGWDADLPHWEPGSKALATRAASGAVLSALGPKL PELWGGSADLAGSNNTTIKGADSFGPPSISTKEYTAHWYGRTLHFGVREHAMGAILSG IVLHGPTRAYGGTFLQFSDYMRPAVRLAALMDIDTIYVWTHDSIGLGEDGPTHQPIEH LSALRAIPRLSVVRPADANETAYAWRTILARRNGSGPVGLILTRQGVPVLDGTDAEGV ARGGYVLSDAGGLQPGEEPDVILIATGSEVQLAVAAQTLLADNDILARVVSMPCLEWF EAQPYEYRDAVLPPTVSARVAVEAGVAQCWHQLVGDTGEIVSIEHYGESADHKTLFRE YGFTAEAVAAAAERALDN" misc_feature complement(1630047..1630109) /gene="tkt" /locus_tag="Rv1449c" /note="PS00801 Transketolase signature 1" gene complement(1630638..1634627) /gene="PE_PGRS27" /locus_tag="Rv1450c" /db_xref="GeneID:886605" CDS complement(1630638..1634627) /gene="PE_PGRS27" /locus_tag="Rv1450c" /function="UNKNOWN" /note="Rv1450c, (MTCY493.04), len: 1329 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kDa protein (603 aa), fasta scores: opt: 2112, E(): 0, (56.5% identity in 630 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177812.1" /db_xref="GI:57116865" /db_xref="GeneID:886605" /translation="MSLVIVAPETVAAAALDVARIGSSIGAANAAAAGSTTSVLAAGA DEVSAAIATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLATLE HNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAGGSGAPGQVGGA GGAAGLFGTGGAGGAGGAGAAGGAGGSGGWLLGNGGVGGAGGQSLLGGATGGAGGNAG LFGVGGTGGPGGPGGPGGVGGTGGAGGLGGTLYGAGGHGGAGGPGPIGGVGGHGGVGG AAGLLGVGGHGGAGGHGAEGVAGAAGEDLSPHGTSGGVGGDAGDGGTGGRGGWLAGAG GAGGAGGVGGTGGAGGAGFSRALIVAGDNGGDPGAGGAGGTGGAGSTIGAHGAAGASP TSGGNGGAGGNGAHFSSGGKAGGNGGAGGAGGLVGNGGAGGAGGNGAPGAPPSGGDPN GGGGGAGGAGGKGGDGGAQAGDGGAGGAGGKGGNGGNGATGATGLNGLGAGADGTDGG KGGNGGAGGGGGAGGQGGKALAATHQDGSMGAGGAGGNGGAGGMGGDGGNGAKGTFDN GGDGVGGNGGNGGSRGIGGAGGIGGAGSTAGADGARGATPTSGGNGGTGGNGANATVA GGAGGAGGKGGNGGLVGNGGAGGKGGDGMAGVAGSSPTTAGESGTSGQNGGAGGAGGA GGRGGDFGGDGGTGGAGGNGANGANATTPGAKGGDGGHGGPGAQGGNGGQGGPGGLAG NLFGQNGIQGVGGSGGKGGAGGLAGDGGNGANGNFAFGDGNGGHGGNGGNPGAGGQGG SGGAGSTPGAKGAHGFTPTSGGDGGDGGNGGNSQVVGGNGGDGGNGGNGGSAGTGGNG GRGGDGAFGGMSANATNPGENGPNGNPGGNGGAGGAGGAGLNGGNGGAGGNGGLGGFG GNGAAGANGVAVGAPGQPGGAGGHGGAGGNGGAGGNGGQGVVSDGAGGAGGAGGDGGA PGDGANGGNGQGAGAFAGGGGGRGGDGGNAGNAGAGGPGGTGSTAGKAGPAGSILHDG GNGGHGGHGAASGGNGGPGGHGGNGGNGGTGANGGNGGIGGTGGAGSTGAKGVLGTNE GDGGDGGRGGNGGRGGNGGQGLTGAGGNGGTGGTPGNGGNGGNGASGDLVTSPGDGGG GGRGGDAGRGGDAGLGGSSGPGGTPGDWGTGGTGGTGGTGGQGANGGLTGGRGGTGGN GGNGNTGGTGGAGGTGGTGHNGSQPGMGGNGGAGGFGGNGFAGVGGRGGMGGSGGTGG TGDAGPFGTGTGGTGGHGGQGGGGGFSILLGLGGLGGLGSPGSIATGTAGGAGGGGGF GGLGGGEFV" repeat_region complement(1633531..1634790) /note="1260 bp imperfect direct repeat 2, first copy at 1637133..1638392" gene 1635029..1635955 /gene="ctaB" /locus_tag="Rv1451" /db_xref="GeneID:886666" CDS 1635029..1635955 /gene="ctaB" /locus_tag="Rv1451" /function="THOUGHT TO BE INVOLVED IN AEROBIC RESPIRATION." /note="converts protoheme IX and farnesyl diphosphate to heme O" /codon_start=1 /transl_table=11 /product="protoheme IX farnesyltransferase" /protein_id="NP_215967.1" /db_xref="GI:15608589" /db_xref="GeneID:886666" /translation="MNVRGRVAPRRVTGRAMSTLLAYLALTKPRVIELLLVTAIPAML LADRGAIHPLLMLNTLVGGMMAAAGANTLNCVADADIDKVMKRTARRPLAREAVPTRN ALALGLTLTVISFFWLWCATNLLAGVLALVTVAFYVFVYTLWLKRRTSQNVVWGGAAG CMPVMIGWSAITGTIAWPALAMFAIIFFWTPPHTWALAMRYKQDYQVAGVPMLPAVAT ERQVTKQILIYTWLTVAATLVLALATSWLYGAVALVAGGWFLTMAHQLYAGVRAGEPV RPLRLFLQSNNYLAVVFCALAVDSVIALPTLH" gene complement(1636004..1638229) /gene="PE_PGRS28" /locus_tag="Rv1452c" /db_xref="GeneID:886595" CDS complement(1636004..1638229) /gene="PE_PGRS28" /locus_tag="Rv1452c" /function="UNKNOWN" /note="Rv1452c, (MTCY493.02), len: 741 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kDa protein (603 aa), fasta scores: opt: 2090, E(): 0, (56.3% identity in 641 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177813.1" /db_xref="GI:57116866" /db_xref="GeneID:886595" /translation="MSLVIVTPETVAAAASDVARIGSSIGVANSAAAGSTTSVLAAGA DEVSAAIATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLATLE HNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAGGSGAPGQVGGA GGAAGLFGTGGAGGAGGAGAAGGAGGSGGWLLGNGGVGGAGGQSLLGGATGGAGGNAG LFGVGGTGGPGGPGGPGGVGGTGGAGGLGGTLYGAGGHGGAGGPGPIGGVGGHGGVGG AAGLLGVGGHGGAGGHGAEGVAGAAGEDLSPHGTSGGVGGDAGDGGTGGRGGWLAGAG GAGGAGGVGGTGGAGGAGFSRALIVAGDNGGDGGNGGMGGAGGAGGPGGAGGLISLLG GQGAGGAGGTGGAGGVGGDRGAGGPGNQAFNAGAGGAGGHGGDPGAGGAGGTGGAGSI TGAQGAIGATPTSGGNGGAGGNGANATTAGTNGANGGPGGHGGLVGNGGAGGNGANGA AGTNASDSGAVGGKGNSGGNGGQGGAGGDGGTLAGNGGAGGTGGRGADGGLGGSGAEG ANATTAGERGQDGGKGGNGGVGGTGGNAVAPGANGGHGGNGGNPGFSGAGGLGGLSGD GVTRAAQGATPDFADTGGKGGNGGNGANAVAPGGTGASGGAGGNAGAGGKGGENIIGD GGGGNGGAGGKGGAGTLLGLTVFGDNGGAGVLGDSTDPDGSGGAGGAGGAGGAGGDPT I" repeat_region complement(1637133..1638392) /note="1260 bp imperfect direct repeat 1, second copy at 1633531..1634790" gene 1638381..1639646 /locus_tag="Rv1453" /db_xref="GeneID:886610" CDS 1638381..1639646 /locus_tag="Rv1453" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1453, (MTCY493.01c), len: 421 aa. Possible transcriptional activator, similar to Q50018 putative transcriptional activator trx from Mycobacterium leprae (517 aa), FASTA scores: opt: 1719, E(): 0, (54.0% identity in 500 aa overlap). Also highly similar to Mycobacterium tuberculosis proteins Rv2370c, Rv1194c, Rv2242, Rv1186c, and to the further upstream ORF's Rv1429|MTCY493.25c (28.1% identity in 335 aa overlap). Start changed since first submission (-11 aa)." /codon_start=1 /transl_table=11 /product="transcriptional activator protein" /protein_id="NP_215969.2" /db_xref="GI:57116867" /db_xref="GeneID:886610" /translation="MALRETSPRIHELIREAARIALNPTQEWLDEFDRAILAANPSIA ADPALATVVKRSNRAHLIHFAAANLRNPGAPVPANLGPEPLRMARDLVRVGLDALALD IYRIGQNVAWRRWTDIAFGLTSDPDELHELLDVPFRTANEFVDTTLAGITTEMQLERD KLTRDVPAERRKIVQLLIDGAPISREHAEARLGYPLDRSHTAAVIWGDQAQGDHSHLD RVADAFGHAGGCPHPLVVVAGAATRWVWVKDAPGFDIDLIHEVLHDIPDARIAIGATA PGIEGFRRSHRDALTTARMIIRLESPHRVAFFTDVEMVALLTENAEGADDFIQRTLGN LESASPALKTTLLTFINQQCNASRAARLLFTHRNTLMNRLETAQRLLPRPLADTTIHV AVALEAQQWREKPTSDPPAKKESNGTKMR" gene complement(1639674..1640660) /gene="qor" /locus_tag="Rv1454c" /db_xref="GeneID:886589" CDS complement(1639674..1640660) /gene="qor" /locus_tag="Rv1454c" /EC_number="1.6.5.5" /function="Catalyzes the one electron reduction of certain quinones [CATALYTIC ACTIVITY: NADPH + quinone = NADP+ + semiquinone]" /experiment="experimental evidence, no additional details recorded" /note="Rv1454c, (MTV007.01c), len: 328 aa. Probable qor, quinone oxidoreductase (EC 1.6.5.5), simiar to U87282|RCU87282_2 quinone oxidoreductase from Rhodobacter capsulatus (323 aa), FASTA scores: opt: 849, E(): 0, (44.7% identity in 329 aa overlap). Also similar to MTCY180.06 Hypothetical protein from Mycobacterium tuberculosis (334 aa), FASTA scores: opt: 430, E(): 2e-14, (32.3% identity in 350 aa overlap). TBparse score is 0.887. Contains PS01162 Quinone oxidoreductase / zeta-crystallin signature." /codon_start=1 /transl_table=11 /product="quinone reductase" /protein_id="NP_215970.1" /db_xref="GI:15608592" /db_xref="GeneID:886589" /translation="MHAIEVTETGGPGVLRHVDQPQPQPGHGELLIKAEAIGVNFIDT YFRSGQYPRELPFVIGSEVCGTVEAVGPGVTAADTAISVGDRVVSASANGAYAEFCTA PASLTAKVPDDVTSEVAASALLKGLTAHYLLKSVYPVKRGDTVLVHAGAGGVGLILTQ WATHLGVRVITTVSTAEKAKLSKDAGADVVLDYPEDAWQFAGRVRELTGGTGVQAVYD GVGATTFDASLASLAVRGTLALFGAASGPVPPVDPQRLNAAGSVYLTRPSLFHFTRTG EEFSWRAAELFDAIGSEAITVAVGGRYPLADALRAHQDLEARKTVGSVVLLP" misc_feature complement(1640181..1640234) /gene="qor" /locus_tag="Rv1454c" /note="PS01162 Quinone oxidoreductase / zeta-crystallin signature" gene 1640680..1641543 /locus_tag="Rv1455" /db_xref="GeneID:886597" CDS 1640680..1641543 /locus_tag="Rv1455" /function="UNKNOWN" /note="Rv1455, (MTV007.02), len: 287 aa. Conserved hypothetical protein, some similarity from aa 80-160 to Z99125|MLCL536.35c hypothetical Mycobacterium leprae protein (101 aa), FASTA scores: opt: 238, E(): 1.8e-08, (51.3% identity in 78 aa overlap). TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215971.1" /db_xref="GI:15608593" /db_xref="GeneID:886597" /translation="MKLARPDVFHPRVVLAGWPQQPAGDGDDAGLVAALRHRGLHAGW LSWDDPEIVHADLVILRATRDYPARLDEFLAWTTRVANLLNSRPVVAWNVERRYLRDL MDRGVPTVPGEVYVPGEPVRLPRKGQVFVGPTIGTGTRRCSARFAAEFVAQLHAAGQA VLVQPGGSGDETVLVFLGGEPSHAFTKQADTWRQTEPDFEIWDVGAAAVAGAAAQVGV DPGELLYARAHITGGSRDPRLLELQLVDPSLGWQWLDPDIRNLAQRDFALCVQSALER LGLGPFSHRRP" gene complement(1641493..1642425) /locus_tag="Rv1456c" /db_xref="GeneID:886591" CDS complement(1641493..1642425) /locus_tag="Rv1456c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF ANTIBIOTIC ACROSS THE MEMBRANE (EXPORT): UNIDENTIFIED ANTIBIOTIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1456c, (MTV007.03c), len: 310 aa. Possible unidentified antibiotic-transport integral membrane protein ABC transporter (see citation below), equivalent to Z99125|MLCL536.34 from Mycobacterium leprae (311 aa), FASTA scores: opt: 1607, E(): 0, (83.3% identity in 300 aa overlap). TBparse score is 0.886." /codon_start=1 /transl_table=11 /product="unidentified antibiotic-transport integral membrane ABC transporter" /protein_id="NP_215972.1" /db_xref="GI:15608594" /db_xref="GeneID:886591" /translation="MPYDRAVSPSLRVQRVIAAIVILTQGGIAVTGAIVRVTASGLGC PTWPQCFPGSFTPVVVAEVPRVHQAVEFGNRMVTFAVVIAAALAVLVVTRARRRTEVL AYAWLMPVSTVVQAMIGGITVRTGLLWWTVAIHLLASMTMVWLAVLLYVKIGQPDDGV VHELVVSPLRALTALSALNLAAVLVTGTLVTAAGPHAGDRSPSRTVPRLKVEITTLVH MHSSLLVAYLALLIGLGFGLLAVGATRAILVRLAVLLALVATQAAVGTTQYFTGVPAA LVAIHVAGAAAVTAATAALWASMGERAQPQPLQR" gene complement(1642537..1643322) /locus_tag="Rv1457c" /db_xref="GeneID:886587" CDS complement(1642537..1643322) /locus_tag="Rv1457c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF ANTIBIOTIC ACROSS THE MEMBRANE (EXPORT): UNIDENTIFIED ANTIBIOTIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1457c, (MTV007.04c), len: 261 aa. Possible unidentified antibiotic-transport integral membrane protein ABC transporter (see citation below), equivalent to Z99125|MLCL536.32 from Mycobacterium leprae (265 aa), FASTA scores: opt: 1415, E(): 0, (83.1% identity in 260 aa overlap). TBparse score is 0.877." /codon_start=1 /transl_table=11 /product="unidentified antibiotic-transport integral membrane ABC transporter" /protein_id="NP_215973.1" /db_xref="GI:15608595" /db_xref="GeneID:886587" /translation="MTQTNRPAFPAGTFSPDPRPNAVPLMLAAQFSLELKLLLRNGEQ LLLTMFIPITLLVGLTLLPMGSFGHNRAATFVPVIMALAVISTAFTGQAIAVAFDRRY GALKRLGATPLPVWGIIAGKSLAVVAVVFLQAIILGAIGFALGWRPALTALTLGAGII ALGTAGFAALGLLLGGTLRAEIVLAVANLMWFVFAGFGALTLESNVIPTAFKWVARVT PSGALTEALSQAMTVSVDWFGIVVLAVWGALAALAALRWFRFT" gene complement(1643319..1644260) /locus_tag="Rv1458c" /db_xref="GeneID:886582" CDS complement(1643319..1644260) /locus_tag="Rv1458c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF ANTIBIOTIC ACROSS THE MEMBRANE (EXPORT): UNIDENTIFIED ANTIBIOTIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv1458c, (MTV007.05c), len: 313 aa. Possible unidentified antibiotic-transport ATP-binding protein ABC transporter (see citation below), equivalent to Z99125|MLCL536.31 from Mycobacterium leprae (315 aa), FASTA scores: opt: 1812, E(): 0, (88.0% identity in 308 aa overlap). Similar to AF027770|AF027770_7 ABC-type transporter in FxbA region in Mycobacterium smegmatis (284 aa), FASTA scores: opt: 1412, E(): 0, (85.1% identity in 248 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.874." /codon_start=1 /transl_table=11 /product="unidentified antibiotic-transport ATP-binding protein ABC transporter" /protein_id="NP_215974.1" /db_xref="GI:15608596" /db_xref="GeneID:886582" /translation="MNRAPDTPEVVLRLRGVCKRYGSITAVSNLDLDVHDAEVMALLG PNGAGKTTTVEMCEGFVRPDAGSIEVLGLDPITDNARLRARIGVMLQGGGGYPAARAG EMLDLVASYAANPLDPHWLLDTLGLTEAARTTYRRLSGGQQQRLALACALVGRPQLVF LDEPTAGMDAHARVLVWELIDALRRDGVTVVLTTHHLKEAEELADRLVIIDHGVTVAA GTPAELMRSGAKDQLRFTAPPRLDLSLLASALPEGYQATELTPGEYLVEGPVDPQVLA TVTAWCAQIDVLATDMRVEQRSLEDVFLDLTGRKLRQ" misc_feature complement(1643805..1643849) /locus_tag="Rv1458c" /note="PS00211 ABC transporters family signature" misc_feature complement(1644108..1644131) /locus_tag="Rv1458c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" repeat_region complement(1644261..1644313) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(1644314..1644364) /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene complement(1644363..1646138) /locus_tag="Rv1459c" /db_xref="GeneID:886585" CDS complement(1644363..1646138) /locus_tag="Rv1459c" /function="UNKNOWN" /note="Rv1459c, (MTV007.06c), len: 591 aa. Possible conserved integral membrane protein, equivalent to MLCL536.30|Z99125 hypothetical protein from Mycobacterium leprae (593 aa), FASTA scores: opt: 1670, E(): 0, (78.6% identity in 585 aa overlap). Also similar to M. tuberculosis protein Rv2174|MTV021.07 (33.1% identity in 523 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_215975.1" /db_xref="GI:15608597" /db_xref="GeneID:886585" /translation="MAARHHTLSWSIASLHGDEQAVGAPLTTTELTALARTRLFGATG TVLMAIGALGAGARPVVQDPTFGVRLLNLPSRIQTVSLTMTTTGAVMMALAWLMLGRF TLGRRRMSRGKLDRTLLLWMLPLLIAPPMYSKDVYSYLAQSEIGRDGLDPYRVGPASG LGLGHVFTLSVPSLWRETPAPYGPLFLWIGRGISSLTGENIVAAVLCHRLVVLIGVTL IVWATPRLAQRCGVAEVSALWLGAANPLLIMHLVAGIHNEALMLGLMLTGVEFALRGL DMANTPRPSPETWRLGPATIRASRRPELGASPRAGASRAVKPRPEWGPLAMLLAGSIL ITLSSQVKLPSLLAMGFVTTVLAYRWGGNLRALLLAAAVMASLTLAIMAILGWASGLG FGWINTLGTANVVRSWMSPPTLLALGTGHVGILLGLGDHTTAVLSLTRAIGVLIITVM VCWLLLAVLRGRLHPIGGLGVALAVTVLLFPVVQPWYLLWAIIPLAAWATRPGFRVAA ILATLIVGIFGPTANGDRFALFQIVDATAASAIIVILLIALTYTRLPWRPLAAEQVVT AAESASKTPATRRPTAAPDAYADST" gene 1646186..1646992 /locus_tag="Rv1460" /db_xref="GeneID:886573" CDS 1646186..1646992 /locus_tag="Rv1460" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1460, (MTV007.07), len: 268 aa. Probable transcriptional regulatory protein. Equivalent to Z99125|MLCL536.29c hypothetical protein from Mycobacterium leprae (254 aa), FASTA scores: opt: 1273, E(): 0, (79.6% identity in 250 aa overlap). Possible helix-turn-helix motif between aa 68 - 89. Start changed since original submission. TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215976.2" /db_xref="GI:57116868" /db_xref="GeneID:886573" /translation="MTSTTLPHRASLVDRSTEFCHTDVVKIPAVSTTVPAAVSDGHTR RAIVRLLLESGSITAGEIGDRLGLSAAGVRRHLDALIEAGDAEASAAAPWQQVGRGRP AKRYRLTAAGRAKLDHSYDDLASAAMRQLREIGGEEAVRTFARRRIDAILADVAPADG PDDAALEAAAERIATALSKAGYVATTTRVGGPIHGVQICQHHCPVSHVAEEFPELCET EQQAMAEVLGTHVQRLATIVNGDCACTTHVPLSPAPSPRPPATSTEGASR" gene 1646989..1649529 /locus_tag="Rv1461" /db_xref="GeneID:886609" CDS 1646989..1649529 /locus_tag="Rv1461" /function="UNKNOWN" /note="Rv1461, (MTV007.08), len: 846 aa. Conserved hypothetical protein. Equivalent of spliced protein from Mycobacterium leprae MLCL536.28c len: 869. Residues 1-253 represent N-extein, and 613-846 the C-extein. The intein present from residues 254 - 612 is different in sequence and site of the insertion from the one present in MLCL536.28c. FASTA scores: Z99125|MLCL536_23 Mycobacterium leprae cosmid L536 (869 aa), opt: 1498 E(): 0, (54.1% identity in 917 aa overlap). The mature protein is similar to Z99120|BSUB0017_150 hypothetical Bacillus subtilis protein (465 aa), FASTA scores: opt:1053, E(): 0, (34.8% identity in 821 aa overlap). The intein shows some similarity to inteins from U67548|MJU67548_6 Methanococcus jannaschii (895 aa), FASTA scores: opt: 181, E(): 0.00023, (25.2% identity in 274 aa overlap). TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215977.1" /db_xref="GI:15608599" /db_xref="GeneID:886609" /translation="MTLTPEASKSVAQPPTQAPLTQEEAIASLGRYGYGWADSDVAGA NAQRGLSEAVVRDISAKKNEPDWMLQSRLKALRIFDRKPIPKWGSNLDGIDFDNIKYF VRSTEKQAASWDDLPEDIRNTYDRLGIPEAEKQRLVAGVAAQYESEVVYHQIREDLEA QGVIFLDTDTGLREHPDIFKEYFGTVIPAGDNKFSALNTAVWSGGSFIYVPPGVHVDI PLQAYFRINTENMGQFERTLIIADEGSYVHYVEGCLPAGELITTADGDLRPIESIRVG DFVTGHDGRPHRVTAVQVRDLDGELFTFTPMSPANAFSVTAEHPLLAIPRDEVRVMRK ERNGWKAEVNSTKLRSAEPRWIAAKDVAEGDFLIYPKPKPIPHRTVLPLEFARLAGYY LAEGHACLTNGCESLIFSFHSDEFEYVEDVRQACKSLYEKSGSVLIEEHKHSARVTVY TKAGYAAMRDNVGIGSSNKKLSDLLMRQDETFLRELVDAYVNGDGNVTRRNGAVWKRV HTTSRLWAFQLQSILARLGHYATVELRRPGGPGVIMGRNVVRKDIYQVQWTEGGRGPK QARDCGDYFAVPIKKRAVREAHEPVYNLDVENPDSYLAYGFAVHNCTAPIYKSDSLHS AVVEIIVKPHARVRYTTIQNWSNNVYNLVTKRARAEAGATMEWIDGNIGSKVTMKYPA VWMTGEHAKGEVLSVAFAGEDQHQDTGAKMLHLAPNTSSNIVSKSVARGGGRTSYRGL VQVNKGAHGSRSSVKCDALLVDTVSRSDTYPYVDIREDDVTMGHEATVSKVSENQLFY LMSRGLTEDEAMAMVVRGFVEPIAKELPMEYALELNRLIELQMEGAVG" gene 1649526..1650719 /locus_tag="Rv1462" /db_xref="GeneID:886567" CDS 1649526..1650719 /locus_tag="Rv1462" /function="UNKNOWN" /note="Rv1462, (MTV007.09), len: 397 aa. Conserved hypothetical protein. Equivalent to MLCL536.27c|Z99125 hypothetical protein from Mycobacterium leprae (392 aa), FASTA scores: opt: 2059, E(): 0, (80.4% identity in 392 aa overlap). Also similar to nearby Mycobacterium tuberculosis hypothetical protein Rv1461. TBparse score is 0.873." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215978.1" /db_xref="GI:15608600" /db_xref="GeneID:886567" /translation="MTAPGLTAAVEGIAHNKGELFASFDVDAFEVPHGRDEIWRFTPL RRLRGLHDGSARATGSATITVSERPGVYTQTVRRGDPRLGEGGVPTDRVAAQAFSSFN SATLVTVERDTQVVEPVGITVTGPGEGAVAYGHLQVRIEELGEAVVVIDHRGGGTYAD NVEFVVDDAARLTAVWIADWADNTVHLSAHHARIGKDAVLRHVTVMLGGDVVRMSAGV RFCGAGGDAELLGLYFADDGQHLESRLLVDHAHPDCKSNVLYKGALQGDPASSLPDAH TVWVGDVLIRAQATGTDTFEVNRNLVLTDGARADSVPNLEIETGEIVGAGHASATGRF DDEQLFYLRSRGIPEAQARRLVVRGFFGEIIAKIAVPEVRERLTAAIEHELEITESTE KTTVS" gene 1650716..1651516 /locus_tag="Rv1463" /db_xref="GeneID:886571" CDS 1650716..1651516 /locus_tag="Rv1463" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT ACROSS THE MEMBRANE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv1463, (MTV007.10), len: 266 aa. Probable conserved ATP-binding protein ABC transporter, equivalent to Z99125|MLCL536.26c putative ABC transporter ATP-binding protein from Mycobacterium leprae (260 aa), FASTA scores: opt: 1444, E(): 0, (86.0% identity in 267 aa overlap). Very similar to U38804|PPU38804_55 ATP-DEPENDENT TRANSPORTER YCF16 from PORPHYRA PURPUREA chloroplast (251 aa), FASTA scores: opt: 822, E(): 0, (52.4% identity in 248 aa overlap); and similar to others. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.872." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="NP_215979.1" /db_xref="GI:15608601" /db_xref="GeneID:886571" /translation="MTILEIKDLHVSVENPAEADHEIPILRGVDLTVKSGETHALMGP NGSGKSTLSYAIAGHPKYHVTSGTITLDGADVLAMSIDERARAGLFLAMQYPVEVPGV SMSNFLRSAATAIRGEPPKLRHWVKEVKAAMAALDIDPAFAERSVNEGFSGGEKKRHE ILQLELLKPKIAILDETDSGLDVDALRVVSEGVNRYAESQHGGILLITHYTRILRYIH PEYVHVFVGGRIVESGGSELADELDQNGYVRFSPASGRYPHQPAPTGA" misc_feature 1650842..1650865 /locus_tag="Rv1463" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1651518..1652771 /gene="csd" /locus_tag="Rv1464" /db_xref="GeneID:886565" CDS 1651518..1652771 /gene="csd" /locus_tag="Rv1464" /EC_number="4.4.1.-" /function="CATALYZES THE REMOVAL OF ELEMENTAL SULFUR AND SELENIUM ATOMS FROM L-CYSTEINE, L-CYSTINE, L-SELENOCYSTEINE, AND L-SELENOCYSTINE TO PRODUCE L-ALANINE" /experiment="experimental evidence, no additional details recorded" /note="Rv1464, (MTV007.11), len: 417 aa. Probable csd, cysteine desulfurase (EC 4.4.1.- ). Equivalent to Q49690|MLCL536.25C cysteine desulfurase from Mycobacterium leprae (418 aa), FASTA scores: opt: 2333, E(): 0, (85.4% identity in 417 aa overlap); and similar to cysteine desulfurase from other organisms. Also similar to M. tuberculosis proteins Rv3025c|ISCS and Rv3778c. Contains PS00595 Aminotransferases class-V pyridoxal-phosphate attachment site. TBparse score is 0.881. BELONGS TO CLASS-V OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. CSD SUBFAMILY." /codon_start=1 /transl_table=11 /product="cysteine desulfurase" /protein_id="NP_215980.1" /db_xref="GI:15608602" /db_xref="GeneID:886565" /translation="MTASVNSLDLAAIRADFPILKRIMRGGNPLAYLDSGATSQRPLQ VLDAEREFLTASNGAVHRGAHQLMEEATDAYEQGRADIALFVGADTDELVFTKNATEA LNLVSYVLGDSRFERAVGPGDVIVTTELEHHANLIPWQELARRTGATLRWYGVTDDGR IDLDSLYLDDRVKVVAFTHHSNVTGVLTPVSELVSRAHQSGALTVLDACQSVPHQPVD LHELGVDFAAFSGHKMLGPNGIGVLYGRRELLAQMPPFLTGGSMIETVTMEGATYAPA PQRFEAGTPMTSQVVGLAAAARYLGAIGMAAVEAHERELVAAAIEGLSGIDGVRILGP TSMRDRGSPVAFVVEGVHAHDVGQVLDDGGVAVRVGHHCALPLHRRFGLAATARASFA VYNTADEVDRLVAGVRRSRHFFGRA" misc_feature 1652193..1652246 /gene="csd" /locus_tag="Rv1464" /note="PS00595 Aminotransferases class-V pyridoxal-phosphate attachment site" gene 1652768..1653256 /locus_tag="Rv1465" /db_xref="GeneID:886569" CDS 1652768..1653256 /locus_tag="Rv1465" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv1465, (MTV007.12), len: 162 aa. Possible nitrogen fixation related protein. Equivalent to Z99125|MLCL536.24c nitrogen fixation protein NIFU from Mycobacterium leprae (165 aa), FASTA scores: opt: 870, E(): 0, (81.8% identity in 165 aa overlap). Also similar to O32163|Z99120|NIFU_BACSU NifU-like protein from Bacillus subtilis (147 aa), FASTA scores: opt: 354, E(): 4.1e-17, (38.3% identity in 141 aa overlap) and to AL096839|SCC22.02 hypothetical protein from Streptomyces coelicolor (156 aa), FASTA scores: opt: 569, E(): 1.2e-31, (56.3% identity in 158 aa overlap). TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="nitrogen fixation related protein" /protein_id="NP_215981.1" /db_xref="GI:15608603" /db_xref="GeneID:886569" /translation="MTLRLEQIYQDVILDHYKHPQHRGLREPFGAQVYHVNPICGDEV TLRVALSEDGTRVTDVSYDGQGCSISQAATSVLTEQVIGQRVPRALNIVDAFTEMVSS RGTVPGDEDVLGDGVAFAGVAKYPARVKCALLGWMAFKDALAQASEAFEEVTDERNQR TG" gene 1653231..1653578 /locus_tag="Rv1466" /db_xref="GeneID:886561" CDS 1653231..1653578 /locus_tag="Rv1466" /function="UNKNOWN" /note="Rv1466, (MTV007.13), len: 115 aa. Conserved hypothetical protein. Equivalent to Z99125|MLCL536.23c hypothetical protein from Mycobacterium leprae (115 aa), FASTA scores: opt: 648, E(): 0, (81.7% identity in 115 aa overlap). Similar to ORF's downstream of sigma factors in Streptococcus mutans and Streptococcus pneumoniae e.g. O06451 ORF3 downstream of RpoD (SPDNAGCPO) (109 aa). Alternative TTG start possible at 13757 then avoids overlap with MTV007.12. TBparse score is 0.837." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215982.1" /db_xref="GI:15608604" /db_xref="GeneID:886561" /translation="MSETSAPAEELLADVEEAMRDVVDPELGINVVDLGLVYGLDVQD GDEGTVALIDMTLTSAACPLTDVIEDQSRSALVGSGLVDDIRINWVWNPPWGPDKITE DGREQLRALGFTV" gene complement(1653673..1655502) /gene="fadE15" /locus_tag="Rv1467c" /db_xref="GeneID:886562" CDS complement(1653673..1655502) /gene="fadE15" /locus_tag="Rv1467c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /note="Rv1467c, (MTV007.14c), len: 609 aa. Probable fadE15, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to NP_302639.1|NC_002677 acyl-CoA dehydrogenase from Mycobacterium leprae (611 aa). Also highly similar to many e.g. T36481 probable acyl-CoA dehydrogenase (fragment) from Streptomyces coelicolor (491 aa) (has its N-terminus very shorter); NP_384640.1|NC_003047 PUTATIVE ACYL-CoA DEHYDROGENASE PROTEIN from Sinorhizobium meliloti (598 aa); ACDS_MEGEL|Q06319 acyl-CoA dehydrogenase (short-chain specific) from Megasphaera elsdenii (383 aa), FASTA scores: E(): 2e-12, (25.4% identity in 410 aa overlap); etc. Also highly similar to fadE5|Rv0244c|MTV034.10c ACYL-CoA DEHYDROGENASE from Mycobacterium tuberculosis (611 aa); and similar to other proteins from Mycobacterium tuberculosis. TBparse score is 0.892." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE15" /protein_id="NP_215983.1" /db_xref="GI:15608605" /db_xref="GeneID:886562" /translation="MGHYIANVRDLEFNLLEVLDIGAVLGTGRYSDLDVDTVRTILAE AARLAEGPIAESFGYADRNPPVFDPNTHSISVPDELAKTVQAIKEAGWWRLGLAEEIG GMPAPPPLAWAVNEMIYCANPSACFFNLGPVLAQSLYIEGNDEQRRWAAEGVQRGWQA TMVLTEPDAGSDVGAGRTKAFEQPDGTWHIEGVKRFISGGDVGNTAENIFHLVLARPE GAGPGTKGLSLFYVPNYLFDPDTFELGARNGVYVTGLEHKMGLKSSPTCELTFGGADV PAVGYLVGGVHNGIAQMFTVIEHARMTIGVKSAGTLSTGYLNALAFAKERVQGADLTQ MTDKTAPRVTIMHHPDVRRSLMTQKAYAEGLRALYLYAAAHQDDAVAQRVSGADHDMA HRVDDLLLPIVKGVGSERAYEILTESLQTLGGSGFLVDYPLEQYIRDAKIDSLYEGTT AIQALDFFFRKIVRDHGKALQFVLAQVTHTVENIDPSLKPQAELLRTALDDITAMTGA LTGYLMSAAQHSSDIYKVGLGSVRYLLAVGDLLIGWRLLVLAGVAHAALADGPSQNDE AFYRGKIAVAAFFAKNMLPKLTGVRSVIENIDDDIMRVPEDAF" gene complement(1655609..1656721) /gene="PE_PGRS29" /locus_tag="Rv1468c" /db_xref="GeneID:886556" CDS complement(1655609..1656721) /gene="PE_PGRS29" /locus_tag="Rv1468c" /function="UNKNOWN" /note="Rv1468c, (MTV007.15c), len: 370 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). TBparse score is 0.856." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177814.1" /db_xref="GI:57116869" /db_xref="GeneID:886556" /translation="MSFVVANTEFVSGAAGNLARLGSMISAANSAAAAQTTAVAAAGA DEVSAAVAALFGAHGQTYQVLSAQAAAFHSQFVQALSGGAQAYAAAEATNFGPLQPLF DVINAPTLALLNRPLIGNGADGTAANPNGQAGGLLIGNGGNGFSPAAGPGGNGGAAGL LGHGGNGGVGALGANGGAGGTGGWLFGNGGAGGNSGGGGGAGGIGGSAVLFGAGGAGG ISPNGMGAGGSGGNGGLFFGNGGAGASSFLGGGGAGGRAFLFGDGGAGGAALSAGSAG RGGDAGFFYGNGGAGGSGAGGASSAHGGAGGQAGLFGNGGEGGDGGALGGNGGNGGNA QLIGNGGDGGDGGGAGAPGLGGRGGLLLGLPGANGT" gene 1656963..1658936 /gene="ctpD" /locus_tag="Rv1469" /db_xref="GeneID:886578" CDS 1656963..1658936 /gene="ctpD" /locus_tag="Rv1469" /EC_number="3.6.3.-" /function="CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF A CATION (POSSIBLY CADMIUM) WITH THE HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + CATION(IN) = ADP + PHOSPHATE + CATION(OUT)]." /note="Rv1469, (MTV007.16), len: 657 aa. Probable ctpD, cation-transporting P-type ATPase D (transmembrane protein) (EC 3.6.3.-), highly similar to others e.g. T35947 probable cation-transporting ATPase from Streptomyces coelicolor (638 aa); NP_442633.1|NC_000911 cation-transporting ATPase (E1-E2 ATPase) from Synechocystis sp. strain PCC 6803 (642 aa), FASTA scores: opt: 1438, E(): 0, (41.9% identity in 592 aa overlap); NP_389268.1|NC_000964 protein similar to heavy metal-transporting ATPase from Bacillus subtilis (637 aa); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Rv3743c|MTV025.091c|CTPJ (660 aa). Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="cation transporter P-type ATPase D" /protein_id="NP_215985.1" /db_xref="GI:15608607" /db_xref="GeneID:886578" /translation="MTLTACEVTAAEAPFDRVSKTIPHPLSWGAALWSVVSVRWATVA LLLFLAGLVAQLNGAPEAMWWTLYLACYLAGGWGSAWAGAQALRNKALDVDLLMIAAA VGAVAIGQIFDGALLIVIFATSGALDDIATRHTAESVKGLLDLAPDQAVVVQGDGSER VVAASELVVGDRVVVRPGDRIPADGAVLSGASDVDQRSITGESMPVAKARGDEVFAGT VNGSGVLHLVVTRDPSQTVVARIVELVADASATKAKTQLFIEKIEQRYSLGMVAATLA LIVIPLMFGADLRPVLLRAMTFMIVASPCAVVLATMPPLLSAIANAGRHGVLVKSAVV VERLADTSIVALDKTGTLTRGIPRLASVAPLDPNVVDARRLLQLAAAAEQSSEHPLGR AIVAEARRRGIAIPPAKDFRAVPGCGVHALVGNDFVEIASPQSYRGAPLAELAPLLSA GATAAIVLLDGVAIGVLGLTDQLRPDAVESVAAMAALTAAPPVLLTGDNGRAAWRVAR NAGITDVRAALLPEQKVEVVRNLQAGGHQVLLVGDGVNDAPAMAAARAAVAMGAGADL TLQTADGVTIRDELHTIPTIIGLARQARRVVTVNLAIAATFIAVLVLWDLFGQLPLPL GVVGHEGSTVLVALNGMRLLTNRSWRAAASAAR" misc_feature 1658001..1658021 /gene="ctpD" /locus_tag="Rv1469" /note="PS00154 E1-E2 ATPases phosphorylation site" gene 1658980..1659354 /gene="trxA" /locus_tag="Rv1470" /db_xref="GeneID:886558" CDS 1658980..1659354 /gene="trxA" /locus_tag="Rv1470" /function="THIOREDOXIN PARTICIPATES IN VARIOUS REDOX REACTIONS THROUGH THE REVERSIBLE OXIDATION OF ITS ACTIVE CENTER DITHIOL, TO A DISULFIDE, & CATALYZES DITHIOL-DISULFIDE EXCHANGE REACTIONS" /note="Rv1470, (MTV007.17), len: 124 aa. Probable trxA, thioredoxin (EC 1.-.-.-), similar to many e.g. P12243|THI1_SYNP7 THIOREDOXIN 1 from Synechococcus sp. (106 aa), FASTA scores: opt: 201, E(): 9.2e-08, (35.4% identity in 99 aa overlap); etc. Highly similar to downstream ORF Rv1471|trxB1 probable thioredoxin from Mycobacterium tuberculosis (123 aa), FASTA scores: opt: 402, E(): 0, (54.4% identity in 114 aa overlap). TBparse score is 0.925. Warning: note that Rv3914|MT4033|MTV028.05|trxC can be alternatively named trxA." /codon_start=1 /transl_table=11 /product="thioredoxin TRXA" /protein_id="NP_215986.1" /db_xref="GI:15608608" /db_xref="GeneID:886558" /translation="MTTRDLTAAYFQQTISANSNVLVYFWAPLCAPCDLFTPTYEASS RKHFDVVHGKVNIETEKDLASIAGVKLLPTLMAFKKGKLVFKQAGIANPAIMDNLVQQ LRAYTFKSPAGEGIGPGTKTSS" gene 1659370..1659741 /gene="trxB1" /locus_tag="Rv1471" /db_xref="GeneID:886554" CDS 1659370..1659741 /gene="trxB1" /locus_tag="Rv1471" /function="THIOREDOXIN PARTICIPATES IN VARIOUS REDOX REACTIONS THROUGH THE REVERSIBLE OXIDATION OF ITS ACTIVE CENTER DITHIOL, TO A DISULFIDE, & CATALYZES DITHIOL-DISULFIDE EXCHANGE REACTIONS." /experiment="experimental evidence, no additional details recorded" /note="Rv1471, (MTV007.18), len: 123 aa. Probable trxB1, thioredoxin (EC 1.-.-.-), similar to many bacterial thioredoxins e.g. P33636|THI2_ECOLI from Escherichia coli (139 aa), FASTA scores: opt: 290, E(): 1.8e-13, (44.3% identity in 97 aa overlap); etc. Highly similar to Rv1470|TrxA probable thioredoxin from Mycobacterium tuberculosis (124 aa), FASTA scores: opt: 402, E(): 1.2e-32, (54.4% identity in 114 aa overlap). Contains PS00194 Thioredoxin family active site. BELONGS TO THE THIOREDOXIN FAMILY. TBparse score is 0.882. Note that previously known as trxB.; trxB" /codon_start=1 /transl_table=11 /product="thioredoxin TRXB1" /protein_id="YP_177815.1" /db_xref="GI:57116870" /db_xref="GeneID:886554" /translation="MTTRDLTAAQFNETIQSSDMVLVDYWASWCGPCRAFAPTFAESS EKHPDVVHAKVDTEAERELAAAAQIRSIPTIMAFKNGKLLFNQAGALPPAALESLVQQ LKAYEVEAGEATTQNGRAQQA" misc_feature 1659433..1659489 /gene="trxB1" /locus_tag="Rv1471" /note="PS00194 Thioredoxin family active site" gene 1659763..1660620 /gene="echA12" /locus_tag="Rv1472" /db_xref="GeneID:886547" CDS 1659763..1660620 /gene="echA12" /locus_tag="Rv1472" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS (BY SIMILARITY) [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_215988.1" /db_xref="GI:15608610" /db_xref="GeneID:886547" /translation="MPHRCAAQVVAGYRSTVSLVLVEHPRPEIAQITLNRPERMNSMA FDVMVPLKEALAQVSYDNSVRVVVLTGAGRGFSPGADHKSAGVVPHVENLTRPTYALR SMELLDDVILMLRRLHQPVIAAVNGPAIGGGLCLALAADIRVASSSAYFRAAGINNGL TASELGLSYLLPRAIGSSRAFEIMLTGRDVSAEEAERIGLVSRQVPDEQLLDACYAIA ARMAGFSRPGIELTKRTLWSGLDAASLEAHMQAEGLGQLFVRLLTANFEEAVAARAEQ RAPVFTDDT" misc_feature 1660126..1660188 /gene="echA12" /locus_tag="Rv1472" /note="PS00166 Enoyl-CoA hydratase/isomerase signature" gene 1660656..1662284 /locus_tag="Rv1473" /db_xref="GeneID:886549" CDS 1660656..1662284 /locus_tag="Rv1473" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF MACROLIDE ACROSS THE MEMBRANE (EXPORT). MACROLIDE ANTIBIOTICS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv1473, (MTV007.20), len: 542 aa. Possible macrolide-transport ATP-binding protein ABC transporter (see citation below), possibly in EF-3 subfamily. Similar to many ABC-transporters e.g. D90909_48|YHES_HAEIN from Synechocystis sp. strain PCC6803 (574 aa), FASTA scores: opt: 870, E(): 0, (33.3% identity in 525 aa overlap); P44808|YHES_HAEIN from Haemophilus influenzae (638 aa), FASTA scores: opt: 706, E(): 0, (33.7% identity in 517 aa overlap); etc. Contains two PS00017 ATP/GTP-binding site motif A (P-loop), and two PS00211 ABC transporter family signatures. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.888." /codon_start=1 /transl_table=11 /product="macrolide ABC transporter ATP-binding protein" /protein_id="NP_215989.1" /db_xref="GI:15608611" /db_xref="GeneID:886549" /translation="MITATDLEVRAGARILLAPDGPDLRVQPGDRIGLVGRNGAGKTT TLRILAGEVEPYAGSVTRAGEIGYLPQDPKVGDLDVLARDRVLSARGLDVLLTDLEKQ QALMAEVADEDERDRAIRRYGQLEERFVALGGYGAESEAGRICASLGLPERVLTQRLR TLSGGQRRRVELARILFAASESGAGNSTTLLLDEPTNHLDADSLGWLRDFLRLHTGGL VVISHNVDLVADVVNKVWFLDAVRGQVDVYNMGWQRYVDARATDEQRRIRERANAERK AAALRAQAAKLGAKATKAVAAQNMLRRADRMMAALDEERVADKVARIKFPTPAACGRT PLVANGLGKTYGSLEVFTGVDLAIDRGSRVVILGLNGAGKTTLLRLLAGVEQPDTGVL EPGYGLRIGYFAQEHDTLDNDATVWENVRHAAPDAGEQDLRGLLGAFMFTGPQLEQPA GTLSGGEKTRLALAGLVASTANVLLLDEPTNNLDPASREQVLDALRSYRGAVVLVTHD PGAAAALGPQRVVLLPDGTEDYWSDEYRDLIELA" misc_feature 1660761..1660784 /locus_tag="Rv1473" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1661139..1661183 /locus_tag="Rv1473" /note="PS00211 ABC transporters family signature" misc_feature 1661757..1661780 /locus_tag="Rv1473" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1662012..1662056 /locus_tag="Rv1473" /note="PS00211 ABC transporters family signature" gene 1662381..1662572 /locus_tag="Rv1473A" /db_xref="GeneID:3205054" CDS 1662381..1662572 /locus_tag="Rv1473A" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1473A, len: 63 aa. Possible transcriptional regulator, CDS predicted by GC plot. Similar to SCI8.24c|AL132644_24 putative transcriptional regulator from Streptomyces coelicolor (73 aa), FASTA scores: opt: 210, E(): 1.5e-08, (56.15% identity in 57 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="YP_177644.1" /db_xref="GI:57116871" /db_xref="GeneID:3205054" /translation="MRKSKKTRDQLLRELRNAYEGGASIRNLAATTGRSYGSIHSMLR ESGTTMRGRGGPNRRSRPR" gene complement(1662641..1663204) /locus_tag="Rv1474c" /db_xref="GeneID:886543" CDS complement(1662641..1663204) /locus_tag="Rv1474c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1474c, (MTV007.21c), len: 187 aa. Probable transcription regulator, equivalent to AF0021|AF002133_1 transcriptional regulator from Mycobacterium avium strain GIR10 (82 aa), FASTA scores: opt: 490, E(): 6.7e-26, (92.5% identity in 80 aa overlap). Also similar to Q59431|UIDR_ECOLI UID OPERON REPRESSOR (GUS OPERON) from Escherichia coli (196 aa), FASTA scores: opt: 192, E(): 5.8e-06, (28.5% identity in 172 aa overlap). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Helix turn helix motif predicted at aa 33-54 (+3.40 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_215990.1" /db_xref="GI:15608612" /db_xref="GeneID:886543" /translation="MPKVSEDHLAARRRQILDGARRCFAEYGYDKATVRRLEQAIGMS RGAIFHHFRDKDALFFALAREDTERMAAVASREGLIGVMRDMLAAPDQFDWLATRLEI ARKLRNDPDFSRGWAERSAELAAATTDRLRRQKQANRVRDDVPSDVLRCYLDLVLDGL LARLASGEDPQRLAAVLDLVENSVRRS" gene complement(1663215..1666046) /gene="acn" /locus_tag="Rv1475c" /db_xref="GeneID:886545" CDS complement(1663215..1666046) /gene="acn" /locus_tag="Rv1475c" /EC_number="4.2.1.3" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE [CATALYTIC ACTIVITY: Citrate = cis-aconitate + H2O]" /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the conversion of citrate to isocitrate" /codon_start=1 /transl_table=11 /product="aconitate hydratase" /protein_id="NP_215991.1" /db_xref="GI:15608613" /db_xref="GeneID:886545" /translation="MTSKSVNSFGAHDTLKVGEKSYQIYRLDAVPNTAKLPYSLKVLA ENLLRNEDGSNITKDHIEAIANWDPKAEPSIEIQYTPARVVMQDFTGVPCIVDLATMR EAIADLGGNPDKVNPLAPADLVIDHSVIADLFGRADAFERNVEIEYQRNGERYQFLRW GQGAFDDFKVVPPGTGIVHQVNIEYLASVVMTRDGVAYPDTCVGTDSHTTMVNGLGVL GWGVGGIEAEAAMLGQPVSMLIPRVVGFRLTGEIQPGVTATDVVLTVTEMLRQHGVVG KFVEFYGEGVAEVPLANRATLGNMSPEFGSTAAIFPIDEETIKYLRFTGRTPEQVALV EAYAKAQGMWHDPKHEPEFSEYLELNLSDVVPSIAGPKRPQDRIALAQAKSTFREQIY HYVGNGSPDSPHDPHSKLDEVVEETFPASDPGQLTFANDDVATDETVHSAAAHADGRV SNPVRVKSDELGEFVLDHGAVVIAAITSCTNTSNPEVMLGAALLARNAVEKGLTSKPW VKTTIAPGSQVVNDYYDRSGLWPYLEKLGFYLVGYGCTTCIGNSGPLPEEISKAVNDN DLSVTAVLSGNRNFEGRINPDVKMNYLASPPLVIAYALAGTMDFDFQTQPLGQDKDGK NVFLRDIWPSQQDVSDTIAAAINQEMFTRNYADVFKGDDRWRNLPTPSGNTFEWDPNS TYVRKPPYFEGMTAKPEPVGNISGARVLALLGDSVTTDHISPAGAIKPGTPAARYLDE HGVDRKDYNSFGSRRGNHEVMIRGTFANIRLRNQLLDDVSGGYTRDFTQPGGPQAFIY DAAQNYAAQHIPLVVFGGKEYGSGSSRDWAAKGTLLLGVRAVIAESFERIHRSNLIGM GVIPLQFPEGKSASSLGLDGTEVFDITGIDVLNDGKTPKTVCVQATKGDGATIEFDAV VRIDTPGEADYYRNGGILQYVLRNILKSG" gene 1666204..1666764 /locus_tag="Rv1476" /db_xref="GeneID:886539" CDS 1666204..1666764 /locus_tag="Rv1476" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1476, (MTV007.23), len: 186 aa. Possibly membrane protein, TMhelix 138-60. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215992.1" /db_xref="GI:15608614" /db_xref="GeneID:886539" /translation="MTGPYFPQTIPFLPSYIPQDVDMTAVKAEVAALGVSAPPAATPG LLEVVQHARDEGIDLKIVLLDHNPPNDTPLRDIATVVGADYSDATVLVLSPNYVGSYS TQYPRVTLEAGEDHSKTGNPVQSAQNFVHELSTPEFPWSALTIVLLIGVLAAAVGARL MQLRGRRSATSTDAAPGAGDDLNQGV" gene 1666990..1668408 /locus_tag="Rv1477" /db_xref="GeneID:886541" CDS 1666990..1668408 /locus_tag="Rv1477" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN VIRULENCE" /note="Rv1477, (MTV007.24), len: 472 aa. Hypothetical Invasion protein. Possibly exported protein with unusually long signal sequence. The last 277 residues are nearly identical to those of AF0060|AF006054_1 hypothetical invasion protein INV1 from Mycobacterium tuberculosis (277 aa), FASTA scores: opt: 1833, E(): 0, (98.2% identity in 277 aa overlap); also very similar to AF0021|AF002133_4 invasin 1 protein from Mycobacterium avium (273 aa), FASTA scores: opt: 1452, E(): 0, (78.1% identity in 279 aa overlap). Similar to Rv1566c|MTCY336.37|Z95586 Mycobacterium tuberculosis cosmid (230 aa), FASTA scores: opt: 528, E(): 4.4e-20, (52.0% identity in 150 aa overlap); and weakly similar to p60 proteins of Listeria spp throughout its length e.g. M80351|LISIAPB_1 Listeria monocytogenes iap-related protein (478 aa), FASTA scores: opt: 251, E(): 8e-06, (24.4% identity in 487 aa overlap). C-terminal domain highly similar to next orf Rv1478|MTV007.25." /codon_start=1 /transl_table=11 /product="invasion protein" /protein_id="NP_215993.1" /db_xref="GI:15608615" /db_xref="GeneID:886541" /translation="MRRNRRGSPARPAARFVRPAIPSALSVALLVCTPGLATADPQTD TIAALIADVAKANQRLQDLSDEVQAEQESVNKAMVDVETARDNAAAAEDDLEVSQRAV KDANAAIAAAQHRFDTFAAATYMNGPSVSYLSASSPDEIIATVTAAKTLSASSQAVMA NLQRARTERVNTESAARLAKQKADKAAADAKASQDAAVAALTETRRKFDEQREEVQRL AAERDAAQARLQAARLVAWSSEGGQGAPPFRMWDPGSGPAGGRAWDGLWDPTLPMIPS ANIPGDPIAVVNQVLGISATSAQVTANMGRKFLEQLGILQPTDTGITNAPAGSAQGRI PRVYGRQASEYVIRRGMSQIGVPYSWGGGNAAGPSKGIDSGAGTVGFDCSGLVLYSFA GVGIKLPHYSGSQYNLGRKIPSSQMRRGDVIFYGPNGSQHVTIYLGNGQMLEAPDVGL KVRVAPVRTAGMTPYVVRYIEY" gene 1668419..1669144 /locus_tag="Rv1478" /db_xref="GeneID:886535" CDS 1668419..1669144 /locus_tag="Rv1478" /function="UNKNOWN, BUT SUPPOSED INVOLVED IN VIRULENCE" /note="Rv1478, (MTV007.25), len: 241 aa. Hypothetical Invasion protein. Possibly exported protein, nearly identical to AF0060|AF006054_2 hypothetical invasion protein INV2 of Mycobacterium tuberculosis (240 aa), FASTA scores: opt: 1509, E(): 0, (95.0% identity in 241 aa overlap); very similar to AF0021|AF002133_5 hypothetical invasion protein INV2 from Mycobacterium avium (244 aa), FASTA scores: opt: 1269, E():0, (78.0% identity in 246 aa overlap). Also similar to Mycobacterium tuberculosis protein MTCY336.37 and weakly similar to C-terminal segment of p60 proteins of Listeria spp.e.g. Q01836|P60_LISIN PROTEIN P60 PRECURSOR (481 aa), FASTA scores: opt: 241, E():4e-07, (37.7% identity in 122 aa overlap). Highly similar to C-terminal domain of preceeding ORF Rv1477|MTV007.24 (472 aa), FASTA scores: opt: 864, E(): 0, (60.1% identity in 213 aa overlap). TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="invasion protein" /protein_id="NP_215994.1" /db_xref="GI:15608616" /db_xref="GeneID:886535" /translation="MRHTRFHPIKLAWITAVVAGLMVGVATPADAEPGQWDPTLPALV SAGAPGDPLAVANASLQATAQATQTTLDLGRQFLGGLGINLGGPAASAPSAATTGASR IPRANARQAVEYVIRRAGSQMGVPYSWGGGSLQGPSKGVDSGANTVGFDCSGLVRYAF AGVGVLIPRFSGDQYNAGRHVPPAEAKRGDLIFYGPGGGQHVTLYLGNGQMLEASGSA GKVTVSPVRKAGMTPFVTRIIEY" gene 1669283..1670416 /gene="moxR1" /locus_tag="Rv1479" /db_xref="GeneID:886537" CDS 1669283..1670416 /gene="moxR1" /locus_tag="Rv1479" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /experiment="experimental evidence, no additional details recorded" /note="Rv1479, (MTV007.26), len: 377 aa. Probable moxR1, transcriptional regulatory protein, similar to X96434|BBGIDBMOX_2 moxR regulator from Borrelia burgdorferi (329 aa), FASTA scores: opt: 850, E():0, (43.5% identity in 317 aa overlap); and P. denitrificans. Highly similar to MoxR homologs of Mycobacterium tuberculosis and Mycobacterium avium (but these both differ at C-terminus) e.g. Rv3692, Rv3164c, and AF0021|AF002133_6 Mycobacterium avium strain GIR10 (309 aa), FASTA scores: opt: 1181, E(): 0, (83.7% identity in 227 aa overlap). Also similar to O33173|AF006054 MoxR fragment from Mycobacterium tuberculosis (211 aa), FASTA scores: opt: 1305, E(): 0, (94.3% identity in 212 aa overlap). TBparse score is 0.889. Note that previously known as moxR.; moxR" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein MOXR1" /protein_id="YP_177816.1" /db_xref="GI:57116872" /db_xref="GeneID:886537" /translation="MTSAGGFPAGAGGYQTPGGHSASPAHEAPPGGAEGLAAEVHTLE RAIFEVKRIIVGQDQLVERMLVGLLSKGHVLLEGVPGVAKTLAVETFARVVGGTFSRI QFTPDLVPTDIIGTRIYRQGREEFDTELGPVVANFLLADEINRAPAKVQSALLEVMQE RHVSIGGRTFPMPSPFLVMATQNPIEHEGVYPLPEAQRDRFLFKINVGYPSPEEEREI IYRMGVTPPQAKQILSTGDLLRLQEIAANNFVHHALVDYVVRVVFATRKPEQLGMNDV KSWVAFGASPRASLGIIAAARSLALVRGRDYVIPQDVIEVIPDVLRHRLVLTYDALAD EISPEIVINRVLQTVALPQVNAVPQQGHSVPPVMQAAAAASGR" gene 1670413..1671366 /locus_tag="Rv1480" /db_xref="GeneID:886531" CDS 1670413..1671366 /locus_tag="Rv1480" /function="UNKNOWN" /note="Rv1480, (MTV007.27,MTCY227.01), len: 317 aa. Conserved hypothetical protein, last 110 aa residues correspond to first 110 aa of YS01_MYCAV|O07394 hypothetical 18.7 kDa Mycobacterium avium protein MAV169 (169 aa), FASTA scores: opt: 642, E(): 0, (84.2% identity in 114 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv3163c and Rv3693. TBparse score is 0.886." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215996.1" /db_xref="GI:15608618" /db_xref="GeneID:886531" /translation="MTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVL HGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVTARTTHPHVRQMIADRELETWLVV DMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQH QHTMLRTIATMPQAPAGVRGDLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAI AARHEVLAIEVLDPRDVELPDVGDVVLQDAESGVVREFSIDPALRDDFARAAAAHRAD VARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALAGHQ" gene 1671377..1672384 /locus_tag="Rv1481" /db_xref="GeneID:886533" CDS 1671377..1672384 /locus_tag="Rv1481" /function="UNKNOWN" /note="Rv1481, (MTCY277.02), len: 335 aa. Probable membrane protein, highly similar to YS02_MYCAV|O07395 hypothetical 36.1 kDa protein mav335 from Mycobacterium avium (335 aa), FASTA scores: opt: 1904, E(): 0, (89.0% identity in 337 aa overlap). Similar to AF116251|AF116251_1 BatA protein from Bacteroides fragilis (327 aa), FASTA scores: opt: 317, E(): 2e-12, (26.5% identity in 340 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215997.1" /db_xref="GI:15608619" /db_xref="GeneID:886533" /translation="MTLPLLGPMTLSGFAHSWFFLFLFVVAGLVALYILMQLARQRRM LRFANMELLESVAPKRPSRWRHVPAILLVLSLLLFTIAMAGPTHDVRIPRNRAVVMLV IDVSQSMRATDVEPSRMVAAQEAAKQFADELTPGINLGLIAYAGTATVLVSPTTNREA TKNALDKLQFADRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLFSDGKETMPTN PDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGG NSYNAATLAELRAVYSSLQQQIGYETIKGDASVGWLRLGALALALAALAALLINRRLP T" gene complement(1672457..1673299) /locus_tag="Rv1482c" /db_xref="GeneID:886526" CDS complement(1672457..1673299) /locus_tag="Rv1482c" /function="UNKNOWN" /note="Rv1482c, (MTCY277.03c), len: 280 aa. Conserved hypothetical protein, highly similar to O07396|AF002133 Mycobacterium avium protein MAV346 (346 aa), FASTA scores: E(): 0, (65.2% identity in 342 aa overlap); slight similarity to GRPE_ECOLI|P09372 heat shock protein from E. coli (197 aa), FASTA scores: opt: 139, E(): 0.012, (28.3% identity in 159 aa overlap). Similar to Mycobacterium tuberculosis hypothetical proteins Rv3517, Rv3555c, Rv3714c, Rv1073, etc. Start changed since first submission (-59 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_215998.2" /db_xref="GI:57116873" /db_xref="GeneID:886526" /translation="MTDPFLGSEALAAGVLTPYELRSRYVALHKDVYVPQGVELTAQL RAKALWLRSRRRGVLAGYSASAFHGAKWIDADLPAAIIDTNRRRAPGLQVWEERIEPD EICVIEGMRVTTPERTALDLTSRFPLDPAVAAVDALIQATDLKVADVEPLIERYRGRR GMKAARAALDLVDGGAQSPKETWLRLLLIRAGFPRPQTQIAVRNEWGWAEAHLDMGWQ DIKVAAEYDGDHHLTSRYHYRKDILRHEKVQHRYGWIVVRVVAEDHPADIIRRVGEAR AFRA" gene 1673440..1674183 /gene="fabG1" /locus_tag="Rv1483" /db_xref="GeneID:886551" CDS 1673440..1674183 /gene="fabG1" /locus_tag="Rv1483" /EC_number="1.1.1.100" /function="INVOLVED IN THE FATTY ACID BIOSYNTHESIS PATHWAY (FIRST REDUCTION STEP) (MYCOLIC ACID BIOSYNTHESIS); REDUCES KASA/KASB PRODUCTS [CATALYTIC ACTIVITY: (3R)-3-hydroxyacyl-[acyl-carrier protein] + NADP+ = 3-oxoacyl-[acyl-carrier protein] + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="Rv1483, (MTCY277.04), len: 247 aa. fabG1 (alternate gene name: mabA), 3-oxoacyl-[acyl-carrier protein] reductase (EC 1.1.1.100) (see citations below), equivalent to O07399|FABG_MYCAV 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE from Mycobacterium avium (255 aa); P71534|FABG_MYCSM 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE from Mycobacterium smegmatis (255 aa); and NP_302228.1|NC_002677 3-oxoacyl-[ACP] reductase (aka MabA) from Mycobacterium leprae (253 aa). Also highly similar to many e.g. T36779 probable 3-oxacyl-(acyl-carrier-protein) reductase from Streptomyces coelicolor (234 aa); FABG_ECOLI|P25716|NP_415611.1|NC_000913 3-oxoacyl-[acyl-carrier-protein] reductase from Escherichia coli strain K12 (244 aa), FASTA scores: opt: 664, E(): 6.8e-35, (44.4% identity in 241 aa overlap); etc. Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY.; mabA" /codon_start=1 /transl_table=11 /product="3-oxoacyl-[acyl-carrier protein] reductase FabG1" /protein_id="NP_215999.1" /db_xref="GI:15608621" /db_xref="GeneID:886551" /translation="MTATATEGAKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAV THRGSGAPKGLFGVECDVTDSDAVDRAFTAVEEHQGPVEVLVSNAGLSADAFLMRMTE EKFEKVINANLTGAFRVAQRASRSMQRNKFGRMIFIGSVSGSWGIGNQANYAASKAGV IGMARSIARELSKANVTANVVAPGYIDTDMTRALDERIQQGALQFIPAKRVGTPAEVA GVVSFLASEDASYISGAVIPVDGGMGMGH" misc_feature 1673857..1673943 /gene="fabG1" /locus_tag="Rv1483" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 1674202..1675011 /gene="inhA" /locus_tag="Rv1484" /db_xref="GeneID:886523" CDS 1674202..1675011 /gene="inhA" /locus_tag="Rv1484" /EC_number="1.3.1.10" /function="THIS ISOZYME IS INVOLVED IN MYCOLIC ACID BIOSYNTHESIS. SECOND REDUCTIVE STEP IN FATTY ACID BIOSYNTHESIS. INVOLVED IN THE RESISTANCE AGAINST THE ANTITUBERCULOSIS DRUGS ISONIAZID AND ETHIONAMIDE [CATALYTIC ACTIVITY: ACYL-[ACYL-CARRIER PROTEIN] + NAD(+) = TRANS-2,3-DEHYDROACYL-[ACYL-CARRIER PROTEIN] + NADH]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes a key regulatory step in fatty acid biosynthesis" /codon_start=1 /transl_table=11 /product="enoyl-(acyl carrier protein) reductase" /protein_id="NP_216000.1" /db_xref="GI:15608622" /db_xref="GeneID:886523" /translation="MTGLLDGKRILVSGIITDSSIAFHIARVAQEQGAQLVLTGFDRL RLIQRITDRLPAKAPLLELDVQNEEHLASLAGRVTEAIGAGNKLDGVVHSIGFMPQTG MGINPFFDAPYADVSKGIHISAYSYASMAKALLPIMNPGGSIVGMDFDPSRAMPAYNW MTVAKSALESVNRFVAREAGKYGVRSNLVAAGPIRTLAMSAIVGGALGEEAGAQIQLL EEGWDQRAPIGWNMKDATPVAKTVCALLSDWLPATTGDIIYADGGAHTQLL" gene 1675017..1676051 /gene="hemH" /locus_tag="Rv1485" /db_xref="GeneID:886525" CDS 1675017..1676051 /gene="hemH" /locus_tag="Rv1485" /EC_number="4.99.1.1" /function="INVOLVED IN PROTOHEME BIOSYNTHESIS (LAST STEP). CATALYZES THE INSERTION OF FERROUS IRON INTO PROTOPORPHYRIN IX TO FORM PROTOHEME [CATALYTIC ACTIVITY : PROTOPORPHYRIN + FE(2+) = PROTOHEME + 2 H(+)]." /experiment="experimental evidence, no additional details recorded" /note="protoheme ferro-lyase; catalyzes the insertion of a ferrous ion into protoporphyrin IX to form protoheme; involved in protoheme biosynthesis; in some organisms this protein is membrane-associated while in others it is cytosolic" /codon_start=1 /transl_table=11 /product="ferrochelatase" /protein_id="NP_216001.1" /db_xref="GI:15608623" /db_xref="GeneID:886525" /translation="MQFDAVLLLSFGGPEGPEQVRPFLENVTRGRGVPAERLDAVAEH YLHFGGVSPINGINRTLIAELEAQQELPVYFGNRNWEPYVEDAVTAMRDNGVRRAAVF ATSAWSGYSSCTQYVEDIARARRAAGRDAPELVKLRPYFDHPLFVEMFADAITAAAAT VRGDARLVFTAHSIPTAADRRCGPNLYSRQVAYATRLVAAAAGYCDFDLAWQSRSGPP QVPWLEPDVTDQLTGLAGAGINAVIVCPIGFVADHIEVVWDLDHELRLQAEAAGIAYA RASTPNADPRFARLARGLIDELRYGRIPARVSGPDPVPGCLSSINGQPCRPPHCVASV SPARPSAGSP" gene complement(1676017..1676883) /locus_tag="Rv1486c" /db_xref="GeneID:886519" CDS complement(1676017..1676883) /locus_tag="Rv1486c" /function="UNKNOWN" /note="Rv1486c, (MTCY277.07c), len: 288 aa. Conserved hypothetical protein, highly similar to YS07_MYCAV|O07402 hypothetical 33.5 kDa protein mav321 from Mycobacterium avium (320 aa), FASTA scores: opt: 1217, E(): 0, (71.1% identity in 315 aa overlap). Weak similarity to AL079332|SCI5.07 hypothetical protein from Streptomyces coelicolor (259 aa), FASTA scores: opt: 131, E(): 0.29, (32.3% identity in 279 aa overlap). Start changed since original submission." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216002.2" /db_xref="GI:57116874" /db_xref="GeneID:886519" /translation="MWCPSVSLSIWANAWLAGKAAPDDVLDALSLWAPTQSVAAYDAV AAGHTGLPWPDVHDAGTVSLLQTLRAAVGRRRLRGTINVVLPVPGDVRGLAAGTQFEH DALAAGEAVIVANPEDPGSAVGLVPEFSYGDVDEAAQSEPLTPELCALSWMVYSLPGA PVLEHYELGDAEYALRSAVRSAAEALSTIGLGSSDVAKPRGLVEQLLESSRQHRVPDH APSRALRVLENAAHVDAIIAVSAGLSRLPIGTQSLSDAQRATDALRPLTAVVRSARMS AVTAILHSAWPD" gene 1676941..1677375 /locus_tag="Rv1487" /db_xref="GeneID:886521" CDS 1676941..1677375 /locus_tag="Rv1487" /function="UNKNOWN" /note="Rv1487, (MTCY277.08), len: 144 aa. Conserved membrane protein. Highly similar to O07404|AF002133 MAV145 from Mycobacterium avium (145 aa), FASTA scores: opt: 667, E(): 0, (72.5% identity in 142 aa overlap). Also similar to AL079332|SCI5.05 hypothetical protein from Streptomyces coelicolor (143 aa), FASTA scores: opt: 344, E(): 1.3e-15, (44.8% identity in 134 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216003.1" /db_xref="GI:15608625" /db_xref="GeneID:886521" /translation="MPVALIWLIAALVLVGAEALTGDMFLLMLGGGALAASVSSWLLA WPMWADGAVFLLVSVLLLVLVRPAVRRRLTQTKGVQLGIEALEGKKAVVLGRVARDGG QVKLDGQVWTARPLNDGDVFEPGDSVTVVQIDGATAVVFKDV" gene 1677397..1678542 /locus_tag="Rv1488" /db_xref="GeneID:886515" CDS 1677397..1678542 /locus_tag="Rv1488" /function="UNKNOWN" /note="Rv1488, (MTCY277.09), len: 381 aa. Possible exported conserved protein; contains possible N-terminal signal sequence. Similar to YBBK_ECOLI|P77367 hypothetical protein ybbK from Escherichia coli (305 aa), FASTA scores: opt: 716, E(): 0, (37.1% identity in 307 aa overlap). Similar to stomatin-like proteins e.g. AF065260|AF065260_1 Clostridium difficile (320 aa), FASTA scores: opt: 767, E(): 0, (42.3% identity in 307 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216004.1" /db_xref="GI:15608626" /db_xref="GeneID:886515" /translation="MQGAVAGLVFLAVLVIFAIIVVAKSVALIPQAEAAVIERLGRYS RTVSGQLTLLVPFIDRVRARVDLRERVVSFPPQPVITEDNLTLNIDTVVYFQVTVPQA AVYEISNYIVGVEQLTTTTLRNVVGGMTLEQTLTSRDQINAQLRGVLDEATGRWGLRV ARVELRSIDPPPSIQASMEKQMKADREKRAMILTAEGTREAAIKQAEGQKQAQILAAE GAKQAAILAAEADRQSRMLRAQGERAAAYLQAQGQAKAIEKTFAAIKAGRPTPEMLAY QYLQTLPEMARGDANKVWVVPSDFNAALQGFTRLLGKPGEDGVFRFEPSPVEDQPKHA ADGDDAEVAGWFSTDTDPSIARAVATAEAIARKPVEGSLGTPPRLTQ" gene 1678552..1678908 /locus_tag="Rv1489" /db_xref="GeneID:3205064" CDS 1678552..1678908 /locus_tag="Rv1489" /function="UNKNOWN" /note="Rv1489, len: 118 aa. Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium avium subsp. paratuberculosis and Streptomyces coelicolor e.g. AJ250017_1 insertion sequence IS900, Locus 3, putative invasion protein from M. paratuberculosis (138 aa), FASTA scores: opt: 120, E(): 0.26, (34.375% identity in 96 aa overlap); SCD6.11c|AL353815_11 possible integral membrane protein from Streptomyces coelicolor (136 aa), FASTA scores: opt: 106, E(): 2.2, (35.9% identity in 103 aa overlap). ORF predicted by GC plot. Replaces previous Rv1489c on other strand." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177645.1" /db_xref="GI:57116875" /db_xref="GeneID:3205064" /translation="MSGLTSPKTYAVLAALQAGDAVACAIPLPPIARLLDDLDVPVSV RPVLPVVKAASAVGLLSVTRFPALARLTTAMLTLYFILAVGAHVRVRDRVVNAIPAAS FLTLFALMTAKGPERT" gene 1678942..1679172 /locus_tag="Rv1489A" /db_xref="GeneID:3205065" CDS 1678942..1679172 /locus_tag="Rv1489A" /function="UNKNOWN" /note="Rv1489A, len: 76 aa. Conserved hypothetical protein, similar to part of alpha subunit of many methylmalonyl-CoA mutases ( 750 aa). Size difference suggests possible gene fragment although Mycobacterium tuberculosis has intact methylmalonyl-CoA mutase gene. P71774|MUTB_MYCTU PROBABLE METHYLMALONYL-CoA MUTASE from Mycobacterium tuberculosis (750 aa), FASTA scores: opt: 258, E(): 3.2e-10, (73.35% identity in 60 aa overlap). ORF predicted by GC plot." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177646.1" /db_xref="GI:57116876" /db_xref="GeneID:3205065" /translation="MSVGEVEVLKVENSRVRAEQLAKLYELRSSRDRVRVDAALAELS RAAAARGCAGTSGLGNNLMAPGPPHSLLGRDR" gene 1679322..1680629 /locus_tag="Rv1490" /db_xref="GeneID:886511" CDS 1679322..1680629 /locus_tag="Rv1490" /function="UNKNOWN" /note="Rv1490, (MTCY277.12), len: 435 aa. Probable membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216006.1" /db_xref="GI:15608628" /db_xref="GeneID:886511" /translation="MSQCFAVKGIGGADQATLGSAEILVKYAQLADKRARVYVLVSTW LVVWGIWHVYFVEAVFPNAILWLHYYAASYEFGFVRRGLGGELIRMLTGDHFFAGAYT VLWTSITVWLIALAVVVWLILSTGNRSERRIMLALLVPVLPFAFSYAIYNPHPELFGM TALVAFSIFLTRAHTSRTRVILSTLYGLTMAVLALIHEAIPLEFALGAVLAIIVLSKN ATGATRRICTALAIGPGTVSVLLLAVVGRRDIADQLCAHIPHGMVENPWAVATTPQRV LDYIFGRVESHADYHDWVCEHVTPWFNLDWITSAKLVAVVGFRALFGAFLLGLLFFVA TTSMIRYVSAVPVRTFFAELRGNLALPVLASALLVPLFITAVDWTRWWVMITLDVAIV YILYAIDRPEIEQPPSRRNVQVFVCVVLVLAVIPTGSANNIGR" gene complement(1681208..1681966) /locus_tag="Rv1491c" /db_xref="GeneID:886513" CDS complement(1681208..1681966) /locus_tag="Rv1491c" /function="UNKNOWN" /note="Rv1491c, (MTCY277.13c), len: 252 aa. Conserved membrane protein. Similar to hypothetical proteins from many organisms e.g. YDJZ_ECOLI|P76221 Escherichia coli (235 aa), FASTA scores: opt: 223, E():6.7 e-07, (31.7% identity in 145 aa overlap); AL133252|SCE46.15 Streptomyces coelicolor (249 aa), FASTA scores: opt: 378, E(): 1.5e-17, (39.1% identity in 169 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical protein Rv0625c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216007.1" /db_xref="GI:15608629" /db_xref="GeneID:886513" /translation="MTAPAICNTTETVHGIATSLGAVARQASLPRIVGTVVGITVLVV VALLVPVPTAVELRDWAKSLGAWFPLAFLLVHTVVTVPPFPRTAFTLAAGLLFGSVVG VFIAVVGSTASAVIAMLLVRATGWQLNSLVRRRAINRLDERLRERGWLAILSLRLIPV VPFAAINYAAGASGVRILSFAWATLAGLLPGTAAVVILGDAFAGSGSPLLILVSVCTG ALGLTGLVYEIRNYRRQHRRMPGYDDPVREPALI" gene 1682157..1684004 /gene="mutA" /locus_tag="Rv1492" /db_xref="GeneID:886507" CDS 1682157..1684004 /gene="mutA" /locus_tag="Rv1492" /EC_number="5.4.99.2" /function="INVOLVED IN PROPIONIC ACID FERMENTATION. CATALYZES THE ISOMERIZATION OF SUCCINYL-CoA TO METHYLMALONYL-CoA DURING SYNTHESIS OF PROPIONATE FROM TRICARBOXYLIC ACID-CYCLE INTERMEDIATES [CATALYTIC ACTIVITY: (R)-2-METHYL-3-OXOPROPANOYL-CoA = SUCCINYL- COA]" /note="Rv1492, (MTCY277.14), len: 615 aa. Probable mutA, Methylmalonyl-CoA mutase small-subunit (EC 5.4.99.2), strong similarity to e.g. MUTA_STRCM|Q05064 methylmalonyl-CoA mutase beta-subunit from Streptomyces cinnamonensis (616 aa), FASTA scores: opt: 1512, E(): 0, (45.9% identity in 628 aa overlap). Contains PS00213 Lipocalin signature, PS00544 Methylmalonyl-CoA mutase signature. BELONGS TO THE METHYLMALONYL-CoA MUTASE FAMILY." /codon_start=1 /transl_table=11 /product="methylmalonyl-CoA mutase small subunit" /protein_id="NP_216008.1" /db_xref="GI:15608630" /db_xref="GeneID:886507" /translation="MSIDVPERADLEQVRGRWRNAVAGVLSKSNRTDSAQLGDHPERL LDTQTADGFAIRALYTAFDELPEPPLPGQWPFVRGGDPLRDVHSGWKVAEAFPANGAT ADTNAAVLAALGEGVSALLIRVGESGVAPDRLTALLSGVYLNLAPVILDAGADYRPAC DVMLALVAQLDPGQRDTLSIDLGADPLTASLRDRPAPPIEEVVAVASRAAGERGLRAI TVDGPAFHNLGATAATELAATVAAAVAYLRVLTESGLVVSDALRQISFRLAADDDQFM TLAKMRALRQLWARVAEVVGDPGGGAAVVHAETSLPMMTQRDPWVNMLRCTLAAFGAG VGGADTVLVHPFDVAIPGGFPGTAAGFARRIARNTQLLLLEESHVGRVLDPAGGSWFV EELTDRLARRAWQRFQAIEARGGFVEAHDFLAGQIAECAARRADDIAHRRLAITGVNE YPNLGEPALPPGDPTSPVRRYAAGFEALRDRSDHHLARTGARPRVLLLPLGPLAEHNI RTTFATNLLASGGIEAIDPGTVDAGTVGNAVADAGSPSVAVICGTDARYRDEVADIVQ AARAAGVSRVYLAGPEKALGDAAHRPDEFLTAKINVVQALSNLLTRLGA" misc_feature 1682184..1682219 /gene="mutA" /locus_tag="Rv1492" /note="PS00213 Lipocalin signature" misc_feature 1683246..1683323 /gene="mutA" /locus_tag="Rv1492" /note="PS00544 Methylmalonyl-CoA mutase signature" gene 1684005..1686257 /gene="mutB" /locus_tag="Rv1493" /db_xref="GeneID:886509" CDS 1684005..1686257 /gene="mutB" /locus_tag="Rv1493" /EC_number="5.4.99.2" /function="INVOLVED IN PROPIONIC ACID FERMENTATION. CATALYZES THE ISOMERIZATION OF SUCCINYL-CoA TO METHYLMALONYL-CoA DURING SYNTHESIS OF PROPIONATE FROM TRICARBOXYLIC ACID-CYCLE INTERMEDIATES [CATALYTIC ACTIVITY : (R)-2-METHYL-3-OXOPROPANOYL-CoA = SUCCINYL- COA.]" /note="MDM; functions in conversion of succinate to propionate" /codon_start=1 /transl_table=11 /product="methylmalonyl-CoA mutase" /protein_id="NP_216009.1" /db_xref="GI:15608631" /db_xref="GeneID:886509" /translation="MTTKTPVIGSFAGVPLHSERAAQSPTEAAVHTHVAAAAAAHGYT PEQLVWHTPEGIDVTPVYIAADRAAAEAEGYPLHSFPGEPPFVRGPYPTMYVNQPWTI RQYAGFSTAADSNAFYRRNLAAGQKGLSVAFDLATHRGYDSDHPRVQGDVGMAGVAID SILDMRQLFDGIDLSTVSVSMTMNGAVLPILALYVVAAEEQGVAPEQLAGTIQNDILK EFMVRNTYIYPPKPSMRIISDIFAYTSAKMPKFNSISISGYHIQEAGATADLELAYTL ADGVDYIRAGLNAGLDIDSFAPRLSFFWGIGMNFFMEVAKLRAGRLLWSELVAQFAPK SAKSLSLRTHSQTSGWSLTAQDVFNNVARTCIEAMAATQGHTQSLHTNALDEALALPT DFSARIARNTQLVLQQESGTTRPIDPWGGSYYVEWLTHRLARRARAHIAEVAEHGGMA QAISDGIPKLRIEEAAARTQARIDSGQQPVVGVNKYQVPEDHEIEVLKVENSRVRAEQ LAKLQRLRAGRDEPAVRAALAELTRAAAEQGRAGADGLGNNLLALAIDAARAQATVGE ISEALEKVYGRHRAEIRTISGVYRDEVGKAPNIAAATELVEKFAEADGRRPRILIAKM GQDGHDRGQKVIATAFADIGFDVDVGSLFSTPEEVARQAADNDVHVIGVSSLAAGHLT LVPALRDALAQVGRPDIMIVVGGVIPPGDFDELYAAGATAIFPPGTVIADAAIDLLHR LAERLGYTLD" misc_feature 1685193..1685270 /gene="mutB" /locus_tag="Rv1493" /note="PS00544 Methylmalonyl-CoA mutase signature" gene 1686271..1686573 /locus_tag="Rv1494" /db_xref="GeneID:886502" CDS 1686271..1686573 /locus_tag="Rv1494" /function="UNKNOWN" /note="Rv1494, (MTCY277.16), len: 100 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216010.1" /db_xref="GI:15608632" /db_xref="GeneID:886502" /translation="MPFLVALSGIISGVRDHSMTVRLDQQTRQRLQDIVKGGYRSANA AIVDAINKRWEALHDEQLDAAYAAAIHDNPAYPYESEAERSAARARRNARQQRSAQ" gene 1686570..1686887 /locus_tag="Rv1495" /db_xref="GeneID:886504" CDS 1686570..1686887 /locus_tag="Rv1495" /function="UNKNOWN" /note="Rv1495, (MTCY277.17), len: 105 aa. Conserved hypothetical protein, some similarity to Rv1942c|MTCY09F9.22 hypothetical protein from Mycobacterium tuberculosis (109 aa) (0.7% identity in 101 aa overlap) and Rv0659c, Rv1102c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216011.1" /db_xref="GI:15608633" /db_xref="GeneID:886504" /translation="MNAPLRGQVYRCDLGYGAKPWLIVSNNARNRHTADVVAVRLTTT RRTIPTWVAMGPSDPLTGYVNADNIETLGKDELGDYLGEVTPATMNKINTALATALGL PWP" gene 1686884..1687888 /locus_tag="Rv1496" /db_xref="GeneID:886496" CDS 1686884..1687888 /locus_tag="Rv1496" /EC_number="2.7.-.-" /function="possibly involved in transport (possibly arginine)" /note="functions in transport of arginine/ornithine; inner membrane ATPase that cleaves ATP and phosphorylates two periplasmic proteins that function as two distinct transport systems, the AO (arginine and ornithine) and LAO (lysine, arginine, and ornithine) periplasmic binding proteins" /codon_start=1 /transl_table=11 /product="arginine/ornithine transport system ATPase" /protein_id="NP_216012.1" /db_xref="GI:15608634" /db_xref="GeneID:886496" /translation="MMAASHDDDTVDGLATAVRGGDRAALPRAITLVESTRPDHREQA QQLLLRLLPDSGNAHRVGITGVPGVGKSTAIEALGMHLIERGHRVAVLAVDPSSTRTG GSILGDKTRMARLAVHPNAYIRPSPTSGTLGGVTRATRETVVLLEAAGFDVILIETVG VGQSEVAVANMVDTFVLLTLARTGDQLQGIKKGVLELADIVVVNKADGEHHKEARLAA RELSAAIRLIYPREALWRPPVLTMSAVEGRGLAELWDTVERHRQVLTGAGEFDARRRD QQVDWTWQLVRDAVLDRVWSNPTVRKVRSELERRVRAGELTPALAAQQILEIANLTDR" gene 1687941..1689230 /gene="lipL" /locus_tag="Rv1497" /db_xref="GeneID:886575" CDS 1687941..1689230 /gene="lipL" /locus_tag="Rv1497" /EC_number="3.1.-.-" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID METABOLISM" /note="Rv1497, (MTCY277.19), len: 429 aa. Probable LipL, esterase (EC 3.1.-.-), very similar to Mycobacterium tuberculosis hypothetical esterases and penicillin binding proteins e.g. Rv1923, Rv2463, Rv3775, etc. Also similar to G151214|M68491 esterase estA from Pseudomonas sp (389 aa), FASTA scores: opt: 604, E(): 1e-31, (34.4% identity in 389 aa overlap)." /codon_start=1 /transl_table=11 /product="esterase LipL" /protein_id="NP_216013.1" /db_xref="GI:15608635" /db_xref="GeneID:886575" /translation="MMVDTGVDHRAVSSHDGPDAGRRVFGAADPRFACVVRAFASMFP GRRFGGGALAVYLDGQPVVDVWKGWADRAGWVPWSADSAPMVFSATKGMTATVIHRLA DRGLIDYEAPVAEYWPAFGANGKATLTVRDVMRHQAGLSGLRGATQQDLLDHVVMEER LAAAVPGRLLGKSAYHALTFGWLMSGLARAVTGKDMRLLFREELAEPLDTDGLHLGRP PADAPTRVAEIIMPQDIAANAVLTCAMRRLAHRFSGGFRSMYFPGAIAAVQGEAPLLD AEIPAANGVATARALARMYGAIANGGEIDGIRFLSRELVTGLTRNRRQVLPDRNLLVP LNFHLGYHGMPIGNVMPGFGHVGLGGSIGWTDPETGVAFALVHNRLLSPLVMTDHAGF VGIYHLIRQAAAQARKRGYQPVTPFGAPYSEPGAAAG" gene complement(1689303..1689920) /locus_tag="Rv1498c" /db_xref="GeneID:886503" CDS complement(1689303..1689920) /locus_tag="Rv1498c" /EC_number="2.1.1.-" /function="CAUSES METHYLATION" /experiment="experimental evidence, no additional details recorded" /note="Rv1498c, (MTCY277.20c), len: 205 aa. Probable methyltransferase (EC 2.1.1.-). Similar to G2792343|AF040571 METHYLTRANSFERASE from AMYCOLATOPSIS MEDITERRANEI (272 aa), FASTA scores: E(): 5.1e-11, (32.3% identity in 124 aa overlap). Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="NP_216014.1" /db_xref="GI:15608636" /db_xref="GeneID:886503" /translation="MLDVGCGSGRMALPLTGYLNSEGRYAGFDISQKAIAWCQEHITS AHPNFQFEVSDIYNSLYNPKGKYQSLDFRFPYPDASFDVVFLTSVFTHMFPPDVEHYL DEISRVLKPGGRCLCTYFLLNDESLAHIAEGKSAHNFQHEGPGYRTIHKKRPEEAIGL PETFVRDVYGKFGLAVHEPLHYGSWSGREPRLSFQDIVIATKTAS" misc_feature complement(1689516..1689539) /locus_tag="Rv1498c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(1690134..1690346) /locus_tag="Rv1498A" /db_xref="GeneID:3205040" CDS complement(1690134..1690346) /locus_tag="Rv1498A" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1498A, len: 70 aa. Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. from Streptomyces coelicolor, Sinorhizobium meliloti and Pseudomonas aeruginosa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177647.1" /db_xref="GI:57116877" /db_xref="GeneID:3205040" /translation="MSNHTYRVIEIVGTSPDGVDAAIQGGLARAAQTMRALDWFEVQS IRGHLVDGAVAHFQVTMKVGFRLEDS" gene 1690407..1690805 /locus_tag="Rv1499" /db_xref="GeneID:886494" CDS 1690407..1690805 /locus_tag="Rv1499" /function="UNKNOWN" /note="Rv1499, (MTCY277.21), len: 132 aa. Hypothetical unknown protein; was initially longer but has been shortened (-24 aa) owing to overlap with Rv1498A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216015.2" /db_xref="GI:57116878" /db_xref="GeneID:886494" /translation="MPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIA TFDQKRPAVGVDEHDPGGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKRPA LRPTKAAATTAATTWIERVQNRRGRHSALV" gene 1690850..1691878 /locus_tag="Rv1500" /db_xref="GeneID:886492" CDS 1690850..1691878 /locus_tag="Rv1500" /EC_number="2.-.-.-" /function="UNKNOWN" /note="Rv1500, (MTCY277.22), len: 342 aa. Probable glycosyltransferase (EC 2.-.-.- ), hydrophobic domain near C-terminus. Some similarity to putative glycosyl-transferases from Bacillus subtilis e.g. O34319|YKCC_BACSU (323 aa), opt: 490, E(): 6.1e-25, (28.85% identity in 312 aa overlap) and to N-acetyl glucosamine transferases. Also similar to G1001347 hypothetical 36.7 kDa protein (318 aa), FASTA scores: opt: 523, E(): 7.2e-26, (30.6% identity in 307 aa overlap)." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="NP_216016.1" /db_xref="GI:15608638" /db_xref="GeneID:886492" /translation="MRLSIVTTMYMSEPYVLEFYRRARAAADKITPDVEIIFVDDGSP DAALQQAVSLLDSDPCVRVIQLSRNFGHHKAMMTGLAHATGDLVFLIDSDLEEDPALL EPFYEKLISTGADVVFGCHARRPGGWLRNFGPKIHYRASALLCDPPLHENTLTVRLMT ADYVRSLVQHQERELSIAGLWQITGFYQVPMSVNKAWKGTTTYTFRRKVATLVDNVTS FSNKPLVFIFYLGAAIFIISSSAAGYLIIDRIFFRALQAGWASVIVSIWMLGGVTIFC IGLVGIYVSKVFIETKQRPYTIIRRIYGSDLTTREPSSLKTAFPAAHLSNGKRVTSEP EGLATGNR" gene 1691890..1692711 /locus_tag="Rv1501" /db_xref="GeneID:886499" CDS 1691890..1692711 /locus_tag="Rv1501" /function="UNKNOWN" /note="Rv1501, (MTCY277.23), len: 273 aa. Conserved hypothetical protein, some similarity to O06374|Rv3633|MTCY15C10.19C hypothetical protein from Mycobacterium tuberculosis, FASTA scores: E(): 3.9e-10, (27.5% identity in 280 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216017.1" /db_xref="GI:15608639" /db_xref="GeneID:886499" /translation="MIPVKVENNTSLDQVQDALNCVGYAVVEDVLDEASLAATRDRMY RVQERILTEIGKERLARAGELGVLRLMMKYDPHFFTFLEIPEVLSIVDRVLSETAILH LQNGFILPSFPPFSTPDVFQNAFHQDFPRVLSGYIASVNIMFAIDPFTRDTGATLVVP GSHQRIEKPDHTYLARNAVPVQCAAGSLFVFDSTLWHAAGRNTSGKDRLAINHQFTRS FFKQQIDYVRALGDAVVLEQPARTQQLLGWYSRVVTNLDEYYQPPDKRLYRKGQG" gene 1692924..1693823 /locus_tag="Rv1502" /db_xref="GeneID:886486" CDS 1692924..1693823 /locus_tag="Rv1502" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1502, (MTCY277.24), len: 299 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216018.1" /db_xref="GI:15608640" /db_xref="GeneID:886486" /translation="MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRD GQNRSSIGSVIVDLAVGGKILDIPAEPILRPGARGMFDDCGVSIGSIVRAGDTRLLYY TGWNLAVTVPWKNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYR MWYGSNLGWGEGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDA GVYRMWFCARGAKYRIYCATSEDGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRG QRFMLYSGDGYGRTGFGLAVLEN" gene complement(1693996..1694544) /locus_tag="Rv1503c" /db_xref="GeneID:886488" CDS complement(1693996..>1694544) /locus_tag="Rv1503c" /function="UNKNOWN" /note="Rv1503c, (MTCY277.25c), len: 182 aa. Conserved hypothetical protein, similar to C-terminal region of P27833|RFFA_ECOLI LIPOPOLYSACCHARIDE BIOSYNTHESIS PROTEIN from Escherichia coli (376 aa), FASTA scores: opt: 565, E(): 0, (49.4% identity in 170 aa overlap); Rv1503c and Rv1504c are both similar to RFFA_ECOLI but are separated by a stop codon, sequence appears to be correct so possible pseudogene." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216019.1" /db_xref="GI:15608641" /db_xref="GeneID:886488" /translation="DFLLRAEILREKGTNRSRFLRNEVDKYTWQDKGSSYLPSELVAA FLWAQFEEAERITRIRLDLWNRYHESFESLEQRGLLRRPIIPQGCSHNAHMYYVLLAP SADREEVLARLTSEGIGAVFHYVPLHDSPAGRRYGRTNGNLTVTNDVASRLIRLPMWV GLQEVDQSRVVEALTRILTLRA" gene complement(1694545..1695144) /locus_tag="Rv1504c" /db_xref="GeneID:886481" CDS complement(1694545..1695144) /locus_tag="Rv1504c" /function="UNKNOWN" /note="Rv1504c, (MTCY277.26c), len: 199 aa. Conserved hypothetical protein, similar to N-terminal region of P27833|RFFA_ECOLI LIPOPOLYSACCHARIDE BIOSYNTHESIS PROTEIN from Escherichia coli (376 aa), FASTA scores: opt: 863, E(): 0, (68.0% identity in 194 aa overlap); Rv1503c and Rv1504c are similar to RFFA_ECOLI but are separated by a stop codon, sequence appears to be correct so possible pseudogene." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216020.1" /db_xref="GI:15608642" /db_xref="GeneID:886481" /translation="MSDHKVPFNRPYMTGRELAYIAEAHSCGHLAGDGPFTRRSHAWL EQQTGCRKALLTPSCTAALEMMALLLDIEEGDEVILPSYTFVSTANAFVLRGGVPVFV DIRPDTLNIDETRIVDAITPRTKAIVPVHYAGVACEMDAIMKIATHHNLAVVEDAAQG AMASYRGRALGSIGDLGALSFHETKNVISGEGGALLVNS" gene complement(1695281..1695946) /locus_tag="Rv1505c" /db_xref="GeneID:886483" CDS complement(1695281..1695946) /locus_tag="Rv1505c" /function="UNKNOWN" /note="Rv1505c, (MTCY277.27c), len: 221 aa. Conserved hypothetical protein, some similarity to hypothetical proteins and glycosylases e.g. P71063|O08181 HYPOTHETICAL 22.5 kDa PROTEIN YVFD from Bacillus subtilis (216 aa), FASTA scores: E(): 2.4e-08, (25.5% identity in 196 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216021.1" /db_xref="GI:15608643" /db_xref="GeneID:886483" /translation="MTKPLVIFGSGDIAQLAHYYFTRDSEYEVVAFTVDRDYASVSEF CGLPLVAFDEVAQRFPPESHAMFVALAYAKLNGVRKEKYLAAKALGYELASYVSSHAT VLNDGRIGENVFLLEDNTIQPFVSIGNNVTLWSGNHIGHHSTIHDHCFLASHIVVSGG VVIEEQSFIGVNATLRDHITIGSRCVVGAGALLLGDADADGVYIGTKTERRPVPSTEL RKI" gene complement(1695943..1696443) /locus_tag="Rv1506c" /db_xref="GeneID:886479" CDS complement(1695943..1696443) /locus_tag="Rv1506c" /function="UNKNOWN" /note="Rv1506c, (MTCY277.28c), len: 166 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216022.1" /db_xref="GI:15608644" /db_xref="GeneID:886479" /translation="MRIVNAADPFSINDLGCGYGALLDYLDARGFKTDYTGIDVSPEM VRAAALRFEGRANADFICAARIDREADYSVASGIFNVRLKSLDTEWCAHIEATLDMLN AASRRGFSFNCLTSYSDASKMRDDLYYADPCALFDLCKRRYSKSVALLHDYGLYEFTI LVRKAS" gene complement(1696727..1697422) /locus_tag="Rv1507c" /db_xref="GeneID:886477" CDS complement(1696727..1697422) /locus_tag="Rv1507c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1507c, (MTCY277.29c), len: 231 aa. Conserved hypothetical protein. Similar to AJ007747|BBR007747_6 Hypothetical protein BbLPS1.06 from Bordetella bronchiseptica cosmid (239 aa), FASTA scores: opt: 362, E(): 1.3e-17, (30.8% identity in 221 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216023.1" /db_xref="GI:15608645" /db_xref="GeneID:886477" /translation="MKKVAIVQSNYIPWRGYFDLIAFVDEFIIYDDMQYTKRDWRNRN RIKTSQGLQWITVPVQVKGRFHQKIRETLIDGTDWAKAHWRALEFNYSAAAHFAEIAD WLAPIYLEEQHTNLSLLNRRLLNAICSYLGISTRLANSWDYELADGKTERLANLCQQA AATEYVSGPSARSYVDERVFDELSIRVTWFDYDGYRDYKQLWGGFEPAVSILDLLFNV GAEAPDYLRYCRQ" gene 1697356..1697859 /locus_tag="Rv1507A" /db_xref="GeneID:3205095" CDS 1697356..1697859 /locus_tag="Rv1507A" /function="UNKNOWN" /note="Rv1507A, len: 167 aa. Hypothetical unknow protein. Shows weak similarity with C-terminus of Q9XHQ7|CDA9 CYTIDINE DEAMINASE 9 from Arabidopsis thaliana (Mouse-ear cress) (298 aa), FASTA scores: opt: 104, E(): 4.2, (33.6% identity in 133 aa overlap), BLASTP scores: Score: 77, Identities: 39/133 (29%), Positives: 62/133 (46%)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177648.1" /db_xref="GI:57116879" /db_xref="GeneID:3205095" /translation="MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEF KRFCDIFNMVLGKARMGRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGRSV SGFVLMIKSASVHEIDSWSSPSVAMSIGVALCSYPHYAAARTSPPNRDWGEDTTRSRP VTGLLAG" gene complement(1698095..1699894) /locus_tag="Rv1508c" /db_xref="GeneID:886078" CDS complement(1698095..1699894) /locus_tag="Rv1508c" /function="UNKNOWN" /note="Rv1508c, (MTCY277.30c), len: 599 aa. Probable membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216024.1" /db_xref="GI:15608646" /db_xref="GeneID:886078" /translation="MIPVMSARFTGFPLLPVALRHGITSGRGCGFILDVGAQRPFGND VLLSVATRKIRSRLPGDRVGNHGALLPFRAEPRRIQMKRPPEVLRGAVTASRERLWAI GSQSERTLMLGTILLASVISAATAYALSQWYAVDVFSTLLVVPGDCWLDWGMNIGRHC FSDYAMVAAAGIQPNPADYLISLPADYQPTAVAAWAPARIPYAIFGLPSHWLGAPRLG LICYLVALTMAVISPAIWAARGARGLERVVIFVTLGAAAIPAWGVIDRGNSTGFVVPI ALAYFVALSRQRWGLATITVILAVLVKPQFVVLGVVLLAARQWRWAGIGITGVVVSNI AAFLLWPRGFPGTIAQSIHGIIKFNSSFGGLRDPRNVSFGKALLLIPDSIKNYQSGKI PEGFLTGPRTQIGFAVLVIVVVAVLALGRRIPPVMVGIVLLATATFSPADVAFYYLVF VLPIAALVARDPNGPPGAGIFDQLAAHGDRRRAVGVCVSLAVALSIVNVAVPGQPFYV PLYGQLGAKGVVGTTPLVFTTVTWAPFLWLVTCVVIIVSYARKPARPHDSHNGPTRES DQDTAASTTSCLPNPVEESSPRGPGPICQNYTP" gene 1699866..1700228 /locus_tag="Rv1508A" /db_xref="GeneID:3205096" CDS 1699866..1700228 /locus_tag="Rv1508A" /function="UNKNOWN" /note="Rv1508A, len: 120 aa. Conserved hypothetical protein, highly similar to central part of glycosyl transferases from various mycobacteria and eubacteria e.g. P71790|MTCY277.33|Rv1511 Hypothetical protein from M. tuberculosis (340 aa), FASTA scores: opt: 210, E(): 2.5 e-09, (42.9% identity in 105 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177649.1" /db_xref="GI:57116880" /db_xref="GeneID:3205096" /translation="MKRALITGITGPDGSYLAKLPLKGYVAAGSPAEVYFCWATRNYR ELYGLLAVNSIWFNHESPRHGETFMTRNPAPYRGRQRGADRCADADAPAHPDRYQYWG VPASVRGVIDRAMGVCVE" gene 1700212..1701093 /locus_tag="Rv1509" /db_xref="GeneID:886475" CDS 1700212..1701093 /locus_tag="Rv1509" /function="UNKNOWN" /note="Rv1509, (MTCY277.31), len: 298 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216025.1" /db_xref="GI:15608647" /db_xref="GeneID:886475" /translation="MFALSNNLNRVNACMDGFLARIRSHVDAHAPELRSLFDTMAAEA RFARDWLSEDLARLPVGAALLEVGGGVLLLSCQLAAEGFDITAIEPTGEGFGKFRQLG DIVLELAAARPTIAPCKAEDFISEKRFDFAFSLNVMEHIDLPDEAVRRVSEVLKPGAS YHFLCPNYVFPYEPHFNIPTFFTKELTCRVMRHRIEGNTGMDDPKGVWRSLNWITVPK VKRFAAKDATLTLRFHRAMLVWMLERALTDKEFAGRRAQWMVAAIRSAVKLRVHHLAG YVPATLQPIMDVRLTKR" gene 1701295..1702593 /locus_tag="Rv1510" /db_xref="GeneID:886466" CDS 1701295..1702593 /locus_tag="Rv1510" /function="UNKNOWN" /note="Rv1510, (MTCY277.32), len: 432 aa. Probable membrane protein. Highly similar to Rv3630|MTCY15C10.22 (431 aa), FASTA scores: E(): 0, (70.8% identity in 424 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216026.1" /db_xref="GI:15608648" /db_xref="GeneID:886466" /translation="MYERRHERGMCDRAVEMTDVGATAAPTGPIARGSVARVGAATAL AVACVYTVIYLAARDLPPACFSIFAVFWGALGIATGATHGLLQETTREVRWVRSTQIV AGHRTHPLRVAGMIGTVAAVVIAGSSPLWSRQLFVEGRWLSVGLLSVGVAGFCAQATL LGALAGVDRWTQYGSLMVTDAVIRLAVAAAAVVIGWGLAGYLWAATAGAVAWLLMLMA SPTARSAASLLTPGGIATFVRGAAHSITAAGASAILVMGFPVLLKVTSDQLGAKGGAV ILAVTLTRAPLLVPLSAMQGNLIAHFVDRRTQRLRALIAPALVVGGIGAVGMLAAGLT GPWLLRVGFGPDYQTGGALLAWLTAAAVAIAMLTLTGAAAVAAALHRAYLLGWVSATV ASTLLLLLPMPLETRTVIALLFGPTVGIAIHVAALARRPD" gene 1703074..1704096 /gene="gmdA" /locus_tag="Rv1511" /db_xref="GeneID:886529" CDS 1703074..1704096 /gene="gmdA" /locus_tag="Rv1511" /EC_number="4.2.1.47" /function="unknown, probably involved in nucleotide-sugar metabolism" /experiment="experimental evidence, no additional details recorded" /note="Rv1511, (MTCY277.33), len: 340 aa. Probable gmdA, GDP-D-mannose dehydratase (EC 4.2.1.47), equivalent to AF125999|AF125999_13 Mycobacterium avium enzyme (343 aa), FASTA scores: opt: 2085, E(): 0, (89.1% identity in 338 aa overlap); similar to G755218 PSEUDOMONAS AERUGINOSA GDP-D-MANNOSE DEHYDRATASE (GCA) (323 aa), FASTA scores: opt: 1073, E(): 0, (51.9% identity in 320 aa overlap); and to S74433 GDP-D-mannose dehydratase rfbD - Syn (362 aa), FASTA scores: opt: 1405, E(): 0, (63.9% identity in 327 aa overlap)." /codon_start=1 /transl_table=11 /product="GDP-D-mannose dehydratase gmdA (GDP-mannose 4,6 dehydratase) (GMD)" /protein_id="NP_216027.1" /db_xref="GI:15608649" /db_xref="GeneID:886529" /translation="MKRALITGITGQDGSYLAELLLAKGYEVHGLIRRASTFNTSRID HLYVDPHQPGARLFLHYGDLIDGTRLVTLLSTIEPDEVYNLAAQSHVRVSFDEPVHTG DTTGMGSMRLLEAVRLSRVHCRFYQASSSEMFGASPPPQNELTPFYPRSPYGAAKVYS YWATRNYREAYGLFAVNGILFNHESPRRGETFVTRKITRAVARIKAGIQSEVYMGNLD AVRDWGYAPEYVEGMWRMLQTDEPDDFVLATGRGFTVREFARAAFEHAGLDWQQYVKF DQRYLRPTEVDSLIGDATKAAELLGWRASVHTDELARIMVDADMAALECEGKPWIDKP MIAGRT" gene 1704093..1705061 /gene="epiA" /locus_tag="Rv1512" /db_xref="GeneID:886461" CDS 1704093..1705061 /gene="epiA" /locus_tag="Rv1512" /function="unknown, probably involved in nucleotide-sugar metabolism" /experiment="experimental evidence, no additional details recorded" /note="Rv1512, (MTCY277.34), len: 322 aa. Probable epiA, NUCLEOTIDE SUGAR EPIMERASE, equivalent to AJ223832|MAS223832_4 from Mycobacterium avium silvaticum (339 aa), FASTA scores: opt: 1821, E(): 0, (84.6% identity in 318 aa overlap); and similar to WCAG_ECOLI|P32055 colanic acid biosynthesis protein wcaG (321 aa), FASTA scores: opt: 835, E(): 0, (53.5% identity in 316 aa overlap)." /codon_start=1 /transl_table=11 /product="nucleotide-sugar epimerase epiA" /protein_id="NP_216028.1" /db_xref="GI:15608650" /db_xref="GeneID:886461" /translation="MNAHTSVGPLDRAARVYIAGHRGLVGSALLRTFAGAGFTNLLVR SRAELDLTDRAATFDFVLESRPQVVIDAAARVGGILANDTYPADFLSENLQIQVNLLD AAVAARVPRLLFLGSSCIYPKLAPQPIPESALLTGPLEPTNDAYAIAKIAGILAVQAV RRQHGLPWISAMPTNLYGPGDNFSPSGSHLLPALIRRYDEAKASGAPNVTNWGTGTPR RELLHVDDLASACLYLLEHFDGPTHVNVGTGIDHTIGEIAEMVASAVGYSGETRWDPS KPDGTPRKLLDVSVLREAGWRPSIALRDGIEATVAWYREHAGTVRQ" gene 1705058..1705789 /locus_tag="Rv1513" /db_xref="GeneID:886464" CDS 1705058..1705789 /locus_tag="Rv1513" /function="UNKNOWN" /note="Rv1513, (MTCY277.35), len: 243 aa. Conserved hypothetical protein, similar to hypothetical proteins from several organisms e.g. AJ223833|MAP223833_3 from Mycobacterium avium paratuberculosis (240 aa), FASTA scores: opt: 1053 E(): 0, (66.3% identity in 243 aa overlap); P74191|SLL1173 from Synechocystis (244 aa), FASTA scores: opt: 276, E(): 1.1e-07, (32.2 % identity in 202 aa overlap). Also highly similar to P95136|Q50460|MTCY349.33c|Rv2956 from Mycobacterium tuberculosis (243 aa), (70.0% identity in 237 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216029.1" /db_xref="GI:15608651" /db_xref="GeneID:886464" /translation="MRLARRARNILRRNGIEVSRYFAELDWERNFLRQLQSHRVSAVL DVGANSGQYARGLRGAGFAGRIVSFEPLPGPFAVLQRSASTDPLWECRRCALGDVDGT ISINVAGNEGASSSVLPMLKRHQDAFPPANYVGAQRVPIHRLDSVAADVLRPNDIAFL KIDVQGFEKQVIAGGDSTVHDRCVGMQLELSFQPLYEGGMLIREALDLVDSLGFTLSG LQPGFTDPRNGRMLQADGIFFRGSD" gene complement(1705807..1706595) /locus_tag="Rv1514c" /db_xref="GeneID:886457" CDS complement(1705807..1706595) /locus_tag="Rv1514c" /function="UNKNOWN" /note="Rv1514c, (MTCY277.36c), len: 262 aa. Conserved hypothetical protein. Similar to other hypothetical proteins, and to WCAE_ECOLI|P71239 putative colanic acid biosynthesis glycosyl transferase (248 aa), FASTA scores: opt: 231, E(): 4.1e-08, (33.3% identity in 210 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical glycosyltransferase, Rv2957." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216030.1" /db_xref="GI:15608652" /db_xref="GeneID:886457" /translation="MTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGG SGDDVVAYLSGCEPGFAYWQSEPDGGRYDAMNQGIAHASGDLLWFLHSADRFSGPDVV AQAVEALSGKGPVSELWGFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSS LVAKIGGYDLDFGIAADQEFILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDL RRMGDLHRRYPFGGRRISHAYLRGREFYAYNSRFWENVFTRMSK" gene complement(1706630..1707526) /locus_tag="Rv1515c" /db_xref="GeneID:886459" CDS complement(1706630..1707526) /locus_tag="Rv1515c" /function="UNKNOWN" /note="Rv1515c, (MTCY277.37c), len: 298 aa. Conserved hypothetical protein, similar to P71805|MTCY02B12.11C|Rv1377c Hypothetical protein from Mycobacterium tuberculosis, FASTA scores: E(): 1.3e-05, (25.4% identity in 134 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216031.1" /db_xref="GI:15608653" /db_xref="GeneID:886459" /translation="MSTNPGPAEGANQVMAQEHSAGAVQFTAHNVRLDDGTLTIPESS RTLDESSWFISARGILETVFPGDKSHLRLADVGCLEGGYAVGFARMGFQVLGIEVREL NMAACNYIKSKTNLPNLRFVHDNALNIANHGLFDTVFCCGLFYHLENPKQYLETLSSV TNKLLILQTHFSIINRSDKWLRLPTTARQLTDRLLRRPAPVKFMLSAPTEHEGLPGRW FTEFSDDRSFGQRDTAKWASWDNRRSFWIQREHLLQAIKDVGVDLVMEEYDNLEPSIA ESLLGGSYAANLRGTFIGIKTR" gene complement(1707529..1708539) /locus_tag="Rv1516c" /db_xref="GeneID:886455" CDS complement(1707529..1708539) /locus_tag="Rv1516c" /EC_number="2.-.-.-" /function="unknown; involved in cellular metabolism." /note="Rv1516c, (MTCY277.38c), len: 336 aa. Probable sugar transferase (EC 2.-.-.-), similar to AB010970|AB010970_6 glycosyltransferase from Streptococcus mutans (465 aa), FASTA scores: opt: 388, E(): 4.1e-18, (32.7% identity in 214 aa overlap), slight similarity to SPSA_BACSU|P39621 spore coat polysaccharide biosynthesis (256 aa), fasta scores: opt: 185, E(): 6.5e-05, (26.2% identity in 187 aa overlap), strong similarity to Rv1520|MTCY19G5.08c probable sugar transferase from Mycobacterium tuberculosis (63.5% identity in 318 aa overlap)." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="NP_216032.2" /db_xref="GI:57116881" /db_xref="GeneID:886455" /translation="MSPQLCPKVSIVSTTHNQAGYARQAFDSFLDQQTDFPVEIIVAD DASTDATPAIIREYAERYPHVFRPIFRTENLGLNGNLTGALSAARGEYVALCEADDYW IDPLKLSKQVAFLDRHPKTTVCFHPVRVIWEDGHAKDSKFPPVRVRGNLSLDALILMN FIQTNSAVYRRLERYDDIPADVMPLDWYLHVRHAVHGDIAMLPDTMAVYRRHAQGMWH NQVVDPPKFWLTQGPGHAATFDAMLDLFPGDPAREELIAVMADWILRQIANVPGPEGR AALQETIARHPRIAMLALQHRGATPARRLKTQWRKLAAATPSRRGLVDVWPSRLRRGC RA" gene 1708871..1709635 /locus_tag="Rv1517" /db_xref="GeneID:886467" CDS 1708871..1709635 /locus_tag="Rv1517" /function="UNKNOWN" /note="Rv1517, (MTCY277.39), len: 254 aa. Conserved hypothetical transmembrane protein, similar to G466802|LEPB1170_F2_64 from Mycobacterium leprae (230 aa), FASTA scores: opt: 282, E(): 2.2e-11, (34.1% identity in 255 aa overlap). Also similar to Mycobacterium tuberculosis Rv3821|MTCY409.09c (237 aa) (36.3% identity in 256 aa overlap); and Rv3481c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216033.1" /db_xref="GI:15608655" /db_xref="GeneID:886467" /translation="MWTMVLLLGLGMAIDPARLGLAVVMLSRRRPMLNLFAFWVGGMV AGVGIALAVLVFMRDVALAAIQGVVSAANEFREAVGILAGGRLHIVIGVIMLLLAARM VARARAQVGVPVGPVGVADGGMSALALAQRPPGLVARLEVRTQQMLQGDVVWPAFVVG VASSAPPFESVVALTVIMASGAEIGTQLGAFVVFTLLVLAVIEIPLVAYLAIPQQTQQ VMLRFQDWVRSNRRQISLTILIGVGFLFLYQGVTSL" gene 1709644..1710603 /locus_tag="Rv1518" /db_xref="GeneID:886451" CDS 1709644..1710603 /locus_tag="Rv1518" /function="unknown, possibly glycosyl transferase" /note="Rv1518, (MTCY277.40, MTCY19G5.11c), len: 319 aa. Conserved hypothetical protein, possibly glycosyl transferase involved in exopolysaccharide synthesis, similar to several hypothetical proteins and glycosyl transferases from diverse organisms e.g. P73996|D90911 from SYNECHO CYSTIS sp. (309 aa), Fasta scores: opt: 300, E(): 1.8e-13, (29.5% identity in 241 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216034.1" /db_xref="GI:15608656" /db_xref="GeneID:886451" /translation="MVPGDASSVVSVNPAKPLISVCIPMYNNGATIERCLRSILEQEG VEFEIVVVDDDSSDDCAAIAATMLRPGDRLLRNEPRLGLNRNHNKCLEVARGGLIQFV HGDDRLLPGALQTLSRRFEDPSVGMAFAPRRVESDDIKWQQRYGRVHTRFRKLRDRNH GPSLVLQMVLHGAKENWIGEPTAVMFRRQLALDAGGFRTDIYQLVDVDFWLRLMLRSA VCFVPHELSVRRHTAATETTRVMATRRNVLDRQRILTWLIVDPLSPNSVRSAAALWWI PAWLAMIVEVAVLGPQRRTHLKALAPAPFREFAHARRQLPMAD" gene 1710733..1711002 /locus_tag="Rv1519" /db_xref="GeneID:886453" CDS 1710733..1711002 /locus_tag="Rv1519" /function="UNKNOWN" /note="Rv1519, (MTCY19G5.09c), len: 89 aa. Conserved hypothetical protein, high similarity to C-terminus of Q50723|MTCY78.26|Rv3402c (412 aa) (58.1% identity in 74 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216035.1" /db_xref="GI:15608657" /db_xref="GeneID:886453" /translation="MRCGCLACDGVLCANGPGRPRRPALTCTAVATRTLHSLATNAEL VESADLTVTEDICSRIVSLPVHDHMAIADVARVVAPFGEGLARGG" gene 1711028..1712068 /locus_tag="Rv1520" /db_xref="GeneID:886447" CDS 1711028..1712068 /locus_tag="Rv1520" /EC_number="2.-.-.-" /function="unknown; thought to be involved in cellular metabolism." /note="Rv1520, (MTCY19G5.08c), len: 346 aa. Probable sugar transferase (EC 2.-.-.-), similar to several e.g. AB010970|AB010970_6 Streptococcus mutans glycosyltransferase (465 aa), FASTA scores: opt: 381, E(): 1.2e-18, (31.7% identity in 240 aa overlap); O34234|Y07786 SUGAR TRANSFERASE from Vibrio cholerae (337 aa), FASTA scores: opt: 214, E(): 8.4e-05, (25.9% identity in 212 aa overlap). Also strongly similar to Mycobacterium tuberculosis probable sugar transferase Rv1516c." /codon_start=1 /transl_table=11 /product="sugar transferase" /protein_id="NP_216036.1" /db_xref="GI:15608658" /db_xref="GeneID:886447" /translation="MSIVSISYNQEEYIREALDGFAAQRTEFPVEVIIADDASTDATP RIIGEYAARYPQLFRPILRQTNIGVHANFKDVLSAARGEYLALCEGDDYWTDPLKLSK QVKYLDRHPETTVCFHPVRVIYEDGAKDSEFPPLSWRRDLSVDALLARNFIQTNSVVY RRQPSYDDIPANVMPIDWYLHVRHAVGGEIAMLPETMAVYRRHAHGIWHSAYTDRRKF WETRGHGMAATLEAMLDLVHGHREREAIVGEVSAWVLREIGKTPGRQGRALLLKSIAD HPRMTMLSLQHRWAQTPWRRFKRRLSTELSSLAALAYATRRRALEGRDGGYRETTSPP TGRGRNVRGSHA" gene 1712302..1714053 /gene="fadD25" /locus_tag="Rv1521" /db_xref="GeneID:886448" CDS 1712302..1714053 /gene="fadD25" /locus_tag="Rv1521" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_216037.1" /db_xref="GI:15608659" /db_xref="GeneID:886448" /translation="MSVVESSLPGVLRERASFQPNDKALTFIDYERSWDGVEETLTWS QLYRRTLNLAAQLREHGSTGDRALILAPQSLDYVVSFIASLQAGIVAVPLSIPQGGAH DERTVSVFADTAPAIVLTASSVVDNVVEYVQPQPGQNAPAVIEVDRLDLDARPSSGSR SAAHGHPDILYLQYTSGSTRTPAGVMVSNKNLFANFEQIMTSYYGVYGKVAPPGSTVV SWLPFYHDMGFVLGLILPILAGIPAVLTSPIGFLQRPARWIQMLASNTLAFTAAPNFA FDLASRKTKDEDMEGLDLGGVHGILNGSERVQPVTLKRFIDRFAPFNLDPKAIRPSYG MAEATVYVATRKAGQPPKIVQFDPQKLPDGQAERTESDGGTPLVSYGIVDTQLVRIVD PDTGIERPAGTIGEIWVHGDNVAIGYWQKPEATERTFSATIVNPSEGTPAGPWLRTGD SGFLSEGELFIMGRIKDLLIVYGRNHSPDDIEATIQTISPGRCAAIAVSEHGAEKLVA IIELKKKDESDDEAAERLGFVKREVTSAISKSHGLSVADLVLVSPGSIPITTSGKIRR AQCVELYRQDEFTRLDA" gene complement(1714172..1717612) /gene="mmpL12" /locus_tag="Rv1522c" /db_xref="GeneID:886445" CDS complement(1714172..1717612) /gene="mmpL12" /locus_tag="Rv1522c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv1522c, (MTCY19G5.06), len: 1146 aa. Probable mmpL12, conserved transmembrane transport protein (see Tekaia et al., 1999), member of RND superfamily. Strong similarity to many Mycobacterial membrane proteins e.g. Q49619|G466786 putative transport protein B1170_C1_181 from Mycobacterium leprae (1008 aa), FASTA scores: opt: 2418, E(): 0, (51.0% identity in 1006 aa overlap); etc. Also highly similar to MmpL8|MTCY48.08c|Rv3823c PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis, FASTA score: (34.3% identity in 376 aa overlap); and some similarity to MmpL10|MTCY20G9|Rv1183 PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN, FASTA score: (27.2% identity in 1011 aa overlap). BELONGS TO THE MMPL FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL12" /protein_id="NP_216038.1" /db_xref="GI:15608660" /db_xref="GeneID:886445" /translation="MARHDEAKAGGLFDRIGNFVVRWPLIVIGCWIAVAAALTLLLPT LQAQAAKREQAPLPPGAPSMVLQKEMSAAFQEKIETSALLLVLLTNENGLGPADEAVY RKLIENLRADTQDKISVQDFLAVPEMKELLASKDNKAWNLPITFAGDAASPETQAAFK RVAAIVKQTVAGTSLTVHLSGPIATVADLTELGEKDVRIIEIGTAVSVLIILILVYRN LVTMLVPLATIGASVVTAQGTLSGLAEFGLAVNMQAIVFMSAVMIGAGTDYAVFLISR YHDYVRHGEKSDMAVKKALMSIGKVITASAATVAVTFLAMVFTKLEVFSAVGPAIAVA ITVSLLGAVTLLPAILTLTGRRGWIKPRRDLTSRMWRRSGVRIVRRSTIHLVGSLIVL VALAGCTLLIRFNYDDLKTVPQHVESVKGYEAMNRHFPMNAMTPMVLFIKSPRDLRTP GALADIEMMSREIAELPNIVMVRGLTRPNGEPLKETKVSFQAGEVGGKLDEATTLLEE HGGELDQLTGGAHQLADALAQIRNEINGAVASSSGIVNTLQAMMDLMGGDKTIRQLEN ASQYVGRMRALGDNLSGTVTDAEQIATWASPMVNALNSSPVCNSDPACRTSRAQLAAI VQAQDDGLLRSIRALAVTLQQTQEYQTLARTVSTLDGQLKQVVSTLKAVDGLPTKLAQ MQQGANALADGSAALAAGVQELVDQVKKMGSGLNEAADFLLGIKRDADKPSMAGFNIP PQIFSRDEFKKGAQIFLSADGHAARYFVQSALNPATTEAMDQVNDILRVADSARPNTE LEDATIGLAGVPTALRDIRDYYNSDMKFIVIATIVIVFLILVILLRALVAPIYLIGSV LISYLSALGIGTLVFQLILGQEMHWSLPGLSFILLVAIGADYNMLLISRIRDESPHGI RIGVIRTVGSTGGVITSAGLIFAASMFGLVGASINTMAQAGFTIGIGIVLDTFLVRTV TVPALTTMIGRANWWPSELGRDPSTPPTKADRWLRRVKGHRRKAPIPAPKPPHTKVVR NTNGHASKAATKSVPNGKPADLAEGNGEYLIDHLRRHSLPLFGYAAMPAYDVVDGVSK PNGDGAHIGKEPVDHLLGHSLPLFGLAGLPSYDRWDDTSIGEPAVGHAGSKPDAKLST" gene 1717653..1718696 /locus_tag="Rv1523" /db_xref="GeneID:886489" CDS 1717653..1718696 /locus_tag="Rv1523" /EC_number="2.1.1.-" /function="causes methylation" /note="Rv1523, (MTCY19G5.05c), len: 347 aa (start uncertain). Probable methyltransferase (EC 2.1.1.-), similar to G560513|U0002O Mycobacterium leprae (270 aa), FASTA scores: opt: 965, E(): 0, (60.3% identity in 247 aa overlap). Also similar to many e.g. Q54303|X86780 METHYLTRANSFERASE RAPM from Streptomyces hygroscopicus (317 aa), FASTA scores: opt: 323, E(): 1e-15, (41.2% identity in 136 aa overlap). And similar to M. tuberculosis hypothetical proteins Rv2952, Rv1405c, Rv1403c, Rv0839." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="NP_216039.1" /db_xref="GI:15608661" /db_xref="GeneID:886489" /translation="MTITALTVTLPLLWRRLTTAGVKYADQGHFVGSAGVPAADAGGR DAASEQIARWTQTCTVVLVCGHGPAKWAFRSWCTSRSCDTLPVALRYRLQSNPLVGKL TTKYFLPLGTRQVGDHVVFFNFGYEEDPPMALPLSESDEPNRYCIQLYHQTASQVDLT GKEVLEVSCGAGGGASYIARNLGPASYTGLDLNPASIDLCRAKHRLPGLQFVQGDAQN LPFPDESFDAVVNVEASHQYPDFRGFLAEVARVLRPGGHFLYTDSRRNPVVAEWEAAL ADAPLRTISQRDIGAQAKRGLDANTARSQEAIGRRAPVLLAGLTRCAVRVLDWDLRRG GGFSYRIYLFAKD" gene 1718726..1719970 /locus_tag="Rv1524" /db_xref="GeneID:885814" CDS 1718726..1719970 /locus_tag="Rv1524" /EC_number="2.4.1.-" /function="UNKNOWN" /note="Rv1524, (MTCY19G5.04c), len: 414 aa. Probable glycosyltransferase (EC 2.4.1.-), similar to many e.g. P96559|U84349 GLYCOSYLTRANSFERASE GTFB from Amycolatopsis orientalis (407 aa), FASTA scores: opt: 363, E(): 6.2e-23, (28.8% identity in 430 aa overlap); also high similarity to Rv1526c|MTCY19G5.02 Mycobacterium tuberculosis hypothetical protein (58.7% identity in 416 aa overlap); and AF143772|AF143772_15 glycosyltransferase gtfB from Mycobacterium avium strain 215 (418 aa), FASTA scores: opt: 1801, E(): 0, (65.2% identity in 417 aa overlap)." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="NP_216040.1" /db_xref="GI:15608662" /db_xref="GeneID:885814" /translation="MKFVVASYGTRGDIEPCAAVGLELQRRGHDVCLAVPPNLIGFVE TAGLSAVAYGSRDSQEQLDEQFLHNAWKLQNPIKLLREAMAPVTEGWAELSAMLTPVA AGADLLLTGQIYQEVVANVAEHHGIPLAALHFYPVRANGEIAFPARLPAPLVRSTITA IDWLYWRMTKGVEDAQRRELGLPKASTPAPRRMAVRGSLEIQAYDALCFPGLAAEWGG RRPFVGALTMESATDADDEVASWIAADTPPIYFGFGSMPIGSLADRVAMISAACAELG ERALICSGPSDATGIPQFDHVKVVRVVSHAAVFPTCRAVVHHGGAGTTAAGLRAGIPT LILWVTSDQPIWAAQIKQLKVGRGRRFSSATKESLIADLRTILAPDYVTRAREIASRM TKPAASVTATADLLEDAARRAR" gene 1720017..1720802 /gene="wbbL2" /locus_tag="Rv1525" /db_xref="GeneID:886470" CDS 1720017..1720802 /gene="wbbL2" /locus_tag="Rv1525" /EC_number="2.-.-.-" /function="POSSIBLY INVOLVED IN CELL WALL ARABINOGALACTAN LINKER FORMATION: USES DTDP-L-RHAMNOSE AS SUBSTRATE TO INSERT THE RHAMNOSYL RESIDUE INTO THE CELL WALL." /note="Rv1525, (MT1576, MTCY19G5.03c), len: 261 aa. Possible wbbL2, rhamnosyl transferase (EC 2.-.-.-) (see citation below), showing weak similarity to several rhamnosyl transferases. Similar to AF105060|AF105060_1 Riftia pachyptila endosymbiont (746 aa), FASTA scores: opt: 183, E(): 0.00013, (35.2% identity in 105 aa overlap)." /codon_start=1 /transl_table=11 /product="rhamnosyl transferase WbbL2" /protein_id="NP_216041.1" /db_xref="GI:15608663" /db_xref="GeneID:886470" /translation="MYAPLVSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDY PRIGTERVSTPGENLGWAGGSELGFRLAFAEGYSHAMTLNNDTRVSKGFVAALLDSRL PADAGMVGPMFDVGFPFAVADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGG MDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGAS AAMIRGLRRTHGWPAAMGILREMGMAHHRKWHKSFPLTCPASC" gene complement(1720780..1722060) /locus_tag="Rv1526c" /db_xref="GeneID:886434" CDS complement(1720780..1722060) /locus_tag="Rv1526c" /EC_number="2.4.1.-" /function="unknown; thought to be involved in cellular metabolism." /note="Rv1526c, (MTCY19G5.02), len: 426 aa. Probable glycosyltransferase (EC 2.4.1.-), highly similar to G467196 Protein L518_C2_147 from Mycobacterium leprae (421 aa), FASTA scores, opt: 1497, E(): 0, (55.0% identity in 424 aa overlap); similar to G452504 rhamnosyltransferase (24.7% identity in 433 aa overlap); and P96565|U84350 GLYCOSYLTRANSFERASE GTFE from Amycolatopsis orientalis (408 aa), E(): 3.4e-24, (28.4% identity in 429 aa overlap), also high similarity to Rv1524|MTCY19G5.04c (58.7 % identity in 416 aa overlap)." /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="NP_216042.1" /db_xref="GI:15608664" /db_xref="GeneID:886434" /translation="MKFVLAVHGTRGDVEPCAAVGVELRRRGHAVHMAVPPNLIEFVE SAGLTGVAYGPDSDEQINTVAAFVRNLTRAQNPLNLARAVKELFVEGWAEMGTTLTTL ADGADLVMTGQTYHGVAANVAEYYDIPAAALHHFPMQVNGQIAIPSIPTPATLVRATM KVSWRLYAYVSKDADRAQRRELGLPPAPAPAVRRLAERGAPEIQAYDPVFFPGLAAEW SDRRPFVGPLTMELHSEPNEELESWIAAGTPPIYFGFGSTPVQTPVQTLAMISDVCAQ LGERALIYSPAANSTRIRHADHVKRVGLVNYSTILPKCRAVVHHGGAGTTAAGLRAGM PTLILWDVADQPIWAGAVQRLKVGSAKRFTNITRGSLLKELRSILAPECAARAREIST RMTRPTAAVTAAADLLEATARQTPGSTPSSSPGR" gene complement(1722083..1728409) /gene="pks5" /locus_tag="Rv1527c" /db_xref="GeneID:886442" CDS complement(1722083..1728409) /gene="pks5" /locus_tag="Rv1527c" /function="Involved in polyketide metabolism." /note="Rv1527c, (MTV045.01c-MTCY19G5.01), len: 2108 aa. Probable pks5, polyketide synthase, highly similar to many e.g. MCAS_MYCBO|Q02251 mycocerosic acid synthase from Mycobacterium bovis (2110 aa), FASTA scores: opt: 6270, E(): 0, (63.6% identity in 2126 aa overlap)." /codon_start=1 /transl_table=11 /product="polyketide synthase pks5" /protein_id="NP_216043.1" /db_xref="GI:15608665" /db_xref="GeneID:886442" /translation="MGKERTKTVDRTRVTPVAVIGMGCRLPGGIDSPDRLWEALLRGD DLVTEIPADRWDIDEYYDPEPGVPGRTDCKWGAYLDNVGDFDPEFFGIGEKEAIAIDP QHRLLLETSWEAMEHGGLTPNQMASRTGVFVGLVHTDYILVHADNQTFEGPYGNTGTN ACFASGRVAYAMGLQGPAITVDTACSSGLTAIHLACRSLHDGESDIALAGGVYVMLEP RRFASGSALGMLSATGRCHAFDVSADGFVSGEGCVMLALKRLPDALADGDRILAVIRG TAANQDGHTVNIATPSRSAQVAAYREALDVAGVDPATVGMVEAHGPGTPVGDPIEYAS LAEVYGNDGPCALASVKTNFGHTQSAAGALGLMKAVLALQHGVVPQNLHFTALPDKLA AIETNLFVPQEITPWPGADQETPRRAAVSSYGMTGTNVHAIVEQAPVPAPESGAPGDT PATPGIDGALLFALSASSQDALRQTAARLADWVDAQGPELAPADLAYTLARRRGHRPV RTAVLAATTAELTEALREVATGEPPYPPAVGQDDRGPVWVFSGQGSQWAGMGADLLAT EPVFAATIAAIEPLIAAESGFSVTEAMTAPEVVTGIDRVQPTLFAMQVALAATMKSYG VAPGAVIGHSLGESAAAVVAGALCLEDGVRVICRRSALMTRIAGAGAMASVELPAQQV LSELMARGVNDAVVAVVASPQSTVIGGATQTVRDLVAAWEQRDVLAREVAVDVASHSP QVDPILDELAEALAEISPLQPEIPYYSATSFDPREEPYCDAYYWVDNLRHTVRFAAAV QAALEDGYRVFTELTPHPLLTHAVDQTARSLDMSAAALAGMRREQPLPHGLRALAGDL YAAGAAVDFAVLYPTGRLINAPLPTWNHRRLLLDDTTRRIAHANTVAVHPLLGSHVRL PEEPERHVWQGEVGTVTQPWLADHQIHGAAALPGAAYCEMALAAARAVLGEASEVRDI RFEQMLLLDDETPIGVTATVEAPGVVPLTVETSHDGRYTRQLAAVLHVVREADDAPDQ PPQKNIAELLASHPHKVDGAEVRQWLDKRGHRLGPAFAGLVDAYIAEGAGDTVLAEVN LPGPLRSQVKAYGVHPVLLDACFQSVAAHPAVQGMADGGLLLPLGVRRLRSYGSARHA RYCCTTVTACGVGVEADLDVLDEHGAVVLAVRGLQLGTGASQASERARVLGERLLSIE WHERELPENSHAEPGAWLLISTCDATDLVAAQLTDALKVHDAQCTTMSWPQRADHAAQ AARLRDQLGTGGFTGVFVLTAPQTGDPDAESPVRGGELVKHVVRIAREIPEITAQEPR LYVLTHNAQAVLSGDRPNLEQGGMRGLLRVIGAEHPHLKASYVDVDEQTGAESVARQL LAASGEDETAWRNDQWYTARLCPAPLRPEERQTTVVDHAEAGMRLQIRTPGDLQTLEF AAFDRVPPGPGEIEVAVTASSINFADVLVTFGRYQTLDGRQPQLGTDFAGVVSAVGPG VSELKVGDRVGGMSPNGCWATFVTCDARLATRLPEGLTDAQAAAVTTASATAWYGLQD LARIKAGDKVLIHSATGGVGQAAIAIARAAGAQIYATAGNEKRRDLLRDMGIEHVYDS RSVEFAEQIRRDTAGYGVDIVLNSVTGAAQLAGLKLLALGGRFIEIGKRDIYSNTRLE LLPFRRNLAFYGLDLGLMSVSHPAAVRELLSTVYRLTVEGVLPMPQSTHYPLAEAATA IRVMGAAEHTGKLILDVPHAGRSSVVLPPEQARVFRSDGSYIITGGLGGLGLFLAEKM ANAGAGRIVLSSRSQPSQKALETIELVRAIGSDVVVECGDIAQPDTADRLVTAATATG LPLRGVLHAAAVVEDATLANITDELIERDWAPKAYGAWQLHRATADQPLDWFCSFSSA AALVGSPGQGAYAAANSWLDTFTHWRRAQDLPATSIAWGAWGQIGRAIAFAEQTGDAI APEEGAYAFETLLRHNRAYSGYAPVIGSPWLTAFAQHSPFAEKFQSLGQNRSGTSKFL AELVDLPREEWPDRLRRLLSKQVGLILRRTIDTDRLLSEYGLDSLSSQELRARVEAET GIRISATEINTTVRGLADLMCDKLAADRDAPAPA" gene complement(1728953..1729450) /gene="papA4" /locus_tag="Rv1528c" /db_xref="GeneID:886028" CDS complement(1728953..1729450) /gene="papA4" /locus_tag="Rv1528c" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1528c, (MTV045.02), len: 165 aa. Probable papA4, conserved polyketide synthase (PKS) associated protein; shows some similarity to C-terminal part of hypothetical proteins from Mycobacterium tuberculosis and Mycobacterium leprae e.g. Z97188|MTCY409_10 Mycobacterium tuberculosis cosmid (468) (37.9% identity in 66 aa overlap); or U00010_11 Mycobacterium leprae cosmid B1170 (35.7% identity in 84 aa overlap). Also similar to Mycobacterium tuberculosis PKS-associated proteins Rv1182, Rv3824c, Rv3820c." /codon_start=1 /transl_table=11 /product="polyketide synthase associated protein" /protein_id="NP_216044.1" /db_xref="GI:15608666" /db_xref="GeneID:886028" /translation="MTQLPQPTWRWWQQRETEQVQSSHIDGEIVGALIPDLAVLHSED ASRAAVGREKHRCSLDPLGGGFRSRRASMPAGALLLSAVIAIQLDRMNARVFGDGWIG AQACMWVNKFHEESTVTALSPSSPIAQGSIARHPETMQSAYVRIAEGGSRDVAPAAQL QRRRP" gene 1729502..1731256 /gene="fadD24" /locus_tag="Rv1529" /db_xref="GeneID:886432" CDS 1729502..1731256 /gene="fadD24" /locus_tag="Rv1529" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_216045.1" /db_xref="GI:15608667" /db_xref="GeneID:886432" /translation="MVASSIPTALRERASVHPNGAAITYIDYEQDWAGVAETLTWSQL YRRMLNVAEPLRHVGATGDRAVILAPQGIEYVVGFLGALQAGRIAVPLPVPHAGAHDE RTISVLSDTSPAVILTTSGAVDDVRECAQPQPGQSAPSIVELDLLDLDSRQRSRSPGA RPTGRDTPETAYLQYTSGSTRTPAGVMVSNKNVFANFEQIVADFFAPEGGVVPPDLTV VSWLPLYHDMGLLLGAIMPILAGVPTVLTSPVGFLQRPARWIQLLARNGRTISAGPNF AFELAVRKTSDDDMDGLDLAGVHTILNGSERVHPATLKRFAERFGRFNFAAAALRPAY GMAEATVYIATRNVNEPPEIVDFESEKLPAGQAIRCPSGSGTPLVSYGVPRSQLVRIV DPDTCIECPQGSVGEIWVQGGNVASGYWHKPEESKRTFGARIVTPSAGTPEAPWLRTG DSGFVSGGELFIIGRIKDLLIVYGRNHAPDDIEATIQEITSGRCAAIAVPDHGTEKLV AIIELKKRGDSDEDVADRLRIVKRDVAAAIFDSHGLSVADLVLVSPGSIPITTSGKIR RAQCVQLYRRREFTRLDA" gene 1731373..1732476 /gene="adh" /locus_tag="Rv1530" /db_xref="GeneID:886426" CDS 1731373..1732476 /gene="adh" /locus_tag="Rv1530" /EC_number="1.1.1.1" /function="Catalyzes the reversible oxidation of ethanol to acetaldehyde with the concomitant reduction of NAD" /note="Rv1530, (MTV045.04), len: 367 aa. Probable adh, alcohol dehydrogenase (EC 1.1.1.1), zinc-dependent, similar to many e.g. AE0009|AE000958_23 Archaeoglobus fulgidus section 1 (402 aa), FASTA scores: opt: 423, E(): 1.8e-19, (31.7% identity in 341 aa overlap). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. TBparse score is 0.919." /codon_start=1 /transl_table=11 /product="alcohol dehydrogenase adh" /protein_id="NP_216046.1" /db_xref="GI:15608668" /db_xref="GOA:O53904" /db_xref="UniProtKB/TrEMBL:O53904" /db_xref="GeneID:886426" /translation="MSDGAVVRALVLEAPRRLVVRQYRLPRIGDDDALVRVEACGLCG TDHEQYTGELAGGFAFVPGHETVGTIAAIGPRAEQRWGVSAGDRVAVEVFQSCRQCAN CRGGEYRRCVRHGLADMYGFIPVDREPGLWGGYAEYQYLAPDSMVLRVAGDLSPEVAT LFNPLGAGIRWGVTIPETKPGDVVAVLGPGIRGLCAAAAAKGAGAGFVMVTGLGPRDA DRLALAAQFGADLAVDVAIDDPVAALTEQTGGLADVVVDVTAKAPAAFAQAIALARPA GTVVVAGTRGVGSGAPGFSPDVVVFKELRVLGALGVDATAYRAALDLLVSGRYPFASL PRRCVRLEGAEDLLATMAGERDGVPPIHGVLTP" misc_feature 1731559..1731603 /gene="adh" /locus_tag="Rv1530" /note="PS00059 Zinc-containing alcohol dehydrogenases signature" gene 1732473..1733039 /locus_tag="Rv1531" /db_xref="GeneID:886437" CDS 1732473..1733039 /locus_tag="Rv1531" /function="UNKNOWN" /note="Rv1531, (MTV045.05), len: 188 aa. Conserved hypothetical protein, similar to Rv0464c|MTV038.08c (190 aa), FASTA scores: E(): 4.8e-10, (30.9% identity in 175 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216047.1" /db_xref="GI:15608669" /db_xref="GOA:O53905" /db_xref="UniProtKB/TrEMBL:O53905" /db_xref="GeneID:886437" /translation="MTTSRVPLLPVDEAKAAADEAGVPDYMAELSIFQVLLNHPRLAR TFNDLLATMLWHGTLDSRLRELVIMRIGWLTDCDYEWTQHWRVASGLGVSADDLLGVR DWQGYNGFGPAEQAVLAATDDVVREGAVSAQSWSACERELHCDKVVLIELVTVISAWR MVASILHSLEVPLEDGVSSWPPDGLSPR" gene complement(1733116..1733550) /locus_tag="Rv1532c" /db_xref="GeneID:886422" CDS complement(1733116..1733550) /locus_tag="Rv1532c" /function="UNKNOWN" /note="Rv1532c, (MTCY07A7A.01c), len: 144 aa. Conserved hypothetical protein, similar to P20378|YPHR_HALHA Hypothetical 15.6 kDa protein from Halobacterium halobium (151 aa), FASTA scores: opt: 152, E():4.5e-05, (30.1% identity in 103 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216048.1" /db_xref="GI:15608670" /db_xref="GOA:O06178" /db_xref="UniProtKB/TrEMBL:O06178" /db_xref="GeneID:886422" /translation="MSDPLTAQEQHKRRQAVRELMPRTPFIGGLGIVFERYEPDDVVI RLPFRTDLTNDGTYFHGGVIASVMDTAGAAAAWSNHDFDRGTRAATVAMSIQYTGAAK RCDLLCHARTARRRKELTFTEITATDPDGNIVAHAVQTYRIV" gene 1733610..1734737 /locus_tag="Rv1533" /db_xref="GeneID:886424" CDS 1733610..1734737 /locus_tag="Rv1533" /function="UNKNOWN" /note="Rv1533, (MTCY07A7A.02), len: 375 aa. Conserved hypothetical protein. Similar to 2NPD_NEUCR|Q01284 2-nitropropane dioxygenase precursor (378 aa), fasta scores: opt: 279, E(): 9.1e-11, (31.3% identity in 256 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv1894c, Rv0021c, Rv3553, Rv2781c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216049.1" /db_xref="GI:15608671" /db_xref="GOA:O06179" /db_xref="UniProtKB/TrEMBL:O06179" /db_xref="GeneID:886424" /translation="MRTRVAELLGAEFPICAFSHCRDVVAAVSNAGGFGILGAVAHSP KRLESELTWIEEHTGGKPYGVDVLLPPKYIGAEQGGIDAQQARELIPEGHRTFVDDLL VRYGIPAVTDRQRSSSAGGLHISPKGYQPLLDVAFAHDIRLIASALGPPPPDLVERAH NHDVLVAALAGTAQHARRHAAAGVDLIVAQGTEAGGHTGEVATMVLVPEVVDAVSPTP VLAAGGIARGRQIAAALALGAEGVWCGSVWLTTEEAETPPVVKDKFLAATSSDTVRSR SLTGKPARMLRTAWTDEWDRPDSPDPLGMPLQSALVSDPQLRINQAAGQPGAKARELA TYFVGQVVGSLDRVRSARSVVLDMVEEFIDTVGQLQGLVQR" gene 1734734..1735411 /locus_tag="Rv1534" /db_xref="GeneID:886420" CDS 1734734..1735411 /locus_tag="Rv1534" /function="Possibly involved in a transcriptional mechanism" /note="Rv1534, (MTCY07A7A.03), len: 225 aa. Probable transcriptional regulator, similar to YCDC_ECOLI|P75899 hypothetical transcriptional regulator from Escherichia coli (212 aa), FASTA scores: opt: 166, E(): 9.8e-05, (24.2% identity in 219 aa overlap). Contains PS01081 Bacterial regulatory proteins, tetR family signature and helix turn helix motif (aa 41-62)." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="NP_216050.1" /db_xref="GI:15608672" /db_xref="GOA:O08377" /db_xref="UniProtKB/TrEMBL:O08377" /db_xref="GeneID:886420" /translation="MSRASARRRRAVSDEDKSQRRDEILAAAKIVFAHKGFHATTVAD IAKQAGLAYGLIYWYFDSKDDLFHALMAGEEEALRAHVAAELARVGGSTEAPLRALLQ AAVQATFEFFETDKATVKLLFRDAYALGGRFEEHLGGIYERFIDDIEAVVVAAQRRGE VVEAPSRMAAYTLAALVGQLAHRRLNTDDNVTAAQVADFVVSLVLDGLRPRALAVGAR GGRAART" misc_feature 1734839..1734931 /locus_tag="Rv1534" /note="PS01081 Bacterial regulatory proteins, tetR family signature" gene 1735976..1736212 /locus_tag="Rv1535" /db_xref="GeneID:886429" CDS 1735976..1736212 /locus_tag="Rv1535" /function="UNKNOWN" /note="Rv1535, (MTCY07A7A.04), len: 78 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216051.1" /db_xref="GI:15608673" /db_xref="UniProtKB/TrEMBL:O06180" /db_xref="GeneID:886429" /translation="MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGT GGDPLVDGAARLLSIPLRHLYAALWRVGLLEVQA" gene 1736519..1739644 /gene="ileS" /locus_tag="Rv1536" /db_xref="GeneID:886412" CDS 1736519..1739644 /gene="ileS" /locus_tag="Rv1536" /EC_number="6.1.1.5" /function="charging ile tRNA [CATALYTIC ACTIVITY : ATP + L-ISOLEUCINE + TRNA(ILE) = AMP + DIPHOSPHATE + L-ISOLEUCYL-TRNA(ILE)]." /note="IleRS; catalyzes the formation of isoleucyl-tRNA(Ile) from isoleucine and tRNA(Ile); since isoleucine and other amino acids such as valine are similar, there are additional editing function in this enzyme; one is involved in hydrolysis of activated valine-AMP and the other is involved in deacylation of mischarged Val-tRNA(Ile); there are two active sites, one for aminoacylation and one for editing; class-I aminoacyl-tRNA synthetase family type 2 subfamily; some organisms carry two different copies of this enzyme; in some organisms, the type 2 subfamily is associated with resistance to the antibiotic pseudomonic acid (mupirocin)" /codon_start=1 /transl_table=11 /product="isoleucyl-tRNA synthetase" /protein_id="NP_216052.1" /db_xref="GI:15608674" /db_xref="GOA:Q10765" /db_xref="UniProtKB/Swiss-Prot:Q10765" /db_xref="GeneID:886412" /translation="MTDNAYPKLAGGAPDLPALELEVLDYWSRDDTFRASIARRDGAP EYVFYDGPPFANGLPHYGHLLTGYVKDIVPRYRTMRGYKVERRFGWDTHGLPAELEVE RQLGITDKSQIEAMGIAAFNDACRASVLRYTDEWQAYVTRQARWVDFDNDYKTLDLAY MESVIWAFKQLWDKGLAYEGYRVLPYCWRDETPLSNHELRMDDDVYQSRQDPAVTVGF KVVGGQPDNGLDGAYLLVWTTTPWTLPSNLAVAVSPDITYVQVQAGDRRFVLAEARLA AYARELGEEPVVLGTYRGAELLGTRYLPPFAYFMDWPNAFQVLAGDFVTTDDGTGIVH MAPAYGEDDMVVAEAVGIAPVTPVDSKGRFDVTVADYQGQHVFDANAQIVRDLKTQSG PAAVNGPVLIRHETYEHPYPHCWRCRNPLIYRSVSSWFVRVTDFRDRMVELNQQITWY PEHVKDGQFGKWLQGARDWSISRNRYWGTPIPVWKSDDPAYPRIDVYGSLDELERDFG VRPANLHRPYIDELTRPNPDDPTGRSTMRRIPDVLDVWFDSGSMPYAQVHYPFENLDW FQGHYPGDFIVEYIGQTRGWFYTLHVLATALFDRPAFKTCVAHGIVLGFDGQKMSKSL RNYPDVTEVFDRDGSDAMRWFLMASPILRGGNLIVTEQGIRDGVRQVLLPLWNTYSFL ALYAPKVGTWRVDSVHVLDRYILAKLAVLRDDLSESMEVYDIPGACEHLRQFTEALTN WYVRRSRSRFWAEDADAIDTLHTVLEVTTRLAAPLLPLITEIIWRGLTRERSVHLTDW PAPDLLPSDADLVAAMDQVRDVCSAASSLRKAKKLRVRLPLPKLIVAVENPQLLRPFV DLIGDELNVKQVELTDAIDTYGRFELTVNARVAGPRLGKDVQAAIKAVKAGDGVINPD GTLLAGPAVLTPDEYNSRLVAADPESTAALPDGAGLVVLDGTVTAELEAEGWAKDRIR ELQELRKSTGLDVSDRIRVVMSVPAEREDWARTHRDLIAGEILATDFEFADLADGVAI GDGVRVSIEKT" misc_feature 1736675..1736710 /gene="ileS" /locus_tag="Rv1536" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene 1739841..1741247 /gene="dinX" /locus_tag="Rv1537" /db_xref="GeneID:886414" CDS 1739841..1741247 /gene="dinX" /locus_tag="Rv1537" /EC_number="2.7.7.7" /function="INVOLVED IN DNA METABOLISM [CATALYTIC ACTIVITY: N DEOXYNUCLEOSIDE TRIPHOSPHATE = N DIPHOSPHATE + {DNA}N]." /note="involved in translesion DNA polymerization with beta clamp of polymerase III; belongs to Y family of polymerases; does not contain proofreading function" /codon_start=1 /transl_table=11 /product="DNA polymerase IV" /protein_id="NP_216053.2" /db_xref="GI:161352466" /db_xref="GOA:P63985" /db_xref="UniProtKB/Swiss-Prot:P63985" /db_xref="GeneID:886414" /translation="MESRWVLHLDMDAFFASVEQLTRPTLRGRPVLVGGLGGRGVVAG ASYEARAYGARSAMPMHQARRLIGVTAVVLPPRGVVYGIASRRVFDTVRGLVPVVEQL SFDEAFAEPPQLAGAVAEDVETFCERLRRRVRDETGLIASVGAGSGKQIAKIASGLAK PDGIRVVRHAEEQALLSGLPVRRLWGIGPVAEEKLHRLGIETIGQLAALSDAEAANIL GATIGPALHRLARGIDDRPVVERAEAKQISAESTFAVDLTTMEQLHEAIDSIAEHAHQ RLLRDGRGARTITVKLKKSDMSTLTRSATMPYPTTDAGALFTVARRLLPDPLQIGPIR LLGVGFSGLSDIRQESLFADSDLTQETAAAHYVETPGAVVPAAHDATMWRVGDDVAHP ELGHGWVQGAGHGVVTVRFETRGSGPGSARTFPVDTGDISNASPLDSLDWPDYIGQLS VEGSAGASAPTVDDVGDR" gene complement(1741212..1742192) /gene="ansA" /locus_tag="Rv1538c" /db_xref="GeneID:886410" CDS complement(1741212..1742192) /gene="ansA" /locus_tag="Rv1538c" /EC_number="3.5.1.1" /function="conversion of asparagine to aspartate [CATALYTIC ACTIVITY : L-ASPARAGINE + H(2)O = L-ASPARTATE + NH(3).]" /note="Rv1538c, (MTCY48.27), len: 326 aa. Probable ansA, L-aparaginase, most similar to ASPG_BACLI|P30363 L-asparaginase (322 aa), FASTA scores: opt: 417, E(): 8.8e-19, (30.9% identity in 314 aa overlap). Contains PS00917 Asparaginase / glutaminase active site signature 2." /codon_start=1 /transl_table=11 /product="L-aparaginase ansA" /protein_id="NP_216054.1" /db_xref="GI:15608676" /db_xref="GOA:P63627" /db_xref="UniProtKB/Swiss-Prot:P63627" /db_xref="GeneID:886410" /translation="MGANHVRNDPIMARLTVITTGGTISTTAGPDGVLRPTHCGATLI AGLDMDSDIEVVDLMALDSSKLTPADWDRIGAAVQEAFRGGADGVVITHGTDTLEETA LWLDLTYAGSRPVVLTGAMLSADAPGADGPANLRDALAVAADPAARDLGVLVSFGGRV LQPLGLHKVANPDLCGFAGESLGFTSGGVRLTRTKTRPYLGDLGAAVAPRVDIVAVYP GSDAVAMDACVAAGARAVVLEALGSGNAGAAVIEGVRRHCRDGSDPVVIAVSTRVAGA RVGAGYGPGHDLVEAGAVMVPRLPPSQARVLLMAALAANSPVADVIDRWG" misc_feature complement(1741899..1741931) /gene="ansA" /locus_tag="Rv1538c" /note="PS00917 Asparaginase / glutaminase active site signature 2" gene 1742244..1742852 /gene="lspA" /locus_tag="Rv1539" /db_xref="GeneID:886408" CDS 1742244..1742852 /gene="lspA" /locus_tag="Rv1539" /EC_number="3.4.23.36" /function="THIS PROTEIN SPECIFICALLY CATALYZES THE REMOVAL OF SIGNAL PEPTIDES FROM PROLIPOPROTEINS [CATALYTIC ACTIVITY : CLEAVAGE OF N-TERMINAL LEADER SEQUENCES FROM MEMBRANE PROLIPOPROTEINS. HYDROLYSES XAA-XBB-XBB-|-CYS, IN WHICH XAA IS HYDROPHOBIC (PREFERABLY LEU), XBB IS OFTEN SER OR ALA, XCC IS OFTEN GLY OR ALA, AND THE CYS IS ALKYLATED ON SULFUR WITH A DIACYLGLYCERYL GROUP]." /note="lipoprotein signal peptidase; integral membrane protein that removes signal peptides from prolipoproteins during lipoprotein biosynthesis" /codon_start=1 /transl_table=11 /product="lipoprotein signal peptidase" /protein_id="NP_216055.1" /db_xref="GI:15608677" /db_xref="GOA:P65262" /db_xref="UniProtKB/Swiss-Prot:P65262" /db_xref="GeneID:886408" /translation="MPDEPTGSADPLTSTEEAGGAGEPNAPAPPRRLRMLLSVAVVVL TLDIVTKVVAVQLLPPGQPVSIIGDTVTWTLVRNSGAAFSMATGYTWVLTLIATGVVV GIFWMGRRLVSPWWALGLGMILGGAMGNLVDRFFRAPGPLRGHVVDFLSVGWWPVFNV ADPSVVGGAILLVILSIFGFDFDTVGRRHADGDTVGRRKADG" gene 1742845..1743771 /locus_tag="Rv1540" /db_xref="GeneID:886404" CDS 1742845..1743771 /locus_tag="Rv1540" /function="UNKNOWN" /note="Rv1540, (MTCY48.25c), len: 308 aa. Member of the yabO/yceC/yfiI family of hypothetical proteins, similar to P44445|YFII_HAEIN hypothetical protein HI0176 from Haemophilus influenzae (324 aa), FASTA scores: opt: 437, E(): 1.2e-22, (33.2% identity in 322 aa overlap). Equivalent to AL049478|MLCL458_13 hypothetical protein from Mycobacterium leprae (308 aa), (89.3% identity in 307 aa overlap). Contains PS01129 hypothetical yabO/yceC/yfiI family signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216056.1" /db_xref="GI:15608678" /db_xref="GOA:Q10786" /db_xref="UniProtKB/Swiss-Prot:Q10786" /db_xref="GeneID:886404" /translation="MADRSMPVPDGLAGMRVDTGLARLLGLSRTAAAALAEEGAVELN GVPAGKSDRLVSGALLQVRLPEAPAPLQNTPIDIEGMTILYSDDDIVAVDKPAAVAAH ASVGWTGPTVLGGLAAAGYRITTSGVHERQGIVHRLDVGTSGVMVVAISERAYTVLKR AFKYRTVDKRYHALVQGHPDPSSGTIDAPIGRHRGHEWKFAITKNGRHSLTHYDTLEA FVAASLLDVHLETGRTHQIRVHFAALHHPCCGDLVYGADPKLAKRLGLDRQWLHARSL AFAHPADGRRVEIVSPYPADLQHALKILRGEG" misc_feature 1743250..1743291 /locus_tag="Rv1540" /note="PS01129 Hypothetical yabO/yceC/yfiI family signature" gene complement(1743778..1744371) /gene="lprI" /locus_tag="Rv1541c" /db_xref="GeneID:886406" CDS complement(1743778..1744371) /gene="lprI" /locus_tag="Rv1541c" /function="UNKNOWN" /note="Rv1541c, (MTCY48.24), len: 197 aa. Possible lipoprotein lprI, contains appropriately positioned prokaryotic membrane lipoprotein lipid attachment site (PS0013)." /codon_start=1 /transl_table=11 /product="lipoprotein LprI" /protein_id="NP_216057.1" /db_xref="GI:15608679" /db_xref="GOA:P65318" /db_xref="UniProtKB/Swiss-Prot:P65318" /db_xref="GeneID:886406" /translation="MRWIGVLVTALVLSACAANPPANTTSPTAGQSLDCTKPATIVQQ LVCHDRQLTSLDHRLSTAYQQALAHRRSAALEAAQSSWTMLRDACAQDTDPRTCVQEA YQTRLVQLAIADPATATPPVLTYRCPTQDGPLTAQFYNQFDPKTAVLNWKGDQVIVFV ELSGSGARYGRQGIEYWEHQGEVRLDFHGATFVCRTS" gene complement(1744426..1744836) /gene="glbN" /locus_tag="Rv1542c" /db_xref="GeneID:886402" CDS complement(1744426..1744836) /gene="glbN" /locus_tag="Rv1542c" /function="oxygen transport" /note="Rv1542c, (MTCY48.23), len: 136 aa. Probable glbN, hemoglobin. Belongs to the protozoan/cyanobacterial globin family. Similar to myoglobins e.g. GLB_PARCA|P15160 myoglobin (hemoglobin) paramecium (116 aa), FASTA scores, opt: 284, E(): 2.1e -13, (35.7% identity in 115 aa overlap). Similar to Mycobacterium tuberculosis hypothetical globin, Rv2470." /codon_start=1 /transl_table=11 /product="hemoglobin glbN" /protein_id="NP_216058.1" /db_xref="GI:15608680" /db_xref="GOA:Q10784" /db_xref="UniProtKB/Swiss-Prot:Q10784" /db_xref="GeneID:886402" /translation="MGLLSRLRKREPISIYDKIGGHEAIEVVVEDFYVRVLADDQLSA FFSGTNMSRLKGKQVEFFAAALGGPEPYTGAPMKQVHQGRGITMHHFSLVAGHLADAL TAAGVPSETITEILGVIAPLAVDVTSGESTTAPV" gene 1745064..1746089 /locus_tag="Rv1543" /db_xref="GeneID:886400" CDS 1745064..1746089 /locus_tag="Rv1543" /EC_number="1.2.1.-" /function="Thought to reduce acyl-CoA esters of fatty acids to fatty aldehydes." /experiment="experimental evidence, no additional details recorded" /note="Rv1543, (MTCY48.22c), len: 341 aa. Possible fatty-acyl CoA reductase (EC 1.2.1.-), highly similar to P94129|U77680 FATTY ACYL-CoA REDUCTASE ACR1 from Acinetobacter calcoaceticus (295 aa), FASTA scores: opt: 899, E(): 0, (48.5% identity in 293 aa overlap). Also highly similar to acrA1|Rv3391|MTV004.49|NP_217908.1|NC_000962 fatty acyl-CoA reductase from Mycobacterium tuberculosis (650 aa). Also highly similar to many oxidoreductases short-chain family." /codon_start=1 /transl_table=11 /product="fatty acyl-CoA reductase" /protein_id="NP_216059.1" /db_xref="GI:15608681" /db_xref="GOA:P66779" /db_xref="UniProtKB/Swiss-Prot:P66779" /db_xref="GeneID:886400" /translation="MNLGDLTNFVEKPLAAVSNIVNTPNSAGRYRPFYLRNLLDAVQG RNLNDAVKGKVVLITGGSSGIGAAAAKKIAEAGGTVVLVARTLENLENVANDIRAIRG NGGTAHVYPCDLSDMDAIAVMADQVLGDLGGVDILINNAGRSIRRSLELSYDRIHDYQ RTMQLNYLGAVQLILKFIPGMRERHFGHIVNVSSVGVQTRAPRFGAYIASKAALDSLC DALQAETVHDNVRFTTVHMALVRTPMISPTTIYDKFPTLTPDQAAGVITDAIVHRPRR ASSPFGQFAAVADAVNPAVMDRVRNRAFNMFGDSSAAKGSESQTDTSELDKRSETFVR ATRGIHW" gene 1746094..1746897 /locus_tag="Rv1544" /db_xref="GeneID:886417" CDS 1746094..1746897 /locus_tag="Rv1544" /EC_number="1.3.1.-" /function="UNKNOWN, BUT POSSIBLY INVOLVEMENT IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1544, (MTCY48.21), len: 267 aa. Possible ketoacyl reductase (EC 1.3.1.-), highly similar to Z97179|MLCL383_26 putative oxidoreductase from Mycobacterium leprae (268 aa), FASTA score: (43.0% identity in 270 aa overlap). Also highly similar to others e.g. T29125 ketoacyl reductase homolog from Streptomyces coelicolor (276 aa); NP_470957.1|NC_003212 protein similar to ketoacyl reductases from Listeria innocua (253 aa); HETN_ANASP|P37694 ketoacyl reductase from Anabaena sp. strain PCC 7120 (287 aa), FASTA scores: opt: 379, E(): 7.5e-18, (31.6% identity in 250 aa overlap); etc. And highly similar to many oxidoreductases short-chain family. Also highly similar to Rv2509 from Mycobacterium tuberculosis (268 aa). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="ketoacyl reductase" /protein_id="NP_216060.1" /db_xref="GI:15608682" /db_xref="GOA:Q10782" /db_xref="UniProtKB/TrEMBL:Q10782" /db_xref="GeneID:886417" /translation="MSLPKPNNQTTVVITGASSGIGVELARGLAGRGFPLMLVARRRE RLDELADQLRQEHCVGVEVLPLDLADTQARAQLADRLRSDAIAGLCNSAGFGTSGRFW ELPFARESEEVVLNALALMELTHAALPGMVKRGAGAVLNIASIAGFQPIPYMAVYSAT KAFVLTFSEAVQEELHGTGVSVTALCPGPVPTEWAEIASAERFSIPLAQVSPHDVAEA AIAGMLSGKRTVVPGIVPKFVSTSGRFAPRSLLLPAIRIGNRLRGGPSR" misc_feature 1746523..1746609 /locus_tag="Rv1544" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 1746919..1747146 /locus_tag="Rv1545" /db_xref="GeneID:886398" CDS 1746919..1747146 /locus_tag="Rv1545" /function="UNKNOWN" /note="Rv1545, (MTCY48.20), len: 75 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216061.1" /db_xref="GI:15608683" /db_xref="UniProtKB/Swiss-Prot:P64871" /db_xref="GeneID:886398" /translation="MPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVA CREEVGITTILAGRDECGVCDKTAGLDGAAP" gene 1747195..1747626 /locus_tag="Rv1546" /db_xref="GeneID:886394" CDS 1747195..1747626 /locus_tag="Rv1546" /function="UNKNOWN" /note="Rv1546, (MTCY48.19c), len: 143 aa. Conserved hypothetical protein, similar to O05902|Rv0910|MTCY21C12.04 Hypothetical protein from Mycobacterium tuberculosis (144 aa), FASTA scores: E(): 5e-30, (37.3% identity in 142 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216062.1" /db_xref="GI:15608684" /db_xref="UniProtKB/Swiss-Prot:P64873" /db_xref="GeneID:886394" /translation="MASVELSADVPISPQDTWDHVSELSELGEWLVIHEGWRSELPDQ LGEGVQIVGVARAMGMRNRVTWRVTKWDPPHEVAMTGSGKGGTKYGVTLTVRPTKGGS ALGLRLELGGRALFGPLGSAAARAVKGDVEKSLKQFAELYG" gene 1747694..1751248 /gene="dnaE" /locus_tag="Rv1547" /db_xref="GeneID:886392" CDS 1747694..1751248 /gene="dnaE" /locus_tag="Rv1547" /EC_number="2.7.7.7" /function="DNA POLYMERASE III IS A COMPLEX, MULTICHAIN ENZYME RESPONSIBLE FOR MOST OF THE REPLICATIVE SYNTHESIS IN BACTERIA. THIS DNA POLYMERASE ALSO EXHIBITS 3' TO 5' EXONUCLEASE ACTIVITY. THE ALPHA CHAIN IS THE DNA POLYMERASE [CATALYTIC ACTIVITY : N DEOXYNUCLEOSIDE TRIPHOSPHATE = N DIPHOSPHATE + {DNA}(N)]." /note="catalyzes DNA-template-directed extension of the 3'- end of a DNA strand by one nucleotide at a time; main replicative polymerase" /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit alpha" /protein_id="NP_216063.1" /db_xref="GI:15608685" /db_xref="GOA:P63977" /db_xref="UniProtKB/Swiss-Prot:P63977" /db_xref="GeneID:886392" /translation="MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVG MTDHGNMFGASEFYNSATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVSGS GSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPS GEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALN IPPLATNDCHYVTRDAAHNHEALLCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWD DEVPGACDSTLLIAERVQSYADVWTPRDRMPVFPVPDGHDQASWLRHEVDAGLRRRFP AGPPDGYRERAAYEIDVICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAY ALGITDIDPIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQV ITFGTIKTKAALKDSARIHYGQPGFAIADRITKALPPAIMAKDIPLSGITDPSHERYK EAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMSSEPLTEAIPLWKRPQD GAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDAIDNVRANRGIDLDLESVPLDDKAT YELLGRGDTLGVFQLDGGPMRDLLRRMQPTGFEDVVAVIALYRPGPMGMNAHNDYADR KNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRIAQKVASYSLARADILRKAMG KKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWT AYLKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNFASVGQD IRYGLGAVRNVGANVVGSLLQTRNDKGKFTDFSDYLNKIDISACNKKVTESLIKAGAF DSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTADPVFTIKVPDDE WEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPNDAQVRVGGIL ASVNRRVNKNGMPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRD DRIALIANDLTVPDFSNAEVERPLAVSLPTRQCTFDKVSALKQVLARHPGTSQVHLRL ISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS" gene complement(1751297..1753333) /gene="PPE21" /locus_tag="Rv1548c" /db_xref="GeneID:886384" CDS complement(1751297..1753333) /gene="PPE21" /locus_tag="Rv1548c" /function="UNKNOWN" /note="Rv1548c, (MTCY48.17), len: 678 aa. Member of the Mycobacterium tuberculosis PPE family, similar to several e.g. YHS6_MYCTU|P42611 hypothetical 50.6 kDa protein in hsp65 3' region (517 aa), FASTA scores: opt:1142, E(): 0, (40.6% identity in 616 aa overlap); also similar to MTCY31.06c (54.9% identity in 381 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177817.1" /db_xref="GI:57116882" /db_xref="GOA:Q10778" /db_xref="UniProtKB/Swiss-Prot:Q10778" /db_xref="GeneID:886384" /translation="MNFSVLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAAS FSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSAFEAALA ATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASA VALSLTPFTPSPSAAATPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPG SANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYN LGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGD TNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFG NSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQ LSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTG SFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTG TNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSV PTITGTANISGFVNAGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASGWIH" gene 1753510..1754037 /gene="fadD11.1" /locus_tag="Rv1549" /db_xref="GeneID:886386" CDS 1753510..1754037 /gene="fadD11.1" /locus_tag="Rv1549" /EC_number="6.2.1.-" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /note="Rv1549, (MTCY48.16c), len: 175 aa. Possible fadD11.1, fatty-acid-CoA synthetase (EC 6.2.1.-), similar to the N-terminus of many fatty-acid CoA synthetases e.g. NP_147860.1|NC_000854 long-chain-fatty-acid--CoA ligase from Aeropyrum pernix (651 aa); P31685|4CL2_SOLTU 4-coumarate--CoA ligase 2 (EC 6.2.1.12) from Solanum tuberosum (Potato) (545 aa), FASTA scores: opt: 168, E(): 4.4e-06, (30.4% identity in 112 aa overlap); etc. Possible frameshift with respect to next ORF Rv1550|MTCY48.15c but we can find no sequence error to account for this. Note that previously known as fadD11'.; fadD11'" /codon_start=1 /transl_table=11 /product="fatty-acid-CoA ligase" /protein_id="YP_177818.1" /db_xref="GI:57116883" /db_xref="GOA:Q10777" /db_xref="UniProtKB/Swiss-Prot:Q10777" /db_xref="GeneID:886386" /translation="MVAAPCFRVLRLWTYAHRCDLGHTDPLSRRTEMTTTERPTTMCE AFQRTAVMDPDAVALRTPGGNQTMTWRDYAAQVRRVAAGLAGLGVRRGDTVSLMMANR IEFYPLDVGAQHVGATSFSVYNTLPAEQLTYVFDNAGTKVVICEQQYVDRVRASGVPI EHIVCVDGAPPARSR" gene 1753716..1755431 /gene="fadD11" /locus_tag="Rv1550" /db_xref="GeneID:886380" CDS 1753716..1755431 /gene="fadD11" /locus_tag="Rv1550" /EC_number="6.2.1.-" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /note="Rv1550, (MTCY48.15c), len: 571 aa. Probable fadD11, fatty-acid-CoA synthetase (EC 6.2.1.-), similar, except in N-terminus, to many e.g. SC6A5.39|T35430 probable long-chain-fatty-acid--CoA ligase (EC 6.2.1.3) from Streptomyces coelicolor (612 aa); NP_301672.1|NC_002677 putative long-chain-fatty-acid-CoA ligase from Mycobacterium leprae (600 aa); P44446|LCFH_HAEIN putative long-chain-fatty-acid-CoA ligase from Haemophilus influenzae (607 aa), FASTA scores: opt: 762, E(): 2.3e-38, (34.4% identity in 436 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. Possible frameshift with respect to previous ORF Rv1549|MTCY48.16c but we can find no sequence error to account for this." /codon_start=1 /transl_table=11 /product="fatty-acid-CoA ligase" /protein_id="NP_216066.1" /db_xref="GI:15608688" /db_xref="GOA:Q10776" /db_xref="UniProtKB/Swiss-Prot:Q10776" /db_xref="GeneID:886380" /translation="MARLRGAGAAGRCRPGRFGSSARRHGLADDGEPDRVLPARRRCS ARRRHLVFGVQHPARRAADLRVRQRGDQGGHLRATVRRSRSRQRCAHRTHRLRRWRAP GTLSLTDLYAAASGDFFDFESTWRAVQPEDIVTLIYTSGTTGNPKGVEMTHANLLFEG YAIDEVLGIRFGDRVTSFLPSAHIADRMTGLYLQEMFGTQVTAVADARTIAAALPDVR PTVWGAVPRVWEKLKAGIEFTVARETDEMKRQALAWAMSVAGKRANALLAGESMSDQL VAEWAKADELVLSKLRERLGFGELRWALSGAAPIPKETLAFFAGIGIPIAEIWGMSEL SCVATASHPRDGRLGTVGKLLPGLQGKIAEDGEYLVRGPLVMKGYRKEPAKTAEAIDS DGWLHTGDVFDIDSDGYLRVVDRKKELIINAAGKNMSPANIENTILAACPMVGVMMAI GDGRTYNTALLVFDADSLGPYAAQRGLDASPAALAADPEVIARIAAGVAEGNAKLSRV EQIKRFRILPTLWEPGGDEITLTMKLKRRRIAAKYSAEIEELYASELRPQVYEPAAVP STQPA" misc_feature 1754121..1754156 /gene="fadD11" /locus_tag="Rv1550" /note="PS00455 Putative AMP-binding domain signature" gene 1755445..1757310 /gene="plsB1" /locus_tag="Rv1551" /db_xref="GeneID:886382" CDS 1755445..1757310 /gene="plsB1" /locus_tag="Rv1551" /EC_number="2.3.1.15" /function="Thought to be involved in lipid metabolism." /note="PlsB; catalyzes the formation of 1-acyl-sn-glycerol 3-phosphate by transfering the acyl moiety from acyl-CoA" /codon_start=1 /transl_table=11 /product="glycerol-3-phosphate acyltransferase" /protein_id="NP_216067.1" /db_xref="GI:15608689" /db_xref="GOA:P65734" /db_xref="UniProtKB/Swiss-Prot:P65734" /db_xref="GeneID:886382" /translation="MTAREVGRIGLRKLLQRIGIVAESMTPLATDPVEVTQLLDARWY DERLRALADELGRDPDSVRAEAAGYLREMAASLDERAVQAWRGFSRWLMRAYDVLVDE DQITQLRKLDRKATLAFAFSHRSYLDGMLLPEAILANRLSPALTFGGANLNFFPMGAW AKRTGAIFIRRQTKDIPVYRFVLRAYAAQLVQNHVNLTWSIEGGRTRTGKLRPPVFGI LRYITDAVDEIDGPEVYLVPTSIVYDQLHEVEAMTTEAYGAVKRPEDLRFLVRLARQQ GERLGRAYLDFGEPLPLRKRLQEMRADKSGTGSEIERIALDVEHRINRATPVTPTAVV SLALLGADRSLSISEVLATVRPLASYIAARNWAVAGAADLTNRSTIRWTLHQMVASGV VSVYDAGTEAVWGIGEDQHLVAAFYRNTAIHILVDRAVAELALLAAAETTTNGSVSPA TVRDEALSLRDLLKFEFLFSGRAQFEKDLANEVLLIGSVVDTSKPAAAADVWRLLESA DVLLAHLVLRPFLDAYHIVADRLAAHEDDSFDEEGFLAECLQVGKQWELQRNIASAES RSMELFKTALRLARHRELVDGADATDIAKRRQQFADEIATATRRVNTIAELARRQ" gene 1757681..1759432 /gene="frdA" /locus_tag="Rv1552" /db_xref="GeneID:886376" CDS 1757681..1759432 /gene="frdA" /locus_tag="Rv1552" /EC_number="1.3.99.1" /function="INVOLVED IN INTERCONVERSION OF FUMARATE AND SUCCINATE (ANAEROBIC RESPIRATION) [CATALYTIC ACTIVITY : SUCCINATE + ACCEPTOR = FUMARATE + REDUCED ACCEPTOR]." /experiment="experimental evidence, no additional details recorded" /note="part of four member fumarate reductase enzyme complex FrdABCD which catalyzes the reduction of fumarate to succinate during anaerobic respiration; FrdAB are the catalytic subcomplex consisting of a flavoprotein subunit and an iron-sulfur subunit, respectively; FrdCD are the membrane components which interact with quinone and are involved in electron transfer; the catalytic subunits are similar to succinate dehydrogenase SdhAB" /codon_start=1 /transl_table=11 /product="fumarate reductase flavoprotein subunit" /protein_id="NP_216068.1" /db_xref="GI:15608690" /db_xref="GOA:P64174" /db_xref="UniProtKB/Swiss-Prot:P64174" /db_xref="GeneID:886376" /translation="MTAQHNIVVIGGGGAGLRAAIAIAETNPHLDVAIVSKVYPMRSH TVSAEGGAAAVTGDDDSLDEHAHDTVSGGDWLCDQDAVEAFVAEAPKELVQLEHWGCP WSRKPDGRVAVRPFGGMKKLRTWFAADKTGFHLLHTLFQRLLTYSDVMRYDEWFATTL LVDDGRVCGLVAIELATGRIETILADAVILCTGGCGRVFPFTTNANIKTGDGMALAFR AGAPLKDMEFVQYHPTGLPFTGILITEAARAEGGWLLNKDGYRYLQDYDLGKPTPEPR LRSMELGPRDRLSQAFVHEHNKGRTVDTPYGPVVYLDLRHLGADLIDAKLPFVRELCR DYQHIDPVVELVPVRPVVHYMMGGVHTDINGATTLPGLYAAGETACVSINGANRLGSN SLPELLVFGARAGRAAADYAARHQKSDRGPSSAVRAQARTEALRLERELSRHGQGGER IADIRADMQATLESAAGIYRDGPTLTKAVEEIRVLQERFATAGIDDHSRTFNTELTAL LELSGMLDVALAIVESGLRREESRGAHQRTDFPNRDDEHFLAHTLVHRESDGTLRVGY LPVTITRWPPGERVYGR" misc_feature 1757804..1757833 /gene="frdA" /locus_tag="Rv1552" /note="PS00504 Fumarate reductase / succinate dehydrogenase FAD-binding site" gene 1759435..1760178 /gene="frdB" /locus_tag="Rv1553" /db_xref="GeneID:886378" CDS 1759435..1760178 /gene="frdB" /locus_tag="Rv1553" /EC_number="1.3.99.1" /function="INVOLVED IN INTERCONVERSION OF FUMARATE AND SUCCINATE (ANAEROBIC RESPIRATION) [CATALYTIC ACTIVITY : SUCCINATE + ACCEPTOR = FUMARATE + REDUCED ACCEPTOR]." /note="Rv1553, (MTCY48.12c), len: 247 aa. Probable frdB, fumarate reductase, iron-sulfur subunit (EC 1.3.99.1), highly similar to others e.g. P00364|FRDB_ECOLI fumarate reductase iron-sulfur protein from Escherichia coli strain K12 (243 aa), FASTA scores: opt: 846, E(): 0, (50.0% identity in 242 aa overlap); P20921|FRDB_PROVU FUMARATE REDUCTASE IRON-SULFUR PROTEIN from Proteus vulgaris (245 aa); G64097 fumarate reductase (EC 1.3.99.1) iron-sulfur protein from Haemophilus influenzae (276 aa); etc. Contains PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature. NOTE THAT FUMARATE REDUCTASE FORMS PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN (Rv1552|frdA), AN IRON-SULFUR (Rv1553|frdB), AND TWO HYDROPHOBIC ANCHOR PROTEINS (Rv1554|frdC and Rv1555|frdD)." /codon_start=1 /transl_table=11 /product="fumarate reductase iron-sulfur subunit FrdB" /protein_id="NP_216069.1" /db_xref="GI:15608691" /db_xref="GOA:Q10761" /db_xref="UniProtKB/Swiss-Prot:Q10761" /db_xref="GeneID:886378" /translation="MMDRIVMEVSRYRPEIESAPTFQAYEVPLTREWAVLDGLTYIKD HLDGTLSFRWSCRMGICGSSGMTINGDPKLACATFLADYLPGPVRVEPMRNFPVIRDL VVDISDFMAKLPSVKPWLVRHDEPPVEDGEYRQTPAELDAFKQFSMCINCMLCYSACP VYALDPDFLGPAAIALGQRYNLDSRDQGAADRRDVLAAADGAWACTLVGECSTACPKG VDPAGAIQRYKLTAATHALKKLLFPWGGG" misc_feature 1759879..1759914 /gene="frdB" /locus_tag="Rv1553" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature" gene 1760175..1760555 /gene="frdC" /locus_tag="Rv1554" /db_xref="GeneID:886371" CDS 1760175..1760555 /gene="frdC" /locus_tag="Rv1554" /EC_number="1.3.99.1" /function="INVOLVED IN INTERCONVERSION OF FUMARATE AND SUCCINATE (ANAEROBIC RESPIRATION). THIS HYDROPHOBIC COMPONENT MAY BE REQUIRED TO ANCHOR THE CATALYTIC COMPONENTS OF THE FUMARATE REDUCTASE COMPLEX TO THE CYTOPLASMIC MEMBRANE." /note="part of four member fumarate reductase enzyme complex FrdABCD which catalyzes the reduction of fumarate to succinate during anaerobic respiration; FrdCD are the membrane components which interact with quinone and are involved in electron transfer; FrdAB are the catalytic subcomplex consisting of a flavoprotein subunit and an iron-sulfur subunit, respectively; the catalytic subunits are similar to succinate dehydrogenase SdhAB" /codon_start=1 /transl_table=11 /product="fumarate reductase subunit C" /protein_id="NP_216070.1" /db_xref="GI:15608692" /db_xref="GOA:Q10762" /db_xref="UniProtKB/Swiss-Prot:Q10762" /db_xref="GeneID:886371" /translation="MSAYRQPVERYWWARRRSYLRFMLREISCIFVAWFVLYLMLVLR AVGAGGNSYQRFLDFSANPVVVVLNVVALSFLLLHAVTWFGSAPRAMVIQVRGRRVPA RAVLAGHYAAWLVVSVIVAWMVLS" gene 1760552..1760929 /gene="frdD" /locus_tag="Rv1555" /db_xref="GeneID:886389" CDS 1760552..1760929 /gene="frdD" /locus_tag="Rv1555" /EC_number="1.3.99.1" /function="INVOLVED IN INTERCONVERSION OF FUMARATE AND SUCCINATE (ANAEROBIC RESPIRATION). THIS HYDROPHOBIC COMPONENT MAY BE REQUIRED TO ANCHOR THE CATALYTIC COMPONENTS OF THE FUMARATE REDUCTASE COMPLEX TO THE CYTOPLASMIC MEMBRANE." /note="in conjunction with FrdC acts to anchor the catalytic components of the fumarate reductase to the cytoplasmic membrane" /codon_start=1 /transl_table=11 /product="fumarate reductase subunit D" /protein_id="NP_216071.1" /db_xref="GI:15608693" /db_xref="GOA:P67643" /db_xref="UniProtKB/Swiss-Prot:P67643" /db_xref="GeneID:886389" /translation="MTPSTSDARSRRRSAEPFLWLLFSAGGMVTALVAPVLLLLFGLA FPLGWLDAPDHGHLLAMVRNPITKLVVLVLVVLALFHAAHRFRFVLDHGLQLGRFDRV IALWCYGMAVLGSATAGWMLLTM" gene 1760997..1761605 /locus_tag="Rv1556" /db_xref="GeneID:886367" CDS 1760997..1761605 /locus_tag="Rv1556" /function="Possibly involved in a transcriptional mechanism" /note="Rv1556, (MTCY48.09c), len: 202 aa. Possible regulatory protein, similar to X86780|SHGCPIR2|g987088 orfY, regulator of antibiotic transport complexes from Streptomyces hygroscopicus (204 aa), FASTA score: opt: 251, E(): 1.7e-10, (33.8% identity in 201 aa overlap) and others." /codon_start=1 /transl_table=11 /product="regulatory protein" /protein_id="NP_216072.1" /db_xref="GI:15608694" /db_xref="GOA:P67436" /db_xref="UniProtKB/Swiss-Prot:P67436" /db_xref="GeneID:886367" /translation="MVGAVTQIADRPTDPSPWSPRETELLAVTLRLLQEHGYDRLTVD AVAASARASKATVYRRWPSKAELVLAAFIEGIRQVAVPPNTGNLRDDLLRLGELICRE VGQHASTIRAVLVEVSRNPALNDVLQHQFVDHRKALIQYILQQAVDRGEISSAAISDE LWDLLPGYLIFRSIIPNRPPTQDTVQALVDDVILPSLTRSTG" gene 1761744..1762937 /gene="mmpL6" /locus_tag="Rv1557" /db_xref="GeneID:886033" CDS 1761744..1762937 /gene="mmpL6" /locus_tag="Rv1557" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /note="Rv1557, (MTCY48.08c), len: 397 aa. Probable mmpL6, conserved transmembrane transport protein (see citations below). Member of RND superfamily, with strong similarity to C-terminal part of members of large Mycobacterial membrane protein family belonging to RND superfamily including: mmpL1, mmpL2, mmpL3, etc. Probably truncated (see Brosch et al., 2002). BELONGS TO THE MMPL FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL6" /protein_id="NP_216073.1" /db_xref="GI:15608695" /db_xref="GOA:Q10773" /db_xref="UniProtKB/Swiss-Prot:Q10773" /db_xref="GeneID:886033" /translation="MQGISVTGLVKRGWMVRSVFDTIDGIDQLGEQLASVTVTLDKLA AIQPQLVALLPDEIASQQINRELALANYATMSGIYAQTAALIENAAAMGQAFDAAKND DSFYLPPEAFDNPDFQRGLKLFLSADGKAARMIISHEGDPATPEGISHIDAIKQAAHE AVKGTPMAGAGIYLAGTAATFKDIQDGATYDLLIAGIAALSLILLIMMIITRSLVAAL VIVGTVALSLGASFGLSVLVWQHLLGIQLYWIVLALAVILLLAVGSDYNLLLISRFKE EIGAGLNTGIIRAMAGTGGVVTAAGLVFAATMSSFVFSDLRVLGQIGTTIGLGLLFDT LVVRAFMTPSIAVLLGRWFWWPQRVRPRPASRMLRPYGPRPVVRELLLREGNDDPRTQ VATHR" gene 1762947..1763393 /locus_tag="Rv1558" /db_xref="GeneID:886363" CDS 1762947..1763393 /locus_tag="Rv1558" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1558, (MTCY48.07c), len: 148 aa. Conserved hypothetical protein, similar to other Mycobacterial tuberculosis proteins e.g. P71854|MTCY03C7.09c|Rv3547 (151 aa), FASTA scores opt: 330, E(): 9.1e-17, (39.7% identity in 151 aa overlap); also Q11057|Rv1261c (149 aa), and O53328|Rv3178 (119 aa). Similar also to AF072709|AF072709_5 Hypothetical protein with a new amplifiable element AUD4 from Streptomyces lividans (149 aa), FASTA scores: opt: 695, E(): 0, (69.1% identity in 149 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216074.1" /db_xref="GI:15608696" /db_xref="GOA:P64875" /db_xref="UniProtKB/Swiss-Prot:P64875" /db_xref="GeneID:886363" /translation="MPLSGEYAPSPLDWSREQADTYMKSGGTEGTQLQGKPVILLTTV GAKTGKLRKTPLMRVEHDGQYAIVASLGGAPKNPVWYHNVVKNPRVELQDGTVTGDYD AREVFGDEKAIWWQRAVAVWPDYASYQTKTDRQIPVFVLTPVRAGG" gene 1763428..1764717 /gene="ilvA" /locus_tag="Rv1559" /db_xref="GeneID:886365" CDS 1763428..1764717 /gene="ilvA" /locus_tag="Rv1559" /EC_number="4.3.1.19" /function="INVOLVED IN ISOLEUCINE BIOSYNTHESIS (FIRST STEP). CATALYZES THE FORMATION OF ALPHA-KETOBUTYRATE FROM THREONINE IN A TWO STEP REACTION. THE FIRST STEP IS A DEHYDRATION OF THREONINE, FOLLOWED BY REHYDRATION AND LIBERATION OF AMMONIA [CATALYTIC ACTIVITY : L-THREONINE + H(2)O = 2-OXOBUTANOATE + NH(3) + H(2)O]." /note="catalyzes the formation of 2-oxobutanoate from L-threonine; biosynthetic" /codon_start=1 /transl_table=11 /product="threonine dehydratase" /protein_id="NP_216075.1" /db_xref="GI:15608697" /db_xref="GOA:P66897" /db_xref="UniProtKB/Swiss-Prot:P66897" /db_xref="GeneID:886365" /translation="MSAELSQSPSSSPLFSLSGADIDRAAKRIAPVVTPTPLQPSDRL SAITGATVYLKREDLQTVRSYKLRGAYNLLVQLSDEELAAGVVCSSAGNHAQGFAYAC RCLGVHGRVYVPAKTPKQKRDRIRYHGGEFIDLIVGGSTYDLAAAAALEDVERTGATL VPPFDDLRTIAGQGTIAVEVLGQLEDEPDLVVVPVGGGGCIAGITTYLAERTTNTAVL GVEPAGAAAMMAALAAGEPVTLDHVDQFVDGAAVNRAGTLTYAALAAAGDMVSLTTVD EGAVCTAMLDLYQNEGIIAEPAGALSVAGLLEADIEPGSTVVCLISGGNNDVSRYGEV LERSLVHLGLKHYFLVDFPQEPGALRRFLDDVLGPNDDITLFEYVKRNNRETGEALVG IELGSAADLDGLLARMRATDIHVEALEPGSPAYRYLL" misc_feature 1763596..1763637 /gene="ilvA" /locus_tag="Rv1559" /note="PS00165 Serine/threonine dehydratases pyridoxal-phosphate attachment site" gene 1764755..1764973 /locus_tag="Rv1560" /db_xref="GeneID:886359" CDS 1764755..1764973 /locus_tag="Rv1560" /function="UNKNOWN" /note="Rv1560, (MTCY48.05c), len: 72 aa. Conserved hypothetical protein, part of a Mycobacterial tuberculosis family of proteins e.g. Q10848|Rv2009|MTCY39.08c (80 aa), FASTA score: (54.4% identity in 68 aa overlap); Q10799|Rv2871|MTCY274.02 (85 aa); O50456|Rv1241|MTV006.13 (86 aa), O06243|Rv2132|MTCY270.36C (76 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216076.1" /db_xref="GI:15608698" /db_xref="UniProtKB/Swiss-Prot:P64877" /db_xref="GeneID:886359" /translation="MYRWCMSRTNIDIDDELAAEVMRRFGLTTKRAAVDLALRRLVGS PLSREFLLGLEGVGWEGDLDDLRSDRPD" gene 1764979..1765383 /locus_tag="Rv1561" /db_xref="GeneID:886361" CDS 1764979..1765383 /locus_tag="Rv1561" /function="UNKNOWN" /note="Rv1561, (MTCY48.04c), len: 134 aa. Conserved hypothetical protein, similar to others from Mycobacterium tuberculosis e.g. Q10847|Rv2010|MTCY39.07c (132 aa), FASTA scores: (37.0% identity in 127 aa overlap); and O06566|Rv1114|MTCY22G8.03 (124 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216077.1" /db_xref="GI:15608699" /db_xref="UniProtKB/Swiss-Prot:P64879" /db_xref="GeneID:886361" /translation="MILIDTSAWVEYFRATGSIAAVEVRRLLSEEAARIAMCEPIAME ILSGALDDNTHTTLERLVNGLPSLNVDDAIDFRAAAGIYRAARRAGETVRSINDCLIA ALAIRHGARIVHRDADFDVIARITNLQAASFR" gene complement(1765400..1767142) /gene="treZ" /locus_tag="Rv1562c" /db_xref="GeneID:886355" CDS complement(1765400..1767142) /gene="treZ" /locus_tag="Rv1562c" /function="INVOLVED IN TREHALOSE BIOSYNTHESIS (PROTECTIVE EFFECT). Mycobacteria can produce trehalose from glucose 6-phosphate and UDP-glucose (the OtsA-OtsB pathway) from glycogen-like alpha(1-->4)-linked glucose polymers (the TreY-TreZ pathway) and from maltose (the TreS pathway). Seems to have additional alpha-glucosidase activity." /experiment="experimental evidence, no additional details recorded" /note="Rv1562c, (MTCY48.03), len: 580 aa. treZ (previously called glgZ), Maltooligosyltrehalose trehalohydrolase, confirmed biochemically (see citation below). Similar to Q44316|D63343 TREZ MALTOOLIGOSYL TREHALOSE TREHALOHYDROLASE from ARTHROBACTER SP (598 aa), FASTA scores: opt: 2071, E(): 0, (52.2% identity in 582 aa overlap); also similar to 1,4-alpha-glucan branching enzymes e.g. GLGB_BACST|P30538 (639 aa), FASTA scores: opt: 313, E(): 3.8e-13, (27.5% identity in 462 aa overlap). Also similar to Mycobacterium tuberculosis proteins Rv1326c|glgB, and Rv1563c treY (previously glgY).; glgZ" /codon_start=1 /transl_table=11 /product="maltooligosyltrehalose trehalohydrolase TreZ" /protein_id="YP_177819.1" /db_xref="GI:57116884" /db_xref="GOA:Q10769" /db_xref="UniProtKB/Swiss-Prot:Q10769" /db_xref="GeneID:886355" /translation="MPEFRVWAPKPALVRLDVNGAVHAMTRSADGWWHTTVAAPADAR YGYLLDDDPTVLPDPRSARQPDGVHARSQRWEPPGQFGAARTDTGWPGRSVEGAVIYE LHIGTFTTAGTFDAAIEKLDYLVDLGIDFVELMPVNSFAGTRGWGYDGVLWYSVHEPY GGPDGLVRFIDACHARRLGVLIDAVFNHLGPSGNYLPRFGPYLSSASNPWGDGINIAG ADSDEVRHYIIDCALRWMRDFHADGLRLDAVHALVDTTAVHVLEELANATRWLSGQLG RPLSLIAETDRNDPRLITRPSHGGYGITAQWNDDIHHAIHTAVSGERQGYYADFGSLA TLAYTLRNGYFHAGTYSSFRRRRHGRALDTSAIPATRLLAYTCTHDQVGNRALGDRPS QYLTGGQLAIKAALTLGSPYTAMLFMGEEWGASSPFQFFCSHPEPELAHSTVAGRKEE FAEHGWAADDIPDPQDPQTFQRCKLNWAEAGSGEHARLHRFYRDLIALRHNEADLADP WLDHLMVDYDEQQRWVVMRRGQLMIACNLGAEPTCVPVSGELVLAWESPIIGDNSTEL AAYSLAILRAAEPA" gene complement(1767135..1769432) /gene="treY" /locus_tag="Rv1563c" /db_xref="GeneID:886357" CDS complement(1767135..1769432) /gene="treY" /locus_tag="Rv1563c" /function="INVOLVED IN TREHALOSE BIOSYNTHESIS (PROTECTIVE EFFECT). Mycobacteria can produce trehalose from glucose 6-phosphate and UDP-glucose (the OtsA-OtsB pathway) from glycogen-like alpha(1-->4)-linked glucose polymers (the TreY-TreZ pathway) and from maltose (the TreS pathway)." /experiment="experimental evidence, no additional details recorded" /note="Rv1563c, (MTCY48.02), len: 765 aa. treY (previously called glgY), maltooligosyl trehalose synthase, confirmed biochemically (see citation below). Strong similarity to Q44315|63343 TREY MALTOOLIGOSYL TREHALOSE SYNTHASE from ARTHROBACTER SP (775 aa), fasta scores: opt: 1953, E(): 0; (46.0% identity in 789 aa overlap). Some similarity to alpha-amylases and to MTCY48.03 (30.2% identity in 215 aa overlap). May catalyse conversion of maltodextrins to maltooligosyl trehaloses. Also similar to Mycobacterium tuberculosis glgB (Rv1326c), treZ (Rv1562c).; glgY" /codon_start=1 /transl_table=11 /product="maltooligosyltrehalose synthase TreY" /protein_id="YP_177820.1" /db_xref="GI:57116885" /db_xref="GOA:Q10768" /db_xref="UniProtKB/Swiss-Prot:Q10768" /db_xref="GeneID:886357" /translation="MAFPVISTYRVQMRGRSNGFGFTFADAENLLDYLDDLGVSHLYL SPILTAVGGSTHGYDVTDPTTVSPELGGSDGLARLSAAARSRGMGLIVDIVPSHVGVG KPEQNAWWWDVLKFGRSSAYAEFFDIDWELGDGRIILPLLGSDSDVANLRVDGDLLRL GDLALPVAPGSGDGTGPAVHDRQHYRLVGWRHGLCGYRRFFSITSLAGLRQEDRAVFD ASHAEVARWFTEGLVDGVRVDHLDGLSDPSGYLAQLRELLGPNAWIVVEKILAVDEAL EPTLPVDGSTGYDVLREIGGVLVDPQGESPLTALVESAGVDYQEMPAMLADLKVHAAV HTLASELRRLRRCIAAAAGADHPLLPAAVAALLRHIGRYRCDYPGQAAVLPCALAETH STTPQLAPGLQLIAAAVARGGEPAVRLQQLCGAVSAKAVEDCMFYRDARLVSLNEVGG EPRRFGVGAAEFHHRAATRARLWPRSMTTLSTHDTKRGEDVRARIGVLSQVPWLWAKF IGHAQAIAPAPDAVTGQFLWQNVFGVWPVSGEVSAALRGRLHTYAEKAIREAAWHTSW HNPNRAFEDDVHGWLDLVLDGPLASELTGLVAHLNSHAESDALAAKLLALTVPGVPDV YQGSELWDDSLVDPDNRRPVDYGTRRVALKALQHPKIRVLAAALRLRRTHPESFLGGA YHPVFAAGPAADHVVAFRRGDDILVAVTRWTVRLQQTGWDHTVLPLPDGSWTDALTGF TASGHTPAVELFADLPVVLLVRDNA" gene complement(1769436..1771601) /gene="treX" /locus_tag="Rv1564c" /db_xref="GeneID:886353" CDS complement(1769436..1771601) /gene="treX" /locus_tag="Rv1564c" /function="POSSIBLY INVOLVED IN TREHALOSE BIOSYNTHESIS (PROTECTIVE EFFECT). Mycobacteria can produce trehalose from glucose 6-phosphate and UDP-glucose (the OtsA-OtsB pathway) from glycogen-like alpha(1-->4)-linked glucose polymers (the TreY-TreZ pathway) and from maltose (the TreS pathway)." /note="Rv1564c, (MTCY48.01), len: 721 aa. Probable treX (previously called glgX), Maltooligosyltrehalose synthase. Strong similarity to D83245|g1890053 treX, glycogen debranching enzyme (glgX) from Sulfolobus acidocaldarius (713 aa), FASTA score: opt: 2396, E(): 0, (48.4% identity in 709 aa overlap); similar to GLGX_HAEIN|P45178 glycogen operon protein glgx (659 aa), FASTA scores: opt: 1512, E(): 0, (42.3% identity in 645 aa overlap).; glgX" /codon_start=1 /transl_table=11 /product="maltooligosyltrehalose synthase TreX" /protein_id="YP_177821.1" /db_xref="GI:57116886" /db_xref="GOA:Q10767" /db_xref="UniProtKB/Swiss-Prot:Q10767" /db_xref="GeneID:886353" /translation="MSSNNAGESDGTGPALPTVWPGNAYPLGATYDGAGTNFSLFSEI AEKVELCLIDEDGVESRIPLDEVDGYVWHAYLPNITPGQRYGFRVHGPFDPAAGHRCD PSKLLLDPYGKSFHGDFTFGQALYSYDVNAVDPDSTPPMVDSLGHTMTSVVINPFFDW AYDRSPRTPYHETVIYEAHVKGMTQTHPSIPPELRGTYAGLAHPVIIDHLNELNVTAV ELMPVHQFLHDSRLLDLGLRNYWGYNTFGFFAPHHQYASTRQAGSAVAEFKTMVRSLH EAGIEVILDVVYNHTAEGNHLGPTINFRGIDNTAYYRLMDHDLRFYKDFTGTGNSLNA RHPHTLQLIMDSLRYWVIEMHVDGFRFDLASTLARELHDVDRLSAFFDLVQQDPVVSQ VKLIAEPWDVGEGGYQVGNFPGLWTEWNGKYRDTVRDYWRGEPATLGEFASRLTGSSD LYEATGRRPSASINFVTAHDGFTLNDLVSYNDKHNEANGENNRDGESYNRSWNCGVEG PTDDPDILALRARQMRNMWATLMVSQGTPMIAHGDEIGRTQYGNNNVYCQDSELSWMD WSLVDKNADLLAFARKATTLRKNHKVFRRRRFFEGEPIRSGDEVRDIAWLTPSGREMT HEDWGRGFDRCVAVFLNGEAITAPDARGERVVDDSFLLCFNAHDHDVEFVMPHDGYAQ QWTGELDTNDPVGDIDLTVTATDTFSVPARSLLVLRKTL" gene complement(1771640..1773829) /locus_tag="Rv1565c" /db_xref="GeneID:886351" CDS complement(1771640..1773829) /locus_tag="Rv1565c" /function="UNKNOWN" /note="Rv1565c, (MTCY336.38), len: 729 aa. Conserved hypothetical membrane protein, some similarity to O05402 HYPOTHETICAL 72.2 kDa PROTEIN from Bacillus subtilis (634 aa), FASTA results: opt: 384, E(): 4.8e-17, (29.1% identity in 378 aa overlap); and to Y392_HAEIN|P43993 hypothetical protein hi0392 from H. influenzae (245 aa), FASTA results: opt: 265, E(): 5.5e-10, (28.3% identity in 247 aa overlap). C-terminal half equivalent to AL049478|MLCL458_19 (274 aa) (78.5% identity in 274 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv0111, Rv0228, Rv1254, Rv0517. N-terminal half hydrophobic. TBparse score is 0.930." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216081.1" /db_xref="GI:15608703" /db_xref="GOA:O06625" /db_xref="UniProtKB/TrEMBL:O06625" /db_xref="GeneID:886351" /translation="MLTLSPPRPPALTPEPALPPVTMGTRTTGFYRHDLDGLRGVAIA LVAVFHVWFGRVSGGVDVFLALSGFFFGGKILRAALNPDLSLSPIAEVIRLIRRLLPA LVVVLAGCALLTIAIQPQTRWEAFANQSLASLGYYQNWELASTVSNYLRAGEAVSPLQ HIWSMSVQGQFYLAFLLLVAGCAYLLRRLFRGPRAPYLRTMFVVLLSTLTLASFIYAI VAHHAYQATAYYNTFARAWELLAGALVGAVVPHVRWPMWLRTAVATAALAAILSCGAL IDGVKEFPGPWALVPVGATMLMILAGANRQGHPGTRDRLPLPNRLLATAPLVALGAMA YSWYLWHWPLLIFWLSYTGHRHANFVEGAAVLLVSGLLAYLTTRLVEDPLRYRAPAGV RSPAAVPPIPWRLRLRRPTIVLGSVVALLGVALTATSFTWREHVIVQRAAGKELSGLS SRDYPGARALIDHVRVPKLRMRPTVLEVRHDLPTSTKDGCISDFVNPAIINCTYGDVD APRTIALAGGSHAEHWLTALDLLGRMHHFKVVTYLKMGCPLSTEEVPLIMGNNAPYPQ CHQWVQAAMAKLVADHPDYVFTTSTRPWNIKPGDVMPATYVGIWQTFADNNIPVLAMR DTPWLVKDGQPFIPADCLAKGGNPQSCGIARSKVLVDRNPTLDFVARFPLLKPLDMSD AICRTDTCRAVEGNVLVYRDSHHLTPTYMRTMTSELGRQIAANTDWW" gene complement(1773928..1774620) /locus_tag="Rv1566c" /db_xref="GeneID:886347" CDS complement(1773928..1774620) /locus_tag="Rv1566c" /function="UNKNOWN" /note="Rv1566c, (MTCY336.37), len: 230 aa. Possible inv protein, probably exported as has QQAPV repeats at C-terminus. Similar to Q49634 inv protein from Mycobacterium leprae (246 aa), FASTA scores: opt: 957, E(): 0, (70.0% identity in 207 aa overlap); also to putative invasins 1,2 (O07390, O07391) from Mycobacterium avium. Slightly similar to C-terminus of P60_LISMO|P21171 Listeria invasion-associated protein p60 precursor. Also similar to Mycobacterium tuberculosis p60 homologues Rv1477, Rv1478, Rv0024, Rv2190c. TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="inv protein" /protein_id="NP_216082.1" /db_xref="GI:15608704" /db_xref="UniProtKB/TrEMBL:O06624" /db_xref="GeneID:886347" /translation="MKRSMKSGSFAIGLAMMLAPMVAAPGLAAADPATRPVDYQQITD VVIARGLSQRGVPFSWAGGGISGPTRGTGTGINTVGFDASGLIQYAYAGAGLKLPRSS GQMYKVGQKVLPQQARKGDLIFYGPEGTQSVALYLGKGQMLEVGDVVQVSPVRTNGMT PYLVRVLGTQPTPVQQAPVQPAPVQQAPVQQAPVQQAPVQQAPVQQAPVQQAPVQQAP VQPPPFGTARSR" gene complement(1774860..1775144) /locus_tag="Rv1567c" /db_xref="GeneID:886349" CDS complement(1774860..1775144) /locus_tag="Rv1567c" /function="UNKNOWN" /note="Rv1567c, (MTCY336.36), len: 94 aa. Probable membrane protein. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216083.1" /db_xref="GI:15608705" /db_xref="UniProtKB/TrEMBL:O06623" /db_xref="GeneID:886349" /translation="MVTMTSWPSRLFAFTDNVCPPDACPLVPFGVNYYIYPVMWGGIG AAIATAVIGPFVSMLKGWYMSFWPIISIAVITVTSIAGYAIAGFSERYWH" gene 1775392..1776705 /gene="bioA" /locus_tag="Rv1568" /db_xref="GeneID:886343" CDS 1775392..1776705 /gene="bioA" /locus_tag="Rv1568" /EC_number="2.6.1.62" /function="INVOLVED IN BIOCONVERSION OF PIMELATE INTO DETHIOBIOTIN [CATALYTIC ACTIVITY : S-ADENOSYL-L-METHIONINE + 8-AMINO-7- OXONONANOATE = S-ADENOSYL-4-METHYLTHIO-2-OXOBUTANOATE + 7,8- DIAMINONONANOATE]. SUPPOSED INVOLVED IN STATIONARY-PHASE SURVIVAL." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of S-adenosyl-4-methylthionine-2-oxobutanoate and 7,8-diaminononanoate from S-adenosyl-L-methionine and 8-amino-7-oxononanoate" /codon_start=1 /transl_table=11 /product="adenosylmethionine--8-amino-7-oxononanoate transaminase" /protein_id="NP_216084.1" /db_xref="GI:15608706" /db_xref="GOA:O06622" /db_xref="UniProtKB/Swiss-Prot:O06622" /db_xref="GeneID:886343" /translation="MAAATGGLTPEQIIAVDGAHLWHPYSSIGREAVSPVVAVAAHGA WLTLIRDGQPIEVLDAMSSWWTAIHGHGHPALDQALTTQLRVMNHVMFGGLTHEPAAR LAKLLVDITPAGLDTVFFSDSGSVSVEVAAKMALQYWRGRGLPGKRRLMTWRGGYHGD TFLAMSICDPHGGMHSLWTDVLAAQVFAPQVPRDYDPAYSAAFEAQLAQHAGELAAVV VEPVVQGAGGMRFHDPRYLHDLRDICRRYEVLLIFDEIATGFGRTGALFAADHAGVSP DIMCVGKALTGGYLSLAATLCTADVAHTISAGAAGALMHGPTFMANPLACAVSVASVE LLLGQDWRTRITELAAGLTAGLDTARALPAVTDVRVCGAIGVIECDRPVDLAVATPAA LDRGVWLRPFRNLVYAMPPYICTPAEITQITSAMVEVARLVGSLP" misc_feature 1776142..1776255 /gene="bioA" /locus_tag="Rv1568" /note="PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site" gene 1776702..1777862 /gene="bioF1" /locus_tag="Rv1569" /db_xref="GeneID:886345" CDS 1776702..1777862 /gene="bioF1" /locus_tag="Rv1569" /EC_number="2.3.1.47" /function="INVOLVED IN BIOTIN BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: 6-CARBOXYHEXANOYL-CoA + L-ALANINE = 8-AMINO-7-OXONONANOATE + CoA + CO2]." /note="catalyzes the formation of 8-amino-7-oxononanoate from 6-carboxyhexanoyl-CoA and L-alanine" /codon_start=1 /transl_table=11 /product="8-amino-7-oxononanoate synthase" /protein_id="YP_177822.1" /db_xref="GI:57116887" /db_xref="GOA:O06621" /db_xref="UniProtKB/Swiss-Prot:O06621" /db_xref="GeneID:886345" /translation="MKAATQARIDDSPLAWLDAVQRQRHEAGLRRCLRPRPAVATELD LASNDYLGLSRHPAVIDGGVQALRIWGAGATGSRLVTGDTKLHQQFEAELAEFVGAAA GLLFSSGYTANLGAVVGLSGPGSLLVSDARSHASLVDACRLSRARVVVTPHRDVDAVD AALRSRDEQRAVVVTDSVFSADGSLAPVRELLEVCRRHGALLLVDEAHGLGVRGGGRG LLYELGLAGAPDVVMTTTLSKALGSQGGVVLGPTPVRAHLIDAARPFIFDTGLAPAAV GAARAALRVLQAEPWRPQAVLNHAGELARMCGVAAVPDSAMVSVILGEPESAVAAAAA CLDAGVKVGCFRPPTVPAGTSRLRLTARASLNAGELELARRVLTDVLAVARR" misc_feature 1777407..1777436 /gene="bioF1" /locus_tag="Rv1569" /note="PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site" gene 1777859..1778539 /gene="bioD" /locus_tag="Rv1570" /db_xref="GeneID:886338" CDS 1777859..1778539 /gene="bioD" /locus_tag="Rv1570" /EC_number="6.3.3.3" /function="INVOLVED IN BIOCONVERSION OF PIMELATE INTO DETHIOBIOTIN [CATALYTIC ACTIVITY : ATP + 7,8-DIAMINONONANOATE + CO(2) = ADP + PHOSPHATE + DETHIOBIOTIN]" /note="DTB synthetase; dethiobiotin synthase; involved in production of dethiobiotin from ATP and 7,8-diaminononanoate and carbon dioxide; contains magnesium" /codon_start=1 /transl_table=11 /product="dithiobiotin synthetase" /protein_id="NP_216086.1" /db_xref="GI:15608708" /db_xref="GOA:O06620" /db_xref="UniProtKB/Swiss-Prot:O06620" /db_xref="GeneID:886338" /translation="MTILVVTGTGTGVGKTVVCAALASAARQAGIDVAVCKPVQTGTA RGDDDLAEVGRLAGVTQLAGLARYPQPMAPAAAAEHAGMALPARDQIVRLIADLDRPG RLTLVEGAGGLLVELAEPGVTLRDVAVDVAAAALVVVTADLGTLNHTKLTLEALAAQQ VSCAGLVIGSWPDPPGLVAASNRSALARIAMVRAALPAGAASLDAGDFAAMSAAAFDR NWVAGLVG" gene 1778539..1779048 /locus_tag="Rv1571" /db_xref="GeneID:886341" CDS 1778539..1779048 /locus_tag="Rv1571" /function="UNKNOWN" /note="Rv1571, (MTCY336.32c), len: 169 aa. Conserved hypothetical protein, similar at N-terminal region to Q49625|LEPB1170_C3_227 hypothetical protein from Mycobacterium leprae (104 aa), FASTA results: opt: 473, E(): 3.9e-24, (74.5% identity in 102 aa overlap). Identical to O06619|AF041819|AF041819_6 Mycobacterium bovis BCG (169 aa). TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216087.1" /db_xref="GI:15608709" /db_xref="UniProtKB/TrEMBL:O06619" /db_xref="GeneID:886341" /translation="MVHSIELVFDSDTEAAIRRIWAGLAAAGIPSQAPASRPHVSLAV AERIAPEVDEPLGAVARRLPLDCVIGAPVLFGRANVVFTRLVVPTSELLALHAEVHRL CGPHLAPAPMANSLPGQWTAHVTLARRVGGHQLGRALRIAGRPSRIDGRFAGLRRWDG NTRAEYLLG" gene complement(1779194..1779298) /locus_tag="Rv1572c" /db_xref="GeneID:886333" CDS complement(1779194..1779298) /locus_tag="Rv1572c" /function="UNKNOWN" /note="Rv1572c, (MTCY336.31B), len: 34 aa. Partial ORF, part of REP13E12 repeat element; 3' end of Rv1587c (MTCY336.17) after phage-like element (see citation below). Similar to C-terminal ends of other REP13E12 repeat elements e.g. Rv1148, Rv1945, Rv3467, etc. Length extended since first submission (+7 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216088.2" /db_xref="GI:57116888" /db_xref="UniProtKB/TrEMBL:O06618" /db_xref="GeneID:886333" /translation="MECSSAVHGQPRTNTFHHHEKLLRHNDEDNHDDP" repeat_region complement(1779266..1779277) /note="12 bp direct repeat 1, ccacggccaacc, flanking phage-like element, second site at 1788514..1788525" gene 1779314..1779724 /locus_tag="Rv1573" /db_xref="GeneID:886337" CDS 1779314..1779724 /locus_tag="Rv1573" /function="UNKNOWN" /note="Rv1573, (MTCY336.31c), len: 136 aa. Probable phiRv1 phage protein (see citation below). TBparse score is 0.872." /codon_start=1 /transl_table=11 /product="phiRV1 phage protein" /protein_id="NP_216089.1" /db_xref="GI:15608711" /db_xref="UniProtKB/TrEMBL:O06617" /db_xref="GeneID:886337" /translation="MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLE VREALVRLQAALNRHEHTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAINR QLGLAGDDEPDGDDTPPWSRMIGLGGGSPAEDER" gene 1779930..1780241 /locus_tag="Rv1574" /db_xref="GeneID:886331" CDS 1779930..1780241 /locus_tag="Rv1574" /function="UNKNOWN" /note="Rv1574, (MTCY336.30), len: 103 aa. Probable phiRV1 phage related protein (see citation below); some similarity to N-terminus of Rv1575|MTCY441.17 Probable phiRV1 phage protein (166 aa), E(): 1.5e-06; and Rv2647|MTCY336.29c Probable phiRV2 phage protein, E(): 3.5e-05. Helix turn helix motif present at aa 14-35 (+3.61 SD)." /codon_start=1 /transl_table=11 /product="phiRV1 phage related protein" /protein_id="NP_216090.1" /db_xref="GI:15608712" /db_xref="UniProtKB/TrEMBL:O06616" /db_xref="GeneID:886331" /translation="MGYKPESERHSTKTDTAIGAALGISAGTYRRLKRIDNATHSDDK EIRRFAEKQMAPLVAGSPSWNARKPRSANARVVASVHRSPMPALVPWNQSRLSATLTR R" repeat_region complement(1779959..1780047) /note="89 bp direct repeat 2, first copy at 1780485..1780573, GGGTTGCGTTGTCGATTCGTTTGAGCCGCCGGTAGGTGCCGGCGGAGATGCCGAGGG CTGCGCCGATAGCAGTGTCTGTTTTCGTCGAA" gene 1780199..1780699 /locus_tag="Rv1575" /db_xref="GeneID:886335" CDS 1780199..1780699 /locus_tag="Rv1575" /function="UNKNOWN" /note="Rv1575, (MTCY336.29c), len: 166 aa. Probable phiRV1 phage protein (see citation below), showing similarity in N-terminal part to Rv1574|MTCY336.30c Probable phiRV1 phage protein (103 aa), FASTA score: opt: 375, E(): 3.8e-16, (60.2% identity in 103 aa overlap); and Rv2647 Probable phiRV2 phage protein. Start changed since first submission (+49 aa)." /codon_start=1 /transl_table=11 /product="phiRV1 phage protein" /protein_id="NP_216091.2" /db_xref="GI:57116889" /db_xref="UniProtKB/TrEMBL:O06615" /db_xref="GeneID:886335" /translation="MEPKPSQRHTDKEVGAALGISAGTYKRLKRIDNATRSDDKEIRL FAEKQMAPLAAGSPSWNGRKPSSGNRKAATMAARLDILAWGPWAPSQNRSVVRRKQTL LSAQPSASPPAPTGGSNESTTQPAASWRVGGPAPLSRGRPRLALSYLRGSLHLQNSKR VAHQHI" repeat_region complement(1780485..1780573) /note="89 bp direct repeat 1, second copy at 1779959..1780047, GGGTTGCGTTGTCGATTCGTTTGAGCCGCCGGTAGGTGCCGGCGGAGATGCCGAGGG CTGCGCCGATAGCAGTGTCTGTTTTCGTCGAA. Many repeats, both direct and inverted, in this region" gene complement(1780643..1782064) /locus_tag="Rv1576c" /db_xref="GeneID:886327" CDS complement(1780643..1782064) /locus_tag="Rv1576c" /function="UNKNOWN" /note="Rv1576c, (MTCY336.28), len: 473 aa. Probable phiRV1 phage protein (capsid subunit) (see citation below). Highly similar to hypothetical Mycobacterium tuberculosis protein Rv2650c|MTCY441.19 phiRV2 phage related protein, FASTA scores: opt: 2782, E(): 0, (89.1% identity in 468 aa overlap). TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="phiRV1 phage protein" /protein_id="NP_216092.1" /db_xref="GI:15608714" /db_xref="UniProtKB/TrEMBL:O06614" /db_xref="GeneID:886327" /translation="MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRH AEELRAEQRRRGREAEEALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDS CVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHT VWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVAR VVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGD AASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVA ADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVL EVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFA WFRVGSDVLVRNAFRVLKVETTA" gene complement(1782072..1782584) /locus_tag="Rv1577c" /db_xref="GeneID:886329" CDS complement(1782072..1782584) /locus_tag="Rv1577c" /function="UNKNOWN" /note="Rv1577c, (MTCY336.27), len: 170 aa. Probable phiRv1 phage protein (prohead protease) (see citation below). Highly similar to hypothetical protein Rv2651c|MTCY441.20c phiRV2 prohead protease, FASTA scores: E(): 0, (89.3% identity in 169 aa overlap). Some similarity to VP4_BPHK7|P49860 putative bacteriophage HK97 prohead protease (gp4) (225 aa), FASTA results: opt: 176, E(): 1.3e-05, (27.3% identity in 165 aa overlap). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216093.1" /db_xref="GI:15608715" /db_xref="UniProtKB/TrEMBL:O06613" /db_xref="GeneID:886329" /translation="MAELRSGEGRTVHGTIVPYNEATTVRDFDGEFQEMFAPGAFRRS IAERGHKLKLLVSHDARTRYPVGRAVELREEPHGLFGAFEIADTPDGDEALANVKAGV VDSFSVGFRPIRDRREGDVLVRVEAALLEVSLTGVPAYSGAQIAGVRAESLTVVSRST AEAWLSLLDW" gene complement(1782758..1783228) /locus_tag="Rv1578c" /db_xref="GeneID:886322" CDS complement(1782758..1783228) /locus_tag="Rv1578c" /function="UNKNOWN" /note="Rv1578c, (MTCY336.26), len: 156 aa. Probable phiRv1 phage protein (terminase) (see citation below), highly similar to Rv2652c|MTCY441.21c phiRV2 phage protein from Mycobacterium tuberculosis, FASTA scores: E(): 4.8e-22, (48.1% identity in 156 aa overlap). Also similar to X65555|ARP3COS_1 hypothetical protein (cos site) - actinophage RP3 (210 aa), FASTA scores: opt: 373, E(): 6.5e-17, (50.0% identity in 114 aa overlap). Contains MIP family signature (PS00221). TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216094.1" /db_xref="GI:15608716" /db_xref="UniProtKB/TrEMBL:O06612" /db_xref="GeneID:886322" /translation="MPRPPKPARLKLVEGRSPGRDSGGRKVPESPKFIRQAPDAPDWL DAEALAEWRRVAPTLERLDLLKPEDRALLSAYCETWSVYVAAVQRVRAEGLTITSPKS GVVHRNPAVTVAETARMHLLRLASEFGLTPAAEQRLAVAPGDDGDGLNPFAPDR" misc_feature complement(1782887..1782913) /locus_tag="Rv1578c" /note="PS00221 MIP family signature" gene complement(1783309..1783623) /locus_tag="Rv1579c" /db_xref="GeneID:886369" CDS complement(1783309..1783623) /locus_tag="Rv1579c" /function="UNKNOWN" /note="Rv1579c, (MTCY336.25), len: 104 aa. Probable phiRv1 phage protein (see citation below). TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216095.1" /db_xref="GI:15608717" /db_xref="UniProtKB/TrEMBL:O06611" /db_xref="GeneID:886369" /translation="MTPINRPLTNDERQLMHELAVQVVCSQTGCSPDAAVEALESFAK DGTLILRGDTENAYLEAGGNVLVHADRDWLAFHASYPGNDPLRDARPIEQDDDQGAGS PS" gene complement(1783620..1783892) /locus_tag="Rv1580c" /db_xref="GeneID:886313" CDS complement(1783620..1783892) /locus_tag="Rv1580c" /function="UNKNOWN" /note="Rv1580c, (MTCY336.24), len: 90 aa. Probable phiRv1 phage protein (see citation below). TBparse score is 0.878." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216096.1" /db_xref="GI:15608718" /db_xref="UniProtKB/TrEMBL:O06610" /db_xref="GeneID:886313" /translation="MAETPDHAELRRRIADMAFNADVGMATCKRCGDAVPYIILPNLQ TGEPVMGVADNKWKRANCPVDVGKPCPFLIAEGVADSTDDTIEVDQ" gene complement(1783906..1784301) /locus_tag="Rv1581c" /db_xref="GeneID:886318" CDS complement(1783906..1784301) /locus_tag="Rv1581c" /function="UNKNOWN" /note="Rv1581c, (MTCY336.23), len: 131 aa. Probable phiRv1 phage protein (see citation below). TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216097.1" /db_xref="GI:15608719" /db_xref="UniProtKB/TrEMBL:O06609" /db_xref="GeneID:886318" /translation="MTAVAITPASGGRHSVRFAYDSAIVSLIKSTIPAYARSWSAHTR CWFIDADWTPLLAAELRYHGHTVTGPADPAQQQCTDWAKALFRAVGPQRTPAVYRALS KVLHPDAPTGCPILQQQLNAARTALTNPA" gene complement(1784497..1785912) /locus_tag="Rv1582c" /db_xref="GeneID:886311" CDS complement(1784497..1785912) /locus_tag="Rv1582c" /function="UNKNOWN" /note="Rv1582c, (MTCY336.22), len: 471 aa. Probable phiRv1 phage protein (see citation below). N-terminus is similar to C-terminus of Q38030 ORF9 Bacteriophage phi-C31 (519 aa), FASTA scores: opt: 331, E(): 6.5e-15, (28.5% identity in 235 aa overlap); and C-terminus to whole of Q38031 ORF10 of Bacteriophage phi-C31 (202 aa), FASTA scores: opt: 353, E(): 1e-16, (31.1% identity in 190 aa overlap). Also similar to part of AB016282|AB016282_42 Bacteriophage phi-105 (806 aa), FASTA scores: opt: 790, E(): 0, (32.7% identity in 459 aa overlap). Similarity to other phage proteins described as putative DNA-polymerase or DNA-primase. Also slightly similar to MTCY441.24c, FASTA scores: E(): 0.0055, (36.0% identity in 75 aa overlap)." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216098.1" /db_xref="GI:15608720" /db_xref="UniProtKB/TrEMBL:O06608" /db_xref="GeneID:886311" /translation="MADIPYGTDYPDAPWIDRDGHVLIDDGGKPTQVHRGQARIAYRL AERYQDKLLHVAGIGWHSWDGRRWAADDRGEAKRAVLAELRQALSDSLNDKELRADVR KCESASGVAGVLDLAAALVPFAATVADLDSDPHLLNVANGTLDLHTLKLRPHAPADRI TKICRGAYQSDTESPLWQAFLTRVLPDEGVRGFVQRLAGVGLLGTVREHVLAILIGVG ANGKSVFDKAIRYALGDYACTAEPDLFMHRENAHPTGEMDLRGVRWVAVSESEKDRRL AESTIKRLTGGDTIRARKMRQDFVEFTPSHTPLLITNHLPRVPGDDTAIWRRIRVVPF EVVIPADEQDRELDARLQLEADSILSWAVAGWSDYQRIGLSQPDAVLAATSNYREDSD TIKRFIDDECVTSSPVLKATTTHLFEAWQRWRVQEGVPEISRKAFGQSLDTHGYPVTD KARDGRWRAGIAVRGADDFDD" gene complement(1785912..1786310) /locus_tag="Rv1583c" /db_xref="GeneID:886315" CDS complement(1785912..1786310) /locus_tag="Rv1583c" /function="UNKNOWN" /note="Rv1583c, (MTCY336.21), len: 132 aa. Probable phiRv1 phage protein (see citation below), highly similar to Rv2656c|MTCY441.25c phiRV2 phage protein (130 aa), FASTA score: E(): 1.3e-33, (81.7% identity in 131 aa overlap). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216099.1" /db_xref="GI:15608721" /db_xref="UniProtKB/Swiss-Prot:O06607" /db_xref="GeneID:886315" /translation="MTAGAGGSPPTRRCPATEDRAPATVATPSSADPTASRAVSWWSV HEHVAPVLDAAGSWPMAGTPAWRQLDDADPRKWAAICDAARHWALRVETCQEAMAQAS RDVSAAADWPGIAREIVRRRGVYIPRAGVA" gene complement(1786307..1786528) /locus_tag="Rv1584c" /db_xref="GeneID:886307" CDS complement(1786307..1786528) /locus_tag="Rv1584c" /function="UNKNOWN" /note="Rv1584c, (MTCY336.20), len: 73 aa. Possible phiRv1 phage protein (putative excisionase) (see citation below). TBparse score is 0.883." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216100.1" /db_xref="GI:15608722" /db_xref="UniProtKB/TrEMBL:O06606" /db_xref="GeneID:886307" /translation="MSTIYHHRGRVAALSRSRASDDPEFIAAKTDLVAANIADYLIRT LAAAPPLTDEQRTRLAELLRPVRRSGGAR" gene complement(1786584..1787099) /locus_tag="Rv1585c" /db_xref="GeneID:886309" CDS complement(1786584..1787099) /locus_tag="Rv1585c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1585c, (MTCY336.19), len: 171 aa. Possible phage phiRv1 protein (see Hatfull 2000). TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="phiRv1 phage protein" /protein_id="NP_216101.1" /db_xref="GI:15608723" /db_xref="UniProtKB/TrEMBL:O06605" /db_xref="GeneID:886309" /translation="MSRHHNIVIVCDHGRKGDGRIEHERCDLVAPIIWVDETQGWLPQ APAVATLLDDDNQPRAVIGLPPNESRLRPEMRRDGWVRLHWEFACLRYGAAGVRTCEQ RPVRVRNGDLQTLCENVPRLLTGLAGNPDYAPGFAVQSDAVVVAMWLWRTLCESDTPN KLRATPTRGSC" gene complement(1787096..1788505) /locus_tag="Rv1586c" /db_xref="GeneID:886305" CDS complement(1787096..1788505) /locus_tag="Rv1586c" /function="integration of phiRv1 into chromosome." /note="Rv1586c, (MTCY336.18), len: 469 aa. Probable phiRv1 integrase, possibly member of the serine family of recombinases (see citation below), similar to several bacteriophage integrases e.g. Q37839 ORF469 PROTEIN from Bacteriophage R4 (469 aa), FASTA scores: opt: 623, E(): 1.6e-29, (31.1% identity in 482 aa overlap); and Bacteriophage TP901-1." /codon_start=1 /transl_table=11 /product="phiRv1 integrase" /protein_id="NP_216102.1" /db_xref="GI:15608724" /db_xref="GOA:O06604" /db_xref="UniProtKB/TrEMBL:O06604" /db_xref="GeneID:886305" /translation="MRYTTPVRAAVYLRISEDRSGEQLGVARQREDCLKLCGQRKWVP VEYLDNDVSASTGKRRPAYEQMLADITAGKIAAVVAWDLDRLHRRPIELEAFMSLADE KRLALATVAGDVDLATPQGRLVARLKGSVAAHETEHKKARQRRAARQKAERGHPNWSK AFGYLPGPNGPEPDPRTAPLVKQAYADILAGASLGDVCRQWNDAGAFTITGRPWTTTT LSKFLRKPRNAGLRAYKGARYGPVDRDAIVGKAQWSPLVDEATFWAAQAVLDAPGRAP GRKSVRRHLLTGLAGCGKCGNHLAGSYRTDGQVVYVCKACHGVAILADNIEPILYHIV AERLAMPDAVDLLRREIHDAAEAETIRLELETLYGELDRLAVERAEGLLTARQVKIST DIVNAKITKLQARQQDQERLRVFDGIPLGTPQVAGMIAELSPDRFRAVLDVLAEVVVQ PVGKSGRIFNPERVQVNWR" gene complement(1788162..1789163) /locus_tag="Rv1587c" /db_xref="GeneID:886303" CDS complement(1788162..1789163) /locus_tag="Rv1587c" /function="UNKNOWN" /note="Rv1587c, (MTCY336.17), len: 333 aa. Partial REP13E12 repeat protein (see citation below), nearly identical (but has been interrupted by phiRv1 prophage) to Q50655|MTCY251.13c|Rv0094c HYPOTHETICAL 34.6 kDa PROTEIN from M. tuberculosis (317 aa), FASTA results: opt: 1511, E(): 1.1e-84, (97.75% identity in 224 aa overlap). Codon usage suggests that translation may involve frameshifting of Rv1588c mRNA in poly_C stretch into reading frame of Rv1587c. 3' end found in Rv1572c. Lenght extended since first submission (+115 aa)." /codon_start=1 /transl_table=11 /product="REP13E12 repeat-containing protein" /protein_id="NP_216103.2" /db_xref="GI:57116890" /db_xref="UniProtKB/TrEMBL:O06603" /db_xref="GeneID:886303" /translation="MLAKLAAPGATNPDDHTPVIDTTPDAAAIDRDTRSQAQRNHDGL LAGLRALIASGKLGQHNGLPVSIVVTTTLTDLQTGAGKGFTGGGTLLPMADVIRMTSH AHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIMLFANDRGCTKPGCDAPAYHS QAHHVTAWTSTGRTDITELTLACGPDNRLAEKGWTTHNNTHGHTEWLPPPHLDHGQPW TCEIHYTCACCCLPPNLRRPLRRTARRGPPTRGLPKAVRAAKMGARRVPRQRRQRINR QAPPRLRADVGRHHRRQDRRRGGLGPGPAPSPSHRAGSLHVISRREAAGPGHRRRRR" repeat_region complement(1788514..1789811) /note="REP-5, len: 1298 bp. REP336, member of REP13E12 family.; REP-5" /rpt_type=DIRECT repeat_region complement(1788514..1788525) /note="12 bp direct repeat 2, ccacggccaacc, flanking phage-like element, first site at 1779266..1779277" gene complement(1789168..1789836) /locus_tag="Rv1588c" /db_xref="GeneID:886325" CDS complement(1789168..1789836) /locus_tag="Rv1588c" /function="UNKNOWN" /note="Rv1588c, (MTCY336.16), len: 222 aa. Partial REP13E12 repeat protein (see citation below), nearly identical to ORF's in other Rep13E12 repeats, including Rv0095c|MTCY251.14c|Y05E_MYCTU|Q10891 hypothetical 15.4 kd protein cy251.14 from Mycobacterium tuberculosis (136 aa), FASTA results: opt: 613, E(): 9.9e-29, (86.5% identity in 111 aa overlap)." /codon_start=1 /transl_table=11 /product="REP13E12 repeat-containing protein" /protein_id="NP_216104.1" /db_xref="GI:15608726" /db_xref="UniProtKB/Swiss-Prot:O06602" /db_xref="GeneID:886325" /translation="MLANSREELVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLE CLVRRLPAVGHALINQLDAQASEEELGGTLCCALANRLRITKPDAARRIADAADLGPR RALTGEPLAPQLTATATAQRQGLIGEAHVKVIRALFRPPARRGGCVHPPGRRSRPGRQ SRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPEQPAIRRHVTAKWLPDPPS AGHL" gene 1790284..1791333 /gene="bioB" /locus_tag="Rv1589" /db_xref="GeneID:886301" CDS 1790284..1791333 /gene="bioB" /locus_tag="Rv1589" /EC_number="2.8.1.6" /function="INVOLVED IN BIOTIN SYNTHESIS." /note="catalyzes the formation of biotin from dethiobiotin and sulfur 2 S-adenosyl-L-methionine" /codon_start=1 /transl_table=11 /product="biotin synthase" /protein_id="NP_216105.1" /db_xref="GI:15608727" /db_xref="GOA:O06601" /db_xref="UniProtKB/Swiss-Prot:O06601" /db_xref="GeneID:886301" /translation="MTQAATRPTNDAGQDGGNNSDILVVARQQVLQRGEGLNQDQVLA VLQLPDDRLEELLALAHEVRMRWCGPEVEVEGIISLKTGGCPEDCHFCSQSGLFASPV RSAWLDIPSLVEAAKQTAKSGATEFCIVAAVRGPDERLMAQVAAGIEAIRNEVEINIA CSLGMLTAEQVDQLAARGVHRYNHNLETARSFFANVVTTHTWEERWQTLSMVRDAGME VCCGGILGMGETLQQRAEFAAELAELGPDEVPLNFLNPRPGTPFADLEVMPVGDALKA VAAFRLALPRTMLRFAGGREITLGDLGAKRGILGGINAVIVGNYLTTLGRPAEADLEL LDELQMPLKALNASL" gene 1791334..1791573 /locus_tag="Rv1590" /db_xref="GeneID:886292" CDS 1791334..1791573 /locus_tag="Rv1590" /function="UNKNOWN" /note="Rv1590, (MTCY336.14c), len: 79 aa. Conserved hypothetical protein, similar to Q49616|LEPB1170_C1_162|YF90_MYCLE from Mycobacterium leprae (80 aa), FASTA scores: opt: 368, E(): 1.7e-21, Smith-Waterman score: 368, (67.1% identity in 73 aa overlap). TBparse score is 0.909" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216106.1" /db_xref="GI:15608728" /db_xref="UniProtKB/Swiss-Prot:P64881" /db_xref="GeneID:886292" /translation="MVEIVAGKQRAPVAAGVYNVYTGELADTATPTAARMGLEPPRFC AQCGRRMVVQVRPDGWWARCSRHGQVDSADLATQR" gene 1791570..1792235 /locus_tag="Rv1591" /db_xref="GeneID:886295" CDS 1791570..1792235 /locus_tag="Rv1591" /function="UNKNOWN" /note="Rv1591, (MTCY336.13c), len: 221 aa. Probable transmembrane protein, similar to Q49626|LEPB1170_C3_229|YF91_MYCLE Hypothetical Mycobacterium leprae protein (198 aa), FASTA results: opt: 802, E(): 0, (63.8% identity in 188 aa overlap). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216107.1" /db_xref="GI:15608729" /db_xref="GOA:O06599" /db_xref="UniProtKB/Swiss-Prot:O06599" /db_xref="GeneID:886295" /translation="MTEPPGFGGPSEPSGAPRTSRTRAVLFVMLGLSATGVLVGGLWA WIAPPIHAVVAITRAGERVHEYLGSESQNFFIAPFMLLGLLSVLAVVASALMWQWREH RGPQMVAGLSIGLTTAAAIAAGVGALVVRLRYGALDFDTVPLSRGDHALTYVTQAPPV FFARRPLQIALTLMWPAGIASLVYALLAAGTARDDLGGYPAVDPSSNARTEALETPQA PVS" gene complement(1792400..1793740) /locus_tag="Rv1592c" /db_xref="GeneID:886287" CDS complement(1792400..1793740) /locus_tag="Rv1592c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1592c, (MTCY336.12), len: 446 aa. Conserved hypothetical protein, some similarity to Q49629|B1170_F1_46 from Mycobacterium leprae (132 aa), FASTA results: opt: 332, E(): 4.5e-14, (56.3% identity in 87 aa overlap). Nearly identical to truncated Mycobacterium bovis BCG protein (148 aa) AF041819|AF041819_11. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216108.1" /db_xref="GI:15608730" /db_xref="UniProtKB/TrEMBL:O06598" /db_xref="GeneID:886287" /translation="MVEPGNLAGATGAEWIGRPPHEELQRKVRPLLPSDDPFYFPPAG YQHAVPGTVLRSRDVELAFMGLIPQPVTATQLLYRTTNMYGNPEATVTTVIVPAELAP GQTCPLLSYQCAIDAMSSRCFPSYALRRRAKALGSLTQMELLMISAALAEGWAVSVPD HEGPKGLWGSPYEPGYRVLDGIRAALNSERVGLSPATPIGLWGYSGGGLASAWAAEAC GEYAPDLDIVGAVLGSPVGDLGHTFRRLNGTLLAGLPALVVAALQHSYPGLARVIKEH ANDEGRQLLEQLTEMTTVDAVIRMAGRDMGDFLDEPLEDILSTPEISHVFGDTKLGSA VPTPPVLIVQAVHDYLIDVSDIDALADSYTAGGANVTYHRDLFSEHVSLHPLSAPMTL RWLTDRFAGKPLTDHRVRTTWPTIFNPMTYAGMARLAVIAAKVITGRKLSRRPL" gene complement(1793997..1794707) /locus_tag="Rv1593c" /db_xref="GeneID:886289" CDS complement(1793997..1794707) /locus_tag="Rv1593c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1593c, (MTCY336.11), len: 236 aa. Conserved hypothetical protein, highly similar to Q49628|B1170_F1_44 from Mycobacterium leprae (286 aa), FASTA scores: opt: 1304, E (): 0, (85.4% identity in 233 aa overlap); similar to several putative DNA hydrolases e.g. Q9S233|SCI51.07C from Streptomyces coelicolor (239 aa), FASTA scores: opt: 415, E(): 4.6e-20, (34.8% identity in 221 aa overlap); also similar to P74291|SLR1690 hypothetical protein from synechocystis (261 aa), FASTA scores: opt: 228, E(): 1.4e-17, (31.5% identity in 213 aa overlap). TBparse score is 0.922" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216109.1" /db_xref="GI:15608731" /db_xref="GOA:O06597" /db_xref="UniProtKB/TrEMBL:O06597" /db_xref="GeneID:886289" /translation="MAHGSTAHEVLAVVFQVRGVGMSRGAAKPQLNVLLWQRAKEPQR GAWSLPGGRLRNDEDMTSSVRRQLAEKVDLRELAHLEQLAVFSDPHRLPGIRMIASTY LGVVPSPATPELPADTRWHPVSSLPPMAFDHGPMVTHARTRLIAKMSYTNIGFALAPK EFALSTLRDIYGAALGYQVDATNLQRVLARRRVITQTGTIAQSGRSGGRPAALYRFTD SQLRVTDEFAALRPPGQL" gene 1794756..1795805 /gene="nadA" /locus_tag="Rv1594" /db_xref="GeneID:886283" CDS 1794756..1795805 /gene="nadA" /locus_tag="Rv1594" /function="quinolinate biosynthesis" /experiment="experimental evidence, no additional details recorded" /note="3 different subfamilies; catalyzes the formation of quinolinate from iminoaspartate and dihydroxyacetone phosphate" /codon_start=1 /transl_table=11 /product="quinolinate synthetase" /protein_id="NP_216110.1" /db_xref="GI:15608732" /db_xref="GOA:P65497" /db_xref="UniProtKB/Swiss-Prot:P65497" /db_xref="GeneID:886283" /translation="MTVLNRTDTLVDELTADITNTPLGYGGVDGDERWAAEIRRLAHL RGATVLAHNYQLPAIQDVADHVGDSLALSRVAAEAPEDTIVFCGVHFMAETAKILSPH KTVLIPDQRAGCSLADSITPDELRAWKDEHPGAVVVSYVNTTAAVKALTDICCTSSNA VDVVASIDPDREVLFCPDQFLGAHVRRVTGRKNLHVWAGECHVHAGINGDELADQARA HPDAELFVHPECGCATSALYLAGEGAFPAERVKILSTGGMLEAAHTTRARQVLVATEV GMLHQLRRAAPEVDFRAVNDRASCKYMKMITPAALLRCLVEGADEVHVDPGIAASGRR SVQRMIEIGHPGGGE" gene 1795805..1797388 /gene="nadB" /locus_tag="Rv1595" /db_xref="GeneID:886285" CDS 1795805..1797388 /gene="nadB" /locus_tag="Rv1595" /EC_number="1.4.3.16" /function="QUINOLINATE BIOSYNTHESIS. CATALYZES THE OXIDATION OF L-ASPARTATE TO IMINOASPARTATE WHICH IS CONDENSED WITH DIHYDROXYACETONE PHOSPHATE TO QUINOLINATE UNDER THE ACTION OF QUINOLINATE SYNTHASE A [CATALYTIC ACTIVITY : L-ASPARTATE + H(2)O + O(2) = OXALOACETATE + NH(3) + H(2)O(2)]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of oxaloacetate from L-aspartate" /codon_start=1 /transl_table=11 /product="L-aspartate oxidase" /protein_id="NP_216111.1" /db_xref="GI:15608733" /db_xref="GOA:P65499" /db_xref="UniProtKB/Swiss-Prot:P65499" /db_xref="GeneID:886285" /translation="MAGPAWRDAADVVVIGTGVAGLAAALAADRAGRSVVVLSKAAQT HVTATHYAQGGIAVVLPDNDDSVDAHVADTLAAGAGLCDPDAVYSIVADGYRAVTDLV GAGARLDESVPGRWALTREGGHSRRRIVHAGGDATGAEVQRALQDAAGMLDIRTGHVA LRVLHDGTAVTGLLVVRPDGCGIISAPSVILATGGLGHLYSATTNPAGSTGDGIALGL WAGVAVSDLEFIQFHPTMLFAGRAGGRRPLITEAIRGEGAILVDRQGNSITAGVHPMG DLAPRDVVAAAIDARLKATGDPCVYLDARGIEGFASRFPTVTASCRAAGIDPVRQPIP VVPGAHYSCGGIVTDVYGQTELLGLYAAGEVARTGLHGANRLASNSLLEGLVVGGRAG KAAAAHAAAAGRSRATSSATWPEPISYTALDRGDLQRAMSRDASMYRAAAGLHRLCDS LSGAQVRDVACRRDFEDVALTLVAQSVTAAALARTESRGCHHRAEYPCTVPEQARSIV VRGADDANAVCVQALVAVC" gene 1797388..1798245 /gene="nadC" /locus_tag="Rv1596" /db_xref="GeneID:886281" CDS 1797388..1798245 /gene="nadC" /locus_tag="Rv1596" /EC_number="2.4.2.19" /function="DE NOVO BIOSYNTHESIS OF NAD AND NADP [CATALYTIC ACTIVITY : NICOTINATE D-RIBONUCLEOTIDE + DIPHOSPHATE + CO(2) = PYRIDINE-2,3-DICARBOXYLATE + 5-PHOSPHO-ALPHA-D-RIBOSE 1-DIPHOSPHATE]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of pyridine-2,3-dicarboxylate and 5-phospho-alpha-D-ribose 1-diphosphate from nictinate D-ribonucleotide" /codon_start=1 /transl_table=11 /product="nicotinate-nucleotide pyrophosphorylase" /protein_id="NP_216112.1" /db_xref="GI:15608734" /db_xref="GOA:O06594" /db_xref="UniProtKB/Swiss-Prot:O06594" /db_xref="GeneID:886281" /translation="MGLSDWELAAARAAIARGLDEDLRYGPDVTTLATVPASATTTAS LVTREAGVVAGLDVALLTLNEVLGTNGYRVLDRVEDGARVPPGEALMTLEAQTRGLLT AERTMLNLVGHLSGIATATAAWVDAVRGTKAKIRDTRKTLPGLRALQKYAVRTGGGVN HRLGLGDAALIKDNHVAAAGSVVDALRAVRNAAPDLPCEVEVDSLEQLDAVLPEKPEL ILLDNFAVWQTQTAVQRRDSRAPTVMLESSGGLSLQTAATYAETGVDYLAVGALTHSV RVLDIGLDM" gene 1798294..1799052 /locus_tag="Rv1597" /db_xref="GeneID:886297" CDS 1798294..1799052 /locus_tag="Rv1597" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1597, (MTCY336.07c), len: 252 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216113.1" /db_xref="GI:15608735" /db_xref="UniProtKB/TrEMBL:O06593" /db_xref="GeneID:886297" /translation="MARTFEDLVAEAASASVGGWGFSWLDGRATEERPSWGYQRQLSQ RLANATAALDLETGGGEVLAGAGNFPPTMVATEAWPPNAAMATRRLHPLGAVVVITGD KPPLPFADAAFDLVTSRHPSTRWWTEIARVLRAGGSYFAQHVGPATLWDLREHFLGPR EHNGADQYAQVVRTCITDAGLEIVDLQMERLRVEFFDVGAVIYFLRKVIWFLPDFTVE GYHDRLRALHERIQAEGPFVTYSTRALIEARKPS" gene complement(1799073..1799483) /locus_tag="Rv1598c" /db_xref="GeneID:886324" CDS complement(1799073..1799483) /locus_tag="Rv1598c" /function="UNKNOWN" /note="Rv1598c, (MTCY336.06), len: 136 aa. Conserved hypothetical protein, some similarity to O06389|Rv0523c|MTCY25D10.02 from Mycobacterium tuberculosis (131 aa), FASTA scores: E(): 2.2e-09, (38.4% identity in 99 aa overlap); and P95144|MTCY359.02|Rv1871c (129 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216114.1" /db_xref="GI:15608736" /db_xref="GOA:O06592" /db_xref="UniProtKB/TrEMBL:O06592" /db_xref="GeneID:886324" /translation="MSAKDHPNNAPGVPMVFPLWLERLQVKYINRALKPIARYLPGTA TIEHRGRKSGKPYQTIVTAYRKDGVLAIALAHGKTDWVKNVLAAGEADVHFARGVVHV INPRIVPAGSDGQGLPRMARLQLRRIGVFVGDIA" gene 1799583..1800899 /gene="hisD" /locus_tag="Rv1599" /db_xref="GeneID:886277" CDS 1799583..1800899 /gene="hisD" /locus_tag="Rv1599" /EC_number="1.1.1.23" /function="Involved in histidine biosynthesis pathway (tenth step). THIS PROTEIN IS CONSIDERED AS A BIFUNCTIONAL ENZYME, POSSESSING TWO ACTIVE SITES, ONE AN ALCOHOL DEHYDROGENASE AND THE OTHER AN ALDEHYDE DEHYDROGENASE [CATALYTIC ACTIVITY : L-HISTIDINOL + 2 NAD(+) + H(2)O = L-HISTIDINE + 2 NADH]." /note="catalyzes the oxidation of L-histidinol to L-histidinaldehyde and then to L-histidine in histidine biosynthesis; functions as a dimer" /codon_start=1 /transl_table=11 /product="histidinol dehydrogenase" /protein_id="NP_216115.1" /db_xref="GI:15608737" /db_xref="GOA:P63950" /db_xref="UniProtKB/Swiss-Prot:P63950" /db_xref="GeneID:886277" /translation="MLTRIDLRGAELTAAELRAALPRGGADVEAVLPTVRPIVAAVAE RGAEAALDFGASFDGVRPHAIRVPDAALDAALAGLDCDVCEALQVMVERTRAVHSGQR RTDVTTTLGPGATVTERWVPVERVGLYVPGGNAVYPSSVVMNVVPAQAAGVDSLVVAS PPQAQWDGMPHPTILAAARLLGVDEVWAVGGAQAVALLAYGGTDTDGAALTPVDMITG PGNIYVTAAKRLCRSRVGIDAEAGPTEIAILADHTADPVHVAADLISQAEHDELAASV LVTPSEDLADATDAELAGQLQTTVHRERVTAALTGRQSAIVLVDDVDAAVLVVNAYAA EHLEIQTADAPQVASRIRSAGAIFVGPWSPVSLGDYCAGSNHVLPTAGCARHSSGLSV QTFLRGIHVVEYTEAALKDVSGHVITLATAEDLPAHGEAVRRRFER" misc_feature 1800291..1800389 /gene="hisD" /locus_tag="Rv1599" /note="PS00611 Histidinol dehydrogenase signature" gene 1800896..1802038 /gene="hisC1" /locus_tag="Rv1600" /db_xref="GeneID:886298" CDS 1800896..1802038 /gene="hisC1" /locus_tag="Rv1600" /EC_number="2.6.1.9" /function="histidine biosynthesis (eighth step) [CATALYTIC ACTIVITY : L-HISTIDINOL-PHOSPHATE + 2-OXOGLUTARATE = 3-(IMIDAZOL-4-YL)-2-OXOPROPYL PHOSPHATE + L-GLUTAMATE]" /note="catalyzes the formation of L-histidinol phosphate from imidazole-acetol phosphate and glutamate in histidine biosynthesis" /codon_start=1 /transl_table=11 /product="histidinol-phosphate aminotransferase" /protein_id="YP_177823.1" /db_xref="GI:57116891" /db_xref="GOA:O06591" /db_xref="UniProtKB/Swiss-Prot:O06591" /db_xref="GeneID:886298" /translation="MTRSGHPVTLDDLPLRADLRGKAPYGAPQLAVPVRLNTNENPHP PTRALVDDVVRSVREAAIDLHRYPDRDAVALRADLAGYLTAQTGIQLGVENIWAANGS NEILQQLLQAFGGPGRSAIGFVPSYSMHPIISDGTHTEWIEASRANDFGLDVDVAVAA VVDRKPDVVFIASPNNPSGQSVSLPDLCKLLDVAPGIAIVDEAYGEFSSQPSAVSLVE EYPSKLVVTRTMSKAFAFAGGRLGYLIATPAVIDAMLLVRLPYHLSSVTQAAARAALR HSDDTLSSVAALIAERERVTTSLNDMGFRVIPSDANFVLFGEFADAPAAWRRYLEAGI LIRDVGIPGYLRATTGLAEENDAFLRASARIATDLVPVTRSPVGAP" misc_feature 1801580..1801609 /gene="hisC1" /locus_tag="Rv1600" /note="PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site" gene 1802035..1802667 /gene="hisB" /locus_tag="Rv1601" /db_xref="GeneID:886274" CDS 1802035..1802667 /gene="hisB" /locus_tag="Rv1601" /EC_number="4.2.1.19" /function="histidine biosynthesis (seventh step) [CATALYTIC ACTIVITY : D-ERYTHRO-1-(IMIDAZOL-4-YL)GLYCEROL 3- PHOSPHATE = 3-(IMIDAZOL-4-YL)-2-OXOPROPYL PHOSPHATE + H(2)O]" /note="catalyzes the dehydration of D-erythro-1-(imidazol-4-yl)glycerol 3-phosphate to 3-(imidazol-4-yl)-2-oxopropyl phosphate in histidine biosynthesis" /codon_start=1 /transl_table=11 /product="imidazoleglycerol-phosphate dehydratase" /protein_id="NP_216117.1" /db_xref="GI:15608739" /db_xref="GOA:P64368" /db_xref="UniProtKB/Swiss-Prot:P64368" /db_xref="GeneID:886274" /translation="MTTTQTAKASRRARIERRTRESDIVIELDLDGTGQVAVDTGVPF YDHMLTALGSHASFDLTVRATGDVEIEAHHTIEDTAIALGTALGQALGDKRGIRRFGD AFIPMDETLAHAAVDLSGRPYCVHTGEPDHLQHTTIAGSSVPYHTVINRHVFESLAAN ARIALHVRVLYGRDPHHITEAQYKAVARALRQAVEPDPRVSGVPSTKGAL" gene 1802664..1803284 /gene="hisH" /locus_tag="Rv1602" /db_xref="GeneID:885529" CDS 1802664..1803284 /gene="hisH" /locus_tag="Rv1602" /EC_number="2.4.2.-" /function="histidine biosynthesis pathway (fifth step). CATALYZES AN AMIDOTRANSFERASE REACTION THAT GENERATES IMIDAZOLE-GLYCEROL PHOSPHATE AND 5-AMINOIMIDAZOL-4-CARBOXAMIDE RIBONUCLEOTIDE, WHICH IS USED FOR PURINE SYNTHESIS." /note="with HisF IGPS catalyzes the conversion of phosphoribulosyl-formimino-5-aminoimidazole-4-carboxamide ribonucleotide phosphate and glutamine to imidazole-glycerol phosphate, 5-aminoimidazol-4-carboxamide ribonucleotide, and glutamate in histidine biosynthesis; the HisH subunit provides the glutamine amidotransferase activity that produces the ammonia necessary to HisF for the synthesis of imidazole-glycerol phosphate and 5-aminoimidazol-4-carboxamide ribonucleotide" /codon_start=1 /transl_table=11 /product="imidazole glycerol phosphate synthase subunit HisH" /protein_id="NP_216118.1" /db_xref="GI:15608740" /db_xref="GOA:O06589" /db_xref="UniProtKB/Swiss-Prot:O06589" /db_xref="GeneID:885529" /translation="MTAKSVVVLDYGSGNLRSAQRALQRVGAEVEVTADTDAAMTADG LVVPGVGAFAACMAGLRKISGERIIAERVAAGRPVLGVCVGMQILFACGVEFGVQTPG CGHWPGAVIRLEAPVIPHMGWNVVDSAAGSALFKGLDVDARFYFVHSYAAQRWEGSPD ALLTWATYRAPFLAAVEDGALAATQFHPEKSGDAGAAVLSSWVDGL" misc_feature 1802895..1802930 /gene="hisH" /locus_tag="Rv1602" /note="PS00442 Glutamine amidotransferases class-I active site" gene 1803294..1804031 /gene="hisA" /locus_tag="Rv1603" /db_xref="GeneID:885873" CDS 1803294..1804031 /gene="hisA" /locus_tag="Rv1603" /EC_number="5.3.1.16" /function="histidine biosynthesis pathway (fourth step) [CATALYTIC ACTIVITY : N-(5'-PHOSPHO-D-RIBOSYLFORMIMINO)-5-AMINO-1-(5''-PHOSPHOR IB OSYL)-4-IMIDAZOLECARBOXAMIDE = N-(5'-PHOSPHO-D-1'-RIBULOSYLFORMIMINO)-5-AMINO-1-(5''-PHO SP HORIBOSYL)-4- IMIDAZOLECARBOXAMIDE.]" /note="catalyzes the formation of 5-(5-phospho-1-deoxyribulos-1-ylamino)methylideneamino-l- (5-phosphoribosyl)imidazole-4-carboxamide from 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide and the formation of 1-(2-carboxyphenylamino)-1-deoxy-D-ribulose 5-phosphate from N-(5-phospho-beta-D-ribosyl)anthranilate; involved in histidine and tryptophan biosynthesis" /codon_start=1 /transl_table=11 /product="phosphoribosyl isomerase A" /protein_id="NP_216119.1" /db_xref="GI:15608741" /db_xref="GOA:P60578" /db_xref="UniProtKB/Swiss-Prot:P60578" /db_xref="GeneID:885873" /translation="MMPLILLPAVDVVEGRAVRLVQGKAGSQTEYGSAVDAALGWQRD GAEWIHLVDLDAAFGRGSNHELLAEVVGKLDVQVELSGGIRDDESLAAALATGCARVN VGTAALENPQWCARVIGEHGDQVAVGLDVQIIDGEHRLRGRGWETDGGDLWDVLERLD SEGCSRFVVTDITKDGTLGGPNLDLLAGVADRTDAPVIASGGVSSLDDLRAIATLTHR GVEGAIVGKALYARRFTLPQALAAVRD" gene 1804039..1804851 /gene="impA" /locus_tag="Rv1604" /db_xref="GeneID:885567" CDS 1804039..1804851 /gene="impA" /locus_tag="Rv1604" /EC_number="3.1.3.25" /function="INVOLVED IN INOSITOL PHOSPHATE METABOLISM. IT IS RESPONSIBLE FOR THE PROVISION OF INOSITOL REQUIRED FOR SYNTHESIS OF PHOSPHATIDYLINOSITOL AND POLYPHOSPHOINOSITIDES. KEY ENZYME OF THE PHOSPHATIDYL INOSITOL SIGNALING PATHWAY [CATALYTIC ACTIVITY: INOSITOL 1(or 4)-MONOPHOSPHATE + H(2)O = INOSITOL + ORTHOPHOSPHATE]." /note="Rv1604, (MTV046.02), len: 270 aa. Probable impA, inositol monophosphatase (EC 3.1.3.25), similar to many e.g. AF0059|AF005905_2 inositol monophosphate phosphatase from Mycobacterium smegmatis (276 aa), FASTA scores: opt: 1241, E(): 0, (70.5% identity in 261 aa overlap). Also similar to Mycobacterium tuberculosis proteins Rv3137 and Rv2701c." /codon_start=1 /transl_table=11 /product="inositol-monophosphatase" /protein_id="NP_216120.1" /db_xref="GI:15608742" /db_xref="GOA:O53907" /db_xref="UniProtKB/TrEMBL:O53907" /db_xref="GeneID:885567" /translation="MHLDSLVAPLVEQASAILDAATALFLVGHRADSAVRKKGNDFAT EVDLAIERQVVAALVAATGIEVHGEEFGGPAVDSRWVWVLDPIDGTINYAAGSPLAAI LLGLLHDGVPVAGLTWMPFTDPRYTAVAGGPLIKNGVPQPPLADAELANVLVGVGTFS ADSRGQFPGRYRLAVLEKLSRVSSRLRMHGSTGIDLVFVADGILGGAISFGGHVWDHA AGVALVRAAGGVVTDLAGQPWTPASRSALAGPPRVHAQILEILGSIGEPEDY" gene 1804853..1805656 /gene="hisF" /locus_tag="Rv1605" /db_xref="GeneID:885261" CDS 1804853..1805656 /gene="hisF" /locus_tag="Rv1605" /function="histidine biosynthesis pathway (sixth step). CATALYZES THE CYCLIZATION REACTION THAT PRODUCES D-ERYTHRO-IMIDAZOLE GLYCEROL PHOSPHATE." /note="catalyzes the conversion of 5-[(5-phospho-1-deoxyribulos-1-ylamino)methylideneamino]- 1-(5-phosphoribosyl)imidazole-4-carboxamideand glutamine to imidazole-glycerol phosphate, 5-aminoimidazol-4-carboxamideribonucleotide and glutamate; the HisF subunit acts as a cyclase" /codon_start=1 /transl_table=11 /product="imidazole glycerol phosphate synthase subunit HisF" /protein_id="NP_216121.1" /db_xref="GI:15608743" /db_xref="GOA:O53908" /db_xref="UniProtKB/Swiss-Prot:O53908" /db_xref="GeneID:885261" /translation="MYADRDLPGAGGLAVRVIPCLDVDDGRVVKGVNFENLRDAGDPV ELAAVYDAEGADELTFLDVTASSSGRATMLEVVRRTAEQVFIPLTVGGGVRTVADVDS LLRAGADKVAVNTAAIACPDLLADMARQFGSQCIVLSVDARTVPVGSAPTPSGWEVTT HGGRRGTGMDAVQWAARGADLGVGEILLNSMDADGTKAGFDLALLRAVRAAVTVPVIA SGGAGAVEHFAPAVAAGADAVLAASVFHFRELTIGQVKAALAAEGITVR" gene 1805653..1806000 /gene="hisI" /locus_tag="Rv1606" /db_xref="GeneID:886011" CDS 1805653..1806000 /gene="hisI" /locus_tag="Rv1606" /EC_number="3.5.4.19" /function="Involved in histidine biosynthesis pathway (at the third step) [CATALYTIC ACTIVITY : 5-PHOSPHORIBOSYL-AMP + H(2)O = 5-(5-PHOSPHO-D-RIBOSYLAMINOFORMIMINO)-1-(5-PHOSPHO-RIBOSY L) IMIDAZOLE-4- CARBOXAMIDE.]" /note="PR-AMP cyclohydrolase; functions in histidine biosynthesis from PRPP; converts 1-(5-phosphoribosyl)-AMP to 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino]imidazole-4- carboxyamide during the histidine biosynthesis pathway; binds zinc and magnesium; forms homodimers" /codon_start=1 /transl_table=11 /product="phosphoribosyl-AMP cyclohydrolase" /protein_id="NP_216638.2" /db_xref="GI:57116892" /db_xref="GOA:O53909" /db_xref="UniProtKB/Swiss-Prot:O53909" /db_xref="GeneID:886011" /translation="MTLDPKIAARLKRNADGLVTAVVQERGSGDVLMVAWMNDEALAR TLQTREATYYSRSRAEQWVKGATSGHTQHVHSVRLDCDGDAVLLTVDQVGGACHTGDH SCFDAAVLLEPDD" gene 1806181..1807263 /gene="chaA" /locus_tag="Rv1607" /db_xref="GeneID:885524" CDS 1806181..1807263 /gene="chaA" /locus_tag="Rv1607" /function="INVOLVED IN TRANSPORT OF IONS (PRESUMABLY CALCIUM) ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1607, (MTV046.05), len: 360 aa. Probable chaA, ionic transporter integral membrane protein, putative calcium/proton antiporter, similar to many e.g. P31801|CHAA_ECOLI CALCIUM/PROTON ANTIPORTER from Escherichia coli (366 aa), FASTA scores: opt: 736, E(): 0, (35.9% identity in 351 aa overlap). Equivalent to Mycobacterium leprae AL049913|MLCB1610_21 (77.7% identity in 364 aa overlap). SEEMS TO BELONG TO THE CaCA FAMILY. TBparse score is 0.888." /codon_start=1 /transl_table=11 /product="ionic transporter integral membrane protein chaA" /protein_id="NP_216123.1" /db_xref="GI:15608745" /db_xref="GOA:O53910" /db_xref="UniProtKB/TrEMBL:O53910" /db_xref="GeneID:885524" /translation="MLKRVPWTVVLPSLAFVALVLTWGKQIGPVVGLLAAVLLAGAVL AAVNHAEVVAARVGEPFGSLVLAVAVTTIEVALIVALMVSGGDDAATLARDTVFAAVM ITTNGIAGLSLLLGSLRYGVTLFNPHGSGAALATVTTLATLSLVLPTFTTSQSGPELS PGQLIFAGAASLGLYVLFLFTQTVRHRDFFLPVAQKGAVEDDSHADPPSTRAALLSLG LLLVALVAVVGLAKVESPVIEEVVSAAGFPQSFVGVVIATLVLLPETLAAARAARQGR LQTSLNLAYGSAMASIGLTIPTIALASLWLSGPLQLGLGAIQLVLLVLTVVVSVLTVV PGRATRLQGEVHLVLLAAYLFLAVVP" gene complement(1807298..1807762) /gene="bcpB" /locus_tag="Rv1608c" /db_xref="GeneID:885530" CDS complement(1807298..1807762) /gene="bcpB" /locus_tag="Rv1608c" /function="peroxide detoxification" /note="Rv1608c, (MTV046.06), len: 154 aa. Probable bcpB, peroxidoxin or bacterioferritin comigratory protein, similar to many, e.g. AE0003|ECAE000335_4 bacterioferritin comigratory protein from Escherichia coli K-12 MG1655 (156 aa), FASTA scores: opt: 329, E(): 1.2e-16, (38.2% identity in 152 aa overlap); Z97179|MLCL383_22 Mycobacterium leprae cosmid L383 (161 aa) (40.2% identity in 132 aa overlap). Also similar to Rv2428 AhpC, alkyl hydroperoxide reductase from Mycobacterium tuberculosis; and other Mycobacterium tuberculosis putative peroxidoxins Rv2521, Rv2238c, Rv1932." /codon_start=1 /transl_table=11 /product="peroxidoxin BcpB" /protein_id="NP_216124.1" /db_xref="GI:15608746" /db_xref="UniProtKB/TrEMBL:O53911" /db_xref="GeneID:885530" /translation="MKTGDTVADFELPDQTGTPRRLSVLLSDGPVVLFFYPAAMTPGC TKEACHFRDLAKEFAEVRASRVGISTDPVRKQAKFAEVRRFDYPLLSDAQGTVAAQFG VKRGLLGKLMPVKRTTFVIDTDRKVLDVISSEFSMDAHADKALATLRAIRSG" gene 1807903..1809453 /gene="trpE" /locus_tag="Rv1609" /db_xref="GeneID:885040" CDS 1807903..1809453 /gene="trpE" /locus_tag="Rv1609" /EC_number="4.1.3.27" /function="Involved in tryptophan biosynthesis pathway (at the first step). SUPPOSED TETRAMER OF TWO COMPONENTS I AND TWO COMPONENTS II: COMPONENT I (Rv1609|trpE) CATALYZES THE FORMATION OF ANTHRANILATE USING AMMONIA RATHER THAN GLUTAMINE, WHEREAS COMPONENT II (Rv0013|trpG) PROVIDES GLUTAMINE AMIDOTRANSFERASE ACTIVITY [CATALYTIC ACTIVITY: CHORISMATE + L-GLUTAMINE = ANTHRANILATE + PYRUVATE + L-GLUTAMATE]." /note="with component II, the glutamine amidotransferase, catalyzes the formation of anthranilate from chorismate and glutamine" /codon_start=1 /transl_table=11 /product="anthranilate synthase component I" /protein_id="NP_216125.1" /db_xref="GI:15608747" /db_xref="GOA:P67001" /db_xref="UniProtKB/Swiss-Prot:P67001" /db_xref="GeneID:885040" /translation="MHADLAATTSREDFRLLAAEHRVVPVTRKVLADSETPLSAYRKL AANRPGTFLLESAENGRSWSRWSFIGAGAPTALTVREGQAVWLGAVPKDAPTGGDPLR ALQVTLELLATADRQSEPGLPPLSGGMVGFFAYDMVRRLERLPERAVDDLCLPDMLLL LATDVAAVDHHEGTITLIANAVNWNGTDERVDWAYDDAVARLDVMTAALGQPLPSTVA TFSRPEPRHRAQRTVEEYGAIVEYLVDQIAAGEAFQVVPSQRFEMDTDVDPIDVYRIL RVTNPSPYMYLLQVPNSDGAVDFSIVGSSPEALVTVHEGWATTHPIAGTRWRGRTDDE DVLLEKELLADDKERAEHLMLVDLGRNDLGRVCTPGTVRVEDYSHIERYSHVMHLVST VTGKLGEGRTALDAVTACFPAGTLSGAPKVRAMELIEEVEKTRRGLYGGVVGYLDFAG NADFAIAIRTALMRNGTAYVQAGGGVVADSNGSYEYNEARNKARAVLNAIAAAETLAA PGANRSGC" gene 1809443..1810150 /locus_tag="Rv1610" /db_xref="GeneID:885293" CDS 1809443..1810150 /locus_tag="Rv1610" /function="UNKNOWN" /note="Rv1610, (MTCY01B2.02), len: 235 aa. Possible conserved membrane protein. Equivalent to AL049913|MLCB1610_23 hypothetical protein from Mycobacterium leprae (264 aa), FASTA score: (65.8% identity in 231 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216126.1" /db_xref="GI:15608748" /db_xref="UniProtKB/TrEMBL:O06128" /db_xref="GeneID:885293" /translation="MAANAGSVRPNRRARPMIGIAQLLLVVAAGALWMAARLPWVVIG SFDELGPPKEVTLTGASWSTALLPLALLMLAAAVAALAVRGWPLRALAVLLAAASFAV GYLGISLWVVPDVAARGADLAHVPVVTLVGSARHYWGAVAAVLAAVCALLAAVFLMSS AAIRGSAGEDMARYAAPRARRSIARRQHSNAAGRAAPQDDGPDMGPRMSERMIWEALD EGRDPTDREQESDTEGR" gene 1810240..1811058 /gene="trpC" /locus_tag="Rv1611" /db_xref="GeneID:885294" CDS 1810240..1811058 /gene="trpC" /locus_tag="Rv1611" /EC_number="4.1.1.48" /function="tryptophan biosynthesis pathway (fourth step) [CATALYTIC ACTIVITY : 1-(2-CARBOXYPHENYLAMINO)-1-DEOXY-D-RIBULOSE 5-PHOSPHATE = 1-(INDOL-3-YL)GLYCEROL 3-PHOSPHATE + CO(2) + H(2)O.]" /note="involved in tryptophan biosynthesis; amino acid biosynthesis; converts 1-(2-carboxyphenylamino)-1-deoxy-D-ribulose 5-phosphate to C(1)-(3-indolyl)-glycerol 3-phosphate and carbon dioxide and water" /codon_start=1 /transl_table=11 /product="indole-3-glycerol-phosphate synthase" /protein_id="NP_216127.1" /db_xref="GI:15608749" /db_xref="GOA:O06129" /db_xref="UniProtKB/Swiss-Prot:O06129" /db_xref="GeneID:885294" /translation="MSPATVLDSILEGVRADVAAREASVSLSEIKAAAAAAPPPLDVM AALREPGIGVIAEVKRASPSAGALATIADPAKLAQAYQDGGARIVSVVTEQRRFQGSL DDLDAVRASVSIPVLRKDFVVQPYQIHEARAHGADMLLLIVAALEQSVLVSMLDRTES LGMTALVEVHTEQEADRALKAGAKVIGVNARDLMTLDVDRDCFARIAPGLPSSVIRIA ESGVRGTADLLAYAGAGADAVLVGEGLVTSGDPRAAVADLVTAGTHPSCPKPAR" misc_feature 1810399..1810443 /gene="trpC" /locus_tag="Rv1611" /note="PS00614 Indole-3-glycerol phosphate synthase signature" gene 1811127..1812359 /gene="trpB" /locus_tag="Rv1612" /db_xref="GeneID:885297" CDS 1811127..1812359 /gene="trpB" /locus_tag="Rv1612" /EC_number="4.2.1.20" /function="tryptophan biosynthesis pathway (fifth last step). THE BETA SUBUNIT IS RESPONSIBLE FOR THE SYNTHESIS OF L-TRYPTOPHAN FROM INDOLE AND L-SERINE. [CATALYTIC ACTIVITY : L-SERINE + 1-(INDOL-3-YL)GLYCEROL 3-PHOSPHATE = L-TRYPTOPHAN + GLYCERALDEHYDE 3-PHOSPHATE + H(2)O]" /note="catalyzes the formation of L-tryptophan from L-serine and 1-(indol-3-yl)glycerol 3-phosphate" /codon_start=1 /transl_table=11 /product="tryptophan synthase subunit beta" /protein_id="NP_216128.1" /db_xref="GI:15608750" /db_xref="GOA:P66984" /db_xref="UniProtKB/Swiss-Prot:P66984" /db_xref="GeneID:885297" /translation="MSAAIAEPTSHDPDSGGHFGGPSGWGGRYVPEALMAVIEEVTAA YQKERVSQDFLDDLDRLQANYAGRPSPLYEATRLSQHAGSARIFLKREDLNHTGSHKI NNVLGQALLARRMGKTRVIAETGAGQHGVATATACALLGLDCVIYMGGIDTARQALNV ARMRLLGAEVVAVQTGSKTLKDAINEAFRDWVANADNTYYCFGTAAGPHPFPTMVRDF QRIIGMEARVQIQGQAGRLPDAVVACVGGGSNAIGIFHAFLDDPGVRLVGFEAAGDGV ETGRHAATFTAGSPGAFHGSFSYLLQDEDGQTIESHSISAGLDYPGVGPEHAWLKEAG RVDYRPITDSEAMDAFGLLCRMEGIIPAIESAHAVAGALKLGVELGRGAVIVVNLSGR GDKDVETAAKWFGLLGND" misc_feature 1811406..1811435 /gene="trpB" /locus_tag="Rv1612" /note="PS00168 Tryptophan synthase beta chain pyridoxal-phosphate attachment site" gene 1812359..1813171 /gene="trpA" /locus_tag="Rv1613" /db_xref="GeneID:885291" CDS 1812359..1813171 /gene="trpA" /locus_tag="Rv1613" /EC_number="4.2.1.20" /function="tryptophan biosynthesis pathway (fifth - last step). THE ALPHA SUBUNIT IS RESPONSIBLE FOR THE ALDOL CLEAVAGE OF INDOLEGLYCEROL PHOSPHATE TO INDOLE AND GLYCERALDEHYDE 3- PHOSPHATE. [CATALYTIC ACTIVITY: L-SERINE + 1-(INDOL-3-YL)GLYCEROL 3-PHOSPHATE = L-TRYPTOPHAN + GLYCERALDEHYDE 3-PHOSPHATE + H(2)O.]" /note="catalyzes the formation of indole and glyceraldehyde 3-phosphate from indoleglycerol phosphate in tryptophan biosynthesis" /codon_start=1 /transl_table=11 /product="tryptophan synthase subunit alpha" /protein_id="NP_216129.1" /db_xref="GI:15608751" /db_xref="GOA:P66980" /db_xref="UniProtKB/Swiss-Prot:P66980" /db_xref="GeneID:885291" /translation="MVAVEQSEASRLGPVFDSCRANNRAALIGYLPTGYPDVPASVAA MTALVESGCDIIEVGVPYSDPGMDGPTIARATEAALRGGVRVRDTLAAVEAISIAGGR AVVMTYWNPVLRYGVDAFARDLAAAGGLGLITPDLIPDEAQQWLAASEEHRLDRIFLV APSSTPERLAATVEASRGFVYAASTMGVTGARDAVSQAAPELVGRVKAVSDIPVGVGL GVRSRAQAAQIAQYADGVIVGSALVTALTEGLPRLRALTGELAAGVRLGMSA" gene 1813171..1814577 /gene="lgt" /locus_tag="Rv1614" /db_xref="GeneID:885292" CDS 1813171..1814577 /gene="lgt" /locus_tag="Rv1614" /EC_number="2.4.99.-" /function="prolipoprotein modification" /note="transfers the N-acyl diglyceride moiety to the prospective N-terminal cysteine in prolipoprotein" /codon_start=1 /transl_table=11 /product="prolipoprotein diacylglyceryl transferase" /protein_id="NP_216130.1" /db_xref="GI:15608752" /db_xref="GOA:O06131" /db_xref="UniProtKB/Swiss-Prot:O06131" /db_xref="GeneID:885292" /translation="MRMLPSYIPSPPRGVWYLGPLPVRAYAVCVITGIIVALLIGDRR LTARGGERGMTYDIALWAVPFGLIGGRLYHLATDWRTYFGDGGAGLAAALRIWDGGLG IWGAVTLGVMGAWIGCRRCGIPLPVLLDAVAPGVVLAQAIGRLGNYFNQELYGRETTM PWGLEIFYRRDPSGFDVPNSLDGVSTGQVAFVVQPTFLYELIWNVLVFVALIYIDRRF IIGHGRLFGFYVAFYCAGRFCVELLRDDPATLIAGIRINSFTSTFVFIGAVVYIILAP KGREAPGALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVK AEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAE AASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDPAEPDGIRRQ DDFSSRRRRWWRLRRRRQ" gene 1815253..1815693 /locus_tag="Rv1615" /db_xref="GeneID:885499" CDS 1815253..1815693 /locus_tag="Rv1615" /function="UNKNOWN" /note="Rv1615, (MTCY01B2.07), len: 146 aa. Probable membrane protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216131.1" /db_xref="GI:15608753" /db_xref="UniProtKB/TrEMBL:O06132" /db_xref="GeneID:885499" /translation="MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPY PQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGL GIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLRDGS" gene 1815683..1816081 /locus_tag="Rv1616" /db_xref="GeneID:885498" CDS 1815683..1816081 /locus_tag="Rv1616" /function="UNKNOWN" /note="Rv1616, (MTCY01B2.08), len: 132 aa. Conserved membrane protein, with some similarity to other hypothetical proteins e.g. AL096884|SC4G6_9 from Streptomyces coelicolor cosmid 4G6 (148 aa), FASTA scores: opt: 245, E(): 1.7e-1 0, (36.7% identity in 128 aa overlap); Q55401|SLL0543 HYPOTHETICAL 16.5 kDa PROTEIN from SYNECHOCYSTIS SP (148 aa), FASTA scores: opt: 225, E(): 6.5e-10, (35.9% identity in 117 aa overlap). Has cysteine cluster and contains a rubredoxin signature (PS00202)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216132.1" /db_xref="GI:15608754" /db_xref="UniProtKB/TrEMBL:O06133" /db_xref="GeneID:885498" /translation="MEASGRQRRYAAAGSVVLLAGALGYIGLVDPHNSNSLYPPCLFK LLTGWNCPACGGLRMIHDLLHGELAASINDNVFLLVGVPVLASWVLLRRRHGDLALPI PVMIAVAVAVIAWTVLRNLPGFPLVPTISG" misc_feature 1815815..1815847 /locus_tag="Rv1616" /note="PS00202 Rubredoxin signature" gene 1816189..1817607 /gene="pykA" /locus_tag="Rv1617" /db_xref="GeneID:885501" CDS 1816189..1817607 /gene="pykA" /locus_tag="Rv1617" /EC_number="2.7.1.40" /function="produces phosphoenol pyruvate in glycolysis [CATALYTIC ACTIVITY : ATP + PYRUVATE = ADP + PHOSPHOENOLPYRUVATE]" /note="catalyzes the formation of phosphoenolpyruvate from pyruvate" /codon_start=1 /transl_table=11 /product="pyruvate kinase" /protein_id="NP_216133.1" /db_xref="GI:15608755" /db_xref="GOA:O06134" /db_xref="UniProtKB/Swiss-Prot:O06134" /db_xref="GeneID:885501" /translation="MTRRGKIVCTLGPATQRDDLVRALVEAGMDVARMNFSHGDYDDH KVAYERVRVASDATGRAVGVLADLQGPKIRLGRFASGATHWAEGETVRITVGACEGSH DRVSTTYKRLAQDAVAGDRVLVDDGKVALVVDAVEGDDVVCTVVEGGPVSDNKGISLP GMNVTAPALSEKDIEDLTFALNLGVDMVALSFVRSPADVELVHEVMDRIGRRVPVIAK LEKPEAIDNLEAIVLAFDAVMVARGDLGVELPLEEVPLVQKRAIQMARENAKPVIVAT QMLDSMIENSRPTRAEASDVANAVLDGADALMLSGETSVGKYPLAAVRTMSRIICAVE ENSTAAPPLTHIPRTKRGVISYAARDIGERLDAKALVAFTQSGDTVRRLARLHTPLPL LAFTAWPEVRSQLAMTWGTETFIVPKMQSTDGMIRQVDKSLLELARYKRGDLVVIVAG APPGTVGSTNLIHVHRIGEDDV" gene 1817615..1818517 /gene="tesB1" /locus_tag="Rv1618" /db_xref="GeneID:885500" CDS 1817615..1818517 /gene="tesB1" /locus_tag="Rv1618" /EC_number="3.1.2.-" /function="Involved in fatty acid metabolism." /note="Rv1618, (MTCY01B2.10), len: 300 aa. Probable tesB1, acyl-CoA thioesterase II (EC 3.1.2.-), similar to other acyl-CoA thioesterases e.g. TESB_ECOLI|P23911 acyl-coa thioesterase II from Escherichia coli (285 aa), FASTA scores: opt: 495, E(): 2.9e-27, (32.5% identity in 283 aa overlap); etc. Also similar to Rv2605c|tesB2 from M. tuberculosis." /codon_start=1 /transl_table=11 /product="acyl-CoA thioesterase II TesB1" /protein_id="NP_216134.1" /db_xref="GI:15608756" /db_xref="GOA:O06135" /db_xref="UniProtKB/TrEMBL:O06135" /db_xref="GeneID:885500" /translation="MPDGKPMSDFDELLAVLDLNAVASDLFTGSHPSKNPLRTFGGQL MAQSFVASSRTLTRHHLPPSAFSVHFINGGDTAKDIEFQVIRLRDERRFANRRVDAVQ DGTLLSSAMVSYMAGGRGHEHALDPPQVAEPHTRPPIGELLRGYEETVPHFVNALQPI EWRYANDPAWIMRDKGDRLAYNRVWVKALGEMPDDPVLHTATLLYSSDTTVLDSVITT HGLSWGFDRIFAASANHSVWFHRQVNFDDWVLYSTSSPVAADSRGLGSGHFFDRSGKL IATVVQEGVLKYFPATPDSAAGRS" gene 1818575..1820029 /locus_tag="Rv1619" /db_xref="GeneID:885509" CDS 1818575..1820029 /locus_tag="Rv1619" /function="UNKNOWN" /note="Rv1619, (MTCY01B2.11), len: 484 aa. Conserved membrane protein. Some similarity to N-terminus of P94974|Rv1640c|MTCY06H11.04c PROBABLE LYSYL-TRNA SYNTHETASE 2 (EC 6.1.1.6) from Mycobacterium tuberculosis (1172 aa), FASTA scores: E(): 1.4e-16, (28.0% identity in 410 aa overlap); and similar in part to O69916| SC3C8.03C Putative intergral membrane protein from Streptomyces coelicolor cosmid 3C8 (589 aa), FASTA scores: opt: 453 E(): 8.4e-22, (31.3% identity in 313 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216135.1" /db_xref="GI:15608757" /db_xref="UniProtKB/TrEMBL:O06136" /db_xref="GeneID:885509" /translation="MVAAAGEPLNCQRANPEVTVKLPSADVVPRLRGRQRVVVHVDSR TARCVGALALVCAACWLIALLAGDYRHAQWAVAGRLGWSLTVLAAVAFIARGIFLGRP VTAMHATAAGLFLLAGLAAHVLVADLLGEILIAGSGWALMWPTSAHPRPEDLPRVWAL INATRADSLAPFAMQAGKSHHFSAAGTAALAYRTRIGYAVVSGDPIGDEAQFPQLVAD FAAMCHMHGWRIVVVGCSERRLGLWSDPMVVGQSLRPIPIGRDVVIDVSNFEMTGRRF RNLRQAVKRTHNFGVTTEIVAEQQLDDQRQAELAEVLAASPSGARTDRGFCMNLDGVL EGRYPGIQLIIARDASGRVQGFHRYATAGGGSDMSLDVPWRRRGAPNGIDERLSADMI AAAKDAGVQRLSLAFAAFPDLFGANQLGRLQRVCRALIHILDPLIALESLYRYLRKFH ALDERRYVLISMTQVFALALVLLSLEFVPRRRHL" gene complement(1819963..1821693) /gene="cydC" /locus_tag="Rv1620c" /db_xref="GeneID:885503" CDS complement(1819963..1821693) /gene="cydC" /locus_tag="Rv1620c" /function="INVOLVED IN ACTIVE TRANSPORT ACROSS THE MEMBRANE OF COMPONENT LINKED WITH THE ASSEMBLY OF CYTOCHROME: INVOLVED IN CYTOCHROME BIOGENESIS (aerobic respiration). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1620c, (MTCY01B2.12c), len: 576 aa. Probable cydC, transmembrane ATP-binding protein ABC transporter involved in transport of component linked with the assembly of cytochrome (see citation below), similar to others e.g. CYDC_ECOLI|P23886 transport ATP-binding protein from Escherichia coli (573 aa), FASTA scores: opt: 631, E(): 1.6e-30, (28.5% identity in 569 aa overlap); C-terminal part of AL034355|SCD78_14 from Streptomyces coelicolor (1172 aa), FASTA scores: opt: 956, E(): 0, (38.8% identity in 554 aa overlap); etc. Contains (PS00211) ABC transporters family signature, and (PS00017) ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="cytochrome' transport transmembrane ATP-binding protein ABC transporter CydC" /protein_id="NP_216136.1" /db_xref="GI:15608758" /db_xref="GOA:O06137" /db_xref="UniProtKB/TrEMBL:O06137" /db_xref="GeneID:885503" /translation="MNRPSAVSRRQRDLLAASGLLGPRLPRILAAVALGVLSLGSALA LAGVSAWLITRAWQMPPVLDLSVAVVAVRAFAISRGVLHYCERLATHDTALRAAGRAR TLIYHRLAHGPAAAAVGLHSGDLAARVGADVDELANMLVRALVPIAVAAVLAVAATAV VAAVSVPAAVVLAVCLLVAGVVAPWLAGRTAAAQEAIARQHRGMRDTSAMIALEHAPE LRVAGALRNVIADSQRRQHAWADALDAAARTGAIAEAMPTAAIGASLLGAVVAGIGMA PTVAPTTLAILMLLPLSAFEATVALPAAAVQLTRSRIAAARLLDLTGSNRVRETESTV SARLPVGTGVLAADVCCGHQEAQSIRVTIDLPPGARLAVTGASGAGKTTLLMTLAGLL PPVHGRVLLDGTNLSDFDEDELRSAVSFFAEDAHIFATTVRDNLLTARGDCPDDELIE ALDRVGLCGWLAGLPEGLSTVLIGGAQAVSAGQRRRLLLARAVLSPARIVLLDEPVEH LDAANADLLRDLLAPNSGIMSAMRTVVVATHHLPNDIQCAELSIATDQRCRRRGTNSS DNNTNASAKT" misc_feature complement(1820215..1820259) /gene="cydC" /locus_tag="Rv1620c" /note="PS00211 ABC transporters family signature" misc_feature complement(1820548..1820571) /gene="cydC" /locus_tag="Rv1620c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(1821690..1823273) /gene="cydD" /locus_tag="Rv1621c" /db_xref="GeneID:885512" CDS complement(1821690..1823273) /gene="cydD" /locus_tag="Rv1621c" /function="INVOLVED IN ACTIVE TRANSPORT ACROSS THE MEMBRANE OF COMPONENT LINKED WITH THE ASSEMBLY OF CYTOCHROME: INVOLVED IN CYTOCHROME BIOGENESIS (aerobic respiration). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1621c, (MTCY01B2.13c), len: 527 aa. Probable cydD, transmembrane ATP-binding protein ABC transporter involved in transport of component linked with the assembly of cytochrome (see citation below), similar to others e.g. P94366|CYDC_BACSU TRANSPORT ATP-BINDING PROTEIN from Bacillus subtilis (567 aa), FASTA scores: opt: 784, E(): 0, (30.1% identity in 535 aa overlap); N-terminal part of AL034355|SCD78_14 from Streptomyces coelicolor (1172 aa), FASTA scores: opt: 1295, E(): 0, (44.6% identity in 534 aa overlap); etc. Also similar to Q11019|Y07D_MYCTU from Mycobacterium tuberculosis (579 aa), FASTA scores: opt: 530, E(): 6.9e-25, (29.1% identity in 530 aa overlap). Contains (PS00211) ABC transporters family signature, and (PS00017) ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="cytochrome' transport transmembrane ATP-binding protein ABC transporter CydD" /protein_id="NP_216137.1" /db_xref="GI:15608759" /db_xref="GOA:O06138" /db_xref="UniProtKB/TrEMBL:O06138" /db_xref="GeneID:885512" /translation="MACGVGISGCAIGSAIVLASIVAGVIDPANPGMAGLRRWLGPLS ILLVLWGLRASIQWLQARLAQRGASAVIADLSGQVLTAVTARRPSQLAAQRDAAAVLI TRGLDGLRPYFTGYLPTLLLAAILTPATVAVIGLYDLKSMAIVVITLPLIPIFMVLIG LATTNPSAAALAAMTAVQARLLDLIAGIPTLRALGRASGPEQRIAELSADHRRSAMAT LRIAFLSALVLELLATLGVALVAVGIGLRLVFGEMSLTAGLTVLLLAPEVYWPLRRVG VQFHAAADGRTAADKAFALLGESPSPTPGRRTVTARGGVIRLERLSVRGRDGRAPYDL TADIEPGRVTVLTGRNGAGKSTTLQAIAGLTAPSSGRITVAGVDVTNLAPAAWWRQLS WLPQRPVLVPGTVRHNLVLLGPVDDLERACAAAGFDAVLDELPRGLDTVLGRGGVGLS LGQRQRLGLARALGSPAAVLLLDEPTAHLDARTEQHVLGAIVERARAGATVLVVAHRQ QVAAAGDRVVEVNSDGFRR" misc_feature complement(1821885..1821929) /gene="cydD" /locus_tag="Rv1621c" /note="PS00211 ABC transporters family signature" misc_feature complement(1822209..1822232) /gene="cydD" /locus_tag="Rv1621c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(1823360..1824400) /gene="cydB" /locus_tag="Rv1622c" /db_xref="GeneID:885510" CDS complement(1823360..1824400) /gene="cydB" /locus_tag="Rv1622c" /EC_number="1.10.3.-" /function="INVOLVED IN THE RESPIRATORY CHAIN (AT THE TERMINAL STEP): AEROBIC RESPIRATION. CYTOCHROME D TERMINAL OXIDASE COMPLEX IS THE COMPONENT OF THE AEROBIC RESPIRATORY CHAIN THAT IS SUPPOSED PREDOMINATED WHEN CELLS ARE GROWN AT LOW AERATION [CATALYTIC ACTIVITY: UBIQUINOL-8 + O(2) = UBIQUINONE-8 + H(2)O]." /note="Rv1622c, (MTCY01B2.14c), len: 346 aa. Probable cydB, cytochrome D ubiquinol oxidase subunit II (EC 1.10.3.-), integral membrane protein, similar to others e.g. P11027|CYDB_ECOLI CYTOCHROME D UBIQUINOL OXIDASE SUBUNIT II from Escherichia coli strain K12 (379 aa), FASTA scores: opt: 519, E(): 0, (32.3% identity in 372 aa overlap); P94365|CYDB_BACSU CYTOCHROME D UBIQUINOL OXIDASE SUBUNIT II from Bacillus subtilis (338 aa), FASTA scores: opt: 824, E(): 0, (39.5% identity in 337 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane cytochrome D ubiquinol oxidase (subunit II) cydB (cytochrome bd-I oxidase subunit II)" /protein_id="NP_216138.1" /db_xref="GI:15608760" /db_xref="GOA:O06139" /db_xref="UniProtKB/TrEMBL:O06139" /db_xref="GeneID:885510" /translation="MVLQELWFGVIAALFLGFFILEGFDFGVGMLMAPFAHVGMGDPE THRRTALNTIGPVWDGNEVWLITAGAAIFAAFPGWYATVFSALYLPLLAILFGMILRA VAIEWRGKIDDPKWRTGADFGIAAGSWLPALLWGVAFAILVRGLPVDANGHVALSIPD VLNAYTLLGGLATAGLFSLYGAVFIALKTSGPIRDDAYRFAVWLSLPVAGLVAGFGLW TQLAYGKDWTWLVLAVAGCAQAAATVLVWRRVSDGWAFMCTLIVVAAVVVLLFGALYP NLVPSTLNPQWSLTIHNASSTPYTLKIMTWVTAFFAPLTVAYQTWTYWVFRQRISAER IPPPTGLARRAP" gene complement(1824430..1825887) /gene="cydA" /locus_tag="Rv1623c" /db_xref="GeneID:885446" CDS complement(1824430..1825887) /gene="cydA" /locus_tag="Rv1623c" /EC_number="1.10.3.-" /function="INVOLVED IN THE RESPIRATORY CHAIN (AT THE TERMINAL STEP): AEROBIC RESPIRATION. CYTOCHROME D TERMINAL OXIDASE COMPLEX IS THE COMPONENT OF THE AEROBIC RESPIRATORY CHAIN THAT IS SUPPOSED PREDOMINATED WHEN CELLS ARE GROWN AT LOW AERATION [CATALYTIC ACTIVITY: UBIQUINOL-8 + O(2) = UBIQUINONE-8 + H(2)O]." /note="Rv1623c, (MTCY01B2.15c), len: 485 aa. Probable cydA (previously known as appC, but renamed cydA to conform with Mycobacterium smegmatis nomenclature), cytochrome D ubiquinol oxidase subunit I (EC 1.10.3.-), integral membrane protein, similar to others e.g. P26459|APPC_ECOLI|CYXA|CBDA|B0978 CYTOCHROME BD-II OXIDASE SUBUNIT I from Escherichia coli strain K12 (514 aa), FASTA scores: opt: 870, E(): 0, (35.9% identity in 485 aa overlap); AL034355|SCD78_12 from Streptomyces coelicolor (501 aa), FASTA scores: opt: 1099, E(): 0, (48.6% identity in 510 aa overlap); etc.; appC" /codon_start=1 /transl_table=11 /product="integral membrane cytochrome D ubiquinol oxidase (subunit I) cydA (cytochrome bd-I oxidase subunit I)" /protein_id="YP_177824.1" /db_xref="GI:57116893" /db_xref="GOA:Q7D892" /db_xref="UniProtKB/TrEMBL:Q7D892" /db_xref="GeneID:885446" /translation="MNVVDISRWQFGITTVYHFIFVPLTIGLAPLIAVMQTLWVVTDN PAWYRLTKFFGKLFLINFAIGVATGIVQEFQFGMNWSEYSRFVGDVFGAPLAMEGLAA FFFESTFIGLWIFGWNRLPRLVHLACIWIVAIAVNVSAFFIIAANSFMQHPVGAHYNP TTGRAELSSIVVLLTNNTAQAAFTHTVSGALLTAGTFVAAVSAWWLVRSSTTHADSDT QAMYRPATILGCWVALAATAGLLFTGDHQGKLMFQQQPMKMASAESLCDTQTDPNFSV LTVGRQNNCDSLTRVIEVPYVLPFLAEGRISGVTLQGIRDLQQEYQQRFGPNDYRPNL FVTYWSFRMMIGLMAIPVLFALIALWLTRGGQIPNQRWFSWLALLTMPAPFLANSAGW VFTEMGRQPWVVVPNPTGDQLVRLTVKAGVSDHSATVVATSLLMFTLVYAVLAVIWCW LLKRYIVEGPLEHDAEPAAHGAPRDDEVAPLSFAY" gene complement(1825998..1826585) /locus_tag="Rv1624c" /db_xref="GeneID:885135" CDS complement(1825998..1826585) /locus_tag="Rv1624c" /function="UNKNOWN" /note="Rv1624c, (MTCY01B2.16c), len: 195 aa. Probable membrane protein, first start taken. Some similarity to Rv3155 nuoK, NADH dehydrogenase chain K from M. tuberculosis. Also similar to AAK72093.1|AF196488 hypothetical protein from Mycobacterium smegmatis (205 aa). Identities = 117/195 (60%)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216140.1" /db_xref="GI:15608762" /db_xref="UniProtKB/TrEMBL:O06141" /db_xref="GeneID:885135" /translation="MCHTAPMEPSPVVSPLPRLLPHLWKSTLASGILSLILGVLVLAW PGISILVAAMAFGVYLLITGVAQVAFAFSLHVSAGGRILLFISGAASLILAVLAFRHF GDAVLLLAIWIGIGFIFRGVATTVSAISDPMLPGRGWSIFVGVISLIAGIVVMASPFE SIWILALVVGIWLVVIGTCEIASSFAIRKASQTLG" gene complement(1826614..1827945) /gene="cya" /locus_tag="Rv1625c" /db_xref="GeneID:888538" CDS complement(1826614..1827945) /gene="cya" /locus_tag="Rv1625c" /EC_number="4.6.1.1" /function="INVOLVED IN cAMP SYNTHESIS [CATALYTIC ACTIVITY: ATP = 3',5'-CYCLIC AMP + DIPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv1625c, (MT1661, MTCY01B2.17c), len: 418 aa. cya, membrane-anchored adenylyl cyclase (EC 4.6.1.1) (see citations below). C-terminal half is similar to region in numerous eukaryotic adenylate and guanylate cyclases. N-terminal half hydrophobic. FASTA score: CYG2_RAT|P22717 guanylate cyclase soluble, beta-2 chain (682 aa), FASTA scores: opt: 552, E(): 2.7e-26, (40.3% identity in 226 aa overlap). Some similarity to Rv2435c|MTCY428.11 from Mycobacterium tuberculosis (730 aa), E(): 7e-19. Start changed since first submission (+25 aa). BELONGS TO ADENYLYL CYCLASE CLASS-4/GUANYLYL CYCLASE FAMILY." /codon_start=1 /transl_table=11 /product="membrane-anchored adenylyl cyclase Cya (ATP pyrophosphate-lyase) (adenylate cyclase)" /protein_id="NP_216141.2" /db_xref="GI:57116894" /db_xref="GOA:O30820" /db_xref="UniProtKB/Swiss-Prot:O30820" /db_xref="GeneID:888538" /translation="MAARKCGAPPIAADGSTRRPDCVTAVRTQARAPTQHYAESVARR QRVLTITAWLAVVVTGSFALMQLATGAGGWYIALINVFTAVTFAIVPLLHRFGGLVAP LTFIGTAYVAIFAIGWDVGTDAGAQFFFLVAAALVVLLVGIEHTALAVGLAAVAAGLV IALEFLVPPDTGLQPPWAMSVSFVLTTVSACGVAVATVWFALRDTARAEAVMEAEHDR SEALLANMLPASIAERLKEPERNIIADKYDEASVLFADIVGFTERASSTAPADLVRFL DRLYSAFDELVDQHGLEKIKVSGDSYMVVSGVPRPRPDHTQALADFALDMTNVAAQLK DPRGNPVPLRVGLATGPVVAGVVGSRRFFYDVWGDAVNVASRMESTDSVGQIQVPDEV YERLKDDFVLRERGHINVKGKGVMRTWYLIGRKVAADPGEVRGAEPRTAGV" gene complement(1828015..1828088) /locus_tag="Rvnt20" /note="tRNA-Leu(CAA)" /db_xref="GeneID:2700456" tRNA complement(1828015..1828088) /locus_tag="Rvnt20" /product="tRNA-Leu" /note="codon recognized: UUG" /anticodon=(pos:1828052..1828054,aa:Leu) /db_xref="GeneID:2700456" gene 1828180..1828797 /locus_tag="Rv1626" /db_xref="GeneID:885080" CDS 1828180..1828797 /locus_tag="Rv1626" /function="Sensor part of a two component regulatory system" /experiment="experimental evidence, no additional details recorded" /note="Rv1626, (MTCY01B2.18), len: 205 aa. Probable two-component response system transcriptional regulator, similar to many e.g. CHEY_BACSU|P24072 chemotaxis protein chey homolog (119 aa), FASTA scores: opt: 283, E(): 1.6e-16, (43.0% identity in 114 aa overlap). Also similar to AL109732|SC7H2_27 hypothetical protein from Streptomyces coelicolor (218 aa), opt: 880, E(): 0, (69.4% identity in 196 aa overlap)." /codon_start=1 /transl_table=11 /product="two-component system transcriptional regulator" /protein_id="NP_216142.1" /db_xref="GI:15608764" /db_xref="GOA:O06143" /db_xref="UniProtKB/TrEMBL:O06143" /db_xref="GeneID:885080" /translation="MTGPTTDADAAVPRRVLIAEDEALIRMDLAEMLREEGYEIVGEA GDGQEAVELAELHKPDLVIMDVKMPRRDGIDAASEIASKRIAPIVVLTAFSQRDLVER ARDAGAMAYLVKPFSISDLIPAIELAVSRFREITALEGEVATLSERLETRKLVERAKG LLQTKHGMTEPDAFKWIQRAAMDRRTTMKRVAEVVLETLGTPKDT" gene complement(1828865..1830073) /locus_tag="Rv1627c" /db_xref="GeneID:885064" CDS complement(1828865..1830073) /locus_tag="Rv1627c" /function="Thought to be involved in lipid metabolism." /note="Rv1627c, (MTCY01B2.19c), len: 402 aa. Probable nonspecific lipid-transfer protein, similar to many lipid carrier proteins e.g. Q51797 ACETYL CoA SYNTHASE from Pyrococcus furiosus (388 aa), FASTA scores: opt: 400, E(): 3.2e-18, (34.4% identity in 407 aa overlap); etc. Also some similarity to Mycobacterium tuberculosis proteins Rv3523, Rv3540c, Rv0244, Rv2790c, Rv1323, etc." /codon_start=1 /transl_table=11 /product="lipid-transfer protein" /protein_id="NP_216143.1" /db_xref="GI:15608765" /db_xref="UniProtKB/TrEMBL:O06144" /db_xref="GeneID:885064" /translation="MRMSAPEPVYILGAGMHPWGKWGNDFTEYGVVAARAALRDAGVD WRHVQLVAGADTIRNGYPGFVAGATFAQKLGWTGVPVSSSYAACASGSQALQSARAQI LAGFCDVALVIGADTTPKGFFAPVGGERKGDPDWQRFHLIGATNTVYFALLARRRMDL YGATVEDFAQVKVKNSRHGLDNPNARYRKENSIDDVLASPVVSDPLRLLDICATSDGA AALIVASKSFTEKHLGSVAGVPSVRAISTVTPKYPQHLPELPDIATDSTAAVPAPERV FKDQILDAAYAEAGIGPEDLSLAEVYDLSTALELDWYEHLGLCPKGEAEALLRSGATT LGGRVPVNPSGGLACFGEAIPAQAIAQVCELTWQLRGQATGRQVADAKVGVTANQGLF GHGSSVIVAR" gene complement(1830070..1830561) /locus_tag="Rv1628c" /db_xref="GeneID:885289" CDS complement(1830070..1830561) /locus_tag="Rv1628c" /function="UNKNOWN" /note="Rv1628c, (MTCY01B2.20c), len: 163 aa. Conserved hypothetical protein, some similarity to others e.g. Q51796 ACAC PROTEIN in Pyrococcus furiosus (136 aa), FASTA scores: opt: 199, E(): 4.6e-06, (34.7% identity in 121 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216144.1" /db_xref="GI:15608766" /db_xref="UniProtKB/TrEMBL:O06145" /db_xref="GeneID:885289" /translation="MPEVTREEPAIDGWFTTDKAGNPHLLGGKCPQCGTYVFPPRADN CPNPACGSDTLESVGLSTRGKLWSYTENRYAPPPPYPAPDPFEPFAVAAVELADEGLI VLGKVVDGTLAADLKVGMEMELTTMPLFADDDGVQRIVYAWRIPSRAGDDAERSDAEE RRR" repeat_region complement(1830074..1830125) /note="52 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 1830665..1833379 /gene="polA" /locus_tag="Rv1629" /db_xref="GeneID:885074" CDS 1830665..1833379 /gene="polA" /locus_tag="Rv1629" /EC_number="2.7.7.7" /function="INVOLVED IN POST-INCISION EVENTS. IN ADDITION TO DNA POLYMERASE ACTIVITY, THIS DNA POLYMERASE EXHIBITS 3' TO 5' AND 5' TO 3' EXONUCLEASE ACTIVITY [CATALYTIC ACTIVITY : N DEOXYNUCLEOSIDE TRIPHOSPHATE = N DIPHOSPHATE + {DNA}(N)]." /experiment="experimental evidence, no additional details recorded" /note="has 3'-5' exonuclease, 5'-3' exonuclease and 5'-3'polymerase activities, primarily functions to fill gaps during DNA replication and repair" /codon_start=1 /transl_table=11 /product="DNA polymerase I" /protein_id="NP_216145.1" /db_xref="GI:15608767" /db_xref="GOA:Q07700" /db_xref="UniProtKB/Swiss-Prot:Q07700" /db_xref="GeneID:885074" /translation="MVTTASAPSEDRAKPTLMLLDGNSLAFRAFYALPAENFKTRGGL TTNAVYGFTAMLINLLRDEAPTHIAAAFDVSRQTFRLQRYPEYKANRSSTPDEFAGQI DITKEVLGALGITVLSEPGFEADDLIATLATQAENEGYRVLVVTGDRDALQLVSDDVT VLYPRKGVSELTRFTPEAVVEKYGLTPRQYPDFAALRGDPSDNLPGIPGVGEKTAAKW IAEYGSLRSLVDNVDAVRGKVGDALRANLASVVRNRELTDLVRDVPLAQTPDTLRLQP WDRDHIHRLFDDLEFRVLRDRLFDTLAAAGGPEVDEGFDVRGGALAPGTVRQWLAEHA GDGRRAGLTVVGTHLPHGGDATAMAVAAADGEGAYLDTATLTPDDDAALAAWLADPAK PKALHEAKAAVHDLAGRGWTLEGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRA ETPQQQQLSLLDDDDTDAETIQTTILRARAVIDLADALDAELARIDSTALLGEMELPV QRVLAKMESAGIAVDLPMLTELQSQFGDQIRDAAEAAYGVIGKQINLGSPKQLQVVLF DELGMPKTKRTKTGYTTDADALQSLFDKTGHPFLQHLLAHRDVTRLKVTVDGLLQAVA ADGRIHTTFNQTIAATGRLSSTEPNLQNIPIRTDAGRRIRDAFVVGDGYAELMTADYS QIEMRIMAHLSGDEGLIEAFNTGEDLHSFVASRAFGVPIDEVTGELRRRVKAMSYGLA YGLSAYGLSQQLKISTEEANEQMDAYFARFGGVRDYLRAVVERARKDGYTSTVLGRRR YLPELDSSNRQVREAAERAALNAPIQGSAADIIKVAMIQVDKALNEAQLASRMLLQVH DELLFEIAPGERERVEALVRDKMGGAYPLDVPLEVSVGYGRSWDAAAH" misc_feature 1832849..1832908 /gene="polA" /locus_tag="Rv1629" /note="PS00447 DNA polymerase family A signature" gene 1833542..1834987 /gene="rpsA" /locus_tag="Rv1630" /db_xref="GeneID:885188" CDS 1833542..1834987 /gene="rpsA" /locus_tag="Rv1630" /function="BINDS MRNA; THUS FACILITATING RECOGNITION OF THE INITIATION POINT. IT IS NEEDED TO TRANSLATE MRNA WITH A SHORT SHINE-DALGARNO (SD) PURINE-RICH SEQUENCE." /experiment="experimental evidence, no additional details recorded" /note="in Escherichia coli this protein is involved in binding to the leader sequence of mRNAs and is itself bound to the 30S subunit; autoregulates expression via a C-terminal domain; in most gram negative organisms this protein is composed of 6 repeats of the S1 domain while in gram positive there are 4 repeats; the S1 nucleic acid-binding domain is found associated with other proteins" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S1" /protein_id="NP_216146.1" /db_xref="GI:15608768" /db_xref="GOA:O06147" /db_xref="UniProtKB/Swiss-Prot:O06147" /db_xref="GeneID:885188" /translation="MPSPTVTSPQVAVNDIGSSEDFLAAIDKTIKYFNDGDIVEGTIV KVDRDEVLLDIGYKTEGVIPARELSIKHDVDPNEVVSVGDEVEALVLTKEDKEGRLIL SKKRAQYERAWGTIEALKEKDEAVKGTVIEVVKGGLILDIGLRGFLPASLVEMRRVRD LQPYIGKEIEAKIIELDKNRNNVVLSRRAWLEQTQSEVRSEFLNNLQKGTIRKGVVSS IVNFGAFVDLGGVDGLVHVSELSWKHIDHPSEVVQVGDEVTVEVLDVDMDRERVSLSL KATQEDPWRHFARTHAIGQIVPGKVTKLVPFGAFVRVEEGIEGLVHISELAERHVEVP DQVVAVGDDAMVKVIDIDLERRRISLSLKQANEDYTEEFDPAKYGMADSYDEQGNYIF PEGFDAETNEWLEGFEKQRAEWEARYAEAERRHKMHTAQMEKFAAAEAAGRGADDQSS ASSAPSEKTAGGSLASDAQLAALREKLAGSA" gene 1835013..1836236 /gene="coaE" /locus_tag="Rv1631" /db_xref="GeneID:885165" CDS 1835013..1836236 /gene="coaE" /locus_tag="Rv1631" /EC_number="2.7.1.24" /function="CATALYZES THE PHOSPHORYLATION OF THE 3'-HYDROXYL GROUP OF DEPHOSPHOCOENZYME A TO FORM COENZYME A [CATALYTIC ACTIVITY: ATP + dephospho-CoA <=> ADP + CoA]." /note="catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form coenzyme A; involved in coenzyme A biosynthesis" /codon_start=1 /transl_table=11 /product="dephospho-CoA kinase/unknown domain fusion protein" /protein_id="NP_216147.1" /db_xref="GI:15608769" /db_xref="GOA:P63826" /db_xref="UniProtKB/Swiss-Prot:P63826" /db_xref="GeneID:885165" /translation="MLRIGLTGGIGAGKSLLSTTFSQCGGIVVDGDVLAREVVQPGTE GLASLVDAFGRDILLADGALDRQALAAKAFRDDESRGVLNGIVHPLVARRRSEIIAAV SGDAVVVEDIPLLVESGMAPLFPLVVVVHADVELRVRRLVEQRGMAEADARARIAAQA SDQQRRAVADVWLDNSGSPEDLVRRARDVWNTRVQPFAHNLAQRQIARAPARLVPADP SWPDQARRIVNRLKIACGHKALRVDHIGSTAVSGFPDFLAKDVIDIQVTVESLDVADE LAEPLLAAGYPRLEHITQDTEKTDARSTVGRYDHTDSAALWHKRVHASADPGRPTNVH LRVHGWPNQQFALLFVDWLAANPGAREDYLTVKCDADRRADGELARYVTAKEPWFLDA YQRAWEWADAVHWRP" misc_feature 1835034..1835057 /gene="coaE" /locus_tag="Rv1631" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(1836387..1836830) /locus_tag="Rv1632c" /db_xref="GeneID:885649" CDS complement(1836387..1836830) /locus_tag="Rv1632c" /function="UNKNOWN" /note="Rv1632c, (MTCY01B2.24c), len: 147 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216148.1" /db_xref="GI:15608770" /db_xref="UniProtKB/TrEMBL:O06149" /db_xref="GeneID:885649" /translation="MRAVDEYTVHPWGLYLARPTPGRAQFHYLESWLLPSLGLRATVF HFNPSHKRDHDYYLDVGEYTPGPSVWRSEDHYLDIEVRTGGGAELADVDELLDAVRHG LLTPTVAEQAVRHAVDAVEGLARNGYDLTRWLATKGMELTWRSGS" gene 1837075..1839171 /gene="uvrB" /locus_tag="Rv1633" /db_xref="GeneID:885249" CDS 1837075..1839171 /gene="uvrB" /locus_tag="Rv1633" /function="INVOLVED IN NUCLEOTIDE EXCISION REPAIR. THE ABC EXCISION NUCLEASE IS A DNA REPAIR ENZYME THAT CATALYZES THE EXCISION REACTION OF UV-DAMAGED NUCLEOTIDE SEGMENTS PRODUCING OLIGOMERS HAVING THE MODIFIED BASE(S). UVRB STIMULATES THE ATPASE ACTIVITY OF UVRA IN THE PRESENCE OF UV-IRRADIATED DOUBLE-STRANDED DNA. IT ALSO ENHANCES THE ABILITY OF UVRA TO BIND TO UV-IRRADIATED DUPLEX DNA" /experiment="experimental evidence, no additional details recorded" /note="The UvrABC repair system catalyzes the recognition and processing of DNA lesions. The beta-hairpin of the Uvr-B subunit is inserted between the strands, where it probes for the presence of a lesion" /codon_start=1 /transl_table=11 /product="excinuclease ABC subunit B" /protein_id="NP_216149.1" /db_xref="GI:15608771" /db_xref="GOA:P67422" /db_xref="UniProtKB/Swiss-Prot:P67422" /db_xref="GeneID:885249" /translation="MRAGGHFEVVSPHAPAGDQPAAIDELERRINAGERDVVLLGATG TGKSATTAWLIERLQRPTLVMAPNKTLAAQLANELREMLPHNAVEYFVSYYDYYQPEA YIAQTDTYIEKDSSINDDVERLRHSATSALLSRRDVVVVASVSCIYGLGTPQSYLDRS VELKVGEEVPRDGLLRLLVDVQYTRNDMSFTRGSFRVRGDTVEIIPSYEELAVRIEFF GDEIEALYYLHPLTGEVIRQVDSLRIFPATHYVAGPERMAHAVSAIEEELAERLAELE SQGKLLEAQRLRMRTNYDIEMMRQVGFCSGIENYSRHIDGRGPGTPPATLLDYFPEDF LLVIDESHVTVPQIGGMYEGDISRKRNLVEYGFRLPSACDNRPLTWEEFADRIGQTVY LSATPGPYELSQTGGEFVEQVIRPTGLVDPKVVVKPTKGQIDDLIGEIRTRADADQRV LVTTLTKKMAEDLTDYLLEMGIRVRYLHSEVDTLRRVELLRQLRLGDYDVLVGINLLR EGLDLPEVSLVAILDADKEGFLRSSRSLIQTIGRAARNVSGEVHMYADKITDSMREAI DETERRRAKQIAYNEANGIDPQPLRKKIADILDQVYREADDTAVVEVGGSGRNASRGR RAQGEPGRAVSAGVFEGRDTSAMPRAELADLIKDLTAQMMAAARDLQFELAARFRDEI ADLKRELRGMDAAGLK" misc_feature 1837195..1837218 /gene="uvrB" /locus_tag="Rv1633" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1839168..1840583 /locus_tag="Rv1634" /db_xref="GeneID:885115" CDS 1839168..1840583 /locus_tag="Rv1634" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF DRUG ACROSS THE MEMBRANE (EXPORT). DRUG RESISTANCE BY AN EXPORT MECHANISM (CONFERES RESISTANCE TO TOXIC COMPOUNDS BY REMOVING THEM FOR THE CELLS)." /note="Rv1634, (MTCY01B2.26), len: 471 aa. Possible drug efflux membrane protein of major facilitator superfamily (MFS), similar to many antibiotic resistance (efflux) proteins. FASTA best: Q56175 TU22 DTDP-GLUCOSE DEHYDRTATASE (GRAE) from Streptomyces violaceoruber (557 aa), opt: 415, E(): 1.7e-17, (26.7% identity in 446 aa overlap). Relatives in Mycobacterium tuberculosis: MTCY369.27c, E(): 4.8e-12; MTCY20B11.14c, E(): 2.9e-10." /codon_start=1 /transl_table=11 /product="drug efflux membrane protein" /protein_id="NP_216150.1" /db_xref="GI:15608772" /db_xref="GOA:O06151" /db_xref="UniProtKB/TrEMBL:O06151" /db_xref="GeneID:885115" /translation="MTETASETGSWRELLSRYLGTSIVLAGGVALYATNEFLTISLLP STIADIGGSRLYAWVTTLYLVGSVVAATTVNTMLLRVGARSSYLMGLAVFGLASLVCA AAPSMQILVAGRTLQGIAGGLLAGLGYALINSTLPKSLWTRGSALVSAMWGVATLIGP ATGGLFAQLGLWRWAFGVMTLLTALMAMLVPVALGAGGVGPGGETPVGSTHKVPVWSL LLMGAAALAISVAALPNYLVQTAGLLAAAALLVAVFVVVDWRIHAAVLPPSVFGSGPL KWIYLTMSVQMIAAMVDTYVPLFGQRLGHLTPVAAGFLGAALAVGWTVGEVASASLNS ARVIGHVVAAAPLVMASGLALGAVTQRADAPVGIIALWALALLIIGTGIGIAWPHLTV RAMDSVADPAESSAAAAAINVVQLISGAFGAGLAGVVVNTAKGGEVAAARGLYMAFTV LAAAGVIASYQATHRDRRLPR" gene complement(1840572..1842242) /locus_tag="Rv1635c" /db_xref="GeneID:885100" CDS complement(1840572..1842242) /locus_tag="Rv1635c" /function="UNKNOWN" /note="Rv1635c, (MTCY01B2.27c), len: 556 aa. Probable conserved transmembrane protein, equivalent to CAC31770.1|AL583921 Mycobacterium leprae membrane protein (527 aa), Identities = 332/527 (62%)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216151.1" /db_xref="GI:15608773" /db_xref="UniProtKB/TrEMBL:O06152" /db_xref="GeneID:885100" /translation="MHASRPGAPPHAGLPSRRTAGDQDHRADPKVTRIMSASTLEQPA AAHVDELVARMRGRLLDPLAIAVLAAVISGAWASRPSLWFDEGATISASASRTLPELW SLLGHIDAVHGLYYLLMHGWFAIFPPTELWSRLPSCLAIGAAAAGVVVFAKQFSGRTT AVCAGAVFAILPRVTWAGIEARSSALSVAAAVWLTVLLVAAVRCNTQRRWLLYALVLM LSILVSINLALLVPAYATMVPLLASGKSRKSPVIWWTVVTAAALGAMTPFILFAHGQV WQVGWIAGLNRNIILDVIHRQYFDHSVPFAILAGLIVAAGIAAHLAGARGPGGDTHRL VLVSAAWIVVPTAVVLIYSATVEPIYYPRYLILTAPAAAVILAVCVVTIARKPWLIAG VVFLLAAAAFPNYFFTQRGPYAKEGWDYSQVADVISAHAKPGDCLLVDNTAGWRPGPI RALLATRPAAFRSLIDVERGTYGPKVGTLWDGHVAVWLTTAKIDKCPTLWTIANRDKS LPDHQVGEMLSPGTGFGRTPVYRFPSYLGFRIVERWQFHYSQVVKSTR" gene 1842451..1842891 /gene="TB15.3" /locus_tag="Rv1636" /db_xref="GeneID:885473" CDS 1842451..1842891 /gene="TB15.3" /locus_tag="Rv1636" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1636, (MTCY01B2.28), len: 146 aa. TB15.3, iron-regulated conserved hypothetical protein (see citations below), similar to other hypothetical proteins from diverse organisms e.g. Q57951|MJ0531|Y531_METJA from Methanococcus jannaschii (170 aa), FASTA scores: opt: 188, E(): 6e-06, (32.2% identity in 149 aa overlap); also P42297|YXIE_BACSU hypothetical 15.9 kDa protein in bglh-wapa intergenic region precursor from Bacillus subtilis (148 aa), FASTA scores: opt: 162, E(): 0.00025, (30.8% identity in 156 aa overlap). Part of family of Mycobacterium tuberculosis hypothetical proteins (but lacks C-terminal region) including Rv2005c, Rv2623, Rv2026c, Rv1996, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216152.1" /db_xref="GI:15608774" /db_xref="GOA:O06153" /db_xref="UniProtKB/TrEMBL:O06153" /db_xref="GeneID:885473" /translation="MSAYKTVVVGTDGSDSSMRAVDRAAQIAGADAKLIIASAYLPQH EDARAADILKDESYKVTGTAPIYEILHDAKERAHNAGAKNVEERPIVGAPVDALVNLA DEEKADLLVVGNVGLSTIAGRLLGSVPANVSRRAKVDVLIVHTT" gene complement(1842898..1843692) /locus_tag="Rv1637c" /db_xref="GeneID:885132" CDS complement(1842898..1843692) /locus_tag="Rv1637c" /function="UNKNOWN" /note="Rv1637c, (MTCY01B2.29c,MTCY06H11.01c), len: 264 aa. Conserved hypothetical protein, some similarity to others e.g. P05446|GLO2_RHOBL PROBABLE HYDROXYACYLGLUTATHIONE HYDROLASE (EC 3.1.2.6) (255 aa), FASTA scores: opt: 252, E(): 2e-09, (39.0% identity in 146 aa overlap). Also similar to Q9Z505|AL035591|SCC54.20 putative hydrolase from Streptomyces coelicolor (218 aa), FASTA scores: opt: 732, E(): 0, (52.3% identity in 220 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins and putative glyoxylases e.g. Rv0634c, Rv3677c, Rv2581c, Rv2260." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216153.1" /db_xref="GI:15608775" /db_xref="UniProtKB/TrEMBL:O06154" /db_xref="GeneID:885132" /translation="MLCARTDNHQGTGNVVTSAHMTRANDDDAGAAGIGAVAHMTTVD DNYTGHVERGKAARRFLPGATILKASVGPMDNNAYLVTCSATGETLLIDAANDAEVLI DLVRRYAPKLALIVTSHQHFDHWQALQAVAAATGAPTAAHPIDADPLPVKPDRLLTHG DSVRIGELTFDVIHLRGHTPGSIALALGGPVTGGVTQLFTGDCLFPGGVGKTWQPADF TQLLDDVTTRVFDVYADSTVIYPGHGDDTELGAERPSLSEWRARGW" gene 1843741..1846659 /gene="uvrA" /locus_tag="Rv1638" /db_xref="GeneID:885685" CDS 1843741..1846659 /gene="uvrA" /locus_tag="Rv1638" /function="INVOLVED IN NUCLEOTIDE EXCISION REPAIR. THE ABC EXCISION NUCLEASE IS A DNA REPAIR ENZYME THAT CATALYZES THE EXCISION REACTION OF UV-DAMAGED NUCLEOTIDE SEGMENTS PRODUCING OLIGOMERS HAVING THE MODIFIED BASE(S). UVRA IS AN ATPASE AND A DNA-BINDING PROTEIN THAT PREFERENTIALLY BINDS SINGLE-STRANDED OR UV-IRRADIATED DOUBLE-STRANDED DNA." /experiment="experimental evidence, no additional details recorded" /note="The UvrABC repair system catalyzes the recognition and processing of DNA lesions. UvrA is an ATPase and a DNA-binding protein. A damage recognition complex composed of 2 uvrA and 2 uvrB subunits scans DNA for abnormalities. When the presence of a lesion has been verified by uvrB, the uvrA molecules dissociate" /codon_start=1 /transl_table=11 /product="excinuclease ABC subunit A" /protein_id="NP_216154.1" /db_xref="GI:15608776" /db_xref="GOA:P63380" /db_xref="UniProtKB/Swiss-Prot:P63380" /db_xref="GeneID:885685" /translation="MADRLIVKGAREHNLRSVDLDLPRDALIVFTGLSGSGKSSLAFD TIFAEGQRRYVESLSAYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTNRNPRSTVGTI TEVYDYLRLLYARAGTPHCPTCGERVARQTPQQIVDQVLAMPEGTRFLVLAPVVRTRK GEFADLFDKLNAQGYSRVRVDGVVHPLTDPPKLKKQEKHDIEVVVDRLTVKAAAKRRL TDSVETALNLADGIVVLEFVDHELGAPHREQRFSEKLACPNGHALAVDDLEPRSFSFN SPYGACPECSGLGIRKEVDPELVVPDPDRTLAQGAVAPWSNGHTAEYFTRMMAGLGEA LGFDVDTPWRKLPAKARKAILEGADEQVHVRYRNRYGRTRSYYADFEGVLAFLQRKMS QTESEQMKERYEGFMRDVPCPVCAGTRLKPEILAVTLAGESKGEHGAKSIAEVCELSI ADCADFLNALTLGPREQAIAGQVLKEIRSRLGFLLDVGLEYLSLSRAAATLSGGEAQR IRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLIETLTRLRDLGNTLIVVEHDEDTIE HADWIVDIGPGAGEHGGRIVHSGPYDELLRNKDSITGAYLSGRESIEIPAIRRSVDPR RQLTVVGAREHNLRGIDVSFPLGVLTSVTGVSGSGKSTLVNDILAAVLANRLNGARQV PGRHTRVTGLDYLDKLVRVDQSPIGRTPRSNPATYTGVFDKIRTLFAATTEAKVRGYQ PGRFSFNVKGGRCEACTGDGTIKIEMNFLPDVYVPCEVCQGARYNRETLEVHYKGKTV SEVLDMSIEEAAEFFEPIAGVHRYLRTLVDVGLGYVRLGQPAPTLSGGEAQRVKLASE LQKRSTGRTVYILDEPTTGLHFDDIRKLLNVINGLVDKGNTVIVIEHNLDVIKTSDWI IDLGPEGGAGGGTVVAQGTPEDVAAVPASYTGKFLAEVVGGGASAATSRSNRRRNVSA" misc_feature 1843834..1843857 /gene="uvrA" /locus_tag="Rv1638" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1845241..1845285 /gene="uvrA" /locus_tag="Rv1638" /note="PS00211 ABC transporters family signature" misc_feature 1845700..1845723 /gene="uvrA" /locus_tag="Rv1638" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 1846267..1846311 /gene="uvrA" /locus_tag="Rv1638" /note="PS00211 ABC transporters family signature" gene complement(1846716..1846973) /locus_tag="Rv1638A" /db_xref="GeneID:3205103" CDS complement(1846716..1846973) /locus_tag="Rv1638A" /function="UNKNOWN" /note="Rv1638A, len: 85 aa. Conserved hypothetical protein, similar to C-terminal part of P31511|35KD_MYCTU 35kd immunogenic protein from Mycobacterium tuberculosis (270 aa), FASTA scores: opt: 159, E(): 0.002, (50.90% identity in 55 aa overlap); and to Mycobacterium leprae ML0981 possible pseudogene, an orthologue of 35kd immunogenic protein from Mycobacterium tuberculosis. Size difference suggests possible gene fragment." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177650.1" /db_xref="GI:57116895" /db_xref="UniProtKB/TrEMBL:Q8VJZ6" /db_xref="GeneID:3205103" /translation="MPDEPTPPEATTPNSESDPRYDSAGVPTFESVREKIETRYGTAL GATELDAESPQGRRLEDQYAQRQRAAAERLAQIRESMHTDE" gene complement(1846989..1848458) /locus_tag="Rv1639c" /db_xref="GeneID:885642" CDS complement(1846989..1848458) /locus_tag="Rv1639c" /function="UNKNOWN" /note="Rv1639c, (MTCY06H11.03c), len: 489 aa. Conserved hypothetical membrane protein. Some similarity to P35866|YLI2_CORGL Hypothetical 45.7 kDa protein from Corynebacterium glutamicum (426 aa), FASTA scores: opt: 511, E( ): 2.4e-23, (28.9% identity in 370 aa overlap). Contains PS00904 protein phenyltransferases alpha subunit repeat signature" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216155.1" /db_xref="GI:15608777" /db_xref="GOA:P94973" /db_xref="UniProtKB/TrEMBL:P94973" /db_xref="GeneID:885642" /translation="MAQNELVTASTPPAATQPLAVGHTSLMHGWVPLAVQVVTAVVLV LAAGWRSRHWQRRWLPTAAAIGATLAWGTRWYVTGNGLANERPPSTLWIWVALTGAAA TVLILGWRSARWWRRGASLLAVPLCLLSATLTLNLWVGYFPTVQTAWNQLTSGPLPDQ ADQAAVAALAHSGVRPSHGTLLPVVIPSDASHFKHRGELVYLPPAWFDREHRSENPPP PQLPTVMMIGGQFNTPADWARAGNAVKTLDDFAAAHSGNAPVVVFVDSGGAFNNDTEC VNGRRGNAADHLTKDVVPYMVSKFGVSPEQTSWGIVGWSMGGTCAVDLTVMHPTLFSA FVDIAGDFYPNAGNKTQTIVRLFGGNEDAWSAFDPTTVITRHGSYTGLSGWFAISSPG PPSPDNAVADTTTMRLAGRDAAANPGNQAAAANALCALGRANGIYCAVVPQPGKHDWP FADRVFAAALPWLAGQLATPGVPKIPLPGTTQQIAGTGR" misc_feature complement(1848003..1848032) /locus_tag="Rv1639c" /note="PS00904 Protein prenyltransferases alpha subunit repeat signature" gene complement(1848517..1852035) /gene="lysS" /locus_tag="Rv1640c" /db_xref="GeneID:885428" CDS complement(1848517..1852035) /gene="lysS" /locus_tag="Rv1640c" /EC_number="6.1.1.6" /function="charging Lys tRNA [CATALYTIC ACTIVITY: ATP + L-LYSINE + TRNA(LYS) = AMP + DIPHOSPHATE + L-LYSYL-TRNA(LYS)]" /note="catalyzes a two-step reaction, first charging a lysine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA" /codon_start=1 /transl_table=11 /product="lysyl-tRNA synthetase" /protein_id="NP_216156.1" /db_xref="GI:15608778" /db_xref="GOA:P94974" /db_xref="UniProtKB/Swiss-Prot:P94974" /db_xref="GeneID:885428" /translation="MGLHLTVPGLRRDGRGVQSNSHDTSSKTTADISRCPQHTDAGLQ RAATPGISRLLGISSRSVTLTKPRSATRGNSRYHWVPAAAGWTVGVIATLSLLASVSP LIRWIIKVPREFINDYLFNFPDTNFAWSFVLALLAAALTARKRIAWLVLLANMVLAAV VNAAEIAAGGNTAAESFGENLGFAVHVVAIVVLVLGYREFWAKVRRGALFRAAAVWLA GAVVGIVASWGLVELFPGSLAPDERLGYAANRVVGFALADPDLFTGRPHVFLNAIFGL FGAFALIGAAIVLFLSQRADNALTGEDESAIRGLLDLYGKDDSLGYFATRRDKSVVFA SSGRACITYRVEVGVCLASGDPVGDHRAWPQAVDAWLRLCQTYGWAPGVMGASSQGAQ TYREAGLTALELGDEAILRPADFKLSGPEMRGVRQAVTRARRAGLTVRIRRHRDIAED EMAQTITRADSWRDTETERGFSMALGRLGDPADSDCLLVEAIDPHNQVLAMLSLVPWG TTGVSLDLMRRSPQSPNGTIELMVSELALHAESLGITRISLNFAVFRAAFEQGAQLGA GPVARLWRGLLVFFSRWWQLETLYRSNMKYQPEWVPRYACYEDARVIPRVGVASVIAE GFLVLPFSRRNRVHTGHHPAVPERLAATGLLHHDGSAPDVSGLRQVGLTNGDGVERRL PEQVRVRFDKLEKLRSSGIDAFPVGRPPSHTVAQALAADHQASVSVSGRIMRIRNYGG VLFAQLRDWSGEMQVLLDNSRLDQGCAADFNAATDLGDLVEMTGHMGASKTGTPSLIV SGWRLIGKCLRPLPNKWKGLLDPEARVRTRYLDLAVNAESRALITARSSVLRAVRETL FAKGFVEVETPILQQLHGGATARPFVTHINTYSMDLFLRIAPELYLKRLCVGGVERVF ELGRAFRNEGVDFSHNPEFTLLEAYQAHADYLEWIDGCRELIQNAAQAANGAPIAMRP RTDKGSDGTRHHLEPVDISGIWPVRTVHDAISEALGERIDADTGLTTLRKLCDAAGVP YRTQWDAGAVVLELYEHLVECRTEQPTFYIDFPTSVSPLTRPHRSKRGVAERWDLVAW GIELGTAYSELTDPVEQRRRLQEQSLLAAGGDPEAMELDEDFLQAMEYAMPPTGGLGM GIDRVVMLITGRSIRETLPFPLAKPH" misc_feature complement(1848574..1848603) /gene="lysS" /locus_tag="Rv1640c" /note="PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2" misc_feature complement(1849225..1849278) /gene="lysS" /locus_tag="Rv1640c" /note="PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1" gene 1852273..1852878 /gene="infC" /locus_tag="Rv1641" /db_xref="GeneID:885478" CDS 1852273..1852878 /gene="infC" /locus_tag="Rv1641" /function="IF-3 BINDS TO THE 30S RIBOSOMAL SUBUNIT AND SHIFTS THE EQUILIBRUM BETWEEN 70S RIBOSOMES AND THEIR 50S AND 30S SUBUNITS IN FAVOR OF THE FREE SUBUNITS, THUS ENHANCING THE AVAILABILITY OF 30S SUBUNITS ON WHICH PROTEIN SYNTHESIS INITIATION BEGINS." /experiment="experimental evidence, no additional details recorded" /note="IF-3 has several functions that are required and promote translation initiation including; preventing association of 70S by binding to 30S; monitoring codon-anticodon interactions by promoting disassociation of fMet-tRNA(fMet) from initiation complexes formed on leaderless mRNAs or incorrectly bound noninitiatior tRNAs and complexes with noncanonical start sites; stimulates codon-anticodon interactions at P-site; involved in moving mRNA to the P-site; and in recycling subunits" /codon_start=1 /transl_table=11 /product="translation initiation factor IF-3" /protein_id="NP_216157.1" /db_xref="GI:15608779" /db_xref="GOA:P65135" /db_xref="UniProtKB/Swiss-Prot:P65135" /db_xref="GeneID:885478" /translation="MSTETRVNERIRVPEVRLIGPGGEQVGIVRIEDALRVAADADLD LVEVAPNARPPVCKIMDYGKYKYEAAQKARESRRNQQQTVVKEQKLRPKIDDHDYETK KGHVVRFLEAGSKVKVTIMFRGREQSRPELGYRLLQRLGADVADYGFIETSAKQDGRN MTMVLAPHRGAKTRARARHPGEPAGGPPPKPTAGDSKAAPN" gene 1852928..1853122 /gene="rpmI" /locus_tag="Rv1642" /db_xref="GeneID:885145" CDS 1852928..1853122 /gene="rpmI" /locus_tag="Rv1642" /function="translation" /experiment="experimental evidence, no additional details recorded" /note="Rv1642, (MTCY06H11.06), len: 64 aa. Probable rpmI, 50S ribosomal protein L35, similar to several e.g. RL35_SYNY3|P48959 from Synechocystis sp. (67 aa), fasta scores: opt: 179, E(): 2.7e-08, (51.6% identity in 64 aa overlap). BELONGS TO THE L35P FAMILY OF RIBOSOMAL PROTEINS." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L35" /protein_id="NP_216158.1" /db_xref="GI:15608780" /db_xref="GOA:P66271" /db_xref="UniProtKB/Swiss-Prot:P66271" /db_xref="GeneID:885145" /translation="MPKAKTHSGASKRFRRTGTGKIVRQKANRRHLLEHKPSTRTRRL DGRTVVAANDTKRVTSLLNG" gene 1853184..1853573 /gene="rplT" /locus_tag="Rv1643" /db_xref="GeneID:885455" CDS 1853184..1853573 /gene="rplT" /locus_tag="Rv1643" /function="THIS PROTEIN BINDS DIRECTLY TO 23S RIBOSOMAL RNA AND IS NECESSARY TO THE IN VITRO ASSEMBLY PROCESS OF THE 50S RIBOSOMAL SUBUNIT; IT IS NOT INVOLVED IN THE PROTEIN SYNTHESIZING FUNCTIONS OF THAT SUBUNIT" /note="binds directly to 23S ribosomal RNA prior to in vitro assembly of the 50S ribosomal subunit" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L20" /protein_id="NP_216159.1" /db_xref="GI:15608781" /db_xref="GOA:P66105" /db_xref="UniProtKB/Swiss-Prot:P66105" /db_xref="GeneID:885455" /translation="MARVKRAVNAHKKRRSILKASRGYRGQRSRLYRKAKEQQLHSLN YAYRDRRARKGEFRKLWIARINAAARLNDITYNRLIQGLKAAGVEVDRKNLADIAISD PAAFTALVDVARAALPEDVNAPSGEAA" misc_feature 1853343..1853393 /gene="rplT" /locus_tag="Rv1643" /note="PS00937 Ribosomal protein L20 signature" gene 1853606..1854388 /gene="tsnR" /locus_tag="Rv1644" /db_xref="GeneID:885163" CDS 1853606..1854388 /gene="tsnR" /locus_tag="Rv1644" /EC_number="2.1.1.-" /function="rRNA modification" /experiment="experimental evidence, no additional details recorded" /note="Rv1644, (MTCY06H11.08), len: 260 aa. Possible tsnR, 23S rRNA methyltransferase (EC 2.1.1.-), similar to several e.g. TSNR_STRLU|P52393 from Streptomyces laurentii (270 aa), FASTA scores: opt: 276, E(): 3.6e-11, (27.6% identity in 261 aa overlap). Also similar to M. tuberculosis hypothetical proteins Rv0881, Rv3579c, and Rv0380c." /codon_start=1 /transl_table=11 /product="23S rRNA methyltransferase TsnR" /protein_id="NP_216160.1" /db_xref="GI:15608782" /db_xref="GOA:P94978" /db_xref="UniProtKB/TrEMBL:P94978" /db_xref="GeneID:885163" /translation="MLTERSARVATAVKLHRHVGRRRAGRFLAEGPNLVAAALARGLV REVFVTEVAARRHELLLAAHEASVHLVTERAAKALSDTVTPAGLVAVCDLPATRLEDV LAGSPQLIAVTVEIREPGNAGTVIRIADAMGAAAVILAGRSVDPYNGKCLRASTGSIF AIPVVVAPDVGAAIADLRAAGLQVLATAVDGEMALDDADRLLAEPTAWLFGPEAHGLS AEIAALADHRVHILMSGGAESLNVAAAAAICLYESARALGRR" gene complement(1854399..1855454) /locus_tag="Rv1645c" /db_xref="GeneID:885287" CDS complement(1854399..1855454) /locus_tag="Rv1645c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1645c, (MTCY06H11.10c), len: 351 aa. Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. O53837|Rv0826|MTV043.18 (351 aa), FASTA scores: (57.5% identity in 299 aa overlap); Q10519|Rv2237|YM37_MYCTU (255 aa), O53682|Rv0276 (306 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216161.1" /db_xref="GI:15608783" /db_xref="UniProtKB/TrEMBL:P94979" /db_xref="GeneID:885287" /translation="MTVASRTSADPLGPDSLTWKYFGDLRTGMMGVWIGAIQNMYPEL GAGVEEHSILLREPLQRVARSVYPIMGVVYDGDRAAQTGQQIKGYHRTIKGVDAEGRR YHALNPDTFYWAHATFFMLVIKVAEYFCGGLTEAEKHQLFEEHVRWYRMYGMSMRPVP KSWEDFQDYWDRVCRDKLEINQATVDILQMRIPKPRFVLMPTPIWDQLFKPLIAGQRW IAAGLFDPAVREKAGMHWTPGDEVLLRVFGKVVELAFLAVPDEIRLHPRALAAYRRAA GRTRHDAPLVQAPGFMAPPRDRQGLPMHYFPPRSHRFTRSALDPAKALMERAGALVHS TLSLAGVRPARGPSRAA" gene 1855764..1856696 /gene="PE17" /locus_tag="Rv1646" /db_xref="GeneID:885486" CDS 1855764..1856696 /gene="PE17" /locus_tag="Rv1646" /function="UNKNOWN" /note="Rv1646, (MTCY06H11.11), len: 310 aa. Member of the Mycobacterium tuberculosis PE family of proteins (see citation below), similar to many e.g. YW36_MYCTU|Q10873 hypothetical 53.7 kd protein cy39.36c (558 aa), FASTA scores, opt: 411, E(): 1.3e-15, (34.4% identity in 320 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177825.1" /db_xref="GI:57116896" /db_xref="UniProtKB/TrEMBL:Q7D879" /db_xref="GeneID:885486" /translation="MSFLTVAPDMVTAAAGNLESVGSALNEAAAAAAPATVGLAAPAA DRVSAVVAAMLGAYARDFQGISAQIAGFHNQFVGALRGGAAAYASAEAANVQQTVVNA VNAPAQALLGHPLIGPETVGSSAAAVSFGFGPLLLAGSDPLLAVPFSYPASLPTPFGP VTMTLNGSFDPLTQQVVFDSGSLTAPAPFVYGLGAVGPALTTMTALQNSGTAFSGAVQ SGNLLGAAGALLQAPGNAVTGFLFGQTAISQSIPGPSNLGYESVGISVPVGGLLAPLQ PVTVTLTPTSGMPTAIQLSGTQFGGLLPALLNGF" gene 1856774..1857724 /locus_tag="Rv1647" /db_xref="GeneID:885432" CDS 1856774..1857724 /locus_tag="Rv1647" /function="UNKNOWN" /note="Rv1647, (MTCY06H11.12), len: 316 aa. Conserved hypothetical protein, some similarity to other Mycobacterium tuberculosis hypothetical proteins e.g. Q11055|Rv1264|YC64_MYCTU Hypothetical 42.2 kDa protein (397 aa), FASTA scores: opt: 197, E(): 9.4e-06, (27.1% identity in 181 aa overlap) and Q10400|Rv2212|YM12_MYCTU (378 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216163.1" /db_xref="GI:15608785" /db_xref="GOA:P94982" /db_xref="UniProtKB/TrEMBL:P94982" /db_xref="GeneID:885432" /translation="MAGSARTTYPCHVEVGPQDSESGAPDETATAMASPVPRQRSALR WLRTVNRSPGLVSFIHRARRLLPGDPEFGDPLSTAGEGGPRAAARAADRLLRDRDAAS REVGLSVLQVWQALTEAVSRRPANPEVTLVFTDLVGFSTWSLHAGDDATLTLLRQVAR AVESPLLDAGGHIVKRLGDGIMAVFRNPTVALRAVLVAQDAVKSLEVQGYTPRMRIGI HTGRPQRLAADWLGVDVNIAARVMERATKGGIMISQPTLDLIPQSELDALGVVARRVR KPVFASKPTGIPPDLAIYRIKTVSESTAADNFDEMSPDAQ" gene 1857731..1858537 /locus_tag="Rv1648" /db_xref="GeneID:885065" CDS 1857731..1858537 /locus_tag="Rv1648" /function="UNKNOWN" /note="Rv1648, (MTCY06H11.13), len: 268 aa. Probable transmembrane protein, some similarity to Rv3434c|MTCY77.06C (237 aa), FASTA scores: E(): 0.00039, (31.4% identity in 194 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216164.1" /db_xref="GI:15608786" /db_xref="UniProtKB/TrEMBL:P94983" /db_xref="GeneID:885065" /translation="MIYRVACLLARIRFTVGYVAALASVSTTILMHGPQVHAQVIRHA STNLHNLAHGHLGTLWNSAFVIDEGPLYFWLPCLACLLAVAELQLRSLRLTVAFVVGH IGATLLVAAVLAGAIEIGWLPWSISRVSDVGMSYGALAALGALTAAIPGRWRPAWIGW WVSLGLATATIGGGFTDAGHTVALLLGMLVTACFTRPARWTLGRCALLAVASGFCLVL LAHSWWSLVSGSALGLLGALGAAGFARWTRARATSLPPGALAIPQPALSR" gene 1858733..1859758 /gene="pheS" /locus_tag="Rv1649" /db_xref="GeneID:885105" CDS 1858733..1859758 /gene="pheS" /locus_tag="Rv1649" /EC_number="6.1.1.20" /function="charging phe tRNA [CATALYTIC ACTIVITY : ATP + L-PHENYLALANINE + TRNA(PHE) = AMP + DIPHOSPHATE + L-PHENYLALANYL-TRNA(PHE)]" /note="catalyzes a two-step reaction, first charging a phenylalanine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; forms a heterotetramer of alpha(2)beta(2); binds two magnesium ions per tetramer; type 1 subfamily" /codon_start=1 /transl_table=11 /product="phenylalanyl-tRNA synthetase subunit alpha" /protein_id="NP_216165.1" /db_xref="GI:15608787" /db_xref="GOA:P94984" /db_xref="UniProtKB/Swiss-Prot:P94984" /db_xref="GeneID:885105" /translation="MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALA RQALAVLPKEQRAEAGKRVNAARNAAQRSYDERLATLRAERDAAVLVAEGIDVTLPST RVPAGARHPIIMLAEHVADTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDT FYIAPEDSRQLLRTHTSPVQIRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEG LAVDRGLSMAHLRGTLDAFARAEFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGAA WVEWGGCGMVHPNVLRATGIDPDLYSGFAFGMGLERTLQFRNGIPDMRDMVEGDVRFS LPFGVGA" misc_feature 1859330..1859383 /gene="pheS" /locus_tag="Rv1649" /note="PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1" gene 1859758..1862253 /gene="pheT" /locus_tag="Rv1650" /db_xref="GeneID:885283" CDS 1859758..1862253 /gene="pheT" /locus_tag="Rv1650" /EC_number="6.1.1.20" /function="charging phe-tRNA [CATALYTIC ACTIVITY : ATP + L-PHENYLALANINE + TRNA(PHE) = AMP + DIPHOSPHATE + L-PHENYLALANYL-TRNA(PHE)]" /note="catalyzes a two-step reaction, first charging a phenylalanine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; forms a tetramer of alpha(2)beta(2); binds two magnesium ions per tetramer; type 2 subfamily" /codon_start=1 /transl_table=11 /product="phenylalanyl-tRNA synthetase subunit beta" /protein_id="NP_216166.1" /db_xref="GI:15608788" /db_xref="GOA:P94985" /db_xref="UniProtKB/Swiss-Prot:P94985" /db_xref="GeneID:885283" /translation="MRLPYSWLREVVAVGASGWDVTPGELEQTLLRIGHEVEEVIPLG PVDGPVTVGRVADIEELTGYKKPIRACAVDIGDRQYREIICGATNFAVGDLVVVALPG ATLPGGFTISARKAYGRNSDGMICSAAELNLGADHSGILVLPPGAAEPGADGAGVLGL DDVVFHLAITPDRGYCMSVRGLARELACAYDLDFVDPASNSRVPPLPIEGPAWPLTVQ PETGVRRFALRPVIGIDPAAVSPWWLQRRLLLCGIRATCPAVDVTNYVMLELGHPMHA HDRNRISGTLGVRFARSGETAVTLDGIERKLDTADVLIVDDAATAAIGGVMGAASTEV RADSTDVLLEAAIWDPAAVSRTQRRLHLPSEAARRYERTVDPAISVAALDRCARLLAD IAGGEVSPTLTDWRGDPPCDDWSPPPIRMGVDVPDRIAGVAYPQGTTARRLAQIGAVV THDGDTLTVTPPSWRPDLRQPADLVEEVLRLEGLEVIPSVLPPAPAGRGLTAGQQRRR TIGRSLALSGYVEILPTPFLPAGVFDLWGLEADDSRRMTTRVLNPLEADRPQLATTLL PALLEALVRNVSRGLVDVALFAIAQVVQPTEQTRGVGLIPVDRRPTDDEIAMLDASLP RQPQHVAAVLAGLREPRGPWGPGRPVEAADAFEAVRIIARASRVDVTLRPAQYLPWHP GRCAQVFVGESSVGHAGQLHPAVIERSGLPKGTCAVELNLDAIPCSAPLPAPRVSPYP AVFQDVSLVVAADIPAQAVADAVRAGAGDLLEDIALFDVFTGPQIGEHRKSLTFALRF RAPDRTLTEDDASAARDAAVQSAAERVGAVLRG" gene complement(1862347..1865382) /gene="PE_PGRS30" /locus_tag="Rv1651c" /db_xref="GeneID:885174" CDS complement(1862347..1865382) /gene="PE_PGRS30" /locus_tag="Rv1651c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN VIRULENCE." /note="Rv1651c, (MTCY06H11.16c), len: 1011 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citations below), similar to many e.g. Q10637|Y03A_MYCTU hypothetical glycine-rich 49.6 kd protein (603 aa), FASTA scores: opt: 1757, E(): 0, (50.8% identity in 714aa overlap). The transcription of this CDS seems to be activated in macrophages (see Ramakrishnan et al., 2000)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177826.1" /db_xref="GI:57116897" /db_xref="UniProtKB/TrEMBL:Q79FL8" /db_xref="GeneID:885174" /translation="MSFLLVEPDLVTAAAANLAGIRSALSEAAAAASTPTTALASAGA DEVSAAVSRLFGAYGQQFQALNARAATFHAEFVSLLNGGAAAYTGAEAASVSSMQALL DAVNAPTQTLLGRPLIGNGADGVAGTGSNAGGNGGPGGILYGNGGNGGAGGNGGAAGL IGNGGAGGAGGAGGAGGAGGAGGTGGLLYGNGGAGGNGGSAAAAGGAGGNALLFGNGG NGGSGASGGAAGHAGTIFGNGGNAGAGSGLAGADGGLFGNGGDGGSSTSKAGGAGGNA LFGNGGDGGSSTVAAGGAGGNTLVGNGGAGGAGGTSGLTGSGVAGGAGGSVGLWGSGG AGGDGGAATSLLGVGMNAGAGGAGGNAGLLYGNGGAGGAGGNGGDTTVPLFDSGVGGA GGAGGNASLFGNGGTGGVGGKGGTSSDLASATSGAGGAGGAGGVGGLLYGNGGNGGAG GIGGAAINILANAGAGGAGGAAGSSFIGNGGNGGAGGAGGAAALFSSGVGGAGGSGGT ALLLGSGGAGGNGGTGGANSGSLFASPGGTGGAGGHGGAGGLIWGNGGAGGNGGNGGT TADGALEGGTGGIGGTGGSAIAFGNGGQGGAGGTGGDHSGGNGIGGKGGASGNGGNAG QVFGDGGTGGTGGAGGAGSGTKAGGTGSDGGHGGNATLIGNGGDGGAGGAGGAGSPAG APGNGGTGGTGGVLFGQSGSSGPPGAAALAFPSLSSSVPILGPYEDLIANTVANLASI GNTWLADPAPFLQQYLANQFGYGQLTLTALTDATRDFAIGLAGIPPSLQSALQALAAG DVSGAVTDVLGAVVKVFVSGVDASDLSNILLLGPVGDLFPILSIPGAMSQNFTNVVMT VTDTTIAFSIDTTNLTGVMTFGLPLAMTLNAVGSPITTAIAFAESTTAFVSAVQAGNL QAAAAALVGAPANVANGFLNGEARLPLALPTSATGGIPVTVEVPVGGILAPLQPFQAT AVIPVIGPVTVTLEGTPAGGIVPALVNYAPTQLAQAIAP" gene 1865576..1866634 /gene="argC" /locus_tag="Rv1652" /db_xref="GeneID:885278" CDS 1865576..1866634 /gene="argC" /locus_tag="Rv1652" /EC_number="1.2.1.38" /function="INVOLVED IN ARGININE BIOSYNTHESIS (AT THE THIRD STEP) [CATALYTIC ACTIVITY : N-ACETYL-L-GLUTAMATE 5-SEMIALDEHYDE + NADP(+) + PHOSPHATE = N-ACETYL-5-GLUTAMYL PHOSPHATE + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the reduction of N-acetyl-5-glutamyl phosphate to N-acetyl-L-glutamate 5-semialdehyde in arginine biosynthesis and the reduction of N-acetyl-gamma-aminoadipyl-phosphate to N-acetyl-L-aminoadipate-semialdehyde in lysine biosynthesis; involved in both the arginine and lysine biosynthetic pathways; lysine is produced via the AAA pathway, lysine from alpha-aminoadipate" /codon_start=1 /transl_table=11 /product="N-acetyl-gamma-glutamyl-phosphate reductase" /protein_id="NP_216168.1" /db_xref="GI:15608790" /db_xref="GOA:P63562" /db_xref="UniProtKB/Swiss-Prot:P63562" /db_xref="GeneID:885278" /translation="MQNRQVANATKVAVAGASGYAGGEILRLLLGHPAYADGRLRIGA LTAATSAGSTLGEHHPHLTPLAHRVVEPTEAAVLGGHDAVFLALPHGHSAVLAQQLSP ETLIIDCGADFRLTDAAVWERFYGSSHAGSWPYGLPELPGARDQLRGTRRIAVPGCYP TAALLALFPALAADLIEPAVTVVAVSGTSGAGRAATTDLLGAEVIGSARAYNIAGVHR HTPEIAQGLRAVTDRDVSVSFTPVLIPASRGILATCTARTRSPLSQLRAAYEKAYHAE PFIYLMPEGQLPRTGAVIGSNAAHIAVAVDEDAQTFVAIAAIDNLVKGTAGAAVQSMN LALGWPETDGLSVVGVAP" gene 1866631..1867845 /gene="argJ" /locus_tag="Rv1653" /db_xref="GeneID:885125" CDS 1866631..1867845 /gene="argJ" /locus_tag="Rv1653" /EC_number="2.3.1.35" /EC_number="2.3.1.1" /function="ARGININE BIOSYNTHESIS." /note="bifunctional arginine biosynthesis protein ArgJ; functions at the 1st and 5th steps in arginine biosynthesis; involved in synthesis of acetylglutamate from glutamate and acetyl-CoA and ornithine by transacetylation between acetylornithine and glutmate" /codon_start=1 /transl_table=11 /product="bifunctional ornithine acetyltransferase/N-acetylglutamate synthase protein" /protein_id="NP_216169.1" /db_xref="GI:15608791" /db_xref="GOA:P63571" /db_xref="UniProtKB/Swiss-Prot:P63571" /db_xref="GeneID:885125" /translation="MTDLAGTTRLLRAQGVTAPAGFRAAGVAAGIKASGALDLALVFN EGPDYAAAGVFTRNQVKAAPVLWTQQVLTTGRLRAVILNSGGANACTGPAGFADTHAT AEAVAAALSDWGTETGAIEVAVCSTGLIGDRLPMDKLLAGVAHVVHEMHGGLVGGDEA AHAIMTTDNVPKQVALHHHDNWTVGGMAKGAGMLAPSLATMLCVLTTDAAAEPAALER ALRRAAAATFDRLDIDGSCSTNDTVLLLSSGASEIPPAQADLDEAVLRVCDDLCAQLQ ADAEGVTKRVTVTVTGAATEDDALVAARQIARDSLVKTALFGSDPNWGRVLAAVGMAP ITLDPDRISVSFNGAAVCVHGVGAPGAREVDLSDADIDITVDLGVGDGQARIRTTDLS HAYVEENSAYSS" gene 1867842..1868726 /gene="argB" /locus_tag="Rv1654" /db_xref="GeneID:888076" CDS 1867842..1868726 /gene="argB" /locus_tag="Rv1654" /EC_number="2.7.2.8" /function="ARGININE BIOSYNTHESIS (SECOND STEP) [CATALYTIC ACTIVITY : ATP + N-ACETYL-L-GLUTAMATE = ADP + N-ACETYL-L-GLUTAMATE 5-PHOSPHATE]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the phosphorylation of N-acetyl-L-glutamate to form N-acetyl-L-glutamate 5-phosphate" /codon_start=1 /transl_table=11 /product="acetylglutamate kinase" /protein_id="NP_216170.1" /db_xref="GI:15608792" /db_xref="GOA:P94989" /db_xref="UniProtKB/Swiss-Prot:P94989" /db_xref="GeneID:888076" /translation="MSRIEALPTHIKAQVLAEALPWLKQLHGKVVVVKYGGNAMTDDT LRRAFAADMAFLRNCGIHPVVVHGGGPQITAMLRRLGIEGDFKGGFRVTTPEVLDVAR MVLFGQVGRELVNLINAHGPYAVGITGEDAQLFTAVRRSVTVDGVATDIGLVGDVDQV NTAAMLDLVAAGRIPVVSTLAPDADGVVHNINADTAAAAVAEALGAEKLLMLTDIDGL YTRWPDRDSLVSEIDTGTLAQLLPTLESGMVPKVEACLRAVIGGVPSAHIIDGRVTHC VLVELFTDAGTGTKVVRG" gene 1868723..1869925 /gene="argD" /locus_tag="Rv1655" /db_xref="GeneID:885187" CDS 1868723..1869925 /gene="argD" /locus_tag="Rv1655" /EC_number="2.6.1.11" /function="ARGININE BIOSYNTHESIS (FOURTH STEP) [CATALYTIC ACTIVITY :N2-ACETYL-L-ORNITHINE + 2-OXOGLUTARATE = N-ACETYL-L-GLUTAMATE 5-SEMIALDEHYDE + L-GLUTAMATE]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of N-acetyl-l-glutamate 5-semialdehyde from 2-oxoglutarate and N(2)-acetyl-L-ornithine" /codon_start=1 /transl_table=11 /product="acetylornithine aminotransferase" /protein_id="NP_216171.1" /db_xref="GI:15608793" /db_xref="GOA:P63568" /db_xref="UniProtKB/Swiss-Prot:P63568" /db_xref="GeneID:885187" /translation="MTGASTTTATMRQRWQAVMMNNYGTPPIALASGDGAVVTDVDGR TYIDLLGGIAVNVLGHRHPAVIEAVTRQMSTLGHTSNLYATEPGIALAEELVALLGAD QRTRVFFCNSGAEANEAAFKLSRLTGRTKLVAAHDAFHGRTMGSLALTGQPAKQTPFA PLPGDVTHVGYGDVDALAAAVDDHTAAVFLEPIMGESGVVVPPAGYLAAARDITARRG ALLVLDEVQTGMGRTGAFFAHQHDGITPDVVTLAKGLGGGLPIGACLAVGPAAELLTP GLHGSTFGGNPVCAAAALAVLRVLASDGLVRRAEVLGKSLRHGIEALGHPLIDHVRGR GLLLGIALTAPHAKDAEATARDAGYLVNAAAPDVIRLAPPLIIAEAQLDGFVAALPAI LDRAVGAP" misc_feature 1869383..1869496 /gene="argD" /locus_tag="Rv1655" /note="PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site" gene 1869922..1870845 /gene="argF" /locus_tag="Rv1656" /db_xref="GeneID:885462" CDS 1869922..1870845 /gene="argF" /locus_tag="Rv1656" /EC_number="2.1.3.3" /function="INVOLVED IN ARGININE BIOSYNTHESIS [CATALYTIC ACTIVITY : CARBAMOYL PHOSPHATE + L-ORNITHINE = PHOSPHATE + L-CITRULLINE.]" /note="catalyzes the formation of L-citrulline from carbamoyl phosphate and L-ornithine in arginine biosynthesis and degradation" /codon_start=1 /transl_table=11 /product="ornithine carbamoyltransferase" /protein_id="NP_216172.1" /db_xref="GI:15608794" /db_xref="GOA:P94991" /db_xref="UniProtKB/Swiss-Prot:P94991" /db_xref="GeneID:885462" /translation="MIRHFLRDDDLSPAEQAEVLELAAELKKDPVSRRPLQGPRGVAV IFDKNSTRTRFSFELGIAQLGGHAVVVDSGSTQLGRDETLQDTAKVLSRYVDAIVWRT FGQERLDAMASVATVPVINALSDEFHPCQVLADLQTIAERKGALRGLRLSYFGDGANN MAHSLLLGGVTAGIHVTVAAPEGFLPDPSVRAAAERRAQDTGASVTVTADAHAAAAGA DVLVTDTWTSMGQENDGLDRVKPFRPFQLNSRLLALADSDAIVLHCLPAHRGDEITDA VMDGPASAVWDEAENRLHAQKALLVWLLERS" misc_feature 1870057..1870080 /gene="argF" /locus_tag="Rv1656" /note="PS00097 Aspartate and ornithine carbamoyltransferases signature" gene 1870842..1871354 /gene="argR" /locus_tag="Rv1657" /db_xref="GeneID:885091" CDS 1870842..1871354 /gene="argR" /locus_tag="Rv1657" /function="REGULATES ARGININE BIOSYNTHESIS GENES" /note="regulates arginine biosynthesis when complexed with arginine by binding at site that overlap the promotors of the arginine biosynthesis genes" /codon_start=1 /transl_table=11 /product="arginine repressor" /protein_id="NP_216173.1" /db_xref="GI:15608795" /db_xref="GOA:P94992" /db_xref="UniProtKB/Swiss-Prot:P94992" /db_xref="GeneID:885091" /translation="MSRAKAAPVAGPEVAANRAGRQARIVAILSSAQVRSQNELAALL AAEGIEVTQATLSRDLEELGAVKLRGADGGTGIYVVPEDGSPVRGVSGGTDRMARLLG ELLVSTDDSGNLAVLRTPPGAAHYLASAIDRAALPQVVGTIAGDDTILVVAREPTTGA QLAGMFENLR" gene 1871363..1872559 /gene="argG" /locus_tag="Rv1658" /db_xref="GeneID:885645" CDS 1871363..1872559 /gene="argG" /locus_tag="Rv1658" /EC_number="6.3.4.5" /function="ARGININE BIOSYNTHESIS [CATALYTIC ACTIVITY : ATP + L-CITRULLINE + L-ASPARTATE = AMP + DIPHOSPHATE + L-ARGININOSUCCINATE]" /note="catalyzes the formation of 2-N(omega)-(L-arginino)succinate from L-citrulline and L-aspartate in arginine biosynthesis, AMP-forming" /codon_start=1 /transl_table=11 /product="argininosuccinate synthase" /protein_id="NP_216174.1" /db_xref="GI:15608796" /db_xref="GOA:P63642" /db_xref="UniProtKB/Swiss-Prot:P63642" /db_xref="GeneID:885645" /translation="MSERVILAYSGGLDTSVAISWIGKETGREVVAVAIDLGQGGEHM DVIRQRALDCGAVEAVVVDARDEFAEGYCLPTVLNNALYMDRYPLVSAISRPLIVKHL VAAAREHGGGIVAHGCTGKGNDQVRFEVGFASLAPDLEVLAPVRDYAWTREKAIAFAE ENAIPINVTKRSPFSIDQNVWGRAVETGFLEHLWNAPTKDIYAYTEDPTINWGVPDEV IVGFERGVPVSVDGKPVSMLAAIEELNRRAGAQGVGRLDVVEDRLVGIKSREIYEAPG AMVLITAHTELEHVTLERELGRFKRQTDQRWAELVYDGLWYSPLKAALEAFVAKTQEH VSGEVRLVLHGGHIAVNGRRSAESLYDFNLATYDEGDSFDQSAARGFVYVHGLSSKLA ARRDLR" misc_feature 1871384..1871410 /gene="argG" /locus_tag="Rv1658" /note="PS00564 Argininosuccinate synthase signature 1" misc_feature 1871711..1871746 /gene="argG" /locus_tag="Rv1658" /note="PS00565 Argininosuccinate synthase signature 2" gene 1872639..1874051 /gene="argH" /locus_tag="Rv1659" /db_xref="GeneID:885365" CDS 1872639..1874051 /gene="argH" /locus_tag="Rv1659" /EC_number="4.3.2.1" /function="ARGININE BIOSYNTHESIS (LAST STEP) [CATALYTIC ACTIVITY : N-(L-ARGININO)SUCCINATE = FUMARATE + L- ARGININE]" /note="catalyzes the formation of arginine from (N-L-arginino)succinate" /codon_start=1 /transl_table=11 /product="argininosuccinate lyase" /protein_id="NP_216175.1" /db_xref="GI:15608797" /db_xref="GOA:P94994" /db_xref="UniProtKB/Swiss-Prot:P94994" /db_xref="GeneID:885365" /translation="MSTNEGSLWGGRFAGGPSDALAALSKSTHFDWVLAPYDLTASRA HTMVLFRAGLLTEEQRDGLLAGLDSLAQDVADGSFGPLVTDEDVHAALERGLIDRVGP DLGGRLRAGRSRNDQVAALFRMWLRDAVRRVATGVLDVVGALAEQAAAHPSAIMPGKT HLQSAQPILLAHHLLAHAHPLLRDLDRIVDFDKRAAVSPYGSGALAGSSLGLDPDAIA ADLGFSAAADNSVDATAARDFAAEAAFVFAMIAVDLSRLAEDIIVWSSTEFGYVTLHD SWSTGSSIMPQKKNPDIAELARGKSGRLIGNLAGLLATLKAQPLAYNRDLQEDKEPVF DSVAQLELLLPAMAGLVASLTFNVQRMAELAPAGYTLATDLAEWLVRQGVPFRSAHEA AGAAVRAAEQRGVGLQELTDDELAAISPELTPQVREVLTIEGSVSARDCRGGTAPGRV AEQLNAIGEAAERLRRQLVR" misc_feature 1873479..1873508 /gene="argH" /locus_tag="Rv1659" /note="PS00163 Fumarate lyases signature" misc_feature 1873518..1873541 /gene="argH" /locus_tag="Rv1659" /note="PS00017 ATP/GTP-binding site motif A" gene 1874160..1875221 /gene="pks10" /locus_tag="Rv1660" /db_xref="GeneID:885112" CDS 1874160..1875221 /gene="pks10" /locus_tag="Rv1660" /EC_number="2.3.1.74" /function="Possibly involved in the biosynthesis of Secondary Metabolites [CATALYTIC ACTIVITY: 3 Malonyl-CoA + 4-Coumaroyl-CoA = 4 CoA + Naringenin chalcone + 3 CO2]" /experiment="experimental evidence, no additional details recorded" /note="Rv1660, (MTCY06H11.25), len: 353 aa. Possible pks10, chalcone synthase (EC 2.3.1.74), similar to BCSA_BACSU|P54157 putative chalcone synthase from B. subtilis (365 aa), FASTA scores: opt: 701, E(): 0, (33.1% identity in 362 aa overlap). Also similar to M. tuberculosis Rv1665|pks11 polyketide synthase (chalcone synthase); and Rv1372|pks18 polyketide synthase. Other upstream initiation sites are possible but homology suggests this start." /codon_start=1 /transl_table=11 /product="chalcone synthase" /protein_id="NP_216176.1" /db_xref="GI:15608798" /db_xref="GOA:P94995" /db_xref="UniProtKB/TrEMBL:P94995" /db_xref="GeneID:885112" /translation="MSVIAGVFGALPPYRYSQRELTDSFVSIPDFEGYEDIVRQLHAS AKVNSRHLVLPLEKYPKLTDFGEANKIFIEKAVDLGVQALAGALDESGLRPEDLDVLI TATVTGLAVPSLDARIAGRLGLRADVRRVPLFGLGCVAGAAGVARLHDYLRGAPDGVA ALVSVELCSLTYPGYKPTLPGLVGSALFADGAAAVVAAGVKRAQDIGADGPDILDSRS HLYPDSLRTMGYDVGSAGFELVLSRDLAAVVEQYLGNDVTTFLASHGLSTTDVGAWVT HPGGPKIINAITETLDLSPQALELTWRSLGEIGNLSSASVLHVLRDTIAKPPPSGSPG LMIAMGPGFCSELVLLRWH" gene 1875304..1881684 /gene="pks7" /locus_tag="Rv1661" /db_xref="GeneID:885108" CDS 1875304..1881684 /gene="pks7" /locus_tag="Rv1661" /function="POTENTIALLY INVOLVED IN SOME INTERMEDIATE STEPS FOR THE SYNTHESIS OF A POLYKETIDE MOLECULE WHICH MAY BE INVOLVED IN SECONDARY METABOLISM" /note="Rv1661, (MTCY06H11.26), len: 2126 aa. Probable pks7, polyketide synthase, similar to many e.g. ERY2_SACER|Q03132 erythronolide synthase, modules 3 and 4 (3567 aa), FASTA scores: E(): 0, (48.8% identity in 2131 aa overlap); also similar to Mycobacterium tuberculosis pks12. Contains PS00606 Beta-ketoacyl synthases active site, PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="polyketide synthase pks7" /protein_id="NP_216177.1" /db_xref="GI:15608799" /db_xref="GOA:P94996" /db_xref="UniProtKB/TrEMBL:P94996" /db_xref="GeneID:885108" /translation="MNSTPEDLVKALRRSLKQNERLKRENRDLLARTTEPVAVVGMGC RYPGGVDSPETLWELVAHGRDAVSEFPADRGWDVAGLFDPDPDAVGKSYTRCGGFLTD VAGFDAEFFGIAPSEALAMDPQQRLLLEVSWEALERAGIDPITLRGSQTGVFAGVFHG SYGGQGRVPGDLERYGLRGSTLSVASGRVAYVLGLQGPAVSVDTACSSSLVALHLAVQ SLRLGECDLALVGGVTVMATPAMFIEFSRQRALSADGRCKAYAGAADGTAFAEGAGVL VLARLADARRLGHPVLALVRGSAVNQDGASNGLATPNGPAQQRVITAALASARLGVAD VDVVEGHGTGTTLGDPIEAQAILATYGQRPADRPLWLGSIKSNIGHTSAAAGVAGVIK MVQAMRHGVLPKTLHVDVPTPHVDWSAGAVSLLTEPRPWHVPGRPRRAGVSSFGISGT NAHVILEEAPAVEPVGAAHGNDPVAVPWVLSARSAQALTNQARRLLAWVGADENVRPL DVGWSLVNTRSLFDHRAVVVGADRTQLMEGLTGLAAGVPGADVVAGRAQTVGKTAFVF PGQGAQWLGMGAQLCATAPVFAEHIHRCERALREHVEWSLLDVLRGAPGAPGLDRVDV VQPALWAVMVSLAELWRSVGVVPDAVIGHSQGEIAAAYVAGALSLRDAAAVVALRSRL LVRLGGAGGMVSLACGQPQAEKLASQWGDRLNIAAVNGVSSVVLAGETDAVTELMQRC EAEGIRARRIDVDYASHSAQVDAIREELIAALRGIEPRTSTVAFFSTVTGELMDTAGV NAEYWYRSIRQPVQFERAVRNAFDGGYRVFVESSPHPVLIAGIEETLVDCDRGATGEP IVIPTLGRDDGGVGRFWLSAGQAHVAGVGVDWRAAFADLGGRRVELPTYAFARQRFWL DGLGAVGGDLGGVGLVGAEHGLLAAVVQRPDSGGVVLTGRISVVAAPWLADHAVGPVV LFPGTGFVELALRAGDEVGCSVLQELTLQAPLVLPADGVRVQVVVGGVEQSGTRNVWV YSAAGQADSSPGWTLHAQGVLGVGSVQPAAELSVWPPVGARAMDVADGYQVLAARGYG YGPAFRGLQALWRRGAEVFADVTLPEGVPIRGFGIHPAVLDAALHAWGIVEGEQQTML PFSWQGVCLHASGAARVRVRLAPVGRGAVSVELADPQGLPVLSVRQLMVRPVSAAALS RSTAGDRGLLEMIWTPVPLEGGDIGDDAVVWELPPHAGAQAGGDVLAAVYRGVHEVLE VLQSWLASDATGLGVVVTRGAVGPVDDDVTDLAGAAVWGLVRSAQAEHPGRVVLVDTD GSVAVEDAVGFGARSGEPQLVVRRGRVYAARLAPVAAGLTLPSASAGGWRLVAGGGGT LADVVVAPVAPVELATGQVRVAVGAVGVNFRDVLVALGMYPGGGELGVDGAGVVVEVG PGVTGLAVGDRVMGLLGLVGSEAVVDARLVTMVPAGWSLVEAAAVPVAFLTAFYGLSV LAEVAAGQKVLVHAGTGGVGMAAVSLARYWGAEVFVTASRAKWDTLRAMGFDDIHISD SRSLEFEEAFLRATEGSGVDVVLNSLAGEFTDASLRLLPSGGRFIELGKTDIRDGQTV AERHRGVRYRAFDLVEAGPDRIAAMLSEVVGLLAAGVLARLPVKTFDARCAPAAYRFV SQARHIGKVVLTIPDGPGGQSGLAGGTVVVTGGTGMAGSAVATHLVRRHGVANLVLVS RSGEQADRAAEVAALLREGGAQVAVVSCDVADRDALAALLAGLDPRYPLKGVFHAAGV LDDAVITGLTPDRVDTVLRAKVDGAWNLHELTEDMDLSAFVVFSSMAGIVGTPAQGNY AAANAFLDGLVAYRRSRGLAGLSVAWGLWEQASAMTRHLGERDRARMTQAGLAPLTTE QALGFLDTALQADRAVVVAARLDRAALAGAGAALPALFSQLAAGPTRRRIDAADTAVS MSGLVSRLHALTPERRQRELTDLVISNAAAVLGRSSSVDINAHKAFQDLGFDSLTAVE LRNRLKTATGLTLSPTLIFDYPTPATLAEHLDSRLVTASGSDQQSLSDRVDDITRELV VLLDQPDLSANVKAHLRTRLQTMLTSLTTEDDDIAAATESQLFAILDEELGS" misc_feature 1875892..1875942 /gene="pks7" /locus_tag="Rv1661" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature 1877362..1877394 /gene="pks7" /locus_tag="Rv1661" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" misc_feature 1881319..1881366 /gene="pks7" /locus_tag="Rv1661" /note="PS00012 Phosphopantetheine attachment site" gene 1881704..1886512 /gene="pks8" /locus_tag="Rv1662" /db_xref="GeneID:885527" CDS 1881704..1886512 /gene="pks8" /locus_tag="Rv1662" /function="POTENTIALLY INVOLVED IN SOME INTERMEDIATE STEPS FOR THE SYNTHESIS OF A POLYKETIDE MOLECULE WHICH MAY BE INVOLVED IN SECONDARY METABOLISM" /note="Rv1662, (MTCY275.01-MTCY06H11.27), len: 1602 aa. Probable pks8, polyketide synthase, similar to many polyketide synthases e.g. ERY2_SACER|Q03132 erythronolide synthase, modules 3 and 4 from Saccharopolyspora erythraea (Streptomyces erythraeus) (3567 aa), FASTA scores: opt: 3319, E(): 0, (45.8% identity in 1619 aa overlap). Also similar to other Mycobacterium tuberculosis probable polyketide synthases e.g. pks7 and pks12. Contains PS00606 Beta-ketoacyl synthases active site and PS01162 Quinone oxidoreductase/zeta-crystallin signature. Note that the similarity extends into the downstream ORF Rv1663 (MTCY275.02), and this could be accounted for by a frameshift, although the sequence has been checked and no discrepancy was found." /codon_start=1 /transl_table=11 /product="polyketide synthase pks8" /protein_id="NP_216178.1" /db_xref="GI:15608800" /db_xref="GOA:O65933" /db_xref="UniProtKB/TrEMBL:O65933" /db_xref="GeneID:885527" /translation="MSGTTTHVDYLKRLTADLRRTRRRLSDLEAKLSEPVAVVGMGCR YPGGVDSPETLWELVAQGRDAVSDFPADRGWDVDGLFDPDPDACGKMYTRRGTFLEHA GDFDAGFFGIGPSEALAMDPQQRLLLEVSWEALERTGIDPTKLRGSATGVFAGVIHAG YGGQLSGELEGYGLTGSTLSVASGRVAYVLGLEGPAVSVDTACSSSLVALHLAVQSLR SGECDLALAGGVTVMATPAAFVEFSRQRALARDGRCKVYAGAADGTAWSEGAGVLVVE RLVDARRLGHPVLALVRGSAVNQDGASNGLTAPNGPSQQRVIRAALASARLRAVEVDV VEGHGTGTMLGDPIEAQALLATYGQDRVEPLWLGSIKSNIGHTSAAAGVAGVIKMVQA MRHGVMPKTLHVDVPTPHVDWSVGAVSLLTQPRAWSVHGRPRRAGVSSFGISGTNAHV ILEQAPVVESVVPEVASPTAASAVPWVLSARSEQALAGQAQRLLAFVAANPDLDPIDV GWSLVKTRAMFEHRAVVVGADRGALLAGLAALAAGESGAGVAVGRARSVGKTVFVFPG QGAQWVGMGAQLYAELPLFALAFDAVAEELDRHLRLPLRNVLWEGDEALLTSTEFAQP ALFAIEVALATLLQHWGISPDFLIGHSVGEIAAAHLAGVLSLTDAAGLVAARGRLMAE LPAGGVMVVVAASEEEVLPVLVDGANLAAVNAPHSVVVSGCEAAVSDIADHFARRGRR VHRLAVSHAFHSLLMEPMLAEFTRIAAGISVSKPRIPLVSNVTGQMAGAGYGDGQYWV EHARRPVRFAEGVQLLNAVGATRFVEVGPGGGLTALVEQSLPLGEALSVAMMRREHPE VSSVLGAVATLFTAGAQMDWPAVFGSPGRRIELPTYAFQRQRYWLPPTSAGSADISGV GLLAARHGLLGAVVEQPDSDVVVLTGRLSVGEQRWLADHVIAGVVLLAGAAFVELALR AADQVDCGVVEELTVVTPLVLPTVGGVQLQVVVGVGEMGQRPVSIYSRNAESDSGWVL HARGVLGAKAVAPAADLSVWPPLGAAPVDVDGAYQRFAELGYEYGRAFQGLTAMWRRE SELFADVAVPDDVDVTLSGFGIHPLVLDAALHAMGMVGEQAATMLPFSWQGVSLHAAG ASRVRARIAPAGDGTVSVELADQAGLPVLSVQALVMRSVSSQLLSAAVAAADAAGRGL LEVAWLPVELAHNDISADLVVWELESFQDGVGPVYSATHRVLVALQSWLAQERAGRLV VLTQGSVGQDATNLAGAAVWGLVRSAQAEHPGRVMLVDSDGSMDVGDVIGCGEEQLMI RNGTAYAARLAQLRPQPILQLPDTNSGWRLVAGGAGALEDLTLASCPAKELAPGQVRI EVRALGVNFRDVLVALGIYPGAAELGAEGAGVVTEVGPGVTGLAVGDPVMGLLGVAGS EAVVDARLVVKLPNRWPLTDAAGVPVVFLTAYYALRVLAQVQPGESVLVHAAAGGVGM AAVQLARLWGLEVFATASRGKWDTLHTMGCDNTHVADSRTLAFEETFWLTTEGRGVDV VLNSLAGEFTDASLRLLPRGGRFIEMGKTEFGTPRSLPRTILGWPTGLST" misc_feature 1882283..1882333 /gene="pks8" /locus_tag="Rv1662" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature 1886144..1886197 /gene="pks8" /locus_tag="Rv1662" /note="PS01162 Quinone oxidoreductase / zeta-crystallin signature" gene 1886512..1888020 /gene="pks17" /locus_tag="Rv1663" /db_xref="GeneID:885523" CDS 1886512..1888020 /gene="pks17" /locus_tag="Rv1663" /function="POTENTIALLY INVOLVED IN SOME INTERMEDIATE STEPS FOR THE SYNTHESIS OF A POLYKETIDE MOLECULE WHICH MAY BE INVOLVED IN SECONDARY METABOLISM" /note="Rv1663, (MTCY275.02), len: 502 aa. Probable pks17, polyketide synthase, similar to other polyketide synthases e g. ERY2_SACER|Q03132 erythronolide synthase, modules 3 and 4 (3567 aa) from Saccharopolyspora erythraea (Streptomyces erythraeus), FASTA scores: opt: 1207, E(): 0, (43.9% identity in 531 aa overlap). Also similar to other Mycobacterium tuberculosis probable polyketide synthases e.g. pks7 and pks1. Note that the similarity extends into the upstream ORF Rv1662 (MTCY275.01) and this could be accounted for by a frameshift, although the sequence has been checked and no discrepancy was found. Contains PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="polyketide synthase pks17" /protein_id="NP_216179.1" /db_xref="GI:15608801" /db_xref="GOA:O06585" /db_xref="UniProtKB/TrEMBL:O06585" /db_xref="GeneID:885523" /translation="MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFL SQARHVGKVVLTMPDAWAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGEHT ESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAGVLDDAVITGL TPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGIVGAPGQANYAAANAFLDG LAAYRRSRGLAALSVAWGLWEQASAMTEHLGERDRVRMSRVGLAPLPTNQAMGFLDAA LLADRPVVVAARLDRAALAGAELPALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLT PEQRHRELTELVCSNAAIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTL PPTLIFDYPTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPD DKTRLIKRLQAILTDCTAPPASSGPSTTHDDEDITTATESQLFAILDDELGP" misc_feature 1887616..1887663 /gene="pks17" /locus_tag="Rv1663" /note="PS00012 Phosphopantetheine attachment site" gene 1888026..1891079 /gene="pks9" /locus_tag="Rv1664" /db_xref="GeneID:885519" CDS 1888026..1891079 /gene="pks9" /locus_tag="Rv1664" /function="POTENTIALLY INVOLVED IN SOME INTERMEDIATE STEPS FOR THE SYNTHESIS OF A POLYKETIDE MOLECULE WHICH MAY BE INVOLVED IN SECONDARY METABOLISM" /note="Rv1664, (MTCY275.03), len: 1017 aa. Probable pks9, polyketide synthase, similar to OL56_STRAT|Q07017 oleandomycin polyketide synthase, modules 5 and 6 from Streptomyces antibioticus (3519 aa), FASTA scores: opt: 1767, E(): 0, (41.6% identity in 919 aa overlap). Similar to other Mycobacterium tuberculosis probable polyketide synthases e.g. pks6, pks8, etc. Contains PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="polyketide synthase pks9" /protein_id="NP_216180.1" /db_xref="GI:15608802" /db_xref="GOA:O06586" /db_xref="UniProtKB/TrEMBL:O06586" /db_xref="GeneID:885519" /translation="MQPTGIAIIGLACRFPTVVSPGDLWDLLRDGREAAGSIDNVADF DADFFNLSPREASAMDPRQRLALELTWELLEDAFVVPETLRGQPIAVYLGAMNDDYAV LTLAADRVDHHAFAGTSRAIIANRVSFAFGLRGPSVTIDSGQSSSLVAVHLACESVRT GEAPLAIAGGVHLNLARETAMLEQEFGAVSPSGHTYAFDERADGYVPGDGGGLVLLKP VQAALDDGDRIHAIIRGSAVGNAGHSATGLTVPSVAGQVDVIRRAMSGAGVDCHQVHY VEAHGTGTKIGDPIEARALGEIFAARQRRPVSVGSVKTNIGHTGGAAGIAGLLKAVLA IENAVIPPSLNYVGAAIDLDSLGLRVDTALTPWPVADEPRRAGVSSFGMGGTNAHVIL EQGPTQSPEIVESVAAAGSNAPVAVPWVLAARSPQALTNQAGRLLAHLTADDGLTALD VGWSLVSTRSVFDHRAVVVGADRGRLMAGLAGLAAGEPGAGVVVGRARSVGKTVFVFP GQGSQWLGMGRQLYGRYSVFARAFDEVVAVLDGQLRLSVRQVMWGADAGLLESTEFAQ PALFVVQVALAALLQDWGVLPDLVMGHSVGEIAAAYVAGALSLVDAARVVAARGRLMQ ALPAGGVMVAVAASEDEVAPLLTEGVCIAAVNAPESVVISGEQAAVGVVVDRLVGLGR RVRRLAVSHAFHSVLMDPMVEEFSKVLADVCVRAPRIGLVSNVTGQLAGAGYGSPAYW VEHVRKPVRFFDGVGLAESLGARVFVEVGPGAGLEASVALLARDRPEVESVLAGVGRL FAEGVAVDWSSVFAGLGGRRVELPTYGFARQRFWLGDNGELSVDQTGKDAGAIARLQS LAPPELQRQLVELVCFHAAIVLGRKSSHDIDPECAFQDLGFDSMSGVELRNRLQMAIG LPGLSLPRTLIFDYPTASALAECLGQLLGGQHESSDDESIWQLLKNIPIHQLRRTGLL DKLLLLAGQPEESLAGRTVSDEVIDSLSPEALIGLALDEDENDIR" misc_feature 1890705..1890752 /gene="pks9" /locus_tag="Rv1664" /note="PS00012 Phosphopantetheine attachment site" gene 1891226..1892287 /gene="pks11" /locus_tag="Rv1665" /db_xref="GeneID:885525" CDS 1891226..1892287 /gene="pks11" /locus_tag="Rv1665" /EC_number="2.3.1.74" /function="Possibly involved in the biosynthesis of Secondary Metabolites [CATALYTIC ACTIVITY: 3 Malonyl-CoA + 4-Coumaroyl-CoA = 4 CoA + Naringenin chalcone + 3 CO2]" /note="Rv1665, (MTCY275.04-MTV047.01), len 353 aa. Possible pks11, chalcone synthase (EC 2.3.1.74), some similarity to BCSA_BACSU|P54157 putative chalcone synthase from Bacillus subtilis (365 aa), FASTA scores: opt: 615, E(): 6.2e-32, (33.4% identity in 308 aa overlap); and to many plant chalcone synthases e.g. CHS_VIGUN|P51089 chalcone synthase (EC 2.3.1.74) (388 aa), FASTA scores: opt: 391, E(): 7.8e-18, (27.2% identity in 349 aa overlap). Highly similar to upstream ORF Rv1660|MTCY06H11.25 pks10 (72.7% identity in 308 aa overlap); and Rv1372 pks18." /codon_start=1 /transl_table=11 /product="chalcone synthase" /protein_id="NP_216181.1" /db_xref="GI:15608803" /db_xref="GOA:O06587" /db_xref="UniProtKB/TrEMBL:O06587" /db_xref="GeneID:885525" /translation="MSVIAGVFGALPPHRYSQSEITDSFVEFPGLKEHEEIIRRLHAA AKVNGRHLVLPLQQYPSLTDFGDANEIFIEKAVDLGVEALLGALDDANLRPSDIDMIA TATVTGVAVPSLDARIAGRLGLRPDVRRMPLFGLGCVAGAAGVARLRDYLRGAPDDVA VLVSVELCSLTYPAVKPTVSSLVGTALFGDGAAAVVAVGDRRAEQVRAGGPDILDSRS SLYPDSLHIMGWDVGSHGLRLRLSPDLTNLIERYLANDVTTFLDAHRLTKDDIGAWVS HPGGPKVIDAVATSLALPPEALELTWRSLGEIGNLSSASILHILRDTIEKRPPSGSAG LMLAMGPGFCTELVLLRWR" gene complement(1892270..1893562) /gene="cyp139" /locus_tag="Rv1666c" /db_xref="GeneID:885528" CDS complement(1892270..1893562) /gene="cyp139" /locus_tag="Rv1666c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv1666c, (MT1706, MTV047.02c), len: 430 aa. Probable cyp139, cytochrome P450 (EC 1.14.-.-), similar to many e.g. U38537|APU38537_7 from Anabaena sp. (459 aa), FASTA scores: opt: 516, E(): 1.7e-26, (25.8% identity in 418 aa overlap). Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 139 CYP139" /protein_id="NP_216182.1" /db_xref="GI:15608804" /db_xref="GOA:P63719" /db_xref="UniProtKB/Swiss-Prot:P63719" /db_xref="GeneID:885528" /translation="MRYPLGEALLALYRWRGPLINAGVGGHGYTYLLGAEANRFVFAN ADAFSWSQTFESLVPVDGPTALIVSDGADHRRRRSVVAPGLRHHHVQRYVATMVSNID TVIDGWQPGQRLDIYQELRSAVRRSTAESLFGQRLAVHSDFLGEQLQPLLDLTRRPPQ VMRLQQRVNSPGWRRAMAARKRIDDLIDAQIADARTAPRPDDHMLTTLISGCSEEGTT LSDNEIRDSIVSLITAGYETTSGALAWAIYALLTVPGTWESAASEVARVLGGRVPAAD DLSALTYLNGVVHETLRLYSPGVISARRVLRDLWFDGHRIRAGRLLIFSAYVTHRLPE IWPEPTEFRPLRWDPNAADYRKPAPHEFIPFSGGLHRCIGAVMATTEMTVILARLVAR AMLQLPAQRTHRIRAANFAALRPWPGLTVEIRKSAPAQ" misc_feature complement(1892441..1892470) /gene="cyp139" /locus_tag="Rv1666c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(1893577..1894230) /locus_tag="Rv1667c" /db_xref="GeneID:885518" CDS complement(1893577..1894230) /locus_tag="Rv1667c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF MACROLIDE ACROSS THE MEMBRANE (EXPORT). MACROLIDE ANTIBIOTICS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv1667c, (MTV047.03c), len: 217 aa. Probable second part of macrolide-transport ATP-binding protein ABC transporter (see citation below), with similarity to C-terminal end of putative ABC transporters/ATP binding proteins, e.g. Z99108|BSUB0005_6 ABC transporter (ATP-binding protein) homolog yfmR from Bacillus subtilis (629 aa), FASTA scores: opt: 411, E(): 6.9e-17, (37.8% identity in 217 aa overlap); etc. Similarity to other NBD components of ABC transporters suggests that Rv1667c and Rv1668c should be contiguous. However, sequence has been checked and no errors found, also same sequence in M. tuberculosis CSU93 and Mycobacterium bovis." /codon_start=1 /transl_table=11 /product="macrolide-transport ATP-binding protein ABC transporter" /protein_id="NP_216183.1" /db_xref="GI:15608805" /db_xref="GOA:O53915" /db_xref="UniProtKB/TrEMBL:O53915" /db_xref="GeneID:885518" /translation="MLGRLRGGYQVEGREVTPTQLLERLGFRRDQLSARVDDLSGGQR RRLQLMLTLLSEPNVLLLDEPTNDVDTEMLTATEDLLDSWAGTLIVVSHDRYLLERVT DQQYAILDDRLRHLPGGIDEYLQLAARVSAPAPAERPAPPAMSGAQRRATEKELAAVD RQLARLADRVAAKHTELAEHDQSDHVGITRLTQQLRVLQDHVAAMENRWLELSEMLE" gene complement(1894224..1895342) /locus_tag="Rv1668c" /db_xref="GeneID:885522" CDS complement(1894224..1895342) /locus_tag="Rv1668c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF MACROLIDE ACROSS THE MEMBRANE (EXPORT). MACROLIDE ANTIBIOTICS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv1668c, (MTV047.04c), len: 372 aa. Probable first part of macrolide-transport ATP-binding protein ABC transporter (see citation below), similar to many ATP-binding proteins ABC transporter e.g. X80735|SEABCT_1|Q54072 Saccharopolyspora erythraea ertX gene (481 aa), FASTA scores: opt: 938, E(): 0, (45.6% identity in 353 aa overlap); etc. Similarity to other NBD components of ABC transporters suggests that Rv1667c and Rv1668c should be contiguous. However, sequence has been checked and no error found, also same sequence in Mycobacterium tuberculosis CSU93 and Mycobacterium bovis. Contains PS00211 ABC transporters family signature and two times PS00017 ATP/GTP-binding site motif A. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="macrolide-transport ATP-binding protein ABC transporter" /protein_id="NP_216184.1" /db_xref="GI:15608806" /db_xref="GOA:O53916" /db_xref="UniProtKB/TrEMBL:O53916" /db_xref="GeneID:885522" /translation="MAHLLGAEAVHLAYPTQVVFEAVTLGVNDGARIGIVGRNGDGKS SLLGLLTGQLRPDSGRVTRRSGLRVNALSQTDTLDPNRTVGWTLIGDQPEHQWAGNPR IRDVVAGLVSDIAWDTPVSTLSGGQRRRVQLASLLVGEWDVIALDEPTNHLDIQGITW LADHLRRRWARNTGGLLVVTHDRWFLDEVATTTWEVHDGIVEPFEGGYAAYVLQRVER DRLTAAAEAKRQNLLRKELAWLRRGAPARTCKPKFRIEAANQLIADVPPPRNTVELAK LAAARLGKDVVDLLGVSVSYQPSGGRPVLRDIEWRIGPGERIGIVGANGAGKSTLLGL IAGTVQPGVGRVKPSGWQCSISTGTIWHRLPTTGSPMC" misc_feature complement(1894356..1894379) /locus_tag="Rv1668c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(1894932..1894976) /locus_tag="Rv1668c" /note="PS00211 ABC transporters family signature" misc_feature complement(1895211..1895234) /locus_tag="Rv1668c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1895725..1896087 /locus_tag="Rv1669" /db_xref="GeneID:885701" CDS 1895725..1896087 /locus_tag="Rv1669" /function="UNKNOWN" /note="Rv1669, (MTV047.04B), len: 120 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216185.1" /db_xref="GI:15608807" /db_xref="UniProtKB/TrEMBL:O86371" /db_xref="GeneID:885701" /translation="MSRRPGYSNGRAGASRQAARGGSAGASSVAFSSQPNCGLTESVL GHQVTGICLGTIHLDAMQWPWSSAYRLEPAVATTLIGISAWWANGSVKQYAGDLTDRV ATMTVCRRTPAPRVHYRQ" gene 1896120..1896467 /locus_tag="Rv1670" /db_xref="GeneID:885702" CDS 1896120..1896467 /locus_tag="Rv1670" /function="UNKNOWN" /note="Rv1670, (MTV047.05), len: 115 aa. Conserved hypothetical protein, highly similar to D90908|D90908_87 Hypothetical protein of Synechocystis sp. PCC6803 complete (94 aa), FASTA scores opt: 378, E(): 3.5e-2, (55.2% identity in 96 aa overlap); also shows some similarity to Mycobacterium tuberculosis hypothetical proteins e.g. C-terminal region of O53404|Rv1056 (254 aa), and P96817|Rv0140 (126 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216186.1" /db_xref="GI:15608808" /db_xref="UniProtKB/TrEMBL:O53917" /db_xref="GeneID:885702" /translation="MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSI CPWKGLAHYYNVVVDGPYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESR HGLARRVVAWLGK" gene 1896475..1896867 /locus_tag="Rv1671" /db_xref="GeneID:885698" CDS 1896475..1896867 /locus_tag="Rv1671" /function="UNKNOWN" /note="Rv1671, (MTV047.06), len: 130 aa. Probable membrane protein. Weak similarity to mercuric transport proteins." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216187.1" /db_xref="GI:15608809" /db_xref="UniProtKB/TrEMBL:O53918" /db_xref="GeneID:885698" /translation="MPTVGPADHAAGLDRRATPDQLPIWRIGIISGLVGMLCCVGPTI LALVGIISAATAFAWANDLYDNYAWWFRVSGLAVLAILVWWALRHRNRCSVNAIRRLR WRLMAVLAIAVGTYGVLSAVTTWFGTFV" gene complement(1896876..1898207) /locus_tag="Rv1672c" /db_xref="GeneID:885700" CDS complement(1896876..1898207) /locus_tag="Rv1672c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF UNDETERMINATED SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1672c, (MTV047.07c), len: 443 aa. Probable conserved integral membrane transport protein, major facilitator superfamily, similar to several phthalate transporters or tartrate transporters e.g. U25634|AVU25634_2 Agrobacterium vitis plasmid pTrAB (433 aa), FASTA scores: opt: 914, E(): 0, (37.1% identity in 426 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_216188.1" /db_xref="GI:15608810" /db_xref="GOA:O53919" /db_xref="UniProtKB/TrEMBL:O53919" /db_xref="GeneID:885700" /translation="MATIAASPTHNALGKAARRLLPLLFVLYVINFVDRANISVAALA MNADLRLSATAYGTAAGVFFLGYVLFQVPANAALARFGAGRTLTAVVLAWGVCSAATA LVTSAHTLYLARFALGVAEGGFFPGVIAYLTVWFPCAQRARAVATFLLAIPVANTVGL PLSGLIVGHVHMAGLPGWRAMFVIEALPALLLAPLLRRLLPDNPQRASWLTPEERAEL SARLTEDTPAPTGRSSGAGWDLVLFAVVYGGLYFALYALQFFLPQLVASLAHGTATLT AATLAALPYGVAALAMLAWSHRSIDRSGAQAGHITLPTTAAGSAALGAALSPMSPIVT LSWLTIAVAGILAAMPAFWSRCTAALAGPRVAVAIATVNAVASLASFAGPYATGHLKD ATGTYHLALLTVAAVLAAAAACSLLLRHAGRTVCANDSEIMLHPSPATPFV" gene complement(1898300..1899232) /locus_tag="Rv1673c" /db_xref="GeneID:885695" CDS complement(1898300..1899232) /locus_tag="Rv1673c" /function="UNKNOWN" /note="Rv1673c, (MTV047.08c), len: 310 aa. Conserved hypothetical protein, shows weak similarity to P44103|YA48_HAEIN Hypothetical protein HI10 48 precursor (369 aa), FASTA scores: E(): 8.3e-11, (26.1% identity in 330 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216189.1" /db_xref="GI:15608811" /db_xref="UniProtKB/TrEMBL:O53920" /db_xref="GeneID:885695" /translation="MTITDPAVSAHADATIGLFEITDHITIDSTQGAHTVEMWCPVIG DGAFQRVLDVEVTSEDPYDLTREPEFGNLMLYSRLRLATAASWSIRYVVERRAIGHAP DPARARPLATAQLFSRALIPEAHVDVDERTRTLAQDVVGPETNPLEQARRIYDYVTGA MDYDATKQSFLGSTEHALTCSVGNCNDIHALFVSLCRSVDIPARFVLGQALELPQPGA QDCEVCGYHCWAEFFVAGLGWLPADASCATKYGTHGLFANLQANHIAWSIGRDILLAP PQRAGRSLFFAGPYAEIDGETHPAQRQIRFTAMT" gene complement(1899260..1899916) /locus_tag="Rv1674c" /db_xref="GeneID:885696" CDS complement(1899260..1899916) /locus_tag="Rv1674c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1674c, (MTV047.09c), len: 218 aa. Probable transcriptional regulatory protein. Highly similar to AJ005575|SPE005575_2 Streptomyces peucetius (226 aa), FASTA scores: opt: 662, E(): 0, (50.0% identity in 208 aa overlap). Similar to Rv0324|Z96800|MTCY63.29 M. tuberculosis cosmid (226 aa), FASTA scores: opt: 579, E(): 0, (45.3% identity in 214 aa overlap). N-terminus is similar to transcriptional activators e.g. MERR_STRLI|P30346 probable mercury resistance operon regulator (125 aa), FASTA scores: opt: 183, E(): 1.9e-06, (35.6% identity in 90 aa overlap). Contains PS00380 Rhodanese signature 1." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216190.1" /db_xref="GI:15608812" /db_xref="GOA:O53921" /db_xref="UniProtKB/TrEMBL:O53921" /db_xref="GeneID:885696" /translation="MSGAKKLIFEQFALVGQALSSGHRLELLDLLVQGERSVDALARA SGLTFANASQHLLQLRRAGLVTSRRDGKRVIYALSDPQVWDVVRAVRAVAERNLASVG SLVRQYYTDRDSLEPISRDELQARVAAGSVLVLDVRPAMEYAAGHLPGAVSIPLDELA ERLDELPSGIDIVACCRGPYCVYAYDALELLRPNGFSARRLDGGFSEWLAADLPVVRT" misc_feature complement(1899455..1899490) /locus_tag="Rv1674c" /note="PS00380 Rhodanese signature 1" gene complement(1900241..1900975) /locus_tag="Rv1675c" /db_xref="GeneID:885693" CDS complement(1900241..1900975) /locus_tag="Rv1675c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1675c, (MTV047.10c), len: 244 aa. Probable transcriptional regulatory protein, weak similarity to D00496|LBATRP_7 trp operon from Lactobacillus casei (219 aa), FASTA scores: opt: 172, E(): 0.00011, (26.9% identity in 186 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216191.1" /db_xref="GI:15608813" /db_xref="GOA:O53922" /db_xref="UniProtKB/TrEMBL:O53922" /db_xref="GeneID:885693" /translation="MADRSVRPLRHLVHAVTGGQPPSEAQVRQAAWIARCVGRGGSAP LHRDDVSALAETLQVKEFAPGAVVFHADQTADGVWIVRHGLIELAVGSRRRRAVVNIL HPGDVDGDIPLLLEMPMVYTGRALTQATCLFLDRQAFERLLATHPAIARRWLSSVAQR VSTAQIRLMGMLGRPLPAQVAQLLLDEAIDARIELAQRTLAAMLGAQRPSINKILKEF ERDRLITVGYAVIEITDQHGLRARAQ" gene 1901047..1901751 /locus_tag="Rv1676" /db_xref="GeneID:885694" CDS 1901047..1901751 /locus_tag="Rv1676" /function="UNKNOWN" /note="Rv1676, (MTV047.11), len: 234 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216192.1" /db_xref="GI:15608814" /db_xref="UniProtKB/TrEMBL:O53923" /db_xref="GeneID:885694" /translation="MACPEWEISRSKRTRKPVLRPRHSVSTLTNRFLAEFCHRYGIGV PTRLARGATVPTRRLQDINDQPVDVPAATGRTHLQFRRFAACPICHLHLRSFANRHQE VADSGITEVVFFHSAADALRGYQSLLPFAVIADPDRVQYREFGVEKSLGAITHPRALW AAVRGSAAMLHRNDPERAGVGFGDGTTHLGLPADFLLDADGTVAAVHYGRHADDQWSV DQLIDINRSLGGKGTQ" gene 1901748..1902296 /gene="dsbF" /locus_tag="Rv1677" /db_xref="GeneID:885690" CDS 1901748..1902296 /gene="dsbF" /locus_tag="Rv1677" /function="UNKNOWN; possibly involved in thiol:disulfide interchange." /note="Rv1677, (MTV047.12), len: 182 aa. Probable dsbF, conserved lipoprotein possibly involved in thiol:disulfide interchange. Highly similar to C-terminus of Z74024|MTCY274.09 mpt53 soluble secreted antigen precursor from Mycobacterium tuberculosis (173 aa), FASTA scores: opt: 482, E(): 3.6e-23, (52.8% identity in 142 aa overlap) . Also some similarity to P52237|TIPB_PSEFL THIOL:DISULFIDE INTERCHANGE PROTEIN TIPB PRECURSOR from Pseudomonas fluorescens (178 aa), FASTA scores: opt: 190, E(): 4.4e-05, (28.5% identity in 151 aa overlap); and P33926|DSBE_ECOLI THIOL:DISULFIDE INTERCHANGE PROTEIN from Escherichia coli (185 aa), FASTA scores: opt: 194, E(): 2.6e-05, (29.1% identity in 175 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site and PS00194 Thioredoxin family active site." /codon_start=1 /transl_table=11 /product="lipoprotein DsbF" /protein_id="NP_216193.1" /db_xref="GI:15608815" /db_xref="GOA:O53924" /db_xref="UniProtKB/TrEMBL:O53924" /db_xref="GeneID:885690" /translation="MTHSRLIGALTVVAIIVTACGSQPKSQPAVAPTGDAAAATQVPA GQTVPAQLQFSAKTLDGHDFHGESLLGKPAVLWFWAPWCPTCQGEAPVVGQVAASHPE VTFVGVAGLDQVPAMQEFVNKYPVKTFTQLADTDGSVWANFGVTQQPAYAFVDPHGNV DVVRGRMSQDELTRRVTALTSR" misc_feature 1901775..1901807 /gene="dsbF" /locus_tag="Rv1677" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" misc_feature 1901970..1902026 /gene="dsbF" /locus_tag="Rv1677" /note="PS00194 Thioredoxin family active site" gene 1902397..1903299 /locus_tag="Rv1678" /db_xref="GeneID:885691" CDS 1902397..1903299 /locus_tag="Rv1678" /function="UNKNOWN" /note="Rv1678, (MTV047.13), len: 300 aa. Probable integral membrane protein." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216194.1" /db_xref="GI:15608816" /db_xref="UniProtKB/TrEMBL:O53925" /db_xref="GeneID:885691" /translation="MARVRRGTELLLSPQSPPATGGLIVLTGLRLLAGLIWLYNVVWK VPPDFGERGRRDLYHFTHLAVEHPVFTPFSWVIEHAVLPYFTAFGWGVLFAESALAVL LLTGTAVRLAALIGIGQSVAIGLSVAESPGEWPWAYAMLLGIHVVLLFTCSTRYAAVD AVRAAATGSAARTAAQRLLAGWGIVLGLIGLVAVWRGLGDDRPAYVGIRALEFSLGEY NLRGALALIAIALAMLAAAKRGWRTVALVAAVVAVAAAAAIYLQVGRTAVWLGGTNTT AAVFVCAAVVSLATEFRIGRVEGA" gene 1903299..1904420 /gene="fadE16" /locus_tag="Rv1679" /db_xref="GeneID:885688" CDS 1903299..1904420 /gene="fadE16" /locus_tag="Rv1679" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv1679, (MTV047.14, MTCI125.01), len: 373 aa. Possible fadE16, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to acyl/butyryl-CoA dehydrogenases e.g. NP_244665.1|NC_002570 acyl-CoA dehydrogenase from Bacillus halodurans (380 aa); NP_000008.1|NM_000017 acyl-Coenzyme A dehydrogenase from Homo sapiens (412 aa); Z99113|BSUB0010_119 from Bacillus subtilis (380 aa), FASTA scores: opt: 439, E(): 3.4e-20, (29.6% identity in 287 aa overlap); etc. Weakly similar to many dehydrogenases and to P31571|CAIA_ECOLI probable carnitine operon oxidoreductase from Escherichia coli (380 aa), FASTA scores: opt: 109, E(): 0.0066, (28.6% identity in 98 aa overlap)." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="NP_216195.1" /db_xref="GI:15608817" /db_xref="GOA:O53926" /db_xref="UniProtKB/TrEMBL:O53926" /db_xref="GeneID:885688" /translation="MATPGVVQEVVSVAAEHAERVDTDCAFPAEAVDALRKTGLLGLV LPREIGGMGSGPVEFTEVVAQLSAACGSTAMIYLMHMAAAVTVAASPPPGLPDLLADM ASGKQLGTLAFSEPGSRSHFWAPVSTASADGDGIAVRADKSWVTSAGFADVYVVSVGS ADGAAGDVDLYAVPADTPGLRVAGTFTGMGLRGNASAPMAVDIRIPDSYRLGEAGGGF GIMMQTVLPWFNLGNAAVSLGLATAATGAAVKHVGTARLEHLGGSLAELPTIRAQIAR MGTTLAAQKAYLEVAANSVSSPDDTTLTHVLGVKASVNDAALTITESAMRVCGGAAFS KHLPIERAFRDARAGSVMAPTADALYDFYGRAVTGLPLF" gene 1904429..1905253 /locus_tag="Rv1680" /db_xref="GeneID:885689" CDS 1904429..1905253 /locus_tag="Rv1680" /function="UNKNOWN" /note="Rv1680, (MTCI125.02), len: 274 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216196.1" /db_xref="GI:15608818" /db_xref="UniProtKB/TrEMBL:O33182" /db_xref="GeneID:885689" /translation="MSTEPLVVGAVAYTPNVVPIWEGIRGYFQDSESPDTQMDFVLYS NYARLVDSLIAGHIDIAWNTNLAYVRTVLQTGGRCTPLAQRDTDVDYTTVFVAHAGSD LHGAKDIAGKRLALGSADSAHAAILPLYYLRRAGIAESDLQVIRFDTDIGKHGDTGRS ELDAVDAVLAGEADVAAIGSSTWAAMGAAELMGESLTEVWRTDGYCHCMFTALDTLPA ERYQPWLDRLLAMSWDDSEHRKILELEGLRRWVPPHLDGYKPLFEAVQEQGIDPRW" gene 1905250..1906242 /gene="moeX" /locus_tag="Rv1681" /db_xref="GeneID:885683" CDS 1905250..1906242 /gene="moeX" /locus_tag="Rv1681" /function="INVOLVED IN MOLYBDOPTENUM COFACTOR BIOSYNTHESIS" /note="Rv1681, (MTCI125.03), len: 330 aa. Possible moeX, Molybdopterin biosynthesis protein, has weak similarity to MOAA_ECOLI|P30745 molybdenum cofactor biosynthesis protein (329 aa), FASTA scores: opt: 162, E(): 0.00081, (27.7% identity in 224 aa overlap) and to Rv3109|MTCY164.19 MoaA from Mycobacterium tuberculosis (28.5% identity in 165 aa overlap)." /codon_start=1 /transl_table=11 /product="molybdopterin biosynthesis protein MoeX" /protein_id="NP_216197.1" /db_xref="GI:15608819" /db_xref="GOA:O33183" /db_xref="UniProtKB/TrEMBL:O33183" /db_xref="GeneID:885683" /translation="MIIELMRRVVGLAQGATAEVAVYGDRDRDLAERWCANTGNTLVR ADVDQTGVGTLVVRRGHPPDPASVLGPDRLPGVRLWLYTNFHCNLCCDYCCVSSSPST PHRELGAERIGRIVGEAARWGVRELFLTGGEPFLLPDIDTIIATCVKQLPTTVLTNGM VFKGRGRRALESLPRGLALQISLDSATPELHDAHRGAGTWVKAVAGIRLALSLGFRVR VAATVASPAPGELTAFHDFLDGLGIAPGDQLVRPIALEGAASQGVALTRESLVPEVTV TADGVYWHPVAATDERALVTRTVEPLTPALDMVSRLFAEQWTRAAEEAALFPCA" gene 1906403..1907320 /locus_tag="Rv1682" /db_xref="GeneID:885686" CDS 1906403..1907320 /locus_tag="Rv1682" /function="UNKNOWN" /note="Rv1682, (MTCI125.04), len: 305 aa. Probable coiled-coil structural protein, weakly similar to many paramyosins, kinesins and plectins e.g. MYSP_ONCVO|Q02171 paramyosin from onchocerca volvulus (879 aa), fasta scores: opt: 180, E():2.6e-08, (24.4% identity in 234 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical coiled-coil proteins (wag31 antigen 84) Rv2145c and Rv2927c." /codon_start=1 /transl_table=11 /product="coiled-coil structural protein" /protein_id="NP_216198.1" /db_xref="GI:15608820" /db_xref="UniProtKB/TrEMBL:O33184" /db_xref="GeneID:885686" /translation="MLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDP VAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRL AVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRR AREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQ ECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLEN IPALLESARHSETEPLQSINGAVAELRAI" repeat_region 1907460..1907515 /note="56 bp direct repeat 1, AGTCGGGTGACGATGCGGGCCGGTGTGGTCCGAGGAGGAGCCCGACAATTTAAGCT" repeat_region 1907516..1907571 /note="56 bp direct repeat 2, AGTCGGGTGACGATGCGGGCCGGTGTGGTCCGAGGAGGAGCCCGACAATTTAAGCT" gene 1907594..1910593 /locus_tag="Rv1683" /db_xref="GeneID:885687" CDS 1907594..1910593 /locus_tag="Rv1683" /EC_number="2.3.1.86" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_216199.1" /db_xref="GI:15608821" /db_xref="GOA:O33185" /db_xref="UniProtKB/TrEMBL:O33185" /db_xref="GeneID:885687" /translation="MVDLNFSMVTRPIERLVATAQNGLEVLRLGGLETGSVPSPSQIV ESVPMYKLRRYFPPDNRPGQPPVGPPVLMVHPMMMSADMWDVTREDGAVGILHASGLD PWVIDFGSPDEVEGGMRRNLADHIVALSEAVDTVKDATGHDVHFVGYSQGGMFCYQAA AYRRSKDIASVVAFGSPVDTLAALPMGIPANMGAAVADFMADHVFNRLDIPSWMARMG FQMMDPLKTAKARVDFVRQLHDREALLPREQQRRFLESEGWIAWSGPAISELLKQFIA HNRMMTGGFAISGQMVTLTDITCPILAFVGEVDDIGQPASVRGIRRAAPNSEVYECLI RAGHFGLVVGSRAAQQSWPTVADWVRWISGDGTKPENIHLMADQPAEHTDSGVAFSSR VAHGIGEVSEAALALARGAADAVVAANRSVRTLAVETVRTLPRLARLGQLNDHTRISL GRIIDEQAHDAPKGEFLLFDGRVHTYEAVNRRINNVVRGLIAVGVRQGDRVGVLMETR PSALVAIAALSRLGAVAVVMRPDTDLSASVRLGRVTEILTDPTNLDAARQLPGQVLVL GGGESRDLDLPADALEQGQVIDMEKIDPDAVELPAWYRPNPGLARDLAFIAFSSADGD LVAKQITNYRWAVSAFGTASTAALGRRDTVYCLTPLHHESALLVSLGGAVVGGTRIAL SRGLRPDRFVAEVRQYGVTVVSYTWAMLRDVVDDPAFVLHGNHPVRLFIGSGMPTGLW ERVVEAFAPAHVVEFFATTDGQAVLANVAGAKIGSKGRPLPGAGRVELGAYDAEHDLI LENDRGFVQVAGVNQVGVLLAQSRGPIDPTASVKRGVFAPADTWISTDYLFWRDDDGD YWLAGGRGSVVRTARGMVYTEPVTNALGLITGVDLAVTYGVLVRGRHVAVSAVTLLPG ATITAADLTEAVASMPVGLGPDIVHVVPQLTLSGTYRPTVSALRANGIPKAGRQAWYF NSGGNEYRRLTPAVRTELTGQHRRGNA" misc_feature 1908023..1908052 /locus_tag="Rv1683" /note="PS00120 Lipases, serine active site" gene 1910586..1910810 /locus_tag="Rv1684" /db_xref="GeneID:885680" CDS 1910586..1910810 /locus_tag="Rv1684" /function="UNKNOWN" /note="Rv1684, (MTCI125.06), len: 74 aa. Conserved hypothetical protein, similar to P75844|YCAR_ECOLI Protein YCAR from Escherichia coli (60 aa), FASTA scores: opt: 108, E(): 0.00022, (39.0% identity in 59 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216200.1" /db_xref="GI:15608822" /db_xref="UniProtKB/TrEMBL:O33186" /db_xref="GeneID:885680" /translation="MLDEALLAILVCPADRGPLVLVEDGDIQVLYNPRLRRAYRIEDG IPVLLVDEAREVDEDEHARLMARGRPAAPQ" gene complement(1910776..1911399) /locus_tag="Rv1685c" /db_xref="GeneID:885682" CDS complement(1910776..1911399) /locus_tag="Rv1685c" /function="UNKNOWN" /note="Rv1685c, (MTCI125.07c), len: 207 aa. Conserved hypothetical protein, some similarity to other Mycobacterium tuberculosis hypothetical regulatory proteins e.g. Q10774|Rv1556|YF56_MYCTU (202 aa), FASTA scores: opt: 111, E(): 1.7e-05, (24.1% identity in 195 aa overlap); and P95215|Rv0258c|MTCY06A4.02c (151 aa) FASTA scores: (32.9% identity in 140 aa overlap); also similar to Q9X8G9|SCE7.13C|AL049819 putative Streptomyces coelicolor transcriptional regulator (204 aa), FASTA scores: opt: 480, E(): 6.4e-25, (40.4% identity in 203 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216201.1" /db_xref="GI:15608823" /db_xref="GOA:O33187" /db_xref="UniProtKB/TrEMBL:O33187" /db_xref="GeneID:885682" /translation="MAAPDNSRRRPGRPAGSSDTRERILSSARELFAHNGIDRTSIRA VAAKAGVDAALVHHYFGTKQQLFAAAIHIPIDPMVIIGPIREAPVEELGYKLPSLLLP IWDSELGAGLIATLRSLISGSDVGLARSFLEEVVTVELGSRVDNPPGTGKIRTQFVAS QLMGVVMARYIVRIEPFASLPAEQIVQTIAPNLQRYLTGELPDDLAP" gene complement(1911401..1912081) /locus_tag="Rv1686c" /db_xref="GeneID:885672" CDS complement(1911401..1912081) /locus_tag="Rv1686c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY DRUG) ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1686c, (MTCI125.08c), len: 226 aa. Probable conserved integral membrane protein ABC transporter (see citation below), similar to AL049819|SCE7.05 putative integral membrane protein from Streptomyces coelicolor (266 aa), FASTA sacores: opt: 661, E(): 0, (45.1% identity in 226 aa overlap); and Q53627|U43537 MEMBRANE PROTEIN INVOLVED IN MITHRAMYCIN RESISTANCE from STREPTOMYCES ARGILLACEUS (233 aa), FASTA scores: opt: 222, E(): 5.4e-10, (28.7% identity in 216 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein ABC transporter" /protein_id="NP_216202.1" /db_xref="GI:15608824" /db_xref="GOA:O33188" /db_xref="UniProtKB/TrEMBL:O33188" /db_xref="GeneID:885672" /translation="MILLVPILIITLMYFMFENVPHRPGTPSGFNTACLVLLGLFPLF VMFVITAITMQRERASGTLERILTTPLRRLDLLAGYGTAFSIAAAAQATLACIVAFWF LGFDTAGSPVWVFAIAIVNAVLGVGLGLLCSAFARTEFQAVQFIPLVMVPQLLLAGII VPRALMPTWLEWISNVMPASYALEALQQVGAHPELTGIAVRDVVVVLSFAVASLCLAA VTLRRRTS" gene complement(1912153..1912920) /locus_tag="Rv1687c" /db_xref="GeneID:885678" CDS complement(1912153..1912920) /locus_tag="Rv1687c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY DRUG) ACROSS THE MEMBRANE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv1687c, (MTCI125.09c), len: 255 aa. Probable conserved ATP-binding protein ABC transporter (see citation below), similar to many ABC-type transporters e.g. P55476|NODI_RHISN nodulation ATP-binding protein I from Rhizobium sp. (343 aa), FASTA scores: opt: 479, E(): 3.7e-23, (34.6% identity in 243 aa overlap); etc. Also similar to many other Mycobacterium tuberculosis ABC-type transporters e.g. MTCY19H9.04 (34.5% identity in 238 aa overlap). Contains PS00211 ABC transporters family signature and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Also contains PS00039 DEAD-box subfamily ATP-dependent helicases signature, though this may be spurious." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="NP_216203.1" /db_xref="GI:15608825" /db_xref="GOA:O33189" /db_xref="UniProtKB/TrEMBL:O33189" /db_xref="GeneID:885678" /translation="MMISSSDELLRDGADPAVIIDQLRVIRGKRLALQDVSVRVACGT ITGLLGPSGSGKTTLIRCIVGSQIIASGSVSVLGQPAGSAELRHRVGYMPQDPTIYND LRVIDNIRYFAELCGVDRQAADEVIEAVDLRDHRTARCANLSGGQRARVSLACALVGR PDLLVLDEPTIGLDPVLRVELWDRFTALARRGTTLLVSSHVMDEADRCGDLLLLRQGQ LLAHTTPHRLRKETGCTSLEEAFLSIVRRTTTVPAAG" misc_feature complement(1912294..1912320) /locus_tag="Rv1687c" /note="PS00039 DEAD-box subfamily ATP-dependent helicases signature" misc_feature complement(1912450..1912494) /locus_tag="Rv1687c" /note="PS00211 ABC transporters family signature" misc_feature complement(1912750..1912773) /locus_tag="Rv1687c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1912979..1913590 /gene="mpg" /locus_tag="Rv1688" /db_xref="GeneID:885679" CDS 1912979..1913590 /gene="mpg" /locus_tag="Rv1688" /EC_number="3.2.2.-" /function="THOUGHT TO BE INVOLVED IN BASE EXCISION REPAIR." /note="responsible for recognizing base lesions in the genome and initiating base excision DNA repair" /codon_start=1 /transl_table=11 /product="3-methyladenine DNA glycosylase" /protein_id="NP_216204.1" /db_xref="GI:15608826" /db_xref="GOA:P65412" /db_xref="UniProtKB/Swiss-Prot:P65412" /db_xref="GeneID:885679" /translation="MNAEELAIDPVAAAHRLLGATIAGRGVRAMVVEVEAYGGVPDGP WPDAAAHSYRGRNGRNDVMFGPPGRLYTYRSHGIHVCANVACGPDGTAAAVLLRAAAI EDGAELATSRRGQTVRAVALARGPGNLCAALGITMADNGIDLFDPSSPVRLRLNDTHR ARSGPRVGVSQAADRPWRLWLTGRPEVSAYRRSSRAPARGASD" gene 1913602..1914876 /gene="tyrS" /locus_tag="Rv1689" /db_xref="GeneID:885668" CDS 1913602..1914876 /gene="tyrS" /locus_tag="Rv1689" /EC_number="6.1.1.1" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-tyrosine + tRNA(Tyr) = AMP + diphosphate + L-tyrosyl-tRNA(Tyr)]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of tyrosyl-tRNA(Tyr) from tyrosine and tRNA(Tyr)" /codon_start=1 /transl_table=11 /product="tyrosyl-tRNA synthetase" /protein_id="NP_216205.1" /db_xref="GI:15608827" /db_xref="GOA:P67611" /db_xref="UniProtKB/Swiss-Prot:P67611" /db_xref="GeneID:885668" /translation="MSGMILDELSWRGLIAQSTDLDTLAAEAQRGPMTVYAGFDPTAP SLHAGHLVPLLTLRRFQRAGHRPIVLAGGATGMIGDPRDVGERSLNEADTVAEWTERI RGQLERFVDFDDSPMGAIVENNLEWTGSLSAIEFLRDIGKHFSVNVMLARDTIRRRLA GEGISYTEFSYLLLQANDYVELHRRHGCTLQIGGADQWGNIIAGVRLVRQKLGATVHA LTVPLVTAADGTKFGKSTGGGSLWLDPQMTSPYAWYQYFVNTADADVIRYLRWFTFLS ADELAELEQATAQRPQQRAAQRRLASELTVLVHGEAATAAVEHASRALFGRGELARLD EATLAAALRETTVAELKPGSPDGIVDLLVASGLSASKGAARRTIHEGGVSVNNIRVDN EEWVPQSSDFLHGRWLVLRRGKRSIAGVERIG" misc_feature 1913722..1913754 /gene="tyrS" /locus_tag="Rv1689" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene 1915527..1915910 /gene="lprJ" /locus_tag="Rv1690" /db_xref="GeneID:885670" CDS 1915527..1915910 /gene="lprJ" /locus_tag="Rv1690" /function="UNKNOWN" /note="Rv1690, (MTCI125.12), len: 127 aa. Probable lprJ, lipoprotein; contains possible signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Weakly similar to other Mycobacterium tuberculosis hypothetical proteins with conserved cysteines e.g. Rv1804c, Rv1810, Rv3354, etc" /codon_start=1 /transl_table=11 /product="lipoprotein LprJ" /protein_id="NP_216206.1" /db_xref="GI:15608828" /db_xref="UniProtKB/TrEMBL:O33192" /db_xref="GeneID:885670" /translation="MTAHTHDGTRTWRTGRQATTLLALLAGVFGGAASCAAPIQADMM GNAFLTALTNAGIAYDQPATTVALGRSVCPMVVAPGGTFESITSRMAEINGMSRDMAS TFTIVAIGTYCPAVIAPLMPNRLQA" misc_feature 1915599..1915631 /gene="lprJ" /locus_tag="Rv1690" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 1915949..1916701 /locus_tag="Rv1691" /db_xref="GeneID:885665" CDS 1915949..1916701 /locus_tag="Rv1691" /function="UNKNOWN" /note="Rv1691, MTCI125.13, len: 250 aa. Conserved hypothetical protein, similar to Q9S210|SCI51.30C|AL109848 Hypothetical protein from Streptomyces coelicolor (210 aa), FASTA score: opt: 556, E(): 6.4e-27, (50.6% identity in 180 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216207.1" /db_xref="GI:15608829" /db_xref="UniProtKB/TrEMBL:O33193" /db_xref="GeneID:885665" /translation="MVDDRQGRRGGRRPRSAAADNRPAFRDGPAIPPGIHARQLAPEI RRELSTLDRATADAVACHLVAAGELIDDDPEAALRHARAARVRASRIAAVREAVGIAA YRCGDWAQALAELRAARRMGSKSPLLALIADCERGLGRPQRAIELARGSEAVELSGDA ADELRIVAAGARADLGQLEQALTVLSTPQLDPGRTGSTAARLFYAYAEILLALGRGDE ALQWFLRSAAADIDGVTDAEDRVDELGAREQK" gene 1916698..1917759 /locus_tag="Rv1692" /db_xref="GeneID:885241" CDS 1916698..1917759 /locus_tag="Rv1692" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1692, (MTCI125.14), len: 353 aa. Probable phosphatase (EC 3.1.-.-), some similarity to others e.g. PNPP_SCHPO|Q00472 4-nitrophenylphosphatase (269 aa), FASTA scores: opt: 214, E(): 1.3e-10, (29.5% identity in 241 aa overlap); and to NAGD_ECOLI|P15302 nagd protein from Escherichia coli (250 aa), FASTA scores: opt: 314, E(): 9.8e-08, (28.2% identity in 245 aa overlap). Also similar to AL109848|SCI51.28 hypothetical protein from Streptomyces coelicolor (343 aa), FASTA scores: opt: 768, E(): 0, (44.8% identity in 315 aa overlap)." /codon_start=1 /transl_table=11 /product="phosphatase" /protein_id="NP_216208.1" /db_xref="GI:15608830" /db_xref="GOA:O33194" /db_xref="UniProtKB/TrEMBL:O33194" /db_xref="GeneID:885241" /translation="MKSIAQEHDCLLIDLDGTVFCGRQPTGGAVQSLSQVRSRKLFVT NNASRSADEVAAHLCELGFTATGEDVVTSAQSAAHLLAGQLAPGARVLIVGTEALANE VAAVGLRPVRRFEDRPDAVVQGLSMTTGWSDLAEAALAIRAGALWVAANVDPTLPTER GLLPGNGSMVAALRTATGMDPRVAGKPAPALMTEAVARGDFRAALVVGDRLDTDIEGA NAAGLPSLMVLTGVNSAWDAVYAEPVRRPTYIGHDLRSLHQDSKLLAVAPQPGWQIDV GGGAVTVCANGDVDDLEFIDDGLSIVRAVASAVWEARAADLHQRPLRIEAGDERARAA LQRWSLMRSDHPVTSVGTQ" gene 1917756..1917932 /locus_tag="Rv1693" /db_xref="GeneID:885662" CDS 1917756..1917932 /locus_tag="Rv1693" /function="UNKNOWN" /note="Rv1693, (MTCI125.15), len: 58 aa. Conserved hypothetical protein, shows some similarity to AL583921 hypothetical protein from Mycobacterium leprae (61 aa). Probable coiled-coil from aa 30 to 58." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216209.1" /db_xref="GI:15608831" /db_xref="UniProtKB/TrEMBL:O33195" /db_xref="GeneID:885662" /translation="MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEA HEVLLAALESAEKG" gene 1917940..1918746 /gene="tlyA" /locus_tag="Rv1694" /db_xref="GeneID:885396" CDS 1917940..1918746 /gene="tlyA" /locus_tag="Rv1694" /function="HAS A CONTACT-DEPENDENT HAEMOLYTIC ACTIVITY; POSSIBLY INVOLVED IN VIRULENCE (PORE FORMATION)." /experiment="experimental evidence, no additional details recorded" /note="Rv1694, (MTCI125.16), len: 268 aa. tlyA, cytotoxin/haemolysin homologue (see citations below), almost identical to NP_301968.1|NC_002677 cytotoxin/haemolysin homologue TlyA from Mycobacterium leprae (269 aa). TlyA homologues were also identified by PCR in Mycobacterium avium, Mycobacterium bovis BCG, but appeared absent in M. smegmatis, M. vaccae, M. kansasii, M. chelonae and M. phlei (see Wren et al., 1998). Also highly similar to CAB83047.1|AJ271681 putative haemolysin from Mycobacterium ulcerans (281 aa); and similar to HLYA_TREHY|Q06803 pore-forming haemolysin/cytotoxin virulence determinant from Treponema hyodysenteriae (240 aa), FASTA scores: opt: 514, E():3e-30, (37.3% identity in 236 aa overlap)." /codon_start=1 /transl_table=11 /product="cytotoxin/hemolysin TlyA" /protein_id="NP_216210.1" /db_xref="GI:15608832" /db_xref="GOA:Q50760" /db_xref="UniProtKB/TrEMBL:Q50760" /db_xref="GeneID:885396" /translation="MARRARVDAELVRRGLARSRQQAAELIGAGKVRIDGLPAVKPAT AVSDTTALTVVTDSERAWVSRGAHKLVGALEAFAIAVAGRRCLDAGASTGGFTEVLLD RGAAHVVAADVGYGQLAWSLRNDPRVVVLERTNARGLTPEAIGGRVDLVVADLSFISL ATVLPALVGCASRDADIVPLVKPQFEVGKGQVGPGGVVHDPQLRARSVLAVARRAQEL GWHSVGVKASPLPGPSGNVEYFLWLRTQTDRALSAKGLEDAVHRAISEGP" gene 1918746..1919669 /gene="ppnK" /locus_tag="Rv1695" /db_xref="GeneID:885660" CDS 1918746..1919669 /gene="ppnK" /locus_tag="Rv1695" /EC_number="2.7.1.23" /function="Catalyzes the phosphorylation of NAD to NADP. Utilizes ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus [CATALYTIC ACTIVITY: ATP + NAD+ = ADP + NADP+]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the phosphorylation of NAD to NADP" /codon_start=1 /transl_table=11 /product="inorganic polyphosphate/ATP-NAD kinase" /protein_id="NP_216211.1" /db_xref="GI:15608833" /db_xref="GOA:O33196" /db_xref="UniProtKB/Swiss-Prot:O33196" /db_xref="GeneID:885660" /translation="MTAHRSVLLVVHTGRDEATETARRVEKVLGDNKIALRVLSAEAV DRGSLHLAPDDMRAMGVEIEVVDADQHAADGCELVLVLGGDGTFLRAAELARNASIPV LGVNLGRIGFLAEAEAEAIDAVLEHVVAQDYRVEDRLTLDVVVRQGGRIVNRGWALNE VSLEKGPRLGVLGVVVEIDGRPVSAFGCDGVLVSTPTGSTAYAFSAGGPVLWPDLEAI LVVPNNAHALFGRPMVTSPEATIAIEIEADGHDALVFCDGRREMLIPAGSRLEVTRCV TSVKWARLDSAPFTDRLVRKFRLPVTGWRGK" gene 1919683..1921446 /gene="recN" /locus_tag="Rv1696" /db_xref="GeneID:885805" CDS 1919683..1921446 /gene="recN" /locus_tag="Rv1696" /function="INVOLVED IN RECOMBINATIONAL REPAIR OF DAMAGED DNA." /note="Rv1696, (MTCI125.18), len: 587 aa. Probable recN, DNA repair protein (see citation below), similar to many e.g. RECN_ECOLI|P05824 dna repair protein recN (553 aa), FASTA scores: opt: 508, E(): 1.9e-33, (31.5% identity in 587 aa overlap). Equivalent to Z95117|MLCB1351_12 recN from Mycobacterium leprae (587 aa), FASTA scores: (76.1% identit y in 589 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="DNA repair protein recN (recombination protein N)" /protein_id="NP_216212.1" /db_xref="GI:15608834" /db_xref="GOA:O33197" /db_xref="UniProtKB/Swiss-Prot:O33197" /db_xref="GeneID:885805" /translation="MLTELRIESLGAISVATAEFDRGFTVLTGETGTGKTMVVTGLHL LGGARADATRVRSGADRAVVEGRFTTTDLDDATVAGLQAVLDSSGAERDEDGSVIALR SISRDGPSRAYLGGRGVPAKSLSGFTNELLTLHGQNDQLRLMRPDEQRGALDRFAAAG EAVQRYRKLRDAWLTARRDLVDRRNRARELAQEADRLKFALNEIDTVDPQPGEDVALV ADIARLSELDTLREAATTARATLCGTPDADAFDRGAVDSLGRARAALQSSDDAALRGL AEQVGEALTVVVDAVAELGAYLDELPADASALDAKLARQAQLRTLTRKYAADIDGVLR WADEARARLAQLDVSEEGLAALERRTGELAHELGQAAVDLSTIRRKAAKRLAKEVSAE LSALAMADAEFTIGVTTELADHGDPVALALASGELARAGADGVDAVEFGFVAHRGMTV LPLAKSASGGELSRVMLSLEVVLATSRKQAAGTTMVFDEIDAGVGGWAAVQIGRRLAR LARTHQVIVVTHLPQVAAYADVHLMVQRTGRDGASGVRRLTSEDRVAELARMLAGLGD SDSGRAHARELLETAQNDELT" misc_feature 1919767..1919790 /gene="recN" /locus_tag="Rv1696" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1921542..1922723 /locus_tag="Rv1697" /db_xref="GeneID:885045" CDS 1921542..1922723 /locus_tag="Rv1697" /function="UNKNOWN" /note="Rv1697, (MTCI125.19), len: 393 aa. Conserved hypothetical protein, highly similar to Q49895|MLC1351.11C|U00021 Hypothetical protein of Mycobacterium leprae from cosmid L247 (430 aa), FASTA scores: opt: 2345, E(): 0, (90.6% identity in 393 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216213.1" /db_xref="GI:15608835" /db_xref="UniProtKB/TrEMBL:O33198" /db_xref="GeneID:885045" /translation="MRMSALLSRNTSRPGLIGIARVDRNIDRLLRRVCPGDIVVLDVL DLDRITADALVEAEIAAVVNASSSVSGRYPNLGPEVLVTNGVTLIDETGPEIFKKVKD GAKVRLYEGGVYAGDRRLIRGTERTDHDIADLMREAKSGLVAHLEAFAGNTIEFIRSE SPLLIDGIGIPDVDVDLRRRHVVIVADEPSGPDDLKSLKPFIKEYQPVLVGVGTGADV LRKAGYRPQLIVGDPDQISTEVLKCGAQVVLPADADGHAPGLERIQDLGVGAMTFPAA GSATDLALLLADHHGAALLVTAGHAANIETFFDRTRVQSNPSTFLTRLRVGEKLVDAK AVATLYRNHISGGAIALLALTMLIAIIVALWVSRTDGVVLHWIIDYWNRFSLWVQHLV S" gene 1922745..1923689 /locus_tag="Rv1698" /db_xref="GeneID:885047" CDS 1922745..1923689 /locus_tag="Rv1698" /function="UNKNOWN" /note="Rv1698, (MTCI125.20), len: 314 aa. Conserved hypothetical protein, possibly exported protein with potential N-terminal signal sequence. Equivalent to Q49894|MLC1351.10C|Z95117 Hypothetical protein from Mycobacterium leprae (317 aa), FASTA scores: (77.0% identity in 317 aa overlap). Probable coiled-coil from aa 31 to 67." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216214.1" /db_xref="GI:15608836" /db_xref="GOA:P64883" /db_xref="UniProtKB/Swiss-Prot:P64883" /db_xref="GeneID:885047" /translation="MISLRQHAVSLAAVFLALAMGVVLGSGFFSDTLLSSLRSEKRDL YTQIDRLTDQRDALREKLSAADNFDIQVGSRIVHDALVGKSVVIFRTPDAHDDDIAAV SKIVGQAGGAVTATVSLTQEFVEANSAEKLRSVVNSSILPAGSQLSTKLVDQGSQAGD LLGIALLSNADPAAPTVEQAQRDTVLAALRETGFITYQPRDRIGTANATVVVTGGALS TDAGNQGVSVARFAAALAPRGSGTLLAGRDGSANRPAAVAVTRADADMAAEISTVDDI DAEPGRITVILALHDLINGGHVGHYGTGHGAMSVTVSQ" gene 1923829..1925589 /gene="pyrG" /locus_tag="Rv1699" /db_xref="GeneID:885048" CDS 1923829..1925589 /gene="pyrG" /locus_tag="Rv1699" /EC_number="6.3.4.2" /function="PYRIMIDINE BIOSYNTHESIS (LAST STEP) [CATALYTIC ACTIVITY : ATP + UTP + GLUTAMINE = ADP + ORTHOPHOSPHATE + CTP (AMMONIA CAN REPLACE GLUTAMINE).]" /note="CTP synthase; CTP synthase; cytidine triphosphate synthetase; catalyzes the ATP-dependent amination of UTP to CTP with either L-glutamine or ammonia as the source of nitrogen; in Escherichia coli this enzyme forms a homotetramer" /codon_start=1 /transl_table=11 /product="CTP synthetase" /protein_id="NP_216215.1" /db_xref="GI:15608837" /db_xref="GOA:P96351" /db_xref="UniProtKB/Swiss-Prot:P96351" /db_xref="GeneID:885048" /translation="MRKHPQTATKHLFVSGGVASSLGKGLTASSLGQLLTARGLHVTM QKLDPYLNVDPGTMNPFQHGEVFVTEDGAETDLDVGHYERFLDRNLPGSANVTTGQVY STVIAKERRGEYLGDTVQVIPHITDEIKRRILAMAQPDADGNRPDVVITEIGGTVGDI ESQPFLEAARQVRHYLGREDVFFLHVSLVPYLAPSGELKTKPTQHSVAALRSIGITPD ALILRCDRDVPEALKNKIALMCDVDIDGVISTPDAPSIYDIPKVLHREELDAFVVRRL NLPFRDVDWTEWDDLLRRVHEPHETVRIALVGKYVELSDAYLSVAEALRAGGFKHRAK VEICWVASDGCETTSGAAAALGDVHGVLIPGGFGIRGIEGKIGAIAYARARGLPVLGL CLGLQCIVIEAARSVGLTNANSAEFDPDTPDPVIATMPDQEEIVAGEADLGGTMRLGS YPAVLEPDSVVAQAYQTTQVSERHRHRYEVNNAYRDKIAESGLRFSGTSPDGHLVEFV EYPPDRHPFVVGTQAHPELKSRPTRPHPLFVAFVGAAIDYKAGELLPVEIPEIPEHTP NGSSHRDGVGQPLPEPASRG" misc_feature 1924990..1925025 /gene="pyrG" /locus_tag="Rv1699" /note="PS00442 Glutamine amidotransferases class-I active site" gene 1925582..1926205 /locus_tag="Rv1700" /db_xref="GeneID:885049" CDS 1925582..1926205 /locus_tag="Rv1700" /function="UNKNOWN" /note="Rv1700, (MTCI125.22), len: 207 aa. Conserved hypothetical protein, equivalent to Q49891|MLC1351.08C|Z95117 Hypothetical protein from Mycobacterium leprae (177 aa), FASTA scores: (66.7% identity in 171 aa overlap); also similar to Q9S225|SCI51.15C|AL109848 Hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 508, E(): 1.2e-27, (43.1% identity in 197 aa overlap); similar to P54570|ADPP_BACSU ADP-RIBOSE PYROPHOSPHATASE (EC 3.6.1.13) (185 aa), FASTA scores: opt: 313, E(): 1.1e-06, (42.7% identity in 124 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216216.1" /db_xref="GI:15608838" /db_xref="UniProtKB/TrEMBL:O33199" /db_xref="GeneID:885049" /translation="MAEHDFETISSETLHTGAIFALRRDQVRMPGGGIVTREVVEHFG AVAIVAMDDNGNIPMVYQYRHTYGRRLWELPAGLLDVAGEPPHLTAARELREEVGLQA STWQVLVDLDTAPGFSDESVRVYLATGLREVGRPEAHHEEADMTMGWYPIAEAARRVL RGEIVNSIAIAGVLAVHAVTTGFAQPRPLDTEWIDRPTAFAARRAER" gene 1926202..1927137 /gene="xerD" /locus_tag="Rv1701" /db_xref="GeneID:885055" CDS 1926202..1927137 /gene="xerD" /locus_tag="Rv1701" /function="SEQUENCE INTEGRATION/RECOMBINATION." /note="site-specific tyrosine recombinase which cuts and rejoins DNA molecules; binds cooperatively to specific DNA consensus sites; forms a heterotetrameric complex with XerC; XerCD exhibit similar sequences; essential to convert chromosome dimers to monomers during cell division and functions during plasmid segregation; XerD specifically exchanges the bottom strands; cell division protein FtsK may regulate the XerCD complex; enzyme from Streptococcus group has unusual active site motifs" /codon_start=1 /transl_table=11 /product="site-specific tyrosine recombinase XerD" /protein_id="NP_216217.1" /db_xref="GI:15608839" /db_xref="GOA:P67636" /db_xref="UniProtKB/Swiss-Prot:P67636" /db_xref="GeneID:885055" /translation="MKTLALQLQGYLDHLTIERGVAANTLSSYRRDLRRYSKHLEERG ITDLAKVGEHDVSEFLVALRRGDPDSGTAALSAVSAARALIAVRGLHRFAAAEGLAEL DVARAVRPPTPSRRLPKSLTIDEVLSLLEGAGGDKPSDGPLTLRNRAVLELLYSTGAR ISEAVGLDLDDIDTHARSVLLRGKGGKQRLVPVGRPAVHALDAYLVRGRPDLARRGRG TAAIFLNARGGRLSRQSAWQVLQDAAERAGITAGVSPHMLRHSFATHLLEGGADVRVV QELLGHASVTTTQIYTLVTVHALREVWAGAHPRAR" gene complement(1927211..1928575) /locus_tag="Rv1702c" /db_xref="GeneID:885061" CDS complement(1927211..1928575) /locus_tag="Rv1702c" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv1702c, (MTCI125.24c), len: 454 aa. Conserved hypothetical ORF in REP13E12 degenerate repeat. Similar to other hypothetical proteins inside REP13E12 elements (often in two parts) e.g. Rv0094c|Q50655|MTCY251.13c (317 aa), FASTA scores: opt: 1284, E(): 0, (59.7% identity in 315 aa overlap); and Rv1128c, Rv1945, Rv1148c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216218.1" /db_xref="GI:15608840" /db_xref="UniProtKB/Swiss-Prot:P64885" /db_xref="GeneID:885061" /translation="MYSSSREEAVAAFDNLDTALNRVLKVSPDDLTIPECLAMLQRCE KIRRRLPAAEHPFINKLADQTDQTELGGKLPFALAERLHISRGEASRRIHEAADLGPR RTLTGQPLPPLLTATAAAQRAGHLGPAHVQVIRCFLHQLPHHVDLPTREKAEAELATL GGRFRPDQLHKLATKLADCLNPDGNYNDTDRARRRSIILGNQGPDGMSAISGYLTPEA RATVDAVLAKLAAPGMANPADDTPCLAGTPSQAAIEADTRSAGQRHHDGLLAALRALL CSGELGQHNGLPAAIIVSTSLTELQSRAGHALTGGGTLLPMSDVIRLASHANHYLRIF DHGRELALYHTKRLASPGQRIVLYAKDRGCSFPNCDVPGYLTEVHHVTDFAQCQETDI NELTQGCGPHHQLATTGGWITRKRKDGTTEWLPPAHLDHGQPRTNSYFHPEKLLHDSD EDDP" repeat_region 1927218..1928589 /note="REP-6, len: 1372 bp. REPI125, member of REP13E12 family.; REP-6" /rpt_type=DIRECT gene complement(1929131..1929721) /locus_tag="Rv1703c" /db_xref="GeneID:885066" CDS complement(1929131..1929721) /locus_tag="Rv1703c" /EC_number="2.1.1.6" /function="CATALYZES THE O-METHYLATION [CATALYTIC ACTIVITY: S-ADENOSYL-L-METHIONINE + CATECHOL = S-ADENOSYL-L-HOMOCYSTEINE + GUAIACOL]" /experiment="experimental evidence, no additional details recorded" /note="Rv1703c, (MTCI125.25c), len: 196 aa. Probable catechol-o-methyltransferase (EC 2.1.1.6), most similar to COMT_HUMAN|P21964 soluble form of mammalian catechol o-methyltransferase (271 aa), FASTA scores: opt: 405, E(): 7 .8e-29, (38.9% identity in 190 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical methyltransferases Rv0187, Rv1220c." /codon_start=1 /transl_table=11 /product="catechol-o-methyltransferase" /protein_id="NP_216219.1" /db_xref="GI:15608841" /db_xref="GOA:O33202" /db_xref="UniProtKB/TrEMBL:O33202" /db_xref="GeneID:885066" /translation="MLATIDKFAYEKSMLINVGDEKGTLLDAAVRRADPALALELGTY LGYGALRIARAAPEARVYSVELAEANASNARRIWAHAGVDDRVVCVVGTIGDGGRTLD ALTEHGFATGTLDFVFLDHDKKAYLPDLQSILDRGWLHPGSIVVADNVRVPGAPKYRA YMRRQQGMSWNTIEHKTHLEYQTLVPDLVLESEYLG" gene complement(1929786..1931456) /gene="cycA" /locus_tag="Rv1704c" /db_xref="GeneID:888812" CDS complement(1929786..1931456) /gene="cycA" /locus_tag="Rv1704c" /function="PERMEASE THAT IS INVOLVED IN THE TRANSPORT ACROSS THE CYTOPLASMIC MEMBRANE OF D-ALANINE, D-SERINE AND GLYCINE" /note="Rv1704c, (MTCI125.26c), len: 556 aa. Probable cycA, D-serine/D-alanine/glycine transporter, highly similar to P39312|CYCA_ECOLI d-serine/d-alanine/glycine transporter from Escherichia coli (470 aa), FASTA scores: opt: 1906, E(): 0, (59.3% identity in 459 aa overlap); etc. Also similar to other Mycobacterium tuberculosis amino-acid permeases e.g. Rv2127, Rv0346c, etc. Contains PS00218 amino acid permeases signature. BELONGS TO THE AMINO ACID PERMEASE FAMILY (APC FAMILY)." /codon_start=1 /transl_table=11 /product="D-serine/alanine/glycine transporter protein CycA" /protein_id="NP_216220.1" /db_xref="GI:15608842" /db_xref="GOA:O33203" /db_xref="UniProtKB/TrEMBL:O33203" /db_xref="GeneID:888812" /translation="MPDDIAAADPTDTQPHLRRDLANRHIQLIAIGGAIGTGLFMGSG RTISLAGPAVMVVYGIIGFFVFFVLRAMGELLLSNLNYKSFVDFAADLRGPAAGFFVG WSYWFAWVVTGIADLVAITGYARFWWPGLPIWVPALVTVALILAVNLFSVRHFGELEF WFALIKVAAIVCLIAVGAILVATNFVSPHGVHATIENLWNDNGFFPTGFLGVVSGFQI AFFAYIGVELVGTAAAETADPRRTLPRAINAVPLRVAVFYIGALLAILAVVPWRQFAS GESPFVTMFSLAGLAAAASVVNFVVVTAAASSANSGFFSTGRMLFGLADEGHAPAAFH QLNRGGVPAPALLLTAPLLLTSIPLLYAGRSVIGAFTLVTTVSSLLFMFVWAMIIISY LVYRRRHPQRHTDSVYKMPGGVVMCWAVLVFFAFVIWTLTTETETATALAWFPLWFVL LAVGWLVTQRRQSRRSFGFHCQVVGVRQQLGRGMARLAMKIHARPKLRSAVVVEPVSA GEPGARRSAKSVRKLASDDSQSAHCPVAVVGLADGGRDPQYHHDGPDR" misc_feature complement(1931217..1931309) /gene="cycA" /locus_tag="Rv1704c" /note="PS00218 Amino acid permeases signature" gene complement(1931497..1932654) /gene="PPE22" /locus_tag="Rv1705c" /db_xref="GeneID:885068" CDS complement(1931497..1932654) /gene="PPE22" /locus_tag="Rv1705c" /function="UNKNOWN" /note="Rv1705c, (MTCI125.27c), len: 385 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to many e.g. YX23_MYCTU|Q10813 hypothetical 41.1 kDa protein cy274.2 3 (404 aa), fasta scores: opt: 819, E(): 0, (46.2% identity in 413 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177827.1" /db_xref="GI:57116898" /db_xref="UniProtKB/TrEMBL:Q79FL6" /db_xref="GeneID:885068" /translation="MDFGALPPEVNSGRMYCGPGSAPMVAAASAWNGLAAELSVAAVG YERVITTLQTEEWLGPASTLMVEAVAPYVAWMRATAIQAEQAASQARAAAAAYETAFA AIVPPPLIAANRARLTSLVTHNVFGQNTASIAATEAQYAEMWAQDAMAMYGYAGSSAT ATKVTPFAPPPNTTSPSAAATQLSAVAKAAGTSAGAAQSAIAELIAHLPNTLLGLTSP LSSALTAAATPGWLEWFINWYLPISQLFYNTVGLPYFAIGIGNSLITSWRALGWIGPE AAEAAAAAPAAVGAAVGGTGPVSAGLGNAATIGKLSLPPNWAGASPSLAPTVGSASAP LVSDIVEQPEAGAAGNLLGGMPLAGSGTGTGGAGPRYGFRVTVMSRPPFAG" gene complement(1932694..1933878) /gene="PPE23" /locus_tag="Rv1706c" /db_xref="GeneID:885070" CDS complement(1932694..1933878) /gene="PPE23" /locus_tag="Rv1706c" /function="UNKNOWN" /note="Rv1706c, (MTCI125.28c), len: 394 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to many e.g. YX23_MYCTU|Q10813 hypothetical 41.1 kDa protein cy274.23 (404 aa), fasta scores: opt: 841, E(): 3.9e-31, (46.8% identity in 408 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177828.1" /db_xref="GI:57116899" /db_xref="UniProtKB/TrEMBL:Q7D842" /db_xref="GeneID:885070" /translation="MTLDVPVNQGHVPPGSVACCLVGVTAVADGIAGHSLSNFGALPP EINSGRMYSGPGSGPLMAAAAAWDGLAAELSSAATGYGAAISELTNMRWWSGPASDSM VAAVLPFVGWLSTTATLAEQAAMQARAAAAAFEAAFAMTVPPPAIAANRTLLMTLVDT NWFGQNTPAIATTESQYAEMWAQDAAAMYGYASAAAPATVLTPFAPPPQTTNATGLVG HATAVAALRGQHSWAAAIPWSDIQKYWMMFLGALATAEGFIYDSGGLTLNALQFVGGM LWSTALAEAGAAEAAAGAGGAAGWSAWSQLGAGPVAASATLAAKIGPMSVPPGWSAPP ATPQAQTVARSIPGIRSAAEAAETSVLLRGAPTPGRSRAAHMGRRYGRRLTVMADRPN VG" gene complement(1934482..1934649) /locus_tag="Rv1706A" /db_xref="GeneID:3205102" CDS complement(1934482..1934649) /locus_tag="Rv1706A" /function="UNKNOWN" /note="Rv1706A, len: 55 aa. Conserved hypothetical protein, similar to part of several probable export proteins e.g. Rv0783c|Z80226_28 from Mycobacterium tuberculosis (540 aa), FASTA scores: opt: 125, E(): 0.011, (52.85% identity in 53 aa overlap). Size difference suggests possible gene fragment." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177651.1" /db_xref="GI:57116900" /db_xref="UniProtKB/TrEMBL:Q79FL4" /db_xref="GeneID:3205102" /translation="MGSLAAFKLGWLLSAMAPNVVLLTAFRVPQGLTMLTVFATGQAG QHRCRTFHVTP" gene 1934882..1936342 /locus_tag="Rv1707" /db_xref="GeneID:885073" CDS 1934882..1936342 /locus_tag="Rv1707" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF SULFATE ACROSS THE MEMBRANE." /note="Rv1707, (MTCI125.29), len: 486 aa. Probable conserved transmembrane protein, possibly involved in transport of sulfate, similar to several hypothetical proteins belonging to the sulfate permease family e.g. P40877|YCHM_ECOLI hypothetical 58.4 kDa protein in pth-prsa intergenic region from Escherichia coli (550 aa), FASTA scores: opt: 486, E(): 0, (33.1% identity in 492 aa overlap). Also similar to many other Mycobacterium tuberculosis membrane proteins e.g. Rv3273, Rv1739c. SEEMS TO BELONG TO THE SULP FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216223.1" /db_xref="GI:15608845" /db_xref="UniProtKB/TrEMBL:O33206" /db_xref="GeneID:885073" /translation="MLQRIARELLSGVAVAIVALPLAIAFGITATGTSQGALIGLYGA IFAGFFAAVFGGTPGQVTGPTGPITVVATATIAEHGLEGAFFAFILAGVFQILFGACR LGSLIRYVPHPVISGFMGGIAILIIMTQLDQVRSSSLLVLVTVVLLLASGRFIKAIPP SLLVLVLVSSVLPLAAPWLRDLRAGPVSINRTVDYIGEIPQAMPSFDFPQVANSTMLQ VLLSAVAIALLGSLDSLLTSLVMDNIRGTRHRSNKELIGQGIGNIAAGLFGGLSGAGA TVRSVVNVRNGGQTALSAATHSVVLFVFVAGLGAVVQYIPLAVLSGILILVAVGMFDW HAMRKAHVSPRGDVIVMFTTMIITVVVDLTIAVMVGIALSLLVHRLRSRQRKAKVTQD DTGTYRIDGPLSFLSVDGVFGSLRDGREDVSLDLQHVTYLDTSGARALLYFIDHSEKD GVAVSIKRIPPRLESQLTALADNEQRDKLRTVLESA" gene 1936360..1937316 /locus_tag="Rv1708" /db_xref="GeneID:885082" CDS 1936360..1937316 /locus_tag="Rv1708" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN CELL PROCESS." /note="Rv1708, (MTCI125.30), len: 318 aa. Putative initiation inhibitor protein, a soj-related protein probably involved in cell process, highly similar to many sporulation initiation inhibitor proteins soj e.g. P37522|SOJ_BACSU Soj protein from Bacillus subtilis (253 aa), FASTA scores: opt: 745, E(): 0, (46.0% identity in 248 aa overlap), and more weakly to various repA/para/incC proteins from various organisms e.g. Y4CK_RHISN|P55393 putative replication protein A from Rhizobium sp. (407 aa), FASTA scores: opt: 205, E(): 4e-13, (29.0% identity in 252 aa overlap). Also similar to Mycobacterium tuberculosis hyothetical proteins Rv3213c and Rv3918c." /codon_start=1 /transl_table=11 /product="putative initiation inhibitor protein" /protein_id="NP_216224.1" /db_xref="GI:15608846" /db_xref="GOA:O33207" /db_xref="UniProtKB/TrEMBL:O33207" /db_xref="GeneID:885082" /translation="MPAGLPGQASVAVRLSCDVPPDARHHEPRPGMTDHPDTGNGIGL TGRPPRAIPDPAPRSSHGPAKVIAMCNQKGGVGKTTSTINLGAALGEYGRRVLLVDMD PQGALSAGLGVPHYELDKTIHNVLVEPRVSIDDVLIHSRVKNMDLVPSNIDLSAAEIQ LVNEVGREQTLARALYPVLDRYDYVLIDCQPSLGLLTVNGLACTDGVIIPTECEFFSL RGLALLTDTVDKVRDRLNPKLDISGILITRYDPRTVNSREVMARVVERFGDLVFDTVI TRTVRFPETSVAGEPITTWAPKSAGALAYRALARELIDRFGM" gene 1937313..1938149 /locus_tag="Rv1709" /db_xref="GeneID:885083" CDS 1937313..1938149 /locus_tag="Rv1709" /function="UNKNOWN" /note="Rv1709, (MTCI125.31), len: 278 aa. Conserved hypothetical protein, similar to others e.g. P35154|YPUG_BACSU from Bacillus subtilis (251 aa), FASTA scores: opt: 271, E(): 8.2e-10, (27.0% identity in 248 aa overlap); Q9S230|SCI51.10C|AL109848 from Streptomyces coelicolor (264 aa), FASTA scores: opt: 855, E(): 0, (56.8% identity in 257 aa overlap). Equivalent to Q49888|MLC1351.05C|Z95117 from Mycobacterium leprae (268 aa), FASTA scores: (78.9% identity in 251 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216225.1" /db_xref="GI:15608847" /db_xref="UniProtKB/TrEMBL:O33208" /db_xref="GeneID:885083" /translation="MNGLQNSLANGGTAPENGYSAGFRVRLTNFEGPFDLLLQLIFAH QLDVTEVALHQVTDDFIAYTKAIGARLELEETTAFLVIAATLLDLKAARLLPAGQVDD EEDLALLEVRDLLFARLLQYRAFKHVAEMFAELEATALRSYPRAVSLEDGFVGLLPEV MLGVDAHRFAEIAAIALTPRPAPTVATEHLHELMVSVPEQAEHLLAMLKARGSGQWAS FSELVADCTAPIEIVGRFLALLELYRTRAVAFEQSEPLGALQVSWTGDDAERSDEKER RL" repeat_region 1938093..1938145 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 1938146..1938841 /locus_tag="Rv1710" /db_xref="GeneID:885116" CDS 1938146..1938841 /locus_tag="Rv1710" /function="UNKNOWN" /note="Rv1710, (MTCI125.32), len: 231 aa. Conserved hypothetical protein, similar to several hypothetical proteins e.g. P35155|YPUH_BACSU from Bacillus subtilis (197 aa), FASTA scores: opt: 339, E(): 1.3e-09, (36.0% identity in 186 aa overlap); Q9S231|SCI51.09C|AL109848 from Streptomyces coelicolor (223 aa), FASTA scores: opt: 626, E(): 0, (51.0% identity in 192 aa overlap). Equivalent to O05669|MLC1351.04C|Z95117 Hypothetical protein from Mycobacterium leprae (231 aa), FASTA scores: (77.9% identity in 231 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216226.1" /db_xref="GI:15608848" /db_xref="UniProtKB/TrEMBL:O33209" /db_xref="GeneID:885116" /translation="MTEHMPEHDPSYGIPDIAEPAELDADELKRVLEALLLVIDTPVT ADALAAATEQPVYRVAAKLQLMADELTGRDSGIDLRHTSEGWRMYTRARFAPYVEKLL LDGARTKLTRAALETLAVVAYRQPVTRARVSAVRGVNVDAVMRTLLARGLITEVGTDA DTGAVTFATTELFLERLGLTSLSELPDIAPLLPDVDTIDDLSESLDSEPRFIKLTGEL ASEQTLSFDVDRD" gene 1938838..1939602 /locus_tag="Rv1711" /db_xref="GeneID:885122" CDS 1938838..1939602 /locus_tag="Rv1711" /function="UNKNOWN" /note="Rv1711, (MTCI125.33), len: 254 aa. Conserved hypothetical protein, highly similar to a large family of hypothetical proteins e.g. P37765|YCIL_ECOLI from Escherichia coli (291 aa), FASTA scores: opt: 496, E(): 1.1e-29, (41.6% identity in 250 aa overlap); 9S232|SCI51.08C|AL109848 PUTATIVE PSEUDOURIDINE SYNTHASE from Streptomyces coelicolor (371 aa), FASTA scores: opt: 818, E(): 0, (53.1% identity in 245 aa overlap). Equivalent to O05668|MLCB1351.03C|Z95117 Hypothetical protein from Mycobacterium leprae (256 aa), (80.5% identity in 256 aa overlap). Contains PS01149 Hypothetical yciL/yejD/yjbC family signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216227.1" /db_xref="GI:15608849" /db_xref="GOA:P65842" /db_xref="UniProtKB/Swiss-Prot:P65842" /db_xref="GeneID:885122" /translation="MMAEPEESREPRGIRLQKVLSQAGIASRRAAEKMIVDGRVEVDG HVVTELGTRVDPQVAVVRVDGARVVLDDSLVYLALNKPRGMHSTMSDDRGRPCIGDLI ERKVRGTKKLFHVGRLDADTEGLMLLTNDGELAHRLMHPSHEVPKTYLATVTGSVPRG LGRTLRAGIELDDGPAFVDDFAVVDAIPGKTLVRVTLHEGRNRIVRRLLAAAGFPVEA LVRTDIGAVSLGKQRPGSVRALRSNEIGQLYQAVGL" misc_feature 1939183..1939233 /locus_tag="Rv1711" /note="PS01149 Hypothetical yciL/yejD/yjbC family signature" gene 1939599..1940291 /gene="cmk" /locus_tag="Rv1712" /db_xref="GeneID:885157" CDS 1939599..1940291 /gene="cmk" /locus_tag="Rv1712" /EC_number="2.7.4.14" /function="Catalyzes the transfer of a phosphate group from ATP to either CMP or UMP to form CDP or UDP and ADP [CATALYTIC ACTIVITY: ATP + CMP = ADP + CDP]." /note="Catalyzes the formation of (d)CDP from ATP and (d)CMP" /codon_start=1 /transl_table=11 /product="cytidylate kinase" /protein_id="NP_216228.1" /db_xref="GI:15608850" /db_xref="GOA:P63803" /db_xref="UniProtKB/Swiss-Prot:P63803" /db_xref="GeneID:885157" /translation="MSRLSAAVVAIDGPAGTGKSSVSRRLARELGARFLDTGAMYRIV TLAVLRAGADPSDIAAVETIASTVQMSLGYDPDGDSCYLAGEDVSVEIRGDAVTRAVS AVSSVPAVRTRLVELQRTMAEGPGSIVVEGRDIGTVVFPDAPVKIFLTASAETRARRR NAQNVAAGLADDYDGVLADVRRRDHLDSTRAVSPLQAAGDAVIVDTSDMTEAEVVAHL LELVTRRSEAVR" misc_feature 1939635..1939658 /gene="cmk" /locus_tag="Rv1712" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1940288..1941679 /gene="engA" /locus_tag="Rv1713" /db_xref="GeneID:885086" CDS 1940288..1941679 /gene="engA" /locus_tag="Rv1713" /function="BINDS BOTH GDP AND GTP. HAS AN INTRINSIC GTPASE ACTIVITY AND IS ESSENTIAL FOR CELL GROWTH." /note="synchronizes cellular events by interacting with multiple targets with tandem G-domains; overexpression in Escherichia coli suppresses rrmJ mutation; structural analysis of the Thermotoga maritima ortholog shows different nucleotide binding affinities in the two binding domains" /codon_start=1 /transl_table=11 /product="GTP-binding protein EngA" /protein_id="NP_216229.1" /db_xref="GI:15608851" /db_xref="GOA:P64057" /db_xref="UniProtKB/Swiss-Prot:P64057" /db_xref="GeneID:885086" /translation="MTQDGTWVDESDWQLDDSEIAESGAAPVVAVVGRPNVGKSTLVN RILGRREAVVQDIPGVTRDRVCYDALWTGRRFVVQDTGGWEPNAKGLQRLVAEQASVA MRTADAVILVVDAGVGATAADEAAARILLRSGKPVFLAANKVDSEKGESDAAALWSLG LGEPHAISAMHGRGVADLLDGVLAALPEVGESASASGGPRRVALVGKPNVGKSSLLNK LAGDQRSVVHEAAGTTVDPVDSLIELGGDVWRFVDTAGLRRKVGQASGHEFYASVRTH AAIDSAEVAIVLIDASQPLTEQDLRVISMVIEAGRALVLAYNKWDLVDEDRRELLQRE IDRELVQVRWAQRVNISAKTGRAVHKLVPAMEDALASWDTRIATGPLNTWLTEVTAAT PPPVRGGKQPRILFATQATARPPTFVLFTTGFLEAGYRRFLERRLRETFGFDGSPIRV NVRVREKRAGKRR" misc_feature 1940384..1940407 /gene="engA" /locus_tag="Rv1713" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1940903..1940926 /gene="engA" /locus_tag="Rv1713" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 1941853..1942665 /locus_tag="Rv1714" /db_xref="GeneID:885159" CDS 1941853..1942665 /locus_tag="Rv1714" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1714, (MTV048.01), len: 270 aa. Probable oxidoreductase (EC 1.-.-.-) similar to many e.g. AE0010|AE001021_4 Archaeoglobus fulgidus section 79 (281 aa), FASTA scores: opt: 578, E(): 3.3e-31, (38.9% identity in 265 aa overlap). Also similar to several other M. tuberculosis oxidoreductases e.g. Rv1544, etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_216230.1" /db_xref="GI:15608852" /db_xref="GOA:O53927" /db_xref="UniProtKB/TrEMBL:O53927" /db_xref="GeneID:885159" /translation="MEEMALAQQVPNLGLARFSVQDKSILITGATGSLGRVAARALAD AGARLTLAGGNSAGLAELVNGAGIDDAAVVTCRPDSLADAQQMVEAALGRYGRLDGVL VASGSNHVAPITEMAVEDFDAVMDANVRGAWLVCRAAGRVLLEQGQGGSVVLVSSVRG GLGNAAGYSAYCPSKAGTDLLAKTLAAEWGGHGIRVNALAPTVFRSAVTEWMFTDDPK GRATREAMLARIPLRRFAEPEDFVGALIYLLSDASSFYTGQVMYLDGGYTAC" gene 1942659..1943573 /gene="fadB3" /locus_tag="Rv1715" /db_xref="GeneID:885160" CDS 1942659..1943573 /gene="fadB3" /locus_tag="Rv1715" /EC_number="1.1.1.157" /function="THOUGHT TO BE INVOLVED IN FATTY ACID DEGRADATION. FADB AND FADA ARE THE ALPHA AND BETA SUBUNITS OF THE MULTIFUNCTIONAL ENZYME COMPLEX OF THE FATTY ACID DEGRADATION CYCLE [CATALYTIC ACTIVITY: (S)-3-hydroxybutanoyl-CoA + NADP+ = 3-acetoacetyl-CoA + NADPH]." /note="Rv1715, (MTV048.02), len: 304 aa. Probable fadB3, 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157), highly similar to many e.g. NP_107236.1|NC_002678 3-hydroxybutyryl-CoA dehydrogenase from Mesorhizobium loti (309 aa); NP_250319.1|NC_002516 probable 3-hydroxyacyl-CoA dehydrogenase from Pseudomonas aeruginosa (509 aa); P45856|HBD_BACSU PROBABLE 3-HYDROXYBUTYRYL-COA DEHYDROGENASE from Bacillus subtilis (287 aa), FASTA scores: opt: 488, E(): 1.5e-24, (38.7% identity in 279 aa overlap); etc. COULD BELONG TO THE 3-HYDROXYACYL-COA DEHYDROGENASE FAMILY." /codon_start=1 /transl_table=11 /product="3-hydroxybutyryl-CoA dehydrogenase FADB3" /protein_id="YP_177829.1" /db_xref="GI:57116901" /db_xref="GOA:Q7D836" /db_xref="UniProtKB/TrEMBL:Q7D836" /db_xref="GeneID:885160" /translation="MLTSHGFSRAAVVGAGLMGRRIAGVLASAGLDVAITDTNAEILH AAAVEAARVAGAGRGSVAAAADLAAAIPDADLVIEAVVENLAVKQELFERLATLAPDA VLATNTSVLPIGAVTERVEDGSRVIGTHFWNPPDLIPVVEVVPSARTAPDTADRVVAL LTQVGKLPVRVGRDVPGFIGNRLQHALWREAIALVAEGVCDPKTVDLVVRNTIGLRLA TLGPLENADYIGLDLTLAIHDAVIPSLNHDPHPSPLLRELVAAGQLGARTGHGFLDWP AGAREATTARLAQHIAAQLQANEKGRGT" gene 1943576..1944406 /locus_tag="Rv1716" /db_xref="GeneID:885162" CDS 1943576..1944406 /locus_tag="Rv1716" /function="UNKNOWN" /note="Rv1716, (MTV048.03,MTCY04C12.01) len: 276 aa. Conserved hypothetical protein, shows high similarity with AF1200|O29068|AE001021_11A conserved protein of Archaeoglobus fulgidus, gp fulgidus section 7 (278 aa), FASTA scores: E(): 0, (61.8% identity in 251 a a overlap); also weak similarity to several polyketide cyclases e.g. O68500|AF048833|DPSY from Streptomyces peucetius (272 aa), FASTA scores: opt: 194, E(): 1.7e-05, (29.6% identity in 223 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216232.1" /db_xref="GI:15608854" /db_xref="UniProtKB/TrEMBL:O53929" /db_xref="GeneID:885162" /translation="MTFAWPLGAAESTLEFYDLSHPWGHGAPAWPYFEDVQIERLHGM AKSRVLTQKITTVMHSGTHIDAPAHVVEGTPFLDEIPLSAFFGTGVVVSIPKGKWGMV TAEDLQNATPDIRPGDIVVVNTGWHHKYADSAEYYAYSPGFDKKAGEWFAAKGVKAVG TDTQALDHPLATAIAPHSPAEAQGGLLPWAVREYEAQTGRKVLDDFPDWEPCHRAILS QGIYGFENVGGDLDKVTGKRVTFAAFPWRWVGGDGCIVRLVAIVDPTGSYRIETGKAV" gene 1944406..1944756 /locus_tag="Rv1717" /db_xref="GeneID:885166" CDS 1944406..1944756 /locus_tag="Rv1717" /function="UNKNOWN" /note="Rv1717, (MTCY04C12.02), len: 116 aa. Conserved hypothetical protein, similar to O29060|AF1208|AE001021 Hypothetical protein from Arecheoglobus fulgidus (114 aa), FASTA scores: opt: 254, E(): 3.3e-09, (37.7% identity in 114 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216233.1" /db_xref="GI:15608855" /db_xref="UniProtKB/TrEMBL:O86372" /db_xref="GeneID:885166" /translation="MKLTRASQAPRYVAPAHHEVSTMRLQGREAGRTERFWVGLSVYR PGGTAEPAPTREETVYVVLDGELVVTVDGAETVLGWLDSVHLAKGELRSIHNRTDRQA LLLVTVAHPVAEVA" repeat_region 1944756..1944808 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 1944809..1945627 /locus_tag="Rv1718" /db_xref="GeneID:885169" CDS 1944809..1945627 /locus_tag="Rv1718" /function="UNKNOWN" /note="Rv1718, (MTCY04C12.03), len: 272 aa. Conserved hypothetical protein, similar to O29058|AF1210|AE001021 Hypothetical protein from Archeoglobus (313 aa), FASTA scores: opt: 301, E(): 8e-23, (31.6% identity in 301 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216234.1" /db_xref="GI:15608856" /db_xref="UniProtKB/TrEMBL:P71976" /db_xref="GeneID:885169" /translation="MSIVITVAPTGPIATKADNPALPTSPEEIATAVEQAYHAGAAVA HIHLRDENERPTADPNIARRAMDLIGERCPILIQLSTGVGLTVPFEQREQLVELRPRM ATLNPCSMSFGAGEFRNPPQAVRRLAARMRELDIKPELEIYDTGHLEACLRLWAEDLL AEPLQFSIVLGVRGGMAATADNLLTMVRRLPPGAIWQVIAIGKANMELTAMGLALGGN ARVGLEDTLYLRKGELAPSNLALVSRTIRLAEALDLPIASVEEAEAALQLPGTS" gene 1945641..1946420 /locus_tag="Rv1719" /db_xref="GeneID:885170" CDS 1945641..1946420 /locus_tag="Rv1719" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1719, (MTCY04C12.04), len: 259 aa. Probable transcriptional regulatory protein, similar to YIAJ_ECOLI|P37671 hypothetical transcriptional regulator from Escherichia coli (282 aa), FASTA scores: opt: 353, E(): 3.2e-15, (31.1% identity in 235 aa overlap). Similar to Mycobacterium tuberculosis hypothetical IclR-family transcriptional regulators Rv2989, Rv1773c. Helix-turn-helix motif from aa 34-55 (+6.94 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216235.1" /db_xref="GI:15608857" /db_xref="GOA:P71977" /db_xref="UniProtKB/TrEMBL:P71977" /db_xref="GeneID:885170" /translation="MSAEEQDTRSGGIQVIARAAELLRVLQAHPGGLSQAEIGERVGM ARSTVSRILNALEDEGLVASRGARGPYRLGPEITRMATTVRLGVVTEMHPFLTELSRE LDETVDLSILDGDRADVVDQVVPPQRLRAVSAVGESFPLYCCANGKALLAALPPERQA RALPSRLAPLTANTITDRAALRDELNRIRVDGVAYDREEQTEGICAVGAVLRGVSVEL VAVSVPVPAQRFYGREAELAGALLAWVSKVDAWFNGTEDRK" gene 1946613..1946686 /locus_tag="Rvnt21" /note="tRNA-Pro(GGG)" /db_xref="GeneID:2700455" tRNA 1946613..1946686 /locus_tag="Rvnt21" /product="tRNA-Pro" /note="codon recognized: CCC" /anticodon=(pos:1946647..1946649,aa:Pro) /db_xref="GeneID:2700455" gene complement(1947030..1947419) /locus_tag="Rv1720c" /db_xref="GeneID:885180" CDS complement(1947030..1947419) /locus_tag="Rv1720c" /function="UNKNOWN" /note="Rv1720c, (MTCY04C12.05c), len: 129 aa. Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. O53610|Rv0065|MTV030.08 (133 aa), FASTA scores: E(): 1.5e-10, (39.1% identity in 128 aa overlap); P71550|Rv0960|MTCY10D7.14C (129 aa) and O06415|Rv0549c|MTCY25D10.28C (137 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216236.1" /db_xref="GI:15608858" /db_xref="UniProtKB/TrEMBL:P71978" /db_xref="GeneID:885180" /translation="MIVLDASAAVELMLTTPAGAAVARRLRGETVHAPAHFDVEVIGA IRQAVVRQLISDHEGLVVVVNFLSLPVRRWPLKPFTQRAYQLRSTHTVADGAYVALAE GLGVPLITCDGRLAQSHGHNAEIELVA" gene complement(1947416..1947643) /locus_tag="Rv1721c" /db_xref="GeneID:885182" CDS complement(1947416..1947643) /locus_tag="Rv1721c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1721c, (MTCY04C12.06c), len: 75 aa. Conserved hypothetical protein, similar to Rv0300|MTCY63.05|O07227 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (73 aa). Start changed since original submission." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216237.2" /db_xref="GI:57116902" /db_xref="UniProtKB/TrEMBL:P71979" /db_xref="GeneID:885182" /translation="MSAMVQIRNVPDELLHELKARAAAQRMSLSDFLLARLAEIAEEP ALDDVLDRLAALPRRDLGASAAELVDEARSE" gene 1947861..1949345 /locus_tag="Rv1722" /db_xref="GeneID:885183" CDS 1947861..1949345 /locus_tag="Rv1722" /EC_number="6.4.1.2" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID METABOLISM" /note="an AccC homodimer forms the biotin carboxylase subunit of the acetyl CoA carboxylase, an enzyme that catalyzes the formation of malonyl-CoA, which in turn controls the rate of fatty acid metabolism" /codon_start=1 /transl_table=11 /product="biotin carboxylase-like protein" /protein_id="NP_216238.1" /db_xref="GI:15608860" /db_xref="UniProtKB/TrEMBL:P71980" /db_xref="GeneID:885183" /translation="MIVPAREPEPQPRRVLNGLSDVRAFFHNNTVPLYFISPTPFNLL GIYRWIRNFFYLTYYDSFEGEHSRVFVPRRRDRRDFDGMGDVCNHLLRDPETLEFIKN RGPGGKACFVMLDEETQALARQAGLEVMHPPAELRHRLESKIVMTRLADEAGVPSVPH VIGRVSSYDELSALAHGAGLGDDLVVEAAYGNAGSATFFVRGLRDWDQCAGGIVGQPE IKVMKRIRNVEVCIEATVTRHGTVIGPAMTSLVGYPELTPYRGAWCGNDVWRGALPPA QTRAAREMVAKLGDVLSREGYRGYFEVDLLHDLDADELYLGEVNPRLSGASPMTNLTT EAYADMPLFLFHLLEYMDVDYELDIEAINSRWERGYGEDEVWGQLIMSETSPDLELFT ATPRTGMWRLNHDGRVSFARQGNDWATMLDESEAFYMRVAAPGDLRCEGAQLGVLVTR GHLQTDDYQLTERGRRWIDGLKAQFASTPLTPAAPIVSRLVARA" gene 1949342..1950589 /locus_tag="Rv1723" /db_xref="GeneID:885185" CDS 1949342..1950589 /locus_tag="Rv1723" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1723, (MTCY04C12.08), len: 415 aa. Possible hydrolase (EC 3.-.-.-), similar to others e.g. NYLB_FLASP|P07061 6-aminohexanoate-dimer hydrolase from Flavobacterium sp. (392 aa), FASTA scores: opt: 717, E(): 0, (35.1% identity in 396 aa overlap). Also similar to M. tuberculosis hypothetical esterases and penicillin binding proteins e.g. Rv1923, Rv1497, Rv2463, etc" /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_216239.1" /db_xref="GI:15608861" /db_xref="GOA:P71981" /db_xref="UniProtKB/TrEMBL:P71981" /db_xref="GeneID:885185" /translation="MSGGVPAGLALDNWLSSPYSHWAFQHVEDFMPTTVIARGTEPVV TLPADNAPIADIGLTSTDGIATTVGAVMAATATDGWAVAHRGALVAEQYLDGLGPRTR HLLFSVSKSLVAAVVGALHGAGAIELDAPVTAYVPALADCGYAGATVRHLLDMRSGVA FSENYDDPAAEIHVREQVIGWAPKRGPDLPATLRDYLLTLRRKSAHGGPFEYRSCETD VLGWICEAAAGQPMPELMSELLWSRIGAQCDATIALDVAGAAGTGIFDGGISACLTDM IRFGSLYLRDGVSLAGQQVVPAAWIADTFDGGPDSRQAFAASPDDNPMPGGMYRNQVW FPYPGSNVALCVGMCGQLIYVNRAAEVVAAKLSTQPHSHEPHMLDTLRAFDAVAHELS GIRSSSTNDPQRPSPPAQEASPG" gene complement(1950632..1951051) /locus_tag="Rv1724c" /db_xref="GeneID:885189" CDS complement(1950632..1951051) /locus_tag="Rv1724c" /function="UNKNOWN" /note="Rv1724c, (MTCY04C12.09c), len: 139 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216240.1" /db_xref="GI:15608862" /db_xref="UniProtKB/TrEMBL:P71982" /db_xref="GeneID:885189" /translation="MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPV PHWPKYWIQALAKHFQRQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYN AVRNAGRAIENEQAALDHKLAEVRKRRMDTWDESYFR" gene complement(1951041..1951751) /locus_tag="Rv1725c" /db_xref="GeneID:885191" CDS complement(1951041..1951751) /locus_tag="Rv1725c" /function="UNKNOWN" /note="Rv1725c, (MTCY04C12.10c), len: 236 aa. Conserved hypothetical protein, similar to other hypothetical proteins from diverse organisms e.g. P70885|U44893 ORF108 from BUTYRIVIBRIO FIBRISOLVENS, (108 aa), FASTA scores: opt: 223, E(): 2e-09, (39.1% identity in 92 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical transcriptional regulator, O05774|Rv3095|YU95_MYCTU (158 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216241.1" /db_xref="GI:15608863" /db_xref="UniProtKB/TrEMBL:P71983" /db_xref="GeneID:885191" /translation="MQPYGQYCPVARAAELLGDRWTLLIVRELLFGPLRFTEIERGLP GISRSVLAQRLRRLQHDRIIEAVPEHTGGGYRFTVAGEELRPVLQTLGDWVSRWLMAD PTPAECDPELLTLWISRRVNTEALPGRRVVVEFRYHGERPLWAWLVLEPGDISVCLHD PCLPVDLTVRGHPRDLYRVYSGRSTLAAEISAERIELDGLPAMRRAFPSWMAWSPFAP AMRQAVVSVDQMPEAHGG" gene 1951852..1953237 /locus_tag="Rv1726" /db_xref="GeneID:885192" CDS 1951852..1953237 /locus_tag="Rv1726" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1726, (MTCY04C12.11), len: 461 aa. Probable oxidoreductase (EC 1.-.-.-), similar to HDNO_ARTOX|P08159 6-hydroxy-d-nicotine oxidase (458 aa), FASTA scores: opt: 678, E(): 0, (29.5% identity in 465 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical dehydrogenases e.g. Rv3107c, Rv1257c, etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_216242.1" /db_xref="GI:15608864" /db_xref="GOA:P71984" /db_xref="UniProtKB/TrEMBL:P71984" /db_xref="GeneID:885192" /translation="MTATLTKTLGSLDDFRGTLCVPGDPDYPRVRAIWNGQVAREPAL IATCHDACDVRTVLRRAVDAGMVTAVRGGGHNVAGTALCDGGVVIDLSAMRAVSLDPA TGRVRVQGGATLADLDHATVPFARVAPAGIVTTTGVGGLTLGGGVGWTTRRFGLSCDN LVAVRLVTAAGDYLSVDDERDPELMWGLRGGGGNFGIVTEFEFATHPFGPVAVAGFVV YRLDDGPAVLRGYRQFAAAAPEEVTTIVVLRHAPPAPWIPVDQRGKPVVMIGAVHTGS IQTGIEALRPVKSLARPVADTVWPTPFLAHQAVLDASNPAGHRYYWKSDHLAELNDEA IDLLVEQTAQLSSPDSLIGIFQLGGAAARGGERSCFPSRHARFMVNYATHWTEAREDD LHRQWTRDAIEALAPYGLGTAYVNFTADDAPMHVETLYSTTEFSRLVTLKNRLDPDNV FRNNHNIRPSA" gene 1953270..1953839 /locus_tag="Rv1727" /db_xref="GeneID:887422" CDS 1953270..1953839 /locus_tag="Rv1727" /function="UNKNOWN" /note="Rv1727, (MTCY04C12.12), len: 189 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins P72040|Rv3773c|MTCY13D12.07C (194 aa), FASTA scores: opt: 176, E(): 2.7e-08, (31.1% identity in 180 aa overlap); and O53801|Rv0738 (182 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216243.1" /db_xref="GI:15608865" /db_xref="UniProtKB/TrEMBL:P71985" /db_xref="GeneID:887422" /translation="MDLYSNLVEAEQRLVALVSSIEADSYSSPTPCDRWDVRALLSHA LASIDAFAAAVDGAPGPDMAQVFSGADIVGDDPLGATQRITRRSQAAWSTVRDLNAEL STFIGVMPAGQALAIITFSTVVHGWDLAVATGQAGELPEHLAEAAQQVAAELVPVLRP RGLFAHDVDLAGEATPTQRLVALTGRKPR" gene complement(1953864..1954634) /locus_tag="Rv1728c" /db_xref="GeneID:885193" CDS complement(1953864..1954634) /locus_tag="Rv1728c" /function="UNKNOWN" /note="Rv1728c, (MTCY04C12.13c), len: 256 aa. Conserved hypothetical protein, some similarity to O07246|Rv0320|MTCY63.25 possible exported protein from Mycobacterium tuberculosis (220 aa), FASTA scores: E(): 1.3e-31, (42.3% identity in 220 aa overlap). C-terminal region similar to Q9ZX60|AF068845|AF068845_17 segment of gp17 of Mycobacteriophage TM4 (1229 aa), FASTA scores: opt: 385, E(): 4.3e-17, (44.6% identity in 139 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216244.1" /db_xref="GI:15608866" /db_xref="UniProtKB/TrEMBL:P71986" /db_xref="GeneID:885193" /translation="MSVNGLPGAHNAGLQPIDSKGCHTRRTRHTKVLFVSKGVLANGR GRWLAIAASLVVSAAILYAQGAEHTCCRETPAAIPTGPDSAPANAPRIASPTEADLLA ASAPVAAQQFQFALPAGVASEEGLQVKTIWVARAVSVLFPQITNIFGYRQDPLKWHPN GLAIDVMIPNHHSDEGIQLGNQVAGLALANAKRWGVLHVIWRQGYYPGIGAPSWTADY GSETLNHYDHVHIATDGGGYPTGRETYYVGSMSPTPPE" gene complement(1954631..1955569) /locus_tag="Rv1729c" /db_xref="GeneID:885194" CDS complement(1954631..1955569) /locus_tag="Rv1729c" /function="UNKNOWN" /note="Rv1729c, (MTCY04C12.14c), len: 312 aa. Conserved hypothetical protein, similar to many Mycobacterium tuberculosis hypothetical proteins e.g. Q50726|Rv3399|YX99_MYCTU (348 aa), FASTA scores: opt: 1019, E(): 0, (55.7% identity in 296 aa overlap); P95074|Rv0726c (367 aa), O53795|Rv0731c (318 aa), and O53841|Rv0830 (301 aa), etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216245.1" /db_xref="GI:15608867" /db_xref="UniProtKB/TrEMBL:P71987" /db_xref="GeneID:885194" /translation="MARTDDDNWDLTSSVGVTATIVAVGRALATKDPRGLINDPFAEP LVRAVGLDLFTKMMDGELDMSTIADVSPAVAQAMVYGNAVRTKYFDDYLLNATAGGIR QVAILASGLDSRAYRLPWPTRTVVYEIDQPKVMEFKTTTLADLGAEPSAIRRAVPIDL RADWPTALQAAGFDSAAPTAWLAEGLLIYLKPQTQDRLFDNITALSAPGSMVATEFVT GIADFSAERARTISNPFRCHGVDVDLASLVYTGPRNHVLDYLAAKGWQPEGVSLAELF RRSGLDVRAADDDTIFISGCLTDHSSISPPTAAGWR" gene complement(1955692..1957245) /locus_tag="Rv1730c" /db_xref="GeneID:885202" CDS complement(1955692..1957245) /locus_tag="Rv1730c" /function="THOUGHT TO BE INVOLVED IN CELL WALL BIOSYNTHESIS AND MAY ALSO ACT AS A SENSOR OF EXTERNAL PENICILLINS" /note="Rv1730c, (MTCY04C12.15c), len: 517 aa. Possible penicillin-binding protein, similar to others e.g. PBP4_NOCLA|Q06317 penicillin-binding protein 4 (pbp-4) from Nocardia lactamdurans (381 aa), FASTA scores: opt: 643, E(): 3.8e-32, (33.8% identity in 370 aa overlap); etc. Also similar to other Mycobacterium tuberculosis hypothetical penicillin binding proteins and esterases e.g. Rv1923, Rv1497, etc." /codon_start=1 /transl_table=11 /product="penicillin-binding protein" /protein_id="NP_216246.1" /db_xref="GI:15608868" /db_xref="GOA:P71988" /db_xref="UniProtKB/TrEMBL:P71988" /db_xref="GeneID:885202" /translation="MCPPIILSSATPTGTRCGTRHGRAVVTEYVRALDRLPHEIATAV VETVNCADPGAAFDELDAKINAGMKAYAIPGVAVAVWAGGQEYVKGYGVTNVDHPMPV DGDTVFRIGSTTKTFTGTVMMRLVERGKVDLDSPVRRYIPDFAVADESASATVTVRQL LNHTAGWDGRNGQDFGRGDDAVALYVKAMTRLPQLTPPGTAFAYNNSGLVVAGRIIEL VAGTTYESTVQRLLLDPLQLAHTRYFSDQIIGLNVAASHSVVDGKPIAVTDFWTFPRS CNPTGGLMSTARDQLRYAQFHLGDGRAPNGEQILSRQSLKAMRSNPGAGGTLWVELTG MGVTWMLRPSAENVTIVEHGGTWKGQRSGFVMVPDRNFAMTVLTNSDGGFHMINDLFA SDWALQRFAGLSNLPATPQRLGAVDLAPYEGRYIAKQVAQNGDLETTVIDFRARDGQL AGSMSTDDANPDGQNSANLGLAFYRPDYGLDLGPDNKPTGSRSNFVRGPDGNIAWFCS QHGRLFRRQ" gene 1957677..1959233 /gene="gabD2" /locus_tag="Rv1731" /db_xref="GeneID:885204" CDS 1957677..1959233 /gene="gabD2" /locus_tag="Rv1731" /EC_number="1.2.1.16" /function="INVOLVED IN 4-AMINOBUTYRATE (GABA) DEGRADATION PATHWAY [CATALYTIC ACTIVITY: SUCCINATE SEMIALDEHYDE + NAD(P)(+) + H(2)O = SUCCINATE + NAD(P)H]." /experiment="experimental evidence, no additional details recorded" /note="NADP-dependent semialdehyde dehydrogenase; part of alternative pathway from alpha-ketoglutarate to succinate" /codon_start=1 /transl_table=11 /product="succinic semialdehyde dehydrogenase" /protein_id="NP_214748.2" /db_xref="GI:57116903" /db_xref="GOA:P96417" /db_xref="UniProtKB/TrEMBL:P96417" /db_xref="GeneID:885204" /translation="MPAPSAEVFDRLRNLAAIKDVAARPTRTIDEVFTGKPLTTIPVG TAADVEAAFAEARAAQTDWAKRPVIERAAVIRRYRDLVIENREFLMDLLQAEAGKARW AAQEEIVDLIANANYYARVCVDLLKPRKAQPLLPGIGKTTVCYQPKGVVGVISPWNYP MTLTVSDSVPALVAGNAVVLKPDSQTPYCALACAELLYRAGLPRALYAIVPGPGSVVG TAITDNCDYLMFTGSSATGSRLAEHAGRRLIGFSAELGGKNPMIVARGANLDKVAKAA TRACFSNAGQLCISIERIYVEKDIAEEFTRKFGDAVRNMKLGTAYDFSVDMGSLISEA QLKTVSGHVDDATAKGAKVIAGGKARPDIGPLFYEPTVLTNVAPEMECAANETFGPVV SIYPVADVDEAVEKANDTDYGLNASVWAGSTAEGQRIAARLRSGTVNVDEGYAFAWGS LSAPMGGMGLSGVGRRHGPEGLLKYTESQTIATARVFNLDPPFGIPATVWQKSLLPIV RTVMKLPGRR" misc_feature 1958385..1958435 /gene="gabD2" /locus_tag="Rv1731" /note="PS00216 Sugar transport proteins signature 1" misc_feature 1958433..1958456 /gene="gabD2" /locus_tag="Rv1731" /note="PS00687 Aldehyde dehydrogenases glutamic acid active site" gene complement(1959243..1959791) /locus_tag="Rv1732c" /db_xref="GeneID:885211" CDS complement(1959243..1959791) /locus_tag="Rv1732c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1732c, (MTCY04C12.17c), len: 182 aa. Conserved hypothetical protein, highly similar to hypothetical proteins from several organisms e.g. P73178|SLL1289|D90904 from Synechocystis (194 aa), FASTA scores: opt: 663, E(): 0, (53.1% identity in 179 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216248.1" /db_xref="GI:15608870" /db_xref="UniProtKB/TrEMBL:P71990" /db_xref="GeneID:885211" /translation="MAVESSMLALGTPAPSFTLPQPATGATVSLDELTGPALVVTFIC NHCPYVQHVAAGLATLGRDLADQGVPMVGISSNDVVTYPQDGPDQMVAEARRHGWTFP YLYDETQDVARAFSAACTPDTFVFDGQRRLVYRGQLDDSRPGNGRPVTAADVRAAVDA LLAGRPVNPDQRPSIGCGIKWR" gene complement(1959855..1960487) /locus_tag="Rv1733c" /db_xref="GeneID:885214" CDS complement(1959855..1960487) /locus_tag="Rv1733c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1733c, (MTCY04C12.18c), len: 210 aa. Probable conserved transmembrane protein. Similar to AL109962|SCJ1_26 hypothetical protein from Streptomyces coelicolor (193 aa), FASTA scores: opt: 287, E(): 3.8e-11, (35.2% identity in 182 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216249.1" /db_xref="GI:15608871" /db_xref="UniProtKB/TrEMBL:P71991" /db_xref="GeneID:885214" /translation="MIATTRDREGATMITFRLRLPCRTILRVFSRNPLVRGTDRLEAV VMLLAVTVSLLTIPFAAAAGTAVQDSRSHVYAHQAQTRHPATATVIDHEGVIDSNTTA TSAPPRTKITVPARWVVNGIERSGEVNAKPGTKSGDRVGIWVDSAGQLVDEPAPPARA IADAALAALGLWLSVAAVAGALLALTRAILIRVRNASWQHDIDSLFCTQR" gene complement(1960774..1961016) /locus_tag="Rv1734c" /db_xref="GeneID:885212" CDS complement(1960774..1961016) /locus_tag="Rv1734c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1734c, (MTCY04C12.19c), len: 80 aa. Conserved hypothetical protein, similar to C-terminal region Q9Z8N2|CP0452|AE001615 Dihydrolipoamide Acetyltransferase from Chlamydia pneumoniae (429 aa), FASTA scores: opt: 138, E(): 0.0012, (26.9% identity in 78 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216250.1" /db_xref="GI:15608872" /db_xref="UniProtKB/TrEMBL:P71992" /db_xref="GeneID:885212" /translation="MTNVGDQGVDAVFGVIYPPQVALVSFGKPAQRVCAVDGAIHVMT TVLATLPADHGCSDDHRGALFFLSINELTRCAAVTG" gene complement(1961291..1961788) /locus_tag="Rv1735c" /db_xref="GeneID:885216" CDS complement(1961291..1961788) /locus_tag="Rv1735c" /function="UNKNOWN" /note="Rv1735c, (MTCY04C12.20c), len: 165 aa. Hypothetical membrane protein, similar to part of O58614|PH0884|AP000004 Hypothetical malic acid transport protein from Pyrococcus horikoshii (330 aa), FASTA scores: opt: 167, E(): 0.0003, (29.2% identity in 120 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216251.1" /db_xref="GI:15608873" /db_xref="UniProtKB/TrEMBL:P71993" /db_xref="GeneID:885216" /translation="MGATAITVLAGAHIVEMADAPMAIVTSGLVAGASVVFWAFGPWL IPPLVAASIWKHVVHRVPLRYEATLWSVVFPLGMYGVGAYRLGLAAHLPIVESIGEFE GWVALAVWTITFVAMLHHLAATIGRSGRSSHAIGAADDTHAIICRPPRSFDHQVRAFR RNQPM" gene complement(1962228..1964186) /gene="narX" /locus_tag="Rv1736c" /db_xref="GeneID:885213" CDS complement(1962228..1964186) /gene="narX" /locus_tag="Rv1736c" /EC_number="1.7.99.4" /function="INVOLVED IN NITRATE REDUCTION, AND IN THE PERSISTENCE IN THE HOST [CATALYTIC ACTIVITY: Nitrite + acceptor = nitrate + reduced acceptor]" /experiment="experimental evidence, no additional details recorded" /note="Rv1736c, (MTCY04C12.21c), len: 652 aa. Probable narX, nitrate reductase (EC 1.7.99.4). Contains three domains: N-terminus (250 aa) is similar to e.g. N-terminus of NARG_ECOLI|P09152 respiratory nitrate reductase 1 alpha chain from Escherichia coli (1246 aa), FASTA scores: E(): 0, (58.6% identity in 251 aa overlap); and Rv1161|MTCI65.28|NARG PROBABLE RESPIRATORY NITRATE REDUCTASE (ALPHA CHAIN) from Mycobacterium tuberculosis (1232 aa). Central region (260-410 aa) is similar to Rv1163|O06561|NARJ PROBABLE RESPIRATORY NITRATE REDUCTASE (DELTA CHAIN) from Mycobacterium tuberculosis (201 aa), FASTA scores: E(): 0, (64.2% identity in 159 aa overlap). C-terminus (420 aa-) is similar to Rv1164|O06562|NARI PROBABLE RESPIRATORY NITRATE REDUCTASE (GAMMA CHAIN) from Mycobacterium tuberculosis (246 aa), FASTA scores: E(): 0, (68.6% identity in 239 aa overlap). Contains PS00551 Prokaryotic molybdopterin oxidoreductases signature 1." /codon_start=1 /transl_table=11 /product="nitrate reductase NarX" /protein_id="NP_216252.1" /db_xref="GI:15608874" /db_xref="GOA:P71994" /db_xref="UniProtKB/TrEMBL:P71994" /db_xref="GeneID:885213" /translation="MTVTPRTGSRIEELLARSGRFFIPGEISADLRTVTRRGGRDGDV FYRDRWSHDKVVRSTHGVNCTGSCSWKIYVKDDIITWETQETDYPSVGPDRPEYEPRG CPRGAAFSWYTYSPTRVRHPYARGVLVEMYREAKARLGDPVAAWADIQADPRRRRRYQ RARGKGGLVRVSWAEATEMIAAAHVHTISTYGPDRVAGFSPIPAMSMVSHAAGSRFVE LIGGVMTSFYDWYADLPVASPQVFGDQTDVPESGDWWDVVWQCASVLLTYPNSRQLGT AEELLAHIDGPAADLLGRTVSELRRADPLTAATRYVDTFDLRGRATLYLTYWTAGDTR NRGREMLAFAQTYRSTDVAPPRGETPDFLPVVLEFAATVDPEAGRRLLSGYRVPIAAL CNALTEAALPYAHTVAAVCRTGDMMGELFWTVVPYVTMTIVAVGSWWRYRYDKFGWTT RSSQLYESRLLRIASPMFHFGILVVIVGHGIGLVIPQSWTQAAGLSEGAYHVQAVVLG SIAGITTLAGVTLLIYRRRTRGPVFMATTVNDKVMYLVLVAAIVAGLGATALGSGVVG EAYNYRETVSVWFRSVWVLQPRGDLMAEAPLYYQIHVLIGLALFALWPFTRLVHAFSA PIGYLFRPYIIYRSREELVLTRPRRRGW" misc_feature complement(1963956..1964015) /gene="narX" /locus_tag="Rv1736c" /note="PS00551 Prokaryotic molybdopterin oxidoreductases signature 1" gene complement(1964183..1965370) /gene="narK2" /locus_tag="Rv1737c" /db_xref="GeneID:885231" CDS complement(1964183..1965370) /gene="narK2" /locus_tag="Rv1737c" /function="INVOLVED IN EXCRETION OF NITRITE, PRODUCED BY THE DISSIMILATORY REDUCTION OF NITRATE, ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv1737c, (MTCY04C12.22c), len: 395 aa. Possible narK2, nitrate/nitrite-transport integral membrane protein (see Hutter & Dick 2000), possibly member of major facilitator superfamily (MFS), similar to P46907|NARK_BACSU nitrite extrusion protein from Bacillus subtilis (395 aa), FASTA scores: opt: 742, E(): 0, (33.6% identity in 375 aa overlap); and to AL109989|SCJ12.23 hypothetical nitrate/nitrite transporter from Streptomyces coelicolor (412 aa), FASTA scores: opt: 1181, E(): 0, (49.4% identity in 389 aa overlap)." /codon_start=1 /transl_table=11 /product="nitrate/nitrite transporter NarK2" /protein_id="NP_216253.1" /db_xref="GI:15608875" /db_xref="GOA:P71995" /db_xref="UniProtKB/TrEMBL:P71995" /db_xref="GeneID:885231" /translation="MRGQAANLVLATWISVVNFWAWNLIGPLSTSYARDMSLSSAEAS LLVATPILVGALGRIVTGPLTDRFGGRAMLIAVTLASILPVLAVGVAATMGSYALLVF FGLFLGVAGTIFAVGIPFANNWYQPARRGFSTGVFGMGMVGTALSAFFTPRFVRWFGL FTTHAIVAAALASTAVVAMVVLRDAPYFRPNADPVLPRLKAAARLPVTWEMSFLYAIV FGGFVAFSNYLPTYITTIYGFSTVDAGARTAGFALAAVLARPVGGWLSDRIAPRHVVL ASLAGTALLAFAAALQPPPEVWSAATFITLAVCLGVGTGGVFAWVARRAPAASVGSVT GIVAAAGGLGGYFPPLVMGATYDPVDNDYTVGLLLLVATALVACTYTALHAREPVSEE ASR" gene 1965657..1965941 /locus_tag="Rv1738" /db_xref="GeneID:885215" CDS 1965657..1965941 /locus_tag="Rv1738" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1738, (MTCY04C12.23), len: 94 aa. Conserved hypothetical protein, similar to P71931|Rv2632c|YQ32_MYCTU Hypothetical 10.1 kDa protein from Mycobacterium tuberculosis (93 aa), FASTA scores: opt: 319, E(): 2.6e-27, (53.9% identity in 89 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216254.1" /db_xref="GI:15608876" /db_xref="UniProtKB/Swiss-Prot:P64887" /db_xref="GeneID:885215" /translation="MCGDQSDHVLQHWTVDISIDEHEGLTRAKARLRWREKELVGVGL ARLNPADRNVPEIGDELSVARALSDLGKRMLKVSTHDIEAVTHQPARLLY" gene complement(1965955..1967637) /locus_tag="Rv1739c" /db_xref="GeneID:887208" CDS complement(1965955..1967637) /locus_tag="Rv1739c" /function="INVOLVED IN SULPHATE TRANSPORT ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv1739c, (MTCY04C12.24c, MTCY28.01), len: 560 aa. Probable sulphate-transport transmembrane protein ABC transporter, similar to several e.g. P53392|G607186 high affinity sulphate transporter from Stylosanthes hamata (662 aa), FASTA scores: opt: 382, E(): 1.6e-16, (28.0% identity in 564 aa overlap); U59234.1|AAB88215.1 biotin carb. from Synechococcus sp. PCC 7942 (574 aa), FASTA scores: opt: 1838, E(): 0, (50.0% identity in 550 aa overlap); etc. Contains PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS), AND SEEMS TO BELONG TO THE SULP FAMILY." /codon_start=1 /transl_table=11 /product="sulphate-transport transmembrane protein ABC transporter" /protein_id="NP_216255.1" /db_xref="GI:15608877" /db_xref="GOA:P71997" /db_xref="UniProtKB/TrEMBL:P71997" /db_xref="GeneID:887208" /translation="MIPTMTSAGWAPGVVQFREYQRRWLRGDVLAGLTVAAYLIPQAM AYATVAGLPPAAGLWASIAPLAIYALLGSSRQLSIGPESATALMTAAVLAPMAAGDLR RYAVLAATLGLLVGLICLLAGTARLGFLASLRSRPVLVGYMAGIALVMISSQLGTITG TSVEGNEFFSEVHSFATSVTRVHWPTFVLAMSVLALLTMLTRWAPRAPGPIIAVLAAT MLVAVMSLDAKGIAIVGRIPSGLPTPGVPPVSVEDLRALIIPAAGIAIVTFTDGVLTA RAFAARRGQEVNANAELRAVGACNIAAGLTHGFPVSSSSSRTALADVVGGRTQLYSLI ALGLVVIVMVFASGLLAMFPIAALGALVVYAALRLIDLSEFRRLARFRRSELMLALAT TAAVLGLGVFYGVLAAVALSILELLRRVAHPHDSVLGFVPGIAGMHDIDDYPQAKRVP GLVVYRYDAPLCFANAEDFRRRALTVVDQDPGQVEWFVLNAESNVEVDLTALDALDQL RTELLRRGIVFAMARVKQDLRESLRAASLLDKIGEDHIFMTLPTAVQAFRRR" misc_feature complement(1967308..1967352) /locus_tag="Rv1739c" /note="PS00211 ABC transporters family signature" gene 1967705..1967917 /locus_tag="Rv1740" /db_xref="GeneID:885226" CDS 1967705..1967917 /locus_tag="Rv1740" /function="UNKNOWN" /note="Rv1740, (MTCY28.02-MTCY04C12.25), len: 70 aa. Conserved hypothetical protein, highly similar to other Mycobacterium tuberculosis hypothetical proteins e.g. P96913|Rv0623|MTCY20H10.04 (84 aa), (73.5% identity in 68 aa overlap); P71998|Rv1740 (70 aa), and O07770|Rv0608 (81 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216256.1" /db_xref="GI:15608878" /db_xref="UniProtKB/TrEMBL:P71998" /db_xref="GeneID:885226" /translation="MELAARMGETLTQAVVVAVREQLARRTGRTRSISLREELAAIGR RCAALPVLDTRAADTILGYDERGLPA" gene 1967917..1968165 /locus_tag="Rv1741" /db_xref="GeneID:887165" CDS 1967917..1968165 /locus_tag="Rv1741" /function="UNKNOWN" /note="Rv1741, (MTCY28.03,MTCY04C12.26), len: 82 aa. Conserved hypothetical protein, very similar in N-terminus to other Mycobacterium tuberculosis hypothetical proteins e.g. P96914|Rv0624|MTCY20H10.05 (131 aa), (80.4% identity in 56 aa overlap); P71999|Rv1741 (82 aa) and O07769|Rv0609 (133 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216257.1" /db_xref="GI:15608879" /db_xref="UniProtKB/TrEMBL:P71999" /db_xref="GeneID:887165" /translation="MVIDTSALVAMLNDEPEAQRFEIAVAADHVWLMSTASYPEMATV IETRFGEPGGREPKVSGQPLLYKGDDFACIDIRAVLAG" gene 1968173..1968910 /locus_tag="Rv1742" /db_xref="GeneID:885234" CDS 1968173..1968910 /locus_tag="Rv1742" /function="UNKNOWN" /note="Rv1742, (MTCY28.04,MTCY04C12.27), len: 245 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216258.1" /db_xref="GI:15608880" /db_xref="UniProtKB/TrEMBL:O33271" /db_xref="GeneID:885234" /translation="MSALLDGVLDAHGGLQRWRAAETVHGRVRTGGLLLRTRVPGNRF ADYRITVHVQQARTVLDPFPRDGYRGVFESGQVRIESHDGAVISSRAHPRAAFFGRSG LRRNIRWDPLDSVYFAGYAMWNYLTTPYLLTREGVAVEEGAPWQQEGETWRRLIVSFP PDIDTHSPRQTFYVDASGLLRRHDYVPEVVGHWARAAHYCADPVDVDGFVFPTCRWVH PIGPGNRSLPFPTLVSILLTDIRVETD" gene 1969004..1970704 /gene="pknE" /locus_tag="Rv1743" /db_xref="GeneID:885284" CDS 1969004..1970704 /gene="pknE" /locus_tag="Rv1743" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO BE INVOLVED IN MEMBRANE TRANSPORT [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /note="Rv1743, (MTCY28.05,MTCY04C12.28), len: 566 aa. Probable pknE, transmembrane serine/threonine protein kinase (EC 2.7.1.-) (see citation below), similar to PKN1_MYXXA|P33973 serine/threonine-protein kinase pkn1 (693 aa), fasta scores: opt: 542, E(): 1.1e-19, (35.8% identity in 302 aa overlap). Also highly similar to K08G_MYCTU|Q11053 probable serine/threonine-protein kinase (626 aa) (59.8% identity in 381 aa overlap). Contains PS00107 Protein kinases ATP-binding region signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES." /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase E" /protein_id="NP_216259.1" /db_xref="GI:15608881" /db_xref="GOA:P72001" /db_xref="UniProtKB/Swiss-Prot:P72001" /db_xref="GeneID:885284" /translation="MDGTAESREGTQFGPYRLRRLVGRGGMGDVYEAEDTVRERIVAL KLMSETLSSDPVFRTRMQREARTAGRLQEPHVVPIHDFGEIDGQLYVDMRLINGVDLA AMLRRQGPLAPPRAVAIVRQIGSALDAAHAAGATHRDVKPENILVSADDFAYLVDFGI ASATTDEKLTQLGNTVGTLYYMAPERFSESHATYRADIYALTCVLYECLTGSPPYQGD QLSVMGAHINQAIPRPSTVRPGIPVAFDAVIARGMAKNPEDRYVTCGDLSAAAHAALA TADQDRATDILRRSQVAKLPVPSTHPVSPGTRWPQPTPWAGGAPPWGPPSSPLPRSAR QPWLWVGVAVAVVVALAGGLGIALAHPWRSSGPRTSAPPPPPPADAVELRVLNDGVFV GSSVAPTTIDIFNEPICPPCGSFIRSYASDIDTAVADKQLAVRYHLLNFLDDQSHSKN YSTRAVAASYCVAGQNDPKLYASFYSALFGSDFQPQENAASDRTDAELAHLAQTVGAE PTAISCIKSGADLGTAQTKATNASETLAGFNASGTPFVWDGSMVVNYQDPSWLARLIG" misc_feature 1969067..1969138 /gene="pknE" /locus_tag="Rv1743" /note="PS00107 Protein kinases ATP-binding region signature" gene complement(1970989..1971390) /locus_tag="Rv1744c" /db_xref="GeneID:885235" CDS complement(1970989..1971390) /locus_tag="Rv1744c" /function="UNKNOWN" /note="Rv1744c, (MTCY28.06c), len: 133 aa. Probable membrane protein, contains four imperfect 10 aa repeats, some similarity to Q25946 (MSA-2) (FRAGMENT) from Plasmodium falciparum (205 aa), FASTA scores: opt: 145, E( ): 0.048, (52.4% identity in 63 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216260.1" /db_xref="GI:15608882" /db_xref="UniProtKB/TrEMBL:O06787" /db_xref="GeneID:885235" /translation="MVINRSIASIDSIAVAGSAATTGAVAVAGSVATAGSVAVAGSVA TAGSVAIAGAAATAGSVGIIGSLLTVLCVAVRQCVACLACITCTRCVACIGCVRCTDC VGCLWCVNCSGLRNVVGARNLRVGNLGRVSN" gene complement(1971380..1971991) /gene="idi" /locus_tag="Rv1745c" /db_xref="GeneID:885309" CDS complement(1971380..1971991) /gene="idi" /locus_tag="Rv1745c" /EC_number="5.3.3.2" /function="CATALYZES THE 1,3-ALLYLIC REARRANGEMENT OF THE HOMOALLYLIC SUBSTRATE ISOPENTEN TO ITS ALLYLIC ISOMER, DIMETHYLALLYL DIPHOSPHATE (DMAPP) [CATALYTIC ACTIVITY :ISOPENTENYL DIPHOSPHATE = DIMETHYLALLYL DIPHOSPHATE]" /note="catalyzes the rearrangement of isopentenyl diphosphate to dimethylallyl phosphate" /codon_start=1 /transl_table=11 /product="isopentenyl-diphosphate delta-isomerase" /protein_id="NP_216261.1" /db_xref="GI:15608883" /db_xref="GOA:P72002" /db_xref="UniProtKB/Swiss-Prot:P72002" /db_xref="GeneID:885309" /translation="MTRSYRPAPPIERVVLLNDRGDATGVADKATVHTGDTPLHLAFS SYVFDLHDQLLITRRAATKRTWPAVWTNSCCGHPLPGESLPGAIRRRLAAELGLTPDR VDLILPGFRYRAAMADGTVENEICPVYRVQVDQQPRPNSDEVDAIRWLSWEQFVRDVT AGVIAPVSPWCRSQLGYLTKLGPCPAQWPVADDCRLPKAAHGN" gene 1972138..1973568 /gene="pknF" /locus_tag="Rv1746" /db_xref="GeneID:885275" CDS 1972138..1973568 /gene="pknF" /locus_tag="Rv1746" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO BE INVOLVED IN MEMBRANE TRANSPORT. PHOSPHORYLATES THE PEPTIDE SUBSTRATE MYELIN BASIC PROTEIN (MBP) AT SERINE AND THREONINE RESIDUES [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /experiment="experimental evidence, no additional details recorded" /note="Rv1746, (MTCY28.09, MTCY04C12.30), len: 476 aa. pknF, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citations below), highly similar to KY28_MYCTU|Q10697 probable serine/threonine-protein kinase from Mycobacterium tuberculosis (589 aa), FASTA scores: opt: 870, E(): 0, (41.6% identity in 406 aa overlap). Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation. Start site chosen by homology, may extend further upstream." /codon_start=1 /transl_table=11 /product="anchored-membrane serine/threonine-protein kinase PKNF (protein kinase F) (STPK F)" /protein_id="NP_216262.1" /db_xref="GI:15608884" /db_xref="GOA:P72003" /db_xref="UniProtKB/Swiss-Prot:P72003" /db_xref="GeneID:885275" /translation="MPLAEGSTFAGFTIVRQLGSGGMGEVYLARHPRLPRQDALKVLR ADVSADGEYRARFNREADAAASLWHPHIVAVHDRGEFDGQLWIDMDFVDGTDTVSLLR DRYPNGMPGPEVTEIITAVAEALDYAHERRLLHRDVKPANILIANPDSPDRRIMLADF GIAGWVDDPSGLTATNMTVGTVSYAAPEQLMGNELDGRADQYALAATAFHLLTGSPPF QHANPAVVISQHLSASPPAIGDRVPELTPLDPVFAKALAKQPKDRYQRCVDFARALGH RLGGAGDPDDTRVSQPVAVAAPAKRSLLRTAVIVPAVLAMLLVMAVAVAVREFQRADD ERAAQPARTRTTTSAGTTTSVAPASTTRPAPTTPTTTGAADTATASPTAAVVAIGALC FPLGSTGTTKTGATAYCSTLQGTNTTIWSLTEDTVASPTVTATADPTEAPLPIEQESP IRVCMQQTGQTRRECREEIRRSNGWP" misc_feature 1972534..1972572 /gene="pknF" /locus_tag="Rv1746" /note="PS00108 Serine/Threonine protein kinases active-site signature" gene 1973630..1976227 /locus_tag="Rv1747" /db_xref="GeneID:885311" CDS 1973630..1976227 /locus_tag="Rv1747" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY LIPOOLIGOSACCHARIDE) ACROSS THE MEMBRANE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1747, (MTCY28.10, MTCY04C12.31), len: 865 aa. Probable conserved transmembrane ATP-binding protein ABC transporter (see citation below), similar to others e.g Q55956 ABC transporter from Synechocystis sp. (790 aa), FASTA scores: opt: 738, E(): 6.3e-26, (31.6% identity in 632 aa overlap); etc. Also similar to other M. tuberculosis ABC-type transporters e.g. Rv2397c|MTCY253.24, FASTA score: (35.2% identity in 213 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="transmembrane ATP-binding protein ABC transporter" /protein_id="NP_216263.1" /db_xref="GI:15608885" /db_xref="GOA:O65934" /db_xref="UniProtKB/TrEMBL:O65934" /db_xref="GeneID:885311" /translation="MPMSQPAAPPVLTVRYEGSERTFAAGHDVVVGRDLRADVRVAHP LISRAHLLLRFDQGRWVAIDNGSLNGLYLNNRRVPVVDIYDAQRVHIGNPDGPALDFE VGRHRGSAGRPPQTTSIRLPNLSAGAWPTDGPPQTGTLGSGQLQQLPPATTRIPAAPP SGPQPRYPTGGQQLWPPSGPQRAPQIYRPPTAAPPPAGARGGTEAGNLATSMMKILRP GRLTGELPPGAVRIGRANDNDIVIPEVLASRHHATLVPTPGGTEIRDNRSINGTFVNG ARVDAALLHDGDVVTIGNIDLVFADGTLARREENLLETRVGGLDVRGVTWTIDGDKTL LDGISLTARPGMLTAVIGPSGAGKSTLARLVAGYTHPTDGTVTFEGHNVHAEYASLRS RIGMVPQDDVVHGQLTVKHALMYAAELRLPPDTTKDDRTQVVARVLEELEMSKHIDTR VDKLSGGQRKRASVALELLTGPSLLILDEPTSGLDPALDRQVMTMLRQLADAGRVVLV VTHSLTYLDVCDQVLLLAPGGKTAFCGPPTQIGPVMGTTNWADIFSTVADDPDAAKAR YLARTGPTPPPPPVEQPAELGDPAHTSLFRQFSTIARRQLRLIVSDRGYFVFLALLPF IMGALSMSVPGDVGFGFPNPMGDAPNEPGQILVLLNVGAVFMGTALTIRDLIGERAIF RREQAVGLSTTAYLIAKVCVYTVLAVVQSAIVTVIVLVGKGGPTQGAVALSKPDLELF VDVAVTCVASAMLGLALSAIAKSNEQIMPLLVVAVMSQLVFSGGMIPVTGRVPLDQMS WVTPARWGFAASAATVDLIKLVPGPLTPKDSHWHHTASAWWFDMAMLVALSVIYVGFV RWKIRLKAC" misc_feature 1974683..1974706 /locus_tag="Rv1747" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 1974989..1975033 /locus_tag="Rv1747" /note="PS00211 ABC transporters family signature" gene 1976600..1977331 /locus_tag="Rv1748" /db_xref="GeneID:885288" CDS 1976600..1977331 /locus_tag="Rv1748" /function="UNKNOWN" /note="Rv1748, (MTCY28.11, MTCY04C12.32), len: 243 aa. Hypothetical unknown protein. Possibly exported protein, hydrophobic domain, TM helix aa 23-45." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216264.1" /db_xref="GI:15608886" /db_xref="UniProtKB/TrEMBL:P72005" /db_xref="GeneID:885288" /translation="MPGGVCSGRPWGRPWWHPGLVGLLIRLAELLVVMLPLIGVLYVG IKALSSFTRRLGEASGDLASDSPAMPRPTTVENDAARWRAITRAVEAHERTDARWLEY ELDAAKLLDFPVMTDMRDPLTTAFHKAKLQADFHKPLRAEDLLDDPDAAGHYLDAVRD YVTAFDTAEAEAMRRRRTGFSREEQQRLARAQSLLRVASDAGATAQERERAYRLARTE LDGLIVLPDRTRAGIERGIAGELDD" gene complement(1977328..1977885) /locus_tag="Rv1749c" /db_xref="GeneID:885322" CDS complement(1977328..1977885) /locus_tag="Rv1749c" /function="UNKNOWN" /note="Rv1749c, (MTCY28.12c-MTCY04C12.33c), len: 185 aa. Possible integral membrane protein, similar to O27914|AE000940 hypothetical protein MTH1892 from Methanobacterium thermoautotrophicum (168 aa), fasta scores: E(): 9.3e-16, (37.4% identity in 123 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216265.1" /db_xref="GI:15608887" /db_xref="UniProtKB/TrEMBL:O65935" /db_xref="GeneID:885322" /translation="MLRAVNEIRQHDGTLKLGKGVGMFTIVGVIVALIGAFVQSRRHR HRPAADIHMLWWMVLIVGVVSIIGAGYHVFDGERTAELIGYTRGDGGFQWENAMGDLA IGVVGLMAYRFRGHFWLATIVVLTIQYVGDAAGHIYYWVVENNTNPYNIGVPLWTDIL LPIVMWALYAWSWHSNGDAVPKGQP" gene complement(1977969..1979567) /gene="fadD1" /locus_tag="Rv1750c" /db_xref="GeneID:885310" CDS complement(1977969..1979567) /gene="fadD1" /locus_tag="Rv1750c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_216266.1" /db_xref="GI:15608888" /db_xref="GOA:P72007" /db_xref="UniProtKB/TrEMBL:P72007" /db_xref="GeneID:885310" /translation="MTDTIQSLLRQHVSDPTIAVKYGGLQWTWSQYLAESAARAAALI TIADPQRPTHIGSLLGNTPEMLAQLAAAGLGGYVLCGLNTTRRGDALAADVRRADCQI VVTDADHRALLDGLDLAGARILDTSTPRWAELVAGDGAFVPYREVDTMDPFMMIFTSG TSGNPKAVPVSHLMATFAGRSLTERFGLTEQDTCYVSMPLFHSNAVVAGWAPAVVSGA AIAPATFSATGFLDDVRRYHATYMNYVGKPLAYILATPERDDDADNPLRVAFGNEAND KDIEEFSRRFGVQVEDGFGSTENAVIVIREPGTPPGSIGRGAHGVAVYNGETVTECAV ARFDAHGALTNADEAIGELVNTTGSGFFTGYYNDPEANAERMRHGMYWSGDLAYRDSE GWIYLAGRTADWMRVDGENLTAAPIERILLRYKAINRVAVYAVPDEYVGDQVMAALVL RAGDTFDPDAFEAFLDAQPDLSTKARPRYIRIAADLPSTATHKVLKRQLIDEGTAVGK ADTLWVREPRGSAYHHASGPAKAI" misc_feature complement(1979070..1979105) /gene="fadD1" /locus_tag="Rv1750c" /note="PS00455 Putative AMP-binding domain signature" gene 1979621..1981003 /locus_tag="Rv1751" /db_xref="GeneID:885489" CDS 1979621..1981003 /locus_tag="Rv1751" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1751, (MTCY28.14-MTCY04C12.35), len: 460 aa. Probable oxidoreductase (EC 1.-.-.-), possibly a monooxygenase or hydroxylase, similar to MHPA_ECOLI|P77397 3-(3-hydroxy-phenyl) propionate hydroxylase (554 aa), FASTA scores: opt: 239, E(): 2e-08, (24.6% identity in 435 aa overlap); and AJ007932|SAR7932.13 oxygenase from Streptomyces argillaceus (436 aa), FASTA scores: opt: 587, E(): 8.6e-30, (32.3% identity in 359 aa overlap). Contains PS00075 Dihydrofolate reductase signature. Also similar to Mycobacterium tuberculosis hypothetical oxidoreductases Rv1260 and Rv0575c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216267.1" /db_xref="GI:15608889" /db_xref="GOA:O65936" /db_xref="UniProtKB/TrEMBL:O65936" /db_xref="GeneID:885489" /translation="MIATMPSMARRSRHDNKITTPAVDCLTIERLDSPASGAPQVTPY ARALMGETTTCAIIGGGPAGMVLGLLLARAGVQVTLLEKHGDFLRDFRGDTVHPTTMR LLDELGLWERFAALPYSEVRTATLHSNGRAVTYIDFERLHQPYPYVAMVPQWDLLNLL AEAAQAEPSFTLRMKTEVTGLLREGGKVTGVRYQGAEGPGELRAELTVACDGRWSIAR HEAGLKAREFPVNFDVWWFKLPREGDAEFSFLPRFSPGKGLGVIPREGYFQIAYLGPK GTDAQLRERGIEEFRRDVSELLPEATASVAALASMDEVKHLNVKVNRLRRWHIDGLLC IGDAAHAMSPVAGVGINLAVQDAVAAATILAEPLREHRVSSRHLAAVRRRRAFPTAVT QAVQRVLHRRLLGPLLQGRDPTPPAALLGLVERLPWLSAVPAYFVGVGVRPEHAPAFA RRGPGNRKGP" misc_feature 1980878..1980904 /locus_tag="Rv1751" /note="PS00075 Dihydrofolate reductase signature" gene 1981130..1981579 /locus_tag="Rv1752" /db_xref="GeneID:885313" CDS 1981130..1981579 /locus_tag="Rv1752" /function="UNKNOWN" /note="Rv1752, (MTCY28.15), len: 149 aa. Conserved hypothetical protein, similar to C-terminal half of Q9TV68|AB021930|CAN2DD Dihydrodiol dehydrogenase (EC 1.3.1.20) from Canis familiaris (335 aa), FASTA score, opt: 168, E(): 0.00015, (31.3% identity in 112 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216268.1" /db_xref="GI:15608890" /db_xref="UniProtKB/TrEMBL:O06789" /db_xref="GeneID:885313" /translation="MDAGCYAVHMAHTFGGATPEVVSAQAKLRDPAVDRAMTAELKFP GGHTGGIRCSMRSSDLLNVSARVVGDRGELRVLNPVVPQLFHRLPPLACVSARRFRCR SAARASGQDDAQGRGREHERDPRDLSGRRAPIAQPELNMVAASGSAA" gene complement(1981614..1984775) /gene="PPE24" /locus_tag="Rv1753c" /db_xref="GeneID:885544" CDS complement(1981614..1984775) /gene="PPE24" /locus_tag="Rv1753c" /function="UNKNOWN" /note="Rv1753c, (MTCY28.16c), len: 1053 aa. Member of the Mycobacterium tuberculosis PPE family of Gly-, Asn-rich proteins, similar to many e.g. YF48_MYCTU|Q10778 hypothetical protein cy48.17 (678 aa), FASTA scores: opt: 1360, E(): 0, (48.9% identity in 550 aa overlap). Note that the Gly-, Asn-rich sequence is interrupted by six near-perfect 26 aa repeats, a unique region, and another, more degenerate region of five 25 aa repeats before resuming at the C-terminus. The end of the first Gly-, Asn- rich region and the start of the first set of repeats shows some similarity to Q50577|AT10S from Mycobacterium tuberculosis (170 aa) (40.2% identity in 189 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177830.1" /db_xref="GI:57116904" /db_xref="UniProtKB/TrEMBL:Q79FL2" /db_xref="GeneID:885544" /translation="MNFSVLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAAS FGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAEQTAAQAAAMIAEFEAVKT AVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASA IASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGN VGNANNGLANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLGN LNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFF NSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMG DFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTGDMNNGVFYRGVGQ GSLQFSITTPDLTLPPLQIPGISVPAFSLPAITLPSLNIPAATTPANITVGAFSLPGL TLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLN IPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATT PANITVSGFQLPPLSIPSVAIPPVTVPPITVGAFNLPPLQIPEVTIPQLTIPAGITIG GFSLPAIHTQPITVGQIGVGQFGLPSIGWDVFLSTPRITVPAFGIPFTLQFQTNVPAL QPPGGGLSTFTNGALIFGEFDLPQLVVHPYTLTGPIVIGSFFLPAFNIPGIDVPAINV DGFTLPQITTPAITTPEFAIPPIGVGGFTLPQITTQEIITPELTINSIGVGGFTLPQI TTPPITTPPLTIDPINLTGFTLPQITTPPITTPPLTIDPINLTGFTLPQITTPPITTP PLTIEPIGVGGFTTPPLTVPGIHLPSTTIGAFAIPGGPGYFNSSTAPSSGFFNSGAGG NSGFGNNGSGLSGWFNTNPAGLLGGSGYQNFGGLSSGFSNLGSGVSGFANRGILPFSV ASVVSGFANIGTNLAGFFQGTTS" repeat_region complement(1982887..1982964) /note="78 bp imperfect direct repeat 6, CGGCGCCTTCAGCCTGCCCGGGTTGACGTTGCCGTCGTTGAACATCCCGGCCGCCAC CACACCCGCCAACATCACCGT" repeat_region complement(1982965..1983042) /note="78 bp imperfect direct repeat 5, CGGCGCCTTCAGCCTGCCCGGGTTGACGTTGCCGTCGTTGAACATCCCGGCCGCCAC CACACCAGCCAACATCACCGT" repeat_region complement(1983043..1983120) /note="78 bp imperfect direct repeat 4, CGGCGCCTTCAGCCTGCCCGGGTTGACGTTGCCGTCGTTGAACATCCCGGCCGCCAC CACACCAGCCAACATCACCGT" repeat_region complement(1983121..1983198) /note="78 bp imperfect direct repeat 3, GGGTGCCTTCAGCCTGCCCGGGTTGACGTTGCCGTCGTTGAACATCCCGGCCGCCAC CACACCAGCCAACATCACCGT" repeat_region complement(1983199..1983276) /note="78 bp imperfect direct repeat 2, CGGCGCCTTCAGCCTGCCCGGGTTGACGTTGCCGTCGTTGAACATCCCGGCCGCCAC CACACCAGCCAACATCACCGT" repeat_region complement(1983277..1983354) /note="78 bp imperfect direct repeat 1, TCCCGCCTTCAGTCTGCCGGCAATAACGCTGCCGTCGCTGAACATCCCGGCCGCCAC CACACCGGCCAACATCACCGT" gene complement(1984979..1986670) /locus_tag="Rv1754c" /db_xref="GeneID:885467" CDS complement(1984979..1986670) /locus_tag="Rv1754c" /function="UNKNOWN" /note="Rv1754c, (MTCY28.17c), len: 563 aa. Conserved hypothetical protein, has proline-rich central region. Some similarity in central region to other Mycobacterium tuberculosis proline-rich proteins e.g. O06555|Rv1157c|MTCI65.24c (371 aa), (32.5% identity in 191 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216270.1" /db_xref="GI:15608892" /db_xref="UniProtKB/TrEMBL:O06790" /db_xref="GeneID:885467" /translation="MYRYQVRVQQRRSEMNRWVATRSRRHTYQWITDHKSPRDHYRHI SELRTSIATSSPGRCDMSPIPRIVSVSLAWAAAIGLMVPIGLAPPAMAAPCSGDAANA PPPPSAIVTDPGATALGPVRPGHGPIPTGRKPRGANDRAPLPKLGPLISALLNPGARN AAPLQQQALVPRANPGPNPAPNPPATGPQPPNATQLTPNPAPAPDPAPAAAPDPGATL AGATTSLAEWVTGPDSPNKTLERFGISGTDLGIPWDNGDPANRQVLMIFGDTFGYCAV DGHQWRYNTLFRSQDRDLGNGVHVTSGDASNRYSGSPVRQPGFSKQLINSIKWARDET GIIPTAGIAVGKTQYVNFMSIRNWGRDGEWTTNYSGIAVSKDNGQTWGVFPGTIRASG PDSGGKARFVPGNENFQMGAYLKSNDGYLYSFGTPPGRGGSAYLARVPQRFVPDLTKY QYWNGDSNSWVPNKPDAATPVIPGPVGEMSVQYNTYLKQYLALYTNGMNDVVARTAPA PQGPWSAEQMLVSSWQMPGGIYAPMMHPWSTGKDVYFNLSLWSAYNVMLMHTVLP" misc_feature complement(1985630..1985653) /locus_tag="Rv1754c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(1986854..1987696) /gene="plcD" /locus_tag="Rv1755c" /db_xref="GeneID:885566" CDS complement(1986854..>1987696) /gene="plcD" /locus_tag="Rv1755c" /EC_number="3.1.4.3" /function="HYDROLYZES SPHINGOMYELIN IN ADDITION TO PHOSPHATIDYLCHOLINE. PROBABLE VIRULENCE FACTOR IMPLICATED IN THE PATHOGENESIS OF M.TUBERCULOSIS AT THE LEVEL OF INTRACELLULAR SURVIVAL, BY THE ALTERATION OF CELL SIGNALING EVENTS OR BY DIRECT CYTOTOXICITY [CATALYTIC ACTIVITY: A PHOSPHATIDYLCHOLINE + H(2)O = 1,2- DIACYLGLYCEROL + CHOLINE PHOSPHATE]." /note="Rv1755c, (MT1799, MTCY28.21c), len: 280 aa. Probable plcD, phospholipase C 4 (fragment) (EC 3.1.4.3) (see citations below), highly similar to C-terminus of other phospholipases e.g. CQ50771|Rv2351c|PLCA|MTP40|MT2416|MTCY98.20c phospholipase C 1 from Mycobacterium tuberculosis (512 aa), FASTA score: (71.1% identity in 284 aa overlap); etc. Note that this ORF has been interrupted by insertion of IS6110 element. BELONGS TO THE BACTERIAL PHOSPHOLIPASE C FAMILY." /codon_start=1 /transl_table=11 /product="phospholipase C 4 PLCD" /protein_id="NP_216271.1" /db_xref="GI:15608893" /db_xref="GOA:Q9XB13" /db_xref="UniProtKB/Swiss-Prot:Q9XB13" /db_xref="GeneID:885566" /translation="DAGVSWKVYRNKTLGPISSVLTYGSLVTSFKQSADPRSDLVRFG VAPSYPASFAADVLANRLPRVSWVIPNVLESEHPAVPAAAGAFAIVNILRILLANPAV WEKTALIVSYDENGGFFDHVVPATAPAGTPGEYVTVPDIDQVPGSGGIRGPIGLGFRV PCFVISPYSRGPQMVHDTFDHTSQLRLLETRFGVPVPNLTAWRRSVTGDMTSTFNFAV PPNSSWPNLDYPGLHALSTVPQCVPNAALGTINRGIPYRVPDPQIMPTQETTPTRGIP SGPC" repeat_region complement(1987703..1989057) /note="IS6110-3, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-3" repeat_region 1987703..1987730 /note="28 bp inverted repeat at the left end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC." gene complement(1987745..1988629) /locus_tag="Rv1756c" /db_xref="GeneID:885541" CDS complement(1987745..1988629) /locus_tag="Rv1756c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv1756c, (MTCY28.22c), len: 294 aa. Putative Transposase subunit for IS6110." /codon_start=1 /transl_table=11 /product="putative transposase" /protein_id="NP_216272.1" /db_xref="GI:15608894" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:885541" /translation="MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKE HISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIAD PATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVAST MATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGA VGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVP PVELEAAYYAQRQRPAAG" gene complement(1988680..1989006) /locus_tag="Rv1757c" /db_xref="GeneID:885558" CDS complement(1988680..1989006) /locus_tag="Rv1757c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv1757c, (MTCY28.23c), len: 108 aa. Putative Transposase subunit for IS6110" /codon_start=1 /transl_table=11 /product="putative transposase" /protein_id="NP_216273.1" /db_xref="GI:15608895" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:885558" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" repeat_region complement(1989030..1989057) /note="28 bp inverted repeat at the right end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 1989042..1989566 /gene="cut1" /locus_tag="Rv1758" /db_xref="GeneID:885552" CDS 1989042..1989566 /gene="cut1" /locus_tag="Rv1758" /EC_number="3.1.1.-" /function="HYDROLYSIS OF CUTIN." /note="Rv1758, (MTCY28.24), len: 174 aa. Probable cut1, serine esterase, cutinase family (EC 3.1.1.-), similar to Rv2301|CUT2_MYCTU|Q50664 probable cutinase cy339.08c precursor from Mycobacterium tuberculosis (219 aa), FASTA scores: opt: 369, E(): 1. 1e-16, (39.1% identity in 179 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical cutinases Rv3452, Rv1984c, Rv3451 and Rv3724. CDS has been interrupted by IS6110 insertion element and 5'-end deleted. BELONGS TO THE CUTINASE FAMILY." /codon_start=1 /transl_table=11 /product="cutinase Cut1" /protein_id="NP_216274.1" /db_xref="GI:15608896" /db_xref="GOA:O06793" /db_xref="UniProtKB/TrEMBL:O06793" /db_xref="GeneID:885552" /translation="MPGRFREDFIDALRSKIGEKSMGVYGVDYPATTDFPTAMAGIYD AGTHVEQTAANCPQSKLVLGGFSQGAAVMGFVTAAAIPDGAPLDAPRPMPPEVADHVA AVTLFGMPSVAFMHSIGAPPIVIGPLYAEKTIQLCAPGDPVCSSGGNWAAHNGYADDG MVEQAAVFAAGRLG" gene complement(1989833..1992577) /gene="wag22" /locus_tag="Rv1759c" /db_xref="GeneID:885325" CDS complement(1989833..1992577) /gene="wag22" /locus_tag="Rv1759c" /function="UNKNOWN. HAS FIBRONECTIN-BINDING ACTIVITY (COULD THUS MEDIATE BACTERIAL ATTACHMENT TO HOST CELLS). THOUGHT TO BE EXPRESSED DURING INFECTION." /experiment="experimental evidence, no additional details recorded" /note="Rv1759c, (MT1807, MTCY28.25c), len: 914 aa. wag22, antigen member (see citations below) of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, highly similar to others e.g. MT1367|Q10637 hypothetical glycine-rich 49.6 kDa protein from Mycobacterium tuberculosis (603 aa), FASTA scores: opt: 2010, E(): 0, (53.0% identity in 724 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177831.1" /db_xref="GI:57116905" /db_xref="UniProtKB/Swiss-Prot:O06794" /db_xref="GeneID:885325" /translation="MSFVIAVPETIAAAATDLADLGSTIAGANAAAAANTTSLLAAGA DEISAAIAALFGAHGRAYQAASAEAAAFHGRFVQALTTGGGAYAAAEAAAVTPLLNSI NAPVLAATGRPLIGNGANGAPGTGANGGDAGWLIGNGGAGGSGAKGANGGAGGPGGAA GLFGNGGAGGAGGTATANNGIGGAGGAGGSAMLFGAGGAGGAGGAATSLVGGIGGTGG TGGNAGMLAGAAGAGGAGGFSFSTAGGAGGAGGAGGLFTTGGVGGAGGQGHTGGAGGA GGAGGLFGAGGMGGAGGFGDHGTLGTGGAGGDGGGGGLFGAGGDGGAGGSGLTTGGAA GNGGNAGTLSLGAAGGAGGTGGAGGTVFGGGKGGAGGAGGNAGMLFGSGGGGGTGGFG FAAGGQGGVGGSAGMLSGSGGSGGAGGSGGPAGTAAGGAGGAGGAPGLIGNGGNGGNG GESGGTGGVGGAGGNAVLIGNGGEGGIGALAGKSGFGGFGGLLLGADGYNAPESTSPW HNLQQDILSFINEPTEALTGRPLIGNGDSGTPGTGDDGGAGGWLFGNGGNGGAGAAGT NGSAGGAGGAGGILFGTGGAGGAGGVGTAGAGGAGGAGGSAFLIGSGGTGGVGGAATT TGGVGGAGGNAGLLIGAAGLGGCGGGAFTAGVTTGGAGGTGGAAGLFANGGAGGAGGT GSTAGGAGGAGGAGGLYAHGGTGGPGGNGGSTGAGGTGGAGGPGGLYGAGGSGGAGGH GGMAGGGGGVGGNAGSLTLNASGGAGGSGGSSLSGKAGAGGAGGSAGLFYGSGGAGGN GGYSLNGTGGDGGTGGAGQITGLRSGFGGAGGAGGASDTGAGGNGGAGGKAGLYGNGG DGGAGGDGATSGKGGAGGNAVVIGNGGNGGNAGKAGGTAGAGGAGGLVLGRDGQHGLT" gene 1993153..1994661 /locus_tag="Rv1760" /db_xref="GeneID:885556" CDS 1993153..1994661 /locus_tag="Rv1760" /function="UNKNOWN" /note="Rv1760, (MTCY28.26), len: 502 aa. Conserved hypothetical protein, similar to several other Mycobacterium tuberculosis hypothetical proteins e.g. Q10554|Y895_MYCTU|MTCY31.23 (505 aa), FASTA scores: opt: 692, E(): 0, (31.7% identity in 477 aa overlap). Member of family with at least 15 other members e.g. Rv3740c, Rv3734c, Rv1425, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216276.1" /db_xref="GI:15608898" /db_xref="GOA:O06795" /db_xref="UniProtKB/Swiss-Prot:O06795" /db_xref="GeneID:885556" /translation="MPRGCAGARFACNACLNFLAGLGISEPISPGWAAMERLSGLDAF FLYMETPSQPLNVCCVLELDTSTMPGGYTYGRFHAALEKYVKAAPEFRMKLADTELNL DHPVWVDDDNFQIRHHLRRVAMPAPGGRRELAEICGYIAGLPLDRDRPLWEMWVIEGG ARSDTVAVMLKVHHAVVDGVAGANLLSHLCSLQPDAPAPQPVRGTGGGNVLQIAASGL EGFASRPVRLATVVPATVLTLVRTLLRAREGRTMAAPFSAPPTPFNGPLGRLRNIAYT QLDMRDVKRVKDRFGVTINDVVVALCAGALRRFLLEHGVLPEAPLVATVPVSVHDKSD RPGRNQATWMFCRVPSQISDPAQRIRTIAAGNTVAKDHAAAIGPTLLHDWIQFGGSTM FGAAMRILPHISITHSPAYNLILSNVPGPQAQLYFLGCRMDSMFPLGPLLGNAGLNIT VMSLNGELGVGIVSCPDLLPDLWGVADGFPEALKELLECSDDQPEGSNHQDS" gene complement(1994671..1995054) /locus_tag="Rv1761c" /db_xref="GeneID:885436" CDS complement(1994671..1995054) /locus_tag="Rv1761c" /function="UNKNOWN" /note="Rv1761c, (MTCY28.27c), len: 127 aa. Possibly exported protein with hydrophobic stretch or TMhelix at aa 15-37." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216277.1" /db_xref="GI:15608899" /db_xref="UniProtKB/TrEMBL:O06796" /db_xref="GeneID:885436" /translation="MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRG FFRSNPERIQIGDWRYEVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVS RYGATVIPNINAAIEVLGTGTDYRF" gene complement(1995054..1995842) /locus_tag="Rv1762c" /db_xref="GeneID:885373" CDS complement(1995054..1995842) /locus_tag="Rv1762c" /function="UNKNOWN" /note="Rv1762c, (MTCY28.28c), len: 262 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216278.1" /db_xref="GI:15608900" /db_xref="UniProtKB/TrEMBL:O06797" /db_xref="GeneID:885373" /translation="MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELV MGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNW RHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAF VMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVV GVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETITPSFVLPMDS" repeat_region 1996101..1997455 /note="IS6110-4, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-4" repeat_region 1996101..1996128 /note="28 bp inverted repeat at the left end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 1996152..1996478 /locus_tag="Rv1763" /db_xref="GeneID:885372" CDS 1996152..1996478 /locus_tag="Rv1763" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv1763, (MTCY28.29), len: 108 aa. Putative Transposase for IS6110 insertion element" /codon_start=1 /transl_table=11 /product="putative transposase" /protein_id="NP_216279.1" /db_xref="GI:15608901" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:885372" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 1996529..1997413 /locus_tag="Rv1764" /db_xref="GeneID:885238" CDS 1996529..1997413 /locus_tag="Rv1764" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv1764, (MTCY28.30), len: 294 aa. Putative Transposase for IS6110 insertion element" /codon_start=1 /transl_table=11 /product="putative transposase" /protein_id="NP_216280.1" /db_xref="GI:15608902" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:885238" /translation="MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKE HISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIAD PATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVAST MATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGA VGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVP PVELEAAYYAQRQRPAAG" gene complement(1997418..1998515) /locus_tag="Rv1765c" /db_xref="GeneID:885256" CDS complement(1997418..1998515) /locus_tag="Rv1765c" /function="UNKNOWN" /note="Rv1765c, (MTCY28.31c), len: 365 aa. Conserved hypothetical protein, highly similar to O53461|Rv2015c|MTV018.02c CONSERVED HYPOTHETICAL PROTEIN (418 aa), (97.8% identity in 364 aa overlap). BLAST hits with non-IS part of sequence submitted under MTU78639." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216281.1" /db_xref="GI:15608903" /db_xref="GOA:O06798" /db_xref="UniProtKB/TrEMBL:O06798" /db_xref="GeneID:885256" /translation="MSSTATSGAAVVSPAERVEVLFEELAELAGQRNAIDGRIVEIVA ELDRDGLWGVTGARSVAGLVAWKMGCSSGNAHTIATVARRLPEFPRCARGMREGRLSL DQVGVIAGRAGEGSDAHYAQLAGVATVNQLRTALKLEPRPEPEPDFRPEPRPSITRSA DEQFSCWRIKLPHVEAAKFDAALQSHLDALIAEYKRDHDNSDGVSDQRPPLPGNVEAF LRLVEAGWDAEVARRPHGQHTTVVMHLDVQERAAGLHLGPLLSESERRYLLCDATFEA WFERDGQVIGCGRTTRQINRRLRRALEHRDRTCVVPGCGATRGLHAHHIRHWQDGGAT ELANLVLVCPYHHRAHHRGLNRPGESGDSLI" repeat_region complement(1997428..1997455) /note="28 bp inverted repeat at the right end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC" repeat_region complement(1998584..1999813) /note="ISB9', len: 1230 bp. Insertion sequence ISB9, nearly identical to EM_BA:MTU78639. Note that this sequence shows several differences to EM_BA: MTU78639, and the transposase ORFs are extensively frameshifted. Our sequence has been checked and is thought to be correct; the sequence in EM_BA:MTU78639 is from a different isolate of Mycobacterium tuberculosis." /mobile_element="insertion sequence:ISB9'" repeat_region 1998584..1998597 /note="14 bp Inverted repeat at the left end of ISB9', ATCACCCCGCAAAG" gene complement(1999142..1999357) /locus_tag="Rv1765A" /db_xref="GeneID:3205098" CDS complement(1999142..1999357) /locus_tag="Rv1765A" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv1765A, len: 71 aa. Putative transposase (fragment), similar to part of many transposase genes including IS6110 e.g. P19774|TRA9_MYCTU PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis (278 aa), FASTA scores: opt: 231, E(): 4.7e-11, (45.35% identity in 75 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="YP_177652.1" /db_xref="GI:57116906" /db_xref="GOA:Q79FL0" /db_xref="UniProtKB/TrEMBL:Q79FL0" /db_xref="GeneID:3205098" /translation="MWVADITFVRTWQGFCYTAFVTDVCTRKIVVWAVSATMRTEDLP VQVFNHAVWQSNSDLSELVHHSDPGSQ" gene 1999737..2000006 /locus_tag="Rv1766" /db_xref="GeneID:885427" CDS 1999737..2000006 /locus_tag="Rv1766" /function="UNKNOWN" /note="Rv1766, (MTCY28.32), len: 89 aa. Conserved hypothetical protein, highly similar to P54431|YRKD_BACSU Hypothetical 7.0 kDa protein in bltr-spoIIIC intergenic region from Bacillus subtilis (63 aa), FASTA scores: opt: 151, E(): 1.5e-05, (53.3% identity in 45 aa overlap). Also similar to Q9RD62|SCF56.04C|AL133424 Hypothetical protein from Streptomyces coelicolor (92 aa), FASTA scores: opt: 239, E(): 1.3e-11, (62.5% identity in 64 aa overlap). Also some similarity to other Mycobacterium tuberculosis hypothetical proteins e.g. O07434|Rv0190|MTCI28.29 (96 aa), (35.5% identity in 62 aa overlap); P71543|Rv0967 (119 aa), and P71600|Rv0030 (109 aa). Start changed since original submission." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216282.2" /db_xref="GI:57116907" /db_xref="UniProtKB/TrEMBL:O06799" /db_xref="GeneID:885427" /translation="MIGDQDSIAAVLNRLRRAQGQLAGVISMIEQGRDCRDVVTQLAA VSRALDRAGFKIVAAGLKECVSGATASGAAPLSAAELEKLFLALA" repeat_region complement(1999800..1999813) /note="14 bp Inverted repeat at the right end of ISB9, ATCACCCCGGCAAG" gene 2000074..2000433 /locus_tag="Rv1767" /db_xref="GeneID:885443" CDS 2000074..2000433 /locus_tag="Rv1767" /function="UNKNOWN" /note="Rv1767, (MTCY28.33), len: 119 aa. Conserved hypothetical protein, similar to Q57498|YA53_HAEIN HYPOTHETICAL PROTEIN HI1053 from Haemophilus influenzae (113 aa), FASTA scores: opt: 233, E(): 6.4e-10, (40.0% identity in 90 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216283.1" /db_xref="GI:15608905" /db_xref="GOA:O06800" /db_xref="UniProtKB/TrEMBL:O06800" /db_xref="GeneID:885443" /translation="MSDQPRHHQVLDDLLPQHRALRHQIPQVYQRFVALGDAALTDGA LSRKVKELVALAIAVVQGCDGCVASHAQAAVRAGATAQEAAEAIGVTILMHGGPATIH GARAYAAFCEFADTTPS" gene 2000614..2002470 /gene="PE_PGRS31" /locus_tag="Rv1768" /db_xref="GeneID:885429" CDS 2000614..2002470 /gene="PE_PGRS31" /locus_tag="Rv1768" /function="UNKNOWN" /note="Rv1768, (MTCY28.34), len: 618 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to Q50615 HYPOTHETICAL 40.8 kDa PROTEIN (498 aa), FASTA scores: opt: 1703, E(): 0, (57.4% identity in 566 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177832.1" /db_xref="GI:57116908" /db_xref="UniProtKB/TrEMBL:Q79FK9" /db_xref="GeneID:885429" /translation="MSYLVVVPELVAAAATDLANIGSSISAANAAAAAPTTALVAAGG DEVSAAIAALFGAHARAYQALSAQAAMFHEQFVRALAAGGNSYAVAEAATAQSVQQDL LNLINAPTQALLGRPLIGNGANGLPGTGQNGGDGGILYGNGGNGGSGGVNQAGGNGGN AGLWGNGGSGGAGGNATTAGRNGFNGGAGGSGGLLWGNGGAGGAGGNGGPAPLVGGVG TTGGAGGNGGGAGLFYGFGGAGGNGGMGGVAPSTGPSMGILPAGGVGGPGGSGGASAL AFGSGGVGGAGGLGGPTDGTVQGVGGFGGQGGNGGQSGLLFGNAGAGGAGAAGGAGTG DTESFGGHGGAGGDGGAVGLIGNGGAGGTGSPGAVVGGNGGVGGLGGAGSPGGLLYGT GGAGGNGGPGGDGGTGATVGFAGSGGFGGAGGIAQLFGTGGMGGSGGGIGAGTTTVVP PDVAPVGGTGGNGGRAGLLLGVGGMGGNGGATSVGGTLYAAGGNGGDGGLVWGNGGTG GSGGAGGAGSVGNGGAGGNAALLFGNGGAGGAGGAGGIGAGGAGGFGAVLFGNGGAGG SGAPGGIGAGGNGGNALLVGNGGNGGAGTGGAAGGAGGSGGLLFGQNGMPGP" gene 2002626..2003870 /locus_tag="Rv1769" /db_xref="GeneID:885420" CDS 2002626..2003870 /locus_tag="Rv1769" /function="UNKNOWN" /note="Rv1769, (MTCY28.35), len: 414 aa. Conserved hypothetical protein, similar to O88066|SCI35.31|AL031541 hypothetical protein from Streptomyces coelicolor (402 aa), FASTA scores: opt: 1341, E(): 0, (53.8% identity in 398 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216285.1" /db_xref="GI:15608907" /db_xref="UniProtKB/TrEMBL:O06802" /db_xref="GeneID:885420" /translation="MHEVAAREQRSDGPMRLDAQGRLQRYEEAFADYDAPFAFVDLDA MWGNADQLLARAGDKPIRVASKSLRCRPLQREILDASERFDGLLTFTLTETLWLAGQG FSNLLLAYPPTDRAALRALGELTAKDPDGAPIVMVDSVEHLDLIERTTDKPVRLCLDF DAGYWRAGGRIKIGSKRSPLHTPEQARALAVEIARRPALTLAALMCYEAHIAGLGDNV AGKRVHNAIIRRMQRMSFEELRERRARAVELVREVADIKIVNAGGTGDLQLVAQEPLI TEATAGSGFYAPTLFDSYSTFTLQPAAMFALPVCRRPGAKTVTALGGGYLASGVGAKD RMPTPYLPVGLKLNALEGTGEVQTPLSGDAARRLKLGDKVYFRHTKAGELCERFDHLH LVRGAEVVDTVPTYRGEGRTFL" gene 2003878..2005164 /locus_tag="Rv1770" /db_xref="GeneID:885415" CDS 2003878..2005164 /locus_tag="Rv1770" /function="UNKNOWN" /note="Rv1770, (MTCY28.36), len: 428 aa. Conserved hypothetical protein, highly similar in N-terminus to Q49882 Hypothetical protein from Mycobacterium leprae from cosmid L247 (83 aa), FASTA scores: opt: 301, E(): 1e-12, (56.5% identity in 85 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216286.1" /db_xref="GI:15608908" /db_xref="UniProtKB/TrEMBL:O06803" /db_xref="GeneID:885415" /translation="MDEAHPAHPADAGRPGGPIQGARRGAAMTPITALPTELAAMREV VETLAPIERAAGEPGEHKAAEWIVERLRTAGAQDARIEEEQYLDGYPRLHLKLSVIGV AAGVAGLLSRRLRIPAALAGVGAGLAIADDCANGPRIVRKRTETPRTTWNAVAEAGDP AGQLTVVVCAHHDAAHSGKFFEAHIEEVMVELFPGIVERIDTQLPNWWGPILAPALAG VGALRGSRPMMIAGTVGSALAAALFADIARSPVVPGANDNLSAVALLVALAERLRERP VKGVRVLLVSLGAEETLQGGIYGFLARHKPELDRDRTYFLNFDTIGSPELIMLEGEGP TVMEDYFYRPFRDLVIRAAERADAPLRRGIRSRNSTDAVLMSRAGYPTACFVSINRHK SVANYHLMSDTPENLCYETVSHAVTVAESVIRELAR" gene 2005161..2006447 /locus_tag="Rv1771" /db_xref="GeneID:885441" CDS 2005161..2006447 /locus_tag="Rv1771" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1771, (MTCY28.37), len: 428 aa. Probable oxidoreductase (EC 1.-.-.-), similar to e.g. GGLO_RAT|P10867 l-gulonolactone oxidase (ec 1.1.3.8) (439 aa), FASTA scores: opt: 862, E(): 0, (34.1% identity in 434 aa overlap). Also shows slight similarity to Mycobacterium tuberculosis oxidoreductase Rv1726|MTCY04C12.11 (22.9% identity in 441 aa overlap) and others e.g. Rv3107c, Rv1257c, Rv2251, etc. Contains PS00862 Oxygen oxidoreductases covalent FAD-binding site." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_216287.1" /db_xref="GI:15608909" /db_xref="GOA:O06804" /db_xref="UniProtKB/TrEMBL:O06804" /db_xref="GeneID:885441" /translation="MSPIWSNWPGEQVCAPSAIVRPTSEAELADVIAQAAKRGERVRA VGSGHSFTDIACTDGVMIDMTGLQRVLDVDQPTGLVTVEGGAKLRALGPQLAQRRLGL ENQGDVDPQSITGATATATHGTGVRFQNLSARIVSLRLVTAGGEVLSLSEGDDYLAAR VSLGALGVISQVTLQTVPLFTLHRHDQRRSLAQTLERLDEFVDGNDHFEFFVFPYADK ALTRTMHRSDEQPKPTPGWQRMVGENFENGGLSLICQTGRRFPSVAPRLNRLMTNMMS SSTVQDRAYKVFATQRKVRFTEMEYAIPRENGREALQRVIDLVRRRSLPIMFPIEVRF SAPDDSFLSTAYGRDTCYIAVHQYAGMEFESYFRAVEEIMDDYAGRPHWGKRHYQTAA TLRERYPQWDRFAAVRDRLDPDRVFLNDYTRRVLGP" misc_feature 2005206..2005307 /locus_tag="Rv1771" /note="PS00862 Oxygen oxidoreductases covalent FAD-binding site" gene 2006636..2006947 /locus_tag="Rv1772" /db_xref="GeneID:885920" CDS 2006636..2006947 /locus_tag="Rv1772" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1772, (MTCY28.38), len: 103 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216288.1" /db_xref="GI:15608910" /db_xref="UniProtKB/TrEMBL:O06805" /db_xref="GeneID:885920" /translation="MGSTGGSQPMTANRGPAAISSGSNSGRVLDTARGILIALRRCPA ETAFDELHNAAQRHRLPVFEIAWALVHLAVEGSTPCRSFVDAQSAARREWGQLFAHAA A" gene complement(2007020..2007766) /locus_tag="Rv1773c" /db_xref="GeneID:885342" CDS complement(2007020..2007766) /locus_tag="Rv1773c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1773c, (MTCY28.39), len: 248 aa. Probable transcriptional regulator belonging to IclR family, similar to ICLR_ECOLI|P16528 acetate operon repressor from Escherichia coli (274 aa), FASTA scores: opt: 261, E(): 3.3e-10, (26.9% identity in 249 aa overlap). Also similar to Mycobacterium tuberculosis protein Rv1719|MTCY04C12.04 (40.2% identity in 244 aa overlap); and Rv2989. Start site chosen by homology, but may extend further upstream. Contains possible helix-turn-helix motif at aa 37-58 (+3.24 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216289.1" /db_xref="GI:15608911" /db_xref="GOA:O06806" /db_xref="UniProtKB/TrEMBL:O06806" /db_xref="GeneID:885342" /translation="MPPTEGKSTTNRDEGIQVLRRAVAALDEIAAEPGHLRLVDLCER LGLAKSTTRRLLVGLVEVGLVSVDSHGRFALGERLLGFGSVTGAHIAAAFRPTVERVA RATDGETVDLSVLRGQRMWFVDQIESSYRLRAVSAVGLRFPLNGTANGKAALAALDDA DAEAALCRLDPMVAEGLRREIVEIRRTGIAFDRNEHTPGISAAAIARRALGDNVIAIS VPAPTARFLEKEQRIIAALRAAADSPDWTR" gene 2007832..2009172 /locus_tag="Rv1774" /db_xref="GeneID:885367" CDS 2007832..2009172 /locus_tag="Rv1774" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1774, (MTCY25C11.01), len : 446 aa. Probable oxidoreductase (EC 1.-.-.-), similar to several e.g. HDNO_ARTOX|P08159 6-hydroxy-d-nicotine oxidase (458 aa), FASTA scores: opt: 417, E(): 6e-20, (28.4% identity in 462 aa overlap). Also some similarity to Mycobacterium tuberculosis oxidoreductase MTCY04C12.11 (24.1% identity in 444 aa overlap). Contains PS00862 Oxygen oxidoreductases covalent FAD-binding site." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_216290.1" /db_xref="GI:15608912" /db_xref="GOA:O33177" /db_xref="UniProtKB/TrEMBL:O33177" /db_xref="GeneID:885367" /translation="MRALPAGRHFFRGSDGYEAARRGTVWHRRVPDRYPEVIVQAVSA DDIVSAIRYATVNGHKVSVVSGGHSFAASHLRDGAVLLDVSRIDHASIDADKGRAVVG PGKGGSVLMAELEAQGLFFPGGHCRGVCLGGYLLQGGYGWNSRIYGPACESVIGLDVI TADGAQIHCDADNHADLYWAARGAGPGFFGVVTSFYLKLYPRPATCGTSVYVYPFDLA DEVFTWARAVSAEVDPRVELQALASRGEPSMGIDVPVISLASPAFADSPEEAEQALAL FGTCPVVEQALVKVPYMPTDLPAWYDVAMTHYLSDHHYAVDNMWTSASAEDLLPGIRS ILDTLPPHPAHFLWLNWGPCPPRQDMAYSIEADIYLALYGSWKDPADEAKYADWARSH MAAMSHLAVGIQLADENLGARPARFASDAAMAKLDRVRAEYDPDGLFNSWMGRI" misc_feature 2007934..2008035 /locus_tag="Rv1774" /note="PS00862 Oxygen oxidoreductases covalent FAD-binding site" gene 2009172..2009990 /locus_tag="Rv1775" /db_xref="GeneID:885842" CDS 2009172..2009990 /locus_tag="Rv1775" /function="UNKNOWN" /note="Rv1775, (MTCY25C11.02), unknown, len: 272 aa. Conserved hypothetical protein, similar to O28806|AF1466 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (255 aa), FASTA scores: opt: 364, E(): 1e-17, (29.2% identity in 267 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216291.1" /db_xref="GI:15608913" /db_xref="UniProtKB/TrEMBL:O33178" /db_xref="GeneID:885842" /translation="MASDLYLGYRNDDADTPFGKFFKPEMAPLPQHVVVALQHGPQAG MALLAFDDAASIVDEGYQQTENGYGILGDGSMQVSVRTDMPGVTPAMWAWWFGWHGSD TRRYKLWHPRAHLSARWKDGDQDSGAGRRGAQRYVGRWSMISEYIGSTKLGAAIQFVE PAAMGLPDDSDDTVSICARLGSADAPVDAGWFVHQVRSTPGGSEMRSRFWMGGPHIAV RKAPEVASKAVRPIASKLIGVSESTARNLLVYCAQEMNHLAGFLADLWESFGDE" gene complement(2009995..2010555) /locus_tag="Rv1776c" /db_xref="GeneID:885821" CDS complement(2009995..2010555) /locus_tag="Rv1776c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1776c, (MTCY25C11.03c), len: 186 aa. Possible regulatory protein, some similarity to Mycobacterium tuberculosis Rv1255c|Q11063 hypothetical transcriptional regulator (202 aa), FASTA scores: opt: 270, E( ): 9.7e-09, (28.3% identity in 191 aa overlap) . Contains possible helix-turn-helix motif at aa 37-58 (+3.49 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216292.1" /db_xref="GI:15608914" /db_xref="GOA:O33179" /db_xref="UniProtKB/TrEMBL:O33179" /db_xref="GeneID:885821" /translation="MPGNDWIVGGNRRTIAAERIYAAATDLITRYGLNALDIDKLARE VHCSRATIYRRAGGKAQIRDVVLTRAAARIADGVRSDVETLRGRERVVAAILLSLQRI RSDPLGKLMFGSIHGGAGELAWLTESPLLADFATELTGIAGGDPQGAKWVVRVVLSLM YWPAENDEAERRLVEKYVAPAFAEQS" gene 2010656..2011960 /gene="cyp144" /locus_tag="Rv1777" /db_xref="GeneID:885839" CDS 2010656..2011960 /gene="cyp144" /locus_tag="Rv1777" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv1777, (MT1827, MTCY25C11.04), len: 434 aa. Probable cyp144, cytochrome p450 (EC 1.14.-.-), similar to CPXM_BACME|Q06069 cytochrome p450 (meg) (EC 1.14.99.-) (410 aa), FASTA scores: opt: 435 E(): 2.3e-16, (28.8% identity in 372 aa overlap). Also similar to several other Mycobacterium tuberculosis p450 genes including Rv0766c, Rv2266, etc. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome p450 144 CYP144" /protein_id="NP_216293.1" /db_xref="GI:15608915" /db_xref="GOA:O33180" /db_xref="UniProtKB/Swiss-Prot:O33180" /db_xref="GeneID:885839" /translation="MRRSPKGSPGAVLDLQRRVDQAVSADHAELMTIAKDANTFFGAE SVQDPYPLYERMRAAGSVHRIANSDFYAVCGWDAVNEAIGRPEDFSSNLTATMTYTAE GTAKPFEMDPLGGPTHVLATADDPAHAVHRKLVLRHLAAKRIRVMEQFTVQAADRLWV DGMQDGCIEWMGAMANRLPMMVVAELIGLPDPDIAQLVKWGYAATQLLEGLVENDQLV AAGVALMELSGYIFEQFDRAAADPRDNLLGELATACASGELDTLTAQVMMVTLFAAGG ESTAALLGSAVWILATRPDIQQQVRANPELLGAFIEETLRYEPPFRGHYRHVRNATTL DGTELPADSHLLLLWGAANRDPAQFEAPGEFRLDRAGGKGHISFGKGAHFCVGAALAR LEARIVLRLLLDRTSVIEAADVGGWLPSILVRRIERLELAVQ" misc_feature 2011787..2011816 /gene="cyp144" /locus_tag="Rv1777" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(2012081..2012530) /locus_tag="Rv1778c" /db_xref="GeneID:885424" CDS complement(2012081..2012530) /locus_tag="Rv1778c" /function="UNKNOWN" /note="Rv1778c, (MTCY25C11.05c), len: 149 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216294.1" /db_xref="GI:15608916" /db_xref="UniProtKB/TrEMBL:O33181" /db_xref="GeneID:885424" /translation="MRVSLFLSDAAQADAQSGKVHALGLGWRQCQTPTPPFALVLFLD IDWDETNKQHQLKCQLLTADGDPVVVPGPHGPQRILFEAAAEAGRAPGAIHGTSVRMP LTLNIPAGIPLEPGIYEWRVEVEGYERATAVEAFIVAGGGHPPASCG" gene complement(2012686..2014479) /locus_tag="Rv1779c" /db_xref="GeneID:885829" CDS complement(2012686..2014479) /locus_tag="Rv1779c" /function="UNKNOWN" /note="Rv1779c, (MTV049.01c), len: 597. Possible integral membrane protein. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216295.1" /db_xref="GI:15608917" /db_xref="UniProtKB/TrEMBL:O53930" /db_xref="GeneID:885829" /translation="MCAHEYAEQRSAVSGIEGLLTWLGGGHWRELGERHERSTHAVAG VIVAVGAALAGLLASLAVSEAAQGPISSPIGAASLALVLGLLVGAVTRGTASGPARGR AGVTGRASVAVAVGFVVGELAALVMFSGAIDRRLDEQAMHSADATPAAVQASASLQQA RNARTALDSAVERARGRLDDALVVARCEYHPTPACPQTRITGVPGRGPETRTANQLLA DAQRELDNALAARDHQAPALDAKMAHDEQALAEVRQAVVADAGRGLGSRWVAMNDLTL ASAGALTARMLAIAFFALLYLLPLILRLWRGDTTHDRHAAARAERERAELEADTAIAI KRAEVRRAAEIMWAEHQLTQTRLAIEAQAEIDREQQRRRVVEALEGPVRASSERTLQP VEDEVYLPIAAETEAASRTVAQLPAGAAHHRPGIAKNLPAQVQPEGAVEPREKRATPV IRSIPDATKAAARWIRPLVPPFVARMLDNTTAPLRTARQVFEEVEEIAFSFKRTHKVT VNAEGSDPNDQPPLESHSPAAPAESNPIASSDSARRSRLATNDDHPPLAQVPPRDLAS LSVGSTGELTQREGPHELRSPDGPRQLPPPR" gene 2014699..2015262 /locus_tag="Rv1780" /db_xref="GeneID:885851" CDS 2014699..2015262 /locus_tag="Rv1780" /function="UNKNOWN" /note="Rv1780, (MTV049.02), len: 187 aa. Conserved hypothetical protein, equivalent to Q49881|ML1380|U00021_2 cosmid L247 from Mycobacterium leprae (187 aa), FASTA scores: opt: 1000, E(): 0, (82.4% identity in 187 aa overlap). TBparse score is 0.930" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216296.1" /db_xref="GI:15608918" /db_xref="UniProtKB/TrEMBL:O53931" /db_xref="GeneID:885851" /translation="MQNHDYVTYEEFGRRFFEVAVTPDRVAAAFADIAGSEFAMEPIS QGPGGIAKVSANVKIREPRVTRKLGDLITFVIHIPLSIDLLLDLRLDKQRFMVAGDIA LRATARAAEPLLLIVDVAKPRPSDITVNVSSKSIRGEVLRILAGVDGEIRRFIAQYVS AEIDSPKSQAAQVINVAEQLDSTWSGP" gene complement(2015302..2017476) /gene="malQ" /locus_tag="Rv1781c" /db_xref="GeneID:885854" CDS complement(2015302..2017476) /gene="malQ" /locus_tag="Rv1781c" /EC_number="2.4.1.25" /function="Transfers a segment of a (1,4)-alpha-D-glucan to a new 4-position in an acceptor, which may be glucose or (1,4)-alpha-D-glucan" /note="Rv1781c, (MTV049.03c), len: 724 aa. Probable malQ, 4-ALPHA-GLUCANOTRANSFERASE (EC 2.4.1.25), similar to many, e.g. P15977|MALQ_ECOLI 4-ALPHA-GLUCANOTRANSFERASE (694 aa), FASTA scores: opt: 964, E(): 0, (31.8% identity in 694 aa overlap). BELONGS TO THE DISPROPORTIONATING ENZYME FAMILY." /codon_start=1 /transl_table=11 /product="4-alpha-glucanotransferase MalQ" /protein_id="NP_216297.1" /db_xref="GI:15608919" /db_xref="GOA:P65336" /db_xref="UniProtKB/Swiss-Prot:P65336" /db_xref="GeneID:885854" /translation="MTELAPSLVELARRFGIATEYTDWTGRQVLVSEATLVAALAALG VPAQTEQQRNDALAAQLRSYWARPLPATIVMRAGEQTQFRVHVTDGAPADVWLQLEDG TTRAEVVQVDNFTPPFDLDGRWIGEASFVLPADLPLGYHRVNLRSGDSQASAAVVVTP DWLGLPDKLAGRRAWGLAVQLYSVRSRQSWGIGDLTDLANLALWSASAHGAGYVLVNP LHAATLPGPAGRSKPIEPSPYLPTSRRFVNPLYLRVEAIPELVDLPKRGRVQRLRTNV QQHADQLDTIDRDSAWAAKRAALKLVHRVPRSAGRELAYAAFRTREGRALDDFATWCA LAETYGDDWHRWPKSLRHPDASGVADFVDKHADAVDFHRWLQWQLDEQLASAQSQALR AGMSLGIMADLAVGVHPNGADAWALQDVLAQGVTAGAPPDEFNQLGQDWSQPPWRPDR LAEQEYRPFRALIQAALRHAGAVRIDHIIGLFRLWWIPDGAPPTQGTYVRYDHDAMIG IVALEAHRAGAVVVGEDLGTVEPWVRDYLLLRGLLGTSILWFEQDRDCGPAGTPLPAE RWREYCLSSVTTHDLPPTAGYLAGDQVRLRESLGLLTNPVEAELESARADRAAWMAEL RRVGLLADGAEPDSEEAVLALYRYLGRTPSRLLAVALTDAVGDRRTQNQPGTTDEYPN WRVPLTGPDGQPMLLEDIFTDRRAATLAEAVRAATTSPMSCW" gene 2017740..2019260 /locus_tag="Rv1782" /db_xref="GeneID:885347" CDS 2017740..2019260 /locus_tag="Rv1782" /function="UNKNOWN" /note="Rv1782, (MTV049.04), len: 506 aa. Probable conserved membrane protein, similar to four other Mycobacterium tuberculosis hypothetical membrane proteins e.g. O05449|Rv3895c|MTCY15F10.17|Z94121 (495 aa), FASTA scores: opt: 1106, E(): 0, (41.2% identity in 485 aa overlap); Rv0283, Rv3450c, and Rv3869, all located near ESAT-6 family genes. Also similar to O33088|MLCB628.17C|Y14967 cosmid B628 from Mycobacterium leprae (481 aa), (32.7% identity in 486 aa overlap); and equivalent to Q9Z5I3|MLCB596.27|AL035472 hypothetical protein from Mycobacterium leprae (506 aa) (82.6% identity in 506 aa overlap). Has hydrophobic stretch from aa 54-76." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216298.1" /db_xref="GI:15608920" /db_xref="UniProtKB/TrEMBL:O53933" /db_xref="GeneID:885347" /translation="MAEESRGQRGSGYGLGLSTRTQVTGYQFLARRTAMALTRWRVRM EIEPGRRQTLAVVASVSAALVICLGALLWSFISPSGQLNESPIIADRDSGALYVRVGD RLYPALNLASARLITGRPDNPHLVRSSQIATMPRGPLVGIPGAPSSFSPKSPPASSWL VCDTVATSSSIGSLQGVTVTVIDGTPDLTGHRQILSGSDAVVLRYGGDAWVIREGRRS RIEPTNRAVLLPLGLTPEQVSQARPMSRALFDALPVGPELLVPEVPNAGGPATFPGAP GPIGTVIVTPQISGPQQYSLVLGDGVQTLPPLVAQILQNAGSAGNTKPLTVEPSTLAK MPVVNRLDLSAYPDNPLEVVDIREHPSTCWWWERTAGENRARVRVVSGPTIPVAATEM NKVVSLVKADTSGRQADQVYFGPDHANFVAVTGNNPGAQTSESLWWVTDAGARFGVED SKEARDALGLTLTPSLAPWVALRLLPQGPTLSRADALVEHDTLPMDMTPAELVVPK" gene 2019257..2020564 /locus_tag="Rv1783" /db_xref="GeneID:885898" CDS 2019257..2020564 /locus_tag="Rv1783" /function="UNKNOWN" /note="Rv1783, (MTV049.05), len: 435 aa. Probable conserved membrane protein. Member of family of Mycobacterium tuberculosis hypothetical proteins including O05450|Rv3894c|MTY15F10.18|Z94121 (1396 aa), FASTA scores: opt: 542, E(): 1.5e-26, (31.4% identity in 440 aa overlap); Rv3447c, Rv0284, Rv3870, Rv1784, and Rv3871, all linked to ESAT-6 family gene. Similar to N-terminal part of Rv3894c (1396 aa), Rv1784 is similar to remainder of Rv3894c. Also similar to O33087|MLCB628.16C|Y14967 Hypothetical protein from Mycobacterium leprae (744 aa), (30.0% identity in 437 aa overlap) and equivalent to N-terminal part of Q9Z5I2|MLCB596.28|AL035472 hypothetical protein from Mycobacterium leprae (1345 aa), (86.4% identity in 397 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216299.1" /db_xref="GI:15608921" /db_xref="UniProtKB/TrEMBL:O53934" /db_xref="GeneID:885898" /translation="MKRGFARPTPEKPPVIKPENIVLSTPLSIPPPEGKPWWLIVVGV VVVGLLGGMVAMVFASGSHVFGGIGSIFPLFMMVGIMMMMFRGMGGGQQQMSRPKLDA MRAQFMLMLDMLRETAQESADSMDANYRWFHPAPNTLAAAVGSPRMWERKPDGKDLNF GVVRVGVGMTRPEVTWGEPQNMPTDIELEPVTGKALQEFGRYQSVVYNLPKMVSLLVE PWYALVGEREQVLGLMRAIICQLAFSHGPDHVQMIVVSSDLDQWDWVKWLPHFGDSRR HDAAGNARMVYTSVREFAAEQAELFAGRGSFTPRHASSSAQTPTPHTVIIADVDDPQW EYVISAEGVDGVTFFDLTGSSMWTDIPERKLQFDKTGVIEALPRDRDTWMVIDDKAWF FALTDQVSIAEAEEFAQKLAQWRLAEAYEEIGQRVAHIGARDI" gene 2020634..2023432 /locus_tag="Rv1784" /db_xref="GeneID:885834" CDS 2020634..2023432 /locus_tag="Rv1784" /function="UNKNOWN" /note="Rv1784, (MTV049.06), len: 932 aa. Conserved hypothetical protein, member of family of Mycobacterium tuberculosis hypothetical proteins including Rv3447c, Rv0284, Rv3870, Rv1783, Rv3871, Rv3894c, all linked to ESAT-6 family genes. Probably ATP-binding membrane proteins. Similar to C-terminal region of 006264|Rv3447c (1236 aa), (36.2% identity in 930 aa overlap). Equivalent to C-terminal region of Mycobacterium leprae hypothetical protein Q9Z512|MLCB596.28|AL035472 (1345 aa), (87.8% identity in 932 aa overlap); also similar to other hypothetical proteins e.g. MLCB628.14 from Mycobacterium leprae, (32.0% identity in 600 aa overlap); MLCB628.15 from Mycobacterium leprae, (35.0% identity in 280 aa overlap); and O86653|SC3C3.20|AL031231 ATP/GTP binding protein from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 618, E(): 4.6e-30, (34.3% identity in 937 aa overlap). Contains two times PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216300.1" /db_xref="GI:15608922" /db_xref="GOA:O53935" /db_xref="UniProtKB/TrEMBL:O53935" /db_xref="GeneID:885834" /translation="MGRSRLRAPFGNRSDNGELLFLDMKSLDEGGDGPHGVMSGTTGS GKSTLVRTVIESLMLSHPPEELQFVLADLKGGSAVKPFAGVPHVSRIITDLEEDQALM ERFLDALWGEIARRKAICDSAGVDDAKEYNSVRARMRARGQDMAPLPMLVVVIDEFYE WFRIMPTAVDVLDSIGRQGRAYWIHLMMASQTIESRAEKLMENMGYRLVLKARTAGAA QAAGVPNAVNLPAQAGLGYFRKSLEDIIRFQAEFLWRDYFQPGVSIDGEEAPALVHSI DYIRPQLFTNSFTPLEVSVGGPDIEPVVAQPNGEVLESDDIEGGEDEDEEGVRTPKVG TVIIDQLRKIKFEPYRLWQPPLTQPVAIDDLVNRFLGRPWHKEYGSACNLVFPIGIID RPYKHDQPPWTVDTSGPGANVLILGAGGSGKTTALQTLICSAALTHTPQQVQFYCLAY SSTALTTVSRIPHVGEVAGPTDPYGVRRTVAELLALVRERKRSFLECGIASMEMFRRR KFGGEAGPVPDDGFGDVYLVIDNYRALAEENEVLIEQVNVIINQGPSFGVHVVVTADR ESELRPPVRSGFGSRIELRLAAVEDAKLVRSRFAKDVPVKPGRGMVAVNYVRLDSDPQ AGLHTLVARPALGSTPDNVFECDSVVAAVSRLTSAQAPPVRRLPARFGVEQVRELASR DTRQGVGAGGIAWAISELDLAPVYLNFAENSHLMVTGRRECGRTTTLATIMSEIGRLY APGASSAPPPAPGRPSAQVWLVDPRRQLLTALGSDYVERFAYNLDGVVAMMGELAAAL AGREPPPGLSAEELLSRSWWSGPEIFLIVDDIQQLPPGFDSPLHKAVPFVNRAADVGL HVIVTRTFGGWSSAGSDPMLRALHQANAPLLVMDADPDEGFIRGKMKGGPLPRGRGLL MAEDTGVFVQVAATEVRR" misc_feature 2020751..2020774 /locus_tag="Rv1784" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 2021882..2021905 /locus_tag="Rv1784" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(2023447..2024628) /gene="cyp143" /locus_tag="Rv1785c" /db_xref="GeneID:885907" CDS complement(2023447..2024628) /gene="cyp143" /locus_tag="Rv1785c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv1785c, (MT1834, MTV049.07c), len: 393 aa. Probable cyp143, cytochrome P450 (1.14.-.-), similar to many e.g. AE0001|RZAE000101_4 Rhizobium sp. NGR234 (414 aa), FASTA scores: opt: 663, E(): 0, (32.4% identity in 413 aa overlap). Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 143" /protein_id="NP_216301.1" /db_xref="GI:15608923" /db_xref="GOA:P63723" /db_xref="UniProtKB/Swiss-Prot:P63723" /db_xref="GeneID:885907" /translation="MTTPGEDHAGSFYLPRLEYSTLPMAVDRGVGWKTLRDAGPVVFM NGWYYLTRREDVLAALRNPKVFSSRKALQPPGNPLPVVPLAFDPPEHTRYRRILQPYF SPAALSKALPSLRRHTVAMIDAIAGRGECEAMADLANLFPFQLFLVLYGLPLEDRDRL IGWKDAVIAMSDRPHPTEADVAAARELLEYLTAMVAERRRNPGPDVLSQVQIGEDPLS EIEVLGLSHLLILAGLDTVTAAVGFSLLELARRPQLRAMLRDNPKQIRVFIEEIVRLE PSAPVAPRVTTEPVTVGGMTLPAGSPVRLCMAAVNRDGSDAMSTDELVMDGKVHRHWG FGGGPHRCLGSHLARLELTLLVGEWLNQIPDFELAPDYAPEIRFPSKSFALKNLPLRW S" misc_feature complement(2023597..2023626) /gene="cyp143" /locus_tag="Rv1785c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene 2024828..2025031 /locus_tag="Rv1786" /db_xref="GeneID:885846" CDS 2024828..2025031 /locus_tag="Rv1786" /function="FERREDOXINS ARE IRON-SULFUR PROTEINS THAT TRANSFER ELECTRONS IN A WIDE VARIETY OF METABOLIC REACTIONS." /note="Rv1786, (MTV049.08), len: 67 aa. Probable ferredoxin (EC 1.-.-.-), similar to others e.g. X63601|FERS_STRGR FERREDOXIN from Streptomyces griseus (65 aa), FASTA scores: opt: 140, E(): 0.001, (38.1% identity in 63 aa overlap); T50943 probable ferredoxin DitA from Pseudomonas abietaniphila (78 aa); BAA84714.1|AB017795 ferredoxin from Nocardioides sp. (69 aa); etc. Also similar to Rv0763c|MTCY369.08 from Mycobacterium tuberculosis (68 aa), FASTA score: (30.6% identity in 62 aa overlap); and Rv0763c." /codon_start=1 /transl_table=11 /product="ferredoxin" /protein_id="NP_216302.1" /db_xref="GI:15608924" /db_xref="UniProtKB/TrEMBL:O53937" /db_xref="GeneID:885846" /translation="MKVRLDPSRCVGHAQCYAVDPDLFPIDDSGNSILAEHEVRPEDM QLTRDGVAACPEMALILEEDDAD" gene 2025301..2026398 /gene="PPE25" /locus_tag="Rv1787" /db_xref="GeneID:885827" CDS 2025301..2026398 /gene="PPE25" /locus_tag="Rv1787" /function="UNKNOWN" /note="Rv1787, (MTV049.09), len: 365 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to Z74024|MTCY274.24 Mycobacterium tuberculosis cosmid (404 aa), FASTA scores: opt: 837, E(): 0, (52.0% identity in 406 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177833.1" /db_xref="GI:57116909" /db_xref="UniProtKB/TrEMBL:Q79FK8" /db_xref="GeneID:885827" /translation="MDFGALPPEINSGRMYCGPGSGPMLAAAAAWDGVAVELGLAATG YASVIAELTGAPWVGAASLSMVAAATPYVAWLSQAAARAEQAGMQAAAAAAAYEAAFV MTVPPPVITANRVLVMTLIATNFFGQNSAAIAVAEAQYAEMWAQDAVAMYGYAAASAS ASRLIPFAAPPKTTNSAGVVAQVAAVAAMPGLLQRLSSAASVSWSNPNDWWLVRLLGS ITPTERTTIVRLLGQSYFATGMAQFFASIAQQLTFGPGGTTAGSGGAWYPTPQFAGLG ASRAVSASLARANKIGALSVPPSWVKTTALTESPVAHAVSANPTVGSSHGPHGLLRGL PLGSRITRRSGAFAHRYGFRHSVVARPPSAG" gene 2026477..2026776 /gene="PE18" /locus_tag="Rv1788" /db_xref="GeneID:885895" CDS 2026477..2026776 /gene="PE18" /locus_tag="Rv1788" /function="UNKNOWN" /note="Rv1788, (MTV049.10), len: 99 aa. Member of the Mycobacterium tuberculosis PE family of gly-, ala-rich proteins (see citation below), similar to Z93777|MTCI364.07 Mycobacterium tuberculosis cosmid (99 aa), FASTA scores: opt: 414, E(): 3.6e-20, (72.4% identity in 98 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177834.1" /db_xref="GI:57116910" /db_xref="UniProtKB/TrEMBL:Q7D7Y9" /db_xref="GeneID:885895" /translation="MSFVTTQPEALAAAAGSLQGIGSALNAQNAAAATPTTGVVPAAA DEVSALTAAQFAAHAQIYQAVSAQAAAIHEMFVNTLQMSSGSYAATEAANAAAAG" gene 2026790..2027971 /gene="PPE26" /locus_tag="Rv1789" /db_xref="GeneID:885333" CDS 2026790..2027971 /gene="PPE26" /locus_tag="Rv1789" /function="UNKNOWN" /note="Rv1789, (MTV049.11), len: 393 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, highly similar to others e.g.Z98268|MTCI125.26 Mycobacterium tuberculosis cosmid (385 aa), FASTA score: opt: 1283, E(): 0, (62.7% identity in 408 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177835.1" /db_xref="GI:57116911" /db_xref="UniProtKB/TrEMBL:Q79FK6" /db_xref="GeneID:885333" /translation="MDFGALPPEVNSVRMYAGPGSAPMVAAASAWNGLAAELSSAATG YETVITQLSSEGWLGPASAAMAEAVAPYVAWMSAAAAQAEQAATQARAAAAAFEAAFA ATVPPPLIAANRASLMQLISTNVFGQNTSAIAAAEAQYGEMWAQDSAAMYAYAGSSAS ASAVTPFSTPPQIANPTAQGTQAAAVATAAGTAQSTLTEMITGLPNALQSLTSPLLQS SNGPLSWLWQILFGTPNFPTSISALLTDLQPYASFFYNTEGLPYFSIGMGNNFIQSAK TLGLIGSAAPAAVAAAGDAAKGLPGLGGMLGGGPVAAGLGNAASVGKLSVPPVWSGPL PGSVTPGAAPLPVSTVSAAPEAAPGSLLGGLPLAGAGGAGAGPRYGFRPTVMARPPFA G" gene 2028425..2029477 /gene="PPE27" /locus_tag="Rv1790" /db_xref="GeneID:885859" CDS 2028425..2029477 /gene="PPE27" /locus_tag="Rv1790" /function="UNKNOWN" /note="Rv1790, (MTV049.12), len: 350 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich protein, similar to Z74024|MTCY274.24 Mycobacterium tuberculosis cosmid (404 aa), FASTA scores: opt: 849, E(): 0, (50.0% identity in 406 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177836.1" /db_xref="GI:57116912" /db_xref="UniProtKB/TrEMBL:Q79FK5" /db_xref="GeneID:885859" /translation="MDFGALPPEINSGRMYCGPGSGPMLAAAAAWDGVAVELGLAATG YASVIAELTGAPWVGAASLSMVAAATPYVAWLSQAAARAEQAGMQAAAAAAAYEAAFV MTVPPPVITANRVLVMTLIATNFFGQNSAAIAVAEAQYAEMWAQDAVAMYGYAAASAS ASRLIPFAAPPKTTNSAGVVAQAVASVSWPNPNDWWLVRLLGSITPTERTTIVRLLGQ SYLATGMARFLTSIAQQLTFGPGGTTAGSGGAWYPTPQFAGLGAGPAVSASLARAEPV GRLSVPPSWAVAAPAFAEKPEAGTPMSVIGEASSCGQGGLLRGIPLARAGRRTGAFAH RYGFRHSVITRSPSAG" gene 2029904..2030203 /gene="PE19" /locus_tag="Rv1791" /db_xref="GeneID:885445" CDS 2029904..2030203 /gene="PE19" /locus_tag="Rv1791" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1791, (MTV049.13), len: 99 aa. Member of the Mycobacterium tuberculosis PE family, but no glycine rich C-terminus (see Brennan & Delogu 2002), highly similar to Z93777|MTCI364.07 M.tuberculosis cosmid (99 aa) opt: 430 E(): 2.4e-21, (75.5% identity in 98 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177837.1" /db_xref="GI:57116913" /db_xref="UniProtKB/TrEMBL:Q79FK4" /db_xref="GeneID:885445" /translation="MSFVTTQPEALAAAAANLQGIGTTMNAQNAAAAAPTTGVVPAAA DEVSALTAAQFAAHAQMYQTVSAQAAAIHEMFVNTLVASSGSYAATEAANAAAAG" gene 2030347..2030643 /gene="esxM" /locus_tag="Rv1792" /pseudo /db_xref="GeneID:886252" misc_feature 2030347..2030643 /gene="esxM" /locus_tag="Rv1792" /experiment="experimental evidence, no additional details recorded" /note="Rv1792, (MTV049.14), len: 98 aa. esxM, ESAT-6 like protein (see Gey Van Pittius et al., 2001), member of Mycobacterium tuberculosis QILSS family of proteins with Rv1038c, Rv1197, Rv3620c and Rv2347c. Has in-frame stop codon at 18074, no error could be found to account for this. Identical (apart from stop codon) to P96363|Rv1038c|MTCY10G2.11 PUTATIVE ESAT-6 LIKE PROTEIN 2 (98 aa), FASTA scores: opt: 389, E(): 5.8e-26, (100.0% identity in 58 aa overlap). Similar protein present in Mycobacterium leprae e.g. Q49946|MLCB1701.06C|AL049191 PUTATIVE ESAT-6 LIKE PROTEIN X (95 aa), FASTA scores: opt: 343, E(): 1.6e-17, (57.6% identity in 92 aa overlap). SEEMS TO BELONG TO THE ESAT6 FAMILY.; TB11.0, QILSS;ESAT-6 LIKE PROTEIN ESXM" /pseudo gene 2030694..2030978 /gene="esxN" /locus_tag="Rv1793" /db_xref="GeneID:885448" CDS 2030694..2030978 /gene="esxN" /locus_tag="Rv1793" /function="UNKNOWN" /note="Rv1793, (MT1842, MTV049.15), len: 94 aa. esxN, ESAT-6 like protein (see citation below), almost identical to several mycobacterial proteins of the ESAT-6-like family including P95242|Rv2346c|MTCY98.15C|Z83860 PUTATIVE ESAT-6 LIKE PROTEIN 6 (94 aa), FASTA scores: opt: 610, E(): 0, (97.9 % identity in 94 aa overlap); Rv3619c, Rv1037c, and Rv1198, etc. Also present in Mycobacterium leprae. SEEMS TO BELONG TO THE ESAT6 FAMILY.; ES6_5, Mtb9.9A" /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXN (ESAT-6 like protein 5)" /protein_id="YP_177838.1" /db_xref="GI:57116914" /db_xref="UniProtKB/Swiss-Prot:O53942" /db_xref="GeneID:885448" /translation="MTINYQFGDVDAHGAMIRAQAASLEAEHQAIVRDVLAAGDFWGG AGSVACQEFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" gene 2031066..2031968 /locus_tag="Rv1794" /db_xref="GeneID:885881" CDS 2031066..2031968 /locus_tag="Rv1794" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1794, (MTV049.16), len: 300 aa. Conserved hypothetical protein, slight similarity to Mycobacterium tuberculosis O53694|Rv0289|MTV035.17, (295 aa), FASTA scores: opt: 172, E(): 0.00083, (25.7% identity in 261 aa overlap). Equivalent to Mycobacterium leprae hypothetical protein Q9Z5I1|MLCB596.31|AL035472 (300 aa), (88.0% identity in 300 aa overlap). Contains PS00211 ABC transporters family signature. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216310.1" /db_xref="GI:15608931" /db_xref="UniProtKB/TrEMBL:O53943" /db_xref="GeneID:885881" /translation="MDQQSTRTDITVNVDGFWMLQALLDIRHVAPELRCRPYVSTDSN DWLNEHPGMAVMREQGIVVNDAVNEQVAARMKVLAAPDLEVVALLSRGKLLYGVIDDE NQPPGSRDIPDNEFRVVLARRGQHWVSAVRVGNDITVDDVTVSDSASIAALVMDGLES IHHADPAAINAVNVPMEEMLEATKSWQESGFNVFSGGDLRRMGISAATVAALGQALSD PAAEVAVYARQYRDDAKGPSASVLSLKDGSGGRIALYQQARTAGSGEAWLAICPATPQ LVQVGVKTVLDTLPYGEWKTHSRV" misc_feature 2031645..2031689 /locus_tag="Rv1794" /note="PS00211 ABC transporters family signature" gene 2032240..2033751 /locus_tag="Rv1795" /db_xref="GeneID:885628" CDS 2032240..2033751 /locus_tag="Rv1795" /function="UNKNOWN" /note="Rv1795, (MTV049.17), len: 503 aa. Conserved hypothetical membrane protein, has a hydrophilic stretch from 1-130 then very hydrophobic. Similar to several other mycobacterial proteins, all linked to ESAT-6 family e.g. Rv3887c|MTY15F10.24|Z94121 (509 aa), FASTA scores: opt: 360, E(): 1.6e-15, (26.7% identity in 514 aa overlap); Rv3448, and Rv0290. TBparse score is 0.916." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216311.1" /db_xref="GI:15608932" /db_xref="UniProtKB/TrEMBL:O53944" /db_xref="GeneID:885628" /translation="MTAVADAPQADIEGVASPQAVVVGVMAGEGVQIGVLLDANAPVS VMTDPLLKVVNSRLRELGEAPLEATGRGRWALCLVDGAPLRATQSLTEQDVYDGDRLW IRFIADTERRSQVIEHISTAVASDLSKRFARIDPIVAVQVGASMVATGVVLATGVLGW WRWHHNTWLTTIYTAVIGVLVLAVAMLLLMRAKTDADRRVADIMLMSAIMPVTVAAAA APPGPVGSPQAVLGFGVLTVAAALALRFTGRRLGIYTTIVIIGALTMLAALARMVAAT SAVTLLSSLLLICVVAYHAAPALSRRLAGIRLPVFPSATSRWVFEARPDLPTTVVVSG GSAPVLEGPSSVRDVLLQAERARSFLSGLLTGLGVMVVVCMTSLCDPHTGQRWLPLIL AGFTSGFLLLRGRSYVDRWQSITLAGTAVIIAAAVCVRYALELSSPLAVSIVAAILVL LPAAGMAAAAHVPHTIYSPLFRKFVEWIEYLCLMPIFPLALWLMNVYAAIRYR" gene 2033729..2035486 /gene="mycP5" /locus_tag="Rv1796" /db_xref="GeneID:885879" CDS 2033729..2035486 /gene="mycP5" /locus_tag="Rv1796" /EC_number="3.4.21.-" /function="THOUGHT TO HAVE PROTEOLYTIC ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="Rv1796, (MTV049.18), len: 585 aa. Probable mycP5, pro-rich membrane-anchored serine protease (mycosin) (EC 3.4.21.-) (see citations below). Member of family with four other Mycobacterium tuberculosis serine proteases: Rv3886c|O05458|MTCY15F10.26|Z94121 (550 aa), FASTA scores: opt: 1173, E(): 0, (47.9% identity in 578 aa overlap); Rv0291, Rv3883c, and Rv3449. Genes all linked to those of ESAT-6 family. Has possible N-terminal signal peptide and hydrophobic anchor-like stretch at C-terminus. Contains two serine protease, subtilase family active site motifs: a aspartic acid active site motif (PS00136); and a histidine active site motif (PS00137). BELONGS TO PEPTIDASE FAMILY S8 (ALSO KNOWN AS THE SUBTILASE FAMILY), PYROLYSIN SUBFAMILY. TBparse score is 0.930." /codon_start=1 /transl_table=11 /product="proline rich membrane-anchored mycosin MYCP5 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-5)" /protein_id="NP_216312.1" /db_xref="GI:15608933" /db_xref="GOA:O53945" /db_xref="UniProtKB/TrEMBL:O53945" /db_xref="GeneID:885879" /translation="MQRFGTGSSRSWCGRAGTATIAAVLLASGALTGLPPAYAISPPT IDPGALPPDGPPGPLAPMKQNAYCTEVGVLPGTDFQLQPKYMEMLNLNEAWQFGRGDG VKVAVIDTGVTPHPRLPRLIPGGDYVMAGGDGLSDCDAHGTLVASMIAAVPANGAVPL PSVPRRPVTIPTTETPPPPQTVTLSPVPPQTVTVIPAPPPEEGVPPGAPVPGPEPPPA PGPQPPAVDRGGGTVTVPSYSGGRKIAPIDNPRNPHPSAPSPALGPPPDAFSGIAPGV EIISIRQSSQAFGLKDPYTGDEDPQTAQKIDNVETMARAIVHAANMGASVINISDVMC MSARNVIDQRALGAAVHYAAVDKDAVIVAAAGDGSKKDCKQNPIFDPLQPDDPRAWNA VTTVVTPSWFHDYVLTVGAVDANGQPLSKMSIAGPWVSISAPGTDVVGLSPRDDGLIN AIDGPDNSLLVPAGTSFSAAIVSGVAALVRAKFPELSAYQIINRLIHTARPPARGVDN QVGYGVVDPVAALTWDVPKGPAEPPKQLSAPLVVPQPPAPRDMVPIWVAAGGLAGALL IGGAVFGTATLMRRSRKQQ" misc_feature 2034041..2034073 /gene="mycP5" /locus_tag="Rv1796" /note="PS00136 Serine proteases, subtilase family, aspartic acid active site" misc_feature 2034149..2034181 /gene="mycP5" /locus_tag="Rv1796" /note="PS00137 Serine proteases, subtilase family, histidine active site" gene 2035483..2036703 /locus_tag="Rv1797" /db_xref="GeneID:885452" CDS 2035483..2036703 /locus_tag="Rv1797" /function="UNKNOWN" /note="Rv1797, (MTV049.19), len: 406 aa. Conserved hypothetical protein, some similarity to Mycobacterium tuberculosis O05462|Rv3882c|MTCY15F10.30|Z94121 (462 aa), FASTA scores: opt: 181, E(): 9.2e-05, (25.4% identity in 283 aa overlap). Has hydrophobic stretch near N-terminus. TBparse score is 0.938" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216313.1" /db_xref="GI:15608934" /db_xref="UniProtKB/TrEMBL:O53946" /db_xref="GeneID:885452" /translation="MKAQRSFGLALSWPRVTAVFLVDVLILAVASHCPDSWQADHHVA WWVGVGVAAVVTLLSVVSYHGITVISGLATWVRDWSADPGTTLGAGCTPAIDHQRRFG RDTVGVREYNGRLVSVIEVTCGESGPSGRHWHRKSPVPMLPVVAVADGLRQFDIHLDG IDIVSVLVRGGVDAAKASASLQEWEPQGWKSEERAGDRTVADRRRTWLVLRMNPQRNV AAVACRDSLASTLVAATERLVQDLDGQSCAARPVTADELTEVDSAVLADLEPTWSRPG WRHLKHFNGYATSFWVTPSDITSETLDELCLPDSPEVGTTVVTVRLTTRVGSPALSAW VRYHSDTRLPKEVAAGLNRLTGRQLAAVRASLPAPTHRPLLVIPSRNLRDHDELVLPV GQELEHATSSFVGQ" gene 2036700..2038532 /locus_tag="Rv1798" /db_xref="GeneID:885543" CDS 2036700..2038532 /locus_tag="Rv1798" /function="UNKNOWN" /note="Rv1798, (MTV049.20), len: 610 aa. Conserved hypothetical protein, similar to several mycobacterial proteins e.g. O05460|MTCY15F10.28|Rv3884c|Z94121 from M. tuberculosis (619 aa), FASTA scores: opt: 669, E(): 0, (31.0% identity in 549 aa overlap); and O33089|MLCB628.18c|Y14967 from Mycobacterium leprae (573 aa), FASTA scores: opt: 723, E(): 0, (32.4% identity in 568 aa overlap). Also very similar to Rv0282. May belong to the CBXX/CFQX family as last 320 aa domain very similar to several family members. Contains ATP/GTP-binding site motif A (P-loop; PS00017). TBparse score is 0.903" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216314.1" /db_xref="GI:15608935" /db_xref="GOA:P63744" /db_xref="UniProtKB/Swiss-Prot:P63744" /db_xref="GeneID:885543" /translation="MTRPQAAAEDARNAMVAGLLASGISVNGLQPSHNPQVAAQMFTT ATRLDPKMCDAWLARLLAGDQSIEVLAGAWAAVRTFGWETRRLGVTDLQFRPEVSDGL FLRLAITSVDSLACAYAAVLAEAKRYQEAAELLDATDPRHPFDAELVSYVRGVLYFRT KRWPDVLAQFPEATQWRHPELKAAGAAMATTALASLGVFEEAFRRAQEAIEGDRVPGA ANIALYTQGMCLRHVGREEEAVELLRRVYSRDAKFTPAREALDNPNFRLILTDPETIE ARTDPWDPDSAPTRAQTEAARHAEMAAKYLAEGDAELNAMLGMEQAKKEIKLIKSTTK VNLARAKMGLPVPVTSRHTLLLGPPGTGKTSVARAFTKQLCGLTVLRKPLVVETSRTK LLGRYMADAEKNTEEMLEGALGGAVFFDEMHTLHEKGYSQGDPYGNAIINTLLLYMEN HRDELVVFGAGYAKAMEKMLEVNQGLRRRFSTVIEFFSYTPQELIALTQLMGRENEDV ITEEESQVLLPSYTKFYMEQSYSEDGDLIRGIDLLGNAGFVRNVVEKARDHRSFRLDD EDLDAVLASDLTEFSEDQLRRFKELTREDLAEGLRAAVAEKKTK" misc_feature 2037768..2037791 /locus_tag="Rv1798" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 2039159..2039350 /gene="lppT" /locus_tag="Rv1799" /db_xref="GeneID:885901" CDS 2039159..2039350 /gene="lppT" /locus_tag="Rv1799" /function="UNKNOWN" /note="Rv1799, (MTV049.21), len: 63. Probable lppT lipoprotein, has possible signal peptide and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.904" /codon_start=1 /transl_table=11 /product="lipoprotein LppT" /protein_id="NP_216315.1" /db_xref="GI:15608936" /db_xref="UniProtKB/TrEMBL:O53948" /db_xref="GeneID:885901" /translation="MSVKSKNGRLAARVLVALAALFAMIALTGSACLAEGPPLGRNPQ GAPAPVGGTVIVAPMHSGV" misc_feature 2039222..2039254 /gene="lppT" /locus_tag="Rv1799" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2039453..2041420 /gene="PPE28" /locus_tag="Rv1800" /db_xref="GeneID:885465" CDS 2039453..2041420 /gene="PPE28" /locus_tag="Rv1800" /function="UNKNOWN" /note="Rv1800, (MTV049.22), len: 655 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, C-terminal very similar to parts of PE proteins e.g. Z92770|MTCI5.25|Rv0151c (588 aa), FASTA scores: opt: 1269, E(): 0, (41.5% identity in 591 aa overlap). TBparse score is 0.925" /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177839.1" /db_xref="GI:57116915" /db_xref="UniProtKB/TrEMBL:Q79FK2" /db_xref="GeneID:885465" /translation="MLPNFAVLPPEVNSARVFAGAGSAPMLAAAAAWDDLASELHCAA MSFGSVTSGLVVGWWQGSASAAMVDAAASYIGWLSTSAAHAEGAAGLARAAVSVFEEA LAATVHPAMVAANRAQVASLVASNLFGQNAPAIAALESLYECMWAQDAAAMAGYYVGA SAVATQLASWLQRLQSIPGAASLDARLPSSAEAPMGVVRAVNSAIAANAAAAQTVGLV MGGSGTPIPSARYVELANALYMSGSVPGVIAQALFTPQGLYPVVVIKNLTFDSSVAQG AVILESAIRQQIAAGNNVTVFGYSQSATISSLVMANLAASADPPSPDELSFTLIGNPN NPNGGVATRFPGISFPSLGVTATGATPHNLYPTKIYTIEYDGVADFPRYPLNFVSTLN AIAGTYYVHSNYFILTPEQIDAAVPLTNTVGPTMTQYYIIRTENLPLLEPLRSVPIVG NPLANLVQPNLKVIVNLGYGDPAYGYSTSPPNVATPFGLFPEVSPVVIADALVAGTQQ GIGDFAYDVSHLELPLPADGSTMPSTAPGSGTPVPPLSIDSLIDDLQVANRNLANTIS KVAATSYATVLPTADIANAALTIVPSYNIHLFLEGIQQALKGDPMGLVNAVGYPLAAD VALFTAAGGLQLLIIISAGRTIANDISAIVP" gene 2042001..2043272 /gene="PPE29" /locus_tag="Rv1801" /db_xref="GeneID:885491" CDS 2042001..2043272 /gene="PPE29" /locus_tag="Rv1801" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1801, (MTV049.23), len: 423 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to AL022021|MTV049.29|Rv1808 (409 aa), FASTA scores: opt: 1229, E(): 0, (55.2% identity in 422 aa overlap). TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177840.1" /db_xref="GI:57116916" /db_xref="UniProtKB/TrEMBL:Q7D7X9" /db_xref="GeneID:885491" /translation="MDFGLLPPEINSGRMYTGPGPGPMLAAATAWDGLAVELHATAAG YASELSALTGAWSGPSSTSMASAAAPYVAWMSATAVHAELAGAQARLAIAAYEAAFAA TVPPPVIAANRAQLMVLIATNIFGQNTPAIMMTEAQYMEMWAQDAAAMYGYAGSSATA SRMTAFTEPPQTTNHGQLGAQSSAVAQTAATAAGGNLQSAFPQLLSAVPRALQGLALP TASQSASATPQWVTDLGNLSTFLGGAVTGPYTFPGVLPPSGVPYLLGIQSVLVTQNGQ GVSALLGKIGGKPITGALAPLAEFALHTPILGSEGLGGGSVSAGIGRAGLVGKLSVPQ GWTVAAPEIPSPAAALQATRLAAAPIAATDGAGALLGGMALSGLAGRAAAGSTGHPIG SAAAPAVGAAAAAVEDLATEANIFVIPAMDD" gene 2043384..2044775 /gene="PPE30" /locus_tag="Rv1802" /db_xref="GeneID:885542" CDS 2043384..2044775 /gene="PPE30" /locus_tag="Rv1802" /function="UNKNOWN" /note="Rv1802, (MTV049.24), len: 463 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to AL022021|MTV049.30|Rv1809 (468 aa), FASTA scores: opt: 1238, E(): 0, (51.0% identity in 471 aa overlap). TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177841.1" /db_xref="GI:57116917" /db_xref="UniProtKB/Swiss-Prot:O53951" /db_xref="GeneID:885542" /translation="MDFGVLPPEINSGRMYAGPGSGPMLAAAAAWDGLATELQSTAAD YGSVISVLTGVWSGQSSGTMAAAAAPYVAWMSATAALAREAAAQASAAAAAYEAAFAA TVPPPVVAANRAELAVLAATNIFGQNTGAIAAAEARYAEMWAQDAAAMYGYAGSSSVA TQVTPFAAPPPTTNAAGLATQGVAVAQAVGASAGNARSLVSEVLEFLATAGTNYNKTV ASLMNAVTGVPYASSVYNSMLGLGFAESKMVLPANDTVISTIFGMVQFQKFFNPVTPF NPDLIPKSALGAGLGLRSAISSGLGSTAPAISAGASQAGSVGGMSVPPSWAAATPAIR TVAAVFSSTGLQAVPAAAISEGSLLSQMALASVAGGALGGAAARATGGFLGGGRVTAV KKSLKDSDSPDKLRRVVAHMMEKPESVQHWHTDEDGLDDLLAELKKKPGIHAVHMAGG NKAEIAPTISESG" gene complement(2044923..2046842) /gene="PE_PGRS32" /locus_tag="Rv1803c" /db_xref="GeneID:885730" CDS complement(2044923..2046842) /gene="PE_PGRS32" /locus_tag="Rv1803c" /function="UNKNOWN" /note="Rv1803c, (MTV049.25c), len: 639 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). Most similar to Rv1768|MTCY28.34|Z95890 (618 aa), FASTA scores: opt: 1827, E(): 0, (53.5% identity in 664 aa overlap). Contains two PS00583 pfkB family of carbohydrate kinases signatures 1. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177842.1" /db_xref="GI:57116918" /db_xref="UniProtKB/TrEMBL:Q79FJ9" /db_xref="GeneID:885730" /translation="MWTSQMIVAPAFVDAAAKDLATIGSAISRANAEALVPITALLPA GADDVSAAIAALFATHGQAYQELSAHAVAFHEQFVQLMSAGAAQYASAEAANSSPLQI VGQTALDAINSPVQTLTGRPLIGNGANGVAGTGQNGGDGGWLYGNGGNGGSGGTGQNG GNGGSAGLWGSGGNGGQGGAGANGAAGQPGKAGGSGGNGGAGGWIYGHGGHGGAGGNG GNATAPGGASAGFDGGAGGNGGSGGRGGLLFGNGGNGSVGGMGGQGTNDTAGDSAGSG GLGGNGGNGAQGGWLIGNGGQGGDSGAGGGTDSTQTGVMNGASGGSAGIAGNGGDAGL VGNGGAGGNGGNGAAGSALGTTIFGGSGGVGGSGGDGGNGGWLFGSGASGGNGGQGGD AGTNGFAGFGGSAGGGGWVGAVNFGPISVQGFGLFGHGGDGGNGGDVGAGSLSIQFGA SGGDGGQGGVLYGNGGNGGNAGSGGGTGFEGSAGQGGAAILIGNGGAGGNGATGGTGV GNIIQEAGGDGSDGGAGGSGGLLFGSGGAGGIGGAGGVGGSGNDGGNGGDGGQGGASG LGIGNGGPGGSGGTGGAGGTGGSAGTGGAGGDGGNAALLIGTGGDGGDGVPPAPGGQG GKGGLIGLPGQNGQP" misc_feature complement(2045286..2045360) /gene="PE_PGRS32" /locus_tag="Rv1803c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" misc_feature complement(2045988..2046062) /gene="PE_PGRS32" /locus_tag="Rv1803c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene complement(2047023..2047349) /locus_tag="Rv1804c" /db_xref="GeneID:885588" CDS complement(2047023..2047349) /locus_tag="Rv1804c" /function="UNKNOWN" /note="Rv1804c, (MTV049.26c), len: 108 aa. Conserved hypothetical protein, similar to several hypothetical Mycobacterium tuberculosis proteins that may be exported (hydrophobic stretch at N-terminus) e.g. O07222|Rv1810|MTCY16F9.04C|Z96073 (118 aa), FASTA scores: opt: 361, E(): 2.3e-19, (53.5% identity in 101 aa overlap); Rv0622, Rv1690, and Rv3067, etc. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216320.1" /db_xref="GI:15608941" /db_xref="UniProtKB/TrEMBL:O53953" /db_xref="GeneID:885588" /translation="MRVVSTLLSIPLMIGLAVPAHAGPSGDDAVFLASLERAGITYSH PDQAIASGKAVCALVESGESGLQVVNELRTRNPGFSMDGCCKFAAISAHVYCPHQITK TSVSAK" gene complement(2047687..2048034) /locus_tag="Rv1805c" /db_xref="GeneID:885470" CDS complement(2047687..2048034) /locus_tag="Rv1805c" /function="UNKNOWN" /note="Rv1805c, (MTV049.27c), len: 115 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216321.1" /db_xref="GI:15608942" /db_xref="UniProtKB/TrEMBL:O53954" /db_xref="GeneID:885470" /translation="MTASVVATSRERHSHKAAKQRACEITDFEPEGRFRVRKRRRGRI GTKRSSISDTDYRRDSFRSHLLTAGAHGDADAQHKGMTAQQTTELGTPLVRALAPHGV SGRSSRKPLGLNP" gene 2048072..2048371 /gene="PE20" /locus_tag="Rv1806" /db_xref="GeneID:885537" CDS 2048072..2048371 /gene="PE20" /locus_tag="Rv1806" /function="UNKNOWN" /note="Rv1806, (MTV049.28), len: 99 aa. Member of the Mycobacterium tuberculosis PE family of gly-, ala-rich proteins (see citation below), most similar to Rv1788|MTV049.10|AL022021 (99 aa), FASTA scores: opt: 334, E(): 4.7 e-15, (59.8% identity in 97 aa overlap). TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177843.1" /db_xref="GI:57116919" /db_xref="UniProtKB/TrEMBL:Q7D7X6" /db_xref="GeneID:885537" /translation="MAFVLVCPDALAIAAGQLRHVGSVIAARNAVAAPATAELAPAAA DEVSALTATQFNFHAAMYQAVGAQAIAMNEAFVAMLGASADSYAATEAANIIAVS" gene 2048398..2049597 /gene="PPE31" /locus_tag="Rv1807" /db_xref="GeneID:885072" CDS <2048398..2049597 /gene="PPE31" /locus_tag="Rv1807" /function="UNKNOWN" /note="Rv1807, (MTV049.29), len: 399 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to Rv1789|MTV049.11|AL022021 (393 aa), FASTA scores: opt: 1169, E(): 0, (49.5% identity in 412 aa overlap). TBparse score is 0.919." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177653.1" /db_xref="GI:57116920" /db_xref="UniProtKB/TrEMBL:Q79FJ7" /db_xref="GeneID:885072" /translation="LDFATLPPEINSARMYSGAGSAPMLAAASAWHGLSAELRASALS YSSVLSTLTGEEWHGPASASMTAAAAPYVAWMSVTAVRAEQAGAQAEAAAAAYEAAFA ATVPPPVIEANRAQLMALIATNVLGQNAPAIAATEAQYAEMWSQDAMAMYGYAGASAA ATQLTPFTEPVQTTNASGLAAQSAAIAHATGASAGAQQTTLSQLIAAIPSVLQGLSSS TAATFASGPSGLLGIVGSGSSWLDKLWALLDPNSNFWNTIASSGLFLPSNTIAPFLGL LGGVAAADAAGDVLGEATSGGLGGALVAPLGSAGGLGGTVAAGLGNAATVGTLSVPPS WTAAAPLASPLGSALGGTPMVAPPPAVAAGMPGMPFGTMGGQGFGRAVPQYGFRPNFV ARPPAAG" gene 2049921..2051150 /gene="PPE32" /locus_tag="Rv1808" /db_xref="GeneID:885590" CDS 2049921..2051150 /gene="PPE32" /locus_tag="Rv1808" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1808, (MTV049.30), len: 409 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to Rv1800|MTV049.22|AL022021 (655 aa), FASTA scores: opt: 1225, E(): 0, (55.1% identity in 423 aa overlap). Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. TBparse score is 0.919." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177844.1" /db_xref="GI:57116921" /db_xref="UniProtKB/TrEMBL:Q79FJ6" /db_xref="GeneID:885590" /translation="MDFGALPPEINSGRMYAGPGSGPLLAAAAAWDALAAELYSAAAS YGSTIEGLTVAPWMGPSSITMAAAVAPYVAWISVTAGQAEQAGAQAKIAAGVYETAFA ATVPPPVIEANRALLMSLVATNIFGQNTPAIAATEAHYAEMWAQDAAAMYGYAGSSAT ASQLAPFSEPPQTTNPSATAAQSAVVAQAAGAAASSDITAQLSQLISLLPSTLQSLAT TATATSASAGWDTVLQSITTILANLTGPYSIIGLGAIPGGWWLTFGQILGLAQNAPGV AALLGPKAAAGALSPLAPLRGGYIGDITPLGGGATGGIARAIYVGSLSVPQGWAEAAP VMRAVASVLPGTGAAPALAAEAPGALFGEMALSSLAGRALAGTAVRSGAGAARVAGGS VTEDVASTTTIIVIPAD" misc_feature 2050947..2050964 /gene="PPE32" /locus_tag="Rv1808" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" gene 2051282..2052688 /gene="PPE33" /locus_tag="Rv1809" /db_xref="GeneID:885555" CDS 2051282..2052688 /gene="PPE33" /locus_tag="Rv1809" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1809, (MTV049.31), len: 468 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, most similar to RV1802AL022021|MTV049.23 (463 aa), FASTA scores: opt: 1238, E(): 0, (51.2% identity in 471 aa overlap). TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177845.1" /db_xref="GI:57116922" /db_xref="UniProtKB/TrEMBL:Q79FJ5" /db_xref="GeneID:885555" /translation="MDFGLQPPEITSGEMYLGPGAGPMLAAAVAWDGLAAELQSMAAS YASIVEGMASESWLGPSSAGMAAAAAPYVTWMSGTSAQAKAAADQARAAVVAYETAFA AVVPPPQIAANRSQLISLVATNIFGQNTAAIAATEAEYGEMWAQDTMAMFGYASSSAT ASRLTPFTAPPQTTNPSGLAGQAAATGQATALASGTNAVTTALSSAAAQFPFDIIPTL LQGLATLSTQYTQLMGQLINAIFGPTGATTYQNVFVTAANVTKFSTWANDAMSAPNLG MTEFKVFWQPPPAPEIPKSSLGAGLGLRSGLSAGLAHAASAGLGQANLVGDLSVPPSW ASATPAVRLVANTLPATSLAAAPATQIPANLLGQMALGSMTGGALGAAAPAIYTGSGA RARANGGTPSAEPVKLEAVIAQLQKQPDAVRHWNVDKADLDGLLDRLSKQPGIHAVHV SNGDKPKVALPDTQLGSH" gene 2052933..2053289 /locus_tag="Rv1810" /db_xref="GeneID:885591" CDS 2052933..2053289 /locus_tag="Rv1810" /function="UNKNOWN" /note="Rv1810, (MTCY16F9.04c), len: 118 aa. Conserved hypothetical protein, similar to several hypothetical Mycobacterium tuberculosis proteins that may be exported (possible N-terminal signal sequence) e.g. O53953|Rv1804c|MTV049.26c|AL022021 (108 aa), FASTA scores: opt: 361, E(): 9.6e-17, (53.5% identity in 101 aa overlap); Rv0622, and Rv1690, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216326.1" /db_xref="GI:15608947" /db_xref="UniProtKB/TrEMBL:O07222" /db_xref="GeneID:885591" /translation="MQLQRTMGQCRPMRMLVALLLSAATMIGLAAPGKADPTGDDAAF LAALDQAGITYADPGHAITAAKAMCGLCANGVTGLQLVADLRDYNPGLTMDSAAKFAA IASGAYCPEHLEHHPS" gene 2053443..2054147 /gene="mgtC" /locus_tag="Rv1811" /db_xref="GeneID:885439" CDS 2053443..2054147 /gene="mgtC" /locus_tag="Rv1811" /EC_number="3.6.3.1" /function="THOUGHT TO BE INVOLVED IN Mg2+ TRANSPORT (IMPORT). MAY ACT AS AN ACCESSORY PROTEIN FOR MGTB SO MEDIATING MAGNESIUM INFLUX INTO THE CYTOSOL [CATALYTIC ACTIVITY: ATP + H(2)O + Mg(2+)(OUT) = ADP + PHOSPHATE + Mg(2+)(IN)]." /note="Rv1811, (MTCY16F9.03c), len: 234 aa. Possible mgtC, magnesium (Mg2+) transport P-type ATPase C (transmembrane protein) (EC 3.6.3.1), highly similar to many e.g. NP_442124.1|NC_000911 Mg2+ transport ATPase from Synechocystis sp. strain PCC 6803 (234 aa); NP_251248.1|NC_002516 probable transport protein from Pseudomonas aeruginosa (230 aa); P22037|ATMC_SALTY|STM3764 magnesium transport ATPase protein C from Salmonella typhimurium (231 aa), FASTA scores: opt: 545, E(): 4.1e-30, (42.3% identity in 220 aa overlap); N-terminus of NP_213315.1|NC_000918 Mg(2+) transport ATPase from Aquifex aeolicus (225 aa); etc. BELONGS TO THE MGTC / SAPB FAMILY" /codon_start=1 /transl_table=11 /product="Mg2+ transport P-type ATPase C" /protein_id="NP_216327.1" /db_xref="GI:15608948" /db_xref="GOA:O07221" /db_xref="UniProtKB/TrEMBL:O07221" /db_xref="GeneID:885439" /translation="MQTLTVADFALRLAVGVGCGAIIGLERQWRARMAGLRTNALVAT GATLFVLYAVATEDSSPTRVASYVVSGIGFLGGGVILREGFNVRGLNTAATLWCSAAV GVLAASGHLVFTLIGTGTIVAVHLLGRPLGRLVDRDNAVEDEGLQPYQVRVICRPKAE TYVRAHIVQRTSSNDITLRGIRTGPAGDDNITLTAHLLMVGHTPAKLERLVAELSLQP GVYAVHWYAGEHAQAE" gene complement(2054157..2055359) /locus_tag="Rv1812c" /db_xref="GeneID:885487" CDS complement(2054157..2055359) /locus_tag="Rv1812c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1812c, (MTCY16F9.02), len: 400 aa. Probable dehydrogenase (EC 1.-.-.-), similar to other dehydrogenases/oxidases e.g. AE001947|AE001947_10 NADH dehydrogenase II of Deinococcus radiodurans (379 aa), FASTA scores: opt: 404, E(): 3.4e-18, (26.4% identity in 363 aa overlap) and DHNA_HAEIN|P44856 nadh dehydrogenase (EC 1.6.99.3) (444 aa), FASTA scores: opt: 200, E(): 8.5e-06, (23.3% identity in 258 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical dehydrogenases Rv0392c, and Rv1854c|MTCY359.19 ndh probable NADH dehydrogenase (31.5% identity in 321 aa overlap)." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_216328.1" /db_xref="GI:15608949" /db_xref="GOA:O07220" /db_xref="UniProtKB/TrEMBL:O07220" /db_xref="GeneID:885487" /translation="MTRVVVIGSGFAGLWAALGAARRLDELAVLAGTVDVMVVSNKPF HDIRVRNYEADLSACRIPLGDVLGPAGVAHVTAEVTAIDADGRRVTTSTGASYSYDRL VLASGSHVVKPALPGLAEFGFDVDTYDGAVRLQQHLQGLAGGPLTSAAATVVVVGAGL TGIETACELPGRLHALFARGDGVTPRVVLIDHNPFVGSDMGLSARPVIEQALLDNGVE TRTGVSVAAVSPGGVTLSSGERLAAATVVWCAGMRASRLTEQLPVARDRLGRLQVDDY LRVIGVPAMFAAGDVAAARMDDEHLSVMSCQHGRPMGRYAGCNVINDLFDQPLLALRI PWYVTVLDLGSAGAVYTEGWERKVVSQGAPAKTTKQSINTRRIYPPLNGSRADLLAAA APRVQPRP" gene complement(2055681..2056112) /locus_tag="Rv1813c" /db_xref="GeneID:885546" CDS complement(2055681..2056112) /locus_tag="Rv1813c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1813c, (MTCY16F9.01), len: 143 aa. Conserved hypothetical protein. Possibly a exported protein with potential N-terminal signal sequence. Similar to Q11050|Rv1269c|MTCY50.13 hypothetical protein from Mycobacterium tuberculosis (124 aa), (42.7% identity in 143 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216329.1" /db_xref="GI:15608950" /db_xref="GOA:P64889" /db_xref="UniProtKB/Swiss-Prot:P64889" /db_xref="GeneID:885546" /translation="MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMM SEIAGLPIPPIIHYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTR CGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN" gene 2056521..2057423 /gene="erg3" /locus_tag="Rv1814" /db_xref="GeneID:885880" CDS 2056521..2057423 /gene="erg3" /locus_tag="Rv1814" /EC_number="1.3.-.-" /function="INVOLVED IN LIPID DESATURATION" /note="Rv1814, (MTCY1A11.29c), len: 300 aa. erg3, transmembrane C-5 sterol desaturase (EC 1.3.-.-) (see *), weak similarity to several e.g. ERG3_YEAST|P32353 c-5 sterol desaturase (365 aa), FASTA scores: opt: 154, E(): 0.0011, (22.9% identity in 288 aa overlap). BELONGS TO THE STEROL DESATURASE FAMILY. [* Note: work of Jackson, C.J., Lamb, D.C., Kelly, D.E., Kelly, S.L., Characterization of a sterol delta 5,6-desaturase homolog in Mycobacterium bovis (BCG). Submitted (JUN-2000) to the EMBL/GenBank/DDBJ databases]." /codon_start=1 /transl_table=11 /product="membrane-bound C-5 sterol desaturase erg3 (sterol-c5-desaturase)" /protein_id="NP_216330.1" /db_xref="GI:15608951" /db_xref="GOA:Q50619" /db_xref="UniProtKB/Swiss-Prot:P68435" /db_xref="GeneID:885880" /translation="MRDPVLFAIPCFLLLLILEWTAARKLESIETAATGQPRPASGAY LTRDSVASISMGLVSIATTAGWKSLALLGYAAIYAYLAPWQLSAHRWYTWVIAIVGVD LLYYSYHRIAHRVRLIWATHQAHHSSEYFNFATALRQKWNNSGEILMWVPLPLMGLPP WMVFCSWSLNLIYQFWVHTERIDRLPRWFEFVFNTPSHHRVHHGMDPVYLDKNYGGIL IIWDRLFGSFQPELFRPHYGLTKRVDTFNIWKLQTREYVAIVRDWRSATRLRDRLGYV FGPPGWEPRTIDKSNAAASLVTSR" gene 2057528..2058193 /locus_tag="Rv1815" /db_xref="GeneID:885430" CDS 2057528..2058193 /locus_tag="Rv1815" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1815, (MTCY1A11.28c), len: 221 aa. Conserved hypothetical protein, similar to G473456 hypothetical protein from Mycobacterium fortuitum (255 aa), FASTA scores: opt: 182, E(): 3.2e-05, (29.6% identity in 230 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216331.1" /db_xref="GI:15608952" /db_xref="UniProtKB/Swiss-Prot:Q50618" /db_xref="GeneID:885430" /translation="MVRLVPRAFAATVALLAAGFSPATASADPVLVFPGMEIRQDNHV CTLGYVDPALKIAFTAGHCRGGGAVTSRDYKVIGHLRAIRDNTPSGSTVATHELIADY EAIVLADDVTASNILPSGRALESRPGVVLHPGQAVCHFGVSTGETCGTVESVNNGWFT MSHGVLSEKGDSGGPVYLAPDGGPAQIVGIFNSVWGGFPAAVSWRSTSEQVHADLGVT PLA" gene 2058256..2058960 /locus_tag="Rv1816" /db_xref="GeneID:885340" CDS 2058256..2058960 /locus_tag="Rv1816" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1816, (MTCY1A11.27c), len: 234 aa. Possible transcriptional regulatory protein. MEME analysis suggests similarity to putative Mycobacterium tuberculosis transcriptional regulators, Rv0653c, Rv0681. Contains helix-turn-helix motif at aa 38-59 (+4.30 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216332.1" /db_xref="GI:15608953" /db_xref="GOA:P67438" /db_xref="UniProtKB/Swiss-Prot:P67438" /db_xref="GeneID:885340" /translation="MCQTCRVGKRRDAREQIEAKIVELGRRQLLDHGAAGLSLRAIAR NLGMVSSAVYRYVSSRDELLTLLLVDAYSDLADTVDRARDDTVADSWSDDVIAIARAV RGWAVTNPARWALLYGSPVPGYHAPPDRTAGVATRVVGAFFDAIAAGIATGDIRLTDD VAPQPMSSDFEKIRQEFGFPGDDRVVTKCFLLWAGVVGAISLEVFGQYGADMLTDPGV VFDAQTRLLVAVLAEH" repeat_region 2059441..2059498 /note="58 bp Mycobacterial Interspersed Repetitive Unit, Class III" repeat_region 2059518..2059575 /note="58 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene 2059595..2061058 /locus_tag="Rv1817" /db_xref="GeneID:885548" CDS 2059595..2061058 /locus_tag="Rv1817" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1817, (MTCY1A11.26c), len: 487 aa. Possible flavoprotein, similar to G746486 flavoprotein subunit of fumarate reductase fad domain homologue (474 aa), FASTA scores: opt: 223, E(): 5.7e-07, (24.1% identity in 489 aa overlap); and AJ236923|SFR236923_3 soluble fumarate reductase of Shewanella frigidimarina ifcA (588 aa), FASTA scores: opt: 310, E(): 2.5e-11, (27.3% identity in 484 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216333.1" /db_xref="GI:15608954" /db_xref="GOA:Q50616" /db_xref="UniProtKB/TrEMBL:Q50616" /db_xref="GeneID:885548" /translation="MSTDIPATVSAETVTSWSDDVDVTVIGFGIAGGCAAVSAAAAGA RVLVLERAAAAGGTTALAGGHFYLGGGTTVQLATGHPDSPEEMYKYLVAVSREPDHDK IRAYCDGSVEHFNWLEGLGFQFERSYFPGKAVIQPNTEGLMFTGNEKVWPFLELAVPA PRGHKVPVPGDTGGAAMVIDLLLKRAASLGIQIRYETGATELIVDGTGKVTGVMWKRF SETGAIKAKSVIIAAGGFVMNPDMVAKYTPKLAEKPFVLGNTYDDGLGIRLGVSAGGA TQHMDQMFITAPPYPPSILLTGIIVNKLGQRFVAEDSYHSRTAGFIMEQPDSAAYLIV DEAHLEHPKMPLVPLIDGWETVVEMEAALGIPPGNLAATLDRYNAYAARGADPDFHKQ PEFLAAQDNGPWGAFDMSLGKAMYAGFTLGGLATSVDGQVLRDDGAVVAGLYAVGACA SNIAQDGKGYASGTQLGEGSFFGRRAGAHAAARAQGM" gene complement(2061178..2062674) /gene="PE_PGRS33" /locus_tag="Rv1818c" /db_xref="GeneID:885551" CDS complement(2061178..2062674) /gene="PE_PGRS33" /locus_tag="Rv1818c" /function="UNKNOWN. SEEMS TO INFLUENCE BOTH CELL SURFACE INTERACTIONS AMONG MYCOBACTERIA AND THE INTERACTIONS OF BACTERIA WITH MACROPHAGES." /experiment="experimental evidence, no additional details recorded" /note="Rv1818c, (MTCY1A11.25), len: 498 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to many. Contains 2 x PS00583 pfkB family of carbohydrate kinases signature 1. Supposed localised to the cell surface (see citations below)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177846.1" /db_xref="GI:57116923" /db_xref="GOA:Q50615" /db_xref="UniProtKB/Swiss-Prot:Q50615" /db_xref="GeneID:885551" /translation="MSFVVTIPEALAAVATDLAGIGSTIGTANAAAAVPTTTVLAAAA DEVSAAMAALFSGHAQAYQALSAQAALFHEQFVRALTAGAGSYAAAEAASAAPLEGVL DVINAPALALLGRPLIGNGANGAPGTGANGGDGGILIGNGGAGGSGAAGMPGGNGGAA GLFGNGGAGGAGGNVASGTAGFGGAGGAGGLLYGAGGAGGAGGRAGGGVGGIGGAGGA GGNGGLLFGAGGAGGVGGLAADAGDGGAGGDGGLFFGVGGAGGAGGTGTNVTGGAGGA GGNGGLLFGAGGVGGVGGDGVAFLGTAPGGPGGAGGAGGLFGVGGAGGAGGIGLVGNG GAGGSGGSALLWGDGGAGGAGGVGSTTGGAGGAGGNAGLLVGAGGAGGAGALGGGATG VGGAGGNGGTAGLLFGAGGAGGFGFGGAGGAGGLGGKAGLIGDGGDGGAGGNGTGAKG GDGGAGGGAILVGNGGNGGNAGSGTPNGSAGTGGAGGLLGKNGMNGLP" misc_feature complement(2061286..2061360) /gene="PE_PGRS33" /locus_tag="Rv1818c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" misc_feature complement(2061814..2061888) /gene="PE_PGRS33" /locus_tag="Rv1818c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene complement(2062809..2064728) /locus_tag="Rv1819c" /db_xref="GeneID:885539" CDS complement(2062809..2064728) /locus_tag="Rv1819c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF DRUGS ACROSS THE MEMBRANE (EXPORT): MULTIDRUGS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1819c, (MTCY1A11.24), len: 639 aa. Probable drugs-transport transmembrane ATP-binding protein ABC transporter (see citation below), equivalent to AL008609|MLCB1788.47 hypothetical ABC transporter from Mycobacterium leprae (638 aa), (74.9% identity in 634 aa overlap). Also similar to other transmembrane ATP-binding proteins e.g. Q57335|Y036_HAEIN hypothetical ABC transporter ATP-binding protein from Haemophilus influenzae (592 aa), FASTA scores: opt: 1235, E(): 2.8e-61, (40.8% identity in 623 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="drugs-transport transmembrane ATP-binding protein ABC transporter" /protein_id="NP_216335.1" /db_xref="GI:15608956" /db_xref="GOA:Q50614" /db_xref="UniProtKB/Swiss-Prot:Q50614" /db_xref="GeneID:885539" /translation="MGPKLFKPSIDWSRAFPDSVYWVGKAWTISAICVLAILVLLRYL TPWGRQFWRITRAYFVGPNSVRVWLMLGVLLLSVVLAVRLNVLFSYQGNDMYTALQKA FEGIASGDGTVKRSGVRGFWMSIGVFSVMAVLHVTRVMADIYLTQRFIIAWRVWLTHH LTQDWLDGRAYYRDLFIDETIDNPDQRIQQDVDIFTAGAGGTPNAPSNGTASTLLFGA VQSIISVISFTAILWNLSGTLNIFGVSIPRAMFWTVLVYVFVATVISFIIGRPLIWLS FRNEKLNAAFRYALVRLRDAAEAVGFYRGERVEGTQLQRRFTPVIDNYRRYVRRSIAF NGWNLSVSQTIVPLPWVIQAPRLFAGQIDFGDVGQTATSFGNIHDSLSFFRNNYDAFA SFRAAIIRLHGLVDANEKGRALPAVLTRPSDDESVELNDIEVRTPAGDRLIDPLDVRL DRGGSLVITGRSGAGKTTLLRSLAELWPYASGTLHRPGGENETMFLSQLPYVPLGTLR DVVCYPNSAAAIPDATLRDTLTKVALAPLCDRLDEERDWAKVLSPGEQQRVAFARILL TKPKAVFLDESTSALDTGLEFALYQLLRSELPDCIVISVSHRPALERLHENQLELLGG GQWRLAPVEAAPAEV" misc_feature complement(2063034..2063078) /locus_tag="Rv1819c" /note="PS00211 ABC transporters family signature" misc_feature complement(2063328..2063351) /locus_tag="Rv1819c" /note="PS00017 ATP/GTP-binding site motif A" gene 2064799..2066442 /gene="ilvG" /locus_tag="Rv1820" /db_xref="GeneID:885738" CDS 2064799..2066442 /gene="ilvG" /locus_tag="Rv1820" /function="VALINE AND ISOLEUCINE BIOSYNTHESIS (FIRST STEP) [CATALYTIC ACTIVITY : 2-ACETOLACTATE + CO(2) = 2 PYRUVATE]" /note="Rv1820, (MTCY1A11.23c), len: 547 aa. Probable ilvG, acetolactate synthase (EC 4.1.3.18). Equivalent to AL008609|MLCB1788.46c ilvG from Mycobacterium leprae (548 aa) (86.1% identity in 548 aa overlap). Similar to ILVB_KLEPN|P27696 (559 aa), FASTA scores: opt: 660, E(): 2.9e-34, (29.1% identity in 549 aa overlap). Also similar to other Mycobacterium tuberculosis Ilv proteins e.g. Rv3003c (ilvB), etc. Contains PS00187 Thiamine pyrophosphate enzymes signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216336.1" /db_xref="GI:15608957" /db_xref="GOA:P66946" /db_xref="UniProtKB/Swiss-Prot:P66946" /db_xref="GeneID:885738" /translation="MSTDTAPAQTMHAGRLIARRLKASGIDTVFTLSGGHLFSIYDGC REEGIRLIDTRHEQTAAFAAEGWSKVTRVPGVAALTAGPGITNGMSAMAAAQQNQSPL VVLGGRAPALRWGMGSLQEIDHVPFVAPVARFAATAQSAENAGLLVDQALQAAVSAPS GVAFVDFPMDHAFSMSSDNGRPGALTELPAGPTPAGDALDRAAGLLSTAQRPVIMAGT NVWWGHAEAALLRLVEERHIPVLMNGMARGVVPADHRLAFSRARSKALGEADVALIVG VPMDFRLGFGGVFGSTTQLIVADRVEPAREHPRPVAAGLYGDLTATLSALAGSGGTDH QGWIEELATAETMARDLEKAELVDDRIPLHPMRVYAELAALLERDALVVIDAGDFGSY AGRMIDSYLPGCWLDSGPFGCLGSGPGYALAAKLARPQRQVVLLQGDGAFGFSGMEWD TLVRHNVAVVSVIGNNGIWGLEKHPMEALYGYSVVAELRPGTRYDEVVRALGGHGELV SVPAELRPALERAFASGLPAVVNVLTDPSVAYPRRSNLA" misc_feature 2066062..2066121 /gene="ilvG" /locus_tag="Rv1820" /note="PS00187 Thiamine pyrophosphate enzymes signature" gene 2066457..2068883 /gene="secA2" /locus_tag="Rv1821" /gene_synonym="azi" /gene_synonym="div" /db_xref="GeneID:885594" CDS 2066457..2068883 /gene="secA2" /locus_tag="Rv1821" /gene_synonym="azi" /gene_synonym="div" /function="INVOLVED IN PROTEIN EXPORT. MAY INTERACTS WITH THE SECY/SECE SUBUNITS. SECA HAS A CENTRAL ROLE IN COUPLING THE HYDROLYSIS OF ATP TO THE TRANSFER OF PRE-SECRETORY PERIPLASMIC AND OUTER MEMBRANE PROTEINS ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="SecA2; functions in protein export; can interact with acidic membrane phospholipids and the SecYEG protein complex; binds to preproteins; binds to ATP and undergoes a conformational change to promote membrane insertion of SecA/bound preprotein; ATP hydrolysis appears to drive release of the preprotein from SecA and deinsertion of SecA from the membrane; additional proteins SecD/F/YajC aid SecA recycling; exists in an equilibrium between monomers and dimers; may possibly form higher order oligomers; proteins in this cluster correspond to SecA2; which is non-essential and seems to play a role in secretion of a subset of proteins" /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecA" /protein_id="NP_216337.1" /db_xref="GI:15608958" /db_xref="GOA:P66785" /db_xref="UniProtKB/Swiss-Prot:P66785" /db_xref="GeneID:885594" /translation="MNVHGCPRIAACRCTDTHPRGRPAFAYRWFVPKTTRAQPGRLSS RFWRLLGASTEKNRSRSLADVTASAEYDKEAADLSDEKLRKAAGLLNLDDLAESADIP QFLAIAREAAERRTGLRPFDVQLLGALRMLAGDVIEMATGEGKTLAGAIAAAGYALAG RHVHVVTINDYLARRDAEWMGPLLDAMGLTVGWITADSTPDERRTAYDRDVTYASVNE IGFDVLRDQLVTDVNDLVSPNPDVALIDEADSVLVDEALVPLVLAGTTHRETPRLEII RLVAELVGDKDADEYFATDSDNRNVHLTEHGARKVEKALGGIDLYSEEHVGTTLTEVN VALHAHVLLQRDVHYIVRDDAVHLINASRGRIAQLQRWPDGLQAAVEAKEGIETTETG EVLDTITVQALINRYATVCGMTGTALAAGEQLRQFYQLGVSPIPPNKPNIREDEADRV YITTAAKNDGIVEHITEVHQRGQPVLVGTRDVAESEELHERLVRRGVPAVVLNAKNDA EEARVIAEAGKYGAVTVSTQMAGRGTDIRLGGSDEADHDRVAELGGLHVVGTGRHHTE RLDNQLRGRAGRQGDPGSSVFFSSWEDDVVAANLDHNKLPMATDENGRIVSPRTGSLL DHAQRVAEGRLLDVHANTWRYNQLIAQQRAIIVERRNTLLRTVTAREELAELAPKRYE ELSDKVSEERLETICRQIMLYHLDRGWADHLAYLADIRESIHLRALGRQNPLDEFHRM AVDAFASLAADAIEAAQQTFETANVLDHEPGLDLSKLARPTSTWTYMVNDNPLSDDTL SALSLPGVFR" gene 2069080..2069709 /gene="pgsA2" /locus_tag="Rv1822" /db_xref="GeneID:885126" CDS 2069080..2069709 /gene="pgsA2" /locus_tag="Rv1822" /EC_number="2.7.8.5" /function="THOUGHT TO BE INVOLVED IN CARDIOLIPIN BIOSYNTHESIS; GENERATES CARDIOLIPIN FROM PHOSPHATIDYLGLYCEROL AND CDP-DIACYLGLYCEROL [CATALYTIC ACTIVITY : MAY BE: PHOSPHATIDYLGLYCEROL + PHOSPHATIDYLGLYCEROL -> CARDIOLIPIN + GLYCEROL, OR: CDP-DIACYLGLYCEROL + GLYCEROL 3-PHOSPHATE = CMP + 3-(3-PHOSPHATIDYL)-GLYCEROL 1-PHOSPHATE]." /note="Rv1822, (MTCY1A11.21c), len: 209 aa. Probable pgsA2, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyl-transferase (EC 2.7.8.5) (see citation below), integral membrane protein, equivalent to AL008609|MLCB1788_17 phosphatidyltransferase from Mycobacterium leprae (206 aa), FASTA score: (76.6% identity in 205 aa overlap). Also highly similar or similar to others e.g. CAB88885.1|AL353861 putative CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyl-transferase from Streptomyces coelicolor (215 aa); AAC44003.1|U29587 phosphatidylglycerol phosphate synthase from Rhodobacter sphaeroides (227 aa); NP_405431.1|NC_003143 CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase from Yersinia pestis (182 aa); P06978|PGSA_ECOLI CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase from Escherichia coli (181 aa), FASTA scores: opt: 252, E(): 2.8e-09, (29.7% identity in 175 aa overlap); etc. Also similar to Rv2746c|PGSA3|MTV002.11c CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE (PGP SYNTHASE) from Mycobacterium tuberculosis (209 aa). Contains PS00379 CDP-alcohol phosphatidyltransferases signature; and PS00075 Dihydrofolate reductase signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY." /codon_start=1 /transl_table=11 /product="CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase" /protein_id="NP_216338.1" /db_xref="GI:15608959" /db_xref="GOA:P63753" /db_xref="UniProtKB/Swiss-Prot:P63753" /db_xref="GeneID:885126" /translation="MEPVLTQNRVLTVPNMLSVIRLALIPAFVYVVLSAHANGWGVAI LVFSGVSDWADGKIARLLNQSSRLGALLDPAVDRLYMVTVPIVFGLSGIVPWWFVLTL LTRDALLAGTLPLLWSRGLSALPVTYVGKAATFGFMVGFPTILLGQCDPLWSHVLLAC GWAFLIWGMYAYLWAFVLYAVQMTMVVRQMPKLKGRAHRPAAQNAGERG" misc_feature 2069242..2069310 /gene="pgsA2" /locus_tag="Rv1822" /note="PS00379 CDP-alcohol phosphatidyltransferases signature" misc_feature 2069341..2069367 /gene="pgsA2" /locus_tag="Rv1822" /note="PS00075 Dihydrofolate reductase signature" gene 2069702..2070625 /locus_tag="Rv1823" /db_xref="GeneID:885547" CDS 2069702..2070625 /locus_tag="Rv1823" /function="UNKNOWN" /note="Rv1823, (MTCY01A11.20), len: 307 aa. Conserved hypothetical protein, similar to P71582|MTCY10H4.12|RV0012 hypothetical protein CY10H4.12 from Mycobacterium tuberculosis (262 aa), FASTA scores: opt: 304, E(): 1.5e-12, (30.1% identity in 246 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216339.1" /db_xref="GI:15608960" /db_xref="GOA:P64891" /db_xref="UniProtKB/Swiss-Prot:P64891" /db_xref="GeneID:885547" /translation="MAESDRLLGGYDPNAGYSAHAGAQPQRIPVPSLLRALLSEHLDA GYAAVAAERERAAAPRCWQARAVSWMWQALAATLVAAVFAAAVAQARSVAPGVRAAQQ LLVASVRSTQAAATTLAQRRSTLSAKVDDVRRIVLADDAEGQRLLARLDVLSLAAASA PVVGPGLTVTVTDPGASPNLSDVSKQRVSGSQQIILDRDLQLVVNSLWESGAEAISID GVRIGPNVTIRQAGGAILVDNNPTSSPYTILAVGPPHAMQDVFDRSAGLYRLRLLETS YGVGVSVNVGDGLALPAGATRDVKFAKQIGP" gene 2070654..2071019 /locus_tag="Rv1824" /db_xref="GeneID:885719" CDS 2070654..2071019 /locus_tag="Rv1824" /function="UNKNOWN" /note="Rv1824, (MTCY1A11.19c), len: 121 aa. Conserved hypothetical membrane protein similar to P28265|SBP_BACSU sbp protein from Bacillus subtilis (121 aa), FASTA scores: opt: 261, E(): 1.9e-12, (38.9% identity in 113 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216340.1" /db_xref="GI:15608961" /db_xref="GOA:P64893" /db_xref="UniProtKB/Swiss-Prot:P64893" /db_xref="GeneID:885719" /translation="MGSDTAWSPARMIGIAALAVGIVLGLVFHPGVPEVIQPYLPIAV VAALDAVFGGLRAYLERIFDPKVFVVSFVFNVLVAALIVYVGDQLGVGTQLSTAIIVV LGIRIFGNTAALRRRLFGA" gene 2071036..2071914 /locus_tag="Rv1825" /db_xref="GeneID:885726" CDS 2071036..2071914 /locus_tag="Rv1825" /function="UNKNOWN" /note="Rv1825, (MTCY1A11.18c), len: 292 aa. Conserved hypothetical protein, weak similarity to Mycobacterium tuberculosis hypothetical proteins Q50610|MTCY1A11.20C|Rv1823|Z78020 (307 aa), FASTA scores: opt: 182, E(): 0.00044, (29.9% identity in 204 aa overlap); and Rv0012. Has a hydrophobic stretch, TMhelix from aa 67 to 85." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216341.1" /db_xref="GI:15608962" /db_xref="GOA:P64895" /db_xref="UniProtKB/Swiss-Prot:P64895" /db_xref="GeneID:885726" /translation="MSENRPEPVAAETSAATTARHSQADAGAHDAVRRGRHELPADHP RSKVGPLRRTRLTEILRGGRSRLVFGTLAILLCLVLGVAIVTQVRQTDSGDSLETARP ADLLVLLDSLRQREATLNAEVIDLQNTLNALQASGNTDQAALESAQARLAALSILVGA VGATGPGVMITIDDPGPGVAPEVMIDVINELRAAGAEAIQINDAHRSVRVGVDTWVVG VPGSLTVDTKVLSPPYSILAIGDPPTLAAAMNIPGGAQDGVKRVGGRMVVQQADRVDV TALRQPKQHQYAQPVK" gene 2071952..2072356 /gene="gcvH" /locus_tag="Rv1826" /db_xref="GeneID:885720" CDS 2071952..2072356 /gene="gcvH" /locus_tag="Rv1826" /function="THE GLYCINE CLEAVAGE SYSTEM CATALYSES THE DEGRADATION OF GLYCINE. THE H PROTEIN SHUTTLES THE METHYLAMINE GROUP OF GLYCINE FROM THE P PROTEIN TO THE T PROTEIN" /note="part of multienzyme complex composed of H, L, P, and T proteins which catalyzes oxidation of glycine to yield carbon dioxide, ammonia, 5,10-CH2-H4folate and a reduced pyridine nucleotide; protein H is involved in transfer of methylamine group from the P to T protein; covalently bound to a lipoyl cofactor" /codon_start=1 /transl_table=11 /product="glycine cleavage system protein H" /protein_id="NP_216342.1" /db_xref="GI:15608963" /db_xref="GOA:Q50607" /db_xref="UniProtKB/Swiss-Prot:Q50607" /db_xref="GeneID:885720" /translation="MSDIPSDLHYTAEHEWIRRSGDDTVRVGITDYAQSALGDVVFVQ LPVIGTAVTAGETFGEVESTKSVSDLYAPISGKVSEVNSDLDGTPQLVNSDPYGAGWL LDIQVDSSDVAALESALTTLLDAEAYRGTLTE" misc_feature 2072096..2072185 /gene="gcvH" /locus_tag="Rv1826" /note="PS00189 2-oxo acid dehydrogenases acyltransferase component lipoyl binding site" gene 2072596..2073084 /gene="cfp17" /locus_tag="Rv1827" /db_xref="GeneID:885735" CDS 2072596..2073084 /gene="cfp17" /locus_tag="Rv1827" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1827, (MTCY1A11.16c), len: 162 aa. cfp17, conserved hypothetical protein (see citation below), equivalent to O32919|MLCB1788.36c hypothetical protein from Mycobacterium leprae (162 aa), FASTA scores: opt: 888, E(): 0, (87.0% identity in 161 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216343.1" /db_xref="GI:15608964" /db_xref="UniProtKB/Swiss-Prot:P64897" /db_xref="GeneID:885735" /translation="MTDMNPDIEKDQTSDEVTVETTSVFRADFLSELDAPAQAGTESA VSGVEGLPPGSALLVVKRGPNAGSRFLLDQAITSAGRHPDSDIFLDDVTVSRRHAEFR LENNEFNVVDVGSLNGTYVNREPVDSAVLANGDEVQIGKFRLVFLTGPKQGEDDGSTG GP" gene 2073081..2073824 /locus_tag="Rv1828" /db_xref="GeneID:885336" CDS 2073081..2073824 /locus_tag="Rv1828" /function="UNKNOWN" /note="Rv1828, (MTCY1A11.15c), len: 247 aa. Conserved hypothetical protein, equivalent to O32918|MLCB1788.35c|AL008609 hypothetical protein from Mycobacterium leprae (251 aa), FASTA scores: opt: 1397, E(): 0, (87.6% identity in 251 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216344.1" /db_xref="GI:15608965" /db_xref="GOA:P67669" /db_xref="UniProtKB/Swiss-Prot:P67669" /db_xref="GeneID:885336" /translation="MSAPDSPALAGMSIGAVLDLLRPDFPDVTISKIRFLEAEGLVTP RRASSGYRRFTAYDCARLRFILTAQRDHYLPLKVIRAQLDAQPDGELPPFGSPYVLPR LVPVAGDSAGGVGSDTASVSLTGIRLSREDLLERSEVADELLTALLKAGVITTGPGGF FDEHAVVILQCARALAEYGVEPRHLRAFRSAADRQSDLIAQIAGPLVKAGKAGARDRA DDLAREVAALAITLHTSLIKSAVRDVLHR" gene 2073943..2074437 /locus_tag="Rv1829" /db_xref="GeneID:885743" CDS 2073943..2074437 /locus_tag="Rv1829" /function="UNKNOWN" /note="Rv1829, (MTCY1A11.14c), len: 164 aa. Conserved hypothetical protein, equivalent to O32917|MLCB1788.34|AL008609 Hypothetical protein from Mycobacterium leprae (164 aa), FASTA scores: opt: 1011, E(): 0, (95.1% identity in 164 aa overlap). Also present in Aquifex aeolicus, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216345.1" /db_xref="GI:15608966" /db_xref="UniProtKB/Swiss-Prot:Q50604" /db_xref="GeneID:885743" /translation="MGEVRVVGIRVEQPQNQPVLLLREANGDRYLPIWIGQSEAAAIA LEQQGVEPPRPLTHDLIRDLIAALGHSLKEVRIVDLQEGTFYADLIFDRNIKVSARPS DSVAIALRVGVPIYVEEAVLAQAGLLIPDESDEEATTAVREDEVEKFKEFLDSVSPDD FKAT" gene 2074841..2075518 /locus_tag="Rv1830" /db_xref="GeneID:885740" CDS 2074841..2075518 /locus_tag="Rv1830" /function="UNKNOWN" /note="Rv1830, (MTCY1A11.13c), len: 225 aa. Conserved hypothetical protein, equivalent to Mycobacterium leprae hypothetical protein MLCB1788.33c|AL008609|O32916 (231 aa), FASTA scores: opt: 1307, E(): 0, (89.6% identity in 231 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216346.1" /db_xref="GI:15608967" /db_xref="GOA:P67671" /db_xref="UniProtKB/Swiss-Prot:P67671" /db_xref="GeneID:885740" /translation="MTQLVTRARSARGSTLGEQPRQDQLDFADHTGTAGDGNDGAAAA SGPVQPGLFPDDSVPDELVGYRGPSACQIAGITYRQLDYWARTSLVVPSIRSAAGSGS QRLYSFKDILVLKIVKRLLDTGISLHNIRVAVDHLRQRGVQDLANITLFSDGTTVYEC TSAEEVVDLLQGGQGVFGIAVSGAMRELTGVIADFHGERADGGESIAAPEDELASRRK HRDRKIG" gene 2075571..2075828 /locus_tag="Rv1831" /db_xref="GeneID:885718" CDS 2075571..2075828 /locus_tag="Rv1831" /function="UNKNOWN" /note="Rv1831, (MTCY1A11.12c), len: 85 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216347.1" /db_xref="GI:15608968" /db_xref="UniProtKB/Swiss-Prot:P64899" /db_xref="GeneID:885718" /translation="MRLCVCSAVDWTTHRSSAGEFCGCQLRTPKEQYLSVNLSGTRTA RDYDASGKRWRPLAVLTRRWGKAIHLTVDRVAESLRRLACR" gene 2075877..2078702 /gene="gcvB" /locus_tag="Rv1832" /db_xref="GeneID:885716" CDS 2075877..2078702 /gene="gcvB" /locus_tag="Rv1832" /EC_number="1.4.4.2" /function="THE GLYCINE CLEAVAGE SYSTEM CATALYSES THE DEGRADATION OF GLYCINE. THE P PROTEIN BINDS THE ALPHA-AMINO GROUP OF GLYCINE THROUGH ITS PYRIDOXAL PHOSPHATE COFACTOR; CO(2) IS RELEASED AND THE REMAINING METHYLAMINE MOIETY IS THEN TRANSFERRED TO THE LIPOAMIDE COFACTOR OF THE H PROTEIN [CATALYTIC ACTIVITY : GLYCINE + LIPOYLPROTEIN = S- AMINOMETHYLDIHYDROLIPOYLPROTEIN + CO(2)]" /note="acts in conjunction with GvcH to form H-protein-S-aminomethyldihydrolipoyllysine from glycine" /codon_start=1 /transl_table=11 /product="glycine dehydrogenase" /protein_id="NP_216348.1" /db_xref="GI:15608969" /db_xref="GOA:Q50601" /db_xref="UniProtKB/Swiss-Prot:Q50601" /db_xref="GeneID:885716" /translation="MSDHSTFADRHIGLDSQAVATMLAVIGVDSLDDLAVKAVPAGIL DTLTDTGAAPGLDSLPPAASEAEALAELRALADANTVAVSMIGQGYYDTHTPPVLLRN IIENPAWYTAYTPYQPEISQGRLEALLNFQTLVTDLTGLEIANASMLDEGTAAAEAMT LMHRAARGPVKRVVVDADVFTQTAAVLATRAKPLGIEIVTADLRAGLPDGEFFGVIAQ LPGASGRITDWSALVQQAHDRGALVAVGADLLALTLIAPPGEIGADVAFGTTQRFGVP MGFGGPHAGYLAVHAKHARQLPGRLVGVSVDSDGTPAYRLALQTREQHIRRDKATSNI CTAQVLLAVLAAMYASYHGAGGLTAIARRVHAHAEAIAGALGDALVHDKYFDTVLARV PGRADEVLARAKANGINLWRVDADHVSVACDEATTDTHVAVVLDAFGVAAAAPAHTDI ATRTSEFLTHPAFTQYRTETSMMRYLRALADKDIALDRSMIPLGSCTMKLNAAAEMES ITWPEFGRQHPFAPASDTAGLRQLVADLQSWLVLITGYDAVSLQPNAGSQGEYAGLLA IHEYHASRGEPHRDICLIPSSAHGTNAASAALAGMRVVVVDCHDNGDVDLDDLRAKVG EHAERLSALMITYPSTHGVYEHDIAEICAAVHDAGGQVYVDGANLNALVGLARPGKFG GDVSHLNLHKTFCIPHGGGGPGVGPVAVRAHLAPFLPGHPFAPELPKGYPVSSAPYGS ASILPITWAYIRMMGAEGLRAASLTAITSANYIARRLDEYYPVLYTGENGMVAHECIL DLRGITKLTGITVDDVAKRLADYGFHAPTMSFPVAGTLMVEPTESESLAEVDAFCEAM IGIRAEIDKVGAGEWPVDDNPLRGAPHTAQCLLASDWDHPYTREQAAYPLGTAFRPKV WPAVRRIDGAYGDRNLVCSCPPVEAFA" gene complement(2078929..2079789) /locus_tag="Rv1833c" /db_xref="GeneID:885737" CDS complement(2078929..2079789) /locus_tag="Rv1833c" /EC_number="3.8.1.5" /function="May act on a wide range of 1-haloalkanes, haloalcohols, haloalkenes and some haloaromatic compounds [Catalytic activity: 1-haloalkane + H(2)O, a primary alcohol + halide]" /note="Rv1833c, (MTCY1A11.10), len: 286 aa. Possible haloalkane dehalogenase (EC 3.8.1.5). Similar to several haloalkane dehalogenase e.g. CAB45532.1|AJ243259 from Mycobacterium bovis (300 aa); also similar to LINB_PSEPA|P51698 1,3,4,6-tetrachloro-1,4-cyclohexadien from Pseudomonas paucimobilis (295 aa), FASTA scores: opt: 314, E(): 1.5e-13, (33.1% identity in 281 aa overlap)." /codon_start=1 /transl_table=11 /product="haloalkane dehalogenase" /protein_id="NP_216349.1" /db_xref="GI:15608970" /db_xref="GeneID:885737" /translation="MSIDFTPDPQLYPFESRWFDSSRGRIHYVDEGTGPPILLCHGNP TWSFLYRDIIVALRDRFRCVAPDYLGFGLSERPSGFGYQIDEHARVIGEFVDHLGLDR YLSMGQDWGGPISMAVAVERADRVRGVVLGNTWFWPADTLAMKAFSRVMSSPPVQYAI LRRNFFVERLIPAGTEHRPSSAVMAHYRAVQPNAAARRGVAEMPKQILAARPLLARLA REVPATLGTKPTLLIWGMKDVAFRPKTIIPRLSATFPDHVLVELPNAKHFIQEDAPDR IAAAIIERFG" gene 2079830..2080696 /locus_tag="Rv1834" /db_xref="GeneID:888761" CDS 2079830..2080696 /locus_tag="Rv1834" /EC_number="3.-.-.-" /function="UNKNOWN" /note="Rv1834, (MTCY1A11.09c), len: 288 aa. Probable hydrolase (EC 3.-.-.-), some similarity to haloalkane dehalogenases and D16262 hypothetical 38.9 kDa protein (335 aa), FASTA scores: opt: 507, E(): 7.6e-28, (33.0% identity in 300 aa overlap)." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_216350.1" /db_xref="GI:15608971" /db_xref="GeneID:888761" /translation="MTSPSVREWRDGGRWLPTAVGKVFVRSGPGDTPTMLLLHGYPSS SFDFRAVIPHLTGQAWVTMDFLGFGLSDKPRPHRYSLLEQAHLVETVVAHTVTGAVVV LAHDMGTSVTTELLARDLDGRLPFDLRRAVLSNGSVILERASLRPIQKVLRSPLGPVA ARLVSRGGFTRGFGRIFSPAHPLSAQEAQAQWELLCYNDGNRIPHLLISYLDERIRHA QRWHGAVRDWPKPLGFVWGLDDPVATTNVLNGLRELRPSAAVVELPGLGHYPQVEAPK AYAEAALSLLVD" gene complement(2080701..2082587) /locus_tag="Rv1835c" /db_xref="GeneID:885877" CDS complement(2080701..2082587) /locus_tag="Rv1835c" /function="UNKNOWN" /note="Rv1835c, (MTCY1A11.08), len: 628 aa. Conserved hypothetical protein, some similarity to putative acylases e.g. G216374 glutaryl 7-aca acylase precursor (634 aa) FASTA scores, opt: 202, E(): 3.5e-06, (25.1% identity in 669 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv2800 and Rv1215c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216351.1" /db_xref="GI:15608972" /db_xref="GeneID:885877" /translation="MTRRGGSDAAWYSAPDQRSAYPRYRGMRYSSCYVTMRDGVRIAI DLYLPAGLTSAARLPAILHQTRYYRSLQLRWPLRMLLGGKPLQHIAADKRRRRRFVAS GYAWVDVDVRGSGASFGARVCEWSSDEIRDGAEIVDWIVRQPWCNGTVAALGNSYDGT SAELLLVNQHPAVRVIAPCFSLFDVYTDIAFPGGIHAAWFTDTWGRYNEALDRNALHE VVGWWAKLPVTGMQPVQEDRDRSLRDGAIAAHRGNYDVHQIAGSLTFRDDVSASDPYR GQPDARLEPIGTPIESGSINLISPHNYWRDVQASGAAIYSYSGWFDGGYAHAAIKRFL TVSTPGSHLILGPWNHTGGWRVDPLRGLSRPDFDHDGELLRFIDHHVKGADTGIGSEP PVHYFTMVENRWKSADTWPPPATTQSYYLSADRQLRPDAPDCDSGADEYVVDQTAGTG ERSRWRSQVGIGGHVCYPDRKAQDAKLLTYTSAPLDHPLEVTGHVVVTLFITSTSSDG TFFVYLEDVDPRGRVAYITEGQLRAIHRRLSDGPPPYRQVVPYRTFASGDAWPLVPGE IARLTFDLLPTSYLFQPGHRIRIAIAGADASHFAILPGCAPTVRVYRSRMHASRIDLP VIQP" gene complement(2082603..2084636) /locus_tag="Rv1836c" /db_xref="GeneID:885707" CDS complement(2082603..2084636) /locus_tag="Rv1836c" /function="UNKNOWN" /note="Rv1836c, (MTCY1A11.07), len: 677 aa. Conserved hypothetical protein. Equivalent to MLCB1788.28|AL008609 hypothetical protein from Mycobacterium leprae (710 aa), FASTA scores: opt: 2938, E(): 0, (66.0% identity in 714 aa overlap). Contains PS00036 bZIP transcription factors basic domain signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216352.1" /db_xref="GI:15608973" /db_xref="GeneID:885707" /translation="MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDD GPLSSEGHYSAVGGYSASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDW QAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDT VAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGG QPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPG LQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVR TLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGS WLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKP PSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLS NVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQ YSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSA DPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS" misc_feature complement(2083080..2083121) /locus_tag="Rv1836c" /note="PS00036 bZIP transcription factors basic domain signature" gene complement(2084756..2086981) /gene="glcB" /locus_tag="Rv1837c" /db_xref="GeneID:885713" CDS complement(2084756..2086981) /gene="glcB" /locus_tag="Rv1837c" /EC_number="2.3.3.9" /function="INVOLVED IN GLYOXYLATE BYPASS (SECOND STEP), AN ALTERNATIVE TO THE TRICARBOXYLIC ACID CYCLE [CATALYTIC ACTIVITY : L-MALATE + CoA = ACETYL-CoA + H(2)O + GLYOXYLATE]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of malate from glyoxylate and acetyl-CoA" /codon_start=1 /transl_table=11 /product="malate synthase G" /protein_id="NP_216353.1" /db_xref="GI:15608974" /db_xref="GeneID:885713" /translation="MTDRVSVGNLRIARVLYDFVNNEALPGTDIDPDSFWAGVDKVVA DLTPQNQALLNARDELQAQIDKWHRRRVIEPIDMDAYRQFLTEIGYLLPEPDDFTITT SGVDAEITTTAGPQLVVPVLNARFALNAANARWGSLYDALYGTDVIPETDGAEKGPTY NKVRGDKVIAYARKFLDDSVPLSSGSFGDATGFTVQDGQLVVALPDKSTGLANPGQFA GYTGAAESPTSVLLINHGLHIEILIDPESQVGTTDRAGVKDVILESAITTIMDFEDSV AAVDAADKVLGYRNWLGLNKGDLAAAVDKDGTAFLRVLNRDRNYTAPGGGQFTLPGRS LMFVRNVGHLMTNDAIVDTDGSEVFEGIMDALFTGLIAIHGLKASDVNGPLINSRTGS IYIVKPKMHGPAEVAFTCELFSRVEDVLGLPQNTMKIGIMDEERRTTVNLKACIKAAA DRVVFINTGFLDRTGDEIHTSMEAGPMVRKGTMKSQPWILAYEDHNVDAGLAAGFSGR AQVGKGMWTMTELMADMVETKIAQPRAGASTAWVPSPTAATLHALHYHQVDVAAVQQG LAGKRRATIEQLLTIPLAKELAWAPDEIREEVDNNCQSILGYVVRWVDQGVGCSKVPD IHDVALMEDRATLRISSQLLANWLRHGVITSADVRASLERMAPLVDRQNAGDVAYRPM APNFDDSIAFLAAQELILSGAQQPNGYTEPILHRRRREFKARAAEKPAPSDRAGDDAA R" gene complement(2087257..2087652) /locus_tag="Rv1838c" /db_xref="GeneID:885744" CDS complement(2087257..2087652) /locus_tag="Rv1838c" /function="UNKNOWN" /note="Rv1838c, (MTCY359.35), len: 131 aa. Conserved hypothetical protein. Part of 14-membered Mycobacterium tuberculosis protein family with Rv2863|MTV003.09|AL008883 (126 aa), FASTA scores: opt: 293, E(): 1.5e-14, (38.2% identity in 123 aa overlap); Rv0749, Rv0277c, Rv2530c, etc. Also similar to AJ248288|CNSPAX06_181 Pyrococcus abyssi complete genome (136 aa), FASTA scores: opt: 197, E(): 2.2e-07, (33. 1% identity in 133 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216354.1" /db_xref="GI:15608975" /db_xref="GeneID:885744" /translation="MILVDSNIPMYLVGASHPHKLDAQRLLESALSGGERLVTDAEVL QEICHRYVAIKRREAIQPAFDAIIGVVDEVLPIERTDVEHARDALLRYQTLSARDALH IAVMAHHDITRLMSFDRGFDSYPGIKRLA" gene complement(2087649..2087912) /locus_tag="Rv1839c" /db_xref="GeneID:885750" CDS complement(2087649..2087912) /locus_tag="Rv1839c" /function="UNKNOWN" /note="Rv1839c, (MTCY359.34), len: 87 aa. Conserved hypothetical protein. Some similarity to G217008 CHO-ORF1 (279 aa), FASTA scores: opt: 86, E(): 13, (38.7% identity in 62 aa overlap). TBparse score is 1.006." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216355.1" /db_xref="GI:15608976" /db_xref="GeneID:885750" /translation="MSKRLQVLLDPDEWEELREIARRHRTTVSEWVRRTLREAREREP RGDLDMKLRSVRAAARHEFPTADVEQMLEEIERGRGAEREGSR" gene complement(2087971..2089518) /gene="PE_PGRS34" /locus_tag="Rv1840c" /db_xref="GeneID:885753" CDS complement(2087971..2089518) /gene="PE_PGRS34" /locus_tag="Rv1840c" /function="UNKNOWN" /note="Rv1840c, (MTCY359.33), len: 515 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). Similar to many e.g. Y03A_MYCTU|Q10637 hypothetical glycine-rich 49.6 kDa protein (603 aa), FASTA scores: opt: 1693, E(): 0, (53.1% identity in 612 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177847.1" /db_xref="GI:57116924" /db_xref="GeneID:885753" /translation="MSFVVAAPEVVVAAASDLAGIGSAIGAANAAAAVPTMGVLAAGA DEVSAAVADLFGAHAQAYQALSAQAALFHEQFVHAMTAGAGAYAGAEAADAAALDVLN GPFQALFGRPLIGDGANGAPGQPGGPGGLLYGNGGNGGNGGIGQPGGAGGDAGLIGNG GNGGIGGPGATGLAGGAGGVGGLLFGDGGNGGAGGLGTGPVGATGGIGGPGGAAVGLF GHGGAGGAGGLGKAGFAGGAGGTGGTGGLLYGNGGNGGNVPSGAADGGAGGDARLIGN GGDGGSVGAAPTGIGNGGNGGNGGWLYGDGGSGGSTLQGFSDGGTGGNAGMFGDGGNG GFSFFDGNGGDGGTGGTLIGNGGDGGNSVQTDGFLRGHGGDGGNAVGLIGNGGAGGAG SAGTGVFAPGGGSGGNGGNGALLVGNGGAGGSGGPTQIPSVAVPVTGAGGTGGNGGTA GLIGNGGNGGAAGVSGDGTPGTGGNGGYAQLIGDGGDGGPGDSGGPGGSGGTGGTLAG QNGSPGG" gene complement(2089681..2090718) /locus_tag="Rv1841c" /db_xref="GeneID:885656" CDS complement(2089681..2090718) /locus_tag="Rv1841c" /function="UNKNOWN" /note="Rv1841c, (MTCY359.32), len: 345 aa. Conserved hypothetical membrane protein. Some similarity to O07585|YHDP_BACSU HYPOTHETICAL 49.9 kDa PROTEIN from Bacillus subtilis (444 aa), FASTA scores: opt: 620, E(): 0, (31.1% identity in 350 aa overlap). Also similar to other Mycobacterium tuberculosis proteins e.g. Rv1842c, Rv2366c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216357.1" /db_xref="GI:15608978" /db_xref="GeneID:885656" /translation="MDVLSAVLLALLLIGANAFFVGAEFALISARRDRLEALAEQGKA TAVTVIRAGEQLPAMLTGAQLGVTVSSILLGRVGEPAVVKLLQLSFGLSGVPPALLHT LSLAIVVALHVLLGEMVPKNIALAGPERTAMLLVPPYLVYVRLARPFIAFYNNCANAI LRLVGVQPKDELDIAVSTAELSEMIAESLSEGLLDHEEHTRLTRALRIRTRLVADVAV PLVNIRAVQVSAVGSGPTIGGVEQALAQTGYSRFPVVDRGGRFIGYLHIKDVLTLGDN PQTVIDLAVVRPLPRVPQSLPLADALSRMRRINSHLALVTADNGSVVGMVALEDVVED LVGTMRDGTHR" gene complement(2090718..2092085) /locus_tag="Rv1842c" /db_xref="GeneID:885739" CDS complement(2090718..2092085) /locus_tag="Rv1842c" /function="UNKNOWN" /note="Rv1842c, (MTCY359.31), len: 455 aa. Conserved hypothetical membrane protein. Similar to Z99109|0O7589 Potential integral membrane protein from Bacillus subtilis (461 aa), FASTA scores: opt: 723, E(): 0, (31.2% identity in 449 aa overlap). Similar to other Mycobacterium tuberculosis putative integral membrane proteins e.g. Rv2366c, Rv1841c. TBPARSE score is 0.883." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216358.1" /db_xref="GI:15608979" /db_xref="GeneID:885739" /translation="MNLTDTVATILAILALTAGTGVFVAAEFSLTALDRSTVEANARG GTSRDRFIQRAHHRLSFQLSGAQLGISITTLATGYLTEPLVAELPHPGLVAVGMSDRV ADGLITFFALVIVTSLSMVFGELVPKYLAVARPLRTARSVVAGQVLFSLLLTPAIRLT NGAANWIVRRLGIEPAEELRSARTPQELVSLVRSSARSGALDDATAWLMRRSLQFGAL TAEELMTPRSKIVALQTDDTIADLVAAAAASGFSRFPVVEGDLDATVGIVHVKQVFEV PPGDRAHTLLTTVAEPVAVVPSTLDGDAVMAQVRASALQTAMVVDEYGGTAGMVTLED LIEEIVGDVRDEHDDATPDVVAAGNGWRVSGLLRIDEVASATGYRAPDGPYETIGGLV LRELGHIPVAGETVELTALDQDGLPDDSMRWLATVIQMDGRRIDLLELIKMGGHADPG SGRGR" gene complement(2092259..2093698) /gene="guaB1" /locus_tag="Rv1843c" /db_xref="GeneID:885714" CDS complement(2092259..2093698) /gene="guaB1" /locus_tag="Rv1843c" /EC_number="1.1.1.205" /function="INVOLVED IN GMP BIOSYNTHESIS [CATALYTIC ACTIVITY: Inosine 5'-phosphate + NAD+ + H2O = xanthosine 5'-phosphate + NADH]" /experiment="experimental evidence, no additional details recorded" /note="catalyzes the synthesis of xanthosine monophosphate by the NAD+ dependent oxidation of inosine monophosphate" /codon_start=1 /transl_table=11 /product="inosine 5-monophosphate dehydrogenase" /protein_id="NP_216359.1" /db_xref="GI:15608980" /db_xref="GeneID:885714" /translation="MMRFLDGHPPGYDLTYNDVFIVPNRSEVASRFDVDLSTADGSGT TIPVVVANMTAVAGRRMAETVARRGGIVILPQDLPIPAVKQTVAFVKSRDLVLDTPVT LAPDDSVSDAMALIHKRAHGVAVVILEGRPIGLVRESSCLGVDRFTRVRDIAVTDYVT APAGTEPRKIFDLLEHAPVDVAVLTDADGTLAGVLSRTGAIRAGIYTPATDSAGRLRI GAAVGINGDVGAKARALAEAGVDVLVIDTAHGHQVKTLDAIKAVSALDLGLPLAAGNV VSAEGTRDLLKAGANVVKVGVGPGAMCTTRMMTGVGRPQFSAVLECASAARQLGGHIW ADGGIRHPRDVALALAAGASNVMIGSWFAGTYESPGDLMRDRDDQPYKESYGMASKRA VVARTGADNPFDRARKALFEEGISTSRMGLDPDRGGVEDLIDHITSGVRSTCTYVGAS NLAELHERAVVGVQSGAGFAEGHPLPAGW" gene complement(2093731..2095188) /gene="gnd1" /locus_tag="Rv1844c" /db_xref="GeneID:885755" CDS complement(2093731..2095188) /gene="gnd1" /locus_tag="Rv1844c" /EC_number="1.1.1.44" /function="INVOLVED IN HEXOSE MONOPHOSPHATE SHUNT (PENTOSE PHOSPHATE PATHWAY) [CATALYTIC ACTIVITY: 6-phospho-D-gluconate + NADP+ = D-ribulose 5-phosphate + CO2 + NADPH]." /note="catalyzes the formation of D-ribulose 5-phosphate from 6-phospho-D-gluconate" /codon_start=1 /transl_table=11 /product="6-phosphogluconate dehydrogenase" /protein_id="YP_177848.1" /db_xref="GI:57116925" /db_xref="GeneID:885755" /translation="MSSSESPAGIAQIGVTGLAVMGSNIARNFARHGYTVAVHNRSVA KTDALLKEHSSDGKFVRSETIPEFLAALEKPRRVLIMVKAGEATDADAVINELADAME PGDIIIDGGNALYTDTMRREKAMRERGLHFVGAGISGGEEGALNGPSIMPGGPAESYQ SLGPLLEEISAHVDGVPCCTHIGPDGSGHFVKMVHNGIEYSDMQLIGEAYQLMRDGLG LTAPAIADVFTEWNNGDLDSYLVEITAEVLRQTDAKTGKPLVDVIVDRAEQKGTGRWT VKSALDLGVPVTGIAEAVFARALSGSVGQRSAASGLASGKLGEQPADPATFTEDVRQA LYASKIVAYAQGFNQIQAGSAEFGWDITPGDLATIWRGGCIIRAKFLNHIKEAFDASP NLASLIVAPYFRGAVESAIDSWRRVVSTAAQLGIPTPGFSSALSYYDALRTARLPAAL TQAQRDFFGAHTYGRIDEPGKFHTLWSSDRTEVPV" gene complement(2095218..2096168) /locus_tag="Rv1845c" /db_xref="GeneID:885370" CDS complement(2095218..2096168) /locus_tag="Rv1845c" /function="UNKNOWN" /note="Rv1845c, (MTCY359.28), len: 316 aa. Conserved hypothetical transmembrane protein. Equivalent to MLCB1788.18|AL008609 Hypothetical protein from Mycobacterium leprae (316 aa), FASTA scores: opt: 1762, E(): 0, (87.6% identity in 314 aa overlap). Similar to proteins in Streptomyces coelicolor e.g. SC10A7.04|AL078618.1. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216361.1" /db_xref="GI:15608982" /db_xref="GeneID:885370" /translation="MSALAFTILAVLLAGPTPALLARATWPLRAPRAAMVLWQAIALA AVLSSFSAGIAIASRLLMPGPDGRPTTSFVGAAGRLGWPLWAAYITVFALTVLVGARL AVAVVRVATATRRRRAHHRMVVDLVGVGHNGALAQPCARARDLRVLDVAQPLAYCLPG VRSRVVVSEGTLTALADAEVAAILTHERAHLRARHDLVLEAFTAVHAAFPRLVRSANA LGAVQLLVELLADDAAVRAAGRTPLARALVACASGRAPSGALAVGGPSTVLRVRRLSG RGNSAVLSAAAYLAAAAVLVVPTVALAVPWLTQLQRLFIA" gene complement(2096183..2096599) /locus_tag="Rv1846c" /db_xref="GeneID:885747" CDS complement(2096183..2096599) /locus_tag="Rv1846c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1846c, (MTCY359.27), len: 138 aa. Possible transcriptional regulatory protein. Equivalent to MLCB1788.17|AL008609 hypothetical protein from Mycobacterium leprae (142 aa), FASTA scores: opt: 736 E(): 0, (95.1% identity in 123 aa overlap). Also similar to BLAI_BACLI|P06555 penicillinase repressor (128 aa), fasta scores: opt: 114, E(): 0.12, (23.7% identity in 131 aa overlap). TBPARSE score is 0.921." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216362.1" /db_xref="GI:15608983" /db_xref="GeneID:885747" /translation="MAKLTRLGDLERAVMDHLWSRTEPQTVRQVHEALSARRDLAYTT VMTVLQRLAKKNLVLQIRDDRAHRYAPVHGRDELVAGLMVDALAQAEDSGSRQAALVH FVERVGADEADALRRALAELEAGHGNRPPAGAATET" gene 2096877..2097299 /locus_tag="Rv1847" /db_xref="GeneID:885734" CDS 2096877..2097299 /locus_tag="Rv1847" /function="UNKNOWN" /note="Rv1847, (MTCY359.26c), len: 140 aa. Conserved hypothetical protein, possible thioesterase, some similarity to YBDB proteins of Escherichia coli and H. influenzae e.g. P15050|YBDB_ECOLI HYPOTHETICAL 15.0 KD PROTEIN IN ENTA-CSTA INTERGENIC REGION (137 aa), FASTA scores: opt: 232, E(): 6.6e-10, (35.8% identity in 106 aa overlap); C48956|G142208 thioesterase from Arthrobacter sp (151 aa), FASTA score: opt: 254, E(): 1.7e-11, (33.3% identity in 138 aa overlap). Also similar to AF064959|AF064959_1 hypothetical protein from Coxiella burnetii (148 aa), FASTA score: opt: 264, E(): 9.3e- 12, (36.8% identity in 117 aa overlap). TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216363.1" /db_xref="GI:15608984" /db_xref="GeneID:885734" /translation="MQPSPDSPAPLNVTVPFDSELGLQFTELGPDGARAQLDVRPKLL QLTGVVHGGVYCAMIESIASMAAFAWLNSHGEGGSVVGVNNNTDFVRSISSGMVYGTA EPLHRGRRQQLWLVTITDDTDRVVARGQVRLQNLEARP" gene 2097348..2097650 /gene="ureA" /locus_tag="Rv1848" /db_xref="GeneID:885414" CDS 2097348..2097650 /gene="ureA" /locus_tag="Rv1848" /EC_number="3.5.1.5" /function="INVOLVED IN THE CONVERSION OF UREA TO NH3 [CATALYTIC ACTIVITY: Urea + H2O = CO2 + 2 NH3]" /note="UreA, with UreB and UreC catalyzes the hydrolysis of urea into ammonia and carbon dioxide; nickel metalloenzyme; accessory proteins UreD, UreE, UreF, and UreG are necessary for assembly of the metallocenter" /codon_start=1 /transl_table=11 /product="urease subunit gamma" /protein_id="NP_216364.1" /db_xref="GI:15608985" /db_xref="GeneID:885414" /translation="MRLTPHEQERLLLSYAAELARRRRARGLRLNHPEAIAVIADHIL EGARDGRTVAELMASGREVLGRDDVMEGVPEMLAEVQVEATFPDGTKLVTVHQPIA" gene 2097647..2097961 /gene="ureB" /locus_tag="Rv1849" /db_xref="GeneID:885710" CDS 2097647..2097961 /gene="ureB" /locus_tag="Rv1849" /EC_number="3.5.1.5" /function="INVOLVED IN THE CONVERSION OF UREA TO NH3 [CATALYTIC ACTIVITY: Urea + H2O = CO2 + 2 NH3]" /note="ureases catalyze the hydrolysis of urea into ammonia and carbon dioxide; in Helicobacter pylori and Yersinia enterocolitica the ammonia released plays a key role in bacterial survival by neutralizing acids when colonizing the gastric mucosa; the holoenzyme is composed of 3 UreC (alpha) and 3 UreAB (gamma/beta)" /codon_start=1 /transl_table=11 /product="urease subunit beta" /protein_id="NP_216365.1" /db_xref="GI:15608986" /db_xref="GeneID:885710" /translation="MIPGEIFYGSGDIEMNAAALSRLQMRIINAGDRPVQVGSHVHLP QANRALSFDRATAHGYRLDIPAATAVRFEPGIPQIVGLVPLGGRREVPGLTLNPPGRL DR" gene 2097961..2099694 /gene="ureC" /locus_tag="Rv1850" /db_xref="GeneID:885359" CDS 2097961..2099694 /gene="ureC" /locus_tag="Rv1850" /EC_number="3.5.1.5" /function="INVOLVED IN THE CONVERSION OF UREA TO NH3 [CATALYTIC ACTIVITY: Urea + H2O = CO2 + 2 NH3]" /note="ureases catalyze the hydrolysis of urea into ammonia and carbon dioxide; in Helicobacter pylori the ammonia released plays a key role in bacterial survival by neutralizing acids when colonizing the gastric mucosa; the holoenzyme is composed of 3 ureC (alpha) and 3 ureAB (gamma/beta) subunits" /codon_start=1 /transl_table=11 /product="urease subunit alpha" /protein_id="NP_216366.1" /db_xref="GI:15608987" /db_xref="GeneID:885359" /translation="MARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAG DEAVFGGGKVLRESMGQGRASRADGAPDTVITGAVIIDYWGIIKADIGIRDGRIVGIG KAGNPDIMTGVHRDLVVGPSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTT IIGGGTGPAEGTKATTVTPGEWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRG GASGFKLHEDWGSTPAAIDTCLAVADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIH AYHTEGAGGGHAPDIITVAAQPNVLPSSTNPTRPHTVNTLDEHLDMLMVCHHLNPRIP EDLAFAESRIRPSTIAAEDVLHDMGAISMIGSDSQAMGRVGEVVLRTWQTAHVMKARR GALEGDPSGSQAADNNRVRRYIAKYTICPAIAHGMDHLIGSVEVGKLADLVLWEPAFF GVRPHVVLKGGAIAWAAMGDANASIPTPQPVLPRPMFGAAAATAAATSVHFVAPQSID ARLADRLAVNRGLAPVADVRAVGKTDLPLNDALPSIEVDPDTFTVRIDGQVWQPQPAA ELPMTQRYFLF" misc_feature 2098930..2098980 /gene="ureC" /locus_tag="Rv1850" /note="PS00145 Urease active site" gene 2099694..2100329 /gene="ureF" /locus_tag="Rv1851" /db_xref="GeneID:885532" CDS 2099694..2100329 /gene="ureF" /locus_tag="Rv1851" /function="PROBABLY FACILITATES NICKEL INCORPORATION" /note="Rv1851, (MTCY359.22c), len: 211 aa. ureF, urease accessory protein. Identical to UREF_MYCTU|P50050 from M. tuberculosis. TBPARSE score is 0.871." /codon_start=1 /transl_table=11 /product="urease accessory protein uref" /protein_id="NP_216367.1" /db_xref="GI:15608988" /db_xref="GeneID:885532" /translation="MTSLAVLLTLADSRLPTGAHVHSGGIEEAIAAGMVTGLATLEAF LKRRVRTHGLLTASIAAAVHRGELAVDDADRETDARTPAPAARHASRSQGRGLIRLAR RVWPDSGWEELGPRPHLAVVAGRVGALSGLAPEHNALHLVYITMTGSAIAAQRLLALD PAEVTVVTFQLSELCEQIAQEATAGLADLSDPLLDTLAQRHDERVRPLFVS" gene 2100340..2101014 /gene="ureG" /locus_tag="Rv1852" /db_xref="GeneID:885729" CDS 2100340..2101014 /gene="ureG" /locus_tag="Rv1852" /function="PROBABLY FACILITATES NICKEL INCORPORATION" /note="Rv1852, (MTCY359.21c), len: 224 aa. ureG, urease accessory protein. Identical to UREG_MYCTU|P50051 from M. tuberculosis. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UREG FAMILY. TBPARSE score is 0.878." /codon_start=1 /transl_table=11 /product="urease accessory protein ureG" /protein_id="NP_216368.1" /db_xref="GI:15608989" /db_xref="GeneID:885729" /translation="MATHSHPHSHTVPARPRRVRKPGEPLRIGVGGPVGSGKTALVAA LCRQLRGELSLAVLTNDIYTTEDADFLRTHAVLPDDRIAAVQTGGCPHTAIRDDITAN LDAIDELMAAHDALDLILVESGGDNLTATFSSGLVDAQIFVIDVAGGDKVPRKGGPGV TYSDLLVVNKTDLAALVGADLAVMARDADAVRDGRPTVLQSLTEDPAASDVVAWVRSQ LAADGV" misc_feature 2100433..2100456 /gene="ureG" /locus_tag="Rv1852" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 2101022..2101648 /gene="ureD" /locus_tag="Rv1853" /db_xref="GeneID:885705" CDS 2101022..2101648 /gene="ureD" /locus_tag="Rv1853" /function="PROBABLY FACILITATES NICKEL INCORPORATION" /note="Rv1853, (MTCY359.20c), len: 208 aa. ureD, probable urease accessory protein. Similar to URED_YEREN|P42868 Urease operon ureD protein from Yersinia enterocolitica (325 aa), Fasta scores: opt: 114, E(): 0.37, (25.2% identity in 119 aa overlap). TBPARSE score is 0.904." /codon_start=1 /transl_table=11 /product="urease accessory protein ureD" /protein_id="NP_216369.1" /db_xref="GI:15608990" /db_xref="GeneID:885705" /translation="MVASPNRLPRIDCRGGVQARRTAPDTVHLVSAAATPLGGDTMRI RVIVERGAQLRLRSAAATVALPGVDTLTSHAHWEIDVTGTLDVDLEPTVVAASARHLS HATLRLHDDGRVRLRERVQIGRCNEREGFWSSSLQADRHGRPLLRHRVELGAGSLADD VIAAPRATISELRYPATAFTDAIDARSTVLALAGGGTLSTWQADRLPG" gene complement(2101651..2103042) /gene="ndh" /locus_tag="Rv1854c" /db_xref="GeneID:885746" CDS complement(2101651..2103042) /gene="ndh" /locus_tag="Rv1854c" /EC_number="1.6.99.3" /function="TRANSFER OF ELECTRONS FROM NADH TO THE RESPIRATORY CHAIN. THE IMMEDIATE ELECTRON ACCEPTOR FOR THE ENZYME IS BELIEVED TO BE UBIQUINONE. DOES NOT COUPLE THE REDOX REACTION TO PROTON TRANSLOCATION." /note="Rv1854c, (MTCY359.19), len: 463 aa. Probable ndh, NADH dehydrogenase (EC 1.6.99.3) (see citations below), similar to several e.g. S74826 NADH dehydrogenase from Synechocystis sp. (445 aa), FASTA score: opt: 1228, E(): 0, (46.3% identity in 432 aa overlap). Highly similar to Rv0392c|Z84725|g1817703 from Mycobacterium tuberculosis (470 aa), FASTA scores: opt: 1911, E(): 0, (64.7% identity in 459 aa overlap); and Rv1812c. TBPARSE score is 0.897." /codon_start=1 /transl_table=11 /product="NADH dehydrogenase" /protein_id="NP_216370.1" /db_xref="GI:15608991" /db_xref="GeneID:885746" /translation="MSPQQEPTAQPPRRHRVVIIGSGFGGLNAAKKLKRADVDIKLIA RTTHHLFQPLLYQVATGIISEGEIAPPTRVVLRKQRNVQVLLGNVTHIDLAGQCVVSE LLGHTYQTPYDSLIVAAGAGQSYFGNDHFAEFAPGMKSIDDALELRGRILSAFEQAER SSDPERRAKLLTFTVVGAGPTGVEMAGQIAELAEHTLKGAFRHIDSTKARVILLDAAP AVLPPMGAKLGQRAAARLQKLGVEIQLGAMVTDVDRNGITVKDSDGTVRRIESACKVW SAGVSASRLGRDLAEQSRVELDRAGRVQVLPDLSIPGYPNVFVVGDMAAVEGVPGVAQ GAIQGAKYVASTIKAELAGANPAEREPFQYFDKGSMATVSRFSAVAKIGPVEFSGFIA WLIWLVLHLAYLIGFKTKITTLLSWTVTFLSTRRGQLTITDQQAFARTRLEQLAELAA EAQGSAASAKVAS" gene complement(2103184..2104107) /locus_tag="Rv1855c" /db_xref="GeneID:885736" CDS complement(2103184..2104107) /locus_tag="Rv1855c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1855c, (MTCY359.18), len: 307 aa. Possible oxidoreductase (EC 1.-.-.-), possibly a monooxygenase. Contains PS00217 Sugar transport proteins signature 2, probably fortuitously. Similar to G487716 (78-11) LINCOMYCIN PRODUCTION GENES (29.2% identity in 154 aa overlap). Also similar to other Mycobacterium tuberculosis proteins e.g. Rv0953c, Rv0791c, Rv0132c, Rv2951c, etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_216371.1" /db_xref="GI:15608992" /db_xref="GeneID:885736" /translation="MTIRLGLQIPNFSYGTGVEKLFPSVIAQAREAEAAGYDSLFVMD HFYQLPMLGTPDQPMLEAYTALGALATATERLQLGALVTGNTYRSPTLLAKIITTLDV VSAGRAILGIGAGWFELEHRQLGFEFGTFSDRFNRLEEALQILEPMVKGERPTFFGDW YTTESAMAEPRYRDRIPILIGGGGEKKTFAIAARFADHLNIVAAVDELPRKMRALAAR CDEAGRDRSTLQTSLLLTVMIDETLSPDAIPAEMSGRVVVGSPAQIADQIQAKVLDAG VDGLIINLAPHGYLPGVITTAAEALRPLLGV" misc_feature complement(2103706..2103783) /locus_tag="Rv1855c" /note="PS00217 Sugar transport proteins signature 2" gene complement(2104146..2104823) /locus_tag="Rv1856c" /db_xref="GeneID:885708" CDS complement(2104146..2104823) /locus_tag="Rv1856c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1856c, (MTCY359.17), len: 225 aa. Possible oxidoreductase (EC 1.-.-.-). Equivalent to MLCB1788.11c|AL008609 OXIDOREDUCTASE from Mycobacterium leprae (224 aa), FASTA scores: opt: 1211, E(): 0; (80.4% identity in 224 aa overlap). Some similarity to dehydrogenases of short-chain dehydrogenase/reductase family and fatty-acyl CoA reductases e.g. P16543|DHK2_STRVN GRANATICIN POLYKETIDE SYNTHASE P (249 aa), FASTA score: opt: 194, E(): 1.1e-05, (32.5% identity in 237 aa overlap)." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_216372.1" /db_xref="GI:15608993" /db_xref="GeneID:885708" /translation="MAVEVLVTGGDTDLGRTMAEGFRNDGHKVTLVGARRGDLEVAAK ELDVDAVVCDTTDPTSLTEARGLFPRHLDTIVNVPAPSWDAGDPRAYSVSDTANAWRN ALDATVLSVVLTVQSVGDHLRSGGSIVSVVAENPPAGGAESAIKAALSNWIAGQAAVF GTRGITINTVACGRSVQTGYEGLSRTPAPVAAEIARLALFLTTPAARHITGQTLHVSH GALAHFG" gene 2104985..2105770 /gene="modA" /locus_tag="Rv1857" /db_xref="GeneID:885655" CDS 2104985..2105770 /gene="modA" /locus_tag="Rv1857" /function="INVOLVED IN THE ACTIVE TRANSPORT OF MOLYBDENUM INTO THE CELL ACROSS THE MEMBRANE (IMPORT). PART OF THE BINDING-PROTEIN-DEPENDENT TRANSPORT SYSTEM MODABC." /note="Rv1857, (MTCY359.16c), len: 261 aa. Probable modA, molybdate-binding protein attached to membrane by lipid-modified N-terminal cysteine (contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site), component of molybdate transport system (see citations below). Shows strong similarity to precursors of periplasmic molybdate/sulphate binding proteins e.g. O31229|Y10817|ANY108174 ModA from Arthrobacter nicotinovorans (260 aa), FASTA score: opt: 725, E(): 0, (47.8% identity in 249 aa overlap). TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="molybdate-binding lipoprotein" /protein_id="NP_216373.1" /db_xref="GI:15608994" /db_xref="GeneID:885655" /translation="MRWIGLSTGLVSAMLVAGLVACGSNSPASSPAGPTQGARSIVVF AAASLQSAFTQIGEQFKAGNPGVNVNFAFAGSSELATQLTQGATADVFASADTAQMDS VAKAGLLAGHPTNFATNTMVIVAAAGNPKKIRSFADLTRPGLNVVVCQPSVPCGSATR RIEDATGIHLNPVSEELSVTDVLNKVITGQADAGLVYVSDALSVATKVTCVRFPEAAG VVNVYAIAVLKRTSQPALARQFVAMVTAAAGRRILDQSGFAKP" misc_feature 2105018..2105050 /gene="modA" /locus_tag="Rv1857" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2105773..2106567 /gene="modB" /locus_tag="Rv1858" /db_xref="GeneID:885723" CDS 2105773..2106567 /gene="modB" /locus_tag="Rv1858" /function="PART OF THE BINDING-PROTEIN-DEPENDENT TRANSPORT SYSTEM MODABC FOR MOLYBDENUM; RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1858, (MTCY359.15c), len: 264 aa. Probable modB, molybdenum-transport integral membrane protein ABC transporter (see citation below), similar to others e.g. Y10817|ANY108175 ModB from Arthrobacter (239 aa), FASTA scores: opt: 937, E(): 0, (67.8% identity in 230 aa overlap); etc. Similar to other Mycobacterium tuberculosis transport proteins e.g. Rv2039c, Rv2316, etc. TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="molbdenum-transport integral membrane protein ABC transporter" /protein_id="NP_216374.1" /db_xref="GI:15608995" /db_xref="GeneID:885723" /translation="MHPPTDLPRWVYLPAIAGIVFVAMPLVAIAIRVDWPRFWALITT PSSQTALLLSVKTAAASTVLCVLLGVPMALVLARSRGRLVRSLRPLILLPLVLPPVVG GIALLYAFGRLGLIGRYLEAAGISIAFSTAAVVLAQTFVSLPYLVISLEGAARTAGAD YEVVAATLGARPGTVWWRVTLPLLLPGVVSGSVLAFARSLGEFGATLTFAGSRQGVTR TLPLEIYLQRVTDPDAAVALSLLLVVVAALVVLGVGARTPIGTDTR" gene 2106574..2107683 /gene="modC" /locus_tag="Rv1859" /db_xref="GeneID:885731" CDS 2106574..2107683 /gene="modC" /locus_tag="Rv1859" /function="PART OF THE BINDING-PROTEIN-DEPENDENT TRANSPORT SYSTEM MODABC FOR MOLYBDENUM; RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv1859, (MTCY359.14c), len: 369 aa. Probable modC, molybdenum-transport ATP-binding protein ABC transporter (see citation below), similar to others e.g. Y10817|ANY108176 ModC from Arthrobacter (349 aa), FASTA scores: opt: 895, E(): 0, (46.0% identity in 361 aa overlap); etc. Shows similarity to other Mycobacterium tuberculosis ABC-transporter proteins e.g. Rv0073, Rv1238, Rv2564, etc. Contains both PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporters family signatures involved in molybdate uptake. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="molybdenum ABC transporter ATP-binding protein" /protein_id="NP_216375.1" /db_xref="GI:15608996" /db_xref="GeneID:885731" /translation="MSKLQLRAVVADRRLDVEFSVSAGEVLAVLGPNGAGKSTALHVI AGLLRPDAGLVRLGDRVLTDTEAGVNVATHDRRVGLLLQDPLLFPHLSVAKNVAFGPQ CRRGMFGSGRARTRASALRWLREVNAEQFADRKPRQLSGGQAQRVAIARALAAEPDVL LLDEPLTGLDVAAAAGIRSVLRSVVARSGCAVVLTTHDLLDVFTLADRVLVLESGTIA EIGPVADVLTAPRSRFGARIAGVNLVNGTIGPDGSLRTQSGAHWYGTPVQDLPTGHEA IAVFPPTAVAVYPEPPHGSPRNIVGLTVAEVDTRGPTVLVRGHDQPGGAPGLAACITV DAATELRVAPGSRVWFSVKAQEVALHPAPHQHASS" misc_feature 2106664..2106687 /gene="modC" /locus_tag="Rv1859" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 2106988..2107032 /gene="modC" /locus_tag="Rv1859" /note="PS00211 ABC transporters family signature" gene 2107736..2108713 /gene="apa" /locus_tag="Rv1860" /db_xref="GeneID:885896" CDS 2107736..2108713 /gene="apa" /locus_tag="Rv1860" /function="UNKNOWN (COULD MEDIATE BACTERIAL ATTACHMENT TO HOST CELLS)." /experiment="experimental evidence, no additional details recorded" /note="Rv1860, (MT1908, MTCY359.0013), len: 325 aa. apa (alternate gene names: mpt32, modD), Ala-, Pro-rich 45/47 kDa secreted protein, very similar to P46842|N43L_MYCLE from Mycobacterium leprae (287 aa), FASTA scores: opt: 1166, E(): 0, (66.4% identity in 298 aa overlap). Known to be glycosylated fibronectin-binding protein (see some citations). CHANGES IN THE MANNOSYLATION PATTERN OF THIS PROTEIN AFFECT ITS ABILITY TO STIMULATE T-LYMPHOCYTE RESPONSE. MAJOR IMMUNODOMINANT ANTIGEN THAT HAS POTENTIAL AS A VACCINE AGAINST TUBERCULOSIS. APA-ELISA COULD BE USED IN DIAGNOSIS. TBparse score is 0.924.; mpt32; modD" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177849.1" /db_xref="GI:57116926" /db_xref="GeneID:885896" /translation="MHQVDPNLTRRKGRLAALAIAAMASASLVTVAVPATANADPEPA PPVPTTAASPPSTAAAPPAPATPVAPPPPAAANTPNAQPGDPNAAPPPADPNAPPPPV IAPNAPQPVRIDNPVGGFSFALPAGWVESDAAHFDYGSALLSKTTGDPPFPGQPPPVA NDTRIVLGRLDQKLYASAEATDSKAAARLGSDMGEFYMPYPGTRINQETVSLDANGVS GSASYYEVKFSDPSKPNGQIWTGVIGSPAANAPDAGPPQRWFVVWLGTANNPVDKGAA KALAESIRPLVAPPPAPAPAPAEPAPAPAPAGEVAPTPTTPTPQRTLPA" gene 2109165..2109470 /locus_tag="Rv1861" /db_xref="GeneID:885741" CDS 2109165..2109470 /locus_tag="Rv1861" /function="UNKNOWN" /note="Rv1861, (MTCY359.12c), len: 101 aa. Probable conserved transmembrane protein, showing weak similarity to AE002069|AE002069_10 hypothetical protein from Deinococcus radiodurans (146 aa), FASTA scores: opt: 154, E(): 0.0027, (30.8% identity in 104 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.863." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216377.1" /db_xref="GI:15608998" /db_xref="GeneID:885741" /translation="MDITATTEFSAMNLDGKTGIGWLGYIVIGGIAGWLASKIVKGGG SGILMNVVIGVVGAFGAGLVLNALGVDVNHGGYWFTFFVALGGAVVLLWIVGMVRKT" misc_feature 2109195..2109218 /locus_tag="Rv1861" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 2109544..2110584 /gene="adhA" /locus_tag="Rv1862" /db_xref="GeneID:885652" CDS 2109544..2110584 /gene="adhA" /locus_tag="Rv1862" /EC_number="1.1.1.1" /function="Catalyzes the reversible oxidation of ethanol to acetaldehyde with the concomitant reduction of NAD" /note="Rv1862, (MTCY359.11), len: 346 aa. Probable adhA, alcohol dehydrogenase (EC 1.1.1.1), similar to ADH2_BACST|P42327 alcohol dehydrogenase (339 aa), FASTA scores: opt: 630, E(): 2.4e-32 (34.4% identity in 320 aa overlap). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. TBPARSE score is 0.899." /codon_start=1 /transl_table=11 /product="alcohol dehydrogenase AdhA" /protein_id="NP_216378.1" /db_xref="GI:15608999" /db_xref="GeneID:885652" /translation="MVSPATTATMSAWQVRRPGPMDTGPLERVTTRVPRPAPSELLVA VHACGVCRTDLHVTEGDLPVHRERVIPGHEVVGEVIEVGSAVGAAAGGEFDRGDRVGI AWLRHTCGVCKYCRRGSENLCPQSRYTGWDADGGYAEFTTVPAAFAHHLPSGYSDSEL APLLCAGIIGYRSLLRTELPPGGRLGLYGFGGSAHITAQVALAQGAEIHVMTRGARAR KLALQLGAASAQDAADRPPVPLDAAILFAPVGDLVLPALEALDRGGILAIAGIHLTDI PDLNYQQHLFQERQIRSVTSNTRADARAFFDFAAQHHIEVTTPEYPLGQADRALGDLS AGRIAGAAVLLI" misc_feature 2109757..2109801 /gene="adhA" /locus_tag="Rv1862" /note="PS00059 Zinc-containing alcohol dehydrogenases signature" gene complement(2110591..2111361) /locus_tag="Rv1863c" /db_xref="GeneID:885722" CDS complement(2110591..2111361) /locus_tag="Rv1863c" /function="UNKNOWN" /note="Rv1863c, (MTCY359.10), len: 256 aa. Probable conserved integral membrane protein, similar to Rv0804|Z95618|MTCY7H7A.05 Hypothetical protein from Mycobacterium tuberculosis (209 aa), FASTA scores: opt: 199, E(): 1e-06, (33.2% identity in 220 aa overlap); and Rv0658c. TBPARSE score is 0.912." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216379.1" /db_xref="GI:15609000" /db_xref="GeneID:885722" /translation="MSDHLTACAAVHPGPLVSHLSVMHRFRIYVDIAVVVLVLVLTNL IAHFTTPWASIATVPAAAVGLVILVRSRGLGWAELGLSRQHWKSGLVYALAAVALVVA VISVGVLLPITRPMFMNHHYATISGAVIASMVMIPLQTVIPEELAFRGVLHGALNRAW GFRGVAVAGSVLFGLWHIATSLGLTSSNVGFTRLFGGGIIGLVAGVMLAVLATGVAGF VFSWLRRRSGSLIAPIALHWSLNGMGALAAALVWHLST" gene complement(2111354..2112109) /locus_tag="Rv1864c" /db_xref="GeneID:885800" CDS complement(2111354..2112109) /locus_tag="Rv1864c" /function="UNKNOWN" /note="Rv1864c, (MTCY359.09), len: 251 aa. Conserved hypothetical protein. Similar to other hypothetical proteins e.g. AL031317|SC6G4.43 from Streptomyces coelicolor cosmid 6G (233 aa), FASTA scores: opt: 716, E(): 0, (54.4% identity in 215 aa overlap); also P43976|YIIM_HAEIN hypothetical protein hi0278 (221 aa), FASTA scores: opt: 223, E(): 3.8e-08, (29.5% identity in 173 aa overlap). TBPARSE score is 0.919" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216380.1" /db_xref="GI:15609001" /db_xref="GeneID:885800" /translation="MTVAPRRLAWTNARQSYPVRVAHVLSVNLARVRANPDPRAQSKL TGIDKVAASEAVMVRAPGSMHAGVGSGLVGDTVGNPKLHGGDDQAVYAYAREDLDAWE TQLHRTLHNGMFGENLTTSGVDVTYARIGERWRIGSDGLVLEVSAPRIPCRTFAAFLD LRYWIKTFTRAAKPGAYLRVIAPGTVRAGDTITVDYRPEHNVTVGLVFRARTSESELL PQLLAADALAAELKAYARERTPSPPPVDSADDV" gene complement(2112106..2112966) /locus_tag="Rv1865c" /db_xref="GeneID:885806" CDS complement(2112106..2112966) /locus_tag="Rv1865c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1865c, (MTCY359.08), len: 286 aa. Probable short-chain dehydrogenase (EC 1.-.-.-), highly similar to C-terminus of NP_301650.1|NC_00267 putative oxidoreductase from Mycobacterium leprae (596 aa). Also similar to various dehydrogenases, generally belonging to short-chain family, e.g. AAG02168.1|AF212041_24|AF212041 3-oxoacyl-(acylcarrier protein) reductase from Zymomonas mobilis (251 aa); P50198|LINX_PSEPA 2,5-DICHLORO-2,5-CYCLOHEXADIENE-1,4-DIOL DEHYDROGENASE from Sphingomonas paucimobilis (250 aa); NP_105680.1|NC_002678 sorbitol dehydrogenase (also similar to acetoin reductase) from Mesorhizobium loti (256 aa); etc. And highly similar to C-terminus of ephD|Rv2214c|MTCY190.25c from Mycobacterium tuberculosis (592 aa); and many other oxidoreductases from Mycobacterium tuberculosis e.g. Y00P_MYCTU|Q10402 putative oxidoreductase (650 aa), FASTA scores: opt: 439, E(): 8.9e-20, (32.5% identity in 280 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_216381.1" /db_xref="GI:15609002" /db_xref="GeneID:885806" /translation="MPGRTSIGVKIRDKVQDKVIAITGGARGIGLATAAALHNLGAKV AIGDIDEAMAKESGADLDLDMYGKLDVTDPDSFSGFLDAVERQLGPIDVLVNNAGIMP VGRIVDEPDPVTRRILDINVYGVILGSKLAAQRMVPRGRGHVINVASLAGEIYAVGVA TYCASKHAVVAFTDSARLEYRSAGVKFSMVLPSFVNTELIAGTGGIKGFKNAEPADIA DAIVGLIVHPKPRVRVTKAAGSMIVAQRFMPRQVSEGLNRLLGGEHVFTDDVDMEKRR TYEARARGEE" misc_feature complement(2112436..2112522) /locus_tag="Rv1865c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 2113140..2115476 /locus_tag="Rv1866" /db_xref="GeneID:885733" CDS 2113140..2115476 /locus_tag="Rv1866" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="Rv1866, (MTCY359.07c), len: 778 aa. Conserved hypothetical protein, N-terminal region similar to fatty acyl-CoA racemases e.g. Rv0855, Rv1143, and C-terminal region (from aa 370) similar to L-carnitine dehydratases, racemases, and Rv3272|MTCY71.12 Mycobacterium tuberculosis (394 aa), FASTA score: opt: 472, E(): 2.1e-21, (29.9% identity in 388 aa overlap). Also similar to P31572|CAIB_ECOLI L-CARNITINE DEHYDRATASE (EC 4.2.1.89) (405 aa), FASTA score: opt: 306, E(): 2.1e-11, (23.3% identity in 424 aa overlap). TBPARSE score is 0.921." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216382.1" /db_xref="GI:15609003" /db_xref="GeneID:885733" /translation="MVTRLLADLGADVLKVEPPGGSPGRHVRPTLAGTSIGFAMHNAN KRSAVLNPLDESDRRRFLDLAASADIVVDCGLPGQAAAYGASCAELADRYRHLVALSI TDFGAAGPRSSWRATDPVLYAMSGALSRSGPTAGTPVLPPDGIASATAAVQAAWAVLV AYFNRLRCGTGDYIDFSRFDAVVMALDPPFGAHGQVAAGIRSTGRWRGRPKNQDAYPI YPCRDGYVRFCVMAPRQWRGLRRWLGEPEDFQDPKYDVIGARLAAWPQISVLVAKLCA EKTMKELVAAGQALGVPITAVLTPSRILASEHFQAVGAITDAELVPGVRTGVPTGYFV VDGKRAGFRTPAPAAGQDEPRWLADPAPVPPPSGRVGGYPFEGLRILDLGIIVAGGEL SRLFGDLGAEVIKVESADHPDGLRQTRVGDAMSESFAWTHRNHLALGLDLRNSEGKAI FGRLVAESDAVFANFKPGTLTSLGFSYDVLHAFNPRIVLAGSSAFGNRGPWSTRMGYG PLVRAATGVTRVWTSDEAQPDNSRHPFYDATTIFPDHVVGRVGALLALAALIHRDRTG GGAHVHISQAEVVVNQLDTMFVAEAARATDVAEIHPDTSVHAVYPCAGDDEWCVISIR SDDEWRRATSVFGQPELANDPRFGASRSRVANRSELVAAVSAWTSTRTPVQAAGALQA AGVAAGPMNRPSDILEDPQLIERNLFRDMVHPLIARPLPAETGPAPFRHIPQAPQRPA PLPGQDSVQICRKLLGMTADETERLINERVMFGPAVTA" gene 2115764..2117248 /locus_tag="Rv1867" /db_xref="GeneID:885704" CDS 2115764..2117248 /locus_tag="Rv1867" /EC_number="2.3.1.9" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_216383.1" /db_xref="GI:15609004" /db_xref="GeneID:885704" /translation="MPVDPRTPVLIGYGQVNHRGDIDAEKQSIEPVDLMAAAARKAAD STVLEAVDSIRVVHMLSAHYRNPGQLLGERIKARTFTTGYSGVGGNMPQSLVNRACLD IQRGRAGVVLLAGAETWRTRTGLRAKGSKLEWTVQDESVPLPDMAGDDVPMAGAAELR INLDRPAYVYPIFEQALRIAYGESIENHRKRIGELWARFSAVAADNPHAWIRNPVTAD EIWQPGPQNRMVSWPYTKLMNSNNMVDQGAALLLTSVERATRLRIPAERWVYPQAGTD AHDTPAVADRHRLHRSTAIRIAGARALELAGLGLDDIEYVDLYSCFPSAVQVAAIELG LDTDDPARPLTVTGGLTFAGGPWSNYVTHSIATMAELLAANPGRRGLITANGGYLTKH SFGVYGTEPPSEFRWEDMQPAVDREPTGDGLVEWEGIGTVEAWTTPVNRDGQPEKAFL AVRTPDGSRSLAVITDPASVQATVREDIAGVKVAVAPDGTATLR" gene 2117347..2119446 /locus_tag="Rv1868" /db_xref="GeneID:885757" CDS 2117347..2119446 /locus_tag="Rv1868" /function="UNKNOWN" /note="Rv1868, (MTCY359.05c), len: 699 aa. Conserved hypothetical protein, similar to products of three consecutive ORFS in Mycobacterium leprae MLCB2052.18|Z98604|B2052 (257 aa), FASTA scores: opt: 314, E(): 9.9e-12, (35.2% identity in 213 aa overlap); MLCB2052.17, and MLCB2052.16. Also similar to M. tuberculosis hypothetical protein Rv2047c. TBPARSE score is 0.926." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216384.1" /db_xref="GI:15609005" /db_xref="GeneID:885757" /translation="MQILVTDATGAVGRSVTRQLIAAGHTVSGIAQHPHDALDPRVDY VCASLRNPVLQELAGEADAVIHLAPVDTSAPGGVGITGLAHVANAAARAGARLLFVSQ AAGRPELYRQAETLVSTGWAPSLVIRIAPPVGRQLDWMVCRTVATLLRSKVSARPIRV LHLDDLVRFLVLALNTDRNGVVDLATPDTTNVVTAWRLLRSVDPHLRTRRVRSWEQLI PEVDIAAVQEDWNFEFGWQATEAIVDTGRGLVGRRLHPAGATNGSGQLALPVEAPPRS VPSHGEPLGSAAPEGLEGEFDDRIDERFPVFSSASLAEALPGPLTPMTLDVQLSGLRA AGRAMGRVLALGGVVADEWERRAIAVFGHRPYIGVSANIVAAAQLPGWDAQAVARRAL GEQPQVTELLPFGRPQLAGGPLGSVAKVVVTARSLALLRHLRSDTHHYVAAADAEHLA AGQLASLPDAGLEVRIRLLRDRIHQGWILTVLWVIDTGVTAATLEHTRAGSAVSGGGM IMESGRIGAEIAPLAAVLRADPPLCALANDGNLASIRALSAPAAAAVDAVIARIGHRG LGEAELANLTFADDPALLLKTAAEIAARPAGPAHPATLIQRLAAGTRSARELAHDTTI RFTHELRMTLRELGSRRVAADVIDVVDDVFYLTCDELITTPADARLRIKRRRAERERL QAQRPPDVIDHAWVPVE" gene complement(2119460..2120695) /locus_tag="Rv1869c" /db_xref="GeneID:885796" CDS complement(2119460..2120695) /locus_tag="Rv1869c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1869c, (MTCY359.04), len: 411 aa. Probable reductase (1.-.-.-). Similar to several reductases e.g. CAC04223.1|AL391515 putative ferredoxin reductase from Streptomyces coelicolor (420 aa); THCD_RHOSO|P43494 rhodocoxin reductase (426 aa), FASTA scores: opt: 904, E(): 0, (40.8% identity in 370 aa overlap). Also similar to Mycobacterium tuberculosis proteins Rv0688 (406 aa) (39.9% identity in 391 aa overlap); and Rv0253 (nitrite reductase subunit). TBPARSE score is 0.918." /codon_start=1 /transl_table=11 /product="reductase" /protein_id="NP_216385.1" /db_xref="GI:15609006" /db_xref="GeneID:885796" /translation="MASSTTFVIVGGGLAGAKAVEALRRSDFGGRIILFGDEEHLPYD RPPLSKEFLAGKKSLSDFTIQTSDWYRDHDVDVRLGVRVSSLDRSAHTVELPDGAAVR YDKLLLATGSAPRRPPIPGSDAAGVHYLRSYNDAVALNSVLVQGSSLAVVGAGWIGLE VAASARQRGVDVTVVETAIQPLLAALGEAVGKVFADLHRDQGVDLRLQTQLEEITAAD GKATGLKMRDGSTVAADAVLVAVGAKPNVELAQQAGLAMGEGGVLVDASLRTSDPDIY AVGDIAAAEHPLLGTRVRTEHWANALKQPAVAAAGMLGRPGEYAELPYLFTDQYDLGM EYVGHAPSCDRVVFRGNVAGREFLSFWLDGDSRVLAGMNVNVWDVVDDVKGLIRSGNP VDVDRLVDPQWPLADLTTN" gene complement(2120795..2121430) /locus_tag="Rv1870c" /db_xref="GeneID:885797" CDS complement(2120795..2121430) /locus_tag="Rv1870c" /function="UNKNOWN" /note="Rv1870c, (MTCY359.03), len: 211 aa. Conserved hypothetical protein. Some similarity to SC6F7.17c hypothetical protein from Streptomyces coelicolor (216 aa). TBPARSE score is 0.939" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216386.1" /db_xref="GI:15609007" /db_xref="GeneID:885797" /translation="MPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMP LFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRY DESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLRE VQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSNALLAAALVRVA" gene complement(2121495..2121884) /locus_tag="Rv1871c" /db_xref="GeneID:885804" CDS complement(2121495..2121884) /locus_tag="Rv1871c" /function="UNKNOWN" /note="Rv1871c, (MTCY359.02), len: 129 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins Q11057|Rv1261|MTCY50.21 (149 aa), FASTA score: opt: 125, E(): 0.019, (32.6% identity in 89 aa overlap); Rv0523c, and Rv1598c. TBPARSE score is 0.909" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216387.1" /db_xref="GI:15609008" /db_xref="GeneID:885804" /translation="MNAAMNLKREFVHRVQRFVVNPIGRQLPMTMLETIGRKTGQPRR TAVGGRVVDNQFWMVSEHGEHSDYVYNIKANPAVRVRIGGRWRSGTAYLLPDDDPRQR LRGLPRLNSAGVRAMGTDLLTIRVDLD" gene complement(2121907..2123151) /gene="lldD2" /locus_tag="Rv1872c" /db_xref="GeneID:885754" CDS complement(2121907..2123151) /gene="lldD2" /locus_tag="Rv1872c" /EC_number="1.1.2.3" /function="INVOLVED IN RESPIRATION; CATALYZES CONVERSION OF LACTATE INTO PYRUVATE [CATALYTIC ACTIVITY: (S)-LACTATE + 2 FERRICYTOCHROME C = PYRUVATE + 2 FERROCYTOCHROME C]." /experiment="experimental evidence, no additional details recorded" /note="Rv1872c, (MTCY180.46, MTCY359.01), len: 414 aa (start uncertain). Possible lldD2, L-lactate dehydrogenase (cytochrome) (EC 1.1.2.3), similar to other lactate dehydrogenases and other oxidases e.g. LLDD_ECOLI|P33232 l-lactate dehydrogenase (cytochrome) from Escherichia coli strain K12 (396 aa), FASTA results: opt: 674, E(): 1.1e-37, (40.5% identity in 279 aa overlap); Q51135 LACTATE DEHYDROGENASE from Neisseria meningitidis (390 aa), FASTA results: opt: 309, E(): 4.1e-15, (42.5% identity in 113 aa overlap); etc. Also shows similarity with Rv0694|lldD1|MTCY210.11 POSSIBLE L-LACTATE DEHYDROGENASE (CYTOCHROME) from Mycobacterium tuberculosis (396 aa). Contains PS00557 FMN-dependent alpha-hydroxy acid dehydrogenases active site. BELONGS TO THE FMN-DEPENDENT ALPHA-HYDROXY ACID DEHYDROGENASES FAMILY. TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="L-lactate dehydrogenase (cytochrome) LldD2" /protein_id="NP_216388.1" /db_xref="GI:15609009" /db_xref="GeneID:885754" /translation="MAVNRRVPRVRDLAPLLQFNRPQFDTSKRRLGAALTIQDLRRIA KRRTPRAAFDYADGGAEDELSIARARQGFRDIEFHPTILRDVTTVCAGWNVLGQPTVL PFGIAPTGFTRLMHTEGEIAGARAAAAAGIPFSLSTLATCAIEDLVIAVPQGRKWFQL YMWRDRDRSMALVRRVAAAGFDTMLVTVDVPVAGARLRDVRNGMSIPPALTLRTVLDA MGHPRWWFDLLTTEPLAFASLDRWPGTVGEYLNTVFDPSLTFDDLAWIKSQWPGKLVV KGIQTLDDARAVVDRGVDGIVLSNHGGRQLDRAPVPFHLLPHVARELGKHTEILVDTG IMSGADIVAAIALGARCTLIGRAYLYGLMAGGEAGVNRAIEILQTGVIRTMRLLGVTC LEELSPRHVTQLRRLGPIGAPT" misc_feature complement(2122237..2122257) /gene="lldD2" /locus_tag="Rv1872c" /note="PS00557 FMN-dependent alpha-hydroxy acid dehydrogenases active site" gene 2123174..2123611 /locus_tag="Rv1873" /db_xref="GeneID:885789" CDS 2123174..2123611 /locus_tag="Rv1873" /function="UNKNOWN" /note="Rv1873, (MTCY180.45c), len: 145 aa. Conserved hypothetical protein. Some similarity to AL591783 hypothetical protein from Sinorhizobium meliloti. TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216389.1" /db_xref="GI:15609010" /db_xref="GeneID:885789" /translation="MKSASDPFDLKRFVYAQAPVYRSVVEELRAGRKRGHWMWFVFPQ LRGLGSSPLAVRYGISSLEEAQAYLQHDLLGPRLHECTGLVNQVQGRSIEEIFGPPDD LKLCSSMTLFARATDANQDFVALLAKYYGGGEDRRTVALLAVT" gene 2123684..2124370 /locus_tag="Rv1874" /db_xref="GeneID:885748" CDS 2123684..2124370 /locus_tag="Rv1874" /function="UNKNOWN" /note="Rv1874, (MTCY180.44c), len: 228 aa. Hypothetical unknown protein, TBparse score is 0.928" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216390.1" /db_xref="GI:15609011" /db_xref="GeneID:885748" /translation="MLMRPEPDDDWCARQRAQVADALLGLGVAGLSINVRDSTVRDSL MTLTTLYPPVAAVVSLWTQQCYGEQVAAALRLLAQECDELGAYLVTESVPLTFPSLVE SGSRTPGLANIALLRRPDGLDQATWLTRWQRDHTQVAIEAQATFGYTQNWVVRALTPE APGIAGIVEELFPVAATTDLKAFFGAADDNDLRNRISRMVASTSAFGANQNIDTVPTS RYVFRTPFKD" gene 2124381..2124824 /locus_tag="Rv1875" /db_xref="GeneID:885793" CDS 2124381..2124824 /locus_tag="Rv1875" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1875, (MTCY180.43c), len: 147 aa. Conserved hypothetical protein. Some similarity to Mycobacterium tuberculosis hypothetical proteins e.g. Rv1155|MTCI65.22|Z95584 (147 aa), FASTA scores: opt: 178, E(): 7.4e-06, (26.9% identity in 130 aa overlap); Rv0121c and Rv2074. Also similar to AL079356|SC6G9.21 hypothetical protein from Streptomyces coelicolor (144 aa), FASTA scores: opt: 239, E(): 3.1 e-09, (38.7% identity in 137 aa overlap). TBparse score is 0.908" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216391.1" /db_xref="GI:15609012" /db_xref="GeneID:885793" /translation="MTTLNEAAALAAAERGLAVVSTVRADGTVQASLVNVGLLPHPVS GEPSLGFTTYGKVKLGNLRARPQLAVTFRNGWQWATVEGRAQLVGPDDPRPWLVDGER LRLLLREVFTAAGGTHDDWDEYDRVMAQEQRAVVLITPTRIYSNG" gene 2125340..2125819 /gene="bfrA" /locus_tag="Rv1876" /db_xref="GeneID:885767" CDS 2125340..2125819 /gene="bfrA" /locus_tag="Rv1876" /function="INVOLVED IN IRON STORAGE (MAY PERFORM ANALOGOUS FUNCTIONS IN IRON DETOXIFICATION AND STORAGE AS THAT OF ANIMAL FERRITINS); FERRITIN IS AN INTRACELLULAR MOLECULE THAT STORES IRON IN A SOLUBLE, NONTOXIC, READILY AVAILABLE FORM. THE FUNCTIONAL MOLECULE, WHICH IS COMPOSED OF 24 CHAINS, IS ROUGHLY SPHERICAL AND CONTAINS A CENTRAL CAVITY IN WHICH THE POLYMERIC FERRIC IRON CORE IS DEPOSITED." /note="Rv1876, (MTCY180.42c), len: 159 aa. Probable bfrA (alternate gene name: bfr), bacterioferritin (see citation below), similar to BFR_MYCLE|P43315 bacterioferritin (bfr) from Mycobacterium leprae (159 aa), FASTA results: opt: 958, E(): 0, (90.6% identity in 159 aa overlap). Also similar to Rv3841|MTCY01A6.28c|bfrB POSSIBLE BACTERIOFERRITIN from Mycobacterium tuberculosis (181 aa). BELONGS TO THE BACTERIOFERRITIN FAMILY. TBparse score is 0.913.; bfr" /codon_start=1 /transl_table=11 /product="bacterioferritin" /protein_id="NP_216392.1" /db_xref="GI:15609013" /db_xref="GeneID:885767" /translation="MQGDPDVLRLLNEQLTSELTAINQYFLHSKMQDNWGFTELAAHT RAESFDEMRHAEEITDRILLLDGLPNYQRIGSLRIGQTLREQFEADLAIEYDVLNRLK PGIVMCREKQDTTSAVLLEKIVADEEEHIDYLETQLELMDKLGEELYSAQCVSRPPT" gene 2125904..2127967 /locus_tag="Rv1877" /db_xref="GeneID:885654" CDS 2125904..2127967 /locus_tag="Rv1877" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF DRUG ACROSS THE MEMBRANE." /note="Rv1877, (MTCY180.41c), len: 687 aa. Probable conserved integral membrane protein, part of major facilitator superfamily (MFS), similar to many antibiotic and drug efflux proteins. Similar to e.g. Q56175 TU22 DTDP-GLUCOSE DEHYDRTATASE from Streptomyces violaceoruber (557 aa), FASTA scores: opt: 895, E(): 0, (34.7% identity in 528 aa overlap). Also similar to Mycobacterium tuberculosis relatives protein, include Rv3728, Rv3239c, Rv2846c, etc. Contains PS00217 Sugar transport proteins signature 2 (PS00217). TBparse score is 0.916." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216393.1" /db_xref="GI:15609014" /db_xref="GeneID:885654" /translation="MAGPTAPTTAPTAIRAGGPLLSPVRRNIIFTALVFGVLVAATGQ TIVVPALPTIVAELGSTVDQSWAVTSYLLGGTVVVVVAGKLGDLLGRNRVLLGSVVVF VVGSVLCGLSQTMTMLAISRALQGVGAGAISVTAYALAAEVVPLRDRGRYQGVLGAVF GVNTVTGPLLGGWLTDYLSWRWAFWINVPVSIAVLTVAATAVPALARPPKPVIDYLGI LVIAVATTALIMATSWGGTTYAWGSATIVGLLIGAAVALGFFVWLEGRAAAAILPPRL FGSPVFAVCCVLSFVVGFAMLGALTFVPIYLGYVDGASATASGLRTLPMVIGLLIAST GTGVLVGRTGRYKIFPVAGMALMAVAFLLMSQMDEWTPPLLQSLYLVVLGAGIGLSMQ VLVLIVQNTSSFEDLGVATSGVTFFRVVGASFGTATFGALFVNFLDRRLGSALTSGAV PVPAVPSPAVLHQLPQSMAAPIVRAYAESLTQVFLCAVSVTVVGFILALLLREVPLTD IHDDADDLGDGFGVPRAESPEDVLEIAVRRMLPNGVRLRDIATQPGCGLGVAELWALL RIYQYQRLFEAVRLTDIGRHLHVPYQVFEPVFDRLVQTGYAARDGDILTLTPSGHRQV DSLAVLIRQWLLDHLAVAPGLKRQPDHQFEAALQHVTDAVLVQRDWYEDLGDLSESRQ LAATT" misc_feature 2126273..2126350 /locus_tag="Rv1877" /note="PS00217 Sugar transport proteins signature 2" gene 2128022..2129374 /gene="glnA3" /locus_tag="Rv1878" /db_xref="GeneID:885761" CDS 2128022..2129374 /gene="glnA3" /locus_tag="Rv1878" /EC_number="6.3.1.2" /function="INVOLVED IN GLUTAMINE BIOSYNTHESIS [CATALYTIC ACTIVITY: ATP + L-GLUTAMATE + NH(3) = ADP + GLUTAMINE + ORTHOPHOSPHATE]." /note="Rv1878, (MTCY180.40c), len: 450 aa. Probable glnA3, glutamine synthetase class I (EC 6.3.1.2), similar to many e.g. GLNA_BACCE|P19064 from Bacillus cereus (443 aa), FASTA results: opt: 497, E(): 5.2e-23, (29.0% identity in 331 aa overlap); etc. Also similar to C-terminus of FLUG_EMENI|P38094 flug protein from emericella nidulans (865 aa), FASTA scores: opt: 227, E (): 6.4e-13, (29.9% identity in 394 aa overlap). Note that the downstream ORF MTCY180.39c is similar to the N-terminus. Also similar to three other potential glutamine synthases in M. tuberculosis: Q10378|GLN2_MYCTU|GLNA2|Rv2222c|MT2280|MTCY190.33c|MTCY42 7. 03c; Rv2860c|MTV003.06c|glnA4 and Rv2220|glnA1. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY. TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="glutamine synthetase" /protein_id="NP_216394.1" /db_xref="GI:15609015" /db_xref="GeneID:885761" /translation="MTATPLAAAAIAQLEAEGVDTVIGTVVNPAGLTQAKTVPIRRTN TFANPGLGASPVWHTFCIDQCSIAFTADISVVGDQRLRIDLSALRIIGDGLAWAPAGF FEQDGTPVPACSRGTLSRIEAALADAGIDAVIGHEVEFLLVDADGQRLPSTLWAQYGV AGVLEHEAFVRDVNAAATAAGIAIEQFHPEYGANQFEISLAPQPPVAAADQLVLTRLI IGRTARRHGLRVSLSPAPFAGSIGSGAHQHFSLTMSEGMLFSGGTGAAGMTSAGEAAV AGVLRGLPDAQGILCGSIVSGLRMRPGNWAGIYACWGTENREAAVRFVKGGAGSAYGG NVEVKVVDPSANPYLASAAILGLALDGMKTKAVLPSETTVDPTQLSDVDRDRAGILRL AADQADAIAVLDSSKLLRCILGDPVVDAVVAVRQLEHERYGDLDPAQLADKFRMAWSV" gene 2129377..2130513 /locus_tag="Rv1879" /db_xref="GeneID:885650" CDS 2129377..2130513 /locus_tag="Rv1879" /function="UNKNOWN" /note="Rv1879, (MTCY180.39c), len: 378 aa. Conserved hypothetical protein, similar to SCC22.14c|AL096839 hypothetical protein from Streptomyces coelicolor (368 aa), FASTA results: opt: 772, E(): 0 (40.3% identity in 372 aa overlap); and to N-terminal half of nodulin/glutamate-ammonia ligase-like protein. Some similarity to N-terminus of AL132958|ATT4D2_11 Arabidopsis thaliana (845 aa), FASTA results: opt: 354, E(): 3.1e-16, (29.2% identity in 383 aa overlap); and to P38094|FLUG_EMENI Flug protein of Emericella nidulans (865 aa), FASTA results: opt: 306, E(): 6.2e-13, (26.5% identity in 415 aa overlap). Note that the upstream ORF Rv1878|MTCY18 0.40c is similar to the C-terminus. TBparse score is 0.933" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216395.1" /db_xref="GI:15609016" /db_xref="GeneID:885650" /translation="MADSAGSDLTRHTAEVPLIDQHVHGCWLTEGNRRRFENALNEAN TEPLADFDSGFDSQLGFAVRNHCAPILGLPRHVDPQTYWDRRSQFSEAELARRFLQAA GVTDWLVETGIGYDVSGMASVAGLGELSGSHAHEVVRLEQVAEQAVQASGDYASAFNE ILRRRAATAVATKSILAYRGGFDGDLTEPPAAQVAEAAKRWRDRGGVRLQDRVLLRFG LHQALRLGKPLQFHVGFGDRDADLHKANPLYLLDFLRQSGNTPIVLLHCYPYEREAGY LAQAFNNVYLDGGLSVHYLGARSPAFIGRLLELAPFRKIVYSSDGFGPAELHFLGATL WRSGIQRVLRGFVERDDWCETDALRVVDLIAHGTAARIYRLGDR" gene complement(2130541..2131857) /gene="cyp140" /locus_tag="Rv1880c" /db_xref="GeneID:885758" CDS complement(2130541..2131857) /gene="cyp140" /locus_tag="Rv1880c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv1880c, (MT1929, MTCY180.38), len: 438 aa. Probable cyp140, cytochrome p450 (EC 1.14.-.-). Similar to Q00441|CPXJ_SACER 6-deoxyerythronolide beta hydroxylase (404 aa), FASTA scores: opt: 775, E(): 0, (44.2% identity in 319 aa overlap); and other members of the cytochrome P450 family. Related to Mycobacterium tuberculosis proteins include: Rv0766c, Rv2266, Rv0778, etc. Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). BELONGS TO THE CYTOCHROME P450 FAMILY. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="cytochrome p450 140 CYP140" /protein_id="NP_216396.1" /db_xref="GI:15609017" /db_xref="GeneID:885758" /translation="MKDKLHWLAMHGVIRGIAAIGIRRGDLQARLIADPAVATDPVPF YDEVRSHGALVRNRANYLTVDHRLAHDLLRSDDFRVVSFGENLPPPLRWLERRTRGDQ LHPLREPSLLAVEPPDHTRYRKTVSAVFTSRAVSALRDLVEQTAINLLDRFAEQPGIV DVVGRYCSQLPIVVISEILGVPEHDRPRVLEFGELAAPSLDIGIPWRQYLRVQQGIRG FDCWLEGHLQQLRHAPGDDLMSQLIQIAESGDNETQLDETELRAIAGLVLVAGFETTV NLLGNGIRMLLDTPEHLATLRQHPELWPNTVEEILRLDSPVQLTARVACRDVEVAGVR IKRGEVVVIYLAAANRDPAVFPDPHRFDIERPNAGRHLAFSTGRHFCLGAALARAEGE VGLRTFFDRFPDVRAAGAGSRRDTRVLRGWSTLPVTLGPARSMVSP" misc_feature complement(2130709..2130738) /gene="cyp140" /locus_tag="Rv1880c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(2131907..2132329) /gene="lppE" /locus_tag="Rv1881c" /db_xref="GeneID:885762" CDS complement(2131907..2132329) /gene="lppE" /locus_tag="Rv1881c" /function="UNKNOWN" /note="Rv1881c, (MTCY180.37), len: 140 aa. Possible lppE, lipoprotein, showing some similarity to L12238|MSG18S19K_1 19K antigen from Mycobacterium intracellulare (162 aa), FASTA scores: opt: 137, E(): 0.0069, (27.6% identity in 156 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.941." /codon_start=1 /transl_table=11 /product="lipoprotein LppE" /protein_id="NP_216397.1" /db_xref="GI:15609018" /db_xref="GeneID:885762" /translation="MCNRLVTVTGVAMVVAAGLSACGQAQTVPRKAARLTIDGVTHTT RPATCSQEHSYRTIDIRNHDSTVQAVVLLSGDRVIPQWVKIRNVDGFNGSFWHGGVGN ARADRARNTYTVAGSAYGISSKKPNTVVSTDFNILAEC" gene complement(2132370..2133203) /locus_tag="Rv1882c" /db_xref="GeneID:885782" CDS complement(2132370..2133203) /locus_tag="Rv1882c" /EC_number="1.1.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1882c, (MTCY180.36), len: 277 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases, generally belonging to SDR family, e.g. NP_250789.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (251 aa); NP_421760.1|NC_002696 short chain dehydrogenase family protein from Caulobacter crescentus (270 aa); NP_107167.1|NC_002678 oxidoreductase (short chain dehydrogenase/reductase family) from Mesorhizobium loti (253 aa); P50197|LINC_PSEPA 2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (250 aa), FASTA scores: opt: 301, E(): 2.3e-12, (30.0% identity in 223 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. Rv3057c, Rv1245, etc. Contains possible helix-turn-helix motif at aa 246-267 (+4.32 SD). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_216398.1" /db_xref="GI:15609019" /db_xref="GeneID:885782" /translation="MKAIFITGAGSGMGREGATLFHANGWRVGAIDRNEDGLAALRVQ LGAERLWARAVDVTDKAALEGALADFCAGNVGGGLDMMWNNAGIGEGGWFEDVPYEAA VRVVDVNFKAVLTGAYAALPYLKKAPGSLMFSTSSSSGTYGMPRIAVYSATKHAVKGL TEALSVEWQRHGVRVADVLPGLIDTAILTSTRQHSDEGPYTISAEQIRAAAPKKGMFR LMPSSSVAEAAWRAYQHPTRLHWYVPRSIRWIDRLKGVSPEFVRRHIAKSLATLEPKR K" misc_feature complement(2132709..2132795) /locus_tag="Rv1882c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(2133231..2133692) /locus_tag="Rv1883c" /db_xref="GeneID:885484" CDS complement(2133231..2133692) /locus_tag="Rv1883c" /function="UNKNOWN" /note="Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08, (34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB from Streptomyces actuosus (173 aa), FASTA score: opt: 207, E(): 1.8e-07, (40.2% identity in 102 aa overlap). TBparse score is 0.923" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216399.1" /db_xref="GI:15609020" /db_xref="GeneID:885484" /translation="MCLDQVMEGSATVHMAAPPDKIWTLIADVRNTGRFSPETFEAEW LDGATGPALGARFRGHVRRNGIGPVYWTVCEPGREFGFAVLLGDRPVNNWHYRLTPTA DGTEVTESFRLPPSVLTTVYYRVFGGWLRQRRNIRDMTKTLQRIKDLVEAG" gene complement(2133731..2134261) /gene="rpfC" /locus_tag="Rv1884c" /db_xref="GeneID:885759" CDS complement(2133731..2134261) /gene="rpfC" /locus_tag="Rv1884c" /function="THOUGHT TO PROMOTE THE RESUSCITATION AND GROWTH OF DORMANT, NONGROWING CELL. COULD ALSO STIMULATES THE GROWTH OF SEVERAL OTHER HIGH G+C GRAM+ ORGANISMS, e.g. Mycobacterium avium, Mycobacterium bovis (BCG), Mycobacterium kansasii, Mycobacterium smegmatis." /note="Rv1884c, (MTCY180.34), len: 176 aa. Probable rpfC, resuscitation promoting factor (see citation below), similar to Z96935|MLRPF_1 resusicitation-promoting factor from Micrococcus luteus (220 aa), FASTA score: opt: 287, E() : 3.3e-11, (40.0% identity in 120 aa overlap). Also similar to others from Mycobacterium tuberculosis: Rv2389c|MTCY253.32|RPFD PROBABLE RESUSCITATION-PROMOTING FACTOR (154 aa), FASTA score: opt: 382, E(): 7.1e-17, (55.4% identity in 101 aa overlap); Rv0867c|RPFA (N-terminal part), Rv2450c|RPFE, and Rv1009|RPFB (C-terminal part). TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="resuscitation-promoting factor RpfC" /protein_id="NP_216400.1" /db_xref="GI:15609021" /db_xref="GeneID:885759" /translation="MHPLPADHGRSRCNRHPISPLSLIGNASATSGDMSSMTRIAKPL IKSAMAAGLVTASMSLSTAVAHAGPSPNWDAVAQCESGGNWAANTGNGKYGGLQFKPA TWAAFGGVGNPAAASREQQIAVANRVLAEQGLDAWPTCGAASGLPIALWSKPAQGIKQ IINEIIWAGIQASIPR" gene complement(2134273..2134872) /locus_tag="Rv1885c" /db_xref="GeneID:885772" CDS complement(2134273..2134872) /locus_tag="Rv1885c" /EC_number="5.4.99.5" /function="UNKNOWN" /note="catalyzes the interconversion of chorismate to prephenate" /codon_start=1 /transl_table=11 /product="chorismate mutase" /protein_id="NP_216401.1" /db_xref="GI:15609022" /db_xref="GeneID:885772" /translation="MLTRPREIYLATAVSIGILLSLIAPLGPPLARADGTSQLAELVD AAAERLEVADPVAAFKWRAQLPIEDSGRVEQQLAKLGEDARSQHIDPDYVTRVFDDQI RATEAIEYSRFSDWKLNPASAPPEPPDLSASRSAIDSLNNRMLSQIWSHWSLLSAPSC AAQLDRAKRDIVRSRHLDSLYQRALTTATQSYCQALPPA" gene complement(2134890..2135867) /gene="fbpB" /locus_tag="Rv1886c" /db_xref="GeneID:885785" CDS complement(2134890..2135867) /gene="fbpB" /locus_tag="Rv1886c" /EC_number="2.3.1.-" /function="INVOLVED IN CELL WALL MYCOLOYLATION. PROTEINS OF THE ANTIGEN 85 COMPLEX ARE RESPONSIBLE FOR THE HIGH AFFINITY OF MYCOBACTERIA TO FIBRONECTIN. POSSESSES A MYCOLYLTRANSFERASE ACTIVITY REQUIRED FOR THE BIOGENESIS OF TREHALOSE DIMYCOLATE (CORD FACTOR), A DOMINANT STRUCTURE NECESSARY FOR MAINTAINING CELL WALL INTEGRITY." /experiment="experimental evidence, no additional details recorded" /note="Rv1886c, (MT1934, MTCY180.32), len: 325 aa. fbpB (alternate gene names: mpt59, 85B), precursor of the 85-B antigen (fibronectin-binding protein B) (mycolyl transferase 85B) (EC 2.3.1.-) (see citations below), highly similar to other Mycobacterial antigen precursors e.g. P12942|A85B_MYCBO ANTIGEN 85-B PRECURSOR from Mycobacterium bovis (323 aa); P21160|A85B_MYCKA ANTIGEN 85-B PRECURSOR from Mycobacterium kansasii (325 aa); etc. Also highly similar to Mycobacterium tuberculosis antigen precursors: Rv3804c|fbpA (338 aa), Rv0129c|fbpC2 (340 aa), and Rv3803c|fbpC1 (299 aa). TBparse score is 0.912.; mpt59; 85B" /codon_start=1 /transl_table=11 /product="secreted antigen 85-B fbpB (85B) (antigen 85 complex B) (Mycolyl transferase 85B) (fibronectin-binding protein B) (extracellular alpha-antigen)" /protein_id="NP_216402.1" /db_xref="GI:15609023" /db_xref="GeneID:885785" /translation="MTDVSRKIRAWGRRLMIGTAAAVVLPGLVGLAGGAATAGAFSRP GLPVEYLQVPSPSMGRDIKVQFQSGGNNSPAVYLLDGLRAQDDYNGWDINTPAFEWYY QSGLSIVMPVGGQSSFYSDWYSPACGKAGCQTYKWETFLTSELPQWLSANRAVKPTGS AAIGLSMAGSSAMILAAYHPQQFIYAGSLSALLDPSQGMGPSLIGLAMGDAGGYKAAD MWGPSSDPAWERNDPTQQIPKLVANNTRLWVYCGNGTPNELGGANIPAEFLENFVRSS NLKFQDAYNAAGGHNAVFNFPPNGTHSWEYWGAQLNAMKGDLQSSLGAG" gene 2136258..2137400 /locus_tag="Rv1887" /db_xref="GeneID:885247" CDS 2136258..2137400 /locus_tag="Rv1887" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1887, (MTCY180.31), len: 380 aa. Hypothetical unknown protein; contains eukaryotic thiol (cysteine) proteases histidine active site at N-terminus (PS00639) and Pro-rich region near C-terminus. TBparse score is 0.935." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216403.1" /db_xref="GI:15609024" /db_xref="GeneID:885247" /translation="MDTVLGLSITPTTLGWVLAEGHGADGAILDRNELELHSGRNAQA IHTAEQLAAEVLLAHEVAAAGDHRLRVIGVTWNAEASAQAALLVESLTGAGFDNVVPV RRLRAIETLAQAIAPVIGYEQIAVCVLEHESATVVMVDTHDGKTQIAVKHVCRGLSGL TSWLTGMFGRDAWRPAGVVVVGSDSEVSEFSWQLERVLPVPVFAQTMAQVTVARGAAL AAAQSTEFTDAQLVADSVSQPTVAPRRSRHYAGAAAALAAAAVTFVASLSLAVGIQLA PHNDTGTAKHGAHKPTPRIAKAVAPAVPPPPTVTPPVPARAPRPAAQHEPPARVTSGE ALTEPNPPEEQPNASAPQQDRNDSQPITRVLEHIPGAYGDSAPPAE" misc_feature 2136426..2136458 /locus_tag="Rv1887" /note="PS00639 Eukaryotic thiol (cysteine) proteases histidine active site" gene complement(2137519..2138079) /locus_tag="Rv1888c" /db_xref="GeneID:885267" CDS complement(2137519..2138079) /locus_tag="Rv1888c" /function="UNKNOWN" /note="Rv1888c, (MTCY180.30), len: 186 aa. Possible transmembrane protein. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216404.1" /db_xref="GI:15609025" /db_xref="GeneID:885267" /translation="MQPDAYPVRVRGDLDPALSRWQWLVKWFLAIPHYIVLFFLHVAA VVVTVIAFFAILFTGRYPRTLFDFNVGVMRWRWRVAFYALSALGTDRYPPFSLQTKAE YPADLEVDYPERLSRGLVLIKWWLLAIPHYLILAVFLSSGWRVFLIDPHDRVGIMWPS LLVILLLVAVVALLFTGRYPIGLYNL" gene complement(2138444..2138617) /locus_tag="Rv1888A" /db_xref="GeneID:3205115" CDS complement(2138444..2138617) /locus_tag="Rv1888A" /function="UNKNOWN" /note="Rv1888A, len: 57 aa. Conserved hypothetical protein. Possibly continuation of Rv1889c, part of large family of Mycobacterium tuberculosis proteins with conserved N-terminal domain of 120 aa. Includes: C-terminus of Rv0726c|P95074 CONSERVED HYPOTHETICAL PROTEIN (367 aa), FASTA scores: opt: 295, E(): 3.1e-15, (73.684% identity in 57 aa overlap); C-terminus of Rv3399|Q50726|MTCY78.29c CONSERVED HYPOTHETICAL PROTEIN (348 aa), FASTA scores: opt: 504, E(): 7.3e-29, (64.2% identity in 120 aa overlap); C-terminus of Rv0731c; etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177654.1" /db_xref="GI:57116927" /db_xref="GeneID:3205115" /translation="MVPVDLRRDWPTPLRQAGFDPNQPSAWLAEGLLAFLPPDAQDRL LDNITALSAPGSR" gene complement(2138661..2139017) /locus_tag="Rv1889c" /db_xref="GeneID:885788" CDS complement(2138661..2139017) /locus_tag="Rv1889c" /function="UNKNOWN" /note="Rv1889c, (MTCY180.29), len: 118 aa. Conserved hypothetical protein. Part of large family of Mycobacterium tuberculosis proteins with conserved N-terminal domain of 120 aa. Includes: Rv3399|Q50726|MTCY78.29C CONSERVED HYPOTHETICAL PROTEIN (348 aa), FASTA results: opt: 504, E(): 7.3e-29, (64.2% identity in 120 aa overlap); Rv0726c|P95074; Rv0731c; etc. Rv1888A possibly continuation of this CDS. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216405.1" /db_xref="GI:15609026" /db_xref="GeneID:885788" /translation="MPRTNNDAWDLATSVGATATMVAAARAVATRADNPLIDDPFAEP LVRAVGIDFFTRWAAGNIKATDVDDPDGTWGLQRLADLLAARTRYFDAFFRDATSAGI RQAVILASGLDARAYR" gene complement(2139076..2139687) /locus_tag="Rv1890c" /db_xref="GeneID:885786" CDS complement(2139076..2139687) /locus_tag="Rv1890c" /function="UNKNOWN" /note="Rv1890c, (MTCY180.28), len: 203 aa. Hypothetical unknown protein. TBparse score is 0.933" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216406.1" /db_xref="GI:15609027" /db_xref="GeneID:885786" /translation="MAHKTRREGRAGRSSEYSRGVSDAVWTLDASDGELVLRTGVVGR AARLGHRLTIAMTRWQALVNWSGTDPVAGELVAEVDSFEVMRGEGGVKGLSEPEKALV RANALKTLNASRFPHIRFTTEAIAQTGNGYRLTGKLHIRGKSREHVIDLHTEDLGAAW RISADTTVRQSNYGVKPYSLLMGSIRVADEVSVAFTAVRAKDD" gene 2139741..2140148 /locus_tag="Rv1891" /db_xref="GeneID:885094" CDS 2139741..2140148 /locus_tag="Rv1891" /function="UNKNOWN" /note="Rv1891, (MTCY180.27c), len: 135 aa. Conserved hypothetical protein. Equivalent to MLCB561.09|AL049571 hypothetical protein from Mycobacterium leprae (134 aa), FASTA scores: opt: 800, E(): 0, (79.7% identity in 133 aa overlap). TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216407.1" /db_xref="GI:15609028" /db_xref="GeneID:885094" /translation="MIRELVTTAAITGAAIGGAPVAGADPQRYDGDVPGMNYDASLGA PCSSWERFIFGRGPSGQAEACHFPPPNQFPPAETGYWVISYPLYGVQQVGAPCPKPQA AAQSPDGLPMLCLGARGWQPGWFTGAGFFPPEP" gene 2140165..2140476 /locus_tag="Rv1892" /db_xref="GeneID:885090" CDS 2140165..2140476 /locus_tag="Rv1892" /function="UNKNOWN" /note="Rv1892, (MTCY180.26c), len: 103 aa. Probable membrane protein. TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216408.1" /db_xref="GI:15609029" /db_xref="GeneID:885090" /translation="MIMCEGRPTESPIPRWLRFVLTSDRAGSAWYIGAGFFFAPVLAV LSPWPTITAVLWWIIGLAGLWLGLLGIAMAVGLARVLRSGAEIPEAYWRTLVDYRSAN E" gene 2140486..2140704 /locus_tag="Rv1893" /db_xref="GeneID:885269" CDS 2140486..2140704 /locus_tag="Rv1893" /function="UNKNOWN" /note="Rv1893, (MTCY180.25c), len: 72 aa. Conserved hypothetical protein. Equivalent to MLCB561.11|AL049571 hypothetical protein from Mycobacterium leprae (74 aa), FASTA scores: opt: 317, E(): 4.6e-15, (69.4% identity in 72 aa overlap). TBparse score is 0.857." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216409.1" /db_xref="GI:15609030" /db_xref="GeneID:885269" /translation="MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASG IVKDSIDIATHAVDRTKEVFTGKTDDEG" gene complement(2140739..2141869) /locus_tag="Rv1894c" /db_xref="GeneID:885081" CDS complement(2140739..2141869) /locus_tag="Rv1894c" /function="UNKNOWN" /note="Rv1894c, (MTCY180.24), len: 376 aa. Conserved hypothetical protein, weak similarity to some oxidoreductases e.g. Q01284 2-NITROPROPANE DIOXYGENASE PRECURSOR (378 aa), FASTA results: opt: 204, E(): 5.8e-06, (34.3% identity in 140 aa overlap). Similar to hypothetical Mycobacterium tuberculosis proteins e.g. Rv3553|MTCY03C7.02c (355 aa), FASTA results: opt: 296, E(): 1.6e-10, (32.9% identity in 167 aa overlap); Rv1533 (375 aa) (48.1% identity in 376 aa overlap); Rv0021c, Rv2781c. TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216410.1" /db_xref="GI:15609031" /db_xref="GeneID:885081" /translation="MHTAICDELGIEFPIFAFTHCRDVVVAVSKAGGFGVLGAVGFTP EQLEIELNWIDEHIGDHPYGVDIVIPNKYEGMDSQLSADELAKTLRSMVPQEHLDFAR KILADHGVPVEDADEDSLQLLGWTEATATPQVDAALKHPKMTMVANALGTPPADMIKH IHDSGRKVAALCGSPSQARKHADAGVDIIIAQGGEAGGHCGEVGSIVLWPQVVKEVAP VPVLAAGGIGSGQQIAAALALGTQGAWTGSQWLMVEEAANTAVQQAAYVKATSRDTVR SRSFTGKPARMLRNDWTEAWEQPESPKPLGMPLQYMVSGMAVKATHKYPNETVDVAFN PVGQVVGQFTKVEKTATVIERWVQEYLEATARLDALNAAASV" gene 2142521..2143675 /locus_tag="Rv1895" /db_xref="GeneID:885148" CDS 2142521..2143675 /locus_tag="Rv1895" /EC_number="1.1.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1895, (MTCY180.23c), len: 384 aa. Possible dehydrogenase (EC 1.1.-.-), similar to various sorbitol and alcohol dehydrogenases, and to putative glutathione-dependent aldehyde dehydrogenase e.g DHSO_BACSU|Q06004 Sorbitol dehydrogenase (EC 1.1.1.14) from Streptomyces coelicolor (352 aa), FASTA results: opt: 506, E(): 7.2e-24, (30.6% identity in 350 aa overlap); and AL109962|SCJ1.28 PUTATIVE ZINC-CONTAINING DEHYDROGENASE from Streptomyces coelicolor (356 aa), FASTA results: opt: 634, E(): 2.9e-30, (34.7% identity in 357 aa overlap). Also similar to other Mycobacterium tuberculosis dehydrogenases. Note that there is a substantial (134 bp) overlap at the C-terminus with the C-terminus of the downstream ORF, although both appear to be true coding regions. TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_216411.1" /db_xref="GI:15609032" /db_xref="GeneID:885148" /translation="MRAVVIDGAGSVRVNTQPDPALPGPDGVVVAVTAAGICGSDLHF YEGEYPFTEPVALGHEAVGTIVEAGPQVRTVGVGDLVMVSSVAGCGVCPGCETHDPVM CFSGPMIFGAGVLGGAQADLLAVPAADFQVLKIPEGITTEQALLLTDNLATGWAAAQR ADISFGSAVAVIGLGAVGLCALRSAFIHGAATVFAVDRVKGRLQRAATWGATPIPSPA AETILAATRGRGADSVIDAVGTDASMSDALNAVRPGGTVSVVGVHDLQPFPVPALTCL LRSITLRMTMAPVQRTWPELIPLLQSGRLDVDGIFTTTLPLDEAAKGYATARARSGEE LRFCLRPDSRDVLGAHETVDLYVHVRRCQSVADLQLEGAADGVDGPSMLN" gene complement(2143535..2144446) /locus_tag="Rv1896c" /db_xref="GeneID:885915" CDS complement(2143535..2144446) /locus_tag="Rv1896c" /function="UNKNOWN" /note="Rv1896c, (MTCY180.22), len: 303 aa. Conserved hypothetical protein. Similar to several (14) hypothetical Mycobacterium tuberculosis proteins e.g. Rv0145|MTCI5.19 (317 aa), FASTA results: opt: 720, E(): 0, (41.6% identity in 308 aa overlap); Q10552|YZ21_MYCTU (325 aa), opt: 689, E(): 0, (40.5% identity in 304 aa overlap); Rv0726c, Rv0731c, Rv3399, etc. and to related proteins in other actinomycetes. Note that there is a substantial (134 bp) overlap at the C-terminus with the C-terminus of the downstream ORF, although both appear to be true coding regions. TBparse score is 0.946" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216412.1" /db_xref="GI:15609033" /db_xref="GeneID:885915" /translation="MTTPEYGSLRSDDDHWDIVSNVGYTALLVAGWRALHTTGPKPLV QDEYAKHFITASADPYLEGLLANPRTSEDGTAFPRLYGVQTRFFDDFFNCADEAGIRQ AVIVAAGLDCRAYRLDWQPGTTVFEIDVPKVLEFKARVLSERGAVPKAHRVAVPADLR TDWPTPLTAAGFDPQRPSAWSVEGLLPYLTGDAQYALFARIDELCAPGSRVALGALGS RLDHEQLAALETAHPGVNMSGDVNFSALTYDDKTDPVEWLVEHGWAVDPVRSTLELQV GYGLTPPDVDVKIDSFMRSQYITAVRA" gene complement(2144451..2144882) /locus_tag="Rv1897c" /db_xref="GeneID:885893" CDS complement(2144451..2144882) /locus_tag="Rv1897c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="hydrolyzes D-tyrosyl-tRNA(Tyr) into D-tyrosine and free tRNA(Tyr); possible defense mechanism against a harmful effect of D-tyrosine" /codon_start=1 /transl_table=11 /product="D-tyrosyl-tRNA(Tyr) deacylase" /protein_id="NP_216413.1" /db_xref="GI:15609034" /db_xref="GeneID:885893" /translation="MRVLVQRVSSAAVRVDGRVVGAIRPDGQGLVAFVGVTHGDDLDK ARRLAEKLWNLRVLADEKSASDMHAPILVISQFTLYADTAKGRRPSWNAAAPGAVAQP LIAAFAAALRQLGAHVEAGVFGAHMQVELVNDGPVTVMLEG" gene 2144940..2145248 /locus_tag="Rv1898" /db_xref="GeneID:885089" CDS 2144940..2145248 /locus_tag="Rv1898" /function="UNKNOWN" /note="Rv1898, (MTCY180.20c), len: 102 aa. Conserved hypothetical protein, some similarity to other hypothetical proteins e.g. Q58452 from METHANOCOCCUS JANNASCH II (100 aa), FASTA results: opt: 152, E(): 9.1e-05, (31.5% identity in 92 aa overlap); and AE000771|AE000771_2 from Aquifex aeolicus (157 aa), FASTA results: opt: 246, E(): 3.2e-11, (39.0% identity in 100 aa overlap). TBparse score is 0.874." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216414.1" /db_xref="GI:15609035" /db_xref="GeneID:885089" /translation="MSVLVAFSVTPLGVGEGVGEIVTEAIRVVRDSGLPNQTDAMFTV IEGDTWAEVMAVVQRAVEAVAARAPRVSAVIKVDWRPGVTDAMTQKVATVERYLLRPE" gene complement(2145214..2146245) /gene="lppD" /locus_tag="Rv1899c" /db_xref="GeneID:885138" CDS complement(2145214..2146245) /gene="lppD" /locus_tag="Rv1899c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1899c, (MTCY180.19), len: 343 aa. Possible lipoprotein; contains appropriately localized lipoprotein lipid attachment site (PS00013). Some similarity to C-terminal part of AE000717|AE000717_4 hypothetical protein from Aquifex aeolicus section 49 (165 aa), FASTA results: opt: 372, E(): 2.3e-14, (43.5% identity in 147 aa overlap); and Q44020 4-hydroxybutyrate dehydrogenase (173 aa), FASTA results: opt: 272, E(): 4.7e-09, (35.8% identity in 165 aa overlap)." /codon_start=1 /transl_table=11 /product="lipoprotein LppD" /protein_id="NP_216415.1" /db_xref="GI:15609036" /db_xref="GeneID:885138" /translation="MSRAAGLPRLSWFAGLTWFAGGSTGAGCAAHPALAGLTAGARCP AYAAISASTARPAATAGTTPATGASGSARPTDAAGMADLARPGVVATHAVRTLGTTGS RAIGLCPCQPLDCPRSPQATLNLGSMGRSLDGPQWRRARVRLCGRWWRRSNTTRGASP RPPSTCRGDNVSMIELEVHQADVTKLELDAITNAANTRLRHAGGVAAAIARAGGPELQ RESTEKAPIGLGEAVETTAGDMPARYVIHAATMELGGPTSGEIITAATAATLRKADEL GCRSLALVAFGTGVGGFPLDDAARLMVGAVRRHRPGSLQRVVFAVHGDAAERAFSAAI QAGEDTARR" misc_feature complement(2146162..2146194) /gene="lppD" /locus_tag="Rv1899c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(2146245..2147633) /gene="lipJ" /locus_tag="Rv1900c" /db_xref="GeneID:885151" CDS complement(2146245..2147633) /gene="lipJ" /locus_tag="Rv1900c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1900c, (MTCY180.18), len: 462 aa. Probable lipJ, lignin peroxidase, with some similarity to esterases, hydrolases and hypothetical Mycobacterium tuberculosis proteins e.g. Q43936 BETA-KETOADIPATE ENOL-LACTONE HYDROLASE from Acinetobacter calcoaceticus (267 aa), FASTA results: opt: 217, E(): 1.7e-07, (29.2% identity in 260 aa overlap). Also similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Rv2212|Q10400|YM12_MYCTU (378 aa), FASTA results: opt: 216, E(): 6.7e-07, (27.7% identity in 285 aa overlap)." /codon_start=1 /transl_table=11 /product="lignin peroxidase LIPJ" /protein_id="NP_216416.1" /db_xref="GI:15609037" /db_xref="GeneID:885151" /translation="MAQAPHIHRTRYAKCGDMDIAYQVLGDGPTDLLVLPGPFVPIDS IDDEPSLYRFHRRLASFSRVIRLDHRGVGLSSRLAAITTLGPKFWAQDAIAVMDAVGC EQATIFAPSFHAMNGLVLAADYPERVRSLIVVNGSARPLWAPDYPVGAQVRRADPFLT VALEPDAVERGFDVLSIVAPTVAGDDVFRAWWDLAGNRAGPPSIARAVSKVIAEADVR DVLGHIEAPTLILHRVGSTYIPVGHGRYLAEHIAGSRLVELPGTDTLYWVGDTGPMLD EIEEFITGVRGGADAERMLATIMFTDIVGSTQHAAALGDDRWRDLLDNHDTIVCHEIQ RFGGREVNTAGDGFVATFTSPSAAIACADDIVDAVAALGIEVRIGIHAGEVEVRDASH GTDVAGVAVHIGARVCALAGPSEVLVSSTVRDIVAGSRHRFAERGEQELKGVPGRWRL CVLMRDDATRTR" gene 2147662..2148954 /gene="cinA" /locus_tag="Rv1901" /db_xref="GeneID:885149" CDS 2147662..2148954 /gene="cinA" /locus_tag="Rv1901" /function="UNKNOWN" /note="Rv1901, (MTCY180.17c), len: 430 aa. Probable cinA-like protein, strong similarity to competence damage proteins CinA of Bacillus subtilis and S. pneumoniae. FASTA results: Q55760 HYPOTHETICAL 44.7 kDa PROTEIN (416 aa) opt: 755, E(): 0, (36.0% identity in 433 aa overlap). TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="competence damage-inducible protein A" /protein_id="NP_216417.1" /db_xref="GI:15609038" /db_xref="GeneID:885149" /translation="MAVSARAGIVITGTEVLTGRVQDRNGPWIADRLLELGVELAHIT ICGDRPADIEAQLRFMAEQGVDLIVTSGGLGPTADDMTVEVVARYCGRELVLDDELEN RIANILKKLMGRNPAIEPANFDSIRAANRKQAMIPAGSQVIDPVGTAPGLVVPGRPAV MVLPGPPRELQPIWSKAIQTAPVQDAIAGRTTYRQETIRIFGLPESSLADTLRDAEAA IPGFDLVEITTCLRRGEIEMVTRFEPNAAQVYTQLARLLRDRHGHQVYSEDGASVDEL VAKLLTGRRIATAESCTAGLLAARLTDRPGSSKYVAGAVVAYSNEAKAQLLGVDPALI EAHGAVSEPVAQAMAAGALQGFGADTATAITGIAGPSGGTPEKPVGTVCFTVLLDDGR TTTRTVRLPGNRSDIRERSTTVAMHLLRRTLSGIPGSP" gene complement(2149006..2150274) /gene="nanT" /locus_tag="Rv1902c" /db_xref="GeneID:885057" CDS complement(2149006..2150274) /gene="nanT" /locus_tag="Rv1902c" /function="INVOLVED IN TRANSPORT OF SIALIC ACID ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1902c, (MTCY180.16), len: 422 aa. Probable nanT, sialic acid-transport integral membrane protein, possibly member of major facilitator superfamily (MFS), similar to others e.g. Q48076 SIALIC ACID TRANSPORTER (407 aa), FASTA results: opt: 443, E(): 5.4e-22, (26.7% identity in 389 aa overlap); etc. Some similarity to MTCI364.12|O05301 conserved hypothetical protein from Mycobacterium tuberculosis (425 aa), FASTA results: opt: 251, E(): 1.1e-09, (23.5% identity in 417 aa overlap). Contains sugar transport proteins signature 2 (PS00217). TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="sialic acid-transport integral membrane protein NanT" /protein_id="NP_216418.1" /db_xref="GI:15609039" /db_xref="GeneID:885057" /translation="MAAPRLTGDQRNAFMASFLGWTMDAFDYFLVVLVYADIATTFHH TKTDVAFLTTATLAMRPVGALLFGLWADRVGRRVPLMVDVSFYSVIGFLCAFAPNFTV LVILRLLYGIGMGGEWGLGAALSMEKVPAERRGVFSGLLQEGYAFGYLLASVAALVVM NWLGLSWRWLFGLSIIPALISLIIRYRVKESEVWEAAQDRMRLTKTRIRDVLGNPAIV RRFVYLVLLMTAFNWMSHGTQDVYPTFLTATTDHGAGLSSLTARWIVVIYNIGAIIGG LAFGTLSQRFSRRYTIVFCAALGLPIVPLFAYSRTAAMLCLGSFLMQVFVQGAWGVIP AHLTEMSPDAIRGVYPGVTYQLGNLLAAFNLPIQERLAESHGYPFALAATIVPVLLVV AVLTAIGKDATGIRFGTTETAFLVRHRNRH" misc_feature complement(2149873..2149950) /gene="nanT" /locus_tag="Rv1902c" /note="PS00217 Sugar transport proteins signature 2" gene 2150364..2150768 /locus_tag="Rv1903" /db_xref="GeneID:885643" CDS 2150364..2150768 /locus_tag="Rv1903" /function="UNKNOWN" /note="Rv1903, (MTCY180.15c), len: 134 aa. Probable conserved membrane protein, similar to Q53868|YPT3_STRCO hypothetical 15.9 kDa protein from Streptomyces coelicolor (148 aa) opt: 323, E(): 1.3e-16, (42.9% identity in 126 aa overlap); and equivalent to AJ000521|MLCOSL672_3 from Mycobacterium leprae (139 aa), FASTA results: opt: 680, E(): 0, (80.6% identity in 129 aa overlap). TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216419.1" /db_xref="GI:15609040" /db_xref="GeneID:885643" /translation="MVPFLMRAAVTGFALWVVTLFVPGMRFAGGDTTLQRVAIIFVVA VIFGLVNAFIKPIVQILSIPLYILTLGLFHVVVNASMLWLTAWITEHTTHWGLQIDHF WWTAIWAAILLSIVSWILSLLARDFRRVTRAH" gene 2150954..2151385 /locus_tag="Rv1904" /db_xref="GeneID:885281" CDS 2150954..2151385 /locus_tag="Rv1904" /function="UNKNOWN" /note="Rv1904, (MTCY180.14c), len: 143 aa. Conserved hypothetical protein, some similarity to other hypothetical Mycobacterium tuberculosis proteins e.g. Rv2638|MTCY441.08|P71937 (148 aa), FASTA results: opt: 456, E( ): 2.7e-23, (52.8% identity in 125 aa overlap); Rv1365|Q11035 (128 aa), FASTA results: opt: 393, E(): 1.4e-19, (48.8% identity in 123 aa overlap); and Rv3687c. Also weak similarity to Q9WVX8|RSBV_STRCO ANTI-SIGMA B FACTOR ANTAGONIST from Streptomyces coelicolor (113 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216420.1" /db_xref="GI:15609041" /db_xref="GeneID:885281" /translation="MRTVAIGPGAGPSSTRPSSQPSDLHSGLRAVTECTGSAVVVHVG GDIDASNEVAWQRLVSKSAAIAIAPGPFVIDIRDLDFMGSCAYAVLAQESVRCRRRGV NMRLVSNQPIVARTIAACGLRRLIPLYATVETALAPPPSAH" gene complement(2151433..2152395) /gene="aao" /locus_tag="Rv1905c" /db_xref="GeneID:885504" CDS complement(2151433..2152395) /gene="aao" /locus_tag="Rv1905c" /EC_number="1.4.3.3" /function="Wide specificity for D-amino acids. Also acts on glycine [CATALYTIC ACTIVITY: A D-AMINO ACID + H2O + O2 = A 2-OXO ACID + NH3 + H2O2]" /note="Rv1905c, (MTCY180.13), len: 320 aa. Probable aao, D-amino acid oxidase (EC 1.4.3.3), similar to many. Equivalent to AJ000521|MLCOSL672.02|O33145 Mycobacterium leprae (320 aa), FASTA results: opt: 1541, E(): 0, (71.7% identity in 315 aa overlap); also similar to OXDD_BOVIN|P31228 d-aspartate oxidase (EC 1.4.3.1) from bos taurus (338 aa), FASTA results: opt: 461, E(): 1.1e-21, (31.8% identity in 321 aa overlap). TBparse score is 0.932" /codon_start=1 /transl_table=11 /product="D-amino acid oxidase" /protein_id="NP_216421.1" /db_xref="GI:15609042" /db_xref="GeneID:885504" /translation="MAIGEQQVIVIGAGVSGLTSAICLAEAGWPVRVWAAALPQQTTS AVAGAVWGPRPKEPVAKVRGWIEQSLHVFRDLAKDPATGVRMTPALSVGDRIETGAMP PGLELIPDVRPADPADVPGGFRAGFHATLPMIDMPQYLDCLTQRLAATGCEIETRPLR SLAEAAEAAPIVINCAGLGARELAGDATVWPRFGQHVVLTNPGLEQLFIERTGGSEWI CYFAHPQRVVCGGISIPGRWDPTPEPEITERILQRCRRIQPRLAEAAVIETITGLRPD RPSVRVEAEPIGRALCIHNYGHGGDGVTLSWGCAREVVNLVGGG" gene complement(2152425..2152895) /locus_tag="Rv1906c" /db_xref="GeneID:885514" CDS complement(2152425..2152895) /locus_tag="Rv1906c" /function="UNKNOWN" /note="Rv1906c, (MTCY180.12), len: 156 aa. Conserved hypothetical protein, possibly exported protein, equivalent to Mycobacterium leprae AJ000521|MLCOSL672.01 (153 aa), FASTA scores: opt: 637, E(): 2.6e-28, (63.2% identity in 155 aa overlap). Also similar to M. tuberculosis hypothetical exported protein, Rv1352." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216422.1" /db_xref="GI:15609043" /db_xref="GeneID:885514" /translation="MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAI DSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTD KAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP" gene complement(2153235..2153882) /locus_tag="Rv1907c" /db_xref="GeneID:885402" CDS complement(2153235..2153882) /locus_tag="Rv1907c" /function="UNKNOWN" /note="Rv1907c, (MTCY180.11), len: 215 aa. Hypothetical unknown protein. Similar to Q50763 Ethyl methane sulphonate resistance protein from Mycobacterium tuberculosis (168 aa), FASTA scores: opt: 638, E(): 0, (69.7% identity in 152 aa overlap). Downstream of a cloned katG gene (EMBL:MTKATG). Differences are due to frameshift errors in the EMBL sequence and the use of an earlier start codon. TBparse score is 0.958." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216423.1" /db_xref="GI:15609044" /db_xref="GeneID:885402" /translation="MIGPARRSTTTRRSTPRADRLAGCWCLPGAICQTPRAWWSQARR DGDDETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTV GLTRRGLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTH PDAHLYCAIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA" gene complement(2153889..2156111) /gene="katG" /locus_tag="Rv1908c" /db_xref="GeneID:885638" CDS complement(2153889..2156111) /gene="katG" /locus_tag="Rv1908c" /EC_number="1.11.1.6" /function="MULTIFUNCTIONAL ENZYME, EXHIBITING BOTH A CATALASE, A BROAD-SPECTRUM PEROXIDASE, AND A PEROXYNITRITASE ACTIVITIES. MAY PLAY A ROLE IN THE INTRACELLULAR SURVIVAL OF MYCOBACTERIA WITHIN MACROPHAGES; PROTECTION AGAINST REACTIVE OXYGEN AND NITROGEN INTERMEDIATES PRODUCED BY PHAGOCYTIC CELLS. SEEMS REGULATED BY SIGB|Rv2710 [CATALYTIC ACTIVITY: 2 H(2)O(2) = O(2) + 2 H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Rv1908c, (MTCY180.10), len: 740 aa. katG, catalase-peroxidase-peroxynitritase T (EC 1.11.1.6) (see citations below), HPI. FASTA results: Q57215 CATALASE-PEROXIDASE from Mycobacterium tuberculosis (740 aa) opt: 5081, E(): 0, (100% identity in 740 aa overlap). Contains peroxidases active site signature (PS00436) and ATP/GTP-binding site motif A (P-loop; PS00017). Cosmid sequence was corrected to agree with a sequencing read from the H37Rv genome. DELETIONS OR DEFECTS IN KATG GENE CAUSE ISONIAZID (INH) RESISTANCE. BELONGS TO THE PEROXIDASE FAMILY. BACTERIAL PEROXIDASE/CATALASE SUBFAMILY. KATG TRANSCRIPTION SEEMS TO BE REGULATED BY FURA|Rv1909c PRODUCT. The catalase-peroxidase activity is associated with the amino-terminal domain but no definite function has been assigned to the carboxy-terminal domain. TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="catalase-peroxidase-peroxynitritase T KATG" /protein_id="NP_216424.1" /db_xref="GI:15609045" /db_xref="GeneID:885638" /translation="MPEQHPPITETTTGAASNGCPVVGHMKYPVEGGGNQDWWPNRLN LKVLHQNPAVADPMGAAFDYAAEVATIDVDALTRDIEEVMTTSQPWWPADYGHYGPLF IRMAWHAAGTYRIHDGRGGAGGGMQRFAPLNSWPDNASLDKARRLLWPVKKKYGKKLS WADLIVFAGNCALESMGFKTFGFGFGRVDQWEPDEVYWGKEATWLGDERYSGKRDLEN PLAAVQMGLIYVNPEGPNGNPDPMAAAVDIRETFRRMAMNDVETAALIVGGHTFGKTH GAGPADLVGPEPEAAPLEQMGLGWKSSYGTGTGKDAITSGIEVVWTNTPTKWDNSFLE ILYGYEWELTKSPAGAWQYTAKDGAGAGTIPDPFGGPGRSPTMLATDLSLRVDPIYER ITRRWLEHPEELADEFAKAWYKLIHRDMGPVARYLGPLVPKQTLLWQDPVPAVSHDLV GEAEIASLKSQIRASGLTVSQLVSTAWAAASSFRGSDKRGGANGGRIRLQPQVGWEVN DPDGDLRKVIRTLEEIQESFNSAAPGNIKVSFADLVVLGGCAAIEKAAKAAGHNITVP FTPGRTDASQEQTDVESFAVLEPKADGFRNYLGKGNPLPAEYMLLDKANLLTLSAPEM TVLVGGLRVLGANYKRLPLGVFTEASESLTNDFFVNLLDMGITWEPSPADDGTYQGKD GSGKVKWTGSRVDLVFGSNSELRALVEVYGADDAQPKFVQDFVAAWDKVMNLDRFDVR" misc_feature complement(2155287..2155310) /gene="katG" /locus_tag="Rv1908c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(2155782..2155817) /gene="katG" /locus_tag="Rv1908c" /note="PS00436 Peroxidases active site signature" gene complement(2156149..2156601) /gene="furA" /locus_tag="Rv1909c" /db_xref="GeneID:885400" CDS complement(2156149..2156601) /gene="furA" /locus_tag="Rv1909c" /function="ACTS AS A GLOBAL NEGATIVE CONTROLLING ELEMENT, EMPLOYING FE(2+) AS A COFACTOR TO BIND THE OPERATOR OF THE REPRESSED GENES. SEEMS TO REGULATE TRANSCRIPTION OF KATG|Rv1908c GENE." /experiment="experimental evidence, no additional details recorded" /note="Rv1909c, (MTCY180.09), len: 150 aa. furA, Ferric uptake regulation protein, similar to Q48835 LEGIONELLA PNEUMOPHILA 130B (WADSWORTH) FERRIC UPTAKE REGULATION (136 aa), FASTA results: opt: 230, E(): 2.5e-09, (32.3% identity in 133 aa overlap). Also similar to Mycobacterium tuberculosis furB ferric uptake regulatory protein, Rv2359. BELONGS TO THE FUR FAMILY." /codon_start=1 /transl_table=11 /product="ferric uptake regulation protein furA (fur)" /protein_id="NP_216425.1" /db_xref="GI:15609046" /db_xref="GeneID:885400" /translation="MSSVSSIPDYAEQLRTADLRVTRPRVAVLEAVNAHPHADTETIF GAVRFALPDVSRQAVYDVLHALTAAGLVRKIQPSGSVARYESRVGDNHHHIVCRSCGV IADVDCAVGEAPCLTASDHNGFLLDEAEVIYWGLCPDCSISDTSRSHP" gene complement(2156706..2157299) /locus_tag="Rv1910c" /db_xref="GeneID:885897" CDS complement(2156706..2157299) /locus_tag="Rv1910c" /function="UNKNOWN" /note="Rv1910c, (MTCY180.08), len: 197 aa. Possible exported protein, very similar to upstream ORF MTCY180.07 (201 aa), FASTA score: E(): 0, (64.0% identity in 200 aa overlap). Also similar to Q9Z729|Y877_CHLPN PROTEIN CPN0877 from Chlamydophila pneumoniae (150 aa). TBparse score is 0.940." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216426.1" /db_xref="GI:15609047" /db_xref="GeneID:885897" /translation="MAHAFHRFALAILGLALPVALVAYGGNGDSRKAAPLAPKAAALG RSMPETPTGDVLTISSPAFADGAPIPEQYTCKGANIAPPLTWSAPFGGALVVDDPDAP REPYVHWIVIGIAPGAGSTADGETPGGGISLPNSSGQPAYTGPCPPAGTGTHHYRFTL YHLPAVPPLAGLAGTQAARVIAQAATMQARLIGTYEG" gene complement(2157382..2157987) /gene="lppC" /locus_tag="Rv1911c" /db_xref="GeneID:885646" CDS complement(2157382..2157987) /gene="lppC" /locus_tag="Rv1911c" /function="UNKNOWN" /note="Rv1911c, (MTCY180.07), len: 201 aa. Probable lipoprotein lppC, contains appropriately positioned prokaryotic membrane lipoprotein lipid attachment site (PS00013). Very similar to downstream ORF MTCY180.08 (204 aa) (although this lacks lipoprotein motif), FASTA score: opt: 831, E(): 0, (64.0% identity in 200 aa overlap). Also similar to Q9Z729|Y877_CHLPN HYPOTHETICAL PROTEIN CPN0877 from Chlamydia pneumoniae (strain CWL029) (150 aa). TBparse score is 0.940." /codon_start=1 /transl_table=11 /product="lipoprotein LppC" /protein_id="NP_216427.1" /db_xref="GI:15609048" /db_xref="GeneID:885646" /translation="MTSTLHRTPLATAGLALVVALGGCGGGGGDSRETPPYVPKATTV DATTPAPAAEPLTIASPMFADGAPIPVQFSCKGANVAPPLTWSSPAGAAELALVVDDP DAVGGLYVHWIVTGIAPGSGSTADGQTPAGGHSVPNSGGRQGYFGPCPPAGTGTHHYR FTLYHLPVALQLPPGATGVQAAQAIAQAASGQARLVGTFEG" misc_feature complement(2157916..2157948) /gene="lppC" /locus_tag="Rv1911c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(2158087..2159091) /gene="fadB5" /locus_tag="Rv1912c" /db_xref="GeneID:885245" CDS complement(2158087..2159091) /gene="fadB5" /locus_tag="Rv1912c" /function="THOUGHT TO BE INVOLVED IN FATTY ACID DEGRADATION. FADB AND FADA ARE THE ALPHA AND BETA SUBUNITS OF THE MULTIFUNCTIONAL ENZYME COMPLEX OF THE FATTY ACID DEGRADATION CYCLE." /note="Rv1912c, (MTCY180.06), len: 334 aa. Possible fadB5, oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases: 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), quinone oxidoreductases (EC 1.6.5.5), and polyketide synthases, e.g. NP_104067.1|NC_002678 probable oxidoreductase from Mesorhizobium loti (308 aa); NP_464140.1|NC_003210 protein similar to oxidoreductase from Listeria monocytogenes (313 aa); NP_193889.1|NC_003075 putative NADPH quinone oxidoreductase from Arabidopsis thaliana (325 aa); NP_001880.2|NM_001889 crystallin, zeta; quinone oxidoreductase; NADPH:quinone reductase from Homo sapiens (329 aa); part 2983 to 3197 of T17410 polyketide synthase type I from Streptomyces venezuelae (3739 aa); Q53927|SCBAC20F6.16 HYDROXYACYL-CoA DEHYDROGENASE from Streptomyces coelicolor (329 aa), FASTA scores: opt: 621, E(): 2e-30, (39.5% identity in 349 aa overlap); etc. Also similar to many hypothetical Mycobacterium tuberculosis proteins including: MTCY24G1.09, MTCY13D12.11, MTCY19H9.01, MTCY24G1.03, MTCY03A2.17c, etc. Contains quinone oxidoreductase/zeta-crystallin signature (PS01162). TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="oxidoreductase FADB5" /protein_id="NP_216428.1" /db_xref="GI:15609049" /db_xref="GeneID:885245" /translation="MRAVVITKHGDPSVLQVRQRPDPPPPGPGQLRVAVRAAGVNFAD HLARVGLYPDAPKLPAVVGYEVAGTVEAVGDGVDPNRVGERVLAGTRFGGYCEIVNVA ATDSVVLPDALSFEQGAAVPVNYATAWAALHGYGSLRAGERVLIHAAAGGVGIAAVQF AKAAKAEVHGTASPQKHQKLAEFGVDRAIDYRRDGWWQGLGPYDVVLDALGGTSLRRS YTLLRPGGRLVGYGISNMQHGEKRSMRRVAPHALSMLRGFNLMKQLEESKTVIGLNML RLWDDRRTLEPWIAPLTKALNDGTILPIVHAIVPFAEAPEAHRILAARENVDKVVLVP" misc_feature complement(2158615..2158668) /gene="fadB5" /locus_tag="Rv1912c" /note="PS01162 Quinone oxidoreductase / zeta-crystallin signature" gene 2159191..2159943 /locus_tag="Rv1913" /db_xref="GeneID:885640" CDS 2159191..2159943 /locus_tag="Rv1913" /function="UNKNOWN" /note="Rv1913, (MTCY180.05c), len: 250 aa. Conserved hypothetical protein, slight similarity to dehydrase and beta-lactamase precursors e.g. Q02057 DEHYDRASE from Streptomyces coelicolor (297 aa), FASTA scores: opt: 184, E(): 4.3e-05, (31.6% identity in 215 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216429.1" /db_xref="GI:15609050" /db_xref="GeneID:885640" /translation="MHFDWERLTDSVHRCRLPFCDVTVGLVRGRTGILLVDTGTTLGE ATAIAADVKQIAGCQVTHVVLTHKHFDHVLGSSVFDQAEVFCAPEVVEYLRSATDRLR EDALSYGADTAEVDRAIAALKPPQHGIYDAAVDLGDRTVTITHPGSGHTTADLVVVAP ATGHADGPTVVFTGDLVEESADPDIDADSDLAAWPATLDRVLAIGGPDASYVPGHGKV VDAQFVRRQRAWLRTRASRQPRETPATLPCKR" gene complement(2159921..2160328) /locus_tag="Rv1914c" /db_xref="GeneID:885875" CDS complement(2159921..2160328) /locus_tag="Rv1914c" /function="UNKNOWN" /note="Rv1914c, (MTCY180.04), len: 135 aa. Hypothetical unknown protein, TBparse score is 0.924" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216430.1" /db_xref="GI:15609051" /db_xref="GeneID:885875" /translation="MVLSRTSTGRVILVPTQLRFDRWFLPLAVPLGLGPKNSELWVGA GSLHVKMGWAFAADIPLTSITKAEATNARVYAAGVHFGFGRWLVNGSRKGLVALTIDP PEQAKMWKKSMTVRELWVSVTDPDALVTACTAK" gene 2160463..2161566 /gene="aceAa" /locus_tag="Rv1915" /db_xref="GeneID:885639" CDS 2160463..2161566 /gene="aceAa" /locus_tag="Rv1915" /EC_number="4.1.3.1" /function="INVOLVED IN GLYOXYLATE BYPASS, AN ALTERNATIVE TO THE TRICARBOXYLIC ACID CYCLE [CATALYTIC ACTIVITY: ISOCITRATE = SUCCINATE + GLYOXYLATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv1915, (MTCY180.03c), len: 367 aa. Probable aceAa, isocitrate lyase (EC 4.1.3.1) (see citations below). Highly similar to the N-terminus of ACEA_MYCLE ISOCITRATE LYASE (EC 4.1.3.1) from Mycobacterium leprae (606 aa), FASTA results: opt: 3314, E(): 0, (86.5% identity in 572 aa overlap). Contains PS00161 Isocitrate lyase signature. Although this ORF and the downstream ORF representing the C-terminal half of aceA could be joined by a frameshift, no error is apparent in the cosmid, or in a seqencing read from the genome of H37Rv. As the downstream ORF has a RBS and transcriptional start immediately following the stop of this ORF, it is possible that they are expressed as two separate modules. In Mycobacterium tuberculosis strain CDC1551, aceA exists as a single gene, MT1966: the corresponding protein has been purified experimentally and seems have an active isocitrate lyase activity (see Honer et al., 1999). For Mycobacterium tuberculosis strain H37Rv, immunoblot assay didn't detect AceAa or AceAb products (see Honer et al., 1999) but mRNA of AceAa|Rv1915 has been detected (see Betts et al., 2002); so AceAb|Rv1916 could be a pseudogene." /codon_start=1 /transl_table=11 /product="isocitrate lyase" /protein_id="NP_216431.1" /db_xref="GI:15609052" /db_xref="GeneID:885639" /translation="MAIAETDTEVHTPFEQDFEKDVAATQRYFDSSRFAGIIRLYTAR QVVEQRGTIPVDHIVAREAAGAFYERLRELFAARKSITTFGPYSPGQAVSMKRMGIEA IYLGGWATSAKGSSTEDPGPDLASYPLSQVPDDAAVLVRALLTADRNQHYLRLQMSER QRAATPAYDFRPFIIADAGTGHGGDPHVRNLIRRFVEVGVPGYHIEDQRPGTKKCGHQ GGKVLVPSDEQIKRLNAARFQLDIMRVPGIIVARTDAEAANLIDSRADERDQPFLLGA TKLDVPSYKSCFLAMVRRFTNWASRSSMVIFSMRLATASTRRPAVGLSAKAFSAWSPT RSTRGGRTASSRSTAFSTRSSRGSWRPGRTTRA" misc_feature 2161099..2161116 /gene="aceAa" /locus_tag="Rv1915" /note="PS00161 Isocitrate lyase signature" gene 2161566..2162762 /gene="aceAb" /locus_tag="Rv1916" /db_xref="GeneID:885383" CDS 2161566..2162762 /gene="aceAb" /locus_tag="Rv1916" /EC_number="4.1.3.1" /function="INVOLVED IN GLYOXYLATE BYPASS, AN ALTERNATIVE TO THE TRICARBOXYLIC ACID CYCLE [CATALYTIC ACTIVITY: ISOCITRATE = SUCCINATE + GLYOXYLATE]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the first step in the glyoxalate cycle, which converts lipids to carbohydrates" /codon_start=1 /transl_table=11 /product="isocitrate lyase" /protein_id="NP_216432.1" /db_xref="GI:15609053" /db_xref="GeneID:885383" /translation="MTYGEAVADVLEFGQSEGEPIGMAPEEWRAFAARASLHAARAKA KELGADPPWDCELAKTPEGYYQIRGGIPYAIAKSLAAAPFADILWMETKTADLADARQ FAEAIHAEFPDQMLAYNLSPSFNWDTTGMTDEEMRRFPEELGKMGFVFNFITYGGHQI DGVAAEEFATALRQDGMLALARLQRKMRLVESPYRTPQTLVGGPRSDAALAASSGRTA TTKAMGKGSTQHQHLVQTEVPRKLLEEWLAMWSGHYQLKDKLRVQLRPQRAGSEVLEL GIHGESDDKLANVIFQPIQDRRGRTILLVRDQNTFGAELRQKRLMTLIHLWLVHRFKA QAVHYVTPTDDNLYQTSKMKSHGIFTEVNQEVGEIIVAEVNHPRIAELLTPDRVALRK LITKEA" gene complement(2162932..2167311) /gene="PPE34" /locus_tag="Rv1917c" /db_xref="GeneID:885362" CDS complement(2162932..2167311) /gene="PPE34" /locus_tag="Rv1917c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1917c, (MTV050.01c-MTCY180.01), len: 1459 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, MPTR subfamily (see citation below). Similar to MTCY28.16, MTCY13E10.17, MTCY63.10, MTV004.05 , MTCY98.24, MTCY6G11.05, etc. C-terminus is identical to Q50471. Unknown Mycobacterium tuberculosis protein (693 aa), FASTA results: opt: 2635, E(): 0, (99.7% identity in 391 aa overlap). Start changed since original submission (+23 aa). Thougth to be surface exposed, cell-wall associated." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177655.1" /db_xref="GI:57116928" /db_xref="GeneID:885362" /translation="MNFSTLPPEINSALIFGGAGSEPMSAAAVAWDQLAMELASAAAS FNSVTSGLVGESWLGPSSAAMAAAVAPYLGWLAAAAAQAQRSATQAAALVAEFEAVRA AMVQPALVAANRSDLVSLVFSNFFGQNAPAIAAIEAAYEQMWAIDVSVMSAYHAGASA VASALTPFTAPPQNLTDLPAQLAAAPAAVVTAAITSSKGVLANLSLGLANSGFGQMGA ANLGILNLGSLNPGGNNFGLGNVGSNNVGLGNTGNGNIGFGNTGNGNIGFGLTGDNQQ GFGGWNSGTGNIGLFNSGTGNIGIGNTGTGNFGIGNSGTSYNTGIGNTGQANTGFFNA GIANTGIGNTGNYNTGSFNLGSFNTGDFNTGSSNTGFFNPGNLNTGVGNTGNVNTGGF NSGNYSNGFFWRGDYQGLIGFSGTLTIPAAGLDLNGLGSVGPITIPSITIPEIGLGIN SSGALVGPINVPPITVPAIGLGINSTGALVGPINIPPITLNSIGLELSAFQVINVGSI SIPASPLAIGLFGVNPTVGSIGPGSISIQLGTPEIPAIPPFFPGFPPDYVTVSGQIGP ITFLSGGYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGGLGPFTVFPDGY SLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGAIGPLTTPPITIPSIPLGID VSGSLGPINIPIEIAGTPGFGNSTTTPSSGFFNSGTGGTSGFGNVGSGGSGFWNIAGN LGNSGFLNVGPLTSGILNFGNTVSGLYNTSTLGLATSAFHSGVGNTDSQLAGFMRNAA GGTLFNFGFANDGTLNLGNANLGDYNVGSGNVGSYNFGSGNIGNGSFGFGNIGSNNFG FGNVGSNNLGFANTGPGLTEALHNIGFGNIGGNNYGFANIGNGNIGFGNTGTGNIGIG LTGDNQVGFGALNSGSGNIGFFNSGNGNIGFFNSGNGNVGIGNSGNYNTGLGNVGNAN TGLFNTGNVNTGIGNAGSYNTGSYNAGDTNTGDLNPGNANTGYLNLGDLNTGWGNIGD LNTGALISGSYSNGILWRGDYQGLIGYSDTLSIPAIPLSVEVNGGIGPIVVPDITIPG IPLSLNALGGVGPIVVPDITIPGIPLSLNALGGVGPIVVPDITIPGIPLSLNALGGVG PIVVPDITIPGIPLSLNALGGVGPIVVPDITIPGIPLSLNALGGVGPITVPGVPISRI PLTINIRIPVNITLNELPFNVAGIFTGYIGPIPLSTFVLGVTLAGGTLESGIQGFSVN PFGLNIPLSGATNAVTIPGFAINPFGLNVPLSGGTSPVTIPGFAINPFGLNVPLSGGT SPVTIPGFTIPGSPLNLTANGGLGPINIPINITSAPGFGNSTTTPSSGFFNSGDGSAS GFGNVGPGISGLWNQVPNALQGGVSGIYNVGQLASGVANLGNTVSGFNNTSTVGHLTA AFNSGVNNIGQMLLGFFSPGAGP" repeat_region complement(2163323..2163392) /note="69 bp imperfect direct repeat 3, TTAATCCGTTTGGGTTGAATGTTCCGTTGAGCGGGGGCACGAGCCCGGTTACGATCC CCGGCTTCACCAT" repeat_region complement(2163393..2163461) /note="69 bp imperfect direct repeat 2, TTAATCCGTTTGGGTTGAATGTTCCGTTGAGCGGGGGCACGAGCCCGGTTACGATCC CTGGTTTCGCGA" repeat_region complement(2163462..2163530) /note="69 bp imperfect direct repeat 1, TTAATCCGTTCGGTTTGAATATTCCGCTGAGCGGTGCTACCAACGCTGTCACGATCC CTGGTTTCGCGA" repeat_region complement(2163741..2163809) /note="69 bp imperfect direct repeat 5, TCGGTCCGATTGTGGTGCCTGATATTACTATTCCTGGTATTCCGTTGAGCCTGAACG CGCTGGGTGGTG" repeat_region complement(2163810..2163878) /note="69 bp imperfect direct repeat 4, TCGGTCCGATTGTGGTGCCGGATATTACTATTCCTGGTATTCCGTTGAGCCTGAACG CGCTGGGTGGTG" repeat_region complement(2163879..2163947) /note="69 bp imperfect direct repeat 3, TCGGTCCGATTGTGGTGCCGGATATTACTATTCCTGGTATTCCGTTGAGCCTGAACG CGCTGGGTGGTG" repeat_region complement(2163948..2164016) /note="69 bp imperfect direct repeat 2, TCGGTCCGATTGTGGTGCCGGATATTACTATTCCTGGTATTCCGTTGAGCCTGAACG CGCTGGGTGGTG" repeat_region complement(2164017..2164085) /note="69 bp imperfect direct repeat 1, TCGGTCCGATTGTGGTGCCGGATATTACTATTCCTGGTATTCCGTTGAGCCTGAACG CGCTGGGTGGTG" misc_feature complement(2165392..2165445) /gene="PPE34" /locus_tag="Rv1917c" /note="PS00879 Orn/DAP/Arg decarboxylases family 2 signature 2" misc_feature complement(2165467..2165520) /gene="PPE34" /locus_tag="Rv1917c" /note="PS00879 Orn/DAP/Arg decarboxylases family 2 signature 2" misc_feature complement(2165542..2165595) /gene="PPE34" /locus_tag="Rv1917c" /note="PS00879 Orn/DAP/Arg decarboxylases family 2 signature 2" gene complement(2167649..2170612) /gene="PPE35" /locus_tag="Rv1918c" /db_xref="GeneID:885506" CDS complement(2167649..2170612) /gene="PPE35" /locus_tag="Rv1918c" /function="UNKNOWN" /note="Rv1918c, (MTV050.02c), len: 987 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins. Similar to MTCY28.16|Z95890 Mycobacterium tuberculosis cosmid (1053 aa), FASTA scores: opt: 3404, E(): 0, (65.6% identity in 1058 aa overlap). Also similar to MTV004.05, MTY13E10.17, MTV014.03, MTCY3C7.23, MTCY6G11.05, MTCY48.17, MTV004.03, MTCY31.07, MTCY4C12.36, MTCY180.01, etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177850.1" /db_xref="GI:57116929" /db_xref="GeneID:885506" /translation="MHYSVLPPEINSALIFAGAGSGPMLAAASAWDGLATELASAAVS FGSVTAGLVGGSWQGRSSVAMAAAAAPYAGWLAAAATQAEQAATQAQVMVAEFEAVRL AMVQPALVAANRSGLISLVISNLFGQNAPAIAAAEAAYEEMWALDVSAMAAYHSGASA VAVALPAFALPLRLPAGLAAGPAAVVTALTTAVGMPTFAGRAIAASLGLANVGGGNLG NANNGLGNIGNANLGNNNLGSGNFGSFNIGSANLGGNNIGIGNAGANNFGLANLGNLN TGFANAGIGNFGIANTGNNNIGNGLTGNNQIGIGGLNSGNGNVGLFNAGSANIGFFNS GNGNFGIGNSGNFSTGLFNPGHGNTGFLNAGSFNTGMFDVGNANTGSFNVGHYNFGAF NPGPSNTGTFNTGGANTGWFNTGSINTGAFNIGDMNNGLFNTGDMNNGVFYRGVGQGS LQFAITSPDLTLPSLEIPGISVPAFSLPAITLPSLTIPAVTTPANVTVGAFDLPGLTV PSLTIPAAMTPANITVGAFDLPGLTVPSLTIPATTTPANITVGAFNLPQLSIPSVTVP PITIPAGTALGAFNLPTLSIPSVTVPPITIPAGTTVGGFTLPTIHTPLISTPQISIGG FSTPGIATQANSGVINLPTFSLNGITITNLVVFIPNNITALQTNMPGVFPQIGGFANT PPAFINTGTITVGGGQINGVGFSIGAINVTPFTLPNVVIQPWSLGGISVDGFTLPEIS TQEFTTPALTISPIGVGALSLPDITTQQFTTPELTIDPITLGGFTLPQLSIPAITTPA FTIDPIALGGFTLPQIMTPEITTPPFAIDPIGLSGFTLPQVNIPEITTPEFTIQPVGL AAFTTPALTIASIHLPSTTMGGFAIPAGPGYFNSSATPSLGFFNAGIGGNSGFGNSGS GLSGWFNTSPVGLLAGSGYQNYGGLISGFSNLGSGISGFANTGTLPFAVTSLVSGLAN IGNNLSGLFFQSTTP" gene complement(2171061..2171525) /locus_tag="Rv1919c" /db_xref="GeneID:885876" CDS complement(2171061..2171525) /locus_tag="Rv1919c" /function="UNKNOWN" /note="Rv1919c, (MTV050.03c), len: 154 aa. Conserved hypothetical protein, shows weak similarity to several major pollen antigens e.g. Z72431|BVGC25_1 MAJOR ALLERGEN BET V 1 from Betula verrucosa (160 aa), FASTA scores: opt: 133, E(): 0.012, (26.8% identity in 149 aa overlap). Also shows some similarity to Rv2574|MTCY227.27C Hypothetical protein from Mycobacterium tuberculosis (167 aa), (27.4% identity in 124 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216435.1" /db_xref="GI:15609056" /db_xref="GeneID:885876" /translation="MSGRKFSFEVTKTSSAPAATLFRLVTDGGNWATWAKPIVAQSSW ARRGDPAPGGIGAIRKLGMWPVFVQEETVEYEQDRRHVYKLVGARTPVQDYFGEVVLT PNASGGTDLRWSGSFTEKVRGTGPVMRAALGGAVRFFAGQLVKAAEREAVRR" gene 2171623..2172486 /locus_tag="Rv1920" /db_xref="GeneID:885416" CDS 2171623..2172486 /locus_tag="Rv1920" /function="UNKNOWN" /note="Rv1920, (MTV050.04), len: 287 aa. Probable membrane protein, similar to AL0215|SC10A5.04 putative membrane protein from Streptomyces coelicolor cosmid 10A5 (295 aa), FASTA scores: opt: 292, E(): 3.6e-13, (31.3% identity in 243 aa overlap). Also weakly similar to several Mycobacterial putative proteins with unknown function e.g. Rv0502, Rv1428c, U00018_22 Mycobacterium leprae cosmid B2168." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216436.1" /db_xref="GI:15609057" /db_xref="GeneID:885416" /translation="MFPRWPQQAHNHEVSRADTVSVPRAPTQAEVAAVLRIMTPLRKV IKPKVYGIENVPTERALLVGNHNTLGLVDAPLLAAELWERGRIVRSLGDHAHFKIPGW RDALTRTGVVEGTREITSELMRRGELVMVFPGGAREVNKRKNERYKLVWKNRLGFARL AIQHGYPIVPFASVGAEHGIDIVLDNESPLLAPVQFLAEKLLGTKDGPALVRGVGLTP VPRPERQYYWFGEPIDTTEFMGQQADDNAARRVRERAAAAIEHGIELMLAERAADPNR SLVGRLLRSDA" gene complement(2172524..2173795) /gene="lppF" /locus_tag="Rv1921c" /db_xref="GeneID:885917" CDS complement(2172524..2173795) /gene="lppF" /locus_tag="Rv1921c" /function="UNKNOWN" /note="Rv1921c, (MTCY09F9.43-MTV050.05c), len: 423 aa. Probable lppF, conserved lipoprotein, similar to G403173 lipoprotein precursor (fragment) from Rhodococcus erythropolis (225 aa), fasta scores: opt: 364, E(): 9.2e-19, (41.9% identity in 148 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LppF" /protein_id="NP_216437.1" /db_xref="GI:15609058" /db_xref="GeneID:885917" /translation="MVRLIPSLLAMATVLGGVIGCSAHQPPTPASGCRQLDAFLKWHH GVREFLQSAIDANSRCTGTADGSARKVAIFDWDNTVVKNDIGYATNYYMLQHSLVLQP ANQDWHAASRYLTDAAANALSVACGKVVPAGKPLPTGSNALCANEILSLLDGETTTGQ PAFVGNNVRRLAGPYAWSNALSAGYTAEELAGFADQAKKQNLAADVGATQQVGTQQVD GYIRVYPQMKDLIGTLQAHGIDTWVVSASPEPIVKVWAGEVGLDDQHVVGVRSVADQS GKLTAHLVGCGGVRDGDDSVMTYLDGKRCWANQVIFGVTGPQAFNQLAADRRQVLAAG DSNSDATFVGDATVVSLVINRNQDDLMCRAYDGLFTRGGKWAINPMFIDPLPQHAPYV CGEAFINPDGSKQPVLRNDGTPIPDQVDSVF" misc_feature complement(2173733..2173765) /gene="lppF" /locus_tag="Rv1921c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2174067..2175182 /locus_tag="Rv1922" /db_xref="GeneID:885918" CDS 2174067..2175182 /locus_tag="Rv1922" /function="UNKNOWN" /note="Rv1922, (MTCY09F9.42c), len: 371 aa. Probable conserved lipoprotein, possibly peptidase (EC 3.4.-.-) similar to many peptidases, e.g. P15555|DAC_STRSQ D-alanyl-D-alanine carboxypeptidase from Streptomyces sp. (406 aa), FASTA scores: opt: 382, E(): 3.1e-17, (28.0% identity in 379 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv1497, Rv2463, Rv3775, etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein" /protein_id="NP_216438.1" /db_xref="GI:15609059" /db_xref="GeneID:885918" /translation="MDSTVTASIRRMLGLLAATLLLGGCTGQHTTRTAASTTYTPHIK ASSQDVLDGAINADEPGCSAAVGVEGKVIWSGVRGIADLASGAKITTDTVFDIASVSK QFTATAILLLVEAGKLTLDDPISQYVPELPDWAQTVTVEQLMHQTSGIPDYVALLAAR GYQVSDRTIEAEARQALAAAPELQFKPGTRFDYSNSNYLLLGEIVHRASGQPLPEFLS AEIFQPLGLAMVVDPVGKVPNKAVSYEKGTGGNRSEYRVGNPAWEQIGDGGIQTTPSQ LARWADNYRTGSVGGLKLLEAQLAGAVETEPGGGDRYGAGIVSRADGTLDHAGAWAGF VTAFHISSDRRTSVAISCNTDKPDPVAMADALGRLWM" misc_feature 2174109..2174141 /locus_tag="Rv1922" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2175173..2176513 /gene="lipD" /locus_tag="Rv1923" /db_xref="GeneID:885910" CDS 2175173..2176513 /gene="lipD" /locus_tag="Rv1923" /EC_number="3.1.-.-" /function="UNKNOWN; LIPOLYTIC ENZYME PROBABLY INVOLVED IN CELLULAR METABOLISM" /experiment="experimental evidence, no additional details recorded" /note="Rv1923, (MTCY09F9.41c), len: 446 aa. Probable lipD, hydrolase lipase (EC 3.1.-.-), similar to esterases and beta-lactamases e.g. G151214 esterase, (389 aa), fasta scores: opt: 569, E(): 5.4e-29, (33.7% identity in 401 aa overlap). Also similar to Mycobacterium tuberculosis hypothetical proteins Rv1497, Rv2463, Rv3775, etc." /codon_start=1 /transl_table=11 /product="lipase LIPD" /protein_id="NP_216439.1" /db_xref="GI:15609060" /db_xref="GeneID:885910" /translation="MDVAGLPRLAAGTQAAIIHGMAQPPSLLTTDNGLPFGVQGACDS RFTGVIRAFAGLYPGRKFGGGALSVYIDGRQVVDVWTGWSDRQGKVPWTADTGAMVFS ATKGLAATVIHRLVDRGLLSYDAPVAEYWPEFGANGKSEVTVSDVLRHRSGLAHLKGV DKDEVMDHLLMEQKLAAAPLDRQHGKLAYHAVTYGWLLSGLARAVTGKGMRELFREEL ARPLNTDGIHLGRPPADSPTKAAQTLLPQAKVPTPLLDFIAPKVAGLSFSGLLGAVYF PGILSLLQDDMPFLDGEVPAVNGVVTARALAKTYGALANDGVIDGTRLLSSQAVRGLT GKSELWPDLNLGLPFTYHQGYQSSPVPGLLEGYGHIGLGGTIGWADPETGSAFGYVHN RLLTLLLFDIGSFAGLAALLNSAVVAARRDDPLEVPHFGAPYSEPRHEQAASGA" gene complement(2176550..2176930) /locus_tag="Rv1924c" /db_xref="GeneID:885352" CDS complement(2176550..2176930) /locus_tag="Rv1924c" /function="UNKNOWN" /note="Rv1924c, (MTCY09F9.40), len: 126 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216440.1" /db_xref="GI:15609061" /db_xref="GeneID:885352" /translation="MDPADVINPTSTRDAALARVLAYRQRVRARPLLIRATLAVVGGG LFVVSLPMIVLLPELGIPALLVAFRLLAVEAQWAVRAYAWTDWRFTQLREWFHRQVLV TRAAILVGLFLAAVALVWLLVYEF" gene 2177087..2178949 /gene="fadD31" /locus_tag="Rv1925" /db_xref="GeneID:885200" CDS 2177087..2178949 /gene="fadD31" /locus_tag="Rv1925" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVEMENT IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="putative fatty-acid--CoA ligase" /protein_id="NP_216441.1" /db_xref="GI:15609062" /db_xref="GeneID:885200" /translation="MNDGSRQELRVRSGLLQIEDCLDADGGIALPAGTTLISLIERNI KYVGDLVAYRYLDHARSAAGCALEVTWTQFGMRLAAIGAHVQRFAGPGDRVAILAPQG IDYVCGFYAAIKAGTVAVPLFAPELPGHAERLDTALRDSEPAVILTTAAAKNAVEGFL NNVPRLRKPTVLVIDQIPDREGELFVPVEMDIDAVSHLQYTSGSTRPPVGVEITHRAV GTNLVQMILSIDLLNRNTHGVSWLPLYHDMGLSMIGFPAVYGGHSTLMSPTAFVRRPL RWIQALSEGSRTGRVVTAAPNFAYEWAAQRGLPAQGDDVDLSNVVLIIGSEPVSIDAV TTFNKAFAPYGLPRTAFKPSYGIAEATLLVATIDHAAEPTVVYLDPEQLGAGHATRVA PDAPNAVVHVSCGHVARSLWAVIVDPDTGPEAGAELPDGEIGEVWLQGDNVARGYWGR PEETRMTFGARLQSPLAEGSHADGSAIDDTWLRTGDLGVYLDGELYITGRIADLLTID GRNHYPQDIEATAAEASPMVRRGYITAFTVPASDGDDRNQRLVIIAERAAGTSRSDPR PALDAIRAAVCNRHGLSVADLSFLPAGAIPRTTSGKLARQACRAQYLSGRLGVH" gene complement(2178957..2179436) /gene="mpt63" /locus_tag="Rv1926c" /db_xref="GeneID:885334" CDS complement(2178957..2179436) /gene="mpt63" /locus_tag="Rv1926c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1926c, (MT1977, MTCY09F9.38), len: 159 aa. mpt63 (alternate gene name: mpb63), immunogenic protein (see citations below), identical to MPT63|MPB63 from Mycobacterium bovis (159 aa). Exported protein containing a N-terminal signal sequence: see notes below about proteomics.; mpb63" /codon_start=1 /transl_table=11 /product="immunogenic protein MPT63 (antigen MPT63/MPB63) (16 kDa immunoprotective extracellular protein)" /protein_id="NP_216442.1" /db_xref="GI:15609063" /db_xref="GeneID:885334" /translation="MKLTTMIKTAVAVVAMAAIATFAAPVALAAYPITGKLGSELTMT DTVGQVVLGWKVSDLKSSTAVIPGYPVAGQVWEATATVNAIRGSVTPAVSQFNARTAD GINYRVLWQAAGPDTISGATIPQGEQSTGKIYFDVTGPSPTIVAMNNGMEDLLIWEP" gene 2179673..2180446 /locus_tag="Rv1927" /db_xref="GeneID:885912" CDS 2179673..2180446 /locus_tag="Rv1927" /function="UNKNOWN" /note="Rv1927, (MTCY09F9.37c), len: 257 aa. Conserved hypothetical protein, similar to SCG11A.10c|AL133210 hypothetical protein from Streptomyces coelicolor (252 aa), FASTA scores: opt: 729, E(): 0, (48.3% identity in 238 aa overlap). Slight similarity with P54543|YQJF_BACSU hypothetical 23.9 kDa protein from Bacillus subtilis (209 aa), FASTA scores, opt: 230, E(): 2.8e-08, (28.0% identity in 164 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216443.1" /db_xref="GI:15609064" /db_xref="GeneID:885912" /translation="MTAIPGPSGAEPGESRALAGYPVTPPALPRPVIFDQRWTDLTFI HWPVLPESVAGSYPPGTRPDVFADGMTYVGLVPFRMSSTKLGTALPIPYVGTFPETNV RLYSIDNAGRHGVLFRSLETARLTVVPLTRIGLGIPYAWSRMRMMRSGKHITYHSVRR WPRRGLRSLLTITIGDLVEPTPLEVWLTARWGAHTRKAGRTWWVPNEHKPWPLRAAEI AELNDELIDASGVQPTGDRLRALFSPGVHARFGRPCVVQ" gene complement(2180450..2181217) /locus_tag="Rv1928c" /db_xref="GeneID:885331" CDS complement(2180450..2181217) /locus_tag="Rv1928c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv1928c, (MTCY09F9.36), len: 255 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (261 aa); P87219|SOU1_CANAL SORBITOL UTILIZATION PROTEIN (SDR FAMILY) from Candida albicans (281 aa); P25529|HDHA_ECOLI 7-alpha-hydroxysteroid dehydrogenase from Escherichia coli (255 aa), FASTA scores: opt: 541, E(): 1.2e-27, (37.5% identity in 251 aa overlap); etc. Also similar to many mycobacterial tuberculosis proteins e.g. Rv1350, Rv0927c, Rv2002, Rv0769, Rv2766c, etc. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_216444.1" /db_xref="GI:15609065" /db_xref="GeneID:885331" /translation="MSVLDLFDLHGKRALITGASTGIGKRVALAYVEAGAQVAIAARH LDALEKLADEIGTSGGKVVPVCCDVSQHQQVTSMLDQVTAELGGIDIAVCNAGIITVT PMLDMPLEEFQRLQNTNVTGVFLTAQAAAKAMVKQGQGGVIINTASMSGHIINVPQQV SHYCASKAAVIHLTKAMAVELAPHKIRVNSVSPGYILTELVEPYTEYQPLWEPKIPLG RLGRPEELAGLYLYLASEASSYMTGSDIVIDGGYTCP" misc_feature complement(2180684..2180770) /locus_tag="Rv1928c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(2181262..2181906) /locus_tag="Rv1929c" /db_xref="GeneID:885630" CDS complement(2181262..2181906) /locus_tag="Rv1929c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1929c, MTCY09F9.35, len: 214 aa. Conserved hypothetical protein, similar to SC4G6.14|AL096884 hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 416, E(): 2.4e-22, (39.8% identity in 206 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216445.1" /db_xref="GI:15609066" /db_xref="GeneID:885630" /translation="MADVPLDAQERLELCDLLEELGPAVATLIEGWTAHDLAAHIVLR ERDLVAGLCIVLPGPFQRFAERRRARLAQSKDFTWLVARIRSGPPMGFFRIGWVRTLA NLNEFFVHHEDVRRASGRGPRSLTPEMDAALWRNVRRGSHFLSRRLHGCGLEIEWVGT GKRVRVRSGEPTARLTGPPGELLLYVFGRRAVARVEVSGPLEAIAAVHRTHFGM" gene complement(2181918..2182442) /locus_tag="Rv1930c" /db_xref="GeneID:885344" CDS complement(2181918..2182442) /locus_tag="Rv1930c" /function="UNKNOWN" /note="Rv1930c, MTCY09F9.34, len: 174 aa. Conserved hypothetical protein, similar to SC5F2A.30|AL049587 hypothetical protein from Streptomyces coelicolor (211 aa), FASTA scores: opt: 307, E(): 2.8e-13, (54.8% identity in 84 aa overlap). Some similarity to M. tuber culosis hypothetical protein Rv0052|MTCY21D4.15 (43% identity in 93 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216446.1" /db_xref="GI:15609067" /db_xref="GeneID:885344" /translation="MTQIAFVAYPGVTALDVVGPYEVLRNLPHAQVRFVWLRGRRATS HWLTLPALKAFGAIPVADERIVHQDNIVTSAGVSAGLDLALWLAGQLGGEARAKAIQL AIEYDPQPPFDSGHMSKASPTTKAAATALLSKDSAKPANLTAATLLAWERALAAVQSR RRKRQPVGAQARRP" gene complement(2182460..2183239) /locus_tag="Rv1931c" /db_xref="GeneID:885437" CDS complement(2182460..2183239) /locus_tag="Rv1931c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv1931c, (MTCY09F9.33), len: 259 aa. Probable transcriptional regulatory protein. Similarity in C-terminal half to transcriptional activators e.g. Q43970 ARAC-LIKE PROTEIN (227 aa), FASTA scores: opt: 238, E(): 7.1e-07, (42.4% identity in 92 aa overlap). Similar to many probable transcription regulators in Streptomyces e.g. AL049587|SC5F2A.29 Streptomyces coelicolor (325 aa), FASTA scores: opt: 387, E(): 3.2e-16, (34.4% identity in 259 aa overlap)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216447.1" /db_xref="GI:15609068" /db_xref="GeneID:885437" /translation="MVIVGFPGDPVDTVILPGGAGVDAARSEPALIDWVKAVSGTARR VVTVCTGAFLAAEAGLLGRTPSDDALGLCRTFRPRISGRSGRCRPDLHAQFAEGVDRG WSHRRHRPRAGTGRRRPRHRDCPDGCPLARPVSAPTRWADPVRGSGVDATRQTDLDPP GAGGHRGRAGGAHRIGELAQRAAMSPRHFTRVFSDEVGEAPGRYVERIRTEAARRQLE ETHDTVVAIAARCGFGTAETMRRSFIRRVGISPDQYRKAFA" gene 2183372..2183869 /gene="tpx" /locus_tag="Rv1932" /db_xref="GeneID:885357" CDS 2183372..2183869 /gene="tpx" /locus_tag="Rv1932" /EC_number="1.11.1.-" /function="HAS ANTIOXIDANT ACTIVITY. COULD REMOVE PEROXIDES OR H(2)O(2)" /experiment="experimental evidence, no additional details recorded" /note="antioxidant activity; thioredoxin-dependent thiol peroxidase; forms homodimers in solution; shows substrate specificity to alkyl hydroperoxides; periplasmic protein" /codon_start=1 /transl_table=11 /product="thiol peroxidase" /protein_id="NP_216448.1" /db_xref="GI:15609069" /db_xref="GeneID:885357" /translation="MAQITLRGNAINTVGELPAVGSPAPAFTLTGGDLGVISSDQFRG KSVLLNIFPSVDTPVCATSVRTFDERAAASGATVLCVSKDLPFAQKRFCGAEGTENVM PASAFRDSFGEDYGVTIADGPMAGLLARAIVVIGADGNVAYTELVPEIAQEPNYEAAL AALGA" gene complement(2183866..2184957) /gene="fadE18" /locus_tag="Rv1933c" /db_xref="GeneID:885394" CDS complement(2183866..2184957) /gene="fadE18" /locus_tag="Rv1933c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="Rv1933c, (MTCY09F9.31), len: 363 aa. Probable fadE18, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. CAB61609.1|AL133210 putative acyl-CoA dehydrogenase from Streptomyces coelicolor (362 aa); NP_421282.1|NC_002696 acyl-CoA dehydrogenase family protein from Caulobacter crescentus (344 aa); ACDS_RAT|P15651 short-chain specific acyl-CoA dehydrogenase from Rattus norvegicus (Rat) (412 aa), fasta scores: opt: 239, E(): 2.1e-08, (28.4% identity in 331 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. N-terminus of fadE22 (721 aa); fadE33 (318 aa); N-terminus of fadE34 (711 aa); etc. COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE18" /protein_id="NP_216449.1" /db_xref="GI:15609070" /db_xref="GeneID:885394" /translation="MDFRYSTEQDDFRASLRGFLGRGAPVREMAAADGSDRRLWQRLC TELELPALHVPPEHGGLGATLVETAIAFAELGRALTPIPFAATVFAIEAILRMGDDEQ RKRLLAGLLTGARIGTIAVSGHDVASATTVRAVRRDGRPALTGECTPVLHGHVADLFV VPAVADGSIVLHVVAADAPGVTVTPLPSFDITRPVATLRLAGSPAEPLTAGTPDDMER VLDVARVLLAAEMLGGAEACLDLAVQYAGRRTQFDRPIGSFQAVKHACADMMIEIDAT RATVMFAAMSAANGDELQTVAPLAKAQTAETFVLCAGSALQIHGAIAFTWEHDLHLYY RRAKTTEALFGSSARNRALLAERAGLVKA" gene complement(2184959..2186188) /gene="fadE17" /locus_tag="Rv1934c" /db_xref="GeneID:885379" CDS complement(2184959..2186188) /gene="fadE17" /locus_tag="Rv1934c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /note="Rv1934c, (MTCY09F9.30), len: 409 aa. Probable fadE17, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to ACD_MYCLE|P46703 acyl-CoA dehydrogenase from Mycobacterium leprae (389 aa), FASTA scores: opt: 414, E(): 2.6e-19, (28.3% identity in 407 aa overlap). Also similar to many e.g. NP_249713.1|NC_002516 probable acyl-CoA dehydrogenase from Pseudomonas aeruginosa (381 aa); NP_420614.1|NC_002696 acyl-CoA dehydrogenase family protein from Caulobacter crescentus (355 aa); CAB61610.1|AL133210 putative acyl-CoA dehydrogenase from Streptomyces coelicolor (393 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. fadE30 (385 aa); fadE31 (377 aa); C-terminus of fadE34 (711 aa); etc. COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE17" /protein_id="NP_216450.1" /db_xref="GI:15609071" /db_xref="GeneID:885379" /translation="MDVSYPPEAEAFRDRIREFVAEHLPPGWPGPGALPPHEREEFAR HWRRALAGAGLVAVSWPTEYGGGGLSPMEQVVLAEEFARAGAPERAENDLLGIDLLGN TLIALGSEAQKRHFLPRILSGEHRWCQGFSEPEAGSDLASVRTRGVLDGDEWVINGHK IWTSAGTTANWIFLLARTDPSAAKHRGLSFLLVPMDQPGVVVRPIVNAAGHSSFSEVF LTDARTSAGNVVGRVGDGWSTAMTLLGFERGSHIATAAIDFERDLQRLCELARDRGLH TDPRVRDGLAWCYARVQIMRYRGYRDLTLALTGRPPGAEAAITKVIWSEYFRRYTDLA VEILGLEALGPRGPGNGGARLVPEAGTPNSPACWMDELLYARAATIYAGSSQIQRNVI GERLLGLPKEPRPEVLC" gene complement(2186203..2187159) /gene="echA13" /locus_tag="Rv1935c" /db_xref="GeneID:885240" CDS complement(2186203..2187159) /gene="echA13" /locus_tag="Rv1935c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS (BY SIMILARITY) [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_216451.1" /db_xref="GI:15609072" /db_xref="GeneID:885240" /translation="MFVGRVGPVDRRSDGERSRRPREFEYIRYETIDDGRIAAITLDR PKQRNAQTRGMLVELGAAFELAEADDTVRVVILRAAGPAFSAGHDLGSADDIRERSPG PDQHPSYRCNGATFGGVESRNRQEWHYYFENTKRWRNLRKITIAQVHGAVLSAGLMLA WCCDLIVASEDTVFADVVGTRLGMCGVEYFGHPWEFGPRKTKELLLTGDCIGADEAHA LGMVSKVFPADELATSTIEFARRIAKVPTMAALLIKESVNQTVDAMGFSAALDGCFKI HQLNHAHWGEVTGGKLSYGTVEYGLEDWRAAPQIRPAIKQRP" gene 2187384..2188493 /locus_tag="Rv1936" /db_xref="GeneID:885882" CDS 2187384..2188493 /locus_tag="Rv1936" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1936, (MTCY09F9.28c), len: 369 aa. Possible monooxygenase (EC 1.-.-.-), similar to LXA2_PHOLU|P23146 alkanal monooxygenase alpha chain (362 aa), FASTA scores: opt: 196, E(): 6.3e-06, (22.3% identity in 373 aa overlap). Also similar to many other Mycobacterium tuberculosis hypothetical oxidoreductases and monooxygenases e.g. Rv0953c, Rv0791c, Rv0132c, etc." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="NP_216452.1" /db_xref="GI:15609073" /db_xref="GeneID:885882" /translation="MEIGIFLMPAHPPERTLYDATRWDLDVIELADQLGYVEAWVGEH FTVPWEPICAPDLLLAQALLRTQQIKLAPGAHLLPYHHPVELAHRVAYFDHLAQGRFM LGVGASGIPGDWALYDVDGKNGEHREMTREALEIMLRIWTEDEPWEHRGKYWNANGIA PMFEGLMRRHIKPYQKPHPPIGVTGFSAGSETLKLAGERGYIPMSLDLNTEYVATHWD AVEEGALRSGRTPDRRDWRLVREVLVAETDEQAFRYAVDGTMGRAMREYVLPTFRMFG MTKFYKHNPSVPDDEVTPEYLAENTFVVGSVQTVVDKLEATYDQVGGFGHLLILGFDY SDNPGPWKESLRLLAHEVMPRLNARLATKPATAVV" gene 2188496..2191015 /locus_tag="Rv1937" /db_xref="GeneID:885433" CDS 2188496..2191015 /locus_tag="Rv1937" /function="UNKNOWN; MAY BE INVOLVED IN ELECTRON TRANSFER." /note="Rv1937, (MTCY09F9.27c), len: 839 aa. Possible oxygenase (EC 1.-.-.-), similar in N-terminus to N-terminal part (approx. 350 aa) of dioxygenases (including ring-hydroxylating dioxygenase electron transfer components) and monooxygenases, e.g. AAC34815.1|AF071556 anthranilate dioxygenase reductase from Acinetobacter sp. (343 aa); AAK52291.1|AY026914|AntC putative anthranilate dioxygenase reductase from Pseudomonas putida (340 aa); AAF63450.1|AF218267_7|AF218267 benzoate dioxygenase / ferredoxin reductase from Pseudomonas putida (336 aa); P23101|XYLZ_PSEPU toluate 1,2-dioxygenase electron transfer component [INCLUDES: FERREDOXIN; FERREDOXIN--NAD(+) REDUCTASE (EC 1.18.1.3)] from Pseudomonas putida plasmid TOL pWW0 (336 aa), FASTA scores: opt: 700, E(): 0, (34.3% identity in 335 aa overlap); S23479 probable benzoate 1,2-dioxygenase (EC 1.14.12.10) reductase component benC from Acinetobacter calcoaceticus (338 aa); AAC45294.1|U81594 soluble methane monooxygenase protein C from Methylocystis sp. (343 aa); P22868|MEMC_METCA METHANE MONOOXYGENASE COMPONENT C from Methylococcus capsulatus (348 aa); etc. Also similar in part to Mycobacterium tuberculosis hypothetical electron transfer proteins Rv3554, Rv3571, etc. Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature." /codon_start=1 /transl_table=11 /product="oxygenase" /protein_id="NP_216453.1" /db_xref="GI:15609074" /db_xref="GeneID:885433" /translation="MAVRQVTVGYSDGTHKTMPVRCDQTVLDAAEEHGVAIVNECQSG ICGTCVATCTAGRYQMGRTEGLSDVERAARKILTCQTFVTSDCRIELQYPVDDNAALL VTGDGVVTAVELVSPSTAILRVDTSGMAGALRYRAGQFAQLQVPGTNVWRNYSYAHPA DGRGECEFIIRLLPDGVMSNYLRDRAQPGDHIALRCSKGSFYLRPIVRPVILVAGGTG LSAILAMAQSLDADVAHPVYLLYGVERTEDLCKLDELTELRRRVGRLEVHVVVARPDP DWDGRTGLVTDLLDERMLASGDADVYLCGPVAMVDAARTWLDHNGFHRVGLYYEKFVA SGAARRRTPARLDYAGVDIAEVCRRGRGTAVVIGGSIAGIAAAKMLSETFDRVIVLEK DGPHRRREGRPGAAQGWHLHHLLTAGQIELERIFPGIVDDMVREGAFKVDMAAQYRIR LGGTWKKPGTSDIEIVCAGRPLLEWCVRRRLDDEPRIDFRYESEVADLAFDRANNAIV GVAVDNGDADGGDGLQVVPAEFVVDASGKNTRVPEFLERLGVGAPEAEQDIINCFYST MQHRVPPERRWQDKVMVICYAYRPFEDTYAAQYYTDSSRTILSTSLVAYNCYSPPRTA REFRAFADLMPSPVIGENIDGLEPASPIYNFRYPNMLRLRYEKKRNLPRALLAVGDAY TSADPVSGLGMSLALKEVREMQALLAKYGAGHRDLPRRYYRAIAKMADTAWFVIREQN LRFDWMKDVDKKRPFYFGVLTWYMDRVLELVHDDLDAYREFLAVVHLVKPPSALMRPR IASRVLGKWARTRLSGQKTLIARNYENHPIPAEPADQLVNA" misc_feature 2188616..2188642 /locus_tag="Rv1937" /note="PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature" gene 2191027..2192097 /gene="ephB" /locus_tag="Rv1938" /db_xref="GeneID:885392" CDS 2191027..2192097 /gene="ephB" /locus_tag="Rv1938" /function="THIS ENZYME ACTS ON EPOXIDES (ALKENE OXIDES, OXIRANES) AND ARENE OXIDES. PLAYS A ROLE IN XENOBIOTIC METABOLISM BY DEGRADING POTENTIAL TOXIC EPOXIDES. ALSO DETERMINES STEADY-STATE LEVELS OF PHYSIOLOGICAL MEDIATORS." /note="Rv1938, (MTCY09F9.26c), len: 356 aa. Probable ephB, epoxide hydrolase (EC 3.3.2.3) (see citation below), similar to many e.g. G1109600 ATSEH (EC 3.3.2.3) (321 aa), FASTA scores: opt: 442, E(): 1.2e-21 (33.1% identity in 356 aa overlap); etc. Also similar to many other M. tuberculosis hypothetical epoxide hydrolases e.g. Rv3617, Rv3670, Rv0134, etc." /codon_start=1 /transl_table=11 /product="epoxide hydrolase EphB" /protein_id="NP_216454.1" /db_xref="GI:15609075" /db_xref="GeneID:885392" /translation="MSQVHRILNCRGTRIHAVADSPPDQQGPLVVLLHGFPESWYSWR HQIPALAGAGYRVVAIDQRGYGRSSKYRVQKAYRIKELVGDVVGVLDSYGAEQAFVVG HDWGAPVAWTFAWLHPDRCAGVVGISVPFAGRGVIGLPGSPFGERRPSDYHLELAGPG RVWYQDYFAVQDGIITEIEEDLRGWLLGLTYTVSGEGMMAATKAAVDAGVDLESMDPI DVIRAGPLCMAEGARLKDAFVYPETMPAWFTEADLDFYTGEFERSGFGGPLSFYHNID NDWHDLADQQGKPLTPPALFIGGQYDVGTIWGAQAIERAHEVMPNYRGTHMIADVGHW IQQEAPEETNRLLLDFLGGLRP" gene 2192094..2192609 /locus_tag="Rv1939" /db_xref="GeneID:885488" CDS 2192094..2192609 /locus_tag="Rv1939" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1939, (MTCY09F9.25c), len: 171 aa. Probable oxidoreductase (EC 1.-.-.-), similar to NP_302637.1|NC_002677 probable oxidoreductase from Mycobacterium leprae (162 aa) Also similar to NTAB_CHELE|P54990 nitrilotriacetate monooxygenase component from Chelatobacter heintzii (322 aa), fasta scores: opt: 269, E(): 5.3e-11, (33.1% identity in 151 aa overlap). And similar to Mycobacterium tuberculosis probable monooxygenase components Rv0246, Rv3567, and to a lesser extent, Rv3007c." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_216455.1" /db_xref="GI:15609076" /db_xref="GeneID:885488" /translation="MSCTFDMVPETVDHLDEVGLRRVFGCFPCGVIAVCAMVDDQPVG MAASSFTSVSVDPPLVSICVQNCSTTWPKLRDRPRLGVSVLAEGHDAACMSLSRKEGN RFAGVFWSELSSGGVVIAGAGAWLDCRPYAEIPAGDHLIALLEICAVRADPETPPLVF HGSRFRRLESR" gene 2192606..2193667 /gene="ribA1" /locus_tag="Rv1940" /db_xref="GeneID:885621" CDS 2192606..2193667 /gene="ribA1" /locus_tag="Rv1940" /EC_number="3.5.4.25" /function="INVOLVED IN RIBOFLAVIN BIOSYNTHESIS [CATALYTIC ACTIVITY : GTP + 3 H(2)O = FORMATE + 2,5-DIAMINO-6-HYDROXY-4-(5-PHOSPHORIBOSYLAMINO)PYRIMIDINE + DIPHOSPHATE]" /note="Rv1940, (MTCY09F9.24c), len: 353 aa. Probable ribA1, Riboflavin biosynthesis protein (EC 3.5.4.25), similar to GCH2_BACSU|P17620 gtp cyclohydrolase ii (EC 3.5.4.25) (398 aa), FASTA scores: opt: 682, E(): 0, (37.7% identity in 363 aa overlap), also similar to Rv1415|MTCY21B4.33|ribA2 (428 aa) (45.4% identity in 368 aa overlap). Note that previously known as ribA.; ribA" /codon_start=1 /transl_table=11 /product="riboflavin biosynthesis protein ribA1 (GTP cyclohydrolase II)" /protein_id="YP_177851.1" /db_xref="GI:57116930" /db_xref="GeneID:885621" /translation="MKTTDVRVRRAITAMAGGHAVVLTGDPNGDGYLVFAAQAATPRL VAFAVRHTSGYLRVALPGAECERLHLPPMCDRDTTHCVSVDVRGTGTGISASDRAWTI AALASATSVAADFQRPGHVVPVQAQADGVLGRRGPAEAAVDLARLAERRPAAALCEIV SPDNPVQMAHHAESVEFAVEHGLAMVSIGELVAYRRRIEPQVVRFTAATLPTWAGASR VIGFRDVYDLGEHLAVIVGAVGAGVPVPLHVHIECLTGDVFGSTACRCGEELNGALAR MSAQGSGVVLYLRPPGPAQACGLFARGDAATDVMPETVTWILRDLGVYAIRLSDDVPG FGLVMFGAIREASTLAAAG" gene 2193664..2194434 /locus_tag="Rv1941" /db_xref="GeneID:885622" CDS 2193664..2194434 /locus_tag="Rv1941" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv1941, (MTCY09F9.23c), len: 256 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases, generally belonging to SDR family, e.g. NP_299015.1|NC_002488 2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase from Xylella fastidiosa (255 aa); NP_250340.1|NC_002516 probable short-chain dehydrogenase from Pseudomonas aeruginosa (253 aa); NP_106890.1|NC_002678 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mesorhizobium loti (374 aa) (has its N-terminus longter); P50197|LINC_PSEPA 2,5-dichloro-2,5-cyclohexadiene-1,4-dehydrogenase from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (250 aa), FASTA scores: opt: 529, E(): 5.7e-25, (40.6% identity in 251 aa overlap); etc. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short-chain type dehydrogenase/reductase" /protein_id="NP_216457.1" /db_xref="GI:15609078" /db_xref="GeneID:885622" /translation="MNHPDLAGKVAIVTGAGAGIGLAVARRLADEGCHVLCADIDGDA ADAAATKIGCGAAACRVDVSDEQQIIAMVDACVAAFGGVDKLVANAGVVHLASLIDTT VEDFDRVIAINLRGAWLCTKHAAPRMIERGGGAIVNLSSLAGQVAVGGTGAYGMSKAG IIQLSRITAAELRSSGIRSNTLLPAFVDTPMQQTAMAMFDGALGAGGARSMIARLQGR MAAPEEMAGIVVFLLSDDASMITGTTQIADGGTIAALW" misc_feature 2194084..2194170 /locus_tag="Rv1941" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(2194644..2194973) /locus_tag="Rv1942c" /db_xref="GeneID:885606" CDS complement(2194644..2194973) /locus_tag="Rv1942c" /function="UNKNOWN" /note="Rv1942c, (MTCY09F9.22), len: 109 aa. Conserved hypothetical protein, shows some similarity to Q10867|MTCY39.28|Rv1991 hypothetical 12.3 kDa protein (114 aa), FASTA scores: opt: 117, E(): 0.021, (24. 5% identity in 110 aa overlap) also P33645|CHPA_ECOLI pemk-like protein 1 (mazf protein) from Escherichia coli (111 aa), FASTA scores: opt: 104, E(): 0.18, (29.1% identity in 110 aa overlap). Also similar to Mycobacterium tuberculosis Rv0659c (102 aa) (32.7% identity in 101 aa overlap); Rv1102c (33.3% identity in 93 aa overlap) and Rv1495." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216458.1" /db_xref="GI:15609079" /db_xref="GeneID:885606" /translation="MTALPARGEVWWCEMAEIGRRPVVVLSRDAAIPRLRRALVAPCT TTIRGLASEVVLEPGSDPIPRRSAVNLDSVESVSVAVLVNRLGRLADIRMRAICTALE VAVDCSR" gene complement(2194970..2195347) /locus_tag="Rv1943c" /db_xref="GeneID:885605" CDS complement(2194970..2195347) /locus_tag="Rv1943c" /function="UNKNOWN" /note="Rv1943c, (MTCY09F9.21), len: 125 aa. Conserved hypothetical protein, showing some similarity with Rv1946c|MTCY09F9.18|lppG possible conserved lipoprotein from Mycobacterium tuberculosis (150 aa), FASTA score: (71.4% identity in 28 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216459.1" /db_xref="GI:15609080" /db_xref="GeneID:885605" /translation="MKTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWH SSGMNRIRLSTTVDAALLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAAYD KHPVDEPDEWGDLASWRRAAGDS" gene complement(2195344..2195934) /locus_tag="Rv1944c" /db_xref="GeneID:885884" CDS complement(2195344..2195934) /locus_tag="Rv1944c" /function="UNKNOWN" /note="Rv1944c, (MTCY09F9.20), len: 196 aa. Conserved hypothetical protein, similar to C-terminal part of SCE20.29|AL136058|CAB65585.1 hypothetical protein from Streptomyces coelicolor (338 aa), BLASTP scores, Identities = 37/131 (28%), Positives = 51/131 (38%)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216460.1" /db_xref="GI:15609081" /db_xref="GeneID:885884" /translation="MISDTEDFAHGDKAAPPRLRASYAACGGDAAGCWTMSDNGASRV PPVDETPAAESAEPITAVSLAWLPAGDYERALDLWPDFAGSDLVTGPDGPVAHPLYCR RMQQKLVEFAEAGFPGLAVAAIRVAPFAAWCAEQGQEPDSPEARAEYAAYLTAHGDHD VMAWPPGRNQQCWCGSGHKYKKCCAAASFIDTEPAP" gene 2195989..2197353 /locus_tag="Rv1945" /db_xref="GeneID:885980" CDS 2195989..2197353 /locus_tag="Rv1945" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1945, (MTCY09F9.19c), len: 454 aa. Member of Mycobacterium tuberculosis REP13E12 repeat family. Similar to several others, best with Rv1148c|Z95584|MTCI65.15 (482 aa), FASTA score: opt: 2954, E(): 0, (97.1% identity in 454 aa overlap). Contains possible helix-turn-helix motif at aa 74-95 (+2.90 SD)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216461.1" /db_xref="GI:15609082" /db_xref="GeneID:885980" /translation="MRSDTREEISAALDAYHASLSRVLDLKCDALTTPELLACLQRLE VERRRQGAAEHALINQLAGQACEEELGGTLRTALANRLHITPGEASRRIAEAEDLGER RALTGEPLPAQLTATAAAQREGKIGREHIKEIQAFFKELSAAVDLGIREAAEAQLAEL ATSRRPDHLHGLATQLMDWLHPDGNFSDQERARKRGITMGKQEFDGMSRISGLLTPEL RATIEAVLAKLAAPGACNPDDQTPVVDDTPDADAVRRDTRSQAQRHHDGLLAGLRGLL ASGELGQHRGLPVTVVVSTTLKELEAATGKGVTGGGSRVPMSDLIRMASNAHHYLALF DGAKPLALYHTKRLASPAQRIMLYAKDRGCSRPGCDAPAYHSEVHHVTPWTTTHRTDI NDLTLACGPDNRLVEKGWKTRKNAKGDTEWLPPAHLDHGQPRINRYHHPEKILCEPDD DEPH" repeat_region 2195989..2197350 /note="REP-7, len: 1362 bp. REP09F9, member of the REP13E12 family.; REP-7" /rpt_type=DIRECT gene complement(2197508..2197960) /gene="lppG" /locus_tag="Rv1946c" /db_xref="GeneID:885979" CDS complement(2197508..2197960) /gene="lppG" /locus_tag="Rv1946c" /function="UNKNOWN" /note="Rv1946c, (MTCY09F9.18), len: 150 aa. Possible lppG, conserved lipoprotein, showing some similarity to Rv1943c|MTCY09F9.21 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (125 aa), FASTA score: (71.4% identity in 28 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein" /protein_id="NP_216462.1" /db_xref="GI:15609083" /db_xref="GeneID:885979" /translation="MIRGSAVSGLLMPSVNGGTAGSVACVQCLFLPKVAVDLINLSGI QCFARIEHVAHAQAHPFVVLVGKPAQHGARIGAVAGAILTGDVIVSHDGELYRAVTAL RQNGPRPHASRRLHAPALCSARSRRGHLRPSCWLPPPRFAGRQSLVAR" misc_feature complement(2197886..2197918) /gene="lppG" /locus_tag="Rv1946c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2198024..2198425 /locus_tag="Rv1947" /db_xref="GeneID:885976" CDS 2198024..2198425 /locus_tag="Rv1947" /function="UNKNOWN" /note="Rv1947, (MTCY09F9.17c), len: 133 aa. Hypothetical unknown protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216463.1" /db_xref="GI:15609084" /db_xref="GeneID:885976" /translation="MDRYNDQASGRALIEIRLCNERATPMPIPIGLWMFQTKLHVNAG GADVFLPVCDVLEQDLAERDEEVRQLNLQYRNRLEYAIGRTCSAAWSVNGSRRPSAVW TTWLPVAETPHTRARSVENALLSMDSRGGVT" gene complement(2198714..2199064) /locus_tag="Rv1948c" /db_xref="GeneID:885975" CDS complement(2198714..2199064) /locus_tag="Rv1948c" /function="UNKNOWN" /note="Rv1948c, (MTCY09F9.16), len: 116 aa. Hypothetical unknown protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216464.1" /db_xref="GI:15609085" /db_xref="GeneID:885975" /translation="MTVFGIKPDNYFGDVVLAAADRDGLRIFQYAVRSAHESGQATFD IDGVQQRIVRESGTADMELGSQTVVWRFDDTKLVEILDKLSPLIDGEGPGHQYIDDLN SPAPTLMISVDEYA" gene complement(2199075..2200034) /locus_tag="Rv1949c" /db_xref="GeneID:885974" CDS complement(2199075..>2200034) /locus_tag="Rv1949c" /function="UNKNOWN" /note="Rv1949c, (MTCY09F9.15), len: 319 aa. Conserved hypothetical protein, partial ORF. Rv1949c and Rv1950c|MTCY09F9.14 are similar but frameshifted with respect to Rv2077c|MTCY49.16C|Q10685 hypothetical 33.3 kd protein (323 aa), FASTA scores: opt: 459, E(): 2.8e-16, (54.8% identity in 157 aa overlap). Cosmid sequence appears to be correct, genomic sequence is also frameshifted in Mycobacterium bovis strain AF2122/97. Similar to Mycobacterium tuberculosis hypothetical proteins: Rv2542, Rv2077c, Rv2797c, Rv0963c, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216465.1" /db_xref="GI:15609086" /db_xref="GeneID:885974" /translation="WLRQRTGADLQIVSGIAEHLRQASGLAREGAGTIGAAQRRVIYA VQDAHNAGFNVEEDLSVTDTRTSRTFAEQAARQAQAQALAGDIRQRATQLIGVEHEVA AKIATATAPLNTVGFHEPPIAPSLPTPVPHNEKPQIHAVDRSWKQDPPSPMPGDPKDM TAVQARAAWDAVNADIARYNARCGRTFVLPNEQAAYDACIADKGSLFERQAAIRARLG ELGVPVEGEPPPAPDPAGPQPNEGLPPPGVSPPAESNLTVGPPSRPIQQARGGESLWD ENGGEWRYFPGDNYRYPHWDYNPHDSPTARWQNIPIGDLPTHK" gene complement(2199998..2200189) /locus_tag="Rv1950c" /db_xref="GeneID:885973" CDS complement(2199998..2200189) /locus_tag="Rv1950c" /function="UNKNOWN" /note="Rv1950c, (MTCY09F9.14), len: 63 aa. Conserved hypothetical protein, partial ORF. Highly similar to N-terminus of Rv2077c|MTCY49.16C|Q10685 hypothetical 33.3 kDa protein (323 aa), FASTA scores: opt: 280, E(): 1.2 e-16, (71.7% identity in 53 aa overlap) but homology continues in different frame ie MTCY09F9.15, cosmid sequence appears to be correct, genomic sequence is also frameshifted in Mycobacterium bovis strain AF2122/97." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216466.1" /db_xref="GI:15609087" /db_xref="GeneID:885973" /translation="MLPTLSHIHAWDTEHLIEAAYYWTKVADQWEDVFLEMRNRSHFI AWEGAGGDGCDSEPALTYR" gene complement(2200190..2200486) /locus_tag="Rv1951c" /db_xref="GeneID:885972" CDS complement(2200190..2200486) /locus_tag="Rv1951c" /function="UNKNOWN" /note="Rv1951c, (MTCY09F9.13), len: 98 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein Rv2541 (135 aa) (40.9% identity in 88 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216467.1" /db_xref="GI:15609088" /db_xref="GeneID:885972" /translation="MKAGELRVNIQQVAATASQWSGRSTELSVLAPPPLGQPFQPTTA AVGGAHAAVGLAVAAFTARTHATASAVEAAAAEYANNEAAAAAEMAAVPQTRLV" gene 2200726..2200941 /locus_tag="Rv1952" /db_xref="GeneID:885971" CDS 2200726..2200941 /locus_tag="Rv1952" /function="UNKNOWN" /note="Rv1952, (MTCY09F9.12c), len: 71 aa. Conserved hypothetical protein. Some similarity to P55510|Y4JJ_RHISN PUTATIVE PLASMID STABILITY PROTEIN (85 aa), FASTA scores: opt: 127, E(): 0.00096, (42.5% identity in 73 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216468.1" /db_xref="GI:15609089" /db_xref="GeneID:885971" /translation="MIRNLPEGTKAALRVRAARHHHSVEAEARAILTAGLLGEEVPMP VLLAADSGHDIDFEPERLGLIARTPQL" gene 2200938..2201249 /locus_tag="Rv1953" /db_xref="GeneID:885970" CDS 2200938..2201249 /locus_tag="Rv1953" /function="UNKNOWN" /note="Rv1953, (MTCY09F9.11c), len: 103 aa. Conserved hypothetical protein. Some similarity to O33827 PLASMID STABILITY-LIKE PROTEIN from Thiobacillus ferrooxidans (143 aa), FASTA scores: opt: 170, E(): 3.5e-06, (45.3% identity in 75 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216469.1" /db_xref="GI:15609090" /db_xref="GeneID:885970" /translation="MTYVLDTNVVSALRVPGRHPAVAAWADSVQVAEQFVVAITLAEI ERGVIAKERTDPTQSEHLRRWFDDKVLRIFVFARRGTNLIMQPLAGHIGYSLYSGISW F" gene complement(2201223..2201744) /locus_tag="Rv1954c" /db_xref="GeneID:885967" CDS complement(2201223..2201744) /locus_tag="Rv1954c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1954c, (MTCY09F9.10), len: 173 aa. Hypothetical unknown protein, end overlaps next ORF upstream, Rv1955 (MTCY09F9.09c)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216470.1" /db_xref="GI:15609091" /db_xref="GeneID:885967" /translation="MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPP RRCDTHPDGTSSAAAALVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSRRR SRLTRGRSFTSHLITSCPRLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPR WGPFRLKPAYTRI" gene 2201584..2202096 /locus_tag="Rv1955" /db_xref="GeneID:885966" CDS 2201584..2202096 /locus_tag="Rv1955" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1955, (MTCY09F9.09c), len: 170 aa. Hypothetical unknown protein, start overlaps another ORF, Rv1954c (MTCY09F9.10)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216471.1" /db_xref="GI:15609092" /db_xref="GeneID:885966" /translation="MPSGWVSHRLGGSPKCISALSLPSGTVGAPSKPDNDATRGRTRP TVPPPDPAAMGTWKFFRASVDGRPVFKKEFDKLPDQARAALIVLMQRYLVGDLAAGSI KPIRGDILELRWHEANNHFRVLFFRWGQHPVALTAFYKNQQKTPKTKIETALDRQKIW KRAFGDTPPI" gene 2202138..2202587 /locus_tag="Rv1956" /db_xref="GeneID:885964" CDS 2202138..2202587 /locus_tag="Rv1956" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv1956, (MTCY09F9.08c), len: 149 aa. Possible transcriptional regulatory protein, contains probable helix-turn-helix motif at aa 52-73 (+4.78 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216472.1" /db_xref="GI:15609093" /db_xref="GeneID:885964" /translation="MSIDFPLGDDLAGYIAEAIAADPSFKGTLEDAEEARRLVDALIA LRKHCQLSQVEVAKRMGVRQPTVSGFEKEPSDPKLSTLQRYARALDARLRLVLEVPTL REVPTWHRLSSYRGSARDHQVRVGADKEILMQTNWARHISVRQVEVA" gene 2202584..2203129 /locus_tag="Rv1957" /db_xref="GeneID:885961" CDS 2202584..2203129 /locus_tag="Rv1957" /function="UNKNOWN" /note="Rv1957, (MTCY09F9.07c), len: 181 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216473.1" /db_xref="GI:15609094" /db_xref="GeneID:885961" /translation="MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPA QGLTYDLEFEPAVDADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATAD FEFAALFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLE ILSRPMPVSPGAQWPATRGTP" gene complement(2203018..2203632) /locus_tag="Rv1958c" /db_xref="GeneID:885960" CDS complement(2203018..2203632) /locus_tag="Rv1958c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1958c, (MTCY09F9.06), len: 204 aa. Hypothetical unknown protein, questionable ORF" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216474.1" /db_xref="GI:15609095" /db_xref="GeneID:885960" /translation="MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNP RRLSMNPGGMRIRCRRGDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASPVF ENLELRAAAGLAFGFRLRPFGGTAADSPPVAAQDLDPCRWADSPALHLAVGVETMVVG QLDSPSFGQGVPLVAGHWAPGETGIGRDNISRVNGGSARRPVRS" gene complement(2203681..2203977) /locus_tag="Rv1959c" /db_xref="GeneID:885958" CDS complement(2203681..2203977) /locus_tag="Rv1959c" /function="UNKNOWN" /note="Rv1959c, (MTCY09F9.05), len: 98 aa. Conserved hypothetical protein, similar to other hypothetical plasmid proteins e.g. AL117189|YPCD1.08 from Yersinia pestis (99 aa), FASTA scores: opt: 162, E(): 7.3e-05, (33.0% identity in 91 aa overlap); also some similarity to E145339 hypothetical protein (103 aa), FASTA scores: opt: 142, E(): 0.0003, (33.0% identity in 91 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216475.1" /db_xref="GI:15609096" /db_xref="GeneID:885958" /translation="MSSRYLLSPAAQAHLEEIWDCTYDRWGVDQAEQYLRELQHAIDR AAANPRIGRACDEIRPGYRKLSAGSHTLFYRVTGEGTIDVVRVLHQRMDVDRNL" gene complement(2203974..2204225) /locus_tag="Rv1960c" /db_xref="GeneID:885957" CDS complement(2203974..2204225) /locus_tag="Rv1960c" /function="UNKNOWN" /note="Rv1960c, (MTCY09F9.04), len: 83 aa. Conserved hypothetical protein, similar to O85269|AF102990|AF102990_51 hypothetical protein of Yersinia enterocolitica (80 aa), FASTA scores: opt: 149, E(): 0.00037, (42 .1% identity in 57 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216476.1" /db_xref="GI:15609097" /db_xref="GeneID:885957" /translation="MGKNTSFVLDEHYSAFIDGEIAAGRYRSASEVIRSALRLLEDRE TQLRALREALEAGERSGSSTPFDFDGFLGRKRADASRGR" gene 2204212..2204706 /locus_tag="Rv1961" /db_xref="GeneID:885954" CDS 2204212..2204706 /locus_tag="Rv1961" /function="UNKNOWN" /note="Rv1961, MTCY09F9.03c, len: 164 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216477.1" /db_xref="GI:15609098" /db_xref="GeneID:885954" /translation="MFLPTNAQYQLLVVGVSPWDTPSPSGRISWGSAWPHQARRAQTC QRVRRHWMIDTTEAAYRLTYQPDGTSITVRENLVDILARELLGPIRGPQEVLPFSPRS QYLVGHLAPVKLTGAALIDDNAVQARANAEALAEGGGVPAYAADETTPTPTTTPKTAH PSRA" gene complement(2204866..2205273) /locus_tag="Rv1962c" /db_xref="GeneID:885952" CDS complement(2204866..2205273) /locus_tag="Rv1962c" /function="UNKNOWN" /note="Rv1962c, (MTCY09F9.02), len: 135 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins Rv3408|MTCY78.20c (133 aa) (36.2% identity in 138 aa overlap); and Rv3384c (130 aa) (43.1% identity in 130 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216478.1" /db_xref="GI:15609099" /db_xref="GeneID:885952" /translation="MIYLETSALVKLIRIEVESDALADWLDDRTELRWITSALTEVEL SRAIRAVSPEGLPAVPSVLARLDRFEIDAVIRSTAAAYPNPALRSLDAIHLATAQTAG SVAPLTALVTYDNRLKEAAEALSLAVVAPGQAR" gene complement(2205582..2206802) /gene="mce3R" /locus_tag="Rv1963c" /db_xref="GeneID:885950" CDS complement(2205582..2206802) /gene="mce3R" /locus_tag="Rv1963c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM; REPRESSION OF THE MCE3 OPERON. COULD ALSO HAVE A REGULATORY ACTION ON THE MCE2 OPERON." /experiment="experimental evidence, no additional details recorded" /note="Rv1963c, (MTV051.01c-MTCY09F9.01), len: 406 aa. Probable mce3R, negative transcriptional regulatory protein, tetR family (see citation below); similar to several transcriptional regulator e.g. AL049485|SC6A5.30 Streptomyces coelicolor cosmid 6 A (404 aa), FASTA scores: opt: 319, E(): 6.4e-13, (29.5% identity in 373 aa overlap); and Z84498|MTCY9F9_1 (259 aa), FASTA scores: opt: 208, E(): 1.6e-07, (100.0% identity in 32 aa overlap). Contains probable helix-turn-helix at aa 36-57 (+4.23 SD) and two tet-R family signatures." /codon_start=1 /transl_table=11 /product="transcriptional repressor (probably TETR-family) MCE3R" /protein_id="NP_216479.1" /db_xref="GI:15609100" /db_xref="GeneID:885950" /translation="MASVAQPVRRRPKDRKKQILDQAVGLFIERGFHSVKLEDIAEAA GVTARALYRHYDNKQALLAEAIRTGQDQYQSARRLTEGETEPTPRPLNADLEDLIAAA VASRALTVLWQREARYLNEDDRTAVRRRINAIVAGMRDSVLLEVPDLSPQHSELRAWA VSSTLTSLGRHSLSLPGEELKKLLYQACMAAARTPPVCELPPLPAGDAARDEADVLFS RYETLLAAGARLFRAQGYPAVNTSEIGKGAGIAGPGLYRSFSSKQAILDALIRRLDEW RCLECIRALRANQQAAQRLRGLVQGHVRISLDAPDLVAVSVTELSHASVEVRDGYLRN QGDREAVWIDLIGKLVPATSVAQGRLLVAAAISFIEDVARTWHLTRYAGVADEISGLA LAILTSGAGNLLRA" gene 2207700..2208497 /gene="yrbE3A" /locus_tag="Rv1964" /db_xref="GeneID:885948" CDS 2207700..2208497 /gene="yrbE3A" /locus_tag="Rv1964" /function="UNKNOWN" /note="Rv1964, (MTV051.02), len: 265 aa. yrbE3A, hypothetical unknown integral membrane protein, part of mce3 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07412|Rv0167|MTCI28.07|yrbE1A (265 aa), O07791|Rv0587|MTCY19H5.35|yrbE2A (265 aa), Rv3501c|MTV023.08c|yrbE4A (254 aa), etc. Also highly similar to conserved hypothetical integral membrane proteins of yrbEA type, e.g. AAD24544.1|AF116213|YrbE1A from Mycobacterium leprae (112 aa); P45392|YRBE_ECOLI from Escherichia coli (260 aa), FASTA scores: opt: 893, E(): 0, (51.4% identity in 253 aa overlap); etc. TBparse score is 0.889. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002)." /codon_start=1 /transl_table=11 /product="integral membrane protein YrbE3A" /protein_id="NP_216480.1" /db_xref="GI:15609101" /db_xref="GeneID:885948" /translation="MVIVADKAAGRVADPVLRPVGALGDFFAMTLDTSVCMFKPPFAW REYLLQCWFVARVSTLPGVLMTIPWAVISGFLFNVLLTDIGAADFSGTGCAIFTVNQS APIVTVLVVAGAGATAMCADLGARTIREELDALRVMGINPIQALAAPRVLAATTVSLA LNSVVTATGLIGAFFCSVFLMHVSAGAWVTGLTTLTHTVDVVISMIKATLFGLMAGLI ACYKGMSVGGGPAGVGRAVNETVVFAFIVLFVINIVVTAVGIPFMVS" gene 2208507..2209322 /gene="yrbE3B" /locus_tag="Rv1965" /db_xref="GeneID:885947" CDS 2208507..2209322 /gene="yrbE3B" /locus_tag="Rv1965" /function="UNKNOWN" /note="Rv1965, (MTV051.03), len: 271 aa. yrbE4B, hypothetical unknown integral membrane protein, part of mce3 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07413|Rv0168|MTCI28.08|yrbE1B (289 aa), FASTA scores: opt: 937, E(): 0, (54.3% identity in 254 aa overlap); O07790|Rv0588|MTCY19H5.34|yrbE2B (295 aa); etc. Also highly similar to conserved hypothetical integral membrane proteins of the yrbEB type, e.g. AAD24545.1|AF116213|YrbE1B from Mycobacterium leprae (106 aa); P45392|YRBE_ECOLI HYPOTHETICAL 27.9 kDa PROTEIN from Escherichia coli (260 aa), FASTA scores: opt: 218, E(): 1.2e-07, (24.1% identity in 245 aa overlap); etc. TBparse score is 0.881. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002)." /codon_start=1 /transl_table=11 /product="integral membrane protein YrbE3b" /protein_id="NP_216481.1" /db_xref="GI:15609102" /db_xref="GeneID:885947" /translation="MTAAKALVSEWNRMGSQMRFFVGTLAGIPDALMHYRGELLRVIA QMGLGTGVLAVIGGTVAIVGFLAMTTGAIVAVQGYNQFASVGVEALTGFASAFFNTRE IQPGTVMVALAATVGAGTTAALGAMRINEEIDALEVIGIRSISYLASTRVLAGVVVAV PLFCVGLMTAYLAARVGTTAIYGQGSGVYDHYFNTFLRPTDVLWSSVEVVVVALMIML VCTYYGYAAHGGPAGVGEAVGRAVRASMVVASIAILVMTLAIYGQSPNFHLAT" gene 2209327..2210604 /gene="mce3A" /locus_tag="Rv1966" /db_xref="GeneID:885944" CDS 2209327..2210604 /gene="mce3A" /locus_tag="Rv1966" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv1966, (MTV051.04), len: 425 aa. mce3A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins P72013|MCE1|Rv0169|MTCI28.09|mce1A (454 aa); O07789|MCE2|Rv0589|MTCY19H5.33c|mce2A (404 aa); etc. Also highly similar to others e.g. AAD52105.1|AF113402_1|AF113402 mycobacterial cell entry protein from Mycobacterium bovis BCG (454 aa); NP_302656.1|NC_002677 putative cell invasion protein from Mycobacterium leprae (441 aa); CAC12798.1|AL445327 putative secreted protein from Streptomyces coelicolor (418 aa); etc. Contains a possible N-terminal signal sequence or membrane anchor. TBparse score is 0.897. Note that previously known as mce3. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002).; mce3" /codon_start=1 /transl_table=11 /product="MCE-family protein MCE3A" /protein_id="YP_177852.1" /db_xref="GI:57116931" /db_xref="GeneID:885944" /translation="MRRGPGRHRLHDAWWTLILFAVIGVAVLVTAVSFTGSLRSTVPV TLAADRSGLVMDSGAKVMMRGVQVGRVAQIGRIEWAQNGASLRLEIDPDQIRYIPANV EAQISATTAFGAKFVDLVMPQNPSRARLSAGAVLHSKNVSTEINTVFENVVDLLNMID PLKLNAVLTAVADAVRGQGERIGQATTDLNEVLEALNARGDTIGGNWRSLKNFTDTYD AAAQDILTILNAASTTSATVVNHSTQLDALLLNAIGLSNAGTNLLGSSRDNLVGAADI LAPTTSLLFKYNPEYTCFLQGAKWYLDNGGYAAWGGADGRTLQLDVALLFGNDPYVYP DNLPVVAAKGGPGGRPGCGPLPDATHNFPVRQLVTNTGWGTGLDIRPNPGIGHPCWAN YFPVTRAVPEPPSIRQCIPGPAIGPNPAAGEQP" gene 2210601..2211629 /gene="mce3B" /locus_tag="Rv1967" /db_xref="GeneID:885943" CDS 2210601..2211629 /gene="mce3B" /locus_tag="Rv1967" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv1967, (MTV051.05), len: 342 aa. mce3B; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07414|Rv0170|MTCI28.10|mce1B (346 aa); O07788|Rv0590|MTCY19H5.32c|mce2B (275 aa); etc. Also similar to others e.g. NP_302657.1|NC_002677 putative secreted protein from Mycobacterium leprae (346 aa); CAC12797.1|AL445327 putative secreted protein from Streptomyces coelicolor (354 aa); etc. Contains a possible N-terminal signal sequence or membrane anchor. TBparse score is 0.872. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002)." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE3B" /protein_id="NP_216483.1" /db_xref="GI:15609104" /db_xref="GeneID:885943" /translation="MRENLGGVVVRLGVFLAVCLLTAFLLIAVFGEVRFGDGKTYYAE FANVSNLRTGKLVRIAGVEVGKVTRISINPDATVRVQFTADNSVTLTRGTRAVIRYDN LFGDRYLALEEGAGGLAVLRPGHTIPLARTQPALDLDALIGGFKPLFRALNPEQVNAL SEQLLHAFAGQGPTIGSLLAQSAAVTNTLADRDRLIGQVITNLNVVLGSLGAHTDRLD QAVTSLSALIHRLAQRKTDISNAVAYTNAAAGSVADLLSQARAPLAKVVRETDRVAGI AAADHDYLDNLLNTLPDKYQALVRQGMYGDFFAFYLCDVVLKVNGKGGQPVYIKLAGQ DSGRCAPK" gene 2211626..2212858 /gene="mce3C" /locus_tag="Rv1968" /db_xref="GeneID:885942" CDS 2211626..2212858 /gene="mce3C" /locus_tag="Rv1968" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv1968, (MTV051.06), len: 410 aa. mce3C; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07415|R0171|MTCI28.11|mce1C (515 aa); O07787|Rv0591|MTCY19H5.31|mce2C (481 aa); etc. Also similar to others e.g. CAC12796.1|AL445327 putative secreted protein from Streptomyces coelicolor (351 aa); NP_302658.1|NC_002677 putative secreted protein from Mycobacterium leprae (519 aa); etc. Contains a possible N-terminal signal sequence or membrane anchor. TBparse score is 0.875. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002)." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE3C" /protein_id="NP_216484.1" /db_xref="GI:15609105" /db_xref="GeneID:885942" /translation="MKSFAERNRLAIGTVGIVVVAAVALAALQYQRLPFFNQGTRVSA YFADAGGLRTGNTVEVSGYPVGKVSSISLDGPGVLVEFKVDTDVRLGNRTEVAIKTKG LLGSKFLDVTPRGDGRLDSPIPIERTTSPYQLPDALGDLAATISGLHTERLSESLATL AQTFADTPAHFRNAIHGVARLAQTLDERDNQLRSLLANAAKATGVLANRTDQIVGLVR DTNVVLAQLRTQSAALDRIWANISAVAEQLRGFIAENRQQLRPALDKLNGVLAIVENR KERVRQAIPLINTYVMSLGESLSSGPFFKAYVVNLLPGQFVQPFISAAFSDLGLDPAT LLPSQLTDPPTGQPGTPPLPMPYPRTGQGGEPRLTLPDAITGNPGDPRYPYRPEPPAP PPGGPPPGPPAQQPGDQP" gene 2212855..2214126 /gene="mce3D" /locus_tag="Rv1969" /db_xref="GeneID:885940" CDS 2212855..2214126 /gene="mce3D" /locus_tag="Rv1969" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv1969, (MTV051.07), len: 423 aa. mce3D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07416|Rv0172|MTCI28.12|mce1D (530 aa); O07786|Rv0592|MTCY19H5.30c|mce2D (508 aa); etc. Also highly similar to others e.g. NP_302659.1|NC_002677 putative secreted protein from Mycobacterium leprae (531 aa); CAC12795.1|AL445327 putative secreted protein from Streptomyces coelicolor (337 aa); etc. Contains a possible N-terminal signal sequence or membrane anchor. TBparse score is 0.872. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002)." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE3D" /protein_id="NP_216485.1" /db_xref="GI:15609106" /db_xref="GeneID:885940" /translation="MTTKLRRARSVLATALVLVAGVILAMRTADAAARTTVVAYFDNS NGVFAGDDVLIRGVPVGKIVKIEPQPLRAKISFWFDRKYRVPADAAAAILSPQLVTGR AIQLTPPYAGGPTMADGTVIPQERTVVPVEWDDLRAQLQRLTALLQPTRPGGVSTLGA LINTAADNLRGQGATIRDTIIKLSQAISALGDHSKDIFSTVTNLSTLVTALHDSADLL ERLNHNLAAVTSLLADGPDKIGQAAEDLNAVVADVGSFAAEHREAIGTASDKLASITT ALVDSLDDIKQTLHISPTVLQNFNNIFEPANGALTGALAGNNMANPIAFLCGAIQAAS RLGGEQAAKLCVQYLAPIVKNRQYNYPPLGANLFVGAQARPNEVTYSEDWLRPDYVAP VADTPPDPAAAVTVDPATGLRGMMMPPGGGS" gene 2214123..2215256 /gene="lprM" /locus_tag="Rv1970" /db_xref="GeneID:885939" CDS 2214123..2215256 /gene="lprM" /locus_tag="Rv1970" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv1970, (MTV051.08), len: 377 aa. Possible lprM (alternate gene name: mce3E), lipoprotein which belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07417|LPRK|Rv0173|MTCI28.13|mce1E (390 aa); O07785|LPRL|Rv0593|MTCY19H5.29|mce2E (402 aa); etc. Also highly similar to others e.g. NP_302660.1|NC_002677 putative lipoprotein from Mycobacterium leprae (392 aa); CAC12794.1|AL445327 putative secreted protein from Streptomyces coelicolor (413 aa); etc. Contains possible N-terminal signal sequence or membrane anchor and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.880. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002).; mce3E" /codon_start=1 /transl_table=11 /product="MCE-family lipoprotein LprM" /protein_id="NP_216486.1" /db_xref="GI:15609107" /db_xref="GeneID:885939" /translation="MRIGLTLVMIAAVVASCGWRGLNSLPLPGTQGNGPGSFAVQAQL PDVNNIQPNSRVRVADVTVGHVTKIERQGWHALVTMRLDGDVDLPANATAKIGTTSLL GSYHIELAPPKGEARQGKLRDGSLIALSHGSAYPSTEQTLAALSLVLNGGGLGQVQDI TEALSTAFAGREHDLRGLIGQLDTFTAYLNNQSGDIIAATDSLNRLVGKFADQQPVFD RALATIPDALAVLADERDTLVEAAEQLSKFSALTVDSVNKTTANLVTELRQLGPVLES LANSGPALTRSLSLLATFPFPNETFQNFQRGEYANLTAIVDLTLSRIDQGLLTGTRWE CHLTQLELQWGRTIGQFPSPCTAGYRGTPGNPLTIAYRWDQGP" misc_feature 2214141..2214173 /gene="lprM" /locus_tag="Rv1970" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2215257..2216570 /gene="mce3F" /locus_tag="Rv1971" /db_xref="GeneID:885938" CDS 2215257..2216570 /gene="mce3F" /locus_tag="Rv1971" /function="UNKNOWN, BUT THOUGHT INVOLVED IN HOST CELL INVASION." /note="Rv1971, (MTV051.09), len: 437 aa. mce3F; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), similar to Mycobacterium tuberculosis proteins O07418|Rv0174|MTCI28.14|mce1F (515 aa), O07784|Rv0594|MTCY19H5.28c|mce2F (516 aa); etc. Also highly similar to others e.g. NP_302661.1|NC_002677 putative secreted protein from Mycobacterium leprae (516 aa); CAC12793.1|AL445327 putative secreted protein from Streptomyces coelicolor (433 aa); etc. Contains a possible N-terminal signal sequence or membrane anchor. TBparse score is 0.881. The transcription of this CDS seems negatively regulated by the product of Rv1963c|mce3R (see Santangelo et al., 2002)." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE3F" /protein_id="NP_216487.1" /db_xref="GI:15609108" /db_xref="GeneID:885938" /translation="MLHLPRRVIVQLAVFTVIAVGVLAITFLHFVRLPAMLFGVGRYT VTMELVEAGGLYRTGNVTYRGFEVGRVAAVRLTDTGVQAVLALKSGIDIPSDLKAEVH SHTAIGETYVELLPRNAASPPLKNGDVIALADTSVPPDINDLLSAANTALEAIPHENL QTVIDESYTAVAGLGLELSRLIKGSAELAIDARANLDPLVALIDRAGPVLDSQTHTSD AIAAWAAQLAAVTGQLQTHDSAVGDLIDRGGPALGETRQLLERLQPTVPILLANLVSV GQVALTYHNDIEQLLVVFPMAIAAEQAGILANLNTKQAYRGQYLSFNLNLNLPPPCTT GFLPAQQRRIPTFEDYPDRPAGDLYCRVPQDSPFNVRGARNIPCETVPGKRAPTVKLC ESDAPYLPLNDGYNWKGDPNATVPGLGSGQDIPQTWQTMLLPPGS" gene 2216592..2217167 /locus_tag="Rv1972" /db_xref="GeneID:885937" CDS 2216592..2217167 /locus_tag="Rv1972" /function="UNKNOWN" /note="Rv1972, (MTV051.10), len: 191 aa. Probable conserved Mce-associated membrane protein. Probably part of mce3 operon. Similar to several Mycobacterium tuberculosis proteins e.g. Rv1363c|Z75555|MTCY02B10.27C (261 aa), FASTA scores: opt: 342, E(): 1.2e-15, (31.8% identity in 195 aa overlap); Rv1362c, Rv0177 (near Mce operon 1), etc. Has hydrophobic stretch at aa 20-40. TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="mce associated membrane protein" /protein_id="NP_216488.1" /db_xref="GI:15609109" /db_xref="GeneID:885937" /translation="MSVAVDSDAEDDAVSEIAEAAGVSPAPAKPSMSAPRRMLLFGLV VVVALAVLLCCWGFRVQRARHAQDQRGHFLQAARQCALNLTTIDWRNAEADVRRILDG ATGEFYNDFAQRSQPFVEVLRHAKASTVGTITEAGLQTQTADTAQALVAVSVQTSNAG EADPVPRAWRMRITVQRVGDRVKVSDVGFVP" gene 2217164..2217646 /locus_tag="Rv1973" /db_xref="GeneID:885936" CDS 2217164..2217646 /locus_tag="Rv1973" /function="UNKNOWN" /note="Rv1973, (MTV051.11), len: 160 aa. Possible conserved Mce-associated membrane protein. Probably part of mce3 operon. Similar to several other proteins from Mycobacterium tuberculosis e.g. Rv1362c|Z75555|MTCY02B10.26C (220 aa), FASTA scores: opt: 378, E(): 2.8e-19, (50.0% identity in 128 aa overlap); Rv1363c; Rv0177 (near Mce operon 1); etc. Contains possible N-terminal signal sequence or membrane anchor. TBparse score is 0.863." /codon_start=1 /transl_table=11 /product="MCE associated membrane protein" /protein_id="NP_216489.1" /db_xref="GI:15609110" /db_xref="GeneID:885936" /translation="MSWSRVIAYGLLPGLALALTCGAGLLKWQDGAVRDAAVARAESV RAATDGTTALLSYRPDTVQHDLESARSRLTGTFLDAYTQLTHDVVIPGAQQKQISAVA TVAAAASVSTSADRAVVLLFVNQTITVGKDAPTTAASSVRVTLDNINGRWLISQFEPI" gene 2217659..2218036 /locus_tag="Rv1974" /db_xref="GeneID:885935" CDS 2217659..2218036 /locus_tag="Rv1974" /function="UNKNOWN" /note="Rv1974, (MTV051.12), len: 125 aa. Probable conserved membrane protein, weakly similar to other Mycobacterium tuberculosis proteins e.g. Rv1271c|Z77137|MTCY50.11 (113 aa), FASTA scores: opt: 98, E(): 1.4, (24.5% identity in 110 aa overlap); Rv1804c; Rv1690. Has possible signal peptide or transmembrane stretch from aa 12-30. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216490.1" /db_xref="GI:15609111" /db_xref="GeneID:885935" /translation="MQRQSLMPQQTLAAGVFVGALLCGVVTAAVPPHARADVVAYLVN VTVRPGYNFANADAALSYGHGLCEKVSRGRPYAQIIADVKADFDTRDQYQASYLLSQA VNELCPALIWQLRNSAVDNRRSG" gene 2218052..2218717 /locus_tag="Rv1975" /db_xref="GeneID:885934" CDS 2218052..2218717 /locus_tag="Rv1975" /function="UNKNOWN" /note="Rv1975, (MTV051.13), len: 221 aa. Conserved hypothetical protein, showing some similarity to AJ251435 hypothetical protein from Mycobacterium avium subsp. paratuberculosis (193 aa). TBparse score is 0.919." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216491.1" /db_xref="GI:15609112" /db_xref="GeneID:885934" /translation="MSRRASATCALSATTAVAIMAAPAARADDKRLNDGVVANVYTVQ RQAGCTNDVTINPQLQLAAQWHTLDLLNNRHLNDDTGSDGSTPQDRAHAAGFRGKVAE TVAINPAVAISGIELINQWYYNPAFFAIMSDCANTQIGVWSENSPDRTVVVAVYGQPD RPSAMPPRGAVTGPPSPVAAQENVPIDPSPDYDASDEIEYGINWLPWILRGVYPPPAM PPQ" gene complement(2218844..2219251) /locus_tag="Rv1976c" /db_xref="GeneID:885933" CDS complement(2218844..2219251) /locus_tag="Rv1976c" /function="UNKNOWN" /note="Rv1976c, (MTV051.14), len: 135 aa. Conserved hypothetical protein, similar to SC1C3.03c|AL023702 hypothetical protein from Streptomyces coelicolor (125 aa), FASTA score: opt: 223, E(): 3.3e-08, (39.6% identity in 111 aa overlap). TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216492.1" /db_xref="GI:15609113" /db_xref="GeneID:885933" /translation="MRWIVDGMNVIGSRPDGWWRDRHRAMVMLVERLEGWAITKARGD DVTVVFERPPSTAIPSSVVEVAHAPKAAANSADDEIVRLVRSGAQPQEIRVVTSDKAL TDRVRDLGAAVYPAERFRDLIDPRGSNAARRTQ" gene 2219754..2220800 /locus_tag="Rv1977" /db_xref="GeneID:885931" CDS 2219754..2220800 /locus_tag="Rv1977" /function="UNKNOWN" /note="Rv1977, (MTV051.15), len: 348 aa. Conserved hypothetical protein, similar to SCC123.20|AL136518 hypothetical protein from Streptomyces coelicolor (402 aa), BLASTP scores: Score = 311 bits (789), Expect = 5e-84 Identities = 156/316 ( 49%), Positives = 212/316 (66%); and PCC6803|D90907_31 Synechocystis sp. (303 aa), FASTA scores: opt: 533, E(): 4.7e- 29, (38.5% identity in 275 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216493.1" /db_xref="GI:15609114" /db_xref="GeneID:885931" /translation="MSQTPATTRKTFPEISSRAWEHPADRTALSALRRLKGFDQILKL MSGMLRERQHRLLYLASAARVGPRQFADLDALLDECVDVLDASAKPELYVMQSPIADA FTIGMGKPFTVITSGLYDLVTHDEMRFVMGHELGHALSGHAVYRTMMMHLLRLARSFG VLPVGGWALRAIVAALLEWQRKSELSGDRAGLLCAQDLDTALRVEMKLAGGCRLDKLD SEAFLAQAREYETSGDMRDGVLKLLNLELQTHPFSVLRAAALTHWVDTGGYAKVIAGE YPRRADDGNAKFADDLGAAARYYRDGFDQSNDPLIKGIRDGFGGIVEGVGRAASNAAD SLGRKITEWRQPSK" misc_feature 2220141..2220170 /locus_tag="Rv1977" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene 2220908..2221756 /locus_tag="Rv1978" /db_xref="GeneID:885928" CDS 2220908..2221756 /locus_tag="Rv1978" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1978, (MTV051.16), len: 282 aa. Conserved hypothetical protein, similar to several hypothetical proteins and methyltransferases e.g. X86780|SHGCPIR.15 methyltransferase from S. hygroscopicus (211 aa), FASTA scores: opt: 151, E(): 0.0072, (30.6% identity in 121 aa overlap). TBparse score is 0.933." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216494.1" /db_xref="GI:15609115" /db_xref="GeneID:885928" /translation="MGEANIREQAIATMPRGGPDASWLDRRFQTDALEYLDRDDVPDE VKQKIIGVLDRVGTLTNLHEKYARIALKLVSDIPNPRILELGAGHGKLSAKILELHPT ATVTISDLDPTSVANIAAGELGTHPRARTQVIDATAIDGHDHSYDLAVFALAFHHLPP TVACKAIAEATRVGKRFLIIDLKRQKPLSFTLSSVLLLPLHLLLLPWSSMRSSMHDGF ISALRAYSPSALQTLARAADPGMQVEILPAPTRLFPPSLAVVFSRSSSAPTESSECSA DRQPGE" gene complement(2221719..2223164) /locus_tag="Rv1979c" /db_xref="GeneID:885819" CDS complement(2221719..2223164) /locus_tag="Rv1979c" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF AMINO ACID ACROSS THE MEMBRANE." /note="(MTCY39.40-MTV051.17c), len: 481 aa. Possible permease, APC family possibly involved in transport of amino acid, showing some similarity to other permeases. Also similar to MTCY39.19 from Mycobacterium tuberculosis (28.2% identity in 277 aa overlap). Contains PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site." /codon_start=1 /transl_table=11 /product="permease" /protein_id="NP_216495.1" /db_xref="GI:15609116" /db_xref="GeneID:885819" /translation="MVGPRTRGYAIHKLGFCSVVMLGINSIIGAGIFLTPGEVIGLAG PFAPMAYVLAGIFAGVVAIVFATAARYVRTNGASYAYTTAAFGRRIGIYVGVTHAITA SIAWGVLASFFVSTLLRVAFPDKAWADAEQLFSVKTLTFLGFIGVLLAINLFGNRAIK WANGTSTVGKAFALSAFIVGGLWIITTQHVNNYATAWSAYSATPYSLLGVAEIGKGTF SSMALATIVALYAFTGFESIANAAEEMDAPDRNLPRAIPIAIFSVGAIYLLTLTVAML LGSNKIAASDDTVKLAAAIGNATFRTIIVVGALISMFGINVAASFGAPRLWTALADSG VLPTRLSRKNQYDVPMVSFAITASLALAFPLALRFDNLHLTGLAVIARFVQFIIVPIA LIALARSQAVEHAAVRRNAFTDKVLPLVAIVVSVGLAVSYDYRCIFLVRGGPNYFSIA LIVITFVVVPAMAYLHYYRIIRRVGDRPSTR" misc_feature complement(2222637..2222666) /locus_tag="Rv1979c" /note="PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site" gene complement(2223343..2224029) /gene="mpt64" /locus_tag="Rv1980c" /db_xref="GeneID:885925" CDS complement(2223343..2224029) /gene="mpt64" /locus_tag="Rv1980c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1980c, (MT2032, MTCY39.39), len: 228 aa. mpt64 (alternate gene name: mpb64), immunogenic protein (alternate gene name: mpb64) (see citations below), identical to MPT64|MPB64 from Mycobacterium bovis (228 aa). Similar to Rv3036c|MTV012.51c from Mycobacterium tuberculosis. Exported protein containing a N-terminal signal sequence: see notes below about proteomics.; mpb64" /codon_start=1 /transl_table=11 /product="immunogenic protein MPT64 (antigen MPT64/MPB64)" /protein_id="NP_216496.1" /db_xref="GI:15609117" /db_xref="GeneID:885925" /translation="MRIKIFMLVTAVVLLCCSGVATAAPKTYCEELKGTDTGQACQIQ MSDPAYNINISLPSYYPDQKSLENYIAQTRDKFLSAATSSTPREAPYELNITSATYQS AIPPRGTQAVVLKVYQNAGGTHPTTTYKAFDWDQAYRKPITYDTLWQADTDPLPVVFP IVQGELSKQTGQQVSIAPNAGLDPVNYQNFAVTNDGVIFFFNPGELLPEAAGPTQVLV PRSAIDSMLA" gene complement(2224220..2225188) /gene="nrdF1" /locus_tag="Rv1981c" /db_xref="GeneID:885923" CDS complement(2224220..2225188) /gene="nrdF1" /locus_tag="Rv1981c" /EC_number="1.17.4.1" /function="INVOLVED IN THE DNA REPLICATION PATHWAY. CATALYZES THE BIOSYNTHESIS OF DEOXYRIBONUCLEOTIDES FROM THE CORRESPONDING RIBONUCLEOTIDES, PRECURSORS THAT ARE NECESSARY FOR DNA SYNTHESIS [CATALYTIC ACTIVITY: 2'-DEOXYRIBONUCLEOSIDE DIPHOSPHATE + OXIDIZED THIOREDOXIN + H(2)O = RIBONUCLEOSIDE DIPHOSPHATE + REDUCED THIOREDOXIN]." /experiment="experimental evidence, no additional details recorded" /note="B2 or R2 protein; type 1b enzyme; catalyzes the rate-limiting step in dNTP synthesis; converts nucleotides to deoxynucleotides; forms a homodimer and then a multimeric complex with NrdE" /codon_start=1 /transl_table=11 /product="ribonucleotide-diphosphate reductase subunit beta" /protein_id="YP_177853.1" /db_xref="GI:57116932" /db_xref="GeneID:885923" /translation="MTGKLVERVHAINWNRLLDAKDLQVWERLTGNFWLPEKIPLSND LASWQTLSSTEQQTTIRVFTGLTLLDTAQATVGAVAMIDDAVTPHEEAVLTNMAFMES VHAKSYSSIFSTLCSTKQIDDAFDWSEQNPYLQRKAQIIVDYYRGDDALKRKASSVML ESFLFYSGFYLPMYWSSRGKLTNTADLIRLIIRDEAVHGYYIGYKCQRGLADLTDAER ADHREYTCELLHTLYANEIDYAHDLYDELGWTDDVLPYMRYNANKALANLGYQPAFDR DTCQVNPAVRAALDPGAGENHDFFSGSGSSYVMGTHQPTTDTDWDF" misc_feature complement(2224841..2224888) /gene="nrdF1" /locus_tag="Rv1981c" /note="PS00368 Ribonucleotide reductase small subunit signature" gene complement(2225413..2225832) /locus_tag="Rv1982c" /db_xref="GeneID:885816" CDS complement(2225413..2225832) /locus_tag="Rv1982c" /function="UNKNOWN" /note="Rv1982c, (MTCY39.37), len: 139 aa. Conserved hypothetical protein. BELONGS TO THE UPF0110 FAMILY. Similar to Rv0624|Z92772|MTY20H10.05 from Mycobacterium tuberculosis (131 aa), FASTA scores: opt: 288, E(): 4.1e-14, (40.2% identity in 127 aa overlap); also similar to Rv0624, Rv2759c, and Rv0609" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216498.1" /db_xref="GI:15609119" /db_xref="GeneID:885816" /translation="MIVDTSAVVALVQGERPHATLVAAALAGAHSPVMSAPTVAECLI VLTARHGPVARTIFERLRSEIGLSVSSFTAEHAAATQRAFLRYGKGRHRAALNFGDCM TYATAQLGHQPLLAVGNDFPQTDLEFRGVVGYWPGVA" gene 2226244..2227920 /gene="PE_PGRS35" /locus_tag="Rv1983" /db_xref="GeneID:885921" CDS 2226244..2227920 /gene="PE_PGRS35" /locus_tag="Rv1983" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1983, (MTCY39.36c), len: 558 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002). Similar to other PE proteins e.g. Rv0977, etc. Contains PS00141 Eukaryotic and viral aspartyl proteases active site." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177854.1" /db_xref="GI:57116933" /db_xref="GeneID:885921" /translation="MSFLVVVPEFLTSAAADVENIGSTLRAANAAAAASTTALAAAGA DEVSAAVAALFARFGQEYQAVSAQASAFHQQFVQTLNSASGSYAAAEATIASQLQTAQ HDLLGAVNAPTETLLGRPLIGDGAPGTATSPNGGAGGLLYGNGGNGYSATASGVGGGA GGSAGLIGNGGAGGAGGPNAPGGAGGNGGWLLGNGGIGGPGGASSIPGMSGGAGGTGG AAGLLGWGANGGAGGLGDGVGVDRGTGGAGGRGGLLYGGYGVSGPGGDGRTVPLEIIH VTEPTVHANVNGGPTSTILVDTGSAGLVVSPEDVGGILGVLHMGLPTGLSISGYSGGL YYIFATYTTTVDFGNGIVTAPTAVNVVLLSIPTSPFAISTYFSALLADPTTTPFEAYF GAVGVDGVLGVGPNAVGPGPSIPTMALPGDLNQGVLIDAPAGELVFGPNPLPAPNVEV VGSPITTLYVKIDGGTPIPVPSIIDSGGVTGTIPSYVIGSGTLPANTNIEVYTSPGGD RLYAFNTNDYRPTVISSGLMNTGFLPFRFQPVYIDYSPSGIGTTVFDHPA" misc_feature 2227123..2227158 /gene="PE_PGRS35" /locus_tag="Rv1983" /note="PS00141 Eukaryotic and viral aspartyl proteases active site" gene complement(2227908..2228561) /gene="cfp21" /locus_tag="Rv1984c" /db_xref="GeneID:885813" CDS complement(2227908..2228561) /gene="cfp21" /locus_tag="Rv1984c" /EC_number="3.1.1.-" /function="HYDROLYZES CUTIN." /experiment="experimental evidence, no additional details recorded" /note="Rv1984c, (MTCY39.35), len: 217 aa. cfp21, probable cutinase precursor with N-terminal signal sequence (EC 3.1.1.-), similar to P41744|CUTI_ALTBR cutinase precursor from Alternaria brassicicola (209 aa), FASTA scores: opt: 283, E(): 2.2e-11, (32.6% identity in 193 aa overlap). Also similar to Mycobacterium tuberculosis proteins e.g. Rv3452, Rv3451, Rv2301, Rv1758, Rv3724. BELONGS TO THE CUTINASE FAMILY." /codon_start=1 /transl_table=11 /product="cutinase precursor CFP21" /protein_id="NP_216500.1" /db_xref="GI:15609121" /db_xref="GeneID:885813" /translation="MTPRSLVRIVGVVVATTLALVSAPAGGRAAHADPCSDIAVVFAR GTHQASGLGDVGEAFVDSLTSQVGGRSIGVYAVNYPASDDYRASASNGSDDASAHIQR TVASCPNTRIVLGGYSQGATVIDLSTSAMPPAVADHVAAVALFGEPSSGFSSMLWGGG SLPTIGPLYSSKTINLCAPDDPICTGGGNIMAHVSYVQSGMTSQAATFAANRLDHAG" misc_feature complement(2228202..2228219) /gene="cfp21" /locus_tag="Rv1984c" /note="PS00155 Cutinase, serine active site" gene complement(2228991..2229902) /locus_tag="Rv1985c" /db_xref="GeneID:885818" CDS complement(2228991..2229902) /locus_tag="Rv1985c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="specific inhibitor of chromosomal initiation of replication in vitro; binds the three 13-mers in the origin (oriC) to block initiation of replication; also controls genes involved in arginine transport" /codon_start=1 /transl_table=11 /product="chromosome replication initiation inhibitor protein" /protein_id="NP_216501.1" /db_xref="GI:15609122" /db_xref="GeneID:885818" /translation="MVDPQLDGPQLAALAAVVELGSFDAAAERLHVTPSAVSQRIKSL EQQVGQVLVVREKPCRATTAGIPLLRLAAQTALLESEALAEMGGNASLKRTRITIAVN ADSMATWFSAVFDGLGDVLLDVRIEDQDHSARLLREGVAMGAVTTERNPVPGCRVHPL GEMRYLPVASRPFVQRHLSDGFTAAAAAKAPSLAWNRDDGLQDMLVRKAFRRAITRPT HFVPTTEGFTAAARAGLGWGMFPEKLAASPLADGSFVRVCDIHLDVPLYWQCWKLDSP IIARITDTVRAAASGLYRGQQRRRRPG" misc_feature complement(2229759..2229836) /locus_tag="Rv1985c" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene 2230011..2230610 /locus_tag="Rv1986" /db_xref="GeneID:885635" CDS 2230011..2230610 /locus_tag="Rv1986" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF LYSINE ACROSS THE MEMBRANE." /note="Rv1986, (MTCY39.33c), len: 199 aa. Probable conserved integral membrane protein, LysE family possibly involved in transport of Lysine, similar to P11667|YGGA_ECOLI hypothetical 23.2 kDa protein in sbm-fba intergenic region (211 aa), FASTA scores: opt: 379, E(): 1.5e-19, (37.3% identity in 185 aa overlap); and Q11154|Rv0488 HYPOTHETICAL 20.9 kDa PROTEIN from M. tuberculosis (201 aa), FASTA scores: opt: 784, E(): 0, (63.4% identity in 186 aa overlap). BELONGS TO THE LYSE/YGGA FAMILY." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216502.1" /db_xref="GI:15609123" /db_xref="GeneID:885635" /translation="MNSPLVVGFLACFTLIAAIGAQNAFVLRQGIQREHVLPVVALCT VSDIVLIAAGIAGFGALIGAHPRALNVVKFGGAAFLIGYGLLAARRAWRPVALIPSGA TPVRLAEVLVTCAAFTFLNPHVYLDTVVLLGALANEHSDQRWLFGLGAVTASAVWFAT LGFGAGRLRGLFTNPGSWRILDGLIAVMMVALGISLTVT" gene 2231026..2231454 /locus_tag="Rv1987" /db_xref="GeneID:885815" CDS 2231026..2231454 /locus_tag="Rv1987" /EC_number="3.2.1.14" /function="HYDROLYSIS OF CHITIN" /note="Rv1987, (MTCY39.32c), len: 142 aa. Possible chitinase (EC 3.2.1.14), similar to several e.g. P36909|CHIT_STRLI chitinase c precursor (619 aa) FASTA scores, opt: 324, E(): 1.2e-14, (39.5% identity in 129 aa overlap)." /codon_start=1 /transl_table=11 /product="chitinase" /protein_id="NP_216503.1" /db_xref="GI:15609124" /db_xref="GeneID:885815" /translation="MAGLNIYVRRWRTALHATVSALIVAILGLAITPVASAATARATL SVTSTWQTGFIARFTITNSSTAPLTDWKLEFDLPAGESVLHTWNSTVARSGTHYVLSP ANWNRIIAPGGSATGGLRGGLTGSYSPPSSCLLNGQYPCT" gene 2231680..2232219 /locus_tag="Rv1988" /db_xref="GeneID:885632" CDS 2231680..2232219 /locus_tag="Rv1988" /EC_number="2.1.1.-" /function="THOUGHT TO CAUSE METHYLATION" /note="Rv1988, (MTCY39.31c), len: 179 aa. Probable methyltransferase (EC 2.1.1.-), similar to ERME_SACER|P07287 rrna adenine n-6-methyltransferase (370 aa), FASTA scores: opt: 259, E(): 2e-11, (35.1% identity in 171 aa overlap); contains PS00092 N-6 Adenine-specific DNA methylases signature. Also similar to Mycobacterium tuberculosis Rv1010 ksgA 16S rRNA dimethyltransferase." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="NP_216504.1" /db_xref="GI:15609125" /db_xref="GeneID:885632" /translation="MSALGRSRRAWGWHRLHDEWAARVVSAAAVRPGELVFDIGAGEG ALTAHLVRAGARVVAVELHPRRVGVLRERFPGITVVHADAASIRLPGRPFRVVANPPY GISSRLLRTLLAPNSGLVAADLVLQRALVCKFASRNARRFTLTVGLMLPRRAFLPPPH VDSAVLVVRRRKCGDWQGR" misc_feature 2231965..2231985 /locus_tag="Rv1988" /note="PS00092 N-6 Adenine-specific DNA methylases signature" gene complement(2232739..2233299) /locus_tag="Rv1989c" /db_xref="GeneID:885810" CDS complement(2232739..2233299) /locus_tag="Rv1989c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1989c, (MTCY39.30), len: 186 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216505.1" /db_xref="GI:15609126" /db_xref="GeneID:885810" /translation="MSDALDEGLVQRIDARGTIEWSETCYRYTGAHRDALSGEGARRF GGRWNPPLLFPAIYLADSAQACMVEVERAAQAASTTAEKMLEAAYRLHTIDVTDLAVL DLTTPQAREAVGLENDDIYGDDWSGCQAVGHAAWFLHMQGVLVPAAGGVGLVVTAYEQ RTRPGQLQLRQSVDLTPALYQELRAT" gene complement(2233296..2233637) /locus_tag="Rv1990c" /db_xref="GeneID:885422" CDS complement(2233296..2233637) /locus_tag="Rv1990c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1990c, (MTCY39.29), len: 113 aa. Probable transcriptional regulatory protein, similar to Mycobacterium tuberculosis Rv3188|AL021646|MTV014.32 (115 aa), FASTA scores: opt: 184, E(): 8.2e-07, (28.4% identity in 109 aa overlap). Contains probable helix-turn-helix motif at aa 20-44 (+4.22 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216506.1" /db_xref="GI:15609127" /db_xref="GeneID:885422" /translation="MGVNVLASTVSGAIERLGLTYEEVGDIVDASPRSVARWTAGQVV PQRLNKQRLIELAYVADALAEVLPRDQANVWMFSPNRLLEHRKPADLVRDGEYQRVLA LIDAMAEGVFV" gene complement(2233881..2234216) /locus_tag="Rv1990A" /db_xref="GeneID:3205104" CDS complement(2233881..2234216) /locus_tag="Rv1990A" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv1990A, len: 111 aa. Possible dehydrogenase (fragment) (EC 1.-.-.-), similar to N-terminal part of several dehydrogenases and hypothetical proteins, e.g. Rv2750|MTV002.15|AL008967 from Mycobacterium tuberculosis (272 aa), FASTA scores: opt: 151, E(): 0.0045, (47.45% identity in 78 aa overlap), but lacks C-terminal part. Maybe a pseudogene. Also similar to U17129|RSU17129_7 putative short-chain alcohol dehydrogenase from Rhodococcus erythropolis (275 aa), FASTA scores: opt: 142, E(): 0.018, (54.15% identity in 48 aa overlap)." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="YP_177656.1" /db_xref="GI:57116934" /db_xref="GeneID:3205104" /translation="MGRLEGKVAFITGVARGQGRSHAVRLADGQARALGKVDVEACGA LVGEVEVWGRDVRDDRRVFVESPADEFGACRRVARQGIRVVGLPVSQRELVEPEAGCA ARRSAAGSQ" gene complement(2234305..2234649) /locus_tag="Rv1991c" /db_xref="GeneID:885634" CDS complement(2234305..2234649) /locus_tag="Rv1991c" /function="UNKNOWN" /note="Rv1991c, (MTCY39.28), len: 114 aa. Conserved hypothetical protein, showing some similarity to P13976|PEMK_ECOLI pemk protein (133 aa), FASTA scores: opt: 113, E(): 0.043, (29.2% identity in 113 aa overlap); and P96622|YDCE PROTEIN from Bacillus subtilis (116 aa), FASTA scores: opt: 227, E(): 6.9e-09, (37.4% identity in 115 aa overlap). Also similar to Mycobacterium tuberculosis Rv2801c, and Rv0659c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216507.1" /db_xref="GI:15609128" /db_xref="GeneID:885634" /translation="MVISRAEIYWADLGPPSGSQPAKRRPVLVIQSDPYNASRLATVI AAVITSNTALAAMPGNVFLPATTTRLPRDSVVNVTAIVTLNKTDLTDRVGEVPASLMH EVDRGLRRVLDL" gene complement(2234991..2237306) /gene="ctpG" /locus_tag="Rv1992c" /db_xref="GeneID:888914" CDS complement(2234991..2237306) /gene="ctpG" /locus_tag="Rv1992c" /EC_number="3.6.3.-" /function="METAL CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF A UNDETERMINATED METAL CATION WITH THE HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED METAL CATION(IN) = ADP + PHOSPHATE + UNDETERMINATED METAL CATION(OUT)]." /note="Rv1992c, (MTCY39.27), len: 771 aa. Probable ctpG, metal cation-transporting P-type ATPase G (transmembrane protein) (EC 3.6.3.-), similar to others, especially cadmium-transporting ATPases (EC 3.6.3.3), e.g. NP_244904.1|NC_002570 cadmium-transporting ATPase from Bacillus halodurans (707 aa); P30336|CADA_BACFI PROBABLE CADMIUM-TRANSPORTING ATPASE from Bacillus firmus (723 aa); BAB47609.1|AB037671 cadmium resistance protein B from Staphylococcus aureus (804 aa); 3121832|Q60048|CADA_LISMO PROBABLE CADMIUM-TRANSPORTING ATPase from Listeria monocytogenes (707 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv0969|MTCY10D7.05c|ctpV PUTATIVE CATION TRANSPORTER P-TYPE ATPASE V (770 aa); Rv1469; Rv0092; etc. Contains PS00435 Peroxidases proximal heme-ligand signature and PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB." /codon_start=1 /transl_table=11 /product="metal cation transporter P-type ATPase G CtpG" /protein_id="NP_216508.1" /db_xref="GI:15609129" /db_xref="GeneID:888914" /translation="MTTVVDAEVQLTVVSDAAGRMRVQATGFQFDAGRAVAIEDTVGK VAGVQAVHAYPRTASIVIWYSRAICDTAAILSAIIDAETVPAAAVPAYASRSASNRKA GVVQKIIDWSTRTLSGVRRDVAAQPSGETSDACCDGEDNEDREPEQLWQVAKLRRAAF SGVLLTASLVAAWAYPLWPVVLGLKALALAVGASTFVPSSLKRLAEGRVGVGTLMTIA ALGAVALGELGEAATLAFLFSISEGLEEYATARTRRGLRALLSLVPDQATVLREGTET IVASTELHVGDQMIVKPGERLATDGIIRAGRTALDVSAITGESVPVEVGPGDEVFAGS INGLGVLQVGVTATAANNSLARIVHIVEAEQVRKGASQRLADCIARPLVPSIMIAAAL IAGTGSVLGNPLVWIERALVVLVAAAPCALAIAVPVTVVASIGAASRLGVLIKGGAAL ETLGTIRAVALDKTGTLTANRPVVIDVATTNGATREEVLAVAAALEARSEHPLAVAVL AATQATTAASDVQAVPGAGLIGRLDGRVVRLGRPGWLDAAELADHVACMQQAGATAVL VERDQQLLGAIAVRDELRPEAAEVVAGLRTGGYQVTMLTGDNHATAAALAAQAGIEQV HAELRPEDKAHLVAQLRARQPTAMVGDGVNDAPALAAADLGIAMGAMGTDVAIETADV ALMGQDLRHLPQALDHARRSRQIMVQNVGLSLSIITVLMPLALFGILGLAAVVLVHEF TEVIVIANGVRAGRIKPLAGPPKTPDRTIPG" misc_feature complement(2235903..2235923) /gene="ctpG" /locus_tag="Rv1992c" /note="PS00154 E1-E2 ATPases phosphorylation site" misc_feature complement(2237154..2237186) /gene="ctpG" /locus_tag="Rv1992c" /note="PS00435 Peroxidases proximal heme-ligand signature" gene complement(2237303..2237575) /locus_tag="Rv1993c" /db_xref="GeneID:885631" CDS complement(2237303..2237575) /locus_tag="Rv1993c" /function="UNKNOWN" /note="Rv1993c, (MTCY39.26), len: 90 aa. Conserved hypothetical protein, very similar to Rv3269|Z92771|MTCY71.09 hypothetical protein from Mycobacterium tuberculosis (93 aa), FASTA results: opt: 309, E(): 3.2e-16, (63.3% identity in 79 aa overlap). Also similar to Rv0968 (98 aa) (51.1% identity in 94 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216509.1" /db_xref="GI:15609130" /db_xref="GeneID:885631" /translation="MVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVM EWGLRGTRRAEAAAESARLTVADVVAEARGRIGEEAPLPAGARVDE" gene complement(2237628..2237984) /locus_tag="Rv1994c" /db_xref="GeneID:888889" CDS complement(2237628..2237984) /locus_tag="Rv1994c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv1994c, (MTCY39.25), len: 118 aa. Probable transcription regulator, similar to MERR_STRLI|P30346 probable mercury resistance operon repressor (125 aa), FASTA scores: opt: 199, E(): 3e-08, (36.3% identity in 102 aa overlap). Contains probable helix-turn-helix motif at aa 36-57 (+3.78 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216510.1" /db_xref="GI:15609131" /db_xref="GeneID:888889" /translation="MLTCEMRESALARLGRALADPTRCRILVALLDGVCYPGQLAAHL GLTRSNVSNHLSCLRGCGLVVATYEGRQVRYALADSHLARALGELVQVVLAVDTDQPC VAERAASGEAVEMTGS" gene 2238141..2238908 /locus_tag="Rv1995" /db_xref="GeneID:888917" CDS 2238141..2238908 /locus_tag="Rv1995" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1995, (MTCY39.24c), len: 255 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216511.1" /db_xref="GI:15609132" /db_xref="GeneID:888917" /translation="MVASGAATKGVTVMKQTPPAAVGRRHLLEISASAAGVIALSACS GSPPEPGKGRPDTTPEQEVPVTAPEDLMREHGVLKRILLIYREGIRRLQADDQSPAPA LNESAQIIRRFIEDYHGQLEEQYVFPKLEQAGKLTDITSVLRTQHQRGRVLTDRVLAA TTAAAAFDQPARDTLAQDMAAYIRMFEPHEAREDTVVFPALRDVMSAVEFRDMAETFE DEEHRRFGEAGFQSVVDKVADIEKSLGIYDLSQFTPS" gene 2239004..2239957 /locus_tag="Rv1996" /db_xref="GeneID:888863" CDS 2239004..2239957 /locus_tag="Rv1996" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1996, (MTCY39.23c), len: 317 aa. Conserved hypothetical protein. Similar to several Mycobacterium tuberculosis hypothetical proteins e.g. Rv2005c|Q10851|YK05_MYCTU (295 aa), FASTA scores: opt: 775, E(): 0, (50.3% identity in 316 aa overlap); Rv2026c (294 aa) (47.9% identity in 311 aa overlap); and Rv2623, etc. Also similar to SCJ1.30c|AL109962 hypothetical protein from Streptomyces coelicolor (328 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216512.1" /db_xref="GI:15609133" /db_xref="GeneID:888863" /translation="MSAQQTNLGIVVGVDGSPCSHTAVEWAARDAQMRNVALRVVQVV PPVITAPEGWAFEYSRFQEAQKREIVEHSYLVAQAHQIVEQAHKVALEASSSGRAAQI TGEVLHGQIVPTLANISRQVAMVVLGYRGQGAVAGALLGSVSSSLVRHAHGPVAVIPE EPRPARPPHAPVVVGIDGSPTSGLAAEIAFDEASRRGVDLVALHAWSDMGPLDFPRLN WAPIEWRNLEDEQEKMLARRLSGWQDRYPDVVVHKVVVCDRPAPRLLELAQTAQLVVV GSHGRGGFPGMHLGSVSRAVVNSGQAPVIVARIPQDPAVPA" gene 2240159..2242876 /gene="ctpF" /locus_tag="Rv1997" /db_xref="GeneID:888867" CDS 2240159..2242876 /gene="ctpF" /locus_tag="Rv1997" /EC_number="3.6.3.-" /function="METAL CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF A UNDETERMINATED METAL CATION WITH THE HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED METAL CATION(IN) = ADP + PHOSPHATE + UNDETERMINATED METAL CATION(OUT)]." /experiment="experimental evidence, no additional details recorded" /note="Rv1997, (MTCY39.22c, MTCY39.21c), len: 905 aa. Probable ctpF, metal cation-transporting P-type ATPase F (transmembrane protein) (EC 3.6.3.-), highly similar to others e.g. NP_250120.1|NC_002516 probable cation-transporting P-type ATPase from Pseudomonas aeruginosa (902 aa); NP_441217.1|NC_000911 cation-transporting ATPase (E1-E2 ATPase) from Synechocystis sp. strain PCC 6803 (905 aa); NP_404093.1|NC_003143 putative cation-transporting P-type ATPase from Yersinia pestis (908 aa); P37367|ATA1_SYNY3 cation-transporting ATPase pma1 from Synechocystis sp. (915 aa), FASTA scores: opt: 2392, E(): 0, (46.5% identity in 852 aa overlap); etc. Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB. Was frame-shifted in original cosmid sequence." /codon_start=1 /transl_table=11 /product="metal cation transporter P-type ATPase A CtpF" /protein_id="NP_216513.1" /db_xref="GI:15609134" /db_xref="GeneID:888867" /translation="MSASVSATTAHHGLPAHEVVLLLESDPYHGLSDGEAAQRLERFG PNTLAVVTRASLLARILRQFHHPLIYVLLVAGTITAGLKEFVDAAVIFGVVVINAIVG FIQESKAEAALQGLRSMVHTHAKVVREGHEHTMPSEELVPGDLVLLAAGDKVPADLRL VRQTGLSVNESALTGESTPVHKDEVALPEGTPVADRRNIAYSGTLVTAGHGAGIVVAT GAETELGEIHRLVGAAEVVATPLTAKLAWFSKFLTIAILGLAALTFGVGLLRRQDAVE TFTAAIALAVGAIPEGLPTAVTITLAIGMARMAKRRAVIRRLPAVETLGSTTVICADK TGTLTENQMTVQSIWTPHGEIRATGTGYAPDVLLCDTDDAPVPVNANAALRWSLLAGA CSNDAALVRDGTRWQIVGDPTEGAMLVVAAKAGFNPERLATTLPQVAAIPFSSERQYM ATLHRDGTDHVVLAKGAVERMLDLCGTEMGADGALRPLDRATVLRATEMLTSRGLRVL ATGMGAGAGTPDDFDENVIPGSLALTGLQAMSDPPRAAAASAVAACHSAGIAVKMITG DHAGTATAIATEVGLLDNTEPAAGSVLTGAELAALSADQYPEAVDTASVFARVSPEQK LRLVQALQARGHVVAMTGDGVNDAPALRQANIGVAMGRGGTEVAKDAADMVLTDDDFA TIEAAVEEGRGVFDNLTKFITWTLPTNLGEGLVILAAIAVGVALPILPTQILWINMTT AIALGLMLAFEPKEAGIMTRPPRDPDQPLLTGWLVRRTLLVSTLLVASAWWLFAWELD NGAGLHEARTAALNLFVVVEAFYLFSCRSLTRSAWRLGMFANRWIILGVSAQAIAQFA ITYLPAMNMVFDTAPIDIGVWVRIFAVATAITIVVATDTLLPRIRAQPP" misc_feature 2241155..2241175 /gene="ctpF" /locus_tag="Rv1997" /note="PS00154 E1-E2 ATPases phosphorylation site" gene complement(2242945..2243721) /locus_tag="Rv1998c" /db_xref="GeneID:888853" CDS complement(2242945..2243721) /locus_tag="Rv1998c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv1998c, (MTCY39.20), len: 258 aa. Conserved hypothetical protein, showing some similarity with other hypothetical proteins e.g. U82823|SEU82823.03 Saccharopolyspora erythraea (266 aa), FASTA results: opt: 654, E(): 0, (43.8% identity in 249 aa overlap); and AL034446|SC1A9.07 Streptomyces coelicolor (251 aa), FASTA scores: opt: 592, E(): 1.5e-31, (43.4% identity in 251 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216514.1" /db_xref="GI:15609135" /db_xref="GeneID:888853" /translation="MSFHDLHHQGVPFVLPNAWDVPSALAYLAEGFTAIGTTSFGVSS SGGHPDGHRATRGANIALAAALAPLQCYVSVDIEDGYSDEPDAIADYVAQLSTAGINI EDSSAEKLIDPALAAAKIVAIKQRNPEVFVNARVDTYWLRQHADTTSTIQRALRYVDA GADGVFVPLANDPDELAELTRNIPCPVNTLPVPGLTIADLGELGVARVSTGSVPYSAG LYAAAHAARAVSDGEQLPRSVPYAELQARLVDYENRTSTT" gene complement(2243816..2245138) /locus_tag="Rv1999c" /db_xref="GeneID:888881" CDS complement(2243816..2245138) /locus_tag="Rv1999c" /function="UNKNOWN. POSSIBLY TRANSPORTER INVOLVED IN TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY CATIONIC AMINO ACIDS) ACROSS THE MEMBRANE: SO RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv1999c, (MTCY39.19), len: 440 aa. Probable conserved integral membrane protein, possibly transporter of cationic amino acid, similar to many transporters, especially amino acid transporters, e.g. CAC08265.1|AL392146 putative amino acid transporter from Streptomyces coelicolor (414 aa); P39277|YJEH_ECOLI hypothetical 44.8 kDa protein from Escherichia coli (418 aa), FASTA scores, opt: 343, E(): 6.6e-15, (27.2% identity in 408 aa overlap); etc. Also similar to Rv1979c from Mycobacterium tuberculosis, FASTA score: (28.2% identity in 277 aa overlap); Rv2127, Rv0346c, Rv0522, etc. SEEMS TO BELONG TO THE APC FAMILY." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216515.1" /db_xref="GI:15609136" /db_xref="GeneID:888881" /translation="MRRPLDPRDIPDELRRRLGLLDAVVIGLGSMIGAGIFAALAPAA YAAGSGLLLGLAVAAVVAYCNAISSARLAARYPASGGTYVYGRMRLGDFWGYLAGWGF VVGKTASCAAMALTVGFYVWPAQAHAVAVAVVVALTAVNYAGIQKSAWLTRSIVAVVL VVLTAVVVAAYGSGAADPARLDIGVDAHVWGMLQAAGLLFFAFAGYARIATLGEEVRD PARTIPRAIPLALGITLAVYALVAVAVIAVLGPQRLARAAAPLSEAMRVAGVNWLIPV VQIGAAVAALGSLLALILGVSRTTLAMARDRHLPRWLAAVHPRFKVPFRAELVVGAVV AALAATADIRGAIGFSSFGVLVYYAIANASALTLGLDEGRPRRLIPLVGLIGCVVLAF ALPLSSVAAGAAVLGVGVAAYGVRRIITRRARQTDSGDTQRSGHPSAT" gene 2245209..2246822 /locus_tag="Rv2000" /db_xref="GeneID:888864" CDS 2245209..2246822 /locus_tag="Rv2000" /function="UNKNOWN" /note="Rv2000, (MTCY39.18c), len: 537 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216516.1" /db_xref="GI:15609137" /db_xref="GeneID:888864" /translation="MRPGFVGLGFGQWPVYVVRWPKLHLTPRQRKRVLHRRRLLTDRP ISLSQIPIRTGGPMNDPWPRPTQGPAKTIETDYLVIGAGAMGMAFTDTLITESGARVV MIDRACQPGGHWTTAYPFVRLHQPSAYYGVNSRALGNNTIDLVGWNQGLNELAPVGEI CAYFDAVLQQQLLPTGRVDYFPMSEYLGDGRFRTLAGTEYVVTVNRRIVDATYLRAVV PSMRPAPYSVAPGVDCVAPNELPKLGTRDRYVVVGAGKTGMDVCLWLLRNDVCPDKLT WIMPRDSWLIDRATLQPGPTFVRQFRESYGATLEAIGAATSTDDLFDRLETAGTLLRI DPSVRPSMYRCATVSHLELEQLRRIRDIVRMGHVQRIEPTTIVLDGGSVPATPTALYI DCTADGAPQRPAKPVFDADHLTLQAVRGCQQVFSAAFIAHVEFAYEDDAVKNELCTPI PHPDCDLDWMRLMHSDLGNFQRWLNDPDLTDWLSSARLNLLADLLPPLSHKPRVRERV VSMFQKRLGTAGDQLAKLLDAATATTEQR" gene 2246832..2247584 /locus_tag="Rv2001" /db_xref="GeneID:888880" CDS 2246832..2247584 /locus_tag="Rv2001" /function="UNKNOWN" /note="Rv2001, (MTCY39.17c), len: 250 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv0466, AL021933|MTV038_10 (264 aa), FASTA scores: opt: 592, E():0, (38.0% identity in 263 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216517.1" /db_xref="GI:15609138" /db_xref="GeneID:888880" /translation="MHHNRDVDLALVERPSSGYVYTTGWRLATTDIDEHQQLRLDGVA RYIQEVGAEHLADAQLAEVHPHWIVLRTVIDVINPIELPSDITFHRWCAALSTRWCSM RVQLQGSAGGRIETEGFWICVNKDTLTPSRLTDDCIARFGSTTENHRLKWRPWLTGPN IDGTETPFPLRRTDIDPFEHVNNTIYWHGVHEILCQIPTLTAPYRAVLEYRSPIKSGE PLTIRYEQHDDVVRMHFVVGDDVRAAALLRRL" gene 2247660..2248442 /gene="fabG3" /locus_tag="Rv2002" /db_xref="GeneID:888857" CDS 2247660..2248442 /gene="fabG3" /locus_tag="Rv2002" /EC_number="1.1.1.53" /function="NOT REALLY KNOWN; THOUGHT TO BE INVOLVED IN LIPID BIOSYNTHESIS [CATALYTIC ACTIVITY: Androstan-3-alpha,17-beta-diol + NAD+ = 17-beta-hydroxyandrostan-3-one + NADH]." /experiment="experimental evidence, no additional details recorded" /note="Rv2002, (MTCY39.16c), len: 260 aa. Possible fabG3, 20-beta-hydroxysteroid dehydrogenase (EC 1.1.1.53), similar to e.g. 2BHD_STREX|P19992 20-beta-hydroxysteroid dehydrogenase (255 aa), FASTA scores: opt: 718, E(): 2e-38, (49.8% identity in 243 aa overlap), and many mycobacterial proteins. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="20-beta-hydroxysteroid dehydrogenase" /protein_id="NP_216518.1" /db_xref="GI:15609139" /db_xref="GeneID:888857" /translation="MSGRLIGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEG KAVAAELADAARYVHLDVTQPAQWTAAVDTAVTAFGGLHVLVNNAGILNIGTIEDYAL TEWQRILDVNLTGVFLGIRAVVKPMKEAGRGSIINISSIEGLAGTVACHGYTATKFAV RGLTKSTALELGPSGIRVNSIHPGLVKTPMTDWVPEDIFQTALGRAAEPVEVSNLVVY LASDESSYSTGAEFVVDGGTVAGLAHNDFGAVEVSSQPEWVT" misc_feature 2248077..2248163 /gene="fabG3" /locus_tag="Rv2002" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(2248563..2249420) /locus_tag="Rv2003c" /db_xref="GeneID:888818" CDS complement(2248563..2249420) /locus_tag="Rv2003c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2003c, (MTCY39.14), len: 285 aa. Conserved hypothetical protein. Some similarity with Methanococcus jannaschii 67555|U67555_3 (205 aa), FASTA scores: opt: 357, E(): 3.2e-17, (33.8% identity in 204 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216519.1" /db_xref="GI:15609140" /db_xref="GeneID:888818" /translation="MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADC CWNQLAVTPDTRMPASSAAGRDAAAYDAWYDSPTGRPILATEVAALRPLIEVFAQPRL EIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVSRHFGAVLMAF TLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLYALRAARGQPGYRDARFYT AAELEQLLADSGFRVIARRCTLHQPPGLARYDIEAAHDGIQAGAGFVAISAVDQAHEP KDDHPLESE" gene complement(2249478..2250974) /locus_tag="Rv2004c" /db_xref="GeneID:888817" CDS complement(2249478..2250974) /locus_tag="Rv2004c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2004c, (MTCY39.13), len: 498 aa. Conserved hypothetical protein similar to several e.g. >pir||T36945 hypothetical protein SCJ1.12 (508 aa) - Streptomyces coelicolor >gi|5748625|emb|CAB53130.1| (AL109962). Smith-Waterman score: 7e-94, Identities = 199/468 (42%). Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216520.1" /db_xref="GI:15609141" /db_xref="GeneID:888817" /translation="MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKP VVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRD KQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAEL RHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGE PALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLR DFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTG KSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALR KARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARA GGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI" misc_feature complement(2249967..2249990) /locus_tag="Rv2004c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2250996..2251883) /locus_tag="Rv2005c" /db_xref="GeneID:888831" CDS complement(2250996..2251883) /locus_tag="Rv2005c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2005c, (MTCY39.12), len: 295 aa. Conserved hypothetical protein, similar to MTCY39.23c, (50.3% identity in 316 aa overlap), C-terminus shows some similarity with YXIE_BACSU P42297 hypothetical 15.9 kd protein in bglh- (148 aa), FASTA scores, opt: 124, E(): 0.038, (28.5% identity in 144 aa overlap), also similar to Rv2623 (294 aa), (52.7% identity in 296 aa overlap) and other Mycobacterium tuberculosis hypothetical proteins e.g. Rv1996, Rv2624c, Rv2028c, Rv3134c, Rv1636. Some, possibly all, of these belong to universal stress protein family." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216521.1" /db_xref="GI:15609142" /db_xref="GeneID:888831" /translation="MSKPRKQHGVVVGVDGSLESDAAACWGATDAAMRNIPLTVVHVV NADVATWPPMPYPETWGVWQEDEGRQIVANAVKLAKEAVGADRKLSVKSELVFSTPVP TMVEISNEAEMVVLGSSGRGALARGLLGSVSSSLVRRAGCPVAVIHSDDAVIPDPQHA PVLVGIDGSPVSELATAVAFDEASRRGVELIAVHAWSDVEVVELPGLDFSAVQQEAEL SLAERLAGWQERYPDVPVSRVVVCDRPARKLVQKSASAQLVVVGSHGRGGLTGMLLGS VSNAVLHAARVPVIVARQS" gene 2252002..2255985 /gene="otsB1" /locus_tag="Rv2006" /db_xref="GeneID:888943" CDS 2252002..2255985 /gene="otsB1" /locus_tag="Rv2006" /EC_number="3.1.3.12" /function="INVOLVED IN TREHALOSE BIOSYNTHESIS (PROTECTIVE EFFECT). Mycobacteria can produce trehalose from glucose 6-phosphate and UDP-glucose (the OtsA-OtsB pathway) from glycogen-like alpha(1-->4)-linked glucose polymers (the TreY-TreZ pathway) and from maltose (the TreS pathway) [CATALYTIC ACTIVITY: TREHALOSE 6-PHOSPHATE + H(2)O = TREHALOSE + ORTHOPHOSPHATE]." /note="Rv2006, (MTCY39.11c), len: 1327 aa. Probable otsB1, trehalose-6-phosphate phosphatase (EC 3.1.3.12) (see citations below); strong similarity in central domain to OTSB_ECOLI P31678 trehalose-phosphatase (266 aa) and M. leprae TREHALOSE-PHOSPHATASE Q49734 (429 aa). Belongs to Glycosyl hydrolases family 65 (http://www.expasy.ch/cgi-bin/lists?glycosid.txt). FASTA scores, sp|Q49734|Q49734 PUTATIVE TREHALOSE-PHOSPHATASE (429 aa) opt: 1283 E(): 0; 51.7% identity in 420 aa overlap opt: 278, E(): 3.6e-11, (29.4% identity in 255 aa overlap). Note that previously known as otsB.; otsB" /codon_start=1 /transl_table=11 /product="trehalose-6-phosphate phosphatase OtsB1" /protein_id="YP_177855.1" /db_xref="GI:57116935" /db_xref="GeneID:888943" /translation="MRCGIVVNVTGPPPTIDRRYHDAVIVGLDNVVDKATRVHAAAWT KFLDDYLTRRPQRTGEDHCPLTHDDYRRFLAGKPDGVADFLAARGIRLPPGSPTDLTD DTVYGLQNLERQTFLQLLNTGVPEGKSIASFARRLQVAGVRVAAHTSHRNYGHTLDAT GLAEVFAVFVDGAVTAELGLPAEPNPAGLIETAKRLGANPGRCVVIDSCQTGLRAGRN GGFALVIAVDAHGDAENLLSSGADAVVADLAAVTVGSGDAAISTIPDALQVYSQLKRL LTGRRPAVFLDFDGTLSDIVERPEAATLVDGAAEALRALAAQCPVAVISGRDLADVRN RVKVDGLWLAGSHGFELVAPDGSHHQNAAATAAIDGLAEAAAQLADALREIAGAVVEH KRFAVAVHYRNVADDSVDNLIAAVRRLGHAAGLRVTTGRKVVELRPDIAWDKGKALDW IGERLGPAEVGPDLRLPIYIGDDLTDEDAFDAVRFTGVGIVVRHNEHGDRRSAATFRL ECPYTVCQFLSQLACDLQEAVQHDDPWTLVFHGYDPGQERLREALCAVGNGYLGSRGC APESAESEAHYPGTYVAGVYNQLTDHIEGCTVDNESLVNLPNWLSLTFRIDGGAWFNV DTVELLSYRQTFDLRRATLTRSLRFRDAGGRVTTMTQERFASMNRPNLVALQTRIESE NWSGTVDFRSLVDGGVHNTLVDRYRQLSSQHLTTAEIEVLADSVLLRTQTSQSGIAIA VAARSTLWRDGQRVDAQYRVARDTNRGGHDIQVTLSAGQSVTLEKVATIFTSRDAATL TAAISAQRCLGEAGRYAELCQQHVRAWARLWERCAIDLTGNTEELRLVRLHLLHLLQT ISPHTAELDAGVPARGLNGEAYRGHVFWDALFVAPVLSLRMPKVARSLLDYRYRRLPA ARRAAHRAGHLGAMYPWQSGSDGSEVSQQLHLNPRSGRWTPDPSDRAHHVGLAVAYNA WHYYQVTGDRQYLVDCGAELLVEIARFWVGLAKLDDSRGRYLIRGVIGPDEFHSGYPG NEYDGIDNNAYTNVMAVWVILRAMEALDLLPLTDRRHLIEKLGLTTQERDQWDDVSRR MFVPFHDGVISQFEGYSELAELDWDHYRHRYGNIQRLDRILEAEGDSVNNYQASKQAD ALMLLYLLSSDELIGLLARLGYRFAPTQIPGTVDYYLARTSDGSTLSAVVHAWVLARA NRSNAMEYFRQVLRSDIADVQGGTTQEGIHLAAMAGSIDLLQRCYSGLELRDDRLVLS PQWPEALGPLEFPFVYRRHQLSLRISGRSATLTAESGDAEPIEVECRGHVQRLRCGHT IEVGCSR" misc_feature 2252671..2252697 /gene="otsB1" /locus_tag="Rv2006" /note="PS00148 Arginase family signature 2" gene complement(2256084..2256428) /gene="fdxA" /locus_tag="Rv2007c" /db_xref="GeneID:888887" CDS complement(2256084..2256428) /gene="fdxA" /locus_tag="Rv2007c" /function="INVOLVED IN ELECTRON TRANSFER." /experiment="experimental evidence, no additional details recorded" /note="Rv2007c, (MTCY39.10), len: 114 aa. Probable fdxA, ferredoxin, similar to e.g. FER_MYCSM P00215 ferredoxin, Mycobacterium smegmatis (106 aa), FASTA scores, opt: 448, E(): 1 .6e-21, (58.7% identity in 109 aa overlap), also similar to Rv0886|MTCY31.14, (34.2% identity in 117 aa overlap) and fdxC|Rv1177." /codon_start=1 /transl_table=11 /product="ferredoxin FDXA" /protein_id="NP_216523.1" /db_xref="GI:15609144" /db_xref="GeneID:888887" /translation="MTYVIGSECVDVMDKSCVQECPVDCIYEGARMLYINPDECVDCG ACKPACRVEAIYWEGDLPDDQHQHLGDNAAFFHQVLPGRVAPLGSPGGAAAVGPIGVD TPLVAAIPVECP" gene complement(2256617..2257942) /locus_tag="Rv2008c" /db_xref="GeneID:888813" CDS complement(2256617..2257942) /locus_tag="Rv2008c" /function="UNKNOWN" /note="Rv2008c, (MTCY39.09), len: 441 aa. Conserved hypothetical protein. Contains PS00017 ATP/GTP-binding site motif A, PS00501 Signal peptidases I serine active site. Also contains helix-turn-helix motif at aa 258-279. Similar to several conserved hypothetical proteins e.g. NP_085874.1|14028123|dbj|BAB54715.1 hypothetical protein from Mesorhizobium loti (435 aa). Smith-Waterman score: 1e-74, Identities = 158/359 (44%)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216524.1" /db_xref="GI:15609145" /db_xref="GeneID:888813" /translation="MDEIESLIGLRPTPLTWPVVIAGDFLGVWDPPPSLPGAANHEIS APTARISCMLIERRDAAARLRRALHRAPVVLLTGPRQAGKTTLSRLVGKSAPECTFDA ENPVDATRLADPMLALSGLSGLITIDEAQRIPDLFPVLRVLVDRPVMPARFLILGSAS PDLVGLASESLAGRVELVELSGLTVRDVGSSAADRLWLRGGLPPSFTARSNEDSAAWR DGYITTFLERDLAQLGVRIPAATMRRAWTMLAHYHGQLFSGAELARSLDVAQTTARRY LDALTDALVVRQLTPWFANIGKRQRRSPKIYIRDTGLLHRLLGIDDRLALERNPKLGA SWEGFVLEQLAALLAPNPLYYWRTQQDAELDLYVELSGRPYGFEIKRTSTPSISRSMR SALVDLQLARLAIVYPGEHRFPLSDTVVAVPADQILTTGSVDELLALLK" misc_feature complement(2256758..2256781) /locus_tag="Rv2008c" /note="PS00501 Signal peptidases I serine active site" misc_feature complement(2257688..2257711) /locus_tag="Rv2008c" /note="PS00017 ATP/GTP-binding site motif A" gene 2258030..2258272 /locus_tag="Rv2009" /db_xref="GeneID:888925" CDS 2258030..2258272 /locus_tag="Rv2009" /function="UNKNOWN" /note="Rv2009, (MTCY39.08c), len: 80 aa. Conserved hypothetical protein, very similar to Rv1560|MTCY48.05c (54.4% identity in 68 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216525.1" /db_xref="GI:15609146" /db_xref="GeneID:888925" /translation="MYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGE PLGRDEALALQGSGFDFSNDEIESFSDTDRKLADES" gene 2258273..2258671 /locus_tag="Rv2010" /db_xref="GeneID:888933" CDS 2258273..2258671 /locus_tag="Rv2010" /function="UNKNOWN" /note="Rv2010, (MTCY39.07c), len: 132 aa. Conserved hypothetical protein, similar to Rv1561|MTCY48.04c, (38.1% identity in 126 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216526.1" /db_xref="GI:15609147" /db_xref="GeneID:888933" /translation="MIVDTSVWIAYLSTSESLASRWLADRIAADSTVIVPEVVMMELL IGKTDEDTAALRRRLLQRFAIEPLAPVRDAEDAAAIHRRCRRGGDTVRSLIDCQVAAM ALRIGVAVAHRDRDYEAIRTHCGLRTEPLF" gene complement(2258854..2259285) /locus_tag="Rv2011c" /db_xref="GeneID:888922" CDS complement(2258854..2259285) /locus_tag="Rv2011c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2011c, (MTCY39.06), len: 143 aa. Conserved hypothetical protein, some similarity to putative regulatory proteins e.g. putative marR-family regulatory protein from Streptomyces coelicolor A3(2) (157 aa), emb|CAB63189.1| (AL133469) 34% identity in 110 aa overlap. Low similarity to PETP_RHOCA P31078 petp protein. Rhodobacter capsulatus (166 aa), FASTA scores, opt: 101, E(): 0 .36, (31.8% identity in 88 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216527.1" /db_xref="GI:15609148" /db_xref="GeneID:888922" /translation="MSDEIARLVADVFELAGLLRRSGEVVAAREGHTQARWQLLSVVS DRALTVPQAARRLGVTRQGVQRVANDLVVCGLAELRHNPDHRTSPLLVLTENGRRVLQ AITERAIVVNNRLADAVDPAALQATRDSLRRMIVALKAERP" gene 2259326..2259820 /locus_tag="Rv2012" /db_xref="GeneID:888927" CDS 2259326..2259820 /locus_tag="Rv2012" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2012, (MTCY39.05c), len: 164 aa. Conserved hypothetical protein, similar to AAK04358.1|AE006263_5 hypothetical protein from Lactococcus lactis (137 aa), (48% identity in 129 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216528.1" /db_xref="GI:15609149" /db_xref="GeneID:888927" /translation="MLSKSKRSCRRRETLRIGEKMSAPITNLQAAQRDAIMNRPAVNG FPHLAETLRRAGVRTNTWWLPAMQSLYETDYGPVLDQGVPLIDGVAEVPAFDRTALVT ALRADQAGQTSFREFAAAAWRAGVLRYVVDLENRTCTYFGLHDQTYMEHYAAVEPSGG APTS" repeat_region 2260443..2261670 /note="IS1607, len: 1228 bp. Vestigial Insertion sequence element, IS1607." /mobile_element="insertion sequence:IS1607" gene 2260665..2261144 /locus_tag="Rv2013" /db_xref="GeneID:887546" CDS 2260665..2261144 /locus_tag="Rv2013" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv2013, (MTCY39.04c), len: 159 aa. Possible transposase: shows similarity to N-terminal part of transposase and insertion element hypothetical proteins eg sp|Q53198|Y4UE_RHISN PUTATIVE TRANSPOSASE Y4UE (359 aa) opt: 383, E(): 1.3e-18; 35.1% identity in 225 aa overlap; sp|P 14707|YM3_STRCO MINI-CIRCLE HYPOTHETICAL 45.7 kDa P (414 aa) opt: 302, E(): 4.2e-13; 33.3% identity in 207 aa overlap; and YI90_MYCPA P14322 insertion element is900 hypothetical protein (399 aa), FASTA scores, opt: 146, E(): 0.0021, (26.9% identity in 145 aa overlap). Length changed since first submission (no clear start apparent)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216529.2" /db_xref="GI:57116936" /db_xref="GeneID:887546" /translation="MDTLLEAGITVVVISPNQLKNLRGRYGSAGNKDDRFDAFVLADT LRTDRSRLRPLLPDTPATATLRRTCRPRKDLVAHRVALANQLRAHLRVVFPGVVGLFA DLDSPISLAFLTFLPRFDCQDRADWLSVKRLAGWLAAAGYCGRAPRPAHRCPARRHR" gene 2261098..2261688 /locus_tag="Rv2014" /db_xref="GeneID:887547" CDS 2261098..2261688 /locus_tag="Rv2014" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv2014, (MTCY39.03c), len: 196 aa. Possible transposase, similar to insertion elements e.g. sp|P14707|YM3_STRCO MINI-CIRCLE HYPOT HETICAL 45.7 kDa P (414 aa) opt: 249 z-score: 307.0 E(): 1.4e-09; 33.1% identity in 169 aa overlap; and YI90_MYCPA P14322 insertion element is900 hypothetical protein (399 a a), FASTA scores, opt: 242, z-score: 299.9, E(): 3.7e-10, (3 2.5% identity in 163 aa overlap); possibly made by frameshifting with respect to upstream ORF. Length changed since first submission." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216530.2" /db_xref="GI:57116937" /db_xref="GeneID:887547" /translation="MLHDRLTGAPRGATGDEGAANAHITRAMVAALTSVATQIKTLDA QIAEQLSLHADAHIFTSLPRSGTVRAARLLAEIGDCRARFPTPESLACLAGVAPSTRQ SGKVKHVGFRWAADKQLRDAVCDFAGDSRRANLWAADRYNRAIARGHDHPHAVRILAR AWLYAIWHCWQDGAAYHPANHRALQALLNQDQDRAA" gene complement(2261816..2263072) /locus_tag="Rv2015c" /db_xref="GeneID:888378" CDS complement(2261816..2263072) /locus_tag="Rv2015c" /function="UNKNOWN" /note="Rv2015c, (MTV018.02c), len: 418 aa. Conserved hypothetical protein. Nearly identical to Mycobacterium tuberculosis Rv1765c|MTCY28.31c, (378 aa), an ORF starting next to ISB9, and ending in IS6110. Different N-terminus chosen and C-terminus differs as that of Rv1765c has been truncated by IS6110. Does NOT show similarities with transposases. BLAST hits with non-IS part of MTU78639. FASTA scores: Z95890|MTCY28_31 (378 aa) opt: 2417, E(): 0, (97.8% identity in 364 aa overlap). TBparse score is 0.939" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216531.1" /db_xref="GI:15609152" /db_xref="GeneID:888378" /translation="MSSTATSGAAVVSPAERVEVLFEELAELAGQRNAIDGRIVEIVA ELDRDGLWGVTGARSVAGLVAWKMGCSSGNAHTIATVARRLPEFPRCARGMREGRLSL DQVGVIAGRAGEGSDAHYAQLAGVATVNQLRTALKLEPRPEPEPDFRPEPRPSITRSA DEQFSCWRIKLPHVEAAKFDAALQSHLDALIAEYKRDHDNSDGVSDQRPPLPGNVEAF LRLVEAGWDAEVARRPHGQHTTVVMHLDVQERAAGLHLGPLLSESERRYLLCDATFEA WFERDGQVIGCGRTTRQINRRLRRALEHRDRTCVVPGCGATRGLHAHHIRHWQDGGAT ELANLVLVCPYHHRAHHRGLITITGPADNLTVADSAGRPLSAGSLARASTKPPPAVAP WPGPTGERADWWWYEPFQPQPPPISN" gene 2263426..2264001 /locus_tag="Rv2016" /db_xref="GeneID:888630" CDS 2263426..2264001 /locus_tag="Rv2016" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2016, (MTV018.03), len: 191 aa. Hypothetical protein. TBparse score is 0.927" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216532.1" /db_xref="GI:15609153" /db_xref="GeneID:888630" /translation="MTELGDKFLAALVGTIRDTRFDIADMRNWRPGWFPTMHSRCLSN LIHDRIWAHLVTLIASNPGTSIKDKGATREIVVGAHLRLRIKRHHAGDEISTYPTRTA IEFWQQGSQPAFPGLEEVRIAVGYRWDPDTREIGAPLLSLRDGKDHVIWVVELDEPAA GVKITWTPIEPTLPSIDFGDLGEDSGASGER" gene 2263998..2265038 /locus_tag="Rv2017" /db_xref="GeneID:888435" CDS 2263998..2265038 /locus_tag="Rv2017" /function="THOUGH TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2017, (MTV018.04), len: 346 aa. Possible regulatory protein; shows similarity at N-terminal end to several transcriptional regulators e.g. Z99115|BSUB0012_44 from Bacillus subtilis (108 aa), FASTA scores: opt: 154, E(): 0.0012; (35.5% identity in 62 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature in C-terminal half, may be fortuitous. TBparse score is 0.908. Contains probable helix-turn-helix motif at aa 18-39 (Score 2243, +6.83 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216533.1" /db_xref="GI:15609154" /db_xref="GeneID:888435" /translation="MNGLGDVLAVARKARGLTQIELAELVGLTQPAINRYESGDRDPD QHIVAKLAEILGVTDDLLIHGNRFRGALAVDAHMRRHKTTKASAWRQLEARLNLLRVH ASFLFEEVAINSEQHVPAFDPEFTAAEDAARLVRAQWRMPMGPVVNLTRWMEAAGCLV FEEDFATQRIDGLSQWVDDYPVMLINANAAPDRKRLTLAHELGHLVLHSTNPTENMET EATAFAAEFLMPESEIRPELRRLDLGKLLELKREWGVSMQALLARAYRMGLVSAEART KLYKAMNARGWKTKEPGIESIVREKPSLPAHIGMTLRSRGFTDQQAAAIAGYANPADN PFRPEGGRLHAI" misc_feature 2264586..2264615 /locus_tag="Rv2017" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene 2265280..2265999 /locus_tag="Rv2018" /db_xref="GeneID:887560" CDS 2265280..2265999 /locus_tag="Rv2018" /function="UNKNOWN" /note="Rv2018, (MTV018.05), len: 239 aa. Conserved hypothetical protein, similar to Rv2308|MTCY339.01c (238 aa). FASTA scores: Z77163|MTCY339_1 Mycobacterium tuberculosis cosmid (238 aa) opt: 142, E(): 0.029; (24.8% identity in 250 aa overlap). Contains probable helix-turn-helix motif at aa 215-236 (Score 1175, +3.19 SD). TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216534.1" /db_xref="GI:15609155" /db_xref="GeneID:887560" /translation="MAGDQELELRFDVPLYTLAEASRYLVVPRATLATWADGYERRPA NAPAVQGQPIITALPHPTGSHARLPFVGIAEAYVLNAFRRAGVPMQRIRPSLDWLIKN VGPHALASQDLCTDGAEVLWRFAERSGEGSPDDLVVRGLIVPRSGQYVFKEIVEHYLQ QISFADDNLASMIRLPQYGDANVVLDPRRGYGQPVFDGSGVRVADVLGPLRAGATFQA VADDYGVTPDQLRDALDAIAA" gene 2265989..2266405 /locus_tag="Rv2019" /db_xref="GeneID:887665" CDS 2265989..2266405 /locus_tag="Rv2019" /function="UNKNOWN" /note="Rv2019, (MTV018.06), len: 138 aa. Hypothetical protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216535.1" /db_xref="GI:15609156" /db_xref="GeneID:887665" /translation="MQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYG ETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAE QVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL" gene complement(2266421..2266720) /locus_tag="Rv2020c" /db_xref="GeneID:887656" CDS complement(2266421..2266720) /locus_tag="Rv2020c" /function="UNKNOWN" /note="Rv2020c, (MTV018.07c), len: 99 aa. Conserved hypothetical protein, nearly identical to C-terminal part of hypothetical protein RvD1-Rv2024c' from Mycobacterium bovis BCG (1606 aa) emb|CAB44655.1| (Y18605). Corresponds to deletion region RvD1 so probably truncated protein. TBparse score is 0.891" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216536.1" /db_xref="GI:15609157" /db_xref="GeneID:887656" /translation="MAPGMKWAAKTDHLAIVLLPRHHRRHSRRGRALPARSRSALGWI IERYRVTTDKASGIVNDPNDWCDEHDDPTYIVDLIKKVTTVSVETMKIVDGLAGG" gene complement(2266805..2267110) /locus_tag="Rv2021c" /db_xref="GeneID:888092" CDS complement(2266805..2267110) /locus_tag="Rv2021c" /function="THOUGH TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2021c, (MTV018.08c), len: 101 aa. Possible regulatory protein, similar to Rv3183|MTV014.27|AL021646 POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN from M. tuberculosis (109 aa), FASTA scores: opt: 214, E(): 1.2e-09, (43.0% identity in 107 aa overlap). TBparse score is 0.913. Contains probable helix-turn-helix at aa 45-66 (Score 1472, +4.20 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_216537.1" /db_xref="GI:15609158" /db_xref="GeneID:888092" /translation="MAMTLRDMDAVRPVNREAVDRHKARMRDEVRAFRLRELRAAQSL TQVQVAALAHIRQSRVSSIENGDIGSAQVNTLRKYVSALGGELDITVRLGDETFTLA" gene complement(2267119..2267724) /locus_tag="Rv2022c" /db_xref="GeneID:888129" CDS complement(2267119..2267724) /locus_tag="Rv2022c" /function="UNKNOWN" /note="Rv2022c, (MTV018.09c), len: 201 aa. Conserved hypothetical protein, similar to Mycobacterium tuberculosis hypothetical protein Rv3182, MTV014.26 (EMBL:AL 021646). FASTA scores; TR:E1248773 (114 aa) opt: 335, E(): 3e-22, 53.8% identity in 106 aa overlap and to hypothetical proteins from Yersinia pestis (115 aa) e.g. emb|CAB53172.1| (AL109969), 41% identity in 108 aa overlap. TBparse score is 0.912" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216538.1" /db_xref="GI:15609159" /db_xref="GeneID:888129" /translation="MNVPWENAHGGALYCLIRGDEFSAWHRLLFQRPGCAESVLACRH FLDGSPVARCSYPEEYHPCVISRIALLCDSVGWTADVERISAWLNGLDRETYELVFAA IEVLEEEGPALGCPLVDTVRGSRHKNMKELRPGSQGRSEVRILFAFDPARQAIMLAAG NKAGRWTQWYDEKIKAADEMFAEHLAQFEDTKPKRRKRKKG" gene complement(2267749..2268108) /locus_tag="Rv2023c" /db_xref="GeneID:887444" CDS complement(2267749..2268108) /locus_tag="Rv2023c" /function="UNKNOWN" /note="Rv2023c, (MTV018.10c), len: 119 aa. Hypothetical protein, alternative upstream start possible. TBparse score is 0.913" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216539.1" /db_xref="GI:15609160" /db_xref="GeneID:887444" /translation="MAARHARAGRWAAQPRPMLGSGAVRYEVGANIDATGFGGIAAVH RLVTRLGLVTRLGLVERVDAHSRFSSSNLPKSSRRISGRVSLSGMSNSAAKVVASTSS SPWGQPLSVGLRRRWRS" gene complement(2268268..2268726) /locus_tag="Rv2023A" /pseudo /db_xref="GeneID:3205050" misc_feature complement(2268268..2268726) /locus_tag="Rv2023A" /note="Rv2023A, len: 152 aa. Hypothetical unknown protein (pseudogene), equivalent to the C-terminus of Q8VJS0|MT2080 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (225 aa), FASTA scores: opt: 1028, E(): 3.6e-66, (99.342% identity in 152 aa overlap) and C-terminus of Mb2047c HYPOTHETICAL PROTEIN from Mycobacterium bovis (225 aa). And N-terminal part equivalent to the C-terminus of Q9XB17 HYPOTHETICAL 15.5 kDa PROTEIN from Mycobacterium bovis BCG (131 aa), FASTA scores: opt: 409, E(): 4.2e-22, (98.276% identity in 58 aa overlap). Note that a deletion of DNA (RvD1 region) in Mycobacterium tuberculosis strain H37Rv resulted in a truncated CDS comparatively to Mycobacterium bovis or Mycobacterium tuberculosis strain CDC1551 genomes (see citations below).;HYPOTHETICAL PROTEIN" /pseudo /db_xref="PSEUDO:CAE55448.1" gene complement(2268693..2270240) /locus_tag="Rv2024c" /db_xref="GeneID:888351" CDS complement(2268693..2270240) /locus_tag="Rv2024c" /function="UNKNOWN" /note="Rv2024c, (MTV018.11c), len: 515 aa. Conserved hypothetical protein. Identical to N-terminal part of much larger hypothetical protein, RvD1-Rv2024c' (1606 aa), from Mycobacterium bovis BCG: CAB44655.1|Y18605|13881753|AAK46361.1|AE007059 so probably truncated. Part of RvD1 chromosomal deletion region. Also similar to hypothetical protein from Helicobacter pylori. FASTA scores: AE0005|HPAE000580_2 Helicobacter pylori (607 aa) opt: 64, E(): 0, (36.2% identity in 464 aa overlap). TBparse score is 0.879" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216540.1" /db_xref="GI:15609161" /db_xref="REBASE:MtuHORF2024P" /db_xref="GeneID:888351" /translation="MGSVHDVIEAFRKAPSNAERGTKFEQLMVRYFELDPTMAQQYDA VWWWIDWPERRGRTDTGIDLVARERDTGNYTAIQCKFYEPTHTLAKGDIDSFFTASGK TGFTNRVIISTTDRWGRNAEDALADQLVPVQRIGMAEIAESPIDWDIAWPADDLQVNL TPAKRHELRPHQQQAIDAVFRGFAVGNDRGKLIMACGTGKTFTALKIAERIAADNGGS ARILLLVPSISLLSQTLREWTAQSELDVRAFAVCSDTKVSRSAEDYHVHDVPIPVTTD ARVLLHEMAHRRRAQGLTVVFCTYQSLPTVAKAQRLGVDEFDLVMCDEAHRTTGVTLA GDDESNFVRVHDGQYLKAARRLYMTATPRIFTESIKDRADQHSAELVSMDDELTFGPE FHRLSFGEAVERGLLTDYKVMVLTVDQGVIAPRLQQELSGVSGELMLDDASKIVGCWN GLAKRSGTGIVAGEPPMRRAVAFAKDIKTSKQVAELFPKVVEAYRELVDDGPGLACLN SSRRIQA" gene complement(2270750..2271748) /locus_tag="Rv2025c" /db_xref="GeneID:888782" CDS complement(2270750..2271748) /locus_tag="Rv2025c" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF METAL IONS ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv2025c, (MTV018.12c), len: 332 aa. Possible conserved transmembrane protein, CDF family possibly involved in transport of metal ions, similar to several hypothetical bacterial proteins e. g. Methanobacterium thermoautotrophicum AE000941_1 (298 aa; described as cation efflux system protein) and Archaeoglob us fulgidus AE001111_5 (384 aa). FASTA scores: AE000941_1 M ethanobacterium thermoautotrophicum (298 aa) opt: 452 E(): 3.3e-24; 30.8% identity in 266 aa overlap and AE001111_5 Archaeoglobus fulgidus section 16 (384 aa) opt: 371 E(): 1.7e-18; 27.7% identity in 267 aa overlap. TBparse score is 0.897" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216541.1" /db_xref="GI:15609162" /db_xref="GeneID:888782" /translation="MTHDHAHSRGVPAMIKEIFAPHSHDAADSVDDTLESTAAGIRTV KISLLVLGLTALIQIVIVVMSGSVALAADTIHNFADALTAVPLWIAFALGAKPATRRY TYGFGRVEDLAGSFVVAMITMSAIIAGYEAIARLIHPQQIEHVGWVALAGLVGFIGNE WVALYRIRVGHRIGSAALIADGLHARTDGFTSLAVLCSAGGVALGFPLADPIVGLLIT AAILAVLRTAARDVFRRLLDGVDPAMVDAAEQALAARPGVQAVRSVRMRWIGHRLHAD AELDVDPALDLAQAHRIAHDAEHELTHTVPKLTTALIHAYPAEHGSSIPDRGRTVE" gene complement(2271863..2272747) /locus_tag="Rv2026c" /db_xref="GeneID:887460" CDS complement(2271863..2272747) /locus_tag="Rv2026c" /function="UNKNOWN" /note="Rv2026c, (MTV018.13c), len: 294 aa. Conserved hypothetical protein, very similar to Mycobacterium tuberculosis hypothetical proteins Rv2005c, Rv2623, Rv1996, Rv2624c, Rv2028c, Rv3134c, Rv1636. Some, possibly all, of these belong to universal stress protein family. TBparse score is 0.946" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216542.1" /db_xref="GI:15609163" /db_xref="GeneID:887460" /translation="MSAATAKYGILVGVDGSAQSNAAVAWAAREAVMRQLPITLLHIV APVVVGWPVGQLYANMTEWQKDNAQQVIEQAREALTNSLGESKPPQVHTELVFSNVVP TLIDASQQAWLMVVGSQGMGALGRLLLGSISTALLHHARCPVAIIHSGNGATPDSDAP VLVGIDGSPASEAATALAFDEASRRRVDLVALHAWTDLGMFPVLGMDWREREKREAEV LAERLAGWQEQYPDVRVHRSLVCDKPARWLLEHSEQAQLVVVGSHGRGGFSGMLLGSV SSAVAHSVRIPVIVVRPS" gene complement(2272787..2274508) /locus_tag="Rv2027c" /db_xref="GeneID:888471" CDS complement(2272787..2274508) /locus_tag="Rv2027c" /function="signal transduction" /note="Rv2027c, (MTV018.14c), len: 573 aa. Probable histidine kinase response regulator, highly similar to others e.g. NP_628132.1|NC_003888 putative two component sensor from Streptomyces coelicolor (560 aa); NP_626695.1|NC_003888 putative two-component sensor histidine kinase from Streptomyces coelicolor (475 aa); etc. Highly similar to Mycobacterium tuberculosis protein Rv3132c, MTCY03A2.2 6. FASTA scores: Z83867|MTCY3A2_26 (578 aa) opt: 2330, E(): 0; 62.5% identity in 560 aa overlap. TBparse score is 0.903" /codon_start=1 /transl_table=11 /product="histidine kinase response regulator" /protein_id="NP_216543.1" /db_xref="GI:15609164" /db_xref="GeneID:888471" /translation="MTHPDRANVNPGSPPLRETLSQLRLRELLLEVQDRIEQIVEGRD RLDGLIDAILAITSGLKLDATLRAIVHTAAELVDARYGALGVRGYDHRLVEFVYEGID EETRHLIGSLPEGRGVLGALIEEPKPIRLDDISRHPASVGFPLHHPPMRTFLGVPVRI RDEVFGNLYLTEKADGQPFSDDDEVLVQALAAAAGIAVDNARLFEESRTREAWIEATR DIGTQMLAGADPAMVFRLIAEEALTLMAGAATLVAVPLDDEAPACEVDDLVIVEVAGE ISPAVKQMTVAVSGTSIGGVFHDRTPRRFDRLDLAVDGPVEPGPALVLPLRAADTVAG VLVALRSADEQPFSDKQLDMMAAFADQAALAWRLATAQRQMREVEILTDRDRIARDLH DHVIQRLFAVGLTLQGAAPRARVPAVRESIYSSIDDLQEIIQEIRSAIFDLHAGPSRA TGLRHRLDKVIDQLAIPALHTTVQYTGPLSVVDTVLANHAEAVLREAVSNAVRHANAT SLAINVSVEDDVRVEVVDDGVGISGDITESGLRNLRQRADDAGGEFTVENMPTGGTLL RWSAPLR" gene complement(2274569..2275408) /locus_tag="Rv2028c" /db_xref="GeneID:888494" CDS complement(2274569..2275408) /locus_tag="Rv2028c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2028c, (MTV018.15c), len: 279 aa. Conserved hypothetical protein, highly similar to Mycobacterium tuberculosis proteins Rv2005c, Rv2623, Rv1996, Rv2624c, Rv3134c, Rv1636. Some, possibly all, of these belong to universal stress protein family. Rv2624c|MTCY01A10.08 (272 aa) and Rv3134c|MTCY03A2.24 (268 aa). FASTA scores: Z95387|MTCY1A10_8 (272 aa) opt: 563, E(): 2.5e-31, (36.8% identity in 266 aa overlap) and Z83867|MTCY3A2_24 (268 aa) opt: 562, E(): 2.9e-31, (40.7% identity in 273 aa overlap). TBparse score is 0.904" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216544.1" /db_xref="GI:15609165" /db_xref="GeneID:888494" /translation="MNQSHKPPSIVVGIDGSKPAVQAALWAVDEAASRDIPLRLLYAI EPDDPGYAAHGAAARKLAAAENAVRYAFTAVEAADRPVKVEVEITQERPVTSLIRASA AAALVCVGAIGVHHFRPERVGSTAAALALSAQCPVAIVRPHRVPIGRDAAWIVVEADG SSDIGVLLGAVMAEARLRDSPVRVVTCRQSGVGDTGDDVRASLDRWLARWQPRYPDVR VQSAAVHGELLDYLAGLGRSVHMVVLSASDQEHVEQLVGAPGNAVLQEAGCTLLVVGQ QYL" gene complement(2275405..2276424) /gene="pfkB" /locus_tag="Rv2029c" /db_xref="GeneID:887491" CDS complement(2275405..2276424) /gene="pfkB" /locus_tag="Rv2029c" /EC_number="2.7.1.-" /function="Involved in glycolysis: converts sugar-1-P to sugar-1,6-P." /experiment="experimental evidence, no additional details recorded" /note="Rv2029c, (MTV018.16c), len: 339 aa. Probable pfkB, phosphofructokinase (EC 2.7.1.-), similar to others eg P06999|K6P2_ECOLI 6-PHOSPHOFRUCTOKINASE I SOZYME 2 from E. coli (309 aa), FASTA scores: opt: 705, E(): 0; (41.4% identity in 304 aa overlap); and LACC_STRMU phosphotagatosekinase (310 aa); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1. TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="phosphofructokinase PfkB (phosphohexokinase)" /protein_id="NP_216545.1" /db_xref="GI:15609166" /db_xref="GeneID:887491" /translation="MTEPAAWDEGKPRIITLTMNPALDITTSVDVVRPTEKMRCGAPR YDPGGGGINVARIVHVLGGCSTALFPAGGSTGSLLMALLGDAGVPFRVIPIAASTRES FTVNESRTAKQYRFVLPGPSLTVAEQEQCLDELRGAAASAAFVVASGSLPPGVAADYY QRVADICRRSSTPLILDTSGGGLQHISSGVFLLKASVRELRECVGSELLTEPEQLAAA HELIDRGRAEVVVVSLGSQGALLATRHASHRFSSIPMTAVSGVGAGDAMVAAITVGLS RGWSLIKSVRLGNAAGAAMLLTPGTAACNRDDVERFFELAAEPTEVGQDQYVWHPIVN PEASP" misc_feature complement(2276209..2276283) /gene="pfkB" /locus_tag="Rv2029c" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene complement(2276441..2278486) /locus_tag="Rv2030c" /db_xref="GeneID:887536" CDS complement(2276441..2278486) /locus_tag="Rv2030c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2030c, (MTV018.17c), len: 681 aa. Conserved hypothetical protein that corresponds to products of two adjacent ORF's described previously MSGTUBDWN_4 (390 aa) and MSGTUBDWN_1 (385 aa). Also similar to C-terminal two-thirds of Mycobacterium tuberculosis protein Rv2143 (MTCY270.25c; 352 aa) and to Rv0571c (443 aa) and M. leprae protein U650s MLU15184_16 (258 aa). FASTA scores: M93129|MSGTUBDWN_4 (390 aa) opt: 2530 E(): 0; 97.7% identity in 385 aa overlap and M93129|MSGTUBDWN_1 (385 aa) opt: 1983 E(): 0; 99.0% identity in 309 aa overlap. Z95388| MTCY270_25 (352 aa) opt: 882 E(): 0; 61.1 % identity in 226 aa overlap. U15184|MLU15184_16 (258 aa) opt: 549 E(): 9.8e-29; 43.8% identity in 219 aa overlap. TBparse score is 0.907" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216546.1" /db_xref="GI:15609167" /db_xref="GeneID:887536" /translation="MLMTAAADVTRRSPRRVFRDRREAGRVLAELLAAYRDQPDVIVL GLARGGLPVAWEVAAALHAPLDAFVVRKLGAPGHDEFAVGALASGGRVVVNDDVVRGL RITPQQLRDIAEREGRELLRRESAYRGERPPTDITGKTVIVVDDGLATGASMFAAVQA LRDAQPAQIVIAVPAAPESTCREFAGLVDDVVCATMPTPFLAVGESFWDFRQVTDEEV RRLLATPTAGPSLRRPAASTAADVLRRVAIDAPGGVPTHEVLAELVGDARIVLIGESS HGTHEFYQARAAMTQWLIEEKGFGAVAAEADWPDAYRVNRYVRGLGEDTNADEALSGF ERFPAWMWRNTVVRDFVEWLRTRNQRYESGALRQAGFYGLDLYSLHRSIQEVISYLDK VDPRAAARARARYACFDHACADDGQAYGFAAAFGAGPSCEREAVEQLVDVQRNALAYA RQDGLLAEDELFYAQQNAQTVRDAEVYYRAMFSGRVTSWNLRDQHMAQTLGSLLTHLD RHLDAPPARIVVWAHNSHVGDARATEVWADGQLTLGQIVRERYGDESRSIGFSTYTGT VTAASEWGGIAQRKAVRPALHGSVEELFHQTADSFLVSARLSRDAEAPLDVVRLGRAI GVVYLPATERQSHYLHVRPADQFDAMIHIDQTRALEPLEVTSRWIAGENPETYPTGL" gene complement(2278498..2278932) /gene="hspX" /locus_tag="Rv2031c" /db_xref="GeneID:887579" CDS complement(2278498..2278932) /gene="hspX" /locus_tag="Rv2031c" /function="STRESS PROTEIN INDUCED BY ANOXIA. HAS A PROPOSED ROLE IN MAINTENANCE OF LONG-TERM VIABILITY DURING LATENT, ASYMPTOMATIC INFECTIONS, AND A PROPOSED ROLE IN REPLICATION DURING INITIAL INFECTION. REGULATED BY THE TWO COMPONENT REGULATORY SYSTEM DEVR|Rv3133c/DEVS|Rv3132c, IN RESPONSE TO A HYPOXIC SIGNAL." /experiment="experimental evidence, no additional details recorded" /note="Rv2031c, (MTV018.18c), len: 144 aa. hspX, heat shock protein localized in the inner membrane (see citations below). Identical to P30223|14KD_MYCTU 14 KD ANTIGEN (16 kDa ANTIGEN) (HSP 16.3) of Mycobacterium tuberculosis (143 aa), FASTA scores: opt: 933, E(): 0, (100.0% identity in 143 aa overlap). BELONGS TO THE SMALL HEAT SHOCK PROTEIN (HSP20) FAMILY. Also known as alpha-crystallin and gene as acr (see some citations below). TBparse score is 0.897.; acr" /codon_start=1 /transl_table=11 /product="heat shock protein hspX" /protein_id="NP_216547.1" /db_xref="GI:15609168" /db_xref="GeneID:887579" /translation="MATTLPVQRHPRSLFPEFSELFAAFPSFAGLRPTFDTRLMRLED EMKEGRYEVRAELPGVDPDKDVDIMVRDGQLTIKAERTEQKDFDGRSEFAYGSFVRTV SLPVGADEDDIKATYDKGILTVSVAVSEGKPTEKHIQIRSTN" gene 2279129..2280124 /gene="acg" /locus_tag="Rv2032" /db_xref="GeneID:887582" CDS 2279129..2280124 /gene="acg" /locus_tag="Rv2032" /function="Unknown. May have a role for bacteria within the host environment." /experiment="experimental evidence, no additional details recorded" /note="Rv2032, (MTV018.19), len: 331 aa. acg (for acr-coregulated gene), conserved hypothetical protein possibly member of a superfamily of classical nitroreductases (see Purkayastha et al., 2002), similar to hypothetical mycobacterial proteins Rv3127|MTCY164.37 (344 aa) and Rv3131|MTCY03A2.27c (332 aa). FASTA scores: Z95150|MTCY164_38 Mycobacterium tuberculosis cosmid (344 aa) opt: 1208, E(): 0, (56.4% identity in 321 aa overlap); Z83867| MTCY3A2_27 Mycobacterium tuberculosis cosmid (332 aa) opt: 568, E(): 8.6e-30, (36.8% identity in 321 aa overlap). Similar to proteins SCJ1.11 (330 aa; AL109962) and SCJ12.27c (335 aa; AL109989) in Streptomyces coelicolor. TBparse score is 0.931." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216548.1" /db_xref="GI:15609169" /db_xref="GeneID:887582" /translation="MPDTMVTTDVIKSAVQLACRAPSLHNSQPWRWIAEDHTVALFLD KDRVLYATDHSGREALLGCGAVLDHFRVAMAAAGTTANVERFPNPNDPLHLASIDFSP ADFVTEGHRLRADAILLRRTDRLPFAEPPDWDLVESQLRTTVTADTVRIDVIADDMRP ELAAASKLTESLRLYDSSYHAELFWWTGAFETSEGIPHSSLVSAAESDRVTFGRDFPV VANTDRRPEFGHDRSKVLVLSTYDNERASLLRCGEMLSAVLLDATMAGLATCTLTHIT ELHASRDLVAALIGQPATPQALVRVGLAPEMEEPPPATPRRPIDEVFHVRAKDHR" gene complement(2280240..2281082) /locus_tag="Rv2033c" /db_xref="GeneID:887972" CDS complement(2280240..2281082) /locus_tag="Rv2033c" /function="UNKNOWN" /note="Rv2033c, (MTV018.20), len: 280 aa. Conserved hypothetical protein, similar to hypothetical protein SCC77.24 (274 aa) from Streptomyces coelicolor A3(2) CAB66235.1|AL13650) (50% identity in 261 aa overlap). TBparse score is 0.897" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216549.1" /db_xref="GI:15609170" /db_xref="GeneID:887972" /translation="MLDRYGTDVLAAGGRRRPRSVEHPVELGMVVEDAETGYVGAVVR VEYGRIDLEDRYGKTRGFPLGPGYLLDGLPVILTAPRCAAAAGPRRTASGSVAVPGAR ARVARASRIYVEGRHDAELIAAVWGADLRIEGVVVEHLGGVDDLVEIVAKFRPGPRRR LGVLVDHLVAGSKEARIAEVVRRGPGGSDTLVVGHPYVDIWQAVKPQRVGLAAWPRVP RHIEWKHGVCDALGWPHADQADIAAAWRRIRSQVRDWTDLEPALIGRVEELIDFVTQP AGDE" gene 2281294..2281617 /locus_tag="Rv2034" /db_xref="GeneID:887859" CDS 2281294..2281617 /locus_tag="Rv2034" /function="Involved in transcriptional regulation." /experiment="experimental evidence, no additional details recorded" /note="Rv2034, (MTV018.21), len: 107 aa. Probable repressor protein similar to several belonging to the ARSR FAMILY e.g. Q53040 (112 aa). FASTA scores: sptr|Q53040|Q53040 NITRILE HYDRATASE REGULATAR 2 (112 aa) opt: 167, E( ): 6.7e-06; 44.7% identity in 76 aa overlap. TBparse score is 0.905. Contains probable helix-turn-helix at aa 32-53 (S core 1350, +3.78 SD)" /codon_start=1 /transl_table=11 /product="ArsR-type repressor protein" /protein_id="NP_216550.1" /db_xref="GI:15609171" /db_xref="GeneID:887859" /translation="MSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSR PAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAALRTDLDRFWTRALTGYAQLIDS EGDDT" gene 2281614..2282102 /locus_tag="Rv2035" /db_xref="GeneID:887445" CDS 2281614..2282102 /locus_tag="Rv2035" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2035, (MTV018.22), len: 162 aa. Conserved hypothetical protein, similar to conserved hypothetical protein (156 aa) from Sinorhizobium meliloti CAC46569.1|AL591789 (34% identity in 146 aa overlap). TBparse score is 0.925" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216551.1" /db_xref="GI:15609172" /db_xref="GeneID:887445" /translation="MTRPRTDAIHHHVVVNAPIERAFAVFTTRFGDFKPREHNLLAIP ITETVFECHAGGHIYDRGVDGSVCKWARVLVYEPPSRVLFTWDIGPTWRPETDLAKTS EVEVRFTAQSAETTRVDLEHRHLDRHGPGWESVADGVDSEAGWPLYLRRYTDLLCIQV QP" gene 2282099..2282740 /locus_tag="Rv2036" /db_xref="GeneID:887433" CDS 2282099..2282740 /locus_tag="Rv2036" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2036, (MTV018.23), len: 213 aa. Conserved hypothetical protein; slight similarity to Streptomyces lincolnensis protein involved in lincomycin production Q54375 (238 aa). FASTA scores: sptr|Q54375|Q54375 (78-11) LINCOMYCIN PRODUCTION GENES (238 aa) opt: 119, E(): 0.97; 31.3% identity in 99 aa overlap. TBparse score is 0.934" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216552.1" /db_xref="GI:15609173" /db_xref="GeneID:887433" /translation="MIAADDDTEKSMMDMARAERAELAAFLTTLTLQQWETPSLCAGW SVKEVVAHMISYEDLGVFGLLKRFAKGRIVRANEVGVDEFAGLSPQELADYVGRHLQP RGLTAGFGGMIALVDGMIHHQDIRRPLGQPRTIPAQRLDRVLRLMPKNPRLRARPRIK GLRLRATDLDWTIGTGPEVTGPGEALLMAMAGRPAAVSDLSGPGKPTLAGRLG" gene complement(2282747..2283721) /locus_tag="Rv2037c" /db_xref="GeneID:888021" CDS complement(2282747..2283721) /locus_tag="Rv2037c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2037c, (MTV018.24c), len: 324 aa. Possible conserved transmembrane protein, similar to hypothetical proteins from Mycobacterium leprae MLCB2052.31 (329 aa) and Bacillus subtilis P54513|YQHO_BACSU (291 aa). FASTA scores: Z98604|MLCB2052_1 6 Mycobacterium leprae cosmid B205 (329 aa) opt: 1764, E(): 0; 80.5% identity in 323 aa overlap and sp|P54513|YQHO_BACSU HYPOTHETICAL 32.9 KD PROTEIN IN G (291 aa ) opt: 328, E(): 8.8e-14; 36.6% identity in 306 aa overlap. TBparse score is 0.919" /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216553.1" /db_xref="GI:15609174" /db_xref="GeneID:888021" /translation="MALVSTARVDLVCEGGGVRGIGLVGAVDALADAGYRFPRVAGSS AGAIVASLVAALQTAGEPVTRLAEMMRSIDYPKFLDRNLIGHVPLIGGGLSLLLSDGV YRGAYLEQLLGGLLADLGVHTFGDLRTGEAPEQFAWSLVVTASDLSRRRLVRIPWDLD SYGIHPDDFSVARAVHASSAIPFVFEPVRVRGATWVDGGLLSNFPVALFDRTDAEPRW PTFGIRLSARPGIPPTRPVQGPVSLGIAAIETLVSNQDNAYIDDPCTVRRTIFVPAHD VSPIDFDITAEQREALYQRGFQAGQKFLANWNYADCLADCGGPFTPSL" gene complement(2283723..2284796) /locus_tag="Rv2038c" /db_xref="GeneID:887998" CDS complement(2283723..2284796) /locus_tag="Rv2038c" /function="Thought to be involved in active transport of sugar across the membrane (import). Responsible for energy coupling to the transport system." /note="Rv2038c, (MTV018.25c), len: 357 aa. Probable sugar-transport ATP-binding protein ABC transporter (see citation below), equivalent to MLCB2052.30|Z98604|MLCB2052_15 from Mycobacterium leprae (356 aa), FASTA scores: opt: 1866, E(): 0, (79.7% identity in 355 aa overlap). Also similar to multiple sugar import proteins e.g. Y08921|SRMSIK_1 msiK protein from Streptomyces reticuli (377 aa), FASTA scores: opt: 1336, E(): 0, (62.6% identity in 377 aa overlap); etc. Also similar to several proteins from Mycobacterium tuberculosis e.g. Rv2832c, Rv1238, Rv2397c, Rv3758c. Contains PS00211 ABC transporters family signature and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="sugar-transport ATP-binding protein ABC transporter" /protein_id="NP_216554.1" /db_xref="GI:15609175" /db_xref="GeneID:887998" /translation="MASVSFEQATRRYPGTDRPALDRLDLIVGDGEFVVLVGPSGCGK TTSLRMVAGLETLDCGRIRIGERDVTEVDPKDRDVAMVFQNYALYPHMTVAQNMGFAL KVAKIGKAEIRERVLAAAKLLDLQSYLDRKPKDLSGGQRQRVAMGRAIVRRPQVFLMD EPLSNLDAKLRGQTRNQIAALQRQLGTTTVYVTHDQVEAMTMGDRVAVLSDGVLQQCA SPRELYRNPGNVFVAGFIGSPAMNLFRLSIADSTVSLGDWQILLPRAVVGTAAEVIIG VRPEHLELGGAGIEMDVDMVEELGADAYLYGRIVSGGCEMDQSIVARVDGRGPPERGS RVRLCPTPGHLHFFAVDGRRIPG" misc_feature complement(2284347..2284391) /locus_tag="Rv2038c" /note="PS00211 ABC transporters family signature" misc_feature complement(2284662..2284685) /locus_tag="Rv2038c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(2284799..2285641) /locus_tag="Rv2039c" /db_xref="GeneID:887729" CDS complement(2284799..2285641) /locus_tag="Rv2039c" /function="Thought to be involved in active transport of sugar across the membrane (import). Responsible for the translocation of the substrate across the membrane." /note="Rv2039c, (MTV018.26c), len: 280 aa. Probable sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to MLCB2052.29|Z98604|MLCB2052_14 from Mycobacterium leprae (283 aa), FASTA scores: opt: 1593, E(): 0, (79.2% identity in 283 aa overlap). Also similar to maltose and lactose transport proteins e.g. X66092|CPMALGHOM_1 from C. perfringens (275 aa), FASTA scores: opt: 695, E(): 0, (41.2% identity in 228 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. Also contains possible helix-turn-helix motif at aa 171-192, although this is probably fortuitous. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="sugar-transport integral membrane protein ABC transporter" /protein_id="NP_216555.1" /db_xref="GI:15609176" /db_xref="GeneID:887729" /translation="MGWADRIVHRHFIRGLALYAGLIGIAWCALFPIIWALSGSLKAD GEVTEPTLFPSHPQWSNYREVFALMPFWRMFFNTVLYAGCVTAGQVFFCSLAGYAFAR LQFRGRDTLFVLYLSTLMVPLTVTVIPQVILMRIVGWVDTPWAMIVPGLFGSAFGTYL MRQFFRTLPTDLEEAAILDGCSPWQIYWRILLPHSRPAVLVLGVLTWVNVWNDFLWPL LMIQRNSLATLTLGLVRLRGEYVARWPVLMAASMLMLVPLVILYAVAQRSFVRGIAVT GLGG" misc_feature complement(2285063..2285149) /locus_tag="Rv2039c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature." gene complement(2285628..2286530) /locus_tag="Rv2040c" /db_xref="GeneID:887893" CDS complement(2285628..2286530) /locus_tag="Rv2040c" /function="Thought to be involved in active transport of sugar across the membrane (import). Responsible for the translocation of the substrate across the membrane." /note="Rv2040c, (MTV018.27c), len: 300 aa. Probable sugar-transport integral membrane protein ABC transporter (see citation below), equivalent to MLCB2052.28|Z98604|MLCB2052_13 from Mycobacterium leprae (319 aa), FASTA scores: opt: 1606, E(): 0, (81.6% identity in 293 aa overlap). Also similar to many diverse sugar transport proteins. TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="sugar-transport integral membrane protein ABC transporter" /protein_id="NP_216556.1" /db_xref="GI:15609177" /db_xref="GeneID:887893" /translation="MTRRRGRRAWAGRMFVAPNLAAVVVFMLFPLGFSLYMSFQKWDL FTHATFVRLDNFRNLFTSDPLFLIAVVNTAVYTVGTVVPTVIVSLVVAAFLNRKIKGI SLFRTVVFLPLAISSVVMAVVWQFVFNTDNGLLNIMLGWLGIGPIPWLIEPRWAMVSL CLVSVWRSVPFATVVLLAAMQGVPETVYEAARIDGAGEIRQFVSITVPLIRGALSFVV VISIIHAFQAFDLVYVLTGANGGPETATYVLGIMLFQHAFSFLEFGYASALAWVMFAI LLVLTVLQLRITHRRSWEASRGLG" misc_feature complement(2285907..2285993) /locus_tag="Rv2040c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp sign." gene complement(2286527..2287846) /locus_tag="Rv2041c" /db_xref="GeneID:887474" CDS complement(2286527..2287846) /locus_tag="Rv2041c" /function="Thought to be involved in active transport of sugar across the membrane (import)." /note="Rv2041c, (MTV018.28c), len: 439 aa. Probable sugar-binding lipoprotein component of sugar transport system, equivalent to Z98604|MLCB2052_1|MLCB2052.27 from Mycobacterium leprae (445 aa), FASTA scores: opt: 2324, E(): 0, (77.4% identity in 446 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="sugar-binding lipoprotein" /protein_id="NP_216557.1" /db_xref="GI:15609178" /db_xref="GeneID:887474" /translation="MVNKPFERRSLLRGAGALTAASLAPWAAGCAADDDDALTFFFAA NPDELRPRMRVVNEFQRRYPDIKVRALLSGPGVMQQLATFCAGGKCPDVLMAWELTYA ELADRGVLLDLNTLLARDQAFAAELKSDSIGALYETFTFNGGQYAFPEQWSGNFLFYN KQLFDDAGVPPPPGSWERPWSFAEFLDAAQALTKQGRSGRDRQWGFVNAWVSFYAAGL FAMNNGVPWSVPRMNPTHLNFDHDGFLEAVQFYADLTNKHKVAPSAAEQQSMSTADLF SVGKAGIALAGHWRYQTFDRADGLDFDVAPLPIGPRGRAACSDIGVTGLAIAATSRRK DQAWEFVKFATGPVGQALIGESRLFVPVLRSAINSHGFANAHRRVGNLAVLSEGPAYS EGLPVTPAWEKIAALMDRYFGPVLRGSRPATSLTGLSQAVDEVLRNP" misc_feature complement(2287757..2287789) /locus_tag="Rv2041c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(2287884..2288681) /locus_tag="Rv2042c" /db_xref="GeneID:887497" CDS complement(2287884..2288681) /locus_tag="Rv2042c" /function="UNKNOWN" /note="Rv2042c, (MTV018.29c), len: 265 aa. Conserved hypothetical protein,similar in N-terminal part to hypothetical proteins MLCB2052.24 (95 aa) and Rv0760c|MTCY369.05 (139 aa). FASTA scores: Z98604|MLCB2052_9 Mycobacterium leprae cosmid B2052 (95 aa) opt: 269, E(): 2.9e-12, (55.4% identity in 92 aa overlap) and Z80226|MTCY369_5 Mycobacterium tuberculosis cosmid (139 aa) opt: 150, E(): 0.001, (28.7% identity in 136 aa overlap). TBparse score is 0.909" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216558.1" /db_xref="GI:15609179" /db_xref="GeneID:887497" /translation="MAPPNRDELLAAVERSPQAAAAHDRAGWVGLFTGDARVEDPVGS QPQVGHEAIGRFYDTFIGPRDITFHRDLDIVSGTVVLRDLELEVAMDSAVTVFIPAFL RYDLRPVTGEWQIAALRAYWELPAMMLQFLRTGSGATRPALQLSRALLGNQGLGGTAG FLTGFRRAGRRHKKLVETFLNAASRADKSAAYHALSRTATMTLGEDELLDIVELFEQL RGASWTKVTGAGSTVAVSLASDHRRGIMFADVPWRGNRINRIRYFPA" gene complement(2288681..2289241) /gene="pncA" /locus_tag="Rv2043c" /db_xref="GeneID:888260" CDS complement(2288681..2289241) /gene="pncA" /locus_tag="Rv2043c" /EC_number="3.5.1.-" /function="Converts amides such as nicotinamide to corresponding acid." /experiment="experimental evidence, no additional details recorded" /note="Rv2043c, (MTV018.30c), len: 186 aa. pncA, pyrazinamidase/nicotinamidase (EC 3.5.1.-) (see citations below). Identical to PYRAZINAMIDASE/NICOTINAMIDASE involved in susceptibility or resistance to antituberculous drug pyrazinamide. FASTA scores: sptr|Q50575|Q50575 PYRAZINAMIDASE/NICOTINAMIDASE. (186 aa) opt: 1236, E(): 0; 100.0% identity in 186 aa overlap. TBparse score is 0.892." /codon_start=1 /transl_table=11 /product="pyrazinamidase/nicotinamidas PNCA (PZase)" /protein_id="NP_216559.1" /db_xref="GI:15609180" /db_xref="GeneID:888260" /translation="MRALIIVDVQNDFCEGGSLAVTGGAALARAISDYLAEAADYHHV VATKDFHIDPGDHFSGTPDYSSSWPPHCVSGTPGADFHPSLDTSAIEAVFYKGAYTGA YSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNGLATRVLVDLT AGVSADTTVAALEEMRTASVELVCSS" gene complement(2289282..2289599) /locus_tag="Rv2044c" /db_xref="GeneID:888243" CDS complement(2289282..2289599) /locus_tag="Rv2044c" /function="UNKNOWN" /note="Rv2044c, (MTV018.31c), len: 105 aa. Conserved hypothetical protein, similar to conserved hypothetical protein PA3386 (121 aa) from Pseudomonas aeruginosa |E83221 conserved hypothetical protein PA3386 [imported] - Pseudomonas aeruginosa (strain PAO1) 9949522|gb|AAG06774.1|AE004760_2 (AE004760). (46% identity in 92 aa overlap). TBparse score is 0.914" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216560.1" /db_xref="GI:15609181" /db_xref="GeneID:888243" /translation="MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCP LTWVERWARTKAAMTPLSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWL PRR" gene complement(2289685..2291220) /gene="lipT" /locus_tag="Rv2045c" /db_xref="GeneID:888358" CDS complement(2289685..2291220) /gene="lipT" /locus_tag="Rv2045c" /EC_number="3.1.1.-" /function="Converts unknown esters to corresponding free acid and alcohol" /experiment="experimental evidence, no additional details recorded" /note="Rv2045c, (MTV018.32c), len: 511 aa. Probable lipT, carboxylesterase similar to many e.g. O08472 (489 aa) and P37967|PNBA_ BACSU (489 aa). PARA-NITROBENZYL ESTERASE (EC 3.1.1.-). Contains PS00941 Carboxylesterases type-B signature 2. Contains PS00122 Carboxylesterases type-B serine active site. FASTA scores: sptr|O08472|O08472 INTRACELLULAR ESTERASE B (489 aa) opt: 849, E(): 0, (36.2% identity in 489 aa overlap) and sp|P37967|PNBA_BACSU PARA-NITROBENZYL ESTERASE (489 aa) opt: 838, E(): 0, (36.0% identity in 489 aa overlap). TBparse score is 0 .918" /codon_start=1 /transl_table=11 /product="carboxylesterase LipT" /protein_id="NP_216561.1" /db_xref="GI:15609182" /db_xref="GeneID:888358" /translation="MALESATVGSMHERTVRARTATGIVEGFTRDGVHRWRSIPYARA PVGSLRFRAPQPAQPWPGVRHCHTFANCAPQQRRYTVMGIGRYQTRSEDCLTLNVVTP EEPATQPLPVMVFIHGGGYILGSSATPIYDGAALARRGCVYVSVNYRLGALGCLDLSS LSTPQITLDSNVYLRDLVLALRWVHDNIAEFGGDPGNVTIFGESAGAHITATLLAVPA AKGLFARAISESPAAGMVRSREVAAEFAARFANLIGARTQDAANALMQASPAQLVEAQ HHLIRQGMRKRLGAFPIGPVFGDDYLPMDPVEAMRSGRVHAVPLIVGTNAEEGRLFTR FLGMLPTNEPMVEELLSGMKPADRERITAAYPNYPAPSACIQLGGDFAFSSAAWQIAE AHGANAPTYLYRYDYAPRTLRWSGFGATHATELFAVFDIYRTRFGALLTAAADRRAAL RVSNEVQRRWRCFSQIGVPGDDWPAYTQDDRAVLVFDRRCRIEFDPHQHRRIAWDGFS LAN" misc_feature complement(2290603..2290650) /gene="lipT" /locus_tag="Rv2045c" /note="PS00122 Carboxylesterases type-B serine active site" misc_feature complement(2290915..2290947) /gene="lipT" /locus_tag="Rv2045c" /note="PS00941 Carboxylesterases type-B signature 2" gene 2291269..2291925 /gene="lppI" /locus_tag="Rv2046" /db_xref="GeneID:888308" CDS 2291269..2291925 /gene="lppI" /locus_tag="Rv2046" /function="UNKNOWN" /note="Rv2046, (MTV018.33), len: 218 aa. Probable lppI, lipoprotein contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.898" /codon_start=1 /transl_table=11 /product="lipoprotein lppI" /protein_id="NP_216562.1" /db_xref="GI:15609183" /db_xref="GeneID:888308" /translation="MRIAALVAVSLLIAGCSREVGGDVGQSQTIAPPAPAPSAAPSTP PAAGAPITTIVSWIEAGHPVDPAAYHVATRDGVTTQLGDDVAFSASSGTVACMTDARH TSGTLACLVRLANPPPRPETAYGEWKGGWVDFDGIHLQVGSARADPGPFVYGNGPELA NGDTLSIGDYRCRSYQAGLFCVNYAHQSAVRFASAGIEPFGCLKPAPPPDGVGVAFGC" misc_feature 2291284..2291316 /gene="lppI" /locus_tag="Rv2046" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(2291962..2294526) /locus_tag="Rv2047c" /db_xref="GeneID:888529" CDS complement(2291962..2294526) /locus_tag="Rv2047c" /function="UNKNOWN" /note="Rv2047c, (MTV018.34c), len: 854 aa. Conserved hypothetical protein, similar to hypothetical protein from Mycobacterium tuberculosis Rv1868|MTCY359.05c (699 aa) and three possible pseudogene fragments from Mycobacterium leprae MLCB2052.16 (251 aa), MLCB2052.17 (120 aa), MLCB2052.18 (257 aa). FASTA scores: gp|Z98604|MLCB2052_7 (257 aa) opt: 1248, E(): 0, (78.6% identity in 248 aa overlap); and Z98604|MLCB2052_5 (251 aa) opt: 674, E(): 0, (50.0% identity in 250 aa overlap); and Z98604|MLCB2052_6 (120 aa) opt: 608 E() : 3.6e-30, (84.0% identity in 106 aa overlap); and Rv1868 Z83859|MTCY359_5 (699 aa) opt: 521 E(): 3e-24; (33.0% identity in 730 aa overlap). TBparse score is 0.917" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216563.1" /db_xref="GI:15609184" /db_xref="GeneID:888529" /translation="MRIAVTGASGVLGRGLTARLLSQGHEVVGIARHRPDSWPSSADF IAADIRDATAVESAMTGADVVAHCAWVRGRNDHINIDGTANVLKAMAETGTGRIVFTS SGHQPRVEQMLADCGLEWVAVRCALIFGRNVDNWVQRLFALPVLPAGYADRVVQVVHS DDAQRLLVRALLDTVIDSGPVNLAAPGELTFRRIAAALGRPMVPIGSPVLRRVTSFAE LELLHSAPLMDVTLLRDRWGFQPAWNAEECLEDFTLAVRGRIGLGKRTFSLPWRLANI QDLPAVDSPADDGVAPRLAGPEGANGEFDTPIDPRFPTYLATNLSEALPGPFSPSSAS VTVRGLRAGGVGIAERLRPSGVIQREIAMRTVAVFAHRLYGAITSAHFMAATVPFAKP ATIVSNSGFFGPSMASLPIFGAQRPPSESSRARRWLRTLRNIGVFGVNLVGLSAGSPR DTDAYVADVDRLERLAFDNLATHDDRRLLSLILLARDHVVHGWVLASGSFMLCAAFNV LLRGLCGRDTAPAAGPELVSARSVEAVQRLVAAARRDPVVIRLLAEPGERLDKLAVEA PEFHSAVLAELTLIGHRGPAEVEMAATSYADNPELLVRMVAKTLRAVPAPQPPTPVIP LRAKPVALLAARQLRDREVRRDRMVRAIWVLRALLREYGRRLTEAGVFDTPDDVFYLL VDEIDALPADVSGLVARRRAEQRRLAGIVPPTVFSGSWEPSPSSAAALAAGDTLRGVG VCGGRVRGRVRIVRPETIDDLQPGEILVAEVTDVGYTAAFCYAAAVVTELGGPMSHAA VVAREFGFPCVVDAQGATRFLPPGALVEVDGATGEIHVVELASEDGPALPGSDLSR" gene complement(2294531..2306986) /gene="pks12" /locus_tag="Rv2048c" /db_xref="GeneID:888350" CDS complement(2294531..2306986) /gene="pks12" /locus_tag="Rv2048c" /function="Involved in polyketide synthesis (product unknown)." /note="Rv2048c, (MTV018.35c), len: 4151 aa. Probable pks12, polyketide synthase similar to many polyketide synthases e.g. the second and third modules of polyketide synthase from S. erythraea (3567 aa), many other Streptomyces enzymes and putative Mycobacterium tuberculosis polyketide synthases, e.g. Z85982|MTCY06H11.26 (2126 aa), FASTA scores: opt: 6668, E(): 0 (61.2% identity in 2058 aa overlap); and Q03132|ERY2_SACER ERYTHRONOLIDE SYNTHASE, MODULES 3 from S. erythraea (3567 aa), FASTA scores: opt: 5309, E(): 0, (40.5% identity in 4141 aa overlap). Contains 2x PS00012 Phosphopantetheine attachment site, 2x PS00606 Beta-ketoacyl synthases active site, and PS00343 Gram-positive cocci surface proteins 'anchor ing' hexapeptide. TB parse score is 0.902." /codon_start=1 /transl_table=11 /product="polyketide synthase pks12" /protein_id="NP_216564.1" /db_xref="GI:15609185" /db_xref="GeneID:888350" /translation="MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSC RFPGGVDSPEGLWQMVADARDVMSEFPTDRGWDLAGLFDPDPDVRHKSYARTGGFVDG VADFDPAFFGISPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSATGVFAGLIVG GYGMLAEEIEGYRLTGMTSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLR SGECDLALAGGVTVNATPTVFVEFSRHRGLAPDGRCKPYAGRADGVGWSEGGGMLVLQ RLSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGLSAAEVDV VEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKM VLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTN AHVIIEAVPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDDGLDVADVG WSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQG SQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDWSLVDVLRGAPGAPGLDRVDVVQPV LFAVMVSLAELWKSVAVHPDAVIGHSQGEIAAAYVAGALSLRDAARVVTLRSKLLAGL AGPGGMVSIACGADQARDLLAPFGDRVSIAVVNGPSAVVVSGEVGALEELIAVCSTKE LRTRRIEVDYASHSVEVEAIRGPLAEALSGIEPRSTRTVFFSTVTGNRLDTAGLDADY WYRNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEETFAACTDGDSEAIVVPT LGRGDGGLHRFLLSAASAFVAGVAVNWRGTLDGAGYVELPTYAFDKRRFWLSAEGSGA DVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPNVQPWLADHAVSDVVLFPGTGFV ELAIRAGDEVGCSVLDELTLAAPLLLPATGSVAVQVVVDAGRDSNSRGVSIFSRADAQ AGWLLHAEGILRPGSVEPGADLSVWPPAGAVTVDVADGYERLATRGYRYGPAFRGLTA MWARGEEIFAEVRLPEAAGGVGGFGVHPALLDAVLHAVVIAGDPDELALPFAWQGVSL HATGASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSGSGPD RLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQALAAVQSWLTDHES GVLVVATRGAMALPREDVADLAGAAVWGLVRSAQTEHPGRIVLVDSDAATDDAAIAMA LATGEPQVVLRGGQVYTARVRGSRAADAILVPPGDGPWRLGLGSAGTFENLRLEPVPN ADAPLGPGQVRVAMRAIAANFRDIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVG DSVFGFFPDGSGTLVAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQ RVLIHAGTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEFED KFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVR YRAFDLFEPGRPRMHQYMLELATLFGDGVLRPLPVTTFDVRRAPAALRYLSQARHTGK VVMLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVA ELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRV DVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHR RAHGLPAISLGWGLWDQASAMTGGLDAADLARLGREGVLALSTAEALELFDTAMIVDE PFLAPARIDLTALRAHAVAVPPMFSDLASAPTRRQVDDSVAAAKSKSALAHRLHGLPE AEQHAVLLGLVRLHIATVLGNITPEAIDPDKAFQDLGFDSLTAVEMRNRLKSATGLSL SPTLIFDYPTPNRLASYIRTELAGLPQEIKHTPAVRTTSEDPIAIVGMACRYPGGVNS PDDMWDMLIQGRDVLSEFPADRGWDLAGLYNPDPDAAGACYTRTGGFVDGVGDFDPAF FGVGPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSATGVFAGVMTQGYGMFAAE PVEGFRLTGQLSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLA LAGGVTVNATPDIFVEFSRWRGLSPDGRCKAFAAAADGTGFSEGGGMLVLQRLSDARR LGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTG TTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHE LLPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTNAHVIIEA VPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDDGLDVADVGWSLAGRS VFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMG MGLHAGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNSTEFAQPALFAVEVAL FRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRLMQALPAGGAMVA VQAAEEEVRPLLSAEVDIAAVNGPASLVISGAQNAVAAVADQLRADGRRVHQLAVSHA FHSPLMDPMIDEFAAVAAGIAIGRPTIGVISNVTGQLAGDDFGSAAYWRRHIRQAVRF ADSVRFAQAAGGSRFLEVGPSGGLVASIEESLPDVAVTTMSALRKDRPEPATLTNAVA QGFVTGMDLDWRAVVGEAQFVELPTYAFQRRRFWLSGDGVAADAAGLGLAASEHALLG AVIDLPASGGVVLTGRLSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGVVD ESTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSRGDVGTGWVLHAEGALRAGSA EPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGLTAMWRRGDEVFAEVALPA DAGVSVTGFGVHPVLLDAALHAVVLSAESAERGQGSVLVPFSWQGVSLHAAGASAVRA RIAPVGPSAVSIELADGLGLPVLSVASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQ PSAAVEPLPVCAWGTTEDSAAVVFESVPLAGDVVAGVYAATSSVLDVLQSWLTRDGAG VLVVMTRGAVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVV TTGEPQVLWRRGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENLRLELIPDA DAPLGPGQVRVAVSAIAANFRDVMIALGLYPDPDAVMGVEACGVVIETSLNKGSFAVG DRVMGLFPEGTGTVASTDQRLLVKVPAGWSHTAAATTSVVFATAHYALVDLAAARSGQ RVLIHAGTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEFED KFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVR YRAFDLFEPGPDRIAQILAELATLFGDGVLRPLPVTTFDVRCAPAALRYLSQARHTGK VVMLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVA ELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRV DVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHR RAHGLPAISLGWGLWDQASAMTGGLATVDFKRFARDGIVAMSSADALQLFDTAMIVDE PFMLPAHIDFAALKVKFDGGTLPPMFVDLINAPTRRQVDDSLAAAKSKSALLQRLEGL PEDEQHAVLLDLVRSHIATVLGSASPEAIDPDRAFQELGFDSLTAVEMRNRLKSATGL ALSPTLIFDYPNSAALAGYMRRELLGSSPQDTSAVAAGEAELQRIVASIPVKRLRQAG VLDLLLALANETETSGQDPALAPTAEQEIADMDLDDLVNAAFRNDDE" misc_feature complement(2294867..2294914) /gene="pks12" /locus_tag="Rv2048c" /note="PS00012 Phosphopantetheine attachment site" misc_feature complement(2300288..2300338) /gene="pks12" /locus_tag="Rv2048c" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature complement(2300963..2301010) /gene="pks12" /locus_tag="Rv2048c" /note="PS00012 Phosphopantetheine attachment site" misc_feature complement(2303978..2303995) /gene="pks12" /locus_tag="Rv2048c" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" misc_feature complement(2306357..2306409) /gene="pks12" /locus_tag="Rv2048c" /note="PS00606 Beta-ketoacyl synthases active site" gene complement(2307293..2307517) /locus_tag="Rv2049c" /db_xref="GeneID:888593" CDS complement(2307293..2307517) /locus_tag="Rv2049c" /function="UNKNOWN" /note="Rv2049c, (MTV018.36c), len: 74 aa. Hypothetical protein. TBparse score is 0.867" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216565.1" /db_xref="GI:15609186" /db_xref="GeneID:888593" /translation="MLTRGEVRALPADAVVLSADDAADLSDRVYQVRCAAEDVVTALD EGAAATELRDLCDELIRAARAADGWRRAGA" gene 2307821..2308156 /locus_tag="Rv2050" /db_xref="GeneID:888598" CDS 2307821..2308156 /locus_tag="Rv2050" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2050, (MTV018.37), len: 111 aa. Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium leprae, MLCB2052.03c (113 aa), and Streptomyces coelicolor A3(2), SC6D7.18c (124 aa). FASTA scores: Z98604|MLCB2052_3 Mycobacterium leprae cosmid B2052 (113 aa) opt: 737, E(): 0, (97.3% identity in 111 aa overlap) and (55% identity in 85 aa overlap) with emb|CAB61670.1|AL133213 hypothetical protein SC6D7.18c. TBparse score is 0.884" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216566.1" /db_xref="GI:15609187" /db_xref="GeneID:888598" /translation="MADRVLRGSRLGAVSYETDRNHDLAPRQIARYRTDNGEEFEVPF ADDAEIPGTWLCRNGMEGTLIEGDLPEPKKVKPPRTHWDMLLERRSIEELEELLKERL ELIRSRRRG" gene complement(2308131..2310755) /gene="ppm1" /locus_tag="Rv2051c" /db_xref="GeneID:887402" CDS complement(2308131..2310755) /gene="ppm1" /locus_tag="Rv2051c" /function="Transfers mannose from GDP-Mannose to all endogenous polyprenol-phosphates." /experiment="experimental evidence, no additional details recorded" /note="Rv2051c, (MTV018.38c), len: 874 aa. ppm1, Polyprenol-monophosphomannose synthase. Transfers mannose from GDP-Mannose to all endogenous polyprenol-phosphates in Mycobacterium tuberculosis, proven experimentally (A. Baulard, Institut Pasteur de Lille: see citation below). Very similar to polyprenol-phosphate-mannose synthases from Mycobacterium smegmatis (594 aa). Two-domain protein similar to products of two adjacent ORFs in Mycobacterium leprae MLCB2052.01 (644 aa), probable membrane protein and MLCB2052.02 (277 aa). First domain (aa 1 - 590) corresponds to membrane protein with similarity to P23930|LNT_ECOLI apolipoprotein n-acyltransferase (512 aa) while second domain (aa 591 - 874) is similar to Schizosaccharomyces pombe dolichol monophosphate mannose synthase (236 aa) and to Mycobacterium tuberculosis Rv0539. FASTA scores: Z 98604|MLCB2052_1 (644 aa) opt: 2725 E(): 0 ; 67.7% identity in 601 aa overlap; and Z98604|MLCB2052_2 (277 aa) opt: 1449 E(): 0; 78.9% identity in 275 aa overlap; and gp|AF0078|AF007873_1 Schizosaccharomyces pombe dolichocholmonophosphate mannose synthase (236 aa) opt: 456 E(): 7.8e-19; 34.5% identity in 223 aa overlap and sp|P23930|LNT_ECOLI APOLIPOPROTEIN N-ACYLTRANSFERASE (512 aa) opt: 330 E(): 1.9e-11; 26.9% identity in 539 aa overlap; and polyprenol-phosphate-mannose synthases from Mycobacterium smegmatis (594 aa). CAC15462.1|AJ294477 putative polyprenol-phosphate-mannose synthase 2 (Ppm2): (55% identity in 533 aa overlap)." /codon_start=1 /transl_table=11 /product="polyprenol-monophosphomannose synthase Ppm1" /protein_id="NP_216567.1" /db_xref="GI:15609188" /db_xref="GeneID:887402" /translation="MKLGAWVAAQLPTTRTAVRTRLTRLVVSIVAGLLLYASFPPRNC WWAAVVALALLAWVLTHRATTPVGGLGYGLLFGLVFYVSLLPWIGELVGPGPWLALAT TCALFPGIFGLFAVVVRLLPGWPIWFAVGWAAQEWLKSILPFGGFPWGSVAFGQAEGP LLPLVQLGGVALLSTGVALVGCGLTAIALEIEKWWRTGGQGDAPPAVVLPAACICLVL FAAIVVWPQVRHAGSGSGGEPTVTVAVVQGNVPRLGLDFNAQRRAVLDNHVEETLRLA ADVHAGLAQQPQFVIWPENSSDIDPFVNPDAGQRISAAAEAIGAPILIGTLMDVPGRP RENPEWTNTAIVWNPGTGPADRHDKAIVQPFGEYLPMPWLFRHLSGYADRAGHFVPGN GTGVVRIAGVPVGVATCWEVIFDRAPRKSILGGAQLLTVPSNNATFNKTMSEQQLAFA KVRAVEHDRYVVVAGTTGISAVIAPDGGELIRTDFFQPAYLDSQVRLKTRLTPATRWG PILQWILVGAAAAVVLVAMRQNGWFPRPRRSEPKGENDDSDAPPGRSEASGPPALSES DDELIQPEQGGRHSSGFGRHRATSRSYMTTGQPAPPAPGNRPSQRVLVIIPTFNEREN LPVIHRRLTQACPAVHVLVVDDSSPDGTGQLADELAQADPGRTHVMHRTAKNGLGAAY LAGFAWGLSREYSVLVEMDADGSHAPEQLQRLLDAVDAGADLAIGSRYVAGGTVRNWP WRRLVLSKTANTYSRLALGIGIHDITAGYRAYRREALEAIDLDGVDSKGYCFQIDLTW RTVSNGFVVTEVPITFTERELGVSKMSGSNIREALVKVARWGIEGRLSRSDHARARPD IARPGAGGSRVSRADVTE" gene complement(2310913..2312517) /locus_tag="Rv2052c" /db_xref="GeneID:888608" CDS complement(2310913..2312517) /locus_tag="Rv2052c" /function="UNKNOWN" /note="Rv2052c, (MTV018.39c), len: 534 aa. Conserved hypothetical protein, very similar to hypothetical protein SC6D7.15 (536 aa) from Streptomyces coelicolor A3(2). Smith-Waterman scores >emb|CAB61667.1| (AL133213) hypothetical protein SC6D7.15 [Streptomyces coelicolor A3(2)] Expect = e-113 Identities = 247/533 (46%)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216568.1" /db_xref="GI:15609189" /db_xref="GeneID:888608" /translation="MSQIPVKLLVNGRVYSPTHPEATAMAVRGDVVAWLGSDDVGRDQ FPDADVQDLDGRFVAPGFVDSHIHLTATGLMLSGLDLRPATSRAQCLRMVADYAADHP GQPLWGHGWDESAWPENAAPSTADLDAVLGDCPAYLARIDSHSALVSSGLRRLVPELA AATGYTAQRPLTGDAHHLARAAARYLLTDVQLADARAVALQAIAAAGVVAVHECAGPE IGGLDDWLRLRALEHGVEVIGYWGEAVATPAQARDLVTETGARGLAGDLFVDGALGSR TAWLHEPYADAPDCIGTCHLDVDGIEAHVRACTKAEVTAGFHVIGDAAVSAAVAAFER VVADLGVVAVARCGHRLEHVEMVTADQAAKLGAWGVIASVQPNFDELWGGGDGMYARR LGAQRGSELNPLALLASQGVPLALGSDAPVTGFDPWASVRAAVNHRTPGSGVSARAAF AAATRGGWRAGGVRDGRIGTLVPGAPASYAIWDAGDFDVDAPRDAVQRWSTDPRSRVP ALPRLGPTDALPRCRQTVHRGAVIYG" gene complement(2312522..2313049) /gene="fxsA" /locus_tag="Rv2053c" /db_xref="GeneID:888768" CDS complement(2312522..2313049) /gene="fxsA" /locus_tag="Rv2053c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="F exclusion of bacteriophage T7; overproduction of this protein in Escherichia coli inhibits the F plasmid-mediated exclusion of bacteriophage T7; interacts with the F plasmid-encoded PifA protein; inner membrane protein" /codon_start=1 /transl_table=11 /product="FxsA" /protein_id="NP_216569.1" /db_xref="GI:15609190" /db_xref="GeneID:888768" /translation="MSRLLLSYAVVELAVVFALAATIGFGWTLLVLLATFVLGFGLLA PLGGWQLGRRLLWLRSGLAEPRSALSDGALVTVASVLVLVPGLVTTTMGLLLLVPPIR ALARPGLTAIAVRGFLRNVPLTADAAANMAGAFGESGTDPDFIDGEVIDVIDVEPLTL QPPRVAAEPPSPGSN" gene 2313125..2313838 /locus_tag="Rv2054" /db_xref="GeneID:888722" CDS 2313125..2313838 /locus_tag="Rv2054" /function="UNKNOWN" /note="Rv2054, (MTCY63A.06c), len: 237 aa. Conserved hypothetical protein, some similarity to various carboxymethylenebutenolidases e.g. sp|O67988|CLCD_RHOOP CARBOXYMETHYLENEBUTENOLIDASE (DIENELACTONE HYDROLASE) (DLH) >gi|2935034|gb|AAC38252.1| (AF003948) dienelactone hydrolase [Rhodococcus opacus] Smith-Waterman scores: Length = 252, Expect = 4e-08 Identities = 62/217 (28%). Also similar to Rv2765. TBparse score is 0.921" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216570.1" /db_xref="GI:15609191" /db_xref="GeneID:888722" /translation="MTTIEIDAPAGPIDALLGLPPGQGPWPGVVVVHDAVGYVPDNKL ISERIARAGYVVLTPNMYARGGRARCITRVFRELLTKRGRALDDILAARDHLLAMPEC SGRVGIVGFCMGGQFALVLSPRGFGATAPFYGTPLPRHLSETLNGACPIVASFGTRDP LGIGAANRLRKVTAAKNIPADIKSYPGAGHSFANKLPGQPLVRIAGFGYNEAATEDAW RRVFEFFGQHLRAGSPGEP" gene complement(2314087..2314353) /gene="rpsR" /locus_tag="Rv2055c" /db_xref="GeneID:887817" CDS complement(2314087..2314353) /gene="rpsR" /locus_tag="Rv2055c" /function="involved in translation, amino-acyl tRNA binding" /note="binds as a heterodimer with protein S6 to the central domain of the 16S rRNA; helps stabilize the platform of the 30S subunit" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S18" /protein_id="NP_216571.1" /db_xref="GI:15609192" /db_xref="GeneID:887817" /translation="MAAKSARKGPTKAKKNLLDSLGVESVDYKDTATLRVFISDRGKI RSRGVTGLTVQQQRQVAQAIKNAREMALLPYPGQDRQRRAALCP" gene complement(2314354..2314659) /gene="rpsN" /locus_tag="Rv2056c" /db_xref="GeneID:887819" CDS complement(2314354..2314659) /gene="rpsN" /locus_tag="Rv2056c" /function="involved in translation" /note="located in the peptidyl transferase center and involved in assembly of 30S ribosome subunit; similar to what is observed with proteins L31 and L33, some proteins in this family contain CXXC motifs that are involved in zinc binding; if two copies are present in a genome, then the duplicated copy appears to have lost the zinc-binding motif and is instead regulated by zinc; the proteins in this group do not appear to have the zinc-binding motif" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S14" /protein_id="NP_216572.1" /db_xref="GI:15609193" /db_xref="GeneID:887819" /translation="MAKKSKIVKNQRRAATVARYASRRTALKDIIRSPSSAPEQRSTA QRALARQPRDASPVRLRNRDAIDGRPRGHLRKFGLSRVRVRQLAHDGHLPGVRKASW" gene complement(2314661..2314825) /gene="rpmG" /locus_tag="Rv2057c" /db_xref="GeneID:887807" CDS complement(2314661..2314825) /gene="rpmG" /locus_tag="Rv2057c" /function="involved in translation" /note="in Escherichia coli BM108, a mutation that results in lack of L33 synthesis had no effect on ribosome synthesis or function; there are paralogous genes in several bacterial genomes, and a CXXC motif for zinc binding and an upstream regulation region of the paralog lacking this motif that are regulated by zinc similar to other ribosomal proteins like L31; the proteins in this group lack the CXXC motif" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L33" /protein_id="YP_177856.1" /db_xref="GI:57116938" /db_xref="GeneID:887807" /translation="MARTDIRPIVKLRSTAGTGYTYTTRKNRRNDPDRLILRKYDPIL RRHVDFREER" gene complement(2314825..2315061) /gene="rpmB" /locus_tag="Rv2058c" /db_xref="GeneID:887801" CDS complement(2314825..2315061) /gene="rpmB" /locus_tag="Rv2058c" /function="involved in ribosome activity" /note="required for 70S ribosome assembly" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L28" /protein_id="NP_216574.1" /db_xref="GI:15609195" /db_xref="GeneID:887801" /translation="MSAHCQVTGRKPGFGNTVSHSHRRSRRRWSPNIQQRTYYLPSEG RRIRLRVSTKGIKVIDRDGIEAVVARLRRQGQRI" gene 2315174..2316709 /locus_tag="Rv2059" /db_xref="GeneID:888402" CDS 2315174..2316709 /locus_tag="Rv2059" /function="UNKNOWN" /note="Rv2059, (MTCY63A.01c), len: 511 aa. Conserved hypothetical protein. Some similarity to EWLA protein gp|U52850|ERU52850_1 Erysipelothrix rhusiopathiae 36 k (304 aa), FASTA score, opt: 287 E(): 6.9e-09; 27.2% identity in 228 aa overlap. There appears to be a frameshift in this ORF around position 3315980 that causes an overlap with next ORF. C-terminal end of protein may be wrong. No error can be found to account for this." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216575.1" /db_xref="GI:15609196" /db_xref="GeneID:888402" /translation="MATPVILVTGHEGTAAVTADLLGLLTDHGTATLRSVAPGSVRRA DPRPRCHRREQRRRHRASMKSAIHPDHHPRRLPRCPVLRRDQVVLEMIVITMVGRPSG PGERKWDVWGSVARAVTGGHVPVKSILTGAHADPHSYQASPADAAAIVDAELVIYNGG GYDPWVDQVLAGHPGVQAVDAYSLLGAVGDDDAPNEHVFYDPNVAKAVAATIADRLAD LDPSNSGNYRANAAEFSRGADAIAISEHAIATTYPDAAVIATEPVVHYLLAAAGLKNR TPATFIAANENGNDPTPADMAAVLDMIAGREVAALLVNPQTPTAATDELQVAARRAGV PITELTETLPSGTDRDQFCAADRPDRRGRSLRADHADRGLSARGHRVGDLLPTALVCH RRSGGRGRPRRASARPGNCVRRTDGRGSRPGCPDRRGTPRDVFADHPRRGGRPGRGCP GRRDRDLGGLRRGFRRRRHPAVAGAWSPGVGVRGHHLVCDLPDLLVAPAAPLTSRSRF RPL" gene 2316279..2316680 /locus_tag="Rv2060" /db_xref="GeneID:888388" CDS 2316279..2316680 /locus_tag="Rv2060" /function="UNKNOWN" /note="Rv2060, (MTV019.01), len: 133 aa. Possible conserved integral membrane protein smaller than but similar to several hypothetical bacterial proteins e.g. >emb|CAC29843.1| (AL583918) putative ABC-transporter transmembrane protein [Mycobacterium leprae] Length = 286 and P44691|YEBI_HAEIN (261 aa). FASTA scores: P44691|YEBI_HAEIN HYPOTHETICAL PROTEIN HI0407 (261 aa) opt: 218, E(): 4.2e-08; 31.1% identity in 122 aa overlap. Maybe frameshift upstream at position 3315980 but no error can be found to account for this. TBparse score is 0.871" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216576.1" /db_xref="GI:15609197" /db_xref="GeneID:888388" /translation="MLTVVCLLVVTVLAICYRPLLFATVDPEVAAARGVPVRALGIVF AALMGVVAAQAVQIVGALLVMSLLITPAAAAARVVVAPVAAIATSVVFAEVSAVGGIL LSLAPGVPVSVFVATISFVIYLICWLLRRRR" gene complement(2316681..2317085) /locus_tag="Rv2061c" /db_xref="GeneID:888246" CDS complement(2316681..2317085) /locus_tag="Rv2061c" /function="UNKNOWN" /note="Rv2061c, (MTV019.02c), len: 134 aa. Conserved hypothetical protein. Similar to conserved hypothetical proteins from Mycobacterium leprae (128 aa) and Streptomyces coelicolor (153 aa). Smith-Waterman scores: >emb|CAC30396.1| (AL583922) [Mycobacterium leprae], Expect = 7e-47, Identities = 92/131 (70%); >emb|CAC14932.1| (AL449216) [Streptomyces coelicolor], Expect = 6e-19 Identities = 48/124 (38%). TBparse score is 0.862" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216577.1" /db_xref="GI:15609198" /db_xref="GeneID:888246" /translation="MTPTFSDLAEAQYLLLTTFTKDGRPKPVPIWAALDTDRGDRLLV ITEKKSWKVKRIRNTPRVTLATCTLRGRPTSEAVEATAAILDESQTGAVYDAIVKRYG IQGKLFTFVSKLRGGMRNNIGLELKVAESETG" gene complement(2317169..2320753) /gene="cobN" /locus_tag="Rv2062c" /db_xref="GeneID:888252" CDS complement(2317169..2320753) /gene="cobN" /locus_tag="Rv2062c" /EC_number="6.6.1.2" /function="REQUIRED FOR COBALT INSERTION." /note="with CobST catalyzes the formation of cobyrinic acid a,c-diamide from hydrogenobyrinic acid a,c-diamide in an ATP-dependent manner; involved in porphyrin and chlorophyll metabolism; vitamin B12 metabolism" /codon_start=1 /transl_table=11 /product="cobaltochelatase subunit CobN" /protein_id="NP_216578.1" /db_xref="GI:15609199" /db_xref="GeneID:888252" /translation="MPEPTVLLLSTSDTDLISARSSGKNYRWANPSRLSDLELTDLLA EASIVVIRILGGYRAWQSGIDTVIAGGVPAVLVSGEQAADAELTDRSTVAAGTALQAH IYLAHGGVDNLRELHAFLCDTVLMTGFGFTPPVATPTWGVLERPDAGKTGPTIAVLYY RAQHLAGNTGYVEALCRAIEDAGGRPLPLYCASLRTAEPRLLERLGGADAMVVTVLAA GGVKPAAASAGGDDDSWNVEHLAALDIPILQGLCLTSPRDQWCANDDGLSPLDVASQV AVPEFDGRIITVPFSFKEIDDDGLISYVADPERCARVAGLAVRHARLRQVAPADKRVA LVFSAYPTKHARIGNAVGLDTPASAVALLQAMRQRGYRVGDLPGVESNDGDALIHALI ECGGHDPDWLTEGQLAGNPIRVSAKEYRDWFATLPAELTDVVTAYWGPPPGELFVDRS HDPDGEIVIAALRAGNLVLMVQPPRGFGENPVAIYHDPDLPPSHHYLAAYRWLDTGFS NGFGAHAVVHLGKHGNLEWLPGKTLGMSASCGPDAALGDLPLIYPFLVNDPGEGTQAK RRAHAVLVDHLIPPMARAETYGDIARLEQLLDEHASVAALDPGKLPAIRQQIWTLIRA AKMDHDLGLTERPEEDSFDDMLLHVDGWLCEIKDVQIRDGLHILGQNPTGEQELDLVL AILRARQLFGGAHAIPGLRQALGLAEDGTDERATVDQTEAKARELVAALQATGWDPSA ADRLTGNADAAAVLRFAATEVIPRLAGTATEIEQVLRALDGRFIPAGPSGSPLRGLVN VLPTGRNFYSVDPKAVPSRLAWEAGVALADSLLARYRDEHGRWPRSVGLSVWGTSAMR TAGDDIAEVLALLGVRPVWDDASRRVIDLAPMQPAELGRPRIDVTVRISGFFRDAFPH VVTMLDDAVRLVADLDEAAEDNYVRAHAQADLAHHGDQRRATTRIFGSKPGTYGAGLL QLIDSRSWRDDADLAQVYTAWGGFAYGRDLDGREAIDDMNRQYRRIAVAAKNTDTREH DIADSDDYFQYHGGMVATVRALTGQAPAAYIGDNTRPDAIRTRTLSEETTRVFRARVV NPRWMAAMRRHGYKGAFEMAATVDYLFGYDATAGVMADWMYEQLTQRYVLDAQNRTFM TESNPWALHGMAERLLEAAGRGLWAQPAPETLDGLRQVLLETEGDLEA" gene 2320831..2321064 /locus_tag="Rv2063" /db_xref="GeneID:3205116" CDS 2320831..2321064 /locus_tag="Rv2063" /function="UNKNOWN" /note="Rv2063, len: 77 aa. Conserved hypothetical protein, showing some similarity to other conserved hypothetical proteins e.g. AL109974_2|SCF34.02c hypothetical protein from Streptomyces coelicolor (133 aa), FASTA scores: opt: 102, E(): 1.7, (34.35% identity in 67 aa overlap); and AE005182_1 from Escherichia coli strain O157:H7 (77 aa), FASTA scores: opt: 95, E(): 3.3, (34.85% identity in 66 aa overlap). This ORF replaces previous Rv2063c on other strand." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177657.1" /db_xref="GI:57116939" /db_xref="GeneID:3205116" /translation="MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAI FRAEREASHAETTTQAVRDEDREWEGTVGDGLG" gene 2321451..2322542 /gene="cobG" /locus_tag="Rv2064" /db_xref="GeneID:888140" CDS 2321451..2322542 /gene="cobG" /locus_tag="Rv2064" /function="REQUIRED FOR COBALAMIN BIOSYNTHESIS." /note="Rv2064, (MTCY49.03), len: 363 aa. Possible cobG, cobalamin biosynthesis protein. Some similarity to COBG_PSEDE P21637 cobg protein. pseudomonas (459 aa) FASTA scores, opt: 240, E(): 1.3e-08, (27.5% identity in 407 aa overlap); contains PS01156 TonB-dependent receptor proteins signature 2" /codon_start=1 /transl_table=11 /product="cobalamin biosynthesis protein CobG" /protein_id="NP_216580.1" /db_xref="GI:15609201" /db_xref="GeneID:888140" /translation="MAGTRDADACPGALRPHQAADGALARIRLPGGMITAAQLATLAS VASDFGSATLELTARGNVQLRGIRDVAAVADAVAKAGLLPSATHERVRNIVASPLSGR AGGLADVRAWVGELDAAIRAEPRLAELGGRFWFGLDDGRADVSGLGADVGVQVFPDGP RLLLTGRDTGVRVADVAETLIEVALRFVKIRETAWRVTELADIGELQSGVELGPSVRP VTKTPVGWIPQDDSRVTLGAAVPLGVLPARVAECLAAIEAPLVITPWRSVLICDLDDA TADAALRVLAPLGLVFDENSPWLNISACTGSPGCAHSAADVRADAARSLNVESAGHRH FVGCERACGSPPAGEVLVATGGGYRRLRP" misc_feature 2321802..2321855 /gene="cobG" /locus_tag="Rv2064" /note="PS01156 TonB-dependent receptor proteins signature 2" gene 2322552..2323178 /gene="cobH" /locus_tag="Rv2065" /db_xref="GeneID:888464" CDS 2322552..2323178 /gene="cobH" /locus_tag="Rv2065" /EC_number="5.4.1.2" /function="REQUIRED FOR COBALAMIN BIOSYNTHESIS." /note="catalyzes the interconversion of precorrin-8X and hydrogenobyrinate" /codon_start=1 /transl_table=11 /product="precorrin-8X methylmutase" /protein_id="NP_216581.1" /db_xref="GI:15609202" /db_xref="GeneID:888464" /translation="MLDYLRDAAEIYRRSFAVIRAEADLARFPADVARVVVRLIHTCG QVDVAEHVAYTDDVVARAGAALAAGAPVLCDSSMVAAGITTSRLPADNQIVSLVADPR ATELAARRQTTRSAAGVELCAERLPGAVLAIGNAPTALFRLLELVDEGAPPPAAVLGG PVGFVGSAQAKEELIERPRGMSYLVVRGRRGGSAMAAAAVNAIASDRE" gene 2323175..2324701 /gene="cobI" /locus_tag="Rv2066" /db_xref="GeneID:888483" CDS 2323175..2324701 /gene="cobI" /locus_tag="Rv2066" /EC_number="2.1.1.-" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS." /note="Rv2066, (MTCY49.05), len: 508 aa. Probable CobI-CobJ fusion protein, S-adenosyl-L-methionine-precorrin-2 methyl transferase and precorrin-3 methylase (EC 2.1.1.-). Similar in N-terminal half (aa 1-240) to COBI_PSEDE|P21639, S-adenosyl-L-methionine-precorrin-2 methyl transferase (244 aa), FASTA scores: opt: 759, E(): 4.4e-34, (49.2% identity in 238 aa overlap); and in C-terminal half (aa 240-508) to P21640|COBJ_PSEDE PRECORRIN-3 METHYLASE (EC 2.1.1.-) (254 aa), FASTA scores: opt: 695, E(): 0, (45.3% identity in 258 aa overlap)." /codon_start=1 /transl_table=11 /product="bifunctional S-adenosyl-L-methionine-precorrin-2 methyl transferase/precorrin-3 methylase Cob I/J" /protein_id="NP_216582.1" /db_xref="GI:15609203" /db_xref="GeneID:888483" /translation="MSARGTLWGVGLGPGDPELVTVKAARVIGEADVVAYHSAPHGHS IARGIAEPYLRPGQLEEHLVYPVTTEATNHPGGYAGALEDFYADATERIATHLDAGRN VALLAEGDPLFYSSYMHLHTRLTRRFNAVIVPGVTSVSAASAAVATPLVAGDQVLSVL PGTLPVGELTRRLADADAAVVVKLGRSYHNVREALSASGLLGDAFYVERASTAGQRVL PAADVDETSVPYFSLAMLPGGRRRALLTGTVAVVGLGPGDSDWMTPQSRRELAAATDL IGYRGYLDRVEVRDGQRRHPSDNTDEPARARLACSLADQGRAVAVVSSGDPGVFAMAT AVLEEAEQWPGVRVRVIPAMTAAQAVASRVGAPLGHDYAVISLSDRLKPWDVIAARLT AAAAADLVLAIYNPASVTRTWQVGAMRELLLAHRDPGIPVVIGRNVSGPVSGPNEDVR VVKLADLNPAEIDMRCLLIVGSSQTRWYSVDSQDRVFTPRRYPEAGRATATKSSRHSD" gene complement(2324647..2325870) /locus_tag="Rv2067c" /db_xref="GeneID:888752" CDS complement(2324647..2325870) /locus_tag="Rv2067c" /function="UNKNOWN" /note="Rv2067c, (MTCY49.06c), len: 407 aa. Conserved hypothetical protein, some similarity to YAT1_SYNP6 P08442 atp synthase subunits region ORF 1. (417 aa), FASTA scores, opt: 373, E(): 4.9e-18, (27.7% identity in 358 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216583.1" /db_xref="GI:15609204" /db_xref="GeneID:888752" /translation="MTDDHPRADIVSRQYHRWLYPHPIADLEAWTTANWEWFDPVHSH RILWPDREYRPDLDILIAGCGTNQAAIFAFTNRAAKVVAIDISRPALDHQQYLKDKHG LANLELHLLPIEELATLGRDFDLVVSTGVLHHLADPRAGMKELAHCLRRDGVVAAMLY GKYGRIGVELLGSVFRDLGLGQDDASIKLAKEAISLLPTYHPLRNYLTKARDLLSDSA LVDTFLHGRQRSYTVEECVDLVTSAGLVFQGWFHKAPYYPHDFFVPNSEFYAAVNTLP EVKAWSVMERLETLNATHLFMACRRDRPKEQYTIDFSTVAALDYVPLMRTRCGVSGTD MFWPGWRMAPSPAQLAFLQQVDGRRTIREIAGCVARTGEPSGGSLADLEEFGRKLFQS LWRLDFVAVALPASG" gene complement(2325886..2326809) /gene="blaC" /locus_tag="Rv2068c" /db_xref="GeneID:888742" CDS complement(2325886..2326809) /gene="blaC" /locus_tag="Rv2068c" /EC_number="3.5.2.6" /function="hydrolyses beta-lactams to generate corresponding beta-amino acid [CATALYTIC ACTIVITY: A BETA-LACTAM + H(2)O = A SUBSTITUTED BETA-AMINO ACID]." /experiment="experimental evidence, no additional details recorded" /note="Rv2068c, (MTCY49.07c), len: 307 aa. blaC, class A beta-lactamase (EC 3.5.2.6) (see citation below), similar to e.g. BLAC_NOCLA Q06316 beta-lactamase precursor (302 aa), FASTA scores, opt: 860, E(): 0, (50.2% identity in 283 aa overlap); eyc. Contains PS00013 Prokaryotic lipid attachment site near N-terminus, and PS00146 Beta-lactamase class-A active site. BELONGS TO THE CLASS-C BETA-LACTAMASE FAMILY." /codon_start=1 /transl_table=11 /product="class A BETA-lactamase BLAC" /protein_id="NP_216584.1" /db_xref="GI:15609205" /db_xref="GeneID:888742" /translation="MRNRGFGRRELLVAMAMLVSVTGCARHASGARPASTTLPAGADL ADRFAELERRYDARLGVYVPATGTTAAIEYRADERFAFCSTFKAPLVAAVLHQNPLTH LDKLITYTSDDIRSISPVAQQHVQTGMTIGQLCDAAIRYSDGTAANLLLADLGGPGGG TAAFTGYLRSLGDTVSRLDAEEPELNRDPPGDERDTTTPHAIALVLQQLVLGNALPPD KRALLTDWMARNTTGAKRIRAGFPADWKVIDKTGTGDYGRANDIAVVWSPTGVPYVVA VMSDRAGGGYDAEPREALLAEAATCVAGVLA" misc_feature complement(2326525..2326572) /gene="blaC" /locus_tag="Rv2068c" /note="PS00146 Beta-lactamase class-A active site" gene 2326944..2327501 /gene="sigC" /locus_tag="Rv2069" /db_xref="GeneID:888723" CDS 2326944..2327501 /gene="sigC" /locus_tag="Rv2069" /function="INVOLVED IN PROMOTER RECOGNITION, TRANSCRIPTION INITIATION." /experiment="experimental evidence, no additional details recorded" /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigC" /protein_id="NP_216585.1" /db_xref="GI:15609206" /db_xref="GeneID:888723" /translation="MTATASDDEAVTALALSAAKGNGRALEAFIKATQQDVWRFVAYL SDVGSADDLTQETFLRAIGAIPRFSARSSARTWLLAIARHVVADHIRHVRSRPRTTRG ARPEHLIDGDRHARGFEDLVEVTTMIADLTTDQREALLLTQLLGLSYADAAAVCGCPV GTIRSRVARARDALLADAEPDDLTG" gene complement(2327491..2328225) /gene="cobK" /locus_tag="Rv2070c" /gene_synonym="cbiJ" /db_xref="GeneID:887566" CDS complement(2327491..2328225) /gene="cobK" /locus_tag="Rv2070c" /gene_synonym="cbiJ" /EC_number="1.3.1.54" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS." /note="CobK/CbiJ; there are 2 pathways for cobalamin (vitamin B12) production, one aerobic (ex. P. denitrificans), the other anaerobic (ex. S. typhimurium); the CobK/CbiJ perform similar reactions in both; the anaerobic pathway includes the use of a chelated cobalt ion in order for ring contraction to occur; CobK thus converts precorrin 6 into dihydro-precorrin 6 while CbiJ converts cobalt-precorrin 6 into cobalt-deihydro-precorrin 6" /codon_start=1 /transl_table=11 /product="cobalt-precorrin-6x reductase" /protein_id="NP_216586.1" /db_xref="GI:15609207" /db_xref="GeneID:887566" /translation="MTRVLLLGGTAEGRALAKELHPHVEIVSSLAGRVPNPALPIGPV RIGGFGGVEGLRGWLREERIDAVVDATHPFAVTITAHAAQVCGELGLPYLVLARPPWD PGTAIIAVSDIEAADVVAEQGYSRVFLTTGRSGIAAFANSDAWFLIRVVTAPDGTALP RRHKLVLSRGPYGYHDEFALLREQRIDALVTKNSGGKMTRAKLDAAAALGISVVMIAR PLLPAGVAAVDSVHRAAMWVAGLPSR" gene complement(2328222..2328977) /gene="cobM" /locus_tag="Rv2071c" /db_xref="GeneID:888521" CDS complement(2328222..2328977) /gene="cobM" /locus_tag="Rv2071c" /EC_number="2.1.1.133" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS." /note="Rv2071c, (MTCY49.10c), len: 251 aa. Probable cobM, precorrin-3 methylase (EC 2.1.1.133), similar to e.g. L21196|g347169|RERCOBLMK2 RERCOBLMK from Rhodococococcus sp. NI86/21 (249 aa), FASTA scores: opt: 992, E(): 0, (62.4% identity in 245 aa overlap) and to COBM_ PSEDE|P21922 precorrin-3 methylase (253 aa), FASTA scores: opt: 863, E(): 0, (54.6% identity in 249 aa overlap). Contains PS00839 Uroporphyrin-III C-methyltransferase signature 1, and PS00840 Uroporphyrin-III C-methyltransferase signature 2." /codon_start=1 /transl_table=11 /product="precorrin-4 C11-methyltransferase CobM" /protein_id="NP_216587.1" /db_xref="GI:15609208" /db_xref="GeneID:888521" /translation="MTVYFIGAGPGAADLITVRGQRLLQRCPVCLYAGSIMPDDLLAQ CPPGATIVDTGPLTLEQIVRKLADADADGRDVARLHSGDPSLYSALAEQCRELDALGI GYEIVPGVPAFAAAAAALKRELTVPGVAQTVTLTRVATLSTPIPPGEDLAALARSRAT LVLHLAAAQIDAIVPRLLDGGYRPETPVAVVAFASWPQQRTLRGTLADIAARMHDAKI TRTAVIVVGDVLTAEGFTDSYLYSVARHGRYAQ" misc_feature complement(2328651..2328752) /gene="cobM" /locus_tag="Rv2071c" /note="PS00840 Uroporphyrin-III C-methyltransferase signature 2" misc_feature complement(2328918..2328962) /gene="cobM" /locus_tag="Rv2071c" /note="PS00839 Uroporphyrin-III C-methyltransferase signature 1" gene complement(2328974..2330146) /gene="cobL" /locus_tag="Rv2072c" /db_xref="GeneID:888321" CDS complement(2328974..2330146) /gene="cobL" /locus_tag="Rv2072c" /EC_number="2.1.1.132" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS." /note="Rv2072c, (MTCY49.11c), len: 390 aa. Probable cobL, methyl transferase (EC 2.1.1.132), similar to L21196|g347169|RERCOBLMK1 from Rhodocococcus sp. NI86/21 (447 aa), FASTA scores: opt: 892; E(): 0; (50.1% identity in 369 aa overlap), and to COBL_PSEDE|P21921 precorrin-6y methylase (413 aa), FASTA scores: opt: 830, E(): 0, (40.6% identity in 404 aa overlap)." /codon_start=1 /transl_table=11 /product="precorrin-6y methyltransferase CobL" /protein_id="NP_216588.1" /db_xref="GI:15609209" /db_xref="GeneID:888321" /translation="MIIVVGIGADGMTGLSEHSRSELRRATVIYGSKRQLALLDDTVT AERWEWPTPMLPAVQGLSPDGADLHVVASGDPLLHGIGSTLIRLFGHDNVTVLPHVSA VTLACARMGWNVYDTEVISLVTAQPHTAVRRGGRAIVLSGDRSTPQALAVLLTEHGRG DSKFSVLEQLGGPAERRRDGTARAWACDPPLDVDELNVIAVRYLLDERTSWAPDEAFA HDGQITKHPIRVLTLAALAPRPGQRLWDVGAGSGAIAVQWCRSWPGCTAVAFERDERR RRNIGFNAAAFGVSVDVRGDAPDAFDDAARPSVIFLGGGVTQPGLLEACLDSLPAGGN LVANAVTVESEAALAHAYSRLGGELRRFQHYLGEPLGGFTGWRPQLPVTQWSVTKR" repeat_region complement(2330147..2330225) /note="79 bp Mycobacterial Interspersed Repetitive Unit, Class I" gene complement(2330214..2330963) /locus_tag="Rv2073c" /db_xref="GeneID:887267" CDS complement(2330214..2330963) /locus_tag="Rv2073c" /function="UNKNOWN" /note="Rv2073c, (MTCY49.12c), len: 249 aa. Probable oxidoreductase (EC 1.-.-.-), belonging to shortchain dehydrogenase reductase (SDR) family, similar to e.g. YMP3_STRCO P43168 hypothetical 25.8 kDa protein in mpra 5' region (251 aa) FASTA scores: opt: 386, E(): 1.1e-18, (44.1% identity in 170 aa overlap). Similar to several M. tuberculosis hypothetical proteins, e.g. Rv3791, Rv1544,Rv0945, Rv0765c. etc." /codon_start=1 /transl_table=11 /product="shortchain dehydrogenase" /protein_id="NP_216589.1" /db_xref="GI:15609210" /db_xref="GeneID:887267" /translation="MDDTGAAPVVIFGGRSQIGGELARRLAAGATMVLAARNADQLAD QAAALRAAGAIAVHTREFDADDLAAHGPLVASLVAEHGPIGTAVLAFGILGDQARAET DAAHAVAIVHTDYVAQVSLLTHLAAAMRTAGRGSLVVFSSVAGIRVRRANYVYGSAKA GLDGFASGLADALHGTGVRLLIARPGFVIGRMTEGMTPAPLSVTPERVAAATARALVN GKRVVWIPWALRPMFVALRLLPRFVWRRMPR" gene 2330993..2331406 /locus_tag="Rv2074" /db_xref="GeneID:888523" CDS 2330993..2331406 /locus_tag="Rv2074" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2074, (MTCY49.13), len: 137 aa. Conserved hypothetical protein, similar to SCF43A.28 hypothetical protein from Streptomyces coelicolor (141 aa). Smith-Waterman scores: 5459242|CAB48915.1|AL096837 hypothetical protein from Streptomyces coelicolor A3(2) Expect = 1e-21, Identities = 56/106 (52%)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216590.1" /db_xref="GI:15609211" /db_xref="GeneID:888523" /translation="MAMVNTTTRLSDDALAFLSERHLAMLTTLRADNSPHVVAVGFTF DPKTHIARVITTGGSQKAVNADRSGLAVLSQVDGARWLSLEGRAAVNSDIDAVRDAEL RYAQRYRTPRPNPRRVVIEVQIERVLGSADLLDRA" gene complement(2331416..2332879) /locus_tag="Rv2075c" /db_xref="GeneID:888771" CDS complement(2331416..2332879) /locus_tag="Rv2075c" /function="UNKNOWN" /note="Rv2075c, (MTCY49.14c), len: 487 aa. Possibly exported or envelope protein; has potential signal peptide at N-terminus and hydrophobic stretch around residue 430." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216591.1" /db_xref="GI:15609212" /db_xref="GeneID:888771" /translation="MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCD VISPVAIPCVALGKFADAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHRTA RFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQLDIDVRALELD LHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVLPQIANWLNAPGHTEEVIL LYLEDQLKNASAYESVVATLDQVLRRADGTSLIYRPNPARRATNGCVPLPLDVSREEI RASGARAVLVGSCAPGWSAAVFDWSGVELESGSNSGYRPYPACDATYGRGVYAWRLVR YYEDSTLATALANPTRPPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWS WAPDEPRAGAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTPAPVVFAGAALAC TAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP" gene complement(2333037..2333288) /locus_tag="Rv2076c" /db_xref="GeneID:887296" CDS complement(2333037..2333288) /locus_tag="Rv2076c" /function="UNKNOWN" /note="Rv2076c, (MTCY49.15c), len: 83 aa. Unknown, questionable ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216592.1" /db_xref="GI:15609213" /db_xref="GeneID:887296" /translation="MVVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIAN AVKGSLWWDIWSLGLLVLIPAVVYGKMRRSRRISSDQDR" gene complement(2333323..2334294) /locus_tag="Rv2077c" /db_xref="GeneID:887785" CDS complement(2333323..2334294) /locus_tag="Rv2077c" /function="UNKNOWN" /note="Rv2077c, (MTCY49.16c), len: 323 aa. Possible conserved transmembrane protein. Part of Mycobacterium tuberculosis protein family with Rv2542, Rv2079, Rv2797c, Rv0963c, Rv1949c. Hydrophobic stretches at C-terminus." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216593.1" /db_xref="GI:15609214" /db_xref="GeneID:887785" /translation="MLATLSQIRAWSTEHLIDAAGYWTETADRWEDVFLQMRNQAHAI AWNGAGGDGLRQRTRADFSTVSGIADQLRRAATIARNGAGTIDAAQRRVMYAVEDAQD AGFNVGEDLSVTDTKTTQPAAVQAARLAQAQALAGDIRLRVGQLVAAENEVSGQLAAT TGDVGNVRFAGAPVVAHSAVQLVDFFKQDGPTPPPPGAPHPSGGADGPYSDPITSMML PPAGTEAPVSDATKRWVDNMVNELAARPPDDPIAVEARRLAFQALHRPCNSAEWTAAV AGFAGSSAGVVGTALAIPAGPADWALLGAALLGVGGSGAAVVNCATK" gene complement(2334295..2334594) /locus_tag="Rv2077A" /db_xref="GeneID:3205060" CDS complement(2334295..2334594) /locus_tag="Rv2077A" /function="UNKNOWN" /note="Rv2077A, len: 99 aa. Conserved hypothetical protein, similar to P95263|Rv1951c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (137 aa), FASTA scores: opt: 271, E(): 1.5e-11, (51.04% identity in 97 aa overlap); and some similarity with P95012|Rv2541 HYPOTHETICAL ALANINE RICH PROTEIN from Mycobacterium tuberculosis (135 aa), FASTA scores: opt: 140, E(): 0.014, (32.95% identity in 88 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177658.1" /db_xref="GI:57116940" /db_xref="GeneID:3205060" /translation="MGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATT VAVSGINAAICCAAAEFATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV" gene 2335059..2335373 /locus_tag="Rv2078" /db_xref="GeneID:888071" CDS 2335059..2335373 /locus_tag="Rv2078" /function="UNKNOWN" /note="Rv2078, (MTCY49.17), len: 104 aa. Unknown" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216594.1" /db_xref="GI:15609215" /db_xref="GeneID:888071" /translation="MFVDVELLHSGANESHYAGEHAHGGADQLSRGPLLSGMFGTFPV AQTFHDAVGAAHAQQMRNLHAHRQALITVGEKARHAATGFTDMDDGNAAELKAVVCSC AT" gene 2335355..2337325 /locus_tag="Rv2079" /db_xref="GeneID:887333" CDS 2335355..2337325 /locus_tag="Rv2079" /function="UNKNOWN" /note="Rv2079, (MTCY49.18), len: 656 aa. Conserved hypothetical protein; part of Mycobacterium tuberculosis protein family with Rv2542, Rv2077c, Rv2797c, Rv0963c, Rv1949c. Contains PS00120 Lipases, serine active site" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216595.1" /db_xref="GI:15609216" /db_xref="GeneID:887333" /translation="MQLRHINIRALIAEAGGDPWAIEHSLHAGRPAQIAELAEAFHAA GRYTAEANAAFEEARRRFEASWNRENGEHPINDSAEVQRVTAALGVQSLQLPKIGVDL ENIAADLAEAQRAAAGRIATLESQLQRIDDQLDQALELEHDPRLAAAERSELDALITC LEQDAIDDTASALGQLQSIRAGYSDHLQQSLAMLRADGYDGAGLQGLDAPQSPVKPEE PIQIPPPGTGAPEVHRWWTSLTSEERQRLIAEHPEQIGNLNGVPVSARSDANIAVMTR DLNRVRDIATRYRTSVDDVLGDPAKYGLSAGDITRYRNADETKKGLDHNARNDPRNPS PVYLFAYDPMAFGGKGRAAIAIGNPDTAKHTAVIVPGTSSSVKGGWLHDNHDDALNLF NQAKAADPNNPTAVIAWMGYDAPNDFTDPRIATPMLARIGGAALAEDVNGLWVTHLGV GQNVTVLGHSYGSTTVADAFALGGMHANDAVLLGCPGTDLAHSAASFHLDGGRVYVGA ASTDPISMLGQLDSLSQYVNRGNLAGQLQGLAVGLGTDPAGDGFGSVRFRAEVPNSDG INPHDHSYYYHRGSEALRSMADIASGHGDALASDGMLAQPRHQPGVEIDIPGLGSVEI DIPGTPASIDPEWSRPPGSITDDHVFDAPLHR" misc_feature 2336714..2336743 /locus_tag="Rv2079" /note="PS00120 Lipases, serine active site" gene 2337306..2337869 /gene="lppJ" /locus_tag="Rv2080" /db_xref="GeneID:887351" CDS 2337306..2337869 /gene="lppJ" /locus_tag="Rv2080" /function="UNKNOWN" /note="Rv2080, (MTCY49.19), len: 187 aa. Possible lppJ, lipoprotein; contains prokayotic lipoprotein modification site (PS00013) and signal sequence at N-terminus." /codon_start=1 /transl_table=11 /product="lipoprotein LppJ" /protein_id="NP_216596.1" /db_xref="GI:15609217" /db_xref="GeneID:887351" /translation="MPHSTADRRLRLTRQALLAAAVVPLLAGCALVMHKPHSAGSSNP WDDSAHPLTDDQAMAQVVEPAKQIVAAADLQAVRAGFSFTSCNDQGDPPYQGTVRMAF LLQGDHDAYFQHVRAAMLSHGWIDGPPPGQYFHGITLHKNGVTANMSLALDHSYGEMI LDGECRNTTDHHHDDETTNITNQLVQP" gene complement(2338065..2338505) /locus_tag="Rv2081c" /db_xref="GeneID:887348" CDS complement(2338065..2338505) /locus_tag="Rv2081c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2081c, (MTCY49.20c), len: 146 aa. Possible transmembrane unknown protein. Hydrophobic stretch from aa 32-54." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216597.1" /db_xref="GI:15609218" /db_xref="GeneID:887348" /translation="MFANAGLSPFVAIWTARAASLYTSHNFWCAAAVSAAVYVGSAVV PAAVAGPLFVGRVSATIKAAAPSTTAAIATLATAANGQLRERGGAGGWVGVHCPVVGG GGVGHPRKAIAAAVSVHSTCMPAAFGGHLGLGDRSRSVSLSGTP" gene 2338709..2340874 /locus_tag="Rv2082" /db_xref="GeneID:887795" CDS 2338709..2340874 /locus_tag="Rv2082" /function="UNKNOWN" /note="Rv2082, (MTCY49.21), len: 721 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv0029, and to Rv3899c and Rv3900c which may be frameshifted." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216598.1" /db_xref="GI:15609219" /db_xref="GeneID:887795" /translation="MAGDLPPGRWSALLVGAWWPARPDAPMAGVTYWRKAAQLKRNEA NDLRNERSLLAVNQGRTADDLLERYWRGEQRLATIAHQCEVKSDQSEQVADAVNYLRD RLTEIAQSGNQQINQILAGKGPIEAKVAAVNAVIEQSNAMADHVGATAMSNIIDATQR VFDETIGGDAHTWLRDHGVSLDTPARPRPVTAEDMTSMTANSPAGSPFGAAPSAPSHS TTTSGPPTAPTPTSPFGTAPMVLSSSSTSSGPPTAPTPTSPFGTAPMPPGPPPPGTVS PPLPPSAPAVGVGGPSVPAAGMPPAAAAATAPLSPQSLGQSFTTGMTTGTPAAAGAQA LSAGALHAATEPLPPPAPPPTTPTVTTPTVATATTAGIPHIPDSAPTPSPAPIAPPTT DNASAMTPIAPMVANGPPASPAPPAAAPAGPLPAYGADLRPPVTTPPATPPTPTGPIS GAAVTPSSPAAGGSLMSPVVNKSTAPATTQAQPSNPTPPLASATAAATTGAAAGDTSR RAAEQQRLRRILDTVARQEPGLSWAAGLRDNGQTTLLVTDLASGWIPPHIRLPAHITL LEPAPRRRHATVTDLLGTTTVAAAHHPHGYLSQPDPDTPALTGDRTARIAPTIDELGP TLVETVRRHDTLPPIAQAVVVAATRNYGVPDNETDLLHHKTTEIHQAVLTTYPNHDIA TVVDWMLLAAINALIAGDQSGANYHLAWAIAAISTRRSR" gene 2340871..2341815 /locus_tag="Rv2083" /db_xref="GeneID:887281" CDS 2340871..2341815 /locus_tag="Rv2083" /function="UNKNOWN" /note="Rv2083, (MTCY49.22), len: 314 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv3898c (110 aa) and Rv3897c (210 aa)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216599.1" /db_xref="GI:15609220" /db_xref="GeneID:887281" /translation="MTSIESHPEQYWAAAGRPGPVPLALGPVHPGGPTLIDLLMALFG LSTNADLGGANADIEGDDTDRRAHAADAARKFSANEANAAEQMQGVGAQGMAQMASGI GGALSGALGGVMGPLTQLPQQAMQAGQGAMQPLMSAMQQAQGADGLAAVDGARLLDSI GGEPGLGSGAGGGDVGGGGAGGTTPTGYLGPPPVPTSSPPTTPAGAPTKSATMPPPGG ASPASAHMGAAGMPMVPPGAMGARGEGSGQEKPVEKRLTAPAVPNGQPVKGRLTVPPS APTTKPTDGKPVVRRRILLPEHKDFGRIAPDEKTDAGE" gene 2341808..2342944 /locus_tag="Rv2084" /db_xref="GeneID:887350" CDS 2341808..2342944 /locus_tag="Rv2084" /function="UNKNOWN" /note="Rv2084, (MTCY49.23), len: 378 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216600.1" /db_xref="GI:15609221" /db_xref="GeneID:887350" /translation="MSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLD TQPRPLVIVHGPLFQAVKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLI DVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIH ALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGE LSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTA ATIQQSGTAGDGGGGRRQDSRRRNGPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAV EGVEEIGASLPGRESTPSDDGGSLHPSGRPRRVHRRRWCGLGLC" repeat_region 2342942..2344410 /note="IS1556, len: 1469 bp. Possible Insertion sequence-like region." /mobile_element="insertion sequence:IS1556" gene 2343027..2343332 /locus_tag="Rv2085" /db_xref="GeneID:888138" CDS 2343027..2343332 /locus_tag="Rv2085" /function="UNKNOWN" /note="Rv2085, (MTCY49.24), len: 101 aa. Conserved hypothetical protein, similar to YI32_MYCTU P19772 insertion element IS986 hypothetical 6.6 kda protein (59 aa), FASTA scores, opt: 119, E(): 0.002 9, (36.4% identity in 55 aa overlap); ORFs Rv2085, Rv2086 and Rv2087 (MTCY49.24,25,26, and 27) all show similarity to transposases but we can find no sequence errors to account for the frameshifts. Contains possible helix-turn-helix motif at aa 33 to 54,(+3.11 SD)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216601.1" /db_xref="GI:15609222" /db_xref="GeneID:888138" /translation="MSDMCDVVSFVGAAERVLRARFRPSPESGPPVHARRCGWSLGIS AETLRRWAGQAEVDSGVVAGVSASRSGSVKTSELEQTIEILKVATSFFARKCDPRHR" gene 2343311..2343916 /locus_tag="Rv2086" /db_xref="GeneID:888128" CDS 2343311..2343916 /locus_tag="Rv2086" /function="UNKNOWN" /note="Rv2086, (MTCY49.25), len: 201 aa. Conserved hypothetical protein: low similarity to transposases; ORFs Rv2085, Rv2086 and Rv2087 (MTCY49.24,25,26, and 27) all show similarity to transposases but we can find no sequence errors to account for the frameshifts. Start changed since first submission (-16 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216602.2" /db_xref="GI:57116941" /db_xref="GeneID:888128" /translation="MRPATPLICAFGDKHKHTYGVTPICRALAVHGVQIASRTYFADR AAAPSKRALWDTTITEILAGYYEPDAEGKRPPECLYGSLKMWAHLQRQGFRWPSATVK TIMRANGWRGVPLAAHITHHRTRPGRGPGPRPGGSAMAGFSNEPAGSGRLHLRADDVE FRLHRVRGRRLRRCDRGLGMLADQRRSVRRTRITPRPSRLT" gene 2343994..2344224 /locus_tag="Rv2087" /db_xref="GeneID:887934" CDS 2343994..2344224 /locus_tag="Rv2087" /function="UNKNOWN" /note="Rv2087, (MTCY49.27), len: 76 aa. Conserved hypothetical protein, with low similarity to transposases; ORFs Rv2085, Rv2086 and Rv2087 (MTCY49.24,25,26, and 27) all show similarity to transposases but we can find no sequence errors to account for the frameshifts. Start changed since first submission (-45 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216603.2" /db_xref="GI:57116942" /db_xref="GeneID:887934" /translation="MLAGLRPSIGIVGDALDNALCETTTGPHRTECSHGSPFRSGPIR TLADLEDIASAWVEHTCHTQQGVRIPGRLQPA" gene 2344411..2346180 /gene="pknJ" /locus_tag="Rv2088" /db_xref="GeneID:888322" CDS 2344411..2346180 /gene="pknJ" /locus_tag="Rv2088" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION) [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /note="Rv2088, (MTCY49.28), len: 589 aa. Probable pknJ, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citation below), similar to other serine/threonine-protein kinases e.g. PKWA_THECU|P49695 putative serine/threonine-protein kinase (742 aa), FASTA scores: opt: 457, E(): 2.7e-15, (26.0% identity in 578 aa overlap); etc. Contains PS00108 Serine/Threonine protein kinases active-site signature. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES. Experimental studies show evidence of auto-phosphorylation." /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase J" /protein_id="NP_216604.1" /db_xref="GI:15609225" /db_xref="GeneID:888322" /translation="MAHELSAGSVFAGYRIERMLGAGGMGTVYLARNPDLPRSEALKV LAAELSRDLDFRARFVREADVAAGLDHPNIVAVHQRGQFEGRLWIAMQFVDGGNAEDA LRAATMTTARAVYVIGEVAKALDYAHQQGVIHRDIKPANFLLSRAAGGDERVLLSDFG IARALGDTGLTSTGSVLATLAYAAPEVLAGQGFDGRADLYSLGCALFRLLTGEAPFAA GAGAAVAVVAGHLHQPPPTVSDRVPGLSAAMDAVIATAMAKDPMRRFTSAGEFAHAAA AALYGGATDGWVPPSPAPHVISQGAVPGSPWWQHPVGSVTALATPPGHGWPPGLPPLP RRPRRYRRGVAAVAAVMVVAAAAVTAVTMTSHQPRTATPPSAAALSPTSSSTTPPQPP IVTRSRLPGLLPPLDDVKNFVGIQNLVAHEPMLQPQTPNGSINPAECWPAVGGGVPSA YDLGTVIGFYGLTIDEPPTGTAPNQVGQLIVAFRDAATAQRHLADLASIWRRCGGRTV TLFRSEWRRPVELSTSVPEVVDGITTMVLTAQGPVLRVREDHAIAAKNNVLVDVDIMT PDTSRGQQAVIGITNYILAKIPG" misc_feature 2344804..2344842 /gene="pknJ" /locus_tag="Rv2088" /note="PS00108 Serine/Threonine protein kinases active-site signature" gene complement(2346197..2347324) /gene="pepE" /locus_tag="Rv2089c" /db_xref="GeneID:887719" CDS complement(2346197..2347324) /gene="pepE" /locus_tag="Rv2089c" /EC_number="3.4.13.-" /function="hydrolysis of peptide bonds" /note="Rv2089c, (MTCY49.29c), len: 375 aa. Probable pepE, dipeptidase, similar to e.g. PEPQ_LACDL P46545, xaa-pro dipeptidase (368 aa), FASTA scores, opt: 617, E(): 5.1 e-32, (34.7% identity in 363 aa overlap); contains PS00491 Aminopeptidase P and proline dipeptidase signature. Also similar to Mycobacterium tuberculosis peptidases Rv2861c, Rv0734, Rv2535c." /codon_start=1 /transl_table=11 /product="dipeptidase PepE" /protein_id="NP_216605.1" /db_xref="GI:15609226" /db_xref="GeneID:887719" /translation="MGSRRFDAEVYARRLALAAAATADAGLAGLVITPGYDLCYLIGS RAETFERLTALVLPAAGAPAVVLPRLELAALKQSAAAELGLRVCDWVDGDDPYGLVSA VLGGAPVATAVTDSMPALHMLPLADALGVLPVLATDVLRRLRMVKEETEIDALRKAGA AIDRVHARVPEFLVPGRTEADVAADIAEAIVAEGHSEVAFVIVGSGPHGADPHHGYSD RELREGDIVVVDIGGTYGPGYHSDSTRTYSIGEPDSDVAQSYSMLQRAQRAAFEAIRP GVTAEQVDAAARDVLAEAGLAEYFVHRTGHGIGLCVHEEPYIVAGNDLVLVPGMAFSI EPGIYFPGRWGARIEDIVIVTEDGAVSVNNCPHELIVVPVS" misc_feature complement(2346383..2346421) /gene="pepE" /locus_tag="Rv2089c" /note="PS00491 Aminopeptidase P and proline dipeptidase signature" gene 2347373..2348554 /locus_tag="Rv2090" /db_xref="GeneID:887924" CDS 2347373..2348554 /locus_tag="Rv2090" /EC_number="3.1.11.-" /function="DNA metabolism" /note="Rv2090, (MTCY49.30), len: 393 aa. Probable 5'-3' exonuclease (EC 3.1.11.-), similar to exonuclease part of DNA polymerase, e.g. DPO1_MYCTU Q07700 DNA polymerase I (EC 2.7.7.7) (pol i) (904 aa), FASTA scores, opt: 461, E(): 1.2e-17, (38.7% identity in 292 aa overlap). BELONGS TO FAMILY A OF DNA POLYMERASES" /codon_start=1 /transl_table=11 /product="5'-3' exonuclease" /protein_id="NP_216606.1" /db_xref="GI:15609227" /db_xref="GeneID:887924" /translation="MPAPDPMRGDPPHPAPPRLRSPLDPTSGDPLHPAPPRLRSPLDP TSGDPLHPAPPRLRSPLDPTSGDPLHPAPPRLRSPLVLLDGASMWFRSFFGVPSSITA PDGRPVNAVRGFIDSMAVVITQQRPNRLAVCLDLDWRPQFRVDLIPSYKAHRVAEPEP NGQPDVEEVPDELTPQVDMIMELLDAFGIAMAGAPGFEADDVLGTLATRERRDPVIVV SGDRDLLQVVADDPVPVRVLYLGRGLAKATLFGPAEVAERYGLPAHRAGAAYAELALL RGDPSDGLPGVPGVGEKTAATLLARHGSLDQIMAAADDRKTTMAKGLRTKLLAASAYI KAADRVVRVATDAPVTLSTPTDRFPLVAADPERTAELATRFGVESSIARLQKALDTLP G" gene complement(2348558..2349292) /locus_tag="Rv2091c" /db_xref="GeneID:887469" CDS complement(2348558..2349292) /locus_tag="Rv2091c" /function="UNKNOWN" /note="Rv2091c, (MTCY49.31c), len: 244 aa. Probable membrane protein; contains potential transmembrane region. Repetitive ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216607.1" /db_xref="GI:15609228" /db_xref="GeneID:887469" /translation="MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATW QAPAYTPQYQQPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQ PGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFW APGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTV SIDGTSKRVTVTFQDNKGTYEVGRPQ" gene complement(2349334..2352054) /gene="helY" /locus_tag="Rv2092c" /db_xref="GeneID:887736" CDS complement(2349334..2352054) /gene="helY" /locus_tag="Rv2092c" /EC_number="3.6.1.-" /function="DNA HELICASE ACTIVITY." /note="Rv2092c, (MTCY49.32c), len: 906 aa. Probable helY, DNA helicase (EC 3.6.1.-), with similarity to YJF0_YEAST P47047 hypothetical helicase in tdh1-gyp6 intergenic region, (1073 aa), FASTA scores, opt: 1004, E(): 0, (29.0% identity in 970 aa o verlap); contains PS00017 ATP/GTP-binding site motif A, PS00402 Binding-protein-dependent transport systems inner membrane comp signature. BELONGS TO THE SKI2 SUBFAMILY OF HELICASES." /codon_start=1 /transl_table=11 /product="ATP-dependent DNA helicase HelY" /protein_id="NP_216608.1" /db_xref="GI:15609229" /db_xref="GeneID:887736" /translation="MTELAELDRFTAELPFSLDDFQQRACSALERGHGVLVCAPTGAG KTVVGEFAVHLALAAGSKCFYTTPLKALSNQKHTDLTARYGRDQIGLLTGDLSVNGNA PVVVMTTEVLRNMLYADSPALQGLSYVVMDEVHFLADRMRGPVWEEVILQLPDDVRVV SLSATVSNAEEFGGWIQTVRGDTTVVVDEHRPVPLWQHVLVGKRMFDLFDYRIGEAEG QPQVNRELLRHIAHRREADRMADWQPRRRGSGRPGFYRPPGRPEVIAKLDAEGLLPAI TFVFSRAGCDAAVTQCLRSPLRLTSEEERARIAEVIDHRCGDLADSDLAVLGYYEWRE GLLRGLAAHHAGMLPAFRHTVEELFTAGLVKAVFATETLALGINMPARTVVLERLVKF NGEQHMPLTPGEYTQLTGRAGRRGIDVEGHAVVIWHPEIEPSEVAGLASTRTFPLRSS FAPSYNMTINLVHRMGPQQAHRLLEQSFAQYQADRSVVGLVRGIERGNRILGEIAAEL GGSDAPILEYARLRARVSELERAQARASRLQRRQAATDALAALRRGDIITITHGRRGG LAVVLESARDRDDPRPLVLTEHRWAGRISSADYSGTTPVGSMTLPKRVEHRQPRVRRD LASALRSAAAGLVIPAARRVSEAGGFHDPELESSREQLRRHPVHTSPGLEDQIRQAER YLRIERDNAQLERKVAAATNSLARTFDRFVGLLTEREFIDGPATDPVVTDDGRLLARI YSESDLLVAECLRTGAWEGLKPAELAGVVSAVVYETRGGDGQGAPFGADVPTPRLRQA LTQTSRLSTTLRADEQAHRITPSREPDDGFVRVIYRWSRTGDLAAALAAADVNGSGSP LLAGDFVRWCRQVLDLLDQVRNAAPNPELRATAKRAIGDIRRGVVAVDAG" misc_feature complement(2349667..2349753) /gene="helY" /locus_tag="Rv2092c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" misc_feature complement(2351917..2351940) /gene="helY" /locus_tag="Rv2092c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2352103..2353029) /gene="tatC" /locus_tag="Rv2093c" /db_xref="GeneID:888068" CDS complement(2352103..2353029) /gene="tatC" /locus_tag="Rv2093c" /function="INVOLVED IN PROTEINS EXPORT: REQUIRED FOR CORRECT LOCALIZATION OF PRECURSOR PROTEINS BEARING SIGNAL PEPTIDES WITH THE TWIN ARGININE CONSERVED MOTIF S/T-R-R-X-F-L-K. THIS SEC-INDEPENDENT PATHWAY IS TERMED TAT FOR TWIN-ARGININE TRANSLOCATION SYSTEM. THIS SYSTEM MAINLY TRANSPORTS PROTEINS WITH BOUND COFACTORS THAT REQUIRE FOLDING PRIOR TO EXPORT (BY SIMILARITY)." /note="Rv2093c, (MT2154, MTCY49.33c), len: 308 aa. Probable tatC, transmembrane protein, component of twin-arginine translocation protein export system (see citation below), equivalent to U00017|U00017_1 from Mycobacterium leprae (317 aa), FASTA scores: opt: 1722, E(): 0, (84.5% identity in 310 aa overlap). Similarity to others e.g. P27857|TATC_ECOLI|MTTB|B3839|Z5360|ECS4768 Sec-independent protein translocase protein from E. coli strain K12 and O157:H7 (258 aa), FASTA scores: opt: 344, E(): 6e-16, (32.5% identity in 265 aa overlap). BELONGS TO THE TATC FAMILY." /codon_start=1 /transl_table=11 /product="Sec-independent protein translocase transmembrane protein tatC" /protein_id="NP_216609.1" /db_xref="GI:15609230" /db_xref="GeneID:888068" /translation="MRAAGLLKRLNPRNRRSRVNPDATMSLVDHLTELRTRLLISLAA ILVTTIFGFVWYSHSIFGLDSLGEWLRHPYCALPQSARADISADGECRLLATAPFDQF MLRLKVGMAAGIVLACPVWFYQLWAFITPGLYQRERRFAVAFVIPAAVLFVAGAVLAY LVLSKALGFLLTVGSDVQVTALSGDRYFGFLLNLLVVFGVSFEFPLLIVMLNLAGLLT YERLKSWRRGLIFAMFVFAAIFTPGSDPFSMTALGAALTVLLELAIQIARVHDKRKAK REAAIPDDEASVIDPPSPVPAPSVIGSHDDVT" gene complement(2353046..2353297) /gene="tatA" /locus_tag="Rv2094c" /db_xref="GeneID:888086" CDS complement(2353046..2353297) /gene="tatA" /locus_tag="Rv2094c" /function="INVOLVED IN PROTEINS EXPORT: REQUIRED FOR CORRECT LOCALIZATION OF PRECURSOR PROTEINS BEARING SIGNAL PEPTIDES WITH THE TWIN ARGININE CONSERVED MOTIF S/T-R-R-X-F-L-K. THIS SEC-INDEPENDENT PATHWAY IS TERMED TAT FOR TWIN-ARGININE TRANSLOCATION SYSTEM. THIS SYSTEM MAINLY TRANSPORTS PROTEINS WITH BOUND COFACTORS THAT REQUIRE FOLDING PRIOR TO EXPORT (BY SIMILARITY)." /experiment="experimental evidence, no additional details recorded" /note="TatA; similar to TatE that is found in some proteobacteria; part of system that translocates proteins with a conserved twin arginine motif across the inner membrane; capable of translocating folded substrates typically those with bound cofactors; similar to a protein import system in thylakoid membranes" /codon_start=1 /transl_table=11 /product="twin arginine translocase protein A" /protein_id="NP_216610.1" /db_xref="GI:15609231" /db_xref="GeneID:888086" /translation="MGSLSPWHWAILAVVVIVLFGAKKLPDAARSLGKSLRIFKSEVR ELQNENKAEASIETPTPVQSQRVDPSAASGQDSTEARPA" misc_feature complement(2353193..2353216) /gene="tatA" /locus_tag="Rv2094c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2353365..2354315) /locus_tag="Rv2095c" /db_xref="GeneID:888403" CDS complement(2353365..2354315) /locus_tag="Rv2095c" /function="UNKNOWN" /note="Rv2095c, (MTCY49.35c), len: 316 aa. Conserved hypothetical protein. Highly similar to ML1330 P54075|YY35_MYCLE HYPOTHETICAL 27.0 kDa PROTEIN (247 aa) opt: 1127 E(): 0, (78.4% identity in 227 aa overlap). Also similar to ORF11(1) of Rhodococcus erythropolis. FASTA score: Z82004|REZ820043 REZ82004 NID: g1666179 - Rhodococcus (326 aa) opt: 624 E(): 1.1e-30; (56.7% identity in 319 aa overlap). Contains possible helix-turn-helix motif at aa 25-46, ( +2.92 SD)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216611.1" /db_xref="GI:15609232" /db_xref="GeneID:888403" /translation="MSALSTRLVRLLNMVPYFQANPRITRAEAAAELGVTAKQLEEDL NQLWMCGLPGYSPGDLIDFEFCGDTIEVTFSAGIDRPLKLTSPEATGLLVALRALADI PGVVDPQAARSAIAKIAAAAGAVAAVAEQAPTESPAAAAVRAAVRNSRALTIDYYAAS HDTLTTRIVDPIRVLLIGGHSYLEAWSREAEGVRLFRFDRIVDAAELGEPAVPPESAR QAPPDTSLFDGDLSLPSATLRVAPSASWMLEYYPIRELRQLPDGSCEVAMTYASEDWM TRLLLGFGSDVRVLAPESLAQRVRDAATAALDAYQAAAPP" gene complement(2354312..2355310) /locus_tag="Rv2096c" /db_xref="GeneID:888436" CDS complement(2354312..2355310) /locus_tag="Rv2096c" /function="UNKNOWN" /note="Rv2096c, (MTCY49.36c), len: 332 aa. Conserved hypothetical protein. Highly similar to ML1329, P54076|YY36_MYCLE HYPOTHETICAL 35.4 kDa PROTEIN B21 (331 aa) opt: 1676 E(): 0; (80.2% identity in 329 aa overlap) and to ORF10(1) of Rhodococcus erythropolis, Z82004|REZ820042 REZ 82004 NID: g1666179 (330 aa) opt: 1232, E(): 0; 59.9% identity in 332 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216612.1" /db_xref="GI:15609233" /db_xref="GeneID:888436" /translation="MATSKVERLVNLVIALLSTRGYITAEKIRSSVAGYSDSPSVEAF SRMFERDKNELRDLGIPLEVGRVSALEPTEGYRINRDAYALSPVELTPDEAAAVAVAT QLWESPELITATQGALLKLRAAGVDVDPLDTGAPVAIASAAAVSGLRGSEDVLGILLS AIDSGQVVQFSHRSSRAEPYTVRTVEPWGVVTEKGRWYLVGHDRDRDATRVFRLSRIG AQVTPIGPAGATTVPAGVDLRSIVAQKVTEVPTGEQATVWVAEGRATALRRAGRSAGP RQLGGRDGEVIELEIRSSDRLAREITGYGADAIVLQPGSLRDDVLARLRAQAGALA" gene complement(2355319..2356677) /locus_tag="Rv2097c" /db_xref="GeneID:888460" CDS complement(2355319..2356677) /locus_tag="Rv2097c" /function="UNKNOWN" /note="Rv2097c, (MTCY49.37c), len: 452 aa. Conserved hypothetical protein. Similarity to YTH6_ RHOSO P43484 hypothetical protein in thcr 5' region (333 aa), FASTA scores opt: 738, E(): 0, (38.5% identity in 330 aa overlap). Also highly similar to Mycobacterium leprae protein ML1328, P54077|YY37_MYCLE HYPOTHETICAL 38.1 KD PROTEIN (336 aa) opt: 1985 E(): 0; (96.4% identity in 307 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216613.1" /db_xref="GI:15609234" /db_xref="GeneID:888460" /translation="MQRRIMGIETEFGVTCTFHGHRRLSPDEVARYLFRRVVSWGRSS NVFLRNGARLYLDVGSHPEYATAECDSLVQLVTHDRAGEWVLEDLLVDAEQRLADEGI GGDIYLFKNNTDSAGNSYGCHENYLIVRAGEFSRISDVLLPFLVTRQLICGAGKVLQT PKAATYCLSQRAEHIWEGVSSATTRSRPIINTRDEPHADAEKYRRLHVIVGDSNMSET TTMLKVGTAALVLEMIESGVAFRDFSLDNPIRAIREVSHDVTGRRPVRLAGGRQASAL DIQREYYTRAVEHLQTREPNAQIEQVVDLWGRQLDAVESQDFAKVDTEIDWVIKRKLF QRYQDRYDMELSHPKIAQLDLAYHDIKRGRGIFDLLQRKGLAARVTTDEEIAEAVDQP PQTTRARLRGEFISAAQEAGRDFTVDWVHLKLNDQAQRTVLCKDPFRAVDERVKRLIA SM" gene complement(2356729..2358206) /gene="PE_PGRS36" /locus_tag="Rv2098c" /db_xref="GeneID:888312" misc_feature complement(2356729..2358206) /gene="PE_PGRS36" /locus_tag="Rv2098c" /note="Rv2098c, (MTCY49.38c), len: 434 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). Frameshifted near N-terminus (see Rv2099c|PE21)." gene 2358389..2360041 /locus_tag="Rv2100" /db_xref="GeneID:888454" CDS 2358389..2360041 /locus_tag="Rv2100" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv2100, (MTCY49.40), len: 550 aa. Conserved hypothetical protein. Member of Mycobacterium tuberculosis REP13E12 repeat family with Rv1148c, Rv1945, Rv3467, Rv0094c, Rv1128c, Rv1587c, Rv1702c, Rv3466, Rv1588c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216616.1" /db_xref="GI:15609237" /db_xref="GeneID:888454" /translation="MAGALFEPSFAAAHPAGLLRRPVTRTVVLSVAATSIAHMFEISL PDPTELCRSDDGALVAAIEDCARVEAAASARRLSAIAELTGRRTGADQRADWACDFWD CAAAEVAAALTISHGKASGQMHLSLALNRLPQVAALFLAGHLGARLFSIIAWRTYLVR DPHALSLLDAALAEHAGAWGPLSAPKLEKAIDSWIDRYDPGALRRSRISARTRDLCIG DPDEDAGTAALWGRLYATDAAMLDRRLTEMAHGVCEDDPRTLAQRRADALGALAAGAD HLACGCGKPDCPSGAGNDERAAGVVIHVVADASALDAQPDPHLSGDEPPSRPLTPETT LFEALTPDPEPDPPATHAPAELITTGGGVVPAPLLAELIRGGATISQVRHPGDLAAEP HYRPSAKLAEFVRMRDLTCRFPGCDVPAEFCDIDHSAPWPLGPTHPSNLKCACRKHHL LKTFWTGWRDVQLPDGTVIWTAPNGHTYTTHPGSRIFFPTWHTTTAELPQTSTAAVNV DARGLMMPRRRRTRAAELAHRINAERALNDAYMAERNKPPSF" gene 2360240..2363281 /gene="helZ" /locus_tag="Rv2101" /db_xref="GeneID:888635" CDS 2360240..2363281 /gene="helZ" /locus_tag="Rv2101" /EC_number="3.6.-.-" /function="HAS HELICASE ACTIVITY." /note="Rv2101, (MTV020.01), len: 1013 aa. Probable helZ, helicase (EC 3.6.-.-), similar to many e.g. PCC6803|P74552|SLL1366 HELICASE OF THE SNF2/RAD54 FAMILY from Synechocystis sp. strain PCC 6803 (1039 aa), FASTA scores: opt: 2015, E(): 0, (38.4% identity in 1063 aa overlap); etc. TBparse score is 0.875." /codon_start=1 /transl_table=11 /product="helicase HelZ" /protein_id="NP_216617.1" /db_xref="GI:15609238" /db_xref="GeneID:888635" /translation="MLVLHGFWSNSGGMRLWAEDSDLLVKSPSQALRSARPHPFAAPA DLIAGIHPGKPATAVLLLPSLRSAPLDSPELIRLAPRPAARTDPMLLAWTVPVVDLDP TAALAAFDQPAPDVRYGASVDYLAELAVFARELVERGRVLPQLRRDTHGAAACWRPVL QGRDVVAMTSLVSAMPPVCRAEVGGHDPHELATSALDAMVDAAVRAALSPMDLLPPRR GRSKRHRAVEAWLTALTCPDGRFDAEPDELDALAEALRPWDDVGIGTVGPARATFRLS EVETENEETPAGSLWRLEFLLQSTQDPSLLVPAEQAWNDDGSLRRWLDRPQELLLTEL GRASRIFPELVPALRTACPSGLELDADGAYRFLSGTAAVLDEAGFGVLLPSWWDRRRK LGLVLSAYTPVDGVVGKASKFGREQLVEFRWELAVGDDPLSEEEIAALTETKSPLIRL RGQWVALDTEQMRRGLEFLERKPTGRKTTAEILALAASHPDDVDTPLEVTAVRADGWL GDLLAGAAAASLQPLDPPDGFTATLRPYQQRGLAWLAFLSSLGLGSCLADDMGLGKTV QLLALETLESVQRHQDRGVGPTLLLCPMSLVGNWPQEAARFAPNLRVYAHHGGARLHG EALRDHLERTDLVVSTYTTATRDIDELAEYEWNRVVLDEAQAVKNSLSRAAKAVRRLR AAHRVALTGTPMENRLAELWSIMDFLNPGLLGSSERFRTRYAIPIERHGHTEPAERLR ASTRPYILRRLKTDPAIIDDLPEKIEIKQYCQLTTEQASLYQAVVADMMEKIENTEGI ERRGNVLAAMAKLKQVCNHPAQLLHDRSPVGRRSGKVIRLEEILEEILAEGDRVLCFT QFTEFAELLVPHLAARFGRAARDIAYLHGGTPRKRRDEMVARFQSGDGPPIFLLSLKA GGTGLNLTAANHVVHLDRWWNPAVENQATDRAFRIGQRRTVQVRKFICTGTLEEKIDE MIEEKKALADLVVTDGEGWLTELSTRDLREVFALSEGAVGE" gene 2363391..2364107 /locus_tag="Rv2102" /db_xref="GeneID:888639" CDS 2363391..2364107 /locus_tag="Rv2102" /function="UNKNOWN" /note="Rv2102, (MTV020.02), len: 238 aa. Conserved hypothetical protein, similar to part of hypothetical protein D90916|D90916_18 from Synechocystis sp. PCC6803 (289 aa), FASTA scores: opt: 498, E(): 1.9e-25, (46.7% identity in 167 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.884." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216618.1" /db_xref="GI:15609239" /db_xref="GeneID:888639" /translation="MLEDIGLGNRLQRGRSYARKGQVISLQVDAGLVTALVQGSRARP YRIRIGIPAFGKSQWAHVERTLAENAWYAAKLLSGEMPEDIEDVFAGLGLSLFPGTAR ELSLDCSCPDYAVPCKHLAATFYLLAESFDEDPFAILAWRGREREDLLANLAAARADG AAPAADHAEQVAQPLTDCLDRYYARQADINVPSPPATPSTALLDQLPDTGLSARGRPL TELLRPAYHALTHHHNSAGG" misc_feature 2363538..2363561 /locus_tag="Rv2102" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(2364086..2364520) /locus_tag="Rv2103c" /db_xref="GeneID:888003" CDS complement(2364086..2364520) /locus_tag="Rv2103c" /function="UNKNOWN" /note="Rv2103c, (MTV020.03), len: 144 aa. Conserved hypothetical protein, similar to hypothetical mycobacterial proteins belonging to family, includes Rv0749, Rv0277c, Rv2530c, Rv3320c, Rv2494, Rv2872, Rv0617, Rv1242 etc. FASTA scores: sptr|Q49793|Q49793 B2126_C3_261 (97 aa) opt: 331, E(): 4.8e-18; 59.4% identity in 96 aa overlap and gp|Z74024|MTCY274_3 Mycobacterium tuberculosis cosmid (147 aa) opt: 234, E(): 1.2e-10; 34.8% identity in 141 aa overlap. TBparse score is 0.889" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216619.1" /db_xref="GI:15609240" /db_xref="GeneID:888003" /translation="MKIVDANVLLYAVNTTSEHHKPSLRWLDGALSGADRVGFAWVPL LAFVRLATKVGLFPRPLPREAAITQVADWLAAPSAVLVNPTVRHADILARMLTYVGTG ANLVNDAHLAALAVEHRASIVSYDSDFGRFEGVRWDQPPALL" gene complement(2364527..2364781) /locus_tag="Rv2104c" /db_xref="GeneID:888014" CDS complement(2364527..2364781) /locus_tag="Rv2104c" /function="UNKNOWN" /note="Rv2104c, (MTV020.04), len: 84 aa. Conserved hypothetical protein, similar to members of a family of hypothetical mycobacterial proteins including Rv2871, Rv1241, Rv2132, Rv3321c, Rv1113, Rv0657, Rv1560, etc. FASTA scores: sptr|Q49787|Q49787 B2126_C2_217 (97 aa) opt: 197, E(): 2e-07; 57.1% identity in 56 aa overlap and Z95388|MTCY270_36 Mycobacterium tuberculosis cosmid (76 aa ) opt: 142, E(): 0.0011; 41.8% identity in 55 aa overlap. TBparse score is 0.915" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216620.1" /db_xref="GI:15609241" /db_xref="GeneID:888014" /translation="MRTTVTLDDDVEQLVRRRMAERQVSFKKALNDAIRDGASGRPAP SHFSTRTADLGVPAVNLDRALQLAADLEDEELVRRQRRGS" repeat_region 2365414..2366768 /note="IS6110-5, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-5" repeat_region 2365414..2365441 /note="28bp inverted repeat at the left end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 2365465..2365791 /locus_tag="Rv2105" /db_xref="GeneID:888395" CDS 2365465..2365791 /locus_tag="Rv2105" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2105, (MTCY261.01), len: 108 aa. Probable transposase subunit for IS6110, similar to eg. Q51647|IS401 transposase subunit (107 aa), FASTA scores; opt: 325, E(): 3.8e-24, (52.9% identity in 102 aa overlap). Identical to many other Mycobacterium tuberculosis IS6110 transposase subunits." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216621.1" /db_xref="GI:15609242" /db_xref="GeneID:888395" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 2365788..2366726 /locus_tag="Rv2106" /db_xref="GeneID:888398" CDS <2365788..2366726 /locus_tag="Rv2106" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2106, (MTCY261.02), len: 312 aa. Probable transposase subunit for IS6110. Identical to many other M. tuberculosis IS6110 transposase subunits." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216622.1" /db_xref="GI:15609243" /db_xref="GeneID:888398" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region complement(2366741..2366768) /note="28bp inverted repeat at the right end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC" gene 2367359..2367655 /gene="PE22" /locus_tag="Rv2107" /db_xref="GeneID:887811" CDS 2367359..2367655 /gene="PE22" /locus_tag="Rv2107" /function="UNKNOWN" /note="Rv2107, (MTCY261.03), len: 98 aa. Member of mycobacterial PE family (see citation below), e.g. Y03A_MYCTU Q10637 hypothetical glycine-rich 49.6 kDa protein (603 aa), FASTA scores; opt: 214 E(): 1.3e-14, (39.8% identity in 93 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177858.1" /db_xref="GI:57116945" /db_xref="GeneID:887811" /translation="MSFVNVDPFGMLAAAATLESLGSHMAVSNAAVASVTTKVPPPAA DYVSKKLSLFFSSHGQQYQVQAARGTAFHRKLVRTLANGALAYEEVEIANNEGF" gene 2367711..2368442 /gene="PPE36" /locus_tag="Rv2108" /db_xref="GeneID:887814" CDS 2367711..2368442 /gene="PPE36" /locus_tag="Rv2108" /function="UNKNOWN" /note="Rv2108, (MTCY261.04), len: 243 aa. Member of the Mycobacterium tuberculosis PE family: N-terminus is similar to N-terminal region of Mycobacterium tuberculosis PPE family proteins, e.g. YX23_MYCTU|Q10813 hypothetical 41.1 kDa protein (404 aa), FASTA scores: opt: 431, E(): 3.9e-32, (44.0% identity in 166 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177859.1" /db_xref="GI:57116946" /db_xref="GeneID:887814" /translation="MPNFWALPPEINSTRIYLGPGSGPILAAAQGWNALASELEKTKV GLQSALDTLLESYRGQSSQALIQQTLPYVQWLTTTAEHAHKTAIQLTAAANAYEQARA AMVPPAMVRANRVQTTVLKAINWFGQFSTRIADKEADYEQMWFQDALVMENYWEAVQE AIQSTSHFEDPPEMADDYDEAWMLNTVFDYHNENAKEEVIHLVPDVNKERGPIELVTK VDKEGTIRLVYDGEPTFSYKEHPKF" gene complement(2368983..2369729) /gene="prcA" /locus_tag="Rv2109c" /db_xref="GeneID:887538" CDS complement(2368983..2369729) /gene="prcA" /locus_tag="Rv2109c" /EC_number="3.4.25.1" /function="protein degradation" /experiment="experimental evidence, no additional details recorded" /note="Rv2109c, (MTCY261.05c), len: 248 aa. prcA, proteasome alpha-type subunit 1, highly similar to TR:Q53080 (EMBL:U26421 ) proteasome alpha-type subunit 1 from Rhodococcus (259 aa), FASTA scores; opt: 1035, E(): 0, 67.2% identity in 247 aa overlap." /codon_start=1 /transl_table=11 /product="proteasome (alpha subunit) PrcA" /protein_id="NP_216625.1" /db_xref="GI:15609246" /db_xref="GeneID:887538" /translation="MSFPYFISPEQAMRERSELARKGIARAKSVVALAYAGGVLFVAE NPSRSLQKISELYDRVGFAAAGKFNEFDNLRRGGIQFADTRGYAYDRRDVTGRQLANV YAQTLGTIFTEQAKPYEVELCVAEVAHYGETKRPELYRITYDGSIADEPHFVVMGGTT EPIANALKESYAENASLTDALRIAVAALRAGSADTSGGDQPTLGVASLEVAVLDANRP RRAFRRITGSALQALLVDQESPQSDGESSG" gene complement(2369726..2370601) /gene="prcB" /locus_tag="Rv2110c" /db_xref="GeneID:887508" CDS complement(2369726..2370601) /gene="prcB" /locus_tag="Rv2110c" /EC_number="3.4.25.1" /function="protein degradation" /experiment="experimental evidence, no additional details recorded" /note="Rv2110c, (MTCY261.06c), len: 291 aa. prcB, proteasome beta-type subunit 2, highly similar to eg. TR:Q53083 (EMBL:U264 22) proteasome beta-type subunit 2 from Rhodococcus (292 aa), FASTA scores; opt: 1103, E(): 0, 64.5% identity in 262 aa overlap." /codon_start=1 /transl_table=11 /product="proteasome (beta subunit) PrcB" /protein_id="NP_216626.1" /db_xref="GI:15609247" /db_xref="GeneID:887508" /translation="MTWPLPDRLSINSLSGTPAVDLSSFTDFLRRQAPELLPASISGG APLAGGDAQLPHGTTIVALKYPGGVVMAGDRRSTQGNMISGRDVRKVYITDDYTATGI AGTAAVAVEFARLYAVELEHYEKLEGVPLTFAGKINRLAIMVRGNLAAAMQGLLALPL LAGYDIHASDPQSAGRIVSFDAAGGWNIEEEGYQAVGSGSLFAKSSMKKLYSQVTDGD SGLRVAVEALYDAADDDSATGGPDLVRGIFPTAVIIDADGAVDVPESRIAELARAIIE SRSGADTFGSDGGEK" gene complement(2370598..2370792) /locus_tag="Rv2111c" /db_xref="GeneID:888788" CDS complement(2370598..2370792) /locus_tag="Rv2111c" /function="UNKNOWN" /note="Rv2111c, MTCY261.07c, len: 64 aa. Conserved hypothetical protein. Highly similar to a hypothetical protein TR:Q53078 (EMBL:U26422) (64 aa) upstream of Rhodococcus proteasome beta-type subunit 1, FASTA scores; opt: 349, E(): 7.3e-25, 84.4% identity in 64 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216627.1" /db_xref="GI:15609248" /db_xref="GeneID:888788" /translation="MAQEQTKRGGGGGDDDDIAGSTAAGQERREKLTEETDDLLDEID DVLEENAEDFVRAYVQKGGQ" gene complement(2370905..2372569) /locus_tag="Rv2112c" /db_xref="GeneID:888290" CDS complement(2370905..2372569) /locus_tag="Rv2112c" /function="UNKNOWN" /note="Rv2112c, (MTCY261.08c), len: 554 aa. Conserved hypothetical protein. Highly similar to a hypothetical protein TR:Q53081 (EMBL:U26422) (499 aa) upstream of Rhodococcus proteasome beta-type subunit 1, FASTA scores opt: 2832 E(): 0, 85.3% identity in 502 aa overlap. Also some similarity to Mycobacterium tuberculosis hypothetical protein Rv2097c (MTCY49.37c, 38.2% identity in 419 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216628.1" /db_xref="GI:15609249" /db_xref="GeneID:888290" /translation="MFWVGGPCLMPASSAARCAARIVGGRCLMPASSAARCAARIVGG PRLYGMQRIIGTEVEYGISSPSDPTANPILTSTQAVLAYAAAAGIQRAKRTRWDYEVE SPLRDARGFDLSRSAGPPPVVDADEVGAANMILTNGARLYVDHAHPEYSAPECTDPLD AVIWDKAGERVMEAAARHVASVPGAAKLQLYKNNVDGKGASYGSHENYLMSRQTPFSA IITGLTPFLVSRQVVTGSGRVGIGPSGDEPGFQLSQRSDYIEVEVGLETTLKRGIINT RDEPHADADRYRRLHVIIGDANLAETSTYLKLGTTALVLDLIEEGPAHAIDLTDLALA RPVHAVHAISRDPSLRATVALADGRELTGLALQRIYLDRVAKLVDSRDPDPRAADIVE TWAHVLDQLERDPMDCAELLDWPAKLRLLDGFRQRENLSWSAPRLHLVDLQYSDVRLD KGLYNRLVARGSMKRLVTEHQVLSAVENPPTDTRAYFRGECLRRFGADIAAASWDSVI FDLGGDSLVRIPTLEPLRGSKAHVGALLDSVDSAVELVEQLTAEPR" repeat_region 2372437..2372492 /note="56 bp direct repeat 1, GCCCGCCGACGATGCGGGCCGCGCAGCGGGCCGCTGAGGAGGCGGGCATCAAGCAA" repeat_region 2372494..2372549 /note="56 bp direct repeat 2, GCCCGCCGACGATGCGGGCCGCGCAGCGGGCCGCTGAGGAGGCGGGCATCAAGCAA" gene 2372630..2373823 /locus_tag="Rv2113" /db_xref="GeneID:887731" CDS 2372630..2373823 /locus_tag="Rv2113" /function="UNKNOWN" /note="Rv2113, (MTCY261.09), len: 397 aa. Probable integral membrane protein." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216629.1" /db_xref="GI:15609250" /db_xref="GeneID:887731" /translation="MSLSVRRPPAARAAAIVEAESWFLKRGLPSVLTMRGRCRRLWPR SAPMLAAWAVVEGCLMAVFFVTDGGEVFISATPTTAQWVILALLAVALPLASLVGWLV SQISSGRGQAAVATMAVAFAAASDVIESGPIQLLRTAVVVGLVLLQTGCGVGSVLGWA VRMTLEHLATVGTLAVRALPIVLLTALVFFNTYVWLMAANINGERLTLAMVFLLAIAG AFVVSKTVERVRPLLRSTTVMPQGSQSLAGTPFATMGDPSPGFPLTRAERLNVVFLLA ASQLVEILVVASVGAAIYLVLGMIILTPPLLREWTHYDSMTTTVLGMTFPAPDSLIRM CLFLGALTFMYISARAVDDAEYRAMFLDPLIDDLHTALLARNRYRNNVVTAPCAGVDA GHVDD" gene 2373834..2374457 /locus_tag="Rv2114" /db_xref="GeneID:887803" CDS 2373834..2374457 /locus_tag="Rv2114" /function="UNKNOWN" /note="Rv2114, (MTCY261.10), len: 207 aa. Unknown hypothetical protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216630.1" /db_xref="GI:15609251" /db_xref="GeneID:887803" /translation="MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAE LWSALDPQALATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLL SSMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPK LGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQDDLTNSYSLVTA" gene complement(2374461..2376290) /locus_tag="Rv2115c" /db_xref="GeneID:887297" CDS complement(2374461..2376290) /locus_tag="Rv2115c" /EC_number="3.6.1.-" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2115c, (MTCY261.11c), len: 609 aa. Probable ATPase (EC 3.6.1.-), similar to e.g. YB56_METJA Q58556 cell division cycle protein 48 homolog (903 aa), FASTA scores; opt: 423, E(): 8.1e-32, 45.8% identity in 249 aa overlap. Contains PS00674 AAA-protein family signature and PS00017 ATP/GTP-binding site motif A (P-loop). Also some similarity to other Mycobacterium tuberculosis ATPases e.g. Rv0435c and Rv3610c. Equivalent to Mycobacterium leprae U00 017|U00017_18 (609 aa), FASTA scores; opt: 3670 E(): 0; 92.9% identity in 609 aa overlap." /codon_start=1 /transl_table=11 /product="ATPase" /protein_id="NP_216631.1" /db_xref="GI:15609252" /db_xref="GeneID:887297" /translation="MGESERSEAFGIPRDSPLSSGDAAELEQLRREAAVLREQLENAV GSHAPTRSARDIHQLEARIDSLAARNSKLMETLKEARQQLLALREEVDRLGQPPSGYG VLLATHDDDTVDVFTSGRKMRLTCSPNIDAASLKKGQTVRLNEALTVVEAGTFEAVGE ISTLREILADGHRALVVGHADEERVVWLADPLIAEDLPDGLPEALNDDTRPRKLRPGD SLLVDTKAGYAFERIPKAEVEDLVLEEVPDVSYADIGGLSRQIEQIRDAVELPFLHKE LYREYSLRPPKGVLLYGPPGCGKTLIAKAVANSLAKKMAEVRGDDAHEAKSYFLNIKG PELLNKFVGETERHIRLIFQRAREKASEGTPVIVFFDEMDSIFRTRGTGVSSDVETTV VPQLLSEIDGVEGLENVIVIGASNREDMIDPAILRPGRLDVKIKIERPDAEAAQDIYS KYLTEFLPVHADDLAEFDGDRSACIKAMIEKVVDRMYAEIDDNRFLEVTYANGDKEVM YFKDFNSGAMIQNVVDRAKKNAIKSVLETGQPGLRIQHLLDSIVDEFAENEDLPNTTN PDDWARISGKKGERIVYIRTLVTGKSSSASRAIDTESNLGQYL" misc_feature complement(2375010..2375066) /locus_tag="Rv2115c" /note="PS00674 AAA-protein family signature" misc_feature complement(2375391..2375414) /locus_tag="Rv2115c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 2376571..2377140 /gene="lppK" /locus_tag="Rv2116" /db_xref="GeneID:886029" CDS 2376571..2377140 /gene="lppK" /locus_tag="Rv2116" /function="UNKNOWN" /note="Rv2116, (MTCY261.12), len: 189 aa. Probable lppK, conserved lipoprotein, similar to Mycobacterium leprae B2126_F3_115 TR:Q49803 (194 aa), FASTA scores; opt: 624, E(): 3.1e-31, 51.6% identity in 190 aa overlap. Contains N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Some similarity to Rv2376c." /codon_start=1 /transl_table=11 /product="lipoprotein lppK" /protein_id="NP_216632.1" /db_xref="GI:15609253" /db_xref="GeneID:886029" /translation="MRRNIRVTLGAATIVAALGLSGCSHPEFKRSSPPAPSLPPVTSS PLEAAPITPLPAPEALIDVLSRLADPAVPGTNKVQLIEGATPENAAALDRFTTALRDG SYLPMTFAANDIAWSDNKPSDVMATVVVTTAHPDNREFTFPMEFVSFKGGWQLSRQTA EMLLAMGNSPDSTPSATSPAPAPSPTPPG" misc_feature 2376607..2376639 /gene="lppK" /locus_tag="Rv2116" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2377148..2377441 /locus_tag="Rv2117" /db_xref="GeneID:887685" CDS 2377148..2377441 /locus_tag="Rv2117" /function="UNKNOWN" /note="Rv2117, (MTCY261.13), len: 97 aa. Conserved hypothetical protein. Similar to hypothetical proteins from Mycobacterium leprae TR:Q49798 U2126J (97 aa), FASTA scores; opt: 554, E(): 0, 85.6% identity in 97 aa overlap, and Bacillus subtilis YLXP_BACSU P32730 hypothetical 10.7 kDa protein (92 aa), FASTA scores; opt: 173, E(): 1.4e-11, 34.1% identity in 82 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216633.1" /db_xref="GI:15609254" /db_xref="GeneID:887685" /translation="MWIGWLEFDVLLGDVRSLKQKRSVTRPLVAELQRKFSVSAAETG SHDLYRRAGIGVAVVSGDRSHAVDVLDNAERLVAAHPEFELLSVRRGLHRTDD" gene complement(2377470..2378312) /locus_tag="Rv2118c" /db_xref="GeneID:887374" CDS complement(2377470..2378312) /locus_tag="Rv2118c" /EC_number="2.1.1.-" /function="INVOLVED IN TRANSFER OF METHYL GROUP (FROM S-ADENOSYL-L-METHIONINE TO THE SUBSTRATE)." /experiment="experimental evidence, no additional details recorded" /note="Rv2118c, (MTCY261.14c), len: 280 aa. Possible S-adenosyl-l-methionine-dependent RNA methyltransferase (EC 2.1.1.-) (see citation below); corresponds to Mycobacterium leprae B2126_C1_165, similar to hypothetical proteins from several organisms e.g. Y134_METJA Q57598 hypothetical protein mj0134 (282 aa), FASTA scores; opt: 256, E(): 1e-13, FASTA scores; 30.2% identity in 285 aa overlap. The larger catalytic C-terminal domain binds the cofactor S-adenosyl-l-methionine (AdoMet) and is involved in the transfer of methyl group from AdoMet to the substrate." /codon_start=1 /transl_table=11 /product="RNA methyltransferase" /protein_id="NP_216634.1" /db_xref="GI:15609255" /db_xref="GeneID:887374" /translation="MSATGPFSIGERVQLTDAKGRRYTMSLTPGAEFHTHRGSIAHDA VIGLEQGSVVKSSNGALFLVLRPLLVDYVMSMPRGPQVIYPKDAAQIVHEGDIFPGAR VLEAGAGSGALTLSLLRAVGPAGQVISYEQRADHAEHARRNVSGCYGQPPDNWRLVVS DLADSELPDGSVDRAVLDMLAPWEVLDAVSRLLVAGGVLMVYVATVTQLSRIVEALRA KQCWTEPRAWETLQRGWNVVGLAVRPQHSMRGHTAFLVATRRLAPGAVAPAPLGRKRE GRDG" gene 2378386..2379222 /locus_tag="Rv2119" /db_xref="GeneID:888413" CDS 2378386..2379222 /locus_tag="Rv2119" /function="UNKNOWN" /note="Rv2119, (MTCY261.15), len: 278 aa. Conserved hypothetical protein. Similar to Mycobacterium leprae hypothetical protein TR:Q49799 U2126V (212 aa), FASTA scores; opt: 1153, E(): 0, 83.6% identity in 195 aa overlap. Orthologs present in Rhodococcus erythropolis (gb|AAC68687.1|(AF088800) and Streptomyces emb|CAB59506.1|(AL132648)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216635.1" /db_xref="GI:15609256" /db_xref="GeneID:888413" /translation="MADQPDPPTPRPALSPSRATDFKQCPLLYRFRAIDRLPEATSAA QLRGSVVHAALEQLYGLPAGLRSPDTARSLVQRAWDQMVAAEPELAGELDPGQPTQLL EDARALVSGYYRLEDPTRFDPQCCEQRVEVELADGTLLRGYIDRIDVAATGELRVVDY KTGKAPPAARALAEFKAMFQMKFYAVALFRSRGVPPTRLRLIYLADGQLLDYSPDRDE LLRFEKTLMAIWRAIQSAGETGDFRPNPSRLCDWCPHQQRCPAFGGTPPPYPGWPTEP AA" gene complement(2379245..2379727) /locus_tag="Rv2120c" /db_xref="GeneID:887821" CDS complement(2379245..2379727) /locus_tag="Rv2120c" /function="UNKNOWN" /note="Rv2120c, (MTCY261.16c), len: 160 aa. Probable conserved integral membrane protein, similar to hypothetical protein from Mesorhizobium loti (153 aa). Smith-Waterman scores: NP_104030.1 hypothetical protein [Mesorhizobium loti] >gi|14023209|dbj|BAB49816.1| (AP003000) Identities = 50/135 (37%)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216636.1" /db_xref="GI:15609257" /db_xref="GeneID:887821" /translation="MTHVLVLLLALLIGVVAGLRSLTAPAVVSWAAFLGWINLHGTWA SWMGNFVTVVIVSVLAVAELVNDKRPKTPPRTVTPVFAVRIILGAFAGAVIGTAWGYR WGGLGAGVIGAVLGTMGGYQARTRLVAARGGHDLPIALLEDSVAVLGGFAIVAAAAAL" gene complement(2379806..2380660) /gene="hisG" /locus_tag="Rv2121c" /db_xref="GeneID:888689" CDS complement(2379806..2380660) /gene="hisG" /locus_tag="Rv2121c" /EC_number="2.4.2.17" /function="THOUGHT TO BE INVOLVED IN HISTIDINE BIOSYNTHESIS." /note="long form of enzyme; catalyzes the formation of N'-5'-phosphoribosyl-ATP from phosphoribosyl pyrophosphate; crucial role in histidine biosynthesis; forms active dimers and inactive hexamers which is dependent on concentration of substrates and inhibitors" /codon_start=1 /transl_table=11 /product="ATP phosphoribosyltransferase" /protein_id="NP_216637.1" /db_xref="GI:15609258" /db_xref="GeneID:888689" /translation="MLRVAVPNKGALSEPATEILAEAGYRRRTDSKDLTVIDPVNNVE FFFLRPKDIAIYVGSGELDFGITGRDLVCDSGAQVRERLALGFGSSSFRYAAPAGRNW TTADLAGMRIATAYPNLVRKDLATKGIEATVIRLDGAVEISVQLGVADAIADVVGSGR TLSQHDLVAFGEPLCDSEAVLIERAGTDGQDQTEARDQLVARVQGVVFGQQYLMLDYD CPRSALKKATAITPGLESPTIAPLADPDWVAIRALVPRRDVNGIMDELAAIGAKAILA SDIRFCRF" gene complement(2380663..2380944) /gene="hisE" /locus_tag="Rv2122c" /db_xref="GeneID:888671" CDS complement(2380663..2380944) /gene="hisE" /locus_tag="Rv2122c" /EC_number="3.6.1.31" /function="THOUGHT TO BE INVOLVED IN HISTIDINE BIOSYNTHESIS." /note="catalyzes the formation of 1-(5-phosphoribosyl)-AMP from 1-(5-phosphoribolsyl)-ATP in histidine biosynthesis" /codon_start=1 /transl_table=11 /product="phosphoribosyl-ATP pyrophosphatase" /protein_id="YP_177860.1" /db_xref="GI:57116947" /db_xref="GeneID:888671" /translation="MQQSLAVKTFEDLFAELGDRARTRPADSTTVAALDGGVHALGKK LLEEAGEVWLAAEHESNDALAEEISQLLYWTQVLMISRGLSLDDVYRKL" gene 2381071..2382492 /gene="PPE37" /locus_tag="Rv2123" /db_xref="GeneID:888710" CDS 2381071..2382492 /gene="PPE37" /locus_tag="Rv2123" /function="UNKNOWN" /note="Rv2123, (MTCY261.19), len: 473 aa. PPE37 (alternate gene name: irg2), member of the Mycobacterium tuberculosis PPE family of proteins but the C-terminus is not repetitive (see citation below).; irg2" /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177861.1" /db_xref="GI:57116948" /db_xref="GeneID:888710" /translation="MTFPMWFAVPPEVPSAWLSTGMGPGPLLAAARAWHALAAQYTEI ATELASVLAAVQASSWQGPSADRFVVAHQPFRYWLTHAATVATAAAAAHETAAAGYTS ALGGMPTLAELAANHAMHGALVTTNFFGVNTIPIALNEADYLRMWIQAATVMSHYQAV AHESVAATPSTPPAPQIVTSAASSAASSSFPDPTKLILQLLKDFLELLRYLAVELLPG PLGDLIAQVLDWFISFVSGPVFTFLAYLVLDPLIYFGPFAPLTSPVLLPAGLTGLAGL GAVSGPAGPMVERVHSDGPSRQSWPAATGVTLVGTNPAALVTTPAPAPTTSAAPTAPS TPGSSAAQGLYAVGGPDGEGFNPIAKTTALAGVTTDAAAPAAKLPGDQAQSSASKATR LRRRLRQHRFEFLADDGRLTMPNTPEMADVAAGNRGLDALGFAGTIPKSAPGSATGLT HLGGGFADVLSQPMLPHTWDGSD" gene complement(2382489..2386067) /gene="metH" /locus_tag="Rv2124c" /db_xref="GeneID:888711" CDS complement(2382489..2386067) /gene="metH" /locus_tag="Rv2124c" /EC_number="2.1.1.13" /function="INVOLVED IN BIOSYNTHESIS OF METHIONINE (AT THE TERMINAL STEP) [CATALYTIC ACTIVITY: 5-methyltetrahydrofolate + L-homocysteine = tetrahydrofolate + L-methionine]." /note="Rv2124c, (MTCY261.20c), len: 1192 aa. Probable metH, methionine synthase (EC 2.1.1.13), similar to many e.g. METH_ECOLI|P13009 5-methyltetrahydrofolate--homocystein methyltransferase from Escherichia coli (1226 aa), FASTA scores: opt: 1446, E(): 0, (32.1% identity in 1223 aa overlap); etc. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. BELONGS TO THE VITAMIN-B12 DEPENDENT METHIONINE SYNTHASE FAMILY." /codon_start=1 /transl_table=11 /product="5-methyltetrahydrofolate--homocystein methyltransferase" /protein_id="NP_216640.1" /db_xref="GI:15609261" /db_xref="GeneID:888711" /translation="MTAADKHLYDTDLLDVLSQRVMVGDGAMGTQLQAADLTLDDFRG LEGCNEILNETRPDVLETIHRNYFEAGADAVETNTFGCNLSNLGDYDIADRIRDLSQK GTAIARRVADELGSPDRKRYVLGSMGPGTKLPTLGHTEYAVIRDAYTEAALGMLDGGA DAILVETCQDLLQLKAAVLGSRRAMTRAGRHIPVFAHVTVETTGTMLLGSEIGAALTA VEPLGVDMIGLNCATGPAEMSEHLRHLSRHARIPVSVMPNAGLPVLGAKGAEYPLLPD ELAEALAGFIAEFGLSLVGGCCGTTPAHIREVAAAVANIKRPERQVSYEPSVSSLYTA IPFAQDASVLVIGERTNANGSKGFREAMIAEDYQKCLDIAKDQTRDGAHLLDLCVDYV GRDGVADMKALASRLATSSTLPIMLDSTETAVLQAGLEHLGGRCAINSVNYEDGDGPE SRFAKTMALVAEHGAAVVALTIDEEGQARTAQKKVEIAERLINDITGNWGVDESSILI DTLTFTIATGQEESRRDGIETIEAIRELKKRHPDVQTTLGLSNISFGLNPAARQVLNS VFLHECQEAGLDSAIVHASKILPMNRIPEEQRNVALDLVYDRRREDYDPLQELMRLFE GVSAASSKEDRLAELAGLPLFERLAQRIVDGERNGLDADLDEAMTQKPPLQIINEHLL AGMKTVGELFGSGQMQLPFVLQSAEVMKAAVAYLEPHMERSDDDSGKGRIVLATVKGD VHDIGKNLVDIILSNNGYEVVNIGIKQPIATILEVAEDKSADVVGMSGLLVKSTVVMK ENLEEMNTRGVAEKFPVLLGGAALTRSYVENDLAEIYQGEVHYARDAFEGLKLMDTIM SAKRGEAPDENSPEAIKAREKEAERKARHQRSKRIAAQRKAAEEPVEVPERSDVAADI EVPAPPFWGSRIVKGLAVADYTGLLDERALFLGQWGLRGQRGGEGPSYEDLVETEGRP RLRYWLDRLSTDGILAHAAVVYGYFPAVSEGNDIVVLTEPKPDAPVRYRFHFPRQQRG RFLCIADFIRSRELAAERGEVDVLPFQLVTMGQPIADFANELFASNAYRDYLEVHGIG VQLTEALAEYWHRRIREELKFSGDRAMAAEDPEAKEDYFKLGYRGARFAFGYGACPDL EDRAKMMALLEPERIGVTLSEELQLHPEQSTDAFVLHHPEAKYFNV" misc_feature complement(2385651..2385683) /gene="metH" /locus_tag="Rv2124c" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene 2386293..2387171 /locus_tag="Rv2125" /db_xref="GeneID:887798" CDS 2386293..2387171 /locus_tag="Rv2125" /function="UNKNOWN" /note="Rv2125, (MTCY261.21), len: 292 aa. Conserved hypothetical protein. Corresponds to Mycobacterium leprae hypothetical protein e.g. TR:Q49797 B2126_F1_36 (317 aa), FASTA scores; opt: 1648, E(): 0, 84.1% identity in 290 aa overlap. Very similar to Mycobacterium tuberculosis hypothetical protein Rv2714" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216641.1" /db_xref="GI:15609262" /db_xref="GeneID:887798" /translation="MTPSEGNAPLPELHNTVVVAAFEGWNDAGDAAGDAVAHLAASWQ ALPIVEIDDEAYYDYQVNRPVIRQVDGVTRELQWPAMRISHCRPPGSDRDVVLMCGVE PNMRWRTFCDELLAVIDKLNVDTVVILGALLADTPHTRPVPVSGAAYSAASARQFGLQ ETRYEGPTGIAGVFQSACVGAGIPAVTFWAAVPHYVSHPPNPKATIALLRRVEDVLDV EVPLADLPAQAEAWEREITETIAEDHELAEYVQTLEQHGDAAVDMNEALGNIDGDALA AEFERYLRRRRPGFGR" gene complement(2387202..2387972) /gene="PE_PGRS37" /locus_tag="Rv2126c" /db_xref="GeneID:887791" CDS complement(2387202..2387972) /gene="PE_PGRS37" /locus_tag="Rv2126c" /function="UNKNOWN" /note="Rv2126c, (MTCY261.22c), len: 256 aa. Possible PE_PGRS pseudogene fragment, similar to the Gly-rich C-terminus of many members of the Mycobacterium tuberculosis PGRS family e.g. MTCY441.04c (778 aa), FASTA scores; opt: 935, E(): 4.4e-18, 56.1% identity in 271 aa overlap." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177862.1" /db_xref="GI:57116949" /db_xref="GeneID:887791" /translation="MIGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGL IGNGGAGGAGGNGGIGGAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGAS GGMGGAGGAGGAGGAGGLLIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGG NAFGGRGGDGGDGGDGGTGGAGGARGAGGAGGAGGWLSGHSGAHGAMGSGGEGGAGGG GGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPPG" gene 2388616..2390085 /gene="ansP1" /locus_tag="Rv2127" /db_xref="GeneID:887715" CDS 2388616..2390085 /gene="ansP1" /locus_tag="Rv2127" /function="Involved in L-asparagine transport." /note="Rv2127, (MTCY261.26), len: 489 aa. Probable ansP1, L-asparagine permease, integral membrane protein highly similar to many e.g. ANSP_ECOLI P77610 L-asparagine permease (L-asparagine transport protein) (516 aa), FASTA scores: opt: 1880, E(): 0, (60.3% identity in 463 aa overlap); etc. Also highly similar to Mycobacterium tuberculosis permeases Rv0346c|MTCY13E10.06c, (72.1% identity in 473 aa overlap) and Rv1704c|MTCI125.26c|cycA. Contains PS00218 Amino acid permeases signature. SEEMS TO BELONG TO THE APC FAMILY." /codon_start=1 /transl_table=11 /product="L-asparagine permease ansP1" /protein_id="YP_177863.1" /db_xref="GI:57116950" /db_xref="GeneID:887715" /translation="MSAASQRVGAFGEEAGYHKGLKPRQLQMIGIGGAIGTGLFLGAG GRLAKAGPGLFLVYGVCGVFVFLILRALGELVLHRPSSGSFVSYAREFFGEKAAYAVG WMYFLHWAMTSIVDTTAIATYLQRWTIFTVVPQWILALIALTVVLSMNLISVEWFGEL EFWAALIKVLALMAFLVVGTVFLAGRYPVDGHSTGLSLWNNHGGLFPTSWLPLLIVTS GVVFAYSAVELVGTAAGETAEPEKIMPRAINSVVARIAIFYVGSVALLALLLPYTAYK AGESPFVTFFSKIGFHGAGDLMNIVVLTAALSSLNAGLYSTGRVMHSIAMSGSAPRFT ARMSKSGVPYGGIVLTAVITLFGVALNAFKPGEAFEIVLNMSALGIIAGWATIVLCQL RLHKLANAGIMQRPRFRMPFSPYSGYLTLLFLLVVLVTMASDKPIGTWTVATLIIVIP ALTAGWYLVRKRVMAVARERLGHTGPFPAVANPPVRSRD" misc_feature 2388763..2388855 /gene="ansP1" /locus_tag="Rv2127" /note="PS00218 Amino acid permeases signature" gene 2390085..2390288 /locus_tag="Rv2128" /db_xref="GeneID:887764" CDS 2390085..2390288 /locus_tag="Rv2128" /function="UNKNOWN" /note="Rv2128, (MTCY26.27), len: 67 aa. Probable conserved transmembrane protein." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216644.1" /db_xref="GI:15609265" /db_xref="GeneID:887764" /translation="MLRRGESIIRNRYASKPPLYGMAMVFLAMAVVAVTAYFRMGWWS IIGYAAAAIIGVIGFALAFRDLS" gene complement(2390308..2391189) /locus_tag="Rv2129c" /db_xref="GeneID:887841" CDS complement(2390308..2391189) /locus_tag="Rv2129c" /function="UNKNOWN" /note="Rv2129c, (MTCY261.28), len: 293 aa. Probable oxidoreductase (EC 1.-.-.-), similar to many e.g. FABG_SYNY3|P73826 3-oxoacyl-[acyl-carrier protein] reductase (240 aa), FASTA scores: opt: 241, E(): 5.1e-17, (32.7% identity in 196 aa overlap); etc. Also similar to a number of other Mycobacterium tuberculosis oxidoreductases e.g. MTCY210.04 (34.1% identity in 217 aa overlap)." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_216645.1" /db_xref="GI:15609266" /db_xref="GeneID:887841" /translation="MTSLQGKVVFITGAARGIGAEVARRLHNKGAKLVLTDLSKSELA VMGAELGGDDRLLTVVADVRDLPAMQAAAETAVERFGGIDVVVANAGIASYGSVLKVD PQAFRRVLDVNLLGNFHTVRATLPALIDRRGYVLIVSSLAAFAAPPGMAPYNMSKAGN EHFANALRLEVAHLGVSVGSAHMSWIDTALVRDTKADLPAFAELLARLPWPLNKTTSV NKCAAAFVNGIEGRKDRVYCPGWVALFRWLKPLLSTRVGQRPIRNTVAKLMPQMDAEV AALGRFASAYTESLENS" gene complement(2391215..2392459) /gene="cysS" /locus_tag="Rv2130c" /db_xref="GeneID:887492" CDS complement(2391215..2392459) /gene="cysS" /locus_tag="Rv2130c" /EC_number="6.1.1.16" /function="tRNA charging" /note="catalyzes a two-step reaction; charges a cysteine by linking its carboxyl group to the alpha-phosphate of ATP then transfers the aminoacyl-adenylate to its tRNA" /codon_start=1 /transl_table=11 /product="cysteinyl-tRNA synthetase" /protein_id="NP_216646.1" /db_xref="GI:15609267" /db_xref="GeneID:887492" /translation="MQSWYCPPVPVLPGRGPQLRLYDSADRQVRPVAPGSKATMYVCG ITPYDATHLGHAATYVTFDLIHRLWLDLGHELHYVQNITDIDDPLFERADRDGVDWRD LAQAEVALFCEDMAALRVLPPQDYVGATEAIAEMVELIEKMLACGAAYVIDREMGEYQ DIYFRADATLQFGYESGYDRDTMLRLCEERGGDPRRPGKSDELDALLWRAARPGEPSW PSPFGPGRPGWHVECAAIALSRIGSGLDIQGGGSDLIFPHHEFTAAHAECVSGERRFA RHYVHAGMIGWDGHKMSKSRGNLVLVSALRAQDVEPSAVRLGLLAGHYRADRFWSQQV LDEATARLHRWRTATALPAGPAAVDVVARVRRYLADDLDTPKAIAALDGWVTDAVEYG GHDAGAPKLVATAIDALLGVDL" gene complement(2392517..2393320) /gene="cysQ" /locus_tag="Rv2131c" /db_xref="GeneID:888142" CDS complement(2392517..2393320) /gene="cysQ" /locus_tag="Rv2131c" /function="COULD HELP CONTROL THE POOL OF 3'-PHOSPHOADENOSIDE 5'-PHOSPHOSULFATE, OR ITS USE IN SULFITE SYNTHESIS (BY SIMILARITY)." /note="Rv2131c, (MTCY270.37), len: 267 aa. Possible cysQ, monophosphatase, equivalent to CYSQ_MYCLE|P46726 cysQ protein homolog from Mycobacterium leprae (289 aa), FASTA scores: opt: 1374, E(): 0, (77.3% identity in 264 aa overlap). Contains inositol monophosphatase family signature 1 (PS00629), significance uncertain. SEEMS TO BELONG TO THE INOSITOL MONOPHOSPHATASE FAMILY." /codon_start=1 /transl_table=11 /product="monophosphatase CysQ" /protein_id="NP_216647.1" /db_xref="GI:15609268" /db_xref="GeneID:888142" /translation="MVSPAAPDLTDDLTDAELAADLAADAGKLLLQVRAEIGFDQPWT LGEAGDRQANSLLLRRLQAERPGDAVLSEEAHDDLARLKSDRVWIIDPLDGTREFSTP GRDDWAVHIALWRRSSNGQPEITDAAVALPARGNVVYRTDTVTSGAAPAGVPGTLRIA VSATRPPAVLHRIRQTLAIQPVSIGSAGAKAMAVIDGYVDAYLHAGGQWEWDSAAPAG VMLAAGMHASRLDGSPLRYNQLDPYLPDLLMCRAEVAPILLGAIADAWR" misc_feature complement(2393018..2393059) /gene="cysQ" /locus_tag="Rv2131c" /note="PS00629 Inositol monophosphatase family signature 1" gene 2393411..2393641 /locus_tag="Rv2132" /db_xref="GeneID:887827" CDS 2393411..2393641 /locus_tag="Rv2132" /function="UNKNOWN" /note="Rv2132, (MTCY270.36c), len: 76 aa. Conserved hypothetical protein. Function unknown but belongs to Mycobacterium tuberculosis protein family including Rv2871, Rv1241, Rv3321c, Rv1113, Rv0657c, Rv1560, Rv2104c, etc. Similarity to Mycobacterium tuberculosis protein Rv2871 (AL021924|MTV020_4, 84 aa). FASTA score: opt: 142, E(): 0.00036; 41.8% identity in 55 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216648.1" /db_xref="GI:15609269" /db_xref="GeneID:887827" /translation="MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVA NRFQQQTYDMGEGIDYSNIGDAIETLDGPASG" gene complement(2393851..2394639) /locus_tag="Rv2133c" /db_xref="GeneID:887616" CDS complement(2393851..2394639) /locus_tag="Rv2133c" /function="UNKNOWN" /note="Rv2133c, (MTCY270.35), len: 262 aa. Conserved hypothetical protein. Function: unknown but equivalent to hypothetical Mycobacterium leprae protein, Q49774. FASTA best: Q49774 B2126_C1_150 (262 aa) opt: 1447, E(): 0; (79.0% identity in 262 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216649.1" /db_xref="GI:15609270" /db_xref="GeneID:887616" /translation="MLADGELTVLGRIRSASNATFLCESTLGLRSLHCVYKPVSGERP LWDFPDGTLAGRELSAYLVSTQLGWNLVPHTIIRDGPAGIGMLQLWVQQPGDAVDSDP LPGPDLVDLFPAHRPRPGYLPVLRAYDYAGDEVVLMHADDIRLRRMAVFDVLINNADR KGGHILCGIDGQVYGVDHGLCLHVENKLRTVLWGWAGKPIDDQILQAVAGLADALGGP LAEALAGRIAAAEIGALRRRAQSLLDQPVMPGPNGHRPIPWPAF" gene complement(2394650..2395237) /locus_tag="Rv2134c" /db_xref="GeneID:887844" CDS complement(2394650..2395237) /locus_tag="Rv2134c" /function="UNKNOWN" /note="Rv2134c, (MTCY270.34), len: 195 aa. Conserved hypothetical protein. Function: unknown but equivalent to hypothetical Mycobacterium leprae protein, Q49789. FASTA best: Q49789 B2126_C3_228, opt: 1192, E( ): 0 (91.1% identity in 192 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216650.1" /db_xref="GI:15609271" /db_xref="GeneID:887844" /translation="MARAIHVFRTPDRFVAGTVGQPGNRTFYLQAVHDSRVVSVVLEK QQVAVLAERIGALLFEVNRRFGTPVPPEPTEIDDLSPLIMPVDAEFRVGTMGLGWDSE AQSVVVELLAVTDAEFDASVVLDDTEEGPDAVRVFLTPESARQFATRSYRVISAGRPP CPLCDEPLDPEGHICARTNGYRRDVLLGSGDDPAG" gene complement(2395301..2396011) /locus_tag="Rv2135c" /db_xref="GeneID:887273" CDS complement(2395301..2396011) /locus_tag="Rv2135c" /function="UNKNOWN" /note="Rv2135c, (MTCY270.33), len: 236 aa. Conserved hypothetical protein. Function: unknown but equivalent to hypothetical Mycobacterium leprae protein, Q49773. FASTA best: Q49773 B2126_C1_148 opt: 1183, E() : 0; (74.8% identity in 250 aa overlap), also similar in C-terminus to PMG2_ECOLI P36942 probable phosphoglycerate mutase 2 (215 aa), FASTA scores; opt: 212, E(): 2.5e-07 27.9% identity in 190 aa overlap; and to Rv2228 and Rv2419c" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216651.1" /db_xref="GI:15609272" /db_xref="GeneID:887273" /translation="MTVILLRHARSTSNTAGVLAGRSGVDLDEKGREQATGLIDRIGD LPIRAVASSPMLRCQRTVEPLAEALCLEPLIDDRFSEVDYGEWTGRKIGDLVDEPLWR VVQAHPSAAVFPGGEGLAQVQTRAVAAVREHDRRLADQHGHDVLWLACTHGDVIKAVI ADAFGMHLDSFQRITADPGSVSVVRYTQLRPFVLHVNHTGARLAPALQAAASAQGASP EPNAAVPPGDAVIGGSTD" gene complement(2396008..2396838) /gene="uppP" /locus_tag="Rv2136c" /db_xref="GeneID:887612" CDS complement(2396008..2396838) /gene="uppP" /locus_tag="Rv2136c" /EC_number="3.6.1.27" /function="UNKNOWN" /note="BacA; phosphatase activity in Escherichia coli not kinase; involved in bacitracin resistance as bacitracin supposedly sequesters undecaprenyl disphosphate which reduces the pool of lipid carrier available to the cell" /codon_start=1 /transl_table=11 /product="undecaprenyl pyrophosphate phosphatase" /protein_id="NP_216652.1" /db_xref="GI:15609273" /db_xref="GeneID:887612" /translation="MSWWQVIVLAAAQGLTEFLPVSSSGHLAIVSRIFFSGDAGASFT AVSQLGTEAAVVIYFARDIVRILSAWLHGLVVKAHRNTDYRLGWYVIIGTIPICILGL FFKDDIRSGVRNLWVVVTALVVFSGVIALAEYVGRQSRHIERLTWRDAVVVGIAQTLA LVPGVSRSGSTISAGLFLGLDRELAARFGFLLAIPAVFASGLFSLPDAFHPVTEGMSA TGPQLLVATLIAFVLGLTAVAWLLRFLVRHNMYWFVGYRVLVGTGMLVLLATGTVAAT" gene complement(2396902..2397315) /locus_tag="Rv2137c" /db_xref="GeneID:888009" CDS complement(2396902..2397315) /locus_tag="Rv2137c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2137c, (MTCY270.31), len: 137 aa. Conserved hypothetical protein. C-terminus is very similar to hypothetical Mycobacterium leprae protein B2126_C2_188 (150 aa). FASTA best: Q49782 B2126_C2_188. (150 aa) opt: 469, E(): 9.6e-28; (77.2% identity in 101 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216653.1" /db_xref="GI:15609274" /db_xref="GeneID:888009" /translation="MRNMKSTSHESESGKLLSISSCRPREMVLQRYSLGMTVTADRHL ADKREEFAVEDISTGIFASGYGQVGDGRSFSFHIEHRSLVVEIYRPRVAGPVPQAEDV VAMAVRGLVDIDLTDERSLAAAVRDSVASAAPVSR" gene 2397330..2398406 /gene="lppL" /locus_tag="Rv2138" /db_xref="GeneID:887303" CDS 2397330..2398406 /gene="lppL" /locus_tag="Rv2138" /function="UNKNOWN" /note="Rv2138, (MTCY270.30c), len: 358 aa. Probable lppL, conserved lipoprotein, with appropriately placed lipoprotein signature (PS00013) strongly similar to hypothetical Mycobacterium leprae protein, Q49806. FASTA best: Q49806 B2126_F3_142. (298 aa) opt: 1495, E(): 0; (75.3% identity in 300 aa overlap)." /codon_start=1 /transl_table=11 /product="lipoprotein LppL" /protein_id="NP_216654.1" /db_xref="GI:15609275" /db_xref="GeneID:887303" /translation="MLTGNKPAVQRRFIGLLMLSVLVAGCSSNPLANFAPGYPPTIEP AQPAVSPPTSQDPAGAVRPLSGHPRAALFDNGTRQLVALRPGADSAAPASIMVFDDVH VAPRVIFLPGPAAALTSDDHGTAFLAARGGYFVADLSSGHTARVNVADAAHTDFTAIA RRSDGKLVLGSADGAVYTLAKNPAVDPASGAATVASRTKIFARVDALVTQGNTTVVLD RGQTSVTTIGADGHAQQALRAGQGATTMAADPLGRVLIADTRGGQLLVYGVDPLILRQ AYPVRQAPYGLAGSRELAWVSQTASNTVIGYDLTTGIPVEKVRYPTVQQPNSLAFDET SDTLYVVSGSGAGVQVIEHAAGTR" misc_feature 2397375..2397407 /gene="lppL" /locus_tag="Rv2138" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2398720..2399793 /gene="pyrD" /locus_tag="Rv2139" /db_xref="GeneID:887326" CDS 2398720..2399793 /gene="pyrD" /locus_tag="Rv2139" /EC_number="1.3.3.1" /function="pyrimidine biosynthesis" /note="catalyzes the conversion of dihydroorotate to orotate in the pyrimidine biosynthesis pathway; uses a flavin nucleotide as an essential cofactor; class 2 enzymes are monomeric and compared to the class 1 class 2 possess an extended N terminus, which plays a role in the membrane association of the enzyme and provides the binding site for the respiratory quinones that serve as physiological electron acceptors" /codon_start=1 /transl_table=11 /product="dihydroorotate dehydrogenase 2" /protein_id="NP_216655.1" /db_xref="GI:15609276" /db_xref="GeneID:887326" /translation="MYPLVRRLLFLIPPEHAHKLVFAVLRGVAAVAPVRRLLRRLLGP TDPVLASTVFGVRFPAPLGLAAGFDKDGTALSSWGAMGFGYAEIGTVTAHPQPGNPAP RLFRLADDRALLNRMGFNNHGARALAIRLARHRPEIPIGVNIGKTKKTPAGDAVNDYR ASARMVGPLASYLVVNVSSPNTPGLRDLQAVESLRPILSAVRAETSTPVLVKIAPDLS DSDLDDIADLAVELDLAGIVATNTTVSRDGLTTPGVDRLGPGGISGPPLAQRAVQVLR RLYDRVGDRLALISVGGIETADDAWERITAGASLLQGYTGFIYGGERWAKDIHEGIAR RLHDGGFGSLHEAVGSARRRQPS" misc_feature 2398987..2399031 /gene="pyrD" /locus_tag="Rv2139" /note="PS00911 Dihydroorotate dehydrogenase signature 1" misc_feature 2399581..2399643 /gene="pyrD" /locus_tag="Rv2139" /note="PS00912 Dihydroorotate dehydrogenase signature 2" gene complement(2399798..2400328) /gene="TB18.6" /locus_tag="Rv2140c" /db_xref="GeneID:888118" CDS complement(2399798..2400328) /gene="TB18.6" /locus_tag="Rv2140c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2140c, (MTCY270.28), len: 176 aa. TB18.6, conserved hypothetical protein; shows good similarity to hypothetical proteins from Streptomyces coelicolor (177 aa; 58% identity) >emb|CAC32358.1| (AL583945) and to 17.1 kDa Escherichia coli protein YbhB. FASTA best: YBHB_ECOLI P12994 hypothetical 17.1 kDa protein (158 aa) opt: 465 E( ): 2e-23; (46.2% identity in 156 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216656.1" /db_xref="GI:15609277" /db_xref="GeneID:888118" /translation="MTTSPDPYAALPKLPSFSLTSTSITDGQPLATPQVSGIMGAGGA DASPQLRWSGFPSETRSFAVTVYDPDAPTLSGFWHWAVANLPANVTELPEGVGDGREL PGGALTLVNDAGMRRYVGAAPPPGHGVHRYYVAVHAVKVEKLDLPEDASPAYLGFNLF QHAIARAVIFGTYEQR" gene complement(2400376..2401722) /locus_tag="Rv2141c" /db_xref="GeneID:887384" CDS complement(2400376..2401722) /locus_tag="Rv2141c" /function="UNKNOWN" /note="Rv2141c, (MTCY270.27), len: 448 aa. Conserved hypothetical protein. Shows some similarity to conserved hypothetical proteins and to acetylornithine deacetylase and succinyl-diaminopimelate desuccinylase and contains ArgE/dapE/ACY1/CPG2/yscS family signature 1 (PS00758). FASTA best: CBPS_YEAST P27614 carboxypeptidases precursor (576 aa) opt: 234, E(): 4.3e-08; (24.3% identity in 412 aa overlap). Previously named dapE2; dapE2" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177864.1" /db_xref="GI:57116951" /db_xref="GeneID:887384" /translation="MTDETGASSDHSDDVAQVVSRLIRFDTTNSGEPGTTKGEAECAR WVAEQLAEVGYQPEYVESGAPGRGNVFARLAGADSSRGALLIHGHLDVVPAEPAEWSV HPFSGAIEDGYVWGRGAVDMKDMVGMMIVVARHLRQAAIVPPRDLVFAFVADEEHGGK YGSHWLVDNRPDLFDGITEAIGEVGGFSLTVPRHDGGERRLYLIETAEKGIQWMRLTA RGRAGHGSMVHDQNAVTAVCEAVARLGRHQFPLVCTDTVAQFLAVVGEETGLAFDLDS PDLAGTIDKLGPMARMLKAVLHDTANPTMLKAGYKANVVPATAEAVVDCRVLPGRRAA FEAEVDALIGPDVTREWVSDLPSYETTFDGDLVAAMNAAVLAVDPDGRTVPYMLSGGT DAKAFARLGIRCFGFSPLRLPPDLDFTSLFHGVDERVPIDGLRFGTEVLTHLLTHC" misc_feature complement(2401444..2401473) /locus_tag="Rv2141c" /note="PS00758 ArgE. dapE, ACY1, CPG2, yscS family signature 1" gene 2401987..2402072 /locus_tag="Rvnt22" /note="tRNA-Leu(GAG)" /db_xref="GeneID:2700467" tRNA 2401987..2402072 /locus_tag="Rvnt22" /product="tRNA-Leu" /note="codon recognized: CUC" /anticodon=(pos:2402020..2402022,aa:Leu) /db_xref="GeneID:2700467" gene complement(2402193..2402510) /locus_tag="Rv2142c" /db_xref="GeneID:888770" CDS complement(2402193..2402510) /locus_tag="Rv2142c" /function="UNKNOWN" /note="Rv2142c, (MTCY270.26), len: 105 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216658.1" /db_xref="GI:15609279" /db_xref="GeneID:888770" /translation="MTRRLRVHNGVEDDLFEAFSYYADAAPDQIDRLYNLFVDAVTKR IPQAPNAFAPLFKHYRHIYLRPFRYYVAYRTTDEAIDILAVRHGMENPNAVEAEISGR TFE" gene 2402977..2404035 /locus_tag="Rv2143" /db_xref="GeneID:887406" CDS 2402977..2404035 /locus_tag="Rv2143" /function="UNKNOWN" /note="Rv2143, (MTCY270.25c), len: 352 aa. Conserved hypothetical protein, strongly similar to two hypothetical mycobacterial proteins Rv2030c 2.1e-50 and Rv0571c from position 120 (Q50819; Q50111). FASTA best: Q50819 opt: 882, E() 0; (61.1% identity in 226 aa overlap). Also similar to AL021942|MTV039_9 (443 aa), FASTA scores: opt: 592, E(): 5e-30; 46.9% identity in 224 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216659.1" /db_xref="GI:15609280" /db_xref="GeneID:887406" /translation="MEAPPYAGDPTFERLRRSFQPADLLPELQAAGVHYTIAVEAADD PAENESLLATARHHDWIARVIGWVPLADPDEVTESSTHGRHRPDASWRRDLRCPGLLP PGCHQPVLVVGLVGQQPEMRPMNPPSGFLRRTPTRRFRDRRDAGRVLADELASYRGRD RLLVLGLARGGVPVGWEVASALGAELDVFLVRKLGVPQWRELAMGALASGGGVVMNDD VVSSLRITDQQVRAAIDSETAELQRRELAYRGGRPVVDPRARIVILVDDGIATGASML AAVRTIRATGPESIVVAVPVGPATACRELAAEADDVVCATMPAAFEAVGQVYNDFHQV TDDEVRELLATPTTGAAT" gene complement(2404165..2404521) /locus_tag="Rv2144c" /db_xref="GeneID:888202" CDS complement(2404165..2404521) /locus_tag="Rv2144c" /function="UNKNOWN" /note="Rv2144c, (MTCY270.24), len: 118 aa. Probable transmembrane protein." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216660.1" /db_xref="GI:15609281" /db_xref="GeneID:888202" /translation="MLIIALVLALIGLLALVFAVVTSNQLVAWVCIGASVLGVALLIV DALRERQQGGADEADGAGETGVAEEADVDYPEEAPEESQAVDAGVIGSEEPSEEASEA TEESAVSADRSDDSAK" gene complement(2404616..2405398) /gene="wag31" /locus_tag="Rv2145c" /db_xref="GeneID:888224" CDS complement(2404616..2405398) /gene="wag31" /locus_tag="Rv2145c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2145c, (MTCY270.23), len: 260 aa. wag31 (alternate gene name: ag84). Function unknown but corresponds to antigen 84 of Mycobacterium tuberculosis (wag31) (see Hermans et al., 1995). Predicted to contain significant amount of coiled coil structure. Some similarity to Rv1682 and Rv2927c. FASTA best: AG84_MYCTU P46816 antigen 84.; ag84" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216661.1" /db_xref="GI:15609282" /db_xref="GeneID:888224" /translation="MPLTPADVHNVAFSKPPIGKRGYNEDEVDAFLDLVENELTRLIE ENSDLRQRINELDQELAAGGGAGVTPQATQAIPAYEPEPGKPAPAAVSAGMNEEQALK AARVLSLAQDTADRLTNTAKAESDKMLADARANAEQILGEARHTADATVAEARQRADA MLADAQSRSEAQLRQAQEKADALQADAERKHSEIMGTINQQRAVLEGRLEQLRTFERE YRTRLKTYLESQLEELGQRGSAAPVDSNADAGGFDQFNRGKN" gene complement(2405666..2405956) /locus_tag="Rv2146c" /db_xref="GeneID:888759" CDS complement(2405666..2405956) /locus_tag="Rv2146c" /function="UNKNOWN" /note="Rv2146c, (MTCY270.22), len: 96 aa. Possible conserved transmembrane protein, orthologs present in M. leprae, ML0921 (96 aa) and Streptomyces coelicolor. Second start taken GTG alternative upstream but much less probable in TBparse. FASTA best: Q44935 SIMILAR TO A HYPOTHETICAL INTEGRAL MEMBRANE PROT EIN (97 aa) opt: 105, E(): 0.093; (25.3% identity in 87 aa overlap). >emb|CAC31302.1| (AL583920) possible membrane protein ML0921 [Mycobacterium leprae] E(): 5e-32 (76% identity in 96 aa overlap)" /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216662.1" /db_xref="GI:15609283" /db_xref="GeneID:888759" /translation="MVVFFQILGFALFIFWLLLIARVVVEFIRSFSRDWRPTGVTVVI LEIIMSITDPPVKVLRRLIPQLTIGAVRFDLSIMVLLLVAFIGMQLAFGAAA" gene complement(2406118..2406843) /locus_tag="Rv2147c" /db_xref="GeneID:887394" CDS complement(2406118..2406843) /locus_tag="Rv2147c" /function="UNKNOWN" /note="Rv2147c, (MTCY270.21), len: 241 aa. Conserved hypothetical protein, similar to conserved hypothetical proteins in Mycobacterium leprae ML0920 (210 aa) and Streptomyces coelicolor. FASTA scores: >emb|CAC31301.1| (AL583920) hypothetical protein ML0920 hypothetical protein (210 aa) opt: 1242, E(): 5.7e-74; 83.486% identity in 218 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216663.1" /db_xref="GI:15609284" /db_xref="GeneID:887394" /translation="MNSHCSHTFITDNRSPRARRGHAMSTLHKVKAYFGMAPMEDYDD EYYDDRAPSRGYARPRFDDDYGRYDGRDYDDARSDSRGDLRGEPADYPPPGYRGGYAD EPRFRPREFDRAEMTRPRFGSWLRNSTRGALAMDPRRMAMMFEDGHPLSKITTLRPKD YSEARTIGERFRDGSPVIMDLVSMDNADAKRLVDFAAGLAFALRGSFDKVATKVFLLS PADVDVSPEERRRIAETGFYAYQ" gene complement(2406840..2407616) /locus_tag="Rv2148c" /db_xref="GeneID:888127" CDS complement(2406840..2407616) /locus_tag="Rv2148c" /function="UNKNOWN" /note="Rv2148c, (MTCY270.20), len: 258 aa. Conserved hypothetical protein; should belong to the YGGS/YBL036C/F09E5.8 family. FASTA best: AB003132|AB003132_5 Corynebacterium glutamicum gene (221 aa) opt: 440, E(): 2.3e-23; 42.8% identity in 236 aa overlap; and YPI1_VIBAL P52055 hypothetical protein in pilt-proc intergenic region in Vibrio alginolyticus. opt: 266, E(): 1.8e-11; 27.9% identity in 244 aa overlap. TBparse score is 0.940" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216664.1" /db_xref="GI:15609285" /db_xref="GeneID:888127" /translation="MAADLSAYPDRESELTHALAAMRSRLAAAAEAAGRNVGEIELLP ITKFFPATDVAILFRLGCRSVGESREQEASAKMAELNRLLAAAELGHSGGVHWHMVGR IQRNKAGSLARWAHTAHSVDSSRLVTALDRAVVAALAEHRRGERLRVYVQVSLDGDGS RGGVDSTTPGAVDRICAQVQESEGLELVGLMGIPPLDWDPDEAFDRLQSEHNRVRAMF PHAIGLSAGMSNDLEVAVKHGSTCVRVGTALLGPRRLRSP" gene complement(2407622..2408374) /gene="yfiH" /locus_tag="Rv2149c" /db_xref="GeneID:888121" CDS complement(2407622..2408374) /gene="yfiH" /locus_tag="Rv2149c" /function="UNKNOWN" /note="Rv2149c, (MTCY270.19), len: 250 aa. yfiH; corresponds to hypothetical 25.3 kDa YfiH protein in ftsZ 3' region of Streptomyces griseus, and to YfiH proteins in other bacteria. Belongs to UPF0124 Family. FASTA best: YFIH_STRGR P45496, (246 aa) opt: 722, E(): 1.9e-37; (49.4% identity in 245 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216665.1" /db_xref="GI:15609286" /db_xref="GeneID:888121" /translation="MLASTRHIARGDTGNVSVRIRRVTTTRAGGVSAPPFDTFNLGDH VGDDPAAVAANRARLAAAIGLPGNRVVWMNQVHGDRVELVDQPRNTALDDTDGLVTAT PRLALAVVTADCVPVLMADARAGIAAAVHAGRAGAQRGVVVRALEVMLSLGAQVRDIS ALLGPAVSGRNYEVPAAMADEVEAALPGSRTTTAAGTPGVDLRAGIACQLRDLGVESI DVDPRCTVADPTLFSHRRDAPTGRFASLVWME" gene complement(2408385..2409524) /gene="ftsZ" /locus_tag="Rv2150c" /db_xref="GeneID:888369" CDS complement(2408385..2409524) /gene="ftsZ" /locus_tag="Rv2150c" /function="Essential for cell division. It is thought that the intracellular concentration of FtsZ protein is critical for productive septum formation in mycobacteria." /experiment="experimental evidence, no additional details recorded" /note="GTPase; similar structure to tubulin; forms ring-shaped polymers at the site of cell division; other proteins such as FtsA, ZipA, and ZapA, interact with and regulate FtsZ function" /codon_start=1 /transl_table=11 /product="cell division protein FtsZ" /protein_id="NP_216666.1" /db_xref="GI:15609287" /db_xref="GeneID:888369" /translation="MTPPHNYLAVIKVVGIGGGGVNAVNRMIEQGLKGVEFIAINTDA QALLMSDADVKLDVGRDSTRGLGAGADPEVGRKAAEDAKDEIEELLRGADMVFVTAGE GGGTGTGGAPVVASIARKLGALTVGVVTRPFSFEGKRRSNQAENGIAALRESCDTLIV IPNDRLLQMGDAAVSLMDAFRSADEVLLNGVQGITDLITTPGLINVDFADVKGIMSGA GTALMGIGSARGEGRSLKAAEIAINSPLLEASMEGAQGVLMSIAGGSDLGLFEINEAA SLVQDAAHPDANIIFGTVIDDSLGDEVRVTVIAAGFDVSGPGRKPVMGETGGAHRIES AKAGKLTSTLFEPVDAVSVPLHTNGATLSIGGDDDDVDVPPFMRR" misc_feature complement(2409180..2409236) /gene="ftsZ" /locus_tag="Rv2150c" /note="PS01135 FtsZ protein signature 2" gene complement(2409697..2410641) /gene="ftsQ" /locus_tag="Rv2151c" /db_xref="GeneID:888367" CDS complement(2409697..2410641) /gene="ftsQ" /locus_tag="Rv2151c" /function="THIS PROTEIN MAY BE INVOLVED IN SEPTUM FORMATION (BY SIMILARITY)." /note="Rv2151c, (MTCY270.17), len: 314 aa. Possible ftsQ, cell division protein, with some homology to FTSQ_STRGR|P45503 cell division protein ftsq homolog from Streptomyces griseus (208 aa), FASTA scores: opt: 204, E(): 4e-05; (30.6% identity in 193 aa overlap)." /codon_start=1 /transl_table=11 /product="cell division protein FtsQ" /protein_id="NP_216667.1" /db_xref="GI:15609288" /db_xref="GeneID:888367" /translation="MTEHNEDPQIERVADDAADEEAVTEPLATESKDEPAEHPEFEGP RRRARRERAERRAAQARATAIEQARRAAKRRARGQIVSEQNPAKPAARGVVRGLKALL ATVVLAVVGIGLGLALYFTPAMSAREIVIIGIGAVSREEVLDAARVRPATPLLQIDTQ QVADRVATIRRVASARVQRQYPSALRITIVERVPVVVKDFSDGPHLFDRDGVDFATDP PPPALPYFDVDNPGPSDPTTKAALQVLTALHPEVASQVGRIAAPSVASITLTLADGRV VIWGTTDRCEEKAEKLAALLTQPGRTYDVSSPDLPTVK" gene complement(2410638..2412122) /gene="murC" /locus_tag="Rv2152c" /db_xref="GeneID:887983" CDS complement(2410638..2412122) /gene="murC" /locus_tag="Rv2152c" /EC_number="6.3.2.8" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS" /note="Catalyzes the formation of UDP-N-acetylmuramoyl-L-alanine from UDP-N-acetylmuramate and L-alanine in peptidoglycan synthesis" /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramate--L-alanine ligase" /protein_id="NP_216668.1" /db_xref="GI:15609289" /db_xref="GeneID:887983" /translation="MSTEQLPPDLRRVHMVGIGGAGMSGIARILLDRGGLVSGSDAKE SRGVHALRARGALIRIGHDASSLDLLPGGATAVVTTHAAIPKTNPELVEARRRGIPVV LRPAVLAKLMAGRTTLMVTGTHGKTTTTSMLIVALQHCGLDPSFAVGGELGEAGTNAH HGSGDCFVAEADESDGSLLQYTPHVAVITNIESDHLDFYGSVEAYVAVFDSFVERIVP GGALVVCTDDPGGAALAQRATELGIRVLRYGSVPGETMAATLVSWQQQGVGAVAHIRL ASELATAQGPRVMRLSVPGRHMALNALGALLAAVQIGAPADEVLDGLAGFEGVRRRFE LVGTCGVGKASVRVFDDYAHHPTEISATLAAARMVLEQGDGGRCMVVFQPHLYSRTKA FAAEFGRALNAADEVFVLDVYGAREQPLAGVSGASVAEHVTVPMRYVPDFSAVAQQVA AAASPGDVIVTMGAGDVTLLGPEILTALRVRANRSAPGRPGVLG" gene complement(2412119..2413351) /gene="murG" /locus_tag="Rv2153c" /db_xref="GeneID:888036" CDS complement(2412119..2413351) /gene="murG" /locus_tag="Rv2153c" /EC_number="2.4.1.227" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS." /note="UDP-N-acetylglucosamine--N-acetylmuramyl- (pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase; involved in cell wall formation; inner membrane-associated; last step of peptidoglycan synthesis" /codon_start=1 /transl_table=11 /product="undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase" /protein_id="NP_216669.1" /db_xref="GI:15609290" /db_xref="GeneID:888036" /translation="MKDTVSQPAGGRGATAPRPADAASPSCGSSPSADSVSVVLAGGG TAGHVEPAMAVADALVALDPRVRITALGTLRGLETRLVPQRGYHLELITAVPMPRKPG GDLARLPSRVWRAVREARDVLDDVDADVVVGFGGYVALPAYLAARGLPLPPRRRRRIP VVIHEANARAGLANRVGAHTADRVLSAVPDSGLRRAEVVGVPVRASIAALDRAVLRAE ARAHFGFPDDARVLLVFGGSQGAVSLNRAVSGAAADLAAAGVCVLHAHGPQNVLELRR RAQGDPPYVAVPYLDRMELAYAAADLVICRAGAMTVAEVSAVGLPAIYVPLPIGNGEQ RLNALPVVNAGGGMVVADAALTPELVARQVAGLLTDPARLAAMTAAAARVGHRDAAGQ VARAALAVATGAGARTTT" gene complement(2413348..2414922) /gene="ftsW" /locus_tag="Rv2154c" /db_xref="GeneID:887916" CDS complement(2413348..2414922) /gene="ftsW" /locus_tag="Rv2154c" /function="UNKNOWN" /note="Rv2154c, (MTCY270.14), len: 524 aa. Probable ftsW, cell division protein, related to MTCY10H4.17c, 3.2e-17. FASTA best: SP5E_BACSU P07373 stage V sporulation protein E (366 aa) opt: 755, E(): 1.6e-33; (38.4% identity in 357 aa overlap)" /codon_start=1 /transl_table=11 /product="FtsW-like protein FtsW" /protein_id="NP_216670.1" /db_xref="GI:15609291" /db_xref="GeneID:887916" /translation="MLTRLLRRGTSDTDGSQTRGAEPVEGQRTGPEEASNPGSARPRT RFGAWLGRPMTSFHLIIAVAALLTTLGLIMVLSASAVRSYDDDGSAWVIFGKQVLWTL VGLIGGYVCLRMSVRFMRRIAFSGFAITIVMLVLVLVPGIGKEANGSRGWFVVAGFSM QPSELAKMAFAIWGAHLLAARRMERASLREMLIPLVPAAVVALALIVAQPDLGQTVSM GIILLGLLWYAGLPLRVFLSSLAAVVVSAAILAVSAGYRSDRVRSWLNPENDPQDSGY QARQAKFALAQGGIFGDGLGQGVAKWNYLPNAHNDFIFAIIGEELGLVGALGLLGLFG LFAYTGMRIASRSADPFLRLLTATTTLWVLGQAFINIGYVIGLLPVTGLQLPLISAGG TSTAATLSLIGIIANAARHEPEAVAALRAGRDDKVNRLLRLPLPEPYLPPRLEAFRDR KRANPQPAQTQPARKTPRTAPGQPARQMGLPPRPGSPRTADPPVRRSVHHGAGQRYAG QRRTRRVRALEGQRYG" gene complement(2414934..2416394) /gene="murD" /locus_tag="Rv2155c" /db_xref="GeneID:888000" CDS complement(2414934..2416394) /gene="murD" /locus_tag="Rv2155c" /EC_number="6.3.2.9" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS." /note="UDP-N-acetylmuramoylalanine--D-glutamate ligase; involved in peptidoglycan biosynthesis; cytoplasmic; catalyzes the addition of glutamate to the nucleotide precursor UDP-N-acetylmuramoyl-L-alanine during cell wall formation" /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase" /protein_id="NP_216671.1" /db_xref="GI:15609292" /db_xref="GeneID:888000" /translation="MLDPLGPGAPVLVAGGRVTGQAVAAVLTRFGATPTVCDDDPVML RPHAERGLPTVSSSDAVQQITGYALVVASPGFSPATPLLAAAAAAGVPIWGDVELAWR LDAAGCYGPPRSWLVVTGTNGKTTTTSMLHAMLIAGGRRAVLCGNIGSAVLDVLDEPA ELLAVELSSFQLHWAPSLRPEAGAVLNIAEDHLDWHATMAEYTAAKARVLTGGVAVAG LDDSRAAALLDGSPAQVRVGFRLGEPAARELGVRDAHLVDRAFSDDLTLLPVASIPVP GPVGVLDALAAAALARSVGVPAGAIADAVTSFRVGRHRAEVVAVADGITYVDDSKATN PHAARASVLAYPRVVWIAGGLLKGASLHAEVAAMASRLVGAVLIGRDRAAVAEALSRH APDVPVVQVVAGEDTGMPATVEVPVACVLDVAKDDKAGETVGAAVMTAAVAAARRMAQ PGDTVLLAPAGASFDQFTGYADRGEAFATAVRAVIR" misc_feature complement(2415978..2416049) /gene="murD" /locus_tag="Rv2155c" /note="PS01011 Folylpolyglutamate synthase signature 1" gene complement(2416396..2417475) /gene="mraY" /locus_tag="Rv2156c" /db_xref="GeneID:888098" CDS complement(2416396..2417475) /gene="mraY" /locus_tag="Rv2156c" /EC_number="2.7.8.13" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS." /note="First step of the lipid cycle reactions in the biosynthesis of the cell wall peptidoglycan" /codon_start=1 /transl_table=11 /product="phospho-N-acetylmuramoyl-pentapeptide- transferase" /protein_id="NP_216672.1" /db_xref="GI:15609293" /db_xref="GeneID:888098" /translation="MRQILIAVAVAVTVSILLTPVLIRLFTKQGFGHQIREDGPPSHH TKRGTPSMGGVAILAGIWAGYLGAHLAGLAFDGEGIGASGLLVLGLATALGGVGFIDD LIKIRRSRNLGLNKTAKTVGQITSAVLFGVLVLQFRNAAGLTPGSADLSYVREIATVT LAPVLFVLFCVVIVSAWSNAVNFTDGLDGLAAGTMAMVTAAYVLITFWQYRNACVTAP GLGCYNVRDPLDLALIAAATAGACIGFLWWNAAPAKIFMGDTGSLALGGVIAGLSVTS RTEILAVVLGALFVAEITSVVLQILTFRTTGRRMFRMAPFHHHFELVGWAETTVIIRF WLLTAITCGLGVALFYGEWLAAVGA" gene complement(2417472..2419004) /gene="murF" /locus_tag="Rv2157c" /db_xref="GeneID:887826" CDS complement(2417472..2419004) /gene="murF" /locus_tag="Rv2157c" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS." /note="deleted EC_number 6.3.2.15; Rv2157c, (MTCY270.11), len: 510 aa. Probable murF, UDP-N-acetylmuramoylalanyl-D-glutamyl-2,6-diaminopimelate -D -alanyl-D-alanyl ligase (EC 6.3.2.15) (UDP-MURNAC-PENTAPEPTIDE SYNTHETASE) (see citation below), also related to other Mycobacterium tuberculosis mur gene products. FASTA best: MURF_ECOLI|P11880 (452 aa), opt: 515, E(): 2.6e-24, (31.9% identity in 511 aa overlap)." /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate- D-alanyl-D-alanyl ligase MurF" /protein_id="NP_216673.1" /db_xref="GI:15609294" /db_xref="GeneID:887826" /translation="MIELTVAQIAEIVGGAVADISPQDAAHRRVTGTVEFDSRAIGPG GLFLALPGARADGHDHAASAVAAGAAVVLAARPVGVPAIVVPPVAAPNVLAGVLEHDN DGSGAAVLAALAKLATAVAAQLVAGGLTIIGITGSSGKTSTKDLMAAVLAPLGEVVAP PGSFNNELGHPWTVLRATRRTDYLILEMAARHHGNIAALAEIAPPSIGVVLNVGTAHL GEFGSREVIAQTKAELPQAVPHSGAVVLNADDPAVAAMAKLTAARVVRVSRDNTGDVW AGPVSLDELARPRFTLHAHDAQAEVRLGVCGDHQVTNALCAAAVALECGASVEQVAAA LTAAPPVSRHRMQVTTRGDGVTVIDDAYNANPDSMRAGLQALAWIAHQPEATRRSWAV LGEMAELGEDAIAEHDRIGRLAVRLDVSRLVVVGTGRSISAMHHGAVLEGAWGSGEAT ADHGADRTAVNVADGDAALALLRAELRPGDVVLVKASNAAGLGAVADALVADDTCGSV RP" gene complement(2419001..2420608) /gene="murE" /locus_tag="Rv2158c" /db_xref="GeneID:887252" CDS complement(2419001..2420608) /gene="murE" /locus_tag="Rv2158c" /EC_number="6.3.2.13" /function="INVOLVED IN CELL WALL FORMATION; PEPTIDOGLYCAN BIOSYNTHESIS." /note="involved in cell wall formation; peptidoglycan synthesis; cytoplasmic enzyme; catalyzes the addition of meso-diaminopimelic acid to the nucleotide precursor UDP-N-aceylmuramoyl-l-alanyl-d-glutamate" /codon_start=1 /transl_table=11 /product="UDP-N-acetylmuramoylalanyl-D-glutamate--2, 6-diaminopimelate ligase" /protein_id="NP_216674.1" /db_xref="GI:15609295" /db_xref="GeneID:887252" /translation="MSSLARGISRRRTEVATQVEAAPTGLRPNAVVGVRLAALADQVG AALAEGPAQRAVTEDRTVTGVTLRAQDVSPGDLFAALTGSTTHGARHVGDAIARGAVA VLTDPAGVAEIAGRAAVPVLVHPAPRGVLGGLAATVYGHPSERLTVIGITGTSGKTTT TYLVEAGLRAAGRVAGLIGTIGIRVGGADLPSALTTPEAPTLQAMLAAMVERGVDTVV MEVSSHALALGRVDGTRFAVGAFTNLSRDHLDFHPSMADYFEAKASLFDPDSALRART AVVCIDDDAGRAMAARAADAITVSAADRPAHWRATDVAPTDAGGQQFTAIDPAGVGHH IGIRLPGRYNVANCLVALAILDTVGVSPEQAVPGLREIRVPGRLEQIDRGQGFLALVD YAHKPEALRSVLTTLAHPDRRLAVVFGAGGDRDPGKRAPMGRIAAQLADLVVVTDDNP RDEDPTAIRREILAGAAEVGGDAQVVEIADRRDAIRHAVAWARPGDVVLIAGKGHETG QRGGGRVRPFDDRVELAAALEALERRA" gene complement(2420631..2421665) /locus_tag="Rv2159c" /db_xref="GeneID:887236" CDS complement(2420631..2421665) /locus_tag="Rv2159c" /function="UNKNOWN" /note="Rv2159c, (MTCY270.09), len: 344 aa. Conserved hypothetical protein; some similarity to hypothetical protein from Streptomyces coelicolor SC1A6.09c (337 aa, 29% identity). Smith-Waterman scores: >pir||T28690 hypothetical protein - Streptomyces coelicolor >gi|3127841|emb|CAA18907.1| (AL023496) Expect = 2e-18" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216675.1" /db_xref="GI:15609296" /db_xref="GeneID:887236" /translation="MKFVNHIEPVAPRRAGGAVAEVYAEARREFGRLPEPLAMLSPDE GLLTAGWATLRETLLVGQVPRGRKEAVAAAVAASLRCPWCVDAHTTMLYAAGQTDTAA AILAGTAPAAGDPNAPYVAWAAGTGTPAGPPAPFGPDVAAEYLGTAVQFHFIARLVLV LLDETFLPGGPRAQQLMRRAGGLVFARKVRAEHRPGRSTRRLEPRTLPDDLAWATPSE PIATAFAALSHHLDTAPHLPPPTRQVVRRVVGSWHGEPMPMSSRWTNEHTAELPADLH APTRLALLTGLAPHQVTDDDVAAARSLLDTDAALVGALAWAAFTAARRIGTWIGAAAE GQVSRQNPTG" gene complement(2421643..2422278) /locus_tag="Rv2160A" /db_xref="GeneID:3205077" CDS complement(2421643..2422278) /locus_tag="Rv2160A" /function="UNKNOWN" /note="Rv2160A, len: 211 aa. Conserved hypothetical protein, possibly a tetR-family transcriptional regulator, similar to N-terminal half of AL512667_12|Q9AD73|SCK31.01c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (200 aa), FASTA scores: opt: 285, E(): 1.4e-08, (51.042% identity in 96 aa overlap). Next gene, Rv2160c, is similar to C-terminal half of 2SCK31.01c suggesting possible frameshift near 2421978 but sequence of this region has been checked and is also identical in strain CDC1551." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177660.1" /db_xref="GI:57116952" /db_xref="GeneID:3205077" /translation="MPSADVGRQTRAQILRAAMDIASVKGLSGLSIGELAGRLGMSKS GLFRHFGAKEQLQLATVEAAVSVFEAEVVAPAMAAPPGVDRVRALMHAWVGYLERDVP AAAFSRPRPPTWTHSLARCATASPRPGGPESPPSRPTSKRRNAGARSGRISKCANSRS SCTPTRWRPTGRCCCSTTTAPESGRERRSTRPWPESAPPRRESNHEICQPY" gene complement(2421662..2422003) /locus_tag="Rv2160c" /db_xref="GeneID:886254" CDS complement(2421662..2422003) /locus_tag="Rv2160c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2160c, (MTCY270.08), len: 113 aa. Conserved hypothetical protein, possibly a tetR-family transcriptional regulator, similar to C-terminal half of AL512667_12|Q9AD73|SCK31.01c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (200 aa), while Rv2160A is similar to the N-terminal half of 2SCK31.01c. This suggests possible frameshift near 2421978 but sequence of this region has been checked and is also identical in strain CDC1551." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216676.1" /db_xref="GI:15609297" /db_xref="GeneID:886254" /translation="MGRIPGTRRAGGCFFAAAAADVDSQPGPVRDRIAATGRAGIAAI TADVETAQRRGEIRADIEVRQLAFELHAYAMEANWALLLLDDDGAGERARTAIDAALA RVGTTQEGVES" gene complement(2422271..2423137) /locus_tag="Rv2161c" /db_xref="GeneID:887978" CDS complement(2422271..2423137) /locus_tag="Rv2161c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2161c, (MTCY270.07), len: 288 aa. Conserved hypothetical protein; shows some similarity to protein involved in lincomycin production and to other M. tuberculosis proteins e.g. Rv0953c, Rv0791c, Rv0132c, Rv2951c, Rv1855c. FASTA best: Q54379 (78-11) LINCOMYCIN PRODUCTION GENES (295 aa) opt: 243, E(): 2.4e-09; (29.5% identity in 285 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216677.1" /db_xref="GI:15609298" /db_xref="GeneID:887978" /translation="MLVSLMQFVTDLTPPPQLVAVWAEERGFAGLYVPEKTHVPISRS TPWPGGELPDWYRRCYDPVVALAAAAAVTTRLRVGTGACLVAVHDPILLAKQIASLCA MSGERFVLGVGFGWNVEELADHGVPFADRIAVTVDKLAAMRALWAAEPVHYEGTHASV PPSWAWPKPAVAPPVLFGCRPSARAFEVIARHGDGWQPIEGYGELLGALPMLHAAFER AGRDPATAQVCVYSSAGDPATLHEYRRAGVAEVALALPSAGRDQVLAALDRSAPLVDA FAGDDREVKSHA" gene complement(2423240..2424838) /gene="PE_PGRS38" /locus_tag="Rv2162c" /db_xref="GeneID:887300" CDS complement(2423240..2424838) /gene="PE_PGRS38" /locus_tag="Rv2162c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2162c, (MTCY270.06), len: 532 aa. Member of M. tuberculosis PE_PGRS family (see citations below). FASTA score: Y03A_MYCTU Q 10637 hypothetical glycine-rich 49.6 kDa protein (603 aa) op t: 1798 z-score: 1220.0 E(): 0; (55.4% identity in 590 aa overlap)" /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177865.1" /db_xref="GI:57116953" /db_xref="GeneID:887300" /translation="MSFVIAAPEVMAAAATDLANIGSSISAASAAAAGPTMGILAAGA DEVSVAISALFGSHAQGYQTLSAQLAAYHNQFVRALNAGAGSYASAEAANVQQTLLNA INAPTQTLLGRPLIGNGADGGPGQNGGPGGLLYGNGGNGGAGDTANPNGGNGGSAGLI GNGGAGGAGAATGAGGAGGNGGWLYGNGGPGGAAGLGTAGGVSPAGGAGGAAGLWGHG GAGGAGGSASGAPGAGGAGGDGGRGGLLYGDGGAGGAGGNGSNGVTGVHGGNGGAGGA AGLIGNGGAGGDGGNGGLSNTGASGGAGGAGGAALIGNGGDGGHGGNGGHGNSGGAGG AGGAGGAGGAGGHVGLIGNGGNGGAGGNGGNDNSSTLADAGSGGAGAAGGNGGLFYGN GGVGGRGGNGGFSSAGTSGGDGGIGGAGGIGGLIGSGGGGGDGGNGGQAPTPGNAGDG GAGGNARLIGDGGRGGNGGEGGDGPPGVKGDGGNGGNGGNAVVIGNGGNGGAGGFGIP VGSGGAGGSRGVLFGTPGANGADG" gene complement(2425048..2427087) /gene="pbpB" /locus_tag="Rv2163c" /db_xref="GeneID:887949" CDS complement(2425048..2427087) /gene="pbpB" /locus_tag="Rv2163c" /function="Involved in peptidoglycan biosynthesis." /note="Rv2163c, (MTCY270.05), len: 679 aa. Probable pbpB, penicillin-binding membrane protein, similar to many bacterial PBP2 proteins e.g. P11882|PBP2_NEIME|PENA|NMA2072|NMB0413 penicillin-binding protein 2 (pbp-2) from Neisseria meningitidis (serogroups A and B) (581 aa), FASTA scores: opt: 665, E(): 1.6e-31, (33.2% identity in 591 aa overlap); etc. Also similar to Rv0016c and Rv2864c from Mycobacterium tuberculosis (2.8e-10). Contains PS00017 possible ATP/GTP-binding site motif A (P-loop) near C-terminus. FASTA best: PBP2_NEIME P11882 penicillin-binding protein 2 (pbp-2). (581 aa) opt: 665, E(): 1.6e-31; (33 .2% identity in 591 aa overlap)" /codon_start=1 /transl_table=11 /product="penicillin-binding membrane protein pbpB" /protein_id="NP_216679.1" /db_xref="GI:15609300" /db_xref="GOA:O06214" /db_xref="UniProtKB/TrEMBL:O06214" /db_xref="GeneID:887949" /translation="MSRAAPRRASQSQSTRPARGLRRPPGAQEVGQRKRPGKTQKARQ AQEATKSRPATRSDVAPAGRSTRARRTRQVVDVGTRGASFVFRHRTGNAVILVLMLVA ATQLFFLQVSHAAGLRAQAAGQLKVTDVQPAARGSIVDRNNDRLAFTIEARALTFQPK RIRRQLEEARKKTSAAPDPQQRLRDIAQEVAGKLNNKPDAAAVLKKLQSDETFVYLAR AVDPAVASAICAKYPEVGAERQDLRQYPGGSLAANVVGGIDWDGHGLLGLEDSLDAVL AGTDGSVTYDRGSDGVVIPGSYRNRHKAVHGSTVVLTLDNDIQFYVQQQVQQAKNLSG AHNVSAVVLDAKTGEVLAMANDNTFDPSQDIGRQGDKQLGNPAVSSPFEPGSVNKIVA ASAVIEHGLSSPDEVLQVPGSIQMGGVTVHDAWEHGVMPYTTTGVFGKSSNVGTLMLS QRVGPERYYDMLRKFGLGQRTGVGLPGESAGLVPPIDQWSGSTFANLPIGQGLSMTLL QMTGMYQAIANDGVRVPPRIIKATVAPDGSRTEEPRPDDIRVVSAQTAQTVRQMLRAV VQRDPMGYQQGTGPTAGVPGYQMAGKTGTAQQINPGCGCYFDDVYWITFAGIATADNP RYVIGIMLDNPARNSDGAPGHSAAPLFHNIAGWLMQRENVPLSPDPGPPLVLQAT" misc_feature complement(2425309..2425332) /gene="pbpB" /locus_tag="Rv2163c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(2427084..2428238) /locus_tag="Rv2164c" /db_xref="GeneID:887217" CDS complement(2427084..2428238) /locus_tag="Rv2164c" /function="UNKNOWN" /note="Rv2164c, (MTCY270.04), len: 384 aa. Probable pro- rich conserved membrane protein, equivalent to ML0907|AL022602 putative conserved membrane protein from Mycobacterium leprae (377 aa) (AL022602), FASTA scores: opt: 1495, E(): 1.7e-56, (62.217% identity in 397 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216680.1" /db_xref="GI:15609301" /db_xref="UniProtKB/TrEMBL:O06213" /db_xref="GeneID:887217" /translation="MRAKREAPKSRSSDRRRRADSPAAATRRTTTNSAPSRRIRSRAG KTSAPGRQARVSRPGPQTSPMLSPFDRPAPAKNTSQAKARAKARKAKAPKLVRPTPME RLAARLTSIDLRPRTLANKVPFVVLVIGSLGVGLGLTLWLSTDAAERSYQLSNARERT RMLQQHKEALERDVREAASAPALAEAARRQGMIPTRDTAHLVQDPDGNWVVVGTPKPA DGVPPPPLNTKLPEDPPPPPKPAAVPLEVPVRVTPGPDDPAPPARSGPEVLVRTPDGT ATLGGATHLPTQAGPQLPGPVPIPGAPGPMPAPPLGAVPSPAPAENPVPLQVGAAPPA GLPGPAPVAATPGLSGGSQPMVAPPAPVPANGEQFGPVTAPVPTAPGAPR" gene complement(2428235..2429269) /gene="mraW" /locus_tag="Rv2165c" /db_xref="GeneID:888462" CDS complement(2428235..2429269) /gene="mraW" /locus_tag="Rv2165c" /function="UNKNOWN" /note="Rv2165c, (MTCY270.03), len: 396 aa. Conserved hypothetical protein; shows strong similarity to several hypothetical bacterial proteins but has extra 80 aa residues at N-terminus FASTA best: YLXA_BACSU Q07876 hypothetical 35.3 kDa protein in ftsl (311 aa) opt: 781, E(): 0; (45.6% identity in 296 aa overlap), BELONGS TO THE YABC (E.COLI), YLXA (B.SUBTILIS) FAMILY" /codon_start=1 /transl_table=11 /product="S-adenosyl-methyltransferase MraW" /protein_id="NP_216681.2" /db_xref="GI:161352465" /db_xref="GOA:P65429" /db_xref="UniProtKB/Swiss-Prot:P65429" /db_xref="GeneID:888462" /translation="MADPGSGPTGFGHVPVLAQRCFELLTPALTRYYPDGSQAVLLDA TIGAGGHAERFLEGLPGLRLIGLDRDPTALDVARSRLVRFADRLTLVHTRYDCLGAAL AESGYAAVGSVDGILFDLGVSSMQLDRAERGFAYATDAPLDMRMDPTTPLTAADIVNT YDEAALADILRRYGEERFARRIAAGIVRRRAKTPFTSTAELVALLYQAIPAPARRVGG HPAKRTFQALRIAVNDELESLRTAVPAALDALAIGGRIAVLAYQSLEDRIVKRVFAEA VASATPAGLPVELPGHEPRFRSLTHGAERASVAEIERNPRSTPVRLRALQRVEHRAQS QQWATEKGDS" gene complement(2429427..2429858) /locus_tag="Rv2166c" /db_xref="GeneID:888261" CDS complement(2429427..2429858) /locus_tag="Rv2166c" /function="UNKNOWN" /note="MraZ; UPF0040; crystal structure shows similarity to AbrB" /codon_start=1 /transl_table=11 /product="cell division protein MraZ" /protein_id="NP_216682.1" /db_xref="GI:15609303" /db_xref="GOA:P65436" /db_xref="UniProtKB/Swiss-Prot:P65436" /db_xref="GeneID:888261" /translation="MFLGTYTPKLDDKGRLTLPAKFRDALAGGLMVTKSQDHSLAVYP RAAFEQLARRASKAPRSNPEARAFLRNLAAGTDEQHPDSQGRITLSADHRRYASLSKD CVVIGAVDYLEIWDAQAWQNYQQIHEENFSAASDEALGDIF" repeat_region complement(2430117..2431471) /note="IS6110-6, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-6" repeat_region complement(2430117..2430144) /note="28 bp Inverted repeat at the left end of IS6110; GAGTCTCCGGACTCACCGGGGCGGTTCA" gene complement(2430159..2431199) /locus_tag="Rv2167c" /db_xref="GeneID:888197" CDS complement(2430159..>2431199) /locus_tag="Rv2167c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2167c, (MTCY270.01), len: 346 aa. Probable IS6110 transposase. FASTA best: TRA9_MYCTU P19774 putative transposase for insertion sequence (identical)" /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216683.1" /db_xref="GI:15609304" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:888197" /translation="AEALAAGQRRIAKGERDFKDRVGFLRGRARPASTLITRFIADHQ GHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIADPATARP ADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVASTMATSMV LDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYD NALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA AYYAQRQRPAAG" gene complement(2431094..2431420) /locus_tag="Rv2168c" /db_xref="GeneID:888459" CDS complement(2431094..2431420) /locus_tag="Rv2168c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2168c, (MTV021.01c), len: 108 aa. Probable IS6110 transposase. FASTA scores: O08155|O08155 HYPOTHETICAL 12.0 kDa PROTEIN (108 aa) opt: 697, E(): 0, (100.0% identity in 108 aa overlap). TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216684.1" /db_xref="GI:15609305" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:888459" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" repeat_region complement(2431444..2431471) /note="28 bp Inverted repeat at the right end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene complement(2431565..2431969) /locus_tag="Rv2169c" /db_xref="GeneID:888257" CDS complement(2431565..2431969) /locus_tag="Rv2169c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2169c, (MTV021.02c), len: 134 aa. Probable conserved transmembrane protein, with orthologs in M. leprae, ML0904 probable membrane protein (134 aa), and Streptomyces coelicolor. FASTA scores with ML0904, opt: 767, E(): 5.1e-43; 86.567% identity in 134 aa overlap. emb|CAA18678.1| (AL022602) >gi|13092974|emb|CAC31285.1| (AL583920). TBparse score is 0.934" /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216685.1" /db_xref="GI:15609306" /db_xref="UniProtKB/TrEMBL:O53503" /db_xref="GeneID:888257" /translation="MPLSDHEQRMLDQIESALYAEDPKFASSVRGGGFRAPTARRRLQ GAALFIIGLGMLVSGVAFKETMIGSFPILSVFGFVVMFGGVVYAITGPRLSGRMDRGG SAAGASRQRRTKGAGGSFTSRMEDRFRRRFDE" gene 2432235..2432855 /locus_tag="Rv2170" /db_xref="GeneID:888170" CDS 2432235..2432855 /locus_tag="Rv2170" /function="UNKNOWN" /note="Rv2170, (MTV021.03), len: 206 aa. Conserved hypothetical protein, equivalent to hypothetical protein ML0903 (210 aa) from Mycobacterium leprae. FASTA scores: ML0903 conserved hypothetical protein (210 aa) opt: 1045, E(): 9.1e-57; 77.143% identity in 210 aa overlap. >emb|CAA18679.1| (AL022602) >gi|13092973|emb|CAC31284.1| (AL583920). TBparse score is 0.905" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216686.1" /db_xref="GI:15609307" /db_xref="GOA:O53504" /db_xref="UniProtKB/TrEMBL:O53504" /db_xref="GeneID:888170" /translation="MAIFLIDLPPSDMERRLGDALTVYVDAMRYPRGTETLRAPMWLE HIRRRGWQAVAAVEVTAAEQAEAADTTALPSAAELSNAPMLGVAYGYPGAPGQWWQQQ VVLGLQRSGFPRLAIARLMTSYFELTELHILPRAQGRGLGEALARRLLAGRDEDNVLL STPETNGEDNRAWRLYRRLGFTDIIRGYHFAGDPRAFAILGRTLPL" gene 2432951..2433634 /gene="lppM" /locus_tag="Rv2171" /db_xref="GeneID:887522" CDS 2432951..2433634 /gene="lppM" /locus_tag="Rv2171" /function="UNKNOWN" /note="Rv2171, (MTV021.04), len: 227 aa. Probable lppM, conserved lipoprotein; contains putative signal peptide and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Has hydrophobic stretch at C-terminus and also contains PS00225 Crystallins beta and gamma 'Greek key' motif signature. Unknown but equivalent to Mycobacterium leprae lipoprotein ML0902 (239 aa). FASTA scores: opt: 1083, E(): 2.4e-56; 75.446% identity in 224 aa overlap (5-227:16-239) >emb|CAA18680.1| (AL022602) >gi|13092972|emb|CAC31283.1| (AL583920). TBparse score is 0.895" /codon_start=1 /transl_table=11 /product="lipoprotein lppM" /protein_id="NP_216687.1" /db_xref="GI:15609308" /db_xref="UniProtKB/TrEMBL:O53505" /db_xref="GeneID:887522" /translation="MARTRRRGMLAIAMLLMLVPLATGCLRVRASITISPDDLVSGEI IAAAKPKNSKDTGPALDGDVPFSQKVAVSNYDSDGYVGSQAVFSDLTFAELPQLANMN SDAAGVNLSLRRNGNIVILEGRADLTSVSDPDADVELTVAFPAAVTSTNGDRIEPEVV QWKLKPGVVSTMSAQARYTDPNTRSFTGAGIWLGIAAFAAAGVVAVLAWIDRDRSPRL TASGDPPTS" misc_feature 2432993..2433025 /gene="lppM" /locus_tag="Rv2171" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" misc_feature 2433164..2433211 /gene="lppM" /locus_tag="Rv2171" /note="PS00225 Crystallins beta and gamma 'Greek key' motif signature" gene complement(2433631..2434536) /locus_tag="Rv2172c" /db_xref="GeneID:888147" CDS complement(2433631..2434536) /locus_tag="Rv2172c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2172c, (MTV021.05c), len: 301 aa. Conserved hypothetical protein, equivalent to Mycobacterium leprae conserved hypothetical protein ML0901 (304 aa). FASTA scores: opt: 1656, E(): 7.7e-98; 81.271% identity in 299 aa overlap (1-299:1-299) >emb|CAA18681.1| (AL022602) >gi|13092971|emb|CAC31282.1| (AL583920) . TBparse score is 0.905" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216688.1" /db_xref="GI:15609309" /db_xref="UniProtKB/TrEMBL:O53506" /db_xref="GeneID:888147" /translation="MTLNTIALELVPPNLEGGKERAIEDARKVVQYSAASGLDGRIRH VMMPGMIAEDDDRPIPMQPKLDVLDFWSIIKPELAGVHGLCTQVTAFMDEPSLHRRLV DLSDAGMEGIVFVGVPRTMQDGEGSGVAPTDALSLYRQLVANRGVIVIPTRDGEQGRL NFKCSRGATYGMTQLLYSDAIVGFLREFARTTEHRPEILLSFGFVPKVETRIGLINWL IQDPGNAAVADEQAFVQKLAGSEPARRRRLMVDLYKRVLDGVADLGFPLSIHLEATYG VSAAAFETFAEMLAYWSPAEPGKPD" gene 2434847..2435905 /gene="idsA2" /locus_tag="Rv2173" /db_xref="GeneID:888334" CDS 2434847..2435905 /gene="idsA2" /locus_tag="Rv2173" /EC_number="2.5.1.-" /function="INVOLVED IN LIPID BIOSYNTHESIS." /note="Rv2173, (MTV021.06), len: 352 aa. Probable idsA2, geranylgeranyl pyrophosphate synthase (EC 2.5.1.-), similar to many e.g. Q54193 geranylgeranyl pyrophosphate synthase from Streptomyces griseus (425 aa). Contains PS00723 and PS00444Polyprenyl synthetases signature 1 and 2. FASTA scores: sptr|Q54193|Q54193 GERANYLGERANYL PYROPHOSPHATE SYNTHASE (425 aa) opt: 744, E(): 0; 39.2% identity in 352 aa overlap. TBparse score is 0.900" /codon_start=1 /transl_table=11 /product="geranylgeranyl pyrophosphate synthetase" /protein_id="NP_216689.1" /db_xref="GI:15609310" /db_xref="GOA:O53507" /db_xref="UniProtKB/TrEMBL:O53507" /db_xref="GeneID:888334" /translation="MAGAITDQLRRYLHGRRRAAAHMGSDYDGLIADLEDFVLGGGKR LRPLFAYWGWHAVASREPDPDVLLLFSALELLHAWALVHDDLIDRSATRRGRPTAQLR YAALHRDRDWRGSPDQFGMSAAILLGDLAQVWADDIVSKVCQSALAPDAQRRVHRVWA DIRNEVLGGQYLDIVAEASAAESIESAMNVATLKTACYTVSRPLQLGTAAAADRSDVA AIFEHFGADLGVAFQLRDDVLGVFGDPAVTGKPSGDDLKSGKRTVLVAEAVELADRSD PLAAKLLRTSIGTRLTDAQVRELRTVIEAVGARAAAESRIAALTQRALATLASAPINA TAKAGLSELAMMAANRSA" misc_feature 2435087..2435131 /gene="idsA2" /locus_tag="Rv2173" /note="PS00723 Polyprenyl synthetases signature 1" misc_feature 2435528..2435566 /gene="idsA2" /locus_tag="Rv2173" /note="PS00444 Polyprenyl synthetases signature 2" gene 2435909..2437459 /locus_tag="Rv2174" /db_xref="GeneID:887528" CDS 2435909..2437459 /locus_tag="Rv2174" /function="UNKNOWN" /note="Rv2174, (MTV021.07), len: 516 aa. Possible conserved integral membrane protein, similar to some hypothetical mycobacterial proteins e.g. Mycobacterium leprae ML0899 probable integral-membrane protein (505 aa) and MLCL536_26 (593 aa). FASTA scores: ML0899 opt: 2715; 78.884% identity in 502 aa overlap and gp|Z99125|MLCL536_26 Mycobacterium leprae cosmid L536. (593 aa) opt: 552, E(): 7.1e-30; 31.6% identity in 513 aa overlap. Also similar to Rv1459c. TBparse score is 0.912" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216690.1" /db_xref="GI:15609311" /db_xref="UniProtKB/TrEMBL:O53508" /db_xref="GeneID:887528" /translation="MTTPSHAPAVDLATAKDAVVQHLSRLFEFTTGPQGGPARLGFAG AVLITAGGLGAGSVRQHDPLLESIHMSWLRFGHGLVLSSILLWTGVGVMLLAWLGLGR RVLAGEATEFTMRATTVIWLAPLLLSVPVFSRDTYSYLAQGALLRDGLDPYAVGPVGN PNALLDDVSPIWTITTAPYGPAFILVAKFVTVIVGNNVVAGTMLLRLCMLPGLALLVW ATPRLASHLGTHGPTALWICVLNPLVLIHLMGGVHNEMLMVGLMTAGIALTVQGRNVA GIILITVAIAVKATAGIALPFLVWVWLRHLRERRGYRPVQAFLAAAAISLLIFVAVFA VLSAVAGVGLGWLTALAGSVKIINWLTVPTGAANVIHALGRGLFTVDFYTLLRITRLI GIVIIAVSLPLLWWRFRRDDRAALTGVAWSMLIVVLFVPAALPWYYSWPLAVAAPLAQ ARRAIAAIAGLSTWVMVIFKPDGSHGMYSWLHFWIATACALTAWYVLYRSPDRRGVQA ATPVVNTP" gene complement(2437446..2437886) /locus_tag="Rv2175c" /db_xref="GeneID:887852" CDS complement(2437446..2437886) /locus_tag="Rv2175c" /function="UNKNOWN" /note="Rv2175c, (MTV021.08c), len: 146 aa. Conserved hypothetical protein, possibly involved in regulation. Contains possible helix-turn-helix domain at aa 31-52 (Score 1042, +2.74 SD). Equivalent to Mycobacterium leprae ML0898 putative DNA-binding protein (134 aa). FASTA scores: opt: 747; 82.090% identity in 134 aa overlap (AL022602) >gi|13092969|emb|CAC31279.1| (AL583920)" /codon_start=1 /transl_table=11 /product="putative regulatory protein" /protein_id="NP_216691.1" /db_xref="GI:15609312" /db_xref="UniProtKB/TrEMBL:O53509" /db_xref="GeneID:887852" /translation="MPGRAPGSTLARVGSIPAGDDVLDPDEPTYDLPRVAELLGVPVS KVAQQLREGHLVAVRRAGGVVIPQVFFTNSGQVVKSLPGLLTILHDGGYRDTEIMRWL FTPDPSLTITRDGSRDAVSNARPVDALHAHQAREVVRRAQAMAY" gene 2437941..2439140 /gene="pknL" /locus_tag="Rv2176" /db_xref="GeneID:888340" CDS 2437941..2439140 /gene="pknL" /locus_tag="Rv2176" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). MAY BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2176, (MTV021.09), len: 399 aa. Probable pknL, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citation below), similar to many e.g. MLCB1770_9 (622 aa). Lacks C-terminal domain and ends with putative transmembrane segment. Contains PS00108 Serine/Threonine protein kinases active-site signature. FASTA scores: Z70722|MLC B1770_9 Mycobacterium leprae cosmid B1770 (622 aa) opt: 732, E(): 5.9e-23; 44.4% identity in 266 aa overlap. Also similar to several Mycobacterium tuberculosis STPK proteins e.g. Rv0014c|PKNB, Rv0015c|PKNA, Rv1743|PKNE, Rv1266c|PKNH etc. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES.TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase L" /protein_id="NP_216692.1" /db_xref="GI:15609313" /db_xref="GOA:O53510" /db_xref="UniProtKB/Swiss-Prot:O53510" /db_xref="GeneID:888340" /translation="MVEAGTRDPLESALLDSRYLVQAKIASGGTSTVYRGLDVRLDRP VALKVMDSRYAGDEQFLTRFRLEARAVARLNNRALVAVYDQGKDGRHPFLVMELIEGG TLRELLIERGPMPPHAVVAVLRPVLGGLAAAHRAGLVHRDVKPENILISDDGDVKLAD FGLVRAVAAASITSTGVILGTAAYLSPEQVRDGNADPRSDVYSVGVLVYELLTGHTPF TGDSALSIAYQRLDADVPRASAVIDGVPPQFDELVACATARNPADRYADAIAMGADLE AIAEELALPEFRVPAPRNSAQHRSAALYRSRITQQGQLGAKPVHHPTRQLTRQPGDCS EPASGSEPEHEPITGQFAGIAIEEFIWARQHARRMVLVWVSVVLAITGLVASAAWTIG SNLSGLL" misc_feature 2438352..2438390 /gene="pknL" /locus_tag="Rv2176" /note="PS00108 Serine/Threonine protein kinases active-site signature" repeat_region complement(2439145..2439948) /note="IS1558-1, len: 804 bp. Insertion sequence IS1558, nearly identical to complement of region 24105 24908 in EM_BA:MTCY428 Z81451 Mycobacterium tuberculosis cosmid Y428." /mobile_element="insertion sequence:IS1558-1" gene complement(2439282..2439947) /locus_tag="Rv2177c" /db_xref="GeneID:888326" CDS complement(2439282..2439947) /locus_tag="Rv2177c" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT, POSSIBLY IS1558." /note="Rv2177c, (MTV021.10c), len: 221 aa. Possible IS1558 transposase (see citation below), similar to several IS element proteins and transposases but nearly identical to last 221 residues of MTCY428_23 (333 aa). FASTA scores: Z81451|MTCY428_23 Mycobacterium tuberculosis cosmid (333 aa) opt: 1491, E() : 0; 98.6% identity in 221 aa overlap. TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216693.1" /db_xref="GI:15609314" /db_xref="GOA:O53511" /db_xref="UniProtKB/TrEMBL:O53511" /db_xref="GeneID:888326" /translation="MRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMIGALDEQ IEQLMHPFCARRELIASIPGIGVGASATVISEIGADPAAWFPSAEHLASWVRLCPGNH ESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYLREYYRRQVRKFGGFRSPAANKK AIIAVAHKLIVIIWHVLATGRPYQDLGADYFTTRMDPDKERRRLVAKLEAQGLGVTLE PAA" gene complement(2440332..2441720) /gene="aroG" /locus_tag="Rv2178c" /db_xref="GeneID:888309" CDS complement(2440332..2441720) /gene="aroG" /locus_tag="Rv2178c" /EC_number="2.5.1.54" /function="chorismate biosynthesis" /experiment="experimental evidence, no additional details recorded" /note="Rv2178c, (MTV021.11c), len: 462 aa. Probable aroG, 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase similar to many, especially those from plants. FASTA scores: Y15113|M C3DDAH7P_1Morinda citrifolia mRNA for 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase (535 aa) opt: 1421, E(): 0; 48.3% identity in 443 aa overlap. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="3-deoxy-D-arabino-heptulosonate 7-phosphate synthase AroG" /protein_id="NP_216694.1" /db_xref="GI:15609315" /db_xref="GOA:O53512" /db_xref="UniProtKB/TrEMBL:O53512" /db_xref="GeneID:888309" /translation="MNWTVDIPIDQLPSLPPLPTDLRTRLDAALAKPAAQQPTWPADQ ALAMRTVLESVPPVTVPSEIVRLQEQLAQVAKGEAFLLQGGDCAETFMDNTEPHIRGN VRALLQMAVVLTYGASMPVVKVARIAGQYAKPRSADIDALGLRSYRGDMINGFAPDAA AREHDPSRLVRAYANASAAMNLVRALTSSGLASLHLVHDWNREFVRTSPAGARYEALA TEIDRGLRFMSACGVADRNLQTAEIYASHEALVLDYERAMLRLSDGDDGEPQLFDLSA HTVWIGERTRQIDGAHIAFAQVIANPVGVKLGPNMTPELAVEYVERLDPHNKPGRLTL VSRMGNHKVRDLLPPIVEKVQATGHQVIWQCDPMHGNTHESSTGFKTRHFDRIVDEVQ GFFEVHRALGTHPGGIHVEITGENVTECLGGAQDISETDLAGRYETACDPRLNTQQSL ELAFLVAEMLRD" gene complement(2441811..2442317) /locus_tag="Rv2179c" /db_xref="GeneID:887927" CDS complement(2441811..2442317) /locus_tag="Rv2179c" /function="UNKNOWN" /note="Rv2179c, (MTV021.12c), len: 168 aa. Conserved hypothetical protein, equivalent to conserved hypothetical protein from Mycobacterium leprae ML0895 conserved hypothetical protein (171 aa). FASTA scores: opt: 977, E(): 1.4e-58; 82.530% identity in 166 aa overlap (AL022602). TBparse score is 0.912" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216695.1" /db_xref="GI:15609316" /db_xref="UniProtKB/TrEMBL:O53513" /db_xref="GeneID:887927" /translation="MRYFYDTEFIEDGHTIELISIGVVAEDGREYYAVSTEFDPERAG SWVRTHVLPKLPPPASQLWRSRQQIRLDLEEFLRIDGTDSIELWAWVGAYDHVALCQL WGPMTALPPTVPRFTRELRQLWEDRGCPRMPPRPRDVHDALVDARDQLRRFRLITSTD DAGRGAAR" gene complement(2442327..2443214) /locus_tag="Rv2180c" /db_xref="GeneID:887954" CDS complement(2442327..2443214) /locus_tag="Rv2180c" /function="UNKNOWN" /note="Rv2180c, (MTV021.13c), len: 295 aa. Probable conserved integral membrane protein, similar to pir||T35292 probable integral membrane protein from Streptomyces coelicolor >gi|5578858|emb|CAB51260.1| (AL096872) (246 aa) (36% identity in 249 aa overlap). TBparse score is 0.914" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216696.1" /db_xref="GI:15609317" /db_xref="UniProtKB/TrEMBL:O53514" /db_xref="GeneID:887954" /translation="MEVFHWLQHDIVDRGRLPLLCCLVAFVLTFLVTRSFVRFIHRRA ADGRPARWWQPRNVHIGSVHIHHVAFGVVLVMISGLTLVTLSVDGREPEFTIAASIFG VGAALVLDEYALILHLSDVYWEEDGRTSVDAVFAAVAVAGLLIMGLHPLIFFLPVRQG ANWVVLQTTLIAGLVLTLPLAVVVLLKGKVWTGLLGMFVVVLLVVGAVRLSRPHAPWA RWRYTRHPEKMRRALQRERTWRRPVVRIKLWLQYVIAGTPRMPDERAVDAQLDQDVRP APPPERTAPILISGSVWSD" gene 2443302..2444585 /locus_tag="Rv2181" /db_xref="GeneID:888269" CDS 2443302..2444585 /locus_tag="Rv2181" /function="UNKNOWN" /note="Rv2181, (MTV021.14), len: 427 aa. Probable conserved integral membrane protein, similar to others in Mycobacterium tuberculosis e.g. Rv1159 (MTCI65.26, 431 aa). Start uncertain. FASTA scores: Z95584|MTCI65_26 (431 aa) opt: 428, E(): 8e-22; 31.2% identity in 407 aa overlap. TBparse score is 0.921" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216697.1" /db_xref="GI:15609318" /db_xref="UniProtKB/TrEMBL:O53515" /db_xref="GeneID:888269" /translation="MSAWRAPEVGSRLGRRVLWCLLWLLAGVALGYVAWRLFGHTPYR IDIDIYQMGARAWLDGRPLYGGGVLFHTPIGLNLPFTYPPLAAVLFSPFAWLQMPAAS VAITVLTLVLLIASTAIVLTGLDAWPTSRLVPAPARLRRLWLAVLIVAPATIWLEPIS SNFAFGQINVVLMTLVIVDCFPRRTPWPRGLMLGLGIALKLTPAVFLLYFLLRRDGRA ALTALASFAVATLLGFVLAWRDSWEYWTHTLHHTDRIGAAALNTDQNIAGALARLTIG DDERFALWVAGSLLVLAATIWAMRRVLRAGEPTLAVICVALFGLVVSPVSWSHHWVWM LPAVLVIGLLGWRRRNVALAMLSLAGVVLMRWTPIDLLPQHRETTAVWWRQLAGMSYV WWALAVIVVAGLTVTARMTPQRSLTRGLTPAPTAS" gene complement(2444586..2445329) /locus_tag="Rv2182c" /db_xref="GeneID:888625" CDS complement(2444586..2445329) /locus_tag="Rv2182c" /EC_number="2.3.1.51" /function="transfer of fatty acyl groups" /note="Rv2182c, (MTV021.15c), len: 247 aa. Probable 1-acylglycerol-3-phosphate O-acyltransferase, similar to many e.g. in Streptomyces. Contains PS00017 ATP/GTP-binding site motif A (P-loop). FASTA scores: pir||T35503 1-acylglycerol-3-phosphate O-acyltransferase (EC 2.3.1.51) homolog SC6E10.16c - Streptomyces coelicolor >gi|5689932|emb|CAB51970.1| (AL109661) hypothetical protein [Streptomyces coelicolor A3(2)] Length = 262, Expect = 6e-61 (54% identity in 215 aa overlap). TBparse score is 0.926" /codon_start=1 /transl_table=11 /product="1-acylglycerol-3-phosphate O-acyltransferase" /protein_id="NP_216698.1" /db_xref="GI:15609319" /db_xref="GOA:O53516" /db_xref="UniProtKB/TrEMBL:O53516" /db_xref="GeneID:888625" /translation="MWYYLFKYIFMGPLFTLLGRPKVEGLEYIPSSGPAILASNHLAV ADSFYLPLVVRRRIWFLAKSEYFTGTGLKGWINRWFYSVSGQVPIDRTNADSAQGALQ TAVVLLGQGKLLGMYPEGTRSPDGRLYKGKTGLARLALHTGVPVIPVAMIGTNVVNPP GRKMLRFGRVTVRFGKPMDFSRFEGLAGNHFIERAVTDEVIYELMGLSGQEYVDIYAA SVKDGRNAGGAGANPNSTDAARIPETAAG" misc_feature complement(2444931..2444954) /locus_tag="Rv2182c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(2445415..2445810) /locus_tag="Rv2183c" /db_xref="GeneID:887650" CDS complement(2445415..2445810) /locus_tag="Rv2183c" /function="UNKNOWN" /note="Rv2183c, (MTV021.16c), len: 131 aa. Conserved hypothetical protein, equivalent to Mycobacterium leprae hypothetical protein ML0891 (MLCB268.25c, 130 aa). FASTA scores: opt: 558, E(): 8.3e-28; 61.832% identity in 131 aa overlap >gi|13092963|emb|CAC31272.1| (AL583920) (AL022602). TBparse score is 0.895" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216699.1" /db_xref="GI:15609320" /db_xref="UniProtKB/TrEMBL:O53517" /db_xref="GeneID:887650" /translation="MSGAHTDVRPELRKLAQAILDGIDPAVRVAAAMASGGGPGTGKC QQVWCPLCALAALVTGEQHPLLTVIADHSLALLEVIRAIVDDIDRSAKPPPEGPPGGG QTGASGGENTNGEGSMKSHYQAIPVTIEE" gene complement(2445807..2446946) /locus_tag="Rv2184c" /db_xref="GeneID:888152" CDS complement(2445807..2446946) /locus_tag="Rv2184c" /function="UNKNOWN" /note="Rv2184c, (MTV021.17c), len: 379 aa. Conserved hypothetical protein, equivalent to hypothetical protein ML0890 (415 aa) from Mycobacterium leprae and also shows some similarity to other hypothetical proteins. FASTA scores: ML0890 opt: 1949; 79.630% identity in 378 aa overlap >emb|CAA18692.1| (AL022602) >gi|13092962|emb|CAC31271.1| (AL583920) and sptr|Q55794|Q55794 HYPOTHETICAL 44.6 kDa PROTEIN. (396 aa) opt: 251, E(): 3.3e-09; 25.5% identity in 384 aa overlap. TBparse score is 0.920" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216700.1" /db_xref="GI:15609321" /db_xref="UniProtKB/TrEMBL:O53518" /db_xref="GeneID:888152" /translation="MVVSTDQAHSLGDVLGIAVPPTGQGDPVRVLAYDPEAGGGFLDA LALDTLALLEGRWLHVVETLDRRFPGSELSSIAPEELCALPGIQEVLGLHAVGELAAA RRWDRIVVDCASTADALRMLTLPATFGLYVERAWPRHRRLSIGADDGRSAVLAELLER IRASVERLSTLLTDGALVSAHLVLTPERVVAAEAVRTLGSLALMGVRVEELLVNQLLV QDENYEYRSLPDHPAFHWYAERIGEQRAVLDDLDATIGDVALVLVPHLAGEPIGPKAL GGLLDSARRRQGSAPPGPLQPIVDLESGSGLASIYRLRLALPQLDPGTLTLGRADDDL IVSAGGMRRRVRLASVLRRCTVLDAHLRGGELTVRFRPNPEVWPT" gene complement(2447066..2447500) /gene="TB16.3" /locus_tag="Rv2185c" /db_xref="GeneID:887239" CDS complement(2447066..2447500) /gene="TB16.3" /locus_tag="Rv2185c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2185c, (MTV021.18c), len: 144 aa. TB16.3, conserved hypothetical protein, similar to other hypothetical actinomycete proteins and equivalent to Mycobacterium leprae ML0889 (144 aa). Some similarity to Mycobacterium tuberculosis Rv0854, Rv0856, Rv0857, Rv0164 and other Mycobacterium leprae proteins. FASTA scores : ML0889 opt: 811; 85.417% identity in 144 aa overlap (AL022602). TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216701.1" /db_xref="GI:15609322" /db_xref="GOA:O53519" /db_xref="UniProtKB/TrEMBL:O53519" /db_xref="GeneID:887239" /translation="MADKTTQTIYIDADPGEVMKAIADIEAYPQWISEYKEVEILEAD DEGYPKRARMLMDAAIFKDTLIMSYEWPEDRQSLSWTLESSSLLKSLEGTYRLAPKGS GTEVTYELAVDLAVPMIGMLKRKAERRLIDGALKDLKKRVEG" gene complement(2447605..2447994) /locus_tag="Rv2186c" /db_xref="GeneID:887347" CDS complement(2447605..2447994) /locus_tag="Rv2186c" /function="UNKNOWN" /note="Rv2186c, (MTV021.19c), len: 129 aa. Conserved hypothetical protein, equivalent to hypothetical Mycobacterium leprae protein ML0888 (135 aa). FASTA scores: ML0888 opt: 704, E(): 2.9e-43; 80.000% identity in 130 aa overlap CAA18694.1| (AL022602). TBparse score is 0.927" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216702.1" /db_xref="GI:15609323" /db_xref="UniProtKB/TrEMBL:O53520" /db_xref="GeneID:887347" /translation="MNSIQIADETYVAADAARVSAAVADRCSWRRWWPDLRLQVTEDR ADKGIRWTVTGALTGTMEIWLEPSMDGVLLHYFLHAEPTGVAAWQLARMNLARMTHHR RVAGKKMAFEVKTVLERSRPIGVSPVT" gene 2448160..2449962 /gene="fadD15" /locus_tag="Rv2187" /db_xref="GeneID:887456" CDS 2448160..2449962 /gene="fadD15" /locus_tag="Rv2187" /EC_number="6.2.1.3" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv2187, (MTV021.20), len: 600 aa. Probable fadD15, long-chain-fatty-acid-CoA ligase (EC 6.2.1.3), similar to several e.g. P44446|LCFH_HAEIN PUTATIVE LONG-CHAIN-FATTY-ACID--CoA LIGASE from Haemophilus influenzae (607 aa), FASTA scores: (607 aa) opt: 992, E(): 0, (31.5% identity in 578 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY. TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid-CoA ligase fadD15 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase)" /protein_id="NP_216703.1" /db_xref="GI:15609324" /db_xref="GOA:O53521" /db_xref="UniProtKB/TrEMBL:O53521" /db_xref="GeneID:887456" /translation="MREISVPAPFTVGEHDNVAAMVFEHERDDPDYVIYQRLIDGVWT DVTCAEAANQIRAAALGLISLGVQAGDRVVIFSATRYEWAILDFAILAVGAVTVPTYE TSSAEQVRWVLQDSEAVVLFAETDSHATMVAELSGSVPALREVLQIAGSGPNALDRLT EAGASVDPAELTARLAALRSTDPATLIYTSGTTGRPKGCQLTQSNLVHEIKGARAYHP TLLRKGERLLVFLPLAHVLARAISMAAFHSKVTVGFTSDIKNLLPMLAVFKPTVVVSV PRVFEKVYNTAEQNAANAGKGRIFAIAAQTAVDWSEACDRGGPGLLLRAKHAVFDRLV YRKLRAALGGNCRAAVSGGAPLGARLGHFYRGAGLTIYEGYGLSGTSGGVAISQFNDL KIGTVGKPVPGNSLRIADDGELLVRGGVVFSGYWRNEQATTEAFTDGWFKTGDLGAVD EDGFLTITGRKKEIIVTAGGKNVAPAVLEDQLRAHPLISQAVVVGDAKPFIGALITID PEAFEGWKQRNSKTAGASVGDLATDPDLIAEIDAAVKQANLAVSHAESIRKFRILPVD FTEDTGELTPTMKVKRKVVAEKFASDIEAIYNKE" misc_feature 2448715..2448750 /gene="fadD15" /locus_tag="Rv2187" /note="PS00455 Putative AMP-binding domain signature" gene complement(2449993..2451150) /locus_tag="Rv2188c" /db_xref="GeneID:887278" CDS complement(2449993..2451150) /locus_tag="Rv2188c" /function="UNKNOWN" /note="Rv2188c, (MTV021.21c), len: 385 aa. Conserved hypothetical protein, possibly glycosyl transferase similar to several putative glycosyl transferases and hypothetical proteins e.g. P73369. Equivalent to Mycobacterium leprae ML0886 putative glycosyl transferase (384 aa). FASTA scores: ML0886 (CAA18697.1| (AL022602) ) opt: 2113, E(): 1.8e-106; 81.462% identity in 383 aa overlap; sptr|P73369|P73369 HYPOTHETICAL 46.2 kDa PROTEIN (404 aa) opt: 379, E(): 2.2e-18; 27.5% identity in 397 aa overlap. Start changed since first submission, now 14 aa shorter. TBparse score is 0.913" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216704.2" /db_xref="GI:57116954" /db_xref="GOA:O53522" /db_xref="UniProtKB/TrEMBL:O53522" /db_xref="GeneID:887278" /translation="MSRVLLVTNDFPPRRGGIQSYLGEFVGRLVGSRAHAMTVYAPQW KGADAFDDAARAAGYRVVRHPSTVMLPGPTVDVRMRRLIAEHDIETVWFGAAAPLALL APRARLAGASRVLASTHGHEVGWSMLPVARSVLRRIGDGTDVVTFVSSYTRSRFASAF GPAASLEYLPPGVDTDRFRPDPAARAELRKRYRLGERPTVVCLSRLVPRKGQDTLVTA LPSIRRRVDGAALVIVGGGPYLETLRKLAHDCGVADHVTFTGGVATDELPAHHALADV FAMPCRTRGAGMDVEGLGIVFLEASAAGVPVIAGNSGGAPETVQHNKTGLVVDGRSVD RVADAVAELLIDRDRAVAMGAAGREWVTAQWRWDTLAAKLADFLRGDDAAR" gene complement(2451247..2452020) /locus_tag="Rv2189c" /db_xref="GeneID:887575" CDS complement(2451247..2452020) /locus_tag="Rv2189c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2189c, (MTV021.22c), len: 257 aa. Conserved hypothetical protein; some similarity to hypothetical protein SC6G10.07c (385 aa) from Streptomyces coelicolor A3(2). Smith-Waterman scores: pir||T35516 hypothetical protein SC6G10.07c - Streptomyces coelicolor >gi|4539203|emb|CAB39861.1| (AL049497) Expect = 2e-08; 30% identity in 245 aa overlap. TBparse score is 0.908" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216705.1" /db_xref="GI:15609326" /db_xref="UniProtKB/TrEMBL:O53523" /db_xref="GeneID:887575" /translation="MRDGPAAPAQVVAPADGFVALRVADDRTVRLLSLGGAATDRLLS RIAAGIDAAVDEVVAFWGTDWSHDIFVVAAGSDEQFHAAAGGGLASQWADIAAITVVD RVDPARRTVVGQRIVFAPGAAHMSPAALRIVLGHELFHYAARADTALDAPRWLAEGVA DFVARPKTPPPADAVSVALSLPSDTDLDTPGPQRSLAYDRAWWFARFVAAAYGTAKLR ELYLATCGVGHFDLATAAHDVLGIDAAGLLARWQRWLMG" gene complement(2452115..2453272) /locus_tag="Rv2190c" /db_xref="GeneID:888670" CDS complement(2452115..2453272) /locus_tag="Rv2190c" /function="UNKNOWN" /note="Rv2190c, (MTV021.23c, MTCY190.01c), len: 385 aa. Conserved hypothetical protein; similar to other hypothetical mycobacterial proteins, including Rv1477, Rv1478, Rv1566c, Rv0024, that are similar to protein p60 precursors from Listeria e.g. Q018 38|P60_LISSE protein p60 precursor (invasion-associated protein) (524 aa). FASTA scores: gp|Z80233|MTCY10H4_25 (281 aa) opt: 290, E(): 6.9e-05; 37.0% identity in 127 aa overlap and sp|Q01838|P60_LISSE PROTEIN P60 PRECURSOR (523 aa) opt: 268, E(): 0.00071; 38.5% identity in 104 aa overlap. TBparse score is 0.927" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216706.1" /db_xref="GI:15609327" /db_xref="UniProtKB/Swiss-Prot:P67473" /db_xref="GeneID:888670" /translation="MRLDQRWLIARVIMRSAIGFFASFTVSSGVLAANVLADPADDAL AKLNELSRQAEQTTEALHSAQLDLNEKLAAQRAADQKLADNRTALDAARARLATFQTA VNKVAAATYMGGRTHGMDAILTAESPQLLIDRLSVQRVMAHQMSTQMARFKAAGEQAV KAEQAAAKSAADARSAAEQAAAVRANLQHKQSQLQVQIAVVKSQYVALTPEERTALAD PGPVPAVAAIAPGAPPAALPPGAPPGDGPAPGVAPPPGGMPGLPFVQPDGAGGDRTAV VQAALTQVGAPYAWGGAAPGGFDCSGLVMWAFQQAGIALPHSSQALAHGGQPVALSDL QPGDVLTFYSDASHAGIYIGDGLMVHSSTYGVPVRVVPMDSSGPIYDARRY" gene 2453819..2455756 /locus_tag="Rv2191" /db_xref="GeneID:887265" CDS 2453819..2455756 /locus_tag="Rv2191" /function="UNKNOWN" /note="contains 3'-5'exonuclease domain" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216707.1" /db_xref="GI:15609328" /db_xref="GOA:Q10384" /db_xref="UniProtKB/Swiss-Prot:Q10384" /db_xref="GeneID:887265" /translation="MQGPNVAAMGATGGTQLSFADLAHAQGAAWTPADEMSLRETTFV VVDLETTGGRTTGNDATPPDAITEIGAVKVCGGAVLGEFATLVNPQHSIPPQIVRLTG ITTAMVGNAPTIDAVLPMFFEFAGDSVLVAHNAGFDIGFLRAAARRCDITWPQPQVLC TMRLARRVLSRDEAPSVRLAALARLFAVASNPTHRALDDARATVDVLHALIERVGNQG VHTYAELRSYLPNVTQAQRCKRVLAETLPHRPGVYLFRGPSGEVLYVGTAADLRRRVS QYFNGTDRRKRMTEMVMLASSIDHVECAHPLEAGVRELRMLSTHAPPYNRRSKFPYRW WWVALTDEAFPRLSVIRAPRHDRVVGPFRSRSKAAETAALLARCTGLRTCTTRLTRSA RHGPACPELEVSACPAARDVTAAQYAEAVLRAAALIGGLDNAALAAAVQQVTELAERR RYESAARLRDHLATAIEALWHGQRLRALAALPELIAAKPDGPREGGYQLAVIRHGQLA AAGRAPRGVPPMPVVDAIRRGAQAILPTPAPLGGALVEEIALIARWLAEPGVRIVGVS NDAAGLASPVRSAGPWAAWAATARSAQLAGEQLSRGWQSDLPTEPHPSREQLFGRTGV DCRTGPPQPLLPGRQPFSTAG" gene complement(2455631..2456743) /gene="trpD" /locus_tag="Rv2192c" /db_xref="GeneID:887681" CDS complement(2455631..2456743) /gene="trpD" /locus_tag="Rv2192c" /EC_number="2.4.2.18" /function="tryptophan biosynthesis" /note="Catalyzes the conversion of N-(5-phospho-D-ribosyl)-anthranilate and diphosphate to anthranilate and 5-phospho-alpha-D-ribose 1-diphosphate" /codon_start=1 /transl_table=11 /product="anthranilate phosphoribosyltransferase" /protein_id="NP_216708.1" /db_xref="GI:15609329" /db_xref="GOA:P66992" /db_xref="UniProtKB/Swiss-Prot:P66992" /db_xref="GeneID:887681" /translation="MALSAEGSSGGSRGGSPKAEAASVPSWPQILGRLTDNRDLARGQ AAWAMDQIMTGNARPAQIAAFAVAMTMKAPTADEVGELAGVMLSHAHPLPADTVPDDA VDVVGTGGDGVNTVNLSTMAAIVVAAAGVPVVKHGNRAASSLSGGADTLEALGVRIDL GPDLVARSLAEVGIGFCFAPRFHPSYRHAAAVRREIGVPTVFNLLGPLTNPARPRAGL IGCAFADLAEVMAGVFAARRSSVLVVHGDDGLDELTTTTTSTIWRVAAGSVDKLTFDP AGFGFARAQLDQLAGGDAQANAAAVRAVLGGARGPVRDAVVLNAAGAIVAHAGLSSRA EWLPAWEEGLRRASAAIDTGAAEQLLARWVRFGRQI" gene 2456901..2457512 /gene="ctaE" /locus_tag="Rv2193" /db_xref="GeneID:887425" CDS 2456901..2457512 /gene="ctaE" /locus_tag="Rv2193" /EC_number="1.9.3.1" /function="THOUGHT TO BE INVOLVED IN AEROBIC RESPIRATION." /experiment="experimental evidence, no additional details recorded" /note="Rv2193, (MTCY190.04), len: 203 aa. Probable ctaE, cytochrome c oxidase polypeptide III (cox3) (EC 1.9.3.1), with strong similarity to others e.g. COX3_SYNY3|Q06475 (29.8% identity in 225 aa overlap)." /codon_start=1 /transl_table=11 /product="cytochrome C oxidase subunit III" /protein_id="NP_216709.1" /db_xref="GI:15609330" /db_xref="GOA:P63856" /db_xref="UniProtKB/Swiss-Prot:P63856" /db_xref="GeneID:887425" /translation="MTSAVGTSGTAITSRVHSLNRPNMVSVGTIVWLSSELMFFAGLF AFYFSARAQAGGNWPPPPTELNLYQAVPVTLVLIASSFTCQMGVFAAERGDIFGLRRW YVITFLMGLFFVLGQAYEYRNLMSHGTSIPSSAYGSVFYLATGFHGLHVTGGLIAFIF LLVRTGMSKFTPAQATASIVVSYYWHFVDIVWIALFTVIYFIR" gene 2457553..2458395 /gene="qcrC" /locus_tag="Rv2194" /db_xref="GeneID:888737" CDS 2457553..2458395 /gene="qcrC" /locus_tag="Rv2194" /function="respiration" /note="Rv2194, (MTCY190.05), len: 280 aa. Probable qcrC, Ubiquinol-cytochrome C reductase cytochrome C subunit (cyoA), shows similarity to cytochrome c family; contains 2 X PS00190 Cytochrome c family heme-binding site signature." /codon_start=1 /transl_table=11 /product="ubiquinol-cytochrome C reductase QcrC(cytochrome C subunit)" /protein_id="NP_216710.1" /db_xref="GI:15609331" /db_xref="GOA:P63887" /db_xref="UniProtKB/Swiss-Prot:P63887" /db_xref="GeneID:888737" /translation="MTKLGFTRSGGSKSGRTRRRLRRRLSGGVLLLIALTIAGGLAAV LTPTPQVAVADESSSALLRTGKQLFDTSCVSCHGANLQGVPDHGPSLIGVGEAAVYFQ VSTGRMPAMRGEAQAPRKDPIFDEAQIDAIGAYVQANGGGPTVVRNPDGSIATQSLRG NDLGRGGDLFRLNCASCHNFTGKGGALSSGKYAPDLAPANEQQILTAMLTGPQNMPKF SNRQLSFEAKKDIIAYVKVATEARQPGGYLLGGFGPAPEGMAMWIIGMVAAIGLALWI GARS" gene 2458392..2459681 /gene="qcrA" /locus_tag="Rv2195" /db_xref="GeneID:888420" CDS 2458392..2459681 /gene="qcrA" /locus_tag="Rv2195" /function="respiration" /note="Rv2195, (MTCY190.06), len: 429 aa. Probable qcrA, Ubiquinol-cytochrome C reductase iron-sulfur subunit (cyoB), shows some similarity to cytochrome B6-F complex iron-sulphur subunits (Rieske iron-sulfur protein); contains PS00200 Rieske iron-sulfur protein signature 2" /codon_start=1 /transl_table=11 /product="Rieske iron-sulfur protein QcrA" /protein_id="NP_216711.1" /db_xref="GI:15609332" /db_xref="GOA:Q10387" /db_xref="UniProtKB/Swiss-Prot:Q10387" /db_xref="GeneID:888420" /translation="MSRADDDAVGVPPTCGGRSDEEERRIVPGPNPQDGAKDGAKATA VPREPDEAALAAMSNQELLALGGKLDGVRIAYKEPRWPVEGTKAEKRAERSVAVWLLL GGVFGLALLLIFLFWPWEFKAADGESDFIYSLTTPLYGLTFGLSILSIAIGAVLYQKR FIPEEISIQERHDGASREIDRKTVVANLTDAFEGSTIRRRKLIGLSFGVGMGAFGLGT LVAFAGGLIKNPWKPVVPTAEGKKAVLWTSGWTPRYQGETIYLARATGTEDGPPFIKM RPEDMDAGGMETVFPWRESDGDGTTVESHHKLQEIAMGIRNPVMLIRIKPSDLGRVVK RKGQESFNFGEFFAFTKVCSHLGCPSSLYEQQSYRILCPCHQSQFDALHFAKPIFGPA ARALAQLPITIDTDGYLVANGDFVEPVGPAFWERTTT" repeat_region 2458392..2458449 /note="58 bp Mycobacterial Interspersed Repetitive Unit, Class II I. Overlaps Rv2195 suggesting alternative GTG start at 2458 468 may be used" gene 2459678..2461327 /gene="qcrB" /locus_tag="Rv2196" /db_xref="GeneID:887400" CDS 2459678..2461327 /gene="qcrB" /locus_tag="Rv2196" /function="respiration" /note="Rv2196, (MTCY190.07), len: 549 aa. Probable qcrB, Ubiquinol-cytochrome C reductase cytochrome B subunit (cytB), integral membrane protein, low similarity in amino-terminal half to cytochrome b subunits, highly similar at C-terminus to SW:12KD_MYCLE P15878 12 KD protein PIR:S08427 (86.9% identity in 153 aa overlap). FASTA scores: sp|Q45658|QCRB_BACST MENAQUINOL-CYTOCHROME C REDUCTASE (224 aa) opt: 341, E(): 6.8e-15; 28.0% identity in 207 aa overlap" /codon_start=1 /transl_table=11 /product="ubiquinol-cytochrome C reductase QcrB (cytochrome B subunit)" /protein_id="NP_216712.1" /db_xref="GI:15609333" /db_xref="GOA:P63885" /db_xref="UniProtKB/Swiss-Prot:P63885" /db_xref="GeneID:887400" /translation="MSPKLSPPNIGEVLARQAEDIDTRYHPSAALRRQLNKVFPTHWS FLLGEIALYSFVVLLITGVYLTLFFDPSMVDVTYNGVYQPLRGVEMSRAYQSALDISF EVRGGLFVRQIHHWAALMFAAAIMVHLARIFFTGAFRRPRETNWVIGSLLLILAMFEG YFGYSLPDDLLSGLGLRAALSSITLGMPVIGTWLHWALFGGDFPGTILIPRLYALHIL LLPGIILALIGLHLALVWFQKHTQFPGPGRTEHNVVGVRVMPVFAFKSGAFFAAIVGV LGLMGGLLQINPIWNLGPYKPSQVSAGSQPDFYMMWTEGLARIWPPWEFYFWHHTIPA PVWVAVIMGLVFVLLPAYPFLEKRFTGDYAHHNLLQRPRDVPVRTAIGAMAIAFYMVL TLAAMNDIIALKFHISLNATTWIGRIGMVILPPFVYFITYRWCIGLQRSDRSVLEHGV ETGIIKRLPHGAYIELHQPLGPVDEHGHPIPLQYQGAPLPKRMNKLGSAGSPGSGSFL FADSAAEDAALREAGHAAEQRALAALREHQDSIMGSPDGEH" gene complement(2461504..2462148) /locus_tag="Rv2197c" /db_xref="GeneID:887213" CDS complement(2461504..2462148) /locus_tag="Rv2197c" /function="UNKNOWN" /note="Rv2197c, (MTCY190.08c), len: 214 aa. Probable conserved transmembrane protein, equivalent to ML0878 conserved hypothetical protein (212 aa) of Mycobacterium leprae. FASTA scores: opt: 858; 62.559% identity in 211 aa overlap CAC31259.1|(AL583920)" /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216713.1" /db_xref="GI:15609334" /db_xref="GOA:Q10389" /db_xref="UniProtKB/Swiss-Prot:Q10389" /db_xref="GeneID:887213" /translation="MVSRYSAYRRGPDVISPDVIDRILVGACAAVWLVFTGVSVAAAV ALMDLGRGFHEMAGNPHTTWVLYAVIVVSALVIVGAIPVLLRARRMAEAEPATRPTGA SVRGGRSIGSGHPAKRAVAESAPVQHADAFEVAAEWSSEAVDRIWLRGTVVLTSAIGI ALIAVAAATYLMAVGHDGPSWISYGLAGVVTAGMPVIEWLYARQLRRVVAPQSS" gene complement(2462148..2463047) /gene="mmpS3" /locus_tag="Rv2198c" /db_xref="GeneID:887471" CDS complement(2462148..2463047) /gene="mmpS3" /locus_tag="Rv2198c" /function="UNKNOWN" /note="Rv2198c, (MTCY190.09c), len: 301 aa. Probable mmpS3, conserved membrane protein (see citation below), equivalent to ML0877|mmpS3 putative membrane protein from Mycobacterium leprae (293 aa), FASTA scores: opt: 1089, E(): 1.2e-43, (69.80% identity in 308 aa overlap). Also similar to other proteins e.g. Rv3209 from Mycobacterium tuberculosis. Contains PS00499 C2 domain signature, a hydrophobic region, and a repetitive proline and threonine rich region. BELONGS TO THE MMPS FAMILY." /codon_start=1 /transl_table=11 /product="membrane protein" /protein_id="NP_216714.1" /db_xref="GI:15609335" /db_xref="GOA:P65378" /db_xref="UniProtKB/Swiss-Prot:P65378" /db_xref="GeneID:887471" /translation="MSGPNPPGREPDEPESEPVSDTGDERASGNHLPPVAGGGDKLPS DQTGETDAYSRAYSAPESEHVTGGPYVPADLRLYDYDDYEESSDLDDELAAPRWPWVV GVAAIIAAVALVVSVSLLVTRPHTSKLATGDTTSSAPPVQDEITTTKPAPPPPPPAPP PTTEIPTATETQTVTVTPPPPPPPATTTAPPPATTTTAAAPPPTTTTPTGPRQVTYSV TGTKAPGDIISVTYVDAAGRRRTQHNVYIPWSMTVTPISQSDVGSVEASSLFRVSKLN CSITTSDGTVLSSNSNDGPQTSC" gene complement(2463233..2463652) /locus_tag="Rv2199c" /db_xref="GeneID:888692" CDS complement(2463233..2463652) /locus_tag="Rv2199c" /function="UNKNOWN" /note="Rv2199c, (MTCY190.10c), len: 139 aa. Possible conserved integral membrane protein, similar to hypothetical membrane proteins in Actinomycetes and equivalent to Mycobacterium leprae, ML0876, putative membrane protein (139 aa) FASTA scores: opt: 866, E(): 1.1e-43; 91.367% identity in 139 aa overlap CAC31257.1| (AL583920)" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216715.1" /db_xref="GI:15609336" /db_xref="GOA:P64947" /db_xref="UniProtKB/Swiss-Prot:P64947" /db_xref="GeneID:888692" /translation="MHIEARLFEFVAAFFVVTAVLYGVLTSMFATGGVEWAGTTALAL TGGMALIVATFFRFVARRLDSRPEDYEGAEISDGAGELGFFSPHSWWPIMVALSGSVA AVGIALWLPWLIAAGVAFILASAAGLVFEYYVGPEKH" gene complement(2463660..2464751) /gene="ctaC" /locus_tag="Rv2200c" /db_xref="GeneID:888799" CDS complement(2463660..2464751) /gene="ctaC" /locus_tag="Rv2200c" /EC_number="1.9.3.1" /function="INVOLVED IN AEROBIC RESPIRATION. SUBUNIT I AND II FORM THE FUNCTIONAL CORE OF THE ENZYME COMPLEX. ELECTRONS ORIGINATING IN CYTOCHROME C ARE TRANSFERRED VIA HEME A AND CU(A) TO THE BINUCLEAR CENTER FORMED BY HEME A3 AND CU(B) (BY SIMILARITY)." /note="Rv2200c, (MTCY190.11c), len: 363 aa. Probable ctaC, transmembrane cytochrome C oxidase (subunit II), COX2, similar e.g. to JT0964 cytochrome-c oxidase chain II (23.0% identity in 317 aa overlap); etc. Contains PS00078 Cytochrome c oxidase subunit II, copper A binding region signature. BELONGS TO THE CYTOCHROME C OXIDASE SUBUNIT 2 FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane cytochrome C oxidase subunit II CtaC" /protein_id="NP_216716.1" /db_xref="GI:15609337" /db_xref="GOA:P63854" /db_xref="UniProtKB/Swiss-Prot:P63854" /db_xref="GeneID:888799" /translation="MTPRGPGRLQRLSQCRPQRGSGGPARGLRQLALAAMLGALAVTV SGCSWSEALGIGWPEGITPEAHLNRELWIGAVIASLAVGVIVWGLIFWSAVFHRKKNT DTELPRQFGYNMPLELVLTVIPFLIISVLFYFTVVVQEKMLQIAKDPEVVIDITSFQW NWKFGYQRVNFKDGTLTYDGADPERKRAMVSKPEGKDKYGEELVGPVRGLNTEDRTYL NFDKVETLGTSTEIPVLVLPSGKRIEFQMASADVIHAFWVPEFLFKRDVMPNPVANNS VNVFQIEEITKTGAFVGHCAEMCGTYHSMMNFEVRVVTPNDFKAYLQQRIDGKTNAEA LRAINQPPLAVTTHPFDTRRGELAPQPVG" gene 2464997..2466955 /gene="asnB" /locus_tag="Rv2201" /db_xref="GeneID:888472" CDS 2464997..2466955 /gene="asnB" /locus_tag="Rv2201" /EC_number="6.3.5.4" /function="asparagine biosynthesis" /note="Rv2201, (MTCY190.12), len: 652 aa. Probable asnB, asparagine synthetase, similar to e.g. SW:ASNH_BACSU P42113 putative asparagine synthetase (26.0% identity in 438 aa overlap)" /codon_start=1 /transl_table=11 /product="asparagine synthetase AsnB" /protein_id="NP_216717.1" /db_xref="GI:15609338" /db_xref="GOA:P64247" /db_xref="UniProtKB/Swiss-Prot:P64247" /db_xref="GeneID:888472" /translation="MCGLLAFVAAPAGAAGPEGADAASAIARASHLMRHRGPDESGTW HAVDGASGGVVFGFNRLSIIDIAHSHQPLRWGPPEAPDRYVLVFNGEIYNYLELRDEL RTQHGAVFATDGDGEAILAGYHHWGTEVLQRLRGMFAFALWDTVTRELFCARDPFGIK PLFIATGAGGTAVASEKKCLLDLVELVGFDTEIDHRALQHYTVLQYVPEPETLHRGVR RLESGCFARIRADQLAPVITRYFVPRFAASPITNDNDQARYDEITAVLEDSVAKHMRA DVTVGAFLSGGIDSTAIAALAIRHNPRLITFTTGFEREGFSEIDVAVASAEAIGARHI AKVVSADEFVAALPEIVWYLDEPVADPALVPLFFVAREARKHVKVVLSGEGADELFGG YTIYREPLSLRPFDYLPKPLRRSMGKVSKPLPEGMRGKSLLHRGSLTLEERYYGNARS FSGAQLREVLPGFRPDWTHTDVTAPVYAESAGWDPVARMQHIDLFTWLRGDILVKADK ITMANSLELRVPFLDPEVFAVASRLPAGAKITRTTTKYALRRALEPIVPAHVLHRPKL GFPVPIRHWLRAGELLEWAYATVGSSQAGHLVDIAAVYRMLDEHRCGSSDHSRRLWTM LIFMLWHAIFVEHSVVPQISEPQYPVQL" gene complement(2467053..2468027) /gene="cbhK" /locus_tag="Rv2202c" /db_xref="GeneID:888551" CDS complement(2467053..2468027) /gene="cbhK" /locus_tag="Rv2202c" /EC_number="2.7.-.-" /function="phosphorylation of carbohydrates" /note="Rv2202c, (MTCY190.13c), len: 324 aa. Probable cbhK, carbohydrate kinase (but not ribose) (EC 2.7.-.-), similar to several e.g. AE000915_1 Methanobacterium thermoautotrop (309 aa) FASTA score: opt: 370, E(): 3.3e-18; 31.2% identity in 276 aa overlap. Low similarity to carbohydrate kinases, e.g. SW:RBSK_BACSU P36945 ribokinase (23.9% identity in 272 aa overlap); contains PS00583 pfkB family of carbohydrate kinases signature 1" /codon_start=1 /transl_table=11 /product="carbohydrate kinase CbhK" /protein_id="NP_216718.1" /db_xref="GI:15609339" /db_xref="GOA:P83734" /db_xref="UniProtKB/Swiss-Prot:P83734" /db_xref="GeneID:888551" /translation="MTIAVTGSIATDHLMRFPGRFSEQLLPEHLHKVSLSFLVDDLVM HRGGVAGNMAFAIGVLGGEVALVGAAGADFADYRDWLKARGVNCDHVLISETAHTARF TCTTDVDMAQIASFYPGAMSEARNIKLADVVSAIGKPELVIIGANDPEAMFLHTEECR KLGLAFAADPSQQLARLSGEEIRRLVNGAAYLFTNDYEWDLLLSKTGWSEADVMAQID LRVTTLGPKGVDLVEPDGTTIHVGVVPETSQTDPTGVGDAFRAGFLTGRSAGLGLERS AQLGSLVAVLVLESTGTQEWQWDYEAAASRLAGAYGEHAAAEIVAVLA" gene 2468231..2468923 /locus_tag="Rv2203" /db_xref="GeneID:888318" CDS 2468231..2468923 /locus_tag="Rv2203" /function="UNKNOWN" /note="Rv2203, (MTCY190.14), len: 230 aa. Possible conserved membrane protein; has single hydrophobic stretch from aa 75 to 97 and is equivalent to Mycobacterium leprae ML0872 putative membrane protein (171 aa). FASTA scores: opt: 821, E(): 3.4e-42; 72.353% identity in 170 aa overlap - CAC31253.1| (AL583920). 2468411." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216719.1" /db_xref="GI:15609340" /db_xref="GOA:P64949" /db_xref="UniProtKB/Swiss-Prot:P64949" /db_xref="GeneID:888318" /translation="MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFA PGPADDAALPPAAYPGVPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTNGA NTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDA FRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVL VCSYVLRTAGSY" gene complement(2468931..2469287) /locus_tag="Rv2204c" /db_xref="GeneID:888428" CDS complement(2468931..2469287) /locus_tag="Rv2204c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2204c, (MTCY190.15c), len: 118 aa. Conserved hypothetical protein. Similar to conserved hypothetical proteins in Actinomycetes and equivalent to Mycobacterium leprae ML0871|ML0871 conserved hypothetical protein (118 aa) and to sp|P45344|YADR_HAEIN HYPOTHETICAL PROTEIN HI1723 (114 aa). FASTA score: ML0871 opt: 720, E(): 8.4e-45; 92.373% identity in 118 aa overlapCAC31252.1| (AL583920); and P45344 opt: 346, E(): 1.8e-18; 45.6% identity in 103 aa overlap. Contains PS01152 Hypothetical hesB/y yadR/yfhF family signature" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216720.1" /db_xref="GI:15609341" /db_xref="GOA:Q10393" /db_xref="UniProtKB/Swiss-Prot:Q10393" /db_xref="GeneID:888428" /translation="MTVQNEPSAKTHGVILTEAAAAKAKSLLDQEGRDDLALRIAVQP GGCAGLRYNLFFDDRTLDGDQTAEFGGVRLIVDRMSAPYVEGASIDFVDTIEKQGFTI DNPNATGSCACGDSFN" gene complement(2469387..2470463) /locus_tag="Rv2205c" /db_xref="GeneID:887277" CDS complement(2469387..2470463) /locus_tag="Rv2205c" /function="UNKNOWN" /note="Rv2205c, (MTCY190.16c), len: 358 aa. Conserved hypothetical protein. Very similar to YHAD_ECOLI|P23524 hypothetical protein (YHAD (E.coli) / YXAA (S14A) (B.subtilis) family) (41.6% identity in 154 aa overlap), and to other members of the glycerate kinase family. Start changed since first submission; protein now 122 aa shorter, owing to extension of Rv2206." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216721.2" /db_xref="GI:57116955" /db_xref="GOA:P64288" /db_xref="UniProtKB/Swiss-Prot:P64288" /db_xref="GeneID:887277" /translation="MRVLVAPDCYGDSLSAVEAAAAIATGWTRSRPGDSFIVAPQSDG GPGFVEVLGSRLGETRRLRVCGPLNTVVNAAWVFDPGSATAYLECAQACGLGLLGGPP TPETALAAHSKGVGQLIAAALRAGAARIVVGLGGSACTDGGKGMIAELGGLDAARRQL ADVEVIAASDVEYPLLGPWGTARVFAPQKGADMATVAVLEGRLAAWAIELDAAAGRGV SAEPGAGAAGGIGAGLLAVGGRYQSGAAIIAEHTHFADDLADAELIVTGEGRFDEQSL HGKVVGAIAAAARPLAIPVIVLAGQVSLDKSALRSAGIMAALSIAEYAGSVRLALADA ANQLMGLASQVAARLGNSGPSGYR" gene 2470622..2471332 /locus_tag="Rv2206" /db_xref="GeneID:888107" CDS 2470622..2471332 /locus_tag="Rv2206" /function="UNKNOWN" /note="Rv2206, (MTCY190.17), len: 236 aa. Probable conserved transmembrane protein. Equivalent to hypothetical protein ML0869 (247 aa) of Mycobacterium leprae gZ98741|MLCB22_2 (247 aa), FASTA scores: opt: 1052, (67.5% identity in 237 aa overlap). Two hydrophobic stretches in C-terminal part. Start changed since original submission (+112 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216722.2" /db_xref="GI:57116956" /db_xref="GOA:P64951" /db_xref="UniProtKB/Swiss-Prot:P64951" /db_xref="GeneID:888107" /translation="MKLLGHRKSHGHQRADASPDAGSKDGCRPDSGRTSGSDTSRGSQ TTGPKGRPTPKRNQSRRHTKKGPVAPAPMTAAQARARRKSLAGPKLSREERRAEKAAN RARMTERRERMMAGEEAYLLPRDRGPVRRYVRDVVDSRRNLLGLFMPSALTLLFVMFA VPQVQFYLSPAMLILLALMTIDAIILGRKVGRLVDTKFPSNTESRWRLGLYAAGRASQ IRRLRAPRPQVERGGDVG" gene 2471411..2472496 /gene="cobT" /locus_tag="Rv2207" /db_xref="GeneID:887781" CDS 2471411..2472496 /gene="cobT" /locus_tag="Rv2207" /EC_number="2.4.2.21" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS" /note="catalyzes the synthesis of alpha-ribazole-5'-phosphate from nicotinate mononucleotide and 5,6-dimethylbenzimidazole" /codon_start=1 /transl_table=11 /product="nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase" /protein_id="NP_216723.1" /db_xref="GI:15609344" /db_xref="GOA:P63841" /db_xref="UniProtKB/Swiss-Prot:P63841" /db_xref="GeneID:887781" /translation="MIGFAPVSTPDAAAEAAARARQDSLTKPRGALGSLEDLSVWVAS CQQRCPPRQFERARVVVFAGDHGVARSGVSAYPPEVTAQMVANIDAGGAAINALADVA GATVRVADLAVDADPLSERIGAHKVRRGSGNIATEDALTNDETAAAITAGQQIADEEV DAGADLLIAGDMGIGNTTAAAVLVAALTDAEPVAVVGFGTGIDDAGWARKTAAVRDAL FRVRPVLPDPVGLLRCAGGADLAAIAGFCAQAAVRRTPLLLDGVAVTAAALVAERLAP GAHRWWQAGHRSSEPGHGLALAALGLDPIVDLHMRLGEGTGAAVALMVLRAAVAALSS MATFTEAGVSTRSVDGVDRTAPPAVSP" gene 2472493..2473242 /gene="cobS" /locus_tag="Rv2208" /db_xref="GeneID:887572" CDS 2472493..2473242 /gene="cobS" /locus_tag="Rv2208" /EC_number="2.-.-.-" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS" /note="catalyzes the formation of adenosylcobalamin from Ado-cobinamide-GDP and alpha-ribazole" /codon_start=1 /transl_table=11 /product="cobalamin synthase" /protein_id="NP_216724.1" /db_xref="GI:15609345" /db_xref="GOA:Q10397" /db_xref="UniProtKB/Swiss-Prot:Q10397" /db_xref="GeneID:887572" /translation="MMRSLATAFAFATVIPTPGSATTPMGRGPMTALPVVGAALGALA AAIAWAGAQVFGPSSPLSGMLTVAVLLVVTRGLHIDGVADTADGLGCYGPPQRALAVM RDGSTGPFGVAAVVLVIALQGLAFATLTTVGIAGITLAVLSGRVTAVLVCRRLVPAAH GSTLGSRVAGTQPAPVVAAWLAVLLAVSVPAGPRPWQGPIAVLVAVTAGAALAAHCVH RFGGVTGDVLGSAIELSTTVSAVTLAGLARL" gene 2473400..2474938 /locus_tag="Rv2209" /db_xref="GeneID:887230" CDS 2473400..2474938 /locus_tag="Rv2209" /function="UNKNOWN" /note="Rv2209, (MTCY190.20), len: 512 aa. Probable conserved integral membrane protein, similar to but longer than Rv0246 gp|AL021929|MTV 034_12 Mycobacterium tuberculosis (436 aa). FASTA score: opt: 712, E(): 2.8e- 32; 33.4% identity in 422 aa overlap" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216725.1" /db_xref="GI:15609346" /db_xref="GOA:P64953" /db_xref="UniProtKB/Swiss-Prot:P64953" /db_xref="GeneID:887230" /translation="MPASRLVRQVSAPRNLFGRLVAQGGFYTAGLQLGSGAVVLPVIC AHQGLTWAAGLLYPAFCIGAILGNSLSPLILQRAGQLRHLLMAAISATAAALVVCNAA VPWTGVGVAAVFLATTGAGGVVTGVSSVAYTDMISSMLPAVRRGELLLTQGAAGSVLA TGVTLVIVPMLAHGNEMARYHDLLWLGAAGLVCSGIAALFVGPMRSVSVTTATRMPLR EIYWMGFAIARSQPWFRRYMTTYLLFVPISLGTTFFSLRAAQSNGSLHVLVILSSIGL VVGSMLWRQINRLFGVRGLLLGSALLNAAAALLCMVAESCGQWVHAWAYGTAFLLATV AAQTVVAASISWISVLAPERYRATLICVGSTLAAVEATVLGVALGGIAQKHATIWPVV VVLTLAVIAAVASLRAPTRIGVTADTSPQAATLQAYRPATPNPIHSDERSTPPDHLSV RRGQLRHVWDSRRPAPPLNRPSCRRAARRPAPGKPAAALPQPRHPAVGVREGAPLDAG QRIA" gene complement(2474864..2475970) /gene="ilvE" /locus_tag="Rv2210c" /db_xref="GeneID:888352" CDS complement(2474864..2475970) /gene="ilvE" /locus_tag="Rv2210c" /EC_number="2.6.1.42" /function="THOUGHT TO CATALYZE THE FIRST REACTION IN THE CATABOLISM OF THE ESSENTIAL BRANCHED CHAIN AMINO ACIDS LEUCINE, ISOLEUCINE, AND VALINE [CATALYTIC ACTIVITY: L-leucine + 2-oxoglutarate = 4-methyl-2-oxopentanoate + L-glutamate]." /note="catalyzes the transamination of the branched-chain amino acids to their respective alpha-keto acids" /codon_start=1 /transl_table=11 /product="branched-chain amino acid aminotransferase" /protein_id="NP_216726.1" /db_xref="GI:15609347" /db_xref="GOA:Q10399" /db_xref="UniProtKB/Swiss-Prot:Q10399" /db_xref="GeneID:888352" /translation="MTSGSLQFTVLRAVNPATDAQRESMLREPGFGKYHTDHMVSIDY AEGRGWHNARVIPYGPIELDPSAIVLHYAQEVFEGLKAYRWADGSIVSFRADANAARL RSSARRLAIPELPDAVFIESLRQLIAVDKAWVPGAGGEEALYLRPFIFATEPGLGVRP ATQYRYLLIASPAGAYFKGGIAPVSVWVSTEYVRACPGGTGAAKFGGNYAASLLAQAE AAENGCDQVVWLDAVERRYIEEMGGMNIFFVLGSGGSARLVTPELSGSLLPGITRDSL LQLAIDAGFAVEERRIDIDEWQKKAAAGEITEVFACGTAAVITPVARVRHGASEFRIA DGQPGEVTMALRDTLTGIQRGTFADTHGWMARLG" gene complement(2476042..2477181) /gene="gcvT" /locus_tag="Rv2211c" /db_xref="GeneID:887233" CDS complement(2476042..2477181) /gene="gcvT" /locus_tag="Rv2211c" /EC_number="2.1.2.10" /function="The glycine cleavage system catalyzes the degradation of glycine [CATALYTIC ACTIVITY: (6S)-tetrahydrofolate + S-aminomethyldihydrolipoylprotein = (6R)-5,10-methylenetetrahydrofolate + NH3 + dihydrolipoylprotein]." /note="catalyzes the transfer of a methylene carbon from the methylamine-loaded GcvH protein to tetrahydrofolate, causing the release of ammonia and the generation of reduced GcvH protein" /codon_start=1 /transl_table=11 /product="glycine cleavage system aminomethyltransferase T" /protein_id="NP_216727.1" /db_xref="GI:15609348" /db_xref="GOA:P64220" /db_xref="UniProtKB/Swiss-Prot:P64220" /db_xref="GeneID:887233" /translation="MCQQGRPLGWDAVSDVPELIHGPLEDRHRELGASFAEFGGWLMP VSYAGTVSEHNATRTAVGLFDVSHLGKALVRGPGAAQFVNSALTNDLGRIGPGKAQYT LCCTESGGVIDDLIAYYVSDDEIFLVPNAANTAAVVGALQAAAPGGLSITNLHRSYAV LAVQGPCSTDVLTALGLPTEMDYMGYADASYSGVPVRVCRTGYTGEHGYELLPPWESA GVVFDALLAAVSAAGGEPAGLGARDTLRTEMGYPLHGHELSLDISPLQARCGWAVGWR KDAFFGRAALLAEKAAGPRRLLRGLRMVGRGVLRPGLAVLVGDETVGVTTSGTFSPTL QVGIGLALIDSDAGIEDGQQINVDVRGRAVECQVVCPPFVAVKTR" gene 2477190..2478326 /locus_tag="Rv2212" /db_xref="GeneID:887979" CDS 2477190..2478326 /locus_tag="Rv2212" /function="UNKNOWN" /note="Rv2212, (MTCY190.23), len: 378 aa. Conserved hypothetical protein. Some similarity to adenylate cyclases, e.g. SW:CYAA_STRCO P40135 (29.2% identity in 291 aa overlap); ttg at 24614 in MTCY190 has a better rbs. Contains possible helix-turn-helix motif at aa 64- 85, (+2.72 SD). Also similar to Rv1264 and Rv1647" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216728.1" /db_xref="GI:15609349" /db_xref="GOA:P64265" /db_xref="UniProtKB/Swiss-Prot:P64265" /db_xref="GeneID:887979" /translation="MYDSLDFDALEAAGIANPRERAGLLTYLDELGFTVEEMVQAERR GRLFGLAGDVLLWSGPPIYTLATAADELGLSADDVARAWSLLGLTVAGPDVPTLSQAD VDALATWVALKALVGEDGAFGLLRVLGTAMARLAEAESTMIRAGSPNIQMTHTHDELA TARAYRAAAEFVPRIGALIDTVHRHHLASARTYFEGVIGDTSASVTCGIGFADLSSFT ALTQALTPAQLQDLLTEFDAAVTDVVHADGGRLVKFIGDAVMWVSSSPERLVRAAVDL VDHPGARAAELQVRAGLAYGTVLALNGDYFGNPVNLAARLVAAAAPGQILAAAQLRDM LPDWPALAHGPLTLKGFDAPVMAFELHDNPRARDADTPSPAASD" gene 2478338..2479885 /gene="pepB" /locus_tag="Rv2213" /db_xref="GeneID:888105" CDS 2478338..2479885 /gene="pepB" /locus_tag="Rv2213" /EC_number="3.4.11.1" /function="protein degradation" /note="catalyzes the removal of N-terminal amino acids preferably leucine from various peptides" /codon_start=1 /transl_table=11 /product="leucyl aminopeptidase" /protein_id="NP_216729.1" /db_xref="GI:15609350" /db_xref="GOA:Q10401" /db_xref="UniProtKB/Swiss-Prot:Q10401" /db_xref="GeneID:888105" /translation="MTTEPGYLSPSVAVATSMPKRGVGAAVLIVPVVSTGEEDRPGAV VASAEPFLRADTVAEIEAGLRALDATGASDQVHRLAVPSLPVGSVLTVGLGKPRREWP ADTIRCAAGVAARALNSSEAVITTLAELPGDGICSATVEGLILGSYRFSAFRSDKTAP KDAGLRKITVLCCAKDAKKRALHGAAVATAVATARDLVNTPPSHLFPAEFAKRAKTLS ESVGLDVEVIDEKALKKAGYGGVIGVGQGSSRPPRLVRLIHRGSRLAKNPQKAKKVAL VGKGITFDTGGISIKPAASMHHMTSDMGGAAAVIATVTLAARLRLPIDVIATVPMAEN MPSATAQRPGDVLTQYGGTTVEVLNTDAEGRLILADAIVRACEDKPDYLIETSTLTGA QTVALGTRIPGVMGSDEFRDRVAAISQRVGENGWPMPLPDDLKDDLKSTVADLANVSG QRFAGMLVAGVFLREFVAESVDWAHIDVAGPAYNTGSAWGYTPKGATGVPTRTMFAVL EDIAKNG" gene complement(2479923..2481701) /gene="ephD" /locus_tag="Rv2214c" /db_xref="GeneID:887472" CDS complement(2479923..2481701) /gene="ephD" /locus_tag="Rv2214c" /function="THOUGHT TO BE INVOLVED IN DETOXIFICATION REACTIONS FOLLOWING OXIDATIVE DAMAGE TO LIPIDS." /note="Rv2214c, (MTCY190.25c), len: 592 aa. Possible ephD, short-chain dehydrogenase (EC 1.-.-.-) (see citation below), equivalent to Z98741|MLCB22_8 Mycobacterium leprae cosmid B22; (596 aa), FASTA score: opt: 3262, E(): 0; 80.4% identity in 596 aa overlap. C-terminus similar to short-chain alcohol dehydrogenase family, similar to SW:LIGD_PSEPA Q01198 c alpha-dehydrogenase (30.7% identity in 241 aa overlap); contains PS00061 Short-chain alcohol dehydrogenase family signature, PS00697 ATP-dependent DNA ligase AMP-binding site. N-terminus corresponds to several epoxide hydrolases of plants and Mycobacterium tuberculosis e.g. MTCY9F925" /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_216730.1" /db_xref="GI:15609351" /db_xref="GOA:P66777" /db_xref="UniProtKB/Swiss-Prot:P66777" /db_xref="GeneID:887472" /translation="MPATQQMSRLVDSPDGVRIAVYHEGNPDGPTVVLVHGFPDSHVL WDGVVPLLAERFRIVRYDNRGVGRSSVPKPISAYTMAHFADDFDAVIGELSPGEPVHV LAHDWGSVGVWEYLRRPGASDRVASFTSVSGPSQDHLVNYVYGGLRRPWRPRTFLRAI SQTLRLSYMALFSVPVVAPLLLRVALSSAAVRRNMVGDIPVDQIHHSETLARDAAHSV KTYPANYFRSFSSSRRGRAIPIVDVPVQLIVNSQDPYVRPYGYDQTARWVPRLWRRDI KAGHFSPMSHPQVMAAAVHDFADLADGKQPSRALLRAQVGRPRGYFGDTLVSVTGAGS GIGRETALAFAREGAEIVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAVEAFA ERVSAEHGVPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLVER GTGGHIVNVSSMAAYAPLQSLSAYCTSKAATYMFSDCLRAELDAAGVGLTTICPGVID TNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYGPDKVADAIVSAVKKKKPIRPVA PEAYALYGISRVLPQALRSTARLRVI" gene 2481965..2483626 /gene="dlaT" /locus_tag="Rv2215" /db_xref="GeneID:888777" CDS 2481965..2483626 /gene="dlaT" /locus_tag="Rv2215" /EC_number="2.3.1.61" /function="Involved in tricarboxylic acid cycle; converts 2-oxoglutarate to succinyl-CoA and CO2" /experiment="experimental evidence, no additional details recorded" /note="E2 component; part of pyruvate dehydrogenase; forms a complex with AceA and LpdC" /codon_start=1 /transl_table=11 /product="dihydrolipoamide acetyltransferase" /protein_id="NP_216731.1" /db_xref="GI:15609352" /db_xref="GOA:P65633" /db_xref="UniProtKB/Swiss-Prot:P65633" /db_xref="GeneID:888777" /translation="MAFSVQMPALGESVTEGTVTRWLKQEGDTVELDEPLVEVSTDKV DTEIPSPAAGVLTKIIAQEDDTVEVGGELAVIGDAKDAGEAAAPAPEKVPAAQPESKP APEPPPVQPTSGAPAGGDAKPVLMPELGESVTEGTVIRWLKKIGDSVQVDEPLVEVST DKVDTEIPSPVAGVLVSISADEDATVPVGGELARIGVAADIGAAPAPKPAPKPVPEPA PTPKAEPAPSPPAAQPAGAAEGAPYVTPLVRKLASENNIDLAGVTGTGVGGRIRKQDV LAAAEQKKRAKAPAPAAQAAAAPAPKAPPAPAPALAHLRGTTQKASRIRQITANKTRE SLQATAQLTQTHEVDMTKIVGLRARAKAAFAEREGVNLTFLPFFAKAVIDALKIHPNI NASYNEDTKEITYYDAEHLGFAVDTEQGLLSPVIHDAGDLSLAGLARAIADIAARARS GNLKPDELSGGTFTITNIGSQGALFDTPILVPPQAAMLGTGAIVKRPRVVVDASGNES IGVRSVCYLPLTYDHRLIDGADAGRFLTTIKHRLEEGAFEADLGL" gene 2483626..2484531 /locus_tag="Rv2216" /db_xref="GeneID:887640" CDS 2483626..2484531 /locus_tag="Rv2216" /function="UNKNOWN" /note="Rv2216, (MTCY190.27), len: 301 aa. Conserved hypothetical protein, equivalent to Mycobacterium leprae ML0860 (307 aa), Z98741|MLCB22_10 Mycobacterium leprae cosmid B22; H (307 aa). FASTA score: opt: 1656, E(): 0; 84.2% identity in 297 aa overlap. Also gp|AE000319|ECAE000319_8 Escherichia coli strain K12 MG1655 (297 aa) opt: 640, E(): 0; 39.5% identity in 294 aa overlap." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216732.1" /db_xref="GI:15609353" /db_xref="UniProtKB/Swiss-Prot:P67232" /db_xref="GeneID:887640" /translation="MANAVVAIAGSSGLIGSALTAALRAADHTVLRIVRRAPANSEEL HWNPESGEFDPHALTDVDAVVNLCGVNIAQRRWSGAFKQSLRDSRITPTEVLSAAVAD AGVATLINASAVGYYGNTKDRVVDENDSAGTGFLAQLCVDWETATRPAQQSGARVVLA RTGVVLSPAGGMLRRMRPLFSVGLGARLGSGRQYMSWISLEDEVRALQFAIAQPNLSG PVNLTGPAPVTNAEFTTAFGRAVNRPTPLMLPSVAVRAAFGEFADEGLLIGQRAIPSA LERAGFQFHHNTIGEALGYATTRPG" gene 2484584..2485276 /gene="lipB" /locus_tag="Rv2217" /db_xref="GeneID:887626" CDS 2484584..2485276 /gene="lipB" /locus_tag="Rv2217" /EC_number="6.-.-.-" /function="lipoate biosynthesis" /note="lipoyl/octanoyltransferase; catalyzes the transfer of the lipoyl/octanoyl moiety of lipoyl/octanoyl-ACP onto lipoate-dependent enzymes like pyruvate dehydrogenase and the glycine cleavage system H protein" /codon_start=1 /transl_table=11 /product="lipoate-protein ligase B" /protein_id="NP_216733.1" /db_xref="GI:15609354" /db_xref="GOA:Q10404" /db_xref="UniProtKB/Swiss-Prot:Q10404" /db_xref="GeneID:887626" /translation="MTGSIRSKLSAIDVRQLGTVDYRTAWQLQRELADARVAGGADTL LLLEHPAVYTAGRRTETHERPIDGTPVVDTDRGGKITWHGPGQLVGYPIIGLAEPLDV VNYVRRLEESLIQVCADLGLHAGRVDGRSGVWLPGRPARKVAAIGVRVSRATTLHGFA LNCDCDLAAFTAIVPCGISDAAVTSLSAELGRTVTVDEVRATVAAAVCAALDGVLPVG DRVPSHAVPSPL" gene 2485273..2486208 /gene="lipA" /locus_tag="Rv2218" /db_xref="GeneID:887922" CDS 2485273..2486208 /gene="lipA" /locus_tag="Rv2218" /function="lipoate biosynthesis" /note="catalyzes the radical-mediated insertion of two sulfur atoms into an acyl carrier protein (ACP) bound to an octanoyl group to produce a lipoyl group" /codon_start=1 /transl_table=11 /product="lipoyl synthase" /protein_id="NP_216734.1" /db_xref="GI:15609355" /db_xref="GOA:P65283" /db_xref="UniProtKB/Swiss-Prot:P65283" /db_xref="GeneID:887922" /translation="MSVAAEGRRLLRLEVRNAQTPIERKPPWIKTRARIGPEYTELKN LVRREGLHTVCEEAGCPNIFECWEDREATFLIGGDQCTRRCDFCQIDTGKPAELDRDE PRRVADSVRTMGLRYATVTGVARDDLPDGGAWLYAATVRAIKELNPSTGVELLIPDFN GEPTRLAEVFESGPEVLAHNVETVPRIFKRIRPAFTYRRSLGVLTAARDAGLVTKSNL ILGLGETSDEVRTALGDLRDAGCDIVTITQYLRPSARHHPVERWVKPEEFVQFARFAE GLGFAGVLAGPLVRSSYRAGRLYEQARNSRALASR" gene 2486235..2486987 /locus_tag="Rv2219" /db_xref="GeneID:888380" CDS 2486235..2486987 /locus_tag="Rv2219" /function="UNKNOWN" /note="Rv2219, (MTCY190.30), len: 250 aa. Probable conserved transmembrane protein. Equivalent to hypothetical membrane protein ML0857 (250 aa) from Mycobacterium leprae Z98741 |MLCB22_13 Mycobacterium leprae cosmid B22; H (250 aa) opt : 1328, E(): 0; 80.8% identity in 250 aa overlap." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216735.1" /db_xref="GI:15609356" /db_xref="GOA:Q10405" /db_xref="UniProtKB/Swiss-Prot:Q10405" /db_xref="GeneID:888380" /translation="MAKPRNAAESKAAKAQANAARKAAARQRRAQLWQAFTLQRKEDK RLLPYMIGAFLLIVGASVGVGVWAGGFTMFTMIPLGVLLGALVAFVIFGRRAQRTVYR KAEGQTGAAAWALDNLRGKWRVTPGVAATGNLDAVHRVIGRPGVIFVGEGSAARVKPL LAQEKKRTARLVGDVPIYDIIVGNGDGEVPLAKLERHLTRLPANITVKQMDTVESRLA ALGSRAGAGVMPKGPLPTTAKMRSVQRTVRRK" gene complement(2486994..2487416) /locus_tag="Rv2219A" /db_xref="GeneID:3205099" CDS complement(2486994..2487416) /locus_tag="Rv2219A" /function="UNKNOWN" /note="Rv2219A, len: 140 aa. Probable conserved membrane protein, similar to SC3H12.05c|AL355740_5 possible integral membrane protein from Streptomyces coelicolor (155 aa), FASTA scores: opt: 327, E(): 7.5e-14, (46.6% identity in 133 aa overlap), also linked to glnA." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177661.1" /db_xref="GI:57116957" /db_xref="UniProtKB/TrEMBL:Q79FG7" /db_xref="GeneID:3205099" /translation="MTAKSPPDYPGKTLGLPDTGPGSLAPMGRRLAALLIDWLIAYGL ALLGVEFGVWSTPMLSTVVLVIWLLLGVAAVRLFGFTPGQLMLGLVVVAVGGRRPVGI GRLVVRGLLIGLVVPPLFTDSDGRGLHDRLTATAVVRR" gene 2487615..2489051 /gene="glnA1" /locus_tag="Rv2220" /db_xref="GeneID:888383" CDS 2487615..2489051 /gene="glnA1" /locus_tag="Rv2220" /EC_number="6.3.1.2" /function="INVOLVED IN GLUTAMINE BIOSYNTHESIS [CATALYTIC ACTIVITY: ATP + L-GLUTAMATE + NH(3) = ADP + GLUTAMINE + ORTHOPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv2220, (MTCY190.31, MTCY427.01), len: 478 aa. glnA1, glutamine synthetase class I (EC 6.3.1.2) (see Tullius et al., 2001), similar to many e.g. GLNA_STRCO|P15106 from Streptomyces coelicolor, FASTA score: (71.4% identity in 475 aa overlap); etc. Also similar to three other potential glutamine synthetases in Mycobacterium tuberculosis: Rv2222c|glnA2, Rv2860c|glnA4, and Rv1878|glnA3. Contains PS00180 Glutamine synthetase signature 1, PS00181 Glutamine synthetase putative ATP-binding region signature, and PS00182 Glutamine synthetase class-I adenylation site. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY.; glnA" /codon_start=1 /transl_table=11 /product="glutamine synthetase GLNA1 (glutamine synthase) (GS-I)" /protein_id="NP_216736.1" /db_xref="GI:15609357" /db_xref="GOA:Q10377" /db_xref="UniProtKB/Swiss-Prot:Q10377" /db_xref="GeneID:888383" /translation="MTEKTPDDVFKLAKDEKVEYVDVRFCDLPGIMQHFTIPASAFDK SVFDDGLAFDGSSIRGFQSIHESDMLLLPDPETARIDPFRAAKTLNINFFVHDPFTLE PYSRDPRNIARKAENYLISTGIADTAYFGAEAEFYIFDSVSFDSRANGSFYEVDAISG WWNTGAATEADGSPNRGYKVRHKGGYFPVAPNDQYVDLRDKMLTNLINSGFILEKGHH EVGSGGQAEINYQFNSLLHAADDMQLYKYIIKNTAWQNGKTVTFMPKPLFGDNGSGMH CHQSLWKDGAPLMYDETGYAGLSDTARHYIGGLLHHAPSLLAFTNPTVNSYKRLVPGY EAPINLVYSQRNRSACVRIPITGSNPKAKRLEFRSPDSSGNPYLAFSAMLMAGLDGIK NKIEPQAPVDKDLYELPPEEAASIPQTPTQLSDVIDRLEADHEYLTEGGVFTNDLIET WISFKRENEIEPVNIRPHPYEFALYYDV" misc_feature 2488407..2488454 /gene="glnA1" /locus_tag="Rv2220" /note="PS00181 Glutamine synthetase putative ATP-binding region signature" misc_feature 2488794..2488832 /gene="glnA1" /locus_tag="Rv2220" /note="PS00182 Glutamine synthetase class-I adenylation site" gene complement(2489369..2492353) /gene="glnE" /locus_tag="Rv2221c" /db_xref="GeneID:887995" CDS complement(2489369..2492353) /gene="glnE" /locus_tag="Rv2221c" /EC_number="2.7.7.42" /function="REGULATORY PROTEIN INVOLVED IN THE REGULATION OF GLUTAMINE SYNTHETASE ACTIVITY. ADENYLYLATION AND DEADENYLYLATION OF GLUTAMINE SYNTHETASE. POSSIBLY REGULATES GLNB|Rv2919c [CATALYTIC ACTIVITY: ATP + [L-GLUTAMATE:AMMONIA LIGASE (ADP-FORMING)] = PYROPHOSPHATE + ADENYLYL-[L-GLUTAMATE:AMMONIA LIGASE (ADP-FORMING)]]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the ATP-dependent addition of AMP to a subunit of glutamine synthetase; also catalyzes the reverse reaction - deadenylation; adenylation/deadenylation of glutamine synthetase subunits is important for the regulation of this enzyme" /codon_start=1 /transl_table=11 /product="bifunctional glutamine-synthetase adenylyltransferase/deadenyltransferase" /protein_id="NP_216737.1" /db_xref="GI:15609358" /db_xref="GOA:Q10379" /db_xref="UniProtKB/Swiss-Prot:Q10379" /db_xref="GeneID:887995" /translation="MVVTKLATQRPKLPSVGRLGLVDPPAGERLAQLGWDRHEDQAHV DLLWSLSRAPDADAALRALIRLSENPDTGWDELNAALLRERSLRGRLFSVLGSSLALG DHLVAHPQSWKLLRGKVTLPSHDQLQRSFVECVEESEGMPGSLVHRLRTQYRDYVLML AALDLAATVEDEPVLPFTVVAARLADAADAALAAALRVAEASVCGEHPPPRLAVIAMG KCGARELNYVSDVDVIFVAERSDPRNARVASEMMRVASAAFFEVDAALRPEGRNGELV RTLESHIAYYQRWAKTWEFQALLKARPVVGDAELGERYLTALMPMVWRACEREDFVVE VQAMRRRVEQLVPADVRGRELKLGSGGLRDVEFAVQLLQLVHARSDESLRVASTVDAL AALGEGGYIGREDAANMTASYEFLRLLEHRLQLQRLKRTHLLPDPEDEEAVRWLARAA HIRPDGRNDAAGVLREELKKQNVRVSKLHTKLFYQPLLESIGPTGLEIAHGMTLEAAG RRLAALGYEGPQTALKHMSALVNQSGRRGRVQSVLLPRLLDWMSYAPDPDGGLLAYRR LSEALATESWYLATLRDKPAVAKRLMHVLGTSAYVPDLLMRAPRVIQQYEDGPAGPKL LETEPAAVARALIASASRYPDPERAIAGARTLRRRELARIGSADLLGLLEVTEVCRAL TSVWVAVLQAALDVMIRASLPDDDRAPAAIAVIGMGRLGGAELGYGSDADVMFVCEPA TGVDDARAVKWSTSIAERVRALLGTPSVDPPLELDANLRPEGRNGPLVRTLGSYAAYY EQWAQPWEIQALLRAHAVAGDAELGQRFLRMVDKTRYPPDGVSADSVREIRRIKARIE SERLPRGADPNTHTKLGRGGLADIEWTVQLLQLQHAHQVPALHNTSTLQSLDVIAAAD LVPAADVELLRQAWLTATRARNALVLVRGKPTDQLPGPGRQLNAVAVAAGWRNDDGGE FLDNYLRVTRRAKAVVRKVFGS" gene complement(2492402..2493742) /gene="glnA2" /locus_tag="Rv2222c" /db_xref="GeneID:888238" CDS complement(2492402..2493742) /gene="glnA2" /locus_tag="Rv2222c" /EC_number="6.3.1.2" /function="INVOLVED IN GLUTAMINE BIOSYNTHESIS [CATALYTIC ACTIVITY: ATP + L-GLUTAMATE + NH(3) = ADP + GLUTAMINE + ORTHOPHOSPHATE]." /note="Rv2222c, (MTCY427.03c), len: 446 aa. Probable glnA2, glutamine synthetase class II (EC 6.3.1.2), similar to others. Also similar to three other potential glutamine synthetases in Mycobacterium tuberculosis: Rv2220|glnA1, Rv2860c|glnA4, and Rv1878|glnA3. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY." /codon_start=1 /transl_table=11 /product="glutamine synthetase" /protein_id="NP_216738.1" /db_xref="GI:15609359" /db_xref="GOA:P64245" /db_xref="UniProtKB/Swiss-Prot:P64245" /db_xref="GeneID:888238" /translation="MDRQKEFVLRTLEERDIRFVRLWFTDVLGFLKSVAIAPAELEGA FEEGIGFDGSSIEGFARVSESDTVAHPDPSTFQVLPWATSSGHHHSARMFCDITMPDG SPSWADPRHVLRRQLTKAGELGFSCYVHPEIEFFLLKPGPEDGSVPVPVDNAGYFDQA VHDSALNFRRHAIDALEFMGISVEFSHHEGAPGQQEIDLRFADALSMADNVMTFRYVI KEVALEEGARASFMPKPFGQHPGSAMHTHMSLFEGDVNAFHSADDPLQLSEVGKSFIA GILEHACEISAVTNQWVNSYKRLVQGGEAPTAASWGAANRSALVRVPMYTPHKTSSRR VEVRSPDSACNPYLTFAVLLAAGLRGVEKGYVLGPQAEDNVWDLTPEERRAMGYRELP SSLDSALRAMEASELVAEALGEHVFDFFLRNKRTEWANYRSHVTPYELRTYLSL" repeat_region 2493801..2493818 /note="18 bp inverted repeat between 3' end of MTCY427.04c and 5' end of MTCY427.03c" gene complement(2493837..2495399) /locus_tag="Rv2223c" /db_xref="GeneID:888093" CDS complement(2493837..2495399) /locus_tag="Rv2223c" /EC_number="3.4.-.-" /function="function unknown; thought to hydrolyze peptides and/or proteins." /note="Rv2223c, (MTCY427.04c), len: 520 aa. Probable exported protease (EC 3.4.-.-); has signal sequence. Very similar to three proteases/peptidases from Streptomyces spp.: L42758, L42759, L27466. FASTA score: L42758|STMSLPD STMSLPD NID: g940302 - Streptomyces (539 aa) opt: 1032 E(): 0, (37.5% identity in 533 aa overlap). Also similar to hypothetical proteins YZZE _ECOLI|P34211 from Escherichia coli (25.4% identity in 406 aa overlap) and PIR:B36944 in ompP 3' region (27.5% identity in 218 aa overlap). Highly similar to Rv2224c and Rv2672 (49.3% identity in 507 aa overlap); contains PS00120 Lipases, serine active site" /codon_start=1 /transl_table=11 /product="exported protease" /protein_id="NP_216739.1" /db_xref="GI:15609360" /db_xref="GOA:P65821" /db_xref="UniProtKB/Swiss-Prot:P65821" /db_xref="GeneID:888093" /translation="MAAMWRRRPLSSALLSFGLLLGGLPLAAPPLAGATEEPGAGQTP GAPVVAPQQSWNSCREFIADTSEIRTARCATVSVPVDYDQPGGTQAKLAVIRVPATGQ RFGALLVNPGGPGASAVDMVAAMAPAIADTDILRHFDLVGFDPRGVGHSTPALRCRTD AEFDAYRRDPMADYSPAGVTHVEQVYRQLAQDCVDRMGFSFLANIGTASVARDMDMVR QALGDDQINYLGYSYGTELGTAYLERFGTHVRAMVLDGAIDPAVSPIEESISQMAGFQ TAFNDYAADCARSPACPLGTDSAQWVNRYHALVDPLVQKPGKTSDPRGLSYADATTGT INALYSPQRWKYLTSGLLGLQRGSDAGDLLVLADDYDGRDADGHYSNDQDAFNAVRCV DAPTPADPAAWVAADQRIRQVAPFLSYGQFTGSAPRDLCALWPVPATSTPHPAAPAGA GKVVVVSTTHDPATPYQSGVDLARQLGAPLITFDGTQHTAVFDGNQCVDSAVMHYFLD GTLPPTSLRCAP" misc_feature complement(2494695..2494724) /locus_tag="Rv2223c" /note="PS00120 Lipases, serine active site" gene complement(2495461..2497023) /locus_tag="Rv2224c" /db_xref="GeneID:887857" CDS complement(2495461..2497023) /locus_tag="Rv2224c" /EC_number="3.4.-.-" /function="function unknown; thought to hydrolyze peptides and/or proteins." /experiment="experimental evidence, no additional details recorded" /note="Rv2224c, (MTCY427.05c), len: 520 aa. Probable exported protease (EC 3.4.-.-); has signal sequence and lipoprotein motif at N-terminal end. Very similar to three proteases/peptidases from Streptomyces spp.: L42758, L42759, L27466. FASTA score: L4 2758|STMSLPD STMSLPD NID: g940302 - Streptomyces (539 aa) opt: 1032 E(): 0, (37.5% identity in 533 aa overlap). Similar to hypothetical protein SW:YZZE_ECOLI P34211 (27.7% identity in 412 aa overlap) and highly similar to Rv2224c and Rv2672 (49.3% identity in 507 aa overlap); contains PS00013, Prokaryotic membrane lipoprotein lipid attachment site, and PS00120 Lipases, serine active site." /codon_start=1 /transl_table=11 /product="exported protease" /protein_id="NP_216740.1" /db_xref="GI:15609361" /db_xref="UniProtKB/Swiss-Prot:P65823" /db_xref="GeneID:887857" /translation="MGMRLSRRDKIARMLLIWAALAAVALVLVGCIRVVGGRARMAEP KLGQPVEWTPCRSSNPQVKIPGGALCGKLAVPVDYDRPDGDVAALALIRFPATGDKIG SLVINPGGPGESGIEAALGVFQTLPKRVHERFDLVGFDPRGVASSRPAIWCNSDADND RLRAEPQVDYSREGVAHIENETKQFVGRCVDKMGKNFLAHVGTVNVAKDLDAIRAALG DDKLTYLGYSYGTRIGSAYAEEFPQRVRAMILDGAVDPNADPIEAELRQAKGFQDAFN NYAADCAKNAGCPLGADPAKAVEVYHSLVDPLVDPDNPRISRPARTKDPRGLSYSDAI VGTIMALYSPNLWQHLTDGLSELVDNRGDTLLALADMYMRRDSHGRYNNSGDARVAIN CVDQPPVTDRDKVIDEDRRAREIAPFMSYGKFTGDAPLGTCAFWPVPPTSQPHAVSAP GLVPTVVVSTTHDPATPYKAGVDLANQLRGSLLTFDGTQHTVVFQGDSCIDEYVTAYL IGGTTPPSGAKC" misc_feature complement(2496331..2496360) /locus_tag="Rv2224c" /note="PS00120 Lipases, serine active site" gene 2497742..2498587 /gene="panB" /locus_tag="Rv2225" /db_xref="GeneID:887440" CDS 2497742..2498587 /gene="panB" /locus_tag="Rv2225" /EC_number="2.1.2.11" /function="INVOLVED IN PANTOTHENATE BIOSYNTHESIS." /note="catalyzes the formation of tetrahydrofolate and 2-dehydropantoate from 5,10-methylenetetrahydrofolate and 3-methyl-2-oxobutanoate" /codon_start=1 /transl_table=11 /product="3-methyl-2-oxobutanoate hydroxymethyltransferase" /protein_id="NP_216741.1" /db_xref="GI:15609362" /db_xref="GOA:Q10505" /db_xref="UniProtKB/Swiss-Prot:Q10505" /db_xref="GeneID:887440" /translation="MSEQTIYGANTPGGSGPRTKIRTHHLQRWKADGHKWAMLTAYDY STARIFDEAGIPVLLVGDSAANVVYGYDTTVPISIDELIPLVRGVVRGAPHALVVADL PFGSYEAGPTAALAAATRFLKDGGAHAVKLEGGERVAEQIACLTAAGIPVMAHIGFTP QSVNTLGGFRVQGRGDAAEQTIADAIAVAEAGAFAVVMEMVPAELATQITGKLTIPTV GIGAGPNCDGQVLVWQDMAGFSGAKTARFVKRYADVGGELRRAAMQYAQEVAGGVFPA DEHSF" gene 2498832..2500373 /locus_tag="Rv2226" /db_xref="GeneID:888513" CDS 2498832..2500373 /locus_tag="Rv2226" /function="UNKNOWN" /note="Rv2226, (MTCY427.07), len: 513 aa. Conserved hypothetical protein, similar to hypothetical secreted protein (510 aa) from Streptomyces coelicolor A3(2) emb|CAB59601.1| (AL132662) hypothetical secreted protein [Streptomyces coelicolor. Smith-Waterman scores Expect = 5e-44 Identities = 166/506 (32%)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216742.1" /db_xref="GI:15609363" /db_xref="GOA:Q10510" /db_xref="UniProtKB/Swiss-Prot:Q10510" /db_xref="GeneID:888513" /translation="MPVEAPRPARHLEVERKFDVIESTVSPSFEGIAAVVRVEQSPTQ QLDAVYFDTPSHDLARNQITLRRRTGGADAGWHLKLPAGPDKRTEMRAPLSASGDAVP AELLDVVLAIVRDQPVQPVARISTHRESQILYGAGGDALAEFCNDDVTAWSAGAFHAA GAADNGPAEQQWREWELELVTTDGTADTKLLDRLANRLLDAGAAPAGHGSKLARVLGA TSPGELPNGPQPPADPVHRAVSEQVEQLLLWDRAVRADAYDAVHQMRVTTRKIRSLLT DSQESFGLKESAWVIDELRELADVLGVARDAEVLGDRYQRELDALAPELVRGRVRERL VDGARRRYQTGLRRSLIALRSQRYFRLLDALDALVSERAHATSGEESAPVTIDAAYRR VRKAAKAAKTAGDQAGDHHRDEALHLIRKRAKRLRYTAAATGADNVSQEAKVIQTLLG DHQDSVVSREHLIQQAIAANTAGEDTFTYGLLYQQEADLAERCREQLEAALRKLDKAV RKARD" gene complement(2500445..2500751) /gene="rnpB" /locus_tag="Rvns01" /db_xref="GeneID:2700434" misc_RNA complement(2500445..2500751) /gene="rnpB" /locus_tag="Rvns01" /product="ribonuclease P RNA" /note="rnpB, len: 307 nt. rna component of RNase P." /function="RNA COMPONENT OF RNase P: RNase P CATALYZES THE REMOVAL OF THE 5'-LEADER SEQUENCE FROM PRE-tRNA TO PRODUCE THE MATURE 5'TERMINUS. THIS PROTEIN PLAYS AN AUXILIARY BUT ESSENTIAL ROLE IN VIVO BY BINDING TO THE 5'-LEADER SEQUENCE AND BROADENING THE SUBSTRATE SPECIFICITY OF THE RIBOZYME." /db_xref="GeneID:2700434" gene 2500931..2501632 /locus_tag="Rv2227" /db_xref="GeneID:888581" CDS 2500931..2501632 /locus_tag="Rv2227" /function="UNKNOWN" /note="Rv2227, (MTCY427.08), len: 233 aa. Conserved hypothetical protein, similar to conserved hypothetical proteins from various bacteria e.g. gb|AAK22693.1| (AE005746) conserved hypothetical protein from Caulobacter crescentus (234 aa) Smith-Waterman score = 109 bits (429), Expect = 1e-41 Identities = 83/167 (49%)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216743.1" /db_xref="GI:15609364" /db_xref="UniProtKB/Swiss-Prot:Q10511" /db_xref="GeneID:888581" /translation="MGQTRRLRRLGRHRCRGQRVRWRTATSADHPRRGRPAAQAVRRR RPVSLDGRYGIQAVRRRAVSIFPCPLSRVIERLKQALYPKLLPIARNWWAKLGREAPW PDSLDDWLASCHAAGQTRSTALMLKYGTNDWNALHQDLYGELVFPLQVVINLSDPETD YTGGEFLLVEQRPRAQSRGTAMQLPQGHGYVFTTRDRPVRTSRGWSASPVRHGLSTIR SGERYAMGLIFHDAA" gene complement(2501644..2502738) /locus_tag="Rv2228c" /db_xref="GeneID:888108" CDS complement(2501644..2502738) /locus_tag="Rv2228c" /function="UNKNOWN" /note="Rv2228c, (MTCY427.09c), len: 364 aa. Conserved hypothetical protein. Some similarity to phosphoglycerate mutase and ribonuclease H. Similar to CAB88177.1|AL352972 putative bifunctional protein (ribonuclease H/phosphoglycerate mutase) from Streptomyces coelicolor A3(2) (497 aa); Smith-Waterman scores: 107 bits (424), Expect = 4e-41 Identities = 160/485 (32%). Also similar in C-terminal part to Rv2419c and Rv2135c." /codon_start=1 /transl_table=11 /product="bifunctional RNase H/acid phosphatase" /protein_id="NP_216744.1" /db_xref="GI:15609365" /db_xref="GOA:P64955" /db_xref="UniProtKB/Swiss-Prot:P64955" /db_xref="GeneID:888108" /translation="MKVVIEADGGSRGNPGPAGYGAVVWTADHSTVLAESKQAIGRAT NNVAEYRGLIAGLDDAVKLGATEAAVLMDSKLVVEQMSGRWKVKHPDLLKLYVQAQAL ASQFRRINYEWVPRARNTYADRLANDAMDAAAQSAAADADPAKIVATESPTSPGWTGA RGTPTRLLLLRHGQTELSEQRRYSGRGNPGLNEVGWRQVGAAAGYLARRGGIAAVVSS PLQRAYDTAVTAARALALDVVVDDDLVETDFGAWEGLTFAEAAERDPELHRRWLQDTS ITPPGGESFDDVLRRVRRGRDRIIVGYEGATVLVVSHVTPIKMLLRLALDAGSGVLYR LHLDLASLSIAEFYADGASSVRLVNQTGYL" gene complement(2502735..2503472) /locus_tag="Rv2229c" /db_xref="GeneID:888264" CDS complement(2502735..2503472) /locus_tag="Rv2229c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2229c, (MTCY427.10c), len: 245 aa. Conserved hypothetical protein; probable coiled-coil protein similar to conserved hypothetical proteins in Actinomycetes. Equivalent to Mycobacterium leprae ML1638 (232 aa), FASTA scores: opt: 868 E(): 4.4e-43; 60.870% identity in 230 aa overlap emb|CAC30589.1| (AL583922)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216745.1" /db_xref="GI:15609366" /db_xref="UniProtKB/Swiss-Prot:Q10513" /db_xref="GeneID:888264" /translation="MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEH NAANDRMAALRIAAEDLDGQVSRFESEIDAVRKRGDRDRSLLTSGATDAKQLADLQHE LDSLQRRQASLEDALLEVLERREELQAQQTAESRALQALRADLAAAQQALDEALAEID QARHQHSSQRDMLTATLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQ ISAAAEDEVVRCPECGAILLRLEGFEE" gene complement(2503469..2504608) /locus_tag="Rv2230c" /db_xref="GeneID:888231" CDS complement(2503469..2504608) /locus_tag="Rv2230c" /function="UNKNOWN" /note="Rv2230c, (MTCY427.11c), len: 379 aa. Conserved hypothetical protein. Equivalent to Mycobacterium leprae, ML1639, conserved hypothetical protein (385 aa). Similar to hypothetical proteins from B. subtilis, P54472, and L. monocytogenes, P53434. FASTA score: ML1639 (MLCB1243.36) opt: 2088, E(): 4e-107; 79.481% identity in 385 aa overlap same as >pir||T44719 hypothetical protein MLCB1243.36 [imported] - Mycobacterium leprae >gi|3150237|emb|CAA19217.1| (AL023635); P54472|YQFO_BACSU HYPOTHETICAL 30. 7 kDa PROTEIN IN (279 aa) opt: 604; E(): 2.2e-30; 38.8% identity in 258 aa overlap. P53434|YRP2_LISMO HYPOTHETICAL 41.4 kDa PROTEIN (373 aa) opt: 595, E(): 1e-29; 30.7% identity in 326 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216746.1" /db_xref="GI:15609367" /db_xref="UniProtKB/Swiss-Prot:Q10514" /db_xref="GeneID:888231" /translation="MSVRLADVIDVLDQAYPPRLAQSWDSVGLVCGDPDDVVDSVTVA VDATPAVVDQVPQAGLLLVHHPLLLRGVDTVAANTPKGVLVHRLIRTGRSLFTAHTNA DSASPGVSDALAHAVGLTVDAVLDPVPGAADLDKWVIYVPRENSEAVRAAVFEAGAGH IGDYSHCSWSVAGTGQFLAHDGASPAIGSVGTVERVAEDRVEVVAPARARAEVLAAMR AAHPYEEPAFDIFALVPPPVGSGLGRIGRLPKPEPLRTFVARLEAALPPTATGVRAAG DPDLLVSRVAVCGGAGDSLLATVAAADVQAYVTADLRHHPADEHCRASQVALIDVAHW ASEFPWCGQAAEVLRSHFGASLPVRVCTICTDPWNLDHETGRDQA" gene complement(2504605..2505699) /gene="cobC" /locus_tag="Rv2231c" /db_xref="GeneID:888337" CDS complement(2504605..2505699) /gene="cobC" /locus_tag="Rv2231c" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS" /note="Rv2231c, (MTCY427.12c), len: 364 aa. Possible cobC, aminotransferase. Note that initiation codon uncertain. Similar to CobC aminotransferases e.g. sp|P21633|COBC_PSEDE COBC PROTEIN (333 aa) opt: 277, E(): 1.7e-11; 28.8% identity in 313 aa overlap and also to e.g. SW:HIS8_ECOLI P06986 histidinol-phosphate aminotransferase (27.0% identity in 289 aa overlap), contains PS00105 aminotransferases class-I pyridoxal-phosphate attachment site. Real Mycobacterium tuberculosis histidinol-phosphate aminotransferase, hisC, is Rv1600 (MTCY336.04c)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216747.1" /db_xref="GI:15609368" /db_xref="GOA:P63500" /db_xref="UniProtKB/Swiss-Prot:P63500" /db_xref="GeneID:888337" /translation="MLWILGPHTGPLLFDAVASLDTSPLAAARYHGDQDVAPGVLDFA VNVRHDRPPEWLVRQLAALLPELARYPSTDDVHRAQDAVAERHGRTRDEVLPLVGAAE GFALLHNLSPVRAAIVVPAFTEPAIALSAAGITAHHVVLKPPFVLDTAHVPDDADLVV VGNPTNPTSVLHLREQLLELRRPGRILVVDEAFADWVPGEPQSLADDSLPDVLVLRSL TKTWSLAGLRVGYALGSPDVLARLTVQRAHWPLGTLQLTAIAACCAPRAVAAAAADAV RLTALRAEMVAGLRSVGAEVVDGAAPFVLFNIADADGLRNYLQSKGIAVRRGDTFVGL DARYLRAAVRPEWPVLVAAIAEWAKRGGRR" misc_feature complement(2505010..2505051) /gene="cobC" /locus_tag="Rv2231c" /note="PS00105 Aminotransferases class-I pyridoxal-phosphate attachment site" gene 2506278..2507153 /locus_tag="Rv2232" /db_xref="GeneID:887597" CDS 2506278..2507153 /locus_tag="Rv2232" /function="UNKNOWN" /note="Rv2232, (MTCY427.13), len: 291 aa. Conserved hypothetical protein, similar to members of haloacid dehalogenase-like family from several bacteria and to putative phosphatases e.g. Q9I767 and AAK78398. Contains N-terminal extension. FASTA scores: Q9I767 HYPOTHETICAL PROTEIN PA0065 (221 aa) opt: 439 E(): 3.2e-18; 38.679% identity (40.196% ungapped) in 212 aa overlap; >>tr|AAK78398 Predicted phosphatase, HAD family (216 aa) opt: 427, E(): 1.5e-17; 34.762% identity (35.437% ungapped) in 210 aa overlap. Replaces previous Rv2232 and Rv2233." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216748.2" /db_xref="GI:57116958" /db_xref="GOA:Q10515" /db_xref="UniProtKB/Swiss-Prot:Q10515" /db_xref="GeneID:887597" /translation="MSSPRERRPASQAPRLSRRPPAHQTSRSSPDTTAPTGSGLSNRF VNDNGIVTDTTASGTNCPPPPRAAARRASSPGESPQLVIFDLDGTLTDSARGIVSSFR HALNHIGAPVPEGDLATHIVGPPMHETLRAMGLGESAEEAIVAYRADYSARGWAMNSL FDGIGPLLADLRTAGVRLAVATSKAEPTARRILRHFGIEQHFEVIAGASTDGSRGSKV DVLAHALAQLRPLPERLVMVGDRSHDVDGAAAHGIDTVVVGWGYGRADFIDKTSTTVV THAATIDELREALGV" gene 2507146..2507637 /gene="ptpA" /locus_tag="Rv2234" /db_xref="GeneID:887373" CDS 2507146..2507637 /gene="ptpA" /locus_tag="Rv2234" /EC_number="3.1.3.48" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA DEPHOSPHORYLATION). CAN DEPHOSPHORYLATED IN VITRO THE PHOSPHOTYROSINE RESIDUE OF MYELIN BASIC PROTEIN (MBP) AT pH 7.0 [CATALYTIC ACTIVITY: Protein tyrosine phosphate + H(2)O = protein tyrosine + phosphate]." /experiment="experimental evidence, no additional details recorded" /note="Rv2234, (MTCY427.15), len: 163 aa. ptpA (alternate gene name: MPtpA), low molecular weight protein-tyrosine-phosphatase (see citations below) (EC 3.1.3.48), similar to other phosphotyrosine protein phosphatases e.g. P53433|PTPA_STRCO LOW MOLECULAR WEIGHT PROTEIN-TYROSINE PHOSPHATASE from Streptomyces coelicolor (164 aa), FASTA scores: opt: 455, E(): 3.3e -25, (49.7% identity in 155 aa overlap); PA1S_HUMAN|P24667 red cell acid phosphatase 1, FASTA score: (37.7% identity in 138 aa overlap); etc. Contains a phosphatase catalytic site domain located in N-terminal part. Activity proven biochemically. Supposed a secreted protein.; MPtpA" /codon_start=1 /transl_table=11 /product="phosphotyrosine protein phosphatase PTPA (protein-tyrosine-phosphatase) (PTPase) (LMW phosphatase)" /protein_id="NP_216750.1" /db_xref="GI:15609371" /db_xref="GOA:P65716" /db_xref="UniProtKB/Swiss-Prot:P65716" /db_xref="GeneID:887373" /translation="MSDPLHVTFVCTGNICRSPMAEKMFAQQLRHRGLGDAVRVTSAG TGNWHVGSCADERAAGVLRAHGYPTDHRAAQVGTEHLAADLLVALDRNHARLLRQLGV EAARVRMLRSFDPRSGTHALDVEDPYYGDHSDFEEVFAVIESALPGLHDWVDERLARN GPS" gene 2507637..2508452 /locus_tag="Rv2235" /db_xref="GeneID:887606" CDS 2507637..2508452 /locus_tag="Rv2235" /function="UNKNOWN: MAY BE INVOLVED IN THE ABILITY TO SURVIVE IN MACROPHAGES." /experiment="experimental evidence, no additional details recorded" /note="Rv2235, (MTCY427.16), len: 271 aa. Probable conserved transmembrane protein (see Miller & Shinnick 2001); hydrophobic regions near N- and C-terminus. Similar to conserved membrane proteins in other Actinomycetes. Equivalent to Mycobacterium leprae. ML1644 (270 aa). FASTA scores: opt: 1357, E(): 1.2e-72; 74.170% identity in 271 aa overlap T44717|3150235|CAA19213.1|AL023635 13093419|CAC30595.1|AL583922." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216751.1" /db_xref="GI:15609372" /db_xref="GOA:P66883" /db_xref="UniProtKB/Swiss-Prot:P66883" /db_xref="GeneID:887606" /translation="MPRLAFLLRPGWLALALVVVAFTYLCFTVLAPWQLGKNAKTSRE NQQIRYSLDTPPVPLKTLLPQQDSSAPDAQWRRVTATGQYLPDVQVLARLRVVEGDQA FEVLAPFVVDGGPTVLVDRGYVRPQVGSHVPPIPRLPVQTVTITARLRDSEPSVAGKD PFVRDGFQQVYSINTGQVAALTGVQLAGSYLQLIEDQPGGLGVLGVPHLDPGPFLSYG IQWISFGILAPIGLGYFAYAEIRARRREKAGSPPPDKPMTVEQKLADRYGRRR" gene complement(2508434..2509375) /gene="cobD" /locus_tag="Rv2236c" /db_xref="GeneID:887513" CDS complement(2508434..2509375) /gene="cobD" /locus_tag="Rv2236c" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS, IN THE CONVERSION OF COBYRIC ACID TO COBINAMIDE." /note="CobD; CbiD in Salmonella; converts cobyric acid to cobinamide by the addition of aminopropanol on the F carboxylic group" /codon_start=1 /transl_table=11 /product="cobalamin biosynthesis protein" /protein_id="NP_216752.1" /db_xref="GI:15609373" /db_xref="GOA:Q10518" /db_xref="UniProtKB/Swiss-Prot:Q10518" /db_xref="GeneID:887513" /translation="MFASTWQTRAVGVLIGCLLDVVFGDPKRGHPVALFGRAAAKLEQ ITYRDGRVAGAVHVGLLVGAVGLLGAALQRLPGRSWPVAATATATWAALGGTSLARTG RQISDLLERDDVEAARRLLPSLCGRDPAQLGGPGLTRAALESVAENTADAQVVPLLWA ASSGVPAVLGYRAINTLDSMIGYRSPRYLRFGWAAARLDDWANYVGARATAVLVVICA PVVGGSPRGAVRAWRRDAARHPSPNAGVVEAAFAGALDVRLGGPTRYHHELQIRPTLG DGRSPKVADLRRAVVLSRVVQAGAAVLAVMLVYRRRP" gene 2509489..2510256 /locus_tag="Rv2237" /db_xref="GeneID:887840" CDS 2509489..2510256 /locus_tag="Rv2237" /function="UNKNOWN" /note="Rv2237, (MTCY427.18), len: 255 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis hypothetical proteins Rv0276, Rv0826, Rv1645c. FASTA score: Rv0276 gp|AL021930|MTV035_4 (306 aa) opt: 874, E(): 0; 49.6% identity in 282 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216753.1" /db_xref="GI:15609374" /db_xref="GOA:P64957" /db_xref="UniProtKB/Swiss-Prot:P64957" /db_xref="GeneID:887840" /translation="MLLPAANVIMQLAVPGVGYGVLESPVDSGNVYKHPFKRARTTGT YLAVATIGTESDRALIRGAVDVAHRQVRSTASSPVSYNAFDPKLQLWVAACLYRYFVD QHEFLYGPLEDATADAVYQDAKRLGTTLQVPEGMWPPDRVAFDEYWKRSLDGLQIDAP VREHLRGVASVAFLPWPLRAVAGPFNLFATTGFLAPEFRAMMQLEWSQAQQRRFEWLL SVLRLADRLIPHRAWIFVYQLYLWDMRFRARHGRRIV" gene complement(2510598..2510669) /locus_tag="Rvnt23" /note="tRNA-Val(TAC)" /db_xref="GeneID:2700452" tRNA complement(2510598..2510669) /locus_tag="Rvnt23" /product="tRNA-Val" /note="codon recognized: GUA" /anticodon=(pos:2510635..2510637,aa:Val) /db_xref="GeneID:2700452" gene complement(2510715..2511176) /gene="ahpE" /locus_tag="Rv2238c" /db_xref="GeneID:887871" CDS complement(2510715..2511176) /gene="ahpE" /locus_tag="Rv2238c" /function="detoxification of organic peroxides." /note="Rv2238c, (MTCY427.19c), len: 153 aa. Probable ahpE, peroxiredoxin. Similarity to many members of AHPC/TSA family e.g. sp|Q96291|BAS1_ARATH 2-CYS PEROXIREDOXIN BAS1 PRECURSOR (265 aa). FASTA score: opt: 275, E(): 2.7e-12; 35.0% identity in 143 aa overlap." /codon_start=1 /transl_table=11 /product="peroxiredoxin AhpE" /protein_id="NP_216754.1" /db_xref="GI:15609375" /db_xref="UniProtKB/Swiss-Prot:P65688" /db_xref="GeneID:887871" /translation="MLNVGATAPDFTLRDQNQQLVTLRGYRGAKNVLLVFFPLAFTGI CQGELDQLRDHLPEFENDDSAALAISVGPPPTHKIWATQSGFTFPLLSDFWPHGAVSQ AYGVFNEQAGIANRGTFVVDRSGIIRFAEMKQPGEVRDQRLWTDALAALTA" gene complement(2511176..2511652) /locus_tag="Rv2239c" /db_xref="GeneID:888451" CDS complement(2511176..2511652) /locus_tag="Rv2239c" /function="UNKNOWN" /note="Rv2239c, (MTCY427.20c), len: 158 aa. Conserved hypothetical protein, similar to conserved hypothetical proteins from Mycobacterium leprae (ML1649, 140 aa) and Streptomyces coelicolor A3(2) (SCC8A.28c, 159 aa). Equivalent to ML1649 conserved hypothetical protein (140 aa). FASTA scores: ML1649 conserved hypothetical protein (140 aa) opt: 846, E(): 6.5e-45; 86.429% identity in 140 aa overlap (tr|O69479|O69479 HYPOTHETICAL 15.2 KDA PROTEIN (140 aa); and opt: 447, E(): 1.2e-21; 50.355% identity (51.825% ungapped) in 141 aa overlap. Similarity with ML1649 suggests alternative start at 251198." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216755.1" /db_xref="GI:15609376" /db_xref="UniProtKB/Swiss-Prot:P64959" /db_xref="GeneID:888451" /translation="MPIATVCTWPAETEGGSTVVAADHASNYARKLGIQRDQLIQEWG WDEDTDDDIRAAIEEACGGELLDEDTDEVIDVVLLWWRDGDGDLVDTLMDAIGPLAED GVIWVVTPKTGQPGHVLPAEIAEAAPTAGLMPTSSVNLGNWSASRLVQPKSRAGKR" gene complement(2511690..2512487) /locus_tag="Rv2240c" /db_xref="GeneID:888600" CDS complement(2511690..2512487) /locus_tag="Rv2240c" /function="UNKNOWN" /note="Rv2240c, (MTCY427.21c), len: 265 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216756.1" /db_xref="GI:15609377" /db_xref="GOA:Q10522" /db_xref="UniProtKB/Swiss-Prot:Q10522" /db_xref="GeneID:888600" /translation="MGQIVAGEIGGQRTTPVGGGLPLACCLDGRPPIVPHRRRRRIAA LRSVLRMRDTPRPARSRCDQVTSHAVLIGWRAVPRRHGGELPRRGALALGCIALLLMG IVGCTTVTDGTAMPDTNVAPAYRSSVSASVSASAATSSIRESQRQQSLTTKAIRTSCD ALAATSKDAIDKVNAYVAAFNQGRNTGPTEGPAIDALNNSASTVSGSLSAALSAQLGD ALNAYVDAARAVANAIGAHASTAEFNRRVDRLNDTKTKALTMCVAAF" gene 2512539..2515244 /gene="aceE" /locus_tag="Rv2241" /db_xref="GeneID:887246" CDS 2512539..2515244 /gene="aceE" /locus_tag="Rv2241" /EC_number="1.2.4.1" /function="Involved in energy metabolism; contributes to acetyl-CoA production as part of pyruvate dehydrogenase complex [CATALYTIC ACTIVITY: PYRUVATE + LIPOAMIDE = S-ACETYL-DIHYDRO-LIPOAMIDE + CO(2)]." /experiment="experimental evidence, no additional details recorded" /note="E1 component; part of pyruvate dehydrogenase; forms a complex with DlaT and LpdC" /codon_start=1 /transl_table=11 /product="pyruvate dehydrogenase subunit E1" /protein_id="NP_216757.1" /db_xref="GI:15609378" /db_xref="GOA:Q10504" /db_xref="UniProtKB/Swiss-Prot:Q10504" /db_xref="GeneID:887246" /translation="MASYLPDIDPEETSEWLESFDTLLQRCGPSRARYLMLRLLERAG EQRVAIPALTSTDYVNTIPTELEPWFPGDEDVERRYRAWIRWNAAIMVHRAQRPGVGV GGHISTYASSAALYEVGFNHFFRGKSHPGGGDQVFIQGHASPGIYARAFLEGRLTAEQ LDGFRQEHSHVGGGLPSYPHPRLMPDFWEFPTVSMGLGPLNAIYQARFNHYLHDRGIK DTSDQHVWCFLGDGEMDEPESRGLAHVGALEGLDNLTFVINCNLQRLDGPVRGNGKII QELESFFRGAGWNVIKVVWGREWDALLHADRDGALVNLMNTTPDGDYQTYKANDGGYV RDHFFGRDPRTKALVENMSDQDIWNLKRGGHDYRKVYAAYRAAVDHKGQPTVILAKTI KGYALGKHFEGRNATHQMKKLTLEDLKEFRDTQRIPVSDAQLEENPYLPPYYHPGLNA PEIRYMLDRRRALGGFVPERRTKSKALTLPGRDIYAPLKKGSGHQEVATTMATVRTFK EVLRDKQIGPRIVPIIPDEARTFGMDSWFPSLKIYNRNGQLYTAVDADLMLAYKESEV GQILHEGINEAGSVGSFIAAGTSYATHNEPMIPIYIFYSMFGFQRTGDSFWAAADQMA RGFVLGATAGRTTLTGEGLQHADGHSLLLAATNPAVVAYDPAFAYEIAYIVESGLARM CGENPENIFFYITVYNEPYVQPPEPENFDPEGVLRGIYRYHAATEQRTNKAQILASGV AMPAALRAAQMLAAEWDVAADVWSVTSWGELNRDGVAIETEKLRHPDRPAGVPYVTRA LENARGPVIAVSDWMRAVPEQIRPWVPGTYLTLGTDGFGFSDTRPAARRYFNTDAESQ VVAVLEALAGDGEIDPSVPVAAARQYRIDDVAAAPEQTTDPGPGA" gene 2515304..2516548 /locus_tag="Rv2242" /db_xref="GeneID:888624" CDS 2515304..2516548 /locus_tag="Rv2242" /function="UNKNOWN" /note="Rv2242, (MTCY427.23), len: 414 aa. Conserved hypothetical protein. Equivalent to ML1652 conserved hypothetical protein from Mycobacterium leprae (414 aa), and orthologue in Streptomyces coelicolor A3(2). FASTA scores: ML1652 opt: 2369, E(): 4.2e-128; 88.406% identity in 414 aa overlap (AL023635)(AL583922). some similarity at 3' end with S25203 srmR protein - Streptomyces ambofaciens (604 aa) opt: 188 E(): 9e-05; (26.4% identity in 277 aa overlap) and with SW:YAEG_HAEIN P44509 hypothetical protein HI0093 (42.3% identity in 52 aa overlap). Contains possible helix-turn-helix motif at aa 360-381 (+3.52 SD)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216758.1" /db_xref="GI:15609379" /db_xref="UniProtKB/Swiss-Prot:P63749" /db_xref="GeneID:888624" /translation="MNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSA MQERLPFFADLEASQRASVALVVQTAVVNFVEWMHDPHSDVGYTAQAFELVPQDLTRR IALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFTAATAYADAAE ARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAPATVLVGTPAPGPNGSNSD GDSERASQDVRDTAARHGRAALTDVHGTWLVAIVSGQLSPTEKFLKDLLAAFADAPVV IGPTAPMLTAAHRSASEAISGMNAVAGWRGAPRPVLARELLPERALMGDASAIVALHT DVMRPLADAGPTLIETLDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQ PRDAYVLRVAATVGQLNYPTPH" gene 2516787..2517695 /gene="fabD" /locus_tag="Rv2243" /db_xref="GeneID:888769" CDS 2516787..2517695 /gene="fabD" /locus_tag="Rv2243" /EC_number="2.3.1.39" /function="CATALYZES MALONYL-COA-ACP TRANSACYLASE (MCAT) ACTIVITY USING HOLO-ACPM AS SUBSTRATE FOR TRANSACYLATION [CATALYTIC ACTIVITY: MALONYL-CoA + [ACYL-CARRIER PROTEIN] = CoA + MALONYL-[ACYL-CARRIER PROTEIN]]." /experiment="experimental evidence, no additional details recorded" /note="Rv2243, (MTCY427.24), len: 302 aa. fabD (alternate gene name: mtFabD), malonyl CoA-acyl carrier protein transacylase (EC 2.3.1.39) (see citations below), highly similar to e.g. A57356 acyl-CoA carrier protein malonyltransferase from Streptomyces coelicolor (316 aa), FASTA score: opt: 955, E(): 0, (52.6% identity in 304 aa overlap); FABD_HAEIN|P43712 malonyl CoA-acyl carrier protein transacylase from Haemophilus influenzae, FASTA score: (30.5% identity in 308 aa overlap); and FABD_ECOLI|P25715 from Escherichia coli, FASTA score: (31.4% identity in 309 aa overlap).; mtFabD" /codon_start=1 /transl_table=11 /product="acyl-carrier-protein S-malonyltransferase" /protein_id="NP_216759.1" /db_xref="GI:15609380" /db_xref="GOA:P63458" /db_xref="UniProtKB/Swiss-Prot:P63458" /db_xref="GeneID:888769" /translation="MIALLAPGQGSQTEGMLSPWLQLPGAADQIAAWSKAADLDLARL GTTASTEEITDTAVAQPLIVAATLLAHQELARRCVLAGKDVIVAGHSVGEIAAYAIAG VIAADDAVALAATRGAEMAKACATEPTGMSAVLGGDETEVLSRLEQLDLVPANRNAAG QIVAAGRLTALEKLAEDPPAKARVRALGVAGAFHTEFMAPALDGFAAAAANIATADPT ATLLSNRDGKPVTSAAAAMDTLVSQLTQPVRWDLCTATLREHTVTAIVEFPPAGTLSG IAKRELRGVPARAVKSPADLDELANL" gene 2517771..2518118 /gene="acpP" /locus_tag="Rv2244" /db_xref="GeneID:888272" CDS 2517771..2518118 /gene="acpP" /locus_tag="Rv2244" /function="INVOLVED IN FATTY ACID BIOSYNTHESIS (MYCOLIC ACIDS SYNTHESIS); INVOLVED IN MEROMYCOLATE EXTENSION." /experiment="experimental evidence, no additional details recorded" /note="carries the fatty acid chain in fatty acid biosynthesis" /codon_start=1 /transl_table=11 /product="acyl carrier protein" /protein_id="NP_216760.1" /db_xref="GI:15609381" /db_xref="GOA:Q10500" /db_xref="UniProtKB/Swiss-Prot:Q10500" /db_xref="GeneID:888272" /translation="MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSM VEIAVQTEDKYGVKIPDEDLAGLRTVGDVVAYIQKLEEENPEAAQALRAKIESENPDA VANVQARLEAESK" gene 2518115..2519365 /gene="kasA" /locus_tag="Rv2245" /db_xref="GeneID:887269" CDS 2518115..2519365 /gene="kasA" /locus_tag="Rv2245" /EC_number="2.3.1.41" /function="INVOLVED IN FATTY ACID BIOSYNTHESIS (MYCOLIC ACIDS SYNTHESIS); INVOLVED IN MEROMYCOLATE EXTENSION. CATALYZES THE CONDENSATION REACTION OF FATTY ACID SYNTHESIS BY THE ADDITION TO AN ACYL ACCEPTOR OF TWO CARBONS FROM MALONYL-ACP [CATALYTIC ACTIVITY: ACYL-[ACYL-CARRIER PROTEIN] + MALONYL-[ACYL-CARRIER PROTEIN] = 3-OXOACYL-[ACYL-CARRIER PROTEIN] + [ACYL-CARRIER PROTEIN] + CO(2)]." /experiment="experimental evidence, no additional details recorded" /note="FabF; beta-ketoacyl-ACP synthase II, KASII; catalyzes a condensation reaction in fatty acid biosynthesis: addition of an acyl acceptor of two carbons from malonyl-ACP; required for the elongation of short-chain unsaturated acyl-ACP" /codon_start=1 /transl_table=11 /product="3-oxoacyl-(acyl carrier protein) synthase II" /protein_id="NP_216761.1" /db_xref="GI:15609382" /db_xref="GOA:P63454" /db_xref="UniProtKB/Swiss-Prot:P63454" /db_xref="GeneID:887269" /translation="MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIH ALEDEFVTKWDLAVKIGGHLKDPVDSHMGRLDMRRMSYVQRMGKLLGGQLWESAGSPE VDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGAAAVIGLQLGA RAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEGPIEALPIAAFSMMRAMST RNDEPERASRPFDKDRDGFVFGEAGALMLIETEEHAKARGAKPLARLLGAGITSDAFH MVAPAADGVRAGRAMTRSLELAGLSPADIDHVNAHGTATPIGDAAEANAIRVAGCDQA AVYAPKSALGHSIGAVGALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYG DYRYAVNNSFGFGGHNVALAFGRY" gene 2519396..2520712 /gene="kasB" /locus_tag="Rv2246" /db_xref="GeneID:887539" CDS 2519396..2520712 /gene="kasB" /locus_tag="Rv2246" /EC_number="2.3.1.179" /function="INVOLVED IN FATTY ACID BIOSYNTHESIS (MYCOLIC ACIDS SYNTHESIS); INVOLVED IN MEROMYCOLATE EXTENSION. CATALYZES THE CONDENSATION REACTION OF FATTY ACID SYNTHESIS BY THE ADDITION TO AN ACYL ACCEPTOR OF TWO CARBONS FROM MALONYL-ACP [CATALYTIC ACTIVITY: ACYL-[ACYL-CARRIER PROTEIN] + MALONYL-[ACYL-CARRIER PROTEIN] = 3-OXOACYL-[ACYL-CARRIER PROTEIN] + [ACYL-CARRIER PROTEIN] + CO(2)]." /experiment="experimental evidence, no additional details recorded" /note="FabF; beta-ketoacyl-ACP synthase II, KASII; catalyzes a condensation reaction in fatty acid biosynthesis: addition of an acyl acceptor of two carbons from malonyl-ACP; required for the elongation of short-chain unsaturated acyl-ACP" /codon_start=1 /transl_table=11 /product="3-oxoacyl-(acyl carrier protein) synthase II" /protein_id="NP_216762.1" /db_xref="GI:15609383" /db_xref="GOA:P63456" /db_xref="UniProtKB/Swiss-Prot:P63456" /db_xref="GeneID:887539" /translation="MGVPPLAGASRTDMEGTFARPMTELVTGKAFPYVVVTGIAMTTA LATDAETTWKLLLDRQSGIRTLDDPFVEEFDLPVRIGGHLLEEFDHQLTRIELRRMGY LQRMSTVLSRRLWENAGSPEVDTNRLMVSIGTGLGSAEELVFSYDDMRARGMKAVSPL TVQKYMPNGAAAAVGLERHAKAGVMTPVSACASGAEAIARAWQQIVLGEADAAICGGV ETRIEAVPIAGFAQMRIVMSTNNDDPAGACRPFDRDRDGFVFGEGGALLLIETEEHAK ARGANILARIMGASITSDGFHMVAPDPNGERAGHAITRAIQLAGLAPGDIDHVNAHAT GTQVGDLAEGRAINNALGGNRPAVYAPKSALGHSVGAVGAVESILTVLALRDQVIPPT LNLVNLDPEIDLDVVAGEPRPGNYRYAINNSFGFGGHNVAIAFGRY" gene 2520743..2522164 /gene="accD6" /locus_tag="Rv2247" /db_xref="GeneID:887671" CDS 2520743..2522164 /gene="accD6" /locus_tag="Rv2247" /EC_number="6.4.1.3" /function="INVOLVED IN FATTY ACID BIOSYNTHESIS (MYCOLIC ACIDS SYNTHESIS) [CATALYTIC ACTIVITY: ATP + PROPIONYL-COA + CO(2) + H(2)O = ADP + ORTHOPHOSPHATE + METHYLMALONYL-COA]." /experiment="experimental evidence, no additional details recorded" /note="Rv2247, (MTCY427.28), len: 473 aa. accD6, Acetyl/Propionyl CoA Carboxylase, beta subunit (EC 6.4.1.3) (see citations below), highly similar to e.g. PCCB_RHOSO|Q06101 propionyl-CoA carboxylase beta chain, FASTA score: (75.1% identity in 437 aa overlap). Similar to many other Acetyl/Propionyl CoA Carboxylases from Mycobacterium tuberculosis. BELONGS TO THE ACCD / PCCB FAMILY." /codon_start=1 /transl_table=11 /product="acetyl/propionyl-CoA carboxylase beta subunit AccD6" /protein_id="NP_216763.1" /db_xref="GI:15609384" /db_xref="GOA:P63407" /db_xref="UniProtKB/Swiss-Prot:P63407" /db_xref="GeneID:887671" /translation="MTIMAPEAVGESLDPRDPLLRLSNFFDDGSVELLHERDRSGVLA AAGTVNGVRTIAFCTDGTVMGGAMGVEGCTHIVNAYDTAIEDQSPIVGIWHSGGARLA EGVRALHAVGQVFEAMIRASGYIPQISVVVGFAAGGAAYGPALTDVVVMAPESRVFVT GPDVVRSVTGEDVDMASLGGPETHHKKSGVCHIVADDELDAYDRGRRLVGLFCQQGHF DRSKAEAGDTDIHALLPESSRRAYDVRPIVTAILDADTPFDEFQANWAPSMVVGLGRL SGRTVGVLANNPLRLGGCLNSESAEKAARFVRLCDAFGIPLVVVVDVPGYLPGVDQEW GGVVRRGAKLLHAFGECTVPRVTLVTRKTYGGAYIAMNSRSLNATKVFAWPDAEVAVM GAKAAVGILHKKKLAAAPEHEREALHDQLAAEHERIAGGVDSALDIGVVDEKIDPAHT RSKLTEALAQAPARRGRHKNIPL" repeat_region 2522173..2522230 /note="58 bp inverted repeat near 3'end of MTCY427.28" gene 2522360..2523175 /locus_tag="Rv2248" /db_xref="GeneID:888603" CDS 2522360..2523175 /locus_tag="Rv2248" /function="UNKNOWN" /note="Rv2248, (MTCY427.29), len: 271 aa. Conserved hypothetical protein. Very similar to hypothetical M. tuberculosis proteins Rv3517, Rv1482c, Rv3555c, Rv3714c, Rv1073. FASTA score: MTCY06G11.02c MTCY6G11 NID: g1877284 - (289 aa) opt: 366 E(): 5.3e-18; (32.1% identity in 249 aa overlap). Some similarity to Mycobacterium avium protein AF002133|AF0021 339 AF002133 NID: g2183254 (346 aa) opt: 308 E(): 5.2e-14; (28.3% identity in 254 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216764.1" /db_xref="GI:15609385" /db_xref="UniProtKB/Swiss-Prot:Q10526" /db_xref="GeneID:888603" /translation="MTRQQLDVQVKNGGLVRVWYGVYAAQEPDLLGRLAALDVFMGGH AVACLGTAAALYGFDTENTVAIHMLDPGVRMRPTVGLMVHQRVGARLQRVSGRLATAP AWTAVEVARQLRRPRALATLDAALRSMRCARSEIENAVAEQRGRRGIVAARELLPFAD GRAESAMESEARLVMIDHGLPLPELQYPIHGHGGEMWRVDFAWPDMRLAAEYESIEWH AGPAEMLRDKTRWAKLQELGWTIVPIVVDDVRREPGRLAARIARHLDRARMAG" repeat_region 2523184..2523236 /note="53 bp inverted repeat between 3' ends of MTCY427.29 and MT CY427.31c" gene complement(2523241..2524791) /gene="glpD1" /locus_tag="Rv2249c" /db_xref="GeneID:887276" CDS complement(2523241..2524791) /gene="glpD1" /locus_tag="Rv2249c" /EC_number="1.1.5.3" /function="INVOLVED IN AEROBIC RESPIRATION AND OXYDATION OF GLYCEROL. REDUCES AN ACCEPTOR AND GENERATES GLYCERONE PHOSPHATE FROM Sn-GLYCEROL 3-PHOSPHATE. POSSIBLY PLAY A ROLE IN METABOLISM OF RIBOFLAVIN, FAD,FMN [CATALYTIC ACTIVITY: SN-GLYCEROL 3-PHOSPHATE + ACCEPTOR = GLYCERONE PHOSPHATE + REDUCED ACCEPTOR]." /note="Rv2249c, (MTCY427.31c), len: 516 aa. Probable glpD1, glycerol-3-phosphate dehydrogenase (EC 1.1.99.5), similar to SW:GLPD_ECOLI P13035 aerobic glycerol-3-phosphate dehydrogenase (30.0% identity in 486 aa overlap) and SW:GLPA_ECOLI P13032 anaerobic glycerol-3-phosphate dehydrogenase (28.2% identity in 504 aa overlap). Also similar to Rv3302c|glpD2 glycerol-3-phosphate dehydrogenase. COFACTOR: FAD (BY SIMILARITY). BELONGS TO THE FAD-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE FAMILY." /codon_start=1 /transl_table=11 /product="glycerol-3-phosphate dehydrogenase" /protein_id="NP_216765.1" /db_xref="GI:15609386" /db_xref="GOA:P64182" /db_xref="UniProtKB/Swiss-Prot:P64182" /db_xref="GeneID:887276" /translation="MLMPHSAALNAARRSADLTALADGGALDVIVIGGGITGVGIALD AATRGLTVALVEKHDLAFGTSRWSSKLVHGGLRYLASGNVGIARRSAVERGILMTRNA PHLVHAMPQLVPLLPSMGHTKRALVRAGFLAGDALRVLAGTPAATLPRSRRIPASRVV EIAPTVRRDGLDGGLLAYDGQLIDDARLVMAVARTAAQHGARILTYVGASNVTGTSVE LTDRRTRQSFALSARAVINAAGVWAGEIDPSLRLRPSRGTHLVFDAKSFANPTAALTI PIPGELNRFVFAMPEQLGRIYLGLTDEDAPGPIPDVPQPSSEEITFLLDTVNTALGTA VGTKDVIGAYAGLRPLIDTGGAGVQGRTADVSRDHAVFESPSGVISVVGGKLTEYRYM AEDVLNRAITLRHLRAAKCRTRNLPLIGAPANPGPAPGSGAGLPESLVARYGAEAANV AAAATCERPTEPVADGIDVTRAEFEYAVTHEGALDVDDILDRRTRIGLVPRDRERVVA VAKEFLSR" gene complement(2524785..2525354) /locus_tag="Rv2250c" /db_xref="GeneID:888018" CDS complement(2524785..2525354) /locus_tag="Rv2250c" /function="Possibly involved in transcriptional regulation" /note="Rv2250c, (MTCY427.32c), len: 189 aa. Possible transcriptional regulatory protein, TetR family. Start unclear; ORF has been shortened since first submission to avoid overlap with Rv2251 (-30 aa). Contains probable helix-turn-helix motif (Score 2243, +6.70 SD)" /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="NP_216766.2" /db_xref="GI:57116959" /db_xref="GOA:Q10528" /db_xref="UniProtKB/Swiss-Prot:Q10528" /db_xref="GeneID:888018" /translation="MLSMSNDRADTGGRILRAAASCVVDYGVDRVTLAEIARRAGVSR PTVYRRWPDTRSIMASMLTSHIADVLREVPLDGDDREALVKQIVAVADRLRGDDLIMS VMHSELARVYITERLGTSQQVLIEGLAARLTVAQRSGSVRSGDARRLATMVLLIAQST IQSADIVDSILDSAALATELTHALNGYLC" gene 2525402..2525821 /locus_tag="Rv2250A" /db_xref="GeneID:3205100" CDS 2525402..2525821 /locus_tag="Rv2250A" /function="ELECTRON ACCEPTOR" /note="Rv2250A, len: 139 aa. Conserved hypothetical protein, possibly flavoprotein. Similar to N-terminus of SCF91.28c|AL132973_28 possible flavoprotein from Streptomyces coelicolor (530 aa), FASTA scores: opt: 240, E(): 1.1e-07, (39.25% identity in 107 aa overlap). Possible frameshift between nt 2525723 to 2525727. The sequences of CDC 1551 and Mycobacterium bovis are missing a single G base." /codon_start=1 /transl_table=11 /product="flavoprotein" /protein_id="YP_177662.1" /db_xref="GI:57116960" /db_xref="UniProtKB/TrEMBL:Q79FG6" /db_xref="GeneID:3205100" /translation="MKWDAWGDPAAAKPLSDGVRSLLKQVVGLADSEQPELDPAQVQL RPSALSGADHDALARIVGTEYFRTADRDRLLHAGGKSTPDLLRRKDTGVQDAPDAVLL PGGPNGGGRRRRHLALLLRPRHCRGPVWWRHQRRWWA" gene 2525565..2526992 /locus_tag="Rv2251" /db_xref="GeneID:888706" CDS 2525565..2526992 /locus_tag="Rv2251" /function="ELECTRON ACCEPTOR" /note="Rv2251, (MTV022.01), len: 475 aa. Possible flavoprotein, probably continuation of Rv2250A, similar to MTCY164.18 from Mycobacterium tuberculosis and to several ALKYLDIHYDROXYACETONEPHOSPHATE SYNTHASES (e.g. O00116). Also some similarity to D-lactate dehydrogenases. FASTA scores: sptr|O05784|O05784 HYPOTHETICAL 56.5 kDa PROTEIN. (527 aa) opt: 1019 E(): 0; (38.6% identity in 487 aa overlap) and sp|O00116|ADAS_HUMAN ALKYLDIHYDROXYACETON EPHOSPHATE SYNTHASE PRECURSOR (EC 2.5.1.26) (658 aa) opt: 558 E(): 6.2e-27; (31.3% identity in 447 aa overlap). TBparse score is 0.902" /codon_start=1 /transl_table=11 /product="flavoprotein" /protein_id="NP_216767.1" /db_xref="GI:15609388" /db_xref="GOA:O53525" /db_xref="UniProtKB/TrEMBL:O53525" /db_xref="GeneID:888706" /translation="MRWRASSAPSISAPPIATGCCTPAASPPQTCCGAKTPVSRMRPT RCCCPAAPTGEDAVADILHYCSDHGIAVVPFGGGTSVVGGLDPVRNDFRAVISLDMRR FDRLHRIDEVSGEAELEAGVTGPEAERLLGEHGFSLGHFPQSFEFATIGGFAATRSSG QDSAGYGRFNDMILGLRMITPVGVLDLGRVPASAAGPDLRQLAIGSEGVFGVITRVRL RVHRIPESTRYEAWSFPDFATGVAALRTITQTGTGPTVVRLSDEAETGVNLATTEAIG ETQITGGCLGITVFEGTQEHTESRHAETRALLAARGGTSLGEGPARAWERGRFAAPYL RDSLLAAGALCETLETATVWSNTPVLKAAVTEALTTSLAASGTPALVMCHVSHVYPTG ASLYFTVVAGQRGDPIEQWLAAKKAASDAIMATGGTITHHHAVGSDHRPWMRAEVGDL GVTLLRTIKATLDPAGILNPGKLIP" gene 2526989..2527918 /locus_tag="Rv2252" /db_xref="GeneID:888429" CDS 2526989..2527918 /locus_tag="Rv2252" /function="UNKNOWN" /note="involved in the biosynthesis of phosphatidylinositol mannosides (PIMs); the enzyme from Mycobacterium tuberculosis can phosphorylate a variety of amphipathic lipids" /codon_start=1 /transl_table=11 /product="diacylglycerol kinase" /protein_id="NP_216768.1" /db_xref="GI:15609389" /db_xref="GOA:O53526" /db_xref="UniProtKB/TrEMBL:O53526" /db_xref="GeneID:888429" /translation="MSAGQLRRHEIGKVTALTNPLSGHGAAVKAAHGAIARLKHRGVD VVEIVGGDAHDARHLLAAAVAKGTDAVMVTGGDGVVSNALQVLAGTDIPLGIIPAGTG NDHAREFGLPTKNPKAAADIVVDGWTETIDLGRIQDDNGIEKWFGTVAATGFDSLVND RANRMRWPHGRMRYYIAMLAELSRLRPLPFRLVLDGTEEIVADLTLADFGNTRSYGGG LLICPNADHSDGLLDITMAQSDSRTKLLRLFPTIFKGAHVELDEVSTTRAKTVHVECP GINVYADGDFACPLPAEISAVPAALQVLRPRHG" gene 2527984..2528487 /locus_tag="Rv2253" /db_xref="GeneID:888287" CDS 2527984..2528487 /locus_tag="Rv2253" /function="UNKNOWN" /note="Rv2253, (MTV022.03), len: 167 aa. Possible secreted protein; has potential N-terminal signal peptide. TBparse score is 0.945." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216769.1" /db_xref="GI:15609390" /db_xref="UniProtKB/TrEMBL:O53527" /db_xref="GeneID:888287" /translation="MSGHRKKAMLALAAASLAATLAPNAVAAAEPSWNGQYLVTLSAN AKTGTSMAANRPEYPHKANYTFSSRCASDVCIATVVDAPPPKNEFIPRPIEYTWNGTQ WVREISWQWDCLLPDGTIEYAPAKSITAYTPGQYGILTGVFHTDIASGTCKGNVDMPV SAKPIVG" gene complement(2528520..2528975) /locus_tag="Rv2254c" /db_xref="GeneID:887182" CDS complement(2528520..2528975) /locus_tag="Rv2254c" /function="UNKNOWN" /note="Rv2254c, (MTV022.04c), len: 151 aa. Probable integral membrane protein. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216770.1" /db_xref="GI:15609391" /db_xref="GOA:O53528" /db_xref="UniProtKB/TrEMBL:O53528" /db_xref="GeneID:887182" /translation="MRYRDLETVAAPTINVLRVWPEIVGAIVLLVIAAMGIGHGLRPS PEPVPAPQKQLGCVRFALIFGLTAINPATFVYFTAVAVTLARALRATTAIAVVVGVAL ASLLWQLLLVSAGAFLRSRATARVRRMTVLAGNAVIAAFGAVLVVHAFA" gene complement(2528980..2529174) /locus_tag="Rv2255c" /db_xref="GeneID:885526" CDS complement(2528980..2529174) /locus_tag="Rv2255c" /function="UNKNOWN" /note="Rv2255c, (MTV022.05c), len: 64 aa. Hypothetical unknown protein. TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216771.1" /db_xref="GI:15609392" /db_xref="UniProtKB/TrEMBL:O53529" /db_xref="GeneID:885526" /translation="MDGIVDRGVRARPCQKVVAVLRRSKSHIDKRLDAATGNAFLGKQ VLSAAGVVEYRPPRRSPLST" gene complement(2529341..2529874) /locus_tag="Rv2256c" /db_xref="GeneID:885865" CDS complement(2529341..2529874) /locus_tag="Rv2256c" /function="UNKNOWN" /note="Rv2256c, (MTV022.06c), len: 177 aa. Conserved hypothetical protein, similar to Streptomyces glaucescens ORF5 (164 aa) and Streptomyces coelicolor hypothetical protein SC4A7.19c (164 aa; emb|CAB62723.1|AL133423). FASTA scores: sptr|Q54209|Q54209 FABD, FABH, FABC, FABB, AND ORF5 (164 aa) opt: 504, E(): 3.9e-27; (44.4% identity in 162 aa overlap). TBparse score is 0.900" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216772.1" /db_xref="GI:15609393" /db_xref="UniProtKB/TrEMBL:O53530" /db_xref="GeneID:885865" /translation="MEPKEQQMRASNQFADVTSGVVYIHASPAAVCPHVEWALSSTLQ AKANLVWTPQPALPPQLRAVTNWVGPVGTGARLANALRSWSVLRFEVTEDPSPGVDGQ RFSHTPQLGLWSGAMSANGDIMVGEMRLRAMMAQGADTLAAELDSVLGTAWDQALEVY RDGGDAGEVTWLSRGVG" gene complement(2530004..2530822) /locus_tag="Rv2257c" /db_xref="GeneID:888069" CDS complement(2530004..2530822) /locus_tag="Rv2257c" /function="UNKNOWN" /note="Rv2257c, (MTV022.07c), len: 272 aa. Conserved hypothetical protein, similar to hypothetical protein SC4A7.08 from Streptomyces coelicolor (273 aa; 58% identity in 243 aa overlap). Also similar to several putative esterases and penicillin-binding proteins in M. tuberculosis e.g. Rv1923, Rv1497, Rv2463, Rv3775, Rv1922, Rv1730c. TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216773.1" /db_xref="GI:15609394" /db_xref="GOA:O53531" /db_xref="UniProtKB/TrEMBL:O53531" /db_xref="GeneID:888069" /translation="MTALEVLGGWPVPAAAAAVIGPAGVLATHGDTARVFALASVTKP LVARAAQVAVEEGVVNLDTPAGPPGSTVRHLLAHTSGLAMHSDQALARPGTRRMYSNY GFTVLAESVQRESGIEFGRYLTEAVCEPLGMVTTRLDGGPAAAGFGATSTVADLAVFA GDLLRPSTVSAQMHADATTVQFPGLDGVLPGYGVQRPNDWGLGFEIRNSKSPHWTGEC NSTRTFGHFGQSGGFIWVDPKADLALVVLTARDFGDWALDLWPAISDAVLAEYT" gene complement(2530836..2531897) /locus_tag="Rv2258c" /db_xref="GeneID:888755" CDS complement(2530836..2531897) /locus_tag="Rv2258c" /function="Possibly involved in transcriptional regulation" /note="Rv2258c, (MTV022.08c), len: 353 aa. Possible transcriptional regulatory protein, similar to several hypothetical proteins from C. elegans. FASTA scores: sptr|O01593|O01593 CODED FOR BY C. ELEGANS CDNA YK102 F (365 aa) opt: 577, E(): 6.4e-31; (30.5% identity in 341 aa overlap). Contains possible helix-turn helix motif at aa 47-68 (+3.65 SD)" /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="NP_216774.1" /db_xref="GI:15609395" /db_xref="GOA:O53532" /db_xref="UniProtKB/TrEMBL:O53532" /db_xref="GeneID:888755" /translation="MSGALETTEEFGNRFVAAIDSAGLAILVSVGHQTGLLDTMAGLP PATSMEIAEAAGLEERYVREWLGGMTTGQIVEYDAGSSTYSLPAHRAGMLTRAAGPDN LAVIAQFVSLLGEVEQKVIRCFREGGGVPYSEYPRFHKLMAEMSGMVFDAALIDVVLP LVDGLPDRLRSGADVADFGCGSGRAVKLMAQAFGASRFTGIDFSDEAVAAGTEEAARL GLANATFERHDLAELDKVGAYDVITVFDAIHDQAQPARVLQNIYRALRPGGVLLMVDI KASSQLEDNVGVPLSTYLYTTSLMHCMTVSLALDGAGLGTVWGRQLATSMLADAGFTD VTVAEIESDVLNNYYIARK" repeat_region complement(2531898..2531950) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(2531951..2532003) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(2532004..2532056) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(2532057..2532109) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(2532110..2532162) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(2532163..2532212) /note="50 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 2532245..2533330 /gene="adhE2" /locus_tag="Rv2259" /db_xref="GeneID:887215" CDS 2532245..2533330 /gene="adhE2" /locus_tag="Rv2259" /EC_number="1.2.1.-" /function="oxido-reduction" /note="Rv2259, (MTV022.09), len: 361 aa. Probable adhE2, zinc-containing alcohol dehydrogenase, similar to several, especially mycothiol-dependent formaldehyde dehydrogenase from Amycolatopsis methanolica P80094 (360 aa). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. FASTA scores: >sp|P80094|FADH_AMYME NAD/MYCOTHIOL-DEPENDENT FORMALDEHYDE DEHYDROGENASE (MD-FALDH) Length = 360, Expect = e-156, Identities = 268/358 (74%). TBparse score is 0.882. Also similar to Rv0162c, (MTCI28.02c, 35.0% identity in 371 aa overlap)." /codon_start=1 /transl_table=11 /product="zinc-dependent alcohol dehydrogenase AdhE2" /protein_id="NP_216775.1" /db_xref="GI:15609396" /db_xref="GOA:O53533" /db_xref="UniProtKB/TrEMBL:O53533" /db_xref="GeneID:887215" /translation="MSQTVRGVIARQKGEPVELVNIVVPDPGPGEAVVDVTACGVCHT DLTYREGGINDEYPFLLGHEAAGIIEAVGPGVTAVEPGDFVILNWRAVCGQCRACKRG RPRYCFDTFNAEQKMTLTDGTELTAALGIGAFADKTLVHSGQCTKVDPAADPAVAGLL GCGVMAGLGAAINTGGVTRDDTVAVIGCGGVGDAAIAGAALVGAKRIIAVDTDDTKLD WARTFGATHTVNAREVDVVQAIGGLTDGFGADVVIDAVGRPETYQQAFYARDLAGTVV LVGVPTPDMRLDMPLVDFFSHGGALKSSWYGDCLPESDFPTLIDLYLQGRLPLQRFVS ERIGLEDVEEAFHKMHGGKVLRSVVML" misc_feature 2532428..2532472 /gene="adhE2" /locus_tag="Rv2259" /note="PS00059 Zinc-containing alcohol dehydrogenases signature" gene 2533330..2533965 /locus_tag="Rv2260" /db_xref="GeneID:888490" CDS 2533330..2533965 /locus_tag="Rv2260" /function="UNKNOWN" /note="Rv2260, (MTV022.10), len: 211 aa. Conserved hypothetical protein, similar to hypothetical proteins Rv0634c, Rv1637c, Rv3677c, Rv2581c from Mycobacterium tuberculosis and to various hydrolases. FASTA scores: sptr|O06154|O06154 HYPOTHETICAL 21.3 kDa PROTEIN (200 aa) opt: 355, E(): 4e- 15; (37.4% identity in 198 aa overlap). TBparse score is 0.901" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216776.1" /db_xref="GI:15609397" /db_xref="UniProtKB/TrEMBL:O53534" /db_xref="GeneID:888490" /translation="MAAIERVITHGTFELDGGSWEVDNNIWLVGDDSEVVVFDAAHHA APIIDAVGGRKVVAVICTHGHNDHVTVAPELGTALDAPVLMHPGDAVLWRMTHPDKSF RAVSDGDAVRVGGTELRALHTPGHSPGSVCWYAPELGPGTGTVFSGDTLFAGGPGATG RSYSDFPTILRSISGRLGALPGDTVVHTGHGDSTTIGDEIVHYEEWVARGH" gene complement(2534042..2534464) /locus_tag="Rv2261c" /db_xref="GeneID:887455" CDS complement(2534042..2534464) /locus_tag="Rv2261c" /function="Function unknown; thought to be involved in lipid metabolism." /note="Rv2261c, (MTV022.11c), len: 140 aa. Conserved hypothetical protein, with function unknown but some similarity to C-terminal end of PCC6803 apolipoprotein N-acyltransferase from Synechocystis sp. Note that next ORF shows similarity to N-terminal part of P74055 APOLIPOPROTEIN N-ACYLTRANSFERASE from Escherichia coli (519 aa), FASTA scores: opt: 142, E(): 0.007, (29.9% identity in 117 aa overlap), suggesting possible frameshift. Sequence of clones from two sources has been checked but no error found. TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216777.1" /db_xref="GI:15609398" /db_xref="GeneID:887455" /translation="MHIAPLISYEMTFSDLTRHAARLGAALLVYQSSTSTFQGSWAQP QLAAQPAVRAVEAGIPAVHASLSGDSSAFDTRGRRLAWCSAEFNGAIVVNVPLASNVT LYLRLGDWVPVTAFVVMGAGFAVFLRRSLARVSDCADK" gene complement(2534470..2535552) /locus_tag="Rv2262c" /db_xref="GeneID:887220" CDS complement(2534470..2535552) /locus_tag="Rv2262c" /function="Function unknown; thought to be involved in lipid metabolism." /note="Rv2262c, (MTV022.12c), len: 360 aa. Conserved hypothetical protein, with function unknown but some similarity to N-terminal 70% of P23930|P77703|LNT_ECOLI|CUTE|B0657 APOLIPOPROTEIN N-ACYLTRANSFERASE (EC 2.3.1.-) from Escherichia coli strain K12 (512 aa), FASTA scores: opt: 239, E(): 1.6e-07, (30.4% identity in 359 aa overlap). Note that neighboring ORF shows similarity to N -terminal part of PCC6803 apolipoprotein N-acyltransferase from Synechocystis sp., suggesting possibility of frameshift. Sequence of clones from two sources has been checked but no error found. Appear to be two extra bases at position 1876970 compared to CDC1551 strain. TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216778.1" /db_xref="GI:15609399" /db_xref="GOA:O53536" /db_xref="UniProtKB/TrEMBL:O53536" /db_xref="GeneID:887220" /translation="MALRAGARRQPVIGCAAALVFGGLPALAFPAPSWWWLAWFGLVP LLLVVRAAPTSWEGALRAWTGMGGFVLATQYWLVTSAGPMLVLLAAGLGVLWLPAGWL AHRLLSVPVTTCRVGAALVVVPSAWVAAEAVRSWQSLGGPWALLGASQWSQPVTLASA SLGGVWLTSFLLVATNTAIASVLVCRATGGRLVALGCVIGCAGLGPASYLLGSVPVGG PTVRVALVQAGDIADAAARLAAGEEFTAAVADQRPDLVVWGESSVGQDLTRHPDVLAR LAELSQRVGADLLVNVDAPAPDGGIYKSAVLVGAHEAVGSYRKTRLVPFGEYVLRCAR FSAGSPATARPPQRIGSAAPGRWCWR" gene 2535641..2536594 /locus_tag="Rv2263" /db_xref="GeneID:887788" CDS 2535641..2536594 /locus_tag="Rv2263" /function="oxidoreduction" /note="Rv2263, (MTV022.13), len: 317 aa. Possible oxidoreductase (EC 1.-.-.-), similar to several oxidoreductases. Similarity suggests alternative GTG start at 10154 but then no rbs. FASTA scores: sptr|Q544 05|Q54405 PROBABLY AN NADP-DEPENDENT OXIDOREDUCTASE (297 aa) opt: 487, E(): 1.1e-23; (36.1% identity in 299 aa overlap). Also similar to Mycobacterium tuberculosis Rv0068, and Rv0439c. TBparse score is 0.889" /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_216779.1" /db_xref="GI:15609400" /db_xref="GOA:O53537" /db_xref="UniProtKB/TrEMBL:O53537" /db_xref="GeneID:887788" /translation="MAKDLVATVPDLSGKLAIITGANSGLGFGLARRLSAAGADVIMA IRNRAKGEAAVEEIRTAVPDAKLTIKALDLSSLASVAALGEQLMADGRPIDLLINNAG VMTPPERVTTADGFELQFGSNHLGHFALTAHLLPLLRAAQRARVVSLSSLAARRGRIH FDDLQFERSYAPMTAYGQSKLAVLMFARELDRRSRAAGWGIISNAAHPGLTKTNLQIA GPSHGRDKPALMERLYKTSWRFAPFLWQEIEEGILPALYAAATPQADGGAFYGPRGRY EVAGGGVREAKVPAAARNDADSKRLWEVSEQLTGVSYPKSR" gene complement(2536572..2538350) /locus_tag="Rv2264c" /db_xref="GeneID:888040" CDS complement(2536572..2538350) /locus_tag="Rv2264c" /function="UNKNOWN" /note="Rv2264c, (MTV022.14c), len: 592 aa. Conserved hypothetical Pro-rich protein, similar to hypothetical proteins Rv0312 (MTCY63.17, 620 aa and Rv0350) that has highly Pro-, Thr-rich C-terminus. Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. FASTA scores: Z96800|MTCY63_17 Mycobacterium tuberculosis cosmid (620 aa) opt: 1075, E(): 8.8e-24; (38.9% identity in 627 aa overlap). TBparse score is 0.919" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216780.1" /db_xref="GI:15609401" /db_xref="GOA:O53538" /db_xref="UniProtKB/TrEMBL:O53538" /db_xref="GeneID:888040" /translation="MATGARPALGLSIGVTNLAAVAADHSITRKPVLTLYRQRPPEVG VPSENPRLDEPGLVITDFVDRVGDSVGIVAADGSVYRSEALVADALLALAYTATGGRA LPGSVTVTYPAHWGPAAVAALDSALRRASEWSHGTSSTAQPLSLLPDAAAALYAIRAD PGIPARGIVAVCDFGGSGTGITLVDAADEYRPVAATVRHQAFSGDLIDQSLLSYVMSE LPGTGAFDPAGTSAIGSLTKLRIECRKAKERLSSSTVTTLTDALGGDIRLTRNELEDT IRDSLDSVGRALEQTLARSGIRTAELVAIVSVGGGANIPAVTTTLSGRFCVPVVRTPR PQLTAAFGGALWAARRPGDTSATVLTAVTSATATAPADAPASVLQPALAWSEADEDSH IGPAPGYTAARPSLSFDHDAHAEPEPKSPPIPWYRLPAVIITGTTVAVLLVGAAVAIG LSTGDQPTAPGTPQRPGVTTTAAPPPSPAPASDGPTTEPAPPVQAPATGGPAPPLQQP LPPPPTTTNTQPAVTTDVITPAPTTPASAPPATTQPPATTQPPATTSPSPPPIPPIPP IPEIPQLPPGIPQVPGIGQFSAISGS" misc_feature complement(2537679..2537696) /locus_tag="Rv2264c" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" gene 2538700..2539929 /locus_tag="Rv2265" /db_xref="GeneID:887976" CDS 2538700..2539929 /locus_tag="Rv2265" /function="UNKNOWN" /note="Rv2265, (MTCY339.45c), len: 409 aa. Possible conserved integral membrane protein, with some similarity to others e.g. M. thermoauto. sp|O26855|O26855 CONSERVED PROTEIN (383 aa), FASTA score: opt: 898 z-score: 1023.5 E(): 0; 38.0% identity in 384 aa overlap; Q58713 HYPOTHETICAL 44.1 kDa PROTEIN 1 317 (398 aa), FASTA scores, opt: 305 E(): 1.2e-11; 22.8% identity in 382 aa overlap; also KGTP_ECOLI P17448 alpha-ketoglutarate permease (432 aa), FASTA scores, opt: 156, E(): 0.006, (24.8% identity in 416 aa overlap)" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216781.1" /db_xref="GI:15609402" /db_xref="GOA:P64961" /db_xref="UniProtKB/Swiss-Prot:P64961" /db_xref="GeneID:887976" /translation="MGANGDVALSRIGATRPALSAWRFVTVFGVVGLLADVVYEGARS ITGPLLASLGATGLVVGVVTGVGEAAALGLRLVSGPLADRSRRFWAWTIAGYTLTVVT VPLLGIAGALWVACALVIAERVGKAVRGPAKDTLLSHAASVTGRGRGFAVHEALDQVG AMIGPLTVAGMLAITGNAYAPALGVLTLPGGAALALLLWLQRRVPRPESYEDCPVVLG NPSAPRPWALPAQFWLYCGFTAITMLGFGTFGLLSFHMVSHGVLAAAMVPVVYAAAMA ADALTALASGFSYDRYGAKTLAVLPILSILVVLFAFTDNVTMVVIGTLVWGAAVGIQE STLRGVVADLVASPRRASAYGVFAAGLGAATAGGGALIGWLYDISIGTLVVVVIALEL MALVMMFAIRLPRVAPS" gene 2540104..2541390 /gene="cyp124" /locus_tag="Rv2266" /db_xref="GeneID:887763" CDS 2540104..2541390 /gene="cyp124" /locus_tag="Rv2266" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv2266, (MT2328, MTCY339.44c), len: 428 aa. Probable cyp124, cytochrome P450 (EC 1.14.-.-), similar to e.g. G405543 cytochrome P450 (406 aa), FASTA scores, opt: 763,E(): 0, (35.4% identity in 393 aa overlap), similar to e.g. MTCY50.26, 33.8% identity in 370 aa overlap" /codon_start=1 /transl_table=11 /product="cytochrome P450 124 CYP124" /protein_id="NP_216782.1" /db_xref="GI:15609403" /db_xref="GOA:Q50696" /db_xref="UniProtKB/Swiss-Prot:Q50696" /db_xref="GeneID:887763" /translation="MGLNTAIATRVNGTPPPEVPIADIELGSLDFWALDDDVRDGAFA TLRREAPISFWPTIELPGFVAGNGHWALTKYDDVFYASRHPDIFSSYPNITINDQTPE LAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEAAVRDRAHRLVSSMIANNPDRQ ADLVSELAGPLPLQIICDMMGIPKADHQRIFHWTNVILGFGDPDLATDFDEFMQVSAD IGAYATALAEDRRVNHHDDLTSSLVEAEVDGERLSSREIASFFILLVVAGNETTRNAI THGVLALSRYPEQRDRWWSDFDGLAPTAVEEIVRWASPVVYMRRTLTQDIELRGTKMA AGDKVSLWYCSANRDESKFADPWTFDLARNPNPHLGFGGGGAHFCLGANLARREIRVA FDELRRQMPDVVATEEPARLLSQFIHGIKTLPVTWS" gene complement(2541644..2542810) /locus_tag="Rv2267c" /db_xref="GeneID:888487" CDS complement(2541644..2542810) /locus_tag="Rv2267c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2267c, (MTCY339.43), len: 388 aa. Conserved hypothetical protein; some similarity to Mycobacterium tuberculosis Rv3529c; gp|Z82098|MTCY3C7_27 (384 aa) FASTA score: opt: 261, E(): 3.6e-10; 27.3% identity in 253 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216783.1" /db_xref="GI:15609404" /db_xref="GOA:P64963" /db_xref="UniProtKB/Swiss-Prot:P64963" /db_xref="GeneID:888487" /translation="MKALRSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSR WHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIADPPIFIVGHWRTGTTLLHELLVV DDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQ GLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKN PTHSFRIKVLLEVFPQAKFIHIVRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKV VSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQHLGLGDFECYLPRL RQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG" gene complement(2542807..2544276) /gene="cyp128" /locus_tag="Rv2268c" /db_xref="GeneID:888025" CDS complement(2542807..2544276) /gene="cyp128" /locus_tag="Rv2268c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /experiment="experimental evidence, no additional details recorded" /note="Rv2268c, (MT2330, MTCY339.42), len: 489 aa. Probable cyp128, cytochrome P450 (EC 1.14.-.-), similar to (but longer than) cytochrome p-450 e.g. CPXK_SACER P3 3271 cytochrome p-450 107b1 (405 aa), FASTA scores, opt: 620, E(): 8.3e-33, (31.8% identity in 406 aa overlap); contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature, similar to MTCY50.26, 32.7% identity in 382 aa overlap" /codon_start=1 /transl_table=11 /product="cytochrome P450 128" /protein_id="NP_216784.1" /db_xref="GI:15609405" /db_xref="GOA:P63713" /db_xref="UniProtKB/Swiss-Prot:P63713" /db_xref="GeneID:888025" /translation="MTATQSPPEPAPDRVRLAGCPLAGTPDVGLTAQDATTALGVPTR RRASSGGIPVATSMWRDAQTVRTYGPAVAKALALRVAGKARSRLTGRHCRKFMQLTDF DPFDPAIAADPYPHYRELLAGERVQYNPKRDVYILSRYADVREAARNHDTLSSARGVT FSRGWLPFLPTSDPPAHTRMRKQLAPGMARGALETWRPMVDQLARELVGGLLTQTPAD VVSTVAAPMPMRAITSVLGVDGPDEAAFCRLSNQAVRITDVALSASGLISLVQGFAGF RRLRALFTHRRDNGLLRECTVLGKLATHAEQGRLSDDELFFFAVLLLVAGYESTAHMI STLFLTLADYPDQLTLLAQQPDLIPSAIEEHLRFISPIQNICRTTRVDYSVGQAVIPA GSLVLLAWGAANRDPRQYEDPDVFRADRNPVGHLAFGSGIHLCPGTQLARMEGQAILR EIVANIDRIEVVEPPTWTTNANLRGLTRLRVAVTPRVAP" misc_feature complement(2542966..2542995) /gene="cyp128" /locus_tag="Rv2268c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(2544289..2544621) /locus_tag="Rv2269c" /db_xref="GeneID:887623" CDS complement(2544289..2544621) /locus_tag="Rv2269c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2269c, (MTCY339.41), len: 110 aa. Unknown protein; questionable ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216785.1" /db_xref="GI:15609406" /db_xref="UniProtKB/Swiss-Prot:P64965" /db_xref="GeneID:887623" /translation="MANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRY GGRAGIGRSETVTDHGAVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCPPL PCDCSTPL" gene 2544698..2545225 /gene="lppN" /locus_tag="Rv2270" /db_xref="GeneID:887448" CDS 2544698..2545225 /gene="lppN" /locus_tag="Rv2270" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2270, (MTCY339.40c), len: 175 aa. Probable lppN, lipoprotein; has appropriately positioned prokaryotic membrane lipoprotein attachment site PS00013." /codon_start=1 /transl_table=11 /product="lipoprotein lppN" /protein_id="NP_216786.1" /db_xref="GI:15609407" /db_xref="GOA:Q50693" /db_xref="UniProtKB/Swiss-Prot:Q50693" /db_xref="GeneID:887448" /translation="MRLPGRHVLYALSAVTMLAACSSNGARGGIASTNMNPTNPPATA ETATVSPTPAPQSARTETWINLQVGDCLADLPPADLSRITVTIVDCATAHSAEVYLRA PVAVDAAVVSMANRDCAAGFAPYTGQSVDTSPYSVAYLIDSHQDRTGADPTPSTVICL LQPANGQLLTGSARR" gene 2545332..2545631 /locus_tag="Rv2271" /db_xref="GeneID:887223" CDS 2545332..2545631 /locus_tag="Rv2271" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2271, (MTCY339.39c), len: 99 aa. Conserved hypothetical protein; some similarity to hypothetical protein AAK01340.1|AF265275_3 (AF265275) from uncultured organism Pu8 (104 aa) E= 4e-10, (34% identity in 91 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216787.1" /db_xref="GI:15609408" /db_xref="UniProtKB/Swiss-Prot:P64967" /db_xref="GeneID:887223" /translation="MTTPPDKARRRFLRDAYKNAERVARTALLTIDQDQLEQLLDYVD ERLGEQPCDHTARHAQRWAQSHRIEWETLAEGLQEFGGYCDCEIVMNVEPEAIFG" gene 2545737..2546105 /locus_tag="Rv2272" /db_xref="GeneID:887459" CDS 2545737..2546105 /locus_tag="Rv2272" /function="UNKNOWN" /note="Rv2272, (MTCY339.38c), len: 122 aa. Probable conserved transmembrane PROTEIN, similar to YIDH_ECOLI P31445 hypothetical 12.8 kDa protein (115 aa), FASTA scores, opt: 291, E(): 2.9e-14, (45.6% identity in 103 aa overlap), similar to MTCY339.37c, (35.0% identity in 100 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216788.1" /db_xref="GI:15609409" /db_xref="GOA:P64969" /db_xref="UniProtKB/Swiss-Prot:P64969" /db_xref="GeneID:887459" /translation="MADDSNDTATDVEPDYRFTLANERTFLAWQRTALGLLAAAVALV QLVPELTIPGARQVLGVVLAILAILTSGMGLLRWQQADRAMRRHLPLPRHPTPGYLAV GLCVVGVVALALVVAKAITG" gene 2546102..2546431 /locus_tag="Rv2273" /db_xref="GeneID:888440" CDS 2546102..2546431 /locus_tag="Rv2273" /function="UNKNOWN" /note="Rv2273, (MTCY339.37c), len: 109 aa. Probable conserved transmembrane protein, similar to Rv2272 (MTCY339.38c), (35.0% identity in 100 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216789.1" /db_xref="GI:15609410" /db_xref="GOA:P64971" /db_xref="UniProtKB/Swiss-Prot:P64971" /db_xref="GeneID:888440" /translation="MNRHSTAASDRGLQAERTTLAWTRTAFALLVNGVLLTLKDTQGA DGPAGLIPAGLAGAAASCCYVIALQRQRALSHRPLPARITPRGQVHILATAVLVLMVV TAFAQLL" gene complement(2546488..2546805) /locus_tag="Rv2274c" /db_xref="GeneID:888067" CDS complement(2546488..2546805) /locus_tag="Rv2274c" /function="UNKNOWN" /note="Rv2274c, (MTCY339.36), len: 105 aa. Unknown protein; questionable ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216790.1" /db_xref="GI:15609411" /db_xref="UniProtKB/Swiss-Prot:Q50689" /db_xref="GeneID:888067" /translation="MSIARSAQPIGWISCPPKGGSSCCRCGGGYTHIFCVSAWTGLVV DLQAEQVRSVVTERLRRRIGRGAPILAGTLAPGVGLAAQNREFRQFTGRSAPPSATIA FGE" gene 2546883..2547752 /locus_tag="Rv2275" /db_xref="GeneID:888355" CDS 2546883..2547752 /locus_tag="Rv2275" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2275, (MTCY339.35c), len: 289 aa. Conserved hypothetical protein. Some similarity to Bacillus subtilis sp|O34351|O34351 YVMC (248 aa), FASTA score: opt: 280, E(): 2.7e -11; 28.2% identity in 227 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216791.1" /db_xref="GI:15609412" /db_xref="UniProtKB/TrEMBL:Q50688" /db_xref="GeneID:888355" /translation="MSYVAAEPGVLISPTDDLQSPRSAPAAHDENADGITGGTRDDSA PNSRFQLGRRIPEATAQEGFLVRPFTQQCQIIHTEGDHAVIGVSPGNSYFSRQRLRDL GLWGLTNFDRVDFVYTDVHVAESYEALGDSAIEARRKAVKNIRGVRAKITTTVNELDP AGARLCVRPMSEFQSNEAYRELHADLLTRLKDDEDLRAVCQDLVRRFLSTKVGPRQGA TATQEQVCMDYICAEAPLFLDTPAILGVPSSLNCYHQSLPLAEMLYARGSGLRASRNQ GHAIVTPDGSPAE" gene 2547749..2548939 /gene="cyp121" /locus_tag="Rv2276" /db_xref="GeneID:888373" CDS 2547749..2548939 /gene="cyp121" /locus_tag="Rv2276" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS. IT HAS BEEN SHOWN TO BIND TIGHLY TO A RANGE OF AZOLE-BASED ANTIFUNGAL DRUGS (e.g. MICONAZOLE, CLOTRIMAZOLE)." /experiment="experimental evidence, no additional details recorded" /note="Rv2276, (MT2336, MTCY339.34c), len: 396 aa. cyp121, cytochrome P450 (EC 1.14.-.-) (see citation below), similar to e.g. G303644 (397 aa) opt: 675, z-score: 776.4, E(): 2.7e-36, (33.7% identity in 407 aa overlap); contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature, similar to MTCY339.42, 29.2% identity in 298 aa overlap." /codon_start=1 /transl_table=11 /product="cytochrome P450 121 CYP121" /protein_id="NP_216792.1" /db_xref="GI:15609413" /db_xref="GOA:Q59571" /db_xref="UniProtKB/Swiss-Prot:Q59571" /db_xref="GeneID:888373" /translation="MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWL VSSYALCTQVLEDRRFSMKETAAAGAPRLNALTVPPEVVNNMGNIADAGLRKAVMKAI TPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVLGIPQEDGPKL FRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITTGLMGELSRLRKDPAYSHV SDELFATIGVTFFGAGVISTGSFLTTALISLIQRPQLRNLLHEKPELIPAGVEELLRI NLSFADGLPRLATADIQVGDVLVRKGELVLVLLEGANFDPEHFPNPGSIELDRPNPTS HLAFGRGQHFCPGSALGRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERL PVLW" misc_feature 2548760..2548789 /gene="cyp121" /locus_tag="Rv2276" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(2549124..2550029) /locus_tag="Rv2277c" /db_xref="GeneID:888498" CDS complement(2549124..2550029) /locus_tag="Rv2277c" /function="UNKNOWN" /note="Rv2277c, (MTCY339.33), len: 301 aa. Possible glycerolphosphodiesterase, similar to e.g. UGPQ_ECOLI P10908 glycerophosphoryldiester phosphodiesterase (cytosolic) (247 aa), FASTA scores, opt: 149, E(): 0.0061, (27.2% identity in 195 aa overlap). Start of protein uncertain, encoded by neighbouring IS6110 as given, is intact in Mycobacterium tuberculosis CDC1551" /codon_start=1 /transl_table=11 /product="glycerolphosphodiesterase" /protein_id="NP_216793.1" /db_xref="GI:15609414" /db_xref="GOA:Q50687" /db_xref="UniProtKB/Swiss-Prot:Q50687" /db_xref="GeneID:888498" /translation="MPGRFTVALVIALGGTCGVADALPLGQTDDPMIVAHRAGTRDFP ENTVLAITNAVAAGVDGMWLTVQVSSDGVPVLYRPSDLATLTDGAGPVNSKTVQQLQQ LNAGWNFTTPGVEGHPYRQRATPIPTLEQAIGATPPDMTLFLDLKQTPPQPLVSAVAQ VLTRTGAAGRSIVYSTNADITAAASRQEGLQVAESRDVTRQRLFNMALNHHCDPQPDP GKWAGFELHRDVTVTEEFTLGSGISAVNAELWDEASVDCFRSQSGMKVMGFAVKTVDD YRLAHKIGLDAVLVDSPLAAQQWRH" repeat_region 2550011..2550013 /note="3 bp direct repeat, ccg, flanking IS6110" repeat_region 2550014..2551368 /note="IS6110-7, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-7" repeat_region 2550014..2550041 /note="28 bp inverted repeat at the left end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 2550065..2550391 /locus_tag="Rv2278" /db_xref="GeneID:888602" CDS 2550065..2550391 /locus_tag="Rv2278" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2278, (MTCY339.32c), len: 108 aa. Probable IS6110 transposase nearly identical to SW:YI32_MYCTU P19772 insertion element IS986 hypothetical (96.6% identity in 59 aa overlap), similar to TR:G309867 IS401 transposase subunit (51.4% identity in 105 aa overlap), predicted region of coiled coil at C-terminus." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216794.1" /db_xref="GI:15609415" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:888602" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 2550388..2551326 /locus_tag="Rv2279" /db_xref="GeneID:887746" CDS <2550388..2551326 /locus_tag="Rv2279" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2279, (MTCY339.31c), len: 312 aa. Probable IS6110 transposase , nearly identical to TRA9_MYCTU P19774 putative transposase for insertion sequence IS986 orfB, (278 aa), FASTA scores, opt: 1859, E(): 0, (99.6% identity in 278 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216795.1" /db_xref="GI:15609416" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:887746" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region 2551341..2551368 /note="28 bp inverted repeat at the right end of IS6110, GAGTCTCCGGACTCACCGGGGCGGTTCA" repeat_region 2551369..2551371 /note="3 bp direct repeat, ccg, flanking IS6110" gene 2551560..2552939 /locus_tag="Rv2280" /db_xref="GeneID:887601" CDS 2551560..2552939 /locus_tag="Rv2280" /function="oxidoreduction" /note="Rv2280, (MTCY339.30c), len: 459 aa. Probable dehydrogenase. Similar to D-lactate dehydrogenase (cytochrome) precursor e.g. G1061264 (587 aa), FASTA scores, opt: 645,E(): 1.3e-31, (28.0% identity in 478 aa overlap), similar to MTCY50.25, 36.5% identity in 447 aa overlap" /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_216796.1" /db_xref="GI:15609417" /db_xref="GOA:Q50685" /db_xref="UniProtKB/TrEMBL:Q50685" /db_xref="GeneID:887601" /translation="MSEMTARFSEIVGNANLLTGDAIPEDYAHDEELTGPPQKPAYAA KPATPEEVAQLLKAASENGVPVTARGSGCGLSGAARPVEGGLLISFDRMNKVLEVDTA NQVAVVQPGVALTDLDAATADTGLRYTVYPGELSSSVGGNVGTNAGGMRAVKYGVARH NVLGLQAVLPTGEIIRTGGRMAKVSTGYDLTQLIIGSEGTLALVTEVIVKLHPRLDHN ASVLAPFADFDQVMAAVPKILASGLAPDILEYIDNTSMAALISTQNLELGIPDQIRDS CEAYLLVALENRIADRLFEDIQTVGEMLMELGAVDAYVLEGGSARKLIEAREKAFWAA KALGADDIIDTVVPRASMPKFLSTARGLAAAADGAAVGCGHAGDGNVHMAIACKDPEK KKKLMTDIFALAMELGGAISGEHGVGRAKTGYFLELEDPVKISLMRRIKQSFDPAGIL NPGVVFGDT" gene 2553173..2554831 /gene="pitB" /locus_tag="Rv2281" /db_xref="GeneID:887257" CDS 2553173..2554831 /gene="pitB" /locus_tag="Rv2281" /function="Involved in phosphate transport." /experiment="experimental evidence, no additional details recorded" /note="Rv2281, (MTCY339.29c), len: 552 aa. Putative pitB, phosphate-transport permease, integral membrane protein, similar to YG04_HAEIN P45268 putative phosphate permease hi1604 (420 aa). FASTA scores, opt: 484, E(): 5e-23, (33.5% identity in 498 aa overlap) also to G399598 amphotropic murine retrovirus receptor (656 aa) FASTA scores, opt: 453, E(): 5.8e-21, (26.8% identity in 645 aa overlap). Also similar to Rv0545c|pitA from M. tuberculosis. BELONGS TO THE PIT SUBFAMILY." /codon_start=1 /transl_table=11 /product="phosphate ABC transporter permease" /protein_id="NP_216797.1" /db_xref="GI:15609418" /db_xref="GOA:P65712" /db_xref="UniProtKB/Swiss-Prot:P65712" /db_xref="GeneID:887257" /translation="MSDNAKHHRDGHLVASGLQDRAARTPQHEGFLGPDRPWHLSFSL LLAGSFVLFSWWAFDYAGSGANKVILVLATVVGMFMAFNVGGNDVANSFGTSVGAGTL TMKQALLVAAIFEVSGAVIAGGDVTETIRSGIVDLSGVSVDPRDFMNIMLSALSAAAL WLLFANRMGYPVSTTHSIIGGIVGAAIALGMVSGQGGAALRMVQWDQIGQIVVSWVLS PVLGGLVSYLLYGVIKRHILLYNEQAERRLTEIKKERIAHRERHKAAFDRLTEIQQIA YTGALARDAVAANRKDFDPDELESDYYRELHEIDAKTSSVDAFRALQNWVPLVAAAGS MIIVAMLLFKGFKHMHLGLTTMNNYFIIAMVGAAVWMATFIFAKTLRGESLSRSTFLM FSWMQVFTASGFAFSHGSNDIANAIGPFAAILDVLRTGAIEGNAAVPAAAMVTFGVAL CAGLWFIGRRVIATVGHNLTTMHPASGFAAELSAAGVVMGATVLGLPVSSTHILIGAV LGVGIVNRSTNWGLMKPIVLAWVITLPSAAILASVGLVALRAIF" gene complement(2554938..2555876) /locus_tag="Rv2282c" /db_xref="GeneID:887253" CDS complement(2554938..2555876) /locus_tag="Rv2282c" /function="UNKNOWN" /note="Rv2282c, (MTCY339.28), len: 312 aa. Probable transcriptional regulator, lysR family, similar to others e.g. YC30_CYAPA|P48271 hypothetical transcriptional regulator YCF30 (324 aa), FASTA scores: opt: 292, E(): 4e-12, (27.6% identity in 286 aa overlap); etc. Also similar to Rv0377|MTCY39.34 from Mycobacterium tuberculosis, FASTA score: (25.4% identity in 268 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature, and contains helix-turn-helix motif at aa 24 -45 (+4.93 SD)." /codon_start=1 /transl_table=11 /product="LysR family transcriptional regulator" /protein_id="NP_216798.1" /db_xref="GI:15609419" /db_xref="GOA:P67667" /db_xref="UniProtKB/Swiss-Prot:P67667" /db_xref="GeneID:887253" /translation="MPLSSRMPGLTCFEIFLAIAEAGSLGGAARELGLTQQAVSRRLA SMEAQIGVRLAIRTTRGSQLTPAGIVVAEWAARLLEVADEIDAGLGSLRTEGRQRIRV VASQTIAEQLMPHWMLSLRAADMRRGGTVPEVILTATNSEHAIAAVRDGIADLGFIEN PCPPTGLGSVVVARDELVVVVPPGHKWARRSRVVSARELAQTPLVTREPNSGIRDSLT AALRDTLGEDMQQAPPVLELSSAAAVRAAVLAGAGPAAMSRLAIADDLAFGRLLAVDI PALNLRRQLRAIWVGGRTPPAGAIRDLLSHITSRST" misc_feature complement(2555727..2555804) /locus_tag="Rv2282c" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene 2555941..2556135 /locus_tag="Rv2283" /db_xref="GeneID:887644" CDS 2555941..2556135 /locus_tag="Rv2283" /function="UNKNOWN" /note="Rv2283, (MTCY339.27c), len: 64 aa. Unknown protein; questionable ORF." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216799.1" /db_xref="GI:15609420" /db_xref="UniProtKB/Swiss-Prot:P64973" /db_xref="GeneID:887644" /translation="MLEKCPHASVDCGASKIGITDNDPATATNRRLASTIRKPPIEHA AGPLGSTSRAGHRSYGGVAS" gene 2556145..2557440 /gene="lipM" /locus_tag="Rv2284" /db_xref="GeneID:887794" CDS 2556145..2557440 /gene="lipM" /locus_tag="Rv2284" /EC_number="3.1.-.-" /function="Hydrolysis of lipids (bound ester)." /note="Rv2284, (MTCY339.26c), len: 431 aa. Probable lipM, esterase (EC 3.1.-.-), similar to others e.g. gp|Z95844|MTCY493_28 from Mycobacterium tuberculosis cosmid (420 aa), FASTA scores: opt: 1266, E(): 0, (50.1% identity in 411 aa overlap). Some similarity to G537514 arylacetamide deacetylase (399 aa), FASTA scores: opt: 190, E(): 5.9e-05, (30.4% identity in 138 aa overlap)." /codon_start=1 /transl_table=11 /product="esterase LipM" /protein_id="NP_216800.1" /db_xref="GI:15609421" /db_xref="GOA:Q50681" /db_xref="UniProtKB/TrEMBL:Q50681" /db_xref="GeneID:887794" /translation="MGAPRLIHVIRQIGALVVAAVTAAATINAYRPLARNGFASLWSW FIGLVVTEFPLPTLASQLGGLVLTAQRLTRPVRAVSWLVAAFSALGLLNLSRAGRQAD AQLTAALDSGLGPDRRTASAGLWRRPAGGGTAKTPGPLRMLRIYRDYAHDGDISYGEY GRANHLDIWRRPDLDLTGTAPVLFQIPGGAWTTGNKRGQAHPLMSHLAELGWICVAIN YRHSPRNTWPDHIIDVKRALAWVKAHISEYGGDPDFIAITGGSAGGHLSSLAALTPND PRFQPGFEEADTRVQAAVPFYGVYDFTRLQDAMHPMMLPLLERMVVKQPRTANMQSYL DASPVTHISADAPPFFVLHGRNDSLVPVQQARGFVDQLRQVSKQPVVYAELPFTQHAF DLLGSARAAHTAIAVEQFLAEVYATQHAGSEPGPAVAIP" gene 2557473..2558810 /locus_tag="Rv2285" /db_xref="GeneID:888632" CDS 2557473..2558810 /locus_tag="Rv2285" /function="UNKNOWN" /note="Rv2285, (MTCY339.25c), len: 445 aa. Conserved hypothetical protein, member of Mycobacterium tuberculosis 15-membered protein family including Rv3740c, Rv3734c, Rv1425, Rv1760, Rv0895, Rv3480c. FASTA scores: gp|Z95844|MTCY493_29 Mycobacterium tuberculosis cosmid (459 aa) opt: 640, E(): 0; 33.4% identity in 470 aa overlap." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216801.1" /db_xref="GI:15609422" /db_xref="GOA:P67206" /db_xref="UniProtKB/Swiss-Prot:P67206" /db_xref="GeneID:888632" /translation="MKLLSPLDQMFARMEAPRTPMHIGAFAVFDLPKGAPRRFIRDLY EAISQLAFLPFPFDSVIAGGASMAYWRQVQPDPSYHVRLSALPYPGTGRDLGALVERL HSTPLDMAKPLWELHLIEGLTGRQFAMYFKAHHCAVDGLGGVNLIKSWLTTDPEAPPG SGKPEPFGDDYDLASVLAAATTKRAVEGVSAVSELAGRLSSMVLGANSSVRAALTTPR TPFNTRVNRHRRLAVQVLKLPRLKAVAHATDCTVNDVILASVGGACRRYLQELGDLPT NTLTASVPVGFERDADTVNAASGFVAPLGTSIEDPVARLTTISASTTRGKAELLAMSP NALQHYSVFGLLPIAVGQKTGALGVIPPLFNFTVSNVVLSKDPLYLSGAKLDVIVPMS FLCDGYGLNVTLVGYTDKVVLGFLGCRDTLPHLQRLAQYTGAAFEELETAALP" gene complement(2558877..2559569) /locus_tag="Rv2286c" /db_xref="GeneID:887395" CDS complement(2558877..2559569) /locus_tag="Rv2286c" /function="UNKNOWN" /note="Rv2286c, (MTCY339.24), len: 230 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis hypothetical protein, Rv2466c, AL021246|MTV008_22 (207 aa). FASTA score: opt: 324, E(): 8.9e-15; 30.4% identity in 194 aa overlap" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216802.1" /db_xref="GI:15609423" /db_xref="GOA:Q50679" /db_xref="UniProtKB/Swiss-Prot:Q50679" /db_xref="GeneID:887395" /translation="MTTVDFHFDPLCPFAYQTSVWIRDVRAQLGITINWRFFSLEEIN LVAGKKHPWERDWSYGWSLMRIGALLRRTNMSLLDRWYAAIGHELHTLGGKPHDPAVA RRLLCDVGVNAAILDAALDDPTTHDDVRADHQRVVAAGGYGVPTLFLDGQCLFGPVLV DPPAGPAALNLWSVVTGMAGLPHVYELQRPKSPADVELIAQQLRPYLDGRDWVSINRG EIVDIDRLAGRS" gene 2559703..2561331 /gene="yjcE" /locus_tag="Rv2287" /db_xref="GeneID:887570" CDS 2559703..2561331 /gene="yjcE" /locus_tag="Rv2287" /function="possibly involved in transport of Na+/H+ across the membrane." /note="Rv2287, (MTCY339.23c), len: 542 aa. Probable yjcE, conserved integral membrane transport protein, similar to eukaryote NA+/H+ exchangers e.g. YJCE_ECOLI|P32703|B4065 Putative Na(+)/H(+) exchanger from Escherichia coli (549 aa), FASTA scores: opt: 436, E(): 5.6e-21, (29.4% identity in 555 aa overlap); etc. SEEMS TO BELONG TO CPA1 FAMILY (NA(+)/H(+) EXCHANGER FAMILY)." /codon_start=1 /transl_table=11 /product="integral membrane transport protein YjcE" /protein_id="NP_216803.1" /db_xref="GI:15609424" /db_xref="GOA:P65526" /db_xref="UniProtKB/Swiss-Prot:P65526" /db_xref="GeneID:887570" /translation="MNGRRTIGEDGLVFGLVVIVALVAAVVVGTVLGHRYRVGPPVLL ILSGSLLGLIPRFGDVQIDGEVVLLLFLPAILYWESMNTSFREIRWNLRVIVMFSIGL VIATAVAVSWTARALGMESHAAAVLGAVLSPTDAAAVAGLAKRLPRRALTVLRGESLI NDGTALVLFAVTVAVAEGAAGIGPAALVGRFVVSYLGGIMAGLLVGGLVTLLRRRIDA PLEEGALSLLTPFAAFLLAQSLKCSGVVAVLVSALVLTYVGPTVIRARSRLQAHAFWD IATFLINGSLWVFVGVQIPGAIDHIAGEDGGLPRATVLALAVTGVVIATRIAWVQATT VLGHTVDRVLKKPTRHVGFRQRCVTSWAGFRGAVSLAAALAVPMTTNSGAPFPDRNLI IFVVSVVILVTVLVQGTSLPTVVRWARMPEDVAHANELQLARTRSAQAALDALPTVAD ELGVAPDLVKHLEKEYEERAVLVMADGADSATSDLAERNDLVRRVRLGVLQHQRQAVT TLRNQNLIDDIVLRELQAAMDLEEVQLLDPADAE" gene 2561328..2561705 /locus_tag="Rv2288" /db_xref="GeneID:887702" CDS 2561328..2561705 /locus_tag="Rv2288" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2288, (MTCY339.22c), len: 125 aa. Unknown hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216804.1" /db_xref="GI:15609425" /db_xref="UniProtKB/Swiss-Prot:P64975" /db_xref="GeneID:887702" /translation="MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAP MRRWCDGDVDGRKLLPPARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPG WAPFGWLHEPSGARCPKADGQSV" gene 2561675..2562457 /gene="cdh" /locus_tag="Rv2289" /db_xref="GeneID:887342" CDS 2561675..2562457 /gene="cdh" /locus_tag="Rv2289" /EC_number="3.6.1.26" /function="Involved in phospholipid biosynthesis [CATALYTIC ACTIVITY: CDP-diacylglycerol + H(2)O = CMP + phosphatidate]." /experiment="experimental evidence, no additional details recorded" /note="Rv2289, (MTCY339.21c), len: 260 aa. Probable cdh, CDP-diacylglycerol pyrophosphatase (EC 3.6.1.26), similar to CDH_SALTY|P26219 cdp-diacylglycerol pyrophosphatase (251 aa), FASTA scores: opt: 395, E(): 5.9e-20, (33.5% identity in 221 aa overlap)." /codon_start=1 /transl_table=11 /product="CDP-diacylglycerol pyrophosphatase" /protein_id="NP_216805.1" /db_xref="GI:15609426" /db_xref="GOA:P63751" /db_xref="UniProtKB/Swiss-Prot:P63751" /db_xref="GeneID:887342" /translation="MPKSRRAVSLSVLIGAVIAALAGALIAVTVPARPNRPEADREAL WKIVHDRCEFGYRRTGAYAPCTFVDEQSGTALYKADFDPYQFLLIPLARITGIEDPAL RESAGRNYLYDAWAARFLVTARLNNSLPESDVVLTINPKNARTQDQLHIHISCSSPTT SAALRNVDTSEYVGWKQLPIDLGGRRFQGLAVDTKAFESRNLFRDIYLKVTADGKKME NASIAVANVAQDQFLLLLAEGTEDQPVAAETLQDHDCSITKS" gene 2562599..2563114 /gene="lppO" /locus_tag="Rv2290" /db_xref="GeneID:887203" CDS 2562599..2563114 /gene="lppO" /locus_tag="Rv2290" /function="UNKNOWN" /note="Rv2290, (MTCY339.20c), len: 171 aa. Probable lppO, conserved lipoprotein, similar to Rv3763, 19KD_MYCTU P11572 19 kDa lipoprotein antigen precursor (159 aa) FASTA scores, opt: 119, E (): 1.3, (25.6% identity in 164 aa overlap). Contains appropriately positioned PS00013 lipoprotein motif (with one mismatch)." /codon_start=1 /transl_table=11 /product="lipoprotein lppO" /protein_id="NP_216806.1" /db_xref="GI:15609427" /db_xref="GOA:Q50675" /db_xref="UniProtKB/Swiss-Prot:Q50675" /db_xref="GeneID:887203" /translation="MTDPRHTVRIAVGATALGVSALGATLPACSAHSGPGSPPSAPSA PAAATVMVEGHTHTISGVVECRTSPAVRTATPSESGTQTTRVNAHDDSASVTLSLSDS TPPDVNGFGISLKIGSVDYQMPYQPVQSPTQVEATRQGKSYTLTGTGHAVIPGQTGMR ELPFGVHVTCP" gene 2563174..2564028 /gene="sseB" /locus_tag="Rv2291" /db_xref="GeneID:887174" CDS 2563174..2564028 /gene="sseB" /locus_tag="Rv2291" /EC_number="2.-.-.-" /function="UNKNOWN" /note="Rv2291, (MTCY339.19c), len: 284 aa. Probable sseB, thiosulfate sulfurtransferase. Very similar to thiosulfate sulfurtransferas/rhodanese from Streptomyces coelicolor AL00920 4|SC9B10_21 (283 aa) opt: 765, E(): 0; Smith-Waterman score: 765; 46.9% identity in 286 aa overlap, similar to THTR_ECOLI P31142 putative thiosulfate sulfurtransferase (280 aa), FASTA scores, opt: 478, E(): 1e-23, (35.1% identity in 265 aa overlap)" /codon_start=1 /transl_table=11 /product="thiosulfate sulfurtransferase SseB" /protein_id="NP_216807.1" /db_xref="GI:15609428" /db_xref="GOA:Q59570" /db_xref="UniProtKB/Swiss-Prot:Q59570" /db_xref="GeneID:887174" /translation="MQARGQVLITAAELAGMIQAGDPVSILDVRWRLDEPDGHAAYLQ GHLPGAVFVSLEDELSDHTIAGRGRHPLPSGASLQATVRRCGIRHDVPVVVYDDWNRA GSARAWWVLTAAGIANVRILDGGLPAWRSAGGSIETGQVSPQLGNVTVLHDDLYAGQR LTLTAQQAGAGGVTLLDARVPERFRGDVEPVDAVAGHIPGAINVPSGSVLADDGTFLG NGALNALLSDHGIDHGGRVGVYCGSGVSAAVIVAALAVIGQDAELFPGSWSEWSSDPT RPVGRGTA" gene complement(2564029..2564253) /locus_tag="Rv2292c" /db_xref="GeneID:888805" CDS complement(2564029..2564253) /locus_tag="Rv2292c" /function="UNKNOWN" /note="Rv2292c, (MTCY339.18), len: 74 aa. Unknown hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216808.1" /db_xref="GI:15609429" /db_xref="UniProtKB/Swiss-Prot:P64977" /db_xref="GeneID:888805" /translation="MNPGFDAVDQETAAAQAVADAHGVPFLGIRGMSDGPGDPLHLPG FPVQFFVYKQIAANNAARVTEAFLQNWAGV" gene complement(2564292..2565032) /locus_tag="Rv2293c" /db_xref="GeneID:887283" CDS complement(2564292..2565032) /locus_tag="Rv2293c" /function="UNKNOWN" /note="Rv2293c, (MTCY339.17), len: 246 aa. Conserved hypothetical protein; some similarity to hypothetical protein (299 aa) AAK24237.1| (AE005897) belonging to phosphorylase family [Caulobacter crescentus] (33% identity in 131 aa overlap). Possible lipoprotein: signal peptide at N-terminus" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216809.1" /db_xref="GI:15609430" /db_xref="GOA:Q50673" /db_xref="UniProtKB/Swiss-Prot:Q50673" /db_xref="GeneID:887283" /translation="MGAPLRHCLLVAAALSLGCGVAAADPGYVANVIPCEQRTLVLSA FPAEADAVLAHTALDANPVVVADRRRYYLGSISGKKVIVAMTGIGLVNATNTTETAFA RFTCASSIAIAAVMFSGVAGGAGRTSIGDVAIPARWTLDNGATFRGVDPGMLATAQTL SVVLDNINTLGNPVCLCRNVPVVRLNHLGRQPQLFVGGDGSSSDKNNGQAFPCIPNGG SVFAANPVVHPIAHLAIPVTFSRRRDPG" gene 2565327..2566550 /locus_tag="Rv2294" /db_xref="GeneID:885868" CDS 2565327..2566550 /locus_tag="Rv2294" /function="UNKNOWN" /note="Rv2294, (MTCY339.16c), len: 407 aa. Probable aminotransferase (EC 2.6.1.-), similar to others in M. tuberculosis e.g. MTV030_19, also similar to PATB_BACSU|Q08432 putative aminotransferase b from Bacillus subtilis (387 aa), FASTA scores: opt: 563, E(): 2 .8e-29, (31.4% identity in 408 aa overlap); and to MALY_ECOLI|P23256 maly protein from Escherichia coli (390 aa), FASTA scores: opt: 530, E(): 3.6e-27, (31.3% identity in 384 aa overlap). BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES." /codon_start=1 /transl_table=11 /product="aminotransferase" /protein_id="NP_216810.1" /db_xref="GI:15609431" /db_xref="GOA:P63502" /db_xref="UniProtKB/Swiss-Prot:P63502" /db_xref="GeneID:885868" /translation="MIPNPLEELTLEQLRSQRTSMKWRAHPADVLPLWVAEMDVKLPP TVADALRRAIDDGDTGYPYGTEYAEAVREFACQRWQWHDLEVSRTAIVPDVMLGIVEV LRLITDRGDPVIVNSPVYAPFYAFVSHDGRRVIPAPLRGDGRIDLDALQEAFSSARAS SGSSGNVAYLLCNPHNPTGSVHTADELRGIAERAQRFGVRVVSDEIHAPLIPSGARFT PYLSVPGAENAFALMSASKAWNLGGLKAALAIAGREAAADLARMPEEVGHGPSHLGVI AHTAAFRTGGNWLDALLRGLDHNRTLLGALVDEHLPGVQYRWPQGTYLAWLDCRELGF DDAASDEMTEGLAVVSDLSGPARWFLDHARVALSSGHVFGIGGAGHVRINFATSRAIL IEAVSRMSRSLLERR" gene 2566772..2567410 /locus_tag="Rv2295" /db_xref="GeneID:888540" CDS 2566772..2567410 /locus_tag="Rv2295" /function="UNKNOWN" /note="Rv2295, (MTCY339.15c), len: 212 aa. Conserved hypothetical protein, cysteine-rich protein, similar to YIEJ_ECOLI P31469 hypothetical 22.5 kDa protein in tnab-bglb intergenic region (195 aa), opt: 270, E(): 3.4e-11, (36.4% identity in 198 aa overlap). Alternative start suggested by similarity 26 codons further downstream" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216811.1" /db_xref="GI:15609432" /db_xref="GOA:P67309" /db_xref="UniProtKB/Swiss-Prot:P67309" /db_xref="GeneID:888540" /translation="MDQSANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHP DPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATF TDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPD ALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADFA" gene 2567504..2568406 /locus_tag="Rv2296" /db_xref="GeneID:887796" CDS 2567504..2568406 /locus_tag="Rv2296" /EC_number="3.8.1.5" /function="Converts haloalkanes to corresponding alcohol and halides [CATALYTIC ACTIVITY: 1-haloalkane + H2O = a primary alcohol + halide]." /experiment="experimental evidence, no additional details recorded" /note="Rv2296, (MTCY339.14c), len: 300 aa. Probable haloalkane dehalogenase (EC 3.8.1.5), similar to e.g. HALO_XANAU P22643, haloalkane dehalogenase, (310 aa), opt: 510 z-score: 577.7 E(): 3.1e-25 (39.0% identity in 315 aa overlap)." /codon_start=1 /transl_table=11 /product="haloalkane dehalogenase" /protein_id="NP_216812.1" /db_xref="GI:15609433" /db_xref="GOA:P64301" /db_xref="UniProtKB/Swiss-Prot:P64301" /db_xref="GeneID:887796" /translation="MDVLRTPDSRFEHLVGYPFAPHYVDVTAGDTQPLRMHYVDEGPG DGPPIVLLHGEPTWSYLYRTMIPPLSAAGHRVLAPDLIGFGRSDKPTRIEDYTYLRHV EWVTSWFENLDLHDVTLFVQDWGSLIGLRIAAEHGDRIARLVVANGFLPAAQGRTPLP FYVWRAFARYSPVLPAGRLVNFGTVHRVPAGVRAGYDAPFPDKTYQAGARAFPRLVPT SPDDPAVPANRAAWEALGRWDKPFLAIFGYRDPILGQADGPLIKHIPGAAGQPHARIK ASHFIQEDSGTELAERMLSWQQAT" gene 2568438..2568890 /locus_tag="Rv2297" /db_xref="GeneID:887789" CDS 2568438..2568890 /locus_tag="Rv2297" /function="UNKNOWN" /note="Rv2297, (MTCY339.13c), len: 150 aa. Unknown protein; contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216813.1" /db_xref="GI:15609434" /db_xref="UniProtKB/Swiss-Prot:P64979" /db_xref="GeneID:887789" /translation="MAMEMAMMGLLGTVVGASAMGIGGIAKSIAEAYVPGVAAAKDRR QQMNVDLQARRYEAVRVWRSGLCSASNAYRQWEAGSRDTHAPNVVGDEWFEGLRPHLP TTGEAAKFRTAYEVRCDNPTLMVLSLEIGRIEKEWMVEASGRTPKHRG" misc_feature 2568738..2568755 /locus_tag="Rv2297" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" gene 2569082..2570053 /locus_tag="Rv2298" /db_xref="GeneID:887344" CDS 2569082..2570053 /locus_tag="Rv2298" /function="UNKNOWN" /note="Rv2298, (MTCY339.12c), len: 323 aa. Conserved hypothetical protein. Similar to SLR0545 Synechocystis sp, Q55493 hypothetical 34.6 kDa protein (314 aa), FASTA scores, opt: 427, E(): 1.7e-20, (39.3% identity in 303 aa overlap) and to YZAE_BACSU P46905 hypothetical protein in natb 3'region (268 aa) FASTA scores, opt: 370, E(): 6.1e-17, (31.4% identity in 264 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216814.1" /db_xref="GI:15609435" /db_xref="GOA:P63484" /db_xref="UniProtKB/Swiss-Prot:P63484" /db_xref="GeneID:887344" /translation="MKYLDVDGIGQVSRIGLGTWQFGSREWGYGDRYATGAARDIVKR ARALGVTLFDTAEIYGLGKSERILGEALGDDRTEVVVASKVFPVAPFPAVIKNRERAS ARRLQLNRIPLYQIHQPNPVVPDSVIMPGMRDLLDSGDIGAAGVSNYSLARWRKADAA LGRPVVSNQVHFSLAHPDALEDLVPFAELENRIVIAYSPLAQGLLGGKYGLENRPGGV RALNPLFGTENLRRIEPLLATLRAIAVDVDAKPAQVALAWLISLPGVVAIPGASSVEQ LEFNVAAADIELSAQSRDALTDAARAFRPVSTGRFLTDMVREKVSRR" gene complement(2570059..2572002) /gene="htpG" /locus_tag="Rv2299c" /db_xref="GeneID:887501" CDS complement(2570059..2572002) /gene="htpG" /locus_tag="Rv2299c" /function="MOLECULAR CHAPERONE INVOLVED IN PROTEIN FOLDING. HAS ATPASE ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="molecular chaperone" /codon_start=1 /transl_table=11 /product="heat shock protein 90" /protein_id="NP_216815.1" /db_xref="GI:15609436" /db_xref="GOA:P64411" /db_xref="UniProtKB/Swiss-Prot:P64411" /db_xref="GeneID:887501" /translation="MNAHVEQLEFQAEARQLLDLMVHSVYSNKDAFLRELISNASDAL DKLRIEALRNKDLEVDTSDLHIEIDADKAARTLTVRDNGIGMAREEVVDLIGTLAKSG TAELRAQLREAKNAAASEELIGQFGIGFYSSFMVADKVQLLTRKAGESAATRWESSGE GTYTIESVEDAPQGTSVTLHLKPEDAEDDLHDYTSEWKIRNLVKKYSDFIAWPIRMDV ERRTPASQEEGGEGGEETVTIETETLNSMKALWARPKEEVSEQEYKEFYKHVAHAWDD PLEIIAMKAEGTFEYQALLFIPSHAPFDLFDRDAHVGIQLYVKRVFIMGDCDQLMPEY LRFVKGVVDAQDMSLNVSREILQQDRQIKAIRRRLTKKVLSTIKDVQSSRPEDYRTFW TQFGRVLKEGLLSDIDNRETLLGISSFVSTYSEEEPTTLAEYVERMKDGQQQIFYATG ETRQQLLKSPHLEAFKAKGYEVLLLTDPVDEVWVGMVPEFDGKPLQSVAKGEVDLSSE EDTSEAEREERQKEFADLLTWLQETLSDHVKEVRLSTRLTESPACLITDAFGMTPALA RIYRASGQEVPVGKRILELNPSHPLVTGLRQAHQDRADDAEKSLAETAELLYGTALLA EGGALEDPARFAELLAERLARTL" gene complement(2572076..2573008) /locus_tag="Rv2300c" /db_xref="GeneID:887880" CDS complement(2572076..2573008) /locus_tag="Rv2300c" /function="UNKNOWN" /note="Rv2300c, (MTCY339.09), len: 310 aa (start uncertain). Conserved hypothetical protein, similar to others e.g. Q9RXY2|DR0172 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (271 aa), FASTA scores: opt: 306, E(): 1.3e-12, (34.6% identity in 229 aa overlap); Q9HZH1|PA3037 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (288 aa), FASTA scores: opt: 248, E(): 7.9e-09, (31.5% identity in 238 aa overlap); Q9PDL8|XF1361 HYPOTHETICAL PROTEIN from Xylella fastidiosa (279 aa), FASTA scores: opt: 236, E(): 4.6e-08, (29.7% identity in 249 aa overlap); U70053|XCU70053_3 GumP PROTEIN from Xanthomonas campestris (282 aa), FASTA scores: opt: 222, E(): 3.7e-07, (30.1% identity in 248 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216816.1" /db_xref="GI:15609437" /db_xref="UniProtKB/Swiss-Prot:P64981" /db_xref="GeneID:887880" /translation="MVATRGRPCPTNFSRPQRPRVAGNGTKSQRCRGRLTTSMLGVAP EAKGPPVKVHHLNCGTMNAFGIALLCHVLLVETDDGLVLVDTGFGIQDCLDPGRVGLF RHVLRPAFLQAETAARQIEQLGYRTSDVRHIVLTHFDFDHIGGIADFPEAHLHVTAAE ARGAIHAPSLRERLRYRRGQWAHGPKLVEHGPDGEPWRGFASAKPLDSIGTGVVLVPM PGHTRGHAAVAVDAGHRWVLHCGDAFYHRGTLDGRFRVPFVMRAEEKLLSYNRNQLRD NQARIVELHRRHDPDLLIVCAHDPDLYQLARDTA" gene 2573015..2573707 /gene="cut2" /locus_tag="Rv2301" /db_xref="GeneID:885371" CDS 2573015..2573707 /gene="cut2" /locus_tag="Rv2301" /EC_number="3.1.1.-" /function="HYDROLYSIS OF CUTIN (A POLYESTER THAT FORMS THE STRUCTURE OF PLANT CUTICLE)." /experiment="experimental evidence, no additional details recorded" /note="Rv2301, (MTCY339.08c), len: 230 aa. Probable cut2 (alternate gene name: cfp25), cutinase (EC 3.1.1.-), highly similar to others from Mycobacteria tuberculosis e.g. MTCY13E12.04|Rv3451|O06318|CUT3_MYCTU (247 aa), FASTA scores: opt: 569, E(): 2.3e-27, (45.3% identity in 223 aa overlap); MT2037|MTCY39.35|RV1984C|Q10837|CUT1_MYCTU (217 aa), FASTA scores: opt: 383, E(): 3.4e-16 (42.9% identity in 217 aa overlap); O69691|Rv3724|MTV025.072 PUTATIVE CUTINASE PRECURSOR (187 aa), FASTA scores: opt: 248, E(): 4.3e-08, (41.85% identity in 172 aa overlap); etc. Also similar to few others from other organisms e.g. Q9KK87 SERINE ESTERASE CUTINASE from Mycobacterium avium (220 aa), FASTA scores: opt: 391, E(): 1.1e-16, (39.15% identity in 235 aa overlap); etc. Contains PS00095 C-5 cytosine-specific DNA methylases C-terminal signature. BELONGS TO THE CUTINASE FAMILY. Start changed since first submission (+11 aa).; cfp25" /codon_start=1 /transl_table=11 /product="cutinase CUT2" /protein_id="NP_216817.2" /db_xref="GI:57116961" /db_xref="GOA:P63881" /db_xref="UniProtKB/Swiss-Prot:P63881" /db_xref="GeneID:885371" /translation="MNDLLTRRLLTMGAAAAMLAAVLLLTPITVPAGYPGAVAPATAA CPDAEVVFARGRFEPPGIGTVGNAFVSALRSKVNKNVGVYAVKYPADNQIDVGANDMS AHIQSMANSCPNTRLVPGGYSLGAAVTDVVLAVPTQMWGFTNPLPPGSDEHIAAVALF GNGSQWVGPITNFSPAYNDRTIELCHGDDPVCHPADPNTWEANWPQHLAGAYVSSGMV NQAADFVAGKLQ" misc_feature 2573201..2573257 /gene="cut2" /locus_tag="Rv2301" /note="PS00095 C-5 cytosine-specific DNA methylases C-terminal signature" gene 2573813..2574055 /locus_tag="Rv2302" /db_xref="GeneID:885154" CDS 2573813..2574055 /locus_tag="Rv2302" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2302, (MTCY339.07c), len: 80 aa. Conserved hypothetical protein, highly similar to others: O53766|AL021942|Rv0569|MTV039.07 HYPOTHETICAL 9.5 KDA PROTEIN from Mycobacterium tuberculosis (88 aa), FASTA scores: opt: 300, E(): 1.4e-14, (61.85% identity in 76 aa overlap); O88049|SCI35.11 HYPOTHETICAL 7.1 KDA PROTEIN from Streptomyces coelicolor (64 aa), FASTA scores: opt: 169, E(): 1.5e-05, (46.55% identity in 58 aa overlap) (has its C-terminus shorter); Q9XCD1 HYPOTHETICAL 12.0 KDA PROTEIN (FRAGMENT) from Thermomonospora fusca (106 aa), FASTA scores: opt: 126, E(): 0.023, (50.0% identity in 34 aa overlap) (similarity in part for this one). Also weakly similar to U650M|G699303|Q50105 HYPOTHETICAL 5.7 KDA PROTEIN from Mycobacterium leprae (53 aa), FASTA scores: opt: 89, E(): 0.66, (45.5% identity in 33 aa overlap); and weakly similar to N-terminus of Q9RIZ1|SCJ1.23c putative DNA-binding protein from Streptomyces coelicolor (323 aa), FASTA scores: opt: 182, E(): 7.3e-06, (42.25% identity in 71 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216818.1" /db_xref="GI:15609439" /db_xref="UniProtKB/Swiss-Prot:P64983" /db_xref="GeneID:885154" /translation="MHAKVGDYLVVKGTTTERHDQHAEIIEVRSADGSPPYVVRWLVN GHETTVYPGSDAVVVTATEHAEAEKRAAARAGHAAT" gene complement(2574096..2575019) /locus_tag="Rv2303c" /db_xref="GeneID:885282" CDS complement(2574096..2575019) /locus_tag="Rv2303c" /function="COULD BE INVOLVED IN ANTIBIOTIC-RESISTANCE." /note="Rv2303c, (MTCY339.06, MT2360), len: 307 aa. Probable antibiotic-resistance protein, with some similarity to Q54229|G153373 macrotetrolide antibiotic-resistance protein (NONR) from Streptomyces griseus (347 aa) (see the first citation below), FASTA scores: opt: 438, E(): 3.1e-21, (33.2% identity in 226 aa overlap); and other hypothetical proteins e.g. P95886 ORF C02006 from Sulfolobus solfataricus (269 aa), FASTA scores: opt: 252, E(): 3.5e-09, (25.5% identity in 286 aa overlap); etc. Also similar to Mycobacterium tuberculosis Rv3510c|O53555|MTV023.17. Note that the protein Q9XDF3|NONC from Streptomyces griseus subsp. griseus (317 aa) is equivalent to Q54229|G153373|NONR however the N-terminal end is shorter (30 aa) owing to a changed start codon (see the second citation below)." /codon_start=1 /transl_table=11 /product="antibiotic-resistance protein" /protein_id="NP_216819.1" /db_xref="GI:15609440" /db_xref="GOA:Q50662" /db_xref="UniProtKB/TrEMBL:Q50662" /db_xref="GeneID:885282" /translation="MTAPEPRVPVIDMWAPFVPSAEVIDDLREGFPVELLSYFEVFTK TTISAEQFGAYAESLRRTDDQILDSLDDAGITRSLITGFDERSTCGVTFVHNASVAAV AARYPDRFLPFAGADILAGDSAVDEFERWVVEHGFRGLSLRPFMIGRPASDPAYFPCY AKCVELGVPVSIHTSADWTRTRLSDLGHPRHIDDVACRFPELTILMSHGGYPWVLQAC LIAWKHPNVYLELAAHRPKYFASPGAGWEPLMRFGQTTIRNKIVYGTGGFLINRPYLQ LCDEMRALPVPREVLEDWLWRNATRVLRLDT" gene complement(2575016..2575225) /locus_tag="Rv2304c" /db_xref="GeneID:885102" CDS complement(2575016..2575225) /locus_tag="Rv2304c" /function="UNKNOWN" /note="Rv2304c, (MTCY339.05), len: 69 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216820.1" /db_xref="GI:15609441" /db_xref="UniProtKB/Swiss-Prot:P64985" /db_xref="GeneID:885102" /translation="MSHDIATEEADDGALDRCVLCDLTGKRVDVKEATCTGRPATTFE QAFAVERDAGFDDFLHGPVGPRSTP" gene 2575809..2577098 /locus_tag="Rv2305" /db_xref="GeneID:885752" CDS 2575809..2577098 /locus_tag="Rv2305" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2305, (MTCY339.04c), len: 429 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216821.1" /db_xref="GI:15609442" /db_xref="GOA:Q50660" /db_xref="UniProtKB/Swiss-Prot:Q50660" /db_xref="GeneID:885752" /translation="MTQTLRLTALDEMFITDDIDIVPSVQIEARVSGRFDLDRLAAAL RAAVAKHALARARLGRASLTARTLYWEVPDRADHLAVEITDEPVGEVRSRFYARAPEL HRSPVFAVAVVRETVGDRLLLNFHHAAFDGMGGLRLLLSLARAYAGEPDEVGGPPIEE ARNLKGVAGSRDLFDVLIRARGLAKPAIDRKRTTRVAPDGGSPDGPRFVFAPLTIESD EMATAVARRPEGATVNDLAMAALALTILQWNRTHDVPAADSVSVNMPVNFRPTAWSTE VISNFASYLAIVLRVDEVTDLEKATAIVAGITGPLKQSGAAGWVVDLLEGGKVLPAML KRQLQLLLPLVEDRFVESVCLSNLGRVDVPAFGGEAGDTTEVWFSPTAAMSVMPIGVG LVGFGGTLRAMFRGDGRTIGGEALGRFAALYRDTLLT" gene 2577108..2577701 /locus_tag="Rv2306A" /db_xref="GeneID:3205062" CDS 2577108..2577701 /locus_tag="Rv2306A" /function="UNKNOWN" /note="Rv2306A, len: 197 aa. Possible conserved membrane protein, similar to several hypothetical membrane proteins from Mycobacterium tuberculosis and Streptomyces coelicolor, e.g. Rv0625c|P96915|Y625_MYCTU HYPOTHETICAL 25.2 KDA PROTEIN from Mycobacterium tuberculosis (246 aa), FASTA scores: opt: 410, E(): 2.7e-17, (53.25% identity in 139 aa overlap). First 140 aa show high similarity, this then decreases but continues in next ORF Rv2306B, suggesting a frameshift near nt 2577473. However the sequence has been checked and no error found. The sequence is identical in CDC1551 and Mycobacterium bovis. Replaces original Rv2306c on other strand." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177663.1" /db_xref="GI:57116962" /db_xref="UniProtKB/TrEMBL:Q79FG5" /db_xref="GeneID:3205062" /translation="MTDNECPADSRRRHVLRLALFAGILLGLFYLVAVARVIHVDGVR SAIVVATGPIAPLAYVVVSAALGALFVPGPILAAGSGVLFGPLLDTFVTLPAFSAGAQ AGMTPRRCWVSIAPIASMHRSNGADCGRWSVSASSPASRMRWPRTPSGRSEFRCGRWS LGRSSGRRHGCSSTPRWARRSPTCRRRWFTRRSRCGA" gene 2577488..2577922 /locus_tag="Rv2306B" /db_xref="GeneID:3205063" CDS 2577488..2577922 /locus_tag="Rv2306B" /function="UNKNOWN" /note="Rv2306B, len: 144 aa. Possible conserved membrane protein, similar to C-terminal part of several hypothetical membrane proteins from Mycobacterium tuberculosis and Streptomyces coelicolor e.g. P96915|Y625_MYCTU|RV0625c HYPOTHETICAL 25.2 KDA PROTEIN from Mycobacterium tuberculosis (246 aa), FASTA scores: opt: 480, E(): 5e-24, (77.15% identity in 92 aa overlap). Could be a continuation of Rv2306A suggesting there may be a frameshift near nt 2577473. The C-terminal part is longer than Rv0625c and the 3'-end of gene overlaps Rv2307c, so maybe a further framehift. However, sequence has been checked and no error found. Also same sequence as strain CDC1551 and Mycobacterium bovis. Replaces original Rv2306c on other strand." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177664.1" /db_xref="GI:57116963" /db_xref="UniProtKB/TrEMBL:Q79FG4" /db_xref="GeneID:3205063" /translation="MWAVVGQRFVPGISDALASYTFGAFGVPLWQMVVGSFIGSAPRV FVYTALGASITNLSSPLVYSAIAVWCVTAIIGAFAARRWYRKWRARPRRRCGLAQLTT GSQQRHTSHRTPAGVVMPGSLSEHRRLRQEAPDRIEHHPPIE" gene complement(2577851..2578696) /locus_tag="Rv2307c" /db_xref="GeneID:885277" CDS complement(2577851..2578696) /locus_tag="Rv2307c" /function="UNKNOWN" /note="Rv2307c, (MTCY339.02), len: 281 aa. Conserved hypothetical protein, similar to many other hypothetical proteins and BEM1/BUD5 suppressors e.g. P77538 HYPOTHETICAL PROTEIN from Escherichia coli (293 aa), FASTA scores: opt: 421, E(): 2.4e-18, (32.1% identity in 268 aa overlap) (alias AAG57647|Z3802|BAB36823|ECS3400 Putative enzyme (3.4.-) from Escherichia coli (293 aa), FASTA scores: opt: 425, E(): 1.7e-18, (32.1% identity in 268 aa overlap));P54069|BE46_SCHPO|BEM46|SPBC32H8.03|PI020 BEM46 PROTEIN from Schizosaccharomyces pombe (Fission yeast) (352 aa), FASTA scores: opt: 355, E(): 3.3e-14, (30.45% identity in 279 aa overlap); O76462|BEM46 BEM46 PROTEIN from Drosophila melanogaster (338 aa), FASTA scores: opt: 404, E(): 2.8e-17, (32.75% identity in 281 aa overlap); etc. Equivalent (but with few differences) to AAK46650|MT2364 protein from Mycobacterium tuberculosis strain CDC1551 (281 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216823.1" /db_xref="GI:15609444" /db_xref="GOA:Q50658" /db_xref="UniProtKB/Swiss-Prot:Q50658" /db_xref="GeneID:885277" /translation="MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPS ASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALH GLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAA VAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVL VIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTET AVLGQ" gene complement(2579228..2579419) /locus_tag="Rv2307A" /db_xref="GeneID:3205073" CDS complement(2579228..2579419) /locus_tag="Rv2307A" /function="UNKNOWN" /note="Rv2307A, len: 63 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="glycine rich protein" /protein_id="YP_177665.1" /db_xref="GI:57116964" /db_xref="UniProtKB/TrEMBL:Q8VJM8" /db_xref="GeneID:3205073" /translation="MAFVDLRYPWCRGDGWISPPVVAVALGWAMRRKPFSRFNEYVGS ASNTCWFARALELRTLLIR" gene complement(2579504..2579935) /locus_tag="Rv2307B" /db_xref="GeneID:3205074" CDS complement(2579504..2579935) /locus_tag="Rv2307B" /function="UNKNOWN" /note="Rv2307B, len: 143 aa. Hypothetical unknown Gly- rich protein. Equivalent to AAK46653 from Mycobacterium tuberculosis strain CDC1551 (133 aa) but longer 10 aa." /codon_start=1 /transl_table=11 /product="glycine rich protein" /protein_id="YP_177666.1" /db_xref="GI:57116965" /db_xref="UniProtKB/TrEMBL:Q79FG2" /db_xref="GeneID:3205074" /translation="MEEVPTGPPAMGHRACGGQKAAFPTRMNSGVEKMYKNSIAIAIG TLTMAVEFSMVSANAEPAPPPGQDPHMPNSAMGYCPGGGFGGITGWGYCDGIRYPDGS YWHQVRVPAPFVGTTLTLSCVIDDGSPVPPLAAPGSCGGGA" gene complement(2580028..2580210) /locus_tag="Rv2307D" /db_xref="GeneID:3205075" CDS complement(2580028..2580210) /locus_tag="Rv2307D" /function="UNKNOWN" /note="Rv2307D, len: 60 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177667.1" /db_xref="GI:57116966" /db_xref="UniProtKB/TrEMBL:Q8VJM6" /db_xref="GeneID:3205075" /translation="MWRHLWLMQPQRRYPRGSGTTRTARRDAGVAPLYGVSRVTVLAS TTATTAPPVKSFPDLL" gene 2580419..2581135 /locus_tag="Rv2308" /db_xref="GeneID:885290" CDS 2580419..2581135 /locus_tag="Rv2308" /function="UNKNOWN" /note="Rv2308, (MTCY339.01c), len: 238 aa. Conserved hypothetical protein, sharing similarity with O53464|Rv2018|MTV018.05 from Mycobacterium tuberculosis (239 aa), FASTA scores: opt: 142, E(): 0.034, (24.8% identity in 250 aa overlap). As contains possible helix-turn-helix motif at aa 16-37 (Sequence: YVYAEVDKLIGLPAGTAKRWIN) (Score 1169, +3.17 SD), may be a transcriptional regulator." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216824.1" /db_xref="GI:15609445" /db_xref="UniProtKB/Swiss-Prot:Q50657" /db_xref="GeneID:885290" /translation="MRADMSVTSMLDREVYVYAEVDKLIGLPAGTAKRWINGYERGGK DHPPILRVTPGATPWVTWGEFVETRMLAEYRDRRKVPIVRQRAAIEELRARFNLRYPL AHLRPFLSTHERDLTMGGEEIGLPDAEVTIRTGQALLGDARWLASIATPGRDEVGEAV IVELPVDKAFPEIVINPSRYSGQPTFVGRRVSPVTIAQMVDGGEEREDLAADYGLSLK QIQDAIDYTKKYRLARLVAA" gene complement(2581764..2581837) /locus_tag="Rvnt24" /note="tRNA-Met(CAT)" /db_xref="GeneID:2700432" tRNA complement(2581764..2581837) /locus_tag="Rvnt24" /product="tRNA-Met" /note="codon recognized: AUG" /anticodon=(pos:2581801..2581803,aa:Met) /db_xref="GeneID:2700432" gene complement(2581843..2582298) /locus_tag="Rv2309c" /db_xref="GeneID:885133" CDS complement(2581843..2582298) /locus_tag="Rv2309c" /function="USE FOR SEQUENCE INTEGRATION. INTEGRASE IS NECESSARY FOR INTEGRATION OF A PHAGE INTO THE HOST GENOME BY SITE-SPECIFIC RECOMBINATION. IN CONJUNCTION WITH EXCISIONASE, INTEGRASE IS ALSO NECESSARY FOR EXCISION OF THE PROPHAGE FROM THE HOST GENOME (BY SIMILARITY)." /note="Rv2309c, (MTCY3G12.25), len: 151 aa. Possible integrase (fragment), similar to others e.g. Q48908 INTEGRASE (FRAGMENT) from Mycobacterium paratuberculos (191 aa), FASTA scores: opt: 279, E(): 3.2e-11, (40.4% identity in 136 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. Rv1055|MTV017.08 INTEGRASE (FRAGMENT) (78 aa) (72.85% identity in 70 aa overlap); and Rv1054|MTV017.07 INTEGRASE (FRAGMENT). COULD BELONG TO THE 'PHAGE' INTEGRASE FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216825.1" /db_xref="GI:15609446" /db_xref="GOA:P71903" /db_xref="UniProtKB/TrEMBL:P71903" /db_xref="GeneID:885133" /translation="MTGAGIVETTTNRVRHVPVPEPVSERLRDELPTEPNALVFPSYR GGHLPIEEYRRAFDKGCKAVGIADLVPHGLRHTTASLAISAGANVKVVQRLLGHATAA MTLDRHGHLLSDDLAGVAGLLVQAIKSAAASLRYSDPDSVAVENISAAS" gene 2583045..2583332 /locus_tag="Rv2309A" /db_xref="GeneID:3205076" CDS 2583045..2583332 /locus_tag="Rv2309A" /function="UNKNOWN" /note="Rv2309A, len: 95 aa. Hypothetical unknown protein. Equivalent to AAK46663 from Mycobacterium tuberculosis strain CDC1551 (95 aa) but longer 13 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177668.1" /db_xref="GI:57116967" /db_xref="UniProtKB/TrEMBL:Q8VJL9" /db_xref="GeneID:3205076" /translation="MATSSDDITINRHPPLNCAVNRHDESRRSPLRRGLLANGLRERQ AGALFERYESQFDSFGYIEKVRYRGSGYRVEDVYARADSGPSAGAELPVGP" gene 2583435..2583779 /locus_tag="Rv2310" /db_xref="GeneID:885175" CDS 2583435..2583779 /locus_tag="Rv2310" /function="USE FOR SEQUENCE EXCISION." /note="Rv2310, (MT2372, MTCY3G12.24c), len: 148 aa. Possible excisionase, showing some similarity to others e.g. Q9LCU5 PUTATIVE EXCISIONASE from Arthrobacter sp. TM1 (174 aa) FASTA scores: opt: 341, E(): 6.6e-15, (48.2% identity in 110 aa overlap); O85865 PUTATIVE EXCISIONASE from Sphingomonas aromaticivorans (152 aa), FASTA scores: opt: 205, E(): 2.2e-06, (41.25% identity in 80 aa overlap); etc. Also similar to Rv3750c|O69717 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (130 aa), FASTA scores: opt: 228, E(): 6.9e-08, (43.9% identity in 82 aa overlap). Contains possible helix-turn-helix motif at aa 20-41 (Score 2181, +6.62 SD)." /codon_start=1 /transl_table=11 /product="excisionase" /protein_id="NP_216826.1" /db_xref="GI:15609447" /db_xref="GOA:P64987" /db_xref="UniProtKB/Swiss-Prot:P64987" /db_xref="GeneID:885175" /translation="MVAALHAGKAVTIAPQSMTLTTQQAADLLGVSRPTVVRLIKSGE LAAERIGNRHRLVLDDVLAYREARRQRQYDALAESAMDIDADEDPEVICEQLREARRV VAARRRTERRRA" gene 2583884..2584408 /locus_tag="Rv2311" /db_xref="GeneID:885168" CDS 2583884..2584408 /locus_tag="Rv2311" /function="UNKNOWN" /note="Rv2311, (MTCY3G12.23c), len: 174 aa. Conserved hypothetical protein, with similarity (in part) to transfer proteins homologous TRAA e.g. Q9EUN8|TRAA TRANSFER PROTEIN HOMOLOG TRAA from Corynebacterium glutamicum (1160 aa), FASTA scores: opt: 221, E(): 2.9e-07, (36.8% identity in 136 aa overlap); Q9ETQ3|TRAA CONJUGAL TRANSFER PROTEIN (TRAA-LIKE PROTEIN) from Corynebacterium equii (1367 aa), FASTA scores: opt: 188, E(): 5.5e-05, (33% identity in 106 aa overlap); P55418|TRAA_RHISN|Y4DS PROBABLE CONJUGAL TRANSFER PROTEIN from Rhizobium sp. strain NGR234 (1102 aa), FASTA scores: opt: 145, E(): 0.035, (29.08% identity in 141 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216827.1" /db_xref="GI:15609448" /db_xref="UniProtKB/Swiss-Prot:P64989" /db_xref="GeneID:885168" /translation="MAPTGQAVDVAVREGAGDVGYSVERENLPADDPVRNGNRWRVIA VDTEHHRIAARRLGDGARAAFSGDYLHEHITHGYAITVHASQGTTAHSTHAVLGDNTS RATLYVAMTPARESNTAYLCERTAGEGARVDLAGWDLWVSGKAEAMSDEKSASPVWCR VGARCDHRGKRSCW" gene 2584486..2584755 /locus_tag="Rv2312" /db_xref="GeneID:885152" CDS 2584486..2584755 /locus_tag="Rv2312" /function="UNKNOWN" /note="Rv2312, (MTCY3G12.22c), len: 89 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216828.1" /db_xref="GI:15609449" /db_xref="UniProtKB/Swiss-Prot:P64991" /db_xref="GeneID:885152" /translation="MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPIN TPGPGRTKQFMEELSQLASAPGPDIDGGIDLTDDEFQAFLQAARS" gene complement(2585052..2585906) /locus_tag="Rv2313c" /db_xref="GeneID:885268" CDS complement(2585052..2585906) /locus_tag="Rv2313c" /function="UNKNOWN" /note="Rv2313c, (MTCY3G12.21), len: 284 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216829.1" /db_xref="GI:15609450" /db_xref="UniProtKB/Swiss-Prot:P64993" /db_xref="GeneID:885268" /translation="MPAPVSVRDDLCRLVALSPGDGRIAGLVRQVCARALSLPSLPCE VAVNEPESPAEAVVAEFAEQFSVDVSAITGEQRSLLWTHLGEDAFGAVVAMYIADFVP RVRAGLEALGVGKEYLGWVTGPISWDHNTDLSAAVFNGFLPAVARMRALDPVTSELVR LRGAAQHNCRVCKSLREVSALDAGGSETLYGEIERFDTSVLLDVRAKAALRYADALIW TPAHLAVDVAVEVRSRFSDDEAVELTFDIMRNASNKVAVSLGADAPRVQQGTERYRIG LDGQTVFG" gene complement(2585917..2587290) /locus_tag="Rv2314c" /db_xref="GeneID:885130" CDS complement(2585917..2587290) /locus_tag="Rv2314c" /function="UNKNOWN" /note="Rv2314c, (MTCY3G12.20), len: 457 aa. Conserved hypothetical protein, highly similar to Q9RJ51|SCI8.02 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (464 aa) FASTA scores: opt: 1485, E(): 5.2e-83, (53.5% identity in 454 aa overlap); similar to AAK24788|CC2824 TldD/PmbA family protein from Caulobacter crescentus (441 aa), FASTA scores: opt: 364, E(): 8.3e-15, (29.8% identity in 460 aa overlap); and showing similarity with Q9HJZ6|TA0814 HYPOTHETICAL PROTEIN from Thermoplasma acidophilum (430 aa), FASTA scores: opt: 220, E(): 4.7e-06, (21.85% identity in 348 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216830.1" /db_xref="GI:15609451" /db_xref="UniProtKB/TrEMBL:P71898" /db_xref="GeneID:885130" /translation="MIEPQHAVNIVLKEAARSGRADETMVLVTEKVEATLRWAGNSMT TNGVSHSRNVTVISIVRRGDSAFVGSVVSAEVDPSVLPGLVVSSQDAARSAPEAGDAA PLLADTGEPDDWDAPVPGTGAGVFTGIAGSLSRGFRGADRLYGYAHRSVSTTFLASST GLRRRYTQPTGAIEINAKRGDASAWVGIGTPDFVEVPIDLMLERLSTRLRWAQRTVEL PAGRYQTIMPPSTVADMMIYLGWSMAGRGAQEGRTAFSAPGGGTRVGERLTELPLTLF TDPAAPGLACTPFVAVSNSSETQSVFDNGMEISQVDWIRSGVINALAYPRATAAKFDA PVAVAADNLIMTGGSADLADMIAGTERGLLLTTLWYIREVDPTTLLLTGLTRDGVYLV EDGEVSAAVNNFRFNESPLDLLRRATEAGVSEPTLPREWSDWVTRTAMPPLRIPDFHM SSVSQAQ" gene complement(2587287..2588804) /locus_tag="Rv2315c" /db_xref="GeneID:885260" CDS complement(2587287..2588804) /locus_tag="Rv2315c" /function="UNKNOWN" /note="Rv2315c, (MTCY3G12.19), len: 505 aa. Conserved hypothetical protein, highly similar to Q9S273|SCI28.10 HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (435 aa), FASTA scores: opt: 1768, E():5.6e-101, (63.2% identity in 432 overlap); and similar to others e.g. AAK24787|CC2823 hypothetical protein (TldD/PmbA family) from Caulobacter crescentus (543 aa), FASTA scores: opt: 876, E():3.1e-46, (42.8% identity in 505 overlap); O58578|PH0848 HYPOTHETICAL 54.4 KDA PROTEIN from Pyrococcus horikoshii (481 aa), FASTA scores: opt: 661, E(): 4.3e-33, (29.95% identity in 484 aa overlap); Q9UZ95|PAB1547 HYPOTHETICAL 53.6 KDA PROTEIN from Pyrococcus abyssi (473 aa), FASTA scores: opt: 656, E(): 8.6e-33, (29.1% identity in 481 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216831.1" /db_xref="GI:15609452" /db_xref="UniProtKB/TrEMBL:P71897" /db_xref="GeneID:885260" /translation="MTPNRGIDEDFLDLPRQQLADAALSAAATAGASHADLRVHRIST EIIQLRDGELETAVISRELGLAVRVIVAGTWGFASHAELAPDVAAATARHAVHVATVL AALNTERVRLAPEPVYTDAEWVSNYRIDPFGVPASEKIAVLRDYSGRLLDADGIDHVS ASLNAVKEQTFYADTFGSSITQQRVRLLPCLDAVAVDSAAGNFESMRTLAPPTARGWE VVAGDEIWNWTDELAQLPSLLAEKVRAPSVMPGPTDLVIDPTNLWLTIHESIGHATEY DRAIGYEAAYAGTSFATPDKLGTLRYGSPVMNVTADRTAEFGLATVGYDDEGVAAQSW DLVRDGVFVGYQLDRAFAPRLGEPRSNGCSYADSPHHVPIQRMANISLQPGIEDLSTA DLIGRVDDGIYIVGDKSWSIDMQRYNFQFTGQRFFRIRGGQLYGQLRDVAYQSSTTDF WNAMEAVGGPSTWRMGGAINCGKAQPGQVAAVSHGCPSALFRGVNVLNTRTEGGR" gene 2588838..2589710 /gene="uspA" /locus_tag="Rv2316" /db_xref="GeneID:885262" CDS 2588838..2589710 /gene="uspA" /locus_tag="Rv2316" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF SUGAR ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2316, (MTCY3G12.18c), len: 290 aa. Probable uspA, sugar-transport integral membrane protein ABC transporter (see citation below), most similar to Q9CBN8|USPA|ML1768 SUGAR TRANSPORT INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (328 aa), FASTA scores: opt: 1593, E(): 1.9e-93, (82.35% identity in 289 aa overlap); and similar to O32940|ML1426|MLCB2052.28 POSSIBLE SUGAR TRANSPORT PROTEIN (PROBABLE ABC-TRANSPORT PROTEIN, INNER MEMBRANE COMPONENT) from Mycobacterium leprae (319 aa), FASTA scores: opt: 600, E(): 9.2e-31, (34.25% identity in 295 aa overlap). Also similar to other proteins involved in transport e.g. Q9X860|SCE134.05c PUTATIVE BINDING PROTEIN DEPENDENT TRANSPORT PROTEIN from Streptomyces coelicolor (327 aa), FASTA scores: opt: 639, E(): 3.2e-33, (40.45% identity in 272 aa overlap); Q9K6N9|BH3689 SUGAR TRANSPORT SYSTEM (PERMEASE) from Bacillus halodurans (300 aa), FASTA scores: opt: 590, E(): 3.7e-30, (35.65% identity in 289 aa overlap); etc." /codon_start=1 /transl_table=11 /product="sugar-transport integral membrane protein ABC transporter UspA" /protein_id="NP_216832.1" /db_xref="GI:15609453" /db_xref="GOA:P71896" /db_xref="UniProtKB/TrEMBL:P71896" /db_xref="GeneID:885262" /translation="MRDAPRRRTALAYALLAPSLVGVVAFLLLPILVVVWLSLHRWDL LGPLRYVGLTNWRSVLTDSGFADSLVVTAVFVAIVVPAQTVLGLLAASLLARRLPGTG LFRTLYVLPWICAPLAIAVMWRWIVAPTDGAISTVLGHRIEWLTDPGLALPVVSAVVV WTNVGYVSLFFLAGLMAIPQDIHNAARTDGASAWQRFWRITLPMLRPTMFFVLVTGII SAAQVFDTVYALTGGGPQGSTDLVAHRIYAEAFGAAAIGRASVMAVVLFVILVGATVV QHLYFRRRISYELT" gene 2589697..2590521 /gene="uspB" /locus_tag="Rv2317" /db_xref="GeneID:885101" CDS 2589697..2590521 /gene="uspB" /locus_tag="Rv2317" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF SUGAR ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2317, (MTC3G12.17c), len: 274 aa. Probable uspB, sugar-transport integral membrane protein ABC transporter (see citation below), most similar to Q9CBN7|USPE|ML1769 SUGAR TRANSPORT INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (274 aa), FASTA scores: opt: 1522, E(): 3.4e-89, (85.0% identity in 274 aa overlap); and similar to O32941|ML1425|MLCB2052.29 PROBABLE ABC-TRANSPORT PROTEIN, INNER MEMBRANE COMPONENT from Mycobacterium leprae (283 aa), FASTA scores: opt: 630, E(): 8.4e-33, (36.55% identity in 268 aa overlap). Also similar to other integral membrane proteins e.g. P73854|LACG|SLR1723 LACTOSE TRANSPORT SYSTEM PERMEASE PROTEIN from Synechocystis sp. strain PCC 6803 (270 aa), FASTA scores: opt: 605, E(): 3.1e-31, (36.0% identity in 264 aa overlap); Q9F3B8|SC5F1.11 PUTATIVE SUGAR TRANSPORT INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (307 aa), FASTA scores: opt: 582, E(): 9.7e-30, (34.45% identity in 264 aa overlap); etc. Also similar to O53483|Rv2039c|MTV018.26c SUGAR TRANSPORT PROTEIN from Mycobacterium tuberculosis (280 aa), FASTA scores: opt: 630, E(): 8.3e-89, (37.7% identity in 268 aa overlap)." /codon_start=1 /transl_table=11 /product="sugar-transport integral membrane protein ABC transporter UspB" /protein_id="YP_177866.1" /db_xref="GI:57116968" /db_xref="GOA:Q7D7B8" /db_xref="UniProtKB/TrEMBL:Q7D7B8" /db_xref="GeneID:885101" /translation="MSSPSRVSNTAVYAVLTIGAVITLSPFLLGLLTSFTSAHQFATG TPLQLPRPPTLANYADIADAGFRRAAVVTALMTAVILLGQLTFSVLAAYAFARLQFRG RDALFWVYVATLMVPGTVTVVPLYLMMAQLGLRNTFWALVLPFMFGSPYAIFLLREHF RLIPDDLINAARLDGANTLDVIVHVVIPSSRPVLAALAMITVVSQWNNFMWPLVITSG HKWRVLTVATADLQSRFNDQWTLVMAATTVAIVPLIALFVTFQRHIVASIVVSGLK" gene 2590518..2591840 /gene="uspC" /locus_tag="Rv2318" /db_xref="GeneID:885143" CDS 2590518..2591840 /gene="uspC" /locus_tag="Rv2318" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF SUGAR ACROSS THE MEMBRANE (IMPORT)." /note="Rv2318, (MTCY3G12.16c), len: 440 aa. Probable uspC, sugar-binding lipoprotein component of sugar transport system (see citation below), most similar to Q9CBN6|USPC|ML1770 SUGAR TRANSPORT PERIPLASMIC BINDING PROTEIN from Mycobacterium leprae (446 aa), FASTA scores: opt: 2294, E(): 8.1e-135, (74.7% identity in 446 aa overlap). Also similar to other substrate-binding proteins e.g. Q9RK89|SCF1.15 PUTATIVE SUBSTRATE BINDING PROTEIN (EXTRACELLULAR) (BINDING-PROTEIN-DEPENDENT TRANSPORT) (FRAGMENT) from Streptomyces coelicolor (221 aa), FASTA scores: opt: 377, E(): 3e-16, (32.25% identity in 217 aa overlap); Q9K6N8|BH3690 SUGAR TRANSPORT SYSTEM (SUGAR-BINDING PROTEIN) from Bacillus halodurans (420 aa), FASTA scores: opt: 227, E(): 1e-06, (25.00% identity in 452 aa overlap); etc. Also similar to O53485|Rv2041c|MTV018.28C LIPOPROTEIN COMPONENT OF SUGAR TRANSPORT SYSTEM from Mycobacterium tuberculosis (439 aa), FASTA scores: opt: 246, E(): 7e-08, (26.75% identity in 325 aa overlap). Contains a hydrophobic stretch (possible signal peptide) at N-terminal end." /codon_start=1 /transl_table=11 /product="periplasmic sugar-binding lipoprotein UspC" /protein_id="NP_216834.1" /db_xref="GI:15609455" /db_xref="GOA:P71894" /db_xref="UniProtKB/TrEMBL:P71894" /db_xref="GeneID:885143" /translation="MTRPRQSTLVATALVLVAILLGVTAVLLGLSAEPRGGKIVVTVR LWDEPIAAAYRQSFAAFTRSHPDIEVRTNLVAYSTYFETLRTDVAGGSADDIFWLSNA YFAAYADSGRLMKIQTDAADWEPAVVDQFTRSGVLWGVPQLTDAGIAVFYNADLLAAA GVDPTQVDNLRWSRGDDDTLRPMLARLTVDADGRTANTPGFDARRVRQWGYNAANDPQ AIYLNYIGSAGGVFQRDGKFAFDNPGAIEAFRYLVGLINDDHVAPPASDTNDNGDFSR NQFLAGKMALFQSGTYSLAPVARDALFHWGVAMLPAGPAGRVSVTNGIAAAGNSASKH PDAVRQVLAWMGSTEGNSYLGRHGAAIPAVLSAQPVYFDYWSARGVDVTPFFAVLNGP RIAAPGGAGFAAGQQALEPYFDEMFLGRGDVTTTLRQAQAAANAATQR" gene complement(2591848..2592726) /locus_tag="Rv2319c" /db_xref="GeneID:885171" CDS complement(2591848..2592726) /locus_tag="Rv2319c" /function="UNKNOWN" /note="Rv2319c, (MTCY3G12.15), len: 292 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216835.1" /db_xref="GI:15609456" /db_xref="GOA:P64995" /db_xref="UniProtKB/Swiss-Prot:P64995" /db_xref="GeneID:885171" /translation="MTIVVGYLAGKVGPSALHLAVRVARMHKTSLTVATIVRRHWPTP SLARVDAEYELWSEQLAAASAREAQRYLRRLADGIEVSYHHRAHRSVSAGLLDVVEEL EAEVLVLGSFPSGRRARVLIGSTADRLLHSSPVPVAITPRRYRCYTDRLTRLSCGYSA TSGSVDVVRRCGHLASRYGVPMRVITFAVRGRTMYPPEVGLHAEASVLEAWAAQAREL LEKLRINGVVSEDVVLQVVTGNGWAQALDAADWQDGEILALGTSPFGDVARVFLGSWS GKIIRYSPVPVLVLPG" gene complement(2592723..2594153) /gene="rocE" /locus_tag="Rv2320c" /db_xref="GeneID:885084" CDS complement(2592723..2594153) /gene="rocE" /locus_tag="Rv2320c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF CATIONIC AMINO ACID (ESPECIALLY ARGININE AND ORNITHINE) ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2320c, (MTCY3G12.14), len: 476 aa. Probable rocE, cationic amino acid (especially arginine and ornithine) transporter (permease), highly similar to other amino acid transporters e.g. Q9L100|SCL6.16C PUTATIVE AMINO ACID TRANSPORTER from Streptomyces coelicolor (496 aa), FASTA scores: opt: 1485, E(): 9.4e-82, (48.4% identity in 477 aa overlap); O06479|YFNA PUTATIVE AMINO ACID TRANSPORTER from Bacillus subtilis (462 aa), FASTA scores: opt: 1271, E(): 6.1e-69, (41.9% identity in 463 aa overlap); Q9PG94|XF0408 AMINO ACID TRANSPORTER from Xylella fastidiosa (509 aa), FASTA scores: opt: 1128, E(): 2.5e-60, (39.5% identity in 481 aa overlap); etc. Also some similarity with Z99108.1|BSUB0005 from Bacillus subtilis (461 aa), FASTA scores: opt: 1271, E(): 0, (41.9% identity in 463 aa overlap); and G403170 ETHANOLAMINE PERMEASE (488 aa), FASTA scores: opt: 468, E(): 1e-23, (28.1% identity in 462 aa overlap). SEEMS TO BELONG TO THE APC FAMILY." /codon_start=1 /transl_table=11 /product="cationic amino acid transport integral membrane protein RocE" /protein_id="NP_216836.1" /db_xref="GI:15609457" /db_xref="GOA:P71892" /db_xref="UniProtKB/TrEMBL:P71892" /db_xref="GeneID:885084" /translation="MPTTSMSLRELMLRRRPVSGAPVASGASGNLKRSFGTFQLTMFG VGATIGTGIFFVLAQAVPEAGPGVIVSFIIAGIAAGLAAICYAELASAVPISGSAYSY AYTTLGEAVAMVVAACLLLEYGVATAAVAVGWSGYVNKLLSNLFGFQMPHVLSAAPWD THPGWVNLPAVILIGLCALLLIRGASESARVNAIMVLIKLGVLGMFMIIAFSAYSADH LKDFVPFGVAGIGSAAGTIFFSYIGLDAVSTAGDEVKDPQKTMPRALIAALVVVTGVY VLVALAALGTQPWQDFAEQETAGLAIILDNVTHGEWASTILAAGAVVSIFTVTLVTMY GQTRILFAMGRDGLLPARFAKVNPRTMTPVHNTVIVAIFASTLAAFIPLDSLADMVSI GTLTAFSVVAVGVIVLRVREPDLPRGFKVPGYPVTPVLSVLACGYILASLHWYTWLAF SGWVAVAVIFYLMWGRHHSALNEEVP" gene complement(2594154..2594699) /gene="rocD2" /locus_tag="Rv2321c" /db_xref="GeneID:885056" CDS complement(2594154..2594699) /gene="rocD2" /locus_tag="Rv2321c" /EC_number="2.6.1.13" /function="INVOLVED IN ARGININE METABOLISM [CATALYTIC ACTIVITY: L-ORNITHINE + A 2-OXO ACID = L-GLUTAMATE 5-SEMIALDEHYDE + AN L-AMINO ACID]." /note="Rv2321c, (MTCY3G12.13), len: 181 aa. Probable rocD2, ornithine aminotransferase (EC: 2.6.1.13), highly similar to C-terminal region of other ornithine aminotransferases, e.g. Q9FC90|ROCD from Streptomyces coelicolor (407 aa), FASTA scores: opt: 628, E(): 1.2e-32, (55.35% identity in 168 aa overlap); P3802|OAT_BACSU|ROCD from Bacillus subtilis (401 aa), FASTA scores: opt: 477, E(): 4.3e-23, (42.1% identity in 178 aa overlap); BAB42057|ROCD|SA0818 from Staphylococcus aureus subsp. aureus N315 (396 aa), FASTA scores: opt: 437, E(): 1.5e-20, (41.3% identity in 170 aa overlap); etc. Contains PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site. BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Rv2322c|MTCY3G12.12 (upstream ORF) and Rv2321c|MTCY3G12.13 appear to be an ornithine aminotransferase homologue but are frameshifted - we can find no sequence error in the cosmid to account for this." /codon_start=1 /transl_table=11 /product="ornithine aminotransferase" /protein_id="NP_216837.1" /db_xref="GI:15609458" /db_xref="GOA:P71891" /db_xref="UniProtKB/TrEMBL:P71891" /db_xref="GeneID:885056" /translation="MIADEIQSGLACTGYPFACDHGGVLPDIYLLGKTLGGGAVPLSA MVADREIFGVVHPGEHGSTFGGNPLAAAIGTPVVSMVVWGECQARSAKLGAHLHQRLA DLIGDGAVALRGLGWWADVDIERALAIGTDMSMRLADRGVLLKDTYGAALRFAPPLVI TAQEIDCAVRRFADALWEAGS" misc_feature complement(2594586..2594699) /gene="rocD2" /locus_tag="Rv2321c" /note="PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site" gene complement(2594699..2595364) /gene="rocD1" /locus_tag="Rv2322c" /db_xref="GeneID:885096" CDS complement(2594699..2595364) /gene="rocD1" /locus_tag="Rv2322c" /EC_number="2.6.1.13" /function="INVOLVED IN ARGININE METABOLISM [CATALYTIC ACTIVITY: L-ORNITHINE + A 2-OXO ACID = L-GLUTAMATE 5-SEMIALDEHYDE + AN L-AMINO ACID]." /experiment="experimental evidence, no additional details recorded" /note="Rv2322c, (MTCY3G12.12), len: 221 aa. Probable rocD1, ornithine aminotransferase (EC: 2.6.1.13), highly similar to N-terminal region of other ornithine aminotransferases, e.g. Q9FC90|ROCD from Streptomyces coelicolor (407 aa), FASTA scores: opt: 770, E(): 8.7e-40, (55.7% identity in 201 aa overlap); BAB42057|ROCD|SA0818 from Staphylococcus aureus subsp. aureus N315 (396 aa) FASTA scores: opt: 632, E(): 2.2e-31, (46.1% identity in 208 aa overlap); P38021|OAT_BACSU|ROCD from Bacillus subtilis (401 aa), FASTA scores: opt: 626, E(): 5.1e-31, (43.1% identity in 218 aa overlap); etc. BELONGS TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. Rv2322c|MTCY3G12.12 and Rv2321c|MTCY3G12.13 (upstream ORF) appear to be an ornithine aminotransferase homologue but are frameshifted - we can find no sequence error in the cosmid to account for this." /codon_start=1 /transl_table=11 /product="ornithine aminotransferase" /protein_id="NP_216838.1" /db_xref="GI:15609459" /db_xref="GOA:P71890" /db_xref="UniProtKB/TrEMBL:P71890" /db_xref="GeneID:885096" /translation="MTNLADATQATMALVERHAAHNYSPLPVVAASAEGAWIADIDGL RYLDWLAAYSAVNLGHRNPASTATAHAQVDTVTLLNRALHADRLGPLGAALAQLCGKD VVLPMNSDAEAVESGLRVARKWGADVNGLPAGRHDIILANNNFHGHTSSVVSFSSDPA AGSGVEPSTPGLRSVPFGDAAAPAQTIDDNTVADLLEPIPGQAGIIVPADDYLPAASS TTC" gene complement(2595361..2596269) /locus_tag="Rv2323c" /db_xref="GeneID:885480" CDS complement(2595361..2596269) /locus_tag="Rv2323c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2323c, (MTCY3G12.11), len: 302 aa. Conserved hypothetical protein, highly similar to others eg Q9FC91|2SCG58.22 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (288 aa), FASTA scores: opt: 561, E(): 7.3e-28, (46.95% identity in 279 aa overlap); P74535|SLL1336 HYPOTHETICAL 78.3 KDA PROTEIN from Synechocystis sp. (705 aa), FASTA scores: opt: 555, E(): 2.1e-27, (37.75% identity in 265 aa overlap); etc. Also similar to various hydrolases e.g. Q53797 BETA-HYDROXYLASE (BLEOMYCIN/PHLEOMYCIN BINDING PROTEIN, ANKYRIN HOMOLOGUE, BLEOMYCIN AND TRANSPORT PROTEIN) from Streptomyces verticillus (326 aa), FASTA scores: opt: 211, E(): 4.5e-06, (26.75% identity in 303 aa overlap); Q9X7M4|DDAH_STRCO|SC5F2A.01c NG,NG-dimethylarginine dimethylaminohydrolase (EC 3.5.3.18) (Dimethylargininase) (Dimethylarginine dimethylaminohydrolase) (258 aa), FASTA scores: opt: 209, E(): 4.9e-06, (27.15% identity in 243 aa overlap); G434715 beta-hydroxylase (bleomicin/phleomycin binding protein) from Streptomyces verticillus (326 aa), FASTA scores: opt: 211, E(): 4.5e-06, (26.75% identity in 303 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216839.1" /db_xref="GI:15609460" /db_xref="UniProtKB/TrEMBL:P71889" /db_xref="GeneID:885480" /translation="MENTQRPSFDCEIRAKYRWFMTDSYVAAARLGSPARRTPRTRRY AMTPPAFFAVAYAINPWMDVTAPVDVQVAQAQWEHLHQTYLRLGHSVDLIEPISGLPD MVYTANGGFIAHDIAVVARFRFPERAGESRAYASWMSSVGYRPVTTRHVNEGQGDLLM VGERVLAGYGFRTDQRAHAEIAAVLGLPVVSLELVDPRFYHLDTALAVLDDHTIAYYP PAFSTAAQEQLSALFPDAIVVGSADAFVFGLNAVSDGLNVVLPVAAMGFAAQLRAAGF EPVGVDLSELLKGGGSVKCCTLEIHP" gene 2596334..2596780 /locus_tag="Rv2324" /db_xref="GeneID:885060" CDS 2596334..2596780 /locus_tag="Rv2324" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2324, (MTCY3G12.10), len: 148 aa. Probable transcriptional regulatory protein, asnC-family, similar to other PUTATIVE ASNC-FAMILY REGULATORY PROTEINS e.g. Q9L101|SCL6.15C from Streptomyces coelicolor (150 aa) FASTA scores: opt: 466, E(): 2.4e-24, (52.8% identity in 142 aa overlap); Q9RKY4|SC6D7.14 PUTATIVE ASNC-FAMILY TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (165 aa), FASTA scores: opt: 266, E(): 5.5e-11, (32.4% identity in 145 aa overlap); Q9ZEP1|LRPA|SCE94.12c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (150 aa), FASTA scores: opt: 249, E(): 6.9e-10, (33.35% identity in 147 aa overlap); etc. Also similar to P96896|Rv3291c|MTCY71.31c from Mycobacterium tuberculosis (150 aa), FASTA scores: opt: 261, E(): 1.1e-10, (36.4% identity in 143 aa overlap)." /codon_start=1 /transl_table=11 /product="AsnC family transcriptional regulator" /protein_id="NP_216840.1" /db_xref="GI:15609461" /db_xref="GOA:P71888" /db_xref="UniProtKB/TrEMBL:P71888" /db_xref="GeneID:885060" /translation="MDRLDDTDERILAELAEHARATFAEIGHKVSLSAPAVKRRVDRM LESGVIKGFTTVVDRNALGWNTEAYVQIFCHGRIAPDQLRAAWVNIPEVVSAATVTGT SDAILHVLAHDMRHLEAALERIRSSADVERSESTVVLSNLIDRMPP" gene complement(2597009..2597857) /locus_tag="Rv2325c" /db_xref="GeneID:886271" CDS complement(2597009..2597857) /locus_tag="Rv2325c" /function="UNKNOWN" /note="Rv2325c, (MTCY3G12.09), len: 282 aa. Conserved hypothetical protein, equivalent to O32970|MLCB22.37c|ML0849 hypothetical protein from Mycobacterium leprae (283 aa), FASTA scores: opt: 1405, E(): 1.8e-78, (77.7% identity in 282 aa overlap). Also some similarity to other proteins e.g. Q9Z9J1|YBAF|BH0166 YBAF PROTEIN (BH0166 PROTEIN) (HYPOTHETICAL PROTEIN) from Bacillus halodurans (265 aa), FASTA scores: opt: 288, E(): 2.8e-10, (25.8% identity in 264 aa overlap); P70972|YBAF YBAF PROTEIN (HYPOTHETICAL PROTEIN) from Bacillus subtilis (265 aa), FASTA scores: opt: 259, E(): 1.5e-08, (25.45% identity in 224 aa overlap); AAK34821|SPY2193|Q99X13 Conserved hypothetical protein from Streptococcus pyogenes (266 aa), FASTA scores: opt: 232, E(): 6.5e-07, (25.1% identity in 267 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216841.1" /db_xref="GI:15609462" /db_xref="GOA:P64997" /db_xref="UniProtKB/Swiss-Prot:P64997" /db_xref="GeneID:886271" /translation="MTTTSAPARNGTRRPSRPIVLLIPVPGSSVIHDLWAGTKLLVVF GISVLLTFYPGWVTIGMMAALVLAAARIAHIPRGALPSVPRWLWIVLAIGFLTAALAG GTPVVAVGGVQLGLGGALHFLRITALSVVLLALGAMVSWTTNVAEISPAVATLGRPFR VLRIPVDEWAVALALALRAFPMLIDEFQVLYAARRLRPKRMPPSRKARRQRHARELID LLAAAITVTLRRADEMGDAITARGGTGQLSAHPGRPKLADWVTLAITAMASGTAVAIE SLILHS" gene complement(2597854..2599947) /locus_tag="Rv2326c" /db_xref="GeneID:888184" CDS complement(2597854..2599947) /locus_tag="Rv2326c" /function="PROBABLY INVOLVED IN ACTIVE TRANSPORT ACCROSS THE MEMBRANE. THOUGHT TO BE RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM AND THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2326c, (MTC3G12.08), len: 697 aa. Possible transmembrane ATP-binding protein ABC transporter (see citation below). Equivalent to Q9CCF9|ML0848 ABC TRANSPORTER from Mycobacterium leprae (724 aa), FASTA scores: opt: 3482, E(): 2.8e-182, (76.9% identity in 697 aa overlap) and also to O32971|MLCB22.38c ABC-TYPE TRANSPORTER from Mycobacterium leprae (726 aa), FASTA scores: opt: 3482, E(): 2.8e-182, (76.9% identity in 697 aa overlap). Similar in part to other ABC TRANSPORTERS e.g. Q9WY65|TM0222 from Thermotoga maritima (266 aa), FASTA scores: opt: 407, E(): 4.2e-15, (38.0% identity in 213 aa overlap); etc. Contains 2 X PS00017 ATP/GTP-binding site motif A (P-loop); and 2 x PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="transmembrane ATP-binding protein ABC transorter" /protein_id="NP_216842.1" /db_xref="GI:15609463" /db_xref="GOA:P63399" /db_xref="UniProtKB/Swiss-Prot:P63399" /db_xref="GeneID:888184" /translation="MCCAVCGPEPGRIGEVTPLGPCPAQHRGGPLRPSELAQASVMAA LCAVTAIISVVVPFAAGLALLGTVPTGLLAYRYRLRVLAAATVAAGMIAFLIAGLGGF MGVVHSAYIGGLTGIVKRRGRGTPTVVVSSLIGGFVFGAAMVGMLAAMVRLRHLIFKV MTANVDGIAATLARMHMQGAAADVKRYFAEGLQYWPWVLLGYFNIGIMIVSLIGWWAL SRLLERMRGIPDVHKLDPPPGDDVDALIGPVPVRLDKVRFRYPRAGQDALREVSLDVR AGEHLAIIGANGSGKTTLMLILAGRAPTSGTVDRPGTVGLGKLGGTAVVLQHPESQVL GTRVADDVVWGLPLGTTADVGRLLSEVGLEALAERDTGSLSGGELQRLALAAALAREP AMLIADEVTTMVDQQGRDALLAVLSGLTQRHRTALVHITHYDNEADSADRTLSLSDSP DNTDMVHTAAMPAPVIGVDQPQHAPALELVGVGHEYASGTPWAKTALRDINFVVEQGD GVLIHGGNGSGKSTLAWIMAGLTIPTTGACLLDGRPTHEQVGAVALSFQAARLQLMRS RVDLEVASAAGFSASEQDRVAAALTVVGLDPALGARRIDQLSGGQMRRVVLAGLLARA PRALILDEPLAGLDAASQRGLLRLLEDLRRARGLTVVVVSHDFAGMEELCPRTLHLRD GVLESAAASEAGGMS" misc_feature complement(2598085..2598129) /locus_tag="Rv2326c" /note="PS00211 ABC transporters family signature" misc_feature complement(2598385..2598408) /locus_tag="Rv2326c" /note="PS00017 ATP/GTP-binding site motif A" misc_feature complement(2598784..2598828) /locus_tag="Rv2326c" /note="PS00211 ABC transporters family signature" misc_feature complement(2599072..2599095) /locus_tag="Rv2326c" /note="PS00017 ATP/GTP-binding site motif A" gene 2599988..2600479 /locus_tag="Rv2327" /db_xref="GeneID:888124" CDS 2599988..2600479 /locus_tag="Rv2327" /function="UNKNOWN" /note="Rv2327, (MTCY3G12.07c), len: 163 aa. Conserved hypothetical protein, similar to Z80775|MTCY21D4.05c|Rv0042c from Mycobacterium tuberculosis (208 aa), FASTA scores: opt: 242, E(): 5e-08, (43.0% identity in 107 aa overlap). Also slight similarity to putative transcriptional regulatory proteins belonging to the MARR-FAMILY e.g. Q9CCY2/ML2696 from Mycobacterium leprae (243 aa), FASTA scores: opt: 245, E(): 3.7e-08, (35.35% identity in 150 aa overlap); Q9L135|SC6D11.20 from Streptomyces coelicolor (155 aa), FASTA scores: opt: 242, E(): 3.9e-08, (34.75% identity in 141 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216843.1" /db_xref="GI:15609464" /db_xref="GOA:P71885" /db_xref="UniProtKB/TrEMBL:P71885" /db_xref="GeneID:888124" /translation="MSPSPAAANRSEVGGPLPGLGADLLAVVARLNRLATQRIQMPLP AAQARLLATIEAQGEARIGDLAAVDHCSQPTMTTQVRRLEDAGLVTRTADPGDARAVR IRITPEGIRTLTAVRADRAAAIEPQLALLPPADRRVLADAVDVLRRLLDHAATTPGRA TRQ" gene 2600731..2601879 /gene="PE23" /locus_tag="Rv2328" /db_xref="GeneID:888111" CDS 2600731..2601879 /gene="PE23" /locus_tag="Rv2328" /function="UNKNOWN" /note="Rv2328, (MTCY3G12.06), len: 382 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), similar to others e.g. Q9L8K5|MAG24-1 PE-PGRS HOMOLOG from Mycobacterium marinum (638 aa), FASTA scores: opt: 495, E(): 6.6e-18, (34.65% identity in 401 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177867.1" /db_xref="GI:57116969" /db_xref="GOA:P71884" /db_xref="UniProtKB/Swiss-Prot:P71884" /db_xref="GeneID:888111" /translation="MQFLSVIPEQVESAAQDLAGIRSALSASYAAAAGPTTAVVSAAE DEVSTAIASIFGAYGRQCQVLSAQASAFHDEFVNLLKTGATAYRNTEFANAQSNVLNA VNAPARSLLGHPSAAESVQNSAPTLGGGHSTVTAGLAAQAGRAVATVEQQAAAAVAPL PSAGAGLAQVVNGVVTAGQGSAAKLATALQSAAPWLAKSGGEFIVAGQSALTGVALLQ PAVVGVVQAGGTFLTAGTSAATGLGLLTLAGVEFSQGVGNLALASGTAATGLGLLGSA GVQLFSPAFLLAVPTALGGVGSLAIAVVQLVQGVQHLSLVVPNVVAGIAALQTAGAQF AQGVNHTMLAAQLGAPGIAVLQTAGGHFAQGIGHLTTAGNAAVTVLIS" gene complement(2601914..2603461) /gene="narK1" /locus_tag="Rv2329c" /db_xref="GeneID:888116" CDS complement(2601914..2603461) /gene="narK1" /locus_tag="Rv2329c" /function="INVOLVED IN EXCRETION OF NITRITE PRODUCED BY THE DISSIMILATORY REDUCTION OF NITRATE." /note="Rv2329c, (MTCY3G12.05), len: 515 aa. Probable narK1, nitrite extrusion protein, possibly member of major facilitator superfamily (MFS). Equivalent to O32974|MLCB22.41c|NARK|ML0844 PUTATIVE NITRITE EXTRUSION PROTEIN from Mycobacterium leprae (517 aa), FASTA scores: opt: 2224, E(): 1.9e-129, (69.3% identity in 488 aa overlap). Also highly similar to others e.g. P94933 NITRITE EXTRUSION PROTEIN from Mycobacterium fortuitum (471 aa), FASTA scores: opt: 1969, E(): 8.6e-114, (62.1% identity in 459 aa overlap); P37758|NARU_ECOLI NITRITE EXTRUSION PROTEIN 2 from Escherichia coli strain K12 (462 aa), FASTA scores: opt: 792, E(): 2.3e-41, (36.95% identity in 476 aa overlap); P10903|NARK_ECOLI nitrite extrusion protein (nitrite facilitator 1) from Escherichia coli strain K12 (463 aa), FASTA scores: opt: 784, E(): 7e-41, (35.3% identity in 468 aa overlap); etc. Also similar to RV0261c|Z86089|MTCY6A4_5 from Mycobacterium tuberculosis (469 aa), FASTA scores: opt: 2000, E(): 1.1e-115, (62.6% identity in 470 aa overlap). BELONGS TO THE NARK/NASA FAMILY OF TRANSPORTERS." /codon_start=1 /transl_table=11 /product="nitrite extrusion protein 1 NarK1" /protein_id="NP_216845.1" /db_xref="GI:15609466" /db_xref="UniProtKB/TrEMBL:P71883" /db_xref="GeneID:888116" /translation="MEQHTLLQREESPRSPAAPSLRRLGGSRHITHWDPEDLGAWEAG NKGIARRNLLWSVVTVHLGYSVWTLWPVLELLMPQDVYGFSTSDKFLLGTIATLFGAF LRMPYALASAIFGGRNWATFSAIVLLIPAIGTTVLLTHPGLPLWPYLVCAALTGLGGG NFASSMSNANAFYPHRLKGSALGIAGGVGNLGVPAIQLVGLLAIATVGERKPYLVCAL YVVLVAIAVIGVSLFMNNVEQHRVQVNRLRPIVSAVLSTRDTWLLSLLYLGTFGSFIG FSFVFGQVLQTNFLACGQSPARATLHAVELAFVGPLLAAVARIYGGRLADRVGGSRLT LIVFVAMTLAAGLLISASTLEGRHVGQHRGATMVGYFVCFVALFVLSGLGNGSVYKMI PTIFEACSRSLDLSEAERRDWSRIISGVVIGFVAAFGALGGVGINMALRESYLSTGSG TDAFWIFMMCYAAAAVLTWKVYDRRTVTDMGMLQAALVRQPASTPAELIGPRTQSDRF SGCSISA" gene complement(2603695..2604222) /gene="lppP" /locus_tag="Rv2330c" /db_xref="GeneID:888072" CDS complement(2603695..2604222) /gene="lppP" /locus_tag="Rv2330c" /function="UNKNOWN" /note="Rv2330c, (MTCY3G12.04), len: 175 aa. Probable lppP, lipoprotein. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LppP" /protein_id="NP_216846.1" /db_xref="GI:15609467" /db_xref="GOA:P65302" /db_xref="UniProtKB/Swiss-Prot:P65302" /db_xref="GeneID:888072" /translation="MRRQRSAVPILALLALLALLALIVGLGASGCAWKPPTTRPSPPN TCKDSDGPTADTVRQAIAAVPIVVPGSKWVEITRGHTRNCRLHWVQIIPTIASQSTPQ QLLFFDRNIPLGSPTRNPKPYITVLPAGDDTVTVQYQWQIGSDQECCPTGIGTVRFHI GSDGKLEALGSIPHQ" gene 2604297..2604683 /locus_tag="Rv2331" /db_xref="GeneID:888099" CDS 2604297..2604683 /locus_tag="Rv2331" /function="UNKNOWN" /note="Rv2331, (MT2393, MTCY3G12.03c), len: 128 aa. Hypothetical unknown protein; shortened version of MTCY3G12.03c to eliminate overlap with MTCY3G12.04." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216847.1" /db_xref="GI:15609468" /db_xref="UniProtKB/Swiss-Prot:P71881" /db_xref="GeneID:888099" /translation="MPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIG QPNTMGDEMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLH SISFIFKGRPGEVYLTVTGYNFRGVP" gene 2604740..2605078 /locus_tag="Rv2331A" /db_xref="GeneID:3205048" CDS 2604740..2605078 /locus_tag="Rv2331A" /function="UNKNOWN" /note="Rv2331A, len: 112 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177669.1" /db_xref="GI:57116970" /db_xref="UniProtKB/TrEMBL:Q79FF7" /db_xref="GeneID:3205048" /translation="MKGHLATFGHPALPTYRGSWLSREPGSPYRLPAGAGRDRGDACR RIPRRTGSGTLLRPGQRCTFAANADPMAKGVDRALCEIVAERRQLDLDLAKAQVRSAL ANQRYHRDVH" gene 2605108..2606754 /gene="mez" /locus_tag="Rv2332" /db_xref="GeneID:887962" CDS 2605108..2606754 /gene="mez" /locus_tag="Rv2332" /EC_number="1.1.1.38" /function="CATALIZES THE OXIDATIVE DECARBOXYLATION OF MALATE INTO PYRUVATE, IMPORTANT FOR A WIDE RANGE OF METABOLIC PATHWAYS [CATALYTIC ACTIVITY: (S)-MALATE + NAD(+) = PYRUVATE + CO(2) + NADH]." /note="malic enzyme; oxaloacetate-decarboxylating; NAD-dependent; catalyzes the formation of pyruvate form malate" /codon_start=1 /transl_table=11 /product="malate dehydrogenase" /protein_id="NP_216848.2" /db_xref="GI:57116971" /db_xref="GOA:P71880" /db_xref="UniProtKB/Swiss-Prot:P71880" /db_xref="GeneID:887962" /translation="MSDARVPRIPAALSAPSLNRGVGFTHAQRRRLGLTGRLPSAVLT LDQQAERVWHQLQSLATELGRNLLLEQLHYRHEVLYFKVLADHLPELMPVVYTPTVGE AIQRFSDEYRGQRGLFLSIDEPDEIEEAFNTLGLGPEDVDLIVCTDAEAILGIGDWGV GGIQIAVGKLALYTAGGGVDPRRCLAVSLDVGTDNEQLLADPFYLGNRHARRRGREYD EFVSRYIETAQRLFPRAILHFEDFGPANARKILDTYGTDYCVFNDDMQGTGAVVLAAV YSGLKVTGIPLRDQTIVVFGAGTAGMGIADQIRDAMVADGATLEQAVSQIWPIDRPGL LFDDMDDLRDFQVPYAKNRHQLGVAVGDRVGLSDAIKIASPTILLGCSTVYGAFTKEV VEAMTASCKHPMIFPLSNPTSRMEAIPADVLAWSNGRALLATGSPVAPVEFDETTYVI GQANNVLAFPGIGLGVIVAGARLITRRMLHAAAKAIAHQANPTNPGDSLLPDVQNLRA ISTTVAEAVYRAAVQDGVASRTHDDVRQAIVDTMWLPAYD" gene complement(2606708..2608321) /locus_tag="Rv2333c" /db_xref="GeneID:887274" CDS complement(2606708..2608321) /locus_tag="Rv2333c" /function="THOUGHT TO BE INVOLVED IN A TRANSPORT SYSTEM ACROSS THE MEMBRANE (PERHAPS DRUG TRANSPORT): RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2333c, (MTCY3G12.01), len: 537 aa. Probable conserved integral membrane transport protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug, highly similar to many e.g. Q9RL22|C5G9.04c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (489 aa), FASTA scores: opt: 1031, E(): 4e-55, (37.4% identity in 412 aa overlap); Q9L0L9|SCD82.12 PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (490 aa), FASTA scores: opt: 883, E(): 3.8e-46, (36.35% identity in 407 aa overlap); Q9ZBW5|SC4B5.03c PUTATIVE INTEGRAL MEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (504 aa), FASTA scores: opt: 899, E(): 4.1e-47, (37.4% identity in 415 aa overlap); P39886|TCMA_STRGA tetracenomycin C resistance and export protein from Streptomyces glaucescens (538 aa), FASTA scores: opt: 839, E(): 1.9e-43, (32.3% identity in 489 aa overlap); etc. Also highly similar to Rv2459|O53186|MTV008.15 PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis strain H37Rv (508 aa), FASTA scores: opt: 1385, E(): 1.5e-76, (44.05% identity in 504 aa overlap); and AAK46834|MT2534 DRUG TRANSPORTER from Mycobacterium tuberculosis strain CDC1551 (523 aa), FASTA scores: opt: 1385, E(): 1.5e-76, (44.4% identity in 504 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_216849.1" /db_xref="GI:15609470" /db_xref="GOA:P71879" /db_xref="UniProtKB/TrEMBL:P71879" /db_xref="GeneID:887274" /translation="MNRTQLLTLIATGLGLFMIFLDALIVNVALPDIQRSFAVGEDGL QWVVASYSLGMAVFIMSAATLADLDGRRRWYLIGVSLFTLGSIACGLAPSIAVLTTAR GAQGLGAAAVSVTSLALVSAAFPEAKEKARAIGIWTAIASIGTTTGPTLGGLLVDQWG WRSIFYVNLPMGALVLFLTLCYVEESCNERARRFDLSGQLLFIVAVGALVYAVIEGPQ IGWTSVQTIVMLWTAAVGCALFVWLERRSSNPMMDLTLFRDTSYALAIATICTVFFAV YGMLLLTTQFLQNVRGYTPSVTGLMILPFSAAVAIVSPLVGHLVGRIGARVPILAGLC MLMLGLLMLIFSEHRSSALVLVGLGLCGSGVALCLTPITTVAMTAVPAERAGMASGIM SAQRAIGSTIGFAVLGSVLAAWLSATLEPHLERAVPDPVQRHVLAEIIIDSANPRAHV GGIVPRRHIEHRDPVAIAEEDFIEGIRVALLVATATLAVVFLAGWRWFPRDVHTAGSD LSERLPTAMTVECAVSHMPGATWCRLWPA" gene 2608796..2609728 /gene="cysK1" /locus_tag="Rv2334" /db_xref="GeneID:886016" CDS 2608796..2609728 /gene="cysK1" /locus_tag="Rv2334" /EC_number="2.5.1.47" /function="INVOLVED IN CYSTEINE BIOSYNTHESIS [CATALYTIC ACTIVITY: O3-ACETYL-L-SERINE + H(2)S = L-CYSTEINE + ACETATE]." /note="Rv2334, (MT2397, MTCY98.03), len: 310 aa. Probable cysK1, cysteine synthase A (EC 4.2.99.8), equivalent to O32978|CYSK_MYCLE|ML0839|MLCB22.47 CYSTEINE SYNTHASE A from Mycobacterium leprae (310 aa), FASTA scores: opt: 1756, E(): 8.6e-96, (85.8% identity in 310 aa overlap). Also highly similar to other CYSTEINE SYNTHASES e.g. Q9JQL6|CYSK|NMA0974|NMB0763 PUTATIVE CYSTEINE SYNTHASE from Neisseria meningitidis (serogroup A and B) (310 aa), FASTA scores: opt: 1368, E(): 4.6e-73, (66.45% identity in 310 aa overlap); P73410|CYSK_SYNY3|SLR1842 from Synechocystis sp (312 aa), FASTA scores: opt: 1310, E(): 1.2e-69, (64.65% identity in 311 aa overlap); Q43725|CYSM_ARATH|OASC|ACS1|AT3G59760|F24G16.30 CYSTEINE SYNTHASE (MITOCHONDRIAL PRECURSOR) from Arabidopsis thaliana (Mouse-ear cress) (424 aa), FASTA scores: opt: 1253, E(): 3.2e-66, (59.2% identity in 309 aa overlap) (has its N-terminus longer 104 aa); etc. Contains PS00901 Cysteine synthase/cystathionine beta-synthase P-phosphate attachment site. BELONGS TO THE CYSTEINE SYNTHASE/CYSTATHIONINE BETA-SYNTHASE FAMILY. Note that previously known as cysK.; cysK" /codon_start=1 /transl_table=11 /product="cysteine synthase A CysK1" /protein_id="YP_177868.1" /db_xref="GI:57116972" /db_xref="GOA:P95230" /db_xref="UniProtKB/Swiss-Prot:P95230" /db_xref="GeneID:886016" /translation="MSIAEDITQLIGRTPLVRLRRVTDGAVADIVAKLEFFNPANSVK DRIGVAMLQAAEQAGLIKPDTIILEPTSGNTGIALAMVCAARGYRCVLTMPETMSLER RMLLRAYGAELILTPGADGMSGAIAKAEELAKTDQRYFVPQQFENPANPAIHRVTTAE EVWRDTDGKVDIVVAGVGTGGTITGVAQVIKERKPSARFVAVEPAASPVLSGGQKGPH PIQGIGAGFVPPVLDQDLVDEIITVGNEDALNVARRLAREEGLLVGISSGAATVAALQ VARRPENAGKLIVVVLPDFGERYLSTPLFADVAD" misc_feature 2608892..2608948 /gene="cysK1" /locus_tag="Rv2334" /note="PS00901 Cysteine synthase/cystathionine beta-synthase P-phosphate attachment site" gene 2609732..2610421 /gene="cysE" /locus_tag="Rv2335" /db_xref="GeneID:886012" CDS 2609732..2610421 /gene="cysE" /locus_tag="Rv2335" /EC_number="2.3.1.30" /function="INVOLVED IN CYSTEINE BIOSYNTHESIS [CATALYTIC ACTIVITY: ACETYL-CoA + L-SERINE = CoA + O-ACETYL-L-SERINE]." /note="Rv2335, (MTCY98.04), len: 229 aa. Probable cysE, serine acetyltransferase (EC 2.3.1.30), equivalent to O32979|CYSE|ML0838 SERINE ACETYLTRANSFERASE from Mycobacterium leprae (227 aa), FASTA scores: opt: 1152, E(): 9.6e-62, (76.4% identity in 229 aa overlap). Also highly similar, except in C-terminal part, to others e.g. Q9HXI6|CYSE|PA3816 O-ACETYLSERINE SYNTHASE from Pseudomonas aeruginosa (258 aa), FASTA scores: opt: 737, E(): 6e-37, (61.3% identity in 168 aa overlap); P23145|NIFP_AZOCH PROBABLE SERINE ACETYLTRANSFERASE from Azotobacter chroococcum mcd 1 (269 aa), FASTA scores: opt: 718, E(): 8.4e-36, (55.45% identity in 220 aa overlap); Q06750|CYSE_BACSU SERINE ACETYLTRANSFERASE from Bacillus subtilis (217 aa), FASTA scores: opt: 640, E(): 3.1e-31, (48.0% identity in 200 aa overlap); etc. Contains PS00101 Bacterial hexapeptide-repeat containing-transferases signature. BELONGS TO THE CYSE/LACA/LPXA/NODL FAMILY OF ACETYLTRANSFERASES. COMPOSED OF MULTIPLE REPEATS OF [LIV]-G-X(4)." /codon_start=1 /transl_table=11 /product="serine acetyltransferase CysE" /protein_id="NP_216851.1" /db_xref="GI:15609472" /db_xref="GOA:P95231" /db_xref="UniProtKB/TrEMBL:P95231" /db_xref="GeneID:886012" /translation="MLTAMRGDIRAARERDPAAPTALEVIFCYPGVHAVWGHRLAHWL WQRGARLLARAAAEFTRILTGVDIHPGAVIGARVFIDHATGVVIGETAEVGDDVTIYH GVTLGGSGMVGGKRHPTVGDRVIIGAGAKVLGPIKIGEDSRIGANAVVVKPVPPSAVV VGVPGQVIGQSQPSPGGPFDWRLPDLVGASLDSLLTRVARLEALGGGPQAAGVIRPPE AGIWHGEDFSI" misc_feature 2610107..2610193 /gene="cysE" /locus_tag="Rv2335" /note="PS00101 Bacterial hexapeptide-repeat containing-transferases signature" gene 2610837..2611805 /locus_tag="Rv2336" /db_xref="GeneID:888958" CDS 2610837..2611805 /locus_tag="Rv2336" /function="UNKNOWN. MAY BE INVOLVED IN VIRULENCE." /experiment="experimental evidence, no additional details recorded" /note="Rv2336, (MTCY98.05), len: 322 aa. Hypothetical unknown protein (see Rindi et al., 2001)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216852.1" /db_xref="GI:15609473" /db_xref="UniProtKB/TrEMBL:P95232" /db_xref="GeneID:888958" /translation="MDVPHEQPALSSSKSNRFTSQRQTTGVGTTTVERLEPRLSPASR HITEAKAFGTECHVSSFTREQDPDRAVRVEQIHGEAYVAAGHVYESALDELGRLDNSN AEFILDKARGSTRETEVIYLHAVPAEPLSGSQGEGGLRIVGISAVGSIDDLSAFKAAK PSMGLAHQRKLYDAIEDLGHGGVKEIAALSVTADAPPTVSYSLIREVLRLYHRTGEKL IITFAMPAYAKMVMNFGRFAMPQVGEPFYAHRNNDPRTSNDLLLVPSIVEPSNFLENI SRGVVTADDGPTARRRFATLCYMTDGLDDYFMPLTRQVLSEGIQDI" gene complement(2611869..2612987) /locus_tag="Rv2337c" /db_xref="GeneID:885123" CDS complement(2611869..2612987) /locus_tag="Rv2337c" /function="UNKNOWN" /note="Rv2337c, (MTCY98.06c), len: 372 aa. Hypothetical unknown protein, sharing some similarity with Q9RI33|SCJ12.27c HYPOTHETICAL 37.2 KDA PROTEIN from Streptomyces coelicolor (335 aa), BLAST scores: 134 AND 46, (28% AND 33% identity, 52% AND 44% positive); FASTA scores: opt: 176, E(): 0.00042, (31.95% identity in 355 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216853.1" /db_xref="GI:15609474" /db_xref="UniProtKB/TrEMBL:P95233" /db_xref="GeneID:885123" /translation="MRAGRWGPGMTGLDPAEFLSLVEAAALAPSADNRREVQLEHAGR RVRLWGDQTWRSAPEHRRIMSLVAIGAAVENVKLRAGRLGFETKVCWFPDSGNPGLVA EIDVDRLPQTRVDPIEGAIERRRTNRRVRFRGPPLSQGELGALSAEATGIDGIQLHWF DSPETRKQILRLVRLAETERFRSRELHEELFSAVRFDIGWTASSDDGLPPGSLEVEAW MRPMFRGLRHWRVLRLLRTVGMHHALGLRAAYLPCRLAPHVGALTTSLDLASGALTAG AVFERIWLRTTLLGAELQPFAASAVLSLPACEWVAPHVRAALVGGWNLLAPGHWPMMV FRIGHARAPSVRTMRQSVEAYCYAPAERSGSDSESRFA" gene complement(2613107..2614063) /gene="moeW" /locus_tag="Rv2338c" /db_xref="GeneID:886018" CDS complement(2613107..2614063) /gene="moeW" /locus_tag="Rv2338c" /function="INVOLVED IN MOLYBDOPTENUM COFACTOR BIOSYNTHESIS; THOUGHT TO BE INVOLVED IN THE BIOSYNTHESIS OF A DEMOLYBDO-COFACTOR (MOLYBDOPTERIN), NECESSARY FOR MOLYBDO-ENZYMES (BY SIMILARITY)." /note="Rv2338c, (MTCY98.07c), len: 318 aa. Possible moeW, molybdoptenum biosynthesis protein, showing some similarity to several molybdopterin biosynthesis proteins e.g. O27613|MTH1571 MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEB HOMOLOG from Methanobacterium thermoautotrophicum (251 aa), FASTA scores: opt: 309, E(): 4.7e-14; (30.7% identity in 254 aa overlap); Q9KPQ5|VC2311 HESA/MOEB/THIF FAMILY PROTEIN from Vibrio cholerae (273 aa), FASTA scores: opt: 255, E(): 4e-09, (36.25% identity in 149 aa overlap); Q9PD34|XF1545 MOLYBDOPTERIN BIOSYNTHESIS PROTEIN from Xylella fastidiosa (276 aa), FASTA scores: opt: 233,E(): 1e-07, (33.6% identity in 128 aa overlap); etc. SEEMS TO BELONG TO THE HESA/MOEB/THIF FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216854.1" /db_xref="GI:15609475" /db_xref="GOA:P95234" /db_xref="UniProtKB/TrEMBL:P95234" /db_xref="GeneID:886018" /translation="MRAGADAPDSGRVKESAPWSYDEAFCRNLGLISPTEQQRLRNSR VAIAGMGGVGGIDMVALARMGIGKFTIADPDVFEIRNSNRQYGAMRSTNGQAKAEVMR NIVHDINPEAEIRAFCEPIGKENAATFLEGADVLVDGIDAFEIDLRRLLYREAQQRGI YALGAGPLGFSTAWVVFDPKGMTFDRYFDLSDAMNTVDKFVAFIAGIAPSATHRRSID LSYVDIENRTGPSVGLACHLASGVVAAEVLKILLGHGRVYAAPYFHQFDAYRSIYVRK RLRCGNRHPLQRVKRRLLARYINRRSAGVIPGLRYHRTEPSY" gene 2614693..2617581 /gene="mmpL9" /locus_tag="Rv2339" /db_xref="GeneID:888966" CDS 2614693..2617581 /gene="mmpL9" /locus_tag="Rv2339" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /note="Rv2339, (MTCY98.08), len: 962 aa. Probable mmpL9, conserved transmembrane transport protein (see citation below), with strong similarity to other Mycobacterial proteins e.g. P54881|YV34_MYCLE|MML4_MYCLE hypothetical 105.2 kDa protein from Mycobacterium leprae (959 aa), FASTA scores: opt: 3799, E(): 0, (59.3% identity in 937 aa overlap); G699237|U1740AB from Mycobacterium leprae; and MTCY20G9.34; MTCY48.08c; MTCY19G5.06 from Mycobacterium tuberculosis. BELONGS TO THE MMPL FAMILY. TBparse score is 0.956." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL9" /protein_id="NP_216855.1" /db_xref="GI:15609476" /db_xref="GOA:P95235" /db_xref="UniProtKB/Swiss-Prot:P95235" /db_xref="GeneID:888966" /translation="MVPGEVHMSDTPSGPHPIIPRTIRLAAIPILLCWLGFTVFVSVA VPPLEAIGETRAVAVAPDDAQSMRAMRRAGKVFNEFDSNSIAMVVLESDQPLGEKAHR YYDHLVDTLVLDQSHIQHIQDFWRDPLTAAGAVSADGKAAYVQLYLAGNMGEALANES VEAVRKIVANSTPPEGIRTYVTGPAALFADQIAAGDRSMKLITGLTFAVITVLLLLVY RSIATTLLILPMVFIGLGATRGTIAFLGYHGMVGLSTFVVNILTALAIAAGTDYAIFL VGRYQEARHIGQNREASFYTMYRGTANVILGSGLTIAGATYCLSFARLTLFHTMGPPL AIGMLVSVAAALTLAPAIIAIAGRFGLLDPKRRLKTRGWRRVGTAVVRWPGPILATSV ALALVGLLALPGYRPGYNDRYYLRAGTPVNRGYAAADRHFGPARMNPEMLLVESDQDM RNPAGMLVIDKIAKEVLHVSGVERVQAITRPQGVPLEHASIPFQISMMGATQTMSLPY MRERMADMLTMSDEMLVAINSMEQMLDLVQQLNDVTHEMAATTREIKATTSELRDHLA DIDDFVRPLRSYFYWEHHCFDIPLCSATRSLFDTLDGVDTLTDQLRALTDDMNKMEAL TPQFLALLPPMITTMKTMRTMMLTMRSTISGVQDQMADMQDHATAMGQAFDTAKSGDS FYLPPEAFDNAEFQQGMKLFLSPNGKAVRFVISHESDPASTEGIDRIEAIRAATKDAI KATPLQGAKIYIGGTAATYQDIRDGTKYDILIVGIAAVCLVFIVMLMITQSLIASLVI VGTVLLSLGTAFGLSVLIWQHFVGLQVHWTIVAMSVIVLLAVGSDYNLLLVSRFKEEV GAGLKTGIIRAMAGTGAVVTSAGLVFAFTMASMAVSELRVIGQVGTTIGLGLLFDTLV VRSFMTPSIAALLGRWFWWPNMIHSRPTVPEAHTRQGARRIQPHLHRG" gene complement(2617667..2618908) /gene="PE_PGRS39" /locus_tag="Rv2340c" /db_xref="GeneID:888961" CDS complement(2617667..2618908) /gene="PE_PGRS39" /locus_tag="Rv2340c" /function="UNKNOWN" /note="Rv2340c, (MTCY98.09c), len: 413 aa. Member of the Mycobacterium tuberculosis PE_family, PGRS subfamily of gly-rich proteins (see citations below), similar to others eg YI18_MYCTU|Q50615|Rv1818c|MTCY1A11.25 PE-PGRS FAMILY PROTEIN from Mycobacterium tuberculosis (498 aa), FASTA scores: opt: 710, E(): 1.4e-22, (41.0% identity in 368 aa overlap); O53884|Rv0872v|MTV043.65c PGRS-FAMILY PROTEIN from Mycobacterium tuberculosis (606 aa), FASTA scores: opt: 708, E(): 1.9e-22, (42.4% identity in 389 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177869.1" /db_xref="GI:57116973" /db_xref="UniProtKB/TrEMBL:Q7D7A7" /db_xref="GeneID:888961" /translation="MSHVTAAPNVLAASAGELAAIGSTMRAANAAAAAPTAGVLAAGG DDVSAGIAALFGARAQAYQAISAQAALFHDRFVQILQEGAAAYAMAEAANALPLQKAQ GVVSELAQDRTGGTGTGQSRGAGGFGGVGQAGGKGWDGGPIGNGQVGEQHGAGQLGST DGNPGVAGAAHGSGVSASHGSGATGAAGVADPGGSGAGVGSAAGNGTGAGSADAVGGA GTGRDIVGSVRGDGGVGMASGDGGLSTGAAGASAEGGLMPGFGGAPWVGGHWGLGGEG HSGAIGGVGEQVAPAVATAPAVSPATTSAVAAESGSTPATKAQAMHATTNPGNAAHQG NPADPGNSARRADGGRDEQLLLLPLTSLRGLRHTLKKLSGLRARNGLLTASGDNASGS GRPWDRDQLLRALGLRPPGHE" gene complement(2619407..2619479) /locus_tag="Rvnt25" /note="tRNA-Asn(GTT)" /db_xref="GeneID:2700449" tRNA complement(2619407..2619479) /locus_tag="Rvnt25" /product="tRNA-Asn" /note="codon recognized: AAC" /anticodon=(pos:2619444..2619446,aa:Asn) /db_xref="GeneID:2700449" gene 2619597..2620016 /gene="lppQ" /locus_tag="Rv2341" /db_xref="GeneID:886275" CDS 2619597..2620016 /gene="lppQ" /locus_tag="Rv2341" /function="UNKNOWN" /note="Rv2341, (MTCY98.10), len: 139 aa. Probable lppQ, conserved lipoprotein, showing some similarity with Rv1228|O33224|LPQX|MTCI61.11 from Mycobacterium tuberculosis (185 aa), FASTA scores: opt: 155; E(): 0.0073; (31.9% identity in 116 aa overlap). Also shows few similarity with P29228|VLPA_MYCHR variant surface antigen A precursor from Mycoplasma hyorhinis (157 aa), FASTA scores: opt: 96, E(): 7.3, (23.1% identity in 143 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="lipoprotein LppQ" /protein_id="NP_216857.1" /db_xref="GI:15609478" /db_xref="UniProtKB/TrEMBL:P95237" /db_xref="GeneID:886275" /translation="MPVGGRQHVFEKLASILGLVAAPLMLLGLSACGRSAGKTSEPTC PTEPIDAADSSTTPDPSCVVRATEINGNGSRIQTWTGSYDAAATQSGGVCGGTCNFHA TVRFTVDEGQISGSVDQVYQAAMVAIATRPTSPSLAP" misc_feature 2619660..2619692 /gene="lppQ" /locus_tag="Rv2341" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2620272..2620529 /locus_tag="Rv2342" /db_xref="GeneID:888951" CDS 2620272..2620529 /locus_tag="Rv2342" /function="UNKNOWN" /note="Rv2342, (MTCY98.11), len: 85 aa. Conserved hypothetical protein, highly similar to Q9CCG1|ML0834 HYPOTHETICAL PROTEIN from Mycobacterium leprae (100 aa), FASTA scores: opt: 392, E(): 2.9e-20, (78.2% identity in 78 aa overlap). N-terminus highly similar to N-terminal part of Q9L085|SCC24.32 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (108 aa), FASTA scores: opt: 122, E(): 0.077, (39.15% identity in 46 aa overlap). TBparse score is 0.887." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216858.1" /db_xref="GI:15609479" /db_xref="UniProtKB/TrEMBL:P95238" /db_xref="GeneID:888951" /translation="MIGYVAVLGLGYVLGAKAGRRRYEQIASTYRALTGSPVARSMIE GGRRKIANRISPDAGFVTLAEIDNQTAVVQRGVERQPKTAR" gene complement(2620533..2622452) /gene="dnaG" /locus_tag="Rv2343c" /db_xref="GeneID:885996" CDS complement(2620533..2622452) /gene="dnaG" /locus_tag="Rv2343c" /EC_number="2.7.7.-" /function="DNA PRIMASE IS THE POLYMERASE THAT SYNTHESIZES SMALL RNA PRIMERS FOR THE OKAZAKI FRAGMENTS ON BOTH TEMPLATE STRANDS REPLICATION FORKS DURING CHROMOSOMAL DNA SYNTHESIS." /note="synthesizes RNA primers at the replication forks" /codon_start=1 /transl_table=11 /product="DNA primase" /protein_id="NP_216859.1" /db_xref="GI:15609480" /db_xref="GOA:P63962" /db_xref="UniProtKB/Swiss-Prot:P63962" /db_xref="GeneID:885996" /translation="MSGRISDRDIAAIREGARIEDVVGDYVQLRRAGADSLKGLCPFH NEKSPSFHVRPNHGHFHCFGCGEGGDVYAFIQKIEHVSFVEAVELLADRIGHTISYTG AATSVQRDRGSRSRLLAANAAAAAFYAQALQSDEAAPARQYLTERSFDAAAARKFGCG FAPSGWDSLTKHLQRKGFEFEELEAAGLSRQGRHGPMDRFHRRLLWPIRTSAGEVVGF GARRLFDDDAMEAKYVNTPETLLYKKSSVMFGIDLAKRDIAKGHQAVVVEGYTDVMAM HLAGVTTAVASCGTAFGGEHLAMLRRLMMDDSFFRGELIYVFDGDEAGRAAALKAFDG EQKLAGQSFVAVAPDGMDPCDLRLKCGDAALRDLVARRTPLFEFAIRAAIAEMDLDSA EGRVAALRRCVPMVGQIKDPTLRDEYARQLAGWVGWADVAQVIGRVRGEAKRTKHPRL GRLGSTTIARAAQRPTAGPPTELAVRPDPRDPTLWPQREALKSALQYPALAGPVFDAL TVEGFTHPEYAAVRAAIDTAGGTSAGLSGAQWLDMVRQQTTSTVTSALISELGVEAIQ VDDDKLPRYIAGVLARLQEVWLGRQIAEVKSKLQRMSPIEQGDEYHALFGDLVAMEAY RRSLLEQASGDDLTA" gene complement(2622457..2623752) /gene="dgt" /locus_tag="Rv2344c" /db_xref="GeneID:885421" CDS complement(2622457..2623752) /gene="dgt" /locus_tag="Rv2344c" /EC_number="3.1.5.1" /function="DGTPASE PREFERENTIALLY HYDROLYZES DGTP OVER THE OTHER CANONICAL NTPS [CATALYTIC ACTIVITY: DGTP + H(2)O = DEOXYGUANOSINE + TRIPHOSPHATE]." /note="dGTPase family type 2 subfamily; presumably hydrolyzes dGTP to deoxyguanosine and triphosphate" /codon_start=1 /transl_table=11 /product="deoxyguanosinetriphosphate triphosphohydrolase-like protein" /protein_id="NP_216860.1" /db_xref="GI:15609481" /db_xref="GOA:P95240" /db_xref="UniProtKB/Swiss-Prot:P95240" /db_xref="GeneID:885421" /translation="MSASEHDPYDDFDRQRRVAEAPKTAGLPGTEGQYRSDFARDRAR VLHSAALRRLADKTQVVGPREGDTPRTRLTHSLEVAQIGRGMAIGLGCDLDLVELAGL AHDIGHPPYGHNGERALDEVAASHGGFEGNAQNFRILTSLEPKVVDAQGLSAGLNLTR ASLDAVTKYPWMRGDGLGSQRRKFGFYDDDRESAVWVRQGAPPERACLEAQVMDWADD VAYSVHDVEDGVVSERIDLRVLAAEEDAAALARLGEREFSRVSADELMAAARRLSRLP VVAAVGKYDATLSASVALKRLTSELVGRFASAAIATTRAAAGPGPLVRFRADLQVPDL VRAEVAVLKILALQFIMSDPRHLETQARQRERIHRVAHRLYSGAPQTLDPVYAAAFNT AADDAARLRVVVDQIASYTEGRLERIDADQLGVSRNALD" gene 2623821..2625803 /locus_tag="Rv2345" /db_xref="GeneID:888960" CDS 2623821..2625803 /locus_tag="Rv2345" /function="UNKNOWN" /note="Rv2345, (MTCY98.14), len: 660 aa. Possible conserved transmembrane protein, with hydrophobic stretch at N-terminal end around position 180. Similar to O52198 HYPOTHETICAL 21.2 KDA PROTEIN (FRAGMENT) from Mycobacterium smegmatis (195 aa), FASTA scores: opt: 589, E(): 1.5e-23; (47.2% identity in 195 aa overlap). TBparse score is 0.895." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216861.1" /db_xref="GI:15609482" /db_xref="UniProtKB/TrEMBL:P95241" /db_xref="GeneID:888960" /translation="MRLVRLLGMVLTILAAGLLLGPPAGAQPPFRLSNYVTDNAGVLT SSGRTAVTAAVDRLYADRRIRLWVVYVENFSGQSALNWAQRTTRTSELGNYDALLAVA TTGREYAFLVPSAMPGVSEGQVDNVRRYQIEPALHDGDYSGAAVAAANGLNRSPSSSS RVVLLVTVGIIVIVVAVLLVVMRHRNRRRRADELAAARRVDPTNVMALAAVPLQALDD LSRSMVVDVDNAVRTSTNELALAIEEFGERRTAPFTQAVNNAKAALSQAFTVRQQLDD NTPETPAQRRELLTRVIVSAAHADRELASQTEAFEKLRDLVINAPARLDLLTQQYVEL TTRIGPTQQRLAELHTEFDAAAMTSIAGNVTTATERLAFADRNISAARDLADQAVSGR QAGLVDAVRAAESALGQARALLDAVDSAATDIRHAVASLPAVVADIQTGIKRANQHLQ QAQQPQTGRTGDLIAARDAAARALDRARGAADPLTAFDQLTKVDADLDRLLATLAEEQ ATADRLNRSLEQALFTAESRVRAVSEYIDTRRGSIGPEARTRLAEAKRQLEAAHDRKS SNPTEAIAYANAASTLAAHAQSLANADVQSAQRAYTRRGGNNAGAILGGIIIGDLLSG GTRGGLGGWIPTSFGGSSNAPGSSPDGGFLGGGGRF" gene complement(2625888..2626172) /gene="esxO" /locus_tag="Rv2346c" /db_xref="GeneID:888956" CDS complement(2625888..2626172) /gene="esxO" /locus_tag="Rv2346c" /function="UNKNOWN" /note="Rv2346c, (MT2411, MTCY98.15c), len: 94 aa. esxO, ESAT-6 like protein (see citation below), member of Mycobacterium tuberculosis protein family with O53942|Rv1793|MTV049.15, O05300|Rv1198|MTCI364.10, MTCY15C10.33, P96364|MTCY07H7B.03|Rv1037c|MTCY10G2.12, MTCI364.10, etc. BELONGS TO THE ESAT6 FAMILY.; ES6_6, Mtb9.9E" /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXO (ESAT-6 like protein 6)" /protein_id="NP_216862.1" /db_xref="GI:15609483" /db_xref="UniProtKB/Swiss-Prot:P95242" /db_xref="GeneID:888956" /translation="MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIVRDVLAAGDFWGG AGSVACQEFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" gene complement(2626223..2626519) /gene="esxP" /locus_tag="Rv2347c" /db_xref="GeneID:886002" CDS complement(2626223..2626519) /gene="esxP" /locus_tag="Rv2347c" /function="UNKNOWN" /note="Rv2347c, (MT2412, MTCY98.16c), len: 98 aa. esxP, ESAT-6 like protein (see citation below). Member of M. tuberculosis hypothetical QILSS protein family with Rv1197, Rv1792, Rv1038c and Rv3620c. BELONGS TO THE ESAT6 FAMILY. TBparse score is 0.896.; ES6_7, QILSS" /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXP (ESAT-6 like protein 7)" /protein_id="NP_216863.1" /db_xref="GI:15609484" /db_xref="UniProtKB/Swiss-Prot:P95243" /db_xref="GeneID:886002" /translation="MATRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAG WSGMAEATSLDTMAQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS" gene complement(2626654..2626980) /locus_tag="Rv2348c" /db_xref="GeneID:886006" CDS complement(2626654..2626980) /locus_tag="Rv2348c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2348c, (MTCY98.17c), len: 108 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216864.1" /db_xref="GI:15609485" /db_xref="UniProtKB/TrEMBL:P95244" /db_xref="GeneID:886006" /translation="MLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDH WAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIP HSPAAG" gene complement(2627172..2628698) /gene="plcC" /locus_tag="Rv2349c" /db_xref="GeneID:886000" CDS complement(2627172..2628698) /gene="plcC" /locus_tag="Rv2349c" /EC_number="3.1.4.3" /function="HYDROLYZES SPHINGOMYELIN IN ADDITION TO PHOSPHATIDYLCHOLINE. PROBABLE VIRULENCE FACTOR IMPLICATED IN THE PATHOGENESIS OF Mycobacterium tuberculosis AT THE LEVEL OF INTRACELLULAR SURVIVAL, BY THE ALTERATION OF CELL SIGNALING EVENTS OR BY DIRECT CYTOTOXICITY [CATALYTIC ACTIVITY: A PHOSPHATIDYLCHOLINE + H(2)O = 1,2- DIACYLGLYCEROL + CHOLINE PHOSPHATE]." /note="Rv2349c, (MT2414, MTCY98.18c), len: 508 aa. Probable plcC, phospolipase C 3 (EC 3.1.4.3) (see citations below), similar to other precursors of several phospolipases C e.g. P15713|PHLN_PSEAE|PA3319 NON-HEMOLYTIC PHOSPHOLIPASE C PRECURSOR from Pseudomonas aeruginosa (692 aa), FASTA scores: opt: 1013, E(): 9.3e-54, (38.85% identity in 525 aa overlap); P06200|PHLC_PSEAE HEMOLYTIC PHOSPHOLIPASE C PRECURSOR from Pseudomonas aeruginosa (730 aa), FASTA scores: opt: 630, E(): 1.5e-30, (35.15% identity in 535 aa overlap); Q9S816|T12J13.18|T21P5.4 PUTATIVE PHOSPHOLIPASE from Arabidopsis thaliana (Mouse-ear cress) (521 aa), FASTA scores: opt: 218, E(): 1e-05, (27.05% identity in 451 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Q9XB13|PLCD|Rv1755c|MT1799|MTCY28.21C PHOSPHOLIPASE C 4 (514 aa), FASTA scores: opt: 2497, E(): 9e-144, (68.35% identity in 509 aa overlap); Q50560|Rv2351c|PLCA|MTP40|MT2416|MTCY98.20c PHOSPHOLIPASE C 1 (520 aa), FASTA scores: opt: 2494, E(): 1.4e-143, (68.1% identity in 514 aa overlap); P95246|PLCB|MPCB|Rv2350c|MT2415|MTCY98.19c PHOSPHOLIPASE C 2 (512 aa), FASTA scores: opt: 2474, E(): 2.2e-142, (67.65% identity in 513 aa overlap); etc. BELONGS TO THE BACTERIAL PHOSPHOLIPASE C FAMILY. TBparse score is 0.938." /codon_start=1 /transl_table=11 /product="phospholipase C 3 PLCC" /protein_id="NP_216865.1" /db_xref="GI:15609486" /db_xref="GOA:P95245" /db_xref="UniProtKB/Swiss-Prot:P95245" /db_xref="GeneID:886000" /translation="MSRRAFLAKAAGAGAAAVLTDWAAPVIEKAYGAGPCSGHLTDIE HIVLCLQENRSFDHYFGTLSAVDGFDTPTPLFQQKGWNPETQALDPTGITLPYRINTT GGPNGVGECVNDPDHQWIAAHLSWNGGANDGWLPAQARTRSVANTPVVMGYYARPDIP IHYLLADTFTICDQYFSSLLGGTMPNRLYWISATVNPDGDQGGPQIVEPAIQPKLTFT WRIMPQNLSDAGISWKVYNSKLLGGLNDTSLSRNGYVGSFKQAADPRSDLARYGIAPA YPWDFIRDVINNTLPQVSWVVPLTVESEHPSFPVAVGAVTIVNLIRVLLRNPAVWEKT ALIIAYDEHGGFFDHVTPLTAPEGTPGEWIPNSVDIDKVDGSGGIRGPIGLGFRVPCF VISPYSRGGLMVHDRFDHTSQLQLIGKRFGVPVPNLTPWRASVTGDMTSAFNFAAPPD PSPPNLDHPVRQLPKVAKCVPNVVLGFLNEGLPYRVPYPQTTPVQESGPARPIPSGIC" gene complement(2628781..2630319) /gene="plcB" /locus_tag="Rv2350c" /db_xref="GeneID:885999" CDS complement(2628781..2630319) /gene="plcB" /locus_tag="Rv2350c" /EC_number="3.1.4.3" /function="HYDROLYZES SPHINGOMYELIN IN ADDITION TO PHOSPHATIDYLCHOLINE. PROBABLE VIRULENCE FACTOR IMPLICATED IN THE PATHOGENESIS OF Mycobacterium tuberculosis AT THE LEVEL OF INTRACELLULAR SURVIVAL, BY THE ALTERATION OF CELL SIGNALING EVENTS OR BY DIRECT CYTOTOXICITY [CATALYTIC ACTIVITY: A PHOSPHATIDYLCHOLINE + H(2)O = 1,2- DIACYLGLYCEROL + CHOLINE PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv2350c, (MT2415, MTCY98.19c), len: 512 aa. Probable plcB (alternate gene name: mpcB), membrane-associated phospolipase C 2 (EC 3.1.4.3) (see citations below), similar to other precursors of several phospolipases C e.g. P15713|PHLN_PSEAE|PA3319 NON-HEMOLYTIC PHOSPHOLIPASE C PRECURSOR from Pseudomonas aeruginosa (692 aa), FASTA scores: opt: 885, E(): 2.3e-44, (38.5% identity in 525 aa overlap); P06200|PHLC_PSEAE HEMOLYTIC PHOSPHOLIPASE C PRECURSOR from Pseudomonas aeruginosa (730 aa), FASTA scores: opt: 639, E(): 6.3e-30, (537 aa overlap); Q9RGS8 NON-HEMOLYTIC PHOSPHOLIPASE C from Pseudomonas aeruginosa (700 aa), FASTA scores: opt: 864, E(): 3.9e-43, (39.2% identity in 528 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Q50560|Rv2351c|PLCA|MTP40|MT2416|MTCY98.20c PHOSPHOLIPASE C 1 (520 aa), FASTA scores: opt: 2788, E(): 4.5e-156, (75.5% identity in 514 aa overlap); Q9XB13|PLCD|Rv1755c|MT1799|MTCY28.21C PHOSPHOLIPASE C 4 (514 aa), FASTA scores: opt: 2623, E(): 2.1e-146, (71.5% identity in 512 aa overlap); P95245|PLCC|Rv2349c|MT2414|MTCY98.18c PHOSPHOLIPASE C 3 (508 aa), FASTA scores: opt: 2474, E(): 1.1e-137, (67.65% identity in 513 aa overlap); etc. BELONGS TO THE BACTERIAL PHOSPHOLIPASE C FAMILY. SUPPOSED MEMBRANE-ASSOCIATED, AT THE EXTRACELLULAR SIDE. TBparse score is 0.931.; mpcB" /codon_start=1 /transl_table=11 /product="membrane-associated phospholipase C" /protein_id="NP_216866.1" /db_xref="GI:15609487" /db_xref="GOA:P95246" /db_xref="UniProtKB/Swiss-Prot:P95246" /db_xref="GeneID:885999" /translation="MTRRQFFAKAAAATTAGAFMSLAGPIIEKAYGAGPCPGHLTDIE HIVLLMQENRSFDHYFGTLSDTRGFDDTTPPVVFAQSGWNPMTQAVDPAGVTLPYRFD TTRGPLVAGECVNDPDHSWIGMHNSWNGGANDNWLPAQVPFSPLQGNVPVTMGFYTRR DLPIHYLLADTFTVCDGYFCSLLGGTTPNRLYWMSAWIDPDGTDGGPVLIEPNIQPLQ HYSWRIMPENLEDAGVSWKVYQNKLLGALNNTVVGYNGLVNDFKQAADPRSNLARFGI SPTYPLDFAADVRNNRLPKVSWVLPGFLLSEHPAFPVNVGAVAIVDALRILLSNPAVW EKTALIVNYDENGGFFDHVVPPTPPPGTPGEFVTVPDIDSVPGSGGIRGPIGLGFRVP CLVISPYSRGPLMVHDTFDHTSTLKLIRARFGVPVPNLTAWRDATVGDMTSTFNFAAP PNPSKPNLDHPRLNALPKLPQCVPNAVLGTVTKTAIPYRVPFPQSMPTQETAPTRGIP SGLC" gene complement(2630537..2632075) /gene="plcA" /locus_tag="Rv2351c" /db_xref="GeneID:885995" CDS complement(2630537..2632075) /gene="plcA" /locus_tag="Rv2351c" /EC_number="3.1.4.3" /function="HYDROLYZES SPHINGOMYELIN IN ADDITION TO PHOSPHATIDYLCHOLINE. PROBABLE VIRULENCE FACTOR IMPLICATED IN THE PATHOGENESIS OF Mycobacterium tuberculosis AT THE LEVEL OF INTRACELLULAR SURVIVAL, BY THE ALTERATION OF CELL SIGNALING EVENTS OR BY DIRECT CYTOTOXICITY [CATALYTIC ACTIVITY: A PHOSPHATIDYLCHOLINE + H(2)O = 1,2- DIACYLGLYCEROL + CHOLINE PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv2351c, (MTP40, MT2416, MTCY98.20c), len: 512 aa. Probable plcA (alternate gene name: mpcA), membrane-associated phospolipase C 1 (EC 3.1.4.3) (MTP40 antigen) (see citations below), similar to other precursors of several phospolipases C e.g. P15713|PHLN_PSEAE|PA3319 NON-HEMOLYTIC PHOSPHOLIPASE C PRECURSOR from Pseudomonas aeruginosa (692 aa), FASTA scores: opt: 1064, E(): 4.3e-55, (39.85% identity in 517 aa overlap); P06200|PHLC_PSEAE HEMOLYTIC PHOSPHOLIPASE C PRECURSOR from Pseudomonas aeruginosa (730 aa), FASTA scores: opt: 562, E(): 1.6e-25, (35.35% identity in 481 aa overlap); Q9RGS8|PLCN|PHLN_BURPS NON-HEMOLYTIC PHOSPHOLIPASE C from Burkholderia pseudomallei (Pseudomonas pseudomallei) (700 aa), FASTA scores: opt: 843, E(): 4.4e-42, (40.5% identity in 531 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. P95246|PLCB|MPCB|Rv2350c|MT2415|MTCY98.19c PHOSPHOLIPASE C 2 (512 aa), FASTA scores: opt: 2788, E(): 1.2e-156, (75.5% identity in 514 aa overlap) (alias Q50561|PLCB|MPCB|Rv2350c|MT2415|MTCY98.19c PHOSPHOLIPASE C 2 (521 aa), FASTA scores: opt: 2700, E(): 1.8e-151, (73.8% identity in 515 aa overlap)); Q9XB13|PLCD|Rv1755c|MT1799|MTCY28.21C PHOSPHOLIPASE C 4 (514 aa), FASTA scores: opt: 2643, E(): 4.1e-148, (71.6% identity in 511 aa overlap); etc. BELONGS TO THE BACTERIAL PHOSPHOLIPASE C FAMILY. SUPPOSED MEMBRANE-ASSOCIATED, AT THE EXTRACELLULAR SIDE.; mpcA" /codon_start=1 /transl_table=11 /product="membrane-associated phospholipase C" /protein_id="NP_216867.1" /db_xref="GI:15609488" /db_xref="GOA:Q04001" /db_xref="UniProtKB/Swiss-Prot:Q04001" /db_xref="GeneID:885995" /translation="MSRREFLTKLTGAGAAAFLMDWAAPVIEKAYGAGPCPGHLTDIE HIVLLMQENRSFDHYFGTLSSTNGFNAASPAFQQMGWNPMTQALDPAGVTIPFRLDTT RGPFLDGECVNDPEHQWVGMHLAWNGGANDNWLPAQATTRAGPYVPLTMGYYTRQDIP IHYLLADTFTICDGYHCSLLTGTLPNRLYWLSANIDPAGTDGGPQLVEPGFLPLQQFS WRIMPENLEDAGVSWKVYQNKGLGRFINTPISNNGLVQAFRQAADPRSNLARYGIAPT YPGDFAADVRANRLPKVSWLVPNILQSEHPALPVALGAVSMVTALRILLSNPAVWEKT ALIVSYDENGGFFDHVTPPTAPPGTPGEFVTVPNIDAVPGSGGIRGPLGLGFRVPCIV ISPYSRGPLMVSDTFDHTSQLKLIRARFGVPVPNMTAWRDGVVGDMTSAFNFATPPNS TRPNLSHPLLGALPKLPQCIPNVVLGTTDGALPSIPYRVPYPQVMPTQETTPVRGTPS GLCS" gene complement(2632923..2634098) /gene="PPE38" /locus_tag="Rv2352c" /db_xref="GeneID:888959" CDS complement(2632923..2634098) /gene="PPE38" /locus_tag="Rv2352c" /function="UNKNOWN" /note="Rv2352c, (MTCY98.21c), len: 391 aa. Member of Mycobacterium tuberculosis PPE_family, highly similar to many e.g. Q10778|MTCY48.17|Y04H_MYCTU (734 aa), FASTA scores: opt: 713, E(): 2.8e-27, (37.7% identity in 430 aa overlap); Q10540|MTCY31.06c, Q11031|MTCY02B10.25c, Q10813|MTCY274.23c, P42611|MTV037.06C, P71868|MTCY03C7.23, P95248|MTCY98.22c, P71869|MTCY03C7.24c, etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177870.1" /db_xref="GI:57116974" /db_xref="UniProtKB/TrEMBL:Q7D7A2" /db_xref="GeneID:888959" /translation="MILDFSWLPPEINSARIYAGAGSGPLFMAAAAWEGLAADLRASA SSFDAVIAGLAAGPWSGPASVAMAGAAAPYVGWLSAAAGQAELSAGQATAAATAFEAA LAATVHPAAVTANRVLLGALVATNILGQNTPAIAATEFDYVEMWAQDVGAMVGYHAGA AAVAETLTPFSVPPLDLAGLASQAGAQLTGMATSVSAALSPIAEGAVEGVPAVVAAAQ SVAAGLPVDAALQVGQAAAYPASMLIGPMMQLAQMGTTANTAGLAGAEAAGLAAADVP TFAGDIASGTGLGGAGGLGAGMSAELGKARLVGAMSVPPTWEGSVPARMASSAMAGLG AMPAEVPAAGGPMGMMPMPMGMGGAGAGMPAGMMGRGGANPHVVQARPSVVPRVGIG" gene complement(2634528..2635592) /gene="PPE39" /locus_tag="Rv2353c" /db_xref="GeneID:886003" CDS complement(2634528..2635592) /gene="PPE39" /locus_tag="Rv2353c" /function="UNKNOWN" /note="Rv2353c, (MTCY98.22c), len: 354 aa. Member of Mycobacterium tuberculosis PPE family, highly similar to many e.g. near ORF P95249|Rv2356c|MTCY98.25 from Mycobacterium tuberculosis (615 aa), FASTA scores: opt: 1566, E(): 3.2e-69, (66.1% identity in 349 aa overlap); Q10778|MTCY48.17, Q10540|MTCY31.06c, E241779|MTCY98, Q10813|MTCY274.23c, P71868|MTCY03C7.23, P71869|MTCY03C7.24c, P42611|MTV037.06C, E64997|MTCY98, Q10707|MTCY49.38C, P71657|MTCY02B10.25c, etc. TBparse score is 0.932. Note that he ATG and RBS appear to be provided by the IR of neighbouring IS6110." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177871.1" /db_xref="GI:57116975" /db_xref="UniProtKB/TrEMBL:Q79FF3" /db_xref="GeneID:886003" /translation="MPGRFRNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFG NGNNGNFNFGSGNTGSNNIGFGNTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGS GNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANA GAGNTGFFDAGNYNFGSLNAGNINSSFGNSGDGNSGFLNAGDVNSGVGNAGDVNTGLG NSGNINTGGFNPGTLNTGFFSAMTQAGPNSGFFNAGTGNSGFGHNDPAGSGNSGIQNS GFGNSGYVNTSTTSMFGGNSGVLNTGYGNSGFYNAAVNNTGIFVTGVMSSGFFNFGTG NSGLLVSGNGLSGFFKNLFG" repeat_region 2635577..2636931 /note="IS6110-8, len: 1355 bp. Insertion sequence IS6110 element that appears to have inserted in 5'-end of MTCY98.031c but is not flanked by expected 3 bp direct repeats of target sequence." /mobile_element="insertion sequence:IS6110-8" repeat_region 2635577..2635604 /note="28 bp Inverted repeat, TGAACCGCCCCGGCATGTCCGGAGACTC, at the left end of IS6110" gene 2635628..2635954 /locus_tag="Rv2354" /db_xref="GeneID:888963" CDS 2635628..2635954 /locus_tag="Rv2354" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="Rv2354, (MTCY98.23), len: 108 aa. Probable IS6110 transposase, highly similar to others. - 1 frameshift required to complete translation. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216870.1" /db_xref="GI:15609491" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:888963" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 2635951..2636889 /locus_tag="Rv2355" /db_xref="GeneID:888957" CDS <2635951..2636889 /locus_tag="Rv2355" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="Rv2355, (MTCY98.24), len: 312 aa. Probable IS6110 transposase, highly similar to others. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216871.1" /db_xref="GI:15609492" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:888957" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region complement(2636904..2636931) /note="28 bp Inverted repeat, TGAACCGCCCCGGTGAGTCCGGAGACTC, at the right end of IS6110" gene complement(2637688..2639535) /gene="PPE40" /locus_tag="Rv2356c" /db_xref="GeneID:888950" CDS complement(2637688..2639535) /gene="PPE40" /locus_tag="Rv2356c" /function="UNKNOWN" /note="Rv2356c, (MTCY98.25), len: 615 aa. Member of Mycobacterium tuberculosis PPE_family, highly similar to others e.g. Q10778|MTCY48.17|YF48_MYCTU HYPOTHETICAL PPE-FAMILY PROTEIN (678 aa), FASTA scores: opt: 1888, E(): 1.9e-78, (54.4% identity in 667 aa overlap); Q10540|MTCY31.06c, E241779|MTCY98, P42611|MTV037.06c, Q10813|MTCY274.23c, P71657|MTCY02B10.25c, MTCY03C7.23, P71869|MTCY03C7.24c, etc. TBparse score is 0.929." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177872.1" /db_xref="GI:57116976" /db_xref="UniProtKB/TrEMBL:Q7D7A1" /db_xref="GeneID:888950" /translation="MVNFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAE SFGLVTSGLAGGSGQAWQGAAAAAMVVAAAPYAGWLAAAAARAGGAAVQAKAVAGAFE AARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHG GASAAAAALAPWQQAVPGLSGLLGGAANAPAAAAQGAAQGLAELTLNLGVGNIGSLNL GSGNIGGTNVGSGNVGGTNLGSGNYGSLNWGSGNTGTGNAGSGNTGDYNPGSGNFGSG NFGSGNIGSLNVGSGNFGTLNLANGNNGDVNFGGGNTGDFNFGGGNNGTLNFGFGNTG SGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGTGNIGFGNSGNNNIGFFNSGDGNIGF FNSGDGNTGFGNAGNINTGFWNAGNLNTGFGSAGNGNVGIFDGGNSNSGSFNVGFQNT GFGNSGAGNTGFFNAGDSNTGFANAGNVNTGFFNGGDINTGGFNGGNVNTGFGSALTQ AGANSGFGNLGTGNSGWGNSDPSGTGNSGFFNTGNGNSGFSNAGPAMLPGFNSGFANI GSFNAGIANSGNNLAGISNSGDDSSGAVNSGSQNSGAFNAGVGLSGFFR" gene complement(2639673..2641064) /gene="glyS" /locus_tag="Rv2357c" /db_xref="GeneID:888962" CDS complement(2639673..2641064) /gene="glyS" /locus_tag="Rv2357c" /EC_number="6.1.1.14" /function="INVOLVED IN TRANSLATION MECHANISNS [CATALYTIC ACTIVITY: ATP + L-GLYCINE + TRNA(GLY) = AMP + PYROPHOSPHATE + L-GLYCYL-TRNA(GLY)]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes a two-step reaction, first charging a glycine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA" /codon_start=1 /transl_table=11 /product="glycyl-tRNA synthetase" /protein_id="NP_216873.1" /db_xref="GI:15609494" /db_xref="GOA:P67032" /db_xref="UniProtKB/Swiss-Prot:P67032" /db_xref="GeneID:888962" /translation="MHHPVAPVIDTVVNLAKRRGFVYPSGEIYGGTKSAWDYGPLGVE LKENIKRQWWRSVVTGRDDVVGIDSSIILPREVWVASGHVDVFHDPLVESLITHKRYR ADHLIEAYEAKHGHPPPNGLADIRDPETGEPGQWTQPREFNMMLKTYLGPIETEEGLH YLRPETAQGIFVNFANVVTTARKKPPFGIGQIGKSFRNEITPGNFIFRTREFEQMEME FFVEPATAKEWHQYWIDNRLQWYIDLGIRRENLRLWEHPKDKLSHYSDRTVDIEYKFG FMGNPWGELEGVANRTDFDLSTHARHSGVDLSFYDQINDVRYTPYVIEPAAGLTRSFM AFLIDAYTEDEAPNTKGGMDKRTVLRLDPRLAPVKAAVLPLSRHADLSPKARDLGAEL RKCWNIDFDDAGAIGRRYRRQDEVGTPFCVTVDFDSLQDNAVTVRERDAMTQDRVAMS SVADYLAVRLKGS" misc_feature complement(2640417..2640479) /gene="glyS" /locus_tag="Rv2357c" /note="PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1" misc_feature complement(2640480..2640503) /gene="glyS" /locus_tag="Rv2357c" /note="PS00017 ATP/GTP-binding site motif A" gene 2641246..2641653 /locus_tag="Rv2358" /db_xref="GeneID:888965" CDS 2641246..2641653 /locus_tag="Rv2358" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2358, (MTCY27.22c), len: 135 aa. Probable transcriptional regulator, arsR family, equivalent to Q9CCG5|ML0825 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (140 aa), FASTA scores: opt: 647, E(): 2e-34, (72.9% identity in 140 aa overlap). Also similar to others e.g. BAB48273|MLR0745 Transcriptional regulator from Rhizobium loti (Mesorhizobium loti) (104 aa), FASTA scores: opt: 185, E(): 3.4e-05, (43.25% identity in 74 aa overlap) (has its N-terminus shorter); P15905|ARR1_ECOLI arsenical resistance operon repressor from Escherichia coli (117 aa), FASTA scores: opt: 164, E(): 8.1e-05, (39.1% identity in 69 aa overlap); etc. Also similar to O53838|Rv0827|MTV043.19c PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium tuberculosis (130 aa), FASTA scores: opt: 201, E(): 4e-06, (35.7% identity in 98 aa overlap); and O69711|Rv3744|MTV025.092 PUTATIVE REGULATORY PROTEIN from Mycobacterium tuberculosis (120 aa), FASTA scores: opt: 209, E(): 1.2e-06, (35.5 % identity in 93 aa overlap). Contains possible helix-turn-helix motif at aa 72-93 (Score 1103, +2.94 SD). Belongs to the ARSR family of transciptional regulators." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="NP_216874.1" /db_xref="GI:15609495" /db_xref="GOA:O05840" /db_xref="UniProtKB/TrEMBL:O05840" /db_xref="GeneID:888965" /translation="MVTSPSTPTAAHEDVGADEVGGHQHPADRFAECPTFPAPPPREI LDAAGELLRALAAPVRIAIVLQLRESQRCVHELVDALHVPQPLVSQHLKILKAAGVVT GERSGREVLYRLADHHLAHIVLDAVAHAGEDAI" gene 2641650..2642042 /gene="furB" /locus_tag="Rv2359" /db_xref="GeneID:886009" CDS 2641650..2642042 /gene="furB" /locus_tag="Rv2359" /function="ACTS AS A GLOBAL NEGATIVE CONTROLLING ELEMENT, EMPLOYING FE(2+) AS A COFACTOR TO BIND THE OPERATOR OF THE REPRESSED GENES. REGULATES THE EXPRESSION OF SEVERAL OUTER-MEMBRANE PROTEINS INCLUDING THE IRON TRANSPORT OPERON (BY SIMILARITY)." /experiment="experimental evidence, no additional details recorded" /note="Rv2359, (MTCY27.21c), len: 130 aa. Probable furB, ferric uptake regulation protein, equivalent to FURB|ML0824|Q9CCG6 PUTATIVE FERRIC UPTAKE REGULATORY PROTEIN from Mycobacterium leprae (131 aa), FASTA scores: opt: 765, E(): 1.7e-43, (86.9% identity in 130 aa overlap). Also highly similar to FERRIC UPTAKE REGULATION PROTEINS e.g. Q9L2H5|SCC121.11 PUTATIVE METAL UPTAKE REGULATION PROTEIN from Streptomyces coelicolor (139 aa), FASTA scores: opt: 547, E(): 3.4e-29, (59.4% identity in 133 aa overlap); P06975|FUR_ECOLI from Escherichia coli (148 aa), FASTA scores: opt: 322, E(): 1.9e-14, (37.9% identity in 132 aa overlap); P45599|FUR_KLEPN FERRIC UPTAKE REGULATION PROTEIN from Klebsiella pneumoniae (155 aa), FASTA scores: opt: 314, E(): 6.7e-14, (36.35% identity in 132 aa overlap); etc. BELONGS TO THE FUR FAMILY." /codon_start=1 /transl_table=11 /product="ferric uptake regulation protein FURB" /protein_id="NP_216875.1" /db_xref="GI:15609496" /db_xref="GOA:O05839" /db_xref="UniProtKB/TrEMBL:O05839" /db_xref="GeneID:886009" /translation="MSAAGVRSTRQRAAISTLLETLDDFRSAQELHDELRRRGENIGL TTVYRTLQSMASSGLVDTLHTDTGESVYRRCSEHHHHHLVCRSCGSTIEVGDHEVEAW AAEVATKHGFSDVSHTIEIFGTCSDCRS" gene complement(2642150..2642578) /locus_tag="Rv2360c" /db_xref="GeneID:888952" CDS complement(2642150..2642578) /locus_tag="Rv2360c" /function="UNKNOWN" /note="Rv2360c, (MTCY27.20), len: 142 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216876.1" /db_xref="GI:15609497" /db_xref="UniProtKB/TrEMBL:O05838" /db_xref="GeneID:888952" /translation="MPSLPDRLASILRDVLPAEEEPDGALTVRHDGTFASLRVVSIAE DLELVSLTQILAWDLPLTKRLAEQVAKQARDINFGSVSLREKVSEKAARRSSGRPASN TADVMLRYNFPGTGLTDDALRTLILLVLETGATIRSALVG" gene complement(2642578..2643468) /locus_tag="Rv2361c" /db_xref="GeneID:888964" CDS complement(2642578..2643468) /locus_tag="Rv2361c" /EC_number="2.5.1.-" /function="INVOLVED IN THE SYNTHESIS OF DECAPRENYL DIPHOSPHATE, A MOLECULE WHICH HAS A CENTRAL ROLE IN THE BIOSYNTHESIS OF MOST FEATURES OF THE MYCOBACTERIAL CELL WALL. ADDS SEVEN MORE ISOPRENE UNITS TO OMEGA,E, Z-FARNESYL DIPHOSPHATE AND RELEASES DECAPRENYL DIPHOSPHATE." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of undecaprenyl pyrophosphate from isopentenyl pyrophosphate" /codon_start=1 /transl_table=11 /product="undecaprenyl pyrophosphate synthase" /protein_id="NP_216877.1" /db_xref="GI:15609498" /db_xref="GOA:P60479" /db_xref="UniProtKB/Swiss-Prot:P60479" /db_xref="GeneID:888964" /translation="MARDARKRTSSNFPQLPPAPDDYPTFPDTSTWPVVFPELPAAPY GGPCRPPQHTSKAAAPRIPADRLPNHVAIVMDGNGRWATQRGLARTEGHKMGEAVVID IACGAIELGIKWLSLYAFSTENWKRSPEEVRFLMGFNRDVVRRRRDTLKKLGVRIRWV GSRPRLWRSVINELAVAEEMTKSNDVITINYCVNYGGRTEITEATREIAREVAAGRLN PERITESTIARHLQRPDIPDVDLFLRTSGEQRSSNFMLWQAAYAEYIFQDKLWPDYDR RDLWAACEEYASRTRRFGSA" misc_feature complement(2642695..2642751) /locus_tag="Rv2361c" /note="PS01066 Hypothetical YBR002c family signature" gene complement(2643461..2644258) /gene="recO" /locus_tag="Rv2362c" /db_xref="GeneID:888954" CDS complement(2643461..2644258) /gene="recO" /locus_tag="Rv2362c" /function="UNKNOWN" /note="involved in DNA repair and RecFOR pathway recombination; RecFOR proteins displace ssDNA-binding protein and facilitate the production of RecA-coated ssDNA" /codon_start=1 /transl_table=11 /product="DNA repair protein RecO" /protein_id="NP_216878.1" /db_xref="GI:15609499" /db_xref="GOA:P65983" /db_xref="UniProtKB/Swiss-Prot:P65983" /db_xref="GeneID:888954" /translation="MRLYRDRAVVLRQHKLGEADRIVTLLTRDHGLVRAVAKGVRRTR SKFGARLEPFAHIEVQLHPGRNLDIVTQVVSVDAFATDIVADYGRYTCGCAILETAER LAGEERAPAPALHRLTVGALRAVADGQRPRDLLLDAYLLRAMGIAGWAPALTECARCA TPGPHRAFHIATGGSVCAHCRPAGSTTPPLGVVDLMSALYDGDWEAAEAAPQSARSHV SGLVAAHLQWHLERQLKTLPLVERFYQADRSVAERRAALIGQDIAGG" gene 2644320..2645774 /gene="amiA2" /locus_tag="Rv2363" /db_xref="GeneID:888955" CDS 2644320..2645774 /gene="amiA2" /locus_tag="Rv2363" /EC_number="3.5.1.4" /function="GENERATES MONOCARBOXYLATE FROM MONOCARBOXYLIC ACID AMIDE [CATALYTIC ACTIVITY: A MONOCARBOXYLIC ACID AMIDE + H(2)O = A MONOCARBOXYLATE + NH(3)]." /note="catalyzes the hydrolysis of a monocarboxylic acid amid to form a monocarboxylate and ammonia" /codon_start=1 /transl_table=11 /product="amidase" /protein_id="NP_216879.1" /db_xref="GI:15609500" /db_xref="GOA:P63490" /db_xref="UniProtKB/Swiss-Prot:P63490" /db_xref="GeneID:888955" /translation="MVGASGSDAGAISGSGNQRLPTLTDLLYQLATRAVTSEELVRRS LRAIDVSQPTLNAFRVVLTESALADAAAADKRRAAGDTAPLLGIPIAVKDDVDVAGVP TAFGTQGYVAPATDDCEVVRRLKAAGAVIVGKTNTCELGQWPFTSGPGFGHTRNPWSR RHTPGGSSGGSAAAVAAGLVTAAIGSDGAGSIRIPAAWTHLVGIKPQRGRISTWPLPE AFNGVTVNGVLARTVEDAALVLDAASGNVEGDRHQPPPVTVSDFVGIAPGPLKIALST HFPYTGFRAKLHPEILAATQRVGDQLELLGHTVVKGNPDYGLRLSWNFLARSTAGLWE WAERLGDEVTLDRRTVSNLRMGHVLSQAILRSARRHEAADQRRVGSIFDIVDVVLAPT TAQPPPMARAFDRLGSFGTDRAIIAACPSTWPWNLLGWPSINVPAGFTSDGLPIGVQL MGPANSEGMLISLAAELEAVSGWATKQPQVWWTS" misc_feature 2644701..2644724 /gene="amiA2" /locus_tag="Rv2363" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 2644812..2644907 /gene="amiA2" /locus_tag="Rv2363" /note="PS00571 Amidases signature" gene complement(2645771..2646673) /gene="era" /locus_tag="Rv2364c" /gene_synonym="bex" /gene_synonym="rbaA" /gene_synonym="sdgE" /gene_synonym="yqfH" /db_xref="GeneID:886027" CDS complement(2645771..2646673) /gene="era" /locus_tag="Rv2364c" /gene_synonym="bex" /gene_synonym="rbaA" /gene_synonym="sdgE" /gene_synonym="yqfH" /function="BINDS BOTH GDP AND GTP. HAS AN INTRINSIC GTPASE ACTIVITY AND IS ESSENTIAL FOR CELL GROWTH." /note="Era; Escherichia coli Ras-like protein; Bex; Bacillus Era-complementing segment; essential protein in Escherichia coli that is involved in many cellular processes; GTPase; binds the cell membrane through apparent C-terminal domain; mutants are arrested during the cell cycle; Streptococcus pneumoniae Era binds to RNA and Escherichia coli Era binds 16S rRNA and 30S ribosome" /codon_start=1 /transl_table=11 /product="GTP-binding protein Era" /protein_id="YP_177873.1" /db_xref="GI:57116977" /db_xref="GOA:O05834" /db_xref="UniProtKB/Swiss-Prot:O05834" /db_xref="GeneID:886027" /translation="MTEFHSGFVCLVGRPNTGKSTLTNALVGAKVAITSTRPQTTRHA IRGIVHSDDFQIILVDTPGLHRPRTLLGKRLNDLVRETYAAVDVIGLCIPADEAIGPG DRWIVEQLRSTGPANTTLVVIVTKIDKVPKEKVVAQLVAVSELVTNAAEIVPVSAMTG DRVDLLIDVLAAALPAGPAYYPDGELTDEPEEVLMAELIREAALQGVRDELPHSLAVV IDEVSPREGRDDLIDVHAALYVERDSQKGIVIGKGGARLREVGTAARSQIENLLGTKV YLDLRVKVAKNWQRDPKQLGRLGF" misc_feature complement(2646614..2646637) /gene="era" /locus_tag="Rv2364c" /gene_synonym="bex" /gene_synonym="rbaA" /gene_synonym="sdgE" /gene_synonym="yqfH" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2646747..2647088) /locus_tag="Rv2365c" /db_xref="GeneID:885259" CDS complement(2646747..2647088) /locus_tag="Rv2365c" /function="UNKNOWN" /note="Rv2365c, (MTCY27.15), len: 113 aa. Conserved hypothetical protein, highly similar to Q49767|ML0630|B1937_F3_101|CAC30138 Hypothetical protein from Mycobacterium leprae (108 aa), FASTA scores: opt: 426, E(): 1.4e-18, (67.9% identity in 106 aa overlap). Also highly similar to Q9RDF3|SCC77.05 from Streptomyces coelicolor (132 aa), FASTA scores: opt: 254, E(): 1.9e-18, (53.1% identity in 96 aa overlap). Equivalent to AAK46728 from Mycobacterium tuberculosis strain CDC1551 (93 aa) but longer 20 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216881.1" /db_xref="GI:15609502" /db_xref="UniProtKB/TrEMBL:O05833" /db_xref="GeneID:885259" /translation="MMRRPITLAEQLDAEDAKLVVLARAAMARAEAGAGAAVRDVDGR TYAAAPVALSALELTGLQAAVAAAVSSGATGLQAAVLVAGSVDDPGIAAVRELAPTAA IIVTDRAGNPL" gene complement(2647060..2648367) /locus_tag="Rv2366c" /db_xref="GeneID:885987" CDS complement(2647060..2648367) /locus_tag="Rv2366c" /function="UNKNOWN" /note="Rv2366c, (MTCY27.14), len: 435 aa. Probable conserved transmembrane protein, highly similar to Q9L2L3|SCC117.07 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (358 aa), FASTA scores: opt: 1159, E(): 5.5e-64, (53.0% identity in 353 aa overlap); ans similar to hypothetical proteins and hemolysin-related proteins e.g. Q9HN02|HLP|VNG2308G HEMOLYSIN PROTEIN from Halobacterium sp. strain NRC-1 (457 aa), FASTA scores: opt: 623, E(): 6.2e-31, (28.4% identity in 433 aa overlap); etc. Potential transmembrane protein with 2 CBS domains. BELONGS TO THE UPF0053 FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216882.1" /db_xref="GI:15609503" /db_xref="GOA:P67130" /db_xref="UniProtKB/Swiss-Prot:P67130" /db_xref="GeneID:885987" /translation="MTGYYQLLGSIVLIGLGGLFAAIDAAISTVSPARVDELVRDQRP GAGSLRKVMADRPRYVNLVVLLRTSCEITATALLVVFIRYHFSMVWGLYLAAGIMVLA SFVVVGVGPRTLGRQNAYSISLATALPLRLISWLLMPISRLLVLLGNALTPGRGFRNG PFASEIELREVVDLAQQRGVVAADERRMIESVFELGDTPAREVMVPRTEMIWIESDKT AGQAMTLAVRSGHSRIPVIGENVDDIVGVVYLKDLVEQTFCSTNGGRETTVARVMRPA VFVPDSKPLDALLREMQRDRNHMALLVDEYGAIAGLVSIEDVLEEIVGEIADEYDQAE TAPVEDLGDKRFRVSARLPIEDVGELYGVEFDDDLDVDTVGGLLALELGRVPLPGAEV ISHGLRLHAEGGTDHRGRVRIGTVLLSPAEPDGADDEEADHPG" gene complement(2648364..2648912) /locus_tag="Rv2367c" /db_xref="GeneID:885989" CDS complement(2648364..2648912) /locus_tag="Rv2367c" /function="CONSERVED HYPOTHETICAL PROTEIN" /note="Rv2367c, (MTCY27.13), len: 182 aa. Conserved hypothetical protein, equivalent to Q49752|YN67_MYCLE|ML0628|B1937_F1_21 HYPOTHETICAL 19.8 KDA PROTEIN from Mycobacterium leprae (178 aa), FASTA scores: opt: 1051, E(): 2e-59, (89.1% identity in 175 aa overlap). Also highly similar to others e.g. Q9L2L4|SCC117.06 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (165 aa), FASTA scores: opt: 599, E(): 6e-31, (56.5% identity in 154 aa overlap); Q9KD56|BH1363 HYPOTHETICAL PROTEIN from Bacillus halodurans (159 aa), FASTA scores: opt: 311, E(): 8.3e-13, (45.05% identity in 111 aa overlap); etc." /codon_start=1 /transl_table=11 /product="putative metalloprotease" /protein_id="NP_216883.1" /db_xref="GI:15609504" /db_xref="GOA:P67134" /db_xref="UniProtKB/Swiss-Prot:P67134" /db_xref="GeneID:885989" /translation="MREHLMSIEVANESGIDVSEAELVSVARFVIAKMDVNPCAELSM LLLDTAAMADLHMRWMDLPGPTDVMSFPMDELEPGGRPDAPEPGPSMLGDIVLCPEFA AEQAAAAGHSLGHELALLTIHGVLHLLGYDHAEPDEEKEMFALQDRLLEEWVADQVEA YQHDRQDEKDRRLLDKSRYFDL" gene complement(2648916..2649974) /gene="phoH1" /locus_tag="Rv2368c" /db_xref="GeneID:885998" CDS complement(2648916..2649974) /gene="phoH1" /locus_tag="Rv2368c" /function="FUNCTION NOT REALLY KNOWN." /note="Rv2368c, (MTCY27.12), len: 352 aa. Probable phoH1, phoH-like protein (phosphate starvation-induced protein), probably ATP-binding protein, equivalent to Q49751|PHOL_MYCLE| ML0627|B1937_F1_20 PHOH-LIKE PROTEIN from Mycobacterium leprae (349 aa), FASTA scores: opt: 1952, E(): 4.7e-107, (88.9% identity in 352 aa overlap). Also highly similar to Q9L2L5|SCC117.05 PHOH-LIKE PROTEIN from Streptomyces coelicolor (359 aa), FASTA scores: opt: 1407, E(): 3.6e-75, (63.6% identity in 349 aa overlap); Q9RSY1|DR1988 PHOH-RELATED PROTEIN from Deinococcus radiodurans (380 aa), FASTA scores: opt: 1053, E(): 1.9e-54, (53.3% identity in 349 aa overlap); Q9KD58|PHOH|BH1361 PHOSPHATE STARVATION-INDUCED PROTEIN from Bacillus halodurans (320 aa), FASTA scores: opt: 1019, E(): 1.6e-52, (54.35% identity in 300 aa overlap); P46343|PHOL_BACSU PHOH-LIKE PROTEIN from Bacillus subtilis (319 aa), FASTA scores: opt: 1014, E(): 3.2e-52, (50.8% identity in 303 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE PHOH FAMILY. Note that previously known as phoH.; phoH" /codon_start=1 /transl_table=11 /product="phosphate starvation-inducible protein PsiH" /protein_id="YP_177874.1" /db_xref="GI:57116978" /db_xref="GOA:O05830" /db_xref="UniProtKB/Swiss-Prot:O05830" /db_xref="GeneID:885998" /translation="MTSRETRAADAAGARQADAQVRSSIDVPPDLVVGLLGSADENLR ALERTLSADLHVRGNAVTLCGEPADVALAERVISELIAIVASGQSLTPEVVRHSVAML VGTGNESPAEVLTLDILSRRGKTIRPKTLNQKRYVDAIDANTIVFGIGPAGTGKTYLA MAKAVHALQTKQVTRIILTRPAVEAGERLGFLPGTLSEKIDPYLRPLYDALYDMMDPE LIPKLMSAGVIEVAPLAYMRGRTLNDAFIVLDEAQNTTAEQMKMFLTRLGFGSKVVVT GDVTQIDLPGGARSGLRAAVDILEDIDDIHIAELTSVDVVRHRLVSEIVDAYARYEEP GSGLNRAARRASGARGRR" misc_feature complement(2649504..2649527) /gene="phoH1" /locus_tag="Rv2368c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2649946..2650248) /locus_tag="Rv2369c" /db_xref="GeneID:885811" CDS complement(2649946..2650248) /locus_tag="Rv2369c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2369c, (MTCY27.11), len: 100 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216885.1" /db_xref="GI:15609506" /db_xref="UniProtKB/TrEMBL:O05829" /db_xref="GeneID:885811" /translation="MIVGLADRHGHGRDVAAHRQAQLAGPRVAAVRRHRTGGHRQASS RIKVSAHGLGVVRCAPTPSLTGVRMKLQHSSVRQVPVDRPESRHQKPGDVPRDPRC" gene complement(2650245..2651558) /locus_tag="Rv2370c" /db_xref="GeneID:886017" CDS complement(2650245..2651558) /locus_tag="Rv2370c" /function="UNKNOWN" /note="Rv2370c, (MTCY27.10), len: 437 aa. Conserved hypothetical protein, member of family proteins from Mycobacterium tuberculosis with Rv1453|MTCY493_01c|O06807 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (432 aa), FASTA scores: opt: 1943, E(): 9.4e-115, (69.9% identity in 409 aa overlap); Rv1194c|MTCI364.06c; etc. Also similar to AAK45764|MT1500 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (432 aa), FASTA scores: opt: 1934, E(): 9.4e-115, (69.9% identity in 409 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216886.1" /db_xref="GI:15609507" /db_xref="UniProtKB/TrEMBL:O05828" /db_xref="GeneID:886017" /translation="MVLPKPTPRGRELIRQAAKVALHPTPEWLDELDRATLAAHPSIA ADPALATVVSRANRSHLIHFATANLRKPGQPVPANLGPDPLRMARDLVRRGLDASALD VYRVGQNVAWQRWTEIAFGLTTDPQELHELLTLPFRSASEFIDATLAGLAAQMQLEYD ELTRDVHAEHRRIVELILDGAPISRQSAEAKLGYPLDRSHTAAIIWYDDPDDNQNHLD HTARAFGRALGCPQPLIAVASAATRWVWVSDAATLDTDRIHQVLDHAPHARIAVGTTA RGIDGFRRSHRDALATQRMLARLRSQQRLAFFADIHMIAVLTENPDSAADFITSTLGD LESASPQLLTTVLTYINEQCNASRAAHVLHTHRNTLLRRLETAQRLLPRPLDHTIIQV AVAISALQWRGSQTSDPVETPVEGITSPPPESLGRRRSRLAQLER" gene 2651753..2651938 /gene="PE_PGRS40" /locus_tag="Rv2371" /db_xref="GeneID:885141" CDS 2651753..2651938 /gene="PE_PGRS40" /locus_tag="Rv2371" /function="UNKNOWN" /note="Rv2371, (MTCY27.09c), len: 61 aa. Short protein, member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to N-terminal part of others e.g. AAK44356|MT0132 PE_PGRS FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (561 aa), FASTA scores: opt: 217, E( ): 4.9e-08, (69.65% identity in 56 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177875.1" /db_xref="GI:57116979" /db_xref="UniProtKB/TrEMBL:Q79FE9" /db_xref="GeneID:885141" /translation="MSLVSVAPELVVTAVPDVARIGSSIGAPDTAAAARPTTSVLAAG ADEVSADVVALFGWVAR" gene complement(2652037..2652825) /locus_tag="Rv2372c" /db_xref="GeneID:885926" CDS complement(2652037..2652825) /locus_tag="Rv2372c" /function="UNKNOWN" /note="in Escherichia coli RsmE methylates the N3 position of the U1498 base in 16S rRNA; cells lacking this function can grow, but are outcompeted by wild-type; SAM-dependent m(3)U1498 methyltransferase" /codon_start=1 /transl_table=11 /product="16S ribosomal RNA methyltransferase RsmE" /protein_id="NP_216888.1" /db_xref="GI:15609509" /db_xref="UniProtKB/Swiss-Prot:P67202" /db_xref="GeneID:885926" /translation="MVAMLFYVDTLPDTGAVAVVDGDEGFHAATVRRIRPGEQLVLGD GVGRLARCVVEQAGRGGLRARVLRRWSVPPVRPPVTVVQALPKSERSELAIELATEAG ADAFLAWQAARCVANWDGARVDKGLRRWRAVVRSAARQSRRARIPPVDGVLSTPMLVQ RVREEVAAGAAVLVLHEEATERIVDIAAAQAGSLMLVVGPEGGIAPDELAALTDAGAV AVRLGPTVLRTSTAAAVALGAVGVLTSRWDASASDCEYCDVTRR" gene complement(2652839..2653987) /gene="dnaJ2" /locus_tag="Rv2373c" /db_xref="GeneID:886023" CDS complement(2652839..2653987) /gene="dnaJ2" /locus_tag="Rv2373c" /function="ACTS AS A CO-CHAPERONE. STIMULATES, JOINTLY WITH GRPE, THE ATPASE ACTIVITY OF DNAK|Rv0350." /experiment="experimental evidence, no additional details recorded" /note="chaperone Hsp40; co-chaperone with DnaK; Participates actively in the response to hyperosmotic and heat shock by preventing the aggregation of stress-denatured proteins and by disaggregating proteins, also in an autonomous, dnaK-independent fashion" /codon_start=1 /transl_table=11 /product="chaperone protein DnaJ" /protein_id="NP_216889.1" /db_xref="GI:15609510" /db_xref="GOA:P63966" /db_xref="UniProtKB/Swiss-Prot:P63966" /db_xref="GeneID:886023" /translation="MARDYYGLLGVSKNASDADIKRAYRKLARELHPDVNPDEAAQAK FKEISVAYEVLSDPDKRRIVDLGGDPLESAAAGGNGFGGFGGLGDVFEAFFGGGFGGG AASRGPIGRVRPGSDSLLRMRLDLEECATGVTKQVTVDTAVLCDRCQGKGTNGDSVPI PCDTCGGRGEVQTVQRSLLGQMLTSRPCPTCRGVGVVIPDPCQQCMGDGRIRARREIS VKIPAGVGDGMRVRLAAQGEVGPGGGPAGDLYVEVHEQAHDVFVREGDHLHCTVSVPM VDAALGVTVTVDAILDGLSEITIPPGTQPGSVITLRGRGMPHLRSNTRGDLHVHVEVV VPTRLDHQDIELLRELKGRRDREVAEVRSTHAAAGGLFSRLRETFTGR" gene complement(2654062..2655093) /gene="hrcA" /locus_tag="Rv2374c" /db_xref="GeneID:885924" CDS complement(2654062..2655093) /gene="hrcA" /locus_tag="Rv2374c" /function="INVOLVED IN TRANSCRIPTIONAL REGULATION (REPRESSION) OF CLASS I HEAT SHOCK PROTEINS e.g. DNAK-GRPE-DNAJ1 AND GROELS OPERONS). PREVENTS HEAT-SHOCK INDUCTION OF THESE OPERONS." /note="Negative regulator of class I heat shock genes (grpE-dnaK-dnaJ and groELS operons). Prevents heat-shock induction of these operons" /codon_start=1 /transl_table=11 /product="heat-inducible transcription repressor" /protein_id="NP_216890.1" /db_xref="GI:15609511" /db_xref="GOA:P64398" /db_xref="UniProtKB/Swiss-Prot:P64398" /db_xref="GeneID:885924" /translation="MGSADERRFEVLRAIVADFVATQEPIGSKSLVERHNLGVSSATV RNDMAVLEAEGYITQPHTSSGRVPTEKGYREFVDRLEDVKPLSSAERRAIQSFLESGV DLDDVLRRAVRLLAQLTRQVAVVQYPTLSTSTVRHLEVIALTPARLLMVVITDSGRVD QRIVELGDVIDDHQLAQLREILGQALEGKKLSAASVAVADLASQLGGAGGLGDAVGRA ATVLLESLVEHTEERLLLGGTANLTRNAADFGGSLRSILEALEEQVVVLRLLAAQQEA GKVTVRIGHETASEQMVGTSMVSTAYGTAHTVYGGMGVVGPTRMDYPGTIASVAAVAL YIGDVLGAR" gene 2655265..2655582 /locus_tag="Rv2375" /db_xref="GeneID:885520" CDS 2655265..2655582 /locus_tag="Rv2375" /function="UNKNOWN" /note="Rv2375, (MTCY27.05c), len: 105 aa. Conserved hypothetical protein, highly similar to only CAC32314|2SCD60.09c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (98 aa), FASTA scores: opt: 425, E(): 5.7e-24, (63.25% identity in 98 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216891.1" /db_xref="GI:15609512" /db_xref="UniProtKB/TrEMBL:O05823" /db_xref="GeneID:885520" /translation="MIFKGVREGKPYPEHGLSYRDWSQIPPQQIRLDELVTTTTVLAL DRLLSEDSTFYGDLFPHAVKWRGTTYLEDGLHRAVRAALRNRTVLHARVFDMDASPGG RRS" gene complement(2655609..2656115) /gene="cfp2" /locus_tag="Rv2376c" /db_xref="GeneID:885515" CDS complement(2655609..2656115) /gene="cfp2" /locus_tag="Rv2376c" /function="FUNCTION NOT KNOWN (PUTATIVE SECRETED PROTEIN); MAY PLAY A ROLE IN THE DEVELOPMENT OF PROTECTIVE IMMUNE RESPONSES." /experiment="experimental evidence, no additional details recorded" /note="Rv2376c, (MT2445, MTCY27.04), len: 168 aa. cfp2 (alternate gene name: mtb12), low molecular weight antigen, secreted protein similar to Q49771|MB12_MYCLE|ML0620|B1937_F3_91 LOW MOLECULAR WEIGHT ANTIGEN MTB12 HOMOLOG PRECURSOR from Mycobacterium leprae (167 aa), FASTA scores: opt: 682, E(): 1.7e-32, (65.5% identity in 165 aa overlap). BELONGS TO THE MTB12 FAMILY.; mtb12" /codon_start=1 /transl_table=11 /product="low molecular weight antigen CFP2 (low molecular weight protein antigen 2) (CFP-2)" /protein_id="NP_216892.1" /db_xref="GI:15609513" /db_xref="UniProtKB/Swiss-Prot:O05822" /db_xref="GeneID:885515" /translation="MKMVKSIAAGLTAAAAIGAAAAGVTSIMAGGPVVYQMQPVVFGA PLPLDPASAPDVPTAAQLTSLLNSLADPNVSFANKGSLVEGGIGGTEARIADHKLKKA AEHGDLPLSFSVTNIQPAAAGSATADVSVSGPKLSSPVTQNVTFVNQGGWMLSRASAM ELLQAAGN" gene complement(2656215..2656430) /gene="mbtH" /locus_tag="Rv2377c" /db_xref="GeneID:885968" CDS complement(2656215..2656430) /gene="mbtH" /locus_tag="Rv2377c" /function="THOUGHT TO BE INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS." /note="Rv2377c, (MT2445.1, MTCY27.03), len: 71 aa. Putative mbtH, conserved protein with no function assigned (see Quadri et al ., 1998; De Voss et al., 1999), similar to hypothetical proteins or proteins found in several gene clusters for biosynthesis or transport of siderophores and other nonribosomally synthesized peptides e.g. Q9Z388|SCE8.11c PUTATIVE SMALL CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (71 aa), FASTA scores: opt: 345, E(): 1.4e-19, (68.2% identity in 66 aa overlap); Q9F8V3|CUMB COUY PROTEIN (probably involved in the biosynthesis of aminocoumarin antibiotic coumermycin A(1)) (see Wang et al., 2000) from Streptomyces rishiriensis (71 aa), FASTA scores: opt: 329, E(): 2.2e-18, (63.2% identity in 68 aa overlap); Q9F5J2|SIM-CB MBTH-LIKE PROTEIN (probably protein involved in the biosynthesis of aminocoumarin antibiotic coumermycin A(1)) from Streptomyces antibioticus (70 aa), FASTA scores: opt: 308, E(): 8.4e-17, (65.6% identity in 64 aa overlap); Q9FB14 MBTH-LIKE PROTEIN (involved in the biosynthesis of the antitumor drug bleomycin) (see Du et al., 2000) from Streptomyces verticillus FASTA scores: opt: 220, E(): 8.8e-10, (41.2% identity in 68 aa overlap); etc." /codon_start=1 /transl_table=11 /product="putative protein MbtH" /protein_id="NP_216893.1" /db_xref="GI:15609514" /db_xref="UniProtKB/Swiss-Prot:O05821" /db_xref="GeneID:885968" /translation="MSTNPFDDDNGAFFVLVNDEDQHSLWPVFADIPAGWRVVHGEAS RAACLDYVEKNWTDLRPKSLRDAMVED" gene complement(2656408..2657703) /gene="mbtG" /locus_tag="Rv2378c" /db_xref="GeneID:885648" CDS complement(2656408..2657703) /gene="mbtG" /locus_tag="Rv2378c" /EC_number="1.14.13.59" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS. THIS HYDROXYLASE IS POSSIBLY REQUIRED FOR N-HYDROXYLATION OF THE TWO LYSINE RESIDUES AT SOME STAGE DURING MYCOBACTIN ASSEMBLY [CATALYTIC ACTIVITY: L-LYSINE + O(2) = N6-HYDROXY-L-LYSINE + H(2)O. NO INFORMATION CAN BE FOUND IF THIS ENZYME IS NADPH DEPENDENT OR INDEPENDENT]." /note="deleted EC_number 1.13.12.10; Rv2378c, (MTCY27.02), len: 431 aa. mbtG, lysine-N-oxygenase (hydroxylase) (EC 1.13.12.10 or 1.14.13.59; depending if enzyme is NADPH dependent or independent) (see citations below), showing some similarity with various proteins including ornithine and lysine-N-oxygenases, e.g. Q9K6Q1|TRKA|BH3677 POTASSIUM UPTAKE PROTEIN from Bacillus halodurans (350 aa), FASTA scores: opt: 153, E(): 0.016, (25.2% identity in 246 aa overlap); P56584|SID1_USTMA L-ORNITHINE 5-MONOOXYGENASE (EC 1.13.12.-) from Ustilago maydis (Smut fungus) (570 aa), FASTA scores: opt: 136, E(): 0.31, (22.85% identity in 127 aa overlap); Q9HHV0|HXYA|VNG6214G MONOOXYGENASE from Halobacterium sp. strain NRC-1 (477 aa), FASTA scores: opt: 119, E(): 3.4, (40.0% identity in 70 aa overlap); O69828|SC1A6.23 PUTATIVE LYSINE N-HYDROXLASE (FRAGMENT) from Streptomyces coelicolor (134 aa), BLAST score: 76 (similarity in part for this one); etc. COFACTORS: FAD (BY SIMILARITY)." /codon_start=1 /transl_table=11 /product="lysine-N-oxygenase MBTG (L-lysine 6-monooxygenase) (lysine N6-hydroxylase)" /protein_id="NP_216894.1" /db_xref="GI:15609515" /db_xref="GOA:O05820" /db_xref="UniProtKB/TrEMBL:O05820" /db_xref="GeneID:885648" /translation="MNPTLAVLGAGAKAVAVAAKASVLRDMGVDVPDVIAVERIGVGA NWQASGGWTDGAHRLGTSPEKDVGFPYRSALVPRRNAELDERMTRYSWQSYLIATASF AEWIDRGRPAPTHRRWSQYLAWVADHIGLKVIHGEVERLAVTGDRWALCTHETTVQAD ALMITGPGQAEKSLLPGNPRVLSIAQFWDRAAGHDRINAERVAVIGGGETAASMLNEL FRHRVSTITVISPQVTLFTRGEGFFENSLFSDPTDWAALTFDERRDALARTDRGVFSA TVQEALLADDRIHHLRGRVAHAVGRQGQIRLTLSTNRGSENFETVHGFDLVIDGSGAD PLWFTSLFSQHTLDLLELGLGGPLTADRLQEAIGYDLAVTDVTPKLFLPTLSGLTQGP GFPNLSCLGLLSDRVLGAGIFTPTKHNDTRRSGEHQSFR" gene complement(2657700..2662085) /gene="mbtF" /locus_tag="Rv2379c" /db_xref="GeneID:885874" CDS complement(2657700..2662085) /gene="mbtF" /locus_tag="Rv2379c" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS. PROBABLY ACTIVATES THE TWO LYSINE RESIDUES THAT ARE INCORPORATED INTO MYCOBACTIN (LYSINE LIGATION)." /note="Rv2379c, (MTCY27.01), len: 1461 aa. mbtF, peptide synthetase (see citations below), similar in part to several synthases e.g. O52820|PCZA363.4 PROTEIN from Amycolatopsis orientalis (4077 aa), FASTA scores: opt: 1873, E(): 1.1e-99, (35.55% identity in 1522 aa overlap); O07944|SNBDE PRISTINAMYCIN I SYNTHASE 3 AND 4 from Streptomyces pristinaespiralis (4848 aa), FASTA scores: opt: 1817, E(): 2.1e-96, (33.65% identity in 1463 aa overlap); O52821 PROTEIN SIMILAR TO PEPTIDE SYNTHETASE from Amycolatopsis orientalis (1860 aa) FASTA scores: opt: 1705, E(): 2.9e-90, (34.75% identity in 1344 aa overlap); Q9XCF2|PSTB PUTATIVE PEPTIDE SYNTHETASE (similar to Mycobacterium tuberculosis nrp protein) from Mycobacterium avium (2552 aa), FASTA scores: opt: 1687, E(): 4e-89, (35.45% identity in 1058 aa overlap); Q9ZET7 PEPTIDE SYNTHETASE (FRAGMENT) from Mycobacterium smegmatis (1438 aa), FASTA scores: opt: 1479, E(): 2.5e-77, (30.45% identity in 1507 aa overlap); etc. Contains PS00455 putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY." /codon_start=1 /transl_table=11 /product="peptide synthetase MBTF (peptide synthase)" /protein_id="NP_216895.1" /db_xref="GI:15609516" /db_xref="GOA:O05819" /db_xref="UniProtKB/TrEMBL:O05819" /db_xref="GeneID:885874" /translation="MGPVAVTRADARGAIDDVMALSPLQQGLFSRATLVAAESGSEAA EADPYVIAMAADAAGPLDIALLRDCAAAMLTRHPNLRASFLHGNLSRPVQVIPSSAEV LWRHVRAHPSEVGALAAEERRRRFDVGRGPLIRFLLIELPDECWHLVIVAHHIVIDGW SLPLFVSELLALYRAGGHVAALPAAPRPYRDYIGWLAGRDQTASRAMWADHLNGLDGP TLLSPALADTPVQPGIPGRTEVRLDREATAELADAARTRGVTISTLVQMAWATTLSAF TGRGDVTFGVTVSGRPSELSGVETMIGLFINTVPLRVRLDARATVGGQCAVLQRQFAM LRDHSYLGFNEFRAIAGIGEMFDTLLVYENFPPGEVVGTAEFVANGVTFRPVALESLS HFPVTVAAHRSTGELTLLVEVLDGALGTMAPESLGRRVLAVLQRLVSRWDRPLRDVDI LLDGEHDPTAPGLPDVTTSAPAVHTRFAEIAAAQPDSVAVSWADGQLTYRELDALADR LATGLRRADVSRETPVAVALSRGPRYVAAMLAVLKAGGMIVPLDPAMPGERVAEILRQ TSAPVVIDEGVFAASVGADILEEDRAITVPVDQAAYVIFTSGTTGTPKGVIGTHRALS AYADDHIERVLRPAAQRLGRPLRIAHAWSFTFDAAWQPLVALLDGHAVHIVDDHRQRD AGALVEAIDRFGLDMIDTTPSMFAQLHNAGLLDRAPLAVLALGGEALGAATWRMIQQN CARTAMTAFNCYGPTETTVEAVVAAVAEHARPVIGRPTCTTRAYVMDSWLRPVPDGVA GELYLAGAQLTRGYLGRPAETAARFVAEPNGRGSRMYRTGDVVRRLPDGGLEFLGRSD DQVKIRGFRVEPGEIAAVLNGHHAVHGCHVTARGHASGPRLTAYVAGGPQPPPVAELR AMLLERLPRYLVPHHIVVLDELPLTPHGKIDENALAAINVTEGPATPPQTPTELVLAE AFADVMETSNVDVTAGFLQMGLDSIVALSVVQAARRRGIALRARLMVECDTIRELAAA IDSDAAWQAPANDAGEPIPVLPNTHWLYEYGDPRRLAQTEVIRLPDRITRERLDAVLA AVVDGHEVLRCRFDRDAMALVAQPKTDILSEVWVSGELVTAVAEQTLGALASLDPQAG RLLSAVWLREPDGPGVLVLTAHVLAMDPASWRIVLGELDAGLHALAAGRAPSPARENT SYRQWSRLLAQRAKALDSVDFWVAELEGADPPLGARRVAPQTDRVGELAITMSISDAD LTARLLSTGRSMTDLLATAAARMVTAWRRQRGQQTPAPLLALETHGRADVHVDKTADT SDTVGLLSAIYPLRIHCDGATDFARIPGSGIDYGLLRYLRADTAERLRAHREPQLLLN YLGSLHVGVGDLAVDRALLADVGQLPEPEQPVRHELTVLAALLGPADAPVLATRWRTL PDILSADDVATLQSLWQGALAEITA" misc_feature complement(2660244..2660279) /gene="mbtF" /locus_tag="Rv2379c" /note="PS00455 Putative AMP-binding domain signature" gene complement(2662067..2667115) /gene="mbtE" /locus_tag="Rv2380c" /db_xref="GeneID:885822" CDS complement(2662067..2667115) /gene="mbtE" /locus_tag="Rv2380c" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS. PROBABLY ACTIVATES THE TWO LYSINE RESIDUES THAT ARE INCORPORATED INTO MYCOBACTIN (LYSINE LIGATION)." /note="Rv2380c, (MTCY22H8.05), len: 1682 aa. mbtE, peptide synthetase (see citations below), similar in part to several synthases e.g. O07944|SNBDE PRISTINAMYCIN I SYNTHASE 3 AND 4 from Streptomyces pristinaespiralis (4848 aa), FASTA scores: opt: 2635, E(): 1.9e-146, (36.8% identity in 1657 aa overlap); O05647|SNBDE VIRGINIAMYCIN S SYNTHETASE (FRAGMENT) from Streptomyces virginiae (1997 aa) FASTA scores: opt: 2580, E(): 1.6e-143, (40.65% identity in 1163 aa overlap); Q9R9I2|DHBF PROTEIN INVOLVED IN SIDEROPHORE PRODUCTION from Bacillus subtilis (2378 aa), FASTA scores: opt: 2388, E(): 3.6e-132, (33.9% identity in 1579 aa overlap); O68487|ACMB ACTINOMYCIN SYNTHETASE II from Streptomyces chrysomallus (2611 aa), FASTA scores: opt: 2165, E(): 4.9e-119, (35.0% identity in 1634 aa overlap); etc. Equivalent to AAK46743 from Mycobacterium tuberculosis strain CDC1551 (1787 aa) but shorter 105 aa. Contains PS00455 putative AMP-binding domain signature, and PS00012 Phosphopantetheine attachment site. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY." /codon_start=1 /transl_table=11 /product="peptide synthetase MBTE (peptide synthase)" /protein_id="NP_216896.1" /db_xref="GI:15609517" /db_xref="GOA:O86329" /db_xref="UniProtKB/TrEMBL:O86329" /db_xref="GeneID:885822" /translation="MWFVQMADPSGALLNICVSYRITGDIDLARLRDAVNAVARRHRI LRTTYPVGDDGVAQPTVHADLRPGWTQYDLTDLSQRAQRLRLEVLAQREFCAPFELSR DAPLRITVVRTAADEHVLLLVAHHIAWDDGSWRVFFTDLTQAYSRADLGADLGPEHRP SAASGPDTTEADLNYWRAIMADPPEPLELPGPAGTCVPTSWRAARATLRLPADTAARV ATMAKNTGCTPYMVLLAAFGALVHRYTHSDDFLVAAPVLNRGAGTEDAIGYFGNTVAM RLRPQSAMSFRELLTATRDIASGAFAHQRINLDRVVRELNPDRRHGAERMTRVSFGFR EPDGGGFNPPGIECERYDLRSNITQLPLGFMVEFDRAGVLVEAEHLVEILEPALAKQM LRHFGVLLDNALAAPDNTLSGLALMDERDAARLREVSRGERFDTPVKTLVDLVNEQTT RTPDATAVVYEGQHFTYHDLNEASNRLGHWLIEQGIGSEDRVAVLLDKSPDLIVTALG VVKSGAVYVPVDPSYPQDRLDFILADCDAKLVLRTPVRELAGYRSDDPTDADRIRPLR PDNTAYLIYTSGTTGLPKGVAVPHRPVAEYFVWFKGEYDVDDTDRLLQVASPSFDVSI AEIFGTLACGARMVIPRPGGLTDIGYLTALLRDEGITAMHFVPSLLGLFLSLPGVSQW RTLQRVPIGGEPLPGEVADKFHATFDALLHNFYGPTETVINASRFKVVGPQGTRIVPI GRPKINTTMHLLDDSLQPVPTGVIGEIYIGGTHVAYGYHRRAGLTAERFVADPFNPGS RMYRSGDLARRNADGDIEFVGRADEQVKIRGFRIELGDVAAAIAVDPTVGQAVVVVSD LPRLGKSLVGYVTPAAGGDGPADVGVDLDRIRARVAAALPEYMLPAAYVVLDEIPITA HGKIDRAALPEPQIASDTEFRAPQTATERRLAQLFGELLGRDRVGADDSFFDLGGHSL LATKLVAAVRNAFGVDVGVREIFEFATVTALAGHIDTLDSDSARPRLTRVDHDGPVRL SSSQMRSWFNYRFDGPNAVNNIPFAAALHGPCDTNAFAAAITDVVARHEILRTVYREI GGVPHQIIQPPAEVPVRCAAGSDAAWLRAELNNERGYVFDLETDWPIRAALLSTPEQT VLSLVVHHIAGDHWSAGVLFTDLLTAYRARSTGQRPSWAPLPVQYADYSVWQSALLDD GAGIVGPQRDYWIRQLGGLAGETGLRPDFPRPALLSGAGDAVEFRLGAAIRDKLAAVS RDLGVTEFMLLQAAVAVVLHKAGGGVDVPIGAPVAGRSEANLDQLIGFFINIVVLRND LRGNPTLREVLQRTRQMALAAYAHQDLPFDQVVEAVNPQRSLSRNPLFDIVVHVREQM PQDHVIDTGPDGDTTLRVLEPTFDAAQADLSVNFFACGDEYRGHVIYRTELYERATAQ RFADWLVRVVEAFADRPDQPLREVEMVSAQARRRILDRSNAGAGTARVYLLDDALKPV PVGVVGDVYYGGGPAVGARLARPSETATRFVADPFAAQPGSRLYRNGERGVWKADGQL ELLAEIERLPTAQAAPVPAEPADTETERALAAILADVLEVGEVGRYDDFFNLGGDSIL ATQVAARARDGGIPLTARMVFEHPVLCELAAAVDAKPHVEAEPDDKHHAPMSTSGLSP DELSALTASWDQWP" misc_feature complement(2664173..2664220) /gene="mbtE" /locus_tag="Rv2380c" /note="PS00012 Phosphopantetheine attachment site" misc_feature complement(2665364..2665399) /gene="mbtE" /locus_tag="Rv2380c" /note="PS00455 Putative AMP-binding domain signature" gene complement(2667255..2670269) /gene="mbtD" /locus_tag="Rv2381c" /db_xref="GeneID:885850" CDS complement(2667255..2670269) /gene="mbtD" /locus_tag="Rv2381c" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS." /note="Rv2381c, (MTCY22H8.04), len: 1004 aa. mbtD, polyketide synthase (see citations below), similar in part to several synthases e.g. Q03132|ERY2_SACER|ERYA ERYTHRONOLIDE SYNTHASE, MODULES 3 AND 4 (EC 2.3.1.94) from Saccharopolyspora erythraea (Streptomyces erythraeus) (3567 aa), FASTA scores: opt: 971, E(): 1e-46, (29.35% identity in 1043 aa overlap); Q9F829|MEGAII MEGALOMICIN 6-DEOXYERYTHRONOLIDE B SYNTHASE 2 from Micromonospora megalomicea subsp. nigra (3562 aa), FASTA scores: opt: 787, E(): 2.4e-36, (29.35% identity in 1032 aa overlap); Q9L4W4|NYSB POLYKETIDE SYNTHASE from Streptomyces noursei (3192 aa), FASTA scores: opt: 761, E(): 6.6e-35, (29.55% identity in 1086 aa overlap); O30764|NIDA1 POLYKETIDE SYNTHASE MODULES 1 AND 2 from Streptomyces caelestis (4340 aa), FASTA scores: opt: 726, E(): 7.8e-33, (27.3% identity in 1052 aa overlap); etc. Contains PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="polyketide synthetase MBTD (polyketide synthase)" /protein_id="NP_216897.1" /db_xref="GI:15609518" /db_xref="GOA:P71719" /db_xref="UniProtKB/TrEMBL:P71719" /db_xref="GeneID:885850" /translation="MAPKQLPDGRVAVLLSAHAEELIGPDARAIADYLERFPATTVTE VARQLRKTRRVRRHRAVLRAADRLELAEGLRALAAGREHPLIARSSLGSAPRQAFVFP GQGGHWPGMGAVAYRELPTYRTATDTCAAAFAAAGVDSPLPYLIAPPGTDERQAFCEI EIEGAQFVHAVALAEVWRSCGVLPDLTVGHSLGEVAAAYLAGSITLSDAVAVVAARAN VVGRLPGRYAVAALGIGEQDASALIATTGGWLELSVVNASSTVAVSGERQAVAAIVDT VRSSGHFARGITVGFPVHTSVLESLRDELCEQLPDSEFMEAPVQFIGGTTGDVVAPGT TFGDYWYANLRHTVRFDRAVESAIRCGARAFIEISAHPALLFAIGQNCEGAANLPDGP AVLVGSARRGERFVDALSANIVSAAVADPGYPWGDLGGDPLDGDVDLSGFPNAPMRAV PMWAHPEPLPPVSGLTIAVERWERMVPSTPVAGRHRHLAVLDLGAHRALAQTLCAAID SHPDTELSAARDAELILVIAPDFEHTDAVRAAGALADLVGAGLLDYPMHIGARCQSVC LVTVGAEQVDAADAVPSAGQAALAAMHRSIGFEHPEQTFSHLDLPSWDLDPVLGVSVI TAVLRGFGETALRGSVNGYTLFERTLADAPAVPNWSLDSGVLDDVVVTGGAGAIGMHY ARYLAEHGARRIVLLSRRAADQATVAMLRKQHGTVIVSPPCDITDPTQLSAIAAEYGG VGASLIVHAAGSVISGTAPGVTSAAVVDNFAAKVLGLAQMIELWPLRPDVRTLLCSSV MGVWGGHGVVAYSAANRLLDVMAAQLRAQGRHCVAVKWGLWQAPKAGEPARGIADAVT IARVERSGLRQMAPQQAIEASLHEFTVDPLVFAADAARLQMLLDSRQFERYEGPTDPN LTIVDAVRTQLAAVLGIPQAGEVNLQESLFDLGVDSMLALDLRNRLKRSIGATVSLAT LMGDITGDGLVAKLEDADERSHTAQKVDISRD" misc_feature complement(2667390..2667437) /gene="mbtD" /locus_tag="Rv2381c" /note="PS00012 Phosphopantetheine attachment site" gene complement(2670269..2671603) /gene="mbtC" /locus_tag="Rv2382c" /db_xref="GeneID:885908" CDS complement(2670269..2671603) /gene="mbtC" /locus_tag="Rv2382c" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS." /note="Rv2382c, (MTCY22H8.03), len: 444 aa. mbtC, polyketide synthase (see citations below), similar in part to several synthases e.g. Q9F7T9 AVERMECTIN POLYKETIDE SYNTHASE (FRAGMENT) from Streptomyces avermitilis (3626 aa), FASTA scores: opt: 1458, E(): 7e-82, (50.65% identity in 446 aa overlap); AAG23264|SPNA POLYKETIDE SYNTHASE LOADING AND EXTENDER MODULE 1 from Saccharopolyspora spinosa (2595 aa) FASTA scores: opt: 1441, E(): 6e-81, (49.1% identity in 446 aa overlap); O33954|TYLG TYLACTONE SYNTHASE STARTER MODULE AND MODULES 1 & 2 from Streptomyces fradiae (4472 aa) FASTA scores: opt: 1439, E(): 1.2e-80, (51.0% identity in 447 aa overlap); O30764|NIDA1 POLYKETIDE SYNTHASE MODULES 1 AND 2 from Streptomyces caelestis (4340 aa) FASTA scores: opt: 1432, E(): 3.3e-80, (50.9% identity in 442 aa overlap); etc." /codon_start=1 /transl_table=11 /product="polyketide synthetase MBTC (polyketide synthase)" /protein_id="NP_216898.1" /db_xref="GI:15609519" /db_xref="GOA:P71718" /db_xref="UniProtKB/TrEMBL:P71718" /db_xref="GeneID:885908" /translation="MSDNDPVVIVGLAIEAPGGVETADDYWTLLSEQREGLGPFPTDR GWALRELFDGSRRNGFKPIHNLGGFLSSATTFDPEFFRISPREATAMDPQQRVGLRVA WRTLENSGINPDDLAGHDVGCYVGASALEYGPALTEFSHHSGHLITGTSLGVISGRIA YTLDLAGPALTVDTSCSSALAAFHTAVQAIRAGDCDLALAGGVCVMGTPGYFVEFSKQ HALSDDGHCRPYSAHASGTAWAEGAAMFLLQRRSRATADRRRVLAEVRASCLNSDGLS DGLTAPSGDAQTRLLRRAIAQAAVVPADVGMVEGHGTATRLGDRTELRSLAASYGTAP AGRGPLLGSVKSNIGHAQAAAGGLGLVKVILAAQHAAIPPTLHVDEPSREIDWEKQGL RLADKLTPWRAVDGWRTAAVSAFGMSGTNSHVIVSMPDTVSAPERGPECGEV" gene complement(2671593..2675837) /gene="mbtB" /locus_tag="Rv2383c" /db_xref="GeneID:885838" CDS complement(2671593..2675837) /gene="mbtB" /locus_tag="Rv2383c" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS. THIS PEPTIDE SYNTHASE FORMS AMIDE BOUND BETWEEN THE CARBOXYLIC ACID OF SALICYLATE AND THE ALPHA-AMINO GROUP OF SERINE (SERINE/THREONINE LIGATION)." /note="Rv2383c, (MTCY22H8.02), len: 1414 aa. mbtB, phenyloxazoline synthase (see citations below), similar to the N-terminal region of several synthetases e.g. Q9EWP5|SC4C2.17 PUTATIVE NON-RIBOSOMAL PEPTIDE SYNTHASE from Streptomyces coelicolor (2229 aa), FASTA scores: opt: 2878, E(): 4.1e-156, (46.85% identity in 1138 aa overlap); Q9Z399|IRP2 YERSINIABACTIN BIOSYNTHETIC from Yersinia pestis (2041 aa), FASTA scores: opt: 2297, E(): 5.3e-123, (38.55% identity in 1069 aa overlap); P48633|HMP2_YEREN|IRP2 HIGH-MOLECULAR-WEIGHT PROTEIN 2 (MAY BE INVOLVED IN THE NONRIBOSOMAL SYNTHESIS OF SMALL PEPTIDES) from Yersinia enterocolitica (2035 aa), FASTA scores: opt: 2275, E(): 9.4e-122, (38.45% identity in 1069 aa overlap); O85739|PCHE|PA4226 DIHYDROAERUGINOIC ACID SYNTHETASE from Pseudomonas aeruginosa (1438 aa) FASTA scores: opt: 2236, E(): 1.2e-119, (38.2% identity in 1330 aa overlap); Q9RFM8|PCHE PYOCHELIN SYNTHETASE from Pseudomonas aeruginosa (1438 aa), FASTA scores: opt: 2229, E(): 3e-119, (38.0% identity in 1329 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature, and PS00012 Phosphopantetheine attachment site. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY." /codon_start=1 /transl_table=11 /product="phenyloxazoline synthase MBTB (phenyloxazoline synthetase)" /protein_id="NP_216899.1" /db_xref="GI:15609520" /db_xref="GOA:P71717" /db_xref="UniProtKB/TrEMBL:P71717" /db_xref="GeneID:885838" /translation="MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMS LVGRWRRKGIAVDFATLAATPTIEAWSQLVSAGTGVAPTAVAAPGDAGLSQEGEPFPL APMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALRHPMLRVQFLP DGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDAKSHQQLDGAVFELALTLL PGERTRLHVDLDMQAADAMSYRILLADLAALYDGREPPALGYTYREYRQAIEAEETLP QPVRDADRDWWAQRIPQLPDPPALPTRAGGERDRRRSTRRWHWLDPQTRDALFARARA RGITPAMTLAAAFANVLARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVD LTGARTAAARAQAVQEALRSAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGD LFCPDVTEQFGTPGWIISQGPQVLLDAQVTEFDGGVLVNWDVREGVFAPGVIDAMFTH QVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPSTEALHDGFFRQAQQQP DAPAVFASSGDLSYAQLRDQASAVAAALRAAGLRVGDTVAVLGPKTGEQVAAVLGILA AGGVYLPIGVDQPRDRAERILATGSVNLALVCGPPCQVRVPVPTLLLADVLAAAPAEF VPGPSDPTALAYVLFTSGSTGEPKGVEVAHDAAMNTVETFIRHFELGAADRWLALATL ECDMSVLDIFAALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEV GGGRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFEVQDAAN LPPDWASVPYGVPFPNNACRVVADSGDDCPDWVAGELWVSGRGIARGYRGRPELTAER FVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKISGYRVELGEIEAALQRLPGVHA AAATVLPGGSDVLAAAVCVDDAGVTAESIRQQLADLVPAHMIPRHVTLLDRIPFTDSG KIDRAEVGALLAAEVERSGDRSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFF ALGGDSVLATQVVAGIRRWLDSPSLMVADMFAARTIAALAQLLTGREANADRLELVAE VYLEIANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAAAAYR WLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDWHLTAPLTLFGHC MGAIVAFEFARLAERNGVPVRALWASSGQAPSTVAASGPLPTADRDVLADMVDLGGTD PVLLEDEEFVELLVPAVKADYRALSGYSCPPDVRIRANIHAVGGNRDHRISREMLTSW ETHTSGRFTLSHFDGGHFYLNDHLDAVARMVSADVR" misc_feature complement(2672526..2672573) /gene="mbtB" /locus_tag="Rv2383c" /note="PS00012 Phosphopantetheine attachment site" misc_feature complement(2673720..2673755) /gene="mbtB" /locus_tag="Rv2383c" /note="PS00455 Putative AMP-binding domain signature" gene 2675936..2677633 /gene="mbtA" /locus_tag="Rv2384" /db_xref="GeneID:885833" CDS 2675936..2677633 /gene="mbtA" /locus_tag="Rv2384" /EC_number="6.-.-.-" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS (INITIATION STEP OF MYCOBACTIN CHAIN GROWTH). ACTIVATES THE MYCOBACTIN ArCP IN TWO HALF-REACTIONS: ACTIVATES SALICYLIC ACID AS ACYLADENYLATE (ADENYLATION STEP) + TRANSFERS ACTIVATED SALICYLATE TO THE MBTA ArCP AS A THIOESTER (ARYLATION STEP)." /note="Rv2384, (MTCY22H8.01, MTCY253.37c), len: 565 aa. mbtA, bifunctional enzyme, including salicyl-AMP ligase (Sal-AMP ligase) (EC 6.-.-.-) and salicyl-S-ArCP synthetase (see Quadri et al ., 1998; De Voss et al., 1999), highly similar to other ligases e.g. Q9F638|MXCE from Stigmatella aurantiaca 2,3-DHBA-AMP ligase (protein involved in the biosynthesis of 2,3-dihydroxybenzoic acid, contains the AMP binding signature) (543 aa), FASTA scores: opt: 1683, E(): 2.8e-90, (48.25% identity in 545 aa overlap) (see Silakowski et al., 2000); P40871|DHBE_BACSU|ENTE 2,3-DIHYDROXYBENZOATE-AMP LIGASE (EC 6.3.2.-) from Bacillus subtilis (539 aa), FASTA scores: opt: 1569, E(): 1.2e-83, (44.9% identity in 532 aa overlap); O07899|VIBE_VIBCHVC0772 VIBRIOBACTIN-SPECIFIC 2,3-DIHYDROXYBENZOATE-AMP LIGASE from Vibrio cholerae (543 aa), FASTA scores: opt: 1457, E(): 3.7e-77, (44.6% identity in 545 aa overlap); etc. Also similar to P95819|SNBA PRISTINAMYCIN I SYNTHETASE I from Streptomyces pristinaespiralis (582 aa), FASTA scores: opt: 1532, E(): 1.7e-81, (46.35% identity in 548 aa overlap); and Q9RFM9|PCHD SALICYL-AMP LIGASE from Pseudomonas aeruginosa (547 aa), FASTA scores: opt: 1415, E(): 1e-74, (45.95% identity in 533 aa overlap). Contains PS00455 Putative AMP-binding domain signature. BELONGS TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY." /codon_start=1 /transl_table=11 /product="bifunctional salicyl-AMP ligase/salicyl-S-arcp synthetase" /protein_id="NP_216900.1" /db_xref="GI:15609521" /db_xref="GOA:P71716" /db_xref="UniProtKB/TrEMBL:P71716" /db_xref="GeneID:885833" /translation="MPPKAADGRRPSPDGGLGGFVPFPADRAASYRAAGYWSGRTLDT VLSDAARRWPDRLAVADAGDRPGHGGLSYAELDQRADRAAAALHGLGITPGDRVLLQL PNGCQFAVALFALLRAGAIPVMCLPGHRAAELGHFAAVSAATGLVVADVASGFDYRPM ARELVADHPTLRHVIVDGDPGPFVSWAQLCAQAGTGSPAPPADPGSPALLLVSGGTTG MPKLIPRTHDDYVFNATASAALCRLSADDVYLVVLAAGHNFPLACPGLLGAMTVGATA VFAPDPSPEAAFAAIERHGVTVTALVPALAKLWAQSCEWEPVTPKSLRLLQVGGSKLE PEDARRVRTALTPGLQQVFGMAEGLLNFTRIGDPPEVVEHTQGRPLCPADELRIVNAD GEPVGPGEEGELLVRGPYTLNGYFAAERDNERCFDPDGFYRSGDLVRRRDDGNLVVTG RVKDVICRAGETIAASDLEEQLLSHPAIFSAAAVGLPDQYLGEKICAAVVFAGAPITL AELNGYLDRRGVAAHTRPDQLVAMPALPTTPIGKIDKRAIVRQLGIATGPVTTQRCH" misc_feature 2676563..2676598 /gene="mbtA" /locus_tag="Rv2384" /note="PS00455 Putative AMP-binding domain signature" gene 2677729..2678649 /gene="mbtJ" /locus_tag="Rv2385" /db_xref="GeneID:885927" CDS 2677729..2678649 /gene="mbtJ" /locus_tag="Rv2385" /EC_number="3.1.1.-" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS. POSSIBLY REQUIRED FOR N-HYDROXYLATION OF THE TWO LYSINE RESIDUES AT SOME STAGE DURING MYCOBACTIN ASSEMBLY." /note="Rv2385, (MTCY253.36c), len: 306 aa. Putative mbtJ, acetyl hydrolase (EC 3.1.1.-) (see citations below), showing some similarity with various hydrolases including acetyl hydrolases e.g. Q9ZBM4|MLCB1450.08|ML0314 PUTATIVE HYDROLASE/ESTERASE from Mycobacterium leprae (335 aa), FASTA scores: opt: 449, E(): 6.7e-21, (33.85% identity in 313 aa overlap); AAK47950|MT3591 Esterase from M. tuberculosis strain CDC1551 (327 aa), FASTA scores: opt: 469, E(): 3.6e-22, (35% identity in 283 aa overlap); Q9X8J4|SCE9.22 PUTATIVE ESTERASE from Streptomyces coelicolor (266 aa), FASTA scores: opt: 430,E(): 8.5e-20, (38% identity in 245 aa overlap); Q01109|BAH_STRHY ACETYL-HYDROLASE (EC 3.1.1.-) from Streptomyces hygroscopicus (299 aa), FASTA scores: opt: 420, E(): 4e-19, (35.1% identity in 265 aa overlap). Equivalent to AAK46748 from Mycobacterium tuberculosis strain CDC1551 (327 aa) but shorter 21 aa. Note that previously known as lipK.; lipK" /codon_start=1 /transl_table=11 /product="putative acetyl hydrolase MBTJ" /protein_id="YP_177876.1" /db_xref="GI:57116980" /db_xref="GOA:Q79FE8" /db_xref="UniProtKB/TrEMBL:Q79FE8" /db_xref="GeneID:885927" /translation="MVLRPITGAIPPDGPWGIWASRRIIAGLMGTFGPSLAGTRVEQV NSVLPDGRRVVGEWVYGPHNNAINAGPGGGAIYYVHGSGYTMCSPRTHRRLTSWLSSL TGLPVFSVDYRLAPRYRFPTAATDVRAAWDWLAHVCGLAAEHMVIAADSAGGHLTVDM LLQPEVAARPPAAVVLFSPLIDLTFRLGASRELQRPDPVVRADRAARSVALYYTGVDP AHHRLALDVAGGPPLPPTLIQVGGAEILEADARQLDADIRAAGGICELQVWPDQMHVF QALPRMTPEAAKAMTYVAQFIRSTTARGDL" gene complement(2678653..2680005) /gene="mbtI" /locus_tag="Rv2386c" /db_xref="GeneID:885823" CDS complement(2678653..2680005) /gene="mbtI" /locus_tag="Rv2386c" /function="INVOLVED IN THE BIOGENESIS OF THE HYDROXYPHENYLOXAZOLINE-CONTAINING SIDEROPHORE MYCOBACTINS. POSSIBLY PLAYS A ROLE IN THE CONVERSION OF CHORISMATE TO SALICILATE (THE STARTER UNIT FOR MYCOBACTIN SIDEROPHORE CONSTRUCTION)." /note="catalyzes conversion of chorismate to salicylate, in mycobactin siderophore construction; requires Mg(2+) for function" /codon_start=1 /transl_table=11 /product="salicylate synthase MbtI" /protein_id="YP_177877.1" /db_xref="GI:57116981" /db_xref="GOA:Q7D785" /db_xref="UniProtKB/TrEMBL:Q7D785" /db_xref="GeneID:885823" /translation="MSELSVATGAVSTASSSIPMPAGVNPADLAAELAAVVTESVDED YLLYECDGQWVLAAGVQAMVELDSDELRVIRDGVTRRQQWSGRPGAALGEAVDRLLLE TDQAFGWVAFEFGVHRYGLQQRLAPHTPLARVFSPRTRIMVSEKEIRLFDAGIRHREA IDRLLATGVREVPQSRSVDVSDDPSGFRRRVAVAVDEIAAGRYHKVILSRCVEVPFAI DFPLTYRLGRRHNTPVRSFLLQLGGIRALGYSPELVTAVRADGVVITEPLAGTRALGR GPAIDRLARDDLESNSKEIVEHAISVRSSLEEITDIAEPGSAAVIDFMTVRERGSVQH LGSTIRARLDPSSDRMAALEALFPAVTASGIPKAAGVEAIFRLDECPRGLYSGAVVML SADGGLDAALTLRAAYQVGGRTWLRAGAGIIEESEPEREFEETCEKLSTLTPYLVARQ" gene 2680765..2682018 /locus_tag="Rv2387" /db_xref="GeneID:885302" CDS 2680765..2682018 /locus_tag="Rv2387" /function="UNKNOWN" /note="Rv2387, (MTCY253.34c), len: 417 aa. Conserved hypothetical protein, showing some similarities with others e.g. Q9K663|BH3869 HYPOTHETICAL PROTEIN from Bacillus halodurans (337 aa), FASTA scores: opt: 343, E(): 4.8e-14, (29.0% identity in 400 aa overlap); AAK25471|CC3509 HYPOTHETICAL PROTEIN from Caulobacter crescentus (365 aa), FASTA scores: opt: 282, E(): 3.2e-10, (32.6% identity in 399 aa overlap); P73953|SLR1512 [D90911_21] CONSERVED HYPOTHETICAL PROTEIN from Synechocystis sp. strain PCC6803 (374 aa), FASTA scores: opt: 230, E(): 5.5e-07; (24.75% identity in 408 aa overlap); etc. Contains PS00213 Lipocalin signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216903.1" /db_xref="GI:15609524" /db_xref="UniProtKB/TrEMBL:P71757" /db_xref="GeneID:885302" /translation="MLHEFWVNFTHNLFKPLLLFFYFGFLIPIFKVRFEFPYVLYQGL TLYLLLAIGWHGGEELAKIKPSNVGAIVGFMVVGFALNFVIGTLAYFLLSKLTAMRRV DRATVAGYYGSDSAGTFATCVAVLTSVGMAFDAYMPVMLAVMEIPGCLVALYLVARLR HRGMNEAGYMADEPGYTTAAMIGAGPGTPARPAHSDSLTAQAERGIEEELELSLEKRE HPNWDEDGVKDSGTNASIFSRELLQEVFLNPGLVLLFGGIVIGLISGLQGQKVLHDDD NFFVAAFQGVLCLFLLEMGMTASRKLKDLASAGSGFVFFGLLAPNLFATLGIIVAHGY AYVTNNDFAPGTYVLFAVLCGAASYIAVPAVQRLAIPEASPTLPLAASLGLTFSYNVT IGIPLYIEIARIVGQWFPATGASIG" misc_feature 2681962..2681997 /locus_tag="Rv2387" /note="PS00213 Lipocalin signature" gene complement(2682015..2683142) /gene="hemN" /locus_tag="Rv2388c" /db_xref="GeneID:885300" CDS complement(2682015..2683142) /gene="hemN" /locus_tag="Rv2388c" /EC_number="1.3.3.-" /function="INVOLVED IN PORPHYRIN BIOSYNTHESIS. ANAEROBIC TRANSFORMATION OF COPROPORPHYRINOGEN-III INTO PROTOPORPHYRINOGEN-IX." /note="catalyzes the oxygen-independent formation of protoporphyrinogen-IX from coproporphyrinogen-III" /codon_start=1 /transl_table=11 /product="coproporphyrinogen III oxidase" /protein_id="NP_216904.1" /db_xref="GI:15609525" /db_xref="GOA:P71756" /db_xref="UniProtKB/Swiss-Prot:P71756" /db_xref="GeneID:885300" /translation="MPGQPFGVYLHVPFCLTRCGYCDFNTYTPAQLGGVSPDRWLLAL RAELELAAAKLDAPTVHTVYVGGGTPSLLGGERLATLLDMVRDHFVLAPDAEVSTEAN PESTWPEFFATIRAAGYTRVSLGMQSVAPRVLATLDRVHSPGRAAAAATEAIAEGFTH VNLDLIYGTPGESDDDLVRSVDAAVQAGVDHVSAYALVVEHGTALARRVRRGELAAPD DDVLAHRYELVDARLSAAGFAWYEVSNWCRPGGECRHNLGYWDGGQWWGAGPGAHGYI GVTRWWNVKHPNTYAEILAGATLPVAGFEQLGADALHTEDVLLKVRLRQGLPLARLGA AERERAEAVLADGLLDYHGDRLVLTGRGRLLADAVVRTLLG" gene complement(2683248..2683712) /gene="rpfD" /locus_tag="Rv2389c" /db_xref="GeneID:885246" CDS complement(2683248..2683712) /gene="rpfD" /locus_tag="Rv2389c" /function="PROMOTES THE RESUSCITATION AND GROWTH OF DORMANT, NONGROWING CELL. COULD ALSO STIMULATES THE GROWTH OF SEVERAL OTHER HIGH G+C GRAM+ ORGANISMS, e.g. Mycobacterium avium, Mycobacterium bovis (BCG), Mycobacterium kansasii, Mycobacterium smegmatis." /note="Rv2389c, (MTCY253.32), len: 154 aa. Probable rpfD, resuscitation-promoting factor. Possible autocrine and/or paracrine bacterial growth factor or cytokine (see citation below). Similar to others from Mycobacterium tuberculosis e.g. O07747|Rv1884c|MTCY180.34|RPFC PROBABLE RESUSCITATION-PROMOTING FACTOR from Mycobacterium tuberculosis (176 aa), FASTA scores: opt: 382, E(): 2.3e-17, (55.45% identity in 101 aa overlap); etc. Also similarity with Q9CBF8|ML2030 HYPOTHETICAL PROTEIN from Mycobacterium leprae (157 aa), FASTA scores: opt: 397, E(): 2.4e-18, (47.95% identity in 121 aa overlap); Q9F2Q2|SCE41.06c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 341, E(): 1.1e-14, (40.45% identity in 131 aa overlap); and O86308|Z96935|MLRPF_1 RPF PROTEIN PRECURSOR from Micrococcus luteus (220 aa), FASTA scores: opt: 301, E(): 3.6e-12, (39.4% identity in 132 aa overlap). Contains a secretory signal sequence in N-terminus. Supposed acts at very low concentration." /codon_start=1 /transl_table=11 /product="resuscitation-promoting factor RpfD" /protein_id="NP_216905.1" /db_xref="GI:15609526" /db_xref="UniProtKB/TrEMBL:P71755" /db_xref="GeneID:885246" /translation="MTPGLLTTAGAGRPRDRCARIVCTVFIETAVVATMFVALLGLST ISSKADDIDWDAIAQCESGGNWAANTGNGLYGGLQISQATWDSNGGVGSPAAASPQQQ IEVADNIMKTQGPGAWPKCSSCSQGDAPLGSLTHILTFLAAETGGCSGSRDD" gene complement(2683709..2684266) /locus_tag="Rv2390c" /db_xref="GeneID:885866" CDS complement(2683709..2684266) /locus_tag="Rv2390c" /function="UNKNOWN" /note="Rv2390c, (MTCY253.31), len: 185 aa. Conserved hypothetical protein, similar to other Mycobacterium tuberculosis proteins Q11032|YD62_MYCTU|MTCY02B10.26c|Rv1362c hypothetical 23.5 kDa protein (220 aa), FASTA scores: opt: 223, E(): 2.1e-07, (27.4% identity in 190 aa overlap); and Q11033|YD63_MYCTU|MTCY02B10.27c|Rv1363c hypothetical 28.3 kDa protein (261 aa), FASTA scores: opt: 238, E(): 2.7e-08, (27.6% identity in 163 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216906.1" /db_xref="GI:15609527" /db_xref="UniProtKB/TrEMBL:P71754" /db_xref="GeneID:885866" /translation="MAIFGRGHGASEPGGTGEPAETPGRGRLTRSVIGWVGAVAVVVS LAGSGWCGWVLFEKHQTDVAAGQALQAARSYVVKLATMDCERIDHNMRDILEGSTGEF KDKYGKSSAHLRQLLADNRVATHGTVVAASVKSATTNKVVVLMFIDQSVSNRNSPTPQ IDRSRIKVIMDKVNGRWLASKVELL" gene 2684679..2686370 /gene="nirA" /locus_tag="Rv2391" /db_xref="GeneID:885472" CDS 2684679..2686370 /gene="nirA" /locus_tag="Rv2391" /EC_number="1.7.7.1" /function="GENERATES NITRITE FROM AMMONIA USING OXIDIZED FERREDOXIN [CATALYTIC ACTIVITY: AMMONIA + H(2)O + OH(-) + 3 OXIDIZED FERREDOXIN = NITRITE + 3 REDUCED FERREDOXIN]." /note="Rv2391, (MTCY253.30c), len: 563 aa. Probable nirA, ferredoxin-dependent nitrite reductase (EC 1.7.7.1), similar to many nitrite/nitrate reductases e.g. CAC33947|SCBAC1A6.26c Putative nitrite/sulphite reductase from Streptomyces coelicolor (565 aa), FASTA scores: opt: 2335, E(): 1.2e-137, (60.1% identity in 567 aa overlap); Q9RZD6|DRA0013 FERREDOXIN-NITRITE REDUCTASE from Deinococcus radiodurans (563 aa), FASTA scores: opt: 1141, E(): 2.2e-63, (39.6% identity in 533 aa overlap); Q59656|NIRA (D31732|PEENIRNRT_1) ferredoxin-dependent NITRITE REDUCTASE from Plectonema boryanum (654 aa) (see Suzuki & Kikuchi 1995), FASTA scores: opt: 805, E(): 1.9e-42, (31.7% identity in 517 aa overlap); Q55366|NIRA|SLR0898 FERREDOXIN-NITRITE REDUCTASE from Synechocystis sp. strain PCC 6803 (502 aa), FASTA scores: opt: 799, E(): 3.7e-42, (32.3% identity in 517 aa overlap); etc. Highly similar (only in N-terminal part because shortened protein (fragment) owing to an IS900 insertion) to Q9K541|NIRA NITRATE REDUCTASE (FRAGMENT) from Mycobacterium paratuberculosis (198 aa), FASTA scores: opt: 798, E(): 2.1e-42, (65.4% identity in 182 aa overlap) (see Bull et al., 2000)." /codon_start=1 /transl_table=11 /product="ferredoxin-dependent nitrite reductase NIRA" /protein_id="NP_216907.1" /db_xref="GI:15609528" /db_xref="GOA:P71753" /db_xref="UniProtKB/TrEMBL:P71753" /db_xref="GeneID:885472" /translation="MSAKENPQMTTARPAKARNEGQWALGHREPLNANEELKKAGNPL DVRERIENIYAKQGFDSIDKTDLRGRFRWWGLYTQREQGYDGTWTGDDNIDKLEAKYF MMRVRCDGGALSAAALRTLGQISTEFARDTADISDRQNVQYHWIEVENVPEIWRRLDD VGLQTTEACGDCPRVVLGSPLAGESLDEVLDPTWAIEEIVRRYIGKPDFADLPRKYKT AISGLQDVAHEINDVAFIGVNHPEHGPGLDLWVGGGLSTNPMLAQRVGAWVPLGEVPE VWAAVTSVFRDYGYRRLRAKARLKFLIKDWGIAKFREVLETEYLKRPLIDGPAPEPVK HPIDHVGVQRLKNGLNAVGVAPIAGRVSGTILTAVADLMARAGSDRIRFTPYQKLVIL DIPDALLDDLIAGLDALGLQSRPSHWRRNLMACSGIEFCKLSFAETRVRAQHLVPELE RRLEDINSQLDVPITVNINGCPNSCARIQIADIGFKGQMIDDGHGGSVEGFQVHLGGH LGLDAGFGRKLRQHKVTSDELGDYIDRVVRNFVKHRSEGERFAQWVIRAEEDDLR" gene 2686367..2687131 /gene="cysH" /locus_tag="Rv2392" /db_xref="GeneID:885250" CDS 2686367..2687131 /gene="cysH" /locus_tag="Rv2392" /EC_number="1.8.4.8" /function="INVOLVED IN THE SULFATE ACTIVATION PATHWAY (AT THE THIRD STEP) IN THE REDUCTIVE BRANCH OF THE CYSTEINE BIOSYNTHETIC PATHWAY. REDUCES ACTIVATED SULFATE INTO SULFITE [CATALYTIC ACTIVITY: 5-PHOSPHOADENOSINE 3-PHOSPHOSULFATE + REDUCED THIOREDOXIN = PHOSPHOADENOSINE PHOSPHATE + OXIDIZED THIOREDOXIN + SULFITE]." /note="catalyzes the reduction of 3'-phosphoadenylyl sulfate into sulfite" /codon_start=1 /transl_table=11 /product="phosphoadenosine phosphosulfate reductase" /protein_id="NP_216908.1" /db_xref="GI:15609529" /db_xref="GOA:P65668" /db_xref="UniProtKB/Swiss-Prot:P65668" /db_xref="GeneID:885250" /translation="MSGETTRLTEPQLRELAARGAAELDGATATDMLRWTDETFGDIG GAGGGVSGHRGWTTCNYVVASNMADAVLVDLAAKVRPGVPVIFLDTGYHFVETIGTRD AIESVYDVRVLNVTPEHTVAEQDELLGKDLFARNPHECCRLRKVVPLGKTLRGYSAWV TGLRRVDAPTRANAPLVSFDETFKLVKVNPLAAWTDQDVQEYIADNDVLVNPLVREGY PSIGCAPCTAKPAEGADPRSGRWQGLAKTECGLHAS" gene 2687128..2687973 /locus_tag="Rv2393" /db_xref="GeneID:885508" CDS 2687128..2687973 /locus_tag="Rv2393" /function="UNKNOWN" /note="Rv2393, (MTCY253.28c), len: 281 aa. Conserved hypothetical protein, with some similarity to Q9L2E8|SC7A8.10c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (274 aa), FASTA scores: opt: 407, E(): 2.8e-18, (37% identity in 246 aa overlap); CAC38793|SCI39.05 Conserved hypothetical protein from Streptomyces coelicolor (305 aa), FASTA scores: opt: 394, E(): 2e-17, (35.0% identity in 251 aa overlap); AAK44492|MT0272 Chalcone/stilbene synthase family protein from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 350, E(): 9.2e-15, (34.0% identity in 235 aa overlap); P95216|Rv0259c|MTCY06A4.03c|Z86089 hypothetical protein from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 345, E(): 1.9e-14,(33.6% identity in 235 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216909.1" /db_xref="GI:15609530" /db_xref="GOA:P71751" /db_xref="UniProtKB/TrEMBL:P71751" /db_xref="GeneID:885508" /translation="MTAPATMQSAAMLRSGAIEAPPATMQSAAMRWGHLPLAEESGTI APQLVLTAHGSKDPRSAANARAIAGRLARMRPGLDVRVAFCELNSPNLVDVLNRCRGA AVVTPLLLADAYHARVDIPAQIASCRVGHRVRQASVLGEDIRLVSALHERLTELGVSP FDHTLGVVVLAIGSSHPAANARTSTVASRLAEGTQWAAVTTAFITRPEASLADATDRL RRHGARRMVIAPWLLAPGILSDRVRGYAREAGIAMAQPLGAHPMVAATMWDRYRQAVA GRIAA" repeat_region 2687128..2687179 /note="52 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region 2687180..2687257 /note="78 bp Mycobacterial Interspersed Repetitive Unit, Class I" gene 2688010..2689941 /gene="ggtB" /locus_tag="Rv2394" /db_xref="GeneID:885867" CDS 2688010..2689941 /gene="ggtB" /locus_tag="Rv2394" /EC_number="2.3.2.2" /function="PLAYS A KEY ROLE IN THE GAMMA-GLUTAMYL CYCLE, A PATHWAY FOR THE SYNTHESIS AND DEGRADATION OF GLUTATHIONE [CATALYTIC ACTIVITY: 5-L-GLUTAMYL)-PEPTIDE + AN AMINO ACID = PEPTIDE + 5-L-GLUTAMYL-AMINO ACID]." /note="Rv2394, (MTCY253.27c), len: 643 aa. Probable ggtB, gamma-glutamyltranspeptidase precursor (EC 2.3.2.2), similar to many e.g. Q9KVF2|VC0194 from Vibrio cholerae (588 aa), FASTA scores: opt: 943, E(): 7.5e-47, (40.0% identity in 597 aa overlap); O69935|SC3C8.26 from Streptomyces coelicolor (603 aa), FASTA scores: opt: 822, E(): 7.2e-40, (33.6% identity in 622 aa overlap); P54422|GGT_BACSU from Bacillus subtilis (587 aa) FASTA scores: opt: 491, E(): 8.2e-21, (33.4% identity in 574 aa overlap); etc. Has potential signal peptide and appropriately positioned prokaryotic lipoprotein attachment site (PS00013)." /codon_start=1 /transl_table=11 /product="gamma-glutamyltranspeptidase precursor GgtB" /protein_id="NP_216910.1" /db_xref="GI:15609531" /db_xref="GOA:P71750" /db_xref="UniProtKB/TrEMBL:P71750" /db_xref="GeneID:885867" /translation="MSVWLRAGALVAAVMLSLSGCGGFHAGAPSTAGPCEIVPNGTPA PKTPPATVPSSRNLATNPEIATGYRRDMTVVRTAHYAAATANPLATQVACRVLRDGGT AADAVVAAQAVLGLVEPQSSGIGGGGYLVYFDARTGSVQAYDGREVAPAAATENYLRW VSDVDRSAPRPNARASGRSIGVPGILRMLEMVHNEHGRTPWRDLFGPAVTLADGGFDI SARMGAAISDAAPQLRDDPEARKYFLNPDGSPKPAGTRLTNPAYSKTLSAIASAGANA FYSGDIAHDIVAAASDTSNGRTPGLLTIEDLAGYLAKRRQPLCTTYRGREICGMPSSG GVAVAATLGILEHFPMSDYAPSKVDLNGGRPTVMGVHLIAEAERLAYADRDQYIADVD FVRLPGGSLTTLVDPGYLAARAALISPQHSMGSARPGDFGAPTAVAPPVPEHGTSHLS VVDSYGNAATLTTTVESSFGSYHLVDGFILNNQLSDFSAEPHATDGSPVANRVEPGKR PRSSMAPTLVFDHSSAGRGALYAVLGSPGGSMIIQFVVKTLVAMLDWGLNPQQAVSLV DFGAANSPHTNLGGENPEINTSDDGDHDPLVQGLRALGHRVNLAEQSSGLSAITRSEA GWAGGADPRREGAVMGDDA" gene 2690072..2692075 /locus_tag="Rv2395" /db_xref="GeneID:885303" CDS 2690072..2692075 /locus_tag="Rv2395" /function="UNKNOWN (POSSIBLY INVOLVED IN TRANSPORT ACROSS THE MEMBRANE)." /note="Rv2395, (MTCY253.26c), len: 667 aa. Probable conserved integral membrane protein, similar to AAK24613|CC2646 OLIGOPEPTIDE TRANSPORTER/OPT FAMILY PROTEIN from Caulobacter crescentus (666 aa), FASTA scores: opt: 1638, E(): 4.8e-86, (51.0% identity in 658 aa overlap); Q9PIS5|CJ0204 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Campylobacter jejuni (665 aa), FASTA scores: opt: 1484, E(): 2.9e-77, (40.6% identity in 658 aa overlap); and P44016|Y561_HAEIN hypothetical integral membrane protein from Haemophilus influenzae (635 aa), FASTA scores: opt: 1449, E(): 2.8e-75, (42.15% identity in 624 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216911.1" /db_xref="GI:15609532" /db_xref="UniProtKB/TrEMBL:P71749" /db_xref="GeneID:885303" /translation="MSGATVGAREITIRGVVLGALITLVFTAANVYLGLRVGLTFATS IPAAVISMGVLRLFANHSVVENNIVQTIASAAGTLSSIIFVLPALLMIGWWSGFPYWT TAAVCALGGILGVMYSIPLRRALVTGSDLPYPEGVAGAEVLKIGDSAREMEHNRRGIG VIALGAAAAAGYALLASLRVINNSLSATFRVGSGATMIGASLSLALIGVGHLVGVTVG VAMIVGLAIAFGVMLPIRTAGQLPPDGDYAVAVARIFSTDVRFIGAGAIAVAAAWTFL KILGPILRGIADAAVSARTRRRGQAVGQTERDIPIHIVAMVVLLSLIPIGWLLADFTD GTPLDDRRPGAIAAGVLLVLVIGLMVAAVCGYMAGLIGSSNSPISGVGILVVVLAGLL IKTAYGPATGSQIPALVAYTVFTAALVFGVATISNDNLQDLKTGQLVGATPWKQQVAL IIGVLVGSVVMAPILQLMQAGFGFQGAPGATANALAAPQAALMSALAKGVFGGSLNWS LVGVGALTGVIAVALDETLAKTTTNLRLPPLAVGMGMYLSAALTLMIPIGAFLGRIYD SWARWSGDDDERKKRLGVMLATGLIVGESLYGVLFAVIVATTGKEEPLAMVGDGFRFA SQPLGAIVFAGLLAWLYQRTRVTASYRLAAPAGSSKPLPDLPG" gene 2692799..2693884 /gene="PE_PGRS41" /locus_tag="Rv2396" /db_xref="GeneID:885517" CDS 2692799..2693884 /gene="PE_PGRS41" /locus_tag="Rv2396" /function="UNKNOWN" /note="Rv2396, (MTCY253.25c), len: 361 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many e.g. AAK47132|MT2812 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1551 (454 aa), FASTA scores: opt: 1256, E(): 2.4e-44, (56.0% identity in 377 aa overlap); AAK46139|MT1866 PE_PGRS FAMILY PROTEIN from M. tuberculosis strain CDC1551 (491 aa), FASTA scores: opt: 1250, E(): 4.4e-44, (57.8% identity in 372 aa overlap); Y278_MYCTU|Rv0278C|MTV035.06c HYPOTHETICAL PE-PGRS FAMILY PROTEIN (957 aa), FASTA scores: opt: 1253, E(): 5.2e-44, (55.5% identity in 400 aa overlap); P71664|Rv1396c|MTCY21B4.13c HYPOTHETICAL GLYCINE-RICH 47.9 KDA PROTEIN (576 aa), FASTA scores: opt: 1236, E(): 1.8e-43, (55.55% identity in 402 aa overlap); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177878.1" /db_xref="GI:57116982" /db_xref="UniProtKB/TrEMBL:Q79FE6" /db_xref="GeneID:885517" /translation="MSFLIASPEALAATATYLTGIGSAISAANAVAAAPTTEILAAGT DEVSTAISALFGAHAQAYQALSAHVAAFHDQFVHTLTAGAGSYMAAEAAAASPLQALQ LELLNAINAPTLALLGRPLIGDGTDAAPGSGGAGGAGGILIGNGGTGGASDLAGTGRG GVGGAGGAGGLFGIGGAGGGCGSAVAIGGDGGAGGAGGVFSGGGAGGAGDAIGGSGGA GGTGGLLGGGGGAGGAGGAGGNGGGASNSASIGGDGGSGGAGGMLYGAGGVGGNGGAA VAIGGDGGAGGRAGAIGNGGDGGNGGTSNTPGGSGGDGGNGGNAGLIGNGGNGGNAEI VISGGSVAGTGGNGGLLLGFNGTNGLP" misc_feature 2693519..2693593 /gene="PE_PGRS41" /locus_tag="Rv2396" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene complement(2693909..2694964) /gene="cysA1" /locus_tag="Rv2397c" /db_xref="GeneID:885663" CDS complement(2693909..2694964) /gene="cysA1" /locus_tag="Rv2397c" /function="INVOLVED IN THE ACTIVE TRANSPORT ACROSS THE MEMBRANE OF MULTIPLE SULFUR-CONTAINING COMPOUNDS, INCLUDING SULFATE AND THIOSULFATE (IMPORT). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv2397c, (MTCY253.24), len: 351 aa. Probable cysA1, sulfate-transport ATP-binding protein ABC transporter (see citations below), similar to OTHER SULFATE ABC TRANSPORTER ATP-BINDING PROTEINS e.g. P14788|CYSA_SYNP7 from Synechococcus sp. (344 aa), FASTA scores: opt: 1112, E(): 2.6e-56, (54.6% identity in 328 aa overlap); P74548|CYSA_SYNY3 from Synechocystis sp. (355 aa), FASTA scores: opt: 1063, E(): 1.7e-53, (51.9% identity in 343 aa overlap); Q9I6L0|CYSA|PA0280 from Pseudomonas aeruginosa (329 aa), FASTA scores: opt: 987, E(): 3.3e-49, (49.2% identity in 339 aa overlap); etc. Also similar to many ATP-binding proteins from Mycobacterium tuberculosis e.g. Rv2038c, Rv1238, Rv2832c, etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Note that previously known as cysA.; cysA" /codon_start=1 /transl_table=11 /product="sulfate-transport ATP-binding protein ABC transporter CysA1" /protein_id="YP_177879.1" /db_xref="GI:57116983" /db_xref="GOA:P71747" /db_xref="UniProtKB/Swiss-Prot:P71747" /db_xref="GeneID:885663" /translation="MTYAIVVADATKRYGDFVALDHVDFVVPTGSLTALLGPSGSGKS TLLRTIAGLDQPDTGTITINGRDVTRVPPQRRGIGFVFQHYAAFKHLTVRDNVAFGLK IRKRPKAEIKAKVDNLLQVVGLSGFQSRYPNQLSGGQRQRMALARALAVDPEVLLLDE PFGALDAKVREELRAWLRRLHDEVHVTTVLVTHDQAEALDVADRIAVLHKGRIEQVGS PTDVYDAPANAFVMSFLGAVSTLNGSLVRPHDIRVGRTPNMAVAAADGTAGSTGVLRA VVDRVVVLGFEVRVELTSAATGGAFTAQITRGDAEALALREGDTVYVRATRVPPIAGG VSGVDDAGVERVKVTST" misc_feature complement(2694518..2694562) /gene="cysA1" /locus_tag="Rv2397c" /note="PS00211 ABC transporters family signature" misc_feature complement(2694833..2694856) /gene="cysA1" /locus_tag="Rv2397c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2694981..2695799) /gene="cysW" /locus_tag="Rv2398c" /db_xref="GeneID:885305" CDS complement(2694981..2695799) /gene="cysW" /locus_tag="Rv2398c" /function="INVOLVED IN THE ACTIVE TRANSPORT ACROSS THE MEMBRANE OF MULTIPLE SULFUR-CONTAINING COMPOUNDS, INCLUDING SULFATE AND THIOSULFATE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv2398c, (MTCY253.23), len: 272 aa. Probable cysW, sulfate-transport integral membrane protein ABC transporter (see citations below), similar to others e.g. Q9K877|CYSW|BH3129 SULFATE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (287 aa), FASTA scores: opt: 765, E(): 4.1e-40, (43.8% identity in 249 aa overlap); P27370|CYSW_SYNP7 sulfate transport system (permease) protein from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (286 aa), FASTA scores: opt: 757, E(): 1.3e-39, (44.3% identity in 264 aa overlap); Q9I6K9|CYSW|PA0281 SULFATE TRANSPORT PROTEIN from Pseudomonas aeruginosa (289 aa), FASTA scores: opt: 753, E(): 2.3e-39, (44.4% identity in 250 aa overlap); P16702|P76534|CYSW_ECOLI SULFATE TRANSPORT SYSTEM PERMEASE from Escherichia coli (291 aa), FASTA scores: opt: 633, E(): 5.7e-32, (38.2% identity in 267 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane component signature. SIMILARITY WITH INTEGRAL MEMBRANE COMPONENTS OF OTHER BINDING-PROTEIN-DEPENDENT TRANSPORT SYSTEMS and BELONGS TO THE CYSTW SUBFAMILY." /codon_start=1 /transl_table=11 /product="sulfate-transport integral membrane protein ABC transporter CysW" /protein_id="NP_216914.1" /db_xref="GI:15609535" /db_xref="GOA:P71746" /db_xref="UniProtKB/TrEMBL:P71746" /db_xref="GeneID:885305" /translation="MTSLPAARYLVRSVALGYVFVLLIVPVALILWRTFEPGFGQFYA WISTPAAISALNLSLLVVAIVVPLNVIFGVTTALVLARNRFRGKGVLQAIIDLPFAVS PVIVGVSLILLWGSAGALGFVEQDLGFKIIFGLPGIVLGSMFVTCPFVVREVEPVLHE LGTDQEQAAATLGSGWWQTFWRITLPSIRWGLTYGIVLTVARTLGEYGAVIIVSSNLP GTSQTLTLLVSDRYHRGAEYGAYALSTLLMAVSVVVLIVQMVLDARRARAVSEG" misc_feature complement(2695242..2695328) /gene="cysW" /locus_tag="Rv2398c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature." gene complement(2695796..2696647) /gene="cysT" /locus_tag="Rv2399c" /db_xref="GeneID:885301" CDS complement(2695796..2696647) /gene="cysT" /locus_tag="Rv2399c" /function="INVOLVED IN THE ACTIVE TRANSPORT ACROSS THE MEMBRANE OF MULTIPLE SULFUR-CONTAINING COMPOUNDS, INCLUDING SULFATE AND THIOSULFATE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv2399c, (MTCY253.22), len: 283 aa. Probable cysT, sulfate-transport integral membrane protein ABC transporter (see citations below), similar to others e.g. BAB48989|MLR1667 PERMEASE PROTEIN OF SULFATE ABC TRANSPORTER from Rhizobium loti (283 aa), FASTA scores: opt: 756, E(): 7.9e-40, (40.95% identity in 271 aa overlap); Q9K878|CYST|BH3128 SULFATE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (279 aa), FASTA scores: opt: 750, E(): 1.8e-39, (44.55% identity in 258 aa overlap); P16701|CYST_ECOLI|CYSU|CYST|B2424 from Escherichia coli (277 aa), FASTA scores: opt: 669, E(): 1.9e-34, (40.0% identity in 260 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane component signature, and PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE CYSTW SUBFAMILY." /codon_start=1 /transl_table=11 /product="sulfate-transport integral membrane protein ABC transporter CysT" /protein_id="NP_216915.1" /db_xref="GI:15609536" /db_xref="GOA:P71745" /db_xref="UniProtKB/TrEMBL:P71745" /db_xref="GeneID:885301" /translation="MTESLVGERRAPQFRARLSGPAGPPSVRVGMAVVWLSVIVLLPL AAIVWQAAGGGWRAFWLAVSSHAAMESFRVTLTISTAVTVINLVFGLLIAWVLVRDDF AGKRIVDAIIDLPFALPTIVASLVMLALYGNNSPVGLHFQHTATGVGVALAFVTLPFV VRAVQPVLLEIDRETEEAAASLGANGAKIFTSVVLPSLTPALLSGAGLAFSRAIGEFG SVVLIGGAVPGKTEVSSQWIRTLIENDDRTGAAAISVVLLSISFIVLLILRVVGARAA KREEMAA" misc_feature complement(2695955..2695978) /gene="cysT" /locus_tag="Rv2399c" /note="PS00017 ATP/GTP-binding site motif A" misc_feature complement(2696060..2696146) /gene="cysT" /locus_tag="Rv2399c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene complement(2696644..2697714) /gene="subI" /locus_tag="Rv2400c" /db_xref="GeneID:885299" CDS complement(2696644..2697714) /gene="subI" /locus_tag="Rv2400c" /function="INVOLVED IN THE ACTIVE TRANSPORT ACROSS THE MEMBRANE OF MULTIPLE SULFUR-CONTAINING COMPOUNDS, INCLUDING SULFATE AND THIOSULFATE (IMPORT)." /experiment="experimental evidence, no additional details recorded" /note="Rv2400c, (MTCY253.21), len: 356 aa. Probable subI, sulfate-binding lipoprotein component of sulfate transport system (see citations below), equivalent to Q9CCN3|SUBI|ML0615 (alias Q49748|B1937_F1_11, 358 aa) PUTATIVE SULPHATE-BINDING PROTEIN from Mycobacterium leprae (348 aa), FASTA scores: opt: 1775, E(): 2.3e-102, (76.45% identity in 340 aa overlap). Also similar to others and other substrate-binding proteins e.g. P27366|SUBI_SYNP7|SBPA SULFATE-BINDING PROTEIN PRECURSOR from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (350 aa), FASTA scores: opt: 703, E(): 4.6e-36, (35.6% identity in 351 aa overlap); Q9I6K7|SBP|PA0283 SULFATE-BINDING PROTEIN PRECURSOR from Pseudomonas aeruginosa (332 aa), FASTA scores: opt: 591, E(): 3.7e-29, (36.9% identity in 317 aa overlap); CAC49112|SMB21133 PUTATIVE SULFATE UPTAKE ABC TRANSPORTER PERIPLASMIC SOLUTE-BINDING PROTEIN PRECURSOR from Rhizobium meliloti (Sinorhizobium meliloti) (341 aa), FASTA scores: opt: 569, E(): 8.8e-28, (36.15% identity in 321 aa overlap); etc. BELONGS TO THE PROKARYOTIC SULFATE BINDING PROTEIN FAMILY." /codon_start=1 /transl_table=11 /product="sulfate-binding lipoprotein" /protein_id="NP_216916.1" /db_xref="GI:15609537" /db_xref="GOA:P71744" /db_xref="UniProtKB/TrEMBL:P71744" /db_xref="GeneID:885299" /translation="MLSLTLSEASCIASASRWRHIIPAGVVCALIAGIGVGCHGGPSD VVGRAGPDRAHTSITLVAYAVPEPGWSAVIPAFNASEQGRGVQVITSYGASADQSRGV ADGKPADLVNFSVEPDIARLVKAGKVDKDWDADATKGIPFGSVVTFVVRAGNPKNIRD WDDLLRPGIEVITPSPLSSGSAKWNLLAPYAAKSDGGRNNQAGIDFVNTLVNEHVKLR PGSGREATDVFVQGSGDVLISYENEAIATERAGKPVQHVTPPQTFKIENPLAVVATST HLGAATAFRNFQYTVQAQKLWAQAGFRPVDPAVAADFADLFPVPAKLWTIADLGGWGS VDPQLFDKATGSITKIYLRATG" gene 2697728..2698057 /locus_tag="Rv2401" /db_xref="GeneID:885664" CDS 2697728..2698057 /locus_tag="Rv2401" /function="UNKNOWN" /note="Rv2401, (MTCY253.19c), len: 109 aa. Hypothetical unknown protein. Equivalent to AAK46768 from Mycobacterium tuberculosis strain CDC1551 (134 aa) but shorter 25 aa. N-terminus extended since first submission (previously 72 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216917.2" /db_xref="GI:57116984" /db_xref="UniProtKB/TrEMBL:O86326" /db_xref="GeneID:885664" /translation="MRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSA ANERADIAPRKTRCCVHVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVRPR HPGYLGA" gene complement(2698042..2698245) /locus_tag="Rv2401A" /db_xref="GeneID:3205070" CDS complement(2698042..2698245) /locus_tag="Rv2401A" /function="UNKNOWN" /note="Rv2401A, len 67 aa. Possible conserved membrane protein, highly similar, but with 29 aa shorter, to ML0614|AL583919_34|Q49760 from Mycobacterium leprae (95 aa), FASTA scores: opt: 297, E(): 3.6e-15, (67.7% identity in 65 aa overlap). Has hydrophobic stretch." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177670.1" /db_xref="GI:57116985" /db_xref="UniProtKB/TrEMBL:Q79FE4" /db_xref="GeneID:3205070" /translation="MGPMNGFLSWWDGVELWLSGLPFALQALAVMPVVLALAYFTAAL LDALLGRVIQLIRRARRPDQAPR" gene 2698529..2700457 /locus_tag="Rv2402" /db_xref="GeneID:885661" CDS 2698529..2700457 /locus_tag="Rv2402" /function="UNKNOWN" /note="Rv2402, (MTCY253.18c), len: 642 aa. Conserved hypothetical protein, highly similar to others e.g. 9X8C4|SCE36.11c CONSERVED HYPOTHETICAL PROTEIN (FRAGMENT) from Streptomyces coelicolor (612 aa), FASTA scores: opt: 1283, E(): 6.5e-75, (41.9% identity in 623 aa overlap); Q9RJ38|SCI8.15 HYPOTHETICAL 66.3 KDA PROTEIN from Streptomyces coelicolor (595 aa), FASTA scores: opt: 1152, E(): 1.7e-66, (39.9% identity in 622 aa overlap), Q9S223|CI51.17 HYPOTHETICAL 68.4 KDA PROTEIN from Streptomyces coelicolor (612 aa), FASTA scores: opt: 1146, E(): 4.2e-66, (40.6% identity in 623 aa overlap); YAY3_SCHPO|Q10211|c4h3.03c HYPOTHETICAL 74.5 kDa PROTEIN from Schizosaccharomyces pombe (Fission yeast) (649 aa) FASTA scores: opt: 999, E(): 1.3e-56, (35.0% identity in 642 aa overlap); etc. Contains possible helix-turn-helix motif, at aa 224-245 (+4.68 SD)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216918.1" /db_xref="GI:15609539" /db_xref="UniProtKB/TrEMBL:P71741" /db_xref="GeneID:885661" /translation="MALSSSSPLRNPFPPIADYAFLSDWETTCLISPAGSVEWLCVPR PDSPSVFGAILDRSAGHFRLGPYGVSVPSARRYLPGSLIMETTWQTHTGWLIVRDALV MGKWHDIERRSRTHRRTPMDWDAEHILLRTVRCVSGTVELMMSCEPAFDYHRLGATWE YSAEAYGEAIARANTEPDAHPTLRLTTNLRIGLEGREARARTRMKEGDDVFVALSWTK HPPPQTYDEAADKMWQTTECWRQWINIGNFPDHPWRAYLQRSALTLKGLTYSPTGALL AASTTSLPETPRGERNWDYRYAWIRDSTFALWGLYTLGLDREADDFFAFIADVSGANN NERHPLQVMYGVGGERSLVEAELHHLSGYDHARPVRIGNGAYNQRQHDIWGSILDSFY LHAKSREQVPENLWPVLKRQVEEAIKHWREPDRGIWEVRGEPQHFTSSKVMCWVALDR GAKLAERQGEKSYAQQWRAIADEIKADILEHGVDSRGVFTQRYGDEALDASLLLVVLT RFLPPDDPRVRNTVLAIADELTEDGLVLRYRVHETDDGLSGEEGTFTICSFWLVSALV EIGEVGRAKRLCERLLSFASPLLLYAEEIEPRSGRHLGNFPQAFTHLALINAVVHVIR AEEEADSSGMFQPANAPM" gene complement(2700535..2701290) /gene="lppR" /locus_tag="Rv2403c" /db_xref="GeneID:885513" CDS complement(2700535..2701290) /gene="lppR" /locus_tag="Rv2403c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2403c, (MTCY253.17), len: 251 aa. Probable lppR, conserved lipoprotein, with weak similarity with MYCOBACTERIAL SERINE/THREONINE PROTEIN KINASES (EC 2.7.1.-) e.g. AAK45563|MT1304 from Mycobacterium tuberculosis strain CDC1551 (626 aa), FASTA scores: opt: 186, E(): 0.00023, (24.4% identity in 238 aa overlap), and the C-terminal part of Q11053|Rv1266c|MTCY50.16|PKNH_MYCTU from Mycobacterium tuberculosis (626 aa), FASTA scores: opt: 185, E()= 0.00027, (24.35% identity in 238 aa overlap). Has signal peptide and appropriate positioned prokaryotic lipoprotein attachment site (PS00013). Could belong to the SER/THR FAMILY of protein kinases." /codon_start=1 /transl_table=11 /product="lipoprotein LppR" /protein_id="NP_216919.1" /db_xref="GI:15609540" /db_xref="UniProtKB/TrEMBL:P71740" /db_xref="GeneID:885513" /translation="MTNRWRWVVPLFAVFLAAGCTTTTTGKAGLAPNAVPRPLMGSLI QRVPLDGAALSTLLNQPFQALPPFPPVFGGSDSLGDSDVSARPADCVGVGYLTQRNVY RSVEVKSVARVSWRHDGSSVKVDDVDEGVVALPSAAAADDLFARFSAQWKECDGTTLT VPASAFGQRSITDVRVADSVVAATVSLRRGTHSILASVPQARAVGVRGNCVVEVAVTF FGITHPSDQGSADISTSAVDIAHAMMDRISELS" gene complement(2701287..2703248) /gene="lepA" /locus_tag="Rv2404c" /db_xref="GeneID:885475" CDS complement(2701287..2703248) /gene="lepA" /locus_tag="Rv2404c" /function="INVOLVED IN TRANSLATION." /note="binds to the ribosome on the universally-conserved alpha-sarcin loop" /codon_start=1 /transl_table=11 /product="GTP-binding protein LepA" /protein_id="NP_216920.1" /db_xref="GI:15609541" /db_xref="GOA:P65269" /db_xref="UniProtKB/Swiss-Prot:P65269" /db_xref="GeneID:885475" /translation="MRTPCSQHRRDRPSAIGSQLPDADTLDTRQPPLQEIPISSFADK TFTAPAQIRNFCIIAHIDHGKSTLADRMLQLTGVVDERSMRAQYLDRMDIERERGITI KAQNVRLPWRVDKTDYVLHLIDTPGHVDFTYEVSRALEACEGAVLLVDAAQGIEAQTL ANLYLALDRDLHIIPVLNKIDLPAADPDRYAAEMAHIIGCEPAEVLRVSGKTGEGVSD LLDEVVRQVPPPQGDAEAPTRAMIFDSVYDIYRGVVTYVRVVDGKISPRERIMMMSTG ATHELLEVGIVSPEPKPCEGLGVGEVGYLITGVKDVRQSKVGDTVTSLSRARGAAAEA LTGYREPKPMVYSGLYPVDGSDYPNLRDALDKLQLNDAALTYEPETSVALGFGFRCGF LGLLHMEITRERLEREFGLDLISTSPNVVYRVHKDDGTEIRVTNPSDWPEGKIRTVYE PVVKTTIIAPSEFIGTIMELCQSRRGELGGMDYLSPERVELRYTMPLGEIIFDFFDAL KSRTRGYASLDYEEAGEQEAALVKVDILLQGEAVDAFSAIVHKDTAYAYGNKMTTKLK ELIPRQQFEVPVQAAIGSKIIARENIRAIRKDVLSKCYGGDITRKRKLLEKQKEGKKR MKTIGRVEVPQEAFVAALSTDAAGDKGKK" misc_feature complement(2702934..2702981) /gene="lepA" /locus_tag="Rv2404c" /note="PS00301 GTP-binding elongation factors signature" misc_feature complement(2703051..2703074) /gene="lepA" /locus_tag="Rv2404c" /note="PS00017 ATP/GTP-binding site motif A" gene 2703269..2703838 /locus_tag="Rv2405" /db_xref="GeneID:885507" CDS 2703269..2703838 /locus_tag="Rv2405" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2405, (MTCY253.15c), len: 189 aa. Conserved hypothetical protein, identical (but N-terminus longer 40 residues) to AAK46773|MT2477 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551. Also highly similar, but N-terminus longer 38 residues, to Q9RD03|SCCM1.41 HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (154 aa), FASTA scores: opt: 451, E(): 2e-22, (48.7% identity in 154 aa overlap). Shows also similarity with hypothetical proteins from other species." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216921.1" /db_xref="GI:15609542" /db_xref="UniProtKB/TrEMBL:P71738" /db_xref="GeneID:885507" /translation="MQRFAENLVFTEAPKLVRHLQNTQETLRTIRQAVKITANIMTTA VPSPPAEIAAGRPVTSTSCPTAARARRLVYAPDLDGRADPGEIVWTWVAYEQDPTRGK DRPVLVVGRDRSVLLGLLVSSQERHAADRDWVGIGSGAWDYEGRESWVRLDRVLDVPE ESIRREGAILEREVFDVVAARLRADYAWR" gene complement(2704009..2704437) /locus_tag="Rv2406c" /db_xref="GeneID:885156" CDS complement(2704009..2704437) /locus_tag="Rv2406c" /function="UNKNOWN" /note="Rv2406c, (MTCY253.14), len: 142 aa. Conserved hypothetical protein. C-terminal region is identical with many CBS DOMAIN PROTEIN e.g. AAK46774|MT2478 CBS DOMAIN PROTEIN from Mycobacterium tuberculosis strain CDC1551 (aa 47-142), FASTA scores: opt: 594, E(): 1.9e-30, (98.97% identity in 97 aa overlap); etc. Also similar to other hypothetical proteins e.g. AAK24594|CC2626 CBS DOMAIN PROTEIN from Caulobacter crescentus (157 aa), FASTA scores: opt: 377, E(): 8.3e-17, (42.55% identity in 141 aa overlap); BAB47826|MLR0188 from Rhizobium loti; etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216922.1" /db_xref="GI:15609543" /db_xref="UniProtKB/TrEMBL:P71737" /db_xref="GeneID:885156" /translation="MRIADVLRNKGAAVVTINPDATVGELLAGLAEQNIGAMVVVGAE GVVGIVSERDVVRQLHTYGASVLSRPVAKIMSTTVATCTKSDTVDKISVLMTENRVRH VPVLDGKKLIGIVSIGDVVKSRMGELEAEQQQLQSYITQG" gene 2704697..2705518 /locus_tag="Rv2407" /db_xref="GeneID:885684" CDS 2704697..2705518 /locus_tag="Rv2407" /EC_number="3.1.26.11" /function="UNKNOWN" /note="member of metallo-beta-lactamase family; the purified enzyme from Escherichia coli forms dimeric zinc phosphodiesterase; in Bacillus subtilis this protein is a 3'-tRNA processing endoribonuclease and is essential while in Escherichia coli it is not; associates with two zinc ions" /codon_start=1 /transl_table=11 /product="ribonuclease Z" /protein_id="NP_216923.1" /db_xref="GI:15609544" /db_xref="GOA:P71736" /db_xref="UniProtKB/Swiss-Prot:P71736" /db_xref="GeneID:885684" /translation="MLEITLLGTGSPIPDPDRAGPSTLVRAGAQAFLVDCGRGVLQRA AAVGVGAAGLSAVLLTHLHGDVLITSWVTNFAADPAPLPIIGPPGTAEVVEATLKAFG HDIGYRIAHHADLTTPPPIEVHEYTAGPAWDRDGVTIRVAPTDHRPVTPTIGFRIESD GASVVLAGDTVPCDSLDQLAAGADALVHTVIRKDIVTQIPQQRVKDICDYHSSVQEAA ATANRAGVGTLVMTHYVPAIGPGQEEQWRALAATEFSGRIEVGNDLHRVEVHPRR" gene 2706017..2706736 /gene="PE24" /locus_tag="Rv2408" /db_xref="GeneID:885511" CDS 2706017..2706736 /gene="PE24" /locus_tag="Rv2408" /function="UNKNOWN" /note="Rv2408, (MTCY253.12c), len: 239 aa. Possibly a member of PE family (see citation below), similar to AAK46440|MT2159 from Mycobacterium tuberculosis strain CDC1551 (491 aa) FASTA scores: opt: 269, E(): 5.4e-08, (38.45% identity in 156 aa overlap) and AAK45466|MT1209 from Mycobacterium tuberculosis strain CDC1551 (308 aa), FASTA scores: opt: 265, E(): 6.3e-08, (36.0% identity in 197 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177880.1" /db_xref="GI:57116986" /db_xref="UniProtKB/TrEMBL:Q79FE3" /db_xref="GeneID:885511" /translation="MLIARPDILCSRGPEAMRAKAADLDLAAAAKTVGVQPAADQVAA AIAAILLSHAQIYQDISTQMAAFHDQLVENRTADSTSYASAEANAQQSLLNAMDAPSW QQRRETVGEVGLPADPAGSGTATAAVAAATTARAGSRSAAQATVAPIGGLKLRRESAL SQPGDLHHHVEVGDALPRVDPFQRGNVGVVAAYTHTDVLLGDLIVIGGVVVPPSTGPG LNPGMAAPVYRLSHHGITLRV" gene complement(2706494..2707333) /locus_tag="Rv2409c" /db_xref="GeneID:885674" CDS complement(2706494..2707333) /locus_tag="Rv2409c" /function="UNKNOWN" /note="Rv2409c, (MTCY253.11), len: 279 aa. Conserved hypothetical protein, equivalent to Q49757|YP69_MYCLE|G466976|B1937_F2_39 HYPOTHETICAL PROTEIN from Mycobacterium leprae (279 aa), FASTA scores: opt: 1564, E(): 4.6e-95, (82.1% identity in 279 aa overlap). Also similar to others e.g. Q9RSX6|DR1993 from Deinococcus radiodurans (274 aa), FASTA scores: opt: 494, E(): 4e-25, (35.1% identity in 282 aa overlap); BAB49898|Mll2875 from Rhizobium loti (Mesorhizobium loti) (294 aa), FASTA scores: opt: 382, E(): 8.9e-18, (29.75% identity in 269 aa overlap); Q9I305|PA1732 from Pseudomonas aeruginosa (266 aa), FASTA scores: opt: 326, E(): 3.7e-14, (31.25% identity in 275 aa overlap); etc. Also similar to Rv2569c|MTCY227.32 from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216925.1" /db_xref="GI:15609546" /db_xref="UniProtKB/TrEMBL:P71734" /db_xref="GeneID:885674" /translation="MWRTRVVHTTGYVYQSPVTASYNEARLTPRSSSRQNLVLNRVET IPATRSYRYIDYWGTAVTAFDLHAPHTELTVTSSSVVETERPEPLAAKATWADLQSTA VIDRFDEVLRPTPHTPASARVDAVGRRIRKCHEPSEAVVAAARWARSELDYIPGTTSV HSSGLDALEQGKGVCQDFVHLSLMVLRSMGIPCRYVSGYLHPKRDAVVGKTVDGRSHA WVQAWTGGWWHYDPTNDNEITEQYISVGVGRDYTDVSPLKGIYSGEGVTDLDVVVEIT RLA" gene complement(2707333..2708310) /locus_tag="Rv2410c" /db_xref="GeneID:885257" CDS complement(2707333..2708310) /locus_tag="Rv2410c" /function="UNKNOWN" /note="Rv2410c, (MTCY253.10), len: 325 aa. Conserved hypothetical protein, equivalent to Q49770|CAC30114|ML0606 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (325 aa), FASTA scores: opt: 1928, E(): 3.5e-117, (90.75% identity in 325 aa overlap). Also some similarity with other hypothetical proteins e.g. Q9RST2|DR2041 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (316 aa), FASTA scores: opt: 329, E(): 5.3e-14, (32.4% identity in 318 aa overlap); C-terminus of Q9HUN7|PA4927 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (830 aa), FASTA scores: opt: 297, E(): 1.5e-11, (27.6% identity in 315 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216926.1" /db_xref="GI:15609547" /db_xref="UniProtKB/TrEMBL:P71733" /db_xref="GeneID:885257" /translation="MLARNAEALYWIGRYVERADDTARILDVAVHQLLEDSSVDPDQA SRLLLRVLGIEPPDHELDVWSLTDLVAFSTNSQGGSSIVDAISAARENAKSAREVTSS ETWECLNTTYNALPERERAAKRLGPHEFLSFIEGRAAMFAGLADSTLLRDDGYRFMLL GRAIERVDMTVRLLLSRVGDSASSPAWVTLLRSAGAHDTYLRTYRGVLDAGRVVEFMM LDRLFPRSVFHSLKLAEHNLAELMHNPHSRIGATTEAQRLLGQARSELEFVQPGVLLE TLESRLAGLQTTCRDVGDALALQYFHAAPWVAWSDAGQRGQLVGSQEES" gene complement(2708310..2709965) /locus_tag="Rv2411c" /db_xref="GeneID:885681" CDS complement(2708310..2709965) /locus_tag="Rv2411c" /function="UNKNOWN" /note="Rv2411c, (MTCY253.09c), len: 551 aa. Hypothetical protein, highly similar to Q49755|YO11_MYCLE|ML0605|MLCL536.05c|U1937B|B1937_F1_4 HYPOTHETICAL 61.8 KDA PROTEIN from Mycobacterium leprae (561 aa), FASTA scores, opt: 3163, E(): 4.1e-178, (87.35% identity in 554 aa overlap). Also highly similar, except in N-terminus, to others e.g. Q55587|Y335_SYNY3|SLL0335 HYPOTHETICAL PROTEIN from Synechocystis sp. strain PCC 6803 (481 aa), FASTA scores: opt: 1620, E(): 1.2e-87, (52.8% identity in 468 aa overlap); Q9I307|PA1730 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (470 aa), FASTA scores: opt: 1574, E(): 5.8e-85, (52.7% identity in 467 aa overlap); Q9RST1|DR2042 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (655 aa), FASTA scores: opt: 1561, E(): 4.4e-84, (53.3% identity in 467 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216927.1" /db_xref="GI:15609548" /db_xref="UniProtKB/Swiss-Prot:P65001" /db_xref="GeneID:885681" /translation="MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDA QGIVRGPYKGIYAELAPSDASELKARADALGRAFIDQGITFSLSGQERPFPLDLVPRV ISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIV PPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRV RAVDDYASHLLRALRNSAATNEADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRD LFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGVAGLVNAARAGNVVL SSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVL KPVEGSGGYGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPR YVDLRPFAVNDGNEVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLAPRASAAARE LGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQQQQQQAFH" gene 2710075..2710335 /gene="rpsT" /locus_tag="Rv2412" /db_xref="GeneID:885676" CDS 2710075..2710335 /gene="rpsT" /locus_tag="Rv2412" /function="INVOLVED IN TRANSLATION MECHANISMS. BINDS DIRECTLY TO 16S RIBOSOMAL RNA (BY SIMILARITY)." /experiment="experimental evidence, no additional details recorded" /note="binds directly to the 16S rRNA and is involved in post-translational inhibition of arginine and ornithine decarboxylase" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S20" /protein_id="NP_216928.1" /db_xref="GI:15609549" /db_xref="GOA:P66505" /db_xref="UniProtKB/Swiss-Prot:P66505" /db_xref="GeneID:885676" /translation="MANIKSQQKRNRTNERARLRNKAVKSSLRTAVRAFREAAHAGDK AKAAELLASTNRKLDKAASKGVIHKNQAANKKSALAQALNKL" gene complement(2710351..2711301) /locus_tag="Rv2413c" /db_xref="GeneID:885666" CDS complement(2710351..2711301) /locus_tag="Rv2413c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2413c, (MTCY253.07), len: 316 aa. Conserved hypothetical protein, highly similar to O33133|MLCL536.07c|ML0603|Q49756|G466975|B1937_F2_36 hypothetical 39.1 KDA protein from Mycobacterium leprae (389 aa), FASTA scores: opt: 1683, E(): 1.8e-88, (83.9% identity in 316 aa overlap). ML0603 is a putative lipoprotein with an N-terminal signal sequence and appropriately positioned prokaryotic lipoprotein lipid attachment site that is not present in Rv2413c as this seems to be 73 aa shorter. Also some similarity with various proteins from other organisms e.g. Q9RDM2|SCC123.02c PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (336 aa), FASTA scores: opt: 792, E(): 6.1e-38, (42.4% identity in 316 aa overlap); Q9HX31|HOLA|PA3989 DNA POLYMERASE III, DELTA SUBUNIT from Pseudomonas aeruginosa (345 aa), FASTA scores: opt: 173, E(): 0.0084, (25.4% identity in 307 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216929.1" /db_xref="GI:15609550" /db_xref="GOA:P71730" /db_xref="UniProtKB/TrEMBL:P71730" /db_xref="GeneID:885666" /translation="MHLVLGDEELLVERAVADVLRSARQRAGTADVPVSRMRAGDVGA YELAELLSPSLFAEERIVVLGAAAEAGKDAAAVIESAAADLPAGTVLVVVHSGGGRAK SLANQLRSMGAQVHPCARITKVSERADFIRSEFASLRVKVDDETVTALLDAVGSDVRE LASACSQLVADTGGAVDAAAVRRYHSGKAEVRGFDIADKAVAGDVAGAAEALRWAMMR GEPLVVLADALAEAVHTIGRVGPQSGDPYRLAAQLGMPPWRVQKAQKQARRWSRDTVA TAMRLVAELNANVKGAVADADYALESAVRQVAELVADRGR" gene complement(2711332..2712876) /locus_tag="Rv2414c" /db_xref="GeneID:885667" CDS complement(2711332..2712876) /locus_tag="Rv2414c" /function="UNKNOWN" /note="Rv2414c, (MTCY253.06), len: 514 aa. Conserved hypothetical protein, showing some similarity with COME OPERON PROTEINS 3 (COMEC OR COME3) e.g. Q9RTB1|DR1854 PUTATIVE COMPETENCE PROTEIN COMEC/REC2 from Deinococcus radiodurans (755 aa), FASTA scores: opt: 311, E(): 8.2e-11, (27.3% identity in 538 aa overlap); P73100|COME|SLL1929 COME PROTEIN from Synechocystis sp. strain PCC 6803 (709 aa), FASTA scores: opt: 302, E(): 2.6e-10, (26.3% identity in 323 aa overlap) (no similarity on N-terminus); P39695|CME3_BACSU COME OPERON PROTEIN 3 from Bacillus subtilis (776 aa), FASTA scores: opt: 273, E(): 1.4e-08, (25.2% identity in 282 aa overlap) (no similarity on N-terminus); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216930.1" /db_xref="GI:15609551" /db_xref="UniProtKB/TrEMBL:P71729" /db_xref="GeneID:885667" /translation="MGFGASRLDVRLVPAALVSWIVTAAGIVWPIGNVCALCCVVVAL GGGALWWCVARRSWHAPRLGSISAGLVAVGMVGAGYGLAVALRSEAVDRHPITVAFGT SALVTVTPSESPVSLGRGRLMFRATVQRLRDDETSGRVVVFARALDFGELMVGQPVQF RARISRPARHDLTVAVFNATGRPTVGRAGPVHRAAHIVRHRFAAAVREVLPADQATML PALVLGDTSTVTALTSREFRAAGLTHLTAVSGANVTIVCAAALVSARLIGPRAAVVCA AVALVAFVILVQPTASVLRAAVMGAIALVGMLSARRRQAIPALSGSVLVLLAAAPHLA VDIGFALSVAATGALVVIAPVWSRRLVDRGCPKVLADALAVAAAAQLVTAPLVAAISG RVSLVAVVANLAVAAVIAPITVLGSVAAVLVVPWPAGAQVLIRFTGPEVWWVLRVAHW ASGVPAATVPVAAGLPGVLLVGGATVFTVAQWRWRWFRAAMCKTMAVAVICLLAWSLS GLVGPS" gene complement(2712891..2713784) /locus_tag="Rv2415c" /db_xref="GeneID:885697" CDS complement(2712891..2713784) /locus_tag="Rv2415c" /function="UNKNOWN" /note="Rv2415c, (MTCY253.05), len: 297 aa. Hypothetical protein, with some similarity in C-terminal part to comE operon proteins 1 e.g. Q9EU10|COME|COME4|COME1|COME2|COME3 COME PROTEIN (a competence protein with DNA-binding activity) from Neisseria gonorrhoeae (99 aa), FASTA scores: opt: 190, E(): 0.0032, (49.2% identity in 61 aa overlap); Q9JYB8|NMB1657 from Neisseria meningitidis (205 aa) FASTA scores: opt: 191, E(): 0.0052, (49.2% identity in 61 aa overlap); CME1_BACSU|P39694 come operon protein 1 from Bacillus subtilis (205 aa), FASTA scores, opt: 181, E(): 0.017 (29.8% identity in 218 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216931.1" /db_xref="GI:15609552" /db_xref="GOA:P71728" /db_xref="UniProtKB/TrEMBL:P71728" /db_xref="GeneID:885697" /translation="MRTELPAERLQRRLGAVPDIDSHAASAHLDPEPHDPTDDGPDHD EPRDDPNSLLPRWLPDTSRGQGWADRIRADPGRAGAVALAVIAALAVLVTVFTLIRDR TEPVMSAKLPPVEPVSPTNPRSSASPGSPDRSGLPVVVSVVGLVHTPGLVTLAPGARI ADALQAAGGAVDGADTVGLNMARQLGDGEQIVVGLAPPSGQPRVLGSSVGAGTPGPAG TSGTATTGPKTAPKTAEVLDLNTATVEQLDALPGIGPVTAAAIVAWRQRNGRFTSVDQ LADVDGIGPARLDKRRNLVRV" gene complement(2714124..2715332) /gene="eis" /locus_tag="Rv2416c" /db_xref="GeneID:885903" CDS complement(2714124..2715332) /gene="eis" /locus_tag="Rv2416c" /function="SUPPOSED INVOLVED IN INTRACELLULAR SURVIVAL. POSSIBLY ASSOCIATED WITH THE CELL SURFACE AND SECRETED." /experiment="experimental evidence, no additional details recorded" /note="Rv2416c, (MTCY253.04), len: 402 aa. eis, enhanced intracellular survival gene (see citations below). Conserved hypothetical protein sharing similarity with Q9F309|SCC80.10 HYPOTHETICAL 44.7 KDA PROTEIN from Streptomyces coelicolor (413 aa), FASTA scores: opt: 382, E(): 1e-16, (31.45% identity in 407 aa overlap); Q9K4F4|SCD66.23 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (418 aa), FASTA scores: opt: 238, E(): 1.3e-07, (36.5% identity in 364 aa overlap): and Q54238|G1139577|ORF5 hypothetical protein from Streptomyces griseus (416 aa), FASTA scores: opt: 237, E(): 1.5e-07, (34.0 identity in 423 aa overlap). Start changed since first submission (- 6 aa) (see Dahl et al., 2001; Wei et al., 2000)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216932.2" /db_xref="GI:57116987" /db_xref="GOA:P71727" /db_xref="UniProtKB/Swiss-Prot:P71727" /db_xref="GeneID:885903" /translation="MTVTLCSPTEDDWPGMFLLAAASFTDFIGPESATAWRTLVPTDG AVVVRDGAGPGSEVVGMALYMDLRLTVPGEVVLPTAGLSFVAVAPTHRRRGLLRAMCA ELHRRIADSGYPVAALHASEGGIYGRFGYGPATTLHELTVDRRFARFHADAPGGGLGG SSVRLVRPTEHRGEFEAIYERWRQQVPGGLLRPQVLWDELLAECKAAPGGDRESFALL HPDGYALYRVDRTDLKLARVSELRAVTADAHCALWRALIGLDSMERISIITHPQDPLP HLLTDTRLARTTWRQDGLWLRIMNVPAALEARGYAHEVGEFSTVLEVSDGGRFALKIG DGRARCTPTDAAAEIEMDRDVLGSLYLGAHRASTLAAANRLRTKDSQLLRRLDAAFAS DVPVQTAFEF" gene complement(2715472..2716314) /locus_tag="Rv2417c" /db_xref="GeneID:885692" CDS complement(2715472..2716314) /locus_tag="Rv2417c" /function="UNKNOWN" /note="Rv2417c, (MTCY253.03), len: 280 aa. Conserved hypothetical protein, highly similar to Q9RDL7|SCC123.07c HYPOTHETICAL 29.2 KDA PROTEIN from Streptomyces coelicolor (281 aa), FASTA scores: opt: 579, E(): 3.6e-27, (38.3% identity in 274 aa overlap). Also some similarity with DEGV proteins or hypothetical proteins from other organisms, e.g. Q9RSY3|DR1986 from Deinococcus radiodurans (281 aa), FASTA scores: opt: 393, E(): 3.4e-16, (31.0% identity in 280 aa overlap); P32436|DEGV_BACSU from Bacillus subtilis (281 aa), FASTA scores: opt: 365, E(): 1.5e-14, (27.8% identity in 284 aa overlap); BAB41937|BAB46307|SA0704|SAV0749 Conserved hypothetical protein from Staphylococcus aureus strain Mu50 and N315 (288 aa), FASTA scores: opt: 371, E(): 7e-15, (28.85% identity in 281 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216933.1" /db_xref="GI:15609554" /db_xref="UniProtKB/Swiss-Prot:P67368" /db_xref="GeneID:885692" /translation="MTVVVVTDTSCRLPADLREQWSIRQVPLHILLDGLDLRDGVDEI PDDIHKRHATTAGATPVELSAAYQRALADSGGDGVVAVHISSALSGTFRAAELTAAEL GPAVRVIDSRSAAMGVGFAALAAGRAAAAGDELDTVARAAAAAVSRIHAFVAVARLDN LRRSGRISGAKAWLGTALALKPLLSVDDGKLVLVQRVRTVSNATAVMIDRVCQLVGDR PAALAVHHVADPAAANDVAAALAERLPACEPAMVTAMGPVLALHVGAGAVGVCVDVGA SPPA" repeat_region complement(2716315..2716391) /note="77 bp Mycobacterial Interspersed Repetitive Unit, Class I" gene complement(2716395..2717138) /locus_tag="Rv2418c" /db_xref="GeneID:885304" CDS complement(2716395..2717138) /locus_tag="Rv2418c" /function="UNKNOWN" /note="Rv2418c, (MTCY253.02), len: 247 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216934.1" /db_xref="GI:15609555" /db_xref="UniProtKB/TrEMBL:P71725" /db_xref="GeneID:885304" /translation="MSSRRGRRPALLVFADSLAYYGPTGGLPADDPRIWPNIVASQLD WDLELIGRIGWTCRDVWWAATQDPRAWAALPRAGAVIFATGGMDSLPSVLPTALRELI RYVRPSWLRRWVRDGYAWVQPRLSPVARAALPPHLTAEYLEKTRGAIDFNRPGIPIIA SLPSVHIAETYGKAHHGRAGTVAAITEWAQHHDIPLVDLKAAVAEQILSGYGNRDGIH WNFEAHQAVAELMLKALAEAGVPNEKSRG" gene complement(2717128..2717799) /locus_tag="Rv2419c" /db_xref="GeneID:885727" CDS complement(2717128..2717799) /locus_tag="Rv2419c" /EC_number="5.4.2.1" /function="INVOLVED IN GLYCOLYSIS [CATALYTIC ACTIVITY: 2-PHOSPHOGLYCERATE + 2,3-DIPHOSPHOGLYCERATE = 3-PHOSPHOGLYCERATE + 2,3-DIPHOSPHOGLYCERATE]." /note="Rv2419c, (MTCY428.28-MTCY253.01), len: 223 aa. Probable phosphoglycerate mutase (EC 5.4.2.1), equivalent to Q9CC00|ML1452 POSSIBLE PHOSPHOGLYCERATE MUTASE from Mycobacterium leprae (224 aa), FASTA scores: opt: 1206, E(): 8.8e-68, (80.35% identity in 224 aa overlap). Also highly similar to Q9RDL0|SCC123.14c PUTATIVE PHOSPHOGLYCERATE MUTASE from Streptomyces coelicolor (223 aa), FASTA scores: opt: 431, E(): 9.4e-20, (40.85% identity in 213 aa overlap); and similar to others e.g. Q9RVD2|DR1097 from Deinococcus radiodurans (232 aa), FASTA scores: opt: 291, E(): 4.6e-11, (39.3% identity in 173 aa overlap); etc. Some similarity to Q10512|Rv2228c|Y019_MYCTU|MT2287|MTcy427.09c hypothetical 39.2 kDa protein from Mycobacterium tuberculosis (364 aa) FASTA scores: opt: 196, E(): 2.8e-06, (45.6% identity in 79 aa overlap). Contains PS00175 Phosphoglycerate mutase family phosphohistidine signature. BELONGS TO THE PHOSPHOGLYCERATE MUTASE FAMILY." /codon_start=1 /transl_table=11 /product="phosphoglycerate mutase (phosphoglyceromutase)" /protein_id="NP_216935.1" /db_xref="GI:15609556" /db_xref="GOA:P71724" /db_xref="UniProtKB/TrEMBL:P71724" /db_xref="GeneID:885727" /translation="MRARRLVMLRHGQTDYNVGSRMQGQLDTELSELGRTQAVAAAEV LGKRQPLLIVSSDLRRAYDTAVKLGERTGLVVRVDTRLRETHLGDWQGLTHAQIDADA PGARLAWREDATWAPHGGESRVDVAARSRPLVAELVASEPEWGGADEPDRPVVLVAHG GLIAALSAALLKLPVANWPALGGMGNASWTQLSGHWAPGSDFESIRWRLDVWNASAQV SSDVL" misc_feature complement(2717749..2717778) /locus_tag="Rv2419c" /note="PS00175 Phosphoglycerate mutase family phosphohistidine signature" gene complement(2717796..2718176) /locus_tag="Rv2420c" /db_xref="GeneID:885677" CDS complement(2717796..2718176) /locus_tag="Rv2420c" /function="UNKNOWN" /note="Rv2420c, (MTCY428.27), len: 126 aa. Conserved hypothetical protein, equivalent to Q9CBZ9|ML1453 HYPOTHETICAL PROTEIN from Mycobacterium leprae (129 aa), FASTA scores: opt: 681, E(): 1.6e-38, (87.0% identity in 123 aa overlap). Also highly similar to Q9RDK9|SCC123.15c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (148 aa), FASTA scores: opt: 447, E(): 5.8e-23, (52.7% identity in 129 aa overlap); and similar to others e.g. P54457|YQEL_BACSU HYPOTHETICAL PROTEIN from Bacillus subtilis (118 aa), FASTA scores: opt: 318, E(): 1.8e-14, (37.3% identity in 110 aa overlap); Q9KD89|BH1328 HYPOTHETICAL PROTEIN from Bacillus halodurans (117 aa), FASTA scores: opt: 296, E(): 5.1e-13, (37.6% identity in 109 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216936.1" /db_xref="GI:15609557" /db_xref="GOA:O86327" /db_xref="UniProtKB/TrEMBL:O86327" /db_xref="GeneID:885677" /translation="MTANREAIDMARVAAGAAAAKLADDVVVIDVSGQLVITDCFVIA SGSNERQVNAIVDEVEEKMRQAGYRPARREGAREGRWTLLDYRDIVVHIQHQDDRNFY ALDRLWGDCPVVPVDLSANSAGAQ" gene complement(2718173..2718808) /gene="nadD" /locus_tag="Rv2421c" /db_xref="GeneID:885457" CDS complement(2718173..2718808) /gene="nadD" /locus_tag="Rv2421c" /EC_number="2.7.7.18" /function="INVOLVED IN NAD BIOSYNTHESIS; CATALYZES THE REVERSIBLE ADENYLATION OF NICOTINATE MONONUCLEOTIDE [CATALYTIC ACTIVITY: ATP + NICOTINATE RIBONUCLEOTIDE = DIPHOSPHATE + DEAMIDO-NAD(+)]." /note="transfers an adenyl group from ATP to NaMN to form nicotinic acid adenine dinucleotide (NaAD) which is then converted to the ubiquitous compound NAD by NAD synthetase; essential enzyme in bacteria" /codon_start=1 /transl_table=11 /product="nicotinic acid mononucleotide adenylyltransferase" /protein_id="NP_216937.1" /db_xref="GI:15609558" /db_xref="GOA:O86328" /db_xref="UniProtKB/Swiss-Prot:O86328" /db_xref="GeneID:885457" /translation="MGGTFDPIHYGHLVAASEVADLFDLDEVVFVPSGQPWQKGRQVS AAEHRYLMTVIATASNPRFSVSRVDIDRGGPTYTKDTLADLHALHPDSELYFTTGADA LASIMSWQGWEELFELARFVGVSRPGYELRNEHITSLLGQLAKDALTLVEIPALAISS TDCRQRAEQSRPLWYLMPDGVVQYVSKCRLYCGACDAGARSTTSLAAGNGL" gene 2719083..2719355 /locus_tag="Rv2422" /db_xref="GeneID:885531" CDS 2719083..2719355 /locus_tag="Rv2422" /function="UNKNOWN" /note="Rv2422, (MTCY428.25c), len: 90 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216938.1" /db_xref="GI:15609559" /db_xref="UniProtKB/TrEMBL:P71926" /db_xref="GeneID:885531" /translation="MPASVSTVLVDTSVAVAPVVADHDHHEDTFQALRGRTLGLAGHA AFERRTLATVAKLLAHTFPATRFLGAGAAMSLLPELAPAEIAGGAV" gene 2719597..2720643 /locus_tag="Rv2423" /db_xref="GeneID:885516" CDS 2719597..2720643 /locus_tag="Rv2423" /function="UNKNOWN" /note="Rv2423, (MTCY428.24c), len: 348 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216939.1" /db_xref="GI:15609560" /db_xref="GOA:P71925" /db_xref="UniProtKB/TrEMBL:P71925" /db_xref="GeneID:885516" /translation="MDNLPIESAESTRLAKAAMTRRFYTRSVVKGEITLPAVPSMIDE YVTMCAGLFAGVGRKFSDEELAHLRAVLQGQLAEAYAASQRSTIVISYNAPMGPTLHY QVRAQWRTVAQEYENWIATREPPLFGTEPDARVWALANEAADPTTHRVLEIGAGTGRN ALALARRGHPVDVVEMTPKFADIIRSDAERDSLDVRVIMRDVFSTMDDLRQDYQLMVL SEVVPDFRTTQQLRNLFELAAQCLAPGARLVFNAFLANGDYAPDQAAREFGQQMYTGM CTRAEMSAAAAGLPLELVADDSVYDYEKTHLPPGAWPPTSWYADWIRGLDVFTTNVES CPIEMRWLVFQRRR" repeat_region 2720644..2720656 /note="13 bp inverted repeat, GCAGTCG(C)AAAAG, at the left end of IS1558" gene complement(2720776..2721777) /locus_tag="Rv2424c" /db_xref="GeneID:885699" CDS complement(2720776..2721777) /locus_tag="Rv2424c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1558." /note="Rv2424c, (MTCY428.23), len: 333 aa. Probable transposase for IS1558, similar to IS element proteins e.g. AL021957|Rv2177c|MTV021_10 from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 1491, E(): 6.2e-87, (98.6% identity in 221 aa overlap); P19780|YIS1_STRCO HYPOTHETICAL INSERTION ELEMENT IS110 from Streptomyces coelicolor (45 aa), FASTA scores: opt: 203, E(): 1.7e-05; (27.3% identity in 238 aa overlap); etc. Contains PS01159 WW/rsp5/WWP domain signature." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216940.1" /db_xref="GI:15609561" /db_xref="GOA:P71924" /db_xref="UniProtKB/TrEMBL:P71924" /db_xref="GeneID:885699" /translation="MQCRAREERPGRKTDLLDAEWLVHLLECGLLRGWLIPPADIKAA RDVIRYRRKLVEHRTSKLQRLGNVLQDAGIKADSVASSVTPKSVRAMVEALIDGERRP AVLADLARGSMRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMIGALDEQIEQL MHPFCARRELIASIPGIGVGASATVISEIGADPAAWFPSAEHLASWVRLCPGNHESAG KRHHGARRTGNQHLQPVLVECAWAAVRTDGYLREYYRRQVRKFGGFRSPAANKKAITT VAHKLIVIIWHVLATGRPHQDLGADYFTTRMDPDKERRRLVAKLEAQGLGVTLEPAA" repeat_region complement(2720779..2721777) /note="IS1558-2, len: 999 bp. Insertion sequence IS1558." /mobile_element="insertion sequence:IS1558-2" misc_feature complement(2720977..2721057) /locus_tag="Rv2424c" /note="PS01159 WW/rsp5/WWP domain signature" repeat_region complement(2721844..2721856) /note="13 bp inverted repeat, GCAGTCG(T)AAAAG, at the right end of IS1558" gene complement(2721866..2723308) /locus_tag="Rv2425c" /db_xref="GeneID:885673" CDS complement(2721866..2723308) /locus_tag="Rv2425c" /function="UNKNOWN" /note="Rv2425c, (MTCY428.22), len: 480 aa. Hypothetical protein; C-terminal half shares similarity to other unknown conserved proteins e.g. Q53065 HYPOTHETICAL 24.3 KDA PROTEIN from Rhodococcus erythropolis (219 aa), FASTA scores: opt: 398, E(): 9.9e-17, (34.15% identity in 202 aa overlap); C-terminus of O27843|MTH1815 CONSERVED PROTEIN from Methanothermobacter thermautotrophicus (346 aa), FASTA scores: opt: 341, E(): 3.7e-13, (31.35% identity in 233 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216941.1" /db_xref="GI:15609562" /db_xref="UniProtKB/TrEMBL:P71923" /db_xref="GeneID:885673" /translation="MAARRIRAARPLAPHGLPGHLVGFVEALRGSGISVGPSETVDAG RVMATLGLGDREVLREGIACAVLRRPDHRDTYDAMFDLWFPAALGARAVITTEDESAG SGGLPPDDVEAMRQLLLDLLANNQDLAGKDERLVEMIARIVEAYGKYSSSRGPSFSSY QALKAMALDELEGKLLAGLLAPYGDEPTATQEQIAKALAAQKIAQLRRMVDAETKRRT AEQLGREHVQMYGIPQLSENVEFLRASGEQLRQMRRVVAPLARTLATRLAARRRRARA GSIDLRKTLRKSMSTGGVPIDLVLHKPRPARPELVVLCDVSGSVAGFSHFTLLLVHAL RQQFSRVRVFAFIDSTDEVTHMFGPESDLAIAIQRITREAGVYARDGHSDYGNAFVSF MQGFPNVLSPRSSLLVLGDGRTNYRNPATDVLADMVTASRHAHWLNPEPKHLWGSGDS AVPRYQEVITMHECRSAKQLATVIDQLLPV" gene complement(2723308..2724183) /locus_tag="Rv2426c" /db_xref="GeneID:885909" CDS complement(2723308..2724183) /locus_tag="Rv2426c" /function="UNKNOWN" /note="Rv2426c, (MTCY428.21), len: 291 aa. Conserved hypothetical protein, highly similar to others e.g. Q51326|ORF4 from Pseudomonas carboxydovorans (295 aa), FASTA scores: opt: 853, E(): 3.7e-43, (48.75% identity in 277 aa overlap); BAB47746|MLR0088 from Rhizobium loti (309 aa), FASTA scores: opt :809, E(): 1.5e-40, (46.5% identity in 291 aa overlap); Q9Y9R8|APE2220 from Aeropyrum pernix (297 aa), FASTA scores: opt: 763, E(): 7.4e-38, (47.1% identity in 261 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216942.1" /db_xref="GI:15609563" /db_xref="GOA:P71922" /db_xref="UniProtKB/TrEMBL:P71922" /db_xref="GeneID:885909" /translation="MTVPARPTPLFADIADVSRRLAETGYLPDTATATAVFLADRLGK PLLVEGPAGVGKTELARAVAQATGSGLVRLQCYEGVDEARALYEWNHAKQILRIQAGS GDWEATKTDVFSEEFLLQRPLLTAIRRTEPTVLLIDETDKADIEIEGLLLEVLSDFAV TVPELGTLTATRAPFVLLTSNATRELSEALKRRCLYLHIDFPTPELERRILLSRVPEL PEHFAEELVRIIGVLRGMQLKKVPSIAETIDWGRTVLALGLDTIDDAVVAATLGVVLK HQSDQQRATGELRLN" misc_feature complement(2724013..2724036) /locus_tag="Rv2426c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2724230..2725477) /gene="proA" /locus_tag="Rv2427c" /db_xref="GeneID:885536" CDS complement(2724230..2725477) /gene="proA" /locus_tag="Rv2427c" /EC_number="1.2.1.41" /function="INVOLVED IN PROLINE BIOSYNTHESIS PATHWAY (AT THE SECOND STEP). CATALYZES THE NADPH DEPENDENT REDUCTION OF L-GAMMA- GLUTAMYL 5-PHOSPHATE INTO L-GLUTAMATE 5-SEMIALDEHYDE AND PHOSPHATE. THE PRODUCT SPONTANEOUSLY UNDERGOES CYCLIZATION TO FORM 1-PYRROLINE-5-CARBOXYLATE. [CATALYTIC ACTIVITY: L-GLUTAMATE 5-SEMIALDEHYDE + PHOSPHATE + NADP(+) = L-GAMMA-GLUTAMYL 5-PHOSPHATE + NADPH]." /note="Catalyzes the phosphorylation of L-glutamate during the proline biosynthesis pathway" /codon_start=1 /transl_table=11 /product="gamma-glutamyl phosphate reductase" /protein_id="NP_216943.1" /db_xref="GI:15609564" /db_xref="GOA:P65788" /db_xref="UniProtKB/Swiss-Prot:P65788" /db_xref="GeneID:885536" /translation="MTVPAPSQLDLRQEVHDAARRARVAARRLASLPTTVKDRALHAA ADELLAHRDQILAANAEDLNAAREADTPAAMLDRLSLNPQRVDGIAAGLRQVAGLRDP VGEVLRGYTLPNGLQLRQQRVPLGVVGMIYEGRPNVTVDAFGLTLKSGNAALLRGSSS AAKSNEALVAVLRTALVGLELPADAVQLLSAADRATVTHLIQARGLVDVVIPRGGAGL IEAVVRDAQVPTIETGVGNCHVYVHQAADLDVAERILLNSKTRRPSVCNAAETLLVDA AIAETALPRLLAALQHAGVTVHLDPDEADLRREYLSLDIAVAVVDGVDAAIAHINEYG TGHTEAIVTTNLDAAQRFTEQIDAAAVMVNASTAFTDGEQFGFGAEIGISTQKLHARG PMGLPELTSTKWIAWGAGHTRPA" gene complement(2725571..2726087) /gene="oxyR'" /locus_tag="Rv2427Ac" /pseudo /db_xref="GeneID:3205094" misc_feature complement(2725571..2726087) /gene="oxyR'" /locus_tag="Rv2427Ac" /note="Pseudogene oxyR', inactivated by multiple mutations; identical to sequence in u16243." /pseudo gene 2726193..2726780 /gene="ahpC" /locus_tag="Rv2428" /db_xref="GeneID:885717" CDS 2726193..2726780 /gene="ahpC" /locus_tag="Rv2428" /function="INVOLVED IN OXIDATIVE STRESS RESPONSE." /experiment="experimental evidence, no additional details recorded" /note="Rv2428, (MTCY428.18c), len: 195 aa. ahpC, alkyl hydroperoxide reductase C (EC 1.-.-.-) (see citations below), equivalent to other alkyl hydroperoxide reductases C mycobacterial proteins e.g. Q9CBF5|AHPC|ML2042 ALKYL HYDROPEROXIDE REDUCTASE from Mycobacterium leprae (195 aa) FASTA scores: opt: 1183, E(): 2.6e-72, (88.20% identity in 195 aa overlap); O87323|AHPC from Mycobacterium marinum (195 aa), FASTA scores: opt: 1215, E(): 1.9e-74, (90.8% identity in 195 aa overlap); Q57413|AHPC|AVI-3 from Mycobacterium avium (195 aa), FASTA scores: opt: 1201, E(): 1.6e-73, (90.25% identity in 195 aa overlap). Also highly similar to others from other organisms e.g. Q9FBP5|AHPC ALKYL HYDROPEROXIDE REDUCTASE from Streptomyces coelicolor (184 aa), FASTA scores: opt: 768, E(): 1.7e-44, (62.45% identity in 189 aa overlap); etc." /codon_start=1 /transl_table=11 /product="alkyl hydroperoxide reductase subunit C" /protein_id="NP_216944.1" /db_xref="GI:15609565" /db_xref="GOA:Q7BHK8" /db_xref="UniProtKB/TrEMBL:Q7BHK8" /db_xref="GeneID:885717" /translation="MPLLTIGDQFPAYQLTALIGGDLSKVDAKQPGDYFTTITSDEHP GKWRVVFFWPKDFTFVCPTEIAAFSKLNDEFEDRDAQILGVSIDSEFAHFQWRAQHND LKTLPFPMLSDIKRELSQAAGVLNADGVADRVTFIVDPNNEIQFVSATAGSVGRNVDE VLRVLDALQSDELCACNWRKGDPTLDAGELLKASA" gene 2726806..2727339 /gene="ahpD" /locus_tag="Rv2429" /db_xref="GeneID:885959" CDS 2726806..2727339 /gene="ahpD" /locus_tag="Rv2429" /function="INVOLVED IN OXIDATIVE STRESS RESPONSE." /experiment="experimental evidence, no additional details recorded" /note="Rv2429, (MTCY428.17c), 177 aa. ahpD, alkyl hydroperoxide reductase (EC 1.-.-.-), similar to other alkyl hydroperoxide reductases D proteins e.g. Q9RN73|AHPD from Streptomyces coelicolor (178 aa), FASTA scores: opt: 611, E(): 1.4e-33, (57.4% identity in 169 aa overlap); Q50441|AHPD_MYCSM AHPD PROTEIN (FRAGMENT) from Mycobacterium smegmatis (52 aa), FASTA score: opt:196." /codon_start=1 /transl_table=11 /product="alkyl hydroperoxide reductase subunit D" /protein_id="NP_216945.1" /db_xref="GI:15609566" /db_xref="GOA:Q57353" /db_xref="UniProtKB/Swiss-Prot:Q57353" /db_xref="GeneID:885959" /translation="MSIEKLKAALPEYAKDIKLNLSSITRSSVLDQEQLWGTLLASAA ATRNPQVLADIGAEATDHLSAAARHAALGAAAIMGMNNVFYRGRGFLEGRYDDLRPGL RMNIIANPGIPKANFELWSFAVSAINGCSHCLVAHEHTLRTVGVDREAIFEALKAAAI VSGVAQALATIEALSPS" gene complement(2727336..2727920) /gene="PPE41" /locus_tag="Rv2430c" /db_xref="GeneID:885945" CDS complement(2727336..2727920) /gene="PPE41" /locus_tag="Rv2430c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2430c, (MTCY428.16), len: 194 aa. Member of the Mycobacterium tuberculosis PPE family similar to others e.g. AAK46014|Rv1745|MT1745 from Mycobacterium tuberculosis (385 aa) FASTA scores: opt: 389, E(): 1.2e-17, (35.95% identity in 192 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177881.1" /db_xref="GI:57116988" /db_xref="UniProtKB/TrEMBL:Q79FE1" /db_xref="GeneID:885945" /translation="MHFEAYPPEVNSANIYAGPGPDSMLAAARAWRSLDVEMTAVQRS FNRTLLSLMDAWAGPVVMQLMEAAKPFVRWLTDLCVQLSEVERQIHEIVRAYEWAHHD MVPLAQIYNNRAERQILIDNNALGQFTAQIADLDQEYDDFWDEDGEVMRDYRLRVSDA LSKLTPWKAPPPIAHSTVLVAPVSPSTASSRTDT" gene complement(2727967..2728266) /gene="PE25" /locus_tag="Rv2431c" /db_xref="GeneID:885703" CDS complement(2727967..2728266) /gene="PE25" /locus_tag="Rv2431c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2431c, (MTCY428.15), len: 99 aa. Member of the Mycobacterium tuberculosis PE family (see Brennan & Delogu 2002), similar to others e.g. AAK47158|MT2839 from Mycobacterium tuberculosis (275 aa) FASTA scores: opt: 194, E(): 2.5e-06, (40.0% identity in 95 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177882.1" /db_xref="GI:57116989" /db_xref="UniProtKB/TrEMBL:Q7D756" /db_xref="GeneID:885703" /translation="MSFVITNPEALTVAATEVRRIRDRAIQSDAQVAPMTTAVRPPAA DLVSEKAATFLVEYARKYRQTIAAAAVVLEEFAHALTTGADKYATAEADNIKTFS" gene complement(2728437..2728847) /locus_tag="Rv2432c" /db_xref="GeneID:885075" CDS complement(2728437..2728847) /locus_tag="Rv2432c" /function="UNKNOWN" /note="Rv2432c, (MTCY428.14), len: 140 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216948.1" /db_xref="GI:15609569" /db_xref="UniProtKB/TrEMBL:P71917" /db_xref="GeneID:885075" /translation="MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEP GAMMGFPCRPALLPHLSRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCRHV RWWLASDGHWGMVSYIPTALNVSMGGIVGWRCVP" gene complement(2728844..2729134) /locus_tag="Rv2433c" /db_xref="GeneID:885579" CDS complement(2728844..2729134) /locus_tag="Rv2433c" /function="UNKNOWN" /note="Rv2433c, (MTCY428.13), len: 96 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216949.1" /db_xref="GI:15609570" /db_xref="UniProtKB/TrEMBL:P71916" /db_xref="GeneID:885579" /translation="MGLRDADERWDTVGQAIGLFLRGHTLRTAAPTALIVGTVLCAVN QGATLAEGAATIGTWVRMVINYLVPFLVASVGYLGARRGVRRASGRSDPSAQ" gene complement(2729115..2730560) /locus_tag="Rv2434c" /db_xref="GeneID:885885" CDS complement(2729115..2730560) /locus_tag="Rv2434c" /function="UNKNOWN" /note="Rv2434c, (MTCY428.12), len: 481 aa. Probable conserved transmembrane protein, with some similarity to BAB48444|MLR0973 PROBABLE INTEGRAL MEMBRANE PROTEIN from Rhizobium loti (410 aa), FASTA scores: opt: 298, E(): 4.1e-11, (27.25% identity in 389 aa overlap); and also similarity with other hypothetical proteins and/or putative integral membrane proteins." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_216950.1" /db_xref="GI:15609571" /db_xref="GOA:P71915" /db_xref="UniProtKB/TrEMBL:P71915" /db_xref="GeneID:885885" /translation="MNLLDSTWFYWAVGIAIGLPAGLIVLTELHNILVRRNSHLARQA SLLRNYLLPLGAVLLLLVKASEVPAEDPTVRVLTTAFGFLVLVLLLSLLNATLFQGAP QQSWRKRLPAIFVDVARFALIGIGLAVILSYIWGVRVGGLFAALGVTSVVIGLMLQNS VGQIVSGLFMLFEQPFRIDDWLETPTARGRVVEVNWRAVHIDTGSGLQIMPNSMLATT AFTNLSRPAGAHECSITTTFSTSDPPDKVCAMLNRAASALPHVKPGVVPATIARGAAE YRTTVRLTSPADEGPTQATFLRWVWYAARREGLHLDEADDEFSTAERVESALRTVVGP ELRLSSSDQQSLARYARLVRYGTDEIVQHAGVVPMGITFVIAGSVRLTVTTDDGSVVA IATLKKGTFLGLTALTRQPDPAGAVALEEVTALQIGREHLEQVVMNKPMLLQELGRVI DERQRKAQQAIRRDLHQSPAAAGEHRGPARR" gene complement(2730557..2732749) /locus_tag="Rv2435c" /db_xref="GeneID:885891" CDS complement(2730557..2732749) /locus_tag="Rv2435c" /EC_number="4.6.1.-" /function="GENERATES 3,'5'-CYCLIC (A/G)MP AND DIPHOSPHATE (OR PYROPHOSPHATE) FROM (A/G)TP." /note="Rv2435c, (MTCY428.11), len: 730 aa. Probable cyclase (adenylyl- or guanylyl-cyclase; EC 4.6.1.1 or 4.6.1.2 respectively); C-terminal domain (aa 500-730) similar to domain at C-terminus of a series of adenylate/guanylate cyclases (EC 4.6.-.-) e.g. O30820|CYA AAK45931|MT1661 from Mycobacterium tuberculosis (443 aa) FASTA scores: opt: 446, E(): 1.3e-19, (30.55% identity in 301 aa overlap); BAB50179|MLL3242 CYCLASE (ADENYLYL OR GUANYLYL) from Rhizobium loti (356 aa), FASTA scores: opt: 372, E(): 3.4e-15, (28.75% identity in 219 aa overlap); etc. BELONGS TO ADENYLYL CYCLASE CLASS-4/GUANYLYL CYCLASE FAMILY." /codon_start=1 /transl_table=11 /product="cyclase" /protein_id="NP_216951.1" /db_xref="GI:15609572" /db_xref="GOA:P71914" /db_xref="UniProtKB/TrEMBL:P71914" /db_xref="GeneID:885891" /translation="MTSGEALDSVAESESTPAKKRHKNVLRRRPRFRASIQSKLMVLL LLTSIVSVAAIAAIVYQSGRTSLRAAAYERLTQLRESQKRAVETLFSDLTNSLVIYER GLTVVDAVVRFTAGFDQLADATISPAQQQAIVNYYNNEFITPVERTTGDKLDITALLP TSPAQRYLQAYYTAPFTSDQDAMRLDDAGDGSAWSAANAQFNSYFREIVTRFDYDDAV LLDTRGNIVYTLSKDPDLGTNILTGPYRESNLRDAYLKALGANAVDFTWITDFKPYQP QLGVPTAWLVAPVEAGGKTQGVLALPLPIDKINKIMTADRQWQAAGMGSGTETYLAGP DSLMRSDSRLFLQDPEEYRKQVVAAGTSLDVVNRAIQFGGTTLLQPVATEGLRAAQRG QTGTVTSTDYTGSRELEAYAPLNVPDSDLHWSILATRNDSEAFAAVASFSRALVLVTV GIIVVICVASMLIAHAMVRPIRRLEVGTQKISAGDYEVNIPVKSRDEIGDLTAAFNEM SRNLQTKEELLNEQRKENDRLLLSMMPEPVVERYRLGEQTIAQEHQDVTVLFADILGV DEISSGLSGNELVKIVDELVRQFDSAAEHLGVERIRTLHNGYLAGCGVTTPRLDNIPR TVDFALEMRRIVDRFNCQTGNDLHLRVGINTGDVISGLVGRSSVVYDMWGAAVSLAYQ MHSGSPQPGIYVTSQVYEAMRDVWQFTAAGTISVGGLEEPIYRLSERS" gene 2733230..2734144 /gene="rbsK" /locus_tag="Rv2436" /db_xref="GeneID:885671" CDS 2733230..2734144 /gene="rbsK" /locus_tag="Rv2436" /EC_number="2.7.1.15" /function="INVOLVED IN RIBOSE METABOLISM (IN THE FIRST STEP) [CATALYTIC ACTIVITY: ATP + D-RIBOSE = ADP + D-RIBOSE 5-PHOSPHATE]." /note="Rv2436, (MTCY428.10c), len: 304 aa. Probable rbsK, ribokinase (EC 2.7.1.15), similar to others e.g. Q9RZ99|DRA0055 from Deinococcus radiodurans (300 aa) FASTA scores: opt: 485, E(): 9.1e-21, (44.55% identity in 301 aa overlap); P36945|P96733|RBSK_BACSU from Bacillus subtilis (293 aa), FASTA scores: opt: 398, E(): 8.5e-16, (36.35% identity in 297 aa overlap); P05054|RBSK_ECOLI|B3752|Z5253|ECS4694 from Escherichia coli strain K12 (309 aa), FASTA scores: opt: 387, E(): 3.8e-15, (34.7% identity in 314 aa overlap); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1. BELONGS TO THE PFKB FAMILY OF CARBOHYDRATE KINASES." /codon_start=1 /transl_table=11 /product="ribokinase RBSK" /protein_id="NP_216952.1" /db_xref="GI:15609573" /db_xref="GOA:P71913" /db_xref="UniProtKB/TrEMBL:P71913" /db_xref="GeneID:885671" /translation="MANASETNVGPMAPRVCVVGSVNMDLTFVVDALPRPGETVLAAS LTRTPGGKGANQAVAAARAGAQVQFSGAFGDDPAAAQLRAHLRANAVGLDRTVTVPGP SGTAIIVVDASAENTVLVAPGANAHLTPVPSAVANCDVLLTQLEIPVATALAAARAAQ SADAVVMVNASPAGQDRSSLQDLAAIADVVIANEHEANDWPSPPTHFVITLGVRGARY VGADGVFEVPAPTVTPVDTAGAGDVFAGVLAANWPRNPGSPAERLRALRRACAAGALA TLVSGVGDCAPAAAAIDAALRANRHNGS" misc_feature 2733377..2733451 /gene="rbsK" /locus_tag="Rv2436" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene 2734376..2734795 /locus_tag="Rv2437" /db_xref="GeneID:885906" CDS 2734376..2734795 /locus_tag="Rv2437" /function="UNKNOWN" /note="Rv2437, (MTCY428.09c), len: 139 aa. Conserved hypothetical protein, with some similarity to CONSERVED HYPOTHETICAL PROTEINS e.g. O06539|RV1139C|MTCI65.06c from Mycobacterium tuberculosis (166 aa); AAK45430|MT1172 from Mycobacterium tuberculosis (124 aa), FASTA scores: opt: 166, E(): 0.00013, (35.7% identity in 112 aa overlap); BAB48937|Mlr1600 from Rhizobium loti (222 aa), FASTA scores: opt: 163 ,E(): 0.00033, (28.1% identity in 121 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216953.1" /db_xref="GI:15609574" /db_xref="GOA:P71912" /db_xref="UniProtKB/TrEMBL:P71912" /db_xref="GeneID:885906" /translation="MLQRTNVVQPLNTLRMVWIQVAGIIPATAGIAATVYAQLAMGDS WRIGVDEQENTTLVRTGPFKWVRHPIYTAMMAFGLGLLLVTPNLVALAGFILLVATLE VHVRRVEEPYLLRTHSAVYRGYTASVGRFVPGVGLIR" gene complement(2734792..2736831) /gene="nadE" /locus_tag="Rv2438c" /db_xref="GeneID:885808" CDS complement(2734792..2736831) /gene="nadE" /locus_tag="Rv2438c" /EC_number="6.3.1.5" /function="INVOLVED IN BIOSYNTHESIS OF NAD. CAN USE BOTH GLUTAMINE OR AMMONIA AS A NITROGEN SOURCE [CATALYTIC ACTIVITY: ATP + DEAMIDO-NAD(+) + L-GLUTAMINE + H(2)O = AMP + DIPHOSPHATE + NAD(+) + L-GLUTAMATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of nicotinamide adenine dinucleotide (NAD) from nicotinic acid adenine dinucleotide (NAAD) using either ammonia or glutamine as the amide donor and ATP; ammonia-utilizing enzymes include the ones from Bacillus and Escherichia coli while glutamine-utilizing enzymes include the Mycobacterial one; forms homodimers" /codon_start=1 /transl_table=11 /product="NAD synthetase" /protein_id="NP_216954.2" /db_xref="GI:57116990" /db_xref="GOA:P71911" /db_xref="UniProtKB/Swiss-Prot:P71911" /db_xref="GeneID:885808" /translation="MNFYSAYQHGFVRVAACTHHTTIGDPAANAASVLDMARACHDDG AALAVFPELTLSGYSIEDVLLQDSLLDAVEDALLDLVTESADLLPVLVVGAPLRHRHR IYNTAVVIHRGAVLGVVPKSYLPTYREFYERRQMAPGDGERGTIRIGGADVAFGTDLL FAASDLPGFVLHVEICEDMFVPMPPSAEAALAGATVLANLSGSPITIGRAEDRRLLAR SASARCLAAYVYAAAGEGESTTDLAWDGQTMIWENGALLAESERFPKGVRRSVADVDT ELLRSERLRMGTFDDNRRHHRELTESFRRIDFALDPPAGDIGLLREVERFPFVPADPQ RLQQDCYEAYNIQVSGLEQRLRALDYPKVVIGVSGGLDSTHALIVATHAMDREGRPRS DILAFALPGFATGEHTKNNAIKLARALGVTFSEIDIGDTARLMLHTIGHPYSVGEKVY DVTFENVQAGLRTDYLFRIANQRGGIVLGTGDLSELALGWSTYGVGDQMSHYNVNAGV PKTLIQHLIRWVISAGEFGEKVGEVLQSVLDTEITPELIPTGEEELQSSEAKVGPFAL QDFSLFQVLRYGFRPSKIAFLAWHAWNDAERGNWPPGFPKSERPSYSLAEIRHWLQIF VQRFYSFSQFKRSALPNGPKVSHGGALSPRGDWRAPSDMSARIWLDQIDREVPKG" misc_feature complement(2735548..2735580) /gene="nadE" /locus_tag="Rv2438c" /note="PS00591 Glycosyl hydrolases family 10 active site" gene 2736709..2736987 /locus_tag="Rv2438A" /db_xref="GeneID:3205071" CDS 2736709..2736987 /locus_tag="Rv2438A" /function="UNKNOWN" /note="Rv2438A, len: 92 aa. Conserved hypothetical protein, showing few similarity with various enzymes e.g. part of O83441|VAA1_TREPA|ATPA1|TP0426 V-TYPE ATP SYNTHASE ALPHA CHAIN 1 (EC 3.6.1.34) from Treponema pallidum (589 aa), FASTA scores: opt: 110, E(): 1.5, (40.3% identity in 72 aa overlap); N-terminus of O95178|NIGM_HUMAN NADH-UBIQUINONE OXIDOREDUCTASE AGGG SUBUNIT PRECURSOR (EC 1.6.5.3) (EC 1.6.99.3) from Homo sapiens (105 aa), FASTA scores: opt: 109, E(): 1.5, (35.5% identity in 62 aa overlap); N-terminus of Q9HJ76|TA1096 PROBABLE GLYCEROL KINASE from Thermoplasma acidophilum (488 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177671.1" /db_xref="GI:57116991" /db_xref="UniProtKB/TrEMBL:Q79FD9" /db_xref="GeneID:3205071" /translation="MARTGHVQYRRGVGRRVTDGGVVSAGGNAHEPVLVGGVKVHRPF IVAQRRQNARITRRVSTLDTVESPALLADGGIDRRGDATDWAAADPGP" gene complement(2737117..2738247) /gene="proB" /locus_tag="Rv2439c" /db_xref="GeneID:885266" CDS complement(2737117..2738247) /gene="proB" /locus_tag="Rv2439c" /EC_number="2.7.2.11" /function="INVOLVED IN PROLINE BIOSYNTHESIS PATHWAY (AT THE FIRST STEP). CATALYZES THE TRANSFER OF A PHOSPHATE GROUP TO GLUTAMATE TO FORM GLUTAMATE 5-PHOSPHATE WHICH RAPIDLY CYCLIZES TO 5-OXOPROLINE [CATALYTIC ACTIVITY: ATP + L-GLUTAMATE = ADP + L-GLUTAMATE 5- PHOSPHATE]." /note="catalyzes the formation of glutamate 5-phosphate from glutamate in proline biosynthesis" /codon_start=1 /transl_table=11 /product="gamma-glutamyl kinase" /protein_id="NP_216955.1" /db_xref="GI:15609576" /db_xref="GOA:P71910" /db_xref="UniProtKB/Swiss-Prot:P71910" /db_xref="GeneID:885266" /translation="MRSPHRDAIRTARGLVVKVGTTALTTPSGMFDAGRLAGLAEAVE RRMKAGSDVVIVSSGAIAAGIEPLGLSRRPKDLATKQAAASVGQVALVNSWSAAFARY GRTVGQVLLTAHDISMRVQHTNAQRTLDRLRALHAVAIVNENDTVATNEIRFGDNDRL SALVAHLVGADALVLLSDIDGLYDCDPRKTADATFIPEVSGPADLDGVVAGRSSHLGT GGMASKVAAALLAADAGVPVLLAPAADAATALADASVGTVFAARPARLSARRFWVRYA AEATGALTLDAGAVRAVVRQRRSLLAAGITAVSGRFCGGDVVELRAPDAAMVARGVVA YDASELATMVGRSTSELPGELRRPVVHADDLVAVSAKQAKQV" misc_feature complement(2737543..2737608) /gene="proB" /locus_tag="Rv2439c" /note="PS00902 Glutamate 5-kinase signature" gene complement(2738247..2739686) /gene="obgE" /locus_tag="Rv2440c" /gene_synonym="cgtA" /gene_synonym="obg" /gene_synonym="yhbZ" /db_xref="GeneID:885900" CDS complement(2738247..2739686) /gene="obgE" /locus_tag="Rv2440c" /gene_synonym="cgtA" /gene_synonym="obg" /gene_synonym="yhbZ" /function="ESSENTIAL GTP-BINDING PROTEIN." /note="essential GTPase; exhibits high exchange rate for GTP/GDP; associates with 50S ribosomal subunit; involved in regulation of chromosomal replication" /codon_start=1 /transl_table=11 /product="GTPase ObgE" /protein_id="NP_216956.1" /db_xref="GI:15609577" /db_xref="GOA:P71909" /db_xref="UniProtKB/TrEMBL:P71909" /db_xref="GeneID:885900" /translation="MPRFVDRVVIHTRAGSGGNGCASVHREKFKPLGGPDGGNGGRGG SIVFVVDPQVHTLLDFHFRPHLTAASGKHGMGNNRDGAAGADLEVKVPEGTVVLDENG RLLADLVGAGTRFEAAAGGRGGLGNAALASRVRKAPGFALLGEKGQSRDLTLELKTVA DVGLVGFPSAGKSSLVSAISAAKPKIADYPFTTLVPNLGVVSAGEHAFTVADVPGLIP GASRGRGLGLDFLRHIERCAVLVHVVDCATAEPGRDPISDIDALETELACYTPTLQGD AALGDLAARPRAVVLNKIDVPEARELAEFVRDDIAQRGWPVFCVSTATRENLQPLIFG LSQMISDYNAARPVAVPRRPVIRPIPVDDSGFTVEPDGHGGFVVSGARPERWIDQTNF DNDEAVGYLADRLARLGVEEELLRLGARSGCAVTIGEMTFDWEPQTPAGEPVAMSGRG TDPRLDSNKRVGAAERKAARSRRREHGDG" misc_feature complement(2739168..2739191) /gene="obgE" /locus_tag="Rv2440c" /gene_synonym="cgtA" /gene_synonym="obg" /gene_synonym="yhbZ" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2739772..2740032) /gene="rpmA" /locus_tag="Rv2441c" /db_xref="GeneID:885919" CDS complement(2739772..2740032) /gene="rpmA" /locus_tag="Rv2441c" /function="INVOLVED IN TRANSLATION MECHANISMS." /note="involved in the peptidyltransferase reaction during translation" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L27" /protein_id="NP_216957.1" /db_xref="GI:15609578" /db_xref="GOA:P66127" /db_xref="UniProtKB/Swiss-Prot:P66127" /db_xref="GeneID:885919" /translation="MAHKKGASSSRNGRDSAAQRLGVKRYGGQVVKAGEILVRQRGTK FHPGVNVGRGGDDTLFAKTAGAVEFGIKRGRKTVSIVGSTTA" misc_feature complement(2739889..2739933) /gene="rpmA" /locus_tag="Rv2441c" /note="PS00831 Ribosomal protein L27 signature" gene complement(2740047..2740361) /gene="rplU" /locus_tag="Rv2442c" /db_xref="GeneID:885715" CDS complement(2740047..2740361) /gene="rplU" /locus_tag="Rv2442c" /function="INVOLVED IN TRANSLATION MECHANISMS." /experiment="experimental evidence, no additional details recorded" /note="Rv2442c, (MTCY428.04), len: 104 aa. Probable rplU, 50S RIBOSOMAL PROTEIN L21, equivalent to Q9CBZ2|RL21_MYCLE from Mycobacterium leprae (103 aa), FASTA scores: opt: 579, E(): 4.8e-31, (91.1% identity in 102 aa overlap). Also highly similar to others e.g. P95756|RL21_STRGR from Streptomyces griseus (106 aa), FASTA scores: opt: 362, E(): 5.4e-17, (56.0% identity in 100 aa overlap); etc." /codon_start=1 /transl_table=11 /product="50S ribosomal protein L21" /protein_id="NP_216958.1" /db_xref="GI:15609579" /db_xref="GOA:P66117" /db_xref="UniProtKB/Swiss-Prot:P66117" /db_xref="GeneID:885715" /translation="MMATYAIVKTGGKQYKVAVGDVVKVEKLESEQGEKVSLPVALVV DGATVTTDAKALAKVAVTGEVLGHTKGPKIRIHKFKNKTGYHKRQGHRQQLTVLKVTG IA" gene 2740709..2742184 /gene="dctA" /locus_tag="Rv2443" /db_xref="GeneID:885745" CDS 2740709..2742184 /gene="dctA" /locus_tag="Rv2443" /function="INVOLVED IN THE TRANSPORT OF DICARBOXYLATES SUCH AS SUCCINATE, FUMARATE, AND MALATE FROM THE PERIPLASM ACROSS THE INNER MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2443, (MTCY428.03c), len: 491 aa. Probable dctA, C4-dicarboxylate-transport transmembrane protein, similar to other C4-DICARBOXYLATE TRANSPORT PROTEINS e.g. AAK46817|MT2519 from Mycobacterium tuberculosis strain CDC1551 (491 aa); Q9L1K8|SC6A11.12 PUTATIVE SODIUM:DICARBOXYLATE SYMPORTER from Streptomyces coelicolor (466 aa), FASTA scores: opt: 1797, E(): 2.9e-98, (61.3% identity in 452 aa overlap); Q9RRG7|DR2525 from Deinococcus radiodurans (463 aa); P50334|DCTA_SALTY from Salmonella typhimurium (428 aa) FASTA scores: opt: 1241, E(): 1.3e-65, (47.2% identity in 415 aa overlap); etc. BELONGS TO THE SODIUM DICARBOXYLATE SYMPORTER FAMILY (SDF) (DAACS FAMILY)." /codon_start=1 /transl_table=11 /product="C4-dicarboxylate-transport transmembrane protein DctA" /protein_id="NP_216959.1" /db_xref="GI:15609580" /db_xref="GOA:P71906" /db_xref="UniProtKB/TrEMBL:P71906" /db_xref="GeneID:885745" /translation="MTAPLDRAPVTDLPANNKGRDRTHWLYLAVIFAVIAGVIVGLTA PSTGKSLTVLGTVFVNLIKMMIAPVIFCTIVLGIGSVRKAAAVGKVGGLALAYFLTMS SVALGIGLIVGNLLSPGRDLHLRPGAVGSGAALAGQAAESHGIAGFIQQIIPRSLPSA LTEGNVLQVLLVALLVGFAVQGLGPAGESILRAVENLQKLVFKVLVMVLWLAPIGAFG AIANIVATTGFNAVTNLLLLMAGFYLTCVVFVFGVLGVLLRIVSGLSIFRLLRYLARE YLLIFATSSSEVVLPRLITKMKHLGVQSSTVGVVVPTGYSFNLDGTAIYLTMASLFIA DAMGHRLTWGEQIALLAFMIIASKGAAGVSGAGLATLAGGLQAHRPELLDGVGLIVGI DRFMSEARSLTNFSGNAVATILVASWTKTIDLSKADEVLRGRDPFDESTMVDPHDEEP PAATPHGGGVPTNPALCDFEQVSLGGLVGRPAGPQRADVDG" gene complement(2742123..2744984) /gene="rne" /locus_tag="Rv2444c" /db_xref="GeneID:885911" CDS complement(2742123..2744984) /gene="rne" /locus_tag="Rv2444c" /EC_number="3.1.-.-" /function="THOUGHT TO BE INVOLVED IN SEVERAL CELLULAR PROCESS." /experiment="experimental evidence, no additional details recorded" /note="Rv2444c, (MTCY428.02), len: 953 aa. Possible rne, ribonuclease E (EC 3.1.-.-), highly similar to others e.g. Q9CBZ1|ML1468 POSSIBLE RIBONUCLEASE from Mycobacterium leprae (924 aa), FASTA scores: opt: 3713, E(): 2.4e-174, (74.2% identity in 966 aa overlap); Q9SI08|AT2G04270 PUTATIVE RIBONUCLEASE E from Arabidopsis thaliana (502 aa), FASTA scores: opt: 674, E(): 7.5e-26, (31.2% identity in 410 aa overlap); etc. Similar at C-terminal end to P21513|RNE_ECOLI|AMS|HMP1|B1084 ribonuclease E (EC 3.1.4.-) (RNASE E) from Escherichia coli strain K12 (1061 aa), FASTA scores: opt: 554, E(): 9.9e-20, (37.8% identity in 386 aa overlap). Also similar in medium part to several cytoplasmic axial filament proteins e.g. Q9HVU4|CAFA|PA4477 from Pseudomonas aeruginosa (485 aa), FASTA scores: opt: 664, E(): 2.3e-25, (42.8% identity in 418 aa overlap); etc. Equivalent to AAK46818 from Mycobacterium tuberculosis strain CDC1551 (621 aa) but longer 332 aa in N-terminal part. SEEMS TO BELONG TO THE RNE FAMILY." /codon_start=1 /transl_table=11 /product="ribonuclease E" /protein_id="NP_216960.1" /db_xref="GI:15609581" /db_xref="GOA:P71905" /db_xref="UniProtKB/TrEMBL:P71905" /db_xref="GeneID:885911" /translation="MIDGAPPSDPPEPSQHEELPDRLRVHSLARTLGTTSRRVLDALT ALDGRVRSAHSTVDRVDAVRVRDLLATHLETAGVLAASVHAPEASEEPESRLMLETQE TRNADVERPHYMPLFVAPQPIPEPLADDEDVDDGPDYVADDSDADDEGQLDRPANRRR RRGRRGRGRGRGEQGGSDGDPVDQQSEPRAQQFTSADAAETDDGDDRDSEDTEAGDNG EDENGSLEAGNRRRRRRRRRKSASGDDNDAALEGPLPDDPPNTVVHERVPRAGDKAGN SQDGGSGSTEIKGIDGSTRLEAKRQRRRDGRDAGRRRPPVLSEAEFLARREAVERVMV VRDRVRTEPPLPGTRYTQIAVLEDGIVVEHFVTSAASASLVGNIYLGIVQNVLPSMEA AFVDIGRGRNGVLYAGEVNWDAAGLGGADRKIEQALKPGDYVVVQVSKDPVGHKGARL TTQVSLAGRFLVYVPGASSTGISRKLPDTERQRLKEILREVVPSDAGVIIRTASEGVK EDDIRADVARLRERWEQIEAKAQETKEKAAGAAVALYEEPDVLVKVIRDLFNEDFVGL IVSGDEAWNTINEYVNSVAPELVSKLTKYESADGPDGQSAPDVFTVHRIDEQLAKAMD RKVWLPSGGTLVIDRTEAMTVIDVNTGKFTGAGGNLEQTVTKNNLEAAEEIVRQLRLR DIGGIVVIDFIDMVLESNRDLVLRRLTESLARDRTRHQVSEVTSLGLVQLTRKRLGTG LIEAFSTSCPNCSGRGILLHADPVDSAAATGRKSEPGARRGKRSKKSRSEESSDRSMV AKVPVHAPGEHPMFKAMAAGLSSLAGRGDEESGEPAAELAEQAGDQPPTDLDDTAQAD FEDTEDTDEDEDELDADEDLEDLDDEDLDEDLDVEDSDSDDEDSDEDAADADVDEEDA AGLDGSPGEVDVPGVTELAPTRPRRRVAGRPAGPPIRLD" gene complement(2745314..2745724) /gene="ndk" /locus_tag="Rv2445c" /db_xref="GeneID:885905" CDS complement(2745314..2745724) /gene="ndk" /locus_tag="Rv2445c" /EC_number="2.7.4.6" /function="MAJOR ROLE IN THE SYNTHESIS OF NUCLEOSIDE TRIPHOSPHATES OTHER THAN ATP [CATALYTIC ACTIVITY: ATP + NUCLEOSIDE DIPHOSPHATE = ADP + NUCLEOSIDE TRIPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of nucleoside triphosphate from ATP and nucleoside diphosphate" /codon_start=1 /transl_table=11 /product="nucleoside diphosphate kinase" /protein_id="NP_216961.1" /db_xref="GI:15609582" /db_xref="GOA:P71904" /db_xref="UniProtKB/Swiss-Prot:P71904" /db_xref="GeneID:885905" /translation="MTERTLVLIKPDGIERQLIGEIISRIERKGLTIAALQLRTVSAE LASQHYAEHEGKPFFGSLLEFITSGPVVAAIVEGTRAIAAVRQLAGGTDPVQAAAPGT IRGDFALETQFNLVHGSDSAESAQREIALWFPGA" gene complement(2745767..2746138) /locus_tag="Rv2446c" /db_xref="GeneID:885916" CDS complement(2745767..2746138) /locus_tag="Rv2446c" /function="UNKNOWN" /note="Rv2446c, (MTV008.02c), len: 123 aa. Probable conserved integral membrane protein, highly similar to Q9CBY9|ML1470 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (123 aa), FASTA scores: opt: 468, E(): 6.7e-23, (66.65% identity in 108 aa overlap). Also similar to Q9L1G5|SCC88.24c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (118 aa), FASTA scores: opt: 130, E(): 0.13, (37.2% identity in 86 aa overlap); and some similarity to O06852|Y13070 hypothetical Streptomyces coelicolor gene also between fpgs and ndk genes (see citation below) (117 aa), FASTA scores: opt: 128, E(): 0.17, (36.0% identity in 86 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_216962.1" /db_xref="GI:15609583" /db_xref="UniProtKB/TrEMBL:O53173" /db_xref="GeneID:885916" /translation="MTDRSREPADPWKGFSAVMAATLILEAIVVLLAIPVVDAVGGGL RPASLGYLVGLAVLLILLTGLQRRPWAIWVNLGAQPVLVAGFAVYPGVGFIGVLFAAL WVLIAYLRAEVRRRRDYRVSQ" gene complement(2746135..2747598) /gene="folC" /locus_tag="Rv2447c" /db_xref="GeneID:885902" CDS complement(2746135..2747598) /gene="folC" /locus_tag="Rv2447c" /EC_number="6.3.2.17" /function="CONVERSION OF FOLATES TO POLYGLUTAMATE DERIVATIVES. BACTERIA REQUIRE FOLATE FOR THE BIOSYNTHESIS OF GLYCINE, METHIONINE, FORMYL-MET-TRNA, THYMIDYLATES, PURINES, AND PANTOTHENATE [CATALYTIC ACTIVITY: ATP + {TETRAHYDROFOLYL-[GLU]}(N) + L-GLUTAMATE = ADP + PHOSPHATE + {TETRAHYDROFOLYL-[GLU]}(N+1)]." /note="Rv2447c, (MTV008.03c), len: 487 aa. Probable folC, folylpolyglutamate synthase (EC 6.3.2.17), equivalent to Q9CBY8|FOLC|ML1471 from Mycobacterium leprae (485 aa), FASTA scores: opt: 2425, E(): 2.2e-134, (78.7% identity in 483 aa overlap). Also highly similar to others e.g. Q9L1G4|FPGS|O08416|Y13070 from Streptomyces coelicolor (444 aa), FASTA scores: opt: 774, E(): 6.3e-38, (53.9% identity in 462 aa overlap); P15925|FOLC_LACCA|FGS from Lactobacillus casei (428 aa), FASTA scores: opt: 631, E(): 1.4e-29, (34.55% identity in 437 aa overlap); Q05865|FOLC_BACSU from Bacillus subtilis (430 aa), FASTA scores: opt: 421, E(): 2.6e-17, (32.9% identity in 383 aa overlap); etc. Contains PS01012 Folylpolyglutamate synthase signature 2. BELONGS TO THE FOLYLPOLYGLUTAMATE SYNTHASE FAMILY." /codon_start=1 /transl_table=11 /product="folylpolyglutamate synthase protein FolC" /protein_id="NP_216963.1" /db_xref="GI:15609584" /db_xref="GOA:O53174" /db_xref="UniProtKB/TrEMBL:O53174" /db_xref="GeneID:885902" /translation="MNSTNSGPPDSGSATGVVPTPDEIASLLQVEHLLDQRWPETRID PSLTRISALMDLLGSPQRSYPSIHIAGTNGKTSVARMVDALVTALHRRTGRTTSPHLQ SPVERISIDGKPISPAQYVATYREIEPLVALIDQQSQASAGKGGPAMSKFEVLTAMAF AAFADAPVDVAVVEVGMGGRWDATNVINAPVAVITPISIDHVDYLGADIAGIAGEKAG IITRAPDGSPDTVAVIGRQVPKVMEVLLAESVRADASVAREDSEFAVLRRQIAVGGQV LQLQGLGGVYSDIYLPLHGEHQAHNAVLALASVEAFFGAGAQRQLDGDAVRAGFAAVT SPGRLERMRSAPTVFIDAAHNPAGASALAQTLAHEFDFRFLVGVLSVLGDKDVDGILA ALEPVFDSVVVTHNGSPRALDVEALALAAGERFGPDRVRTAENLRDAIDVATSLVDDA AADPDVAGDAFSRTGIVITGSVVTAGAARTLFGRDPQ" misc_feature complement(2747038..2747085) /gene="folC" /locus_tag="Rv2447c" /note="PS01012 Folylpolyglutamate synthase signature 2" gene complement(2747595..2750225) /gene="valS" /locus_tag="Rv2448c" /db_xref="GeneID:885892" CDS complement(2747595..2750225) /gene="valS" /locus_tag="Rv2448c" /EC_number="6.1.1.9" /function="INVOLVED IN TRANSLATION MECHANISMS [CATALYIC ACTIVITY: ATP + L-VALINE + TRNA(VAL) = AMP + DIPHOSPHATE + L-VALYL-TRNA(VAL)]." /note="valine--tRNA ligase; ValRS; converts valine ATP and tRNA(Val) to AMP PPi and valyl-tRNA(Val); class-I aminoacyl-tRNA synthetase type 1 subfamily; has a posttransfer editing process to hydrolyze mischarged Thr-tRNA(Val) which is done by the editing domain" /codon_start=1 /transl_table=11 /product="valyl-tRNA synthetase" /protein_id="NP_216964.1" /db_xref="GI:15609585" /db_xref="GOA:P67599" /db_xref="UniProtKB/Swiss-Prot:P67599" /db_xref="GeneID:885892" /translation="MLPKSWDPAAMESAIYQKWLDAGYFTADPTSTKPAYSIVLPPPN VTGSLHMGHALEHTMMDALTRRKRMQGYEVLWQPGTDHAGIATQSVVEQQLAVDGKTK EDLGRELFVDKVWDWKRESGGAIGGQMRRLGDGVDWSRDRFTMDEGLSRAVRTIFKRL YDAGLIYRAERLVNWSPVLQTAISDLEVNYRDVEGELVSFRYGSLDDSQPHIVVATTR VETMLGDTAIAVHPDDERYRHLVGTSLAHPFVDRELAIVADEHVDPEFGTGAVKVTPA HDPNDFEIGVRHQLPMPSILDTKGRIVDTGTRFDGMDRFEARVAVRQALAAQGRVVEE KRPYLHSVGHSERSGEPIEPRLSLQWWVRVESLAKAAGDAVRNGDTVIHPASMEPRWF SWVDDMHDWCISRQLWWGHRIPIWYGPDGEQVCVGPDETPPQGWEQDPDVLDTWFSSA LWPFSTLGWPDKTAELEKFYPTSVLVTGYDILFFWVARMMMFGTFVGDDAAITLDGRR GPQVPFTDVFLHGLIRDESGRKMSKSKGNVIDPLDWVEMFGADALRFTLARGASPGGD LAVSEDAVRASRNFGTKLFNATRYALLNGAAPAPLPSPNELTDADRWILGRLEEVRAE VDSAFDGYEFSRACESLYHFAWDEFCDWYLELAKTQLAQGLTHTTAVLAAGLDTLLRL LHPVIPFLTEALWLALTGRESLVSADWPEPSGISVDLVAAQRINDMQKLVTEVRRFRS DQGLADRQKVPARMHGVRDSDLSNQVAAVTSLAWLTEPGPDFEPSVSLEVRLGPEMNR TVVVELDTSGTIDVAAERRRLEKELAGAQKELASTAAKLANADFLAKAPDAVIAKIRD RQRVAQQETERITTRLAALQ" misc_feature complement(2750064..2750099) /gene="valS" /locus_tag="Rv2448c" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene complement(2750313..2751572) /locus_tag="Rv2449c" /db_xref="GeneID:885894" CDS complement(2750313..2751572) /locus_tag="Rv2449c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2449c, (MTV008.05c), len: 419 aa. Conserved hypothetical protein, highly similar to hypothetical proteins e.g. P95139|Rv2953|MTCY349.37c from M. tuberculosis (418 aa), FASTA scores: opt: 1829, E(): 4.7e-103, (67.3% identity in 419 aa overlap); AAK47353|MT3027 from Mycobacterium tuberculosis strain CDC1551 (418 aa), FASTA score: opt: 1829, E(): 4.7e-103, (67.3 identity in 419 aa overlap); Q9CD87|ML0129 from Mycobacterium leprae (418 aa), FASTA scores: opt: 1727, E(): 6.8e-97, (65.45% identity in 414 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216965.1" /db_xref="GI:15609586" /db_xref="UniProtKB/TrEMBL:O53176" /db_xref="GeneID:885894" /translation="MTATPREFDIVLYGATGFVGKLTAEYLARAGGDARIALAGRSTQ RVLAVREALGESAQTWPILTADASLPSTLQAMAARAQVVVTTVGPYTRYGLPLVAACA AAGTDYADLTGEPMFMRNSIDLYHKQAADTGARIVHACGFDSVPSDLSVYALYHAARE DGAGELTDTNCVVRSFKGGFSGGTIASMLEVLSTASNDPDARRQLSDPYMLSPDRGAE PELGPQPDLPSRRGRRLAPELAGVWTAGFIMAPTNTRIVRRSNALLDWAYGRRFRYSE TMSVGSTVLAPVVSVVGGGVGNAMFGLASRYIRLLPRGLVKRVVPKPGTGPSAAARER GYYRIETYTTTTTGARYLARMAQDGDPGYKATSVLLGECGLALALDRDKLSDMRGVLT PAAAMGDALLERLPAAGVSLQTTRLAS" gene complement(2751662..2752180) /gene="rpfE" /locus_tag="Rv2450c" /db_xref="GeneID:885760" CDS complement(2751662..2752180) /gene="rpfE" /locus_tag="Rv2450c" /function="THOUGHT TO PROMOTE THE RESUSCITATION AND GROWTH OF DORMANT, NONGROWING CELLS. COULD ALSO STIMULATE THE GROWTH OF SEVERAL OTHER HIGH G+C GRAM+ ORGANISMS, e.g. Mycobacterium avium, Mycobacterium bovis (BCG), Mycobacterium kansasii, Mycobacterium smegmatis." /experiment="experimental evidence, no additional details recorded" /note="Rv2450c, (MTV008.06c), len: 172 aa. Probable rpfE, resuscitation-promoting factor (see Mukamolova et al., 1998), similar to O86308|Z96935|MLRPF_1 RPF PROTEIN PRECURSOR from Micrococcus luteus (220 aa), FASTA scores: opt: 291, E(): 3e-7, (48.75% identity in 80 aa overlap). C-terminus is similar to other Mycobacterial rpf proteins e.g. O05594|Rv1009|MTCI237.26|RPFB PROBABLE RESUSCITATION-PROMOTING FACTOR from Mycobacterium tuberculosis (362 aa), FASTA scores: opt: 344, E(): 1.4e-09, (42.85% identity in 147 aa overlap); etc. C-terminal region similar to N-terminal region of Q9F2Q2|SCE41.06c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 355, E(): 3.1e-10, (56.65% identity in 90 aa overlap). Also similar to Q9F2Q1|SCE41.07c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (near Q9F2Q2|SCE41.06c) (341 aa) FASTA scores: opt: 317, E(): 2.5e-08, (51.7% identity in 87 aa overlap). With Mycobacterium leprae, high similarity between the two corresponding C-terminal regions of two HYPOTHETICAL PROTEINS, Q9CD53|ML0240 (375 aa), FASTA scores: opt: 339, E(): 2.5e-09, (59.15% identity in 93 aa overlap) and O33049|MLCB57.05c|ML2151 (174 aa), FASTA scores: opt: 329, E(): 4e-09, (58.14% identity in 86 aa overlap). Contains a possible secretory signal sequence in N-terminus. Possible autocrine and/or paracrine bacterial growth factor or cytokine (see citations below)." /codon_start=1 /transl_table=11 /product="resuscitation-promoting factor RpfE" /protein_id="NP_216966.1" /db_xref="GI:15609587" /db_xref="UniProtKB/TrEMBL:O53177" /db_xref="GeneID:885760" /translation="MKNARTTLIAAAIAGTLVTTSPAGIANADDAGLDPNAAAGPDAV GFDPNLPPAPDAAPVDTPPAPEDAGFDPNLPPPLAPDFLSPPAEEAPPVPVAYSVNWD AIAQCESGGNWSINTGNGYYGGLRFTAGTWRANGGSGSAANASREEQIRVAENVLRSQ GIRAWPVCGRRG" gene 2752262..2752660 /locus_tag="Rv2451" /db_xref="GeneID:886025" CDS 2752262..2752660 /locus_tag="Rv2451" /function="UNKNOWN" /note="Rv2451, (MTV008.07), len: 132 aa. Hypothetical unknown pro-, ser-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216967.1" /db_xref="GI:15609588" /db_xref="UniProtKB/TrEMBL:O53178" /db_xref="GeneID:886025" /translation="MGRAVSVRHGSGALDLPGAAASRRLRVGQPIQPSPAPLARGSVD SIVEISCCPSAGPRGPYDNDLDSSSPANRDISSITSRSRRGGTIVVAGQKCGFGSAVS LRPRRYREPNHANIVTPDTDLSPSWPWSGI" gene complement(2752848..2752994) /locus_tag="Rv2452c" /db_xref="GeneID:885858" CDS complement(2752848..2752994) /locus_tag="Rv2452c" /function="UNKNOWN" /note="Rv2452c, (MTV008.08c), len: 48 aa. Hypothetical unknown protein (see citation below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216968.1" /db_xref="GI:15609589" /db_xref="UniProtKB/TrEMBL:O53179" /db_xref="GeneID:885858" /translation="MAFRDILVLFSMKTLLTLAMAAASSTALTTVGVSGARLITYCVG VEDI" gene complement(2753018..2753623) /gene="mobA" /locus_tag="Rv2453c" /db_xref="GeneID:885904" CDS complement(2753018..2753623) /gene="mobA" /locus_tag="Rv2453c" /function="INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS. LINKS A GUANOSINE 5'-PHOSPHATE TO MOLYDOPTERIN (MPT) FORMING MOLYBDOPTERIN GUANINE DINUCLEOTIDE (MGD)." /note="MobA; links a guanosine 5'-phosphate to molydopterin to form molybdopterin guanine dinucleotide; involved in molybdenum cofactor biosynthesis" /codon_start=1 /transl_table=11 /product="molybdopterin-guanine dinucleotide biosynthesis protein A" /protein_id="NP_216969.1" /db_xref="GI:15609590" /db_xref="GOA:P65402" /db_xref="UniProtKB/Swiss-Prot:P65402" /db_xref="GeneID:885904" /translation="MAELAPDTVPLAGVVLAGGESRRMGRDKATLPLPGGTTTLVEHM VGILGQRCAPVFVMAAPGQPLPTLPVPVLRDELPGLGPLPATGRGLRAAAEAGVRLAF VCAVDMPYLTVELIEDLARRAVQTDAEVVLPWDGRNHYLAAVYRTDLADRVDTLVGAG ERKMSALVDASDALRIVMADSRPLTNVNSAAGLHAPMQPGR" gene complement(2753625..2754746) /locus_tag="Rv2454c" /db_xref="GeneID:887435" CDS complement(2753625..2754746) /locus_tag="Rv2454c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="catalyzes the coenzyme A dependent formation of succinyl-CoA from 2-oxoglutarate and ferredoxin" /codon_start=1 /transl_table=11 /product="2-oxoglutarate ferredoxin oxidoreductase subunit beta" /protein_id="NP_216970.1" /db_xref="GI:15609591" /db_xref="UniProtKB/TrEMBL:O53181" /db_xref="GeneID:887435" /translation="MTRSGDEAQLMTGVTGDLAGTELGLTPSLTKNAGVPTTDQPQKG KDFTSDQEVRWCPGCGDYVILNTIRNFLPELGLRRENIVFISGIGCSSRFPYYLETYG FHSIHGRAPAIATGLALAREDLSVWVVTGDGDALSIGGNHLIHALRRNINVTILLFNN RIYGLTKGQYSPTSEVGKVTKSTPMGSLDHPFNPVSLALGAEATFVGRALDSDRNGLT EVLRAAAQHRGAALVEILQDCPIFNDGSFDALRKEGAEERVIKVRHGEPIVFGANGEY CVVKSGFGLEVAKTADVAIDEIIVHDAQVDDPAYAFALSRLSDQNLDHTVLGIFRHIS RPTYDDAARSQVVAARNAAPSGTAALQSLLHGRDTWTVD" gene complement(2754743..2756704) /locus_tag="Rv2455c" /db_xref="GeneID:887370" CDS complement(2754743..2756704) /locus_tag="Rv2455c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2455c, (MTV008.11c), len: 653 aa. Probable oxidoreductase, alpha subunit (EC 1.-.-.-), similar to others e.g. Q9F2W6|SCD20.13c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (645 aa), FASTA scores: opt: 2017, E(): 1e-111, (66.45% identity in 617 aa overlap) alias Q9RKS4|STAH10.35c PUTATIVE OXIDOREDUCTASE ALPHA-SUBUNIT from Streptomyces coelicolor (630 aa), FASTA scores: opt: 2008, E(): 3.4e-111, (66.45% identity in 614 aa overlap); Q9YA13|APE2126 LONG HYPOTHETICAL 2-OXOACID--FERREDOXIN OXIDOREDUCTASE ALPHA CHAIN from Aeropyrum pernix (644 aa) FASTA scores: opt: 687, E(): 4.6e-33, (33.35% identity in 441 aa overlap); etc. Note that the downstream ORF (MTV008.10c|Rv2454c) is possibly an oxidoreductase beta subunit." /codon_start=1 /transl_table=11 /product="oxidoreductase alpha subunit" /protein_id="NP_216971.1" /db_xref="GI:15609592" /db_xref="GOA:O53182" /db_xref="UniProtKB/TrEMBL:O53182" /db_xref="GeneID:887370" /translation="MDPNGSGAGPESHDAAFHAAPDRQRLENVVIRFAGDSGDGMQLT GDRFTSEAALFGNDLATQPNYPAEIRAPAGTLPGVSSFQIQIADYDILTAGDRPDVLV AMNPAALKANIGDLPLGGMVIVNSDEFTKRNLTKVGYVTNPLESGELSDYVVHTVAMT TLTLGAVEAIGASKKDGQRAKNMFALGLLSWMYGRELEHSEAFIREKFARKPEIAEAN VLALKAGWNYGETTEAFGTTYEIPPATLPPGEYRQISGNTALAYGIVVAGQLAGLPVV LGSYPITPASDILHELSKHKNFNVVTFQAEDEIGGICAALGAAYGGALGVTSTSGPGI SLKSEALGLGVMTELPLLVIDVQRGGPSTGLPTKTEQADLLQALYGRNGESPVAVLAP RSPADCFETALEAVRIAVSYHTPVILLSDGAIANGSEPWRIPDVNALPPIKHTFAKPG EPFQPYARDRETLARQFAIPGTPGLEHRIGGLEAANGSGDISYEPTNHDLMVRLRQAK IDGIHVPDLEVDDPTGDAELLLIGWGSSYGPIGEACRRARRRGTKVAHAHLRYLNPFP ANLGEVLRRYPKVVAPELNLGQLAQVLRGKYLVDVQSVTKVKGVSFLADEIGRFIRAA LAGRLAELEQDKTLVARLSAATAGAGANG" gene complement(2756936..2758192) /locus_tag="Rv2456c" /db_xref="GeneID:887340" CDS complement(2756936..2758192) /locus_tag="Rv2456c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY SUGAR) ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2456c, (MTV008.12c), len: 418 aa. Probable conserved integral membrane transport protein, involved in a efflux system, weakly similar to many e.g. Q9RUR0|YD22_DEIRA|DR1322 PUTATIVE SUGAR EFFLUX TRANSPORTER from Deinococcus radiodurans (389 aa), FASTA scores: opt: 224, E(): 8.4e-06, (24.45% identity in 409 aa overlap); Q9UYY0|PAB0913 MULTIDRUG RESISTANCE PROTEIN from Pyrococcus abyssi (410 aa), FASTA scores: opt: 210, E(): 5.6e-05, (21.8% identity in 408 aa overlap); etc. Contains PS00216 Sugar transport proteins signature 1." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_216972.1" /db_xref="GI:15609593" /db_xref="GOA:O53183" /db_xref="UniProtKB/TrEMBL:O53183" /db_xref="GeneID:887340" /translation="MSGTVVAVPPRVARALDLLNFSLADVRDGLGPYLSIYLLLIHDW DQASIGFVMAVGGIAAIVAQTPIGALVDRTTAKRALVVAGAVLVTAAAVAMPLFAGLY SISVLQAVTGIASSVFAPALAAITLGAVGPQFFARRIGRNEAFNHAGNASAAGATGAL AYFFGPVVVFWVLAGMALISVLATLRIPPDAVDHDLARGMDHAPGEPHPQPSRFTVLA HNRELVIFGAAVVAFHFANAAMLPLVGELLALHNRDEGTALMSSCIVAAQVVMVPVAY VVGTRADAWGRKPIFLVGFAVLTARGFLYTLSDNSYWLVGVQLLDGIGAGIFGALFPL VVQDVTHGTGHFNISLGAVTTATGIGAALSNLVAGWIVVVAGYDAAFMSLGALAGAGF LLYLVAMPETVDSDVRVRSRPTLGGK" misc_feature complement(2757311..2757361) /locus_tag="Rv2456c" /note="PS00216 Sugar transport proteins signature 1" gene complement(2758208..2759488) /gene="clpX" /locus_tag="Rv2457c" /db_xref="GeneID:888167" CDS complement(2758208..2759488) /gene="clpX" /locus_tag="Rv2457c" /EC_number="3.4.-.-" /function="ATP-DEPENDENT SPECIFICITY COMPONENT OF THE CLP PROTEASE. IT DIRECTS THE PROTEASE TO SPECIFIC SUBSTRATES. CAN PERFORM CHAPERONE FUNCTIONS IN THE ABSENCE OF CLPP)." /note="binds and unfolds substrates as part of the ClpXP protease" /codon_start=1 /transl_table=11 /product="ATP-dependent protease ATP-binding subunit ClpX" /protein_id="NP_216973.1" /db_xref="GI:15609594" /db_xref="GOA:O53184" /db_xref="UniProtKB/Swiss-Prot:O53184" /db_xref="GeneID:888167" /translation="MARIGDGGDLLKCSFCGKSQKQVKKLIAGPGVYICDECIDLCNE IIEEELADADDVKLDELPKPAEIREFLEGYVIGQDTAKRTLAVAVYNHYKRIQAGEKG RDSRCEPVELTKSNILMLGPTGCGKTYLAQTLAKMLNVPFAIADATALTEAGYVGEDV ENILLKLIQAADYDVKRAETGIIYIDEVDKIARKSENPSITRDVSGEGVQQALLKILE GTQASVPPQGGRKHPHQEFIQIDTTNVLFIVAGAFAGLEKIIYERVGKRGLGFGAEVR SKAEIDTTDHFADVMPEDLIKFGLIPEFIGRLPVVASVTNLDKESLVKILSEPKNALV KQYIRLFEMDGVELEFTDDALEAIADQAIHRGTGARGLRAIMEEVLLPVMYDIPSRDD VAKVVVTKETVQDNVLPTIVPRKPSRSERRDKSA" misc_feature complement(2759105..2759128) /gene="clpX" /locus_tag="Rv2457c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 2759779..2760687 /gene="mmuM" /locus_tag="Rv2458" /db_xref="GeneID:885871" CDS 2759779..2760687 /gene="mmuM" /locus_tag="Rv2458" /EC_number="2.1.1.10" /function="CATALYZES METHYL TRANSFER FROM S-METHYLMETHIONINE OR S-ADENOSYLMETHIONINE (LESS EFFICIENT) TO HOMOCYSTEINE, SELENOHOMOCYSTEINE AND LESS EFFICIENTLY SELENOCYSTEINE [CATALYTIC ACTIVITY: S-ADENOSYL-L-METHIONINE + L-HOMOCYSTEINE = S-ADENOSYL-L-HOMOCYSTEINE + L-METHIONINE]." /note="converts homocysteine and S-adenosyl-methionine to methionine and S-adenosyl-homocysteine or S-methyl-methionine and homocysteine to two methionines" /codon_start=1 /transl_table=11 /product="homocysteine methyltransferase" /protein_id="NP_216974.1" /db_xref="GI:15609595" /db_xref="GOA:O53185" /db_xref="UniProtKB/TrEMBL:O53185" /db_xref="GeneID:885871" /translation="MELVSDSVLISDGGLATELEARGHDLSDPLWSARLLVDAPHAIT AVHTAYFRAGAQIATTASYQASFEGFAARGIGHDDATVLLRRSVELAQAARDEVGVGG LSVAASVGPYGAALADGSEYRGYYGLSVAALMKWHLPRLEVLVDAGADMLALETIPDI DEAEALVNLVRRLATPAWLSYTINGTRTRAGQPLTDAFAVAAGVPEIVAVGVNCCAPD DVLPAIAFAVAHTGKPVIVYPNSGEGWDGRRRAWVGPRRFSGSSGQLAREWVAAGARI VGGCCRVRPIDIAEIGRALTTAPPRG" gene 2760854..2762380 /locus_tag="Rv2459" /db_xref="GeneID:888191" CDS 2760854..2762380 /locus_tag="Rv2459" /function="THOUGHT TO BE INVOLVED IN A TRANSPORT SYSTEM ACROSS THE MEMBRANE (PERHAPS DRUG TRANSPORT): RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2459, (MTV008.15), len: 508 aa. Probable conserved integral membrane transport protein, member of major facilitator superfamily (MFS) possibly involved in drug transport, highly similar to many efflux proteins e.g. Q9RL22|SC5G9.04c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (489 aa), FASTA scores: opt: 788, E(): 1.3e-38, (34.45% identity in 412 aa overlap); Q9I428|PA1316 PROBABLE MFS TRANSPORTER from Pseudomonas aeruginosa (513 aa), FASTA scores: opt: 782, E(): 3.1e-38, (32.75% identity in 519 aa overlap); P39886|TCMA_STRGA tetracenomycin C resistance and export protein from Streptomyces glaucescens (538 aa), FASTA scores: opt: 752, E(): 1.8e-36, (31.7% identity in 511 aa overlap); etc. Also highly similar to AAK46687|MT2395 DRUG TRANSPORTER from Mycobacterium tuberculosis strain CDC1551 (537 aa), FASTA scores: opt: 1396, E(): 5.6e-74, (44.45% identity in 504 aa overlap); and P71879|Rv2333c|MTCY3G12.01 PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis strain H37Rv (537 aa), FASTA scores: opt: 1385, E(): 2.5e-73, (44.25% identity in 504 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_216975.1" /db_xref="GI:15609596" /db_xref="GOA:O53186" /db_xref="UniProtKB/TrEMBL:O53186" /db_xref="GeneID:888191" /translation="MTPRQRLTVLATGLGIFMVFVDVNIVNVALPSIQKVFHTGEQGL QWAVAGYSLGMAAVLMSCALLGDRYGRRRSFVFGVTLFVVSSIVCVLPVSLAVFTVAR VIQGLGAAFISVLSLALLSHSFPNPRMKARAISNWMAIGMVGAASAPALGGLMVDGLG WRSVFLVNVPLGAIVWLLTLVGVDESQDPEPTQLDWVGQLTLIPAVALIAYTIIEAPR FDRQSAGFVAALLLAAGVLLWLFVRHEHRAAFPLVDLKLFAEPLYRSVLIVYFVVMSC FFGTLMVITQHFQNVRDLSPLHAGLMMLPVPAGFGVASLLAGRAVNKWGPQLPVLTCL AAMFIGLAIFAISMDHAHPVALVGLTIFGAGAGGCATPLLHLGMTKVDDGRAGMAAGM LNLQRSLGGIFGVAFLGTIVAAWLGAALPNTMADEIPDPIARAIVVDVIVDSANPHAH AAFIGPGHRITAAQEDEIVLAADAVFVSGIKLALGGAAVLLTGAFVLGWTRFPRTPAS" gene complement(2762531..2763175) /gene="clpP2" /locus_tag="Rv2460c" /db_xref="GeneID:888174" CDS complement(2762531..2763175) /gene="clpP2" /locus_tag="Rv2460c" /EC_number="3.4.21.92" /function="CLP CLEAVES PEPTIDES IN VARIOUS PROTEINS IN A PROCESS THAT REQUIRES ATP HYDROLYSIS. CLP MAY BE RESPONSIBLE FOR A FAIRLY GENERAL AND CENTRAL HOUSEKEEPING FUNCTION RATHER THAN FOR THE DEGRADATION OF SPECIFIC SUBSTRATES." /note="hydrolyzes proteins to small peptides; with the ATPase subunits ClpA or ClpX, ClpP degrades specific substrates" /codon_start=1 /transl_table=11 /product="ATP-dependent Clp protease proteolytic subunit" /protein_id="NP_216976.1" /db_xref="GI:15609597" /db_xref="GOA:P63783" /db_xref="UniProtKB/Swiss-Prot:P63783" /db_xref="GeneID:888174" /translation="MNSQNSQIQPQARYILPSFIEHSSFGVKESNPYNKLFEERIIFL GVQVDDASANDIMAQLLVLESLDPDRDITMYINSPGGGFTSLMAIYDTMQYVRADIQT VCLGQAASAAAVLLAAGTPGKRMALPNARVLIHQPSLSGVIQGQFSDLEIQAAEIERM RTLMETTLARHTGKDAGVIRKDTDRDKILTAEEAKDYGIIDTVLEYRKLSAQTA" repeat_region 2762762..2763061 /note="300 bp direct repeat copy 1" misc_feature complement(2762837..2762872) /gene="clpP2" /locus_tag="Rv2460c" /note="PS00381 Endopeptidase Clp serine active site" gene complement(2763172..2763774) /gene="clpP" /locus_tag="Rv2461c" /db_xref="GeneID:888176" CDS complement(2763172..2763774) /gene="clpP" /locus_tag="Rv2461c" /EC_number="3.4.21.92" /function="CLP CLEAVES PEPTIDES IN VARIOUS PROTEINS IN A PROCESS THAT REQUIRES ATP HYDROLYSIS. CLP MAY BE RESPONSIBLE FOR A FAIRLY GENERAL AND CENTRAL HOUSEKEEPING FUNCTION RATHER THAN FOR THE DEGRADATION OF SPECIFIC SUBSTRATES." /note="hydrolyzes proteins to small peptides; with the ATPase subunits ClpA or ClpX, ClpP degrades specific substrates" /codon_start=1 /transl_table=11 /product="ATP-dependent Clp protease proteolytic subunit" /protein_id="YP_177883.1" /db_xref="GI:57116992" /db_xref="GOA:O53188" /db_xref="UniProtKB/Swiss-Prot:O53188" /db_xref="GeneID:888176" /translation="MSQVTDMRSNSQGLSLTDSVYERLLSERIIFLGSEVNDEIANRL CAQILLLAAEDASKDISLYINSPGGSISAGMAIYDTMVLAPCDIATYAMGMAASMGEF LLAAGTKGKRYALPHARILMHQPLGGVTGSAADIAIQAEQFAVIKKEMFRLNAEFTGQ PIERIEADSDRDRWFTAAEALEYGFVDHIITRAHVNGEAQ" repeat_region 2763397..2763696 /note="300 bp direct repeat copy 2" gene complement(2763891..2765291) /gene="tig" /locus_tag="Rv2462c" /db_xref="GeneID:888615" CDS complement(2763891..2765291) /gene="tig" /locus_tag="Rv2462c" /function="INVOLVED IN PROTEIN EXPORT. ACTS AS A CHAPERONE BY MAINTAINING THE NEWLY SYNTHESIZED PROTEIN IN AN OPEN CONFORMATION." /experiment="experimental evidence, no additional details recorded" /note="Tig; RopA; peptidyl-prolyl cis/trans isomerase; promotes folding of newly synthesized proteins; binds ribosomal 50S subunit; forms a homodimer" /codon_start=1 /transl_table=11 /product="trigger factor" /protein_id="NP_216978.1" /db_xref="GI:15609599" /db_xref="GOA:O53189" /db_xref="UniProtKB/Swiss-Prot:O53189" /db_xref="GeneID:888615" /translation="MKSTVEQLSPTRVRINVEVPFAELEPDFQRAYKELAKQVRLPGF RPGKAPAKLLEARIGREAMLDQIVNDALPSRYGQAVAESDVQPLGRPNIEVTKKEYGQ DLQFTAEVDIRPKISPPDLSALTVSVDPIEIGEDDVDAELQSLRTRFGTLTAVDRPVA VGDVVSIDLSATVDGEDIPNAAAEGLSHEVGSGRLIAGLDDAVVGLSADESRVFTAKL AAGEHAGQEAQVTVTVRSVKERELPEPDDEFAQLASEFDSIDELRASLSDQVRQAKRA QQAEQIRNATIDALLEQVDVPLPESYVQAQFDSVLHSALSGLNHDEARFNELLVEQGS SRAAFDAEARTASEKDVKRQLLLDALADELQVQVGQDDLTERLVTTSRQYGIEPQQLF GYLQERNQLPTMFADVRRELAIRAAVEAATVTDSDGNTIDTSEFFGKRVSAGEAEEAE PADEGAARAASDEATT" gene complement(2765331..2765404) /locus_tag="Rvnt26" /note="tRNA-Pro(TGG)" /db_xref="GeneID:2700470" tRNA complement(2765331..2765404) /locus_tag="Rvnt26" /product="tRNA-Pro" /note="codon recognized: CCA" /anticodon=(pos:2765368..2765370,aa:Pro) /db_xref="GeneID:2700470" gene 2765541..2765611 /locus_tag="Rvnt27" /note="tRNA-Gly(TCC)" /db_xref="GeneID:2700450" tRNA 2765541..2765611 /locus_tag="Rvnt27" /product="tRNA-Gly" /note="codon recognized: GGA" /anticodon=(pos:2765573..2765575,aa:Gly) /db_xref="GeneID:2700450" gene 2765655..2766839 /gene="lipP" /locus_tag="Rv2463" /db_xref="GeneID:888572" CDS 2765655..2766839 /gene="lipP" /locus_tag="Rv2463" /EC_number="3.1.-.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /note="Rv2463, (MTV008.19), len: 394 aa. Probable lipP, esterase (EC 3.1.-.-), lipase similar to others eg O87861|ESTA ESTERASE A from Streptomyces chrysomallus (389 aa), FASTA scores: opt: 964, E(): 1.9e-53, (44.35% identity in 399 aa overlap); Q9I4S7|PA1047 PROBABLE ESTERASE from Pseudomonas aeruginosa (392 aa), FASTA scores: opt: 863, E(): 4.6e-47, (40.05% identity in 377 aa overlap); Q53403|ESTC ESTERASE III from Pseudomonas fluorescens (382 aa), FASTA scores: opt: 753, E(): 3.9e-40, (36.3% identity in 380 aa overlap); etc." /codon_start=1 /transl_table=11 /product="esterase/lipase LipP" /protein_id="NP_216979.1" /db_xref="GI:15609600" /db_xref="GOA:O53190" /db_xref="UniProtKB/TrEMBL:O53190" /db_xref="GeneID:888572" /translation="MNQPDIKGSCASEFTKVRDAFERNFVLRNEVGAAVAVWVDGDLV VNLWGGSADAGGTRPWQHDTLATVLSGTKALTATCVHQLVDRGELDLHAPVARYWPEF GQAGKQAITLAMVMSHRSGAIGPRGRLGWEQVADWDFVCEQLAAAEPWWQPGAAQGYH MTTFGFILGEVFRRVTGRTVGQYLRTEIAEPLGADVHIGLHPGEQLRCADLVDKPHIR QLLADVQAPGYPTSLNEHPKAALSVSMGFAPDDELGSNDLQLWRQIEFPGTNGQVSAL GLATFYNGLAQEKLLSREHMELVRVSQGGFDTDLVLGPRVADHGWGLGYMLNQRGVNG PNPRIFGHGGLGGSFGFVDLEHRIGYAYVMNRFDATKANADPRSVVLSNEVYAALGVN RS" gene complement(2766859..2767665) /locus_tag="Rv2464c" /db_xref="GeneID:888500" CDS complement(2766859..2767665) /locus_tag="Rv2464c" /EC_number="3.2.2.-" /function="HYDROLYSES DNA (THIS ENZYME MAY PLAY A SIGNIFICANT ROLE IN PROCESSES LEADING TO RECOVERY FROM MUTAGENESIS AND/OR CELL DEATH BY ALKYLATING AGENTS)." /note="Rv2464c, (MT2539, MTV008.20c), len: 268 aa. Possible DNA glycosylase (EC 3.2.2.-), showing some similarity to several other DNA glycosylases e.g. Q9F308|SCC80.11c PUTATIVE DNA REPAIR HYDROLASE (FRAGMENT) from Streptomyces coelicolor (306 aa), FASTA scores: opt: 894, E(): 6.1e-51, (51.05% identity in 282 aa overlap); O50606|MUTM|FPG_THETH FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE (EC 3.2.2.23) from Thermus aquaticus (267 aa), FASTA scores: opt: 342, E(): 4.6e-15, (32.4% identity in 250 aa overlap); Q9RCW5|SCM10.34c PUTATIVE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE from Streptomyces coelicolor (287 aa), FASTA scores: opt: 321, E(): 1.1e-13, (29.35% identity in 259 aa overlap); etc. Identical to AAK46839|MT2539 FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE from Mycobacterium tuberculosis strain CDC1551. Also similar to other Mycobacterium tuberculosis DNA glycosylases e.g. MTCY71.37 (32.9% identity in 277 aa overlap). BELONGS TO THE FPG FAMILY." /codon_start=1 /transl_table=11 /product="DNA glycosylase" /protein_id="NP_216980.1" /db_xref="GI:15609601" /db_xref="GOA:P64158" /db_xref="UniProtKB/Swiss-Prot:P64158" /db_xref="GeneID:888500" /translation="MPEGHTLHRLARLHQRRFAGAPVSVSSPQGRFADSASALNGRVL RRASAWGKHLFHHYVGGPVVHVHLGLYGTFTEWARPTDGWLPEPAGQVRMRMVGAEFG TDLRGPTVCESIDDGEVADVVARLGPDPLRSDANPSSAWSRITKSRRPIGALLMDQTV IAGVGNVYRNELLFRHRIDPQRPGRGIGEPEFDAAWNDLVSLMKVGLRRGKIIVVRPE HDHGLPSYLPDRPRTYVYRRAGEPCRVCGGVIRTALLEGRNVFWCPVCQT" gene complement(2767671..2768159) /locus_tag="Rv2465c" /db_xref="GeneID:887225" CDS complement(2767671..2768159) /locus_tag="Rv2465c" /EC_number="5.3.1.6" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the interconversion of ribose 5-phosphate to ribulose 5-phosphate; enzyme from E. coli shows allose 6-phosphate isomerase activity" /codon_start=1 /transl_table=11 /product="ribose-5-phosphate isomerase B" /protein_id="YP_177884.1" /db_xref="GI:57116993" /db_xref="GOA:Q79FD7" /db_xref="UniProtKB/TrEMBL:Q79FD7" /db_xref="GeneID:887225" /translation="MSGMRVYLGADHAGYELKQRIIEHLKQTGHEPIDCGALRYDADD DYPAFCIAAATRTVADPGSLGIVLGGSGNGEQIAANKVPGARCALAWSVQTAALAREH NNAQLIGIGGRMHTVAEALAIVDAFVTTPWSKAQRHQRRIDILAEYERTHEAPPVPGA PA" gene complement(2768261..2768884) /locus_tag="Rv2466c" /db_xref="GeneID:888214" CDS complement(2768261..2768884) /locus_tag="Rv2466c" /function="UNKNOWN. SEEMS REGULATED BY SIGH (Rv3223c PRODUCT)." /experiment="experimental evidence, no additional details recorded" /note="Rv2466c, (MTV008.22c), len: 207 aa. Conserved hypothetical protein (see citation below), equivalent to Q9CBY0|ML1485 HYPOTHETICAL PROTEIN from Mycobacterium leprae (207 aa), FASTA scores: opt: 1154, E(): 1.1e-67, (80.6% identity in 206 aa overlap). Also highly similar to Q9L201|SC8E4A.04c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (216 aa), FASTA scores: opt: 789, E(): 4.6e-44, (57.9% identity in 213 aa overlap). Also similar to AAK46628|MT2344 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (230 aa), FASTA scores: opt: 324, E(): 6.1e-14, (30.4% identity in 194 aa overlap). Contains PS00195 Glutaredoxin active site." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216982.1" /db_xref="GI:15609603" /db_xref="GOA:O53193" /db_xref="UniProtKB/TrEMBL:O53193" /db_xref="GeneID:888214" /translation="MLEKAPQKSVADFWFDPLCPWCWITSRWILEVAKVRDIEVNFHV MSLAILNENRDDLPEQYREGMARAWGPVRVAIAAEQAHGAKVLDPLYTAMGNRIHNQG NHELDEVITQSLADAGLPAELAKAATSDAYDNALRKSHHAGMDAVGEDVGTPTIHVNG VAFFGPVLSKIPRGEEAGKLWDASVTFASYPHFFELKRTRTEPPQFD" misc_feature complement(2768798..2768830) /locus_tag="Rv2466c" /note="PS00195 Glutaredoxin active site" gene 2768986..2771571 /gene="pepN" /locus_tag="Rv2467" /db_xref="GeneID:887403" CDS 2768986..2771571 /gene="pepN" /locus_tag="Rv2467" /EC_number="3.4.11.2" /function="AMINOPEPTIDASE WITH BROAD SUBSTRATE SPECIFICITY TO SEVERAL PEPTIDES (COULD PREFERENTIALLY CLEAVE LEUCINE, ARGININE AND LYSINE IN PEPTIDE-BOND-CONTAINING SUBSTRATES)." /note="Rv2467, (MTV008.23), len: 861 aa. Probable pepN, aminopeptidase N (EC 3.4.11.2), equivalent to Q9CBX9|ML1486 PROBABLE AMINOPEPTIDASE from Mycobacterium leprae (862 aa), FASTA scores: opt: 4751,E(): 0, (83.3% identity in 862 aa overlap). Also highly similar to others e.g. Q11010|AMPN_STRLI|PEPN from Streptomyces lividans (857 aa), FASTA scores: opt: 2839, E(): 1.8e-170, (53.25% identity in 864 aa overlap); Q9L1Z2|PEPN from Streptomyces coelicolor (857 aa), FASTA scores: opt: 2834, E(): 3.8e-170, (53.1% identity in 864 aa overlap); P37896|AMPN_LACDL|PEPN from Lactobacillus delbrueckii (subsp. lactis) (842 aa), FASTA scores: opt: 719, E(): 2.4e-37, (31.65% identity in 439 aa overlap); etc. Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature. BELONGS TO PEPTIDASE FAMILY M1 (ZINC METALLOPROTEASE), ALSO KNOWN AS THE PEPN SUBFAMILY. Note that previously known as pepD.; pepD" /codon_start=1 /transl_table=11 /product="aminopeptidase N" /protein_id="YP_177885.1" /db_xref="GI:57116994" /db_xref="GOA:Q7D736" /db_xref="UniProtKB/TrEMBL:Q7D736" /db_xref="GeneID:887403" /translation="MALPNLTRDQAVERAALITVDSYQIILDVTDGNGAPGERTFRST TTVVFDALPGADTVIDISAHTVRRASLNDQDLDVSGYDEAAGIPLRGLAQRNVVVVDA DCHYSNTGEGLHRFVDPVDGETYLYSQFETADAKRMFACFDQPDLKATFDVRVTAPAH WKVISNGAPLAAANGVHTFATTPRMSTYLVALIAGPYAAWTDTYIDDHGEIPLGIYCR ASLAEYMDAERLFTQTKQGFGFYHKHFGLPYAFGKYDQLFVPEFNAGAMENAGAVTFL EDYVFRSKVTRASYERRAETVLHEMAHMWFGDLVTMTWWDDLWLNESFATFASVLCQS EATEFTEAWTTFATVEKSWAYRQDQLPSTHPIAADIPDLAAVEVNFDGITYAKGASVL KQLVAYVGLERFLAGLRDYFRTHAFGNASFDDLLAALEKASGRDLSNWGEQWLKTTGL NTLRPDFEVDAEGRFTRFAVTQSGAAPGAGETRVHRLAVGIYDDDGSKSSGKLVRVHR EELDVSGPITNVPALVGVSRGKLILVNDDDLTYCSLRLDERSLQTALDRIADIAEPLP RTLVWSAAWEMTREAELRARDFVSLVSGGVHAETEVGVAQRLLLQAQTALGCYAEPGW ARERGWPQFADRLLELAREAEPGSDHQLAYINSLCSSVLSPRHVQTLGALLEGEPAAC GLAGLAVDTDLRWRIVTALATAGAIDADGPETPRIDAEVQRDPTAAGKRHAAQARAAR PQFVVKDEAFTTVVEDDTLANATGRAMIAGIAAPGQGELLKPFARRYFQAIPGVWARR SSEVAQSVVIGLYPHWDISEQGITAAEEFLSDPEVPPALRRLVLEGQAAVQRSLRARN FDADG" misc_feature 2769871..2769900 /gene="pepN" /locus_tag="Rv2467" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(2771644..2772147) /locus_tag="Rv2468c" /db_xref="GeneID:887767" CDS complement(2771644..2772147) /locus_tag="Rv2468c" /function="UNKNOWN" /note="Rv2468c, (MTV008.24c), len: 167 aa. Conserved hypothetical protein, highly similar to Mycobacterium leprae HYPOTHETICAL PROTEINS Q9CC58|ML1255 (163 aa), FASTA scores: opt: 859, E(): 1.6e-49, (81.2% identity in 165 aa overlap) and Q9X7B5|MLCB1610.16 (169 aa), FASTA scores: opt: 859, E(): 1.6e-49, (81.2% identity in 165 aa overlap). Also weak similarity with Q9X8D7|SCE39.14c PUTATIVE GNTR-FAMILY REGULATOR from Streptomyces coelicolor (243 aa), FASTA scores: opt: 116, E(): 1.3, (30.1% identity in 156 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216984.1" /db_xref="GI:15609605" /db_xref="UniProtKB/TrEMBL:O53195" /db_xref="GeneID:887767" /translation="MTHRSSRLEVGPVARGDVATIEHAELPPGWVLTTSGRISGVTEP GELSVHYPFPIADLVALDDALTYSSRACQVRFAIYLGDLGRDTAARAREILGKVPTPD NAVLLAVSPNQCAIEVVYGSQVRGRGAESAAPLGVAAASSAFEQGELVDGLISAIRVL SAGIAPG" gene complement(2772367..2773035) /locus_tag="Rv2469c" /db_xref="GeneID:888591" CDS complement(2772367..2773035) /locus_tag="Rv2469c" /function="UNKNOWN" /note="Rv2469c, (MTV008.25c), len: 222 aa. Conserved hypothetical protein, highly similar to other HYPOTHETICAL PROTEINS e.g. Q9X7B4|MLCB1610.15|ML1254 from Mycobacterium leprae (215 aa), FASTA scores: opt: 1183, E(): 3.3e-70, (77.9% identity in 222 aa overlap); Q9L1Y0|SC8E4A.25c from Streptomyces coelicolor (178 aa), FASTA scores: opt: 589, E(): 1.7e-31, (53.4% identity in 161 aa overlap) (N-terminal region is shorter 50 aa approximatively); Q9RRS6|DR2409 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (186 aa), FASTA scores: opt: 440, E(): 9.6e-22, (42.25% identity in 168 aa overlap) (N-terminal region is shorter 30 aa approximatively); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216985.1" /db_xref="GI:15609606" /db_xref="GOA:O53196" /db_xref="UniProtKB/TrEMBL:O53196" /db_xref="GeneID:888591" /translation="MAHGKKRRGHRSSGVAAGVTGPASCLHSVHSHRLASGVETHPPN RHESASIWNRRRVLLLNSTYEPLTALSMRRAIVMVICGKADVVHEDPSGPVIHSATRS ILVPSVIQLRSYVRVPYRARVPMTRAALMHRDRFCCAYCGGKADTVDHVVPRSRGGAH SWENCVACCSPCNHRKGDRLLTELGWALRRAPLPPTGPHWRLLSAVKELDPSWARYLG EGAA" gene 2773178..2773564 /gene="glbO" /locus_tag="Rv2470" /db_xref="GeneID:887743" CDS 2773178..2773564 /gene="glbO" /locus_tag="Rv2470" /function="OXYGEN CARRIER, INVOLVED IN OXYGEN TRANSPORT." /note="Rv2470, (MTV008.26), len: 128 aa. Possible glbO, globin-like protein, highly similar to Q9CC59|GLBO|ML1253 HEMOGLOBIN-LIKE (OXYGEN CARRIER) from Mycobacterium leprae (128 aa), FASTA scores: opt: 767, E(): 4e-47, (88.1% identity in 126 aa overlap); Q9X7B3|MLCB1610.14c PUTATIVE GLOBIN from Mycobacterium leprae (131 aa); Q9L250|SC6D10.14 PUTATIVE GLOBIN from Streptomyces coelicolor (137 aa), FASTA scores: opt: 466, E(): 5.7e-26, (53.6% identity in 125 aa overlap). Also similar to O31607 YJBI PROTEIN from Bacillus subtilis (132 aa), FASTA scores: opt: 294, E(): 6.6e-14; (39.85% identity in 128 aa overlap). COULD BELONG TO PROTOZOAN/CYANOBACTERIAL GLOBIN FAMILY PROTEIN." /codon_start=1 /transl_table=11 /product="globin GlbO" /protein_id="NP_216986.1" /db_xref="GI:15609607" /db_xref="GeneID:887743" /translation="MPKSFYDAVGGAKTFDAIVSRFYAQVAEDEVLRRVYPEDDLAGA EERLRMFLEQYWGGPRTYSEQRGHPRLRMRHAPFRISLIERDAWLRCMHTAVASIDSE TLDDEHRRELLDYLEMAAHSLVNSPF" gene 2773564..2775204 /gene="aglA" /locus_tag="Rv2471" /db_xref="GeneID:887393" CDS 2773564..2775204 /gene="aglA" /locus_tag="Rv2471" /EC_number="3.2.1.20" /function="INVOLVED IN SUGAR METABOLISM (HYDROLYSIS OF TERMINAL, NON-REDUCING 1,4-LINKED D-GLUCOSE RESIDUES WITH RELEASE OF D-GLUCOSE)." /note="Rv2471, (MTV008.27), len: 546 aa. Probable aglA, maltase (alpha-glucosidase) (EC 3.2.1.20), highly similar or similar to several e.g. Q60027|AGLA from Thermomonospora curvata (544 aa), FASTA scores: opt: 2071, E(): 4e-116, (57.7% identity in 525 aa overlap); Q9KZE3|AGLAE from Streptomyces coelicolor (534 aa), FASTA scores: opt: 1475, E(): 1.5e-80, (50.1% identity in 537 aa overlap); O86874|AGLA from Streptomyces lividans (534 aa), FASTA scores: opt: 1473, E(): 2e-80, (50.1% identity in 537 aa overlap); etc. SEEMS TO BELONG TO FAMILY 13 OF GLYCOSYL HYDROLASES, ALSO KNOWN AS THE ALPHA-AMYLASE FAMILY." /codon_start=1 /transl_table=11 /product="alpha-glucosidase AglA" /protein_id="NP_216987.1" /db_xref="GI:15609608" /db_xref="GeneID:887393" /translation="MDQHQRPDPMGPGSPRASARRPEPDPMGEPWWSRAVFYQVYPRS FADSNGDGVGDLDGLASRLDHLQQLGVDAIWINPVTVSPMADHGYDVADPRDIDPLFG GMPAFERLVAAAHRQGIKVTMDVVPNHTSSAHPWFQAALADLPGSPARDRYFFRDGRG PDGSLPPNNWESVFGGPAWTRVREPDGNPGQWYLHLFDTEQPDLNWDNPEILDDFEKT LRFWLDRGVDGFRIDVAHGMAKPPGLPDSPDLGIEVLHHRDDDPRFNHPNVHAIHRDI RTVIDEYPGAVTVGEVWVHDNARWAEYLRPDELHLGFNFRLARTEFDAAEIRDAVANS LAAAALQNATPTWTLANHDVGREVSRYGGGEIGLRRAKAMAVVMLALPGVVFLYNGQE LGLPDVDLPDEVLQDPTWERSGRTERGRDGCRVPIPWSGNIPPFGFSTCPDTWLPMPP EWAALTAEKQRADAGSTLSFFRLALRLRRERNEFDGDVDWLAAPDDALIFRRHGGGLV CALNAAERPLALPAGEPILASAPLTDATLPPNAAAWLV" gene 2775272..2775565 /locus_tag="Rv2472" /db_xref="GeneID:887255" CDS 2775272..2775565 /locus_tag="Rv2472" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2472, (MTV008.28), len: 97 aa. Conserved hypothetical protein, showing some similarity to O53451|Rv1103c|MTV017.56c from Mycobacterium tuberculosis strain H37Rv (106 aa), FASTA scores: opt: 135, E(): 0.026, (45.85% identity in 72 aa overlap); and AAK45393|MT1135 HYPOTHETICAL 11.4 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (78 aa) FASTA scores: opt: 139, E(): 0.011, (45.35% identity in 75 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216988.1" /db_xref="GI:15609609" /db_xref="GeneID:887255" /translation="MMMRIAVRLPGEVITFVDSEVSQIRIPSRRAAVVLRASNASDAA ILTATEPNHHLDALAGQAAKLAPTSIDAAHPARPARRDPCLYPRTGQALPRTG" gene 2775568..2776284 /locus_tag="Rv2473" /db_xref="GeneID:888163" CDS 2775568..2776284 /locus_tag="Rv2473" /function="UNKNOWN" /note="Rv2473, (MTV008.29), len: 238 aa. Possible pro-,ala-rich membrane protein, with possible transmembrane domain around aa 81-104." /codon_start=1 /transl_table=11 /product="alanine and proline rich membrane protein" /protein_id="NP_216989.1" /db_xref="GI:15609610" /db_xref="GeneID:888163" /translation="MAPTSSSVASELLMPWPSAAASGVVGWRTTATASQRYHRPMSDT PFAEPYPEQRPPWGVPPPGWDGSSRPAPSTTPRSPGRWSLVAALALAVVSLGVGIVGW FHRQPHDKPSPAPSAPTFTSQQISDAKENVCAAHRIVRQAAVLNTNQANPVPGDPTGD LAVAANARLALYSGGDYLLRRLTAEPATPAELRDAVRSLANALQELAVNYLAGAPDSV VTPLRLALERDTRAVDPLCV" gene complement(2776316..2776969) /locus_tag="Rv2474c" /db_xref="GeneID:888606" CDS complement(2776316..2776969) /locus_tag="Rv2474c" /function="UNKNOWN" /note="Rv2474c, (MTV008.30c), len: 217 aa. Hypothetical protein. Shows weak similarity with Q9L246|SC6D10.18c HYPOTHETICAL 24.9 KDA PROTEIN from Streptomyces coelicolor (238 aa), FASTA scores: opt: 111, E(): 5.6, (30% identity in 233 aa overlap), BLASTP scores: Score= 135, E= 3.5e-07, P= 3.5e-07, Identities= 55/182 (30%)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216990.1" /db_xref="GI:15609611" /db_xref="GeneID:888606" /translation="MVERGLWLPDPAHRADLATFVDHALRLDDAAVIRIRARSTGLLS AWVATGFDVLASRVVAGKVRPDDLSVAARSLAHGLATTDASGYVDPGYSMDSAWRGGL PPESGFTYLDDVPARVMLDLAHRGARLAKEHGSSAGPPVSLLDQEVIQVSSADVVVGL PMRCVFALTAMGFLPQSAETISADELIRVRISPAWLRLDARFGSVYRHRGHAALVLR" gene complement(2776975..2777391) /locus_tag="Rv2475c" /db_xref="GeneID:887793" CDS complement(2776975..2777391) /locus_tag="Rv2475c" /function="UNKNOWN" /note="Rv2475c, (MTV008.31c), len: 138 aa. Conserved hypothetical protein, showing similarity with Q9L245|SC6D10.19c HYPOTHETICAL 16.2 KDA PROTEIN from Streptomyces coelicolor (136 aa), FASTA scores: opt: 236, E(): 1.9e-09, (34.1% identity in 126 aa overlap). Also some similarity with AAK44393|Z97050|MTCI28_3 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis cosmid I (151 aa), FASTA scores: opt: 147, E(): 0.00025, (29.2% identity in 120 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216991.1" /db_xref="GI:15609612" /db_xref="GeneID:887793" /translation="MSVGFVTPVGVRWSDIDMYQHVNHATMVTILEEARVPFLKDAFG ADITSTGLLIADVRVTYKGQLRLSDSPLQVTIWTKRLRAVDFTLGYEVRSVNAEPDSR PAVIAESQLAAFHIEEQRLVRLSPHHREYLQRWFRG" gene complement(2777388..2782262) /gene="gdh" /locus_tag="Rv2476c" /db_xref="GeneID:887437" CDS complement(2777388..2782262) /gene="gdh" /locus_tag="Rv2476c" /EC_number="1.4.1.2" /function="CATABOLIC GLUTDH INVOLVED IN THE UTILIZATION OF GLUTAMATE AND OTHER AMINO ACIDS OF THE GLUTAMATE FAMILY. GENERATES 2-OXOGLUTARATE FROM L-GLUTAMATE [CATALYTIC ACTIVITY: L-GLUTAMATE + H(2)O + NAD(+) = 2-OXOGLUTARATE + NH(3) + NADH]." /note="Rv2476c, (MTV008.32c), len: 1624 aa. Probable gdh, glutamate dehydrogenase (EC 1.4.1.2). Highly similar to Q9X7B2|MLCB1610.10|ML1249 HYPOTHETICAL 177.9 KDA PROTEIN from Mycobacterium leprae (1622 aa), FASTA scores: opt: 8630,E(): 0, (81.45% identity in 1634 aa overlap). But highly similar to Q9F0J1|GDH NAD-GLUTAMATE DEHYDROGENASE from Streptomyces clavuligerus (1651 aa), FASTA scores: opt: 3833, E(): 0, (45.8% identity in 1600 aa overlap); (see Minambres et al., 2000). Also similar with others e.g. AAG53963|PA3068|GDHB HYPOTHETICAL (NAD(+)-DEPENDENT GLUTAMATE DEHYDROGENASE from Pseudomonas aeruginosa (1620 aa), FASTA scores: opt: 2214, E(): 1e-124, (40.1% identity in 1561 aa overlap) (see Lu & Abdelal 2001); and Q9Y8G5|GDHB NAD-SPECIFIC GLUTAMATE DEHYDROGENASE from Agaricus bisporus (1029 aa), FASTA scores: opt: 194, E(): 0.00099, (22.7% identity in 647 aa overlap) (see Kersten et al., 1999); etc. Contains possible Helix-turn-helix motif at aa 1568 to 1589 (score 1098, +2.93 SD)." /codon_start=1 /transl_table=11 /product="NAD-dependent glutamate dehydrogenase" /protein_id="NP_216992.1" /db_xref="GI:15609613" /db_xref="GeneID:887437" /translation="MTIDPGAKQDVEAWTTFTASADIPDWISKAYIDSYRGPRDDSSE ATKAAEASWLPASLLTPAMLGAHYRLGRHRAAGESCVAVYRADDPAGFGPALQVVAEH GGMLMDSVTVLLHRLGIAYAAILTPVFDVHRSPTGELLRIEPKAEGTSPHLGEAWMHV ALSPAVDHKGLAEVERLLPKVLADVQRVATDATALIATLSELAGEVESNAGGRFSAPD RQDVGELLRWLGDGNFLLLGYQRCRVADGMVYGEGSSGMGVLRGRTGSRPRLTDDDKL LVLAQARVGSYLRYGAYPYAIAVREYVDGSVVEHRFVGLFSVAAMNADVLEIPTISRR VREALAMAESDPSHPGQLLLDVIQTVPRPELFTLSAQRLLTMARAVVDLGSQRQALLF LRADRLQYFVSCLVYMPRDRYTTAVRMQFEDILVREFGGTRLEFTARVSESPWALMHF MVRLPEVGVAGEGAAAPPVDVSEANRIRIQGLLTEAARTWADRLIGAAAAAGSVGQAD AMHYAAAFSEAYKQAVTPADAIGDIAVITELTDDSVKLVFSERDEQGVAQLTWFLGGR TASLSQLLPMLQSMGVVVLEERPFSVTRPDGLPVWIYQFKISPHPTIPLAPTVAERAA TAHRFAEAVTAIWHGRVEIDRFNELVMRAGLTWQQVVLLRAYAKYLRQAGFPYSQSYI ESVLNEHPATVRSLVDLFEALFVPVPSGSASNRDAQAAAAAVAADIDALVSLDTDRIL RAFASLVQATLRTNYFVTRQGSARCRDVLALKLNAQLIDELPLPRPRYEIFVYSPRVE GVHLRFGPVARGGLRWSDRRDDFRTEILGLVKAQAVKNAVIVPVGAKGGFVVKRPPLP TGDPAADRDATRAEGVACYQLFISGLLDVTDNVDHATASVNPPPEVVRRDGDDAYLVV AADKGTATFSDIANDVAKSYGFWLGDAFASGGSVGYDHKAMGITARGAWEAVKRHFRE IGIDTQTQDFTVVGIGDMSGDVFGNGMLLSKHIRLIAAFDHRHIFLDPNPDAAVSWAE RRRMFELPRSSWSDYDRSLISEGGGVYSREQKAIPLSAQVRAVLGIDGSVDGGAAEMA PPNLIRAILRAPVDLLFNGGIGTYIKAESESDADVGDRANDPVRVNANQVRAKVIGEG GNLGVTALGRVEFDLSGGRINTDALDNSAGVDCSDHEVNIKILIDSLVSAGTVKADER TQLLESMTDEVAQLVLADNEDQNDLMGTSRANAASLLPVHAMQIKYLVAERGVNRELE ALPSEKEIARRSEAGIGLTSPELATLMAHVKLGLKEEVLATELPDQDVFASRLPRYFP TALRERFTPEIRSHQLRREIVTTMLINDLVDTAGITYAFRIAEDVGVTPIDAVRTYVA TDAIFGVGHIWRRIRAANLPIALSDRLTLDTRRLIDRAGRWLLNYRPQPLAVGAEINR FAAMVKALTPRMSEWLRGDDKAIVEKTAAEFASQGVPEDLAYRVSTGLYRYSLLDIID IADIADIDAAEVADTYFALMDRLGTDGLLTAVSQLPRHDRWHSLARLAIRDDIYGALR SLCFDVLAVGEPGESSEQKIAEWEHLSASRVARARRTLDDIRASGQKDLATLSVAARQ IRRMTRTSGRGISG" gene complement(2782366..2784042) /locus_tag="Rv2477c" /db_xref="GeneID:887757" CDS complement(2782366..2784042) /locus_tag="Rv2477c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF MACROLIDE ACROSS THE MEMBRANE (EXPORT): MACROLIDE ANTIBIOTICS RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="ChvD; in Agrobacterium tumefaciens, mutations in both Walker boxes were found to affect virulence" /codon_start=1 /transl_table=11 /product="putative ABC transporter ATP-binding protein" /protein_id="NP_216993.1" /db_xref="GI:15609614" /db_xref="GeneID:887757" /translation="MAEFIYTMKKVRKAHGDKVILDDVTLSFYPGAKIGVVGPNGAGK SSVLRIMAGLDKPNNGDAFLATGATVGILQQEPPLNEDKTVRGNVEEGMGDIKIKLDR FNEVAELMATDYTDELMEEMGRLQEELDHADAWDLDAQLEQAMDALRCPPADEPVTNL SGGERRRVALCKLLLSKPDLLLLDEPTNHLDAESVQWLEQHLASYPGAILAVTHDRYF LDNVAEWILELDRGRAYPYEGNYSTYLEKKAERLAVQGRKDAKLQKRLTEELAWVRSG AKARQAKSKARLQRYEEMAAEAEKTRKLDFEEIQIPVGPRLGNVVVEVDHLDKGYDGR ALIKDLSFSLPRNGIVGVIGPNGVGKTTLFKTIVGLETPDSGSVKVGETVKLSYVDQA RAGIDPRKTVWEVVSDGLDYIQVGQTEVPSRAYVSAFGFKGPDQQKPAGVLSGGERNR LNLALTLKQGGNLILLDEPTNDLDVETLGSLENALLNFPGCAVVISHDRWFLDRTCTH ILAWEGDDDNEAKWFWFEGNFGAYEENKVERLGVDAARPHRVTHRKLTRG" misc_feature complement(2782672..2782716) /locus_tag="Rv2477c" /note="PS00211 ABC transporters family signature" misc_feature complement(2782960..2782983) /locus_tag="Rv2477c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(2783521..2783565) /locus_tag="Rv2477c" /note="PS00211 ABC transporters family signature" misc_feature complement(2783908..2783931) /locus_tag="Rv2477c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(2784123..2784608) /locus_tag="Rv2478c" /db_xref="GeneID:887304" CDS complement(2784123..2784608) /locus_tag="Rv2478c" /function="UNKNOWN" /note="Rv2478c, (MTV008.34c), len: 161 aa. Conserved hypothetical protein, with weak similarity with many single-strand binding proteins e.g. Q9X8U3|SCH24.29 PUTATIVE SINGLE-STRAND BINDING PROTEIN from Streptomyces coelicolor (199 aa), FASTA scores: opt: 246, E(): 4.5e-08, (31.5% identity in 162 aa overlap); P46390|SSB_MYCLE|ML2684|MLCB1913.20c SINGLE-STRAND BINDING PROTEIN (SSB) (HELIX-DESTABILIZING PROTEIN) from Mycobacterium leprae (168 aa), FASTA scores: opt: 239, E(): 1e-07, (30.8% identity in 146 aa overlap); P18310|SSBF_ECOLI SINGLE-STRAND BINDING PROTEIN from Escherichia coli (178 aa), FASTA scores: opt: 116, E(): 2.9, (25.7% identity in 140 aa overlap); etc. Also similarity with Rv0054|P71711|MTCY21D4.17|SSB_MYCTU PROBABLE SINGLE-STRAND BINDING PROTEIN from M. tuberculosis (164 aa), FASTA scores: opt: 234, E(): 2e-07, (31.75% identity in 148 aa overlap). N-terminus shorter 8 aa from AAK46855|MT2553 SINGLE-STRAND DNA BINDING PROTEIN from Mycobacterium tuberculosis strain CDC1551." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216994.1" /db_xref="GI:15609615" /db_xref="GeneID:887304" /translation="MVGHIVNDLQRRKVGDQEVVKFRVASNSRRRTSDGGWEPGNSLF ITVNCWGRLVTGVGAALGKGAPVIVVGHVYTSEYEDRDGIRRSSLEMRATSVGPDLSR VIVRIEKPAYTGPSAGDLPAATGTGAAGAADAPASAADSVSDVVVDDAITGHNPLPIS A" repeat_region complement(2784614..2785970) /note="IS6110-9, len: 1357 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-9" repeat_region 2784614..2784642 /note="29 bp Inverted repeat at the left end of IS6110, GTGAACCGCCCCGGTGAGTCCGGAGACTC" gene complement(2784657..2785697) /locus_tag="Rv2479c" /db_xref="GeneID:887201" CDS complement(2784657..>2785697) /locus_tag="Rv2479c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2479c, (MTV008.35c), len: 346 aa. Probable transposase for IS6110, identical to many, probably translated by frame shifting from the upstream ORF." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216995.1" /db_xref="GI:15609616" /db_xref="GeneID:887201" /translation="AEALAAGQRRIAKGERDFKDRVGFLRGRARPASTLITRFIADHQ GHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIADPATARP ADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVASTMATSMV LDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYD NALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA AYYAQRQRPAAG" gene complement(2785592..2785918) /locus_tag="Rv2480c" /db_xref="GeneID:887328" CDS complement(2785592..2785918) /locus_tag="Rv2480c" /function="THOUGHT TO BE REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2480c, (MTV008.36c), len: 108 aa. Possible transposase for IS6110, identical to many." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_216996.1" /db_xref="GI:15609617" /db_xref="GeneID:887328" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" repeat_region complement(2785942..2785970) /note="29 bp Inverted repeat at the right end of IS6110, GTGAACCGCCCCGGCATGTCCGGAGACTC" gene complement(2786575..2786898) /locus_tag="Rv2481c" /db_xref="GeneID:887462" CDS complement(2786575..2786898) /locus_tag="Rv2481c" /function="UNKNOWN" /note="Rv2481c, (MTV008.37c), len: 107 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_216997.1" /db_xref="GI:15609618" /db_xref="GeneID:887462" /translation="MALRRRHEPDGWPFSQRSEKPNAVRHAVRCSAVSAAASTANGTP VNWVSGRVTRAMGVHRQTRGGVASVHADSLRGAVLVHGQLRNSIPISANVPASGANTK SSIAH" gene complement(2786914..2789283) /gene="plsB2" /locus_tag="Rv2482c" /db_xref="GeneID:887848" CDS complement(2786914..2789283) /gene="plsB2" /locus_tag="Rv2482c" /EC_number="2.3.1.15" /function="INVOLVED IN PHOSPHOLIPID BIOSYNTHESIS (AT THE FIRST STEP). MAY ALSO FUNCTION IN THE REGULATION OF MEMBRANE BIOGENESIS [CATALYTIC ACTIVITY: ACYL-CoA + SN-GLYCEROL 3-PHOSPHATE = CoA + 1-ACYL-SN-GLYCEROL 3-PHOSPHATE]." /note="PlsB; catalyzes the formation of 1-acyl-sn-glycerol 3-phosphate by transfering the acyl moiety from acyl-CoA" /codon_start=1 /transl_table=11 /product="glycerol-3-phosphate acyltransferase" /protein_id="NP_216998.1" /db_xref="GI:15609619" /db_xref="GeneID:887848" /translation="MTKPAADASAVLTAEDTLVLASTATPVEMELIMGWLGQQRARHP DSKFDILKLPPRNAPPAALTALVEQLEPGFASSPQSGEDRSIVPVRVIWLPPADRSRA GKVAALLPGRDPYHPSQRQQRRILRTDPRRARVVAGESAKVSELRQQWRDTTVAEHKR DFAQFVSRRALLALARAEYRILGPQYKSPRLVKPEMLASARFRAGLDRIPGATVEDAG KMLDELSTGWSQVSVDLVSVLGRLASRGFDPEFDYDEYQVAAMRAALEAHPAVLLFSH RSYIDGVVVPVAMQDNRLPPVHMFGGINLSFGLMGPLMRRSGMIFIRRNIGNDPLYKY VLKEYVGYVVEKRFNLSWSIEGTRSRTGKMLPPKLGLMSYVADAYLDGRSDDILLQGV SICFDQLHEITEYAAYARGAEKTPEGLRWLYNFIKAQGERNFGKIYVRFPEAVSMRQY LGAPHGELTQDPAAKRLALQKMSFEVAWRILQATPVTATGLVSALLLTTRGTALTLDQ LHHTLQDSLDYLERKQSPVSTSALRLRSREGVRAAADALSNGHPVTRVDSGREPVWYI APDDEHAAAFYRNSVIHAFLETSIVELALAHAKHAEGDRVAAFWAQAMRLRDLLKFDF YFADSTAFRANIAQEMAWHQDWEDHLGVGGNEIDAMLYAKRPLMSDAMLRVFFEAYEI VADVLRDAPPDIGPEELTELALGLGRQFVAQGRVRSSEPVSTLLFATARQVAVDQELI APAADLAERRVAFRRELRNILRDFDYVEQIARNQFVACEFKARQGRDRI" gene complement(2789280..2791022) /gene="plsC" /locus_tag="Rv2483c" /db_xref="GeneID:887744" CDS complement(2789280..2791022) /gene="plsC" /locus_tag="Rv2483c" /EC_number="3.1.3.3" /EC_number="2.3.1.51" /function="C-TERMINUS: INVOLVED IN PHOSPHOLIPID BIOSYNTHESIS (AT THE SECOND STEP); CONVERTS LYSOPHOSPHATIDIC ACID (LPA) INTO PHOSPHATIDIC ACID BY INCORPORATING ACYL MOIETY AT THE 2 POSITION [CATALYTIC ACTIVITY 2: ACYL-CoA + 1-ACYL-SN-GLYCEROL 3-PHOSPHATE = CoA + 1,2-DIACYL-SN-GLYCEROL 3-PHOSPHATE]. N-TERMINUS: COULD BE GENERATE SERINE AND PHOSPHATE FROM PHOSPHOSERINE; MAY CATALYZE THE LAST STEP IN THE BIOSYNTHESIS OF SERINE FROM CARBOHYDRATES (THE REACTION MECHANISM COULD BE PROCEED VIA THE FORMATION OF A PHOSPHORYL-ENZYME INTERMEDIATES) [CATALYTIC ACTIVITY 1: PHOSPHOSERINE + H(2)O = SERINE + PHOSPHATE]." /note="Rv2483c, (MTV008.39c), len: 580 aa. Possible plsC, a transmembrane phospholipid biosynthesis bifunctionnal enzyme, including L-3-phosphoserine phosphatase (EC 3.1.3.3) and 1-acyl-Sn-glycerol-3-phosphate acyltransferase (EC 2.3.1.51), equivalent to Q9X7A9|PLSC|ML1245 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (579 aa), FASTA scores: opt: 2835, E(): 9.2e-153, (77.15% identity in 573 aa overlap). C-terminal end is similar to many 1-ACYL-SN-GLYCEROL-3-PHOSPHATE ACYLTRANSFERASES (LYSOPHOSPHATIDIC ACIDACYLTRANSFERASES) e.g. Q9SDQ2 from Limnanthes floccosa (281 aa), FASTA scores: opt: 378, E(): 3.1e-14, (30.0% identity in 230 aa overlap) and Q42868|PLSC_LIMAL from Limnanthes alba (White meadowfoam) (281 aa), FASTA scores: opt: 374, E(): 5.2e-14, (30.55% identity in 221 aa overlap); and the N-terminal end is similar to many SERB FAMILY PROTEINS e.g. AAK44749|MT0526 from Mycobacterium tuberculosis strain CDC1551 (308 aa), FASTA scores: opt: 356, E(): 5.8e-13, (32.5% identity in 298 aa overlap) and Q49823|ML2424 from Mycobacterium leprae (300 aa), FASTA scores: opt: 346, E(): 2.1e-12, (32.0% identity in 278 aa overlap). So belongs to the 1-ACYL-SN-GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE FAMILY and may belong to the SERB FAMILY." /codon_start=1 /transl_table=11 /product="bifunctionnal putative L-3-phosphoserine phosphatase/1-acyl-SN-glycerol-3-phosphate acyltransferase" /protein_id="NP_216999.1" /db_xref="GI:15609620" /db_xref="GeneID:887744" /translation="MSAADEQGEERATRKSAPDLRLPGSVAEILASPAGPKVGAFFDL DGTLVAGFTAVILTQERLRRRDMGVGELLGMVQAGLNHTLGRIEFEDLIGKAAAALAG RLLTDLEEIGERLFAQRIESRIYPEMRELVRAHVARGHTVVLSSSALTIQVGPVARFL GINNMLTNKFETNEDGILTGGVLKPILWCPGKATAVQRFAAEHDIDLKDSYFYADGDE DVALMYLVGNPRPTNPEGKMAAVAKRRGWPILKFNSRGGVGIRRQLRTLAGLSTIVPV AAGAVGIGVLTGSRRRGVNFFTSTFSQLLLATSGVHLNVIGKENLTAQRPAVFIFNHR NQVDPVIAGALVRDNWVGVGKKELASDPIMGTLGKLLDGVFIDRDDPVAAVETLHTVE ERARNGLSIVIAPEGTRLDTTEVGSFKKGPFRIAMAAKIPIVPIVIRNAEIVASRNST TINPGTVDVAVFPPIPVDDWTLDALPDRIAEVRQLYLDTLADWPVDGLPAVDLYAEQK AARKARAQVAKATAKRVPAKKAPAKSAANKGAAATKAATKKASPKAKPSESKIAGKDG EASASPSSSAKGRS" gene complement(2791019..2792494) /locus_tag="Rv2484c" /db_xref="GeneID:888623" CDS complement(2791019..2792494) /locus_tag="Rv2484c" /function="UNKNOWN" /note="Rv2484c, (MTV008.40c), len: 491 aa. Conserved hypothetical protein, highly similar or similar to many Mycobacterial hypothetical proteins e.g. Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 2459, E(): 3e-138, (75.15% identity in 483 aa overlap); O53304|YU87_MYCTU|Rv3087|MTV013.08 from Mycobacterium tuberculosis (472 aa), FASTA scores: opt: 527, E(): 8.1e-24, (29.1% identity in 485 aa overlap); O53305|YU88_MYCTU|Rv3088|MT3173|MTV013.09 from Mycobacterium tuberculosis (474 aa), FASTA scores: opt: 370, E(): 1.6e-14, (26.05% identity in 422 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217000.1" /db_xref="GI:15609621" /db_xref="GeneID:888623" /translation="MAESGESPRLSDELGPVDYLMHRGEANPRTRSGIMALELLDGTP DWDRFRTRFENASRRVLRLRQKVVVPTLPTAAPRWVVDPDFNLDFHVRRVRVSGPATL REVLDLAEVILQSPLDISRPLWTATLVEGMADGRAAMLLHVSHAVTDGVGGVEMFAQI YDLERDPPPRSTPPQPIPEDLSPNDLMRRGINHLPIAVVGGVLDALSGAVSMAGRAVL EPVSTVSGILGYARSGIRVLNRAAEPSPLLRRRSLTTRTEAIDIRLADLHKAAKAGGG SINDAYLAGLCGALRRYHEALGVPISTLPMAVPVNLRAEGDAAGGNQFTGVNLAAPVG TIDPVARMKKIRAQMTQRRDEPAMNIIGSIAPVLSVLPTAVLEGITGSVIGSDVQASN VPVYPGDTYLAGAKILRQYGIGPLPGVAMMVVLISRGGWCTVTVRYDRASVRNDELFA QCLQAGFDEILALAGGPAPRVLPASFDTQGAGSVPRSVSGS" gene complement(2792723..2793988) /gene="lipQ" /locus_tag="Rv2485c" /db_xref="GeneID:887876" CDS complement(2792723..2793988) /gene="lipQ" /locus_tag="Rv2485c" /EC_number="3.1.-.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /note="Rv2485c, (MTV008.41c), len: 421 aa. Probable lipQ, carboxylesterase protein (lipase) (EC 3.1.-.-). Similar (greater at the C-terminal end) to AAK46626|MT2342 PUTATIVE CARBOXYLESTERASE from Mycobacterium tuberculosis strain CDC1551 (431 aa), FASTA scores: opt: 1134, E(): 4.3e-60, (46.25% identity in 428 aa overlap); and Q50681|Rv2284|MTCY339.26c HYPOTHETICAL PROTEIN from M. tuberculosis strain H37Rv (431 aa), FASTA scores: opt: 1134, E(): 4.3e-60, (46.25% identity in 428 aa overlap). Also similar in part to other putative lipases/esterases e.g. AAK44451|MT0230 from Mycobacterium tuberculosis strain CDC1551 (403 aa), FASTA scores: opt: 763, E(): 4.6e-38, (37.95% identity in 390 aa overlap); Q9RY19|DR0133 from Deinococcus radiodurans (296 aa), FASTA scores: opt: 392, E(): 4e-16, (33.7% identity in 276 aa overlap); Q9Z545|SC9B2.14 from Streptomyces coelicolor (502 aa) FASTA scores: opt: 279, E(): 3.2e-09, (31.15% identity in 292 aa overlap); etc." /codon_start=1 /transl_table=11 /product="carboxylesterase LipQ" /protein_id="NP_217001.1" /db_xref="GI:15609622" /db_xref="GeneID:887876" /translation="MHIASVTSRCSRAGAEALRQGAQLAADARDTCRAGALLLRGSPC AIGWVAGWLSAEFPARVVTGHALSRISPRSIGRFGTSWAAQRADQILHAALVDAFGPD FRDLVWHPTGEQSEAARRSGLLNLPHIPGPHRRYAAQTSDIPYGPGGRENLLDIWRRP DLAPGRRAPVLIQVPGGAWTINGKRPQAYPLMSRMVELGWICVSINYSKSPRCTWPAH IVDVKRAIAWVRENIADYGGDPDFITITGGSAGAHLAALAALSANDPALQPGFESADT AVQAAAPYYGVYDLTNAENMHEMMMPFLEHFVMRSRYVDNPGLFKAASPISYVHSEAP PFFVLHGEKDPMVPSAQSRAFSAALRDAGAATVSYAELPNAHHAFDLAATVRSRMVAE AVSDFLGVIYGRRMGARKGSLALSSPPAS" gene 2794176..2794249 /locus_tag="Rvnt28" /note="tRNA-Arg(TCT)" /db_xref="GeneID:2700423" tRNA 2794176..2794249 /locus_tag="Rvnt28" /product="tRNA-Arg" /note="codon recognized: AGA" /anticodon=(pos:2794210..2794212,aa:Arg) /db_xref="GeneID:2700423" gene 2794350..2795120 /gene="echA14" /locus_tag="Rv2486" /db_xref="GeneID:887894" CDS 2794350..2795120 /gene="echA14" /locus_tag="Rv2486" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_217002.1" /db_xref="GI:15609623" /db_xref="GeneID:887894" /translation="MAQYDPVLLSVDKHVALITVNDPDRRNAVTDEMSAQLRAAIQRA EGDPDVHAVVVTGAGKAFCAGADLSALGAGVGDPAEPRLLRLYDGFMAVSSCNLPTIA AVNGAAVGAGLNLALAADVRIAGPAALFDARFQKLGLHPGGGATWMLQRAVGPQVARA ALLFGMCFDAESAVRHGLALMVADDPVTAALELAAGPAAAPREVVLASKATMRATASP GSLDLEQHELAKRLELGPQAKSVQSPEFAARLAAAQHR" misc_feature 2794650..2794712 /gene="echA14" /locus_tag="Rv2486" /note="PS00166 Enoyl-CoA hydratase/isomerase signature" gene complement(2795301..2797385) /gene="PE_PGRS42" /locus_tag="Rv2487c" /db_xref="GeneID:887909" CDS complement(2795301..2797385) /gene="PE_PGRS42" /locus_tag="Rv2487c" /function="UNKNOWN" /note="Rv2487c, (MTV008.43c), len: 694 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of Gly-rich proteins (see citation below), similar to many e.g. AAK47245|MT2919 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1515 (663 aa), FASTA scores: opt: 2317, E(): 2.3e-84, (58.35% identity in 622 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177886.1" /db_xref="GI:57116995" /db_xref="GeneID:887909" /translation="MSLVIATPQLLATAALDLASIGSQVSAANAAAAMPTTEVVAAAA DEVSAAIAGLFGAHARQYQALSVQVAAFHEQFVQALTAAAGRYASTEAAVERSLLGAV NAPTEALLGRPLIGNGADGTAPGQPGAAGGLLFGNGGNGAAGGFGQTGGSGGAAGLIG NGGNGGAGGTGAAGGAGGNGGWLWGNGGNGGVGGTSVAAGIGGAGGNGGNAGLFGHGG AGGTGGAGLAGANGVNPTPGPAASTGDSPADVSGIGDQTGGDGGTGGHGTAGTPTGGT GGDGATATAGSGKATGGAGGDGGTAAAGGGGGNGGDGGVAQGDIASAFGGDGGNGSDG VAAGSGGGSGGAGGGAFVHIATATSTGGSGGFGGNGAASAASGADGGAGGAGGNGGAG GLLFGDGGNGGAGGAGGIGGDGATGGPGGSGGNAGIARFDSPDPEAEPDVVGGKGGDG GKGGSGLGVGGAGGTGGAGGNGGAGGLLFGNGGNGGNAGAGGDGGAGVAGGVGGNGGG GGTATFHEDPVAGVWAVGGVGGDGGSGGSSLGVGGVGGAGGVGGKGGASGMLIGNGGN GGSGGVGGAGGVGGAGGDGGNGGSGGNASTFGDENSIGGAGGTGGNGGNGANGGNGGA GGIAGGAGGSGGFLSGAAGVSGADGIGGAGGAGGAGGAGGSGGEAGAGGLTNGPGSPG VSGTEGMAGAPG" gene complement(2797467..2800880) /locus_tag="Rv2488c" /db_xref="GeneID:887997" CDS complement(2797467..2800880) /locus_tag="Rv2488c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2488c, (MTV008.44c), len: 1137 aa. Probable transcriptional regulatory protein, belonging to luxR family, similar to many in Mycobacterium tuberculosis e.g. AAK44621|MT0399 from strain CDC1551 (1092 aa) FASTA scores: opt: 3767, E(): 1.8e-211, (56.75% identity in 1093 aa overlap); O53720|Rv0386|MTV036.21 from strain H37Rv (1085 aa), FASTA scores: opt: 3756, E(): 7.6e-211, (56.75% identity in 1089 aa overlap); AAK45665|MT1402 from strain CDC1551 (1159 aa), FASTA scores: opt: 3395, E(): 8.2e-190, (52.0% identity in 1093 aa overlap); etc. Also similar to transcriptional regulatory proteins luxR-family from other organisms e.g. Q9CBP3|ML1753 from Mycobacterium leprae (1106 aa), FASTA scores: opt: 2823, E(): 1.5e-156, (50.35% identity in 1116 aa overlap); Q9KYF4|SCD72A.02 from Streptomyces coelicolor (1114 aa), FASTA scores: opt: 915, E(): 1.7e-45, (30.7% identity in 1143 aa overlap); etc. Some similarity with Q9KXP6|SC9C5.28 HYPOTHETICAL 81.8 KDA PROTEIN from Streptomyces coelicolor (750 aa), FASTA scores: opt: 1085, E(): 1.6e-55, (35.45% identity in 722 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00622 Bacterial regulatory proteins, luxR family signature, probable coiled-coil from aa 585 to 616 and probable helix-turn-helix motif at aa 1086 to 1107 (score 1206, +3.29 SD). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="LuxR family transcriptional regulator" /protein_id="NP_217004.1" /db_xref="GI:15609625" /db_xref="GeneID:887997" /translation="MDRRPRDFEQSRRRCRCNALRAGSMLASMSKIHPGVDVVPVDWS ADGVSELVPTGTVTLLLADIEGATHLPGSQLDTTAIAKLDRTLTELVREHRGVCPVEQ GEGDSFLVAFARASDAVACALGLQRAPLAPIRLRIGMHTGEVSSPDEGNCVGPTIDRT ARLRELAHGGQTVLSGTTSDLVADLLPKDAWLNDLGTYRLDDLPRPERVVQLCHPDLH NAFPPLRTRKVVGAHCLPAQLTRLVGRVDEVAQVRGLLDVKRWVTLTGVGGVGKTRLA TQVASAVADGYPDGVWYVNLAPITDPALVPIAAARVLGLPDQPGRSTVDTIVRRIGDR RMLVVLDNCEHLLDGCAALIVALLGACPALRVLATSREPIAVAGEQIWRVPPLGHGEA IELFTDRAREARPELEITADNLALVTEICHRLDGIPLAIELAASRVRALALTEIVDSL HDRFRLLTGGSRIAVRRQQTMRASVDWSHALLTGPEQVLFRRLAVFPSGFDLDGAQAA AAGGDVQRYEVVDLLSLLADKSLVVTDDSDGRTRYRLLETVRQYALEKLRESGDADAV RARHRDHYAAVAAGLDAPSVAGHERRLNQAELEIDNLRAAFAFSRENGDTGHALLLAS CLQPLWRARGRLQEGLAWFAAALADHDAHPAGADPGLYARALADRALIDAVAGITDRL DDAQKALAIARDIEDPALLARALTACGGVAAYNADLARPWLAEAVGLARAVGDKWRLA EVLAWQAYVGFAGEGDPGATRAAGEEARSLADEIGDAFLSRSCRWALAAANLWQGNLE AAVGLSREVIGESDAAHDMVSSCAGQACLAHALAHRGDTEAAAAAQASIDTAVGLSPV LSGSACSALVFATLAAGDVAAAEHARESATRFFGASAAAIINDPTSSAQISCARGDLN AAHRLADGAASITRGVHRARALTTRCRIEIAQGDRHRAERDAHDALGVAASIGAYLWV PDILECLASVMADAGSNREAVRLFGAADAARGRMGAVRFGIYQAGCNSSLATLRKSMG DSEFDDAWAEGTALSIDEAIAYAQRGRGARKRPTSGWGALTPTELEVALLVGEGLSNK EIGVRLFISPRTVHSHLTHVYTKLGLSSRLQLAQQAARRGESERGPSRP" misc_feature complement(2797548..2797631) /locus_tag="Rv2488c" /note="PS00622 Bacterial regulatory proteins, luxR family signature" misc_feature complement(2800062..2800085) /locus_tag="Rv2488c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" repeat_region 2800671..2800918 /note="248 bp direct repeat 2" gene complement(2800846..2801145) /locus_tag="Rv2489c" /db_xref="GeneID:888937" CDS complement(2800846..2801145) /locus_tag="Rv2489c" /function="UNKNOWN" /note="Rv2489c, (MTV008.45c), len: 99 aa. Hypothetical unknown ala-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217005.1" /db_xref="GI:15609626" /db_xref="GeneID:888937" /translation="MGVTAKAAEAAAPSSSFPSLRKPHRAGDSADRSAGDFDGTAHDA VVSVLAGDAASTGGLTIASGQHGHCRSAAMARRSPNASTKARRTHGPAAKRFRAI" gene complement(2801254..2806236) /gene="PE_PGRS43" /locus_tag="Rv2490c" /db_xref="GeneID:887941" CDS complement(2801254..2806236) /gene="PE_PGRS43" /locus_tag="Rv2490c" /function="UNKNOWN" /note="Rv2490c, (MTV008.46c), len: 1660 aa. Member of the Mycobacterium tuberculosis PE family, PGRS-subfamily of Gly-rich proteins (see citation below), similar to many e.g. AAK47971|MT3612.1 PE_PGRS family protein from Mycobacterium tuberculosis strain CDC1551 (1715 aa), FASTA scores: opt: 5161, E(): 1.5e-187, (51.7% identity in 1752 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177887.1" /db_xref="GI:57116996" /db_xref="GeneID:887941" /translation="MSYVIATPEMMATAAFDLARIGSQVSAASAVAAMPTTEVVAAGA DEVSAGIAALFSAHAQEYQALSAQAAAFHDQFVHTLTAAARWYTATEIANAAAMRVVL GAVNAPTQTLLGRPLIGDGAHGTAPGQPGGAGGLLFGNGGNGAAGAVGQVGGAGGAAG LFGIGGAGGAGGAGAPGGTGGTGGWLAGGGGVGGMGGAGGGAGGAGGNAGLFGNGGAG GAGGAGGGAGGAGGNAGWFGHGGAGGVGGVGAAGANGATPGQDGAAGVAGSDDGAGGD GLAGSDGGDGGAGGVGGNGGRGGWLLGNGGAGGVGGVGGAGGAGAAGGAGGAGATGIN GPAGISAAGGDGGAGGNGGAGGNGGVGGAGGAGGSAGLLGYVGRAGDGGAGGGGGLGG APGDGGAGGNGGSWLAAGDGGAGGHGGDPGLGGAGGAGGASGGAGARAGANGLAAGND GPVSGGNGGKGGNGAHAPVAGGHGGNGGAGGNGGLVGDGGAGGHGGDGAAGAGYADMT AIFLGSSGTPGEDGGNGGAGGAGGAGGAHAGDGGAGGAGGNGGAGGAGGNGAHGFNAV LVSDGGNGGDGGAGGRGGDGGAGGAGGDAPAGRAGSQGVGGDGGAGGAGGAPGNGGSG GRGDMAFKDGDGGAGGDGGDPGAGGKGGAGGAGATEGVTGATGATVHSGGNGGKGGNG ADATVAGANGGKGGAGGNGGLVGDGGAGGDGGSGAAGANGANVGEDGADGTLSGQPGE GSEANGGQGGVGGGGAGGAGGDGGAGSSALGSGGNGGRGDAGQAGGAGGAGGAGGAGG SVSGDGGPGGKGGAGGAGGAGASGGGGGKGASGADSAEAVGGAGGKGGDGGVGGVGGD GGPGGDGGAGGAAPAGQVGSHGVGGVGGDGGLGGAGGNGGDGGHGSDGGDGGDGGDPG AGGLGGLGGDSGNGTRAASGVDASDHGPGSGGNGGNGGNGAQASVAGGAGGNGGDGGN AGRVGDGGAGGNGGDGAAGANGANSGAPGSDALALGQPGGNGGQGDAGQAGGAGGAGG AGGAGGSVSGDGGAGGNGGAGGNGGVGASGGAGARGANGIDSIGGTGGAGGGGGDGGA GGVGGHGGDGGVGGAAPSGTVGSHGTGGVGGDGGLGGAGGVGGAGGNGGIGITVGGAG GAGGNGGDPGAGGRGGLGGDSGNGTSAANGVDASKHGPLTGGDGGVGGNGAKAAAAGG DGGQGGDGGNAGLFGDGGAGGDGADGTAAEALGGDGGAGGAGGKGGDAGDIGDGGDGG KGGDGAHGALGGLTVAGGNGGAGGAGGAGGAGGAFLGDGGNGGAGGQGGAGRGGSPGG GGGVGGHGGAGGDAGMNGGGGTGGQGGNGAAGGAGWSPDSDLKGFDGFDGGSGGAGGD GGAGGAGGTQTGDGGDGGAGGLGGAGGVGGNGVDGFDINETTGRDGGDGGDGGYGGWG GAGGNGGAGGSAPAGEVGNRGVGGDGGDGGSGGDAGNGGLGGDGFTYLADFDGEPGGD GGDGGDGGWGRPGGQGGFGSTSGAHGKAGFGAPGGDGGDGGNGGHGGDGNGSFADAGD GGPGGNGGNGGLGGAGRDGGAPGGDGGDGGTGGSGGFGAPPPRSIGGGDGGDGGRGGD GGRGAGGLTSGGVGSSGESGGSGNGRGDPGSGGSGGEGGEGGPSISVNVT" repeat_region 2806368..2806625 /note="258 bp direct repeat 2" gene 2806665..2807288 /locus_tag="Rv2491" /db_xref="GeneID:887780" CDS 2806665..2807288 /locus_tag="Rv2491" /function="UNKNOWN" /note="Rv2491, (MTV008.47), len: 207 aa. Conserved hypothetical protein, similar in part to other hypothetical proteins e.g. O29139|AF1126 from Archaeoglobus fulgidus (151 aa), FASTA scores: opt: 293, E(): 2.8e-11, (42.85% identity in 126 aa overlap); O66531|AQ_134 from Aquifex aeolicus (151 aa), FASTA scores: opt: 261, E(): 2.6e-09, (37.75% identity in 106 aa overlap); Q9HKU3|TA0501 from Thermoplasma acidophilum (161 aa), FASTA scores: opt: 260, E(): 3.2e-09, (35.9% identity in 117 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217007.1" /db_xref="GI:15609628" /db_xref="GeneID:887780" /translation="MVDTSAPASRLDTDPRRAHVSLSKHPYQIGVFGSGTIGPRVYEL AYQVGAEIAKQGHILISGGMTGTMEASSRGASDADGLVVGVLPGDKFTDGNAYSTIKI LSGMQFARNYITGLSCHGAIVVGGSSGAYEEARRVWEGRGPVVVLANSGSPTGASAQM LSMQEIFGVAFPEDKPKPWRVFSAATPAESVSLVIGLIRKGYAQHEP" gene 2807278..2808030 /locus_tag="Rv2492" /db_xref="GeneID:887436" CDS 2807278..2808030 /locus_tag="Rv2492" /function="UNKNOWN" /note="Rv2492, (MTV008.48), len: 250 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217008.1" /db_xref="GI:15609629" /db_xref="GeneID:887436" /translation="MSRRIINEFGVQIYGATIGDTWAGLVRAVLDLGSQCFDEDRERI ALSNVRIKSSVQNYPDLTIEEHCNSAQLKAMLDFMFNTDTMEDIDVVKSFSRGAKSYH RRIKEGRMIEFVIERLSLIPESKKAVVVFPTYEDYAAVMRNHRDDYLPCLVSIQFRLL PDGKDYVFHTTFYSRSMDAWQKGHGNLLSIAKLSDWVRENVSARIGRKIMLGPLDGMI CDVHIYKETYAEACKRLANLDLRRTQFDAVRN" gene 2808083..2808304 /locus_tag="Rv2493" /db_xref="GeneID:887480" CDS 2808083..2808304 /locus_tag="Rv2493" /function="UNKNOWN" /note="Rv2493, (MTV008.49), len: 73 aa. Conserved hypothetical protein, highly similar to AAK46916|MT2606 HYPOTHETICAL 8.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (74 aa), FASTA scores: opt: 234, E(): 4e-09, (56.95% identity in 74 aa overlap); and similar to O53373|Rv3321c|MTV016.21c HYPOTHETICAL 8.8 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (80 aa), FASTA scores: opt: 126, E(): 0.055, (30.75% identity in 78 aa overlap); and with weak similarity with other Mycobacterial hypothetical proteins e.g. Q9CCR7|ML0525 from Mycobacterium leprae (58 aa), FASTA scores: opt: 115, E(): 0.22, (47.75% identity in 44 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217009.1" /db_xref="GI:15609630" /db_xref="GeneID:887480" /translation="MRTTLDLDDDVIAAARELASSQRRSLGSVISELARRGLMPGRVE ADDGLPVIRVPAGTPPITPEMVRRALDED" gene 2808310..2808735 /locus_tag="Rv2494" /db_xref="GeneID:887700" CDS 2808310..2808735 /locus_tag="Rv2494" /function="UNKNOWN" /note="Rv2494, (MTV008.50), len: 141 aa. Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. P95023|EMBL:Z83863|MTCY159.26|Rv2530c (139 aa) FASTA scores: opt: 380 E(): 6.6e-19, (48.0% identity in 125 aa overlap); O53372|Rv3320c|MTV016.20c (142 aa), FASTA scores: opt: 287, E(): 1.3e-12, (41.6% identity in 125 aa overlap); AAK46915|MT2605 (strain CDC1551) (139 aa) FASTA scores: opt: 380, E(): 6.6e-19 (48.0% identity in 125 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217010.1" /db_xref="GI:15609631" /db_xref="GeneID:887700" /translation="MALLDVNALVALAWDSHIHHARIREWFTANATLGWATCPLTEAG FVRVSTNPKVLPSAIGIADARRVLVALRAVGGHRFLADDVSLVDDDVPLIVGYRQVTD AHLLTLARRRGVRLVTFDAGVFTLAQQRPKTPVELLTIL" gene complement(2808758..2809939) /gene="pdhC" /locus_tag="Rv2495c" /db_xref="GeneID:888237" CDS complement(2808758..2809939) /gene="pdhC" /locus_tag="Rv2495c" /EC_number="2.3.1.12" /function="INVOLVED IN ENERGY METABOLISM. THE PYRUVATE DEHYDROGENASE COMPLEX CATALYZES THE OVERALL CONVERSION OF PYRUVATE TO ACETYL-CoA & CO(2). IT CONTAINS MULTIPLE COPIES OF THREE ENZYMATIC COMPONENTS: PYRUVATE DEHYDROGENASE (E1), DIHYDROLIPOAMIDE ACETYLTRANSFERASE (E2) & LIPOAMIDE DEHYDROGENASE (E3) [CATALYTIC ACTIVITY: ACETYL-CoA + DIHYDROLIPOAMIDE = CoA + S-ACETYLDIHYDROLIPOAMIDE]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of acetyl from acetyldihydrolipoamide to coenzyme A to form acetyl CoA" /codon_start=1 /transl_table=11 /product="branched-chain alpha-keto acid dehydrogenase subunit E2" /protein_id="NP_217011.1" /db_xref="GI:15609632" /db_xref="GeneID:888237" /translation="MSGEDSIRSFPVPDLGEGLQEVTVTCWSVAVGDDVEINQTLCSV ETAKAEVEIPSPYAGRIVELGGAEGDVLKVGAELVRIDTGPTAVAQPNGEGAVPTLVG YGADTAIETSRRTSRPLAAPVVRKLAKELAVDLAALQRGSGAGGVITRADVLAAARGG VGAGPDVRPVHGVHARMAEKMTLSHKEIPTAKASVEVICAELLRLRDRFVSAAPEITP FALTLRLLVIALKHNVILNSTWVDSGEGPQVHVHRGVHLGFGAATERGLLVPVVTDAQ DKNTRELASRVAELITGAREGTLTPAELRGSTFTVSNFGALGVDDGVPVINHPEAAIL GLGAIKPRPVVVGGEVVARPTMTLTCVFDHRVVDGAQVAQFMCELRDLIESPETALLD L" gene complement(2809936..2810982) /gene="pdhB" /locus_tag="Rv2496c" /db_xref="GeneID:888571" CDS complement(2809936..2810982) /gene="pdhB" /locus_tag="Rv2496c" /EC_number="1.2.4.1" /function="INVOLVED IN ENERGY METABOLISM. THE PYRUVATE DEHYDROGENASE COMPLEX CATALYZES THE OVERALL CONVERSION OF PYRUVATE TO ACETYL-CoA & CO(2). IT CONTAINS MULTIPLE COPIES OF THREE ENZYMATIC COMPONENTS: PYRUVATE DEHYDROGENASE (E1), DIHYDROLIPOAMIDE ACETYLTRANSFERASE (E2) & LIPOAMIDE DEHYDROGENASE (E3). [CATALYTIC ACTIVITY: PYRUVATE + LIPOAMIDE = S-ACETYL-DIHYDRO-LIPOAMIDE + CO(2)]." /experiment="experimental evidence, no additional details recorded" /note="Rv2496c, (MTCY07A7.02c), len: 348 aa. Probable pdhB, pyruvate dehydrogenase e1 component, beta subunit (EC 1.2.4.1), similar to others e.g. Q9Y8I6||PDHB from Halobacterium volcanii (Haloferax volcanii) (327 aa) FASTA scores: opt: 1050, E(): 6.4e-60, (49.7% identity in 324 aa overlap); Q9KG98|BH0214 from Bacillus halodurans (328 aa), FASTA scores: opt: 987, E(): 6.9e-56, (45.7% identity in 324 aa overlap); Q9HN76|PDHB|VNG2218G from Halobacterium sp. strain NRC-1 (297 aa), FASTA scores: opt: 968, E(): 1.1e-54, (51.2% identity in 297 aa overlap); P21874|ODPB_BACST|PDHB PYRUVATE DEHYDROGENASE E1 COMPONENT from Bacillus stearothermophilus (324 aa), FASTA scores: opt: 951, E(): 1.4e-53, (47.6% identity in 321 aa overlap); etc. Also similar to Q9XA61|SCGD3.17c PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE E1, BETA SUBUNIT (2-oxoisovalerate dehydrogenase) (EC 1.2.4.4) from Streptomyces coelicolor, (326 aa), FASTA scores: opt: 1178, E(): 4.1e-68, (55.0% identity in 322 aa overlap); Q9XA48|SCGD3.31c PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE E1 BETA SUBUNIT from Streptomyces coelicolor (334 aa), FASTA scores: opt: 1173, E(): 8.8e-68, (55.6% identity in 320 aa overlap); Q53593|BKDB E1-BETA BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE from Streptomyces avermitilis (334 aa), FASTA scores: opt: 1132, E(): 3.7e-65, (55.0% identity in 320 aa overlap); etc." /codon_start=1 /transl_table=11 /product="pyruvate dehydrogenase E1 component beta subunit PdhB" /protein_id="NP_217012.1" /db_xref="GI:15609633" /db_xref="GeneID:888571" /translation="MTQIADRPARPDETLAVAVSDITQSLTMVQAINRALYDAMAADE RVLVFGEDVAVEGGVFRVTEGLADTFGADRCFDTPLAESAIIGIAVGLALRGFVPVPE IQFDGFSYPAFDQVVSHLAKYRTRTRGEVDMPVTVRIPSFGGIGAAEHHSDSTESYWV HTAGLKVVVPSTPGDAYWLLRHAIACPDPVMYLEPKRRYHGRGMVDTSRPEPPIGHAM VRRSGTDVTVVTYGNLVSTALSSADTAEQQHDWSLEVIDLRSLAPLDFDTIAASIQRT GRCVVMHEGPRSLGYGAGLAARIQEEMFYQLEAPVLRACGFDTPYPPARLEKLWLPGP DRLLDCVERVLRQP" gene complement(2810993..2812096) /gene="pdhA" /locus_tag="Rv2497c" /db_xref="GeneID:888583" CDS complement(2810993..2812096) /gene="pdhA" /locus_tag="Rv2497c" /EC_number="1.2.4.1" /function="INVOLVED IN ENERGY METABOLISM. THE PYRUVATE DEHYDROGENASE COMPLEX CATALYZES THE OVERALL CONVERSION OF PYRUVATE TO ACETYL-CoA & CO(2). IT CONTAINS MULTIPLE COPIES OF THREE ENZYMATIC COMPONENTS: PYRUVATE DEHYDROGENASE (E1), DIHYDROLIPOAMIDE ACETYLTRANSFERASE (E2) & LIPOAMIDE DEHYDROGENASE (E3) [CATALYTIC ACTIVITY: PYRUVATE + LIPOAMIDE = S-ACETYL-DIHYDRO-LIPOAMIDE + CO(2)]." /experiment="experimental evidence, no additional details recorded" /note="Rv2497c, (MTCY07A7.03c), len: 367 aa. Probable pdhA, pyruvate dehydrogenase e1 component, alpha subunit (EC 1.2.4.1), similar to many e.g. Q9Y8I5|PDHA from Halobacterium volcanii (Haloferax volcanii) (368 aa) FASTA scores: opt: 961, E(): 1.3e-52, (45.6% identity in 351 aa overlap); BAB40585 from Bacillus sp. UTB2301 (356 aa) FASTA scores: opt: 947, E(): 9.1e-52, (43.1% identity in 355 aa overlap); Q9KG99|BH0213 from Bacillus halodurans (367 aa), FASTA scores: opt: 896, E(): 1.4e-48, (42.65% identity in 340 aa overlap); etc. Also similar to several PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASES E1, BETA SUBUNIT (EC 1.2.4.4), alternate name : 2-oxoisovalerate dehydrogenase, e.g. Q53592|BKDA from Streptomyces avermitilis (381 aa), FASTA scores: opt: 980, E(): 8.5e-54, (45.65% identity in 370 aa overlap); etc." /codon_start=1 /transl_table=11 /product="pyruvate dehydrogenase E1 component alpha subunit PdhA" /protein_id="NP_217013.1" /db_xref="GI:15609634" /db_xref="GeneID:888583" /translation="MGEGSRRPSGMLMSVDLEPVQLVGPDGTPTAERRYHRDLPEETL RWLYEMMVVTRELDTEFVNLQRQGELALYTPCRGQEAAQVGAAACLRKTDWLFPQYRE LGVYLVRGIPPGHVGVAWRGTWHGGLQFTTKCCAPMSVPIGTQTLHAVGAAMAAQRLD EDSVTVAFLGDGATSEGDVHEALNFAAVFTTPCVFYVQNNQWAISMPVSRQTAAPSIA HKAIGYGMPGIRVDGNDVLACYAVMAEAAARARAGDGPTLIEAVTYRLGPHTTADDPT RYRSQEEVDRWATLDPIPRYRTYLQDQGLWSQRLEEQVTARAKHVRSELRDAVFDAPD FDVDEVFTTVYAEITPGLQAQREQLRAELARTD" gene complement(2812355..2813176) /gene="citE" /locus_tag="Rv2498c" /db_xref="GeneID:887466" CDS complement(2812355..2813176) /gene="citE" /locus_tag="Rv2498c" /EC_number="4.1.3.6" /function="INTERCONVERSION OF ACETATE AND OXALOACETATE FROM CITRATE [CATALYTIC ACTIVITY: CITRATE = ACETATE + OXALOACETATE]." /note="Rv2498c, (MTCY07A7.04c), len: 273 aa. Probable citE, citrate lyase, beta subunit (EC 4.1.3.6), similar to others e.g. Q9S3L3|CITE from Corynebacterium glutamicum (Brevibacterium flavum) (217 aa), FASTA scores: opt: 565, E(): 1.5e-28, (41.85% identity in 215 aa overlap); Q9HRM8|CITE|VNG0627G from Halobacterium sp. strain NRC-1 (303 aa), FASTA scores: opt: 535, E(): 1.5e-26, (41.65% identity in 276 aa overlap); Q9S2U9|SC4G6.02 from Streptomyces coelicolor (274 aa), FASTA scores: opt: 426, E(): 1e-19, (37.6% identity in 274 aa overlap); P77770|CILB_ECOLI from Escherichia coli (307 aa), FASTA scores: opt: 265, E(): 1.5e-10, (32.8% identity in 265 aa overlap); etc. Also similar to Rv3075c|MTCY22D7.06 from Mycobacterium tuberculosis, FASTA score: (35.2% identity in 264 aa overlap)." /codon_start=1 /transl_table=11 /product="citrate (Pro-3S)-lyase beta subunit" /protein_id="NP_217014.1" /db_xref="GI:15609635" /db_xref="GeneID:887466" /translation="MNLRAAGPGWLFCPADRPERFAKAAAAADVVILDLEDGVAEAQK PAARNALRDTPLDPERTVVRINAGGTADQARDLEALAGTAYTTVMLPKAESAAQVIEL APRDVIALVETARGAVCAAEIAAADPTVGMMWGAEDLIATLGGSSSRRADGAYRDVAR HVRSTILLAASAFGRLALDAVHLDILDVEGLQEEARDAAAVGFDVTVCIHPSQIPVVR KAYRPSHEKLAWARRVLAASRSERGAFAFEGQMVDSPVLTHAETMLRRAGEATSE" gene complement(2813173..2813730) /locus_tag="Rv2499c" /db_xref="GeneID:888584" CDS complement(2813173..2813730) /locus_tag="Rv2499c" /function="UNKNOWN" /note="Rv2499c, (MTCY07A7.05c), len: 185 aa. Possible oxidase regulatory-related protein, similar to many maoC MONOAMINE OXIDASE REGULATORY PROTEIN e.g. Q9RUZ1|DR1239 MAOC-RELATED PROTEIN from Deinococcus radiodurans (160 aa), FASTA scores: opt: 519, E(): 7.6e-28, (58.1% identity in 148 aa overlap); BAB48392|MLR0905 Probable monoamine oxidase regulatory protein from Rhizobium loti (Mesorhizobium loti) (150 aa), FASTA scores: opt: 480, E(): 2.9e-25, (49.0% identity in 149 aa overlap); Q9HN18|MAOC1|VNG2290G MONOAMINE OXIDASE REGULATORY-LIKE from Halobacterium sp. strain NRC-1 (208 aa), FASTA scores: opt: 419, E(): 4.6e-21, (45.6% identity in 158 aa overlap); P77455|MAOC_ECOLI|PAAZ|B1387 MaoC protein (Phenylacetic acid degradation protein paaZ) from Escherichia coli strain K12 (681 aa), FASTA scores: opt: 252, E(): 1.9e-09, (36.0% identity in 172 aa overlap); etc. But also similar to other proteins with different putative functions e.g. Q9HRM9|MAOC2|VNG0626G MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN from Halobacterium sp strain NRC-1 (157 aa), FASTA scores: opt: 380, E(): 1.5e-18, (45.75% identity in 153 aa overlap); Q9KIF1 FKBR2 from Streptomyces hygroscopicus var. ascomyceticus (175 aa), FASTA scores: opt: 355, E(): 7.6e-17, (42.0% identity in 150 aa overlap); CAC36828|Q99Q03|SAPE Spore associated protein from Streptomyces coelicolor (174 aa), FASTA scores: opt: 318, E(): 2.2e-14, (41.45% identity in 152 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidase regulatory-like protein" /protein_id="NP_217015.1" /db_xref="GI:15609636" /db_xref="GeneID:888584" /translation="MTKHAGDRESDDAVSACRVAGSTVGRRILQRGLWFEEFQIGTTY LHRPGRTVTEADNVLFTTLTMNTQSLHLDAAWAGQQPGFRGERLVNSMFTLSTMVGLS VAQLTLGTIVANLGFSEVSFPKPVFHGDTLYAETVCTGKRESKSRPGEGIVTLEHIAR NQHGEVVARAVRTTLVQKQSIKEAQ" gene complement(2813727..2814911) /gene="fadE19" /locus_tag="Rv2500c" /db_xref="GeneID:888541" CDS complement(2813727..2814911) /gene="fadE19" /locus_tag="Rv2500c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT SEEMS INVOLVED IN METABOLISM OF SMALL BRANCHED-CHAIN FATTY ACIDS AND MACROLIDE ANTIBIOTIC PRODUCTION. CATALYSES THE ALPHA, BETA-DEHYDROGENETION OF ACYL-CoA ESTERS AND TRANSFER ELECTRONS TO ETF, THE ELECTRON TRANSFER PROTEIN." /note="Rv2500c, (MTCY07A7.06c), len: 394 aa. Possible fadE19 (alternate gene name: , acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9XCG6|ACDH from Streptomyces coelicolor (386 aa), FASTA scores: opt: 1714, E(): 1.1e-98, (69.45% identity in 383 aa overlap); Q9XCG5|ACDH from Streptomyces avermitilis (386 aa), FASTA scores: opt: 1713, E(): 1.3e-98, (70.0% identity in 383 aa overlap); Q9L7W5|FENK from Bacillus subtilis (370 aa), FASTA scores: opt: 1094, E(): 2.3e-60, (48.4% identity in 372 aa overlap); etc. Contains PS00072 Acyl-CoA dehydrogenases signature 1, PS00073 Acyl-CoA dehydrogenases signature 2. BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY.; mmgC" /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="NP_217016.1" /db_xref="GI:15609637" /db_xref="GeneID:888541" /translation="MTTTTTTISGGILPKEYQDLRDTVADFARTVVAPVSAKHDAEHS FPYEIVAKMGEMGLFGLPFPEEYGGMGGDYFALSLVLEELGKVDQSVAITLEAAVGLG AMPIYRFGTEEQKQKWLPDLTSGRALAGFGLTEPGAGSDAGSTRTTARLEGDEWIING SKQFITNSGTDITSLVTVTAVTGTTGTAADAKKEISTIIVPSGTPGFTVEPVYNKVGW NASDTHPLTFADARVPRENLLGARGSGYANFLSILDEGRIAIAALATGAAQGCVDESV KYANQRQSFGQPIGAYQAIGFKIARMEARAHVARTAYYDAAAKMLAGKPFKKEAAIAK MISSEAAMDNSRDATQIHGGYGFMNEYPVARHYRDSKVLEIGEGTTEVQLMLIARSLG LQ" misc_feature complement(2813805..2813864) /gene="fadE19" /locus_tag="Rv2500c" /note="PS00073 Acyl-CoA dehydrogenases signature 2" misc_feature complement(2814480..2814518) /gene="fadE19" /locus_tag="Rv2500c" /note="PS00072 Acyl-CoA dehydrogenases signature 1" gene complement(2814916..2816880) /gene="accA1" /locus_tag="Rv2501c" /db_xref="GeneID:887309" CDS complement(2814916..2816880) /gene="accA1" /locus_tag="Rv2501c" /EC_number="6.3.4.14" /function="THIS PROTEIN CARRIES TWO FUNCTIONS: BIOTIN CARBOXYL CARRIER PROTEIN AND BIOTIN CARBOXYLTRANSFERASE. INVOLVED IN THE FIRST STEP OF LONG-CHAIN FATTY ACID SYNTHESIS [CATALYTIC ACTIVITY: ATP + BIOTIN-CARBOXYL-CARRIER PROTEIN + CO(2) = ADP + PHOSPHATE + CARBOXYBIOTIN-CARBOXYL-CARRIER PROTEIN]." /note="Rv2501c, (MTCY07A7.07c, P46401), len: 654 aa. Probable accA1 (alternate gene name: bccA), acetyl-/propionyl-coenzyme A carboxylase (alpha subunit) [INCLUDES: BIOTIN CARBOXYLASE (EC 6.3.4.14); BIOTIN CARBOXYL CARRIER PROTEIN (BCCP)], similar to others eg Q9L076|FABG from Streptomyces coelicolor (646 aa), FASTA scores: opt: 2071, E(): 1e-113, (57.8% identity in 659 aa overlap); AAK24139|Q9A6C6|CC2168 from Caulobacter crescentus (654 aa), FASTA scores: opt: 1754, E(): 3.7e-95, (47.2% identity in 661 aa overlap); etc. Contains PS00188 Biotin-requiring enzymes attachment site, PS00866 Carbamoyl-phosphate synthase subdomain signature 1, and PS00867 Carbamoyl-phosphate synthase subdomain signature 2.; bccA" /codon_start=1 /transl_table=11 /product="acetyl-/propionyl-coenzyme A carboxylase subunit alpha" /protein_id="NP_217017.1" /db_xref="GI:15609638" /db_xref="GeneID:887309" /translation="MFDTVLVANRGEIAVRVIRTLRRLGIRSVAVYSDPDVDARHVLE ADAAVRLGPAPARESYLDIGKVLDAAARTGAQAIHPGYGFLAENADFAAACERARVVF LGPPARAIEVMGDKIAAKNAVAAFDVPVVPGVARAGLTDDALVTAAAEVGYPVLIKPS AGGGGKGMRLVQDPARLPEALVSARREAMSSFGDDTLFLERFVLRPRHIEVQVLADAH GNVVHLGERECSLQRRHQKVIEEAPSPLLDPQTRERIGVAACNTARCVDYVGAGTVEF IVSAQRPDEFFFMEMNTRLQVEHPVTEAITGLDLVEWQLRVGAGEKLGFAQNDIELRG HAIEARVYAEDPAREFLPTGGRVLAVFEPAGPGVRVDSSLLGGTVVGSDYDPLLTKVI AHGADREEALDRLDQALARTAVLGVQTNVEFLRFLLADERVRVGDLDTAVLDERSADF TARPAPDDVLAAGGLYRQWALARRAQGDLWAAPSGWRGGGHMAPVRTAMRTPLRSETV SVWGPPESAQVQVGDGEIDCASVQVTREQMSVTISGLRRDYRWAEADRHLWIADERGT WHLREAEEHKIHRAVGARPAEVVSPMPGSVIAVQVESGSQISAGDVVVVVEAMKMEHS LEAPVSGRVQVLVSVGDQVKVEQVLARIKD" misc_feature complement(2815000..2815053) /gene="accA1" /locus_tag="Rv2501c" /note="PS00188 Biotin-requiring enzymes attachment site" misc_feature complement(2815996..2816019) /gene="accA1" /locus_tag="Rv2501c" /note="PS00867 Carbamoyl-phosphate synthase subdomain signature 2" misc_feature complement(2816380..2816424) /gene="accA1" /locus_tag="Rv2501c" /note="PS00866 Carbamoyl-phosphate synthase subdomain signature 1" gene complement(2816885..2818474) /gene="accD1" /locus_tag="Rv2502c" /db_xref="GeneID:887168" CDS complement(2816885..2818474) /gene="accD1" /locus_tag="Rv2502c" /EC_number="6.4.1.-" /function="INVOLVED IN FATTY ACID METABOLISM." /note="Rv2502c, (MTCY07A7.08c), len: 529 aa. Probable accD1, acetyl-/propionyl-CoA carboxylase (beta subunit) (EC 6.4.1.-), similar, but with N-terminus shorter, to Q9L077|ACCD1 from Streptomyces coelicolor (538 aa), FASTA scores: opt: 2747, E(): 1.9e-159, (77.9% identity in 516 aa overlap). Also similar to others e.g. AAK24141|CC2170 from Caulobacter crescentus (530 aa), FASTA scores: opt: 2413, E(): 3.8e-139, (69.4% identity in 529 aa overlap); BAB54131|MLL7731 from Rhizobium loti (537 aa), FASTA scores: opt: 2399, E(): 2.7e-138, (67.4% identity in 527 aa overlap); etc. COULD BELONG TO THE ACCD/PCCB FAMILY." /codon_start=1 /transl_table=11 /product="acetyl-/propionyl-CoA carboxylase subunit beta" /protein_id="NP_217018.1" /db_xref="GI:15609639" /db_xref="GeneID:887168" /translation="MTTPSIAIAPSFADEHRRLVAELNNKLAAAALGGNERARKRHVS RGKLLPRERVDRLLDPGSPFLELAPLAAGGMYGDESPGAGIITGIGRVSGRQCVIVAN DATVKGGTYYPMTVKKHLRAQEVALQNMLPCIYLVDSGGAFLPRQDEVFPDREHFGRI FYNQATMSAKGIPQVAAVLGSCTAGGAYVPAMSDEAVIVREQGTIFLGGPPLVKAATG EIVSAEELGGGDLHSRTSGVTDHLADDDEDALRIVRAIADTFGPCEPAQWDVRRSVEP KYPQAELYDVVPPDPRVPYDVHEVVVRIVDGSEFSEFKAKYGKTLVTAFARVHGHPVG IVANNGVLFSESALKGAHFIELCDKRKIPLLFLQNIAGFMVGRDYEAGGIAKHGAKMV TAVACARVPKLTVVIGGSYGAGNYSMCGRAYSPRFLWMWPNARISVMGGEQAASVLAT VRGEQLSAAGTPWSPDEEEAFKAPIRAQYEDQGNPYYSTARLWDDGIIDPADTRTVVG LALSLCAHAPLDQVGYGVFRM" gene complement(2818471..2819127) /gene="scoB" /locus_tag="Rv2503c" /db_xref="GeneID:888502" CDS complement(2818471..2819127) /gene="scoB" /locus_tag="Rv2503c" /EC_number="2.8.3.5" /function="INVOLVED IN VARIOUS DEGRADATION AND SYNTHESIS [CATALYTIC ACTIVITY: SUCCINYL-CoA + A 3-OXO ACID = SUCCINATE + A 3-OXO-ACYL-COA]." /note="Rv2503c, (MTCY07A7.09c, MT2578), len: 218 aa. Probable scoB, 3-oxo acid:CoA transferase, beta subunit (succinyl-CoA:3-ketoacid-CoA transferase) (EC 2.8.3.5). Highly similar to others e.g. Q9XAM8|SC4C6.12c from Streptomyces coelicolor (217 aa), FASTA scores: opt: 1048, E(): 2.6e-60, (73.9% identity in 207 aa overlap); Q9XD82|PCAJ from Streptomyces sp. 2065 (214 aa), FASTA scores: opt: 1031, E(): 3.2e-59, (70.8% identity in 209 aa overlap); AAK53493|LPSJ from Xanthomonas campestris (pv. campestris) (212 aa), FASTA scores: opt: 886, E(): 6.6e-50, (62.5% identity in 208 aa overlap); P42316|SCOB_BACSU from Bacillus subtilis (216 aa), FASTA scores: opt: 820, E(): 1.2e-45, (58.2% identity in 201 aa overlap); etc. BELONGS TO THE 3-OXOACID COA-TRANSFERASE SUBUNIT B FAMILY." /codon_start=1 /transl_table=11 /product="succinyl-CoA:3-ketoacid-coenzyme A transferase subunit beta ScoB" /protein_id="NP_217019.1" /db_xref="GI:15609640" /db_xref="GeneID:888502" /translation="MSAPGWSRDEMAARVAAEFEDGQYVNLGIGMPTLIPNHIPDGVH VVLHSENGILGVGPYPRREDVDADLINAGKETVTTLPGAAFFSSSTSFGIIRGGHLDV AVLGAMQVSVTGDLANWMIPGKMVKGMGGAMDLVHGARKVIVMMEHTAKDGSPKILER CTLPLTGVGCVDRIVTELAVIDVCADGLHLVQTAPGVSVDEVVAKTQPPLVLRDLATQ" gene complement(2819124..2819870) /gene="scoA" /locus_tag="Rv2504c" /db_xref="GeneID:888503" CDS complement(2819124..2819870) /gene="scoA" /locus_tag="Rv2504c" /EC_number="2.8.3.6" /function="INVOLVED IN FATTY ACID DEGRADATION/SYNTHESIS [CATALYTIC ACTIVITY: SUCCINYL-CoA + A 3-OXO ACID = SUCCINATE + A 3-OXO-ACYL-COA]." /note="Rv2504c, (MT2579, MTCY07A7.10c), len: 248 aa. Probable scoA, succinyl-CoA:3-ketoacid-Coenzyme A transferase, alpha subunit (3-oxo acid:CoA transferase) (EC 2.8.3.6). Highly similar to others e.g. Q9XAM7|SC4C6.13c from Streptomyces coelicolor (260 aa), FASTA scores: opt: 1130, E(): 2.2e-64, (69.9% identity in 249 aa overlap); Q9XD83|PCAI from Streptomyces sp. 2065 (251 aa), FASTA scores: opt: 1121, E(): 8.1e-64, (69.5% identity in 249 aa overlap); etc. BELONGS TO THE 3-OXOACID COA-TRANSFERASE SUBUNIT A FAMILY." /codon_start=1 /transl_table=11 /product="succinyl-CoA:3-ketoacid-coenzyme A transferase subunit alpha ScoA" /protein_id="NP_217020.1" /db_xref="GI:15609641" /db_xref="GeneID:888503" /translation="MDKVVATAAEAVADIANGSSLAVGGFGLCGIPEALIAALVDSGV TDLETVSNNCGIDGVGLGLLLQHKRIRRTVSSYVGENKEFARQFLAGELEVELTPQGT LAERLRAGGMGIPAFYTPAGVGTQVADGGLPWRYDASGGVAVVSPAKETREFDGVTYV LERGIRTDFALVHAWQGDRHGNLMYRHAAANFNPECASAGRITIAEVEHLVEPGEIDP ATVHTPGVFVHRVVHVPNPAKKIERETVRQ" gene complement(2819953..2821596) /gene="fadD35" /locus_tag="Rv2505c" /db_xref="GeneID:887774" CDS complement(2819953..2821596) /gene="fadD35" /locus_tag="Rv2505c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="AMP-binding domain protein" /protein_id="NP_217021.1" /db_xref="GI:15609642" /db_xref="GeneID:887774" /translation="MAAAEVVDPNRLSYDRGPSAPSLLESTIGANLAATAARYGHREA LVDMVARRRFNYSELLTDVHRLATGLVRAGIGPGDRVGIWAPNRWEWVLVQYATAEIG AILVTINPAYRVREVEYALRQSGVAMVIAVASFKDADYAAMLAEVGPRCPDLADVILL ESDRWDALAGAEPDLPALQQTAARLDGSDPVNIQYTSGTTAYPKGVTLSHRNILNNGY LVGELLGYTAQDRICIPVPFYHCFGMVMGNLAATSHGAAMVIPAPGFDPAATLRAVQD ERCTSLYGVPTMFIAELGLPDFTDYELGSLRTGIMAGAACPVEVMRKVISRMHMPGVS ICYGMTETSPVSTQTRADDSVDRRVGTVGRVGPHLEIKVVDPATGETVPRGVVGEFCT RGYSVMAGYWNDPQKTAEVIDADGWMHTGDLAEMDPSGYVRIAGRIKDLVVRGGENIS PREIEELLHTHPDIVDGHVIGVPDAKYGEELMAVVKLRNDAPELTIERLREYCMGRIA RFKIPRYLWIVDEFPMTVTGKVRKVEMRQQALEYLRGQQ" gene 2821712..2822359 /locus_tag="Rv2506" /db_xref="GeneID:888516" CDS 2821712..2822359 /locus_tag="Rv2506" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2506, (MTCY07A7.12), len: 215 aa. Probable transcriptional regulator, tetR family, similar to many others e.g. Q9L078|SCC105.06c PUTATIVE TETR-FAMILY REGULATORY PROTEIN from Streptomyces coelicolor (208 aa), FASTA scores: opt: 333, E(): 1.5e-14, (48.75% identity in 197 aa overlap); Q9X7X6|SC6A5.30c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (404 aa), FASTA scores: opt: 267, E(): 4.8e-10, (30.45% identity in 207 aa overlap) (similarity only with C-terminus for this one); Q9FBI8|SCP8.33c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (213 aa), FASTA scores: opt: 239, E(): 1.8e-08, (29.9% identity in 184 aa overlap); etc. Also similar to transcriptional regulatory proteins from Mycobacterium tuberculosis e.g. O05858|Rv3208|MTCY07D11.18c (228 aa), FASTA scores: opt: 218, E(): 4.4e-07, (30.35% identity in 191 aa overlap); C-terminus of P95251|Rv1963c|MTV051.01c|MTCY09F9.01 (406 aa), FASTA scores: opt: 238, E(): 3.6e-08, (28.25% identity in 177 aa overlap); P96839|Rv3557c|MTCY06G11.04c (200 aa), FASTA scores: opt: 215, E(): 6.2e-07, (38.25% identity in 148 aa overlap); etc. Equivalent to AAK46885 from Mycobacterium tuberculosis strain CDC1551 (231 aa) but shorter 16 aa. Contains probable helix-turn-helix motif at aa 46-67, (Score 1660, +4.84 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217022.1" /db_xref="GI:15609643" /db_xref="GeneID:888516" /translation="MTASAPDGRPGQPEATNRRSQLKSDRRFQLLAAAERLFAERGFL AVRLEDIGAAAGVSGPAIYRHFPNKESLLVELLVGVSARLLAGARDVTTRSANLAAAL DGLIEFHLDFALGEADLIRIQDRDLAHLPAVAERQVRKAQRQYVEVWVGVLRELNPGL AEADARLMAHAVFGLLNSTPHSMKAADSKPARTVRARAVLRAMTVAALSAADRCL" gene 2822438..2823259 /locus_tag="Rv2507" /db_xref="GeneID:888244" CDS 2822438..2823259 /locus_tag="Rv2507" /function="UNKNOWN" /note="Rv2507, (MTCY07A7.13), len: 273 aa. Possible conserved pro-rich membrane protein (N-terminal half is Proline-rich), highly similar to Q9CCU3|ML0431 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (259 aa) (alias O07711|MLCL383.38c but longer 2 aa), FASTA scores: opt: 968, E(): 1.4e-31, (60.35% identity in 275 aa overlap). Contains potential membrane spanning region." /codon_start=1 /transl_table=11 /product="proline rich membrane protein" /protein_id="NP_217023.1" /db_xref="GI:15609644" /db_xref="GeneID:888244" /translation="MNDPRRPQRFGPPLSGYGPTGPQVPPNPPTADPAYADQSPYAST YGGYVSPPWSPGGPPPRPPQWPPGPHEASPTQQLPQYWQYDQPPPGGFPPDGLTPPPP QGPRTPRWLWFAAGSAVLLVVALVIALVIANGSVKKQTAIEPLPPMPGPSPTRPTTTT PTPPSPSAAPAPTTTTGTPSETVAGAMQTVVYDVTGEGRAISITYMDSGNVIQTEFNV ALPWRKEVSLSKSSLHPASVTIVNIGHNVTCSVTVAGVQVRQRTGAGLTICDAPS" gene complement(2823256..2824593) /locus_tag="Rv2508c" /db_xref="GeneID:888527" CDS complement(2823256..2824593) /locus_tag="Rv2508c" /function="UNKNOWN" /note="Rv2508c, (MTCY07A7.14c), len: 445 aa. Probable conserved integral membrane leu-, ala-rich protein, equivalent to Q9CCU4|ML0430 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (454 aa) (alias O07710|MLCL383.37 longer 10 aa), FASTA scores: opt: 2205, E(): 2.5e-124, (75.75% identity in 441 aa overlap). Also similar to hypothetical or membrane proteins e.g. BAB50841|MLL4103 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (458 aa), FASTA scores: opt: 396, E(): 2.4e-16, (27.75% identity in 447 aa overlap); Q9RKX9|SC6D7.19c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (486 aa), FASTA scores: opt: 323, E(): 5.7e-12, (28.95% identity in 428 aa overlap); P42306|YXIO_BACSU PROBABLE INTEGRAL MEMBRANE PROTEIN from Bacillus subtilis (428 aa), FASTA scores: opt: 220, E(): 7.2e-06, (20.35% identity in 413 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. Q10564|Y876_MYCTU|Rv0876c|MT0899|MTCY31.04c (548 aa), FASTA scores: opt: 184, E(): 0.0012, (24.7% identity in 466 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217024.1" /db_xref="GI:15609645" /db_xref="GeneID:888527" /translation="MNNPGSRAGTLLHFRVVAWAMWDCGSTGLNAIVTTFVFSVYLTS AVGQGLPGGTSPASWLGRAGAVAGLTIGVLAPVVGVWVESPHRRRVALSVLTGTAVAL TCAMFLIRDDPRYLWAGLVLLAATAASSDLSSVPYNAMLRQLSTPSTAGRISGFGWAS GYVGSVALLLVIYLGFMSGSGSQRGLLQLPVANGLNVRMAMLVAAAWLALLGLPLLLV AHRLPDSGAASHPSTGLLGGYRKLWTEISAEWRRDRNLVYFLVASAIFRDGLAAIFAF GAVLGVNAYGLTQADVLIFGAAASVVAAVGAVLGGFVDHRIGSKPVIVGSLAAIIAAA LTLLTLSGPTAFWACGLLLCVFIGPAQSSARALLLHMAQHGKEGVAFGLYTMTGRAVS FLGPWLFSVFVDVFHTVRAGLGGVCLVLTTGLLLMLRVQVSRHGGALTTAQSS" gene 2824678..2825484 /locus_tag="Rv2509" /db_xref="GeneID:888526" CDS 2824678..2825484 /locus_tag="Rv2509" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2509, (MTCY07A7.15), len: 268 aa. Probable ala-rich oxidoreductase, short-chain dehydrogenase/reductase (EC 1.-.-.-), equivalent to O07709|MLCL383.36c|ML0429 DEHYDROGENASE (PUTATIVE OXIDOREDUCTASE) from Mycobacterium leprae (268 aa), FASTA scores: opt: 1509, E(): 2.6e-84, (88.75% identity in 267 aa overlap). Also highly similar to others e.g. O86553|SC1F2.16c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (276 aa), FASTA scores: opt: 492, E(): 9.5e-23, (38.15% identity in 262 aa overlap); Q9I5R3|PA0658 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginosa (266 aa), FASTA scores: opt: 472, E(): 1.5e-21, (37.8% identity in 246 aa overlap); AAK22120|CC0133 OXIDOREDUCTASE (SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY) from Caulobacter crescentus (266 aa), FASTA scores: opt: 428, E(): 6.9e-19, (35.8% identity in 243 aa overlap); etc. Also highly similar or similar to oxidoreductases from Mycobacterium tuberculosis e.g. Q10782|Rv1544|MTCY48.21 PUTATIVE KETOACYL REDUCTASE (EC 1.3.1.-) (267 aa), FASTA scores: opt: 656, E(): 1.1e-32, (43.05% identity in 267 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short-chain type dehydrogenase/reductase" /protein_id="NP_217025.1" /db_xref="GI:15609646" /db_xref="GeneID:888526" /translation="MPIPAPSPDARAVVTGASQNIGAALATELAARGHHLIVTARRED VLTELAARLADKYRVTVDVRPADLADPQERSKLADELAARPISILCANAGTATFGPIA SLDLAGEKTQVQLNAVAVHDLTLAVLPGMIERKAGGILISGSAAGNSPIPYNATYAAT KAFVNTFSESLRGELRGSGVHVTVLAPGPVRTELPDASEASLVEKLVPDFLWISTEHT ARVSLNALERNKMRVVPGLTSKAMSVASQYAPRAIVAPIVGAFYKRLGGS" misc_feature 2825107..2825193 /locus_tag="Rv2509" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(2825488..2827089) /locus_tag="Rv2510c" /db_xref="GeneID:888613" CDS complement(2825488..2827089) /locus_tag="Rv2510c" /function="UNKNOWN" /note="Rv2510c, (MTCY07A7.16c), len: 533 aa. Hypothetical unknown protein, highly similar, but longer approximatively 20 aa, to others e.g. Q9ABY0|CC0090 HYPOTHETICAL PROTEIN from Caulobacter crescentus (516 aa), FASTA scores: opt: 1282, E(): 8.4e-63, (45.1% identity in 490 aa overlap); Q9A130|SPY0500 HYPOTHETICAL PROTEIN from Streptococcus pyogenes (500 aa), FASTA scores: opt: 1281, E(): 9.3e-63, (43.8% identity in 491 aa overlap); Q985L5|MLR7622 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (515 aa), FASTA scores: opt: 1259, E(): 1.5e-61, (44.1% identity in 510 aa overlap); P39342|YJGR_ECOLI|B4263 HYPOTHETICAL 54.3 KDA PROTEIN from Escherichia coli strain K12 (500 aa), FASTA scores: opt: 1257, E(): 1.9e-61, (42.7% identity in 501 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217026.1" /db_xref="GI:15609647" /db_xref="GeneID:888613" /translation="MGTESAAGGPGGPAQRIAAGYTVEGQALQLGTVVVDGEPDPSAQ IRIPLATVNRHGLVAGATGTGKTKTLQLIAEQLSAAGVAVLMADVKGDLSGLARPGEA ADKTAARAKDTGDDWVPTAFPVEFLSLGASGVGVPVRATISSFGPILLAKVLGLNATQ ESTLGLIFHWADQRGLPLLDLKDLRAVITHLTSDEGKVELKSLGAVSPTTAGVILRAL VNLEAEGADTFFGEPELRPEDLLRVDSQGRGIISLLEFGSQALRPAMFSTFLMWVLAD LFTFLPEVGDLDKPKLVFFFDEAHLLFTDASKAFLEQVEQTVKLIRSKGVGVFFCTQL PTDLPNDVLSQLGARIQHALRAFTPDDHKALRKTVRTYPKTDVYDLESALTSLGTGEA VVTVLSEKGAPTPVAWTRMRAPRSLMAAIGAEAIGAAAQASSLQAVYGQTIDRPSAHE ILSAKLAPAQEAPAQEAPAPRGQYDPLPWPDDFEVPPMPAPVEPQGPAVWEEILKNPT VKSVLNTTAREITRSIFGTGRRRRK" misc_feature complement(2826889..2826912) /locus_tag="Rv2510c" /note="PS00017 ATP/GTP-binding site motif A." gene 2827157..2827804 /gene="orn" /locus_tag="Rv2511" /db_xref="GeneID:888633" CDS 2827157..2827804 /gene="orn" /locus_tag="Rv2511" /EC_number="3.1.-.-" /function="INVOLVED IN RNA DEGRADATION: 3'-TO-5' EXORIBONUCLEASE SPECIFIC FOR SMALL OLIGORIBONUCLEOTIDES." /experiment="experimental evidence, no additional details recorded" /note="3'-5' exoribonuclease specific for small oligoribonuclotides" /codon_start=1 /transl_table=11 /product="oligoribonuclease" /protein_id="NP_217027.1" /db_xref="GI:15609648" /db_xref="GeneID:888633" /translation="MQDELVWIDCEMTGLDLGSDKLIEIAALVTDADLNILGDGVDVV MHADDAALSGMIDVVAEMHSRSGLIDEVKASTVDLATAEAMVLDYINEHVKQPKTAPL AGNSIATDRAFIARDMPTLDSFLHYRMIDVSSIKELCRRWYPRIYFGQPPKGLTHRAL ADIHESIRELRFYRRTAFVPQPGPSTSEIAAVVAELSDGAGAQEETDSAEAPQSG" gene 2827854..2827926 /locus_tag="Rvnt29" /note="tRNA-His(GTG)" /db_xref="GeneID:2700451" tRNA 2827854..2827926 /locus_tag="Rvnt29" /product="tRNA-His" /note="codon recognized: CAC" /anticodon=(pos:2827887..2827889,aa:His) /db_xref="GeneID:2700451" repeat_region complement(2828489..2829938) /note="IS1081-3, len: 1450 bp. Insertion sequence IS1081." /mobile_element="insertion sequence:IS1081-3" gene complement(2828556..2829803) /locus_tag="Rv2512c" /db_xref="GeneID:888515" CDS complement(2828556..2829803) /locus_tag="Rv2512c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1081." /note="Rv2512c, (MTCY07A7.18c), len: 415 aa. Transposase for IS1081, identical to P35882|TRA1_MYCBO transposase for insertion sequence element IS1081 from Mycobacterium bovis (415 aa), FASTA scores: opt: 2680, E(): 1.9e-162, (100.0% identity in 415 aa overlap). Also highly similar to others from Mycobacterium tuberculosis e.g. P96354|Rv1047|MTCY10G2.02c|Rv3115|MTCY164.25|Rv3023c|MTV0 12 .38c (415 aa), FASTA scores: opt: 2675, E(): 3.9e-162, (99.75% identity in 415 aa overlap). Contains PS00435 Peroxidases proximal heme-ligand signature, PS01007 Transposases, Mutator family, signature. BELONGS TO THE MUTATOR FAMILY OF TRANSPOSASE." /codon_start=1 /transl_table=11 /product="IS1081 transposase" /protein_id="NP_217028.1" /db_xref="GI:15609649" /db_xref="GeneID:888515" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" misc_feature complement(2829033..2829107) /locus_tag="Rv2512c" /note="PS01007 Transposases, Mutator family, signature" misc_feature complement(2829249..2829281) /locus_tag="Rv2512c" /note="PS00435 Peroxidases proximal heme-ligand signature" gene 2830161..2830583 /locus_tag="Rv2513" /db_xref="GeneID:887808" CDS 2830161..2830583 /locus_tag="Rv2513" /function="UNKNOWN" /note="Rv2513, (MTCY07A7.19), len: 140 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217029.1" /db_xref="GI:15609650" /db_xref="GeneID:887808" /translation="MDDIAAFKLDSLPDITFTVTRAISSGGENPAGFLNFAARREQPE ILGGGGRPGPVGPEAVDTPRIRGGKVPFVFRTLPGYTFYASQIEPRVGDPEGPTLLAG FGNIPETSQRSPGWIRITCTGPDDDEELEFFGFAGPES" gene complement(2830877..2831338) /locus_tag="Rv2514c" /db_xref="GeneID:887878" CDS complement(2830877..2831338) /locus_tag="Rv2514c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2514c, (MTCY07A7.20c), len: 153 aa. Conserved hypothetical protein, showing some similarity to Q9PG05|XF0497 HYPOTHETICAL PROTEIN from Xylella fastidiosa (155 aa), FASTA scores: opt: 215, E(): 1.4e-07, (30.6% identity in 160 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217030.1" /db_xref="GI:15609651" /db_xref="GeneID:887878" /translation="MLYSFDTSAILNGRRDLFRPAVFRSLWGRVEDAISAGQIRSVDE VQRELARRDDDAKRWADGQTGLFCPLDEQIQQAARHILRLHPNMVRQGGRRSAADPFV IALAMVNNATVVTQETASGNIEKPRIPDVCDALGVPWLTLMGYIEAQGWTF" gene complement(2831344..2832591) /locus_tag="Rv2515c" /db_xref="GeneID:887812" CDS complement(2831344..2832591) /locus_tag="Rv2515c" /function="UNKNOWN" /note="Rv2515c, (MTCY07A7.21c), len: 415 aa. Conserved hypothetical protein, showing some similarity to Q9PG06|XF0496 HYPOTHETICAL PROTEIN from Xylella fastidiosa (391 aa), FASTA scores: opt: 388, E(): 4.4e-18, (27.8% identity in 399 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217031.1" /db_xref="GI:15609652" /db_xref="GeneID:887812" /translation="MGIGHPMWVGWCIIIAMRSIPASVESSVLRWARESCGLTEVAAA RKLGLPDDRVAAWEVGEVVPTIAQLRKAAEVYKRSLAVFFLSEPPEGFDTLRDFRRLD GAASGQWTPGLHEEFRRAHTQRDFALELADAEDREIPGAWRLPLSGDEADADIAARIR KALIEVSPLPIPVASVDPYEHLNAWVSAIETSGVLVLATRGGKVAIDEMRGMCLYFDE LPVIVLNGSDHPRPRLFSLLHEFVHVVLHTEGLCDVIADAHPSTQDRSLEARCNAIAA AVLMPADVVRARPEVIVRSETPSSWDYESLRPVAAHFGVSAEAFLRRLSTLGIVPVEV YRQRRAEFIAAHEDEAERARSAGGGNWYRNTVRDLGKGYVRAVTDAHRRRVIDSNTAA IYLDAKVSQIPKLAESAELRSVV" misc_feature complement(2831857..2831886) /locus_tag="Rv2515c" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(2832710..2833513) /locus_tag="Rv2516c" /db_xref="GeneID:887186" CDS complement(2832710..2833513) /locus_tag="Rv2516c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2516c, (MTV009.01c), len: 267 aa. Hypothetical unknown protein. Contains probable helix-turn-helix motif at aa 98 to 119 (Score 1743, +5.12 SD). C-terminus extended since first submission (+ 18 aa). TBparse score is 0.964." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217032.2" /db_xref="GI:57116997" /db_xref="GeneID:887186" /translation="MTADWVVTFTFDADPSMETMDAWETQLEGFDALVSRVPGHGIDV TVYAPGDWSVFDALAKMAGEVMPVVQAKSPIAVQIISEPEHRLRAEAFTTPELMSAAE IADELGVSRQRVHQLRSTAGFPAPLADLRGGAVWDAAAVRRFAETWERKPGRPHTGTA KFAYSWAVGPAVGRSGKAPNVRWRVENPDKIRFVLRNIGDDIAEDVEIDLSRIDAITR NVPKKTVIRPGEGLNMVLIAAWGHPLPNQLYVRWAGQDEWAAVPLHPAH" gene complement(2833510..2833761) /locus_tag="Rv2517c" /db_xref="GeneID:887673" CDS complement(2833510..2833761) /locus_tag="Rv2517c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2517c, (MTV009.02c), len: 83 aa. Hypothetical unknown protein. Equivalent to AAK46899 from Mycobacterium tuberculosis strain CDC1551 (97 aa) but shorter 14 aa. Questionable orf." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217033.1" /db_xref="GI:15609654" /db_xref="GeneID:887673" /translation="MNSAIIKIAKWAQSQQWTVEDDASGYTRFYNPQGVYIARFPATP SNEYRRMRDLLGALKKAGLTWPPPSKKERRAQHRKEGAQ" gene complement(2834109..2835335) /gene="lppS" /locus_tag="Rv2518c" /db_xref="GeneID:888160" CDS complement(2834109..2835335) /gene="lppS" /locus_tag="Rv2518c" /function="UNKNOWN" /note="Rv2518c, (MTV009.03c), len: 408 aa. Probable lppS, conserved lipoprotein, highly similar to O07707|MLCL383.3 HYPOTHETICAL 43.6 KDA PROTEIN from Mycobacterium leprae (407 aa), FASTA scores: opt: 2300, E(): 1.2e-130, (82.5% identity in 406 aa overlap); Q9CCU5|LPPS|ML0426 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (404 aa), FASTA scores: opt: 2279, E(): 2.3e-129, (82.4% identity in 403 aa overlap); and Q9CB49|ML2446 POSSIBLE LIPOPROTEIN from Mycobacterium leprae (441 aa), FASTA scores: opt: 736, E(): 8.4e-37, (35.6% identity in 399 aa overlap). Also similar to other proteins from several organisms e.g. Q9X811|SC6G10.26c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (424 aa), FASTA scores: opt: 867, E(): 1.1e-44, (32.25% identity in 403 aa overlap); Q9L1E8|SC3D11.14 PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (416 aa), FASTA scores: opt: 737, E(): 7e-37, (32.95% identity in 413 aa overlap); Q9KYV1|SCE22.11 PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (407 aa), FASTA scores: opt: 721, E(): 6.2e-36, (33.5% identity in 400 aa overlap). And similar to several hypothetical mycobacterial proteins e.g. Q11149|Y483_MYCTU|Rv0483|MT0501|MTCY20G9.09 (451 aa), FASTA scores: opt: 763, E(): 2.1e-38, (34.85% identity in 402 aa overlap). Has very long signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="lipoprotein LppS" /protein_id="NP_217034.1" /db_xref="GI:15609655" /db_xref="GeneID:888160" /translation="MPKVGIAAQAGRTRVRRAWLTALMMTAVMIGAVACGSGRGPAPI KVIADKGTPFADLLVPKLTASVTDGAVGVTVDAPVSVTAADGVLAAVTMVNDNGRPVA GRLSPDGLRWSTTEQLGYNRRYTLNATALGLGGAATRQLTFQTSSPAHLTMPYVMPGD GEVVGVGEPVAIRFDENIADRGAAEKAIKITTNPPVEGAFYWLNNREVRWRPEHFWKP GTAVDVAVNTYGVDLGEGMFGEDNVQTHFTIGDEVIATADDNTKILTVRVNGEVVKSM PTSMGKDSTPTANGIYIVGSRYKHIIMDSSTYGVPVNSPNGYRTDVDWATQISYSGVF VHSAPWSVGAQGHTNTSHGCLNVSPSNAQWFYDHVKRGDIVEVVNTVGGTLPGIDGLG DWNIPWDQWRAGNAKA" misc_feature complement(2835231..2835263) /gene="lppS" /locus_tag="Rv2518c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2835494..2835566 /locus_tag="Rvnt30" /note="tRNA-Lys(CTT)" /db_xref="GeneID:2700436" tRNA 2835494..2835566 /locus_tag="Rvnt30" /product="tRNA-Lys" /note="codon recognized: AAG" /anticodon=(pos:2835527..2835529,aa:Lys) /db_xref="GeneID:2700436" gene 2835785..2837263 /gene="PE26" /locus_tag="Rv2519" /db_xref="GeneID:888172" CDS 2835785..2837263 /gene="PE26" /locus_tag="Rv2519" /function="UNKNOWN" /note="Rv2519, (MTV009.04), len: 492 aa. Member of the M. tuberculosis PE family (see citation below), highly similar to many e.g. Q50630|YP91_MYCTU|Rv2591|MT2668.1|MTCY227.10c (543 aa), FASTA scores: opt: 848, E(): 3e-30, (39.55% identity in 445 aa overlap). TBparse score is 0.91." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177888.1" /db_xref="GI:57116998" /db_xref="GeneID:888172" /translation="MSRLIVAPDWLASAAAEVQSIGSALSAANAAAAAPTTLLVAAAE DEVSAAAAALFANYGREYQTLSVRFASLDQQFAQALNSAAASYQTAEATGASLVQTAT QGVLGVINAPTEFMFGRSLIGDGADGTAASPIGEPGGILYGDGGNGYSQTTPGAVGGA GGSAGFIGNGGAGGAGGPGAGGGTGGLGGWLWGNNGAAGTGDPVNVAVPLRVENNFPL VNLLVNRGPTVPILLDTGSSSLVIPFWKIGWQNLGLPTGFDVVHYGNGVSIVYADVPT TVDFGGGAATTPTSVHVGILPYPRNLDSLVLIASGGAFGPNGNGILGIGPNVGSYAVS GPGNVVTTDLPGQLNEGTLIDIPGGYMQFGPNTGTPITSVTGAPITVLNVQIGGYDPN GGYWSLPSIFDSGGNHGTLPAVILGTGQTTGYAPPGTVISISIHDNQTLLYQYTTTAS NSPVVTADPRLNTGLTPFLLGPVYISNNPSGVGTVVFNYPPP" gene complement(2837388..2837615) /locus_tag="Rv2520c" /db_xref="GeneID:887804" CDS complement(2837388..2837615) /locus_tag="Rv2520c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2520c, (MTV009.05c), len: 75 aa. Possible conserved membrane protein, equivalent to O07706|MLCL383.32 HYPOTHETICAL 10.0 KDA PROTEIN from Mycobacterium leprae (91 aa), FASTA scores: opt: 290, E(): 4.1e-14, (58.65% identity in 75 aa overlap); and Q9CCU6|ML0425 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (75 aa), FASTA scores: opt: 286, E(): 6.6e-14, (57.35% identity in 75 aa overlap). TBparse score is 0.882." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217036.1" /db_xref="GI:15609657" /db_xref="GeneID:887804" /translation="MVDRDPNTIKQEIDQTRDQLAATIDSLAERANPRRLADDAKTRV IAFLRKPIVTVSLVGIGSVVVVVVIHKIRNR" gene 2837684..2838157 /gene="bcp" /locus_tag="Rv2521" /db_xref="GeneID:887694" CDS 2837684..2838157 /gene="bcp" /locus_tag="Rv2521" /function="PUTATIVE ANTIOXIDANT PROTEIN." /note="Rv2521, (MTV009.06), len: 157 aa. Probable bcp, bacterioferritin comigratory protein, equivalent to O07705|BCP|ML0424 from Mycobacterium leprae (161 aa), FASTA scores: opt: 829, E(): 6.8e-46, (79.6% identity in 157 aa overlap). Also highly similar to Q9KZQ2|SCE6.38 HYPOTHETICAL 16.8 KDA PROTEIN Streptomyces coelicolor (155 aa), FASTA scores: opt: 727, E(): 2e-39, (69.5% identity in 154 aa overlap); P23480|AAG57590|BCP_ECOLI|B2480|BAB36765|Z3739|ECS3342 BACTERIOFERRITIN COMIGRATORY PROTEIN from Escherichia coli strain K12 (156 aa), FASTA scores: opt: 513, E(): 8.3e-26, (48.3% identity in 149 aa overlap); Q9RW23|DR0846 BACTERIOFERRITIN COMIGRATORY PROTEIN from Deinococcus radiodurans (175 aa), FASTA scores: opt: 465, E(): 1e-22, (46.5% identity in 157 aa overlap); P44411|BCP_HAEIN|HI0254 BACTERIOFERRITIN COMIGRATORY PROTEIN from Haemophilus influenzae (155 aa), FASTA scores: opt: 453, E(): 5.3e-22, (47.5% identity in 139 aa overlap); etc. Also similar to Mycobacterium tuberculosis Rv1608c|MTV046.06|bcpB and Rv2238c|MTCY427.19c|hpE. TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="bacterioferritin comigratory protein BCP" /protein_id="NP_217037.1" /db_xref="GI:15609658" /db_xref="GeneID:887694" /translation="MTKTTRLTPGDKAPAFTLPDADGNNVSLADYRGRRVIVYFYPAA STPGCTKQACDFRDNLGDFTTAGLNVVGISPDKPEKLATFRDAQGLTFPLLSDPDREV LTAWGAYGEKQMYGKTVQGVIRSTFVVDEDGKIVVAQYNVKATGHVAKLRRDLSV" gene complement(2838129..2839541) /locus_tag="Rv2522c" /db_xref="GeneID:887375" CDS complement(2838129..2839541) /locus_tag="Rv2522c" /function="UNKNOWN" /note="Rv2522c, (MTV009.07c), len: 470 aa. Conserved hypothetical protein, equivalent, but longer 20 aa, to Q9X7E4|ML1193|MLCB458.08 from HYPOTHETICAL 46.6 KDA PROTEIN Mycobacterium leprae (442 aa), FASTA scores: opt: 2521, E(): 4.1e-142, (86.35% identity in 440 aa overlap). Also similar to various proteins e.g. Q9K425|SCG22.20 PUTATIVE PEPTIDASE from Streptomyces coelicolor (451 aa), FASTA scores: opt: 1097, E(): 1.1e-57, (42.5% identity in 451 aa overlap); Q9FCK3|2SC3B6.09 PUTATIVE PEPTIDASE from Streptomyces coelicolor (470 aa), FASTA scores: opt: 669, E(): 2.8e-32, (34.2% identity in 462 aa overlap); Q98AF9|MLL6018 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (486 aa), FASTA scores: opt: 622, E(): 1.7e-29, (33.95% identity in 442 aa overlap); Q9RSU7|DR2025 ARGE/DAPE/ACY1 FAMILY PROTEIN from Deinococcus radiodurans (459 aa), FASTA scores: opt: 616, E(): 3.7e-29, (34.15% identity in 442 aa overlap); etc (include some similarity to hypothetical proteins from C. elegans and yeast). Alternative start possible at 6687 but then no RBS obvious. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217038.1" /db_xref="GI:15609659" /db_xref="GeneID:887375" /translation="MSASRRRIASKSGFSCDSASARELVERVREVLPSVRCDLEELVR IESVWADPDRRDEVHRSARAVADLLSQAGFDDVRIVSERGAPAVIARYPAPPGAPTVL LYAHHDVQPEGDRGQWVSPPFEPTERGGRLYGRGTADDKAGIATHVAAFWAHGGRPPV GVTVFVEGEEESGSPSLGRLLAAHRDALAADVIVIADSDNWSTDIPALTVSLRGMADC VVEVATLDHGLHSGLWGGVVPDALTVLVRLLASLHDDDGNVAVAGMHESTAARVDYPA GRVRAESGLLDGVSEIGTGSVPQRLWAKPAITVIGIDTTSVAAASNTLIPRARAKISI RVAPGGDATAHLDAVEAHLRRHAPWGAQVTVTRGEVGQPYAIEASGPVYDAARSAFRQ AWGADPIDMGMGGSIPFIAEFAAAFPQATILVTGVEDPGTQAHSVNESLHLGVLERAA TAEALLLAKLAAIPTGRAEA" gene complement(2839538..2839930) /gene="acpS" /locus_tag="Rv2523c" /db_xref="GeneID:888626" CDS complement(2839538..2839930) /gene="acpS" /locus_tag="Rv2523c" /EC_number="2.7.8.7" /function="BIOSYNTHESIS OF FATTY ACIDS AND LIPIDS. TRANSFERS THE 4'-PHOSPHOPANTETHEINE MOIETY FROM COENZYME A TO A SER OF ACYL-CARRIER PROTEIN. CATALYZES THE FORMATION OF HOLO-ACP, WHICH MEDIATES THE TRANSFER OF ACYL FATTY-ACID INTERMEDIATES DURING THE BIOSYNTHESIS OF FATTY ACIDS AND LIPIDS [CATALYTIC ACTIVITY: CoA + APO-[ACYL-CARRIER PROTEIN] = ADENOSINE 3',5'-BISPHOSPHATE + HOLO-[ACYL-CARRIER PROTEIN] ]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the formation of holo-ACP, which mediates the essential transfer of acyl fatty acid intermediates during the biosynthesis of fatty acids and lipids" /codon_start=1 /transl_table=11 /product="4'-phosphopantetheinyl transferase" /protein_id="NP_217039.1" /db_xref="GI:15609660" /db_xref="GeneID:888626" /translation="MGIVGVGIDLVSIPDFAEQVDQPGTVFAETFTPGERRDASDKSS SAARHLAARWAAKEAVIKAWSGSRFAQRPVLPEDIHRDIEVVTDMWGRPRVRLTGAIA EYLADVTIHVSLTHEGDTAAAVAILEAP" gene complement(2840123..2849332) /gene="fas" /locus_tag="Rv2524c" /db_xref="GeneID:887704" CDS complement(2840123..2849332) /gene="fas" /locus_tag="Rv2524c" /EC_number="2.3.1.-" /function="INVOLVED IN LIPID METABOLISM. FATTY ACID SYNTHETASE CATALYZES THE FORMATION OF LONG-CHAIN FATTY ACIDS FROM ACETYL-COA, MALONYL-CoA AND NADPH." /experiment="experimental evidence, no additional details recorded" /note="Rv2524c, (MTCY159.32, MTV009.09c), len: 3069 aa. Probable fas, Fatty Acid Synthase (EC 2.3.1.-), equivalent to Q9X7E2|FAS|ML1191 PUTATIVE TYPE I FATTY ACID SYNTHASE from Mycobacterium leprae (3076 aa), FASTA scores: opt: 17484, E(): 0, (85.8% identity in 3081 aa overlap). Also similar to others e.g. Q04846|FAS|Q59497 from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (3104 aa), FASTA scores: opt: 3981, E(): 5.5e-203, (49.8% identity in 3099 aa overlap); Q48926|FAS from Mycobacterium bovis (2796 aa), FASTA scores: opt: 2098, E(): 3.9e-103, (59.7% identity in 2862 aa overlap) (see Fernandes et al., 1996); P34731|FAS1_CANAL FATTY ACID SYNTHASE SUBUNIT BETA from Candida albicans (Yeast) (2037 aa), FASTA scores: opt: 955, E(): 1.3e-42, (27.4% identity in 1926 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00606 Beta-ketoacyl synthases active site." /codon_start=1 /transl_table=11 /product="fatty acid synthase" /protein_id="NP_217040.1" /db_xref="GI:15609661" /db_xref="GeneID:887704" /translation="MTIHEHDRVSADRGGDSPHTTHALVDRLMAGEPYAVAFGGQGSA WLETLEELVSATGIETELATLVGEAELLLDPVTDELIVVRPIGFEPLQWVRALAAEDP VPSDKHLTSAAVSVPGVLLTQIAATRALARQGMDLVATPPVAMAGHSQGVLAVEALKA GGARDVELFALAQLIGAAGTLVARRRGISVLGDRPPMVSVTNADPERIGRLLDEFAQD VRTVLPPVLSIRNGRRAVVITGTPEQLSRFELYCRQISEKEEADRKNKVRGGDVFSPV FEPVQVEVGFHTPRLSDGIDIVAGWAEKAGLDVALARELADAILIRKVDWVDEITRVH AAGARWILDLGPGDILTRLTAPVIRGLGIGIVPAATRGGQRNLFTVGATPEVARAWSS YAPTVVRLPDGRVKLSTKFTRLTGRSPILLAGMTPTTVDAKIVAAAANAGHWAELAGG GQVTEEIFGNRIEQMAGLLEPGRTYQFNALFLDPYLWKLQVGGKRLVQKARQSGAAID GVVISAGIPDLDEAVELIDELGDIGISHVVFKPGTIEQIRSVIRIATEVPTKPVIMHV EGGRAGGHHSWEDLDDLLLATYSELRSRANITVCVGGGIGTPRRAAEYLSGRWAQAYG FPLMPIDGILVGTAAMATKESTTSPSVKRMLVDTQGTDQWISAGKAQGGMASSRSQLG ADIHEIDNSASRCGRLLDEVAGDAEAVAERRDEIIAAMAKTAKPYFGDVADMTYLQWL RRYVELAIGEGNSTADTASVGSPWLADTWRDRFEQMLQRAEARLHPQDFGPIQTLFTD AGLLDNPQQAIAALLARYPDAETVQLHPADVPFFVTLCKTLGKPVNFVPVIDQDVRRW WRSDSLWQAHDARYDADAVCIIPGTASVAGITRMDEPVGELLDRFEQAAIDEVLGAGV EPKDVASRRLGRADVAGPLAVVLDAPDVRWAGRTVTNPVHRIADPAEWQVHDGPENPR ATHSSTGARLQTHGDDVALSVPVSGTWVDIRFTLPANTVDGGTPVIATEDATSAMRTV LAIAAGVDSPEFLPAVANGTATLTVDWHPERVADHTGVTATFGEPLAPSLTNVPDALV GPCWPAVFAAIGSAVTDTGEPVVEGLLSLVHLDHAARVVGQLPTVPAQLTVTATAANA TDTDMGRVVPVSVVVTGADGAVIATLEERFAILGRTGSAELADPARAGGAVSANATDT PRRRRRDVTITAPVDMRPFAVVSGDHNPIHTDRAAALLAGLESPIVHGMWLSAAAQHA VTATDGQARPPARLVGWTARFLGMVRPGDEVDFRVERVGIDQGAEIVDVAARVGSDLV MSASARLAAPKTVYAFPGQGIQHKGMGMEVRARSKAARKVWDTADKFTRDTLGFSVLH VVRDNPTSIIASGVHYHHPDGVLYLTQFTQVAMATVAAAQVAEMREQGAFVEGAIACG HSVGEYTALACVTGIYQLEALLEMVFHRGSKMHDIVPRDELGRSNYRLAAIRPSQIDL DDADVPAFVAGIAESTGEFLEIVNFNLRGSQYAIAGTVRGLEALEAEVERRRELTGGR RSFILVPGIDVPFHSRVLRVGVAEFRRSLDRVMPRDADPDLIIGRYIPNLVPRLFTLD RDFIQEIRDLVPAEPLDEILADYDTWLRERPREMARTVFIELLAWQFASPVRWIETQD LLFIEEAAGGLGVERFVEIGVKSSPTVAGLATNTLKLPEYAHSTVEVLNAERDAAVLF ATDTDPEPEPEEDEPVAESPAPDVVSEAAPVAPAASSAGPRPDDLVFDAADATLALIA LSAKMRIDQIEELDSIESITDGASSRRNQLLVDLGSELNLGAIDGAAESDLAGLRSQV TKLARTYKPYGPVLSDAINDQLRTVLGPSGKRPGAIAERVKKTWELGEGWAKHVTVEV ALGTREGSSVRGGAMGHLHEGALADAASVDKVIDAAVASVAARQGVSVALPSAGSGGG ATIDAAALSEFTDQITGREGVLASAARLVLGQLGLDDPVNALPAAPDSELIDLVTAEL GADWPRLVAPVFDPKKAVVFDDRWASAREDLVKLWLTDEGDIDADWPRLAERFEGAGH VVATQATWWQGKSLAAGRQIHASLYGRIAAGAENPEPGRYGGEVAVVTGASKGSIAAS VVARLLDGGATVIATTSKLDEERLAFYRTLYRDHARYGAALWLVAANMASYSDVDALV EWIGTEQTESLGPQSIHIKDAQTPTLLFPFAAPRVVGDLSEAGSRAEMEMKVLLWAVQ RLIGGLSTIGAERDIASRLHVVLPGSPNRGMFGGDGAYGEAKSALDAVVSRWHAESSW AARVSLAHALIGWTRGTGLMGHNDAIVAAVEEAGVTTYSTDEMAALLLDLCDAESKVA AARSPIKADLTGGLAEANLDMAELAAKAREQMSAAAAVDEDAEAPGAIAALPSPPRGF TPAPPPQWDDLDVDPADLVVIVGGAEIGPYGSSRTRFEMEVENELSAAGVLELAWTTG LIRWEDDPQPGWYDTESGEMVDESELVQRYHDAVVQRVGIREFVDDGAIDPDHASPLL VSVFLEKDFAFVVSSEADARAFVEFDPEHTVIRPVPDSTDWQVIRKAGTEIRVPRKTK LSRVVGGQIPTGFDPTVWGISADMAGSIDRLAVWNMVATVDAFLSSGFSPAEVMRYVH PSLVANTQGTGMGGGTSMQTMYHGNLLGRNKPNDIFQEVLPNIIAAHVVQSYVGSYGA MIHPVAACATAAVSVEEGVDKIRLGKAQLVVAGGLDDLTLEGIIGFGDMAATADTSMM CGRGIHDSKFSRPNDRRRLGFVEAQGGGTILLARGDLALRMGLPVLAVVAFAQSFGDG VHTSIPAPGLGALGAGRGGKDSPLARALAKLGVAADDVAVISKHDTSTLANDPNETEL HERLADALGRSEGAPLFVVSQKSLTGHAKGGAAVFQMMGLCQILRDGVIPPNRSLDCV DDELAGSAHFVWVRDTLRLGGKFPLKAGMLTSLGFGHVSGLVALVHPQAFIASLDPAQ RADYQRRADARLLAGQRRLASAIAGGAPMYQRPGDRRFDHHAPERPQEASMLLNPAAR LGDGEAYIG" misc_feature complement(2841152..2841202) /gene="fas" /locus_tag="Rv2524c" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature complement(2843072..2843095) /gene="fas" /locus_tag="Rv2524c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2849852..2850574) /locus_tag="Rv2525c" /db_xref="GeneID:888612" CDS complement(2849852..2850574) /locus_tag="Rv2525c" /function="UNKNOWN" /note="Rv2525c, (MTCY159.31), len: 240 aa. Conserved hypothetical protein, equivalent to Q9X7E1|ML1190|MLCB458.05 HYPOTHETICAL 25.3 KDA PROTEIN from Mycobacterium leprae (239 aa), FASTA scores: opt: 1358, E(): 1e-75, (82.15% identity in 241 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217041.1" /db_xref="GI:15609662" /db_xref="GeneID:888612" /translation="MSVSRRDVLKFAAATPGVLGLGVVASSLRAAPASAGSLGTLLDY AAGVIPASQIRAAGAVGAIRYVSDRRPGGAWMLGKPIQLSEARDLSGNGLKIVSCYQY GKGSTADWLGGASAGVQHARRGSELHAAAGGPTSAPIYASIDDNPSYEQYKNQIVPYL RSWESVIGHQRTGVYANSKTIDWAVNDGLGSYFWQHNWGSPKGYTHPAAHLHQVEIDK RKVGGVGVDVNQILKPQFGQWA" gene 2851091..2851318 /locus_tag="Rv2526" /db_xref="GeneID:888254" CDS 2851091..2851318 /locus_tag="Rv2526" /function="UNKNOWN" /note="Rv2526, (MTCY159.30c), len: 75 aa. Hypothetical unknown protein. TBparse score is 0.877." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217042.1" /db_xref="GI:15609663" /db_xref="GeneID:888254" /translation="MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQ AAARRRRIVDHLAHAGTHVDADVLLSEQAWR" gene 2851315..2851716 /locus_tag="Rv2527" /db_xref="GeneID:887266" CDS 2851315..2851716 /locus_tag="Rv2527" /function="UNKNOWN" /note="Rv2527, (MTCY159.29c), len: 133 aa. Hypothetical protein, showing some similarity to hypothetical proteins from Mycobacterium tuberculosis e.g. P95007|MTCY159.10c|Rv2546 (137 aa), FASTA scores: opt: 206, E(): 1.4e-07, (38.0% identity in 100 aa overlap); O33299|MTV002.22c|Rv2757c (138 aa), FASTA scores: opt: 201, E(): 3.1e-07, (35.7% identity in 126 aa overlap); and P96411|MTCY08D5.24c|Rv0229c (226 aa), FASTA scores: opt: 153, E(): 0.0011, (32.8% identity in 128 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217043.1" /db_xref="GI:15609664" /db_xref="GeneID:887266" /translation="MTTWILDKSAHVRLVAGATPPAGIDLTDLAICDIGELEWLYSAR SATDYDSQQTSLRAYQILRAPSDIFDRVRHLQRDLAHHRGMWHRTPLPDLFIAETALH HRAGVLHHDRDYKRIAVVRPGFQACELSRGR" gene complement(2851751..2852671) /gene="mrr" /locus_tag="Rv2528c" /db_xref="GeneID:887150" CDS complement(2851751..2852671) /gene="mrr" /locus_tag="Rv2528c" /function="INVOLVED IN THE ACCEPTANCE OF FOREIGN DNA WHICH IS MODIFIED. RESTRICTS BOTH ADENINE- AND CYTOSINE-METHYLATED DNA." /note="Rv2528c, (MTCY159.28), len: 306 aa. Probable mrr, restriction system protein, similar to other mrr proteins e.g. Q9RWS8|DR0587|MRR from Deinococcus radiodurans (306 aa), FASTA scores: opt: 776, E(): 4.2e-40, (40.45% identity in 309 aa overlap); P24202|MRR_ECOLI|B4351 from Escherichia coli strain K12 (304 aa), FASTA scores: opt: 647, E(): 2.9e-32, (35.25% identity in 309 aa overlap); Q9RX07|DR0508 from Deinococcus radiodurans (336 aa), FASTA scores: opt: 456, E(): 1.3e-20, (37.3% identity in 319 aa overlap); etc." /codon_start=1 /transl_table=11 /product="restriction system protein mrr" /protein_id="NP_217044.1" /db_xref="GI:15609665" /db_xref="GeneID:887150" /translation="MTIPDAQTLMRPILAYLADGQAKSAKDVIAAMSDEFGLSDDERA QMLPSGRQRTMYDRVHWSLTHMSQAGLLDRPTRGHVQVTDTGRQVLKAHPERVDMAVL REFPSYIAFRERTKAKQPVDATAKRPSGDDVQVSPEDLIDAALAENRAAVEGEILKKA LTLSPTGFEDLVIRLLEAMGYGRAGAVERTSASGDAGIDGIISQDPLGLDRIYVQAKR YAVDQTIGRPKIHEFAGALLGKQGDRGVYITTSSFSRGAREEAERINARIELIDGARL AELLVRYRVGVQAVQTVELLRLDEDFFDGL" gene 2852875..2854266 /locus_tag="Rv2529" /db_xref="GeneID:887153" CDS 2852875..2854266 /locus_tag="Rv2529" /function="UNKNOWN" /note="Rv2529, (MTCY159.27c), len: 463 aa. Hypothetical unknown protein. Note that C-terminal part is similar to short region of Q53609|MTS1_STRAL|SALIM MODIFICATION METHYLASE SALI from Streptomyces albus G (587 aa), FASTA scores: opt: 170, E(): 0.016, (59.45% identity in 37 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217045.1" /db_xref="GI:15609666" /db_xref="GeneID:887153" /translation="MHLAHRVASSRDTPSSSATPNAVSGSASNAADRPCLVRPPTAPP WAHGPRLRRDPTGGGSTPSIVLSRSTDRSKDGHRIVPAGARKSGVRASTGRLPSTRKT TRSPDCRPSASRTAFGTVTCPFDVTMGSSECLLHRCRTPPVPSHSVELLVAANPAEDS RLPYLIRLPVGAGLVFATSDVWPRTKALYCHRLDIADWPADPVVVDRVELRSCSRRGA AIDVVAARARENRSQLVHTMARGRQVVFWQSPKTRKQSRPGVRTPTARAAGIPELHIV VDAHERYPYTFADKPAKTTREALPCGDYGLKVAGQLVAAVERKALADLTSGVLNGNLK YQLTELAALPRAAVVVEDRYSEIFAHSFARPTAIADGLAELQIGFPNVPIVFCQTRKL AQEYTYRYLAAALTWFVDDADATTVFEPAAAEPEPSSAELRAWAKSVGLPVSDRGRLR PQILQAWRAAHPR" gene complement(2854267..2854686) /locus_tag="Rv2530c" /db_xref="GeneID:887192" CDS complement(2854267..2854686) /locus_tag="Rv2530c" /function="UNKNOWN" /note="Rv2530c, (MTCY159.26), len: 139 aa. Conserved hypothetical protein, highly similar to two HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis (strains H37Rv and CDC1551): O53219|Rv2494|MTV008.50 (141 aa), FASTA scores: opt: 380, E(): 3.6e-19, (48.0% identity in 125 aa overlap); and O53372|Rv3320c|MTV016.20c (142 aa), FASTA scores: opt: 286, E(): 9.3e-13, (41.35% identity in 133 aa overlap); and similar to others e.g. O07760|Rv0617|MTCY19H5.04c (133 aa), FASTA scores: opt: 158, E(): 0.00048, (39.55% identity in 129 aa overlap). Also some similarity with CAC48798|SMB20412 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (54 aa), FASTA scores: opt: 184, E(): 3.7e-06, (53.85% identity in 52 aa overlap); and CAC48797|SMB20411 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (82 aa), FASTA scores: opt: 170, E(): 4.8e-05, (44.45% identity in 63 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217046.1" /db_xref="GI:15609667" /db_xref="GeneID:887192" /translation="MTALLDVNVLIALGWPNHVHHAAAQRWFTQFSSNGWATTPITEA GYVRISSNRSVMQVSTTPAIAIAQLAAMTSLAGHTFWPDDVPLIVGSAGDRDAVSNHR RVTDCHLIALAARYGGRLVTFDAALADSASAGLVEVL" gene complement(2854683..2854907) /locus_tag="Rv2530A" /db_xref="GeneID:3205085" CDS complement(2854683..2854907) /locus_tag="Rv2530A" /function="UNKNOWN" /note="Rv2530A, len: 74 aa. Conserved hypothetical protein, similar to Q9CCR7|ML0525 HYPOTHETICAL PROTEIN from Mycobacterium leprae (58 aa), FASTA scores: opt: 179, E(): 1.8e-06, (63.65% identity in 44 aa overlap). Highly similar to O53218|Rv2493 from Mycobacterium tuberculosis (73 aa), FASTA scores: opt: 240, E(): 5.7e-11, (56.75% identity in 74 aa overlap); and Q92WE1|RB0399|SMB20413 HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti)p lasmid pSymB (megaplasmid 2) (75 aa), FASTA scores: opt: 226, E(): 6.5e-10, (56.00% identity in 75 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177672.1" /db_xref="GI:57116999" /db_xref="GeneID:3205085" /translation="MRTTLQIDDDVLEDARSIARSEGKSVGAVISELARRSLRPVGIV EVDGFPVFDVPPDAPTVTSEDVVRALEDDV" gene complement(2854938..2857781) /locus_tag="Rv2531c" /db_xref="GeneID:887216" CDS complement(2854938..2857781) /locus_tag="Rv2531c" /EC_number="4.1.1.-" /function="UNKNOWN. COULD BE AN ORNITHINE/ARGININE/LYSINE DECARBOXYLASE INVOLVED IN THE BIOSYNTHESIS OF SPERMIDINE FROM ARGININE." /note="Rv2531c, (MTCY159.25), len: 947 aa. Probable amino acid decarboxylase (EC 4.1.1.-), equivalent to Q9CCR8|ADI|ML0524 PUTATIVE AMINO ACID DECARBOXYLASE from Mycobacterium leprae (950 aa), FASTA scores: opt: 5426, E(): 0, (86.45% identity in 951 aa overlap). Also similar to other amino acid decarboxylases (but longer in N-terminus) e.g. Q9I2S7|PA1818 PROBABLE ORN/ARG/LYS AMINO ACID DECARBOXYLASE from Pseudomonas aeruginosa (751 aa), FASTA scores: opt: 434, E(): 2.5e-19, (29.15% identity in 738 aa overlap); Q9CML3|SPEF|PM0806 ORNITHINE DECARBOXYLASE from Pasteurella multocida (720 aa), FASTA scores: opt: 402, E(): 2.4e-17, (24.85% identity in 752 aa overlap); P21169|DCOR_ECOLI|SPEC|B2965|BAB37264|ECS3841|AAG58096 ORNITHINE DECARBOXYLASE ISOZYME (CONSTITUTIVE ENZYME) from Escherichia coli strain K12 (711 aa), FASTA scores: opt: 396, E(): 5.6e-17, (28.0% identity in 646 aa overlap); P44317|DCOR_HAEIN|SPEF|HI0591 ORNITHINE DECARBOXYLASE from Haemophilus influenzae (720 aa), FASTA scores: opt: 393, E(): 8.8e-17, (25.05% identity in 743 aa overlap) ; etc. SEEMS TO BELONG TO FAMILY 1 OF ORNITHINE, LYSINE, AND ARGININE DECARBOXYLASES. Note that previously known as adi.; adi" /codon_start=1 /transl_table=11 /product="amino acid decarboxylase" /protein_id="YP_177889.1" /db_xref="GI:57117000" /db_xref="GeneID:887216" /translation="MNPNSVRPRRLHVSALAAVANPSYTRLDTWNLLDDACRHLAEVD LAGLDTTHDVARAKRLMDRIGAYERYWLYPGAQNLATFRAHLDSHSTVRLTEEVSLAV RLLSEYGDRTALFDTSASLAEQELVAQAKQQQFYTVLLADDSPATAPDSLAECLRQLR NPADEVQFELLVVASIEDAITAVALNGEIQAAIIRHDLPLRSRDRVPLMTTLLGTDGD EAVANETHDWVECAEWIRELRPHIDLYLLTDESIAAETQDEPDVYDRTFYRLNDVTDL HSTVLAGLRNRYATPFFDALRAYAAAPVGQFHALPVARGASIFNSKSLHDMGEFYGRN IFMAETSTTSGGLDSLLDPHGNIKTAMDKAAVTWNANQTYFVTNGTSTANKIVVQALT RPGDIVLIDRNCHKSHHYGLVLAGAYPMYLDAYPLPQYAIYGAVPLRTIKQALLDLEA AGQLHRVRMLLLTNCTFDGVVYNPRRVMEEVLAIKPDICFLWDEAWYAFATAVPWARQ RTAMIAAERLEQMLSTAEYAEEYRNWCASMDGVDRSEWVDHRLLPDPNRARVRVYATH STHKSLSALRQASMIHVRDQDFKALTRDAFGEAFLTHTSTSPNQQLLASLDLARRQVD IEGFELVRHVYNMALVFRHRVRKDRLISKWFRILDESDLVPDAFRSSTVSSYRQVRQG ALADWNEAWRSDQFVLDPTRLTLFIGATGMNGYDFREKILMERFGIQINKTSINSVLL IFTIGVTWSSVHYLLDVLRRVAIDLDRSQKAASGADLALHRRHVEEITQDLPHLPDFS EFDLAFRPDDASSFGDMRSAFYAGYEEADREYVQIGLAGRRLAEGKTLVSTTFVVPYP PGFPVLVPGQLVSKEIIYFLAQLDVKEIHGYNPDLGLSVFTQAALARMEAARNAVATV GAALPAFEVPRDASALNGTVNGDSVLQGVAEDA" gene complement(2857853..2858254) /locus_tag="Rv2532c" /db_xref="GeneID:887152" CDS complement(2857853..2858254) /locus_tag="Rv2532c" /function="UNKNOWN" /note="Rv2532c, (MTCY159.24), len: 133 aa. Hypothetical unknown protein, equivalent to AAK46918 from Mycobacterium tuberculosis strain CDC1551 but shorter 157 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217048.1" /db_xref="GI:15609669" /db_xref="GeneID:887152" /translation="MTRLELRVVVAAVLAATVVLGAVVCAAYGLTIVASAMSIYALGV GAWLYHAIERLILARRISTVRTAAKPLQPLLPVMAAIMGLTQAVVRSLGDVTDLPARR RELSQLPVLRWVDNSGNRANRRIADSDDLAD" gene complement(2858254..2858724) /gene="nusB" /locus_tag="Rv2533c" /db_xref="GeneID:887359" CDS complement(2858254..2858724) /gene="nusB" /locus_tag="Rv2533c" /function="INVOLVED IN THE TRANSCRIPTION TERMINATION PROCESS. INTERACTS WITH RPSJ|NUSE|Rv0700." /experiment="experimental evidence, no additional details recorded" /note="Regulates rRNA biosynthesis by transcriptional antitermination" /codon_start=1 /transl_table=11 /product="transcription antitermination protein NusB" /protein_id="NP_217049.1" /db_xref="GI:15609670" /db_xref="GeneID:887359" /translation="MSDRKPVRGRHQARKRAVALLFEAEVRGISAAEVVDTRAALAEA KPDIARLHPYTAAVARGVSEHAAHIDDLITAHLRGWTLDRLPAVDRAILRVSVWELLH AADVPEPVVVDEAVQLAKELSTDDSPGFVNGVLGQVMLVTPQLRAAAQAVRGGA" gene complement(2858727..2859290) /gene="efp" /locus_tag="Rv2534c" /db_xref="GeneID:888437" CDS complement(2858727..2859290) /gene="efp" /locus_tag="Rv2534c" /function="INVOLVED IN PEPTIDE BOND SYNTHESIS. STIMULATE EFFICIENT TRANSLATION AND PEPTIDE-BOND SYNTHESIS ON NATIVE OR RECONSTITUTED 70S RIBOSOMES IN VITRO. PROBABLY FUNCTIONS INDIRECTLY BY ALTERING THE AFFINITY OF THE RIBOSOME FOR AMINOACYL-TRNA, THUS INCREASING THEIR REACTIVITY AS ACCEPTORS FOR PEPTIDYL TRANSFERASE." /experiment="experimental evidence, no additional details recorded" /note="Involved in peptide bond synthesis; alters the affinity of the ribosome for aminoacyl-tRNA" /codon_start=1 /transl_table=11 /product="elongation factor P" /protein_id="NP_217050.1" /db_xref="GI:15609671" /db_xref="GeneID:888437" /translation="MATTADFKNGLVLVIDGQLWTITEFQHVKPGKGPAFVRTKLKNV LSGKVVDKTFNAGVKVDTATVDRRDTTYLYRDGSDFVFMDSQDYEQHPLPEALVGDAA RFLLEGMPVQVAFHNGVPLYIELPVTVELEVTHTEPGLQGDRSSAGTKPATLQTGAQI NVPLFINTGDKLKVDSRDGSYLGRVNA" gene complement(2859300..2860418) /gene="pepQ" /locus_tag="Rv2535c" /db_xref="GeneID:888409" CDS complement(2859300..2860418) /gene="pepQ" /locus_tag="Rv2535c" /EC_number="3.4.-.-" /function="UNKNOWN; HYDROLYSES PEPTIDES." /note="Rv2535c, (MTCY159.21), len: 372 aa. Probable pepQ, cytoplasmic peptidase (EC 3.4.-.-), equivalent to Q9CCS1|PEPQ|ML0521 PUTATIVE CYTOPLASMIC PEPTIDASE from Mycobacterium leprae (376 aa), FASTA scores: opt: 1954, E(): 1.1e-105, (82.7% identity in 376 aa overlap). Also similar to other peptidases e.g. P54518|YQHT_BACSU PUTATIVE PEPTIDASE (BELONGS TO PEPTIDASE FAMILY M24B) from Bacillus subtilis (353 aa), FASTA scores: opt: 808, E(): 1.6e-39, (39.65% identity in 368 aa overlap); Q9KXQ8|SC9C5.16c PUTATIVE PEPTIDASE from Streptomyces coelicolor (368 aa), FASTA scores: opt: 803, E(): 3.2e-39, (43.15% identity in 380 aa overlap); Q9K950|BH2800 XAA-PRO DIPEPTIDASE from Bacillus halodurans (355 aa), FASTA scores: opt: 801, E(): 4.1e-39, (39.45% identity in 365 aa overlap); etc. Note that second part of protein is similar to second part of MTCY49.29c|Rv2089c|MT2150|MTCY49.29c PROBABLE DIPEPTIDASE (EC 3.4.13.-; BELONGS TO PEPTIDASE FAMILY M24B) from Mycobacterium tuberculosis (375 aa) (33.9% identity in 354 aa overlap) BLAST RESULTS: Score: 142 bits (359), E: 4e-33, Identities: 86/224 (38%), Positives: 119/224 (52%), Gaps: 4/224 (1%). COULD BE BELONG TO PEPTIDASE FAMILY M24B." /codon_start=1 /transl_table=11 /product="cytoplasmic peptidase PepQ" /protein_id="NP_217051.1" /db_xref="GI:15609672" /db_xref="GeneID:888409" /translation="MTHSQRRDKLKAQIAASGLDAMLISDLINVRYLSGFSGSNGALL VFADERDAVLATDGRYRTQAASQAPDLEVAIERAVGRYLAGRAGEAGVGKLGFESHVV TVDGLDALAGALEGKNTELVRASGTVESLREVKDAGELALLRLACEAADAALTDLVAR GGLRPGRTERQVSRELEALMLDHGADAVSFETIVAAGANSAIPHHRPTDAVLQVGDFV KIDFGALVAGYHSDMTRTFVLGKAADWQLEIYQLVAEAQQAGRQALLPGAELRGVDAA ARQLIADAGYGEHFGHGLGHGVGLQIHEAPGIGVTSAGTLLAGSVVTVEPGVYLPGRG GVRIEDTLVVAGGTPKMPETAGQTPELLTRFPKELAIL" gene 2860452..2861144 /locus_tag="Rv2536" /db_xref="GeneID:888386" CDS 2860452..2861144 /locus_tag="Rv2536" /function="UNKNOWN" /note="Rv2536, (MTCY159.20c), len: 230 aa. Probable conserved transmembrane protein, equivalent to Q9CCS2|ML0520 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (202 aa), FASTA scores: opt: 812, E(): 2e-41, (63.2% identity in 201 aa overlap). Also similar in part to Q9HMD5|VNG2594c from Halobacterium sp. strain NRC-1 (117 aa), FASTA scores: opt: 33.6, E(): 1.8, (33.6% identity in 116 aa overlap); and perhaps AAK65752|SMA1996 PUTATIVE ABC TRANSPORTER PERMEASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (323 aa), FASTA scores: opt: 117, E(): 6.1, (30.6% identity in 121 aa overlap). TBparse score is 0.876." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217052.1" /db_xref="GI:15609673" /db_xref="GeneID:888386" /translation="MTNWMLRGLAFAAAMVVLRLFQGALINAWQMLSGLISLVLLLLF AIGGVVWGVMDGRADAKASPDPDRRQDLAMTWLLAGLVAGALSGAVAWLISLFYKAIY TGGPINELTTFAAFTALIVFLVGIVGVAVGRWLVDRQLAKAPVRHHGLAAEHERAADT DVFSAVRADDSPTGEMQVAQPEAQTAAVATVEREAPTEVIRTTESDTPTEVIRTDTEA DQTKPGDEPKKD" gene complement(2861148..2861591) /gene="aroD" /locus_tag="Rv2537c" /db_xref="GeneID:888397" CDS complement(2861148..2861591) /gene="aroD" /locus_tag="Rv2537c" /EC_number="4.2.1.10" /function="INVOLVED AT THE THIRD STEP IN THE BIOSYNTHESIS OF CHORISMATE WITHIN THE BIOSYNTHESIS OF AROMATIC AMINO ACIDS (THE SHIKIMATE PATHWAY). CATALYZE A TRANS-DEHYDRATION VIA AN ENOLATE INTERMEDIATE [CATALYTIC ACTIVITY: 3-DEHYDROQUINATE = 3-DEHYDROSHIKIMATE + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 3-dehydroshikimate from 3-dehydroquinate in chorismate biosynthesis" /codon_start=1 /transl_table=11 /product="3-dehydroquinate dehydratase" /protein_id="NP_217053.1" /db_xref="GI:15609674" /db_xref="GeneID:888397" /translation="MSELIVNVINGPNLGRLGRREPAVYGGTTHDELVALIEREAAEL GLKAVVRQSDSEAQLLDWIHQAADAAEPVILNAGGLTHTSVALRDACAELSAPLIEVH ISNVHAREEFRRHSYLSPIATGVIVGLGIQGYLLALRYLAEHVGT" misc_feature complement(2861514..2861567) /gene="aroD" /locus_tag="Rv2537c" /note="PS01029 Dehydroquinase class II signature" gene complement(2861588..2862676) /gene="aroB" /locus_tag="Rv2538c" /db_xref="GeneID:888392" CDS complement(2861588..2862676) /gene="aroB" /locus_tag="Rv2538c" /EC_number="4.2.3.4" /function="INVOLVED AT THE SECOND STEP IN THE BIOSYNTHESIS OF CHORISMATE WITHIN THE BIOSYNTHESIS OF AROMATIC AMINO ACIDS (THE SHIKIMATE PATHWAY) [CATALYTIC ACTIVITY: 7-PHOSPHO-3-DEOXY-ARABINO-HEPTULOSONATE = 3-DEHYDROQUINATE + ORTHOPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 3-dehydroquinate from 3-deoxy-arabino-heptulonate 7-phosphate; functions in aromatic amino acid biosynthesis" /codon_start=1 /transl_table=11 /product="3-dehydroquinate synthase" /protein_id="NP_217054.1" /db_xref="GI:15609675" /db_xref="GeneID:888392" /translation="MTDIGAPVTVQVAVDPPYPVVIGTGLLDELEDLLADRHKVAVVH QPGLAETAEEIRKRLAGKGVDAHRIEIPDAEAGKDLPVVGFIWEVLGRIGIGRKDALV SLGGGAATDVAGFAAATWLRGVSIVHLPTTLLGMVDAAVGGKTGINTDAGKNLVGAFH QPLAVLVDLATLQTLPRDEMICGMAEVVKAGFIADPVILDLIEADPQAALDPAGDVLP ELIRRAITVKAEVVAADEKESELREILNYGHTLGHAIERRERYRWRHGAAVSVGLVFA AELARLAGRLDDATAQRHRTILSSLGLPVSYDPDALPQLLEIMAGDKKTRAGVLRFVV LDGLAKPGRMVGPDPGLLVTAYAGVCAP" gene complement(2862673..2863203) /gene="aroK" /locus_tag="Rv2539c" /db_xref="GeneID:887434" CDS complement(2862673..2863203) /gene="aroK" /locus_tag="Rv2539c" /EC_number="2.7.1.71" /function="INVOLVED AT THE FIFTH STEP IN THE BIOSYNTHESIS OF CHORISMATE WITHIN THE BIOSYNTHESIS OF AROMATIC AMINO ACIDS (THE SHIKIMATE PATHWAY) [CATALYTIC ACTIVITY: ATP + SHIKIMATE = ADP + SHIKIMATE 3-PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of shikimate 3-phosphate from shikimate in aromatic amino acid biosynthesis" /codon_start=1 /transl_table=11 /product="shikimate kinase" /protein_id="NP_217055.1" /db_xref="GI:15609676" /db_xref="GeneID:887434" /translation="MAPKAVLVGLPGSGKSTIGRRLAKALGVGLLDTDVAIEQRTGRS IADIFATDGEQEFRRIEEDVVRAALADHDGVLSLGGGAVTSPGVRAALAGHTVVYLEI SAAEGVRRTGGNTVRPLLAGPDRAEKYRALMAKRAPLYRRVATMRVDTNRRNPGAVVR HILSRLQVPSPSEAAT" misc_feature complement(2862955..2863032) /gene="aroK" /locus_tag="Rv2539c" /note="PS01128 Shikimate kinase signature" misc_feature complement(2863156..2863179) /gene="aroK" /locus_tag="Rv2539c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2863207..2864412) /gene="aroF" /locus_tag="Rv2540c" /db_xref="GeneID:887379" CDS complement(2863207..2864412) /gene="aroF" /locus_tag="Rv2540c" /EC_number="4.2.3.5" /function="INVOLVED AT THE SEVENTH STEP IN THE BIOSYNTHESIS OF CHORISMATE WITHIN THE BIOSYNTHESIS OF AROMATIC AMINO ACIDS (THE SHIKIMATE PATHWAY) [CATALYTIC ACTIVITY: 5-O-(1-CARBOXYVINYL)-3-PHOSPHOSHIKIMATE = CHORISMATE + ORTHOPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of chorismate from 5-O-(1-carboxyvinyl)-3-phosphoshikimate in aromatic amino acid biosynthesis" /codon_start=1 /transl_table=11 /product="chorismate synthase" /protein_id="NP_217056.1" /db_xref="GI:15609677" /db_xref="GeneID:887379" /translation="MLRWITAGESHGRALVAVVEGMVAGVHVTSADIADQLARRRLGY GRGARMTFERDAVTVLSGIRHGSTLGGPIAIEIGNTEWPKWETVMAADPVDPAELADV ARNAPLTRPRPGHADYAGMLKYGFDDARPVLERASARETAARVAAGTVARAFLRQALG VEVLSHVISIGASAPYEGPPPRAEDLPAIDASPVRAYDKAAEADMIAQIEAAKKDGDT LGGVVEAVALGLPVGLGSFTSGDHRLDSQLAAAVMGIQAIKGVEIGDGFQTARRRGSR AHDEMYPGPDGVVRSTNRAGGLEGGMTNGQPLRVRAAMKPISTVPRALATVDLATGDE AVAIHQRSDVCAVPAAGVVVETMVALVLARAALEKFGGDSLAETQRNIAAYQRSVADR EAPAARVSG" misc_feature complement(2863990..2864010) /gene="aroF" /locus_tag="Rv2540c" /note="PS00788 Chorismate synthase signature 2" gene 2864427..2864834 /locus_tag="Rv2541" /db_xref="GeneID:887831" CDS 2864427..2864834 /locus_tag="Rv2541" /function="UNKNOWN" /note="Rv2541, (MTCY159.15c), len: 135 aa. Hypothetical unknown ala-rich protein, equivalent to AAK46926|MT2615.1 HYPOTHETICAL 38.9 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 but AAK46926|MT2615.1 longer at C-terminus. Questionable ORF. Some similarity with Rv2077A from Mycobacterium tuberculosis (99 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217057.1" /db_xref="GI:15609678" /db_xref="GeneID:887831" /translation="MRRRRPPHVNAPTPCDRGDVRPPGCPASIPGVEVAGGTRARLRV TADGLQALAGRCATLAGELSAAVAPSGAVLSWQANAVAVNAAHARAGAAAAAVSARMR ATAAALGQAARRYAGQDTAAAAALGAVRPWGTH" gene 2865130..2866341 /locus_tag="Rv2542" /db_xref="GeneID:887261" CDS 2865130..2866341 /locus_tag="Rv2542" /function="UNKNOWN" /note="Rv2542, (MTCY159.14c), len: 403 aa. Conserved hypothetical protein, highly similar to AAK46927|MT2616 HYPOTHETICAL 28.0 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (265 aa), FASTA scores: opt: 1776, E(): 2.3e-94, (99.25% identity in 265 aa overlap). And similar to several hypothetical proteins from Mycobacterium tuberculosis (strain H37Rv and CDC1551) e.g. P71654|Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: opt: 537, E(): 2.6e-23, (40.75% identity in 292 aa overlap); P71547|Y963_MYCTU|Rv0963c|MT0992|MTCY10D7.11 (266 aa), FASTA scores: opt: 357, E(): 2.6e-13, (34.6% identity in 234 aa overlap); Q10685|YK77_MYCTU|Rv2077c|MT2137|MTCY49.16c (323 aa), FASTA scores: opt: 261, E(): 9.5e-08, (32.7% identity in 211 aa overlap); etc. Also similar to Q9RDQ9|SC4A7.03 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (406 aa), FASTA scores: opt: 247, E(): 7.3e-07, (30.35% identity in 303 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217058.1" /db_xref="GI:15609679" /db_xref="GeneID:887261" /translation="MLDAVSDARRDGFAVGEDYTVTDRSTGGSRQQRAARLGQAQGHA DFIRHRVGALLATDRDIATRVSAATQGLDELAFEDVPGVDTPAEDGVQAVDFRQAPPP GAPGGMSSGDIDAIDAANRALLQDMLAEYSRLPDGQVKTDRLADIAAIQEALRVPDSH LIYVARPDDPADMIPAVTAVGDPFTADHVSVTVPGVSGTTRQTIATMTQETRGLREEA RVIAHSVGESENVATIAWVGYQPPPVLASWNTVDDDLAQAGAPKLEAFLRDLQAGSHN PGHTTALFGHSYGSLLSGIALKDGASSLVDNAVLYGSPGFDATSPAKLGMNDHNFFVM TTPDDPIRYPARLAPLHGWGSDGADTIGTVGRQGTPARVGIRPQRDHRRIPGPLPLHP SADRRGIHSAG" gene 2866468..2867127 /gene="lppA" /locus_tag="Rv2543" /db_xref="GeneID:888052" CDS 2866468..2867127 /gene="lppA" /locus_tag="Rv2543" /function="UNKNOWN" /note="Rv2543, (MTCY159.13c), len: 219 aa. Probable lppA, conserved lipoprotein, highly similar to upstream ORF P95009|LPPB|Rv2544|MTCY159.12 PUTATIVE LIPOPROTEIN LPPB from Mycobacterium tuberculosis (220 aa), FASTA scores: opt: 1240, E(): 1.1e-73, (87.15% identity in 218 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LppA" /protein_id="NP_217059.1" /db_xref="GI:15609680" /db_xref="GeneID:888052" /translation="MIAPQPISRTLPRWQRIVALTMIGISTALIGGCTMDHNPDTSRR LTGEQKIQLIDSMRNKGSYEAARERLTATARIIADRVSAAIPGQTWKFDDDPNIQQSD RNGALCDKLTADIARRPIANSVMFGATFSAEDFKIAANIVREEAAKYGATTESSLFNE SAKRDYDVQGNGYEFRLLQIKFATLNITGDCFLLQKVLDLPAGQLPPEPPIWPTTSTP H" misc_feature 2866534..2866566 /gene="lppA" /locus_tag="Rv2543" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2867124..2867786 /gene="lppB" /locus_tag="Rv2544" /db_xref="GeneID:888054" CDS 2867124..2867786 /gene="lppB" /locus_tag="Rv2544" /function="UNKNOWN" /note="Rv2544, (MTCY159.12c), len: 220 aa. Probable lppB, conserved lipoprotein, highly similar to downstream ORF P95010|MTCY159.13c|LPPA|Rv2543|MTCY159.13 PUTATIVE LIPOPROTEIN LPPA from Mycobacterium tuberculosis (219 aa), FASTA scores: opt: 1242, E(): 4.8e-72, (87.15% identity in 218 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.924." /codon_start=1 /transl_table=11 /product="lipoprotein LppB" /protein_id="NP_217060.1" /db_xref="GI:15609681" /db_xref="GeneID:888054" /translation="MIAPQPIPRTLPRWQRIVALTMIGISTALIGGCTMGQNPDKSPH LTGEQKIQLIDSMRHKGSYEAARERLTATAQIIADRVSAAIPGQTWKFNDDSYGQDFY RNGSLCKELSADIARRPMAKPVDFGSTFSAEDFKIAANIVREEAAKYGVTTESSLFNE SAKRDYDVQGNGYEFNLGQIKFATLNITGDCFLLQKVLDLPAGQLPPEPPIWPTTSTP TP" misc_feature 2867190..2867222 /gene="lppB" /locus_tag="Rv2544" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 2867783..2868061 /locus_tag="Rv2545" /db_xref="GeneID:888038" CDS 2867783..2868061 /locus_tag="Rv2545" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2545, (MTY159.11c), len: 92 aa. Conserved hypothetical protein. C-terminus highly similar to O33300|Rv2758c|MTV002.23c PROTEIN from Mycobacterium tuberculosis (88 aa), FASTA scores: opt: 151, E(): 9.8e-05, (66.65% identity in 45 aa overlap); and Q10771|Rv1560|MT1611|MTCY48.05 PROTEIN from Mycobacterium tuberculosis (72 aa), FASTA scores: opt: 84, E(): 8.2, (46.5% identity in 43 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217061.1" /db_xref="GI:15609682" /db_xref="GeneID:888038" /translation="MSTTIVAGVIQGHLPVILPTRRRARDLGHTTALFRAQTLQCIYL SIEYLYVCSMSRRTTIDIDDILLARAQAALGTTGLKDRVDAALRAAVR" gene 2868154..2868567 /locus_tag="Rv2546" /db_xref="GeneID:887365" CDS 2868154..2868567 /locus_tag="Rv2546" /function="UNKNOWN" /note="Rv2546, (MTCY159.10c), len: 137 aa. Conserved hypothetical protein. Some similarity to several HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis (strain H37Rv and CDC1551) e.g. P96411|Rv0229c|MTCY08D5.24c (226 aa), FASTA scores: opt: 272, E(): 1.3e-11, (39.7% identity in 136 aa overlap); O33299|Rv2757c|MTV002.22c (138 aa), FASTA scores: opt: 265, E(): 2.5e-11, (38.5% identity in 135 aa overlap); P95026|Rv2527|MTCY159.29c (133 aa), FASTA scores: opt: 206, E(): 2.6e-07, (38.0% identity in 100 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217062.1" /db_xref="GI:15609683" /db_xref="GeneID:887365" /translation="MVFCVDTSAWHHAARPEVARRWLAALSADQIGICDHVRLEILYS ANSATDYDALADELDGLARIPVGAETFTRACQVQRELAHVAGLHHRSVKIADLVIAAA AELSGTIVWHYDENYDRVAAITGQPTEWIVPRGTL" gene 2868606..2868863 /locus_tag="Rv2547" /db_xref="GeneID:888452" CDS 2868606..2868863 /locus_tag="Rv2547" /function="UNKNOWN" /note="Rv2547, (MTCY159.09c), len: 85 aa. Conserved hypothetical protein. Some similarity to P71666|YD98_MYCTU|Rv1398c|MT1442|MTCY21B4.15c HYPOTHETICAL 9.4 KDA PROTEIN from Mycobacterium tuberculosis (85 aa), FASTA scores: opt: 108, E(): 0.33, (37.1% identity in 62 aa overlap); CAC45864|SMC01933 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (71 aa), FASTA scores: opt: 105, E(): 0.46, (28.4% identity in 74 aa overlap); Q97W38|SSO10342 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (58 aa), FASTA scores: opt: 94, E(): 2.3, (46.95% identity in 49 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217063.1" /db_xref="GI:15609684" /db_xref="GeneID:888452" /translation="MRTQVTLGKEELELLDRAAKASGASRSELIRRAIHRAYGTGSKQ ERLAALDHSRGSWRGRDFTGTEYVDAIRGDLNERLARLGLA" gene 2868860..2869237 /locus_tag="Rv2548" /db_xref="GeneID:888412" CDS 2868860..2869237 /locus_tag="Rv2548" /function="UNKNOWN" /note="Rv2548, (MTCY159.08c), len: 125 aa. Conserved hypothetical protein. Some similarity to various proteins e.g. P71665|Rv1397c|MTCY21B4.14c HYPOTHETICAL 15.0 KDA PROTEIN from Mycobacterium tuberculosis (133 aa), FASTA scores: opt: 265, E(): 7.1e-12, (42.3% identity in 123 aa overlap); Q97WY5|SSO1975 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (125 aa), FASTA scores: opt: 131, E(): 0.018, (30.0% identity in 110 aa overlap); O52285|YLE HYPOTHETICAL 14.9 KDA PROTEIN from Agrobacterium radiobacter (133 aa), FASTA scores: opt: 128, E(): 0.03, (32.8% identity in 125 aa overlap); etc. TBscore is 0.865." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217064.1" /db_xref="GI:15609685" /db_xref="GeneID:888412" /translation="MKLIDTTIAVDHLRGEPAAAVLLAELINNGEEIAASELVRFELL AGVRESELAALEAFFSAVVWTLVTEDIARIGGRLARRYRSSHRGIDDVDYLIAATAIV VDADLLTTNVRHFPMFPDLQPPY" gene complement(2869727..2870122) /locus_tag="Rv2549c" /db_xref="GeneID:887193" CDS complement(2869727..2870122) /locus_tag="Rv2549c" /function="UNKNOWN" /note="Rv2549c, (MTCY159.07), len: 131 aa. Conserved hypothetical protein, showing some similarity to P73415|SLL1715 from Synechocystis sp. strain PCC 6803 (157 aa), FASTA scores: opt: 167, E(): 4.2e-05, (29.45% identity in 129 aa overlap); Q9HHY6|VNG6166H from Halobacterium sp. plasmid pNRC200 strain NRC-1 (144 aa), FASTA scores: opt: 133, E(): 0.011, (29.6% identity in 125 aa overlap); and Q9HSU3|VNG0072H from Halobacterium sp. strain NRC-1 (144 aa), FASTA scores: opt: 113, E(): 0.29, (25.75% identity in 136 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217065.1" /db_xref="GI:15609686" /db_xref="GeneID:887193" /translation="MIFVDTSFWAALGNAGDARHGTAKRLWASKPPVVMTSNHVLGET WTLLNRRCGHRAAVAAAAIRLSTVVRVEHVTADLEEQAWEWLVRHDEREYSFVDATSF AVMRKKGIQNAYAFDGDFSAAGFVEVRPE" gene complement(2870119..2870364) /locus_tag="Rv2550c" /db_xref="GeneID:887353" CDS complement(2870119..2870364) /locus_tag="Rv2550c" /function="UNKNOWN" /note="Rv2550c, (MTCY159.06), len: 81 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217066.1" /db_xref="GI:15609687" /db_xref="GeneID:887353" /translation="MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVA EHLRQPGPDPVDAFVGSFVGEADLSASVDDVVYGKHE" gene complement(2870775..2871194) /locus_tag="Rv2551c" /db_xref="GeneID:887855" CDS complement(2870775..2871194) /locus_tag="Rv2551c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2551c, (MTCY159.05), len: 139 aa. Conserved hypothetical protein, similar to the second part of Q9XAP1|SC10A7.34c PUTATIVE TYPE IV PEPTIDASE from Streptomyces coelicolor (259 aa), FASTA scores: opt: 243, E(): 7.4e-08, (40.95% identity in 144 aa overlap). Also some similarity with other proteins e.g. AAK58497|GSPO GSPO PROTEIN from Acetobacter diazotrophicus (261 aa), FASTA scores: opt: 152, E(): 0.025, (33.35% identity in 135 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217067.1" /db_xref="GI:15609688" /db_xref="GeneID:887855" /translation="MLAAAVLAWMGVLCVCDVRQRRLPNWLTLPGAGVILLFAGLAGR GVPALAGAAALAGVYLLVHLALPAAMGAGDVKLAIGLGGLTGCFGVEVWFLAALAAPL LTAVCGVMVTPWGVRTLPHGPSMCVASLGAVGLALLG" gene complement(2871206..2872015) /gene="aroE" /locus_tag="Rv2552c" /db_xref="GeneID:887330" CDS complement(2871206..2872015) /gene="aroE" /locus_tag="Rv2552c" /EC_number="1.1.1.25" /function="POSSIBLY INVOLVED AT THE FOURTH STEP IN THE BIOSYNTHESIS OF CHORISMATE WITHIN THE BIOSYNTHESIS OF AROMATIC AMINO ACIDS (THE SHIKIMATE PATHWAY) [CATALYTIC ACTIVITY: SHIKIMATE + NADP(+) = 5-DEHYDROSHIKIMATE + NADPH]." /note="AroE; catalyzes the conversion of shikimate to 3-dehydroshikimate" /codon_start=1 /transl_table=11 /product="shikimate 5-dehydrogenase" /protein_id="NP_217068.1" /db_xref="GI:15609689" /db_xref="GeneID:887330" /translation="MSEGPKKAGVLGSPIAHSRSPQLHLAAYRALGLHDWTYERIECG AAELPVVVGGFGPEWVGVSVTMPGKFAALRFADERTARADLVGSANTLVRTPHGWRAD NTDIDGVAGALGAAAGHALVLGSGGTAPAAVVGLAELGVTDITVVARNSDKAARLVDL GTRVGVATRFCAFDSGGLADAVAAAEVLVSTIPAEVAAGYAGTLAAIPVLLDAIYDPW PTPLAAAVGSAGGRVISGLQMLLHQAFAQVEQFTGLPAPREAMTCALAALD" gene complement(2872012..2873265) /locus_tag="Rv2553c" /db_xref="GeneID:888228" CDS complement(2872012..2873265) /locus_tag="Rv2553c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2553c, (MTCY159.03), len: 417 aa. Probable conserved membrane protein, equivalent to Q9CCS8|ML0514 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (421 aa), FASTA scores: opt: 1955, E(): 1.1e-111, (72.7% identity in 414 aa overlap). Also similar in part to various proteins e.g. Q9L9G6|NOVB NOVB PROTEIN (aminodesoxychorismate lyase) from Streptomyces sphaeroides (284 aa), FASTA scores: opt: 451, E(): 2.9e-2, (37.95% identity in 203 aa overlap); Q9EWY3|2SCG38.36 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (253 aa), FASTA scores: opt: 419, E(): 2.3e-18, (39.2% identity in 171 aa overlap); Q9CHT3|YGCC HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (550 aa), FASTA scores: opt: 379, E(): 1.2e-15, (23.0% identity in 417 aa overlap); O25309|HP0587 AMINODEOXYCHORISMATE LYASE (PABC) from Helicobacter pylori (Campylobacter pylori) (329 aa), FASTA scores: opt: 290, E(): 2e-10, (31.65% identity in 180 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217069.1" /db_xref="GI:15609690" /db_xref="GeneID:888228" /translation="MPDGGHRHRAQPVSVRPNRHRRTRVSRAQRRHAQQIRRRRRVAG GFALSLLVVVVVVAVVVGAKLWQTMLGFGNDYTGPGKRDIVIQIRAGDSTTAVGETLL KHGVVATVRAFVDAAHGNTAISSIQPGFYRMRTEISAASAVARLTDPHNRVGKLVIPE GRQLDDTTDMKTNVVNPGIFALISRATCVDLDGTQRCVSVADLRAAASRSTPTMLSVP RWAVGPVMELGTDHRRIEGLIAPGTFNIDPSASAETILATLISAGAVEYMKSGLVDTA KSLGLSPYDILVVASLVQQEANTQDFPKVARVIYNRLHEHRTLEFDSTVNYPLDRREV ATSDTDRAQRTPWNTYMAQGLPATAICSPGVDALRAAEHPVPGDWLYFVTIDSQGTTL FTRDYQQHLANIELAKHNGVLDSAR" gene complement(2873258..2873770) /locus_tag="Rv2554c" /db_xref="GeneID:887490" CDS complement(2873258..2873770) /locus_tag="Rv2554c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="similar to RuvC resolvase with substantial differences; NMR structural information suggests this protein is monomeric; unknown cellular function" /codon_start=1 /transl_table=11 /product="Holliday junction resolvase-like protein" /protein_id="NP_217070.1" /db_xref="GI:15609691" /db_xref="GeneID:887490" /translation="MVPAQHRPPDRPGDPAHDPGRGRRLGIDVGAARIGVACSDPDAI LATPVETVRRDRSGKHLRRLAALAAELEAVEVIVGLPRTLADRIGRSAQDAIELAEAL ARRVSPTPVRLADERLTTVSAQRSLRQAGVRASEQRAVIDQAAAVAILQSWLDERLAA MAGTQEGSDA" gene complement(2873771..2876485) /gene="alaS" /locus_tag="Rv2555c" /db_xref="GeneID:887726" CDS complement(2873771..2876485) /gene="alaS" /locus_tag="Rv2555c" /EC_number="6.1.1.7" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-ALANINE + TRNA(ALA) = AMP + PYROPHOSPHATE + L-ALANYL-TRNA(ALA)]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes a two-step reaction, first charging an alanyl molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA" /codon_start=1 /transl_table=11 /product="alanyl-tRNA synthetase" /protein_id="NP_217071.1" /db_xref="GI:15609692" /db_xref="GeneID:887726" /translation="MQTHEIRKRFLDHFVKAGHTEVPSASVILDDPNLLFVNAGMVQF VPFFLGQRTPPYPTATSIQKCIRTPDIDEVGITTRHNTFFQMAGNFSFGDYFKRGAIE LAWALLTNSLAAGGYGLDPERIWTTVYFDDDEAVRLWQEVAGLPAERIQRRGMADNYW SMGIPGPCGPSSEIYYDRGPEFGPAGGPIVSEDRYLEVWNLVFMQNERGEGTTKEDYQ ILGPLPRKNIDTGMGVERIALVLQDVHNVYETDLLRPVIDTVARVAARAYDVGNHEDD VRYRIIADHSRTAAILIGDGVSPGNDGRGYVLRRLLRRVIRSAKLLGIDAAIVGDLMA TVRNAMGPSYPELVADFERISRIAVAEETAFNRTLASGSRLFEEVASSTKKSGATVLS GSDAFTLHDTYGFPIELTLEMAAETGLQVDEIGFRELMAEQRRRAKADAAARKHAHAD LSAYRELVDAGATEFTGFDELRSQARILGIFVDGKRVPVVAHGVAGGAGEGQRVELVL DRTPLYAESGGQIADEGTISGTGSSEAARAAVTDVQKIAKTLWVHRVNVESGEFVEGD TVIAAVDPGWRRGATQGHSGTHMVHAALRQVLGPNAVQAGSLNRPGYLRFDFNWQGPL TDDQRTQVEEVTNEAVQADFEVRTFTEQLDKAKAMGAIALFGESYPDEVRVVEMGGPF SLELCGGTHVSNTAQIGPVTILGESSIGSGVRRVEAYVGLDSFRHLAKERALMAGLAS SLKVPSEEVPARVANLVERLRAAEKELERVRMASARAAATNAAAGAQRIGNVRLVAQR MSGGMTAADLRSLIGDIRGKLGSEPAVVALIAEGESQTVPYAVAANPAAQDLGIRAND LVKQLAVAVEGRGGGKADLAQGSGKNPTGIDAALDAVRSEIAVIARVG" gene complement(2876576..2876965) /locus_tag="Rv2556c" /db_xref="GeneID:887451" CDS complement(2876576..2876965) /locus_tag="Rv2556c" /function="UNKNOWN" /note="Rv2556c, (MTCY09C4.12), len: 129 aa. Conserved hypothetical protein, highly similar to others e.g. Q9EWY5|2SCG38.34 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (140 aa), FASTA scores: opt: 488, E(): 8.2e-26, (58.8% identity in 131 aa overlap); Q9L9G4|NOVD NOVD PROTEIN from Streptomyces sphaeroides (143 aa), FASTA scores: opt: 474, E(): 7.2e-25, (60.85% identity in 120 aa overlap); Q9X2I5|TM1872 from Thermotoga maritima (132 aa), FASTA scores: opt: 270, E(): 2.7e-11, (39.55% identity in 129 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217072.1" /db_xref="GI:15609693" /db_xref="GeneID:887451" /translation="MLDVDTARRRIVDLTDAVRAFCTAHDDGLCNVFVPHATAGVAII ETGAGSDEDLVDTLVRLLPRDDRYRHAHGSYGHGADHLLPAFVAPSVTVPVSGGQPLL GTWQSIVLVDLNQDNPRRSVRLSFVEG" gene 2877072..2877746 /locus_tag="Rv2557" /db_xref="GeneID:887865" CDS 2877072..2877746 /locus_tag="Rv2557" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN THE PERSISTENCE IN THE HOST." /experiment="experimental evidence, no additional details recorded" /note="Rv2557, (MTCY9C4.11c), len: 224 aa. Conserved hypothetical protein, highly similar to upstream ORF Q50740|MTCY9C4.10c|Rv2558|MT2635 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (236 aa), FASTA scores: opt: 1007, E(): 6.9e-60, (69.2% identity in 224 aa overlap); and Mb2587 in Mycobacterium bovis (224 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217073.1" /db_xref="GI:15609694" /db_xref="GeneID:887865" /translation="MTGGATGALPRTMKEGWIVYARSTTIQAQSECIDTGIAHVRDVV MPALQGMDGCIGVSLLVDRQSGRCIATSAWETAEAMHASREQVTPIRDRCAEMFGGTP AVEEWEIAAMHRDHRSAEGACVRATWVKVPADQVDQGIEYYKSSVLPQIEGLDGFCSA SLLVDRTSGRAVSSATFDSFDAMERNRDQSNALKATSLREAGGEELDECEFELALAHL RVPELV" gene 2877831..2878541 /locus_tag="Rv2558" /db_xref="GeneID:887298" CDS 2877831..2878541 /locus_tag="Rv2558" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN THE PERSISTENCE IN THE HOST." /experiment="experimental evidence, no additional details recorded" /note="Rv2558, (MTCY9C4.10c), len: 236 aa. Conserved hypothetical protein, highly similar to downstream ORF Q50741|MTCY9C4.11c|Rv2557|MT2645 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (224 aa), FASTA scores: opt: 1007, E(): 4.7e-59, (69.2% identity in 224 aa overlap); and Mb2588 in Mycobacterium bovis (236 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217074.1" /db_xref="GI:15609695" /db_xref="GeneID:887298" /translation="MPGSAGWRKVFGGTGGATGALPRHGRGSIVYARSTTIEAQPLSV DIGIAHVRDVVMPALQEIDGCVGVSLLVDRQSGRCIATSAWETLEAMRASVERVAPIR DRAALMFAGSARVEEWDIALLHRDHPSHEGACVRATWLKVVPDQLGRSLEFYRTSVLP ELESLDGFCSASLMVDHPACRRAVSCSTFDSMDAMARNRDRASELRSRRVRELGAEVL DVAEFELAIAHLRVPELV" gene complement(2878571..2879929) /locus_tag="Rv2559c" /db_xref="GeneID:887368" CDS complement(2878571..2879929) /locus_tag="Rv2559c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2559c, (MTCY9C4.09), len: 452 aa. Conserved hypothetical ala-, leu-, val-rich protein, equivalent to Q9CCT1|ML0510 HYPOTHETICAL PROTEIN from Mycobacterium leprae (473 aa), FASTA scores: opt: 2411, E(): 3.9e-121, (83.4% identity in 452 aa overlap); O69490|O69490 HYPOTHETICAL 47.1 KDA PROTEIN from Mycobacterium leprae (447 aa), FASTA scores: opt: 2406, E(): 6.9e-121, (83.95% identity in 448 aa overlap). Also highly similar to Q9KXP4|SC9C5.30c CONSERVED ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (451 aa), FASTA scores: opt: 1742, E(): 1.5e-85, (64.4% identity in 430 aa overlap); Q9RT67|DR1898 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (434 aa), FASTA scores: opt: 1147, E(): 6.6e-54, (46.0% identity in 415 aa overlap); P45262|YCAJ_HAEIN|HI1590 HYPOTHETICAL PROTEIN from Haemophilus influenzae (446 aa), FASTA scores: opt: 1140, E(): 1.6e-53, (42.5% identity in 428 aa overlap); etc. Also similar to Q50629|MTCY227.09|RUVB|Rv2592c|MT2669|MTCY227.09 HOLLIDAY JUNCTION DNA HELICASE from Mycobacterium tuberculosis (344 aa), (30.1% identity in 296 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="recombination factor protein RarA" /protein_id="NP_217075.1" /db_xref="GI:15609696" /db_xref="GeneID:887368" /translation="MPEAVSDGLFDVPGVPMTSGHDLGASAGAPLAVRMRPASLDEVV GQDHLLAPGSPLRRLVEGSGVASVILYGPPGSGKTTLAALISQATGRRFEALSALSAG VKEVRAVIENSRKALLHGEQTVLFIDEVHRFSKTQQDALLSAVEHRVVLLVAATTENP SFSVVAPLLSRSLILQLRPLTAEDTRAVVQRAIDDPRGLGRAVAVAPEAVDLLVQLAA GDARRALTALEVAAEAAQAAGELVSVQTIERSVDKAAVRYDRDGDQHYDVVSAFIKSV RGSDVDAALHYLARMLVAGEDPRFIARRLMILASEDIGMAGPSALQVAVAAAQTVALI GMPEAQLTLAHATIHLATAPKSNAVTTALAAAMNDIKAGKAGLVPAHLRDGHYSGAAA LGNAQGYKYSHDDPDGVVAQQYPPDELVDVDYYRPTGRGGEREIAGRLDRLRAIIRKK RG" misc_feature complement(2879693..2879716) /locus_tag="Rv2559c" /note="PS00017 ATP/GTP-binding site motif A" gene 2880075..2881052 /locus_tag="Rv2560" /db_xref="GeneID:887363" CDS 2880075..2881052 /locus_tag="Rv2560" /function="UNKNOWN" /note="Rv2560, (MTCY9C4.08c), len: 325 aa. Probable transmembrane protein, pro-, gly-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217076.1" /db_xref="GI:15609697" /db_xref="GeneID:887363" /translation="MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGT YLPPGYNAPPPPPGYGPPPGPPPPGYPTHLQSSGFSVGDAISWSWNRFTQNAVTLVVP VLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYI ALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLC VIPGLIFGFVAQFAVAFAVDRSTSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGE LLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVRPAPPVGWPPGPQLA" gene 2881409..2881702 /locus_tag="Rv2561" /db_xref="GeneID:887164" CDS 2881409..2881702 /locus_tag="Rv2561" /function="UNKNOWN" /note="Rv2561, (MTCY9C4.07c), len: 97 aa. Conserved hypothetical protein, highly similar in part (and longer 33 aa) to upstream ORF AAK46951|RV2562|MT2638|MTCY9C4.06c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (212 aa), FASTA scores: opt: 205, E(): 2e-06, (76.1% identity in 46 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217077.1" /db_xref="GI:15609698" /db_xref="GeneID:887164" /translation="MGIQRAVLLIADIGGYTNYMHWNRKHLAHAQWTVAQLLESVIDA AKGMKLAKLEGDAAFFWAPGGQHQCPGMRPAPADAPEVPHAARADQKRPSLRL" gene 2881758..2882147 /locus_tag="Rv2562" /db_xref="GeneID:887329" CDS 2881758..2882147 /locus_tag="Rv2562" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2562, (MTCY9C4.06c), len: 129 aa. Conserved hypothetical protein, highly similar, but shorter 83 aa, to downstream ORF AAK46951|RV2561|MT2638|MTCY9C4.07c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (97 aa), FASTA scores: opt: 866, E(): 2.2e-54, (100.0% identity in 129 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217078.1" /db_xref="GI:15609699" /db_xref="GeneID:887329" /translation="MAEQKVKRNVELAGVDVILVHRMLKNEVPVSEYLFMTDVVAQCL DESVRKLATPLTHDFEGIGETSTHYIDLATSDMPPAVPDHSFFGLLWADVKFEWHALP YLLGFKKACAGFRSLGRGATEEPAEMG" gene 2882290..2883339 /locus_tag="Rv2563" /db_xref="GeneID:887516" CDS 2882290..2883339 /locus_tag="Rv2563" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF GLUTAMINE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2563, (MTCY9C4.05c), len: 349 aa. Probable glutamine-transport transmembrane protein ABC transporter (see citation below), highly similar to O53617|Rv0072|MTV030.16 PUTATIVE ABC-TRANSPORTER TRANSMEMBRANE SUBUNIT from Mycobacterium tuberculosis (349 aa), FASTA scores: opt: 1772, E(): 1.1e-89, (76.2% identity in 349 aa overlap). Also some similarity with various hypothetical proteins e.g. Q9RYN1|DRA0279 HYPOTHETICAL 37.1 KDA PROTEIN from Deinococcus radiodurans (353 aa), FASTA scores: opt: 347, E(): 6.6e-12, (24.35% identity in 357 aa overlap); BAB58522|SAV2360 CONSERVED HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (351 aa), FASTA scores: opt: 262, E(): 2.9e-07, (19.4% identity in 356 aa overlap); Q9AK94|SC10A9.10c PUTATIVE ABC TRANSPORT SYSTEM TRANSMEMBRANE PROTEIN from Streptomyces coelicolor (379 aa), FASTA scores: opt: 172, E(): 0.025, (26.85% identity in 387 aa overlap); etc." /codon_start=1 /transl_table=11 /product="glutamine-transport transmembrane protein ABC transporter" /protein_id="NP_217079.1" /db_xref="GI:15609700" /db_xref="GeneID:887516" /translation="MLFAALRDVQWRKRRLVIAIVSTGLVFAMTLVLTGLVNGFRVEA ERTVDSMGVDAFVVKAGAAGPFLGSTPFAQIDLPQVARAPGVLAAAPLATAPSTIRQG TSARNVTAFGAPEHGPGMPRVSDGRAPSTPDEVAVSSTLGRNLGDDLQVGARTLRIVG IVPESTALAKIPNIFLTTEGLQQLAYNGQPTISSIGIDGMPRQLPDGYQTVNRADAVS DLMRPLKVAVDAITVVAVLLWIVAALIVGSVVYLSALERLRDFAVFKAIGVPTRSILA GLALQAVVVALLAAVVGGILSLLLAPLFPMTVVVPLSAFVALPAIATVIGLLASVAGL RRVVAIDPALAFGGP" gene 2883342..2884334 /gene="glnQ" /locus_tag="Rv2564" /db_xref="GeneID:887898" CDS 2883342..2884334 /gene="glnQ" /locus_tag="Rv2564" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF GLUTAMINE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2564, (MTCY9C4.04c), len: 330 aa. Probable glnQ, glutamine-transport ATP-binding protein ABC transporter (see citation below), highly similar to many e.g. Q9L0J9|SCD40A.12c PUTATIVE ABC-TRANSPORTER ATP-BINDING PROTEIN from Streptomyces coelicolor (246 aa), FASTA scores: opt: 598, E(): 2.5e-26, (46.35% identity in 218 aa overlap); O54136|SC2E9.11 from Streptomyces coelicolor (230 aa), FASTA scores: opt: 592, E(): 5.1e-26, (46.55% identity in 219 aa overlap); O29244|AF1018 from Archaeoglobus fulgidus (228 aa), FASTA scores: opt: 580, E(): 2.4e-25, (42.4% identity in 210 aa overlap); P75831|YBJZ_ECOLI|B0879 from Escherichia coli strain K12 (648 aa), FASTA scores: opt: 555, E(): 1.3e-23, (39.65% identity in 232 aa overlap); etc. Also highly similar to O53618|Rv0073|MTV030.17 ABC-TRANSPORTER ATP-BINDING SUBUNIT from Mycobacterium tuberculosis (330 aa), FASTA scores: opt: 1782, E(): 4.7e-92, (83.65% identity in 330 aa overlap); etc. Shows some similarity to Q11040|YC81_MYCTU|MTCY50.01|Rv1281c|MT1318 HYPOTHETICAL ABC TRANSPORTER ATP-BINDING PROTEIN from Mycobacterium tuberculosis (612 aa) (32.9 % identity in 234 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), PS00211 ABC transporters family signature, and PS00889 Cyclic nucleotide-binding domain signature 2. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="glutamine-transport ATP-binding protein ABC transporter GlnQ" /protein_id="NP_217080.1" /db_xref="GI:15609701" /db_xref="GeneID:887898" /translation="MGGLTISDLVVEYSSGGYAVRPIDGLSLDVAPGSLVILLGPSGC GKTTLLSCLGGILRPKSGSIKFDDVDITTLEGAALAKYRRDKVGIVFQAFNLVSSLTA LENVMVPLRAAGVSRAAARKRAEDLLIRVNLGERMKHRPGDMSGGQQQRVAVARAIAL DPQLILADEPTAHLDFIQVEEVLRLIRSLAQGDRVVVVATHDSRMLPLADRVLELMPA QVSPNQPPETVHVKAGEVLFEQSTMGDLIYVVSEGEFEIVRELADGGEELVKTAAPGD YFGEIGVLFHLPRSATVRARSDATAVGYTAQAFRERLGVTRVADLIEHRELASE" misc_feature 2883459..2883482 /gene="glnQ" /locus_tag="Rv2564" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 2883771..2883815 /gene="glnQ" /locus_tag="Rv2564" /note="PS00211 ABC transporters family signature" misc_feature 2884173..2884226 /gene="glnQ" /locus_tag="Rv2564" /note="PS00889 Cyclic nucleotide-binding domain signature 2" gene 2884611..2886362 /locus_tag="Rv2565" /db_xref="GeneID:887288" CDS 2884611..2886362 /locus_tag="Rv2565" /function="UNKNOWN" /note="Rv2565, (MTCY9C4.03c), len: 583 aa. Conserved hypothetical protein, similar in part to Q9A6C3|CC2171 HYPOTHETICAL PROTEIN from Caulobacter crescentus (610 aa), FASTA scores: opt: 765, E(): 2.8e-37, (32.15% identity in 575 aa overlap). C-terminus also highly similar to various bacterial proteins e.g. O34731|YLBK_BACSU HYPOTHETICAL 28.3 KDA PROTEIN from Bacillus subtilis (260 aa), FASTA scores: opt: 386, E(): 2.2e-15, (33.05% identity in 245 aa overlap); CAC45997|SMC01003 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (321 aa), FASTA scores: opt: 352, E(): 2.5e-13, (29.65% identity in 280 aa overlap); Q9K9Q8|BH2587 HYPOTHETICAL PROTEIN from Bacillus halodurans (275 aa), FASTA scores: opt: 334, E(): 2.5e-12, (33.7% identity in 175 aa overlap); etc. And shows similarity to C-terminal half of some eukaryotic proteins e.g. Q9R114|NTE NEUROPATHY TARGET ESTERASE HOMOLOG from Mus musculus (Mouse) (1327 aa), FASTA scores: opt: 411, E(): 2.7e-16, (24.45% identity in 626 aa overlap); O60859 NEUROPATHY TARGET ESTERASE from Homo sapiens (Human) (1327 aa), FASTA scores: opt: 410, E(): 3.1e-16, (24.1% identity in 627 aa overlap); Q9U969|SWS|CG2212 SWISS CHEESE PROTEIN from Drosophila melanogaster (Fruit fly) (1425 aa), FASTA scores: opt: 401, E(): 1.1e-15, (27.75% identity in 544 aa overlap); etc. Also shows strong similarity to C-terminal half of O05884|Z95121|Rv3239c|MTY20B11.14c HYPOTHETICAL 110.2 KDA PROTEIN from Mycobacterium tuberculosis (1048 aa), FASTA scores: opt: 648, E(): 3e-30, (36.55% identity in 572 aa overlap); and O69695|Rv3728|MTV025.076 PUTATIVE TWO-DOMAIN MEMBRANE PROTEIN from Mycobacterium tuberculosis (1065 aa), FASTA scores: opt: 643, E(): 6e-30, (34.3% identity in 595 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217081.1" /db_xref="GI:15609702" /db_xref="GeneID:887288" /translation="MTTARRRPKRRGTDARTALRNVPILADIDDEQLERLATTVERRH VPANQWLFHAGEPADSIYIVDSGRFVAVAPEGHVFAEMASGDSIGDLGVIAGAARSAG VRALRDGVVWRIAAETFTDMLEATPLLQSAMLRAMARMLRQSRPAKTARRPRVIGVVS NGDTAAAPMVDAIATSLDSHGRTAVIAPPVETTSAVQEYDELVEAFSETLDRAERSND WVLVVADRGAGDLWRHYVSAQSDRLVVLVDQRYPPDAVDSLATQRPVHLITCLAEPDP SWWDRLAPVSHHPANSDGFGALARRIAGRSLGLVMAGGGARGLAHFGVYQELTEAGVV IDRFGGTSSGAIASAAFALGMDAGDAIAAAREFIAGSDPLGDYTIPISALTRGGRVDR LVQGFFGNTLIEHLPRGFFSVSADMITGDQIIHRRGSVSGAVRASISIPGLIPPVHNG EQLLVDGGLLNNLPANVMCADTDGEVICVDLRRTFVPSKGFGLLPPIVTPPGLLRRLL TGTDNALPPLQETLLRAFDLAASTANLRELPRVAAIIEPDVSKIGVLNFKQIDAALEA GRMAARAALQAQPDLVR" gene 2886373..2889795 /locus_tag="Rv2566" /db_xref="GeneID:887737" CDS 2886373..2889795 /locus_tag="Rv2566" /function="UNKNOWN" /note="Rv2566, (MTCY9C4.02c), len: 1140 aa. Long conserved hypothetical protein, equivalent to O53120|ML2678 OR MLCB1913.12 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1000 aa), FASTA scores: opt: 760, E(): 7.1e-38, (50.2% identity in 1128 aa overlap); and middle part equivalent to Q9ZB40 72.2 KDA PROTEIN (FRAGMENT) from Mycobacterium leprae (644 aa), FASTA scores: opt: 1017, E(): 1.5e-65, (45.65% identity in 655 aa overlap). Also highly similar to Q98HG6|MLL2877 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (1119 aa), FASTA scores: opt: 1413, E(): 3.7e-77, (52.4% identity in 1148 aa overlap); and N-terminus shows similarity with other proteins e.g. Q9HUN8|PA4926 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (311 aa), FASTA scores: opt: 278, E(): 3e-09, (29.95% identity in 284 aa overlap); and upstream ORF Q50652|YP69_MYCTU|Rv2569c|MT2645|MTCY227.32 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (314 aa), FASTA scores: opt: 252, E(): 1.1e-07, (28.9% identity in 315 aa overlap). Equivalent to AAK46955 from Mycobacterium tuberculosis strain CDC1551 (1156 aa) but shorter 16 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217082.1" /db_xref="GI:15609703" /db_xref="GeneID:887737" /translation="MPLRPTQVSGTGRTRCAGRSGVISSAAMSIKVALEHRTSYTFDR LVRVYPHIVRLRPAPHSRTSIEAYSLRIEPADHFINWQQDALGNFLARLVFPNPMRQL RITVGLIADLKVINPFDFFIEDWAEIWPCAGMAYPKALADDLRPYLRPVDEDGDGSGP GELTQAWVRNFTVPDGTRTIDFLVALNRAINADVGYCVRMEPGVQTPDFTLRTGVGSC RDSAWLLVSILRQFGLAARFVSGYLVQLASDIEALDGPSGPAADFTDLHAWAEAYIPG AGWIGLDPTSGLLAGEGHIPLAATPHPASAAPISGGTDVCDTVLEFSNTVTRVHEDPR VTLPYTDESWKTICEVGQRVDERLAAADVRLTVGGEPTFVSVDNQVAEEWRTAADGPH KRERASDLAARLKAVWAPQGLIHRGQGRWYPGEPLPRWQIALYWRTDGRPLWTNDALL ADPWGAPPADPVDDDAAYRVLAGIADGLGLPISQVRPAYEDPLSRLAAAVRMPAGDPV ESGDDLGCDTNPDTPTGRAALLARLDEAITSPAAYVLPLHRRDDGQGWASANWRLRRG RIVLLEGDSPAGLRLPLDSISWRPPRASFDADPVAVRSTLPAELHTDRAVVEDPETAP TTALVAEVRGGLVHIFLPPTDALEHFIDLVARVEAAATTANCPVVIEGYGPPPDPRLT STTITPDPGVIEVNIAPTASFAEQRQQLETLYQQARLARLTTEAFDVDGTHGGTGGGN HITLGGVTPADSPLLRRPDLLVSLLTYWQRHPSLSYLFAGRFVGTTSQAPRVDEGRAE ALYELEIAFAEILRLSPSSGGGRPQPWVTDRALRHLLTDITGNTHRAEFCIDKLYSPD SARGRLGLLELRGFEMPPHLHMAMVQSLLVRSLVAWFWDQPLRAPLIRHGANLHGRYL LPHFLIHDIADVAADLRAHGIAFETSWLDPFTEFRFPRIGTAVFDGIEIELRGAIEPW HTLGEEATAAGTARYVDSSVERIQVRIIGADRHRYVVTCNGYPMPLLATDNPDIHVGG VRFKAWQPPSALHPTITVDGPLRFELIDIATATSCGGCTYHVAHPGGRAYDEPPVNAV EAEARRARRFEATGFTPGKLDLSDIREKQARISTDIGAPGILDLRRVRTVQQ" gene 2889795..2892449 /locus_tag="Rv2567" /db_xref="GeneID:888578" CDS 2889795..2892449 /locus_tag="Rv2567" /function="UNKNOWN" /note="Rv2567, (MTCY227.34c, MTCY9C4.01c), len: 884 aa. Conserved hypothetical ala-, leu-rich protein, equivalent to O53121|ML2679|MLCB1913.13 HYPOTHETICAL PROTEIN from Mycobacterium leprae (893 aa), FASTA scores: opt: 4326, E(): 0, (75.2% identity in 883 aa overlap); and similar to Q49755|YO11_MYCLE|ML0605|MLCL536.05c|U1937B|B1937_F1_4 HYPOTHETICAL 61.8 KDA PROTEIN from Mycobacterium leprae (561 aa), FASTA scores: opt: 758, E(): 1.2e-38, (32.2% identity in 537 aa overlap). Also similar to others e.g. Q9HUN7|PA4927 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (830 aa), FASTA scores: opt: 1247, E(): 2.2e-68, (38.25% identity in 831 aa overlap); Q98HG7|MLL2876 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (803 aa), FASTA scores: opt: 937, E(): 1.9e-49, (32.15% identity in 828 aa overlap); CAC47419|SMC04057 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (802 aa), FASTA scores: opt: 900, E(): 3.4e-47, (30.85% identity in 852 aa overlap); etc. And similar to P71732|YO11_MYCTU|Rv2411c|MT2484|MTCY253.09 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (551 aa), FASTA scores: opt: 781, E(): 4.6e-40, (33.75% identity in 495 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217083.1" /db_xref="GI:15609704" /db_xref="GeneID:888578" /translation="MAPSASAATNGYDVDRLLAGYRTARAQETLFDLRDGPGAGYDEF VDDDGNVRPTWTELADAVAERGKAGLDRLRSVVHSLIDHDGITYTAIDAHRDALTGDH DLEPGPWRLDPLPLVISAADWEVLEAGLVQRSRLLDAILADLYGPRSMLTEGVLPPEM LFAHPGYVRAANGIQMPGRHQLFMHACDLSRLPDGTFQVNADWTQAPSGSGYAMADRR VVAHAVPDLYEELAPRPTTPFAQALRLALIDAAPDVAQDPVVVVLSPGIYSETAFDQA YLATLLGFPLVESADLVVRDGKLWMRSLGTLKRVDVVLRRVDAHYADPLDLRADSRLG VVGLVEAQHRGTVTVVNTLGSGILENPGLLRFLPQLSERLLDESPLLHTAPVYWGGIA SERSHLLANVSSLLIKSTVSGETLVGPTLSSAQLADLAVRIEAMPWQWVGQELPQFSS APTNHAGVLSSAGVGMRLFTVAQRSGYAPMIGGLGYVLAPGPAAYTLKTVAAKDIWVR PTERAHAEVITVPVLAPPAKTGAGTWAVSSPRVLSDLFWMGRYGERAENMARLLIVTR ERYHVFRHQQDTDESECVPVLMAALGKITGYDTATGAGSAYDRADMIAVAPSTLWSLT VDPDRPGSLVQSVEGLALAAQAVRDQLSNDTWMVLANVERAVEHKSDPPQSLAEADAV LASAQAETLAGMLTLSGVAGESMVHDVGWTMMDIGKRIERGLWLTALLQATLSTVRHP AAEQAIIEATLVACESSVIYRRRTVGKFSVAAVTELMLFDAQNPRSLVYQLERLRADL KDLPGSSGSSRPERMVDEMNTRLRRSHPEELEEVSADGLRAELAELLAGIHASLRDVA DVLTATQLALPGGMQPLWGPDQRRVMPA" gene complement(2892446..2893471) /locus_tag="Rv2568c" /db_xref="GeneID:887249" CDS complement(2892446..2893471) /locus_tag="Rv2568c" /function="UNKNOWN" /note="Rv2568c, (MTCY227.33), len: 341 aa. Conserved hypothetical protein, highly similar (but longer 60 aa) to Q98E75|MLR4376 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (308 aa), FASTA scores: opt: 566, E(): 4.1e-29, (40.2% identity in 291 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217084.1" /db_xref="GI:15609705" /db_xref="GeneID:887249" /translation="MRDFHCPNCGQRLAFENSACLSCGSALGFSLGRMALLVIADDAD VQLCANLHLAQCNWLVPSDQLGGLCSSCVLTIERPSDTNTAGLAEFARAEGAKRRLIA ELHELKLPIVGRDQDPDHGLAFRLLSSAHENVTTGHQNGVITLDLAEGDDVHREQLRV EMDEPYRTLLGHFRHEIGHYYFYRLIASSSDYLSRFNELFGDPDADYSQALDRHYRGG PPEGWQDSFVSSYATMHASEDWAETFAHYLHIRDALDTAAWCGLAPASATFDRPALGP SAFNTIIDKWLPLSWSLNMVNRSMGHDDLYPFVLPAAVLEKMRFIHTVVDEVAPDFEP AHSRRTV" gene complement(2893464..2894408) /locus_tag="Rv2569c" /db_xref="GeneID:887783" CDS complement(2893464..2894408) /locus_tag="Rv2569c" /function="UNKNOWN" /note="Rv2569c, (MTCY227.32), len: 314 aa. Conserved hypothetical protein, equivalent to Q9CCT2|ML0508 HYPOTHETICAL PROTEIN from Mycobacterium leprae (313 aa), FASTA scores: opt: 1723, E(): 1.9e-95, (84.4% identity in 301 aa overlap); and some similarity with Q49757|YP69_MYCLE|ML0607|MLCL536.03c|B1937_F2_39 HYPOTHETICAL 31.1 KDA PROTEIN from Mycobacterium leprae (279 aa), FASTA scores: opt: 305, E(): 4.5e-11, (33.0% identity in 300 aa overlap). Also similar to to other hypothetical proteins e.g. Q9HUN8|PA4926 from Pseudomonas aeruginosa (311 aa), FASTA scores: opt: 704, E(): 8.7e-35, (39.7% identity in 320 aa overlap); Q98HG8|MLL2875 from Rhizobium loti (Mesorhizobium loti) (294 aa), FASTA scores: opt: 521, E(): 6.5e-24, (35.05% identity in 294 aa overlap); Q9A7W9|CC1600 from Caulobacter crescentus (325 aa), FASTA scores: opt: 510, E(): 3.2e-23, (34.4% identity in 2588 aa overlap); etc. Also some similarity with proteins from Mycobacterium tuberculosis e.g. P71734|Rv2409c|MTCY253.11 CONSERVED HYPOTHETICAL PROTEIN (279 aa), FASTA scores: opt: 312, E(): 1.7e-11, (34.45% identity in 296 aa overlap); and Q50732|Rv2566|MTCY9C4.02 LONG CONSERVED HYPOTHETICAL PROTEIN (1140 aa), FASTA scores: opt: 252, E(): 2.2e-07, (28.9% identity in 315 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217085.1" /db_xref="GI:15609706" /db_xref="GeneID:887783" /translation="MSADSSLSLPLSGTHRYRVTHRTEYRYSDVVTSSYGRGFLTPRN SLRQRCVAHRLTIDPAPADRSTSRDGYGNISSYFHVTEPHRTLTITSDSIVDVSPPPP GLYTSGPALQPWEAARPAGLPGSLATEFTLDLNPPEITDAVREYAAPSFLPKRPLVEV LRDLASRIYTDFTYRSGSTTISTGVNEVLLAREGVCQDFARLAIACLRANGLAACYVS GYLATDPPPGKDRMIGIDATHAWASVWTPQQPGRFEWLGLDPTNDQLVDQRYIVVGRG RDYADVPPLRGIIYTNSENSVIDVSVDVVPFEGDALHA" gene 2894512..2894901 /locus_tag="Rv2570" /db_xref="GeneID:887377" CDS 2894512..2894901 /locus_tag="Rv2570" /function="UNKNOWN" /note="Rv2570, (MTCY227.31c), len: 129 aa. Conserved hypothetical protein, similar to Q98GQ7|MLR3218 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (133 aa), FASTA scores: opt: 174, E(): 9.6e-05, (32.25% identity in 124 aa overlap); Q9A390|CC3314 HYPOTHETICAL PROTEIN from Caulobacter crescentus (129 aa), FASTA scores: opt: 155, E(): 0.0017, (33.35% identity in 108 aa overlap); and Q9A2Y0|CC3426 HYPOTHETICAL PROTEIN from Caulobacter crescentus (120 aa), FASTA scores: opt: 144, E(): 0.0083, (32.95% identity in 91 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217086.1" /db_xref="GI:15609707" /db_xref="GeneID:887377" /translation="MATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDRE ALTRAGSEPPSGDIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVR DLEELITEAWLMQAPKQLVQAFLANSG" gene complement(2894893..2895960) /locus_tag="Rv2571c" /db_xref="GeneID:887382" CDS complement(2894893..2895960) /locus_tag="Rv2571c" /function="UNKNOWN" /note="Rv2571c, (MTCY227.30), len: 355 aa. Probable transmembrane ala-, val-, leu-rich protein, showing some similarity with other membrane proteins e.g. Q99340|YFDA_CORGL HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (359 aa), FASTA scores: opt: 338, E(): 2.5e-13, (29.4% identity in 255 aa overlap); Q9RD86|SCF43.02 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (379 aa), FASTA scores: opt: 208, E(): 2.1e-05, (26.05% identity in 303 aa overlap); Q9RD81|SCF43.07 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (419 aa), FASTA scores: opt: 205, E(): 3.5e-05, (25.15% identity in 362 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transmembrane alanine and valine and leucine rich protein" /protein_id="NP_217087.1" /db_xref="GI:15609708" /db_xref="GeneID:887382" /translation="MSASLLVRTACGGRAVAQRLRTVLWPITQTSVVAGLAWYLTHDV FNHPQAFFAPISAVVCMSATNVLRARRAQQMIVGVALGIVLGAGVHALLGSGPIAMGV VVFIALSVAVLCARGLVAQGLMFINQAAVSAVLVLVFASNGSVVFERLFDALVGGGLA IVFSILLFPPDPVVMLCSARADVLAAVRDILAELVNTVSDPTSAPPDWPMAAADRLHQ QLNGLIEVRANAAMVARRAPRRWGVRSTVRDLDQQAVYLALLVSSVLHLARTIAGPGG DKLPTPVHAVLTDLAAGTGLADADPTAANEHAAAARATASTLQSAACGSNEVVRADIV QACVTDLQRVIERPGPSGMSA" gene complement(2896013..2897803) /gene="aspS" /locus_tag="Rv2572c" /db_xref="GeneID:888532" CDS complement(2896013..2897803) /gene="aspS" /locus_tag="Rv2572c" /EC_number="6.1.1.12" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-ASPARTATE + TRNA(ASP) = AMP + PYROPHOSPHATE + L-ASPARTYL-TRNA(ASP)]." /note="catalyzes a two-step reaction, first charging an aspartate molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; contains discriminating and non-discriminating subtypes" /codon_start=1 /transl_table=11 /product="aspartyl-tRNA synthetase" /protein_id="NP_217088.1" /db_xref="GI:15609709" /db_xref="GeneID:888532" /translation="MFVLRSHAAGLLREGDAGQQVTLAGWVARRRDHGGVIFIDLRDA SGIAQVVFRDPQDTEVLAQAHRLRAEFCVSVAGVVEIRPEGNANPEIATGEIEVNATS LTVLGECAPLPFQLDEPAGEELRLKYRYLDLRRDDPAAAIRLRSRVNAAARAVLARHD FVEIETPTITRSTPEGARDFLVPARLHPGSFYALPQSPQLFKQLLMVAGMERYYQIAR CYRDEDFRADRQPEFTQLDMEMSFVDAEDIIAISEEVLTELWALIGYRIPTPIPRIGY AEAMRRFGTDKPDLRFGLELVECTDFFSDTTFRVFQAPYVGAVVMPGGASQPRRTLDG WQDWAKQRGHRGLAYVLVAEDGTLGGPVAKNLTEAERTGLADHVGAKPGDCIFFSAGP VKSSRALLGAARVEIANRLGLIDPDAWAFVWVVDPPLFEPADEATAAGEVAVGSGAWT AVHHAFTAPKPEWEDRIESDTGSVLADAYDIVCNGHEIGGGSVRIHRRDIQERVFAVM GLDKAEAEEKFGFLLEAFMFGAPPHGGIAFGWDRTTALLAGMDSIREVIAFPKTGGGV DPLTDAPAPITAQQRKESGIDAQPKRVQQA" misc_feature complement(2897093..2897146) /gene="aspS" /locus_tag="Rv2572c" /note="PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1" gene 2897956..2898783 /locus_tag="Rv2573" /db_xref="GeneID:888188" CDS 2897956..2898783 /locus_tag="Rv2573" /EC_number="1.1.1.169" /function="UNKNOWN" /note="ketopantoate reductase; catalyzes the NADPH reduction of ketopantoate to pantoate; functions in pantothenate (vitamin B5) biosynthesis" /codon_start=1 /transl_table=11 /product="2-dehydropantoate 2-reductase" /protein_id="NP_217089.2" /db_xref="GI:161352464" /db_xref="GeneID:888188" /translation="MHKAGYSPLLCGHTPRAGIELRRDGADPIVVPGPVHTSPREVAG PVDVLILAVKATQNDAARPWLTRLCDERTVVAVLQNGVEQVEQVQPHCPSSAVVPAIV WCSAETQPQGWVRLRGEAALVVPTGPAAEQFAGLLRGAGATVDCDPDFTTAAWRKLLV NALAGFMVLSGRRSAMFRRDDVAALSRRYVAECLAVARAEGARLDDDVVDEVVRLVRS APQDMGTSMLADRAAHRPLEWDLRNGVIVRKARAHGLATPISDVLVPLLAAASDGPG" gene 2898806..2899309 /locus_tag="Rv2574" /db_xref="GeneID:887232" CDS 2898806..2899309 /locus_tag="Rv2574" /function="UNKNOWN" /note="Rv2574, (MTCY227.27c), len: 167 aa. Conserved hypothetical protein, showing similarity with Q9K3N3|SCG20A.07 HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (157 aa), FASTA scores: opt: 218, E(): 2.8e-08, (30.65% identity in 150 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217090.1" /db_xref="GI:15609711" /db_xref="GeneID:887232" /translation="MYPCERVGLSFTETAPYLFRNTVDLAITPEQLFEVLADPQAWPR WATVITKVTWTSPEPFGAGTTRIVEMRGGIVGDEEFISWEPFTRMAFRFNECSTRAVG AFAEDYRVQAIPGGCRLTWTMAQKLAGPARPALFVFRPLLNLALRRFLRNLRRYTDAR FAAAQQS" gene 2899339..2900220 /locus_tag="Rv2575" /db_xref="GeneID:888178" CDS 2899339..2900220 /locus_tag="Rv2575" /function="UNKNOWN" /note="Rv2575, (MTCY227.26c), len: 293 aa. Possible conserved membrane gly-rich protein, highly similar to hypothetical proteins e.g. Q9RR98|DR2596 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (313 aa), FASTA scores: opt: 734, E(): 2.8e-38, (42.95% identity in 291 aa overlap); Q9HV81|PA4717 from Pseudomonas aeruginosa (297 aa), FASTA scores: opt: 641, E(): 1.5e-32, (43.35% identity in 300 aa overlap); Q98IA4|MLL2493 from Rhizobium loti (Mesorhizobium loti) (306 aa), FASTA scores: opt: 628, E(): 1e-31, (38.45% identity in 307 aa overlap); etc. Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217091.1" /db_xref="GI:15609712" /db_xref="GeneID:888178" /translation="MTFNEGVQIDTSTTSTSGSGGGRRLAIGGGLGGLLVVVVAMLLG VDPGGVLSQQPLDTRDHVAPGFDLSQCRTGADANRFVQCRVVATGNSVDAVWKPLLPG YTRPHMRLFSGQVGTGCGPASSEVGPFYCPVDKTAYFDTDFFQVLVTQFGSSGGPFAE EYVVAHEYGHHVQNLLGVLGRAQQGAQGAAGSGVRTELQADCYAGVWAYYASTVKQES TGVPYLEPLSDKDIQDALAAAAAVGDDRIQQQTTGRTNPETWTHGSAAQRQKWFTVGY QTGDPNICDTFSAADLG" misc_feature 2899825..2899854 /locus_tag="Rv2575" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(2900226..2900690) /locus_tag="Rv2576c" /db_xref="GeneID:887251" CDS complement(2900226..2900690) /locus_tag="Rv2576c" /function="UNKNOWN" /note="Rv2576c, (MTCY227.25), len: 154 aa. Possible conserved membrane protein, showing similarity with Q9ZFC2 HYPOTHETICAL 15.7 KDA PROTEIN from Mycobacterium sp. FM10 (146 aa), FASTA scores: opt: 235, E(): 4.1e-08, (31.35% identity in 150 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217092.1" /db_xref="GI:15609713" /db_xref="GeneID:887251" /translation="MPAGVGNASGSVLDMTSVRTVPSAVALVTFAGAALSGVIPAIAR ADPVGHQVTYTVTTTSDLMANIRYMSADPPSMAAFNADSSKYMITLHTPIAGGQPLVY TATLANPSQWAIVTASGGLRVNPEFHCEIVVDGQVVVSQDGGSGVQCSTRPW" gene 2900918..2902507 /locus_tag="Rv2577" /db_xref="GeneID:888207" CDS 2900918..2902507 /locus_tag="Rv2577" /function="UNKNOWN" /note="Rv2577, (MTCY227.24c), len: 529 aa. Conserved hypothetical protein, showing similarity with various proteins from eukaryotes, in particular phosphatases, e.g. Q9SE01|PAP PURPLE ACID PHOSPHATASE PRECURSOR (EC 3.1.3.2) from Glycine max (Soybean) (464 aa), FASTA scores: opt: 190, E(): 0.00026, (27.3% identity in 388 aa overlap); Q9SVP2|F18A5.90|AT4G13700 HYPOTHETICAL 53.4 KDA PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (474 aa), FASTA scores: opt: 280, E(): 6.6e-10, (27.2% identity in 331 aa overlap); Q9FK32 SIMILARITY TO UNKNOWN PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (529 aa), FASTA scores: opt: 249, E(): 6.2e-08, (25.3% identity in 435 aa overlap); Q12546|APHA ACID PHOSPHATASE PRECURSOR from Aspergillus ficuum (614 aa), FASTA scores: opt: 207, E(): 2.9e-05, (22.95% identity in 458 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217093.1" /db_xref="GI:15609714" /db_xref="GeneID:888207" /translation="MGADLKQPQDADSPPKGVSRRRFLTTGAAAVVGTGVGAGGTALL SSHPRGPAVWYQRGRSGAPPVGGLHLQFGRNASTEMVVSWHTTDTVGNPRVMLGTPTS GFGSVVVAETRSYRDAKSNTEVRVNHAHLTNLTPDTDYVYAAVHDGTTPELGTARTAP SGRKPLRFTSFGDQSTPALGRLADGRYVSDNIGSPFAGDITIAIERIAPLFNLINGDL CYANLAQDRIRTWSDWFDNNTRSARYRPWMPAAGNHENEVGNGPIGYDAYQTYFAVPD SGSSPQLRGLWYSFTAGSVRVISLHNDDVCYQDGGNSYVRGYSGGEQRRWLQAELANA RRDSEIDWVVVCMHQTAISTADDNNGADLGIRQEWLPLFDQYQVDLVVCGHEHHYERS HPLRGALGTDTRTPIPVDTRSDLIDSTRGTVHLVIGGGGTSKPTNALLFPQPRCQVIT GVGDFDPAIRRKPSIFVLEDAPWSAFRDRDNPYGFVAFDVDPGQPGGTTSIKATYYAV TGPFGGLTVIDQFTLTKPRGG" gene complement(2902509..2903531) /locus_tag="Rv2578c" /db_xref="GeneID:887427" CDS complement(2902509..2903531) /locus_tag="Rv2578c" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv2578c, (MTCY227.23), len: 340 aa. Conserved hypothetical protein, highly similar to hypothetical proteins (conserved or not) e.g. Q9ZBJ3|SC9C7.17c from Streptomyces coelicolor (348 aa), FASTA scores: opt: 998, E(): 1.6e-55, (47.6% identity in 355 aa overlap); Q9I763|PA0069 from Pseudomonas aeruginosa (352 aa), FASTA scores: opt: 560, E(): 6e-28, (36.6% identity in 284 aa overlap); Q986C9|MLL7417 from Rhizobium loti (Mesorhizobium loti) (356 aa), FASTA scores: opt: 550, E(): 2.6e-27, (39.15% identity in 240 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217094.1" /db_xref="GI:15609715" /db_xref="GeneID:887427" /translation="MRWARQAVAVNGMPVDDGALPGLQRIGLVRSVRAPQFDGITFHE VLCKSALNKVPNAAALPFRYTVNGYRGCSHACRYCFARPTHEYLDFNPGTDFDTQVVV KTNVAAVLRHELRRPSWRRETVALGTNTDPYQRAEGRYALMPGIIGALAASGTPLSIL TKGTLLRRDLPLIAEAAQQVPVSVAVSLAVGDPELHRDVESGTPTPQARLALITAIRA AGLDCHVMVAPVLPQLTDSGEHLDQLLGQIAAAGATGVTVFGLHLRGSTRGWFMCWLA RAHPELVSRYRELYRRGPYLPPSYREMLRERVAPLIAKYRLAGDHRPAPPETEAALVP VQATLF" gene 2903639..2904541 /gene="dhaA" /locus_tag="Rv2579" /db_xref="GeneID:888599" CDS 2903639..2904541 /gene="dhaA" /locus_tag="Rv2579" /EC_number="3.8.1.5" /function="GENERATES A PRIMARY ALCOHOL AND HALIDE FROM 1-HALOALKANE AND H2O [CATALYTIC ACTIVITY: 1-HALOALKANE + H2O = A PRIMARY ALCOHOL + HALIDE]. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the cleavage of carbon-halogen bonds in aliphatic compounds forming a primary alcohol and a halide" /codon_start=1 /transl_table=11 /product="haloalkane dehalogenase" /protein_id="YP_177890.1" /db_xref="GI:57117001" /db_xref="GeneID:888599" /translation="MTAFGVEPYGQPKYLEIAGKRMAYIDEGKGDAIVFQHGNPTSSY LWRNIMPHLEGLGRLVACDLIGMGASDKLSPSGPDRYSYGEQRDFLFALWDALDLGDH VVLVLHDWGSALGFDWANQHRDRVQGIAFMEAIVTPMTWADWPPAVRGVFQGFRSPQG EPMALEHNIFVERVLPGAILRQLSDEEMNHYRRPFVNGGEDRRPTLSWPRNLPIDGEP AEVVALVNEYRSWLEETDMPKLFINAEPGAIITGRIRDYVRSWPNQTEITVPGVHFVQ EDSPEEIGAAIAQFVRRLRSAAGV" gene complement(2904821..2906092) /gene="hisS" /locus_tag="Rv2580c" /db_xref="GeneID:887479" CDS complement(2904821..2906092) /gene="hisS" /locus_tag="Rv2580c" /EC_number="6.1.1.21" /function="INVOLVED IN TRANSLATION MECHAMISM [CATALYTIC ACTIVITY: ATP + L-HISTIDINE + TRNA(HIS) = AMP + PYROPHOSPHATE + L-HISTIDYL-TRNA(HIS)]." /note="catalyzes a two-step reaction, first charging a histidine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; class II aminoacyl-tRNA synthetase; forms homodimers; some organisms have a paralogous gene, hisZ, that is similar to hisS and produces a protein that performs the first step in histidine biosynthesis along with HisG" /codon_start=1 /transl_table=11 /product="histidyl-tRNA synthetase" /protein_id="NP_217096.1" /db_xref="GI:15609717" /db_xref="GeneID:887479" /translation="MTEFSSFSAPKGVPDYVPPDSAQFVAVRDGLLAAARQAGYSHIE LPIFEDTALFARGVGESTDVVSKEMYTFADRGDRSVTLRPEGTAGVVRAVIEHGLDRG ALPVKLCYAGPFFRYERPQAGRYRQLQQVGVEAIGVDDPALDAEVIAIADAGFRSLGL DGFRLEITSLGDESCRPQYRELLQEFLFGLDLDEDTRRRAGINPLRVLDDKRPELRAM TASAPVLLDHLSDVAKQHFDTVLAHLDALGVPYVINPRMVRGLDYYTKTAFEFVHDGL GAQSGIGGGGRYDGLMHQLGGQDLSGIGFGLGVDRTVLALRAEGKTAGDSARCDVFGV PLGEAAKLRLAVLAGRLRAAGVRVDLAYGDRGLKGAMRAAARSGARVALVAGDRDIEA GTVAVKDLTTGEQVSVSMDSVVAEVISRLAG" misc_feature complement(2905127..2905150) /gene="hisS" /locus_tag="Rv2580c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2906089..2906763) /locus_tag="Rv2581c" /db_xref="GeneID:888217" CDS complement(2906089..2906763) /locus_tag="Rv2581c" /EC_number="3.1.2.6" /function="INVOLVED IN GLYOXAL PATHWAY. THIOLESTERASE THAT CATALYZES THE HYDROLYSIS OF S-D-LACTOYL-GLUTATHIONE TO FORM GLUTATHIONE AND D-LACTIC ACID [CATALYTIC ACTIVITY: (S)-(2-HYDROXYACYL)GLUTATHIONE + H(2)O = GLUTATHIONE + A 2-HYDROXY ACID ANION]." /experiment="experimental evidence, no additional details recorded" /note="Rv2581c, (MTCY227.20), len: 224 aa. Possible glyoxalase II (EC 3.1.2.6), equivalent to Q49649|YP81_MYCLE|ML0493|MLCB1259.11|B1177_C3_247 HYPOTHETICAL 23.9 KDA PROTEIN from Mycobacterium leprae (218 aa), FASTA scores: opt: 1264, E(): 7.8e-73, (82.0% identity in 222 aa overlap). Also highly similar to Q9KXP1|SC9C5.33c POSSIBLE HYDROLASE from Streptomyces coelicolor (235 aa), FASTA scores: opt: 654, E(): 2.9e-34, (46.8% identity in 220 aa overlap); and similar to Q9CI24|YFCI HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (210 aa), FASTA scores: opt: 360, E(): 9.9e-16, (35.0% identity in 217 aa overlap); AAK75726|SP1646 METALLO-BETA-LACTAMASE SUPERFAMILY PROTEIN from Streptococcus pneumoniae (209 aa), FASTA scores: opt: 320, E(): 3.3e-13, (35.85% identity in 198 aa overlap); AAK80229|CAC2272 PREDICTED ZN-DEPENDENT HYDROLASE OF METALLO-BETA-LACTAMASE SUPERFAMILY from Clostridium acetobutylicum (199 aa), FASTA scores: opt: 282, E(): 8e-11, (32.7% identity in 217 aa overlap); etc. Equivalent to AAK46971 from Mycobacterium tuberculosis strain CDC1551 (246 aa) but shorter 22 aa. BELONGS TO THE GLYOXALASE II FAMILY. COFACTOR: BINDS TWO ZINC IONS." /codon_start=1 /transl_table=11 /product="glyoxalase II" /protein_id="NP_217097.1" /db_xref="GI:15609718" /db_xref="GeneID:888217" /translation="MLITGFPAGLLACNCYVLAERPGTDAVIVDPGQGAMGTLRRILD KNRLTPAAVLLTHGHIDHIWSAQKVSDTFGCPTYVHPADRFMLTDPIYGLGPRIAQLV AGAFFREPKQVVELDRDGDKIDLGGISVNIDHTPGHTRGSVVFRVLQATNNDKDIVFT GDTLFERAIGRTDLAGGSGRDLLRSIVDKLLVLDDSTVVLPGHGNSTTIGAERRFNPF LEGLSR" gene 2906814..2907740 /gene="ppiB" /locus_tag="Rv2582" /db_xref="GeneID:887691" CDS 2906814..2907740 /gene="ppiB" /locus_tag="Rv2582" /EC_number="5.2.1.8" /function="PPIASES ACCELERATE THE FOLDING OF PROTEINS [CATALYTIC ACTIVITY: CIS-TRANS ISOMERIZATION OF PROLINE IMIDIC PEPTIDE BONDS INOLIGOPEPTIDES]." /note="Rv2582, (MTCY227.19c), len: 308 aa. Probable ppiB (alternate gene name: ppi), cyclophilin (peptidyl-prolyl cis-trans isomerase) (EC 5.2.1.8), equivalent to P46697|PPIB_MYCLE|PPI|ML0492|MLCB1259.10c|B1177_F3_97 PROBABLE PEPTIDYL-PROLYL CIS-TRANS ISOMERASE B from Mycobacterium leprae (295 aa), FASTA scores: opt: 1423, E(): 1.3e-66, (72.2% identity in 295 aa overlap). Aldo similar to others e.g. Q9KJG8|PPIB PEPTIDYL-PROLYL CIS-TRANS ISOMERASE from Streptomyces lividans (277 aa), FASTA scores: opt: 485, E(): 3.2e-18, (38.35% identity in 292 aa overlap); Q9KXP0|SC9C5.34 PEPTIDYL-PROLYL CIS-TRANS ISOMERASE from Streptomyces coelicolor (277 aa), FASTA scores: opt: 483, E(): 4.1e-18, (38.35% identity in 292 aa overlap); Q9RT72|DR1893 PEPTIDYL-PROLYL CIS-TRANS ISOMERASE from Deinococcus radiodurans (350 aa), FASTA scores: opt: 296, E(): 2.2e-08, (29.0% identity in 276 aa overlap); etc. BELONGS TO THE CYCLOPHILIN-TYPE PPIASE FAMILY.; ppi" /codon_start=1 /transl_table=11 /product="peptidyl-prolyl cis-trans isomerase B" /protein_id="NP_217098.1" /db_xref="GI:15609719" /db_xref="GeneID:887691" /translation="MGHLTPVAAPRLACAFVPTNAQRRATAKRKLERQLERRAKQAKR RRILTIVGGSLAAVAVIVAVVVTVVVNKDDHQSTTSATPTDSASTSPPQAATAPPLPP FKPSANLGANCQYPPSPDKAVKPVKLPRTGKVPTDPAQVSVSMVTNQGNIGLMLANNE SPCTVNSFVSLAQQGFFKGTTCHRLTTSPMLAVLQCGDPKGDGTGGPGYQFANEYPTD QYSANDPKLNEPVIYPRGTLAMANAGPNTNSSQFFMVYRDSKLPPQYTVFGTIQADGL TTLDKIAKAGVAGGGEDGKPATEVTITSVLLD" gene complement(2907826..2910198) /gene="relA" /locus_tag="Rv2583c" /db_xref="GeneID:887888" CDS complement(2907826..2910198) /gene="relA" /locus_tag="Rv2583c" /EC_number="2.7.6.5" /function="INVOLVED IN THE METABOLISM OF PPGPP (AT THE FIRST STEP). IN EUBACTERIA PPGPP (GUANOSINE 3'-DIPHOSPHATE 5-'DIPHOSPHATE) IS A MEDIATOR OF THE STRINGENT RESPONSE THAT COORDINATES A VARIETY OF CELLULAR ACTIVITIES IN RESPONSE TO CHANGES IN NUTRITIONAL ABUNDANCE. THIS ENZYME CATALYZES THE FORMATION OF PPPGPP WHICH IS THEN HYDROLYSED TO FORM PPGPP [CATALYTIC ACTIVITY: ATP + GTP = AMP + GUANOSINE 3'-DIPHOSPHATE 5-'TRIPHOSPHATE]." /note="Rv2583c, (MTCY227.18), len: 790 aa. Probable relA, GTP pyrophosphokinase (EC 2.7.6.5), equivalent to Q49640|RELA_MYCLE|ML0491|MLCB1259.09|B1177_C1_168 PROBABLE GTP PYROPHOSPHOKINASE from Mycobacterium leprae (787 aa), FASTA scores: opt: 4834, E(): 0, (93.4% identity in 790 aa overlap). Also highly similar to others e.g. O87331|RELA_CORGL|RELA|REL from Corynebacterium glutamicum (Brevibacterium flavum) (760 aa), FASTA scores: opt: 3375, E(): 1.6e-196, (67.0% identity in 758 aa overlap); O85709|RELA_STRAT from Streptomyces antibioticus (841 aa), FASTA scores: opt: 3209, E(): 1.9e-186, (63.85% identity in 786 aa overlap); Q9KDH1|RELA|BH1242 from Bacillus halodurans (728 aa), FASTA scores: opt: 2195,E(): 3.8e-125, (45.65% identity in 714 aa overlap); etc. BELONGS TO THE RELA / SPOT FAMILY." /codon_start=1 /transl_table=11 /product="GTP pyrophosphokinase" /protein_id="NP_217099.1" /db_xref="GI:15609720" /db_xref="GeneID:887888" /translation="MAEDQLTAQAVAPPTEASAALEPALETPESPVETLKTSISASRR VRARLARRMTAQRSTTNPVLEPLVAVHREIYPKADLSILQRAYEVADQRHASQLRQSG DPYITHPLAVANILAELGMDTTTLVAALLHDTVEDTGYTLEALTEEFGEEVGHLVDGV TKLDRVVLGSAAEGETIRKMITAMARDPRVLVIKVADRLHNMRTMRFLPPEKQARKAR ETLEVIAPLAHRLGMASVKWELEDLSFAILHPKKYEEIVRLVAGRAPSRDTYLAKVRA EIVNTLTASKIKATVEGRPKHYWSIYQKMIVKGRDFDDIHDLVGVRILCDEIRDCYAA VGVVHSLWQPMAGRFKDYIAQPRYGVYQSLHTTVVGPEGKPLEVQIRTRDMHRTAEYG IAAHWRYKEAKGRNGVLHPHAAAEIDDMAWMRQLLDWQREAADPGEFLESLRYDLAVQ EIFVFTPKGDVITLPTGSTPVDFAYAVHTEVGHRCIGARVNGRLVALERKLENGEVVE VFTSKAPNAGPSRDWQQFVVSPRAKTKIRQWFAKERREEALETGKDAMAREVRRGGLP LQRLVNGESMAAVARELHYADVSALYTAIGEGHVSAKHVVQRLLAELGGIDQAEEELA ERSTPATMPRRPRSTDDVGVSVPGAPGVLTKLAKCCTPVPGDVIMGFVTRGGGVSVHR TDCTNAASLQQQAERIIEVLWAPSPSSVFLVAIQVEALDRHRLLSDVTRALADEKVNI LSASVTTSGDRVAISRFTFEMGDPKHLGHLLNAVRNVEGVYDVYRVTSAA" gene complement(2910229..2910900) /gene="apt" /locus_tag="Rv2584c" /db_xref="GeneID:888579" CDS complement(2910229..2910900) /gene="apt" /locus_tag="Rv2584c" /EC_number="2.4.2.7" /function="INVOLVED IN PURINE SALVAGE. CATALYSES A SALVAGE REACTION RESULTING IN THE FORMATION OF AMP, THAT IS ENERGICALLY LESS COSTLY THAN DE NOVO SYNTHESIS [CATALYTIC ACTIVITY: AMP + PYROPHOSPHATE = ADENINE + 5-PHOSPHO-ALPHA-D-RIBOSE 1-DIPHOSPHATE]." /note="catalyzes a salvage reaction resulting in the formation of AMP which is metabolically less costly than a de novo synthesis" /codon_start=1 /transl_table=11 /product="adenine phosphoribosyltransferase" /protein_id="NP_217100.1" /db_xref="GI:15609721" /db_xref="GeneID:888579" /translation="MCHGGTWAGDYVLNVIATGLSLKARGKRRRQRWVDDGRVLALGE SRRSSAISVADVVASLTRDVADFPVPGVEFKDLTPLFADRRGLAAVTEALADRASGAD LVAGVDARGFLVAAAVATRLEVGVLAVRKGGKLPRPVLSEEYYRAYGAATLEILAEGI EVAGRRVVIIDDVLATGGTIGATRRLLERGGANVAGAAVVVELAGLSGRAALAPLPVH SLSRL" misc_feature complement(2910358..2910384) /gene="apt" /locus_tag="Rv2584c" /note="PS00144 Asparaginase / glutaminase active site signature 1" misc_feature complement(2910364..2910402) /gene="apt" /locus_tag="Rv2584c" /note="PS00103 Purine/pyrimidine phosphoribosyl transferases signature" gene complement(2911004..2912677) /locus_tag="Rv2585c" /db_xref="GeneID:887701" CDS complement(2911004..2912677) /locus_tag="Rv2585c" /function="UNKNOWN" /note="Rv2585c, (MT2662, MTCY227.16), len: 557 aa. Possible conserved lipoprotein precursor, possibly attached to the membrane by a lipid anchor and substrate-binding protein involved in transport, equivalent to Q49646|YP85_MYCLE|ML0489|MLCB1259.07|B1177_C2_197 HYPOTHETICAL LIPOPROTEIN PRECURSOR from Mycobacterium leprae (555 aa), FASTA scores: opt: 2812, E(): 9.8e-158, (78.95% identity in 546 aa overlap); and C-terminus highly similar to C-terminus of Q49638|DCIAE|B1177_C1_166 DCIAE PROTEIN from Mycobacterium leprae (344 aa), FASTA scores: opt: 1177, E(): 7.4e-62, (78.6% identity in 229 aa overlap). Also similar in part to various proteins, principally substrate-binding proteins, e.g. O87329|DCIAE DIPEPTIDE-BINDING PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (502 aa), FASTA scores: opt: 614, E(): 1.2e-28, (30.7% identity in 427 aa overlap); Q9AKR0|OPPA|CAC49261 PUTATIVE OLIGOPEPTIDE UPTAKE ABC TRANSPORTER PERIPLASMIC SOLUTE-BINDING PROTEIN PRECURSOR from Rhizobium meliloti (Sinorhizobium meliloti) (532 aa), FASTA scores: opt: 209, E(): 7.7e-05, (22.85% identity in 460 aa overlap); P76128|YDDS_ECOLI|B1487|P77769|P76874 PUTATIVE ABC TRANSPORTER PERIPLASMIC BINDING PROTEIN from Escherichia coli strain K12 (516 aa), FASTA scores: opt: 182, E(): 0.0029, (20.0% identity in 315 aa overlap); etc." /codon_start=1 /transl_table=11 /product="lipoprotein" /protein_id="NP_217101.1" /db_xref="GI:15609722" /db_xref="GeneID:887701" /translation="MAPRRRRHTRIAGLRVVGTATLVAATTLTACSGSAAAQIDYVVD GALVTYNTNTVIGAASAGAQAFARTLTGFGYHGPDGQVVADRDFGTVSVVEGSPLILD YQISDDAVYSDGRPVTCDDLVLAWAAQSGRFPGFDAATQAGYVDIANIECTAGQKKAR VSFIPDRSVVDHSQLFTATSLMPSHVIADQLHIDVTAALLSNNVSAVEQIARLWNSTW DLKPGRSHDEVRSRFPSSGPYKIESVLDDGAVVLVANDRWWGTKAITKRITVWPQGAD IQDRVNNRSVDVVDVAAGSSGSLVTPDSYQRTDYPSAGIEQLIFAPQGSLAQSRTRRA LALCVPRDAIARDAGVPIANSRLSPATDDALTDADGAAEARQFGRVDPAAARDALGGT PLTVRIGYGRPNARLAATIGTIADACAPAGITVSDVTVDTPGPQALRDGKIDVLLAST GGATGSGSSGSCAMDAYDLHSGNGNNLSGYANAQIDGIISALAVSADPAERARLLAEA APVLWDEMPTLPLYRQQRTLLMSTKMYAVSRNPTRWGAGWNMDRWALAR" gene complement(2912683..2914011) /gene="secF" /locus_tag="Rv2586c" /db_xref="GeneID:887229" CDS complement(2912683..2914011) /gene="secF" /locus_tag="Rv2586c" /function="INVOLVED IN PROTEIN EXPORT. PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA, SECB, SECD, SECE, SECF, SECG AND SECY." /note="forms a complex with SecD and YajC; SecDFyajC stimulates the proton motive force-driven protein translocation; seems to modulate the cycling of SecA by stabilizing its membrane-inserted state and appears to be required for the release of mature proteins from the extracytoplasmic side of the membrane; in some organisms, such as Bacillus subtilis, SecD is fused to SecF" /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecF" /protein_id="NP_217102.1" /db_xref="GI:15609723" /db_xref="GeneID:887229" /translation="MASKAKTGRDDEATSAVELTEATESAVARTDGDSTTDTASKLGH HSFLSRLYTGTGAFEVVGRRRLWFGVSGAIVAVAIASIVFRGFTFGIDFKGGTTVSFP RGSTQVAQVEDVYYRALGSEPQSVVIVGAGASATVQIRSETLTSDQTAKLRDALFEAF GPKGTDGQPSKQAISDSAVSETWGGQITKKAVIALVVFLVLVALYITVRYERYMTISA ITAMLFDLTVTAGVYSLVGFEVTPATVIGLLTILGFSLYDTVIVFDKVEENTHGFQHT TRRTFAEQANLAINQTFMRSINTSLIGVLPVLALMVVAVWLLGVGTLKDLALVQLIGI IIGTYSSIFFATPLLVTLRERTELVRNHTRRVLKRRNSGSPAGSEDASTDGGEQPAAA DEQSLVGITQASSQSAPRAAQGSSKPAPGARPVRPVGTRRPTGKRNAGRR" gene complement(2914015..2915736) /gene="secD" /locus_tag="Rv2587c" /db_xref="GeneID:888192" CDS complement(2914015..2915736) /gene="secD" /locus_tag="Rv2587c" /function="INVOLVED IN PROTEIN EXPORT. PART OF THE PROKARYOTIC PROTEIN TRANSLOCATION APPARATUS WHICH COMPRISE SECA, SECB, SECD, SECE, SECF, SECG AND SECY." /note="part of the preprotein secretory system; when complexed with proteins SecF and YajC, SecDFyajC stimulates the proton motive force-driven protein translocation, and appears to be required for the release of mature proteins from the extracytoplasmic side of the membrane" /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecD" /protein_id="NP_217103.1" /db_xref="GI:15609724" /db_xref="GeneID:888192" /translation="MASSSAPVHPARYLSVFLVMLIGIYLLVFFTGDKHTAPKLGIDL QGGTRVTLTARTPDGSAPSREALAQAQQIISARVNGLGVSGSEVVVDGDNLVITVPGN DGSEARNLGQTARLYIRPVLNSMPAQPAAEEPQPAPSAEPQPPGQPAAPPPAQSGAPA SPQPGAQPRPYPQDPAPSPNPTSPASPPPAPPAEAPATDPRKDLAERIAQEKKLRQST NQYMQMVALQFQATRCESDDILAGNDDPKLPLVTCSTDHKTAYLLAPSIISGDQIQNA TSGMDQRGIGYVVDLQFKGPAANIWADYTAAHIGTQTAFTLDSQVVSAPQIQEAIPGG RTQISGGDPPFTAATARQLANVLKYGSLPLSFEPSEAQTVSATLGLSSLRAGMIAGAI GLLLVLVYSLLYYRVLGLLTALSLVASGSMVFAILVLLGRYINYTLDLAGIAGLIIGI GTTADSFVVFFERIKDEIREGRSFRSAVPRGWARARKTIVSGNAVTFLAAAVLYFLAI GQVKGFAFTLGLTTILDLVVVFLVTWPLVYLASKSSLLAKPAYNGLGAVQQVARERRA MARTGRG" gene complement(2915846..2916193) /gene="yajC" /locus_tag="Rv2588c" /db_xref="GeneID:887346" CDS complement(2915846..2916193) /gene="yajC" /locus_tag="Rv2588c" /function="THOUGHT TO BE INVOLVED IN SECRETION APPARATUS." /experiment="experimental evidence, no additional details recorded" /note="member of preprotein translocase; forms a heterotrimer with SecD and SecF; links the SecD/SecF/YajC/YidC complex with the SecY/SecE/SecG complex" /codon_start=1 /transl_table=11 /product="preprotein translocase subunit YajC" /protein_id="NP_217104.1" /db_xref="GI:15609725" /db_xref="GeneID:887346" /translation="MESFVLFLPFLLIMGGFMYFASRRQRRAMQATIDLHDSLQPGER VHTTSGLEATIVAIADDTIDLEIAPGVVTTWMKLAIRDRILPDDDIDEELNEDLDKDV DDVAGERRVTNDS" gene 2916360..2917709 /gene="gabT" /locus_tag="Rv2589" /db_xref="GeneID:887915" CDS 2916360..2917709 /gene="gabT" /locus_tag="Rv2589" /EC_number="2.6.1.19" /function="INVOLVED IN 4-AMINOBUTYRATE (GABA) DEGRADATION PATHWAY [CATALYTIC ACTIVITY: 4-AMINOBUTANOATE + 2-OXOGLUTARATE = SUCCINATE SEMIALDEHYDE + L-GLUTAMATE]." /note="catalyzes the formation of succinate semialdehyde and glutamate from 4-aminobutanoate and 2-oxoglutarate" /codon_start=1 /transl_table=11 /product="4-aminobutyrate aminotransferase" /protein_id="NP_217105.1" /db_xref="GI:15609726" /db_xref="GeneID:887915" /translation="MASLQQSRRLVTEIPGPASQALTHRRAAAVSSGVGVTLPVFVAR AGGGIVEDVDGNRLIDLGSGIAVTTIGNSSPRVVDAVRTQVAEFTHTCFMVTPYEGYV AVAEQLNRITPGSGPKRSVLFNSGAEAVENAVKIARSYTGKPAVVAFDHAYHGRTNLT MALTAKSMPYKSGFGPFAPEIYRAPLSYPYRDGLLDKQLATNGELAAARAIGVIDKQV GANNLAALVIEPIQGEGGFIVPAEGFLPALLDWCRKNHVVFIADEVQTGFARTGAMFA CEHEGPDGLEPDLICTAKGIADGLPLSAVTGRAEIMNAPHVGGLGGTFGGNPVACAAA LATIATIESDGLIERARQIERLVTDRLTTLQAVDDRIGDVRGRGAMIAVELVKSGTTE PDAGLTERLATAAHAAGVIILTCGMFGNIIRLLPPLTIGDELLSEGLDIVCAILADL" misc_feature 2917134..2917256 /gene="gabT" /locus_tag="Rv2589" /note="PS00600 Aminotransferases class-III pyridoxal-phosphate attachment site" gene 2917871..2921377 /gene="fadD9" /locus_tag="Rv2590" /db_xref="GeneID:888574" CDS 2917871..2921377 /gene="fadD9" /locus_tag="Rv2590" /EC_number="6.2.1.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv2590, (MTCY227.11c), len: 1168 aa. Probable fadD9, fatty-acid-CoA synthetase (EC 6.2.1.-), highly similar to O69484|FADD9 (alias Q9CCT4|FADD9|ML0484 but longer 14 aa) PUTATIVE ACYL-CoA SYNTHETASE from Mycobacterium leprae (1174 aa), FASTA scores: opt: 5247, E(): 0, (68.0% identity in 1178 aa overlap); Q49651|LCLA|B1177_F1_23 PUTATIVE LONG-CHAIN-FATTY-ACID--CoA LIGASE from Mycobacterium leprae (827 aa), FASTA scores: opt: 3170, E(): 7.1e-181, (63.9% identity in 770 aa overlap). N-terminal (700 residues) similar to other long chain fatty acid ligases. And C-terminus highly similar to C-terminus of Q9XCF2|PSTB PSTB PROTEIN from Mycobacterium avium (2552 aa), FASTA scores: opt: 2083, E(): 8.4e-116, (40.8% identity in 1150 aa overlap) (and weak similarity on N-terminus); Q49653|POL1|B1177_F2_70 POL1 PROTEIN from Mycobacterium leprae (400 aa), FASTA scores: opt: 2066, E(): 2e-115, (76.25% identity in 404 aa overlap). C-terminal part highly similar to polyketide synthases and peptides synthases (weak similarity on N-terminus) e.g. Q10896|Rv0101|MTCY251.20|NRP PROBABLE PEPTIDE SYNTHETASE from Mycobacterium tuberculosis (2512 aa), FASTA scores: opt: 1988, E(): 3.7e-110, (40.2% identity in 1181 aa overlap); etc. Contains PS00455 putative AMP-binding domain signature, and PS00061 Short-chain alcohol dehydrogenase family signature. SEEMS TO BELONG TO THE ATP-DEPENDENT AMP-BINDING ENZYME FAMILY, AND TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="fatty-acid-CoA ligase" /protein_id="NP_217106.1" /db_xref="GI:15609727" /db_xref="GeneID:888574" /translation="MSINDQRLTRRVEDLYASDAQFAAASPNEAITQAIDQPGVALPQ LIRMVMEGYADRPALGQRALRFVTDPDSGRTMVELLPRFETITYRELWARAGTLATAL SAEPAIRPGDRVCVLGFNSVDYTTIDIALIRLGAVSVPLQTSAPVTGLRPIVTETEPT MIATSIDNLGDAVEVLAGHAPARLVVFDYHGKVDTHREAVEAARARLAGSVTIDTLAE LIERGRALPATPIADSADDALALLIYTSGSTGAPKGAMYRESQVMSFWRKSSGWFEPS GYPSITLNFMPMSHVGGRQVLYGTLSNGGTAYFVAKSDLSTLFEDLALVRPTELCFVP RIWDMVFAEFHSEVDRRLVDGADRAALEAQVKAELRENVLGGRFVMALTGSAPISAEM TAWVESLLADVHLVEGYGSTEAGMVLNDGMVRRPAVIDYKLVDVPELGYFGTDQPYPR GELLVKTQTMFPGYYQRPDVTAEVFDPDGFYRTGDIMAKVGPDQFVYLDRRNNVLKLS QGEFIAVSKLEAVFGDSPLVRQIFIYGNSARAYPLAVVVPSGDALSRHGIENLKPVIS ESLQEVARAAGLQSYEIPRDFIIETTPFTLENGLLTGIRKLARPQLKKFYGERLERLY TELADSQSNELRELRQSGPDAPVLPTLCRAAAALLGSTAADVRPDAHFADLGGDSLSA LSLANLLHEIFGVDVPVGVIVSPASDLRALADHIEAARTGVRRPSFASIHGRSATEVH ASDLTLDKFIDAATLAAAPNLPAPSAQVRTVLLTGATGFLGRYLALEWLDRMDLVNGK LICLVRARSDEEAQARLDATFDSGDPYLVRHYRELGAGRLEVLAGDKGEADLGLDRVT WQRLADTVDLIVDPAALVNHVLPYSQLFGPNAAGTAELLRLALTGKRKPYIYTSTIAV GEQIPPEAFTEDADIRAISPTRRIDDSYANGYANSKWAGEVLLREAHEQCGLPVTVFR CDMILADTSYTGQLNLPDMFTRLMLSLAATGIAPGSFYELDAHGNRQRAHYDGLPVEF VAEAICTLGTHSPDRFVTYHVMNPYDDGIGLDEFVDWLNSPTSGSGCTIQRIADYGEW LQRFETSLRALPDRQRHASLLPLLHNYREPAKPICGSIAPTDQFRAAVQEAKIGPDKD IPHLTAAIIAKYISNLRLLGLL" misc_feature 2918594..2918629 /gene="fadD9" /locus_tag="Rv2590" /note="PS00455 Putative AMP-binding domain signature" misc_feature 2920667..2920753 /gene="fadD9" /locus_tag="Rv2590" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 2921551..2923182 /gene="PE_PGRS44" /locus_tag="Rv2591" /db_xref="GeneID:887992" CDS 2921551..2923182 /gene="PE_PGRS44" /locus_tag="Rv2591" /function="UNKNOWN" /note="Rv2591, (MTCY227.10c), len: 543 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to others e.g. O53845|Rv0834c|MTV043.26c from Mycobacterium tuberculosis (882 aa), FASTA scores: opt: 1813, E(): 5.8e-66, (55.3% identity in 568 aa overlap). Equivalent to AAK46982 from Mycobacterium tuberculosis strain CDC1551 (505 aa) but longer 38 aa. Contains PS00583 pfkB family of carbohydrate kinases signature 1." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177891.1" /db_xref="GI:57117002" /db_xref="GeneID:887992" /translation="MSFVTAAPEMLATAAQNVANIGTSLSAANATAAASTTSVLAAGA DEVSQAIARLFSDYATHYQSLNAQAAAFHHSFVQTLNAAGGAYSSAEAANASAQALEQ NLLAVINAPAQALFGRPLIGNGANGTAASPNGGDGGILYGNGGNGFSQTTAGVAGGAG GSAGLIGNGGNGGAGGAGAAGGAGGAGGWLLGNGGAGGPGGPTDVPAGTGGAGGAGGD APLIGWGGNGGPGGFAAFGNGGAGGNGGASGSLFGVGGAGGVGGSSEDVGGTGGAGGA GRGLFLGLGGDGGAGGTSNNNGGDGGAGGTAGGRLFSLGGDGGNGGAGTAIGSNAGDG GAGGDSSALIGYAQGGSGGLGGFGESTGGDGGLGGAGAVLIGTGVGGFGGLGGGSNGT GGAGGAGGTGATLIGLGAGGGGGIGGFAVNVGNGVGGLGGQGGQGAALIGLGAGGAGG AGGATVVGLGGNGGDGGDGGGLFSIGVGGDGGNAGNGAMPANGGNGGNAGVIANGSFA PSFVGFGGNGGNGVNGGTGGSGGILFGANGANGPS" misc_feature 2922706..2922777 /gene="PE_PGRS44" /locus_tag="Rv2591" /note="PS00583 pfkB family of carbohydrate kinases signature 1" gene complement(2923199..2924233) /gene="ruvB" /locus_tag="Rv2592c" /db_xref="GeneID:888173" CDS complement(2923199..2924233) /gene="ruvB" /locus_tag="Rv2592c" /EC_number="3.1.22.4" /function="FORMS A COMPLEX WITH RUVA. RUVB COULD POSSESS WEAK ATPASE ACTIVITY, WHICH WILL BE STIMULATED BY THE RUVA PROTEIN IN THE PRESENCE OF DNA. THE RUVA-RUVB COMPLEX IN THE PRESENCE OF ATP RENATURES CRUCIFORM STRUCTURE IN SUPERCOILED DNA WITH PALINDROMIC SEQUENCE, INDICATING THAT IT MAY PROMOTE STRAND EXCHANGE REACTIONS IN HOMOLOGOUS RECOMBINATION. RUVAB IS AN HELICASE THAT MEDIATES THE HOLLIDAY JUNCTION MIGRATION BY LOCALIZED DENATURATION AND REANNELING." /experiment="experimental evidence, no additional details recorded" /note="promotes strand exchange during homologous recombination; RuvAB complex promotes branch migration; RuvABC complex scans the DNA during branch migration and resolves Holliday junctions at consensus sequences; forms hexameric rings around opposite DNA arms; requires ATP for branch migration and orientation of RuvAB complex determines direction of migration" /codon_start=1 /transl_table=11 /product="Holliday junction DNA helicase RuvB" /protein_id="NP_217108.1" /db_xref="GI:15609729" /db_xref="GeneID:888173" /translation="MTERSDRDVSPALTVGEGDIDVSLRPRSLREFIGQPRVREQLQL VIEGAKNRGGTPDHILLSGPPGLGKTSLAMIIAAELGSSLRVTSGPALERAGDLAAML SNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGATSIPLEVAPFTLVG ATTRSGALTGPLRDRFGFTAHMDFYEPAELERVLARSAGILGIELGADAGAEIARRSR GTPRIANRLLRRVRDFAEVRADGVITRDVAKAALEVYDVDELGLDRLDRAVLSALTRS FGGGPVGVSTLAVAVGEEAATVEEVCEPFLVRAGMVARTPRGRVATALAWTHLGMTPP VGASQPGLFE" misc_feature complement(2924024..2924047) /gene="ruvB" /locus_tag="Rv2592c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2924230..2924820) /gene="ruvA" /locus_tag="Rv2593c" /db_xref="GeneID:887688" CDS complement(2924230..2924820) /gene="ruvA" /locus_tag="Rv2593c" /function="FORMS A COMPLEX WITH RUVB. RUVA STIMULATES, IN THE PRESENCE OF DNA, THE WEAK ATPASE ACTIVITY OF RUVB. THE RUVA-RUVB COMPLEX IN THE PRESENCE OF ATP RENATURES CRUCIFORM STRUCTURE IN SUPERCOILED DNA WITH PALINDROMIC SEQUENCE, INDICATING THAT IT MAY PROMOTE STRAND EXCHANGE REACTIONS IN HOMOLOGOUS RECOMBINATION. RUVAB IS AN HELICASE THAT MEDIATES THE HOLLIDAY JUNCTION MIGRATION BY LOCALIZED DENATURATION AND REANNELING." /experiment="experimental evidence, no additional details recorded" /note="plays an essential role in ATP-dependent branch migration of the Holliday junction" /codon_start=1 /transl_table=11 /product="Holliday junction DNA helicase RuvA" /protein_id="NP_217109.1" /db_xref="GI:15609730" /db_xref="GeneID:887688" /translation="MIASVRGEVLEVALDHVVIEAAGVGYRVNATPATLATLRQGTEA RLITAMIVREDSMTLYGFPDGETRDLFLTLLSVSGVGPRLAMAALAVHDAPALRQVLA DGNVAALTRVPGIGKRGAERMVLELRDKVGVAATGGALSTNGHAVRSPVVEALVGLGF AAKQAEEATDTVLAANHDATTSSALRSALSLLGKAR" gene complement(2924817..2925383) /gene="ruvC" /locus_tag="Rv2594c" /db_xref="GeneID:887418" CDS complement(2924817..2925383) /gene="ruvC" /locus_tag="Rv2594c" /EC_number="3.1.22.4" /function="NUCLEASE THAT RESOLVES HOLLIDAY JUNCTION INTERMEDIATES IN GENETIC RECOMBINATION. CLEAVES THE CRUCIFORM STRUCTURE IN SUPERCOILED DNA BY NICKING TO STRANDS WITH THE SAME POLARITY AT SITES SYMMETRICALLY OPPOSED AT THE JUNCTION IN THE HOMOLOGOUS ARMS AND LEAVES A 5'TERMINAL PHOSPHATE AND A 3'TERMINAL HYDROXYL GROUP [CATALYTIC ACTIVITY: ENDONUCLEOLYTIC CLEAVAGE AT A JUNCTION SUCH AS A RECIPROCAL SINGLE-STRANDED CROSSOVER BETWEEN TWO HOMOLOGOUS DNA DUPLEXES (HOLLIDAY JUNCTION)]." /experiment="experimental evidence, no additional details recorded" /note="endonuclease; resolves Holliday structures; forms a complex of RuvABC; the junction binding protein RuvA forms a hexameric ring along with the RuvB helicase and catalyzes branch migration; RuvC then interacts with RuvAB to resolve the Holliday junction by nicking DNA strands of like polarity" /codon_start=1 /transl_table=11 /product="Holliday junction resolvase" /protein_id="NP_217110.1" /db_xref="GI:15609731" /db_xref="GeneID:887418" /translation="MRVMGVDPGLTRCGLSLIESGRGRQLTALDVDVVRTPSDAALAQ RLLAISDAVEHWLDTHHPEVVAIERVFSQLNVTTVMGTAQAGGVIALAAAKRGVDVHF HTPSEVKAAVTGNGSADKAQVTAMVTKILALQAKPTPADAADALALAICHCWRAPTIA RMAEATSRAEARAAQQRHAYLAKLKAAR" gene 2925492..2925737 /locus_tag="Rv2595" /db_xref="GeneID:887682" CDS 2925492..2925737 /locus_tag="Rv2595" /function="UNKNOWN" /note="Rv2595, (MTCY227.06c), len: 81 aa. Conserved hypothetical protein, showing similarity with various bacterial proteins e.g. O28268|AF2011 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (86 aa), FASTA scores: opt: 120, E(): 0.13, (34.35% identity in 67 aa overlap); CAC46196|SMC01176 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (79 aa), FASTA scores: opt: 119, E(): 0.14, (33.35% identity in 63 aa overlap); P37554|SP5T_BACSU|SPOVT STAGE V SPORULATION PROTEIN T from Bacillus subtilis (178 aa), FASTA scores: opt: 104, E(): 2.9, (51.45% identity in 35 aa overlap); etc. Also similar to O07779|Rv0599c|MTCY19H5.23 hypothetical protein from Mycobacterium tuberculosis (78 aa), FASTA scores: opt: 160, E(): 0.00026, (35.8% identity in 81 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217111.1" /db_xref="GI:15609732" /db_xref="GeneID:887682" /translation="MRTTIDVAGRLVIPKRIRERLGLRGNDQVEITERDGRIEIEPAP TGVELVREGSVLVARPERPLPPLTDEIVRETLDRTRR" gene 2925734..2926138 /locus_tag="Rv2596" /db_xref="GeneID:888218" CDS 2925734..2926138 /locus_tag="Rv2596" /function="UNKNOWN" /note="Rv2596, (MTCY227.05c), len: 134 aa. Conserved hypothetical protein, similar to O07780|Rv0598c|MTCY19H5.24 HYPOTHETICAL 14.8 KDA PROTEIN from Mycobacterium tuberculosis (137 aa), FASTA scores: opt: 254, E(): 8.8e-11, (41.55% identity in 130 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217112.1" /db_xref="GI:15609733" /db_xref="GeneID:888218" /translation="MIAPDTSVLVAGFATWHEGHEAAVRALNRGVHLIAHAAVETYSV LTRLPPPHRIAPVAVHAYLADITSSNYLALDACSYRGLTDHLAEHDVTGGATYDALVG FTAKAAGAKLLTRDLRAVETYERLRVEVELVT" gene 2926355..2926975 /locus_tag="Rv2597" /db_xref="GeneID:887679" CDS 2926355..2926975 /locus_tag="Rv2597" /function="UNKNOWN" /note="Rv2597, (MTCY227.04c), len: 206 aa. Probable membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217113.1" /db_xref="GI:15609734" /db_xref="GeneID:887679" /translation="MGNLLVVIAVALFIAAIVVLVVAIRRPKTPATPGGRRDPLAFDA MPQFGPRQLGPGAIVSHGGIDYVVRGSVTFREGPFVWWEHLLEGGDTPTWLSVQEDDG RLELAMWVKRTDLGLQPGGQHVIDGVTFQETERGHAGYTTEGTTGLPAGGEMDYVDCA SAGQGADESMLLSFERWAPDMGWEIATGKSVLAGELTVYPAPPVSA" gene 2926986..2927480 /locus_tag="Rv2598" /db_xref="GeneID:887689" CDS 2926986..2927480 /locus_tag="Rv2598" /function="UNKNOWN" /note="Rv2598, (MTCY227.03c), len: 164 aa. Conserved hypothetical protein, showing similarity with hypothetical proteins from Streptomyces coelicolor e.g. Q9X8S3|SCH10.34c (185 aa), FASTA scores: opt: 197, E(): 3.5e-06, (34.75% identity in 167 aa overlap); and Q9L088|SCC24.29c (172 aa), FASTA scores: opt: 149, E(): 0.0053, (37.65% identity in 146 aa overlap). Equivalent to AAK46988 from Mycobacterium tuberculosis strain CDC1551 (154 aa) but longer 10 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217114.1" /db_xref="GI:15609735" /db_xref="GeneID:887689" /translation="MPLHQLAIAPVDVSGALLGLVLNAPAPRPLATHRLAHTDGSALQ LGVLGASHVVTVEGRFCEEVSCVARSRGGDLPESTHAPGYHLQSHTETHDEAAFRRLA RHLRERCTRATGWLGGVFPGDDAALTALAAEPDGTGWRWRTWHLYPSASGGTVVHTTS RWRP" gene 2927477..2927908 /locus_tag="Rv2599" /db_xref="GeneID:887832" CDS 2927477..2927908 /locus_tag="Rv2599" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2599, (MTCY227.02c), len: 143 aa. Probable conserved membrane protein, equivalent to Q9K536|2599 HYPOTHETICAL 15.0 KDA PROTEIN (FRAGMENT) from Mycobacterium paratuberculosis (143 aa), FASTA scores: opt: 691, E(): 1.7e-33, (68.55% identity in 143 aa overlap). Shows weak similarity with Q9L089|SCC24.28c PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (131 aa), FASTA scores: opt: 130, E(): 0.52, (26.45% identity in 136 aa overlap). Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217115.1" /db_xref="GI:15609736" /db_xref="GeneID:887832" /translation="MSRNRLFLVAGSLAVAAAVSLISGITLLNRDVGSYIASHYRQES RDVNGTRYLCTGSPKQVATTLVKYQTPAARASHTDTEYLRYRNNIVTVGPDGTYPCII RVENLSAGYNHGAYVFLGPGFTPGSPSGGSGGSPGGPGGSK" misc_feature 2927795..2927827 /locus_tag="Rv2599" /note="PS00626 Regulator of chromosome condensation (RCC1) signature 2" gene 2927990..2928391 /locus_tag="Rv2600" /db_xref="GeneID:887869" CDS 2927990..2928391 /locus_tag="Rv2600" /function="UNKNOWN" /note="Rv2600, (MTCY277.01c, MTV001.01), len: 133 aa. Probable conserved integral membrane protein, equivalent (but shorter 18 aa) to Q9K537|YQ00_MYCPA HYPOTHETICAL PROTEIN RV2600 HOMOLOG from Mycobacterium paratuberculosis (151 aa), FASTA scores: opt: 543, E(): 4.2e-28, (62.9% identity in 132 aa overlap). Also some similarity with other hypothetical or membrane proteins e.g. Q9L090|SCC24.27c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (146 aa), FASTA scores: opt: 241, E(): 8.7e-09, (34.8% identity in 135 aa overlap); O58487|PH0773 HYPOTHETICAL 15.0 KDA PROTEIN from Pyrococcus horikoshii (138 aa), FASTA scores: opt: 116, E(): 0.84, (34.35% identity in 96 aa overlap); etc. Equivalent to AAK46990 from Mycobacterium tuberculosis strain CDC1551 (152 aa) but shorter 19 aa." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217116.1" /db_xref="GI:15609737" /db_xref="GeneID:887869" /translation="MVATVLYFLVGAAVLVAGFLMVNLLTPGDLRRLVFIDRRPNAVV LAATMYVALAIVTIAAIYASSNQLAQGLIGVAVYGIVGVALQGVALVILEIAVPGRFR EHIDAPALHPAVFATAVMLLAVAGVIAAALS" gene 2928388..2929959 /gene="speE" /locus_tag="Rv2601" /db_xref="GeneID:887676" CDS 2928388..2929959 /gene="speE" /locus_tag="Rv2601" /EC_number="2.5.1.16" /function="INVOLVED IN THE BIOSYNTHESIS OF SPERMIDINE FROM ARGININE (AT THE FIFTH, LAST STEP). THE ACTIVITY IS THOUGHT TO BE REGULATED MAINLY BY THE AVAILABILITY OF DECARBOXYLATED S-ADENOSYLMETHIONINE [CATALYTIC ACTIVITY: S-ADENOSYLMETHIONINAMINE + PUTRESCINE = 5'-METHYLTHIOADENOSINE + SPERMIDINE]." /note="catalyzes the formation of spermidine from putrescine and S-adenosylmethioninamine" /codon_start=1 /transl_table=11 /product="spermidine synthase" /protein_id="YP_177892.1" /db_xref="GI:57117003" /db_xref="GeneID:887676" /translation="MTSTRQAGEATEASVRWRAVLLAAVAACAACGLVYELALLTLAA SLNGGGIVATSLIVAGYIAALGAGALLIKPLLAHAAIAFIAVEAVLGIIGGLSAAALY AAFAFLDELDGSTLVLAVGTALIGGLVGAEVPLLMTLLQRGRVAGAADAGRTLANLNA ADYLGALVGGLAWPFLLLPQLGMIRGAAVTGIVNLAAAGVVSIFLLRHVVSGRQLVTA LCALAAALGLIATLLVHSHDIETTGRQQLYADPIIAYRHSAYQEIVVTRRGDDLRLYL DGGLQFCTRDEYRYTESLVYPAVSDGARSVLVLGGGDGLAARELLRQPGIEQIVQVEL DPAVIELARTTLRDVNAGSLDNPRVHVVIDDAMSWLRGAAVPPAGFDAVIVDLRDPDT PVLGRLYSTEFYALAARALAPGGLMVVQAGSPYSTPTAFWRIISTIRSAGYAVTPYHV HVPTFGDWGFALARLTDIAPTPAVPSTAPALRFLDQQVLEAATVFSGDIRPRTLDPST LDNPHIVEDMRHGWD" gene 2930070..2930357 /locus_tag="Rv2601A" /db_xref="GeneID:3205108" CDS 2930070..2930357 /locus_tag="Rv2601A" /function="UNKNOWN" /note="Rv2601A, 95 aa. Hypothetical protein, showing few similarity to O53811|Rv0748 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (88 aa), FASTA scores: opt: 132, E(): 0.017, (29.25% identity in 82 aa overlap); O53218|Rv2493 (73 aa), FASTA scores: opt: 107, E(): 0.97, (33.75% identity in 83 aa overlap); and Q10799|YS71_MYCTU|Rv2871 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (85 aa), FASTA scores: opt: 108, E(): 0.91, (41.00% identity in 39 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177673.1" /db_xref="GI:57117004" /db_xref="GeneID:3205108" /translation="MKTTLDLPDELMRAIKVRAAQQGRKMKDVVTELLRSGLSQTHSG APIPTPRRVQLPLVHCGGAATREQEMTPERVAAALLDQEAQWWSGHDDAAL" gene 2930344..2930784 /locus_tag="Rv2602" /db_xref="GeneID:888186" CDS 2930344..2930784 /locus_tag="Rv2602" /function="UNKNOWN" /note="Rv2602, (MTCI270A.03c), len: 146 aa. Conserved hypothetical protein, some weak similarity with proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O50457|Rv1242|MTV006.14 (143 aa), FASTA scores: opt: 147, E(): 0.0021, (26.25% identity in 141 aa overlap); P95023|Rv2530c|MTCY159.26 (139 aa), FASTA scores: opt: 131, E(): 0.027, (33.35% identity in 135 aa overlap); O53812|Rv0749|MTV041.23 (142 aa), FASTA scores: opt: 125, E(): 0.072, (26.45% identity in 140 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217118.1" /db_xref="GI:15609739" /db_xref="GeneID:888186" /translation="MLLCDTNIWLALALSGHVHHRASRAWLDTINAPGVIHFCRATQQ SLLRLLTNRTVLGAYGSPPLTNREAWAAYAAFLDDDRIVLAGAEPDGLEAQWRAFAVR QSPAPKVWMDAYLAAFALTGGFELVTTDTAFTQYGGIELRLLAK" gene complement(2930805..2931560) /locus_tag="Rv2603c" /db_xref="GeneID:887369" CDS complement(2930805..2931560) /locus_tag="Rv2603c" /function="UNKNOWN" /note="Rv2603c, (MTCI270A.02), len: 251 aa. Highly conserved hypothetical protein, equivalent to Q49645|YQ03_MYCLE|ML0475|U1177B|B1177_C2_181 HYPOTHETICAL 26.6 KDA PROTEIN from Mycobacterium leprae (251 aa), FASTA scores: opt: 1514, E(): 2.2e-84, (92.45% identity in 251 aa overlap). Also highly similar to Q9L288|SCL2.11c HYPOTHETICAL 26.8 KDA PROTEIN from Streptomyces coelicolor (250 aa), FASTA scores: opt: 1268, E(): 1.5e-69, (76.7% identity in 249 aa overlap); Q9AE12|YFCA HYPOTHETICAL STRUCTURAL PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (251 aa), FASTA scores: opt: 1231, E(): 2.6e-67, (72.9% identity in 251 aa overlap); O83487|Y474_TREPA|TP0474 HYPOTHETICAL PROTEIN from Treponema pallidum (245 aa), FASTA scores: opt: 780, E(): 4.4e-40, (47.75% identity in 245 aa overlap); P24237|YEBC_ECOLI|B1864 PROTEIN YEBC from Escherichia coli strain K12 (246 aa), FASTA scores: opt: 776, E(): 7.6e-40, (47.8% identity in 249 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217119.1" /db_xref="GI:15609740" /db_xref="GeneID:887369" /translation="MSGHSKWATTKHKKAVVDARRGKMFARLIKNIEVAARVGGGDPA GNPTLYDAIQKAKKSSVPNENIERARKRGAGEEAGGADWQTIMYEGYAPNGVAVLIEC LTDNRNRAASEVRVAMTRNGGTMADPGSVSYLFSRKGVVTLEKNGLTEDDVLAAVLEA GAEDVNDLGDSFEVISEPAELVAVRSALQDAGIDYESAEASFQPSVSVPVDLDGARKV FKLVDALEDSDDVQNVWTNVDVSDEVLAALDDE" gene complement(2931693..2932289) /locus_tag="Rv2604c" /db_xref="GeneID:887371" CDS complement(2931693..2932289) /locus_tag="Rv2604c" /function="UNKNOWN" /note="with PdxST is involved in the biosynthesis of pyridoxal 5'-phosphate; PdxT catalyzes the hydrolysis of glutamine to glutamate and ammonia; PdxS utilizes the ammonia to synthesize pyridoxal 5'-phosphate" /codon_start=1 /transl_table=11 /product="glutamine amidotransferase subunit PdxT" /protein_id="NP_217120.1" /db_xref="GI:15609741" /db_xref="GeneID:887371" /translation="MSVPRVGVLALQGDTREHLAALRECGAEPMTVRRRDELDAVDAL VIPGGESTTMSHLLLDLDLLGPLRARLADGLPAYGSCAGMILLASEILDAGAAGRQAL PLRAMNMTVRRNAFGSQVDSFEGDIEFAGLDDPVRAVFIRAPWVERVGDGVQVLARAA GHIVAVRQGAVLATAFHPEMTGDRRIHQLFVDIVTSAA" gene complement(2932297..2933142) /gene="tesB2" /locus_tag="Rv2605c" /db_xref="GeneID:887588" CDS complement(2932297..2933142) /gene="tesB2" /locus_tag="Rv2605c" /EC_number="3.1.2.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM. CAN HYDROLYZE A BROAD RANGE OF ACYL-CoA THIOESTERS." /experiment="experimental evidence, no additional details recorded" /note="Rv2605c, (MTCY01A10.28), len: 281 aa. Probable tesB2, acyl-CoA thioesterase II (EC 3.1.2.-), highly similar to others e.g. Q98EG9|MLL4250 from Rhizobium loti (Mesorhizobium loti) (286 aa), FASTA scores: opt: 563, E(): 3.9e-29, (47.75% identity in 287 aa overlap); CAC47767 from Rhizobium meliloti (Sinorhizobium meliloti) (294 aa), FASTA scores: opt: 553, E(): 1.8e-28, (49.3% identity in 280 aa overlap); P23911|TESB_ECOLI|B0452 from Escherichia coli strain K12 (285 aa), FASTA scores: opt: 487, E(): 3.1e-24, (41.9% identity in 277 aa overlap); etc. Also similar to O06135|TESB1|Rv1618|MTCY01B2.10 ACYL-CoA THIOESTERASE II from Mycobacterium tuberculosis (300 aa), FASTA scores: opt: 425, E(): 1.1e-21, (34.9% identity in 278 aa overlap). BELONGS TO THE C/M/P THIOESTER HYDROLASE FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA thioesterase II" /protein_id="NP_217121.1" /db_xref="GI:15609742" /db_xref="GeneID:887588" /translation="MSIEEILDLEQLEVNIYRGSVFSPESGFLQRTFGGHVAGQSLVS AVRTVDPRYMVHSLHGYFLRPGDAKERTVFLVERIRDGGSFCTRRVNAVQHGETIFSM AASFQTEQEGITHQDVMPAAPPPDGLPGLNSIKVFDDAGFRQFDEWDVCIVPRERLRL LPGKASQQQVWLRHRDPLPDDPVLHICALAYMSDLTLLGSAQVNHLDVRDQLQVASLD HAMWFMRPFRADEWLLYDQSSPSASGGRALTRGEIFTRSGEMVAAVMQEGLTRHRRGH RSVGQ" gene complement(2933171..2934070) /locus_tag="Rv2606c" /db_xref="GeneID:888592" CDS complement(2933171..2934070) /locus_tag="Rv2606c" /function="Possibly involved in the biosynthesis of pyridoxine/pyridoxal 5-phosphate biosynthesis" /note="with PdxT forms pyridoxal 5'-phosphate from glutamine, either ribose 5-phosphate or ribulose 5-phosphate, and either glyceraldehyde 3-phosphate or dihydroxyacetone phosphate" /codon_start=1 /transl_table=11 /product="pyridoxal biosynthesis lyase PdxS" /protein_id="NP_217122.1" /db_xref="GI:15609743" /db_xref="GeneID:888592" /translation="MDPAGNPATGTARVKRGMAEMLKGGVIMDVVTPEQARIAEGAGA VAVMALERVPADIRAQGGVSRMSDPDMIEGIIAAVTIPVMAKVRIGHFVEAQILQTLG VDYIDESEVLTPADYAHHIDKWNFTVPFVCGATNLGEALRRISEGAAMIRSKGEAGTG DVSNATTHMRAIGGEIRRLTSMSEDELFVAAKELQAPYELVAEVARAGKLPVTLFTAG GIATPADAAMMMQLGAEGVFVGSGIFKSGAPEHRAAAIVKATTFFDDPDVLAKVSRGL GEAMVGINVDEIAVGHRLAQRGW" gene 2934198..2934872 /gene="pdxH" /locus_tag="Rv2607" /db_xref="GeneID:888155" CDS 2934198..2934872 /gene="pdxH" /locus_tag="Rv2607" /EC_number="1.4.3.5" /function="INVOLVED IN BIOSYNTHESIS OF PYRIDOXINE (VITAMIN B6) AND PYRIDOXAL PHOSPHATE. OXIDIZE PNP AND PMP INTO PYRIDOXAL 5'-PHOSPHATE (PLP)[CATALYTIC ACTIVITY: PYRIDOXAMINE 5'-PHOSPHATE + H(2)O + O(2) = PYRIDOXAL 5'-PHOSPHATE + NH(3) + H(2)O(2)]." /note="catalyzes the formation of pyridoxal 5'-phosphate from pyridoxamine 5'-phosphate" /codon_start=1 /transl_table=11 /product="pyridoxamine 5'-phosphate oxidase" /protein_id="NP_217123.1" /db_xref="GI:15609744" /db_xref="GeneID:888155" /translation="MDDDAQMVAIDKDQLARMRGEYGPEKDGCGDLDFDWLDDGWLTL LRRWLNDAQRAGVSEPNAMVLATVADGKPVTRSVLCKILDESGVAFFTSYTSAKGEQL AVTPYASATFPWYQLGRQAHVQGPVSKVSTEEIFTYWSMRPRGAQLGAWASQQSRPVG SRAQLDNQLAEVTRRFADQDQIPVPPGWGGYRIAPEIVEFWQGRENRMHNRIRVANGR LERLQP" gene 2935046..2936788 /gene="PPE42" /locus_tag="Rv2608" /db_xref="GeneID:888204" CDS 2935046..2936788 /gene="PPE42" /locus_tag="Rv2608" /function="UNKNOWN" /note="Rv2608, (MTCY01A10.25c), len: 580 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O06828|Rv1430|MTCY493.24c from Mycobacterium tuberculosis (528 aa), FASTA scores: opt: 1004, E(): 5.9e-48, (56.05% identity in 307 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177893.1" /db_xref="GI:57117005" /db_xref="GeneID:888204" /translation="MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGS FASVTTGLAGDAWHGPASLAMTRAASPYVGWLNTAAGQAAQAAGQARLAASAFEATLA ATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASA VATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANI GIGNIGDRNLGIGNTGNWNIGIGITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGT DSLLPLPNIPLLEYAARFITPVHPGYTATFLETPSQFFPFTGLNSLTYDVSVAQGVTN LHTAIMAQLAAGNEVVVFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNR PDGGILTRFGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAI AGILFLHSGLIALPPDLASGVVQPVSSPDVLTTYILLPSQDLPLLVPLRAIPLLGNPL ADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAADLQQGAVQGVNDALSGL GLPPPWQPALPRLF" gene complement(2936810..2937865) /locus_tag="Rv2609c" /db_xref="GeneID:888208" CDS complement(2936810..2937865) /locus_tag="Rv2609c" /function="UNKNOWN" /note="Rv2609c, (MTCY01A10.24), len: 351 aa. Probable conserved membrane protein, equivalent to O07146|MLCL581.13c|ML0451 HYPOTHETICAL 37.9 KDA PROTEIN from Mycobacterium leprae (349 aa), FASTA scores: opt: 1675, E(): 1.4e-95, (77.85% identity in 334 aa overlap). Also similar to hypothetical proteins: O69888|SC2E1.17|MUTT HYPOTHETICAL 19.4 KDA PROTEIN from Streptomyces coelicolor and Streptomyces lividans (172 aa), FASTA scores: opt: 345, E(): 3.5e-14, (44.7% identity in 161 aa overlap); Q9L285|SCL2.14c HYPOTHETICAL 19.8 KDA PROTEIN from Streptomyces coelicolor (180 aa), FASTA scores: opt: 179, E(): 0.00056, (43.25% identity in 171 aa overlap); and Q9RYE5|DR0004 MUTT/NUDIX FAMILY PROTEIN from Deinococcus radiodurans (350 aa), FASTA scores: opt: 153, E(): 0.037, (33.35% identity in 123 aa overlap). Contains PS00893 mutT domain signature. BELONGS TO THE MUTT/NUDIX FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217125.1" /db_xref="GI:15609746" /db_xref="GeneID:888208" /translation="MTWLVLAGAVLLVVLVAFGAWGYQTANRLNRLNVRYDLSWQSLD SALARRAVVARAVAIDAYGGAPQGSRLAALADAAEGAPRHARENAENELSAALAMVNP ASLPAALIAELADAEARVLLARRFHNDAVRDTLALGERRLVRLLRLGGTAVLPTYFEI VERPHALVHGDQGASGRRTSARVVLLDDSGAVLLLCGSDPANPAFRDGAAPKWWFTVG GQVRPGERLAQAAARELAEETGLRVAPADMIGPIWRRDEVFEFNGSLIDSEEFYLVHR TRRFEPAVQGRTELERRYIRDARWCDANDIAQLVAAGERVYPLQLGELLPAANRLVDV ALDNGAARDAGVPQPIR" misc_feature complement(2937152..2937211) /locus_tag="Rv2609c" /note="PS00893 mutT domain signature" gene complement(2937865..2939001) /gene="pimA" /locus_tag="Rv2610c" /db_xref="GeneID:888627" CDS complement(2937865..2939001) /gene="pimA" /locus_tag="Rv2610c" /EC_number="2.4.1.-" /function="INVOLVED IN THE FIRST MANNOSYLATION STEP IN PHOSPHATIDYLINOSITOL MANNOSIDE BIOSYNTHESIS (TRANSFER OF MANNOSE RESIDUES ONTO PI, LEADING TO THE SYNTHESIS OF PHOSPHATIDYLINOSITOL MONOMANNOSIDE)." /experiment="experimental evidence, no additional details recorded" /note="Rv2610c, (MTCY01A10.23), len: 378 aa. pimA, alpha-mannosyltransferase (EC 2.4.1.-) (see citations below), equivalent to O07147|MLCL581.14c|ML0452 PUTATIVE GLYCOSYLTRANSFERASE from Mycobacterium leprae (374 aa), FASTA scores: opt: 2044, E(): 8.8e-118, (82.25% identity in 378 aa overlap). N-terminus (from aa 1 to 27) equivalent to Q9FY7 PUTATIVE ALPHA-MANNOSYL TRANSFERASE (FRAGMENT) from Mycobacterium smegmatis (27 aa), BLASTP scores: 57.4 bits (137), E(): 3e-8, Identities = 25/27 (92%), Positives = 27/27 (99%) (see citation below). Also highly similar to Q9L284|SCL2.15c PUTATIVE SUGAR TRANSFERASE from Streptomyces coelicolor (387 aa), FASTA scores: opt: 1222, E(): 1.8e-67, (52.95% identity in 376 aa overlap); and similar in part to various proteins e.g. Q9YA73|APE2066 LONG HYPOTHETICAL N-ACETYLGLUCOSAMINYL-PHOSPHATIDYLINOSITOL BIOSYNTHETIC PROTEIN from Aeropyrum pernix (392 aa), FASTA scores: opt: 434, E(): 3e-19, (31.5% identity in 378 aa overlap); Q9UZA1|PAB0827 GALACTOSYLTRANSFERASE OR LPS BIOSYNTHESIS RFBU RELATED PROTEIN from Pyrococcus abyssi (371 aa), FASTA scores: opt: 382, E(): 4.3e-16, (28.2% identity in 383 aa overlap); O26275|MTH173 LPS BIOSYNTHESIS RFBU RELATED PROTEIN from Methanothermobacter thermautotrophicus (382 aa), FASTA scores: opt: 372, E(): 1.8e-15, (28.4% identity in 391 aa overlap); etc. Shows also some similarity with O05313|Rv1212c|MTCI364.24c HYPOTHETICAL 41.5 KDA PROTEIN from Mycobacterium tuberculosis (387 aa), FASTA scores: opt: 232, E(): 1.1e -07, (28.4% identity in 402 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="alpha-mannosyltransferase PIMA" /protein_id="NP_217126.1" /db_xref="GI:15609747" /db_xref="GeneID:888627" /translation="MRIGMICPYSFDVPGGVQSHVLQLAEVMRTRGHLVSVLAPASPH AALPDYFVSGGRAVPIPYNGSVARLRFGPATHRKVKKWLAHGDFDVLHLHEPNAPSLS MLALNIAEGPIVATFHTSTTKSLTLTVFQGILRPMHEKIVGRIAVSDLARRWQMEALG SDAVEIPNGVDVDSFASAARLDGYPRQGKTVLFLGRYDEPRKGMAVLLDALPKVVQRF PDVQLLIVGHGDADQLRGQAGRLAAHLRFLGQVDDAGKASAMRSADVYCAPNTGGESF GIVLVEAMAAGTAVVASDLDAFRRVLRDGEVGHLVPVDPPDLQAAALADGLIAVLEND VLRERYVAAGNAAVRRYDWSVVASQIMRVYETVAGSGAKVQVAS" misc_feature complement(2938432..2938455) /gene="pimA" /locus_tag="Rv2610c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(2939012..2939962) /locus_tag="Rv2611c" /db_xref="GeneID:888618" CDS complement(2939012..2939962) /locus_tag="Rv2611c" /EC_number="2.3.1.-" /function="THE PRODUCT OT THIS PUTATIVE ORF SEEMS BE AN ACYLTRANSFERASE RESPONSIBLE FOR THE CLEAVAGE OF ONE FATTY ACID CHAIN OF PI (MOST LIKELY THE C16 FATTY ACID CHAIN FOUND AT POSITION sn-2) AND THUS INVOLVED IN THE SYNTHESIS OF LYSO-PIMan2, A POSSIBLE PRECURSOR OF LM AND LIPOARABINOMANNAN." /note="Acylates the intermediate (KDO)2-lipid IVA to form (KDO)2-(lauroyl)-lipid IVA" /codon_start=1 /transl_table=11 /product="lipid A biosynthesis lauroyl acyltransferase" /protein_id="NP_217127.1" /db_xref="GI:15609748" /db_xref="GeneID:888618" /translation="MIAGLKGLKLPKDPRSSVTRTATDWAYAAGWMAVRALPEFAVRN AFDTGARYFARHGGPEQLRKNLARVLGVPPAAVPDPLMCASLESYGRYWREVFRLPTI NHRKLARQLDRVIGGLDHLDAALAAGLGAVLALPHSGNWDMAGMWLVQRHGTFTTVAE RLKPESLYQRFIDYRESLGFEVLPLSGGERPPFEVLSERLRNNRVVCLMAERDLTRTG VEVDFFGEPTRMPVGPAKLAVETGAALLPTHCWFEGRGWGFQVYPALDCTSGDVAAIT QALADRFAQNIAAHPADWHMLQPQWLADLSESRRAQLRSR" gene complement(2939959..2940612) /gene="pgsA1" /locus_tag="Rv2612c" /db_xref="GeneID:888209" CDS complement(2939959..2940612) /gene="pgsA1" /locus_tag="Rv2612c" /EC_number="2.7.8.11" /function="CATALYZES THE TRANSFER OF A FREE ALCOHOL (INOSITOL) ONTO CDP-DIACYLGLYCEROL. THE PRODUCT OF THIS PUTATIVE ORF SEEMS BE ESSENTIAL TO MYCOBACTERIA [CATALYTIC ACTIVITY: CDP-DIACYLGLYCEROL + MYO-INOSITOL = CMP + PHOSPHATIDYL 1D-MYO-INOSITOL]." /inference="non-experimental evidence, no additional details recorded" /note="Rv2612c, (MTCY01A10.21), len: 217 aa. Probable pgsA1 (previously known as pgsA), PI synthase/CDP-diacylglyceride--inositol phosphatidyltransferase (EC 2.7.8.11), transmembrane protein, equivalent to O07149|MLCL581.16c|PGSA|ML0454 PUTATIVE PHOSPHATIDYLTRANSFERASE from Mycobacterium leprae (239 aa), FASTA scores: opt: 1141, E(): 4.1e-70, (79.35% identity in 213 aa overlap); and Q9F7Y9|PGSA PHOSPHATIDYLINOSITOL SYNTHASE from Mycobacterium smegmatis (222 aa), FASTA scores: opt: 981, E(): 2.7e-59, (67.3% identity in 217 aa overlap) (see citation below). Also similar to other proteins e.g. Q9L282|SCL2.17c PUTATIVE MEMBRANE TRANSFERASE from Streptomyces coelicolor (241 aa), FASTA scores: opt: 564, E(): 4.9e-31, (43.4% identity in 212 aa overlap); Q9UYD0|PGSA-LIKE|PAB1041 CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Pyrococcus abyssi (186 aa), FASTA scores: opt: 264, E(): 8.4e-11, (33.15% identity in 190 aa overlap); Q9HQS2|PGSA|VNG1030G CDP-DIACYLGLYCEROL-GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Halobacterium sp. strain NRC-1 (199 aa), FASTA scores: opt: 249, E(): 9.1e-10, (32.1% identity in 193 aa overlap); etc. Contains PS00379 CDP-alcohol phosphatidyltransferases signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY. Note that in Mycobacterium smegmatis, the psgA homologue is essential to the survival of the bacteria and seems cannot be compensated by any other enzyme of Mycobacterium smegmatis.; pgsA" /codon_start=1 /transl_table=11 /product="CDP-diacylglycerol--inositol 3-phosphatidyltransferase" /protein_id="YP_177894.1" /db_xref="GI:57117006" /db_xref="GeneID:888209" /translation="MSKLPFLSRAAFARITTPIARGLLRVGLTPDVVTILGTTASVAG ALTLFPMGKLFAGACVVWFFVLFDMLDGAMARERGGGTRFGAVLDATCDRISDGAVFC GLLWWIAFHMRDRPLVIATLICLVTSQVISYIKARAEASGLRGDGGFIERPERLIIVL TGAGVSDFPFVPWPPALSVGMWLLAVASVITCVQRLHTVWTSPGAIDRMAIPGKGDR" misc_feature complement(2940334..2940402) /gene="pgsA1" /locus_tag="Rv2612c" /note="PS00379 CDP-alcohol phosphatidyltransferases signature" gene complement(2940609..2941196) /locus_tag="Rv2613c" /db_xref="GeneID:888193" CDS complement(2940609..2941196) /locus_tag="Rv2613c" /function="UNKNOWN; BUT COULD BE INVOLVED IN LIPID METABOLISM." /note="Rv2613c, (MTCY01A10.20A), len: 195 aa. Conserved hypothetical protein, equivalent to Q9CCU0|ML0455 HYPOTHETICAL PROTEIN from Mycobacterium leprae (206 aa), FASTA scores: opt: 1074, E(): 7.4e-62, (84.7% identity in 196 aa overlap); and highly similar, but longer 18 aa, to O07150|MLCL581.17c HYPOTHETICAL 20.7 KDA PROTEIN from Mycobacterium leprae (186 aa), FASTA scores: opt: 1038, E(): 1.4e-59, (89.7% identity in 175 aa overlap). Also highly similar to other hypothetical proteins (often Hit family member) e.g. Q9F7Z0 from Mycobacterium smegmatis (see citation below) (205 aa), FASTA scores: opt: 975, E(): 1.6e-55, (79.35% identity in 184 aa overlap); Q9L279|SCL2.20 from Streptomyces coelicolor (186 aa), FASTA scores: opt: 638, E(): 5.8e-34, (52.85% identity in 176 aa overlap); Q9YFX8|APE0122 from Aeropyrum pernix (184 aa), FASTA scores: opt: 515, E(): 4.4e-26, (45.9% identity in 159 aa overlap); etc. It seems the Rv2613c and downstream ORF Rv2612c|psgA1 are expressed from the same promoter (see citation below) and that Rv2613c should be involved in lipid metabolism." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217129.1" /db_xref="GI:15609750" /db_xref="GeneID:888193" /translation="MSDEDRTDRATEDHTIFDRGVGQRDQLQRLWTPYRMNYLAEAPV KRDPNSSASPAQPFTEIPQLSDEEGLVVARGKLVYAVLNLYPYNPGHLMVVPYRRVSE LEDLTDLESAELMAFTQKAIRVIKNVSRPHGFNVGLNLGTSAGGSLAEHLHVHVVPRW GGDANFITIIGGSKVIPQLLRDTRRLLATEWARQP" gene complement(2941189..2943267) /gene="thrS" /locus_tag="Rv2614c" /db_xref="GeneID:888211" CDS complement(2941189..2943267) /gene="thrS" /locus_tag="Rv2614c" /EC_number="6.1.1.3" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-THREONINE + TRNA(THR) = AMP + PYROPHOSPHATE + L-THREONYL-TRNA(THR)]." /note="catalyzes a two-step reaction, first charging a threonine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; catalyzes the formation of threonyl-tRNA(Thr) from threonine and tRNA(Thr)" /codon_start=1 /transl_table=11 /product="threonyl-tRNA synthetase" /protein_id="NP_217130.1" /db_xref="GI:15609751" /db_xref="GeneID:888211" /translation="MSAPAQPAPGVDGGDPSQARIRVPAGTTAATAVGEAGLPRRGTP DAIVVVRDADGNLRDLSWVPDVDTDITPVAANTDDGRSVIRHSTAHVLAQAVQELFPQ AKLGIGPPITDGFYYDFDVPEPFTPEDLAALEKRMRQIVKEGQLFDRRVYESTEQARA ELANEPYKLELVDDKSGDAEIMEVGGDELTAYDNLNPRTRERVWGDLCRGPHIPTTKH IPAFKLTRSSAAYWRGDQKNASLQRIYGTAWESQEALDRHLEFIEEAQRRDHRKLGVE LDLFSFPDEIGSGLAVFHPKGGIVRRELEDYSRRKHTEAGYQFVNSPHITKAQLFHTS GHLDWYADGMFPPMHIDAEYNADGSLRKPGQDYYLKPMNCPMHCLIFRARGRSYRELP LRLFEFGTVYRYEKSGVVHGLTRVRGLTMDDAHIFCTRDQMRDELRSLLRFVLDLLAD YGLTDFYLELSTKDPEKFVGAEEVWEEATTVLAEVGAESGLELVPDPGGAAFYGPKIS VQVKDALGRTWQMSTIQLDFNFPERFGLEYTAADGTRHRPVMIHRALFGSIERFFGIL TEHYAGAFPAWLAPVQVVGIPVADEHVAYLEEVATQLKSHGVRAEVDASDDRMAKKIV HHTNHKVPFMVLAGDRDVAAGAVSFRFGDRTQINGVARDDAVAAIVAWIADRENAVPT AELVKVAGRE" misc_feature complement(2941573..2941602) /gene="thrS" /locus_tag="Rv2614c" /note="PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2" gene 2943376..2943603 /locus_tag="Rv2614A" /db_xref="GeneID:3205055" CDS 2943376..2943603 /locus_tag="Rv2614A" /function="UNKNOWN" /note="Rv2614A, len: 75 aa. Conserved hypothetical protein. The region from aa 10-35 is similar to part of C-terminal part of several TRIOSEPHOSPHATE ISOMERASES (EC 5.3.1.1) e.g. P46711|TPIS_MYCLE|TPIA|TPI|ML0572|B1496_C1_127 from Mycobacterium leprae (261 aa), FASTA scores: opt: 112, E(): 0.95, (60.0% identity in 25 aa overlap); and O08408|TPIS_MYCTU|TPIA|TPI|Rv1438|MT1482|MTCY493.16c from Mycobacterium tuberculosis (261 aa), FASTA scores: opt: 104, E(): 3.3, (60.0% identity in 25 aa overlap); P19583|TPIS_CORGL|TPIA|TPI from Corynebacterium glutamicum (Brevibacterium flavum) (259 aa), FASTA scores: opt: 100, E(): 6, (45.45% identity in 33 aa overlap); etc. TRIOSEPHOSPHATE ISOMERASES PLAY AN IMPORTANT ROLE IN SEVERAL METABOLIC PATHWAYS (CATALYTIC ACTIVITY: D-GLYCERALDEHYDE 3-PHOSPHATE = DIHYDROXY-ACETONE PHOSPHATE)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177674.1" /db_xref="GI:57117007" /db_xref="GeneID:3205055" /translation="MGDRYRAGDRVLYGGSMSPKDVDDLATQQDVDDGQSIERRWTGS GQRRWRRSPPTGRYRSNSQIQVWISGAGRLR" gene complement(2943600..2944985) /gene="PE_PGRS45" /locus_tag="Rv2615c" /db_xref="GeneID:888215" CDS complement(2943600..2944985) /gene="PE_PGRS45" /locus_tag="Rv2615c" /function="UNKNOWN" /note="Rv2615c, (MTCY01A10.19), len: 461 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many e.g. P71664|Rv1396c|MTCY21B4.13c from Mycobacterium tuberculosis (576 aa), FASTA scores: opt: 1629, E(): 4.8e-58, (56.65% identity in 482 aa overlap). Equivalent to AAK47006 from Mycobacterium tuberculosis strain CDC1551 (476 aa) but shorter 15 aa." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177895.1" /db_xref="GI:57117008" /db_xref="GeneID:888215" /translation="MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLAAAQD EVSTAIAALFGSHGQHYQAISAQVAAYQQRFVLALSQAGSTYAVAEAASATPLQNVLD AINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAG LIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGAGGNGGIGGAGTNLAIGGHGG NGGNAGLIGAGGTGGAGGTGGGEPSAGASGGNGGNGGNGGLLIGNSGDGGAAGNGAGI SQNGPASGFGGNGGHAGTTGLIGNGGNGGAGGAGGDVSADFGGVGFGGQGGNGGAGGL LYGNGGAGGNGGAAGSPGSVTAFGGNGGSGGSGGNGGNALIGNAGAGGSAGAGGNGAS AGTAGGSGGDGGKGGNGGSVGLIGNGGNGGNGGAGSLFNGAPGFGGPGGSGGASLLGP PGLAGTNGADG" gene 2945330..2945830 /locus_tag="Rv2616" /db_xref="GeneID:887289" CDS 2945330..2945830 /locus_tag="Rv2616" /function="UNKNOWN" /note="Rv2616, (MTCY01A10.18c), len: 166 aa. Conserved hypothetical protein, highly similar to bacterial proteins: Q9L1G0|SC3D11.02c HYPOTHETICAL 20.3 KDA PROTEIN from Streptomyces coelicolor (188 aa), FASTA scores: opt: 407, E(): 2.3e-20, (44.0% identity in 159 aa overlap); Q9X945 A3(2) GLYCOGEN METABOLISM CLUSTER from Streptomyces coelicolor (134 aa), FASTA scores: opt: 330, E(): 2.5e-15, (46.65% identity in 120 aa overlap) (N-terminus shorter); Q9RST8|DR2035 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (198 aa), FASTA scores: opt: 228, E(): 2.4e-08, (35.1% identity in 168 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217132.1" /db_xref="GI:15609753" /db_xref="GeneID:887289" /translation="MDLNALADLPLTYPEVGATATGRLPAGYNHLDVSTQIGTGRQRF EQAADAVMHWGMQRNAGLRVRASSETAVVSAVVLVGIAFLRAPCRVVYVIDEPDVRGF GYGTLPGHPVSGEERFAVRCDPMTSVVFAEVLSFSRPATWASKAAGPLGAVTQRFIAQ RYLRAV" gene complement(2945847..2946287) /locus_tag="Rv2617c" /db_xref="GeneID:888610" CDS complement(2945847..2946287) /locus_tag="Rv2617c" /function="UNKNOWN" /note="Rv2617c, (MTCY01A10.17), len: 146 aa. Probable transmembrane protein, showing some similarity to hypothetical or membrane proteins e.g. CAC47207|SMC00744 PUTATIVE TRANSPORT PROTEIN TRANSMEMBRANE from Rhizobium meliloti (Sinorhizobium meliloti) (399 aa), FASTA scores: opt: 108, E(): 5.5, (29.15% identity in 144 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217133.1" /db_xref="GI:15609754" /db_xref="GeneID:888610" /translation="MSIRPTTSPALADQLKDPAYSAYVLLRTLFTVAPILFGLDKFFN LLTHPQHWNMYLAGWINDLVPGTADQCMYLVGAIEIVAGVLVAVAPRIGAWVVAAWLA GIILNLVTGPGFYDIALRDFGLLVGAIALARLAQGVHSGGIGRP" gene 2946434..2947111 /locus_tag="Rv2618" /db_xref="GeneID:887690" CDS 2946434..2947111 /locus_tag="Rv2618" /function="UNKNOWN" /note="Rv2618, (MTCY01A10.15c), len: 225 aa. Conserved hypothetical protein, similar in part to Q9EWQ9|SC4C2.03 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (159 aa), FASTA scores: opt: 235, E(): 1.3e-07, (43.7% identity in 103 aa overlap); Q9HLM6|TA0201 HYPOTHETICAL PROTEIN from Thermoplasma acidophilum (215 aa), FASTA scores: opt: 164, E(): 0.0038, (23.4% identity in 201 aa overlap); and to mycobacterial proteins e.g. O06191|Rv2621c|MTCY01A10.11 HYPOTHETICAL 24.2 KDA PROTEIN from Mycobacterium tuberculosis (224 aa), FASTA scores: opt: 149, E(): 0.033, (28.05% identity in 196 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217134.1" /db_xref="GI:15609755" /db_xref="GeneID:887690" /translation="MDPVRRQLYQFVCSQSMPVSRDQAADAVGIPRHQAKFHLDRLTA EGLLDTEYARLTGRSGPGAGRTAKLYRRAGRDIALSLPQREYELAGRLMAAAIVLSAT TGEPTVEVLNRIAHDYGQAMGAAATTRPPADPAAALELTLDVLRKYGYEPRRPAGPGD DEVELVNCPFHALAREQTELACNMNHALITGVADALAPHSPAVRLAPGPARCCVVLKR CSAHDPE" gene complement(2947096..2947449) /locus_tag="Rv2619c" /db_xref="GeneID:888601" CDS complement(2947096..2947449) /locus_tag="Rv2619c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2619c, (MTCY01A10.14), len: 117 aa. Conserved hypothetical protein, highly similar to Q9L0F3|SCD31.14 HYPOTHETICAL 11.6 KDA PROTEIN from Streptomyces coelicolor (110 aa) , FASTA scores: opt: 407, E(): 2.3e-21, (55.95% identity in 109 aa overlap). Also similarity with other short bacterial hypothetical proteins e.g. Q9F8B9 HYPOTHETICAL 12.4 KDA PROTEIN from Streptococcus agalactiae (112 aa), FASTA scores: opt: 143, E(): 0.0032, (32.45% identity in 74 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217135.1" /db_xref="GI:15609756" /db_xref="GeneID:888601" /translation="MESISLTSLAAEKLAEAQQTHSGRAAHTIHGGHTHELRQTVLAL LAGHDLSEHDSPGEATLQVLQGHVCLTAGEDAWNGRAGDYVAIPPTRHALHAVEDSVI MLTVLKSLPDAHSGS" gene complement(2947462..2947887) /locus_tag="Rv2620c" /db_xref="GeneID:888497" CDS complement(2947462..2947887) /locus_tag="Rv2620c" /function="UNKNOWN" /note="Rv2620c, (MTCY01A10.13), len: 141 aa. Probable conserved transmembrane protein, highly similar to O54184|SC7H1.25 HYPOTHETICAL 14.6 KDA PROTEIN from Streptomyces coelicolor (144 aa), FASTA scores: opt: 459, E(): 1.4e-22, (56.45% identity in 140 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217136.1" /db_xref="GI:15609757" /db_xref="GeneID:888497" /translation="MSAGPAIEVAVAFVWLGMVVAISFLEAPLKFRAAGVTLQIGLGI GRLVFRALNTVEVGFALVILAIVVVGSTPARIAAAFSVALAALAVQLIAVRPRLTRRS NQVLAGLQAPRSRGHHIYVGLEIVKVVALLVAGILLLNG" gene complement(2947884..2948558) /locus_tag="Rv2621c" /db_xref="GeneID:888565" CDS complement(2947884..2948558) /locus_tag="Rv2621c" /function="COULD BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2621c, (MTCY01A10.11), len: 224 aa. Possible transcriptional regulator, similar in part to Q49688|MLCL536.29c|ML0592 PUTATIVE DNA-BINDING PROTEIN from Mycobacterium leprae (254 aa), FASTA scores: opt: 168, E(): 0.0018, (29.75% identity in 222 aa overlap). Shows similarity with Q9XAD0|SCC22.08c PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (252 aa), FASTA scores: opt: 148, E(): 0.032, (29.4% identity in 204 aa overlap); and Q9RVM8|DR0999 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (225 aa), FASTA scores: opt: 195, E(): 3.3e-05, (29.6% identity in 213 aa overlap). Also some similarity with O06195|Rv2618|MTCY01A10.15c from Mycobacterium tuberculosis (225 aa), FASTA scores: opt: 149, E(): 0.025, (28.95% identity in 197 aa overlap). Contains helix-turn-helix motif at aa 31-52 (Score 1662, +4.85 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217137.1" /db_xref="GI:15609758" /db_xref="GeneID:888565" /translation="MGVSVIIRSLQEPVGRRRAVLRALCASRVPMSIAAIAGKLGVHP NTVRFHLDNLVADGQVERVEPGRGRPGRPPLMFRAVRRTDSTGTRRYRLLAEILASGL AAERDSRAMALSAGRAWGRQLEAPPAGADTEETIDHLVAVLDDLGFAPERRASNGRQQ VGLRHCPFLELAETQAGVVCPVHLGIMRGALQTWGAPVTVDRLDAFVEPDLCLAHFTP LEGAIR" gene 2948636..2949457 /locus_tag="Rv2622" /db_xref="GeneID:887825" CDS 2948636..2949457 /locus_tag="Rv2622" /EC_number="2.1.1.-" /function="CAUSES METHYLATION." /note="Rv2622, (MTCY01A10.10c), len: 273 aa. Possible methyltransferase (EC 2.1.1.-), similar in part to others e.g. AAK75664|SP1578 PUTATIVE METHYLTRANSFERASE from Streptococcus pneumoniae (252 aa), FASTA scores: opt: 406, E(): 6.6e-18, (32.65% identity in 251 aa overlap); Q9F8B8 METHYLTRANSFERASE from Streptococcus agalactiae (254 aa), FASTA scores: opt: 381, E(): 2.3e-16, (31.75% identity in 252 aa overlap); Q9RJB6|SCF91.08 PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (231 aa), FASTA scores: opt: 159, E(): 0.0091, (33.1% identity in 151 aa overlap); etc. Also similar in part to several hypothetical proteins e.g. Q99YR0|SPY1582 HYPOTHETICAL PROTEIN from Streptococcus pyogenes (251 aa), FASTA scores: opt: 397, E(): 2.3e-17, (36.3% identity in 248 aa overlap)." /codon_start=1 /transl_table=11 /product="methyltransferase (methylase)" /protein_id="NP_217138.1" /db_xref="GI:15609759" /db_xref="GeneID:887825" /translation="MANKRGNAGQPLPLSDRDDDHMQGHWLLARLGKRVLRPGGVELT RTLLARAEVTDADVLELAPGLGRTAAEILARNPRSYVGAESDPNAANLVRHVLAGRGD VRVTDAADTGLSDASADVVIGEAMLTMQGNAAKHTIVAEAARVLRPGGRYAIHELALV PDDVAEQVRTDLRQSLARALKVNARPLTVAEWSHLLAGHGLVVEHVVTASMALLQPRR VIADEGLLGALRFAGNLLIHRAARRRVLLMRHTFRRHRERLTAVAIVAHKPHVDS" gene 2949593..2950486 /gene="TB31.7" /locus_tag="Rv2623" /db_xref="GeneID:887442" CDS 2949593..2950486 /gene="TB31.7" /locus_tag="Rv2623" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2623, (MTCY01A10.09c), len: 297 aa. TB31.7, conserved hypothetical protein, highly similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Q10851|YK05_MYCTU|Rv2005c|MT2061|MTCY39.12 (295 aa), FASTA scores: opt: 1076, E(): 1.4e-60, (55.25% identity in 295 aa overlap); O53472|Rv2026c|MTV018.13c (294 aa), FASTA scores: opt: 988, E(): 4.8e-55, (51.5% identity in 295 aa overlap); Q10862|YJ96_MYCTU|Rv1996|MT2052|MTCY39.23c (317 aa), FASTA scores: opt: 688, E(): 4.1e-36, (45.1% identity in 315 aa overlap); etc. Also similar to several Streptomyces proteins e.g. Q9RIZ8|SCJ1.16c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (294 aa), FASTA scores: opt: 407, E(): 2e-18, (32.65% identity in 303 aa overlap); and other bacterial hypothetical proteins e.g. Q9HPP5|VNG1536 from Halobacterium sp (147 aa), FASTA scores: opt: 180, E(): 0.00022, (31.65% identity in 139 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217139.1" /db_xref="GI:15609760" /db_xref="GeneID:887442" /translation="MSSGNSSLGIIVGIDDSPAAQVAVRWAARDAELRKIPLTLVHAV SPEVATWLEVPLPPGVLRWQQDHGRHLIDDALKVVEQASLRAGPPTVHSEIVPAAAVP TLVDMSKDAVLMVVGCLGSGRWPGRLLGSVSSGLLRHAHCPVVIIHDEDSVMPHPQQA PVLVGVDGSSASELATAIAFDEASRRNVDLVALHAWSDVDVSEWPGIDWPATQSMAEQ VLAERLAGWQERYPNVAITRVVVRDQPARQLVQRSEEAQLVVVGSRGRGGYAGMLVGS VGETVAQLARTPVIVARESLT" gene complement(2950489..2951307) /locus_tag="Rv2624c" /db_xref="GeneID:888939" CDS complement(2950489..2951307) /locus_tag="Rv2624c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2624c, (MTCY01A10.08), len: 272 aa. Conserved hypothetical protein, similar to several Streptomyces proteins e.g. Q9RIY5|SCJ1.29c HYPOTHETICAL 30.1 KDA PROTEIN from Streptomyces coelicolor (283 aa), FASTA scores: opt: 260, E(): 5e-09, (32.05% identity in 290 aa overlap). Also similar to Mycobacterium tuberculosis proteins O53474|Rv2028c|MTV018.15c (279 aa), FASTA scores: opt: 563, E(): 7e-28, (36.85% identity in 266 aa overlap); P95192|Rv3134c|MTCY03A2.240 (268 aa), FASTA scores: opt: 458, E(): 2.3e-21, (36.55% identity in 271 aa overlap); Q10851|YK05_MYCTU|Rv2005c|MT2061|MTCY39.12 (295 aa), FASTA scores: opt: 199, E(): 3.2e-05, (29.35% identity in 286 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217140.1" /db_xref="GI:15609761" /db_xref="GeneID:888939" /translation="MSGRGEPTMKTIIVGIDGSHAAITAALWGVDEAISRAVPLRLVS VIKPTHPSPDDYDRDLAHAERSLREAQSAVEAAGKLVKIETDIPRGPAGPVLVEASRD AEMICVGSVGIGRYASSILGSTATELAEKAHCPVAVMRSKVDQPASDINWIVVRMTDA PDNEAVLEYAAREAKLRQAPILALGGRPEELREIPDGEFERRVQDWHHRHPDVRVYPI TTHTGIARFLADHDERVQLAVIGGGEAGQLARLVGPSGHPVFRHAECSVLVVRR" gene complement(2951322..2952503) /locus_tag="Rv2625c" /db_xref="GeneID:887692" CDS complement(2951322..2952503) /locus_tag="Rv2625c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2625c, (MTCY01A10.07), len: 393 aa. Probable conserved transmembrane ala-, leu-rich protein, similar to many hypothetical or membrane proteins e.g. Q55518|Y528_SYNY3|SLL0528 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Synechocystis sp. strain PCC 6803 (379 aa), FASTA scores: opt: 552, E(): 5.6e-26, (30.75% identity in 374 aa overlap); Q9RJ56|SCI41.35c HYPOTHETICAL 39.8 KDA PROTEIN from Streptomyces coelicolor (374 aa), FASTA scores: opt: 419, E(): 5.7e-18, (31.6% identity in 383 aa overlap); CAC49448|SMB20925 CONSERVED HYPOTHETICAL MEMBRANE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (372 aa), FASTA scores: opt: 401, E(): 6.9e-17, (29.5% identity in 383 aa overlap); etc. Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217141.1" /db_xref="GI:15609762" /db_xref="GeneID:887692" /translation="MRDAIPLGRIAGFVVNVHWSVLVILWLFTWSLATMLPGTVGGYP AVVYWLLGAGGAVMLLASLLAHELAHAVVARRAGVSVESVTLWLFGGVTALGGEAKTP KAAFRIAFAGPATSLALSATFGALAITLAGVRTPAIVISVAWWLATVNLLLGLFNLLP GAPLDGGRLVRAYLWRRHGDSVRAGIGAARAGRVVALVLIALGLAEFVAGGLVGGVWL AFIGWFIFAAAREEETRISTQQLFAGVRVADAMTAQPHTAPGWINVEDFIQRYVLGER HSAYPVADRDGSITGLVALRQLRDVAPSRRSTTSVGDIALPLHSVPTARPQEPLTALL ERMAPLGPRSRALVTEGSAVVGIVTPSDVARLIDVYRLAQPEPTFTTSPQDADRFSDA G" misc_feature complement(2952288..2952317) /locus_tag="Rv2625c" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(2952562..2952993) /locus_tag="Rv2626c" /db_xref="GeneID:888576" CDS complement(2952562..2952993) /locus_tag="Rv2626c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2626c, (MTCY01A10.06), len: 143 aa. Conserved hypothetical protein, similar to CAC49670|SMB21441 PUTATIVE INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE PROTEIN (EC 1.1.1.205) from Rhizobium meliloti (Sinorhizobium meliloti) (120 aa), FASTA scores: opt: 287, E(): 6.6e-12, (43.75% identity in 112 aa overlap) (has its N-terminus shorter 27 aa); AAK78655|CAC0678 CBS DOMAINS from Clostridium acetobutylicum (142 aa), FASTA scores: opt: 276, E(): 3.9e-11, (35.65% identity in 115 aa overlap); Q9K9P0|BH2605 BH2605 PROTEIN from Bacillus halodurans (142 aa), FASTA scores: opt: 276, E(): 3.9e-11, (35.65% identity in 115 aa overlap); etc. Also some similarity to P71737|Rv2406c|MTCY253.14 HYPOTHETICAL 15.1 KDA PROTEIN from Mycobacterium tuberculosis (142 aa), FASTA scores: opt: 145, E(): 0.00012, (22.3% identity in 112 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217142.1" /db_xref="GI:15609763" /db_xref="GeneID:888576" /translation="MTTARDIMNAGVTCVGEHETLTAAAQYMREHDIGALPICGDDDR LHGMLTDRDIVIKGLAAGLDPNTATAGELARDSIYYVDANASIQEMLNVMEEHQVRRV PVISEHRLVGIVTEADIARHLPEHAIVQFVKAICSPMALAS" gene complement(2953507..2954748) /locus_tag="Rv2627c" /db_xref="GeneID:888568" CDS complement(2953507..2954748) /locus_tag="Rv2627c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2627c, (MTCY01A10.05), len: 413 aa. Conserved hypothetical protein. Some similarity in C-terminal part of O53697|Rv0293c|MTV035.21c HYPOTHETICAL 44.0 KDA PROTEIN from Mycobacterium tuberculosis (400 aa), FASTA scores: opt: 392, E(): 1.9e-17, (31.1% identity in 299 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217143.1" /db_xref="GI:15609764" /db_xref="GeneID:888568" /translation="MASSASDGTHERSAFRLSPPVLSGAMGPFMHTGLYVAQSWRDYL GQQPDKLPIARPTIALAAQAFRDEIVLLGLKARRPVSNHRVFERISQEVAAGLEFYGN RRWLEKPSGFFAQPPPLTEVAVRKVKDRRRSFYRIFFDSGFTPHPGEPGSQRWLSYTA NNREYALLLRHPEPRPWLVCVHGTEMGRAPLDLAVFRAWKLHDELGLNIVMPVLPMHG PRGQGLPKGAVFPGEDVLDDVHGTAQAVWDIRRLLSWIRSQEEESLIGLNGLSLGGYI ASLVASLEEGLACAILGVPVADLIELLGRHCGLRHKDPRRHTVKMAEPIGRMISPLSL TPLVPMPGRFIYAGIADRLVHPREQVTRLWEHWGKPEIVWYPGGHTGFFQSRPVRRFV QAALEQSGLLDAPRTQRDRSA" gene 2955058..2955420 /locus_tag="Rv2628" /db_xref="GeneID:888566" CDS 2955058..2955420 /locus_tag="Rv2628" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2628, (MTCY01A10.04c), len: 120 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217144.1" /db_xref="GI:15609765" /db_xref="GeneID:888566" /translation="MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHP RKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIGDW PAAYAIGEHLSVEIAVAV" gene 2955767..2956891 /locus_tag="Rv2629" /db_xref="GeneID:888588" CDS 2955767..2956891 /locus_tag="Rv2629" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2629, (MTCY01A10.03c), len: 374 aa. Conserved hypothetical protein, similar to Q9ZC00|SC1E6.22c HYPOTHETICAL 40.7 KDA PROTEIN from Streptomyces coelicolor (373 aa), FASTA scores: opt: 425, E(): 2.5e-18, (30.2% identity in 371 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217145.1" /db_xref="GI:15609766" /db_xref="GeneID:888588" /translation="MRSERLRWLVAAEGPFASVYFDDSHDTLDAVERREATWRDVRKH LESRDAKQELIDSLEEAVRDSRPAVGQRGRALIATGEQVLVNEHLIGPPPATVIRLSD YPYVVPLIDLEMRRPTYVFAAVDHTGADVKLYQGATISSTKIDGVGYPVHKPVTAGWN GYGDFQHTTEEAIRMNCRAVADHLTRLVDAADPEVVFVSGEVRSRTDLLSTLPQRVAV RVSQLHAGPRKSALDEEEIWDLTSAEFTRRRYAEITNVAQQFEAEIGRGSGLAAQGLA EVCAALRDGDVDTLIVGELGEATVVTGKARTTVARDADMLSELGEPVDRVARADEALP FAAIAVGAALVRDDNRIAPLDGVGALLRYAATNRLGSHRS" gene 2956893..2957432 /locus_tag="Rv2630" /db_xref="GeneID:887426" CDS 2956893..2957432 /locus_tag="Rv2630" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2630, (MTCY01A10.02c), len: 179 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217146.1" /db_xref="GI:15609767" /db_xref="GeneID:887426" /translation="MLHRDDHINPPRPRGLDVPCARLRATNPLRALARCVQAGKPGTS SGHRSVPHTADLRIEAWAPTRDGCIRQAVLGTVESFLDLESAHAVHTRLRRLTADRDD DLLVAVLEEVIYLLDTVGETPVDLRLRDVDGGVDVTFATTDASTLVQVGAVPKAVSLN ELRFSQGRHGWRCAVTLDV" gene 2957572..2958870 /locus_tag="Rv2631" /db_xref="GeneID:887431" CDS 2957572..2958870 /locus_tag="Rv2631" /function="UNKNOWN" /note="Rv2631, (MTCY441.01, MTCY01A10.01c), len: 432 aa. Conserved hypothetical protein, highly similar to several conserved hypothetical proteins from various species e.g. O29399|AF0862 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (482 aa), FASTA scores: opt: 1496, E(): 2.1e-80, (52.3% identity in 432 aa overlap) (has its N-terminus longer 30 aa); O27634|MTH1597 CONSERVED PROTEIN from Methanothermobacter thermautotrophicus (488 aa), FASTA scores: opt: 1428, E(): 2.1e-76, (50.9% identity in 432 aa overlap); Q9YB37|APE1758 HYPOTHETICAL 53.7 KDA PROTEIN APE1758 from Aeropyrum pernix (483 aa), FASTA scores: opt: 1422, E(): 4.6e-76, (49.3% identity in 432 aa overlap) (has its N-terminus longer 30 aa); etc. Equivalent to AAK47022 from Mycobacterium tuberculosis strain CDC1551 (432 aa). 3' part extended since first submission (+175 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217147.2" /db_xref="GI:57117009" /db_xref="GeneID:887431" /translation="MQVVNVATLPGIVRASYAMPDVHWGYGFPIGGVAATDVDNDGVV SPGGVGFDISCGVRLLVGEGLDREELQPRLPAVMDRLDRAIPRGVGTAGVWRLPDRNT LQEVLTGGARFAVEQGHGVALDLERCEDGGVMTGADAAKISDRALQRGLGQIGSLGSG NHFLEVQAVDRVYDPVAAAPMGLAEGTVCVMIHTGSRGLGHQICTDHVRQMEQAMGRY GIAVPDRQLACVPVHSPDGQAYLAAMAAAANYGRANRQLLTEATRRVFADATGTPLDL LYDVSHNLAKIETHPIDGQLRSVCVHRKGATRSLPPHHHELPAELAAVGQPVLIPGTM GTASYVLAGVTGNPAFFSTAHGAGRVLSRHQAARHTSGEAIRASLAKRGIIVRGTSRR GIAEEKPEAYKDVDEVIEASHQSGLARKVARLVPLGCVKG" gene complement(2958909..2959190) /locus_tag="Rv2632c" /db_xref="GeneID:887231" CDS complement(2958909..2959190) /locus_tag="Rv2632c" /function="UNKNOWN" /note="Rv2632c, (MTCY441.02c), len: 93 aa. Conserved hypothetical protein, highly similar to conserved hypothetical proteins from Mycobacterium tuberculosis: P71996|YH38_MYCTU|Rv1738|MT1780|MTCY04C12.23 (94 aa), FASTA scores: opt: 319, E(): 4.2e-15, (53.95% identity in 89 aa overlap); and Q9KK61 from Mycobacterium bovis BCG (56 aa), FASTA scores: opt: 178, E(): 9.2e-06, (52.95% identity in 51 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217148.1" /db_xref="GI:15609769" /db_xref="GeneID:887231" /translation="MTDSEHVGKTCQIDVLIEEHDERTRAKARLSWAGRQMVGVGLAR LDPADEPVAQIGDELAIARALSDLANQLFALTSSDIEASTHQPVTGLHH" gene complement(2959335..2959820) /locus_tag="Rv2633c" /db_xref="GeneID:888597" CDS complement(2959335..2959820) /locus_tag="Rv2633c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2633c, (MTCY441.03c), len: 161 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217149.1" /db_xref="GI:15609770" /db_xref="GeneID:888597" /translation="MNAYDVLKRHHTVLKGLGRKVGEAPVNSEERHVLFDEMLIELDI HFRIEDDLYYPALSAAGKPITGTHAEHRQVVDQLATLLRTPQRAPGYEEEWNVFRTVL EAHADVEERDMIPAPTPVHITDAELEELGDKMAARIEQLRGSPLYTLRTKGKADLLKA I" gene complement(2960105..2962441) /gene="PE_PGRS46" /locus_tag="Rv2634c" /db_xref="GeneID:888573" CDS complement(2960105..2962441) /gene="PE_PGRS46" /locus_tag="Rv2634c" /function="UNKNOWN" /note="Rv2634c, (MTCY441.04c), len: 778 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many e.g. O53553|YZ08_MYCTU|Rv3508|MTV023.15 from Mycobacterium tuberculosis (1901 aa), FASTA scores: opt: 2553, E(): 2.2e-93, (53.8% identity in 866 aa overlap). Equivalent to AAK47026 from Mycobacterium tuberculosis strain CDC1551 (788 aa) but shorter 10 aa." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177896.1" /db_xref="GI:57117010" /db_xref="GeneID:888573" /translation="MSFVIAVPEALTMAASDLANIGSTINAANAAAALPTTGVVAAAA DEVSAAVAALFGSYAQSYQAFGAQLSAFHAQFVQSLTNGARSYVVAEATSAAPLQDLL GVVNAPAQALLGRPLIGNGANGADGTGAPGGPGGLLLGNGGNGGSGAPGQPGGAGGDA GLIGNGGTGGKGGDGLVGSGAAGGVGGRGGWLLGNGGTGGAGGAAGATLVGGTGGVGG ATGLIGSGGFGGAGGAAAGVGTTGGVGGSGGVGGVFGNGGFGGAGGLGAAGGVGGAAS YFGTGGGGGVGGDGAPGGDGGAGPLLIGNGGVGGLGGAGAAGGNGGAGGMLLGDGGAG GQGGPAVAGVLGGMPGAGGNGGNANWFGSGGAGGQGGTGLAGTNGVNPGSIANPNTGA NGTDNSGNGNQTGGNGGPGPAGGVGEAGGVGGQGGLGESLDGNDGTGGKGGAGGTAGT DGGAGGAGGAGGIGETDGSAGGVATGGEGGDGATGGVDGGVGGAGGKGGQGHNTGVGD AFGGDGGIGGDGNGALGAAGGNGGTGGAGGNGGRGGMLIGNGGAGGAGGTGGTGGGGA AGFAGGVGGAGGEGLTDGAGTAEGGTGGLGGLGGVGGTGGMGGSGGVGGNGGAAGSLI GLGGGGGAGGVGGTGGIGGIGGAGGNGGAGGAGTTTGGGATIGGGGGTGGVGGAGGTG GTGGAGGTTGGSGGAGGLIGWAGAAGGTGAGGTGGQGGLGGQGGNGGNGGTGATGGQG GDFALGGNGGAGGAGGSPGGSSGIQGNMGPPGTQGADG" gene 2962470..2962712 /locus_tag="Rv2635" /db_xref="GeneID:887813" CDS 2962470..2962712 /locus_tag="Rv2635" /function="UNKNOWN" /note="Rv2635, (MTCY441.05), len: 80 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217151.1" /db_xref="GI:15609772" /db_xref="GeneID:887813" /translation="MVAADHRALGSNKSYPASQTAEAIWPPARTLRYDRQSPWLATGF DRRMSQTVTGVGVQNCAVSKRRCSAVDHSSRTPYRR" gene 2962713..2963390 /locus_tag="Rv2636" /db_xref="GeneID:888196" CDS 2962713..2963390 /locus_tag="Rv2636" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2636, (MTCY441.06), len: 225 aa. Conserved hypothetical protein, showing some similarity with various proteins: Q98FG2|MLL3789 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (239 aa), FASTA scores: opt: 304, E(): 3.7e-13, (31.55% identity in 187 aa overlap); CAC46568|SMC04451 PUTATIVE CHLORAMPHENICOL PHOSPHOTRANSFERASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (220 aa), FASTA scores: opt: 175, E(): 0.00014, (28.0% identity in 225 aa overlap); Q56148|CPT_STRVL CHLORAMPHENICOL 3-O PHOSPHOTRANSFERASE (EC 2.7.1.-) from Streptomyces violaceus (Streptomyces venezuelae) (178 aa), FASTA scores: opt: 131, E(): 0.1, (31.75% identity in 170 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Translational start site uncertain, chosen by similarity." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217152.1" /db_xref="GI:15609773" /db_xref="GeneID:888196" /translation="MINPTRARRMRYRLAAMAGMPEGKLILLNGGSSAGKTSLALAFQ DLAAECWMHIGIDLFWFALPPEQLDLARVRPEYYTWDSAVEADGLEWFTVHPGPILDL AMHSRYRAIRAYLDNGMNVIADDVIWTREWLVDALRVFEGCRVWMVGVHVSDEEGARR ELERGDRHPGWNRGSARAAHADAEYDFELDTTATPVHELARELHESYQACPYPMAFNR LRKRFLS" misc_feature 2962800..2962823 /locus_tag="Rv2636" /note="PS00017 ATP/GTP-binding site motif A" gene 2963586..2964242 /gene="dedA" /locus_tag="Rv2637" /db_xref="GeneID:888616" CDS 2963586..2964242 /gene="dedA" /locus_tag="Rv2637" /function="UNKNOWN" /note="Rv2637, (MTCY441.07), len: 218 aa. Possible dedA, transmembrane protein, equivalent to Q49642|YQ37_MYCLE|ML0467|MLCL581.27|B1177_C2_172/B1177_C1 _1 40 HYPOTHETICAL 23.1 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN, BELONGS TO THE DEDA FAMILY) from Mycobacterium leprae (214 aa), FASTA scores: opt: 1160, E(): 4.4e-64, (82.75% identity in 209 aa overlap); and O69601|Y364_MYCLE|ML0287|MLCB4.30 HYPOTHETICAL PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) (222 aa), FASTA scores: opt: 292, E(): 6.6e-11, (32.25% identity in 189 aa overlap). Also highly similar to other membrane proteins e.g. CAC42863|SCBAC36F5.27c PUTATIVE INTEGRAL MEMBRANE from Streptomyces coelicolor (211 aa), FASTA scores: opt: 837, E(): 2.6e-44, (59.2% identity in 201 aa overlap); Q55705|Y232_SYNY3|SLR0232 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Synechocystis sp. strain PCC 6803 (218 aa), FASTA scores: opt: 415, E(): 1.9e-18, (37.85% identity in 206 aa overlap); Q9RV63|DR1167 DEDA PROTEIN from Deinococcus radiodurans (200 aa); P09548|DEDA_ECOLI|B2317|Z3579|ECS3201 DEDA PROTEIN (DSG-1 PROTEIN) from Escherichia coli strains K12 and O157:H7 (219 aa), BLAST scores: 178, E(): 1.8e-13, Identities = 53/175 (30%); etc. Also similar to O06314|Y364_MYCTU|Rv0364|MT0380|MTCY13E10.26 HYPOTHETICAL 24.5 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Mycobacterium tuberculosis (227 aa), FASTA scores: opt: 293, E(): 5.8e-11, (35.85% identity in 184 aa overlap). BELONGS TO THE DEDA FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane protein DedA" /protein_id="NP_217153.1" /db_xref="GI:15609774" /db_xref="GeneID:888616" /translation="MDVEALLQSIPPLMVYLVVGAVVGIESLGIPLPGEIVLVSAAVL SSHPELAVNPIGVGGAAVIGAVVGDSIGYSIGRRFGLPLFDRLGRRFPKHFGPGHVAL AERLFNRWGVRAVFLGRFIALLRIFAGPLAGALKMPYPRFLAANVTGGICWAGGTTAL VYFAGMAAQHWLERFSWIALVIAVIAGITAAILLRERTSRAIAELEAEHCRKAGTTAA" gene 2964405..2964851 /locus_tag="Rv2638" /db_xref="GeneID:888156" CDS 2964405..2964851 /locus_tag="Rv2638" /function="UNKNOWN" /note="Rv2638, (MTCY441.08), len: 147 aa. Conserved hypothetical protein, similar in part to Q9WVX8|RSBV_STRCO|BLDG|SCH5.12c ANTI-SIGMA B FACTOR ANTAGONIST from Streptomyces coelicolor (113 aa), FASTA scores: opt: 162, E(): 0.00066, (31.8% identity in 110 aa overlap); and showing weak similarity with various proteins e.g. O69205 HYPOTHETICAL 13.4 KDA PROTEIN from Actinosynnema pretiosum (subsp. auranticum) (128 aa), FASTA scores: opt: 157, E(): 0.0016, (29.8% identity in 114 aa overlap); Q9RJ93|SCF91.32 PUTATIVE ANTI-SIGMA FACTOR ANTAGONIST from Streptomyces coelicolor (183 aa), FASTA scores: opt: 148, E(): 0.0082, (30.85% identity in 107 aa overlap); etc. Also highly similar to hypothetical proteins from Mycobacterium tuberculosis: O07728|Rv1904|MTCY180.14c (143 aa), FASTA scores: opt: 456, E(): 3.9e-23, (52.8% identity in 125 aa overlap); and Q11035|YD65_MYCTU|Rv1365c|MT1411|MTCY02B10.29c (128 aa), FASTA scores: opt: 435, E(): 8.6e-22, (53.6% identity in 125 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217154.1" /db_xref="GI:15609775" /db_xref="GeneID:888156" /translation="MGLITTEPRSSPHPLSPRLVHELGDPHSTLRATTDGSGAALLIH AGGEIDGRNEHLWRQLVTEAAAGVTAPGPLIVDVTGLDFMGCCAFAALADEAQRCRCR GIDLRLVSHQPIVARIAEAGGLSRVLPIYPTVDTALGKGTAGPARC" gene complement(2965026..2965358) /locus_tag="Rv2639c" /db_xref="GeneID:887428" CDS complement(2965026..2965358) /locus_tag="Rv2639c" /function="UNKNOWN" /note="Rv2639c, (MTCY441.09c), len: 110 aa. Probable conserved integral membrane protein, highly similar to many bacterial hypothetical or membrane proteins e.g. Q9X889|YE14_STRCO|SCE15.14 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (112 aa), FASTA scores: opt: 597, E(): 3.1e-31, (73.15% identity in 108 aa overlap); Q55939|Y793_SYNY3|SLL0793 POTENTIAL INTEGRAL MEMBRANE PROTEIN from Synechocystis sp. strain PCC 6803 (108 aa), FASTA scores: opt: 341, E(): 4.9e-15, (51.4% identity in 109 aa overlap); O31553|YFJF_BACSU POTENTIAL INTEGRAL MEMBRANE PROTEIN from Bacillus subtilis (109 aa), FASTA scores: opt: 334, E(): 1.4e-14, (47.5% identity in 109 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217155.1" /db_xref="GI:15609776" /db_xref="GeneID:887428" /translation="MVVRSILLFVLAAVAEIGGAWLVWQGVREQRGWLWAGLGVIALG VYGFFATLQPDAHFGRVLAAYGGVFVAGSLAWGMALDGFRPDRWDVIGALGCMAGVAV IMYAPRGH" gene complement(2965478..2965837) /locus_tag="Rv2640c" /db_xref="GeneID:888594" CDS complement(2965478..2965837) /locus_tag="Rv2640c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2640c, (MTCY441.10c), len: 119 aa. Possible transcriptional regulator, arsR family, highly similar to many e.g. Q9L1V5|SC4A9.07 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (117 aa), FASTA scores: opt: 261, E(): 5.6e-10, (47.75% identity in 103 aa overlap); Q9X8X8|SCH35.28c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (122 aa), FASTA scores: opt: 252, E(): 2.2e-09, (37.05% identity in 116 aa overlap); Q9L220|SC1A2.21 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (119 aa), FASTA scores: opt: 252, E(): 2.2e-09, (37.05% identity in 116 aa overlap); P77295|YGAV_ECOLI|B2667 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Escherichia coli strain K12 (99 aa), FASTA scores: opt: 156, E(): 0.0023, (34.1% identity in 88 aa overlap); etc. Also similar to upstream ORF P71941|Rv2642|MTCY441.12 PUTATIVE TRANSCRIPTIONAL REGULATORY PROTEIN from Mycobacterium tuberculosis (126 aa), FASTA scores: opt: 237, E(): 2e-08, (38.55% identity in 109 aa overlap). Contains helix-turn-helix motif at aa 59-80 (Score 1166, +3.16 SD). BELONGS TO THE ARSR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="NP_217156.1" /db_xref="GI:15609777" /db_xref="GeneID:888594" /translation="MPKSLPVIDISAPVCCAPVAAGPMSDGDALAVALRLKALADPAR VKIMSYLFSSPAGEQVSGQLAAALSLSDGTVSHHLAQLRKAGLVISDRRGMHVFHRVH PEALQALCTVLNPNCCA" gene 2965939..2966397 /gene="cadI" /locus_tag="Rv2641" /db_xref="GeneID:888629" CDS 2965939..2966397 /gene="cadI" /locus_tag="Rv2641" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2641, (MTCY441.11), len: 152 aa. cadI, conserved hypothetical protein. Gene induced by cadmium (see Hotter et al., 2001), highly similar to hypothetical proteins e.g. Q9L222|SC1A2.19c from Streptomyces coelicolor (152 aa), FASTA scores: opt: 509, E(): 2.3e-27, (55.05% identity in 149 aa overlap); P45945|YQCK_BACSU from Bacillus subtilis (146 aa), FASTA scores: opt: 295, E(): 5.4e-13, (33.55% identity in 146 aa overlap); and Q98CF8|MLL5167 from Rhizobium loti (Mesorhizobium loti) (124 aa), FASTA scores: opt: 110, E(): 1.3, (31.4% identity in 121 aa overlap). Some similarity with Q10548|Y887_MYCTU|Rv0887c|MT0910|MTCY31.15c from Mycobacterium tuberculosis (152 aa), FASTA scores: opt: 108, E(): 2.1, (25.7% identity in 148 aa overlap)." /codon_start=1 /transl_table=11 /product="cadmium inducible protein CADI" /protein_id="NP_217157.1" /db_xref="GI:15609778" /db_xref="GeneID:888629" /translation="MSRVQLALNVDDLEAAITFYSRLFNAEPAKRKPGYANFAIADPP LKLVLLENPGTGGTLNHLGVEVGSSNTVHAEIARLTEAGLVTEKEIGTTCCFATQDKV WVTGPGGERWEVYTVLADSETFGSGPRHNDTSDGEASMCCDGQVAVGASG" gene 2966533..2966913 /locus_tag="Rv2642" /db_xref="GeneID:887703" CDS 2966533..2966913 /locus_tag="Rv2642" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2642, (MTCY441.12), len: 126 aa. Possible transcriptional regulator, arsR family, highly similar to many e.g. Q9X8X8|SCH35.28c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (122 aa), FASTA scores: opt: 390, E(): 3.7e-19, (56.55% identity in 122 aa overlap); Q9L220|SC1A2.21 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (119 aa), FASTA scores: opt: 378, E(): 2.3e-18, (59.8% identity in 97 aa overlap); Q9L1V5|SC4A9.07 PUTATIVE ARSR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (117 aa), FASTA scores: opt: 359, E(): 4.1e-17, (56.9% identity in 116 aa overlap); P52144|ARR2_ECOLI|ARSR from Escherichia coli (117 aa), FASTA scores: opt: 202, E(): 1e-06, (39.8% identity in 88 aa overlap); etc. Also similar to downstream ORF P71939|Rv2640c|MTCY441.10c PUTATIVE TRANSCRIPTIONAL REGULATORY PROTEIN from Mycobacterium tuberculosis (119 aa), FASTA scores: opt: 237, E(): 5e-09, (38.55% identity in 109 aa overlap); and others from Mycobacterium tuberculosis e.g. O05840|Rv2358|MTCY27.22c. Contains PS00846 Bacterial regulatory proteins, arsR family signature. Contains helix-turn-helix motif at aa 58-79 (Score 1112, +2.97 SD). BELONGS TO THE ARSR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="ArsR family transcriptional regulator" /protein_id="NP_217158.1" /db_xref="GI:15609779" /db_xref="GeneID:887703" /translation="MSNLHPLPEVASCVVAPLVREPLNPPAAAEMAARFKALADPVRL QLLSSVASRAGGEACVCDISAGVEVSQPTISHHLKVLRDAGLLTSRRRASWVYYAVVP EALTVLSNLLSVHADAAPALGAPA" misc_feature 2966707..2966763 /locus_tag="Rv2642" /note="PS00846 Bacterial regulatory proteins, arsR family signature" gene 2966910..2968406 /gene="arsC" /locus_tag="Rv2643" /db_xref="GeneID:887674" CDS 2966910..2968406 /gene="arsC" /locus_tag="Rv2643" /function="INVOLVED IN TRANSPORT OF ARSENIC COMPOUNDS ACROSS THE MEMBRANE (EXPORT): ARSENIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2643, (MTCY441.13), len: 498 aa. Probable arsC, arsenical resistance transport integral membrane protein, highly similar or similar to others e.g. Q9L1X4|SC3D9.05 POSSIBLE ARSENIC RESISTANCE MEMBRANE TRANSPORT PROTEIN from Streptomyces coelicolor (368 aa), FASTA scores: opt: 1729, E(): 2.2e-96, (74.3% identity in 358 aa overlap); Q9X8Y0|SCH35.26 PUTATIVE HEAVY METAL RESISTANCE MEMBRANE PROTEIN from Streptomyces coelicolor (369 aa), FASTA scores: opt: 1729, E(): 2.2e-96, (73.8% identity in 359 aa overlap); Q06598|ACR3_YEAST|ACR3|YPR201W|P9677.2 ARSENICAL-RESISTANCE PROTEIN from Saccharomyces cerevisiae (Baker's yeast) (404 aa), FASTA scores: opt: 591, E(): 4e-28, (36.6% identity in 380 aa overlap); etc. BELONGS TO THE ACR3 FAMILY." /codon_start=1 /transl_table=11 /product="arsenic-transport integral membrane protein ArsC" /protein_id="NP_217159.1" /db_xref="GI:15609780" /db_xref="GeneID:887674" /translation="MTETVTRTAAPAVVGKLSTLDRFLPVWIGSAMAAGLLLGRWIPG LHTALEGVQLDGISLPIALGLLIMMYPVLAKVRYDRLDTVTGDRKLLLSSLLLNWVLG PALMFALAWLLLADLPEYRTGLIIVGLARCIAMVIIWNDLACGDREAAAVLVALNSIF QVAMFAALGWFYLSVLPGWLGLEQTTIATSPWQIAKSVLIFLGIPLLAGYLSRRIGEK TKGRNWYESRFLPKVGPWALYGLLFTIVILFALQGDQITGRPLDVARIALPLLAYFAI MWVGGYLLGAALRLGYRRTTTLAFTAASNNFELAIAVAIATYGATSGQALAGVVGPLI EVPVLVGLVYVSLALRNRLAGPNATHDADKPSVLFVCVHNAGRSQMAAGLLTHLAGDR IEVRSAGTEPAGQVNPTAVAAMAEMGIDITANAPTLLTGGQVQSSDVVITMGCGDACP YFPGVSYRNWKLPDPAGQPLDVVRMIRDDIADRVQALIAELLATAKTR" gene complement(2968533..2968850) /locus_tag="Rv2644c" /db_xref="GeneID:887366" CDS complement(2968533..2968850) /locus_tag="Rv2644c" /function="UNKNOWN" /note="Rv2644c, (MTCY441.14c), len: 105 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217160.1" /db_xref="GI:15609781" /db_xref="GeneID:887366" /translation="MSPRRTSGGVVPVDRYRIDEGLIVVLVFAGRDERRRTVCFADKF GCVHIGNPDLYRPQTSLPQPLPISSHAISGSRFVETTNRADQQEPIGPNRAELFDQAL HAG" gene complement(2969497..2969568) /locus_tag="Rvnt31" /note="tRNA-Val(CAC)" /db_xref="GeneID:2700447" tRNA complement(2969497..2969568) /locus_tag="Rvnt31" /product="tRNA-Val" /note="codon recognized: GUG" /anticodon=(pos:2969534..2969536,aa:Val) /db_xref="GeneID:2700447" gene 2969753..2969825 /locus_tag="Rvnt32" /note="tRNA-Gly(GCC)" /db_xref="GeneID:2700439" tRNA 2969753..2969825 /locus_tag="Rvnt32" /product="tRNA-Gly" /note="codon recognized: GGC" /anticodon=(pos:2969786..2969788,aa:Gly) /db_xref="GeneID:2700439" gene 2969855..2969925 /locus_tag="Rvnt33" /note="tRNA-Cys(GCA)" /db_xref="GeneID:2700440" tRNA 2969855..2969925 /locus_tag="Rvnt33" /product="tRNA-Cys" /note="codon recognized: UGC" /anticodon=(pos:2969887..2969889,aa:Cys) /db_xref="GeneID:2700440" gene 2969942..2970013 /locus_tag="Rvnt34" /note="tRNA-Val(GAC)" /db_xref="GeneID:2700437" tRNA 2969942..2970013 /locus_tag="Rvnt34" /product="tRNA-Val" /note="codon recognized: GUC" /anticodon=(pos:2969974..2969976,aa:Val) /db_xref="GeneID:2700437" gene 2970123..2970554 /locus_tag="Rv2645" /db_xref="GeneID:887799" CDS 2970123..2970554 /locus_tag="Rv2645" /function="UNKNOWN" /note="Rv2645, (MTCY441.15), len: 143 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217161.1" /db_xref="GI:15609782" /db_xref="GeneID:887799" /translation="MTTTPRQPLFCAHADTNGDPGRCACGQQLADVGPATPPPPWCEP GTEPIWEQLTERYGGVTICQWTRYFPAGDPVAADVWIAADDRVVDGRVLRTQPAIHYT EPPVLGIGPAAARRLAAELLNAADTLDDGRRQLDDLGEHRR" gene 2970551..2971549 /locus_tag="Rv2646" /db_xref="GeneID:887706" CDS 2970551..2971549 /locus_tag="Rv2646" /function="SEQUENCE INTEGRATION. INTEGRASE IS NECESSARY FOR INTEGRATION OF A PHAGE INTO THE HOST GENOME BY SITE-SPECIFIC RECOMBINATION. IN CONJUNCTION WITH EXCISIONASE, INTEGRASE IS ALSO NECESSARY FOR EXCISION OF THE PROPHAGE FROM THE HOST GENOME." /note="Rv2646, (MTCY441.16), len: 332 aa. Probable integrase, similar to others e.g. P06723|VINT_BP186|INT INTEGRASE from Bacteriophage 186 (336 aa)s FASTA scores: opt: 198, E(): 6.3e-05, (30.45% identity in 138 aa overlap). COULD BE BELONG TO THE 'PHAGE' INTEGRASE FAMILY." /codon_start=1 /transl_table=11 /product="integrase" /protein_id="NP_217162.1" /db_xref="GI:15609783" /db_xref="GeneID:887706" /translation="MNTATRVRLARKRADRLNLKLIKNGHHFRLRDADEITLAVGHLG VVEAFLAAAKSQNKPPGPPPSLHAPPSWRRDIDDYLLNLNAAGQRPATIRLRKTVLCA AAHGLGRPPADVTAEHLLDWLGKQQHLSPEGRKTYRSTLRGFFVWAYEMDRVRDYVAD SLPKVRCPKQPPRPAGDDVWQAALAKADRRIELMIRLAGEAGLRRAEAAQAHTGDLMD GGLLLVHGKGGKRRIVPISDYLAALIRDTPHGYLFPNGTGGHLTAEHVGKLVSRALPG DATMHTLRHRYATRAYRGSHNLRAVQQLLGHASIVTTERYTALCDDEVRAAAAAAW" gene 2971659..2972027 /locus_tag="Rv2647" /db_xref="GeneID:885870" CDS 2971659..2972027 /locus_tag="Rv2647" /function="UNKNOWN" /note="Rv2647, (MTCY441.17), len: 122 aa (questionable ORF). Hypothetical protein, probably corresponds to conserved DNA sequence also found in MTCY336.29c and Rv1574|MTCY336.30c|O06616 HYPOTHETICAL 11.4 KDA PROTEIN from Mycobacterium tuberculosis (103 aa), FASTA scores: opt: 170, E(): 0.0002, (69.05% identity in 42 aa overlap). Shows weak similarity with Q9EUM1|RESB RESOLVASE PROTEIN HOMOLOG from Corynebacterium glutamicum (Brevibacterium flavum) (343 aa), FASTA scores: opt: 112, E(): 2.9, (31.05% identity in 87 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217163.1" /db_xref="GI:15609784" /db_xref="GeneID:885870" /translation="MHVCHTIADVVDRAKAERSENTLRKDFTPSELLAAGRRIAELER PKAKQRQREGGDHGRQARYSGLGSMEPKPESERDAHKADTAISEALGISRGHYQRLKR IDNATRSEAGYRDGLNGWSG" repeat_region 2972106..2972108 /note="3 bp direct repeat: TCG at 5'-end of IS6110" repeat_region 2972109..2973463 /note="IS6110-10, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-10" repeat_region 2972109..2972136 /note="28 bp inverted repeat: TGAACCGCCCCGGCATGTCCGGAGACTC at the left end of IS6110" gene 2972160..2972486 /locus_tag="Rv2648" /db_xref="GeneID:887828" CDS 2972160..2972486 /locus_tag="Rv2648" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2648, (MTCY441.17A), len: 108 aa. Probable transposase for IS6110." /codon_start=1 /transl_table=11 /product="transposase IS6110" /protein_id="NP_217164.1" /db_xref="GI:15609785" /db_xref="GeneID:887828" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 2972435..2973421 /locus_tag="Rv2649" /db_xref="GeneID:888553" CDS <2972435..2973421 /locus_tag="Rv2649" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2649, (MTCY441.18), len: 328 aa. Probable transposase for IS6110." /codon_start=1 /transl_table=11 /product="transposase IS6110" /protein_id="NP_217165.1" /db_xref="GI:15609786" /db_xref="GeneID:888553" /translation="KDRVGFLRGRARPASTLITRFIADHQGHREGPDGLRWGVESICT QLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNR EGIEVARCTVERLMTKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWV ADLTYVSTWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLD LKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKP GKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region complement(2973436..2973463) /note="28 bp inverted repeat, TGAACCGCCCCGGTGAGTCCGGAGACTC, at the right end of IS6110." repeat_region 2973464..2973466 /note="3 bp direct repeat: TCG at 3'-end of IS6110" gene complement(2973795..2975234) /locus_tag="Rv2650c" /db_xref="GeneID:887478" CDS complement(2973795..2975234) /locus_tag="Rv2650c" /function="UNKNOWN" /note="Rv2650c, (MTCY441.19), len: 479 aa. Possible phiRv2 prophage protein (capsid subunit) (see citation below), highly similar to O06614|Rv1576c|MTCY336.28 PROBABLE phiRv1 PHAGE PROTEIN from Mycobacterium tuberculosis (473 aa), FASTA scores: opt: 2782, E(): 2.8e-159, (89.1% identity in 468 aa overlap)." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protein" /protein_id="NP_217166.1" /db_xref="GI:15609787" /db_xref="GeneID:887478" /translation="MTNEQHFADDGDIKQLSLDETRSAAKQLLDSVEGDLTGDVAQRF QALTRHAEELRAEQRRRGREAEEALRRCRAGELRVVPGAPTGGDDGDAPPGNSLRDIA FRTLDVCVRDGLMSSRAAEAAETLCRTGPPQSTSWAQRWLAATGNRDYLGAFVKRVSN PVAGHTTWTDREAAAWREAAAVAAEQRAMGLVDTAGGFLIPAALDPAILLSGDGSTNP IRQVARVVQTTSEVWRGVTSEGAEAHWYSEAQEVSDDSPTLAQPAVPSYRGSCWIPFS LEIEGDAAGFVAEVGRVLADSVEQLQAAAFVSGSGNGEPTGFVSALTGTADYTVTGAG TEAVVAADVYALQSALPPRFQSNSAFAANLSTINVLRQAETANGALKFPSLHASPPML AGKHIWEVSNMDTVDAAVTATNYPLVLGDWKQFIITDRVGSTVELVPHVFGGNRRPTG QRGFFCWFRVGSDVLVDNAFRVLKVQTTA" gene complement(2975242..2975775) /locus_tag="Rv2651c" /db_xref="GeneID:887837" CDS complement(2975242..2975775) /locus_tag="Rv2651c" /EC_number="3.4.-.-" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS." /note="Rv2651c, (MTCY441.20c), len: 177 aa. Possible protease protein (EC 3.4.-.-), phiRv2 phage protein (prohead protease) (see citation below), showing some similarity with several proteases e.g. Q9A4P4|CC2786 PUTATIVE PROTEASE from Caulobacter crescentus (138 aa), FASTA scores: opt: 206, E(): 2e-06, (36.35% identity in 132 aa overlap); Q9RNH0 PUTATIVE PROHEAD PROTEASE from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (184 aa), FASTA scores: opt: 196, E(): 1.1e-05, (35.05% identity in 137 aa overlap); BAB35014|ECS1591 PUTATIVE PROHEAD PROTEASE from Escherichia coli strain O157:H7 (185 aa), FASTA scores: opt: 187, E(): 4.1e-05, (32.9% identity in 158 aa overlap); etc. And highly similar to O06613|Rv1577c|MTCY336.27 Probable phiRV1 phage protein from Mycobacterium tuberculosis (170 aa), FASTA scores: opt: 987, E(): 2.3e-56, (89.35% identity in 169 aa overlap)." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protease" /protein_id="NP_217167.1" /db_xref="GI:15609788" /db_xref="GeneID:887837" /translation="MSSILFRTAELRPGEGRTVYGVIVPYGEVTTVRDLDGEFREMFA PGAFRRSIAERGHKVKLLVSHDARTRYPVGRAVELREEPHGLFGAFELANTPDGDEAL ANVKAGVVDAFSVGFRPIRDRREGDVIVRVEAALLEVSLTGVPAYLGAQIAGVRAESL AVVSRSLAEARLALMDW" gene complement(2975928..2976554) /locus_tag="Rv2652c" /db_xref="GeneID:888577" CDS complement(2975928..2976554) /locus_tag="Rv2652c" /function="UNKNOWN" /note="Rv2652c, (MTCY441.21c), len: 208 aa. Probable phiRv2 phage protein (terminase) (see citation below), showing some similarity with AAK79859|Q97HW1|CAC1896 PHAGE TERMINASE-LIKE PROTEIN (SMALL SUBUNIT) from Clostridium acetobutylicum (151 aa), FASTA scores: opt: 155, E(): 0.012, (24.7% identity in 158 aa overlap); and Q9B019 HYPOTHETICAL 17.8 KDA PROTEIN from Bacteriophage GMSE-1 (159 aa), FASTA scores: opt: 141, E(): 0.087, (27.65% identity in 159 aa overlap). Also highly similar to O06612|Rv1578c|MTCY336.26 Probable phiRV1 phage protein from Mycobacterium tuberculosis (156 aa), FASTA scores: opt: 448, E(): 1.2e-20, (48.1% identity in 156 aa overlap). Equivalent to AAK47043 from Mycobacterium tuberculosis strain CDC1551 but longer 45 aa." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protein" /protein_id="NP_217168.1" /db_xref="GI:15609789" /db_xref="GeneID:888577" /translation="MPSPATARPDTATVGERVRAQVLWGVFWHHGIRDPKPGKRRVVL KMGRRGPAPAPAQLKLLGGRSPGRDSGGRRVTPPAAFERVAPECPDWLPPGAKDMWGR VVPELAALNLLKESDLGVLTSFCVAWDQLMQAVTAYREQGFIATNARSRRVTVHPAVA AARAATRDVLVLARELGCTPSAEANLAAVLAAAGDPDDDEFNPFAPDR" gene complement(2976586..2976909) /locus_tag="Rv2653c" /db_xref="GeneID:887367" CDS complement(2976586..2976909) /locus_tag="Rv2653c" /function="UNKNOWN" /note="Rv2653c, (MTCY441.22c), len: 107 aa. Hypothetical unknown protein, possibly phiRv2 phage protein (see citation below)." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protein" /protein_id="NP_217169.1" /db_xref="GI:15609790" /db_xref="GeneID:887367" /translation="MTHKRTKRQPAIAAGLNAPRRNRVGRQHGWPADVPSAEQRRAQR QRDLEAIRRAYAEMVATSHEIDDDTAELALLSMHLDDEQRRLEAGMKLGWHPYHFPDE PDSKQ" gene complement(2976989..2977234) /locus_tag="Rv2654c" /db_xref="GeneID:888154" CDS complement(2976989..2977234) /locus_tag="Rv2654c" /function="UNKNOWN" /note="Rv2654c, (MTCY441.23c), len: 81 aa. Hypothetical ala-rich protein, possibly phiRv2 phage protein (see citation below), similar to C-terminus of Q9HNI3|VNG2091H HYPOTHETICAL PROTEIN from Halobacterium sp. strain NRC-1 (212 aa), FASTA scores: opt: 122, E(): 0.46, (43.05% identity in 79 aa overlap)." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protein" /protein_id="NP_217170.1" /db_xref="GI:15609791" /db_xref="GeneID:888154" /translation="MSGHALAARTLLAAADELVGGPPVEASAAALAGDAAGAWRTAAV ELARALVRAVAESHGVAAVLFAATAAAAAAVDRGDPP" gene complement(2977231..2978658) /locus_tag="Rv2655c" /db_xref="GeneID:887388" CDS complement(2977231..2978658) /locus_tag="Rv2655c" /function="UNKNOWN" /note="Rv2655c, (MTCY441.24c), len: 475 aa. Hypothetical protein, possibly phiRv2 phage protein (putative primase-like protein) (see citation below). C-terminus similar to P22875|YXIS_SACER HYPOTHETICAL 28.9 KDA PROTEIN (PROBABLY DOES NOT PLAY A DIRECT ROLE IN PLASMID INTEGRATION OR EXCISION) from Saccharopolyspora erythraea (Streptomyces erythraeus) plasmid pSE211 (263 aa), FASTA scores: opt: 389, E(): 2.7e-15, (33.45% identity in 269 aa overlap). Weak similarity in N-terminus to O06608|MTCY336.22|Rv1582c Probable phiRV1 phage protein from Mycobacterium tuberculosis (471 aa), FASTA scores: opt: 133, E(): 2.5, (36.0% identity in 75 aa overlap)." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protein" /protein_id="NP_217171.1" /db_xref="GI:15609792" /db_xref="GeneID:887388" /translation="MADIPYGRDYPDPIWCDEDGQPMPPVGAELLDDIRAFLRRFVVY PSDHELIAHTLWIAHCWFMEAWDSTPRIAFLSPEPGSGKSRALEVTEPLVPRPVHAIN CTPAYLFRRVADPVGRPTVLYDECDTLFGPKAKEHEEIRGVINAGHRKGAVAGRCVIR GKIVETEELPAYCAVALAGLDDLPDTIMSRSIVVRMRRRAPTEPVEPWRPRVNGPEAE KLHDRLANWAAAINPLESGWPAMPDGVTDRRADVWESLVAVADTAGGHWPKTARATAE TDATANRGAKPSIGVLLLRDIRRVFSDRDRMRTSDILTGLNRMEEGPWGSIRRGDPLD ARGLATRLGRYGIGPKFQHSGGEPPYKGYSRTQFEDAWSRYLSADDETPEERDLSVSA VSAVSPPVGDPGDATGATDATDLPEAGDLPYEPPAPNGHPNGDAPLCSGPGCPNKLLS TEAKAAGKCRPCRGRAAASARDGAR" gene complement(2978660..2979052) /locus_tag="Rv2656c" /db_xref="GeneID:888179" CDS complement(2978660..2979052) /locus_tag="Rv2656c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2656c, (MTCY441.25c), len: 130 aa. Probable phiRv2 phage protein (see Hatfull 2000), highly similar to O06607|YF83_MYCTU|Rv1583c|MT3573.2|MTCY336.21 Probable phiRV1 phage protein from Mycobacterium tuberculosis (132 aa), FASTA scores: opt: 734, E(): 2.5e-39, (81.5% identity in 131 aa overlap); and some similarity with Q982T4|MLL8506 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (204 aa), FASTA scores: opt: 104, E(): 9.7, (31.85% identity in 113 aa overlap)." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protein" /protein_id="NP_217172.1" /db_xref="GI:15609793" /db_xref="GeneID:888179" /translation="MTAVGGSPPTRRCPATEDRAPATVATPSSTDPTASRAVSWWSVH EYVAPTLAAAVEWPMAGTPAWCDLDDTDPVKWAAICDAARHWALRVETCQAASAEASR DVSAAADWPAVSREIQRRRDAYIRRVVV" gene complement(2979049..2979309) /locus_tag="Rv2657c" /db_xref="GeneID:887399" CDS complement(2979049..2979309) /locus_tag="Rv2657c" /function="UNKNOWN" /note="Rv2657c, (MTCY441.26c), len: 86 aa. Probable phiRv2 phage protein (excisionase) (see citation below), similar to O22001|VG36_BPMD2|36|G2 GENE 36 PROTEIN (GP36) from Mycobacteriophage D29 (56 aa), FASTA scores: opt: 171, E(): 9.6e-06, (48.0% identity in 50 aa overlap); and Q05246|VG36_BPML5|36 GENE 36 PROTEIN (GP36) from Mycobacteriophage L5 (56 aa), FASTA scores: opt: 169, E(): 1.3e-05, (50% identity in 50 aa overlap). Similarity suggests alternative start at 21737. Contains possible helix-turn-helix motif from aa 33 to 54 (Score 1655, +4.82 SD)." /codon_start=1 /transl_table=11 /product="phiRv2 prophage protein" /protein_id="NP_217173.1" /db_xref="GI:15609794" /db_xref="GeneID:887399" /translation="MCAFPSPSLGWTVSHETERPGMADAPPLSRRYITISEAAEYLAV TDRTVRQMIADGRLRGYRSGTRLVRLRRDEVDGAMHPFGGAA" gene complement(2979326..2979688) /locus_tag="Rv2658c" /db_xref="GeneID:888562" CDS complement(2979326..2979688) /locus_tag="Rv2658c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2658c, (MTCY441.27c), len: 120 aa. Hypothetical unknown protein, probably phage protein." /codon_start=1 /transl_table=11 /product="prophage protein" /protein_id="NP_217174.1" /db_xref="GI:15609795" /db_xref="GeneID:888562" /translation="MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFT ETQWGRHIEWKLECRACRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEA RHVIPFSALCLRLSQLGG" gene complement(2979691..2980818) /locus_tag="Rv2659c" /db_xref="GeneID:885098" CDS complement(2979691..2980818) /locus_tag="Rv2659c" /function="SEQUENCE INTEGRATION. INTEGRASE IS NECESSARY FOR INTEGRATION OF A PHAGE INTO THE HOST GENOME BY SITE-SPECIFIC RECOMBINATION. IN CONJUNCTION WITH EXCISIONASE, INTEGRASE IS ALSO NECESSARY FOR EXCISION OF THE PROPHAGE FROM THE HOST GENOME." /experiment="experimental evidence, no additional details recorded" /note="Rv2659c, (MTCY441.28c), len: 375 aa. Probable integrase, phiRv2 phage protein: putative member of the phage integrase family of tyrosine recombinases (see Hatfull 2000), highly similar to others e.g. P22884|VINT_BPML5|33|INT from Mycobacteriophage L5 (371 aa), FASTA scores: opt: 836, E(): 1.2e-44, (39.0% identity in 372 aa overlap); Q38361|VINT_BPMD2|33|INT from Mycobacteriophage D29 (333 aa), FASTA scores: opt: 786, E(): 1.4e-41, (40.55% identity in 338 aa overlap); etc. SEEMS BELONGS TO THE 'PHAGE' INTEGRASE FAMILY." /codon_start=1 /transl_table=11 /product="phiRv2 prophage integrase" /protein_id="NP_217175.1" /db_xref="GI:15609796" /db_xref="GeneID:885098" /translation="MTQTGKRQRRKFGRIRQFNSGRWQASYTGPDGRVYIAPKTFNAK IDAEAWLTDRRREIDRQLWSPASGQEDRPGAPFGEYAEGWLKQRGIKDRTRAHYRKLL DNHILATFADTDLRDITPAAVRRWYATTAVGTPTMRAHSYSLLRAIMQTALADDLIDS NPCRISGASTARRVHKIRPATLDELETITKAMPDPYQAFVLMAAWLAMRYGELTELRR KDIDLHGEVARVRRAVVRVGEGFKVTTPKSDAGVRDISIPPHLIPAIEDHLHKHVNPG RESLLFPSVNDPNRHLAPSALYRMFYKARKAAGRPDLRVHDLRHSGAVLAASTGATLA ELMQRLGHSTAGAALRYQHAAKGRDREIAALLSKLAENQEM" gene complement(2980963..2981190) /locus_tag="Rv2660c" /db_xref="GeneID:887222" CDS complement(2980963..2981190) /locus_tag="Rv2660c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2660c, (MTCY441.29c), len: 75 aa (questionable orf). Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217176.1" /db_xref="GI:15609797" /db_xref="GeneID:887222" /translation="MIAGVDQALAATGQASQRAAGASGGVTVGVGVGTEQRNLSVVAP SQFTFSSRSPDFVDETAGQSWCAILGLNQFH" gene complement(2981187..2981576) /locus_tag="Rv2661c" /db_xref="GeneID:888545" CDS complement(2981187..2981576) /locus_tag="Rv2661c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2661c, (MTCY441.30c), len: 129 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217177.1" /db_xref="GI:15609798" /db_xref="GeneID:888545" /translation="MRARSDAGGQSVKSRTSNRSRSSRRSRVRSSISALVDNPQARPR ELPVLCGWPVVRVEPVCEFVPEPVCGQAEVLGEPAAAHRVTSARRSPSTTVCSRSQKA SAVVISSVSSVARVRRASVSSVDATTA" gene 2981482..2981754 /locus_tag="Rv2662" /db_xref="GeneID:888589" CDS 2981482..2981754 /locus_tag="Rv2662" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2662, (MTCY441.31), len: 90 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217178.1" /db_xref="GI:15609799" /db_xref="GeneID:888589" /translation="MDDLTRLRRELLDRFDVRDFTDWPPASLRALIATYDPWIDMTAS PPQPVSPGGPRLRLVRLTTNPSARAAPIGNGGDSSVCAGEKQCRPP" gene 2981853..2982086 /locus_tag="Rv2663" /db_xref="GeneID:888561" CDS 2981853..2982086 /locus_tag="Rv2663" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2663, (MTCY441.32), len: 77 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217179.1" /db_xref="GI:15609800" /db_xref="GeneID:888561" /translation="MEVRASARKHGINDDAMLHAYRNALRYVELEYHGEVQLLVIGPD QTGRLLELVIPADEPPRIIHANVLRPKFYDYLR" gene 2982097..2982351 /locus_tag="Rv2664" /db_xref="GeneID:888501" CDS 2982097..2982351 /locus_tag="Rv2664" /function="UNKNOWN" /note="Rv2664, (MTCY441.33), len: 84 aa. Hypothetical protein. Some weak similarity to nearby P71964|Rv2667|clpX'|MT2741|MTCY441.36 POSSIBLE ATP-DEPENDENT PROTEASE ATP-BINDING SUBUNIT from Mycobacterium tuberculosis (252 aa), FASTA scores: opt: 134, E(): 0.027, (31.15% identity in 77 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217180.1" /db_xref="GI:15609801" /db_xref="GeneID:888501" /translation="MKHKTDIDEWLDTIEPNPADAHDASHLRRIIAAKEAVQTAESEL RAAVNAARAAGDTWAAIGVALGITRQAAFQRFGPHSTASP" gene 2982699..2982980 /locus_tag="Rv2665" /db_xref="GeneID:887748" CDS 2982699..2982980 /locus_tag="Rv2665" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2665, (MTCY441.34), len: 93 aa. Hypothetical arg-rich protein, showing some similarity to N-terminus of P71640|Rv2811|MTCY16B7.32c HYPOTHETICAL 21.1 KDA PROTEIN from Mycobacterium tuberculosis (202 aa), FASTA scores: opt: 157, E(): 0.0011, (37.5% identity in 72 aa overlap); and also to part of O35132|CP2B_RAT|CYP27B1|CYP27B 25-HYDROXYVITAMIN D-1 ALPHA HYDROXYLASE, MITOCHONDRIAL PRECURSOR from Rattus norvegicus (Rat) (501 aa), FASTA scores: opt: 106, E(): 5.4, (34.5% identity in 87 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217181.1" /db_xref="GI:15609802" /db_xref="GeneID:887748" /translation="MIVVRTAEAAEQALTEGQLVCPRRGCGDTLRRWRYGRRRHVRSL GSQVIDVRPQRVRCRRCESTHVLLPAALQPRLGRGGGGQLRPGVWCTGR" repeat_region 2982946..2983854 /note="IS1081'-4, len: 909 bp. Defective Insertion sequence IS1081 element; truncated at 3'-end." /mobile_element="insertion sequence:IS1081'-4" repeat_region 2983019..2983033 /note="15 bp Inverted repeat at the left end of IS1081:TCGCGTGATCCTTCG, right end copy is missing" gene 2983071..2983874 /locus_tag="Rv2666" /db_xref="GeneID:888904" CDS 2983071..2983874 /locus_tag="Rv2666" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1081." /note="Rv2666, (MTCY441.35), len: 267 aa. Probable transposase (fragment), identical in region of overlap to P35882|TRA1_MYCBO|TRA1_MYCTU TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1081 from Mycobacterium tuberculosis or bovis (415 aa). Last 4 codons not part of gene. Contains PS01007 Transposases, Mutator family, signature." /codon_start=1 /transl_table=11 /product="truncated IS1081 transposase" /protein_id="NP_217182.1" /db_xref="GI:15609803" /db_xref="GeneID:888904" /translation="MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANHGRHNA" misc_feature 2983767..2983841 /locus_tag="Rv2666" /note="PS01007 Transposases, Mutator family, signature" gene 2983896..2984654 /gene="clpC2" /locus_tag="Rv2667" /db_xref="GeneID:887415" CDS 2983896..2984654 /gene="clpC2" /locus_tag="Rv2667" /EC_number="3.4.-.-" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS IN PRESENCE OF ATP." /experiment="experimental evidence, no additional details recorded" /note="Rv2667, (MTCY441.36), len: 252 aa. Possible clpC2, ATP-dependent protease atp-binding subunit (EC 3.4.-.-), highly similar to Q9X8L2|SCE9.40 HYPOTHETICAL 27.3 KDA PROTEIN from Streptomyces coelicolor (258 aa), FASTA scores: opt: 877, E(): 2.2e-46, (57.25% identity in 255 aa overlap). The second half of the protein is highly similar to N-terminal of several CLP-FAMILY proteins e.g. P24428|CLPC_MYCLE|ML0235 PROBABLE ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT from Mycobacterium leprae (848 aa), FASTA scores: opt: 307, E(): 3.2e-11, (38.6% identity in 158 aa overlap); O06286|CLPC_MYCTU|Rv3596c|MT3703|MTCY07H7B.26 PROBABLE ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT from Mycobacterium tuberculosis (848 aa), FASTA scores: opt: 307, E(): 3.2e-11, (38.6% identity in 158 aa overlap); Q9S6T8|SCE94.24c PUTATIVE CLP-FAMILY ATP-BINDING PROTEASE from Streptomyces coelicolor (841 aa), FASTA scores: opt: 303, E(): 5.6e-11, (38.8% identity in 152 aa overlap); etc. Some weak similarity to nearby P71961|MTCY441.33|Rv2664 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (83 aa). Contain Pfam match to entry PF02861 Clp amino terminal domain. BELONGS TO THE CLPA/CLPB FAMILY. CLPC SUBFAMILY. Note that previously known as clpX'; clpX'" /codon_start=1 /transl_table=11 /product="ATP-dependent protease ATP-binding subunit ClpC2" /protein_id="YP_177897.1" /db_xref="GI:57117011" /db_xref="GeneID:887415" /translation="MPEPTPTAYPVRLDELINAIKRVHSDVLDQLSDAVLAAEHLGEI ADHLIGHFVDQARRSGASWSDIGKSMGVTKQAAQKRFVPRAEATTLDSNQGFRRFTPR ARNAVVAAQNAAHGAASSEITPDHLLLGVLTDPAALATALLQQQEIDIATLRTAVTLP PAVTEPPQPIPFSGPARKVLELTFREALRLGHNYIGTEHLLLALLELEDGDGPLHRSG VDKSRAEADLITTLASLTGANAAGATDAGATDAG" gene 2984733..2985254 /locus_tag="Rv2668" /db_xref="GeneID:887993" CDS 2984733..2985254 /locus_tag="Rv2668" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2668, (MTCY441.37), len: 173 aa. Hypothetical ala-, val-rich protein, possibly exported. Equivalent to AAK47057 from Mycobacterium tuberculosis strain CDC1551 (208 aa) but N-terminal part shorter 35 aa and with few differences. Has potential signal peptide sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217184.1" /db_xref="GI:15609805" /db_xref="GeneID:887993" /translation="MRHWLIVLATLLVAAAGVAAANDVPRAWAGDAPIGHIGDTLRVD TGTYVADVTVSSVVPVDPPPGFGYTRSGVPVKSFPDSSVTRADVTVRAVRVPNSFILA TNFSFTGVTPFADAYKPRPCDASDWLDAALGNAPQGSIVRGGVYWDAYRDPVSVVVLL DEKTGQHLAQWNL" gene 2985283..2985753 /locus_tag="Rv2669" /db_xref="GeneID:887699" CDS 2985283..2985753 /locus_tag="Rv2669" /function="UNKNOWN" /note="Rv2669, (MTCY441.38), len: 156 aa. Conserved hypothetical protein, showing some similarity to various proteins e.g. Q9A6M0|CC2073 ACETYLTRANSFERASE (GNAT FAMILY) from Caulobacter crescentus (178 aa), FASTA scores: opt: 242, E(): 1.2e-09, (30.9% identity in 165 aa overlap); Q99RQ8|SA2159 hypothetical protein similar to transcription repressor of sporulation, septation and degradation paiA from Staphylococcus aureus subsp. aureus N315 (171 aa), FASTA scores: opt: 214, E(): 9.8e-08, (27.5% identity in 160 aa overlap); BAB58531|SAV2369 HYPOTHETICAL 20.1 KDA PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (171 aa), FASTA scores: opt: 214, E(): 9.8e-08, (27.5% identity in 160 aa overlap); P21340|PAIA_BACSU|O32112 PROTEASE SYNTHASE AND SPORULATION from Bacillus subtilis (171 aa), FASTA scores: opt: 209, E(): 2.1e-07, (22.85% identity in 162 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217185.1" /db_xref="GI:15609806" /db_xref="GeneID:887699" /translation="MTDADELAAVAARTFPLACPPAVAPEHIASFVDANLSSARFAEY LTDPRRAILTARHDGRIVGYAMLIRGDDRDVELSKLYLLPGYHGTGAAAALMHKVLAT AADWGALRVWLGVNQKNQRAQRFYAKTGFKINGTRTFRLGAHHENDYVMVRELV" gene complement(2985731..2986840) /locus_tag="Rv2670c" /db_xref="GeneID:887407" CDS complement(2985731..2986840) /locus_tag="Rv2670c" /function="UNKNOWN" /note="Rv2670c, (MTCY441.39c), len: 369 aa. Conserved hypothetical protein, equivalent, but longer 164 aa, to O05683|MLC1351.22c HYPOTHETICAL 17.3 KDA PROTEIN from Mycobacterium leprae (160 aa), FASTA scores: opt: 847, E(): 1.2e-45, (82.4% identity in 159 aa overlap). And highly similar to Q9X824|SC9B1.04c PUTATIVE ATP/GTP-BINDING INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (350 aa), FASTA scores: opt: 1169, E(): 2e-65, (56.85% identity in 343 aa overlap); and Q9RWB0|DR0759 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (351 aa), FASTA scores: opt: 859, E(): 4e-46, (45.9% identity in 331 aa overlap). Also some similarity with other proteins e.g. P46442|YHCM_ECOLI|AAG58360|BAB37528 HYPOTHETICAL PROTEIN from Escherichia coli strains K12 and O157:H7 (375 aa), FASTA scores: opt: 237, E(): 2.1e-07, (28.0% identity in 325 aa overlap); Q9JRK2|NMA1520|NMB1306 PUTATIVE NUCLEOTIDE-BINDING PROTEIN from Neisseria meningitidis (serogroup A and B) (383 aa), FASTA scores: opt: 221, E(): 2.1e-06, (27.8% identity in 356 aa overlap); Q9HVX7|PA4438 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (364 aa), FASTA scores: opt: 211, E(): 8.5e-06, (28.9% identity in 353 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217186.1" /db_xref="GI:15609807" /db_xref="GeneID:887407" /translation="MTLIAARRYSATMHGSASEACGSVDHLVDRHPTVSPVRLIAQLR PPPTFAEVSFATYRPDPVEPTQAAAVVACQDFCRQAVERRAGRKKWFGKRDVLPGVGL YLDGGFGVGKTHLLASAYYQLPGTGPDAPTCPKAFATFGELTQLAGVFGFADCIDLLA NYTALCIDEFELDDPGNTTLISRLLSALVERGVSVAATSNTLPEQLGEGRFAAQDFLR EINTLASIFTTVRIEGPDYRHRDLPPAPAPLSDEEVAARAARVEGATLDDFDALCAHL ATMHPSRYLTLIEGVTAVFLTGVHGIDDQNVALRLVALVDRLYDAGIPVVASGAKLDT IFSEEMLAGGYRKKYLRATSRLLALTAGVIQAREP" misc_feature complement(2986502..2986525) /locus_tag="Rv2670c" /note="PS00017 ATP/GTP-binding site motif A" gene 2986839..2987615 /gene="ribD" /locus_tag="Rv2671" /db_xref="GeneID:887389" CDS 2986839..2987615 /gene="ribD" /locus_tag="Rv2671" /function="INVOLVED IN RIBOFLAVIN BIOSYNTHESIS (AT THE SECOND AND THIRD STEPS). CONVERTS 2,5-DIAMINO-6-(RIBOSYLAMINO)-4(3H)-PYRIMIDINONE 5'-PHOSPHATE INTO 5-AMINO-6-(RIBOSYLAMINO)-2,4(1H,3H)-PYRIMIDINEDIONE 5'-PHOSPHATE [CATALYTIC ACTIVITY 1: 2,5-DIAMINO-6-HYDROXY-4-(5-PHOSPHORIBOSYLAMINO)PYRIMIDINE + H(2)O = 5-AMINO-6-(5-PHOSPHORIBOSYLAMINO)URACIL + NH(3)] [CATALYTIC ACTIVITY 2: 5-AMINO-6-(5-PHOSPHORIBITYLAMINO)URACIL + NADP(+) = 5-AMINO-6-(5-PHOSPHORIBOSYLAMINO)URACIL + NADPH]." /note="Rv2671, (MTCY441.40), len: 258 aa. Possible ribD (alternate gene name: ribG), bifunctional riboflavin biosynthesis protein incuding diaminohydroxyphosphoribosylaminopyrimidine deaminase and 5-amino-6-(5-phosphoribosylamino) uracil reductase (EC 3.5.4.26 and 1.1.1.193), highly similar to O05684|MLC1351.23|ML1340 POSSIBLE REDUCTASE from Mycobacterium leprae (268 aa), FASTA scores: opt: 1211, E(): 3e-68, (72.9% identity in 251 aa overlap). Also weakly similar to others e.g. Q9HWX2|RIBD|PA4056 RIBOFLAVIN-SPECIFIC DEAMINASE/REDUCTASE from Pseudomonas aeruginosa (373 aa), FASTA scores: opt: 211, E(): 6.3e-06, (30.1% identity in 216 aa overlap); Q9HQA1|RIBG|VNG1256G RIBOFLAVIN-SPECIFIC DEAMINASE from Halobacterium sp. strain NRC-1 (220 aa), FASTA scores: opt: 202, E(): 1.5e-05, (27.0% identity in 174 aa overlap); O28272|RIB7_ARCFU|AF2007 PUTATIVE 5-AMINO-6-(5-PHOSPHORIBOSYLAMINO)URACIL REDUCTASE (HTP REDUCTASE) (EC 1.1.1.193) from Archaeoglobus fulgidus (219 aa), FASTA scores: opt: 209, E(): 5.4e-06, (24.15% identity in 211 aa overlap); P25539|RIBD_ECOLI|RIBG|B0414 from Escherichia coli strain K12 (367 aa), FASTA scores: opt: 185, E(): 0.00026, (26.7% identity in 221 aa overlap); etc. But also similar to several hydrolases e.g. Q9X825|SC9B1.05 PUTATIVE HYDROLASE from Streptomyces coelicolor (265 aa), FASTA scores: opt: 536, E(): 2.9e-26, (44.25% identity in 235 aa overlap); Q9RKM1|SCD17.10 PUTATIVE BIFUNCTIONAL ENZYME DEAMINASE/REDUCTASE from Streptomyces coelicolor (376 aa), FASTA scores: opt: 228, E(): 5.6e-07, (33.5% identity in 188 aa overlap); etc. Equivalent to AAK47060 from Mycobacterium tuberculosis strain CDC1551 (239 aa) but longer 19 aa. SUPPOSED BELONG TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY IN THE N-TERMINAL SECTION; and TO THE HTP REDUCTASE FAMILY IN THE C-TERMINAL SECTION.; ribG" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217187.1" /db_xref="GI:15609808" /db_xref="GeneID:887389" /translation="MPDSGQLGAADTPLRLLSSVHYLTDGELPQLYDYPDDGTWLRAN FISSLDGGATVDGTSGAMAGPGDRFVFNLLRELADVIVVGVGTVRIEGYSGVRMGVVQ RQHRQARGQSEVPQLAIVTRSGRLDRDMAVFTRTEMAPLVLTTTAVADDTRQRLAGLA EVIACSGDDPGTVDEAVLVSQLAARGLRRILTEGGPTLLGTFVERDVLDELCLTIAPY VVGGLARRIVTGPGQVLTRMRCAHVLTDDSGYLYTRYVKT" gene 2987682..2989268 /locus_tag="Rv2672" /db_xref="GeneID:887398" CDS 2987682..2989268 /locus_tag="Rv2672" /EC_number="3.4.-.-" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS." /note="Rv2672, (MTCY441.41), len: 528 aa. Possible secreted protease (EC 3.4.-.-), equivalent to O05685|MLC1351.24|ML1339 PUTATIVE SECRETED PROTEASE from Mycobacterium leprae (525 aa), FASTA scores: opt: 2722, E(): 9.4e-140, (74.45% identity in 528 aa overlap). Also similar to several exported proteinases from Streptomyces and Mycobacteria e.g. Q54399|SLPE PROTEINASE from Streptomyces lividans (513 aa), FASTA scores: opt: 429, E(): 6.8e-16, (26.2% identity in 538 aa overlap); Q9FCK9|2SC3B6.03c PEPTIDASE from Streptomyces coelicolor (513 aa), FASTA scores: opt: 421, E(): 1.8e-15, (26.45% identity in 541 aa overlap); Q10508|YM23_MYCTU from Mycobacterium tuberculosis (520 aa), FASTA scores: opt: 349, E(): 1.4e-11, (26.6% identity in 523 aa overlap); etc. Equivalent to AAK47061 from Mycobacterium tuberculosis strain CDC1551 (518 aa) but longer 10 aa." /codon_start=1 /transl_table=11 /product="secreted protease" /protein_id="NP_217188.1" /db_xref="GI:15609809" /db_xref="GeneID:887398" /translation="MATVVGMSRPMTSTAMLVALTCSATVLAACVPAFGADPRFATYS GAGPQGAATTTPPPAGPPPLAAPKNDLSWHDCTSRVYSNAGIPAAPGVKLECASYDTD LDPLVGGSTAVSIGVVRARSNQTPSDAGPLVFTTGSDLPSSTQLPVWLAHAGIDVLRS HPIVAVDRRGMGMSSPIDCRDHFDRDEMRDQAQFQAGDDPVANLSDISNTATTDCTDA IAPGESAYDNTHAASDIERLRKLWDVPALAFVGIGNGTQVALAYAASRPDNVARLILD SPIALGVSAEAAAEQQVQGQQAALDAFAAQCVAVNCALGSHPKGAVSALLSAARSGDG PGGASVAAVANAVATALGFPDSGRVDSTTKLADALAAARSGDMNLLSALINRADTTRD TDGQFISSCSDAVNRPTPDRVRELVVAWGKLYPQFGAVAALNLVKCVHWPSSSPPQPP KDLKVDVLLLGVQNDPIVGNEGVAATAATAINANAASKRVMWQGIGHGASIYSSCAVP PLVAYLDTGKLPDTDTYCPA" gene 2989291..2990592 /locus_tag="Rv2673" /db_xref="GeneID:887396" CDS 2989291..2990592 /locus_tag="Rv2673" /function="UNKNOWN" /note="Rv2673, (MTCY441.42), len: 433 aa. Possible conserved integral membrane protein, equivalent to MLC1351.25|ML1338 POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (440 aa), FASTA scores: opt: 2410, E(): 5.3e-143, (82.05% identity in 434 aa overlap); and showing some similarity with Q9CBX0|ML1504 PROBABLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (430 aa), FASTA scores: opt: 159, E(): 0.014, (24.4% identity in 340 aa overlap). Also similar to Q53873|SC6G4.11 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (411 aa), FASTA scores: opt: 383, E(): 1.4e-16, (29.6% identity in 422 aa overlap); and with weak similarity with P71061|YVFB HYPOTHETICAL PROTEIN from Bacillus subtilis (396 aa), FASTA scores: opt: 136, E(): 0.36, (24.35% identity in 279 aa overlap); and BAB60134|TVG1014811 HYPOTHETICAL PROTEIN from Thermoplasma volcanium (695 aa), FASTA scores: opt: 133, E(): 0.85, (26.45% identity in 280 aa overlap). Shows also some similarity with O06557|Rv1159|MTCI65.26 HYPOTHETICAL 47.1 KDA PROTEIN from Mycobacterium tuberculosis (431 aa), FASTA scores: opt: 149, E(): 0.059, (22.45% identity in 410 aa overlap); and O53515|Rv2181|MTV021.14 PUTATIVE MEMBRANE PROTEIN from Mycobacterium tuberculosis (427 aa), FASTA scores: opt: 129, E(): 1, (24.8% identity in 367 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217189.1" /db_xref="GI:15609810" /db_xref="GeneID:887396" /translation="MYGALVTAADSIRTGLGASLLAGFRPRTGAPSTATILRSALWPA AVLSVLHRSIVLTTNGNITDDFKPVYRAVLNFRRGWDIYNEHFDYVDPHYLYPPGGTL LMAPFGYLPFAPSRYLFISINTAAILVAAYLLLRMFNFTLTSVAAPALILAMFATETV TNTLVFTNINGCILLLEVLFLRWLLDGRASRQWCGGLAIGLTLVLKPLLGPLLLLPLL NRQWRALVAAVVVPVVVNVAALPLVSDPMSFFTRTLPYILGTRDYFNSSILGNGVYFG LPTWLILFLRILFTAITFGALWLLYRYYRTGDPLFWFTTSSGVLLLWSWLVMSLAQGY YSMMLFPFLMTVVLPNSVIRNWPAWLGVYGFMTLDRWLLFNWMRWGRALEYLKITYGW SLLLIVTFTVLYFRYLDAKADNRLDGGIDPAWLTPEREGQR" gene 2990706..2991116 /locus_tag="Rv2674" /db_xref="GeneID:886019" CDS 2990706..2991116 /locus_tag="Rv2674" /function="UNKNOWN" /note="Rv2674, (MTCY441.43), len: 136 aa. Conserved hypothetical protein, highly similar to various proteins e.g. Q9X828|SC9B1.08 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (135 aa), FASTA scores: opt: 653, E(): 1.8e-37, (71.1% identity in 128 aa overlap); O26807|MTH711 TRANSCRIPTIONAL REGULATOR from Methanothermobacter thermautotrophicus (151 aa), FASTA scores: opt: 533, E(): 2.7e-29, (58.15% identity in 129 aa overlap); Q9C5C8|AT4G21860 HYPOTHETICAL 22.0 KDA PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (202 aa), FASTA scores: opt: 490, E(): 2.8e-26, (54.05% identity in 124 aa overlap); P39903|YEAA_ECOLI|B1778|Z2817|ECS2487 HYPOTHETICAL PROTEIN from Escherichia coli strains K12 and O157:H7 (137 aa), FASTA scores: opt: 426, E(): 4.4e-22, (46.8% identity in 126 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217190.1" /db_xref="GI:15609811" /db_xref="GeneID:886019" /translation="MTRPKLELSDDEWRQKLTPQEFHVLRRAGTERPFTGEYTDTTTA GIYQCRACGAELFRSTEKFESHCGWPSFFDPKSSDAVTLRPDHSLGMTRTEVLCANCD SHLGHVFAGEGYPTPTDKRYCINSISLRLVPGSV" gene complement(2991184..2991936) /locus_tag="Rv2675c" /db_xref="GeneID:887760" CDS complement(2991184..2991936) /locus_tag="Rv2675c" /function="UNKNOWN" /note="Rv2675c, (MTCY441.44c), len: 250 aa. Conserved hypothetical protein. C-terminus highly similar to Q50010|U1764Z from Mycobacterium leprae (69 aa), FASTA scores: opt: 284, E(): 4.6e-11, (68.25% identity in 63 aa overlap). Shows some similarity with Q9P3V6|SPAC1348.04 (alias Q9P3E7|Q9P7U5) HYPOTHETICAL 16.6 KDA PROTEIN from Schizosaccharomyces pombe (Fission yeast) (145 aa), FASTA scores: opt: 203, E(): 9.5e-06, (33.05% identity in 118 aa overlap); Q9ZSZ7|BMCT METHYL CHLORIDE TRANSFERASE from Batis maritima (230 aa), FASTA scores: opt: 197, E(): 3.3e-05, (28.85% identity in 156 aa overlap); P72459|STSG METHYLTRANSFERASE from Streptomyces griseus (253 aa), FASTA scores: opt: 194, E(): 5.5e-05, (24.45% identity in 229 aa overlap); etc. Also similar to various proteins from Mycobacterium tuberculosis e.g. P71805|Rv1377c|MTCY02B12.11c HYPOTHETICAL 22.8 KDA PROTEIN (212 aa), FASTA scores: opt: 431, E(): 8.3e-20, (39.1% identity in 197 aa overlap); O06426|Rv0560c|MTCY25D10.39c HYPOTHETICAL 25.9 KDA PROTEIN (241 aa), FASTA scores: opt: 379, E(): 1.6e-16, (35.95% identity in 178 aa overlap); O69667|Rv3699|MTV025.047 PUTATIVE METHYLTRANSFERASE (233 aa), FASTA scores: opt: 297, E(): 2e-11, (30.55% identity in 193 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217191.1" /db_xref="GI:15609812" /db_xref="GeneID:887760" /translation="MTAQFDPADPTRFEEMYRDDRVAHGLPAATPWDIGGPQPVVQQL VALGAIRGEVLDPGTGPGHHAIYYAAKGYAATGIDGSVAAIERARDNARKAGVSVNFQ VGDATTLDGLDGRFDTVVDCAFYHTFSTAPELQRCYVRALRRASKPGARLYMFEFGEH NVNGFSMPRSLSEDDFRQVLPVGGWEITYLGTTTYQVNLSVEALELMAARNPDMADQV RCVLERFRAIKPWLVGGRVHAPFWEVHATRVD" gene complement(2991933..2992628) /locus_tag="Rv2676c" /db_xref="GeneID:887718" CDS complement(2991933..2992628) /locus_tag="Rv2676c" /function="UNKNOWN" /note="Rv2676c, (MTCY441.45c), len: 231 aa. Conserved hypothetical protein, equivalent to Q9CCB2|ML1045 (alias Q50009|U1764Y but longer 66 aa) HYPOTHETICAL PROTEIN from Mycobacterium leprae (231 aa), FASTA scores: opt: 1401, E(): 8.7e-88, (87.45% identity in 231 aa overlap). Also highly similar to O69830|SC1B5.02 HYPOTHETICAL 28.1 KDA PROTEIN from Streptomyces coelicolor (243 aa), FASTA scores: opt: 915, E(): 7.7e-55, (61.25% identity in 222 aa overlap); and similar to others e.g. Q9RUB0|DR1481 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (289 aa), FASTA scores: opt: 327, E(): 6.1e-15, (31.8% identity in 176 aa overlap); Q97WP2|SSO2169 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (223 aa), FASTA scores: opt: 285, E(): 3.4e-12, (31.3% identity in 163 aa overlap); BAB59947|TVG0805714 HYPOTHETICAL PROTEIN from Thermoplasma volcanium (223 aa), FASTA scores: opt: 206, E(): 7.7e-07, (25.0% identity in 176 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217192.1" /db_xref="GI:15609813" /db_xref="GeneID:887718" /translation="MARLDYDALNATLRYLMFSVFSVSPGALGDQRDAIIDDASTFFK QQEERGVVVRGLYDVAGLRADADFMVWTHAERVEALQATYADFRRTTTLGRACTPVWS GVGLHRPAEFNKSHIPAFLAGEEPGAYICVYPFVRSYEWYLLPDEERRRMLAEHGMAA RGYKDVRANTVPAFALGDYEWILAFEAPELDRIVDLMRELRATDARRHTRAETPFFTG PRVPVEQLVHSLP" gene complement(2992634..2993992) /gene="hemY" /locus_tag="Rv2677c" /db_xref="GeneID:887711" CDS complement(2992634..2993992) /gene="hemY" /locus_tag="Rv2677c" /EC_number="1.3.3.4" /function="INVOLVED IN HEME AND PORPHYRIN BIOSYNTHESIS (AT THE PENULTIMATE STEP). CATALYZES THE 6-ELECTRON OXIDATION OF PROTOPORPHYRINOGEN IX TO FORM PROTOPORPHYRIN IX [CATALYTIC ACTIVITY: PROTOPORPHYRINOGEN-IX + O(2) = PROTOPORPHYRIN-IX + H(2)O(2)]." /note="catalyzes the formation of protoporphyrin IX from protoporphyrinogen IX" /codon_start=1 /transl_table=11 /product="protoporphyrinogen oxidase" /protein_id="YP_177675.1" /db_xref="GI:57117012" /db_xref="GeneID:887711" /translation="MTPRSYCVVGGGISGLTSAYRLRQAVGDDATITLFEPADRLGGV LRTEHIGGQPMDLGAEAFVLRRPEMPALLAELGLSDRQLASTGARPLIYSQQRLHPLP PQTVVGIPSSAGSMAGLVDDATLARIDAEAARPFTWQVGSDPAVADLVADRFGDQVVA RSVDPLLSGVYAGSAATIGLRAAAPSVAAALDRGATSVTDAVRQALPPGSGGPVFGAL DGGYQVLLDGLVRRSRVHWVRARVVQLERGWVLRDETGGRWQADAVILAVPAPRLARL VDGIAPRTHAAARQIVSASSAVVALAVPGGTAFPHCSGVLVAGDESPHAKAITLSSRK WGQRGDVALLRLSFGRFGDEPALTASDDQLLAWAADDLVTVFGVAVDPVDVRVRRWIE AMPQYGPGHADVVAELRAGLPPTLAVAGSYLDGIGVPACVGAAGRAVTSVIEALDAQV AR" gene complement(2993989..2995062) /gene="hemE" /locus_tag="Rv2678c" /db_xref="GeneID:888934" CDS complement(2993989..2995062) /gene="hemE" /locus_tag="Rv2678c" /EC_number="4.1.1.37" /function="INVOLVED IN PORPHYRIN BIOSYNTHESIS [CATALYTIC ACTIVITY: UROPORPHYRINOGEN III = COPROPORPHYRINOGEN + 4 CO(2)]." /note="catalyzes the formation of coproporphyrinogen from uroporphyrinogen III" /codon_start=1 /transl_table=11 /product="uroporphyrinogen decarboxylase" /protein_id="NP_217194.1" /db_xref="GI:15609815" /db_xref="GeneID:888934" /translation="MSTRRDLPQSPYLAAVTGRKPSRVPVWFMRQAGRSLPEYRALRE RYSMLAACFEPDVACEITLQPIRRYDVDAAILFSDIVVPLRAAGVDLDIVADVGPVIA DPVRTAADVAAMKPLDPQAIQPVLVAASLLVAELGDVPLIGFAGAPFTLASYLVEGGP SRHHAHVKAMMLAEPASWHALMAKLTDLTIAFLVGQIDAGVDAIQVFDSWAGALSPID YRQYVLPHSARVFAALGEHGVPMTHFGVGTAELLGAMSEAVTAGERPGRGAVVGVDWR TPLTDAAARVVPGTALQGNLDPAVVLAGWPAVERAARAVVDDGRRAVDAGAAGHIFNL GHGVLPESDPAVLADLVSLVHSL" misc_feature complement(2994589..2994636) /gene="hemE" /locus_tag="Rv2678c" /note="PS00907 Uroporphyrinogen decarboxylase signature 2" gene 2995115..2995945 /gene="echA15" /locus_tag="Rv2679" /db_xref="GeneID:887693" CDS 2995115..2995945 /gene="echA15" /locus_tag="Rv2679" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_217195.1" /db_xref="GI:15609816" /db_xref="GeneID:887693" /translation="MPVTYDDFPSLRCEIHDQPGHEGVLELVLDSPGLNSVGPHMHRD LADIWPVIDRDPAVRVVLVRGEGKAFSSGGSFDLIAETIGDYQGRLRIMREARDLVLN LVNFDKPVVSAIRGPAVGAGLVVALLADISVAGRAAKIIDGHTKLGVAAGDHAAICWP LLVGMAKAKYYLLTCEPLSGEEAERIGLVSICVDDDDVLPTATRLAERLAAGAQNAIR WTKRSLNHWYRMFGPAFETSLGLEFIGFGGPDVREGLAAHREKRPARFGADPDPGAGS" misc_feature 2995445..2995507 /gene="echA15" /locus_tag="Rv2679" /note="PS00166 Enoyl-CoA hydratase/isomerase signature" repeat_region 2996003..2996053 /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region 2996054..2996104 /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 2996105..2996737 /locus_tag="Rv2680" /db_xref="GeneID:887734" CDS 2996105..2996737 /locus_tag="Rv2680" /function="UNKNOWN" /note="Rv2680, (MTV010.04), len: 210 aa. Conserved hypothetical protein, equivalent to Q50005|ML1041|U1764V HYPOTHETICAL PROTEIN from Mycobacterium leprae (196 aa), FASTA scores: opt: 1136, E(): 9.7e-66, (83.95% identity in 193 aa overlap). Also similar to O69860|SC1C3.18c HYPOTHETICAL 24.7 KDA PROTEIN from Streptomyces coelicolor (238 aa), FASTA scores: opt: 516, E(): 5.7e-26, (45.5% identity in 189 aa overlap); and similar in part to Q9I6V4|PA0178 PROBABLE TWO-COMPONENT SENSOR from Pseudomonas aeruginosa (639 aa), FASTA scores: opt: 120, E(): 3.1, (33.05% identity in 115 aa overlap); and a few other proteins. Equivalent to AAK47069 from Mycobacterium tuberculosis strain CDC1551 (178 aa) but longer 32 aa; and N-terminus highly similar to N-terminus of AAK48352|MT3984 HYPOTHETICAL 4.2 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (38 aa), FASTA scores: opt: 102, E(): 3.6, (62.05% identity in 29 aa overlap). TBparse score is 0.886." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217196.1" /db_xref="GI:15609817" /db_xref="GeneID:887734" /translation="MTSAGDDAERSDEEERRLTSAEPALFREAVAAMNAVTVRPEIEL GPIRPPQRLAPYSYALGAEIKHPELDVIPERSEGDAFGRLIMLYDPDGSDAWDGTIRL VAYVQADLDSSEAVDPLLPEVAWSWLVDALTARTDQVRALGGTVTATTSVRYGDISGP PRAHQLELRASWTATTPDLGAHVQAFCDVLEHAAGLPPAGVTDLGSRSRA" repeat_region 2996105..2996155 /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 2996739..2998055 /locus_tag="Rv2681" /db_xref="GeneID:887708" CDS 2996739..2998055 /locus_tag="Rv2681" /function="UNKNOWN" /note="Rv2681, (MTCY05A6.02), len: 438 aa. Conserved hypothetical ala-rich protein, equivalent to Q50004|ML1040|U1764U HYPOTHETICAL PROTEIN from Mycobacterium leprae (429 aa), FASTA scores: opt: 2146, E(): 1.1e-119, (77.4% identity in 416 aa overlap). Also highly similar to O69858|SC1C3.16c HYPOTHETICAL 42.5 KDA PROTEIN from Streptomyces coelicolor (394 aa), FASTA scores: opt: 1336, E(): 9e-72, (51.6% identity in 405 aa overlap); and with some similarity to RIBONUCLEASES D e.g. Q983F2|MLL8354 from Rhizobium loti (Mesorhizobium loti) (383 aa), FASTA scores: opt: 379, E(): 3.9e-15, (31.6% identity in 323 aa overlap); Q9A7L8|CC1704 from Caulobacter crescentus (389 aa), FASTA scores: opt: 370, E(): 1.3e-14, (31.45% identity in 318 aa overlap); CAC45770 from Rhizobium meliloti (Sinorhizobium meliloti) (383 aa), FASTA scores: opt: 331, E(): 2.7e-12, (27.75% identity in 357 aa overlap); etc. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217197.1" /db_xref="GI:15609818" /db_xref="GeneID:887708" /translation="MCPEPSHAGAAESEGTESEPTPLLRPAGGIPDLCVTVGEIAAAA ELLDRGRGPFAVDAERASGFRYSGRAYLIQIRRAEAGTVLIDPVSHGGDPLTVLAPVA EVLSTNEWILHSADQDLPCLAEVGMRPPALYDTELAGRLAGFDRVNLAAMVERLLGLG LTKGHGAADWSKRPLPSAWLNYAALDVELLIELRAAISRVLAEQGKTDWAAQEFEHLR SFESRPPPAAARQDRWRRTSGIHKVHDRRGLAAVRELWTARDRIAQRRDIAPRRILPD SAIIDAAIADPKSVDDLVALPVFGGRNQRRSAAVWWAALAAARESPDPPEIAEPANGP PPPGRWVRRKPAAAARLDAARAALTEVSQRVRVPTENLVSPDLVRRLCWEWEDISQSS PDPIAAVEAYLRTGQARAWQLELVVPILTAALTGAPDAGAQGDDGS" gene complement(2998052..2999968) /gene="dxs1" /locus_tag="Rv2682c" /db_xref="GeneID:887461" CDS complement(2998052..2999968) /gene="dxs1" /locus_tag="Rv2682c" /EC_number="2.2.1.7" /function="INVOLVED IN THE DEOXYXYLULOSE-5-PHOSPHATE PATHWAY (DXP) OF ISOPRENOIDBIOSYNTHESIS (AT THE FIRST STEP), AND IN THE BIOSYNTHETIC PATHWAY TO THIAMINE AND PYRIDOXOL (AT THE FIRST STEP). CATALYZES THE ACYLOIN CONDENSATION REACTION BETWEEN ATOMS 2 AND 3 OF PYRUVATE AND GLYCERALDEHYDE 3-PHOSPHATE TO YIELD 1-DEOXY-D-XYLULOSE-5-PHOSPHATE (DXP)." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 1-deoxy-D-xylulose 5-phosphate from pyruvate and D-glyceraldehyde 3-phosphate" /codon_start=1 /transl_table=11 /product="1-deoxy-D-xylulose-5-phosphate synthase" /protein_id="YP_177898.1" /db_xref="GI:57117013" /db_xref="GeneID:887461" /translation="MLQQIRGPADLQHLSQAQLRELAAEIREFLIHKVAATGGHLGPN LGVVELTLALHRVFDSPHDPIIFDTGHQAYVHKMLTGRSQDFATLRKKGGLSGYPSRA ESEHDWVESSHASAALSYADGLAKAFELTGHRNRHVVAVVGDGALTGGMCWEALNNIA ASRRPVIIVVNDNGRSYAPTIGGVADHLATLRLQPAYEQALETGRDLVRAVPLVGGLW FRFLHSVKAGIKDSLSPQLLFTDLGLKYVGPVDGHDERAVEVALRSARRFGAPVIVHV VTRKGMGYPPAEADQAEQMHSTVPIDPATGQATKVAGPGWTATFSDALIGYAQKRRDI VAITAAMPGPTGLTAFGQRFPDRLFDVGIAEQHAMTSAAGLAMGGLHPVVAIYSTFLN RAFDQIMMDVALHKLPVTMVLDRAGITGSDGASHNGMWDLSMLGIVPGIRVAAPRDAT RLREELGEALDVDDGPTALRFPKGDVGEDISALERRGGVDVLAAPADGLNHDVLLVAI GAFAPMALAVAKRLHNQGIGVTVIDPRWVLPVSDGVRELAVQHKLLVTLEDNGVNGGA GSAVSAALRRAEIDVPCRDVGLPQEFYEHASRSEVLADLGLTDQDVARRITGWVAALG TGVCASDAIPEHLD" gene 3000112..3000609 /locus_tag="Rv2683" /db_xref="GeneID:887207" CDS 3000112..3000609 /locus_tag="Rv2683" /function="UNKNOWN" /note="Rv2683, (MTCY05A6.04), len: 165 aa. Conserved hypothetical protein, equivalent, but shorter 19 aa, to Q49999|ML1037|U1764Q HYPOTHETICAL PROTEIN from Mycobacterium leprae (184 aa), FASTA scores: opt: 750, E(): 1.2e-41, (73.8% identity in 164 aa overlap). Shows some similarity with other HYPOTHETICAL PROTEINS e.g. Q988S9|MLL6611 from Rhizobium loti (Mesorhizobium loti) (232 aa), FASTA scores: opt: 128, E(): 0.25, (25.5% identity in 149 aa overlap); Q9YFL5|APE0233 from Aeropyrum pernix (340 aa), FASTA scores: opt: 123, E(): 0.73, (29.1% identity in 141 aa overlap); BAB60477|TVG1377730 from Thermoplasma volcanium (174 aa), FASTA scores: opt: 118, E(): 0.86, (28.8% identity in 59 aa overlap); etc. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217199.1" /db_xref="GI:15609820" /db_xref="GeneID:887207" /translation="MKVNIDPTAPTFATYRRDMRAEQMAEDYPVVSIDSDALDAARML AEHRLPGLLVTAGAGKQYAVLPASQVVRFIVPRYVQDDPLLAGVLNESTADRCAERLS GKKVRDVLPDHLVEVPPANADDTIIEVAAVMARLRSPLLAVVKDGSLLGVVTASRLLA AALKT" gene 3000614..3001903 /gene="arsA" /locus_tag="Rv2684" /db_xref="GeneID:888366" CDS 3000614..3001903 /gene="arsA" /locus_tag="Rv2684" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF ARSENICAL COMPOUNDS ACROSS THE MEMBRANE (EXPORT): ARSENIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2684, (MTCY05A6.05), len: 429 aa. Probable arsA, arsenic-transport integral membrane protein, equivalent to P46838|AG45_MYCLE|ML1036 46 KDA PROBABLE INTEGRAL MEMBRANE PROTEIN (antigen 45, a transmembrane protein related to arsenical pumps) from Mycobacterium leprae (429 aa), FASTA scores: opt: 2067, E(): 9.9e-118, (74.05% identity in 428 aa overlap); and upstream orf O07187|YQ85_MYCTU|ARSB|Rv2685|MT2759|MTCY05A6.06 PROBABLE INTEGRAL MEMBRANE 45.2 KDA PROTEIN ARSB from Mycobacterium tuberculosis (428 aa), FASTA scores: opt: 2148, E(): 1.3e-122, (76.58% identity in 427 aa overlap). Also highly similar to other proteins e.g. Q9UY19|PAB1107 TRANSPORT PROTEIN from Pyrococcus abyssi (425 aa), FASTA scores: opt: 1109, E(): 8.3e-60, (41.45% identity in 427 aa overlap); O59575|PH1912 HYPOTHETICAL 46.0 KDA PROTEIN from Pyrococcus horikoshii (424 aa), FASTA scores: opt: 1101, E(): 2.5e-59, (41.95% identity in 429 aa overlap); Q9KDI2|BH1231 HYPOTHETICAL 46.0 KDA PROTEIN from Bacillus halodurans (428 aa), FASTA scores: opt: 1018, E(): 2.7e-54, (38.9% identity in 427 aa overlap); etc. BELONGS TO THE NADC/P/PHO87 FAMILY OF TRANSPORTERS, P SUBFAMILY (ARS FAMILY). TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="arsenic-transport integral membrane protein ArsA" /protein_id="NP_217200.1" /db_xref="GI:15609821" /db_xref="GeneID:888366" /translation="MSVVAVTIFVAAYVLIASDRVNKTMVALTGAAAVVVLPVITSHD IFYSHDTGIDWDVIFLLVGMMIIVGVLRQTGVFEYTAIWAAKRARGSPLRIMILLVLV SALASALLDNVTTVLLIAPVTLLVCDRLNINTTSFLMAEVFASNIGGAATLVGDPPNI IVASRAGLTFNDFMLHLTPLVVIVLIALIAVLPRLFGSITVEADRIADVMALDEGEAI RDRGLLVKCGAVLVLVFAAFVAHPVLHIQPSLVALLGAGMLIVVSGLTRSEYLSSVEW DTLLFFAGLFIMVGALVKTGVVNDLARAATQLTGGNIVATAFLILGVSAPISGIIDNI PYVATMTPLVAELVAVMGGQPSTDTPWWALALGADFGGNLTAIGASANVVMLGIARRA GAPISFWEFTRKGAVVTAVSIALAAIYLWLRYFVLLH" gene 3001983..3003269 /gene="arsB1" /locus_tag="Rv2685" /db_xref="GeneID:888150" CDS 3001983..3003269 /gene="arsB1" /locus_tag="Rv2685" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF ARSENICAL COMPOUNDS ACROSS THE MEMBRANE (EXPORT): ARSENIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2685, (MTCY05A6.06), len: 428 aa. Probable arsB1, arsenic-transport integral membrane protein, equivalent to P46838|AG45_MYCLE|ML1036 46 KDA PROBABLE INTEGRAL MEMBRANE PROTEIN (antigen 45, a transmembrane protein related to arsenical pumps) from Mycobacterium leprae (429 aa), FASTA scores: opt: 2048, E(): 7.3e-120, (74.25% identity in 427 aa overlap); and downstream ORF O07186|YQ84_MYCTU|ARSA|Rv2684|MT2758|MTCY05A6.05 PROBABLE INTEGRAL MEMBRANE PROTEIN ARSA from Mycobacterium tuberculosis (429 aa), FASTA scores: opt: 2154, E(): 1.9e-126, (76.8% identity in 427 aa overlap). Also highly similar to other proteins e.g. O59575|PH1912 HYPOTHETICAL 46.0 KDA PROTEIN from Pyrococcus horikoshii (424 aa), FASTA scores: opt: 1075, E(): 1.9e-59, (43.55% identity in 427 aa overlap); Q9UY19|PAB1107 TRANSPORT PROTEIN from Pyrococcus abyssi (425 aa), FASTA scores: opt: 1062, E(): 1.3e-58, (41.8% identity in 428 aa overlap); Q9KDI2|BH1231 HYPOTHETICAL 46.0 KDA PROTEIN from Bacillus halodurans (428 aa), FASTA scores: opt: 993, E(): 2.4e-54, (39.55% identity in 430 aa overlap); etc. BELONGS TO THE NADC/P/PHO87 FAMILY OF TRANSPORTERS, P SUBFAMILY. TBparse score is 0.881. Note that previously known as arsB.; arsB" /codon_start=1 /transl_table=11 /product="arsenic-transport integral membrane protein ArsB1" /protein_id="YP_177899.1" /db_xref="GI:57117014" /db_xref="GeneID:888150" /translation="MSIIAITVFVAGYALIASDRVSKTRVALTCAAIMVGAGIVGSDD VFYSHEAGIDWDVIFLLLGMMIIVSVLRHTGVFEYVAIWAVKRANAAPLRIMILLVLV TALGSALLDNVTTVLLIAPVTLLVCDRLGVNSTPFLVAEVFASNVGGAATLVGDPPNI IIASRAGLTFNDFLIHMAPAVLVVMIALIGLLPWLLGSVTAEPDRVADVLSLNEREAI HDRGLLIKCGVVLVLVFAAFIAHPVLHIQPSLVALLGAGVLVRFSGLERSDYLSSVEW DTLLFFAGLFVMVGALVKTGVVEQLARAATELTGGNELLTVGLILGISAPVSGIIDNI PYVATMTPIVTELVAAMPGHVHPDTFWWALALSADFGGNLTAVAASANVVMLGIARRS GTPISFWKFTRKGAVVTAVSLVLSAVYLWLRYFVFG" gene complement(3003280..3004038) /locus_tag="Rv2686c" /db_xref="GeneID:888360" CDS complement(3003280..3004038) /locus_tag="Rv2686c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF UNIDENTIFIED ANTIBIOTIC ACROSS THE MEMBRANE (EXPORT): ANTIBIOTIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2686c, (MTCY05A6.07c), len: 252 aa. Probable antibiotic-transport integral membrane leu-, ala-, val-rich protein ABC transporter (see citation below). The region from aa 115 to 160 is highly similar to N-terminus of Q49998|U1764P HYPOTHETICAL PROTEIN from Mycobacterium leprae (53 aa), FASTA scores: opt: 151, E(): 0.011, (58.15% identity in 43 aa overlap). Shows some similarity with membrane proteins e.g. AAK75541|SP1447 MEMBRANE PROTEIN from Streptococcus pneumoniae (298 aa), FASTA scores: opt: 139, E(): 0.21, (29.65% identity in 135 aa overlap); Q9K4C9|2SC6G5.26c PUTATIVE ABC TRANSPORTER INTEGRAL MEMBRANE SUBUNIT from Streptomyces coelicolor (249 aa), FASTA scores: opt: 138, E(): 0.21, (26.9% identity in 253 aa overlap); Q53627|MTRB MEMBRANE PROTEIN INVOLVED IN MITHRAMYCIN RESISTANCE from Streptomyces argillaceus (233 aa), FASTA scores: opt: 136, E(): 0.27, (26.7% identity in 191 aa overlap); etc. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="antibiotic ABC transporter transmembrane protein" /protein_id="NP_217202.1" /db_xref="GI:15609823" /db_xref="GeneID:888360" /translation="MRAISSLAGPRALAAFGRNDIRGTYRDPLLVMLVIAPVIWTTGV ALLTPLFTEMLARRYGFDLVGYYPLILTAFLLLTSIIVAGALAAFLVLDDVDAGTMTA LRVTPVPLSVFFGYRAATVMVVTTIYVVATMSCSGILEPGLVSSLIPIGLVAGLSAVV TLLLILAVANNKIQGLAMVRALGMLIAGLPCLPWFISSNWNLAFGVLPPYWAAKAFWV ASDHGTWWPYLVGGAVYNLAIVWVLFRRFRAKHA" gene complement(3004035..3004748) /locus_tag="Rv2687c" /db_xref="GeneID:888446" CDS complement(3004035..3004748) /locus_tag="Rv2687c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF UNIDENTIFIED ANTIBIOTIC ACROSS THE MEMBRANE (EXPORT): ANTIBIOTIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2687c, (MTCY05A6.08c), len: 237 aa. Probable antibiotic-transport integral membrane leu-, val-rich protein ABC transporter (see citation below), showing some similarity with two other hypothetical proteins, BAB59668|TVG0517148 from Thermoplasma volcanium (241 aa), FASTA scores: opt: 136, E(): 0.32, (23.1% identity in 208 aa overlap); and Q97U55|SSO3168 from Sulfolobus solfataricus (249 aa), FASTA scores: opt: 136, E(): 0.33, (25.15% identity in 195 aa overlap). Has some hydrophobic stretches and contains bacterial regulatory proteins, araC family signature (PS00041). TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="antibiotic ABC transporter transmembrane protein" /protein_id="NP_217203.1" /db_xref="GI:15609824" /db_xref="GeneID:888446" /translation="MTRLVPALRLELTLQVRQKFLHAAVFSGLIWLAVLLPMPVSLRP VAEPYVLVGDIAIIGFFFVGGTVFFEKQERTIGAIVSTPLRFWEYLAAKLTVLLAISL FVAVVVATIVHGLGYHLLPLVAGIVLGTLLMLLVGFSSSLPFASVTDWFLAAVIPLAI MLAPPVVHYSGLWPNPVLYLIPTQGPLLLLGAAFDQVSLAPWQVGYAVVYPIVCAAGL CRAAKALFGRYVVQRSGVL" misc_feature complement(3004353..3004472) /locus_tag="Rv2687c" /note="PS00041 Bacterial regulatory proteins, araC family signature" gene complement(3004745..3005650) /locus_tag="Rv2688c" /db_xref="GeneID:888463" CDS complement(3004745..3005650) /locus_tag="Rv2688c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF UNIDENTIFIED ANTIBIOTIC ACROSS THE MEMBRANE (EXPORT): ANTIBIOTIC RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv2688c, (MTCY05A6.09c), len: 301 aa. Probable antibiotic-transport ATP-binding protein ABC transporter (see citation below), highly similar to AAK47077|MT2762 ABC TRANSPORTER ATP-BINDING PROTEIN from Mycobacterium tuberculosis strain CDC1551 (317 aa), FASTA scores: opt: 1714, E(): 5.1e-93, (95.6% identity in 274 aa overlap). Also highly similar to other ATP-BINDING PROTEINS ABC TRANSPORTER e.g. Q9K639|BH3893 from Bacillus halodurans (282 aa), FASTA scores: opt: 644, E(): 1.4e-30, (38.% identity in 285 aa overlap); O58550|PH0820 from Pyrococcus horikoshii (312 aa), FASTA scores: opt: 574, E(): 1.8e-26, (39.1% identity in 307 aa overlap); Q9WYM0|TM0389 from Thermotoga maritima (301 aa), FASTA scores: opt: 536, E(): 2.9e-24, (36.1% identity in 291 aa overlap); etc. Has ATP/GTP-binding site motif A (P-loop) at N-terminus (PS00017). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.872." /codon_start=1 /transl_table=11 /product="antibiotic ABC transporter ATP-binding protein" /protein_id="NP_217204.1" /db_xref="GI:15609825" /db_xref="GeneID:888463" /translation="MTALNRAVASARVGTEVIRVRGLTFRYPKAAEPAVRGMEFTVGR GEIFGLLGPSGAGKSTTQKLLIGLLRDHGGQATVWDKEPAEWGPDYYERIGVSFELPN HYQKLTGYENLRFFASLYAGATADPMQLLAAVGLADDAHTLVGKYSKGMQMRLPFARS LINDPELLFLDEPTSGLDPVNARKIKDIIVDLKARGRTIFLTTHDMATADELCDRVAF VVDGRIVALDSPTELKIARSRRRVRVEYRGDGGGLETAEFGMDGLADDPAFHSVLRNH HVETIHSREASLDDVFVEVTGRQLT" misc_feature complement(3005474..3005497) /locus_tag="Rv2688c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3005845..3007062) /locus_tag="Rv2689c" /db_xref="GeneID:887219" CDS complement(3005845..3007062) /locus_tag="Rv2689c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2689c, (MTCY05A6.10c), len: 405 aa (other less probable starts possible). Conserved hypothetical ala-, val-, gly-rich protein, similar to O54099|SC10A5.06 HYPOTHETICAL 49.5 KDA PROTEIN from Streptomyces coelicolor (458 aa), FASTA scores: opt: 455, E(): 2.7e-20, (38.35% identity in 417 aa overlap); and shows weak similarity in part with several methyltransferases (EC 2.1.1.-) e.g. Q9X0H9|TM1094 PUTATIVE RNA METHYLTRANSFERASE from Thermotoga maritima (439 aa), FASTA scores: opt: 306, E(): 3e-11, (25.9% identity in 436 aa overlap); AK79403|CAC1435 S-ADENOSYLMETHIONINE-DEPENDENT METHYLTRANSFERASES from Clostridium acetobutylicum (456 aa), FASTA scores: opt: 294, E(): 1.6e-10, (23.4% identity in 449 aa overlap); Q9A8M7|CC1326 RNA METHYLTRANSFERASE from Caulobacter crescentus (415 aa), FASTA scores: opt: 247, E(): 1.1e-07, (28.4% identity in 433 aa overlap); etc. Equivalent to AAK47078 from Mycobacterium tuberculosis strain CDC1551 (434 aa) but shorter 29 aa. TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217205.1" /db_xref="GI:15609826" /db_xref="GeneID:887219" /translation="MTRAGDDAVNLTLVTGAPANGGSCVAHHEGRVVFVRYALPGERV RARVTAQRGSYWHAEAFEVIDPSPDRIGSLCSIAGADGAGCCDLAFAAPEAARTLKAQ VVANQLERLGRHSWQGEAQPLSDAGPTGWRIRVRLDVGADRRPGFHRYHSGELVTDLD CGQLPVGMLDGLVAADWPPEAQLYVALDDDGERHVVCSVRQGPRNRTRTVTNVVEGAY HAHQRVHRRSWRVPVTAFWQAHRDAAAVYSDLIADWAQPAPGMTAWDLYGGAGVFAAV LGEAVGESGRVLTVDTSRLASGAARAALVDLPQVEVVTGSVRRVLAVQPAGADLAVLD PPRSGAGREVVDLLAGAGVPRLIHIGCEAASFARDIGLYRGHGYAVEKIKVFDAFPLT HYVECVALLTRKV" repeat_region complement(3007063..3007115) /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(3007116..3007168) /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(3007169..3007221) /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene complement(3007236..3009209) /locus_tag="Rv2690c" /db_xref="GeneID:888011" CDS complement(3007236..3009209) /locus_tag="Rv2690c" /function="UNKNOWN" /note="Rv2690c, (MTCY05A6.11c), len: 657 aa. Probable conserved integral membrane ala-, val-, leu-rich protein, highly similar to others e.g. O54098|SC10A5.05 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (691 aa), FASTA scores: opt: 2007, E(): 1.6e-116, (62.35% identity in 669 aa overlap); O69917|SC3C8.04c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (644 aa), FASTA scores: opt: 923, E(): 1.7e-49, (35.3% identity in 669 aa overlap); AAK78253|CAC0272 AMINO ACID TRANSPORTER from Clostridium acetobutylicum (620 aa), FASTA scores: opt: 674, E(): 4.1e-34, (36.55% identity in 640 aa overlap); etc. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217206.1" /db_xref="GI:15609827" /db_xref="GeneID:888011" /translation="MSKLSTAARRLLIGRPFRSDRLSHTLLPKRIALPVFASDAMSSI AYAPEEIFLVLSVAGLAAYSMAPLIGLAVAAVLLVVVSSYRQNVHAYPSGGGDYEVVT TNLGATGGLVVASALMVDYVLTVAVSISSAASNIGSVSPFVYEHKVLFAVGAIVLIMA MNLRGVRESGLAFAIPTYAFIAGIGTMLVWGLFRIFVLGNPVRAESAAFEMHAEHGQI VGFALVFLVARSFSSGCAALTGVEAISNGVPAFQKPKSRNAATTLLMLGIIAVSMFMG MIVLAVETGVQVVDDPDTQLTGAPPGYQQKTLVAQLAQAVFGGFYLGFLLIAAVTALI LVLAANTAFNGFPVLGSVLAQHSYLPRQLHTRGDRLAFSNGILFLAAAAIGAVVAFRA ELTALIQLYIVGVFISFTMSQVGMVRHWTRLLSAETDPRARRAMLRSRAVNTVGFVST GTVLLIVLVTKFLAGAWIAIVAMGGFFMMMKLIHRHYDAVNRELAEQAEEAEITLPSR NHAVVLVSKLHLPTLRALTYARATRPDVLEAVTVNVDDAETRELVRQWQDSDVSVPLK VIASPYREITRPVLDYVKRVSKESPRTVVTVFIPEYVVGRWWEQLLHNQSALRLKGRL LFMPGVMVTSVPWQLTSSERIKTLQPHAAPGDT" gene 3009344..3010027 /gene="ceoB" /locus_tag="Rv2691" /db_xref="GeneID:887250" CDS 3009344..3010027 /gene="ceoB" /locus_tag="Rv2691" /function="PART OF A POTASSIUM TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv2691, (MTCY05A6.12), len: 227 aa. ceoB (alternate gene name: trkA), TRK system potassium uptake protein (see citation below), highly similar to others e.g. Q53949|TRKA_STRCO|SC2E9.17c from Streptomyces coelicolor (223 aa), FASTA scores: opt: 781, E(): 5.8e-42, (53.2% identity in 220 aa overlap); O27333|TRKA_METTH|MTH1265 from Methanobacterium thermoautotrophicum (216 aa), FASTA scores: opt: 287, E(): 5.3e-11, (27.0% identity in 211 aa overlap); O54141|SC2E9.16c from Streptomyces coelicolor (226 aa), FASTA scores: opt: 269, E(): 7.3e-10, (29.9% identity in 214 aa overlap); etc. Also similar to upstream orf O07194|CEOC|TRKA_MYCTU|TRKA|TRKB|Rv2692|MT2766|MTCY05A6.1 3 TRK SYSTEM POTASSIUM UPTAKE PROTEIN from Mycobacterium tuberculosis (220 aa), FASTA scores: opt: 259, E(): 3e-09, (26.55% identity in 226 aa overlap). Contains a motif common to NAD+ binding pockets (see citation below). BELONGS TO THE TRKA FAMILY.; trkA" /codon_start=1 /transl_table=11 /product="TRK system potassium uptake protein CEOB" /protein_id="YP_177900.1" /db_xref="GI:57117015" /db_xref="GeneID:887250" /translation="MRVVVMGCGRVGASVADGLSRIGHEVAIIDRDSAAFNRLSPQFA GERVLGQGFDRDVLLRAGIQGADAFAAVSSGDNSNIISARLARETFGVPRVVARIYDA KRAEVYERLGIPTITTVPWTTDRLLNALMQDTETAKWRDPTGTVAVAEVVLHEDWVGH RATDLEQATGARIAFLIRFGTGVLPEPKTVLQAGDKVYIAAISGRAAEAAAIAALPPS EDFESGARR" gene 3010024..3010686 /gene="ceoC" /locus_tag="Rv2692" /db_xref="GeneID:887493" CDS 3010024..3010686 /gene="ceoC" /locus_tag="Rv2692" /function="PART OF A POTASSIUM TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv2692, (MTCY05A6.13), len: 220 aa. ceoC (alternate gene names: trkA and trkB), TRK system potassium uptake protein (see citation below), highly similar to others e.g. O54141|SC2E9.16c from Streptomyces coelicolor (226 aa), FASTA scores: opt: 870, E(): 9.4e-48, (58.8% identity in 216 aa overlap); Q58505|TRKA_METJA|MJ1105 from Methanococcus jannaschii (218 aa), FASTA scores: opt: 361, E(): 9.7e-16, (29.8% identity in 218 aa overlap); O27333|TRKA_METTH|MTH1265 from Methanobacterium thermoautotrophicum (216 aa), FASTA scores: opt: 326, E(): 1.5e-13, (30.1% identity in 216 aa overlap); etc. Also similar to downstream orf O07193|CEOB|TRKA|Rv2691|MTCY05A6.12 TRK SYSTEM POTASSIUM UPTAKE PROTEIN from Mycobacterium tuberculosis (227 aa), FASTA scores: opt: 259, E(): 2.6e-09, (26.55% identity in 226 aa overlap). Contains a motif common to NAD+ binding pockets (see citation below). BELONGS TO THE TRKA FAMILY.; trkA; trkB" /codon_start=1 /transl_table=11 /product="TRK system potassium uptake protein CEOC" /protein_id="YP_177901.1" /db_xref="GI:57117016" /db_xref="GeneID:887493" /translation="MKVAVAGAGAVGRSVTRELVENGHDITLIERNPDHLDAAAIPEA HWRLGDACELSLLESIHLEEFDVVVAATGDDKVNVVLSLLAKTEFAVPRVVARVNDPR NEWLFNDAWGVDVAVSTPRMLASLIEEAVTIGDLVRLMEFRTGQANLVEITLPDNTPW GGKPVRKLQLPRDAALVTILRGPRVIVPEADEPLEGGDELLFVAVTEAEEELSRLLLP SM" gene complement(3010697..3011368) /locus_tag="Rv2693c" /db_xref="GeneID:887917" CDS complement(3010697..3011368) /locus_tag="Rv2693c" /function="UNKNOWN" /note="Rv2693c, (MTCY05A6.14c), len: 223 aa. Probable conserved integral membrane ala-, leu-rich protein, showing some similarity to O54140|SC2E9.15 HYPOTHETICAL 29.6 KDA PROTEIN from Streptomyces coelicolor (272 aa), FASTA scores: opt: 212, E(): 4.3e-06, (23.5% identity in 247 aa overlap). TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="integral membrane alanine and leucine rich protein" /protein_id="NP_217209.1" /db_xref="GI:15609830" /db_xref="GeneID:887917" /translation="MNANRTSAQRLLAQAGGVSGLVYSSLPVVTFVVASSAAGLLPAI GFALSMAGLILLWRLLRRESARPVVAGFCGVAVCALIAYLVGQSKGYFLLGIWMSLLW AVVFTLSILIRRPIVGYLWSWLSGRDRAWRDVSRAVFAFDVATLGWTLVFAARFIVQR HLYDADKTGWLGVARIGMGWPLTALAALATYAAIKAAQRAILASHDAAAVGGAAEFDA DAGRE" gene complement(3011399..3011767) /locus_tag="Rv2694c" /db_xref="GeneID:888911" CDS complement(3011399..3011767) /locus_tag="Rv2694c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2694c, (MTCY05A6.15c), len: 122 aa. Conserved hypothetical protein, highly similar in part to SC2E9.14 HYPOTHETICAL 16.9 KDA PROTEIN from Streptomyces coelicolor (154 aa), FASTA scores: opt: 299, E(): 1.9e-13, (41.05% identity in 117 aa overlap. Equivalent to AAK47083 from Mycobacterium tuberculosis strain CDC1551 (157 aa) but shorter 35 aa. TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217210.1" /db_xref="GI:15609831" /db_xref="GeneID:888911" /translation="MGAQGYLRRLTRRLTEDLEQRDVEELSDEVLNAGAQRAIDCQRG QEVTVVGTLRSVETNGKGCSGGVRAELFDGSDTVTLVWLGQRRIPGIDTGRTLRVRGR LGKLENGTKAIYNPHYEIQR" gene 3011916..3012623 /locus_tag="Rv2695" /db_xref="GeneID:888223" CDS 3011916..3012623 /locus_tag="Rv2695" /function="UNKNOWN" /note="Rv2695, (MTCY05A6.16), len: 235 aa. Conserved hypothetical ala-rich protein, equivalent to Q49994|ML1030|U1764L HYPOTHETICAL PROTEIN from Mycobacterium leprae (232 aa), FASTA scores: opt: 1166, E(): 6.3e-63, (76.95% identity in 230 aa overlap). Also shows some similarity with other hypothetical proteins e.g. Q986S2|MLR7232 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (277 aa), FASTA scores: opt: 150, E(): 0.059, (33.55% identity in 173 aa overlap); CAC47772|SMC03810 HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (269 aa), FASTA scores: opt: 143, E(): 0.15, (28.05% identity in 228 aa overlap); Q9A5N6|CC2411 3-OXOADIPATE ENOL-LACTONE HYDROLASE/4-CARBOXYMUCONOLACTONE DECARBOXYLASE from Caulobacter crescentus (393 aa), FASTA scores: opt: 138, E(): 0.41, (26.45% identity in 238 aa overlap); etc. TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217211.1" /db_xref="GI:15609832" /db_xref="GeneID:888223" /translation="MAVDLDGVTTVLLPGTGSDNDYVRRAFSAPLRRAGAVLVTPVPH PGRLIDGYRAALDDAARDGPVVVGGVSLGAAVAAAWALEHPDRAVAVLAALPAWTGEP ELAPAAQAARYTAARLRCDGLAATTTRMRASSPVWLAEELTRSWRVQWPELPDAMEEA AAYVAPSRAELARLVAPLAVAAAVDDPIHPLQVAADWVSVAPHAALRTVTLDEIGADA AALGSACLAALAEVSGA" misc_feature 3011952..3011969 /locus_tag="Rv2695" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" gene complement(3012829..3013608) /locus_tag="Rv2696c" /db_xref="GeneID:888482" CDS complement(3012829..3013608) /locus_tag="Rv2696c" /function="UNKNOWN" /note="Rv2696c, (MTCY05A6.17c), len: 259 aa. Conserved hypothetical ala-, gly-, val-rich protein, equivalent (but shorter 18 aa) to Q49993|ML1029|U1764K HYPOTHETICAL PROTEIN from Mycobacterium leprae (273 aa), FASTA scores: opt: 1174, E(): 2.1e-63, (70.6% identity in 262 aa overlap). Also similar to O54135|SC2E9.10 from Streptomyces coelicolor (250 aa), FASTA scores: opt: 213, E(): 9.8e-06, (28.25% identity in 255 aa overlap); and showing weak similarity with other proteins. TBparse score is 0.893." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217212.1" /db_xref="GI:15609833" /db_xref="GeneID:888482" /translation="MAFGRRTGKDGGKRKAGHAPVQPADEHVRPEDTVVASAAAASGV EDQEELQGPFDIDDFDDPSVAVLARLDLGSVLIPMPAAGQVQVELTESGVPSAVWVIT PNGRYSIAAYAAPKTGGLWREVAGELADSLRKDSAKVSIKDGPWGREVIGIAAGVVRF IGVDGYRWMIRCVVNGPQETVDALTEEAREALADTVVRRGDTPLPVRTPLPVHLPEPM AAQLREAAAAQADTQRQAAAGVARRGAQGSAMQQLRSTTGG" repeat_region complement(3013612..3013687) /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class I" gene complement(3013683..3014147) /gene="dut" /locus_tag="Rv2697c" /db_xref="GeneID:887290" CDS complement(3013683..3014147) /gene="dut" /locus_tag="Rv2697c" /EC_number="3.6.1.23" /function="INVOLVED IN BIOSYNTHESIS OF THYMIDYLATE. THIS ENZYME IS INVOLVED IN NUCLEOTIDE METABOLISM: IT PRODUCES DUMP, THE IMMEDIATE PRECURSOR OF THYMIDINE NUCLEOTIDES AND IT DECREASES THE INTRACELLULAR CONCENTRATION OF DUTP SO THAT URACIL CANNOT BE INCORPORATED INTO DNA [CATALYTIC ACTIVITY: DUTP + H(2)O = DUMP + PYROPHOSPHATE]." /note="catalyzes the formation of dUMP from dUTP" /codon_start=1 /transl_table=11 /product="deoxyuridine 5'-triphosphate nucleotidohydrolase" /protein_id="NP_217213.1" /db_xref="GI:15609834" /db_xref="GeneID:887290" /translation="MSTTLAIVRLDPGLPLPSRAHDGDAGVDLYSAEDVELAPGRRAL VRTGVAVAVPFGMVGLVHPRSGLATRVGLSIVNSPGTIDAGYRGEIKVALINLDPAAP IVVHRGDRIAQLLVQRVELVELVEVSSFDEAGLASTSRGDGGHGSSGGHASL" gene 3014173..3014658 /locus_tag="Rv2698" /db_xref="GeneID:888528" CDS 3014173..3014658 /locus_tag="Rv2698" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2698, (MTCY05A6.19), len: 161 aa. Probable conserved ala-rich transmembrane protein, equivalent to Q49991|ML1027|U1764I POSSIBLE MEMBRANE PROTEIN from Mycobacterium leprae (157 aa), FASTA scores: opt: 886, E(): 1.1e-49, (78.9% identity in 161 aa overlap). Also similar to O54132|SC2E9.07c HYPOTHETICAL 16.5 KDA PROTEIN from Streptomyces coelicolor (154 aa), FASTA scores: opt: 230, E(): 7.1e-08, (35.7% identity in 154 aa overlap)." /codon_start=1 /transl_table=11 /product="alanine rich transmembrane protein" /protein_id="NP_217214.1" /db_xref="GI:15609835" /db_xref="GeneID:888528" /translation="MSGTRLAPHSVRYRERLWVPWWWWPLAFALAALIAFEVNLGVAA LPDWVPFATLFTVAAGTLLWLGRVEIRVTAGSADGAGVKLWAGPAHLPVAVIARSAEI PATAKSAALGRQLDPAAYVLHRAWVGPMVLVVLDDPNDPTPYWLVSCRHPERVLSALR S" gene complement(3014663..3014965) /locus_tag="Rv2699c" /db_xref="GeneID:887218" CDS complement(3014663..3014965) /locus_tag="Rv2699c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2699c, (MTCY05A6.20c), len: 100 aa. Conserved hypothetical protein, very equivalent to Q49990|ML1026|U1764J HYPOTHETICAL PROTEIN from Mycobacterium leprae (100 aa), FASTA scores: opt: 632, E(): 7.7e-36, (96.0% identity in 100 aa overlap). Also highly similar to O54130|SC2E9.05 HYPOTHETICAL 11.0 KDA PROTEIN from Streptomyces coelicolor (98 aa), FASTA scores: opt: 465, E(): 1.1e-24, (71.45% identity in 98 aa overlap). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217215.1" /db_xref="GI:15609836" /db_xref="GeneID:887218" /translation="MPTDYDAPRRTETDDVSEDSLEELKARRNEAASAVVDVDESESA ESFELPGADLSGEELSVRVVPKQADEFTCSSCFLVQHRSRLASEKNGVMICTDCAA" gene 3015203..3015853 /locus_tag="Rv2700" /db_xref="GeneID:887405" CDS 3015203..3015853 /locus_tag="Rv2700" /function="UNKNOWN" /note="Rv2700, (MTCY05A6.21), len: 216 aa. Possible secreted ala-rich protein, equivalent to Q4998|ML1025|U1764H POSSIBLE SECRETED PROTEIN from Mycobacterium leprae (216 aa), FASTA scores: opt: 1198, E(): 1.2e-65, (82.4% identity in 216 aa overlap). Also showing some similarity with Q9AK75|2SCD60.08c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (204 aa), FASTA scores: opt: 193, E(): 8.9e-05, (31.25% identity in 192 aa overlap). TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="secreted alanine rich protein" /protein_id="NP_217216.1" /db_xref="GI:15609837" /db_xref="GeneID:887405" /translation="MVAQITEGTAFDKHGRPFRRRNPRPAIVVVAFLVVVTCVMWTLA LTRPPDVREAAVCNPPPQPAGSAPTNLGEQVSRTDMTDVAPAKLSDTKVHVLNASGRG GQAADIAGALQDLGFAQPTAANDPIYAGTRLDCQGQIRFGTAGQATAAALWLVAPCTE LYHDSRADDSVDLALGTDFTTLAHNDDIDAVLANLRPGATEPSDPALLAKIHANSC" gene complement(3015863..3016735) /gene="suhB" /locus_tag="Rv2701c" /db_xref="GeneID:887210" CDS complement(3015863..3016735) /gene="suhB" /locus_tag="Rv2701c" /function="IN E. COLI, SUHB MUTATION (SUHB2) ENHANCES THE SYNTHESIS OF SIGMA(32) AND SUPPRESSES TEMPERATURE-SENSITIVE GROWTH OF THE RPOH15 MUTANT. MAY AFFECT SOME STEP(S) OF PROTEIN SYNTHESIS BY FACILITATING THE FUNCTION OF GROE OR OTHER HEAT SHOCK PROTEINS." /note="Rv2701c, (MTCY05A6.22c), len: 290 aa. Possible suhB, extragenic suppressor protein, equivalent to P46813|SUHB_MYCLE|SUHB|SSYA|ML1024 EXTRAGENIC SUPPRESSOR PROTEIN from Mycobacterium leprae (291 aa), FASTA scores: opt: 1424, E(): 4.9e-78, (77.55% identity in 294 aa overlap). Similar (except at N-terminus) to others e.g. O54128|SUHB from Streptomyces coelicolor (209 aa), FASTA scores: opt: 560, E(): 1.7e-26, (46.95% identity in 213 aa overlap); Q9CNV8|SUHB|PM0315 from Pasteurella multocida (267 aa), FASTA scores: opt: 479, E(): 1.5e-21, (39.3% identity in 234 aa overlap); P44333|SUHB_HAEIN|HI0937 from Haemophilus influenzae (267 aa), FASTA scores: opt: 438, E(): 4.1e-19, (34.7% identity in 248 aa overlap); P22783|SUHB_ECOLI|SSYA|B2533 from Escherichia coli strain K12 (267 aa), FASTA scores: opt: 419, E(): 5.7e-18, (34.45% identity in 267 aa overlap); etc. And also similar to putative myo-inositol-1(or 4)-monophosphatases e.g. Q9S1M1|SPCA from Streptoverticillium netropsis (Streptoverticillium flavopersicus) (266 aa), FASTA scores: opt: 556, E(): 3.6e-26, (45.4% identity in 240 aa overlap); Q9S3X5|SPCA from Streptomyces spectabilis (264 aa), FASTA scores: opt: 502, E(): 6.1e-23, (46.05% identity in 265 aa overlap); CAC47357 from Rhizobium meliloti (Sinorhizobium meliloti) (266 aa), FASTA scores: opt: 452, E(): 6e-20, (38.5% identity in 244 aa overlap); etc. Equivalent to AAK47090 from Mycobacterium tuberculosis strain CDC1551 (277 aa) but longer 13 aa. Contains PS00630 Inositol monophosphatase family signatures 1 and 2 (PS00629 and PS00630). BELONGS TO THE INOSITOL MONOPHOSPHATASE FAMILY. TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="extragenic suppressor protein SuhB" /protein_id="NP_217217.1" /db_xref="GI:15609838" /db_xref="GeneID:887210" /translation="MTRPDNEPARLRSVAENLAAEAAAFVRGRRAEVFGISRAGDGDG AVRAKSSPTDPVTVVDTDTERLLRDRLAQLRPGDPILGEEGGGPADVTATPSDRVTWV LDPIDGTVNFVYGIPAYAVSIGAQVGGITVAGAVADVAARTVYSAATGLGAHLTDERG RHVLRCTGVDELSMALLGTGFGYSVRCREKQAELLAHVVPLVRDVRRIGSAALDLCMV AAGRLDAYYEHGVQVWDCAAGALIAAEAGARVLLSTPRAGGAGLVVVAAAPGIADELL AALQRFNGLEPIPD" misc_feature complement(3015992..3016036) /gene="suhB" /locus_tag="Rv2701c" /note="PS00630 Inositol monophosphatase family signature 2" misc_feature complement(3016394..3016435) /gene="suhB" /locus_tag="Rv2701c" /note="PS00629 Inositol monophosphatase family signature 1" gene 3016858..3017655 /gene="ppgK" /locus_tag="Rv2702" /db_xref="GeneID:887313" CDS 3016858..3017655 /gene="ppgK" /locus_tag="Rv2702" /EC_number="2.7.1.63" /function="CATALYZES THE PHOSPHORYLATION OF GLUCOSE USING POLYPHOSPHATE OR ATP AS THE PHOSPHORYL DONOR. GTP, UTP AND CTP CAN REPLACE ATP AS PHOSPHORYL DONOR [CATALYTIC ACTIVITY: (PHOSPHATE)(N) + D-GLUCOSE = (PHOSPHATE)(N-1) + D-GLUCOSE 6-PHOSPHATE]." /note="Rv2702, (MTCY05A6.23), len: 265 aa. ppgK, polyphosphate glucokinase (EC 2.7.1.2) (see citations below), equivalent, but shorter 60 aa, to Q49988|PPGK_MYCLE|ML1023|U1764FG POLYPHOSPHATE GLUCOKINASE from Mycobacterium leprae (324 aa), FASTA scores: opt: 1411, E(): 5.6e-80, (82.8% identity in 262 aa overlap). Also highly similar (or just similar) to others e.g. Q9ADE8|PPGK from Streptomyces coelicolor (246 aa), FASTA scores: opt: 912, E(): 3e-49, (57.3% identity in 239 aa overlap); Q9AGV8|PPGK from Corynebacterium ammoniagenes (Brevibacterium ammoniagenes) (277 aa), FASTA scores: opt: 890, E(): 7.5e-48, (57.75% identity in 239 aa overlap); P40184|GLK_STRCO|SC6E10.20c from Streptomyces coelicolor (317 aa), FASTA scores: opt: 233, E(): 3.2e-07, (31.3% identity in 163 aa overlap); etc. TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="polyphosphate glucokinase PPGK (polyphosphate-glucose phosphotransferase)" /protein_id="NP_217218.1" /db_xref="GI:15609839" /db_xref="GeneID:887313" /translation="MTSTGPETSETPGATTQRHGFGIDVGGSGIKGGIVDLDTGQLIG DRIKLLTPQPATPLAVAKTIAEVVNGFGWRGPLGVTYPGVVTHGVVRTAANVDKSWIG TNARDTIGAELGGQQVTILNDADAAGLAETRYGAGKNNPGLVVLLTFGTGIGSAVIHN GTLIPNTEFGHLEVGGKEAEERAASSVKEKNDWTYPKWAKQVIRVLIAIENAIWPDLF IAGGGISRKADKWVPLLENRTPVVPAALQNTAGIVGAAMASVADTTH" gene 3017835..3019421 /gene="sigA" /locus_tag="Rv2703" /db_xref="GeneID:887477" CDS 3017835..3019421 /gene="sigA" /locus_tag="Rv2703" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED. THIS IS THE PRIMARY SIGMA-FACTOR OF THIS BACTERIA. SUPPOSED INVOLVED IN THE HOUSEKEEPING REGULONS." /experiment="experimental evidence, no additional details recorded" /note="sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor" /protein_id="NP_217219.1" /db_xref="GI:15609840" /db_xref="GeneID:887477" /translation="MAATKASTATDEPVKRTATKSPAASASGAKTGAKRTAAKSASGS PPAKRATKPAARSVKPASAPQDTTTSTIPKRKTRAAAKSAAAKAPSARGHATKPRAPK DAQHEAATDPEDALDSVEELDAEPDLDVEPGEDLDLDAADLNLDDLEDDVAPDADDDL DSGDDEDHEDLEAEAAVAPGQTADDDEEIAEPTEKDKASGDFVWDEDESEALRQARKD AELTASADSVRAYLKQIGKVALLNAEEEVELAKRIEAGLYATQLMTELSERGEKLPAA QRRDMMWICRDGDRAKNHLLEANLRLVVSLAKRYTGRGMAFLDLIQEGNLGLIRAVEK FDYTKGYKFSTYATWWIRQAITRAMADQARTIRIPVHMVEVINKLGRIQRELLQDLGR EPTPEELAKEMDITPEKVLEIQQYAREPISLDQTIGDEGDSQLGDFIEDSEAVVAVDA VSFTLLQDQLQSVLDTLSEREAGVVRLRFGLTDGQPRTLDEIGQVYGVTRERIRQIES KTMSKLRHPSRSQVLRDYLD" misc_feature 3018789..3018830 /gene="sigA" /locus_tag="Rv2703" /note="PS00715 Sigma-70 factors family signature 1" gene 3019458..3019886 /locus_tag="Rv2704" /db_xref="GeneID:887675" CDS 3019458..3019886 /locus_tag="Rv2704" /function="UNKNOWN" /note="Rv2704, (MTCY05A6.25), len: 142 aa. Conserved hypothetical protein, highly similar (but shorter 25 aa) to Q9RYB7|DR0033 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (157 aa), FASTA scores: opt: 381, E(): 1.5e-17, (54.85% identity in 124 aa overlap); and highly similar to various proteins e.g. CAC47758|SMC03796 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (126 aa), FASTA scores: opt: 302, E(): 1.4e-12, (46.6% identity in 126 aa overlap); Q98E55|MLL4402 from Rhizobium loti (Mesorhizobium loti) (130 aa), FASTA scores: opt: 252, E(): 2.1e-09, (40.15% identity in 127 aa overlap); Q9K3V5|SCD10.21 PUTATIVE ACETYLTRANSFERASE from Streptomyces coelicolor (291 aa), FASTA scores: opt: 247, E(): 8.7e-09, (41.3% identity in 138 aa overlap) (homology only in N-terminal region); etc. TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217220.1" /db_xref="GI:15609841" /db_xref="GeneID:887675" /translation="MSASRTMVSSGSEFESAVGYSRAVRIGPLVVVAGTTGSGDDIAA QTRDALRRIEIALGQAGATLADVVRTRIYVTDISRWREVGEVHAQAFGKIRPVTSMVE VTALIAPGLLVEIEADAYVGSAVADRNSGAGPKDPSPAGG" gene complement(3019814..3020203) /locus_tag="Rv2705c" /db_xref="GeneID:887302" CDS complement(3019814..3020203) /locus_tag="Rv2705c" /function="UNKNOWN" /note="Rv2705c, (MTCY05A6.26c), len: 129 aa (unlikely ORF). Conserved hypothetical protein, similar to others e.g. Q9RXR5|DR0242 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (112 aa), FASTA scores: opt: 259, E(): 9.4e-10, (40.5% identity in 116 aa overlap); CAC45122|SMC02246 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (115 aa), FASTA scores: opt: 208, E(): 1.6e-06, (38.3% identity in 107 aa overlap); Q98B88|MLL5682 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (116 aa), FASTA scores: opt: 173, E(): 0.00026, (34.95% identity in 103 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217221.1" /db_xref="GI:15609842" /db_xref="GeneID:887302" /translation="MRMTPDPAMLVHLCGVQEWSHARERGGIYPESDKTGYIHLSTLE QVHLPANRLYRGRADLVLLYIDPAALDSPVRWEPGVPTDPRSMLFPHLYGPLPVRAVI GAAAYPPAGDGSFGPAPEFRSATADPT" gene complement(3020200..3020457) /locus_tag="Rv2706c" /db_xref="GeneID:887484" CDS complement(3020200..3020457) /locus_tag="Rv2706c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2706c, (MTCY05A6.27c), len: 85 aa (unlikely ORF). Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217222.1" /db_xref="GI:15609843" /db_xref="GeneID:887484" /translation="MLVGVMLAEKKLGSGGQLGAHPSCSATAVAAVCSSQLRTGQSCV HGSPFSGIFTFSDVRGSRRVPRPLSGVSFLTTFAPANRAGW" gene 3020573..3021547 /locus_tag="Rv2707" /db_xref="GeneID:887750" CDS 3020573..3021547 /locus_tag="Rv2707" /function="UNKNOWN" /note="Rv2707, (MTCY05A6.28), len: 324 aa. Probable conserved transmembrane ala-, leu-rich protein, equivalent to Q49985|ML1017|U1764D POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (330 aa), FASTA scores: opt: 1617, E(): 2.5e-91, (75.4% identity in 325 aa overlap). Also similar to other membrane proteins e.g. Q9ADF6|SCBAC1A6.31 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (344 aa), FASTA scores: opt: 593, E(): 5.9e-29, (36.2% identity in 268 aa overlap); Q99SZ8|SA1699 HYPOTHETICAL PROTEIN (similar to transporter) from Staphylococcus aureus subsp. aureus N315 (405 aa), FASTA scores: opt: 318, E(): 3.7e-12, (27.9% identity in 265 aa overlap); O34437|YFKH HYPOTHETICAL PROTEIN (similar to transporter) from Bacillus subtilis (275 aa), FASTA scores: opt: 309, E(): 9.7e-12, (29.3% identity in 263 aa overlap); etc. TBparse score is 0.930." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217223.1" /db_xref="GI:15609844" /db_xref="GeneID:887750" /translation="MSDQVPKPHRHHIWRITRRTLSKSWDDSIFSESAQAAFWSALSL PPLLLGMLGSLAYVAPLFGPDTLPAIEKSALSTAHSFFSPSVVNEIIEPTIGDITNNA RGEVASLGFLISLWAGSSAISAFVDAVVEAHDQTPLRHPVRQRFFALFLYVVMLVFLV ATAPVMVVGPRKVSEHIPESLANLLRYGYYPALILGLTVGVILLYRVALPVPLPTHRL VLGAVLAIAVFLIATLGLRVYLAWITRTGYTYGALATPIAFLLFAFFGGFAIMLGAEL NAAVQEEWPAPATHAHRLGNWLKARIGVGTTTYSSTAQHSAVAAEPPS" gene complement(3021548..3021796) /locus_tag="Rv2708c" /db_xref="GeneID:887159" CDS complement(3021548..3021796) /locus_tag="Rv2708c" /function="UNKNOWN" /note="Rv2708c, (MTCY05A6.29), len: 82 aa. Conserved hypothetical protein, equivalent (but shorter 25 aa) to Q49984|ML1016|U1764C HYPOTHETICAL PROTEIN from Mycobacterium leprae (107 aa), FASTA scores: opt: 492, E(): 7.3e-27, (87.8% identity in 82 aa overlap). Also highly similar to Q9L1U7|SCE59.06c HYPOTHETICAL 10.4 KDA PROTEIN from Streptomyces coelicolor (97 aa), FASTA scores: opt: 200, E(): 4.4e-07, (51.6% identity in 62 aa overlap). TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217224.1" /db_xref="GI:15609845" /db_xref="GeneID:887159" /translation="MSGMQTQTIERTDADERVDDGTGSDTPKYFHYVKKDKIAESAVM GSHVVALCGEVFPVTRAPKPGSPVCPDCKRIYDTLKKG" gene 3021839..3022285 /locus_tag="Rv2709" /db_xref="GeneID:887282" CDS 3021839..3022285 /locus_tag="Rv2709" /function="UNKNOWN" /note="Rv2709, (MTCY05A6.30), len: 148 aa. Probable conserved transmembrane protein, equivalent to Q9CCB4|ML1015 (alias Q49983|U1764B but extended in N-terminus) POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (139 aa), FASTA scores: opt: 578, E(): 5.5e-31, (70.75% identity in 123 aa overlap). Shows also similarity with Q9RJ48|SCI8.05 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (159 aa), FASTA scores: opt: 119, E(): 0.57, (31.95% identity in 119 aa overlap). TBscore is 0.892." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217225.1" /db_xref="GI:15609846" /db_xref="GeneID:887282" /translation="MWDSRVMKHGLRLGFNGQFDDFDDFDDKGRPVLITAAAPSYEVE HRTRVRKYLTLMAFRVPALILAAIAYGAWHNGLISLLIVAASVPLPWMAVLIANDRPP RRADEPRRFDVARRRIPLFPTAERPALEPRRQPAERSAPRGFADHG" gene 3022461..3023432 /gene="sigB" /locus_tag="Rv2710" /db_xref="GeneID:888580" CDS 3022461..3023432 /gene="sigB" /locus_tag="Rv2710" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED. MAY CONTROL THE REGULONS OF STATIONARY PHASE AND GENERAL STRESS RESISTANCE. SEEMS TO BE REGULATED BY SIGH (Rv3223c PRODUCT) AND SIGE (Rv1221 PRODUCT). SEEMS TO REGULATE KATG|Rv1908c AND THE HEAT-SHOCK RESPONSE." /experiment="experimental evidence, no additional details recorded" /note="sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released; sigma factors in this cluster are active during stationary phase" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigB" /protein_id="NP_217226.1" /db_xref="GI:15609847" /db_xref="GeneID:888580" /translation="MADAPTRATTSRVDSDLDAQSPAADLVRVYLNGIGKTALLNAAG EVELAKRIEAGLYAEHLLETRKRLGENRKRDLAAVVRDGEAARRHLLEANLRLVVSLA KRYTGRGMPLLDLIQEGNLGLIRAMEKFDYTKGFKFSTYATWWIRQAITRGMADQSRT IRLPVHLVEQVNKLARIKREMHQHLGREATDEELAAESGIPIDKINDLLEHSRDPVSL DMPVGSEEEAPLGDFIEDAEAMSAENAVIAELLHTDIRSVLATLDEREHQVIRLRFGL DDGQPRTLDQIGKLFGLSRERVRQIERDVMSKLRHGERADRLRSYAS" misc_feature 3022800..3022841 /gene="sigB" /locus_tag="Rv2710" /note="PS00715 Sigma-70 factors family signature 1" gene 3023565..3024257 /gene="ideR" /locus_tag="Rv2711" /db_xref="GeneID:888590" CDS 3023565..3024257 /gene="ideR" /locus_tag="Rv2711" /function="TRANSCRIPTIONAL REGULATORY PROTEIN (REPRESSOR AND ACTIVATOR), IRON-BINDING REPRESSOR OF SIDEROPHORE BIOSYNTHESIS AND IRON UPTAKE. SEEMS TO REGULATE A VARIETY OF GENES ENCODING A VARIETY OF PROTEINS e.g. TRANSPORTERS, PROTEINS INVOLVED IN SIDEROPHORE SYNTHESIS AND IRON STORAGE, MEMBERS OF THE PE/PPE FAMILY, ENZYMES INVOLVED IN LIPID METABOLISM, TRANSCRIPTIONAL REGULATORY PROTEINS, ETC. ALSO ACTIVATOR OF BFRA|Rv1876 GENE." /experiment="experimental evidence, no additional details recorded" /note="Rv2711, (MTCY05A6.32), len: 230 aa. ideR (formerly known as dtxR), iron dependent repressor and activator (see citations below), equivalent to Q9CCB5|ML1013 IRON DEPENDENT REPRESSOR from Mycobacterium leprae (230 aa), FASTA scores: opt: 1365, E(): 3.8e-77, (90.0% identity in 230 aa overlap). Also highly similar to others e.g. Q50379|DTXR from Mycobacterium smegmatis (233 aa), FASTA scores: opt: 1291, E(): 1.4e-72, (86.1% identity in 230 aa overlap); Q9F7T3|IDER from Corynebacterium equii (Rhodococcus equi) (230 aa), FASTA scores: opt: 1130, E(): 1.2e-62, (74.8% identity in 230 aa overlap); P33120|DTXR_CORDI from Corynebacterium diphtheriae (226 aa), FASTA scores: opt: 803, E(): 1.6e-42, (57.85% identity in 230 aa overlap); etc. BELONGS TO THE FUR FAMILY. TBparse score is 0.876.; dtxR" /codon_start=1 /transl_table=11 /product="IRON-dependent repressor and activator IDER" /protein_id="NP_217227.1" /db_xref="GI:15609848" /db_xref="GeneID:888590" /translation="MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQT VSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEAC RWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLVELGVGPEPGADDANLVRLTELPAG SPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLP HEMAHAVKVEKV" misc_feature 3024210..3024239 /gene="ideR" /locus_tag="Rv2711" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature, [GSTALIVN].{2}HE[LIVMFYW][^DEHRKP]H.[LIV MFYWGSPQ], info count = 18.9" gene complement(3024270..3025328) /locus_tag="Rv2712c" /db_xref="GeneID:888586" CDS complement(3024270..3025328) /locus_tag="Rv2712c" /function="UNKNOWN" /note="Rv2712c, (MTCY05A6.33c), len: 352 aa. Hypothetical unknown ala-, leu-rich protein. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217228.1" /db_xref="GI:15609849" /db_xref="GeneID:888586" /translation="MTKYRGQFELNRPATLIAALPAILGFVPEKSLVLVSLAAGELGS VMRADLCDELADRVGHLAELVAAANPAAAIAVIVDANGAQCPRCNEEYRQLCAALAAA LSQRDIVLWAAHVVDRVAAGGRWHCVDGCGCSGVIDDPSASPLAMAAVLDGRQLYPRR SDLQAVIAVDDPVRSAELAVALGHQAADREIAHRADSVGCSRQDVENALAAAARVADG QSLSDTELARLGCALGDARVRDMLYALAVGENAGAAESLWALLARVLPEPWRVEALVL LAFSAYARGDGPLAGVSLQAALCCEPGHRMAGMLDTALQSGLRPEHIRDIAVTGYQRA EQLGIRLPPRRAFGQRAG" gene 3025441..3026847 /gene="sthA" /locus_tag="Rv2713" /db_xref="GeneID:887355" CDS 3025441..3026847 /gene="sthA" /locus_tag="Rv2713" /EC_number="1.6.1.1" /function="CONVERSION OF NADPH, GENERATED BY PERIPHERAL CATABOLIC PATHWAYS, TO NADH, WHICH CAN ENTER THE RESPIRATORY CHAIN FOR ENERGY GENERATION [CATALYTIC ACTIVITY: NADPH + NAD(+) = NADP(+) + NADH]." /note="catalyzes the conversion of NADPH to NADH" /codon_start=1 /transl_table=11 /product="soluble pyridine nucleotide transhydrogenase" /protein_id="NP_217229.1" /db_xref="GI:15609850" /db_xref="GeneID:887355" /translation="MREYDIVVIGSGPGGQKAAIASAKLGKSVAIVERGRMLGGVCVN TGTIPSKTLREAVLYLTGMNQRELYGASYRVKDRITPADLLARTQHVIGKEVDVVRNQ LMRNRVDLIVGHGRFIDPHTILVEDQARREKTTVTGDYIIIATGTRPARPSGVEFDEE RVLDSDGILDLKSLPSSMVVVGAGVIGIEYASMFAALGTKVTVVEKRDNMLDFCDPEV VEALKFHLRDLAVTFRFGEEVTAVDVGSAGTVTTLASGKQIPAETVMYSAGRQGQTDH LDLHNAGLEVQGRGRIFVDDRFQTKVDHIYAVGDVIGFPALAATSMEQGRLAAYHAFG EPTDGITELQPIGIYSIPEVSYVGATEVELTKSSIPYEVGVARYRELARGQIAGDSYG MLKLLVSTEDLKLLGVHIFGTSATEMVHIGQAVMGCGGSVEYLVDAVFNYPTFSEAYK NAALDVMNKMRALNQFRR" misc_feature 3025501..3025524 /gene="sthA" /locus_tag="Rv2713" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 3027065..3028039 /locus_tag="Rv2714" /db_xref="GeneID:887653" CDS 3027065..3028039 /locus_tag="Rv2714" /function="UNKNOWN" /note="Rv2714, (MTCY05A6.35), len: 324 aa. Conserved hypothetical ala-, leu-rich protein, equivalent to Q49847|ML1009|B2235_F1_6 HYPOTHETICAL PROTEIN from Mycobacterium leprae (326 aa), FASTA scores: opt: 1881, E(): 5.8e-107, (89.7% identity in 320 aa overlap); and similar to Q49797|MLCB2533.03c|B2126_F1_36 HYPOTHETICAL PROTEIN from Mycobacterium leprae (317 aa), FASTA scores: opt: 376, E(): 1.2e-15, (30.1% identity in 279 aa overlap); and Q9CC38|ML1306 HYPOTHETICAL PROTEIN from Mycobacterium leprae (274 aa), FASTA scores: opt: 367, E(): 3.6e-15, (29.8% identity in 275 aa overlap). Also highly similar to Q9S2K6|SC7H2.11c HYPOTHETICAL 34.2 KDA PROTEIN from Streptomyces coelicolor (312 aa), FASTA scores: opt: 770, E(): 1.4e-39, (40.9% identity in 286 aa overlap); and similar to Q9ADA5|SCI52.04 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (333 aa), FASTA scores: opt: 386, E(): 3e-16, (29.05% identity in 296 aa overlap). Also similar to O33260|Rv2125|MTCY261.21 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (292 aa), FASTA scores: opt: 387, E(): 2.3e-16, (29.45% identity in 292 aa overlap). TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217230.1" /db_xref="GI:15609851" /db_xref="GeneID:887653" /translation="MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHAL EGFSDAGHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDDPE LSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGLGTVPMAVPHT RPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMAQHGHEVVGFTVHVPHYLT QTDYPAAAQALLEQVAKTGSLQLPLAVLAEAAAEVQAKIDEQVQASAEVAQVVAALER QYDAFIDAQENRSLLTRDEDLPSGDELGAEFERFLAQQAEKKSDDDPT" gene 3028098..3029123 /locus_tag="Rv2715" /db_xref="GeneID:887974" CDS 3028098..3029123 /locus_tag="Rv2715" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2715, (MTCY05A6.36), len: 341 aa. Possible hydrolase (EC 3.-.-.-), showing some similarity with other hydrolases e.g. Q9I5B0|PA0829 PROBABLE HYDROLASE from Pseudomonas aeruginosa (313 aa), FASTA scores: opt: 336, E(): 9.9e-14, (28.05% identity in 289 aa overlap); BAB55888 HYDROLASE (FRAGMENT) from Terrabacter sp. DBF63 (319 aa), FASTA scores: opt: 326, E(): 4.2e-13, (27.95% identity in 290 aa overlap); O52866|CEH|EH SOLUBLE EPOXIDE HYDROLASE from Corynebacterium SP (285 aa), FASTA scores: opt: 325, E(): 4.4e-13, (29.95% identity in 284 aa overlap); etc. Also shows some similarity to P96811|EPHF|Rv0134|MTCI5.08 HYPOTHETICAL 33.8 KDA PROTEINfrom Mycobacterium tuberculosis (300 aa), FASTA scores: E(): 1.8e-10, (27.7% identity in 271 aa overlap). Contains lipases, serine active site motif (PS00120). TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_217231.1" /db_xref="GI:15609852" /db_xref="GeneID:887974" /translation="MTERKRNLRPVRDVAPPTLQFRTVHGYRRAFRIAGSGPAILLIH GIGDNSTTWNGVHAKLAQRFTVIAPDLLGHGQSDKPRADYSVAAYANGMRDLLSVLDI ERVTIVGHSLGGGVAMQFAYQFPQLVDRLILVSAGGVTKDVNIVFRLASLPMGSEAMA LLRLPLVLPAVQIAGRIVGKAIGTTSLGHDLPNVLRILDDLPEPTASAAFGRTLRAVV DWRGQMVTMLDRCYLTEAIPVQIIWGTKDVVLPVRHAHMAHAAMPGSQLEIFEGSGHF PFHDDPARFIDIVERFMDTTEPAEYDQAALRALLRRGGGEATVTGSADTRVAVLNAIG SNERSAT" misc_feature 3028410..3028439 /locus_tag="Rv2715" /note="PS00120 Lipases, serine active site" gene 3029172..3029858 /locus_tag="Rv2716" /db_xref="GeneID:887775" CDS 3029172..3029858 /locus_tag="Rv2716" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2716, (MTCY05A6.37), len: 228 aa. Conserved hypothetical protein, similar to other proteins e.g. Q9RKR0|SCC75A.14 HYPOTHETICAL 23.3 KDA PROTEIN from Streptomyces coelicolor (214 aa), FASTA scores: opt: 447, E(): 4e-22, (44.1% identity in 220 aa overlap); Q9HHG6|PHZF|VNG6408G PHENAZINE BIOSYNTHETIC PROTEIN from Halobacterium sp. strain NRC-1 (299 aa), FASTA scores: opt: 201, E(): 6.1e-06, (30.4% identity in 148 aa overlap) (similarity only at N-terminus); P73125|SLR1019 HYPOTHETICAL 34.1 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (314 aa), FASTA scores: opt: 196, E(): 1.4e-05, (28.5% identity in 298 aa overlap); etc. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217232.1" /db_xref="GI:15609853" /db_xref="GeneID:887775" /translation="MAIEVSVLRVFTDSDGNFGNPLGVINASKVEHRDRQQLAAQSGY SETIFVDLPSPGSTTAHATIHTPRTEIPFAGHPTVGASWWLRERGTPINTLQVPAGIV QVSYHGDLTAISARSEWAPEFAIHDLDSLDALAAADPADFPDDIAHYLWTWTDRSAGS LRARMFAANLGVTEDEATGAAAIRITDYLSRDLTITQGKGSLIHTTWSPEGWVRVAGR VVSDGVAQLD" gene complement(3029867..3030361) /locus_tag="Rv2717c" /db_xref="GeneID:887284" CDS complement(3029867..3030361) /locus_tag="Rv2717c" /function="UNKNOWN" /note="Rv2717c, (MTCY05A6.38c), len: 164 aa. Conserved hypothetical protein, equivalent to Q9CCB8|ML1006 (alias Q49838 but shortened N-terminus) HYPOTHETICAL PROTEIN from Mycobacterium leprae (161 aa), FASTA scores: opt: 797, E(): 2.3e-46, (73.8% identity in 164 aa overlap). Also highly similar to other eukaryotic proteins e.g. O64527|YUP8H12R.14 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (166 aa), FASTA scores: opt: 393, E(): 2.3e-19, (42.4% identity in 158 aa overlap); Q9Y325 CGI-36 PROTEIN from Homo sapiens (Human) (165 aa), FASTA scores: opt: 294, E(): 9.5e-13, (33.95% identity in 159 aa overlap); etc. TBparse score is 0.937." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217233.1" /db_xref="GI:15609854" /db_xref="GeneID:887284" /translation="MTRDLAPALQALSPLLGSWAGRGAGKYPTIRPFEYLEEVVFAHV GKPFLTYTQQTRAVADGKPLHSETGYLRVCRPGCVELVLAHPSGITEIEVGTYSVTGD VIELELSTRADGSIGLAPTAKEVTALDRSYRIDGDELSYSLQMRAVGQPLQDHLAAVL HRQR" gene complement(3030413..3030877) /gene="nrdR" /locus_tag="Rv2718c" /db_xref="GeneID:887985" CDS complement(3030413..3030877) /gene="nrdR" /locus_tag="Rv2718c" /function="UNKNOWN" /note="Rv2718c, (MTCY05A6.39c), len: 154 aa. Conserved hypothetical protein, equivalent to Q49844|ML1005|U2235A|B2235_C2_209 HYPOTHETICAL 17.3 KDA PROTEIN from Mycobacterium leprae (154 aa), FASTA scores: opt: 937, E(): 1.5e-52, (92.7% identity in 151 aa overlap). Highly similar to O86848|NRDR_STRCL PUTATIVE REGULATORY PROTEIN from Streptomyces clavuligerus (172 aa), FASTA scores: opt: 750, E(): 1.1e-40, (73.65% identity in 148 aa overlap); O69980|SC4H2.25 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (182 aa), FASTA scores: opt: 725, E(): 4.6e-39, (73.1% identity in 145 aa overlap); Q9KPU0|VC2272 HYPOTHETICAL PROTEIN from Vibrio cholerae (156 aa), FASTA scores: opt: 462, E(): 1.8e-22, (47.3% identity in 148 aa overlap); etc. TBparse score is 0.933." /codon_start=1 /transl_table=11 /product="transcriptional regulator NrdR" /protein_id="NP_217234.1" /db_xref="GI:15609855" /db_xref="GeneID:887985" /translation="MHCPFCRHPDSRVIDSRETDEGQAIRRRRSCPECGRRFTTVETA VLAVVKRSGVTEPFSREKVISGVRRACQGRQVDDDALNLLAQQVEDSVRAAGSPEIPS HDVGLAILGPLRELDEVAYLRFASVYRSFSSADDFAREIEALRAHRNLSAHS" gene complement(3031040..3031537) /locus_tag="Rv2719c" /db_xref="GeneID:887959" CDS complement(3031040..3031537) /locus_tag="Rv2719c" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv2719c, (MTCY05A6.40c), len: 165 aa. Possible conserved membrane protein, equivalent to Q49846|ML1004|B2235_C3_243 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (164 aa), FASTA scores: opt: 486, E(): 4e-21, (55.2% identity in 163 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217235.1" /db_xref="GI:15609856" /db_xref="GeneID:887959" /translation="MTPVRPPHTPDPLNLRGPLDGPRWRRAEPAQSRRPGRSRPGGAP LRYHRTGVGMSRTGHGSRPVPPATTVGLALLAAAITLWLGLVAQFGQMITGGSADGSA DSTGRVPDRLAVVRVETGESLYDVAVRVAPNAPTRQVADRIRELNGLQTPALAVGQTL IAPVG" gene 3031845..3032498 /gene="lexA" /locus_tag="Rv2720" /db_xref="GeneID:888169" CDS 3031845..3032498 /gene="lexA" /locus_tag="Rv2720" /EC_number="3.4.21.88" /function="INVOLVED IN REGULATION OF NUCLEOTIDE EXCISION REPAIR AND SOS RESPONSE. REPRESSES A NUMBER OF GENES INVOLVED IN THE RESPONSE TO DNA DAMAGE (SOS RESPONSE), INCLUDING RECA AND LEXA. HAS BEEN SHOWN TO BIND TO THE 14 BP PALINDROMIC SEQUENCE 5'-CGAACNNNNGTTCG-3'. IN THE PRESENCE OF SINGLE-STRANDED DNA, RECA INTERACTS WITH LEXA CAUSING AN AUTOCATALYTIC CLEAVAGE WHICH DISRUPTS THE DNA-BINDING PART OF LEXA, LEADING TO DEREPRESSION OF THE SOS REGULON AND EVENTUALLY DNA REPAIR [CATALYTIC ACTIVITY: HYDROLYSIS OF ALA-|-GLY BOND IN REPRESSOR LEXA]." /experiment="experimental evidence, no additional details recorded" /note="Represses a number of genes involved in the response to DNA damage" /codon_start=1 /transl_table=11 /product="LexA repressor" /protein_id="NP_217236.1" /db_xref="GI:15609857" /db_xref="GeneID:888169" /translation="MLSADSALTERQRTILDVIRASVTSRGYPPSIREIGDAVGLTST SSVAHQLRTLERKGYLRRDPNRPRAVNVRGADDAALPPVTEVAGSDALPEPTFVPVLG RIAAGGPILAEEAVEDVFPLPRELVGEGTLFLLKVIGDSMVEAAICDGDWVVVRQQNV ADNGDIVAAMIDGEATVKTFKRAGGQVWLMPHNPAFDPIPGNDATVLGKVVTVIRKV" gene complement(3032520..3034619) /locus_tag="Rv2721c" /db_xref="GeneID:888291" CDS complement(3032520..3034619) /locus_tag="Rv2721c" /function="UNKNOWN" /note="Rv2721c, (MTCY05A6.42c, MTCY154.01c), len: 699 aa. Possible conserved transmembrane ala-, gly-rich protein, equivalent to Q49837|ML1002|U2235I POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (687 aa), FASTA scores: opt: 2703, E(): 6.6e-135, (60.3% identity in 713 aa overlap). Shows some similaity to Q01377|CSP1 PS1 PROTEIN PRECURSOR (SECRETED PROTEIN) from Corynebacterium glutamicum (Brevibacterium flavum) (657 aa), FASTA scores: opt: 276, E(): 3.8e-07, (29.4% identity in 272 aa overlap); and Q9KIJ0 Rv2721c-LIKE PROTEIN from Mycobacterium paratuberculosis (246 aa), FASTA scores: opt: 178, E(): 0.025, (37.5% identity in 120 aa overlap). TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217237.1" /db_xref="GI:15609858" /db_xref="GeneID:888291" /translation="MNGQRGQLSTLIGRTLLGLAATAVTAVLLAPTVAASPMGDAEDA MMAAWEKAGGDTSTLGVRKGDVYPIGDGFALDFAGGKMFFTPATGAKYLYGPLLDKYE SLGGAADSDLGFPTINEVPGLAGPDSRVSTFSAADNPVIFWTPEHGAFVVRGALNAAW DKLGSSGGVLGAPVGDETYDGEVTAQKFSGGEVSWNRATKEFTTVPAVLAEQLKGLQV AIDPSAAINMAWRAAGGAAGPLGAKKGGQYPIGGDGIAQDFVGGKVFFSPATGANAVE GEILAKYESLGGPVSSDLGFPIANETDGGFGPSSRIVRFSAADKPVIFWTPDHGAFVV RGAMVAAWDKLRGPNGKLGAPVGDQTVDGDVVSQKFTGGMISWNRAKNTFTTDPANLA PLLSGLQVSGQNQPSTSAMPPPGKKFTWHWWWLGAAALGVLLVVMVALVVFGLRRRRR GYDAAAYDDDRAGDVEYGTAADGDWPPDEDFGSEHFGFGDQFPPEPVAPDAGSTPRVS WPRGAGAAVGDAEHLPGEEGYGSDLLSGPSNVGVEEEDTDAVDTTPTPVVSQADLSEV GPDLIVPERVVPETFVPQAFVPEAVAPEAVPPDVHAADLADTGLPAAAVSAAEDRGGR HAAAEPPEPPSAGVRPAIHLPLEDPYQMPNGYPVKASVSFGLYYPPGSALYHDTLAEL WFASEEVAQVNGFIRAD" gene 3034635..3034883 /locus_tag="Rv2722" /db_xref="GeneID:888399" CDS 3034635..3034883 /locus_tag="Rv2722" /function="UNKNOWN" /note="Rv2722, (MTCY154.02), len: 82 aa. Conserved hypothetical protein, similar to Q9CCB9|ML1001 HYPOTHETICAL PROTEIN from Mycobacterium leprae (91 aa), FASTA scores: opt: 154, E(): 0.00053, (37.5% identity in 88 aa overlap). Equivalent to AAK47111 from Mycobacterium tuberculosis strain CDC1551 (94 aa) but shorter 12 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217238.1" /db_xref="GI:15609859" /db_xref="GeneID:888399" /translation="MPCLARQPVDLPPWAGPRCGPYCPRARITLLQRTTIAKSNRKYY ENGYPADVKLMPGHAAVVSNRAAARAGFALPCRKRQPD" gene 3034909..3036102 /locus_tag="Rv2723" /db_xref="GeneID:887768" CDS 3034909..3036102 /locus_tag="Rv2723" /function="UNKNOWN" /note="Rv2723, (MTCY154.03), len: 397 aa. Probable conserved integral membrane protein, highly similar to others e.g. Q9Z503|SCC54.23c PUTATIVE INTEGRAL MEMBRANE EXPORT PROTEIN from Streptomyces coelicolor (333 aa), FASTA scores: opt: 883, E(): 2.4e-48, (46.4% identity in 332 aa overlap); Q9RD18|SCM1.25c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (316 aa), FASTA scores: opt: 865, E(): 3.1e-47, (47.55% identity in 324 aa overlap); P96554|Y319_MYXXA INTEGRAL MEMBRANE PROTEIN (PROBABLE) from Myxococcus xanthus (319 aa), FASTA scores: opt: 626, E(): 3.4e-32, (34.65% identity in 323 aa overlap); P42601|YGJT_ECOLI|B3088 from Escherichia coli strain K12 INTEGRAL MEMBRANE PROTEIN (PROBABLE) (321 aa), FASTA scores: opt: 541, E(): 7.7e-27, (35.1% identity in 279 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217239.1" /db_xref="GI:15609860" /db_xref="GeneID:887768" /translation="MGASGLVWTLTIVLIAGLMLVDYVLHVRKTHVPTLRQAVIQSAT FVGIAILFGIAVVVFGGSELAVEYFACYLTDEALSVDNLFVFLVIISSFGVPRLAQQK VLLFGIAFALVTRTGFIFVGAALIENFNSAFYLFGLVLLVMAGNLARPTGLESRDAET LKRSVIIRLADRFLRTSQDYNGDRLFTVSNNKRMMTPLLLVMIAVGGTDILFAFDSIP ALFGLTQNVYLVFAATAFSLLGLRQLYFLIDGLLDRLVYLSYGLAVILGFIGVKLMLE ALHDNKIPFINGGKPVPTVEVSTTQSLTVIIIVLLITTAASFWSARGRAQNAMARARR YATAYLDLHYETESAERDKIFTALLAAERQINTLPTKYRMQPGQDDDLMTLLCRAHAA RDAHM" gene complement(3036131..3037291) /gene="fadE20" /locus_tag="Rv2724c" /db_xref="GeneID:887866" CDS complement(3036131..3037291) /gene="fadE20" /locus_tag="Rv2724c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv2724c, (MTCY154.04c), len: 386 aa. Probable fadE20, acyl-CoA dehydrogenase (EC 1.3.99.-), highly similar to many e.g. Q9X7Y2|SC6A5.36 from Streptomyces coelicolor (382 aa), FASTA scores: opt: 1583, E(): 6.9e-94, (62.7% identity in 378 aa overlap); Q9HVY0|PA4435 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 1468, E(): 1.6e-86, (57.65% identity in 380 aa overlap); Q9ABZ1|CC0079 from Caulobacter crescentus (391 aa), FASTA scores: opt: 1298, E(): 1.2e-75, (51.9% identity in 391 aa overlap); etc. Also similar to many other Mycobacterium tuberculosis proteins e.g. O06164|FADE19|Rv2500c|MTCY07A7.06c ACYL-CoA DEHYDROGENASE (394 aa) (34.3% identity in 382 aa overlap). Contains acyl-CoA dehydrogenases signature 2 (PS00073). BELONGS TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE20" /protein_id="NP_217240.1" /db_xref="GI:15609861" /db_xref="GeneID:887866" /translation="MGSATKYQRTLFEPEHELFRESYRAFLDRHVAPYHDEWEKTKIV DRGVWLEAGKQGFLGMAVPEEYGGGGNADFRYNTVITEETCAGRYSGIGFGLHNDIVA PYLLALATEEQKRRWFPNFCTGELITAIAMTEPGTGSDLQGITTRAVKHGDHYVLNGS KTFITNGINSDLVIVVAQTDPEKGAQGFSLLVVERGMAGFERGRQLDKIGLDAQDTAE LSFTDVAVPAENLLGQEGMGFIYLMQNLPQERISIAIMAAAGMESVLEQTLQYAKERK AFGRSIGSFQNSRFLLAELATEATVVRIMVDEFIKLHLAGKLTAEQAAMAKWYATEKQ VYLNDRCLQLHGGYGYMREYPVARAYLDSRVQTIYGGTTEIMKEIIGRGLGV" misc_feature complement(3036206..3036265) /gene="fadE20" /locus_tag="Rv2724c" /note="PS00073 Acyl-CoA dehydrogenases signature 2" gene complement(3037427..3038914) /gene="hflX" /locus_tag="Rv2725c" /db_xref="GeneID:888241" CDS complement(3037427..3038914) /gene="hflX" /locus_tag="Rv2725c" /EC_number="3.1.5.1" /function="POSSIBLY A PUTATIVE GTPase, MODULATING ACTIVITY OF HFLK AND HFLC PROTEINS." /note="Rv2725c, (MTCY154.05c), len: 495 aa. Probable hflX (hfl for high frequency of lysogenization), GTP-binding protein (EC 3.1.5.-),equivalent to Q9CCC0|ML0997 (alias Q49843|HFLX but longer) POSSIBLE ATP/GTP-BINDING PROTEIN from Mycobacterium leprae (488 aa), FASTA scores: opt: 2562, E(): 1.1e-133, (84.55% identity in 485 aa overlap). Also highly similar to many e.g. Q9XCC1 from Streptomyces fradiae (425 aa), FASTA scores: opt: 1280, E(): 3.2e-63, (57.7% identity in 423 aa overlap); P73965|HFLX|SLR1521 from Synechocystis sp. strain PCC 6803 (534 aa), FASTA scores: opt: 1028, E(): 2.8e-49, (44.7% identity in 414 aa overlap); P25519|HFLX_ECOLI|B4173 from Escherichia coli strain K12 (426 aa), FASTA scores: opt: 916, E(): 3.4e-43, (40.1% identity in 414 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="GTP-binding protein HflX" /protein_id="NP_217241.1" /db_xref="GI:15609862" /db_xref="GeneID:888241" /translation="MPANSDARPAATCHHRVLAMTYPDPPQTGLSDFTPSLGELALED RSALRRVAGLSTELADVSEVEYRQLRLERVVLVGVWTEGSAADNRASLAELAALAETA GSQVLEGLIQRRDKPDPSTYIGSGKAAELREVIVATGADTVICDGELSPAQLTALEKA VQVKVIDRTALILDIFAQHATSREGKAQVSLAQMEYMLPRLRGWGESMSRQAGGRAGG SGGGVGLRGPGETKIETDRRRIRERMAKLRRDIRAMKQVRDTQRSRRRHSDVPSIAIV GYTNAGKSSLLNALTGAGVLVQDALFATLEPTTRRAEFGDGRPVVLTDTVGFVRHLPT QLVEAFRSTLEEVVHADLLVHVVDGSDGHPLAQIDAVRQVISEVIADHDGDPPPELLV VNKVDVASDLMLAKLRHGLPGAVFVSARTGDGIDALRRRMAELVVPADTAVDVVIPYD RGDLVARVHADGRIQQAEHKPEGTRIKARVPEALAATLREFAPRA" misc_feature complement(3038063..3038086) /gene="hflX" /locus_tag="Rv2725c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3038931..3039800) /gene="dapF" /locus_tag="Rv2726c" /db_xref="GeneID:888614" CDS complement(3038931..3039800) /gene="dapF" /locus_tag="Rv2726c" /EC_number="5.1.1.7" /function="INVOLVED IN THE BIOSYNTHESIS OF LYSINE FROM ASPARTATE SEMIALDEHYDE (AT THE SIXTH STEP) [CATALYTIC ACTIVITY: LL-2,6-DIAMINOHEPTANEDIOATE = MESO-DIAMINOHEPTANEDIOATE]." /experiment="experimental evidence, no additional details recorded" /note="involved in lysine biosynthesis; DAP epimerase; produces DL-diaminopimelate from LL-diaminopimelate" /codon_start=1 /transl_table=11 /product="diaminopimelate epimerase" /protein_id="NP_217242.1" /db_xref="GI:15609863" /db_xref="GeneID:888614" /translation="MIFAKGHGTQNDFVLLPDVDAELVLTAARVAALCDRRKGLGADG VLRVTTAGAAQAVGVLDSLPEGVRVTDWYMDYRNADGSAAQMCGNGVRVFAHYLRASG LEVRDEFVVGSLAGPRPVTCHHVEAAYADVSVDMGKANRLGAGEAVVGGRRFHGLAVD VGNPHLACVDSQLTVDGLAALDVGAPVSFDGAQFPDGVNVEVLTAPVDGAVWMRVHER GVGETRSCGTGTVAAAVAALAAVGSPTGTLTVHVPGGEVVVTVTDATSFLRGPSVLVA RGDLADDWWNAMG" gene complement(3039825..3040769) /gene="miaA" /locus_tag="Rv2727c" /db_xref="GeneID:887242" CDS complement(3039825..3040769) /gene="miaA" /locus_tag="Rv2727c" /EC_number="2.5.1.75" /function="CATALYZES THE FIRST STEP IN THE BIOSYNTHESIS OF 2-METHYLTHIO-N6-(DELTA(2)-ISOPENTENYL)-ADENOSINE (MS[2]I[6]A]) ADJACENT TO THE ANTICODON OF SEVERAL TRNA SPECIES [CATALYTIC ACTIVITY: ISOPENTENYL DIPHOSPHATE + TRNA = PYROPHOSPHATE + TRNA CONTAINING 6-ISOPENTENYLADENOSINE]." /note="IPP transferase; isopentenyltransferase; involved in tRNA modification; in Escherichia coli this enzyme catalyzes the addition of a delta2-isopentenyl group from dimethylallyl diphosphate to the N6-nitrogen of adenosine adjacent to the anticodon of tRNA species that read codons starting with uracil; further tRNA modifications may occur; mutations in miaA result in defects in translation efficiency and fidelity" /codon_start=1 /transl_table=11 /product="tRNA delta(2)-isopentenylpyrophosphate transferase" /protein_id="NP_217243.1" /db_xref="GI:15609864" /db_xref="GeneID:887242" /translation="MRPLAIIGPTGAGKSQLALDVAARLGARVSVEIVNADAMQLYRG MDIGTAKLPVSERRGIPHHQLDVLDVTETATVARYQRAAAADIEAIAARGAVPVVVGG SMLYVQSLLDDWSFPATDPSVRARWERRLAEVGVDRLHAELARRDPAAAAAILPTDAR RTVRALEVVELTGQPFAASAPRIGAPRWDTVIVGLDCQTTILDERLARRTDLMFDQGL VEEVRTLLRNGLREGVTASRALGYAQVIAALDAGAGADMMRAAREQTYLGTRRYVRRQ RSWFRRDHRVHWLDAGVASSPDRARLVDDAVRLWRHVT" misc_feature complement(3040725..3040748) /gene="miaA" /locus_tag="Rv2727c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3040766..3041461) /locus_tag="Rv2728c" /db_xref="GeneID:888330" CDS complement(3040766..3041461) /locus_tag="Rv2728c" /function="UNKNOWN" /note="Rv2728c, (MTCY154.08c), len: 231 aa. Conserved hypothetical ala-rich protein, equivalent to Q49835|ML0994|B2235_C1_162 HYPOTHETICAL PROTEIN from Mycobacterium leprae (232 aa), FASTA scores: opt: 1037, E(): 1.2e-54, (68.55% identity in 232 aa overlap). Also similar to O69964|SC4H2.09 from Streptomyces coelicolor (237 aa), FASTA scores: opt: 300, E(): 7.7e-11, (32.8% identity in 241 aa overlap); and some similarity with other proteins e.g. Q14234|ELN ELASTIN from Homo sapiens (Human) (757 aa), FASTA scores: opt: 161, E(): 0.03, (30.6% identity in 242 aa overlap); P55488|Y4IE HYPOTHETICAL 15.4 KDA PROTEIN from Rhizobium sp. strain NGR234 (135 aa), FASTA scores: opt: 147, E(): 0.061, (34.95% identity in 123 aa overlap). Shows also some similarity with P71657|Rv1387|MTCY21B4.04 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (539 aa), FASTA scores: opt: 159, E(): 0.035, (34.8% identity in 135 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217244.1" /db_xref="GI:15609865" /db_xref="GeneID:888330" /translation="MLSAIGIVPSAPVLVPELAGAAAAELADLGAAVIAAASLLPKSW IAVGTGRADDVVRPTDVGTFAGFGADVRVGLAPQDGDGVAVPVELPLCALLTAWVRGQ ARPEARAQVHVYASDHGSDAAVARGRQLRADIDREPDPIGVLVVADGLNTLTPRAPGG YDPDGAGMQRALDDALASGDLAVLTRLPAQVLGRVAFQVLAGLAEPGPRSAKEFYRGA PHGVGYFAGVWQP" gene complement(3041570..3042475) /locus_tag="Rv2729c" /db_xref="GeneID:888333" CDS complement(3041570..3042475) /locus_tag="Rv2729c" /function="UNKNOWN" /note="Rv2729c, (MTCY154.09c), len: 301 aa. Probable conserved integral membrane ala-, val-, leu-rich protein, similar to P42459|YLEU_CORGL HYPOTHETICAL 29.6 KDA PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum)(270 aa), FASTA scores: opt: 365, E(): 4.7e-15, (30.75% identity in 221 aa overlap); and to other integral membrane proteins (principally from Streptomyces sp.) e.g. Q9EWZ8|2SCG38.21 from Streptomyces coelicolor (302 aa), FASTA scores: opt: 365, E(): 5.2e-15, (32.0% identity in 278 aa overlap); Q9S267|SCI30A.06 from Streptomyces coelicolor (297 aa), FASTA scores: opt: 356, E(): 1.8e-14, (31.5% identity in 289 aa overlap); AAK81278|CAC3346 from Clostridium acetobutylicum (472 aa), FASTA scores: opt: 154, E(): 0.038, (24.1% identity in 224 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217245.1" /db_xref="GI:15609866" /db_xref="GeneID:888333" /translation="MASVEFATILALGAALLAGIGYVTLQRSARQVTAEEYVGHFTLF HLSLRHALWWLGSLAAVASFTLQAIALTMGSVVLVQSLQATALLFALLIDARLTHHRC TPREWMWAVLLAGAVAVIVMSGNPAAGTTRAPFSTWAVVAVVVVPAVVLCVVGARIAS GSLSAVLLAVASSATLAVFTVLTKGVVTELGEGFATLIRTPALYAWILVLPIGLMLQQ SSLRVGALTASLPTITVARPVIASVLGITVLDEVLHTGRVALVALVAAVVVVVVATVA LARDEVAMMTVSAGELGAAGQLAVR" gene 3042542..3043018 /locus_tag="Rv2730" /db_xref="GeneID:888323" CDS 3042542..3043018 /locus_tag="Rv2730" /function="UNKNOWN" /note="Rv2730, (MTCY174.10), len: 158 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217246.1" /db_xref="GI:15609867" /db_xref="GeneID:888323" /translation="MMMNWRQTNITTKRCAQTRASSSASEFCGIFAAPGLMRNCHHGG SAPSAVGGSAVQLTVAYGPQRFHGRCASNSSVRPLTTGGSWTPTSISSTDGGKAQGHD THDRQISRRTVCQAASILASILLETVAGPGEGIGPTTSVPLRAADARHTREGLQGR" gene 3043026..3044378 /locus_tag="Rv2731" /db_xref="GeneID:888300" CDS 3043026..3044378 /locus_tag="Rv2731" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2731, (MTCY174.11), len: 450 aa. Conserved hypothetical ala-, arg-rich protein, highly similar in part to Q49849|B2235_F2_77 HYPOTHETICAL PROTEIN from Mycobacterium leprae (266 aa), FASTA scores: opt: 368, E(): 1e-10, (73.5% identity in 83 aa overlap); and Q9KXN9|SC9C5.35 HYPOTHETICAL 6.5 KDA PROTEIN (FRAGMENT) from Streptomyces coelicolor (58 aa), FASTA scores: opt: 214, E(): 0.00065, (51.7% identity in 58 aa overlap). Also similar to Q9L296|SCL2.01 HYPOTHETICAL 37.4 KDA PROTEIN (FRAGMENT) from Streptomyces coelicolor (328 aa), FASTA scores: opt: 843, E(): 3.7e-33, (45.95% identity in 296 aa overlap) (but N-terminus shorter); and shows some similarity with other proteins e.g. Q26938 KINETOPLAST-ASSOCIATED PROTEIN (KAP) from Trypanosoma cruzi (1052 aa), FASTA scores: opt: 223, E(): 0.0022, (30.3% identity in 297 aa overlap). Start site chosen by RBS and to avoid overlap, although there are several other possible start sites further upstream." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217247.1" /db_xref="GI:15609868" /db_xref="GeneID:888300" /translation="MTADEPRSDDSSGSAPQPAATPVPRPGPRPGPRPVPRPTSYPVG AHPPSDPHRFGRIDDDGTVWLVSASGERIVGSWQAGDPEAAFAHFGRRFDDLSTEIML MDERLASGTGDARKIKAHAIALAETLPTACVLGDVDALADRLTSIRDRAEVIAAADRS RREEHRAAQTARKEALAAEAEELAANATQWKVAGDRLRAILDEWKTISGVDRKVDDAL WKRYSTARDTFNRRRGSHFAELDRERSGVRQSKERLCERAEELSESTDWTATSAEFRK LLADWKAAGRASKDVDDALWRRFKAAQDSFFTARNAATAEKEAELRANADAKEALLAE AERLDTTNHEAARAALRSIAEKWDAIGKVSRERAAELERRLRAVEKKVREAGEADWSD PQARARAEQFRARAEQFEHQAEKAAAAGRTKEADEAKANAEQWRQWAEAAADALTRRP" gene complement(3044375..3044989) /locus_tag="Rv2732c" /db_xref="GeneID:888319" CDS complement(3044375..3044989) /locus_tag="Rv2732c" /function="UNKNOWN" /note="Rv2732c, (MTCY174.12c), len: 204 aa. Probable conserved transmembrane protein, similar to Q49834 hypothetical protein B2235_C1_155 from Mycobacterium leprae (209 aa), FASTA scores: opt: 932, E(): 0, (70.6% identity in 201 aa overlap). Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217248.1" /db_xref="GI:15609869" /db_xref="GeneID:888319" /translation="MMSHEHDAGDLDALRAEIEAAERRVAREIEPGARALVVAILVFV LLGSFILPHTGSVRGWDVLFSSHGAGRAAVALPSRVFAWLALVFGVGFSMLALLTRRW ALAWVALAGSAMASGTGLLAVWSRQTVAAGHPGPGIGLIVAWITAIVLTFHWAQVVWS RTIVQLAAEERRRRVVAQQQCKTLLDHVQTDSEAGTTPDRGTDR" misc_feature complement(3044822..3044839) /locus_tag="Rv2732c" /note="PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide" gene complement(3044986..3046524) /locus_tag="Rv2733c" /db_xref="GeneID:888266" CDS complement(3044986..3046524) /locus_tag="Rv2733c" /function="UNKNOWN" /note="catalyzes the formation of 2-methylthio-N6-(dimethylallyl)adenosine (ms(2)i(6)A) at position 37 in tRNAs that read codons beginning with uridine from N6-(dimethylallyl)adenosine (i(6)A)" /codon_start=1 /transl_table=11 /product="(dimethylallyl)adenosine tRNA methylthiotransferase" /protein_id="NP_217249.1" /db_xref="GI:15609870" /db_xref="GeneID:888266" /translation="MVAHDAAAGVTGEGAGPPVRRAPARTYQVRTYGCQMNVHDSERL AGLLEAAGYRRATDGSEADVVVFNTCAVRENADNRLYGNLSHLAPRKRANPDMQIAVG GCLAQKDRDAVLRRAPWVDVVFGTHNIGSLPTLLERARHNKVAQVEIAEALQQFPSSL PSSRESAYAAWVSISVGCNNSCTFCIVPSLRGREVDRSPADILAEVRSLVNDGVLEVT LLGQNVNAYGVSFADPALPRNRGAFAELLRACGDIDGLERVRFTSPHPAEFTDDVIEA MAQTRNVCPALHMPLQSGSDRILRAMRRSYRAERYLGIIERVRAAIPHAAITTDLIVG FPGETEEDFAATLDVVRRARFAAAFTFQYSKRPGTPAAQLDGQLPKAVVQERYERLIA LQEQISLEANRALVGQAVEVLVATGEGRKDTVTARMSGRARDGRLVHFTAGQPRVRPG DVITTKVTEAAPHHLIADAGVLTHRRTRAGDAHTAGQPGRAVGLGMPGVGLPVSAAKP GGCR" gene 3046821..3047675 /locus_tag="Rv2734" /db_xref="GeneID:888303" CDS 3046821..3047675 /locus_tag="Rv2734" /function="UNKNOWN" /note="Rv2734, (MTCY154.14), len: 284 aa. Conserved hypothetical protein, highly similar to various proteins e.g. Q984J2|MLR7981 ABC TRANSPORTER ATP-BINDING PROTEIN from Rhizobium loti (Mesorhizobium loti) (286 aa), FASTA scores: opt: 877, E(): 9e-50, (52.45% identity in 246 aa overlap) (N-terminus longer); Q98DH1|MLL4707 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (249 aa), FASTA scores: opt: 829, E(): 1.1e-46, (50.4% identity in 244 aa overlap); AAK65865|SMA2239 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (259 aa), FASTA scores: opt: 796, E(): 1.5e-44, (50.0% identity in 252 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217250.1" /db_xref="GI:15609871" /db_xref="GeneID:888303" /translation="MSDRSAIEWTGATWNPVTGCDRVSPGCDHCYAMTLAKRLKAMGS DKYQTDGDPRTSGPGFGVTIHPRSLDEPFRWRSPRTVFVNSMADLFHARVALWFIREV FEVMRATPQHTYQILTKRSLRLRRLAHKLEWPSNVWMGVSVENVDAFRRIEDLRQVPA AVRFLSCEPLLGPLDGINLGSIDWVIAGGESGPNFRPIDPQWVRHIRDTCTAADVPFF FKQWGGRTPKAFGRELDGRCWDEMPLIEIRNPDPRTTSRVHADPMLATAPTESAQRSN PGQLVRQR" gene complement(3047560..3048552) /locus_tag="Rv2735c" /db_xref="GeneID:888389" CDS complement(3047560..3048552) /locus_tag="Rv2735c" /function="UNKNOWN" /note="Rv2735c, (MTCY154.15c), len 330 aa. Conserved hypothetical protein, showing some similarity with Q98DH2|MLR4706 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (302 aa), FASTA scores: opt: 140, E(): 0.062, (27.0% identity in 200 aa overlap); and Q9PHA1|XF0043 HYPOTHETICAL PROTEIN from Xylella fastidiosa (293 aa), FASTA scores: opt: 120, E(): 1.2, (30.75% identity in 117 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217251.1" /db_xref="GI:15609872" /db_xref="GeneID:888389" /translation="MAREWSYWTRNKLEILAGYLPAFNRASQTSRERIYLDLMAGQPE NIDRDMGEKFDGSSLIAMKADPPFTRLRFCELNPLASELDVALRTRFPGDGRYRVVAG DSNVTIDETLAELGPWRWAPTFAFIDQQAAEVHWETINKVAAFRQNPRNLKTELWMLM SPTMIARGVKGTNAELFIEQVTRMYGDADWKRIQAARWRHHLTAPAYRAEMVNLMRVK LEYELGYKYSHRIPMQMHNKVTIFDMVFATDHWAGDAIMCHLYNRAAQKEPEMMRQAK SAKQQKESEDRGEMGLFSVGELAVQDSNAGQILWAPSPTWDPRARGWWSEDPGF" gene complement(3048562..3049086) /gene="recX" /locus_tag="Rv2736c" /db_xref="GeneID:888393" CDS complement(3048562..3049086) /gene="recX" /locus_tag="Rv2736c" /function="MAY PLAY A REGULATORY ROLE POSSIBLY BY INTERACTING WITH RECA, THE PRODUCT OF THE UPSTREAM ORF." /experiment="experimental evidence, no additional details recorded" /note="binds RecA and inhibits RecA-mediated DNA strand exchange and ATP hydrolysis and coprotease activities" /codon_start=1 /transl_table=11 /product="recombination regulator RecX" /protein_id="NP_217252.1" /db_xref="GI:15609873" /db_xref="GeneID:888393" /translation="MTVSCPPPSTSEREEQARALCLRLLTARSRTRAELAGQLAKRGY PEDIGNRVLDRLAAVGLVDDTDFAEQWVQSRRANAAKSKRALAAELHAKGVDDDVITT VLGGIDAGAERGRAEKLVRARLRREVLIDDGTDEARVSRRLVAMLARRGYGQTLACEV VIAELAAERERRRV" gene complement(3049052..3051424) /gene="recA" /locus_tag="Rv2737c" /db_xref="GeneID:888371" CDS complement(3049052..3051424) /gene="recA" /locus_tag="Rv2737c" /EC_number="3.1.-.-" /function="INVOLVED IN REGULATION OF NUCLEOTIDE EXCISION REPAIR, IN GENETIC RECOMBINATION, AND IN INDUCTION OF THE SOS RESPONSE. ENDONUCLEASE WHICH CAN CATALYZE THE HYDROLYSIS OF ATP IN THE PRESENCE OF SINGLE-STRANDED DNA, THE ATP-DEPENDENT UPTAKE OF SINGLE-STRANDED DNA BY DUPLEX DNA, AND THE ATP-DEPENDENT HYBRIDIZATION OF HOMOLOGOUS SINGLE-STRANDED DNAS. IT INTERACTS WITH LEXA|Rv2720 CAUSING ITS ACTIVATION AND LEADING TO ITS AUTOCATALYTIC CLEAVAGE." /experiment="experimental evidence, no additional details recorded" /note="these RecA proteins contain inteins; catalyzes the hydrolysis of ATP in the presence of single-stranded DNA, the ATP-dependent uptake of single-stranded DNA by duplex DNA, and the ATP-dependent hybridization of homologous single-stranded DNAs" /codon_start=1 /transl_table=11 /product="DNA recombination protein RecA" /protein_id="NP_217253.1" /db_xref="GI:15609874" /db_xref="GeneID:888371" /translation="MTQTPDREKALELAVAQIEKSYGKGSVMRLGDEARQPISVIPTG SIALDVALGIGGLPRGRVIEIYGPESSGKTTVALHAVANAQAAGGVAAFIDAEHALDP DYAKKLGVDTDSLLVSQPDTGEQALEIADMLIRSGALDIVVIDSVAALVPRAELEGEM GDSHVGLQARLMSQALRKMTGALNNSGTTAIFINQLRDKIGVMFGSPETTTGGKALKF YASVRMDVRRVETLKDGTNAVGNRTRVKVVKNKCLAEGTRIFDPVTGTTHRIEDVVDG RKPIHVVAAAKDGTLHARPVVSWFDQGTRDVIGLRIAGGAIVWATPDHKVLTEYGWRA AGELRKGDRVAQPRRFDGFGDSAPIPADHARLLGYLIGDGRDGWVGGKTPINFINVQR ALIDDVTRIAATLGCAAHPQGRISLAIAHRPGERNGVADLCQQAGIYGKLAWEKTIPN WFFEPDIAADIVGNLLFGLFESDGWVSREQTGALRVGYTTTSEQLAHQIHWLLLRFGV GSTVRDYDPTQKRPSIVNGRRIQSKRQVFEVRISGMDNVTAFAESVPMWGPRGAALIQ AIPEATQGRRRGSQATYLAAEMTDAVLNYLDERGVTAQEAAAMIGVASGDPRGGMKQV LGASRLRRDRVQALADALDDKFLHDMLAEELRYSVIREVLPTRRARTFDLEVEELHTL VAEGVVVHNCSPPFKQAEFDILYGKGISREGSLIDMGVDQGLIRKSGAWFTYEGEQLG QGKENARNFLVENADVADEIEKKIKEKLGIGAVVTDDPSNDGVLPAPVDF" misc_feature complement(3049349..3049366) /gene="recA" /locus_tag="Rv2737c" /note="PS00881 Protein splicing signature" misc_feature complement(3050756..3050782) /gene="recA" /locus_tag="Rv2737c" /note="PS00321 recA signature" misc_feature complement(3051203..3051226) /gene="recA" /locus_tag="Rv2737c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 3051619..3051792 /locus_tag="Rv2737A" /db_xref="GeneID:3205037" CDS 3051619..3051792 /locus_tag="Rv2737A" /function="UNKNOWN" /note="Rv2737A, len 57 aa. Conserved hypothetical cys-rich protein (possibly gene fragment), similar to central part of AJ243803_1|glgA from Streptomyces coelicolor glgA (181 aa), FASTA scores: opt: 210, E(): 6.1e-09, (59.25% identity in 54 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177676.1" /db_xref="GI:57117017" /db_xref="GeneID:3205037" /translation="MRPDLRARLVRITDDLLNTASLAGSGVLTGPDLTFRRRSCCLFY RVPAGGKCGDCPL" gene complement(3051806..3052012) /locus_tag="Rv2738c" /db_xref="GeneID:888368" CDS complement(3051806..3052012) /locus_tag="Rv2738c" /function="UNKNOWN" /note="Rv2738c, (MTV002.03c), len: 68 aa. Conserved hypothetical protein, equivalent to Q9CCC1|ML0986 HYPOTHETICAL PROTEIN from Mycobacterium leprae (67 aa), FASTA scores: opt: 397, E(): 3.7e-22, (83.6% identity in 67 aa overlap). Also highly similar to O50484|SC4H8.05 HYPOTHETICAL 7.5 KDA PROTEIN from Streptomyces coelicolor (64 aa), FASTA scores: opt: 185, E(): 5.9e-07, (39.7% identity in 63 aa overlap). Second part of the protein is highly similar to C-terminus of upstream ORF O33285|Rv2742c|MTV002.07c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (277 aa), FASTA scores: opt: 200, E(): 1.7e-07, (78.4% identity in 37 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217254.1" /db_xref="GI:15609875" /db_xref="GeneID:888368" /translation="MLAGVRLTEFHERVALHFGAAYGSSVLLDHVLTGFDGRSAAQAI EDGVEPRDVWRALCADFDVPHDRW" gene complement(3052023..3053189) /locus_tag="Rv2739c" /db_xref="GeneID:888363" CDS complement(3052023..3053189) /locus_tag="Rv2739c" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2739c, (MTV002.04c), len: 388 aa. Possible ala-rich transferase (EC 2.-.-.-), equivalent to Q49841|ML0985|MLCB33.02c|U2235C POSSIBLE GLYCOSYLTRANSFERASE from Mycobacterium leprae (392 aa), FASTA scores: opt: 2112, E(): 5.1e-114, (80.95% identity in 388 aa overlap). Shows some similarity with other transferases e.g. Q9S1V2|SCJ4.21 PUTATIVE GLYCOSYL TRANSFERASE from Streptomyces coelicolor (407 aa), FASTA scores: opt: 290, E(): 2e-09, (27.75% identity in 382 aa overlap); Q9RYI3|DRA0329 PUTATIVE GLYCOSYLTRANSFERASE from Deinococcus radiodurans (418 aa), FASTA scores: opt: 267, E(): 4.3e-08, (29.05% identity in 396 aa overlap); P96560|GTFC GLYCOSYLTRANSFERASE from Amycolatopsis orientalis (409 aa), FASTA scores: opt: 253, E(): 2.7e-07, (27.75% identity in 418 aa overlap); etc. Equivalent to AAK47130 from Mycobacterium tuberculosis strain CDC1551 (420 aa) but shorter 32 aa." /codon_start=1 /transl_table=11 /product="alanine rich transferase" /protein_id="NP_217255.1" /db_xref="GI:15609876" /db_xref="GeneID:888363" /translation="MRVAVVAGPDPGHSFPAIALCQRFRAAADTPTLFTGVEWLEAAR AAGIDAVELDGLAATDRDLDAGARIHRRAAQMAVLNVPRLRALEPELVVSDVITACGG MAAELLGIPWVELNPHPLYLPSKGLPPIGSGLAAGTGIRGRLRDATMRALTGRSWRAG LRQRAAVRVEIGLPARDPGPLRRLIATLPALEVPRPDWPAEAVVVGPLHFEPTDRVLA IPAGTGPVVVVAPSTALTGTAGLTEVALQSLTPGETVPSGSRLVVSRLSGADLTVPPW AVAGLGSQAELLTRADLVICGGGHGMVAKTLLAGVPMVVVPGGGDQWEIANRVVRQGS AVLIRPLTADALVAAVNEVLSSPRFREAARRAAASVAGAADPVRVCHDALALAG" gene 3053233..3053682 /locus_tag="Rv2740" /db_xref="GeneID:888365" CDS 3053233..3053682 /locus_tag="Rv2740" /function="UNKNOWN" /note="Rv2740, (MTV002.05), len: 149 aa. Conserved hypothetical protein, equivalent, but shorter 17 aa, to Q9CCC2|ML0984 (alias Q49850 but longer) HYPOTHETICAL PROTEIN from Mycobacterium leprae (164 aa), FASTA scores: opt: 481, E(): 9.7e-26, (52.0% identity in 150 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217256.1" /db_xref="GI:15609877" /db_xref="GeneID:888365" /translation="MAELTETSPETPETTEAIRAVEAFLNALQNEDFDTVDAALGDDL VYENVGFSRIRGGRRTATLLRRMQGRVGFEVKIHRIGADGAAVLTERTDALIIGPLRV QFWVCGVFEVDDGRITLWRDYFDVYDMFKGLLRGLVALVVPSLKATL" gene 3053914..3055491 /gene="PE_PGRS47" /locus_tag="Rv2741" /db_xref="GeneID:888339" CDS 3053914..3055491 /gene="PE_PGRS47" /locus_tag="Rv2741" /function="UNKNOWN" /note="Rv2741, (MTV002.06), len: 525 aa. Member of the M. tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to others e.g. Q10637|YD25_MYCTU|Rv1325c|MT1367|MTCY130.10c HYPOTHETICAL PE-PGRS FAMILY PROTEIN (603 aa), FASTA scores: opt: 1936, E(): 1.1e-71, (56.95% identity in 611 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177902.1" /db_xref="GI:57117018" /db_xref="GeneID:888339" /translation="MSFVIAAPEFLTAAAMDLASIGSTVSAASAAASAPTVAILAAGA DEVSIAVAALFGMHGQAYQALSVQASAFHQQFVQALTAGAYSYASAEAAAVTPLQQLV DVINAPFRSALGRPLIGNGANGKPGTGQDGGAGGLLYGSGGNGGSGLAGSGQKGGNGG AAGLFGNGGAGGAGASNQAGNGGAGGNGGAGGLIWGTAGTGGNGGFTTFLDAAGGAGG AGGAGGLFGAGGAGGVGGAALGGGAQAAGGNGGAGGVGGLFGAGGAGGAGGFSDTGGT GGAGGAGGLFGPGGGSGGVGGFGDTGGTGGDGGSGGLFGVGGAGGHGGFGSAAGGDGG AGGAGGTVFGSGGAGGAGGVATVAGHGGHGGNAGLLYGTGGAGGAGGFGGFGGDGGDG GIGGLVGSGGAGGSGGTGTLSGGRGGAGGNAGTFYGSGGAGGAGGESDNGDGGNGGVG GKAGLVGEGGNGGDGGATIAGKGGSGGNGGNAWLTGQGGNGGNAAFGKAGTGSVGVGG AGGLLEGQNGENGLLPS" gene complement(3055515..3056348) /locus_tag="Rv2742c" /db_xref="GeneID:888332" CDS complement(3055515..3056348) /locus_tag="Rv2742c" /function="UNKNOWN" /note="Rv2742c, (MTV002.07c), len: 277 aa (questionable ORF). Conserved hypothetical arg-rich protein. Extreme N-terminus is highly similar to the N-teminus of Q9CCC1ML0986 HYPOTHETICAL PROTEIN from Mycobacterium leprae (67 aa), FASTA scores: opt: 183, E(): 0.00052, (71.05% identity in 38 aa overlap); and to the downstream ORF O33281|Rv2738c|MTV002.03c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (68 aa), FASTA scores: opt: 200, E(): 5.5e-05, (78.4% identity in 37 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217258.1" /db_xref="GI:15609879" /db_xref="GeneID:888332" /translation="MLVDELGVKIVHAQHVPAPYLVQRMREIHERDENRQRHAQVDVQ RRRDQPERGQHQHRRNRDADHHPDGRTLAGQIVAHPVSHRVRQPRPVAIADVLPRVGP RADCVVAHSLQGSPRRRERRRGQTAHQRLGRRSGNAIACPLYLENAAGPEPDTKRAEG RRFGAFGGGDLRWMADRVPRQGSGRRGLGSRSGAGVPQGADARGWRHTADGVPRVGQP AIRRGVPGFWCWLDHVLTGFGGRNAICAIEDGVEPRVAWWALCTDFDVPRSMGRRTPG G" gene complement(3056420..3057232) /locus_tag="Rv2743c" /db_xref="GeneID:887779" CDS complement(3056420..3057232) /locus_tag="Rv2743c" /function="UNKNOWN" /note="Rv2743c, (MTV002.08c), len: 270 aa. Possible conserved transmembrane ala-rich protein, equivalent to Q49833|MLCB33.04c|B2235_C1_148 UNKNOWN PROTEIN from Mycobacterium leprae (123 aa), FASTA scores: opt: 639, E(): 3.3e-31, (74.8% identity in 123 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217259.1" /db_xref="GI:15609880" /db_xref="GeneID:887779" /translation="MAVKAGQRRPWRSLLQRGVDTAGDLADLVAQKISVAIDPRARLL RRRRRALRWGLVFTAGCLLWGLVTALLAAWGWFTSLLVITGTIAVTQAIPATLLLLRY RWLRSEPLPVRRPASVRRLPPPGSAARPAMSALGASERGFFSLLGVMERGAMLPADEI RDLTAAANQTSAAMVATAAEVVSMERAVQCSAASRSYLVPTINAFTAQLSTGVRQYNE MVTAAAQLVSSANGAGGAGPGQQRYREELAGATDRLVAWAQAFDELGGLPRR" gene complement(3057251..3058063) /gene="35kd_ag" /locus_tag="Rv2744c" /db_xref="GeneID:888304" CDS complement(3057251..3058063) /gene="35kd_ag" /locus_tag="Rv2744c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2744c, (MTV002.09c), len: 270 aa. 35kd_ag, conserved ala-rich protein 35-kd antigen (see O'Connor et al., 1990). N-terminal part is equivalent to Q49840|MLCB33.06c|B2235_C2_187 HYPOTHETICAL PROTEIN from Mycobacterium leprae (167 aa), FASTA scores: opt: 789, E(): 3.4e-35, (85.05% identity in 147 aa overlap); and C-terminal part equivalent to Q49845|MLCB33.05c|B2235_C3_214 HYPOTHETICAL PROTEIN from Mycobacterium leprae (114 aa), FASTA scores: opt: 465, E(): 3.6e-18, (65.8% identity in 114 aa overlap); note that these two proteins from Mycobacterium leprae are adjacent. Shows some similarity with Q55707||Y617_SYNY3|SLL0617 HYPOTHETICAL 28.9 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (267 aa), FASTA scores: opt: 155, E(): 0.19, (23.4% identity in 252 aa overlap); and C-terminus of Q9L4N1|EMM M PROTEIN from Streptococcus equisimilis (592 aa), FASTA scores: opt: 165, E(): 0.11, (23.45% identity in 260 aa overlap). C-terminus also similar to AAK45945|MT1676 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (85 aa), FASTA scores: opt: 159, E(): 0.047, (50.9% identity in 55 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177903.1" /db_xref="GI:57117019" /db_xref="GeneID:888304" /translation="MANPFVKAWKYLMALFSSKIDEHADPKVQIQQAIEEAQRTHQAL TQQAAQVIGNQRQLEMRLNRQLADIEKLQVNVRQALTLADQATAAGDAAKATEYNNAA EAFAAQLVTAEQSVEDLKTLHDQALSAAAQAKKAVERNAMVLQQKIAERTKLLSQLEQ AKMQEQVSASLRSMSELAAPGNTPSLDEVRDKIERRYANAIGSAELAESSVQGRMLEV EQAGIQMAGHSRLEQIRASMRGEALPAGGTTATPRPATETSGGAIAEQPYGQ" gene complement(3058193..3058531) /locus_tag="Rv2745c" /db_xref="GeneID:888315" CDS complement(3058193..3058531) /locus_tag="Rv2745c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2745c, (MTV002.10c), len: 112 aa. Possible transcriptional regulatory protein, highly similar to O86815|SC7C7.10 HYPOTHETICAL 13.6 KDA PROTEIN from Streptomyces coelicolor (126 aa), FASTA scores: opt: 300, E(): 2.4e-13, (60.45% identity in 86 aa overlap); and highly similar to other transcriptional regulators e.g. Q9X7S1|SC5H1.13c POSSIBLE DNA-BINDING PROTEIN from Streptomyces coelicolor (157 aa), FASTA scores: opt: 254, E(): 3.3e-10, (50.0% identity in 94 aa overlap) (N-terminus longer); Q9F885|POPR TRANSCRIPTIONAL REGULATOR from Streptomyces lividans (148 aa), FASTA scores: opt: 248, E(): 7.8e-10, (53.6% identity in 97 aa overlap) (N-terminus longer); Q9FCH1|2SCD46.12 PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (141 aa), FASTA scores: opt: 162, E(): 0.00038, (33.0% identity in 106 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217261.1" /db_xref="GI:15609882" /db_xref="GeneID:888315" /translation="MAALVREVVGDVLRGARMSQGRTLREVSDSARVSLGYLSEIERG RKEPSSELLSAICTALQLPLSVVLIDAGERMARQERLARATPAGRATGATIDASTKVV IAPVVSLAVA" gene complement(3058602..3059231) /gene="pgsA3" /locus_tag="Rv2746c" /db_xref="GeneID:888375" CDS complement(3058602..3059231) /gene="pgsA3" /locus_tag="Rv2746c" /EC_number="2.7.8.5" /function="INVOLVED IN ACIDIC PHOSPHOLIPID BIOSYNTHESIS. THIS PROTEIN PROBABLY CATALYZES THE COMMITTED STEP TO THE SYNTHESIS OF THE ACIDIC PHOSPHOLIPIDS (BY SIMILARITY) [CATALYTIC ACTIVITY: CDP-DIACYLGLYCEROL + GLYCEROL-3-PHOSPHATE = CMP + 3-(3-PHOSPHATIDYL)-GLYCEROL 1-PHOSPHATE]." /note="Rv2746c, (MTV002.11c), len: 209 aa. Probable pgsA3, PGP synthase (EC 2.7.8.5) (see citation below), transmembrane protein, equivalent, but longer 19 aa, to Q49839|O08087|PGSA|ML0979 PGSA from Mycobacterium leprae (193 aa), FASTA scores: opt: 925, E(): 3.7e-53, (77.15% identity in 188 aa overlap). Also highly similar to O86813|PGSA PHOSPHATIDYLGLYCEROPHOSPHATE SYNTHASE from Streptomyces coelicolor (263 aa), FASTA scores: opt: 692, E(): 6.6e-38, (57.85% identity in 185 aa overlap) (has its N-terminus longer); and similar to others (generally with N-terminus shorter) e.g. Q99XI0|PGSA|SPY2196 PHOSPHATIDYLGLYCEROPHOSPHATE SYNTHASE from Streptococcus pyogenes (180 aa), FASTA scores: opt: 368, E(): 5.4e-17, (39.9% identity in 168 aa overlap); Q9ZE96|PGSA_RICPR|PGSA|RP049 CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3- PHOSPHATIDYLTRANSFERASE from Rickettsia prowazekii (181 aa), FASTA scores: opt: 343, E(): 2.3e-15, (40.1% identity in 172 aa overlap); P06978|PGSA_ECOLI|PGSA|B1912|Z3000|ECS2650 CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3- PHOSPHATIDYLTRANSFERASE from Escherichia coli strains K12 and O157:H7 (181 aa), FASTA scores: opt: 322, E(): 5.3e-14, (34.45% identity in 180 aa overlap); etc. Also some similarity to PGSA2|Rv1822|MTCY1A11.21c PROBABLE CDP-DIACYLGLYCEROL--GLYCEROL-3-PHOSPHATE 3-PHOSPHATIDYLTRANSFERASE from Mycobacterium tuberculosis (209 aa), FASTA score: (27.1% identity in 166 aa overlap). Contains PS00379 CDP-alcohol phosphatidyltransferases signature. BELONGS TO THE CDP-ALCOHOL PHOSPHATIDYLTRANSFERASE CLASS-I FAMILY." /codon_start=1 /transl_table=11 /product="CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase" /protein_id="NP_217262.1" /db_xref="GI:15609883" /db_xref="GeneID:888375" /translation="MSRSTRYSVAVSAQPETGQIAGRARIANLANILTLLRLVMVPVF LLALFYGGGHHSAARVVAWAIFATACITDRFDGLLARNYGMATEFGAFVDPIADKTLI GSALIGLSMLGDLPWWVTVLILTRELGVTVLRLAVIRRGVIPASWGGKLKTFVQAVAI GLFVLPLSGPLHVAAVVVMAAAILLTVITGVDYVARALRDIGGIRQTAS" misc_feature complement(3058938..3059006) /gene="pgsA3" /locus_tag="Rv2746c" /note="PS00379 CDP-alcohol phosphatidyltransferases signature" gene 3059262..3059786 /locus_tag="Rv2747" /db_xref="GeneID:888407" CDS 3059262..3059786 /locus_tag="Rv2747" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="catalyzes the conversion of l-glutamate to a-N-acetyl-l-glutamate in arginine biosynthesis" /codon_start=1 /transl_table=11 /product="N-acetylglutamate synthase" /protein_id="NP_217263.1" /db_xref="GI:15609884" /db_xref="GeneID:888407" /translation="MTERPRDCRPVVRRARTSDVPAIKQLVDTYAGKILLEKNLVTLY EAVQEFWVAEHPDLYGKVVGCGALHVLWSDLGEIRTVAVDPAMTGHGIGHAIVDRLLQ VARDLQLQRVFVLTFETEFFARHGFTEIEGTPVTAEVFDEMCRSYDIGVAEFLDLSYV KPNILGNSRMLLVL" gene complement(3059855..3062506) /gene="ftsK" /locus_tag="Rv2748c" /db_xref="GeneID:888408" CDS complement(3059855..3062506) /gene="ftsK" /locus_tag="Rv2748c" /function="POSSIBLY INVOLVED IN CELL DIVISION PROCESSES" /note="Rv2748c, (MTV002.13c), len: 883 aa. Possible ftsK, cell division transmembrane protein, equivalent to O05560|ML0977|FTSK|MLCB33.09c CELL DIVISION PROTEIN from Mycobacterium leprae (886 aa), FASTA scores: opt: 3147, E(): 7.9e-175, (78.1% identity in 885 aa overlap). Also similar to other members of the spoIIIE/ftsK family e.g. O86810|SC7C7.05 FTSK HOMOLOG from Streptomyces coelicolor (929 aa), FASTA scores: opt: 2256, E(): 3.8e-123, (49.05% identity in 924 aa overlap); Q9CF25|FTSK CELL DIVISION PROTEIN FTSK from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (763 aa), FASTA scores: opt: 1438, E(): 9.1e-76, (37.7% identity in 751 aa overlap); AAK75005|Q97RE4|SP0878 SPOE FAMILY PROTEIN from Streptococcus pneumoniae (767 aa), FASTA scores: opt: 1405, E(): 7.5e-74, (48.0% identity in 477 aa overlap); P46889|FTSK_ECOLI|B0890 from Escherichia coli strain K12 (1329 aa), FASTA scores: opt: 759, E(): 0, (44.5% identity in 537 aa overlap) (similarity in C-terminal half); etc. Equivalent to AAK47139 from Mycobacterium tuberculosis strain CDC1551 (968 aa) but shorter 85 aa. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE FTSK/SPOIIIE FAMILY." /codon_start=1 /transl_table=11 /product="cell division transmembrane protein FtsK" /protein_id="NP_217264.1" /db_xref="GI:15609885" /db_xref="GeneID:888408" /translation="MLGPPGTPRVGRRDAARSLVTLLRRPWQRGEQIAVTSVADGVDG VIATRLAVMSSKTVARSGTRTSRSKATSRGASRSARSAVPRKRSRPVKGVGRPSRRHH RSLLVSTGLACGRAMRAVWMMAAKGTGGAARSIGRARDIEPGHRRDGIALVLLGLAVV VAASSWFDAARPLGAWVDALLRTFIGSAVVMLPLVAAAVAVVLMRTSPNPDSRPRLIL GASLIGLSFLGLCHLWAGSPEAPESRLRAAGFIGFAIGGPLSDGLTAWIAAPLLFIGA LFGLLLLAGITIREVPDAMRAMFGTRLLPREYADDFEDFADFDGDDADTVEVARQDFS DGYYDEVPLCSDDGPPAWPSAEVPQDDTATIPEASAGRGSGRRGRRKDTQVLDRIVEG PYTLPSLDLLISGDPPKKRSAANTHMAGAIGEVLTQFKVDAAVTGCTRGPTVTRYEVE LGPGVKVEKITALQRNIAYAVATESVRMLAPIPGKSAVGIEVPNTDREMVRLADVLTA RETRRDHHPLVIGLGKDIEGDFISANLAKMPHLLVAGSTGSGKSSFVNSMLVSLLTRA TPEEVRMILIDPKMVELTPYEGIPHLITPIITQPKKAAAALAWLVDEMEQRYQDMQAS RVRHIDDFNDKVRSGAITAPLGSQREYRPYPYVVAIVDELADLMMTAPRDVEDAIVRI TQKARAAGIHLVLATQRPSVDVVTGLIKTNVPSRLAFATSSLTDSRVILDQAGAEKLI GMGDGLFLPMGASKPLRLQGAYVSDEEIHAVVTACKEQAEPEYTEGVTTAKPTAERTD VDPDIGDDMDVFLQAVELVVSSQFGSTSMLQRKLRVGFAKAGRLMDLMETRGIVGPSE GSKAREVLVKPDELAGTLAAIRGDGGE" misc_feature complement(3060851..3060874) /gene="ftsK" /locus_tag="Rv2748c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 3062505..3062819 /locus_tag="Rv2749" /db_xref="GeneID:888390" CDS 3062505..3062819 /locus_tag="Rv2749" /function="UNKNOWN" /note="Rv2749, (MTV002.14), len: 104 aa. Conserved hypothetical protein, showing some similarity with Q9I1R9|PA2198 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (114 aa), FASTA scores: opt: 157, E(): 0.00081, (35.0% identity in 100 aa overlap); and O86332|Rv0793|MTV042.03 HYPOTHETICAL 11.2 KDA PROTEIN from Mycobacterium tuberculosis (101 aa), FASTA scores: opt: 143, E(): 0.0062, (26.9% identity in 93 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217265.1" /db_xref="GI:15609886" /db_xref="GeneID:888390" /translation="MPVVVVATLTAKPESVDTVRDILTRAVDDVHREPGCQLYALHET GETFIFVEQWADAEALKAHSGAPAVATMFTAAGEHLVGAPDIKLLQPVPAGDPSKGQL RR" gene 3062816..3063634 /locus_tag="Rv2750" /db_xref="GeneID:888384" CDS 3062816..3063634 /locus_tag="Rv2750" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Catalyzes the first of the two reduction steps in the elongation cycle of fatty acid synthesis" /codon_start=1 /transl_table=11 /product="3-ketoacyl-(acyl-carrier-protein) reductase" /protein_id="NP_217266.1" /db_xref="GI:15609887" /db_xref="GeneID:888384" /translation="MIDRPLEGKVAFITGAARGLGRAHAVRLAADGANIIAVDICEQI ASVPYPLSTADDLAATVELVEDAGGGIVARQGDVRDRASLSVALQAGLDEFGRLDIVV ANAGIAMMQAGDDGWRDVIDVNLTGVFHTVQVAIPTLIEQGTGGSIVLISSAAGLVGI GSSDPGSLGYAAAKHGVVGLMRAYANHLAPQNIRVNSVHPCGVDTPMINNEFFQQWLT TADMDAPHNLGNALPVELVQPTDIANAVAWLASEEARYVTGVTLPVDAGFVNKR" gene 3063638..3064528 /locus_tag="Rv2751" /db_xref="GeneID:887806" CDS 3063638..3064528 /locus_tag="Rv2751" /function="UNKNOWN" /note="Rv2751, (MTV002.16), len: 296 aa. Conserved hypothetical protein, similar in part to others e.g. Q98LR1|MLR0915 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (299 aa), FASTA scores: opt: 279, E(): 1.6e-11, (32.85% identity in 210 aa overlap); Q9FBX1|SC8E7.10 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (283 aa), FASTA scores: opt: 232, E(): 2.4e-08, (27.9% identity in 269 aa overlap); Q9FMY9 HYPOTHETICAL PROTEIN (GENOMIC DNA, CHROMOSOME 5, P1 CLONE:MJB21) from Arabidopsis thaliana (Mouse-ear cress) (370 aa), FASTA scores: opt: 205, E(): 2.1e-06, (28.9% identity in 211 aa overlap); etc. Also similar in part to several proteins from Mycobacterium tuberculosis: P72053|Rv3787c|MTCY13D12.21 HYPOTHETICAL 33.4 KDA PROTEIN (308 aa), FASTA scores: opt: 266, E(): 1.3e-10, (29.6% identity in 267 aa overlap); O53795|MBE50c|Rv0731c|MTV041.05c HYPOTHETICAL 34.9 KDA PROTEIN (318 aa), FASTA scores: opt: 266, E(): 1.3e-10, (32.05% identity in 281 aa overlap); O53841|Rv0830|MTV043.22 HYPOTHETICAL 33.4 KDA PROTEIN (301 aa), FASTA scores: opt: 263, E(): 2e-10, (31.3% identity in 262 aa overlap); etc. BELONGS TO THE MTCY13D12.21 / MTCY210.45C / MTCY78.29C FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217267.1" /db_xref="GI:15609888" /db_xref="GeneID:887806" /translation="MARNPAAQTAFGPMVLAAVEQNEPPGRRLVDDDLADLFLPRPLR WLAGATRSAVLRRLLISASEWSGRGLWANLACRKRFIGDKLDEALGDIDAVVILGAGL DTRAYRLTRRVRMPVFEVDLPVNIARKAKTVRRVLGELPLSVRLVALDFEHDDLLTAL AEHGYRTEYRVFFVCEGVTQYLTERAVRRTLEGLRAAAPGSRMVFTYVRRDFIDGTNR YGTRTLYHTVRQRRQLWHFGLDPEEVAGFLADYGWRLTEQAGPEELVQRYVEPTGRNL NASQIEWSAYAEKSEPVTPR" gene complement(3064515..3066191) /locus_tag="Rv2752c" /db_xref="GeneID:887802" CDS complement(3064515..3066191) /locus_tag="Rv2752c" /function="UNKNOWN" /note="Rv2752c, (MTV002.17c), len: 558 aa. Conserved hypothetical protein, equivalent to Q9CBW5|ML1512 HYPOTHETICAL PROTEIN from Mycobacterium leprae (558 aa), FASTA scores: opt: 3301, E(): 1.2e-195, (89.05% identity in 558 aa overlap). Also highly similar to other hypothetical proteins from a wide range of prokaryotes e.g. CAC19480|P54122|YOR4_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (718 aa), FASTA scores: opt: 2142, E(): 3.5e-124, (57.2% identity in 554 aa overlap) (N-terminus longer); O86842|SC9A10.09 from Streptomyces coelicolor (561 aa), FASTA scores: opt: 2077, E(): 2.9e-120, (55.95% identity in 556 aa overlap); Q9ZI80 from Streptomyces toyocaensis (528 aa), FASTA scores: opt: 1843, E(): 7.3e-106, (52.45% identity in 528 aa overlap) (N-terminus shorter 30 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217268.1" /db_xref="GI:15609889" /db_xref="GeneID:887802" /translation="MDVDLPPPGPLTSGGLRVTALGGINEIGRNMTVFEHLGRLLIID CGVLFPGHDEPGVDLILPDMRHVEDRLDDIEALVLTHGHEDHIGAIPFLLKLRPDIPV VGSKFTLALVAEKCREYRITPVFVEVREGQSTRHGVFECEYFAVNHSTPDALAIAVYT GAGTILHTGDIKFDQLPPDGRPTDLPGMSRLGDTGVDLLLCDSTNAEIPGVGPSESEV GPTLHRLIRGADGRVIVACFASNVDRVQQIIDAAVALGRRVSFVGRSMVRNMRVARQL GFLRVADSDLIDIAAAETMAPDQVVLITTGTQGEPMSALSRMSRGEHRSITLTAGDLI VLSSSLIPGNEEAVFGVIDALSKIGARVVTNAQARVHVSGHAYAGELLFLYNGVRPRN VMPVHGTWRMLRANAKLAASTGVPQESILLAENGVSVDLVAGKASISGAVPVGKMFVD GLIAGDVGDITLGERLILSSGFVAVTVVVRRGTGQPLAAPHLHSRGFSEDPKALEPAV RKVEAELESLVAANVTDPIRIAQGVRRTVGKWVGETYRRQPMIVPTVIEV" gene complement(3066222..3067124) /gene="dapA" /locus_tag="Rv2753c" /db_xref="GeneID:888289" CDS complement(3066222..3067124) /gene="dapA" /locus_tag="Rv2753c" /EC_number="4.2.1.52" /function="INVOLVED IN BIOSYNTHESIS OF DIAMINOPIMELATE AND LYSINE FROM ASPARTATE SEMIALDEHYDE (AT THE FIRST STEP) [CATALYTIC ACTIVITY: L-ASPARTATE 4-SEMIALDEHYDE + PYRUVATE = DIHYDRODIPICOLINATE + 2 H(2)O]." /note="catalyzes the formation of dihydrodipicolinate from L-aspartate 4-semialdehyde and pyruvate in lysine and diaminopimelate biosynthesis" /codon_start=1 /transl_table=11 /product="dihydrodipicolinate synthase" /protein_id="NP_217269.1" /db_xref="GI:15609890" /db_xref="GeneID:888289" /translation="MTTVGFDVAARLGTLLTAMVTPFSGDGSLDTATAARLANHLVDQ GCDGLVVSGTTGESPTTTDGEKIELLRAVLEAVGDRARVIAGAGTYDTAHSIRLAKAC AAEGAHGLLVVTPYYSKPPQRGLQAHFTAVADATELPMLLYDIPGRSAVPIEPDTIRA LASHPNIVGVKDAKADLHSGAQIMADTGLAYYSGDDALNLPWLAMGATGFISVIAHLA AGQLRELLSAFGSGDIATARKINIAVAPLCNAMSRLGGVTLSKAGLRLQGIDVGDPRL PQVAATPEQIDALAADMRAASVLR" misc_feature complement(3066606..3066698) /gene="dapA" /locus_tag="Rv2753c" /note="PS00666 Dihydrodipicolinate synthetase signature 2" misc_feature complement(3066930..3066983) /gene="dapA" /locus_tag="Rv2753c" /note="PS00665 Dihydrodipicolinate synthetase signature 1" gene complement(3067193..3067945) /gene="thyX" /locus_tag="Rv2754c" /db_xref="GeneID:887766" CDS complement(3067193..3067945) /gene="thyX" /locus_tag="Rv2754c" /EC_number="2.1.1.148" /function="CATALYZES THE FORMATION OF DTMP AND TETRAHYDROFOLATE FROM DUMP AND METHYLENETETRAHYDROFOLATE" /note="flavin dependent thymidylate synthase; ThyX; thymidylate synthase complementing protein; catalyzes the formation of dTMP and tetrahydrofolate from dUMP and methylenetetrahydrofolate; the enzyme from Mycobacterium tuberculosis forms homotetramers; uses FAD as a cofactor" /codon_start=1 /transl_table=11 /product="FAD-dependent thymidylate synthase" /protein_id="NP_217270.1" /db_xref="GI:15609891" /db_xref="GeneID:887766" /translation="MAETAPLRVQLIAKTDFLAPPDVPWTTDADGGPALVEFAGRACY QSWSKPNPKTATNAGYLRHIIDVGHFSVLEHASVSFYITGISRSCTHELIRHRHFSYS QLSQRYVPEKDSRVVVPPGMEDDADLRHILTEAADAARATYSELLAKLEAKFADQPNA ILRRKQARQAARAVLPNATETRIVVTGNYRAWRHFIAMRASEHADVEIRRLAIECLRQ LAAVAPAVFADFEVTTLADGTEVATSPLATEA" gene complement(3068189..3068464) /gene="hsdS.1" /locus_tag="Rv2755c" /db_xref="GeneID:887776" CDS complement(3068189..3068464) /gene="hsdS.1" /locus_tag="Rv2755c" /function="IMPLICATED IN RESTRICTION/MODIFICATION OF DNA. COMPONENT OF TYPE I RESTRICTION/MODIFICATION SYSTEM. IT'S POSSIBLE THAT THE M AND S SUBUNITS TOGETHER FORM A METHYLTRANSFERASE (MTASE) THAT METHYLATES TWO ADENINE RESIDUES IN COMPLEMENTARY STRANDS OF BIPARTITE DNA RECOGNITION SEQUENCE." /note="Rv2755c, (MTV002.20c), len: 91 aa. Possible hsdS.1, fragment of type I restriction/modification system specificity determinant (S protein), similar to the N-terminus of other hsdS proteins e.g. O34140|HSDS from Klebsiella pneumoniae (439 aa), FASTA scores: opt: 303, E(): 2.1e-13, (46.65% identity in 90 aa overlap); P72419|STY|SBLI from Salmonella typhimurium (434 aa), FASTA scores: opt: 278, E(): 1.1e-11, (47.65% identity in 86 aa overlap); and Q9P9X9|XF2741 from Xylella fastidiosa (412 aa), FASTA scores: opt: 144, E(): 0.015, (31.7% identity in 82 aa overlap). Also some similarity with O33303|Rv2761c|MTV002.26c|HSDS POSSIBLE TYPE I RESTRICTION/MODIFICATION SYSTEM SPECIFICITY DETERMINANT from Mycobacterium tuberculosis (364 aa), FASTA scores: opt: 145, E(): 0.012, (29.9% identity in 87 aa overlap). Note that previously known as hsdS'.; hsdS'" /codon_start=1 /transl_table=11 /product="type I restriction/modification system specificity determinant HsdS" /protein_id="YP_177904.1" /db_xref="GI:57117020" /db_xref="GeneID:887776" /translation="MSDGWKTLRFGEVLELQRGHDLPAASRGSGTVPVIGSFGVTGMH DTAAYDGPGVAIGRSGAAIGTATFVAGPIWPLDTCLFVRDFKGNDPR" gene complement(3068461..3070083) /gene="hsdM" /locus_tag="Rv2756c" /db_xref="GeneID:888278" CDS complement(3068461..3070083) /gene="hsdM" /locus_tag="Rv2756c" /EC_number="2.1.1.-" /function="IMPLICATED IN METHYLATION OF DNA. COMPONENT OF TYPE I RESTRICTION/MODIFICATION SYSTEM. IT IS POSSIBLE THAT THE M AND S SUBUNITS TOGETHER FORM A METHYLTRANSFERASE (MTASE) THAT METHYLATES TWO ADENINE RESIDUES IN COMPLEMENTARY STRANDS OF BIPARTITE DNA RECOGNITION SEQUENCE." /note="Rv2756c, (MTV002.21c), len: 540 aa. Possible hsdM, type I restriction/modification system DNA methylase (M protein) (EC 2.1.1.-), highly similar to others e.g. Q9P9X8|XF2742 from Xylella fastidiosa (519 aa), FASTA scores: opt: 1613, E(): 1.9e-96, (52.3% identity in 543 aa overlap); O34139|HSDM from Klebsiella pneumoniae (539 aa), FASTA scores: opt: 1267, E(): 4.4e-74, (45.9% identity in 549 aa overlap); P72418|STY|SBLI|HSDM from Salmonella typhimurium (539 aa), FASTA scores: opt: 1263, E(): 8e-74, (45.7% identity in 549 aa overlap); etc. Possible alternative start site (GTG) overlapping with termination codon of previous ORF 90 bp upstream. Note that the corresponding endonuclease (M protein) does not appear to be present in Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="type I restriction/modification system DNA methylase HsdM" /protein_id="NP_217272.1" /db_xref="GI:15609893" /db_xref="GeneID:888278" /translation="MPPRKKQAPQAPSTMKELKDTLWKAADKLRGSLSASQYKDVILG LVFLKYVSDAYDERREAIRAELAAEGMEESQIEDLIDDPEQYQGYGVFVVPVSARWKF LAENTKGKPAVGGEPAKNIGQLIDEAMDAVMKANPTLGGTLPRLYNKDNIDQRRLGEL IDLFNSARFSRQGEHRARDLMGEVYEYFLGNFARAEGKRGGEFFTPPSVVKVIVEVLE PSSGRVYDPCCGSGGMFVQTEKFIYEHDGDPKDVSIYGQESIEETWRMAKMNLAIHGI DNKGLGARWSDTFARDQHPDVQMDYVMANLPFNIKDWARNEEDPRWRFGVPPANNANY AWIQHILYKLAPGGRAGVVMANGSMSSNSNGEGDIRAQIVEADLVSCMVALPTQLFRS TGIPVCLWFFAKDKAAGKQGSIDRCGQVLFIDARELGDLVDRAERALTNEEIVRIGDT FHAWRGSKSAAVKGIMYEDVPGFCKSATLAEIKATDYALTPGRYVGTPAVEDDGEPID EKMARLSKALLEAFDESARLERVVREQLGRLR" gene complement(3070170..3070586) /locus_tag="Rv2757c" /db_xref="GeneID:888249" CDS complement(3070170..3070586) /locus_tag="Rv2757c" /function="UNKNOWN" /note="Rv2757c, (MTV002.22c), len: 138 aa. Conserved hypothetical protein, similar to several other M. tuberculosis hypothetical proteins e.g. P96411|Rv0229c| MTCY08D5.24c (226 aa), FASTA scores: opt: 354, E(): 4.6e-18, (45.25% identity in 137 aa overlap) (N-terminus longer 89 aa); P95007|RV2546|MTCY159.10c (137 aa), FASTA scores: opt: 265, E(): 7.5e-12, (38.5% identity in 135 aa overlap); O07228|Rv0301|MTCY63.06 (141 aa), FASTA scores: opt: 259, E(): 2.1e-11, (42.4% identity in 132 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217273.1" /db_xref="GI:15609894" /db_xref="GeneID:888249" /translation="MTTRYLLDKSAAYRAHLPAVRHRLEPLMERGLLARCGITDLEFG VSARSREDHRTLGTYRRDALEYVNTPDTVWVRAWEIQEALTDKGFHRSVKIPDLIIAA VAEHHGIPVMHYDQDFERIAAITRQPVEWVVAPGTA" gene complement(3070583..3070849) /locus_tag="Rv2758c" /db_xref="GeneID:888263" CDS complement(3070583..3070849) /locus_tag="Rv2758c" /function="UNKNOWN" /note="Rv2758c, (MTV002.23c), len: 88 aa. Conserved hypothetical protein, similar to several other M. tuberculosis hypothetical proteins e.g. P95008|Rv2545 (92 aa), FASTA scores: opt: 151, E(): 0.00028, (66.65% identity in 45 aa overlap); Q10771|YF60_MYCTU|RV1560|MT1611|MTCY48.05c (72 aa), FASTA scores: opt: 106, E(): 0.52, (39.15% identity in 46 aa overlap); O06565|Rv1113|MTCY22G8.02 (65 aa), FASTA scores: opt: 97, E(): 2.2, (33.35% identity in 69 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217274.1" /db_xref="GI:15609895" /db_xref="GeneID:888263" /translation="MHRGYALVVCSPGVTRTMIDIDDDLLARAAKELGTTTKKDTVHA ALRAALRASAARSLMNRMAENATGTQDEALVNAMWRDGHPENTA" misc_feature complement(3070709..3070795) /locus_tag="Rv2758c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene complement(3070875..3071270) /locus_tag="Rv2759c" /db_xref="GeneID:888293" CDS complement(3070875..3071270) /locus_tag="Rv2759c" /function="UNKNOWN" /note="Rv2759c, (MTV002.24c), len: 131 aa. Conserved hypothetical protein, highly similar to three M. tuberculosis hypothetical proteins O07769|Y609_MYCTU|Rv0609|MT0638|MTCY19H5.13c (133 aa), FASTA scores: opt: 364, E(): 5.1e-18, (49.6% identity in 131 aa overlap); P96914|Y624_MYCTU|Rv0624|MT0652|MTCY20H10.05 (131 aa), FASTA scores: opt: 324, E(): 2.9e-15, (42.85% identity in 126 aa overlap); and Q10874|YJ82_MYCTU|Rv1982c|MT2034|MTCY39.37 (139 aa), FASTA scores: opt: 271, E(): 1.4e-11, (38.6% identity in 127 aa overlap). Also similar to other hypothetical proteins from other bacteria e.g. CAC45376|SMC00900 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (128 aa), FASTA scores: opt: 286, E(): 1.2e-12, (39.55% identity in 129 aa overlap); Q981I7|MLL9357 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (131 aa), FASTA scores: opt: 257, E(): 1.2e-10, (36.35% identity in 132 aa overlap); Q9AAG1|CC0639 HYPOTHETICAL PROTEIN from Caulobacter crescentus (131 aa), FASTA scores: opt: 217, E(): 6.9e-08, (33.35% identity in 132 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217275.1" /db_xref="GI:15609896" /db_xref="GeneID:888293" /translation="MIVDTSAIVAIVSGESGAQVLKEALERSPNSRMSAPNYVELCAI MQRRDRPEISRLVDRLLDDYGIQVEAVDADQARVAAQAYRDYGRGSGHPARLNLGDTY SYALAQVTGEPLLFRGDDFTHTDIRPACT" gene complement(3071267..3071536) /locus_tag="Rv2760c" /db_xref="GeneID:887705" CDS complement(3071267..3071536) /locus_tag="Rv2760c" /function="UNKNOWN" /note="Rv2760c, (MTV002.25c), len: 89 aa. Conserved hypothetical protein, showing some similarity with two hypothetical proteins from Mycobacterium tuberculosis O07770|Rv0608|MTCY19H5.14c (81 aa), FASTA scores: opt: 128, E(): 0.057, (37.5% identity in 88 aa overlap); and P96913|Rv0623|MTCY20H10.04 (84 aa), FASTA scores: opt: 99, E(): 5.5, (37.1% identity in 89 aa overlap). Also showing some similarity with CAC45377|SMC00899 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (84 aa), FASTA scores: opt: 116, E(): 0.38, (36.25% identity in 91 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217276.1" /db_xref="GI:15609897" /db_xref="GeneID:887705" /translation="MSLNIKSQRTVALVRELAARTGTNQTAAVEDAVARRLSELDRED RARAEARRAAAEQTLRDLDKLLSDDDKRLIRRHEVDLYDDSGLPR" gene complement(3071546..3072640) /gene="hsdS" /locus_tag="Rv2761c" /db_xref="GeneID:887695" CDS complement(3071546..3072640) /gene="hsdS" /locus_tag="Rv2761c" /function="IMPLICATED IN RESTRICTION/MODIFICATION OF DNA. COMPONENT OF TYPE I RESTRICTION/MODIFICATION SYSTEM. IT IS THOUGHT THAT THE M AND S SUBUNITS TOGETHER FORM A METHYLTRANSFERASE (MTASE) THAT METHYLATES TWO ADENINE RESIDUES IN COMPLEMENTARY STRANDS OF BIPARTITE DNA RECOGNITION SEQUENCE." /experiment="experimental evidence, no additional details recorded" /note="Rv2761c, (MTV002.26c), len: 364 aa. Possible hsdS, type I restriction/modification system specificity determinant (S protein), similar in part to other hsdS protein (S PROTEINS) e.g. Q9P9X9|XF2741 from Xylella fastidiosa (412 aa), FASTA scores: opt: 252, E(): 7.4e-09, (24.95% identity in 401 aa overlap); N-terminus of Q9RC12 TYPE I S-SUBUNIT from Lactobacillus delbrueckii (subsp. lactis) (389 aa), FASTA scores: opt: 232, E(): 1.4e-07, (28.1% identity in 185 aa overlap); N-terminus of P72419|STY|SBLI from Salmonella typhimurium (434 aa), FASTA scores: opt: 221, E(): 8e-07, (28.45% identity in 130 aa overlap); C-terminus of P17222|PRRB_ECOLI from Escherichia coli strain CTR5X (401 aa), FASTA scores: opt: 197, E(): 2.8e-05, (27.05% identity in 148 aa overlap); etc. SEEMS TO BELONG TO TYPE-I RESTRICTION SYSTEM S METHYLASE FAMILY." /codon_start=1 /transl_table=11 /product="type I restriction/modification system specificity determinant HsdS" /protein_id="NP_217277.1" /db_xref="GI:15609898" /db_xref="GeneID:887695" /translation="MSRVEKVEKVRLGDHLDFSNGHTSGHTSPASEPGGRYPVYGANG VIGYSAQHNARGPLIVVGRVGSYCGSLRYCDSDVWVTDNALACRAKKPEETRYWYYAL LGFGLNRYRAGSGQPLLSQGVLRNVSVSAVAAPDRPRIGEILGAFDDKIAANDRVIEA AEALMLAIVGRLSAYVPLSSLASRSTACLDAQHFDSTVAHYSFAAFDGGAQPSRVGGR TIRSAKLVVSQPCVLFPKLNPRIPRIWNITSLPSEMALASTEFVVLRPVGVDTSALWA ALRQPDVLAELRQLVGGMTGSRQRIQPTQLLRVWVRDVRRLTPGHAAAIANLGALCNE RRIESARLASCRDALLPLLMSGIDGLPAGR" gene complement(3072637..3073056) /locus_tag="Rv2762c" /db_xref="GeneID:887698" CDS complement(3072637..3073056) /locus_tag="Rv2762c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2762c, (MTV002.27c), len: 139 aa. Conserved hypothetical protein, similar to C-terminus of hypothetical proteins: Q9A380|CC3324 from Caulobacter crescentus (409 aa), FASTA scores: opt: 181, E(): 9.8e-05, (43.55% identity in 101 aa overlap); Q98KQ4|MLR1373 from Rhizobium loti (Mesorhizobium loti) (399 aa), FASTA scores: opt: 174, E(): 0.00028, (46.35% identity in 82 aa overlap); and Q9HZZ9|PA2844 from Pseudomonas aeruginosa (402 aa), FASTA scores: opt: 158, E(): 0.0033, (40.0% identity in 80 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217278.1" /db_xref="GI:15609899" /db_xref="GeneID:887698" /translation="MSAATAAWDRRAAVVVGGVAEPGSAGPIAGADRKRLISRIQVRQ LDSAAVAAKRRHLYYVRPLDGHPVARVDRKTDRAADSLPVAGVLGELDIPPVTVAEGL AGELASMASWLGLGGIAVSTRGDLAGELCAATKRTNG" repeat_region 3073055..3073112 /note="51 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene complement(3073130..3073609) /gene="dfrA" /locus_tag="Rv2763c" /db_xref="GeneID:887777" CDS complement(3073130..3073609) /gene="dfrA" /locus_tag="Rv2763c" /EC_number="1.5.1.3" /function="ESSENTIAL STEP FOR DE NOVO GLYCINE AND PURINE SYNTHESIS, DNA PRECURSOR SYNTHESIS, AND FOR THE CONVERSION OF DUMP TO DTMP [CATALYTIC ACTIVITY: 5,6,7,8-TETRAHYDROFOLATE + NADP(+) = 7,8-DIHYDROFOLATE + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="Rv2763c, (MTV002.28c), len: 159 aa. Probable dfrA (alternate gene names: folA, dhfr), dihydrofolate reductase (EC 1.5.1.3), equivalent to O30463|FOLA DIHYDROFOLATE REDUCTASE from Mycobacterium avium (see citation below) (181 aa), FASTA scores: opt: 802, E(): 4.5e-48, (70.2% identity in 161 aa overlap); and Q9CBW1|FOLA|ML1518 DIHYDROFOLATE REDUCTASE from Mycobacterium leprae (165 aa), FASTA scores: opt: 782, E(): 1e-46, (70.55% identity in 163 aa overlap). Also highly similar to many e.g. Q9K168|DYR_NEIMB|FOLA|NMB0308 from Neisseria meningitidis (serogroup B) (162 aa), FASTA scores: opt: 469, E(): 3.8e-25, (46.65% identity in 163 aa overlap); P12833|DYR3_SALTY|DHFRIII from Salmonella typhimurium (162 aa), FASTA scores: opt: 367, E(): 4e-18, (45.4% identity in 141 aa overlap); Q59408|DYRC_ECOLI|DHFRXIII from Escherichia coli strain RA33.2 (165 aa), FASTA scores: opt: 313, E(): 2.2e-14, (41.9% identity in 136 aa overlap); etc. Contains PS00075 Dihydrofolate reductase signature. BELONGS TO THE DIHYDROFOLATE REDUCTASE FAMILY.; folA" /codon_start=1 /transl_table=11 /product="dihydrofolate reductase DFRA (DHFR) (tetrahydrofolate dehydrogenase)" /protein_id="NP_217279.1" /db_xref="GI:15609900" /db_xref="GeneID:887777" /translation="MVGLIWAQATSGVIGRGGDIPWRLPEDQAHFREITMGHTIVMGR RTWDSLPAKVRPLPGRRNVVLSRQADFMASGAEVVGSLEEALTSPETWVIGGGQVYAL ALPYATRCEVTEVDIGLPREAGDALAPVLDETWRGETGEWRFSRSGLRYRLYSYHRS" misc_feature complement(3073544..3073570) /gene="dfrA" /locus_tag="Rv2763c" /note="PS00075 Dihydrofolate reductase signature" gene complement(3073680..3074471) /gene="thyA" /locus_tag="Rv2764c" /db_xref="GeneID:887728" CDS complement(3073680..3074471) /gene="thyA" /locus_tag="Rv2764c" /EC_number="2.1.1.45" /function="INVOLVED IN DEOXYRIBONUCLEOTIDE BIOSYNTHESIS. PROVIDES THE SOLE DE NOVO SOURCE OF DTMP FOR DANA BIOSYNTHESIS [CATALYTIC ACTIVITY: 5,10-METHYLENETETRAHYDROFOLATE + DUMP = DIHYDROFOLATE + DTMP]." /experiment="experimental evidence, no additional details recorded" /note="ThyA; catalyzes formation of dTMP and 7,8-dihydrofolate from 5,10-methylenetetrahydrofolate and dUMP; involved in deoxyribonucleotide biosynthesis; there are 2 copies in some Bacilli, one of which appears to be phage-derived" /codon_start=1 /transl_table=11 /product="thymidylate synthase" /protein_id="NP_217280.1" /db_xref="GI:15609901" /db_xref="GeneID:887728" /translation="MTPYEDLLRFVLETGTPKSDRTGTGTRSLFGQQMRYDLSAGFPL LTTKKVHFKSVAYELLWFLRGDSNIGWLHEHGVTIWDEWASDTGELGPIYGVQWRSWP APSGEHIDQISAALDLLRTDPDSRRIIVSAWNVGEIERMALPPCHAFFQFYVADGRLS CQLYQRSADLFLGVPFNIASYALLTHMMAAQAGLSVGEFIWTGGDCHIYDNHVEQVRL QLSREPRPYPKLLLADRDSIFEYTYEDIVVKNYDPHPAIKAPVAV" misc_feature complement(3074010..3074096) /gene="thyA" /locus_tag="Rv2764c" /note="PS00091 Thymidylate synthase active site" gene 3074636..3075373 /locus_tag="Rv2765" /db_xref="GeneID:887714" CDS 3074636..3075373 /locus_tag="Rv2765" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2765, (MTV002.30), len: 245 aa. Probable ala-rich hydrolase (EC 3.-.-.-), similar to various hydrolases or hypothetical proteins e.g. Q9KYM6|SC9H11.13c PUTATIVE HYDROLASE from Streptomyces coelicolor (251 aa), FASTA scores: opt: 630, E(): 1.4e-33, (43.1% identity in 246 aa overlap); Q9A5T9|CC2358 DIENELACTONE HYDROLASE FAMILY PROTEIN from Caulobacter crescentus (286 aa), FASTA scores: opt: 592, E(): 4.5e-31, (38.45% identity in 242 aa overlap); Q9FCF1|2SCD46.33 PUTATIVE HYDROLASE (DIENELACTONE HYDROLASE FAMILY) from Streptomyces coelicolor (254 aa), FASTA scores: opt: 500, E(): 3.9e-25, (37.7% identity in 252 aa overlap); P73163|DLHH_SYNY3|SLL1298 PUTATIVE CARBOXYMETHYLENEBUTENOLIDASE (DIENELACTONE HYDROLASE) (EC 3.1.1.45) from Synechocystis sp. (strain PCC 6803) (246 aa), FASTA scores: opt: 276, E(): 1.3e-10, (26.95% identity in 230 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217281.1" /db_xref="GI:15609902" /db_xref="GeneID:887714" /translation="MPKTTDTAATPDGTCAVRLFTPDGPGRWPGVVMFPDAGGVRDTF DRMAAKLAGFGYVVLLPDVYYREGDWAPFDMKTAFGDPQERARIMFMIGTLTPDRVTR DADALLNYLASRPEVIGDRFGVCGYCMGGRMSVVVAGRLPDRVAAAAAFHPGGLVANS PDSPHLLADRISATVYIGGAENDPSFTADHAEKLDKAFSAAGVPHRIECYPAAHGFAV PDNPSYDAAADERHWAAMTETFGAALN" gene complement(3075588..3076370) /gene="fabG" /locus_tag="Rv2766c" /db_xref="GeneID:887727" CDS complement(3075588..3076370) /gene="fabG" /locus_tag="Rv2766c" /EC_number="1.1.1.100" /function="UNKNOWN, POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Catalyzes the first of the two reduction steps in the elongation cycle of fatty acid synthesis" /codon_start=1 /transl_table=11 /product="3-ketoacyl-(acyl-carrier-protein) reductase" /protein_id="YP_177905.1" /db_xref="GI:57117021" /db_xref="GeneID:887727" /translation="MTSLDLTGRTAIITGASRGIGLAIAQQLAAAGAHVVLTARRQEA ADEAAAQVGDRALGVGAHAVDEDAARRCVDLTLERFGSVDILINNAGTNPAYGPLLEQ DHARFAKIFDVNLWAPLMWTSLVVTAWMGEHGGAVVNTASIGGMHQSPAMGMYNATKA ALIHVTKQLALELSPRIRVNAICPGVVRTRLAEALWKDHEDPLAATIALGRIGEPADI ASAVAFLVSDAASWITGETMIIDGGLLLGNALGFRAAPSTEH" misc_feature complement(3075861..3075947) /gene="fabG" /locus_tag="Rv2766c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(3076367..3076720) /locus_tag="Rv2767c" /db_xref="GeneID:887762" CDS complement(3076367..3076720) /locus_tag="Rv2767c" /function="UNKNOWN" /note="Rv2767c, (MTV002.32c), len: 117 aa (questionable ORF). Possible membrane protein, showing very weak similarity with Q9L2H7|SCC121.09 PUTATIVE METAL TRANSPORT ABC TRANSPORTER from Streptomyces coelicolor (256 aa), FASTA scores: opt: 110, E(): 1, (33.05% identity in 112 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217283.1" /db_xref="GI:15609904" /db_xref="GeneID:887762" /translation="MVGYEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRS RHLNHARDTPQMVAVAQVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYRRS PPAESGHHSNRRQAK" gene complement(3076894..3078078) /gene="PPE43" /locus_tag="Rv2768c" /db_xref="GeneID:887765" CDS complement(3076894..3078078) /gene="PPE43" /locus_tag="Rv2768c" /function="UNKNOWN" /note="Rv2768c, (MTV002.33c), len: 394 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. upstream ORF O33312|Rv2770c|MTV002.35c (402 aa), FASTA scores: opt: 1135, E(): 6.1e-51, (62.15% identity in 391 aa overlap); and P96362|Rv1039c|MTCY10G2.10 from M. tuberculosis (391 aa), FASTA scores: opt: 1721, E(): 6.8e-81, (70.35% identity in 398 aa overlap). Equivalent to AAK47157 from Mycobacterium tuberculosis strain CDC1551 (462 aa) but shorter 68 aa." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177906.1" /db_xref="GI:57117022" /db_xref="GeneID:887765" /translation="MDFGALPPEINSTRMYAGAGAAPLMAAGATWNGLAVELSTTASS VESVIMQLTTEQWLGPASMSMVVAAQPYLAWLTYTAESAAHAAAQAMASAAAFEAAFA MTVPPAEVAANRALLAALVATNVLGQNTPAIMATEAHYGEMWAQDALAMYGYAASSAA AGRLNPLITPSQTANMAGLAGQAAAVSHAAAASTVQQVGLGSLISNLPNAVMGFASPL TSAADAAGLGGIIQDIEELLGITFVQNAINGAVNTTAWFVMATIPNAVFLGHAFAALN PATVTAAADAVPAAAAAAGLAHTVTPVGVGGASLTASLGEASSVGGLSVPAGWSTAAP AMTSGTTALEGSGWAVPEEAGPVAAMPGMAGISGAAKGAGAYAGPRYGFKPIVMPKQV VV" gene complement(3078158..3078985) /gene="PE27" /locus_tag="Rv2769c" /db_xref="GeneID:888461" CDS complement(3078158..3078985) /gene="PE27" /locus_tag="Rv2769c" /function="UNKNOWN" /note="Rv2769c, (MTV002.34c), len: 275 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), highly similar to many (notably in N-terminal part) e.g. P96361|Rv1040c|MTCY10G2.09 from Mycobacterium tuberculosis (275 aa), FASTA scores: opt: 1111, E(): 5.9e-52, (68.55% identity in 283 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177907.1" /db_xref="GI:57117023" /db_xref="GeneID:888461" /translation="MSFLTTQPEELAAAAGKLETIGSAMVAQNAAAAAPTTTGVIPAA ADEISVLQAPLFTAYGTLYQQVSAEAAAVYDLFVKTLGVSAGTYAATEAANSSAAASP LSGIASILGSTPGKVPSWISDIANIFNIGAGNWASAASDLLGLASGGLLPAAEEAALE EGLEGAGLSELGAAEAAVGEAPIAAGLGAAPLAAGLSRASSIGALSVPPSWAGQANLV SSTSTLQGAGWTTAAPHGAAGTVIPGMPGLASATRSSAGFGAPRYGAKPIVVPKPAV" gene complement(3079309..3080457) /gene="PPE44" /locus_tag="Rv2770c" /db_xref="GeneID:888456" CDS complement(3079309..3080457) /gene="PPE44" /locus_tag="Rv2770c" /function="UNKNOWN. MAY BE INVOLVED IN VIRULENCE." /experiment="experimental evidence, no additional details recorded" /note="Rv2770c, (MTV002.35c), len: 382 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. downstream ORF O33310|Rv2768c|MTV002.33c from M. tuberculosis (394 aa), FASTA scores: opt: 1135, E(): 2.2e-53, (62.15% identity in 391 aa overlap); and P96362|Rv1039c|MTCY10G2.10 from Mycobacterium tuberculosis (391 aa), FASTA scores: opt: 1010, E(): 1e-46, (55.95% identity in 395 aa overlap). Equivalent to AAK47159 from Mycobacterium tuberculosis strain CDC1551 (402 aa) but shorter 20 aa. Start changed since first submission (-20 aa)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177677.1" /db_xref="GI:57117024" /db_xref="GeneID:888456" /translation="MDFGALPPEVNSARMYGGAGAADLLAAAAAWNGIAVEVSTAASS VGSVITRLSTEHWMGPASLSMAAAVQPYLVWLTCTAESSALAAAQAMASAAAFETAFA LTVPPAEVVANRALLAELTATNILGQNVSAIAATEARYGEMWAQDASAMYGYAAASAV AARLNPLTRPSHITNPAGLAHQAAAVGQAGASAFARQVGLSHLISDVADAVLSFASPV MSAADTGLEAVRQFLNLDVPLFVESAFHGLGGVADFATAAIGNMTLLADAMGTVGGAA PGGGAAAAVAHAVAPAGVGGTALTADLGNASVVGRLSVPASWSTAAPATAAGAALDGT GWAVPEEDGPIAVMPPAPGMVVAANSVGADSGPRYGVKPIVMPKHGLF" gene complement(3080581..3081033) /locus_tag="Rv2771c" /db_xref="GeneID:888434" CDS complement(3080581..3081033) /locus_tag="Rv2771c" /function="UNKNOWN" /note="Rv2771c, (MTV002.36c), len: 150 aa. Conserved hypothetical protein, equivalent to Q9CBV8|ML1525 HYPOTHETICAL PROTEIN from Mycobacterium leprae (151 aa), FASTA scores: opt: 489, E(): 1.7e-27, (52.7% identity in 148 aa overlap). Also highly similar to Q9RD46|SCF56.21 HYPOTHETICAL 15.7 KDA PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 671, E(): 2.2e-40, (67.8% identity in 146 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217287.1" /db_xref="GI:15609908" /db_xref="GeneID:888434" /translation="MRRLLIVHHTPSPHMQEMFEAVVSGATDPEIEGVEVVRRPALTV SPIEMLEADGYLLGTPANLGYISGALKHAFDVCYYLCLDTTRGRSFGAYIHGNEGTEG AERAVDAITTGLGWVQAAETVVVMGKPSKADIEACWNLGATVAAQLMG" gene complement(3081119..3081592) /locus_tag="Rv2772c" /db_xref="GeneID:888449" CDS complement(3081119..3081592) /locus_tag="Rv2772c" /function="UNKNOWN" /note="Rv2772c, (MTV002.37c), len: 157 aa. Probable conserved transmembrane protein, equivalent to Q9CBV7|ML1526 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (160 aa), FASTA scores: opt: 767, E(): 1.5e-43, (76.6% identity in 154 aa overlap); and similar to P46830|YDAB_MYCBO from Mycobacterium bovis (177 aa), FASTA scores: opt: 337, E(): 3.9e-15, (40.75% identity in 135 aa overlap). Also similar to O86837|SC9A10.04 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 338, E(): 3e-15, (43.75% identity in 144 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217288.1" /db_xref="GI:15609909" /db_xref="GeneID:888449" /translation="MTRRTLYVQLIIAFMCVAMVAYLVMLGRVAVAMIGSGRAAAAGL GLALLILPVIGLWAMIATLRAGFAYQRLARLIAEDGLDIDASALPRRASGRIQRDAAD ALFAAVRTELEDDADDWRRWYRLARAYDYAGDRRRAREAMKTALQLEGRARPGAR" gene complement(3081604..3082341) /gene="dapB" /locus_tag="Rv2773c" /db_xref="GeneID:888443" CDS complement(3081604..3082341) /gene="dapB" /locus_tag="Rv2773c" /EC_number="1.3.1.26" /function="INVOLVED IN BIOSYNTHESIS OF DIAMINOPIMELATE AND LYSINE FROM ASPARTATE SEMIALDEHYDE (AT THE SECOND STEP) [CATALYTIC ACTIVITY: 2,3,4,5-TETRAHYDRODIPICOLINATE + NAD(P)(+) = 2,3-DIHYDRODIPICOLINATE + NAD(P)H]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the reduction of 2,3-dihydrodipicolinate to 2,3,4,5-tetrahydrodipicolinate in lysine and diaminopimelate biosynthesis" /codon_start=1 /transl_table=11 /product="dihydrodipicolinate reductase" /protein_id="NP_217289.1" /db_xref="GI:15609910" /db_xref="GeneID:888443" /translation="MRVGVLGAKGKVGATMVRAVAAADDLTLSAELDAGDPLSLLTDG NTEVVIDFTHPDVVMGNLEFLIDNGIHAVVGTTGFTAERFQQVESWLVAKPNTSVLIA PNFAIGAVLSMHFAKQAARFFDSAEVIELHHPHKADAPSGTAARTAKLIAEARKGLPP NPDATSTSLPGARGADVDGIPVHAVRLAGLVAHQEVLFGTEGETLTIRHDSLDRTSFV PGVLLAVRRIAERPGLTVGLEPLLDLH" gene complement(3082352..3082756) /locus_tag="Rv2774c" /db_xref="GeneID:888439" CDS complement(3082352..3082756) /locus_tag="Rv2774c" /function="UNKNOWN" /note="Rv2774c, (MTV002.39c), len: 134 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217290.1" /db_xref="GI:15609911" /db_xref="GeneID:888439" /translation="MGTAVEVGWRDPCGLAVGELRCAPAVSDQPVVGCAGCPLVDMVD FAPVTGCVAVGSTMGAVPALLRVRFPWPPFEPDVRLSPYLALHGICRWGGSDSCDRTT VQVFHLHSINKRLTAHAGFGAAAVVGLEDGPV" gene 3082909..3083370 /locus_tag="Rv2775" /db_xref="GeneID:888432" CDS 3082909..3083370 /locus_tag="Rv2775" /function="UNKNOWN" /note="Rv2775, (MTV002.40), len: 153 aa. Hypothetical unknown protein, showing weak similarity with hypothetical proteins e.g. Q9ZBJ7|SC9C7.13c from Streptomyces coelicolor (179 aa), FASTA scores: opt: 167, E(): 0.00024, (29.05% identity in 148 aa overlap). Equivalent to AAK47164 from Mycobacterium tuberculosis strain CDC1551 (185 aa) but shorter 32 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217291.1" /db_xref="GI:15609912" /db_xref="GeneID:888432" /translation="MHYPVWRQSWTGILDPYLLDMIGSPKLWVEESYPQSLKRGGWSM WIAESGGQPIGMTMFGPDIAHPDRIQIDALYVAENSQRHGIGGRLLNRALHSHPSADM ILWCAEKNSKARGFYEKKDFHIDGRTFTWKPLSGVNVPHVGYRLYRSAPPG" gene complement(3083374..3084303) /locus_tag="Rv2776c" /db_xref="GeneID:888453" CDS complement(3083374..3084303) /locus_tag="Rv2776c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2776c, (MTV002.41c), len: 309 aa. Probable oxidoreductase (EC 1.-.-.-), similar to other oxidoreductases e.g. Q9KZ15|SC10B7.17 PUTATIVE IRON-SULFUR OXIDOREDUCTASE from Streptomyces coelicolor (364 aa), FASTA scores: opt: 846, E(): 1.2e-45, (46.75% identity in 308 aa overlap); O88034|SC5A7.28c IRON-SULFUR OXIDOREDUCTASE BETA SUBUNIT from Streptomyces coelicolor (313 aa), FASTA scores: opt: 745, E(): 2.3e-39, (41.45% identity in 316 aa overlap); P33164|PDR_BURCE|OPHA1 PHTHALATE DIOXYGENASE REDUCTASE from Burkholderia cepacia (Pseudomonas cepacia) (321 aa), FASTA scores: opt: 616, E(): 2.9e-31, (33.65% identity in 309 aa overlap); etc. Equivalent to AAK47165 from Mycobacterium tuberculosis strain CDC1551 (363 aa) but shorter 54 aa. Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature and PS00063 Aldo/keto reductase family putative active site signature. SEEMS TO BELONG TO THE 2FE2S PLANT-TYPE FERREDOXIN FAMILY IN THE C-TERMINAL SECTION." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217292.1" /db_xref="GI:15609913" /db_xref="GeneID:888453" /translation="MRRTNPAVVTKRELVAPDVVALTLADPGGGLLPAWSPGGHIDVQ LPSGRRRQYSLCGVPGRRTDYRIAIRRIADGGGGSIEMHEAFDVGDTCEFEGPRNAFH LGLAERDVLFVIGGIGVTPILPMIRAAEQRGIDWRAIYAGRGREYMPFLDEVVAVAPG RVTVWADDEHGRFASVDELLAGAGPTTAVYVCGPPGMLEAVRVARNQHADAPLHYERF SPPPVVDGVPFELELARSRRVLRVPANRSALDVMLDWDPTTAYSCQQGFCGTCKVRVL AGQVDRRGRIIEGDNEMLVCVSRAVSGRVVIDA" misc_feature complement(3083491..3083517) /locus_tag="Rv2776c" /note="PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature" misc_feature complement(3083560..3083607) /locus_tag="Rv2776c" /note="PS00063 Aldo/keto reductase family putative active site signature" gene complement(3084485..3085555) /locus_tag="Rv2777c" /db_xref="GeneID:887833" CDS complement(3084485..3085555) /locus_tag="Rv2777c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2777c, (MTV002.42c), len: 356 aa. Conserved hypothetical protein, highly similar (but longer in N-terminus) to hypothetical proteins Q9KZ16|SC10B7.16 from Streptomyces coelicolor (296 aa), FASTA scores: opt: 980, E(): 6.8e-57, (51.25% identity in 281 aa overlap); and Q9HYS0|PA3325 from Pseudomonas aeruginosa (295 aa), FASTA scores: opt: 816, E(): 4e-46, (43.75% identity in 288 aa overlap); and similar (but longer in N-terminus) to other hypothetical proteins e.g. Q9I3H1|PA1542 from Pseudomonas aeruginosa (278 aa), FASTA scores: opt: 234, E(): 6.3e-08, (31.8% identity in 258 aa overlap). Equivalent to AAK47166 from Mycobacterium tuberculosis strain CDC1551 (393 aa) but shorter 37 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217293.1" /db_xref="GI:15609914" /db_xref="GeneID:887833" /translation="MNVEVHSAPGWRAGSSPLGYAQLYLPTRDVYWGDMSGIYVNAVA TFSEGAAMVSVDDRATGPHSSESRAADHERLVLEPRDVEFDWTNLPFHYVPNEPMATH VLNVLHMLLPAGEEFFVRVFKKTLPLIKDDQLRLDVQGFIGQEAMHSQAHSGVVDHFD AQGVDVTAFTNQIRWLFEKLLGESPRRSPRRQYSWLLEQVSFIAAIEHYTAVMGEWIL NSPQLDAVGADPVMLDMLRWHGAEEVEHKAVAFDTMKHLRAGYWRQVRAQLTVTPVML LLWIRGVRFMYSVDPYLPPGTKPRWRDYFKAARRGLVPGLPRLLRVVGHYYKPGFHPS QLGGLGAAVDYLAVSPAARASH" gene complement(3085713..3086183) /locus_tag="Rv2778c" /db_xref="GeneID:887751" CDS complement(3085713..3086183) /locus_tag="Rv2778c" /function="UNKNOWN" /note="Rv2778c, (MTV002.43c), len: 156 aa. Conserved hypothetical protein, similar to Q9CBF7|ML2031 HYPOTHETICAL PROTEIN from Mycobacterium leprae (151 aa), FASTA scores: opt: 227, E(): 8.5e-09, (35.95% identity in 153 aa overlap). Also similar to AAK46204|MT1931.1 HYPOTHETICAL 17.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (158 aa), FASTA scores: opt: 238, E(): 1.5e-09, (35.75% identity in 151 aa overlap); or O07748|Rv1883c|MTCY180.35 HYPOTHETICAL 17.3 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (158 aa), FASTA scores: opt: 212, E(): 9.7e-08, (34.45% identity in 151 aa overlap); note that AAK46204|MT1931.1 and O07748|Rv1883c|MTCY180.35 are essentially the same protein except for a small (5 aa) gap." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217294.1" /db_xref="GI:15609915" /db_xref="GeneID:887751" /translation="MPDPDGPSVTVTVEIDANPDLVYGLITDLPTLASLAEEVVAMQL RKGDDVRKGAVFVGRNENGGRRWTTTCTVTDADPGRVFAFDVRSGIIPISRWQYGIVA TEHGCRVTESTWDRRPSWFRAVARMATGVKDRASVNTEHIRRTLQRLKDRAEAG" gene complement(3086215..3086754) /locus_tag="Rv2779c" /db_xref="GeneID:888479" CDS complement(3086215..3086754) /locus_tag="Rv2779c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2779c, (MTV002.44c), len: 179 aa. Possible transcriptional regulator, from the Lrp/AsnC family, similar (but longer 30 aa in N-terminus) to others e.g. CAC42842|SCBAC36F5.06 PUTATIVE ASNC-FAMILY TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (163 aa), FASTA scores: opt: 333, E(): 4.4e-16, (39.7% identity in 141 aa overlap); O07920|AZLB_BACSU TRANSCRIPTIONAL REGULATOR (ASNC FAMILY) from Bacillus subtilis; Q9I233|PA2082 PROBABLE TRANSCRIPTIONAL REGULATOR (ASNC FAMILY) from Pseudomonas aeruginosa (158 aa), FASTA scores: opt: 322, E(): 2.5e-15, (33.1% identity in 148 aa overlap); etc. Also similar to P96896|Rv3291c|MTCY71.31c from Mycobacterium tuberculosis (33.3% identity in 120 aa overlap). Equivalent to AAK47168 from Mycobacterium tuberculosis strain CDC1551 (181 aa). SEEMS TO BELONG TO THE ASNC FAMILY OF TRANSCRIPTIONAL REGULATORS. Start changed since first submission (+8 aa)." /codon_start=1 /transl_table=11 /product="LRP/AsnC family transcriptional regulator" /protein_id="NP_217295.2" /db_xref="GI:57117025" /db_xref="GeneID:888479" /translation="MIILFRGHMRDNSTEHKTRRAASSKDVRPAELDEVDRRILSLLH GDARMPNNALADTVGIAPSTCHGRVRRLVDLGVIRGFYTDIDPVAVGLPLQAMISVNL QSSARGKIRSFIQQIRRKRQVMDVYFLAGADDFILHVAARDTEDLRSFVVENLNADAD VAGTQTSLIFEHLRGAAPI" gene 3086820..3087935 /gene="ald" /locus_tag="Rv2780" /db_xref="GeneID:888493" CDS 3086820..3087935 /gene="ald" /locus_tag="Rv2780" /EC_number="1.4.1.1" /function="MAY PLAY A ROLE IN CELL WALL SYNTHESIS AS L-ALANINE IS AN IMPORTANT CONSTITUENT OF THE PEPTIDOGLYCAN LAYER [CATALYTIC ACTIVITY: L-ALANINE + H(2)O + NAD(+) = PYRUVATE + NH(3) + NADH]." /experiment="experimental evidence, no additional details recorded" /note="Rv2780, (MT2850, MTV002.45), len: 371 aa. ald, secreted L-alanine dehydrogenase (EC 1.4.1.1) (40 kd antigen); equivalent to Q9CBV6|ALD|ML1532 L-ALANINE DEHYDROGENASE from Mycobacterium leprae (371 aa), FASTA scores: opt: 2081, E(): 4e-115, (85.45% identity in 371 aa overlap). Also highly similar to others e.g. Q9S227|SCI51.13c from Streptomyces coelicolor (371 aa), FASTA scores: opt: 1575, E(): 2.3e-85, (66.05% identity in 371 aa overlap); Q9K827|BH3180 from Bacillus halodurans (371 aa), FASTA scores: opt: 1341, E(): 1.4e-71, (56.45% identity in 372 aa overlap); Q9RT70|DR1895 from Deinococcus radiodurans (390 aa), FASTA scores: opt: 1319, E(): 2.8e-70, (54.2% identity in 371 aa overlap); etc. Contains PS00836 and PS00837 Alanine dehydrogenase & pyridine nucleotide transhydrogenase signature 1 and 2." /codon_start=1 /transl_table=11 /product="secreted L-alanine dehydrogenase ALD (40 kDa antigen) (TB43)" /protein_id="NP_217296.1" /db_xref="GI:15609917" /db_xref="GeneID:888493" /translation="MRVGIPTETKNNEFRVAITPAGVAELTRRGHEVLIQAGAGEGSA ITDADFKAAGAQLVGTADQVWADADLLLKVKEPIAAEYGRLRHGQILFTFLHLAASRA CTDALLDSGTTSIAYETVQTADGALPLLAPMSEVAGRLAAQVGAYHLMRTQGGRGVLM GGVPGVEPADVVVIGAGTAGYNAARIANGMGATVTVLDINIDKLRQLDAEFCGRIHTR YSSAYELEGAVKRADLVIGAVLVPGAKAPKLVSNSLVAHMKPGAVLVDIAIDQGGCFE GSRPTTYDHPTFAVHDTLFYCVANMPASVPKTSTYALTNATMPYVLELADHGWRAACR SNPALAKGLSTHEGALLSERVATDLGVPFTEPASVLA" misc_feature 3086829..3086909 /gene="ald" /locus_tag="Rv2780" /note="PS00836 Alanine dehydrogenase & pyridine nucleotide transhydrogenase signature 1" misc_feature 3087336..3087413 /gene="ald" /locus_tag="Rv2780" /note="PS00837 Alanine dehydrogenase & pyridine nucleotide transhydrogenase signature 2" gene complement(3087950..3088984) /locus_tag="Rv2781c" /db_xref="GeneID:888488" CDS complement(3087950..3088984) /locus_tag="Rv2781c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2781c, (MTV002.46c), len: 344 aa. Possible ala-rich oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases or hypothetical proteins e.g. Q9RDD8|SCC77.20c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (364 aa), FASTA scores: opt: 912, E(): 5.3e-47, (45.55% identity in 336 aa overlap); Q9FDD4|2-NPDL PUTATIVE 2-NITROPROPANE DIOXYGENASE from Streptomyces ansochromogenes (363 aa), FASTA scores: opt: 869, E(): 1.9e-44, (44.2% identity in 337 aa overlap); O05413|YRPB 2-NITROPROPANE DIOXYGENASE from Bacillus subtilis (347 aa), FASTA scores: opt: 560, E(): 4.9e-26, (33.75% identity in 317 aa overlap); etc." /codon_start=1 /transl_table=11 /product="alanine rich oxidoreductase" /protein_id="NP_217297.1" /db_xref="GI:15609918" /db_xref="GeneID:888488" /translation="MVLGFWDIAVPIVGAPMAGGPSTPALAAAVSNAGGLGFVAGGYL SADRLADDIAAARAATTGPIGANLFVPQPSVADWAQLEYYADELEEVAEYYHTEVGQP VYGDDDDWVRKLEVVADVRPEVVSFTFGAPPPDVVQRLSALGLLVSITVTSVYEAGVA IAAGADSLVVQGPAAGGHRGTFAPDMEPGTESLHQLLDRIGSAHDVPLVAAGGLGTAE DVAAVLRRGAIAAQVGTALLLADEAGTNAAHRAALKNPEFDATLVTRAFSGRYARGLA NNFTRLLDHVAPLGYPEVHQMTKPIRAAAVQADDPHGTNLWAGSAHRKTRPGPAADII ASLTPDVCSA" gene complement(3089045..3090361) /gene="pepR" /locus_tag="Rv2782c" /db_xref="GeneID:888470" CDS complement(3089045..3090361) /gene="pepR" /locus_tag="Rv2782c" /function="UNKNOWN; POSSIBLY HYDROLYZES PEPTIDES AND/OR PROTEINS" /note="deleted EC_number 3.4.99.-; Rv2782c, (MTV002.47c), len: 438 aa. Probable pepR, protease/peptidase (EC 3.4.99.-), equivalent to O32965|YR82_MYCLE|ML0855|MLCB22.26c HYPOTHETICAL ZINC PROTEASE from Mycobacterium leprae (445 aa), FASTA scores: opt: 2346, E(): 4.3e-146, (84.3% identity in 421 aa overlap). Also highly similar to others e.g. O86835|YA12_STRCO|SC9A10.02 from Streptomyces coelicolor (459 aa), FASTA scores: opt: 1394, E(): 1.1e-83, (51.9% identity in 416 aa overlap); Q04805|YMXG_BACSU|YMXG from Bacillus subtilis (409 aa), FASTA scores: opt: 1014, E(): 7.9e-59, (37.55% identity in 410 aa overlap); Q9KA85|BH2405 from Bacillus halodurans (413 aa), FASTA scores: opt: 967, E(): 9.6e-56, (38.6% identity in 417 aa overlap); etc. Contains PS00143 Insulinase family, zinc-binding region signature. BELONGS TO PEPTIDASE FAMILY M16, ALSO KNOWN AS THE INSULINASE FAMILY. COFACTOR: REQUIRES DIVALENT CATIONS FOR ACTIVITY. BINDS ZINC." /codon_start=1 /transl_table=11 /product="zinc protease PEPR" /protein_id="NP_217298.1" /db_xref="GI:15609919" /db_xref="GeneID:888470" /translation="MPRRSPADPAAALAPRRTTLPGGLRVVTEFLPAVHSASVGVWVG VGSRDEGATVAGAAHFLEHLLFKSTPTRSAVDIAQAMDAVGGELNAFTAKEHTCYYAH VLGSDLPLAVDLVADVVLNGRCAADDVEVERDVVLEEIAMRDDDPEDALADMFLAALF GDHPVGRPVIGSAQSVSVMTRAQLQSFHLRRYTPERMVVAAAGNVDHDGLVALVREHF GSRLVRGRRPVAPRKGTGRVNGSPRLTLVSRDAEQTHVSLGIRTPGRGWEHRWALSVL HTALGGGLSSRLFQEVRETRGLAYSVYSALDLFADSGALSVYAACLPERFADVMRVTA DVLESVARDGITEAECGIAKGSLRGGLVLGLEDSSSRMSRLGRSELNYGKHRSIEHTL RQIEQVTVEEVNAVARHLLSRRYGAAVLGPHGSKRSLPQQLRAMVG" misc_feature complement(3090155..3090226) /gene="pepR" /locus_tag="Rv2782c" /note="PS00143 Insulinase family, zinc-binding region signature" gene complement(3090339..3092597) /gene="gpsI" /locus_tag="Rv2783c" /db_xref="GeneID:888467" CDS complement(3090339..3092597) /gene="gpsI" /locus_tag="Rv2783c" /EC_number="2.7.6.-" /EC_number="2.7.7.8" /function="INVOLVED IN mRNA DEGRADATION. HYDROLYSES SINGLE-STRANDED POLYRIBONUCLEOTIDES PROCESSIVELY IN THE 3' TO 5' DIRECTION. INVOLVED IN THE RNA DEGRADOSOME, A MULTI-ENZYME COMPLEX IMPORTANT IN RNA PROCESSING AND MESSENGER RNA DEGRADATION [CATALYTIC ACTIVITY: RNA(N+1) + PHOSPHATE = RNA(N) + A NUCLEOSIDE DIPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv2783c, (MTV002.48c), len: 752 aa. Probable gpsI, polyribonucleotide nucleotidyltransferase (EC 2.7.7.8; 2.7.6.-), equivalent to Q9CCF8|GPSI|ML0854 (alias O32966) PUTATIVE POLYRIBONUCLEOTIDE PHOSPHORYLASE / GUANOSINE PENTAPHOSPHATE SYNTHETASE from Mycobacterium leprae (773 aa), FASTA scores: opt: 4304, E(): 0, (89.95% identity in 757 aa overlap). Also highly similar to others e.g. O86656|GPSI GUANOSINE PENTAPHOSPHATE SYNTHETASE/ POLYRIBONUCLEOTIDE NUCLEOTIDYLTRANSFERASE (FRAGMENT) from Streptomyces coelicolor (716 aa), FASTA scores: opt: 3393, E(): 5.8e-192, (72.77% identity in 718 aa overlap); Q53597|GPSI GUANOSINE PENTAPHOSPHATE SYNTHETASE from Streptomyces antibioticus (740 aa), FASTA scores: opt: 3314, E(): 2.6e-187, (70.55% identity in 733 aa overlap); P72659|PNP|SLL1043 POLYRIBONUCLEOTIDE NUCLEOTIDYLTRANSFERASE from Synechocystis sp. strain PCC 6803 (718 aa), FASTA scores: opt: 1244, E(): 1.7e-65, (45.05% identity in 750 aa overlap); etc. Note that S. antibioticus guanosine pentaphosphate synthetase is a multifunctional enzyme that also acts as a polyribonucleotide nucleotidyltransferase. Start site chosen by homology from several alternatives." /codon_start=1 /transl_table=11 /product="polynucleotide phosphorylase/polyadenylase" /protein_id="NP_217299.1" /db_xref="GI:15609920" /db_xref="GeneID:888467" /translation="MSAAEIDEGVFETTATIDNGSFGTRTIRFETGRLALQAAGAVVA YLDDDNMLLSATTASKNPKEHFDFFPLTVDVEERMYAAGRIPGSFFRREGRPSTDAIL TCRLIDRPLRPSFVDGLRNEIQIVVTILSLDPGDLYDVLAINAASASTQLGGLPFSGP IGGVRVALIDGTWVGFPTVDQIERAVFDMVVAGRIVEGDVAIMMVEAEATENVVELVE GGAQAPTESVVAAGLEAAKPFIAALCTAQQELADAAGKSGKPTVDFPVFPDYGEDVYY SVSSVATDELAAALTIGGKAERDQRIDEIKTQVVQRLADTYEGREKEVGAALRALTKK LVRQRILTDHFRIDGRGITDIRALSAEVAVVPRAHGSALFERGETQILGVTTLDMIKM AQQIDSLGPETSKRYMHHYNFPPFSTGETGRVGSPKRREIGHGALAERALVPVLPSVE EFPYAIRQVSEALGSNGSTSMGSVCASTLALLNAGVPLKAPVAGIAMGLVSDDIQVEG AVDGVVERRFVTLTDILGAEDAFGDMDFKVAGTKDFVTALQLDTKLDGIPSQVLAGAL EQAKDARLTILEVMAEAIDRPDEMSPYAPRVTTIKVPVDKIGEVIGPKGKVINAITEE TGAQISIEDDGTVFVGATDGPSAQAAIDKINAIANPQLPTVGERFLGTVVKTTDFGAF VSLLPGRDGLVHISKLGKGKRIAKVEDVVNVGDKLRVEIADIDKRGKISLILVADEDS TAAATDAATVTS" gene complement(3092951..3093466) /gene="lppU" /locus_tag="Rv2784c" /db_xref="GeneID:888484" CDS complement(3092951..3093466) /gene="lppU" /locus_tag="Rv2784c" /function="UNKNOWN" /note="Rv2784c, (MTV002.49c), len: 171 aa. Probable lppU, lipoprotein, sharing no homology with other proteins. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LppU" /protein_id="NP_217300.1" /db_xref="GI:15609921" /db_xref="GeneID:888484" /translation="MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQ ATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGC MSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAV CVEDVTGGPRS" misc_feature complement(3093410..3093442) /gene="lppU" /locus_tag="Rv2784c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(3093479..3093748) /gene="rpsO" /locus_tag="Rv2785c" /db_xref="GeneID:888455" CDS complement(3093479..3093748) /gene="rpsO" /locus_tag="Rv2785c" /function="INVOLVED IN TRANSLATION MECHANISM. THIS PROTEIN IS ONE OF THE 16S RIBOSOMAL RNA BINDING PROTEINS." /note="primary rRNA binding protein; helps nucleate assembly of 30S; binds directly to the 16S rRNA and an intersubunit bridge to the 23S rRNA; autoregulates translation through interactions with the mRNA leader sequence" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S15" /protein_id="NP_217301.1" /db_xref="GI:15609922" /db_xref="GeneID:888455" /translation="MALTAEQKKEILRSYGLHETDTGSPEAQIALLTKRIADLTEHLK VHKHDHHSRRGLLLLVGRRRRLIKYISQIDVERYRSLIERLGLRR" misc_feature complement(3093542..3093634) /gene="rpsO" /locus_tag="Rv2785c" /note="PS00362 Ribosomal protein S15 signature" gene complement(3093905..3094900) /gene="ribF" /locus_tag="Rv2786c" /gene_synonym="ribC" /db_xref="GeneID:888468" CDS complement(3093905..3094900) /gene="ribF" /locus_tag="Rv2786c" /gene_synonym="ribC" /EC_number="2.7.1.26" /EC_number="2.7.7.2" /function="INVOLVED IN FAD BIOSYNTHESIS [CATALYTIC ACTIVITY 1: ATP + RIBOFLAVIN = ADP + FMN] [CATALYTIC ACTIVITY 2: ATP + FMN = DIPHOSPHATE + FAD]." /note="catalyzes the formation of FMN from riboflavin and the formation of FAD from FMN; in Bacillus the ribC gene has both flavokinase and FAD synthetase activities" /codon_start=1 /transl_table=11 /product="bifunctional riboflavin kinase/FMN adenylyltransferase" /protein_id="NP_217302.1" /db_xref="GI:15609923" /db_xref="GeneID:888468" /translation="MRRRLAIVQRWRGQDEIPTDWGRCVLTIGVFDGVHRGHAELIAH AVKAGRARGVPAVLMTFDPHPMEVVYPGSHPAQLTTLTRRAELVQDLGIEVFLVMPFT TDFMKLTPDRFIHELLVEHLHVVEVVVGENFTFGKKAAGNVDTLRRAGERFGFAVESM SLVSEHHSNETVTFSSTYIRSCVDAGDMVAAMEALGRPHRVEGVVVRGEGRGAELGFP TANVAPPMYSAIPADGVYAAWFTVLGHGPVTGTVVPGERYQAAVSVGTNPTFSGRTRT VEAFVLDTTADLYGQHVALDFVGRIRGQKKFESVRQLVAAMGADTERARDLLSTG" gene 3095111..3096874 /locus_tag="Rv2787" /db_xref="GeneID:888288" CDS 3095111..3096874 /locus_tag="Rv2787" /function="UNKNOWN" /note="Rv2787, (MTV002.52), len: 587 aa. Conserved hypothetical ala-rich protein, equivalent to Q9CCI1|ML0798 HYPOTHETICAL PROTEIN from Mycobacterium leprae (592 aa), FASTA scores: opt: 2994, E(): 6.9e-179, (76.5% identity in 587 aa overlap); and similar in part to other proteins from Mycobacterium leprae e.g. O33082|MLCB628.11 HYPOTHETICAL 52.0 KDA PROTEIN (478 aa), FASTA scores: opt: 481, E(): 2.3e-22, (30.95% identity in 294 aa overlap). Also similar in part to O86637|SC3C3.03c HYPOTHETICAL 112.1 KDA PROTEIN from Streptomyces coelicolor (1083 aa), FASTA scores: opt: 488, E(): 1.5e-22, (28.95% identity in 297 aa overlap). And similar to other hypothetical proteins from Mycobacterium tuberculosis e.g. O06396|Rv0530|MTCY25D10.09 (405 aa), FASTA scores: opt: 625, E(): 2.2e-31, (34.05% identity in 320 aa overlap); O69740|Rv3876|MTV027.11 (666 aa), FASTA scores: opt: 453, E(): 1.6e-20, (29.2% identity in 370 aa overlap); P96217|Rv3860|MTCY01A6.08c (390 aa), FASTA scores: opt: 443, E(): 4.7e-20, (29.95% identity in 354 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217303.1" /db_xref="GI:15609924" /db_xref="GeneID:888288" /translation="MSTFRECRSMFDAAVKSYQSGDLANARAAFGRLTVENPDMSDGW LGLLACGDHHLDTLAGAHQHSEALYSETRRVGLTDGELSAVVMAPMYLGLRVWSRATI GLAYASALIIADRHDEAAATLDDPVITEDTGAAQYRQFVMATLFHKTRSWSNLLKVTE ISPPSGATDVRDEVADAVAALASTAAASLGQFQFALELAEQVSTTNPRVTADVTLTRA WCLRELGDDDAARVALSATTTGDAPRTNTTAEQAGSPQPKFRHPYDDGRDLLVARRRP PAGDGWRKAVTKMTFGRVNPEPSAKREQTDELIQRICAPLADVHKLAFVSAKGGVGKT TMTVLVGNAVARLRGDRVMAVDVDADLGDLSARFSERGGPQTNIEHFVSSQHTKRYAD VRVHTVMNKDRLEMLGAQNDPRSTYKFGPEDYGAAMQILETHCNVILLDCGTPVNGPL FSNILNDVTGLVVVASEDVRGVEGALVTLDWLGAHGFGRLLQHTVVVLNAIQKTRSLV DCGAAENQFRKRVPDFFRIPYDPHLATGLAVDFSSLKRRTRNAVLDLAGGLAQHYPAS RVRPRGEDSWKTWIETMRQVG" misc_feature 3096089..3096112 /locus_tag="Rv2787" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene 3096959..3097645 /gene="sirR" /locus_tag="Rv2788" /db_xref="GeneID:888253" CDS 3096959..3097645 /gene="sirR" /locus_tag="Rv2788" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2788, (MTV002.53), len: 228 aa. Probable sirR, transcriptional repressor, highly similar to others e.g. Q9RRF3|DR2539 PUTATIVE IRON DEPENDENT REPRESSOR from Deinococcus radiodurans (232 aa), FASTA scores: opt: 518, E(): 4.5e-26, (41.2% identity in 221 aa overlap); Q9HRU8|SIRR|VNG0536G from Halobacterium sp. strain NRC-1 (233 aa), FASTA scores: opt: 516, E(): 6.1e-26, (40.45% identity in 220 aa overlap); Q9KIJ2|SLOR REGULATOR SLOR from Streptococcus mutans (217 aa), FASTA scores: opt: 418, E(): 1.2e-19, (36.15% identity in 213 aa overlap); etc. Also some similarity to Q50495|IDER_MYCTU|MTCY05A6.32|IDER|DTXR|Rv2711|MT2784|MTC Y0 5A6.32 IRON-DEPENDENT REPRESSOR from Mycobacterium tuberculosis (230 aa), FASTA scores: opt: 266, E(): 7.1e-10, (27.6% identity in 221 aa overlap). Contains helix-turn-helix motif at aa 32-53 (Score 1327, +3.71 SD). COULD BELONG TO THE CRP/FNR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="transcriptional repressor SIRR" /protein_id="NP_217304.1" /db_xref="GI:15609925" /db_xref="GeneID:888253" /translation="MRADEEPGDLSAVAQDYLKVIWTAQEWSQDKVSTKMLAERIGVS ASTASESIRKLAEQGLVDHEKYGAVTLTDSGRRAALAMVRRHRLLETFLVNELGYRWD EVHDEAEVLEHAVSDRLMARIDAKLGFPQRDPHGDPIPGADGQVPTPPARQLWACRDG DTGTVARISDADPQMLRYFASIGISLDSRLRVLARREFAGMISVAIDSADGATVDLGS PAAQAIWVVS" gene complement(3097706..3098938) /gene="fadE21" /locus_tag="Rv2789c" /db_xref="GeneID:888258" CDS complement(3097706..3098938) /gene="fadE21" /locus_tag="Rv2789c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv2789c, (MTV002.54c), len: 410 aa. Probable fadE21, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. P45857|ACDB_BACSU|MMGC from Bacillus subtilis (379 aa), FASTA scores: opt: 689, E(): 9.3e-37, (35.75% identity in 400 aa overlap); Q9K6D1|ACDA|BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 679, E(): 4.1e-36, (37.3% identity in 405 aa overlap); Q06319|ACDS_MEGEL from Megasphaera elsdenii (383 aa), FASTA scores: opt: 650, E(): 3e-34, (37.7% identity in 334 aa overlap); etc. Contains acyl-CoA dehydrogenases signature 1 (PS00072). BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE21" /protein_id="NP_217305.1" /db_xref="GI:15609926" /db_xref="GeneID:888258" /translation="MFEWSDTDLMVRDAVRQFIDKEIRPHQDALETGELSPYPIARKL FSQFGLDVLLAESVNQMLDGERAKREKRDSSGSFGLADQASMVAVLVSELAGVSIGLL STVAVSLGLGAATIMSRGTLAQQERWVPTLVTLEKIAAWAITEPDSGSDAFGGMKTHV TRDGEDYILNGHKTFITNGPYADVLVVYAKLADGEPASDWRNRPVLVFVLDAGMPGLT QGKPFKKMGMMSSPTGELFFDNVRLTPDRLLCAEGDGRDSARANFAVERLGVALMSLG IINECHRLCVDYAKTRTLWGRNIGQFQLIQLKLAKMEVARINVQNMVFQAIERLKAGK QLTLAEASAIKLYSSEAATDVAMEAVQLFGGNGYMAEYRVEQLARDAKSLMIYAGSNE VQVTHIAKGLLGEPASRA" misc_feature complement(3098477..3098515) /gene="fadE21" /locus_tag="Rv2789c" /note="PS00072 Acyl-CoA dehydrogenases signature 1" gene complement(3098964..3100169) /gene="ltp1" /locus_tag="Rv2790c" /db_xref="GeneID:888585" CDS complement(3098964..3100169) /gene="ltp1" /locus_tag="Rv2790c" /function="POSSIBLY CATALYZES THE TRANSFER OF A GREAT VARIETY OF LIPIDS BETWEEN MEMBRANES." /experiment="experimental evidence, no additional details recorded" /note="Rv2790c, (MTV002.55c), len: 401 aa. Probable ltp1, lipid-transfer protein, highly similar to many eukaryotic sterol-carrier proteins/lipid-transfer protein precursors (see Ossendorp & Wirtz 1993) e.g. O62742|SCP2 STEROL CARRIER PROTEIN X from Oryctolagus cuniculus (Rabbit) (547 aa), FASTA scores: opt: 1710, E(): 6e-102, (63.7% identity in 394 aa overlap); Q9QW19 3-OXOACYL-CoA THIOLASE HOMOLOG (FRAGMENT) from Rattus sp. (405 aa), FASTA scores: opt: 1696, E(): 3.8e-101, (63.2% identity in 394 aa overlap); P11915|NLTP_RAT|SCP2|SCP-2 NONSPECIFIC LIPID-TRANSFER PROTEIN PRECURSOR from Rattus norvegicus (Rat) (547 aa), FASTA scores: opt: 1696, E(): 4.8e-101, (63.2% identity in 394 aa overlap); P32020|NLTP_MOUSE|SCP2|SCP-2 NONSPECIFIC LIPID-TRANSFER PROTEIN PRECURSOR from Mus musculus (Mouse) (547 aa), FASTA scores: opt: 1681, E(): 4.3e-100, (62.7% identity in 394 aa overlap); etc. Contains PS00098 Thiolases acyl-enzyme intermediate signature and PS00737 Thiolases signature 2. Also similar to other M. tuberculosis proteins e.g. O06144|Rv1627c|MTCY01B2.19c (402 aa) (35.8% identity in 413 aa overlap)." /codon_start=1 /transl_table=11 /product="lipid-transfer protein" /protein_id="NP_217306.1" /db_xref="GI:15609927" /db_xref="GeneID:888585" /translation="MPNQGSSNKVYVIGVGMTKFEKPGRREGWDYPDMARESGTKALR DAGIDYREVEQGYVGYVYGESTSGQRALYELGMTGIPIVNVNNNCSTGSTALYLGAQA IRGGLADCVLALGFEKMQPGALGGGADDRESPLGRHVKALAEIDEFGFPVAPWMFGAA GREHMKKYGTTAEHFAKIGYKNHKHSVNNPYAQFQDEYTLDDILASKMISDPLTKLQC SPTSDGSAAVVLASEDYLANHNLAGRAVEIVGQAMTTDFASTFDGSARNIIGYDMTVQ AAQRVYQQSGLGPKDFGVIELHDCFSANELLLYEALGLCGPGEAPELIDDNQTTYGGR WVVNPSGGLISKGHPLGATGLAQCAELTWQLRGTAEARQVDNVTAALQHNIGLGGAAV VTAYQRAER" misc_feature complement(3099108..3099158) /gene="ltp1" /locus_tag="Rv2790c" /note="PS00737 Thiolases signature 2" misc_feature complement(3099861..3099917) /gene="ltp1" /locus_tag="Rv2790c" /note="PS00098 Thiolases acyl-enzyme intermediate signature" repeat_region 3100175..3102206 /note="IS1602, len: 2032 bp. Insertion sequence IS1602." /mobile_element="insertion sequence:IS1602" gene complement(3100202..3101581) /locus_tag="Rv2791c" /db_xref="GeneID:888281" CDS complement(3100202..3101581) /locus_tag="Rv2791c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1602." /experiment="experimental evidence, no additional details recorded" /note="Rv2791c, (MTV002.56c), len: 459 aa. Probable IS1602 transposase for IS1602 element, similar to many e.g. P95117|Rv2978c|MTCY349.09 from Mycobacterium tuberculosis (459 aa), FASTA scores: opt: 2718, E(): 6.3e-165, (86.05% identity in 459 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217307.1" /db_xref="GI:15609928" /db_xref="GeneID:888281" /translation="MAKFEIPEGWMVQAFRFTLDPTAEQARALARHFGARRKAYNWTV ATLKADIDAWQATGIQTAKPSLRVLRKRWNTVKNDVCVNIETGVVWWPECSKEAYADG IDGAVDAYWNWQNSRSGKRDGKRMGFPRFKKKGRDPDRVTFTTGAMRVEPDRRHLTLP VIGTVRTHENTRRVERLIAKGRSRVLAITVRRNGTRIDASVRVLVQRPQQPKVTDPGS RVGVDVGVRRLATVATADGAVLERVPNPRPLDAALNELRHVCRARSRCTKGSRRYRER TTEISRLHRRVNDVRTHHLHCLTTHLAKTHGRIVVEGLDAAGMLRQQGLSGARARRRG LSDAALGTPRRHLSYKTGWYGSQLVVADRWFPSSKTCHVCGHVQEIGWAEHWQCDSCS ASHQRDDCAAINLARYEDTSSVVGPVGAAVKRGADRKTRPGRAGGREARKGSSRKAAE QPRDGVQVA" gene complement(3101581..3102162) /locus_tag="Rv2792c" /db_xref="GeneID:888274" CDS complement(3101581..3102162) /locus_tag="Rv2792c" /function="PREVENTS THE COINTEGRATION OF FOREIGN DNA BEFORE INTEGRATION INTO THE CHROMOSOME." /note="Rv2792c, (MTV002.57c), len: 193 aa. Possible IS1602 resolvase, highly similar to many from Mycobacterium tuberculosis e.g. O07773|Rv0605|MTCY19H5.17c POSSIBLE RESOLVASE (202 aa), FASTA scores: opt: 1040, E(): 1.9e-62, (85.05% identity in 194 aa overlap). Contains PS00397 Site-specific recombinases active site and possible helix-turn-helix motif at aa 1-2 (Score 1687, +4.93 SD)." /codon_start=1 /transl_table=11 /product="resolvase" /protein_id="NP_217308.1" /db_xref="GI:15609929" /db_xref="GeneID:888274" /translation="MNLAVWAERNGVARVTAYRWFHAGLLPVPARKAGRLILVDDQPA DRSRRARTAVYARVSSADQKPDLDRQVARVTAWATTEQIAVDKVVTEVGSALNGHRRK FLALLRDPSVKRIVVEHRDRFCRFGSEYVEAALAAQGRELVVVDSAEVDDDLVRDMTE ILTSMCARLYGKRAAQNRAKRALAAAAEESEAA" misc_feature complement(3101974..3102000) /locus_tag="Rv2792c" /note="PS00397 Site-specific recombinases active site" gene complement(3102364..3103260) /gene="truB" /locus_tag="Rv2793c" /db_xref="GeneID:888587" CDS complement(3102364..3103260) /gene="truB" /locus_tag="Rv2793c" /EC_number="4.2.1.70" /function="FORMATION OF PSEUDOURIDINE AT POSITION 55 IN THE PSI GC LOOP OF TRANSFER RNAS [CATALYTIC ACTIVITY: URACIL + D-RIBOSE 5-PHOSPHATE = PSEUDOURIDINE 5'-PHOSPHATE + H(2)O]." /note="catalyzes isomerization of specific uridines in RNA to pseudouridine; responsible for residues in T loops of many tRNAs" /codon_start=1 /transl_table=11 /product="tRNA pseudouridine synthase B" /protein_id="NP_217309.1" /db_xref="GI:15609930" /db_xref="GeneID:888587" /translation="MSATGPGIVVIDKPAGMTSHDVVGRCRRIFATRRVGHAGTLDPM ATGVLVIGIERATKILGLLTAAPKSYAATIRLGQTTSTEDAEGQVLQSVPAKHLTIEA IDAAMERLRGEIRQVPSSVSAIKVGGRRAYRLARQGRSVQLEARPIRIDRFELLAARR RDQLIDIDVEIDCSSGTYIRALARDLGDALGVGGHVTALRRTRVGRFELDQARSLDDL AERPALSLSLDEACLLMFARRDLTAAEASAAANGRSLPAVGIDGVYAACDADGRVIAL LRDEGSRTRSVAVLRPATMHPG" gene complement(3103257..3103940) /locus_tag="Rv2794c" /db_xref="GeneID:888226" CDS complement(3103257..3103940) /locus_tag="Rv2794c" /function="UNKNOWN" /note="Rv2794c, (MTV002.59c), len: 227 aa. Conserved hypothetical protein, equivalent to Q9Z5I5|ML1547|MLCB596.23 PUTATIVE IRON-CHELATING COMPLEX SUBUNIT from Mycobacterium leprae (227 aa), FASTA scores: opt: 1248, E(): 9.1e-77, (79.75% identity in 227 aa overlap). Also highly similar to various proteins e.g. Q9F0Q6|PPTA PHOSPHOPANTETHEINYL TRANSFERASE from Streptomyces verticillus (246 aa), FASTA scores: opt: 692, E(): 2.8e-39, (46.65% identity in 225 aa overlap); O88029|SC5A7.23 HYPOTHETICAL 24.5 KDA PROTEIN from Streptomyces coelicolor (226 aa), FASTA scores: opt: 679, E(): 2e-38, (46.9% identity in 226 aa overlap); O24813 DNA FOR L-PROLINE 3-HYDROXYLASE from Streptomyces sp. (208 aa), FASTA scores: opt: 631, E(): 3.2e-35, (48.1% identity in 208 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217310.1" /db_xref="GI:15609931" /db_xref="GeneID:888226" /translation="MTVGTLVASVLPATVFEDLAYAELYSDPPGLTPLPEEAPLIARS VAKRRNEFITVRHCARIALDQLGVPPAPILKGDKGEPCWPDGMVGSLTHCAGYRGAVV GRRDAVRSVGIDAEPHDVLPNGVLDAISLPAERADMPRTMPAALHWDRILFCAKEATY KAWFPLTKRWLGFEDAHITFETDSTGWTGRFVSRILIDGSTLSGPPLTTLRGRWSVER GLVLTAIVL" gene complement(3103937..3104911) /locus_tag="Rv2795c" /db_xref="GeneID:888492" CDS complement(3103937..3104911) /locus_tag="Rv2795c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2795c, (MTV002.60c), len: 324 aa. Conserved hypothetical protein, equivalent to Q9Z5I6|ML1548|MLCB596.22 HYPOTHETICAL 37.5 KDA PROTEIN from Mycobacterium leprae (321 aa), FASTA scores: opt: 2018, E(): 6.3e-128, (87.4% identity in 318 aa overlap). Also highly similar to O88028|SC5A7.22 HYPOTHETICAL 33.5 KDA PROTEIN from Streptomyces coelicolor (295 aa), FASTA scores: opt: 1202, E(): 3.4e-73, (57.2% identity in 285 aa overlap); and Q9AMH7|SIMX4 SIMX4 PROTEIN from Streptomyces antibioticus (293 aa), FASTA scores: opt: 1045, E(): 1.2e-62, (51.4% identity in 286 aa overlap). C-terminus highly similar to Q9F0Q7 HYPOTHETICAL 9.6 KDA PROTEIN (FRAGMENT) from Streptomyces verticillus (81 aa), FASTA scores: opt: 395, E(): 1.8e-19, (68.35% identity in 79 aa overlap). Also similar to other proteins e.g. Q9FWV7 HYPOTHETICAL 45.3 KDA PROTEIN from Oryza sativa (Rice) (402 aa), FASTA scores: opt: 294, E(): 3.6e-12, (26.45% identity in 340 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217311.1" /db_xref="GI:15609932" /db_xref="GeneID:888492" /translation="MTWKGSGQETVGAEPTLWAISDLHTGHLGNKPVAESLYPSSPDD WLIVAGDVAERTDEIRWSLDLLRRRFAKVIWVPGNHELWTTNRDPMQIFGRARYDYLV NMCDEMGVVTPEHPFPVWTERGGPATIVPMFLLYDYSFLPEGANSKAEGVAIAKERNV VATDEFLLSPEPYPTRDAWCHERVAATRARLEQLDWMQPTVLVNHFPLLRQPCDALFY PEFSLWCGTTKTADWHTRYNAVCSVYGHLHIPRTTWYDGVRFEEVSVGYPREWRRRKP YSWLRQVLPDPQYAPGYLNDFGGHFVITPEMRTQAAQFRERLRQRQSR" gene complement(3105056..3105619) /gene="lppV" /locus_tag="Rv2796c" /db_xref="GeneID:888915" CDS complement(3105056..3105619) /gene="lppV" /locus_tag="Rv2796c" /function="UNKNOWN" /note="Rv2796c, (MTV002.61c, MTCY16B7.47), len 187 aa. Probable lppV, conserved lipoprotein, similar to others from Mycobacterium tuberculosis e.g. P95009|LPPB|Rv2544|MTCY159.12c PROBABLE CONSERVED LIPOPROTEIN (220 aa), FASTA scores: opt: 168, E(): 0.00066, (22.45% identity in 196 aa overlap); and P95010|LPPA|RV2543|MTCY159.13c PROBABLE CONSERVED LIPOPROTEIN (219 aa), FASTA scores: opt: 165, E(): 0.001, (23.1% identity in 199 aa overlap)." /codon_start=1 /transl_table=11 /product="lipoprotein LppV" /protein_id="NP_217312.1" /db_xref="GI:15609933" /db_xref="GeneID:888915" /translation="MRWPTAWLLALVCVMATGCGPSGHGTRAGEEGPLSPEKVAELEN PLRAKPPLEDAKDQYRAAVTQLANAITALVPGLTWRTDMDTWTGCGGEYEWTRAKAAY FMIVFSGPIPDDKWLQAVQIVKDGVEQFGATGFGVMKNKPADHDVYFAGHGGVEFKCS TQKAAVLTAQSDCRISRTDTPKPSPTP" gene complement(3105619..3107307) /locus_tag="Rv2797c" /db_xref="GeneID:887713" CDS complement(3105619..3107307) /locus_tag="Rv2797c" /function="UNKNOWN" /note="Rv2797c, (MTCY16B7.46), len: 562 aa. Conserved hypothetical ala-rich protein. C-terminus highly similar to several mycobacterial proteins e.g. AAK46927|MT2616 HYPOTHETICAL 28.0 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (265 aa), FASTA scores: opt: 535, E(): 4.6e-22, (42.95% identity in 263 aa overlap); P95011|Rv2542|MTCY159.14c HYPOTHETICAL 42.4 KDA PROTEIN from Mycobacterium tuberculosis (403 aa), FASTA scores: opt: 537, E(): 5e-22, (40.75% identity in 292 aa overlap) (similarity in the second half of protein); P71547|Y963_MYCTU|Rv0963c|MT0992|MTCY10D7.11 HYPOTHETICAL 28.1 KDA PROTEIN (266 aa), FASTA scores: opt: 314, E(): 5.7e-10, (39.0% identity in 254 aa overlap); etc. Contains PS00120 Lipases, serine active site." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217313.1" /db_xref="GI:15609934" /db_xref="GeneID:887713" /translation="MPLTVADIDRWNAQAVREVFHAASARAEVTFEASRQLAALSIFA NSGGKTAEAAAHHNAGIRRDLDAHGNEALAVARAADRAADGIVKVQSELAALRHAAAA AELTIDALINRVVPIPGLRSTEAQWARTLAKQTELQAELDAIMAEANAVDEELASAVN MADGDAPIPADSGPPVGPEGLTPTQLASDANEERLREERARLQAHLERLQAEYDQLSV RAARDYHNGILDGDAVGRLAALTDELSAARGRLGELDAVDEALSRAPETYLTQLQIPE DPNQQVLAAVAVGNPDTAANVSVTVPGVGSTTRGALPGMVTEARDLRSEVIRQLNAAG KPASVATIAWMGYHPPPNPLDTGSAGDLWQTMTDGQAHAGAADLSRYLQQVRANNPSG HLTVLGHSYGSLTASLALQDLDAQSAHPVNDVVFYGSPGLELYSPAQLGLDHGHAYVM QAPHDLITNLVAPLAPLHGWGLDPYLTPGFTELSSQAGFDPGGIWRDGVYAHGDYPRS FLDAAGQPQLRMSGYNLAAIAAGLPDNTVGPPLLPPILGGGMPAAPGPALRGGR" misc_feature complement(3106099..3106128) /locus_tag="Rv2797c" /note="PS00120 Lipases, serine active site" gene complement(3107311..3107637) /locus_tag="Rv2798c" /db_xref="GeneID:888431" CDS complement(3107311..3107637) /locus_tag="Rv2798c" /function="UNKNOWN" /note="Rv2798c, (MTCY16B7.45), len: 108 aa. Conserved hypothetical ala-rich protein, similar to P71545|Y965_MYCTU|Rv0965c|MT0993|MTCY10D7.09 HYPOTHETICAL 14.5 KDA PROTEIN from Mycobacterium tuberculosis (139 aa), FASTA scores: opt: 198, E(): 8e-07, (38.9% identity in 90 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217314.1" /db_xref="GI:15609935" /db_xref="GeneID:888431" /translation="MFQISPEQWMHSAAQVTTQGEGLAVGHLSSDYRMQAAQFGWQGA SAMALNAKMDDWLDASRALLTRIGDHAFGLQEAAIQHAAAEAERAQALAQVGVSADVV AGPRGV" gene 3107768..3108397 /locus_tag="Rv2799" /db_xref="GeneID:887759" CDS 3107768..3108397 /locus_tag="Rv2799" /function="UNKNOWN" /note="Rv2799, (MTCY16B7.44c), len: 209 aa. Probable membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217315.1" /db_xref="GI:15609936" /db_xref="GeneID:887759" /translation="MYTPGKGPPRAGGVVFTRVRLIGGLGALTAAVVVVGTVGWQGIP PAPTGGDAVQLRSTAAPMSTTMKSPIVATTDPSPFDPCRDIPFDVIQRLGLAYTPPEA EEGLRCHFDAGNYQMAVEPIIWRTYAQTLPPDAIETTIAGHRAAQYWVRKPTYHNSFW YSSCMVTFKTSYGVIQQSLFYSTVYSEPDVDCPSTNLQRANDLVPYYRF" gene 3108416..3110065 /locus_tag="Rv2800" /db_xref="GeneID:888932" CDS 3108416..3110065 /locus_tag="Rv2800" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2800, (MTCY16B7.43c), len: 549 aa. Possible hydrolase (EC 3.-.-.-), an esterase (EC 3.1.1.-) or an acylase (EC 3.-.-.-). Similar, but longer in N-terminus, to esterases or acylases e.g. Q9L9D7|COCE COCAINE ESTERASE from Rhodococcus sp. MB1 'Bresler 1999' (574 aa), FASTA scores: opt: 510, E(): 3.1e-23, (33.6% identity in 571 aa overlap); Q9L3U2|STTE PUTATIVE ACYLASE from Streptomyces rochei (Streptomyces parvullus) (554 aa), FASTA scores: opt: 492, E(): 3.7e-22, (34.45% identity in 569 aa overlap); CAC49652|SMB21424 PUTATIVE ESTERASE OR ACYLASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (578 aa), FASTA scores: opt: 405, E(): 7.1e-17, (34.45% identity in 569 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_217316.1" /db_xref="GI:15609937" /db_xref="GeneID:888932" /translation="MSTTSARPERPKLRALTGRVGGQALGGLLGLPRATTRYTVGHVR VPMRDGVQLVADHYAPATSQPVGTLLVRGPYGRRFPFSLVFARIYAARGYHVVLQSVR GTFGSGGVFEPMVNEAADGADTVAWLREQPWFTGRFGTIGLPYLGFTQWALLHDPPPE LAAAVITVGPHDFRASVWGTGSFTVNDFLGWSDLVSHQEDPGRIRAGIRQLTAPRRVA RTAATLPLGESARTLLGTGAPWFESWVEHTDRDDPFWDRLRFPAALDRVQVPVLLVGG WQDIFLRQTLQQYRHLRDRGVHVALTVGPWTHTQMLTKGLATGARESLDWLDAHLGRA PALRPSPVRVFVTGQGWRHLPDWPPATTERAWYLQPGGRLGESAPASGTPPATFRYHP ADPTPTTGGPLLSSNGGYRDDSRLATRADVLCFTGAPLTHDLCVHGNPVVELVHSSDN PYVDVFVRVSEVDAKGRSRNVSDGYRRLGDAPELVRVELDAIAHRFRADSRIRVLIAG SWFPRYARNLGTPEPILTGRQLKPATHAVHFGRSRLLLPVG" gene complement(3110167..3110523) /locus_tag="Rv2801c" /db_xref="GeneID:888921" CDS complement(3110167..3110523) /locus_tag="Rv2801c" /function="UNKNOWN" /note="Rv2801c, (MTCY16B7.42), len: 118 aa. Conserved hypothetical protein, highly similar to Q9RWK4|DR0662 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (115 aa), FASTA scores: opt: 306, E(): 2e-15, (43.95% identity in 116 aa overlap); and similar to AAK78474|CAC0494 PEMK FAMILY OF DNA-BINDING PROTEINS from Clostridium acetobutylicum (122 aa), FASTA scores: opt: 217, E(): 7.3e-09, (33.35% identity in 117 aa overlap); P96622|YDCE YDCE PROTEIN from Bacillus subtilis (116 aa), FASTA scores: opt: 194, E(): 3.5e-07, (33.35% identity in 117 aa overlap); Q9PHH8|XFA0027 PLASMID MAINTENANCE PROTEIN from Xylella fastidiosa (108 aa), FASTA scores: opt: 188, E(): 9.1e-07, (40.85% identity in 115 aa overlap); etc. Also similar to Q10867|YJ91_MYCTU|Rv1991c|MT2046|MTCY39.28 HYPOTHETICAL 12.3 KDA PROTEIN from Mycobacterium tuberculosis (114 aa), FASTA scores: opt: 190, E(): 6.8e-07, (36.75% identity in 117 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217317.1" /db_xref="GI:15609938" /db_xref="GeneID:888921" /translation="MMRRGEIWQVDLDPARGSEANNQRPAVVVSNDRANATATRLGRG VITVVPVTSNIAKVYPFQVLLSATTTGLQVDCKAQAEQIRSIATERLLRPIGRVSAAE LAQLDEALKLHLDLWS" gene complement(3110780..3111823) /locus_tag="Rv2802c" /db_xref="GeneID:888931" CDS complement(3110780..3111823) /locus_tag="Rv2802c" /function="UNKNOWN" /note="Rv2802c, (MTCY16B7.41), len: 347 aa. Hypothetical unknown arg-, ala-rich protein. C-terminus shows some similarity with N-terminal part of hypothetical proteins Q98K84|MLR1592 from Rhizobium loti (Mesorhizobium loti) (104 aa), FASTA scores: opt: 138, E(): 0.12, (37.35% identity in 91 aa overlap); and CAC47718|SMC03294 from Rhizobium meliloti (Sinorhizobium meliloti) (114 aa), FASTA scores: opt: 128, E(): 0.53, (31.4% identity in 86 aa overlap). Equivalent to AAK47191 from Mycobacterium tuberculosis strain CDC1551 (357 aa) but shorter 10 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217318.1" /db_xref="GI:15609939" /db_xref="GeneID:888931" /translation="MARQPLEQRVARAAQAALARQRFVSAIDVLLGLGWLAPSHVDQW RQGRVDSLEQVVQANLSKITAVMAALRRWARDRGLNPSETDYVARTRDRRRLRFSVTG EDAIERAYRTHWVSPELSERAVARQSRRPDLVVIMPVNDWSCASCGGSGDLMFLEDAG PLCLDCADLGHLVFLPSGDAALTRRAKRASRLSAVVVRWSRARKRYERQGILVEAEAL ERAENECLADAEVRARRRERDEARRANEDLRLQAEFGAAIRTLFPNCPAGRAEAIARH AATRGSGRIGRSAAGRALDPEAVRLAVAASVRHIDTSFDELLMSGVDRETARHRVGEH VEEVLRDWRATSR" gene 3111822..3112289 /locus_tag="Rv2803" /db_xref="GeneID:888416" CDS 3111822..3112289 /locus_tag="Rv2803" /function="UNKNOWN" /note="Rv2803, len: 155 aa. Conserved hypothetical protein, similar to hypothetical proteins from other organisms, and with some similarity to C-terminal part of Rv0918|Z95210_12 hypothetical protein from Mycobacterium tuberculosis (158 aa), FASTA scores: opt: 204, E(): 9e-07, (42.35% identity in 85 aa overlap). Replaces original 2803c on other strand." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177678.1" /db_xref="GI:57117026" /db_xref="GeneID:888416" /translation="MTCPSLVGLRTEAAELSYSDQPDALGVAMRERREQQNLVRPPRR NASRRINTDQTSTKYVYITYMPETLTGRLNFRLSPEQEQALRHAAALTGQSLSGFVLS AAVDHAHDLLARANRIELSEAAFRRFVAALDEPDEAAPELVRLARRKSRIPPH" gene complement(3112465..3113094) /locus_tag="Rv2804c" /db_xref="GeneID:888418" CDS complement(3112465..3113094) /locus_tag="Rv2804c" /function="UNKNOWN" /note="Rv2804c, (MTCY16B7.39), len: 209 aa. Hypothetical unknown protein, overlaps neighbouring orf Rv2805|MTCY16B7.38c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217320.1" /db_xref="GI:15609941" /db_xref="GeneID:888418" /translation="MHDHQVLAARHAHQGPHVLQQRPGFVAEAPRPKATPVDLLGRAR QPRAGQHLPRRRAAHPRGGHHRIQNLAVAPPHHRRQQQRGHSRRSIGSTSPSDDSASY SQRPRDVADPPVEASTLEGQEAVVTVELGGAVVDGVDDQGAGAVVPGTGHGSDEGIEE KIATETGALLLPVERQASEDEHWDRVGSGWPRPGRDGTRIRSMLPMASA" gene 3112867..3113271 /locus_tag="Rv2805" /db_xref="GeneID:888916" CDS 3112867..3113271 /locus_tag="Rv2805" /function="UNKNOWN" /note="Rv2805, (MTCY16B7.38c), len: 134 aa. Conserved hypothetical protein, highly similar to N-terminal region of downstream ORF P71644|Rv2807|MTCY16B7.36c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (384 aa), FASTA scores: opt: 525, E(): 6.4e-29, (78.2% identity in 101 aa overlap). Also highly similar to N-terminus of other proteins: Q9KK74 HYPOTHETICAL 47.4 KDA PROTEIN from Brevibacterium linens (418 aa), FASTA scores: opt: 480, E(): 8.8e-26, (64.15% identity in 106 aa overlap); AAK40065 Rv3128c-LIKE PROTEIN from Mycobacterium celatum (423 aa), FASTA scores: opt: 218, E(): 1.2e-07, (46.05% identity in 89 aa overlap); Q981U5|MLR9230 from Rhizobium loti (Mesorhizobium loti) (504 aa), FASTA scores: opt: 131, E(): 0.15, (29.4% identity in 126 aa overlap). Overlaps neighbouring ORF Rv2804c|MTCY16B7.39." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217321.1" /db_xref="GI:15609942" /db_xref="GeneID:888916" /translation="MGRGNGKILDPVVATTGMGRSTARQMLTGPRLPGPAEQVDGRSL RPRGFSDEARALLEHVWALMGMPCGKYLVVMHDLWLPLLTAAGDLDKPLVTEASVAEL KATALPGANRMPHWAAGTLPDGFPARAVRTRT" gene 3113268..3113459 /locus_tag="Rv2806" /db_xref="GeneID:888905" CDS 3113268..3113459 /locus_tag="Rv2806" /function="UNKNOWN" /note="Rv2806, (MTCY16B7.37c), len: 63 aa. Possible membrane protein, sharing no homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217322.1" /db_xref="GI:15609943" /db_xref="GeneID:888905" /translation="MKTNPRYGPAFYSVMTVLFLALFVLNVCTHGSTLGLISTGGLAV LMGYIGYRGWSGKRHINRQ" gene 3113658..3114812 /locus_tag="Rv2807" /db_xref="GeneID:888907" CDS 3113658..3114812 /locus_tag="Rv2807" /function="UNKNOWN" /note="Rv2807, (MTCY16B7.36c), len: 384 aa. Conserved hypothetical protein, highly similar, but shorter 35 aa, to Q9KK74 HYPOTHETICAL 47.4 KDA PROTEIN from Brevibacterium linens (418 aa), FASTA scores: opt: 1865, E(): 9.4e-116, (69.75% identity in 380 aa overlap); and with similarity with other hypothetical proteins or transposases e.g. Q981U5|MLR9230 PROTEIN from Rhizobium loti (Mesorhizobium loti) (504 aa), FASTA scores: opt: 636,, (36.05% identity in 377 aa overlap); CAC47689 PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ISRM18 from Rhizobium meliloti (Sinorhizobium meliloti) (507 aa), FASTA scores: opt: 553, E(): 6.6e-29, (33.5% identity in 370 aa overlap); etc. Also similar to Rv3128c|MTCY164.38c (336 aa) (47.2% identity in 339 aa overlap); and high similarity at N-terminal region with Rv2805|MTCY16B7.38c (79.2% identity in 101 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217323.1" /db_xref="GI:15609944" /db_xref="GeneID:888907" /translation="MVSTTGMGRSTARRMLTGPGLPEPAEQVDGRRLRARGFSDDARA LLEHVWALMGMPCGKYLVVMLELWLPLEAAAGDLDKPFATEAAVAELKAMSAATVDRY LKPARERMRIKGISTTKPSPLLRNSITIHTCSDEAPKVPGVIEADTVAHCGPSLIGEF ARTLTMTDLVTGWTENASIRNNAAKWILEGIKECQQRFPFPMTVFDSDCGGEFINHDV AGWLQARDIAQTRSRPYQKNDQAHVESKNNHVVRKHAFYWRYDTGEELELLNRLWPLV SLRCNFFTPTKKPVGYTSTVNGRRKRIYDKPATPWQRLQASGVLDAQQLSTVAARIEG FNPADLTRQINAIQMQLLDLAKTKTEALATARHIDLQSLQPSINRLAKAK" gene 3115046..3115303 /locus_tag="Rv2808" /db_xref="GeneID:888944" CDS 3115046..3115303 /locus_tag="Rv2808" /function="UNKNOWN" /note="Rv2808, (MTCY16B7.35c), len: 85 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217324.1" /db_xref="GI:15609945" /db_xref="GeneID:888944" /translation="MSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDA VIDVLSDALMKTFGPDWVPNDYGLKIERAIDAYLETWPIYR" gene 3115408..3115719 /locus_tag="Rv2809" /db_xref="GeneID:888909" CDS 3115408..3115719 /locus_tag="Rv2809" /function="UNKNOWN" /note="Rv2809, (MTCY16B7.34c), len: 103 aa (questionable ORF). Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217325.1" /db_xref="GI:15609946" /db_xref="GeneID:888909" /translation="MTYAARDDTTLPKLLAQMRWVVLVDKRQLAVLLLENEGPVASAT DTLDTRGDSDYENQPVDAVERLCRRLADQAVRQWGFMQGLKQKLGPGVDVRMKLVEWN R" gene complement(3115741..3116142) /locus_tag="Rv2810c" /db_xref="GeneID:887784" CDS complement(3115741..>3116142) /locus_tag="Rv2810c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1555." /note="Rv2810c, (MTCY16B7.33), len: 133 aa. Probable transposase for IS1555, similar to C-terminal domain of transposases for defective IS1555 e.g. Q9LCS0|TNPA TRANSPOSASE from Arthrobacter sp. TM1 (435 aa), FASTA scores: opt: 294, E(): 1.8e-13, (55.1% identity in 98 aa overlap); Q50440|TNPA INSERTION ELEMENT TNPR AND TNPA GENE from Mycobacterium smegmatis (413 aa), FASTA scores: opt: 274, E(): 4.7e-12, (56.25% identity in 96 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217326.1" /db_xref="GI:15609947" /db_xref="GeneID:887784" /translation="PLRLQAHTGGPPVALRQETTGGPSPTNDLITEPPRHYKQQTRVR QAPALLTVSAGTGVPVVLEELAKLGRTLWRCRHDVLAYFDHHASNGPTEAINGRLEAL CRNALGFRNLTHYRIRSLLHCGNLAQLIHAL" repeat_region complement(3115744..3116142) /note="IS1555', len: 399 bp. Probable defective Insertion sequence element, IS1555." /mobile_element="insertion sequence:IS1555'" gene 3116139..3116747 /locus_tag="Rv2811" /db_xref="GeneID:887740" CDS 3116139..3116747 /locus_tag="Rv2811" /function="UNKNOWN" /note="Rv2811, (MTCY16B7.32c), len: 202 aa. Conserved hypothetical protein. C-terminus equivalent to C-terminus of AAK47198|MT2878 HYPOTHETICAL 17.7 KDA PROTEIN Mycobacterium tuberculosis strain CDC1551 (178 aa), FASTA scores: opt: 609, E(): 1.5e-32, (61.0% identity in 182 aa overlap); and C-terminus highly similar to P72038|Rv3771c|MTCY13D12.05c HYPOTHETICAL 11.3 KDA PROTEIN from Mycobacterium tuberculosis (108 aa), FASTA scores: opt: 465, E(): 2.8e-23, (73.6% identity in 106 aa overlap). Also some similarity with P71962|Rv2665|MTCY441.34 HYPOTHETICAL 10.5 KDA PROTEIN from Mycobacterium tuberculosis (93 aa), FASTA scores: opt: 153, E(): 0.0057, (39.05% identity in 64 aa overlap); and Q9A6W6|CC1966 HYPOTHETICAL PROTEIN CC1966 from Caulobacter crescentus (189 aa), FASTA scores: opt: 115, E(): 2.6, (39.4% identity in 104 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217327.1" /db_xref="GI:15609948" /db_xref="GeneID:887740" /translation="MVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPA GPVELCPRRSRCTGCGVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIATDV ARPAETVRGWLRRFAERVEAVRSVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAA AIGRRFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHESTLP" repeat_region 3116817..3118225 /note="IS1604, len: 1409 bp. Insertion sequence IS1604." /mobile_element="insertion sequence:IS1604" gene 3116818..3118227 /locus_tag="Rv2812" /db_xref="GeneID:888942" CDS 3116818..3118227 /locus_tag="Rv2812" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1604." /note="Rv2812, (MTCY16B7.31c), len: 469 aa. Probable transposase for IS1604, similar to putative transposases and hypothetical proteins e.g. Q9EZM2|PUTATIVE TRANSPOSASE from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 329, E(): 3e-13, (27.05% identity in 362 aa overlap); CAC46499 PUTATIVE TRANSPOSASE PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (390 aa), FASTA scores: opt: 327, E(): 3.9e-13, (30.5% identity in 367 aa overlap); etc. Contains possible helix-turn-helix motif at aa 50-71 (Score 1140, +3.07 SD)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217328.1" /db_xref="GI:15609949" /db_xref="GeneID:888942" /translation="MAVGDDEEKVRAERARAIGLFRYQLIWEAADAAHSTKQRGKMVR ELASREHTDPFGRRVRISRQTIDRWIRGWRAGGFDALVPNPRQCTPRTPAEVLELAVA LRRENPQRTAAAIRRILRTQLGWAPDERTLQRNFHRLGLTGATTGSAPAVFGRFEAEH PNALWTGDVLHGIRIDLRKTYLFAFLDDHSRLVPGYRWGHAEDTVRLAAALRPALASR GVPNAVYVDNGSPYVDAWLLRACAKLGVRLVHSTPGRPQGRGKIERFFRTVREQFLVE ITGEPDVVGRHYVADLAELNRLFTAWVETVYHRSVHSETGQTPLARWSAGGPIPLPAP ETLTEAFLWEEHRRVTKTATVSLHGNRYEIDPALVGRKVELVFDPFDLTRIEVRLAGA PMRRAIPYHIGRHSHPKAKPETPTAPPKPSGIDYAQLIETAHAAELARGVNYTALTGA ADQIPGQLDLLTGQEAQPK" gene 3118224..3119036 /locus_tag="Rv2813" /db_xref="GeneID:888570" CDS 3118224..3119036 /locus_tag="Rv2813" /function="UNKNOWN" /note="Rv2813, (MTCY16B7.30c), len: 270 aa. Conserved hypothetical protein, similar to various proteins (notably secreted proteins) e.g. Q9ZFL2 HYPOTHETICAL 30.4 KDA PROTEIN from Bacillus stearothermophilus (266 aa), FASTA scores: opt: 518, E(): 1.4e-26, (33.85% identity in 266 aa overlap); P45754|GSPA_AERHY|EXEA GENERAL SECRETION PATHWAY PROTEIN from Aeromonas hydrophila (547 aa), FASTA scores: opt: 386, E(): 1.1e-17, (32.05% identity in 265 aa overlap); Q9KPC7|VC2445 GENERAL SECRETION PATHWAY PROTEIN A from Vibrio cholerae (529 aa), FASTA scores: opt: 366, E(): 2.2e-16, (31.1% identity in 270 aa overlap); Q56674|VC0403 MANNOSE-SENSITIVE HEMAGGLUTININ D from Vibrio cholerae (281 aa), FASTA scores: opt: 317, E(): 2.1e-13, (27.85% identity in 262 aa overlap); etc. Also highly similar to AAK40072 Rv2813-LIKE PROTEIN from Mycobacterium celatum (270 aa), FASTA scores: opt: 1628, E(): 2.8e-99, (90.75% identity in 270 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217329.1" /db_xref="GI:15609950" /db_xref="GeneID:888570" /translation="MMHKLISYYGFSRMPFGRDLAPGMLHRHSAHNEAVARIGWCIAD RRIGVITGEVGAGKTVAVRAALASLDRSRHTIIYLPDPTVGVQGIHHRIVASLGGQPL THHATLAPQAADALAAEQAERGRTPVVVVEEAHLLGYDQLEALRLLTNHDLDSSSPFA CLLIGQPTLRRRMKLGVLAALDQRIGLRYAMPPMTDTNTGSYLRHHLKLAGRDDALFS DDAIGLIHQTSRGYPRAVNNLALQALVAAFAADKAIVDESTTRTAIAEVTAD" misc_feature 3118377..3118400 /locus_tag="Rv2813" /note="PS00017 ATP/GTP-binding site motif A" repeat_region complement(3119185..3123576) /note="4392 bp direct repeat region" repeat_region complement(3119185..3119220) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119259..3119294) /note="36 bp direct repeat, 35 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119335..3119370) /note="36 bp direct repeat, 35 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119411..3119446) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119484..3119519) /note="36 bp direct repeat, 35 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119556..3119591) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119627..3119662) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119701..3119736) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119777..3119812) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119848..3119883) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119921..3119956) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3119995..3120030) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120068..3120103) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120141..3120176) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120213..3120248) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120285..3120320) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120359..3120394) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120433..3120468) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120504..3120523) /note="20 bp partial direct repeat, CCCCGAGAGGGGACGGAAAC, of sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3120523..3121897) /note="IS6110-11, len: 1375 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-11" gene complement(3120566..3121504) /locus_tag="Rv2814c" /db_xref="GeneID:887839" CDS complement(3120566..>3121504) /locus_tag="Rv2814c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2814c, (MTCY16B7.29), len: 312 aa. Probable transposase, highly similar to others e.g. P97137|Rv0796|MTV042.06 PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS986/IS6110 from Mycobacterium tuberculosis (328 aa), FASTA scores: opt: 2103, E(): 6.1e-132, (100.0% identity in 312 aa overlap); etc. Start unlikely." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217330.1" /db_xref="GI:15609951" /db_xref="GeneID:887839" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" gene complement(3121501..3121827) /locus_tag="Rv2815c" /db_xref="GeneID:888511" CDS complement(3121501..3121827) /locus_tag="Rv2815c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS6110." /note="Rv2815c, (MTCY16B7.28), len: 108 aa. Probable transposase, identical from aa 51 with P19772|YIA2_MYCTU PUTATIVE TRANSPOSASE (INSERTION ELEMENT IS986) from Mycobacterium tuberculosis (59 aa), FASTA scores: opt: 365, E(): 1.1e-19, (96.6% identity in 59 aa overlap); and other transposases." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217331.1" /db_xref="GI:15609952" /db_xref="GeneID:888511" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" repeat_region complement(3121882..3121897) /note="16 bp partial direct repeat, GTCGTCAGACCCAAAA, of sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3121938..3121973) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122013..3122048) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122086..3122121) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122158..3122193) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122230..3122265) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122303..3122338) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122375..3122410) /note="36 bp direct repeat, 32 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122436..3122471) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122513..3122548) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122585..3122620) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122661..3122696) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122738..3122773) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122811..3122846) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122882..3122917) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3122955..3122990) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123029..3123064) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123102..3123137) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123173..3123208) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123248..3123283) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123318..3123353) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123390..3123425) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123467..3123502) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" repeat_region complement(3123541..3123576) /note="36 bp direct repeat, 36 out of 36 bp identical to sequence GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC" gene complement(3123625..3123966) /locus_tag="Rv2816c" /db_xref="GeneID:888520" CDS complement(3123625..3123966) /locus_tag="Rv2816c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2816c, (MTCY16B7.27), len: 113 aa. Conserved hypothetical protein, highly similar in part to N-terminus of several proteins e.g. O28403|AF1876 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (94 aa), FASTA scores: opt: 137, E(): 0.0022, (47.55% identity in 61 aa overlap); Q97Y85|SSO8090 HYPOTHETICAL PROTEIN from Sulfolobus solfataricus (88 aa), FASTA scores: opt: 124, E(): 0.02, (37.3% identity in 59 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217332.1" /db_xref="GI:15609953" /db_xref="GeneID:888520" /translation="MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLA KILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYGRG RLVSAEEFVFF" gene complement(3123967..3124983) /locus_tag="Rv2817c" /db_xref="GeneID:888506" CDS complement(3123967..3124983) /locus_tag="Rv2817c" /function="UNKNOWN" /note="Rv2817c, (MTCY16B7.26), len: 338 aa. Conserved hypothetical protein, showing similarity with O30236|AF2435 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (322 aa), FASTA scores: opt: 397, E(): 2.4e-19, (28.2% identity in 298 aa overlap); Q9KFX9|BH0341 HYPOTHETICAL PROTEIN from Bacillus halodurans (343 aa), FASTA scores: opt: 337, E(): 2.8e-15, (27.35% identity in 300 aa overlap); Q9X2B7|TM1797 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (319 aa), FASTA scores: opt: 321, E(): 3.3e-14, (26.5% identity in 268 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217333.1" /db_xref="GI:15609954" /db_xref="GeneID:888506" /translation="MVQLYVSDSVSRISFADGRVIVWSEELGESQYPIETLDGITLFG RPTMTTPFIVEMLKRERDIQLFTTDGHYQGRISTPDVSYAPRLRQQVHRTDDPAFCLS LSKRIVSRKILNQQALIRAHTSGQDVAESIRTMKHSLAWVDRSGSLAELNGFEGNAAK AYFTALGHLVPQEFAFQGRSTRPPLDAFNSMVSLGYSLLYKNIIGAIERHSLNAYIGF LHQDSRGHATLASDLMEVWRAPIIDDTVLRLIADGVVDTRAFSKNSDTGAVFATREAT RSIARAFGNRIARTATYIKGDPHRYTFQYALDLQLQSLVRVIEAGHPSRLVDIDITSE PSGA" gene complement(3124996..3126144) /locus_tag="Rv2818c" /db_xref="GeneID:888510" CDS complement(3124996..3126144) /locus_tag="Rv2818c" /function="UNKNOWN" /note="Rv2818c, (MTCY16B7.25), len: 382 aa. Hypothetical unknown protein, equivalent to AAK47210 from Mycobacterium tuberculosis strain CDC1551 (430 aa) but shorter 48 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217334.1" /db_xref="GI:15609955" /db_xref="GeneID:888510" /translation="MLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRF DLFVPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPA RALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERANLKQLIVS YDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDTAFTYDPANKVAEYI SALALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEI RCALKHPPKSPNAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSI SEDRITKDGGLLPEQLLKILARETGADLTLYDRLNDEIIRQIDMAPLG" gene complement(3126240..3127367) /locus_tag="Rv2819c" /db_xref="GeneID:888514" CDS complement(3126240..3127367) /locus_tag="Rv2819c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2819c, (MTCY16B7.23), len: 375 aa. Hypothetical unknown protein (see citations below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217335.1" /db_xref="GI:15609956" /db_xref="GeneID:888514" /translation="MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDME LLYADIPAHKRKSFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSIEP RRASRGRGGRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQ PVRVPGHQTREHRQYGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDL LICQKMDMNVHGKPDGLPLFRECLAPGTSISHRVVVDTSPTARGGWREGERFLETLAE TAASVNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGK VVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELSIRRAE" gene complement(3127364..3128272) /locus_tag="Rv2820c" /db_xref="GeneID:888908" CDS complement(3127364..3128272) /locus_tag="Rv2820c" /function="UNKNOWN" /note="Rv2820c, (MTCY16B7.22), len: 302 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217336.1" /db_xref="GI:15609957" /db_xref="GeneID:888908" /translation="MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMG GQQLLGELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPAAQ LGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLL ATGSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTS LPTDDELEAALAGATYRLVKRSGFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGIL DVSLGGNHPVYSYARPLFLALPESAA" gene complement(3128253..3128963) /locus_tag="Rv2821c" /db_xref="GeneID:887741" CDS complement(3128253..3128963) /locus_tag="Rv2821c" /function="UNKNOWN" /note="Rv2821c, (MTCY16B7.21), len: 236 aa. Conserved hypothetical protein, similar to several hypothetical proteins e.g. Q9X2C9|TM1809 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (247 aa), FASTA scores: opt: 318, E(): 8.2e-15, (39.45% identity in 213 aa overlap); O27152|MTH1080 CONSERVED HYPOTHETICAL PROTEIN from Methanothermobacter thermautotrophicus (245 aa), FASTA scores: opt: 294, E(): 3.9e-13, (34.8% identity in 224 aa overlap); BAB59251|TVG0114661 HYPOTHETICAL PROTEIN from Thermoplasma volcanium (229 aa), FASTA scores: opt: 252, E(): 3.3e-10, (33.8% identity in 225 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217337.1" /db_xref="GI:15609958" /db_xref="GeneID:887741" /translation="MTTSYAKIEITGTLTVLTGLQIGAGDGFSAIGAVDKPVVRDPLS RLPMIPGTSLKGKVRTLLSRQYGADTETFYRKPNEDHAHIRRLFGDTEEYMTGRLVFR DTKLTNKDDLEARGAKTLTEVKFENAINRVTAKANLRQMERVIPGSEFAFSLVYEVSF GTPGEEQKASLPSSDEIIEDFNAIARGLKLLELDYLGGSGTRGYGQVKFSNLKARAAV GALDGSLLEKLNHELAAV" gene complement(3128973..3129347) /locus_tag="Rv2822c" /db_xref="GeneID:887778" CDS complement(3128973..3129347) /locus_tag="Rv2822c" /function="UNKNOWN" /note="Rv2822c, (MTCY16B7.20), len: 124 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217338.1" /db_xref="GI:15609959" /db_xref="GeneID:887778" /translation="MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFD EAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRDGL LRFCRYMEALAAYKKYLDPKDK" gene complement(3129344..3131773) /locus_tag="Rv2823c" /db_xref="GeneID:887735" CDS complement(3129344..3131773) /locus_tag="Rv2823c" /function="UNKNOWN" /note="Rv2823c, (MTCY16B7.19), len: 809 aa. Conserved hypothetical protein, similar in part to others e.g. Q9X2D1|TM1811Thermotoga maritima (717 aa), FASTA scores: opt: 401, E(): 3.6e-18, (27.15% identity in 773 aa overlap); O27154|MTH1082 CONSERVED HYPOTHETICAL PROTEIN from Methanothermobacter thermautotrophicus (822 aa), FASTA scores: opt: 306, E(): 6e-12, (25.55% identity in 872 aa overlap); Q59066|MJ1672 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (800 aa), FASTA scores: opt: 302, E(): 1.1e-11, (24.9% identity in 812 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217339.1" /db_xref="GI:15609960" /db_xref="GeneID:887735" /translation="MNPQLIEAIIGCLLHDIGKPVQRAALGYPGRHSAIGRAFMKKVW LRDSRNPSQFTDEVDEADIGVSDRRILDAISYHHSSALRTAAENGRLAADAPAYIAYN IAAGTDRRKADSDDGHGASTWDPDTPLYSMFNRFGSGTANLAFAPEMLDDRKPINIPS PRRIEFDKDRYAAIVNKLKAILVDLERSDTYLASLLNVLEATLSFVPSSTDASEVVDV SLFDHLKLTGALGACIWHYLQATGQSDFKSALFDKQDTFYNEKAFLLTTFDVSGIQDF IYTIHSSGAAKMLRARSFYLEMLTEHLIDELLARVGLSRANLNYSGGGHAYLLLPNTE SARKSVEQFEREANDWLLENFATRLFIATGSVPLAANDLMRRPNESASQASNRALRYS GLYRELSEQLSAKKLARYSADQLRELNSRDHDGQKGDRECSVCHTVNRTVSADDEPKC SLCQALTAASSQIQSESRRFLLISDGATKGLPLPFGATLTFCSRADADKALQQPQTRR RYAKNKFFAGECLGTGLWVGDYVAQMEFGDYVKRASGIARLGVLRLDVDNLGQAFTHG FMEQGNGKFNTISRTAAFSRMLSLFFRQHINYVLARPKLRPITGDDPARPREATIIYS GGDDVFVVGAWDDVIEFGIELRERFHEFTQGKLTVSAGIGMFPDKYPISVMAREVGDL EDAAKSLPGKNGVALFDREFTFGWDELLSKVIEEKYRHIADYFSGNEERGMAFIYKLL ELLAERDDRITKARWVYFLTRMRNPTGDTAPFQQFANRLHQWFQDPTDAKQLKTALHL YIYRTRKEESE" gene complement(3131770..3132714) /locus_tag="Rv2824c" /db_xref="GeneID:888945" CDS complement(3131770..3132714) /locus_tag="Rv2824c" /function="UNKNOWN" /note="Rv2824c, (MTCY16B7.18), len: 314 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217340.1" /db_xref="GI:15609961" /db_xref="GeneID:888945" /translation="MAARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVG FSHRGDRRMTEHLSRLTLTLEVDAPLERARVATLGPHLHGVLMESIPADYVQTLHTVP VNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRS LEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAI VDGEEPDPGLIAEFGQSVRLSAFRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYI AALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP" gene complement(3132892..3133539) /locus_tag="Rv2825c" /db_xref="GeneID:887769" CDS complement(3132892..3133539) /locus_tag="Rv2825c" /function="UNKNOWN" /note="Rv2825c, (MTCY16B7.17), len: 215 aa. Conserved hypothetical protein, similar to Q9RY53|DR0097 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (189 aa), FASTA scores: opt: 261, E(): 8e-11, (33.5% identity in 176 aa overlap); and shows some similarity with N-terminus of O27278|MTH1210 MRR RESTRICTION SYSTEM RELATED PROTEIN from Methanothermobacter thermautotrophicus (340 aa), FASTA scores: opt: 133, E(): 0.091, (28.55% identity in 112 aa overlap). Equivalent to AAK47217 from Mycobacterium tuberculosis strain CDC1551 (246 aa) but shorter 31 aa; and equivalent to upstream ORF P71624|Rv2828c|MTCY16B7.14 from Mycobacterium tuberculosis strain H37Rv (alias AAK47221 from strain CDC1551) (181 aa), FASTA scores: opt: 1169, E(): 8.5e-74, (98.35% identity in 181 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217341.1" /db_xref="GI:15609962" /db_xref="GeneID:887769" /translation="MKLPGAKRLGDDRRPLGTLRCWRHSDIGPARGIVVTPALKEWSA AVHALLDGRQTVLLRKGGIGEKRFEVAAHEFLLFPTVAHSHAERVRPEHRDLLGPAAA DSTDECVLLRAAAKVVAALPVNRPEGLDAIEDLHIWTAESVRADRLDFRPKHKLAVLV VSAIPLAEPVRLARRPEYGGCTSWVQLPVTPTLAAPVHDEAALAEVAARVREAVG" gene complement(3133709..3134593) /locus_tag="Rv2826c" /db_xref="GeneID:887742" CDS complement(3133709..3134593) /locus_tag="Rv2826c" /function="UNKNOWN" /note="Rv2826c, (MTCY16B7.16), len: 294 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217342.1" /db_xref="GI:15609963" /db_xref="GeneID:887742" /translation="MAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGD NRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQST RGDGRHWQLRVRHTELGEPRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLP VVAEAEACAEKLARYRRVALARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRG TRPLRVEDVLAARSEHDFQPDSIGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAA CDERHRREVENALAVLRS" gene complement(3134596..3135483) /locus_tag="Rv2827c" /db_xref="GeneID:887707" CDS complement(3134596..3135483) /locus_tag="Rv2827c" /function="UNKNOWN" /note="Rv2827c, (MTCY16B7.15), len: 295 aa. Hypothetical unknown protein, equivalent to AAK47219 from Mycobacterium tuberculosis strain CDC1551 (315 aa) but shorter 20 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217343.1" /db_xref="GI:15609964" /db_xref="GeneID:887707" /translation="MVSPAGADRRIPTWASRVVSGLARDRPVVVTKEDLTQRLTEAGC GRDPDSAIRELRRIGWLVQLPVKGTWAFIPPGEAAISDPYLPLRSWLARDQNAGFMLA GASAAWHLGYLDRQPDGRIPIWLPPAKRLPDGLASYVSVVRIPWNAADTALLAPRPAL LVRRRLDLVAWATGLPALGPEALLVQIATRPASFGPWADLVPHLDDLVADCSDERLER LLSGRPTSAWQRASYLLDSGGEPARGQALLAKRHTEVMPVTRFTTAHSRDRGESVWAP EYQLVDELVVPLLRVIGKA" gene complement(3135788..3136333) /locus_tag="Rv2828c" /db_xref="GeneID:887749" CDS complement(3135788..3136333) /locus_tag="Rv2828c" /function="UNKNOWN" /note="Rv2828c, (MTCY16B7.14), len: 181 aa. Conserved hypothetical protein, similar to Q9RY53|DR0097 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (189 aa), FASTA scores: opt: 267, E(): 1.9e-11, (34.1% identity in 176 aa overlap); and shows some similarity with N-terminus of O27278|MTH1210 MRR RESTRICTION SYSTEM RELATED PROTEIN from Methanothermobacter thermautotrophicus (340 aa), FASTA scores: opt: 133, E(): 0.07, (28.55% identity in 112 aa overlap). Also equivalent to downstream ORF P71627|Rv2825c|MTCY16B7.17 from Mycobacterium tuberculosis strain H37Rv (alias AAK47217 from strain CDC1551, 246 aa) (215 aa), FASTA scores: opt: 1173, E(): 8.3e-75, (98.9% identity in 181 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217344.1" /db_xref="GI:15609965" /db_xref="GeneID:887749" /translation="MTPALKEWSAAVHALLDGRQTVLLRKGGIGEKRFEVAAHEFLLF PTVAHSHAERVRPEHRDLLGPAAADSTDECVLLRAAAKVVAALPVNRPEGLDAIEDLH IWTAESVRADRLDFRPKHRLAVLVVSAIPLAEPVRLARTPEYGGCTSWVQLPVTPTLA APVHDEAALAEVAARVREAVG" gene complement(3136620..3137012) /locus_tag="Rv2829c" /db_xref="GeneID:887730" CDS complement(3136620..3137012) /locus_tag="Rv2829c" /function="UNKNOWN" /note="Rv2829c, (MTCY16B7.13), len: 130 aa. Conserved hypothetical protein similar to AAK65872|SMA2253 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (125 aa), FASTA scores: opt: 171, E(): 7.7e-05, (34.9% identity in 129 aa overlap); and shows some similarity with other proteins e.g. Q9AH69 HYPOTHETICAL 14.7 KDA PROTEIN from Neisseria meningitidis (128 aa), FASTA scores: opt: 148, E(): 0.0031, (28.1% identity in 121 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217345.1" /db_xref="GI:15609966" /db_xref="GeneID:887730" /translation="MTTVLLDSHVAYWWSAEPQRLSMAASQAIEHADELAVAAISWFE LAWLAEQERIQLAIPVLSWLQQLAEHVRTVGITPSVAATAVALPSSFPGDPADRLIYA TAIEHGWRLVTKDRRLRSHRHPRPVTVW" gene complement(3137009..3137224) /locus_tag="Rv2830c" /db_xref="GeneID:888537" CDS complement(3137009..3137224) /locus_tag="Rv2830c" /function="UNKNOWN" /note="Rv2830c, (MTCY16B7.12), len: 71 aa. Hypothetical protein, some similarity to Z97182|MTCY19H5.26|Rv0596c Hypothetical protein from Mycobacterium tuberculosis (85 aa), FASTA scores: opt: 88, E(): 1.3, (41.7% identity in 36 aa overlap); and to PHD_BPP1|Q06253 bacteriophage P1 phd gene (73 aa), FASTA scores: opt: 79, E(): 3.8, (35.9% identity in 39 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217346.1" /db_xref="GI:15609967" /db_xref="GeneID:888537" /translation="MTATEVKAKILSLLDEVAQGEEIEITKHGRTVARLVAATGPHAL KGRFSGVAMAAADDDELFTTGVSWNVS" gene 3137271..3138020 /gene="echA16" /locus_tag="Rv2831" /db_xref="GeneID:888519" CDS 3137271..3138020 /gene="echA16" /locus_tag="Rv2831" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_217347.1" /db_xref="GI:15609968" /db_xref="GeneID:888519" /translation="MTDDILLIDTDERVRTLTLNRPQSRNALSAALRDRFFAALADAE ADDDIDVVILTGADPVFCAGLDLKELAGQTALPDISPRWPAMTKPVIGAINGAAVTGG LELALYCDILIASEHARFADTHARVGLLPTWGLSVRLPQKVGIGLARRMSLTGDYLSA TDALRAGLVTEVVAHDQLLPTARRVAASIVGNNQNAVRALLASYHRIDESQTAAGLWL EACAAKQFRTSGDTIAANREAVLQRGRAQVR" gene complement(3138099..3139181) /gene="ugpC" /locus_tag="Rv2832c" /db_xref="GeneID:888554" CDS complement(3138099..3139181) /gene="ugpC" /locus_tag="Rv2832c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF Sn-GLYCEROL-3-PHOSPHATE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv2832c, (MTCY16B7.10), len: 360 aa. Probable ugpC, Sn-glycerol-3-phosphate transport ATP-binding protein ABC transporter (see Braibant et al., 2000), similar to others: CAC48805 PROBABLE GLYCEROL-3-PHOSPHATE ABC TRANSPORTER ATP-BINDING PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (349 aa), FASTA scores: opt: 1018, E(): 4.1e-53, (48.6% identity in 356 aa overlap); Q98G42|MLL3499|UGPC SN-GLYCEROL-3-PHOSPHATE TRANSPORT ATP-BINDING PROTEIN from Rhizobium loti (Mesorhizobium loti) (366 aa), FASTA scores: opt: 1016, E(): 5.6e-53, (48.5% identity in 367 aa overlap). But also highly similar to many msiK proteins, ABC transporter ATP-binding proteins possibly involved in transport of cellolbiose and maltose (see Schlosser et al., 1997) e.g. P96483|MSIK MSIK PROTEIN from Streptomyces reticuli (377 aa), FASTA scores: opt: 1277, E(): 1.9e-68, (58.05% identity in 379 aa overlap); Q9L0Q1|MSIK ABC TRANSPORTER ATP-BINDING PROTEIN from Streptomyces coelicolor (378 aa), FASTA scores: opt: 1276, E(): 2.1e-68, (57.65% identity in 380 aa overlap); Q54333|MSIK from Streptomyces lividans (314 aa), FASTA scores: opt: 1217, E(): 5.9e-65, (63.7% identity in 292 aa overlap); and other ABC-TYPE SUGAR TRANSPORT PROTEINS. Also highly similar to O53482|Rv2038c|MTV018.25c ABC-TYPE SUGAR TRANSPORT PROTEIN from Mycobacterium tuberculosis (357 aa), FASTA scores: opt: 1248, E(): 9.4e-67, (56.8% identity in 354 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONG TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="sn-glycerol-3-phosphate transport ATP-binding protein ABC transporter UGPC" /protein_id="NP_217348.1" /db_xref="GI:15609969" /db_xref="GeneID:888554" /translation="MANVQYSAVTQRYPGADAPTVDNLDLDIADGEFLVLVGPSGCGK STTLRVLAGLEPIESGRISIGDVDVTHLPPRARDVAMVFQNYALYPNMTVAANMGFAL RNAGMSRADTRRRVLEVADMLELTDLLDRKPAKLSGGQRQRVAMGRAIVRRPRVFCMD EPLSNLDAKLRVSTRSQISGLQRRLGTTTVYVTHDQVEAMTMGDRVAVLKDGVLQQVD TPRALYDDPVNTFVATFIGAPAMNLIDAAVAHGVVRAPDLAIPVPDPAAERVLVGVRP ESWDVASIGTPGSLTVHVELVEELGFESFVYATPVDQRGWSSRAPRIVFRTDRRTAVR VGESLAIVPHSQEVRLFNSRTETRLR" misc_feature complement(3138732..3138776) /gene="ugpC" /locus_tag="Rv2832c" /note="PS00211 ABC transporters family signature" misc_feature complement(3139047..3139070) /gene="ugpC" /locus_tag="Rv2832c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3139174..3140484) /gene="ugpB" /locus_tag="Rv2833c" /db_xref="GeneID:888199" CDS complement(3139174..3140484) /gene="ugpB" /locus_tag="Rv2833c" /function="INVOLVED IN ACTIVE TRANSPORT OF Sn-GLYCEROL-3-PHOSPHATE AND GLYCEROPHOSPHORYL DIESTERS ACROSS THE MEMBRANE (IMPORT). Sn-GLYCEROL-3-PHOSPHATE AND GLYCEROPHOSPHORYL DIESTERS - BINDING PROTEIN INTERACTS WITH THE BINDING PROTEIN-DEPENDENT TRANSPORT SYSTEM UGPACE." /note="Rv2833c, (MTCY16B7.09), len: 436 aa. Probable ugpB, Sn-glycerol-3-phosphate binding lipoprotein component of Sn-glycerol-3-phosphate transport system (see citation below), similar to various transporters substrate-binding periplasmic proteins e.g. Q9KDY2|BH1079 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER (GLYCEROL-3-PHOSPHATE BINDING PROTEIN) from Bacillus halodurans (459 aa), FASTA scores: opt: 357, E(): 3.1e-14, (23.4% identity in 406 aa overlap); P72397|MALE PUTATIVE MALTOSE-BINDING PROTEIN from Streptomyces coelicolor (423 aa), FASTA scores: opt: 318, E(): 7e-12, (23.7% identity in 430 aa overlap); AAK78409|CAC0429 GLYCEROL-3-PHOSPHATE ABC-TRANSPORTER PERIPLASMIC COMPONENT from Clostridium acetobutylicum (447 aa), FASTA scores: opt: 305, E(): 4.5e-11, (27.15% identity in 438 aa overlap); P10904|UGPB_ECOLI|B3453 GLYCEROL-3-PHOSPHATE-BINDING PERIPLASMIC PROTEIN PRECURSOR from Escherichia coli strain K12 (438 aa); etc. Contains signal sequence and appropriately positioned prokaryotic lipoprotein attachment site (PS00013)." /codon_start=1 /transl_table=11 /product="sn-glycerol-3-phosphate-binding lipoprotein UGPB" /protein_id="NP_217349.1" /db_xref="GI:15609970" /db_xref="GeneID:888199" /translation="MDPLNRRQFLALAAAAAGVTAGCAGMGGGGSVKSGSGPIDFWSS HPGQSSAAERELIGRFQDRFPTLSVKLIDAGKDYDEVAQKFNAALIGTDVPDVVLLDD RWWFHFALSGVLTALDDLFGQVGVDTTDYVDSLLADYEFNGRHYAVPYARSTPLFYYN KAAWQQAGLPDRGPQSWSEFDEWGPELQRVVGAGRSAHGWANADLISWTFQGPNWAFG GAYSDKWTLTLTEPATIAAGNFYRNSIHGKGYAAVANDIANEFATGILASAVASTGSL AGITASARFDFGAAPLPTGPDAAPACPTGGAGLAIPAKLSEERKVNALKFIAFVTNPT NTAYFSQQTGYLPVRKSAVDDASERHYLADNPRARVALDQLPHTRTQDYARVFLPGGD RIISAGLESIGLRGADVTKTFTNIQKRLQVILDRQIMRKLAGHG" gene complement(3140487..3141314) /gene="ugpE" /locus_tag="Rv2834c" /db_xref="GeneID:888534" CDS complement(3140487..3141314) /gene="ugpE" /locus_tag="Rv2834c" /function="INVOLVED IN ACTIVE TRANSPORT OF Sn-GLYCEROL-3-PHOSPHATE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2834c, (MTCY16B7.08), len: 275 aa. Probable ugpE, Sn-glycerol-3-phosphate transport integral membrane protein ABC transporter (see citation below), similar to various permeases e.g. Q9KDY3|BH1078 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER from Bacillus halodurans (270 aa), FASTA scores: opt: 620, E(): 4.3e-32, (34.7% identity in 268 aa overlap); Q9X0K6|TM1122 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER PERMEASE PROTEIN from Thermotoga maritima (276 aa), FASTA scores: opt: 605, E(): 3.9e-31, (32.5% identity in 274 aa overlap); AAG58557|UGPE SN-GLYCEROL 3-PHOSPHATE TRANSPORT SYSTEM (INTEGRAL MEMBRANE PROTEIN) from Escherichia coli strain O157:H7 and EDL933 (281 aa), FASTA scores: opt: 574, E(): 3.7e-29, (32.95% identity in 264 aa overlap); P10906|UGPE_ECOLI|B3451 SN-GLYCEROL-3-PHOSPHATE TRANSPORT SYSTEM PERMEASE PROTEIN from Escherichia coli strain K12 (281 aa), FASTA scores: opt: 569, E(): 7.6e-29, (32.6% identity in 264 aa overlap); etc. Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /codon_start=1 /transl_table=11 /product="sn-glycerol-3-phosphate transport integral membrane protein ABC transporter UGPE" /protein_id="NP_217350.1" /db_xref="GI:15609971" /db_xref="GeneID:888534" /translation="MTPDRLRSSVGYAAMLLVVTLIAGPLLFVFFTSFKDQPDIYAQP TSWWPLRWYPQNYRTATEQIPFWTFLRNSLIITSVLAVVKFTLGVLSAFGLVFVRFPG RTAVFLVIIAALMVPNQITVISNYALISHLGLRNTFAGIILPLAGVAFGTFLMRNHFL SLPAEIIEAARMDGARWWQLLLRVVLPMSRPTMVAVGVITVVNEWNEYLWPFLMSDDE SVAPLPIGLTFLQQAEGVTNWGPVMAVTLLAMLPILLVFIALQRQMIKGLTSGAVKG" misc_feature complement(3140754..3140840) /gene="ugpE" /locus_tag="Rv2834c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene complement(3141311..3142222) /gene="ugpA" /locus_tag="Rv2835c" /db_xref="GeneID:888569" CDS complement(3141311..3142222) /gene="ugpA" /locus_tag="Rv2835c" /function="INVOLVED IN ACTIVE TRANSPORT OF Sn-GLYCEROL-3-PHOSPHATE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2835c, (MTCY1B7.07), len: 303 aa. Probable ugpA, Sn-glycerol-3-phosphate transport integral membrane protein ABC transporter (see citation below), similar to various permeases e.g. Q9RK71|SCF11.19 PROBABLE SUGAR TRANSPORTER INNER MEMBRANE PROTEIN from Streptomyces coelicolor (316 aa), FASTA scores: opt: 643, E(): 3.1e-35, (38.85% identity in 291 aa overlap); Q9KDY4|BH1077 GLYCEROL-3-PHOSPHATE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (315 aa), FASTA scores: opt: 548, E(): 6.2e-29, (31.5% identity in 295 aa overlap); AAK78407|CAC0427 GLYCEROL-3-PHOSPHATE ABC-TRANSPORTER, PERMEASE COMPONENT from Clostridium acetobutylicum (304 aa), FASTA scores: opt: 538, E(): 2.8e-28, (29.1% identity in 292 aa overlap); etc. Contains PS00062 Aldo/keto reductase family signature 2, and PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /codon_start=1 /transl_table=11 /product="sn-glycerol-3-phosphate transport integral membrane protein ABC transporter UGPA" /protein_id="NP_217351.1" /db_xref="GI:15609972" /db_xref="GeneID:888569" /translation="MAAPQRARLRSSKERVRDYALFVVLVGPNVALLLLFVYRPLADN IRLSFFDWNVSDPSARFVGLSNYTEWFTRSDTRQIVFNTAVFTGAAVVGSMVLGLALA MLLDRPLRGRNLVRSTVFAPFVISGAAVGLAAQFVFDPHFGLIQDLLRRIGVGVPDFY QDARWALFMVTITYVWKNLGYTFVIYLAALQGVRRDLLEAAEIDGASRWAVFRRVLLP QLRPTTFFLSITVLINSLQVFDVINVMTRGGPEGTGTTTMVYQVYVETFRNFRAGYGA TVATIMFLVLLAVTYYQVRVMDRGQRQ" misc_feature complement(3141569..3141655) /gene="ugpA" /locus_tag="Rv2835c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" misc_feature complement(3142022..3142075) /gene="ugpA" /locus_tag="Rv2835c" /note="PS00062 Aldo/keto reductase family signature 2" gene complement(3142309..3143628) /gene="dinF" /locus_tag="Rv2836c" /db_xref="GeneID:888547" CDS complement(3142309..3143628) /gene="dinF" /locus_tag="Rv2836c" /function="UNKNOWN; INDUCTION BY DNA DAMAGE." /note="Rv2836c, (MTCY16B7.06), len: 439 aa. Possible dinF, DNA-damage-inducible protein F, integral membrane protein, similar to others e.g. BAB38450|ECS5027|AAG59243 from Escherichia coli strain O157:H7 (459 aa), FASTA scores: opt: 501, E(): 2.7e-21, (29.55% identity in 443 aa overlap); P28303|DINF_ECOLI|B4044 from Escherichia coli strain K12 (459 aa), FASTA scores: opt: 491, E(): 1e-20, (29.35% identity in 443 aa overlap); Q98B90|MLR5680 from Rhizobium loti (Mesorhizobium loti) (471 aa), FASTA scores: opt: 466, E(): 2.7e-19, (30.7% identity in 433 aa overlap); etc. But also similar or highly similar to other hypothetical proteins e.g. Q9X8U6|SCH24.32c HYPOTHETICAL 46.3 KDA PROTEIN from Streptomyces coelicolor (448 aa), FASTA scores: opt: 981, E(): 1.1e-48, (42.35% identity in 437 aa overlap). Contains PS00213 Lipocalin signature." /codon_start=1 /transl_table=11 /product="DNA-damage-inducible protein F" /protein_id="NP_217352.1" /db_xref="GI:15609973" /db_xref="GeneID:888547" /translation="MSQVGHRAGGRQIAQLALPALGVLAAEPLYLLFDIAVVGRLGAI SLAGLAIGSLVLGLVGSQATFLSYGTTARAARRYGAGNRVAAVTEGVQATWLALGLGA LVVVVVEATATPLVSAIASGDGITAAALPWLRIAILGTPAILVSLAGNGWLRGVQDTV RPLRYVVAGFGSSALLCPLLVYGWLGLPRWGLTGSAVANLVGQWLAALLFAGALLAER VSLRPDRAVLGAQLMMARDLIVRTLAFQVCYVSAAAVAARFGAAALAAHQVVLQLWGL LALVLDSLAIAAQSLVGAALGAGDAGHAKAVAWRVTAFSLLAAGILAAALGLGSSVLP GLFTDDRSVLAAIGVPWWFMVVQLPFAGIVFAVDGVLLGAGDAAFMRTATVASALVGF LPLVWLSLAYGWGLAGIWSGLGTFIVLRLIFVGWRAYSGRWAVTGAA" misc_feature complement(3143008..3143049) /gene="dinF" /locus_tag="Rv2836c" /note="PS00213 Lipocalin signature" gene complement(3143635..3144645) /locus_tag="Rv2837c" /db_xref="GeneID:888920" CDS complement(3143635..3144645) /locus_tag="Rv2837c" /function="UNKNOWN" /note="Rv2837c, (MTCY16B7.05), len: 336 aa. Conserved hypothetical protein, showing some similarity with other proteins e.g. O67552|AQ_1630 HYPOTHETICAL 36.2 KDA PROTEIN from Aquifex aeolicus (325 aa), FASTA scores: opt: 498, E(): 3.6e-25, (32.8% identity in 314 aa overlap); Q9X1T1|TM1595 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (333 aa), FASTA scores: opt: 482, E(): 4.1e-24, (34.85% identity in 304 aa overlap); Q9RW43|DR0826 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (338 aa), FASTA scores: opt: 444, E(): 1.3e-21, (33.85% identity in 331 aa overlap); etc. Equivalent to AAK47229 from Mycobacterium tuberculosis strain CDC1551 (316 aa) but longer 20 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217353.1" /db_xref="GI:15609974" /db_xref="GeneID:888920" /translation="MTTIDPRSELVDGRRRAGARVDAVGAAALLSAAARVGVVCHVHP DADTIGAGLALALVLDGCGKRVEVSFAAPATLPESLRSLPGCHLLVRPEVMRRDVDLV VTVDIPSVDRLGALGDLTDSGRELLVIDHHASNDLFGTANFIDPSADSTTTMVAEILD AWGKPIDPRVAHCIYAGLATDTGSFRWASVRGYRLAARLVEIGVDNATVSRTLMDSHP FTWLPLLSRVLGSAQLVSEAVGGRGLVYVVVDNREWVAARSEEVESIVDIVRTTQQAE VAAVFKEVEPHRWSVSMRAKTVNLAAVASGFGGGGHRLAAGYTTTGSIDDAVASLRAA LG" gene complement(3144620..3145171) /gene="rbfA" /locus_tag="Rv2838c" /db_xref="GeneID:888923" CDS complement(3144620..3145171) /gene="rbfA" /locus_tag="Rv2838c" /function="ASSOCIATES WITH FREE 30S RIBOSOMAL SUBUNITS (BUT NOT WITH 30S SUBUNITS THAT ARE PART OF 70S RIBOSOMES OR POLYSOMES). ESSENTIAL FOR EFFICIENT PROCESSING OF 16S RRNA. MAY INTERACT WITH THE 5'TERMINAL HELIX REGION OF 16S SRNA." /note="associates with free 30S ribosomal subunits; essential for efficient processing of 16S rRNA; in Escherichia coli rbfA is induced by cold shock" /codon_start=1 /transl_table=11 /product="ribosome-binding factor A" /protein_id="NP_217354.1" /db_xref="GI:15609975" /db_xref="GeneID:888923" /translation="MADAARARRLAKRIAAIVASAIEYEIKDPGLAGVTITDAKVTAD LHDATVYYTVMGRTLHDEPNCAGAAAALERAKGVLRTKVGAGTGVRFTPTLTFTLDTI SDSVHRMDELLARARAADADLARVRVGAKPAGEADPYRDNGSVAQSPAPGGLGIRTSD GPEAVEAPLTCGGDTGDDDRPKE" gene complement(3145171..3147873) /gene="infB" /locus_tag="Rv2839c" /db_xref="GeneID:888157" CDS complement(3145171..3147873) /gene="infB" /locus_tag="Rv2839c" /function="IF-2, ONE OF THE ESSENTIAL COMPONENTS FOR THE INITIATION OF PROTEIN SYNTHESIS IN VITRO, PROTECTS FORMYLMETHIONYL-TRNA FROM SPONTANEOUS HYDROLYSIS AND PROMOTES ITS BINDING TO THE 30S RIBOSOMAL SUBUNITS. IT IS ALSO INVOLVED IN THE HYDROLYSIS OF GTP DURING THE FORMATION OF THE 70S RIBOSOMAL COMPLEX." /experiment="experimental evidence, no additional details recorded" /note="Protects formylmethionyl-tRNA from spontaneous hydrolysis and promotes its binding to the 30S ribosomal subunits during initiation of protein synthesis. Also involved in the hydrolysis of GTP during the formation of the 70S ribosomal complex" /codon_start=1 /transl_table=11 /product="translation initiation factor IF-2" /protein_id="NP_217355.1" /db_xref="GI:15609976" /db_xref="GeneID:888157" /translation="MAAGKARVHELAKELGVTSKEVLARLSEQGEFVKSASSTVEAPV ARRLRESFGGSKPAPAKGTAKSPGKGPDKSLDKALDAAIDMAAGNGKATAAPAKAADS GGAAIVSPTTPAAPEPPTAVPPSPQAPHPGMAPGARPGPVPKPGIRTPRVGNNPFSSA QPADRPIPRPPAPRPGTARPGVPRPGASPGSMPPRPGGAVGGARPPRPGAPRPGGRPG APGAGRSDAGGGNYRGGGVGAAPGTGFRGRPGGGGGGRPGQRGGAAGAFGRPGGAPRR GRKSKRQKRQEYDSMQAPVVGGVRLPHGNGETIRLARGASLSDFADKIDANPAALVQA LFNLGEMVTATQSVGDETLELLGSEMNYNVQVVSPEDEDRELLESFDLSYGEDEGGEE DLQVRPPVVTVMGHVDHGKTRLLDTIRKANVREAEAGGITQHIGAYQVAVDLDGSQRL ITFIDTPGHEAFTAMRARGAKATDIAILVVAADDGVMPQTVEAINHAQAADVPIVVAV NKIDKEGADPAKIRGQLTEYGLVPEEFGGDTMFVDISAKQGTNIEALEEAVLLTADAA LDLRANPDMEAQGVAIEAHLDRGRGPVATVLVQRGTLRVGDSVVAGDAYGRVRRMVDE HGEDVEVALPSRPVQVIGFTSVPGAGDNFLVVDEDRIARQIADRRSARKRNALAARSR KRISLEDLDSALKETSQLNLILKGDNAGTVEALEEALMGIQVDDEVVLRVIDRGVGGI TETNVNLASASDAVIIGFNVRAEGKATELASREGVEIRYYSVIYQAIDEIEQALRGLL KPIYEENQLGRAEIRALFRSSKVGLIAGCLVTSGVMRRNAKARLLRDNIVVAENLSIA SLRREKDDVTEVRDGFECGLTLGYADIKEGDVIESYELVQKERA" misc_feature complement(3146638..3146661) /gene="infB" /locus_tag="Rv2839c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3147959..3148258) /locus_tag="Rv2840c" /db_xref="GeneID:888189" CDS complement(3147959..3148258) /locus_tag="Rv2840c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2840c, (MTCY16B7.02), len: 99 aa. Conserved hypothetical protein, equivalent to Q9Z5J0|ML1557|MLCB596.13 HYPOTHETICAL 11.6 KDA PROTEIN from Mycobacterium leprae (106 aa), FASTA scores: opt: 501, E(): 2.3e-29, (501% identity in 96 aa overlap). Also highly similar to other hypothetical proteins e.g. Q9KYR0|SC5H4.29 from Streptomyces coelicolor (101 aa), FASTA scores: opt: 256, E(): 1.4e-11, (50.6% identity in 81 aa overlap); Q9APM9 from Myxococcus xanthus (111 aa), FASTA scores: opt: 174, E(): 1.3e-05, (42.25% identity in 97 aa overlap); and similar to to others e.g. N-terminus of CAC41675|SMC02913 from Rhizobium meliloti (Sinorhizobium meliloti) (230 aa), FASTA scores: opt: 172, E(): 3e-05, (42.4% identity in 66 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217356.1" /db_xref="GI:15609977" /db_xref="GeneID:888189" /translation="MRTCVGCRKRGLAVELLRVVAVSTGNGNYAVIVDTATSLPGRGA WLHPLRQCAQQAIRRRAFARALRIAGSPDTSAVVEYLESLGELEPPGNRTGSNRT" gene complement(3148385..3149428) /gene="nusA" /locus_tag="Rv2841c" /db_xref="GeneID:888213" CDS complement(3148385..3149428) /gene="nusA" /locus_tag="Rv2841c" /function="COULD PARTICIPATES IN BOTH THE TERMINATION AND ANTITERMINATION OF TRANSCRIPTION." /note="modifies transcription through interactions with RNA polymerase affecting elongation, readthrough, termination, and antitermination" /codon_start=1 /transl_table=11 /product="transcription elongation factor NusA" /protein_id="NP_217357.1" /db_xref="GI:15609978" /db_xref="GeneID:888213" /translation="MNIDMAALHAIEVDRGISVNELLETIKSALLTAYRHTQGHQTDA RIEIDRKTGVVRVIARETDEAGNLISEWDDTPEGFGRIAATTARQVMLQRFRDAENER TYGEFSTREGEIVAGVIQRDSRANARGLVVVRIGTETKASEGVIPAAEQVPGESYEHG NRLRCYVVGVTRGAREPLITLSRTHPNLVRKLFSLEVPEIADGSVEIVAVAREAGHRS KIAVRSNVAGLNAKGACIGPMGQRVRNVMSELSGEKIDIIDYDDDPARFVANALSPAK VVSVSVIDQTARAARVVVPDFQLSLAIGKEGQNARLAARLTGWRIDIRGDAPPPPPGQ PEPGVSRGMAHDR" gene complement(3149425..3149976) /locus_tag="Rv2842c" /db_xref="GeneID:888159" CDS complement(3149425..3149976) /locus_tag="Rv2842c" /function="UNKNOWN" /note="in Streptococcus pneumoniae this gene was found to be essential; structure determination of the Streptococcus protein shows that it is similar to a number of other proteins" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217358.1" /db_xref="GI:15609979" /db_xref="GeneID:888159" /translation="MTTGLPSQRQVIELLGADFACAGYEIEDVVIDARARPPRIAVIA DGDAPLDLDTIAALSRRASALLDGLDGANKIRGRYLLEVSSPGVERPLTSEKHFRRAR GRKVELVLSDGSRLTGRVGEMRAGTVALVIREDRGWAVREIPLAEIVKAVVQVEFSPP APAELELAQSSEMGLARGTEAGA" gene 3150171..3150716 /locus_tag="Rv2843" /db_xref="GeneID:888522" CDS 3150171..3150716 /locus_tag="Rv2843" /function="UNKNOWN" /note="Rv2843, (MTCY24A1.14c), len: 181 aa. Probable conserved transmembrane ala-rich protein, equivalent to Q9Z5J3|ML1560|MLCB596.10c HYPOTHETICAL 17.5 KDA PROTEIN from Mycobacterium leprae (178 aa), FASTA scores: opt: 707, E(): 1.4e-32, (70.25% identity in 168 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217359.1" /db_xref="GI:15609980" /db_xref="GeneID:888522" /translation="MLRAAPVINRLTNRPISRRGVLAGGAALAALGVVSACGESAPKA PAVEELRSPLDQARHDGALAAAAATAIGIPPQVAAALTVVATQRTSHARALATEIARA AGKLVSATSETSSSSPSPTDPAAPPPAVSDVIDSLRTSAGEASRLVATTSGYRAGLLA SIAASCTASYTVALVPSGPSI" gene 3150713..3151201 /locus_tag="Rv2844" /db_xref="GeneID:888536" CDS 3150713..3151201 /locus_tag="Rv2844" /function="UNKNOWN" /note="Rv2844, (MTCY24A1.13c), len: 162 aa. Conserved hypothetical ala-rich protein, equivalent to Q9Z5J4|ML1561|MLCB596.09c HYPOTHETICAL 17.5 KDA PROTEIN from Mycobacterium leprae (165 aa), FASTA scores: opt: 771, E(): 4.9e-46, (71.5% identity in 165 aa overlap). Also similar to Q9KYR4|SC5H4.25c HYPOTHETICAL 16.8 KDA PROTEIN from Streptomyces coelicolor (167 aa), FASTA scores: opt: 242, E(): 1.6e-09, (38.9% identity in 144 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217360.1" /db_xref="GI:15609981" /db_xref="GeneID:888536" /translation="MTSSEPAHGATPKRSPSEGSADNAALCDALAVEHATIYGYGIVS ALSPPGVNFLVADALKQHRHRRDDVIVMLSARGVTAPIAAAGYQLPMQVSSAADAARL AVRMENDGATAWRAVVEHAETADDRVFASTALTESAVMATRWNRVLGAWPITAAFPGG DE" gene complement(3151202..3152950) /gene="proS" /locus_tag="Rv2845c" /db_xref="GeneID:888539" CDS complement(3151202..3152950) /gene="proS" /locus_tag="Rv2845c" /EC_number="6.1.1.15" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-PROLINE + TRNA(PRO) = AMP + PYROPHOSPHATE + L-PROLYL-TRNA(PRO)]." /note="catalyzes the formation of prolyl-tRNA(Pro) from proline and tRNA(Pro)" /codon_start=1 /transl_table=11 /product="prolyl-tRNA synthetase" /protein_id="NP_217361.1" /db_xref="GI:15609982" /db_xref="GeneID:888539" /translation="MITRMSELFLRTLRDDPADAEVASHKLLIRAGYIRPVAPGLYSW LPLGLRVLRNIERVIRDEMNAIGGQEILFPALLPRAPYETTNRWTQYGDSVFRLKDRR GNDYLLGPTHEELFTLTVKGEYSSYKDFPLTLYQIQTKYRDEARPRAGILRAREFVMK DSYSFDIDAAGLKAAYHAHREAYQRIFDRLQVRYVIVSAVSGAMGGSASEEFLAESPS GEDAFVRCLESGYAANVEAVVTARPDTLPIDGLPEAVVHDTGDTPTIASLVAWANEAD LGRTVTAADTLKNVLIKVRQPGGDTELLAIGVPGDREVDDKRLGAALEPADYALLDDD DFAKHPFLVKGYIGPKALRENNVRYLVDPRIVDGTSWITGADQPGRHVVGLVAGRDFT ADGTIEAAEVREGDPSPDGAGPLVMARGIEIGHIFQLGSKYTDAFTADVLGEDGKPVR LTMGSYGIGVSRLVAVVAEQHHDELGLRWPSTVAPFDVHLVIANKDAQARAGATALAA DLDRLGVEVLLDDRQASPGVKFKDAELLGMPWIVVVGRGWADGVVELRDRFSGQTREL VAGASLATDIAAAVTG" misc_feature complement(3152468..3152530) /gene="proS" /locus_tag="Rv2845c" /note="PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1" gene complement(3153039..3154631) /gene="efpA" /locus_tag="Rv2846c" /db_xref="GeneID:888575" CDS complement(3153039..3154631) /gene="efpA" /locus_tag="Rv2846c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY DRUG) ACROSS THE MEMBRANE (EXPORT): SO RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv2846c, (MTCY24A1.11), len: 530 aa. Possible efpA, integral membrane efflux protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug (see citations below), equivalent to Q9Z5J5|ML1562|MLCB596.08 PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Mycobacterium leprae (534 aa), FASTA scores: opt: 2881, E(): 4.1e-160, (86.55% identity in 535 aa overlap). Also highly similar to several membrane proteins e.g. O69986|SC4H2.31c TRANSMEMBRANE EFFLUX PROTEIN (515 aa), FASTA scores: opt: 1063, E(): 2.2e-54, (39.65% identity in 406 aa overlap); Q9FBQ5|SCD86A.02c PUTATIVE TRANSPORT INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (503 aa), FASTA scores: opt: 918, E(): 5.8e-46, (33.7% identity in 469 aa overlap); Q9KYU0|SCE22.23c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (514 aa), FASTA scores: opt: 888, E(): 3.3e-44, (32.85% identity in 469 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane efflux protein EfpA" /protein_id="NP_217362.1" /db_xref="GI:15609983" /db_xref="GeneID:888575" /translation="MTALNDTERAVRNWTAGRPHRPAPMRPPRSEETASERPSRYYPT WLPSRSFIAAVIAIGGMQLLATMDSTVAIVALPKIQNELSLSDAGRSWVITAYVLTFG GLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEATLVIARLSQGVGSAIASP TGLALVATTFPKGPARNAATAVFAAMTAIGSVMGLVVGGALTEVSWRWAFLVNVPIGL VMIYLARTALRETNKERMKLDATGAILATLACTAAVFAFSIGPEKGWMSGITIGSGLV ALAAAVAFVIVERTAENPVVPFHLFRDRNRLVTFSAILLAGGVMFSLTVCIGLYVQDI LGYSALRAGVGFIPFVIAMGIGLGVSSQLVSRFSPRVLTIGGGYLLFGAMLYGSFFMH RGVPYFPNLVMPIVVGGIGIGMAVVPLTLSAIAGVGFDQIGPVSAIALMLQSLGGPLV LAVIQAVITSRTLYLGGTTGPVKFMNDVQLAALDHAYTYGLLWVAGAAIIVGGMALFI GYTPQQVAHAQEVKEAIDAGEL" gene complement(3154654..3155871) /gene="cysG" /locus_tag="Rv2847c" /db_xref="GeneID:887481" CDS complement(3154654..3155871) /gene="cysG" /locus_tag="Rv2847c" /EC_number="2.1.1.107" /EC_number="4.99.1.-" /function="INVOLVED IN THE BIOSYNTHESIS OF SIROHEME AND COBALAMIN [CATALYTIC ACTIVITY: 2 S-ADENOSYL-L-METHIONINE + UROPORPHYRIN III = 2 S-ADENOSYL-L-HOMOCYSTEINE + SIROHYDROCHLORIN]. SAM-DEPENDENT METHYL TRANSFERASE THAT METHYLATES UROPORPHYRINOGEN III AT POSITION C-2 AND C-7 TO FORM PRECORRIN-2 AND THEN POSITION C-12 OR C-18 TO FORM TRIMETHYLPYRROCORPHIN 2. IT CATALYZES ALSO THE CONVERSION OF PRECORRIN-2 INTO SIROHEME (CONSISTING OF AN OXIDATION AND FE(2+) CHELATION)." /note="Rv2847c, (MTCY24A1.10), len: 405 aa. Possible cysG, multifunctional enzyme, siroheme synthase containing uroporphyrin-iii c-methyltransferase (EC 2.1.1.107), precorrin-2 oxidase (EC 1.-.-.-) and ferrochelatase (EC 4.99.1.-). C-terminus highly similar to many uroporphyrin-iii c-methyltransferases e.g. Q51720|COBA UROPORPHYRINOGEN III METHYLTRANSFERASE from Propionibacterium freudenreichii (257 aa), FASTA scores: opt: 776, E(): 1.5e-39, (48.95% identity in 243 aa overlap); Q9HMY4|UROM|VNG2331G S-ADENOSYL-L-METHIONINE:UROPORPHYRINOGEN III METHYLTRANSFERASE from Halobacterium sp. strain NRC-1 (246 aa), FASTA scores: opt: 704, E(): 3.1e-35, (49.4% identity in 245 aa overlap); P42437|NASF_BACSU|NASBE UROPORPHYRIN-III C-METHYLTRANSFERASE from Bacillus subtilis (483 aa), FASTA scores: opt: 610, E(): 2.4e-29, (42.1% identity in 240 aa overlap); etc. And highly similar over entire length to other proteins e.g. Q9L1C9|SCL11.09c UROPORPHYRINOGEN III METHYLTRANSFERASE from Streptomyces coelicolor (410 aa), FASTA scores: opt: 1481, E(): 5.6e-82, (58.45% identity in 409 aa overlap); Q9I0M7|CYSG|PA2611 SIROHEME SYNTHASE from Pseudomonas aeruginosa (465 aa), FASTA scores: opt: 609, E(): 2.7e-29, (34.7% identity in 444 aa overlap); P11098|CYSG_ECOLI|B3368|Z4729|ECS4219 SIROHEME SYNTHASE from Escherichia coli stains O157:H7 and K12 (457 aa), FASTA scores: opt: 543, E(): 9.1e-27, (31.3% identity in 450 aa overlap); etc. BELONGS TO A FAMILY THAT GROUPS SUMT, CYSG, CBIF/COBM AND CBIL/COBI. Note that previously known as cysG2.; cysG2" /codon_start=1 /transl_table=11 /product="multifunctional uroporphyrinogen III methylase/precorrin-2 oxidase/ferrochelatase" /protein_id="NP_215025.2" /db_xref="GI:57117027" /db_xref="GeneID:887481" /translation="MTENPYLVGLRLAGKKVVVVGGGTVAQRRLPLLIASGADVHVIA PSVTPAVEAMDQITLSVRDYRDGDLDGAWYAIAATDDARVNVAVVAEAERRRIFCVRA DIAVEGTAVTPASFSYAGLSVGVLAGGEHRRSAAIRSAIREALQQGVITAQSSDVLSG GVALVGGGPGDPELITVRGRRLLAQADVVVADRLAPPELLAELPPHVEVIDAAKIPYG RAMAQDAINAVLIERARSGNFVVRLKGGDPFVFARGYEEVLACAHAGIPVTVVPGVTS AIAVPAMAGVPVTHRAMTHEFVVVSGHLAPGHPESLVNWDALAALTGTIVLLMAVERI ELFVDVLLKGGRTADTPVLVVQHGTTAAQQTLRATLADTPEKVRAAGIRPPAIIVIGA VVGLSGVRGLNNS" repeat_region complement(3155874..3155927) /note="54 bp direct repeat 4, GGTGGCGACCCGCGGCGCCCGGTCCCCGCGCTTGCGATCGCCACTAGGCTTGGC" repeat_region complement(3155928..3155981) /note="54 bp direct repeat 3, GGTGGCGACCCGCGGCGCCCGGTCCCCGCGCTTGCGATCGCCACTGGCCCTGAT" repeat_region complement(3155982..3156035) /note="54 bp direct repeat 2, GGTGGCGACCCGCGGCGCCCGGTCCCCGCGCTTGCGATCGCCACTGGCCCTGAT" repeat_region complement(3156036..3156089) /note="54 bp direct repeat 1, GGTGGCGACCCGCGGCGCCCGGTCCCCGCGCTTGCGATCGCCACTGGCCCTGAT" gene complement(3156148..3157521) /gene="cobB" /locus_tag="Rv2848c" /db_xref="GeneID:888499" CDS complement(3156148..3157521) /gene="cobB" /locus_tag="Rv2848c" /function="INVOLVED IN THE COBALAMIN BIOSYNTHESIS. RESPONSIBLE FOR THE AMIDATION OF CARBOXYLIC GROUPS AT POSITION A AND C OF EITHER COBYRINIC ACID OR HYDROGENOBRYNIC ACID. NH(2) GROUPS ARE PROVIDED BY GLUTAMINE, AND ONE MOLECULE OF ATP IS HYDROGENOLYZED FOR EACH AMIDATION." /note="responsible for the amidation of carboxylic groups at position A and C of cobyrinic acid or hydrogenobrynic acid" /codon_start=1 /transl_table=11 /product="cobyrinic acid a,c-diamide synthase" /protein_id="NP_217364.1" /db_xref="GI:15609985" /db_xref="GeneID:888499" /translation="MRVSAVAVAAPASGSGKTTIATGLIGALRQAGHTVAPFKVGPDF IDPGYHALAAGRPGRNLDPVLVGERLIGPLYAHGVAGADIAVIEGVLGLFDGRIGPAG GAPAAGSTAHVAALLGAPVILVVDARGQSHSVAALLHGFSTFDTATRIAGVILNRVGS ARHEQVLRQACDQAGVAVLGAIPRTAELELPTRYLGLVTAVEYGRRARLAVQAMTAVV ARHVDLAAVIACAGSQAAHPPWDPVIAVGNTARQPATVAIAAGRAFTFGYAEHAEMLR AAGAEVVEFDPLSETLPEGTDAVVLPGGFPEQFTAELSANDTVRRQINELAAAGAPVH AECAGLLYLVSELDGHPMCGVVAGSARFTQHLKLGYRDAVAVVDSALYSVGERVVGHE FHRTAVTFADSYQPAWVYQGQDVDDVRDGAVHSGVHASYLHTHPAATPGAVARFVAHA ACNTPRA" gene complement(3157521..3158144) /gene="cobO" /locus_tag="Rv2849c" /db_xref="GeneID:888236" CDS complement(3157521..3158144) /gene="cobO" /locus_tag="Rv2849c" /EC_number="2.5.1.17" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS; TRANSFORMS COBYRINIC ACID INTO COBINAMIDE [CATALYTIC ACTIVITY: ATP + COB(I)ALAMIN + H(2)O = ORTHOPHOSPHATE + PYROPHOSPHATE + ADENOSYLCOBALAMIN]." /note="catalyzes the formation of adenosylcob(III)yrinic acid a,c-diamide from cob(I)yrinic acid a,c-diamide" /codon_start=1 /transl_table=11 /product="cob(I)yrinic acid a,c-diamide adenosyltransferase" /protein_id="YP_177908.1" /db_xref="GI:57117028" /db_xref="GeneID:888236" /translation="MPQGNPLAVPNDGLTTRARRNMPILAVHTGEGKGKSTAAFGMAL RAWNAGLDIAVFQFVKSAKWKVGEEAAFRQLGRLHDQHGIGGAVEWHKMGAGWSWTRT SRKAGTDVDRAAAAADGWAEIALRLATQRHDFYLLDEFTYPLKWGWLDVDEVVDVLRA RPGHQHVVITGRDAPQRLVAAADLVTEMTKVKHPMDAGRKGQKGIEW" gene complement(3158165..3160054) /locus_tag="Rv2850c" /db_xref="GeneID:887364" CDS complement(3158165..3160054) /locus_tag="Rv2850c" /EC_number="4.99.1.-" /function="UNKNOWN; POSSIBLY INTRODUCES A MAGNESIUM ION INTO SPECIFIC SUBSTRATE/COMPOUND." /note="Rv2850c, (MTCY24A1.07), len: 629 aa. Possible magnesium-chelatase (EC 4.99.1.-), highly similar (but with gaps) to magnesium-chelatases from notably photosynthetic organisms involved in chlorophyll biosynthesis e.g. Q9RJ18|SCI8.35c PUTATIVE CHELATASE from Streptomyces coelicolor (672 aa), FASTA scores: opt: 1941, E(): 2.1e-85, (54.65% identity in 675 aa overlap); Q9HZQ5|PA2942 PROBABLE MAGNESIUM CHELATASE from Pseudomonas aeruginosa (338 aa), FASTA scores: opt: 991, E(): 2.7e-40, (49.45% identity in 368 aa overlap); O33549|BCHI MG PROTOPORPHYRIN IX CHELATASE SUBUNIT from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (334 aa), FASTA scores: opt: 833, E(): 9.4e-33, (50.65% identity in 318 aa overlap); O30819|BCHI_RHOSH MAGNESIUM-CHELATASE 38 KDA SUBUNIT from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (334 aa), FASTA scores: opt: 828, E(): 1.6e-32, (50.3% identity in 318 aa overlap); etc. Equivalent to AAK47242 from Mycobacterium tuberculosis strain CDC1551 (610 aa) but longer 19 aa. COULB BELONG TO THE MG-CHELATASE SUBUNITS D/I FAMILY." /codon_start=1 /transl_table=11 /product="magnesium chelatase" /protein_id="NP_217366.1" /db_xref="GI:15609987" /db_xref="GeneID:887364" /translation="MKPYPFSAIVGHDRLRLALLLCAVRPEIGGALIRGEKGTAKSTA VRGLAALLSVATGSTETGLVELPLGATEDRVVGSLDLQRVMRDGEHAFSPGLLARAHG GVLYVDEVNLLHDHLVDILLDAAAMGRVHVERDGISHSHEARFVLIGTMNPEEGELRP QLLDRFGLTVDVQASRDIDVRVQVIRRRMAYEADPDAFVARYADADAELAHRIAAARA TVDDVVLGDNELRRIAALCAAFDVDGMRADLVVARTAAAHAAWRGVRTVEEQDIRAAA ELALPHRRRRDPFDDHGIDRDQLDEALALASVDPEPEPDPPGGGQSANEPASQPNSRS KSTEPGAPSSMGDDPPRPASPRLRSSPRPSAPPSKIFRTRALRVPGVGTGAPGRRSRA RNASGSVVAAAEVSDPDAHGLHLFATLLAAGERAFGAGPLRPWPDDVRRAIREGREGN LVIFVVDASGSMAARDRMAAVSGATLSLLRDAYQRRDKVAVITFRQHEATLLLSPTSS AHIAGRRLARFSTGGKTPLAEGLLAARALIIREKVRDRARRPLVVVLTDGRATAGPDP LGRSRTAAAGLVAEGAAAVVVDCETSYVRLGLAAQLARQLGAPVVRLEQLHADYLVHA VRGVA" gene complement(3160051..3160521) /locus_tag="Rv2851c" /db_xref="GeneID:887664" CDS complement(3160051..3160521) /locus_tag="Rv2851c" /function="UNKNOWN" /note="Rv2851c, (MTCY24A1.06), len: 156 aa. Conserved hypothetical protein, similar to various bacterial proteins e.g. Q9KP14|VC2565 ELAA PROTEIN from Vibrio cholerae (149 aa), FASTA scores: opt: 360, E(): 1e-18, (46.05% identity in 139 aa overlap); Q9I717|PA0115 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (150 aa), FASTA scores: opt: 341, E(): 2.4e-17, (43.65% identity in 142 aa overlap); Q9K8M4|BH2982 HYPOTHETICAL PROTEIN from Bacillus halodurans (155 aa), FASTA scores: opt: 320, E(): 8e-16, (40.85% identity in 142 aa overlap); P52077|ELAA_ECOLI|B2267 PROTEIN ELAA from Escherichia coli strain K12 (153 aa), FASTA scores: opt: 269, E(): 3.8e-12, (35.7% identity in 140 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217367.1" /db_xref="GI:15609988" /db_xref="GeneID:887664" /translation="MTEALRRVWAKDLDARALYELLKLRVEVFVVEQACPYPELDGRD LLAETRHFWLETPDGEVTCTLRLMEEHAGGEKVFRIGRLCTKRDARGQGHSNRLLCAA LAEVGDYPCRIDAQAYLTAMYAQHGFVRDGDEFLDDGIPHVPMLRPGSGQVERP" repeat_region complement(3160522..3160583) /note="62 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene complement(3160580..3162061) /gene="mqo" /locus_tag="Rv2852c" /db_xref="GeneID:888544" CDS complement(3160580..3162061) /gene="mqo" /locus_tag="Rv2852c" /EC_number="1.1.5.4" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE [CATALYTIC ACTIVITY: (S)-MALATE + ACCEPTOR = OXALOACETATE + REDUCED ACCEPTOR]." /note="malate dehydrogenase; catalyzes the oxidation of malate to oxaloacetate" /codon_start=1 /transl_table=11 /product="malate:quinone oxidoreductase" /protein_id="NP_217368.1" /db_xref="GI:15609989" /db_xref="GeneID:888544" /translation="MSDLARTDVVLIGAGIMSATLGVLLRRLEPNWSITLIERLDAVA AESSGPWNNAGTGHSALCEMNYTPEMPDGSIDITKAVRVNEQFQVTRQFWAYAAENGI LTDVRSFLNPVPHVSFVHGSRGVEYLRRRQKALAGNPLFAGTEFIESPDEFARRLPFM AAKRAFSEPVALNWAADGTDVDFGALAKQLIGYCVQNGTTALFGHEVRNLSRQSDGSW TVTMCNRRTGEKRKLNTKFVFVGAGGDTLPVLQKSGIKEVKGFAGFPIGGRFLRAGNP ALTASHRAKVYGFPAPGAPPLGALHLDLRFVNGKSWLVFGPYAGWSPKFLKHGQISDL PRSIRPDNLLSVLGVGLTERRLLNYLISQLRLSEPERVSALREFAPSAIDSDWELTIA GQRVQVIRRDERNGGVLEFGTTVIGDADGSIAGLLGGSPGASTAVAIMLDVLQKCFAN RYQSWLPTLKEMVPSLGVQLSNEPALFDEVWSWSTKALKLGAA" gene 3162268..3164115 /gene="PE_PGRS48" /locus_tag="Rv2853" /db_xref="GeneID:888171" CDS 3162268..3164115 /gene="PE_PGRS48" /locus_tag="Rv2853" /function="UNKNOWN" /note="Rv2853, (MTCY24A1.04c), len: 615 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many e.g. O53884|Rv0872c|MTV043.65c from Mycobacterium tuberculosis (606 aa), FASTA scores: opt: 1405, E(): 1.4e-97, (64.6% identity in 619 aa overlap). Equivalent to AAK47245 from Mycobacterium tuberculosis strain CDC1551 (663 aa) but shorter 48 aa." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177909.1" /db_xref="GI:57117029" /db_xref="GeneID:888171" /translation="MLYVVASPDLMTAAATNLAEIGSAISTANGAAALPTVEVVAAAA DEVSTQIAALFGAHARSYQTLSTQAAAFHSRFVQALTTAAASYASVEAANASPLQVAL DVINAPAQTLLGRPLIGNGADGSTPGQAGGPGGLLYGNGGNGAAGGPNQAGGAGGNAG LIGNGGAGGAGGVGAVGGKRGTGGLLFGNGGAGGQGGLGLAGINGGSGGQGGHGGNAI LFGQGGAGGPGGTGAMGVAGTNPTPIGTAAPGSDGVNQIGNGGNTDLTGGAGGDGNAG STTVNGGNGGTGGAARNSSGGTGNSFGGAGGAGGDGANGGDGGAGGEALTEGGATAVS GAGGKGGNAEASGGAGGNGGKGGFAQATTSVTGGNGGNGGNGHDSNAPGGAGGSGGVG GDGGRGGLLAGNGGTGGAGGNGGTGGAGAPGGAGGAGGKADIANSLGDNATVTGGNGG TGGDGGSALGTGGAGGAGGLGGHGGAGGLLIGNGGAGGAGGLGGAGGAGGAGGEGGAG GAGGEAIPGGASTNSAGGDGGAGGTGGNGGDGGAGGAPGLGGAGGAGGWLIGQSGSTG GGGAGGAGGAGGAGGAGGSGGAGGHGDTTSGKNGSSGTAGFDGNPGQPG" gene 3164152..3165192 /locus_tag="Rv2854" /db_xref="GeneID:888542" CDS 3164152..3165192 /locus_tag="Rv2854" /function="UNKNOWN" /note="Rv2854, (MTCY24A1.03c), len: 346 aa. Hypothetical unknown protein, showing similarity with Q9CD03|ML2603 HYPOTHETICAL PROTEIN from Mycobacterium leprae (279 aa), FASTA scores: opt: 154, E(): 0.0083, (33.35% identity in 87 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217370.1" /db_xref="GI:15609991" /db_xref="GeneID:888542" /translation="MTGWVPDVLPGYWQCTIPLGPDPDDEGDIVATLVGRGPQTGKAR GDTTGAHHTVLAVHGYTDYFFHTELADHFANRGFAFYALDLRKCGRSRAPGQTPHFIT DLARYDTELEHSLSIINEQNRSAKVLVYGHSAGGLIVSLWLDRLRQRGEITRAGVTGL VLNSPFLDLQGPAILRLPLTSAFFAAMARMRPKWVARPPKEGGYGCTLHRDYDGEFDY NLQWKPVGGFPVTFGWIHASRRGHARLHRGIDVGVPNLILCSDHTVREKADPATLHRG DAVLDVTHITRWAGCIGNRSTVIAVADAKHDVFLSLPQPRQMAYRRLDLWLDDYLGTH NDTDASASSGKG" gene 3165205..3166584 /gene="mtr" /locus_tag="Rv2855" /db_xref="GeneID:887773" CDS 3165205..3166584 /gene="mtr" /locus_tag="Rv2855" /EC_number="1.8.1.7" /function="INVOLVED IN REDUCTION OF MYCOTHIOL." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the reduction of mycothione or glutathione to mycothione or glutathione disulfide" /codon_start=1 /transl_table=11 /product="mycothione reductase" /protein_id="YP_177910.1" /db_xref="GI:57117030" /db_xref="GeneID:887773" /translation="METYDIAIIGTGSGNSILDERYASKRAAICEQGTFGGTCLNVGC IPTKMFVYAAEVAKTIRGASRYGIDAHIDRVRWDDVVSRVFGRIDPIALSGEDYRRCA PNIDVYRTHTRFGPVQADGRYLLRTDAGEEFTAEQVVIAAGSRPVIPPAILASGVDYH TSDTVMRIAELPEHIVIVGSGFIAAEFAHVFSALGVRVTLVIRGSCLLRHCDDTICER FTRIASTKWELRTHRNVVDGQQRGSGVALRLDDGCTINADLLLVATGRVSNADLLDAE QAGVDVEDGRVIVDEYQRTSARGVFALGDVSSPYLLKHVANHEARVVQHNLLCDWEDT QSMIVTDHRYVPAAVFTDPQIAAVGLTENQAVAKGLDISVKIQDYGDVAYGWAMEDTS GIVKLITERGSGRLLGAHIMGYQASSLIQPLIQAMSFGLTAAEMARGQYWIHPALPEV VENALLGLR" gene 3166684..3167802 /gene="nicT" /locus_tag="Rv2856" /db_xref="GeneID:888643" CDS 3166684..3167802 /gene="nicT" /locus_tag="Rv2856" /function="SEEMS INVOLVED IN NICKEL INCORPORATION. THOUGHT TO BE INVOLVED IN TRANSPORT OF NICKEL ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2856, (MTCY24A1.01c), len: 372 aa. Possible nicT, nickel-transport integral membrane protein, similar to transport proteins and hydrogenase cluster proteins e.g. BAB58860|SAV2698 HYPOTHETICAL 37.9 KDA PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (338 aa), FASTA scores: opt: 1082, E(): 7.1e-60, (48.05% identity in 335 aa overlap); Q97ZB2|HOXN HIGH-AFFINITY NICKEL-TRANSPORT PROTEIN from Sulfolobus solfataricus (373 aa), FASTA scores: opt: 922, E(): 6.6e-50, (42.2% identity in 372 aa overlap); P23516|HOXN_ALCEU HIGH-AFFINITY NICKEL TRANSPORT PROTEIN (INTEGRAL MEMBRANE PROTEIN) from Alcaligenes eutrophus (Ralstonia eutropha) (351 aa), FASTA scores: opt: 904, E(): 8.3e-49, (41.9% identity in 339 aa overlap); Q45247|HUPN_BRAJA HYDROGENASE NICKEL INCORPORATION PROTEIN from Bradyrhizobium japonicum (381 aa), FASTA scores: opt: 853, E(): 1.3e-45, (41.65% identity in 329 aa overlap); etc. SEEMS TO BELONG TO THE HOXN/HUPN/NIXA FAMILY OF NICKEL TRANSPORTERS (NiCoT FAMILY)." /codon_start=1 /transl_table=11 /product="nickel-transport integral membrane protein" /protein_id="NP_217372.1" /db_xref="GI:15609993" /db_xref="GeneID:888643" /translation="MASSQLDRQRSRSAKMNRALTAAEWWRLGLMFAVIVALHLVGWL TVTLLVEPARLSLGGKAFGIGVGLTAYTLGLRHAFDADHIAAIDNTTRKLMSDGHRPL AVGFFFSLGHSTVVFGLAVMLVTGLKAIVGPVENDSSTLHHYTGLIGTSISGAFLYLI GILNVIVLVGIVRVFAHLRRGDYDEAELEQQLDNRGLLIRFLGRFTKSLTKSWHMYPV GFLFGLGFDTATEIALLVLAGTSAAAGLPWYAILCLPVLFAAGMCLLDTIDGSFMNFA YGWAFSSPVRKIYYNITVTGLSVAVALLIGSVELLGLIANQLGWQGPFWDWLGGLDLN TVGFVVVAMFALTWAIALLVWHYGRVEERWTPAPDRTT" misc_feature 3168123..3168140 /note="PS00190 Cytochrome c family heme-binding site signature" gene complement(3168583..3169359) /locus_tag="Rv2857c" /db_xref="GeneID:888560" CDS complement(3168583..3169359) /locus_tag="Rv2857c" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2857c, (MTV003.03c), len: 258 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to various dehydrogenases e.g. O88068|SCI35.33c PROBABLE DEHYDROGENASE (SDR FAMILY) from Streptomyces coelicolor (260 aa), FASTA scores: opt: 1208, E(): 2e-68, (72.35% identity in 253 aa overlap); Q9I376|PA1649 from Pseudomonas aeruginosa PROBABLE SHORT-CHAIN DEHYDROGENASE (253 aa), FASTA scores: opt: 569, E(): 2.1e-28, (39.2% identity in 255 aa overlap); Q9EX74|MLHA SDR-LIKE ENZYME from Rhodococcus erythropolis (246 aa), FASTA scores: opt: 567, E(): 2.8e-28, (41.15% identity in 248 aa overlap); etc. Also similar to many Mycobacterium tuberculosis dehydrogenases e.g. FABG3|Rv2002|MT2058|MTCY39.16c PUTATIVE OXIDOREDUCTASE (260 aa), FASTA score: (38.3% identity in 248 aa overlap). BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_217373.1" /db_xref="GI:15609994" /db_xref="GeneID:888560" /translation="MMDLSQRLAGRVAVITGGGSGIGLAAGRRMRAEGATIVVGDVDV EAGGAAADELSGLFVPTDVCDEDAVNGLFDGAAETYGRIDIAFNNAGISPPEDNLIEN TELAAWQRVQDVNLKSVYLCCRAALRHMVLAGKGSIVNTASFVAVMGSATSQISYTAS KGGVLAMSRELGVQFARQGIRVNALCPGPVNTPLLQELFAKNPERAARRMVHVPLGRF AEPDEIAAAVAFLASDDASFITASTFLVDGGISSAYVTPL" gene complement(3169356..3170723) /gene="aldC" /locus_tag="Rv2858c" /db_xref="GeneID:888636" CDS complement(3169356..3170723) /gene="aldC" /locus_tag="Rv2858c" /EC_number="1.2.1.3" /function="OXIDIZES A WIDE VARIETY OF ALDEHYDES [CATALYTIC ACTIVITY: ALDEHYDE + NAD(+) + H(2)O = ACID + NADH]." /note="Rv2858c, (MTV003.04c), len: 455 aa. Probable aldC, aldehyde dehydrogenase (EC 1.2.1.3), similar to many e.g. O88069|SCI35.34c PUTATIVE ALDEHYDE DEHYDROGENASE from Streptomyces coelicolor (483 aa), FASTA scores: opt: 1872, E(): 6.4e-109, (64.5% identity in 448 aa overlap); Q9FAB1|ALDH|BT-ALDH ALDEHYDE DEHYDROGENASE from Bacillus thermoleovorans (497 aa), FASTA scores: opt: 1157, E(): 2.1e-64, (44.3% identity in 458 aa overlap); O33455|CYMC P-CUMIC ALDEHYDE DEHYDROGENASE from Pseudomonas putida (494 aa), FASTA scores: opt: 1149, E(): 6.5e-64, (43.15% identity in 452 aa overlap); P40047|DHA5_YEAST|ALD5|ALDH5|ALD3|YER073W ALDEHYDE DEHYDROGENASE from Saccharomyces cerevisiae (Baker's yeast) (519 aa), FASTA scores: opt: 1091, E(): 2.7e-60, (38.55% identity in 459 aa overlap); P80668|FEAB_ECOLI|PADA|MAOB|B1385 PHENYLACETALDEHYDE DEHYDROGENASE (EC 1.2.1.39) from Escherichia coli strain K12 (499 aa), FASTA scores: opt: 1074, E(): 3e-59, (42.2% identity in 462 aa overlap); etc. Also similar to many M. tuberculosis dehydrogenases e.g. P71823|Rv0768|MTCY369.13 (489 aa), FASTA score: (38.1% identity in 467 aa overlap). Contains PS00687 Aldehyde dehydrogenases glutamic acid active site and PS00070 Aldehyde dehydrogenases cysteine active site. BELONGS TO THE ALDEHYDE DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase" /protein_id="NP_217374.1" /db_xref="GI:15609995" /db_xref="GeneID:888636" /translation="MSTTQLINPATEEVLASVDHTDANAVDDAVQRARAAQRRWARLA PAQRAAGLRAFAAAVQAHLDELAALEVANSGHPIVSAEWEAGHVRDVLAFYAASPERL SGRQIPVAGGVDVTFNEPMGVVGVITPWNFPMVIASWAIAPALAAGNAVLVKPAELTP LTTMRLGELAVEAGLDEDLLQVLPGKGTVVGERFVTHPDIRKIVFTGSTEVGKRVMAG AAAQVKRVTLELGGKSANIVFHDCDLERAATTAPAGVFDNAGQDCCARSRILVQRSVY DRFMELLEPAVHSIVVGDPGSRATEMGPLVSRAHRDKVAGYVPDDAPVAFRGTAPAGR GFWFPPTVLTPKRGDRTVTDEIFGPVVVVLTFDDEADAISLANDTAYGLSGSIWTDDL SRALRVARAVESGNLSVNSHSSVRFNTPFGGFKQSGVGRELGPDAPLQFTETKNVFIA VGEEM" misc_feature complement(3169923..3169958) /gene="aldC" /locus_tag="Rv2858c" /note="PS00070 Aldehyde dehydrogenases cysteine active site" misc_feature complement(3170019..3170042) /gene="aldC" /locus_tag="Rv2858c" /note="PS00687 Aldehyde dehydrogenases glutamic acid active site" gene complement(3170720..3171646) /locus_tag="Rv2859c" /db_xref="GeneID:887495" CDS complement(3170720..3171646) /locus_tag="Rv2859c" /EC_number="6.3.5.-" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2859c, (MTV003.05c), len: 308 aa. Possible amidotransferase (EC 6.3.5.- or 2.-.-.-), equivalent (but longer 58 aa) to Q9CBU9|ML1573 POSSIBLE AMIDOTRANSFERASE from Mycobacterium leprae (249 aa), FASTA scores: opt: 1226, E(): 3e-64, (71.55% identity in 239 aa overlap). Also similar to other amidotransferases and hypothetical proteins, but shorter in N-terminus e.g. O88072|SCI35.37 HYPOTHETICAL 25.3 KDA PROTEIN from Streptomyces coelicolor (242 aa), FASTA scores: opt: 683, E(): 1.2e-32, (47.65% identity in 235 aa overlap); AAK79730|Q97I88|CAC1764 PREDICTED GLUTAMINE AMIDOTRANSFERASE from Clostridium acetobutylicum (241 aa), FASTA scores: opt: 458, E(): 1.6e-19, (32.95% identity in 246 aa overlap); AAK75201|Q97QV9|SP1089 GLUTAMINE AMIDOTRANSFERASE CLASS I from Streptococcus pneumoniae (229 aa), FASTA scores: opt: 431, E(): 5.6e-18, (34.75% identity in 236 aa overlap); etc. Contains three 17 aa repeats at the N-terminus very similar to those in other Mycobacterium tuberculosis proteins e.g. Q10699|YY30_MYCTU|Rv2090|MT2151|MTCY49.30 PUTATIVE 5'-3' EXONUCLEASE RV2090 (EC 3.1.11.-)." /codon_start=1 /transl_table=11 /product="amidotransferase" /protein_id="NP_217375.1" /db_xref="GI:15609996" /db_xref="GeneID:887495" /translation="MDLSASRSDGGDPLRPASPRLRSPVSDGGDPLRPASPRLRSPVS DGGDPLRPASPRLRSPLGASRPVVGLTAYLEQVRTGVWDIPAGYLPADYFEGITMAGG VAVLLPPQPVDPESVGCVLDSLHALVITGGYDLDPAAYGQEPHPATDHPRPGRDAWEF ALLRGALQRGMPVLGICRGTQVLNVALGGTLHQHLPDILGHSGHRAGNGVFTRLPVHT ASGTRLAELIGESADVPCYHHQAIDQVGEGLVVSAVDVDGVIEALELPGDTFVLAVQW HPEKSLDDLRLFKALVDAASGYAGRQSQAEPR" repeat_region complement(3171468..3171518) /note="51 bp direct repeat 1, GTCCGATGGTGGCGACCCGCTGCGCCCGGCTTCGCCGCGCTTGCGATCGCC" repeat_region complement(3171522..3171572) /note="51 bp direct repeat 2, GTCCGATGGTGGCGACCCGCTGCGCCCGGCTTCGCCGCGCTTGCGATCGCC" repeat_region complement(3171576..3171616) /note="(41 bp) part of 51 bp direct repeat 3, GGCGACCCGCTGCGCCCGGCTTCGCCGCGCTTGCGATCGCC" gene complement(3171627..3173000) /gene="glnA4" /locus_tag="Rv2860c" /db_xref="GeneID:887420" CDS complement(3171627..3173000) /gene="glnA4" /locus_tag="Rv2860c" /EC_number="6.3.1.2" /function="INVOLVED IN GLUTAMINE BIOSYNTHESIS [CATALYTIC ACTIVITY: ATP + L-GLUTAMATE + NH(3) = ADP + GLUTAMINE + ORTHOPHOSPHATE]." /note="Rv2860c, (MTV003.06c), len: 457 aa. Probable glnA4, glutamine synthetase class II (EC 6.3.1.2), similar to many glutamine synthases e.g. O88070|SCI35.35c from Streptomyces coelicolor (462 aa), FASTA scores: opt: 1947, E(): 8.2e-120, (64.15% identity in 452 aa overlap); Q98H15|MLL3074 from Rhizobium loti (Mesorhizobium loti) (465 aa), FASTA scores: opt: 1321, E(): 7.8e-79, (46.7% identity in 452 aa overlap); Q98EM0|MLL4187 from Rhizobium loti (Mesorhizobium loti) (456 aa), FASTA scores: opt: 698, E(): 4.6e-38, (33.5% identity in 454 aa overlap); Q9CDL9|GLNA from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (446 aa), FASTA scores: opt: 633, E(): 8.2e-34, (32.45% identity in 456 aa overlap); etc. Also similar to three other potential glutamine synthases in Mycobacterium tuberculosis: Q10378|GLN2_MYCTU|GLNA2|Rv2222c|MT2280|MTCY190.33c|MTCY42 7. 03c PROBABLE GLUTAMINE SYNTHETASE (446 aa), FASTA score: (31.1% identity in 453 aa overlap); Rv1878|glnA3 and Rv2220|glnA1. BELONGS TO THE GLUTAMINE SYNTHETASE FAMILY." /codon_start=1 /transl_table=11 /product="glutamine synthetase" /protein_id="NP_217376.1" /db_xref="GI:15609997" /db_xref="GeneID:887420" /translation="MTGPGSPPLAWTELERLVAAGDVDTVIVAFTDMQGRLAGKRISG RHFVDDIATRGVECCSYLLAVDVDLNTVPGYAMASWDTGYGDMVMTPDLSTLRLIPWL PGTALVIADLVWADGSEVAVSPRSILRRQLDRLKARGLVADVATELEFIVFDQPYRQA WASGYRGLTPASDYNIDYAILASSRMEPLLRDIRLGMAGAGLRFEAVKGECNMGQQEI GFRYDEALVTCDNHAIYKNGAKEIADQHGKSLTFMAKYDEREGNSCHIHVSLRGTDGS AVFADSNGPHGMSSMFRSFVAGQLATLREFTLCYAPTINSYKRFADSSFAPTALAWGL DNRTCALRVVGHGQNIRVECRVPGGDVNQYLAVAALIAGGLYGIERGLQLPEPCVGNA YQGADVERLPVTLADAAVLFEDSALVREAFGEDVVAHYLNNARVELAAFNAAVTDWER IRGFERL" gene complement(3173160..3174017) /gene="mapB" /locus_tag="Rv2861c" /db_xref="GeneID:888596" CDS complement(3173160..3174017) /gene="mapB" /locus_tag="Rv2861c" /EC_number="3.4.11.18" /function="REMOVES THE AMINO-TERMINAL METHIONINE FROM NASCENT PROTEINS [CATALYTIC ACTIVITY: L-METHIONYLPEPTIDE + H(2)O = L-METHIONINE + PEPTIDE]." /note="catalyzes the removal of N-terminal amino acids from peptides and arylamides; generally Co(II) however activity has been shown for some methionine aminopeptidases with Zn, Fe, or Mnin Bacillus subtilis the protein in this cluster is considered non-essential" /codon_start=1 /transl_table=11 /product="methionine aminopeptidase" /protein_id="YP_177911.1" /db_xref="GI:57117031" /db_xref="GeneID:888596" /translation="MPSRTALSPGVLSPTRPVPNWIARPEYVGKPAAQEGSEPWVQTP EVIEKMRVAGRIAAGALAEAGKAVAPGVTTDELDRIAHEYLVDNGAYPSTLGYKGFPK SCCTSLNEVICHGIPDSTVITDGDIVNIDVTAYIGGVHGDTNATFPAGDVADEHRLLV DRTREATMRAINTVKPGRALSVIGRVIESYANRFGYNVVRDFTGHGIGTTFHNGLVVL HYDQPAVETIMQPGMTFTIEPMINLGALDYEIWDDGWTVVTKDRKWTAQFEHTLLVTD TGVEILTCL" gene complement(3174059..3174643) /locus_tag="Rv2862c" /db_xref="GeneID:887423" CDS complement(3174059..3174643) /locus_tag="Rv2862c" /function="UNKNOWN" /note="Rv2862c, (MTV003.08), len: 194 aa. Conserved hypothetical protein, showing some similarity with others e.g. Q9X8X5|SCH35.31c HYPOTHETICAL 19.6 KDA PROTEIN from Streptomyces coelicolor (180 aa), FASTA scores: opt: 266, E(): 2.2e-11, (34.65% identity in 179 aa overlap); Q9Z5H1|ML0169|MLCB373.19 HYPOTHETICAL 22.1 KDA PROTEIN from Mycobacterium leprae (200 aa), FASTA scores: opt: 195, E(): 2.3e-06, (30.15% identity in 189 aa overlap); etc. Also some similarity to P71544|Y966_MYCTU|Rv0966c|MT0994|MTCY10D7.08 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (230 aa), FASTA scores: opt: 209, E(): 2.6e-07, (31.5% identity in 184 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217378.1" /db_xref="GI:15609999" /db_xref="GeneID:887423" /translation="MTETGGDMVALRVSDADRNGTMRRLHNAVALGLINIDEFEQRSS RVSFACTRSELDGLVGDLPRPGAIVTSAADRVELRGWAGSLKRHGEWIVPTRLALVRR LGSIELDLVKARFAGPVVVIELDMMFGSLEVRLPNGASASIDDVEVYVGSASDRRKDA PAEGTPHVVLTGRMVCGSVVIKGPRRALLRRHRG" gene 3174992..3175372 /locus_tag="Rv2863" /db_xref="GeneID:888195" CDS 3174992..3175372 /locus_tag="Rv2863" /function="UNKNOWN" /note="Rv2863, (MTV003.09), len: 126 aa. Conserved hypothetical protein, similar to hypothetical proteins from Mycobacterium tuberculosis e.g. Q50595|YI38_MYCTU|Rv1838c|MT1886|MTCY1A11.05|MTCY359.35 CONSERVED HYPOTHETICAL PROTEIN (131 aa), FASTA scores: opt: 299, E(): 6.5e-15, (39.0% identity in 123 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217379.1" /db_xref="GI:15610000" /db_xref="GeneID:888195" /translation="MIFVDTNVFMYAVGRDHPLRMPAREFLEHSLEHQDRLVTSAEAM QELLNAYVPVGRNSTLDSALTLVRALTEIWPVEAADVAHARTLHHRHPGLGARDLLHL ACCQRRGVTRIKTFDHTLASAFRS" gene complement(3175454..3177265) /locus_tag="Rv2864c" /db_xref="GeneID:888496" CDS complement(3175454..3177265) /locus_tag="Rv2864c" /function="UNKNOWN, POSSIBLY INVOLVED IN CELL WALL BIOSYNTHESIS." /note="Rv2864c, (MTV003.10c), len: 603 aa. Possible penicillin-binding lipoprotein, probably located in periplasm, equivalent to Q9CBU6|ML1577 PROBABLE PENICILLIN BINDING PROTEIN from Mycobacterium leprae (608 aa), FASTA scores: opt: 3352, E(): 2.1e-193, (81.5% identity in 606 aa overlap). Also shows some similarity to others e.g. P72405|PCBR from Streptomyces clavuligerus (551 aa), FASTA scores: opt: 543, E(): 6.1e-25, (28.4% identity in 567 aa overlap); Q9F2L0|SCH63.18c from Streptomyces coelicolor (546 aa), FASTA scores: opt: 519, E(): 1.7e-23, (29.3% identity in 577 aa overlap); Q9RKD1|SCE87.07 from Streptomyces coelicolor (541 aa), FASTA scores: opt: 472, E(): 1.1e-20, (34.3% identity in 318 aa overlap); etc. Equivalent to AAK47258 from Mycobacterium tuberculosis strain CDC1551 (618 aa) but shorter 15 aa. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site, and PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="penicillin-binding lipoprotein" /protein_id="NP_217380.1" /db_xref="GI:15610001" /db_xref="GeneID:888496" /translation="MVTKTTLASATSGLLLLAVVAMSGCTPRPQGPGPAAEKFFAALA IGDTASAAQLSDNPNEAREALNAAWAGLQAAHLDAQVLSAKYAEDTGTVAYRFSWHLP KDRIWTYDGQLKMARDEGRWHVRWTTSGLHPKLGEHQTFALRADPPRRASVNEVGGTD VLVPGYLYHYSLDAGQAGRELFGTAHAVVGALHPFDDTLNDPQLLAEQASSSTQPLDL VTLHADDSNRVAAAIGQLPGVVITPQAELLPTDKHFAPAVLNDVKKAVVDELDGKAGW RVVSVNQNGVDVSVLHEVAPSPASSVSITLDRVVQNAAQHAVNTRGGKAMIVVIKPST GEILAIAQNAGADADGPVATTGLYPPGSTFKMITAGAAVERDLATPETLLGCPGEIDI GHRTIPNYGGFDLGVVPMSRAFASSCNTTFAELSSRLPPRGLTQAARRYGIGLDYQVD GITTVTGSVPPTVDLAERTEDGFGQGKVLASPFGMALVAATVAAGKTPVPQLIAGRPT AVEGDATPISQKMIDALRPMMRLVVTNGTAKEIAGCGEVFGKTGEAEFPGGSHSWFAG YRGDLAFASLIVGGGSSEYAVRMTKVMFESLPPGYLA" misc_feature complement(3175775..3175798) /locus_tag="Rv2864c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(3177191..3177223) /locus_tag="Rv2864c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 3177537..3177818 /locus_tag="Rv2865" /db_xref="GeneID:887458" CDS 3177537..3177818 /locus_tag="Rv2865" /function="UNKNOWN" /note="Rv2865, (MTV003.11), len: 93 aa. Conserved hypothetical protein, showing weak similarity with P58235|YR54_SYNY3|SSR2754 HYPOTHETICAL 9.7 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (87 aa), FASTA scores: opt: 134, E(): 0.007, (30.65% identity in 75 aa overlap); BAB58570|SAV2408 CONSERVED HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (83 aa), FASTA scores: opt: 124, E(): 0.037, (27.5% identity in 80 aa overlap). Also similar to Rv1247|MTV006.19c HYPOTHETICAL 9.8 KDA PROTEIN from Mycobacterium tuberculosis (89 aa), FASTA scores: opt: 249, E(): 2.6e-11, (44.2% identity in 86 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217381.1" /db_xref="GI:15610002" /db_xref="GeneID:887458" /translation="MRILPISTIKGKLNEFVDAVSSTQDQITITKNGAPAAVLVGADE WESLQETLYWLAQPGIRESIAEADADIASGRTYGEDEIRAEFGVPRRPH" gene 3177822..3178085 /locus_tag="Rv2866" /db_xref="GeneID:887450" CDS 3177822..3178085 /locus_tag="Rv2866" /function="UNKNOWN" /note="Rv2866, (MTV003.12), len: 87 aa. Conserved hypothetical protein, similar to O50461|Rv1246c|MTV006.18c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (97 aa), FASTA scores: opt: 290, E(): 3.6e-16, (54.1% identity in 85 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217382.1" /db_xref="GI:15610003" /db_xref="GeneID:887450" /translation="MPYTVRFTTTARRDLHKLPPRILAAVVEFAFGDLSREPLRVGKP LRRELAGTFSARRGTYRLLYRIDDEHTTVVILRVDHRADIYRR" gene complement(3178458..3179312) /locus_tag="Rv2867c" /db_xref="GeneID:887475" CDS complement(3178458..3179312) /locus_tag="Rv2867c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2867c, (MTV003.13c), len: 284 aa. Conserved hypothetical protein, similar to Q9KYR8|SC5H4.21 HYPOTHETICAL 31.3 KDA PROTEIN from Streptomyces coelicolor (287 aa), FASTA scores: opt: 798, E(): 2.4e-45, (47.95% identity in 269 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217383.1" /db_xref="GI:15610004" /db_xref="GeneID:887475" /translation="MSAPPISRLVGERQVSVVRDAAAVWRVLDDDPIESCMVAARVAD HGIDPNAIGGELWTRRGAHESLCFAGANLIPLRGGPIDLNAFADVAMSTPRRCSSLVG RADLVLPMWQRLEPVWGPARDVRDNQPLMALATHPSCAIDTGVRQVRPEELDSYLVAA VDMFIGEVGVDPRLGDGGRGYRRRVAGLIAAGRAWARFEHGQVIFKAEVGSQSPAVGQ IQGVWVHPEWRGIGLGTAGTATLAAVIVGSGRIASLYVNSFNTVARAAYARVGFKEIG TFATVLLD" gene complement(3179368..3180531) /gene="ispG" /locus_tag="Rv2868c" /gene_synonym="gcpE" /db_xref="GeneID:887463" CDS complement(3179368..3180531) /gene="ispG" /locus_tag="Rv2868c" /gene_synonym="gcpE" /function="NOT YET KNOWN. GCPE IS AN ESSENTIAL GENE." /note="catalyzes the conversion of 2C-methyl-D-erythritol 2,4-cyclodiphosphate into 4-hydroxy-3-methyl-2-en-1-yl diphosphate; involved in isoprenoid synthesis" /codon_start=1 /transl_table=11 /product="4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase" /protein_id="NP_217384.1" /db_xref="GI:15610005" /db_xref="GeneID:887463" /translation="MTVGLGMPQPPAPTLAPRRATRQLMVGNVGVGSDHPVSVQSMCT TKTHDVNSTLQQIAELTAAGCDIVRVACPRQEDADALAEIARHSQIPVVADIHFQPRY IFAAIDAGCAAVRVNPGNIKEFDGRVGEVAKAAGAAGIPIRIGVNAGSLDKRFMEKYG KATPEALVESALWEASLFEEHGFGDIKISVKHNDPVVMVAAYELLAARCDYPLHLGVT EAGPAFQGTIKSAVAFGALLSRGIGDTIRVSLSAPPVEEVKVGNQVLESLNLRPRSLE IVSCPSCGRAQVDVYTLANEVTAGLDGLDVPLRVAVMGCVVNGPGEAREADLGVASGN GKGQIFVRGEVIKTVPEAQIVETLIEEAMRLAAEMGEQDPGATPSGSPIVTVS" gene complement(3180548..3181762) /locus_tag="Rv2869c" /db_xref="GeneID:887449" CDS complement(3180548..3181762) /locus_tag="Rv2869c" /function="UNKNOWN" /note="Rv2869c, (MTV003.15c), len: 404 aa. Probable conserved transmembrane protein, equivalent to Q9CBU4|ML1582 PROBABLE INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (404 aa), FASTA scores: opt: 2250, E(): 1.1e-128, (82.2% identity in 404 aa overlap). Also weakly similar to other membrane proteins or hypothetical proteins e.g. Q9A710|CC1916 PUTATIVE MEMBRANE-ASSOCIATED ZINC METALLOPROTEASE from Caulobacter crescentus (398 aa), FASTA scores: opt: 368, E(): 7.8e-15, (28.1% identity in 427 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217385.1" /db_xref="GI:15610006" /db_xref="GeneID:887449" /translation="MMFVTGIVLFALAILISVALHECGHMWVARRTGMKVRRYFVGFG PTLWSTRRGETEYGVKAVPLGGFCDIAGMTPVEELDPDERDRAMYKQATWKRVAVLFA GPGMNLAICLVLIYAIALVWGLPNLHPPTRAVIGETGCVAQEVSQGKLEQCTGPGPAA LAGIRSGDVVVKVGDTPVSSFDEMAAAVRKSHGSVPIVVERDGTAIVTYVDIESTQRW IPNGQGGELQPATVGAIGVGAARVGPVRYGVFSAMPATFAVTGDLTVEVGKALAALPT KVGALVRAIGGGQRDPQTPISVVGASIIGGDTVDHGLWVAFWFFLAQLNLILAAINLL PLLPFDGGHIAVAVFERIRNMVRSARGKVAAAPVNYLKLLPATYVVLVLVVGYMLLTV TADLVNPIRLFQ" gene complement(3181770..3183011) /gene="dxr" /locus_tag="Rv2870c" /db_xref="GeneID:887800" CDS complement(3181770..3183011) /gene="dxr" /locus_tag="Rv2870c" /EC_number="1.1.1.267" /function="INVOLVED IN THE DEOXYXYLULOSE-5-PHOSPHATE PATHWAY (DXP) OF ISOPRENOID BIOSYNTHESIS (AT THE SECOND STEP). CATALYZES THE NADP-DEPENDENT REARRANGEMENT AND REDUCTION OF 1-DEOXY-D-XYLULOSE-5-PHOSPHATE (DXP) TO 2-C-METHYL-D- ERYTHRITOL 4-PHOSPHATE (MEP)." /note="catalyzes the NADP-dependent rearrangement and reduction of 1-deoxy-D-xylulose-5-phosphate (DXP) to 2-C-methyl-D-erythritol 4-phosphate" /codon_start=1 /transl_table=11 /product="1-deoxy-D-xylulose 5-phosphate reductoisomerase" /protein_id="NP_217386.2" /db_xref="GI:57117032" /db_xref="GeneID:887800" /translation="MTNSTDGRADGRLRVVVLGSTGSIGTQALQVIADNPDRFEVVGL AAGGAHLDTLLRQRAQTGVTNIAVADEHAAQRVGDIPYHGSDAATRLVEQTEADVVLN ALVGALGLRPTLAALKTGARLALANKESLVAGGSLVLRAARPGQIVPVDSEHSALAQC LRGGTPDEVAKLVLTASGGPFRGWSAADLEHVTPEQAGAHPTWSMGPMNTLNSASLVN KGLEVIETHLLFGIPYDRIDVVVHPQSIIHSMVTFIDGSTIAQASPPDMKLPISLALG WPRRVSGAAAACDFHTASSWEFEPLDTDVFPAVELARQAGVAGGCMTAVYNAANEEAA AAFLAGRIGFPAIVGIIADVLHAADQWAVEPATVDDVLDAQRWARERAQRAVSGMASV AIASTAKPGAAGRHASTLERS" repeat_region 3181794..3181836 /note="(43 bp) part of 51 bp direct repeat, GTGTCGACCCGCTGCGCCCGGCTTCGCCGTGCTTGCGATCGCC" misc_feature complement(3182766..3182798) /gene="dxr" /locus_tag="Rv2870c" /note="PS00133 Zinc carboxypeptidases, zinc-binding region 2 signature" gene 3183138..3183395 /locus_tag="Rv2871" /db_xref="GeneID:887468" CDS 3183138..3183395 /locus_tag="Rv2871" /function="UNKNOWN" /note="Rv2871, (MTCY274.02), len: 85 aa. Conserved hypothetical protein (see citation below), similar to other CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O50456|Rv1241|MTV006.13 (86 aa), FASTA scores: opt: 172, E(): 2.9e-05, (37.2% identity in 86 aa overlap); O53811|Rv0748|MTV041.22 (85 aa), FASTA scores: opt: 170, E(): 4e-05, (35.3% identity in 85 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217387.1" /db_xref="GI:15610008" /db_xref="GeneID:887468" /translation="MRTTIRIDDELYREVKAKAARSGRTVAAVLEDAVRRGLNPPKPQ AAGRYRVQPSGKGGLRPGVDLSSNAALAEAMNDGVSVDAVR" gene 3183382..3183825 /locus_tag="Rv2872" /db_xref="GeneID:887361" CDS 3183382..3183825 /locus_tag="Rv2872" /function="UNKNOWN" /note="Rv2872, (MTCY274.03), len: 147 aa. Conserved hypothetical protein (see citation below), similar to other CONSERVED HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O53683|Rv0277c|MTV035.05c (142 aa), FASTA scores: opt: 357, E(): 1.4e-17, (41.45% identity in 140 aa overlap); O53812|Rv0749|MTV041.23 (142 aa), FASTA scores: opt: 350, E(): 4.3e-17, (41.55% identity in 142 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217388.1" /db_xref="GI:15610009" /db_xref="GeneID:887361" /translation="MLCVDVNVLVYAHRADLREHADYRGLLERLANDDEPLGLPDSVL AGFIRVVTNRRVFTEPTSPQDAWQAVDALLAAPAAMRLRPGERHWMAFRQLASDVDAN GNDIADAHLAAYALENNATWLSADRGFARFRRLRWRHPLDGQTHL" gene 3183905..3184567 /gene="mpt83" /locus_tag="Rv2873" /db_xref="GeneID:887155" CDS 3183905..3184567 /gene="mpt83" /locus_tag="Rv2873" /function="NOT REALLY KNOWN." /experiment="experimental evidence, no additional details recorded" /note="Rv2873, (MTCY274.04), len: 220 aa. mpt83 (alternate gene name: mpb83), cell surface lipoprotein (see citations below). Also similar to upstream ORF Q50769|MP70_MYCTU|MPT70|MPB70|Rv2875|MT2943|MTCY274.06 which is also known as MAJOR SECRETED IMMUNOGENIC PROTEIN MPT70 PRECURSOR from Mycobacterium tuberculosis (193 aa), FASTA scores: opt: 806, E(): 2.7e-38, (70.25% identity in 185 aa overlap). BELONGS TO THE MPT70 / MPT83 FAMILY. ATTACHED TO THE MEMBRANE BY A LIPID ANCHOR.; mpb83" /codon_start=1 /transl_table=11 /product="cell surface lipoprotein mpt83 (lipoprotein P23)" /protein_id="NP_217389.1" /db_xref="GI:15610010" /db_xref="GeneID:887155" /translation="MINVQAKPAAAASLAAIAIAFLAGCSSTKPVSQDTSPKPATSPA APVTTAAMADPAADLIGRGCAQYAAQNPTGPGSVAGMAQDPVATAASNNPMLSTLTSA LSGKLNPDVNLVDTLNGGEYTVFAPTNAAFDKLPAATIDQLKTDAKLLSSILTYHVIA GQASPSRIDGTHQTLQGADLTVIGARDDLMVNNAGLVCGGVHTANATVYMIDTVLMPP AQ" gene 3184847..3186934 /gene="dipZ" /locus_tag="Rv2874" /db_xref="GeneID:888162" CDS 3184847..3186934 /gene="dipZ" /locus_tag="Rv2874" /function="COULD BE INVOLVED IN CYTOCHROME-C BIOGENESIS." /note="Rv2874, (MT2942, MTCY274.05), len: 695 aa. Possible dipZ, cytochrome c-type biogenesis protein (see citation below), probable integral membrane protein, similar in part to others or hypothetical proteins e.g. CAC48606|SMB20213 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (627 aa), FASTA scores: opt: 844, E(): 7.3e-43, (32.65% identity in 643 aa overlap); Q9ZMH0|CCDA OR JHP0250 PUTATIVE CYTOCHROME C-TYPE BIOGENESIS PROTEIN from Helicobacter pylori J99 (Campylobacter pylori J99) (239 aa), FASTA scores: opt: 250, E(): 1.4e-07, (27.3% identity in 227 aa overlap); Q9LA04|CCDA C-TYPE CYTOCHROME BIOGENESIS PROTEIN from Rhodobacter capsulatus (Rhodopseudomonas capsulata) (252 aa), FASTA scores: opt: 245, E(): 2.9e-07, (27.85% identity in 244 aa overlap); etc. Also similar to O06393|CCSA|Rv0527|MTCY25D10.06 CYTOCHROME C-TYPE BIOGENESIS PROTEIN from Mycobacterium tuberculosis (259 aa), FASTA scores: opt: 280, E(): 2.4e-09, (29.3% identity in 239 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane C-type cytochrome biogenesis protein DipZ" /protein_id="NP_217390.1" /db_xref="GI:15610011" /db_xref="GeneID:888162" /translation="MVESRRAAAAASAYASRCGIAPATSQRSLATPPTISVPSGEGRC RCHVARGAGRDPRRRLRRRRWCGRCGYHSHLTGGEFDVNRLCQQRSRERSCQLVAVPA DPRPKRQRITDVLTLALVGFLGGLITGISPCILPVLPVIFFSGAQSVDAAQVAKPEGA VAVRRKRALSATLRPYRVIGGLVLSFGMVTLLGSALLSVLHLPQDAIRWAALVALVAI GAGLIFPRFEQLLEKPFSRIPQKQIVTRSNGFGLGLALGVLYVPCAGPILAAIVVAGA TATIGLGTVVLTATFALGAALPLLFFALAGQRIAERVGAFRRRQREIRIATGSVTILL AVALVFDLPAALQRAIPDYTASLQQQISTGTEIREQLNLGGIVNAQNAQLSNCSDGAA QLESCGTAPDLKGITGWLNTPGNKPIDLKSLRGKVVLIDFWAYSCINCQRAIPHVVGW YQAYKDSGLAVIGVHTPEYAFEKVPGNVAKGAANLGISYPIALDNNYATWTNYRNRYW PAEYLIDATGTVRHIKFGEGDYNVTETLVRQLLNDAKPGVKLPQPSSTTTPDLTPRAA LTPETYFGVGKVVNYGGGGAYDEGSAVFDYPPSLAANSFALRGRWALDYQGATSDGND AAIKLNYHAKDVYIVVGGTGTLTVVRDGKPATLPISGPPTTHQVVAGYRLASETLEVR PSKGLQVFSFTYG" gene 3187030..3187611 /gene="mpt70" /locus_tag="Rv2875" /db_xref="GeneID:887724" CDS 3187030..3187611 /gene="mpt70" /locus_tag="Rv2875" /function="NOT REALLY KNOWN." /experiment="experimental evidence, no additional details recorded" /note="Rv2875, (MTCY274.06), len: 193 aa. mpt70 (alternate gene name: mpb70), major secreted immunogenic protein MPT70 precursor (see citations below). Also similar to downstream ORF Q10790|MP83_MYCTU|MPT83|MPB83|Rv2873|MT2940|MTCY274.04 CELL SURFACE LIPOPROTEIN MPT83 PRECURSOR (LIPOPROTEIN P23) (220 aa), FASTA scores: opt: 806, E(): 1.2e-40, (70.25% identity in 185 aa overlap). BELONGS TO THE MPT70 / MPT83 FAMILY. GENERALLY FOUND AS A MONOMER; HOMODIMER IN CULTURE FLUIDS.; mpb70" /codon_start=1 /transl_table=11 /product="major secreted immunogenic protein MPT70" /protein_id="NP_217391.1" /db_xref="GI:15610012" /db_xref="GeneID:887724" /translation="MKVKNTIAATSFAAAGLAALAVAVSPPAAAGDLVGPGCAEYAAA NPTGPASVQGMSQDPVAVAASNNPELTTLTAALSGQLNPQVNLVDTLNSGQYTVFAPT NAAFSKLPASTIDELKTNSSLLTSILTYHVVAGQTSPANVVGTRQTLQGASVTVTGQG NSLKVGNADVVCGGVSTANATVYMIDSVLMPPA" gene 3187663..3187977 /locus_tag="Rv2876" /db_xref="GeneID:887823" CDS 3187663..3187977 /locus_tag="Rv2876" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2876, (MTCY274.07), len: 104 aa. Possible conserved transmembrane protein, equivalent (but longer 16 aa) to Q9CBU2|ML1584 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (84 aa), FASTA scores: opt: 444, E(): 8.3e-26, (73.85% identity in 88 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217392.1" /db_xref="GI:15610013" /db_xref="GeneID:887823" /translation="MFGQWEFDVSPTGGIAVASTEVEHFAGSQHEVDTAEVPSAAWGW SRIDHRTWHIVGLCIFGFLLAMLRGNHVGHVEDWFLITFAAVVLFVLARDLWGRRRGW IR" gene complement(3188008..3188871) /locus_tag="Rv2877c" /db_xref="GeneID:887196" CDS complement(3188008..3188871) /locus_tag="Rv2877c" /function="UNKNOWN; POSSIBLY INVOLVED IN TRANSPORT OF MERCURY ACROSS THE MEMBRANE." /note="Rv2877c, (MTCY274.08c), len: 287 aa. Probable conserved integral membrane protein, Mer family possibly involved in transport of mercury, similar to others, and to the fourth protein of the mercury resistance operon of Streptomyces sp (or other organisms), and to putative cytochrome-c biogenesis proteins e.g. Q9XBD1|CZA382.20C PUTATIVE INTEGRAL MEMBRANE TRANSPORTER from Amycolatopsis orientalis (298 aa), FASTA scores: opt: 913, E(): 7.6e-46, (51.55% identity in 293 aa overlap); P30344|MER4_STRLI MERCURY RESISTANCE PROBABLE HG TRANSPORT PROTEIN from Streptomyces lividans (319 aa), FASTA scores: opt: 427, E(): 1.2e-17, (32.85% identity in 289 aa overlap); Q9M5P3 PUTATIVE CYTOCHROME C BIOGENESIS PROTEIN PRECURSOR from Arabidopsis thaliana (Mouse-ear cress) (354 aa), FASTA scores: opt: 229, E(): 4e-06, (29.85% identity in 221 aa overlap); etc. Contains PS00044 Bacterial regulatory proteins, lysR family signature. Note that previously known as merT.; merT" /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="YP_177912.1" /db_xref="GI:57117033" /db_xref="GeneID:887196" /translation="MNEALIGLAFAAGLVAALNPCGFAMLPAYLLLVVYGQDSAGRTG PLSAVGRAAAATVGMALGFLTVFGIFGALTISAATAVQRYLPYATVLIGLALIALGGW LLLGRGLTALTPRSLGVRWAPTVRLGSMYGYGISYAVASLSCTIGPFLAVTGAGLRGG SVVGSVAIYLAYVAGLTLVVGVLAVAAATASSALADRLRRILPFVNRISGALLVVVGL YVGYYGLYELRLIAGVGANPQDAVIAAAGRLQGALAGWVNQHGAWPWAVLLVVLVVGA FAGTWFRRVRR" misc_feature complement(3188341..3188418) /locus_tag="Rv2877c" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene complement(3188876..3189397) /gene="mpt53" /locus_tag="Rv2878c" /db_xref="GeneID:887184" CDS complement(3188876..3189397) /gene="mpt53" /locus_tag="Rv2878c" /function="NOT REALLY KNOWN. DESPITE A WEAK HOMOLOGY TO THIOREDOXIN THIS CANNOT SERVE AS A SUBSTRATE FOR THIOREDOXIN REDUCTASE. FURTHERMORE IT HAS NO DISULFIDE REDUCING ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="Rv2878c, (MT2946, MTCY274.09c), len: 173 aa. mpt53, secreted protein (contains N-terminal signal sequence) (see citations below). Shows some similarity with several disulfide bond interchange proteins e.g. P43787|THIX_HAEIN THIOREDOXIN-LIKE PROTEIN HI1115 from Haemophilus influenzae (167 aa), FASTA scores: opt: 200, E(): 1.4e-06, (28.9% identity in 135 aa overlap); P52237|TIPB_PSEFL THIOL:DISULFIDE INTERCHANGE PROTEIN TIPB PRECURSOR (CYTOCHROME C BIOGENESIS PROTEIN TIPB) (178 aa), FASTA scores: opt: 184, E(): 1.8e-05, (26.3% identity in 171 aa overlap); etc. Also highly similar to O53924|DSBF|Rv1677|MTV047.12 PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (182 aa), FASTA scores: opt: 482, E(): 5.7e-26, (52.8% identity in 142 aa overlap). COULD BE BELONG TO THE THIOREDOXIN FAMILY. Note that also previously known as dsbE.; dsbE" /codon_start=1 /transl_table=11 /product="soluble secreted antigen MPT53 precursor" /protein_id="NP_217394.1" /db_xref="GI:15610015" /db_xref="GeneID:887184" /translation="MSLRLVSPIKAFADGIVAVAIAVVLMFGLANTPRAVAADERLQF TATTLSGAPFDGASLQGKPAVLWFWTPWCPFCNAEAPSLSQVAAANPAVTFVGIATRA DVGAMQSFVSKYNLNFTNLNDADGVIWARYNVPWQPAFVFYRADGTSTFVNNPTAAMS QDELSGRVAALTS" gene complement(3189583..3190152) /locus_tag="Rv2879c" /db_xref="GeneID:887156" CDS complement(3189583..>3190152) /locus_tag="Rv2879c" /function="UNKNOWN" /note="Rv2879c, (MTCY274.10c), len: 189 aa. Conserved hypothetical protein, similar to others e.g. C-terminus of Q9RVT6|DR0936 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (346 aa), FASTA scores: opt: 505, E(): 1e-26, (46.5% identity in 185 aa overlap); O34617|YLON_BACSU HYPOTHETICAL 41.6 KDA PROTEIN from Bacillus subtilis (363 aa), FASTA scores: opt: 459, E(): 1.2e-24, (40.5% identity in 185 aa overlap); YFGB_ECOLI|P36979 hypothetical 43.1 kDa protein from Escherichia coli (384 aa), FASTA scores, opt: 410, E(): 2.8e-21, (41.7% identity in 187 aa overlap); etc. Appears to be a frame shift with respect to following ORF but we can detect no error in the cosmid sequence to account for this." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217395.1" /db_xref="GI:15610016" /db_xref="GeneID:887156" /translation="WGEPLANYARVLAAVQRITARPPSGFGISARAVTVSTVGLAPAI RNLADARLGVTLALSLHAPDDGLRDTLVPVNNRWRISEALDAARYYANVTGRRVSIEY ALIRDVNDQPWRADLLGKRLHRVLGPLAHVNLIPLNPTPGSDWDASPKPVEREFVKRV RAKGVSCTVRDTRGREISAACGQLAAVGG" gene complement(3189851..3190678) /locus_tag="Rv2880c" /db_xref="GeneID:887810" CDS complement(3189851..3190678) /locus_tag="Rv2880c" /function="UNKNOWN" /note="Rv2880c, (MTCY274.11c), len: 275 aa. Conserved hypothetical protein, highly similar in N-terminus to others e.g. O86754|SC6A9.22c HYPOTHETICAL 40.4 KDA PROTEIN from Streptomyces coelicolor (368 aa), FASTA scores: opt: 663, E(): 2.6e-33, (52.6% identity in 213 aa overlap); Q55880|Y098_SYNY3|SLL0098 HYPOTHETICAL 38.9 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (350 aa), FASTA scores: opt: 362, E(): 7.3e-15, (38.9% identity in 162 aa overlap); O66732|AQ_416 HYPOTHETICAL 40.2 KDA PROTEIN from Aquifex aeolicus (348 aa), FASTA scores: opt: 321, E(): 2.4e-12, (39.75% identity in 146 aa overlap); etc. Appears to be a frame shift with respect to preceding ORF but we can detect no error in the cosmid sequence to account for this." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217396.1" /db_xref="GI:15610017" /db_xref="GeneID:887810" /translation="MVPELMFDEPRPGRPPRHLADLDAAGRASAVAELGLPAFRAKQL AHQYYGRLIADPRQMTDLPAAVRDRIAGAMFPNLLTASADITCDAGQTRKTLWRAVDG TMFESVLMRYPRRNTVCISSQAGCGMACPFCATGQGGLTRNLSTAEILEQVRAGAAAL RDDFGDRLSNVVFMGMGGAAGQLRQGVGRSSAHYRAAAVRFRDFGPRGDGVDGGSGPC YPQPCRRAARRDPGAVAARPRRRVARYTSSGQQPVEDQRSARCGPVLRQCDRATGVY" gene complement(3190701..3191621) /gene="cdsA" /locus_tag="Rv2881c" /db_xref="GeneID:888910" CDS complement(3190701..3191621) /gene="cdsA" /locus_tag="Rv2881c" /EC_number="2.7.7.41" /function="INVOLVED IN THE PHOSPHOLIPID BIOSYNTHESIS [CATALYTIC ACTIVITY: CTP + PHOSPHATIDATE = PYROPHOSPHATE + CDP-DIACYLGLYCEROL]." /note="Rv2881c, (MTCY274.12c), len: 306 aa. Probable cdsA, phosphatidate cytidylyltransferase (EC 2.7.7.41), integral membrane protein, equivalent to Q9CBU1|CDSA_MYCLE|ML1589 PHOSPHATIDATE CYTIDYLYLTRANSFERASE from Mycobacterium leprae (312 aa), FASTA scores: opt: 1470, E(): 1.1e-84, (70.3% identity in 313 aa overlap). Also similar to others e.g. Q9KPV7|VC2255 from Vibrio cholerae (280 aa), FASTA scores: opt: 383, E(): 1.1e-16, (29.3% identity in 280 aa overlap); Q9CDT2|CDSA from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (267 aa), FASTA scores: opt: 361, E(): 2.6e-15, (29.05% identity in 265 aa overlap); P06466|CDSA_ECOLI|CDS|B0175|Z0186|ECS0177 from Escherichia coli strains K12 and O157:H7 (249 aa), FASTA scores: opt: 352, E(): 9.2e-15, (40.4% identity in 156 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE CDS FAMILY." /codon_start=1 /transl_table=11 /product="integral membrane phosphatidate cytidylyltransferase CdsA" /protein_id="NP_217397.1" /db_xref="GI:15610018" /db_xref="GeneID:888910" /translation="MTTNDAGTGNPAEQPARGAKQQPATETSRAGRDLRAAIVVGLSI GLVLIAVLVFVPRVWVAIVAVATLVATHEVVRRLREAGYLIPVIPLLIGGQAAVWLTW PFGAVGALAGFGGMVVVCMIWRLFMQDSVTRPTTGGAPSPGNYLSDVSATVFLAVWVP LFCSFGAMLVYPENGSGWVFCMMIAVIASDVGGYAVGVLFGKHPMVPTISPKKSWEGF AGSLVCGITATIITATFLVGKTPWIGALLGVLFVLTTALGDLVESQVKRDLGIKDMGR LLPGHGGLMDRLDGILPSAVAAWIVLTLLP" misc_feature complement(3190902..3190925) /gene="cdsA" /locus_tag="Rv2881c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3191644..3192201) /gene="frr" /locus_tag="Rv2882c" /db_xref="GeneID:887464" CDS complement(3191644..3192201) /gene="frr" /locus_tag="Rv2882c" /function="RESPONSIBLE FOR THE RELEASE OF RIBOSOMES FROM MESSENGER RNA AT THE TERMINATION OF PROTEIN BIOSYNTHESIS. MAY INCREASE THE EFFICIENCY OF TRANSLATION BY RECYCLING RIBOSOMES FROM ONE ROUND OF TRANSLATION TO ANOTHER." /experiment="experimental evidence, no additional details recorded" /note="Rrf; Frr; ribosome-recycling factor; release factor 4; RF4; recycles ribosomes upon translation termination along with release factor RF-3 and elongation factor EF-G; A GTPase-dependent process results in release of 50S from 70S; inhibited by release factor RF-1; essential for viability; structurally similar to tRNAs" /codon_start=1 /transl_table=11 /product="ribosome recycling factor" /protein_id="NP_217398.1" /db_xref="GI:15610019" /db_xref="GeneID:887464" /translation="MIDEALFDAEEKMEKAVAVARDDLSTIRTGRANPGMFSRITIDY YGAATPITQLASINVPEARLVVIKPYEANQLRAIETAIRNSDLGVNPTNDGALIRVAV PQLTEERRRELVKQAKHKGEEAKVSVRNIRRKAMEELHRIRKEGEAGEDEVGRAEKDL DKTTHQYVTQIDELVKHKEGELLEV" repeat_region complement(3192202..3192254) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(3192255..3192307) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region complement(3192308..3192360) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene complement(3192373..3193158) /gene="pyrH" /locus_tag="Rv2883c" /db_xref="GeneID:887709" CDS complement(3192373..3193158) /gene="pyrH" /locus_tag="Rv2883c" /EC_number="2.7.4.-" /function="URIDINE MONOPHOSPHATE KINASE [CATALYTIC ACTIVITY: ATP + UMP = ADP + UDP]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the phosphorylation of UMP to UDP" /codon_start=1 /transl_table=11 /product="uridylate kinase" /protein_id="NP_217399.1" /db_xref="GI:15610020" /db_xref="GeneID:887709" /translation="MTEPDVAGAPASKPEPASTGAASAAQLSGYSRVLLKLGGEMFGG GQVGLDPDVVAQVARQIADVVRGGVQIAVVIGGGNFFRGAQLQQLGMERTRSDYMGML GTVMNSLALQDFLEKEGIVTRVQTAITMGQVAEPYLPLRAVRHLEKGRVVIFGAGMGL PYFSTDTTAAQRALEIGADVVLMAKAVDGVFAEDPRVNPEAELLTAVSHREVLDRGLR VADATAFSLCMDNGMPILVFNLLTDGNIARAVRGEKIGTLVTT" gene 3193393..3194151 /locus_tag="Rv2884" /db_xref="GeneID:887443" CDS 3193393..3194151 /locus_tag="Rv2884" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv2884, (MTCY274.15), len: 252 aa. Probable transcriptional regulatory protein, highly similar to others e.g. Q05943|GLNR_STRCO|SCD84.26c TRANSCRIPTIONAL REGULATORY PROTEIN from Streptomyces coelicolor (267 aa), FASTA scores: opt: 609, E(): 2.7e-34, (46.4% identity in 224 aa overlap); Q55733|SLL0396 REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEM from Synechocystis sp. strain PCC 6803 (224 aa), FASTA scores: opt: 330, E(): 3e-15, (31.8% identity in 217 aa overlap); Q9A4S3|CC2757 DNA-BINDING RESPONSE REGULATOR from Caulobacter crescentus (223 aa), FASTA scores: opt: 311, E(): 6e-14, (30.3% identity in 221 aa overlap); etc. Also highly similar to O53830|Rv0818|MTV043.10 PUTATIVE REGULATORY PROTEIN from Mycobacterium tuberculosis (255 aa), FASTA scores: opt: 665, E(): 3.8e-38, (47.6% identity in 227 aa overlap). THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217400.1" /db_xref="GI:15610021" /db_xref="GeneID:887443" /translation="MPTGPTTGKWHPHEVWRYLLEVLLLTDEADLESALPELESFAQS VQRAPLDDPGAAKGADADVAIIDARADLAAARRVCRRLTTSAPALAVVAVVAPANFVA VDGDWIFDDVLLNAAGGAELQARLRLAITRRRSTLAGTLQFGDLVLHPASYTASLGDR DLGLTLTEFKLMNFLVQHAGRAFTRTRLMREVWGYECHGRIRTVDVHVRRLRAKLGAE HESMIDTVRGVGYMAVTPPQPRWIISESILNRCK" repeat_region complement(3194166..3196432) /note="IS1539, len: 2267 bp. Insertion sequence IS1539." /mobile_element="insertion sequence:IS1539" gene complement(3194166..3195548) /locus_tag="Rv2885c" /db_xref="GeneID:887173" CDS complement(3194166..3195548) /locus_tag="Rv2885c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1539." /note="Rv2885c, (MTCY274.16c), len: 460 aa. Probable transposase for IS1539. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217401.1" /db_xref="GI:15610022" /db_xref="GeneID:887173" /translation="MMARLKVPEGWCVQAFRFTLNPTQTQAASLARHFGARRKAFNWT VTALKADIKAWRADGTESAKPSLRVLRKRWNTVKDQVCVNAQTGQVWWPECSKEAYAD GIAGAVDAYWNWQSCRAGKRAGKTVGVPRFKKKGRDADRVCFTTGAMRVEPDRRHLTL PVIGTIRTYENTRRVERLIAKGRARVLAITVRRNGTRLDASVRVLVQRPQQRRVALPD SRVGVDVGVRRLATVADAEGTVLEQVPNPRPLDAALRGLRRVSRARSRCTKGSRRYCE RTTELSRLHRRVNDVRTHHLHVLTTRLAKTHGRIVVEGLDAAGMLRQKGLPGARARRR ALSDAALATPRRHLSYKTGWYGSSLVVADRWFPSSKTCHACRHVQDIGWDEKWQCDGC SITHQRDDNAAINLARYEEPPSVVGPVGAAVKRGADRKTGPGPAGGREARKATGHPAG EQPRDGVQVK" misc_feature complement(3195171..3195194) /locus_tag="Rv2885c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3195545..3196432) /locus_tag="Rv2886c" /db_xref="GeneID:887424" CDS complement(3195545..3196432) /locus_tag="Rv2886c" /function="PREVENTS THE COINTEGRATION OF FOREIGN DNA BEFORE INTEGRATION INTO THE CHROMOSOME." /note="Rv2886c, (MTCY274.17c), len: 295 aa. Probable resolvase for IS1539. Contains PS00213 Lipocalin signature." /codon_start=1 /transl_table=11 /product="resolvase" /protein_id="NP_217402.1" /db_xref="GI:15610023" /db_xref="GeneID:887424" /translation="MSRILTHVPGRTVNRSYALPALVGSAAGRLSGNHSHGREAYIAL PQWACSRQPSTPPLQTPGRINALWSLRPVLPMPGRGCQLLRLGGRWLSVVCCRNGSMN LVVWAEGNGVARVIAYRWLRVGRLPVPARRVGRVILVDEPAGQPGRWGRTAVCARLSS ADQKVDLDRQVVGVTAWATAEQIPVGKVVTEVGSALYGRRRTFLTLLGDPTVRRIVMK RRDRLGRFGFECVQAVLAADGRELVVVDSADVDDDVVGDITEILTSICARLYGKRAAG NRAARAVAAAARAGGHEAR" misc_feature complement(3196154..3196195) /locus_tag="Rv2886c" /note="PS00213 Lipocalin signature" gene 3196431..3196850 /locus_tag="Rv2887" /db_xref="GeneID:888563" CDS 3196431..3196850 /locus_tag="Rv2887" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv2887, (MTCY274.18), len: 139 aa. Probable transcriptional regulatory protein, highly similar to Q9EX59|SC1A4.04 PUTATIVE MARR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (151 aa), FASTA scores: opt: 354, E(): 6.6e-16, (42.95% identity in 135 aa overlap); and similar to others e.g. AAF97817|SLYA TRANSCRIPTIONAL REGULATOR SLYA from Escherichia coli strain EPEC 2348/69 (146 aa), FASTA scores: opt: 181, E(): 0.0001, (27.25% identity in 132 aa overlap); P55740|SLYA_ECOLI|AAG56631|B1642|Z2657|ECS2351 TRANSCRIPTIONAL REGULATOR SLYA from Escherichia coli strains K12 and O157:H7 (146 aa), FASTA scores: opt: 177, E(): 0.00018, (27.25% identity in 132 aa overlap) ; etc. Contains probable helix-turn-helix motif at aa 50-71 (Score 1182, +3.21 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217403.1" /db_xref="GI:15610024" /db_xref="GeneID:888563" /translation="MGLADDAPLGYLLYRVGAVLRPEVSAALSPLGLTLPEFVCLRML SQSPGLSSAELARHASVTPQAMNTVLRKLEDAGAVARPASVSSGRSLPATLTARGRAL AKRAEAVVRAADARVLARLTAPQQREFKRMLEKLGSD" gene complement(3196864..3198285) /gene="amiC" /locus_tag="Rv2888c" /db_xref="GeneID:887401" CDS complement(3196864..3198285) /gene="amiC" /locus_tag="Rv2888c" /EC_number="3.5.1.4" /function="HYDROLYZES A MONOCARBOXYLIC ACID AMIDE AND GENERATES A MONOCARBOXYLATE [CATALYTIC ACTIVITY: A MONOCARBOXYLIC ACID AMIDE + H(2)O = A MONOCARBOXYLATE + NH(3)]." /note="catalyzes the hydrolysis of a monocarboxylic acid amid to form a monocarboxylate and ammonia" /codon_start=1 /transl_table=11 /product="amidase" /protein_id="NP_217404.1" /db_xref="GI:15610025" /db_xref="GeneID:887401" /translation="MSRVHAFVDDALGDLDAVALADAIRSGRVGRADVVEAAIARAEA VNPALNALAYAAFDVARDAAAMGTGQEAFFSGVPTFIKDNVDVAGQPSMHGTDAWEPY AAVADSEITRVVLGTGLVSLGKTQLSEFGFSAVAEHPRLGPVRNPWNTDYTAGASSSG SGALVAAGVVPIAHANDGGGSIRIPAACNGLVGLKPSRGRLPLEPEYRRLPVGIVANG VLTRTVRDTAAFYREAERLWRNHQLPPVGDVTSPVKQRLRIAVVTRSVLREASPEVRQ LTLKLAGLLEELGHRVEHVDHPPAPASFVDDFVLYWGFLALAQVRSGRRTFGRTFDPT RLDELTLGLARHTGRNLHRLPLAIMRLRMLRRRSVRFFGTYDVLLTPTVAEATPQVGY LAPTDYQTVLDRLSSWVVFTPVQNVTGVPAISLPLAQSADGMPVGMMLSADTGREALL LELAYELEEARPWARIHAPNIAE" misc_feature complement(3197911..3197934) /gene="amiC" /locus_tag="Rv2888c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3198292..3199107) /gene="tsf" /locus_tag="Rv2889c" /db_xref="GeneID:888187" CDS complement(3198292..3199107) /gene="tsf" /locus_tag="Rv2889c" /function="ASSOCIATES WITH THE EF-TU.GDP COMPLEX AND INDUCES THE EXCHANGE OF GDP TO GTP, IT REMAINS BOUND TO THE AMINOACYL-TRNA. EF-TU.GTP COMPLEX UP TO THE GTP HYDROLYSIS STAGE ON THE RIBOSOME." /experiment="experimental evidence, no additional details recorded" /note="EF-Ts; functions during elongation stage of protein translation; forms a dimer; associates with EF-Tu-GDP complex and promotes exchange of GDP to GTP resulting in regeneration of the active form of EF-Tu" /codon_start=1 /transl_table=11 /product="elongation factor Ts" /protein_id="NP_217405.1" /db_xref="GI:15610026" /db_xref="GeneID:888187" /translation="MANFTAADVKRLRELTGAGMLACKNALAETDGDFDKAVEALRIK GAKDVGKRAERATAEGLVAAKDGALIELNCETDFVAKNAEFQTLADQVVAAAAAAKPA DVDALKGASIGDKTVEQAIAELSAKIGEKLELRRVAIFDGTVEAYLHRRSADLPPAVG VLVEYRGDDAAAAHAVALQIAALRARYLSRDDVPEDIVASERRIAEETARAEGKPEQA LPKIVEGRLNGFFKDAVLLEQASVSDNKKTVKALLDVAGVTVTRFVRFEVGQA" misc_feature complement(3198865..3198897) /gene="tsf" /locus_tag="Rv2889c" /note="PS01127 Elongation factor Ts signature 2" gene complement(3199119..3199982) /gene="rpsB" /locus_tag="Rv2890c" /db_xref="GeneID:887187" CDS complement(3199119..3199982) /gene="rpsB" /locus_tag="Rv2890c" /function="INVOLVED IN TRANSLATION MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="one of the last subunits in the assembly of the 30S subunit; absence of S2 does not inhibit assembly but results in an inactive subunit" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S2" /protein_id="NP_217406.1" /db_xref="GI:15610027" /db_xref="GeneID:887187" /translation="MAVVTMKQLLDSGTHFGHQTRRWNPKMKRFIFTDRNGIYIIDLQ QTLTFIDKAYEFVKETVAHGGSVLFVGTKKQAQESVAAEATRVGMPYVNQRWLGGMLT NFSTVHKRLQRLKELEAMEQTGGFEGRTKKEILGLTREKNKLERSLGGIRDMAKVPSA IWVVDTNKEHIAVGEARKLGIPVIAILDTNCDPDEVDYPIPGNDDAIRSAALLTRVIA SAVAEGLQARAGLGRADGKPEAEAAEPLAEWEQELLASATASATPSATASTTALTDAP AGATEPTTDAS" misc_feature complement(3199428..3199472) /gene="rpsB" /locus_tag="Rv2890c" /note="PS00211 ABC transporters family signature" misc_feature complement(3199932..3199967) /gene="rpsB" /locus_tag="Rv2890c" /note="PS00962 Ribosomal protein S2 signature 1" gene 3200266..3201015 /locus_tag="Rv2891" /db_xref="GeneID:888328" CDS 3200266..3201015 /locus_tag="Rv2891" /function="UNKNOWN" /note="Rv2891, (MTCY274.22), len: 249 aa (C-terminus overlaps neigbouring ORF). Conserved hypothetical protein, similar in N-terminus to O69910|SC2E1.40c HYPOTHETICAL 22.8 KDA PROTEIN from Streptomyces coelicolor (226 aa), FASTA scores: opt: 315, E(): 3.4e-11, (40.7% identity in 145 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217407.1" /db_xref="GI:15610028" /db_xref="GeneID:888328" /translation="MAKSPARRCTAKVRRVLSRSVLILCWSLLGAAPAHADDSRLGWP LRPPPAVVRQFDAASPNWNPGHRGVDLAGRPGQPVYAAGSATVVFAGLLAGRPVVSLA HPGGLRTSYEPVVAQVRVGQPVSAPTVIGALAAGHPGCQAAACLHWGAMWGPASGANY VDPLGLLKSTPIRLKPLSSEGRTLHYRQAEPVFVNEAAAGALAGAGHRKSPKQGVFRG AAQGGDIVARQPPGRWVCPSSAGGPIGWHRQ" gene complement(3200794..3202020) /gene="PPE45" /locus_tag="Rv2892c" /db_xref="GeneID:887824" CDS complement(3200794..3202020) /gene="PPE45" /locus_tag="Rv2892c" /function="UNKNOWN" /note="Rv2892c, (MTCY274.23c), len: 408 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O06386|Rv3621c|MTCY15C10.31|MTCY07H7B.01 from M. tuberculosis (413 aa), FASTA scores: opt: 957, E(): 6.2e-46, (44.7% identity in 423 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177913.1" /db_xref="GI:57117034" /db_xref="GeneID:887824" /translation="MDFGVLPPEINSGRMYAGPGSGPMMAAAAAWDSLAAELGLAAGG YRLAISELTGAYWAGPAAASMVAAVTPYVAWLSATAGQAEQAGMQARAAAAAYELAFA MTVPPPVVVANRALLVALVATNFFGQNTPAIAATEAQYAEMWAQDAAAMYAYAGSAAI ATELTPFTAAPVTTSPAALAGQAAATVSSTVPPLATTAAVPQLLQQLSSTSLIPWYSA LQQWLAENLLGLTPDNRMTIVRLLGISYFDEGLLQFEASLAQQAIPGTPGGAGDSGSS VLDSWGPTIFAGPRASPSVAGGGAVGGVQTPQPYWYWALDRESIGGSVSAALGKGSSA GSLSVPPDWAARARWANPAAWRLPGDDVTALRGTAENALLRGFPMASAGQSTGGGFVH KYGFRLAVMQRPPFAG" gene 3202420..3203397 /locus_tag="Rv2893" /db_xref="GeneID:887337" CDS 3202420..3203397 /locus_tag="Rv2893" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2893, (MTCY274.24), len: 325 aa. Possible oxidoreductase (EC 1.-.-.-), showing similarity with various proteins and/or oxidoreductases e.g. Q9AE05|RIF11 eleventh protein in the rif biosynthetic gene cluster from Amycolatopsis mediterranei (Nocardia mediterranei) (294 aa), FASTA scores: opt: 270, E(): 4.8e-10, (34.5% identity in 313 aa overlap); O52567 REDUCTASE from Amycolatopsis mediterranei (Nocardia mediterranei) (153 aa), FASTA scores: opt: 251, E(): 5e-09, (42.4% identity in 125 aa overlap); Q58929|MER|MJ1534 F420-DEPENDENT METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE (EC 1.5.99.-) from Methanococcus jannaschii (331 aa), FASTA scores: opt: 249, E(): 1.2e-08, (29.7% identity in 283 aa overlap); etc. Also some similarity with others proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. P71844|Rv0791c|MTCY369.35c PUTATIVE OXIDOREDUCTASE (347 aa), FASTA scores: opt: 264, E(): 1.3e-09, (29.05% identity in 272 aa overlap); and P96809|Rv0132|MTCI5.06c PUTATIVE OXIDOREDUCTASE (360 aa), FASTA scores: opt: 260, E(): 2.4e-09, (33.05% identity in 239 aa overlap)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217409.1" /db_xref="GI:15610030" /db_xref="GeneID:887337" /translation="MTVASTAHHTRRLRFGLAAPLPRAGTQMRAFAQAVEAAGFDVLA FPDHLVPSVSPFAGATAAAMATQRLHTGTLVLNNDFRHPVDTAREAAGVATLAEGRFE LGLGAGHRRSEYDAAGITFDSGATRVARLIESAHLIRALLDAEPVDFDGQHYRVHAEA GSLVAPPKVRVPLLVGGNGTEVLRLGGRIADIVGLAGISHNRDATQVRFTHFDADGLA DRIAVVRHAAGDRFEAIELNALIQAVVCTNDRNAAAAELAATLGGITPEQVLESPFLL LGTHEQMAEALAARQRRFGVSYWTVFDEWAGRASAMRDIAEVIALLRYG" gene complement(3203394..3204290) /gene="xerC" /locus_tag="Rv2894c" /db_xref="GeneID:887392" CDS complement(3203394..3204290) /gene="xerC" /locus_tag="Rv2894c" /function="PARTICIPATES IN THE SITE-SPECIFIC RECOMBINATION. ACTS BY CATALYZING THE CUTTING AND REJOINING OF THE RECOMBINATING DNA MOLECULES. ACTS JOINTLY WITH XERD." /note="site-specific tyrosine recombinase which cuts and rejoins DNA molecules; binds cooperatively to specific DNA consensus sites; forms a heterotetrameric complex with XerC; XerCD exhibit similar sequences; essential to convert chromosome dimers to monomers during cell division and functions during plasmid segregation; cell division protein FtsK may regulate the XerCD complex; enzyme from Streptococcus group has unusual active site motifs" /codon_start=1 /transl_table=11 /product="site-specific tyrosine recombinase XerC" /protein_id="NP_217410.1" /db_xref="GI:15610031" /db_xref="GeneID:887392" /translation="MQAILDEFDEYLALQCGRSVHTRRAYLGDLRSLFAFLADRGSSL DALTLSVLRSWLAATAGAGAARTTLARRTSAVKAFTAWAVRRGLLAGDPAARLQVPKA RRTLPAVLRQDQALRAMAAAESGAEQGDPLALRDRLIVELLYATGIRVSELCGLDVDD IDTGHRLVRVLGKGNKQRTVPFGQPAADALHAWLVDGRRALVTAESGHALLLGARGRR LDVRQARTAVHQTVAAVDGAPDMGPHGLRHSAATHLLEGGADLRVVQELLGHSSLATT QLYTHVAVARLRAVHERAHPRA" gene complement(3204381..3205232) /gene="viuB" /locus_tag="Rv2895c" /db_xref="GeneID:888180" CDS complement(3204381..3205232) /gene="viuB" /locus_tag="Rv2895c" /function="THOUGHT TO BE INVOLVED IN INTRACELLULAR REMOVAL OF IRON FROM IRON-MYCOBACTIN COMPLEX. MYCOBACTIN IS AN IRON-CHELATING COMPOUND INVOLVED IN THE TRANSPORT OF IRON FROM THE BACTERIAL ENVIRONMENT INTO THE CELL CYTOPLASM." /note="Rv2895c, (MT2963, MTCY274.26c), len: 283 aa. Possible viuB, mycobactin utilization protein, highly similar to Q9RJ78|SCI41.06 HYPOTHETICAL 31.5 KDA PROTEIN from Streptomyces coelicolor (280 aa), FASTA scores: opt: 639, E(): 5.1e-32, (46.3% identity in 285 aa overlap); and similar to other proteins e.g. Q9F641|MXCB protein of the biosynthetic gene cluster of the myxochelin-type iron chelator from Stigmatella aurantiaca (270 aa), FASTA scores: opt: 417, E(): 2.2e-18, (34.2% identity in 263 aa overlap); Q56646|VIUB_VIBCH|VC2210 VIBRIOBACTIN UTILIZATION PROTEIN from Vibrio cholerae (271 aa), FASTA scores: opt: 395, E(): 5.1e-17, (31.0% identity in 274 aa overlap); Q56743|VIUB_VIBVU VULNIBACTIN UTILIZATION PROTEIN V from Vibrio vulnificus (271 aa), FASTA scores: opt: 390, E(): 1e-16, (33.95% identity in 274 aa overlap); etc. Equivalent to AAK47289 from Mycobacterium tuberculosis strain CDC1551 (321 aa) but shorter 38 aa." /codon_start=1 /transl_table=11 /product="mycobactin utilization protein ViuB" /protein_id="NP_217411.1" /db_xref="GI:15610032" /db_xref="GeneID:888180" /translation="MAGRPLHAFEVVATRHLAPHMVRVVLGGSGFDTFVPSDFTDSYI KLVFVDDDVDVGRLPRPLTLDSFADLPTAKRPPVRTMTVRHVDAAAREIAVDIVLHGE HGVAGPWAAGAQRGQPIYLMGPGGAYAPDPAADWHLLAGDESAIPAIAAALEALPPDA IGRAFIEVAGPDDEIGLTAPDAVEVNWVYRGGRADLVPEDRAGDHAPLIEAVTTTAWL PGQVHVFIHGEAQAVMHNLRPYVRNERGVDAKWASSISGYWRRGRTEEMFRKWKKELA EAEAGTH" gene complement(3205265..3206434) /locus_tag="Rv2896c" /db_xref="GeneID:887177" CDS complement(3205265..3206434) /locus_tag="Rv2896c" /function="UNKNOWN" /note="Rv2896c, (MTCY274.27c), len: 389 aa. Conserved hypothetical protein, similar to others proteins e.g. Q9ZJ08|FIR2 from Rhodococcus fascians (293 aa), FASTA scores: opt: 663, E(): 3.3e-32, (43.7% identity in 286 aa overlap); O69892|SC2E1.21 HYPOTHETICAL 37.9 KDA PROTEIN from Streptomyces coelicolor (382 aa), FASTA scores: opt: 600, E(): 2.2e-28, (46.45% identity in 267 aa overlap); Q9JWZ4|DPRA|NMA0158 DPRA HOMOLOG from Neisseria meningitidis (serogroup A) (395 aa), FASTA scores: opt: 495, E(): 4.1e-22, (34.6% identity in 347 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217412.1" /db_xref="GI:15610033" /db_xref="GeneID:887177" /translation="MIDPTARAWAYLSRVAEPPCAQLAALVRCVGPVEAADRVRRGQV GNELAQHTGARREIDRAADDLELLMRRGGRLITPDDDEWPVLAFAAFSGAGARARPCG HSPLVLWALGPARLDEVAPRAAAVVGTRAATAYGEHVAADLAAGLAERDVSVVSGGAY GIDGAAHRAALDSEGITVAVLAGGFDIPYPAGHSALLHRIAQHGVLFTEYPPGVRPAR HRFLTRNRLVAAVARAAVVVEAGLRSGAANTAAWARALGRVVAAVPGPVTSSASAGCH TLLRHGAELVTRADDIVEFVGHIGELAGDEPRPGAALDVLSEAERQVYEALPGRGAAT IDEIAVGSGLLPAQVLGPLAILEVAGLAECRDGRWRILRAGAGQAAAKGAAARLV" gene complement(3206431..3207942) /locus_tag="Rv2897c" /db_xref="GeneID:888212" CDS complement(3206431..3207942) /locus_tag="Rv2897c" /function="UNKNOWN" /note="Rv2897c, (MTCY274.28c), len: 503 aa. Conserved hypothetical protein, possibly Mg-chelatase, highly similar to hypothetical proteins and chelatases e.g. Q9RTV0|DR1656 MG(2+) CHELATASE FAMILY PROTEIN from Deinococcus radiodurans (519 aa), FASTA scores: opt: 1333, E(): 3.6e-68, (46.55% identity in 505 aa overlap);Q55372|SLR0904 HYPOTHETICAL 55.1 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (509 aa), FASTA scores: opt: 1271, E(): 1.2e-64, (42.65% identity in 504 aa overlap); Q9HTR4|PA5290 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (497 aa), FASTA scores: opt: 1248, E(): 2.3e-63, (45.9% identity in 503 aa overlap); Q9K0Z6|COMM|NMB0405 COMPETENCE PROTEIN (MG-CHELATASE) from Neisseria meningitidis (serogroup B), FASTA scores: opt: 1229, E(): 2.8e-62, (43.2% identity in 509 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217413.1" /db_xref="GI:15610034" /db_xref="GeneID:888212" /translation="MALGRAFSVAVRGLDGEIVEIEADITSGLPGVHLVGLPDAALQE SRDRVRAAVTNCGNSWPMARLTLALSPATLPKMGSVYDIALAAAVLSAQQKKPWERLE NTLLLGELSLDGRVRPVRGVLPAVLAAKRDGWPAVVVPADNLPEASLVDGIDVRGVRT LGQLQSWLRGSTGLAGRITTADTTPESAADLADVVGQSQARFAVEVAAAGAHHLMLTG PPGVGKTMLAQRLPGLLPSLSGSESLEVTAIHSVAGLLSGDTPLITRPPFVAPHHSSS VAALVGGGSGMARPGAVSRAHRGVLFLDECAEISLSALEALRTPLEDGEIRLARRDGV ACYPARFQLVLAANPCPCAPADPQDCICAAATKRRYLGKLSGPLLDRVDLRVQMHRLR AGAFSAADGESTSQVRQRVALAREAAAQRWRPHGFRTNAEVSGPLLRRKFRPSSAAML PLRTALDRGLLSIRGVDRTLRVAWSLADLAGRTSPGIDEVAAALSFRQTGARR" misc_feature complement(3207268..3207291) /locus_tag="Rv2897c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3207942..3208328) /locus_tag="Rv2898c" /db_xref="GeneID:888507" CDS complement(3207942..3208328) /locus_tag="Rv2898c" /function="UNKNOWN" /note="Rv2898c, (MTCY274.29c), len: 128 aa. Conserved hypothetical protein, highly similar to O33024|YS98_MYCLE|ML1607|MLCB250.49 HYPOTHETICAL 11.0 KDA PROTEIN from Mycobacterium leprae (96 aa), FASTA scores: opt: 318, E(): 2.3e-16, (58.35% identity in 96 aa overlap). Also similar to other hypothetical proteins e.g. O69890|YE19_STRCO|SC2E1.19 from Streptomyces coelicolor (130 aa), FASTA scores: opt: 253, E(): 1.7e-11, (39.65% identity in 121 aa overlap); Q9HVZ1|PA4424 from Pseudomonas aeruginosa (125 aa), FASTA scores: opt: 234, E(): 4.2e-10, (40.85% identity in 115 aa overlap); O86871 from Streptomyces lividans (85 aa), FASTA scores: opt: 224, E(): 1.8e-09, (46.45% identity in 84 aa overlap); etc. Equivalent to AAK47292 from Mycobacterium tuberculosis strain CDC1551 (141 aa) but shorter 13 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217414.1" /db_xref="GI:15610035" /db_xref="GeneID:888507" /translation="MTTLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWRCRYGELD VIACDAATRTVVFVEVKTRTGDGYGGLAHAVTERKVRRLRRLAGLWLADQEERWAAVR IDVIGVRVGPKNSGRTPELTHLQGIG" gene complement(3208576..3209406) /gene="fdhD" /locus_tag="Rv2899c" /db_xref="GeneID:888201" CDS complement(3208576..3209406) /gene="fdhD" /locus_tag="Rv2899c" /function="NECESSARY FOR FORMATE DEHYDROGENASE ACTIVITY" /note="involved in the production or activity of formate dehydrogenase-H which is active when nitrate is not present during anaerobic growth" /codon_start=1 /transl_table=11 /product="formate dehydrogenase accessory protein" /protein_id="NP_217415.1" /db_xref="GI:15610036" /db_xref="GeneID:888201" /translation="MGYATAHRRVRHLSADQVITRPETLAVEEPLEIRVNGTPVTVTM RTPGSDFELVQGFLLAEGVVAHREDVLTVSYCGRRVEGNATGASTYNVLDVALAPGVK PPDVDVTRTFYTTSSCGVCGKASLQAVSQVSRFAPGGDPATVAADTLKAMPDQLRRAQ KVFARTGGLHAAALFGVDGAMLAVREDIGRHNAVDKVIGWAFERDRIPLGASVLLVSG RASFELTQKALMAGIPVLAAVSAPSSLAVSLADASGITLVAFLRGDSMNVYTRADRIT" gene complement(3209406..3211745) /gene="fdhF" /locus_tag="Rv2900c" /db_xref="GeneID:887987" CDS complement(3209406..3211745) /gene="fdhF" /locus_tag="Rv2900c" /EC_number="1.2.1.2" /function="DECOMPOSES FORMIC ACID TO HYDROGEN AND CARBON DIOXIDE UNDER ANAEROBIC CONDITIONS IN THE ABSENCE OF EXOGENOUS ELECTRON ACCEPTORS [CATALYTIC ACTIVITY: FORMATE + NAD(+) = CO(2) + NADH]." /note="Rv2900c, (MTCY274.31c), len: 779 aa. Possible fdhF, formate dehydrogenase (EC 1.2.1.2), highly similar to others formate dehydrogenases and prokaryotic molybdopterin-containing oxidoreductases e.g. Q9S2J9|SC7H2.18 PUTATIVE FORMATE DEHYDROGENASE from Streptomyces coelicolor (759 aa), FASTA scores: opt: 3038, E(): 2.7e-180, (59.7% identity in 767 aa overlap); Q9HU08|PA5181 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (773 aa), FASTA scores: opt: 2560, E(): 1.1e-150, (53.2% identity in 761 aa overlap); P78160 FORMATE DEHYDROGENASE A CHAIN (EC 1.2.1.2) (FRAGMENT) from Escherichia coli strain K12 (740 aa), FASTA scores: opt: 2002, E(): 3.7e-116, (43.1% identity in 733 aa overlap); P07658|FDHF_ECOLI|P78137|B4079 FORMATE DEHYDROGENASE from Escherichia coli strain K12 (715 aa), FASTA scores: opt: 305, E(): 5.6e-13, (25.5% identity in 748 aa overlap); etc. BELONGS TO THE PROKARYOTIC MOLYBDOPTERIN-CONTAINING OXIDOREDUCTASE FAMILY." /codon_start=1 /transl_table=11 /product="formate dehydrogenase H" /protein_id="NP_217416.1" /db_xref="GI:15610037" /db_xref="GeneID:887987" /translation="MYVEAVRWQRSAASRDVLADYDEQAVTVAPRKREAAGVRAVMVS LQRGMQQMGALRTAAALARLNQRNGFDCPGCAWPEEPGGRKLAEFCENGAKAVAEEAT KRTVTAEFFARHSVAELSAKPEYWLSQQGRLAHPMVLRPGDDHYRPISWDAAYQLIAE QLNGLDSPDRAVFYTSGRTSNEAAFCYQLLVRSFGTNNLPDCSNMCHESSGAALTDSI GIGKGSVTIGDVEHADLIVIAGQNPGTNHPRMLSVLGKAKANGAKIIAVNPLPEAGLI RFKDPQKVNGVVGHGIPIADEFVQIRLGGDMALFAGLGRLLLEAEERVPGSVVDRSFV DNHCAGFDGYRRRTLQVGLDTVMDATGIELAQLQRVAAMLMASQRTVICWAMGLTQHA HAVATIGEVTNVLLLRGMIGKPGAGVCPVRGHSNVQGDRTMGIWEKMPEQFLAALDRE FGITSPRAHGFDTVAAIRAMRDGRVSVFMGMGGNFASATPDTAVTEAALRRCALTVQV STKLNRSHLVHGATALILPTLGRTDRDTRNGRKQLVSVEDSMSMVHLSRGSLHPPSDQ VRSEVQIICQLARALFGPGHPVPWERFADDYDTIRDAIAAVVPGCDDYNHKVRVPDGF QLPHPPRDAREFRTSTGKANFAVNPLQWVPVPPGRLVLQTLRSHDQYNTTIYGLDDRY RGVKGGRRVVFINPADIETFGLTAGDRVDLVSEWTDGQGGLQERRAKDFLVVAYSTPV GNAAAYYPETNPLVPLDHTAAQSNTPVSKAIIVRLEPTA" gene complement(3211803..3212108) /locus_tag="Rv2901c" /db_xref="GeneID:888605" CDS complement(3211803..3212108) /locus_tag="Rv2901c" /function="UNKNOWN" /note="Rv2901c, (MTCY274.32c), len: 101 aa. Conserved hypothetical protein, very equivalent to O33023|ML1610|MLCB250.41 HYPOTHETICAL 12.3 KDA PROTEIN from Mycobacterium leprae (101 aa), FASTA scores: opt: 658, E(): 2.6e-43, (99.0% identity in 101 aa overlap). Also highly similar to O69889|SC2E1.18 HYPOTHETICAL PROTEIN from Streptomyces coelicolor and Streptomyces lividans (102 aa), FASTA scores: opt: 515, E(): 2.2e-32, (75.0% identity in 100 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217417.1" /db_xref="GI:15610038" /db_xref="GeneID:888605" /translation="MSAEDLEKYETEMELSLYREYKDIVGQFSYVVETERRFYLANSV EMVPRNTDGEVYFELRLADAWVWDMYRPARFVKQVRVVTFKDVNIEEVEKPELRLPE" gene complement(3212162..3212956) /gene="rnhB" /locus_tag="Rv2902c" /db_xref="GeneID:888504" CDS complement(3212162..3212956) /gene="rnhB" /locus_tag="Rv2902c" /EC_number="3.1.26.4" /function="DEGRADES THE RIBONUCLEOTIDE MOIETY ON RNA-DNA HYBRID MOLECULES [CATALYTIC ACTIVITY: ENDONUCLEOLYTIC CLEAVAGE TO 5'- PHOSPHOMONOESTER]." /note="RNH2; RNase HII; binds manganese; endonuclease which specifically degrades the RNA of RNA-DNA hybrids" /codon_start=1 /transl_table=11 /product="ribonuclease HII" /protein_id="NP_217418.1" /db_xref="GI:15610039" /db_xref="GeneID:888504" /translation="MTKTWPPRTVIRKSGGLRGMRTLESALHRGGLGPVAGVDEVGRG ACAGPLVVAACVLGPGRIASLAALDDSKKLSEQAREKLFPLICRYAVAYHVVFIPSAE VDRRGVHVANIEGMRRAVAGLAVRPGYVLSDGFRVPGLPMPSLPVIGGDAAAACIAAA SVLAKVSRDRVMVALDADHPGYGFAEHKGYSTPAHSRALARLGPCPQHRYSFINVRRV ASGSNTAEVADGQPDPRDGTAQTGEGRWSKSSHPATMRATGRAQGT" gene complement(3212970..3213854) /gene="lepB" /locus_tag="Rv2903c" /db_xref="GeneID:887157" CDS complement(3212970..3213854) /gene="lepB" /locus_tag="Rv2903c" /EC_number="3.4.21.89" /function="CLEAVAGE OF N-TERMINAL LEADER SEQUENCES FROM SECRETED PROTEIN PRECURSORS." /experiment="experimental evidence, no additional details recorded" /note="Rv2903c, (MTCY274.34c), len: 294 aa. Probable lepB, signal peptidase I (EC 3.4.21.89) (TYPE II MEMBRANE PROTEIN) (see Braunstein & Belisle 2000), equivalent to O33021|LEP_MYCLE|ML1612|MLCB250.39 PROBABLE SIGNAL PEPTIDASE I from Mycobacterium leprae (289 aa), FASTA scores: opt: 1335, E(): 1.8e-77, (69.75% identity in 301 aa overlap). Also similar to many e.g. O86869|SIPX SIGNAL PEPTIDASE I from Streptomyces lividans (320 aa), FASTA scores: opt: 474, E(): 1e-22, (43.55% identity in 248 aa overlap); O69884|SIP1|SIPW PUTATIVE SIGNAL PEPTIDASE I from Streptomyces coelicolor and Streptomyces lividans (259 aa), FASTA scores: opt: 226, E(): 5e-07, (36.0% identity in 214 aa overlap); P42668|LEP_BACLI|SIP SIGNAL PEPTIDASE I from Bacillus licheniformis (186 aa), FASTA scores: opt: 218, E(): 1.3e-06, (34.5% identity in 194 aa overlap); etc. Contains PS00501 Signal peptidases I serine active site,and PS00761 Signal peptidases I signature 3. BELONGS TO PEPTIDASE FAMILY S26; ALSO KNOWN AS TYPE I LEADER PEPTIDASE FAMILY." /codon_start=1 /transl_table=11 /product="signal peptidase I LepB" /protein_id="NP_217419.1" /db_xref="GI:15610040" /db_xref="GeneID:887157" /translation="MTETTDSPSERQPGPAEPELSSRDPDIAGQVFDAAPFDAAPDAD SEGDSKAAKTDEPRPAKRSTLREFAVLAVIAVVLYYVMLTFVARPYLIPSESMEPTLH GCSTCVGDRIMVDKLSYRFGSPQPGDVIVFRGPPSWNVGYKSIRSHNVAVRWVQNALS FIGFVPPDENDLVKRVIAVGGQTVQCRSDTGLTVNGRPLKEPYLDPATMMADPSIYPC LGSEFGPVTVPPGRVWVMGDNRTHSADSRAHCPLLCTDDPLPGTVPVANVIGKARLIV WPPSRWGVVRSVNPQQGR" misc_feature complement(3213117..3213158) /gene="lepB" /locus_tag="Rv2903c" /note="PS00761 Signal peptidases I signature 3" misc_feature complement(3213552..3213575) /gene="lepB" /locus_tag="Rv2903c" /note="PS00501 Signal peptidases I serine active site" gene complement(3213912..3214253) /gene="rplS" /locus_tag="Rv2904c" /db_xref="GeneID:887356" CDS complement(3213912..3214253) /gene="rplS" /locus_tag="Rv2904c" /function="THIS PROTEIN IS LOCATED AT THE 30S-50S RIBOSOMAL SUBUNIT INTERFACE AND MAY PLAY A ROLE IN THE STRUCTURE AND FUNCTION OF THE AMINOACYL-TRNA BINDING SITE." /experiment="experimental evidence, no additional details recorded" /note="this protein is located at the 30S-50S ribosomal subunit interface and may play a role in the structure and function of the aminoacyl-tRNA binding site" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L19" /protein_id="NP_217420.1" /db_xref="GI:15610041" /db_xref="GeneID:887356" /translation="MNRLDFVDKPSLRDDIPAFNPGDTINVHVKVIEGAKERLQVFKG VVIRRQGGGIRETFTVRKESYGVGVERTFPVHSPNIDHIEVVTRGDVRRAKLYYLREL RGKKAKIKEKR" gene 3214628..3215572 /gene="lppW" /locus_tag="Rv2905" /db_xref="GeneID:887421" CDS 3214628..3215572 /gene="lppW" /locus_tag="Rv2905" /function="UNKNOWN" /note="Rv2905, (MTCY274.36), len: 314 aa. Probable lppW, conserved ala-rich lipoprotein, with slight similarity to beta-lactamases and hypothetical proteins e.g. Q9S1P7|SCJ9A.23 HYPOTHETICAL 36.3 KDA PROTEIN from Streptomyces coelicolor (336 aa), FASTA scores: opt: 222, E(): 2.8e-06, (25.5% identity in 298 aa overlap); O69914|SC3C8.01 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (302 aa), FASTA scores: opt: 201, E(): 5.1e-05, (24.9% identity in 257 aa overlap); P14559|BLAC_STRAL BETA-LACTAMASE PRECURSOR from Streptomyces albus G (314 aa), FASTA scores: opt: 113, E(): 3.3, (25.2% identity in 278 aa overlap); etc. Has signal peptide and appropriately positioned prokaryotic lipoprotein lipid attachment site: ATTACHED TO THE MEMBRANE BY A LIPID ANCHOR (POTENTIAL)." /codon_start=1 /transl_table=11 /product="alanine rich lipoprotein LppW" /protein_id="NP_217421.1" /db_xref="GI:15610042" /db_xref="GeneID:887421" /translation="MRARPLTLLTALAAVTLVVVAGCEARVEAEAYSAADRISSRPQA RPQPQPVELLLRAITPPRAPAASPNVGFGELPTRVRQATDEAAAMGATLSVAVLDRAT GQLVSNGNTQIIATASVAKLFIADDLLLAEAEGKVTLSPEDHHALDVMLQSSDDGAAE RFWSQDGGNAVVTQVARRYGLRSTAPPSDGRWWNTISSAPDLIRYYDMLLDGSGGLPL DRAAVIIADLAQSTPTGIDGYPQRFGIPDGLYAEPVAVKQGWMCCIGSSWMHLSTGVI GPERRYIMVIESLQPADDATARATITQAVRTMFPNGRI" gene complement(3215665..3216357) /gene="trmD" /locus_tag="Rv2906c" /db_xref="GeneID:888495" CDS complement(3215665..3216357) /gene="trmD" /locus_tag="Rv2906c" /EC_number="2.1.1.31" /function="SPECIFICALLY METHYLATES GUANOSIME-37 IN VARIOUS TRNAS [CATALYTIC ACTIVITY S-ADENOSYL-L-METHIONINE + TRNA = S-ADENOSYL-L-HOMOCYSTEINE + TRNA CONTAINING N1-METHYLGUANINE]." /note="methylates guanosine-37 in various tRNAs; uses S-adenosyl-L-methionine to transfer methyl group to tRNA" /codon_start=1 /transl_table=11 /product="tRNA (guanine-N(1)-)-methyltransferase" /protein_id="NP_217422.1" /db_xref="GI:15610043" /db_xref="GeneID:888495" /translation="MRIDIVTIFPACLDPLRQSLPGKAIESGLVDLNVHDLRRWTHDV HHSVDDAPYGGGPGMVMKAPVWGEALDEICSSETLLIVPTPAGVLFTQATAQRWTTES HLVFACGRYEGIDQRVVQDAARRMRVEEVSIGDYVLPGGESAAVVMVEAVLRLLAGVL GNPASHQDDSHSTGLDGLLEGPSYTRPASWRGLDVPEVLLSGDHARIAAWRREVSLQR TRERRPDLSHPD" gene complement(3216361..3216891) /gene="rimM" /locus_tag="Rv2907c" /db_xref="GeneID:887188" CDS complement(3216361..3216891) /gene="rimM" /locus_tag="Rv2907c" /function="ESSENTIAL FOR EFFICIENT PROCESSING OF 16S RRNA. PROBABLY PART OF THE 30S SUBUNIT PRIOR TO OR DURING THE FINAL STEP IN THE PROCESSING OF 16S FREE 30S RIBOSOMAL SUBUNITS. IT COULD BE SOME ACCESSORY PROTEIN NEEDED FOR EFFICIENT ASSEMBLY OF THE 30S SUBUNIT. RIMM IS NEEDED IN A STEP PRIOR TO RBFA DURING THE MATURATION OF 16S RRNA. HAS AFFINITY FOR FREE RIBOSOMAL 30S SUBUNITS BUT NOT FOR 70S RIBOSOMES." /experiment="experimental evidence, no additional details recorded" /note="Essential for efficient processing of 16S rRNA" /codon_start=1 /transl_table=11 /product="16S rRNA-processing protein RimM" /protein_id="NP_217423.1" /db_xref="GI:15610044" /db_xref="GeneID:887188" /translation="MELVVGRVVKSHGVTGEVVVEIRTDDPADRFAPGTRLRAKGPFD GGAEGSAVSYVIESVRQHGGRLLVRLAGVADRDAADALRGSLFVIDADDLPPIDEPDT YYDHQLVGLMVQTATGEGVGVVTEVVHTAAGELLAVKRDSDEVLVPFVRAIVTSVSLD DGIVEIDPPHGLLNLE" gene complement(3216905..3217147) /locus_tag="Rv2908c" /db_xref="GeneID:887336" CDS complement(3216905..3217147) /locus_tag="Rv2908c" /function="UNKNOWN" /note="Rv2908c, (MTCY274.40c), len: 80 aa. Conserved hypothetical protein, equivalent to O33015|YT08_MYCLE from Mycobacterium leprae (80 aa), FASTA scores: opt: 492, E(): 3.1e-29, (93.75% identity in 80 aa overlap). Also highly similar to others e.g. O69880|YE09_STRCO from Streptomyces coelicolor (79 aa), FASTA scores: opt: 356, E(): 3e-19, (71.6% identity in 74 aa overlap); Q9KA12|BH2482 PROTEIN from Bacillus halodurans (76 aa), FASTA scores: opt: 220, E(): 2.9e-09, (48.6% identity in 72 aa overlap); O31738|YLQC_BACSU HYPOTHETICAL 9.1 KDA PROTEIN from Bacillus subtilis (81 aa), FASTA scores: opt: 172, E(): 1e-05, (39.2% identity in 74 aa overlap); etc. BELONGS TO THE UPF0109 FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217424.1" /db_xref="GI:15610045" /db_xref="GeneID:887336" /translation="MSAVVVDAVEHLVRGIVDNPDDVRVDLITSRRGRTVEVHVHPDD LGKVIGRGGRTATALRTLVAGIGGRGIRVDVVDTDQ" gene complement(3217155..3217643) /gene="rpsP" /locus_tag="Rv2909c" /db_xref="GeneID:888631" CDS complement(3217155..3217643) /gene="rpsP" /locus_tag="Rv2909c" /function="INVOLVED IN TRANSLATION MECHANISM." /note="binds to lower part of 30S body where it stabilizes two domains; required for efficient assembly of 30S; in Escherichia coli this protein has nuclease activity" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S16" /protein_id="NP_217425.1" /db_xref="GI:15610046" /db_xref="GeneID:888631" /translation="MAVKIKLTRLGKIRNPQYRVAVADARTRRDGRAIEVIGRYHPKE EPSLIEINSERAQYWLSVGAQPTEPVLKLLKITGDWQKFKGLPGAQGRLKVAAPKPSK LEVFNAALAAADGGPTTEATKPKKKSPAKKAAKAAEPAPQPEQPDTPALGGEQAELTA ES" gene complement(3217827..3218270) /locus_tag="Rv2910c" /db_xref="GeneID:887387" CDS complement(3217827..3218270) /locus_tag="Rv2910c" /function="UNKNOWN" /note="Rv2910c, (MTCY274.42c), len: 147 aa. Conserved hypothetical protein, showing some similarity with hypothetical proteins from other organisms e.g. Q9JN76|MMYY HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 164, E(): 0.00026, (35.05% identity in 129 aa overlap); etc. Also some similarity with protein from Mycobacterium tuberculosis e.g. O07237|Rv0310c|MTCY63.15c (163 aa), FASTA scores: opt: 165, E(): 0.00023, (26.3% identity in 137 aa overlap); P96815|Rv0138|MTCI5.12 (167 aa), FASTA scores: opt: 132, E(): 0.048, (30.25% identity in 109 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217426.1" /db_xref="GI:15610047" /db_xref="GeneID:887387" /translation="MCAVLDRSMLSVAEISDRLEIQQLLVDYSSAIDQRRFDDLDRVF TPDAYIDYRALGGIDGRYPKIKQWLSQVLGNFPVYAHMLGNFSVRVDGDTASSRVICF NPMVFAGDRQQVLFCGLWYDDDFVRTPDGWRIIRRVETKCFQKMM" gene 3218339..3219214 /gene="dacB2" /locus_tag="Rv2911" /db_xref="GeneID:887189" CDS 3218339..3219214 /gene="dacB2" /locus_tag="Rv2911" /EC_number="3.4.16.4" /function="INVOLVED IN PEPTIDOGLYCAN SYNTHESIS (AT FINAL STAGES). HYDROLYZES THE BOUND D-ALANYL-D-ALANINE [CATALYTIC ACTIVITY: D-ALANYL-D-ALANINE + H(2)O = 2 D-ALANINE]." /note="Rv2911, (MTCY274.43), len: 291 aa. Probable dacB2, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein) (EC 3.4.16.4), an ala-rich protein. Highly similar (except in N-terminus) to Q9CCM2|ML0691 PUTATIVE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE from Mycobacterium leprae (411 aa), FASTA scores: opt: 749, E(): 9.3e-39, (46.75% identity in 276 aa overlap). Also similar to penicillin binding proteins / D-alanyl-D-alanine carboxypeptidases e.g. Q9KCJ8|SC4G1.16c D-ALANYL-D-ALANINE CARBOXYPEPTIDASE from Streptomyces coelicolor (382 aa), FASTA scores: opt: 386, E(): 2.1e-16, (31.25% identity in 285 aa overlap); P35150|DACB_BACSU PENICILLIN-BINDING PROTEIN 5* PRECURSOR from Bacillus subtilis (382 aa), FASTA scores: opt: 384, E(): 3.6e-17, (30.7% identity in 244 aa overlap); Q9K8X5|DACB|BH2877 D-ALANYL-D-ALANINE CARBOXYPEPTIDASE (PENICILLIN-BINDING PROTEIN 5) from Bacillus halodurans (395 aa), FASTA scores: opt: 359, E(): 9.7e-15, (30.3% identity in 241 aa overlap); P33364|PBP7_ECOLI|PBPG|B2134 penicillin-binding protein 7 precursor from Escherichia coli strain K12 (313 aa), FASTA scores: opt: 273, E(): 7.5e-10, (27.8% identity in 263 aa overlap); etc. Also similar to O53380|Rv3330|MTV016.30 PENICILLIN-BINDING PROTEIN from Mycobacterium tuberculosis (405 aa), FASTA scores: opt: 746, E(): 1.4e-38, (47.0% identity in 266 aa overlap). Seems to contain PF00768 Peptidase_S11 domain PFAM. BELONGS TO PEPTIDASE FAMILY S11; ALSO KNOWN AS THE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE 1 FAMILY. Thought to be a membrane-bound protein. Note that previously known as dacB.; dacB" /codon_start=1 /transl_table=11 /product="D-alanyl-D-alanine carboxypeptidase" /protein_id="YP_177914.1" /db_xref="GI:57117035" /db_xref="GeneID:887189" /translation="MRKLMTATAALCACAVTVSAGAAWADADVQPAGSVPIPDGPAQT WIVADLDSGQVLAGRDQNVAHPPASTIKVLLALVALDELDLNSTVVADVADTQAECNC VGVKPGRSYTARQLLDGLLLVSGNDAANTLAHMLGGQDVTVAKMNAKAATLGATSTHA TTPSGLDGPGGSGASTAHDLVVIFRAAMANPVFAQITAEPSAMFPSDNGEQLIVNQDE LLQRYPGAIGGKTGYTNAARKTFVGAAARGGRRLVIAMMYGLVKEGGPTYWDQAATLF DWGFALNPQASVGSL" gene complement(3219274..3219861) /locus_tag="Rv2912c" /db_xref="GeneID:888017" CDS complement(3219274..3219861) /locus_tag="Rv2912c" /function="THOUGHT TO BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2912c, (MTCY274.44c), len: 195 aa. Probable transcription regulatory protein, tetR family, showing similarity with others e.g. Q9K3V9|SCD10.17 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (202 aa), FASTA scores: opt: 185, E(): 4.4e-05, (31.15% identity in 167 aa overlap); Q9KFQ0 TETR-FAMILY from Bacillus halodurans (185 aa), FASTA scores: opt: 164, E(): 0.001, (35.6% identity in 73 aa overlap); P17446|BETI_ECOLI|BETI|B0313 regulatory protein from Escherichia coli strain K12 (195 aa), FASTA scores: opt: 126, E(): 0.024, (24.5% identity in 196 aa overlap); etc. Contains possible helix-turn-helix motif at aa 33-54 (+2.71 SD). POSSIBLY BELONGS TO THE TETR/ACRR FAMILY." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217428.1" /db_xref="GI:15610049" /db_xref="GeneID:888017" /translation="MARTQQQRREETVARLLQASIDTIIEVGYARASAAVITKRAGVS VGALFRHFETMGDFMAATAYEVLRRQLETFTKQVAEIPADRPALPAALTILRDITAGS TNAVLYELMVAARTDEKLKETLQNVLGQYSAKIHDAARALPGAESFPEETFPVIVALM TNVFDGAAIVRGVLPQPELEEQRIPMLTALLTAGL" gene complement(3219863..3221698) /locus_tag="Rv2913c" /db_xref="GeneID:887809" CDS complement(3219863..3221698) /locus_tag="Rv2913c" /EC_number="3.5.1.-" /function="HYDROLIZES SPECIFIC D-AMINO ACID." /experiment="experimental evidence, no additional details recorded" /note="Rv2913c, (MTCY338.01c, MTCY274.45c), len: 611 aa. Possible D-amino acid aminohydrolase (EC 3.5.1.-), similar (principally in N-terminus) to D-amino acid aminohydrolases e.g. Q9V2D3|NDAD|PAB0090 D-AMINOACYLASE (ASPARTATE, GLUTAMATE ETC) from Pyrococcus abyssi (526 aa), FASTA scores: opt: 336, E(): 2.2e-13, (27.55% identity in 581 aa overlap); P94212|NDDD_ALCXX N-ACYL-D-ASPARTATE DEACYLASE (EC 3.5.1.83) (N-ACYL-D-ASPARTATE AMIDOHYDROLASE) from Alcaligenes xylosoxydans xylosoxydans (Achromobacter xylosoxidans) (498 aa), FASTA scores: opt: 221, E(): 3.4e-06, (25.95% identity in 532 aa overlap); Q9AGH8 D-AMINOACYLASE (EC 3.5.1.81) from Alcaligenes faecalis (484 aa), FASTA scores: opt: 218, E(): 5.1e-06, (28.35% identity in 434 aa overlap); etc." /codon_start=1 /transl_table=11 /product="D-amino acid aminohydrolase" /protein_id="NP_217429.1" /db_xref="GI:15610050" /db_xref="GeneID:887809" /translation="MLAWRQLNDLEETVTYDVIIRDGLWFDGTGNAPLTRTLGIRDGV VATVAAGALDETGCPEVVDAAGKWVVPGFIDVHTHYDAEVLLDPGLRESVRHGVTTVL LGNCSLSTVYANSEDAADLFSRVEAVPREFVLGALRDNQTWSTPAEYIEAIDALPLGP NVSSLLGHSDLRTAVLGLDRATDDTVRPTEAELAKMAKLLDEALEAGMLGMSGMDAAI DKLDGDRFRSRALPSTFATWRERRKLISVLRHRGRILQSAPDVDNPVSALLFFLASSR IFNRRKGVRMSMLVSADAKSMPLAVHVFGLGTRVLNKLLGSQVRFQHLPVPFELYSDG IDLPVFEEFGAGTAALHLRDQLQRNELLADRSYRRSFRREFDRIKLGPSLWHRDFHDA VIVECPDKSLIGKSFGAIADERGLHPLDAFLDVLVDNGERNVRWTTIVANHRPNQLNK LAAEPSVHMGFSDAGAHLRNMAFYNFGLRLLKRARDADRAGQPFLSIERAVYRLTGEL AEWFGIGAGTLRQGDRADFAVIDPTHLDESVDGYHEEAVPYYGGLRRMVNRNDATVVA TGVGGTVVFRGGQFGGQFRDGYGQNVKSGRYLRAGELGAALSRSA" gene complement(3221767..3223524) /gene="pknI" /locus_tag="Rv2914c" /db_xref="GeneID:887642" CDS complement(3221767..3223524) /gene="pknI" /locus_tag="Rv2914c" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). THOUGHT TO BE INVOLVED IN CELL DIVISION/DIFFERENTIATION [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /note="Rv2914c, (MTCY338.02c), len: 585 aa. Probable pknI, transmembrane serine/threonine-protein kinase (EC 2.7.1.-) (see citation below), ala-rich protein, highly similar to many in Mycobacterium tuberculosis and other bacteria e.g. Q9RLQ7|MBK PUTATIVE SERINE/THREONINE PROTEIN KINASE from Mycobacterium bovis BCG (291 aa), FASTA scores: opt: 376, E(): 1.1e-10, (36.95% identity in 287 aa overlap); P33973|PKN1_MYXXA serine/threonine-protein kinase from Myxococcus xanthus (693 aa), FASTA scores: opt: 286, E(): 5.4e-10, (29.9% identity in 374 aa overlap); P72003|PKNF_MYCTU|Rv1746|MT1788|MTCY28.09 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium tuberculosis (476 aa), FASTA scores: opt: 675, E(): 1.7e-24, (39.75% identity in 468 aa overlap); Q10697|PKNJ_MYCTU|Rv2088|MT2149|MTCY49.28 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium tuberculosis (589 aa), FASTA scores: opt: 574, E(): 1e-19, (34.85% identity in 479 aa overlap); etc. Equivalent to AAK47308 from Mycobacterium tuberculosis strain CDC1551 (603 aa) but shorter 18 aa. Contains Hank's kinase subdomain. BELONGS TO THE SER/THR FAMILY OF PROTEIN KINASES." /codon_start=1 /transl_table=11 /product="transmembrane serine/threonine-protein kinase I" /protein_id="NP_217430.1" /db_xref="GI:15610051" /db_xref="GeneID:887642" /translation="MALASGVTFAGYTVVRMLGCSAMGEVYLVQHPGFPGWQALKVLS PAMAADDEFRRRFQRETEVAARLFHPHILEVHDRGEFDGQLWIAMDYVDGIDATQHMA DRFPAVLPVGEVLAIVTAVAGALDYAHQRGLLHRDVNPANVVLTSQSAGDQRILLADF GIASQPSYPAPELSAGADVDGRADQYALALTAIHLFAGAPPVDRSHTGPLQPPKLSAF RPDLARLDGVLSRALATAPADRFGSCREFADAMNEQAGVAIADQSSGGVDASEVTAAA GEEAYVVDYPAYGWPEAVDCKEPSARAPAPAAPTPQRRGSMLQSAAGVLARRLDNFST ATKAPASPTRRRPRRILVGAVAVLLLAGLFAVGIVIGRKTNTTATEVARPPTSGSAVP SAPTTTVAVTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLA AATMLDDNDHTQAKTPPVRPFLMQFGEGQWKSRPETVQFPCVGPNGSPSTQATTQLLA LRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPD TTSTATLTPPTTTAPGPGR" gene complement(3223568..3224680) /locus_tag="Rv2915c" /db_xref="GeneID:888083" CDS complement(3223568..3224680) /locus_tag="Rv2915c" /function="UNKNOWN" /note="Rv2915c, (MTCY338.03c), len: 370 aa. Conserved hypothetical protein, posssibly XAA-PRO dipeptidase (prolidase) (EC 3.4.13.9), highly similar to CAC38796|SCI39.08c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (363 aa), FASTA scores: opt: 1341, E(): 5.5e-76, (56.65% identity in 362 aa overlap); and similar to prolidases (XAA-PRO dipeptidase) e.g. Q9ABC9|CC0300 PUTATIVE XAA-PRO DIPEPTIDASE from Caulobacter crescentus (428 aa), FASTA scores: opt: 327, E(): 7.4e-13, (30.2% identity in 374 aa overlap); Q97XD4 PROLIDASE from Sulfolobus solfataricus (396 aa), FASTA scores: opt: 271, E(): 2.1e-09, (30.5% identity in 354 aa overlap); Q9WX55 PROLIDASE from Microbacterium esteraromaticum (393 aa), FASTA scores: opt: 256, E(): 1.8e-08, (27.95% identity in 365 aa overlap); etc. Also similar to O53619|Rv0074|MTV030.18 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (411 aa), FASTA scores: opt: 243, E(): 1.2e-07, (27.5% identity in 389 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217431.1" /db_xref="GI:15610052" /db_xref="GeneID:888083" /translation="MKRVDTIRPRSRAVRLHVRGLGLPDETAIQLWIVDGRISTEPVA GADTVFDGGWILPGLVDAHCHVGLGKHGNVELDEAIAQAETERDVGALLLRDCGSPTD TRGLDDHEDLPRIIRAGRHLARPKRYIAGFAVELEDESQLPAAVAEQARRGDGWVKLV GDWIDRQIGDLAPLWSDDVLKAAIDTAHAQGARVTAHVFSEDALPGLINAGIDCIEHG TGLTDDTIALMLEHGTALVPTLINLENFPGIADAAGRYPTYAAHMRDLYARGYGRVAA AREAGVPVYAGTDAGSTIEHGRIADEVAALQRIGMTAHEALGAACWDARRWLGRPGLD DRASADLLCYAQDPRQGPGVLQHPDLVILRGRTFGP" gene complement(3224708..3226285) /gene="ffh" /locus_tag="Rv2916c" /db_xref="GeneID:888242" CDS complement(3224708..3226285) /gene="ffh" /locus_tag="Rv2916c" /function="NECESSARY FOR EFFICIENT EXPORT OF EXTRA-CYTOPLASMIC PROTEINS. BINDS TO THE SIGNAL SEQUENCE WHEN IT EMERGES FROM THE RIBOSOMES." /note="Rv2916c, (MTCY338.04c), len: 525 aa. Probable ffh, signal recognition particle (SRP) protein (ala-, gly-, leu-rich protein) (see citation below), equivalent to O33013|SR54_MYCLE SIGNAL RECOGNITION PARTICLE from Mycobacterium leprae (521 aa), FASTA scores: opt: 2968, E(): 1.6e-145, (87.85% identity in 526 aa overlap). Also highly similar to others e.g. O69874|FFH from Streptomyces coelicolor (550 aa), FASTA scores: opt: 2025, E(): 6e-97, (63.8% identity in 519 aa overlap) (N-terminus longer 34 aa); P37105|SR54_BACSU from Bacillus subtilis (446 aa), FASTA scores: opt: 1451, E(): 1.9e-67, (51.5% identity in 435 aa overlap); BAB57399|FFH from Staphylococcus aureus subsp. aureus Mu50 (455 aa), FASTA scores: opt: 1418, E(): 9.4e-66, (48.65% identity in 448 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE SRP FAMILY OF GTP-BINDING PROTEINS. NOTE THAT SIGNAL RECOGNITION PARTICLE CONSISTS OF A SMALL CYTOPLASMIC RNA (SC-RNA) MOLECULE AND PROTEIN FFH. THE PROTEIN HAS A TWO DOMAIN STRUCTURE: THE G-DOMAIN BINDS GTP; THE M-DOMAIN BINDS THE RNA AND ALSO BINDS THE SIGNAL SEQUENCE." /codon_start=1 /transl_table=11 /product="signal recognition particle protein" /protein_id="NP_217432.1" /db_xref="GI:15610053" /db_xref="GeneID:888242" /translation="MFESLSDRLTAALQGLRGKGRLTDADIDATTREIRLALLEADVS LPVVRAFIHRIKERARGAEVSSALNPAQQVVKIVNEELISILGGETRELAFAKTPPTV VMLAGLQGSGKTTLAGKLAARLRGQGHTPLLVACDLQRPAAVNQLQVVGERAGVPVFA PHPGASPESGPGDPVAVAAAGLAEARAKHFDVVIVDTAGRLGIDEELMAQAAAIRDAI NPDEVLFVLDAMIGQDAVTTAAAFGEGVGFTGVALTKLDGDARGGAALSVREVTGVPI LFASTGEKLEDFDVFHPDRMASRILGMGDVLSLIEQAEQVFDAQQAEEAAAKIGAGEL TLEDFLEQMLAVRKMGPIGNLLGMLPGAAQMKDALAEVDDKQLDRVQAIIRGMTPQER ADPKIINASRRLRIANGSGVTVSEVNQLVERFFEARKMMSSMLGGMGIPGIGRKSATR KSKGAKGKSGKKSKKGTRGPTPPKVKSPFGVPGMPGLAGLPGGLPDLSQMPKGLDELP PGLADFDLSKLKFPGKK" misc_feature complement(3225944..3225967) /gene="ffh" /locus_tag="Rv2916c" /note="PS00017 ATP/GTP-binding site motif A" gene 3226363..3228243 /locus_tag="Rv2917" /db_xref="GeneID:887758" CDS 3226363..3228243 /locus_tag="Rv2917" /function="UNKNOWN" /note="Rv2917, (MTCY338.05), len: 626 aa. Conserved hypothetical ala-, arg-rich protein, highly similar (but longer 34 aa) to O33011|ML1624|MLCB250.18C HYPOTHETICAL 65.2 KDA PROTEIN from Mycobacterium leprae (596 aa), FASTA scores: opt: 3117, E(): 9e-183, (79.8% identity in 584 aa overlap). Also highly similar to Q9S2E8|SCE19A.36C HYPOTHETICAL 66.2 KDA PROTEIN from Streptomyces coelicolor (598 aa), FASTA scores: opt: 1921, E(): 1.1e-109, (56.08% identity in 567 aa overlap); and Q9S3Y6|SDRA SDRA PROTEIN from Streptomyces coelicolor (597 aa), FASTA scores: opt: 1896, E(): 3.6e-108, (55.75% identity in 567 aa overlap). And shows some similarity with others proteins from other organisms. Equivalent to AAK47311 putative RNA helicase from Mycobacterium tuberculosis strain CDC1551 (602 aa) but longer 24 aa. Contains PS00017 ATP/GTP-binding site motif (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217433.1" /db_xref="GI:15610054" /db_xref="GeneID:887758" /translation="MRVTRLVDAESTRCDVGPAPKSVAMLHFTAATSRFRLGRERANS VRSDGGWGVLQPVSATFNPPLRGWQRRALVQYLGTQPRDFLAVATPGSGKTSFALRIA AELLRYHTVEQVTVVVPTEHLKVQWAHAAAAHGLSLDPKFANSNPQTSPEYHGVMVTY AQVASHPTLHRVRTEARKTLVVFDEIHHGGDAKTWGDAIREAFGDATRRLALTGTPFR SDDSPIPFVSYQPDADGVLRSQADHTYGYAEALADGVVRPVVFLAYSGQARWRDSAGE EYEARLGEPLSAEQTARAWRTALDPEGEWMPAVITAADRRLRQLRAHVPDAGGMIIAS DRTTARAYARLLTTMTAEEPTVVLSDDPGSSARITEFAQGTSRWLVAVRMVSEGVDVP RLSVGVYATNASTPLFFAQAIGRFVRSRRPGETASIFVPSVPNLLQLASALEVQRNHV LGRPHRESAHDPLDGDPATRTQTERGGAERGFTALGADAELDQVIFDGSSFGTATPTG SDEEADYLGIPGLLDAEQMRALLHRRQDEQLRKRAQLQKGATQPATSGASASVHGQLR DLRRELHTLVSIAHHRTGKPHGWIHDERRRRCGGPPIAAATRAQIKARIDALRQLNSE RS" misc_feature 3226624..3226647 /locus_tag="Rv2917" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3228254..3230680) /gene="glnD" /locus_tag="Rv2918c" /db_xref="GeneID:888621" CDS complement(3228254..3230680) /gene="glnD" /locus_tag="Rv2918c" /EC_number="2.7.7.59" /function="MODIFIES, BY URIDYLYLATION OR DEURIDYLYLATION THE PII (GLNB|Rv2919c) REGULATORY PROTEIN [CATALYTIC ACTIVITY: UTP + [PROTEIN-PII] = DIPHOSPHATE + URIDYLYL-[PROTEIN-PII]]." /note="catalyzes the uridylylation or deuridylylation of the PII nitrogen regulatory protein; also involved in adenylylating and deadelnylyating GlnK" /codon_start=1 /transl_table=11 /product="PII uridylyl-transferase" /protein_id="NP_217434.1" /db_xref="GI:15610055" /db_xref="GeneID:888621" /translation="MEAESPCAASDLAVARRELLSGNHRELDPVGLRQTWLDLHESWL IDKADEIGIADASGFAIVGVGGLGRRELLPYSDLDVLLLHDGKPADILRPVADRLWYP LWDANIRLDHSVRTVSEALTIANSDLMAALGMLEARHIAGDQQLSFALIDGVRRQWRN GIRSRMGELVEMTYARWRRCGRIAQRAEPDLKLGRGGLRDVQLLDALALAQLIDRHGI GHTDLPAGSLDGAYRTLLDVRTELHRVSGRGRDHLLAQFADEISAALGFGDRFDLART LSSAGRTIGYHAEAGLRTAANALPRRGISALVRRPKRRPLDEGVVEYAGEIVLARDAE PEHDPGLVLRVAAASADTGLPIGAATLSRLAASVPDLPTPWPQEALDDLLVVLSAGPT TVATIEALDRTGLWGRLLPEWEPIRDLPPRDVAHKWTVDRHVVETAVHAAPLATRVAR PDLLALGALLHDIGKGRGTDHSVLGAELVIPVCTRLGLSPPDVRTLSKLVRHHLLLPI TATRRDLNDPKTIEAVSEALGGDPQLLEVLHALSEADSKATGPGVWSDWKASLVDDLV RRCRMVMAGESLPQAEPTAPHYLSLAADHGVHVEISPRDGERIDAVIVAPDERGLVSK AAAVLALNSLRVHSASVNVHQGVAITEFVVSPLFGSPPAAELVRQQFVGALNGDVDVL GMLQKRDSDAASLVSARAGDVQAGVPVTRTAAPPRILWLDTAAPAKLILEVRAMDRAG LLALLAGALEGAGAGIVWAKVNTFGSTAADVFCVTVPAELDARAAVEQHLLEVLGASV DVVVDEPVGD" gene complement(3230738..3231076) /gene="glnB" /locus_tag="Rv2919c" /db_xref="GeneID:887756" CDS complement(3230738..3231076) /gene="glnB" /locus_tag="Rv2919c" /function="IN NITROGEN-LIMITING CONDITIONS, WHEN THE RATIO OF GLN TO 2-KETOGLUTARATE DECREASES, P-II IS URIDYLYLATED TO P-II-UMP BY GLND|Rv2918c. P-II-UMP ALLOWS THE DEADENYLYLATION OF GLUTAMINE SYNTHETASE (GS), THUS ACTIVATING THE ENZYME. CONVERSERLY, IN NITROGEN EXCESS P-II IS DEURIDYLATED AND PROMOTES THE ADENYLATION OF GS. P-II INDIRECTLY CONTROLS THE TRANSCRIPTION OF THE GS GENE (GLNA: FOUR COPIES IN THE GENOME). P-II PREVENTS NR-II CATALYZED CONVERSION OF NR-I TO NR-I-PHOSPHATE, THE TRANSCRIPTIONAL ACTIVATOR OF GLNA. WHEN P-II IS URIDYLYLATED TO P-II-UMP, THESE EVENTS ARE REVERSED." /note="Rv2919c, (MTCY338.08c), len: 112 aa. Probable glnB, nitrogen regulatory protein, highly similar to others e.g. Q9X705|GLNB PII PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (112 aa), FASTA scores: opt: 531, E(): 4.5e-30, (68.75% identity in 112 aa overlap); P21193|GLNB_AZOBR NITROGEN REGULATORY PROTEIN P-II from Azospirillum brasilense (112 aa), FASTA scores: opt: 496, E(): 1.2e-27, (60.7% identity in 112 aa overlap); P05826|GLNB_ECOLI|B2553|Z3829|ECS3419|STY2808 NITROGEN REGULATORY PROTEIN P-II from Escherichia coli strains K12 and O157:H7 (112 aa), FASTA scores: opt: 487, E(): 5.3e-27, (61.6% identity in 112 aa overlap); etc. Contains PS00496 P-II protein urydylation site. BELONGS TO THE P(II) PROTEIN FAMILY." /codon_start=1 /transl_table=11 /product="nitrogen regulatory protein P-II GLNB" /protein_id="NP_217435.1" /db_xref="GI:15610056" /db_xref="GeneID:887756" /translation="MKLITAIVKPFTLDDVKTSLEDAGVLGMTVSEIQGYGRQKGHTE VYRGAEYSVDFVPKVRIEVVVDDSIVDKVVDSIVRAARTGKIGDGKVWVSPVDTIVRV RTGERGHDAL" misc_feature complement(3230924..3230941) /gene="glnB" /locus_tag="Rv2919c" /note="PS00496 P-II protein urydylation site" gene complement(3231073..3232506) /gene="amt" /locus_tag="Rv2920c" /db_xref="GeneID:887683" CDS complement(3231073..3232506) /gene="amt" /locus_tag="Rv2920c" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF AMMONIUM ACROSS THE MEMBRANE (EXPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv2920c, (MTCY338.09c), len: 477 aa. Probable amt, ammonium-transport integral membrane protein (ala-, gly-, leu-, val-rich protein), highly similar to others e.g. Q9ZBP6|SC7A1.27 AMMONIUM TRANSPORTER from Streptomyces coelicolor (448 aa), FASTA scores: opt: 1246, E(): 7.3e-67, (54.1% identity in 462 aa overlap); P54146|AMT_CORGL AMMONIUM TRANSPORT SYSTEM from Corynebacterium glutamicum (452 aa), FASTA scores: opt: 953, E(): 2.1e-49, (41.45% identity in 475 aa overlap); Q07429|NRGA_BACSU PROBABLE AMMONIUM TRANSPORTER (MEMBRANE PROTEIN NRGA) from Bacillus subtilis (404 aa), FASTA scores: opt: 721, E(): 0, (44.4% identity in 430 aa overlap); etc. BELONGS TO THE AMT1/MEP/NRGA FAMILY OF AMMONIUM TRANSPORTERS (TC 2.49)." /codon_start=1 /transl_table=11 /product="ammonium transporter" /protein_id="NP_217436.1" /db_xref="GI:15610057" /db_xref="GeneID:887683" /translation="MDQFPIMGVPDGGDTAWMLVSSALVLLMTPGLAFFYGGMVRSKS VLNMIMMSISAMGVVTVLWALYGYSIAFGDDVGNIAGNPSQYWGLKGLIGVNAVAADP STQTAAVNIPLAGTLPATVFVAFQLMFAIITVALISGAVADRLKFGAWLLFAGLWATF VYFPVAHWVFAFDGFAAEHGGWIANKLHAIDFAGGTAVHINAGVAALMLAIVLGKRRG WPATLFRPHNLPFVMLGAALLWFGWYGFNAGSATTANGVAGATFVTTTIATAAAMLGW LLTERVRDGKATTLGAASGIVAGLVAITPSCSSVNVLGALAVGVSAGVLCALAVGLKF KLGFDDSLDVVGVHLVGGLVGTLLVGLLAAPEAPAINGVAGVSKGLFYGGGFAQLERQ ALGACSVLVYSGIITLILALILKFTIGLRLDAEQESTGIDEAEHAESGYDFAVASGSV LPPRVTVEDSRNGIQERIGQKVEAEPK" gene complement(3232871..3234139) /gene="ftsY" /locus_tag="Rv2921c" /db_xref="GeneID:887205" CDS complement(3232871..3234139) /gene="ftsY" /locus_tag="Rv2921c" /function="PROBABLY INVOLVED IN THE RECEPTION AND INSERTION OF A SUBSET OF PROTEINS AT THE MEMBRANE: POSSIBLY MEMBRANE RECEPTOR FOR FFH|Rv2916c." /note="Rv2921c, (MTCY338.10c, MT2989), len: 422 aa. Probable ftsY, signal recognition particle (SRP) receptor, a membrane-associated cell division protein (see citation below), equivalent to O33010|FTSY_MYCLE CELL DIVISION PROTEIN FTSY HOMOLOG from Mycobacterium leprae (430 aa), FASTA scores: opt: 1760, E(): 1.1e-108, (81.35% identity in 429 aa overlap). Also similar to others e.g. Q9I6C1|FTSY|PA0373 SIGNAL RECOGNITION PARTICLE RECEPTOR FTSY from Pseudomonas aeruginosa (455 aa), FASTA scores: opt: 882, E(): 5.1e-40, (42.08% identity in 385 aa overlap); Q9KVJ6|FTSY CELL DIVISION PROTEIN from Vibrio cholerae (391 aa), FASTA scores: opt: 837, E(): 1.2e-37, (36.3% identity in 394 aa overlap); P10121|FTSY_ECOLI|FTSY|B3464 CELL DIVISION PROTEIN from Escherichia coli strain K12 (497 aa), FASTA scores: opt: 800, E(): 1.3e-35, (39.75% identity in 327 aa overlap); etc. Also similar to Q9ZBP9|SC7A1.24 PUTATIVE PROKARYOTIC DOCKING PROTEIN from Streptomyces coelicolor (412 aa), FASTA scores: opt: 1461, E(): 4.3e-71, (60.3% identity in 423 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00300 SRP54-type proteins GTP-binding domain signature. BELONGS TO THE SRP FAMILY OF GTP-BINDING PROTEINS." /codon_start=1 /transl_table=11 /product="cell division protein FtsY" /protein_id="NP_217437.1" /db_xref="GI:15610058" /db_xref="GeneID:887205" /translation="MWEGLWIATAVIAALVVIAALTLGLVLYRRRRISLSPRPERGVV DRSGGYTASSGITFSQTPTTQPAERIDTSGLPAVGDDATVPRDAPKRTIADVHLPEFE PEPQAPEVPEADAIAPPEGRLERLRGRLARSQNALGRGLLGLIGGGDLDEDSWQDVED TLLVADLGPAATASVVSQLRSRLASGNVRTEADARAVLRDVLINELQPGMDRSIRALP HAGHPSVLLVVGVNGTGKTTTVGKLARVLVADGRRVVLGAADTFRAAAADQLQTWAAR VGAAVVRGPEGADPASVAFDAVDKGIAAGADVVLIDTAGRLHTKVGLMDELDKVKRVV TRRASVDEVLLVLDATIGQNGLAQARVFAEVVDISGAVLTKLDGTAKGGIVFRVQQEL GVPVKLVGLGEGPDDLAPFEPAAFVDALLG" misc_feature complement(3232916..3232957) /gene="ftsY" /locus_tag="Rv2921c" /note="PS00300 SRP54-type proteins GTP-binding domain signature" misc_feature complement(3233429..3233452) /gene="ftsY" /locus_tag="Rv2921c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3234189..3237806) /gene="smc" /locus_tag="Rv2922c" /db_xref="GeneID:887179" CDS complement(3234189..3237806) /gene="smc" /locus_tag="Rv2922c" /function="PLAYS AN IMPORTANT ROLE IN CHROMOSOME STRUCTURE AND PARTITIONING. ESSENTIAL FOR CHROMOSOME PARTITION." /note="Rv2922c, (MT2990, MTCY338.11c), len: 1205 aa. Probable smc, chromosome partition protein (ala-, arg-, leu-, glu-rich protein, possibly coiled-coil protein) (see * below), equivalent (but longer 84 aa) to Q9CBT5|SMC|ML1629|MLCB250.01 POSSIBLE CELL DIVISION PROTEIN from Mycobacterium leprae (1203 aa), FASTA scores: opt: 5957, E(): 0, (79.15% identity in 1205 aa overlap). Also highly similar to other chromosome segregation proteins e.g. Q9ZBQ2|SC7A1.21 PUTATIVE CHROMOSOME ASSOCIATED PROTEIN from Streptomyces coelicolor (1186 aa), FASTA scores: opt: 2633, E(): 4.1e-120, (53.03% identity in 1205 aa overlap); P51834|SMC_BACSU CHROMOSOME PARTITION PROTEIN from Bacillus subtilis (1186 aa), FASTA scores: opt: 1009, E(): 2.1e-41, (30.75% identity in 1205 aa overlap); Q9CHC9|SMC CHROMOSOME SEGREGATION PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (924 aa), FASTA scores: opt: 996, E(): 7.5e-41, (29.75% identity in 874 aa overlap); etc. Equivalent to AAK47317 from Mycobacterium tuberculosis strain CDC1551 (1205 aa) but longer 84 aa. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE SMC FAMILY. N-terminus shortened since first submission. [* Note: Unpublished. Cobbe N., Heck M.M.S.- Phylogenetic analysis of SMC proteins (OCT-2001)]." /codon_start=1 /transl_table=11 /product="chromosome partition protein Smc" /protein_id="NP_217438.2" /db_xref="GI:57117036" /db_xref="GeneID:887179" /translation="MYLKSLTLKGFKSFAAPTTLRFEPGITAVVGPNGSGKSNVVDAL AWVMGEQGAKTLRGGKMEDVIFAGTSSRAPLGRAEVTVSIDNSDNALPIEYTEVSITR RMFRDGASEYEINGSSCRLMDVQELLSDSGIGREMHVIVGQGKLEEILQSRPEDRRAF IEEAAGVLKHRKRKEKALRKLDTMAANLARLTDLTTELRRQLKPLGRQAEAAQRAAAI QADLRDARLRLAADDLVSRRAEREAVFQAEAAMRREHDEAAARLAVASEELAAHESAV AELSTRAESIQHTWFGLSALAERVDATVRIASERAHHLDIEPVAVSDTDPRKPEELEA EAQQVAVAEQQLLAELDAARARLDAARAELADRERRAAEADRAHLAAVREEADRREGL ARLAGQVETMRARVESIDESVARLSERIEDAAMRAQQTRAEFETVQGRIGELDQGEVG LDEHHERTVAALRLADERVAELQSAERAAERQVASLRARIDALAVGLQRKDGAAWLAH NRSGAGLFGSIAQLVKVRSGYEAALAAALGPAADALAVDGLTAAGSAVSALKQADGGR AVLVLSDWPAPQAPQSASGEMLPSGAQWALDLVESPPQLVGAMIAMLSGVAVVNDLTE AMGLVEIRPELRAVTVDGDLVGAGWVSGGSDRKLSTLEVTSEIDKARSELAAAEALAA QLNAALAGALTEQSARQDAAEQALAALNESDTAISAMYEQLGRLGQEARAAEEEWNRL LQQRTEQEAVRTQTLDDVIQLETQLRKAQETQRVQVAQPIDRQAISAAADRARGVEVE ARLAVRTAEERANAVRGRADSLRRAAAAEREARVRAQQARAARLHAAAVAAAVADCGR LLAGRLHRAVDGASQLRDASAAQRQQRLAAMAAVRDEVNTLSARVGELTDSLHRDELA NAQAALRIEQLEQMVLEQFGMAPADLITEYGPHVALPPTELEMAEFEQARERGEQVIA PAPMPFDRVTQERRAKRAERALAELGRVNPLALEEFAALEERYNFLSTQLEDVKAARK DLLGVVADVDARILQVFNDAFVDVEREFRGVFTALFPGGEGRLRLTEPDDMLTTGIEV EARPPGKKITRLSLLSGGEKALTAVAMLVAIFRARPSPFYIMDEVEAALDDVNLRRLL SLFEQLREQSQIIIITHQKPTMEVADALYGVTMQNDGITAVISQRMRGQQVDQLVTNS S" misc_feature complement(3237693..3237716) /gene="smc" /locus_tag="Rv2922c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3237818..3238099) /gene="acyP" /locus_tag="Rv2922A" /db_xref="GeneID:3205043" CDS complement(3237818..3238099) /gene="acyP" /locus_tag="Rv2922A" /EC_number="3.6.1.7" /function="INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: AN ACYLPHOSPHATE + H(2)O = A FATTY ACID ANION + ORTHOPHOSPHATE]." /note="catalyzes the hydrolysis of acylphosphate" /codon_start=1 /transl_table=11 /product="acylphosphatase" /protein_id="YP_177679.1" /db_xref="GI:57117037" /db_xref="GeneID:3205043" /translation="MSAPDVRLTAWVHGWVQGVGFRWWTRCRALELGLTGYAANHADG RVLVVAQGPRAACQKLLQLLQGDTTPGRVAKVVADWSQSTEQITGFSER" gene complement(3238086..3238499) /locus_tag="Rv2923c" /db_xref="GeneID:887687" CDS complement(3238086..3238499) /locus_tag="Rv2923c" /function="UNKNOWN" /note="Rv2923c, (MTCY338.12c), len: 137 aa. Conserved hypothetical protein, showing similarity with other hypothetical proteins e.g. P24246|YHFA_ECOLI|B3356|Z4717|ECS4207 from Escherichia coli strains K12 and O157:H7 (134 aa), FASTA scores: opt: 110, E(): 1.9, (25.9% identity in 135 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217439.1" /db_xref="GI:15610060" /db_xref="GeneID:887687" /translation="MTQLWVERTGTRRYIGRSTRGAQVLVGSEDVDGVFTPGELLKIA LAACSGMASDQPLARRLGDDYQAVVKVSGAADRDQERYPLIEETMELDLSGLTEDEKE RLLVVINRAVELACTVGRTLKSGTTVNLEVVDVGA" gene complement(3238601..3239470) /gene="fpg" /locus_tag="Rv2924c" /db_xref="GeneID:887438" CDS complement(3238601..3239470) /gene="fpg" /locus_tag="Rv2924c" /EC_number="3.2.2.23" /function="INVOLVED IN BASE EXCISION REPAIR (REPAIR OF OXIDIZED PURINES). THIS ENZYME MAY PLAY A SIGNIFICANT ROLE IN PROCESSES LEADING TO RECOVERY FROM MUTAGENESIS AND/OR CELL DEATH BY ALKYLATING AGENTS [CATALYTIC ACTIVITY HYDROLYSIS OF DNA CONTAINING RING-OPENED N7-METHYLGUANINE RESIDUES, RELEASING 2,6-DIAMINO-4-HYDROXY-5-(N-METHYL)FORMAMIDOPYRIMIDE]." /note="Involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. Acts as DNA glycosylase that recognizes and removes damaged bases" /codon_start=1 /transl_table=11 /product="formamidopyrimidine-DNA glycosylase" /protein_id="NP_217440.1" /db_xref="GI:15610061" /db_xref="GeneID:887438" /translation="MPELPEVEVVRRGLQAHVTGRTITEVRVHHPRAVRRHDAGPADL TARLRGARINGTDRRGKYLWLTLNTAGVHRPTDTALVVHLGMSGQMLLGAVPCAAHVR ISALLDDGTVLSFADQRTFGGWLLADLVTVDGSVVPVPVAHLARDPLDPRFDCDAVVK VLRRKHSELKRQLLDQRVVSGIGNIYADEALWRAKVNGAHVAATLRCRRLGAVLHAAA DVMREALAKGGTSFDSLYVNVNGESGYFERSLDAYGREGENCRRCGAVIRRERFMNRS SFYCPRCQPRPRK" gene complement(3239829..3240551) /gene="rnc" /locus_tag="Rv2925c" /db_xref="GeneID:887873" CDS complement(3239829..3240551) /gene="rnc" /locus_tag="Rv2925c" /EC_number="3.1.26.3" /function="DIGESTS DOUBLE-STRANDED RNA. INVOLVED IN THE PROCESSING OF RIBOSOMAL RNA PRECURSORS AND OF SOME mRNAs [CATALYTIC ACTIVITY: ENDONUCLEOLYTIC CLEAVAGE TO 5'-PHOSPHOMONOESTER]." /note="cytoplasmic enzyme involved in processing rRNA and some mRNAs; substrates typically have dsRNA regions; forms a homodimer; have N-terminal nuclease and C-terminal RNA-binding domains; requires magnesium as preferred ion for activity" /codon_start=1 /transl_table=11 /product="ribonuclease III" /protein_id="NP_217441.1" /db_xref="GI:15610062" /db_xref="GeneID:887873" /translation="MIRSRQPLLDALGVDLPDELLSLALTHRSYAYENGGLPTNERLE FLGDAVLGLTITDALFHRHPDRSEGDLAKLRASVVNTQALADVARRLCAEGLGVHVLL GRGEANTGGADKSSILADGMESLLGAIYLQHGMEKAREVILRLFGPLLDAAPTLGAGL DWKTSLQELTAARGLGAPSYLVTSTGPDHDKEFTAVVVVMDSEYGSGVGRSKKEAEQK AAAAAWKALEVLDNAMPGKTSA" misc_feature complement(3240405..3240431) /gene="rnc" /locus_tag="Rv2925c" /note="PS00517 Ribonuclease III family signature" gene complement(3240548..3241171) /locus_tag="Rv2926c" /db_xref="GeneID:887487" CDS complement(3240548..3241171) /locus_tag="Rv2926c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2926c, (MTCY338.15c), len: 207 aa. Conserved hypothetical protein, equivalent to O69468|ML1660|MLCB1243.14 HYPOTHETICAL 23.5 KDA PROTEIN from Mycobacterium leprae (217 aa), FASTA scores: opt: 866, E(): 1.4e-48, (67.2% identity in 192 aa overlap). Also similar in part to other hypothetical proteins e.g. Q9WXZ8 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (182 aa), FASTA scores: opt: 254, E(): 3.4e-09, (31.45% identity in 143 aa overlap); Q9ZBQ9|SC7A1.14 HYPOTHETICAL 23.5 KDA PROTEIN from Streptomyces coelicolor (217 aa), FASTA scores: opt: 244, E(): 1.7e-08, (45.5% identity in 189 aa overlap); O65982 HYPOTHETICAL 26.2 KDA PROTEIN from Clostridium thermosaccharolyticum (Thermoanaerobacterium thermosaccharolyticum) (228 aa), FASTA scores: opt: 220, E(): 6.1e-07, (32.45% identity in 148 aa overlap); etc. Equivalent to AAK47323 from Mycobacterium tuberculosis strain CDC1551 (195 aa) but longer 12 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217442.1" /db_xref="GI:15610063" /db_xref="GeneID:887487" /translation="MDLGGVRRRISLMARQHGPTAQRHVASPMTVDIARLGRRPGAMF ELHDTVHSPARIGLELIAIDQGALLDLDLRVESVSEGVLVTGTVAAPTVGECARCLSP VRGRVQVALTELFAYPDSATDETTEEDEVGRVVDETIDLEQPIIDAVGLELPFSPVCR PDCPGLCPQCGVPLASEPGHRHEQIDPRWAKLVEMLGPESDTLRGER" gene complement(3241222..3241959) /locus_tag="Rv2927c" /db_xref="GeneID:887317" CDS complement(3241222..3241959) /locus_tag="Rv2927c" /function="UNKNOWN" /note="Rv2927c, (MTCY338.16c), len: 245 aa. Conserved hypothetical protein, equivalent to Q9CBS6|ML1661|MLCB1243.13 (alias O69467) HYPOTHETICAL PROTEIN from Mycobacterium leprae (247 aa), FASTA scores: opt: 1440, E(): 4.9e-76, (90.6% identity in 245 aa overlap). Also similar to many hypothetical proteins from other organisms e.g. Q9ZBR0|SC7A1.13 HYPOTHETICAL 41.0 KDA PROTEIN from Streptomyces coelicolor (379 aa), FASTA scores: opt: 266, E(): 3.4e-08, (29.9% identity in 234 aa overlap); etc. Also some similarity with P46815|AG84_MYCLE|ML0922 ANTIGEN 84 from Mycobacterium leprae (266 aa), FASTA scores: opt: 193, E(): 0.00043, (28.7% identity in 136 aa overlap) (see citation below); and P46816|AG84_MYCTU|WAG31|Rv2145c|MT2204|MTCY270.23 ANTIGEN 84 from Mycobacterium tuberculosis (260 aa), FASTA scores: opt: 178, E(): 0.0031, (34.35% identity in 131 aa overlap) (see citation below). Contains potential coiled-coil region." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217443.1" /db_xref="GI:15610064" /db_xref="GeneID:887317" /translation="MYRVFEALDELSAIVEEARGVPMTAGCVVPRGDVLELIDDIKDA IPGELDDAQDVLDARDSMLQDAKTHADSMVSSATTEAESILNHARTEADRILSDAKAQ ADRMVSEARQHSERMVADAREEAIRIATAAKREYEASVSRAQAECDRLIENGNISYEK AVQEGIKEQQRLVSQNEVVAAANAESTRLVDTAHAEADRLRGECDIYVDNKLAEFEEF LNGTLRSVGRGRHQLRTAAGTHDYAVR" gene 3242198..3242983 /gene="tesA" /locus_tag="Rv2928" /db_xref="GeneID:887446" CDS 3242198..3242983 /gene="tesA" /locus_tag="Rv2928" /EC_number="3.1.2.-" /function="UNKNOWN; PROBABLY INVOLVED IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2928, (MTCY338.17), len: 261 aa. Probable tesA, thioesterase (EC 3.1.2.-), equivalent to Q9Z5K4|ML2359|MLCB12.04c PUTATIVE THIOESTERASE from Mycobacterium leprae (261 aa), FASTA scores: opt: 1326, E(): 3.7e-80, (73.2% identity in 261 aa overlap). Also similar to others e.g. Q9ZGI1 THIOESTERASE II PIKAV from Streptomyces venezuelae (281 aa), FASTA scores: opt: 535, E(): 6.6e-28, (38.05% identity in 234 aa overlap); Q9L4W2|NYSE thioesterase involved in synthesis of the polyene antifungal antibiotic nystatin from Streptomyces noursei (see Brautaset et al., 2000) (251 aa), FASTA scores: opt: 523, E(): 3.8e-27, (34.53% identity in 223 aa overlap); Q54145 THIOESTERASE from Streptomyces fradiae (253 aa), FASTA scores: opt: 495, E(): 2.7e-25, (37.85% identity in 230 aa overlap); etc." /codon_start=1 /transl_table=11 /product="thioesterase TESA" /protein_id="NP_217444.1" /db_xref="GI:15610065" /db_xref="GeneID:887446" /translation="MLARHGPRYGGSVNGHSDDSSGDAKQAAPTLYIFPHAGGTAKDY VAFSREFSADVKRIAVQYPGQHDRSGLPPLESIPTLADEIFAMMKPSARIDDPVAFFG HSMGGMLAFEVALRYQSAGHRVLAFFVSACSAPGHIRYKQLQDLSDREMLDLFTRMTG MNPDFFTDDEFFVGALPTLRAVRAIAGYSCPPETKLSCPIYAFIGDKDWIATQDDMDP WRDRTTEEFSIRVFPGDHFYLNDNLPELVSDIEDKTLQWHDRA" gene 3242970..3243281 /locus_tag="Rv2929" /db_xref="GeneID:888112" CDS 3242970..3243281 /locus_tag="Rv2929" /function="UNKNOWN" /note="Rv2929, (MTCY338.18), len: 103 aa. Hypothetical unknown protein; unlikely ORF but some weak similarity to C-terminal half of P18319|UREG_KLEAE urease accessory protein from klebsiella aerogenes (205 aa), FASTA scores: opt: 99, E(): 1.1, (38.6% identity in 57 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217445.1" /db_xref="GI:15610066" /db_xref="GeneID:888112" /translation="MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGL ERMASDTHGGGGGRPVTPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTSVL T" gene 3243697..3245448 /gene="fadD26" /locus_tag="Rv2930" /db_xref="GeneID:887603" CDS 3243697..3245448 /gene="fadD26" /locus_tag="Rv2930" /EC_number="2.3.1.86" /function="INVOLVED IN PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS, POSSIBLY BY ACTIVATING SUBSTRATES FOR THE PPS POLYKETIDES SYNTHASE." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_217446.2" /db_xref="GI:57117038" /db_xref="GeneID:887603" /translation="MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWS QVYSRACIIAEELKLCGLPGDRVAVLAPQGLEYVLAFLGALQAGFIAVPLSTPQYGIH DDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLDLDSPRQMPAF SRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYGYFGDPAKIPTGTVVSWLP LYHDMGLILGICAPLVARRRAMLMSPMSFLRRPARWMQLLATSGRCFSAAPNFAFELA VRRTSDQDMAGLDLRDVVGIVSGSERIHVATVRRFIERFAPYNLSPTAIRPSYGLAEA TLYVAAPEAGAAPKTVRFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPE TMVENPPGVVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLG VISDGELFIMGRIKDLLIVDGRNHYPDDIEATIQEITGGRAAAIAVPDDITEQLVAII EFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVSPGSIPITTSGKIRRSA CVERYRSDGFKRLDVAV" gene 3245445..3251075 /gene="ppsA" /locus_tag="Rv2931" /db_xref="GeneID:888183" CDS 3245445..3251075 /gene="ppsA" /locus_tag="Rv2931" /function="INVOLVED IN PHENOLPTHIOCEROL AND PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS: EXTENSION OF C18 WITH MALONY CoA (PARTIAL REDUCTION)." /experiment="experimental evidence, no additional details recorded" /note="Rv2931, (MTCY338.20), len: 1876 aa. ppsA, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q9Z5K6|ML2357|MLCB12.02c PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1871 aa), FASTA scores: opt: 7566, E(): 0, (76.1% identity in 1888 aa overlap); Q9S384|ML2356|MLCB12.01c PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1540 aa), FASTA scores: opt: 4026, E(): 9.8e-212, (45.7% identity in 1811 aa overlap); Q49932|PKSC|L518_F1_2 PUTATIVE POLYKETIDE SYNTHASE (1446 aa), FASTA scores: opt: 4026, E(): 9.4e-212, (70.6% identity in 885 aa overlap). Also similar to polyketide synthases from other bacteria e.g. C-terminus of Q9L8C7|EPOC POLYKETIDE SYNTHASE from Polyangium cellulosum (7257 aa), FASTA scores: opt: 2592, E(): 5.2e-133, (32.55% identity in 2245 aa overlap); P22367|MSAS_PENPA 6-methylsalicylic acid synthase from Penicillium patulum (Penicillium griseofulvum) (1774 aa), FASTA scores: opt: 2391, E(): 0, (34.2% identity in 1815 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. Q10978|PPSB_MYCTU|RV2932 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1538 aa), FASTA scores: opt: 4227, E(): 0, (46.8% identity in 1810 aa overlap) (gap in middle); etc. Contains PS00606 Beta-ketoacyl synthases active site, and PS00012 Phosphopantetheine attachment site. Note that Rv2931|ppsA belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="phenolpthiocerol synthesis type-I polyketide synthase PPSA" /protein_id="NP_217447.1" /db_xref="GI:15610068" /db_xref="GeneID:888183" /translation="MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSR DAVVLSGELSELLGRTVSPIDFWEHPTINALAAYLAAPEPSPDSDAAVKRGARNSLDE PIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPPEVAAALARTT RWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWEALEHAGIPPGTLRRSATG VFAGACLSEYGAMASADLSQVDGWSNSGGAMSIIANRLSYFLDLRGPSVAVDTACSSS LVAIHLACQSLRTQDCHLAIAAGVNLLLSPAVFRGFDQVGALSPTGQCRAFDATADGF VRGEGAGVVVLKRLTDAQRDGDRVLAVICGSAVNQDGRSNGLMAPNPAAQMAVLRAAY TNAGMQPSEVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTE AAAGIAGFIKTVLAVQHGQIPPNQHFETANPHIPFTDLRMKVVDTQTEWPATGHPRRA GVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGKTMQRVSATAGMLADWM EGPGADVALADVAHTLNHHRSRQPKFGTVVARDRTQAIAGLRALAAGQHAPGVVNPAD GSPGPGTVFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLHDVLANG EELVGIEQIQLGLIGMQLALTELWCSYGVRPDLVIGHSMGEVAAAVVAGALTPAEGLR VTATRSRLMAPLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQI DELIARVRAQNRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYAD LHTQPVFDAEHWATNMRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIIDTLH SAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHPPHTPHPPEPHPPIPTTPWQHTR HWITTKYPAGSVGSAPRAGTLLGQHTTVATVSASPPSHLWQARLAPDAKPYQGGHRFH QVEVVPASVVLHTILSAATELGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSP AAGTPSDRWTRHVTAQLSSSPSDSASSLNEHHRANGQPPERAHRDLIPDLAELLAMRG IDGLPFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADSRLYV PASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPSLSMRSLRYRALD FGLDVGRAQPPASTGPVEAYCDATNFVHTIDWQPQTVPDATHPGAEQVTHPGPVAIIG DDGAALCETLEGAGYQPAVMSDGVSQARYVVYVADSDPAGADETDVDFAVRICTEITG LVRTLAERDADKPAALWILTRGVHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDL AINDDLGEFGPALAELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYL ITGGLGALGLLMADWLADRGAHRLVLTGRTPLPPRRDWQLDTLDTELRRRIDAIRALE MRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQLVTSMTGDAVR QVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQGSYAAANSYLDALARARR QQGCHTMSLDWVAWRGLGLAADAQLVSEELARMGSRDITPSEAFTAWEFVDGYDVAQA VVVPMPAPAGADGSGANAYLLPARNWSVMAATEVRSELEQGLRRIIAAELRVPEKELD TDRPFAELGLNSLMAMAIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHDVS QDNQISALSSSAGSVLDSLFDRIESAPPEAERSV" misc_feature 3246234..3246284 /gene="ppsA" /locus_tag="Rv2931" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature 3250815..3250862 /gene="ppsA" /locus_tag="Rv2931" /note="PS00012 Phosphopantetheine attachment site" gene 3251072..3255688 /gene="ppsB" /locus_tag="Rv2932" /db_xref="GeneID:888023" CDS 3251072..3255688 /gene="ppsB" /locus_tag="Rv2932" /function="INVOLVED IN PHENOLPTHIOCEROL AND PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS: EXTENSION WITH MALONY CoA (PARTIAL REDUCTION)." /experiment="experimental evidence, no additional details recorded" /note="Rv2932, (MTV011.01, MTCY338.21, MT3002), len 1538 aa. ppsB, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q9S384|ML2356|MLCB12.01c PUTATIVE POLYKETIDE SYNTHASE (1540 aa), FASTA scores: opt: 7284, E(): 0, (76.3% identity in 1561 aa overlap); Q49932|PKSC|L518_F1_2 PUTATIVE POLYKETIDE SYNTHASE (1446 aa), FASTA scores: opt: 6811, E(): 0, (76.2% identity in 1462 aa overlap); etc. Also similar to polyketide synthases from other bacteria e.g. Q9KIZ6|EPOE EPOE PROTEIN from Polyangium cellulosum (3798 aa), FASTA scores: opt: 3052, E(): 3.3e-165, (38.35% identity in 1538 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. Q10977|PPSA_MYCTU|RV2931 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1876 aa), FASTA scores: opt: 4227, E(): 0, (46.9% identity in 1810 aa overlap); P96203|PPSD|Rv2934|MTCY19H9.02 PKSE PROTEIN (1827 aa), FASTA scores: opt: 3756, E(): 1.8e-205, (42.9% identity in 1808 aa overlap); etc. Overlaps and extends CDS from neighbouring cosmid MTCY338.21. Contains PS00606 Beta-ketoacyl synthases active site. Note that Rv2932|ppsB belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="phenolpthiocerol synthesis type-I polyketide synthase PPSB" /protein_id="NP_217448.1" /db_xref="GI:15610069" /db_xref="GeneID:888023" /translation="MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCR FPGDVDGPESFWDFLVAGRNAISTVPADRWDAEAFYHPDPLTPGRMTTKWGGFVPDVA GFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTAVMMGVYFNEY QSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVAVDTACSSSLVAVHLACQS LRLRETDLALAGGVSITLRPETQIAISAWGLLSPQGRCAAFDAAADGFVRGEGAGVVV LKRLTDAVRDGDQVLAVVRGSAVNQDGRSNGVTAPNTAAQCDVIADALRSGDVAPDSV NYVEAHGTGTVLGDPIEFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATL AVQRATIPPNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAH VIIEQGSELAPVSEGGEDTGVSTLVVTGKTAQRMAATAQVLADWMEGPGAEVAVADVA HTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVSHQDGSPGPGTVFVYSG RGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLRDVIATGKELVGIEQIQLGL IGMQLTLTELWRSYGVQPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRARLMAPLS GQGGMALLGLDAAATEALIADYPQVTVGIYNSPRQTVIAGPTEQIDELIARVRAQNRF ASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWA TNMRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTKSAAKYL SIGTLQRDADDTVTFRTNLYTADIAHPPHTCHPPEPHPTIPTTPWQHTHHWIATTHPS TAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPIRPAVSADPPSTAAWLVVADNEL CHELARAADSRVDSLSPPALAAGSDPAALLDALRGVDNVLYAPPVPGELLDIESAYQV FHATRRLAAAMVASSATAISPPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEH PEIWGGIIDLDDSMPAELAVRHVLTAAHGTDGEDQVVYRSGARHVPRLQRRTLPGKPV TLNADASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATGTDLI AVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTDDDVTTMFRPKLD ALALLHRRSLKSPVRHFVLFSSVSGLLGSRWLAHYTATSAFLDSFAGARRTMGLPATV VDWGLWKSLADVQKDATQISAESGLQPMADEVAIGALPLVMNPDAAVATVVVAADWPL LAAAYRTRGALRIVDDLLPAPEDVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATV MGMPPTEPLDPSAGFFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYL ATVLPELLEIGATAVATQQATDSYHELTEAELLEQLSERLRGTQ" misc_feature 3251657..3251707 /gene="ppsB" /locus_tag="Rv2932" /note="PS00606 Beta-ketoacyl synthases active site" gene 3255685..3262251 /gene="ppsC" /locus_tag="Rv2933" /db_xref="GeneID:887686" CDS 3255685..3262251 /gene="ppsC" /locus_tag="Rv2933" /function="INVOLVED IN PHENOLPTHIOCEROL AND PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS: EXTENSION WITH MALONY CoA (COMPLETE REDUCTION)." /experiment="experimental evidence, no additional details recorded" /note="Rv2933, (MTCY19H9.01, MTV011.02), len: 2188 aa. ppsC, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q49933|PKSD|ML2355|L518_F1_3 PUTATIVE POLYKETIDE SYNTHASE (2201 aa), FASTA scores: opt: 6973, E(): 0, (82.32% identity in 2217 aa overlap); Q49624|PKS3|MASA|ML1229|B1170_C2_209 PROBABLE MYCOCEROSIC ACID SYNTHASE (2118 aa), FASTA scores: opt: 4015, E(): 2.9e-208, (36.6% identity in 2184 aa overlap); etc. Also similar to polyketide synthases from other bacteria e.g. C-terminus of Q9L8C7 POLYKETIDE SYNTHASE from Polyangium cellulosum (7257 aa), FASTA scores: opt: 3909, E(): 3.6e-202, (40.15% identity in 2220 aa overlap); Q9KIZ7|EPOD EPOD PROTEIN from Polyangium cellulosum (7257 aa), FASTA scores: opt: 3886, E(): 6.2e-201, (40.05% identity in 2220 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. P96291|Rv2940c (2111 aa), FASTA scores: opt: 4204, E(): 0, (39.1% identity in 2176 aa overlap); Q10977|PPSA_MYCTU|RV2931 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1876 aa), FASTA scores: opt: 3793, E(): 2.4e-196, (46.65% identity in 1612 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site, and PS00012 Phosphopantetheine attachment site. Note that Rv2933|ppsC belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="phenolpthiocerol synthesis type-I polyketide synthase PPSC" /protein_id="NP_217449.1" /db_xref="GI:15610070" /db_xref="GeneID:887686" /translation="MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGC RFPGGVNNPEQFWDLLCAGRSGIVRVPAQRWDADAYYCDDHTVPGTICSTEGGFLTSW QPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGTQTSVFVGVTA YDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGARGPAVVIDTACSSSLVAVH LACQSLRGRESDMALVGGTNLLLSPGPSIACSRWGMLSPEGRCKTFDASADGYVRGEG AAVVVLKRLDDAVRDGNRILAVVRGSAVNQDGASSGVTVPNGPAQQALLAKALTSSKL TAADIDYVEAHGTGTPLGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVA GLMKAVLAVHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFG VSGTNAHVVIEQAPDPMAAAGTEPQRGPVPAVSTLVVFGKTAPRVAATASVLADWLDG PGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRALAAGQSAPGVVAPREGS IGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAIAELEPEFVAQGGFSLRDVIAGGKE LVGIEQIQLGLIGMQLALTALWRSYGVTPDAVIGHSMGEVAAAVVAGALTPAQGLRVT AVRSRLMAPLSGQGTMALLELDAEATEALIADYPEVSLGIYASPRQTVISGPPLLIDE LIDKVRQQNGFATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLG ISLGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSISDTLRAS YDVDNYLSIGTLQRDAHDTLEFHTNLNTTHTTHPPQTPHPPEPHPVLPTTPWQHTQHW ITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELDPDLLWLADHVIDDLVVLPGAAY AEIALAAATDTFAVEQDQPWMISELDLRQMLHVTPGTVLVTTLTGDEQRCQVEIRTRS GSSGWTTHATATVARAEPLAPLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGP AFQGIVGLAVTQAGVARAQVRLPASARTGSREFMLHPVMMDIALQTLGATRTATDLAG GQDARQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTDANGQ PLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGALLLIGDPAAGDPL LPALQSSLRDRITDLELASAADEATLRAAISRTSWDGIVVVCPPRANDESMPDEAQLE LARTRTLLVASVVETVTRMGARKSPRLWIVTRGAAQFDAGESVTLAQTGLRGIARVLT FEHSELNTTLVDIEPDGTGSLAALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSG DLAAEARHQVVNLDSSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAA GLNFSDVLKAMGVYPGLDGAAPVIGGECVGYVTAIGDEVDGVEVGQRVIAFGPGTFGT HLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVLIHSATGGVGM AAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSVDFADEILELTDGYGVDVV LNSLAGEAIQRGVQILAPGGRFIELGKKDVYADASLGLAALAKSASFSVVDLDLNLKL QPARYRQLLQHILQHVADGKLEVLPVTAFSLHDAADAFRLMASGKHTGKIVISIPQHG SIEAIAAPPPLPLVSRDGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDEVA AAIAELNASGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNM TDSAARRVFAPKVTGSWRLHVATAARDVDWWLTFSSAAALLGTPGQGAYAAANSWVDG LVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQGLAAMQAVLTADRGRTG VFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSGQRRGGGAIRAQLDALDAAERPGHL ASAIADEIRAVLRSGDPIDHHRPLETLGLDSLMGLELRNRLEASLGITLPVALVWAYP TISDLATALCERMDYATPAAAQEISDTEPELSDEEMDLLADLVDASELEAATRGES" misc_feature 3256285..3256335 /gene="ppsC" /locus_tag="Rv2933" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature 3261982..3262029 /gene="ppsC" /locus_tag="Rv2933" /note="PS00012 Phosphopantetheine attachment site" gene 3262248..3267731 /gene="ppsD" /locus_tag="Rv2934" /db_xref="GeneID:887172" CDS 3262248..3267731 /gene="ppsD" /locus_tag="Rv2934" /function="INVOLVED IN PHENOLPTHIOCEROL AND PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS: EXTENSION WITH METHYLMALONY CoA (PARTIAL REDUCTION)." /experiment="experimental evidence, no additional details recorded" /note="Rv2934, (MTCY19H9.02), len: 1827 aa. ppsD, type-I polyketide synthase (see citations below), highly similar to others from Mycobacterium leprae e.g. Q9CB70|ML2354 POLYKETIDE SYNTHASE (1822 aa), FASTA scores: opt: 9779, E(): 0, (80.35% identity in 1836 aa overlap); Q49940|L518_F3_67|PFSE (1815 aa), FASTA scores: opt: 9658, E(): 0, (79.85% identity in 1831 aa overlap); etc. Also similar to polyketide synthases from other bacteria e.g. C-terminus of Q9RNB2|MCYD|Q9FDU1 POLYKETIDE SYNTHASE (MCYD PROTEIN) from Microcystis aeruginosa (3906 aa), FASTA scores: opt: 2961, E(): 6e-159, (32.15% identity in 1827 aa overlap); etc. And also highly similar to others from Mycobacterium tuberculosis e.g. Q10978|PPSB_MYCTU|RV2932 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE (1538 aa), FASTA scores: opt: 3756, E(): 3.8e-204, (42.85% identity in 1808 aa overlap) (gaps in middle); P96202|PPSC|RV2933 POLYKETIDE SYNTHASE (2188 aa), FASTA scores: opt: 3463, E(): 1.7e-187, (39.2% identity in 2165 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site, PS00017 ATP/GTP-binding site motif A, PS00013 Prokaryotic membrane lipoprotein lipid attachment site, and PS00012 Phosphopantetheine attachment site. Note that Rv2934|ppsD belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="phenolpthiocerol synthesis type-I polyketide synthase PPSD" /protein_id="NP_217450.1" /db_xref="GI:15610071" /db_xref="GeneID:887172" /translation="MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIG CRFPGNVTGPESFWQLLADGVDTIEQVPPDRWDADAFYDPDPSASGRMTTKWGGFVSD VDAFDADFFGITPREAVAMDPQHRMLLEVAWEALEHAGIPPDSLSGTRTGVMMGLSSW DYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPAVAVDTACSSSLVAIHLAC QSLRLRETDVALAGGVQLTLSPFTAIALSKWSALSPTGRCNSFDANADGFVRGEGCGV VVLKRLADAVRDQDRVLAVVRGSATNSDGRSNGMTAPNALAQRDVITSALKLADVTPD SVNYVETHGTGTVLGDPIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAG FIKAVLAVQRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGL SGTNAHVVVEQAPDTAVAAAGGMPYVSALNVSGKTAARVASAAAVLADWMSGPGAAAP LADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPRVGVVDCDQHAGGPGRV FVYSGQGSQWASMGQQLLANEPAFAKAVAELDPIFVDQVGFSLQQTLIDGDEVVGIDR IQPVLVGMQLALTELWRSYGVIPDAVIGHSMGEVSAAVVAGALTPEQGLRVITTRSRL MARLSGQGAMALLELDADAAEALIAGYPQVTLAVHASPRQTVIAGPPEQVDTVIAAVA TQNRLARRVEVDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADAD YWSANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVMSTMNRE LDQTLYFHAQLAAVGVAASEHTTGRLVDLPPTPWHHQRFWVTDRSAMSELAATHPLLG AHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIMPAAGFAEIALAAASEALGTAAD AVAPNIVINQFEVEQMLPLDGHTPLTTQLIRGGDSQIRVEIYSRTRGGEFCRHATAKV EQSPRECAHAHPEAQGPATGTTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAE TEISIPDEAPRHPGYRLHPVVLDAALQSVGAAIPDGEIAGSAEASYLPVSFETIRVYR DIGRHVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLPLEQK IFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKSMADDFVQQWRSP MRRVHTADIHDESAVLAAFAETAGDPEHPPVGVVVFVGGASSRLDDELAAARDTVWSI TTVVRAVVGTWHGRSPRLWLVTGGGLSVADDEPGTPAAASLKGLVRVLAFEHPDMRTT LVDLDITQDPLTALSAELRNAGSGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVR QGASYVVTGGLGGLGLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVV RGDVASPGVAEKLIETARQSGGQLRGVVHAAAVIEDSLVFSMSRDNLERVWAPKATGA LRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRRASGLPAAVIN WGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIRTGVARLRADRALVAFPEI RSISYFTQVVEELDSAGDLGDWGGPDALADLDPGEARRAVTERMCARIAAVMGYTDQS TVEPAVPLDKPLTELGLDSLMAVRIRNGARADFGVEPPVALILQGASLHDLTADLMRQ LGLNDPDPALNNADTIRDRARQRAAARHGAAMRRRPKPEVQGG" misc_feature 3262839..3262889 /gene="ppsD" /locus_tag="Rv2934" /note="PS00606 Beta-ketoacyl synthases active site" misc_feature 3263679..3263702 /gene="ppsD" /locus_tag="Rv2934" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 3266976..3267008 /gene="ppsD" /locus_tag="Rv2934" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" misc_feature 3267465..3267512 /gene="ppsD" /locus_tag="Rv2934" /note="PS00012 Phosphopantetheine attachment site" gene 3267737..3272203 /gene="ppsE" /locus_tag="Rv2935" /db_xref="GeneID:888210" CDS 3267737..3272203 /gene="ppsE" /locus_tag="Rv2935" /function="INVOLVED IN PHENOLPTHIOCEROL AND PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS: EXTENSION WITH MALONY CoA (PARTIAL REDUCTION, DECARBOXYLATION)." /experiment="experimental evidence, no additional details recorded" /note="Rv2935, (MTCY19H9.03), len: 1488 aa. ppsE, type-I polyketide synthase (see citations below), equivalent to Q49934|PKSF|ML2353|L518_F1_8 PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1489 aa), FASTA scores: opt: 8156, E(): 0, (82.05% identity in 1493 aa overlap). Also similar to polyketide synthases from other bacteria e.g. Q9RAH3|NOSB NOSB PROTEIN from Nostoc sp. GSV224 (1244 aa), FASTA scores: opt: 2438, E(): 8.8e-137, (43.75% identity in 969 aa overlap); Q9KIZ8|EPOC EPOC PROTEIN from Polyangium cellulosum (1832 aa), FASTA scores: opt: 2272, E(): 8.6e-127, (39.95% identity in 1061 aa overlap); O54155|SC3F7.12 POLYKETIDE SYNTHASE from Streptomyces coelicolor (2297 aa), FASTA scores: opt: 1522, E(): 3.6e-82, (36.35% identity in 1057 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site. Note that Rv2935|ppsE belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="phenolpthiocerol synthesis type-I polyketide synthase PPSE" /protein_id="NP_217451.1" /db_xref="GI:15610072" /db_xref="GeneID:888210" /translation="MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQE LRDAGVSDKTLADPAYVRRAPLLDGIDEFDAGFFGFPPLAAQVLDPQHRLFLQCAWHA LEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFDQFSLFLQNDK DFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLSGECDMALAGGSSLCIPHR VGYFTSPGSMVSAVGHCRPFDVRADGTVFGSGVGLVVLKPLAAAIDAGDRIHAVIRGS AINNDGSAKMGYAAPNPAAQADVIAEAHAVSGIDSSTVSYVECHGTGTPLGDPIEIQG LRAAFEVSQTSRSAPCVLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSP NPELRLDQSPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHA EPAGPQVILLSAQTAAALGESRTALAAALETQDGPRLSDVAYTLARRRKHNVTMAAVV HDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRVVFLFPGQGAQHVGMAK GLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFDGTATDLERIDRSQPALFTVEYALA KLVDTFGVRAGAYIGYSTGEYIAATLAGVFDLQTAIKTVSLRARLMHESPPGAMVAVA LGPDDVTQYLPPEVELSAVNDPGNCVVAGPKDQIRALRQRLTEAGIPVRRVRATHAFH TSAMDPMLGQFQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFA DELDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDDRDTFLR ALGELWSAGVEVDWTPRRPAVPHLVSLPGYPFARQRHWVEPNHTVWAQAPGANNGSPA GTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVSSVDRNANFFDLGGDSLMAISIA MAAANEGLTITPQDLYEYPTLASLTAAVDASFASSGLAKPPEAQANPAVPPNVTYFLD RGLRDTGRCRVPLILRLDPKIGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAP AEFTGLSNRSVPNGVAAGSPEERAAVLGILAELLEDQTDPNAPLAAVHIAAAHGGPHY LCLAIHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAALATHP AALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLSVEQTSELDDGRR RFRRSIQTILLAALGRTIAQTVGEGVVAVELEGEGRSVLRPDVDLRRTVGWFTTYYPV PLACATGLGALAQLDAVHNTLKSVPHYGIGYGLLRYVYAPTGRVLGAQRTPDIHFRYA GVIPELPSGDAPVQFDSDMTLPVREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPA ATAEALERTFPLALSALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG" misc_feature 3268259..3268309 /gene="ppsE" /locus_tag="Rv2935" /note="PS00606 Beta-ketoacyl synthases active site" gene 3272214..3273209 /gene="drrA" /locus_tag="Rv2936" /db_xref="GeneID:888168" CDS 3272214..3273209 /gene="drrA" /locus_tag="Rv2936" /function="PROBABLY INVOLVED IN ACTIVE TRANSPORT OF ANTIBIOTIC AND PHTHIOCEROL DIMYCOCEROSATE (DIM) ACROSS THE MEMBRANE (EXPORT). DRRA, DRRB|Rv2936|MTCY19H9.05 AND DRRC|Rv2938|MTCY19H9.06 MAY ACT JOINTLY TO CONFER DAUNORUBICIN AND DOXORUBICIN RESISTANCE BY AN EXPORT MECHANISM. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv2936, (MTCY19H9.04), len: 331 aa. Probable drrA, daunorubicin-DIM-transport resistance ATP-binding protein ABC transporter, probably involved in daunorubicin resistance and phthiocerol dimycocerosate transport (see citations below), equivalent to Q49938|DRRA|ML2352|L518_F2_43|DRRA PROBABLE DAUNORUBICIN RESISTANCE ATP-BINDING PROTEIN from Mycobacterium leprae (331 aa), FASTA scores: opt: 1842, E(): 4.2e-103, (85.2% identity in 331 aa overlap). Also highly similar to others e.g. Q9XCF7 DRRA from Mycobacterium avium (315 aa), FASTA scores: opt: 1040, E(): 4.7e-55, (54.35% identity in 309 aa overlap); Q9X5J8 DAUNORUBICIN RESISTANCE PROTEIN A from Mycobacterium avium (315 aa), FASTA scores: opt: 1030, E(): 1.9e-54, (53.7% identity in 309 aa overlap); P32010|DRRA_STRPE DAUNORUBICIN RESISTANCE ATP-BINDING PROTEIN from Streptomyces peucetius (330 aa), FASTA scores: opt: 852, E(): 9e-44, (47.15% identity in 318 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00211 ABC transporters family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). Note that Rv2936|drrA belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="daunorubicin-DIM-transport ATP-binding protein ABC transporter DrrA" /protein_id="NP_217452.1" /db_xref="GI:15610073" /db_xref="GeneID:888168" /translation="MRNDDMAVVVNGVRKTYGKGKIVALDDVSFKVRRGEVIGLLGPN GAGKTTMVDILSTLTRPDAGSAIIAGYDVVSEPAGVRRSIMVTGQQVAVDDALSGEQN LVLFGRLWGLSKSAARKRAAELLEQFSLVHAGKRRVGTYSGGMRRRIDIACGLVVQPQ VAFLDEPTTGLDPRSRQAIWDLVASFKKLGIATLLTTQYLEEADALSDRIILIDHGII IAEGTANELKHRAGDTFCEIVPRDLKDLDAIVAALGSLLPEHHRAMLTPDSDRITMPA PDGIRMLVEAARRIDEARIELADIALRRPSLDHVFLAMTTDPTESLTHLVSGSAR" misc_feature 3272337..3272360 /gene="drrA" /locus_tag="Rv2936" /note="PS00017 ATP/GTP-binding site motif A" misc_feature 3272634..3272678 /gene="drrA" /locus_tag="Rv2936" /note="PS00211 ABC transporters family signature" gene 3273206..3274075 /gene="drrB" /locus_tag="Rv2937" /db_xref="GeneID:887968" CDS 3273206..3274075 /gene="drrB" /locus_tag="Rv2937" /function="PROBABLY INVOLVED IN ACTIVE TRANSPORT OF ANTIBIOTIC AND PHTHIOCEROL DIMYCOCEROSATE (DIM) ACROSS THE MEMBRANE (EXPORT). DRRA|Rv2934|MTCY19H9.04, DRRB AND DRRC|Rv2938|MTCY19H9.06 MAY ACT JOINTLY TO CONFER DAUNORUBICIN AND DOXORUBICIN RESISTANCE BY AN EXPORT MECHANISM. PROBABLY RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE AND LOCALIZATION OF DIM INTO THE CELL WALL." /experiment="experimental evidence, no additional details recorded" /note="Rv2937, (MTCY19H9.05), len: 289 aa. Probable drrB, daunorubicin-DIM-transport integral membrane protein ABC transporter, probably involved in daunorubicin resistance and phthiocerol dimycocerosate transport (see citations below), equivalent to Q49935|DRRB|ML2351|L518_F1_9 DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Mycobacterium leprae (288 aa), FASTA scores: opt: 1252, E(): 5.3e-72, (64.0% identity in 289 aa overlap). Also similar to others e.g. Q9XCF8 DRRB PROTEIN from Mycobacterium avium (246 aa), FASTA scores: opt: 423, E(): 1.5e-19, (30.85% identity in 243 aa overlap); Q9S6H4 DAUNORUBICIN RESISTANCE PROTEIN B from Mycobacterium avium (246 aa), FASTA scores: opt: 420, E(): 2.3e-19, (30.85% identity in 243 aa overlap); P32011|DRRB_STRPE DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Streptomyces peucetius (283 aa), FASTA scores: opt: 242, E(): 4.7e-08, (27.85% identity in 219 aa overlap); etc. Note that Rv293|drrB belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="daunorubicin-DIM-transport integral membrane protein ABC transporter DrrB" /protein_id="NP_217453.1" /db_xref="GI:15610074" /db_xref="GeneID:887968" /translation="MSGPAIDASPALTFNQSSASIQQRRLSTGRQMWVLYRRFAAPSL LNGEVLTTVGAPIIFMVGFYIPFAIPWNQFVGGASSGVASNLGQYITPLVTLQAVSFA AIGSGFRAATDSLLGVNRRFQSMPMAPLTPLLARVWVAVDRCFTGLVISLVCGYVIGF RFHRGALYIVGFCLLVIAIGAVLSFAADLVGTVTRNPDAMLPLLSLPILIFGLLSIGL MPLKLFPHWIHPFVRNQPISQFVAALRALAGDTTKTASQVSWPVMAPTLTWLFAFVVI LALSSTIVLARRP" gene 3274072..3274902 /gene="drrC" /locus_tag="Rv2938" /db_xref="GeneID:888491" CDS 3274072..3274902 /gene="drrC" /locus_tag="Rv2938" /function="PROBABLY INVOLVED IN ACTIVE TRANSPORT OF ANTIBIOTIC AND PHTHIOCEROL DIMYCOCEROSATE (DIM) ACROSS THE MEMBRANE (EXPORT). DRRA|Rv2934|MTCY19H9.04, DRRB|Rv2937|MTCY19H9.05 AND DRRC MAY ACT JOINTLY TO CONFER DAUNORUBICIN AND DOXORUBICIN RESISTANCE BY AN EXPORT MECHANISM. PROBABLY RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE AND LOCALIZATION OF DIM INTO THE CELL WALL." /experiment="experimental evidence, no additional details recorded" /note="Rv2938, (MTCY19H9.06), len: 276 aa. Probable drrC, daunorubicin-DIM-transport integral membrane protein ABC transporter, probably involved in daunorubicin resistance and phthiocerol dimycocerosate transport (see citations below), equivalent to Q9CB71|ML2350 PROBABLE ANTIBIOTIC RESISTANCE MEMBRANE PROTEIN from Mycobacterium leprae (276 aa), FASTA scores: opt: 1434, E(): 1.2e-81, (79.0% identity in 276 aa overlap); and Q49941|DRRC|L518_F3_76 PUTATIVE DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Mycobacterium leprae (244 aa), FASTA scores: opt: 1194, E(): 8.3e-67, (76.85% identity in 242 aa overlap). Also similar to others e.g. Q9XCF9 DRRC PROTEIN from Mycobacterium avium (263 aa), FASTA scores: opt: 538, E(): 3.7e-26, (32.65% identity in 251 aa overlap); Q9S6H3 DAUNORUBICIN RESISTANCE PROTEIN C from Mycobacterium avium (263 aa), FASTA scores: opt: 533, E(): 7.6e-26, (32.25% identity in 251 aa overlap); P32011|DRRB_STRPE DAUNORUBICIN RESISTANCE TRANSMEMBRANE PROTEIN from Streptomyces peucetius (283 aa), FASTA scores: opt: 276, E(): 6.6e-10, (21.07% identity in 261 aa overlap); etc. Note that Rv2938|drrC belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentaly)." /codon_start=1 /transl_table=11 /product="daunorubicin-DIM-transport integral membrane protein ABC transporter DrrC" /protein_id="NP_217454.1" /db_xref="GI:15610075" /db_xref="GeneID:888491" /translation="MITTTSQEIELAPTRLPGSQNAARLFVAQTLLQTNRLLTRWARD YITVIGAIVLPILFMVVLNIVLGNLAYVVTHDSGLYSIVPLIALGAAITGSTFVAIDL MRERSFGLLARLWVLPVHRASGLISRILANAIRTLVTTLVMLGTGVVLGFRFRQGLIP SLMWISVPVILGIAIAAMVTTVALYTAQTVVVEGVELVQAIAIFFSTGLVPLNSYPGW IQPFVAHQPVSYAIAAMRGFAMGGPVLSPMIGMLVWTAGICVVCAVPLAIGYRRASTH" gene 3274949..3276217 /gene="papA5" /locus_tag="Rv2939" /db_xref="GeneID:887327" CDS 3274949..3276217 /gene="papA5" /locus_tag="Rv2939" /function="THOUGHT TO BE INVOLVED IN PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS." /experiment="experimental evidence, no additional details recorded" /note="required for PDIM synthesis; phthiocerol and phthiodiolone dimycocerosate esters are scaffolds used for virulence-enhancing lipids; proposed to catalyze diesterification of phthiocerol and phthiodolone with mycocerosate; functions in polyketide synthesis" /codon_start=1 /transl_table=11 /product="acyltransferase PapA5" /protein_id="NP_217455.1" /db_xref="GI:15610076" /db_xref="GeneID:887327" /translation="MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFD ALLETHPVLASHLEQSSDGGWNLVADDLLHSGICVIDGTAATNGSPSGNAELRLDQSV SLLHLQLILREGGAELTLYLHHCMADGHHGAVLVDELFSRYTDAVTTGDPGPITPQPT PLSMEAVLAQRGIRKQGLSGAERFMSVMYAYEIPATETPAVLAHPGLPQAVPVTRLWL SKQQTSDLMAFGREHRLSLNAVVAAAILLTEWQLRNTPHVPIPYVYPVDLRFVLAPPV APTEATNLLGAASYLAEIGPNTDIVDLASDIVATLRADLANGVIQQSGLHFGTAFEGT PPGLPPLVFCTDATSFPTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLIIEH HGHIAEPGKSLEAIRSLLCTVPSEYGWIME" gene complement(3276380..3282715) /gene="mas" /locus_tag="Rv2940c" /db_xref="GeneID:887982" CDS complement(3276380..3282715) /gene="mas" /locus_tag="Rv2940c" /function="CATALYZES THE ELONGATION OF N-FATTY ACYL-COA WITH METHYLAMALONYL-CoA (NOT MALONYL-COA) AS THE ELONGATING AGENT TO FORM MYCOCEROSYL LIPIDS." /experiment="experimental evidence, no additional details recorded" /note="Rv2940c, (MTCY24G1.09, MTCY19H9.08c), len: 2111 aa. Probable mas, mycocerosic acid synthase membrane associated, multifunctional enzyme (see citations below), almost identical to Q02251|MCAS_MYCBO|MAS MYCOCEROSIC ACID SYNTHASE from Mycobacterium bovis (2110 aa), FASTA scores: opt: 13226, E(): 0, (95.8% identity in 2115 aa overlap) (see Mathur & Kolattukudy 1992); and equivalent to Q9CD78|MAS|ML0139 PUTATIVE MYCOCEROSIC SYNTHASE from Mycobacterium leprae (2116 aa), FASTA scores: opt: 12142, E(): 0, (87.95% identity in 2119 aa overlap); and Q49624|PKS3|MASA|ML1229|B1170_C2_209 PROBABLE MYCOCEROSIC ACID SYNTHASE from Mycobacterium leprae (2118 aa), FASTA scores: opt: 8421, E(): 0, (60.8% identity in 2127 aa overlap). Also similar to other synthases e.g. C-terminus of Q9L8C7|EPOC POLYKETIDE SYNTHASE from Polyangium cellulosum (7257 aa), FASTA scores: opt: 4332, E(): 0, (40.85% identity in 2149 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. O53901|PKS5|Rv1527c|MTV045.01c|MTCY19G5.01 POLYKETIDE SYNTHASE (2108 aa), FASTA scores: opt: 5059, E(): 0, (65.9% identity in 2121 aa overlap); etc. Contains several domains, organized in the following order: beta-ketoacyl synthase (PS00606), acyl transferase, dehydratase-enoyl reductase, beta-ketoreductase, acyl carrier protein. Contains PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="multifunctional mycocerosic acid synthase membrane-associated MAS" /protein_id="NP_217456.1" /db_xref="GI:15610077" /db_xref="GeneID:887982" /translation="MESRVTPVAVIGMGCRLPGGINSPDKLWESLLRGDDLVTEIPPD RWDADDYYDPEPGVPGRSVSRWGGFLDDVAGFDAEFFGISEREATSIDPQQRLLLETS WEAIEHAGLDPASLAGSSTAVFTGLTHEDYLVLTTTAGGLASPYVVTGLNNSVASGRI AHTLGLHGPAMTFDTACSSGLMAVHLACRSLHDGEADLALAGGCAVLLEPHASVAASA QGMLSSTGRCHSFDADADGFVRSEGCAMVLLKRLPDALRDGNRIFAVVRGTATNQDGR TETLTMPSEDAQVAVYRAALAAAGVQPETVGVVEAHGTGTPIGDPIEYRSLARVYGAG TPCALGSAKSNMGHSTASAGTVGLIKAILSLRHGVVPPLLHFNRLPDELSDVETGLFV PQAVTPWPNGNDHTPKRVAVSSFGMSGTNVHAIVEEAPAEASAPESSPGDAEVGPRLF MLSSTSSDALRQTARQLATWVEEHQDCVAASDLAYTLARGRAHRPVRTAVVAANLPEL VEGLREVADGDALYDAAVGHGDRGPVWVFSGQGSQWAAMGTQLLASEPVFAATIAKLE PVIAAESGFSVTEAITAQQTVTGIDKVQPAVFAVQVALAATMEQTYGVRPGAVVGHSM GESAAAVVAGALSLEDAARVICRRSKLMTRIAGAGAMGSVELPAKQVNSELMARGIDD VVVSVVASPQSTVIGGTSDTVRDLIARWEQRDVMAREVAVDVASHSPQVDPILDDLAA ALADIAPMTPKVPYYSATLFDPREQPVCDGAYWVDNLRNTVQFAAAVQAAMEDGYRVF AELSPHPLLTHAVEQTGRSLDMSVAALAGMRREQPLPHGLRGLLTELHRAGAALDYSA LYPAGRLVDAPLPAWTHARLFIDDDGQEQRAQGACTITVHPLLGSHVRLTEEPERHVW QGDVGTSVLSWLSDHQVHNVAALPGAAYCEMALAAAAEVFGEAAEVRDITFEQMLLLD EQTPIDAVASIDAPGVVNFTVETNRDGETTRHATAALRAAEDDCPPPGYDITALLQAH PHAVNGTAMRESFAERGVTLGAAFGGLTTAHTAEAGAATVLAEVALPASIRFQQGAYR IHPALLDACFQSVGAGVQAGTATGGLLLPLGVRSLRAYGPTRNARYCYTRLTKAFNDG TRGGEADLDVLDEHGTVLLAVRGLRMGTGTSERDERDRLVSERLLTLGWQQRALPEVG DGEAGSWLLIDTSNAVDTPDMLASTLTDALKSHGPQGTECASLSWSVQDTPPNDQAGL EKLGSQLRGRDGVVIVYGPRVGDPDEHSLLAGREQVRHLVRITRELAEFEGELPRLFV VTRQAQIVKPHDSGERANLEQAGLRGLLRVISSEHPMLRTTLIDVDEHTDVERVAQQL LSGSEEDETAWRNGDWYVARLTPSPLGHEERRTAVLDPDHDGMRVQVRRPGDLQTLEF VASDRVPPGPGQIEVAVSMSSINFADVLIAFGRFPIIDDREPQLGMDFVGVVTAVGEG VTGHQVGDRVGGFSEGGCWRTFLTCDANLAVTLPPGLTDEQAITAATAHATAWYGLND LAQIKAGDKVLIHSATGGVGQAAISIARAKGAEIFATAGNPAKRAMLRDMGVEHVYDS RSVEFAEQIRRDTDGYGVDIVLNSLTGAAQRAGLELLAFGGRFVEIGKADVYGNTRLG LFPFRRGLTFYYLDLALMSVTQPDRVRELLATVFKLTADGVLTAPQCTHYPLAEAADA IRAMSNAEHTGKLVLDVPRSGRRSVAVTPEQAPLYRRDGSYIITGGLGGLGLFFASKL AAAGCGRIVLTARSQPNPKARQTIEGLRAAGADIVVECGNIAEPDTADRLVSAATATG LPLRGVLHSAAVVEDATLTNITDELIDRDWSPKVFGSWNLHRATLGQPLDWFCLFSSG AALLGSPGQGAYAAANSWVDVFAHWRRAQGLPVSAIAWGAWGEVGRATFLAEGGEIMI TPEEGAYAFETLVRHDRAYSGYIPILGAPWLADLVRRSPWGEMFASTGQRSRGPSKFR MELLSLPQDEWAGRLRRLLVEQASVILRRTIDADRSFIEYGLDSLGMLEMRTHVETET GIRLTPKVIATNNTARALAQYLADTLAEEQAAAPAAS" misc_feature complement(3276506..3276553) /gene="mas" /locus_tag="Rv2940c" /note="PS00012 Phosphopantetheine attachment site" misc_feature complement(3282164..3282214) /gene="mas" /locus_tag="Rv2940c" /note="PS00606 Beta-ketoacyl synthases active site" gene 3283335..3285077 /gene="fadD28" /locus_tag="Rv2941" /db_xref="GeneID:887454" CDS 3283335..3285077 /gene="fadD28" /locus_tag="Rv2941" /EC_number="2.3.1.86" /function="INVOLVED IN PHTHIOCEROL DIMYCOCEROSATE (DIM) BIOSYNTHESIS. THOUGHT TO BE INVOLVED IN THE RELEASE AND TRANSFER OF MYCOSEROSIC ACID FROM MAS ONTO THE DIOLS." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_217457.1" /db_xref="GI:15610078" /db_xref="GeneID:887454" /translation="MSVRSLPAALRACARLQPHDPAFTFMDYEQDWDGVAITLTWSQL YRRTLNVAQELSRCGSTGDRVVISAPQGLEYVVAFLGALQAGRIAVPLSVPQGGVTDE RSDSVLSDSSPVAILTTSSAVDDVVQHVARRPGESPPSIIEVDLLDLDAPNGYTFKED EYPSTAYLQYTSGSTRTPAGVVMSHQNVRVNFEQLMSGYFADTDGIPPPNSALVSWLP FYHDMGLVIGICAPILGGYPAVLTSPVSFLQRPARWMHLMASDFHAFSAAPNFAFELA ARRTTDDDMAGRDLGNILTILSGSERVQAATIKRFADRFARFNLQERVIRPSYGLAEA TVYVATSKPGQPPETVDFDTESLSAGHAKPCAGGGATSLISYMLPRSPIVRIVDSDTC IECPDGTVGEIWVHGDNVANGYWQKPDESERTFGGKIVTPSPGTPEGPWLRTGDSGFV TDGKMFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAISVPGDRSTEKLVAIIE LKKRGDSDQDAMARLGAIKREVTSALSSSHGLSVADLVLVAPGSIPITTSGKVRRGAC VEQYRQDQFARLDA" misc_feature 3284823..3284861 /gene="fadD28" /locus_tag="Rv2941" /note="PS00018 EF-hand calcium-binding domain" gene 3285070..3287832 /gene="mmpL7" /locus_tag="Rv2942" /db_xref="GeneID:887548" CDS 3285070..3287832 /gene="mmpL7" /locus_tag="Rv2942" /function="INVOLVED IN TRANSLOCATION OF PHTHIOCEROL DIMYCOCEROSATE (DIM) IN THE CELL WALL." /experiment="experimental evidence, no additional details recorded" /note="Rv2942, (MTCY24G1.07c), len: 920 aa. mmpL7, conserved transmembrane transport protein (see citations below), member of RND superfamily, highly similar to Q9XB10 HYPOTHETICAL 99.5 KDA PROTEIN from Mycobacterium bovis BCG (945 aa), FASTA scores: opt: 488, E(): 4.9e-20, (29.5% identity in 918 aa overlap); and to others from Mycobacteria e.g. O53735|MML4_MYCTU from Mycobacterium tuberculosis (945 aa), FASTA scores: opt: 481, E(): 1.2e-19, (25.9% identity in 922 aa overlap); etc. Also similar to other membrane proteins e.g. O54101|MMLB_STRCO|SC10A5.10c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (847 aa), FASTA scores: opt: 256, E(): 7.2e-07, (25.15% identity in 545 aa overlap); etc. Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site, PS00079 Multicopper oxidases signature 1, and PS00044 Bacterial regulatory proteins, lysR family signature. BELONGS TO THE MMPL FAMILY. Note that Rv2941|fadD28 and Rv2942|mmpL7 are transcriptionally coupled (proven experimentaly)." /codon_start=1 /transl_table=11 /product="transmembrane transport protein MmpL7" /protein_id="NP_217458.1" /db_xref="GI:15610079" /db_xref="GeneID:887548" /translation="MPSPAGRLHRIRYIRLKKSSPDCRATITSGSADGQRRSPRLTNL LVVAAWVAAAVIANLLLTFTQAEPHDTSPALLPQDAKTAAATSRIAQAFPGTGSNAIA YLVVEGGSTLEPQDQPYYDAAVGALRADTRHVGSVLDWWSDPVTAPLGTSPDGRSATA MVWLRGEAGTTQAAESLDAVRSVLRQLPPSEGLRASIVVPAITNDMPMQITAWQSATI VTVAAVIAVLLLLRARLSVRAAAIVLLTADLSLAVAWPLAAVVRGHDWGTDSVFSWTL AAVLTIGTITAATMLAARLGSDAGHSAAPTYRDSLPAFALPGACVAIFTGPLLLARTP ALHGVGTAGLGVFVALAASLTVLPALIALAGASRQLPAPTTGAGWTGRLSLPVSSASA LGTAAVLAICMLPIIGMRWGVAENPTRQGGAQVLPGNALPDVVVIKSARDLRDPAALI AINQVSHRLVEVPGVRKVESAAWPAGVPWTDASLSSAAGRLADQLGQQAGSFVPAVTA IKSMKSIIEQMSGAVDQLDSTVNVTLAGARQAQQYLDPMLAAARNLKNKTTELSEYLE TIHTWIVGFTNCPDDVLCTAMRKVIEPYDIVVTGMNELSTGADRISAISTQTMSALSS APRMVAQMRSALAQVRSFVPKLETTIQDAMPQIAQASAMLKNLSADFADTGEGGFHLS RKDLADPSYRHVRESMFSSDGTATRLFLYSDGQLDLAAAARAQQLEIAAGKAMKYGSL VDSQVTVGGAAQIAAAVRDALIHDAVLLAVILLTVVALASMWRGAVHGAAVGVGVLAS YLAALGVSIALWQHLLDRELNALVPLVSFAVLASCGVPYLVAGIKAGRIADEATGARS KGAVSGRGAVAPLAALGGVFGAGLVLVSGGSFSVLSQIGTVVVLGLGVLITVQRAWLP TTPGRR" misc_feature 3286072..3286104 /gene="mmpL7" /locus_tag="Rv2942" /note="PS00639 Eukaryotic thiol (cysteine) proteases histidine active site" misc_feature 3287392..3287469 /gene="mmpL7" /locus_tag="Rv2942" /note="PS00044 Bacterial regulatory proteins, lysR family signature" misc_feature 3287725..3287787 /gene="mmpL7" /locus_tag="Rv2942" /note="PS00079 Multicopper oxidases signature 1" repeat_region 3288463..3290504 /note="IS1533, len: 2042 bp. Minimum region corresponding to Insertion sequence IS1533." /mobile_element="insertion sequence:IS1533" gene 3288464..3289705 /locus_tag="Rv2943" /db_xref="GeneID:887834" CDS 3288464..3289705 /locus_tag="Rv2943" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1533." /note="Rv2943, (MTCY24G1.06c), len: 413 aa. Probable transposase for insertion sequence IS1533, similar to other transposases e.g. P15025|ISTA_ECOLI ista protein (insertion sequence IS21) from Escherichia coli (390 aa), FASTA scores: opt: 268, E(): 5.1e-11, (24.1% identity in 378 aa overlap). Contains potential helix-turn-helix motif at aa 19-40 (Score 1611, +4.67 SD)." /codon_start=1 /transl_table=11 /product="IS1533 transposase" /protein_id="NP_217459.1" /db_xref="GI:15610080" /db_xref="GeneID:887834" /translation="MLTVEDWAEIRRLHRAEGLPIKMIARVLGISKNTVKSALESNQQ PKYERAPQGSIVDAVEPRIRELLQAYPTMPATVIAERIGWERSIRVLSARVAELRPVY LPPDPASRTTYVAGEIAQCDFWFPPIELPVGFGQTRTAKQLPVLTMVCAYSRWLLAML LPSRCAEDLFAGWWRLIEALGAVPRVLVWDGEGAIGRWRGGRSELTTECQAFRGTLAA KVLICRPADPEAKGLIERAHDYLERSFLPGRVFASPADFNAQLGAWLALVNTRTRRAL GCAPTDRIGADRAAMLSLPPVAPATGWCTSLRLPRDHYVRCDSNDYSVHPGVIGHRVL VRADLERVHVFCDGELVADHERIWAVHQTVSDPAHVEAAKVLRRRHFSAASPVVEPQV QVRSLSDYDDALGVDIDGGVA" gene 3289705..3290235 /locus_tag="Rv2943A" /db_xref="GeneID:3205061" CDS 3289705..3290235 /locus_tag="Rv2943A" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv2943A, len: 176 aa. Possible transposase, similar to many e.g. AJ238712|MBO238712_2 PUTATIVE TRANSPOSASE (IS21-l) from Mycobacterium bovis BCG (266 aa), FASTA scores: opt: 762, E(): 0, (100.0% identity in 118 aa overlap). Possible frameshift after codon 118 i.e. near position 3290056, to fuse with Rv2944." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="YP_177680.1" /db_xref="GI:57117039" /db_xref="GeneID:3205061" /translation="MPTTKATQRRDVSTEIAYLTRALKAPTLRESVSRLADRARAENW SHEEYLAACLQREVSARESHGGEGRIRAARFPARKSLEEFDFEHARGLKRDTIAHLGT LDFITARDNVVFLGPAWHREDSSCGRPGDTRVSGRSSGAVRHRRRMGSTARRGSPRRA HLRRTHPALPLSAPGG" gene 3289790..3290506 /locus_tag="Rv2944" /db_xref="GeneID:887636" CDS 3289790..3290506 /locus_tag="Rv2944" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1533" /note="Rv2944, (MTCY24G1.05c), len: 238 aa. Possible transposase for IS1533, similar to IS-element proteins e.g. P15026|ISTB_ECOLI istb protein from Escherichia coli (265 aa), FASTA scores: opt: 475, E (): 1.6e-21, (48.0% identity in 148 aa overlap); Z95436|MTY15C10_14 from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 784, E(): 0, (87.4% identity in 135 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="IS1533 transposase" /protein_id="NP_217460.1" /db_xref="GI:15610081" /db_xref="GeneID:887636" /translation="MSQCPGWPIAPAPRTGATKNTWPPACSGKCQPGSPMVVRAASAP PASRLGSRWKSSTLSMLVASNATPSHIWAPWISSPPAITSCFWAPPGTGKTHLAVGLA IRACQAGHRVLFATAAEWVARLAEAHHAGRIYAELTRLCRYPLLVVDEVGYIPFEPEA ANLFFQLVSSRYERASLIVTSNKAFGRWGEVFGGDDVVAAAMIDRLVHHAEVVALKGD SYRLKDRDLGRVPPAGTTEE" misc_feature 3290051..3290074 /locus_tag="Rv2943A" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3290624..3291325) /gene="lppX" /locus_tag="Rv2945c" /db_xref="GeneID:887956" CDS complement(3290624..3291325) /gene="lppX" /locus_tag="Rv2945c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2945c, (MTCY24G1.04), len: 233 aa. Probable lppX, conserved lipoprotein, equivalent to Q9CD80 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (233 aa), FASTA scores: opt: 1165, E(): 2.1e-65, (76.4% identity in 233 aa overlap); and similar to Q9CCP6|ML0557 from Mycobacterium leprae (238 aa), FASTA scores: opt: 338, E(): 7.4e-14, (30.75% identity in 231 aa overlap). Also similar to others from Mycobacterium tuberculosis e.g. P71679|LPRG_MYCTU LIPOPROTEIN (236 aa), FASTA scores: opt: 342, E(): 4.1e-14, (32.05% identity in 231 aa overlap); etc. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site, and has in its N-terminal a signal peptide. BELONGS TO THE LPPX/LPRAFG FAMILY OF LIPOPROTEINS." /codon_start=1 /transl_table=11 /product="lipoprotein LppX" /protein_id="NP_217461.1" /db_xref="GI:15610082" /db_xref="GeneID:887956" /translation="MNDGKRAVTSAVLVVLGACLALWLSGCSSPKPDAEEQGVPVSPT ASDPALLAEIRQSLDATKGLTSVHVAVRTTGKVDSLLGITSADVDVRANPLAAKGVCT YNDEQGVPFRVQGDNISVKLFDDWSNLGSISELSTSRVLDPAAGVTQLLSGVTNLQAQ GTEVIDGISTTKITGTIPASSVKMLDPGAKSARPATVWIAQDGSHHLVRASIDLGSGS IQLTQSKWNEPVNVD" misc_feature complement(3291245..3291277) /gene="lppX" /locus_tag="Rv2945c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(3291503..3296353) /gene="pks1" /locus_tag="Rv2946c" /db_xref="GeneID:888122" CDS complement(3291503..3296353) /gene="pks1" /locus_tag="Rv2946c" /function="POLYKETIDE SYNTHASE POSSIBLY INVOLVED IN LIPID SYNTHESIS" /note="Rv2946c, (MTCY24G1.03), len: 1616 aa. Probable pks1, polyketide synthase, similar to many e.g. ML035|AL583917|Q9CD81 putative polyketide synthase from Mycobacterium leprae (2103 aa), Fasta scores: opt: 8761, E(): 0, (82.6% identity in 1620 aa overlap); etc. Almost identical in part to G560507|Q50470 PKS002C protein from Mycobacterium tuberculosis (fragment) (950 aa), Fasta scores: opt: 5685, E(): 0, (95.3% identity in 927 aa overlap). Also similar to Mycobacterium tuberculosis polyketide synthases pks7|Rv1661|P94996 (2126 aa) (54.6% identity in 1632 aa); pks12|Rv2048c|O53490 (4151 aa) (58.0% identity in 1606 aa); pks8|rv1662|O65933 (1602 aa) (59.7% identity in 1144 aa). Contains a PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="polyketide synthase PKS1" /protein_id="NP_217462.1" /db_xref="GI:15610083" /db_xref="GeneID:888122" /translation="MISARSAEALTAQAGRLMAHVQANPGLDPIDVGCSLASRSVFEH RAVVVGASREQLIAGLAGLAAGEPGAGVAVGQPGSVGKTVVVFPGQGAQRIGMGRELY GELPVFAQAFDAVADELDRHLRLPLRDVIWGADADLLDSTEFAQPALFAVEVASFAVL RDWGVLPDFVMGHSVGELAAAHAAGVLTLADAAMLVVARGRLMQALPAGGAMVAVAAS EDEVEPLLGEGVGIAAINAPESVVISGAQAAANAIADRFAAQGRRVHQLAVSHAFHSP LMEPMLEEFARVAARVQAREPQLGLVSNVTGELAGPDFGSAQYWVDHVRRPVRFADSA RHLQTLGATHFIEAGPGSGLTGSIEQSLAPAEAMVVSMLGKDRPELASALGAAGQVFT TGVPVQWSAVFAGSGGRRVQLPTYAFQRRRFWETPGADGPADAAGLGLGATEHALLGA VVERPDSDEVVLTGRLSLADQPWLADHVVNGVVLFPGAGFVELVIRAGDEVGCALIEE LVLAAPLVMHPGVGVQVQVVVGAADESGHRAVSVYSRGDQSQGWLLNAEGMLGVAAAE TPMDLSVWPPEGAESVDISDGYAQLAERGYAYGPAFQGLVAIWRRGSELFAEVVAPGE AGVAVDRMGMHPAVLDAVLHALGLAVEKTQASTETRLPFCWRGVSLHAGGAGRVRARF ASAGADAISVDVCDATGLPVLTVRSLVTRPITAEQLRAAVTAAGGASDQGPLEVVWSP ISVVSGGANGSAPPAPVSWADFCAGSDGDASVVVWELESAGGQASSVVGSVYAATHTA LEVLQSWLGADRAATLVVLTHGGVGLAGEDISDLAAAAVWGMARSAQAENPGRIVLID TDAAVDASVLAGVGEPQLLVRGGTVHAPRLSPAPALLALPAAESAWRLAAGGGGTLED LVIQPCPEVQAPLQAGQVRVAVAAVGVNFRDVVAALGMYPGQAPPLGAEGAGVVLETG PEVTDLAVGDAVMGFLGGAGPLAVVDQQLVTRVPQGWSFAQAAAVPVVFLTAWYGLAD LAEIKAGESVLIHAGTGGVGMAAVQLARQWGVEVFVTASRGKWDTLRAMGFDDDHIGD SRTCEFEEKFLAVTEGRGVDVVLDSLAGEFVDASLRLLVRGGRFLEMGKTDIRDAQEI AANYPGVQYRAFDLSEAGPARMQEMLAEVRELFDTRELHRLPVTTWDVRCAPAAFRFM SQARHIGKVVLTMPSALADRLADGTVVITGATGAVGGVLARHLVGAYGVRHLVLASRR GDRAEGAAELAADLTEAGAKVQVVACDVADRAAVAGLFAQLSREYPPVRGVIHAAGVL DDAVITSLTPDRIDTVLRAKVDAAWNLHQATSDLDLSMFALCSSIAATVGSPGQGNYS AANAFLDGLAAHRQAAGLAGISLAWGLWEQPGGMTAHLSSRDLARMSRSGLAPMSPAE AVELFDAALAIDHPLAVATLLDRAALDARAQAGALPALFSGLARRPRRRQIDDTGDAT SSKSALAQRLHGLAADEQLELLVGLVCLQAAAVLGRPSAEDVDPDTEFGDLGFDSLTA VELRNRLKTATGLTLPPTVIFDHPTPTAVAEYVAQQMSGSRPTESGDPTSQVVEPAAA EVSVHA" misc_feature complement(3291677..3291724) /gene="pks1" /locus_tag="Rv2946c" /note="PS00012 Phosphopantetheine attachment site" gene complement(3296350..3297840) /gene="pks15" /locus_tag="Rv2947c" /db_xref="GeneID:887291" CDS complement(3296350..3297840) /gene="pks15" /locus_tag="Rv2947c" /function="POLYKETIDE SYNTHASE POSSIBLY INVOLVED IN LIPID SYNTHESIS" /experiment="experimental evidence, no additional details recorded" /note="Rv2947c, (MTCY24G1.02), len: 496 aa. Probable pks15, polyketide synthase. Almost identical to G560508|Q50469 PKS002B protein from Mycobacterium tuberculosis (495 aa), FASTA scores: opt: 3270, E(): 0, (99.6% identity in 496 a a overlap). Similar to Mycobacterium tuberculosis proteins MTCY338.20|RV2931|PPSA_MYCTU ppsA phenolpthiocerol synthesis (1876 aa) (49.9% identity in 465 aa overlap); MTCY24G1.09|RV2940C|P96291 Putative mas, mycocerosic acid synthase (2111 aa) (50.2% identity in 454 aa overlap); and MTCY22H8.03|RV2382C|P71718 hypothetical protein (444 aa) (47.6% identity in 437 aa overlap). Contains PS00606 Beta-ketoacyl synthases active site." /codon_start=1 /transl_table=11 /product="polyketide synthase PKS15" /protein_id="NP_217463.1" /db_xref="GI:15610084" /db_xref="GeneID:887291" /translation="MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQR ATEPVAVVGIGCRFPGGVDGPDGLWDVVSAGRDVVSEFPTDRGWDVEGLYDPDPDAEG KTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEHAGIDPLSLRG SATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVSYVLGLQGPAVSVDTACSS SLVAIHWAMSSLRSGECDLALAGGVTVMGLPSIFVGFSRQRGLAADGRCKAFAAAADG TGWGEGAGVVVLERLSDARRLGHSVLAVVRGSAVNQDGASNGLTAPNGLAQQRVIQVA LANAGLSAADVDVVEAHGTATTLGDPIEAQALLSTYGQGGPAEQPLWVGSIKSNMGHT QAAAGVAGVIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRR AAVSSFGISGTNAHLILEEAPVPAPAEAPVEASESTGGRGRRWCRG" misc_feature complement(3297172..3297222) /gene="pks15" /locus_tag="Rv2947c" /note="PS00606 Beta-ketoacyl synthases active site" gene complement(3297837..3299954) /gene="fadD22" /locus_tag="Rv2948c" /db_xref="GeneID:887295" CDS complement(3297837..3299954) /gene="fadD22" /locus_tag="Rv2948c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_217464.1" /db_xref="GI:15610085" /db_xref="GeneID:887295" /translation="MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLG EVLRNRGLSSGDRVLLCLPDSPDLVQLLLACLARGVMAFLANPELHRDDHALAARNTE PALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYATYTSGTTGPP KAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAYGLGNSVWFPLATGGSAVI NSAPVTPEAAAILSARFGPSVLYGVPNFFARVIDSCSPDSFRSLRCVVSAGEALELGL AERLMEFFGGIPILDGIGSTEVGQTFVSNRVDEWRLGTLGRVLPPYEIRVVAPDGTTA GPGVEGDLWVRGPAIAKGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEV IGGVNVDPREVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDL HRGLLNRLSAFKVPHRFAVVDRLPRTPNGKLVRGALRKQSPTKPIWELSLTEPGSGVR AQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAVCAEAAKMLGEPDPWSV DQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPETVGWDYGSISGLAQYLEAELAGGHG RLKSAGPVNSGATGLWAIEEQLNKVEELVAVIADGEKQRVADRLRALLGTIAGSEAGL GKLIQAASTPDEIFQLIDSELGK" gene complement(3299971..3300570) /locus_tag="Rv2949c" /db_xref="GeneID:887206" CDS complement(3299971..3300570) /locus_tag="Rv2949c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2949c, (MTCY349.41), len: 199 aa. Conserved hypothetical protein, equivalent to Q9CD83|ML0133 HYPOTHETICAL PROTEIN from Mycobacterium leprae (210 aa), FASTA scores: opt: 797, E(): 7.4e-47, (62.55% identity in 195 aa overlap). Equivalent to AAK47348 from Mycobacterium tuberculosis strain CDC1551 (212 aa) but shorter 13 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217465.1" /db_xref="GI:15610086" /db_xref="GeneID:887206" /translation="MTECFLSDQEIRKLNRDLRILIAANGTLTRVLNIVADDEVIVQI VKQRIHDVSPKLSEFEQLGQVGVGRVLQRYIILKGRNSEHLFVAAESLIAIDRLPAAI ITRLTQTNDPLGEVMAASHIETFKEEAKVWVGDLPGWLALHGYQNSRKRAVARRYRVI SGGQPIMVVTEHFLRSVFRDAPHEEPDRWQFSNAITLAR" gene complement(3300596..3302344) /gene="fadD29" /locus_tag="Rv2950c" /db_xref="GeneID:887202" CDS complement(3300596..3302344) /gene="fadD29" /locus_tag="Rv2950c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_217466.2" /db_xref="GI:161352463" /db_xref="GeneID:887202" /translation="MSESSLADLLQKAASQYPNRAAYKFIDYDTDPAGFTETVTWWQV HRRAMIVAEELWIYASSGDRVAILAPQGLEYIIAFMGVLQAGLIAVPLPVPQFGIHDE RISSALRDSAPSIILTTSSVIDEVTTYAPHACAAQGQSAPIVVAVDALDLSSSRALDP TRFERPSTAYLQYTSGSTRAPAGVVLSHKNVITNCVQLMSDYIGDSEKVPSTPVSWLP FYHDMGLMLGIILPMINQDTAVLMSPMAFLQRPARWMQLLAKHRAQISSAPNFGFELA VRRTSDDDMAGLDLGHVRTIVTGAERVNVATLRRFTERFAPFNLSETAIRPSYGLAEA TVYVATAGPGRAPKSVCFDYQQLSVGQAKRAENGSEGANLVSYGAPRASTVRIVDPET RMENPAGTVGEIWVQGDNVGLGYWRNPQQTEATFRARLVTPSPGTSEGPWLRTGDLGV IFEGELFITGRIKELLVVDGANHYPEDIEATIQEITGGRVVAIAVPDDRTEKLVTIIE LMKRGRTDEEEKNRLRTVKREVASAISRSHRLRVADVVMVAPGSIPVTTSGKVRRSAS VERYLHHEFSRLDAMA" gene complement(3303103..3304248) /locus_tag="Rv2951c" /db_xref="GeneID:887887" CDS complement(3303103..3304248) /locus_tag="Rv2951c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2951c, (MTCY349.39), len: 381 aa. Possible oxidoreductase (EC 1.-.-.-), equivalent to Q9CD85 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (382 aa), FASTA scores: opt: 2225, E(): 7.6e-134, (84.8% identity in 382 aa overlap); and similar to O30260 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (363 aa), FASTA scores: opt: 652, E(): 6.1e-34, (32.55% identity in 344 aa overlap). Also similar to various oxidoreductases e.g. O29071|AF1196 N5,N10-METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE from Archaeoglobus fulgidus (348 aa), FASTA scores: opt: 381, E(): 9.7e-17, (27.7% identity in 354 aa overlap); Q58929|MER|MJ1534 F420-DEPENDENT METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE (EC 1.5.99.-) from Methanococcus jannaschii (331 aa), FASTA scores: opt: 372, E(): 3.5e-16, (30.85% identity in 295 aa overlap); Q9UXP0 PUTATIVE F420-DEPENDENT N5,N10-METHYLENE-TETRAHYDROMETHANOPTERIN REDUCTASE from Methanolobus tindarius (326 aa), FASTA scores: opt: 343, E(): 2.4e-14, (27.4% identity in 314 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217467.1" /db_xref="GI:15610088" /db_xref="GeneID:887887" /translation="MGGLRFGFVDALVHSRLPPTLPARSSMAAATVMGADSYWVGDHL NALVPRSIATSEYLGIAAKFVPKIDANYEPWTMLGNLAFGLPSRLRLGVCVTDAGRRN PAVTAQAAATLHLLTRGRAILGIGVGEREGNEPYGVEWTKPVARFEEALATIRALWNS NGELISRESPYFPLHNALFDLPPYRGKWPEIWVAAHGPRMLRATGRYADAWIPIVVVR PSDYSRALEAVRSAASDAGRDPMSITPAAVRGIITGRNRDDVEEALESVVVKMTALGV PGEAWARHGVEHPMGADFSGVQDIIPQTMDKQTVLSYAAKVPAALMKEVVFSGTPDEV IDQVAEWRDHGLRYVVLINGSLVNPSLRKTVTAVLPHAKVLRGLKKL" gene 3304441..3305253 /locus_tag="Rv2952" /db_xref="GeneID:887311" CDS 3304441..3305253 /locus_tag="Rv2952" /EC_number="2.1.1.-" /function="THOUGHT TO CAUSE METHYLATION." /experiment="experimental evidence, no additional details recorded" /note="Rv2952, (MTCY349.38), len: 270 aa. Probable methyltransferase (EC 2.1.1.-), equivalent to Q9CD86|ML0130 HYPOTHETICAL PROTEIN from Mycobacterium leprae (270 aa), FASTA scores: opt: 1584, E(): 6.1e-99, (83.7% identity in 270 aa overlap). Also highly similar to Q9RMN9|MTF2 PUTATIVE METHYLTRANSFERASE from Mycobacterium smegmatis (274 aa), FASTA scores: opt: 902, E(): 3.8e-53, (56.35% identity in 252 aa overlap). Also similar to other methyltransferases e.g. Q9ADL4|SORM O-METHYLTRANSFERASE from Polyangium cellulosum (346 aa), FASTA scores: opt: 390, E(): 1.1e-18, (36.25% identity in 251 aa overlap); Q54303|RAPM METHYLTRANSFERASE from Streptomyces hygroscopicus (317 aa), FASTA scores: opt: 315, E(): 1.1e-13, (40.75% identity in 135 aa overlap); etc. Very similar to C-terminal part of Q50584|Rv1523|MTCY19G5.05c HYPOTHETICAL 37.9 KDA PROTEIN from Mycobacterium tuberculosis (358 aa), FASTA score: opt: 965, E(): 2.7e-57, (60.3% identity in 247 aa overlap)." /codon_start=1 /transl_table=11 /product="methyltransferase (methylase)" /protein_id="NP_217468.1" /db_xref="GI:15610089" /db_xref="GeneID:887311" /translation="MAFSRTHSLLARAGSTSTYKRVWRYWYPLMTRGLGNDEIVFINW AYEEDPPMDLPLEASDEPNRAHINLYHRTATQVDLGGKQVLEVSCGHGGGASYLTRTL HPASYTGLDLNQAGIKLCKKRHRLPGLDFVRGDAENLPFDDESFDVVLNVEASHCYPH FRRFLAEVVRVLRPGGYFPYADLRPNNEIAAWEADLAATPLRQLSQRQINAEVLRGIG NNSQKSRDLVDRHLPAFLRFAGREFIGVQGTQLSRYLEGGELSYRMYCFTKD" gene 3305279..3306535 /locus_tag="Rv2953" /db_xref="GeneID:887772" CDS 3305279..3306535 /locus_tag="Rv2953" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2953, (MTCY349.37c), len: 418 aa. Conserved hypothetical protein, equivalent to Q9CD87|ML0129 HYPOTHETICAL PROTEIN from Mycobacterium leprae (418 aa), FASTA scores: opt: 2357, E(): 2.7e-143, (86.6% identity in 418 aa overlap). Also highly similar to Q9X7N5|SC5F2A.12c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (396 aa), FASTA scores: opt: 491, E(): 7e-24, (38.35% identity in 417 aa overlap); and similar to other hypothetical proteins e.g. Q9VG81 CG5167 PROTEIN from Drosophila melanogaster (Fruit fly) (431 aa), FASTA scores: opt: 393, E(): 1.4e-17, (26.55% identity in 433 aa overlap); Q9GZE9|F22F7.1 HYPOTHETICAL PROTEIN from Caenorhabditis elegans (426 aa), FASTA scores: opt: 338, E(): 4.6e-14, (27.05% identity in 425 aa overlap); P73855|SLL1601 HYPOTHETICAL 44.8 KDA PROTEIN from Synechocystis sp. (strain PCC 6803) (414 aa), FASTA scores: opt: 565, E(): 1.3e-28, (35.7% identity in 409 aa overlap); etc. Also highly similar to other proteins from Mycobacterium tuberculosis e.g. RV2449C|O53176|MTV008.05C HYPOTHETICAL 44.4 KDA PROTEIN (419 aa), FASTA scores: opt: 1835, E(): 7e-110, (67.55% identity in 419 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217469.1" /db_xref="GI:15610090" /db_xref="GeneID:887772" /translation="MSPAEREFDIVLYGATGFSGKLTAEHLAHSGSTARIALAGRSSE RLRGVRMMLGPNAADWPLILADASQPLTLEAMAARAQVVLTTVGPYTRYGLPLVAACA KAGTDYADLTGELMFCRNSIDLYHKQAADTGARIILACGFDSIPSDLNVYQLYRRSVE DGTGELCDTDLVLRSFSQRWVSGGSVATYSEAMRTASSDPEARRLVTDPYTLTTDRGA EPELGAQPDFLRRPGRDLAPELAGFWTGGFVQAPFNTRIVRRSNALQEWAYGRRFRYS ETMSLGKSMAAPILAAAVTGTVAGTIGLGNKYFDRLPRRLVERVTPKPGTGPSRKTQE RGHYTFETYTTTTTGARYRATFAHNVDAYKSTAVLLAQSGLALALDRDRLAELRGVLT PAAAMGDALLARLPGAGVVMGTTRLS" gene complement(3306666..3307391) /locus_tag="Rv2954c" /db_xref="GeneID:887214" CDS complement(3306666..3307391) /locus_tag="Rv2954c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2954c, (MTCY349.36), len: 241 aa. Hypothetical unknown protein. Equivalent to AAK47354 from Mycobacterium tuberculosis strain CDC1551 (199 aa) but longer 42 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217470.1" /db_xref="GI:15610091" /db_xref="GeneID:887214" /translation="MRLPGMLRPTAERHFHSIFYLRHNARRQEHLATLGLDLGNKSVL EVGAGIGDHTQFFLDRGCKVLCTEPRGENLDVIRQRFGSNPNVTVDHLDLDGDLPAEA HQYDVVYCYGVLYHLSRPAEALAWMCDRAVDLLLLETCVSYSGEDEPFLVSERASSPS QAITGTGCRPSRVWVMNRLREKMPHVYVTATQPRHRQFPLDWRANGPIASTGLARAVF VASRAPLNLPTLVEELPMVQRRC" gene complement(3307580..3308545) /locus_tag="Rv2955c" /db_xref="GeneID:887318" CDS complement(3307580..3308545) /locus_tag="Rv2955c" /function="UNKNOWN" /note="Rv2955c, (MTCY349.34), len: 321 aa. Conserved hypothetical protein, similar to others e.g. Q98NV5|MLL9724 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (284 aa), FASTA scores: opt: 231, E(): 6.5e-08, (34.6% identity in 182 aa overlap); Q9AGG2|NLPE1 NLPE1 from Rhizobium etli (249 aa), FASTA scores: opt: 212, E(): 1.1e-06, (27.85% identity in 255 aa overlap); Q9KXY2 HYPOTHETICAL 31.3 KDA PROTEIN from Streptomyces coelicolor(291 aa), FASTA scores: opt: 211, E(): 1.4e-06, (30.9% identity in 249 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217471.1" /db_xref="GI:15610092" /db_xref="GeneID:887318" /translation="MQFQDVRLMRVVVCRRLGPAKGQRRWRPLDLGTTGCFENLGAQR PTYRMRAIRMLECAMPNRLVRSLQRWRPFGLPPHRWRLAPWYWRGLQVTLEPGSAIAW IVRLTGGFEETEIDIAAALYSALYPDRCILDVGANVGIHSLAWARLAPVVALEPAPGT HSRLEANVAANGLQDRIRTLRTAAGDAVGEVDFFVAADSAFSSLNDTGRIRIRERTRV PCTTLDALAAELPLPVGLLKIDVEGLERAVIAGAAELLRRDRPVLLVEIYGGAASNPD PERTIADIRAYGYEPFVYADDAGLQPYQRHRDDRYCYFFIPSRKG" gene 3308668..3309399 /locus_tag="Rv2956" /db_xref="GeneID:887486" CDS 3308668..3309399 /locus_tag="Rv2956" /function="UNKNOWN" /note="Rv2956, (MTCY349.33c), len: 243 aa. Conserved hypothetical protein, highly similar to O86299|GSC GSC PROTEIN from Mycobacterium avium subsp. silvaticum Mycobacterium avium (240 aa), FASTA scores: opt: 1070, E(): 3.5e-63, (67.5% identity in 240 aa overlap); and O86294|GSC GSC PROTEIN from Mycobacterium paratuberculosis (240 aa), FASTA scores: opt: 1070, E(): 3.5e-63, (67.5% identity in 240 aa overlap). Also some similarity with other proteins from other organisms e.g. Q9L727 NODULATION PROTEIN NOEI from Rhizobium fredii (Sinorhizobium fredii) (241 aa), FASTA scores: opt: 205, E(): 3.5e-06, (27.25% identity in 198 aa overlap); Q9AGG1|LPEA LPEA PROTEIN from Rhizobium etli (286 aa), FASTA scores: opt: 201, E(): 7.2e-06, (28.85% identity in 208 aa overlap); P74191|SLL1173 HYPOTHETICAL 28.0 KDA PROTEIN Synechocystis sp. (strain PCC 6803) (244 aa), FASTA scores: opt: 274, E(): 1e-10, (30.65% identity in 225 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. P71792|RV1513|MTCY277.35 HYPOTHETICAL 26.7 KDA PROTEIN (243 aa), FASTA scores: opt: 1105, E(): 1.7e-65, (70.05% identity in 237 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217472.1" /db_xref="GI:15610093" /db_xref="GeneID:887486" /translation="MKSLKLARFIARSAAFEVSRRYSERDLKHQFVKQLKSRRVDVVF DVGANSGQYAAGLRRAAYKGRIVSFEPLSGPFTILESKASTDPLWDCRQHALGDSDGT VTINIAGNAGQSSSVLPMLKSHQNAFPPANYVGTQEASIHRLDSVAPEFLGMNGVAFL KVDVQGFEKQVLAGGKSTIDDHCVGMQLELSFLPLYEGGMLIPEALDLVYSLGFTLTG LLPCFIDANNGRMLQADGIFFREDD" gene 3309470..3310297 /locus_tag="Rv2957" /db_xref="GeneID:887258" CDS 3309470..3310297 /locus_tag="Rv2957" /EC_number="2.4.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2957, (MTCY349.31c), len: 275 aa. Possible glycosyl transferase (EC 2.4.1.-); possibly secreted protein. Highly similar to O88109|GSD|GTFD GSD PROTEIN from Mycobacterium avium subsp. silvaticum, Mycobacterium paratuberculosis, and Mycobacterium avium (266 aa), FASTA scores: opt: 1010, E(): 2.5e-62, (68.8% identity in 221 aa overlap). Also some similarity with other proteins and especially glycosyl transferases e.g. Q9AEE4 HYPOTHETICAL 31.4 KDA PROTEIN from Leptospira interrogans (265 aa), FASTA scores: opt: 371, E(): 3.3e-18, (34.43% identity in 212 aa overlap); Q9EXY4 PUTATIVE GLYCOSYL TRANSFERASE from Escherichia coli (248 aa), FASTA scores: opt: 339, E(): 5e-16, (32.4% identity in 210 aa overlap); Q9RCC4 GLYCOSYLTRANSFERASE-LIKE PROTEIN from Yersinia pestis (247 aa), FASTA scores: opt: 333, E(): 1.3e-15, (31.8% identity in 217 aa overlap); Q9EXY1 PUTATIVE GLYCOSYL TRANSFERASE from Escherichia coli (248 aa), FASTA scores: opt: 328, E(): 2.9e-15, (31.9% identity in 210 aa overlap); etc. Equivalent to AAK47357 from Mycobacterium tuberculosis strain CDC1551 (256 aa) but longer 19 aa." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="NP_217473.1" /db_xref="GI:15610094" /db_xref="GeneID:887258" /translation="MVQTKRYAGLTAANTKKVAMAAPMFSIIIPTLNVAAVLPACLDS IARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGVDLA TGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRL LFKRNICHQAIFYRRGLFGTIGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASY NEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVISRAMVMRTVISWRRRR" gene complement(3310714..3312000) /locus_tag="Rv2958c" /db_xref="GeneID:887816" CDS complement(3310714..3312000) /locus_tag="Rv2958c" /EC_number="2.4.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM. POSSIBLY INVOLVED IN RESISTANCE TO KILLING BY HUMAN MACROPHAGES." /experiment="experimental evidence, no additional details recorded" /note="Rv2958c, (MTCY349.30), len: 428 aa. Possible glycosyl transferase (EC 2.4.1.-) (see citation below), highly similar to Q9CD88|ML0128 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (435 aa), FASTA scores: opt: 2116, E(): 5.8e-126, (75.05% identity in 417 aa overlap); and Q9CD91|ML0125 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (438 aa), FASTA scores: opt: 2104, E(): 3.3e-125, (74.65% identity in 418 aa overlap). Also shows some similarity to variety of glycosyl transferases e.g. Q9RYI3 PUTATIVE GLYCOSYLTRANSFERASE from Deinococcus radiodurans (418 aa), FASTA scores: opt: 317, E(): 1.9e-12, (31.0% identity in 297 aa overlap); Q9S1V2 PUTATIVE GLYCOSYL TRANSFERASE from Streptomyces coelicolor (407 aa), FASTA scores: opt: 264, E(): 4.1e-09, (27.2% identity in 342 aa overlap); P72650|CRTX|SLR1125 ZEAXANTHIN GLUCOSYL TRANSFERASE from Synechocystis sp. strain PCC 6803 (419 aa), FASTA scores: opt: 251, E(): 2.8e-08, (26.8% identity in 295 aa overlap); etc. Very similar to P95130|MTCY349.25 from Mycobacterium tuberculosis (449 aa), FASTA score: opt: 2215, E(): 3.3e-132, (77.25% identity in 422 aa overlap)." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="NP_217474.1" /db_xref="GI:15610095" /db_xref="GeneID:887816" /translation="MEETSVAGDPGPDAGTSTAPNAAPEPVARRQRILFVGEAATLAH VVRPFVLARSLDPSRYEVHFACDPRFNKLLGPLPFPHHPIHTVPSEEVLLKIAQGRLF YNTRTLRKYIAADRKILNEIAPDVVVGDNRLSLSVSARLAGIPYIAIANAYWSPQARR RFPLPDVPWTRFFGVRPVSILYRLYRPLIFALYCLPLNWLRRKHGLSSLGWDLCRIFT DGDYTLYADVPELVPTYNLPANHRYLGPVLWSPDVKPPTWWHSLPTDRPIIYATLGSS GGKNLLQVVLNALADLPVTVIAATAGRNHLKNVPANAFVADYLPGEAAAARSAVVLCN GGSPTTQQALAAGVPVIGLPSNMDQHLNMEALERAGAGVLLRTERLNTEGVAAAVKQV LSGAEFRQAARRLAEAFGPDFAGFPQHIESALRLVC" gene complement(3312101..3312838) /locus_tag="Rv2959c" /db_xref="GeneID:887862" CDS complement(3312101..3312838) /locus_tag="Rv2959c" /EC_number="2.1.1.-" /function="THOUGHT TO CAUSE METHYLATION." /note="Rv2959c, (MTCY349.29), len: 245 aa. Possible methyltransferase (EC 2.1.1.-), highly similar to Q9CD89|ML0127 from Mycobacterium leprae (229 aa), FASTA scores: opt: 1183, E(): 3.9e-69, (76.1% identity in 226 aa overlap). Also some similarity with other methyltransferases and other proteins e.g. Q51079 PUTATIVE METHYL TRANSFERASE from Nocardia lactamdurans (236 aa), FASTA scores: opt: 156, E(): 0.0086, (23.25% identity in 159 aa overlap); Q98ID5 CEPHALOSPORIN HYDROXYLASE from Rhizobium loti (Mesorhizobium loti) (217 aa), FASTA scores: opt: 275, E(): 1.7e-10, (29.65% identity in 199 aa overlap); etc. And also similar to P72897 HYPOTHETICAL 27.8 KDA PROTEIN from Mycobacterium tuberculosis (249 aa), FASTA scores: opt: 292, E(): 1.5e-11, (31.25% identity in 208 aa overlap)." /codon_start=1 /transl_table=11 /product="methyltransferase (methylase)" /protein_id="NP_217475.1" /db_xref="GI:15610096" /db_xref="GeneID:887862" /translation="MGLVWRSRTSLVGQLIGLVRLVASFAAQLFYRPSDAVAEEYHKW YYGNLVWTKTTYMGINCWKSVSDMWNYQEILSELQPSLVIEFGTRYGGSAVYFANIMR QIGQPFKVLTVDNSHKALDPRARREPDVLFVESSSTDPAIAEQIQRLKNEYPGKIFAI LDSDHSMNHVLAEMKLLRPLLSAGDYLVVEDSNINGHPVLPGFGPGPYEAIEAYEDEF PNDYKHDAERENKFGWTSAPNGFLIRN" gene complement(3312953..3313201) /locus_tag="Rv2960c" /db_xref="GeneID:887224" CDS complement(3312953..3313201) /locus_tag="Rv2960c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2960c, (MT3036, MTCY349.28), len: 82 aa. Hypothetical unknown protein, equivalent to AAK47362 from Mycobacterium tuberculosis strain CDC1551 (116 aa) but shorter 34 aa. Shortened version of MTCY349.28 avoiding overlap." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217476.1" /db_xref="GI:15610097" /db_xref="GeneID:887224" /translation="MGRNATAVVSLPVVALSPRAGQAGYLWQSITRGLRVTPICCYHP PCGGGVQKMLSRKLGRVCPAPSPKDAARGAHNVGANAV" gene 3313283..3313672 /locus_tag="Rv2961" /db_xref="GeneID:887316" CDS 3313283..3313672 /locus_tag="Rv2961" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv2961, (MTCY349.26c), len: 129 aa. Probable transposase, highly similar to C-terminus of O50414|Rv3387|MTV004.45 PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis (225 aa), FASTA scores: opt: 605, E(): 7.2e-34, (66.65% identity in 129 aa overlap); and similar to others e.g. CAC47401 PUTATIVE PARTIAL TRANSPOSASE FOR ISRM17 PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (174 aa), FASTA scores: opt: 183, E(): 2.6e-05, (30.25% identity in 129 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217477.1" /db_xref="GI:15610098" /db_xref="GeneID:887316" /translation="MEHGNPHDAPQLAPAVERITTRAGRPPGTVTADRGYGEKRVEDD LHDLGVRTVAIPRKGRPSQARRAEEQRPSFRRTVKWRTGSEGRISTLKRNYGWNRSCI DGTEGTRIWTRHGILTHNLIKISSLAA" gene complement(3313773..3315122) /locus_tag="Rv2962c" /db_xref="GeneID:887892" CDS complement(3313773..3315122) /locus_tag="Rv2962c" /EC_number="2.4.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM. POSSIBLY INVOLVED IN RESISTANCE TO KILLING BY HUMAN MACROPHAGES." /experiment="experimental evidence, no additional details recorded" /note="Rv2962c, (MTCY349.25), len: 508 aa. Possible glycosyl transferase (EC 2.4.1.-) (see citation below), highly similar or identical to Mycobacterium tuberculosis proteins G560522 U0002JA, G560521 U0002H, G560522 U0002JA, G560519 U0002KA. Equivalent (but longer 21 aa) to Q9CD91 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (438 aa), FASTA scores: opt: 2229, E(): 1.3e-133, (77.45% identity in 426 aa overlap); and highly similar to Q9CD88 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (435 aa), FASTA scores: opt: 2129, E(): 2.7e-127, (74.35% identity in 425 aa overlap); and others from Mycobacterium leprae. Also shows some similarity to variety of glycosyl transferases e.g. Q9RYI3|DRA0329 PUTATIVE GLYCOSYL TRANSFERASE from Deinococcus radiodurans (418 aa), FASTA scores: opt: 340, E(): 5.5e-14, (31.2% identity in 330 aa overlap); P72650 ZEAXANTHIN GLUCOSYL TRANSFERASE from Synechocystis sp. (strain PCC 6803) (419 aa), FASTA scores: opt: 244, E(): 6.6e-08, (26.2% identity in 294 aa overlap); etc. Also highly similar to P95134 HYPOTHETICAL 46.8 KDA PROTEIN from Mycobacterium tuberculosis (428 aa), FASTA scores: opt: 2215, E(): 9.6e-133, (77.25% identity in 422 aa overlap)." /codon_start=1 /transl_table=11 /product="glycosyl transferase" /protein_id="NP_217478.1" /db_xref="GI:15610099" /db_xref="GeneID:887892" /translation="MRVSCVYATASRWGGPPVASEVRGDAAISTTPDAAPGLAARRRR ILFVAEAVTLAHVVRPFALAQSLDPSRYEVHFACDPRYNQLLGPLPFRHHAIHTIPSE RFFGNLTQGRFYAMRTLRKYVEADLRVLDEIAPDLVVGDLRISLSVSARLAGIPYIAI ANAYWSPYAQRRFPLPDVIWTRLFGVRLVKLLYRLERPLLFALQCMPLNWVRRRHGLS SLGWNLCRIFTDGDHTLYADVPELMPTYDLPANHEYLGPVLWSPAGKPPTWWDSLPTD RPIVYATLGTSGGRNLLQLVLNALAELPVTVIAATAGRSDLKTVPANAFVADYLPGEA AAARSAVVVCNGGSLTTQQALVAGVPVIGVAGNLDQHLNMEAVERAGAGVLLRTERLK SQRVAGAVMQVISRSEYRQAAARLADAFGRDRVGFPQHVENALRLMPENRPRTWLAS" gene 3315236..3316456 /locus_tag="Rv2963" /db_xref="GeneID:887263" CDS 3315236..3316456 /locus_tag="Rv2963" /function="UNKNOWN" /note="Rv2963, (MTCY349.24c), len: 406 aa. Probable integral membrane protein." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217479.1" /db_xref="GI:15610100" /db_xref="GeneID:887263" /translation="MTSTKVEDRVTAAVLGAIGHALALTASMTWEILWALILGFALSA VVQAVVRRSTIVTLLGDDRPRTLVIATGLGAASSSCSYAAVALARSLFRKGANFTAAM AFEIGSTNLVVELGIILALLMGWQFTAAEFVGGPIMILVLAVLFRLFVGARLIDAARE QAERGLAGSMEGHAAMDMSIKREGSFWRRLLSPPGFTSIAHVFVMEWLAILRDLILGL LIAGAIAAWVPESFWQSFFLANHPAWSAVWGPIIGPIVAIVSFVCSIGNVPLAAVLWN GGISFGGVIAFIFADLLILPILNIYRKYYGARMMLVLLGTFYASMVVAGYLIELLFGT TNLIPSQRSATVMTAEISWNYTTWLNVIFLVIAAALVVRFITSGGLPMLRMMGGSPDA PHDHHDRHDDHLGH" gene 3316529..3317461 /gene="purU" /locus_tag="Rv2964" /db_xref="GeneID:887338" CDS 3316529..3317461 /gene="purU" /locus_tag="Rv2964" /EC_number="3.5.1.10" /function="INVOLVED IN DE NOVO PURINE BIOSYNTHESIS [CATALYTIC ACTIVITY: 10-FORMYLTETRAHYDROFOLATE + H(2)O = FORMATE + TETRAHYDROFOLATE]." /note="produces formate from formyl-tetrahydrofolate which is the major source of formate for PurT in de novo purine nucleotide biosynthesis; has a role in one-carbon metabolism; forms a homohexamer; activated by methionine and inhibited by glycine" /codon_start=1 /transl_table=11 /product="formyltetrahydrofolate deformylase" /protein_id="NP_217480.1" /db_xref="GI:15610101" /db_xref="GeneID:887338" /translation="MGKGSMTAHATPNEPDYPPPPGGPPPPADIGRLLLRCHDRPGII AAVSTFLARAGANIISLDQHSTAPEGGTFLQRAIFHLPGLTAAVDELQRDFGSTVADK FGIDYRFAEAAKPKRVAIMASTEDHCLLDLLWRNRRGELEMSVVMVIANHPDLAAHVR PFGVPFIHIPATRDTRTEAEQRQLQLLSGNVDLVVLARYMQILSPGFLEAIGCPLINI HHSFLPAFTGAAPYQRARERGVKLIGATAHYVTEVLDEGPIIEQDVVRVDHTHTVDDL VRVGADVERAVLSRAVLWHCQDRVIVHHNQTIVF" gene complement(3318330..3318815) /gene="coaD" /locus_tag="Rv2965c" /db_xref="GeneID:888423" CDS complement(3318330..3318815) /gene="coaD" /locus_tag="Rv2965c" /EC_number="2.7.7.3" /function="INVOLVED IN THE COENZYME A (CoA) BIOSYNTHESIS (AT THE FOURTH STEP). REVERSIBLY TRANSFERS AN ADENYLYL GROUP FROM ATP TO 4'-PHOSPHOPANTETHEINE, YIELDING DEPHOSPHO-CoA (DPCOA) AND PYROPHOSPHATE [CATALYTIC ACTIVITY: ATP + PANTETHEINE 4'-PHOSPHATE = DIPHOSPHATE + DEPHOSPHO-CoA]." /note="Catalyzes the conversion of ATP and pantetheine 4'-phosphate to diphosphate and 3'-dephospho-coA" /codon_start=1 /transl_table=11 /product="phosphopantetheine adenylyltransferase" /protein_id="NP_217481.1" /db_xref="GI:15610102" /db_xref="GeneID:888423" /translation="MTGAVCPGSFDPVTLGHVDIFERAAAQFDEVVVAILVNPAKTGM FDLDERIAMVKESTTHLPNLRVQVGHGLVVDFVRSCGMTAIVKGLRTGTDFEYELQMA QMNKHIAGVDTFFVATAPRYSFVSSSLAKEVAMLGGDVSELLPEPVNRRLRDRLNTER T" repeat_region complement(3318835..3318889) /note="55 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene complement(3318901..3319467) /locus_tag="Rv2966c" /db_xref="GeneID:888469" CDS complement(3318901..3319467) /locus_tag="Rv2966c" /EC_number="2.1.1.-" /function="THOUGHT TO CAUSE METHYLATION." /note="Rv2966c, (MTCY349.21), len: 188 aa. Possible methyltransferase (EC 2.1.1.-), equivalent (but shorter 36 aa) to O69465|MLCB1243.09 HYPOTHETICAL 23.0 KDA PROTEIN from Mycobacterium leprae (220 aa), FASTA scores: opt: 872, E(): 9.1e-50, (74.2% identity in 182 aa overlap). Also similar to others e.g. Q9ZBR2|SC7A1.11 PUTATIVE METHYLASE from Streptomyces coelicolor (195 aa), FASTA scores: opt: 510, E(): 3.7e-26, (47.5% identity in 179 aa overlap); Q9F842 HYPOTHETICAL METHYLTRANSFERASE (FRAGMENT) from Mycobacterium smegmatis (80 aa), FASTA scores: opt: 386, E(): 2.5e-18, (75.0% identity in 80 aa overlap); P10120|YHHF_ECOLI|YHHFZ|B3465 PUTATIVE METHYLASE from Escherichia colistrain K12 (198 aa), FASTA scores: opt: 319, E(): 1.1e-13, (35.5% identity in 183 aa overlap); etc. Contains PS00092 N-6 Adenine-specific DNA methylases signature." /codon_start=1 /transl_table=11 /product="methyltransferase (methylase)" /protein_id="NP_217482.1" /db_xref="GI:15610103" /db_xref="GeneID:888469" /translation="MTRIIGGVAGGRRIAVPPRGTRPTTDRVRESLFNIVTARRDLTG LAVLDLYAGSGALGLEALSRGAASVLFVESDQRSAAVIARNIEALGLSGATLRRGAVA AVVAAGTTSPVDLVLADPPYNVDSADVDAILAALGTNGWTREGTVAVVERATTCAPLT WPEGWRRWPQRVYGDTRLELAERLFANV" misc_feature complement(3319102..3319122) /locus_tag="Rv2966c" /note="PS00092 N-6 Adenine-specific DNA methylases signature" repeat_region complement(3319468..3319568) /note="101 bp Mycobacterial Interspersed Repetitive Unit, Class III" repeat_region complement(3319569..3319666) /note="98 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene complement(3319663..3323046) /gene="pca" /locus_tag="Rv2967c" /db_xref="GeneID:887299" CDS complement(3319663..3323046) /gene="pca" /locus_tag="Rv2967c" /EC_number="6.4.1.1" /function="INVOLVED IN GLUCONEOGENESIS AND LIPOGENESIS. PYRUVATE CARBOXYLASE CATALYZES A 2-STEP REACTION, INVOLVING THE ATP-DEPENDENT CARBOXYLATION OF THE COVALENTLY ATTACHED BIOTIN IN THE FIRST STEP AND THE TRANSFER OF THE CARBOXYL GROUP TO PYRUVATE IN THE SECOND [CATALYTIC ACTIVITY: ATP + PYRUVATE + HCO(3)(-) = ADP + PHOSPHATE + OXALOACETATE]." /note="biotin-containing enzyme that catalyzes a two step carboxylation of pyruvate to oxaloacetate" /codon_start=1 /transl_table=11 /product="pyruvate carboxylase" /protein_id="NP_217483.1" /db_xref="GI:15610104" /db_xref="GeneID:887299" /translation="MFSKVLVANRGEIAIRAFRAAYELGVGTVAVYPYEDRNSQHRLK ADESYQIGDIGHPVHAYLSVDEIVATARRAGADAIYPGYGFLSENPDLAAACAAAGIS FVGPSAEVLELAGNKSRAIAAAREAGLPVLMSSAPSASVDELLSVAAGMPFPLFVKAV AGGGGRGMRRVGDIAALPEAIEAASREAESAFGDPTVYLEQAVINPRHIEVQILADNL GDVIHLYERDCSVQRRHQKVIELAPAPHLDAELRYKMCVDAVAFARHIGYSCAGTVEF LLDERGEYVFIEMNPRVQVEHTVTEEITDVDLVASQLRIAAGETLEQLGLRQEDIAPH GAALQCRITTEDPANGFRPDTGRISALRTAGGAGVRLDGSTNLGAEISPYFDSMLVKL TCRGRDLPTAVSRARRAIAEFRIRGVSTNIPFLQAVLDDPDFRAGRVTTSFIDERPQL LTARASADRGTKILNFLADVTVNNPYGSRPSTIYPDDKLPDLDLRAAPPAGSKQRLVK LGPEGFARWLRESAAVGVTDTTFRDAHQSLLATRVRTSGLSRVAPYLARTMPQLLSVE CWGGATYDVALRFLKEDPWERLATLRAAMPNICLQMLLRGRNTVGYTPYPEIVTSAFV QEATATGIDIFRIFDALNNIESMRPAIDAVRETGSAIAEVAMCYTGDLTDPGEQLYTL DYYLKLAEQIVDAGAHVLAIKDMAGLLRPPAAQRLVSALRSRFDLPVHLHTHDTPGGQ LASYVAAWHAGADAVDGAAAPLAGTTSQPALSSIVAAAAHTEYDTGLSLSAVCALEPY WEALRKVYAPFESGLPGPTGRVYHHEIPGGQLSNLRQQAIALGLGDRFEEIEEAYAGA DRVLGRLVKVTPTSKVVGDLALALVGAGVSADEFASDPARFGIPESVLGFLRGELGDP PGGWPEPLRTAALAGRGAARPTAQLAADDEIALSSVGAKRQATLNRLLFPSPTKEFNE HREAYGDTSQLSANQFFYGLRQGEEHRVKLERGVELLIGLEAISEPDERGMRTVMCIL NGQLRPVLVRDRSIASAVPAAEKADRGNPGHIAAPFAGVVTVGVCVGERVGAGQTIAT IEAMKMEAPITAPVAGTVERVAVSDTAQVEGGDLLVVVS" misc_feature complement(3319747..3319800) /gene="pca" /locus_tag="Rv2967c" /note="PS00188 Biotin-requiring enzymes attachment site" misc_feature complement(3320926..3320967) /gene="pca" /locus_tag="Rv2967c" /note="PS00165 Serine/threonine dehydratases pyridoxal-phosphate attachment site" misc_feature complement(3322168..3322191) /gene="pca" /locus_tag="Rv2967c" /note="PS00867 Carbamoyl-phosphate synthase subdomain signature 2" gene complement(3323071..3323703) /locus_tag="Rv2968c" /db_xref="GeneID:888476" CDS complement(3323071..3323703) /locus_tag="Rv2968c" /function="UNKNOWN" /note="Rv2968c, (MTCY349.19), len: 210 aa. Probable conserved integral membrane protein, equivalent to O69464 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (214 aa), FASTA scores: opt: 1060, E(): 1.4e-58, (71.95% identity in 214 aa overlap). Also highly similar to others e.g. Q9F844 HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN from Mycobacterium smegmatis (187 aa), FASTA scores: opt: 883, E(): 1.2e-47, (62.8% identity in 190 aa overlap); Q9KXP3 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (240 aa), FASTA scores: opt: 503, E(): 4.6e-24, (38.0% identity in 192 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217484.1" /db_xref="GI:15610105" /db_xref="GeneID:888476" /translation="MVAARPAERSGDPAAVRVPVPSAWWVLIGGVIGLFASMTLTVEK VRILLDPIYVPSCNVNPIVSCGSVMTTPQASLLGFPNPLLGIAGFTVVVVTGVLAVAK VPLPRWYWIGLAVGILVGVAFVHWLIFQSLYRIGALCPYCMVVWAVIATLLVVVASIV FGPMRENRGSQERVGARLLYQWRWSLATLWFTTVFLLIMVRFWDYWSTLI" gene complement(3323709..3324476) /locus_tag="Rv2969c" /db_xref="GeneID:888481" CDS complement(3323709..3324476) /locus_tag="Rv2969c" /function="UNKNOWN" /note="Rv2969c, (MTCY349.18), len: 255 aa. Possible conserved membrane or exported protein, equivalent to Q9CBS4|ML1667 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (264 aa), FASTA scores: opt: 1101, E(): 9.9e-68, (65.9% identity in 258 aa overlap); and highly similar to O69463 PUTATIVE TRANSMEMBRANE PROTEIN from Mycobacterium leprae (258 aa), FASTA scores: opt: 1097, E(): 1.8e-67, (65.5% identity in 258 aa overlap). C-terminus also highly similar to Q9KK65|996A160 EXPORTED PROTEIN (FRAGMENT) from Mycobacterium avium (85 aa), FASTA scores: opt: 418, E(): 2e-21, (72.95% identity in 85 aa overlap). Also weakly similar to membrane or exported proteins e.g. Q9S2U7|SC4G6.04c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (275 aa), FASTA scores: opt: 312, E(): 7.6e-14, (28.25% identity in 230 aa overlap); Q9XAB6|SCC22.22C PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (255 aa), FASTA scores: opt: 181, E(): 6.4e-05, (27.0% identity in 226 aa overlap); etc. Also some similarity with P72001|PKNE_MYCTU from Mycobacterium tuberculosis (566 aa), FASTA scores: opt: 264, E(): 2.3e-10, (30.5% identity in 177 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217485.1" /db_xref="GI:15610106" /db_xref="GeneID:888481" /translation="MADKSKRPPRFDLKSADGSFGRLVQIGGTTIVVVFAVVLVFYIV TSRDDKKDGVAGPGDAVRVTSSKLVTQPGTSNPKAVVSFYEDFLCPACGIFERGFGPT VSKLVDIGAVAADYTMVAILDSASNQHYSSRAAAAAYCVADESIEAFRRFHAALFSKD IQPAELGKDFPDNARLIELAREAGVVGKVPDCINSGKYIEKVDGLAAAVNVHATPTVR VNGTEYEWSTPAALVAKIKEIVGDVPGIDSAAATATS" gene complement(3324573..3325703) /gene="lipN" /locus_tag="Rv2970c" /db_xref="GeneID:887194" CDS complement(3324573..3325703) /gene="lipN" /locus_tag="Rv2970c" /EC_number="3.1.1.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2970c, (MTCY349.17), len: 376 aa. Probable lipN, lipase/esterase (EC 3.1.1.-), similar to others e.g. Q9AA37|CC0771 PUTATIVE ESTERASE from Caulobacter crescentus (380 aa), FASTA scores: opt: 822, E(): 8e-46, (42.15% identity in 318 aa overlap); Q9XDR4 ESTERASE HDE from petroleum-degrading bacterium HD-1 (317 aa), FASTA scores: opt: 738, E(): 2e-40, (48.85% identity in 262 aa overlap); O52270 LIPASE from Pseudomonas sp. (strain B11-1) (308 aa), FASTA scores: opt: 683, E(): 7.3e-37, (41.3% identity in 288 aa overlap); etc. Also similar to P71668 HYPOTHETICAL 34.1 KDA PROTEIN from Mycobacterium tuberculosis (320 aa), FASTA scores: opt: 715, E(): 6.3e-39, (42.3% identity in 298 aa overlap). Equivalent to AAK47374 from Mycobacterium tuberculosis strain CDC1551 (309 aa) but longer 67 aa." /codon_start=1 /transl_table=11 /product="lipase/esterase LipN" /protein_id="NP_217486.1" /db_xref="GI:15610107" /db_xref="GeneID:887194" /translation="MTKSLPGVADLRLGANHPRMWTRRVQGTVVNVGVKVLPWIPTPA KRILSAGRSVIIDGNTLDPTLQLMLSTSRIFGVDGLAVDDDIVASRAHMRAICEAMPG PQIHVDVTDLSIPGPAGEIPARHYRPSGGGATPLLVFYHGGGWTLGDLDTHDALCRLT CRDADIQVLSIDYRLAPEHPAPAAVEDAYAAFVWAHEHASDEFGALPGRVAVGGDSAG GNLSAVVCQLARDKARYEGGPTPVLQWLLYPRTDFTAQTRSMGLFGNGFLLTKRDIDW FHTQYLRDSDVDPADPRLSPLLAESLSGLAPALIAVAGFDPLRDEGESYAKALRAAGT AVDLRYLGSLTHGFLNLFQLGGGSAAGTNELISALRAHLSRV" gene 3325934..3326104 /locus_tag="Rv2970A" /db_xref="GeneID:3205038" CDS 3325934..3326104 /locus_tag="Rv2970A" /function="UNKNOWN" /note="Rv2970A, len: 56 aa. Conserved hypothetical protein, similar to C-terminal part of several oxidoreductases e.g. Rv2971|Z83018|MTCY349_22 from Mycobacterium tuberculosis (282 aa), FASTA scores: opt: 158, E(): 3.6e-06, (45.0% identity in 60 aa overlap). May represent a gene fragment." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177681.1" /db_xref="GI:57117040" /db_xref="GeneID:3205038" /translation="MLIRWHIQLGNIVIPKSVNPMRIASNFDAFDFPRSMTEPGLVRI RKPSISQAGEMT" gene 3326101..3326949 /locus_tag="Rv2971" /db_xref="GeneID:887275" CDS 3326101..3326949 /locus_tag="Rv2971" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2971, (MTCY349.16c), len: 282 aa. Probable oxidoreductase (EC 1.-.-.-), possibly aldo/keto reductase, equivalent to O69462 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (282 aa), FASTA scores: opt: 1495, E(): 4.9e-93, (82.35% identity in 272 aa overlap). Also similar to others e.g. Q9KYM9|SC9H11.10C OXIDOREDUCTASE from Streptomyces coelicolor (276 aa), FASTA scores: opt: 849, E(): 1.2e-49, (51.7% identity in 267 aa overlap); Q9ZBW7|SC4B5.01C PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (277 aa), FASTA scores: opt: 847, E(): 1.7e-49, (49.1% identity in 271 aa overlap); Q46857|YQHE_ECOLI|YQHE|B3012 HYPOTHETICAL OXIDOREDUCTASE from Escherichia coli strain K12 (275 aa), FASTA scores: opt: 827, E(): 3.7e-48, (47.45% identity in 276 aa overlap); etc. Contains PS00063 Aldo /keto reductase family putative active site signature; and PS00062 Aldo/keto reductase family signature 2." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217487.1" /db_xref="GI:15610108" /db_xref="GeneID:887275" /translation="MTGESGAAAAPSITLNDEHTMPVLGLGVAELSDDETERAVSAAL EIGCRLIDTAYAYGNEAAVGRAIAASGVAREELFVTTKLATPDQGFTRSQEACRASLD RLGLDYVDLYLIHWPAPPVGKYVDAWGGMIQSRGEGHARSIGVSNFTAENIENLIDLT FVTPAVNQIELHPLLNQDELRKANAQHTVVTQSYCPLALGRLLDNPTVTSIASEYVKT PAQVLLRWNLQLGNAVVVRSARPERIASNFDVFDFELAAEHMDALGGLNDGTRVREDP LTYAGT" misc_feature 3326491..3326544 /locus_tag="Rv2971" /note="PS00062 Aldo/keto reductase family signature 2" misc_feature 3326803..3326850 /locus_tag="Rv2971" /note="PS00063 Aldo/keto reductase family putative active site signature" gene complement(3327023..3327736) /locus_tag="Rv2972c" /db_xref="GeneID:887191" CDS complement(3327023..3327736) /locus_tag="Rv2972c" /function="UNKNOWN" /note="Rv2972c, (MTCY349.15), len: 237 aa. Possible conserved membrane or exported protein, equivalent (but longer 52 aa) to O69461|MLCB1243.02 HYPOTHETICAL 20.5 KDA PROTEIN from Mycobacterium leprae (180 aa), FASTA scores: opt: 581, E(): 8.2e-32, (55.75% identity in 174 aa overlap). Also similar to membrane or exported proteins e.g. Q9F2P3|SCE41.16C PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (258 aa), FASTA scores: opt: 498, E(): 4.1e-26, (44.08% identity in 186 aa overlap); Q99QB5|SCP1.323C PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (219 aa), FASTA scores: opt: 329, E(): 8.5e-15, (36.35% identity in 176 aa overlap); Q9ACQ1|SCP1.267 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (219 aa), FASTA scores: opt: 286, E(): 6.6e-12, (32.03% identity in 231 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217488.1" /db_xref="GI:15610109" /db_xref="GeneID:887191" /translation="MNRRTLLWLSAIAALALVVAYQTLGSSAGRHADEFAARAGVPTV QPGADVLAGIAVLPKRIHRYDYRRSAFGHPWDDRNDAPGGHNGCDTRDDILDRDLVDK TYVSIKRCPNAVATGTLRDPYTNTTVAFQRGASVGQSVQIDHIVPLSYAWDMGAYRWP NSERMRFANDPANLLAVQGQANQDKGDSPPAQWMPPNKAFACQYAMQFIAVLRGYSLP VDQPSSDVLRQAAATCPTG" gene complement(3327733..3329946) /gene="recG" /locus_tag="Rv2973c" /db_xref="GeneID:887439" CDS complement(3327733..3329946) /gene="recG" /locus_tag="Rv2973c" /EC_number="3.6.1.-" /function="CRITICAL ROLE IN RECOMBINATION AND DNA REPAIR. HELP PROCESS HOLLIDAY JUNCTION INTERMEDIATES TO MATURE PRODUCTS BY CATALYSING BRANCH MIGRATION. HAS A DNA UNWINDING ACTIVITY CHARACTERISTIC OF A DNA HELICASE WITH A 3' TO 5' POLARITY. RECG UNWIND BRANCHED DUPLEX DNA (Y-DNA)." /note="catalyzes branch migration in Holliday junction intermediates" /codon_start=1 /transl_table=11 /product="ATP-dependent DNA helicase RecG" /protein_id="NP_217489.1" /db_xref="GI:15610110" /db_xref="GeneID:887439" /translation="MASLSDRLDRVLGATAADALDEQFGMRTVDDLLRHYPRSYVEGA ARVGIGDARPEAGEHITIVDVITDTYSFPMKKKPNRKCLRITVGGGRNKVTATFFNAD YIMRDLTKHTKVMLSGEVGYYKGAMQLTHPAFLILDSPDGKNHGTRSLKSIADASKAI SGELVVEEFERRFFPIYPASTKVQSWDIFKCVRQVLDVLDRVDDPLPAELRAKHGLIP EDEALRAIHLAESQSLRERARERLTFDEAVGLQWALVARRHGELSESGPSAAWKSNGL AAELLRRLPFELTAGQREVLDVLSDGLAANRPLNRLLQGEVGSGKTIVAVLAMLQMVD AGYQCALLAPTEVLAAQHLRSIRDVLGPLAMGGQLGGAENATRVALLTGSMTAGQKKQ VRAEIASGQVGIVIGTHALLQEAVDFHNLGMVVVDEQHRFGVEQRDQLRAKAPAGITP HLLVMTATPIPRTVALTVYGDLETSTLRELPLGRQPIATNVIFVKDKPAWLDRAWRRI IEEAAAGRQAYVVAPRIDESDDTDVQGGVRPSATAEGLFSRLRSAELAELRLALMHGR LSADDKDAAMAAFRAGEVDVLVCTTVIEVGVDVPNATVMLVMDADRFGISQLHQLRGR IGRGEHPSVCLLASWVPPDTPAGQRLRAVAGTMDGFALADLDLKERKEGDVLGRNQSG KAITLRLLSLAEHEEYIVAARDFCIEAYKNPTDPALALMAARFTSTDRIEYLDKS" misc_feature complement(3328981..3329004) /gene="recG" /locus_tag="Rv2973c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3329949..3331361) /locus_tag="Rv2974c" /db_xref="GeneID:887383" CDS complement(3329949..3331361) /locus_tag="Rv2974c" /function="UNKNOWN" /note="Rv2974c, (MTCY349.13), len: 470 aa. Conserved hypothetical ala-rich protein, highly similar to others e.g. C-terminus of Q9ZBR4|SC7A1.09 HYPOTHETICAL 59.5 KDA PROTEIN from Streptomyces coelicolor (589 aa), FASTA scores: opt: 774, E(): 1.3e-36, (41.0% identity in 495 aa overlap); Q9K9Z6|BH2498 HYPOTHETICAL PROTEIN from Bacillus halodurans (557 aa), FASTA scores: opt: 268, E(): 8e-08, (27.7% identity in 502 aa overlap) (N-terminus longer 76 aa); Q9X293 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (497 aa), FASTA scores: opt: 265, E(): 1.1e-07, (24.9% identity in 470 aa overlap) (N-terminus longer 43 aa); etc. Also some similarity with P47609|Y369_MYCGE|MG369 HYPOTHETICAL PROTEIN from Mycoplasma genitalium (557 aa), FASTA scores: opt: 154, E(): 0.25, (20.25% identity in 489 aa overlap); this, and following ORF, are similar to Y369_MYCGE but no cosmid sequence error was identified." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217490.1" /db_xref="GI:15610111" /db_xref="GeneID:887383" /translation="MNGARGNSGVILSQILRGIAEVTATAAAASGAVLRAVDANALGA ALWRGVELVVASMGGVEVPGTIVSVLRAAAGAVDQCAHEGLAGAVTAAGDAAVIALEK TPEQLDVLADAGAVDAGGRGLLVLLDALRSTICGQAPARAVYEPSPRALPTDTATQRP APQFEVMYLLAVCDAAAADQLRDRLKELGESVAIAAAPPDSYSVHVHTDDAGAAVEAG LAVGRVSRIVISALGSGTSGLPAGGWTRGRAVLAVVDGDGAAELFAGEGACVLRPGPD AVTPAADISAHQLVRAVVDTGAAHVMVLPNGYVAAEELVAGCTAAIGWGVDVVPVPTG SMVQGLAALAVHDAARQAVDDGYSMARAAGASRHGSVRIATQKALTWAGTCKPGDGLG IAGDEVLIVADDVAAAAIGLVDLLLASGGDLVTVLIGAGVTEDVAVVLERHVHDHHPG TELVSYRTGHRGDALLIGVE" gene complement(3331358..3331612) /locus_tag="Rv2975c" /db_xref="GeneID:887512" CDS complement(3331358..3331612) /locus_tag="Rv2975c" /function="UNKNOWN" /note="Rv2975c, (MTCY349.12), len: 84 aa. Conserved hypothetical protein, similar to N-terminus of others e.g. Q9ZBR4|SC7A1.09 HYPOTHETICAL 59.5 KDA PROTEIN from Streptomyces coelicolor (589 aa), FASTA scores: opt: 141, E(): 0.0019, (41.25% identity in 80 aa overlap); Q98R49|MYPU_1610 HYPOTHETICAL PROTEIN from Mycoplasma pulmonis (545 aa), FASTA scores: opt: 127, E(): 0.023, (48.0% identity in 50 aa overlap); Q9K9Z6|BH2498 HYPOTHETICAL PROTEIN from Bacillus halodurans (557 aa), FASTA scores: opt: 126, E(): 0.028, (34.55% identity in 81 aa overlap); etc. Also some similarity with N-terminus of P47609|Y369_MYCGE|MG369 HYPOTHETICAL PROTEIN from Mycoplasma genitalium (557 aa), FASTA scores: opt: 108, E(): 0.7, (36.75% identity in 49 aa overlap); this, and preceding ORF, are similar to Y369_MYCGE and YLOV PROTEIN but no cosmid sequence error was identified." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217491.1" /db_xref="GI:15610112" /db_xref="GeneID:887512" /translation="MGTADRPLDASALRDWAHAVVSDLILHIDEINRLNVFPVADSDT GVNMLFTMRAAVVEADLHANSQADAEDVARVAAALAAGAR" gene complement(3332071..3332754) /gene="ung" /locus_tag="Rv2976c" /db_xref="GeneID:887410" CDS complement(3332071..3332754) /gene="ung" /locus_tag="Rv2976c" /EC_number="3.2.2.-" /function="INVOLVED IN BASE EXCISION REPAIR.EXCISES URACIL RESIDUES FROM THE DNA WHICH CAN ARISE AS A RESULT OF MISINCORPORATION OF DUMP RESIDUES BY DNA POLYMERASE OR DUE TO DEAMINATION OF CYTOSINE." /note="Excises uracil residues from the DNA which can arise as a result of misincorporation of dUMP residues by DNA polymerase or due to deamination of cytosine" /codon_start=1 /transl_table=11 /product="uracil-DNA glycosylase" /protein_id="NP_217492.1" /db_xref="GI:15610113" /db_xref="GeneID:887410" /translation="MTARPLSELVERGWAAALEPVADQVAHMGQFLRAEIAAGRRYLP AGSNVLRAFTFPFDNVRVLIVGQDPYPTPGHAVGLSFSVAPDVRPWPRSLANIFDEYT ADLGYPLPSNGDLTPWAQRGVLLLNRVLTVRPSNPASHRGKGWEAVTECAIRALAARA APLVAILWGRDASTLKPMLAAGNCVAIESPHPSPLSASRGFFGSRPFSRANELLVGMG AEPIDWRLP" gene complement(3332787..3333788) /gene="thiL" /locus_tag="Rv2977c" /db_xref="GeneID:888342" CDS complement(3332787..3333788) /gene="thiL" /locus_tag="Rv2977c" /EC_number="2.7.4.16" /function="INVOLVED IN THIAMINE BIOSYNTHESIS [CATALYTIC ACTIVITY: ATP + THIAMINE PHOSPHATE = ADP + THIAMINE DIPHOSPHATE]." /note="catalyzes the formation of thiamine diphosphate from thiamine phosphate ant ATP" /codon_start=1 /transl_table=11 /product="thiamine monophosphate kinase" /protein_id="NP_217493.1" /db_xref="GI:15610114" /db_xref="GeneID:888342" /translation="MTTKDHSLATESPTLQQLGEFAVIDRLVRGRRQPATVLLGPGDD AALVSAGDGRTVVSTDMLVQDSHFRLDWSTPQDVGRKAIAQNAADIEAMGARATAFVV GFGAPAETPAAQASALVDGMWEEAGRIGAGIVGGDLVSCRQWVVSVTAIGDLDGRAPV LRSGAKAGSVLAVVGELGRSAAGYALWCNGIEDFAELRRRHLVPQPPYGHGAAAAAVG AQAMIDVSDGLLADLRHIAEASGVRIDLSAAALAADRDALTAAATALGTDPWPWVLSG GEDHALVACFVGPVPAGWRTIGRVLDGPARVLVDGEEWTGYAGWQSFGEPDNQGSLG" repeat_region complement(3333768..3335792) /note="IS1538, len: 2025 bp. Similar to other Insertion sequence elements in M. tuberculosis e.g. IS1535, IS1536, IS1537, & IS1539 (EM_NEW:MTCY274 Z74024 Mycobacterium tuberculosis cosmid Y274)" /mobile_element="insertion sequence:IS1538" repeat_region 3333768..3333773 /note="6 bp inverted repeat at the left end of IS1538, TGAGTG" gene complement(3333785..3335164) /locus_tag="Rv2978c" /db_xref="GeneID:887390" CDS complement(3333785..3335164) /locus_tag="Rv2978c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION ELEMENT IS1538." /note="Rv2978c, (MTCY349.09), len: 459 aa. Probable transposase for IS1538, very similar to several other putative transposases from Mycobacterium tuberculosis e.g. YX16_MYCTU|Q10809 (460 aa), FASTA scores: opt: 2613, E(): 0, (83.0% identity in 458 aa overlap); etc. Low level matches to other tranposases." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217494.1" /db_xref="GI:15610115" /db_xref="GeneID:887390" /translation="MPKFEVPDGWTVQAFRFTLDPTEDQAKALARHFGARRKAYNWTV ATLKADIQAWHASGTVTAKPSLRVLRKRWNTVKDDVCVNTETGVAWWPECSKEAYADG IAGAVEAYWNWQTSRAGKRAGKRVGFPRFKRKGRDQDRVSFTTGAMRVEPDRRHLTLP VIGTVRTHENTRRIERLIKAGRARVLAISVRRNGTRLDASVRVLVQRPQQPKVVHPGS RVGVDVGVRRLATVATADGTAIEQVENPRPLGAALRELRHVCRARSRCTKGSRRYRER TTQISRLHRRVNDVRTHHLHVLTTRLAQTHGRIVVEGLDATEMLRQKGLPGARARRRG LSDAALGTPRRHLSYKTVWYGSALVVADRWFPSSKTCHACRHVQDIGWDEQWQCDRCS VVHQRDDCAAINLARYEETSSIVGPVGAAVKRGADRKTGPRPAGGCEARKGSSPKAAE QPRDGVQVA" gene complement(3335164..3335748) /locus_tag="Rv2979c" /db_xref="GeneID:888325" CDS complement(3335164..3335748) /locus_tag="Rv2979c" /function="PREVENTS THE COINTEGRATION OF FOREIGN DNA BEFORE INTEGRATION INTO THE CHROMOSOME." /note="Rv2979c, (MTCY349.08), len: 194 aa. Probable resolvase for IS1538, with low level matches to transposon resolvases; highly similar from aa 101 to YX1C_MYCTU|Q10831 from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 809, E(): 0, (69.1% identity in 194 aa overlap). Contains PS00397 Site-specific recombinases active site, and possible helix-turn-helix motiv at aa 2-23." /codon_start=1 /transl_table=11 /product="resolvase" /protein_id="NP_217495.1" /db_xref="GI:15610116" /db_xref="GeneID:888325" /translation="MNLATWAERNGVAPGTAYRWFRAGLLSVMARRVGRLILVDEPAG DAGMRSPTAVYARVSSADQKADLDRQVARVTAWATAQQMPVDKVVTEVGSAFNEHRRK FLSLLRDPSVHRIVVEHRDRFCRLGSKYVQAAFAAQGRELVVVDSAEVDDDLVRDMTE ILTSMCARLYGKRAAENRTKRALAAAAGEDHEAA" misc_feature complement(3335560..3335586) /locus_tag="Rv2979c" /note="PS00397 Site-specific recombinases active site" repeat_region complement(3335787..3335792) /note="6 bp inverted repeat at the right end of IS1538, TGAGTG." gene 3335960..3336505 /locus_tag="Rv2980" /db_xref="GeneID:887506" CDS 3335960..3336505 /locus_tag="Rv2980" /function="UNKNOWN" /note="Rv2980, (MTCY349.07c), len: 181 aa. Possible conserved secreted protein, equivalent to Q9CBS1 POSSIBLE SECRETED PROTEIN from Mycobacterium leprae (191 aa), FASTA scores: opt: 794, E(): 2.3e-40, (67.25% identity in 177 aa overlap). Also some weak similarity with other hypothetical proteins or secreted proteins e.g. C-terminus of Q98F98|MLL3872 MLL3872 PROTEIN from Rhizobium loti (Mesorhizobium loti) (575 aa), FASTA scores: opt: 148, E(): 0.16, (28.35% identity in 194 aa overlap); Q9L0W9|SCH22A.13C PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (167 aa), FASTA scores: opt: 114, E(): 7.5, (40.0% identity in 80 aa overlap); etc. Equivalent to AAK47385 from Mycobacterium tuberculosis strain CDC1551 (214 aa) but shorter 33 aa. Has hydrophobic stretch near N-terminus." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217496.1" /db_xref="GI:15610117" /db_xref="GeneID:887506" /translation="MTGESDGPPRAVLIAAAALAAAVIGVILVVAANRQPPERPVVIP AVPAPQATGPGCKALLAALPQRLGEYRRAPVAEPTTAGATAWRTGPNSTPVILRCGLD RPAEFVVGSAIQVVDRVQWFQVAAQNPDEPGRSTWYTVDRPVYVALTLPSGSGPTAIQ ELSDVIDHTIPAVPIDPAPAR" gene complement(3336796..3337917) /gene="ddl" /locus_tag="Rv2981c" /db_xref="GeneID:888415" CDS complement(3336796..3337917) /gene="ddl" /locus_tag="Rv2981c" /EC_number="6.3.2.4" /function="INVOLVED IN CELL WALL FORMATION. ALONG WITH ALANINE RACEMASE, IT MAKES UP THE D-ALANINE BRANCH OF THE PEPTIDOGLYCAN BIOSYNTHETIC ROUTE. [CATALYTIC ACTIVITY: ATP + D-ALANINE + D-ALANINE = ADP + PHOSPHATE + D-ALANYL-D-ALANINE]." /note="D-alanine--D-alanine ligase; DdlA; DdlB; cytoplasmic; catalyzes the formation of D-alanyl-D-alanine from two D-alanines in peptidoglycan synthesis; there are two forms of this enzyme in Escherichia coli" /codon_start=1 /transl_table=11 /product="D-alanyl-alanine synthetase A" /protein_id="NP_217497.1" /db_xref="GI:15610118" /db_xref="GeneID:888415" /translation="MSANDRRDRRVRVAVVFGGRSNEHAISCVSAGSILRNLDSRRFD VIAVGITPAGSWVLTDANPDALTITNRELPQVKSGSGTELALPADPRRGGQLVSLPPG AGEVLESVDVVFPVLHGPYGEDGTIQGLLELAGVPYVGAGVLASAVGMDKEFTKKLLA ADGLPVGAYAVLRPPRSTLHRQECERLGLPVFVKPARGGSSIGVSRVSSWDQLPAAVA RARRHDPKVIVEAAISGRELECGVLEMPDGTLEASTLGEIRVAGVRGREDSFYDFATK YLDDAAELDVPAKVDDQVAEAIRQLAIRAFAAIDCRGLARVDFFLTDDGPVINEINTM PGFTTISMYPRMWAASGVDYPTLLATMIETTLARGVGLH" misc_feature complement(3337531..3337566) /gene="ddl" /locus_tag="Rv2981c" /note="PS00843 D-alanine--D-alanine ligase signature 1" gene complement(3337995..3338999) /gene="gpsA" /locus_tag="Rv2982c" /db_xref="GeneID:887864" CDS complement(3337995..3338999) /gene="gpsA" /locus_tag="Rv2982c" /EC_number="1.1.1.94" /function="INVOLVED IN DE NOVO PHOSPHOLIPID BIOSYNTHESIS; GLYCEROL-3 PHOSPHATE FORMATION [CATALYTIC ACTIVITY: SN-GLYCEROL 3-PHOSPHATE + NAD(P)(+) = GLYCERONE PHOSPHATE + NAD(P)H]." /note="catalyzes the NAD(P)H-dependent reduction of glycerol 3-phosphate to glycerone phosphate" /codon_start=1 /transl_table=11 /product="NAD(P)H-dependent glycerol-3-phosphate dehydrogenase" /protein_id="NP_217498.1" /db_xref="GI:15610119" /db_xref="GeneID:887864" /translation="MAGIASTVAVMGAGAWGTALAKVLADAGGEVTLWARRAEVADQI NTTRYNPDYLPGALLPPSIHATADAEEALGGASTVLLGVPAQTMRANLERWAPLLPEG ATLVSLAKGIELGTLMRMSQVIISVTGAEPPQVAVISGPNLASEIAECQPAATVVACS DSGRAVALQRALNSGYFRPYTNADVVGTEIGGACKNIIALACGMAVGIGLGENTAAAI ITRGLAEIIRLGTALGANGATLAGLAGVGDLVATCTSPRSRNRSFGERLGRGETLQSA GKACHVVEGVTSCESVLALASSYDVEMPLTDAVHRVCHKGLSVDEAITLLLGRRTKPE" gene 3339118..3339762 /locus_tag="Rv2983" /db_xref="GeneID:888161" CDS 3339118..3339762 /locus_tag="Rv2983" /function="UNKNOWN" /note="Rv2983, (MTCY349.04c), len: 214 aa. Conserved hypothetical ala-rich protein, equivalent to O33128|ML1680|MLCB637.37c HYPOTHETICAL 22.0 KDA PROTEIN from Mycobacterium leprae (216 aa), FASTA scores: opt: 1080, E(): 9e-61, (79.05% identity in 215 aa overlap). Also similar to other hypothetical proteins e.g. Q9ZBS2|SC7A1.01C from Streptomyces coelicolor (212 aa), FASTA scores: opt: 420, E(): 2.9e-19, (43.5% identity in 207 aa overlap); O26710|MTH613 from Methanothermobacter thermautotrophicus (223 aa), FASTA scores: opt: 193, E(): 5.8e-05, (30.0% identity in 190 aa overlap); Q9RKG8|SCE46.21 from Streptomyces coelicolor (210 aa), FASTA scores: opt: 139, E(): 0.14, (27.65% identity in 206 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217499.1" /db_xref="GI:15610120" /db_xref="GeneID:888161" /translation="MSGTPDDGDIGLIIAVKRLAAAKTRLAPVFSAQTRENVVLAMLV DTLTAAAGVGSLRSITVITPDEAAAAAAAGLGADVLADPTPEDDPDPLNTAITAAERV VAEGASNIVVLQGDLPALQTQELAEAISAARHHRRSFVADRLGTGTAVLCAFGTALHP RFGPDSSARHRRSGAVELTGAWPGLRCDVDTPADLTAARQLGVGPATARAVAHR" gene 3339854..3342082 /gene="ppk" /locus_tag="Rv2984" /db_xref="GeneID:887482" CDS 3339854..3342082 /gene="ppk" /locus_tag="Rv2984" /EC_number="2.7.4.1" /function="CATALYZES THE REVERSIBLE TRANSFER OF THE TERMINAL PHOSPHATE OF ATP TO FORM A LONG-CHAIN POLYPHOSPHATE (POLYP) [CATALYTIC ACTIVITY: ATP + {PHOSPHATE}(N) = ADP + {PHOSPHATE}(N+1)]." /note="catalyzes the reversible transfer of the terminal phosphate of ATP to form a long chain polyphosphate" /codon_start=1 /transl_table=11 /product="polyphosphate kinase" /protein_id="NP_217500.1" /db_xref="GI:15610121" /db_xref="GeneID:887482" /translation="MMSNDRKVTEIENSPVTEVRPEEHAWYPDDSALAAPPAATPAAI SDQLPSDRYLNRELSWLDFNARVLALAADKSMPLLERAKFLAIFASNLDEFYMVRVAG LKRRDEMGLSVRSADGLTPREQLGRIGEQTQQLASRHARVFLDSVLPALGEEGIYIVT WADLDQAERDRLSTYFNEQVFPVLTPLAVDPAHPFPFVSGLSLNLAVTVRQPEDGTQH FARVKVPDNVDRFVELAAREASEEAAGTEGRTALRFLPMEELIAAFLPVLFPGMEIVE HHAFRITRNADFEVEEDRDEDLLQALERELARRRFGSPVRLEIADDMTESMLELLLRE LDVHPGDVIEVPGLLDLSSLWQIYAVDRPTLKDRTFVPATHPAFAERETPKSIFATLR EGDVLVHHPYDSFSTSVQRFIEQAAADPNVLAIKQTLYRTSGDSPIVRALIDAAEAGK QVVALVEIKARFDEQANIAWARALEQAGVHVAYGLVGLKTHCKTALVVRREGPTIRRY CHVGTGNYNSKTARLYEDVGLLTAAPDIGADLTDLFNSLTGYSRKLSYRNLLVAPHGI RAGIIDRVEREVAAHRAEGAHNGKGRIRLKMNALVDEQVIDALYRASRAGVRIEVVVR GICALRPGAQGISENIIVRSILGRFLEHSRILHFRAIDEFWIGSADMMHRNLDRRVEV MAQVKNPRLTAQLDELFESALDPCTRCWELGPDGQWTASPQEGHSVRDHQESLMERHR SP" gene 3342165..3343118 /gene="mutT1" /locus_tag="Rv2985" /db_xref="GeneID:888165" CDS 3342165..3343118 /gene="mutT1" /locus_tag="Rv2985" /EC_number="3.-.-.-" /function="UNKNOWN; HYDROLYTIC ENZYME. POSSIBLY INVOLVED IN REMOVAL OF DAMAGED NUCLEOTIDE." /note="Rv2985, (MTCY349.02c), len: 317 aa. Possible mutT1, long MutT protein (hydrolase) (EC 3.-.-.-) (see citation below), highly similar to O33126|MLCB637.35 HYPOTHETICAL 34.5 KDA PROTEIN from Mycobacterium leprae (312 aa), FASTA scores: opt: 1514, E(): 5.1e-91, (71.85% identity in 316 aa overlap); and Q9CBR8|ML1682 HYPOTHETICAL PROTEIN from Mycobacterium leprae (311 aa), FASTA scores: opt: 1510, E(): 9.2e-91, (71.5% identity in 316 aa overlap). Also similar to Q50195|L222-ORF6|ML2698 HYPOTHETICAL PROTEIN from Mycobacterium leprae (251 aa), FASTA scores: opt: 231, E(): 1.1e-07, (36.7% identity in 128 aa overlap). Also similar to shorter mutt proteins and related hypothetical protein e.g. Q9EUS6 HYPOTHETICAL 16.6 KDA PROTEIN from Streptomyces griseus subsp. griseus (152 aa), FASTA scores: opt: 380, E(): 1.7e-17, (50.75% identity in 130 aa overlap); Q9KZV8|SCD84.10C PUTATIVE MUTT-LIKE PROTEIN from Streptomyces coelicolor (142 aa), FASTA scores: opt: 376, E(): 2.9e-17, (46.1% identity in 128 aa overlap); P96590|MUTT MUTT PROTEIN from Bacillus subtilis (149 aa), FASTA scores: opt: 180, E(): 0.00017, (35.25% identity in 122 aa overlap); etc. Also similar to O05437 HYPOTHETICAL 27.1 KDA PROTEIN from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 224, E(): 3.2e-07, (34.03% identity in 144 aa overlap). Contains PS00893 mutT domain signature. SEEMS TO BELONG TO THE MUTT/NUDIX FAMILY PROTEIN." /codon_start=1 /transl_table=11 /product="hydrolase MutT1" /protein_id="NP_217501.1" /db_xref="GI:15610122" /db_xref="GeneID:888165" /translation="MSIQNSSARRRSAGRIVYAAGAVLWRPGSADSEGPVEIAVIHRP RYDDWSLPKGKVDPGETAPVGAVREILEETGHRANLGRRLLTVTYPTDSPFRGVKKVH YWAARSTGGEFTPGSEVDELIWLPVPDAMNKLDYAQDRKVLCRFAKHPADTQTVLVVR HGTAGSKAHFSGDDSKRPLDKRGRAQAEALVPQLLAFGATDVYAADRVRCHQTMEPLA AELNVTIHNEPTLTEESYANNPKRGRHRVLQIVEQVGTPVICTQGKVIPDLITWWCER DGVHPDKSRNRKGSTWVLSLSAGRLVTADHIGGALAANVRA" misc_feature 3342324..3342383 /gene="mutT1" /locus_tag="Rv2985" /note="PS00893 mutT domain signature" gene complement(3343176..3343820) /gene="hupB" /locus_tag="Rv2986c" /db_xref="GeneID:888166" CDS complement(3343176..3343820) /gene="hupB" /locus_tag="Rv2986c" /function="THIS PROTEIN BELONGS TO THE HISTONE LIKE FAMILY OF PROKARYOTIC DNA-BINDING PROTEINS WHICH ARE CAPABLE OF WRAPPING DNA TO STABILIZE IT, AND PREVENT ITS DENATURATION UNDER EXTREME ENVIRONMENTAL CONDITIONS." /experiment="experimental evidence, no additional details recorded" /note="Rv2986c, (MTCY349.01), len: 214 aa. Probable hupB (alternate gene names: hup, hlp, lbp21), DNA-binding protein HU homolog (resembles fusion between HU and histone) (see Pethe et al., 2002), equivalent to others from Mycobacteria e.g. Q9XB18|DBH_MYCBO from Mycobacterium bovis (205 aa), FASTA scores: opt: 1050, E(): 5.6e-45, (95.35% identity in 214 aa overlap); Q9ZHC5|DBH_MYCSM from Mycobacterium smegmatis (208 aa), FASTA scores: opt: 1035, E(): 3.1e-44, (80.2% identity in 217 aa overlap); and O33125|DBH_MYCLE from Mycobacterium leprae (200 aa), FASTA scores: opt: 914, E(): 2.7e-38, (80.1% identity in 216 aa overlap). Also highly similar to others from other organisms e.g. O86537|DBH2_STRCO from Streptomyces coelicolor (218 aa), FASTA scores: opt: 569, E(): 2.6e-21, (51.35% identity in 220 aa overlap); P08821|DBH1_BACSU from Bacillus subtilis (92 aa), FASTA scores: opt: 280, E(): 2.5e-07, (45.05% identity in 91 aa overlap) (C-terminus shorter); etc. Contains PS00045 Bacterial histone-like DNA-binding proteins signature. BELONGS TO THE BACTERIAL HISTONE-LIKE PROTEIN FAMILY. Note that its C-terminal domain is very rich in lysine and alanine.; hup; hlp; lbp21" /codon_start=1 /transl_table=11 /product="DNA-binding protein HU" /protein_id="NP_217502.1" /db_xref="GI:15610123" /db_xref="GeneID:888166" /translation="MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTI TGFGVFEQRRRAARVARNPRTGETVKVKPTSVPAFRPGAQFKAVVSGAQRLPAEGPAV KRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAATKAPAKKAVKA TKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPATKAPAKKATARRGRK" misc_feature complement(3343626..3343685) /gene="hupB" /locus_tag="Rv2986c" /note="PS00045 Bacterial histone-like DNA-binding proteins signature" gene complement(3344033..3344629) /gene="leuD" /locus_tag="Rv2987c" /db_xref="GeneID:888225" CDS complement(3344033..3344629) /gene="leuD" /locus_tag="Rv2987c" /EC_number="4.2.1.33" /function="INVOLVED IN LEUCINE BIOSYNTHESIS (AT THE SECOND STEP) [CATALYTIC ACTIVITY: 3-ISOPROPYLMALATE = 2-ISOPROPYLMALEATE + H(2)O (ALSO CATALYSES 2-ISOPROPYLMALEATE + H(2)O = 3-HYDROXY-4-METHYL-3-CARBOXYPENTANONE)]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the isomerization between 2-isopropylmalate and 3-isopropylmalate in leucine biosynthesis; forms a heterodimer of LeuC/D" /codon_start=1 /transl_table=11 /product="isopropylmalate isomerase small subunit" /protein_id="NP_217503.1" /db_xref="GI:15610124" /db_xref="GeneID:888225" /translation="MEAFHTHSGIGVPLRRSNVDTDQIIPAVFLKRVTRTGFEDGLFA GWRSDPAFVLNLSPFDRGSVLVAGPDFGTGSSREHAVWALMDYGFRVVISSRFGDIFR GNAGKAGLLAAEVAQDDVELLWKLIEQSPGLEITANLQDRIITAATVVLPFKIDDHSA WRLLEGLDDIALTLRKLDEIEAFEGACAYWKPRTLPAP" gene complement(3344654..3346075) /gene="leuC" /locus_tag="Rv2988c" /db_xref="GeneID:887875" CDS complement(3344654..3346075) /gene="leuC" /locus_tag="Rv2988c" /EC_number="4.2.1.33" /function="INVOLVED IN LEUCINE BIOSYNTHESIS (AT THE SECOND STEP) [CATALYTIC ACTIVITY: 3-ISOPROPYLMALATE = 2-ISOPROPYLMALEATE + H(2)O (ALSO CATALYSES 2-ISOPROPYLMALEATE + H(2)O = 3-HYDROXY-4-METHYL-3-CARBOXYPENTANONE)]." /experiment="experimental evidence, no additional details recorded" /note="dehydratase component, catalyzes the isomerization between 2-isopropylmalate and 3-isopropylmalate" /codon_start=1 /transl_table=11 /product="isopropylmalate isomerase large subunit" /protein_id="NP_217504.1" /db_xref="GI:15610125" /db_xref="GeneID:887875" /translation="MALQTGEPRTLAEKIWDDHIVVSGGGCAPDLIYIDLHLVHEVTS PQAFDGLRLAGRRVRRPELTLATEDHNVPTVDIDQPIADPVSRTQVETLRRNCAEFGI RLHSMGDIEQGIVHVVGPQLGLTQPGMTIVCGDSHTSTHGAFGALAMGIGTSEVEHVL ATQTLPLRPFKTMAVNVDGRLPDGVSAKDIILALIAKIGTGGGQGHVIEYRGSAIESL SMEGRMTICNMSIEAGARAGMVAPDETTYAFLRGRPHAPTGAQWDTALVYWQRLRTDV GAVFDTEVYLDAASLSPFVTWGTNPGQGVPLAAAVPDPQLMTDDAERQAAEKALAYMD LRPGTAMRDIAVDAVFVGSCTNGRIEDLRVVAEVLRGRKVADGVRMLIVPGSMRVRAQ AEAEGLGEIFTDAGAQWRQAGCSMCLGMNPDQLASGERCAATSNRNFEGRQGAGGRTH LVSPAVAAATAVRGTLSSPADLN" misc_feature complement(3345005..3345040) /gene="leuC" /locus_tag="Rv2988c" /note="PS00450 Aconitase family signature" gene 3346147..3346848 /locus_tag="Rv2989" /db_xref="GeneID:887723" CDS 3346147..3346848 /locus_tag="Rv2989" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv2989, (MTV012.03), len: 233 aa. Probable transcriptional regulator (ala-rich protein), highly similar to O86533|SC1C2.33c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (238 aa), FASTA scores: opt: 711, E(): 2.3e-38, (53.05% identity in 230 aa overlap); and similar to others e.g. Q9KND6 PUTATIVE TRANSCRIPTIONAL REGULATOR from Vibrio cholerae (244 aa), FASTA scores: opt: 232, E(): 1.2e-07, (29.75% identity in 232 aa overlap); Q9R9U0|SRPS EFFLUX PUMP REGULATOR from Pseudomonas putida (259 aa), FASTA scores: opt: 224, E(): 4.1e-07, (28.35% identity in 247 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. O06806|Rv1773c|MTCY28.39 HYPOTHETICAL 26.6 KDA PROTEIN (248 aa), FASTA scores: opt: 239, E(): 4.4e-08, (29.85% identity in 231 aa overlap); P71977|RV1719|MTCY04C12.04 HYPOTHETICAL 27.9 KDA PROTEIN (259 aa), FASTA scores: opt: 215, E(): 1.6e-06, (31.85% identity in 223 aa overlap); etc. Equivalent to AAK47396 from Mycobacterium tuberculosis strain CDC1551 (267 aa) but shorter 34 aa. Contains possible helix-turn-helix motif at aa 25-46 (Score 1005, +2.61 SD). TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217505.1" /db_xref="GI:15610126" /db_xref="GeneID:887723" /translation="MRQHSGIGVLDKAVGVLHAVAESPCGLAELCDRTDLPRATAYRL AAALEVHRLLGRGQDGHWRLGPAITELATHVDDPLLVACAAVLPQLRDATGESVQVYR REGTSRVCVAALEPAAGLRDTVPVGARLPMTAGSGAKVLLAHTDAATQAAVLPKAVFS ARALAEVCRRGWAQSVAEREPGVASVSAPVRDGRGVVIAAISVSGPIDRMGRRPGVRW AADLLSAADALTRRL" gene complement(3346859..3347719) /locus_tag="Rv2990c" /db_xref="GeneID:887176" CDS complement(3346859..3347719) /locus_tag="Rv2990c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv2990c, (MTV012.04c), len: 286 aa. Hypothetical unknown protein. TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217506.1" /db_xref="GI:15610127" /db_xref="GeneID:887176" /translation="MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEG VHGERPWGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRL LVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGL EPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEAR RFPIRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHG NDYVIAVEPM" gene 3347982..3348473 /locus_tag="Rv2991" /db_xref="GeneID:887792" CDS 3347982..3348473 /locus_tag="Rv2991" /function="UNKNOWN" /note="Rv2991, (MTV012.05), len: 163 aa. Conserved hypothetical protein, similar to others e.g. Q9K3X7|2SCG61.39. HYPOTHETICAL 17.6 KDA PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 266, E(): 2.1e-11, (34.85% identity in 155 aa overlap); Q9CNX3|PM0299 HYPOTHETICAL PROTEIN from Pasteurella multocida (171 aa), FASTA scores: opt: 175, E(): 5.1e-05, (31.3% identity in 131 aa overlap); Q9KZI9|SCG8A.10 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (142 aa), FASTA scores: opt: 163, E(): 0.00031, (32.4% identity in 108 aa overlap); etc. Also some similarity to O06553|MTCI65.22|Rv1155 hypothetical protein from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 127, E(): 0.1, (32.9% identity in 73 aa overlap); and to several proteins of similar size that confer resistance to 5-Nitroimidazole antibiotics in Bacteroides. TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217507.1" /db_xref="GI:15610128" /db_xref="GeneID:887792" /translation="MGTKQRADIVMSEAEIADFVNSSRTGTLATIGPDGQPHLTAMWY AVIDGEIWLETKAKSQKAVNLRRDPRVSFLLEDGDTYDTLRGVSFEGVAEIVEEPEAL HRVGVSVWERYTGPYTDECKPMVDQMMNKRVGVRIVARRTRSWDHRKLGLPHMSVGGS TAP" gene complement(3348547..3348619) /locus_tag="Rvnt35" /note="tRNA-Glu(CTC)" /db_xref="GeneID:2700433" tRNA complement(3348547..3348619) /locus_tag="Rvnt35" /product="tRNA-Glu" /note="codon recognized: GAG" /anticodon=(pos:3348583..3348585,aa:Glu) /db_xref="GeneID:2700433" gene complement(3348659..3348730) /locus_tag="Rvnt36" /note="tRNA-Gln(CTG)" /db_xref="GeneID:2700442" tRNA complement(3348659..3348730) /locus_tag="Rvnt36" /product="tRNA-Gln" /note="codon recognized: CAG" /anticodon=(pos:3348695..3348697,aa:Gln) /db_xref="GeneID:2700442" gene complement(3348805..3350277) /gene="gltX" /locus_tag="Rv2992c" /db_xref="GeneID:887850" CDS complement(3348805..3350277) /gene="gltX" /locus_tag="Rv2992c" /EC_number="6.1.1.17" /function="INVOLVED IN TRANSLATION MECHANISMS [CATALYTIC ACTIVITY: ATP + L-GLUTAMATE + TRNA(GLU) = AMP + DIPHOSPHATE + L-GLUTAMYL-TRNA(GLU)]." /experiment="experimental evidence, no additional details recorded" /note="Charges one glutamine molecule and pairs it to its corresponding RNA trinucleotide during protein translation" /codon_start=1 /transl_table=11 /product="glutamyl-tRNA synthetase" /protein_id="YP_177915.1" /db_xref="GI:57117041" /db_xref="GeneID:887850" /translation="MTATETVRVRFCPSPTGTPHVGLVRTALFNWAYARHTGGTFVFR IEDTDAQRDSEESYLALLDALRWLGLDWDEGPEVGGPYGPYRQSQRAEIYRDVLARLL AAGEAYHAFSTPEEVEARHVAAGRNPKLGYDNFDRHLTDAQRAAYLAEGRQPVVRLRM PDDDLAWNDLVRGPVTFAAGSVPDFALTRASGDPLYTLVNPCDDALMKITHVLRGEDL LPSTPRQLALHQALIRIGVAERIPKFAHLPTVLGEGTKKLSKRDPQSNLFAHRDRGFI PEGLLNYLALLGWSIADDHDLFGLDEMVAAFDVADVNSSPARFDQKKADALNAEHIRM LDVGDFTVRLRDHLDTHGHHIALDEAAFAAAAELVQTRIVVLGDAWELLKFFNDDQYV IDPKAAAKELGPDGAAVLDAALAALTSVTDWTAPLIEAALKDALIEGLALKPRKAFSP IRVAATGTTVSPPLFESLELLGRDRSMQRLRAARQLVGHA" gene complement(3350274..3350993) /locus_tag="Rv2993c" /db_xref="GeneID:887867" CDS complement(3350274..3350993) /locus_tag="Rv2993c" /EC_number="5.3.3.-" /function="ISOMERIZES HHDD (2-HYDROXY-HEPT-2,4-DIENE-1,7-DIOATE) TO OHED (2-OXO-HEPT-3-ENE-1,7-DIOATE)." /experiment="experimental evidence, no additional details recorded" /note="Rv2993c, (MTV012.07c), len: 239 aa. Possible 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase (EC 5.3.3.-), equivalent to O33119|ML1689|MLCB637.28 POSSIBLE 2-HYDROXYHEPTA-2,4-DIENE- 1,7-DIOATE ISOMERASE from Mycobacterium leprae (242 aa), FASTA scores: opt: 1427, E(): 4.4e-86, (85.9% identity in 241 aa overlap). Also similar to others e.g. Q9LBE3|DR1609 from Deinococcus radiodurans (250 aa), FASTA scores: opt: 723, E(): 5.5e-40, (49.05% identity in 216 aa overlap); O27551|MTH1507 from Methanothermobacter thermautotrophicus (260 aa), FASTA scores: opt: 708, E(): 5.4e-39, (52.1% identity in 213 aa overlap); Q9HQR6|VNG1037G|HPCE from Halobacterium sp. (strain NRC-1) (244 aa), FASTA scores: opt: 590, E(): 2.7e-31, (43.65% identity in 220 aa overlap); etc. Start chosen by homology, but ORF could continue upstream. TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="2-hydroxyhepta-2,4-diene-1,7-dioate isomerase" /protein_id="NP_217509.1" /db_xref="GI:15610130" /db_xref="GeneID:887867" /translation="MTAREIAEHPFGTPTFTGRSWPLADVRLLAPILASKVVCVGKNY ADHIAEMGGRPPADPVIFLKPNTAIIGPNTPIRLPANASPVHFEGELAIVIGRACKDV PAAQAVDNILGYTIGNDVSARDQQQSDGQWTRAKGHDTFCPVGPWIVTDLAPFDPADL ELRTVVNGDVKQHARTSLMIHDIGAIVEWISAIMTLLPGDLILTGTPAGVGPIEDGDT VSITIEGIGTLTNPVVRKGKP" gene 3351269..3352606 /locus_tag="Rv2994" /db_xref="GeneID:888200" CDS 3351269..3352606 /locus_tag="Rv2994" /function="UNKNOWN; COULB BE INVOLVED IN EFFLUX SYSTEM (POSSIBLY DRUG)." /note="Rv2994, (MTV012.08), len: 445 aa. Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of drug. C-terminal part highly similar to O33118|MLCB637.27c HYPOTHETICAL 14.7 KDA PROTEIN (probable pseudogene product) from Mycobacterium leprae (134 aa), FASTA scores: opt: 483, E(): 2.7e-21, (60.9% identity in 138 aa overlap). Also similar to various transporters e.g. Q9I5C8|PA0811 PROBABLE MFS TRANSPORTER from Pseudomonas aeruginosa (415 aa), FASTA scores: opt: 289, E(): 1.3e-09, (26.05% identity in 399 aa overlap); O30210|AF0025 CYANATE TRANSPORT PROTEIN from Archaeoglobus fulgidus (393 aa), FASTA scores: opt: 281, E(): 3.7e-09, (24.05% identity in 399 aa overlap); Q9RI35|SCJ12.25C PUTATIVE NITRATE/NITRITE TRANSPORTER from Streptomyces coelicolor (412 aa), FASTA scores: opt: 264, E(): 3.8e-08, (24.95% identity in 409 aa overlap); Q9A5N5|CC2412 MAJOR FACILITATOR FAMILY TRANSPORTER from Caulobacter crescentus (405 aa), FASTA scores: opt: 263, E(): 4.3e-08, (27.55% identity in 399 aa overlap); etc. First start taken; similarity to P21191|NORA_STAAU QUINOLONE RESISTANCE PROTEIN from Staphylococcus aureus (388 aa) suggests alternative start at 7319 but then no positively charged aa before first transmembrane segment. TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217510.1" /db_xref="GI:15610131" /db_xref="GeneID:888200" /translation="MSRDPTGVGARWAIMIVSLGVTASSFLFINGVAFLIPRLENARG TPLSHAGLLASMPSWGLVVTMFAWGYLLDHVGERMVMAVGSALTAAAAYAAASVHSLL WIGVFLFLGGMAAGGCNSAGGRLVSGWFPPQQRGLAMGIRQTAQPLGIASGALVIPEL AERGVHAGLMFPAVVCTLAAVASVLGIVDPPRKSRTKASEQELASPYRGSSILWRIHA ASALLMMPQTVTVTFMLVWLINHHGWSVAQAGVLVTISQLLGALGRVAVGRWSDHVGS RMRPVRLIAAAAAATLFLLAAVDNEGSRYDVLLMIAISVIAVLDNGLEATAITEYAGP YWSGRALGIQNTTQRLMAAAGPPLFGSLITTAAYPTAWALCGVFPLAAVPLVPVRLLP PGLETRARRQSVRRHRWWQAVRCHAWPNGPRRPGPPGQPRRVRQGGTAITPPT" gene complement(3352458..3353468) /gene="leuB" /locus_tag="Rv2995c" /db_xref="GeneID:888182" CDS complement(3352458..3353468) /gene="leuB" /locus_tag="Rv2995c" /EC_number="1.1.1.85" /function="INVOLVED IN LEUCINE BIOSYNTHESIS (AT THE THIRD STEP) [CATALYTIC ACTIVITY: 3-CARBOXY-2-HYDROXY-4-METHYLPENTANOATE + NAD(+) = 3-CARBOXY-4-METHYL-2-OXOPENTANOATE + NADH (THE PRODUCT DECARBOXYLATES TO 4-METHYL-2-OXOPENTANOATE)]." /note="catalyzes the oxidation of 3-isopropylmalate to 3-carboxy-4-methyl-2-oxopentanoate in leucine biosynthesis" /codon_start=1 /transl_table=11 /product="3-isopropylmalate dehydrogenase" /protein_id="NP_217511.1" /db_xref="GI:15610132" /db_xref="GeneID:888182" /translation="MKLAIIAGDGIGPEVTAEAVKVLDAVVPGVQKTSYDLGARRFHA TGEVLPDSVVAELRNHDAILLGAIGDPSVPSGVLERGLLLRLRFELDHHINLRPARLY PGVASPLSGNPGIDFVVVREGTEGPYTGNGGAIRVGTPNEVATEVSVNTAFGVRRVVA DAFERARRRRKHLTLVHKTNVLTFAGGLWLRTVDEVGECYPDVEVAYQHVDAATIHMI TDPGRFDVIVTDNLFGDIITDLAAAVCGGIGLAASGNIDATRANPSMFEPVHGSAPDI AGQGIADPTAAIMSVALLLSHLGEHDAAARVDRAVEAHLATRGSERLATSDVGERIAA AL" gene complement(3353483..3355069) /gene="serA1" /locus_tag="Rv2996c" /db_xref="GeneID:887154" CDS complement(3353483..3355069) /gene="serA1" /locus_tag="Rv2996c" /EC_number="1.1.1.95" /function="INVOLVED AT THE FIRST COMMITTED STEP IN THE 'PHOSPHORYLATED' PATHWAY OF L-SERINE BIOSYNTHESIS [CATALYTIC ACTIVITY: 3-PHOSPHOGLYCERATE + NAD(+) = 3-PHOSPHOHYDROXYPYRUVATE + NADH]." /note="catalyzes the formation of 3-phosphonooxypyruvate from 3-phospho-D-glycerate in serine biosynthesis; can also reduce alpha ketoglutarate to form 2-hydroxyglutarate" /codon_start=1 /transl_table=11 /product="D-3-phosphoglycerate dehydrogenase" /protein_id="YP_177916.1" /db_xref="GI:57117042" /db_xref="GeneID:887154" /translation="MSLPVVLIADKLAPSTVAALGDQVEVRWVDGPDRDKLLAAVPEA DALLVRSATTVDAEVLAAAPKLKIVARAGVGLDNVDVDAATARGVLVVNAPTSNIHSA AEHALALLLAASRQIPAADASLREHTWKRSSFSGTEIFGKTVGVVGLGRIGQLVAQRI AAFGAYVVAYDPYVSPARAAQLGIELLSLDDLLARADFISVHLPKTPETAGLIDKEAL AKTKPGVIIVNAARGGLVDEAALADAITGGHVRAAGLDVFATEPCTDSPLFELAQVVV TPHLGASTAEAQDRAGTDVAESVRLALAGEFVPDAVNVGGGVVNEEVAPWLDLVRKLG VLAGVLSDELPVSLSVQVRGELAAEEVEVLRLSALRGLFSAVIEDAVTFVNAPALAAE RGVTAEICKASESPNHRSVVDVRAVGADGSVVTVSGTLYGPQLSQKIVQINGRHFDLR AQGINLIIHYVDRPGALGKIGTLLGTAGVNIQAAQLSEDAEGPGATILLRLDQDVPDD VRTAIAAAVDAYKLEVVDLS" misc_feature complement(3354458..3354499) /gene="serA1" /locus_tag="Rv2996c" /note="PS00670 D-isomer specific 2-hydroxyacid dehydrogenases signature 2" misc_feature complement(3354557..3354640) /gene="serA1" /locus_tag="Rv2996c" /note="PS00065 D-isomer specific 2-hydroxyacid dehydrogenases NAD-binding signature" misc_feature complement(3354641..3354664) /gene="serA1" /locus_tag="Rv2996c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 3355099..3356541 /locus_tag="Rv2997" /db_xref="GeneID:888637" CDS 3355099..3356541 /locus_tag="Rv2997" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv2997, (MTV012.11), len: 480 aa. Possible ala-rich dehydrogenase (EC 1.-.-.-), similar to others dehydrogenases and hypothetical proteins e.g. Q9EYI5 PUTATIVE DEHYDROGENASE from Streptomyces nogalater (472 aa), FASTA scores: opt: 1131, E(): 1.7e-61, (41.0% identity in 471 aa overlap); Q9ZBG4|SC9B5.16 PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (472 aa), FASTA scores: opt: 1064, E(): 2e-57, (39.05% identity in 471 aa overlap); Q98BS8 PROBABLE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (524 aa), FASTA scores: opt: 196, E(): 0.00021, (25.1% identity in 526 aa overlap); etc. Shows strong simlarity throughout its length to O06826|MTCY493.22c|Rv1432 HYPOTHETICAL 50.5 KDA PROTEIN from Mycobacterium tuberculosis (473 aa), FASTA scores: opt: 1220, E(): 6.1e-67, (42.35% identity in 465 aa overlap). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="alanine rich dehydrogenase" /protein_id="NP_217513.1" /db_xref="GI:15610134" /db_xref="GeneID:888637" /translation="MDVTVVGSGPNGLATAVICARAGLNVQVVEAQATFGGGARSAAD FEFPEVLHDVCSAVHPLALASPFFAEFDLPARGVTLTVPDIAYANPLPGRPAAIAYHD LAHTCAKLDDGASWRRLLGPLVAHSETVVEFMLSDKRSLPTALGSVLRLGLRMLAQGT PAWRSLAGEDARALFTGVAAHAISPLPSLVSAGAGLMLATLAHSVGWPIPVGGTQAIA DALIADLRAHGGRLAAGVEITEPQRSVVVFDTAPTALLRVYRDKLPHRYAKALRRYRF RAGIAKVDFVLSDEIPWSDPRLRRAATLHLGGTRDQMARAEADVAAGRHADWPMVLAA CPHVADPGRIDETGRRPFWTYAHVPSGSTLDATETVTSVLERFAPGFRDIVVAARAVP AARMADHNANYVGGDITVGANSTWRAIAGPTPRLNPWRTPIPKVYLCSAATPPGAGVH GMCGWYAARTLLRTEFGITRMPPLGHELRP" gene 3356815..3357276 /locus_tag="Rv2998" /db_xref="GeneID:888646" CDS 3356815..3357276 /locus_tag="Rv2998" /function="UNKNOWN" /note="Rv2998, (MTV012.12), len: 153 aa. Hypothetical unknown protein. Note that equivalent to AAK47405 Hypothetical 19.4 kDa protein from Mycobacterium tuberculosis strain CDC1551 (186 aa) but sequence differs in N-terminus. TBparse score is 0.938." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217514.1" /db_xref="GI:15610135" /db_xref="GeneID:888646" /translation="MDVIWSATIATTVATGMRKPRMHGMPPITSGSMVTRVTRMSIRL AGDSTLGRFSTSRLGLSSAKSKPEGDFGTACGAVSGGDAGVVALAEGVDDGQSKPGAA GGARGVGGFRESRADCGEQFGVASWTPQGEFEFGGQEAKGVRSSWPASLTN" gene complement(3357225..3357428) /locus_tag="Rv2998A" /db_xref="GeneID:3205090" CDS complement(3357225..3357428) /locus_tag="Rv2998A" /function="UNKNOWN" /note="Rv2998A, len: 67 aa. Probable conserved hypothetical protein, (possibly gene fragment), highly similar to central part of two-component sensor proteins e.g. O07777|Rv0601c|MTCY19H5.21 TWO COMPONENT SENSOR (FRAGMENT) from Mycobacterium tuberculosis (156 aa), FASTA scores: opt: 212, E(): 3.7e-09, (58.2% identity in 67 aa overlap); Q9L2B6|SC8F4.08 PROBABLE TWO-COMPONENT SENSOR KINASE from Streptomyces coelicolor (478 aa), FASTA scores: opt: 193, E(): 2.6e-07, (47.05% identity in 68 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177682.1" /db_xref="GI:57117043" /db_xref="GeneID:3205090" /translation="MERMRIRAAGISATDPHARLPLPLARDEIRYLGTTFNDLLQRLQ DALERERQFVSDAGHELRTPLAS" gene 3357602..3358567 /gene="lppY" /locus_tag="Rv2999" /db_xref="GeneID:887357" CDS 3357602..3358567 /gene="lppY" /locus_tag="Rv2999" /function="UNKNOWN" /note="Rv2999, (MTV012.13), len: 321 aa. Probable lppY, conserved lipoprotein, highly similar to O07774|LPQO|Rv0604|MTCY19H5.18c PUTATIVE LIPOPROTEIN from Mycobacterium tuberculosis (316 aa), FASTA scores: opt: 1153, E(): 5e-62, (53.2% identity in 312 aa overlap); and showing similarity with AAK80743|CAC2799 UNCHARACTERIZED CONSERVED PROTEIN SIMILAR TO LPPY/LPQO OF Mycobacterium tuberculosis from Clostridium acetobutylicum (152 aa), FASTA scores: opt: 165, E(): 0.0077, (26.08% identity in 138 aa overlap); and Q9F2T1|SCD65.01c PUTATIVE LIPOPROTEIN (FRAGMENT) from Streptomyces coelicolor (146 aa), FASTA scores: opt: 126, E(): 1.6, (% identity in aa overlap). Equivalent to AAK47407 from Mycobacterium tuberculosis strain CDC1551 (329 aa) but shorter 8 aa. Contains probable N-terminal signal sequence and PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.892." /codon_start=1 /transl_table=11 /product="lipoprotein LppY" /protein_id="NP_217515.1" /db_xref="GI:15610136" /db_xref="GeneID:887357" /translation="MAGAKHAGRIVAITTAAAVILAACSSGSKGGAGSGHAGKARSAV TTTDADWKPVADALGRSGKLGDNNTAYRINLPRNDLHITSYGVDIKPGLSLGGYAAFA RYDNNETLLMGDLVITEEELPKVTDALQAHGIAQTALHKHLLQQDPPVWWTHIHGMGD AARLAQGLKAALDATTIGPPTPPPARQPPVDIDVAGVDQALGRKGTQDGGLMKYSIPR KDTIIEDGHVLPAVSLNLTTVINFQPVGRGRAAINGDFILIAPEVQEVIRAMRAGNIT IVELHNHGLTEEPRLFYMHYWAVDDAVTLARALRPAMDATNLQSS" misc_feature 3357641..3357673 /gene="lppY" /locus_tag="Rv2999" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 3358612..3359271 /locus_tag="Rv3000" /db_xref="GeneID:887271" CDS 3358612..3359271 /locus_tag="Rv3000" /function="UNKNOWN" /note="Rv3000, (MTV012.14), len: 219 aa. Possible conserved transmembrane protein, similar to various membrane proteins e.g. P77307|YBBM_ECOLI|B0491 HYPOTHETICAL 28.2 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Escherichia coli strain K12 (259 aa), FASTA scores: opt: 292, E(): 3.1e-11, (30.25% identity in 218 aa overlap); N-terminus of Q9BJF3 PUTATIVE ABC TRANSPORTER (FRAGMENT) from Sterkiella histriomuscorum (1319 aa), FASTA scores: opt: 274, E(): 1.3e-09, (39.6% identity in 101 aa overlap); Q9C9W0|T23K23.21 PUTATIVE ABC TRANSPORTER from Arabidopsis thaliana (Mouse-ear cress) (263 aa), FASTA scores: opt: 258, E(): 4.4e-09, (30.1% identity in 196 aa overlap); P74369|YG47_SYNY3|SLR1647 HYPOTHETICAL 28.1 KDA PROTEIN (POTENTIAL INTEGRAL MEMBRANE PROTEIN) from Synechocystis sp. strain PCC 6803 (259 aa), FASTA scores: opt: 257, E(): 5.1e-09, (37.75% identity in 98 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217516.1" /db_xref="GI:15610137" /db_xref="GeneID:887271" /translation="MAVHGFLLERVSVVRDEATVLRQVSAHFPAGRCSAVRGASGSGK TTLLRLLNRLIDPTSGKVWLDGVPLTDLDVLVLRRRVGLVAQAPVVLTDAVLNEVRVG RPDLPEGRVTELLARLCLGQSAREAFLPHQRSALRTALIPAIDSTKVVGLISLPGAMS GLILAGVDPLTAIRYQIVVMYLLLAATAVAALTCARLAERALFDRAHRLVSLPAATRR A" misc_feature 3358723..3358746 /locus_tag="Rv3000" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3359585..3360586) /gene="ilvC" /locus_tag="Rv3001c" /db_xref="GeneID:887483" CDS complement(3359585..3360586) /gene="ilvC" /locus_tag="Rv3001c" /EC_number="1.1.1.86" /function="INVOLVED IN VALINE AND ISOLEUCINE BIOSYNTHESIS (AT THE SECOND STEP) [CATALYTIC ACTIVITY: (R)-2,3-DIHYDROXY-3-METHYLBUTANOATE + NADP(+) = (S)-2-HYDROXY-2-METHYL-3-OXOBUTANOATE + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of (R)-2,3-dihydroxy-3-methylbutanoate from (S)-2-hydroxy-2-methyl-3-oxobutanoate in valine and isoleucine biosynthesis" /codon_start=1 /transl_table=11 /product="ketol-acid reductoisomerase" /protein_id="NP_217517.1" /db_xref="GI:15610138" /db_xref="GeneID:887483" /translation="MFYDDDADLSIIQGRKVGVIGYGSQGHAHSLSLRDSGVQVRVGL KQGSRSRPKVEEQGLDVDTPAEVAKWADVVMVLAPDTAQAEIFAGDIEPNLKPGDALF FGHGLNVHFGLIKPPADVAVAMVAPKGPGHLVRRQFVDGKGVPCLVAVEQDPRGDGLA LALSYAKAIGGTRAGVIKTTFKDETETDLFGEQTVLCGGTEELVKAGFEVMVEAGYPA ELAYFEVLHELKLIVDLMYEGGLARMYYSVSDTAEFGGYLSGPRVIDAGTKERMRDIL REIQDGSFVHKLVADVEGGNKQLEELRRQNAEHPIEVVGKKLRDLMSWVDRPITETA" gene complement(3360624..3361130) /gene="ilvH" /locus_tag="Rv3002c" /db_xref="GeneID:887226" CDS complement(3360624..3361130) /gene="ilvH" /locus_tag="Rv3002c" /EC_number="2.2.1.6" /function="INVOLVED IN VALINE AND ISOLEUCINE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: 2-ACETOLACTATE + CO(2) = 2-PYRUVATE]." /experiment="experimental evidence, no additional details recorded" /note="with IlvI catalyzes the formation of 2-acetolactate from pyruvate, the small subunit is required for full activity and valine sensitivity; E.coli produces 3 isoenzymes of acetolactate synthase which differ in specificity to substrates, valine sensitivity and affinity for cofactors; also known as acetolactate synthase 3 small subunit" /codon_start=1 /transl_table=11 /product="acetolactate synthase 3 regulatory subunit" /protein_id="NP_217518.1" /db_xref="GI:15610139" /db_xref="GeneID:887226" /translation="MSPKTHTLSVLVEDKPGVLARVAALFSRRGFNIESLAVGATECK DRSRMTIVVSAEDTPLEQITKQLNKLINVIKIVEQDDEHSVSRELALIKVQADAGSRS QVIEAVNLFRANVIDVSPESLTVEATGNRGKLEALLRVLEPFGIREIAQSGMVSLSRG PRGIGTAK" gene complement(3361130..3362986) /gene="ilvB1" /locus_tag="Rv3003c" /db_xref="GeneID:887286" CDS complement(3361130..3362986) /gene="ilvB1" /locus_tag="Rv3003c" /EC_number="2.2.1.6" /function="INVOLVED IN VALINE AND ISOLEUCINE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: 2-ACETOLACTATE + CO(2) = 2 PYRUVATE]." /experiment="experimental evidence, no additional details recorded" /note="acetolactate synthase large subunit; catalyzes the formation of 2-acetolactate from pyruvate" /codon_start=1 /transl_table=11 /product="acetolactate synthase 1 catalytic subunit" /protein_id="YP_177917.1" /db_xref="GI:57117044" /db_xref="GeneID:887286" /translation="MSAPTKPHSPTFKPEPHSAANEPKHPAARPKHVALQQLTGAQAV IRSLEELGVDVIFGIPGGAVLPVYDPLFDSKKLRHVLVRHEQGAGHAASGYAHVTGRV GVCMATSGPGATNLVTPLADAQMDSIPVVAITGQVGRGLIGTDAFQEADISGITMPIT KHNFLVRSGDDIPRVLAEAFHIAASGRPGAVLVDIPKDVLQGQCTFSWPPRMELPGYK PNTKPHSRQVREAAKLIAAARKPVLYVGGGVIRGEATEQLRELAELTGIPVVTTLMAR GAFPDSHRQNLGMPGMHGTVAAVAALQRSDLLIALGTRFDDRVTGKLDSFAPEAKVIH ADIDPAEIGKNRHADVPIVGDVKAVITELIAMLRHHHIPGTIEMADWWAYLNGVRKTY PLSYGPQSDGSLSPEYVIEKLGEIAGPDAVFVAGVGQHQMWAAQFIRYEKPRSWLNSG GLGTMGFAIPAAMGAKIALPGTEVWAIDGDGCFQMTNQELATCAVEGIPVKVALINNG NLGMVRQWQSLFYAERYSQTDLATHSHRIPDFVKLAEALGCVGLRCEREEDVVDVINQ ARAINDCPVVIDFIVGADAQVWPMVAAGTSNDEIQAARGIRPLFDDITEGHA" misc_feature complement(3361541..3361600) /gene="ilvB1" /locus_tag="Rv3003c" /note="PS00187 Thiamine pyrophosphate enzymes signature" gene 3363348..3363686 /gene="cfp6" /locus_tag="Rv3004" /db_xref="GeneID:887891" CDS 3363348..3363686 /gene="cfp6" /locus_tag="Rv3004" /function="UNKNOWN FUNCTION (PUTATIVE SECRETED PROTEIN)." /experiment="experimental evidence, no additional details recorded" /note="Rv3004, (MT3084.1, MTV012.18), len: 112 aa. cfp6, low molecular weight protein antigen 6 (CFP-6) (cf note * below). Weak homology with Q9RKZ5|SC6D7.02 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (156 aa), FASTA scores: opt: 109, E(): 0.78, (39.4% identity in 122 aa overlap). CAUTION: THE INITIATOR METHIONINE MAY BE FURTHER UPSTREAM MAKING THE SEQUENCE A PRECURSOR. TBparse score is 0.873. [* Note: Bhaskar S., Mukherjee R.: Isolation, purification and immunological characterization of low molecular weight protein antigens from culture filtrate of Mycobacterium tuberculosis H37Rv. Unpublished. Submitted (NOV-1998) to the SWISS-PROT data bank]." /codon_start=1 /transl_table=11 /product="low molecular weight protein antigen 6 (CFP-6)" /protein_id="NP_217520.1" /db_xref="GI:15610141" /db_xref="GeneID:887891" /translation="MAHFAVGFLTLGLLVPVLTWPVSAPLLVIPVALSASIIRLRTLA DERGVTVRTLVGSRAVRWDDIDGLRFHRGSWARATLKDGTELRLPAVTFATLPHLTEA SSGRVPNPYR" gene complement(3363693..3364532) /locus_tag="Rv3005c" /db_xref="GeneID:887305" CDS complement(3363693..3364532) /locus_tag="Rv3005c" /function="UNKNOWN" /note="Rv3005c, (MTV012.19c), len: 279 aa. Conserved hypothetical protein, equivalent to O33110|MLCB637.18|ML1698 HYPOTHETICAL 29.5 KDA PROTEIN from Mycobacterium leprae (277 aa), FASTA scores: opt: 1245, E(): 1.2e-65, (70.5% identity in 278 aa overlap). Also similar, but longer 100 aa in N-terminus, to other hypothetical proteins, few membrane proteins, e.g. Q9RKN9|SCC75A.35 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (180 aa), FASTA scores: opt: 326, E(): 3.9e-12, (44.2% identity in 138 aa overlap); P96694|YDFP|AB001488 HYPOTHETICAL PROTEIN from Bacillus subtilis (129 aa), FASTA scores: opt:273, E(): 3.7e-09, (33.1% identity in 130 aa overlap); Q9KKT1|VCA1019 HYPOTHETICAL PROTEIN from Vibrio cholerae (148 aa), FASTA scores: opt: 258, E(): 3.1e-08, (34.9% identity in 126 aa overlap); etc. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217521.1" /db_xref="GI:15610142" /db_xref="GeneID:887305" /translation="MTSSNDSHWQRPDDSPGPMPGRPVSASLVDPEDDLTPARYAGDF GSGTTTVIPPYDAASSGVGNSGYSLIEAAEPLPYVQPQPGRQVPAGSAGIDMDDDERV RAAGRRGTQNLGLLILRVGLGAVLIAHGLQKLFGWWDGQGLAGFQNSLSDIGYQHAEI LAYVSAGGEIVAGVLLVLGLFTPLAAAGALAFLINGLLAGISAQHSRPVAYFLQDGHE YQITLVVMAVAVILSGPGRYGLDAARGWAHRPFIGSFVALLGGIAAGIAVWVLLNGAN PLA" gene 3364709..3365830 /gene="lppZ" /locus_tag="Rv3006" /db_xref="GeneID:887308" CDS 3364709..3365830 /gene="lppZ" /locus_tag="Rv3006" /function="UNKNOWN" /note="Rv3006, (MTV012.20), len: 373 aa. Probable lppZ, conserved lipoprotein, equivalent to O33109|MLCB637.17C|ML1699 putative lipoprotein from M. leprae (372 aa), FASTA scores: opt: 2211, E(): 4.3e-100, (87.1% identity in 373 aa overlap). Shows also similarity (in part) with Q9Z571|SC8D9.20c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (447 aa), FASTA scores: opt: 185, E(): 0.051, (31.6% identity in 300 aa overlap); Q9Z9R3|BH2090 GLUCOSE DEHYDROGENASE-B from Bacillus halodurans (371 aa), FASTA scores: opt: 206, E(): 0.0043, (28.3% identity in 205 aa overlap); and other GLUCOSE DEHYDROGENASES B. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site, followed by a proline-rich domain. TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="lipoprotein LppZ" /protein_id="NP_217522.1" /db_xref="GI:15610143" /db_xref="GeneID:887308" /translation="MWTTRLVRSGLAALCAAVLVSSGCARFNDAQSQPFTTEPELRPQ PSSTPPPPPPLPPVPFPKECPAPGVMQGCLESTSGLIMGIDSKTALVAERITGAVEEI SISAEPKVKTVIPVDPAGDGGLMDIVLSPTYSQDRLMYAYISTPTDNRVVRVADGDIP KDILTGIPKGAAGNTGALIFTSPTTLVVMTGDAGDPALAADPQSLAGKVLRIEQPTTI GQTPPTTALSGIGSGGGLCIDPVDGSLYVADRTPTADRLQRITKNSEVSTVWTWPDKP GVAGCAAMDGTVLVNLINTKLTVAVRLAPSTGAVTGEPDVVRKDTHAHAWALRMSPDG NVWGATVNKTAGDAEKLDDVVFPLFPQGGGFPRNNDDKT" misc_feature 3364748..3364780 /gene="lppZ" /locus_tag="Rv3006" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(3365836..3366450) /locus_tag="Rv3007c" /db_xref="GeneID:888938" CDS complement(3365836..3366450) /locus_tag="Rv3007c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3007c, (MTV012.21c), len: 204 aa. Possible oxidoreductase (EC 1.-.-.-), similar to Q9EWU5|3SC5B7.04c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (162 aa), FASTA scores: opt: 376, E(): 1.5e-18, (41.35% identity in 150 aa overlap); Q9K416|SCG22.29c PUTATIVE FLAVIN-DEPENDENT REDUCTASE PROTEIN from Streptomyces coelicolor (169 aa), FASTA scores: opt: 246, E(): 1e-09, (34.1% identity in 135 aa overlap); and some similarity to coupling proteins of 4-hydroxyphenylacetic hydroxylase/monooxygenase e.g. Q9HWT6|HPAC|PA4092 Pseudomonas aeruginosa (170 aa), FASTA score: opt: 214; O68232|HPAC Photorhabdus luminescens (Xenorhabdus luminescens) (172 aa), FASTA score: opt: 198; Q9RPU2|HPAC Salmonella dublin (170 aa), FASTA score: opt: 197; etc. Equivalent to AAK47416 from Mycobacterium tuberculosis strain CDC1551 (236 aa) but shorter 32 aa. Start chosen by similarity. TBparse score is 0.930." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217523.1" /db_xref="GI:15610144" /db_xref="GeneID:888938" /translation="MSEDVARIHDGDVIDESFDELMGMLDHPVFVVTTQADGHPAGCL VSFATQTSVQPPSFMVGLPRSTGTSEVASRSEHLAVHVLSQRQHVLAELFGSQTEEEV NKFARCSWRAGPCGMPILDDAAAWFIGRTASRSDVGDYVAYLLEPVSVWAPECSEDLL YLSDLDFDVDDIDPGKEASPRFYERERGDETRRYGVVRFTLDVP" gene 3366644..3367267 /locus_tag="Rv3008" /db_xref="GeneID:888530" CDS 3366644..3367267 /locus_tag="Rv3008" /function="UNKNOWN" /note="Rv3008, (MTV012.22), len: 207 aa (start uncertain). Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217524.1" /db_xref="GI:15610145" /db_xref="GeneID:888530" /translation="MLTVVAVIGILECGLVLHMPDNDLWYCGPWTLWVMAGRGVASGA GVWRGDRVATPLAVAITAAGLVSGARIGPGAAAKRDPQLAQWNEIRSHYQEIAEWIDH DTATAHPAVAATQISAAGSFGRANMVDYLGLLDSRADETVRRDEFSRWLSAKPDYLVT TEQSVDAATIALPEFRHAYDRAATIGTLNVYRRNSPDGDEPLPADGN" gene complement(3367264..3368793) /gene="gatB" /locus_tag="Rv3009c" /db_xref="GeneID:888919" CDS complement(3367264..3368793) /gene="gatB" /locus_tag="Rv3009c" /EC_number="6.3.5.-" /function="COMPONENT OF THE TRANSLATIONAL APPARATUS. FURNISHES A MEANS FOR FORMATION OF CORRECTLY CHARGED GLN-TRNA(GLN) THROUGH THE TRANSAMIDATION OF MISACYLATED GLU- TRNA(GLN) IN ORGANISMS WHICH LACK GLUTAMINYL-TRNA SYNTHETASE. THE REACTION TAKES PLACE IN THE PRESENCE OF GLUTAMINE AND ATP THROUGH AN ACTIVATED GAMMA-PHOSPHO-GLU-TRNA(GLN) [CATALYTIC ACTIVITY: ATP + L-GLUTAMYL-TRNA(GLN) + L-GLUTAMINE = ADP + PHOSPHATE + L-GLUTAMINYL-TRNA(GLN) + L-GLUTAMATE]." /note="allows the formation of correctly charged Asn-tRNA(Asn) or Gln-tRNA(Gln) through the transamidation of misacylated Asp-tRNA(Asn) or Glu-tRNA(Gln) in organisms which lack either or both of asparaginyl-tRNA or glutaminyl-tRNA synthetases; reaction takes place in the presence of glutamine and ATP through an activated phospho-Asp-tRNA(Asn) or phospho-Glu-tRNA" /codon_start=1 /transl_table=11 /product="aspartyl/glutamyl-tRNA amidotransferase subunit B" /protein_id="NP_217525.1" /db_xref="GI:15610146" /db_xref="GeneID:888919" /translation="MTVAAGAAKAAGAELLDYDEVVARFQPVLGLEVHVELSTATKMF CGCTTTFGGEPNTQVCPVCLGLPGSLPVLNRAAVESAIRIGLALNCEIVPWCRFARKN YFYPDMPKNYQISQYDEPIAINGYLDAPLEDGTTWRVEIERAHMEEDTGKLTHIGSET GRIHGATGSLIDYNRAGVPLIEIVTKPIVGAGARAPQIARSYVTALRDLLRALDVSDV RMDQGSMRCDANVSLKPAGTTEFGTRTETKNVNSLKSVEVAVRYEMQRQGAILASGGR ITQETRHFHEAGYTSAGRTKETAEDYRYFPEPDLEPVAPSRELVERLRQTIPELPWLS RRRIQQEWGVSDEVMRDLVNAGAVELVAATVEHGASSEAARAWWGNFLAQKANEAGIG LDELAITPAQVAAVVALVDEGKLSNSLARQVVEGVLAGEGEPEQVMTARGLALVRDDS LTQAAVDEALAANPDVADKIRGGKVAAAGAIVGAVMKATRGQADAARVRELVLEACGQ G" misc_feature complement(3368494..3368622) /gene="gatB" /locus_tag="Rv3009c" /note="PS00041 Bacterial regulatory proteins, araC family signature" gene complement(3368823..3369854) /gene="pfkA" /locus_tag="Rv3010c" /db_xref="GeneID:888509" CDS complement(3368823..3369854) /gene="pfkA" /locus_tag="Rv3010c" /EC_number="2.7.1.11" /function="INVOLVED IN GLYCOLYSIS; CONVERTS SUGAR-1-P TO SUGAR-1,6-P [CATALYTIC ACTIVITY: ATP + D-FRUCTOSE 6-PHOSPHATE = ADP + D-FRUCTOSE 1,6-BISPHOSPHATE]." /note="catalyzes the formation of D-fructose 1,6-bisphosphate from D-fructose 6-phosphate in glycolysis" /codon_start=1 /transl_table=11 /product="6-phosphofructokinase" /protein_id="NP_217526.1" /db_xref="GI:15610147" /db_xref="GeneID:888509" /translation="MRIGVLTGGGDCPGLNAVIRAVVRTCHARYGSSVVGFQNGFRGL LENRRVQLHNDDRNDRLLAKGGTMLGTARVHPDKLRAGLPQIMQTLDDNGIDVLIPIG GEGTLTAASWLSEENVPVVGVPKTIDNDIDCTDVTFGHDTALTVATEAIDRLHSTAES HERVMLVEVMGRHAGWIALNAGLASGAHMTLIPEQPFDIEEVCRLVKGRFQRGDSHFI CVVAEGAKPAPGTIMLREGGLDEFGHERFTGVAAQLAVEVEKRINKDVRVTVLGHIQR GGTPTAYDRVLATRFGVNAADAAHAGEYGQMVTLRGQDIGRVPLADAVRKLKLVPQSR YDDAAAFFG" misc_feature complement(3369000..3369056) /gene="pfkA" /locus_tag="Rv3010c" /note="PS00433 Phosphofructokinase signature" gene complement(3369950..3371434) /gene="gatA" /locus_tag="Rv3011c" /db_xref="GeneID:887262" CDS complement(3369950..3371434) /gene="gatA" /locus_tag="Rv3011c" /EC_number="6.3.5.-" /function="COMPONENT OF THE TRANSLATIONAL APPARATUS. FURNISHES A MEANS FOR FORMATION OF CORRECTLY CHARGED GLN-TRNA(GLN) THROUGH THE TRANSAMIDATION OF MISACYLATED GLU-TRNA(GLN) IN ORGANISMS WHICH LACK GLUTAMINYL-TRNA SYNTHETASE. THE REACTION TAKES PLACE IN THE PRESENCE OF GLUTAMINE AND ATP THROUGH AN ACTIVATED GAMMA-PHOSPHO-GLU-TRNA(GLN) [CATALYTIC ACTIVITY: ATP + L-GLUTAMYL-TRNA(GLN) + L-GLUTAMINE = ADP + PHOSPHATE + L-GLUTAMINYL-TRNA(GLN) + L-GLUTAMATE]." /note="allows the formation of correctly charged Asn-tRNA(Asn) or Gln-tRNA(Gln) through the transamidation of misacylated Asp-tRNA(Asn) or Glu-tRNA(Gln) in organisms which lack either or both of asparaginyl-tRNA or glutaminyl-tRNA synthetases; reaction takes place in the presence of glutamine and ATP through an activated phospho-Asp-tRNA(Asn) or phospho-Glu-tRNA" /codon_start=1 /transl_table=11 /product="aspartyl/glutamyl-tRNA amidotransferase subunit A" /protein_id="NP_217527.1" /db_xref="GI:15610148" /db_xref="GeneID:887262" /translation="MTDIIRSDAATLAAKIAIKEVSSAEITRACLDQIEATDETYHAF LHVAADEALAAAAAIDKQVAAGEPLPSALAGVPLALKDVFTTSDMPTTCGSKILEGWR SPYDATLTARLRAAGIPILGKTNMDEFAMGSSTENSAYGPTRNPWNLDRVPGGSGGGS AAALAAFQAPLAIGSDTGGSIRQPAALTATVGVKPTYGTVSRYGLVACASSLDQGGPC ARTVLDTALLHQVIAGHDPRDSTSVDAEVPDVVGAARAGAVGDLRGVRVGVVRQLHGG EGYQPGVLASFEAAVEQLTALGAEVSEVDCPHFDHALAAYYLILPSEVSSNLARFDAM RYGLRVGDDGTRSAEEVMAMTRAAGFGPEVKRRIMIGTYALSAGYYDAYYNQAQKVRT LIARDLDAAYRSVDVLVSPTTPTTAFRMGEKVDDPLAMYLFDLCTLPLNLAGHCGMSV PSGLSPDDGLPVGLQIMAPALADDRLYRVGAAYEAARGPLLSAI" misc_feature complement(3371063..3371086) /gene="gatA" /locus_tag="Rv3011c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3371431..3371730) /gene="gatC" /locus_tag="Rv3012c" /db_xref="GeneID:887335" CDS complement(3371431..3371730) /gene="gatC" /locus_tag="Rv3012c" /EC_number="6.3.5.-" /function="COMPONENT OF THE TRANSLATIONAL APPARATUS. FURNISHES A MEANS FOR FORMATION OF CORRECTLY CHARGED GLN-TRNA(GLN) THROUGH THE TRANSAMIDATION OF MISACYLATED GLU- TRNA(GLN) IN ORGANISMS WHICH LACK GLUTAMINYL-TRNA SYNTHETASE. THE REACTION TAKES PLACE IN THE PRESENCE OF GLUTAMINE AND ATP THROUGH AN ACTIVATED GAMMA-PHOSPHO-GLU-TRNA(GLN) [CATALYTIC ACTIVITY: ATP + L-GLUTAMYL-TRNA(GLN) + L-GLUTAMINE = ADP + PHOSPHATE + L-GLUTAMINYL-TRNA(GLN) + L-GLUTAMATE]." /note="allows the formation of correctly charged Asn-tRNA(Asn) or Gln-tRNA(Gln) through the transamidation of misacylated Asp-tRNA(Asn) or Glu-tRNA(Gln) in organisms which lack either or both of asparaginyl-tRNA or glutaminyl-tRNA synthetases; reaction takes place in the presence of glutamine and ATP through an activated phospho-Asp-tRNA(Asn) or phospho-Glu-tRNA; some Mycoplasma proteins contain an N-terminal fusion to an unknown domain" /codon_start=1 /transl_table=11 /product="aspartyl/glutamyl-tRNA amidotransferase subunit C" /protein_id="NP_217528.1" /db_xref="GI:15610149" /db_xref="GeneID:887335" /translation="MSQISRDEVAHLARLARLALTETELDSFAGQLDAILTHVSQIQA VDVTGVQATDNPLKDVNVTRPDETVPCLTQRQVLDQAPDAVDGRFAVPQILGDEQ" gene 3371815..3372471 /locus_tag="Rv3013" /db_xref="GeneID:888642" CDS 3371815..3372471 /locus_tag="Rv3013" /function="UNKNOWN" /note="Rv3013, (MTV012.27), len: 218 aa. Conserved hypothetical protein, equivalent to O33103|MLCB637_11c HYPOTHETICAL 24.4 KDA PROTEIN from Mycobacterium leprae (230 aa), FASTA scores: opt: 1188, E(): 2.6e-67, (83.95% identity in 218 aa overlap). Equivalent to AAK47422 from Mycobacterium tuberculosis strain CDC1551 (240 aa) but shorter 22 aa. TBparse score is 0.879." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217529.1" /db_xref="GI:15610150" /db_xref="GeneID:888642" /translation="MRSYLLRIELADRPGSLGSLAVALGSVGADILSLDVVERGNGYA IDDLVVELPPGAMPDTLITAAEALNGVRVDSVRPHTGLLEAHRELELLDHVAAAEGAT ARLQVLVNEAPRVLRVSWCTVLRSSGGELHRLAGSPGAPETRANSAPWLPIERAAALD GGADWVPQAWRDMDTTMVAAPLGDTHTAVVLGRPGPEFRPSEVARLGYLAGIVATMLR" gene complement(3372545..3374620) /gene="ligA" /locus_tag="Rv3014c" /db_xref="GeneID:887354" CDS complement(3372545..3374620) /gene="ligA" /locus_tag="Rv3014c" /EC_number="6.5.1.2" /function="THIS PROTEIN CATALYZES THE FORMATION OF PHOSPHODIESTER LINKAGES BETWEEN 5'-PHOSPHORYL AND 3'-HYDROXYL GROUPS IN DOUBLE-STRANDED DNA USING NAD AS A COENZYME AND AS THE ENERGY SOURCE FOR THE REACTION. IT IS ESSENTIAL FOR DNA REPLICATION AND REPAIR OF DAMAGED DNA [CATALYTIC ACTIVITY: NAD(+) + (DEOXYRIBONUCLEOTIDE)(N) + (DEOXYRIBONUCLEOTIDE)(M) = AMP + NICOTINAMIDE NUCLEOTIDE + (DEOXYRIBONUCLEOTIDE)(N+M)]." /note="this protein catalyzes the formation of phosphodiester linkages between 5'-phosphoryl and 3'-hydroxyl groups in double-stranded DNA using NAD as a coenzyme and as the energy source for the reaction; essential for DNA replication and repair of damaged DNA; similar to ligase LigB" /codon_start=1 /transl_table=11 /product="NAD-dependent DNA ligase LigA" /protein_id="NP_217530.1" /db_xref="GI:15610151" /db_xref="GeneID:887354" /translation="MSSPDADQTAPEVLRQWQALAEEVREHQFRYYVRDAPIISDAEF DELLRRLEALEEQHPELRTPDSPTQLVGGAGFATDFEPVDHLERMLSLDNAFTADELA AWAGRIHAEVGDAAHYLCELKIDGVALSLVYREGRLTRASTRGDGRTGEDVTLNARTI ADVPERLTPGDDYPVPEVLEVRGEVFFRLDDFQALNASLVEEGKAPFANPRNSAAGSL RQKDPAVTARRRLRMICHGLGHVEGFRPATLHQAYLALRAWGLPVSEHTTLATDLAGV RERIDYWGEHRHEVDHEIDGVVVKVDEVALQRRLGSTSRAPRWAIAYKYPPEEAQTKL LDIRVNVGRTGRITPFAFMTPVKVAGSTVGQATLHNASEIKRKGVLIGDTVVIRKAGD VIPEVLGPVVELRDGSEREFIMPTTCPECGSPLAPEKEGDADIRCPNARGCPGQLRER VFHVASRNGLDIEVLGYEAGVALLQAKVIADEGELFALTERDLLRTDLFRTKAGELSA NGKRLLVNLDKAKAAPLWRVLVALSIRHVGPTAARALATEFGSLDAIAAASTDQLAAV EGVGPTIAAAVTEWFAVDWHREIVDKWRAAGVRMVDERDESVPRTLAGLTIVVTGSLT GFSRDDAKEAIVARGGKAAGSVSKKTNYVVAGDSPGSKYDKAVELGVPILDEDGFRRL LADGPASRT" gene complement(3374651..3375664) /locus_tag="Rv3015c" /db_xref="GeneID:887181" CDS complement(3374651..3375664) /locus_tag="Rv3015c" /function="UNKNOWN" /note="Rv3015c, (MTV012.29c), len: 337 aa. Conserved hypothetical protein, equivalent to Q9CBR6|ML1706 HYPOTHETICAL PROTEIN from Mycobacterium leprae (337 aa), FASTA scores: opt: 1703, E(): 3.1e-92, (78.05% identity in 337 aa overlap); and (but longer 47 aa) O33101|MLCB637.09 HYPOTHETICAL 30.0 KDA PROTEIN from Mycobacterium leprae (290 aa), FASTA scores: opt: 1564, E(): 2.4e-78, (78.6% identity in 290 aa overlap). Also similar to Q9Z586|SC8D9.05 HYPOTHETICAL 35.0 KDA PROTEIN from Streptomyces coelicolor (331 aa), FASTA scores: opt: 774, E(): 4.7e-38, (43.4% identity in 334 aa overlap); and showing similarity with other proteins e.g. Q39586|METE_CHLRE 5-METHYLTETRAHYDROPTEROYLTRIGLUTAMATE--HOMOCYSTEINE METHYLTRANSFERASE from Chlamydomonas reinhardtii (814 aa), FASTA scores: opt: 162, E(): 0.048, (27.05% identity in 355 aa overlap). TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217531.1" /db_xref="GI:15610152" /db_xref="GeneID:887181" /translation="MSVFATATGIGSWPGTAAREAAQVVVGELAGALAYLTELPARGV GADMLGRAGGLLVDVAIDTVPRGYRIAARPGAVTRRAASLLDEDMDALEEAWETAGLR GCGRAVKVQAPGPVTLVAGLELANGHRAITDPGAVRDLAASLAEGVAAHRAALARRLD TPVVVQFDEPSLPAALGGRLTGVTALSPVAPLDETVAEALLDTCIAAVDADVALHSCS PDLPWDLLQRSRISAVSVDASTLQAADLDAVAAFVESGRTVVLGLVPVTAPERAPSME EVAAAAVAVTDRLGVPRSALRDRLGVSPACGLANATGQWARTAVGLARDVAEAFARDP EAI" gene 3375758..3376387 /gene="lpqA" /locus_tag="Rv3016" /db_xref="GeneID:888557" CDS 3375758..3376387 /gene="lpqA" /locus_tag="Rv3016" /function="UNKNOWN" /note="Rv3016, (MTV012.30), len: 209 aa. Probable lpqA, lipoprotein. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="lipoprotein LpqA" /protein_id="NP_217532.1" /db_xref="GI:15610153" /db_xref="GeneID:888557" /translation="MVGLTRPLLLCGATLLIAACTRVVGGTASATFGGDRQGMLDVAT ILLDQSRMQAITGSGDDLTIIPTMDTTYPVDVDDFAQPIPRECRFIYAETAVFGSEIE AFHKTTFQDRPDGSLISEAAAAYRDAGTARRAFDTLAVTVHDCAASPAGWLFVSRWTA GGNSLHIRAGDCGRDYRVLSAALLEVTFCGFPESVSDIVMTNIAANVPG" misc_feature 3375785..3375817 /gene="lpqA" /locus_tag="Rv3016" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(3376490..3376852) /gene="esxQ" /locus_tag="Rv3017c" /db_xref="GeneID:888508" CDS complement(3376490..3376852) /gene="esxQ" /locus_tag="Rv3017c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3017c, (MT3097, MTV012.31c), len: 120 aa. esxQ, ESAT-6 like protein (see citation below), possibly secreted protein, very similar to AAK47433|MT3104 PUTATIVE SECRETED ESAT-6 LIKE PROTEIN 9 from Mycobacterium tuberculosis strain CDC1551 (96 aa), FASTA scores: opt: 315, E(): 1.2e-14, (65.7% identity in 70 aa overlap); Rv3019c|O53266|MTV012.33c PUTATIVE SECRETED ESAT-6 LIKE PROTEIN 9 from Mycobacterium tuberculosis (96 aa), FASTA scores: opt: 315, E(): 1.2e-14, (65.7% identity in 70 aa overlap) and Rv0288|O53693|CFP7|MT0301|MTV035.16 10 KDA ANTIGEN CFP7 (LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 7) (CFP-7) from Mycobacterium tuberculosis (95 aa), FASTA scores: opt: 303, E(): 7.4e-14, (66.2% identity in 68 aa overlap). BELONGS TO THE ESAT6 FAMILY. Note previously known as TB12.9.; TB12.9, ES6_8" /codon_start=1 /transl_table=11 /product="Esat-6 like protein esxQ (TB12.9) (Esat-6 like protein 8)" /protein_id="NP_217533.1" /db_xref="GI:15610154" /db_xref="GeneID:888508" /translation="MSQSMYSYPAMTANVGDMAGYTGTTQSLGADIASERTAPSRACQ GDLGMSHQDWQAQWNQAMEALARAYRRCRRALRQIGVLERPVGDSSDCGTIRVGSFRG RWLDPRHAGPATAADAGD" gene complement(3376939..3378243) /gene="PPE46" /locus_tag="Rv3018c" /db_xref="GeneID:888940" CDS complement(3376939..3378243) /gene="PPE46" /locus_tag="Rv3018c" /function="UNKNOWN" /note="Rv3018c, (MTV012.32c), len: 434 aa. Member of PPE family but lacks Gly, Ala rich repeats at C-terminal domain, closest to MTCY261.19. See citation below. Also very similar to following ORF MTV012.35c. Nearly identical in parts to Mycobacterium tuberculosis protein erroneously described as DIHYDROFOLATE REDUCTASE (X59271|MTFOLA_1) P31500|DYR_MYCTU (214 aa), FASTA scores: opt: 972, E(): 4.4e-42, (80.0% identity in 195 aa overlap); and Z97559|MTCY261_19 from Mycobacterium tuberculosis cosmid (473 aa), FASTA scores: opt: 806, E(): 0; (38.8% identity in 479 aa overlap); and O53268|MTV012.35c from Mycobacterium tuberculosis (358 aa), FASTA scores: opt: 1714, E(): 3.3e-79, (78.3% identity in 355 aa overlap). TBparse score is 0.945." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177918.1" /db_xref="GI:57117045" /db_xref="GeneID:888940" /translation="MTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAV AQELSVVVAAVGAGVWQGPSAELFVAAYVPYVAWLVQASADSAAAAGEHEAAAAGYVC ALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQAATVMSAYEAV VGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPWHEIVQFLEETFAAYDQYLS ALLSELPAVAWVWFQLFVDILGFNIIGFIITLASNAQLLTEFAINASYVAVGLLYAIA GVIDIVVEWVIGNLFGVVPLLGGPLLGALAAAVVPGVAGLAGVAGLAALPAVGAAAGA PAALVGSVAPVSGGVVSPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKESV GQPAGLTVLADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV" misc_feature complement(3378329..3378415) /note="similar to PE family protein; Rv3018A, len: 28 aa. Member of Mycobacterium tuberculosis PE family (see citation below), most similar to Rv0285 (102 aa), FASTA scores: opt: 147, E(): 3.5e-05, (92.85% identity in 28 aa overlap); etc." gene complement(3378711..3379001) /gene="esxR" /locus_tag="Rv3019c" /db_xref="GeneID:888926" CDS complement(3378711..3379001) /gene="esxR" /locus_tag="Rv3019c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3019c, (MT3104, MTV012.33c), len: 96 aa. esxR, secreted ESAT-6 like protein (see citations below), most similar to O53693|AAK44525|Rv0288|CFP7|MT0301|MTV035.16 10 KDA ANTIGEN CFP7 (LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 7) (CFP-7) from Mycobacterium tuberculosis (95 aa), FASTA scores: opt: 566, E(): 5.1e-31, (84.3% identity in 95 aa overlap). Also similar to Q9CD33|ML2531 POSSIBLE CELL SURFACE PROTEIN from Mycobacterium leprae (96 aa), FASTA scores: opt: 472, E(): 8.3e-25, (66.6% identity in 96 aa overlap); O53264|Rv3017c|MTV012.31c PUTATIVE SECRETED ANTIGEN from Mycobacterium tuberculosis (120 aa), FASTA scores: opt: 321, E(): 9.6e-15, (67.15% identity in 70 aa overlap); Q57165|AAK48357|O84901|X79562|ESAT6|Rv3875|MT3989|MTV027. 10 esat6 gene from Mycobacterium tuberculosis strain Erdman (94 aa), FASTA scores: opt: 131, E(): 0.028, (26.1% identity in 88 aa overlap). BELONGS TO THE ESAT6 FAMILY. TBparse score is 0.906. Note previously known as TB10.3.; ES6_9, TB10.3" /codon_start=1 /transl_table=11 /product="secreted ESAT-6 like protein ESXR (TB10.3) (ESAT-6 like protein 9)" /protein_id="NP_217535.1" /db_xref="GI:15610156" /db_xref="GeneID:888926" /translation="MSQIMYNYPAMMAHAGDMAGYAGTLQSLGADIASEQAVLSSAWQ GDTGITYQGWQTQWNQALEDLVRAYQSMSGTHESNTMAMLARDGAEAAKWGG" gene complement(3379036..3379329) /gene="esxS" /locus_tag="Rv3020c" /db_xref="GeneID:888946" CDS complement(3379036..3379329) /gene="esxS" /locus_tag="Rv3020c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3020c, (MTV012.34c), len: 97 aa. esxS, ESAT-6 like protein. PE-family related protein; distant member of the Mycobacterium tuberculosis PE family, similar to AAK44524|MT0300 PE FAMILY PROTEIN from M. tuberculosis strain CDC1551 (97 aa), FASTA scores: opt: 564, E(): 5.9e-30, (91.75% identity in 97 aa overlap). Has potential helix-turn-helix motif at positions 14-35. TBparse score is 0.912. SEEMS TO BELONG TO THE ESAT6 FAMILY (see Betts et al., 2002). Note that previously known as PE28.; PE28" /codon_start=1 /transl_table=11 /product="Esat-6 like protein esxS" /protein_id="YP_177919.1" /db_xref="GI:57117047" /db_xref="GeneID:888946" /translation="MSLLDAHIPQLIASHTAFAAKAGLMRHTIGQAEQQAMSAQAFHQ GESAAAFQGAHARFVAAAAKVNTLLDIAQANLGEAAGTYVAADAAAASSYTGF" gene complement(3379376..3380452) /gene="PPE47" /locus_tag="Rv3021c" /db_xref="GeneID:888924" CDS complement(3379376..3380452) /gene="PPE47" /locus_tag="Rv3021c" /function="UNKNOWN" /note="Rv3021c, (MTV012.35c), len: 358 aa. Member of Mycobacterium tuberculosis PPE family. Should be continuation of upstream ORF MTV012.36c but is frameshifted due to missing base at 36448 in v012. Sequence has been checked but no error apparent. Very similar to neighbouring ORF O53265|MTV012.32c|Rv3018c from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 1714, E(): 6.6e-770, (78.3% identity in 355 aa overlap) and AAK47430|MT3101 (strongly in the N-terminal part) (310 aa), FASTA scores: opt: 897, E(): 4.5e-37, (66.95% identity in 227 aa overlap). TBparse score is 0.943." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177920.1" /db_xref="GI:57117048" /db_xref="GeneID:888924" /translation="MVGAASADSAAAAGEHEAAAAGYVCALAEMPTLPELAANHLTHA VLVATNFFGINTIPIALNEADYVRMWVQAATVMSAYEAVVGAALVATPHTGPAPVIVK PGANEASNAVAAATITPFPFGELAKFLEMAAQAFTEVGELIMKSAEAWAVGFVELITG LVNFEPWLVLTGMIDMFFATVGFALGVFVLVPLLEFAVVLELAILSIGWIISNIFGAI PVLGGPLLGALAAAVVPGVAGLAGVAGLAALPAVGAAAGAPAALVGSVAPVSGGVVSP QARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKESVGQPAGLTVLADEFGDGAP VPMLPGSWGPDLVGVAGDGGLVSV" gene complement(3380440..3380682) /gene="PPE48" /locus_tag="Rv3022c" /db_xref="GeneID:888512" CDS complement(<3380440..>3380682) /gene="PPE48" /locus_tag="Rv3022c" /function="UNKNOWN" /note="Rv3022c, (MTV012.36c), len: 81 aa. Member of M. tuberculosis PPE family with frameshift due to missing bp in codon 82. The ORF continues in downstream MTV012.35c. The sequence has been checked and no errors were detected. Identical to neigbouring ORF O53265|Rv3018c|MTV012.32c (434 aa), FASTA scores: opt: 526, E(): 6.2e-26, (100.0% identity in 81 aa overlap); and O69706|Rv739c|MTV025.087c (77 aa), FASTA scores: opt: 392, E(): 3.4e-18, (72.7% identity in 77 aa overlap). TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177684.1" /db_xref="GI:57117049" /db_xref="GeneID:888512" /translation="VTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAV AQELSVVVAAVGAGVWQGPSAELFVAAYVPYVAWLVQ" gene complement(3380679..3380993) /gene="PE29" /locus_tag="Rv3022A" /db_xref="GeneID:3205088" CDS complement(3380679..3380993) /gene="PE29" /locus_tag="Rv3022A" /function="UNKNOWN" /note="Rv3022A, len: 104 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), similar to many others e.g. Rv0285|AL021930_12 from Mycobacterium tuberculosis (102 aa), FASTA scores: opt: 497, E(): 3e-21, (80.39% identity in 102 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177685.1" /db_xref="GI:57117050" /db_xref="GeneID:3205088" /translation="MTLRVVPEGLAAASAAVEALTARLAAAHAGAAPAITAVVAPAAD PVSLQSAVGFSALGSEHAAIAGEGVEELGRSGVAVGESGIGYAAGDAVAAATYLVSGG SL" repeat_region 3381351..3382674 /note="IS1081-5, len: 1324 bp. Insertion sequence IS1081." /mobile_element="insertion sequence:IS1081-5" repeat_region 3381351..3381365 /note="15 bp Inverted repeat at left end of IS1081:TCGCGTGATCCTTCG" gene complement(3381375..3382622) /locus_tag="Rv3023c" /db_xref="GeneID:888525" CDS complement(3381375..3382622) /locus_tag="Rv3023c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1081." /experiment="experimental evidence, no additional details recorded" /note="Rv3023c, (MTV012.38c), len: 415 aa. Probable IS1081 transposase. Contains PS01007 Transposases, Mutator family, signature. Similars to P35882|TRA1_MYCTU|Rv1199c|MTCI364.11c and Rv2512c|MTCY07A7.18c TRANSPOSASES FOR INSERTION SEQUENCE ELEMENT IS1081 (415 aa), FASTA scores: opt: 2675, E(): 1.8e-162, (100.0% identity in 415 aa overlap). TBparse score is 0.894. BELONGS TO THE MUTATOR FAMILY OF TRANSPOSASE." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217539.1" /db_xref="GI:15610160" /db_xref="GeneID:888525" /translation="MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" misc_feature complement(3381852..3381926) /locus_tag="Rv3023c" /note="PS01007 Transposases, Mutator family, signature" repeat_region complement(3382660..3382674) /note="15 bp Inverted repeat at the right end of IS1081:TCGCGTGATCCTTCG" gene complement(3382785..3383888) /gene="mnmA" /locus_tag="Rv3024c" /db_xref="GeneID:888524" CDS complement(3382785..3383888) /gene="mnmA" /locus_tag="Rv3024c" /EC_number="2.1.1.61" /function="INVOLVED IN TRANSLATION MECHANISMS [CATALYTIC ACTIVITY: S-ADENOSYL-L-METHIONINE + TRNA = S-ADENOSYL-L-HOMOCYSTEINE + TRNA CONTAINING 5-METHYLAMINOMETHYL-2-THIOURIDYLATE]." /note="catalyzes a sulfuration reaction to synthesize 2-thiouridine at the U34 position of tRNAs" /codon_start=1 /transl_table=11 /product="tRNA-specific 2-thiouridylase MnmA" /protein_id="NP_217540.1" /db_xref="GI:15610161" /db_xref="GeneID:888524" /translation="MKVLAAMSGGVDSSVAAARMVDAGHEVVGVHMALSTAPGTLRTG SRGCCSKEDAADARRVADVLGIPFYVWDFAEKFKEDVINDFVSSYARGETPNPCVRCN QQIKFAALSARAVALGFDTVATGHYARLSGGRLRRAVDRDKDQSYVLAVLTAQQLRHA AFPIGDTPKRQIRAEAARRGLAVANKPDSHDICFIPSGNTKAFLGERIGVRRGVVVDA DGVVLASHDGVHGFTIGQRRGLGIAGPGPNGRPRYVTAIDADTATVHVGDVTDLDVQT LTGRAPVFTAGAAPSGPVDCVVQVRAHGETVSAVAELIGDALFVQLHAPLRGVARGQT LVLYRPDPAGDEVLGSATIAGASGLSTGGNPGA" gene complement(3383885..3385066) /gene="iscS" /locus_tag="Rv3025c" /db_xref="GeneID:887677" CDS complement(3383885..3385066) /gene="iscS" /locus_tag="Rv3025c" /EC_number="4.4.1.-" /function="CATALYZES THE REMOVAL OF ELEMENTAL SULFUR FROM CYSTEINE TO PRODUCE ALANINE." /note="Rv3025c, (MTV012.40c), len: 393 aa. Probable iscS (alternate gene name: nifS), cysteine desulfurase (NifS-like protein) (EC 4.4.1.-), equivalent to MLCB637.06|O33098 NIFS-LIKE PROTEIN from Mycobacterium leprae (396 aa), FASTA scores: opt: 2186, E(): 2.7e-122, (84.9% identity in 391 aa overlap). Also highly similar to many e.g. O86581|SC2A11.20 PUTATIVE PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASE from Streptomyces coelicolor (389 aa), FASTA scores: opt: 1568, E(): 1.1e-85, (61.7% identity in 389 aa overlap); P57795|ISCS|NIFS CYSTEINE DESULFURASE (NIFS PROTEIN HOMOLOG) from Methanosarcina thermophila (404 aa), FASTA scores: opt: 1059, E(): 1.6e-55, (46.2% identity in 381 aa overlap); O54055|ISCS_RUMFL|ISCS|NIFS CYSTEINE DESULFURASE from Ruminococcus flavefaciens (396 aa), FASTA scores: opt: 973, E(): 2e-50, (43.3% identity in 381 aa overlap); P57794|NIFS_ACEDI CYSTEINE DESULFURASE from Acetobacter diazotrophicus (400 aa), FASTA scores: opt: 958, E(): 1.6e-49, (41.1% identity in 392 aa overlap); etc. Also similar to Rv1464|MTV007.11 from Mycobacterium tuberculosis. Contains PS00595 Aminotransferases class-V pyridoxal-phosphate attachment site. BELONGS TO CLASS-V OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES, NIFS/ISCS SUBFAMILY. COFACTOR: PYRIDOXAL PHOSPHATE (BY SIMILARITY).; nifS" /codon_start=1 /transl_table=11 /product="cysteine desulfurase IscS" /protein_id="NP_217541.1" /db_xref="GI:15610162" /db_xref="GeneID:887677" /translation="MAYLDHAATTPMHPAAIEAMAAVQRTIGNASSLHTSGRSARRRI EEARELIADKLGARPSEVIFTAGGTESDNLAVKGIYWARRDAEPHRRRIVTTEVEHHA VLDSVNWLVEHEGAHVTWLPTAADGSVSATALREALQSHDDVALVSVMWANNEVGTIL PIAEMSVVAMEFGVPMHSDAIQAVGQLPLDFGASGLSAMSVAGHKFGGPPGVGALLLR RDVTCVPLMHGGGQERDIRSGTPDVASAVGMATAAQIAVDGLEENSARLRLLRDRLVE GVLAEIDDVCLNGADDPMRLAGNAHFTFRGCEGDALLMLLDANGIECSTGSACTAGVA QPSHVLIAMGVDAASARGSLRLSLGHTSVEADVDAALEVLPGAVARARRAALAAAGAS R" misc_feature complement(3384422..3384475) /gene="iscS" /locus_tag="Rv3025c" /note="PS00595 Aminotransferases class-V pyridoxal-phosphate attachment site" gene complement(3385163..3386077) /locus_tag="Rv3026c" /db_xref="GeneID:888489" CDS complement(3385163..3386077) /locus_tag="Rv3026c" /function="UNKNOWN" /note="Rv3026c, (MTV012.41c), len: 304 aa. Conserved hypothetical protein, similar to Q9RCZ0|SCM10.08C PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (275 aa), FASTA scores: opt: 393, E(): 2.2e-17, (41.4% identity in 299 aa overlap). Similar in part to other hypothetical proteins and acyltransferases e.g. BAB51968|MLR5533 from Rhizobium loti (266 aa), FASTA scores: opt: 280, E(): 2.4e-10, (29.45% identity in 258 aa overlap); Q9KIH9 PUTATIVE ACYLTRANSFERASE (PUTATIVE ACYLTRANSFERASE TRANSMEMBRANE PROTEIN) (EC 2.3.1.) from Rhizobium meliloti (Sinorhizobium meliloti) (292 aa), FASTA scores: opt: 252, E(): 1.4e-08, (30.5% identity in 210 aa overlap); O69114|PLSC PUTATIVE 1-ACYL-SN-GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE from Burkholderia pseudomallei (Pseudomonas pseudomallei) (289 aa), FASTA scores: opt: 216, E(): 2.4e-06, (30.85% identity in 269 aa overlap); etc. So may be a member of acyltransferase family protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217542.1" /db_xref="GI:15610163" /db_xref="GeneID:888489" /translation="MSAPAVTEHSWLPRATCGVSCVSVGDAAQVRRPLVVLRVALRVM LALLLVPGVPLVVMPLPGRTRVQRIYCRLVLRLFGVRITVSGSPVRNLRGVLVVSGHV SWLDVFCIGSVLPGSFVARADMFTGRTIGIVARILKIIPIERASLRRLPGVVDTIARR LRAGQTVVAFPEGTTWCGRPGDDAGRPAARAGAGCSHRGCGAFYPAMFQAAIDAGRPV QPLRLTYHHVDGTVSTAPAFVGDDTLVRSVCRLLTVRRTLAWVRVESLQLPGTDRRNL ARRCQSAVLAGALGQSGQRPGRRHVPAT" gene complement(3386074..3386814) /locus_tag="Rv3027c" /db_xref="GeneID:887851" CDS complement(3386074..3386814) /locus_tag="Rv3027c" /function="UNKNOWN" /note="Rv3027c, (MTV012.42c), len: 246 aa. Conserved hypothetical protein, similar, but shorter 30 aa in N-terminus, to others e.g. Q9RCY9|SCM10.09c from Streptomyces coelicolor (256 aa), FASTA scores: opt: 498, E(): 7.8e-24, (47.7% identity in 237 aa overlap); BAB50158|MLR3216 from Rhizobium loti (291 aa), FASTA scores: opt: 359, E(): 3.7e-15, (33.35% identity in 246 aa overlap); etc. Equivalent to AAK47441 from Mycobacterium tuberculosis strain CDC1551 (281 aa) but shorter 35 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217543.1" /db_xref="GI:15610164" /db_xref="GeneID:887851" /translation="MVEAAQRLRYDVFSTTPGFALPAAADTRRDGDRFDEYCDHLLVR DDDTGELVGCYRMLAPAGAIAAGGLYTATEFDVCAFDPLRPSLVEMGRAVVREGHRNG GVVLLMWAGILAYLDRYGYDYVTGCVSVPIGGDGETPGSRLRGVRDFILNRHAAPPQC QVYPYRPVRVDGRSLDDILPPPRPAVPPLMRGYLRLGARACGEPAHDPDFGVGDFCLL LDKDHADTRYLRRLRSVAAASEMVNDAR" gene complement(3387075..3388031) /gene="fixB" /locus_tag="Rv3028c" /db_xref="GeneID:888549" CDS complement(3387075..3388031) /gene="fixB" /locus_tag="Rv3028c" /function="THE ELECTRON TRANSFER FLAVOPROTEIN SERVES AS A SPECIFIC ELECTRON ACCEPTOR FOR OTHER DEHYDROGENASES. IT TRANSFERS THE ELECTRONS TO THE MAIN RESPIRATORY CHAIN VIA ETF-UBIQUINONE OXIDOREDUCTASE (ETF DEHYDROGENASE)." /experiment="experimental evidence, no additional details recorded" /note="Rv3028c, (MTV012.43c), len: 318 aa. Probable fixB (alternate gene name: etfA), electron transfer flavoprotein (alpha subunit) for various dehydrogenases. Equivalent to O33096|ETFA_MYCLE|FIXB|ML1711|MLCB637.04 ELECTRON TRANSFER FLAVOPROTEIN from Mycobacterium leprae (318 aa), FASTA scores: opt: 1788, E(): 1.1e-87, (89.3% identity in 318 aa overlap). Also highly similar to many e.g. Q9K418|SCG22.27c from Streptomyces coelicolor (320 aa), FASTA scores: opt: 1161, E(): 1.6e-54, (59.45% identity in 323 aa overlap); AAK08137|etfa from Rhodobacter sphaeroides (308 aa), FASTA scores: opt: 792, E(): 5.1e-35, (45.95% identity in 309 aa overlap); P38974|ETFA_PARDE ELECTRON TRANSFER FLAVOPROTEIN from Paracoccus denitrificans (307 aa), FASTA scores: opt: 789, E(): 7.4e-35, (45.95% identity in 309 aa overlap); etc. BELONGS TO THE ETF ALPHA-SUBUNIT / FIXB FAMILY.; etfA" /codon_start=1 /transl_table=11 /product="electron transfer flavoprotein subunit alpha" /protein_id="NP_217544.1" /db_xref="GI:15610165" /db_xref="GeneID:888549" /translation="MAEVLVLVEHAEGALKKVSAELITAARALGEPAAVVVGVPGTAA PLVDGLKAAGAAKIYVAESDLVDKYLITPAVDVLAGLAESSAPAGVLIAATADGKEIA GRLAARIGSGLLVDVVDVREGGVGVHSIFGGAFTVEAQANGDTPVITVRAGAVEAEPA AGAGEQVSVEVPAAAENAARITAREPAVAGDRPELTEATIVVAGGRGVGSAENFSVVE ALADSLGAAVGASRAAVDSGYYPGQFQVGQTGKTVSPQLYIALGISGAIQHRAGMQTS KTIVAVNKDEEAPIFEIADYGVVGDLFKVAPQLTEAIKARKG" gene complement(3388070..3388870) /gene="fixA" /locus_tag="Rv3029c" /db_xref="GeneID:887670" CDS complement(3388070..3388870) /gene="fixA" /locus_tag="Rv3029c" /function="THE ELECTRON TRANSFER FLAVOPROTEIN SERVES AS A SPECIFIC ELECTRON ACCEPTOR FOR OTHER DEHYDROGENASES. IT TRANSFERS THE ELECTRONS TO THE MAIN RESPIRATORY CHAIN VIA ETF-UBIQUINONE OXIDOREDUCTASE (ETF DEHYDROGENASE)." /experiment="experimental evidence, no additional details recorded" /note="Rv3029c, (MTV012.44c), len: 266 aa. Probable fixA (alternate gene name: etfB), electron transfer flavoprotein (beta-subunit). Equivalent of O33095|ETFB_MYCLE|FixA|MLCB637.03 ELECTRON TRANSFER FLAVOPROTEIN from Mycobacterium leprae (266 aa), FASTA scores: opt: 1603, E(): 7.6e-87, (95.1% identity in 266 aa overlap). Also highly similar to others e.g. Q9K417|SCG22.28c from Streptomyces coelicolor (262 aa), FASTA scores: opt: 860, E(): 2.3e-43, (52.4% identity in 263 aa overlap); O85691|ETFB_MEGEL from Megasphaera elsdenii (270 aa), FASTA scores: opt: 548, E(): 4.2e-25, (35.15% identity in 273 aa overlap); etc. Also highly similar in particular to Q9KHD0|NONH FLAVOPROTEIN REDUCTASE from Streptomyces griseus subsp. griseus (this one is required for macrotetrolide biosynthesis in Streptomyces griseus) (261 aa), FASTA scores: opt: 867, E(): 8.8e-44, (54.0% identity in 263 aa overlap). BELONGS TO THE ETF BETA-SUBUNIT / FIXA FAMILY.; etfB" /codon_start=1 /transl_table=11 /product="electron transfer flavoprotein subunit beta" /protein_id="NP_217545.1" /db_xref="GI:15610166" /db_xref="GeneID:887670" /translation="MTNIVVLIKQVPDTWSERKLTDGDFTLDREAADAVLDEINERAV EEALQIREKEAADGIEGSVTVLTAGPERATEAIRKALSMGADKAVHLKDDGMHGSDVI QTGWALARALGTIEGTELVIAGNESTDGVGGAVPAIIAEYLGLPQLTHLRKVSIEGGK ITGERETDEGVFTLEATLPAVISVNEKINEPRFPSFKGIMAAKKKEVTVLTLAEIGVE SDEVGLANAGSTVLASTPKPAKTAGEKVTDEGEGGNQIVQYLVAQKII" gene 3389101..3389925 /locus_tag="Rv3030" /db_xref="GeneID:888604" CDS 3389101..3389925 /locus_tag="Rv3030" /function="UNKNOWN" /note="Rv3030, (MTV012.45), len: 274 aa. Conserved hypothetical protein, equivalent to O33094|MLCB637.02c|ML1713 hypothetical 30.8 KDa protein from Mycobacterium leprae (280 aa), FASTA scores: opt: 1388, E(): 5.5e-83, (78.2% identity in 280 aa overlap). N-terminus has similarity to hypothetical proteins from a number of organisms and to Q54303|EMBL:X86780|RAPM methyltransferase from Streptomyces hygroscopicus (317 aa), FASTA scores: opt: 191, E(): 3.6e-05, (35.65% identity in 101 aa overlap). TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217546.1" /db_xref="GI:15610167" /db_xref="GeneID:888604" /translation="MCAFVPHVPRHSRGDNPPSASTASPAVLTLTGERTIPDLDIENY WFRRHQVVYQRLAPRCTARDVLEAGCGEGYGADLIACVARQVIAVDYDETAVAHVRSR YPRVEVMQANLAELPLPDASVDVVVNFQVIEHLWDQARFVRECARVLRGSGLLMVSTP NRITFSPGRDTPINPFHTRELNADELTSLLIDAGFVDVAMCGLFHGPRLRDMDARHGG SIIDAQIMRAVAGAPWPPELAADVAAVTTADFEMVAAGHDRDIDDSLDLIAIAVRP" gene 3389922..3391502 /locus_tag="Rv3031" /db_xref="GeneID:888543" CDS 3389922..3391502 /locus_tag="Rv3031" /function="UNKNOWN" /note="Rv3031, (MTV012.46), len: 526 aa. Conserved hypothetical protein, equivalent to Q9CBR4|ML1714 HYPOTHETICAL PROTEIN from Mycobacterium leprae (522 aa), FASTA scores: opt: 3167, E(): 4.4e-190, (86.15% identity in 526 aa overlap); and highly similar to truncated O33093|MLCB637.01c HYPOTHETICAL 37.2 KDA PROTEIN (FRAGMENT) from Mycobacterium leprae (338 aa), FASTA scores: opt: 2041, E(): 5.7e-120, (84.8% identity in 342 aa overlap). Also some similarity to hypothetical proteins Q9V0M7|PAB1857 from Pyrococcus abyssi (602 aa), FASTA scores: opt: 477, E(): 3.5e-22, (31.2% identity in 556 aa overlap); and Synechocystis P74630|D90916|SLL0735 from Synechocystis sp. strain PCC 6803 (529 aa), FASTA scores: opt: 282, E(): 4.7e-10, (28.6% identity in 560 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217547.1" /db_xref="GI:15610168" /db_xref="GeneID:888543" /translation="MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAY LPLLQVLAALADENRHRLITLGMTPVVNAQLDDPYCLNGVHHWLANWQLRAEEAASVR YARQSKSADYPSCTPEALRAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVE LLGGPLAHPFQPLLAPRLREFALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYAT AGVSHFMVDGPSLHGDTALGRPVGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFH TYDHLTGLKPARVTGRNVPSEQKAPYDPERADRAVDVHVADFVDVVRNRLLSESERIG RPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAAGVRVGTLSDAIADGFVGDPVELP PSSWGSGKDWQVWSGAKVADLVQLNSEVVDTALTTIDKALAQTASLDGPLPRDHVADQ ILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATREIAGALAAGRRDTARRLAEG WNRADGLFGALDARRLPK" gene 3391534..3392778 /locus_tag="Rv3032" /db_xref="GeneID:888185" CDS 3391534..3392778 /locus_tag="Rv3032" /EC_number="2.-.-.-" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3032, (MTV012.47), len: 414 aa. Possible transferase (EC 2.-.-.-), equivalent to Q9CBR3|ML1715 PUTATIVE TRANSFERASE from Mycobacterium leprae (438 aa), FASTA scores: opt: 2456, E(): 7.3e-145, (87.9% identity in 414 aa overlap). Also similar to hypothetical proteins and various transferases e.g. P73369|SLL1971 HYPOTHETICAL 46.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (404 aa), FASTA scores: opt: 584, E(): 7.3e-29, (34.5% identity in 400 aa overlap); Q9Z5B7|SC2G5.06 PUTATIVE TRANSFERASE from Streptomyces coelicolor (406 aa), FASTA scores: opt: 509, E(): 3.3e-24, (35.9% identity in 413 aa overlap); Q9UZA1|PAB0827 GALACTOSYLTRANSFERASE (LPS BIOSYNTHESIS RFBU RELATED PROTEIN) from Pyrococcus abyssi (371 aa), FASTA scores: opt: 381, E(): 2.6e-16, (26.75% identity in 404 aa overlap); etc. TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="transferase" /protein_id="NP_217548.1" /db_xref="GI:15610169" /db_xref="GeneID:888185" /translation="MRILMVSWEYPPVVIGGLGRHVHHLSTALAAAGHDVVVLSRCPS GTDPSTHPSSDEVTEGVRVIAAAQDPHEFTFGNDMMAWTLAMGHAMIRAGLRLKKLGT DRSWRPDVVHAHDWLVAHPAIALAQFYDVPMVSTIHATEAGRHSGWVSGALSRQVHAV ESWLVRESDSLITCSASMNDEITELFGPGLAEITVIRNGIDAARWPFAARRPRTGPAE LLYVGRLEYEKGVHDAIAALPRLRRTHPGTTLTIAGEGTQQDWLIDQARKHRVLRATR FVGHLDHTELLALLHRADAAVLPSHYEPFGLVALEAAAAGTPLVTSNIGGLGEAVING QTGVSCAPRDVAGLAAAVRSVLDDPAAAQRRARAARQRLTSDFDWQTVATATAQVYLA AKRGERQPQPRLPIVEHALPDR" gene 3393380..3393928 /locus_tag="Rv3033" /db_xref="GeneID:888865" CDS 3393380..3393928 /locus_tag="Rv3033" /function="UNKNOWN" /note="Rv3033, (MTV012.48), len: 182 aa. Hypothetical unknown protein. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217549.1" /db_xref="GI:15610170" /db_xref="GeneID:888865" /translation="MAHSIVRTLLASGAATALIAIPTACSFSIGTSHSHSVSKAEVAR QITAKMTDAAGNKPESVTCPSDLPAEVGAELNCEMKIKDRTFNVNVTVTSVDGSDVKF DMVETVDKNQVANIISDKLFQRVGARPDSVTCPDNLKGVEGAKLRCRLTDGSKTYGIS VIVTSVDAGDVNFDFKVDDHPE" gene complement(3394019..3394921) /locus_tag="Rv3034c" /db_xref="GeneID:887470" CDS complement(3394019..3394921) /locus_tag="Rv3034c" /EC_number="2.-.-.-" /function="UNKNOWN; POSSIBLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3034c, (MTV012.49c), len: 300 aa. Possible transferase (2.-.-.-), equivalent to AAK47449|MT3119 Hexapeptide transferase family protein from M. tuberculosis strain CDC1551 but N-terminus shorter 39 residues (262 aa), FASTA scores: opt: 1773, E(): 4.7e-105, (100.0% identity in 262 aa overlap). Similar to Q9CBR1|ML1719 from Mycobacterium leprae but also shorter in N-terminus (245 aa), FASTA scores: opt: 1549, E(): 6.6e-91, (90.6% identity in 244 aa overlap). Some weakly similarity with other transferases (C-terminal part shows some similarity to acetyltransferase from Methanococcus jannaschii (214 aa)). Alternative start possible at 3395077 but codon usage not as good. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="transferase" /protein_id="NP_217550.1" /db_xref="GI:15610171" /db_xref="GeneID:887470" /translation="MNVLSLGSSSGVVWGRVPITAPAGAATGVTSRADAHSQMRRYAQ TGPTAKLSSAPMTTMWGAPLHRRWRGSRLRDPRQAKFLTLASLKWVLANRAYTPWYLV RYWRLLRFKLANPHIITRGMVFLGKGVEIHATPELAQLEIGRWVHIGDKNTIRAHEGS LRFGDKVVLGRDNVINTYLDIEIGDSVLMADWCYICDFDHRMDDITLPIKDQGIIKSP VRIGPDTWIGVKVSVLRGTTIGRGCVLGSHAVVRGAIPDYSIAVGAPAKVVKNRQLSW EASAAQRAELAAALADIERKKAAR" gene 3395379..3396461 /locus_tag="Rv3035" /db_xref="GeneID:887842" CDS 3395379..3396461 /locus_tag="Rv3035" /function="UNKNOWN" /note="Rv3035, (MTV012.50), len: 360 aa. Conserved hypothetical protein, equivalent to Q9CBR0|ML1720 HYPOTHETICAL PROTEIN from Mycobacterium leprae (364 aa), FASTA scores: opt: 1963, E(): 1.4e-108, (75.8% identity in 363 aa overlap). TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217551.1" /db_xref="GI:15610172" /db_xref="GeneID:887842" /translation="MAAGPALSARGYLALNGQTPAGCSLMEWQNDNNGRQRWCVRLVQ GGGFAGPLFDGFDNLYVGQPGAIISFPPTQWTRWRQPVIGMPSTPRFLGHGRLLVSTH LGQLLVFDTRRGMVVGSPVDLVDGIDPTDATRGLADCAPARPGCPVAAAPAFSSVNGT VVVSVWQPGEPAAKLVGLKYHAEQLVREWTSDAVSAGVLASPVLSADGSTVYVNGRDH RLWALNAADGKAKWSAPLGFLAQTPPALTPHGLIVSGGGPDTALAAFRDAGDHAEGAW RRDDVTALSTASLAGTGVGYTVISGPNHDGTPGLSLLVFDPANGHTVNSYPLPGATGY PVGVSVGNDRRVVTATSDGQVYSFAP" gene complement(3396458..3397141) /gene="TB22.2" /locus_tag="Rv3036c" /db_xref="GeneID:887320" CDS complement(3396458..3397141) /gene="TB22.2" /locus_tag="Rv3036c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3036c, (MTV012.51c), len: 227 aa. Probable TB22.2, conserved secreted protein, with putative N-terminal signal peptide, highly similar to secreted immunogenic protein MPT64/MPB64 P19996|Rv1980c|MTCY39.39 from Mycobacterium tuberculosis and Mycobacterium bovis (228 aa), FASTA scores: opt: 681, E(): 2.5e-35, (45.8% identity in 227 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217552.1" /db_xref="GI:15610173" /db_xref="GeneID:887320" /translation="MRYLIATAVLVAVVLVGWPAAGAPPSCAGLGGTVQAGQICHVHA SGPKYMLDMTFPVDYPDQQALTDYITQNRDGFVNVAQGSPLRDQPYQMDATSEQHSSG QPPQATRSVVLKFFQDLGGAHPSTWYKAFNYNLATSQPITFDTLFVPGTTPLDSIYPI VQRELARQTGFGAAILPSTGLDPAHYQNFAITDDSLIFYFAQGELLPSFVGACQAQVP RSAIPPLAI" gene complement(3397214..3398290) /locus_tag="Rv3037c" /db_xref="GeneID:888640" CDS complement(3397214..3398290) /locus_tag="Rv3037c" /function="UNKNOWN" /note="Rv3037c, (MTV012.52c), len: 358 aa. Conserved hypothetical protein, similar in part to others e.g. O86799|SC6G4.36c from Streptomyces coelicolor (426 aa), FASTA scores: opt: 545, E(): 5.5e-27, (36.15% identity in 354 aa overlap); Q9UZW6|PAB0687 from Pyrococcus abyssi (386 aa), FASTA scores: opt: 262, E(): 3.5e-09, (31.0% identity in 200 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217553.1" /db_xref="GI:15610174" /db_xref="GeneID:888640" /translation="MRARFGDRAPWLVETTLLRRRAAGKLGELCPNVGVSQWLFTDEA LQQATAAPVARHRARRLAGRVVHDATCSIGTELAALRELAVRAVGSDIDPVRLAMARH NLAALGMEADLCRADVLHPVTRDAVVVIDPARRSNGRRRFHLADYQPGLGPLLDRYRG RDVVVKCAPGIDFEEVGRLGFEGEIEVISYRGGVREACLWSAGLAGSGIRRRASILDS GEQIGDDEPDDCGVRPAGKWIVDPDGAVVRAGLVRNYGARHGLWQLDPQIAYLSGDRL PPALRGFEVLEQLAFDERRLRQVLSALDCGAAEILVRGVAIDPDALRRRLRLRGSRPL AVVITRIGAGSLSHVTAYVCRPSR" gene complement(3398425..3399408) /locus_tag="Rv3038c" /db_xref="GeneID:888517" CDS complement(3398425..3399408) /locus_tag="Rv3038c" /function="UNKNOWN" /note="Rv3038c, (MTV012.53c), len: 327 aa. Conserved hypothetical protein, equivalent to Q9CBQ9|ML1723 HYPOTHETICAL PROTEIN from Mycobacterium leprae (327 aa), FASTA scores: opt: 1843, E(): 6.1e-108, (80.75% identity in 327 aa overlap). Weak similarity with e.g. Q9KZI3|SCG8A.16 PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (199 aa), FASTA scores: opt: 227, E(): 3.9e-07, (31.95% identity in 191 aa overlap) and O52570 METHYLTRANSFERASE from Amycolatopsis mediterranei (272 aa), FASTA scores: opt: 228, E(): 4.3e-07, (31.7% identity in 164 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature but shows no similarity to known LysR family members. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217554.1" /db_xref="GI:15610175" /db_xref="GeneID:888517" /translation="MTRSSNIPADATPNPHATAEQVAAARHDSKLAQVLYHDWEAENY DEKWSISYDQRCVDYARGRFDAIVPDEVIAQLPYDRALELGCGTGFFLLNLIQAGVAR RGSVTDLSPGMVKVATRNGQALGLDIDGRVADAEGIPYDDDAFDLVVGHAVLHHIPDV ELSLREVVRVLKPGGRFVFAGEPTTVGDGYARTLSTLTWRVVTNATKLPGLRGWRRPQ GELDESSRAAALEALVDLHTFTPQDLQRIAHNAGAVEVQTATEEFTAAMLGWPLRTFE CTVPPGRLGWGWARFAFTSWKTLGWVDANVWRHVVPKGWFYNVMITGVKPS" misc_feature complement(3399049..3399126) /locus_tag="Rv3038c" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene complement(3399419..3400183) /gene="echA17" /locus_tag="Rv3039c" /db_xref="GeneID:888216" CDS complement(3399419..3400183) /gene="echA17" /locus_tag="Rv3039c" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_217555.1" /db_xref="GI:15610176" /db_xref="GeneID:888216" /translation="MPEFVNVVVSDGSQDAGLAMLLLSRPPTNAMTRQVYREVVAAAN ELGRRDDVAAVILYGGHEIFSAGDDMPELRTLSAQEADTAARIRQQAVDAVAAIPKPT VAAITGYALGAGLTLALAADWRVSGDNVKFGATEILAGLIPSGDGMARLTRAAGPSRA KELVFSGRFFDAEEALALGLIDDMVAPDDVYDAAAAWARRFLDGPPHALAAAKAGISD VYELAPAERIAAERRRYVEVFAAGQGGGSKGDRGGR" gene complement(3400192..3401058) /locus_tag="Rv3040c" /db_xref="GeneID:887672" CDS complement(3400192..3401058) /locus_tag="Rv3040c" /function="UNKNOWN" /note="Rv3040c, (MTV012.55c), len: 288 aa. Conserved hypothetical protein, highly similar to Q9XA40|SCH17.07c hypothetical protein from Streptomyces coelicolor (312 aa), FASTA scores: opt: 648, E(): 5.2e-34, (50.0% identity in 260 aa overlap). Also similar to Q9F7R7 PREDICTED MUTT SUPERFAMILY HYDROLASE from uncultured proteobacterium EBAC31A08 (264 aa), FASTA scores: opt: 295, E(): 1.3e-11, (27.2% identity in 257 aa overlap); AAK24293|CC2322 hypothetical protein from Caulobacter crescentus (254 aa), BLAST scores: 185 (32% identity) AND 131 (37% identity), etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217556.1" /db_xref="GI:15610177" /db_xref="GeneID:887672" /translation="MNSPREPLVPPPTPRPAATVMLVRDPDAGSASGLAVFLMRRHAA MDFAAGVMVFPGGGVDDRDRDADLGRLGAWAGPPPQWWAQRFGIEPDLAEALVCAAAR ETFEESGVLFAGPVDQDHSAPNSIVSDASVYGDARRALADRTLSFADFLQREKLVLRS DLLRPWANWVTPEAELTRRYDTYFFVGALPEGQRADGENTESDRAGWVLPADAIADFA AGRNFLLPPTWTQLDSLAGHTVADVLAVERQIVPVQPQLARNGDNWEIEFFDSDRYNQ ARRSGGSTGWPL" gene complement(3401055..3401918) /locus_tag="Rv3041c" /db_xref="GeneID:887197" CDS complement(3401055..3401918) /locus_tag="Rv3041c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY IRON) ACROSS THE MEMBRANE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv3041c, (MTV012.56c), len: 287 aa. Probable conserved ATP-binding protein ABC transporter (see citation below), equivalent to Q9CBQ7|ML1726 PUTATIVE ABC TRANSPORTER PROTEIN ATP-BINDING PROTEIN from Mycobacterium leprae (305 aa), FASTA scores: opt: 1576, E(): 8.6e-85, (83.4% identity in 289 aa overlap). Also similar to other putative ATP-binding proteins ABC transporters e.g. Q9X9Z4|SCI5.06C from Streptomyces coelicolor (265 aa), FASTA scores: opt: 893, E(): 4.8e-45, (53.3% identity in 257 aa overlap); Q9L156|SC5C11.16c from Streptomyces coelicolor (279 aa), FASTA scores: opt: 680, E(): 1.3e-32, (45.4% identity in 271 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="NP_217557.1" /db_xref="GI:15610178" /db_xref="GeneID:887197" /translation="MRHDSRVLDNGGPDAADPDLLIDFRNVSLRRNGRTLVGPLDWAV ELDERWVIVGPNGAGKTSLLRIAAAAEHPSSGVAFVLGERLGRVDVSELRARVGLSSS ALAERVPGDERVRDLVVSAGYAVLGRWRERYEAVDYHRAIDMLESLGAEHLANRTYGT LSEGERKRVLIARALMTDPELLLLDEPAAGLDLGGREELVARLADLAADPDAPALVLV THHVEEIPPGFSHCLLLSEARVVAAGLLPDALTAENLSTAFGQEITLEVADGRYFARR RRSRAAHRRQS" misc_feature complement(3401736..3401759) /locus_tag="Rv3041c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3401933..3403162) /gene="serB2" /locus_tag="Rv3042c" /db_xref="GeneID:887815" CDS complement(3401933..3403162) /gene="serB2" /locus_tag="Rv3042c" /EC_number="3.1.3.3" /function="GENERATES SERINE FROM PHOSPHOSERINE [CATALYTIC ACTIVITY: PHOSPHOSERINE + H(2)O = SERINE + PHOSPHATE]." /note="Rv3042c, (MTV012.57c), len: 409 aa. Probable serB2, Phosphoserine phosphatase (EC 3.1.3.3), equivalent to Q9CBQ6|ML1727 PUTATIVE PHOSPHOSERINE PHOSPHATASE from Mycobacterium leprae (411 aa), FASTA scores: opt: 2173, E(): 1.3e-117, (86.3% identity in 408 aa overlap). Also similar to other e.g. Q9S281|SCI28.02 from Streptomyces coelicolor (410 aa), FASTA scores: opt: 1209, E(): 3e-62, (51.75% identity in 400 aa overlap); Q9HUK|PA4960 from Pseudomonas aeruginosa (429 aa), FASTA scores: opt: 704, E(): 3.1e-33, (40.95% identity in 393 aa overlap); O28142|SERB_ARCTU|AF2138 from Archaeoglobus fulgidus (344 aa), FASTA scores: opt: 671, E(): 2e-31, (37.25% identity in 325 aa overlap); and P06862|SERB_ECOLI (322 aa), FASTA scores: opt: 628, E(): 5.7e-29, (46.8% identity in 235 aa overlap). BELONGS TO THE SERB FAMILY. TBparse score is 0.884." /codon_start=1 /transl_table=11 /product="phosphoserine phosphatase" /protein_id="NP_217558.1" /db_xref="GI:15610179" /db_xref="GeneID:887815" /translation="MPAKVSVLITVTGMDQPGVTSALFEVLAQHGVELLNVEQVVIRG RLTLGVLVSCPLDVADGTALRDDVAAAIHGVGLDVAIERSDDLPIIRQPSTHTIFVLG RPITAGAFSAVARGVAALGVNIDFIRGISDYPVTGLELRVSVPPGCVGPLQIALTKVA AEEHVDVAVEDYGLAWRTKRLIVFDVDSTLVQGEVIEMLAARAGAQGQVAAITEAAMR GELDFAESLQRRVATLAGLPATVIDDVAEQLELMPGARTTIRTLRRLGFRCGVVSGGF RRIIEPLARELMLDFVASNELEIVDGILTGRVVGPIVDRPGKAKALRDFASQYGVPME QTVAVGDGANDIDMLGAAGLGIAFNAKPALREVADASLSHPYLDTVLFLLGVTRGEIE AADAGDCGVRRVEIPAD" gene complement(3403200..3404921) /gene="ctaD" /locus_tag="Rv3043c" /db_xref="GeneID:887881" CDS complement(3403200..3404921) /gene="ctaD" /locus_tag="Rv3043c" /EC_number="1.9.3.1" /function="CYTOCHROME C OXIDASE IS THE COMPONENT OF THE RESPIRATORY CHAIN THAT CATALYZES THE REDUCTION OF OXYGEN TO WATER. SUBUNITS 1-3 FORM THE FUNCTIONAL CORE OF THE ENZYME COMPLEX. CO I IS THE CATALYTIC SUBUNIT OF THE ENZYME. ELECTRONS ORIGINATING IN CYTOCHROME C ARE TRANSFERRED VIA THE COPPER A CENTER OF SUBUNIT 2 AND HEME A OF SUBUNIT 1 TO THE BIMETALLIC CENTER FORMED BY HEME A3 AND COPPER B [CATALYTIC ACTIVITY: 4 FERROCYTOCHROME C + O(2) = 2 H(2)O + 4 FERRICYTOCHROME C]." /note="Rv3043c, (MTV012.58c), len: 573 aa. Probable ctaD, integral membrane cytochrome C oxidase polypeptide I (EC 1.9.3.1), equivalent to Q9CBQ5|ML1728 from Mycobacterium leprae (574 aa), FASTA scores: opt: 3738, E(): 3.8e-216, (95.4% identity in 566 aa overlap). Also similar to other CYTOCHROME C OXIDASES POLYPEPTIDE I e.g. Q9AEL9|CTAD from Corynebacterium glutamicum (Brevibacterium flavum) (584 aa), FASTA scores: opt: 3065, E(): 6.8e-176, (72.65% identity in 567 aa overlap); Q9X813|SC6G10.28c from Streptomyces coelicolor (578 aa), FASTA scores: opt: 2888, E(): 2.6e-165, (71.7% identity in 544 aa overlap); Q9K451|CTAD from Streptomyces coelicolor (573 aa), FASTA scores: opt: 2757, E(): 1.8e-157, (70.2% identity in 537 aa overlap). Contains PS00077 Cytochrome c oxidase subunit I, copper B binding region signature. BELONGS TO THE HEME-COPPER RESPIRATORY OXIDASE FAMILY. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="cytochrome C oxidase polypeptide I" /protein_id="NP_217559.1" /db_xref="GI:15610180" /db_xref="GeneID:887881" /translation="MTAEAPPLGELEAIRPYPARTGPKGSLVYKLITTTDHKMIGIMY CVACISFFFIGGLLALLMRTELAAPGLQFLSNEQFNQLFTMHGTIMLLFYATPIVFGF ANLVLPLQIGAPDVAFPRLNAFSFWLFVFGATIGAAGFITPGGAADFGWTAYTPLTDA IHSPGAGGDLWIMGLIVAGLGTILGAVNMITTVVCMRAPGMTMFRMPIFTWNIMVTSI LILIAFPLLTAALFGLAADRHLGAHIYDAANGGVLLWQHLFWFFGHPEVYIIALPFFG IVSEIFPVFSRKPIFGYTTLVYATLSIAALSVAVWAHHMFATGAVLLPFFSFMTYLIA VPTGIKFFNWIGTMWKGQLTFETPMLFSVGFMVTFLLGGLTGVLLASPPLDFHVTDSY FVVAHFHYVLFGTIVFATFAGIYFWFPKMTGRLLDERLGKLHFWLTFIGFHTTFLVQH WLGDEGMPRRYADYLPTDGFQGLNVVSTIGAFILGASMFPFVWNVFKSWRYGEVVTVD DPWGYGNSLEWATSCPPPRHNFTELPRIRSERPAFELHYPHMVERLRAEAHVGRHHDE PAMVTSS" misc_feature complement(3403977..3403991) /gene="ctaD" /locus_tag="Rv3043c" /note="PS00077 Cytochrome c oxidase subunit I, copper B binding region signature" gene 3405136..3406215 /gene="fecB" /locus_tag="Rv3044" /db_xref="GeneID:888745" CDS 3405136..3406215 /gene="fecB" /locus_tag="Rv3044" /function="MAY BE INVOLVED IN ACTIVE TRANSPORT OF FeIII-DECITRATE ACROSS THE MEMBRANE (IMPORT)." /note="Rv3044, (MTV012.59), len: 359 aa. Probable fecB, FeIII dicitrate-binding periplasmic lipoprotein (see citation below), equivalent to Q9CBQ4|FECB|ML1729 PUTATIVE FEIII-DICITRATE TRANSPORTER LIPOPROTEIN from Mycobacterium leprae (364 aa), FASTA scores: opt: 1816, E(): 1.1e-96, (75.65% identity in 357 aa overlap); and Q9LA57|FECB from Mycobacterium avium (364 aa), FASTA scores: opt: 1769, E(): 5.1e-94. Similar to many periplasmic FeIII-dicitrate transporters e.g. P72593|FECB|SLR1319 from Synechocystis sp. strain PCC 6803 (315 aa), FASTA scores: opt: 459, E(): 3.6e-19, (31.35% identity in 303 aa overlap); and P72611|FECB|SLR1492 from Synechocystis sp. strain PCC 6803. N-terminus longer (approximatively 30 aa) to AAK47459 from Mycobacterium tuberculosis strain CDC1551 (327 aa). Has signal peptide and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.883." /codon_start=1 /transl_table=11 /product="FEIII-dicitrate-binding periplasmic lipoprotein" /protein_id="NP_217560.1" /db_xref="GI:15610181" /db_xref="GeneID:888745" /translation="MRSTVAVAVAAAVIAASSGCGSDQPAHKASQSMITPTTQIAGAG VLGNDRKPDESCARAAAAADPGPPTRPAHNAAGVSPEMVQVPAEAQRIVVLSGDQLDA LCALGLQSRIVAAALPNSSSSQPSYLGTTVHDLPGVGTRSAPDLRAIAAAHPDLILGS QGLTPQLYPQLAAIAPTVFTAAPGADWENNLRGVGAATARIAAVDALITGFAEHATQV GTKHDATHFQASIVQLTANTMRVYGANNFPASVLSAVGVDRPPSQRFTDKAYIEIGTT AADLAKSPDFSAADADIVYLSCASEAAAERAAVILDSDPWRKLSANRDNRVFVVNDQV WQTGEGMVAARGIVDDLRWVDAPIN" misc_feature 3405163..3405195 /gene="fecB" /locus_tag="Rv3044" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 3406285..3407325 /gene="adhC" /locus_tag="Rv3045" /db_xref="GeneID:888888" CDS 3406285..3407325 /gene="adhC" /locus_tag="Rv3045" /EC_number="1.1.1.2" /function="GENERATES ALDEHYDE OR KETONE FROM ALCOHOL [CATALYTIC ACTIVITY: ALCOHOL + NADP(+) = ALDEHYDE OR KETONE + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="Rv3045, (MTV012.60), len: 346 aa. Probable adhC, NADP-dependent alcohol dehydrogenase (EC 1.1.1.2), equivalent to Q9CBQ3|ADHA|ML1730 ALCOHOL DEHYDROGENASES from Mycobacterium leprae (362 aa), FASTA scores: opt: 1982, E(): 1.3e-111, (85.85% identity in 346 aa overlap); Q9AE96|ADHC from Mycobacterium smegmatis (348 aa), FASTA scores: opt: 1808, E(): 3.4e-101, (78.95% identity in 347 aa overlap); Q9EWF1|SCK13.33c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (346 aa), FASTA scores: opt: 1508, E(): 3.3e-83, (64.45% identity in 346 aa overlap); O06007|ADHA from Bacillus subtilis (349 aa), FASTA scores: opt: 1412, E(): 1.9e-77, (61.8% identity in 335 aa overlap); etc. Contains PS00059 Zinc-containing alcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY. HIGH SIMILARITY WITH OTHER BACTERIAL ADH'S. TBparse score is 0.878." /codon_start=1 /transl_table=11 /product="NADP-dependent alcohol dehydrogenase ADHC" /protein_id="NP_217561.1" /db_xref="GI:15610182" /db_xref="GeneID:888888" /translation="MSTVAAYAAMSATEPLTKTTITRRDPGPHDVAIDIKFAGICHSD IHTVKAEWGQPNYPVVPGHEIAGVVTAVGSEVTKYRQGDRVGVGCFVDSCRECNSCTR GIEQYCKPGANFTYNSIGKDGQPTQGGYSEAIVVDENYVLRIPDVLPLDVAAPLLCAG ITLYSPLRHWNAGANTRVAIIGLGGLGHMGVKLGAAMGADVTVLSQSLKKMEDGLRLG AKSYYATADPDTFRKLRGGFDLILNTVSANLDLGQYLNLLDVDGTLVELGIPEHPMAV PAFALALMRRSLAGSNIGGIAETQEMLNFCAEHGVTPEIELIEPDYINDAYERVLASD VRYRFVIDISAL" misc_feature 3406468..3406512 /gene="adhC" /locus_tag="Rv3045" /note="PS00059 Zinc-containing alcohol dehydrogenases signature" gene complement(3407314..3407688) /locus_tag="Rv3046c" /db_xref="GeneID:888870" CDS complement(3407314..3407688) /locus_tag="Rv3046c" /function="UNKNOWN" /note="Rv3046c, (MTV012.61c), len: 124 aa. Conserved hypothetical protein, similar to several hypothetical mycobacterial proteins e.g. Q50171|ML2258 U296W HYPOTHETICAL PROTEIN from Mycobacterium leprae (100 aa), FASTA scores: opt: 194, E(): 7.6e-06, (35.9% identity in 103 aa overlap); and O06409|Rv0543c|MTCY25D10.22c from Mycobacterium tuberculosis (100 aa), FASTA scores: opt: 192, E(): 1e-05, (34.7% identity in 98 aa overlap). TBparse score is 0.873." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217562.1" /db_xref="GI:15610183" /db_xref="GeneID:888870" /translation="MTKTFSHPHFFRSVLRWLQVGYPEGVPGPDRVALLSLLRSTPLT EEQIGEVVRHFTENGSPAVADRVIDRDEIAEFISEVTHHDAGPENIQRVAGILAAAGW PLAGVDVGESESGSDRAPASQG" gene complement(3408022..3408306) /locus_tag="Rv3047c" /db_xref="GeneID:888898" CDS complement(3408022..3408306) /locus_tag="Rv3047c" /function="UNKNOWN" /note="Rv3047c, (MTV012.62c), len: 94 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217563.1" /db_xref="GI:15610184" /db_xref="GeneID:888898" /translation="MGGPFDADAEAHFDEVAEAFAKLTNVDRDVGVDLEKELCMTVEA DDRSDALVTRRLLPRVPRCIPLAARLAPGTIGCPSFWNPIATGGASRQAL" gene complement(3408404..3409378) /gene="nrdF2" /locus_tag="Rv3048c" /db_xref="GeneID:888886" CDS complement(3408404..3409378) /gene="nrdF2" /locus_tag="Rv3048c" /EC_number="1.17.4.1" /function="INVOLVED IN THE DNA REPLICATION PATHWAY. CATALYZES THE BIOSYNTHESIS OF DEOXYRIBONUCLEOTIDES FROM THE CORRESPONDING RIBONUCLEOTIDES, PRECURSORS THAT ARE NECESSARY FOR DNA SYNTHESIS [CATALYTIC ACTIVITY: 2'-DEOXYRIBONUCLEOSIDE DIPHOSPHATE + OXIDIZED THIOREDOXIN + H(2)O = RIBONUCLEOSIDE DIPHOSPHATE + REDUCED THIOREDOXIN]." /experiment="experimental evidence, no additional details recorded" /note="B2 or R2 protein; type 1b enzyme; catalyzes the rate-limiting step in dNTP synthesis; converts nucleotides to deoxynucleotides; forms a homodimer and then a multimeric complex with NrdE" /codon_start=1 /transl_table=11 /product="ribonucleotide-diphosphate reductase subunit beta" /protein_id="YP_177921.1" /db_xref="GI:57117051" /db_xref="GeneID:888886" /translation="MTGNAKLIDRVSAINWNRLQDEKDAEVWDRLTGNFWLPEKVPVS NDIPSWGTLTAGEKQLTMRVFTGLTMLDTIQGTVGAVSLIPDALTPHEEAVLTNIAFM ESVHAKSYSQIFSTLCSTAEIDDAFRWSEENRNLQRKAEIVLQYYRGDEPLKRKVAST LLESFLFYSGFYLPMYWSSRAKLTNTADMIRLIIRDEAVHGYYIGYKFQRGLALVDDV TRAELKDYTYELLFELYDNEVEYTQDLYDEVGLTEDVKKFLRYNANKALMNLGYEALF PRDETDVNPAILSALSPNADENHDFFSGSGSSYVIGKAVVTEDDDWDF" misc_feature complement(3409025..3409072) /gene="nrdF2" /locus_tag="Rv3048c" /note="PS00368 Ribonucleotide reductase small subunit signature" gene complement(3409509..3411083) /locus_tag="Rv3049c" /db_xref="GeneID:888894" CDS complement(3409509..3411083) /locus_tag="Rv3049c" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /note="Rv3049c, (MTV012.64c), len: 524 aa. Probable monooxygenase (EC 1.-.-.-), similar to several monooxygenases e.g. Q9I3H5|PA1538 PROBABLE FLAVIN-CONTAINING MONOOXYGENASE from Pseudomonas aeruginosa (527 aa), FASTA scores: opt: 1577, E(): 3.9e-90, (47.3% identity in 501 aa overlap); Q9RKB5|SCE87.23c MONOOXYGENASE from Streptomyces coelicolor (519 aa), FASTA scores: opt: 1522, E(): 9.8e-87, (47.4% identity in 485 aa overlap); Q9I218|PA2097 PROBABLE FLAVIN-BINDING MONOOXYGENASE from Pseudomonas aeruginosa (491 aa), FASTA scores: opt: 1366, E(): 4.2e-77, (43.75% identity in 489 aa overlap); etc. Also similar to Q10532|Rv0892|Y892_MYCTU|MT0916|MTCY31.20 PROBABLE MONOOXYGENASE (EC 1.14.13.-) from Mycobacterium tuberculosis strain H37Rv (495 aa), FASTA scores: opt: 1147, E(): 1.5e-63, (38.0% identity in 479 aa overlap). TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="NP_217565.1" /db_xref="GI:15610186" /db_xref="GeneID:888894" /translation="MSIADTAAKPSTPSPANQPPVRTRAVIIGTGFSGLGMAIALQKQ GVDFVILEKADDVGGTWRDNTYPGCACDIPSHLYSFSFEPKADWKHLFSYWDEILGYL KGVTDKYGLRRYIEFNSLVDRGYWDDDECRWHVFTADGREYVAQFLISGAGALHIPSF PEIAGRDEFAGPAFHSAQWDHSIDLTGKRVAIVGTGASAIQIVPEIVGQVAELQLYQR TPPWVVPRTNEELPVSLRRALRTVPGLRALLRLGIYWAQEALAYGMTKRPNTLKIIEA YAKYNIRRSVKDRELRRKLTPRYRIGCKRILNSSTYYPAVADPKTELITDRIDRITHD GIVTADGTGREVFREADVIVYATGFHVTDSYTYVQIKGRHGEDLVDRWNREGIGAHRG ITVANMPNLFFLLGPNTGLGHNSVVFMIESQIHYVADAIAKCDRMGVQALAPTREAQD RFNQELQRRLAGSVWNSGGCRSWYLDEHGKNTVLWCGYTWQYWLTTRSVNPAEYRFFG IGNGLSSDRATVAAAN" gene complement(3411217..3411957) /locus_tag="Rv3050c" /db_xref="GeneID:888866" CDS complement(3411217..3411957) /locus_tag="Rv3050c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM" /note="Rv3050c, (MTV012.65c), len: 246 aa. Probable transcriptional regulatory protein tetR-family, equivalent but shorter to Q9CBQ1|ML1733 from Mycobacterium leprae (275 aa), FASTA scores: opt: 1381,(E): 2.7e-79, (86.25% identity in 240 aa overlap); AAK44712|MT0489 from Mycobacterium tuberculosis strain CDC1551 (256 aa), FASTA scores: opt: 328,(E): 1.8e-13, (30.75% identity in 234 aa overlap); etc. Also some similarity to O53757|Rv0472c|MTV038.16c. Alternative starts possible at 68052 or 67923. Has potential helix-turn-helix motif at positons 51-72." /codon_start=1 /transl_table=11 /product="AsnC family transcriptional regulator" /protein_id="NP_217566.1" /db_xref="GI:15610187" /db_xref="GeneID:888866" /translation="MVRIPRPHPSAKPGVKVDARSERWREHRKKVRNEIVDAAFRAID RLGPELSVRQIAEEAGTAKPKIYRHFTDKSDLLEAIGMRLRDMLWAAIFPSLDLATDS AREVIRRSVEEYVNLVDQHPNVLRVFIQGRSAKQSEATVRTLNEGREITLAMAEMFNN ELREMELNRAALELAAFAAFGSAASATEWWLGPEPDSPRRMPREQFVAHLTTIMMGVI VGTAEALGIAVDPDQPIHDAVPNNPAVR" gene complement(3412085..3414166) /gene="nrdE" /locus_tag="Rv3051c" /db_xref="GeneID:888869" CDS complement(3412085..3414166) /gene="nrdE" /locus_tag="Rv3051c" /EC_number="1.17.4.1" /function="INVOLVED IN THE DNA REPLICATION PATHWAY. CATALYZES THE BIOSYNTHESIS OF DEOXYRIBONUCLEOTIDES FROM THE CORRESPONDING RIBONUCLEOTIDES, PRECURSORS THAT ARE NECESSARY FOR DNA SYNTHESIS. [CATALYTIC ACTIVITY: 2'-DEOXYRIBONUCLEOSIDE DIPHOSPHATE + OXIDIZED THIOREDOXIN + H(2)O = RIBONUCLEOSIDE DIPHOSPHATE + REDUCED THIOREDOXIN]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the rate-limiting step in dNTP synthesis" /codon_start=1 /transl_table=11 /product="ribonucleotide-diphosphate reductase subunit alpha" /protein_id="NP_217567.1" /db_xref="GI:15610188" /db_xref="GeneID:888869" /translation="MLNLYDADGKIQFDKDREAAHQYFLQHVNQNTVFFHNQDEKLDY LIRENYYEREVLDQYSRNFVKTLLDRAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYL ERFEDRVVMVALTLAAGDTALAELLVDEIIDGRFQPATPTFLNSGKKQRGEPVSCFLL RVEDNMESIGRSINSALQLSKRGGGVALLLTNIREHGAPIKNIENQSSGVIPIMKLLE DAFSYANQLGARQGAGAVYLHAHHPDIYRFLDTKRENADEKIRIKTLSLGVVIPDITF ELAKRNDDMYLFSPYDVERVYGVPFADISVTEKYYEMVDDARIRKTKIKAREFFQTLA ELQFESGYPYIMFEDTVNRANPIDGKITHSNLCSEILQVSTPSLFNEDLSYAKVGKDI SCNLGSLNIAKTMDSPDFAQTIEVAIRALTAVSDQTHIKSVPSIEQGNNDSHAIGLGQ MNLHGYLARERIFYGSDEGIDFTNIYFYTVLYHALRASNRIAIERGTHFKGFERSKYA SGEFFDKYTDQIWEPKTQKVRQLFADAGIRIPTQDDWRRLKESVQAHGIYNQNLQAVP PTGSISYINHSTSSIHPIVSKVEIRKEGKIGRVYYPAPYMTNDNLEYYEDAYEIGYEK IIDTYAAATQHVDQGLSLTLFFKDTATTRDVNKAQIYAWRKGIKTLYYIRLRQMALEG TEVEGCVSCML" misc_feature complement(3412466..3412534) /gene="nrdE" /locus_tag="Rv3051c" /note="PS00089 Ribonucleotide reductase large subunit signature" gene complement(3414232..3414684) /gene="nrdI" /locus_tag="Rv3052c" /db_xref="GeneID:888885" CDS complement(3414232..3414684) /gene="nrdI" /locus_tag="Rv3052c" /function="NOT KNOWN; PROBABLY INVOLVED IN RIBONUCLEOTIDE REDUCTASE FUNCTION." /note="in Salmonella NrdI has a stimulatory effect on the ribonucleotide reductase activity of NrdH with NrdEF" /codon_start=1 /transl_table=11 /product="ribonucleotide reductase stimulatory protein" /protein_id="NP_217568.1" /db_xref="GI:15610189" /db_xref="GeneID:888885" /translation="MDIAGRSLVYFSSVSENTHRFVQKLGIPATRIPLHGRIEVDEPY VLILPTYGGGRANPGLDAGGYVPKQVIAFLNNDHNRAQLRGVIAAGNTNFGAEFCYAG DVVSRKCSVPYLYRFELMGTEDDVAAVRTGLAEFWKEQTCHQPSLQSL" gene complement(3414719..3414958) /gene="nrdH" /locus_tag="Rv3053c" /db_xref="GeneID:888884" CDS complement(3414719..3414958) /gene="nrdH" /locus_tag="Rv3053c" /function="INVOLVED IN ELECTRON TRANSFER SYSTEM FOR RIBONUCLEOTIDE REDUCTASE SYSTEM NRDEF." /note="Rv3053c, (MTCY22D7.29), len: 79 aa. Probable nrdH, glutaredoxin-like protein, equivalent to Q9CBP8|NRDH|ML1736 from Mycobacterium leprae (80 aa), FASTA scores: opt: 478, E(): 2.7e-27, (91.15% identity in 79 aa overlap), and similar to many glutaredoxin-like proteins e.g. Q9XD65|NRDH from Corynebacterium glutamicum (Brevibacterium flavum) (77 aa), FASTA scores: opt: 382, E(): 1.5e-20, (72.35% identity in 76 aa overlap); and Q56108|NRDH_SALTY from Salmonella typhimurium (81 aa), FASTA scores: opt: 243, E(): 9.9e-11, (45.85% identity in 72 aa overlap). BELONGS TO THE GLUTAREDOXIN FAMILY." /codon_start=1 /transl_table=11 /product="glutaredoxin electron transport protein NrdH" /protein_id="NP_217569.1" /db_xref="GI:15610190" /db_xref="GeneID:888884" /translation="MTVTVYTKPACVQCSATSKALDKQGIAYQKVDISLDSEARDYVM ALGYLQAPVVVAGNDHWSGFRPDRIKALAGAALTA" gene complement(3415435..3415989) /locus_tag="Rv3054c" /db_xref="GeneID:887963" CDS complement(3415435..3415989) /locus_tag="Rv3054c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3054c, (MTCY22D7.28), len: 184 aa. Conserved hypothetical protein, similar to Q9RD22|SCM1.21 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (187 aa), FASTA scores: opt: 651, E(): 1.5e-33, (56.8% identity in 175 aa overlap). Also shares similarity with other hypothetical proteins and Chromate reductases e.g. AAK56853|CHRR from Pseudomonas putida (186 aa), FASTA scores: opt: 339, E(): 3.3e-14, (38.75% identity in 160 aa overlap). Contains aminotransferases class-II pyridoxal-phosphate attachment site (PS00599) near C-terminus. TBparse score is 0.873." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217570.1" /db_xref="GI:15610191" /db_xref="GeneID:887963" /translation="MSDTKSDIKILALVGSLRAASFNRQIAELAAKVAPDGVTVTMFE GLGDLPFYNEDIDTATEVPAPVSALREAASDAHAALVVTPEYNGSIPAVIKNAIDWLS RPFGDGALKDKPLAVIGGSMGRYGGVWAHDETRKSFSIAGTRVVDAIKLSVPFQTLGK SVADDAGLAANVRDAVGNLAAEVG" misc_feature complement(3415492..3415521) /locus_tag="Rv3054c" /note="PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site" gene 3416081..3416695 /locus_tag="Rv3055" /db_xref="GeneID:888882" CDS 3416081..3416695 /locus_tag="Rv3055" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3055, (MTCY22D7.26c), len: 204 aa. Possible transcriptional regulatory protein, similar to Q9RD23|SCM1.20c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (234 aa), FASTA scores: opt: 471, E(): 4.6e-23, (44.9% identity in 187 aa overlap); and with low similarity to other e.g. Q9ADK8|2SCK31.12 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (198 aa), FASTA scores: opt: 208, 2.5e-06, (32.9% identity in 155 aa overlap); Q9ADD9|SCBAC20F6.11c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (199 aa), FASTA scores: opt: 182, E(): 0.00012, (31.0% identity in 184 aa overlap). Contains potential helix-turn-helix motif from aa 48 to 69 (+3.42 SD). SO MAY BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217571.1" /db_xref="GI:15610192" /db_xref="GeneID:888882" /translation="MSGAERLGDLPVFARQEPVPERGDAARNRALLLEAARRLIARSG ADAITMDDVAAAAGVGKGTLFRRFGSRAGLMMVLLDEDERASQQAFLFGPPPLGPDAP PLDRLIAFGRERMRFVHAHHQLLSEANRDPQTRHSAALSVLRTHLRVLLASAPTTGDL DAQTDALLALLDVDYVEHQLNAGGHTLQTLGDAWESLARKLCGR" gene 3416705..3417745 /gene="dinP" /locus_tag="Rv3056" /db_xref="GeneID:887519" CDS 3416705..3417745 /gene="dinP" /locus_tag="Rv3056" /EC_number="2.7.7.7" /function="THOUGHT TO BE INVOLVED IN DNA METABOLISM AND MUTAGENESIS [CATALYTIC ACTIVITY: N DEOXYNUCLEOSIDE TRIPHOSPHATE = N DIPHOSPHATE + {DNA}N]." /experiment="experimental evidence, no additional details recorded" /note="involved in translesion DNA polymerization with beta clamp of polymerase III; belongs to Y family of polymerases; does not contain proofreading function" /codon_start=1 /transl_table=11 /product="DNA polymerase IV" /protein_id="NP_217572.1" /db_xref="GI:15610193" /db_xref="GeneID:887519" /translation="MPTAAPRWILHVDLDQFLASVELLRHPELAGLPVIVGGNGDPTE PRKVVTCASYEARAYGVRAGMPLRTAARRCPEATFLPSNPAAYNAASEEVVALLRDLG YPVEVWGWDEAYLAVAPGTPDDPIEVAEEIRKVILSQTGLSCSIGISDNKQRAKIATG LAKPAGIYQLTDANWMAIMGDRTVEALWGVGPKTTKRLAKLGINTVYQLAHTDSGLLM STFGPRTALWLLLAKGGGDTEVSAQAWVPRSRSHAVTFPRDLTCRSEMESAVTELAQR TLNEVVASSRTVTRVAVTVRTATFYTRTKIRKLQAPSTDPDVITAAARHVLDLFELDR PVRLLGVRLELA" gene complement(3417799..3418662) /locus_tag="Rv3057c" /db_xref="GeneID:888617" CDS complement(3417799..3418662) /locus_tag="Rv3057c" /EC_number="1.1.-.-" /function="UNKNOWN, BUT SIMILAR TO VARIOUS OXIDOREDUCTASES AND ENZYMES INVOLVED IN POLYKETIDES SYNTHESIS" /note="Rv3057c, (MTCY22D7.24), len: 287 aa. Probable oxidoreductase, probably short-chain alcohol dehydrogenase/reductase (EC 1.1.-.-). Equivalent to Q9CBP7|ML1740 POSSIBLE SHORT CHAIN DEHYDROGENASES/REDUCTASE from Mycobacterium leprae (312 aa), FASTA scores: opt: 1563, E(): 6e-89, (81.8% identity in 280 aa overlap). Also similar to many oxidoreductases e.g. Q9ZBX8|SCD78.21c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (585 aa), FASTA scores: opt: 541, E(): 6.7e-26, (37.25% identity in 263 aa overlap); AAK47506|MT3170 OXIDOREDUCTASE, SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY from Mycobacterium tuberculosis strain CDC1551 (276 aa), FASTA scores: opt: 521, E(): 6.1e-25, (36.25% identity in 276 aa overlap); AAK45541|MT1283 OXIDOREDUCTASE, SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY from Mycobacterium tuberculosis strain CDC1551 (276 aa), FASTA scores: opt: 471, E(): 7.2e-22, (32.4% identity in 281 aa overlap). Also similar to O50460|Rv1245c|MTV006.17C DEHYDROGENASE (276 aa). Contains short-chain alcohol dehydrogenase family signature (PS00061). MAY BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_217573.1" /db_xref="GI:15610194" /db_xref="GeneID:888617" /translation="MLQRGAGQYFAGKRCFVTGAASGIGRATALRLAAQGAELYLTDR DRDGLAQTVCDARALGAQVPEHRVLDVSDYQDVAAFAADIHARHPSMDVVLNIAGVSA WGTVDQLTHDQWSRMVAINLMGPIHVIETLVPPMVAAGRGGHLVNVSSAAGLVGLPWH AAYSASKYGLRGLSEVLRFDLARHGIGVSVVVPGAVKTPLVNTVEIAGVDRDDPRVNR WVERFSGHAVTPEKAADKILAGVTRNRYLVYTSADIRALYAFKRYAWWPYTLVMRRVN VFFTRALRPGP" misc_feature complement(3418129..3418215) /locus_tag="Rv3057c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene complement(3418726..3419376) /locus_tag="Rv3058c" /db_xref="GeneID:888878" CDS complement(3418726..3419376) /locus_tag="Rv3058c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3058c, (MTCY22D7.23), len: 216 aa. Possible transcriptional regulatory protein, tetR-family, showing reasonable similarity to others e.g. AAK48337|MT3970 from Mycobacterium tuberculosis strain CDC1551 (216 aa), FASTA scores: opt: 261, E(): 2.8e-10, (31.7% identity in 221 aa overlap); Q49962|ML1070|U1756B from Mycobacterium leprae (217 aa), FASTA scores: opt: 234, E(): 1.8e-08, (27.2% identity in 195 aa overlap); Q9CDD3|ML0064 from Mycobacterium leprae (214 aa), FASTA scores: opt: 199, E(): 3.6e-06, (25.65% identity in 195 aa overlap); O66121|CPRS from Streptomyces coelicolor (215 aa), FASTA scores: opt: 183, E(): 4.2e-05, (26.0% identity in 196 aa overlap). Equivalent to AAK47476|MT3144 from Mycobacterium tuberculosis strain CDC1551 (237 aa) but N-terminus shorter 21 residues. Start was predicted by TBparse but alternatives (ATG) are possible. COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217574.1" /db_xref="GI:15610195" /db_xref="GeneID:888878" /translation="MTSHAADEKQAAPPMRRRGDRHRQAILRAARELLEETPFAELSV RAISLRAGVARSGFYFYFDSKYSVLAQILAEATEELEEASQHFSARQPGESPEQFVNR MIGSVAAVYANNDPVLRACNAARQSDMEIRDILERQFQVLLRETIGVFEAEVKAGTAH PISEDLPTLVRTLAATTALMLTGDALLVGPDSDAARRVRVLEQMWLNALWGGGKAP" gene 3419492..3420970 /gene="cyp136" /locus_tag="Rv3059" /db_xref="GeneID:888883" CDS 3419492..3420970 /gene="cyp136" /locus_tag="Rv3059" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv3059, (MTCY22D7.22c), len: 492 aa. Probable cyp136, cytochrome P450 136 (EC 1.14.-.-), similar to other cytochrome P450-dependent oxidases e.g. Q59990|CYP120|CYP|SLR0574 PUTATIVE CYTOCHROME P450 120 from Synechocystis sp. strain PCC 6803 (444 aa), FASTA scores: opt: 579, E(): 1.5e-29, (27.3% identity in 443 aa overlap); Q64654|CYP51|CP51_RAT CYTOCHROME P450 51 (EC 1.14.14.-) (LANOSTEROL 14-ALPHA DEMETHYLASE) from Rattus norvegicus (Rat) (503 aa), FASTA scores: opt: 549, E(): 1.4e-27, (26.2% identity in 458 aa overlap); Q9JIY3|CYP51 LANOSTEROL 14-ALPHA-DEMETHYLASE from Mus musculus (Mouse) (486 aa), FASTA scores: opt: 546, E(): 2.1e-27, (25.75% identity in 458 aa overlap). Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). BELONGS TO THE CYTOCHROME P450 FAMILY. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="cytochrome P450 136" /protein_id="NP_217575.1" /db_xref="GI:15610196" /db_xref="GeneID:888883" /translation="MATIHPPAYLLDQAKRRFTPSFNNFPGMSLVEHMLLNTKFPEKK LAEPPPGSGLKPVVGDAGLPILGHMIEMLRGGPDYLMFLYKTKGPVVFGDSAVLPGVA ALGPDAAQVIYSNRNKDYSQQGWVPVIGPFFHRGLMLLDFEEHMFHRRIMQEAFVRSR LAGYLEQMDRVVSRVVADDWVVNDARFLVYPAMKALTLDIASMVFMGHEPGTDHELVT KVNKAFTITTRAGNAVIRTSVPPFTWWRGLRARELLENYFTARVKERREASGNDLLTV LCQTEDDDGNRFSDADIVNHMIFLMMAAHDTSTSTATTMAYQLAAHPEWQQRCRDESD RHGDGPLDIESLEQLESLDLVMNESIRLVTPVQWAMRQTVRDTELLGYYLPKGTNVIA YPGMNHRLPEIWTDPLTFDPERFTEPRNEHKRHRYAFTPFGGGVHKCIGMVFDQLEIK TILHRLLRRYRLELSRPDYQPRWDYSAMPIPMDGMPIVLRPR" misc_feature 3420785..3420814 /gene="cyp136" /locus_tag="Rv3059" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene complement(3421741..3423213) /locus_tag="Rv3060c" /db_xref="GeneID:888855" CDS complement(3421741..3423213) /locus_tag="Rv3060c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3060c, (MTCY22D7.21), len: 490 aa. Probable transcriptional regulatory protein, showing reasonable similarity to several members of the GntR family e.g. BAB54431|MLL8575 from Rhizobium loti (Mesorhizobium loti) (247 aa), FASTA scores: opt: 274, E(): 3.5e-10, (30.35% identity in 224 aa overlap); P96570|ESMR from Burkholderia cepacia (Pseudomonas cepacia) (277 aa), FASTA scores: opt: 229, E(): 2.8e-07, (25.85% identity in 240 aa overlap); Q9S276|SCI28.07 from Streptomyces coelicolor (230 aa), FASTA scores: opt: 211, E(): 3.4e-06, (27.25% identity in 220 aa overlap); etc. Seems to have two domains: residues 1-260 resemble UxuR, and 260-490 resemble PdhR, ExuR, etc. Contains bacterial regulatory proteins, GntR family signature (PS00043). Helix-turn-helix motif (+3.13 SD) at aa 38-59. SEEMS TO BELONG TO THE GNTR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="NP_217576.1" /db_xref="GI:15610197" /db_xref="GeneID:888855" /translation="MSTEPDAVWTDKRASKIARRIEADIVRRGWPIGASLGSESALQQ RFCVSRSVLREAVRLVEHHQVARMRRGPNGGLFICEPNAGPATRAVVIYLEYLGTTIG DLLGARLVLEPLAASLAAEHIDEPGIERLRAVLRAEERWRPGLPPPPEQFYRVLAEQS KNPVLQLFIDILMRLTKRYVQKSGTQSAGEAVEAAGQVHNEHSDIVAAVTAGDSAWAK TLSERHVEAVAGWLQQHQRGNDAAVRNGGRAREPRRAQQLILGAPRGKLAEVLAATIG DDIAASGWQVGSVFGTETALLERYQVSRAVLREAVRLLEYHAIAHMRRGPGGGLVVTT PQPQASIDTIALYLQYRKPSREDLRCVRDAIEIDNVAKVVKRRSEPEVASFLDTLGRP RLDNPTDDVRAAAVEEFRFHVGLARAAGNTMLDLFLLILVELFRRHLSSTEQALPTWS DVVAVGHAHVRILEAIGSGDDSLARCRTRRHLDAAASWWL" misc_feature complement(3422272..3422337) /locus_tag="Rv3060c" /note="PS00043 Bacterial regulatory proteins, gntR family signature" gene complement(3423262..3425427) /gene="fadE22" /locus_tag="Rv3061c" /db_xref="GeneID:887617" CDS complement(3423262..3425427) /gene="fadE22" /locus_tag="Rv3061c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3061c, (MTCY22D7.20), len: 721 aa. Probable fadE22, Acyl-CoA Dehydrogenase (EC 1.3.99.-), similar to many e.g. AAK44503|MT0284 from Mycobacterium tuberculosis strain CDC1551 (731 aa), FASTA scores: opt: 1804, E(): 1.1e-101, (43.45% identity in 743 aa overlap); AAK48037|MT3678 from Mycobacterium tuberculosis strain CDC1551 (711 aa), FASTA scores: opt: 1630, E(): 3.9e-91, (42.55% identity in 733 aa overlap); and extensive similarity in C-terminal part to many acyl-CoA dehydrogenases e.g. Q9A5G9|CC2478 from Caulobacter crescentus (407 aa), FASTA scores: opt: 767, E(): 4.8e-39, (36.7% identity in 376 aa overlap). Also similar to many hypothetical proteins. COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE22" /protein_id="NP_217577.1" /db_xref="GI:15610198" /db_xref="GeneID:887617" /translation="MGIALTDDHRELSGVARAFLTSQKVRWAARASLDAAGDARPPFW QNLAELGWLGLHIDERHGGSGYGLSELVVVIEELGRAVAPGLFVPTVIASAVVAKEGT DDQRARLLPALIDGTLTAGVGLDSQVQVTDGVADGEAGIVLGAGLAELLLVAAGDDVL VLERGRKGVSVDVPENFDPTRRSGRVRLDNVRVTTDDILLGAYESALARARTLLAAEA VGGAADCVDSAVAYAKVRQQFGRTIATFQAVKHHCANMLVAAESAIAAVWDAARAAAE DEEQFRLAAAVAAALAFPAYARNAELNIQVHGGIGFTWEHDAHLHLRRALVTVGLFGG DAPVRDVFERTAAGVTRAISLDLPAQAEELRARIRSDAAEIAALEKDAQRDKLIETGY VMPHWPRPWGRAAGAVEQLVIEEEFSAAGIERPDYSITGWVILTLIQHGTPWQIERFV EKALRQQEIWCQLFSEPDAGSDAASVKTRATRVEGGWKINGQKVWTSGAQYCARGLAT VRTDPDAPKHAGITTVIIDMLAPGVEVRPLRQITGDSEFNEVFFNDVFVPDEDVVGAP NSGWTVARATLGNERVSIGGSGSYYEAMAAKLVQLVQRRSDAFAGAPIRVGAFLAEDH ALRLLNLRRAARSVEGAGPGPEGNITKLKVAEHMIEGAAIAAALWGPEIALLDGPGRV IGRTVMGARGMAIAGGTSEVTRNQIAERILGMPRDPLIS" gene 3425584..3427107 /gene="ligB" /locus_tag="Rv3062" /db_xref="GeneID:887553" CDS 3425584..3427107 /gene="ligB" /locus_tag="Rv3062" /EC_number="6.5.1.1" /function="THIS PROTEIN SEALS DURING DNA REPLICATION, DNA RECOMBINATION AND DNA REPAIR NICKS IN DOUBLE-STRANDED DNA [CATALYTIC ACTIVITY: ATP + (DEOXYRIBONUCLEOTIDE)(N) + (DEOXYRIBONUCLEOTIDE)(M) = AMP + PYROPHOSPHATE + (DEOXYRIBONUCLEOTIDE)(N+M)]." /note="catalyzes the ATP-dependent formation of a phosphodiester at the site of a single-strand break in duplex DNA" /codon_start=1 /transl_table=11 /product="ATP-dependent DNA ligase" /protein_id="NP_217578.1" /db_xref="GI:15610199" /db_xref="GeneID:887553" /translation="MLLHDVAITSMDVAATSSRLTKVARIAALLHRAAPDTQLVTIIV SWLSGELPQRHIGVGWAALRSLPPPAPQPALTVTGVDATLSKIGTLPGKGSQAQRAAL VAELFSAATEAEQTFLLRLLGGELRQGAKGGIMADAVAQAAGLPAATVQRAAMLGGDL AAAAAAGLSGAALDTFTLRVGRPIGPMLAQTATSVHDALERHGGTTIFEAKLDGARVQ IHRANDQVRIYTRSLDDVTARLPEVVEATLALPVRDLVADGEAIALCPDNRPQRFQVT ASRFGRSVDVAAARATQPLSVFFFDILHRDGTDLLEAPTTERLAALDALVPARHRVDR LITSDPTDAANFLDATLAAGHEGVMAKAPAARYLAGRRGAGWLKVKPVHTLDLVVLAV EWGSGRRRGKLSNIHLGARDPATGGFVMVGKTFKGMTDAMLDWQTTRFHEIAVGPTDG YVVQLRPEQVVEVALDGVQRSSRYPGGLALRFARVVRYRADKDPAEADTIDAVRALY" misc_feature 3426208..3426234 /gene="ligB" /locus_tag="Rv3062" /note="PS00697 ATP-dependent DNA ligase AMP-binding site" misc_feature 3426832..3426855 /gene="ligB" /locus_tag="Rv3062" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 3427243..3429519 /gene="cstA" /locus_tag="Rv3063" /db_xref="GeneID:887948" CDS 3427243..3429519 /gene="cstA" /locus_tag="Rv3063" /function="PEPTIDE UTILIZATION DURING CARBON STARVATION." /note="Rv3063, (MTCY22D7.18c), len: 758 aa. Probable cstA, integral membrane starvation-induced stress response protein, similar to other e.g. P15078|CSTA_ECOLI|B0598 from Escherichia coli strain K12 (701 aa), FASTA scores: opt: 2357, E(): 9.5e-137, (51.25% identity in 712 aa overlap); AAG54933|CSTA from Escherichia coli strain O157:H7 EDL933 (701 aa), FASTA scores: opt: 2356, E(): 1.1e-136, (51.1% identity in 712 aa overlap); etc. Predicted to be membrane associated. Similarity suggests start at GTG at 16801 in Y22D7 but no RBS obvious so TBparse-predicted start at 16881 taken. BELONGS TO THE CSTA FAMILY. TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="carbon starvation protein A CstA" /protein_id="NP_217579.1" /db_xref="GI:15610200" /db_xref="GeneID:887948" /translation="MAAPTPSNRIEERSGHASCVRADADLPPVAILGRSPITLRHKIF FVAVAVIGALAWTVVAFFRNEPVNAVWIVVAAGCTYIIGFRFYARLIEMKVVRPRDDH ATPAEILDDGTDYVPTDRRVVFGHHFAAIAGAGPLVGPVLATQMGYLPSSIWIVVGAV LAGCVQDYLVLWISVRRRGRSLGQMVRDELGATAGVAALVGIPVIITIVIAVLALVVV RALAKSPWGVFSIAMTIPIAIFMGCYLRFLRPGRVSEVSLIGIGLLLLAVVSGDWVAH TSWGAAWFSLSPVTLCWLLISYGFAASVLPVWLLLAPRDYLSTFMKVGTIALLAIGVC AAHPIIEAPAVSKFAGSGNGPVFAGSLFPFLFITIACGALSGFHALICSGTTPKMLEK EGQMRVIGYGGMMTESFVAVIALLTAAILDQHLYFTLNAPSLHTHDSAATAAKYVNGL GLTGSPVTPDHISQAAASVGEQTIVSRTGGAPTLAFGMAEMLHRVVGGVGLKAFWYHF AIMFEALFILTTVDAGTRAARFMISDALGNFGGVLRKLQNPSWRPGAWACRLVVVAAW GSILLLGVTDPLGGINTLFPLFGIANQLLAGIALTVITVVVIKKGRLKWAWIPGIPLL WDLAVTLTASWQKIFSADPSVGYWTQHAHYAAAQHAGETAFGSATNADEINDVVRNTF VQGTLSIVFVVVVVLVVVAGVIVALKTIRGRGIPLAEDDPAPSTLFAPAGLIPTAAER KLQRRLGAPASASVAAPD" gene complement(3429825..3430250) /locus_tag="Rv3064c" /db_xref="GeneID:888879" CDS complement(3429825..3430250) /locus_tag="Rv3064c" /function="UNKNOWN" /note="Rv3064c, (MTCY22D7.17), len: 141 aa. Probable conserved integral membrane protein, similar to many e.g. Q9KY40|SCC8A.08 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 391, E(): 2.4e-18, (48.45% identity in 130 aa overlap); Q9K461|SC2H12.23c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 339, E(): 5.1e-15, (46.7% identity in 124 aa overlap); BAB48975|MLR1652 hypothetical protein from Rhizobium loti (Mesorhizobium loti) (130 aa), FASTA scores: opt: 319, E(): 8.7e-14, (41.45% identity in 123 aa overlap); Q9JR31|NMA2196|NMB0291 CONSERVED HYPOTHETICAL INNER MEMBRANE PROTEIN from Neisseria meningitidis serogroup A and B (132 aa), FASTA scores: opt: 303, E(): 9.4e-13, (43.65% identity in 126 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217580.1" /db_xref="GI:15610201" /db_xref="GeneID:888879" /translation="MVKDLDRRLAGCLPAVLSLFRLVYGLLFAGYGSMILFGWPVTSA QPVEFGSWPGWYAGVIELVAGLLIATGLFTRAVAFVASGEMAVAYFWMHQPYALWPIG GPPDGNGGTPAILFCFGFFLLVFTGGGIYSIDARRTVTA" gene 3430387..3430710 /gene="mmr" /locus_tag="Rv3065" /db_xref="GeneID:887550" CDS 3430387..3430710 /gene="mmr" /locus_tag="Rv3065" /function="INVOLVED IN TRANSPORT OF MULTIDRUGS (TETRAPHENYLPHOSPHONIUM, ERYTHROMYCIN, ETHIDIUM BROMIDE, ACRIFLAVINE, SAFRANIN O, PYRONIN Y, etc) ACROSS THE MEMBRANE (EXPORT): MULTIDRUGS RESISTANCE BY AN EXPORT MECHANISM (CONFERES RESISTANCE TO TOXIC COMPOUNDS BY REMOVING THEM FOR THE CELLS). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv3065, (MT3150.1, MTCY22D7.17c), len: 107 aa. mmr, integral membrane multidrugs resistance transporter (see citation below), equivalent to Q9CBP1|ML1756 PROBABLE MULTIDRUG RESISTANCE PROTEIN from Mycobacterium leprae (107 aa), FASTA scores: opt: 534, E(): 3.3e-28, (77.55% identity in 107 aa overlap). Also highly similar to bacterial proteins involved in resistance to ethidium bromide or methyl viologen e.g. O87866|QACG_STASP QUATERNARY AMMONIUM COMPOUND-RESISTANCE PROTEIN QACG (QUARTERNARY AMMONIUM DETERMINANT G) from Staphylococcus sp. strain ST94 (107 aa), FASTA scores: opt: 307, E(): 1.8e-13, (39.8% identity in 103 aa overlap); P96460|QAC QUATERNARY AMMONIUM COMPOUNDS RESISTANCE PROTEIN QAC from Staphylococcus aureus (107 aa), FASTA scores: opt: 304, E(): 2.8e-13, (40.4% identity in 104 aa overlap); Q57225|QACE_ECOLI QUATERNARY AMMONIUM COMPOUND-RESISTANCE PROTEIN QACE (QUARTERNARY AMMONIUM DETERMINANT E) from Escherichia coli (110 aa), FASTA scores: opt: 300, E(): 5.2e-13, (48.15% identity in 108 aa overlap); AAG55967|Z1870 METHYLVIOLOGEN RESISTANCE PROTEIN ENCODED WITHIN PROPHAGE CP-933X from Escherichia coli strain O157:H7 EDL933 (110 aa); P23895|EMRE|MVRC|EB|B0543 EMRE PROTEIN from Escherichia coli (110 aa), FASTA scores: opt: 290, E(): 2.3e-12, (43.55% identity in 101 aa overlap); etc. Also similar to the SugE protein of enteric bacteria. BELONGS TO THE SMALL MULTIDRUG RESISTANCE (SMR) PROTEIN FAMILY. Note that previously known as emrE.; emrE" /codon_start=1 /transl_table=11 /product="multidrugs-transport integral membrane protein MMR" /protein_id="YP_177922.1" /db_xref="GI:57117052" /db_xref="GeneID:887550" /translation="MIYLYLLCAIFAEVVATSLLKSTEGFTRLWPTVGCLVGYGIAFA LLALSISHGMQTDVAYALWSAIGTAAIVLVAVLFLGSPISVMKVVGVGLIVVGVVTLN LAGAH" gene 3430707..3431315 /locus_tag="Rv3066" /db_xref="GeneID:888847" CDS 3430707..3431315 /locus_tag="Rv3066" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3066, (MTCY22D7.15c), len: 202 aa. Probable transcriptional regulatory protein deoR-family, with some similarity to transcriptional regulators and hypothetical proteins, e.g. Q9X9V5|SCI7.35c HYPOTHETICAL 21.1 KDA PROTEIN from Streptomyces coelicolor (197 aa), FASTA scores: opt: 398, E(): 5.7e-19, (40.3% identity in 191 aa overlap); AAG55222|Z1073 PUTATIVE DEOR-TYPE TRANSCRIPTIONAL REGULATOR from Escherichia coli strain O157:H7 EDL933 (178 aa), FASTA scores: opt: 257, E(): 7.9e-10, (28.4% identity in 176 aa overlap); Q9HXU1|PA3699 PROBABLE TRANSCRIPTIONAL REGULATOR (TETR/ACRR FAMILY) from Pseudomonas aeruginosa (237 aa), FASTA scores: opt: 229, E(): 6.7e-08, (32.1% identity in 187 aa overlap); etc." /codon_start=1 /transl_table=11 /product="DeoR family transcriptional regulator" /protein_id="NP_217582.1" /db_xref="GI:15610203" /db_xref="GeneID:888847" /translation="MTAGSDRRPRDPAGRRQAIVEAAERVIARQGLGGLSHRRVAAEA NVPVGSTTYYFNDLDALREAALAHAANASADLLAQWRSDLDKDRDLAATLARLTTVYL ADQDRYRTLNELYMAAAHRPELQRLARLWPDGLLALLEPRIGRRAANAVTVFFDGATL HALITGTPLSTDELTDAIARLVADGPEQREVGQSAHAGRTPD" gene 3431428..3431838 /locus_tag="Rv3067" /db_xref="GeneID:888822" CDS 3431428..3431838 /locus_tag="Rv3067" /function="UNKNOWN" /note="Rv3067, (MTCY22D7.14c), len: 136 aa. Conserved hypothetical protein, weakly similar to other mycobacterium proteins e.g. O53953|Rv1804c|MTV049.26c (108 aa), FASTA scores: opt: 183, E(): 0.00053, (36.6% identity in 82 aa overlap); O07222|Rv1810|MTCY16F9.04c (118 aa), FASTA scores: opt: 149, E(): 0.05, (30.95% identity in 84 aa overlap). Has hydrophobic stretch at N-terminus. Start chosen on basis of codon usage but upstream ATG also possible." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217583.1" /db_xref="GI:15610204" /db_xref="GeneID:888822" /translation="MLTVGVGIGAAILLGWFTLAHRHPDQPGAAATPPPAGLTTRSAP TAAPPSTLQSPDLDSVFLGNLHDRGISFTNPDAAVYNGKMVCTNLGGGMTVQQVVEAL QSSSPALGDRTTAYVAVSIRTYCPKYDAVLPPGS" gene complement(3431840..3431912) /locus_tag="Rvnt37" /note="tRNA-Ala(GGC)" /db_xref="GeneID:2700438" tRNA complement(3431840..3431912) /locus_tag="Rvnt37" /product="tRNA-Ala" /note="codon recognized: GCC" /anticodon=(pos:3431877..3431879,aa:Ala) /db_xref="GeneID:2700438" gene complement(3431979..3433622) /gene="pgmA" /locus_tag="Rv3068c" /db_xref="GeneID:888826" CDS complement(3431979..3433622) /gene="pgmA" /locus_tag="Rv3068c" /EC_number="5.4.2.2" /function="THIS ENZYME PARTICIPATES IN BOTH THE BREAKDOWN AND SYNTHESIS OF GLUCOSE [CATALYTIC ACTIVITY: ALPHA-D-GLUCOSE 1-PHOSPHATE = ALPHA-D-GLUCOSE 6-PHOSPHATE]." /note="catalyzes the interconversion of alpha-D-glucose 1-phosphate to alpha-D-glucose 6-phosphate" /codon_start=1 /transl_table=11 /product="phosphoglucomutase" /protein_id="NP_217584.1" /db_xref="GI:15610205" /db_xref="GeneID:888826" /translation="MVANPRAGQPAQPEDLVDLPHLVTAYYSIEPDPDDLAQQVAFGT SGHRGSALTGTFNELHILAITQAIVEYRAAQGTTGPLFIGRDTHGLSEPAWVSALEVL AANQVVAVVDSRDRYTPTPAISHAILTYNRGRTEALADGIVVTPSHNPPSDGGIKYNP PNGGPADTAATTAIAKRANEILLARSMVKRLPLARALRTAQRHDYLGHYVDDLPNVVD IAAIREAGVRIGADPLGGASVDYWGEIAHRHGLDLTVVNPLVDATWRFMTLDTDGKIR MDCSSPDAMAGLIRTMFGNRERYQIATGNDADADRHGIVTPDEGLLNPNHYLAVAIEY LYTHRPSWPAGIAVGKTVVSSSIIDRVVAGIGRQLVEVPVGFKWFVDGLIGATLGFGG EESAGASFLRRDGSVWTTDKDGIIMALLAAEILAVTGATPSQRYHALAGEYGGPCYAR IDAPADREQKARLARLSADQVSATELAGEPITAKLTTAPGNGAALGGLKVTTANAWFA ARPSGTEDVYKIYAESFRGPQHLVEVQQTAREVVDRVIG" misc_feature complement(3432570..3432593) /gene="pgmA" /locus_tag="Rv3068c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(3433158..3433202) /gene="pgmA" /locus_tag="Rv3068c" /note="PS00710 Phosphoglucomutase and phosphomannomutase phosphoserine signature" gene 3433692..3434090 /gene="ccrB" /locus_tag="Rv3069" /db_xref="GeneID:888859" CDS 3433692..3434090 /gene="ccrB" /locus_tag="Rv3069" /function="UNKNOWN" /note="may be involved in chromosome condensation; overexpression in Escherichia coli protects against decondensation by camphor; overexpressing the protein results in an increase in supercoiling" /codon_start=1 /transl_table=11 /product="camphor resistance protein CrcB" /protein_id="NP_217585.1" /db_xref="GI:15610206" /db_xref="GeneID:888859" /translation="MPNHDYRELAAVFAGGALGALARAALSALAIPDPARWPWPTFTV NVVGAFLVGYFTTRLLERLPLSSYRRPLLGTGLCGGLTTFSTMQVETISMIEHGHWGL AAAYSVVSITLGLLAVHLATVLVRRVRIRR" gene 3434087..3434467 /gene="ccrB" /locus_tag="Rv3070" /db_xref="GeneID:888651" CDS 3434087..3434467 /gene="ccrB" /locus_tag="Rv3070" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="may be involved in chromosome condensation; overexpression in Escherichia coli protects against decondensation by camphor; overexpressing the protein results in an increase in supercoiling" /codon_start=1 /transl_table=11 /product="camphor resistance protein CrcB" /protein_id="NP_217586.1" /db_xref="GI:15610207" /db_xref="GeneID:888651" /translation="MTASTALTVAIWIGVMLIGGIGSVLRFLVDRSVARRLARTFPYG TLTVNITGAALLGFLAGLALPKDAALLAGTGFVGAYTTFSTWMLETQRLGEDRQMVSA LANIVVSVVLGLAAALLGQWIAQI" gene 3434464..3435573 /locus_tag="Rv3071" /db_xref="GeneID:888649" CDS 3434464..3435573 /locus_tag="Rv3071" /function="UNKNOWN" /note="Rv3071, (MTCY22D7.10c), len: 369 aa. Conserved hypothetical protein, weakly similar in N-terminus of Q9A4V0|CC2725 HYPOTHETICAL PROTEIN CC2725 from Caulobacter crescentus (113 aa), FASTA scores: opt: 141, E(): 0.031, (27.6% identity in 105 aa overlap). C-terminal region also weakly similar to other hypothetical proteins e.g. Q9FC38|YG11_STRCO from Streptomyces coelicolor (114 aa), FASTA scores: opt: 151, E(): 0.007, (31.65% identity in 98 aa overlap). TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217587.1" /db_xref="GI:15610208" /db_xref="GeneID:888649" /translation="MNEQCLKLTAYFGERQRAVGGAGRFLADAMLDLFGSHNVATSVM LRGTTSFGPKHEFRCDQSLSLSEDPPVTVAAVDIESKIRSLVDDVTAMTDRGLVTLER ARLVTRHSGAEEFGDIDSRNGDAAKLTIYAGRQVRVAGAPAYYTICELLHRHGFAGAT VLLGVDGTAHGRRRRARFFGRNVNVPLMIIAVGTPAQVAVAAMELTAALPNPLLTIER VRLCKRDGELFARPQQLPQTDDQGRTLWQKLMVHTAEATHHEGLPIHRALVHRLMQSE TARGATALRGIWGFYGDHKPHGDKLFQLVRRVPVTTIIVDTPQAIARSFDIVDELTNW HGLVTSEMVPAAVSLTGSRDGTQKTGETPLARYDY" gene complement(3435798..3436322) /locus_tag="Rv3072c" /db_xref="GeneID:887175" CDS complement(3435798..3436322) /locus_tag="Rv3072c" /function="UNKNOWN" /note="Rv3072c, (MTCY22D7.09), len: 174 aa. Hypothetical protein, similar in part to O87779 HYPOTHETICAL 18.1 KDA PROTEIN (FRAGMENT) from Mycobacterium paratuberculosis (166 aa), FASTA scores: opt: 238, E(): 2.5e-08, (42.6% identity in 108 aa overlap); Q9AH10 PUTATIVE F420-DEPENDENT DEHYDROGENASE from Rhodococcus erythropolis (295 aa), FASTA scores: opt: 228, E(): 1.7e-07, (34.25% identity in 111 aa overlap); P71557|Y953_MYCTU|Rv0953c|MTCY10D7.21 POSSIBLE OXIDOREDUCTASE from Mycobacterium tuberculosis strain H37Rv (304 aa), FASTA scores: opt: 208, E(): 3.2e-06, (38.9% identity in 108 aa overlap); etc. N-terminal region similar to several proteins from Mycobacterium tuberculosis (see MAST results on the web site http://www.genolist.pasteur.fr/TubercuList/mast/P18.1.htm l) . TBparse score is 0.976." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217588.1" /db_xref="GI:15610209" /db_xref="GeneID:887175" /translation="MACVRRSCDVTGTARAGIGAGADPAVVDAVAVAADDCGFATLWV GEHVVMVDRPASRYPYSRDGVIAVPAQADWLDPMIALSFAAAASSRVDVATGVLLLPE HNPVIVAKEAASLDRLSGRRLTLGVASDGPRRSSTRSECHSSGAQSAPPNTSLQCAHY GATTSHRSTATVGS" gene complement(3436329..3436685) /locus_tag="Rv3073c" /db_xref="GeneID:888647" CDS complement(3436329..3436685) /locus_tag="Rv3073c" /function="UNKNOWN" /note="Rv3073c, (MTCY22D7.08), len: 118 aa. Conserved hypothetical protein, highly similar to other e.g. Q9F3D7|SC2H2.18 from Streptomyces coelicolor (119 aa), FASTA scores: opt: 399, E(): 2.5e-20, (53.05% identity in 115 aa overlap); Q9K4K9|SC5F8.15c from Streptomyces coelicolor (117 aa), FASTA scores: opt: 334, E(): 6e-16, (49.1% identity in 112 aa overlap); Q9HKD5|TA0666 from Thermoplasma acidophilum (134 aa), FASTA scores: opt: 334, E(): 6.7e-16, (42.35% identity in 111 aa overlap); BAB53507|MLL7394 from Rhizobium loti (Mesorhizobium loti) (120 aa), FASTA scores: opt: 309, E(): 3e-14, (43.65% identity in 110 aa overlap); etc. TBparse score is 0.885." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217589.1" /db_xref="GI:15610210" /db_xref="GeneID:888647" /translation="MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIW CKDVAPSKELREWYHHQPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRH VARSHAAVLAQLLNGR" gene 3436779..3438053 /locus_tag="Rv3074" /db_xref="GeneID:888027" CDS 3436779..3438053 /locus_tag="Rv3074" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv3074, (MTCY22D7.07c), len: 424 aa. Conserved hypothetical protein, highly similar but shorter (46 aa) to P71806|Rv1378c|MTCY02B12.12c HYPOTHETICAL 51.3 KDA PROTEIN from Mycobacterium tuberculosis (475 aa), FASTA scores: opt: 2009, E(): 5.8e-113, (72.95% identity in 429 aa overlap); and also similar to other hypothetical mycobacterium proteins e.g. O33266|Rv0336|MTCY279.03 (503 aa), FASTA scores: opt: 337, E(): 7.5e-13, (28.6% identity in 381 aa overlap); O33360|Rv0515|MTCY20G10.05 (503 aa), FASTA scores: opt: 337, E(): 7.5e-13, (28.6% identity in 381 aa overlap); etc. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217590.1" /db_xref="GI:15610211" /db_xref="GeneID:888027" /translation="MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDA ARRAAEGAAGVPAARRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDC GALSEWRATLIVRESACLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDP QAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRG QVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMV ASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAP IRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTG SRHRSGAPPHLPAVTVSELEVRIGIALARYAA" gene complement(3438050..3438973) /locus_tag="Rv3075c" /db_xref="GeneID:887178" CDS complement(3438050..3438973) /locus_tag="Rv3075c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3075c, (MTCY22D7.06), len: 307 aa. Conserved hypopthetical protein, with some similarity to Q9I562|PA0883 PROBABLE ACYL-CoA LYASE BETA CHAIN from Pseudomonas aeruginosa (275 aa), FASTA scores: opt: 408, E(): 9.2e-19, (35.15% identity in 273 aa overlap); Q9S2U9|SC4G6.02 PUTATIVE CITRATE LYASE BETA CHAIN from Streptomyces coelicolor (274 aa), FASTA scores: opt: 384, E(): 3.1e-17, (34.7% identity in 265 aa overlap); O06162|CITE|Rv2498c|MTCY07A7.04c from Mycobacterium tuberculosis (273 aa), FASTA scores: opt: 349, E(): 5.1e-15, (35.2% identity in 264 aa overlap); etc. Several initiation codons possible, first one chosen. TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217591.1" /db_xref="GI:15610212" /db_xref="GeneID:887178" /translation="MTSMYEQVDTNTADPVAGSRIDPVLARSWLLVNGAHGDRFESAA HSRADIVVLDIEDAVAPKDKHAARDNAVRWFGDGNADWVRINGFGTPWWADDLAMLAD SPVGGVMLAMVESVDHVTETAKRLPNVPIVALVETARGLERINEIAAAKGTFRLAFGI GDFRRDTGFGEDPATLAYARSRFTIAARAAGLPSAIDGPTIGSNALKLIEATAVSAEF GMTGKICLSPDQCPVVNEGLSPSQDEIVWAKEFFAEFARDGGEIRNGSDLPRIARATK ILDLARAYGIEVSDFEDEPVHMPAPTDTYHY" gene 3439072..3439548 /locus_tag="Rv3076" /db_xref="GeneID:888026" CDS 3439072..3439548 /locus_tag="Rv3076" /function="UNKNOWN" /note="Rv3076, (MTCY22D7.05c), len: 158 aa. Conserved hypothetical protein, weakly similar to Q9AK12|SC8D11.07 HYPOTHETICAL 17.0 KDA PROTEIN from Streptomyces coelicolor (151 aa), FASTA scores: opt: 110, E(): 1.5, (25.5% identity in 145 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217592.1" /db_xref="GI:15610213" /db_xref="GeneID:888026" /translation="MVLDGVVSDTRRSRTIAARQQTIWDVLADFGSLSSWVEGVDHSC VLNHGPDGGALGSTRRVQVGRNTLVERVIEFDPPTTLAYRIEGLPARLRKVTNRWTLR PADPVGAVTVVTLTSTIEIGGNPLARLAELVVGRAMAKRSNTMLAGLAQRLEDKHG" gene 3439541..3441352 /locus_tag="Rv3077" /db_xref="GeneID:887419" CDS 3439541..3441352 /locus_tag="Rv3077" /EC_number="3.1.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3077, (MTCY22D7.04c), len: 603 aa. Possible hydrolase (EC 3.1.-.-), with some similarity to variety of hydrolases (aryl- and steryl sulfatases principaly) e.g. Q45087|PEHA PHOSPHONATE MONOESTER HYDROLASE from Burkholderia caryophylli (514 aa), FASTA scores: opt: 239, E(): 7.2e-07, (23.95% identity in 413 aa overlap); Q9I1E5|PA2333 PROBABLE SULFATASE from Pseudomonas aeruginosa (538 aa), FASTA scores: opt: 231, E(): 2.3e-06, (28.1% identity in 516 aa overlap); P31447|YIDJ_ECOLI|B3678 PUTATIVE SULFATASE (EC 3.1.6.-) from Escherichia coli (497 aa), FASTA scores: opt: 222, E(): 7.4e-06, (27.7% identity in 390 aa overlap); etc. TBparse score is 1.002. Note that previously known as atsF.; atsF" /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="YP_177923.1" /db_xref="GI:57117053" /db_xref="GeneID:887419" /translation="MANRPDIIIVMTDEERAVPPYESAEVLAWRQRSLTGRRWFDEHG ISFTRHYTGSLACVPSRPTIFTGQYPDLHGVTQTDGIGKRFDDSRLRWLRAGEVPTLG NWFRAAGYDTHYDGKWHISHADLEDPATGAPLATNDNEGVVDSAAVRRYLDADPLGPY GFSGWVGPEPHGAGLANSGFRRDPLVADRVVAWLTERYARRRAGDTAAMRPFLLVASF VNPHDIVLFPAWVWRSPLKPSPLDPPHVPAAPTADEDLSTKPAAQVAYREAYYSGYGL TRMVSRNYARNAQRYRDLYYRLHAEVDGPIDRVGRAVTEGGSEDAMLVRTSDHGDLLG AHGGLHQKWFNLYDEATRVPFVIARIGEKATQPRTVSAPTSHVDLVPTLLSAAGVDVD VVAAALAESFSEVHPLPGRDLMPVVDGASADEGRAIYLMTRDNVLEGDTGASLLSRQL GRIVNPPAPLRIKVPAHVAANFEGLVVRVDDTDAAGGAGHLWKLVRTFDDPATWTEPG VRHLATNGMGGDAYRTDPLDDQWELYDLTADPIEAYNRWTDPQLHELRQHLRMLLKQQ RAVSVPERNQPWPYAHRLPPSGASNGLVRRVLGRFVR" gene 3441353..3441754 /gene="hab" /locus_tag="Rv3078" /db_xref="GeneID:888891" CDS 3441353..3441754 /gene="hab" /locus_tag="Rv3078" /EC_number="5.-.-.-" /function="ENZYME INVOLVED IN THE SECOND STEP OF NITROBENZENE DEGRADATION: REARRANGED THE INTERMEDIATE HYDROXYLAMINOBENZENE TO 2-AMINOPHENOL." /note="Rv3078, (MTCY22D7.03c), len: 133 aa. Probable hab, hydroxylaminobenzene mutase (5.-.-.-) (see Davis et al., 2000), highly similar to two hydroxylaminobenzene mutases from Pseudomonas pseudoalcaligenes O52214|HABA (135 aa), FASTA scores: opt: 495, E(): 6.8e-25, (51.1% identity in 133 aa overlap); and O52216|HABB (164 aa), FASTA scores: opt: 479, E(): 8.2e-24, (51.9% identity in 133 aa overlap) (see Davis et al., 2000); and to Q9AH35|NBZB HYDROXYLAMINOBENZENE MUTASE from Pseudomonas putida (164 aa), FASTA scores: opt: 476, E(): 1.3e-23, (51.8% identity in 133 aa overlap) (see Park & Kim 2000). Gene name according to Pseudomonas pseudoalcaligenes nomenclature. Also similarity with putative different membrane proteins involved in transport (protein predicted to be a transmembrane protein). TBparse score is 0.870." /codon_start=1 /transl_table=11 /product="hydroxylaminobenzene mutase HAB" /protein_id="NP_217594.1" /db_xref="GI:15610215" /db_xref="GeneID:888891" /translation="MQKLLFTIGLALFLIGLLTGLVIPALKNPRMALSSHLEGVLNGM FLVVLGLLWPHIDLPEAWQVIAVALIVYSAYANWLATLLAAAWGAGRKFAPIATGDHK APAAKEGFVSFLLLSLSVAIVIGVVIVIIGL" gene complement(3441770..3442597) /locus_tag="Rv3079c" /db_xref="GeneID:888660" CDS complement(3441770..3442597) /locus_tag="Rv3079c" /function="UNKNOWN" /note="Rv3079c, (MTCY22D7.02), len: 275 aa. Conserved hypothetical protein, similar to other hypothetical mycobacterium proteins e.g. P71557|Y953_MYCTU|Rv0953c|MTCY10D7.21 POSSIBLE OXIDOREDUCTASE from Mycobacterium tuberculosis strain H37Rv (282 aa), FASTA scores: opt: 668, E(): 2.4e-34, (40.55% identity in 281 aa overlap); O06216|Rv2161c|MTCY270.07 from Mycobacterium tuberculosis strain H37Rv (288 aa), FASTA scores: opt: 595, E(): 8.5e-30, (40.9% identity in 274 aa overlap); O87779 from Mycobacterium paratuberculosis (166 aa), FASTA scores: opt: 464, E(): 7.2e-22, (41.55% identity in 166 aa overlap); etc. Also some similarity to other proteins e.g. Q9AH10 PUTATIVE F420-DEPENDENT DEHYDROGENASE from Rhodococcus erythropolis (295 aa), FASTA scores: opt: 401, E(): 9.6e-18, (30.2% identity in 288 aa overlap); Q9AE04|RIF17 RIF17 PROTEIN from Amycolatopsis mediterranei (356 aa), FASTA scores: opt: 298, E(): 2.8e-11, (35.0% identity in 203 aa overlap); AAK48081|MT3720 LUCIFERASE-RELATED PROTEIN from Mycobacterium tuberculosis strain CDC1551 (395 aa), FASTA scores: opt: 223, E(): 1.4e-06, (29.4% identity in 211 aa overlap). TBparse score is 0.916." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217595.1" /db_xref="GI:15610216" /db_xref="GeneID:888660" /translation="MQFGVLTFVTDEGIGPAELGAALEHRGFESLFLAEHTHIPVNTQ SPYPGGGPIPEKYYRTLDPFVALAAAAATTQSLVLGTGIALIPERDPIVTAKEVASLD LVSQGRFRFGVGVGWLREEVANHGVDPAVRGRVIDERLRAIIEIWTQEQAEFHGTYVD FDPIYCWPKPVTKPYPPLYVGGGPANFPRIARLNAGWIAISPSPQRLSGPLQRLRAMA GGDVPVTVCQWGEAAAKDLEGYRHLGVERVLLELPTEPRDPTLRYLDKLQAELARLA" gene complement(3442656..3445988) /gene="pknK" /locus_tag="Rv3080c" /db_xref="GeneID:888659" CDS complement(3442656..3445988) /gene="pknK" /locus_tag="Rv3080c" /EC_number="2.7.1.-" /function="INVOLVED IN SIGNAL TRANSDUCTION (VIA PHOSPHORYLATION). INVOLVED IN TRANSCRIPTIONAL REGULATORY MECHANISM AND IN THE REGULATION OF SECONDARY METABOLITES [CATALYTIC ACTIVITY: ATP + A PROTEIN = ADP + A PHOSPHOPROTEIN]." /note="Rv3080c, (MTV013.01c-MTCY22D7.01), len: 1110 aa. Probable pknK, serine/threonine protein kinase involved in transcriptional regulatory function (EC 2.7.1.-) (see citation below). Similar but shorter in N-terminus (approximatively 300 residues) to others e.g. Q48411|ACOK TRANSCRIPTIONAL REGULATORY PROTEIN OF aco ABCD operon from Klebsiella pneumoniae (921 aa), FASTA scores: opt: 886, E(): 7.6e-37, (27.75% identity in 829 aa overlap); Q9HX92|PA3921 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS) (906 aa), FASTA scores: opt: 760, E(): 1.5e-30, (29.55% identity in 822 aa overlap); Q9I2X9|PA1760 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS) (907 aa), FASTA scores: opt: 696, E(): 2.3e-27, (25.85% identity in 685 aa overlap); P06993|MALT (alias BAB37683|ECS4260 and AAG58520|MALT) POSITIVE REGULATOR OF MAL REGULON from Escherichia coli strain O157:H7 (901 aa), FASTA scores: opt: 660, E(): 1.4e-25, (29.25% identity in 530 aa overlap); Q9KNF3|VCA0011 MALT REGULATORY PROTEIN from Vibrio cholerae (BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS) (921 aa), FASTA scores: opt: 626, E(): 7.2e-24, (25.8% identity in 659 aa overlap); etc. N-terminal region similar to N-terminus of serine/threonine kinases e.g. Q9KK90|PKMA SERINE/THREONINE KINASE (SIMILAR TO THE SER/THR FAMILY OF PROTEIN KINASES) from Amycolatopsis mediterranei (589 aa), FASTA scores: opt: 545, E(): 5.7e-20, (34.45% identity in 334 aa overlap); Q9RPT5|AMK SERINE/THREONINE PROTEIN KINASE HOMOLOG (SIMILAR TO THE SER/THR FAMILY OF PROTEIN KINASES) from Amycolatopsis mediterranei (606 aa), FASTA scores: opt: 537, E(): 1.5e-19, (35.55% identity in 346 aa overlap); Q9L0I0|PKAD PROTEIN SERINE/THREONINE KINASE from Streptomyces coelicolor (599 aa), FASTA scores: opt: 520, E(): 1e-18, (36.1% identity in 324 aa overlap); etc. N-terminal part also similar to O53510|PKNL_MYCTU|Rv2176|MT2232|MTV021.09 PROBABLE SERINE/THREONINE-PROTEIN KINASE from Mycobacterium tuberculosis strain H37Rv (399 aa), FASTA scores: opt: 511, E(): 2.1e-18, (35.15% identity in 313 aa overlap). Contains PS00107 Protein kinases ATP-binding region signature and PS00017 ATP/GTP-binding site motif A (P-loop). Contains Hank's kinase subdomain. FIRST PART OF THE PROTEIN SEEMS BELONG TO THE SER/THR FAMILY OF PROTEIN KINASES, AND SECOND PARTS SEEMS BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="serine/threonine-protein kinase transcriptional regulatory protein" /protein_id="NP_217596.1" /db_xref="GI:15610217" /db_xref="GeneID:888659" /translation="MTDVDPHATRRDLVPNIPAELLEAGFDNVEEIGRGGFGVVYRCV QPSLDRAVAVKVLSTDLDRDNLERFLREQRAMGRLSGHPHIVTVLQVGVLAGGRPFIV MPYHAKNSLETLIRRHGPLDWRETLSIGVKLAGALEAAHRVGTLHRDVKPGNILLTDY GEPQLTDFGIARIAGGFETATGVIAGSPAFTAPEVLEGASPTPASDVYSLGATLFCAL TGHAAYERRSGERVIAQFLRITSQPIPDLRKQGLPADVAAAIERAMARHPADRPATAA DVGEELRDVQRRNGVSVDEMPLPVELGVERRRSPEAHAAHRHTGGGTPTVPTPPTPAT KYRPSVPTGSLVTRSRLTDILRAGGRRRLILIHAPSGFGKSTLAAQWREELSRDGAAV AWLTIDNDDNNEVWFLSHLLESIRRVRPTLAESLGHVLEEHGDDAGRYVLTSLIDEIH ENDDRIAVVIDDWHRVSDSRTQAALGFLLDNGCHHLQLIVTSWSRAGLPVGRLRIGDE LAEIDSAALRFDTDEAAALLNDAGGLRLPRADVQALTTSTDGWAAALRLAALSLRGGG DATQLLRGLSGASDVIHEFLSENVLDTLEPELREFLLVASVTERTCGGLASALAGITN GRAMLEEAEHRGLFLQRTEDDPNWFRFHQMFADFLHRRLERGGSHRVAELHRRASAWF AENGYLHEAVDHALAAGDPARAVDLVEQDETNLPEQSKMTTLLAIVQKLPTSMVVSRA RLQLAIAWANILLQRPAPATGALNRFETALGRAELPEATQADLRAEADVLRAVAEVFA DRVERVDDLLAEAMSRPDTLPPRVPGTAGNTAALAAICRFEFAEVYPLLDWAAPYQEM MGPFGTVYAQCLRGMAARNRLDIVAALQNFRTAFEVGTAVGAHSHAARLAGSLLAELL YETGDLAGAGRLMDESYLLGSEGGAVDYLAARYVIGARVKAAQGDHEGAADRLSTGGD TAVQLGLPRLAARINNERIRLGIALPAAVAADLLAPRTIPRDNGIATMTAELDEDSAV RLLSAGDSADRDQACQRAGALAAAIDGTRRPLAALQAQILHIETLAATGRESDARNEL APVATKCAELGLSRLLVDAGLA" misc_feature complement(3444864..3444887) /gene="pknK" /locus_tag="Rv3080c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(3445824..3445895) /gene="pknK" /locus_tag="Rv3080c" /note="PS00107 Protein kinases ATP-binding region signature" gene 3446040..3447278 /locus_tag="Rv3081" /db_xref="GeneID:888650" CDS 3446040..3447278 /locus_tag="Rv3081" /function="UNKNOWN" /note="Rv3081, (MTV013.02), len: 412 aa. Conserved hypothetical protein. Second part of the protein (approximatively residues 250-412) shares weak similarity with other hypothetical proteins e.g. Q9YEU3|APE0488 from Aeropyrum pernix (188 aa), FASTA scores: opt: 149, E(): 0.019, E(): 0.019, (29.5% identity in 173 aa overlap); and first part shares weak similarity with C-terminal part of Q9RVT9|DR0933 ALPHA-AMLYASE from Deinococcus radiodurans (644 aa), FASTA scores: opt: 127, E(): 1.4, (27.25% identity in 198 aa overlap). Equivalent to AAK47502|MT3166 HYPOTHETICAL 48.3 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (436 aa) but shorter 24 aa in N-terminus. Contains PS00850 Glycine radical signature and possible helix-turn-helix motif at aa 53-74. TBparse score is 0.940." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217597.1" /db_xref="GI:15610218" /db_xref="GeneID:888650" /translation="MTPHYRQAAASRLDTHRTQKLRSQTNGGKDRHQLTYEQFARMLT LMGPSDLWTVERAARHWGVSASRARAILSSRHIHRVSGYPAQAIKAVTLRQGARTDLK TANHLVPAAQAFTMAETGAAIGETEDERARLRIFFEFLRGADETGTSALDLIVDEPAL IGEHRFDALLAAAAEYISARWGRPGPLWSVSIERFLDTAWWVSDLPSARAFAAVWTPA PFRRRGIYLDRHDLTSDGVCVMPEPVFNRTELQRAFTALAAKLERRGVVGQVHVVGGA AMLLAYNSRVTTRDIDALFSTDGPMLEAIREVADEMGWPRTWLNNQASGYVSRTPGEG APVFDHPFLHVVATPAQHLLAMKVVAARGVRDGEDIRLLLDRLRITSAAGVWEIVARY FPAETITDRSRLLVEDLLNQ" misc_feature 3446268..3446294 /locus_tag="Rv3081" /note="PS00850 Glycine radical signature" gene complement(3447404..3448426) /gene="virS" /locus_tag="Rv3082c" /db_xref="GeneID:888657" CDS complement(3447404..3448426) /gene="virS" /locus_tag="Rv3082c" /function="MAY HAVE A ROLE IN THE REGULATION OF PROTEINS NECESSARY FOR VIRULENCE." /experiment="experimental evidence, no additional details recorded" /note="Rv3082c, (MT3167, MTV013.03c), len: 340 aa. virS, transcriptional regulatory protein araC/xylS family, probably involved in virulence (see citations below). Similar to many transcriptional regulators araC/xylS family e.g. Q9HZ25|PA3215 PROBABLE TRANSCRIPTIONAL REGULATOR (ARAC/XYLS FAMILY) from Pseudomonas aeruginosa (337 aa), FASTA scores: opt: 379, E(): 3e-17, (30.4% identity in 306 aa overlap); Q9Z3Y6|PHBR POLYHYDROXYBUTYRATE TRANSCRIPTIONAL ACTIVATOR from Pseudomonas sp. 61-3 (379 aa), FASTA scores: opt: 336, E(): 2e-14, (26.35% identity in 334 aa overlap); P72171|ORUR|PA0831 ORNITHINE UTILIZATION TRANSCRIPTIONAL REGULATOR oruR from Pseudomonas aeruginosa (339 aa), FASTA scores: opt: 274, E(): 1.9e-10, (23.7% identity in 321 aa overlap); Q9ZFW7 VIRULENCE REGULATING HOMOLOG from Pseudomonas alcaligenes (346 aa), FASTA scores: opt: 262, E(): 1.2e-09, (24.5% identity in 339 aa overlap); etc. Also similar to O69703|Rv3736|MTV025.084 PUTATIVE REGULATORY PROTEIN (ARAC/XYLS FAMILY) from Mycobacterium tuberculosis strain H37Rv (353 aa), FASTA scores: opt: 656, E(): 3.5e-35, (36.95% identity in 333 aa overlap). Has potential helix-turn-helix motif at positions 252-273. BELONGS TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="virulence-regulating transcriptional regulator VirS" /protein_id="NP_217598.1" /db_xref="GI:15610219" /db_xref="GeneID:888657" /translation="MELGSLIRATNLWGYTDLMRELGADPLPFLRRFDIPPGIEHQED AFMSLAGFVRMLEASAAELDCPDFGLRLARWQGLGILGPVAVIARNAATLFGGLEAIG RYLYVHSPALTLTVSSTTARSNVRFGYEVTEPGIPYPLQGYELSMANAARMIRLLGGP QARARVFSFRHAQLGTDAAYREALGCTVRFGRTWCGFEVDHRLAGRPIDHADPETKRI ATKYLESQYLPSDATLSERVVGLARRLLPTGQCSAEAIADQLDMHPRTLQRRLAAEGL RCHDLIERERRAQAARYLAQPGLYLSQIAVLLGYSEQSALNRSCRRWFGMTPRQYRAY GGVSGR" gene 3448504..3449991 /locus_tag="Rv3083" /db_xref="GeneID:888655" CDS 3448504..3449991 /locus_tag="Rv3083" /function="UNKNOWN; POSSIBLE OXIDOREDUCTASE INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3083, (MTV013.04), len: 495 aa. Probable monooxygenase (EC 1.-.-.-), highly similar to other putative monooxygenases flavin-binding family e.g. AAK48336|MT3969 from Mycobacterium tuberculosis strain CDC1551 (489 aa), FASTA scores: opt: 1692, E(): 4.9e-98, (49.7% identity in 489 aa overlap); Q9A588|CC2569 from Caulobacter crescentus (498 aa), FASTA scores: opt: 1684, E(): 1.6e-97, (52.25% identity in 484 aa overlap); Q9APW3 from Pseudomonas aeruginosa (508 aa), FASTA scores: opt: 1603, E(): 1.8e-92, (49.8% identity in 484 aa overlap); etc. TBparse score is 0.882." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="NP_217599.1" /db_xref="GI:15610220" /db_xref="GeneID:888655" /translation="MNQHFDVLIIGAGLSGIGTACHVTAEFPDKTIALLERRERLGGT WDLFRYPGVRSDSDMFTFGYKFRPWRDVKVLADGASIRQYIADTATEFGVDEKIHYGL KVNTAEWSSRQCRWTVAGVHEATGETRTYTCDYLISCTGYYNYDAGYLPDFPGVHRFG GRCVHPQHWPEDLDYSGKKVVVIGSGATAVTLVPAMAGSNPGSAAHVTMLQRSPSYIF SLPAVDKISEVLGRFLPDRWVYEFGRRRNIAIQRKLYQACRRWPKLMRRLLLWEVRRR LGRSVDMSNFTPNYLPWDERLCAVPNGDLFKTLASGAASVVTDQIETFTEKGILCKSG REIEADIIVTATGLNIQMLGGMRLIVDGAEYQLPEKMTYKGVLLENAPNLAWIIGYTN ASWTLKSDIAGAYLCRLLRHMADNGYTVATPRDAQDCALDVGMFDQLNSGYVKRGQDI MPRQGSKHPWRVLMHYEKDAKILLEDPIDDGVLHFAAAAQDHAAA" gene 3449997..3450923 /gene="lipR" /locus_tag="Rv3084" /db_xref="GeneID:888652" CDS 3449997..3450923 /gene="lipR" /locus_tag="Rv3084" /EC_number="3.1.1.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3084, (MTV013.05), len: 308 aa. Probable lipR, N-Acetyl-hydrolase/esterase (EC 3.1.1.-), similar to other e.g. Q01109|BAH_STRH from Streptomyces hygroscopicus (299 aa), FASTA scores: opt: 558, E(): 4.1e-26, (40.25% identity in 246 aa overlap); Q9X8J4|SCE9.22 from Streptomyces coelicolor (266 aa), FASTA scores: opt: 544, E(): 2.5e-25, (36.95% identity in 257 aa overlap); Q56171|DEA from Streptomyces viridochromogenes (299 aa), FASTA scores: opt: 532, E(): 1.4e-24, (38.6% identity in 254 aa overlap); etc. Also similar to O06350|LIPF|Rv3487c|MTCY13E12.41c (277 aa), FASTA score: opt: 291, E(): 8.5e-10, (28.5% identity in 239 aa overlap). MAY BE BELONG TO THE 'GDXG' FAMILY OF LIPOLYTIC ENZYMES. TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="acetyl-hydrolase/esterase LipR" /protein_id="NP_217600.1" /db_xref="GI:15610221" /db_xref="GeneID:888652" /translation="MNLRKNVIRSVLRGARPLFASRRLGIAGRRVLLATLTAGARAPK GTRFQRVSIAGVPVQRVQPPHAATSGTLIYLHGGAYALGSARGYRGLAAQLAAAAGMT ALVPDYTRAPHAHYPVALEEMAAVYTRLLDDGLDPKTTVIAGDSAGGGLTLALAMALR DRGIQAPAALGLICPWADLAVDIEATRPALRDPLILPSMCTEWAPRYVGSSDPRLPGI SPVYGDMSGLPPIVMQTAGDDPICVDADKIETACAASKTSIEHRRFAGMWHDFHLQVS LLPEARDAIADLGARLRGHLHQSQGQPRGVVK" gene 3450920..3451750 /locus_tag="Rv3085" /db_xref="GeneID:888656" CDS 3450920..3451750 /locus_tag="Rv3085" /function="UNKNOWN; SUPPOSED INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3085, (MTV013.06), len: 276 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various oxidoreductases in the short chain dehydrogenases/reductases family e.g. Q9CC98|ML1094 SHORT CHAIN ALCOHOL DEHYDROGENASE from Mycobacterium leprae (277 aa), FASTA scores: opt: 1059, E(): 4.8e-56, (61.65% identity in 266 aa overlap); Q9I3H6|PA1537 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginos (295 aa), FASTA scores: opt: 858, E(): 4.7e-44, (48.4% identity in 285 aa overlap); Q9CBP7|ML1740 POSSIBLE SHORT CHAIN REDUCTASE from Mycobacterium leprae (312 aa), FASTA scores: opt: 500, E(): 1e-22, (36.6% identity in 257 aa overlap); etc. Also similar to mycobacterium proteins O50460|Rv1245c|MTV006.17c DEHYDROGENASE SIMILAR TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES FAMILY (276 aa), FASTA scores: opt: 1200, E(): 1.9e-64, (65.2% identity in 273 aa overlap); and P95101|Rv3057c|MTCY22D7.24 HYPOTHETICAL DEHYDROGENASE (287 aa). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. TBparse score is 0.860." /codon_start=1 /transl_table=11 /product="short-chain type dehydrogenase/reductase" /protein_id="NP_217601.1" /db_xref="GI:15610222" /db_xref="GeneID:888656" /translation="MSSFEGKVAVITGAGSGIGRALALNLSEKRAKLALSDVDTDGLA KTVRLAQALGAQVKSDRLDVAEREAVLAHADAVVAHFGTVHQVYNNAGIAYNGNVDKS EFKDIERIIDVDFWGVVNGTKAFLPHVIASGDGHIVNISSLFGLIAVPGQSAYNAAKF AVRGFTEALRQEMLVARHPVKVTCVHPGGIKTAVARNATVADGEDQQTFAEFFDRRLA LHSPEMAAKTIVNGVAKGQARVVVGLEAKAVDVLARIMGSSYQRLVAAGVAKFFPWAK" misc_feature 3451343..3451429 /locus_tag="Rv3085" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 3451781..3452887 /gene="adhD" /locus_tag="Rv3086" /db_xref="GeneID:888654" CDS 3451781..3452887 /gene="adhD" /locus_tag="Rv3086" /EC_number="1.1.1.-" /function="UNKNOWN; GENERATES AN ALDEHYDE (OR PERHAPS A KETONE) FROM AN ALCOHOL." /experiment="experimental evidence, no additional details recorded" /note="Rv3086, (MTV013.07), len: 368 aa. Probable adhD, zinc-type alcohol dehydrogenase (EC 1.1.1.-), highly similar to many e.g. O69045 HYPOTHETICAL ALCOHOL DEHYDROGENASE from Rhodococcus rhodochrous (370 aa), FASTA scores: opt: 1255, E(): 8.7e-68, (50.4% identity in 367 aa overlap); P25406|ADHB_UROHA ALCOHOL DEHYDROGENASE I-B from Uromastyx hardwickii (Indian spiny-tailed lizard) (375 aa), FASTA scores: opt: 787, E(): 8.2e-40, (35.9% identity in 373 aa overlap); P72324||ADHI_RHOSH ALCOHOL DEHYDROGENASE CLASS III from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (376 aa), FASTA scores: opt: 787, E(): 8.3e-40, (35.1% identity in 379 aa overlap). Also highly similar to P71818|Rv0761c|MTCY369.06c HYPOTHETICAL ZINC-TYPE ALCOHOL DEHYDROGENASE-LIKE PROTEIN from Mycobacterium tuberculosis strain H37Rv (375 aa), FASTA scores: opt: 1186, E(): 1.2e-63, (47.3% identity in 368 aa overlap). Contains PS00059 Zinc-containing alcohol dehydrogenases signature. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE. TBparse score is 0.871. POSSIBLY REQUIRES ZINC FOR ITS ACTIVITY." /codon_start=1 /transl_table=11 /product="zinc-type alcohol dehydrogenase AdhD" /protein_id="NP_217602.1" /db_xref="GI:15610223" /db_xref="GeneID:888654" /translation="MKTTAAVLFEAGKPFELMELDLDGPGPGEVLVKYTAAGLCHSDL HLTDGDLPPRFPIVGGHEGSGVIEEVGAGVTRVKPGDHVVCSFIPNCGTCRYCCTGRQ NLCDMGATILEGCMPDGSFRFHSQGTDFGAMCMLGTFAERATVSQHSVVKVDDWLPLE TAVLVGCGVPSGWGTAVNAGNLRAGDTAVIYGVGGLGINAVQGATAAGCKYVVVVDPV AFKRETALKFGATHAFADAASAAAKVDELTWGQGADAALILVGTVDDEVVSAATAVIG KGGTVVITGLADPAKLTVHVSGTDLTLHEKTIKGSLFGSCNPQYDIVRLLRLYDAGQL MLDELVTTTYNLEQVNQGYQDLRDGKNIRGVIVH" misc_feature 3451958..3452002 /gene="adhD" /locus_tag="Rv3086" /note="PS00059 Zinc-containing alcohol dehydrogenases signature" gene 3452925..3454343 /locus_tag="Rv3087" /db_xref="GeneID:888653" CDS 3452925..3454343 /locus_tag="Rv3087" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3087, (MTV013.08), len: 472 aa. Hypothetical protein, similar to several Mycobacterium tuberculosis proteins e.g. MTCY08D5.16, MTCY28.26, MTCY493.29c. Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa). TBparse score is 0.880." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217603.1" /db_xref="GI:15610224" /db_xref="GeneID:888653" /translation="MRRLNGVDALMLYLDGGSAYNHTLKISVLDPSTDPDGWSWPKAR QMFEERAHLLPVFRLRYLPTPLGLHHPIWVEDPEFDLDAHVRRVVCPAPGGMAEFCAL VEQIYAHPLDRDRPLWQTWVVEGLDGGRVALVTLLHHAYSDGVGVLDMLAAFYNDTPD EAPVVAPPWEPPPLPSTRQRLGWALRDLPSRLGKIAPTVRAVRDRVRIEREFAKDGDR RVPPTFDRSAPPGPFQRGLSRSRRFSCESFPLAEVREVSKTLGVTINDVFLACVAGAV RRYLERCGSPPTDAMVATMPLAVTPAAERAHPGNYSSVDYVWLRADIADPLERLHATH LAAEATKQHFAQTKDADVGAVVELLPERLISGLARANARTKGRFDTFKNVVVSNVPGP REPRYLGRWRVDQWFSTGQISHGATLNMTVWSYCDQFNLCVMADAVAVRNTWELLGGF RASHEELLAAARAQATPKEMAT" gene 3454340..3455764 /locus_tag="Rv3088" /db_xref="GeneID:888669" CDS 3454340..3455764 /locus_tag="Rv3088" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3088, (MTV013.09), len: 474 aa. Hypothetical protein, similar to several Mycobacterium tuberculosis proteins e.g. MTCY31.23 (505 aa), MTCY13E12.34c (497 aa) and MTCY493.29c (459 aa). Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa). TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217604.1" /db_xref="GI:15610225" /db_xref="GeneID:888669" /translation="MTRINPIDLSFLLLERANRPNHMAAYTIFEKPKGQKSSFGPRLF DAYRHSQAAKPFNHKLKWLGTDVAAWETVEPDMGYHIRHLALPAPGSMQQFHETVSFL NTGLLDRGHPMWECYIIDGIERGRIAILLKVHHALIDGEGGLRAMRNFLSDSPDDTTL AGPWMSAQGADRPRRTPATVSRRAQLQGQLQGMIKGLTKLPSGLFGVSADAADLGAQA LSLKARKASLPFTARRTLFNNTAKSAARAYGNVELPLADVKALAKATGTSVNDVVMTV IDDALHHYLAEHQASTDRPLVAFMPMSLREKSGEGGGNRVSAELVPMGAPKASPVERL KEINAATTRAKDKGRGMQTTSRQAYALLLLGSLTVADALPLLGKLPSANVVISNMKGP TEQLYLAGAPLVAFSGLPIVPPGAGLNVTFASINTALCIAIGAAPEAVHEPSRLAELM QRAFTELQTEAGTTSPTTSKSRTP" gene 3455761..3457272 /gene="fadD13" /locus_tag="Rv3089" /db_xref="GeneID:888666" CDS 3455761..3457272 /gene="fadD13" /locus_tag="Rv3089" /EC_number="6.2.1.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3089, (MTV013.10), len: 503 aa. Probable fadD13, Acyl-CoA Synthetase (EC 6.2.1.-), similar to many e.g. MTCI28.06, MTCY08D5.09, MTCY06G11.08 from Mycobacterium tuberculosis strain H37Rv; and to Q9F7P5 PREDICTED ACID--CoA LIGASE FADD13 from uncultured proteobacterium EBAC31A08 (504 aa), FASTA scores: opt: 1126, E(): 2.4e-62, (38.85% identity in 502 aa overlap); Q9EY88|FCS FERULOYL-CoA SYNTHETASE from Amycolatopsis sp. strain HR167 (491 aa), FASTA scores: opt: 1073, E(): 4.5e-59, (38.5% identity in 504 aa overlap); BAB49118|MLR1843 PROBABLE ACID-CoA LIGASE from Rhizobium loti (Mesorhizobium loti) (495 aa), FASTA scores: opt: 937, E(): 1.2e-50, (36.2% identity in 503 aa overlap); Q9KZC1|SC6F7.21 PROBABLE LONG-CHAIN-FATTY-ACID-CoA LIGASE from Streptomyces coelicolor (511 aa), FASTA scores: opt: 899, E(): 2.8e-48, (36.1% identity in 510 aa overlap); Q9A5P7|CC2400 PUTATIVE ACID-CoA LIGASE from Caulobacter crescentus (496 aa), FASTA scores: opt: 874, E(): 9.8e-47, (35.1% identity in 507 aa overlap); etc. Contains PS00455 Putative AMP-binding domain signature and PS00061 Short-chain alcohol dehydrogenase family signature. TBparse score is 0.877." /codon_start=1 /transl_table=11 /product="chain-fatty-acid-CoA ligase" /protein_id="NP_217605.1" /db_xref="GI:15610226" /db_xref="GeneID:888666" /translation="MKNIGWMLRQRATVSPRLQAYVEPSTDVRMTYAQMNALANRCAD VLTALGIAKGDRVALLMPNSVEFCCLFYGAAKLGAVAVPINTRLAAPEVSFILSDSGS KVVIYGAPSAPVIDAIRAQADPPGTVTDWIGADSLAERLRSAAADEPAVECGGDDNLF IMYTSGTTGHPKGVVHTHESVHSAASSWASTIDVRYRDRLLLPLPMFHVAALTTVIFS AMRGVTLISMPQFDATKVWSLIVEERVCIGGAVPAILNFMRQVPEFAELDAPDFRYFI TGGAPMPEALIKIYAAKNIEVVQGYALTESCGGGTLLLSEDALRKAGSAGRATMFTDV AVRGDDGVIREHGEGEVVIKSDILLKEYWNRPEATRDAFDNGWFRTGDIGEIDDEGYL YIKDRLKDMIISGGENVYPAEIESVIIGVPGVSEVAVIGLPDEKWGEIAAAIVVADQN EVSEQQIVEYCGTRLARYKLPKKVIFAEAIPRNPTGKILKTVLREQYSATVPK" misc_feature 3455935..3456021 /gene="fadD13" /locus_tag="Rv3089" /note="PS00061 Short-chain alcohol dehydrogenase family signature" misc_feature 3456241..3456276 /gene="fadD13" /locus_tag="Rv3089" /note="PS00455 Putative AMP-binding domain signature" gene 3458211..3459098 /locus_tag="Rv3090" /db_xref="GeneID:888668" CDS 3458211..3459098 /locus_tag="Rv3090" /function="UNKNOWN" /note="Rv3090, (MTCY164.01), len: 295 aa. Hypothetical unknown Ala-, Val-rich protein. Hydrophobic stretch at N-terminus." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217606.1" /db_xref="GI:15610227" /db_xref="GeneID:888668" /translation="MTWQIVFVVICVIVAGVAALFWRLPSDDTTRSRAKTVTIAAVAA AAVFFFLGCFTIVGTRQFAIMTTFGRPTGVSLNNGFHGKWPWQMTHPMDGAVQIDKYV KEGNTDQRITVRLGNQSTALADVSIRWQLKQAAAPELFQQYKTFDNVRVNLIERNLSV ALNEVFAGFNPLDPRNLDVSPLPSLAKRAADILRQDVGGQVDIFDVNVPTIQYDQSTE DKINQLNQQRAQTSIALEAQRTAEAQAKANEILSRSISDDPNVVVQNCITAAINKGIS PLGCWPGSSALPTIAVPGR" gene 3459116..3460807 /locus_tag="Rv3091" /db_xref="GeneID:888665" CDS 3459116..3460807 /locus_tag="Rv3091" /function="UNKNOWN" /note="Rv3091, (MTCY164.02), len: 563 aa. Hypothetical protein, similar in part to O60859 NEUROPATHY TARGET ESTERASE from Homo sapiens (Human) (1327 aa), FASTA scores: opt: 177, E(): 0.0062, (30.65% identity in 173 aa overlap); and Q9I385|PA1640 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (345 aa), FASTA scores: opt: 152, E(): 0.069, (27.8% identity in 180 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217607.1" /db_xref="GI:15610228" /db_xref="GeneID:888665" /translation="MPIPFADGMLSRLGRRGAALDLIEEFEDESGEPPASLSPADLLA AEPALLLQKMENRLVRHHLANPDVLSGEQLRKLRYILNFARLADFEPGAAGPGGSRGR GDISVGGQVAPWRSRVVDALYAPLREEPDPVTALEGAKDVLATLVDDQDDQRRVLIER HGSDFSATELDAEVGYKKLVTVLGGGGGAGFVYIGGMQRLLAAGQVPDYMIGSSFGSI IGSLVARELPVPIDEYAEWAKTVSYRAILGPERRRSRHGLAGMFTLRFDQFAHTLLSR ADGERMRMSDLAIPFDVVVAGVRRQPYAALPSRFRHRERSTLTLRSLPFLPIGIGPWV AARMWQVAAFIDLRVVKPIVISADGATRDVNVVDAASFSSAIPGVLHHETSDPRMLPI LDELCADQDVAAMVDGGAASNVPVELAWERVRDGRLGTRNACYLAFDCFHPHWDPRHL WLVPITQAVQLQMVRNLPYADHLVRFEPTLSPVNLAPSAAAIDRACRWGRDSVEPAIA VTSALLEPTWWEGDRPPAAEPKERTKSAASSMSAVMAAIQAPTGRFRRWRSRHLT" gene complement(3460814..3461734) /locus_tag="Rv3092c" /db_xref="GeneID:888667" CDS complement(3460814..3461734) /locus_tag="Rv3092c" /function="UNKNOWN" /note="Rv3092c, (MTCY164.03c), len: 306 aa. Probable conserved integral membrane protein, highly similar to Q9RUT5|DR1297 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (311 aa), FASTA scores: opt: 941, E(): 9.8e-51, (55.65% identity in 309 aa overlap); Q9A8B8|CC1436 HYPOTHETICAL PROTEIN from Caulobacter crescentus (314 aa), FASTA scores: opt: 791, E(): 1.6e-41, (46.9% identity in 305 aa overlap); and also highly similar to Q9I2N8|PA1857 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (307 aa), FASTA scores: opt: 373, E(): 8.1e-16, (40.8% identity in 321 aa overlap); BAB36119|ECS2696 PUTATIVE METHYL-INDEPENDENT MISMATCH REPAIR PROTEIN from Escherichia coli strain O157:H7 (305 aa), FASTA scores: opt: 335, E(): 1.7e-13, (39.75% identity in 307 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217608.1" /db_xref="GI:15610229" /db_xref="GeneID:888667" /translation="MSGGLFGLLDHVAVLARLAAASIDDIGAAAGRATAKAAGVVIDD TAVTPQYVHRITAERELPIIKRIAIGSVRNKLLLILPGALLLSQLVPWLLTPLLMLGA TYLCYEGAEKVCGVIGGRGHDAAPQVAERELVAGAIRTDFILSAEIMVIALNEVADQP FVPRLIVLVIVALVITAAVYGVVAVIVQMDDVGLRLTQTASRFGQRIGGGLVAGMPKL LSALSAVGMGAMLWVGGHIVLVGSDHLGWHAPYRLVHHLDDHLVGSAGGALTWLVSTA ACAATGLVIGIVVVALVHLVCFRPPRSRSL" gene complement(3461760..3462764) /locus_tag="Rv3093c" /db_xref="GeneID:888663" CDS complement(3461760..3462764) /locus_tag="Rv3093c" /function="UNKNOWN; COULD BE INVOLVED IN CELLULAR METABOLISM." /note="Rv3093c, (MTCY164.04c), len: 334 aa. Hypothetical oxidoreductase (EC 1.-.-.-), with some similarity with various oxidoreductases e.g. Q58929|MER|MJ1534 N5,N10-METHYLENE TETRAHYDROMETHANOPTERIN REDUCTASE (EC 1.5.99.-) from Methanococcus jannaschii (331 aa), FASTA scores: opt: 300, E(): 1.1e-10, (24.1% identity in 324 aa overlap); and Q9ZA30|GRA-ORF29 PUTATIVE FMN-DEPENDENT MONOOXYGENASE from Streptomyces violaceoruber (343 aa), FASTA scores: opt: 264, E(): 1.5e-08, (30.45% identity in 335 aa overlap); Q9CCV8|ML0348 POSSIBLE COENZYME F420-DEPENDENT OXIDOREDUCTASE from Mycobacterium leprae (350 aa), FASTA scores: opt: 220, E(): 6.4e-06, (26.5% identity in 328 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217609.1" /db_xref="GI:15610230" /db_xref="GeneID:888663" /translation="MTDIEVALPFWLDRPDHEATDVALAAADTGFAALWIGEMATYDA FALATSIGLRTPNMTLKVGPLAVGVRGPVGLALGVSSVASLTGCRVDLALGASSPAIV AGWHGRPWAHHVPVMRETIECLRSIFTGARVEYSGRHVNSRGFRLRGAAPDTRIALGA FGPGMIRLAAQHADEVVLNLASPFRVGRVRAAIDSAAAAAGRAAPRLTVCVPVAVNPG AAAHSQLAAQLAVYLAPPGYGEMFSALGFDGLVRSARSRATRRELAVAVPSELLDRVC ALGSPDRVAARLRAYADAGADCVAVVPATAEDPGGRVALRALRPGGLYGTAGDNDGRR" gene complement(3462761..3463891) /locus_tag="Rv3094c" /db_xref="GeneID:888661" CDS complement(3462761..3463891) /locus_tag="Rv3094c" /function="UNKNOWN" /note="Rv3094c, (MTCY164.05c), len: 376 aa. Conserved hypothetical protein, some similarity with various proteins e.g. Q9RMR9|NRGC NRGC PROTEIN (corresponding gene seems regulated by NifA) from Bradyrhizobium japonicum (388 aa), FASTA scores: opt: 677, E(): 5.8e-35, (34.55% identity in 353 aa overlap); P26698|PIGM_RHOSO PIGMENT PROTEIN from Rhodococcus sp. strain ATCC 21145 (387 aa), FASTA scores: opt: 480, E(): 1.2e-22, (28.7% identity in 376 aa overlap); Q9F0J3|NCNH HYDROXYLASE from Streptomyces arenae (405 aa), FASTA scores: opt: 441, E(): 3.3e-20, (29.25% identity in 352 aa overlap); etc. Equivalent to AAK47516 from Mycobacterium tuberculosis strain CDC1551 (395 aa) but N-terminus shorter 19 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217610.1" /db_xref="GI:15610231" /db_xref="GeneID:888661" /translation="MNQSETEIEILAEKIARWARARSAEIERDRRLPDELVTRLREAG LLRATMPREVAAPELAPGRALRCAEAVARGDASAGWCVSIAITSALLVAYLPARSREE MFGGGRGVAAGVWAPRGTARSVDGGVVVSGRWPFCSGINHADIMFAGCFVDDRQVPSV VALNKDELQVLDTWHTLGLRGTGSHDCVADDVFVPADRVFSVFDGPIVDRPLYRFPVF GFFALSIGAAALGNARAAIDDLVELAGGKKGLGSTRTLAERSATQAAAATAESALGAA RALFYEVIEAAWQVSHDAEAVPVTMRNRLRLAATHAVRTSADVVRSMYDLAGGTAIYD NAPLQRRFRDAFTATAHFQVNEASRELPGRVLLDQPADVSML" gene 3463973..3464449 /locus_tag="Rv3095" /db_xref="GeneID:888679" CDS 3463973..3464449 /locus_tag="Rv3095" /function="UNKNOWN. COULD BE INVOLVED IN REGULATORY MECHANISM." /note="Rv3095, (MTCY164.06), len: 158 aa. Possible regulatory protein, because contains possible helix-turn-helix motif at aa 39-61 (+4.83 SD). Similar to hypothetical proteins e.g. Q9I0C9|PA2713 from Pseudomonas aeruginosa (159 aa), FASTA scores: opt: 486, E(): 1.6e-25, (45.95% identity in 148 aa overlap); Q9AAF6|CC0645 from Caulobacter crescentus (188 aa), FASTA scores: opt: 479, E(): 5.3e-25, (45.75% identity in 153 aa overlap); Q9K408|2SCG61.07 from Streptomyces coelicolor (157 aa), FASTA scores: opt: 407, E(): 2.8e-20, (43.9% identity in 139 aa overlap); etc." /codon_start=1 /transl_table=11 /product="putative transcriptional regulatory protein" /protein_id="NP_217611.1" /db_xref="GI:15610232" /db_xref="GeneID:888679" /translation="MAVSDLSHRFEGESVGRALELVGERWTLLILREAFFGVRRFGQL ARNLGIPRPTLSSRLRMLVEVGLFDRVPYSSDPERHEYRLTEAGRDLFAAIVVLMQWG DEYLPRPEGPPIKLRHHTCGEHADPRLICTHCGEEITARNVTPEPGPGFKAKLASS" gene 3464547..3465686 /locus_tag="Rv3096" /db_xref="GeneID:888678" CDS 3464547..3465686 /locus_tag="Rv3096" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3096, (MTCY164.07), len: 379 aa. Hypothetical protein, with slight similarity to several proteins e.g. Q09671|OYEB_SCHPO|SPAC5H10.10 PUTATIVE NADPH DEHYDROGENASE C5H10.10 (EC 1.6.99.1) (OLD YELLOW ENZYME HOMOLOG) from Schizosaccharomyces pombe (Fission yeast) (392 aa), FASTA scores: opt: 125, E(): 1.1, (25.45% identity in 165 aa overlap); and Q12603|XYNA_DICTH BETA-1,4-XYLANASE (EC 3.2.1.8) (ENDO-1,4-BETA-XYLANASE) from Dictyoglomus thermophilum (352 aa), FASTA scores: opt: 124, E(): 1.2, (25.65% identity in 195 aa overlap); etc. Contains glycosyl hydrolases family 5 signature (PS00659). TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217612.1" /db_xref="GI:15610233" /db_xref="GeneID:888678" /translation="MHRRTALKLPLLLAAGTVLGQAPRAAAEEPGRWSADRAHRWYQA HGWLVGANYITSNAINQLEMFQPGTYDPRRIDNELGLARFHGFNTVRVFLHDLLWAQD APGFQTRLAQFVAIAARYHIKPLFVLFDSCWDPLPRPGRQRAPRAGVHNSGWVQSPGA ERLDDRRYASTLYNYVTGVLGQFRNDDRVLGWDLWNEPDNPARVYRKVERKDKLERVA ELLPQVFRWARTVDPVQPLTSGVWQGNWGDPGRRSTISAIQLDNADVITFHSYAAPAE FEGRIAELAPLQRPILCTEYLARSQGSTVEGILPIAKRHNVGAFNWGLVAGKTQTYLP WDSWDHPYRAPPKVWFHDLLHPNGRPYRDGEVQTIRKLNGMPSQD" misc_feature 3465114..3465143 /locus_tag="Rv3096" /note="PS00659 Glycosyl hydrolases family 5 signature" gene complement(3465778..3467091) /gene="lipY" /locus_tag="Rv3097c" /db_xref="GeneID:888677" CDS complement(3465778..3467091) /gene="lipY" /locus_tag="Rv3097c" /EC_number="3.1.1.3" /function="hydrolyzes long chain triacylglycerol" /codon_start=1 /transl_table=11 /product="triacylglycerol lipase" /protein_id="YP_177924.1" /db_xref="GI:57117054" /db_xref="GeneID:888677" /translation="MVSYVVALPEVMSAAATDVASIGSVVATASQGVAGATTTVLAAA EDEVSAAIAALFSGHGQDYQALSAQLAVFHERFVQALTGAAKGYAAAELANASLLQSE FASGIGNGFATIHQEIQRAPTALAAGFTQVPPFAAAQAGIFTGTPSGAAGFDIASLWP VKPLLSLSALETHFAIPNNPLLALIASDIPPLSWFLGNSPPPLLNSLLGQTVQYTTYD GMSVVQITPAHPTGEYVVAIHGGAFILPPSIFHWLNYSVTAYQTGATVQVPIYPLVQE GGTAGTVVPAMAGLISTQIAQHGVSNVSVVGDSAGGNLALAAAQYMVSQGNPVPSSMV LLSPWLDVGTWQISQAWAGNLAVNDPLVSPLYGSLNGLPPTYVYSGSLDPLAQQAVVL EHTAVVQGAPFSFVLAPWQIHDWILLTPWGLLSWPQINQQLGIAA" gene complement(3467210..3467662) /locus_tag="Rv3098c" /db_xref="GeneID:888676" CDS complement(3467210..3467662) /locus_tag="Rv3098c" /function="UNKNOWN" /note="Rv3098c, (MTCY164.09c), len: 150 aa. Hypothetical unknown protein (shorter version of MTCY164.09c)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217614.1" /db_xref="GI:15610235" /db_xref="GeneID:888676" /translation="MASLRIAEVDPVDRSPNHHASGSVETSSSRSRSASVRACLIHTS RSSSCSARRMTSLLRSPLRIAALMICSSFSVGRKPMVAVMSTTIADVAQSYSNCSTHS GTPTPAFAASFLLDAINAPRVIAGRFASESVRFPAAAPHGSVPSRLPV" gene complement(3467813..3468419) /gene="ssr" /locus_tag="Rvns02" /db_xref="GeneID:2700458" misc_RNA complement(3467813..3468419) /gene="ssr" /locus_tag="Rvns02" /product="10Sa RNA" /note="ssr, len: 607 nt. Match to EM_BA:MT10SARNA X60301 M.tuberculosis gene for 10Sa RNA." /function="INVOLVED IN DEGRADATION OF PROTEINS ENCODED BY ABNORMAL MESSENGER RNA." /db_xref="GeneID:2700458" gene complement(3468413..3469264) /locus_tag="Rv3099c" /db_xref="GeneID:888674" CDS complement(3468413..3469264) /locus_tag="Rv3099c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3099c, (MTCY164.10c), len: 283 aa. Conserved hypothetical protein, some similarity with hypothetical proteins e.g. Q9XA69|SCGD3.09 from Streptomyces coelicolor (274 aa), FASTA scores: opt: 384, E(): 1.8e-17, (32.7% identity in 269 aa overlap); and P71606|Y036_MYCTU|Rv0036c from Mycobacterium tuberculosis strain H37Rv (257 aa), FASTA scores: opt: 179, E(): 0.00024, (25.85% identity in 205 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217615.1" /db_xref="GI:15610236" /db_xref="UniProtKB/TrEMBL:O05777" /db_xref="GeneID:888674" /translation="MTTPGRPLTTLDKSDVLAGLFAVWHSLDALLDGLLETDWQATSP LPGWDVKAVVSHIIGTESFLLGIAAPEPDTDVSALAHVRNPIGVMNECWVRHLGTESG VGLLERFRAVTSQRRKVLASLSDDEWNAPTTTPSGPDSYGRFMRIRIFDCWMHEQDIR AAVQRPSSDDELGGPASPLVLDEIAATMGFVVGKLAKAPDGSRVLLELTGPLSRSIRV SVDGRARVVDDFGGPAPTATIRLDGLQFTRLAGGRPMSPARSQDVELGGDKELAGHIL ERLNFVI" gene complement(3469301..3469783) /gene="smpB" /locus_tag="Rv3100c" /db_xref="GeneID:888675" CDS complement(3469301..3469783) /gene="smpB" /locus_tag="Rv3100c" /function="BINDS SPECIFICALLY TO THE SSRA RNA (TMRNA) AND IS REQUIRED FOR STABLE ASSOCIATION OF SSRA WITH RIBOSOMES. THOUGHT TO BE IMPLICATED IN THE SURVIVAL OF BACTERIUM WITHIN MACROPHAGES." /note="binds to ssrA RNA (tmRNA) and is required for its successful binding to ribosomes; also appears to function in the trans-translation step by promoting accommodation of tmRNA into the ribosomal A site; SmpB protects the tmRNA from RNase R degradation in Caulobacter crescentus; both the tmRNA and SmpB are regulated in cell cycle-dependent manner; functions in release of stalled ribosomes from damaged mRNAs and targeting proteins for degradation" /codon_start=1 /transl_table=11 /product="SsrA-binding protein" /protein_id="NP_217616.1" /db_xref="GI:15610237" /db_xref="GOA:P96294" /db_xref="UniProtKB/Swiss-Prot:P96294" /db_xref="GeneID:888675" /translation="MSKSSRGGRQIVASNRKARHNYSIIEVFEAGVALQGTEVKSLRE GQASLADSFATIDDGEVWLRNAHIPEYRHGSWTNHEPRRNRKLLLHRRQIDTLVGKIR EGNFALVPLSLYFAEGKVKVELALARGKQARDKRQDMARRDAQREVLRELGRRAKGMT" gene complement(3469786..3470679) /gene="ftsX" /locus_tag="Rv3101c" /db_xref="GeneID:888673" CDS complement(3469786..3470679) /gene="ftsX" /locus_tag="Rv3101c" /function="INVOLVED IN GROWTH (PRINCIPALLY DURING LOG PHASE CELLS). THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF SEPTATION COMPONENT ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE. IS CODED IN AN OPERON ESSENTIAL FOR CELL DIVISION." /experiment="experimental evidence, no additional details recorded" /note="Rv3101c, (MTCY164.12c), len: 297 aa. Putative ftsX, cell division protein, septation component transport integral membrane protein ABC transporter (see citations below), equivalent to O32882|FTSX_MYCLE|ML0670|MLCB1779.20c CELL DIVISION PROTEIN from Mycobacterium leprae (297 aa), FASTA scores: opt: 1597, E(): 9.2e-93, (80.8% identity in 297 aa overlap); and similar to others e.g. Q9L1S7|SCE59.27c from Streptomyces coelicolor (305 aa), FASTA scores: opt: 585, E(): 1.9e-29, (34.55% identity in 304 aa overlap); O34876|FTSX_BACSU from Bacillus subtilis (296 aa), FASTA scores: opt: 318, E(): 9.1e-13, (24.65% identity in 300 aa overlap); Q9K6X3|FTSX|BH3601 from Bacillus halodurans (298 aa), FASTA scores: opt: 290, E(): 5.2e-11, (22.75% identity in 299 aa overlap); etc. BELONGS TO THE FTSX FAMILY." /codon_start=1 /transl_table=11 /product="putative cell division protein FTSX (septation component-transport integral membrane protein ABC transporter)" /protein_id="NP_217617.1" /db_xref="GI:15610238" /db_xref="GOA:P96293" /db_xref="UniProtKB/Swiss-Prot:P96293" /db_xref="GeneID:888673" /translation="MRFGFLLNEVLTGFRRNVTMTIAMILTTAISVGLFGGGMLVVRL ADSSRAIYLDRVESQVFLTEDVSANDSSCDTTACKALREKIETRSDVKAVRFLNRQQA YDDAIRKFPQFKDVAGKDSFPASFIVKLENPEQHKDFDTAMKGQPGVLDVLNQKELID RLFAVLDGLSNAAFAVALVQAIGAILLIANMVQVAAYTRRTEIGIMRLVGASRWYTQL PFLVEAMLAATMGVGIAVAGLMVVRALFLENALNQFYQANLIAKVDYADILFITPWLL LLGVAMSGLTAYLTLRLYVRR" gene complement(3470680..3471369) /gene="ftsE" /locus_tag="Rv3102c" /db_xref="GeneID:888672" CDS complement(3470680..3471369) /gene="ftsE" /locus_tag="Rv3102c" /function="INVOLVED IN GROWTH. THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF SEPTATION COMPONENT ACROSS THE MEMBRANE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM. IS CODED IN AN OPERON ESSENTIAL FOR CELL DIVISION." /experiment="experimental evidence, no additional details recorded" /note="Rv3102c, (MTCY164.13_2c), len: 229 aa. Putative ftsE, cell division protein, septation component transport ATP-binding protein ABC transporter (see citations below), equivalent to O32883|FTSE|ML0669 CELL DIVISION ATP-BINDING PROTEIN from Mycobacterium leprae (229 aa), FASTA scores: opt: 1384, E(): 2.4e-74, (91.7% identity in 229 aa overlap); and similar to Q9L1S6|FTSE from Streptomyces coelicolor (229 aa), FASTA scores: opt: 914, E(): 8.7e-47, (62.85% identity in 226 aa overlap); Q9A0S4|FTSE|SPY0644 from Streptococcus pyogenes (230 aa), FASTA scores: opt: 866, E(): 5.7e-44, (57.9% identity in 228 aa overlap); Q9CGX0|FTSE from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (230 aa), FASTA scores: opt: 792, E(): 1.3e-39, (52.2% identity in 228 aa overlap); etc. Other relatives from Mycobacterium tuberculosis include: MTCY253.24; MTCY16B7.10; MTCY9C4.04c; MTCY50.01; MTCY05A6.09c; MTCY04C12.31. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and ABC transporters family signature (PS00211). BELONG TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="putative cell division ATP-binding protein FTSE (septation component-transport ATP-binding protein ABC transporter)" /protein_id="NP_217618.1" /db_xref="GI:15610239" /db_xref="GOA:O05779" /db_xref="UniProtKB/TrEMBL:O05779" /db_xref="GeneID:888672" /translation="MITLDHVTKQYKSSARPALDDINVKIDKGEFVFLIGPSGSGKST FMRLLLAAETPTSGDVRVSKFHVNKLRGRHVPKLRQVIGCVFQDFRLLQQKTVYDNVA FALEVIGKRTDAINRVVPEVLETVGLSGKANRLPDELSGGEQQRVAIARAFVNRPLVL LADEPTGNLDPETSRDIMDLLERINRTGTTVLMATHDHHIVDSMRQRVVELSLGRLVR DEQRGVYGMDR" misc_feature complement(3470911..3470955) /gene="ftsE" /locus_tag="Rv3102c" /note="PS00211 ABC transporters family signature" misc_feature complement(3471241..3471264) /gene="ftsE" /locus_tag="Rv3102c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3471413..3471850) /locus_tag="Rv3103c" /db_xref="GeneID:888798" CDS complement(3471413..3471850) /locus_tag="Rv3103c" /function="UNKNOWN" /note="Rv3103c, (MTCY164.13c), len: 145 aa. Hypothetical unknown pro-rich protein, with some similarity to Proline-rich proteins e.g. Q39789 PROLINE-RICH CELL WALL PROTEIN from Gossypium hirsutum (Upland cotton) (214 aa), FASTA scores: opt: 267, E(): 0.00014, (40% identity in 110 aa overlap). Equivalent to AAK47525 from M. mycobacterium strain CDC1551 (158 aa) but shorter 13 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217619.1" /db_xref="GI:15610240" /db_xref="UniProtKB/TrEMBL:O05780" /db_xref="GeneID:888798" /translation="MKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAP GPGDSPPTQVVPPGFVPDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAV PPPFELPPPFGPGTTTPTPPAPLPQPGPGPTAGTYPKSEPPTR" gene complement(3471852..3472778) /locus_tag="Rv3104c" /db_xref="GeneID:888685" CDS complement(3471852..3472778) /locus_tag="Rv3104c" /function="UNKNOWN" /note="Rv3104c, (MTCY164.14c), len: 308 aa. Possible conserved transmembrane protein, with some similarity to hypthetical proteins e.g. Q9L1X9|SC8E4A.26 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (408 aa), FASTA scores: opt: 514, E(): 4.3e-25, (35.2% identity in 287 aa overlap); Q9XA89|CF43A.26c HYPOTHETICAL 36.1 KDA PROTEIN from Streptomyces coelicolor (333 aa), FASTA scores: opt: 482, E(): 3.7e-23, (34.9% identity in 301 aa overlap); Q55987|SLR0765 HYPOTHETICAL 68.9 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (617 aa), FASTA scores: opt: 429, E(): 1.3e-19, (30.6% identity in 278 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217620.1" /db_xref="GI:15610241" /db_xref="GOA:O05781" /db_xref="UniProtKB/TrEMBL:O05781" /db_xref="GeneID:888685" /translation="MTTSGTVLATSIAQHWHNFWRGEIGDWILNRGLRIVMLLIAAVL AARFVTWLANRVTRRLDLGFTESDALVRSEATKHRQAVASVISWVSIVLIYVVVVYEV IDVLPVPVGALVGPAAVLGAALGFGAQRLVQDLLAGFFIIVEKQYGFGDLVELSMVGS PENAAGTVEDVTLRVTKLRSSEGEVFTVPNGNIVKSVNLSKDWARAVVDIPVPTSADL GRVNEVLHQECEHARHDSLLGELLLDEPTVMGVERIEVDTVTLRLVARTLPGKQFEAG RQLRVLVIRALTRAGIVTAADARAAVAESPEQ" gene complement(3472768..3473904) /gene="prfB" /locus_tag="Rv3105c" /db_xref="GeneID:888820" CDS complement(3472768..3473904) /gene="prfB" /locus_tag="Rv3105c" /function="PEPTIDE CHAIN RELEASE FACTOR 2 DIRECTS THE TERMINATION OF TRANSLATION IN RESPONSE TO THE PEPTIDE CHAIN TERMINATION CODONS UGA AND UAA." /note="recognizes the termination signals UGA and UAA during protein translation a specificity which is dependent on amino acid residues residing in loops of the L-shaped tRNA-like molecule of RF2; in some organisms control of PrfB protein levels is maintained through a +1 ribosomal frameshifting mechanism; this protein is similar to release factor 1" /codon_start=1 /transl_table=11 /product="peptide chain release factor 2" /protein_id="NP_217621.1" /db_xref="GI:15610242" /db_xref="GOA:P66026" /db_xref="UniProtKB/Swiss-Prot:P66026" /db_xref="GeneID:888820" /translation="MPVTLAAVDPDRQADIAALDCTLTTVERVLDVEGLRSRIEKLEH EASDPHLWDDQTRAQRVTSELSHTQGELRRVEELRRRLDDLPVLYELAAEEAGAAAAD AVAEADAELKSLRADIEATEVRTLLSGEYDEREALVTIRSGAGGVDAADWAEMLMRMY IRWAEQHKYPVEVFDTSYAEEAGIKSATFAVHAPFAYGTLSVEQGTHRLVRISPFDNQ SRRQTSFAEVEVLPVVETTDHIDIPEGDVRVDVYRSSGPGGQSVNTTDSAVRLTHIPS GIVVTCQNEKSQLQNKIAAMRVLQAKLLERKRLEERAELDALKADGGSSWGNQMRSYV LHPYQMVKDLRTEYEVGNPAAVLDGDLDGFLEAGIRWRNRRNDD" misc_feature complement(3473098..3473148) /gene="prfB" /locus_tag="Rv3105c" /note="PS00745 Prokaryotic-type class I peptide chain release factors signature" gene 3474007..3475377 /gene="fprA" /locus_tag="Rv3106" /db_xref="GeneID:888839" CDS 3474007..3475377 /gene="fprA" /locus_tag="Rv3106" /EC_number="1.18.1.2" /function="GENERATES OXIDIZED FERREDOXIN FROM FERREDOXIN [CATALYTIC ACTIVITY: REDUCED FERREDOXIN + NADP(+) = OXIDIZED FERREDOXIN + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="Rv3106, (MTCY164.16), len: 456 aa. fprA, NADPH:adrenodoxin oxidoreductase (NADPH-ferredoxin reductase) (EC 1.18.1.2) (see citations below), equivalent to O32886|MLCB1779.25|FPRA|ML0666 from Mycobacterium leprae (456 aa), FASTA scores: opt: 2505, E(): 1.2e-142, (81,05% identity in 459 aa overlap); also similar to other NADPH:adrenodoxin oxidoreductases e.g. Q9RX19|DR0496 from Deinococcus radiodurans (479 aa), FASTA scores: opt: 1331, E(): 2.6e-72, (48.9% identity in 454 aa overlap); Q9RK35|SCF15.02 from Streptomyces coelicolor (454 aa), FASTA scores: opt: 1102, E(): 1.3e-58, (41.35% identity in 462 aa overlap); P82861 from Salvelinus fontinalis (Brook trout) (498 aa), FASTA scores: opt: 827, E(): 4e-42, (41.3% identity in 460 aa overlap); Q9V3T9|ADRO_DROME from Drosophila melanogaster (Fruit fly) (466 aa), FASTA scores: opt: 790, E(): 6.3e-40, (39.45% identity in 459 aa overlap); etc. Also similar to Q10547|FPRB|Rv0886|MT0909|MTCY31.14 from Mycobacterium tuberculosis strain H37Rv (575 aa), FASTA scores: opt: 894, E(): 4.4e-46, (42.05% identity in 459 aa overlap)." /codon_start=1 /transl_table=11 /product="NADPH:adrenodoxin oxidoreductase FPRA (NADPH-ferredoxin reductase)" /protein_id="NP_217622.1" /db_xref="GI:15610243" /db_xref="GOA:O05783" /db_xref="UniProtKB/Swiss-Prot:O05783" /db_xref="GeneID:888839" /translation="MRPYYIAIVGSGPSAFFAAASLLKAADTTEDLDMAVDMLEMLPT PWGLVRSGVAPDHPKIKSISKQFEKTAEDPRFRFFGNVVVGEHVQPGELSERYDAVIY AVGAQSDRMLNIPGEDLPGSIAAVDFVGWYNAHPHFEQVSPDLSGARAVVIGNGNVAL DVARILLTDPDVLARTDIADHALESLRPRGIQEVVIVGRRGPLQAAFTTLELRELADL DGVDVVIDPAELDGITDEDAAAVGKVCKQNIKVLRGYADREPRPGHRRMVFRFLTSPI EIKGKRKVERIVLGRNELVSDGSGRVAAKDTGEREELPAQLVVRSVGYRGVPTPGLPF DDQSGTIPNVGGRINGSPNEYVVGWIKRGPTGVIGTNKKDAQDTVDTLIKNLGNAKEG AECKSFPEDHADQVADWLAARQPKLVTSAHWQVIDAFERAAGEPHGRPRVKLASLAEL LRIGLG" gene complement(3475378..3476961) /gene="agpS" /locus_tag="Rv3107c" /db_xref="GeneID:887657" CDS complement(3475378..3476961) /gene="agpS" /locus_tag="Rv3107c" /EC_number="2.5.1.26" /function="INVOLVED IN ETHER LIPID BIOSYNTHESIS [CATALYTIC ACTIVITY: 1-ACYL-GLYCERONE 3-PHOSPHATE + A LONG-CHAIN ALCOHOL = 1-ALKYL-GLYCERONE 3-PHOSPHATE + A LONG-CHAIN ACID ANION]." /note="Rv3107c, (MTCY164.17c), len: 527 aa. Possible agpS, alkyl-dihydroxyacetonephosphate synthase (EC 2.5.1.26), similar to others and some various enzymes e.g. AAK46595|MT2311 PUTATIVE ALKYL-DIHYDROXYACETONEPHOSPHATE SYNTHASE from Mycobacterium tuberculosis strain CDC1551 (529 aa), FASTA scores: opt: 1052, E(): 2.1e-58, (37.1% identity in 542 aa overlap); Q9RJ97|SCF91.28c PUTATIVE FLAVOPROTEIN from Streptomyces coelicolor (530 aa), FASTA scores: opt: 972, E(): 2.2e-53, (36.2% identity in 544 aa overlap); O96759|ADAS_DICDI ALKYLDIHYDROXYACETONEPHOSPHATE SYNTHASE (EC 2.5.1.26) from Dictyostelium discoideum (Slime mold) (611 aa), FASTA scores: opt: 617, E(): 4.5e-31, (33.95% identity in 480 aa overlap); O97157|ADAS_TRYBB ALKYLDIHYDROXYACETONEPHOSPHATE SYNTHASE from Trypanosoma brucei (613 aa), FASTA scores: opt: 567, E(): 6.2e-28, (29.15% identity in 521 aa overlap); etc. Also similar to O53525|Rv2251|MTV022.01 HYPOTHETICAL 49.8 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (475 aa), FASTA scores: opt: 1019, E(): 2.3e-56, (38.6% identity in 487 aa overlap). BELONGS TO THE FAD-BINDING OXIDOREDUCTASE/TRANSFERASE FAMILY 4. COFACTOR: FAD (BY SIMILARITY)." /codon_start=1 /transl_table=11 /product="alkyldihydroxyacetonephosphate synthase AgpS" /protein_id="NP_217623.1" /db_xref="GI:15610244" /db_xref="GOA:O05784" /db_xref="UniProtKB/TrEMBL:O05784" /db_xref="GeneID:887657" /translation="MRSWWGWGTVEDALSDQETQALQSRVAALVSGHDLSDHPPPDLT ALGLAAPRVSPPASLAALCSSDLVDRAGHARGKAYRDIARNLQGQLDHLPDLIARPRS EQDVIDVLDWCAREGIAVIPYGGGSSVVGGVEPRFDEPVVTVDVTAMSAVLEIDRVSR AARIQAGAFGPSIEHQLRPHDLTLRHFPQSFGFSTLGGWLATRSGGHFATLYTHIDDL TESLRIVTPVGISESRRLPGSGAGPSPDRLFLGSEGTLGIITEAWMRLQHRPRWQVTV SVVFDDWAAAVAATRTIAQAGLYPANCRLLDPAEALLNAGTSVGGGLLVLAFESADHP IDPWLHRAVAITAEHGGTVTAQRSRGTTSDATEHNAAANWRSAFLRMPYQRDALVRRG VIAETFETACTWDGFDTLHAAVTDAARTAIWKVCGTGVVTCRFTHVYPDGPAPYYGIY AGGRWGSLDAQWDEIKAAVSEAISASGGTITHHHAVGRDHRAWYDRQRPDPFAAALRA AKSALDPAGILNPGVLLGR" gene 3477060..3477500 /locus_tag="Rv3108" /db_xref="GeneID:888686" CDS 3477060..3477500 /locus_tag="Rv3108" /function="UNKNOWN" /note="Rv3108, (MTCY164.18), len: 146 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217624.1" /db_xref="GI:15610245" /db_xref="UniProtKB/TrEMBL:O05785" /db_xref="GeneID:888686" /translation="MTPNAASTGDSAKNTITGCCLITARALVARTRSISLPGMPFRMP ADYHNASSDEPTNRHPWPAPARCCRHEWRTMRRTNACDRRRFGLSLTIHEDACRIISV VPVVLEVRRAEPAHPATPYPEPLARCSRSPGLNESSHMSGRIPP" gene 3477649..3478728 /gene="moaA1" /locus_tag="Rv3109" /db_xref="GeneID:888836" CDS 3477649..3478728 /gene="moaA1" /locus_tag="Rv3109" /function="INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS; INVOLVED IN THE BIOSYNTHESIS OF MOLYBDOPTERIN PRECURSOR Z FROM GUANOSINE." /note="Rv3109, (MTCY164.19), len: 359 aa. Probable moaA1, molybdenum cofactor biosynthesis protein, highly similar to others e.g. P39757|MOAA_BACSU|NARA|NARAB from Bacillus subtilis (341 aa), FASTA scores: opt: 810, E(): 6.2e-44, (39.75% identity in 327 aa overlap); O67929|MOAA_AQUAE|AQ_2183 from Aquifex aeolicus (320 aa), FASTA scores: opt: 794, E(): 6e-43, (40.55% identity in 323 aa overlap); Q9ZIM6|MOAA_STACA from Staphylococcus carnosus (340 aa), FASTA scores: opt: 783, E(): 3.2e-42, (38.65% identity in 326 aa overlap); etc. Also highly similar to O53143|MOAA3|MOA3_MYCTU|MT3427 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A 3 from Mycobacterium tuberculosis strain F4 (378 aa), FASTA scores: opt: 1762, E(): 4.7e-104, (74.3% identity in 350 aa overlap); and similar to O53881|MOA2_MYCTU|MOAA2|Rv0869c|MT0892|MTV043.62 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN A 2 from Mycobacterium tuberculosis strain H37Rv (360 aa), FASTA scores: opt: 657, E(): 3e-34, (36.55% identity in 309 aa overlap). BELONGS TO THE MOAA / NIFB / PQQE FAMILY. Note that previously known as moaA.; moaA" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein A" /protein_id="YP_177925.1" /db_xref="GI:57117055" /db_xref="GOA:O05786" /db_xref="UniProtKB/Swiss-Prot:O05786" /db_xref="GeneID:888836" /translation="MSTPTLPDMVAPSPRVRVKDRCRRMMGDLRLSVIDQCNLRCRYC MPEEHYTWLPRQDLLSVKEISAIVDVFLSVGVSKVRITGGEPLIRPDLPEIVRTLSAK VGEDSGLRDLAITTNGVLLADRVDGLKAAGMKRITVSLDTLQPERFKAISQRNSHDKV IAGIKAVAAAGFTDTKIDTTVMRGANHDELADLIEFARTVNAEVRFIEYMDVGGATHW AWEKVFTKANMLESLEKRYGRIEPLPKHDTAPANRYALPDGTTFGIIASTTEPFCATC DRSRLTADGLWLHCLYAISGINLREPLRAGATHDDLVETVTTGWRRRTDRGAEQRLAQ RERGVFLPLSTLKADPHLEMHTRGG" gene 3478779..3479174 /gene="moaB1" /locus_tag="Rv3110" /db_xref="GeneID:888683" CDS 3478779..3479174 /gene="moaB1" /locus_tag="Rv3110" /EC_number="4.2.1.96" /function="THOUGHT TO BE INVOLVED IN MOLYBDOPTERIN BIOSYNTHESIS. CATALYZES THE DEHYDRATATION OF 4A-HYDROXYTETRAHYDROPTERINS [CATALYTIC ACTIVITY: (6R)-6-(L-ERYTHRO-1,2-DIHYDROXYPROPYL)-5,6,7,8-TETRAHYDRO -4 A-HYDROXYPTERIN = (6R)-6-(L-ERYTHRO-1,2- DIHYDROXYPROPYL)-7,8-DIHYDRO-6H-PTERIN + H(2)O]." /note="Rv3110, (MTCY164.20), len: 131 aa. Probable moaB1, pterin-4-alpha-carbinolamine dehydratase (EC 4.2.1.96), similar to others e.g. P73790|SSL2296 from Synechocystis sp. strain PCC 6803 (96 aa), FASTA scores: opt: 195, E(): 6.2e-07, (35.4% identity in 96 aa overlap); Q9PAB4|PHS_XYLFA|XF2604 from Xylella fastidiosa (116 aa), FASTA scores: opt: 187, E(): 2.6e-06, (36.25% identity in 102 aa overlap); AAK42360|Q97WM6|PHS_SULSO|SSO2187 from Sulfolobus solfataricus (114 aa), FASTA scores: opt: 177, E(): 1.3e-05, (34.6% identity in 78 aa overlap); etc. Also highly similar to AAK47768|MT3426 PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Mycobacterium tuberculosis CDC1551 (124 aa), FASTA scores: opt: 383, E(): 7.7e-20, (50.0% identity in 110 aa overlap). BELONGS TO THE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE FAMILY. Note that previously known as moaB.; moaB" /codon_start=1 /transl_table=11 /product="pterin-4-alpha-carbinolamine dehydratase MoaB1" /protein_id="YP_177926.1" /db_xref="GI:57117056" /db_xref="GOA:Q6MX13" /db_xref="UniProtKB/TrEMBL:Q6MX13" /db_xref="GeneID:888683" /translation="MTVSTPEQHEQRASHDASEGKHNVCQGRLAALADAAVSEKLGAL PGWQLLDMRLSRAFQCTNFDQSIDFMNRVASIANDINHHPDIAVLDKRSVRVTAWTRK LGYLTDIDFDLAASVEAMYATEFADRPAR" gene 3479171..3479683 /gene="moaC" /locus_tag="Rv3111" /db_xref="GeneID:888680" CDS 3479171..3479683 /gene="moaC" /locus_tag="Rv3111" /function="INVOLVED IN THE BIOSYNTHESIS OF MOLYBDOPTERIN." /note="MoaC; along with MoaA is involved in conversion of a guanosine derivative into molybdopterin precursor Z; involved in molybdenum cofactor biosynthesis" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein MoaC" /protein_id="YP_177927.1" /db_xref="GI:57117057" /db_xref="GOA:O05788" /db_xref="UniProtKB/Swiss-Prot:O05788" /db_xref="GeneID:888680" /translation="MIDHALALTHIDERGAARMVDVSEKPVTLRVAKASGLVIMKPST LRMISDGAAAKGDVMAAARIAGIAAAKRTGDLIPLCHPLGLDAVSVTITPCEPDRVKI LATTTTLGRTGVEMEALTAVSVAALTIYDMCKAVDRAMEISQIVLQEKSGGRSGVYRR SASDLACQSR" gene 3479700..3479951 /gene="moaD1" /locus_tag="Rv3112" /db_xref="GeneID:888897" CDS 3479700..3479951 /gene="moaD1" /locus_tag="Rv3112" /function="INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS." /note="Rv3112, (MTCY164.22), len: 83 aa. Probable moaD1, molybdenum cofactor biosynthesis protein (molybdopterin converting factor (subunit 1)), similar to others e.g. Q9HJF0|TA1019 from Thermoplasma acidophilum (85 aa), FASTA scores: opt: 144, E(): 0.0012, (31.7% identity in 82 aa overlap); BAB59710|TVG0556526 from Thermoplasma volcanium (90 aa), FASTA scores: opt: 144, E(): 0.0012, (31.7% identity in 82 aa overlap); P30748|MOAD_ECOLI|CHLA4|CHLM|B0784 from Escherichia coli strain K12 (81 aa), FASTA scores: opt: 116, E(): 0.11, (36.9% identity in 84 aa overlap); etc. N-terminus also highly similar to to O53375|GPHA|Rv3323c|MTV016.23c MOAD-MOAE FUSION PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 333, E(): 2e-16, (65.05% identity in 83 aa overlap); and some similarity with Rv0868c|MTV043.61c|MOAD2 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D 2 (92 aa). Note that previously known as moaD.; moaD" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein D" /protein_id="YP_177928.1" /db_xref="GI:57117058" /db_xref="GOA:Q7D640" /db_xref="UniProtKB/TrEMBL:Q7D640" /db_xref="GeneID:888897" /translation="MIKVNVLYFGAVREACDETPREEVEVQNGTDVGNLVDQLQQKYP RLRDHCQRVQMAVNQFIAPLSTVLGDGDEVAFIPQVAGG" gene 3480074..3480742 /locus_tag="Rv3113" /db_xref="GeneID:888781" CDS 3480074..3480742 /locus_tag="Rv3113" /EC_number="3.1.3.-" /function="UNKNOWN" /note="Rv3113, (MTCY164.23), len: 222 aa. Possible phosphatase (EC 3.1.3.-), with weak similarity to other phosphatases e.g. Q9KYY0|SCE33.02c from Streptomyces coelicolor (223 aa), FASTA scores: opt: 368, E(): 1.2e-16, (32.9% identity in 222 aa overlap); and Q55039|GPH_SYNP7|CBBZ PHOSPHOGLYCOLATE PHOSPHATASE from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (212 aa), FASTA scores: opt: 176, E(): 0.00025, (24.7% identity in 182 aa overlap)." /codon_start=1 /transl_table=11 /product="phosphatase" /protein_id="NP_217629.1" /db_xref="GI:15610250" /db_xref="GOA:O05790" /db_xref="UniProtKB/TrEMBL:O05790" /db_xref="GeneID:888781" /translation="MTSRDGFTIVWDWNGTLCDDRTILLDAVGQTLVNEGFEPLSQQQ LIQRFARPLRTFFENACGRDLLTSEWERVQSTFRRIYRSREAEVTLVEDAYDVLAQGN RSAAGQFLLSLAPHDELMHFVQKYGIAKWFNGIRGRTRPDQEKPMMLAELIMQRSLNP TRVVHIGDSLEDAAAASAVGAISVLVTGASLQPPDRVMLKQLQPFVASSLKQALQYAG GDGD" gene 3480759..3481289 /locus_tag="Rv3114" /db_xref="GeneID:888779" CDS 3480759..3481289 /locus_tag="Rv3114" /function="UNKNOWN" /note="Rv3114, (MTCY164.24), len: 176 aa. Conserved hypothetical protein, with some similarity to Q9F9W7 CYTOSINE DEAMINASE from Bifidobacterium longum (143 aa), FASTA scores: opt: 207, E(): 2.2e-07, (37.05% identity in 108 aa overlap); and Q9RV23|DR1207 CELL CYCLE PROTEIN MESJ, PUTATIVE/CYTOSINE DEAMINASE-RELATED PROTEIN from Deinococcus radiodurans (600 aa), FASTA scores: opt: 212, E(): 3.5e-07, (33.35% identity in 177 aa overlap). Equivalent to AAK47536|MT3196 CYTIDINE AND DEOXYCYTIDYLATE DEAMINASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (187 aa) but shorter 11 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217630.1" /db_xref="GI:15610251" /db_xref="GOA:O05791" /db_xref="UniProtKB/TrEMBL:O05791" /db_xref="GeneID:888779" /translation="MVAARLPFGWSADSGVTADIIEAAMELAIDTARHATAPFGAALL DVTTLRAFSGGNTYFESGDRFAHAETNVLRAAMSTLPELSNHVLISTAEPCPMCAAAS VLSGVRAIIFGTSIETLIQCGWFQIRISASDVVAASTRPTRPSVYSGFLSHKTDLLYR NSENRRAMNPWTDPSH" repeat_region 3481399..3482722 /note="IS1081-6, len: 1324 bp. Insertion sequence IS1081." /mobile_element="insertion sequence:IS1081-6" repeat_region 3481399..3481413 /note="15 bp inverted repeat at left end of IS1081: TCGCGTGATCCTTCG" gene 3481451..3482698 /locus_tag="Rv3115" /db_xref="GeneID:888790" CDS 3481451..3482698 /locus_tag="Rv3115" /function="INVOLVED IN THE TRANSPOSITION IN THE INSERTION SEQUENCE IS1081." /experiment="experimental evidence, no additional details recorded" /note="Rv3115, (MTCY164.25), len: 415 aa. Probable IS1081 transposase, similar to others. Has transposases, mutator family, signature (PS01007). Other copies are MTCY10G2.02c, MTCY441.35, MTCY77.03c. TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217631.1" /db_xref="GI:15610252" /db_xref="GOA:P96354" /db_xref="UniProtKB/TrEMBL:P96354" /db_xref="GeneID:888790" /translation="MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADAL CGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAERA LTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEAFRTRPLDAGP YTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRD LVARGLSGVALVTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRT LLHSIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIW SNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRA RAALTSTEEPAKQQTTNTPALTT" misc_feature 3482147..3482221 /locus_tag="Rv3115" /note="PS01007 Transposases, Mutator family, signature" repeat_region complement(3482708..3482722) /note="15 bp inverted repeat at right end of IS1081: TCGCGTGATCCTTCG" gene 3482776..3483945 /gene="moeB2" /locus_tag="Rv3116" /db_xref="GeneID:888808" CDS 3482776..3483945 /gene="moeB2" /locus_tag="Rv3116" /function="POSSIBLY INVOLVED IN MOLYBDOPTERIN METABOLISM (SYNTHESIS)." /note="Rv3116, (MTCY164.26), len: 389 aa. Probable moeB2, molybdopterin cofactor biosynthesis protein, equivalent to Q9CCG8|MOEZ|ML0817 PROTEIN PROBABLY INVOLVED IN MOLYBDOPTERIN BIOSYNTHESIS from Mycobacterium leprae (395 aa), FASTA scores: opt: 1433, E(): 8e-80, (57.8% identity in 384 aa overlap). Very similar to members of the HESA/MOEB/THIF family e.g. Q9FCL0|2SC3B6.02 PUTATIVE SULFURYLASE from Streptomyces coelicolor (392 aa), FASTA scores: opt: 1562, E(): 1.1e-87, (58.15% identity in 380 aa overlap); Q9XC37|PDTORFF MOEB-LIKE PROTEIN (PUTATIVE SULFURYLASE) from Pseudomonas stutzeri (Pseudomonas perfectomarina) (391 aa), FASTA scores: opt: 1311, E(): 2.1e-72, (52.4% identity in 395 aa overlap); O54307|MPT|MOEB MPT-SYNTHASE SULFURYLASE from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (391 aa), FASTA scores: opt: 1238, E(): 5.7e-68, (51.4% identity in 393 aa overlap); P74344|MOEB|SLL1536 MOLYBDOPTERIN BIOSYNTHESIS MOEB PROTEIN from Synechocystis sp. strain PCC 6803 (392 aa), FASTA scores: opt: 1212, E(): 2.2e-66, (46.5% identity in 398 aa overlap); etc. Also highly similar to O05860|MTCY07D11.20|MOEB1|Rv3206c PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN from Mycobacterium tuberculosis strain H37Rv (392 aa), FASTA scores: opt: 1445, E(): 1.5e-80, (56.25% identity in 400 aa overlap). BELONGS TO THE HesA /MoeB/ThiF FAMILY. Note that previously known as moeB.; moeB" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein MoeB" /protein_id="YP_177929.1" /db_xref="GI:57117059" /db_xref="GOA:Q7D637" /db_xref="UniProtKB/TrEMBL:Q7D637" /db_xref="GeneID:888808" /translation="MTEALIPAPSQISLTRDEVRRYSRHLIIPDIGVNGQQRLKDARV LCIGAGGLGSPALLYLAAAGVGTIGIIDGDHVDESNLQRQIIHGTSDVGRPKVESAAE AVAEINPHVRVTQYREMLTHDNALEIFGDHDLIVDGTDNFTTRYLINDAAVLAGKPYV WGSIYRFNGQTSVFWPGRGPCYRCLHPAPPPPGLVPSCAEGGVLGAICATIASIQVTE VLKLLTGVGTPLVGRLLMYEALDATYHQIRIAKNPDCAICGDAPTITELVDDSVSCAS TQSVDPELVISCDELRTKQQSDQNFLLVDVREPAEFDIAHIPGSILIPKGEIGSAAGL AQLPLDKEIVLYCKSGIRSAQALTTLKAAGLHNVKHLDGGIAEWTRTIDSSLLVY" gene 3483974..3484807 /gene="cysA3" /locus_tag="Rv3117" /db_xref="GeneID:888802" CDS 3483974..3484807 /gene="cysA3" /locus_tag="Rv3117" /EC_number="2.8.1.1" /function="MAY BE A SULFOTRANSFERASE INVOLVED IN THE FORMATION OF THIOSULFATE [CATALYTIC ACTIVITY: THIOSULFATE + CYANIDE = SULFITE + THIOCYANATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3117, (MTCY164.27, MT3199, O05793), len: 277 aa. Probable cysA3 (alternate gene name: sseC3), thiosulfate sulfurtransferase (EC 2.8.1.1) (see Wooff et al., 2002), equivalent to Q50036|CYSA|CYSA3|ML2198|THTR_MYCLE PUTATIVE SULFURTRANSFERASE THIOSULFATE from Mycobacterium leprae (277 aa). Also highly similar to other putative thiosulfate sulfurtransferases e.g. P16385|THTR_SACER|CYSA from Saccharopolyspora erythraea (Streptomyces erythraeus) (281 aa), FASTA scores: opt: 1442, E(): 1.7e-84, (75.55% identity in 274 aa overlap); Q9RXT9DR0217|DR0217 from Deinococcus radiodurans (286 aa), FASTA scores: opt: 1046, E(): 2.6e-59, (53.8% identity in 275 aa overlap); Q9HMT7|TSSA|VNG2393G from Halobacterium sp. strain NRC-1 (293 aa), FASTA scores: opt: 1030, E(): 2.7e-58, (56.1% identity in 278 aa overlap); Q9Y8N8|APE2595 from Aeropyrum pernix (218 aa), FASTA scores: opt: 808, E(): 2.7e-44, (53.5% identity in 215 aa overlap); etc. Identical second copy present as Rv0815c|AL022004|MTV043.07c|MT0837|O05793|cysA2 (277 aa) (100.0% identity in 277 aa overlap). Also shows some similarity to P96888|THT2_MYCTU|SSEA|Rv3283|MT3382|MTCY71.23 PUTATIVE THIOSULFATE SULFURTRANSFERASE from Mycobacterium tuberculosis (297 aa), FASTA scores: opt: 955, E(): 1.6e-53, (50.2% identity in 271 aa overlap); and Q59570|THT3_MYCTU|SSEB|Rv2291|MT2348|MTCY339.19c PUTATIVE THIOSULFATE SULFURTRANSFERASE from Mycobacterium tuberculosis (284 aa), FASTA scores: E(): 1.4e-14, (26.7% identity in 292 aa overlap). Contains rhodanese active site and C-terminal signatures (PS00380, PS00683). BELONGS TO THE RHODANESE FAMILY. TBparse score is 0.901.; sseC3" /codon_start=1 /transl_table=11 /product="thiosulfate sulfurtransferase CysA3" /protein_id="NP_217633.1" /db_xref="GI:15610254" /db_xref="GOA:O05793" /db_xref="UniProtKB/Swiss-Prot:O05793" /db_xref="GeneID:888802" /translation="MARCDVLVSADWAESNLHAPKVVFVEVDEDTSAYDRDHIAGAIK LDWRTDLQDPVKRDFVDAQQFSKLLSERGIANEDTVILYGGNNNWFAAYAYWYFKLYG HEKVKLLDGGRKKWELDGRPLSSDPVSRPVTSYTASPPDNTIRAFRDEVLAAINVKNL IDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSRAANEDGTFKSDEELAKLY ADAGLDNSKETIAYCRIGERSSHTWFVLRELLGHQNVKNYDGSWTEYGSLVGAPIELG S" misc_feature 3484598..3484690 /gene="cysA3" /locus_tag="Rv3117" /note="PS00380 Rhodanese active site" misc_feature 3484745..3484768 /gene="cysA3" /locus_tag="Rv3117" /note="PS00683 Rhodanese C-terminal signature" gene 3484809..3485111 /gene="sseC1" /locus_tag="Rv3118" /db_xref="GeneID:888809" CDS 3484809..3485111 /gene="sseC1" /locus_tag="Rv3118" /function="THOUGHT TO BE INVOLVED IN SULPHUR METABOLISM." /note="Rv3118, (MTCY164.28, O05794), len: 100 aa. sseC1, conserved hypothetical protein, equivalent to Q9CBC7|ML2199 HYPOTHETICAL PROTEIN from Mycobacterium leprae (100 aa), FASTA scores: opt: 545, E(): 3.1e-30, (84.0% identity in 10 aa overlap). Also similar to hypothetical proteins e.g. Q50035 from Saccharopolyspora erythraea (Streptomyces erythraeus) (101 aa), FASTA scores: opt: 345, E(): 9.7e-17, (57.15% identity in 98 aa overlap); and Q9K4H3|SCD66.02 from Streptomyces coelicolor (95 aa), FASTA scores: opt: 249, E(): 2.8e-10, (48.5% identity in 99 aa overlap). Some weak similarity with Q9ZB84|PCAG PROTOCATECHUATE 3,4-DIOXYGENASE ALPHA-SUBUNIT from Pseudomonas marginata (196 aa), FASTA scores: opt: 109, E(): 1.4, (31.3% identity in 83 aa overlap); and other bacterial proteins. Identical second copy present as Rv0814c|AL022004|MTV043.06c|SSEC2 from Mycobacterium tuberculosis (100 aa) (100.0% identity in 100 aa overlap). Note that previously known as sseC.; sseC" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177930.1" /db_xref="GI:57117060" /db_xref="GOA:Q6MX10" /db_xref="UniProtKB/TrEMBL:Q7D986" /db_xref="GeneID:888809" /translation="MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLD SSDEFTAEVVASATGDFRFFAAPGSWTLRALSAAGNGDAVVQPSGAGIHEVDVKIT" gene 3485132..3485575 /gene="moaE1" /locus_tag="Rv3119" /db_xref="GeneID:888811" CDS 3485132..3485575 /gene="moaE1" /locus_tag="Rv3119" /function="POSSIBLY A MOLYBDENUM BIOSYNTHESIS COFACTOR. CONVERSION OF MOLYBDOPTERIN PRECURSOR Z INTO MOLYBDOPTERIN REQUIRES TRANSFER OF TWO SULFUR ATOMS TO PRECURSOR Z (TO GENERATE THE DITHIOLENE GROUP). THIS IS CATALYZED BY THE CONVERTING FACTOR COMPOSED OF A SMALL AND LARGE SUBUNIT." /note="Rv3119, (MTCY164.29), len: 147 aa. Probable moaE1, molybdopterin converting factor E (molybdopterin converting factor (subunit 2)), highly similar to others e.g. O31705|MOAE from Bacillus subtilis (157 aa), FASTA scores: opt: 390, E(): 8.6e-19, (43.95% identity in 132 aa overlap); Q9K8I7|MOAE|BH3019 from Bacillus halodurans (156 aa), FASTA scores: opt: 369, E(): 2e-17, (42.4% identity in 132 aa overlap); P30749|MOAE_ECOLI|CHLA5|B0785 from Escherichia coli strain K12 (149 aa), FASTA scores: opt: 312, E(): 1.1e-13, (38.45% identity in 130 aa overlap); etc. Also highly similar (but shorter 74 aa) to O53375|GPHA|Rv3323c|MTV016.23c MOAD-MOAE FUSION PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 733, E(): 3.9e-41, (76.2% identity in 143 aa overlap); and highly similar to O53878|MOAE2|Rv0866|MTV043.59 PUTATIVE MOLYBDOPTERIN SYNTHASE LARGE SUBUNIT from Mycobacterium tuberculosis (141 aa), FASTA scores: opt: 321, E(): 2.6e-14, (40.9% identity in 132 aa overlap). Note that previously known as moaE.; moaE" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein E" /protein_id="YP_177931.1" /db_xref="GI:57117061" /db_xref="GOA:O05795" /db_xref="UniProtKB/Swiss-Prot:O05795" /db_xref="GeneID:888811" /translation="MANVVAEGAYPYCRLTDQPLSVDEVLAAVSGPEQGGIVIFVGNV RDHNAGHDVTRLFYEAYPPMVIRTLMSIIGRCEDKAEGVRVAVAHRTGELQIGDAAVV IGASAPHRAEAFDAARMCIELLKQEVPIWKKEFSSTGAEWVGDRP" gene 3485572..3486174 /locus_tag="Rv3120" /db_xref="GeneID:888828" CDS 3485572..3486174 /locus_tag="Rv3120" /function="UNKNOWN" /note="Rv3120, (MTCY164.30), len: 200 aa. Conserved hypothetical protein, with weak similarity to several hypothetical proteins and many N-methyl transferases e.g. Q9X9V1|ORF8 PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor A3(2) (208 aa), FASTA scores: opt: 177, E(): 0.00011, (34.6% identity in 130 aa overlap); Q9XA90|SCF43A.25c PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (215 aa), FASTA scores: opt: 147, E(): 0.011, (31.3% identity in 166 aa overlap); BAB52127|MLL5735 PROBABLE METHYLTRANSFERASE from Rhizobium loti (Mesorhizobium loti) (247 aa), FASTA scores: opt: 133, E(): 0.11, (29.75% identity in 158 aa overlap). Highly similar to O53374|Rv3322c|MTV016.22c POSSIBLE METHYLTRANSFERASE from Mycobacterium tuberculosis strain H37Rv (204 aa), FASTA scores: opt: 691, E(): 1.1e-38, (57.0% identity in 200 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217636.1" /db_xref="GI:15610257" /db_xref="GOA:O05796" /db_xref="UniProtKB/TrEMBL:O05796" /db_xref="GeneID:888828" /translation="MSPSPSALLADHPDRIRWNAKYECADPTEAVFAPISWLGDVLQF GVPEGPVLELACGRSGTALGLAAAGRCVTAIDVSDTALVQLELEATRRELADRLTLVH ADLCSWQSGDGRFALVLCRLFWHPPTFRQACEAVAPGGVVAWEAWRRPIDVARDTRRA EWCLKPGQPESELPAGFTVIRVVDTDGSEPSRRIIAQRSL" gene 3486509..3487711 /gene="cyp141" /locus_tag="Rv3121" /db_xref="GeneID:887409" CDS 3486509..3487711 /gene="cyp141" /locus_tag="Rv3121" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv3121, (MTCY164.31), len: 400 aa. Probable cyp141, cytochrome P-450 integral membrane protein (EC 1.14.-.-), similar to other cytochrome P450-dependent oxidases e.g. Q9X5P9|CYP107N1 from Streptomyces lavendulae (410 aa), FASTA scores: opt: 825, E(): 3.1e-42, (33.35% identity in 393 aa overlap); Q59819|OLEP|CYP107D1 from Streptomyces antibioticus (407 aa), FASTA scores: opt: 812, E(): 1.9e-41, (34.85% identity in 396 aa overlap); O32460|CYP107M1 from Actinomadura hibisca (411 aa), FASTA scores: opt: 713, E(): 1.6e-35, (31.05% identity in 396 aa overlap); P55544|CPXP_RHISN|CYP112A|Y4LD from Rhizobium sp. strain NGR234 (400 aa), FASTA scores: opt: 688, E(): 5.1e-34, (33.0% identity in 406 aa overlap); etc. Also similar to MTCY339.44c, MTCY369.22, MTCY50.26, MTCY03C7.11, MTCY339.34c, MTCY339.42, MTCY369.11c. Contains cytochrome P450 cysteine heme-iron ligand signature (PS00086). BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 141" /protein_id="NP_217637.1" /db_xref="GI:15610258" /db_xref="GOA:O08362" /db_xref="UniProtKB/Swiss-Prot:O08362" /db_xref="GeneID:887409" /translation="MTSTSIPTFPFDRPVPTEPSPMLSELRNSCPVAPIELPSGHTAW LVTRFDDVKGVLSDKRFSCRAAAHPSSPPFVPFVQLCPSLLSIDGPQHTAARRLLAQG LNPGFIARMRPVVQQIVDNALDDLAAAEPPVDFQEIVSVPIGEQLMAKLLGVEPKTVH ELAAHVDAAMSVCEIGDEEVSRRWSALCTMVIDILHRKLAEPGDDLLSTIAQANRQQS TMTDEQVVGMLLTVVIGGVDTPIAVITNGLASLLHHRDQYERLVEDPGRVARAVEEIV RFNPATEIEHLRVVTEDVVIAGTALSAGSPAFTSITSANRDSDQFLDPDEFDVERNPN EHIAFGYGPHACPASAYSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIK ELLVTWPT" misc_feature 3487523..3487552 /gene="cyp141" /locus_tag="Rv3121" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature" gene 3488089..3488559 /locus_tag="Rv3122" /db_xref="GeneID:888890" CDS 3488089..3488559 /locus_tag="Rv3122" /function="UNKNOWN" /note="Rv3122, (MTCY164.32), len: 156 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217638.1" /db_xref="GI:15610259" /db_xref="UniProtKB/TrEMBL:O07033" /db_xref="GeneID:888890" /translation="MYSGCWINNQNGETRVGEDSLEDLEQRRARLYDQLAATGDFRRG SISENYRRCGKPNCVCAQEGHPGHGPRYLWTRTVAGRGTKGRQLSVEEVDKVRAELAN YHRFAQVSEQIVAVNEAICEARPPNPAATAPPAGTTGHKKGGSATRSRRSSPPR" gene 3488569..3489063 /locus_tag="Rv3123" /db_xref="GeneID:888823" CDS 3488569..3489063 /locus_tag="Rv3123" /function="UNKNOWN" /note="Rv3123, (MTCY164.33), len: 162 aa. Hypothetical unknown protein, but N-terminus shares weak similarity with N-terminal part of O93439|CMESO-1 BHLH TRANSCRIPTION FACTOR from Gallus gallus (Chicken) (287 aa), FASTA scores: opt: 129, E(): 0.81, (38.75% identity in 80 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217639.1" /db_xref="GI:15610260" /db_xref="UniProtKB/TrEMBL:O07034" /db_xref="GeneID:888823" /translation="MRSRSVRWDPRCRPGRSGVGDPHCDDPAGLLAAGAAAGRRHRAP GPAHRLRARALRVVRRLPRQEPRYRAGPGPVAPRLLPLPHLRAWDGAPWIWNLATAIL PEATPIVDLYHARQHVHDLAGQLAPALGEHHSDWLTARLVDLDSGDIETLVQQPIGQH TGHT" gene 3489506..3490375 /locus_tag="Rv3124" /db_xref="GeneID:888825" CDS 3489506..3490375 /locus_tag="Rv3124" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3124, (MTCY164.34), len: 289 aa. Probable transcriptional regulatory protein, similar to many Streptomyces and Mycobacterium tuberculosis regulatory proteins e.g. Q11052|YC67_MYCTU|Rv1267c|MT1305|MTCY50.15 from Mycobacterium tuberculosis strain H37Rv (388 aa), FASTA scores: opt: 963, E(): 2e-56, (55.15% identity in 252 aa overlap); O53145 from Mycobacterium tuberculosis (381 aa); P71484|EMBR from Mycobacterium avium (384 aa), FASTA scores: opt: 859, E(): 1.5e-49, (52.2% identity in 249 aa overlap); Q9XCC3|TYLT from Streptomyces fradiae (404 aa), FASTA scores: opt: 462, E(): 3.1e-23, (35.05% identity in 254 aa overlap); Q9XCC4|TYLS from Streptomyces fradiae (277 aa), FASTA scores: opt: 456, E(): 5.6e-23, (33.45% identity in 269 aa overlap); etc. Start chosen by similarity, alternative possible (see AAK47548 from Mycobacterium tuberculosis strain CDC1551, longer N-terminus (311 aa))." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217640.1" /db_xref="GI:15610261" /db_xref="GOA:O05797" /db_xref="UniProtKB/TrEMBL:O05797" /db_xref="GeneID:888825" /translation="MQFNVLGPLELNLRGTKLPLGTPKQRAVLAMLLLSRNQVVAADA LVQAIWEKSPPARARRTVHTYICNLRRTLSDAGVDSRNILVSEPPGYRLLIGDRQQCD LDRFVAAKESGLRASAKGYFSEAIRYLDSALQNWRGPVLGDLRSFMFVQMFSRALTED ELLVHTKLAEAAIACGRADVVIPKLERLVAMHPYRESLWKQLMLGYYVNEYQSAAIDA YHRLKSTLAEELGVEPAPTIRALYHKILRQLPMDDLVGRVTRGRVDLRGGNGAKVEEL TESDKDLLPIGLA" gene complement(3490476..3491651) /gene="PPE49" /locus_tag="Rv3125c" /db_xref="GeneID:888892" CDS complement(3490476..3491651) /gene="PPE49" /locus_tag="Rv3125c" /function="UNKNOWN" /note="Rv3125c, (MTCY164.35c), len: 391 aa. Member of the Mycobacterium tuberculosis PPE family, similar to other e.g. P95247|Rv2352c|MTCY98.21c (391 aa), FASTA scores: opt: 1576, E(): 3.8e-72, (62.55% identity in 398 aa overlap), MTCY98.0029c, MTCY03A2.22c, MTCY10G2.10, MTCY02B10.25c, MTCI364.08, M TCY21C12.09c, MTCY48.17." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177932.1" /db_xref="GI:57117062" /db_xref="UniProtKB/TrEMBL:Q7D631" /db_xref="GeneID:888892" /translation="MVLGFSWLPPEINSARMFAGAGSGPLFAAASAWEGLAADLWASA SSFESVLAALTTGPWTGPASMSMAAAASPYVGWLSTVASQAQLAAIQARAAATAFEAA LAATVHPTAVTANRVSLASLIAANVLGQNTPAIAATEFDYLEMWAQDVAAMVGYHAGA KSVAATLAPFSLPPVSLAGLAAQVGTQVAGMATTASAAVTPVVEGAMASVPTVMSGMQ SLVSQLPLQHASMLFLPVRILTSPITTLASMARESATRLGPPAGGLAAANTPNPSGAA IPAFKPLGGRELGAGMSAGLGQAQLVGSMSVPPTWQGSIPISMASSAMSGLGVPPNPV ALTQAAGAAGGGMPMMLMPMSISGAGAGMPGGLMDRDGAGWHVTQARLTVIPRTGVG" gene complement(3491808..3492122) /locus_tag="Rv3126c" /db_xref="GeneID:888806" CDS complement(3491808..3492122) /locus_tag="Rv3126c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3126c, (MTCY164.36c), unknown, len: 104 aa. Hypothetical unknown protein. Shortened version of MTCY164.36c, avoiding overlap." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217642.1" /db_xref="GI:15610263" /db_xref="UniProtKB/TrEMBL:O05799" /db_xref="GeneID:888806" /translation="MVIRFDQIGSLVLSMKSLASLSFQRCLRENSSLVAALDRLDAAV DELSALSFDALTTPERDRARRDRDHHPWSRSRSQLSPRMAHGAVHQCQWPKAVWAVID NP" gene 3492147..3493181 /locus_tag="Rv3127" /db_xref="GeneID:888850" CDS 3492147..3493181 /locus_tag="Rv3127" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3127, (MTCY164.37), len: 344 aa. Hypothetical protein, highly similar to Mycobacterium tuberculosis protein O53476|Rv2032|MTV018.19 (331 aa), FASTA scores: opt: 1212, E(): 6e-69, (56.7% identity in 321 aa overlap), and also similar to P95195|MTCY03A2.27c (332 aa), FASTA scores: opt: 521, E(): 1.6e-25; (35.0% identity in 326 aa overlap). Some similarity to C-terminal half of hypothetical Mycobacterium tuberculosis proteins." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217643.1" /db_xref="GI:15610264" /db_xref="UniProtKB/TrEMBL:O05800" /db_xref="GeneID:888850" /translation="MLKNAVLLACRAPSVHNSQPWRWVAESGSEHTTVHLFVNRHRTV PATDHSGRQAIISCGAVLDHLRIAMTAAHWQANITRFPQPNQPDQLATVEFSPIDHVT AGQRNRAQAILQRRTDRLPFDSPMYWHLFEPALRDAVDKDVAMLDVVSDDQRTRLVVA SQLSEVLRRDDPYYHAELEWWTSPFVLAHGVPPDTLASDAERLRVDLGRDFPVRSYQN RRAELADDRSKVLVLSTPSDTRADALRCGEVLSTILLECTMAGMATCTLTHLIESSDS RDIVRGLTRQRGEPQALIRVGIAPPLAAVPAPTPRRPLDSVLQIRQTPEKGRNASDRN ARETGWFSPP" gene complement(3493168..3494181) /locus_tag="Rv3128c" /pseudo /db_xref="GeneID:886263" misc_feature complement(3493168..3494181) /locus_tag="Rv3128c" /experiment="experimental evidence, no additional details recorded" /note="Rv3128c, (MTCY164.38c), len: 337 aa. Conserved hypothetical protein, similar to other conserved hypothetical proteins. This ORF corresponds to a fusion of MTCY164.38 and MTCY164.39c. Has in-frame amber stop codon but is similar throughout its length to Rv2807|MTCY16B7.36c|Z81331 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (384 aa), FASTA scores: opt: 954, E(): 0, (47.2% identity in 339 aa overlap).;CONSERVED HYPOTHETICAL PROTEIN" /pseudo gene 3494660..3494992 /locus_tag="Rv3129" /db_xref="GeneID:888901" CDS 3494660..3494992 /locus_tag="Rv3129" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3129, (MTCY164.40), len: 110 aa. Conserved hypothetical protein, with some similarity to various hypothetical proteins from Streptomyces coelicolor e.g. Q9RI34|SCJ12.26 HYPOTHETICAL 14.5 KDA PROTEIN (137 aa), FASTA scores: opt: 141, E(): 0.0016, (39.3% identity in 84 aa overlap); Q9RI49|SCJ12.09c HYPOTHETICAL 15.8 KDA PROTEIN (146 aa), FASTA scores: opt: 141, E(): 0.0017, (38.05% identity in 92 aa overlap); Q9RJ05|SCJ1.09C POSSIBLE DNA-BINDING PROTEIN (233 aa), FASTA scores: opt: 140, E(): 0.0029, (34.85% identity in 89 aa overlap); Q9XA48|SCGD3.31c PUTATIVE BRANCHED-CHAIN ALPHA KETO ACID DEHYDROGENASE E1 BETA SUBUNIT (334 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177933.1" /db_xref="GI:57117063" /db_xref="UniProtKB/TrEMBL:Q7D628" /db_xref="GeneID:888901" /translation="MVQGRTVLFRTAEGAKLFSAVAKCAVAFEADDHNVAEGWSVIVK VRAQVLTTDAGVREAERAQLLPWTATLKRHCVRVIPWEITGRHFRFGPEPDRSQTFAC EASSHNQR" gene complement(3494975..3496366) /gene="tgs1" /locus_tag="Rv3130c" /db_xref="GeneID:888841" CDS complement(3494975..3496366) /gene="tgs1" /locus_tag="Rv3130c" /experiment="experimental evidence, no additional details recorded" /codon_start=1 /transl_table=11 /product="triacylglycerol synthase" /protein_id="NP_217646.1" /db_xref="GI:15610266" /db_xref="GOA:O07035" /db_xref="UniProtKB/Swiss-Prot:O07035" /db_xref="GeneID:888841" /translation="MNHLTTLDAGFLKAEDVDRHVSLAIGALAVIEGPAPDQEAFLSS LAQRLRPCTRFGQRLRLRPFDLGAPKWVDDPDFDLGRHVWRIALPRPGNEDQLFELIA DLMARRLDRGRPLWEVWVIEGLADSKWAILTKLHHCMADGIAATHLLAGLSDESMSDS FASNIHTTMQSQSASVRRGGFRVNPSEALTASTAVMAGIVRAAKGASEIAAGVLSPAA SSLNGPISDLRRYSAAKVPLADVEQVCRKFDVTINDVALAAITESYRNVLIQRGERPR FDSLRTLVPVSTRSNSALSKTDNRVSLMLPNLPVDQENPLQRLRIVHSRLTRAKAGGQ RQFGNTLMAIANRLPFPMTAWAVGLLMRLPQRGVVTVATNVPGPRRPLQIMGRRVLDL YPVSPIAMQLRTSVAMLSYADDLYFGILADYDVVADAGQLARGIEDAVARLVAISKRR KVTRRRGALSLVV" gene 3496551..3497549 /locus_tag="Rv3131" /db_xref="GeneID:888838" CDS 3496551..3497549 /locus_tag="Rv3131" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3131, (MTCY03A2.27c), len: 332 aa. Hypothetical protein, similar to other hypothetical bacterial proteins e.g. O53476|Rv2032|MTV018.19 (331 aa), FASTA scores: opt: 568, E(): 2.5e-27, (36.7% identity in 321 aa overlap); O05800|Rv3127|MTCY164.37 (344 aa), FASTA scores: opt: 521, E(): 1.9e-24, (34.95% identity in 326 aa overlap); Q9RI33|SCJ12.27c from Streptomyces coelicolor (335 aa), FASTA scores: opt: 441, E(): 1.3e-19, (35.75% identity in 319 aa overlap); Q9RI44|SCJ12.14 from Streptomyces coelicolor (309 aa), FASTA scores: opt: 328, E(): 9.3e-13, (27.9% identity in 308 aa overlap); Q9CBP5|ML1751 from Mycobacterium leprae (721 aa), FASTA scores: opt: 137, E(): 0.78, (26.15% identity in 298 aa overlap); etc. Equivalent to AAK47555 from Mycobacterium tuberculosis strain CDC1551 but shorter 12 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217647.1" /db_xref="GI:15610267" /db_xref="UniProtKB/TrEMBL:P95195" /db_xref="GeneID:888838" /translation="MNTHFPDAETVRTVLTLAVRAPSIHNTQPWRWRVCPTSLELFSR PDMQLRSTDPDGRELILSCGVALHHCVVALASLGWQAKVNRFPDPKDRCHLATIGVQP LVPDQADVALAAAIPRRRTDRRAYSCWPVPGGDIALMAARAARGGVMLRQVSALDRMK AIVAQAVLDHVTDEEYLRELTIWSGRYGSVAGVPARNEPPSDPSAPIPGRLFAGPGLS QPSDVLPADDGAAILALGTETDDRLARLRAGEAASIVLLTATAMGLACCPITEPLEIA KTRDAVRAEVFGAGGYPQMLLRVGWAPINADPLPPTPRRELSQVVEWPEELLRQRC" gene complement(3497529..3499265) /gene="devS" /locus_tag="Rv3132c" /db_xref="GeneID:888829" CDS complement(3497529..3499265) /gene="devS" /locus_tag="Rv3132c" /EC_number="2.7.3.-" /function="SENSOR PART OF THE TWO COMPONENT REGULATORY SYSTEM DEVR/DEVS. THOUGHT TO CONTROL HSPX|Rv2031|ACR EXPRESSION." /experiment="experimental evidence, no additional details recorded" /note="Rv3132c, (MTCY03A2.26), len: 578 aa. devS, membrane-bound two component sensor histidine kinase (EC 2.7.3.-) (see citations below; dev for Differentially Expressed in Virulent strain), similar to others two component sensors e.g. Q9RI43|SCJ12.15c PUTATIVE TWO-COMPONENT SENSOR from Streptomyces coelicolor (585 aa), FASTA scores: opt: 1305, E(): 2.5e-69, (41.35% identity in 573 aa overlap); Q9ZBY4|SCD78.15 PUTATIVE TWO COMPONENT SENSOR from Streptomyces coelicolor (560 aa), FASTA scores: opt: 1194, E(): 8.1e-63, (41.05% identity in 558 aa overlap); O85371|CPRS TWO COMPONENT REGULATOR from Rhodococcus sp (563 aa), FASTA scores: opt: 803, E(): 8.3e-40, (38.4% identity in 552 aa overlap); Q9L094|SCC24.23 PUTATIVE TWO-COMPONENT SENSOR HISTIDINE KINASE from Streptomyces coelicolor (similarity only in C-terminus for this one); etc. Also highly similar to mycobacterium O53473|Rv2027c|MTV018.14c PUTATIVE MEMBRANE PROTEIN (573 aa), FASTA scores: opt: 2333, E(): 7.6e-130, (61.45% identity in 576 aa overlap). TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="two component sensor histidine kinase DEVS" /protein_id="NP_217648.1" /db_xref="GI:15610268" /db_xref="GOA:P95194" /db_xref="UniProtKB/TrEMBL:P95194" /db_xref="GeneID:888829" /translation="MTTGGLVDENDGAAMRPLRHTLSQLRLHELLVEVQDRVEQIVEG RDRLDGLVEAMLVVTAGLDLEATLRAIVHSATSLVDARYGAMEVHDRQHRVLHFVYEG IDEETVRRIGHLPKGLGVIGLLIEDPKPLRLDDVSAHPASIGFPPYHPPMRTFLGVPV RVRDESFGTLYLTDKTNGQPFSDDDEVLVQALAAAAGIAVANARLYQQAKARQSWIEA TRDIATELLSGTEPATVFRLVAAEALKLTAADAALVAVPVDEDMPAADVGELLVIETV GSAVASIVGRTIPVAGAVLREVFVNGIPRRVDRVDLEGLDELADAGPALLLPLRARGT VAGVVVVLSQGGPGAFTDEQLEMMAAFADQAALAWQLATSQRRMRELDVLTDRDRIAR DLHDHVIQRLFAIGLALQGAVPHERNPEVQQRLSDVVDDLQDVIQEIRTTIYDLHGAS QGITRLRQRIDAAVAQFADSGLRTSVQFVGPLSVVDSALADQAEAVVREAVSNAVRHA KASTLTVRVKVDDDLCIEVTDNGRGLPDEFTGSGLTNLRQRAEQAGGEFTLASVPGAS GTVLRWSAPLSQ" gene complement(3499262..3499915) /gene="devR" /locus_tag="Rv3133c" /db_xref="GeneID:888842" CDS complement(3499262..3499915) /gene="devR" /locus_tag="Rv3133c" /function="REGULATOR PART OF THE TWO COMPONENT REGULATORY SYSTEM DEVR/DEVS. CONTROLS HSPX|Rv2031|ACR EXPRESSION." /experiment="experimental evidence, no additional details recorded" /note="Rv3133c, (MTCY03A2.25), len: 217 aa. devR, two component transcriptional regulator (see Dasgupta et al., 2000; dev for Differentially Expressed in Virulent strain), highly similar to several e.g. O85372|CPRR TWO COMPONENT REGULATOR from Rhodococcus sp. (212 aa), FASTA scores: opt: 868, E(): 6.2e-46, (65.05% identity in 206 aa overlap); Q9RI42|SCJ12.16c PUTATIVE LUXR FAMILY TWO-COMPONENT RESPONSE REGULATOR from Streptomyces coelicolor (233 aa), FASTA scores: opt: 849, E(): 9.7e-45, (60.55% identity in 218 aa overlap); Q9XA59|SCGD3.19 PUTATIVE TWO-COMPONENT SYSTEM RESPONSE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (218 aa), FASTA scores: opt: 835, E(): 6.5e-44, (61.55% identity in 208 aa overlap); and similar to others. Contains bacterial regulatory proteins, LuxR family signature (PS00622) near C-terminus as seen in bvgA, comA, dctR, degU, evgA, fimZ, fixJ, gacA, glpR, narL, narP, nodW, rcsB and uhpA. Helix-turn-helix motif at 166-187 (+3.15 SD). BELONGS TO THE LUXR/UHPA FAMILY OF TRANSCRIPTIONAL REGULATORS. THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS." /codon_start=1 /transl_table=11 /product="two component transcriptional regulatory protein DevR" /protein_id="NP_217649.1" /db_xref="GI:15610269" /db_xref="GOA:P95193" /db_xref="UniProtKB/TrEMBL:P95193" /db_xref="GeneID:888842" /translation="MVKVFLVDDHEVVRRGLVDLLGADPELDVVGEAGSVAEAMARVP AARPDVAVLDVRLPDGNGIELCRDLLSRMPDLRCLILTSYTSDEAMLDAILAGASGYV VKDIKGMELARAVKDVGAGRSLLDNRAAAALMAKLRGAAEKQDPLSGLTDQERTLLGL LSEGLTNKQIADRMFLAEKTVKNYVSRLLAKLGMERRTQAAVFATELKRSRPPGDGP" misc_feature complement(3499343..3499426) /gene="devR" /locus_tag="Rv3133c" /note="PS00622 Bacterial regulatory proteins, luxR family signature" gene complement(3499943..3500749) /locus_tag="Rv3134c" /db_xref="GeneID:887558" CDS complement(3499943..3500749) /locus_tag="Rv3134c" /function="UNKNOWN. COULD PLAY A ROLE IN THE ADAPTATION TO HYPOXIA, PARTICIPATING IN THE PHOSPHORELAY IN THE TWO COMPONENT REGULATORY SYSTEM DEVR|Rv3133c/DEVS|Rv3132c." /experiment="experimental evidence, no additional details recorded" /note="Rv3134c, (MTCY03A2.240, len: 268 aa. Ala-, Val- rich protein (see citations below), related to other hypothetical Mycobacterium tuberculosis proteins e.g. O53474|Rv2028c|MTV018.15c (279 aa), FASTA scores: opt: 562, E(): 3.2e-28, (40.65% identity in 273 aa overlap); O06188|Rv2624c|MTCY01A10.08 (272 aa), FASTA scores: opt: 458, E(): 1.1e-21, (36.55% identity in 271 aa overlap); O53472|R2026c|MTV018.13c (294 aa), FASTA scores: opt: 232, E(): 1.9e-07, (30.45% identity in 276 aa overlap); etc. Shares some similarity with other hypothetical proteins from Streptomyces coelicolor e.g. Q9RIZ8|SCJ1.16c (294 aa), FASTA scores: opt: 207, E(): 6.9e-06, (28.9% identity in 263 aa overlap); Q9K4L5|SC5F8.09 PUTATIVE STRESS-INDUCIBLE PROTEIN (312 aa), FASTA scores: opt: 204, E(): 1.1e-05, (28.4% identity in 271 aa overlap); etc. Equivalent to AAK47558|MT3220 Universal stress protein family from Mycobacterium tuberculosis strain CDC1551 (268 aa). Rv3134c seems cotranscribed with devR-devS (see Sherman et al., 2001)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217650.1" /db_xref="GI:15610270" /db_xref="GOA:P95192" /db_xref="UniProtKB/TrEMBL:P95192" /db_xref="GeneID:887558" /translation="MSDPRPARAVVVGIDGSRAATHAALWAVDEAVNRDIPLRLVYVI DPSQLSAAGEGGGQSAARAALHDASRKVEATGQPVKIETEVLCGRPLTKLMQESRSAA MLCVGSVGLDHVRGRRGSVAATLAGSALCPVAVIHPSPAEPATTSQVSAVVAEVDNGV VLRHAFEEARLRGVPLRAVAVHAAETPDDVEQGSRLAHVHLSRRLAHWTRLYPEVRVD RAIAGGSACRHLAANAKPGQLFVADSHSAHELCGAYQPGCAVLTVRSANL" gene 3501334..3501732 /gene="PPE50" /locus_tag="Rv3135" /db_xref="GeneID:888153" CDS 3501334..3501732 /gene="PPE50" /locus_tag="Rv3135" /function="UNKNOWN" /note="Rv3135, (MTCY03A2.23c), len: 132 aa. Member of the Mycobacterium tuberculosis Ala-, Gly-rich PPE family, similar to P95190|Rv3136|MTCY03A2.22c (380 aa), FASTA scores: opt: 494, E(): 6.7e-25, (57.25% identity in 131 aa overlap) (next ORF downstream), MTY21C12_9, MTCY3C7_24, MTCI125_27, MTV049_12, MTV049_9, MTV049_11 , MTCY274_24 etc. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177934.1" /db_xref="GI:57117064" /db_xref="UniProtKB/TrEMBL:Q6MX07" /db_xref="GeneID:888153" /translation="MDYAFLPPEINSARMYSGPGPNSMLVAAASWDALAAELASAAEN YGSVIARLTGMHWWGPASTSMLAMSAPYVEWLERTAAQTKQTATQARAAAAAFEQAHA MTVPPALVTGIRGAIVVETASASNTAGTPP" gene 3501794..3502936 /gene="PPE51" /locus_tag="Rv3136" /db_xref="GeneID:888835" CDS 3501794..3502936 /gene="PPE51" /locus_tag="Rv3136" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3136, (MTCY03A2.22c), len: 380 aa. Member of the Mycobacterium tuberculosis Ala-, Gly-rich PPE family, similar to Q9AGF0|Ov2770c Rv2770c-LIKE PROTEIN from M. microti (397 aa), FASTA scores: opt: 917, E(): 9e-41, (46.15% identity in 388 aa overlap); O33312|Rv2770c|MTV002.35c, MTV002_36, MTCI125_26, MTCY10G2_10, MTCI364_8, MTV049_28, MTV049_29, etc. TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177935.1" /db_xref="GI:57117065" /db_xref="UniProtKB/TrEMBL:Q7D623" /db_xref="GeneID:888835" /translation="MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEA YGSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQAYA MTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAAMYGYATASAA AALLTPFSPPRQTTNPAGLTAQAAAVSQATDPLSLLIETVTQALQALTIPSFIPEDFT FLDAIFAGYATVGVTQDVESFVAGTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSA TSPGGGVSASGAGGAASVGNTVLASVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLT TLPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVRLTVMAHPPAAG" gene 3503393..3504175 /locus_tag="Rv3137" /db_xref="GeneID:888827" CDS 3503393..3504175 /locus_tag="Rv3137" /EC_number="3.1.3.-" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /note="Rv3137, (MTCY03A2.21c), len: 260 aa. Probable monophosphatase (EC 3.1.3.-), equivalent to O32889|MLCB1779_19|ML0662 PUTATIVE MONOPHOSPHATASE from Mycobacterium leprae (255 aa), FASTA scores: opt: 1403, E(): 1.2e-81, (81.8% identity in 253 aa overlap). Also similar to Q9K4B1|SC7E4.05c from Streptomyces coelicolor (266 aa), FASTA scores: opt: 969, E(): 3.5e-54, (57.9% identity in 259 aa overlap); Q53743|PUR3 MONO-PHOSPHATASE from Streptomyces lipmanii (Streptomyces alboniger) (273 aa), FASTA scores: opt: 862, E(): 2.1e-47, (55.25% identity in 257 aa overlap); BAB50023|MLL3039 MONO-PHOSPHATASE from Rhizobium loti (Mesorhizobium loti) (262 aa), FASTA scores: opt: 448, E(): 3.2e-21, (31.37% identity in 255 aa overlap); etc. Contains inositol monophosphatase family signature 1 (PS00629). TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="monophosphatase" /protein_id="NP_217653.1" /db_xref="GI:15610273" /db_xref="GOA:P95189" /db_xref="UniProtKB/TrEMBL:P95189" /db_xref="GeneID:888827" /translation="MSHDDLMLALALADRADELTRVRFGALDLRIDTKPDLTPVTDAD RAVESDVRQTLGRDRPGDGVLGEEFGGSTTFTGRQWIVDPIDGTKNFVRGVPVWASLI ALLEDGVPSVGVVSAPALQRRWWAARGRGAFASVDGARPHRLSVSSVAELHSASLSFS SLSGWARPGLRERFIGLTDTVWRVRAYGDFLSYCLVAEGAVDIAAEPQVSVWDLAALD IVVREAGGRLTSLDGVAGPHGGSAVATNGLLHDEVLTRLNAG" misc_feature 3503630..3503671 /locus_tag="Rv3137" /note="PS00629 Inositol monophosphatase family signature 1" gene 3504195..3505283 /gene="pflA" /locus_tag="Rv3138" /db_xref="GeneID:887973" CDS 3504195..3505283 /gene="pflA" /locus_tag="Rv3138" /EC_number="1.97.1.4" /function="INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: S-adenosyl-L-methionine + dihydroflavodoxin + [formate acetyltransferase]-glycine = 5'-deoxyadenosine + methionine + flavodoxin + [formate acetyltransferase]-glycine-2-yl radical]." /experiment="experimental evidence, no additional details recorded" /note="Rv3138, (MTCY03A2.20c), len: 362 aa. Probable pflA, pyruvate formate lyase activating protein (EC 1.97.1.4), similar to other e.g. Q9V0N1|PAB1859 from Pyrococcus abyssi (348 aa), FASTA scores: opt: 926, E(): 1.1e-52, (39.95% identity in 343 aa overlap); O27446|MTH1395 from Methanobacterium thermoautotrophicum (335 aa), FASTA scores: opt: 909, E(): 1.3e-51, (42.2% identity in 327 aa overlap); O28939|AF1330 from Archaeoglobus fulgidus (336 aa), FASTA scores: opt: 884, E(): 5.6e-50, (42.0% identity in 319 aa overlap); etc. Also similar to O50099|PH1391 HYPOTHETICAL 40.2 KDA PROTEIN from Pyrococcus horikoshii (348 aa), FASTA scores: opt: 934, E(): 3.3e-53, (40.5% identity in 343 aa overlap); and other hypothetical proteins. TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="pyruvate formate lyase activating protein PflA" /protein_id="NP_217654.1" /db_xref="GI:15610274" /db_xref="GOA:P95188" /db_xref="UniProtKB/TrEMBL:P95188" /db_xref="GeneID:887973" /translation="MSDPFTIATKHWHRLHDSRIQCDVCPRACKLHEGQRGLCFVRGR FDDQVKLTSYGRSSGFCVDPIEKKPLNHFLPGSATLSFGTAGCNLACKFCQNWDISKS REIDVLASRAAPADIARTAHELGCRSVAFTYNDPTIFWEYAADVADACHDQGIKAVAV TAGYMCPEPRAEFYRRVDAANVDLKAFTEDFYRKVCVSHLRNVLDTLAYLRHQTNVWL EITTLLIPGRNDSDAEVAAECRWIRENLGVDVPVHFTAFHPDYKMMDTPATPTATLTR AREIGIGEGLRFVYTGNVHDAVGGSTSCPGCRATVIVRDWYSIRHYALTEDGRCQACG YQMPGVYDGPAGHWGQRRLPLLTSLSRM" gene 3505363..3506769 /gene="fadE24" /locus_tag="Rv3139" /db_xref="GeneID:887971" CDS 3505363..3506769 /gene="fadE24" /locus_tag="Rv3139" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3139, (MTCY03A2.19c), len: 468 aa. Probable fadE24, acyl-CoA dehydrogenase (1.3.99.-), equivalent to O32890|MLCB1779.30|FADE24|ML0661 PUTATIVE ACYL-CoA DEHYDROGENASE from Mycobacterium leprae (465 aa), FASTA scores: opt: 2587, E(): 4e-153, (83.6% identity in 464 aa overlap). Similar to other e.g. Q9HUH0|PA4995 from Pseudomonas aeruginosa (429 aa), FASTA scores: opt: 1139, E(): 2.8e-63, (45.3% identity in 426 aa overlap); Q9K6D0|MMGC|BH3799 from Bacillus halodurans (379 aa), FASTA scores: opt: 603, E(): 4.7e-30, (30.3% identity in 366 aa overlap); Q9K6D1|ACDA|BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 601, E(): 6.3e-30, (32.25% identity in 363 aa overlap); etc. Contains acyl-CoA dehydrogenases signature 2 (PS00073) near C-terminus. BELONGS TO THE ACYL-CoA DEHYDROGENASES FAMILY. TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE24" /protein_id="NP_217655.1" /db_xref="GI:15610275" /db_xref="GOA:P95187" /db_xref="UniProtKB/TrEMBL:P95187" /db_xref="GeneID:887971" /translation="MTNTTSAANAAKPSGARTDRRGRTTGVGLAPHKRTGIDVALALL TPIVGQEFLDKYRLRDPLNRSLRYGVKTMFATAGAATRQFQRVQGLRGGPTRLKSSGR DYFDLTPDDDQKLIIETVDEFAEEVLRPAAHDADDAATYPSDLTAKAAELGITAINIP EDFDGIAEHRSSVTNVLVAEALAYGDMGLALPILAPGGVASALTHWGSADQQATYLKE FAGENVPQACVAITEPQPLFDPTRLKTTAVRTPSGYRLDGVKSLIPAAADAELFIVGA QLGGKPALFIVESAASGLTVKADPSMGIRGAALGQVELCGVSVPLNARLGEDEASDND YSEALALARLGWAALAVGTSHAVLDYVVPYVKQRQAFGEPIAHRQAVAFMCANIAIEL DGLRLITWRGASRAEQGLPFAREAALAKRLGSDKGMQIGLDGVQLLGGHGYTKEHPVE RWYRDLRAIGVAEGVVVI" misc_feature 3506668..3506727 /gene="fadE24" /locus_tag="Rv3139" /note="PS00073 Acyl-CoA dehydrogenases signature 2" gene 3506790..3507995 /gene="fadE23" /locus_tag="Rv3140" /db_xref="GeneID:887417" CDS 3506790..3507995 /gene="fadE23" /locus_tag="Rv3140" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3140, (MTCY03A2.18c), len: 401 aa. Probable fadE23, acyl-CoA dehydrogenase (1.3.99.-) (see citation below), equivalent to O32891|MLCB1779.31|FADE23|ML0660 PUTATIVE ACYL-CoA DEHYDROGENASE from Mycobacterium leprae (400 aa), FASTA scores: opt: 2307, E(): 3e-136, (89.5% identity in 401 aa overlap). Also similar to others e.g. Q9HUH1|PA4994 from Pseudomonas aeruginosa (402 aa), FASTA scores: opt: 1558, E(): 1.2e-89, (61.0% identity in 400 aa overlap); O31251 from Acinetobacter sp. ADP1 (401 aa), FASTA scores: opt: 1509, E(): 1.3e-86, (58.2% identity in 402 aa overlap); Q9K6D1|ACDA OR BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 612, E(): 8.4e-31, (38.2% identity in 293 aa overlap); Q9AHX9|FADFX from Pseudomonas putida (375 aa), FASTA scores: opt: 584, E(): 4.6e-29, (32.7% identity in 379 aa overlap); etc. COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY. TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE23" /protein_id="NP_217656.1" /db_xref="GI:15610276" /db_xref="GOA:P95186" /db_xref="UniProtKB/TrEMBL:P95186" /db_xref="GeneID:887417" /translation="MAINLELPRKLQAIIVKTHQGAAEMMRPIARKYDLKEHAYPVEL DTLINLFEGAAESFNFAGAHSLRDEDEGKDENHNGANMAAVVQTMEASWGDVAMMLSL PYQGLGNAAISAVATDEQLERLGKVWAAMAITEPEFGSDSAAVSTTATLDGDEYVING EKIFVTAGSRATHIVVWATLDKSLGRPAIKSFIVPREHPGVTVERLEHKLGIKGSDTA VIRFDNARIPKGNLLGNPEIEVGKGFAGVMETFDNTRPIVAAMAVGIGRAALEEIRSV LTGAGVEISYDKPSHTQSAAAAEFLRMEADWEASYLLSLRAAWQADNNIPNSKEASMS KAKAGRMASDVTCKTVELAGTTGYSEQSLLEKWARDSKILDIFEGTQQIQQLVVARRL LGLSSSELK" gene 3508095..3509066 /gene="fadB4" /locus_tag="Rv3141" /db_xref="GeneID:888051" CDS 3508095..3509066 /gene="fadB4" /locus_tag="Rv3141" /EC_number="1.6.5.5" /function="INVOLVED IN LIPID DEGRADATION [CATALYTIC ACTIVITY: NADPH + QUINONE = NADP(+) + SEMIQUINONE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3141, (MTCY03A2.17c), len: 323 aa. Probable fadB4, quinone oxidoreductase (EC 1.6.5.5), showing strong similarity to variety of quinone oxidoreductases and domains in polyketide and fatty acid synthases e.g. Q9HTV6|PA5234 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (325 aa), FASTA scores: opt: 737, E(): 1.4e-35, (39.65% identity in 328 aa overlap); Q9RYQ7|DRA0251 PUTATIVE NADPH QUINONE OXIDOREDUCTASE from Deinococcus radiodurans (336 aa), FASTA scores: opt: 688, E(): 1e-32, (40.6% identity in 325 aa overlap); Q9RVG8|DR1061 PUTATIVE NADPH QUINONE OXIDOREDUCTASE from Deinococcus radiodurans (388 aa), FASTA scores: opt: 559, E(): 3.3e-25, (36.3% identity in 325 aa overlap); BAB49685|MLL2594 PROBABLE QUINONE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (326 aa), FASTA scores: opt: 519, E(): 5.9e-23, (34.25% identity in 330 aa overlap); Q9LXZ4|T5P19_110 QUINONE REDUCTASE-LIKE PROTEIN from Arabidopsis thaliana (348 aa), FASTA scores: opt: 517, E(): 8.1e-23, (33.55% identity in 322 aa overlap); etc. Also similar to Q9AA38|CC0770 ZINC-CONTAINING ALCOHOL DEHYDROGENASE from Caulobacter crescentus (325 aa), FASTA scores: opt: 673, E(): 7.2e-32, (40.2% identity in 326 aa overlap); and Q9ABX4|CC0096 ZINC-CONTAINING ALCOHOL DEHYDROGENASE from Caulobacter crescentus (332 aa), FASTA scores: opt: 623, E(): 5.7e-29, (40.7% identity in 334 aa overlap). Also resembles Mycobacterium tuberculosis proteins P96826|Rv0149|MTCI5_23, MTCY13D12.11, MTCY24G1.03, MTCY19H9.01. BELONGS TO THE ZINC-CONTAINING ALCOHOL DEHYDROGENASE FAMILY, QUINONE OXIDOREDUCTASE SUBFAMILY. TBparse score is 0.904. Thought to be differentially expressed within host cells (see Triccas et al., 1999)." /codon_start=1 /transl_table=11 /product="NADPH quinone oxidoreductase" /protein_id="NP_217657.1" /db_xref="GI:15610277" /db_xref="GOA:P95185" /db_xref="UniProtKB/TrEMBL:P95185" /db_xref="GeneID:888051" /translation="MRAVRVTRLEGPDAVEVAEVEEPTSAGVVIEVHAAGVAFPDALL TRGRYQYRPEPPFVLGAEIAGVVRSAPDNSQVRSGDRVVGLTMLTGGMAEVAVLSPER VFKLPDNMTFEAGAGVLFNDLTVYFALAVRGRLQAGETVLVHGAAGGIGTSTLRLAPA LGASRTVAVVSTQEKAELATVAGATDVVLAEGFKDAVQELTNGRGVDIVVDPVGGDRF TDSLRSLAAGGRLLVIGFTGGEIPTVKVNRLLLNNIDVVGVGWGAWSLTHPDALAQQW SQLERLLRSGKLPPPEPVVYPLDQAAAAIASLENRTAKGKVVLRVRD" gene complement(3509118..3509546) /locus_tag="Rv3142c" /db_xref="GeneID:887521" CDS complement(3509118..3509546) /locus_tag="Rv3142c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3142c, (MTCY03A2.16), len: 142 aa. Hypothetical unknown protein. Equivalent to AAK47569 from Mycobacterium tuberculosis strain CDC1551 but shorter 33 aa. TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217658.1" /db_xref="GI:15610278" /db_xref="UniProtKB/TrEMBL:P95184" /db_xref="GeneID:887521" /translation="MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLT LPAIETSPAEVVAIDPNDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVH PDDRVTAWELYGKYHGYAACLAPGKLRVVRHDVADANGDQ" gene 3509654..3510055 /locus_tag="Rv3143" /db_xref="GeneID:887576" CDS 3509654..3510055 /locus_tag="Rv3143" /function="UNKNOWN, BUT COULD BE INVOLVED IN REGULATORY MECHANISM" /note="Rv3143, (MTCY03A2.15c), len: 133 aa. Probable response regulator, similar to other sensory transduction regulatory proteins e.g. Q9X810|SC6G10.25 from Streptomyces coelicolor (133 aa), FASTA scores: opt: 474, E(): 2.8e-24, (54.15% identity in 120 aa overlap); Q9KZ82|SCE25.04c from Streptomyces coelicolor (225 aa), FASTA scores: opt: 144, E(): 0.016, (32.3% identity in 127 aa overlap); Q9RZT4|DRB0029 from Deinococcus radiodurans (416 aa), FASTA scores: opt: 145, E(): 0.024, (30.65% identity in 124 aa overlap). SIMILAR TO OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="NP_217659.1" /db_xref="GI:15610279" /db_xref="GOA:P95183" /db_xref="UniProtKB/TrEMBL:P95183" /db_xref="GeneID:887576" /translation="MPDSSTALRILVYSDNVQTRERVMRALGKRLHPDLPDLTYVEVA TGPMVIRQMDRGGIDLAILDGEATPTGGMGIAKQLKDELASCPPILVLTGRPDDTWLA SWSRAEAAVPHPVDPIVLGRTVLSLLRAPAH" gene complement(3510088..3511317) /gene="PPE52" /locus_tag="Rv3144c" /db_xref="GeneID:887930" CDS complement(3510088..3511317) /gene="PPE52" /locus_tag="Rv3144c" /function="UNKNOWN" /note="Rv3144c, (MTCY03A2.14), len: 409 aa. Member of the Mycobacterium tuberculosis PPE family, Gly-, Ala-rich, similar to others e.g. P71868|Rv3533c|MTCY03C7.23 (582 aa), FASTA scores: opt: 1007, E(): 5.2e-35, (56.2% identity in 306 aa overlap); and MTV014_3, MTCY6G11_5, MTCY98.0034c, MTCY31.06c, MTCY48.17, MTCY98.0029c, MTCY03C7.17c, etc. TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177936.1" /db_xref="GI:57117066" /db_xref="UniProtKB/TrEMBL:Q6MX05" /db_xref="GeneID:887930" /translation="MSFVVLPPEINSLRMFIGAGTAPMLAAAAAWDGLAEELGTAAQS FASVTAGLAGQAWQGPAALAMAAAAAPYAGWLTAAAAQSAGAAGQARAVASIFEAAQA ATVLPAAVAANRDAFVQLVMTNLFGQNAPLIAAAEGVYEEMWAADVAAMSGYYSGASA IAAQVVPWASLLQRFPGLGAGATGATGGESVGTGATGGESVGTGGGESVGTGGATASG GGVGYVGSGVASAGLAAGDPAHGSVGQGNFGGGDVGAGDVVASSATSAHAGVVSPGFI GAPLALAALGQMARGGTNSAPGTATESARAPEPAASAPPEAVVEVPELEVPAMGVLPT VDPKVAAKAAPLSTTRVGQSAGSGIPESTLRTAQGQQASETSAAEETAPSLRPEAAAG QLRPRVRKDPKIQMRGG" gene 3511682..3512068 /gene="nuoA" /locus_tag="Rv3145" /db_xref="GeneID:887397" CDS 3511682..3512068 /gene="nuoA" /locus_tag="Rv3145" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit A" /protein_id="NP_217661.1" /db_xref="GI:15610281" /db_xref="GOA:P65563" /db_xref="UniProtKB/Swiss-Prot:P65563" /db_xref="GeneID:887397" /translation="MNVYIPILVLAALAAAFAVVSVVIASLVGPSRFNRSKQAAYECG IEPASTGARTSIGPGAASGQRFPIKYYLTAMLFIVFDIEIVFLYPWAVSYDSLGTFAL VEMAIFMLTVFVAYAYVWRRGGLTWD" gene 3512077..3512631 /gene="nuoB" /locus_tag="Rv3146" /db_xref="GeneID:888791" CDS 3512077..3512631 /gene="nuoB" /locus_tag="Rv3146" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="The point of entry for the majority of electrons that traverse the respiratory chain eventually resulting in the reduction of oxygen" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit B" /protein_id="NP_217662.1" /db_xref="GI:15610282" /db_xref="GOA:P65575" /db_xref="UniProtKB/Swiss-Prot:P65575" /db_xref="GeneID:888791" /translation="MGLEEQLPGGILLSTVEKVAGYVRKNSLWPATFGLACCAIEMMA TAGPRFDIARFGMERFSATPRQADLMIVAGRVSQKMAPVLRQIYDQMAEPKWVLAMGV CASSGGMFNNYAIVQGVDHVVPVDIYLPGCPPRPEMLLHAILKLHEKIQQMPLGINRE RAIAEAEEAALLARPTIEMRGLLR" gene 3512628..3513338 /gene="nuoC" /locus_tag="Rv3147" /db_xref="GeneID:888816" CDS 3512628..3513338 /gene="nuoC" /locus_tag="Rv3147" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit C" /protein_id="NP_217663.1" /db_xref="GI:15610283" /db_xref="GOA:P65571" /db_xref="UniProtKB/Swiss-Prot:P65571" /db_xref="GeneID:888816" /translation="MSPPNQDAQEGRPDSPTAEVVDVRRGMFGVSGTGDTSGYGRLVR QVVLPGSSPRPYGGYFDDIVDRLAEALRHERVEFEDAVEKVVVYRDELTLHVRRDLLP RVAQRLRDEPELRFELCLGVSGVHYPHETGRELHAVYPLQSITHNRRLRLEVSAPDSD PHIPSLFAIYPTNDWHERETYDFFGIIFDGHPALTRIEMPDDWQGHPQRKDYPLGGIP VEYKGAQIPPPDERRGYN" gene 3513338..3514660 /gene="nuoD" /locus_tag="Rv3148" /db_xref="GeneID:888851" CDS 3513338..3514660 /gene="nuoD" /locus_tag="Rv3148" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit D" /protein_id="NP_217664.1" /db_xref="GI:15610284" /db_xref="GOA:P65569" /db_xref="UniProtKB/Swiss-Prot:P65569" /db_xref="GeneID:888851" /translation="MTAIADSAGGAGETVLVAGGQDWQQVVDAARSADPGERIVVNMG PQHPSTHGVLRLILEIEGETVVEARCGIGYLHTGIEKNLEYRYWTQGVTFVTRMDYLS PFFNETAYCLGVEKLLGITDEIPERVNVIRVLMMELNRISSHLVALATGGMELGAMTP MFVGFRAREIVLTLFEKITGLRMNSAYIRPGGVAQDLPPNAATEIAEALKQLRQPLRE MGELLNENAIWKARTQGVGYLDLTGCMALGITGPILRSTGLPHDLRKSEPYCGYQHYE FDVITDDSCDAYGRYMIRVKEMWESMKIVEQCLDKLRPGPTMISDRKLAWPADLQVGP DGLGNSPKHIAKIMGSSMEALIHHFKLVTEGIRVPAGQVYVAVESPRGELGVHMVSDG GTRPYRVHYRDPSFTNLQSVAAMCEGGMVADLIAAVASIDPVMGGVDR" gene 3514657..3515415 /gene="nuoE" /locus_tag="Rv3149" /db_xref="GeneID:887903" CDS 3514657..3515415 /gene="nuoE" /locus_tag="Rv3149" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit E" /protein_id="NP_217665.1" /db_xref="GI:15610285" /db_xref="GOA:P65573" /db_xref="UniProtKB/Swiss-Prot:P65573" /db_xref="GeneID:887903" /translation="MTQPPGQPVFIRLGPPPDEPNQFVVEGAPRSYPPDVLARLEVDA KEIIGRYPDRRSALLPLLHLVQGEDSYLTPAGLRFCADQLGLTGAEVSAVASFYTMYR RRPTGEYLVGVCTNTLCAVMGGDAIFDRLKEHLGVGHDETTSDGVVTLQHIECNAACD YAPVVMVNWEFFDNQTPESARELVDSLRSDTPKAPTRGAPLCGFRQTSRILAGLPDQR PDEGQGGPGAPTLAGLQVARKNDMQAPPTPGADE" gene 3515412..3516749 /gene="nuoF" /locus_tag="Rv3150" /db_xref="GeneID:888854" CDS 3515412..3516749 /gene="nuoF" /locus_tag="Rv3150" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Rv3150, (MTCY03A2.08c), len: 445 aa. Probable nuoF, NADH dehydrogenase, chain F (EC 1.6.5.3), similar to others e.g. Q9XAQ9|NUOF_STRCO from Streptomyces coelicolor (449 aa), FASTA scores: opt: 2314, E(): 3.5e-139, (76.25% identity in 434 aa overlap); NUF2_RHIME from Rhizobium meliloti (421 aa), FASTA scores: opt: 1545, E(): 1.8e-90, (53.1% identity in 424 aa overlap); Q9RU92|DR1500 from Deinococcus radiodurans (444 aa), FASTA scores: opt: 1445, E(): 4.1e-84, (52.9% identity in 427 aa overlap); etc. Contains respiratory-chain NADH dehydrogenase 51 Kd subunit signature 2 (PS00645). BELONGS TO THE COMPLEX I 51 KDA SUBUNIT FAMILY. COFACTOR: FMN AND ONE 4FE-4S CLUSTER (PROBABLE). TBparse score is 0.889." /codon_start=1 /transl_table=11 /product="NADH dehydrogenase I chain F" /protein_id="NP_217666.1" /db_xref="GI:15610286" /db_xref="GOA:P65567" /db_xref="UniProtKB/Swiss-Prot:P65567" /db_xref="GeneID:888854" /translation="MTTQATPLTPVISRHWDDPESWTLATYQRHDRYRGYQALQKALT MPPDDVISIVKDSGLRGRGGAGFATGTKWSFIPQGDTGAAAKPHYLVVNADESEPGTC KDIPLMLATPHVLIEGVIIAAYAIRAHHAFVYVRGEVVPVLRRLHNAVAEAYAAGFLG RNIGGSGFDLELVVHAGAGAYICGEETALLDSLEGRRGQPRLRPPFPAVAGLYGCPTV INNVETIASVPSIILGGIDWFRSMGSEKSPGFTLYSLSGHVTRPGQYEAPLGITLREL LDYAGGVRAGHRLKFWTPGGSSTPLLTDEHLDVPLDYEGVGAAGSMLGTKALEIFDET TCVVRAVRRWTEFYKHESCGKCTPCREGTFWLDKIYERLETGRGSHEDIDKLLDISDS ILGKSFCALGDGAASPVMSSIKHFRDEYLAHVEGGGCPFDPRDSMLVANGVDA" misc_feature 3516462..3516497 /gene="nuoF" /locus_tag="Rv3150" /note="PS00645 Respiratory-chain NADH dehydrogenase 51 Kd subunit signature 2" gene 3516746..3519166 /gene="nuoG" /locus_tag="Rv3151" /db_xref="GeneID:887540" CDS 3516746..3519166 /gene="nuoG" /locus_tag="Rv3151" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit G" /protein_id="NP_217667.1" /db_xref="GI:15610287" /db_xref="GOA:P95175" /db_xref="UniProtKB/Swiss-Prot:P95175" /db_xref="GeneID:887540" /translation="MTQAADTDIRVGQPEMVTLTIDGVEISVPKGTLVIRAAELMGIQ IPRFCDHPLLEPVGACRQCLVEVEGQRKPLASCTTVATDDMVVRTQLTSEIADKAQHG VMELLLINHPLDCPMCDKGGECPLQNQAMSNGRTDSRFTEAKRTFAKPINISAQVLLD RERCILCARCTRFSDQIAGDPFIDMQERGALQQVGIYADEPFESYFSGNTVQICPVGA LTGTAYRFRARPFDLVSSPSVCEHCASGCAQRTDHRRGKVLRRLAGDDPEVNEEWNCD KGRWAFTYATQPDVITTPLIRDGGDPKGALVPTSWSHAMAVAAQGLAAARGRTGVLVG GRVTWEDAYAYAKFARITLGTNDIDFRARPHSAEEADFLAARIAGRHMAVSYADLESA PVVLLVGFEPEDESPIVFLRLRKAARRHRVPVYTIAPFATGGLHKMSGRLIKTVPGGE PAALDDLATGAVGDLLATPGAVIIVGERLATVPGGLSAAARLADTTGARLAWVPRRAG ERGALEAGALPTLLPGGRPLADEVARAQVCAAWHIAELPAAAGRDADGILAAAADETL AALLVGGIEPADFADPDAVLAALDATGFVVSLELRHSTVTERADVVFPVAPTTQKAGA FVNWEGRYRTFEPALRGSTLQAGQSDHRVLDALADDMGVHLGVPTVEAAREELAALGI WDGKHAAGPHIAATGPTQPEAGEAILTGWRMLLDEGRLQDGEPYLAGTARTPVVRLSP DTAAEIGAADGEAVTVSTSRGSITLPCSVTDMPDRVVWLPLNSAGSTVHRQLRVTIGS IVKIGAGS" misc_feature 3517088..3517126 /gene="nuoG" /locus_tag="Rv3151" /note="PS00642 Respiratory-chain NADH dehydrogenase 75 Kd subunit signature 2" gene 3519282..3520514 /gene="nuoH" /locus_tag="Rv3152" /db_xref="GeneID:887531" CDS 3519282..3520514 /gene="nuoH" /locus_tag="Rv3152" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit H" /protein_id="NP_217668.1" /db_xref="GI:15610288" /db_xref="GOA:P65561" /db_xref="UniProtKB/Swiss-Prot:P65561" /db_xref="GeneID:887531" /translation="MTTFGHDTWWLVAAKAIAVFVFLMLTVLVAILAERKLLGRMQLR PGPNRVGPKGALQSLADGIKLALKESITPGGIDRFVYFVAPIISVIPAFTAFAFIPFG PEVSVFGHRTPLQITDLPVAVLFILGLSAIGVYGIVLGGWASGSTYPLLGGVRSTAQV ISYEVAMGLSFATVFLMAGTMSTSQIVAAQDGVWYAFLLLPSFVIYLISMVGETNRAP FDLPEAEGELVAGFHTEYSSLKFAMFMLAEYVNMTTVSALAATLFFGGWHAPWPLNMW ASANTGWWPLIWFTAKVWGFLFIYFWLRATLPRLRYDQFMALGWKLLIPVSLVWVMVA AIIRSLRNQGYQYWTPTLVFSSIVVAAAMVLLLRKPLSAPGARASARQRGDEGTSPEP AFPTPPLLAGATKENAGG" misc_feature 3519933..3519974 /gene="nuoH" /locus_tag="Rv3152" /note="PS00668 Respiratory-chain NADH dehydrogenase subunit 1 signature 2" gene 3520507..3521142 /gene="nuoI" /locus_tag="Rv3153" /db_xref="GeneID:887530" CDS 3520507..3521142 /gene="nuoI" /locus_tag="Rv3153" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit I" /protein_id="NP_217669.1" /db_xref="GI:15610289" /db_xref="GOA:P95173" /db_xref="UniProtKB/Swiss-Prot:P95173" /db_xref="GeneID:887530" /translation="MANTDRPALPHKRAVPPSRADSGPRRRRTKLLDAVAGFGVTLGS MFKKTVTEEYPERPGPVAARYHGRHQLNRYPDGLEKCIGCELCAWACPADAIYVEGAD NTEEERFSPGERYGRVYQINYLRCIGCGLCIEACPTRALTMTYDYELADDNRADLIYE KDRLLAPLLPEMAAPPHPRTPGATDKDYYLGNVTAEGLRGVRESQTTGDSR" misc_feature 3520747..3520782 /gene="nuoI" /locus_tag="Rv3153" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature" misc_feature 3520882..3520917 /gene="nuoI" /locus_tag="Rv3153" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature" gene 3521139..3521927 /gene="nuoJ" /locus_tag="Rv3154" /db_xref="GeneID:888762" CDS 3521139..3521927 /gene="nuoJ" /locus_tag="Rv3154" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit J" /protein_id="NP_217670.1" /db_xref="GI:15610290" /db_xref="GOA:P95172" /db_xref="UniProtKB/TrEMBL:P95172" /db_xref="GeneID:888762" /translation="MTAVLASDVIVRTSTGEAVMFWVLSALALLGAVGVVLAVNAVYS AMFLAMTMIILAVFYMAQDALFLGVVQVVVYTGAVMMLFLFVLMLIGVDSAESLKETL RGQRVAAVLTGVGFGVLLISTIGQVATRGFAGLTVANANGNVEGLAALIFSRYLWAFE LTSALLITAAVGAMVLAHRERFERRKTQRELSQERFRPGGHPTPLPNPGVYARHNAVD VAALLPDGSYSELSVPRMLRTRGADGLQTPSPGAVSGSLEGGAS" gene 3521924..3522223 /gene="nuoK" /locus_tag="Rv3155" /db_xref="GeneID:888764" CDS 3521924..3522223 /gene="nuoK" /locus_tag="Rv3155" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit K" /protein_id="NP_217671.1" /db_xref="GI:15610291" /db_xref="GOA:P65565" /db_xref="UniProtKB/Swiss-Prot:P65565" /db_xref="GeneID:888764" /translation="MNPANYLYLSVLLFTIGASGVLLRRNAIVMFMCVELMLNAVNLA FVTFARMHGHLDAQMIAFFTMVVAACEVVVGLAIIMTIFRTRKSASVDDANLLKG" gene 3522234..3524135 /gene="nuoL" /locus_tag="Rv3156" /db_xref="GeneID:888063" CDS 3522234..3524135 /gene="nuoL" /locus_tag="Rv3156" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to ubiquinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit L" /protein_id="NP_217672.1" /db_xref="GI:15610292" /db_xref="GOA:O86350" /db_xref="UniProtKB/Swiss-Prot:O86350" /db_xref="GeneID:888063" /translation="MTTSLGTHYTWLLVALPLAGAAILLFGGRRTDAWGHLLGCAAAL AAFGVGAMLLADMLGRDGLERAIHQQVFTWIPAGGLQVDFGLQIDQLSMCFVLLISGV GSLIHIYSVGYMAEDPDRRRFFGYLNLFLASMLLLVVADNYVLLYVGWEGVGLASYLL IGFWYHKPSAATAAKKAFVMNRVGDAGLAVGMFLTFSTFGTLSYAGVFAGVPAASRAV LTAIGLLMLLGACAKSAQVPLQAWLGDAMEGPTPVSALIHAATMVTAGVYLIVRSGPL YNLAPTAQLAVVIVGAVTLLFGAIIGCAKDDIKRALAASTISQIGYMVLAAGLGPAGY AFAIMHLLTHGFFKAGLFLGSGAVIHAMHEEQDMRRYGGLRAALPVTFATFGLAYLAI IGVPPFAGFFSKDAIIEAALGAGGIRGSLLGGAALLGAGVTAFYMTRVMLMTFFGEKR WTPGAHPHEAPAVMTWPMILLAVGSVFSGGLLAVGGTLRHWLQPVVGSHEEATHALPT WVATTLALGVVAVGIAVAYRMYGTAPIPRVAPVRVSALTAAARADLYGDAFNEEVFMR PGAQLTNAVVAVDDAGVDGSVNALATLVSQTSNRLRQMQTGFARNYALSMLVGAVLVA AALLVVQLW" gene 3524132..3525793 /gene="nuoM" /locus_tag="Rv3157" /db_xref="GeneID:888765" CDS 3524132..3525793 /gene="nuoM" /locus_tag="Rv3157" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit M" /protein_id="NP_217673.1" /db_xref="GI:15610293" /db_xref="GOA:O53307" /db_xref="UniProtKB/Swiss-Prot:O53307" /db_xref="GeneID:888765" /translation="MNNVPWLSVLWLVPLAGAVLIILLPPGRRRLAKWAGMVVSVLTL AVSIVVAAEFKPSAEPYQFVEKHSWIPAFGAGYTLGVDGIAVVLVLLTTVLIPLLLVA GWNDATDADDLSPASGRYPQRPAPPRLRSSGGERTRGVHAYVALTLAIESMVLMSVIA LDVLLFYVFFEAMLIPMYFLIGGFGQGAGRSRAAVKFLLYNLFGGLIMLAAVIGLYVV TAQYDSGTFDFREIVAGVAAGRYGADPAVFKALFLGFMFAFAIKAPLWPFHRWLPDAA VESTPATAVLMMAVMDKVGTFGMLRYCLQLFPDPSTYFRPLIVTLAIIGVIYGAIVAI GQTDMMRLIAYTSISHFGFIIAGIFVMTTQGQSGSTLYMLNHGLSTAAVFLIAGFLIA RRGSRSIADYGGVQKVAPILAGTFMVSAMATVSLPGLAPFISEFLVLLGTFSRYWLAA AFGVTALVLSAVYMLWLYQRVMTGPVAEGNERIGDLVGREMIVVAPLIALLLVLGVYP KPVLDIINPAVENTMTTIGQHDPAPSVAHPVPAVGASRTAEGPHP" gene 3525790..3527385 /gene="nuoN" /locus_tag="Rv3158" /db_xref="GeneID:888780" CDS 3525790..3527385 /gene="nuoN" /locus_tag="Rv3158" /EC_number="1.6.5.3" /function="INVOLVED IN AEROBIC|ANAEROBIC RESPIRATION [CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the transfer of electrons from NADH to quinone" /codon_start=1 /transl_table=11 /product="NADH dehydrogenase subunit N" /protein_id="NP_217674.1" /db_xref="GI:15610294" /db_xref="GOA:O53308" /db_xref="UniProtKB/Swiss-Prot:O53308" /db_xref="GeneID:888780" /translation="MILPAPHVEYFLLAPMLIVFSVAVAGVLAEAFLPRRWRYGAQVT LALGGSAVALIAVIVVARSIHGSGHAAVLGAIAVDRATLFLQGTVLLVTIMAVVFMAE RSARVSPQRQNTLAVARLPGLDSFTPQASAVPGSDAERQAERAGATQTELFPLAMLSV GGMMVFPASNDLLTMFVALEVLSLPLYLMCGLARNRRLLSQEAAMKYFLLGAFSSAFF LYGVALLYGATGTLTLPGIRDALAARTDDSMALAGVALLAVGLLFKVGAVPFHSWIPD VYQGAPTPITGFMAAATKVAAFGALLRVVYVALPPLHDQWRPVLWAIAILTMTVGTVT AVNQTNVKRMLAYSSVAHVGFILTGVIADNPAGLSATLFYLVAYSFSTMGAFAIVGLV RGADGSAGSEDADLSHWAGLGQRSPIVGVMLSMFLLAFAGIPLTSGFVSKFAVFRAAA SAGAVPLVIVGVISSGVAAYFYVRVIVSMFFTEESGDTPHVAAPGVLSKAAIAVCTVV TVVLGIAPQPVLDLADQAAQLLR" gene complement(3527391..3529163) /gene="PPE53" /locus_tag="Rv3159c" /db_xref="GeneID:888794" CDS complement(3527391..3529163) /gene="PPE53" /locus_tag="Rv3159c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3159c, (MTV014.03c), len: 590 aa. Member of the Mycobacterium tuberculosis PPE_family of Gly-, Asn-rich proteins. Highly similar to P71868|Rv3533c|MTCY03C7.23 (582 aa), FASTA scores: opt: 2289, E(): 3.2e-98, (63.5% identity in 600 aa overlap); and also similar to MTCY48_17, MTV041_29, MTCY6G11_5, MTCY98_24, etc. TBparse score is 0.921." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177937.1" /db_xref="GI:57117067" /db_xref="UniProtKB/TrEMBL:Q6MX04" /db_xref="GeneID:888794" /translation="MNYSVLPPEINSLRMFTGAGSAPMLAASVAWDRLAAELAVAASS FGSVTSGLAGQSWQGAAAAAMAAAAAPYAGWLAAAAARAAGASAQAKAVASAFEAARA ATVHPMLVAANRNAFVQLVLSNLFGQNAPAIAAAEAMYEQMWAADVAAMVGYHGGASA AAAQLSSWSIGLQQALPAAPSALAAAIGLGNIGVGNLGGGNTGDYNLGSGNSGNANVG SGNSGNANVGSGNDGATNLGSGNIGNTNLGSGNVGNVNLGSGNRGFGNLGNGNFGSGN LGSGNTGSTNFGGGNLGSFNLGSGNIGSSNIGFGNNGDNNLGLGNNGNNNIGFGLTGD NLVGIGALNSGIGNLGFGNSGNNNIGFFNSGNNNVGFFNSGNNNFGFGNAGDINTGFG NAGDTNTGFGNAGFFNMGIGNAGNEDMGVGNGGSFNVGVGNAGNQSVGFGNAGTLNVG FANAGSINTGFANSGSINTGGFDSGDRNTGFGSSVDQSVSSSGFGNTGMNSSGFFNTG NVSAGYGNNGDVQSGINNTNSGGFNVGFYNSGAGTVGIANSGLQTTGIANSGTLNTGV ANTGDHSSGGFNQGSDQSGFFGQP" gene complement(3529338..3529979) /locus_tag="Rv3160c" /db_xref="GeneID:888797" CDS complement(3529338..3529979) /locus_tag="Rv3160c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3160c, (MTV014.04c), len: 213 aa. Possible transcriptional regulator, with some similarity to others e.g. Q9S3L4|AMTR AMTR PROTEIN (global repressor in the nitrogen regulation system; see Jakoby et al., 2000) (222 aa), FASTA scores: opt: 182, E(): 7.3e-05, (27.9% identity in 208 aa overlap); Q9X7X9|SC6A5.33c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (223 aa), FASTA scores: opt: 176, E(): 0.00018, (26.5% identity in 185 aa overlap); Q9XA31|SCH69.03c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (209 aa), FASTA scores: opt: 173, E(): 0.00027, (27.25% identity in 176 aa overlap); BAB54133|MLL7734 TRANSCRIPTIONAL REGULATOR from Rhizobium loti (Mesorhizobium loti) (213 aa), FASTA scores: opt: 172, E(): 0.00031, (23.55% identity in 204 aa overlap); etc. Also similar to hypothetical proteins from Mycobacterium tuberculosis strain H37Rv e.g. P96839|Rv3557v|MTCY06G11.04c (200 aa), FASTA scores: opt: 169, E(): 0.00046, (26.75% identity in 157 aa overlap). Contains probable helix-turn-helix motif from aa 31 to 52 (Score 1857, +5.51 SD). SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.901." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217676.1" /db_xref="GI:15610296" /db_xref="GOA:O53310" /db_xref="UniProtKB/TrEMBL:O53310" /db_xref="GeneID:888797" /translation="MPRQAGRWSPTALRILGAAAELIALRGYSSTSTRDIAAAVGVEQ PAIYKHFSAKRDILAALVRLAVEWPLELFGHITAMPVPAVVKLHRWLTESLDHLHASP YVLVSILITPDLHQESFVAERELVAEMERALVGLIETGQGEGDVRAMHPLSAARLVQA LFDALALPEFAVSPDEIVEFAMTALLSDPDRLAEIRAAADALEIQTAPPDRGL" gene complement(3529990..3531138) /locus_tag="Rv3161c" /db_xref="GeneID:888800" CDS complement(3529990..3531138) /locus_tag="Rv3161c" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3161c, (MTV014.05c), len: 382 aa. Possible dioxygenase (EC 1.-.-.-), similar to subunit of several dioxygenases and related proteins e.g. BAB50510|MLR3662 DIOXYGENASE, ALPHA SUBUNIT from Rhizobium loti (Mesorhizobium loti) (400 aa), FASTA scores: opt: 413, E(): 6.2e-20, (28.4% identity in 331 aa overlap); Q9A3T0|CC3122 RIESKE 2FE-2S FAMILY PROTEIN from Caulobacter crescentus (404 aa), FASTA scores: opt: 405, E(): 2.1e-19, (27.95% identity in 372 aa overlap); Q9HTF4|PA5410 PROBABLE RING HYDROXYLATING DIOXYGENASE, ALPHA-SUBUNIT from Pseudomonas aeruginosa (429 aa), FASTA scores: opt: 392, E(): 1.6e-18, (25.8% identity in 399 aa overlap); Q9AGK6|PHTAA PHTHALATE DIOXYGENASE LARGE SUBUNIT from Arthrobacter keyseri (473 aa), FASTA scores: opt: 385, E(): 5.2e-18, (34.0% identity in 206 aa overlap); P76253|YEAW_ECOLI PUTATIVE DIOXYGENASE, ALPHA SUBUNIT from Escherichia coli (374 aa), FASTA scores: opt: 376, E(): 1.7e-17, (27.05% identity in 344 aa overlap); etc. TBparse score is 0.932." /codon_start=1 /transl_table=11 /product="dioxygenase" /protein_id="NP_217677.1" /db_xref="GI:15610297" /db_xref="GOA:O53311" /db_xref="UniProtKB/TrEMBL:O53311" /db_xref="GeneID:888800" /translation="MLSTDNRAELGDILTDIGDYLDDNPPALSLPPAAYTSSELWQLE RERIFNRSWMLVAHVDQVAKTGDYVTVSVAGEPVMVVRDVDGQLHALSPICRHRLMLM VEPGAGRIDTLTCQYHLWRYGLDGRLRGAPHMAANLDFNRRECRLPQFAVATWNGLVW INLDADAEPIAAHLDLTDDEFAGYRLGEMVQVESWSHEWRANWKVAAENGHENYHVLG LHRQTLEPFVPGGGDLDVRQYSRWALRLRVPFTVPVEAKSLQLNEVQKSNLVVLWTFP NSALAIAGERVVWFGFIPQSIDRVQVLGGVLTTPELAADAAATAQTSQFVMAMINDED RLGLEAVQVGAGSRFAERGHLSSKEWPGMLAFYRNLAMALVGDHPGAS" gene complement(3531208..3531645) /locus_tag="Rv3162c" /db_xref="GeneID:888756" CDS complement(3531208..3531645) /locus_tag="Rv3162c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3162c, (MTV014.06c), len: 145 aa. Possible integral membrane protein, with some similarity to C-terminal part of Q10803|Rv2877c|MTCY274.08c hypothetical protein from Mycobacterium tuberculosis (287 aa), FASTA scores: opt: 112, E(): 6.9, (29.65% identity in 135 aa overlap); and other hypothetical proteins from other organisms. TBparse score is 0.924." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217678.1" /db_xref="GI:15610298" /db_xref="UniProtKB/TrEMBL:O53312" /db_xref="GeneID:888756" /translation="MTSFAHPGTRGLSTVFGLMMVGSAAVGSHGLAVVVGLAAVIAVG VAAVFRLAATLAVVLSVVMIVVSGPTHVLAALSGFCAAVYLVCRYGAGVVAGSWPTTV AAVGFTFAGLAATSFPLQVPWLPLAAPLAVLATYVLATRPFSR" gene complement(3531642..3532913) /locus_tag="Rv3163c" /db_xref="GeneID:888789" CDS complement(3531642..3532913) /locus_tag="Rv3163c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3163c, (MTV014.07c), len: 423 aa. Possible conserved secreted protein, with some similarity to other hypothetical bacterial proteins e.g. Q9Z539|SC9B2.20c from Streptomyces coelicolor (460 aa), FASTA scores: opt: 666, E(): 1.5e-33, (33.55% identity in 417 aa overlap); O58486|PH0774 from Pyrococcus horikoshii (410 aa), FASTA scores: opt: 329, E(): 6.9e-13, (23.8% identity in 424 aa overlap); Q9UZ66|PAB0849 from Pyrococcus abyssi (410 aa), FASTA scores: opt: 322, E(): 1.9e-12, (24.15% identity in 389 aa overlap); etc. Also some similarity with P71761|Rv1480|MTV007.27|MTCY277.01 from Mycobacterium tuberculosis (317 aa), FASTA scores: opt: 198, E(): 6.3e-05, (26.75% identity in 269 aa overlap). Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217679.1" /db_xref="GI:15610299" /db_xref="UniProtKB/TrEMBL:O53313" /db_xref="GeneID:888789" /translation="MIQTCEVELRWRASQLTLAIATCAGVALAAAVVAGRWQLIAFAA PLLGVLCSISWQRPVPVIQVHGDPDSQRCFENEHVRVTVWVTTESVDAAVELTVSALA GMQFEALESVSRRTTTVSAVAQRWGRYPIRARVAVVARGGLLMGAGTVDAAEIVVFPL TPPQSTPLPQTELLDRLGAHLTRHVGPGVEYADIRPYVPGDQLRAVNWVVSARRGRLH VTRRLTDRAADVVVLIDMYRQPAGPATEATERVVRGAAQVVQTALRNGDRAGIVALGG NRPRWLGADIGQRQFYRVLDTVLGAGEGFENTTGTLAPRAAVPAGAVVIAFSTLLDTE FALALIDLRKRGHVVVAVDVLDSCPLQDQLDPLVVRMWALQRSAMYRDMATIGVDVLS WPADHSLQQSMGALPNRRRRGRGRASRARLP" misc_feature complement(3532437..3532523) /locus_tag="Rv3163c" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene complement(3532943..3533905) /gene="moxR3" /locus_tag="Rv3164c" /db_xref="GeneID:887658" CDS complement(3532943..3533905) /gene="moxR3" /locus_tag="Rv3164c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM; REGULATES METHANOL DEHYDROGENASE." /note="Rv3164c, (MTV014.08c), len: 320 aa. Probable moxR3, methanol dehydrogenase regulatory protein, highly similar to Q9Z538|SC9B2.21c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (332 aa), FASTA scores: opt: 1227, E(): 1.7e-67, (60.25% identity in 302 aa overlap); Q9UZ67|MOXR-3|PAB0848 METHANOL DEHYDROGENASE REGULATORY PROTEIN from Pyrococcus abyssi (314 aa), FASTA scores: opt: 1126, E(): 2.3e-61, (54.1% identity in 305 aa overlap); Q9HSH7|MOXR|VNG0223G METHANOL DEHYDROGENASE REGULATORY PROTEIN from Halobacterium sp. strain NRC-1 (318 aa), FASTA scores: opt: 1072, E(): 4.5e-58, (51.45% identity in 315 aa overlap); Q9RVV4|DR0918 MOXR-RELATED PROTEIN from Deinococcus radiodurans (354 aa), FASTA scores: opt: 1000, E(): 1.2e-53, (50.95% identity in 318 aa overlap); etc. Also high similarity with several hypothetical bacterial proteins. TBparse score is 0.917." /codon_start=1 /transl_table=11 /product="methanol dehydrogenase transcriptional regulatory protein MoxR3" /protein_id="NP_217680.1" /db_xref="GI:15610300" /db_xref="UniProtKB/TrEMBL:O53314" /db_xref="GeneID:887658" /translation="MIMPAATTTAHCEAVLDEIERVVVGKRSALTLILTAVLARGHVL IEDLPGLGKTLIARSFAAALGLDFTRVQFTPDLLPADLLGSTIYDMQSGRFEFRAGPI FTNLLLADEINRTPPKTQAALLEAMAEGQVSIDGQTHKLAMPFIVLATDNPIEYEGTY PLPEAQLDRFAIRLELRYLSERDETSMLRRRLERGSADPTVNQVVDCHDLLAMRESVE QVTVHEDVLHYVVSLANATRHHPQVAVGASPRAELDLVQLSRARALLLGRDYVIPEDV KELATAAVAHRITLRPEMWVRKIAGADVVSELLRRLPVPRISGT" gene complement(3533913..3534395) /locus_tag="Rv3165c" /db_xref="GeneID:887661" CDS complement(3533913..3534395) /locus_tag="Rv3165c" /function="UNKNOWN" /note="Rv3165c, (MTV014.09)c, len: 160 aa. Hypothetical unknown protein. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217681.1" /db_xref="GI:15610301" /db_xref="UniProtKB/TrEMBL:O53315" /db_xref="GeneID:887661" /translation="MKRLIALGIFLIVGIELLALILHDRRLVLAGSGLALALVLLNVR RMLGNRDELTAAPDSDDLGEGLRRWLSNTETTIRWSESTRADWDRHLRPMLARRFEIA TGHRQAKDPVAFAATGRMLFGDELWEWVNPNNVTHTGDRQPGPGRAALEEILQKLEQV" gene complement(3534392..3535351) /locus_tag="Rv3166c" /db_xref="GeneID:888750" CDS complement(3534392..3535351) /locus_tag="Rv3166c" /function="UNKNOWN" /note="Rv3166c, (MTV014.10c), len: 319 aa. Probable transmembrane protein, similar but longer (52 aa) to O32895|MLCB1779.35c hypothetical protein from Mycobacterium leprae (119 aa), FASTA scores: opt: 289, E(): 3.7e-10, (44.25% identity in 122 aa overlap). Also some similarity to Q9Z536|SC9B2.23c PUTATIVE TRANSMEMBRANE PROTEIN from Streptomyces coelicolor (339 aa), FASTA scores: opt: 247, E(): 2.5e-07, (28.2% identity in 326 aa overlap); and in N-terminus to Q9RS20|DR2307 PUTATIVE MULTIDRUG-EFFLUX TRANSPORTER from Deinococcus radiodurans (410 aa), FASTA scores: opt: 135,E(): 1, (32.35% identity in 136 aa overlap). TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217682.1" /db_xref="GI:15610302" /db_xref="UniProtKB/TrEMBL:O53316" /db_xref="GeneID:888750" /translation="MPGTKPGSDKPTGRVVVVIVLLMLAGAALRGHLPADDGAPLAAA GGSRAALMFIVAALAATLALIALAIITRLRHPLPVAPSAGELSAMLGGAAGRPNWRVL LLGLGTILAWLLIAILLARLFVPDDVGPAAPIPDSTATPDASSTTPSRPQPPQDNNDD VLGILFASTIGLFLMVVAGSLITSRRQRKSAPARISGDRIESPAPSARSESLARAAEI GLAEMADLRREPREAIIACYVAMERELSHVPGVAPQDFDTPTEVLARAVEHRALHGAS AAALVSLFAEARFSPHVMNEEHREVAMRLLRLVLDELSTRTAI" gene complement(3535431..3536057) /locus_tag="Rv3167c" /db_xref="GeneID:888763" CDS complement(3535431..3536057) /locus_tag="Rv3167c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3167c, (MTV014.11c), len: 208 aa. Probable transcriptional regulator, tetR family, similar to several transcriptional regulators e.g. Q9L2A4|SC8F4.22c (TETR/ACRR FAMILY) from Streptomyces coelicolor (234 aa), FASTA scores: opt: 317, E(): 7.5e-13, (33.35% identity in 210 aa overlap); Q9RK47|SCF12.11 (TETR/ACRR FAMILY) from Streptomyces coelicolor (206 aa), FASTA scores: opt: 293, E(): 2.1e-11, (32.65% identity in 199 aa overlap); Q54288 REGULATOR OF ANTIBIOTIC TRANSPORT COMPLEXES (TETR/ACRR FAMILY) (204 aa), FASTA scores: opt: 260, E(): 2.4e-09, (30.75% identity in 205 aa overlap); etc. Equivalent to AAK47595 from Mycobacterium tuberculosis strain CDC1551 but shorter 21 aa. Contains probable helix-turn-helix motif from aa 42 to 63 (Score 1727, +5.07 SD). MAY BE BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217683.1" /db_xref="GI:15610303" /db_xref="GOA:O53317" /db_xref="UniProtKB/TrEMBL:O53317" /db_xref="GeneID:888763" /translation="MKADLPSLDKAPGAGRPRDPRIDSAILSATAELLVQIGYSNLSL AAVAERAGTTKSALYRRWSSKAELVHEAAFPAAPTALQAAAGDIAADIRMMIAATRDV FTTPVVRAALPGLVADMTADAELNARVLARFADLFAAVRMRLREAVDRGEAHPDVDPD RLIELIGGATMLRMLLYPDDMLDDAWVDQTTAIVVRGVHRAAPGGSVV" gene 3536102..3537238 /locus_tag="Rv3168" /db_xref="GeneID:888778" CDS 3536102..3537238 /locus_tag="Rv3168" /function="UNKNOWN" /note="Rv3168, (MTV014.12), len: 378 aa. Conserved hypothetical protein, similar to other hypothetical proteins e.g. Q9M7Y6|F3E22.6 from Arabidopsis thaliana (Mouse-ear cress) (314 aa), FASTA scores: opt: 236, E(): 1.1e-07, (27.35% identity in 234 aa overlap); Q9RYW2|DRA0194 from Deinococcus radiodurans (386 aa), FASTA scores: opt: 207, E(): 9.1e-06, (23.45% identity in 320 aa overlap); etc. Also some similarity with O69727|Rc3761c|MTV025.109c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (351 aa), FASTA scores: opt: 193, E(): 6.4e-05, (29.4% identity in 242 aa overlap). TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217684.1" /db_xref="GI:15610304" /db_xref="UniProtKB/TrEMBL:O53318" /db_xref="GeneID:888778" /translation="MANEPAIGAIDRLQRSSRDVTTLPAVISRWLSSVLPGGAAPEVT VESGVDSTGMSSETIILTARWQQDGRSIQQKLVARVAPAAEDVPVFPTYRLDHQFEVI RLVGELTDVPVPRVRWIETTGDVLGTPFFLMDYVEGVVPPDVMPYTFGDNWFADAPAE RQRQLQDATVAALATLHSIPNAQNTFSFLTQGRTSDTTLHRHFNWVRSWYDFAVEGIG RSPLLERTFEWLQSHWPDDAAAREPVLLWGDARVGNVLYRDFQPVAVLDWEMVALGPR ELDVAWMIFAHRVFQELAGLATLPGLPEVMREDDVRATYQALTGVELGDLHWFYVYSG VMWACVFMRTGARRVHFGEIEKPDDVESLFYHAGLMKHLLGEEH" gene 3537238..3538362 /locus_tag="Rv3169" /db_xref="GeneID:888774" CDS 3537238..3538362 /locus_tag="Rv3169" /function="UNKNOWN" /note="Rv3169, (MTV014.13), len: 374 aa. Conserved hypothetical protein, with similarity to other hypothetical proteins: Q9A8W6|CC1232 from Caulobacter crescentus (368 aa), FASTA scores: opt: 669, E(): 3.3e-34, (34.05% identity in 376 aa overlap); and O32901|MLCB1779.41 from Mycobacterium leprae (127 aa), FASTA scores: opt: 179, E(): 0.00034, (29.0% identity in 131 aa overlap). Also weak similarity with P95149|Rv1866|MTCY359.07c (804 aa), FASTA scores: opt: 121, E(): 6.4, (37.0% identity in 119 aa overlap). Equivalent to AAK47597 from Mycobacterium tuberculosis strain CDC1551 but shorter 43 aa. TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217685.1" /db_xref="GI:15610305" /db_xref="UniProtKB/TrEMBL:O53319" /db_xref="GeneID:888774" /translation="MPQMLGPLDEYPLHQLPQPIAWPGSSDRNFYDRSYFNAHDRTGN IFLITGIGYYPNLGVKDAFVLIRRADIQTAVHLSDAIDSDRLHQHVNGYRVEVVEPLR KLRIVLDETEGVAADLTWEGLFDVVQEQPHVLRSGNRVTLDAQRFAQLGTWSGRIVVD GERIAVDPATWLGSRDRSWGIRPVGEPEPAGRPADPPFEGMWWLYVPLAFDDFAVVLI IQEEPDGFRSLNDCTRIWRDGHVEQLGWPRVRIHYRSGTRIPTGATIEASTPDGAPVH FDVESKLAVPTHVGGGYGGDSDWSHGMWKGEKFVERRTYDMTDPTIIARAGFGVIDHV GRALCRDGDGNPVQGWGLFEHGALGRHDPSGFADWSTLAP" misc_feature 3538502..3538522 /note="PS00092 N-6 Adenine-specific DNA methylases signature" gene 3538505..3539851 /gene="aofH" /locus_tag="Rv3170" /db_xref="GeneID:888754" CDS 3538505..3539851 /gene="aofH" /locus_tag="Rv3170" /EC_number="1.4.3.4" /function="POSSIBLY CATALYZES THE OXIDATIVE DEAMINATION: OXIDIZE ON PRIMARY AMINES, AND PERHAPS ON SECONDARY AND TERTIARY AMINES [CATALYTIC ACTIVITY: RCH(2)NH(2) + H(2)O + O(2) = RCHO + NH(3) + H(2)O(2)]. MUST HAVE IMPORTANT FUNCTION IN METABOLISM. SUPPOSED INVOLVED IN STATIONARY-PHASE SURVIVAL." /experiment="experimental evidence, no additional details recorded" /note="Rv3170, (MT3259, MTV014.14), len: 448 aa. Probable aofH, flavin-containing (mono)amine oxidase (EC 1.4.3.4), equivalent to a predicted homologous protein from Mycobacterium smegmatis (see citation below), and similar to many eukaryotic monoamine oxidases e.g. P49253|AOF_ONCMY from Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) (522 aa), FASTA scores: opt: 869, E(): 5.3e-44, (37.7% identity in 448 aa overlap); P21396|AOFA_RAT|MAOA from Rattus norvegicus (Rat) (526 aa), FASTA scores: opt: 839, E(): 3.2e-42, (37.45% identity in 446 aa overlap); Q99NA8|MAO-A from Cavia porcellus (Guinea pig) (506 aa), FASTA scores: opt: 836, E(): 4.6e-42, (37.0% identity in 446 aa overlap); P21398|AOFA_BOVIN from Bos taurus (Bovine) (527 aa), FASTA scores: opt: 806, E(): 2.8e-40, (37.0% identity in 446 aa overlap); P21397|AOFA_HUMAN (527 aa), FASTA scores: opt: 801, E(): 5.6e-40, (37.2% identity in 446 aa overlap); etc. Alternative start possible at position 3538487. BELONGS TO THE FLAVIN MONOAMINE OXIDASE FAMILY. COFACTOR: FAD (POTENTIAL). TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="flavin-containing monoamine oxidase" /protein_id="NP_217686.1" /db_xref="GI:15610306" /db_xref="GOA:P63533" /db_xref="UniProtKB/Swiss-Prot:P63533" /db_xref="GeneID:888754" /translation="MTNPPWTVDVVVVGAGFAGLAAARELTRQGHEVLVFEGRDRVGG RSLTGRVAGVPADMGGSFIGPTQDAVLALATELGIPTTPTHRDGRNVIQWRGSARSYR GTIPKLSLTGLIDIGRLRWQFERIARGVPVAAPWDARRARELDDVSLGEWLRLVRATS SSRNLMAIMTRVTWGCEPDDVSMLHAARYVRAAGGLDRLLDVKNGAQQDRVPGGTQQI AQAAAAQLGARVLLNAAVRRIDRHGAGVTVTSDQGQAEAGFVIVAIPPAHRVAIEFDP PLPPEYQQLAHHWPQGRLSKAYAAYSTPFWRASGYSGQALSDEAPVFITFDVSPHADG PGILMGFVDARGFDSLPIEERRRDALRCFASLFGDEALDPLDYVDYRWGTEEFAPGGP TAAVPPGSWTKYGHWLREPVGPIHWASTETADEWTGYFDGAVRSGQRAAAEVAALL" gene complement(3539846..3540745) /gene="hpx" /locus_tag="Rv3171c" /db_xref="GeneID:887293" CDS complement(3539846..3540745) /gene="hpx" /locus_tag="Rv3171c" /EC_number="1.11.1.-" /function="SUPPOSED INVOLVED IN DETOXIFICATION REACTIONS." /note="Rv3171c, (MTV014.15c), len: 299 aa. Possible hpx, non-heme haloperoxidase (EC 1.11.1.-), similar to other hydrolases (principaly epoxide hydrolases) and non-heme chloroperoxidases e.g. Q9RKB6|SCE87.22c PUTATIVE HYDROLASE from Streptomyces coelicolor (314 aa), FASTA scores: opt: 431, E(): 6e-20, (38.05% identity in 297 aa overlap); Q9HZ14|PA3226 PROBABLE HYDROLASE (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Pseudomonas aeruginosa (275 aa), FASTA scores: opt: 236, E(): 1e-07, (29.6% identity in 277 aa overlap); Q9DBL9|1300003 D03RIK PROTEIN SIMILAR TO ALPHA/BETA HYDROLASE FOLD from Mus musculus (Mouse) (351 aa), FASTA scores: opt: 223, E(): 8.3e-07, (24.35% identity in 304 aa overlap); AAK46260|MT1988 EPOXIDE HYDROLASE from Mycobacterium tuberculosis strain CDC1551 (356 aa), FASTA scores: opt: 223, E(): 8.4e-07, (40.7% identity in 113 aa overlap); P49323|PRXC_STRLI|CPO|CPOL NON-HEME CHLOROPEROXIDASE (EC 1.11.1.10) (CHLORIDE PEROXIDASE) from Streptomyces lividans (275 aa), FASTA scores: opt: 220, E(): 1e-06, (29.5% identity in 305 aa overlap); etc. Equivalent to AAK47599 Hydrolase, alpha/beta hydrolase family from Mycobacterium tuberculosis strain CDC1551 but shorter 24 aa. Start chosen by similarity, alternative with good RBS possible. TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="non-Heme haloperoxidase Hpx" /protein_id="NP_217687.1" /db_xref="GI:15610307" /db_xref="GOA:O53321" /db_xref="UniProtKB/TrEMBL:O53321" /db_xref="GeneID:887293" /translation="MTVRAADGTPLHTQVFGPPHGYPIVLTHGFVCAIRAWAYQIADL AGDYRVIAFDHRGHGRSGVPRRGAYSLNHLAADLDSVLDATLAPRERAVVAGHSMGGI TIAAWSDRYRHKVRRRTDAVALINTTTGDLVRKVKLLSVPRELSPVRVLAGRSLVNTF GGFPLPGAARALSRHVISTLAVAADADPSATRLVYELFTQTSAAGRGGCAKMLVEEVG SAHLNLDGLTVPTLVIGGVRDRLTPISQSRRIARTAPNVVGLVELPGGHCSMLERHQE VNSHLRALAESVTRHVRDRRISS" gene complement(3540882..3541364) /locus_tag="Rv3172c" /db_xref="GeneID:888848" CDS complement(3540882..3541364) /locus_tag="Rv3172c" /function="UNKNOWN" /note="Rv3172c, (MTV014.16c), len: 160 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217688.1" /db_xref="GI:15610308" /db_xref="UniProtKB/TrEMBL:O53322" /db_xref="GeneID:888848" /translation="MSVALLREMFDRMVVAKNAELIEHYYDPDFLMYSDGLSQSFAKF RDSHRKLYATAISYAVEYDEHAWVEAQTRLPGGCGSPRRDLARSRPASRWYSLPPTAT AEFTGSGRRRGRVGATWPPSTITETTTDRLAMRNQLRAGAATLLFCDPMLQRFPATRK" gene complement(3541443..3542045) /locus_tag="Rv3173c" /db_xref="GeneID:888844" CDS complement(3541443..3542045) /locus_tag="Rv3173c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM (PROBABLY REPRESSION)." /note="Rv3173c, (MTV014.17c), len: 200 aa. Probable transcriptional regulatory protein tetR family, similar to several bacterial putative regulatory proteins e.g. Q9EWI2|SC7H9.14 from Streptomyces coelicolor (195 aa), FASTA scores: opt: 319, E(): 1.7e-13, (34.55% identity in 195 aa overlap); O85695|3SCF60.04 from Streptomyces lividans and Streptomyces coelicolor (192 aa), FASTA scores: opt: 297, E(): 4.3e-12, (37.45% identity in 187 aa overlap); BAB50853|MLR4117 from Rhizobium loti (Mesorhizobium loti) (205 aa), FASTA scores: opt: 280, E(): 5.5e-11, (31.45% identity in 194 aa overlap); BAB53760|MLL8133 from Rhizobium loti (Mesorhizobium loti) (194 aa), FASTA scores: opt: 270, E(): 2.3e-10, (34.05% identity in 185 aa overlap); etc. Also similar to other regulators from Mycobacterium tuberculosis e.g. P96839|Rv3557c|MTCY06G11.04c (200 aa), FASTA scores: opt: 154, E(): 0.0013, (38.8% identity in 80 aa overlap). Contains probable helix-turn-helix motif from aa 39 to 60 (Score 1251, +3.45 SD). SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="TetR/ACRR family transcriptional regulator" /protein_id="NP_217689.1" /db_xref="GI:15610309" /db_xref="GOA:O53323" /db_xref="UniProtKB/TrEMBL:O53323" /db_xref="GeneID:888844" /translation="MPPVTRTTEPPRRGGRGARQRILKAAAELFYCEGINATGVELIA NKASVSKRTLYQHFPSKSALVEEYLRGLRQAAGEADKMPKASNATPRERLLALFDRPN RGDGRMRGCPFHNAAVEAAGEMPGVERIVHSHKRDYIKGLARLAREAGAAHPRSLGNQ LAVLFEGAAALSTSLDDAGPWAHARAAAEVLIDQATARPV" gene 3542138..3542845 /locus_tag="Rv3174" /db_xref="GeneID:887989" CDS 3542138..3542845 /locus_tag="Rv3174" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /note="Rv3174, (MTV014.18), len: 235 aa. Probable oxidoreductase short-chain dehyrogenase/reductase (EC 1.-.-.-), similar to others e.g. Q9RPT7|SITS from Streptomyces albus (223 aa), FASTA scores: opt: 654, E(): 6.1e-32, (49.3% identity in 215 aa overlap); Q9RI61|SCJ11.46 from Streptomyces coelicolor (230 aa), FASTA scores: opt: 626, E(): 2.9e-30, (50.9% identity in 224 aa overlap); Q9A5Z1|CC2306 from Caulobacter crescentus (252 aa), FASTA scores: opt: 430, E(): 1.3e-18, (39.45% identity in 228 aa overlap); Q51641 INSECT-TYPE DEHYDROGENASE (249 aa), FASTA scores: opt: 301, E(): 5.7e-11, (38.3% identity in 188 aa overlap); Q9HXC9|PA3883 from Pseudomonas aeruginosa (276 aa), FASTA scores: opt: 296, E(): 1.2e-10, (29.55% identity in 247 aa overlap); etc. MAY BE BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_217690.1" /db_xref="GI:15610310" /db_xref="GOA:O53324" /db_xref="UniProtKB/TrEMBL:O53324" /db_xref="GeneID:887989" /translation="MTSLAERTVLVTGANRGMGREYVAQLLGRKVAKVYAATRNPLAI DVSDPRVIPLQLDVTDAVSVAEAADLATDVGILINNAGISRASSVLDKDTSALRGELE TNLFGPLALASAFADRIAERSGAIVNVSSVLAWLPLGMSYGVSKAAMWSATESMRIEL APRGVQVVGVYVGLVDTDMGRFADAPKSDPADVVRQVLDGIEAGKEDVLADEMSRQVR ASLNVPARERIARLMGN" gene 3542860..3544347 /locus_tag="Rv3175" /db_xref="GeneID:887294" CDS 3542860..3544347 /locus_tag="Rv3175" /EC_number="3.5.1.4" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /note="catalyzes the hydrolysis of a monocarboxylic acid amid to form a monocarboxylate and ammonia" /codon_start=1 /transl_table=11 /product="amidase" /protein_id="NP_217691.1" /db_xref="GI:15610311" /db_xref="GOA:O53325" /db_xref="UniProtKB/TrEMBL:O53325" /db_xref="GeneID:887294" /translation="MAMSAKASDDIAWLPATAQLAVLAAKKVSSAELVELYLSRIDTY NASLNAIVTVDPDAARRVAKRSDAARARGDELGPLHGLPITVKDSYETAGMRTTCGRR DLADYVPTQDAEAVARLRRAGAIIMGKTNMPTGNQDVQASNPVFGRTNNPWDAARTSG GSAGGGAAATAAGLTSFDYGSEIGGSTRIPAHYCGLYGHKSTWRSVPLVGHIPSAPGN PGRWGQADMACAGVQVRGARDIIPALEATVGPMRADGGFSYALAPPRAGALKDFRVAV WAEDPHCPIDADVRRAMDDAVAALRAAGAHVVEQPATIPVDMAVSHNIFQSLVFGAFA VDRSTLSPASAAALGLRAVRHPRGEAANALGATLQSHRAWLFADAARHEMRDRWAGFF NEFDVLLLPVTPTPAPLHHNKDHDRLGRTIDVDGVSRSYWDQLKWNALANIAGTPATT MPITTTATGLPIGIQAMGPAGGDRTTVEFAALLTEVLGGFRVPPL" misc_feature 3543226..3543249 /locus_tag="Rv3175" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3544344..3545300) /gene="mesT" /locus_tag="Rv3176c" /db_xref="GeneID:888846" CDS complement(3544344..3545300) /gene="mesT" /locus_tag="Rv3176c" /function="BIOTRANSFORMATION ENZYME THAT ACTS ON A VARIETY OF EPOXIDES AND ARENE OXIDES. CATALYZES THE HYDROLYSIS OF ARENE AND EPOXIDES TO LESS REACTIVE AND MORE WATER SOLUBLE DIHYDRODIOLS BY THE TRANS ADDITION OF WATER [CATALYTIC ACTIVITY: AN EPOXIDE + H(2)O = A GLYCOL]." /note="Rv3176c, (MTV014.20c), len: 318 aa. Probable mesT, epoxide hydrolase (EC 3.3.2.3), similar to others e.g. O15007|PEG1|MEST|Q92571|O14973 MEST PROTEIN (MESODERM SPECIFIC TRANSCRIPT (MOUSE) HOMOLOG) (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Homo sapiens (Human) (335 aa), FASTA scores: opt: 348, E(): 6e-15, (32.15% identity in 280 aa overlap); AAH06639|Q07646 MEST PROTEIN from Mus musculus (Mouse) (335 aa), FASTA scores: opt: 342, E(): 1.4e-14, (31.45% identity in 280 aa overlap); Q9I8E7|MEST EPOXIDE HYDROLASE (EC 3.3.2.3) from Fugu rubripes (Japanese pufferfish) (Takifugu rubripes) (326 aa), FASTA scores: opt: 322, E(): 2.7e-13, (29.55% identity in 301 aa overlap); Q9PUC9|PEG1|MEST EPOXIDE HYDROLASE from Brachydanio rerio (Zebrafish) (Zebra danio) (344 aa), FASTA scores: opt: 322, E(): 2.8e-13, (32.35% identity in 207 aa overlap); Q9HYH6|PA3429 PROBABLE EPOXIDE HYDROLASE from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 258, E(): 3e-09, (29.85% identity in 288 aa overlap); O31243|ECHA EPOXIDE HYDROLASE from Agrobacterium radiobacter (294 aa), FASTA scores: opt: 202, E(): 1.1e-05, (27.0% identity in 278 aa overlap); etc. Also similar to Q50599|Rv1834|MT1882|MTCY1A11.09c HYPOTHETICAL 31.7 KDA PROTEIN from Mycobacterium tuberculosis (288 aa), FASTA scores: opt: 294, E(): 1.5e-11, (29.95% identity in 287 aa overlap). Equivalent to AAK47604 from Mycobacterium tuberculosis strain CDC1551 (339 aa) but shorter 21 aa. SIMILAR TO ALPHA/BETA HYDROLASE FOLD. MAY BE BELONG TO PEPTIDASE FAMILY S33. Note that previously known as lipS. TBparse score is 0.911.; lipS" /codon_start=1 /transl_table=11 /product="epoxide hydrolase MesT" /protein_id="YP_177938.1" /db_xref="GI:57117068" /db_xref="GOA:Q6MX03" /db_xref="UniProtKB/TrEMBL:Q6MX03" /db_xref="GeneID:888846" /translation="MTHRASALISAQEWFSAGERVGYDAERPGINPRSPLRAFIRRAA GTGVTRTFLPGWPDGSYGWAKVEAFLSSRFHFPRIYLDYIGHGDSDKPRDYPYSTFER ADLVEALWHAEGIAQTVVVAFDYSCIVSLELLARRIDRERAGNDQRTRITACLLANGG IFADGHTHAWYTTPLLTSPLGAAITPIGQRSWRMFAPFLRPVFSRGYPLSAAEMKELH DAISRRDGVRVLPATAGFVDEHREHAARWDLARIISALGDEVAFGVVGSAEDPFEGEQ LRLARERLADSVEITELAGGHLTTAEQPDRLAEVIAALPERS" gene 3545447..3546307 /locus_tag="Rv3177" /db_xref="GeneID:888766" CDS 3545447..3546307 /locus_tag="Rv3177" /EC_number="1.11.1.-" /function="SUPPOSED INVOLVED IN DETOXIFICATION REACTIONS." /note="Rv3177, (MTV014.21), len: 286 aa. Possible peroxidase (non-haem peroxidase) (EC 1.11.1.-), highly similar to Q9KJF9|W78 CULTIVAR SPECIFICITY PROTEIN (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) W78 from Rhizobium leguminosarum (287 aa), FASTA scores: opt: 1059, E(): 2.3e-59, (61.4% identity in 272 aa overlap); BAB48728|MLL1328 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (286 aa), FASTA scores: opt: 746, E(): 1.1e-39, (43.25% identity in 282 aa overlap). Similar to nonheme chloroperoxidases and related esterases e.g. O73957|SAL LIPOLYTIC ENZYME from Sulfolobus acidocaldarius (314 aa), FASTA scores: opt: 408, E(): 1.9e-18, (32.4% identity in 287 aa overlap); Q9AJM9|BIOH PROTEIN INVOLVED IN BIOTIN SYNTHESIS from Kurthia sp. 538-KA26 (267 aa), FASTA scores: opt: 324 ,E(): 3.2e-13, (30.0% identity in 250 aa overlap); Q9CBB1|ML2269 PUTATIVE HYDROLASE (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Mycobacterium leprae (265 aa); O05691|THCF_RHOER NON-HEME HALOPEROXIDASE (EC 1.11.1.-) from Rhodococcus erythropolis (SIMILAR TO OTHER BACTERIAL NON-HEME BROMO- AND CHLORO-PEROXIDASES) (274 aa), FASTA scores: opt: 279, E(): 2.2e-10, (29.0% identity in 276 aa overlap); Q53540|EST ESTERASE (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Pseudomonas putida (276 aa), FASTA scores: opt: 271, E(): 7.1e-10, (29.65% identity in 280 aa overlap); etc. Also similar to O06420|BPOC|Rv0554|MTCY25D10.33 HYPOTHETICAL 28.3 KDA PROTEIN (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from M. tuberculosis (262 aa), FASTA scores: opt: 280 ,E(): 1.8e-10, (28.0% identity in 257 aa overlap). Equivalent to AAK47605 from Mycobacterium tuberculosis strain CDC1551 (300 aa) but shorter 14 aa. SIMILAR TO ALPHA/BETA HYDROLASE FOLD. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="peroxidase" /protein_id="NP_217693.1" /db_xref="GI:15610313" /db_xref="GOA:O53327" /db_xref="UniProtKB/TrEMBL:O53327" /db_xref="GeneID:888766" /translation="MPQRQAGDIGATYQDAPTKSINVGGTRFVYRRLGADAGVPVIFL HHLGAVLDNWDPRVVDGIAAKHPVVTFDNRGVGASEGQTPDTVTTMADDAIAFVRALG FDQVDLLGFSLGGFVAQVIAQQEPQLVRKIILAGTGPAGGVGIGKVTFGTIRESIKAT LTFRDPKELRFFTRTDSGKSAARQFVKRLKERKDNRDKSITVRAFRSQLKAIHAWGTQ KPSDLTSIGHPVLIANGDDDTMVPTSNSLDLADRLPDATLRIYPDAGHGGIFQHHAQF VDDALQFLES" gene 3546438..3546797 /locus_tag="Rv3178" /db_xref="GeneID:888786" CDS 3546438..3546797 /locus_tag="Rv3178" /function="UNKNOWN" /note="Rv3178, (MTV014.22), len: 119 aa. Hypothetical protein, with some similarity to other hypothetical bacterial proteins (principaly mycobacterium and streptomyces proteins) e.g. P71854|Rv3547|MTCY03C7.09c from Mycobacterium tuberculosis strain H37Rv (151 aa), FASTA scores: opt: 310, E(): 2e-14, (40.5% identity in 116 aa overlap); Q9ZH81 from M. paratuberculosis (144 aa), FASTA scores: opt: 274, E(): 5.6e-12, (38.9% identity in 108 aa overlap); O85698|3SCF60.07 from Streptomyces lividans and Streptomyces coelicolor (149 aa), FASTA scores: opt: 235, E(): 2.7e-09, (35.2% identity in 108 aa overlap); Q10772|YF58_MYCTU|Rv1558|MT1609|MTCY48.07c (148 aa); Q9WX21|SCE68.11 from Streptomyces coelicolor (305 aa); etc. Equivalent to AAK47606 from Mycobacterium tuberculosis strain CDC1551 (171 aa) but shorter 52 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217694.1" /db_xref="GI:15610314" /db_xref="GOA:O53328" /db_xref="UniProtKB/TrEMBL:O53328" /db_xref="GeneID:888786" /translation="MRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVA SALGQAENPQWYRNLPPNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDS CQSWTERGIPVIILRPR" gene 3547618..3548907 /locus_tag="Rv3179" /db_xref="GeneID:887947" CDS 3547618..3548907 /locus_tag="Rv3179" /function="UNKNOWN" /note="Rv3179, (MTV014.23), len: 429 aa. Conserved hypothetical protein, highly similar to Q9KH61 PUTATIVE ATP/GTP BINDING PROTEIN from Mycobacterium smegmatis (428 aa), FASTA scores: opt: 2466, E(): 1.5e-148, (89.7% identity in 428 aa overlap) (no article found on the NCBI web site (July 2001)); and to other hypothetical bacterial proteins e.g. O07781|Rv0597c|MTCY19H5.25 from M. tuberculosis (411 aa), FASTA scores: opt: 1031, E(): 8e-58, (41.5% identity in 417 aa overlap); BAB54715|MLR9349 from Rhizobium loti (Mesorhizobium loti) (435 aa), FASTA scores: opt: 365, E(): 1.1e-15, (31.75% identity in 416 aa overlap); etc. Equivalent to AAK47609 from Mycobacterium tuberculosis strain CDC1551 (454 aa) but shorter 25 aa. Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217695.1" /db_xref="GI:15610315" /db_xref="GOA:O53329" /db_xref="UniProtKB/TrEMBL:O53329" /db_xref="GeneID:887947" /translation="MVHDEAGHELIERHMLEQLREVAEYTRVVLINGPRQAGKTTLLQ QLHAELGGWLRSLDVDVERASARADPEGYIMSAPRPTFLDEVQCAGDPLILAIKTATD RDRRPRQFFLSGSTRFLTVPTLSESLAGRVAILDLWPLSVAERSGVRPEIIAQLFTEP QVVLGTEPAPVTRHEYLQLACAGGFPEVVQRPAGRARSRWFSDYLRTVTQRDVRELKR IEQTDRLPRFMRYLAAITAQELNVAEAARVIGVDAGTIRSDLALFETVYLVHRLPAWS RNLTAKIKKRSKIHVVDSGFAAWLRGQSADSLARPTAEGAGPIMETFVINELMKLRAA TELEVDLYHFRDRDGREIDCILQTPDSRVVGVEVKASATVNVHDFRHLSFARDRLGDE FITGVLFYTGARALPFGDRLMALPINLLWNGQSVSSL" misc_feature 3547714..3547737 /locus_tag="Rv3179" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3549254..3549688) /locus_tag="Rv3180c" /db_xref="GeneID:887946" CDS complement(3549254..3549688) /locus_tag="Rv3180c" /function="UNKNOWN" /note="Rv3180c, (MTV014.24c), len: 144 aa. Hypothetical unknown ala-rich protein. Contains probable coiled-coil domain from aa 40 to 70." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217696.1" /db_xref="GI:15610316" /db_xref="UniProtKB/TrEMBL:O53330" /db_xref="GeneID:887946" /translation="MPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEV RAALAAAARNHDLTESELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGAD AVHLASALAVGDPGLVVAVWDRRLHTGAHAAGCRVAPAQLDP" gene complement(3549691..3550143) /locus_tag="Rv3181c" /db_xref="GeneID:888787" CDS complement(3549691..3550143) /locus_tag="Rv3181c" /function="UNKNOWN" /note="Rv3181c, (MTV014.25c), len: 150 aa. Hypothetical protein, with some similarity to other mycobacterium proteins e.g. Q50718|YY07_MYCTU|Rv3407|MT3515|MTCY78.21c (99 aa), FASTA scores: opt: 123, E(): 0.25, (33.7% identity in 89 aa overlap); and O50412|Rv3385c|MTV004.43c (102 aa), FASTA scores: opt: 123, E(): 0.26, (39.7% identity in 68 aa overlap). TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217697.1" /db_xref="GI:15610317" /db_xref="UniProtKB/TrEMBL:O53331" /db_xref="GeneID:888787" /translation="MQLGRKVTSHHDIDRFGVASTADESVYRPLPPRLRLAQVNLSRR RCRTQSDMYKSRFSECTVQSVDVSVTELRAHLSDWLDRARAGGEVVITERGIPIARLA ALDSTDTLERLTAEGVIGKATAQRPVAAGRPRPRPQRPVSDRVSDQRR" gene 3550374..3550718 /locus_tag="Rv3182" /db_xref="GeneID:888795" CDS 3550374..3550718 /locus_tag="Rv3182" /function="UNKNOWN" /note="Rv3182, (MTV014.26), len: 114 aa. Hypothetical protein, with some similarity to other hypothetical bacterial proteins e.g. O53468|Rv2022c|MTV018.09c from M. tuberculosis (201 aa), FASTA scores: opt: 335, E(): 3.6e-16, (51.9% identity in 104 aa overlap); and Q9L3R6|ORF119 from Anabaena sp. strain PCC 7120 (119 aa), FASTA scores: opt: 250, E(): 1.6e-10, (42.1% identity in 95 aa overlap). Equivalent to AAK47614 from Mycobacterium tuberculosis strain CDC1551 (94 aa) but longer 20 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217698.1" /db_xref="GI:15610318" /db_xref="UniProtKB/TrEMBL:O53332" /db_xref="GeneID:888795" /translation="MAVILLPQVERWFFALNRDAMASVTGAIDLLEMEGPTLGRPVVD KVNDSTFHNMKELRPAGTSIRILFAFDPARQAILLLGGDKAGNWKRWYDNNIPIADQR SENWLASEHGGG" gene 3550715..3551044 /locus_tag="Rv3183" /db_xref="GeneID:888059" CDS 3550715..3551044 /locus_tag="Rv3183" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3183, (MTV014.27), len: 109 aa. Possible transcriptional regulator, similar to others e.g. Q9S1D9|YPPCP1.08c from Yersinia pestis (99 aa), FASTA scores: opt: 119, E(): 0.47, (40.55% identity in 74 aa overlap); Q9X153|TM1330 from Thermotoga maritima (111 aa), FASTA scores: opt: 115, E(): 0.91, (40.35% identity in 57 aa overlap); P95258|Rv1956|MTCY09F9.08c (alias AAK46277 putative DNA-binding protein from strain CDC1551) (149 aa), FASTA scores: opt: 116, E(): 1, (42.25% identity in 71 aa overlap). Also similar to O53467|Rv2021c|MTV018.08c from Mycobacterium tuberculosis (101 aa), FASTA scores: opt: 214, E(): 5.8e-07, (43.0% identity in 107 aa overlap). Contains probable helix-turn-helix motif from aa 51 to 72 (Score 1803, +5.33 SD). TBparse score is 0.852." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217699.1" /db_xref="GI:15610319" /db_xref="GOA:O53333" /db_xref="UniProtKB/TrEMBL:O53333" /db_xref="GeneID:888059" /translation="MTMARNWRDIRADAVAQGRVDLQRAAVAREEMRDAVLAHRLAEI RKALGHARQADVAALMGVSQARVSKLESGDLSHTELGTLQAYVAALGGHLRIVAEFGE NTVELTA" repeat_region 3551227..3551229 /note="3 bp direct repeat, cga, at 5'-end of IS6110" repeat_region 3551230..3552584 /note="IS6110-12, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-12" repeat_region 3551230..3551257 /note="28 bp inverted repeat at left end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 3551281..3551607 /locus_tag="Rv3184" /db_xref="GeneID:888796" CDS 3551281..3551607 /locus_tag="Rv3184" /function="INVOLVED IN THE TRANSPOSITION IN THE INSERTION SEQUENCE IS6110." /note="Rv3184, (MTV014.28), len: 108 aa. Probable IS6110 transposase. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217700.1" /db_xref="GI:15610320" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:888796" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 3551604..3552542 /locus_tag="Rv3185" /db_xref="GeneID:887441" CDS <3551604..3552542 /locus_tag="Rv3185" /function="INVOLVED IN THE TRANSPOSITION IN THE INSERTION SEQUENCE IS6110." /note="Rv3185, (MTV014.29), len: 312 aa. Probable IS6110 transposase. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217701.1" /db_xref="GI:15610321" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:887441" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region complement(3552557..3552584) /note="28 bp inverted repeat at right end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC" repeat_region 3552585..3552587 /note="3 bp direct repeat, cga, at 3'-end of IS6110" repeat_region 3552710..3552712 /note="3 bp direct repeat, att, at 5'-end of IS6110" repeat_region 3552713..3554067 /note="IS6110-13, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-13" repeat_region 3552713..3552740 /note="28 bp inverted repeat at left end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 3552764..3553090 /locus_tag="Rv3186" /db_xref="GeneID:888024" CDS 3552764..3553090 /locus_tag="Rv3186" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="Rv3186, (MTV014.30), len: 108 aa. Probable IS6110 transposase. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217702.1" /db_xref="GI:15610322" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:888024" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 3553087..3554025 /locus_tag="Rv3187" /db_xref="GeneID:887604" CDS <3553087..3554025 /locus_tag="Rv3187" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="Rv3187, (MTV014.31), len: 312 aa. Probable IS6110 transposase." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217703.1" /db_xref="GI:15610323" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:887604" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region complement(3554040..3554067) /note="28 bp inverted repeat at right end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC" repeat_region 3554068..3554070 /note="3 bp direct repeat, att, at 5'-end of IS6110" gene 3554298..3554645 /locus_tag="Rv3188" /db_xref="GeneID:888103" CDS 3554298..3554645 /locus_tag="Rv3188" /function="UNKNOWN" /note="Rv3188, (MTV014.32), len: 115 aa. Conserved hypothetical protein, with similarity to other proteins from Mycobacterium tuberculosis: Q10868|YJ90_MYCTU|Rv1990c|MT2044|MTCY39.29 HYPOTHETICAL PROTEIN (113 aa), FASTA scores: opt: 184, E(): 8.1e-06, (28.45% identity in 109 aa overlap); and O06299|Rv0348|MTCY13E10.08 HYPOTHETICAL PROTEIN (217 aa), FASTA scores: opt: 129, E(): 0.074, (30.0% identity in 100 aa overlap). Also some similarity with C-terminus of Q9XA59|SCGD3.19 PUTATIVE TWO-COMPONENT SYSTEM RESPONSE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (218 aa), FASTA scores: opt: 114, E(): 0.76, (30.0% identity in 110 aa overlap) (for this one, no similarity exists in the N-terminal region with the N-terminus of other regulatory components of sensory transduction systems). TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217704.1" /db_xref="GI:15610324" /db_xref="UniProtKB/TrEMBL:O53334" /db_xref="GeneID:888103" /translation="MAVTLDRAVEASEIVDALKPFGVTQVDVAAVIQVSDRAVRGWRT GDIRPERYDRLAQLRDLVLLLSDSLTPRGVGQWLHAKNRLLDGQRPVDLLAKDRYEDV RSAAESFIDGAYV" gene 3554642..3555262 /locus_tag="Rv3189" /db_xref="GeneID:888034" CDS 3554642..3555262 /locus_tag="Rv3189" /function="UNKNOWN" /note="Rv3189, (MTV014.33), len: 206 aa. Conserved hypothetical protein, weakly similar to other proteins from Mycobacterium tuberculosis e.g. O86329|MBTE|Rv2380c|MTCY22H8.05 (1682 aa), FASTA scores: opt: 135, E(): 0.79, (27.8% identity in 187 aa overlap); and Q10869|YJ89_MYCTU|Rv1989c|MT2043MTCY39.30 (186 aa), FASTA scores: opt: 122, E(): 0.85, (32.25% identity in 93 aa overlap). TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217705.1" /db_xref="GI:15610325" /db_xref="UniProtKB/TrEMBL:O53335" /db_xref="GeneID:888034" /translation="MKLADAIATAPRRTLKGTYWHQGPTRHPVTSCADPARGPGRYHR TGEPGVWYASNKEQGAWAELFRHFVDDGVDPFEVRRRVGRVAVTLQVLDLTDERTRSH LGVDETDLLSDDYTTTQAIAAARDANFDAVLAPAAALPGCQTLAVFVHALPNIEPERS EVRQPPPRLANLLPLIRPHEHMPDSVRRLLATLTRAGAEAIRRRRR" gene complement(3555422..3556687) /locus_tag="Rv3190c" /db_xref="GeneID:888119" CDS complement(3555422..3556687) /locus_tag="Rv3190c" /function="UNKNOWN" /note="Rv3190c, (MTV014.34c), len: 421 aa. Hypothetical unknown protein. TBparse score is 0.937." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217706.1" /db_xref="GI:15610326" /db_xref="UniProtKB/TrEMBL:O53336" /db_xref="GeneID:888119" /translation="MEYVQLFSKGRLNDLAGSLAGFLGKASQATAQRLQSWDADDLLN TPVDDVVEQLVELGSVECPDLRVDDAFMLPATEVDQQYRDWGEQRTRRVTRLVLVVPF EGHKDIFNLRPDQFTTMPPQVLRLQGHEIHLAIDNLSNDAAAINAAFHKQIANIEKYL GWSRRQIDLHNQGLRNELPGMVARRREQLLATRNLQAEIGFPVRRRKDADTYAAPISR KSVRPRPHRPAGARAAFKPEPAMQDEDYQSALRVLRNQRNALERTPSVAAKLDGEEIR DMLLVGLNAQFEGDAGGELFNGAGKTDILIRVDDRNIFIGECKVWSGPRTMDDVLKQL FGYLVWRDTKAAILLFIRNKDVTAVIDNAIAKIKEHPNHKRCPAHRAGADQYEFTMHA DGDPEREIHLTLIPFALRPTAEVPTTTIP" gene complement(3557311..3558345) /locus_tag="Rv3191c" /db_xref="GeneID:887628" CDS complement(3557311..3558345) /locus_tag="Rv3191c" /function="INVOLVED IN THE TRANSPOSITION OF AN INSERTION SEQUENCE." /note="Rv3191c, (MTV014.35c), len: 344 aa. Probable transposase, similar to many especially Q9K2N8 PUTATIVE TRANSPOSASE from Pseudomonas aeruginosa (338 aa), FASTA scores: opt: 837, E(): 1.3e-43, (42.55% identity in 336 aa overlap); Q9RBF4 INSERTION SEQUENCE IS1088 from Alcaligenes eutrophus (Ralstonia eutropha) (342 aa), FASTA scores: opt: 823, E(): 9.2e-43, (43.05% identity in 337 aa overlap); and Q51379 PUTATIVE TRANSPOSASE from Pseudomonas alcaligenes (338 aa), FASTA scores: opt: 818, E(): 1.8e-42, (42.35% identity in 333 aa overlap). Contains probable helix-turn-helix motif from aa 25 to 46 (Score 1968, +5.89 SD)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217707.1" /db_xref="GI:15610327" /db_xref="GOA:O53337" /db_xref="UniProtKB/TrEMBL:O53337" /db_xref="GeneID:887628" /translation="MRQISSRYLSEEERINIADLRRSGLSIRKIADQLGRAPSTVSRE LRRNSRRDGQYRPFEAHRWAVQRRVRRHRRRIDKNPDLCELIAELLAQRWSPQQIARH LRRKYPDDRSMWLCHESIYQAVYQPQSRLIRPPQVKSPHRGPLRTGRTHRRAHLRPGR RRPRFAQPMLSIHQRPFDPADRSEPGHWEGDLIVGKNQGSAIGTLVERQTRLIRLLHL PTHDAYCLRIAITETMSDLPVTLVRSITWDQGIEMARHIDITADLGAPVYFCDSRSPW QRASNENSNGLLRQYFPKGTSLSTYTPDHLRAVEYEINNRPRQVLGHRSPAELFTALL TSPDHQLLRR" repeat_region complement(3557314..3558345) /note="IS1603, len: 1032 bp. Insertion sequence IS1603." /mobile_element="insertion sequence:IS1603" gene complement(3559370..3559443) /locus_tag="Rvnt38" /note="tRNA-Met(CAT)" /db_xref="GeneID:2700465" tRNA complement(3559370..3559443) /locus_tag="Rvnt38" /product="tRNA-Met" /note="codon recognized: AUG" /anticodon=(pos:3559407..3559409,aa:Met) /db_xref="GeneID:2700465" gene 3559563..3560024 /locus_tag="Rv3192" /db_xref="GeneID:887899" CDS 3559563..3560024 /locus_tag="Rv3192" /function="UNKNOWN" /note="Rv3192, (MTV014.36), len: 153 aa. Conserved hypothetical ala- and pro-rich protein, with weak similarity to N-terminal half of several proteins e.g. Q11030|YD60_MYCTU|Rv1360|MT1405|MTCY02B10.24 HYPOTHETICAL 37.3 KDA PROTEIN from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 245, E(): 3.7e-08, (33.1% identity in 157 aa overlap); O30260|AF2411 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (363 aa), FASTA scores: opt: 144, E(): 0.072, (32.6% identity in 92 aa overlap); Q9ZA30|GRA-ORF29 PUTATIVE FMN-DEPENDENT MONOOXYGENASE from Streptomyces violaceoruber (343 aa), FASTA scores: opt: 133, E(): 0.33, (25.15% identity in 159 aa overlap). TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217708.1" /db_xref="GI:15610328" /db_xref="UniProtKB/TrEMBL:O53338" /db_xref="GeneID:887899" /translation="MIPQPLSQLGDLARRPGRRVLCSPKTAAPSISNATVASPAAPGL ELSTGIALAFPRGPFVPAAAAWELQEATSGKFQLGLGTQVRKNVVHRYGMAFHRPGPR LRYLLAVKACFAVFQTGTPDHHGEFDNPDFITAQWSPARIDPPGPSPAGPR" gene complement(3560194..3563172) /locus_tag="Rv3193c" /db_xref="GeneID:888012" CDS complement(3560194..3563172) /locus_tag="Rv3193c" /function="UNKNOWN" /note="Rv3193c, (MTV014.37c), len: 992 aa. Probable conserved transmembrane protein, with hydrophobic N-terminal domain ( 1-340 aa), highly similar to Q9CCM6|ML0644 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Mycobacterium leprae (983 aa), FASTA scores: opt: 5421, E(): 0, (86.15% identity in 989 aa overlap); and O53609|Rv0064|MTV030.07 PUTATIVE MEMBRANE PROTEIN from Mycobacterium tuberculosis strain H37Rv (979 aa), FASTA scores: opt: 3204, E(): 2.1e-142, (50.25% identity in 985 aa overlap). C-terminal part (709-990 aa) highly similar to O32904|MLCB1779.46 HYPOTHETICAL 29.1 KDA PROTEIN from Mycobacterium leprae (277 aa), FASTA scores: opt: 1521, E(): 3.4e-64, (82.6% identity in 282 aa overlap). Also some similarity to hypothetical proteins generally transmembrane e.g. Q9FCI4|2SC3B6.28 from Streptomyces coelicolor (815 aa), FASTA scores: opt: 951, E(): 3.4e-37, (39.2% identity in 826 aa overlap); P72637|SLL1060 from Synechocystis sp. strain PCC 6803 (1032 aa), FASTA scores: opt: 938, E(): 1.6e-36, (29.95% identity in 855 aa overlap); O28851|AF1421 from Archaeoglobus fulgidus (880 aa), FASTA scores: opt: 526, E(): 2.6e-17, (28.05% identity in 970 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217709.1" /db_xref="GI:15610329" /db_xref="GOA:O53339" /db_xref="UniProtKB/Swiss-Prot:O53339" /db_xref="GeneID:888012" /translation="MGMRSAARMPKLTRRSRILIMIALGVIVLLLAGPRLIDAYVDWL WFGELGYRSVFTTMLATRIVVCLVAGVVVGGIVFGGLALAYRTRPVFVPDADNDPVAR YRAVVLARLRLVGIGIPAAIGLLAGIVAQSYWARIQLFLHGGDFGVRDPQFGRDLGFY AFELPFYRLMLSYMLVSVFLAFVANLVAHYIFGGIRLSGRTGALSRSARVQLVSLVGV LVLLKAVAYWLDRYELLSHTRGGKPFTGAGYTDINAVLPAKLILMAIALICAAAVFSA IALRDLRIPAIGLVLLLLSSLIVGAGWPLIVEQISVKPNAAQKESEYISRSITATRQA YGLTSDVVTYRNYSGDSPATAQQVAADRATTSNIRLLDPTIVSPAFTQFQQGKNFYYF PDQLSIDRYLDRNGNLRDYVVAARELNPDRLIDNQRDWINRHTVYTHGNGFIASPANT VRGIANDPNQNGGYPEFLVNVVGANGTVVSDGPAPLDQPRIYFGPVISNTSADYAIVG RNGDDREYDYETNIDTKRYTYTGSGGVPLGGWLARSVFAAKFAERNFLFSNVIGSNSK ILFNRDPAQRVEAVAPWLTTDSAVYPAIVNKRLVWIVDGYTTLDNYPYSELTSLSSAT ADSNEVAFNRLVPDKKVSYIRNSVKATVDAYDGTVTLYQQDEKDPVLKAWMQVFPGTV KPKSDIAPELAEHLRYPEDLFKVQRMLLAKYHVNDPVTFFSTSDFWDVPLDPNPTASS YQPPYYIVAKNIAKDDNSASYQLISAMNRFKRDYLAAYISASSDPATYGNLTVLTIPG QVNGPKLANNAITTDPAVSQDLGVIGRDNQNRIRWGNLLTLPVARGGLLYVEPVYASP GASDAASSYPRLIRVAMMYNDKVGYGPTVRDALTGLFGPGAGATATGIAPTEAAVPPS PAANPPPPASGPQPPPVTAAPPVPVGAVTLSPAKVAALQEIQAAIGAARDAQKKGDFA AYGSALQRLDEAITKFNDAG" gene complement(3563264..3564286) /locus_tag="Rv3194c" /db_xref="GeneID:888062" CDS complement(3563264..3564286) /locus_tag="Rv3194c" /function="UNKNOWN" /note="Rv3194c, (MTV014.38c), len: 340 aa. Possible conserved secreted protein (N-terminal stretch hydrophobic), equivalent to Q9CCM7|ML0643 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (340 aa), FASTA scores: opt: 1822, E(): 1.6e-102, (80.3% identity in 340 aa overlap). Also similar to other proteins e.g. Q9FCI6|2SC3B6.26 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (364 aa), FASTA scores: opt: 430, E(): 1.1e-18, (40.95% identity in 359 aa overlap); Q9S3Y5|SDRC SDRC PROTEIN from Streptomyces coelicolor (241 aa), FASTA scores: opt: 396, E(): 8.9e-17, (35.2% identity in 318 aa overlap) (similarity in part for this one); O34470|YLBL YLBL PROTEIN from Bacillus subtilis (350 aa), FASTA scores: opt: 385, E(): 5.6e-16, (27.7% identity in 350 aa overlap); etc. TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217710.1" /db_xref="GI:15610330" /db_xref="GOA:O53340" /db_xref="UniProtKB/TrEMBL:O53340" /db_xref="GeneID:888062" /translation="MNRRILTLMVALVPIVVFGVLLAVVTVPFVALGPGPTFDTLGEI DGKQVVQIVGTQTYPTSGHLNMTTVSQRDGLTLGEALALWLSGQEQLMPRDLVYPPGK SREEIENDNAADFKRSEAAAEYAALGYLKYPKAVTVASVMDPGPSVDKLQAGDAIDAV DGTPVGNLDQFTALLKNTKPGQEVTIDFRRKNEPPGIAQITLGKNKDRDQGVLGIEVV DAPWAPFAVDFHLANVGGPSAGLMFSLAVVDKLTSGHLVGSTFVAGTGTIAVDGKVGQ IGGITHKMAAARAAGATVFLVPAKNCYEASSDSPPGLKLVKVETLSQAVDALHAMTSG SPTPSC" gene 3564364..3565782 /locus_tag="Rv3195" /db_xref="GeneID:888900" CDS 3564364..3565782 /locus_tag="Rv3195" /function="UNKNOWN" /note="Rv3195, (MTV014.39), len: 472 aa. Hypothetical protein, equivalent to Q49746|ML0642|B1937_C3_231 HYPOTHETICAL 50.3 KDA PROTEIN from Mycobacterium leprae (479 aa), FASTA scores: opt: 2503, E(): 1e-138, (79.35% identity in 475 aa overlap). Similar in part to Q9FCI9|2SC3B6.23c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (487 aa), FASTA scores: opt: 1382, E(): 2.7e-73, (46.4% identity in 489 aa overlap); Q9X8I7|SCE9.14 HYPOTHETICAL 41.2 KDA PROTEIN from Streptomyces coelicolor (375 aa), FASTA scores: opt: 319, E(): 2.4e-11, (25.6% identity in 383 aa overlap); etc. TBparse score is 0.893." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217711.1" /db_xref="GI:15610331" /db_xref="UniProtKB/TrEMBL:O53341" /db_xref="GeneID:888900" /translation="MSTGEVMGDLPFGFSSGDDPPEDPSGRDKRGKDGADSGSGANPL GAFGIGGEFNMADLGQIFTRLGEMFGGVGTAMAAGKTSGPVNYDLARQVASSSIGFIA PIPAATNSAIADAVHLADTWLDGATSLPAGATKAVGWSPTDWVDNTLATWKRLCDPMA QQISTVWASSLPEEAKSMAGPLLSIMSQMGGIAFGSQLGQALGRLSREVLTSTDIGLP LGPKGVAAILPGAVESFAAGLEQPRSEILTFLATREAAHHRLFSHVPWLASQLLGAVE AYAMGMKIDMTGIEELARDINPTSLADPAAMEQLLSQGVFEPKATPAQTQALERLETL LALIEGWVQTVVTAALGERIPGEAALSETLRRRRASGGPAEQTFATLVGLELRPRKLR EAGALWERLTRAVGMDARDAVWQHPDLLPATDDLDDPAAFIDRVIGGDTSGIDEAIAE LERDQQARGADDSGHDGGPVDN" gene 3565788..3566687 /locus_tag="Rv3196" /db_xref="GeneID:888903" CDS 3565788..3566687 /locus_tag="Rv3196" /function="UNKNOWN" /note="Rv3196, (MTV014.40), len: 299 aa. Hypothetical protein, with some similarity to other hypothetical proteins e.g. Q9FCJ5|2SC3B6.17c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (442 aa), FASTA scores: opt: 233, E(): 3.5e-07, (29.9% identity in 261 aa overlap). TBparse score is 0.936." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217712.1" /db_xref="GI:15610332" /db_xref="UniProtKB/TrEMBL:O53342" /db_xref="GeneID:888903" /translation="MSARSVAPSQVMRRAASALYSLNPAMPVLLRPDGAVQVGWDPRR AVLVRPPRGLTATGLAALLRSMRSPIPITELQRQAAERGLVDGDAMANLVAQLVGAGV ATPLANPGNLDSRRRAASIRVHGRGPLSDLLVQALRCSGARIRHSSQPHAAVTPAGVD LVVLSDYLVADPHMVRDLHTERVPHLPVRVRDGTGMVGPLVVPGVTSCLGCADLHRSD RDAAWPAIAAQLRDTVGVADRATLLATAALALSQVNRVIAAVRGQEATPEPPSALNTT LEFDLNAGSIVARQWTRHPRCFC" gene complement(3566696..3566896) /locus_tag="Rv3196A" /db_xref="GeneID:3205082" CDS complement(3566696..3566896) /locus_tag="Rv3196A" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3196A, len: 66 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177939.1" /db_xref="GI:57117069" /db_xref="UniProtKB/TrEMBL:Q8VJ55" /db_xref="GeneID:3205082" /translation="MQEGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDV LDTLARAYASISTNVPEQGRLG" gene 3567024..3568367 /locus_tag="Rv3197" /db_xref="GeneID:887858" CDS 3567024..3568367 /locus_tag="Rv3197" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT ACROSS THE MEMBRANE. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv3197, (MTV014.41), len: 447 aa. Probable conserved ATP-binding protein ABC transporter, highly similar to Mycobacterium leprae proteins: Q9CCM8|ML0640 HYPOTHETICAL PROTEIN (473 aa), FASTA scores: opt: 2512, E(): 2.1e-140, (83.0% identity in 447 aa overlap). Interestingly, the N-terminal half (1-219 aa) corresponds to Q49747|ABC1|B1937_C3_233 ABC1 PROTEIN from Mycobacterium leprae (267 aa), FASTA scores: opt: 1276, E(): 6.3e-68, (88.6% identity in 219 aa overlap); and the C-terminal half (239-447 aa) corresponds to Q49745|B1937_C2_179 HYPOTHETICAL 23.1 KDA PROTEIN (206 aa), FASTA scores: opt: 1138, E(): 6.5e-60, (77.05% identity in 209 aa overlap); two adjacent orfs from Mycobacterium leprae. Also highly similar to other proteins (generally ABC transporters) e.g. Q9FCJ6|2SC3B6.16c HYPOTHETICAL 51.3 KDA PROTEIN from Streptomyces coelicolor (469 aa), FASTA scores: opt: 1340, E(): 1.8e-71, (45.9% identity in 449 aa overlap); O65576|ABC1AT ABC1 PROTEIN (alias Q9SBB2|T15B16.14|AT4G01660 PUTATIVE ABC TRANSPORTER) from Arabidopsis thaliana (Mouse-ear cress) (623 aa), FASTA scores: opt: 543, E(): 1.7e-24, (28.4% identity in 405 aa overlap); O27682|MTH1645 ABC TRANSPORTER from Methanobacterium thermoautotrophicum (623 aa), FASTA scores: opt: 497, E(): 7.8e-22, (33.0% identity in 309 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.892." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="NP_217713.1" /db_xref="GI:15610333" /db_xref="GOA:O53343" /db_xref="UniProtKB/TrEMBL:O53343" /db_xref="GeneID:887858" /translation="MDDGSVSDIKRGRAARNAKLASIPVGFAGRAALGLGKRLTGKSK DEVTAELMEKAANQLFTVLGELKGGAMKVGQALSVMEAAIPDEFGEPYREALTKLQKD APPLPASKVHRVLDGQLGTKWRERFSSFNDTPVASASIGQVHKAIWSDGREVAVKIQY PGADEALRADLKTMQRMVGVLKQLSPGADVQGVVDELVERTEMELDYRLEAANQRAFA KAYHDHPRFQVPHVVASAPKVVIQEWIEGVPMAEIIRHGTTEQRDLIGTLLAELTFDA PRRLGLMHGDAHPGNFMLLPDGRMGIIDFGAVAPMPGGFPIELGMTIRLAREKNYDLL LPTMEKAGLIQRGRQVSVREIDEMLRQYVEPIQVEVFHYTRKWLQKMTVSQIDRSVAQ IRTARQMDLPAKLAIPMRVIASVGAILCQLDAHVPIKALSEELIPGFAEPDAIVV" misc_feature 3567129..3567152 /locus_tag="Rv3197" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3568401..3568679) /gene="whiB7" /locus_tag="Rv3197A" /db_xref="GeneID:3205083" CDS complement(3568401..3568679) /gene="whiB7" /locus_tag="Rv3197A" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3197A, len: 92 aa. Probable whiB7 (alternate gene name: whmC), WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to Q49765|WHIB7|ML0639|B1937_F2_68 PUTATIVE TRANSCRIPTIONAL REGULATOR WHIB7 from Mycobacterium leprae (89 aa), FASTA scores: opt: 441, E(): 6.3e-24, (69.3% identity in 88 aa overlap). Similar to Q9FCJ8|2SC3B6.14 PUTATIVE DNA-BINDING PROTEIN from Streptomyces coelicolor (122 aa), FASTA scores: opt: 348, E(): 2.2e-17, (57.7% identity in 78 aa overlap); Q9AD55|SCP1.95 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (102 aa), FASTA scores: opt: 166, E(): 7.1e-05, (39.4% identity in 76 aa overlap); etc.; whmC" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIB-like WHIB7" /protein_id="YP_177940.1" /db_xref="GI:57117070" /db_xref="GOA:Q6MX01" /db_xref="UniProtKB/TrEMBL:Q6MX01" /db_xref="GeneID:3205083" /translation="MSVLTVPRQTPRQRLPVLPCHVGDPDLWFADTPAGLEVAKTLCV SCPIRRQCLAAALQRAEPWGVWGGEIFDQGSIVSHKRPRGRPRKDAVA" gene complement(3569109..3571211) /gene="uvrD2" /locus_tag="Rv3198c" /db_xref="GeneID:888902" CDS complement(3569109..3571211) /gene="uvrD2" /locus_tag="Rv3198c" /EC_number="3.6.1.-" /function="INVOLVED IN NUCLEOTIDE EXCISION REPAIR. HAS BOTH ATPASE AND HELICASE ACTIVITIES. UNWINDS DNA DUPLEXES WITH 3' TO 5' POLARITY WITH RESPECT TO THE BOUND STRAND AND INITIATES UNWINDING MOST EFFECTIVELY WHEN A SINGLE-STRANDED REGION IS PRESENT. INVOLVED IN THE POSTINCISION EVENTS OF NUCLEOTIDE EXCISION REPAIR AND METHYL-DIRECTED MISMATCH REPAIR." /note="Rv3198c, (MTV014.42c), len: 700 aa. Probable UvrD2, ATP dependent DNA helicase II (EC 3.6.1.-) (see citation below), equivalent to P53528|UVRD_MYCLE|VRD|UVRD2|ML0637|B1937_F1_27 PROBABLE DNA HELICASE II HOMOLOG from Mycobacterium leprae (714 aa), FASTA scores: opt: 3749, E(): 0, (82.85% identity in 706 aa overlap); and C-terminal half (466-700 aa) corresponds to Q49764|RECQ|B1937_F2_66 PUTATIVE DNA HELICASE RECQ (EC 3.6.1.-) (242 aa), FASTA scores: opt: 1267, E(): 1.4e-69, (82.5% identity in 234 aa overlap); products of two adjacent ORFS in Mycobacterium leprae. Also similar to other DNA helicases e.g. Q9FCK0|2SC3B6.12 from Streptomyces coelicolor (785 aa), FASTA scores: opt: 1687, E(): 1.2e-94, (52.05% identity in 728 aa overlap); P71561|CRA|IVRD|Rv0949|MT0976|MTCY10D7.25c ATP-DEPENDENT DNA HELICASE PCRA from Mycobacterium tuberculosis (771 aa), FASTA scores: opt: 715, E(): 1e-35, (34.1% identity in 710 aa overlap); Q9CD72|PCRA_MYCLE|UVRD|ML0153 ATP-DEPENDENT DNA HELICASE PCRA from Mycobacterium leprae (778 aa), FASTA scores: opt: 687, E(): 5.1e-34, (32.0% identity in 719 aa overlap); O83991|TP1028 DNA HELICASE II (UVRD) from Treponema pallidum (670 aa), FASTA scores: opt: 652, E(): 6e-32, (30.25% identity in 671 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE UVRD SUBFAMILY OF HELICASES. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="ATP-dependent DNA helicase II UVRD2" /protein_id="NP_217714.1" /db_xref="GI:15610334" /db_xref="GOA:P64320" /db_xref="UniProtKB/Swiss-Prot:P64320" /db_xref="GeneID:888902" /translation="MSIASDPLIAGLDDQQREAVLAPRGPVCVLAGAGTGKTRTITHR IASLVASGHVAAGQVLAVTFTQRAAGEMRSRLRALDAAARTGSGVGAVQALTFHAAAY RQLRYFWSRVIADTGWQLLDSKFAVVARAASRTRLHASTDDVRDLAGEIEWAKASLIG PEEYVTAVAAARRDPPLDAAQIAAVYSEYEALKARGDGVTLLDFDDLLLHTAAAIEND AAVAEEFQDRYRCFVVDEYQDVTPLQQRVLSAWLGDRDDLTVVGDANQTIYSFTGASP RFLLDFSRRFPDAAVVRLERDYRSTPQVVSLANRVIAAARGRVAGSKLRLSGQREPGP VPSFHEHSDEPAEAATVAASIARLIASGTPPSEVAILYRVNAQSEVYEEALTQAGIAY QVRGGEGFFNRQEIKQALLALQRVSERDTDAALSDVVRAVLAPLGLTAQPPVGTRARE RWEALTALAELVDDELAQRPALQLPGLLAELRRRAEARHPPVVQGVTLASLHAAKGLE WDAVFLVGLADGTLPISHALAHGPNSEPVEEERRLLYVGITRARVHLALSWALSRSPG GRQSRKPSRFLNGIAPQTRADPVPGTSRRNRGAAARCRICNNELNTSAAVMLRRCETC AADVDEELLLQLKSWRLSTAKEQNVPAYVVFTDNTLIAIAELLPTDDAALIAIPGIGA RKLEQYGSDVLQLVRGRT" misc_feature complement(3571098..3571121) /gene="uvrD2" /locus_tag="Rv3198c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 3571335..3571589 /locus_tag="Rv3198A" /db_xref="GeneID:3205084" CDS 3571335..3571589 /locus_tag="Rv3198A" /function="UNKNOWN" /note="Rv3198A, len: 84 aa. Possible glutaredoxin protein (EC 1.-.-.-), highly similar to Q9FCK1|2SC3B6.11c PUTATIVE GLUTAREDOXIN-LIKE PROTEIN from Streptomyces coelicolor (80 aa), FASTA scores: opt: 293, E(): 2.2e-14, (55.15% identity in 78 aa overlap); and Q9RSN9|DR2085 PUTATIVE GLUTAREDOXIN from Deinococcus radiodurans (81 aa), FASTA scores: opt: 198, E(): 1.2e-07, (53.55% identity in 56 aa overlap). Also similar to several hypothetical bacterial proteins e.g. Q9X8C2|SCE36.09 HYPOTHETICAL 13.0 KDA PROTEIN from Streptomyces coelicolor (114 aa), FASTA scores: opt: 181, E(): 2.6e-06, (44.45% identity in 72 aa overlap)." /codon_start=1 /transl_table=11 /product="glutaredoxin protein" /protein_id="YP_177941.1" /db_xref="GI:57117071" /db_xref="GOA:Q8VJ51" /db_xref="UniProtKB/TrEMBL:Q8VJ51" /db_xref="GeneID:3205084" /translation="MITAALTIYTTSWCGYCLRLKTALTANRIAYDEVDIEHNRAAAE FVGSVNGGNRTVPTVKFADGSTLTNPSADEVKAKLVKIAG" gene complement(3571602..3572543) /gene="nudC" /locus_tag="Rv3199c" /db_xref="GeneID:887860" CDS complement(3571602..3572543) /gene="nudC" /locus_tag="Rv3199c" /EC_number="3.6.1.22" /function="INVOLVED IN NICOTINATE AND NICOTINAMIDE METABOLISM. GENERATES AMP AND NMN FROM NAD(+) AND H(2)O. ACTING ON ACID ANHYDRIDES, IN PHOSPHORUS-CONTAINING ANHYDRIDES. ALSO ACTS ON NADP+, 3-ACETYLPYRIDINE AND THE THIONICOTINAMIDE ANALOGUES OF NAD+ AND NADP+ [CATALYTIC ACTIVITY: NADH + H(2)O = AMP + NMNH]." /note="can catalyze hydrolysis of broad range of dinucleotide pyrophosphates but prefers reduced form of NADH; requires divalent metal ions such as magnesium and manganese and produces two mononucleoside 5'-phosphates" /codon_start=1 /transl_table=11 /product="NADH pyrophosphatase" /protein_id="NP_217715.1" /db_xref="GI:15610335" /db_xref="GOA:O53345" /db_xref="UniProtKB/Swiss-Prot:O53345" /db_xref="GeneID:887860" /translation="MTNVSGVDFQLRSVPLLSRVGADRADRLRTDMEAAAAGWPGAAL LRVDSRNRVLVANGRVLLGAAIELADKPPPEAVFLGRVEGGRHVWAVRAALQPIADPD IPAEAVDLRGLGRIMDDTSSQLVSSASALLNWHDNARFSALDGAPTKPARAGWSRVNP ITGHEEFPRIDPAVICLVHDGADRAVLARQAAWPERMFSLLAGFVEAGESFEVCVARE IREEIGLTVRDVRYLGSQQWPFPRSLMVGFHALGDPDEEFSFSDGEIAEAAWFTRDEV RAALAAGDWSSASESKLLLPGSISIARVIIESWAACE" misc_feature complement(3571878..3571937) /gene="nudC" /locus_tag="Rv3199c" /note="PS00893 mutT domain signature" gene complement(3572602..3573669) /locus_tag="Rv3200c" /db_xref="GeneID:887543" CDS complement(3572602..3573669) /locus_tag="Rv3200c" /function="THOUGHT TO BE INVOLVED IN CATION TRANSPORT ACROSS THE MEMBRANE." /note="Rv3200c, (MTV014.44c), len: 355 aa. Possible transmembrane cation transporter, similar to many transmembrane proteins and putative potassium channels e.g. Q9XA52|SCGD3.27C PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (365 aa), FASTA scores: opt: 1022, E(): 2.6e-53, (49.85% identity in 325 aa overlap); Q9RRZ3|DR2336 PUTATIVE POTASSIUM CHANNEL from Deinococcus radiodurans (320 aa), FASTA scores: opt: 436, E(): 1e-18, (30.9% identity in 304 aa overlap); O28600|AF1673 PUTATIVE POTASSIUM CHANNEL from Archaeoglobus fulgidus (314 aa), FASTA scores: opt: 363, E(): 2.1e-14, (27.2% identity in 309 aa overlap); Q57604|Y13B_METJAMJ0138.1|MJ0138.1 PUTATIVE POTASSIUM CHANNEL from Methanococcus jannaschii (333 aa), FASTA scores: opt: 356, E(): 5.7e-14, (26.0% identity in 281 aa overlap); P73132|SLL0993 POTASSIUM CHANNEL from Synechocystis sp. strain PCC 6803 (365 aa), FASTA scores: opt: 330, E(): 2.1e-12, (27.8% identity in 324 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.904." /codon_start=1 /transl_table=11 /product="transmembrane cation transporter" /protein_id="NP_217716.1" /db_xref="GI:15610336" /db_xref="GOA:O53346" /db_xref="UniProtKB/TrEMBL:O53346" /db_xref="GeneID:887543" /translation="MAGSWRRLRGLNEKLTAQPGYALVGVLRIPQRRASPARVISRRV VVAVVALLLTAGIVYVDRDGYLDAQGDRLTFLDCLYYAAVTLSTTGYGDITPISEFAR AINIFVITPLRIAFLILLVGTTLEVLTETSRQAYKIQRWRSRVRNHTVVIGYGTKGKT AVAAMVSDELVPGEIVVVDTDSGVLERAAAAGLVTVHGDATKSDVLRLAGTQHASSII VATSRDDTAVLVTLTAREIAPKAKIVASIREAENQHLLRQSGADTVVVSSETAGRLLG IATTTPSVVEMIEDLLTPEAGLAVAEREVEQAEVGGSPRHLRDIVLGVVRDGQLLRIG APEVDAIEASDRLLYIRQVGR" misc_feature complement(3573190..3573213) /locus_tag="Rv3200c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3573731..3577036) /locus_tag="Rv3201c" /db_xref="GeneID:887663" CDS complement(3573731..3577036) /locus_tag="Rv3201c" /EC_number="3.6.1.-" /function="HAS BOTH ATPASE AND HELICASE ACTIVITIES" /note="Rv3201c, (MTV014.45c), len: 1101 aa. Probable ATP-dependent DNA helicase (EC 3.6.1.-), similar to others e.g. Q9FCK4|2SC3B6.08 from Streptomyces coelicolor (1222 aa), FASTA scores: opt: 1209, E(): 5.4e-63, (38.45% identity in 1199 aa overlap); P71561|PCRA_MYCTU|CRA|IVRD|Rv0949|MT0976|MTCY10D7.25c from Mycobacterium tuberculosis (771 aa), FASTA scores: opt: 403, E(): 6.5e-16, (28.15% identity in 717 aa overlap); Q9FCK5|2SC3B6.07 from Streptomyces coelicolor (1159 aa), FASTA scores: opt: 349, E(): 1.3e-12, (29.2% identity in 1144 aa overlap); Q9L3M1|UVRD from Prochlorococcus sp. (512 aa; fragment), FASTA scores: opt: 290, E(): 2e-09, (27.95% identity in 479 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="ATP-dependent DNA helicase" /protein_id="NP_217717.1" /db_xref="GI:15610337" /db_xref="GOA:O53347" /db_xref="UniProtKB/TrEMBL:O53347" /db_xref="GeneID:887663" /translation="MTQTAAPARYSPAELACALGLFPPTAEQAAVIAAPPGPLVVIAG AGAGKTETMAARVVWLVANGYAEPGQVLGLTFTRKAAGQLLRRVRSRLARLAGIGLGC GDPAACAPVVSTYHAFAGSLLRDYGLLLPLEPDTRLLSETELWQLAFDVVSGYDGVLC TDKSPAAVTSIVVRLWGQLGEHLVDTRALRDTHVELERLVHALPAGRYQRDRGPSQWL LRMLATQTQRAELVPLLDALGERMHAGKVMDFAMQMASAARLAATSPQVGQDLRRRYR VVLLDEYQDTGHAQRVVLSSLFGGGVDDGLALTAVGDPIQSIYGWRGASATNLPRFTT DFPLSDGTPAPVLELLTSWRNPPQALRVANGISAEARRRSVAVRALRPRPDAPPGAVR CALLPDVQAEREWIADHLRMRYQRAEADGVKPPTAAVLVRRNADAAAIADTLRARGIP AEVVGLAGLLSIPEVAEVVAMLRLVADPTAGAAAMRVLTGPRWRLGARDLAALWRRAL TLSGESPSTASPESIAMAASADADNPCLADAISDPGSAEGYSVAGYGRIGALAGELSA LRGRLGHSLPDLVAEVRRVLGVDCEVRASAPVSGGWAGPEHLDAFADVVAGYAERASA RSSEASVAGLLAYLDVAEVVENGLPPAELTVACDRVQVLTVHAAKGLEWQVVAVAHLS RGVFPSTVSRSSWLTDPAELPPLLRGDRASAGAHGIPVLDTSAVADRKQLSDKISEHR RLLDRRRVDEERRLLYVAVTRAEDTLLVSGHHWGPTGTKPRGPSEFLCELKDIIDRSA AAGDPCGVVEQWASAPAGDERNPLCDNAIEAVWPADPLAARRGDVERGAALVAAAMSA DLPGSTTDIDHPPRPGDAPWSTDVDALLAERAHAARGAPARGLPNHLSVSSLVELVGD PVGARQRLMCRLPKRPDPHAWLGDAFHAWVQQFYGAELLFDLGDLPGAADREVGDPEE LAALQRAFTASSWAARTPAAVEVPFEMPIGDTVVRGRIDAVFVDPDGGATVVDWKTGK PPHGPAAMRQAAVQLAVYRLAWAALRGCPTSSVRTAFYYVRSGITVVPDELPAPGELA MLLTDCAGRRSDT" misc_feature complement(3576887..3576910) /locus_tag="Rv3201c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3577033..3580200) /locus_tag="Rv3202c" /db_xref="GeneID:887574" CDS complement(3577033..3580200) /locus_tag="Rv3202c" /EC_number="3.6.1.-" /function="HAS BOTH ATPASE AND HELICASE ACTIVITIES" /note="Rv3202c, (MTCY07D11.24, MTV014.46c), len: 1055 aa. Possible ATP-dependent DNA helicase (EC 3.6.1.-), showing some similarity to UvrD proteins e.g. Q9FCK5|2SC3B6.07 PUTATIVE ATP-DEPENDENT DNA HELICASE from Streptomyces coelicolor (1159 aa), FASTA scores: opt: 666, E(): 1e-29, (34.5% identity in 1154 aa overlap); Q9L7T3|UVRD|PA5443 MISMATCH REPAIR PROTEIN MUTU (DNA HELICASE II) from Pseudomonas aeruginosa (728 aa), FASTA scores: opt: 239, E(): 7.3e-06, (23.8% identity in 677 aa overlap) (no similarity in C-terminal part for this one); etc. C-terminal region similar to Q9FDU2|ORF3 ORF3 PROTEIN (FRAGMENT) from Streptomyces griseus (551 aa), FASTA scores: opt: 800, E(): 1.7e-37, (36.2% identity in 525 aa overlap); and Q9ZG15 HYPOTHETICAL 35.5 KDA PROTEIN from Rhodococcus erythropolis (323 aa), FASTA scores: opt: 232, E(): 9.7e-06, (28.55% identity in 266 aa overlap)." /codon_start=1 /transl_table=11 /product="ATP-dependent DNA helicase" /protein_id="NP_217718.1" /db_xref="GI:15610338" /db_xref="GOA:O53348" /db_xref="UniProtKB/TrEMBL:O53348" /db_xref="GeneID:887574" /translation="MSHIWGVEAGAALAPGLRGPVLVLGGPGTGKSTLLVEAAVAHIG AGTDPESVLLLTGSGRMGMRARSALTTALLRSRTNGPCRAAIREPVVRTVHSYAYAVL RKAAQRAGDALPRLLTSAEQDAIIRELLAGDAEDGPAATTTWPAHLRPALTTAGFATE LRNLLARCAERGLDPLELQQLGRRRGRPEWIAAGQFAQRYEQVMLLRGAVGLAAPQAT APALSAAELVGAALEAFAVDPELLAAERARVRTLLVDDAQQLDPQAARLVRMLAAGTE LALIAGDPNQAVFGFRGGEPTGLLADDPPPAGGAPIPSVTLTVSHRCAPAVARAVTGI ARRLPGRSVGRRIEGTGTEVGSVTVRLAGSAHAEAAMIADALRRAHLIDGVPWSQMAV IVRSVPRAVRLPRALAAAGVPVAPPAVGGPLSAEPAVRALLTVLEATADGLDGDQALL LLTGPIGGVDPVSLRQLRRTLQRARPGQTSRKFGDLLVEVLGGDAPPSGPGSRALRRV RAVLTAAARCHRSGSLGGQDPRHTLWAAWQRSGLQRRWLAASEHGGAAAVQATRDLET VTALFDITDHYVSRTSGASLRGLVEHVTALQLPVVRPEPAAPTEQVMVLSAHAALGHE WDLVVIAGLQDGLWPNTVPRGGVLGTQRLLDELDGVTKDASMRAPLLAEERRLLVTAM GRARRRLLVTAVDSDAGGGGHEAVLPSAFFFEIAQWADGDGEPVAMQPVSAPRVLSAA AVVGRLRVVVCAPACAVDDADRDCAATQLARLAKAGVPGADPSEWHGLAPVSTSDPLC DSDDLVTLTPSTLQALNDCPLRWLAERHGGTNTRELPSAVGSVLHALFAEPGRSESQL LAELDRVWGHLPFGAQWYSANELARHRAMIQAFVQWRAQSRSELTEVGVEVDIDGALE DGSGQARKIRLRGRADRLERDPAGRLVIVDIKTGKTPVSKDDAQQHAQLAMYQLAVAE GLVRAGDEPGGARLVYVGKSGAAGVAERKQDPLTPAARDEWRNLVRQLAAATAGPQFI ARRNDGCTHCPLRPGCPAHVRGSAP" gene 3580638..3581312 /gene="lipV" /locus_tag="Rv3203" /db_xref="GeneID:888133" CDS 3580638..3581312 /gene="lipV" /locus_tag="Rv3203" /EC_number="3.1.-.-" /function="UNKNOWN; PRESUMED LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /note="Rv3203, (MTCY07D11.23c), len: 224 aa. Possible lipV, hydrolase lipase (EC 3.1.-.-), showing some similarity to other lipases e.g. Q9JSN0|NMA2216 PUTATIVE HYDROLASE from Neisseria meningitidis (serogroup A) (312 aa), FASTA scores: opt: 192, E(): 0.00016, (45.2% identity in 73 aa overlap); Q9RK95|SCF1.09 PUTATIVE HYDROLASE from Streptomyces coelicolor (258 aa), FASTA scores: opt: 188, E(): 0.00024, (30.1% identity in 226 aa overlap); Q9KZC3|SC6F7.19c PUTATIVE LIPASE from Streptomyces coelicolor (269 aa), FASTA scores: opt: 179, E(): 0.00086, (36.35% identity in 121 aa overlap); etc. Equivalent to AAK47641 Hydrolase, alpha/beta hydrolase family from Mycobacterium tuberculosis strain CDC1551 (261 aa) but shorter 37 aa. Contains serine active site signature of lipases (PS00120)." /codon_start=1 /transl_table=11 /product="lipase LipV" /protein_id="NP_217719.1" /db_xref="GI:15610339" /db_xref="GOA:O05863" /db_xref="UniProtKB/TrEMBL:O05863" /db_xref="GeneID:888133" /translation="MPEIPIAAPDLLGHGRSPWAAPWTIDANVSALAALLDNQGDGPV VVVGHSFGGAVAMHLAAARPDQVAALVLLDPAVALDGSRVREVVDAMLASPDYLDPAE ARAEKATGAWADVDPPVLDAELDEHLVALPNGRYGWRISLPAMVCYWSELARDIVLPP VGTATTLVRAVRASPAYVSDQLLAALDKRLGADFELLDFDCGHMVPQAKPTEVAAVIR SRLGPR" misc_feature 3580767..3580796 /gene="lipV" /locus_tag="Rv3203" /note="PS00120 Lipases, serine active site" gene 3581315..3581620 /locus_tag="Rv3204" /db_xref="GeneID:888132" CDS 3581315..3581620 /locus_tag="Rv3204" /EC_number="2.1.1.-" /function="CAUSES METHYLATION." /note="Rv3204, (MTCY07D11.22c), len: 101 aa. Possible DNA methyltransferase (EC 2.1.1.-), similar to many hypothetical bacteriel proteins and methyltransferases e.g. Q9KT40|VC1065 METHYLATED-DNA--PROTEIN-CYSTEINE METHYLTRANSFERASE-RELATED PROTEIN from Vibrio cholerae (100 aa), FASTA scores: opt: 170, E(): 2.8e-05, (34.35% identity in 99 aa overlap); Q9UTN9|SPAC1250.04c PUTATIVE METHYLTRANSFERASE from Schizosaccharomyces pombe (Fission yeast) (108 aa), FASTA scores: opt: 161, E(): 0.00013, (36.65% identity in 101 aa overlap); Q9YDF4|APE0959 175 AA LONG HYPOTHETICAL METHYLATED-DNA--PROTEIN-CYSTEINE METHYLTRANSFERASE from Aeropyrum pernix (175 aa), FASTA scores: opt: 144, E(): 0.003, (37.95% identity in 87 aa overlap); Q50855 PUTATIVE METHYLGUANINE-DNA METHYLTRANSFERASE from Myxococcus xanthus (147 aa), FASTA scores: opt: 141, E(): 0.0041, (37.65% identity in 93 aa overlap); etc." /codon_start=1 /transl_table=11 /product="DNA-methyltransferase (modification methylase)" /protein_id="NP_217720.1" /db_xref="GI:15610340" /db_xref="GOA:O05862" /db_xref="UniProtKB/TrEMBL:O05862" /db_xref="GeneID:888132" /translation="MAPVTDEQVELVRSLVAAIPLGRVSTYGDIAALTGLSSPRIVGW IMRTDSSDLPWHRVIRASGRPAQHLATRQLELLRAEGVLSVDGRVALSEIRYEFPPG" gene complement(3581627..3582505) /locus_tag="Rv3205c" /db_xref="GeneID:888877" CDS complement(3581627..3582505) /locus_tag="Rv3205c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3205c, (MTCY07D11.21), len: 292 aa. Hypothetical protein, highly similar to Q9CCG7|ML0818 HYPOTHETICAL PROTEIN from Mycobacterium leprae (297 aa), FASTA scores: opt: 1745, E(): 9.1e-98, (87.3% identity in 291 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217721.1" /db_xref="GI:15610341" /db_xref="UniProtKB/TrEMBL:O05861" /db_xref="GeneID:888877" /translation="MGSTRLTGVNVEPPPEHVLVAFGLAGAQPILLGAGWEGGWRCGE VVLSMVADNARAAWSARVRETLFVDGVRLARPVRSTDGRYVVSGWRADTFVAGAPEPR HDEVVSAAVRLHEATGKLERPRFLTQGPAAPWAEIDVFVAADRAGWEERPLQSVPPGV PTAPPAADPQRSIDLINQLAGLRKPTKSPNQLVHGDLYGTVLFAGTAPPGITDITPYW RPASWAAGVAVVDALSWGAADDGLIERWNALPEWPQMLLRALMFRLAVYALHPRSTAE AFPGLAHTAALVRLVL" gene complement(3582532..3583710) /gene="moeB1" /locus_tag="Rv3206c" /db_xref="GeneID:888871" CDS complement(3582532..3583710) /gene="moeB1" /locus_tag="Rv3206c" /function="POSSIBLY INVOLVED IN MOLYBDOPTERIN METABOLISM (SYNTHESIS)" /experiment="experimental evidence, no additional details recorded" /note="The proteins in this cluster have high sequence similarity to MoeB and are possibly involved in the synthesis of molybdopterin, but there has been no biochemical or physiological characterization. There is also no genetic linkage to other molybdopterin cofactor synthesis proteins. These proteins are similar to a Pseudomonas stutzeri protein which is essential to pyridine-2,6-bis(thiocarboxylic acid) synthesis that possibly activates a substrate by adenylation" /codon_start=1 /transl_table=11 /product="molybdopterin biosynthesis-like protein MoeZ" /protein_id="YP_177942.1" /db_xref="GI:57117072" /db_xref="GOA:Q7D5X9" /db_xref="UniProtKB/TrEMBL:Q7D5X9" /db_xref="GeneID:888871" /translation="MSTSLPPLVEPASALSREEVARYSRHLIIPDLGVDGQKRLKNAR VLVIGAGGLGAPTLLYLAAAGVGTIGIVDFDVVDESNLQRQVIHGVADVGRSKAQSAR DSIVAINPLIRVRLHELRLAPSNAVDLFKQYDLILDGTDNFATRYLVNDAAVLAGKPY VWGSIYRFEGQASVFWEDAPDGLGVNYRDLYPEPPPPGMVPSCAEGGVLGIICASVAS VMGTEAIKLITGIGETLLGRLLVYDALEMSYRTITIRKDPSTPKITELVDYEQFCGVV ADDAAQAAKGSTITPRELRDWLDSGRKLALIDVRDPVEWDIVHIDGAQLIPKSLINSG EGLAKLPQDRTAVLYCKTGVRSAEALAAVKKAGFSDAVHLQGGIVAWAKQMQPDMVMY" gene complement(3583801..3584658) /locus_tag="Rv3207c" /db_xref="GeneID:888874" CDS complement(3583801..3584658) /locus_tag="Rv3207c" /function="UNKNOWN" /note="Rv3207c, (MTCY07D11.19), len: 285 aa. Hypothetical protein, highly similar but shorter (57 aa) to Q9CCG9|ML0816 HYPOTHETICAL PROTEIN from Mycobacterium leprae (341 aa), FASTA scores: opt: 1676, E(): 9.7e-96, (81.0% identity in 284 aa overlap). Also similar to C-terminus of Q9FBI6|SCP8.36 HYPOTHETICAL PROTEIN from Streptomyces coelicolor (559 aa), FASTA scores: opt: 426, E(): 8.4e-19, (37.35% identity in 281 aa overlap); and similar to other hypothetical proteins (generally membrane proteins) e.g. Q9K456|SC2H12.28C PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (314 aa), FASTA scores: opt: 341, E(): 8.8e-14, (29.75% identity in 296 aa overlap). Contains neutral zinc metallopeptidases, zinc-binding region signature (PS00142)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217723.1" /db_xref="GI:15610343" /db_xref="GOA:O05859" /db_xref="UniProtKB/TrEMBL:O05859" /db_xref="GeneID:888874" /translation="MSTYGWRAYALPVLMVLTTVVVYQTVTGTSTPRPAAAQTVRDSP AIGVVGTAILDAPPRGLAVFDANLPAGTLPDGGPFTEAGDKTWRVVPGTTPQVGQGTV KVFRYTVEIENGLDPTMYGGDNAFAQMVDQTLTNPKGWTHNPQFAFVRIDSGKPDFRI SLVSPTTVRGGCGYEFRLETSCYNPSFGGMDRQSRVFINEARWVRGAVPFEGDVGSYR QYVINHEVGHAIGYLRHEPCDQQGGLAPVMMQQTFSTSNDDAAKFDPDFVKADGKTCR FNPWPYPIP" misc_feature complement(3583969..3583998) /locus_tag="Rv3207c" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene 3585004..3585690 /locus_tag="Rv3208" /db_xref="GeneID:887905" CDS 3585004..3585690 /locus_tag="Rv3208" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3208, (MTCY07D11.18c), len: 228 aa. Probable transcriptional regulator, tetR family, equivalent to Q9CCH0|ML0815 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (228 aa), FASTA scores: opt: 1248, E(): 1.4e-74, (82.4% identity in 227 aa overlap). Also highly similar to Q9FBI8|SCP8.33c PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (213 aa), FASTA scores: opt: 629, E(): 4e-34, (45.8% identity in 203 aa overlap); Q9KIL9|F58R F58R (FRAGMENT) from Streptomyces coelicolor A3(2) (149 aa), FASTA scores: opt: 497, E(): 1.3e-25, (50.35% identity in 147 aa overlap); Q9K3T5|SCE66.08 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (225 aa), FASTA scores: opt: 344, E(): 1.8e-15, (31.15% identity in 212 aa overlap); Q9RYK4|DRA0308 TRANSCRIPTIONAL REGULATOR, TETR FAMILY from Deinococcus radiodurans (239 aa), FASTA scores: opt: 290, E(): 6.5e-12, (30.5% identity in 223 aa overlap); etc. And also similar to Mycobacterium tuberculosis proteins P96381|Rv1019|MTCY10G2.30c HYPOTHETICAL 21.7 KDA PROTEIN (197 aa), FASTA scores: opt: 356, E(): 2.7e-16, (34.4% identity in 189 aa overlap); MTV034_4; MTY07A7A_3; MTV032_1; MTCY07A7_12; etc. Contains probable helix-turn-helix motif at aa 60-81 (Score 1517, +4.35 SD). SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217724.1" /db_xref="GI:15610344" /db_xref="GOA:O05858" /db_xref="UniProtKB/TrEMBL:O05858" /db_xref="GeneID:887905" /translation="MSDLAKTAQRRALRSSGSARPDEDVPAPNRRGNRLPRDERRGQL LVVASDVFVDRGYHAAGMDEIADRAGVSKPVLYQHFSSKLELYLAVLHRHVENLVSGV HQALSTTTDNRQRLHVAVQAFFDFIEHDSQGYRLIFENDFVTEPEVAAQVRVATESCI DAVFALISADSGLDPHRARMIAVGLVGMSVDCARYWLDADKPISKSDAVEGTVQFAWG GLSHVPLTRS" gene complement(3585677..3585949) /gene="TB9.4" /locus_tag="Rv3208A" /db_xref="GeneID:3205107" CDS complement(3585677..3585949) /gene="TB9.4" /locus_tag="Rv3208A" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3208A, len: 90 aa. TB9.4, conserved hypothetical protein (see citations below), equivalent to Q9CCH1|ML0814 HYPOTHETICAL PROTEIN from Mycobacterium leprae (82 aa), FASTA scores: opt: 411, E(): 1.8e-22, (81.0% identity in 79 aa overlap). Also similar, but shorter in N-terminus, to Q9FBI9|SCP8.32c PUTATIVE ATP-BINDING PROTEIN from Streptomyces coelicolor (94 aa), FASTA scores: opt: 246, E(): 8.1e-11, (53.4% identity in 73 aa overlap); Q9DGP6 (alias Q9DGP4) GLUTAMATE DECARBOXYLASE 67 KDA ISOFORM (FRAGMENT) from Alepocephalus bairdii (182 aa), FASTA scores: opt: 100, E(): 2.6, (35.3% identity in 85 aa overlap). Corresponds to Statens Serum Institute antigen, CYP10 TB9.4. Has N-terminal sequence, VEVKIGITDSPRELV." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177943.1" /db_xref="GI:57117073" /db_xref="UniProtKB/TrEMBL:Q6MWZ8" /db_xref="GeneID:3205107" /translation="MEVKIGITDSPRELVFSSAQTPSEVEELVSNALRDDSGLLTLTD ERGRRFLIHTARIAYVEIGVADARRVGFGVGVDAAAGSAGKVATSG" gene 3586274..3586834 /locus_tag="Rv3209" /db_xref="GeneID:888875" CDS 3586274..3586834 /locus_tag="Rv3209" /function="UNKNOWN" /note="Rv3209, (MTCY07D11.17c), len: 186 aa. Conserved hypothetical thr-, pro-rich protein, equivalent (but shorter 36 aa in N-terminus) to Q9CCH2|ML0813 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (195 aa), FASTA scores: opt: 508, E(): 1.4e-15, (58.4% identity in 185 aa overlap). Also some similarity with Q10390|MMS3_MYCTU|MMPS3|Rv2198c|MT2254|MTCY190.09c PROBABLE CONSERVED TRANSMEMBRANE TRANSPORT PROTEIN from M. tuberculosis (299 aa), FASTA scores: opt: 339, E(): 3.7e-08, (35.0% identity in 180 aa overlap); and Q9CCE9|MMPS3|ML0877 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (293 aa), FASTA scores: opt: 272, E(): 2.8e-05, (36.4% identity in 173 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217725.1" /db_xref="GI:15610345" /db_xref="UniProtKB/TrEMBL:O05857" /db_xref="GeneID:888875" /translation="MALGAVATAVIINSGDSTSTKAIVGAPAPRTVISTSPRPTAPTS TSPHPSPSTLRPQLPPETVTTVAPPGTGPTTVPTRTPTAAPPQTAVPPPAPLNPRTVV YRVTGTKQLFDLVNVVYTDARGFPVTDFNVSLPWTKMVVLNPGVQTESVVATSLYSRL NCSIVNTGAQTVVASTNNAIIATCTR" gene complement(3586844..3587539) /locus_tag="Rv3210c" /db_xref="GeneID:888876" CDS complement(3586844..3587539) /locus_tag="Rv3210c" /function="UNKNOWN" /note="Rv3210c, (MTCY07D11.16), len: 231 aa. Conserved hypothetical protein, similar (but N-terminus shorter) to Q9FBJ1|SCP8.30 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (260 aa), FASTA scores: opt: 599, E(): 1.1e-30, (42.5% identity in 233 aa overlap); and some similarity to Q9RRV1|DR2384 PHENYLACETIC ACID DEGRADATION PROTEIN PAAC from Deinococcus radiodurans (263 aa), FASTA scores: opt: 129, E(): 0.43, (27.9% identity in 172 aa overlap); and Q9F621 FLGK PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (472 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217726.1" /db_xref="GI:15610346" /db_xref="GOA:O05856" /db_xref="UniProtKB/TrEMBL:O05856" /db_xref="GeneID:888876" /translation="MPSPSSADQVADSPRPRLPADHPGVNELFALLAYGEVAAFYRLT DEARMAPDLRGRISMASMAAAEMGHYELLRNALERRGVDVVSAMSKYTSALENYHRLT TPSTWLEALVKTYVADALAADLYLEIADGLPDEVADVVRAALSETGHSQFVVAEVRAA VTASGKQRSRLALWSRRLLGEAITQAQLVLADHDELVDLVVSGSGGLSQLGAFFDRLQ QTHDQRMRELGLS" gene 3587798..3589381 /gene="rhlE" /locus_tag="Rv3211" /db_xref="GeneID:888840" CDS 3587798..3589381 /gene="rhlE" /locus_tag="Rv3211" /function="HAS A HELIX-DESTABILIZING ACTIVITY" /note="Rv3211, (MTCY07D11.15c), len: 527 aa. Probable rhlE, ATP-dependent RNA helicase, equivalent (but shorter 22 aa) to Q9CCH3|RHLE|ML0811 PUTATIVE ATP-DEPENDENT RNA HELICASE from Mycobacterium leprae (544 aa), FASTA scores: opt: 2497, E(): 8.7e-131, (74.75% identity in 531 aa overlap). Also highly similar to other RNA helicases e.g. Q9FBJ2|SCP8.29c from Streptomyces coelicolor (879 aa), FASTA scores: opt: 1458, E(): 3.6e-73, (52.5% identity in 522 aa overlap); Q9DF36 from Xenopus laevis (African clawed frog) (800 aa), FASTA scores: opt: 792, E(): 2.3e-36, (37.15% identity in 385 aa overlap); Q99Z38|DEAD|SPY1415 from Streptococcus pyogenes (759 aa), FASTA scores: opt: 779, E(): 1.1e-35, (37.1% identity in 380 aa overlap); P33906|DEAD|CSDA from Klebsiella pneumoniae (642 aa), FASTA scores: opt: 768, E(): 4e-35, (43.4% identity in 387 aa overlap); etc. Contains ATP/GTP-binding site motif A (PS00017) and DEAD-box subfamily ATP-dependent helicases signature (PS00039). SIMILAR TO DEAD/DEAH BOX HELICASE FAMILY AND SIMILAR TO HELICASE C-TERMINAL DOMAIN." /codon_start=1 /transl_table=11 /product="ATP-dependent RNA helicase RhlE" /protein_id="NP_217727.1" /db_xref="GI:15610347" /db_xref="GOA:O05855" /db_xref="UniProtKB/TrEMBL:O05855" /db_xref="GeneID:888840" /translation="MTAVKHTTESTFAKLGVRDEIVRALGEEGIKRPFAIQELTLPLA LDGEDVIGQARTGMGKTFAFGVPLLQRITSGDGTRPLTGAPRALVVVPTRELCLQVTD DLATAGKYLTAGPDTDDAAAVRRRLSVVSIYGGRPYEPQIEALRAGADVVVGTPGRLL DLCQQGHLQLGGLSVLVLDEADEMLDLGFLPDIERILRQIPADRQSMLFSATMPDPII TLARTFMVRPTHIRAEAPHSSAVHDATEQFVYRAHALDKVELVSRVLQARDRGATMIF TRTKRTAQKVADELTERGFAVGAVHGDLGQLAREKALKAFRTGGIDVLVATDVAARGI DIDDVTHVINYQCPEDEKMYVHRIGRTGRAGRTGVAVTLVDWDELPRWSMIDQALGLG SPDPAETYSNSPHLYAELAIPATAGGTVGPARKSQGRRRDTDCDGQKTAQHARNTPRR RRTRGGKPVTGHPGTNPISSPIVGGDATSEPGSGTASDSGSDVVSGSRSGNGEAARRR RRRRRRPTHAQDGFAARAN" misc_feature 3587957..3587980 /gene="rhlE" /locus_tag="Rv3211" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 3588326..3588352 /gene="rhlE" /locus_tag="Rv3211" /note="PS00039 DEAD-box subfamily ATP-dependent helicases signature" gene 3589394..3590617 /locus_tag="Rv3212" /db_xref="GeneID:887931" CDS 3589394..3590617 /locus_tag="Rv3212" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3212, (MTCY07D11.14c), len: 407 aa. Hypothetical ala-, val-rich protein, equivalent to Q9CCH4|ML0810 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (407 aa), FASTA scores: opt: 2158, E(): 5.3e-119, (79.85% identity in 407 aa overlap). Weak similarity to several eukaryotic transcription factors e.g. P08393|ICP0_HSV11|ICP0|IE110 TRANS-ACTING TRANSCRIPTIONAL PROTEIN from Herpes simplex virus (type 1 / strain 17) (775 aa), FASTA scores: opt: 115, E(): 2, (26.9% identity in 334 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217728.1" /db_xref="GI:15610348" /db_xref="UniProtKB/TrEMBL:O05854" /db_xref="GeneID:887931" /translation="MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAA VAVPTPAPAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESLWS YARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYADPRVRLFSDG TTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQSGCTLESAAASSAAVSVLE ACTNQADLRLVLLRPGKEDDEPIQRIVPEPGVRPGSGARVLVVSQNNTAVYLPARSGA QPRVDVIDETGATVSSTLLAKPPSTSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTI AAGETTAPVGPGVMMAGQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSG SRVIEQRGDTLVALG" gene complement(3590692..3591492) /locus_tag="Rv3213c" /db_xref="GeneID:888896" CDS complement(3590692..3591492) /locus_tag="Rv3213c" /function="UNKNOWN, BUT POSSIBLY INVOLVED IN REGULATION OF PARTITIONING." /experiment="experimental evidence, no additional details recorded" /note="Rv3213c, (MTCY07D11.13), len: 266 aa. Possible soj/parA-related protein, very similar in particular to Soj/ParA proteins (and relatives) from Bacillus subtilis that inhibit the initiation of sporulation by preventing phosphorylation of Spo0A (see Quisel & Grossman 2000) e.g. Q9S228|SCI51.12c from Streptomyces coelicolor (340 aa), FASTA scores: opt: 746, E(): 1.6e-40, (48.2% identity in 249 aa overlap); Q9HT11|SOJ|PA5563 from Pseudomonas aeruginosa (262 aa), FASTA scores: opt: 649, E(): 2.1e-34, (42.2% identity in 256 aa overlap); Q9PB62|XF2282 from Xylella fastidiosa (264 aa), FASTA scores: opt: 624, E(): 8.3e-33, (42.25% identity in 251 aa overlap); Q9K5N0|SOJ_BACHD|SOJ|BH4058 from Bacillus halodurans (253 aa), FASTA scores: opt: 621, E(): 1.2e-32, (41.55% identity in 248 aa overlap); P37522|SOJ_BACSU (253 aa), FASTA scores: opt: 620, E(): 1.4e-32, (41.65% identity in 245; etc. Also similar to various mycobacterial proteins: U00021_10 from Mycobacterium leprae, MTCI125_29 from Mycobacterium tuberculosis, MLCB1351_6 from Mycobacterium leprae, MTV028_9c|Rv3918c|PARA PROBABLE CHROMOSOME PARTITIONING PROTEIN from Mycobacterium tuberculosis, MSGDNAB_18 from Mycobacterium leprae. SEEMS TO BELONG TO THE PARA FAMILY." /codon_start=1 /transl_table=11 /product="SOJ/PARA-like protein" /protein_id="NP_217729.1" /db_xref="GI:15610349" /db_xref="GOA:O05853" /db_xref="UniProtKB/TrEMBL:O05853" /db_xref="GeneID:888896" /translation="MTDTRVLAVANQKGGVAKTTTVASLGAAMVEKGRRVLLVDLDPQ GCLTFSLGQDPDKLPVSVHEVLLGEVEPNAVLVTTMEGMTLLPANIDLAGAEAMLLMR AGREYALKRALAKFSDRFDVVIIDCPPSLGVLTLNGLTAADKAIVPLQCEMLAHRGVG QFLRTVADVQQITNPNLRLLGALPTLYDSRTTHTRDVLLDVADRYDLQVLAPPIPRTV RFAEASASGSSVMAGRKNKGAVAYRELAQALLKHWKTGRPLPTFTVDL" repeat_region complement(3591493..3591569) /note="77 bp Mycobacterial Interspersed Repetitive Unit, Class I" gene 3591646..3592257 /gene="gpm2" /locus_tag="Rv3214" /db_xref="GeneID:888830" CDS 3591646..3592257 /gene="gpm2" /locus_tag="Rv3214" /EC_number="5.4.2.1" /function="INVOLVED IN GLYCOLYSIS [CATALYTIC ACTIVITY: 1,3-DIPHOSPHOGLYCERATE + 3-PHOSPHOGLYCERATE = 2,3-DIPHOSPHOGLYCERATE + 3-PHOSPHOGLYCERATE]." /note="forms a homodimer in Mycobacterium tuberculosis; belongs to the dPGM superfamily" /codon_start=1 /transl_table=11 /product="acid phosphatase" /protein_id="YP_177944.1" /db_xref="GI:57117074" /db_xref="GOA:Q6MWZ7" /db_xref="UniProtKB/TrEMBL:Q6MWZ7" /db_xref="GeneID:888830" /translation="MGVRNHRLLLLRHGETAWSTLGRHTGGTEVELTDTGRTQAELAG QLLGELELDDPIVICSPRRRTLDTAKLAGLTVNEVTGLLAEWDYGSYEGLTTPQIRES EPDWLVWTHGCPAGESVAQVNDRADSAVALALEHMSSRDVLFVSHGHFSRAVITRWVQ LPLAEGSRFAMPTASIGICGFEHGVRQLAVLGLTGHPQPIAAG" gene 3592254..3593372 /gene="entC" /locus_tag="Rv3215" /db_xref="GeneID:888824" CDS 3592254..3593372 /gene="entC" /locus_tag="Rv3215" /EC_number="5.4.4.2" /function="COULD BE INVOLVED IN ENTEROBACTIN BIOSYNTHESIS. ENTEROBACTIN IS AN IRON-CHELATING COMPOUND INVOLVED IN TRANSPORTING IRON FROM THE BACTERIAL ENVIRONMENT INTO THE CELL CYTOPLASM. COULD BE ALSO INVOLVED IN 2,3-DIHYDROXYBENZOATE OR ENTEROCHELIN OR MENAQUINONE BIOSYNTHESIS [CATALYTIC ACTIVITY: CHORISMATE = ISOCHORISMATE]." /note="synthesizes isochorismate acid from chorismate" /codon_start=1 /transl_table=11 /product="isochorismate synthase" /protein_id="NP_217731.1" /db_xref="GI:15610351" /db_xref="GOA:O05851" /db_xref="UniProtKB/TrEMBL:O05851" /db_xref="GeneID:888824" /translation="MSAHVATLHPEPPFALCGPRGTLIARGVRTRYCDVRAAQAALRS GTAPILLGALPFDVSRPAALMVPDGVLRARKLPDWPTGPLPKVRVAAALPPPADYLTR IGRARDLLAAFDGPLHKVVLARAVQLTADAPLDARVLLRRLVVADPTAYGYLVDLTSA GNDDTGAALVGASPELLVARSGNRVMCKPFAGSAPRAADPKLDAANAAALASSAKNRH EHQLVVDTMRVALEPLCEDLTIPAQPQLNRTAAVWHLCTAITGRLRNISTTAIDLALA LHPTPAVGGVPTKAATELIAELEGDRGFYAGAVGWCDGRGDGHWVVSIRCAQLSADRR AALAHAGGGIVAESDPDDELEETTTKFATILTALGVEQ" gene 3593520..3593852 /locus_tag="Rv3216" /db_xref="GeneID:888845" CDS 3593520..3593852 /locus_tag="Rv3216" /EC_number="2.3.1.-" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /note="Rv3216, (MTCY07D11.10c), len: 110 aa. Possible acetyltransferase (2.3.1.-), similar but shorter to many e.g. Q9AB32|CC0402 ACETYLTRANSFERASE (GNAT FAMILY) from Caulobacter crescentus (159 aa), FASTA scores: opt: 325, E(): 3.8e-17, (45.65% identity in 103 aa overlap); P79081|ATS1 PUTATIVE ACETYLTRANSFERASE ATS1 from Schizosaccharomyces pombe (Fission yeast) (168 aa), FASTA scores: opt: 313, E(): 3.1e-16, (47.6% identity in 105 aa overlap); Q9I640|PA0478 PROBABLE N-ACETYLTRANSFERASE from Pseudomonas aeruginosa (158 aa), FASTA scores: opt: 308, E(): 6.9e-16, (50.0% identity in 98 aa overlap); Q9KHE3 PUTATIVE ACETYLTRANSFERASE from Anabaena sp. strain PCC 7120 (164 aa), FASTA scores: opt: 269, E(): 5.4e-13, (41.75% identity in 103 aa overlap); etc. Also some similarity to diamine acetyltransferases (EC 2.3.1.57) e.g. Q28999|ATDA_PIG|SAT from Sus scrofa (Pig) (171 aa), FASTA scores: opt: 152, E(): 0.00025, (23.15% identity in 108 aa overlap)." /codon_start=1 /transl_table=11 /product="acetyltransferase" /protein_id="NP_217732.1" /db_xref="GI:15610352" /db_xref="GOA:O05850" /db_xref="UniProtKB/TrEMBL:O05850" /db_xref="GeneID:888845" /translation="MRGHVAEVNGGVAAMALWFLNFSTWDGVAGIYVEDLFVWPRFRR RGLARGLLSTLARECVDNRYTRLAWSVLNWNSDAIALYDRIGGQPQHEWTIYRLSGPR LAALAAPR" gene complement(3593804..3594235) /locus_tag="Rv3217c" /db_xref="GeneID:888843" CDS complement(3593804..3594235) /locus_tag="Rv3217c" /function="UNKNOWN" /note="Rv3217c, (MTCY07D11.09), len: 143 aa. Probable conserved integral membrane protein, equivalent (highly similar but shorter 30 aa) to Q9CCH6|ML0806 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (173 aa). Also similar to others e.g. Q9F3L9|2SC7G11.04 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (152 aa), FASTA scores: opt: 177, E(): 0.00024, (33.8% identity in 136 aa overlap). And shows similarity to O34238|MVIN|VC0680 VIRULENCE FACTOR MVIN HOMOLOG from Vibrio (525 aa), FASTA scores: opt: 126, E(): 0.97, (30.9% identity in 68 aa overlap). First GTG taken." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217733.1" /db_xref="GI:15610353" /db_xref="UniProtKB/TrEMBL:O05849" /db_xref="GeneID:888843" /translation="MPVRAPAAVRGAGLIVAVQGGAALVVAAALLVRGLAGADQHIVN GLGTAGWFVLVGGAVLAAGCRLAVGKLWGRGLAVFAQLLLLPVAWYLIVGSHQPAIGI PVGIIALGVLVLLFSPPSIRWAAGRDQRGAASAANRGPDSR" gene 3594468..3595433 /locus_tag="Rv3218" /db_xref="GeneID:888906" CDS 3594468..3595433 /locus_tag="Rv3218" /function="UNKNOWN" /note="Rv3218, (MTCY07D11.08c), len: 321 aa. Conserved hypothetical protein, similar to several hypothetical bacterial proteins e.g. Q9F3M0|2SC7G11.03c from Streptomyces coelicolor (322 aa), FASTA scores: opt: 694, E(): 4.2e-35, (39.95% identity in 328 aa overlap); Q9A0J4|SPY0752 from Streptomyces pyogenes (340 aa), FASTA scores: opt: 187, E(): 0.00033, (30.5% identity in 141 aa overlap); O31502|YERQ from Bacillus subtilis (303 aa), FASTA scores: opt: 184, E(): 0.00045, (34.15% identity in 126 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217734.1" /db_xref="GI:15610354" /db_xref="GOA:O05848" /db_xref="UniProtKB/TrEMBL:O05848" /db_xref="GeneID:888906" /translation="MRAVLIVNPTATATTPAGRDLLAHALESRLQLTVEHTNHRGHGT ELGQAAVADGVDLVVVHGGDGTVSAVVNGMLGRPGTTPVRPVPAVAVVPGGSANVLAR ALGISADPIAATNQLIQLLDDYGRHQQWRRIGLIDCGERWAVFNAGMGVDAEVVAAVE AERDKGGKVTAWRYIRAAVRAVLACTRREPALTLQLPNRDPITGVHFVFVSNSSPWTY ANNRPVWTNPDCRFESGLGVFATTSMKVVPTLRVVRQMFAKQPKFEFNHVINNDDVAC LRVTSMGPPIASQFDGDYLGVRETMTFRAVPDALAVVAPPARKRI" gene 3595713..3595967 /gene="whiB1" /locus_tag="Rv3219" /db_xref="GeneID:887980" CDS 3595713..3595967 /gene="whiB1" /locus_tag="Rv3219" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3219, (MTCY07D11.07c), len: 84 aa. Probable whiB1 (alternate gene name: whmE), WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor. Equivalent to Q9CCH7|WHIB1|ML0804 PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (84 aa), FASTA scores: opt: 580, E(): 3.5e-35, (95.25% identity in 84 aa overlap). Highly similar to several e.g. Q9X952|WBLE DEVELOPMENTAL REGULATORY PROTEIN WHIB-PARALOG from Streptomyces coelicolor (85 aa), FASTA scores: opt: 477, E(): 9.2e-28, (75.3% identity in 81 aa overlap); Q9AD55|SCP1.95 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (102 aa), FASTA scores: opt: 383, E(): 6.1e-21, (60.75% identity in 79 aa overlap); Q9K4K8|SC5F8.16c from Streptomyces coelicolor (83 aa), FASTA scores: opt: 346, E(): 2.5e-18, (54.75% identity in 84 aa overlap); etc.; whmE" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIB-like WHIB1" /protein_id="NP_217735.1" /db_xref="GI:15610355" /db_xref="GOA:O05847" /db_xref="UniProtKB/TrEMBL:O05847" /db_xref="GeneID:887980" /translation="MDWRHKAVCRDEDPELFFPVGNSGPALAQIADAKLVCNRCPVTT ECLSWALNTGQDSGVWGGMSEDERRALKRRNARTKARTGV" gene complement(3596029..3597534) /locus_tag="Rv3220c" /db_xref="GeneID:888801" CDS complement(3596029..3597534) /locus_tag="Rv3220c" /EC_number="2.7.3.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv3220c, (MTCY07D11.06), len: 501 aa. Probable sensor (probably histidine kinase), equivalent to Q9CCH8|ML0803 PUTATIVE TWO-COMPONENT SYSTEM SENSOR KINASE from Mycobacterium leprae (500 aa). Similar to others e.g. Q9F3M1|2SC7G11.01 PUTATIVE HISTIDINE KINASE (FRAGMENT) from Streptomyces coelicolor (372 aa), FASTA scores: opt: 1038, E(): 7.4e-56, (48.95% identity in 380 aa overlap); Q9A3K5|CC3198 SENSOR HISTIDINE KINASE from Caulobacter crescentus (327 aa), FASTA scores: opt: 311, E(): 1.2e-11, (33.35% identity in 201 aa overlap) (similarity only in C-terminal part for this one); Q9A2T2|CC3474 PUTATIVE SENSOR HISTIDINE KINASE from Caulobacter crescentus (547 aa); etc. C-terminal half shows similarity to many sensor proteins, that respond to various stimuli from Methanobacterium thermoautotrophicum e.g. O26568|MTH468 SENSORY TRANSDUCTION HISTIDINE KINASE (554 aa), FASTA scores: opt: 425, E(): 2.1e-18, (34.0% identity in 244 aa overlap); O26546|MTH446 SENSORY TRANSDUCTION REGULATORY PROTEIN (583 aa), FASTA scores: opt: 380, E(): 1.2e-15, (37.15% identity in 202 aa overlap); O26913|MTH823 SENSORY TRANSDUCTION REGULATORY PROTEIN (677 aa), FASTA scores: opt: 375, E(): 2.7e-15, (35.4% identity in 195 aa overlap); etc. SEEMS SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES." /codon_start=1 /transl_table=11 /product="two component sensor kinase" /protein_id="NP_217736.1" /db_xref="GI:15610356" /db_xref="GOA:O05846" /db_xref="UniProtKB/TrEMBL:O05846" /db_xref="GeneID:888801" /translation="MSTLGDLLAEHTVLPGSAVDHLHAVVGEWQLLADLSFADYLMWV RRDDGVLVCVAQCRPNTGPTVVHTDAVGTVVAANSMPLVAATFSGGVPGREGAVGQQN SCQHDGHSVEVSPVRFGDQVVAVLTRHQPELAARRRSGHLETAYRLCATDLLRMLAEG TFPDAGDVAMSRSSPRAGDGFIRLDVDGVVSYASPNALSAYHRMGLTTELEGVNLIDA TRPLISDPFEAHEVDEHVQDLLAGDGKGMRMEVDAGGATVLLRTLPLVVAGRNVGAAI LIRDVTEVKRRDRALISKDATIREIHHRVKNNLQTVAALLRLQARRTSNAEGREALIE SVRRVSSIALVHDALSMSVDEQVNLDEVIDRILPIMNDVASVDRPIRINRVGDLGVLD SDRATALIMVITELVQNAIEHAFDPAAAEGSVTIRAERSARWLDVVVHDDGLGLPQGF SLEKSDSLGLQIVRTLVSAELDGSLGMRDARERGTDVVLRVPVGRRGRLML" gene complement(3597551..3597766) /gene="TB7.3" /locus_tag="Rv3221c" /db_xref="GeneID:888096" CDS complement(3597551..3597766) /gene="TB7.3" /locus_tag="Rv3221c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3221c, (MTCY07D11.05), len: 71 aa. TB7.3, Biotinylated protein (see citations below), equivalent (appears to have one additional residue) to Q9CCH9|ML0802|BTB7_MYCLE BIOTINYLATED PROTEIN TB7.3 HOMOLOG from Mycobacterium leprae (70 aa), FASTA scores: opt: 367, E(): 4e-18, (90.0% identity in 70 aa overlap); Q9XCD6|BTB7_MYCSM BIOTINYLATED PROTEIN TB7.3 HOMOLOG from Mycobacterium smegmatis (70 aa), FASTA scores: opt: 341, E(): 2.1e-16, (84.05% identity in 69 aa overlap). Similar to C-terminal part of various proteins e.g. Q9HPP8|ACC|VNG1532G BIOTIN CARBOXYLASE from Halobacterium sp. strain NRC-1 (610 aa), FASTA scores: opt: 212, E(): 4e-07, (50.0% identity in 68 aa overlap); Q58628|PYCB_METJA|MJ1231 PYRUVATE CARBOXYLASE SUBUNIT B from Methanococcus jannaschii (567 aa), FASTA scores: opt: 192, E(): 7.8e-06, (44.8% identity in 58 aa overlap); Q9ZAA7|GCDC GLUTACONYL-CoA DECARBOXYLASE GAMMA SUBUNIT from Acidaminococcus fermentans (145 aa), FASTA scores: opt: 184, E(): 8.9e-06, (39.4% identity in 66 aa overlap); etc." /codon_start=1 /transl_table=11 /product="putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit" /protein_id="NP_217737.1" /db_xref="GI:15610357" /db_xref="GOA:O05845" /db_xref="UniProtKB/Swiss-Prot:O05845" /db_xref="GeneID:888096" /translation="MAEDVRAEIVASVLEVVVNEGDQIDKGDVVVLLESMKMEIPVLA EAAGTVSKVAVSVGDVIQAGDLIAVIS" gene complement(3598051..3598356) /locus_tag="Rv3221A" /db_xref="GeneID:3205091" CDS complement(3598051..3598356) /locus_tag="Rv3221A" /function="BINDS SIGMA FACTOR AND INHIBITS IT. PROBABLY INVOLVED IN SURVIVAL FOLLOWING HEAT SHOCK AND OXIDATIVE STRESS." /note="Rv3221A, len: 101 aa. Possible anti-sigma factor, similar to Q9XCD7|AAD41811.1 unknown protein from Mycobacterium smegmatis, linked to sigma factor sigH (see Fernandes et al., 1999) (101 aa), FASTA scores: opt: 422, E(): 3.4e-22, (64.9% identity in 94 aa overlap); and to Q9RL96|RsrA anti-sigma factor from Streptomyces coelicolor (see Kang et al., 1999) (105 aa), FASTA scores: opt: 163, E(): 0.00016, (32.05% identity in 78 aa overlap)." /codon_start=1 /transl_table=11 /product="anti-sigma factor" /protein_id="YP_177945.1" /db_xref="GI:57117075" /db_xref="UniProtKB/TrEMBL:Q8VJ46" /db_xref="GeneID:3205091" /translation="MSENCGPTDAHADHDDSHGGMGCAEVIAEVWTLLDGECTPETRE RLRRHLEACPGCLRHYGLEERIKALIGTKCRGDRAPEGLRERLRLEIRRTTIIRGGP" gene complement(3598353..3598904) /locus_tag="Rv3222c" /db_xref="GeneID:887655" CDS complement(3598353..3598904) /locus_tag="Rv3222c" /function="UNKNOWN" /note="Rv3222c, (MTCY07D11.04), len: 183 aa. Hypothetical protein, with some similarity to Q9SZD2|F19B15.50|AT4G29020 GLYCINE-RICH PROTEIN LIKE from Arabidopsis thaliana (Mouse-ear cress) (158 aa), FASTA scores: opt: 131, E(): 0.77, (33.35% identity in 126 aa overlap); Q9S222|SCI51.18 PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (548 aa), FASTA scores: opt: 133, E(): 1.6, (36.25% identity in 149 aa overlap); etc. Also some similarity to other hypothetical Mycobacterium tuberculosis proteins e.g. O06292|Rv0341|MTCY13E10.01 (479 aa), FASTA scores: opt: 141, E(): 0.5, (31.2% identity in 170 aa overlap); AAK45760|MT1497.1 PE_PGRS FAMILY PROTEIN from strain CDC1551 (1408 aa), FASTA scores: opt: 137, E(): 2, (31.75% identity in 148 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217738.1" /db_xref="GI:15610358" /db_xref="UniProtKB/TrEMBL:O05844" /db_xref="GeneID:887655" /translation="MSSPVSSRRLANLVKESLQGSVLGGVVSDAVLPAVSDDVKPGAG EDAYRVPVVVAAGSGAVVQVGGLEVGSAAVAGEVADTVAELFVCRPTEPDVGDFVGLA GGAGDAGQAGQQFGLGVGVRGESFGARRRLALSTVGASGATAGLRKTHDGHHGCQARG ALTQRRLYIGNPSEITDTRMVHQ" gene complement(3598901..3599551) /gene="sigH" /locus_tag="Rv3223c" /db_xref="GeneID:888094" CDS complement(3598901..3599551) /gene="sigH" /locus_tag="Rv3223c" /function="ALTERNATIVE SIGMA FACTOR THAT PLAYS A ROLE IN THE OXIDATIVE-STRESS RESPONSE (REGULATION OF THIOREDOXIN RECYCLING). THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED. THIS SIGMA FACTOR IS INVOLVED IN HEAT SHOCK AND OXIDATIVE STRESS RESPONSE; IT IS BELIEVED TO CONTROL PROTEIN PROCESSING IN THE EXTRACYTOPLASMIC COMPARTMENT. REGULATES POSITIVELY DNAK AND CLPB GENES. REGULATES TRXB2, TRXC, Rv2466c AND SIGB GENES, AND PROBABLY SIG B GENE. SIGH MAY MEDIATE THE TRANSCRIPTION OF AT LEAST 31 GENES DIRECTLY AND MODULATES THE EXPRESSION OF ABOUT 150 OTHERS." /experiment="experimental evidence, no additional details recorded" /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription; this sigma factor is involved in heat shock and oxidative stress response" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor RpoE" /protein_id="NP_217739.1" /db_xref="GI:15610359" /db_xref="GOA:P66807" /db_xref="UniProtKB/Swiss-Prot:P66807" /db_xref="GeneID:888094" /translation="MADIDGVTGSAGLQPGPSEETDEELTARFERDAIPLLDQLYGGA LRMTRNPADAEDLLQETMVKAYAGFRSFRHGTNLKAWLYRILTNTYINSYRKKQRQPA EYPTEQITDWQLASNAEHSSTGLRSAEVEALEALPDTEIKEALQALPEEFRMAVYYAD VEGFPYKEIAEIMDTPIGTVMSRLHRGRRQLRGLLADVARDRGFARGEQAHEGVSS" misc_feature complement(3599354..3599392) /gene="sigH" /locus_tag="Rv3223c" /note="PS01063 Sigma-70 factors ECF subfamily signature" gene 3599851..3600699 /locus_tag="Rv3224" /db_xref="GeneID:888852" CDS 3599851..3600699 /locus_tag="Rv3224" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3224, (MTCY07D11.02c), len: 282 aa. Probable iron-regulated oxidoreductase, possible short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to BAB49551|MLL2413 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (288 aa), FASTA scores: opt: 1053, E(): 6.4e-59, (57.95% identity in 276 aa overlap); Q9AB34|CC0400 SHORT CHAIN DEHYDROGENASE FAMILY PROTEIN from Caulobacter crescentus (285 aa), FASTA scores: opt: 1051, E(): 8.5e-59, (55.9% identity in 281 aa overlap); and Q9VB10|CG5590 HYPOTHETICAL PROTEIN (SIMILAR TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY) from Drosophila melanogaster (Fruit fly) (412 aa), FASTA scores: opt: 966, E(): 2.5e-53, (52.15% identity in 278 aa overlap). Similar to various proteins (principaly oxidoreductases) e.g. Q18639|C45B11.3 HYPOTHETICAL PROTEIN (SIMILAR TO THE SDR FAMILY) from Caenorhabditis elegans (293 aa), FASTA scores: opt: 921, E(): 1.2e-50, (51.3% identity in 271 aa overlap); Q9HZV5|PA2892 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginosa (274 aa), FASTA scores: opt: 847, E(): 5.1e-46, (49.25% identity in 274 aa overlap); Q9I6V0|PA0182 PROBABLE SHORT-CHAIN DEHYDROGENASE (SIMILAR TO THE SDR FAMILY) from Pseudomonas aeruginosa (250 aa), FASTA scores: opt: 333, E(): 8.3e-14, (29.8% identity in 245 aa overlap); Q9HY98|PA3511 PROBABLE SHORT-CHAIN DEHYDROGENASE from Pseudomonas aeruginosa (253 aa), FASTA scores: opt: 330, E(): 1.3e-13, (31.2% identity in 250 aa overlap); etc. Related proteins in Mycobacterium tuberculosis include MTCY02B10.14, MTCY369.14, and MTCY09F9.36. Has ATP/GTP-binding site motif A, (PS00017) near C-terminus. MAY BE BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_217740.1" /db_xref="GI:15610360" /db_xref="GOA:O05842" /db_xref="UniProtKB/TrEMBL:O05842" /db_xref="GeneID:888852" /translation="MSLNGKTMFISGASRGIGLAIAKRAARDGANIALIAKTAEPHPK LPGTVFTAAKELEEAGGQALPIVGDIRDPDAVASAVATTVEQFGGIDICVNNASAINL GSITEVPMKRFDLMNGIQVRGTYAVSQACIPHMKGRENPHILTLSPPILLEKKWLRPT AYMMAKYGMTLCALGIAEEMRADGIASNTLWPRTMVATAAVQNLLGGDEAMARSRKPE VYADAAYVIVNKPATEYTGKTLLCEDVLVESGVTDLSVYDCVPGATLGVDLWVEDANP PGYLPA" misc_feature 3600544..3600567 /locus_tag="Rv3224" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 3600635..3600823 /locus_tag="Rv3224A" /db_xref="GeneID:3205092" CDS 3600635..3600823 /locus_tag="Rv3224A" /function="UNKNOWN" /note="Rv3224A, len: 62 aa. Conserved hypothetical protein (possibly gene fragment), overlaps Rv3224. Similar to N-terminus of ML0799|AL583919_131 conserved hypothetical protein from Mycobacterium leprae (135 aa), FASTA scores: opt: 104, E(): 0.78, (59.37% identity in 32 aa overlap). Note that upstream ORF Rv3224B is similar to C-terminus of ML0799. There appears to be no frameshift as sequence is identical in strain CDC1551 and in Mycobacterium bovis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177946.1" /db_xref="GI:57117076" /db_xref="UniProtKB/TrEMBL:Q6MWZ5" /db_xref="GeneID:3205092" /translation="MRRSASTCGWKTPTRRGTSRPSDSKTLILELPDERAVAIVPVPS KLSLKAAGGPRGAQSGHG" gene 3600801..3601019 /locus_tag="Rv3224B" /db_xref="GeneID:3205093" CDS 3600801..3601019 /locus_tag="Rv3224B" /function="UNKNOWN" /note="Rv3224B, len: 72 aa. Conserved hypothetical protein (possibly gene fragment), similar to C-terminal part of ML0799|AL583919_131 conserved hypothetical protein from Mycobacterium leprae (135 aa), FASTA scores: opt: 229, E(): 2e-09, (60.00% identity in 70 aa overlap). Note that downstream ORF Rv3224A is similar to N-terminus of ML0799. There appears to be no frameshift as sequence is identical in strain CDC1551 and in Mycobacterium bovis." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177947.1" /db_xref="GI:57117077" /db_xref="UniProtKB/TrEMBL:Q6MWZ4" /db_xref="GeneID:3205093" /translation="MPKAAMAKPAAAEQATGYVVGGISPFGQRKRLRTVVDVSALSWD RVLRCRQTALGRHGGPAGPDHLDQRDHR" gene complement(3601016..3602440) /locus_tag="Rv3225c" /db_xref="GeneID:888804" CDS complement(3601016..3602440) /locus_tag="Rv3225c" /EC_number="2.-.-.-" /function="UNKNOWN" /note="Rv3225c, (MTCY07D11.01), len: 474 aa (start uncertain). Possible transferase (EC 2.-.-.-). C-terminal part shows some similarity to various bacterial proteins e.g. BAB49093|MLL1809 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (298 aa), FASTA scores: opt: 557, E(): 2.8e-26, (34.55% identity in 295 aa overlap); P14509|KKA8_ECOLI|APHA AMINOGLYCOSIDE 3'-PHOSPHOTRANSFERASE from Escherichia coli (271 aa), FASTA scores: opt: 194, E(): 0.00018, (27.75% identity in 227 aa overlap); Q53826|CPH CAPREOMYCIN PHOSPHOTRANSFERASE from Streptomyces capreolus (281 aa), FASTA scores: opt: 178, E(): 0.0017, (30.5% identity in 269 aa overlap); Q9CDM4|YWIA UNKNOWN PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (213 aa), FASTA scores: opt: 167, E(): 0.0061, (2705% identity in 149 aa overlap); Q9X843|SC9B1.24 PUTATIVE TRANSFERASE (FRAGMENT) from Streptomyces coelicolor (317 aa), FASTA scores: opt: 165, E(): 0.011, (26.05% identity in 280 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transferase" /protein_id="NP_217742.1" /db_xref="GI:15610361" /db_xref="GOA:O05841" /db_xref="UniProtKB/TrEMBL:O05841" /db_xref="GeneID:888804" /translation="MRFAKLSDGLSDGIVTLSPLCLDDVDAHLAGGDERLVRWLSGMP STRASVEAYIRHCREQWVTGGPLRSFGIRTVAETIVGTIDLRFDGEGLASGQVNVAYG LYPSWRGRGLATRAVDLVCQYAAEHGATEAVIKVEPENSASARVALRAGFAFVRRICE QDGTVFDRYERVLRAKMHADEVDIDEDLVRRLLRAQFPQWADLPIAPVRSAGTDNAMY RLGEDLAVRIPRIGWAIESLRTEQQWLPRIAAHLGVASPVPVGLGSPAEGFGWPWSVC RWVAGENPSAAEFVEPNRAVEDLADFITALRATDPMGGPPAKRGAPLGEQDAEVRAAL AALDGIIDVHAATAAWESALRVPPYAGPPMWFHGDLSRFNILTAQGRLTGVIDFGLMG VGDPSVDLIIAWNLLSAPARAQFRVAVGAADDDWMRGRGRALAIALIALPYYQDTNPP LAASARYAIGEVLADFRYGARPGC" gene complement(3602564..3603322) /locus_tag="Rv3226c" /db_xref="GeneID:888220" CDS complement(3602564..3603322) /locus_tag="Rv3226c" /function="UNKNOWN" /note="Rv3226c, (MTCY20B11.01c), len: 252 aa. Conserved hypothetical protein, similar to various hypothetical bacterial proteins e.g. Q9CCI2|ML0793 PUTATIVE BACTERIOPHAGE PROTEIN from Mycobacterium leprae (252 aa), FASTA scores: opt: 1183, E(): 3.8e-68, (70.65% identity in 252 aa overlap); BAB54183|MLR7795 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (369 aa), FASTA scores: opt: 417, E(): 2.9e-19, (33.75% identity in 252 aa overlap); O64131 YOQW PROTEIN from Bacteriophage SPBc2 (224 aa), FASTA scores: opt: 413, E(): 3.4e-19, (38.5% identity in 244 aa overlap); O31916 YOQW PROTEIN from Bacillus subtilis (224 aa), FASTA scores: opt: 413, E(): 3.4e-19, (38.5% identity in 244 aa overlap); O34906 YOAM PROTEIN from Bacillus subtilis (227 aa), FASTA scores: opt: 401, E(): 2e-18, (37.7% identity in 244 aa overlap); Q9K4A5|SC7E4.11 HYPOTHETICAL 30.8 KDA PROTEIN from Streptomyces coelicolor (271 aa), FASTA scores: opt: 383, E(): 3.3e-17, (39.6% identity in 283 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217743.1" /db_xref="GI:15610362" /db_xref="UniProtKB/TrEMBL:O05872" /db_xref="GeneID:888220" /translation="MCGRFAVTTDPAQLAEKITAIDEATGCGGGKTSYNVAPTDTIAT VVSRHSEPDDEPTRRVRLMRWGLIPSWIKAGPGGAPDAKGPPLINARADKVATSPAFR SAVRSKRCLVPMDGWYEWRVDPDATPGRPNAKTPFFLHRHDGALLFTAGLWSVWKSYR SAPPLLSCTVITTDAVGELAEIHDRMPLLLAEEDWDDWLNPDAPPDPELLARPPDVRD IALRQVSTLVNNVRNNGPELLEPARSQPEQIQLL" gene 3603377..3604729 /gene="aroA" /locus_tag="Rv3227" /db_xref="GeneID:888753" CDS 3603377..3604729 /gene="aroA" /locus_tag="Rv3227" /EC_number="2.5.1.19" /function="INVOLVED IN THE BIOSYNTHESIS OF CHORISMATE WITHIN THE BIOSYNTHESIS OF AROMATIC AMINO ACIDS (THE SHIKIMATE PATHWAY). ACTS IN THE SIXTH STEP OF THIS PATHWAY. [CATALYTIC ACTIVITY: PHOSPHOENOLPYRUVATE + 3-PHOSPHOSHIKIMATE = ORTHOPHOSPHATE + O(5)-(1-CARBOXYVINYL)-3-PHOSPHOSHIKIMATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 5-O-(1-carboxyvinyl)-3-phosphoshikimate from phosphoenolpyruvate and 3-phosphoshikimate in tryptophan biosynthesis" /codon_start=1 /transl_table=11 /product="3-phosphoshikimate 1-carboxyvinyltransferase" /protein_id="NP_217744.1" /db_xref="GI:15610363" /db_xref="GOA:P22487" /db_xref="UniProtKB/Swiss-Prot:P22487" /db_xref="GeneID:888753" /translation="MKTWPAPTAPTPVRATVTVPGSKSQTNRALVLAALAAAQGRGAS TISGALRSRDTELMLDALQTLGLRVDGVGSELTVSGRIEPGPGARVDCGLAGTVLRFV PPLAALGSVPVTFDGDQQARGRPIAPLLDALRELGVAVDGTGLPFRVRGNGSLAGGTV AIDASASSQFVSGLLLSAASFTDGLTVQHTGSSLPSAPHIAMTAAMLRQAGVDIDDST PNRWQVRPGPVAARRWDIEPDLTNAVAFLSAAVVSGGTVRITGWPRVSVQPADHILAI LRQLNAVVIHADSSLEVRGPTGYDGFDVDLRAVGELTPSVAALAALASPGSVSRLSGI AHLRGHETDRLAALSTEINRLGGTCRETPDGLVITATPLRPGIWRAYADHRMAMAGAI IGLRVAGVEVDDIAATTKTLPEFPRLWAEMVGPGQGWGYPQPRSGQRARRATGQGSGG" misc_feature 3604388..3604444 /gene="aroA" /locus_tag="Rv3227" /note="PS00885 EPSP synthase signature 2" gene 3604726..3605718 /locus_tag="Rv3228" /db_xref="GeneID:888785" CDS 3604726..3605718 /locus_tag="Rv3228" /function="UNKNOWN" /note="Rv3228, (MTCY20B11.03), len: 330 aa. Conserved hypothetical protein, equivalent to Q9CCI4|ML0791 HYPOTHETICAL PROTEIN from Mycobacterium leprae (327 aa), FASTA scores: opt: 1828, E(): 1e-98, (84.0% identity in 331 aa overlap). Also similar to several hypothetical bacterial proteins e.g. Q9K4A8|SC7E4.08c from Streptomyces coelicolor (337 aa), FASTA scores: opt: 1051, E(): 1e-53, (52.65% identity in 338 aa overlap); Q9HUL3|PA4952 from Pseudomonas aeruginosa (339 aa), FASTA scores: opt: 392 ,E(): 1.4e-15, (34.85% identity in 281 aa overlap); Q9PFV1|XF0556 from Xylella fastidiosa (341 aa), FASTA scores: opt: 367, E(): 4e-14, (36.85% identity in 247 aa overlap); P45339|YJEQ_HAEIN|HI1714 from Haemophilus influenzae (346 aa), FASTA scores: opt: 355, E(): 2e-13, (31.65% identity in 281 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217745.1" /db_xref="GI:15610364" /db_xref="GOA:O05873" /db_xref="UniProtKB/TrEMBL:O05873" /db_xref="GeneID:888785" /translation="MRPGDYDESDVKVRSGRSSRPRTKTRPEHADAEAAMVVSVDRGR WGCVLGGRPDRRITAMRARELGRTPIVVGDDVDVVGDLSGRPDTLARIVRRAPRRTVL RRTADDTDPTERVVVANADQLLIVVALADPPPRTGLVDRALIAAYAGGLTPILCLTKT DLAPAEPFGKQFADLELTVTAAGVDDPLLAVADLLAGKITVLLGHSGVGKSTLVNRLV PEADRAVGEVTEIGRGRHTSTRSVALPLGDTLSGSGWVIDTPGIRSFGLAHIQPDNVL LAFSDLAEATRECPRGCGHMGPPADPECALDTLSGPAARRAAAARRLLAVLSQT" misc_feature 3605335..3605358 /locus_tag="Rv3228" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3605751..3607034) /locus_tag="Rv3229c" /db_xref="GeneID:888821" CDS complement(3605751..3607034) /locus_tag="Rv3229c" /EC_number="1.14.19.3" /function="THOUGHT TO BE INVOLVED IN LIPID METABOLISM [CATALYTIC ACTIVITY: LINOLEOYL-CoA + AH(2) + O(2) = GAMMA-LINOLENOYL-CoA + A + 2 H(2)O]" /experiment="experimental evidence, no additional details recorded" /note="Rv3229c, (MTCY20B11.04c), len: 427 aa. Possible linoleoyl-CoA desaturase (EC 1.14.99.25), showing similarity with desaturases and other proteins e.g. Q08871|DES6|SLL0262 LINOLEOYL-CoA DESATURASE from Synechocystis sp. strain PCC 6803 (359 aa), FASTA scores: opt: 319, E(): 4e-13, (25.1% identity in 295 aa overlap); Q54795|DESD DELTA 6 DESATURASE from Spirulina platensis (368 aa), FASTA scores: opt: 268, E(): 7.7e-10, (25.0% identity in 300 aa overlap); Q9ZTU8|S276 PROTEIN WITH SIMILARITY TO CYTOCHROME B5 DOMAIN from Triticum aestivum (Wheat) (469 aa), FASTA scores: opt: 240, E(): 5.9e-08, (27.05% identity in 266 aa overlap); etc. Note that previously known as desA3.; desA3" /codon_start=1 /transl_table=11 /product="linoleoyl-CoA desaturase" /protein_id="YP_177948.1" /db_xref="GI:57117078" /db_xref="GOA:Q7D5W1" /db_xref="UniProtKB/TrEMBL:Q7D5W1" /db_xref="GeneID:888821" /translation="MAITDVDVFAHLTDADIENLAAELDAIRRDVEESRGERDARYIR RTIAAQRALEVSGRLLLAGSSRRLAWWTGALTLGVAKIIENMEIGHNVMHGQWDWMND PEIHSSTWEWDMSGSSKHWRYTHNFVHHKYTNILGMDDDVGYGMLRVTRDQRWKRYNI FNVVWNTILAIGFEWGVALQHLEIGKIFKGRADREAAKTRLREFSAKAGRQVFKDYVA FPALTSLSPGATYRSTLTANVVANVIRNVWSNAVIFCGHFPDGAEKFTKTDMIGEPKG QWYLRQMLGSANFNAGPALRFMSGNLCHQIEHHLYPDLPSNRLHEISVRVREVCDRYD LPYTTGSFLVQYGKTWRTLAKLSLPDKYLRDNADDAPETRSERMFAGLGPGFAGADPV TGRRRGLKTAIAAVRGRRRSKRMAKSVTEPDDLAA" gene complement(3607112..3608254) /locus_tag="Rv3230c" /db_xref="GeneID:888748" CDS complement(3607112..3608254) /locus_tag="Rv3230c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3230c, (MTCY20B11.05c), len: 380 aa. Putative oxidoreductase (EC 1.-.-.-), with some similarity to various proteins, especially reductases e.g. Q9HUS4|PA4889 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (366 aa), FASTA scores: opt: 516, E(): 1.8e-24, (33.8% identity in 367 aa overlap); P95533|TDNB ELECTRON TRANSFER PROTEIN from Pseudomonas putida (337 aa), FASTA scores: opt: 380, E(): 4e-16, (30.7% identity in 277 aa overlap); BAB34381|ECS0958 NADH OXIDOREDUCTASE FOR THE HCP from Escherichia coli strain O157:H7 (322 aa), FASTA scores: opt: 369, E(): 1.8e-15, (28.65% identity in 328 aa overlap); Q44253|ATDA5 ANILINE DIOXYGENASE REDUCTASE COMPONENT from Acinetobacter sp. (336 aa), FASTA scores: opt: 305, E(): 1.6e-11, (27.4% identity in 303 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217747.1" /db_xref="GI:15610366" /db_xref="GOA:O05875" /db_xref="UniProtKB/TrEMBL:O05875" /db_xref="GeneID:888748" /translation="MSKKHTTLNASIIDTRRPTVAGADRHPGWHALRKIAARITTPLL PDDYLHLANPLWSARELRGRILGVRRETEDSATLFIKPGWGFSFDYQPGQYIGIGLLV DGRWRWRSYSLTSSPAASGSARMVTVTVKAMPEGFLSTHLVAGVKPGTIVRLAAPQGN FVLPDPAPPLILFLTAGSGITPVMSMLRTLVRRNQITDVVHLHSAPTAADVMFGAELA ALAADHPGYRLSVRETRAQGRLDLTRIGQQVPDWRERQTWACGPEGVLNQADKVWSSA GASDRLHLERFAVSKTAPAGAGGTVTFARSGKSVAADAATSLMDAGEGAGVQLPFGCR MGICQSCVVDLVEGHVRDLRTGQRHEPGTRVQTCVSAASGDCVLDI" gene complement(3608364..3608873) /locus_tag="Rv3231c" /db_xref="GeneID:888749" CDS complement(3608364..3608873) /locus_tag="Rv3231c" /function="UNKNOWN" /note="Rv3231c, (MTCY20B11.06c), len: 169 aa. Hypothetical protein, similar to Q9KYX9|SCE33.03c HYPOTHETICAL 17.4 KDA PROTEIN from Streptomyces coelicolor (167 aa), FASTA scores: opt: 415, E(): 6.6e-19, (49.1% identity in 171 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217748.1" /db_xref="GI:15610367" /db_xref="UniProtKB/TrEMBL:O05876" /db_xref="GeneID:888749" /translation="MTQVYIPATLAMLQRLVADGALWPVNGTAFAVTPTLRESYAEGD DEELAEVALREAALASLRLLAADIGATADALPPRRAVLAAEVDDATYRPDLDDAVVRL AGPITIDQVVAAYVDNAGAEPAVMAAIAVIDAADLGDEDAELVVGDAQDHDLAWYANQ ELPFLLDLL" gene complement(3608870..3609757) /gene="pvdS" /locus_tag="Rv3232c" /db_xref="GeneID:888760" CDS complement(3608870..3609757) /gene="pvdS" /locus_tag="Rv3232c" /function="POSSIBLY INVOLVED IN TRANSCRIPTIONAL MECHANISM (PROBABLY SIGMA FACTOR PROMOTING ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES)." /note="Rv3232c, (MTCY20B11.07c), len: 295 aa (start uncertain). Possible pvdS, an alternative RNA polymerase sigma factor, highly similar (but N-terminus longer 25-50 residues approximatively) to Q9RIZ9|SCJ1.15 PUTATIVE REGULATOR from Streptomyces coelicolor (267 aa), FASTA scores: opt: 1189, E(): 1.4e-70, (65.65% identity in 262 aa overlap); Q9KU02|VC0728 HYPOTHETICAL PROTEIN from Vibrio cholerae (258 aa), FASTA scores: opt: 1074, E(): 4.5e-63, (62.6% identity in 254 aa overlap); P72119|PVDS PAO SUBSTRAIN OT684 PYOVERDINE GENE TRANSCRIPTIONAL REGULATOR PVDS (FRAGMENT) from Pseudomonas aeruginosa (see citations below) (237 aa), FASTA scores: opt: 988, E(): 1.8e-57, (60.8% identity in 227 aa overlap). Also highly similar to Q9I154|PA2428 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (304 aa), FASTA scores: opt: 1057, E(): 6.8e-62, (60.7% identity in 252 aa overlap); Q9I6Z1|PA0141 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (298 aa), FASTA scores: opt: 990, E(): 1.6e-57, (54.6% identity in 249 aa overlap); and other hypothetical bacterial proteins. Could be a member of a subfamily of RNA polymerase sigma factors which direct the synthesis of extracellular products by bacteria." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein PvdS" /protein_id="NP_217749.1" /db_xref="GI:15610368" /db_xref="UniProtKB/TrEMBL:O05877" /db_xref="GeneID:888760" /translation="MDIPSVDVSTATNDGASSRAKGHRSAAPGRRKISDAVYQAELFR LQTEFVKLQEWARHSGARLVVIFEGRDGAGKGGAIKRITEYLNPRVARIAALPAPTDR ERGQWYYQRYIAHLPAKGEIVLFDRSWYNRAGVEKVMGFCTPQEYVLFLRQTPIFEQM LIDDGILLRKYWFSVSDAEQLRRFKARRNDPVRQWKLSPMDLESVYRWEDYSRAKDEM MVHTDTPVSPWYVVESDIKKHARLNMMAHLLSTIDYADVEKPKVKLPPRPLVSGNYRR PPRELSTYVDDYVATLIAR" gene complement(3609781..3610371) /locus_tag="Rv3233c" /db_xref="GeneID:888773" CDS complement(3609781..3610371) /locus_tag="Rv3233c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3233c, (MTCY20B11.08c), len: 196 aa. Hypothetical protein, similar to C-terminus of Q9RIU8|SCM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 308, E(): 1.2e-12, (32.0% identity in 200 aa overlap); and several hypothetical M. tuberculosis proteins e.g. O06343|YY80_MYCTU|Rv3480c|MTCY13E12.33c (497 aa), FASTA scores: opt: 248, E(): 9.8e-09, (27.5% identity in 200 aa overlap); MTCY28_26; MTCY493_29; MTCY31_25; MTCY31_25." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217750.1" /db_xref="GI:15610369" /db_xref="UniProtKB/TrEMBL:O05878" /db_xref="GeneID:888773" /translation="MIAGALGNWLMSRGEAVAPTATVRAMAPLSVYADDQLDSTGPGQ AISQVTPFLVDLPVGEGNAVVRLSQIAHATESNPTAASLVDARTIVTLSGLAPATLHA MGVRVATSFSARLFNLLITNAPGTQSQMYIAGTKLLETYSVPPLLHNQALAISVTSYN GMLYFGINADRDAMSDVDLLPGLLSQALDELLEASR" gene complement(3610374..3611189) /locus_tag="Rv3234c" /db_xref="GeneID:888767" CDS complement(3610374..3611189) /locus_tag="Rv3234c" /function="UNKNOWN" /note="Rv3234c, (MTCY20B11.09c), len: 271 aa. Hypothetical protein, similar to C-terminus of Mycobacterium tuberculosis hypothetical proteins e.g. P71694|Rv1425|MTCY21B4.43|MTCY493.29c (459 aa), FASTA scores: opt: 498, E(): 5.2e-24, (36.8% identity in 261 aa overlap); MTCY03A2.28; MTCY31.23; MTCY493_29; MTCY28_26; MTV013_8; MTY13E12_33; etc. Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 309, E(): 4.3e-12, (33.35% identity in 189 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217751.1" /db_xref="GI:15610370" /db_xref="GOA:O05879" /db_xref="UniProtKB/Swiss-Prot:O05879" /db_xref="GeneID:888767" /translation="MVTRLSASDASFYQLENTATPMYVGLLLILRRPRAGLSYEALLE TVEQRLPQIPRYRQKVQEVKLGLARPVWIDDRDFDITYHVRRSALPSPGSDEQLHELI ARLAARPLDKSRPLWEMYLVEGLEKNRIALYTKSHQALINGVTALAIGHVIADRTRRP PAFPEDIWVPERDPGTTRLLLRAVGDWLVRPGAQLQAVGSAVAGLVTNSGQLVETGRK VLDIARTVARGTAPSSPLNATVSRNRRFTVARASLDDYRTVRARYDCDSTTWC" gene 3611300..3611941 /locus_tag="Rv3235" /db_xref="GeneID:888862" CDS 3611300..3611941 /locus_tag="Rv3235" /function="UNKNOWN" /note="Rv3235, (MTCY20B11.10), len: 213 aa. Hypothetical unknown ala-, arg-, pro-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217752.1" /db_xref="GI:15610371" /db_xref="UniProtKB/TrEMBL:O05880" /db_xref="GeneID:888862" /translation="MMASNQTAAQHSSATLQQAPRSIDDAGGCPLTISPIANSPGDTF AVTPVVEYEPPPRNIPPCGQSSHAARRPHTPQLARRQPIRPSGRAPAAVTSTAKSPRL RQAGTFADAALRRVLEVIDRRRPVGQLRPLLAPGLVDSVLAVSRTAAGHQQGAAMLRR IRLTPAGPDTADTAAEVFGTYSRGDRIHAIACRVEQRPAGNETRWLMVALHIG" gene complement(3611959..3613116) /locus_tag="Rv3236c" /db_xref="GeneID:888861" CDS complement(3611959..3613116) /locus_tag="Rv3236c" /function="PROBABLY INVOLVED IN TRANSPORT OF UNDETERMINATED SUBSTRATE (POSSIBLY CATIONS Na/H) ACROSS THE MEMBRANE. THOUGHT TO BE RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3236c, (MTCY20B11.11c), len: 385 aa. Probable conserved integral membrane transport protein, possibly cation (Na/H) transporter, equivalent to Q9CCI5|ML0782 putative transmembrane transport protein from Mycobacterium leprae (385 aa), FASTA scores: opt: 1975, E(): 2.4e-108, (81.55% identity in 385 aa overlap). Highly similar to others e.g. O69958|SC4H2.03c putative transmembrane transport protein from Streptomyces coelicolor (411 aa), FASTA scores: opt: 1226, E(): 1.6e-64, (53.5% identity in 372 aa overlap); Q9XAKO|SC66T3.13c putative transmembrane transport protein from Streptomyces coelicolor (403 aa), FASTA scores: opt: 1198, E(): 6.8e-63, (53.25% identity in 370 aa overlap); Q9RV80|DR1149 putative Na+/H+ antiporter from Deinococcus radiodurans (383 aa), FASTA scores: opt: 1069, E(): 2.3e-55, (47.35% identity in 376 aa overlap); Q9L191|SC10G8.11 putative transmembrane transport protein from Streptomyces coelicolor (446 aa), FASTA scores: opt: 695, E(): 1.9e-33, (38.05% identity in 384 aa overlap); Q9RRW8|DR2367 putative glutathione-regulated potassium-efflux system protein KEFB from Deinococcus radiodurans (575 aa), FASTA scores: opt: 414, E(): 6.2e-17, (30.25% identity in 380 aa overlap); etc. SEEMS TO BELONG TO THE CPA2 FAMILY. Note that previously known as kefB.; kefB" /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="YP_177949.1" /db_xref="GI:57117079" /db_xref="UniProtKB/TrEMBL:Q7D5V5" /db_xref="GeneID:888861" /translation="MEVSRALLFELGVLLAVLAVLGAVARRFALSPIPVYLLAGLSLG NGGILGVAAAGEFIATGAPIGVVLLLLALGLEFSATEFASSLRHHLPSAGVDIVLNAT PGAVAGWLLGLDGVAILGLAGVTYISSSGVIARLLEDLRRLGNRETPAVLSVLVLEDF AMAAYLPLFAVLATDGSWLEAVVGMTVAIAALLGAFAASYRWGHHVGRLVTHPDSEQL LLRVLGITLIVAAVAESLHASAAVGAFLVGLTLTGETADRARMVLTPLRDLFATIFFL GIGLSVDPGKLVSMLPVALALAAVTAATKVATGMFAARREGVARRGQLRAGTALVARG EFSLIIIGLAGASIPGVAALATAYVFVMAIVGPILARYTGGGLPAAAVASN" gene complement(3613121..3613603) /locus_tag="Rv3237c" /db_xref="GeneID:888803" CDS complement(3613121..3613603) /locus_tag="Rv3237c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3237c, (MTCY20B11.12c), len: 160 aa. Conserved hypothetical protein, equivalent to Q9CCI6|ML0781 HYPOTHETICAL PROTEIN from Mycobacterium leprae (160 aa), FASTA scores: opt: 828, E(): 1.5e-45, (80.6% identity in 160 aa overlap); and similar to other hypothetical bacterial proteins and more weakly to putative potassium channels e.g. Q9RV81|DR1148 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (175 aa), FASTA scores: opt: 420, E(): 9.5e-20, (37.95% identity in 158 aa overlap); O69959|SC4H2.04c HYPOTHETICAL 17.1 KDA PROTEIN from Streptomyces coelicolor (161 aa), FASTA scores: opt: 315, E(): 3.8e-13, (40.0% identity in 150 aa overlap); Q9HNH3|PCHB|VNG2104G POTASSIUM CHANNEL HOMOLOG from Halobacterium sp. strain NRC-1 (418 aa), FASTA scores: opt: 158, E(): 0.007, (31.45% identity in 124 aa overlap); Q58752|YD57_METJA|MJ1357 PUTATIVE POTASSIUM CHANNEL PROTEIN from Methanococcus jannaschii (343 aa), FASTA scores: opt: 143, E(): 0.053, (33.8% identity in 68 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217754.1" /db_xref="GI:15610373" /db_xref="GOA:O05882" /db_xref="UniProtKB/TrEMBL:O05882" /db_xref="GeneID:888803" /translation="MDVKEVLLPGVGLRYEFTSYRGDRIGIVARRSGGFDVVLYGRDD PDEARPVLRLTDEEAEAVAQILGAPRIAERFTELTREVPGLKAGQIHIRAGSLFVDRP LGDTRARTRTGASIVAIVRDEDVLASPGPTDVLRAGDVLIVIGTEDGIAGVEQIVEKG" gene complement(3613664..3614398) /locus_tag="Rv3238c" /db_xref="GeneID:888858" CDS complement(3613664..3614398) /locus_tag="Rv3238c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3238c, (MTCY20B11.13c), len: 244 aa. Probable conserved integral membrane protein, similar to several hypothetical proteins and transmembrane proteins e.g. Q9UN92|NRM29 MULTISPANNING NUCLEAR ENVELOPE MEMBRANE PROTEIN NURIM (FRAGMENT) from Homo sapiens (Human) (261 aa), FASTA scores: opt: 281, E(): 3.3e-11, (30.7% identity in 189 aa overlap); Q9VEG9|CG7655 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (253 aa), FASTA scores: opt: 242, E(): 1.1e-08, (27.7% identity in 242 aa overlap); BAB48937|MLR1600 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (222 aa), FASTA scores: opt: 137, E(): 0.066, (28.1% identity in 185 aa overlap); BAB57936|SAV1774 AESENICAL PUMP MEMBRANE PROTEIN HOMOLOG from Staphylococcus aureus subsp. aureus Mu50 (430 aa), FASTA scores: opt: 125, E(): 0.68, (25.7% identity in 144 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217755.1" /db_xref="GI:15610374" /db_xref="UniProtKB/TrEMBL:O05883" /db_xref="GeneID:888858" /translation="MKRYLTIIYGAASYLVFLVAFGYAIGFVGDVVVPRTVDHAIAAP IGQAVVVNLVLLGVFAVQHSVMARQGFKRWWTRFVPPSIERSTYVLLASVALLLLYWQ WRTMPAVIWDVRQPAGRVALWALFWLGWATVLTSTFMINHFELFGLRQVYLAWRGKPY TEIGFQAHLLYRWVRHPIMLGFVVAFWATPMMTAGHLLFAIGATGYILVALQFEERDL LAALGDQYRDYRREVSMLLPWPHRHT" gene complement(3614457..3617603) /locus_tag="Rv3239c" /db_xref="GeneID:888856" CDS complement(3614457..3617603) /locus_tag="Rv3239c" /function="UNKNOWN, BUT SEEMS INVOLVED IN EFFLUX SYSTEM (PROBABLY SUGAR OR DRUG TRANSPORT)." /note="Rv3239c, (MTCY20B11.14c), len: 1048 aa. Probable conserved transmembrane protein, organised in two domains. Domain comprising first 500 aa residues is similar to various antibiotic resistance and efflux proteins and contains sugar transport proteins signature 1 (PS00216); e.g. Q9RL22|SC5G9.04c PUTATIVE TRANSMEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (489 aa), FASTA scores: opt: 905, E(): 3.1e-41, (36.95% identity in 482 aa overlap); and O68912|FRNF PUTATIVE ANTIBIOTIC ANTIPORTER from Streptomyces roseofulvus (517 aa), FASTA scores: opt: 866, E(): 4.1e-39, (37.1% identity in 512 aa overlap). Second part, corresponding to last 550 aa residues, is very similar to Q50733|Rv2565|MTCY9C4.03c hypothetical 62.1 kDa protein from Mycobacterium tuberculosis (583 aa), FASTA scores: E(): 2.1e-28, (36.5% identity in 572 aa overlap). Also equivalent to Rv3728|MTV025.076 PUTATIVE TWO-DOMAIN MEMBRANE PROTEIN (SIMILAR TO SUGAR TRANSPORTER FAMILY) from Mycobacterium tuberculosis (1065 aa), FASTA scores: opt: 4328, E(): 0, (64.15% identity in 1046 aa overlap); and similar to other Mycobacterium tuberculosis proteins: MTCY3G12.01, E(): 6.3e-32; MTCY98.02c, E(): 6.3e-32; MTCY9C4.03c, E(): 1.5e-26; MTCY369.27c, E(): 2.5e-26. Equivalent to AAK47679 Drug transporter from Mycobacterium tuberculosis strain CDC1551 (1065 aa) but shorter 20 aa. Contains cyclic nucleotide-binding domain signature 2 (PS00889). Probably member of major facilitator superfamily (MFS)." /codon_start=1 /transl_table=11 /product="transmembrane transport protein" /protein_id="NP_217756.1" /db_xref="GI:15610375" /db_xref="GOA:O05884" /db_xref="UniProtKB/TrEMBL:O05884" /db_xref="GeneID:888856" /translation="MHISLHGGKGFANLTRRRRPSSASVLLVAGFGAFLAFLDSTIVN IAFPDIQRSFPSYDIGSLSWILNGYNIVFAAFMVAAGRLADLLGRRRTFLSGVLVFTI ASGLCAVAGSVEQLVAFRVLQGIGAAILVPASLALVVEGFDAARRAHAIGLWGAAAAI AAGLGPPIGGLLVEWAGWRWVLLVNVPLGIVAAIATKRMLVESRASGRRRMPDLRGAL LLAVTLGLVTLGLVKGPDWGWLSVATVGSFLASVLTSVGFVHSSRSHPAPLVEPALLR SRSFVAGNLLTLVAAAGFYCYGLTHVLYLNYVWHYSLLKAGFAIAPAAVVAAVVAAAL GRVAGRHGHRVIVLVGALVWAGSLVWYLQRVGSEPDFLRVWLPGQLLQGIGVGATLPV LSSAALAEVAKGGSYATSSAVVSTTRQLGAVLGVAVMVILIGKPEHGTAEEALRRGWA MAAICFIAVAVAAAVLGRTNRNPVQMPAPEPAIAPRLEPPIPQPAAAPIEHWAAGDAD PLGNLPLFAGLDAATLAQLGEHVEDVELEAGCYLFHEGDPSDSLYVIRTGRVQVLQDS IVLKELGRGEVLGELGLLIDAPRSATVRALRDTKLVRLTKAQFDEIADHGALAALVKV LATRLREAPPPATDSTSPEVVVSVIGVSGDAPVPAVAAGLLTALSARLRAVDPGRVDR DGLDRAERVADKVVLHAAVEDAGWRDFCLRVADRIVLVAGDPNPQAARLPARARGADL VLAGPAASREHRRQWEELITPRSVHVVHYRRILENVRPLAARIAGRSIGLVLGGGGAR GFAHLGVLDELERVGVTIDRFAGTSMGAVIAVFGACGMDAATADAYAYEYFIRHNPLS DYAFPVRGLVRGRRTLTLLEAAFGDRLVEELPKEFRCVSVDLLARRPVVHRRGRLVDV IGCSLRLPGIYPPQVYNGRLHVDGGVLDNLPVSTRASPDGPLIAVSIGLGGGGPGSAR QDGSPKVPGIGDTLMRTMTIGSQRGADAALSLAQVVIRPDTGAVGLLEFHQIDAAREA GRVAAREAMPHIMALLNR" misc_feature complement(3615819..3615872) /locus_tag="Rv3239c" /note="PS00889 Cyclic nucleotide-binding domain signature 2" misc_feature complement(3617316..3617366) /locus_tag="Rv3239c" /note="PS00216 Sugar transport proteins signature 1" gene complement(3617682..3620531) /gene="secA1" /locus_tag="Rv3240c" /gene_synonym="azi" /gene_synonym="div" /db_xref="GeneID:888860" CDS complement(3617682..3620531) /gene="secA1" /locus_tag="Rv3240c" /gene_synonym="azi" /gene_synonym="div" /function="INVOLVED IN PROTEIN EXPORT. INTERACTS WITH THE SECY/SECE SUBUNITS. SECA HAS A CENTRAL ROLE IN COUPLING THE HYDROLYSIS OF ATP TO THE TRANSFER OF PRE-SECRETORY PERIPLASMIC AND OUTER MEMBRANE PROTEINS ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="functions in protein export; can interact with acidic membrane phospholipids and the SecYEG protein complex; binds to preproteins; binds to ATP and undergoes a conformational change to promote membrane insertion of SecA/bound preprotein; ATP hydrolysis appears to drive release of the preprotein from SecA and deinsertion of SecA from the membrane; additional proteins SecD/F/YajC aid SecA recycling; exists in an equilibrium between monomers and dimers; may possibly form higher order oligomers; proteins in this cluster correspond SecA1; SecA2 is not essential and seems to play a role in secretion of a subset of proteins" /codon_start=1 /transl_table=11 /product="preprotein translocase subunit SecA" /protein_id="YP_177950.1" /db_xref="GI:57117080" /db_xref="GOA:O05885" /db_xref="UniProtKB/Swiss-Prot:O05885" /db_xref="GeneID:888860" /translation="MLSKLLRLGEGRMVKRLKKVADYVGTLSDDVEKLTDAELRAKTD EFKRRLADQKNPETLDDLLPEAFAVAREAAWRVLDQRPFDVQVMGAAALHLGNVAEMK TGEGKTLTCVLPAYLNALAGNGVHIVTVNDYLAKRDSEWMGRVHRFLGLQVGVILATM TPDERRVAYNADITYGTNNEFGFDYLRDNMAHSLDDLVQRGHHYAIVDEVDSILIDEA RTPLIISGPADGASNWYTEFARLAPLMEKDVHYEVDLRKRTVGVHEKGVEFVEDQLGI DNLYEAANSPLVSYLNNALKAKELFSRDKDYIVRDGEVLIVDEFTGRVLIGRRYNEGM HQAIEAKEHVEIKAENQTLATITLQNYFRLYDKLAGMTGTAQTEAAELHEIYKLGVVS IPTNMPMIREDQSDLIYKTEEAKYIAVVDDVAERYAKGQPVLIGTTSVERSEYLSRQF TKRRIPHNVLNAKYHEQEATIIAVAGRRGGVTVATNMAGRGTDIVLGGNVDFLTDQRL RERGLDPVETPEEYEAAWHSELPIVKEEASKEAKEVIEAGGLYVLGTERHESRRIDNQ LRGRSGRQGDPGESRFYLSLGDELMRRFNGAALETLLTRLNLPDDVPIEAKMVTRAIK SAQTQVEQQNFEVRKNVLKYDEVMNQQRKVIYAERRRILEGENLKDQALDMVRDVITA YVDGATGEGYAEDWDLDALWTALKTLYPVGITADSLTRKDHEFERDDLTREELLEALL KDAERAYAAREAELEEIAGEGAMRQLERNVLLNVIDRKWREHLYEMDYLKEGIGLRAM AQRDPLVEYQREGYDMFMAMLDGMKEESVGFLFNVTVEAVPAPPVAPAAEPAELAEFA AAAAAAAQQRSAVDGGARERAPSALRAKGVASESPALTYSGPAEDGSAQVQRNGGGAH KTPAGVPAGASRRERREAARRQGRGAKPPKSVKKR" gene complement(3620610..3621254) /locus_tag="Rv3241c" /db_xref="GeneID:888849" CDS complement(3620610..3621254) /locus_tag="Rv3241c" /function="UNKNOWN, BUT MAY BE INVOLVED IN TRANSDUCTION MECHANISM" /note="Rv3241c, (MTCY20B11.16c), len: 213 aa. Conserved hypothetical protein, similar to many hypothetical proteins and to some putative ribosomal proteins e.g. Q9CCI7|ML0778 HYPOTHETICAL PROTEIN from Mycobacterium leprae (229 aa), FASTA scores: opt: 1234, E(): 1.3e-72, (89.3% identity in 206 aa overlap); Q9KYX2|SCE33.11c HYPOTHETICAL 27.9 KDA PROTEIN from Streptomyces coelicolor (254 aa), FASTA scores: opt: 487, E(): 2.2e-24, (47.6% identity in 210 aa overlap); Q9FLV3 PROTEIN SIMILAR TO RIBOSOMAL PROTEIN 30S SUBUNIT from Arabidopsis thaliana (Mouse-ear cress) (365 aa), FASTA scores: opt: 264, E(): 7e-10, (26.4% identity in 212 aa overlap); P19954|RR30_SPIOL|RPS22 PLASTID-SPECIFIC 30S RIBOSOMAL PROTEIN 1, chloroplast, from Spinacia oleracea (Spinach) (302 aa), FASTA scores: opt: 261, E(): 9.3e-10, (26.15% identity in 214 aa overlap); P47995|YSEA_STACA HYPOTHETICAL PROTEIN IN SECA 5'REGION (ORF1) (FRAGMENT) (BELONGS TO THE S30AE FAMILY OF RIBOSOMAL PROTEINS) from Staphylococcus carnosus (165 aa), FASTA scores: opt: 201, E(): 4.2e-06, (33.35% identity in 147 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217758.1" /db_xref="GI:15610377" /db_xref="UniProtKB/TrEMBL:O05886" /db_xref="GeneID:888849" /translation="MDSGQVLAEPKSNAEIVFKGRNVEIPDHFRIYVSQKLARLERFD RTIYLFDVELDHERNRRQRKSCQRVEITARGRGPVVRGEACADSFYAALESAVVKLES RLRRGKDRRKVHYGDKTPVSLAEATAVVPAPENGFNTRPAEAHDHDGAVVEREPGRIV RTKEHPAKPMSVDDALYQMELVGHDFFLFYDKDTERPSVVYRRHAYDYGLIRLA" gene complement(3621570..3622211) /locus_tag="Rv3242c" /db_xref="GeneID:888757" CDS complement(3621570..3622211) /locus_tag="Rv3242c" /function="UNKNOWN" /note="Rv3242c, (MTCY20B11.17c), len: 213 aa. Conserved hypothetical protein, highly similar in N-terminus to Q9CCI9|ML0776 HYPOTHETICAL PROTEIN from Mycobacterium leprae (85 aa), FASTA scores: opt: 324, E(): 1.7e-13, (78.1% identity in 64 aa overlap). Also similar to Q9RUJ7|DR1389 PUTATIVE COMPETENCE PROTEIN COMF from Deinococcus radiodurans (219 aa), FASTA scores: opt: 223, E(): 6.3e-07, (35.8% identity in 215 aa overlap); BAB50338|MLL3453 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (240 aa), FASTA scores: opt: 218, E(): 1.4e-06, (28.5% identity in 224 aa overlap); Q9A9Y1|CC0830 COMPETENCE PROTEIN F from Caulobacter crescentus (265 aa), FASTA scores: opt: 182, E(): 0.00026, (30.15% identity in 219 aa overlap); etc. Equivalent to AAK47682 from Mycobacterium tuberculosis strain CDC1551 (241 aa) but shorter 29 aa. Contains purine/pyrimidine phosphoribosyl transferases signature (PS00103). SEEMS TO BELONG TO PURINE/PYRIMIDINE PHOSPHORIBOSYL TRANSFERASE FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217759.1" /db_xref="GI:15610378" /db_xref="GOA:O05887" /db_xref="UniProtKB/TrEMBL:O05887" /db_xref="GeneID:888757" /translation="MLDLVLPLECGGCGAPATRWCAACAAELSVAAGEPHVVSPRVDP QVPVFALGRYAGVRRQAILAMKEHGRRDLVAPLACALIVGVDHLLSWGMLENPLTMVP APTRRWAARRRGGDPVSRMARIAGATLGRHHDVTVVPALRMRALARDSVGLGASARER NITGRVLLRGQRPRNEVVLVDDIITTGATARESVRVLQAAGVRVGAVLAVAAA" misc_feature complement(3621645..3621683) /locus_tag="Rv3242c" /note="PS00103 Purine/pyrimidine phosphoribosyl transferases signature" gene complement(3622249..3623091) /locus_tag="Rv3243c" /db_xref="GeneID:888810" CDS complement(3622249..3623091) /locus_tag="Rv3243c" /function="UNKNOWN" /note="Rv3243c, (MTCY20B11.18c), len: 280 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217760.1" /db_xref="GI:15610379" /db_xref="UniProtKB/TrEMBL:O05888" /db_xref="GeneID:888810" /translation="MSPRVPRLRWDDPFRALDMLASLWSSTGMSLVSAGAAQAVAAPY RTLFTTLQQLLIGKEVTVRIGDHDVVLTVTELDSALEPQGLAVGQLGEVRVAARGISW DQHHLHSAVAVLRNVHIRPGVPPLVIAAPVELSSALPTEIFDDVLRQATPQLRGELSE SGAARLRWARRPDWGGLEVDVDVAGTTSQTTLWLRPRTVITGQRRWTLPARTPAYRVP LPELPHGLRITDVSLAADCLQLSALLPEWRTELPLRYLESVITQLSQGALSFVWPPLR SGAD" gene complement(3623159..3624910) /gene="lpqB" /locus_tag="Rv3244c" /db_xref="GeneID:888815" CDS complement(3623159..3624910) /gene="lpqB" /locus_tag="Rv3244c" /function="UNKNOWN" /note="Rv3244c, (MTCY20B11.19c), len: 583 aa. Probable lpqB, conserved lipoprotein; contains appropriately placed lipoprotein signature (PS00013). Equivalent to Q9CCJ0|LPQB|ML0775 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (589 aa), FASTA scores: opt: 3375, E(): 1.4e-186, (87.9% identity in 579 aa overlap). Also similar to various proteins (in particular transferases) e.g. Q9KYX0|SCE33.13c PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (615 aa), FASTA scores: opt: 228, E(): 1.3e-05, (25.5% identity in 624 aa overlap); O87992|BBLPS1.19c PUTATIVE GLUTAMINE AMIDOTRANSFERASE from Bordetella bronchiseptica (Alcaligenes bronchisepticus) (628 aa), FASTA scores: opt: 162, E(): 0.079, (28.05% identity in 171 aa overlap); Q9L2F4|SC7A8.01 PUTATIVE SUGAR KINASE (FRAGMENT) from Streptomyces coelicolor (434 aa), FASTA scores: opt: 143, E(): 0.72, (27.65% identity in 293 aa overlap); etc." /codon_start=1 /transl_table=11 /product="lipoprotein LpqB" /protein_id="NP_217761.1" /db_xref="GI:15610380" /db_xref="UniProtKB/TrEMBL:O05889" /db_xref="GeneID:888815" /translation="MRLTILLFLGAVLAGCASVPSTSAPQAIGTVERPVPSNLPKPSP GMDPDVLLREFLKATADPANRHLAARQFLTESASNAWDDAGSALLIDHVVFVETRSAE KVSVTMRADILGSLSDVGVFETAEGQLPDPGPIELVKTSDGWRIDRLPNGVFLDWQQF QETYKRNTLYFADPTGKTVVPDPRYVAVSDRDQLATELVSKLLAGPRPEMARTVRNLL APPLRLRGPVTRADGGKSGIGRGYGGARVDMEKLSTTDPHSRQLLAAQIIWTLARADI RGPYVINADGAPLEDRFAEGWTTSDVAATDPGVADGAAAGLHALVNGSLVAMDAQRVT PVPGAFGRMPEQTAAAVSRSGRQVASVVTLGRGAPDEAASLWVGDLGGEAVQSADGHS LSRPSWSLDDAVWVVVDTNVVLRAIQDPASGQPARIPVDSTAVASRFPGAINDLQLSR DGTRAAMVIGGQVILAGVEQTQAGQFALTYPRRLGFGLGSSVVSLSWRTGDDIVVTRT DAAHPVSYVNLDGVNSDAPSRGLQTPLTAIAANPSTVYVAGPQGVLMYSASVESRPGW ADVPGLMVPGAAPVLPG" misc_feature complement(3624863..3624895) /gene="lpqB" /locus_tag="Rv3244c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(3624910..3626613) /gene="mtrB" /locus_tag="Rv3245c" /db_xref="GeneID:888719" CDS complement(3624910..3626613) /gene="mtrB" /locus_tag="Rv3245c" /EC_number="2.7.3.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv3245c, (MTCY20B11.20c), len: 567 aa. mtrB, sensor-like histidine kinase (EC 2.7.3.-) (see citations below), equivalent to Q9CCJ1|MTRB OR ML0774 PUTATIVE TWO-COMPONENT SYSTEM SENSOR KINASE from Mycobacterium leprae (562 aa), FASTA scores: opt: 3208, E(): 7.4e-173, (88.7% identity in 566 aa overlap). Also similar to others e.g. Q9KYW9|SCE33.14c PUTATIVE TWO-COMPONENT SYSTEM HISTIDINE KINASE from Streptomyces coelicolor (688 aa), FASTA scores: opt: 1355, E(): 1.1e-68, (48.95% identity in 515 aa overlap); etc. Relatives in Mycobacterium tuberculosis are: MTCY369.03, E(): 1.5e-22; MTCY20G9.16, E(): 1.9e-17. SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES." /codon_start=1 /transl_table=11 /product="two component sensory transduction histidine kinase MTRB" /protein_id="NP_217762.1" /db_xref="GI:15610381" /db_xref="GOA:Q50496" /db_xref="UniProtKB/Swiss-Prot:Q50496" /db_xref="GeneID:888719" /translation="MIFGSRRRIRGRRGRSGPMTRGLSALSRAVAVAWRRSLQLRVVA LTLGLSLAVILALGFVLTSQVTNRVLDIKVRAAIDQIERARTTVSGIVNGEETRSLDS SLQLARNTLTSKTDPASGAGLAGAFDAVLMVPGDGPRAASTAGPVDQVPNALRGFVKA GQAAYQYATVQTEGFSGPALIIGTPTLSRVANLELYLIFPLASEQATITLVRGTMATG GLVLLVLLAGIALLVSRQVVVPVRSASRIAERFAEGHLSERMPVRGEDDMARLAVSFN DMAESLSRQIAQLEEFGNLQRRFTSDVSHELRTPLTTVRMAADLIYDHSADLDPTLRR STELMVSELDRFETLLNDLLEISRHDAGVAELSVEAVDLRTTVNNALGNVGHLAEEAG IELLVDLPAEQVIAEVDARRVERILRNLIANAIDHAEHKPVRIRMAADEDTVAVTVRD YGVGLRPGEEKLVFSRFWRSDPSRVRRSGGTGLGLAISVEDARLHQGRLEAWGEPGEG ACFRLTLPMVRGHKVTTSPLPMKPIPQPVLQPVAQPNPQPMPPEYKERQRPREHAEWS G" repeat_region complement(3626614..3626666) /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene complement(3626663..3627349) /gene="mtrA" /locus_tag="Rv3246c" /db_xref="GeneID:888743" CDS complement(3626663..3627349) /gene="mtrA" /locus_tag="Rv3246c" /function="TRANSCRIPTIONAL ACTIVATOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /experiment="experimental evidence, no additional details recorded" /note="Rv3246c, (MTCY20B11.21c), len: 228 aa. mtrA, transcriptional activator, response regulator (see citations below), equivalent to Q9CCJ2|MTRA|ML0773 PUTATIVE TWO-COMPONENT RESPONSE REGULATOR from Mycobacterium leprae (228 aa), FASTA scores: opt: 1458, E(): 1.4e-85, (98.7% identity in 228 aa overlap). Also highly similar to others e.g. Q9F9J5|SCRA PUTATIVE RESPONSE REGULATOR from Streptomyces coelicolor (228 aa), FASTA scores: opt: 1141, E(): 1.9e-65, (74.9% identity in 227 aa overlap); Q9KYW8|SCE33.15c PUTATIVE TWO-COMPONENT SYSTEM RESPONSE REGULATOR from Streptomyces coelicolor (229 aa), FASTA scores: opt: 1141, E(): 1.9e-65, (74.9% identity in 227 aa overlap); Q9F868|REGX3 RESPONSE REGULATOR REGX3 from Mycobacterium smegmatis (228 aa), FASTA scores: opt: 730, E(): 2.3e-39, (50.90% identity in 222 aa overlap); etc. Relatives in Mycobacterium tuberculosis are: U01971|MTU01971_1; Q11156|RGX3_MYCTU; MTCY20G9.17, E(): 0; MTCY31.31c, E(): 3.4e-29; MTCY369.02, E(): 5.7e-28. SIMILAR TO BACTERIAL REGULATORY PROTEINS INVOLVED IN SIGNAL TRANSDUCTION. THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS. Experiments showed mtrA is differentially expressed in virulent and avirulent strains during growth in macrophages." /codon_start=1 /transl_table=11 /product="two component sensory transduction transcriptional regulatory protein MTRA" /protein_id="NP_217763.1" /db_xref="GI:15610382" /db_xref="GOA:Q50447" /db_xref="UniProtKB/Swiss-Prot:Q50447" /db_xref="GeneID:888743" /translation="MDTMRQRILVVDDDASLAEMLTIVLRGEGFDTAVIGDGTQALTA VRELRPDLVLLDLMLPGMNGIDVCRVLRADSGVPIVMLTAKTDTVDVVLGLESGADDY IMKPFKPKELVARVRARLRRNDDEPAEMLSIADVEIDVPAHKVTRNGEQISLTPLEFD LLVALARKPRQVFTRDVLLEQVWGYRHPADTRLVNVHVQRLRAKVEKDPENPTVVLTV RGVGYKAGPP" gene complement(3627419..3628063) /gene="tmk" /locus_tag="Rv3247c" /db_xref="GeneID:888740" CDS complement(3627419..3628063) /gene="tmk" /locus_tag="Rv3247c" /EC_number="2.7.4.9" /function="PHOSPHORYLATION OF DTMP TO FORM DTDP IN BOTH DE NOVO AND SALVAGE PATHWAYS OF DTTP SYNTHESIS [CATALYTIC ACTIVITY: ATP + THYMIDINE 5'-PHOSPHATE = ADP + THYMIDINE 5'-DIPHOSPHATE]." /note="catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to thymidine monophosphate (dTMP) to form thymidine diphosphate (dTDP)" /codon_start=1 /transl_table=11 /product="thymidylate kinase" /protein_id="NP_217764.1" /db_xref="GI:15610383" /db_xref="GOA:O05891" /db_xref="UniProtKB/TrEMBL:O05891" /db_xref="GeneID:888740" /translation="MLIAIEGVDGAGKRTLVEKLSGAFRAAGRSVATLAFPRYGQSVA ADIAAEALHGEHGDLASSVYAMATLFALDRAGAVHTIQGLCRGYDVVILDRYVASNAA YSAARLHENAAGKAAAWVQRIEFARLGLPKPDWQVLLAVSAELAGERSRGRAQRDPGR ARDNYERDAELQQRTGAVYAELAAQGWGGRWLVVGADVDPGRLAATLAPPDVPS" gene complement(3628160..3629647) /gene="sahH" /locus_tag="Rv3248c" /db_xref="GeneID:888746" CDS complement(3628160..3629647) /gene="sahH" /locus_tag="Rv3248c" /EC_number="3.3.1.1" /function="THIOESTER HYDROLASE WHICH ACTING ON ETHER BOUNDS. COULD BE INVOLVED IN METHIONINE AND SELENOAMINO ACID METABOLISMS. ALSO INVOLVED IN ACTIVATED METHYL. CYCLE ADENOSYLHOMOCYSTEINE IS A COMPETITIVE INHIBITOR OF S-ADENOSYL-L-METHIONINE-DEPENDENT METHYL TRANSFERASE REACTIONS; THEREFORE ADENOSYLHOMOCYSTEINASE MAY PLAY A KEY ROLE IN THE CONTROL OF METHYLATIONS VIA REGULATION OF THE INTRACELLULAR CONCENTRATION OF ADENOSYLHOMOCYSTEINE [CATALYTIC ACTIVITY: S-ADENOSYL-L-HOMOCYSTEINE + H(2)O = ADENOSINE + L-HOMOCYSTEINE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of L-homocysteine from S-adenosyl-L-homocysteine" /codon_start=1 /transl_table=11 /product="S-adenosyl-L-homocysteine hydrolase" /protein_id="NP_217765.1" /db_xref="GI:15610384" /db_xref="GOA:P60176" /db_xref="UniProtKB/Swiss-Prot:P60176" /db_xref="GeneID:888746" /translation="MTGNLVTKNSLTPDVRNGIDFKIADLSLADFGRKELRIAEHEMP GLMSLRREYAEVQPLKGARISGSLHMTVQTAVLIETLTALGAEVRWASCNIFSTQDHA AAAVVVGPHGTPDEPKGVPVFAWKGETLEEYWWAAEQMLTWPDPDKPANMILDDGGDA TMLVLRGMQYEKAGVVPPAEEDDPAEWKVFLNLLRTRFETDKDKWTKIAESVKGVTEE TTTGVLRLYQFAAAGDLAFPAINVNDSVTKSKFDNKYGTRHSLIDGINRGTDALIGGK KVLICGYGDVGKGCAEAMKGQGARVSVTEIDPINALQAMMEGFDVVTVEEAIGDADIV VTATGNKDIIMLEHIKAMKDHAILGNIGHFDNEIDMAGLERSGATRVNVKPQVDLWTF GDTGRSIIVLSEGRLLNLGNATGHPSFVMSNSFANQTIAQIELWTKNDEYDNEVYRLP KHLDEKVARIHVEALGGHLTKLTKEQAEYLGVDVEGPYKPDHYRY" misc_feature complement(3628781..3628825) /gene="sahH" /locus_tag="Rv3248c" /note="PS00739 S-adenosyl-L-homocysteine hydrolase signature" gene complement(3629752..3630387) /locus_tag="Rv3249c" /db_xref="GeneID:888741" CDS complement(3629752..3630387) /locus_tag="Rv3249c" /function="PROBABLY INVOLVED IN A TRANSCRIPTIONAL MECHANISM" /experiment="experimental evidence, no additional details recorded" /note="Rv3249c, (MTCY20B11.24c), len: 211 aa. Possible transcriptional regulatory protein, tetR family, with similarity to several e.g. Q9AE61|ALKB1 PUTATIVE TETR-REGULATORY from Rhodococcus erythropolis (208 aa), FASTA scores: opt: 503, E(): 7.7e-26, (40.6% identity in 192 aa overlap); CAC37620 PUTATIVE TETR-REGULATORY PROTEIN from Prauserella rugosa (212 aa), FASTA scores: opt: 246, E(): 4.4e-09, (27.95% identity in 186 aa overlap); Q9K4B0|SC7E4.06 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL from Streptomyces coelicolor (203 aa), FASTA scores: opt: 224, E(): 1.1e-07, (34.5% identity in 197 aa overlap); Q11063|YC55_MYCTU|Rv1255c|MT1294|MTCY50.27 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Mycobacterium tuberculosis (202 aa), FASTA scores: opt: 191, E(): 1.6e-05, (28.35% identity in 180 aa overlap); etc. Equivalent to AAK47689 from Mycobacterium tuberculosis strain CDC1551 (230 aa) but shorter 19 aa. COULD BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Possible helix-turn helix motif at aa 44-65 (+6.66 SD)." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217766.1" /db_xref="GI:15610385" /db_xref="GOA:O05892" /db_xref="UniProtKB/TrEMBL:O05892" /db_xref="GeneID:888741" /translation="MSTPSATVAPVKRIPYAEASRALLRDSVLDAMRDLLLTRDWSAI TLSDVARAAGISRQTIYNEFGSRQGLAQGYALRLADRLVDNVHASLDANVGNFYEAFL QGFRSFFAESAADPLVISLLTGVAKPDLLQLITTDSAPIITRASARLAPAFTDTWVAT TDNDANVLSRAIVRLCLSYVSMPPEADHDVAADLARLITPFAERHGVINVP" gene complement(3630384..3630566) /gene="rubB" /locus_tag="Rv3250c" /db_xref="GeneID:888744" CDS complement(3630384..3630566) /gene="rubB" /locus_tag="Rv3250c" /function="INVOLVED IN THE HYDROCARBON HYDROXYLATING SYSTEM TO CONVERT CONVERSION OF DODECANE TO LAURIC ACID, WHICH TRANSFERS ELECTRONS FROM NADH TO RUBREDOXIN REDUCTASE AND THEN THROUGH RUBREDOXIN TO ALKANE 1 MONOOXYGENASE." /experiment="experimental evidence, no additional details recorded" /note="Rv3250c, (MTCY20B11.25c), len: 60 aa. Probable rubB, rubredoxin, highly similar to other rubredoxins e.g. Q9AE66|RUBA4 from Rhodococcus erythropolis (60 aa), FASTA scores: opt: 391, E(): 2.2e-21, (83.05% identity in 59 aa overlap); Q9AE63|RUBA2 from Rhodococcus erythropolis (63 aa), FASTA scores: opt: 380, E(): 1.4e-20, (83.9% identity in 56 aa overlap); P42453|RUBR_ACICA|RUBA from Acinetobacter calcoaceticus (54 aa), FASTA scores: opt: 315, E(): 4.9e-16, (69.8% identity in 53 aa overlap); Q9HTK7|PA5351 from Pseudomonas aeruginosa (55 aa), FASTA scores: opt: 298, E(): 8e-15, (64.15% identity in 53 aa overlap); Q9PGC3|XF0379 from Xylella fastidiosa (57 aa), FASTA scores: opt: 263, E(): 2.5e-12, (59.25% identity in 54 aa overlap); etc. Also similar to neighbouring ORF M. tuberculosis RubA (MTCY20B11.26c). Contains rubredoxin signature (PS00202). BELONGS TO THE RUBREDOXIN FAMILY." /codon_start=1 /transl_table=11 /product="rubredoxin RubB" /protein_id="NP_217767.1" /db_xref="GI:15610386" /db_xref="GOA:O05893" /db_xref="UniProtKB/TrEMBL:O05893" /db_xref="GeneID:888744" /translation="MNDYKLFRCIQCGFEYDEALGWPEDGIAAGTRWDDIPDDWSCPD CGAAKSDFEMVEVARS" misc_feature complement(3630429..3630461) /gene="rubB" /locus_tag="Rv3250c" /note="PS00202 Rubredoxin signature" gene complement(3630571..3630738) /gene="rubA" /locus_tag="Rv3251c" /db_xref="GeneID:888747" CDS complement(3630571..3630738) /gene="rubA" /locus_tag="Rv3251c" /function="INVOLVED IN THE HYDROCARBON HYDROXYLATING SYSTEM, WHICH TRANSFERS ELECTRONS FROM NADH TO RUBREDOXIN REDUCTASE AND THEN THROUGH RUBREDOXIN TO ALKANE 1 MONOOXYGENASE." /experiment="experimental evidence, no additional details recorded" /note="Rv3251c, (MTCY20B11.26c), len: 55 aa. Probable rubA, rubredoxin, highly similar to other rubredoxins (but sometimes shorter) e.g. Q9AE67|RUBA3 from Rhodococcus erythropolis (61 aa), FASTA scores: opt: 335, E(): 1e-17, (73.6% identity in 53 aa overlap); P00272|RUB2_PSEOL|ALKG from Pseudomonas oleovorans (172 aa), FASTA scores: opt: 278, E(): 2.7e-13, (65.3% identity in 49 aa overlap); CAC38028|ALKG from Alcanivorax borkumensis (174 aa), FASTA scores: opt: 271, E(): 8.6e-13, (62.0% identity in 50 aa overlap); Q9WWW4|ALKG from Pseudomonas putida (175 aa), FASTA scores: opt: 270, E(): 1e-12, (61.8% identity in 55 aa overlap); etc. Also highly similar to C-terminus of Q9XBM1|ALKB ALKANE 1-MONOOXYGENASE (EC 1.14.15.3) from Prauserella rugosa (490 aa), FASTA scores: opt: 296, E(): 2.9e-14, (75.5% identity in 49 aa overlap). Also similar to neighbouring ORF Mycobacterium tuberculosis rubB (MTCY20B11.25c). Contains rubredoxin signature (PS00202). BELONGS TO THE RUBREDOXIN FAMILY." /codon_start=1 /transl_table=11 /product="rubredoxin RUBA" /protein_id="NP_217768.1" /db_xref="GI:15610387" /db_xref="GOA:O05894" /db_xref="UniProtKB/TrEMBL:O05894" /db_xref="GeneID:888747" /translation="MAAYRCPVCDYVYDEANGDAREGFPAGTGWDQIPDDWCCPDCAV REKVDFEKIGG" misc_feature complement(3630610..3630642) /gene="rubA" /locus_tag="Rv3251c" /note="PS00202 Rubredoxin signature" gene complement(3630738..3631988) /gene="alkB" /locus_tag="Rv3252c" /db_xref="GeneID:888690" CDS complement(3630738..3631988) /gene="alkB" /locus_tag="Rv3252c" /EC_number="1.14.15.3" /function="THOUGHT TO BE INVOLVED IN FATTY ACID METABOLISM. GENERATES OCTANOL AND OXIDIZED RUBREDOXIN FROM OCTANE AND REDUCED RUBREDOXIN. ALSO HYDROXYLATES FATTY ACIDS IN THE OMEGA-POSITION [CATALYTIC ACTIVITY: OCTANE + REDUCED RUBREDOXIN + (O)2 = 1-OCTANOL + OXIDIZED RUBREDOXIN + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Rv3252c, (MTCY20B11.27c), len: 416 aa. Probable alkB, transmembrane alkane-1-monooxygenase (EC 1.14.15.3), highly similar to many (see Marin et al., 2001) e.g. Q9AE68|ALKB2 from Rhodococcus erythropolis (408 aa), FASTA scores: opt: 2018, E(): 9.6e-122, (68.6% identity in 415 aa overlap); Q9AFD5|ALKB from Nocardioides sp. CF8 (483 aa), FASTA scores: opt: 1485, E(): 1.4e-87, (56.55% identity in 405 aa overlap); Q9XAU0|ALKB1 from Rhodococcus erythropolis (391 aa), FASTA scores: opt: 1400, E(): 3.3e-82, (62.6% identity in 396 aa overlap); Q9XBM1|ALKB from Prauserella rugosa (490 aa), FASTA scores: opt: 1266, E(): 1.5e-73, (57.55% identity in 410 aa overlap); CAC40954|ALKB4 from Rhodococcus erythropolis (386 aa), FASTA scores: opt: 1190, E(): 9.1e-69, (54.3% identity in 383 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transmembrane alkane 1-monooxygenase AlkB" /protein_id="NP_217769.1" /db_xref="GI:15610388" /db_xref="GOA:O05895" /db_xref="UniProtKB/TrEMBL:O05895" /db_xref="GeneID:888690" /translation="MTTQIGSGGPEAPRPPEVEEWRDKKRYLWLMGLIAPTALVVMLP LIWGMNQLGWHAAAQVPLWIGPILLYVLLPLLDLRFGPDGQNPPDEVTDRLENDKYYR YCTYIYIPFQYLSVVLGAYLFTAANLSWLGFDGALSWAGKLGVALSVGVLGGVGINTA HEMGHKKDSLERWLSKITLAQTCYGHFYIEHNRGHHVRVSTPEDPASARFGETLWEFL PRSVIGGLRSAVHLEAQRLRRLGVSPWNPMTYLRNDVLNAWLMSVVLWGGLIAVFGPA LIPFVIIQAVFGFSLLEAVNYLEHYGLLRQKSANGRYERCAPVHSWNSDHIVTNLFLY HLQRHSDHHANPTRRYQTLRSMAGAPNLPSGYASMISLTYFPPLWRKVMDHRVLEHYG GDITRVNLHPRVREKALARYGASA" gene complement(3632097..3633584) /locus_tag="Rv3253c" /db_xref="GeneID:888739" CDS complement(3632097..3633584) /locus_tag="Rv3253c" /function="THOUGHT TO BE INVOLVED IN CATIONIC AMINO ACID TRANSPORT ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3253c, (MTCY20B11.28c), len: 495 aa. Possible cationic amino acid transporter, integral membrane protein, similar to many e.g. O69844|SC1C3.02 PUTATIVE CATIONIC AMINO ACID TRANSPORTER from Streptomyces coelicolor (503 aa), FASTA scores: opt: 1649, E(): 5.8e-92, (52.6% identity in 485 aa overlap); Q9AE69 PUTATIVE TRANSPORTER (FRAGMENT) from Rhodococcus erythropolis (385 aa), FASTA scores: opt: 1594, E(): 9.7e-89, (62.0% identity in 387 aa overlap); Q9PBD7|XF2207 CATIONIC AMINO ACID TRANSPORTER from Xylella fastidiosa (483 aa), FASTA scores: opt: 1079, E(): 1.2e-57, (40.55% identity in 493 aa overlap); Q9SRU9|F20H23.25 PUTATIVE CATIONIC AMINO ACID TRANSPORTER from Arabidopsis thaliana (Mouse-ear cress) (614 aa), FASTA scores: opt: 802, E(): 6.7e-41, (36.4% identity in 445 aa overlap); P30823|CTR1_RAT|SLC7A1|ATRC1 HIGH-AFFINITY CATIONIC AMINO ACID TRANSPORTER-1 from Rattus norvegicus (Rat) (624 aa), FASTA scores: opt: 782, E(): 1.1e-39, (36.1% identity in 432 aa overlap); etc. Relatives in Mycobacterium tuberculosis include: MTCY3G12.14, E(): 5.6e-31; MTCY39.19, E(): 1.6e-14. SEEMS TO BELONG TO THE APC FAMILY." /codon_start=1 /transl_table=11 /product="cationic amino acid transport integral membrane protein" /protein_id="NP_217770.1" /db_xref="GI:15610389" /db_xref="GOA:O05896" /db_xref="UniProtKB/TrEMBL:O05896" /db_xref="GeneID:888739" /translation="MAGRRRMKSVEQSIADTDEPTTRLRKDLTWWDLVVFGVSVVIGA GIFTVTASTAGDITGPAIWISFLIAAATCALAALCYAEFASTLPVAGSAYTFSYATFG EFLAWVIGWNLVLELAMGAAVVAKGWSSYLGTVFGFGNGTGHLGSLQLDWGALVIVTL VATLIALGTKLSSRFSAVVTAIKVSVVVLVVVVGAFYIRAANYSPFIPEPEVQHHGGG LDQSVFSLLTGAQGSHYGWYGVLAGASIVFFAFIGFDIVATMAEETKRPQRDVPRGIL ASLGVVTLLYVAVSVVLSGMVPYTQLRTVPGRGPANLATAFQANGVYWASGIISVGAL AGLTTVVMVLMLGQCRVLFAMARDGLVPRQLAKTGSRGTPVRVTVLVAVLVATTASVF PITKLEEMVNVGTLFAFILVSAGVVVLRRTRPDLQRGFTAPWVPLLPIAAVCACLWLM LNLTALTWIRFGIWLVAGTAIYVGYGRRHSAQGLRQARESATRRC" gene 3633675..3635063 /locus_tag="Rv3254" /db_xref="GeneID:887237" CDS 3633675..3635063 /locus_tag="Rv3254" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3254, (MTCY20B11.29), len: 462 aa. Conserved hypothetical protein, similar to CAC37877|SC1G7.02 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (440 aa), FASTA scores: opt: 606, E(): 6.2e-31, (31.7% identity in 445 aa overlap); O86550|SC1F2.13c HYPOTHETICAL 50.7 KDA PROTEIN from Streptomyces coelicolor (476 aa), FASTA scores: opt: 577, E(): 4.5e-29, (32.5% identity in 400 aa overlap); Q9L0A8|SCC24.09 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (468 aa), FASTA scores: opt: 380, E(): 1.3e-16, (30.7% identity in 391 aa overlap); BAB48792|MLL1411 PROBABLE FAD-DEPENDENT MONOOXYGENASE from Rhizobium loti (Mesorhizobium loti) (421 aa), FASTA scores: opt: 128, E(): 1.1, (25.2% identity in 397 aa overlap); Q9L7X9|BENF BENZOATE-SPECIFIC PORIN-LIKE PROTEIN from Pseudomonas putida (397 aa), FASTA scores: opt: 119, E(): 4, (24.85% identity in 157 aa overlap); etc. Also similar to N-terminus of AAK46259|MT1987 PUTATIVE FERREDOXIN REDUCTASE, ELECTRON TRANSFER COMPONENT from Mycobacterium tuberculosis strain CDC1551 (839 aa), FASTA scores: opt: 493, E(): 1.5e-23, (30.65% identity in 382 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217771.1" /db_xref="GI:15610390" /db_xref="UniProtKB/TrEMBL:O05897" /db_xref="GeneID:887237" /translation="MVIGASIAGLCAARVLSDFYSTVTVFERDELPEAPANRATVPQD RHLHMLMARGAQEFDSLFPGLLHDMVAAGVPMLENRPDCIYLGAAGHVLGTGHTLRKE FTAYVPSRPHLEWQLRRRVLQLSNVQIVRRLVTEPQFERRQQRVVGVLLDSPGSGQDR EREEFIAADLVVDAAGRGTRLPVWLTQWGYRRPAEDTVDIGISYASHQFRIPDGLIAE KVVVAGASHDQSLGLGMLCYEDGTWVLTTFGVADAKPPPTFDEMRALADKLLPARFTA ALAQAQPIGCPAFHAFPASRWRRYDKLERFPRGIVPFGDAVASFNPTFGQGMTMTSLQ AGHLRRALKARNSAMKGDLAAELNRATAKTTYPVWMMNAIGDISFHHATAEPLPRWWR PAGSLFDQFLGAAETDPVLAEWFLRRFSLLDSLYMVPSVPIIGRAIAHNLRLWLKEQR ERRQPVTTRRSP" gene complement(3635041..3636267) /gene="manA" /locus_tag="Rv3255c" /db_xref="GeneID:888698" CDS complement(3635041..3636267) /gene="manA" /locus_tag="Rv3255c" /EC_number="5.3.1.8" /function="THIS ENZYME CONVERTS D-MANNOSE 6-PHOSPHATE TO D-FRUCTOSE 6-PHOSPHATE [CATALYTIC ACTIVITY: D-MANNOSE 6-PHOSPHATE = D-FRUCTOSE 6-PHOSPHATE]." /note="Rv3255c, (MTCY20B11.30c), len: 408 aa. Probable manA, mannose-6-phosphate isomerase (EC 5.3.1.8), equivalent to Q9CCJ5|MANA|ML0765 PUTATIVE MANNOSE-6-PHOSPHATE ISOMERASE from Mycobacterium leprae (410 aa), FASTA scores: opt: 2271, E(): 1.6e-133, (84.45% identity in 411 aa overlap). Also similar to many others e.g. Q9KZL9|MANA from Streptomyces coelicolor (383 aa), FASTA scores: opt: 946, E(): 2.4e-51, (44.4% identity in 403 aa overlap); Q9KV87|VC0269 from Vibrio cholerae (399 aa), FASTA scores: opt: 726, E(): 1.1e-37, (34.15% identity in 404 aa overlap); Q9CMJ5|PMI|PM0829 from Pasteurella multocida (400 aa), FASTA scores: opt: 640, E(): 2.4e-32, (32.5% identity in 391 aa overlap); etc. SIMILAR TO FAMILY 1 OF MANNOSE-6-PHOSPHATE ISOMERASES." /codon_start=1 /transl_table=11 /product="mannose-6-phosphate isomerase" /protein_id="NP_217772.1" /db_xref="GI:15610391" /db_xref="GOA:O05898" /db_xref="UniProtKB/TrEMBL:O05898" /db_xref="GeneID:888698" /translation="MELLRGALRTYAWGSRTAIAEFTGRPVPAAHPEAELWFGAHPGD PAWLQTPHGQTSLLEALVADPEGQLGSASRARFGDVLPFLVKVLAADEPLSLQAHPSA EQAVEGYLREERMGIPVSSPVRNYRDTSHKPELLVALQPFEALAGFREAARTTELLRA LAVSDLDPFIDLLSEGSDADGLRALFTTWITAPQPDIDVLVPAVLDGAIQYVSSGATE FGAEAKTVLELGERYPGDAGVLAALLLNRISLAPGEAIFLPAGNLHAYVRGFGVEVMA NSDNVLRGGLTPKHVDVPELLRVLDFAPTPKARLRPPIRREGLGLVFETPTDEFAATL LVLDGDHLGHEVDASSGHDGPQILLCTEGSATVHGKCGSLTLQRGTAAWVAADDGPIR LTAGQPAKLFRATVGL" gene complement(3636275..3637315) /locus_tag="Rv3256c" /db_xref="GeneID:888696" CDS complement(3636275..3637315) /locus_tag="Rv3256c" /function="UNKNOWN" /note="Rv3256c, (MTV015.01c-MTCY20B11.31c), len: 346 aa. Conserved hypothetical protein, equivalent to Q9CCJ6|ML0764 HYPOTHETICAL PROTEIN from Mycobacterium leprae (365 aa), FASTA scores: opt: 1574, E(): 1.4e-82, (75.35% identity in 365 aa overlap). Also similar to other hypothetical bacterial proteins e.g. Q9KZL8|SCE34.07c from Streptomyces coelicolor (375 aa), FASTA scores: opt: 171, E(): 0.012, (31.1% identity in 376 aa overlap); P55709|Y4YA_RHISN from Rhizobium sp. strain NGR234 (457 aa), FASTA scores: opt: 140, E(): 0.84, (28.75% identity in 233 aa overlap). TBparse score is 0.878." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217773.1" /db_xref="GI:15610392" /db_xref="UniProtKB/TrEMBL:O05899" /db_xref="GeneID:888696" /translation="MNVARAIDLEDTEGLIAADRGALLRAASMAGAQVRAIAAAADEG ELDLLRGSDRPRSVIWVTGRGTAETAGTILASTLGAGAAEPIVLASAAPPWVGPLDVL IVAGDDPGDPALVGAAAIGVRRGARVVVVAPYEGPLRDSTAGRVAVLEPRLRVPDEFG LSRYLAAGLAALQTVDPKLRIDLASLADELDAEALRNSAGREVFTNPAKALAARVSGC QLALAGDNAATLALARHGSSVMLRIANQVVAATRLSDAVVALRAGTPPDALFHDEEID GPAPQRLRVLALALAGERTVVAARVAGLDDAYLVAAEDVPELLDAPVGSGGAVLAVRL EMAAVYLRLVRG" gene complement(3637312..3638709) /gene="manB" /locus_tag="Rv3257c" /db_xref="GeneID:888699" CDS complement(3637312..3638709) /gene="manB" /locus_tag="Rv3257c" /EC_number="5.4.2.8" /function="THIS ENZYME CONVERSES D-MANNOSE 1-PHOSPHATE IN D-MANNOSE 6-PHOSPHATE [CATALYTIC ACTIVITY: D-MANNOSE 1-PHOSPHATE = D-MANNOSE 6-PHOSPHATE]." /note="converts mannose-6-phosphate to mannose-1-phosphate; the resulting product is then converted to GDP-mannose by ManC which is then used in the synthesis of mannose-containing glycoconjugates that are important for mediating entry into host cells" /codon_start=1 /transl_table=11 /product="phosphomannomutase/phosphoglucomutase" /protein_id="NP_217774.1" /db_xref="GI:15610393" /db_xref="GOA:O86374" /db_xref="UniProtKB/TrEMBL:O86374" /db_xref="GeneID:888699" /translation="MSWPAAAVDRVIKAYDVRGLVGEEIDESLVTDLGAAFARLMRTE DARPVVIGHDMRDSSPSLADAFAAGVTGQGLDVVRVGLASTDQLYFASGLLDCPGAMF TASHNPAAYNGIKMCRAAAKPVGADTGLTAIRDDLIAGVARYDGTPGTIADQDVLVDY GAFLRSLVDTSGLRPLRVAVDAGNGMAGHTAPAVLGVIDSITLLPSYFELDGSFPNHE ANPLDPANLVDLQAYVRDTGADIGLAFDGDADRCFVVDERGQPVSPSTVTALVAAREL NREIGATIIHNVITSRAVPELVAERGGTPLRSRVGHSYIKALMAETGAIFGGEHSAHY YFRDFWGADSGMLAALHVLAALGEQSRPLSELTADYQRYESSGEINFTVVDSSACVEA VLKSFGNRIVSIDHLDGVTVDLGDDSWFNLRSSNTEPLLRLNVEGRSVGDVDAVVRQV SAEIAAQSAHAKAGP" gene complement(3638811..3639302) /locus_tag="Rv3258c" /db_xref="GeneID:888694" CDS complement(3638811..3639302) /locus_tag="Rv3258c" /function="UNKNOWN" /note="Rv3258c, (MTV015.03c), len: 163 aa. Conserved hypothetical protein, equivalent to Q9CCJ8|ML0762 HYPOTHETICAL PROTEIN from Mycobacterium leprae (165 aa), FASTA scores: opt: 840, E(): 9.9e-42, (76.9% identity in 169 aa overlap). Also similar to Q9KZL4|SCE34.11c HYPOTHETICAL 15.0 KDA PROTEIN from Streptomyces coelicolor (140 aa), FASTA scores: opt: 353, E(): 1.1e-13, (48.3% identity in 147 aa overlap); and shows really weak similarity to other bacterial proteins. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217775.1" /db_xref="GI:15610394" /db_xref="UniProtKB/TrEMBL:O53351" /db_xref="GeneID:888694" /translation="MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDS TAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVR EGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPD PAD" gene 3639425..3639844 /locus_tag="Rv3259" /db_xref="GeneID:888697" CDS 3639425..3639844 /locus_tag="Rv3259" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3259, (MTV015.04), len: 139 aa. Conserved hypothetical protein, equivalent, but shorter 29 aa, to Q9CCJ9|ML0761 HYPOTHETICAL PROTEIN from Mycobacterium leprae (167 aa), FASTA scores: opt: 846, E(): 2.2e-47, (89.2% identity in 139 aa overlap). C-terminus highly similar to Q9S425 HYPOTHETICAL 6.0 KDA PROTEIN (FRAGMENT) from Mycobacterium smegmatis (54 aa), FASTA scores: opt: 275, E(): 2.7e-11, (81.15% identity in 53 aa overlap). Also similar to Q9KZL3|SCE34.12 from Streptomyces coelicolor (117 aa), FASTA scores: opt: 152, E(): 0.004, (34.15% identity in 126 aa overlap). Equivalent to AAK47699 from Mycobacterium tuberculosis strain CDC1551 (175 aa) but shorter 36 aa. TBparse score is 0.884." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217776.1" /db_xref="GI:15610395" /db_xref="UniProtKB/TrEMBL:O53352" /db_xref="GeneID:888697" /translation="MRGPLLPPTVPGWRSRAERFDMAVLEAYEPIERRWQERVSQLDI AVDEIPRIAAKDPESVQWPPEVIADGPIALARLIPAGVDVRGNATRARIVLFRKPIER RAKDTEELGELLHEILVAQVAIYLDVDPSVIDPTIDD" gene complement(3639872..3640141) /gene="whiB2" /locus_tag="Rv3260c" /db_xref="GeneID:888687" CDS complement(3639872..3640141) /gene="whiB2" /locus_tag="Rv3260c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3260c, (MTV015.05c), len: 89 aa. Probable whiB2 (alternate gene name: whmD), WhiB-like regulatory protein (see Hutter & Dick 1999), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to Q9CCK0|WHIB2|ML0760 PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (89 aa), FASTA scores: opt: 550, E(): 6.1e-31, (85.4% identity in 89 aa overlap). Also similar to others e.g. Q9S426 WHMD REGULATORY PROTEIN (see Gomez & Bishai 2000) from Mycobacterium smegmatis (129 aa), FASTA scores: opt: 488, E(): 1.4e-26, (83.55% identity in 85 aa overlap); Q06387|WHIB-STV WHIB-STV PROTEIN from Streptomyces griseocarneus (87 aa), FASTA scores: opt: 443, E(): 1.2e-23, (74.7% identity in 83 aa overlap); Q05429|WHIB|WHIB1 TRANSCRIPTION-LIKE FACTOR WHIB from Streptomyces aureofaciens (87 aa), FASTA scores: opt: 442, E(): 1.3e-23, (74.7% identity in 83 aa overlap); etc. Equivalent to AAK47700 WhiB-related protein from Mycobacterium tuberculosis strain CDC1551 (123 aa) but shorter 34 aa. Also similar to other Mycobacterium tuberculosis proteins: MTCY07D11.07c (45.1% identity in 71 aa overlap) and MTCY78.13c (37.4% identity in 91 aa overlap). Start chosen by homology but ORF continues to ATG upstream at 3754.; whmD" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIB-like WHIB2" /protein_id="NP_217777.1" /db_xref="GI:15610396" /db_xref="GOA:O53353" /db_xref="UniProtKB/TrEMBL:O53353" /db_xref="GeneID:888687" /translation="MVPEAPAPFEEPLPPEATDQWQDRALCAQTDPEAFFPEKGGSTR EAKKICMGCEVRHECLEYALAHDERFGIWGGLSERERRRLKRGII" gene 3640543..3641538 /gene="fbiA" /locus_tag="Rv3261" /db_xref="GeneID:888701" CDS 3640543..3641538 /gene="fbiA" /locus_tag="Rv3261" /function="REQUIRED FOR COENZYME F420 PRODUCTION: INVOLVED IN THE CONVERSION OF FO INTO F420." /note="catalyzes the formation of the L-lactyl phosphodiester of 7,8-didemethyl-8-hydroxy-5-deazariboflavin (F420-0) and GMP from actyl (2) diphospho-(5')guanosine (LPPG) to 7,8-didemethyl-8-hydroxy-5-deazariboflavin (FO)" /codon_start=1 /transl_table=11 /product="LPPG:FO 2-phospho-L-lactate transferase" /protein_id="NP_217778.1" /db_xref="GI:15610397" /db_xref="UniProtKB/TrEMBL:P96866" /db_xref="GeneID:888701" /translation="MKVTVLAGGVGGARFLLGVQQLLGLGQFAANSAHSDADHQLSAV VNVGDDAWIHGLRVCPDLDTCMYTLGGGVDPQRGWGQRDETWHAMQELVRYGVQPDWF ELGDRDLATHLVRTQMLQAGYPLSQITEALCDRWQPGARLLPATDDRCETHVVITDPV DESRKAIHFQEWWVRYRAQVPTHSFAFVGAEKSSAATEAIAALADADIIMLAPSNPVV SIGAILAVPGIRAALREATAPIVGYSPIIGEKPLRGMADTCLSVIGVDSTAAAVGRHY GARCATGILDCWLVHDGDHAEIDGVTVRSVPLLMTDPNATAEMVRAGCDLAGVVA" gene 3641535..3642881 /gene="fbiB" /locus_tag="Rv3262" /db_xref="GeneID:888693" CDS 3641535..3642881 /gene="fbiB" /locus_tag="Rv3262" /function="REQUIRED FOR COENZYME F420 PRODUCTION: INVOLVED IN THE CONVERSION OF FO INTO F420." /note="catalyzes the addition of gamma linked glutamate to 7,8-didemethyl-8-hydroxy-5-deazariboflavin coenzyme F420-0)" /codon_start=1 /transl_table=11 /product="F420-0--gamma-glutamyl ligase" /protein_id="NP_217779.1" /db_xref="GI:15610398" /db_xref="GOA:P96867" /db_xref="UniProtKB/TrEMBL:P96867" /db_xref="GeneID:888693" /translation="MTGPEHGSASTIEILPVIGLPEFRPGDDLSAAVAAAAPWLRDGD VVVVTSKVVSKCEGRLVPAPEDPEQRDRLRRKLIEDEAVRVLARKDRTLITENRLGLV QAAAGVDGSNVGRSELALLPVDPDASAATLRAGLRERLGVTVAVVITDTMGRAWRNGQ TDAAVGAAGLAVLRNYAGVRDPYGNELVVTEVAVADEIAAAADLVKGKLTATPVAVVR GFGVSDDGSTARQLLRPGANDLFWLGTAEALELGRQQAQLLRRSVRRFSTDPVPGDLV EAAVAEALTAPAPHHTRPTRFVWLQTPAIRARLLDRMKDKWRSDLTSDGLPADAIERR VARGQILYDAPEVVIPMLVPDGAHSYPDAARTDAEHTMFTVAVGAAVQALLVALAVRG LGSCWIGSTIFAADLVRDELDLPVDWEPLGAIAIGYADEPSGLRDPVPAADLLILK" gene 3643177..3644838 /locus_tag="Rv3263" /db_xref="GeneID:888691" CDS 3643177..3644838 /locus_tag="Rv3263" /EC_number="2.1.1.-" /function="CAUSES DNA METHYLATION." /note="Rv3263, (MTCY71.03), len: 553 aa. Probable DNA methylase (EC 2.1.1.-), equivalent to Q9CCK4|ML0756 PROBABLE DNA METHYLASE from Mycobacterium leprae (555 aa), FASTA scores: opt: 2980, E(): 2.1e-184, (81.9% identity in 541 aa overlap). Also similar to others e.g. P25240|MT57_ECOLI|ECO57IM MODIFICATION METHYLASE from Escherichia coli (544 aa), FASTA scores: opt: 595, E(): 1e-30, (30.35% identity in 507 aa overlap); P25201|MTA1_ACICA|ACCIM MODIFICATION METHYLASE ACCI from Acinetobacter calcoaceticus (540 aa), FASTA scores: opt: 366, E(): 5.7e-16, (23.35% identity in 467 aa overlap); Q56752|M-ACCI ACCI METHYLASE from Bergeyella zoohelcum (541 aa), FASTA scores: opt: 365, E(): 6.6e-16, (22.95% identity in 466 aa overlap); etc. Contains PS00092 N-6 Adenine-specific DNA methylases signature. Alternative start site at aa 25." /codon_start=1 /transl_table=11 /product="DNA methylase (modification methylase) (methyltransferase)" /protein_id="NP_217780.1" /db_xref="GI:15610399" /db_xref="GOA:P96868" /db_xref="REBASE:MtuHORF3263P" /db_xref="UniProtKB/TrEMBL:P96868" /db_xref="GeneID:888691" /translation="MQPSHPTRPGAVIRYVGSSLDTCPMTTFAGKTAASADKVRGGYY TPPAVARFLAHWVHQAGPKILEPSCGDGRILRELSAITDHAHGVELVAREAKKSRDFA SVDTENLFTWLHKTQLGSWDGVAGNPPYIRFGNWASEQRDPALELMRRVGLRPTKLTN AWVPFVVASTTLARDGGRVGLVVPAELLQVTYAAQLREFLLSRYREITLVTFERLVFD GILQEVVLFCGVVGPGPAHIRTVRLGDANDLNALGDKDFTNESAPALLHEKEKWTKYF LDPAQIRLLRGLKQSATMIRLGELADVDVGIVTGRNSFFTFTDAKAQALGLRAHCVPL VSRSAQLSGLIYDEDCRACDVAGNHRTWLLDAADYPTDPALVAHITAGEAAGVHLGYK CSIRKPWWSTPSLWMPDLFMLRQIHFAPRLTVNAAAATSTDTVHRVRLDPNVDPATLA AVFHNSATFAFAEIMGRSYGGGILELEPREAEQLPMPPPAYGSAELAQDVDLLLKANE IDKALDVVDRHVLIDGLGLSPRLVAGCRAAWLTLRDRRTKRGSRR" misc_feature 3643546..3643566 /locus_tag="Rv3263" /note="PS00092 N-6 Adenine-specific DNA methylases signature" gene complement(3644898..3645977) /gene="manB" /locus_tag="Rv3264c" /db_xref="GeneID:888715" CDS complement(3644898..3645977) /gene="manB" /locus_tag="Rv3264c" /EC_number="2.7.7.-" /function="INVOLVED IN GDP-MANNOSE BIOSYNTHESIS AND BIOSYNTHESIS OF NUCLEOTIDE-ACTIVATED GLYCERO-MANNO-HEPTOSE (D-ALPHA-D PATHWAY): GENERATES GDP-MANNOSE AND PHOSPHATE FROM GTP AND ALPHA-D-MANNOSE 1-PHOSPHATE. MANB PRODUCT IS NEEDED FOR ALL MANNOSYL GLYCOLIPIDS AND POLYSACCHARIDES WHICH, LIKE RHAMNOSYL RESIDUES, ARE AN IMPORTANT PART OF THE MYCOBACTERIUM ENVELOPE [CATALYTIC ACTIVITY: ALPHA-D-MANNOSE 1-PHOSPHATE + GTP = GDP-MANNOSE + PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3264c, (MTCY71.04c), len: 359 aa. manB (alternate gene name: hddC), D-alpha-D-mannose-1-phosphate guanylyltransferase (EC 2.7.7.-) (see citations below), equivalent to Q9CCK6|RMLA2|ML0753 PUTATIVE SUGAR-PHOSPHATE NUCLEOTIDYL TRANSFERASE from Mycobacterium leprae (358 aa), FASTA scores: opt: 2075, E(): 2.7e-115, (86.9% identity in 359 aa overlap). Also similar to others e.g. Q9KZK6|SCE34.20c PUTATIVE NUCLEOTIDE PHOSPHORYLASE from Streptomyces coelicolor (360 aa), FASTA scores: opt: 1314, E(): 2.2e-70, (57.0% identity in 358 aa overlap); Q9KZP4|SC1A8A.08 PUTATIVE MANNOSE-1-PHOSPHATE GUANYLTRANSFERASE from Streptomyces coelicolor (831 aa), FASTA scores: opt: 699, E(): 8.6e-34, (34.45% identity in 354 aa overlap) (only similarity in N-terminus for this one); P74589|SLL1496 MANNOSE-1-PHOSPHATE GUANYLTRANSFERASE from Synechocystis sp. strain PCC 6803 (843 aa), FASTA scores: opt: 692, E(): 2.3e-33, (35.1% identity in 342 aa overlap) (only similarity in N-terminus for this one too); BAB59222|TVG0079558 MANNOSE-1-PHOSPHATE GUANYLTRANSFERASE from Thermoplasma volcanium (359 aa), FASTA scores: opt: 664, E(): 5.2e-32, (34.6% identity in 338 aa overlap); Q9ZTW5|GMP GDP-MANNOSE PYROPHOSPHORYLASE from Solanum tuberosum (Potato) (361 aa), FASTA scores: opt: 636, E(): 2.3e-30, (34.65% identity in 361 aa overlap); etc. BELONGS TO FAMILY 2 OF MANNOSE-6-PHOSPHATE ISOMERASES. Note that previously known as rmlA2.; hddC" /codon_start=1 /transl_table=11 /product="D-alpha-D-mannose-1-phosphate guanylyltransferase MANB (D-alpha-D-heptose-1-phosphate guanylyltransferase)" /protein_id="YP_177951.1" /db_xref="GI:57117081" /db_xref="GOA:Q7D5T3" /db_xref="UniProtKB/TrEMBL:Q7D5T3" /db_xref="GeneID:888715" /translation="MATHQVDAVVLVGGKGTRLRPLTLSAPKPMLPTAGLPFLTHLLS RIAAAGIEHVILGTSYKPAVFEAEFGDGSALGLQIEYVTEEHPLGTGGGIANVAGKLR NDTAMVFNGDVLSGADLAQLLDFHRSNRADVTLQLVRVGDPRAFGCVPTDEEDRVVAF LEKTEDPPTDQINAGCYVFERNVIDRIPQGREVSVEREVFPALLADGDCKIYGYVDAS YWRDMGTPEDFVRGSADLVRGIAPSPALRGHRGEQLVHDGAAVSPGALLIGGTVVGRG AEIGPGTRLDGAVIFDGVRVEAGCVIERSIIGFGARIGPRALIRDGVIGDGADIGARC ELLSGARVWPGVFLPDGGIRYSSDV" gene complement(3645979..3646884) /gene="wbbL1" /locus_tag="Rv3265c" /db_xref="GeneID:888714" CDS complement(3645979..3646884) /gene="wbbL1" /locus_tag="Rv3265c" /EC_number="2.-.-.-" /function="PROBABLY INVOLVED IN CELL WALL ARABINOGALACTAN LINKER FORMATION: USES DTDP-L-RHAMNOSE AS SUBSTRATE TO INSERT THE RHAMNOSYL RESIDUE INTO THE CELL WALL. SEEMS TO BE ESSENTIAL FOR MYCOBACTERIAL VIABILITY." /experiment="experimental evidence, no additional details recorded" /note="Rv3265c, (MTCY71.05c), len: 301 aa. Probable wbbL1, dTDP-RHA:A-D-GLCNAC-DIPHOSPHORYL POLYPRENOL A-3-L-RHAMNOSYL TRANSFERASE (EC 2.-.-.-) (see citations below), equivalent to Q9CCK7|WBBL|ML0752 PUTATIVE DTDP-RHAMNOSYL TRANSFERASE from Mycobacterium leprae (308 aa), FASTA scores: opt: 1788, E(): 3e-104, (85.05% identity in 301 aa overlap); and Q9RN50|WBBL|Q9RN49 (see note * below) DTDP-RHA:A-D-GLCNAC-DIPHOSPHORYL POLYPRENOL, A-3-L-RHAMNOSYL TRANSFERASE from Mycobacterium smegmatis (296 aa), FASTA scores: opt: 1494, E(): 6.1e-86, (72.35% identity in 293 aa overlap). Note that previously known as wbbL. [* Note: UNPUBLISHED (experimental study on Mycobacterium smegmatis). Submitted (SEP-1999) to the EMBL/GenBank/DDBJ databases - The cell wall arabinogalactan linker formation enzyme, dTDP-Rha:a-D-GlcNAc-diphosphoryl polyprenol, a-3-L-rhamnosyl transferase is essential for mycobacterial viability - Mills J.A., Motichka K., Jucker M., Wu H.P., Uhlic B.C., Stern R.J., Scherman M.S., Vissa V.D., Yan W., Pan F., Kimbrel S., Kundu M., McNeil M.].; wbbL" /codon_start=1 /transl_table=11 /product="dTDP-RHA:A-D-GlcNAc-diphosphoryl polyprenol" /protein_id="YP_177952.1" /db_xref="GI:57117082" /db_xref="GOA:Q7D5T2" /db_xref="UniProtKB/TrEMBL:Q7D5T2" /db_xref="GeneID:888714" /translation="MVAVTYSPGPHLERFLASLSLATERPVSVLLADNGSTDGTPQAA VQRYPNVRLLPTGANLGYGTAVNRTIAQLGEMAGDAGEPWVDDWVIVANPDVQWGPGS IDALLDAASRWPRAGALGPLIRDPDGSVYPSARQMPSLIRGGMHAVLGPFWPRNPWTT AYRQERLEPSERPVGWLSGSCLLVRRSAFGQVGGFDERYFMYMEDVDLGDRLGKAGWL SVYVPSAEVLHHKAHSTGRDPASHLAAHHKSTYIFLADRHSGWWRAPLRWTLRGSLAL RSHLMVRSSLRRSRRRKLKLVEGRH" gene complement(3646895..3647809) /gene="rmlD" /locus_tag="Rv3266c" /db_xref="GeneID:888704" CDS complement(3646895..3647809) /gene="rmlD" /locus_tag="Rv3266c" /function="INVOLVED IN dTDP-L-RHAMNOSE BIOSYNTHESIS: CONVERTS dTDP-6-DEOXY-L-LYXO-4-HEXULOSE TO dTDP-L-RHAMNOSE WITH THE CONCOMITANT OXIDATION OF NADPH TO NADP+ [CATALYTIC ACTIVITY: dTDP-6-DEOXY-L-LYXO-4-HEXULOSE + NADPH = dTDP-L-RHAMNOSE + NADP+]." /experiment="experimental evidence, no additional details recorded" /note="Rv3266c, (MTCY71.06c), len: 304 aa. rmlD, dTDP-6-deoxy-L-lyxo-4-hexulose reductase (dTDP-rhamnose modification protein) (EC 1.-.-.-)(see citations below), highly similar to Q9CCK8 putative dTDP-rhamnose modification protein from Mycobacterium leprae (311 aa), FASTA scores, opt: 1440, E(): 1.1e-78, (74.7% identity in 312 aa overlap); and similar to several dTDP-4-dehydrorhamnose reductase (EC 1.1.1.133) e.g. STRL_STRGR|P29781 from Streptomyces griseus (304 aa), FASTA scores, opt: 788, E(): 0, (47.4% identity in 304 aa overlap)." /codon_start=1 /transl_table=11 /product="dTDP-6-deoxy-L-lyxo-4-hexulose reductase RmlD" /protein_id="NP_217783.1" /db_xref="GI:15610402" /db_xref="GOA:P96871" /db_xref="UniProtKB/TrEMBL:P96871" /db_xref="GeneID:888704" /translation="MAGRSERLVITGAGGQLGSHLTAQAAREGRDMLALTSSQWDITD PAAAERIIRHGDVVINCAAYTDVDGAESNEAVAYAVNATGPQHLARACARVGARLIHV STDYVFDGDFGGAEPRPYEPTDETAPQGVYARSKLAGEQAVLAAFPEAAVVRTAWVYT GGTGKDFVAVMRRLAAGHGRVDVVDDQTGSPTYVADLAEALLALADAGVRGRVLHAAN EGVVSRFGQARAVFEECGADPQRVRPVSSAQFPRPAPRSSYSALSSRQWALAGLTPLR HWRSALATALAAPANSTSIDRRLPSTRD" gene 3647885..3649381 /locus_tag="Rv3267" /db_xref="GeneID:888713" CDS 3647885..3649381 /locus_tag="Rv3267" /function="UNKNOWN" /note="Rv3267, (MTCY71.07), len: 498 aa. Conserved hypothetical protein, CPSA-related protein, equivalent to Q9CCK9|ML0750 HYPOTHETICAL PROTEIN from Mycobacterium leprae (489 aa), FASTA scores: opt: 2523, E(): 5e-138, (78.9% identity in 498 aa overlap); and Q50160|CPSA (HYPOTHETICAL PROTEIN CPSA) from Mycobacterium leprae (516 aa), FASTA scores: opt: 868, E(): 1.2e-42, (34.7% identity in 507 aa overlap). Also similar to O06347|CPSA|Rv3484|MTCY13E12.37 CPSA from Mycobacterium tuberculosis (512 aa), FASTA scores: opt: 928, E(): 4.2e-46, (37.35% identity in 498 aa overlap); and O53834|Rv0822c|MTV043.14c HYPOTHETICAL 72.9 KDA PROTEIN from Mycobacterium tuberculosis (684 aa), FASTA scores: opt: 434, E(): 1.5e-17, (30.9% identity in 541 aa overlap). Also similar to Q9KZK0|SCE34.26 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (507 aa), FASTA scores: opt: 437, E(): 8.1e-18, (28.55% identity in 469 aa overlap); O68907 FRNA PROTEIN from Streptomyces roseofulvus (770 aa), FASTA scores: opt: 388, E(): 7.6e-15, (32.6% identity in 267 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217784.1" /db_xref="GI:15610403" /db_xref="UniProtKB/TrEMBL:P96872" /db_xref="GeneID:888713" /translation="MMSAQRVVRTVRTARAISTALAVAIVLGTGVAWSSVRSFEDGIF HMSAPSLGHGGDDGAIDILLVGLDSRTDAHGNPLSAEELATLHAGDEEATNTDTIILI RVPNNGKSATAISIPRDSYVAAPGLGKTKINGVYGQTRETKRAGLVQAGASPTEAAAA GTEAGREALIKTVADLTGVTVDHYAEIGLLGFALIADALGGVDVCLKEPVYEPLSGAD FPAGRQKLNGPQALSFVRQRHDLPRGDLDRVVRQQAVMAALAHRVISGQTLSSPATLK RLEQAVQRSVVLSSGWDIMDFVRQLQKLAGGNVAFATIPVLDGAGWSDDGMQSVVRVD PRQVQDWVVGLLHEQDQGKTDELAYTPAKTTANVVNDTDINGLAAAVSKVLSSKGFTT GSVGNNDGDHVPGSQVRAAKADDLGAQQVAKELGGLPVVADASIAPGSVRVVLANDYS GPGSGLGGSDPNGVVSPARAFNLGSADDTTPPPSPILTAGSDAPECIN" misc_feature 3648251..3648274 /locus_tag="Rv3267" /note="PS00017 ATP/GTP-binding site motif A" gene 3649420..3650109 /locus_tag="Rv3268" /db_xref="GeneID:888703" CDS 3649420..3650109 /locus_tag="Rv3268" /function="UNKNOWN" /note="Rv3268, (MTCY71.08), len: 229 aa. Conserved hypothetical protein, similar to Q9KZK4|SCE34.22 HYPOTHETICAL 27.1 KDA PROTEIN from Streptomyces coelicolor (263 aa), FASTA scores: opt: 442, E(): 5.9e-20, (40.1% identity in 242 aa overlap). Also weak similarity to N-terminal part (approximatively 1530 to 1740 residues) of O07944|SNBDE PRISTINAMYCIN I SYNTHASE 3 AND 4 from Streptomyces pristinaespiralis (4848 aa), FASTA scores: opt: 159, E(): 0.11, (30.35% identity in 224 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217785.1" /db_xref="GI:15610404" /db_xref="GOA:P96873" /db_xref="UniProtKB/TrEMBL:P96873" /db_xref="GeneID:888703" /translation="MLRADPVGPRITYYDDATGERIELSAVTLANWAAKTGNLLRDEL AAGPASRVAILLPAHWQTAAVLFGVWWIGAQAILDDSPADVALCTADRLAEADAVVNS AAVAGEVAVLSLDPFGRPATGLPVGVTDYATAVRVHGDQIVPEHNPGPVLAGRSVEQI LRDCAASAAARGLTAADRVLSTASWAGPDELVDGLLAILAAGASLVQVANPDPAMLQR RIATEKVTRVL" gene 3650234..3650515 /locus_tag="Rv3269" /db_xref="GeneID:888709" CDS 3650234..3650515 /locus_tag="Rv3269" /function="UNKNOWN. MAY BE INVOLVED IN A CHAPERONING PROCESS." /experiment="experimental evidence, no additional details recorded" /note="Rv3269, (MTCY71.09), len: 93 aa. Conserved hypothetical protein, similar to many Mycobacterium proteins and chaperonins/heat shock proteins e.g. Q9CCL0|ML0748 HYPOTHETICAL PROTEIN from Mycobacterium leprae (92 aa), FASTA scores: opt: 427, E(): 6.8e-21, (73.65% identity in 91 aa overlap); Q10865|Rv1993c|MT2049|MTCY39.26c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (90 aa), FASTA scores: opt: 313, E(): 1.2e-13, (60.7% identity in 84 aa overlap); P71542|Y968_MYCTU|Rv0968|MTCY10D7.06c (98 aa), FASTA scores: opt: 294, E(): 2.2e-12, (55.1% identity in 98 aa overlap); Q50827|MOPA|GROEL|CH60_MYCVA CHAPERONIN (PROTEIN CPN60) from Mycobacterium vaccae (120 aa), FASTA scores: opt: 107, E(): 2.1, (39.5% identity in 81 aa overlap); Q9AEB3|HSP65 HEAT SHOCK PROTEIN (FRAGMENT) from Mycobacterium gadium (122 aa), FASTA scores: opt: 102, E(): 4.4, (38.25% identity in 81 aa overlap); Q49374|CH60_MYCGN|MOPA|GROEL CHAPERONIN (PROTEIN CPN60) from Mycobacterium genavense (120 aa), FASTA scores: opt: 99, E(): 6.8, (40.25% identity in 82 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217786.1" /db_xref="GI:15610405" /db_xref="UniProtKB/TrEMBL:P96874" /db_xref="GeneID:888709" /translation="MAIQVFLAKATTTVITGLAGVTAYEILKKAAAKAPLRQTAVSAA ALGLRGTRKAEEAAESARLKVADVMAEARERIGEESPTPAISDLHDHDH" gene 3650526..3652682 /gene="ctpC" /locus_tag="Rv3270" /db_xref="GeneID:888705" CDS 3650526..3652682 /gene="ctpC" /locus_tag="Rv3270" /EC_number="3.6.3.-" /function="METAL CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF UNDETERMINED METAL CATION WITH THE HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED METAL CATION(IN) = ADP + PHOSPHATE + UNDETERMINATED METAL CATION(OUT)]." /note="Rv3270, (MT3370, MTCY71.10), len: 718 aa. Probable ctpC, metal cation-transport ATPase P-type (EC 3.6.3.-), integral membrane protein, equivalent to Q9CCL1|CTPC|ML0747 PUTATIVE CATION TRANSPORT ATPASE from Mycobacterium leprae (725 aa), FASTA scores: opt: 3908, E(): 0, (85.95% identity in 713 aa overlap). Also similar to O66027|MTAA METAL TRANSPORTING ATPASE MTA72 from Mycobacterium tuberculosis (680 aa), FASTA scores: opt: 3756, E(): 5.5e-213, (91.45% identity in 679 aa overlap); and to other ATPases e.g. Q9ZHC7|SILP_SALTY PUTATIVE CATION TRANSPORTING P-TYPE ATPASE from Salmonella typhimurium (824 aa), FASTA scores: opt: 1145, E(): 1.3e-59, (36.55% identity in 643 aa overlap); Q9HX93|PA3920 PROBABLE METAL TRANSPORTING P-TYPE ATPASE from Pseudomonas aeruginosa (792 aa), FASTA scores: opt: 1140, E(): 2.4e-59, (35.95% identity in 745 aa overlap); etc. Contains PS00154 E1-E2 ATPases phosphorylation site. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES), SUBFAMILY IB." /codon_start=1 /transl_table=11 /product="metal cation-transporting P-type ATPase C CtpC" /protein_id="NP_217787.1" /db_xref="GI:15610406" /db_xref="GOA:P96875" /db_xref="UniProtKB/Swiss-Prot:P96875" /db_xref="GeneID:888705" /translation="MTLEVVSDAAGRMRVKVDWVRCDSRRAVAVEEAVAKQNGVRVVH AYPRTGSVVVWYSPRRADRAAVLAAIKGAAHVAAELIPARAPHSAEIRNTDVLRMVIG GVALALLGVRRYVFARPPLLGTTGRTVATGVTIFTGYPFLRGALRSLRSGKAGTDALV SAATVASLILRENVVALTVLWLLNIGEYLQDLTLRRTRRAISELLRGNQDTAWVRLTD PSAGSDAATEIQVPIDTVQIGDEVVVHEHVAIPVDGEVVDGEAIVNQSAITGENLPVS VVVGTRVHAGSVVVRGRVVVRAHAVGNQTTIGRIISRVEEAQLDRAPIQTVGENFSRR FVPTSFIVSAIALLITGDVRRAMTMLLIACPCAVGLSTPTAISAAIGNGARRGILIKG GSHLEQAGRVDAIVFDKTGTLTVGRPVVTNIVAMHKDWEPEQVLAYAASSEIHSRHPL AEAVIRSTEERRISIPPHEECEVLVGLGMRTWADGRTLLLGSPSLLRAEKVRVSKKAS EWVDKLRRQAETPLLLAVDGTLVGLISLRDEVRPEAAQVLTKLRANGIRRIVMLTGDH PEIAQVVADELGIDEWRAEVMPEDKLAAVRELQDDGYVVGMVGDGINDAPALAAADIG IAMGLAGTDVAVETADVALANDDLHRLLDVGDLGERAVDVIRQNYGMSIAVNAAGLLI GAGGALSPVLAAILHNASSVAVVANSSRLIRYRLDR" misc_feature 3651747..3651767 /gene="ctpC" /locus_tag="Rv3270" /note="PS00154 E1-E2 ATPases phosphorylation site" gene complement(3652679..3653347) /locus_tag="Rv3271c" /db_xref="GeneID:888716" CDS complement(3652679..3653347) /locus_tag="Rv3271c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3271c, (MTCY71.11c), len: 222 aa. Probable conserved integral membrane protein, similar to others e.g. Q9RD35|SCM1.07c from Streptomyces coelicolor (230 aa), FASTA scores: opt: 360, E(): 4.7e-16, (33.85% identity in 195 aa overlap); Q9X897|SCE2.02c from Streptomyces coelicolor (234 aa), FASTA scores: opt: 357, E(): 7.3e-16, (33.85% identity in 195 aa overlap); Q9D0E0 2610024A01RIK PROTEIN from Mus musculus (Mouse) (288 aa), FASTA scores: opt: 191, E(): 3.7e-05, (23.65% identity in 207 aa overlap)." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217788.1" /db_xref="GI:15610407" /db_xref="GOA:P96876" /db_xref="UniProtKB/TrEMBL:P96876" /db_xref="GeneID:888716" /translation="METTTEHRDESTLDSPVSVAREAEWQRNVRWARWLAWVSLAVLL TEGAVGLWQGIAVGSVALTGWALGGGSEGLASAMVLWRFTGDRTWSATAEHRAQRGVA VSFWLTAPYLVAESIRHLAGEHRAETSVIGIGLTAIALLLMPVLGWANHRVGERLGSG ATAGEGTQNYLCAAQAAAVLLGLAITAVWSNGWWIDPAIGLAIAGIAVWQGIRTWRGH GCGC" gene 3653448..3654632 /locus_tag="Rv3272" /db_xref="GeneID:888702" CDS 3653448..3654632 /locus_tag="Rv3272" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3272, (MTCY71.12), len: 394 aa. Conserved hypothetical protein, similar to various proteins e.g. Q9I672|PA0446 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (407 aa), FASTA scores: opt: 643, E(): 6.8e-32, (33.15% identity in 389 aa overlap); Q9RJU8|SCF41.21 PUTATIVE RACEMASE from Streptomyces coelicolor (403 aa), FASTA scores: opt: 541, E(): 1.1e-25, (31.95% identity in 385 aa overlap); O87838|SC8A6.04c PUTATIVE TRANSFERASE from Streptomyces coelicolor (410 aa), FASTA scores: opt: 539, E(): 1.5e-25, (29.95% identity in 395 aa overlap); Q9I563|PA0882 from Pseudomonas aeruginosa (400 aa), FASTA scores: opt: 530, E(): 5.2e-25, (28.8% identity in 396 aa overlap); BAB60328|TVG1215416 L-CARNITINE DEHYDRATASE from Thermoplasma volcanium (399 aa), FASTA scores: opt: 529, E(): 6e-25, (32.9% identity in 383 aa overlap); etc. C-terminus is similar to Q49678|U00012_27|B1308_C3_195 from Mycobacterium leprae (130 aa) (60.0% identity in 115 aa overlap). Also partially similar to MTCY359_7 from M. tuberculosis (778 aa) (29.9% identity in 388 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217789.1" /db_xref="GI:15610408" /db_xref="GOA:P96877" /db_xref="UniProtKB/TrEMBL:P96877" /db_xref="GeneID:888702" /translation="MPTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAP GGEAARQITSVLPGRPPLATYFLPNNRGKKSVTVDLTTEQAKQQMLRLADTADVVLEA FRPGTMEKLGLGPDDLRSRNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMP TPEGKPQIIPFQLVDNASGHVLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQL MMHLNRAASDQPKPEPAPKAKRRKGVGFATQPSDAFRTADGYIVISAYVPKHWQKLCY LIGRPDLVEDQRFAEQRSRSINYAELTAELELALASKTATEWVQLLQANGLMACLAHT WKQVVDTPLFAENDLTLEVGRGADTITVIRTPARYASFRAVVTDPPPTAGEHNAVFLA RP" gene 3654637..3656931 /locus_tag="Rv3273" /db_xref="GeneID:888700" CDS 3654637..3656931 /locus_tag="Rv3273" /EC_number="4.2.1.1" /function="GENERATES CO(2) AND H(2)O FROM H(2)CO(3), AND POSSIBLY INVOLVED IN TRANSPORT OF SULFATE ACROSS THE MEMBRANE." /experiment="experimental evidence, no additional details recorded" /note="Rv3273, (MTCY71.13), len: 764 aa. Probable transmembrane protein (N-terminal part is hydrophobic) with probable carbonic anhydrase activity (in C-terminal part) (EC 4.2.1.1). Possibly involved in transport of sulfate. Equivalent to Q9CBA3|ML2279 PUTATIVE TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium leprae (496 aa), FASTA scores: opt: 1637, E(): 1.8e-89, (59.15% identity in 487 aa overlap). Similar to various proteins (principally sulfate transporters) e.g. Q9X927|SCH5.25 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (830 aa), FASTA scores: opt: 1325, E(): 8e-71, (40.85% identity in 788 aa overlap); Q9I729|PA0103 PROBABLE SULFATE TRANSPORTER from Pseudomonas aeruginosa (523 aa), FASTA scores: opt: 1015, E(): 1.3e-52, (39.95% identity in 488 aa overlap); Q9KN88|VCA0077 SULFATE PERMEASE FAMILY PROTEIN from Vibrio cholerae (553 aa), FASTA scores: opt: 629, E(): 9.6e-30, (30.95% identity in 423 aa overlap); etc. C-terminal part (aa 550-764) shows similarity to carbonic anhydrase e.g. P27134|CYNT_SYNP7 CARBONIC ANHYDRASE (EC 4.2.1.1) (272 aa), FASTA scores: opt: 350, E(): 8.1e-15, (33.8% identity in 201 aa overlap). Contains PS00704 Prokaryotic-type carbonic anhydrases signature 1. SEEMS TO BELONG TO THE SULP FAMILY." /codon_start=1 /transl_table=11 /product="transmembrane carbonic anhydrase" /protein_id="NP_217790.1" /db_xref="GI:15610409" /db_xref="GOA:P96878" /db_xref="UniProtKB/TrEMBL:P96878" /db_xref="GeneID:888700" /translation="MTIPRSQHMSTAVNSCTEAPASRSQWMLANLRHDVPASLVVFLV ALPLSLGIAIASGAPIIAGVIAAVVGGIVAGAVGGSPVQVSGPAAGLTVVVAELIDEL GWPMLCLMTIAAGALQIVFGLSRMARAALAIAPVVVHAMLAGIGITIALQQIHVLLGG TSHSSAWRNIVALPDGILHHELHEVIVGGTVIAILLMWSKLPAKVRIIPGPLVAIAGA TVLALLPVLQTERIDLQGNFFDAIGLPKLAEMSPGGQPWSHEISAIALGVLTIALIAS VESLLSAVGVDKLHHGPRTDFNREMVGQGSANVVSGLLGGLPITGVIVRSSANVAAGA RTRMSTILHGVWILLFASLFTNLVELIPKAALAGLLIVIGAQLVKLAHIKLAWRTGNF VIYAITIVCVVFLNLLEGVAIGLVVAIVFLLVRVVRAPVEVKPVGGEQSKRWRVDIDG TLSFLLLPRLTTVLSKLPEGSEVTLNLNADYIDDSVSEAISDWRRAHETRGGVVAIVE TSPAKLHHAHARPPKRHFASDPIGLVPWRSARGKDRGSASVLDRIDEYHRNGAAVLHP HIAGLTDSQDPYELFLTCADSRILPNVITASGPGDLYTVRNLGNLVPTDPDDRSVDAA LDFAVNQLGVSSVVVCGHSSCAAMTALLEDDPANTTTPMMRWLENAHDSLVVFRNHHP ARRSAESAGYPEADQLSIVNVAVQVERLTRHPILATAVAAADLQVIGIFFDISTARVY EVGPNGIICPDEPADRPVDHESAQ" misc_feature 3656386..3656409 /locus_tag="Rv3273" /note="PS00704 Prokaryotic-type carbonic anhydrases signature 1" gene complement(3656920..3658089) /gene="fadE25" /locus_tag="Rv3274c" /db_xref="GeneID:888731" CDS complement(3656920..3658089) /gene="fadE25" /locus_tag="Rv3274c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION [CATALYTIC ACTIVITY: ACYL-CoA + ETF = 2,3-DEHYDROACYL-CoA + REDUCED ETF]." /experiment="experimental evidence, no additional details recorded" /note="Rv3274c, (MTCY71.14c), len: 389 aa. Probable fadE25, Acyl-CoA Dehydrogenase (EC 1.3.99.-), equivalent to P46703|ACDP_MYCLE|FADE25|ACD|ML0737|B1308_F1_34 PROBABLE ACYL-CoA DEHYDROGENASE FADE25 from Mycobacterium leprae (389 aa), FASTA scores: opt: 2394, E(): 3.8e-143, (92.05% identity in 389 aa overlap). Also similar to many e.g. Q9RIQ5|FADE FATTY ACID ACYL-CoA DEHYDROGENASE from Streptomyces lividans (385 aa), FASTA scores: opt: 1692, E(): 4.9e-99, (67.35% identity in 383 aa overlap); P45867|ACDA_BACSU|ACD from Bacillus subtilis (379 aa), FASTA scores: opt: 1212, E(): 7.2e-69, (51.85% identity in 376 aa overlap); Q9K6D1|ACDA|BH3798 from Bacillus halodurans (380 aa), FASTA scores: opt: 1209, E(): 1.1e-68, (51.7% identity in 377 aa overlap); P52042|ACDS_CLOAB|BCD from Clostridium acetobutylicum (379 aa), FASTA scores: opt: 1056, E(): 4.6e-59, (44.6% identity in 379 aa overlap); etc. Contains PS00072 Acyl-CoA dehydrogenases signature 1, PS00073 Acyl-CoA dehydrogenases signature 2. BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE25" /protein_id="NP_217791.1" /db_xref="GI:15610410" /db_xref="GOA:P63427" /db_xref="UniProtKB/Swiss-Prot:P63427" /db_xref="GeneID:888731" /translation="MVGWAGNPSFDLFKLPEEHDEMRSAIRALAEKEIAPHAAEVDEK ARFPEEALVALNSSGFNAVHIPEEYGGQGADSVATCIVIEEVARVDASASLIPAVNKL GTMGLILRGSEELKKQVLPALAAEGAMASYALSEREAGSDAASMRTRAKADGDHWILN GAKCWITNGGKSTWYTVMAVTDPDRGANGISAFMVHKDDEGFTVGPKERKLGIKGSPT TELYFENCRIPGDRIIGEPGTGFKTALATLDHTRPTIGAQAVGIAQGALDAAIAYTKD RKQFGESISTFQAVQFMLADMAMKVEAARLMVYSAAARAERGEPDLGFISAASKCFAS DVAMEVTTDAVQLFGGAGYTTDFPVERFMRDAKITQIYEGTNQIQRVVMSRALLR" misc_feature complement(3656995..3657054) /gene="fadE25" /locus_tag="Rv3274c" /note="PS00073 Acyl-CoA dehydrogenases signature 2" misc_feature complement(3657655..3657693) /gene="fadE25" /locus_tag="Rv3274c" /note="PS00072 Acyl-CoA dehydrogenases signature 1" gene complement(3658114..3658638) /gene="purE" /locus_tag="Rv3275c" /db_xref="GeneID:888721" CDS complement(3658114..3658638) /gene="purE" /locus_tag="Rv3275c" /EC_number="4.1.1.21" /function="INVOLVED IN PURINE BIOSYNTHESIS (SIXTH STEP). THIS SUBUNIT CAN ALONE TRANSFORM AIR TO CAIR, BUT IN ASSOCIATION WITH PURK, WHICH POSSESSES AN ATPASE ACTIVITY, AN ENZYME COMPLEX IS PRODUCED WHICH IS CAPABLE OF CONVERTING AIR TO CAIR EFFICIENTLY UNDER PHYSIOLOGICAL CONDITION [CATALYTIC ACTIVITY: 1-(5-PHOSPHORIBOSYL)-5-AMINO-4-IMIDAZOLE-CARBOXYLATE = 1-(5-PHOSPHORIBOSYL)-5-AMINOIMIDAZOLE + CO(2)]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes a step in the de novo purine nucleotide biosynthetic pathway" /codon_start=1 /transl_table=11 /product="phosphoribosylaminoimidazole carboxylase catalytic subunit" /protein_id="NP_217792.1" /db_xref="GI:15610411" /db_xref="GOA:P96880" /db_xref="UniProtKB/Swiss-Prot:P96880" /db_xref="GeneID:888721" /translation="MTPAGERPRVGVIMGSDSDWPVMADAAAALAEFDIPAEVRVVSA HRTPEAMFSYARGAAERGLEVIIAGAGGAAHLPGMVAAATPLPVIGVPVPLGRLDGLD SLLSIVQMPAGVPVATVSIGGAGNAGLLAVRMLGAANPQLRARIVAFQDRLADVVAAK DAELQRLAGKLTRD" gene complement(3658635..3659924) /gene="purK" /locus_tag="Rv3276c" /db_xref="GeneID:888720" CDS complement(3658635..3659924) /gene="purK" /locus_tag="Rv3276c" /EC_number="4.1.1.21" /function="INVOLVED IN PURINE BIOSYNTHESIS (SIXTH STEP). POSSESSES AN ATPASE ACTIVITY THAT IS DEPENDENT ON THE PRESENCE OF AIR (AMINOIMIDAZOLE RIBONUCLEOTIDE). THE ASSOCIATION OF PURK AND PURE PRODUCES AN ENZYME COMPLEX CAPABLE OF CONVERTING AIR TO CAIR EFFICIENTLY UNDER PHYSIOLOGICAL CONDITION [CATALYTIC ACTIVITY: 1-(5-PHOSPHORIBOSYL)-5-AMINO-4-IMIDAZOLE-CARBOXYLATE = 1-(5-PHOSPHORIBOSYL)-5-AMINOIMIDAZOLE + CO(2)]." /note="With PurE catalyzes the conversion of aminoimidazole ribonucleotide to carboxyaminoimidazole ribonucleotide in the de novo purine nucleotide biosynthetic pathway" /codon_start=1 /transl_table=11 /product="phosphoribosylaminoimidazole carboxylase ATPase subunit" /protein_id="NP_217793.1" /db_xref="GI:15610412" /db_xref="GOA:P65898" /db_xref="UniProtKB/Swiss-Prot:P65898" /db_xref="GeneID:888720" /translation="MMAVASSRTPAVTSFIAPLVAMVGGGQLARMTHQAAIALGQNLR VLVTSADDPAAQVTPNVVIGSHTDLAALRRVAAGADVLTFDHEHVPNELLEKLVADGV NVAPSPQALVHAQDKLVMRQRLAAAGVAVPRYAGIKDPDEIDVFAARVDAPIVVKAVR GGYDGRGVRMARDVADARDFARECLADGVAVLVEERVDLRRELSALVARSPFGQGAAW PVVQTVQRDGTCVLVIAPAPALPDDLATAAQRLALQLADELGVVGVLAVELFETTDGA LLVNELAMRPHNSGHWTIDGARTSQFEQHLRAVLDYPLGDSDAVVPVTVMANVLGAAQ PPAMSVDERLHHLFARMPDARVHLYGKAERPGRKVGHINFLGSDVAQLCERAELAAHW LSHGRWTDGWDPHRASDDAVGVPPACGGRSDEEERRL" repeat_region complement(3658658..3658715) /note="58 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene 3659878..3660696 /locus_tag="Rv3277" /db_xref="GeneID:888729" CDS 3659878..3660696 /locus_tag="Rv3277" /function="UNKNOWN" /note="Rv3277, (MTCY71.17), len: 272 aa. Probable conserved transmembrane protein, equivalent, but longer 49 aa, to Q49673|B1308_C1_121|ML0734 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (228 aa), FASTA scores: opt: 1266, E(): 6.1e-78, (84.2% identity in 228 aa overlap). Also similar to various proteins (principally unknowns) e.g. Q9KZ84|SCE25.02 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (190 aa), FASTA scores: opt: 197, E(): 3.6e-06, (32.0% identity in 150 aa overlap); BAB50058|MLL3086 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (136 aa), FASTA scores: opt: 176, E(): 6.9e-05, (34.7% identity in 147 aa overlap); O29640|AF0615 HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (129 aa), FASTA scores: opt: 120, E(): 0.38, (23.35% identity in 120 aa overlap); Q9KJU8|GTCA TEICHOIC ACID GLYCOSYLATION PROTEIN from Listeria innocua (145 aa), FASTA scores: opt: 117, E(): 0.67, (23.85% identity in 151 aa overlap); etc. Equivalent to AAK47718 from Mycobacterium tuberculosis strain CDC1551 (256 aa) but longer 16 aa. Contains PS00044 Bacterial regulatory proteins, lysR family signature." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217794.1" /db_xref="GI:15610413" /db_xref="GOA:P96882" /db_xref="UniProtKB/TrEMBL:P96882" /db_xref="GeneID:888729" /translation="MNEVTAGVRELATAIMVSRHLTGVLAGHGSQTVTYHFASILCSS VHSLVVSFADATIARLPGVVQPYAQRHHELIKFAIVGGTTFIIDTAIFYTLKLTVLEP KPVTAKVIAGIVAVIASYVLNREWSFRDRGGRERHHEALLFFAFSGVGVLLSMAPLWF SSYILQLRVPTVSLTMENIADFISAYIIGNLLQMAFRFWAFRRWVFPDEFARNPDKAL ESALTAGGIAEVFEDVLEGGFEDGNVTLLRAWRNRANRFAQLGDSSEPRVSKTS" misc_feature 3660313..3660390 /locus_tag="Rv3277" /note="PS00044 Bacterial regulatory proteins, lysR family signature" gene complement(3660651..3661169) /locus_tag="Rv3278c" /db_xref="GeneID:888724" CDS complement(3660651..3661169) /locus_tag="Rv3278c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3278c, (MTCY71.18c), len: 172 aa. Probable conserved transmembrane protein, equivalent to Q9CCL2|ML0733 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (172 aa), FASTA scores: opt: 1024, E(): 6e-61, (83.15% identity in 172 aa overlap); and Q49672|B1308_F2_67 HYPOTHETICAL PROTEIN from Mycobacterium leprae (181 aa), FASTA scores: opt: 1024, E(): 6.3e-61, (83.15% identity in 172 aa overlap) (this is certainly the same putative protein but with N-terminus longer). Also some similarity to other hypothetical proteins (generally membrane proteins) e.g. O26822|MTH726 HYPOTHETICAL PROTEIN from Methanobacterium thermoautotrophicum (204 aa), FASTA scores: opt: 147, E(): 0.0079, (24.6% identity in 187 aa overlap); Q9X8H4|SCE9.01 HYPOTHETICAL 47.7 KDA PROTEIN (FRAGMENT) from Streptomyces coelicolor (436 aa), FASTA scores: opt: 151, E(): 0.0079, (28.1% identity in 153 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217795.1" /db_xref="GI:15610414" /db_xref="GOA:P96883" /db_xref="UniProtKB/TrEMBL:P96883" /db_xref="GeneID:888724" /translation="MSYPENVLAAGEQVVLHRHPHWNRLIWPVVVLVLLTGLAAFGSG FVNSTPWQQIAKNVIHAVIWGIWLVIVGWLTLWPFLSWLTTHFVVTNRRVMFRHGVLT RSGIDIPLARINSVEFRDRIFERIFRTGTLIIESASQDPLEFYNIPRLREVHALLYHE VFDTLGSDESPS" gene complement(3661212..3662012) /gene="birA" /locus_tag="Rv3279c" /db_xref="GeneID:888726" CDS complement(3661212..3662012) /gene="birA" /locus_tag="Rv3279c" /EC_number="6.3.4.15" /function="BIRA ACTS BOTH AS A BIOTIN-OPERON REPRESSOR AND AS THE ENZYME THAT SYNTHESIZES THE COREPRESSOR, ACETYL-COA:CARBON-DIOXIDE LIGASE. THIS PROTEIN ALSO ACTIVATES BIOTIN TO FORM BIOTINYL-5'-ADENYLATE AND TRANSFERS THE BIOTIN MOIETY TO BIOTIN-ACCEPTING PROTEINS [CATALYTIC ACTIVITY: ATP + BIOTIN + APO-[ACETYL-COA:CARBON-DIOXIDE LIGASE (ADP FORMING)] = AMP + PYROPHOSPHATE + [ACETYL-COA:CARBON-DIOXIDE LIGASE (ADP FORMING)]]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of biotinyl-5'-AMP, also acts as a transcriptional repressor of the biotin operon" /codon_start=1 /transl_table=11 /product="biotin--protein ligase" /protein_id="NP_217796.1" /db_xref="GI:15610415" /db_xref="GOA:P96884" /db_xref="UniProtKB/TrEMBL:P96884" /db_xref="GeneID:888726" /translation="MTDRDRLRPPLDERSLRDQLIGAGSGWRQLDVVAQTGSTNADLL ARAASGADIDGVVLIAEHQTAGRGRHGRGWAATARAQIILSVGVRVVDVPVQAWGWLS LAAGLAVLDSVAPLIAVPPAETGLKWPNDVLARGGKLAGILAEVAQPFVVLGVGLNVT QAPEEVDPDATSLLDLGVAAPDRNRIASRLLRELEARIIQWRNANPQLAADYRARSLT IGSRVRVELPGGQDVVGIARDIDDQGRLCLDVGGRTVVVSAGDVVHLR" gene 3662062..3663708 /gene="accD5" /locus_tag="Rv3280" /db_xref="GeneID:888725" CDS 3662062..3663708 /gene="accD5" /locus_tag="Rv3280" /EC_number="6.4.1.3" /function="KEY ENZYME IN THE CATABOLIC PATHWAY OF ODD-CHAIN FATTY ACIDS, ISOLEUCINE, THREONINE, METHIONINE, AND VALINE [CATALYTIC ACTIVITY: ATP + PROPIONYL-CoA + CO(2) + H(2)O = ADP + ORTHOPHOSPHATE + METHYLMALONYL-COA.]" /experiment="experimental evidence, no additional details recorded" /note="Rv3280, (MTCY71.20, pccB), len: 548 aa. Probable accD5, propyonyl-CoA carboxylase beta chain 5 (EC 6.4.1.3), equivalent to P53002|PCCB_MYCLE|ACCD5|ML0731|B1308_C1_125 PROBABLE PROPIONYL-CoA CARBOXYLASE BETA CHAIN 5 from Mycobacterium leprae (549 aa), FASTA scores: opt: 3241, E(): 4e-192, (88.7% identity in 549 aa overlap). Also similar to many e.g. O87201|DTSR2 DTSR2 PROTEIN INVOLVED IN GLUTAMATE PRODUCTION from orynebacterium glutamicum (Brevibacterium flavum) (537 aa), FASTA scores: opt: 2604, E(): 6.9e-153, (74.1% identity in 529 aa overlap) (see Kimura et al., 1996); P53003|PCCB_SACER from Saccharopolyspora erythraea (Streptomyces erythraeus) (546 aa), FASTA scores: opt: 2466, E(): 2.2e-144, (70.2% identity in 530 aa overlap); O88155|DTSR1 DTSR1 PROTEIN from Corynebacterium glutamicum (Brevibacterium flavum) (543 aa), FASTA scores: opt: 2375, E(): 8.8e-139, (67.1% identity in 529 aa overlap; Q9X4K7|PCCB from Streptomyces coelicolor (530 aa), FASTA scores: opt: 2360, E(): 7.3e-138, (67.9% identity in 533 aa overlap); O24789|MXPCCB from Myxococcus xanthus (524 aa), FASTA scores: opt: 1868, E(): 1.5e-107, (56.85% identity in 524 aa overlap); etc. Also similar with METHYLMALONYL-CoA DECARBOXYLASES e.g. O59018|PH1287 from Pyrococcus horikoshii (522 aa), FASTA scores: opt: 1841, E(): 6.7e-106, (54.15% identity in 528 aa overlap). Also similarity with MTCY427.28 (43.8% identity in 434 aa overlap). BELONGS TO THE ACCD/PCCB FAMILY." /codon_start=1 /transl_table=11 /product="propionyl-CoA carboxylase beta chain" /protein_id="NP_217797.1" /db_xref="GI:15610416" /db_xref="GOA:P96885" /db_xref="UniProtKB/Swiss-Prot:P96885" /db_xref="GeneID:888725" /translation="MTSVTDRSAHSAERSTEHTIDIHTTAGKLAELHKRREESLHPVG EDAVEKVHAKGKLTARERIYALLDEDSFVELDALAKHRSTNFNLGEKRPLGDGVVTGY GTIDGRDVCIFSQDATVFGGSLGEVYGEKIVKVQELAIKTGRPLIGINDGAGARIQEG VVSLGLYSRIFRNNILASGVIPQISLIMGAAAGGHVYSPALTDFVIMVDQTSQMFITG PDVIKTVTGEEVTMEELGGAHTHMAKSGTAHYAASGEQDAFDYVRELLSYLPPNNSTD APRYQAAAPTGPIEENLTDEDLELDTLIPDSPNQPYDMHEVITRLLDDEFLEIQAGYA QNIVVGFGRIDGRPVGIVANQPTHFAGCLDINASEKAARFVRTCDCFNIPIVMLVDVP GFLPGTDQEYNGIIRRGAKLLYAYGEATVPKITVITRKAYGGAYCVMGSKDMGCDVNL AWPTAQIAVMGASGAVGFVYRQQLAEAAANGEDIDKLRLRLQQEYEDTLVNPYVAAER GYVDAVIPPSHTRGYIGTALRLLERKIAQLPPKKHGNVPL" gene 3663689..3664222 /locus_tag="Rv3281" /db_xref="GeneID:888732" CDS 3663689..3664222 /locus_tag="Rv3281" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3281, (MTCY71.21), len: 177 aa. Conserved hypothetical protein, equivalent (but longer 14 aa and with a gap between aa 82-102) to AAK47723|MT3380 from Mycobacterium tuberculosis strain CDC1551 (142 aa), FASTA scores: opt: 830, E(): 3.1e-40, (86.5% identity in 163 aa overlap). C-terminus highly similar to Q49671|B1308_C3_211|ML0730 from Mycobacterium leprae (84 aa), FASTA scores: opt: 393, E(): 7.6e-16, (68.95% identity in 87 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217798.1" /db_xref="GI:15610417" /db_xref="UniProtKB/TrEMBL:P96886" /db_xref="GeneID:888732" /translation="MGTCPCESSERNEPVSRVSGTNEVSDGNETNNPAEVSDGNETNN PAEVSDGNETNNPAPVSRVSGTNEVSDGNETNNPAPVSRVSGTNEVSDGNETNNPAPV TEKPLHPHEPHIEILRGQPTDQELAALIAVLGSISGSTPPAQPEPTRWGLPVDQLRYP VFSWQRITLQEMTHMRR" gene 3664219..3664887 /gene="maf" /locus_tag="Rv3282" /db_xref="GeneID:888718" CDS 3664219..3664887 /gene="maf" /locus_tag="Rv3282" /function="UNKNOWN" /note="Maf; overexpression in Bacillus subtilis inhibits septation in the dividing cell" /codon_start=1 /transl_table=11 /product="Maf-like protein" /protein_id="NP_217799.1" /db_xref="GI:15610418" /db_xref="GOA:P96887" /db_xref="UniProtKB/Swiss-Prot:P96887" /db_xref="GeneID:888718" /translation="MTRLVLGSASPGRLKVLRDAGIEPLVIASHVDEDVVIAALGPDA VPSDVVCVLAAAKAAQVATTLTGTQRIVAADCVVVACDSMLYIEGRLLGKPASIDEAR EQWRSMAGRAGQLYTGHGVIRLQDNKTVYRAAETAITTVYFGTPSASDLEAYLASGES LRVAGGFTLDGLGGWFIDGVQGNPSNVIGLSLPLLRSLVQRCGLSVAALWAGNAGGPA HKQQ" gene 3664928..3665821 /gene="sseA" /locus_tag="Rv3283" /db_xref="GeneID:888717" CDS 3664928..3665821 /gene="sseA" /locus_tag="Rv3283" /EC_number="2.8.1.1" /function="POSSIBLY A SULFOTRANSFERASE INVOLVED IN THE FORMATION OF THIOSULFATE [CATALYTIC ACTIVITY: THIOSULFATE + CYANIDE = SULFITE + THIOCYANATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3283, (MTCY71.23), len: 297 aa. Probable sseA, thiosulfate sulfurtransferase (EC 2.8.1.1), equivalent P46700|THT2_MYCLE|SSEA|ML0728|B1308_C1_127 PUTATIVE THIOSULFATE SULFURTRANSFERASE SSEA from Mycobacterium leprae (296 aa), FASTA scores: opt: 1742, E(): 5.5e-108, (83.45% identity in 296 aa overlap). Also highly similar to others e.g. Q9RXT9|DR0217 from Deinococcus radiodurans (286 aa), FASTA scores: opt: 1057, E(): 1.2e-62, (53.86% identity in 273 aa overlap); P16385|THTR_SACER|CYSA from Saccharopolyspora erythraea (Streptomyces erythraeus) (281 aa), FASTA scores: opt: 1006, E(): 2.7e-59, (51.25% identity in 277 aa overlap); P71121|THTR_CORGL from Corynebacterium glutamicum (Brevibacterium flavum) (225 aa), FASTA scores: opt: 897, E(): 3.6e-52, (59.05% identity in 215 aa overlap); etc. Also highly similar to O05793|CYSA1|CYSA|Rv3117|MT3199|MTCY164.27|CYSA2|RV0815c| MT 0837|MTV043.07c|THTR_MYCTU PUTATIVE THIOSULFATE SULFURTRANSFERASE (EC 2.8.1.1) from Mycobacterium tuberculosis (277 aa), FASTA scores: opt: 955, E(): 6.3e-56, (50.2% identity in 271 aa overlap); and Q50036|THTR_MYCLE|CYSA|CYSA3|ML2198 PUTATIVE THIOSULFATE SULFURTRANSFERASE from Mycobacterium leprae (277 aa), FASTA scores: opt: 931, E(): 2.5e-54, (48.9% identity in 276 aa overlap). Shows some similarity to MTCY339.19c (30.3% identity in 254 aa overlap). Contains PS00683 Rhodanese C-terminal signature. BELONGS TO THE RHODANESE FAMILY. Thought to be differentially expressed within host cells (see Triccas et al., 1999)." /codon_start=1 /transl_table=11 /product="thiosulfate sulfurtransferase SseA" /protein_id="NP_217800.1" /db_xref="GI:15610419" /db_xref="GOA:P96888" /db_xref="UniProtKB/Swiss-Prot:P96888" /db_xref="GeneID:888717" /translation="MPLPADPSPTLSAYAHPERLVTADWLSAHMGAPGLAIVESDEDV LLYDVGHIPGAVKIDWHTDLNDPRVRDYINGEQFAELMDRKGIARDDTVVIYGDKSNW WAAYALWVFTLFGHADVRLLNGGRDLWLAERRETTLDVPTKTCTGYPVVQRNDAPIRA FRDDVLAILGAQPLIDVRSPEEYTGKRTHMPDYPEEGALRAGHIPTAVHIPWGKAADE SGRFRSREELERLYDFINPDDQTVVYCRIGERSSHTWFVLTHLLGKADVRNYDGSWTE WGNAVRVPIVAGEEPGVVPVV" misc_feature 3665735..3665758 /gene="sseA" /locus_tag="Rv3283" /note="PS00683 Rhodanese C-terminal signature" gene 3665818..3666249 /locus_tag="Rv3284" /db_xref="GeneID:887966" CDS 3665818..3666249 /locus_tag="Rv3284" /function="UNKNOWN" /note="Rv3284, (MTCY71.24, unknown), len: 143 aa. Conserved hypothetical protein, with similarity to other bacterial hypothetical proteins e.g. Q9RXU0|DR0216 from Deinococcus radiodurans (147 aa), FASTA scores: opt: 425, E(): 9.1e-21, (46.55% identity in 146 aa overlap); BAB37094|ECS3671 from Escherichia coli strain O157:H7 (147 aa), FASTA scores: opt: 187, E(): 2.2e-05, (29.5% identity in 139 aa overlap); AAG57925|YGDK from Escherichia coli strain O157:H7 EDL933 (147 aa), FASTA scores: opt: 187, E(): 2.2e-05, (32.05% identity in 139 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217801.1" /db_xref="GI:15610420" /db_xref="UniProtKB/Swiss-Prot:P67123" /db_xref="GeneID:887966" /translation="MTAPASLPAPLAEVVSDFAEVQGQDKLRLLLEFANELPALPSHL AESAMEPVPECQSPLFLHVDASDPNRVRLHFSAPAEAPTTRGFASILAAGLDEQPAAD ILAVPEDFYTELGLAALISPLRLRGMSAMLARIKRRLREAD" gene 3666357..3668159 /gene="accA3" /locus_tag="Rv3285" /db_xref="GeneID:887912" CDS 3666357..3668159 /gene="accA3" /locus_tag="Rv3285" /EC_number="6.3.4.14" /function="INVOLVED IN LONG-CHAIN FATTY ACID SYNTHESIS (AT THE FIRST STEP). CARRIES TWO FUNCTIONS: BIOTIN CARBOXYL CARRIER PROTEIN AND BIOTIN CARBOXYLTRANSFERASE [CATALYTIC ACTIVITY: ATP + BIOTIN-CARBOXYL-CARRIER PROTEIN + CO(2) = ADP + ORTHOPHOSPHATE + CARBOXYBIOTIN-CARBOXYL-CARRIER PROTEIN]." /experiment="experimental evidence, no additional details recorded" /note="Rv3285, (MTCY71.25), len: 600 aa. Probable accA3, bifunctional protein acetyl-/propionyl-coenzyme A carboxylase, alpha chain (EC 6.3.4.14) (see citations below) equivalent to P46392|BCCA_MYCLE|BCCA|ML0726|B1308_C1_129 ACETYL-/PROPIONYL-COENZYME A CARBOXYLASE ALPHA CHAIN from Mycobacterium leprae (598 aa), FASTA scores: opt: 3510, E(): 1.1e-196, (89.3% identity in 601 aa overlap). Also highly similar to other proteins e.g. P71122|ACCBC ACYL COENZYME A CARBOXYLASE from Corynebacterium glutamicum (Brevibacterium flavum) (591 aa), FASTA scores: opt: 2776, E(): 5.6e-154, (71.95% identity in 592 aa overlap); Q54119|BCPA2 BIOTIN CARBOXYLASE AND BIOTIN CARBOXYL CARRIER PROTEIN from Saccharopolyspora erythraea (Streptomyces erythraeus) (591 aa), FASTA scores: opt: 2723, E(): 6.7e-151, (70.5% identity in 590 aa overlap); Q54105|BCPA BIOTIN CARBOXYLASE AND BIOTIN CARBOXYL CARRIER PROTEIN from Saccharopolyspora erythraea (Streptomyces erythraeus) (597 aa), FASTA scores: opt: 2721, E(): 8.9e-151, (70.05% identity in 594 aa overlap); Q9EWV4|2SCK31.20 PUTATIVE ACYL-CoA CARBOXYLASE COMPLEX A SUBUNIT from Streptomyces coelicolor (590 aa), FASTA scores: opt: 2626, E(): 2.9e-145, (68.25% identity in 595 aa overlap); etc. Contains PS00867 Carbamoyl-phosphate synthase subdomain signature 2, PS00188 Biotin-requiring enzymes attachment site. SIMILAR TO OTHER BIOTIN-DEPENDENT ENZYMES AND CARBAMOYL-PHOSPHATE SYNTHETASES." /codon_start=1 /transl_table=11 /product="bifunctional acetyl-/propionyl-coenzyme A carboxylase subunit alpha" /protein_id="NP_217802.1" /db_xref="GI:15610421" /db_xref="GOA:P96890" /db_xref="UniProtKB/TrEMBL:P96890" /db_xref="GeneID:887912" /translation="MASHAGSRIARISKVLVANRGEIAVRVIRAARDAGLPSVAVYAE PDAESPHVRLADEAFALGGQTSAESYLDFAKILDAAAKSGANAIHPGYGFLAENADFA QAVIDAGLIWIGPSPQSIRDLGDKVTARHIAARAQAPLVPGTPDPVKGADEVVAFAEE YGLPIAIKAAHGGGGKGMKVARTIDEIPELYESAVREATAAFGRGECYVERYLDKPRH VEAQVIADQHGNVVVAGTRDCSLQRRYQKLVEEAPAPFLTDFQRKEIHDSAKRICKEA HYHGAGTVEYLVGQDGLISFLEVNTRLQVEHPVTEETAGIDLVLQQFRIANGEKLDIT EDPTPRGHAIEFRINGEDAGRNFLPAPGPVTKFHPPSGPGVRVDSGVETGSVIGGQFD SMLAKLIVHGADRAEALARARRALNEFGVEGLATVIPFHRAVVSDPAFIGDANGFSVH TRWIETEWNNTIEPFTDGEPLDEDARPRQKVVVEIDGRRVEVSLPADLALSNGGGCDP VGVIRRKPKPRKRGAHTGAAASGDAVTAPMQGTVVKFAVEEGQEVVAGDLVVVLEAMK MENPVTAHKDGTITGLAVEAGAAITQGTVLAEIK" misc_feature 3667242..3667265 /gene="accA3" /locus_tag="Rv3285" /note="PS00867 Carbamoyl-phosphate synthase subdomain signature 2" misc_feature 3668022..3668075 /gene="accA3" /locus_tag="Rv3285" /note="PS00188 Biotin-requiring enzymes attachment site" gene complement(3668169..3668954) /gene="sigF" /locus_tag="Rv3286c" /db_xref="GeneID:888727" CDS complement(3668169..3668954) /gene="sigF" /locus_tag="Rv3286c" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED. THOUGHT TO BE INVOLVED IN SURVIVAL AND PROLIFERATION IN LUNG GRANULOMAS DURING INFECTION. THOUGHT TO BE INVOLVED IN VIRULENCE AND PERSISTENCE PROCESSES. MODULATES EXPRESSION OF THE 16 KDa ALPHA-CRYSTALLIN HOMOLOGUE/Rv2031c. NEGATIVELY REGULATED BY Rv3287c|RSBW|USFX." /experiment="experimental evidence, no additional details recorded" /note="Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released; this sigma factor is a general stress response regulator; expressed in stationary phase and under nitrogen depletion and cold shock" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigF" /protein_id="NP_217803.1" /db_xref="GI:15610422" /db_xref="GOA:Q7D5S2" /db_xref="UniProtKB/TrEMBL:Q7D5S2" /db_xref="GeneID:888727" /translation="MTARAAGGSASRANEYADVPEMFRELVGLPAGSPEFQRHRDKIV QRCLPLADHIARRFEGRGEPRDDLIQVARVGLVNAAVRFDVKTGSDFVSFAVPTIMGE VRRHFRDNSWSVKVPRRLKELHLRLGTATADLSQRLGRAPSASELAAELGMDRAEVIE GLLAGSSYHTLSIDSGGGSDDDARAITDTLGDVDAGLDQIENREVLRPLLEALPERER TVLVLRFFDSMTQTQIAERVGISQMHVSRLLAKSLARLRDQLE" gene complement(3668951..3669388) /gene="rsbW" /locus_tag="Rv3287c" /db_xref="GeneID:887977" CDS complement(3668951..3669388) /gene="rsbW" /locus_tag="Rv3287c" /function="BINDS TO SIGMA AND BLOCKS ITS ABILITY TO FORM AN RNA POLYMERASE HOLOENZYME. REGULATES NEGATIVELY SIGF|Rv3286c, AND NEGATIVELY REGULATED BY Rv1365c|RSFA AND Rv3687C|RSFB." /experiment="experimental evidence, no additional details recorded" /note="Rv3287c, (MTCY71.27c), len: 145 aa. rsbW (alternate gene name: usfX), anti-sigma factor (see citations below), similar to Q49667|B1308_F3_89 from Mycobacterium leprae (75 aa), FASTA scores: opt: 308, E(): 2.5e-15, (72.2% identity in 72 aa overlap); Q9R3X8|PRS1|USHX|PRS PRS1 PROTEIN (ANTI-SIGMA FACTOR) from Streptomyces coelicolor (137 aa), FASTA scores: opt: 184, E(): 3.7e-06, (36.8% identity in 106 aa overlap); O50231 PUTATIVE SIGMA-B REGULATOR from Bacillus licheniformis (160 aa), FASTA scores: opt: 122, E(): 0.13, (23.9% identity in 92 aa overlap); and P17904|RSBW_BACSU ANTI-SIGMA B FACTOR (SIGMA-B NEGATIVE EFFECTOR RSBW) from Bacillus subtilis (160 aa), FASTA scores: opt: 108, E(): 1.3, (21.25% identity in 127 aa overlap). Equivalent to AAK47729 from Mycobacterium tuberculosis strain CDC1551 (145 aa) but longer 99 aa. INDUCTION BY HEAT SHOCK, SALT STRESS, OXIDATIVE STRESS, GLUCOSE LIMITATION AND OXYGEN LIMITATION. N-terminus shortened since first submission (previously 242 aa).; usfX" /codon_start=1 /transl_table=11 /product="anti-sigma factor rsbW (sigma negative effector)" /protein_id="NP_217804.2" /db_xref="GI:57117083" /db_xref="UniProtKB/TrEMBL:Q7D5S1" /db_xref="GeneID:887977" /translation="MADSDLPTKGRQRGVRAVELNVAARLENLALLRTLVGAIGTFED LDFDAVADLRLAVDEVCTRLIRSALPDATLRLVVDPRKDEVVVEASAACDTHDVVAPG SFSWHVLTALADDVQTFHDGRQPDVAGSVFGITLTARRAASSR" gene complement(3669586..3669999) /gene="usfY" /locus_tag="Rv3288c" /db_xref="GeneID:888735" CDS complement(3669586..3669999) /gene="usfY" /locus_tag="Rv3288c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3288c, (MTCY71.28c), len: 137 aa. usfY, putative protein (see citation below). Has no significant homologues. May not be contranscribed with the usfX and sigF proteins." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217805.1" /db_xref="GI:15610424" /db_xref="UniProtKB/TrEMBL:Q7D5S0" /db_xref="GeneID:888735" /translation="MGQIPPQPVRRVLPLMVVPGNGQKWRNRTETEEAMGDTYRDPVD HLRTTRPLAGESLIDVVHWPGYLLIVAGVVGGVGALAAFGTGHHAEGMTFGVVAIVVT VVGLAWLAFEHRRIRKIADRWYTEHPEVRRQRLAG" gene complement(3670034..3670411) /locus_tag="Rv3289c" /db_xref="GeneID:888736" CDS complement(3670034..3670411) /locus_tag="Rv3289c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3289c, (MTCY71.29c), len: 125 aa. Possible transmembrane protein, showing slight similarity to other membrane proteins or glycoproteins." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217806.1" /db_xref="GI:15610425" /db_xref="UniProtKB/TrEMBL:P96894" /db_xref="GeneID:888736" /translation="MHEVGGPSRGDRLGRDDSEVHSAIRFAVVAAVVGVGFLIMGALL VSTCSGVDTAACGPPQRILLALGGPLILCAAGLWAFLRTYRVWRAEGTWWGWHGAGWF LLTLMVLTLCIGVPPIAGPVMAP" gene complement(3670445..3671794) /gene="lat" /locus_tag="Rv3290c" /db_xref="GeneID:887904" CDS complement(3670445..3671794) /gene="lat" /locus_tag="Rv3290c" /EC_number="2.6.1.36" /function="POSSIBLY INVOLVED IN L-ALPHA-AMINOADIPIC ACID (L-AAA) BIOSYNTHESIS. CATALYZES THE TRANSFER OF THE TERMINAL AMINO GROUP OF L-LYSINE OR L-ORNITHINE TO ALPHA-KETOGLUTARATE [CATALYTIC ACTIVITY: L-LYSINE + 2-OXOGLUTARATE = 2-AMINOADIPATE 6-SEMIALDEHYDE + L-GLUTAMATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 2-aminoadipate 6-semiladehyde and glutamate from lysine and 2-oxoglutarate" /codon_start=1 /transl_table=11 /product="L-lysine aminotransferase" /protein_id="NP_217807.1" /db_xref="GI:15610426" /db_xref="GOA:P63509" /db_xref="UniProtKB/Swiss-Prot:P63509" /db_xref="GeneID:887904" /translation="MAAVVKSVALAGRPTTPDRVHEVLGRSMLVDGLDIVLDLTRSGG SYLVDAITGRRYLDMFTFVASSALGMNPPALVDDREFHAELMQAALNKPSNSDVYSVA MARFVETFARVLGDPALPHLFFVEGGALAVENALKAAFDWKSRHNQAHGIDPALGTQV LHLRGAFHGRSGYTLSLTNTKPTITARFPKFDWPRIDAPYMRPGLDEPAMAALEAEAL RQARAAFETRPHDIACFVAEPIQGEGGDRHFRPEFFAAMRELCDEFDALLIFDEVQTG CGLTGTAWAYQQLDVAPDIVAFGKKTQVCGVMAGRRVDEVADNVFAVPSRLNSTWGGN LTDMVRARRILEVIEAEGLFERAVQHGKYLRARLDELAADFPAVVLDPRGRGLMCAFS LPTTADRDELIRQLWQRAVIVLPAGADTVRFRPPLTVSTAEIDAAIAAVRSALPVVT" gene complement(3671845..3672297) /locus_tag="Rv3291c" /db_xref="GeneID:888728" CDS complement(3671845..3672297) /locus_tag="Rv3291c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3291c, (MTCY71.31c), len: 150 aa. Probable transcriptional regulator asnC-family, similar to other regulatory proteins e.g. Q9RKY4|SC6D7.14 from Streptomyces coelicolor (165 aa), FASTA scores: opt: 503, E(): 9.1e-26, (50.35% identity in 143 aa overlap); Q9KYP0|SCD69.13 from Streptomyces coelicolor (167 aa), FASTA scores: opt: 310, E(): 2.7e-13, (37.2% identity in 129 aa overlap); BAB50701|MLL3910 from Rhizobium loti (Mesorhizobium loti) (152 aa), FASTA scores: opt: 282, E(): 1.6e-11, (39.55% identity in 129 aa overlap); O87635|LRP_KLEAE from Klebsiella aerogenes (163 aa), FASTA scores: opt: 279, E(): 2.5e-11, (38.1% identity in 147 aa overlap); etc. Contains helix-turn-helix motif at aa 22-43 (+3.94 SD). COULD BELONG TO THE ASNC FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="AsnC family transcriptional regulator" /protein_id="NP_217808.1" /db_xref="GI:15610427" /db_xref="GOA:P96896" /db_xref="UniProtKB/TrEMBL:P96896" /db_xref="GeneID:888728" /translation="MNEALDDIDRILVRELAADGRATLSELATRAGLSVSAVQSRVRR LESRGVVQGYSARINPEAVGHLLSAFVAITPLDPSQPDDAPARLEHIEEVESCYSVAG EESYVLLVRVASARALEDLLQRIRTTANVRTRSTIILNTFYSDRQHIP" gene 3672328..3673575 /locus_tag="Rv3292" /db_xref="GeneID:888734" CDS 3672328..3673575 /locus_tag="Rv3292" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3292, (MTCY71.32), len: 415 aa. Conserved hypothetical protein, similar to P76097|YDCJ_ECOLI|B1423 HYPOTHETICAL 51.0 KDA PROTEIN from Escherichia coli strain K12 (447 aa), FASTA scores: opt: 747, E(): 5.6e-39, (38.55% identity in 449 aa overlap); BAB35451|ECS2028 HYPOTHETICAL 51.0 KDA PROTEIN from Escherichia coli strain O157:H7 (447 aa), FASTA scores: opt: 744, E(): 8.6e-39, (38.3% identity in 449 aa overlap); AAG56352|Z2297 PROTEIN from Escherichia coli O157:H7 EDL933 (212 aa), FASTA scores: opt: 454, E(): 4.6e-21, (41.75% identity in 206 aa overlap); and similar in part with Q49664|B1308_C1_136 from Mycobacterium leprae (71 aa), FASTA scores: opt: 305, E(): 3.2e-12, (70.0% identity in 70 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217809.1" /db_xref="GI:15610428" /db_xref="UniProtKB/Swiss-Prot:P65065" /db_xref="GeneID:888734" /translation="MSRSKRLQTGQLRARFAAGLSAMYAAEVPAYGTLVEVCAQVNSD YLTRHRRAERLGSLQRVTAERHGAIRVGNPAELAAVADLFAAFGMLPVGYYDLRTAES PIPVVSTAFRPIDANELAHNPFRVFTSMLAIEDRRYFDADLRTRVQTFLARRQLFDPA LLAQARAIAADGGCDADDAPAFVAAAVAAFALSREPVEKSWYDELSRVSAVAADIAGV GSTHINHLTPRVLDIDDLYRRMTERGITMIDTIQGPPRTDGPDVLLRQTSFRALAEPR MFRDEDGTVTPGILRVRFGEVEARGVALTPRGRERYEAAMAAADPAAVWATHFPSTDA EMAAQGLAYYRGGDPSAPIVYEDFLPASAAGIFRSNLDRDSQTGDGPDDAGYNVDWLA GAIGRHIHDPYALYDALAQEERR" gene 3673602..3675086 /gene="pcd" /locus_tag="Rv3293" /db_xref="GeneID:888733" CDS 3673602..3675086 /gene="pcd" /locus_tag="Rv3293" /EC_number="1.5.-.-" /function="INVOLVED IN L-ALPHA-AMINOADIPIC ACID (L-AAA) BIOSYNTHESIS (IN THE SECOND STEP; THE FIRST STEP IS PROMOTED BY LAT ENZYME." /experiment="experimental evidence, no additional details recorded" /note="Rv3293, (MTCY71.33), len: 494 aa. Probable pcd, piperideine-6-carboxylic acid dehydrogenase (EC 1.5.-.-), highly similar to others e.g. O85725|PCD SEMIALDEHYDE DEHYDROGENASE from Streptomyces clavuligerus (512 aa), FASTA scores: opt: 2214, E(): 6.7e-121, (68.75% identity in 496 aa overlap) (see Alexander & Jensen 1998); Q9I4U7|PA1027 PROBABLE ALDEHYDE DEHYDROGENASE from Pseudomonas aeruginosa (529 aa), FASTA scores: opt: 1984, E(): 1.4e-107, (64.5% identity in 493 aa overlap); BAB49892|MLL2867 ALDEHYDE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (504 aa), FASTA scores: opt: 1964, E(): 2e-106, (62.8% identity in 476 aa overlap); Q9A8Y1|CC1216 ALDEHYDE DEHYDROGENASE from Caulobacter crescentus (507 aa), FASTA scores: opt: 1909, E(): 3.1e-103, (59.95% identity in 497 aa overlap); O54199|PCD PIPERIDEINE-6-CARBOXILIC ACID DEHYDROGENASE from Streptomyces clavuligerus (496 aa), FASTA scores: opt: 1748, E(): 6.4e-94, (60.6% identity in 467 aa overlap); and Q9F1U8|PCD PIPERIDEINE-6-CARBOXYLATE DEHYDROGENASE from 'Flavobacterium' lutescens (510 aa), FASTA scores: opt: 1656, E(): 1.4e-88, (54.05% identity in 481 aa overlap) (see Fujii et al., 2000); etc. Contains PS00687 Aldehyde dehydrogenases glutamic acid active site. Note that ORF Rv3290c seems to encoded the putative lat enzyme. Note that previously known as aldB.; aldB" /codon_start=1 /transl_table=11 /product="piperideine-6-carboxilic acid dehydrogenase" /protein_id="YP_177953.1" /db_xref="GI:57117084" /db_xref="GOA:Q7D5R7" /db_xref="UniProtKB/TrEMBL:Q7D5R7" /db_xref="GeneID:888733" /translation="MLEACQAIGVTAALGEPGEHSLPASTPITGDVLFSIAPTTPEQA DHAIAAAAATFTAWRSTPAPVRGALVARLGELLTAHQQDLATLVTVEVGKITAEARGE VQEMIDVCQFSVGLSRQLYGRTIASERAGHRLLETWHPLGVVGVITAFNFPVAVWAWN TAVALVCGDTVVWKPSELTPLTALACQALLSRAAADVGAPAAVGGLLLGGAERGAQLV DDPRVALLSATGSVRMGQQVGPRVARRFGRVLLELGGNNAAIVAPSADLELAVRGIVF AAAGTAGQRCTSLRRLIVHRSVADDVVARVVGAYRQLAIGDPSAPDTLVGPLIHEAAY RDMVAALERARTDGGEVIGGDRREVGSPGAYYVAPAVVRMPSQTAIVATETFAPILYV LTYDDLDEAIALNNAVPQGLSSSIFTTDLREAEHFLDQSDCGIANVNIGTSGAEIGGA FGGEKQTGGGRESGSDAWKAYMRRATNTVNYSSELPLAQGVKFG" misc_feature 3674352..3674375 /gene="pcd" /locus_tag="Rv3293" /note="PS00687 Aldehyde dehydrogenases glutamic acid active site" gene complement(3675186..3675995) /locus_tag="Rv3294c" /db_xref="GeneID:3205051" CDS complement(3675186..3675995) /locus_tag="Rv3294c" /function="UNKNOWN" /note="Rv3294c, 269 aa. Conserved hypothetical protein, similar to several conserved hypothetical proteins from Mycobacterium tuberculosis: O07781|Rv0597c (411 aa), FASTA scores: opt: 682, E(): 3.6e-37, (44.85% identity in 243 aa overlap); O53329|Rv3179 (454 aa), FASTA scores: opt: 561, E(): 3.3e-29, (42.20% identity in 218 aa overlap); Q10849|YK08_MYCTU|Rv2008c (441 aa), FASTA scores: opt: 194, E(): 3.9e-05, (30.10% identity in 239 aa overlap). Also some similarity with proteins from other organisms. Replace previous Rv3294 on opposite strand." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177954.1" /db_xref="GI:57117085" /db_xref="UniProtKB/TrEMBL:Q8VJ37" /db_xref="GeneID:3205051" /translation="MGLPRRPCCDTTGSARYRESVRRYPRIGEDSAAYRRRLCRESAK ARNVDRVVKRDAADVSNLQRIADLPRLIRLLAARSASELNLSSLATDAEIPVRTLPPY LDLLETLYLIDRIPAWSTNLSKRVVDRPKVLLLDSGLAARLVNVSPTGAGPHANPNAA GAIIETFVIAELRRQLGWSQQAPRLFHYRDRDGAEVDLILETADGLIAAIEIKSAATL RGRDTRSISRLRDKVGARFAGGVILHTGPQAQPFGDRLAAVPIDILWSPSG" gene 3676066..3676731 /locus_tag="Rv3295" /db_xref="GeneID:887180" CDS 3676066..3676731 /locus_tag="Rv3295" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3295, (MTCY71.35), len: 221 aa. Probable transcriptional regulator tetR-family, equivalent to Q9CCL4|ML0717 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (223 aa), FASTA scores: opt: 1260, E(): 7.2e-75, (85.45% identity in 220 aa overlap). Also highly similar to other streptomyces regulators e.g. Q9RD77|SCF43.11 from Streptomyces coelicolor (205 aa), FASTA scores: opt: 442, E(): 9.8e-22, (38.6% identity in 202 aa overlap); Q9RKY8|SC6D7.09 from Streptomyces coelicolor (220 aa), FASTA scores: opt: 215, E(): 5.9e-07, (31.85% identity in 135 aa overlap); Q9L0U5|SCD35.06 from Streptomyces coelicolor (240 aa), FASTA scores: opt: 214, E(): 7.4e-07, (28.2% identity in 156 aa overlap); etc. SIMILAR TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS. Contains potential helix-turn-helix motif at aa 33-54 (+4.42 SD)." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_217812.1" /db_xref="GI:15610431" /db_xref="GOA:P96900" /db_xref="UniProtKB/TrEMBL:P96900" /db_xref="GeneID:887180" /translation="MATARRRLSPQDRRAELLALGAEVFGKRPYDEVRIDEIAERAGV SRALMYHYFPDKRAFFAAVVKDEADRLYAATNKAPAPGMTMFEEIRTGVLAYMAYHQQ NPEAAWAAYVGLGRSDPVLLGIDDEAKNRQMEHIMSRIAEVVSGIDRDNTLDPEVERD LRVIIHGWLAFTFELCRQRIMDPSTDAERLADACAHALLDAISRLPQIPAELADAMAT ARM" gene 3676775..3681316 /gene="lhr" /locus_tag="Rv3296" /db_xref="GeneID:887503" CDS 3676775..3681316 /gene="lhr" /locus_tag="Rv3296" /EC_number="3.6.1.-" /function="HAS BOTH ATPASE AND HELICASE ACTIVITIES." /note="Rv3296, (MTCY71.36), len: 1512 aa. Probable lhr, ATP-dependent helicase (EC 3.6.1.-), similar to others e.g. P30015|LHR_ECOLI|RHLF|B1653 from Escherichia coli stain K12 (1538 aa), FASTA scores: opt: 2930, E(): 1.5e-159, (47.55% identity in 1569 aa overlap); AAG56642|LHR from Escherichia coli stain O157:H7 EDL933 (1538 aa), FASTA scores: opt: 2930, E(): 1.5e-159, (47.6% identity in 1561 aa overlap); O86821|SC7C7.16c from Streptomyces coelicolor (1690 aa), FASTA scores: opt: 2919, E(): 7e-159, (53.55% identity in 1703 aa overlap); Q9HYW9|PA3272 from Pseudomonas aeruginosa (1448 aa), FASTA scores: opt: 907, E(): 6.2e-44, (35.85% identity in 1512 aa overlap); etc. SIMILAR TO DEAD/DEAH BOX HELICASE FAMILY AND TO HELICASE C-TERMINAL DOMAIN. Contains PS00017 ATP/GTP-binding site motif A and possible helix-turn-helix motif." /codon_start=1 /transl_table=11 /product="ATP-dependent helicase" /protein_id="NP_217813.1" /db_xref="GI:15610432" /db_xref="GOA:P96901" /db_xref="UniProtKB/TrEMBL:P96901" /db_xref="GeneID:887503" /translation="MRFAQPSALSRFSALTRDWFTSTFAAPTAAQASAWAAIADGDNT LVIAPTGSGKTLAAFLWALDSLAGSEPMSERPAATRVLYVSPLKALAVDVERNLRTPL AGLTRLAERQGLPAPQIRVGVRSGDTPPALRRQLVSQPPDVLITTPESLFLMLTSAAR QTLTGVQTVIIDEIHAIAATKRGAHLALSLERLDDLSSRRRAQRIGLSATVRPPEELA RFLSGQSPTTIVAPPAAKTVELSVQVPVPDMANLTDNTIWPDVEARLVDLIESHNSTI VFANSRRLAERLTARLNEIHAARCGIELAPDTNQQVAGGAPAHIMGSGQTFGAPPVLA RAHHGSISKEQRAVVEEDLKRGQLKAVVATSSLELGIDMGAVDLVIQVQAPPSVASGL QRIGRAGHQVGEISRGVLFPKHRTDLLGCAVSVQRMLAGEIETMRVPANPLDILAQHT VAAAALEPLDADAWFDTVRRAAPFATLPRSLFEATLDLLSGKYPSTEFAELRPRLVYD RDTGTLTARPGAQRLAVTSGGAIPDRGLFAVYLATERPSRVGELDEEMVYESRPGDVI SLGATSWRITEITHDRVLVIPAPGQPARLPFWRGDDAGRPAELGAALGALTGELAALD RTAFGTRCAGLGFDDYATDNLWRLLDDQRTATAVVPTDSTLLVERFRDELGDWRVILH SPYGLRVHGPLALAVGRRLRDRYGIDEKPTASDNGIVVRLPDTVSAGEDSPPGAELFV FDADEIDPIVTTEVAGSALFASRFRESAARALLLPRRHPGRRSPLWQQRQRAARLLEV ARKYPDFPIVLETVRECLQDVYDVPILVELMARIAQRRVRVAEAETAKPSPFAASLLF GYVGAFMYEGDTPLAERRAAALALDGTLLAELLGRVELRELLDPDVIAATSRQLQHLA ADRVARDAEGVADLLRLLGPLTEDEIAARAGAPEVSGWLDGLRAAKRALVVSFAGRSW WVAVEDMGRLRDGVGAAVPVGLPASFTEAVADPLGELLGRYARTHTPFTTAAAAARFG LGLRVTADVLGRLASDGRLVRGEFVAAAKGSAGGEQWCDAEVLRILRRRSLAALRAQA EPVSTAAYGRFLPAWQHVSAGNSGIDGLAAVIDQLAGVRIPASAIEPLVLAPRIRDYS PAMLDELLASGDVTWSGAGSISGSDGWIALHPADSAPMTLAEPAEIDFTDAHRAILAS LGTGGAYFFRQLTHDGLTEAELKAALWELIWAGRVTGDTFAPVRAVLGGAGTRKRAAP AHGGHRPPRLSRYRLTHAQARNADPTVAGRWSALPLPEPDSTLRAHYQAELLLNRHGV LTKDAVAAEGVAGGFATLYKVLSAFEDAGRCQRGYFIESLGGAQFAVASTVDRLRSYL DGVDPEQPDYHAVVLAAADPANPYGAALPWPASSADGTARPGRKAGALVVLVDGELAW FLERGGRSLLTFTDDPEANHAAAIGLADLVTAGRVASILVERADGMPVLQPGGRASAA LTALLAAGFVRTPRGLRRR" misc_feature 3676916..3676939 /gene="lhr" /locus_tag="Rv3296" /note="PS00017 ATP/GTP-binding site motif A" gene 3681320..3682087 /gene="nei" /locus_tag="Rv3297" /db_xref="GeneID:887937" CDS 3681320..3682087 /gene="nei" /locus_tag="Rv3297" /EC_number="3.2.-.-" /function="INVOLVED IN DAMAGE REVERSAL. DNA N-GLYCOSYLASE WITH AN AP LYASE ACTIVITY. REQUIRED FOR THE REPAIR OF OXIDATIVE DNA DAMAGE (OXIDIZED PYRIMIDINES)." /note="Rv3297, (MTCY71.37, MT3396), len: 255 aa. Probable nei, endonuclease VIII (EC 3.2.-.-) (see citation below), similar to others e.g. O86820|END8_STRCO|NEI|SC7C7.15c from Streptomyces coelicolor (276 aa), FASTA scores: opt: 770, E(): 1.2e-42, (50.35% identity in 268 aa overlap); P50465|END8_ECOLI|NEI|B0714 from Escherichia coli strain K12 (262 aa), FASTA scores: opt: 310, E(): 6.3e-13, (28.1% identity in 267 aa overlap); AAG55037|NEI from Escherichia coli strain O157:H7 EDL933 (263 aa), FASTA scores: opt: 301, E(): 2.4e-12, (27.7% identity in 267 aa overlap); etc. BELONGS TO THE FPG FAMILY." /codon_start=1 /transl_table=11 /product="endonuclease VIII" /protein_id="NP_217814.1" /db_xref="GI:15610433" /db_xref="GOA:P64156" /db_xref="UniProtKB/Swiss-Prot:P64156" /db_xref="GeneID:887937" /translation="MPEGDTVWHTAATLRRHLAGRTLTRCDIRVPRFAAVDLTGEVVD EVISRGKHLFIRTGTASIHSHLQMDGSWRVGNRPVRVDHRARIILEANQQEQAIRVVG VDLGLLEVIDRHNDGAVVAHLGPDLLADDWDPQRAAANLIVAPDRPIAEALLDQRVLA GIGNVYCNELCFVSGVLPTAPVSAVADPRRLVTRARDMLWVNRFRWNRCTTGDTRAGR RLWVYGRAGQGCRRCGTLIAYDTTDERVRYWCPACQR" gene complement(3682110..3683024) /gene="lpqC" /locus_tag="Rv3298c" /db_xref="GeneID:887984" CDS complement(3682110..3683024) /gene="lpqC" /locus_tag="Rv3298c" /EC_number="3.1.-.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /note="Rv3298c, (MTCY71.38c), len: 304 aa. Possible lpqC, esterase lipoprotein (EC 3.1.-.-), equivalent to Q9CCL5|LPQC|ML0715 PUTATIVE SECRETED HYDROLASE from Mycobacterium leprae (304 aa), FASTA scores: opt: 1543, E(): 1.3e-87, (71.6% identity in 303 aa overlap); and Q49658|B1308_F2_43 TUBULIN FAMILY PROTEIN from Mycobacterium leprae (302 aa), FASTA scores: opt: 1541, E(): 1.7e-87, (72.0% identity in 300 aa overlap). Also similar to Q9I5Z3|PA0543 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (322 aa), FASTA scores: opt: 439, E(): 8.9e-20, (32.3% identity in 319 aa overlap); Q9F2K9|SCH63.19c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (348 aa), FASTA scores: opt: 394, E(): 5.5e-17, (30.25% identity in 334 aa overlap); etc. And similar to O86367|LPQP|Rv0671|MTCI376.03c from Mycobacterium tuberculosis strain H37Rv (280 aa), FASTA scores: opt: 519, E(): 9.8e-25, (39.25% identity in 275 aa overlap). Probably lipoprotein, esterase membrane-bound, with 18 aa signal sequence as it contains appropriately positioned (PS00013) Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="esterase lipoprotein LpqC" /protein_id="NP_217815.1" /db_xref="GI:15610434" /db_xref="GOA:P96903" /db_xref="UniProtKB/TrEMBL:P96903" /db_xref="GeneID:887984" /translation="MPWARMLSLIVLMVCLAGCGGDQLLARHASSVATFQFGGLTRSY RLHVPPAEPSGLVISLHGGGGTGAGQEALTDFDAVADAADLLVVYPDGYDKSWADGRG ASPADRRHLDDVGFLVALAAKLVHDFDIAPGHVFATGMSNGGFMSNRLACDRADIFAA VAPVAGTLGVGVTCNPSRPVSVLEAHGTADPLVPFNGGAVRGRGGLSHSISVASLVDR WRAVDGCQGDPSAAELPDVGDGTMVHLFDSSSCAAGTEVISYQIDNGGHTWPGGRQYL PKAVIGATTRAFDGSQVIAQFFATHGRD" gene complement(3683051..3685963) /gene="atsB" /locus_tag="Rv3299c" /db_xref="GeneID:887500" CDS complement(3683051..3685963) /gene="atsB" /locus_tag="Rv3299c" /EC_number="3.1.6.1" /function="GENERATES SULFATE AND PHENOL FROM PHENOL SULFATE [CATALYTIC ACTIVITY: A PHENOL SULFATE + H(2)O = A PHENOL + SULFATE]." /note="Rv3299c, (MTCI418A.01c, MTCY71.39c), len: 970 aa. Probable atsB, arylsulfatase (EC 3.1.6.1), similar to P51691|ARS_PSEAE|ATSA|PA0183 (alias CAA88421|ATSA) from Pseudomonas aeruginosa (535 aa), FASTA scores: opt: 645, E(): 5.8e-31, (32.0% identity in 550 aa overlap); Q9L4Y2|ATSA from Klebsiella pneumoniae (577 aa), FASTA scores: opt: 504, E(): 1.7e-22, (26.3% identity in 566 aa overlap); and P20713|ATSA|ARS_KLEAE (precursor) from Klebsiella pneumoniae (464 aa), FASTA scores: opt: 502, E(): 1.8e-22, (26.85% identity in 451 aa overlap). Also similar to Mycobacterium tuberculosis proteins O06776|MTI376.13c|ATSD|Rv0663 (787 aa) (43.6% identity in 796 aa overlap) and P95059|MTCY210.30|ATSA|R0711 (787 aa) (38.4% identity in 797 aa overlap). Equivalent to AAK47741 from Mycobacterium tuberculosis strain CDC1551 (992 aa) but shorter 22 aa. Contains PS00523 Sulfatases signature 1 and PS01095 Chitinases family 18 active site signature. BELONGS TO THE SULFATASE FAMILY." /codon_start=1 /transl_table=11 /product="arylsulfatase AtsB" /protein_id="NP_217816.1" /db_xref="GI:15610435" /db_xref="GOA:O65931" /db_xref="UniProtKB/TrEMBL:O65931" /db_xref="GeneID:887500" /translation="MMSEDNALVLVAGYQDLDSARHDFQTLVDAAKDKSIPLQGAVLI GKDAEGSPVLVDTGNRLGRRGAAWGAGVGLAIGLFSPALLASAALGAATGALAGTFAH HRIKTGLADKIGQALAAGRAVVIAVTEAQGRLEAGQALASSPMKSVAELSRSTLRSLG AALREAMGKFNPDRTRLPLPQRRFGGVVGRTMAESVGDWSIVPGPFPPDDAPNVLIVL IDDAGFGGPDTFGGAIRTPTLSRLAQNGLIYNRFHVTAVCSPTRAALLTGRNHHRVGF GSVCEFPGPYPGYSAVRPRSCAALPRILRDNGYVTGAFGKWHLTPDNVQGAAGPFDNW PLGWGFDHFWGFPSGAAGQYDPIISQDNSVIGIPEGSGEDGRPYYFPDDLTDKAIEWL HTVRAQNATKPWMLYYATGATHAPHHVFKEWADKYRGEFDDGWDVYRQKTFERQKRLG IIPPDAELTERPDLFPAWDSMSEAQKRLFARQMEVFAGFSENADWNVGRLLDAIEDLG ESDNTLVFYIWGDNGASMEGTNTGSFNEMTFLNGLDLDAERQLELIEQYGGIAALGDE FTAPHFASAWAHASNTPLQWGKQMASHLGGTRDPLVVAWPARIRPDGRVRSQFTHCID IAPTVLAAIGLPEPTHVDGFEQEPMDGTSFVRTFDDAEAEDRHTVQYFENFGSRAIYK DGWWACARLDKAPWDLSPETMRRFAPGTYDPDQDVWELYYLPDDFSQAKNLAAEHPDK VAELTQLWWQEAERNRVLPLLGGLAVMFGDLPPLPTTARFSFKGDVQNIQRGMVPRIC GRSYAIEARLHIPDGGAQGVIVANADFMGGFALWVDEQRHLHHTYSFLGVETYRQVSS EPLPTGDVTVRMLFDSHQPVAASGGRVTLWADDRLIGEGELPQTVPLAFTSYAGMDIG RDNGLVVDRGYEDKAPYAFTGTVTEVIFDLKPVHPEAARALHEHASVQAVGQGAAG" misc_feature complement(3684320..3684346) /gene="atsB" /locus_tag="Rv3299c" /note="PS01095 Chitinases family 18 active site signature" misc_feature complement(3685160..3685198) /gene="atsB" /locus_tag="Rv3299c" /note="PS00523 Sulfatases signature 1" gene complement(3685983..3686900) /locus_tag="Rv3300c" /db_xref="GeneID:887958" CDS complement(3685983..3686900) /locus_tag="Rv3300c" /function="UNKNOWN" /note="Rv3300c, (MTCI418A.02c), len: 305 aa. Conserved hypothetical protein, similar to various proteins (notably pseudoridine synthase family proteins) e.g. Q9RJ76|SCI41.08 PUTATIVE RIBOSOMAL PSEUDOURIDINE SYNTHASE from Streptomyces coelicolor (324 aa), FASTA scores: opt: 876, E(): 4.5e-48, (52.1% identity in 313 aa overlap); Q9I272|PA2043 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (300 aa), FASTA scores: opt: 676, E(): 1.8e-35, (42.55% identity in 268 aa overlap); Q9JZW8|NMB0867 YABO/YCEC/SFHB FAMILY PROTEIN from Neisseria meningitidis (serogroup B) (307 aa), FASTA scores: opt: 597, E(): 1.8e-30, (42.9% identity in 282 aa overlap); Q9JUY2|NMA1085 HYPOTHETICAL PROTEIN from Neisseria meningitidis (serogroup A) (307 aa), FASTA scores: opt: 597, E(): 1.8e-30, (42.9% identity in 282 aa overlap); Q12362|RIB2_YEAST|RIB2|YOL066C DRAP DEAMINASE (PSEUDOURIDINE SYNTHASE FAMILY PROTEIN) from Saccharomyces cerevisiae (Baker's yeast) (591 aa), FASTA scores: opt: 338, E(): 6.9e-14, (32.95% identity in 246 aa overlap); Q9RTS2|DR1684 PUTATIVE PSEUDOURIDINE SYNTHASE from Deinococcus radiodurans (321 aa), FASTA scores: opt: 319, E(): 6.5e-13, (32.75% identity in 235 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical protein Q10786|Y04P_MYCTU|MTCY48.25c|Rv1540|MT1592 (308 aa) (28.8% identity in 299 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217817.1" /db_xref="GI:15610436" /db_xref="GOA:O07166" /db_xref="UniProtKB/TrEMBL:O07166" /db_xref="GeneID:887958" /translation="MALRPEDRLLSVHDVLGPVRVRLLGGSVLAELTARFGVAARAKV LAGEVVDDDGAVVDSGTVLPPGSVVHLYRDLPDEVPVPFDVPVLHQDADIVVVDKPHF LATMPRGRHVAQTALVRLRRELGLPELSPAHRLDRLTAGVLLFTTRREVRGSYQTMFA RGLVRKTYLARAPVAPGLALPRLVRSRIVKRRGHLQAVCEPGVPNAETLVERIARDGL YRLTPTTGRTHQLRVHMAALGIPIMGDPLYPNVISVAAHDFSTPLQLLAQRIEFDDPL TGSHREFASTRTLTGATLPTWSAAADCRP" gene complement(3686912..3687577) /gene="phoY1" /locus_tag="Rv3301c" /db_xref="GeneID:887212" CDS complement(3686912..3687577) /gene="phoY1" /locus_tag="Rv3301c" /function="INVOLVED IN TRANSCRIPTIONAL REGULATION OF ACTIVE TRANSPORT OF INORGANIC PHOSPHATE ACROSS THE MEMBRANE." /note="Rv3301c, (MTCI418A.03c), len: 221 aa. Probable phoY1, phosphate-transport system regulatory protein, highly similar to Q50047|phoY|PHOU1|PHOY1|ML2188 PHOSPHATE TRANSPORT SYSTEM PROTEIN PHOU HOMOLOG 1 from Mycobacterium leprae (222 aa), FASTA scores: opt: 929, E(): 7.8e-51, (61.45% identity in 218 aa overlap). Also highly similar to Q9FCE2|2SCD46.42c PUTATIVE REGULATORY PROTEIN (FRAGMENT) from Streptomyces coelicolor (123 aa), FASTA scores: opt: 324, E(): 1.8e-13, (43.65% identity in 103 aa overlap); Q9L0R3|SCD8A.01c PUTATIVE PHOSPHATE TRANSPORT SYSTEM REGULATORY PROTEIN (FRAGMENT) from Streptomyces coelicolor (139 aa), FASTA scores: opt: 309, E(): 1.7e-12, (36.7% identity in 139 aa overlap); Q52989|PHOU_RHIME PHOSPHATE TRANSPORT SYSTEM PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (237 aa), FASTA scores: opt: 292, E(): 3.1e-11, (26.3% identity in 213 aa overlap); etc. And highly similar to Mycobacterium tuberculosis O53833|PHU2_MYCTU|MTV043_13c|PHOU2|PHOY2|Rv0821c|MT0843 PHOSPHATE TRANSPORT SYSTEM PROTEIN PHOU HOMOLOG 2 (213 aa) (63.4% identity in 213 aa overlap). BELONGS TO THE PHOU FAMILY." /codon_start=1 /transl_table=11 /product="phosphate transporter PhoU" /protein_id="NP_217818.1" /db_xref="GI:15610437" /db_xref="GOA:P65718" /db_xref="UniProtKB/Swiss-Prot:P65718" /db_xref="GeneID:887212" /translation="MRTVYHQRLTELAGRLGEMCSLAGIAMKRATQALLEADIGAAEQ VIRDHERIVAMRAQVEKEAFALLALQHPVAGELREIFSAVQIIADTERMGALAVHIAK ITRREYPNQVLPEEVRNCFADMAKVAIALGDSARQVLVNRDPQEAAQLHDRDDAMDDL HRHLLSVLIDREWRHGVRVGVETALLGRFFERFADHAVEVGRRVIFMVTGVLPTEDEI STY" gene complement(3687685..3689442) /gene="glpD2" /locus_tag="Rv3302c" /db_xref="GeneID:887211" CDS complement(3687685..3689442) /gene="glpD2" /locus_tag="Rv3302c" /EC_number="1.1.5.3" /function="INVOLVED IN AEROBIC RESPIRATION AND OXYDATION OF GLYCEROL. REDUCES AN ACCEPTOR AND GENERATES GLYCERONE PHOSPHATE FROM Sn-GLYCEROL 3-PHOSPHATE. POSSIBLY PLAY A ROLE IN METABOLISM OF RIBOFLAVIN, FAD,FMN [CATALYTIC ACTIVITY: SN-GLYCEROL 3-PHOSPHATE + ACCEPTOR = GLYCERONE PHOSPHATE + REDUCED ACCEPTOR]." /note="Rv3302c, (MTCI418A.04c, MTV016.01c), len: 585 aa. Probable glpd2, glycerol-3-phosphate dehydrogenase (EC 1.1.99.5), equivalent to P53435|GLPD_MYCLE|ML0713|L308_C1_179 GLYCEROL-3-PHOSPHATE DEHYDROGENASE (EC 1.1.99.5) from Mycobacterium leprae (585 aa), FASTA scores: opt: 3489, E(): 2.2e-198, (90.75% identity in 584 aa overlap). Also highly similar to many e.g. Q9L0I3|SCD63.06 from Streptomyces coelicolor (568 aa), FASTA scores: opt: 2203, E(): 1.6e-122, (59.95% identity in 564 aa overlap); Q9RVK8|DR1019 from Deinococcus radiodurans (522 aa), FASTA scores: opt: 949, E(): 1.4e-48, (37.0% identity in 538 aa overlap); BAB53412|MLR7270 from Rhizobium loti (Mesorhizobium loti) (505 aa), FASTA scores: opt: 861, E(): 2.2e-43, (37.3% identity in 488 aa overlap); P18158|GLPD_BACSU from B. subtilis (555 aa), FASTA scores: opt: 768, E(): 7.2e-38, (32.85% identity in 484 aa overlap); etc. Also similar to Mycobacterium tuberculosis protein Q10502|GLPD_MYCTU|MTCY427_31c|Rv2249c GLYCEROL-3-PHOSPHATE DEHYDROGENASE (516 aa), FASTA scores: opt: 843, E(): 2.6e-42, (36.5% identity in 515 aa overlap). Contains PS00978 FAD-dependent glycerol-3-phosphate dehydrogenase signature 2. COFACTOR: FAD (BY SIMILARITY). BELONGS TO THE FAD-DEPENDENT GLYCEROL-3-PHOSPHATE DEHYDROGENASE FAMILY." /codon_start=1 /transl_table=11 /product="glycerol-3-phosphate dehydrogenase" /protein_id="NP_217819.1" /db_xref="GI:15610438" /db_xref="GOA:P64184" /db_xref="UniProtKB/Swiss-Prot:P64184" /db_xref="GeneID:887211" /translation="MSNPIQAPDGGQGWPAAALGPAQRAVAWKRLGTEQFDVVVIGGG VVGSGCALDAATRGLKVALVEARDLASGTSSRSSKMFHGGLRYLEQLEFGLVREALYE RELSLTTLAPHLVKPLPFLFPLTKRWWERPYIAAGIFLYDRLGGAKSVPAQRHFTRAG ALRLSPGLKRSSLIGGIRYYDTVVDDARHTMTVARTAAHYGAVVRCSTQVVALLREGD RVIGVGVRDSENGAVAEVRGHVVVNATGVWTDEIQALSKQRGRFQVRASKGVHVVVPR DRIVSDVAMILRTEKSVMFVIPWGSHWIIGTTDTDWNLDLAHPAATKADIDYILGTVN AVLATPLTHADIDGVYAGLRPLLAGESDDTSKLSREHAVAVPAAGLVAIAGGKYTTYR VMAADAIDAAVQFIPARVAPSITEKVSLLGADGYFALVNQAEHVGALQGLHPYRVRHL LDRYGSLISDVLAMAASDPSLLSPITEAPGYLKVEAAYAAAAEGALHLEDILARRMRI SIEYPHRGVDCAREVAEVVAPVLGWTAADIDREVANYMARVEAEVLSQAQPDDVSADM LRASAPEARAEILEPVPLD" misc_feature complement(3688258..3688290) /gene="glpD2" /locus_tag="Rv3302c" /note="PS00978 FAD-dependent glycerol-3-phosphate dehydrogenase signature 2" gene complement(3689457..3690938) /gene="lpdA" /locus_tag="Rv3303c" /db_xref="GeneID:887659" CDS complement(3689457..3690938) /gene="lpdA" /locus_tag="Rv3303c" /EC_number="1.8.1.4" /function="INVOLVED IN ENERGY METABOLISM. LIPOAMIDE DEHYDROGENASE IS GENERALLY A COMPONENT OF THE MULTIENZYME PYRUVATE DEHYDROGENASE AND/OR ALPHA-KETOACID DEHYDROGENASE AND/OR 2-OXOGLUTARATE DEHYDROGENASE COMPLEXES [CATALYTIC ACTIVITY: DIHYDROLIPOAMIDE + NAD(+) = LIPOAMIDE + NADH]." /note="catalyzes the reduction of nonspecific electron acceptors such as 2,6-dimethyl-1,4-benzoquinone and 5-hydroxy-1,4-naphthaquinone; does not have lipoamide dehydrogenase activity" /codon_start=1 /transl_table=11 /product="flavoprotein disulfide reductase" /protein_id="NP_217820.1" /db_xref="GI:15610439" /db_xref="GOA:O53355" /db_xref="UniProtKB/TrEMBL:O53355" /db_xref="GeneID:887659" /translation="MVTRIVILGGGPAGYEAALVAATSHPETTQVTVIDCDGIGGAAV LDDCVPSKTFIASTGLRTELRRAPHLGFHIDFDDAKISLPQIHARVKTLAAAQSADIT AQLLSMGVQVIAGRGELIDSTPGLARHRIKATAADGSTSEHEADVVLVATGASPRILP SAQPDGERILTWRQLYDLDALPDHLIVVGSGVTGAEFVDAYTELGVPVTVVASQDHVL PYEDADAALVLEESFAERGVRLFKNARAASVTRTGAGVLVTMTDGRTVEGSHALMTIG SVPNTSGLGLERVGIQLGRGNYLTVDRVSRTLATGIYAAGDCTGLLPLASVAAMQGRI AMYHALGEGVSPIRLRTVAATVFTRPEIAAVGVPQSVIDAGSVAARTIMLPLRTNARA KMSEMRHGFVKIFCRRSTGVVIGGVVVAPIASELILPIAVAVQNRITVNELAQTLAVY PSLSGSITEAARRLMAHDDLDCTAAQDAAEQLALVPHHLPTSN" gene 3691141..3691620 /locus_tag="Rv3304" /db_xref="GeneID:887605" CDS 3691141..3691620 /locus_tag="Rv3304" /function="UNKNOWN" /note="Rv3304, (MTV016.03), len: 159 aa. Hypothetical conserved protein, very similar to Q9CCL6|ML0711 HYPOTHETICAL PROTEIN from Mycobacterium leprae (159 aa), FASTA scores: opt: 1041, E(): 6.1e-62, (91.8% identity in 159 aa overlap); and Q49927|L308_F3_97 from M. leprae (174 aa), FASTA scores: opt: 974, E(): 1.8e-57, (91.2% identity in 149 aa overlap) . Also highly similar to Q9AD81|SCK13.10c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (145 aa), FASTA scores: opt: 615, E(): 7.8e-34, (60.55% identity in 147 aa overlap); and shows some similarity to other various hypotheticals proteins. ORF continues upstream with possible start at 2198 (equivalent to AAK47746 from Mycobacterium tuberculosis strain CDC1551 (212 aa) but shorter 53 aa). TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217821.1" /db_xref="GI:15610440" /db_xref="UniProtKB/TrEMBL:O53356" /db_xref="GeneID:887605" /translation="MPLYAAYGSNMHPEQMLERAPHSPMAGTGWLPGWRLTFGGEDIG WEGALATVVEDPDSKVFVVLYDMTPADEKNLDRWEGSEFGIHQKIRCRVERISSDTTT DPVLAWLYVLDAWEGGLPSARYLGVMADAAEIAGAPSDYVHDLRTRPARNIGPGTIA" gene complement(3691639..3692808) /gene="amiA1" /locus_tag="Rv3305c" /db_xref="GeneID:887545" CDS complement(3691639..3692808) /gene="amiA1" /locus_tag="Rv3305c" /EC_number="3.5.1.-" /function="UNKNOWN; CERTAINLY HYDROLYSES L-AMINO ACID." /note="Rv3305c, (MTV016.04c), len: 389 aa. Possible amiA1, N-acyl-L-amino acid amidohydrolase (or peptidase) (EC 3.5.1.-), similar to many proteins e.g. Q9AK43|2SCK8.09 PUTATIVE PEPTIDASE from Streptomyces coelicolor (410 aa), FASTA scores: opt: 1015, E(): 3.9e-54, (50.8% identity in 374 aa overlap); Q9UZ30|PAB0873 AMINO ACID AMIDOHYDROLASE from Pyrococcus abyssi (383 aa), FASTA scores: opt: 823, E(): 1.6e-42, (38.2% identity in 369 aa overlap); O58453|PH0722 LONG HYPOTHETICAL AMINO ACID AMIDOHYDROLASE from Pyrococcus horikoshii (388 aa), FASTA scores: opt: 815, E(): 4.8e-42, (38.75% identity in 369 aa overlap); O34980|YTNL_BACSU HYPOTHETICAL 45.2 KDA PROTEIN from B. subtilis (416 aa), FASTA scores: opt: 805, E(): 2.1e-41, (37.85% identity in 367 aa overlap); Q9KCF8|BH1613 N-ACYL-L-AMINO ACID AMIDOHYDROLASE from Bacillus halodurans (404 aa), FASTA scores: opt: 795, E(): 8.1e-41, (37.7% identity in 382 aa overlap); BAB50445|MLR3583 HYPOTHETICAL HIPPURATE HYDROLASE from Rhizobium loti (Mesorhizobium loti) (387 aa), FASTA scores: opt: 761, E(): 8.9e-39, (37.65% identity in 385 aa overlap); Q9RXH4|DR0339 PUTATIVE N-ACYL-L-AMINO ACID AMIDOHYDROLASE from Deinococcus radiodurans (392 aa), FASTA scores: opt: 745, E(): 8.4e-38, (36.15% identity in 379 aa overlap); etc. Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site. TBparse score is 0.905. Note that previously known as amiA.; amiA" /codon_start=1 /transl_table=11 /product="N-acyl-L-amino acid amidohydrolase" /protein_id="YP_177955.1" /db_xref="GI:57117086" /db_xref="GOA:Q7D5R0" /db_xref="UniProtKB/TrEMBL:Q7D5R0" /db_xref="GeneID:887545" /translation="MSLADAAESWLAAHHDDLVGWRRHIHRYPELGRQEYATTQFVAE RLADAGLNPKVLPGGTGLTCDFGPQHQPRIALRADMDALPMAERTGAPYASTMPNVAH ACGHDAHTAILLGAALALASVPELPVGVRLIFQAAEELMPGGAIDAIAAGALAGVSRI FALHCDPRLEVGKVAVRQGPITSAADSIEITLYSPGGHTSRPHLTADLVYGLGTLVTG LPGVLSRRIDPRNSTVLVWGAVNAGMAANAIPQTGVLSGTVRTASRQTWVDLEELVRQ AISALLLPLAIEHTLQYRRGVPPVVNEEISTRILAHAIEAIGPGVLADTRQSGGGEDF SWYLEEVPGAMARLGVWSGDGLQLDLHQPTFDIDERALAIGLRVMVNIIEQAAAH" misc_feature complement(3691849..3691881) /gene="amiA1" /locus_tag="Rv3305c" /note="PS00639 Eukaryotic thiol (cysteine) proteases histidine active site" gene complement(3692805..3693989) /gene="amiB1" /locus_tag="Rv3306c" /db_xref="GeneID:887645" CDS complement(3692805..3693989) /gene="amiB1" /locus_tag="Rv3306c" /EC_number="3.5.1.-" /function="INVOLVED IN CELLULAR METABOLISM, ACTIVE ON CARBON ALIPHATIC AMIDES AND/OR ON MANY AROMATIC AMIDES [CATALYTIC ACTIVITY : A MONOCARBOXYLIC ACID AMIDE + H(2)O = A MONOCARBOXYLATE + NH(3)]." /note="Rv3306c, (MTV016.05c), len: 394 aa. Probable amiB1, aminohydrolase (EC 3.5.1.-), similar to several belonging to peptidase family M40 (and to hypothetical proteins) e.g. P54983|AMHX_BACSU AMIDOHYDROLASE AMHX from Bacillus subtilis (EC 3.5.1.-) (389 aa), FASTA scores: opt: 286, E(): 9.9e-10, (26.6% identity in 351 aa overlap); P76052|ABGB_ECOLI Aminobenzoyl-glutamate utilizatio from Escherichia coli (481 aa), FASTA scores: opt: 383, E(): 2.1e-15, (30.5% identity in 328 aa overlap); P44765|YDAJ_HAEIN HYPOTHETICAL PROTEIN HI0584 from Haemophilus influenzae (423 aa), FASTA scores: opt: 297, E(): 2.4e-10, (29.6% identity in 274 aa overlap). TBparse score is 0.897. Note that previously known as amiB.; amiB" /codon_start=1 /transl_table=11 /product="amidohydrolase AmiB1" /protein_id="YP_177956.1" /db_xref="GI:57117087" /db_xref="GOA:Q7D5Q9" /db_xref="UniProtKB/TrEMBL:Q7D5Q9" /db_xref="GeneID:887645" /translation="MPAASASDRVEELVRRRGGELVELSHAIHAEPELAFAEHRSCAK AQALVAERGFEITTAAGGLDTAFRADYGSGPLVVGVCAEYDALPGIGHACGHNIIAAS AVGTALALAEVADDLGLTVALLGTPAEESGGGKALMLQAGTFDDVAVAVMVHPGPTDI AGARSLALSEVTVRYRGKESHAAVAPHLGVNAADAVTVAQVAIGVLRQQLAPGQMVHG IVTDGGQAVNVIPGQARLQYAMRAVESDSLRELQTRMFACFAAGALAAGCEYEIDEAA PAYAELKPDPWLADVCREEMQRLGREPLLPALEAELPLGSTDMGNVTQVLPGIHPVIG LDAGAATVHQRAFTVASAGASADRAVVDGAIMLARTVVRLAQTPDERDRVLAAQQRRA AR" gene 3694054..3694860 /gene="deoD" /locus_tag="Rv3307" /db_xref="GeneID:887542" CDS 3694054..3694860 /gene="deoD" /locus_tag="Rv3307" /EC_number="2.4.2.1" /function="INVOLVED IN PURINE NUCLEOSIDE SALVAGE. CLEAVAGE OF GUANOSINE OR INOSINE TO RESPECTIVE BASES AND SUGAR-1-PHOSPHATE MOLECULES [CATALYTIC ACTIVITY: PURINE NUCLEOSIDE + ORTHOPHOSPHATE = PURINE + ALPHA-D-RIBOSE 1-PHOSPHATE]." /note="catalyzes the formation of a purine and ribose phosphate from a purine nucleoside; in E. coli this enzyme functions in xanthosine degradation" /codon_start=1 /transl_table=11 /product="purine nucleoside phosphorylase" /protein_id="NP_217824.1" /db_xref="GI:15610443" /db_xref="GOA:O53359" /db_xref="UniProtKB/Swiss-Prot:O53359" /db_xref="GeneID:887542" /translation="MADPRPDPDELARRAAQVIADRTGIGEHDVAVVLGSGWLPAVAA LGSPTTVLPQAELPGFVPPTAAGHAGELLSVPIGAHRVLVLAGRIHAYEGHDLRYVVH PVRAARAAGAQIMVLTNAAGGLRADLQVGQPVLISDHLNLTARSPLVGGEFVDLTDAY SPRLRELARQSDPQLAEGVYAGLPGPHYETPAEIRMLQTLGADLVGMSTVHETIAARA AGAEVLGVSLVTNLAAGITGEPLSHAEVLAAGAASATRMGALLADVIARF" gene 3694864..3696468 /gene="pmmB" /locus_tag="Rv3308" /db_xref="GeneID:887541" CDS 3694864..3696468 /gene="pmmB" /locus_tag="Rv3308" /EC_number="5.4.2.8" /function="CONVERTES D-MANNOSE 1-PHOSPHATE TO D-MANNOSE 6-PHOSPHATE." /note="Rv3308, (MTV016.07), len: 534 aa. Probable pmmB, phosphomannomutase (EC 5.4.2.8), equivalent to Q9CCL7|PMMB|ML0706 PUTATIVE PHOSPHO-SUGAR MUTASE from Mycobacterium leprae (538 aa), FASTA scores: opt: 2681, E(): 1.4e-150, (76.95% identity in 538 aa overlap). Also similar to others e.g. Q9AD82|SCK13.08c from Streptomyces coelicolor (549 aa), FASTA scores: opt: 1378, E(): 8.9e-74, (46.7% identity in 529 aa overlap); Q9ZHL4|PMM (FRAGMENT so no homology at N-terminus for this one) from Haemophilus ducreyi (443 aa), FASTA scores: opt: 935, E(): 9.6e-48, (39.4% identity in 449 aa overlap); P18159|YHXB_BACSU from Bacillus subtilis (565 aa), FASTA scores: opt: 776, E(): 2.7e-38, (31.7% identity in 574 aa overlap); etc. Contains PS00710 Phosphoglucomutase and phosphomannomutase phosphoserine signature. BELONGS TO THE PHOSPHOHEXOSE MUTASES FAMILY. TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="phosphomannomutase" /protein_id="NP_217825.1" /db_xref="GI:15610444" /db_xref="GOA:O53360" /db_xref="UniProtKB/TrEMBL:O53360" /db_xref="GeneID:887541" /translation="MTPENWIAHDPDPQTAAELAACGPDELKARFSRPLAFGTAGLRG HLRGGPDAMNLAVVLRATWAVARVLTDRGLAGSPVIVGRDARHGSPAFAAAAAEVLAA AGFSVLLLPDPAPTPVVAFAVRHTGAAAGIQITASHNPATDNGYKVYVDGGLQLLAPT DRQIEAAMATAPPADQIARKTVNPSENRASDLIDRYIQRAAGVRRCAGSVRVALTPLH GVGGAMAVETLRRAGFTEVHTVATQFAPNPDFPTVTLPNPEEPGATDALLTLATDVDA DVAIALDPDADRCAVGIPTVSGWRMLSGDETGWLLGDYILSQTDDRASPPETRVVAST VVSSRMLAAIAAHHAAVHVETLTGFKWLARADANLPGTLVYAYEEAIGHCVDPTAVRD KDGISAAVLVCDLVAALKGQGRSVTDALDELARCYGVHEVAALSRPVSGAVETTDLMR RLREDPPRRLAGFPATVTDIGDTLILTGGDDNMLVRVAVRPSGTEPKLKCYLEIRCAV TGDLPAARQLVRARIDELSASVRRWW" misc_feature 3695254..3695298 /gene="pmmB" /locus_tag="Rv3308" /note="PS00710 Phosphoglucomutase and phosphomannomutase phosphoserine signature" gene complement(3696470..3697093) /gene="upp" /locus_tag="Rv3309c" /db_xref="GeneID:887944" CDS complement(3696470..3697093) /gene="upp" /locus_tag="Rv3309c" /EC_number="2.4.2.9" /function="INVOLVED IN PYRIMIDINE SALVAGE PATHWAY [CATALYTIC ACTIVITY: UMP + PYROPHOSPHATE = URACIL + 5-PHOSPHO-ALPHA-D-RIBOSE 1-DIPHOSPHATE]." /note="Catalyzes the formation of uracil and 5-phospho-alpha-D-ribosy 1-diphosphate from UMP and diphosphate" /codon_start=1 /transl_table=11 /product="uracil phosphoribosyltransferase" /protein_id="NP_217826.1" /db_xref="GI:15610445" /db_xref="GOA:P94928" /db_xref="UniProtKB/Swiss-Prot:P94928" /db_xref="GeneID:887944" /translation="MQVHVVDHPLAAARLTTLRDERTDNAGFRAALRELTLLLIYEAT RDAPCEPVPIRTPLAETVGSRLTKPPLLVPVLRAGLGMVDEAHAALPEAHVGFVGVAR DEQTHQPVPYLDSLPDDLTDVPVMVLDPMVATGGSMTHTLGLLISRGAADITVLCVVA APEGIAALQKAAPNVRLFTAAIDEGLNEVAYIVPGLGDAGDRQFGPR" gene 3697198..3698097 /locus_tag="Rv3310" /db_xref="GeneID:887988" CDS 3697198..3698097 /locus_tag="Rv3310" /EC_number="3.1.3.2" /function="INVOLVED IN CELLULAR METABOLISM: ACTING ON ESTER BONDS [CATALYTIC ACTIVITY: AN ORTHOPHOSPHORIC MONOESTER + H(2)O = AN ALCOHOL + ORTHOPHOSPHATE]." /note="Rv3310, (MTV016.09), len: 299 aa. Possible acid phosphatase (EC 3.1.3.2), similar to several fungal or bacterial acid phosphatases e.g. BAB50846|MLR4110 from Rhizobium loti (Mesorhizobium loti) (292 aa), FASTA scores: opt: 460, E(): 4.8e-22, (38.65% identity in 295 aa overlap); P34724|PHOA_ASPNG from Aspergillus niger (417 aa), FASTA scores: opt: 172, E(): 0.0013, (29.1% identity in 306 aa overlap); P08540|PHOX_KLULA from Kluyveromyces lactis (Yeast) (421 aa), FASTA scores: opt: 170, E(): 0.0018, (27.8% identity in 266 aa overlap); P37274|PHOA_PENCH from Penicillium chrysogenum (412 aa), FASTA scores: opt: 163, E(): 0.0049, (29.05% identity in 303 aa overlap); etc. TBparse score is 0.914." /codon_start=1 /transl_table=11 /product="acid phosphatase" /protein_id="NP_217827.1" /db_xref="GI:15610446" /db_xref="GOA:O53361" /db_xref="UniProtKB/TrEMBL:O53361" /db_xref="GeneID:887988" /translation="MLRGIQALSRPLTRVYRALAVIGVLAASLLASWVGAVPQVGLAA SALPTFAHVVIVVEENRSQAAIIGNKSAPFINSLAANGAMMAQAFAETHPSEPNYLAL FAGNTFGLTKNTCPVNGGALPNLGSELLSAGYTFMGFAEDLPAVGSTVCSAGKYARKH VPWVNFSNVPTTLSVPFSAFPKPQNYPGLPTVSFVIPNADNDMHDGSIAQGDAWLNRH LSAYANWAKTNNSLLVVTWDEDDGSSRNQIPTVFYGAHVRPGTYNETISHYNVLSTLE QIYGLPKTGYATNAPPITDIWGD" gene 3698121..3699383 /locus_tag="Rv3311" /db_xref="GeneID:887533" CDS 3698121..3699383 /locus_tag="Rv3311" /function="UNKNOWN" /note="Rv3311, (MTV016.10), len: 420 aa. Conserved hypothetical protein, equivalent to Mycobacterium leprae hypothetical proteins Q9CCL8|ML0703 (423 aa), FASTA scores: opt: 2185, E(): 5.5e-120, (77.55% identity in 423 aa overlap); Q49918|L308_F2_61 (167 aa), FASTA scores: opt: 929, E(): 3.5e-47, (84.4% identity in 167 aa overlap) (similarity at C-terminus for this one); and Q49914|L308_F1_17 (166 aa), FASTA scores: opt: 900, E(): 1.7e-45, (79.0% identity in 162 aa overlap) (similarity at N-terminus for this one); Q49923|U0308N (86 aa) FASTA scores: opt: 149, E(): 0.052, (48.35% identity in 60 aa overlap); etc. Note that the Rv3311 corresponding protein in Mycobacterium leprae is similar to products of two adjacent ORFs. Also some similarity to Q9XI61|F9L1.1 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (523 aa), FASTA scores: opt: 134, E(): 1.8, (25.1% identity in 203 aa overlap). Equivalent to AAK47753 from Mycobacterium tuberculosis strain CDC1551 (431 aa) but shorter 12 aa. TBparse score is 0.888." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217828.1" /db_xref="GI:15610447" /db_xref="UniProtKB/TrEMBL:O53362" /db_xref="GeneID:887533" /translation="MVADLVPIRLSLSAGDRYTLWAPRWRDAGDEWEAFLGKDDDLYG FESVSDLVAFVRTDTENDLVDHPAWQDLTGAHAHNLNPAEDNQFDLVVVEELLAEKPT AESVAALAASLAIVSAIGSVCELAAVSKFFNGNPILGTVSGGLEHFTGKAGNKRWNSI AEVIGRSWDDVLAAIDEIISTPEVDAELSEKVAEELAEEPEGAEEVAAEVEATQDTQE AAESDDEEADAPGDSVVLGGDRDFWLQVGIDPIQIMTGTATFYTLRCYLDDRPIFLGR NGRISVFGSERALARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGL VDDFADGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSV GKPTAPYAAAVREWEKLERFVESRLRRE" gene complement(3699404..3700330) /locus_tag="Rv3312c" /db_xref="GeneID:887939" CDS complement(3699404..3700330) /locus_tag="Rv3312c" /function="UNKNOWN" /note="Rv3312c, (MTV016.11), len: 308 aa. Hypothetical protein, similar to various proteins (principally hypothetical unknowns or hydrolases) e.g. Q9M9P2|T17B22.7 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (326 aa), FASTA scores: opt: 261, E(): 2.6e-09, (27.55% identity in 323 aa overlap); Q9FWB6 PUTATIVE ALPHA/BETA HYDROLASE from Oryza sativa (Rice) (354 aa), FASTA scores: opt: 241, E(): 4.9e-08, (28.9% identity in 301 aa overlap) (note that Q9FWB6 correspond to Q9FWB5 PUTATIVE ALPHA/BETA HYDROLASE (353 aa) but longer 1 aa; and to Q9AUW9 HYPOTHETICAL PROTEIN (332 aa) but longer 22 aa); Q9M382|F24B22.200 HYPOTHETICAL PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (342 aa), FASTA scores: opt: 222, E(): 8e-07, (27.6% identity in 319 aa overlap); Q9HWM9|PA4152 PROBABLE HYDROLASE from Pseudomonas aeruginosa (370 aa), FASTA scores: opt: 176, E(): 0.00071, (29.2% identity in 209 aa overlap); Q9L3R2 HYDROLASE from Rhizobium leguminosarum (261 aa), FASTA scores: opt: 174, E(): 0.00071, (28.9% identity in 173 aa overlap); P49323|PRXC_STRLI|CPO|CPOL NON-HEME CHLOROPEROXIDASE (EC 1.11.1.10) from Streptomyces lividans (275 aa), FASTA scores: opt: 172, E(): 0.001, (30.9% identity in 194 aa overlap) (similarity only at N-terminus for this one); etc. Some similarity in N-terminal part to non-heme chloroperoxidases. Also similar to O05293|Rv1191|MTCI364.03 HYPOTHETICAL PROTEIN from M. tuberculosis (304 aa), FASTA scores: opt: 417, E(): 3.1e-19, (32.6% identity in 279 aa overlap) (note that Rv1191 is equivalent to AAK45485 from Mycobacterium tuberculosis strain CDC1551 but shorter 14 aa, and that AAK45485 is annoted Hydrolase, alpha/beta hydrolase family). TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217829.1" /db_xref="GI:15610448" /db_xref="GOA:O53363" /db_xref="UniProtKB/TrEMBL:O53363" /db_xref="GeneID:887939" /translation="MTGPPPSLPERIRTDEADVLMLPDGRALAYLEWGDSTGYPAFYF HGTPSSRLEGAFADGAARRTGFRLIAIDRPGYGRSTFQAGRNFRDWPADVCALADAFE LEEFGVVGHSGAGPHLFACGAVIPRTRLAFVGALGPWGPLATPDIMRSLNAADRCYAR LARSGPRLFGALFAPLGWCAKYTPGLFSTLLAAAVPAADKHLLSDERFGRHLRAIQLE AFRQGSRGAAYESFLQFRPWGFDLAEVAVPTHIWLGDRDSFVPRAMGEYLQRAIPHVD LHWAHGKGHFNIEDWDAILAACALDIGKRRGG" gene complement(3700705..3701016) /locus_tag="Rv3312A" /db_xref="GeneID:3205113" CDS complement(3700705..3701016) /locus_tag="Rv3312A" /function="UNKNOWN" /note="Rv3312A, len: 103 aa. Secreted protein antigen, described in Corixa patent as having N-terminal sequence YYWCPGQPFDPAWGP. Equivalent to AAK47756 from Mycobacterium tuberculosis strain CDC1551 (114 aa) but shorter 11 aa." /codon_start=1 /transl_table=11 /product="secreted protein antigen" /protein_id="YP_177957.1" /db_xref="GI:57117088" /db_xref="UniProtKB/TrEMBL:Q6MWY5" /db_xref="GeneID:3205113" /translation="MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPG QPFDPAWGPNWDPYTCHDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGG A" gene complement(3701087..3702184) /gene="add" /locus_tag="Rv3313c" /db_xref="GeneID:887994" CDS complement(3701087..3702184) /gene="add" /locus_tag="Rv3313c" /EC_number="3.5.4.4" /function="CATALYZES HYDROLYTIC DEAMINATION OF ADENOSINE AND GENERATES INOSINE [CATALYTIC ACTIVITY: ADENOSINE + H(2)O = INOSINE + NH(3) (ALSO MAY ACT ON DEOXYADENOSINE)]." /note="catalyzes the formation of inosine from adenosine" /codon_start=1 /transl_table=11 /product="adenosine deaminase" /protein_id="NP_217830.1" /db_xref="GI:15610449" /db_xref="GOA:P63907" /db_xref="UniProtKB/Swiss-Prot:P63907" /db_xref="GeneID:887994" /translation="MTAAPTLQTIRLAPKALLHDHLDGGLRPATVLDIAGQVGYDDLP ATDVDALASWFRTQSHSGSLERYLEPFSHTVAVMQTPEALYRVAFECAQDLAADSVVY AEVRFAPELHISCGLSFDDVVDTVLTGFAAGEKACAADGQPITVRCLVTAMRHAAMSR EIAELAIRFRDKGVVGFDIAGAEAGHPPTRHLDAFEYMRDHNARFTIHAGEAFGLPSI HEAIAFCGADRLGHGVRIVDDIDVDADGGFQLGRLAAILRDKRIPLELCPSSNVQTGA VASIAEHPFDLLARARFRVTVNTDNRLMSDTSMSLEMHRLVEAFGYGWSDLARFTVNA MKSAFIPFDQRLAIIDEVIKPRFAALMGHSE" gene complement(3702184..3703467) /gene="deoA" /locus_tag="Rv3314c" /db_xref="GeneID:887929" CDS complement(3702184..3703467) /gene="deoA" /locus_tag="Rv3314c" /EC_number="2.4.2.4" /function="THE ENZYMES WHICH CATALYZE THE REVERSIBLE PHOSPHORYLOSIS OF PYRIMIDINE NUCLEOSIDES ARE INVOLVED IN THE DEGRADATION OF THESE COMPOUNDS AND IN THEIR UTILIZATION AS CARBON AND ENERGY SOURCES, OR IN THE RESCUE OF PYRIMIDINE BASES FOR NUCLEOTIDE SYNTHESIS [CATALYTIC ACTIVITY: THYMIDINE + PHOSPHATE = THYMINE + 2-DEOXY-D-RIBOSE 1-PHOSPHATE]." /note="Catalyzes the reversible phosphorolysis of thymidine, deoxyuridine and their analogues to their respective bases and 2-deoxyribose 1-phosphate" /codon_start=1 /transl_table=11 /product="thymidine phosphorylase" /protein_id="NP_217831.1" /db_xref="GI:15610450" /db_xref="GOA:O53366" /db_xref="UniProtKB/Swiss-Prot:O53366" /db_xref="GeneID:887929" /translation="MTDFAFDAPTVIRTKRDGGRLSDAAIDWVVKAYTDGRVADEQMS ALLMAIVWRGMDRGEIARWTAAMLASGARLDFTDLPLATVDKHSTGGVGDKITLPLVP VVAACGGAVPQASGRGLGHTGGTLDKLESITGFTANLSNQRVREQLCDVGAAIFAAGQ LAPADAKLYALRDITGTVESLPLIASSIMSKKLAEGAGALVLDVKVGSGAFMRSPVQA RELAHTMVELGAAHGVPTRALLTEMNCPLGRTVGNALEVAEALEVLAGGGPPDVVELT LRLAGEMLELAGIHGRDPAQTLRDGTAMDRFRRLVAAQGGDLSKPLPIGSHSETVTAG ASGTMGDIDAMAVGLAAWRLGAGRSRPGARVQHGAGVRIHRRPGEPVVVGEPLFTLYT NAPERFGAARAELAGGWSIRDSPPQVRPLIVDRIV" misc_feature complement(3703072..3703125) /gene="deoA" /locus_tag="Rv3314c" /note="PS00647 Thymidine and pyrimidine-nucleoside phosphorylases signature" gene complement(3703464..3703865) /gene="cdd" /locus_tag="Rv3315c" /db_xref="GeneID:887975" CDS complement(3703464..3703865) /gene="cdd" /locus_tag="Rv3315c" /EC_number="3.5.4.5" /function="THIS ENZYME SCAVENGE EXOGENOUS AND ENDOGENOUS CYTIDINE AND 2'-DEOXYCYTIDINE FOR UMP SYNTHESIS [CATALYTIC ACTIVITY: CYTIDINE + H(2)O = URIDINE + NH(3)]." /note="Reclaims exogenous and endogenous cytidine and 2'-deoxycytidine molecules for UMP synthesis" /codon_start=1 /transl_table=11 /product="cytidine deaminase" /protein_id="NP_217832.1" /db_xref="GI:15610451" /db_xref="GOA:O53367" /db_xref="UniProtKB/TrEMBL:O53367" /db_xref="GeneID:887975" /translation="MPDVDWNMLRGNATQAAAGAYVPYSRFAVGAAALVDDGRVVTGC NVENVSYGLTLCAECAVVCALHSTGGGRLLALACVDGHGSVLMPCGRCRQVLLEHGGS ELLIDHPVRPRRLGDLLPDAFGLDDLPRERR" misc_feature complement(3703578..3703700) /gene="cdd" /locus_tag="Rv3315c" /note="PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature" gene 3704102..3704440 /gene="sdhC" /locus_tag="Rv3316" /db_xref="GeneID:887969" CDS 3704102..3704440 /gene="sdhC" /locus_tag="Rv3316" /EC_number="1.3.99.1" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE. MONO-HEME CYTOCHROME OF THE SUCCINATE DEHYDROGENASE COMPLEX." /note="Rv3316, (MTV016.16), len: 112 aa. Probable sdhC, cytochrome B-556 of succinate dehydrogenase SdhC subunit (EC 1.3.99.1), transmembrane protein, equivalent (but shorter 35 aa) to Q9CCM0|SDHC|ML0699 PUTATIVE SUCCINATE DEHYDROGENASE CYTOCHROME B-556 SUBUNIT from Mycobacterium leprae (153 aa), FASTA scores: opt: 692, E(): 1.2e-39, (88.4% identity in 112 aa overlap). Also similar to others e.g. Q9KZ88|SC5G8.26c from Streptomyces coelicolor (126 aa), FASTA scores: opt: 484, E(): 8.3e-26, (65.65% identity in 99 aa overlap); Q9RVR8|DR0954 from Deinococcus radiodurans (118 aa), FASTA scores: opt: 195, E(): 1.7e-06, (36.8% identity in 87 aa overlap); Q9HQ63|DHSD_HALN1|SDHD|SDHC|VNG1310G from Halobacterium sp. strain NRC-1 (130 aa), FASTA scores: opt: 192, E(): 2.9e-06, (37.85% identity in 74 aa overlap); P72109|DHSD_NATPH|SDHD|SDHC from Natronomonas pharaonis (Natronobacterium pharaonis) (130 aa), FASTA scores: opt: 183, E(): 1.1e-05, (35.15% identity in 74 aa overlap); etc. PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN, AN IRON-SULFUR, CYTOCHROME B-556, AND AN HYDROPHOBIC ANCHOR PROTEIN. BELONGS TO THE CYTOCHROME B560 FAMILY. TBparse score is 0.893" /codon_start=1 /transl_table=11 /product="succinate dehydrogenase cytochrome B-556 subunit" /protein_id="NP_217833.1" /db_xref="GI:15610452" /db_xref="GOA:O53368" /db_xref="UniProtKB/TrEMBL:O53368" /db_xref="GeneID:887969" /translation="MWSWVCHRISGATIFFFLFVHVLDAAMLRVSPQTYNAVLATYKT PIVGLMEYGLVAAVLFHALNGIRVILIDFWSEGPRYQRLMLWIIGSVFLLLMVPAGVV VGIHMWEHFR" gene 3704437..3704871 /gene="sdhD" /locus_tag="Rv3317" /db_xref="GeneID:887845" CDS 3704437..3704871 /gene="sdhD" /locus_tag="Rv3317" /EC_number="1.3.99.1" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE. PUTATIVE HYDROPHOBIC COMPONENT OF THE SUCCINATE DEHYDROGENASE COMPLEX. COULD BE REQUIRED TO ANCHOR THE CATALYTIC COMPONENTS TO THE CYTOPLASMIC MEMBRANE" /note="Rv3317, (MTV016.17), len: 144 aa. Probable sdhD, membrane anchor of succinate dehydrogenase SdhD subunit (EC 1.3.99.1), equivalent (but shorter 19 aa) to Q49915|SDHD|ML0698|L308_F1_25 PUTATIVE SUCCINATE DEHYDROGENASE HYDROPHOBIC MEMBRANE ANCHOR PROTEIN from Mycobacterium leprae (163 aa), FASTA scores: opt: 878, E(): 1.9e-51, (85.2% identity in 142 aa overlap). Also similar to others e.g. Q9KZ89|SC5G8.25c from Streptomyces coelicolor (160 aa), FASTA scores: opt: 553, E(): 6.6e-30, (58.85% identity in 141 aa overlap); Q9RVR9|DR0953 from Deinococcus radiodurans (125 aa), FASTA scores: opt: 251, E(): 5.5e-10, (37.15% identity in 113 aa overlap); O29573|DHSD_ARCFU|SDHD|AF0684 from Archaeoglobus fulgidus (117 aa), FASTA scores: opt: 160, E(): 0.00056, (25.95% identity in 108 aa overlap); etc. PART OF AN ENZYME COMPLEX CONTAINING FOUR SUBUNITS: A FLAVOPROTEIN, AN IRON-SULFUR, CYTOCHROME B-556, AND AN HYDROPHOBIC ANCHOR PROTEIN. TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="succinate dehydrogenase hydrophobic membrane anchor subunit SdhD" /protein_id="NP_217834.1" /db_xref="GI:15610453" /db_xref="GOA:O53369" /db_xref="UniProtKB/TrEMBL:O53369" /db_xref="GeneID:887845" /translation="MSAPVRQRSHDRPASLDNPRSPRRRAGMPNFEKFAWLFMRFSGV VLVFLAIGHVFIMLMWDNGVYRLDFNFVAQRWASPFWQTWDLLLLWLAQLHGGNGLRT IIDDYSRKDTTRFWLNSLLVLSMLFTLMLGTYVIVTFDPNIS" repeat_region complement(3704895..3705004) /note="110 bp Mycobacterial Interspersed Repetitive Unit, Class III" gene 3705000..3706772 /gene="sdhA" /locus_tag="Rv3318" /db_xref="GeneID:887639" CDS 3705000..3706772 /gene="sdhA" /locus_tag="Rv3318" /EC_number="1.3.5.1" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE. MEMBRANE-BOUND FAD-CONTAINING ENZYME WHICH IS RESPONSIBLE FOR SUCCINATE INTERCONVERSION [CATALYTIC ACTIVITY: SUCCINATE + ACCEPTOR = FUMARATE + REDUCED ACCEPTOR]." /note="part of four member succinate dehydrogenase enzyme complex that forms a trimeric complex (trimer of tetramers); SdhA/B are the catalytic subcomplex and can exhibit succinate dehydrogenase activity in the absence of SdhC/D which are the membrane components and form cytochrome b556; SdhC binds ubiquinone; oxidizes succinate to fumarate while reducing ubiquinone to ubiquinol" /codon_start=1 /transl_table=11 /product="succinate dehydrogenase flavoprotein subunit" /protein_id="NP_217835.1" /db_xref="GI:15610454" /db_xref="GOA:O53370" /db_xref="UniProtKB/TrEMBL:O53370" /db_xref="GeneID:887639" /translation="MICQHRYDVVIVGAGGAGMRAAVEAGPRVRTAVLTKLYPTRSHT GAAQGGMCAALANVEDDNWEWHTFDTVKGGDYLADQDAVEIMCKEAIDAVLDLEKMGM PFNRTPEGRIDQRRFGGHTRDHGKAPVRRACYAADRTGHMILQTLYQNCVKHDVEFFN EFYALDLALTQTPSGPVATGVIAYELATGDIHVFHAKAVVIATGGSGRMYKTTSNAHT LTGDGIGIVFRKGLPLEDMEFHQFHPTGLAGLGILISEAVRGEGGRLLNGEGERFMER YAPTIVDLAPRDIVARSMVLEVLEGRGAGPLKDYVYIDVRHLGEEVLEAKLPDITEFA RTYLGVDPVTELVPVYPTCHYLMGGIPTTVTGQVLRDNTSVVPGLYAAGECACVSVHG ANRLGTNSLLDINVFGRRAGIAAASYAQGHDFVDMPPNPEAMVVGWVSDILSEHGNER VADIRGALQQSMDNNAAVFRTEETLKQALTDIHALKERYSRITVHDKGKRFNTDLLEA IELGFLLELAEVTVVGALNRKESRGGHAREDYPNRDDVNYMRHTMAYKEIGADKEGPE LRSDVRLDFKPVVQTRYEPKERKY" misc_feature 3705120..3705149 /gene="sdhA" /locus_tag="Rv3318" /note="PS00504 Fumarate reductase / succinate dehydrogenase FAD-binding site" gene 3706772..3707563 /gene="sdhB" /locus_tag="Rv3319" /db_xref="GeneID:887562" CDS 3706772..3707563 /gene="sdhB" /locus_tag="Rv3319" /EC_number="1.3.99.1" /function="INVOLVED IN TRICARBOXYLIC ACID CYCLE. MEMBRANE-BOUND FAD-CONTAINING ENZYME WHICH IS RESPONSIBLE FOR SUCCINATE INTERCONVERSION [CATALYTIC ACTIVITY: SUCCINATE + ACCEPTOR = FUMARATE + REDUCED ACCEPTOR]." /note="part of four member succinate dehydrogenase enzyme complex that forms a trimeric complex (trimer of tetramers); SdhA/B are the catalytic subcomplex and can exhibit succinate dehydrogenase activity in the absence of SdhC/D which are the membrane components and form cytochrome b556; SdhC binds ubiquinone; oxidizes succinate to fumarate while reducing ubiquinone to ubiquinol; the catalytic subunits are similar to fumarate reductase" /codon_start=1 /transl_table=11 /product="succinate dehydrogenase iron-sulfur subunit" /protein_id="NP_217836.1" /db_xref="GI:15610455" /db_xref="GOA:O53371" /db_xref="UniProtKB/TrEMBL:O53371" /db_xref="GeneID:887562" /translation="MSVEPDVETLDPPLPPVPDGAVMVTVKIARFNPDDPDAFAATGG WQSFRVPCLPSDRLLNLLIYIKGYLDGTLTFRRSCAHGVCGSDAMRINGVNRLACKVL MRDLLPKKKGKSLTVTVEPIRGLPVEKDLVVDMEPFFDAYRAIKPYLITSGNPPTRER IQSPTDRARYDDTTKCILCACCTTSCPVFWHEGSYFGPAAIVNAHRFIFDSRDEAAAE RLDILNEVDGVWRCRTTFNCTESCPRGIEVTKAIQEVKRALMFTR" misc_feature 3707297..3707332 /gene="sdhB" /locus_tag="Rv3319" /note="PS00198 4Fe-4S ferredoxins, iron-sulfur binding region signature" gene complement(3707642..3708070) /locus_tag="Rv3320c" /db_xref="GeneID:887245" CDS complement(3707642..3708070) /locus_tag="Rv3320c" /function="UNKNOWN" /note="Rv3320c, (MTV016.20c), len: 142 aa. Conserved hypothetical protein, similar to several hypothetical proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. P95023|Rv2530c|MTCY159.26 (139 aa), FASTA scores: opt: 292, E(): 4.8e-14, (41.5% identity in 135 aa overlap); O53219|Rv2494|MTV008.50 (141 aa), FASTA scores: opt: 287, E(): 1.1e-13, (41.6% identity in 125 aa overlap); O07760|Rv0617|MTCY19H5.04c (133 aa), FASTA scores: opt: 252, E(): 3.3e-11, (37.8% identity in 127 aa overlap); etc. TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217837.1" /db_xref="GI:15610456" /db_xref="UniProtKB/TrEMBL:O53372" /db_xref="GeneID:887245" /translation="MRALLDVNVLLALLDRDHVDHERARAWITGQIERGWASCAITQN GFVRVISQPRYPSPISVAHAIDLLARATHTRYHEFWSCTVSILDSKVIDRSRLHSPKQ VTDAYLLALAVAHDGRFVTFDQSIALTAVPGATKQHLATL" gene complement(3708074..3708316) /locus_tag="Rv3321c" /db_xref="GeneID:887577" CDS complement(3708074..3708316) /locus_tag="Rv3321c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3321c, (MTV016.21c), len: 80 aa. Conserved hypothetical protein, similar at N-terminal region to several proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. AAK48167|MT3800 DNA-BINDING PROTEIN (COPG FAMILY) from strain CDC1551 (74 aa), FASTA scores: opt: 142, E(): 0.0016, (48.85% identity in 43 aa overlap); AAK46916|MT2606 HYPOTHETICAL 8.0 KDA PROTEIN from strain CDC1551 (74 aa), FASTA scores: opt: 139, E(): 0.0026, (37.2% identity in 78 aa overlap); O50456|Rv1241|MTV006.13 HYPOTHETICAL 9.9 KDA PROTEIN from strain H37Rv (86 aa), FASTA scores: opt: 134, E(): 0.0066, (39.0% identity in 82 aa overlap); etc. TBparse score is 0.906." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217838.1" /db_xref="GI:15610457" /db_xref="GOA:O53373" /db_xref="UniProtKB/TrEMBL:O53373" /db_xref="GeneID:887577" /translation="MRTTLSIDDDVLLAVKERARREKRTAGEILSDLARQALTNQNPQ PAASQEDAFHGFEPLPHRGGAVSNALIDRLRDEEAV" gene complement(3708438..3709052) /locus_tag="Rv3322c" /db_xref="GeneID:887520" CDS complement(3708438..3709052) /locus_tag="Rv3322c" /EC_number="2.1.1.-" /function="COULD CAUSE METHYLATION." /note="Rv3322c, (MTV016.22c), len: 204 aa. Conserved hypothetical protein, showing weak similarity to proteins including several methyltransferases (EC 2.1.1.-) e.g. Q9X9V1|ORF8 PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (208 aa), FASTA scores: opt: 193, E(): 1e-05, (36.35% identity in 132 aa overlap); and Q9XA90|SCF43A.25c PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (215 aa), FASTA scores: opt: 161, E(): 0.0014, (32.05% identity in 131 aa overlap); P74712|SLR1183 HYPOTHETICAL 21.3 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (194 aa), FASTA scores: opt: 155, E(): 0.0032, (27.35% identity in 150 aa overlap); Q9ABW8|CC0102 RRNA METHYLTRANSFERASE RSMB from Caulobacter crescentus (429 aa), FASTA scores: opt: 148, E(): 0.018, (31.5% identity in 162 aa overlap); etc. Also highly similar to O05796|Rv3120|MTCY164.30 HYPOTHETICAL 21.8 KDA PROTEIN from Mycobacterium tuberculosis (200 aa), FASTA scores: opt: 691, E(): 1.2e-38, (56.5% identity in 200 aa overlap); and shows weak similarity to O69667|Rv3699|MTV025.047 PUTATIVE METHYLTRANSFERASE from Mycobacterium tuberculosis (233 aa), FASTA scores: opt: 155, E(): 0.0037, (29.15% identity in 168 aa overlap). TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="methyltransferase" /protein_id="YP_177958.1" /db_xref="GI:57117089" /db_xref="GOA:O53374" /db_xref="UniProtKB/TrEMBL:O53374" /db_xref="GeneID:887520" /translation="MSVQTDPALREHPNRVDWNARYERAGSAHAPFAPVPWLADVLRA GVPDGPVLELASGRSGTALALAAHGRQVTAIDVSDVALLQLDSEAVRRGVADRLNLVQ ADLGCWEPGETRFALVLSRLFWDAAIFHRACEAVMPGGVLAWESLALSGAEAGTASAK RRVKPGEPACLLPADFTVVHEGQGNCDSAPSRIMIARRSPLPGA" gene complement(3709049..3709714) /gene="moaX" /locus_tag="Rv3323c" /db_xref="GeneID:887578" CDS complement(3709049..3709714) /gene="moaX" /locus_tag="Rv3323c" /function="THOUGHT TO BE INVOLVED IN MOLYBDENUM COFACTOR BIOSYNTHESIS." /note="Rv3323c, (MTV016.23c), len: 221 aa. Probable moaX, MoaD-MoaE fusion protein, similar (whole or partial) to several MoaD and MoaE proteins e.g. Q9RR88|DR2607 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D/E from Deinococcus radiodurans (229 aa), FASTA scores: opt: 407, E(): 1.8e-18, (32.75% identity in 223 aa overlap); Q9K8I7|MOAE|BH3019 MOLYBDOPTERIN CONVERTING FACTOR (SUBUNIT 2) from Bacillus halodurans (156 aa), FASTA scores: opt: 375, E(): 1.3e-16, (41.65% identity in 132 aa overlap); O31705|MOAE MOLYBDOPTERIN CONVERTING FACTOR (SUBUNIT 2) from Bacillus subtilis (157 aa), FASTA scores: opt: 368, E(): 3.6e-16, (41.65% identity in 132 aa overlap); etc. C-terminus highly similar to O05795|MOAE_MYCTU|Rv3119|MT3201|MTCY164.29|MOAE1 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN E from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 733, E(): 5.4e-39, (76.2% identity in 143 aa overlap); and N-terminus highly similar to O05789|MOAD1|Rv3112|MTCY164.22 PUTATIVE MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN D from Mycobacterium tuberculosis (83 aa), FASTA scores: opt: 333, E(): 3.2e-14, (65.05% identity in 83 aa overlap). TBparse score is 0.941." /codon_start=1 /transl_table=11 /product="MOAD-MOAE fusion protein MOAX" /protein_id="YP_177959.1" /db_xref="GI:57117090" /db_xref="GOA:Q6MWY3" /db_xref="UniProtKB/TrEMBL:Q6MWY3" /db_xref="GeneID:887578" /translation="MITVNVLYFGAVREACKVAHEKISLESGTTVDGLVDQLQIDYPP LADFRKRVRMAVNESIAPASTILDDGDTVAFIPQVAGGSDVYCRLTDEPLSVDEVLNA ISGPSQGGAVIFVGTVRNNNNGHEVTKLYYEAYPAMVHRTLMDIIEECERQADGVRVA VAHRTGELRIGDAAVVIGASAPHRAAAFDAARMCIERLKQDVPIWKKEFALDGVEWVA NRP" gene complement(3709715..3710269) /gene="moaC" /locus_tag="Rv3324c" /db_xref="GeneID:887981" CDS complement(3709715..3710269) /gene="moaC" /locus_tag="Rv3324c" /function="THOUGHT TO BE INVOLVED IN THE BIOSYNTHESIS OF MOLYBDOPTERIN." /note="MoaC; along with MoaA is involved in conversion of a guanosine derivative into molybdopterin precursor Z; involved in molybdenum cofactor biosynthesis" /codon_start=1 /transl_table=11 /product="molybdenum cofactor biosynthesis protein MoaC" /protein_id="NP_217841.2" /db_xref="GI:161352462" /db_xref="GOA:P65392" /db_xref="UniProtKB/Swiss-Prot:P65392" /db_xref="GeneID:887981" /translation="MQPAGGTVNDHDGVLTHLDEQGAARMVDVSAKAVTLRRARASGA VLMKPSTLDMICHGTAAKGDVIATARIAGIMAAKRTGELIPLCHPLGIEAVTVTLEPQ GADRLSIAATVTTVARTGVEMEALTAVTVTALTVYDMCKAVDRAMTITDIRLDEKSGG RSGHYRRHDADVKPSDGGSTEDGC" gene complement(3710248..3710379) /locus_tag="Rv3324A" /pseudo /db_xref="GeneID:3205081" misc_feature complement(3710248..3710379) /locus_tag="Rv3324A" /note="Rv3324A, 44 aa. Probable pseudogene moaB3, fragment of pterin-4-alpha-carbinolamine dehydratase (EC 4.2.1.96), equivalent to C-terminus of MT3426|Q8VJ32 PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Mycobacterium tuberculosis strain CDC1551 (124 aa), FASTA scores: opt: 309, E(): 1.1e-20, (100.000% identity in 44 aa overlap), and C-terminus of Mb3354c|moaB3 PROBABLE PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE from Mycobacterium bovis (124 aa). Note that a deletion of DNA (RvD5 region) in Mycobacterium tuberculosis strain H37Rv resulted in a truncated CDS comparatively to Mycobacterium bovis or Mycobacterium tuberculosis strain CDC1551 genomes (see citations below).;PROBABLE FRAGMENT OF PTERIN-4-ALPHA-CARBINOLAMINE DEHYDRATASE MOAB3 (PHS) (4-ALPHA-HYDROXY-TETRAHYDROPTERIN DEHYDRATASE) (PTERIN-4-A-CARBINOLAMINE DEHYDRATASE) (PHENYLALANINE HYDROXYLASE-STIMULATING PROTEIN) (PHS) (PTERIN CARBINOLAMINE DEHYDRATASE) (PCD)" /pseudo repeat_region 3710382..3711736 /note="IS6110-14, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-14" repeat_region 3710382..3710409 /note="28 bp inverted repeat at left end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene 3710433..3710759 /locus_tag="Rv3325" /db_xref="GeneID:887314" CDS 3710433..3710759 /locus_tag="Rv3325" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE ELEMENT IS6110." /note="Rv3325, (MTV016.25), len: 108 aa. Probable transposase for insertion element IS6110. BELONGS TO THE TRANSPOSASE FAMILY 8. TBparse score is 0.928." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217842.1" /db_xref="GI:15610461" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:887314" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 3710756..3711694 /locus_tag="Rv3326" /db_xref="GeneID:887563" CDS <3710756..3711694 /locus_tag="Rv3326" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE ELEMENT IS6110." /note="Rv3326, (MTV016.26), len: 312 aa. Probable transposase for insertion element IS6110. TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217843.1" /db_xref="GI:15610462" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:887563" /translation="LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYD HINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTK LGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVA FVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATAR WVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG" repeat_region complement(3711709..3711736) /note="28 bp inverted repeat at right end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC" repeat_region 3711737..3712822 /note="IS1547-2, len: 1086 bp. Region corresponding to Insertion sequence IS1547, positions 1982 3067 in EM_NEW:MTY13470." /mobile_element="insertion sequence:IS1547-2" gene 3711749..3713461 /locus_tag="Rv3327" /db_xref="GeneID:887965" CDS 3711749..3713461 /locus_tag="Rv3327" /function="INVOLVED IN THE TRANSPOSITION IN THE INSERTION SEQUENCE ELEMENT IS1547." /note="Rv3327, (MTV016.27), len: 570 aa. Probable fusion protein. Indeed, N-terminal part corresponds to entire O07269 transposase of IS1547 (383 aa), and C-terminal part identical to MTCI249B.03c (210 aa). N-terminal part is identical to MTV042_7 (188 aa); C-terminal part (aa 378-570) is similar to hypothetical 20.5 kDa protein from Escherichia coli P76222|YNJA_ECOLI (182 aa), FASTA scores: opt: 292, E(): 5.3e-11, (32.6% identity in 181 aa overlap). TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217844.1" /db_xref="GI:15610463" /db_xref="GOA:O53377" /db_xref="UniProtKB/TrEMBL:O53377" /db_xref="GeneID:887965" /translation="MVVVGTDAHKYSHTFVATDEVGRQLGEKTVKATTAGHATAIMWA REQFGLELIWGIEDCRNMSARLERDLLAAGQQVVRVPTKLMAQTRKSARSRGKSDPID ALAVARAVLRETDLPLATHDETSRELKLLTDRRDVLVAQRTSAINRLRWLVHELDPER APAARSLDAAKHQQALRTWLDTQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQ VAPALLEIPGCAELTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMRLS RSGNRQLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLKRRLARTVFQA LRTVHQPSSEHTQPAAACHRSYCSSHLGEPPRLTDMTQKTRIQPLPPKRAGLLIRALY RIAKRRFGEVPEPFTVTAHHRRLLIANVVHEALLQRASRKLPPSVRELAVFWTARSIG CSWCVDFGAMLQRLDGLDVDRLTDIDNYATSSKFSDDERAAIAYAEAMTADPHSVTDE QVADLRARFGEAGVIELTYQIGVENMRARMNSALGITEQGFNSGDACRVPWAAPDVPS AESR" gene complement(3713394..3714332) /gene="sigJ" /locus_tag="Rv3328c" /db_xref="GeneID:887964" CDS complement(3713394..3714332) /gene="sigJ" /locus_tag="Rv3328c" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED." /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigJ" /protein_id="NP_217845.1" /db_xref="GI:15610464" /db_xref="GOA:O53378" /db_xref="UniProtKB/TrEMBL:O53378" /db_xref="GeneID:887964" /translation="MEVSEFEALRQHLMSVAYRLTGTVADAEDIVQEAWLRWDSPDTV IADPRAWLTTVVSRLGLDKLRSAAHRRETYTGTWLPEPVVTGLDATDPLAAVVAAEDA RFAAMVVLERLRPDQRVAFVLHDGFAVPFAEVAEVLGTSEAAARQLASRARKAVTAQP ALISGDPDPAHNEVVGRLMAAMAAGDLDTVVSLLHPDVTFTGDSNGKAPTAVRAVRGS DKVVRFILGLVQRYGPGLFGANQLALVNGELGAYTAGLPGVDGYRAMAPRITAITVRD GKVCALWDIANPDKFTGSPLKERRAQPTGRGRHHRN" gene 3714392..3715708 /locus_tag="Rv3329" /db_xref="GeneID:888028" CDS 3714392..3715708 /locus_tag="Rv3329" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3329, (MTV016.29), len: 438 aa (start uncertain). Probable aminotransferase (EC 2.6.1.-), similar to many e.g. O86744|SC6A9.12 from Streptomyces coelicolor (457 aa), FASTA scores: opt: 2120, E(): 5.1e-125, (70.1% identity in 438 aa overlap); Q9I6J2|PA0299 from Pseudomonas aeruginosa (456 aa), FASTA scores: opt: 983, E(): 5.7e-54, (38.1% identity in 425 aa overlap); Q53196|Y4UB_RHISN from Rhizobium sp. strain NGR234 plasmid sym pNGR234a (467 aa), FASTA scores: opt: 971, E(): 3.3e-53, (39.25% identity in 438 aa overlap); P33189|YHXA_BACSU from Bacillus subtilis (450 aa), FASTA scores: opt: 933, E(): 7.5e-51, (40.25% identity in 435 aa overlap); etc. Equivalent to AAK47775 from Mycobacterium tuberculosis strain CDC1551 (466 aa) but shorter 28 aa. COFACTOR: PYRIDOXAL PHOSPHATE. COULD BELONG TO CLASS-III OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217846.1" /db_xref="GI:15610465" /db_xref="GOA:O53379" /db_xref="UniProtKB/TrEMBL:O53379" /db_xref="GeneID:888028" /translation="MHFARHGAGIQHPVIVRGDGVTIFDDRGKSYLDALSGLFVVQVG YGRAELAEAAARQAGTLGYFPLWGYATPPAIELAERLARYAPGDLNRVFFTSGGTEAV ETAWKVAKQYFKLTGKPGKQKVISRSIAYHGTTQGALAITGLPLFKAPFEPLTPGGFR VPNTNFYRAPLHTDLKEFGRWAADRIAEAIEFEGPDTVAAVFLEPVQNAGGCIPAPPG YFERVREICDRYDVLLVSDEVICAFGRIGSMFACEDLGYVPDMITCAKGLTSGYSPLG AMIASDRLFEPFNDGETMFAHGYTFGGHPVSAAVGLANLDIFEREGLSDHVKRNSPAL RATLEKLYDLPIVGDIRGEGYFFGIELVKDQATKQTFTDDERARLLGQVSAALFEAGL YCRTDDRGDPVVQVAPPLISGQPEFDTIETILRSVLTDTGRKYLHL" gene 3715777..3716994 /gene="dacB1" /locus_tag="Rv3330" /db_xref="GeneID:887607" CDS 3715777..3716994 /gene="dacB1" /locus_tag="Rv3330" /EC_number="3.4.16.4" /function="INVOLVED IN PEPTIDOGLYCAN SYNTHESIS (AT FINAL STAGES). HYDROLYZES THE BOUND D-ALANYL-D-ALANINE [CATALYTIC ACTIVITY: D-ALANYL-D-ALANINE + H(2)O = 2 D-ALANINE]." /note="Rv3330, (MTV016.30), len: 405 aa. Probable dacB1, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein) (EC 3.4.16.4), equivalent to Mycobacterium leprae proteins Q9CCM2|ML0691 PUTATIVE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE (411 aa), FASTA scores: opt: 2066, E(): 2.5e-102, (77.15% identity in 416 aa overlap); Q49917|L308_F1_36 (228 aa), FASTA scores: opt: 1241, E(): 7.9e-59, (78.9% identity in 232 aa overlap) (note that this protein corresponds to C-terminal part of the putative protein encoded by Rv3330, aa 174-405); and Q49921|PBPC (182 aa), FASTA scores: opt: 736, E(): 3.7e-32, (73.95% identity in 169 aa overlap) (note that this protein corresponds to N-terminal part of the putative protein encoded by Rv3330, aa 1-158); note L308_F1_36 (228 aa) and PBPC (182 aa) are two consecutive Mycobacterium leprae ORFs . Also similar to others e.g. Q9FC34|SC4G1.16c PUTATIVE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE from Streptomyces coelicolor (413 aa), FASTA scores: opt: 572, E(): 3.4e-23, (33.75% identity in 382 aa overlap); P35150|DACB_BACSU PENICILLIN-BINDING PROTEIN 5* PRECURSOR (D-ALANYL-D-ALANINE CARBOXYPEPTIDASE) from Bacillus subtilis (382 aa), FASTA scores: opt: 422, E(): 2.8e-15, (31.3% identity in 249 aa overlap); Q9K8X5|DACB|BH2877 D-ALANYL-D-ALANINE CARBOXYPEPTIDASE (PENICILLIN-BINDING PROTEIN) from Bacillus halodurans (395 aa), FASTA scores: opt: 421, E(): 3.2e-15, (31.95% identity in 241 aa overlap); etc. Also similar to Mycobacterium tuberculosis Q10828|Rv2911|MTCY274.43 PROBABLE PENICILLIN-BINDING PROTEIN (BELONGS TO PEPTIDASE FAMILY S11; ALSO KNOWN AS THE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE 1 FAMILY) (291 aa), FASTA scores: opt: 746, E(): 1.6e-32, (47.0% identity in 266 aa overlap). Has hydrophobic stretches at both N- and C-termini. Certainly membrane-bound protein. BELONGS TO PEPTIDASE FAMILY S11; ALSO KNOWN AS THE D-ALANYL-D-ALANINE CARBOXYPEPTIDASE 1 FAMILY. TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="penicillin-binding protein DacB1" /protein_id="NP_217847.1" /db_xref="GI:15610466" /db_xref="GOA:O53380" /db_xref="UniProtKB/TrEMBL:O53380" /db_xref="GeneID:887607" /translation="MAFLRSVSCLAAAVFAVGTGIGLPTAAGEPNAAPAACPYKVSTP PAVDSSEVPAAGEPPLPLVVPPTPVGGNALGGCGIITAPGSAPAPGDVSAEAWLVADL DSGAVIAARDPHGRHRPASVIKVLVAMASINTLTLNKSVAGTADDAAVEGTKVGVNTG GTYTVNQLLHGLLMHSGNDAAYALARQLGGMPAALEKINLLAAKLGGRDTRVATPSGL DGPGMSTSAYDIGLFYRYAWQNPVFADIVATRTFDFPGHGDHPGYELENDNQLLYNYP GALGGKTGYTDDAGQTFVGAANRDGRRLMTVLLHGTRQPIPPWEQAAHLLDYGFNTPA GTQIGTLIEPDPSLMSTDRNPADRQRVDPQAAARISAADALPVRVGVAVIGALIVFGL IMVARAMNRRPQH" gene 3717090..3718598 /gene="sugI" /locus_tag="Rv3331" /db_xref="GeneID:887504" CDS 3717090..3718598 /gene="sugI" /locus_tag="Rv3331" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF SUGAR ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3331, (MTV016.31), len: 502 aa (start uncertain). Probable sugI, sugar-transport integral membrane protein, possibly member of major facilitator superfamily (MFS), similar to several transporters e.g. P37021|GALP_ECOLI|B2943 GALACTOSE-PROTON SYMPORTER (GALACTOSE TRANSPORTER) from Escherichia coli strain K12 (464 aa), FASTA scores: opt: 818, E(): 1.8e-39, (31.85% identity in 446 aa overlap); P96742|YWTG METABOLITE-TRANSPORT-RELATED PROTEIN from Bacillus subtilis (457 aa), FASTA scores: opt: 810, E(): 5e-39, (33.2% identity in 428 aa overlap); AAG58074|GALP (alias BAB37242|ECS3819) GALACTOSE-PROTON SYMPORT OF TRANSPORT SYSTEM from Escherichia coli strain O157:H7 EDL933 (464 aa), FASTA scores: opt: 810, E(): 5.1e-39, (32.2% identity in 432 aa overlap); P46333|CSBC_BACSU|SS92BR PROBABLE METABOLITE TRANSPORT PROTEIN from Bacillus subtilis (461 aa), FASTA scores: opt: 792, E(): 5.4e-38, (33.7% identity in 442 aa overlap); etc. Equivalent to AAK47777|MT343 from Mycobacterium tuberculosis strain CDC1551 (500 aa) but with some divergence between residues 229 and 254. Contains PS00216 Sugar transport proteins signature 1 and PS00217 Sugar transport proteins signature 2. BELONGS TO THE SUGAR TRANSPORTER FAMILY. TBparse score is 0.869." /codon_start=1 /transl_table=11 /product="sugar-transport integral membrane protein SugI" /protein_id="NP_217848.1" /db_xref="GI:15610467" /db_xref="GOA:O53381" /db_xref="UniProtKB/TrEMBL:O53381" /db_xref="GeneID:887504" /translation="MTTLWQPHRNDYSPIPGRGVHARRGARRPRPRGGRAERPGTGQL TRSGRRALLVGLTAASVGVLYGYDLSAIAGALLSLSEEFELTTREQELLTTTAVLGQI AGALGGGILANAIGRKKSVVLIVAGYAVFALLGATSVSVPMLVVARLLLGVTIGLSVV VVPVYVAESAPAAVRGSLVTAYQLATLSGIVVGYLVGYLLAGSHGWRAMFGLAAAPAT LLLPLLWRMPDTARWYLLKGRIADARSALRRIQPEADIDAELADMAAAVDERGGGIGE MVRRPYLRATLFVIALGFLVQITGINAIIYYSPRLFAAMGFAGYFAMLALPAMVQVAG LAAVCASLFLVDRLGRRPILLSGIATMITADAVLITVFANDSDGGTGLVLGFAGVLLF IIGFNFGFGSLVWVYAAESFPSRLRSMGSSPMLTSTLTANAIVAAFSLTMLRVLGGAG VFAVFGTFAVVAFVVVYRFAPETKGRKLEEIRHFWENGGRWPAERSPAADEP" misc_feature 3717537..3717614 /gene="sugI" /locus_tag="Rv3331" /note="PS00217 Sugar transport proteins signature 2" misc_feature 3718110..3718160 /gene="sugI" /locus_tag="Rv3331" /note="PS00216 Sugar transport proteins signature 1" gene 3718595..3719746 /gene="nagA" /locus_tag="Rv3332" /db_xref="GeneID:887518" CDS 3718595..3719746 /gene="nagA" /locus_tag="Rv3332" /EC_number="3.5.1.25" /function="INVOLVED IN N-ACETYL GLUCOSAMINE UTILIZATION PATHWAY [CATALYTIC ACTIVITY: N-ACETYL-D-GLUCOSAMINE 6-PHOSPHATE + H(2)O = D-GLUCOSAMINE 6-PHOSPHATE + ACETATE]." /note="Rv3332, (MTV016.32), len: 383 aa. Probable nagA, N-acetylglucosamine-6-phosphate deacetylase (EC 3.5.1.25), similar to many e.g. Q9KXV7|SCD95A.17c PUTATIVE DEACETYLASE from Streptomyces coelicolor (381 aa), FASTA scores: opt: 1090, E(): 1.6e-55, (47.8% identity in 385 aa overlap); Q9PDB4|XF1465 N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE from Xylella fastidiosa (386 aa), FASTA scores: opt: 667, E(): 3.5e-31, (38.3% identity in 394 aa overlap); Q9AAZ9|CC0443 N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE from Caulobacter crescentus (378 aa), FASTA scores: opt: 661, E(): 7.5e-31, (38.9% identity in 383 aa overlap); O34450||NAGA_BACSU N-ACETYLGLUCOSAMINE-6-PHOSPHATE DEACETYLASE from Bacillus subtilis (396 aa), FASTA scores: opt: 571, E(): 1.2e-25, (32.45% identity in 376 aa overlap); etc. Equivalent to AAK47778 from Mycobacterium tuberculosis strain CDC1551 (346 aa) but longer 37 aa. BELONGS TO THE NAGA FAMILY. TBparse score is 0.881." /codon_start=1 /transl_table=11 /product="N-acetylglucosamine-6-phosphate deacetylase" /protein_id="NP_217849.1" /db_xref="GI:15610468" /db_xref="GOA:O53382" /db_xref="UniProtKB/TrEMBL:O53382" /db_xref="GeneID:887518" /translation="MTVLGADAVVIDGRICRPGWVHTADGRILSGGAGAPPMPADAEF PDAIVVPGFVDMHVHGGGGASFADGNAADIARAAEFHLRHGTTTTLASLVTAGPAELL SAVGALAEATRDGVVAGIHLEGPWLSPARCGAHDHTRMRAPDPAEIESVLAAADGAVR MVTLAPELPGSDAAIRRFRDAEVVVAVGHTDATYTQTRHAIDLGATVGTHLFNAMPPL DHRAPGPVLALLCDPRVTVEIIADGVHVHPAVVHAVIEAVGPDRVAVVTDAIAAAGCG DGAFRLGTMPIEVESSVARVAGASTLAGSTTTMDQLFRTVAGLGSKSDSAGDVALAAA VQVTSATPARALGLTGVGRLAAGYAANLVVLDRDLRVTAVMVNDDWRVG" gene complement(3719937..3720782) /locus_tag="Rv3333c" /db_xref="GeneID:887632" CDS complement(3719937..3720782) /locus_tag="Rv3333c" /function="UNKNOWN" /note="Rv3333c, (MTV016.33c), len: 281 aa. Hypothetical unknown pro-rich protein. Equivalent to AAK47780 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (265 aa) but longer 16 aa. TBparse score is 0.927." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217850.1" /db_xref="GI:15610469" /db_xref="UniProtKB/TrEMBL:O53383" /db_xref="GeneID:887632" /translation="MFTGIASHAGALGAALVVLIGAAILHDGPAAADPNQDDRFLALL EKKEIPAVANVPRVIDAAHKVCRKLDGGMPVNDIVDGLRNDAYNIDPVMRLYPVRLTT TMTRFISAAVEIYCPNHHSKMAFAMANFEPGSNEPTHRVAASTRSAVNSGSDLRASVS DMTIMSPGWREPTGAMLASVLGAVRAGDPLIPNPPPIPVPPPAAQTLIPPPPIVAPPP PRPAPPQQPPPPPPEVEPPAGVPQSGGAAGSGGAGSGGGGGGDGPVEPSPARPMPPGF IRLAP" gene 3721257..3721697 /locus_tag="Rv3334" /db_xref="GeneID:887593" CDS 3721257..3721697 /locus_tag="Rv3334" /function="INVOLVED IN A TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3334, (MTV016.34), len: 146 aa. Probable transcriptional regulator, similar to many regulatory proteins (notably mercury resistance operon regulators) e.g. Q9HXV1|PA3689 PROBABLE TRANSCRIPTIONAL REGULATOR MERR FAMILY from Pseudomonas aeruginosa (156 aa), FASTA scores: opt: 275, E(): 1.6e-11, (35.95% identity in 139 aa overlap); Q9AKR6|PBRR LEAD RESISTANCE OPERON REGULATOR from Ralstonia metallidurans strain CH34 (plasmid pMOL30) (145 aa), FASTA scores: opt: 267, E(): 5.2e-11, (35.8% identity in 134 aa overlap); P95838|MERR MERCURIC RESISTANCE OPERON REGULATOR from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (144 aa), FASTA scores: opt: 266, E(): 6e-11, (31.35% identity in 118 aa overlap); P22853|MERR_BACSR MERCURIC RESISTANCE OPERON REGULATOR from Bacillus sp. strain RC607 (132 aa), FASTA scores: opt: 262, E(): 1e-10, (34.6% identity in 130 aa overlap); etc. Contains probable helix-turn-helix motif at aa 1-22 (Score 1478, +4.22 SD). SEEMS TO BELONG TO THE MERR FAMILY OF TRANSCRIPTIONAL REGULATORS. TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="MerR family transcriptional regulator" /protein_id="NP_217851.1" /db_xref="GI:15610470" /db_xref="GOA:O53384" /db_xref="UniProtKB/TrEMBL:O53384" /db_xref="GeneID:887593" /translation="MKISEVAALTNTSTKTLRFYENSGLLPPPARTASGYRNYGPEIV DRLRFIHRGQAAGLALQEVRQILAIHDRGEAPCAHVRQLLSTRIDEVRAQIAELIALE GHLQTLLDHASYGPPTEHDHSTVCWILESDLDEPTAIEVSDIHA" gene complement(3721731..3722600) /locus_tag="Rv3335c" /db_xref="GeneID:888029" CDS complement(3721731..3722600) /locus_tag="Rv3335c" /function="UNKNOWN" /note="Rv3335c, (MTV016.35c), len: 289 aa. Probable conserved integral membrane protein, equivalent to Q49909|ML0687 PUTATIVE MEMBRANE PROTEIN U0308AA from Mycobacterium leprae (313 aa), FASTA scores: opt: 1299, E(): 8.9e-75, (68.75% identity in 288 aa overlap). Also similar to other hypothetical bacterial proteins e.g. BAB37825|ECS4402 from Escherichia coli strain O157:H7 (alias P37642|YHJD_ECOLI|B3522 strain K12) (337 aa), FASTA scores: opt: 591, E(): 4.2e-30, (35.15% identity in 273 aa overlap); P45417|YHJD_ERWCH from Erwinia chrysanthemi (328 aa), FASTA scores: opt: 500, E(): 2.2e-24, (34.9% identity in 275 aa overlap); Q9KZA0|SC5G8.14 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (321 aa), FASTA scores: opt: 321, E(): 4.3e-13, (27.3% identity in 271 aa overlap); etc. TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217852.1" /db_xref="GI:15610471" /db_xref="GOA:O53385" /db_xref="UniProtKB/TrEMBL:O53385" /db_xref="GeneID:888029" /translation="MGELAEPGVLDRLRARFGWLDHVVRAFTRFNDRNGSLFAAGLTY YTIFAIFPLLMVGFGVGGFALSRRPELLTTLEERIRTSVSGAVGQQLVDLMNSAIDAR ASVGVIGLATAAWVGLGWMWHLREALSQMWAHPVAPAGYLRTKLSDLAAMVGTFVVIV ATIALTVLGHARPMAAVLRWLEIPQFSVFDEIFRGISVLVSVLVSWVLFTWMIGRLPR EPVGLVTAARAGLMAAVGFELFKQVGAIYLQIVLRSPAGAVFGPVLGLMVFAFVTAWL ILFATAWAATASA" gene complement(3722621..3723631) /gene="trpS" /locus_tag="Rv3336c" /db_xref="GeneID:887559" CDS complement(3722621..3723631) /gene="trpS" /locus_tag="Rv3336c" /EC_number="6.1.1.2" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-TRYPTOPHAN + TRNA(TRP) = AMP + PYROPHOSPHATE + L-TRYPTOPHANYL-TRNA(TRP)]." /note="catalyzes a two-step reaction, first charging a tryptophan molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA" /codon_start=1 /transl_table=11 /product="tryptophanyl-tRNA synthetase" /protein_id="NP_217853.1" /db_xref="GI:15610472" /db_xref="GOA:P67590" /db_xref="UniProtKB/Swiss-Prot:P67590" /db_xref="GeneID:887559" /translation="MSTPTGSRRIFSGVQPTSDSLHLGNALGAVAQWVGLQDDHDAFF CVVDLHAITIPQDPEALRRRTLITAAQYLALGIDPGRATIFVQSQVPAHTQLAWVLGC FTGFGQASRMTQFKDKSARQGSEATTVGLFTYPVLQAADVLAYDTELVPVGEDQRQHL ELARDVAQRFNSRFPGTLVVPDVLIPKMTAKIYDLQDPTSKMSKSAGTDAGLINLLDD PALSAKKIRSAVTDSERDIRYDPDVKPGVSNLLNIQSAVTGTDIDVLVDGYAGHGYGD LKKDTAEAVVEFVNPIQARVDELTADPAELEAVLAAGAQRAHDVASKTVQRVYDRLGF LL" misc_feature complement(3723554..3723586) /gene="trpS" /locus_tag="Rv3336c" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature" gene 3723656..3724042 /locus_tag="Rv3337" /db_xref="GeneID:887556" CDS 3723656..3724042 /locus_tag="Rv3337" /function="UNKNOWN" /note="Rv3337, (MTV016.37), len: 128 aa. Conserved hypothetical protein, equivalent to N-terminus of Q49926|ML0685 TPEA (PUTATIVE HYDROLASE) from Mycobacterium leprae (303 aa), FASTA scores: opt: 362, E(): 5.7e-17, (74.3% identity in 70 aa overlap). Also weak similarity in N-terminus to Q98JT7|BAB49078|MLR1789 PROBABLE EPOXIDE HYDROLASE from Rhizobium loti (Mesorhizobium loti) (300 aa), FASTA scores: opt: 122, E(): 0.74, (31.95% identity in 97 aa overlap). Homology suggests this ORF should be in frame with the following ORF MTV016.38 but no sequence error could be found. Short distance to start of trpS suggests region may not be protein-coding. TBparse score is 0.941. C-terminus extended since first submission (+47 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217854.2" /db_xref="GI:57117091" /db_xref="UniProtKB/TrEMBL:O53387" /db_xref="GeneID:887556" /translation="MPSPSTTGHHAACGTGGTGFSVGSMRSPIRVGSGEPVLLLHPFL MSQTVWEKVAQQLADTGRFEVFAPTMAGHNGGPASGTRFCPRRCWPTTSNASSTNWAG KPAISSATRWAAGSRSNSNDVAGHAA" gene 3723904..3724548 /locus_tag="Rv3338" /db_xref="GeneID:888020" CDS 3723904..3724548 /locus_tag="Rv3338" /function="UNKNOWN" /note="Rv3338, (MTV016.38), len: 214 aa. Hypothetical protein, equivalent to C-termini of Q49926|ML0685 TPEA (PUTATIVE HYDROLASE) from Mycobacterium leprae (303 aa), FASTA scores: opt: 984, E(): 2.6e-56, (65.4% identity in 214 aa overlap); and O32873|MLCB1779.02 HYPOTHETICAL 31.8 KDA PROTEIN (SIMILAR TO ALPHA/BETA HYDROLASE FOLD) from Mycobacterium leprae (292 aa), FASTA scores: opt: 984, E(): 2.5e-56, (65.4% identity in 214 aa overlap). Also similar to C-termini of several hypothetical proteins (generally hydrolases) e.g. Q9K3H6|2SCG18.11 PUTATIVE HYDROLASE from Streptomyces coelicolor (316 aa), FASTA scores: opt: 213, E(): 1.4e-06, (29.75% identity in 185 aa overlap). Homology suggests that this ORF should be in frame with the previous ORF MTV016.37 but no sequence error could be found. TBparse score is 0.887." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217855.1" /db_xref="GI:15610474" /db_xref="UniProtKB/TrEMBL:O53388" /db_xref="GeneID:888020" /translation="MSSAVLADHVERQLDELGWETSHIVGNSLGGWVAFELERRGRAR SVTGIAPAGGWTRWSPVKFEVIAKFIAGAPILAVAHILGQRALRLPFSRLLATLPISA TPDGVSERELSGIIDDAAHCPAYFQLLVKALVLPGLQELEHTAVPSHVVLCEQDRVVP PSRFSRHFTDSLPAGHRLTVLDGVGHVPMFEAPGRITELITSFIEECCPHVRAS" gene complement(3724615..3725844) /gene="icd1" /locus_tag="Rv3339c" /db_xref="GeneID:888013" CDS complement(3724615..3725844) /gene="icd1" /locus_tag="Rv3339c" /EC_number="1.1.1.41" /function="INVOLVED IN THE KREBS CYCLE [CATALYTIC ACTIVITY: ISOCITRATE + NADP(+) = 2-OXOGLUTARATE + CO(2) + NADPH]." /note="Converts isocitrate to alpha ketoglutarate" /codon_start=1 /transl_table=11 /product="isocitrate dehydrogenase" /protein_id="NP_217856.1" /db_xref="GI:15610475" /db_xref="GOA:P65097" /db_xref="UniProtKB/Swiss-Prot:P65097" /db_xref="GeneID:888013" /translation="MSNAPKIKVSGPVVELDGDEMTRVIWKLIKDMLILPYLDIRLDY YDLGIEHRDATDDQVTIDAAYAIKKHGVGVKCATITPDEARVEEFNLKKMWLSPNGTI RNILGGTIFREPIVISNVPRLVPGWTKPIVIGRHAFGDQYRATNFKVDQPGTVTLTFT PADGSAPIVHEMVSIPEDGGVVLGMYNFKESIRDFARASFSYGLNAKWPVYLSTKNTI LKAYDGMFKDEFERVYEEEFKAQFEAAGLTYEHRLIDDMVAACLKWEGGYVWACKNYD GDVQSDTVAQGYGSLGLMTSVLMTADGKTVEAEAAHGTVTRHYRQYQAGKPTSTNPIA SIFAWTRGLQHRGKLDGTPEVIDFAHKLESVVIATVESGKMTKDLAILIGPEQDWLNS EEFLDAIADNLEKELAN" misc_feature complement(3724966..3725025) /gene="icd1" /locus_tag="Rv3339c" /note="PS00470 Isocitrate and isopropylmalate dehydrogenases signature" gene 3726127..3727476 /gene="metC" /locus_tag="Rv3340" /db_xref="GeneID:888037" CDS 3726127..3727476 /gene="metC" /locus_tag="Rv3340" /EC_number="2.5.1.49" /function="TRANSFORMS O-ACETYLHOMOSERINE INTO L-METHIONINE [CATALYTIC ACTIVITY: O-ACETYL-L-HOMOSERINE + METHANETHIOL = L-METHIONINE + ACETATE]." /note="catalyzes the formation of L-methionine and acetate from O-acetyl-L-homoserine and methanethiol" /codon_start=1 /transl_table=11 /product="O-acetylhomoserine aminocarboxypropyltransferase" /protein_id="NP_217857.1" /db_xref="GI:15610476" /db_xref="GOA:O53390" /db_xref="UniProtKB/TrEMBL:O53390" /db_xref="GeneID:888037" /translation="MSADSNSTDADPTAHWSFETKQIHAGQHPDPTTNARALPIYATT SYTFDDTAHAAALFGLEIPGNIYTRIGNPTTDVVEQRIAALEGGVAALFLSSGQAAET FAILNLAGAGDHIVSSPRLYGGTYNLFHYSLAKLGIEVSFVDDPDDLDTWQAAVRPNT KAFFAETISNPQIDLLDTPAVSEVAHRNGVPLIVDNTIATPYLIQPLAQGADIVVHSA TKYLGGHGAAIAGVIVDGGNFDWTQGRFPGFTTPDPSYHGVVFAELGPPAFALKARVQ LLRDYGSAASPFNAFLVAQGLETLSLRIERHVANAQRVAEFLAARDDVLSVNYAGLPS SPWHERAKRLAPKGTGAVLSFELAGGIEAGKAFVNALKLHSHVANIGDVRSLVIHPAS TTHAQLSPAEQLATGVSPGLVRLAVGIEGIDDILADLELGFAAARRFSADPQSVAAF" misc_feature 3726760..3726804 /gene="metC" /locus_tag="Rv3340" /note="PS00868 Cys/Met metabolism enzymes pyridoxal-phosphate attachment site" gene 3727488..3728627 /gene="metX" /locus_tag="Rv3341" /db_xref="GeneID:888030" CDS 3727488..3728627 /gene="metX" /locus_tag="Rv3341" /EC_number="2.3.1.31" /function="CATALYZES ACYLATION OF L-HOMOSERINE. INVOLVED IN BIOSYNTHESIS OF METHIONINE; HTA VARIANT; FIRST STEP [CATALYTIC ACTIVITY: ACETYL-CoA + L-HOMOSERINE = CoA + O-ACETYL-L-HOMOSERINE]." /note="Catalyzes the conversion of acetyl-CoA and L-homoserine to CoA and O-acetyl-L-homoserine" /codon_start=1 /transl_table=11 /product="homoserine O-acetyltransferase" /protein_id="NP_217858.1" /db_xref="GI:15610477" /db_xref="GOA:O53391" /db_xref="UniProtKB/Swiss-Prot:O53391" /db_xref="GeneID:888030" /translation="MTISDVPTQTLPAEGEIGLIDVGSLQLESGAVIDDVCIAVQRWG KLSPARDNVVVVLHALTGDSHITGPAGPGHPTPGWWDGVAGPGAPIDTTRWCAVATNV LGGCRGSTGPSSLARDGKPWGSRFPLISIRDQVQADVAALAALGITEVAAVVGGSMGG ARALEWVVGYPDRVRAGLLLAVGARATADQIGTQTTQIAAIKADPDWQSGDYHETGRA PDAGLRLARRFAHLTYRGEIELDTRFANHNQGNEDPTAGGRYAVQSYLEHQGDKLLSR FDAGSYVILTEALNSHDVGRGRGGVSAALRACPVPVVVGGITSDRLYPLRLQQELADL LPGCAGLRVVESVYGHDGFLVETEAVGELIRQTLGLADREGACRR" gene 3728624..3729355 /locus_tag="Rv3342" /db_xref="GeneID:888046" CDS 3728624..3729355 /locus_tag="Rv3342" /EC_number="2.1.1.-" /function="CAUSES METHYLATION" /note="Rv3342, (MTV016.42), len: 243 aa. Possible methyltransferase (EC 2.1.1.-), similar to various proteins e.g. Q9I5X8|PA0558 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (255 aa), FASTA scores: opt: 496, E(): 4.4e-24, (39.85% identity in 236 aa overlap); Q9XBC9|CZA382.22c PUTATIVE RRNA METHYLASE from Amycolatopsis orientalis (259 aa), FASTA scores: opt: 473, E(): 1.2e-22, (42.45% identity in 245 aa overlap); Q9UTA8|SPAC25B8.10 PUTATIVE METHYLTRANSFERASE from Schizosaccharomyces pombe (Fission yeast) (256 aa), FASTA scores: opt: 470, E(): 1.9e-22, (35.7% identity in 238 aa overlap); and Q9UTA9|SPAC25B8.09 PUTATIVE METHYLTRANSFERASE from Schizosaccharomyces pombe (Fission yeast) (251 aa), FASTA scores: opt: 418, E(): 3.4e-19, (31.2% identity in 237 aa overlap); etc. Start uncertain. BELONGS TO THE METHYLTRANSFERASE SUPERFAMILY. TBparse score is 0.878." /codon_start=1 /transl_table=11 /product="methyltransferase (methylase)" /protein_id="NP_217859.1" /db_xref="GI:15610478" /db_xref="GOA:P65348" /db_xref="UniProtKB/Swiss-Prot:P65348" /db_xref="GeneID:888046" /translation="MTCSRRDMSLSFGSAVGAYERGRPSYPPEAIDWLLPAAARRVLD LGAGTGKLTTRLVERGLDVVAVDPIPEMLDVLRAALPQTVALLGTAEEIPLDDNSVDA VLVAQAWHWVDPARAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGEIIGRDGDPVR DRVTLPEPFTTVQRHQVEWTNYLTPQALIDLVASRSYCITSPAQVRTKTLDRVRQLLA THPALANSNGLALPYVTVCVRATLA" gene complement(3729364..3736935) /gene="PPE54" /locus_tag="Rv3343c" /db_xref="GeneID:888033" CDS complement(3729364..3736935) /gene="PPE54" /locus_tag="Rv3343c" /function="UNKNOWN" /note="Rv3343c, (MTV016.43c), len: 2523 aa. Member of the Mycobacterium tuberculosis PPE family, MPTR subgroup of Gly-, Asn-rich proteins. Most similar to O50379|Rv3350c|MTV004.07c|MTV004_5 from Mycobacterium tuberculosis strain H37Rv (3716 aa), FASTA scores: opt: 4672, E(): 4e-211, (44.2% identity in 3174 aa overlap); and also similar to MTV004_3, MTCY63_9, MTY13E10_17, MTY13E10_16, MTCY180_1, MTV050_1, MTCY3C7_23, MTV014_3, MTCY63_10; etc. TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177960.1" /db_xref="GI:57117092" /db_xref="GOA:Q6MWY2" /db_xref="UniProtKB/TrEMBL:Q6MWY2" /db_xref="GeneID:888033" /translation="MSFVVMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAA FGSVTSGLVGGIWQGPSSVAMAAAAAPYAGWLSAAAASAESAAGQARAVVGVFEAALA ETVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAAMLGYHTGASA AAEALAPFGSPLASLAAAAEPAKSLAVNLGLANVGLFNAGSGNVGSYNVGAGNVGSYN VGGGNIGGNNVGLGNVGWGNFGLGNSGLTPGLMGLGNIGFGNAGSYNFGLANMGVGNI GFANTGSGNFGIGLTGDNLTGFGGFNTGSGNVGLFNSGTGNVGFFNSGTGNWGVFNSG SYNTGIGNSGIASTGLFNAGGFNTGVVNAGSYNTGSFNAGEANTGGFNPGSVNTGWLN TGDINTGVANSGDVNTGAFISGNYSNGVLWRGDYQGLLGFSSGANVLPVIPLSLDING GVGAITIEPIHILPDIPININETLYLGPLVVPPINVPAISLGVGIPNISIGPIKINPI TLWPAQNFNQTITLAWPVSSITIPQIQQVALSPSPIPTTLIGPIHINTGFSIPVTFSY STPALTLFPVGLSIPTGGPLTLTLGVTAGTEAFTIPGFSIPEQPLPLAINVIGHINAL STPAITIDNIPLNLHAIGGVGPVDIVGGNVPASPGFGNSTTAPSSGFFNTGAGGVSGF GNVGAHTSGWFNQSTQAMQVLPGTVSGYFNSGTLMSGIGNVGTQLSGMLSGGALGGNN FGLGNIGFDNVGFGNAGSSNFGLANMGIGNIGLANTGNGNIGIGLSGDNLTGFGGFNS GSENVGLFNSGTGNVGFFNSGTGNLGVFNSGSHNTGFFLTGNNINVLAPFTPGTLFTI SEIPIDLQVIGGIGPIHVQPIDIPAFDIQITGGFIGIREFTLPEITIPAIPIHVTGTV GLEGFHVNPAFVLFGQTAMAEITADPVVLPDPFITIDHYGPPLGPPGAKFPSGSFYLS ISDLQINGPIIGSYGGPGTIPGPFGATFNLSTSSLALFPAGLTVPDQTPVTVNLTGGL DSITLFPGGLAFPENPVVSLTNFSVGTGGFTVFPQGFTVDRIPVDLHTTLSIGPFPFR WDYIPPTPANGPIPAVPGGFGLTSGLFPFHFTLNGGIGPISIPTTTVVDALNPLLTVT GNLEVGPFTVPDIPIPAINFGLDGNVNVSFNAPATTLLSGLGITGSIDISGIQITNIQ TQPAQLFMSVGQTLFLFDFRDGIELNPIVIPGSSIPITMAGLSIPLPTVSESIPLNFS FGSPASTVKSMILHEILPIDVSINLEDAVFIPATVLPAIPLNVDVTIPVGPINIPIIT EPGSGNSTTTTSDPFSGLAVPGLGVGLLGLFDGSIANNLISGFNSAVGIVGPNVGLSN LGGGNVGLGNVGDFNLGAGNVGGFNVGGGNIGGNNVGLGNVGFGNVGLANSGLTPGLM GLGNIGFGNAGSYNFGLANMGVGNIGFANTGSGNFGIGLTGDNLTGFGGFNTGSGNVG LFNSGTGNVGFFNSGTGNWGVFNSGSYNTGIGNSGIASTGLFNAGGFNTGVVNAGSYN TGSFNAGQANTGGFNPGSVNTGWLNTGDINTGVANSGDVNTGAFISGNYSNGAFWRGD YQGLLGFSYRPAVLPQTPFLDLTLTGGLGSVVIPAIDIPAIRPEFSANVAIDSFTVPS IPIPQIDLAATTVSVGLGPITVPHLDIPRVPVTLNYLFGSQPGGPLKIGPITGLFNTP IGLTPLALSQIVIGASSSQGTITAFLANLPFSTPVVTIDEIPLLASITGHSEPVDIFP GGLTIPAMNPLSINLSGGTGAVTIPAITIGEIPFDLVAHSTLGPVHILIDLPAVPGFG NTTGAPSSGFFNSGAGGVSGFGNVGAMVSGGWNQAPSALLGGGSGVFNAGTLHSGVLN FGSGMSGLFNTSVLGLGAPALVSGLGSVGQQLSGLLASGTALHQGLVLNFGLADVGLG NVGLGNVGDFNLGAGNVGGFNVGGGNIGGNNVGLGNVGWGNFGLGNSGLTPGLMGLGN IGFGNAGSYNFGLANMGVGNIGFANTGSGNFGIGLTGDNLTGFGGFNTGSGNVGLFNS GTGNVGFFNSGTGNWGVFNSGSYNTGIGNSGIASTGLFNAGGFNTGVVNAGSYNTGSF NAGQANTGGFNPGSVNTGWLNTGDINTGVANSGDVNTGAFISGNYSNGAFWRGDYQGL LGFSYTSTIIPEFTVANIHASGGAGPIIVPSIQFPAIPLDLSATGHIGGFTIPPVSIS PITVRIDPVFDLGPITVQDITIPALGLDPATGVTVGPIFSSGSIIDPFSLTLLGFINV NVPAIQTAPSEILPFTVLLSSLGVTHLTPEITIPGFHIPVDPIHVELPLSVTIGPFVS PEITIPQLPLGLALSGATPAFAFPLEITIDRIPVVLDVNALLGPINAGLVIPPVPGFG NTTAVPSSGFFNIGGGGGLSGFHNLGAGMSGVLNAISDPLLGSASGFANFGTQLSGIL NRGADISGVYNTGALGLITSALVSGFGNVGQQLAGLIYTGTGP" gene complement(3736984..3738438) /gene="PE_PGRS49" /locus_tag="Rv3344c" /db_xref="GeneID:888115" CDS complement(3736984..>3738438) /gene="PE_PGRS49" /locus_tag="Rv3344c" /function="UNKNOWN" /note="Rv3344c, (MTV016.44c), len: 484 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-, ala-rich proteins (see citation below). Appears to be a gene fragment, should be in-frame with following ORF, MTV016.45c, frameshift required around 49595 but could not be found on checking BAC and cosmid clones. Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O53557|Rv3512|MTV023.19 (1079 aa), FASTA scores: opt: 1595, E(): 1.8e-54, (52.0% identity in 544 aa overlap). TBparse score is 0.831." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177961.1" /db_xref="GI:57117093" /db_xref="UniProtKB/TrEMBL:Q6MWY1" /db_xref="GeneID:888115" /translation="AQASPAAHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGG DAGNAGSGGNGGKGGDGVGPGSTGGAGGKGGAGANGGSSNGNARGGNAGNGGHGGAGG SGDTGGAGGAGGQGGFGGTGGSGSGIGGGAGGNGGNGGAGGTGVVLGGKGGDGGNGDH GGPATNPGSGSRGGAGGSGGNGGAGGNATGSGGKGGAGGNGGDGSFGATSGPASIGVT GAPGGNGGKGGAGGSNPNGSGGDGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGA GGNGSLSSGEGGKGGDGGHGGDGVGGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGG DGGQGGPNGGGTVGTVAGGGGNGGVGGRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNG GLGGAGGGGGNAPDGGFGGNGGKGGQGGIGGGTQSATGLGGDGGDGGDGGNGGNSGAK AGGAGGKGQAGQPNSGTEPGFGGDGGLGGAGATP" gene complement(3738158..3742774) /gene="PE_PGRS50" /locus_tag="Rv3345c" /db_xref="GeneID:888114" CDS complement(3738158..3742774) /gene="PE_PGRS50" /locus_tag="Rv3345c" /function="UNKNOWN" /note="Rv3345c, (MTV004.01c-MTV016.45c), 1538 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). Similar to AAK47791 from strain CDC1551 but with some big gaps (after residues 501 and 1419; and for AAK47791 after residue 991). Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 4508, E(): 7e-161, (52.1% identity in 1529 aa overlap); MTV004_1, MTV023_21, MTV023_15, MTCY493_4, MTV039_16, MTV008_46, MTV023_14, MTV023_19, MTV043_26, MTCY493_2, MTCY441_4; etc." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177962.1" /db_xref="GI:57117094" /db_xref="UniProtKB/TrEMBL:Q6MWY0" /db_xref="GeneID:888114" /translation="MVMSLMVAPELVAAAAADLTGIGQAISAANAAAAGPTTQVLAAA GDEVSAAIAALFGTHAQEYQALSARVATFHEQFVRSLTAAGSAYATAEAANASPLQAL EQQVLGAINAPTQLWLGRPLIGDGVHGAPGTGQPGGAGGLLWGNGGNGGSGAAGQVGG PGGAAGLFGNGGSGGSGGAGAAGGVGGSGGWLNGNGGAGGAGGTGANGGAGGNAWLFG AGGSGGAGTNGGVGGSGGFVYGNGGAGGIGGIGGIGGNGGDAGLFGNGGAGGAGAAGL PGAAGLNGGDGSDGGNGGTGGNGGRGGLLVGNGGAGGAGGVGGDGGKGGAGDPSFAVN NGAGGNGGHGGNPGVGGAGGAGGLLAGAHGAAGATPTSGGNGGDGGIGATANSPLQAG GAGGNGGHGGLVGNGGTGGAGGAGHAGSTGATGTALQPTGGNGTNGGAGGHGGNGGNG GAQHGDGGVGGKGGAGGSGGAGGNGFDAATLGSPGADGGMGGNGGKGGDGGKAGDGGA GAAGDVTLAVNQGAGGDGGNGGEVGVGGKGGAGGVSANPALNGSAGANGTAPTSGGNG GNGGAGATPTVAGENGGAGGNGGHGGSVGNGGAGGAGGNGVAGTGLALNGGNGGNGGI GGNGGSAAGTGGDGGKGGNGGAGANGQDFSASANGANGGQGGNGGNGGIGGKGGDAFA TFAKAGNGGAGGNGGNVGVAGQGGAGGKGAIPAMKGATGADGTAPTSGGDGGNGGNGA SPTVAGGNGGDGGKGGSGGNVGNGGNGGAGGNGAAGQAGTPGPTSGDSGTSGTDGGAG GNGGAGGAGGTLAGHGGNGGKGGNGGQGGIGGAGERGADGAGPNANGANGENGGSGGN GGDGGAGGNGGAGGKAQAAGYTDGATGTGGDGGNGGDGGKAGDGGAGENGLNSGAMLP GGGTVGNPGTGGNGGNGGNAGVGGTGGKAGTGSLTGLDGTDGITPNGGNGGNGGNGGK GGTAGNGSGAAGGNGGNGGSGLNGGDAGNGGNGGGALNQAGFFGTGGKGGNGGNGGAG MINGGLGGFGGAGGGGAVDVAATTGGAGGNGGAGGFASTGLGGPGGAGGPGGAGDFAS GVGGVGGAGGDGGAGGVGGFGGQGGIGGEGRTGGNGGSGGDGGGGISLGGNGGLGGNG GVSETGFGGAGGNGGYGGPGGPEGNGGLGGNGGAGGNGGVSTTGGDGGAGGKGGNGGD GGNVGLGGDAGSGGAGGNGGIGTDAGGAGGAGGAGGNGGSSKSTTTGNAGSGGAGGNG GTGLNGAGGAGGAGGNAGVAGVSFGNAVGGDGGNGGNGGHGGDGTTGGAGGKGGNGSS GAASGSGVVNVTAGHGGNGGNGGNGGNGSAGAGGQGGAGGSAGNGGHGGGATGGDGGN GGNGGNSGNSTGVAGLAGGAAGAGGNGGGTSSAAGHGGSGGSGGSGTTGGAGAAGGNG GAGAGGGSLSTGQSGGPRRQRWCRWQRRRWLGRQRRRRWCRWQRRCRRQRWRWRCRQR RLRRQWRQGRRRCRPWLHRRRGRQGRRWRQRRFQQRQRSRWQRR" gene complement(3743198..3743455) /locus_tag="Rv3346c" /db_xref="GeneID:888010" CDS complement(3743198..3743455) /locus_tag="Rv3346c" /function="UNKNOWN" /note="Rv3346c, (MTV004.02c), len: 85 aa. Conserved hypothetical protein, highly similar to mycobacterium hypothetical proteins O50384|Rv3355c|MTV004.12c from strain H37Rv (97 aa), FASTA scores: opt: 413, E(): 4.6e-23, (85.55% identity in 97 aa overlap); O32878|MLCB1779.16c|ML0675 from Mycobacterium leprae (91 aa), FASTA scores: opt: 349, E(): 1.7e-18, (67.35% identity in 95 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217863.1" /db_xref="GI:15610482" /db_xref="UniProtKB/TrEMBL:O50377" /db_xref="GeneID:888010" /translation="MTVRAVLRRTVGAQWPILAGVNFWRRGALLIGIGVGVAAVLRLV LSEERAGLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG" repeat_region 3743198..3743404 /note="207 bp imperfect direct repeat 1, 199/207 bp identical to second copy at 3769514..3769720" repeat_region 3743402..3743510 /note="109 bp imperfect direct repeat 1, 95/109 bp identical to second copy at 3769754..3769862" repeat_region 3743508..3743605 /note="98 bp imperfect direct repeat 1, 82/98 bp identical to the second copy at 3770994..3771091" gene complement(3743711..3753184) /gene="PPE55" /locus_tag="Rv3347c" /db_xref="GeneID:888120" CDS complement(3743711..3753184) /gene="PPE55" /locus_tag="Rv3347c" /function="UNKNOWN" /note="Rv3347c, (MTV004.03c), len: 3157 aa. Member of the Mycobacterium tuberculosis PPE family, Gly-, Ala-, Asn-rich protein. Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551, e.g. O50379|Rv3350c|MTV004.07c (3716 aa), FASTA scores: opt: 6497, E(): 0, (61.65% identity in 3756 aa overlap); and other upstream ORFs MTV004_5, MTY13E10_15, MTCY28_16, MTCY63_9, MTY13E10_17, MTCY180_1; etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177963.1" /db_xref="GI:57117095" /db_xref="UniProtKB/TrEMBL:Q6MWX9" /db_xref="GeneID:888120" /translation="MNFPVLPPEINSVLMYSGAGSSPLLAAAAAWDGLAEELGSAAVS FGQVTSGLTAGVWQGAAAAAMAAAAAPYAGWLGSVAAQAVAVAGQARAAVAAFEAALA ATVDPAAVAVNRMAMRALAMSNLLGQNAAAIAAVEAEYELMWAADVAAMAGYHSGASA AAAALPAFSPPAQALGGGVGAFLNALFAGPAKMLRLNAGLGNVGNYNVGLGNVGIFNL GAANVGAQNLGAANAGSGNFGFGNIGNANFGFGNSGLGLPPGMGNIGLGNAGSSNYGL ANLGVGNIGFANTGSNNIGIGLTGDNLTGIGGLNSGTGNLGLFNSGTGNIGFFNSGTG NFGVFNSGSYNTGVGNAGTASTGLFNVGGFNTGVANVGSYNTGSFNAGNTNTGGFNPG NVNTGWLNTGNTNTGIANSGNVNTGAFISGNFSNGVLWRGDYEGLWGLSGGSTIPAIP IGLELNGGVGPITVLPIQILPTIPLNIHQTFSLGPLVVPDIVIPAFGGGTAIPISVGP ITISPITLFPAQNFNTTFPVGPFFGLGVVNISGIEIKDLAGNVTLQLGNLNIDTRINQ SFPVTVNWSTPAVTIFPNGISIPNNPLALLASASIGTLGFTIPGFTIPAAPLPLTIDI DGQIDGFSTPPITIDRIPLNLGASVTVGPILINGVNIPATPGFGNTTTAPSSGFFNSG DGGVSGFGNFGAGSSGWWNQAQTEVAGAGSGFANFGSLGSGVLNFGSGVSGLYNTGGL PPGTPAVVSGIGNVGEQLSGLSSAGTALNQSLIINLGLADVGSVNVGFGNVGDFNLGA ANIGDLNVGLGNVGGGNVGFGNIGDANFGLGNAGLAAGLAGVGNIGLGNAGSGNVGFG NMGVGNIGFGNTGTNNLGIGLTGDNQTGIGGLNSGAGNIGLFNSGTGNVGLFNSGTGN FGLFNSGSFNTGIGNGGTGSTGLFNAGNFNTGVANPGSYNTGSFNVGDTNTGGFNPGS INTGWFNTGNANTGVANSGNVDTGALMSGNFSNGILWRGNFEGLFGLNVGITIPEFPI HWTSTGGIGPIIIPDTTILPPIHLGLTGQANYGFAVPDIPIPAIHIDFDGAADAGFTA PATTLLSALGITGQFRFGPITVSNVQLNPFNVNLKLQFLHDAFPNEFPDPTISVQIQV AIPLTSATLGGLALPLQQTIDAIELPAISFSQSIPIDIPPIDIPASTINGISMSEVVP IDVSVDIPAVTITGTRIDPIPLNFDVLSSAGPINISIIDIPALPGFGNSTELPSSGFF NTGGGGGSGIANFGAGVSGLLNQASSPMVGTLSGLGNAGSLASGVLNSGVDISGMFNV STLGSAPAVISGFGNLGNHVSGVSIDGLLAMLTSGGSGGSGQPSIIDAAIAELRHLNP LNIVNLGNVGSYNLGFANVGDVNLGAGNLGNLNLGGGNLGGQNLGLGNLGDGNVGFGN LGHGNVGFGNSGLGALPGIGNIGLGNAGSNNVGFGNMGLGNIGFGNTGTNNLGIGLTG DNQTGFGGLNSGAGNLGLFNSGTGNIGFFNTGTGNWGLFNSGSYNTGIGNSGTGSTGL FNAGSFNTGLANAGSYNTGSLNAGNTNTGGFNPGNVNTGWFNAGHTNTGGFNTGNVNT GAFNSGSFNNGALWTGDHHGLVGFSYSIEITGSTLVDINETLNLGPVHIDQIDIPGMS LFDIHELVNIGPFRIEPIDVPAVVLDIHETMVIPPIVFLPSMTIGGQTYTIPLDTPPA PAPPPFRLPLLFVNALGDNWIVGASNSTGMSGGFVTAPTQGILIHTGPSSATTGSLAL TLPTVTIPTITTSPIPLKIDVSGGLPAFTLFPGGLNIPQNAIPLTIDASGVLDPITIF PGGFTIDPLPLSLALNISVPDSSVPIIIVPPTPGFGNATATPSSGFFNSGAGGVSGFG NFGAGSSGWWNQAHAALAGAGSGVLNVGTLNSGVLNVGSGISGLYNTAIVGLGTPALV SGAGNVGQQLSGVLAAGTALTQSPIINLGLADVGNYNLGLGNVGDFNLGAANLGDLNL GLGNIGNANVGFGNIGHGNVGFGNSGLGAALGIGNIGLGNAGSTNVGLANMGVGNIGF ANTGTNNLGIGLTGDNQTGIGGLNSGAGNIGLFNSGTGNIGFFNSGTGNWGLFNSGSF NTGIGNSGTGSTGLFNAGGFTTGLANAGSYNTGSFNVGDTNTGGFNPGSINTGWFNTG NANTGIANSGNVDTGALMSGNFSNGILWRGNYEGLFSYSYSLDVPRITILDAHFTGAF GPVVVPPIPVLAINAHLTGNAAMGAFTIPQIDIPALNPNVTGSVGFGPIAVPSVTIPA LTAARAVLDMAASVGATSEIEPFIVWTSSGAIGPTWYSVGRIYNAGDLFVGGNIISGI PTLSTTGPVHAVFNAASQAFNTPALNIHQIPLGFQVPGSIDAITLFPGGLTFPANSLL NLDVFVGTPGATIPAITFPEIPANADGELYVIAGDIPLINIPPTPGIGNTTTVPSSGF FNTGAGGGSGFGNFGANMSGWWNQAHTALAGAGSGIANVGTLHSGVLNLGSGLSGIYN TSTLPLGTPALVSGLGNVGDHLSGLLASNVGQNPITIVNIGLANVGNGNVGLGNIGNL NLGAANIGDVNLGFGNIGDVNLGFGNIGGGNVGFGNIGDANFGFGNSGLAAGLAGMGN IGLGNAGSGNVGWANMGLGNIGFGNTGTNNLGIGLTGDNQSGIGGLNSGTGNIGLFNS GTGNIGFFNSGTANFGLFNSGSYNTGIGNSGVASTGLVNAGGFNTGVANAGSYNTGSF NAGDTNTGGFNPGSTNTGWFNTGNANTGVANAGNVNTGALITGNFSNGILWRGNYEGL AGFSFGYPIPLFPAVGADVTGDIGPATIIPPIHIPSIPLGFAAIGHIGPISIPNIAIP SIHLGIDPTFDVGPITVDPITLTIPGLSLDAAVSEIRMTSGSSSGFKVRPSFSFFAVG PDGMPGGEVSILQPFTVAPINLNPTTLHFPGFTIPTGPIHIGLPLSLTIPGFTIPGGT LIPQLPLGLGLSGGTPPFDLPTVVIDRIPVELHASTTIGPVSLPIFGFGGAPGFGNDT TAPSSGFFNTGGGGGSGFSNSGSGMSGVLNAISDPLLGSASGFANFGTQLSGILNRGA GISGVYNTGTLGLVTSAFVSGFMNVGQQLSGLLFAGTGP" gene 3753765..3754256 /locus_tag="Rv3348" /db_xref="GeneID:888110" CDS 3753765..3754256 /locus_tag="Rv3348" /function="POSSIBLY INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1608'." /note="Rv3348, (MTV004.04), len: 163 aa. Probable transposase, partially similar to several insertion elements e.g. P19834|YI11_STRCL INSERTION ELEMENT IS116 HYPOTHETICAL 44.8 KDA PROTEIN (SIMILAR TO IS900 OF MYCOBACTERIUM PARATUBERCULOSIS) from Streptomyces clavuligerus (399 aa), FASTA scores: opt: 146, E(): 0.016, (29.1% identity in 158 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217865.1" /db_xref="GI:15610484" /db_xref="GOA:P96234" /db_xref="UniProtKB/TrEMBL:P96234" /db_xref="GeneID:888110" /translation="MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPT LAGLRTLTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGAIV GKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVIDANRSWRRLMS LAR" repeat_region 3753765..3754253 /note="IS1608', len: 489 bp. Insertion sequence IS1608'." /mobile_element="insertion sequence:IS1608'" gene complement(3754293..3755033) /locus_tag="Rv3349c" /db_xref="GeneID:888126" CDS complement(3754293..3755033) /locus_tag="Rv3349c" /function="POSSIBLY INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1561'." /note="Rv3349c, (MTV004.05c), len: 246 aa. Probable transposase pseudogene fragment, similar to part of Q50911|U10634 IS204 PUTATIVE TRANSPOSASE from NOCARDIA ASTEROIDES (377 aa), FASTA scores: opt: 288, E(): 8.3e-11, (48.5% identity in 97 aa overlap); and others." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217866.1" /db_xref="GI:15610485" /db_xref="GOA:Q93IG7" /db_xref="UniProtKB/TrEMBL:Q93IG7" /db_xref="GeneID:888126" /translation="MAIDPAAAYASAIRTPGLLPNAKLVVDHFHVTTLANDALTAVRR RVTWAFHDRRGRKIDPQWANRRRLLTARERLSDKSFAKMRNRINAVDPRAQILSAWIA KEELRTLLSTVRTGGDPHLARHHLHRFLPGASTRRSPNCSPWPPPLTSHPRSTPSWSP ASPTRASVVGEVAEMLGDIDGQCVQVEVPVPERGPAGCGGLDGLGRAGVSATPRVCAA MTAVNVAGRCAGQQADVGPTPQHRCRGR" repeat_region complement(3754296..3755033) /note="IS1561', len: 738 bp. Insertion sequence IS1561'." /mobile_element="insertion sequence:IS1561'" gene complement(3755952..3767102) /gene="PPE56" /locus_tag="Rv3350c" /db_xref="GeneID:888113" CDS complement(3755952..3767102) /gene="PPE56" /locus_tag="Rv3350c" /function="UNKNOWN" /note="Rv3350c, (MTV004.07c), len: 3716 aa. Member of the Mycobacterium tuberculosis PPE family of Gly-, Ala-, Asn-rich proteins, similar to many Mycobacterium tuberculosis proteins from strains H37Rv and CDC1551, e.g. O50378|Rv3347c|MTV004.03c (3157 aa), FASTA scores: opt: 6497, E(): 0, (61.65% identity in 3756 aa overlap); MTCY28_16, MTV050_2, MTY13E10_17, MTCY63_10, MTCY180_1, MTCY63_9, MTV050_1, MTV014_3, MTY13E10_15; etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177964.1" /db_xref="GI:57117096" /db_xref="UniProtKB/TrEMBL:Q6MWX8" /db_xref="GeneID:888113" /translation="MEFPVLPPEINSVLMYSGAGSSPLLAAAAAWDGLAEELGSAAVS FGQVTSGLTAGVWQGAAAAAMAAAAAPYAGWLGSVAAAAEAVAGQARVVVGVFEAALA ATVDPALVAANRARLVALAVSNLLGQNTPAIAAAEAEYELMWAADVAAMAGYHSGASA AAAALPAFSPPAQALGGGVGAFLTALFASPAKALSLNAGLGNVGNYNVGLGNVGVFNL GAGNVGGQNLGFGNAGGTNVGFGNLGNGNVGFGNSGLGAGLAGLGNIGLGNAGSSNYG FANLGVGNIGFGNTGTNNVGVGLTGNHLTGIGGLNSGTGNIGLFNSGTGNVGFFNSGT GNFGVFNSGNYNTGVGNAGTASTGLFNAGNFNTGVVNVGSYNTGSFNAGDTNTGGFNP GGVNTGWLNTGNTNTGIANSGNVNTGAFISGNFNNGVLWVGDYQGLFGVSAGSSIPAI PIGLVLNGDIGPITIQPIPILPTIPLSIHQTVNLGPLVVPDIVIPAFGGGIGIPINIG PLTITPITLFAQQTFVNQLPFPTFSLGKITIPQIQTFDSNGQLVSFIGPIVIDTTIPG PTNPQIDLTIRWDTPPITLFPNGISAPDNPLGLLVSVSISNPGFTIPGFSVPAQPLPL SIDIEGQIDGFSTPPITIDRIPLTVGGGVTIGPITIQGLHIPAAPGVGNTTTAPSSGF FNSGAGGVSGFGNVGAGSSGWWNQAPSALLGAGSGVGNVGTLGSGVLNLGSGISGFYN TSVLPFGTPAAVSGIGNLGQQLSGVSAAGTTLRSMLAGNLGLANVGNFNTGFGNVGDV NLGAANIGGHNLGLGNVGDGNLGLGNIGHGNLGFANLGLTAGAAGVGNVGFGNAGINN YGLANMGVGNIGFANTGTGNIGIGLVGDHRTGIGGLNSGIGNIGLFNSGTGNVGFFNS GTGNFGIGNSGRFNTGIGNSGTASTGLFNAGSFSTGIANTGDYNTGSFNAGDTNTGGF NPGGINTGWFNTGHANTGLANAGTFGTGAFMTGDYSNGLLWRGGYEGLVGVRVGPTIS QFPVTVHAIGGVGPLHVAPVPVPAVHVEITDATVGLGPFTVPPISIPSLPIASITGSV DLAANTISPIRALDPLAGSIGLFLEPFRLSDPFITIDAFQVVAGVLFLENIIVPGLTV SGQILVTPTPIPLTLNLDTTPWTLFPNGFTIPAQTPVTVGMEVANDGFTFFPGGLTFP RASAGVTGLSVGLDAFTLLPDGFTLDTVPATFDGTILIGDIPIPIIDVPAVPGFGNTT TAPSSGFFNTGGGGGSGFANVGAGTSGWWNQGHDVLAGAGSGVANAGTLSSGVLNVGS GISGWYNTSTLGAGTPAVVSGIGNLGQQLSGFLANGTVLNRSPIVNIGWADVGAFNTG LGNVGDLNWGAANIGAQNLGLGNLGSGNVGFGNIGAGNVGFANSGPAVGLAGLGNVGL SNAGSNNWGLANLGVGNIGLANTGTGNIGIGLVGDYQTGIGGLNSGSGNIGLFNSGTG NVGFFNTGTGNFGLFNSGSFNTGIGNSGTGSTGLFNAGNFNTGIANPGSYNTGSFNVG DTNTGGFNPGDINTGWFNTGIMNTGTRNTGALMSGTDSNGMLWRGDHEGLFGLSYGIT IPQFPIRITTTGGIGPIVIPDTTILPPLHLQITGDADYSFTVPDIPIPAIHIGINGVV TVGFTAPEATLLSALKNNGSFISFGPITLSNIDIPPMDFTLGLPVLGPITGQLGPIHL EPIVVAGIGVPLEIEPIPLDAISLSESIPIRIPVDIPASVIDGISMSEVVPIDASVDI PAVTITGTTISAIPLGFDIRTSAGPLNIPIIDIPAAPGFGNSTQMPSSGFFNTGAGGG SGIGNLGAGVSGLLNQAGAGSLVGTLSGLGNAGTLASGVLNSGTAISGLFNVSTLDAT TPAVISGFSNLGDHMSGVSIDGLIAILTFPPAESVFDQIIDAAIAELQHLDIGNALAL GNVGGVNLGLANVGEFNLGAGNVGNINVGAGNLGGSNLGLGNVGTGNLGFGNIGAGNF GFGNAGLTAGAGGLGNVGLGNAGSGSWGLANVGVGNIGLANTGTGNIGIGLTGDYRTG IGGLNSGTGNLGLFNSGTGNIGFFNTGTGNFGLFNSGSYSTGVGNAGTASTGLFNAGN FNTGLANAGSYNTGSLNVGSFNTGGVNPGTVNTGWFNTGHTNTGLFNTGNVNTGAFNS GSFNNGALWTGDYHGLVGFSFSIDIAGSTLLDLNETLNLGPIHIEQIDIPGMSLFDVH EIVEIGPFTIPQVDVPAIPLEIHESIHMDPIVLVPATTIPAQTRTIPLDIPASPGSTM TLPLISMRFEGEDWILGSTAAIPNFGDPFPAPTQGITIHTGPGPGTTGELKISIPGFE IPQIATTRFLLDVNISGGLPAFTLFAGGLTIPTNAIPLTIDASGALDPITIFPGGYTI DPLPLHLALNLTVPDSSIPIIDVPPTPGFGNTTATPSSGFFNSGAGGVSGFGNVGSNL SGWWNQAASALAGSGSGVLNVGTLGSGVLNVGSGVSGIYNTSVLPLGTPAVLSGLGNV GHQLSGVSAAGTALNQIPILNIGLADVGNFNVGFGNVGDVNLGAANLGAQNLGLGNVG TGNLGFANVGHGNIGFGNSGLTAGAAGLGNTGFGNAGSANYGFANQGVRNIGLANTGT GNIGIGLVGDNLTGIGGLNSGAGNIGLFNSGTGNIGFFNSGTGNFGIGNSGSFNTGIG NSGTGSTGLFNAGSFNTGVANAGSYNTGSFNAGDTNTGGFNPGTINTGWFNTGHTNTG IANSGNVGTGAFMSGNFSNGLLWRGDHEGLFSLFYSLDVPRITIVDAHLDGGFGPVVL PPIPVPAVNAHLTGNVAMGAFTIPQIDIPALTPNITGSAAFRIVVGSVRIPPVSVIVE QIINASVGAEMRIDPFEMWTQGTNGLGITFYSFGSADGSPYATGPLVFGAGTSDGSHL TISASSGAFTTPQLETGPITLGFQVPGSVNAITLFPGGLTFPATSLLNLDVTAGAGGV DIPAITWPEIAASADGSVYVLASSIPLINIPPTPGIGNSTITPSSGFFNAGAGGGSGF GNFGAGTSGWWNQAHTALAGAGSGFANVGTLHSGVLNLGSGVSGIYNTSTLGVGTPAL VSGLGNVGHQLSGLLSGGSAVNPVTVLNIGLANVGSHNAGFGNVGEVNLGAANLGAHN LGFGNIGAGNLGFGNIGHGNVGVGNSGLTAGVPGLGNVGLGNAGGNNWGLANVGVGNI GLANTGTGNIGIGLTGDYQTGIGGLNSGAGNLGLFNSGAGNVGFFNTGTGNFGLFNSG SFNTGVGNSGTGSTGLFNAGSFNTGVANAGSYNTGSFNVGDTNTGGFNPGSINTGWLN AGNANTGVANAGNVNTGAFVTGNFSNGILWRGDYQGLAGFAVGYTLPLFPAVGADVSG GIGPITVLPPIHIPPIPVGFAAVGGIGPIAIPDISVPSIHLGLDPAVHVGSITVNPIT VRTPPVLVSYSQGAVTSTSGPTSEIWVKPSFFPGIRIAPSSGGGATSTQGAYFVGPIS IPSGTVTFPGFTIPLDPIDIGLPVSLTIPGFTIPGGTLIPTLPLGLALSNGIPPVDIP AIVLDRILLDLHADTTIGPINVPIAGFGGAPGFGNSTTLPSSGFFNTGAGGGSGFSNT GAGMSGLLNAMSDPLLGSASGFANFGTQLSGILNRGAGISGVYNTGALGVVTAAVVSG FGNVGQQLSGLLFTGVGP" gene complement(3767346..3768140) /locus_tag="Rv3351c" /db_xref="GeneID:888109" CDS complement(3767346..3768140) /locus_tag="Rv3351c" /function="UNKNOWN" /note="Rv3351c, (MTV004.08c), len: 264 aa. Hypothetical protein, highly similar to C-terminal region (aa 292-479) of O53608|Rv0063|MTV030.06 OXIDOREDUCTASE from Mycobacterium tuberculosis (479 aa), FASTA scores: opt: 699, E(): 1.7e-36, (54.75% identity in 190 aa overlap). Shows some similarity to Q9KYD6|SCD72A.20 PUTATIVE LIPOPROTEIN (FRAGMENT) from Streptomyces coelicolor (403 aa), FASTA scores: opt: 192, E(): 9.1e-05, (27.9% identity in 154 aa overlap); and P71091|YGAK HYPOTHETICAL 54.4 KDA PROTEIN from Bacillus subtilis (480 aa), FASTA scores: opt: 174, E(): 0.0014, (26.5% identity in 166 aa overlap). Note that the two upstream ORFs Rv3352c and Rv3353c also show similarity to Rv0063 (MTV030_7). Sequence was checked but no errors found." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217868.1" /db_xref="GI:15610487" /db_xref="UniProtKB/TrEMBL:O50380" /db_xref="GeneID:888109" /translation="MLASCPARSGAAVADAIKSAVGVQPSGVEHKTLRRMDLVRYLAG GHTTYPPEGFVAGSDVIGTTNPAAAQAIVAAIGTWPPAAGRASALIDSLGGAVGDMDP EGSAFPWCRQSAVVQWYVNTPSDGQVATANKWLSDAHHAVQHFSVGGYVNYLEANAAA SQYFGANLSRLTTVRRKYDPDRIMYSGLDFSTRQVAERLLPALGFRVRFGVLVIRCAL CTDTVKRLGTLPNLTWSRLKVNVAVTQEQAGVMDLPALPVRRTPRR" gene complement(3768222..3768593) /locus_tag="Rv3352c" /db_xref="GeneID:888104" CDS complement(3768222..3768593) /locus_tag="Rv3352c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3352c, (MTV004.09c), len: 123 aa. Possible oxidoreductase (EC 1.-.-.-), similar to part of several oxidoreductases (and hypothetical proteins) from diverse organisms e.g. Q9KYD6|SCD72A.20 PUTATIVE LIPOPROTEIN (FRAGMENT) from Streptomyces coelicolor (403 aa), FASTA scores: opt: 348, E(): 7.9e-15, (51.0% identity in 102 aa overlap); BAB53081|MLR6875 PROBABLE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (479 aa), FASTA scores: opt: 262, E(): 2.3e-09, (53.85% identity in 78 aa overlap); O94206|OX1 OXIDOREDUCTASE from Claviceps purpurea (Ergot fungus) (483 aa), FASTA scores: opt: 245, E(): 2.7e-08, (42.6% identity in 115 aa overlap); Q9KHK2|ENCM PUTATIVE FAD-DEPENDENT OXYGENASE ENCM from Streptomyces maritimus (464 aa), FASTA scores: opt: 238, E(): 7.2e-08, (43.95% identity in 91 aa overlap); etc. Also highly similar to part of O53608|Rv0063|MTV030.06 OXIDOREDUCTASE (479 aa), FASTA scores: opt: 599, E(): 1.6e-30, (71.55% identity in 123 aa overlap); and to other Mycobacterium tuberculosis proteins e.g. Rv3353c and Rv3351c. All show similarity to a family of oxidoreductases in Mycobacterium tuberculosis, suggesting that frameshift mutations may have occurred. Sequence has been checked but no errors were found." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217869.1" /db_xref="GI:15610488" /db_xref="GOA:O50381" /db_xref="UniProtKB/TrEMBL:O50381" /db_xref="GeneID:888104" /translation="MSAATDLYAVHQALAGESRAIPTGSCPTVGVAGLTLGGGLGADS RHAGLTCDALKSATVVLPGGDAVSASADDHAELFWALRGGGGGNFGVTTSMTFARFPT ADCDVVRVDFAPSAAAQVLVG" gene complement(3768736..3768996) /locus_tag="Rv3353c" /db_xref="GeneID:888095" CDS complement(3768736..3768996) /locus_tag="Rv3353c" /function="UNKNOWN" /note="Rv3353c, (MTV004.10c), len: 86 aa. Hypothetical protein, showing some similarity to Q9X5Q4|MITR MITR PROTEIN from Streptomyces lavendulae (514 aa), FASTA scores: opt: 134, E(): 0.09, (29.5% identity in 78 aa overlap); and weak to Q49720|B1549_C3_218 from Mycobacterium leprae (222 aa), FASTA scores: opt: 99, E(): 8.8, (32.9% identity in 76 aa overlap). But highly similar to N-terminal part of O53608|Rv0063|MTV030.06 OXIDOREDUCTASE from Mycobacterium tuberculosis (479 aa), FASTA scores: opt: 305, E(): 4.9e-13, (52.9% identity in 87 aa overlap); and some similarity can be found with Rv3352c and Rv3351c. All show similarity to a family of oxidoreductases in Mycobacterium tuberculosis, suggesting that frameshift mutations may have occurred. Sequence has been checked but no errors were found. Start changed since original submission." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217870.2" /db_xref="GI:57117097" /db_xref="UniProtKB/TrEMBL:O50382" /db_xref="GeneID:888095" /translation="MSRQTFLRGAVGAPATSAVFPTILARATPGDGWASLASSIGGQV LLPANGRAFTSGKQIFNSNYSGLNPAAVVTVASQADVRKAVS" gene 3769111..3769500 /locus_tag="Rv3354" /db_xref="GeneID:887643" CDS 3769111..3769500 /locus_tag="Rv3354" /function="UNKNOWN" /note="Rv3354, (MTV004.11), len: 129 aa. Conserved hypothetical protein, equivalent (but shorter 29 aa) to Q9CCM4|ML0676 HYPOTHETICAL PROTEIN from Mycobacterium leprae (158 aa), FASTA scores: opt: 467, E(): 3.3e-21, (55.9% identity in 127 aa overlaps). Highly similar to O33192|LPRJ|Rv1690|MTCI125.12 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (127 aa), FASTA scores: opt: 329, E(): 4.7e-13, (46.95% identity in 115 aa overlap); and also similar to other Mycobacterium tuberculosis hypothetical proteins e.g. O07222|Rv1810|MTCY16F9.04c (118 aa), FASTA scores: opt: 195, E(): 4.2e-05, (37.15% identity in 113 aa overlap); MTCI125_11, MTCY16F9_4, MTV049_25." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217871.1" /db_xref="GI:15610490" /db_xref="UniProtKB/TrEMBL:O50383" /db_xref="GeneID:887643" /translation="MNLRRHQTLTLRLLAASAGILSAAAFAAPAQANPVDDAFIAALN NAGVNYGDPVDAKALGQSVCPILAEPGGSFNTAVASVVARAQGMSQDMAQTFTSIAIS MYCPSVMADVASGNLPALPDMPGLPGS" gene complement(3769514..3769807) /locus_tag="Rv3355c" /db_xref="GeneID:887660" CDS complement(3769514..3769807) /locus_tag="Rv3355c" /function="UNKNOWN" /note="Rv3355c, (MTV004.12c), len: 97 aa. Hypothetical protein, equivalent to O32878|MLCB1779.16c|ML0675 HYPOTHETICAL 9.6 KDA PROTEIN from Mycobacterium leprae (91 aa), FASTA scores: opt: 439, E(): 3.9e-23, (78.9% identity in 90 aa overlap). Identical, but with a gap, to O50377|Rv3346c|MTV004.02c HYPOTHETICAL 8.9 KDA PROTEIN from Mycobacterium tuberculosis (85 aa), FASTA scores: opt: 413, E(): 2.1e-21, (85.55% identity in 97 aa overlap). Also some similarity to other proteins e.g. Q9K3J5|SC2A6.10 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (178 aa), FASTA scores: opt: 147, E(): 0.003, (31.25% identity in 80 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217872.1" /db_xref="GI:15610491" /db_xref="UniProtKB/TrEMBL:O50384" /db_xref="GeneID:887660" /translation="MTVRAVFRRTVGAQWPILLVGSIFAVGFVLAGANFWRRGALLIG IGVGVAAVLRLVLSEERAGLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG" repeat_region 3769514..3769720 /note="207 bp imperfect direct repeat 2, 199/207 bp identical to first copy at 3743198..3743404" repeat_region 3769754..3769862 /note="109 bp imperfect direct repeat 2, 95/109 bp identical to first copy at 3743402..3743510" gene complement(3769804..3770649) /gene="folD" /locus_tag="Rv3356c" /db_xref="GeneID:888145" CDS complement(3769804..3770649) /gene="folD" /locus_tag="Rv3356c" /EC_number="1.5.1.5" /EC_number="3.5.4.9" /function="NECESSARY FOR THE BIOSYNTHESIS OF PURINES, THYMYDYLATE, METHIONINE, HISTIDINE, PANTOTHENATE, AND FORMYL TRNA-MET [CATALYTIC ACTIVITY: 5,10-METHYLENETETRAHYDROFOLATE + NADP(+) = 5,10-METHENYLTETRAHYDROFOLATE + NADPH] [CATALYTIC ACTIVITY: 5,10-METHENYLTETRAHYDROFOLATE + H(2)O = 10-FORMYLTETRAHYDROFOLATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 5,10-methenyltetrahydrofolate from 5,10-methylenetetrahydrofolate and subsequent formation of 10-formyltetrahydrofolate from 5,10-methenyltetrahydrofolate" /codon_start=1 /transl_table=11 /product="bifunctional 5,10-methylene-tetrahydrofolate dehydrogenase/ 5,10-methylene-tetrahydrofolate cyclohydrolase" /protein_id="NP_217873.1" /db_xref="GI:15610492" /db_xref="GOA:O50385" /db_xref="UniProtKB/TrEMBL:O50385" /db_xref="GeneID:888145" /translation="MGAIMLDGKATRDEIFGDLKQRVAALDAAGRTPGLGTILVGDDP GSQAYVRGKHADCAKVGITSIRRDLPADISTATLNETIDELNANPDCTGYIVQLPLPK HLDENAALERVDPAKDADGLHPTNLGRLVLGTPAPLPCTPRGIVHLLRRYDISIAGAH VVVIGRGVTVGRPLGLLLTRRSENATVTLCHTGTRDLPALTRQADIVVAAVGVAHLLT ADMVRPGAAVIDVGVSRTDDGLVGDVHPDVWELAGHVSPNPGGVGPLTRAFLLTNVVE LAERR" gene 3770773..3771048 /locus_tag="Rv3357" /db_xref="GeneID:888135" CDS 3770773..3771048 /locus_tag="Rv3357" /function="UNKNOWN" /note="Rv3357, (MTV004.14), len: 91 aa. Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. Q9Z4V7|YU1E_STRCO (alias CAC37261|SCBAC17D6.02) ORFU1E (BELONGS TO THE PHD/YEFM FAMILY) from Streptomyces coelicolor (87 aa), FASTA scores: opt: 344, E(): 1.9e-17, (62.05% identity in 87 aa overlap); P46147|YEFM_ECOLI|B2017 from Escherichia coli strain K12 (83 aa), FASTA scores: opt: 215, E(): 1.6e-08, (50.0% identity in 72 aa overlap); BAB58570|SAV2408 from Staphylococcus aureus subsp. aureus Mu50 (83 aa), FASTA scores: opt: 161, E(): 8.8e-05, (39.95% identity in 77 aa overlap); Q9Z5W8 PUTATIVE PHD PROTEIN from Francisella novicid (85 aa), FASTA scores: opt: 143, E(): 0.0016, (28.9% identity in 83 aa overlap); etc. Also similar to Rv1247c|MTV006.19c (89 aa) (36.9% identity in 84 aa overlap). SEEMS TO BELONG TO THE PHD/YEFM FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217874.1" /db_xref="GI:15610493" /db_xref="UniProtKB/Swiss-Prot:P65067" /db_xref="GeneID:888135" /translation="MSISASEARQRLFPLIEQVNTDHQPVRITSRAGDAVLMSADDYD AWQETVYLLRSPENARRLMEAVARDKAGHSAFTKSVDELREMAGGEE" repeat_region 3770994..3771091 /note="98 bp imperfect direct repeat 2, 82/98 bp identical to the first copy at 3743508..3743605" gene 3771045..3771302 /locus_tag="Rv3358" /db_xref="GeneID:888139" CDS 3771045..3771302 /locus_tag="Rv3358" /function="UNKNOWN" /note="Rv3358, (MTV004.15), len: 85 aa. Conserved hypohetical protein, highly similar to other hypohetical proteins e.g. Q9Z4V8|SCBAC17D6.03 from Streptomyces coelicolor (84 aa), FASTA scores: opt: 393, E(): 1.1e-21, (59.75% identity in 82 aa overlap); P56605|YOEB_ECOLI from Escherichia coli (84 aa), FASTA scores: opt: 305, E(): 2.2e-15, (49.35% identity in 77 aa overlap); Q9Z5W7 PUTATIVE DOC PROTEIN from Francisella novicida (68 aa), FASTA scores: opt: 253, E(): 9.6e-12, (51.6% identity in 62 aa overlap); BAB58569|SAV2407 from Staphylococcus aureus subsp. aureus Mu50 (88 aa), FASTA scores: opt: 250, E(): 2e-11, (40.5% identity in 84 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217875.1" /db_xref="GI:15610494" /db_xref="UniProtKB/Swiss-Prot:P64528" /db_xref="GeneID:888139" /translation="MRSVNFDPDAWEDFLFWLAADRKTARRITRLIGEIQRDPFSGIG KPEPLQGELSGYWSRRIDDEHRLVYRAGDDEVTMLKARYHY" gene 3771344..3772534 /locus_tag="Rv3359" /db_xref="GeneID:887668" CDS 3771344..3772534 /locus_tag="Rv3359" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3359, (MTV004.16), len: 396 aa. Possible oxidoreductase (EC 1.-.-.-), similar to N-terminal part of various proteins (hypothetical unknowns or oxidoreductases) e.g. Q9ZB94 HYPOTHETICAL 69.3 KDA PROTEIN from Rhodococcus erythropolis (649 aa), FASTA scores: opt: 509, E(): 3e-24, (30.0% identity in 380 aa overlap); O29991|AF0248 NADH-DEPENDENT FLAVIN OXIDOREDUCTASE from Archaeoglobus fulgidus (378 aa), FASTA scores: opt: 478, E(): 1.6e-22, (32.45% identity in 379 aa overlap); Q9HUH9|PA4986 PROBABLE OXIDOREDUCTASE from Pseudomonas aeruginosa (648 aa), FASTA scores: opt: 412, E(): 3.3e-18, (30.45% identity in 384 aa overlap); Q9KCT8|BH1481 NADH OXIDASE from Bacillus halodurans (338 aa), FASTA scores: opt: 404, E(): 6.1e-18, (30.2% identity in 275 aa overlap); etc. Some weak similarity to Mycobacterium leprae MLCB1779_10." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217876.1" /db_xref="GI:15610495" /db_xref="GOA:O50388" /db_xref="UniProtKB/TrEMBL:O50388" /db_xref="GeneID:887668" /translation="MAPGSCEAPDVFNPAKLGPLTLRNRVIKAATFEARTPDALVTDD LIEYHRLPAAGGVAMTTVAYCAVSPGGRTGGNQIWMRPHAVPGLRRLTEAIHAEGAAI SAQIGHAGPVADARSNQATALAPVRFFNPIAMRFAQKATREDIDDVLAAHAHAARLAV DAGFDAVEIHLGHNYLASAFLSPLLNRRDDEFGGSLQNRAKVARGLVMAVRRAVRQQV AVTAKLNMTDGIRGGITVDEALTTARWLQDDGGLDAIELTAGSSLVNPMYLFRGDAPV KEFAAAFKPPLRWGIRMTGHRFFREYPYRDAYLLREARLFRAELTIPLILLGGITNRT TMDLAMAEGFEFVAMARALLAEPDLVNRIAAEGSQVRSACTHCNQCMATIYRRTHCVV TGAP" gene 3772651..3773019 /locus_tag="Rv3360" /db_xref="GeneID:888136" CDS 3772651..3773019 /locus_tag="Rv3360" /function="UNKNOWN" /note="Rv3360, (MTV004.17), len: 122 aa. Hypothetical protein, highly similar to the N-terminus of O65934|Rv1747|MTCY28.10|MTCY04C12.31 probable ABC-transporter ATP-binding protein from Mycobacterium tuberculosis (865 aa), FASTA scores: opt: 480, E(): 4.7e-25, (61.0% identity in 118 aa overlap); and some similarity with the N-terminus of P96214|Rv3863|MTCY01A6.05c HYPOTHETICAL 41.1 KDA PROTEIN from Mycobacterium tuberculosis (392 aa), FASTA scores: opt: 138, E(): 0.033, (31.95% identity in 97 aa overlap). Some weak similarity with the N-terminus of other hypothetical proteins e.g. P73823|CYAA|SLR1991 ADENYLATE CYCLASE from Synechocystis sp. strain PCC 6803 (337 aa), FASTA scores: opt: 127, E(): 0.16, (28.55% identity in 112 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217877.1" /db_xref="GI:15610496" /db_xref="UniProtKB/TrEMBL:O50389" /db_xref="GeneID:888136" /translation="MSRPHPPVLTVRSDRSQQCFAAGRDVVVGSDLRADMRVAHPLIA RAHLLLRFDRGNWIAIDNDSQSGMFVDGQRVSEVDIYDGLTINIGKPTGPWITFEVGH HQGIIGRLSRTPSSRPGSPI" gene complement(3773016..3773567) /locus_tag="Rv3361c" /db_xref="GeneID:888130" CDS complement(3773016..3773567) /locus_tag="Rv3361c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3361c, (MTV004.18c), len: 183 aa. Conserved hypothetical protein, with some similarity to various proteins e.g. P74221|YB52_SYNY3|SLR1152 HYPOTHETICAL 36.2 KDA PROTEIN SLR (CONTAINS 5 PENTAPEPTIDE REPEAT DOMAINS) from Synechocystis sp. strain PCC 6803 (331 aa), FASTA scores: opt: 252, E(): 3.9e-10, (30.55% identity in 167 aa overlap); Q9SE95 FH PROTEIN INTERACTING PROTEIN FIP2 from Arabidopsis thaliana (Mouse-ear cress) (298 aa), FASTA scores: opt: 207, E(): 4.4e-07, (30.35% identity in 168 aa overlap); Q9A735|CC1891 PENTAPEPTIDE REPEAT FAMILY PROTEIN from Caulobacter crescentus (250 aa), FASTA scores: opt: 181, E(): 2.3e-05, (24.05% identity in 187 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217878.1" /db_xref="GI:15610497" /db_xref="UniProtKB/TrEMBL:O50390" /db_xref="GeneID:888130" /translation="MQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQH RGSAFRNCTFERTTLWHSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGL NLTGCRLRETSLVDTDLRKCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLV GARVDVDQAVAFAAAHGLCLAGG" gene complement(3773574..3774155) /locus_tag="Rv3362c" /db_xref="GeneID:888088" CDS complement(3773574..3774155) /locus_tag="Rv3362c" /function="UNKNOWN" /note="Rv3362c, (MTV004.19c), len: 193 aa. Probable ATP/GTP-binding protein, similar to others from Streptomyces coelicolor e.g. O86519|SC1C2.18c (174 aa), FASTA scores: opt: 731, E(): 9.8e-41, (66.85% identity in 169 aa overlap); Q9XAE1|SC6G9.41c (191 aa), FASTA scores: opt: 730, E(): 1.2e-40, (63.55% identity in 173 aa overlap); Q9L235|SC1A2.06 (184 aa), FASTA scores: opt: 650, E(): 1.9e-35, (55.95% identity in 177 aa overlap); Q9RJ74|SCI41.10c (176 aa), FASTA scores: opt: 618, E(): 2.3e-33, (55.9% identity in 161 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="ATP/GTP-binding protein" /protein_id="NP_217879.1" /db_xref="GI:15610498" /db_xref="GOA:O50391" /db_xref="UniProtKB/TrEMBL:O50391" /db_xref="GeneID:888088" /translation="MALKHSEASGTASTKIVIAGGFGSGKTTFVGAVSEIMPLRTEAM VTDASAGVDMLEATPDKRSTTVAMDFGRITLGEDLVLYLFGTPGQRRFWFMWDDLVRG AIGAIVLVDCRRLQDSFAAVDFFEHRNLPFLIAINEFDSAPRYPVSAVRDALTLPAHI PVINVDARNRRSATDALIAVSEYALATLSPAGG" misc_feature complement(3774075..3774098) /locus_tag="Rv3362c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3774136..3774504) /locus_tag="Rv3363c" /db_xref="GeneID:888090" CDS complement(3774136..3774504) /locus_tag="Rv3363c" /function="UNKNOWN" /note="Rv3363c, (MTV004.20c), len: 122 aa. Conserved hypothetical protein, similar to others from Streptomyces coelicolor e.g. O86523|SC1C2.23c (132 aa), FASTA scores: opt: 236, E(): 9e-09, (38.5% identity in 122 aa overlap); O86520|SC1C2.19c (190 aa), FASTA scores: opt: 231, E(): 2.7e-08, (41.0% identity in 122 aa overlap); Q9X834|SC9B1.14c (119 aa), FASTA scores: opt: 188, E(): 1.1e-05, (37.5% identity in 120 aa overlap); Q9ADJ4|SCBAC14E8.05 (113 aa), FASTA scores: opt: 167, E(): 0.00025, (33.05% identity in 109 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217880.1" /db_xref="GI:15610499" /db_xref="UniProtKB/TrEMBL:O50392" /db_xref="GeneID:888090" /translation="MFNPAGDRPKAGLVRPYTLTAGRTGTDVDLPLQAPVQTLPAGPA GRWPAYDMRRRILQLCIGSPSVAEISARLDLPVGVARVLVGDLVTSGYLRVHATLTDR STRDERHELIGRTLRGLKAL" gene complement(3774482..3774874) /locus_tag="Rv3364c" /db_xref="GeneID:888085" CDS complement(3774482..3774874) /locus_tag="Rv3364c" /function="UNKNOWN" /note="Rv3364c, (MTV004.21c), len: 130 aa. Conserved hypothetical protein, highly similar to others from Streptomyces coelicolor e.g. O86524|SC1C2.24c (137 aa), FASTA scores: opt: 466, E(): 1.3e-22, (58.6% identity in 116 aa overlap); O86521|SC1C2.20c (140 aa), FASTA scores: opt: 445, E(): 2.7e-21, (56.9% identity in 116 aa overlap); Q9KZI6|SCG8A.13c (145 aa), FASTA scores: opt: 341, E(): 9.5e-15, (51.3% identity in 113 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217881.1" /db_xref="GI:15610500" /db_xref="GOA:O50393" /db_xref="UniProtKB/TrEMBL:O50393" /db_xref="GeneID:888085" /translation="MKARLPDSPLDWLVSKFAREVPGVAHALLVSVDGLPVAASEHLP RERADQLAAVTSGLASLAGGAAQLFDGGQVLQSVVEMQNGYLLLMQVGDGSALAALAA TGCDIGQIGYEMAILVERVGGVVQSCRR" gene complement(3774871..3777501) /locus_tag="Rv3365c" /db_xref="GeneID:887652" CDS complement(3774871..3777501) /locus_tag="Rv3365c" /function="UNKNOWN" /note="Rv3365c, (MTV004.22c), len: 876 aa. Conserved hypothetical protein, similar to various proteins from Streptomyces coelicolor e.g. O86525|SC1C2.25c HYPOTHETICAL 139.7 KDA PROTEIN (SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES) (1329 aa), FASTA scores: opt: 879, E(): 5.4e-32, (29.9% identity in 924 aa overlap) (similarity in N-terminal part for this one); O86522|SC1C2.21c HYPOTHETICAL 119.9 KDA PROTEIN (SIMILAR TO OTHER PROKARYOTIC SENSORY TRANSDUCTION HISTIDINE KINASES) (1111 aa), FASTA scores: opt: 855, E(): 5.6e-31, (28.9% identity in 892 aa overlap) (similarity in N-terminal part for this one); Q9KZI5|SCG8A.14c PUTATIVE MEMBRANE PROTEIN (862 aa), FASTA scores: opt: 791, E(): 3.3e-28, (30.8% identity in 828 aa overlap); Q9KZN0|SC1A8A.22c (943 aa), FASTA scores: opt: 660, E(): 2.5e-22, (27.65% identity in 893 aa overlap); etc. Similar in part to two consecutive Mycobacterium leprae hypothetical ORFs, probably representing a pseudogene: O07701|MLCL383.27 (118 aa), FASTA scores: opt: 430, E(): 1e-12, (58.25% identity in 115 aa overlap); and O07700|MLCL383.26 (111 aa), FASTA scores: opt: 271, E(): 1.3e-05, (50.4% identity in 121 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217882.1" /db_xref="GI:15610501" /db_xref="GOA:Q93IG6" /db_xref="UniProtKB/TrEMBL:Q93IG6" /db_xref="GeneID:887652" /translation="MTMFARPTIPVAAAASDISAPAQPARGKPQQRPPSWSPRNWPVR WKVFTIALLPLVVAMVLAGLRVEAAMASTSGLRLVAARAEMIPAITKYMSALDVAVLA SSTGHDVEGAQKNFTARKYELQTRLADTDVIADVRSGVNTLLNGGQALLDKVLADSIG LRDRVTAYAPLLLTAQNVIDASVRVDSEQIRTQVQGLSRAVGARGQMTMQEILVTRGA DLAEPQLRSAMVTLAGTEPSTLFGMSAALGAGSPDTKNLQQQMVTRMAIMSDPAVALV NNPELLHSIQITRDIAEQVITDTTEAVTKSVQSQATDRRDAAIRDAVLVLAAIATAIV VVLVVARTLVGPMRVLRDGALKVAHTDLDGEIAAVRAGDEPIPEPLAVYTTEEIGQVA HAVDELHTRALLLAGEETRLRLLVNEMFETMSRRSRSLVDQQLSVIDQLERNEEDPAR LDSLFRLDHLAARLRRNSANLLVLAGAQITRDHREPVPLSTVISAAVSEVEDYRRVDI ARVPDCAVVGAAAGGVIHLLAELIDNALRYSSPTTPVRVAAAIGSEGSVLLRISDSGL GMTDADRRMANMRLRAGGEVTPDSARHMGLFVVGRLAGRHGIRVGLRGPVTGEQGTGT TAEVYLPLAVLEGTAPAQPPKPRVFAIKPPCPEPAAADPTDVPAAIGPLPPVTLLPRR TPGSSGIADVPAQPMQQRRRELKTPWWEDRFQQEPKQPPAPEPRPAPPPAKPAPPAGP VDDDVIYRRMLSEMVGDPHELAHSPDLDWKSVWDHGWSAAAEAADKPVQSRTDYGLPV REPGARLVPGAAVPEGPDREHPGAALASNGGLHPGRAPRHAAAVRDPDAVRASISSHF GGVRTGRSHARESSQGPNQQ" misc_feature complement(3775207..3775236) /locus_tag="Rv3365c" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene 3777737..3778201 /gene="spoU" /locus_tag="Rv3366" /db_xref="GeneID:887648" CDS 3777737..3778201 /gene="spoU" /locus_tag="Rv3366" /EC_number="2.1.1.-" /function="CAUSES METHYLATION." /note="Rv3366, (MTV004.23), len: 154 aa. Probable spoU, tRNA/rRNA methylase (EC 2.1.1.-), equivalent to Q9CCU7|ML0419 PUTATIVE tRNA/rRNA METHYLTRANSFERASE from Mycobacterium leprae (158 aa), FASTA scores: opt: 861, E(): 1.2e-50, (83.75% identity in 154 aa overlap); and O07698|MLCL383.24c rRNA METHYLASE from Mycobacterium leprae (169 aa), FASTA scores: opt: 861, E(): 1.3e-50, (83.75% identity in 154 aa overlap). Also highly similar to many members of the spoU family of rRNA methylases e.g. Q9K199|NMB0268 RNA METHYLTRANSFERASE (TRMH FAMILY) from Neisseria meningitidis (serogroup B) (154 aa), FASTA scores: opt: 534, E(): 7.6e-29, (50.0% identity in 154 aa overlap); and Q9JSM8|NMA2218 from Neisseria meningitidis (serogroup A) (154 aa), FASTA scores: opt: 526, E(): 2.6e-28, (49.35% identity in 154 aa overlap); Q9HU57|PA5127 from Pseudomonas aeruginosa (153 aa), FASTA scores: opt: 531, E(): 1.2e-28, (52.95% identity in 151 aa overlap); P33899|YIBK_ECOLI|B3606 from Escherichia coli strain K12 (157 aa), FASTA scores: opt: 511, E(): 2.6e-27, (49.35% identity in 154 aa overlap); etc. BELONGS TO THE RNA METHYLTRANSFERASE TRMH FAMILY." /codon_start=1 /transl_table=11 /product="tRNA/rRNA methylase SpoU" /protein_id="NP_217883.1" /db_xref="GI:15610502" /db_xref="GOA:O50394" /db_xref="UniProtKB/TrEMBL:O50394" /db_xref="GeneID:887648" /translation="MFRLLFVSPRIAPNTGNAIRTCAATGCELHLVEPLGFDLSEPKL RRAGLDYHDLASVTVHASLAHAWEALSPARVFAFTAQATTLFTNVGYRAGDVLMFGPE PTGLDEATLADTHITGQVRIPMLAGRRSLNLSNAAAVAVYEAWRQHGFAGAV" gene 3778568..3780334 /gene="PE_PGRS51" /locus_tag="Rv3367" /db_xref="GeneID:887404" CDS 3778568..3780334 /gene="PE_PGRS51" /locus_tag="Rv3367" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3367, (MTV004.25), len: 588 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002). Similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O50415|Rv3388|MTV004.46 (731 aa), FASTA scores: opt: 1999, E(): 7.2e-72, (55.0% identity in 620 aa overlap); and MTV004_44, MTV043_65, MTV006_15, MTCY63_2, MTCY21B4_13, MTV023_21, MTV008_43, MTCY24A1_4, MTV023_15; etc. Equivalent to AAK47814 from Mycobacterium tuberculosis strain CDC1551 (628 aa) but shorter 37 aa." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177965.1" /db_xref="GI:57117098" /db_xref="UniProtKB/TrEMBL:Q6MWX7" /db_xref="GeneID:887404" /translation="MSFVVAVPEALAAAASDVANIGSALSAANAAAAAGTTGLLAAGA DEVSAALASLFSGHAVSYQQVAAQATALHDQFVQALTGAGGSYALTEAANVQQNLLNA INAPTQALLGRPLIGDGAVGTASSPDGQDGGLLFGNGGAGYNSAATPGMAGGNGGNAG LIGNGGTGGSGGAGAAGGAGGSGGWLYGNGGNGGIGGNAIVAGGAGGNGGAGGAAGLW GSGGSGGQGGNGLTGNDGVNPAPVTNPALNGAAGDSNIEPQTSVLIGTQGGDGTPGGA GVNGGNGGAGGDANGNPANTSIANAGAGGNGAAGGDGGANGGAGGAGGQAASAGSSVG GDGGNGGAGGTGTNGHAGGAGGAGGAGGRGGWLVGNGGNGGNGAAGGNGAIGGTGGAG GVPANQGGNSALGTQPVGGDGGDGGNGGTGGTGGRGGDGGSGGAGGASGWLMGNGGNG GNGGTGGSGGVGGNGGIGGDGAGGGNATSTSSIPFDAHGGNGGAGGDAGHGGTGGDGG DGGHAGTGGRGGLLAGQHANSGNGGGGGTGGAGGTHGTPGSGNAGGTGTGNADSTNGG PGSDGLGGDAFNGSRGTDGNPG" gene complement(3780335..3780979) /locus_tag="Rv3368c" /db_xref="GeneID:887641" CDS complement(3780335..3780979) /locus_tag="Rv3368c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM" /note="Rv3368c, (MTV004.26c), len: 214 aa. Possible oxidoreductase (EC 1.-.-.-), equivalent to O07697|MLCL383.23|ML0418 HYPOTHETICAL 23.6 KDA PROTEIN (PUTATIVE OXIDOREDUCTASE) from Mycobacterium leprae (210 aa), FASTA scores: opt: 1215, E(): 1.5e-74, (81.4% identity in 210 aa overlap). Also similar to O30106|AF0131 PUTATIVE NAD(P)H-FLAVIN OXIDOREDUCTASE from Archaeoglobus fulgidus (194 aa), FASTA scores: opt: 139, E(): 0.028, (29.0% identity in 207 aa overlap); Q60049|NOX_THETH NADH DEHYDROGENASE from Thermus aquaticus (subsp. thermophilus) (205 aa), FASTA scores: opt: 169, E(): 0.00028, (28.3% identity in 212 aa overlap); and shows some similarity to other hypothetical proteins (unknowns or oxidoreductases)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_217885.1" /db_xref="GI:15610504" /db_xref="GOA:O50397" /db_xref="UniProtKB/TrEMBL:O50397" /db_xref="GeneID:887641" /translation="MTLNLSVDEVLTTTRSVRKRLDFDKPVPRDVLMECLELALQAPT GSNSQGWQWVFVEDAAKKKAIADVYLANARGYLSGPAPEYPDGDTRGERMGRVRDSAT YLAEHMHRAPVLLIPCLKGREDESAVGGVSFWASLFPAVWSFCLALRSRGLGSCWTTL HLLDNGEHKVADVLGIPYDEYSQGGLLPIAYTQGIDFRPAKRLPAESVTHWNGW" gene 3780978..3781412 /locus_tag="Rv3369" /db_xref="GeneID:887669" CDS 3780978..3781412 /locus_tag="Rv3369" /function="UNKNOWN" /note="Rv3369, (MTV004.27), len: 144 aa. Conserved hypothetical protein. C-terminus is similar to N-terminus of O07696|MLCL383.22c HYPOTHETICAL 14.7 KDA PROTEIN from Mycobacterium leprae (131 aa), FASTA scores: opt: 174, E(): 6e-05, (67.55% identity in 37 aa overlap). Also some slight similarity to Q9EWU1|3SC5B7.08c from Streptomyces coelicolor (153 aa), FASTA scores: opt: 125, E(): 0.13, (31.05% identity in 116 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217886.1" /db_xref="GI:15610505" /db_xref="UniProtKB/TrEMBL:O50398" /db_xref="GeneID:887669" /translation="MWAGYRWAMSVELTQEVSARLTSDLYGWLTTVARSGQPVPRLVW FYFDGTDLTVYSMPQAAKVAHITAHPQVSLNLDSDGNGAGIIVVGGTAAVVATDVDCR DDAPYWAKYREDAAKFGLTEAIAAYSTRLKITPTRVWTTPTG" gene complement(3781501..3784776) /gene="dnaE2" /locus_tag="Rv3370c" /db_xref="GeneID:887259" CDS complement(3781501..3784776) /gene="dnaE2" /locus_tag="Rv3370c" /EC_number="2.7.7.7" /function="DNA POLYMERASE III IS A COMPLEX, MULTICHAIN ENZYME RESPONSIBLE FOR MOST OF THE REPLICATIVE SYNTHESIS IN BACTERIA. THE ALPHA CHAIN IS THE DNA POLYMERASE. THOUGHT TO BE REGULATED BY Rv2720|LEXA [CATALYTIC ACTIVITY: N DEOXYNUCLEOSIDE TRIPHOSPHATE = N PYROPHOSPHATE + DNA(N)]." /experiment="experimental evidence, no additional details recorded" /note="DNA polymerase involved in damage-induced mutagenesis and translesion synthesis. It is not the major replicative DNA polymerase." /codon_start=1 /transl_table=11 /product="error-prone DNA polymerase" /protein_id="NP_217887.2" /db_xref="GI:161352461" /db_xref="GOA:O50399" /db_xref="UniProtKB/TrEMBL:O50399" /db_xref="GeneID:887259" /translation="MGWSNGPPSWAEMERVLNGKPRHAGVPAFDADGDVPRSRKRGAY QPPGRERVGSSVAYAELHAHSAYSFLDGASTPEELVEEAARLGLCALALTDHDGLYGA VRFAEAAAELDVRTVFGAELSLGATARTERPDPPGPHLLVLARGPEGYRRLSRQLAAA HLAGGEKGKPRYDFDALTEAAGGHWHILTGCRKGHVRQALSQGGPAAAQRALADLVDR FTPSRVSIELTHHGHPLDDERNAALAGLAPRFGVGIVATTGAHFADPSRGRLAMAMAA IRARRSLDSAAGWLAPLGGAHLRSGEEMARLFAWCPEAVTAAAELGERCAFGLQLIAP RLPPFDVPDGHTEDSWLRSLVMAGARERYGPPKSAPRAYSQIEHELKVIAQLRFPGYF LVVHDITRFCRDNDILCQGRGSAANSAVCYALGVTAVDPVANELLFERFLSPARDGPP DIDIDIESDQREKVIQYVYHKYGRDYAAQVANVITYRGRSAVRDMARALGFSPGQQDA WSKQVSHWTGQADDVDGIPEQVIDLATQIRNLPRHLGIHSGGMVICDRPIADVCPVEW ARMANRSVLQWDKDDCAAIGLVKFDLLGLGMLSALHYAKDLVAEHKGIEVDLARLDLS EPAVYEMLARADSVGVFQVESRAQMATLPRLKPRVFYDLVVEVALIRPGPIQGGSVHP YIRRRNGVDPVIYEHPSMAPALRKTLGVPLFQEQLMQLAVDCAGFSAAEADQLRRAMG SKRSTERMRRLRGRFYDGMRALHGAPDEVIDRIYEKLEAFANFGFPESHALSFASLVF YSAWFKLHHPAAFCAALLRAQPMGFYSPQSLVADARRHGVAVHGPCVNASLAHATCEN AGTEVRLGLGAVRYLGAELAEKLVAERTANGPFTSLPDLTSRVQLSVPQVEALATAGA LGCFGMSRREALWAAGAAATGRPDRLPGVGSSSHIPALPGMSELELAAADVWATGVSP DSYPTQFLRADLDAMGVLPAERLGSVSDGDRVLIAGAVTHRQRPATAQGVTFINLEDE TGMVNVLCTPGVWARHRKLAHTAPALLIRGQVQNASGAITVVAERMGRLTLAVGARSR DFR" gene 3784932..3786272 /locus_tag="Rv3371" /db_xref="GeneID:888053" CDS 3784932..3786272 /locus_tag="Rv3371" /function="UNKNOWN" /note="Rv3371, (MTV004.29), len: 446 aa. Hypothetical protein, similar to many Mycobacterium tuberculosis (strains H37Rv and CDC1551) hypothetical proteins e.g. O07035|YV30_MYCTU|Rv3130c|MTCY03A2.28|MTCY164.41c (463 aa), FASTA scores: opt: 556, E(): 7.7e-28, (44.95% identity in 447 aa overlap); MTY20B11_9, MTCY28_26, MTV013_8, MTCY21B4_43, MTCY493_29; etc. Also similar to O07692|MLCL383_9|MLCL383.18c HYPOTHETICAL 14.1 KDA PROTEIN from Mycobacterium leprae (129 aa), FASTA scores: opt: 293, E(): 1.3e-11, (47.85% identity in 117 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217888.1" /db_xref="GI:15610507" /db_xref="GOA:O50400" /db_xref="UniProtKB/Swiss-Prot:O50400" /db_xref="GeneID:888053" /translation="MAQLTALDAGFLKSRDPERHPGLAIGAVAVVNGAAPSYDQLKTV LTERIKSIPRCTQVLATEWIDYPGFDLTQHVRRVALPRPGDEAELFRAIALALERPLD PDRPLWECWIIEGLNGNRWAILIKIHHCMAGAMSAAHLLARLCDDADGSAFANNVDIK QIPPYGDARSWAETLWRMSVSIAGAVCTAAARAVSWPAVTSPAGPVTTRRRYQAVRVP RDAVDAVCHKFGVTANDVALAAITEGFRTVLLHRGQQPRADSLRTLEKTDGSSAMLPY LPVEYDDPVRRLRTVHNRSQQSGRRQPDSLSDYTPLMLCAKMIHALARLPQQGIVTLA TSAPRPRHQLRLMGQKMDQVLPIPPTALQLSTGIAVLSYGDELVFGITADYDAASEMQ QLVNGIELGVARLVALSDDSVLLFTKDRRKRSSRALPSAARRGRPSVPTARARH" gene 3786314..3787489 /gene="otsB2" /locus_tag="Rv3372" /db_xref="GeneID:888137" CDS 3786314..3787489 /gene="otsB2" /locus_tag="Rv3372" /EC_number="3.1.3.12" /function="INVOLVED IN OSMOREGULATORY TREHALOSE BIOSYNTHESIS. Mycobacteria can produce trehalose from glucose 6-phosphate and UDP-glucose (the OtsA-OtsB pathway) from glycogen-like alpha(1-->4)-linked glucose polymers (the TreY-TreZ pathway) and from maltose (the TreS pathway) [CATALYTIC ACTIVITY: TREHALOSE 6-PHOSPHATE + H(2)O = TREHALOSE + ORTHOPHOSPHATE]." /note="Rv3372, (MTV004.30),len: 391 aa. Possible otsB2, trehalose-6-phosphate phosphatase (EC 3.1.3.12), equivalent to Q49734|OTSB2|OTSP|B1620_F1_1|MLCL383.17c PUTATIVE TREHALOSE-PHOSPHATASE from Mycobacterium leprae (429 aa), FASTA scores: opt: 1675, E(): 2.4e-91, (67.05% identity in 425 aa overlap). Also weakly similar to several trehalose phosphatases e.g. Q9C8B3|F10O5.8 from Arabidopsis thaliana (Mouse-ear cress) (366 aa), FASTA scores: opt: 432, E(): 3.1e-18, (36.65% identity in 281 aa overlap); O27788|MTH1760 from Methanobacterium thermoautotrophicum (264 aa), FASTA scores: opt: 347, E(): 2.5e-13, (30.75% identity in 221 aa overlap); Q9FWQ2 from Oryza sativa (Rice) (382 aa), FASTA scores: opt: 338, E(): 1.1e-12, (32.5% identity in 320 aa overlap); etc. Also similar to part of Mycobacterium tuberculosis Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c (1327 aa), FASTA scores: opt: 1192, E(): 1.6e-62, (56.65% identity in 339 aa overlap)." /codon_start=1 /transl_table=11 /product="trehalose 6-phosphate phosphatase" /protein_id="NP_217889.1" /db_xref="GI:15610508" /db_xref="GOA:O50401" /db_xref="UniProtKB/TrEMBL:O50401" /db_xref="GeneID:888137" /translation="MRKLGPVTIDPRRHDAVLFDTTLDATQELVRQLQEVGVGTGVFG SGLDVPIVAAGRLAVRPGRCVVVSAHSAGVTAARESGFALIIGVDRTGCRDALRRDGA DTVVTDLSEVSVRTGDRRMSQLPDALQALGLADGLVARQPAVFFDFDGTLSDIVEDPD AAWLAPGALEALQKLAARCPIAVLSGRDLADVTQRVGLPGIWYAGSHGFELTAPDGTH HQNDAAAAAIPVLKQAAAELRQQLGPFPGVVVEHKRFGVAVHYRNAARDRVGEVAAAV RTAEQRHALRVTTGREVIELRPDVDWDKGKTLLWVLDHLPHSGSAPLVPIYLGDDITD EDAFDVVGPHGVPIVVRHTDDGDRATAALFALDSPARVAEFTDRLARQLREAPLRAT" gene 3787726..3788367 /gene="echA18" /locus_tag="Rv3373" /db_xref="GeneID:888123" CDS 3787726..3788367 /gene="echA18" /locus_tag="Rv3373" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Rv3373, (MTV004.31), len: 213 aa. Probable echA18, enoyl-CoA hydratase (EC 4.2.1.17), similar to others e.g. P97087|CRT from Clostridium thermosaccharolyticum (Thermoanaerobacterium thermosaccharolyticum) (259 aa), FASTA scores: opt: 423, E(): 3.4e-20, (37.95% identity in 174 aa overlap); Q9X7Q4|SC5F2A.31c from Streptomyces coelicolor (257 aa), FASTA scores: opt: 399, E(): 1.2e-18, (45.05% identity in 171 aa overlap); BAB52005|MLL5584 from Rhizobium loti (Mesorhizobium loti) (257 aa), FASTA scores: opt: 385, E(): 9.6e-18, (41.95% identity in 174 aa overlap); etc. Also some similarity to 3-HYDROXYBUTYRYL-CoA DEHYDRATASES (EC 4.2.1.55) e.g. P52046|CRT_CLOAB from Clostridium acetobutylicum (261 aa), FASTA scores: opt: 414, E(): 1.3e-19, (38.3% identity in 175 aa overlap). And similar to other hydratases from Mycobacterium tuberculosis e.g. O53418|ECH8_MYCTU|Rv1070c|MT1100|MTV017.23c PROBABLE ENOYL-CoA HYDRATASE (257 aa), FASTA scores: opt: 365, E(): 1.9e-16, (39.1% identity in 174 aa overlap). BELONGS TO THE ENOYL-CoA HYDRATASE/ISOMERASE FAMILY. Note that this homology extends across the stop codon and directly into the next ORF MTV004.29, suggesting a possible readthrough of the TGA stop codon." /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_217890.1" /db_xref="GI:15610509" /db_xref="GOA:O50402" /db_xref="UniProtKB/TrEMBL:O50402" /db_xref="GeneID:888123" /translation="MRRRAMTKMDEASNPCGGDIEAEMCQLMREQPPAEGVVDRVALQ RHRNVALITLSHPQAQNALNLASWRRLKRLLDDLAGESGLRAVVLRGAGDKAFAAGAD IKEFPNTRMSAADAAEYNESLAVCLRALTTMPIPVIAAVRGLAVGGGCELATACDVCI ATDDARFGIPLGKLGVTTGFTEADTVARLIGPAALKYLLFSGELIGIEEAARW" gene 3788368..3788616 /gene="echA18.1" /locus_tag="Rv3374" /db_xref="GeneID:888100" CDS 3788368..3788616 /gene="echA18.1" /locus_tag="Rv3374" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Rv3374, (MTV004.32), len: 82 aa. Probable echA18.1, enoyl-CoA hydratase C-terminus (EC 4.2.1.17), similar to the C-terminus of several enoyl-CoA hydratases e.g. Q9I5I4|PA0745 from Pseudomonas aeruginosa (272 aa), FASTA scores: opt: 123, E(): 0.13, (34.55% identity in 81 aa overlap); P97087|CRT from Clostridium thermosaccharolyticum (Thermoanaerobacterium thermosaccharolyticum) (259 aa), FASTA scores: opt: 115, E(): 0.45, (32.95% identity in 82 aa overlap); Q9I002|PA2841 from Pseudomonas aeruginosa (263 aa), FASTA scores: opt: 108, E(): 1.4, (30.95% identity in 84 aa overlap); etc. Also some similarity to C-terminus of O29956|AF0285 3-HYDROXYACYL-CoA DEHYDROGENASE from Archaeoglobus fulgidus (658 aa), FASTA scores: opt: 116, E(): 0.81, (34.15% identity in 82 aa overlap); and other enzymes. And similar to other hydratases from Mycobacterium tuberculosis e.g. O53418|ECH8_MYCTU|Rv1070c|MT1100|MTV017.23c PROBABLE ENOYL-CoA HYDRATASE (257 aa), FASTA scores: opt: 111, E(): 0.83, (36.05% identity in 86 aa overlap). This homology extends across the upstream TGA stop codon into the upstream ORF MTV004.28, suggesting possible readthrough of the previous stop codon. Note that previously known as echA18'.; echA18'" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="YP_177966.1" /db_xref="GI:57117099" /db_xref="GOA:Q6MWX6" /db_xref="UniProtKB/TrEMBL:Q6MWX6" /db_xref="GeneID:888100" /translation="MVQKVVAPQDLAAATAKLVGQVCRQSAVTMRAAKVVANMHGRAL TGADTDALIRFGVEAYEGADLREGVAAFSQGRPPKFDD" gene 3788621..3790048 /gene="amiD" /locus_tag="Rv3375" /db_xref="GeneID:888064" CDS 3788621..3790048 /gene="amiD" /locus_tag="Rv3375" /EC_number="3.5.1.4" /function="INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: A MONOCARBOXYLIC ACID AMIDE + H(2)O = A MONOCARBOXYLATE + NH(3)]." /note="Rv3375, (MTV004.33), len: 475 aa. Probable amiD, amidase (EC 3.5.1.4), similar to various amidases e.g. Q53116|AMDA ENANTIOMERASE-SELECTIVE AMIDASE from Rhodococcus sp. (462 aa), FASTA scores: opt: 1036, E(): 1.6e-54, (38.6% identity in 464 aa overlap); Q9ZHK8|PZAA NICOTINAMIDASE/PYRAZINAMIDASE from Mycobacterium smegmatis (468 aa), FASTA scores: opt: 930, E(): 3.4e-48, (36.3% identity in 463 aa overlap); Q9A551|CC2613 PYRAZINAMIDASE/NICOTINAMIDASE from Caulobacter crescentus (464 aa), FASTA scores: opt: 841, E(): 7.1e-43, (39.45% identity in 469 aa overlap); O69768|AMID_PSEPU AMIDASE from Pseudomonas putida (466 aa), FASTA scores: opt: 800, E(): 2e-40, (33.6% identity in 467 aa overlap); O28325|YJ54_ARCFU|AF1954 PUTATIVE AMIDASE from Archaeoglobus fulgidu (453 aa), FASTA scores: opt: 669, E(): 1.3e-32, (30.4% identity in 467 aa overlap); etc. Also some similarity to AMIB2|Rv1263|MT1301|MTCY50.19c putative amidase from Mycobacterium tuberculosis (462 aa), (31.5% identity in 466 aa overlap). SEEMS BELONG TO THE AMIDASE FAMILY." /codon_start=1 /transl_table=11 /product="amidase AmiD" /protein_id="NP_217892.1" /db_xref="GI:15610511" /db_xref="GOA:P63496" /db_xref="UniProtKB/Swiss-Prot:P63496" /db_xref="GeneID:888064" /translation="MTDADSAVPPRLDEDAISKLELTEVADLIRTRQLTSAEVTESTL RRIERLDPQLKSYAFVMPETALAAARAADADIARGHYEGVLHGVPIGVKDLCYTVDAP TAAGTTIFRDFRPAYDATVVARLRAAGAVIIGKLAMTEGAYLGYHPSLPTPVNPWDPT AWAGVSSSGCGVATAAGLCFGSIGSDTGGSIRFPTSMCGVTGIKPTWGRVSRHGVVEL AASYDHVGPITRSAHDAAVLLSVIAGSDIHDPSCSAEPVPDYAADLALTRIPRVGVDW SQTTSFDEDTTAMLADVVKTLDDIGWPVIDVKLPALAPMVAAFGKMRAVETAIAHADT YPARADEYGPIMRAMIDAGHRLAAVEYQTLTERRLEFTRSLRRVFHDVDILLMPSAGI ASPTLETMRGLGQDPELTARLAMPTAPFNVSGNPAICLPAGTTARGTPLGVQFIGREF DEHLLVRAGHAFQQVTGYHRRRPPV" gene 3790156..3790809 /locus_tag="Rv3376" /db_xref="GeneID:888066" CDS 3790156..3790809 /locus_tag="Rv3376" /function="UNKNOWN" /note="Rv3376, (MTV004.34), len: 217 aa. Hypothetical protein, similar to various bacterial proteins (notably hydrolases) e.g. Q9RUP0|DR1344 HYDROLASE from Deinococcus radiodurans (222 aa), FASTA scores: opt: 348, E(): 1.8e-15, (36.75% identity in 215 aa overlap); Q9RXA1|DR0414 HYDROLASE (CBBY/CBBZ/GPH/YIEH FAMILY) from Deinococcus radiodurans (155 aa), FASTA scores: opt: 233, E(): 3.5e-08, (36.4% identity in 151 aa overlap); Q9X0Q9|TM1177 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (225 aa), FASTA scores: opt: 231, E(): 6.6e-08, (27.6% identity in 221 aa overlap); Q9ABI3|CC0244 HYDROLASE, HALOACID DEHALOGENASE-LIKE from Caulobacter crescentus (213 aa), FASTA scores: opt: 213, E(): 9.1e-07, (28.95% identity in 221 aa overlap); BAB38231|ECS4808 PUTATIVE PHOSPHATASE from Escherichia coli strain O157:H7 (206 aa), FASTA scores: opt: 210, E(): 1.4e-06, (26.95% identity in 193 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217893.1" /db_xref="GI:15610512" /db_xref="GOA:O50405" /db_xref="UniProtKB/TrEMBL:O50405" /db_xref="GeneID:888066" /translation="MSISAVVFDRDGVLTSFDWTRAEEDVRRITGLPLEEIERRWGGW LNGLTIDDAFVETQPISEFLSSLARELELGSKARDELVRLDYMAFAQGYPDARPALEE ARRRGLKVGVLTNNSLLVSARSLLQCAALHDLVDVVLSSQMIGAAKPDPRAYQAIAEA LGVSTTSCLFFDDIADWVEGARCAGMRAYLVDRSGQTRDGVVRDLSSLGAILDGAGP" gene complement(3790848..3792353) /locus_tag="Rv3377c" /db_xref="GeneID:888073" CDS complement(3790848..3792353) /locus_tag="Rv3377c" /function="UNKNOWN" /note="Rv3377c, (MTV004.35c), len: 501 aa. Possible cyclase; similarity with various proteins, notably cyclases involved in steroid biosynthesis in plants and bacteria e.g. BAB52679|MLR6369 from Rhizobium loti (Mesorhizobium loti) (516 aa), FASTA scores: opt: 533, E(): 5.6e-27, (30.45% identity in 522 aa overlap); Q9ZTN8 COPALYL DIPHOSPHATE SYNTHASE 1 from Cucurbita maxima (Pumpkin) (Winter squash) (823 aa), FASTA scores: opt: 484, E(): 1.2e-23, (28.35% identity in 388 aa overlap); Q38710|AC22 ABIETADIENE CYCLASE from Abies grandis (868 aa), FASTA scores: opt: 382, E(): 5.2e-17, (25.55% identity in 462 aa overlap); Q41771|AN1 KAURENE SYNTHASE A from Zea mays (Maize) (823 aa), FASTA scores: opt: 377, E(): 1.1e-16, (29.75% identity in 390 aa overlap); Q9AJE4 DITERPENE CYCLASE-1 from Kitasatospora griseola (Streptomyces griseolosporeus) (499 aa), FASTA scores: opt: 336, E(): 3.2e-14, (27.5% identity in 513 aa overlap); Q9SAU6 E-ALPHA-BISABOLENE SYNTHASE (FRAGMENT) from Abies grandis (782 aa), FASTA scores: opt: 317, E(): 7.8e-13, (25.25% identity in 479 aa overlap); etc. Note that this and the upstream ORF MTV004.36c have a significantly lower GC bias than the rest of the genome." /codon_start=1 /transl_table=11 /product="cyclase" /protein_id="NP_217894.1" /db_xref="GI:15610513" /db_xref="GOA:O50406" /db_xref="UniProtKB/TrEMBL:O50406" /db_xref="GeneID:888073" /translation="METFRTLLAKAALGNGISSTAYDTAWVAKLGQLDDELSDLALNW LCERQLPDGSWGAEFPFCYEDRLLSTLAAMISLTSNKHRRRRAAQVEKGLLALKNLTS GAFEGPQLDIKDATVGFELIAPTLMAEAARLGLAICHEESILGELVGVREQKLRKLGG SKINKHITAAFSVELAGQDGVGMLDVDNLQETNGSVKYSPSASAYFALHVKPGDKRAL AYISSIIQAGDGGAPAFYQAEIFEIVWSLWNLSRTDIDLSDPEIVRTYLPYLDHVEQH WVRGRGVGWTGNSTLEDCDTTSVAYDVLSKFGRSPDIGAVLQFEDADWFRTYFHEVGP SISTNVHVLGALKQAGYDKCHPRVRKVLEFIRSSKEPGRFCWRDKWHRSAYYTTAHLI CAASNYDDALCSDAIGWILNTQRPDGSWGFFDGQATAEETAYCIQALAHWQRHSGTSL SAQISRAGGWLSQHCEPPYAPLWIAKTLYCSATVVKAAILSALRLVDESNQ" gene complement(3792358..3793248) /locus_tag="Rv3378c" /db_xref="GeneID:888075" CDS complement(3792358..3793248) /locus_tag="Rv3378c" /function="UNKNOWN" /note="Rv3378c, (MTV004.36c), len: 296 aa. Hypothetical unknown protein. Note that this ORF and the downstream ORF MTV004.35c have a significantly lower GC bias than the rest of the genome." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217895.1" /db_xref="GI:15610514" /db_xref="UniProtKB/TrEMBL:O50407" /db_xref="GeneID:888075" /translation="MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLEC NPQYDDYQQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLA NDEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGV FGNDAAESVAQFSISWNETHGKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLL SSGKTSLYFTVAPSYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRA QPDRVFGVGCVHDGIWFAEG" gene complement(3793257..3794867) /gene="dxs2" /locus_tag="Rv3379c" /db_xref="GeneID:888080" CDS complement(3793257..3794867) /gene="dxs2" /locus_tag="Rv3379c" /EC_number="2.2.1.7" /function="CATALYZES THE ACYLOIN CONDENSATION REACTION BETWEEN C ATOMS 2 AND 3 OF PYRUVATE AND GLYCERALDEHYDE 3-PHOSPHATE TO YIELD 1-DEOXY-D-XYLULOSE-5-PHOSPHATE (DXP). POSSIBLY INVOLVED IN DEOXYXYLULOSE-5-PHOSPHATE PATHWAY (DXP) OF ISOPRENOID BIOSYNTHESIS (AT THE FIRST STEP), AND BIOSYNTHETIC PATHWAY TO THIAMINE ANDPYRIDOXOL (AT THE FIRST STEP)." /note="Catalyzes the formation of 1-deoxy-D-xylulose 5-phosphate from pyruvate and D-glyceraldehyde 3-phosphate" /codon_start=1 /transl_table=11 /product="1-deoxy-D-xylulose-5-phosphate synthase" /protein_id="NP_217896.1" /db_xref="GI:15610515" /db_xref="GOA:O50408" /db_xref="UniProtKB/TrEMBL:O50408" /db_xref="GeneID:888080" /translation="MFDTGHQTYPHKLLTGRGKDFATLRQADGLSGYPNRHESPHDWV ENSHASVSLAWVDGIAKALALQGQCDRRVIAVIGDGALTGGVAWEGLNNLGAATRPVI VVLNDNGRSYDPTAGALAAHLEELRVGTPRGPNLFENMGFTYIGPVDGHNIPDTCAVL RKAAAAARPVVVHAVTSKGRGYPPAEADERDHMHACGVVDIATGLASTPSQRSWTDVF EDEIARIADDRSDVVGLTAAMRLPTGLGALSRRYPHRVFDSGIAEQHLLASAAGLAAA GTHPVVAVYSTFLHRAFDQLLFDIGLHRLPVTLVLDRAGVTGPDGPSHHGLWDLALLA CVPGFQIACPRDAPRLRQQLRTAIATAAPTAVRFPKGAPGEPITAEHTIGGLDVLHTP PPHWRPDVLLVAVGAMSRPCMDAARCLSEEQIGVTVVDPQWVWPISPALTELAGRHRI TVCVEDAIADVGIGAHLSHHIGRTHPRTRTYTLGLPPAYIPHASRDHILSSHGLTGPA IRIRCKSLLNALHEVPGPEDHPDSGDSY" repeat_region 3795058..3796412 /note="IS6110-15, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-15" repeat_region 3795058..3795085 /note="28 bp inverted repeat at the left end of IS6110, TGAACCGCCCCGGTGAGTCCGGAGACTC" gene complement(3795100..3795984) /locus_tag="Rv3380c" /db_xref="GeneID:887411" CDS complement(3795100..3795984) /locus_tag="Rv3380c" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="Rv3380c, (MTV004.38c), len: 294 aa. Probable transposase (IS6110 ORF II), identical to many. May be expressed by frameshifting from the upstream ORF MTV004.39c." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217897.1" /db_xref="GI:15610516" /db_xref="GOA:P19774" /db_xref="UniProtKB/Swiss-Prot:P19774" /db_xref="GeneID:887411" /translation="MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKE HISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIAD PATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVAST MATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGA VGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVP PVELEAAYYAQRQRPAAG" gene complement(3796035..3796361) /locus_tag="Rv3381c" /db_xref="GeneID:887646" CDS complement(3796035..3796361) /locus_tag="Rv3381c" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="Rv3381c, (MTV004.39c), len: 108 aa. Probable transposase (IS6110 ORF I), identical to many." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217898.1" /db_xref="GI:15610517" /db_xref="GOA:Q50686" /db_xref="UniProtKB/Swiss-Prot:Q50686" /db_xref="GeneID:887646" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" repeat_region complement(3796385..3796412) /note="28 bp inverted repeat at the right end of IS6110, TGAACCGCCCCGGCATGTCCGGAGACTC" gene complement(3796448..3797437) /gene="lytB1" /locus_tag="Rv3382c" /db_xref="GeneID:887953" CDS complement(3796448..3797437) /gene="lytB1" /locus_tag="Rv3382c" /function="NOT KNOW. POSSIBLY INVOLVED IN DRUG/ANTIBIOTIC TOLERANCE. IN OTHER ORGANISMS, LYTB PRODUCT IS INVOLVED IN PENICILLIN TOLERANCE AND CONTROL OF THE STRINGENT RESPONSE." /note="Rv3382c, (MTV004.40c), len: 329 aa. Probable lytB1, lytB-related protein, highly similar to many e.g. Q9HVM7|LYTB_PSEAE|PA4557 from Pseudomonas aeruginosa (314 aa), FASTA scores: opt: 1048, E(): 2e-55, (53.2% identity in 314 aa overlap); Q9JR39|LYTB|NMA0624|NMB1831 from Neisseria meningitidis (serogroup A and B) (322 aa), FASTA scores: opt: 1041, E(): 5.4e-55, (52.25% identity in 312 aa overlap); P22565|LYTB_ECOLI|B0029 from Escherichia coli strain K12 (316 aa), FASTA scores: opt: 1013, E(): 2.5e-53, (51.45% identity in 311 aa overlap) (for more information about lytB protein, see citation below); Q9X781|LYTB_MYCLE|LYTB2|ML1938|MLCB1222.06c from Mycobacterium leprae (332 aa), FASTA scores: opt: 979, E(): 2.8e-51, (51.3% identity in 312 aa overlap); etc. Also similar to Q9PAS9|XF2416 DRUG TOLERANCE PROTEIN from Xylella fastidiosa (316 aa), FASTA scores: opt: 1043, E(): 4.1e-55, (53.65% identity in 315 aa overlap). And similar to O53458|Rv1110|LYTB2|MTV017.63 from Mycobacterium tuberculosis (335 aa), FASTA scores: opt: 975, E(): 4.9e-51, (51.3% identity in 312 aa overlap). BELONGS TO THE LYTB FAMILY." /codon_start=1 /transl_table=11 /product="LYTB-like protein LYTB1" /protein_id="YP_177967.1" /db_xref="GI:57117100" /db_xref="GOA:O50409" /db_xref="UniProtKB/Swiss-Prot:O50409" /db_xref="GeneID:887953" /translation="MAEVFVGPVAQGYASGEVTVLLASPRSFCAGVERAIETVKRVLD VAEGPVYVRKQIVHNTVVVAELRDRGAVFVEDLDEIPDPPPPGAVVVFSAHGVSPAVR AGADERGLQVVDATCPLVAKVHAEAARFAARGDTVVFIGHAGHEETEGTLGVAPRSTL LVQTPADVAALNLPEGTQLSYLTQTTLALDETADVIDALRARFPTLGQPPSEDICYAT TNRQRALQSMVGECDVVLVIGSCNSSNSRRLVELAQRSGTPAYLIDGPDDIEPEWLSS VSTIGVTAGASAPPRLVGQVIDALRGYASITVVERSIATETVRFGLPKQVRAQ" gene complement(3797437..3798489) /gene="idsB" /locus_tag="Rv3383c" /db_xref="GeneID:887680" CDS complement(3797437..3798489) /gene="idsB" /locus_tag="Rv3383c" /EC_number="2.5.1.-" /function="INVOLVED IN BIOSYNTHESIS OF MEMBRANE ETHER-LINKED LIPIDS. CATALYZES THE TRANS-ADDITION OF THE THREE MOLECULES OF IPP ONTO DMAPP TO FORM GERANYLGERANYL PYROPHOSPHATE WHICH IS A PRECURSOR OF THE ETHER-LINKED LIPIDS. catalyze the consecutive condensation of homoallylic diphosphate of isopentenyl diphosphates (IPP, C5) with allylic diphosphates to synthesize prenyl diphosphates of various chain lengths." /note="Rv3383c, (MTV004.41c), len: 350 aa. Possible idsB, polyprenyl transferase (polyprenyl diphosphate synthase) (EC 2.5.1.-), similar to many prenyltransferases involved in lipid biosynthesis e.g. Q9RGW1|GTR GERANYL TRANSFERASE from Streptomyces coelicolor (386 aa), FASTA scores: opt: 908, E(): 3.7e-50, (48.8/% identity in 334 aa overlap); Q9KWG0|GGDPS GERANYL GERANYL DIPHOSPHATE SYNTHASE from Kitasatospora griseola (Streptomyces griseolosporeus) (348 aa), FASTA scores: opt: 801, E(): 2e-43, (41.5% identity in 347 aa overlap); Q9X7V8|SC6A5.12 PUTATIVE POLYPRENYL SYNTHETASE from Streptomyces coelicolor (378 aa), FASTA scores: opt: 779, E(): 5.3e-42, (44.45% identity in 324 aa overlap); Q9S5E9 FARNESYL, GERANYLGERANYL, GERANYLFARNESYL, HEXAPRENYL, HEPTAPRENYL DIPHOSPHATE SYNTHASE (SELF-HEPPS) from Synechococcus elongatus (324 aa), FASTA scores: opt: 563, E(): 2.3e-28, (39.85% identity in 241 aa overlap) (see citation below); O26156|IDSA_METTH|MTH50 BIFUNCTIONAL SHORT CHAIN ISOPRENYL DIPHOSPHATE SYNTHASE [INCLUDES: FARNESYL PYROPHOSPHATE SYNTHETASE (EC 2.5.1.1) (FPP SYNTHETASE) (DIMETHYLALLYLTRANSFERASE) AND GERANYLTRANSTRANSFERASE (EC 2.5.1.10)] from Methanobacterium thermoautotrophicum (325 aa), FASTA scores: opt: 540, E(): 6.5e-27, (35.75% identity in 319 aa overlap); P95999|GGPP_SULSO|GDS|GDS-1|SSO0061|C05010|C05_049 GERANYLGERANYL PYROPHOSPHATE SYNTHETASE (GGPP SYNTHETASE) (GGPS) [INCLUDES: DIMETHYLALLYLTRANSFERASE (EC 2.5.1.1)AND GERANYLTRANSTRANSFERASE (EC 2.5.1.10) AND FARNESYLTRANSTRANSFERASE (EC 2.5.1.29)] from Sulfolobus solfataricus (332 aa), FASTA scores: opt: 511, E(): 4.5e-25 (36.9% identity in 244 aa overlap); etc. Also similar to Q50727|GGPP_MYCTU|Rv3398c|MT3506|MTCY78.30 PROBABLE MULTIFUNCTIONAL GERANYLGERANYL PYROPHOSPHATE SYNTHETASE [INCLUDES: DIMETHYLALLYLTRANSFERASE (EC 2.5.1.1); GERANYLTRANSTRANSFERASE (EC 2.5.1.10); FARNESYLTRANSTRANSFERASE (EC 2.5.1.29)] from Mycobacterium tuberculosis (359 aa), FASTA scores: opt: 687, E(): 3.4e-36, (39.1% identity in 325 aa overlap). Contains PS00723 Polyprenyl synthetases signature 1. BELONGS TO THE FPP/GGPP SYNTHETASES FAMILY." /codon_start=1 /transl_table=11 /product="polyprenyl synthetase IdsB" /protein_id="NP_217900.1" /db_xref="GI:15610519" /db_xref="GOA:O50410" /db_xref="UniProtKB/TrEMBL:O50410" /db_xref="GeneID:887680" /translation="MGGVLTLDAAFLGSVPADLGKALLERARADCGPVLHRAIESMRE PLATMAGYHLGWWNADRSTAAGSSGKYFRAALVYAAAAACGGDVGDATPVSAAVELVH NFTLLHDDVMDGDATRRGRPTVWSVWGVGVAILLGDALHATAVRILTGLTDECVAVRA IRRLQMSCLDLCIGQFEDCLLEGQPEVTVDDYLRMAAGKTAALTGCCCALGALVANAD DATIAALERFGHELGLAFQCVDDLIGIWGDPGVTGKPVGNDLARRKATLPVVAALNSR SEAATELAALYQAPAAMTASDVERATALVKVAGGGHVAQRCADERIQAAIAALPDAVR SPDLIALSQLICRREC" misc_feature complement(3798130..3798174) /gene="idsB" /locus_tag="Rv3383c" /note="PS00723 Polyprenyl synthetases signature 1" gene complement(3799243..3799635) /locus_tag="Rv3384c" /db_xref="GeneID:887432" CDS complement(3799243..3799635) /locus_tag="Rv3384c" /function="UNKNOWN" /note="Rv3384c, (MTV004.42c), len: 130 aa. Hypothetical protein, similar to Mycobacterium tuberculosis hypothetical proteins P95252|Rv1962c|MTCY09F9.02 (135 aa), FASTA scores: opt: 266, E(): 1.6e-10, (43.1% identity in 130 aa overlap); and Q50717|YY08_MYCTU|Rv3408|MTCY78.20c (136 aa), FASTA scores: opt: 243, E(): 4.8e-09, (35.1% identity in 131 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217901.1" /db_xref="GI:15610520" /db_xref="UniProtKB/TrEMBL:O50411" /db_xref="GeneID:887432" /translation="MAAIYLDSSAIVKLAVREPESDALRRYLRTRHPRVSSALARAEV MRALLDKGESARKAGRRALAHLDLLRVDKRVLDLAGGLLPFELRTLDAIHLATAQRLG VDLGRLCTYDDRMRDAAKTLGMAVIAPS" gene complement(3799635..3799943) /locus_tag="Rv3385c" /db_xref="GeneID:887429" CDS complement(3799635..3799943) /locus_tag="Rv3385c" /function="UNKNOWN" /note="Rv3385c, (MTV004.43c), len: 102 aa. Hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. Q50718|Y09M_MYCTU|MTCY78.21c|Rv3407|MT3515 (99 aa), FASTA scores: opt: 155, E(): 0.001, (41.05% identity in 78 aa overlap); O07782|Rv0596c|MTCY19H5.26 (85 aa), FASTA scores: opt: 136, E(): 0.016, (39.45% identity in 71 aa overlap); P96916|Rv0626|MTCY20H10.07 (86 aa), FASTA scores: opt: 130, E(): 0.04, (51.2% identity in 41 aa overlap); etc. Also similar to PREVENT HOST DEATH (PHD) PROTEINS e.g. CAA66834|PHD from Escherichia coli (73 aa), FASTA scores: opt: 113, E(): 0.45, (39.4% identity in 66 aa overlap); and Q06253|PHD_BPP1 from Bacteriophage P1 (73 aa), FASTA scores: opt: 113, E(): 0.45, (39.4% identity in 66 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217902.1" /db_xref="GI:15610521" /db_xref="UniProtKB/TrEMBL:O50412" /db_xref="GeneID:887429" /translation="MTPTACATVSTMTSVGVRALRQRASELLRRVEAGETIEITDRGR PVALLSPLPQGGPYEQLLASGEIERATLDVVDLPEPLDLDAGVELPSVTLARLREHER" repeat_region 3799987..3801554 /note="IS1560-2, len: 1568 bp. Possible Insertion sequence element IS_1560. Second copy in MTCY10G2 from 11273 to 12919." /mobile_element="insertion sequence:IS1560-2" repeat_region 3799987..3800011 /note="25 bp inverted repeat at the right end of putative IS1560, TAATTACTAGGACCTGAAAAAGTCG" gene 3800092..3800796 /locus_tag="Rv3386" /db_xref="GeneID:888044" CDS 3800092..3800796 /locus_tag="Rv3386" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1560." /note="Rv3386, (MTV004.44), len: 234 aa. Possible transposase, showing very weak similarity to several IS element transposases. Highly similar (but shorter) to P963659|MTCY10G2_13|Rv1036c from Mycobacterium tuberculosis (112 aa), FASTA scores: opt: 507, E(): 8.3e-25, (83.9% identity in 87 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217903.1" /db_xref="GI:15610522" /db_xref="UniProtKB/TrEMBL:O50413" /db_xref="GeneID:888044" /translation="MFRTVGDQASLWESVLPEELRRLPEELARVDALLDDSAFFCPFV PFFDPRMGRPSIPMETYLRLMFLKFRYRLGYESLCREVTDSITWRRFCRIPLEGSVPH PTTLMKLTTRCGEDAVAGLNEALLAKAASEKLLRTNKVRADTTVVEGDVGYPTDTGLL AKAVGSMARTVARIKAADAGSAPLGGSSGPRDRLQAAVTRRAATRSGAGLRAPDHRGA SRDRRAGADRGCRGGT" gene 3800786..3801463 /locus_tag="Rv3387" /db_xref="GeneID:887820" CDS 3800786..3801463 /locus_tag="Rv3387" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1560." /note="Rv3387, (MTV004.45), len: 225aa. Possible transposase, showing very weak similarity to other IS element proteins, and similar to various hypothetical proteins." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217904.1" /db_xref="GI:15610523" /db_xref="GOA:O50414" /db_xref="UniProtKB/TrEMBL:O50414" /db_xref="GeneID:887820" /translation="MVRNAQRAVRRASGRRKAWLRQAINHLEKLIGRTERVVDQARSR LAGVMPDSSSRLVSLHDADARPIRKGRLGKPVEFGYKAQVVDNADGVILDHSVELGNP ADAPQLAPAIERISRRTGRPPRAVTADRGCGDASVEDDLHQLGVRNVAIPRKSKPSAT RRAFEHRRAFRDKIKWRTGSEGRINHLKRSYGWNRTELTGITGARTWCGHGVFAHNLV KISTLAA" repeat_region complement(3801530..3801554) /note="25 bp inverted repeat at the right end of putative IS1560, TAATTACTAAGACCTGAAAAAGTCG" gene 3801653..3803848 /gene="PE_PGRS52" /locus_tag="Rv3388" /db_xref="GeneID:888151" CDS 3801653..3803848 /gene="PE_PGRS52" /locus_tag="Rv3388" /function="UNKNOWN" /note="Rv3388, (MTV004.46), len: 731 aa. Member of the M. tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to many PE-family proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O53553|YZ08_MYCTU|RV3508|MTV023.15 (1901 aa), FASTA scores: opt: 2380, E(): 3.6e-87, (53.8% identity in 773 aa overlap); and MTV023_21, MTV023_18, MTV023_14, MTV039_16, MTCY441_4." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177968.1" /db_xref="GI:57117101" /db_xref="UniProtKB/TrEMBL:Q6MWX5" /db_xref="GeneID:888151" /translation="MSFVIANPEMLAAAATDLAGIRSAISAATAAAAAPTIQVAAAGA DEVSLAISALFGQHAQAYQALSAQATIFHDQFVQALTSGGNLYAAAESHTVEQMVLNA INAPTQTLFGRPLIGDGANGTAENPDGQNGGLLFGNGGNGFTQTTAGVAGGNGGSAGL IGNGGAGGGGGAGAAGGLGGNGGWLYGNGGAGGIGGAGTGTGGHGGAGGAGGRAWLWG TGGAGGAGGDGGWLFGDGGAGGTGGNGGSGFNSLTSSVGGAGGAGGHAGLFGAGGTGG TGGIGGQNTETGPAASNGGAGGAGGGGGYLVGDGGAGGTGGAGGKNSSGGATLTGGTG GTGGAGGAAGWLYGSGGAGGAGGAGGLNNAGGATGGTGGTGGAGGSGAWLYGNGGAAG AGGNGGNNTSAGTGGVGASGGTGGNAGLIGAGGHGGAGGAGGNQTGGVGNGGAGGNGG AGGAGGQLYGNGGDGGNGGAGGANIAGGNGSDGGAAGHGGAGGSARLIGAGGHGGDGG AGGNTAGRRADAIAGTGGDGGNGGNGGLLSGNAGAGGHGGAGGSSTATTTTGTPPTGA TGGNGGNGGAGGTAGFTGSGGIGGNGGAGGTGGNAGVALSVGSTGGLGGNGGSGGLGG GGGSLFGNGGAGGVGATGGNGGSGIGPASVGGNGGKGGVGAAGGLAGQIGNGGSGGSG GAGGNGGTGDTAGNGGNGGAGAVGGNAQLIGNGGNGGGGGNGGTGADGT" gene complement(3803919..3804791) /locus_tag="Rv3389c" /db_xref="GeneID:887923" CDS complement(3803919..3804791) /locus_tag="Rv3389c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3389c, (MTV004.47c), len: 290 aa. Possible dehydrogenase (EC 1.-.-.-), similar to parts of several bacterial dehydrogenases and eukaryotic short-chain dehydrogenases involved in steroid biosynthesis e.g. Q9UVH9|FOX2 FOX2 PROTEIN (a multifunctional protein of the peroxisomal beta-oxidation) (SIMILAR TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY) from Glomus mosseae (1015 aa), FASTA scores: opt: 649, E(): 7.5e-33, (40.9% identity in 269 aa overlap); Q9L009|SCC30.12c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (333 aa), FASTA scores: opt: 602, E(): 2.7e-30, (40.35% identity in 305 aa overlap); AAH03098 HYDROXYSTEROID (17-BETA) DEHYDROGENASE 4 from Homo sapiens (Human) (736 aa), FASTA scores: opt: 592, E(): 2.1e-29, (41.55% identity in 272 aa overlap); P51659|DHB4_HUMAN ESTRADIOL 17 BETA-DEHYDROGENASE 4 from Homo sapiens (Human) (736 aa), FASTA scores: opt: 592, E(): 2.1e-29, (41.55% identity in 272 aa overlap); Q19058|E04F6.3 HYDRATASE-DEHYDROGENASE-EPIMERASE from Caenorhabditis elegans (298 aa), FASTA scores: opt: 573, E(): 1.6e-28, (41.0% identity in 266 aa overlap); O42484 17-BETA-HYDROXYSTEROID DEHYDROGENASE TYPE IV from Gallus gallus (Chicken) (735 aa), FASTA scores: opt: 573, E(): 3.2e-28, (39.8% identity in 279 aa overlap); etc. And also similar in part to Q9LBK1|PHAJ2|PA1018 (R)-SPECIFIC ENOYL-CoA HYDRATASE from Pseudomonas aeruginosa (288 aa), FASTA scores: opt: 601, E(): 2.7e-30, (40.5% identity in 294 aa overlap). And similar to P71863|UFAA2|Rv3538|MTCY03C7.18c HYPOTHETICAL 30.2 KDA PROTEIN from Mycobacterium tuberculosis (286 aa), FASTA scores: opt: 609, E(): 8.7e-31, (39.65% identity in 285 aa overlap). HAS SOME SIMILARITY TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_217906.1" /db_xref="GI:15610525" /db_xref="GOA:Q11198" /db_xref="UniProtKB/TrEMBL:Q11198" /db_xref="GeneID:887923" /translation="MAIDPNSIGAVTEPMLFEWTDRDTLLYAIGVGAGTGDLAFTTEN SHGIDQQVLPTYAVICCPAFGAAAKVGTFNPAALLHGSQGIRLHAPLPAAGKLSVVTE VADIQDKGEGKNAIVVLRGRGCDPESGSLVAETLTTLVLRGQGGFGGARGERPAAPEF PDRHPDARIDMPTREDQALIYRLSGDRNPLHSDPWFATQLAGFPKPILHGLCTYGVAG RALVAELGGGVAANITSIAARFTKPVFPGETLSTVIWRTEPGRAVFRTEVAGSDGAEA RVVLDDGAVEYVAG" gene 3804865..3805575 /gene="lpqD" /locus_tag="Rv3390" /db_xref="GeneID:887627" CDS 3804865..3805575 /gene="lpqD" /locus_tag="Rv3390" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3390, (MTV004.48), len: 236 aa. Probable lpqD, a conserved lipoprotein with some similarity to various bacterial proteins e.g. Q9F3Q7|SC10F4.03 PUTATIVE ISOMERASE from Streptomyces coelicolor (224 aa), FASTA scores: opt: 416, E(): 2.5e-18, (33.0% identity in 197 aa overlap); Q9ZAX0|PGM 2,3-PDG DEPENDENT PHOSPHOGLYCERATE MUTASE from Amycolatopsis methanolica (205 aa), FASTA scores: opt: 314, E(): 3.7e-12, (28.55% identity in 203 aa overlap); P73454|SLR1748 HYPOTHETICAL 24.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (214 aa), FASTA scores: opt: 201, E(): 2.8e-05, (23.8% identity in 189 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical proteins e.g. O53817|Rv0754|MTV041.28 PGRS-FAMILY PROTEIN (584 aa), FASTA scores: opt: 219, E(): 5.1e-06, (39.8% identity in 226 aa overlap). Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LpqD" /protein_id="NP_217907.1" /db_xref="GI:15610526" /db_xref="GOA:O50416" /db_xref="UniProtKB/TrEMBL:O50416" /db_xref="GeneID:887627" /translation="MAKRTPVRKACTVLAVLAATLLLGACGGPTQPRSITLTFIRNAQ SQANADGIIDTDMPGSGLSADGKAEAQQVAHQVSRRDVDSIYSSPMAADQQTAGPLAG ELGKQVEILPGLQAINAGWFNGKPESMANSTYMLAPADWLAGDVHNTIPGSISGTEFN SQFSAAVRKIYDSGHNTPVVFSQGVAIMIWTLMNARNSRDSLLTTHPLPNIGRVVITG NPVTGWRLVEWDGIRNFT" misc_feature 3804910..3804942 /gene="lpqD" /locus_tag="Rv3390" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 3805621..3807573 /gene="acrA1" /locus_tag="Rv3391" /db_xref="GeneID:887950" CDS 3805621..3807573 /gene="acrA1" /locus_tag="Rv3391" /EC_number="1.2.1.-" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM" /experiment="experimental evidence, no additional details recorded" /note="Rv3391, (MTV004.49), len: 650 aa. Possible acrA1, multi functional protein with fatty acyl-CoA reductase activity in C-terminal part (EC 1.2.1.-). Indeed C-terminal part highly similar to P94129|ACR1 FATTY ACYL-CoA REDUCTASE from Acinetobacter calcoaceticus (295 aa), FASTA scores: opt: 767, E(): 1.4e-36, (45.4% identity in 260 aa overlap); and similar to other oxidoreductases dehydrogenases/reductases e.g. Q9Y3A1 CGI-93 PROTEIN (SIMILARITY WITH SDR FAMILY) from Homo sapiens (Human) (291 aa), FASTA scores: opt: 363, E(): 1.5e-13, (38.65% identity in 194 aa overlap); Q9L146|SC6D11.09 PUTATIVE OXIDOREDUCTASE (SIMILARITY WITH SDR FAMILY) from Streptomyces coelicolor (343 aa), FASTA scores: opt: 346, E(): 1.6e-12, (30.4% identity in 283 aa overlap); Q9HSR4|YUSZ1|VNG0115G OXIDOREDUCTASE from Halobacterium sp. strain NRC-1 (260 aa), FASTA scores: opt: 338, E(): 3.7e-12, (33.85% identity in 248 aa overlap); etc. C-terminus also similar to Mycobacterium tuberculosis proteins Q10783|YF43_MYCTU|Rv1543|MTCY48.22c PUTATIVE OXIDOREDUCTASE (341 aa), FASTA scores: opt: 787, E(): 1.2e-37, (39.8% identity in 319 aa overlap); O06413|Rv0547c|MTCY25D10.26c HYPOTHETICAL 31.8 KDA PROTEIN (294 aa), FASTA scores: opt: 565, E(): 4.7e-25, (36.8% identity in 242 aa overlap); O53398|Rv1050|MTV017.03 OXIDOREDUCTASE (SDR FAMILY) (301 aa), FASTA scores: opt: 436, E(): 1.1e-17, (32.2% identity in 292 aa overlap). N-terminus (aa 1-320) is similar to P37693|HETM_ANASP polyketide synthase hetM from Anabaena sp. (506 aa), FASTA scores: opt: 188, E(): 1.3e-07, (27.7% identity in 361 aa overlap); so certainly a multi-domain enzyme. SEEMS TO BELONG TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY. Note that this ORF corresponds to the gene ORF2|Q11197 (see Yuan et al., 1995), but longer 266 aa, due to use of a more upstream start site." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_217908.1" /db_xref="GI:15610527" /db_xref="GOA:O50417" /db_xref="UniProtKB/TrEMBL:O50417" /db_xref="GeneID:887950" /translation="MRYVVTGGTGFIGRHVVSRLLDGRPEARLWALVRRQSLSRFERL AGQWGDRVRPLVGDLTELELSERTIAELGDIDHVLHCAAVHDTTWADATRAVIELAAR LDATFHHVSSIAVAGDFAGHYTEADFDVGQRLPTPYHRMTFEAERLVRSTPGLRYRIY RPAVVVGDSRTGEMDTIDGPYYLFGVLAKLAVLPSFTPMLLPDIGRTNIVPVDYVADA LVALMHADGRDGQTFHLTAPTAIGLRGIYRGIAGAAGLPPLLGTLPGFVAAPVLNARG RAKVLRNMAATQLGIPAEIFDVVGCAPTFTSDTTREALRGTGIHVPEFATYAPGLWRY WAEHLDPDRARRNDPLLGRHVIITGASSGIGRASAIAVAKRGATVFALARNGNALDEL VTEIRAHGGQAHAFTCDVTDSASVEHTVKDILGRFDHVDYLVNNAGRSIRRSVVNSTD RLHDYERVMAVNYFGAVRMVLALLPHWRERRFGHVVNVSSAGVQARNPKYSSYLPTKA ALDAFADVVASETLSDHITFTNIHMPLVATPMIVPSRRLNPVRAISAERAAAMVIRGL VEKPARIDTPLGTLAEAGNYVAPRLSRRILHQLYLGYPDSAAAQGISRPDADRPPAPR RPRRSARAGVPRPLRRLGRLVPGVHW" gene complement(3807574..3808437) /gene="cmaA1" /locus_tag="Rv3392c" /db_xref="GeneID:887961" CDS complement(3807574..3808437) /gene="cmaA1" /locus_tag="Rv3392c" /EC_number="2.1.1.79" /function="HAS CYCLOPROPANE FUNCTION. TRANSFERS A METHYLENE GROUP FROM S-ADENOSYL-L-METHIONINE TO THE CIS DOUBLE BOND OF AN UNSATURATED FATTY ACID CHAIN RESULTING IN THE REPLACEMENT OF THE DOUBLE BOND WITH A METHYLENE BRIDGE. MYCOLIC ACIDS, WHICH REPRESENT THE MAJOR CONSTITUENT OF MYCOBACTERIAL CELL WALL COMPLEX, ACT AS SUBSTRATES [CATALYTIC ACTIVITY: S-ADENOSYL-L-METHIONINE + PHOSPHOLIPIDOLEFINIC FATTY ACID = S-ADENOSYL-L-HOMOCYSTEINE + PHOSPHOLIPID CYCLOPROPANE FATTY ACID]." /experiment="experimental evidence, no additional details recorded" /note="Rv3392c, (MTV004.50), len: 287 aa. cmaA1, cyclopropane mycolic acid synthase 1 (EC 2.1.1.79), characterized in 1995 as CFA1_MYCTU|Q11195|CMAA1|CMA1 cyclopropane-fatty-acyl-phospholipid synthase 1 (see citations below). Highly similar to Mycobacterium tuberculosis proteins MTCY20H10.23c (58.7% identity in 286 aa overlap); MTCY20H10.24c (68.6% identity); MTCY20H10.25c (73.5% identity); MTCY20H10.26c (57.0% identity); and MTCY20G9.30c (55.7% identity). Also highly similar to Q9CBK3|MMAA4|ML1903 METHYL MYCOLIC ACID SYNTHASES from Mycobacterium leprae (298 aa), FASTA scores: opt: 1098, E(): 1e-63, (57.0% identity in 286 aa overlap). Equivalent to AAK44898|MT0672 from Mycobacterium tuberculosis strain CDC1551 (317 aa) but shorter 30 aa and with some differences in residues between the proteins." /codon_start=1 /transl_table=11 /product="cyclopropane-fatty-acyl-phospholipid synthase 1" /protein_id="NP_217909.1" /db_xref="GI:15610528" /db_xref="GOA:Q11195" /db_xref="UniProtKB/Swiss-Prot:Q11195" /db_xref="GeneID:887961" /translation="MPDELKPHFANVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDMT LQEAQIAKIDLALGKLGLQPGMTLLDVGCGWGATMMRAVEKYDVNVVGLTLSKNQANH VQQLVANSENLRSKRVLLAGWEQFDEPVDRIVSIGAFEHFGHERYDAFFSLAHRLLPA DGVMLLHTITGLHPKEIHERGLPMSFTFARFLKFIVTEIFPGGRLPSIPMVQECASAN GFTVTRVQSLQPHYAKTLDLWSAALQANKGQAIALQSEEVYERYMKYLTGCAEMFRIG YIDVNQFTCQK" gene 3808461..3809387 /gene="iunH" /locus_tag="Rv3393" /db_xref="GeneID:887625" CDS 3808461..3809387 /gene="iunH" /locus_tag="Rv3393" /EC_number="3.2.2.-" /function="INVOLVED IN PURINE SALVAGE. CATALYZES THE HYDROLYSIS OF ALL OF THE COMMONLY OCCURRING PURINE AND PYRIMIDINE NUCLEOSIDES INTO RIBOSE AND THE ASSOCIATED BASE, AND COULD HAVE A PREFERENCE FOR INOSINE AND URIDINE AS SUBSTRATES [CATALYTIC ACTIVITY: A N-D-RIBOSYLPURINE + H(2)O = A PURINE + D- RIBOSE]." /note="Rv3393, (MTV004.51), len: 308 aa. Probable iunH, nucleoside hydrolase (EC 3.2.2.-), similar to others e.g. Q9RXB2|DR0403 from Deinococcus radiodurans (314 aa), FASTA scores: opt: 497, E(): 6e-24, (34.3% identity in 312 aa overlap); Q27546|IUNH_CRIFA from Crithidia fasciculata (314 aa), FASTA scores: opt: 475, E(): 1.4e-22, (31.45% identity in 318 aa overlap); Q9CK67|IUNH from Pasteurella multocida (310 aa), FASTA scores: opt: 464, E(): 6.9e-22, (30.9% identity in 314 aa overlap); Q9A549|CC2615 from Caulobacter crescentus (323 aa), FASTA scores: opt: 464, E(): 7.2e-22, (37.85% identity in 280 aa overlap); etc. Note that also similar to BAB34113|ECS0690 (alias AAG54985|YBEK) PUTATIVE TRNA SYNTHETASE from Escherichia coli strain O157:H7 (311 aa), FASTA scores: opt: 483, E(): 4.5e-23, (33.0% identity in 315 aa overlap). The active site histidine is conserved." /codon_start=1 /transl_table=11 /product="nucleoside hydrolase IunH" /protein_id="NP_217910.1" /db_xref="GI:15610529" /db_xref="GOA:O50418" /db_xref="UniProtKB/TrEMBL:O50418" /db_xref="GeneID:887625" /translation="MSVVFADVDTGIDDALAVIYLLASPDADLVGIASTGGNIAVGQV CANNLSLLELCGAADIPVSKGADEPLGGRWPDHPKFHGPKGIGYAELPASNRRLTDYD ATTAWIAAAHSHAGDLIGLVTGPLTNLALALRAEPALPRLLRRLVIMGGMFDGQPITE WNIRVDPEAASEVFTAWAGQRQLPIVCGLDLTRRVAMTPDILARLASVCGSSPVMRVI EDALRFYFESHEARGHGYLAYMHDPLAAAVAMDPELLTTRTATVDVDPTGATVTDWSG KRNPNARIGMSVDPAVFFDRFVERIGRFARRT" gene complement(3809442..3811025) /locus_tag="Rv3394c" /db_xref="GeneID:887945" CDS complement(3809442..3811025) /locus_tag="Rv3394c" /function="UNKNOWN" /note="Rv3394c, (MTV004.52c), len: 527 aa. Hypothetical protein, with some similarity to various bacterial proteins e.g. BAB51085|MLR4427 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (545 aa), FASTA scores: opt: 267, E(): 2.8e-08, (26.5% identity in 509 aa overlap); BAB48362|MLR0866 DNA DAMAGE INDUCIBLE PROTEIN P from Rhizobium loti (Mesorhizobium loti) (438 aa), FASTA scores: opt: 245, E(): 4.6e-07, (25.5% identity in 290 aa overlap); Q9S292|SCI11.27c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (322 aa), FASTA scores: opt: 202, E(): 0.00012, (28.5% identity in 323 aa overlap); etc. Also similarity with P95102|DINP|RV3056|MTCY22D7.25c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (346 aa), FASTA scores: opt: 211, E(): 3.9e-05, (26.45% identity in 306 aa overlap). Equivalent to AAK47838 from Mycobacterium tuberculosis strain CDC1551 (492 aa) but longer 35 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217911.1" /db_xref="GI:15610530" /db_xref="GOA:O50419" /db_xref="UniProtKB/TrEMBL:O50419" /db_xref="GeneID:887945" /translation="MMASARVLAIWCMDWPAVAAAAAAGLSATAPVAVTLANRVIACS ATARAAGVRRGLRRREAAARCPQLFIATADADRDARLFEGVIAAVDDLVPRAELLRPG LLVLPVRGPARFFGSEQMAAERLIDAVAAAGAECQVGIADRLSTAVFAARAGRIVEPG GDARFLSLLSIRQLATEPSLSGPGRDDLTDLLWRMGIRTIGQFAALSRTDVASRFGAD AVAAHRFARGEPERAPCGREPPPDLAAELACDPPIDRVDAAAFAGRSLAAELHRALMA AGVGCTRLAIHAVTANGEERSRVWRCAEPLTEDATADRVRWQLDGWLNNRNARDRPTA AVTLLRLQAVETVSASEGLQLPLWGGLGEQDRLRARRALVRVQGLLGPEAVRVPVLSG GHGPAERITLTVLGLVAPEPVPQADPGQPWPGRLPDPSPAVLFDDPVDLLDAQGNPIR VTSRGMFSADPARLRVRGRDDRLRWWAGPWPDDERWWDPDRASGRTARAQVLLDGDPG TALLLCYRQRRWYLEGSYE" gene complement(3811022..3811636) /locus_tag="Rv3395c" /db_xref="GeneID:887960" CDS complement(3811022..3811636) /locus_tag="Rv3395c" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv3395c, (MTCY78.33), len: 204 aa. Conserved hypothetical protein, with some similarity with RECA PROTEINS (RECOMBINASES A) e.g. P16238|RECA_THIFE from Thiobacillus ferrooxidans (346 aa), FASTA scores: opt: 131, E(): 1.1, (31.45% identity in 140 aa overlap); Q59560|RECA_MYCSM from Mycobacterium smegmatis (349 aa), FASTA scores: opt: 121, E(): 4.4, (30.25% identity in 129 aa overlap); etc. Note that shortened since first submission to avoid overlap with Rv3395A. Equivalent to AAK47839 from Mycobacterium tuberculosis strain CDC1551 (227 aa) but shorter 23 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217912.2" /db_xref="GI:57117102" /db_xref="GOA:Q50730" /db_xref="UniProtKB/Swiss-Prot:Q50730" /db_xref="GeneID:887960" /translation="MTAAFASDQRLENGAEQLESLRRQMALLSEKVSGGPSRSGDLVP AGPVSLPPGTVGVLSGARSLLLSMVASVTAAGGNAAIVGQPDIGLLAAVEMGADLSRL AVIPDPGTDPVEVAAVLIDGMDLVVLGLGGRRVTRARARAVVARARQKGCTLLVTDGD WQGVSTRLAARVCGYEITPALRGVPTPGLGRISGVRLQINGRGR" gene 3811719..3812345 /locus_tag="Rv3395A" /db_xref="GeneID:3205044" CDS 3811719..3812345 /locus_tag="Rv3395A" /function="UNKNOWN" /note="Rv3395A, len: 208 aa. Probable membrane protein, with potential transmembrane stretches from aa 7..25 and 55..77. Weak similarity to Q9F2P3|SCE41.16C PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (258 aa), FASTA scores: opt: 107, E(): 7.4, (34.05% identity in 94 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177969.1" /db_xref="GI:57117103" /db_xref="UniProtKB/TrEMBL:Q6MWX4" /db_xref="GeneID:3205044" /translation="MQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATD NTTDGFELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLH NAAEALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLR GGSVTTADHTLILVAGNGDLDVARRLVEEAGGDWNATTIAHGRREFVN" gene complement(3812501..3814078) /gene="guaA" /locus_tag="Rv3396c" /db_xref="GeneID:887412" CDS complement(3812501..3814078) /gene="guaA" /locus_tag="Rv3396c" /EC_number="6.3.5.2" /function="converts ATP and xanthosine 5'-phosphate and L-glutamine and water to AMP and pyrophosphate diphosphate and GMP and L-glutamate" /experiment="experimental evidence, no additional details recorded" /note="contains glutamine-hydrolyzing domain and glutamine amidotransferase; GMP-binding domain; functions to produce GMP from XMP in the IMP pathway" /codon_start=1 /transl_table=11 /product="GMP synthase" /protein_id="NP_217913.1" /db_xref="GI:15610532" /db_xref="GOA:Q50729" /db_xref="UniProtKB/Swiss-Prot:Q50729" /db_xref="GeneID:887412" /translation="MVQPADIDVPETPARPVLVVDFGAQYAQLIARRVREARVFSEVI PHTASIEEIRARQPVALVLSGGPASVYADGAPKLDPALLDLGVPVLGICYGFQAMAQA LGGIVAHTGTREYGRTELKVLGGKLHSDLPEVQPVWMSHGDAVTAAPDGFDVVASSAG APVAAFEAFDRRLAGVQYHPEVMHTPHGQQVLSRFLHDFAGLGAQWTPANIANALIEQ VRTQIGDGHAICGLSGGVDSAVAAALVQRAIGDRLTCVFVDHGLLRAGERAQVQRDFV AATGANLVTVDAAETFLEALSGVSAPEGKRKIIGRQFIRAFEGAVRDVLDGKTAEFLV QGTLYPDVVESGGGSGTANIKSHHNVGGLPDDLKFTLVEPLRLLFKDEVRAVGRELGL PEEIVARQPFPGPGLGIRIVGEVTAKRLDTLRHADSIVREELTAAGLDNQIWQCPVVL LADVRSVGVQGDGRTYGHPIVLRPVSSEDAMTADWTRVPYEVLERISTRITNEVAEVN RVVLDITSKPPATIEWE" misc_feature complement(3813782..3813817) /gene="guaA" /locus_tag="Rv3396c" /note="PS00442 Glutamine amidotransferases class-I active site" gene complement(3814090..3814998) /gene="phyA" /locus_tag="Rv3397c" /db_xref="GeneID:887911" CDS complement(3814090..3814998) /gene="phyA" /locus_tag="Rv3397c" /EC_number="2.5.1.-" /function="INVOLVED IN CAROTENOID BIOSYNTHESIS AND IN ASTAXANTHIN BIOSYNTHETIC PATHWAY. CATALYSES THE REACTION FROM PREPHYTOENE DIPHOSPHATE TO PHYTOENE [CATALYTIC ACTIVITY 1: 2 GERANYLGERANYL DIPHOSPHATE = PYROPHOSPHATE + PREPHYTOENE DIPHOSPHATE] [CATALYTIC ACTIVITY 2: PREPHYTOENE DIPHOSPHATE = PYROPHOSPHATE + PHYTOENE]." /note="Rv3397c, (MTCY78.31), len: 302 aa. Probable phyA (alternate gene name: crtB), phytoene synthase (EC 2.5.1.-), similar to many others e.g. Q9X7V5|SC6A5.09 from Streptomyces coelicolor (312 aa), FASTA scores: opt: 791, E(): 2.8e-43, (48.25% identity in 286 aa overlap); Q9RW07|DR0862 from Deinococcus radiodurans (325 aa), FASTA scores: opt: 482, E(): 1.5e-23, (35.25% identity in 292 aa overlap); Q9JRU9|NMB1168|NMB1130 from Neisseria meningitidis (serogroup B) (290 aa), FASTA scores: opt: 446, E(): 2.8e-21, (34.25% identity in 260 aa overlap); P37272|PSY_CAPAN from Capsicum annuum (Bell pepper) (419 aa), FASTA scores: opt: 431, E(): 3.4e-20, (33.0% identity in 288 aa overlap); etc. Also similar to Q9JUF5|NMA1339 PUTATIVE POLY-ISOPRENYL TRANSFERASE (EC 2.5.1.) from Neisseria meningitidis (serogroup A) (290 aa), FASTA scores: opt: 450, E(): 1.6e-21, (34.6% identity in 260 aa overlap). And similar to crtB|O05424 PHYTOENE SYNTHASE from Mycobacterium marinum (319 aa), BLASTP scores: 113, E= 6e-24, Identities = 89/283 (31%) (see citation below). Contains PS01045 Squalene and phytoene synthases signature 2. BELONGS TO THE PHYTOENE/SQUALENE SYNTHETASE FAMILY.; crtB" /codon_start=1 /transl_table=11 /product="phytoene synthase" /protein_id="NP_217914.1" /db_xref="GI:15610533" /db_xref="GOA:P65860" /db_xref="UniProtKB/Swiss-Prot:P65860" /db_xref="GeneID:887911" /translation="MTEIEQAYRITESITRTAARNFYYGIRLLPREKRAALSAVYALG RRIDDVADGELAPETKITELDAIRKSLDNIDDSSDPVLVALADAARRFPVPIAMFAEL IDGARMEIDWTGCRDFDELIVYCRRGAGTIGKLCLSIFGPVSTATSRYAEQLGIALQQ TNILRDVREDFLNGRIYLPRDELDRLGVRLRLDDTGALDDPDGRLAALLRFSADRAAD WYSLGLRLIPHLDRRSAACCAAMSGIYRRQLALIRASPAVVYDRRISLSGLKKAQVAA AALASSVTCGPAHGPLPADLGSHPSH" misc_feature complement(3814462..3814539) /gene="phyA" /locus_tag="Rv3397c" /note="PS01045 Squalene and phytoene synthases signature 2" gene complement(3815027..3816106) /gene="idsA1" /locus_tag="Rv3398c" /db_xref="GeneID:887919" CDS complement(3815027..3816106) /gene="idsA1" /locus_tag="Rv3398c" /EC_number="2.5.1.1" /EC_number="2.5.1.29" /EC_number="2.5.1.10" /function="INVOLVED IN THE BIOSYNTHESIS OF MEMBRANE ETHER-LINKED LIPIDS. CATALYZES THE TRANS-ADDITION OF THE THREE MOLECULES OF IPP ONTO DMAPP TO FORM GERANYLGERANYL PYROPHOSPHATE WHICH IS A PRECURSOR OF THE ETHER-LINKED LIPIDS [CATALYTIC ACTIVITY1: Dimethylallyl diphosphate + isopentenyl diphosphate = diphosphate + geranyl diphosphate] [CATALYTIC ACTIVITY2: Geranyl diphosphate + isopentenyl diphosphate = diphosphate + trans,trans-farnesyl diphosphate] [CATALYTIC ACTIVITY3: Trans-trans-farnesyl diphosphate + isopentenyl diphosphate = diphosphate + geranylgeranyl diphosphate]" /note="Rv3398c, (MTCY78.30), len: 359 aa. Probable idsA1, geranylgeranyl pyrophosphate synthetase (GGPP synthetase) including: dimethylallyltransferase (EC 2.5.1.1), geranyltranstransferase (EC 2.5.1.10), and farnesyltranstransferase (EC 2.5.1.29). Most similar to AE000797_3|O26156|Q53479 bifunctional short chain isoprenyl diphosphate synthase from Methanobacterium thermoautotrop (325 aa), FASTA scores: opt: 605, E(): 0, (37.1% identity in 329 aa overlap); homology suggests ATG at 30121 or TTG at 30145 to be the initiation codon. Contains PS00444 Polyprenyl synthetases signature 2. BELONGS TO THE FPP/GGPP SYNTHETASES FAMILY; BELONGS TO A FAMILY THAT GROUPS TOGETHER FPP SYNTHETASE, GGPP SYNTHETASE AND HEXAPRENYL PYROPHOSPHATE SYNTHETASE. Note that previously known as idsA.; idsA" /codon_start=1 /transl_table=11 /product="multifunctional dimethylallyltransferase/farnesyl diphosphate synthetase/ farnesyltranstransferase" /protein_id="YP_177970.1" /db_xref="GI:57117104" /db_xref="GOA:Q50727" /db_xref="UniProtKB/Swiss-Prot:Q50727" /db_xref="GeneID:887919" /translation="MRGTDEKYGLPPQPDSDRMTRRTLPVLGLAHELITPTLRQMADR LDPHMRPVVSYHLGWSDERGRPVNNNCGKAIRPALVFVAAEAAGADPHSAIPGAVSVE LVHNFSLVHDDLMDRDEHRRHRPTVWALWGDAMALLAGDAMLSLAHEVLLDCDSPHVG AALRAISEATRELIRGQAADTAFESRTDVALDECLKMAEGKTAALMAASAEVGALLAG APRSVREALVAYGRHIGLAFQLVDDLLGIWGRPEITGKPVYSDLRSRKKTLPVTWTVA HGGSAGRRLAAWLVDETGSQTASDDELAAVAELIECGGGRRWASAEARRHVTQGIDMV ARIGIPDRPAAELQDLAHYIVDRQA" misc_feature complement(3815369..3815407) /gene="idsA1" /locus_tag="Rv3398c" /note="PS00444 Polyprenyl synthetases signature 2" gene 3816129..3817175 /locus_tag="Rv3399" /db_xref="GeneID:887938" CDS 3816129..3817175 /locus_tag="Rv3399" /function="UNKNOWN" /note="Rv3399, (MTCY78.29c), len: 348 aa. Hypothetical protein, similar to other Mycobacterium tuberculosis (strains H37Rv and CDC1551) hypothetical proteins e.g. P95074|Rv0726c|MTCY210.45c (367 aa), FASTA scores: opt: 1188, E(): 7.7e-69, (60.05% identity in 308 aa overlap); MTCY31.21c (38.0% identity in 308 aa overlap), MTV041_5, MTCY4C12_14, MTY13D12_21, MTV043_22, MTCY210_44, MTCI5_19, MTCI5_20, MTV035_9, MTCY180_22, MTCY31_23, MTY13D12_1, MTCY180_29; etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217916.1" /db_xref="GI:15610535" /db_xref="UniProtKB/Swiss-Prot:Q50726" /db_xref="GeneID:887938" /translation="MARPMGKLPSNTRKCAQCAMAEALLEIAGQTINQKDLGRSGRMT RTDNDTWDLASSVGATATMIATARALASRAENPLINDPFAEPLVRAVGIDLFTRLASG ELRLEDIGDHATGGRWMIDNIAIRTKFYDDFFGDATTAGIRQVVILAAGLDTRAYRLP WPPGTVVYEIDQPAVIKFKTRALANLNAEPNAERHAVAVDLRNDWPTALKNAGFDPAR PTAFSAEGLLSYLPPQGQDRLLDAITALSAPDSRLATQSPLVLDLAEEDEKKMRMKSA AEAWRERGFDLDLTELIYFDQRNDVADYLAGSGWQVTTSTGKELFAAQGLPPFADDHI TRFADRRYISAVLK" gene 3817239..3818027 /locus_tag="Rv3400" /db_xref="GeneID:887918" CDS 3817239..3818027 /locus_tag="Rv3400" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3400, (MTCY78.28c), len: 262 aa. Probable hydrolase (EC 3.-.-.-), strongly equivalent to Q49741|YY00_MYCLE|ML0393|B1620_F3_119 HYPOTHETICAL 28.6 KDA PROTEIN from Mycobacterium leprae (261 aa), FASTA scores: opt: 1293, E(): 2.2e-71, (74.45% identity in 262 aa overlap). Similar to several various proteins (notably hydrolases) e.g. Q9L2I7|SCF42.32 PUTATIVE HYDROLASE from Streptomyces coelicolor (246 aa), FASTA scores: opt: 888, E(): 7.7e-47, (56.35% identity in 245 aa overlap); Q9EX06|2SCG38.13 PUTATIVE HYDROLASE from Streptomyces coelicolor (238 aa), FASTA scores: opt: 195, E(): 8.1e-05, (29.5% identity in 234 aa overlap); Q9I5X4|PA0562 PROBABLE HYDROLASE from Pseudomonas aeruginosa (224 aa), FASTA scores: opt: 190, E(): 0.00015, (27.8% identity in 248 aa overlap); O06995|PGMB_BACSU|YVDM PUTATIVE BETA-PHOSPHOGLUCOMUTASE from Bacillus subtilis (226 aa), FASTA scores: opt: 190, E(): 0.00016, (33.9% identity in 245 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical protein Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c (1327 aa), FASTA scores: opt: 413, E(): 2e-17, (34.9% identity in 238 aa overlap). Interestingly, note that Rv3400 and Rv3401 are similar to beginning and end of Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c with approx. 270 aa missing from the middle." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_217917.1" /db_xref="GI:15610536" /db_xref="GeneID:887918" /translation="MANWYRPNYPEVRSRVLGLPEKVRACLFDLDGVLTDTASLHTKA WKAMFDAYLAERAERTGEKFVPFDPAADYHTYVDGKKREDGVRSFLSSRAIEIPDGSP DDPGAAETVYGLGNRKNDMLHKLLRDDGAQVFDGSRRYLEAVTAAGLGVAVVSSSANT RDVLATTGLDRFVQQRVDGVTLREEHIAGKPAPDSFLRAAELLGVTPDAAAVFEDALS GVAAGRAGNFAVVVGINRTGRAAQAAQLRRHGADVVVTDLAELL" gene 3818042..3820402 /locus_tag="Rv3401" /db_xref="GeneID:887928" CDS 3818042..3820402 /locus_tag="Rv3401" /function="UNKNOWN; PROBABLY ENZYME INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3401, (MTCY78.27c), len: 786 aa. Hypothetical conserved protein, may be an hydrolase or a transferase, equivalent to Q49736|ML0392|B1620_F1_30 HYPOTHETICAL 88.1 KDA PROTEIN from Mycobacterium leprae (792 aa), FASTA scores: opt: 4820, E(): 0, (91.45% identity in 782 aa overlap). Also highly similar to Q9L2I8|SCF42.31c PUTATIVE GLYCOSYL TRANSFERASE from Streptomyces coelicolor (792 aa), FASTA scores: opt: 3060, E(): 2.9e-179, (59.25% identity in 785 aa overlap); and similar to others e.g. Q9K109|NMB0390 MALTOSE PHOSPHORYLASE from Neisseria meningitidis (serogroup B) (752 aa), FASTA scores: opt: 980, E(): 3.5e-52, (29.2% identity in 774 aa overlap); Q9JSW8|MAPA|NMA2098 PUTATIVE MALTOSE PHOSPHORYLASE (EC 2.4.1.8) from Neisseria meningitidis (serogroup A) (752 aa), FASTA scores: opt: 956, E(): 1e-50, (28.4% identity in 764 aa overlap); O06993|YVDK_BACSU HYPOTHETICAL 88.3 KDA PROTEIN (BELONGS TO FAMILY 65 OF GLYCOSYL HYDROLASES) from Bacillus subtilis (757 aa), FASTA scores: opt: 926, E(): 6.9e-49, (28.5% identity in 754 aa overlap); Q9CF04|MAPA MALTOSEPHOSPHORYLASE from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (751 aa), FASTA scores: opt: 907, E(): 1e-47, (26.95% identity in 753 aa overlap); P77154|YCJT_ECOLI|B1316 HYPOTHETICAL 84.9 KDA PROTEIN (BELONGS TO FAMILY 65 OF GLYCOSYL HYDROLASES) from Escherichia coli strain K12 (755 aa), FASTA scores: opt: 392, E(): 2.9e-16, (27.5% identity in 774 aa overlap); etc. Also similar to Mycobacterium tuberculosis hypothetical protein Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c (1327 aa), (27.2% identity in 802 aa overlap); note that Rv3400 and Rv3401 are similar to beginning and end of Q10850|YK06_MYCTU|Rv2006|MT2062|MTCY39.11c with approx. 270 aa missing from the middle." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217918.1" /db_xref="GI:15610537" /db_xref="GeneID:887928" /translation="MITEDAFPVEPWQVRETKLNLNLLAQSESLFALSNGHIGLRGNL DEGEPFGLPGTYLNSFYEIRPLPYAEAGYGYPEAGQTVVDVTNGKIFRLLVGDEPFDV RYGELISHERILDLRAGTLTRRAHWRSPAGKQVKVTSTRLVSLAHRSVAAIEYVVEAI EEFVRVTVQSELVTNEDVPETSADPRVSAILDRPLQAVEHERTERGALLMHRTRASAL MMAAGMEHEVEVPGRVEITTDARPDLARTTVICGLRPGQKLRIVKYLAYGWSSLRSRP ALRDQAAGALHGARYSGWQGLLDAQRAYLDDFWDSADVEVEGDPECQQAVRFGLFHLL QASARAERRAIPSKGLTGTGYDGHAFWDTEGFVLPVLTYTAPHAVADALRWRASTLDL AKERAAELGLEGAAFPWRTIRGQESSAYWPAGTAAWHINADIAMAFERYRIVTGDGSL EEECGLAVLIETARLWLSLGHHDRHGVWHLDGVTGPDEYTAVVRDNVFTNLMAAHNLH TAADACLRHPEAAEAMGVTTEEMAAWRDAADAANIPYDEELGVHQQCEGFTTLAEWDF EANTTYPLLLHEAYVRLYPAQVIKQADLVLAMQWQSHAFTPEQKARNVDYYERRMVRD SSLSACTQAVMCAEVGHLELAHDYAYEAALIDLRDLHRNTRDGLHMASLAGAWTALVV GFGGLRDDEGILSIDPQLPDGISRLRFRLRWRGFRLIVDANHTDVTFILGDGPGTQLT MRHAGQDLTLHTDTPSTIAVRTRKPLLPPPPQPPGREPVHRRALAR" gene complement(3820653..3821891) /locus_tag="Rv3402c" /db_xref="GeneID:887910" CDS complement(3820653..3821891) /locus_tag="Rv3402c" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN CELL PROCESS." /note="Rv3402c, (MTCY78.26), len: 412 aa. Conserved hypothetical protein, probably involved in cell process, similar to various proteins generally involved in extracellular compounds (lipopolysaccharide O-antigen) biosynthesis e.g. O68392|RFBE PEROSAMINE SYNTHETASE from Brucella melitensis (367 aa), FASTA scores: opt: 420, E(): 1.2e-19, (26.15% identity in 375 aa overlap); Q9L6C1 3,4-DEHYDRATASE-LIKE PROTEIN from Streptomyces antibioticus (393 aa), FASTA scores: opt: 419, E(): 1.5e-19, (30.65% identity in 385 aa overlap); Q9RR26|OLENI DEHYDRATASE from Streptomyces antibioticus (393 aa), FASTA scores: opt: 416, E(): 2.3e-19, (30.65% identity in 385 aa overlap); O33942 ERYCIV PROTEIN from Saccharopolyspora erythraea (Streptomyces erythraeus) (401 aa), FASTA scores: opt: 410, E(): 5.6e-19, (31.75% identity in 362 aa overlap); Q9UZI4|ASPB-LIKE1|PAB0774 ASPARTATE AMINOTRANSFERASE (ASPB-LIKE1) from Pyrococcus abyssi (366 aa), FASTA scores: opt: 402, E(): 1.7e-18, (27.05% identity in 377 aa overlap); O88001|WLBC PUTATIVE AMINO-SUGAR BIOSYNTHESIS PROTEIN from Bordetella bronchiseptica (Alcaligenes bronchisepticus) (366 aa), FASTA scores: opt: 394, E(): 5.6e-18, (26.8% identity in 347 aa overlap); Q45378|BPLC DNA FOR LIPOPOLYSACCHARIDE BIOSYNTHESIS from Bordetella pertussis (366 aa), FASTA scores: opt: 393, E(): 6.5e-18, (26.8% identity in 347 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217919.1" /db_xref="GI:15610538" /db_xref="GeneID:887910" /translation="MKIRTLSGSVLEPPSAVRATPGTSMLKLEPGGSTIPKIPFIRPS FPGPAELAEDFVQIAQANWYTNFGPNERRFARALRDYLGPHLHVATLANGTLALLAAL HVSFGAGTRDRYLLMPSFTFVGVAQAALWTGYRPWFIDIDANTWQPCVHSARAVIERF RDRIAGILLANVFGVGNPQISVWEELAAEWELPIVLDSAAGFGSTYADGERLGGRGAC EIFSFHATKPFAVGEGGALVSRDPRLVEHAYKFQNFGLVQTRESIQLGMNGKLSEISA AIGLRQLVGLDRRLASRRKVLECYRTGMADAGVRFQDNANVASLCFASACCTSADHKA AVLGSLRRHAIEARDYYNPPQHRHPYFVTNAELVESTDLAVTADICSRIVSLPVHDHM APDDVARVVAAVQEAEVRGE" gene complement(3822262..3823863) /locus_tag="Rv3403c" /db_xref="GeneID:887907" CDS complement(3822262..3823863) /locus_tag="Rv3403c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3403c, (MTCY78.25), len: 533 aa. Hypothetical unknown protein, but some weak similarity to Q9KJP2 HYPOTHETICAL 54.9 KDA PROTEIN from Myxococcus xanthus (504 aa), FASTA scores: opt: 157, E(): 0.011, (24.1% identity in 548 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217920.1" /db_xref="GI:15610539" /db_xref="GeneID:887907" /translation="MLAFPYLMTMITPPTFDVAFIGSGAACSMTLLEMADALLSSPSA SPKLRIAVVERDEQFWCGIPYGQRSSIGSLAIQKLDDFADEPEKAAYRIWLEQNKQRW LAFFQAEGGAAAARWICDNRDALDGNQWGELYLPRFLFGVFLSEQMIAAIAALGERDL AEIVTIRAEAMSAHSADGHYRIGLRPSGNGPTAIAAGKVVVAIGSPPTKAILASDSEP AFTYINDFYSPGGESNVARLRDSLDRVESWEKRNVLVVGSNATSLEALYLMRHDARIR ARVRSITVISRSGVLPYMICNQPPEFDFPRLRTLLCTEAIAAADLMSAIRDDLATAEE RSLNLADLYDAVAALFGQALHKMDLVQQEEFFCVHGMNFTKLVRRAGRDCRQASEELA ADGTLSLLAGEVLRVDACASGQPFATMTYRAAGAEHTHPVPFAAVVNCGGFEELDTCS SPFLVSAMQNGLCRPNRTNRGLLVNDDFEASPGFCVIGPLVGGNFTPKIRFWHVESAP RVRSLAKSLAASLLASLQPVALAPC" gene complement(3823880..3824584) /locus_tag="Rv3404c" /db_xref="GeneID:887902" CDS complement(3823880..3824584) /locus_tag="Rv3404c" /function="UNKNOWN" /note="Rv3404c, (MTCY78.24), len: 234 aa. Conserved hypothetical protein, some similarity to several METHIONYL-TRNA FORMYLTRANSFERASES e.g. BAB51418|MLL4854 from Rhizobium loti (Mesorhizobium loti) (317 aa), FASTA scores: opt: 210, E(): 1.7e-06, (27.55% identity in 178 aa overlap); P94463|FMT_BACSU from Bacillus subtilis (317 aa), FASTA scores: opt: 199 ,E(): 8.8e-06, (28.25% identity in 177 aa overlap); O51091||FMT_BORBU|BB0064 from Borrelia burgdorferi (Lyme disease spirochete) (312 aa), FASTA scores: opt: 187, E(): 5.2e-05, (30.2% identity in 192 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217921.1" /db_xref="GI:15610540" /db_xref="GeneID:887902" /translation="MTILILTDNVHAHALAVDLQARHGDMDVYQSPIGQLPGVPRCDV AERVAEIVERYDLVLSFHCKQRFPAALIDGVRCVNVHPGFNPYNRGWFPQVFSIIDGQ KVGVTIHEIDDQLDHGPIIAQRECAIESWDSSGSVYARLMDIERELVLEHFDAIRDGS YTAKSPATEGNLNLKKDFEQLRRLDLNERGTFGHFLNRLRALTHDDFRNAWFVDASGR KVFVRVVLEPEKPAEA" gene complement(3824702..3825268) /locus_tag="Rv3405c" /db_xref="GeneID:887940" CDS complement(3824702..3825268) /locus_tag="Rv3405c" /function="MAY BE INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3405c, (MTCY78.23), len: 188 aa. Possible transcriptional regulator, showing weak similarity to other bacterial regulatory proteins e.g. Q9KE70|BH0987 from Bacillus halodurans (203 aa), FASTA scores: opt: 168, E(): 0.0016, (34.8% identity in 92 aa overlap); Q9A5F7|CC2493 Caulobacter crescentus (204 aa), FASTA scores: opt: 160, E(): 0.0051, (32.6% identity in 89 aa overlap); Q9RDR0|SC4A7.02 from Streptomyces coelicolor (227 aa), FASTA scores: opt: 159, E(): 0.0064, (37.0% identity in 189 aa overlap); etc. Also some similarity to hypothetical Mycobacterium tuberculosis regulatory proteins e.g. O05858|Rv3208|MTCY07D11.18c, MTCI125_6, MTCY7D11_18, MTCY10G2_30; etc. Contains potential helix-turn-helix motif from aa 39-60 (+2.97 SD)." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_217922.1" /db_xref="GI:15610541" /db_xref="GeneID:887940" /translation="MTTRPATDRRKMPTGREEVAAAILQAATDLFAERGPAATSIRDI AARSKVNHGLVFRHFGTKDQLVGAVLDHLGTKLTRLLHSEAPADIIERALDRHGRVLA RALLDGYPVGQLQQRFPNVAELLDAVRPRYDSDLGARLAVAHALALQFGWRLFAPMLR SATGIDELTGDELRLSVNDAVARILEPH" gene 3825330..3826217 /locus_tag="Rv3406" /db_xref="GeneID:887955" CDS 3825330..3826217 /locus_tag="Rv3406" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3406, (MTCY78.22c), len: 295 aa. Probable dioxygenase (EC 1.-.-.-), highly similar to Q9WWU|ATSK PUTATIVE ALPHA-KETOGLUTARATE DEPENDENT DIOXYGENASE from Pseudomonas putida (301 aa), FASTA scores: opt: 994, E(): 3.9e-57, (53.7% identity in 283 aa overlap); Q9I6U1|PA0193 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (300 aa), FASTA scores: opt: 1024, E(): 4.4e-59, (53.65% identity in 287 aa overlap); Q9HX81|TAUD|PA3935 TAURINE DIOXYGENASE from Pseudomonas aeruginosa (277 aa), FASTA scores: opt: 599, E(): 1.4e-31, (39.35% identity in 277 aa overlap); and similar to other dioxygenases e.g. AAG54718|TAUD (alias BAB33845|ECS0422) TAURINE DIOXYGENASE 2-OXOGLUTARATE-DEPENDENT from Escherichia coli strain O157:H7 (283 aa), FASTA scores: opt: 595, E(): 2.5e-31, (38.1% identity in 281 aa overlap); etc. BELONGS TO THE TFDA FAMILY OF DIOXYGENASES." /codon_start=1 /transl_table=11 /product="dioxygenase" /protein_id="NP_217923.1" /db_xref="GI:15610542" /db_xref="GeneID:887955" /translation="MTDLITVKKLGSRIGAQIDGVRLGGDLDPAAVNEIRAALLAHKV VFFRGQHQLDDAEQLAFAGLLGTPIGHPAAIALADDAPIITPINSEFGKANRWHTDVT FAANYPAASVLRAVSLPSYGGSTLWANTAAAYAELPEPLKCLTENLWALHTNRYDYVT TKPLTAAQRAFRQVFEKPDFRTEHPVVRVHPETGERTLLAGDFVRSFVGLDSHESRVL FEVLQRRITMPENTIRWNWAPGDVAIWDNRATQHRAIDDYDDQHRLMHRVTLMGDVPV DVYGQASRVISGAPMEIAG" gene 3826252..3826551 /locus_tag="Rv3407" /db_xref="GeneID:887505" CDS 3826252..3826551 /locus_tag="Rv3407" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3407, (MTCY78.21c), len: 99 aa. Hypothetical protein, similar to other hypothetical proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK46285|MT2013 (90 aa), FASTA scores: opt: 160, E(): 0.00021, (37.1% identity in 89 aa overlap); O50412|Rv3385c|MTV004.43c (102 aa), FASTA scores: opt: 155, E(): 0.00051, (41.05% identity in 78 aa overlap), MTCY19H5.26, MTCY20H10.07, MTI376.09c, MTCY427.21, etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217924.1" /db_xref="GI:15610543" /db_xref="GeneID:887505" /translation="MRATVGLVEAIGIRELRQHASRYLARVEAGEELGVTNKGRLVAR LIPVQAAERSREALIESGVLIPARRPQNLLDVTAEPARGRKRTLSDVLNEMRDEQ" gene 3826548..3826958 /locus_tag="Rv3408" /db_xref="GeneID:887900" CDS 3826548..3826958 /locus_tag="Rv3408" /function="UNKNOWN" /note="Rv3408, (MTCY78.20c), len: 136 aa. Hypothetical protein, similar to other hypothetical proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O50411|Rv3384c|MTV004.42c (130 aa), FASTA scores: opt: 243, E(): 1.7e-09, (35.1% identity in 131 aa overlap); P95252|Rv1962c|MTCY09F9.02 (135 aa), FASTA scores: opt: 191, E(): 5e-06, (35.5% identity in 138 aa overlap), etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217925.1" /db_xref="GI:15610544" /db_xref="GeneID:887900" /translation="MIYMDTSALTKLLISEPETTELRTWLTAQSGQGEDAATSTLGRV ESMRVVARYGQPGQTERARYLLDGLDILPLTEPVIGLAETIGPATLRSLDAIHLAAAA QIKRELTAFVTYDHRLLSGCREVGFVTASPGAVR" gene complement(3826991..3828727) /gene="choD" /locus_tag="Rv3409c" /db_xref="GeneID:887502" CDS complement(3826991..3828727) /gene="choD" /locus_tag="Rv3409c" /EC_number="1.1.3.6" /function="INVOLVED IN CHOLESTEROL METABOLISM [CATALYTIC ACTIVITY: CHOLESTEROL + O(2) = CHOLEST-4-EN-3-ONE + H(2)O(2)]." /experiment="experimental evidence, no additional details recorded" /note="Rv3409c, (MTCY78.19), len: 578 aa. Probable choD, cholesterol oxidase precursor (EC 1.1.3.6), equivalent to Q9CCV1|CHOD|ML0389 (alias Q59530|CHOD|B1620_C3_240) PUTATIVE CHOLESTEROL OXIDASE from Mycobacterium leprae (569 aa), FASTA scores: opt: 3510, E(): 3.8e-198, (88.6% identity in 569 aa overlap). Also highly similar to Q9L0H6|SCD63.13 PUTATIVE CHOLESTEROL OXIDASE from Streptomyces coelicolor (602 aa), FASTA scores: opt: 1101, E(): 5.2e-57, (60.05% identity in 586 aa overlap); and similar to other oxidoreductases e.g. Q9A7T6|CC1634 OXIDOREDUCTASE (GMC FAMILY) from Caulobacter crescentus (579 aa), FASTA scores: opt: 221, E(): 1.8e-05, (25.2% identity in 583 aa overlap). BELONGS TO THE GMC OXIDOREDUCTASES FAMILY. COFACTOR: FAD FLAVOPROTEIN. Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="cholesterol oxidase precursor" /protein_id="NP_217926.1" /db_xref="GI:15610545" /db_xref="GeneID:887502" /translation="MKPDYDVLIIGSGFGGSVTALRLTEKGYRVGVLEAGRRFSDEEF AKTSWDLRKFLWAPRLGCYGIQRIHPLRNVMILAGAGVGGGSLNYANTLYVPPEPFFA DQQWSHITDWRGELMPHYQQAQRMLGVVQNPTFTDADRIVKEVADEMGFGDTWVPTPV GVFFGPDGTKTPGKTVPDPYFGGAGPARTGCLECGCCMTGCRHGAKNTLVKNYLGLAE SAGAQVIPMTTVKGFERRSDGLWEVRTVRTGSWLRRDRRTFTATQLVLAAGTWGTQHL LFKMRDRGRLPGLSKRLGVLTRTNSESIVGAATLKVNPDLDLTHGVAITSSIHPTADT HIEPVRYGKGSNAMGLLQTLMTDGSGPQGTDVPRWRQLLQTASQDPRGTIRMLNPRQW SERTVIALVMQHLDNSITTFTKRGKLGIRWYSSKQGHGEPNPTWIPIGNQVTRRIAAK IDGVAGGTWGELFNIPLTAHFLGGAVIGDDPEHGVIDPYHRVYGYPTLYVVDGAAISA NLGVNPSLSIAAQAERAASLWPNKGETDRRPPQGEPYRRLAPIQPAHPVVPADAPGAL RWLPIDPVSNAG" misc_feature complement(3828203..3828226) /gene="choD" /locus_tag="Rv3409c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3828783..3829910) /gene="guaB3" /locus_tag="Rv3410c" /db_xref="GeneID:887510" CDS complement(3828783..3829910) /gene="guaB3" /locus_tag="Rv3410c" /EC_number="1.1.1.205" /function="CATALYSES THE FIRST REACTION UNIQUE TO GMP BIOSYNTHESIS [CATALYTIC ACTIVITY: INOSINE 5'-PHOSPHATE + NAD(+) + H(2)O = XANTHOSINE 5'-PHOSPHATE + NADH]." /note="catalyzes the synthesis of xanthosine monophosphate by the NAD+ dependent oxidation of inosine monophosphate" /codon_start=1 /transl_table=11 /product="inosine 5-monophosphate dehydrogenase" /protein_id="NP_217927.1" /db_xref="GI:15610546" /db_xref="GeneID:887510" /translation="MVEIGMGRTARRTYELSEISIVPSRRTRSSKDVSTAWQLDAYRF EIPVVAHPTDALVSPEFAIELGRLGGLGVLNGEGLIGRHLDVEAKIAQLLEAAAADPE PSTAIRLLQELHAAPLNPDLLGAAVARIREAGVTTAVRVSPQNAQWLTPVLVAAGIDL LVIQGTIVSAERVASDGEPLNLKTFISELDIPVVAGGVLDHRTALHLMRTGAAGVIVG YGSTQGVTTTDEVLGISVPMATAIADAAAARRDYLDETGGRYVHVLADGDIHTSGELA KAIACGADAVVLGTPLAESAEALGEGWFWPAAAAHPSLPRGALLQIAVGERPPLARVL GGPSDDPFGGLNLVGGLRRSMAKAGYCDLKEFQKVGLTVGG" gene complement(3829930..3831519) /gene="guaB2" /locus_tag="Rv3411c" /db_xref="GeneID:887498" CDS complement(3829930..3831519) /gene="guaB2" /locus_tag="Rv3411c" /EC_number="1.1.1.205" /function="CATALYSES THE FIRST REACTION UNIQUE TO GMP BIOSYNTHESIS [CATALYTIC ACTIVITY: INOSINE 5'-PHOSPHATE + NAD(+) + H(2)O = XANTHOSINE 5'-PHOSPHATE + NADH]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the synthesis of xanthosine monophosphate by the NAD+ dependent oxidation of inosine monophosphate" /codon_start=1 /transl_table=11 /product="inosine 5'-monophosphate dehydrogenase" /protein_id="NP_217928.1" /db_xref="GI:15610547" /db_xref="GeneID:887498" /translation="MSRGMSGLEDSSDLVVSPYVRMGGLTTDPVPTGGDDPHKVAMLG LTFDDVLLLPAASDVVPATADTSSQLTKKIRLKVPLVSSAMDTVTESRMAIAMARAGG MGVLHRNLPVAEQAGQVEMVKRSEAGMVTDPVTCRPDNTLAQVDALCARFRISGLPVV DDDGALVGIITNRDMRFEVDQSKQVAEVMTKAPLITAQEGVSASAALGLLRRNKIEKL PVVDGRGRLTGLITVKDFVKTEQHPLATKDSDGRLLVGAAVGVGGDAWVRAMMLVDAG VDVLVVDTAHAHNRLVLDMVGKLKSEVGDRVEVVGGNVATRSAAAALVDAGADAVKVG VGPGSICTTRVVAGVGAPQITAILEAVAACRPAGVPVIADGGLQYSGDIAKALAAGAS TAMLGSLLAGTAEAPGELIFVNGKQYKSYRGMGSLGAMRGRGGATSYSKDRYFADDAL SEDKLVPEGIEGRVPFRGPLSSVIHQLTGGLRAAMGYTGSPTIEVLQQAQFVRITPAG LKESHPHDVAMTVEAPNYYAR" misc_feature complement(3830491..3830529) /gene="guaB2" /locus_tag="Rv3411c" /note="PS00487 IMP dehydrogenase / GMP reductase signature" gene 3831726..3832136 /locus_tag="Rv3412" /db_xref="GeneID:887499" CDS 3831726..3832136 /locus_tag="Rv3412" /function="UNKNOWN" /note="Rv3412, (MTCY78.16c), len: 136 aa. Hypothetical protein, strongly similar to Q49742|YY12_MYCLE|ML0386|B1620_F3_131 HYPOTHETICAL 15.3 KDA PROTEIN from Mycobacterium leprae (137 aa), FASTA scores: opt: 933, E(): 6.3e-52, (93.4% identity in 136 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217929.1" /db_xref="GI:15610548" /db_xref="GeneID:887499" /translation="MRDHLPPGLPPDPFADDPCDPSAALEAVEPGQPLDQQERMAVEA DLADLAVYEALLAHKGIRGLVVCCDECQQDHYHDWDMLRSNLLQLLIDGTVRPHEPAY DPEPDSYVTWDYCRGYADASLNEAAPDADRFRRR" gene complement(3832146..3833045) /locus_tag="Rv3413c" /db_xref="GeneID:887925" CDS complement(3832146..3833045) /locus_tag="Rv3413c" /function="UNKNOWN" /note="Rv3413c, (MTCY78.16), len: 299 aa. Hypothetical unknown ala-, pro-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217930.1" /db_xref="GI:15610549" /db_xref="GeneID:887925" /translation="MREFGNPLGDRPPLDELARTDLLLDALAEREEVDFADPRDDALA ALLGQWRDDLRWPPASALVSQDEAVAALRAGVAQRRRARRSLAAVGSVAAALLVLSGF GAVVADARPGDLLYGLHAMMFNRSRVSDDQIVLSAKANLAKVEQMIAQGQWAEAQDEL AEVSSTVQAVTDGSRRQDLINEVNLLNTKVETRDPNATLRPGSPSNPAAPGSVGNSWT PLAPVVEPPTPPTPASAAEPSMSAGVSESPMPNSTSTVAASPSTPSSKPEPGSIDPSL EPADEATNPAGQPAPETPVSPTH" gene complement(3833038..3833676) /gene="sigD" /locus_tag="Rv3414c" /db_xref="GeneID:887594" CDS complement(3833038..3833676) /gene="sigD" /locus_tag="Rv3414c" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED." /experiment="experimental evidence, no additional details recorded" /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription; this protein is involved in expression of ribosome-associated gene products in stationary phase" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigD" /protein_id="NP_217931.1" /db_xref="GI:15610550" /db_xref="GeneID:887594" /translation="MVDPGVSPGCVRFVTLEISPSMTMQGERLDAVVAEAVAGDRNAL REVLETIRPIVVRYCRARVGTVERSGLSADDVAQEVCLATITALPRYRDRGRPFLAFL YGIAAHKVADAHRAAGRDRAYPAETLPERWSADAGPEQMAIEADSVTRMNELLEILPA KQREILILRVVVGLSAEETAAAVGSTTGAVRVAQHRALQRLKDEIVAAGDYA" gene complement(3833694..3834521) /locus_tag="Rv3415c" /db_xref="GeneID:887599" CDS complement(3833694..3834521) /locus_tag="Rv3415c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3415c, (MTCY78.14), len: 275 aa. Conserved hypothetical protein, equivalent to Q9CCV3|ML0383 HYPOTHETICAL PROTEIN from Mycobacterium leprae (281 aa), FASTA scores: opt: 1278, E(): 4.2e-71, (73.5% identity in 279 aa overlap). Also some similarity with P71677|RIBD_MYCTU|RIBG|Rv1409|MT1453|MTCY21B4.26 RIBOFLAVIN BIOSYNTHESIS PROTEIN R (339 aa), FASTA scores: opt: 143, E(): 0.13, (28.25% identity in 184 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217932.1" /db_xref="GI:15610551" /db_xref="GeneID:887599" /translation="MNETPHAPVVEQVLVAAAFGNQPGSWPLPTAITPHHLWLRAVAA GGQGRYAHAYGDLSVLRRLVPAGPLASLAHSTQGSLLRQLGWHTLARGWDGRALALAG ADREAGADALIGLAADALGVGRFAAAGALLDRADPLVVSPLVADRLAVRRRWVAAELA MATGDGATAVRHAEEAVELTQAMAVASARHRVKSDVVLAAALCSAGAVARARAVGEEA LDATARFGLLPLRWALACLLIDIGTVTFSAQQLRELTKIRNICAGQVRRAGGCWRTA" gene 3834892..3835200 /gene="whiB3" /locus_tag="Rv3416" /db_xref="GeneID:887598" CDS 3834892..3835200 /gene="whiB3" /locus_tag="Rv3416" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM (GROWTH PHASE-DEPENDENT)." /experiment="experimental evidence, no additional details recorded" /note="Rv3416, (MTCY78.13c), len: 102 aa. whiB3 (alternate gene name: whmB), WhiB-like regulatory protein (see citations below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to Q49871|WHIB3|WHIB|ML0382|B229_F1_2|B1620_F3_137 PROBABLE TRANSCRIPTION FACTOR WHIB3 from Mycobacterium leprae (102 aa), FASTA scores: opt: 657, E(): 7.9e-39, (86.25% identity in 102 aa overlap). Also highly similar to Q9Z6E9|WHIB3 from Mycobacterium smegmatis (96 aa), FASTA scores: opt: 604, E(): 3.5e-35, (80.4% identity in 102 aa overlap); and O88103|WHID|SC6G4.45c|WBLB from Streptomyces coelicolor (112 aa), FASTA scores: opt: 437, E(): 1.4e-23, (62.5% identity in 96 aa overlap). Also similar to O05847|WHIB1|Rv3219|MTCY07D11.07c from Mycobacterium tuberculosis (84 aa), FASTA scores: opt: 215, E(): 2.5e-08, (44.45% identity in 81 aa overlap). Note that primer extension analysis revealed three transcriptional start sites and that expression from the three potential promoters is growth phase-dependent (see Mulder et al., 1999). Moreover, the transcription of this CDS seems to be activated in macrophages (see Ramakrishnan et al., 2000).; whmB" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIB-like WHIB3" /protein_id="NP_217933.1" /db_xref="GI:15610552" /db_xref="GeneID:887598" /translation="MPQPEQLPGPNADIWNWQLQGLCRGMDSSMFFHPDGERGRARTQ REQRAKEMCRRCPVIEACRSHALEVGEPYGVWGGLSESERDLLLKGTMGRTRGIRRTA" gene complement(3835272..3836891) /gene="groEL" /locus_tag="Rv3417c" /db_xref="GeneID:887877" CDS complement(3835272..3836891) /gene="groEL" /locus_tag="Rv3417c" /function="PREVENTS MISFOLDING AND PROMOTES THE REFOLDING AND PROPER ASSEMBLY OF UNFOLDED POLYPEPTIDES GENERATED UNDER STRESS CONDITIONS." /experiment="experimental evidence, no additional details recorded" /note="60 kDa chaperone family; promotes refolding of misfolded polypeptides especially under stressful conditions; forms two stacked rings of heptamers to form a barrel-shaped 14mer; ends can be capped by GroES; misfolded proteins enter the barrel where they are refolded when GroES binds; many bacteria have multiple copies of the groEL gene which are active under different environmental conditions; the B.japonicum protein in this cluster is expressed constitutively; in Rhodobacter, Corynebacterium and Rhizobium this protein is not essential for growth" /codon_start=1 /transl_table=11 /product="chaperonin GroEL" /protein_id="NP_217934.1" /db_xref="GI:15610553" /db_xref="GeneID:887877" /translation="MSKLIEYDETARRAMEVGMDKLADTVRVTLGPRGRHVVLAKAFG GPTVTNDGVTVAREIELEDPFEDLGAQLVKSVATKTNDVAGDGTTTATILAQALIKGG LRLVAAGVNPIALGVGIGKAADAVSEALLASATPVSGKTGIAQVATVSSRDEQIGDLV GEAMSKVGHDGVVSVEESSTLGTELEFTEGIGFDKGFLSAYFVTDFDNQQAVLEDALI LLHQDKISSLPDLLPLLEKVAGTGKPLLIVAEDVEGEALATLVVNAIRKTLKAVAVKG PYFGDRRKAFLEDLAVVTGGQVVNPDAGMVLREVGLEVLGSARRVVVSKDDTVIVDGG GTAEAVANRAKHLRAEIDKSDSDWDREKLGERLAKLAGGVAVIKVGAATETALKERKE SVEDAVAAAKAAVEEGIVPGGGASLIHQARKALTELRASLTGDEVLGVDVFSEALAAP LFWIAANAGLDGSVVVNKVSELPAGHGLNVNTLSYGDLAADGVIDPVKVTRSAVLNAS SVARMVLTTETVVVDKPAKAEDHDHHHGHAH" misc_feature complement(3835650..3835685) /gene="groEL" /locus_tag="Rv3417c" /note="PS00296 Chaperonins cpn60 signature" misc_feature complement(3836469..3836492) /gene="groEL" /locus_tag="Rv3417c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3836986..3837288) /gene="groES" /locus_tag="Rv3418c" /db_xref="GeneID:887583" CDS complement(3836986..3837288) /gene="groES" /locus_tag="Rv3418c" /function="BINDS TO CPN60 IN THE PRESENCE OF MG-ATP AND SUPPRESSES THE ATPASE ACTIVITY OF THE LATTER." /experiment="experimental evidence, no additional details recorded" /note="10 kDa chaperonin; Cpn10; GroES; forms homoheptameric ring; binds to one or both ends of the GroEL double barrel in the presence of adenine nucleotides capping it; folding of unfolded substrates initiates in a GroEL-substrate bound and capped by GroES; release of the folded substrate is dependent on ATP binding and hydrolysis in the trans ring" /codon_start=1 /transl_table=11 /product="co-chaperonin GroES" /protein_id="NP_217935.1" /db_xref="GI:15610554" /db_xref="GeneID:887583" /translation="MAKVNIKPLEDKILVQANEAETTTASGLVIPDTAKEKPQEGTVV AVGPGRWDEDGEKRIPLDVAEGDTVIYSKYGGTEIKYNGEEYLILSARDVLAVVSK" misc_feature complement(3837199..3837273) /gene="groES" /locus_tag="Rv3418c" /note="PS00681 Chaperonins cpn10 signature" gene complement(3837555..3838589) /gene="gcp" /locus_tag="Rv3419c" /db_xref="GeneID:887595" CDS complement(3837555..3838589) /gene="gcp" /locus_tag="Rv3419c" /EC_number="3.4.24.57" /function="HYDROLYSIS OF O-SIALOGLYCOPROTEINS; CLEAVES 31-ARG-|-ASP-32 BOND IN GLYCOPHORIN A. DOES NOT CLEAVE UNGLYCOSYLATED PROTEINS, DESIALYLATED GLYCOPROTEINS OR GLYCOPROTEINS THAT ARE ONLY N-GLYCOSYLATED. COULD BE A METALLOPROTEASE." /note="in most organisms, only the N-terminal domain is present in a single polypeptide; in some archaea this domain is fused to a kinase domain; this gene is essential for growth in Escherichia coli and Bacillus subtilis; the secreted glycoprotease from Pasteurella haemolytica showed specificity for O-sialoglycosylated proteins; the Pyrococcus structure shows DNA-binding properties, iron-binding, ATP-binding, and AP endonuclease activity" /codon_start=1 /transl_table=11 /product="putative DNA-binding/iron metalloprotein/AP endonuclease" /protein_id="NP_217936.1" /db_xref="GI:15610555" /db_xref="GeneID:887595" /translation="MTTVLGIETSCDETGVGIARLDPDGTVTLLADEVASSVDEHVRF GGVVPEIASRAHLEALGPAMRRALAAAGLKQPDIVAATIGPGLAGALLVGVAAAKAYS AAWGVPFYAVNHLGGHLAADVYEHGPLPECVALLVSGGHTHLLHVRSLGEPIIELGST VDDAAGEAYDKVARLLGLGYPGGKALDDLARTGDRDAIVFPRGMSGPADDRYAFSFSG LKTAVARYVESHAADPGFRTADIAAGFQEAVADVLTMKAVRAATALGVSTLLIAGGVA ANSRLRELATQRCGEAGRTLRIPSPRLCTDNGAMIAAFAAQLVAAGAPPSPLDVPSDP GLPVMQGQVR" misc_feature complement(3838233..3838295) /gene="gcp" /locus_tag="Rv3419c" /note="PS01016 Glycoprotease family signature" gene complement(3838586..3839062) /gene="rimI" /locus_tag="Rv3420c" /db_xref="GeneID:887555" CDS complement(3838586..3839062) /gene="rimI" /locus_tag="Rv3420c" /EC_number="2.3.1.128" /function="THIS ENZYME ACETYLATES THE N-TERMINAL ALANINE OF RIBOSOMAL PROTEIN S18 [CATALYTIC ACTIVITY: ACETYL-COA + RIBOSOMAL-PROTEIN L-ALANINE = CoA + RIBOSOMAL-PROTEIN N-ACETYL-L-ALANINE]." /note="Rv3420c, (MTCY78.09), len: 158 aa. Probable rimI, ribosomal-protein-alanine acetyltransferase (EC 2.3.1.128), equivalent to C-terminal part of Q49857|YY21_MYCLE|ML0378|B229_C1_170 HYPOTHETICAL 38.0 KDA PROTEIN from Mycobacterium leprae (359 aa), FASTA scores: opt: 772, E(): 2.7e-44, (72.1% identity in 154 aa overlap). Similar notably to ribosomal-protein-alanine acetyltransferases e.g. Q9AC11|CC0058 from Caulobacter crescentus (150 aa), FASTA scores: opt: 223, E(): 4.9e-08, (37.5% identity in 136 aa overlap); Q9KFD4|BH0547 from Bacillus halodurans (151 aa), FASTA scores: opt: 222, E(): 5.8e-08, (35.2% identity in 142 aa overlap); Q9PG61|XF0441 from Xylella fastidiosa (156 aa), FASTA scores: opt: 207, E(): 5.9e-07, (32.2% identity in 149 aa overlap); Q9HVB7|RIMI|PA4678 from Pseudomonas aeruginosa (150 aa), FASTA scores: opt: 203, E(): 1.1e-06, (32.45% identity in 151 aa overlap); P09453|RIMI_ECOLI|B4373 from Escherichia coli strain K12 (148 aa), FASTA scores: opt: 196, E(): 3.1e-06, (33.55% identity in 149 aa overlap); etc. BELONGS TO THE ACETYLTRANSFERASE FAMILY, RIMI SUBFAMILY." /codon_start=1 /transl_table=11 /product="ribosomal-protein-alanine acetyltransferase" /protein_id="NP_217937.1" /db_xref="GI:15610556" /db_xref="GeneID:887555" /translation="MTADTEPVTIGALTRADAQRCAELEAQLFVGDDPWPPAAFNREL ASPHNHYVGARSGGTLVGYAGISRLGRTPPFEYEVHTIGVDPAYQGRGIGRRLLRELL DFARGGVVYLEVRTDNDAALALYRSVGFQRVGLRRRYYRVSGADAYTMRRDSGDPS" gene complement(3839059..3839694) /locus_tag="Rv3421c" /db_xref="GeneID:887532" CDS complement(3839059..3839694) /locus_tag="Rv3421c" /function="UNKNOWN" /note="Rv3421c, (MTCY78.08), len: 211 aa. Conserved hypothetical protein, equivalent to Q49857|YY21_MYCLE|ML0378|B229_C1_170 HYPOTHETICAL 38.0 KDA PROTEIN from Mycobacterium leprae (359 aa), FASTA scores: opt: 1000, E(): 1.8e-50, (75.6% identity in 205 aa overlap). Also similar to other hypothetical bacterial proteins e.g. O86791|SC6G4.28 from Streptomyces coelicolor (217 aa), FASTA scores: opt: 453, E(): 3.3e-19, (48.1% identity in 212 aa overlap); Q9AC10|CC0059 (GLYCOPROTEASE FAMILY PROTEIN) from Caulobacter crescentus (211 aa), FASTA scores: opt: 248, E(): 2e-07, (34.3% identity in 210 aa overlap); Q9KQK9|VC1989 from Vibrio cholerae (237 aa), FASTA scores: opt: 238, E(): 8.2e-07, (28.85% identity in 208 aa overlap); BAB51966|Mlr5530 from Rhizobium loti (Mesorhizobium loti) (225 aa), FASTA scores: opt: 237, E(): 9e-07, (35.0% identity in 220 aa overlap); etc. Some similarity to upstream Q50709|GCP_MYCTU|Rv3419c|MT3528|MTCY78.10 from Mycobacterium tuberculosis (344 aa), (33.9% identity in 127 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217938.1" /db_xref="GI:15610557" /db_xref="GeneID:887532" /translation="MSRVQISTVLAIDTATPAVTAGIVRRHDLVVLGERVTVDARAHA ERLTPNVLAALADAALTMADLDAVVVGCGPGPFTGLRAGMASAAAYGHALGIPVYGVC SLDAIGGQTIGDTLVVTDARRREVYWARYCDGIRTVGPAVNAAADVDPGPALAVAGAP EHAALFALPCVEPSRPSPAGLVAAVNWADKPAPLVPLYLRRPDAKPLAVCT" gene complement(3839691..3840197) /locus_tag="Rv3422c" /db_xref="GeneID:887557" CDS complement(3839691..3840197) /locus_tag="Rv3422c" /function="UNKNOWN" /note="Rv3422c, (MTCY78.07), len: 168 aa. Conserved hypothetical protein, equivalent to Q49864|YY22_MYCLE|ML0377|U229F|B229_C2_205 HYPOTHETICAL 17.6 KDA PROTEIN from Mycobacterium leprae (161 aa), FASTA scores: opt: 752, E(): 8.3e-38, (77.4% identity in 146 aa overlap). Also similar to other hypothetical bacterial proteins e.g. O86788|YJEE_STRCO|SC6G4.25 from Streptomyces coelicolor (148 aa), FASTA scores: opt: 377, E(): 1.2e-15, (50.85% identity in 120 aa overlap); Q9X1W7|TM1632 from Thermotoga maritima (161 aa), FASTA scores: opt: 247, E(): 6.2e-08, (39.4% identity in 137 aa overlap); Q9RRY1|DR2351 from Deinococcus radiodurans (148 aa), FASTA scores: opt: 236, E(): 2.6e-07, (38.6% identity in 127 aa overlap); etc. Contains PS00017 ATP /GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217939.1" /db_xref="GI:15610558" /db_xref="GeneID:887557" /translation="MSREGIRRRPKARAGLTGGGTATLPRVEDTLTLGSRLGEQLCAG DVVVLSGPLGAGKTVLAKGIAMAMDVEGPITSPTFVLARMHRPRRPGTPAMVHVDVYR LLDHNSADLLSELDSLDLDTDLEDAVVVVEWGEGLAERLSQRHLDVRLERVSHSDTRI ATWSWGRS" misc_feature complement(3840024..3840047) /locus_tag="Rv3422c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3840194..3841420) /gene="alr" /locus_tag="Rv3423c" /db_xref="GeneID:887634" CDS complement(3840194..3841420) /gene="alr" /locus_tag="Rv3423c" /EC_number="5.1.1.1" /function="PROVIDES THE D-ALANINE REQUIRED FOR CELL WALL BIOSYNTHESIS. TRANSFORMS L-ALANINE to D-ALANINE [CATALYTIC ACTIVITY: L-ALANINE = D-ALANINE]" /experiment="experimental evidence, no additional details recorded" /note="converts L-alanine to D-alanine which is used in cell wall biosynthesis; binds one pyridoxal phosphate per monomer; forms a homodimer" /codon_start=1 /transl_table=11 /product="alanine racemase" /protein_id="NP_217940.1" /db_xref="GI:15610559" /db_xref="GeneID:887634" /translation="MKRFWENVGKPNDTTDGRGTTSLAMTPISQTPGLLAEAMVDLGA IEHNVRVLREHAGHAQLMAVVKADGYGHGATRVAQTALGAGAAELGVATVDEALALRA DGITAPVLAWLHPPGIDFGPALLADVQVAVSSLRQLDELLHAVRRTGRTATVTVKVDT GLNRNGVGPAQFPAMLTALRQAMAEDAVRLRGLMSHMVYADKPDDSINDVQAQRFTAF LAQAREQGVRFEVAHLSNSSATMARPDLTFDLVRPGIAVYGLSPVPALGDMGLVPAMT VKCAVALVKSIRAGEGVSYGHTWIAPRDTNLALLPIGYADGVFRSLGGRLEVLINGRR CPGVGRICMDQFMVDLGPGPLDVAEGDEAILFGPGIRGEPTAQDWADLVGTIHYEVVT SPRGRITRTYREAENR" gene complement(3841714..3842076) /locus_tag="Rv3424c" /db_xref="GeneID:887618" CDS complement(3841714..3842076) /locus_tag="Rv3424c" /function="UNKNOWN" /note="Rv3424c, (MTCY78.05), len: 120 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217941.1" /db_xref="GI:15610560" /db_xref="GeneID:887618" /translation="MPNPVTMLYGRKADLVILPHVLAEERPHPYSTPGRKRGAQIALT TGIDALASFAPQIVNPRHGLSRVVQCLGGCENKRHAYFRSISKTPHIRARGVPSVCAV RTVGVDGAKRPPKPIPVQ" gene 3842239..3842769 /gene="PPE57" /locus_tag="Rv3425" /db_xref="GeneID:887635" CDS 3842239..3842769 /gene="PPE57" /locus_tag="Rv3425" /function="UNKNOWN" /note="Rv3425, (MTCY78.04c), len: 176 aa. Member of the M. tuberculosis PPE family, similar to many e.g. O06246|Rv3429|MTCY77.01 (178 aa), FASTA scores: opt: 781, E(): 7e-44, (69.9% identity in 176 aa overlap); and downstream Q50702|YY26_MYCTU|Rv3426|MTCY78.03c (232 aa), FASTA scores: opt: 517, E(): 1.2e-26, (68.0% identity in 125 aa overlap); MTV049_11, MTCY428_16, MTV049_22, MTV049_30, MTCY261_4; etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177971.1" /db_xref="GI:57117105" /db_xref="GeneID:887635" /translation="MHPMIPAEYISNIIYEGPGADSLFFASGQLRELAYSVETTAESL EDELDELDENWKGSSSDLLADAVERYLQWLSKHSSQLKHAAWVINGLANAYNDTRRKV VPPEEIAANREERRRLIASNVAGVNTPAIADLDAQYDQYRARNVAVMNAYVSWTRSAL SDLPRWREPPQIYRGG" gene 3843036..3843734 /gene="PPE58" /locus_tag="Rv3426" /db_xref="GeneID:887622" CDS 3843036..3843734 /gene="PPE58" /locus_tag="Rv3426" /function="UNKNOWN" /note="Rv3426, (MTCY78.03c), len: 232 aa. Member of the M. tuberculosis PPE family, similar to many e.g. the downstream O06246|Rv3429|MTCY77.01 (178 aa), FASTA scores: opt: 555, E(): 6.5e-26, (72.0% identity in 125 aa overlap); and upstream Q50703|YY25_MYCTU|Rv3425|MTCY78.04c (176 aa), FASTA scores: opt: 517, E(): 1.1e-23, (68.0% identity in 125 aa overlap); MTV049_30, MTCY3C7_24, MTCY428_16, MTCY3A2_22; etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177972.1" /db_xref="GI:57117106" /db_xref="GeneID:887622" /translation="MHLMIPAEYISNVIYEGPRADSLYAADQRLRQLADSVRTTAESL NTTLDELHENWKGSSSEWMADAALRYLDWLSKHSRQILRTARVIESLVMAYEETLLRV VPPATIANNREEVRRLIASNVAGGKHSSNRRPRGTIRAVPGRKYPSNGPLSKLDPICA IEAAPMAGAAADPQERVGPRGRRGLAGQQQCRGRPGPSLRCSHDTPRFQMNQAFHTMV NMLLTCFACQEKPR" gene complement(3843885..3844640) /locus_tag="Rv3427c" /db_xref="GeneID:887631" CDS complement(3843885..3844640) /locus_tag="Rv3427c" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1532." /note="Rv3427c, (MTCY78.02), len: 251 aa. Possible transposase, similar to other e.g. Q9APG8|ORF2 PUTATIVE TRANSPOSASE SUBUNIT 2 from Pseudomonas putida (251 aa), FASTA scores: opt: 479, E(): 1.8e-21, (34.85% identity in 238 aa overlap). Contains PS00017 ATP/GTP-binding site motif A." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217944.1" /db_xref="GI:15610563" /db_xref="GeneID:887631" /translation="MSICDPALRNALRTLKLSGMLDTLDARLAQTRNGDLGHLEFLQA LREDEIARRESAALTRRLRRAKFEAQATFEDFDFTANPKLPGAMLRDLAALRWLDAGE SVILHGPVGVGKTHVAQALVHAVARRGGDVRFAKTSRMLSDLAGGHADRSWGQRIREY TKPLVLILDDFAMREHTAMHADDLYELISDRAITGKPLILTSNRAPNNWYGLFPNPVV AESLLDRLINTSHQILMDGPSYRPRKRPGRTTS" repeat_region complement(3843888..3845970) /note="IS1532, len: 2083 bp. Insertion sequence IS1532." /mobile_element="insertion sequence:IS1532" misc_feature complement(3844296..3844319) /locus_tag="Rv3427c" /note="PS00017 ATP/GTP-binding site motif A" gene complement(3844738..3845970) /locus_tag="Rv3428c" /db_xref="GeneID:887621" CDS complement(3844738..3845970) /locus_tag="Rv3428c" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1532." /note="Rv3428c, (MTCY78.01, len: 410 aa. Possible transposase INSERTION SEQUENCE, similar to others e.g. Q9APG9|ORF1 from Pseudomonas putida (509 aa), FASTA scores: opt: 578, E(): 1.1e-29, (32.45% identity in 376 aa overlap); P55379|Y4BL_RHISN from Rhizobium sp. strain NGR234 (516 aa), FASTA scores: opt: 665, E(): 2.7e-35, (35.3% identity in 391 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217945.1" /db_xref="GI:15610564" /db_xref="GeneID:887621" /translation="MATIAQRLRDDHGVAASESSVRRWIATHFAEEVARERVTVPRGP VDAGSEAQIDYGRLGMWFDPATARRVAVWAFVMVLAFSRHLFVRPVIRMDQTAWCACH VAAFEFFDGVPARLVCDNLRTGVDKPDLYDPQINRSYAELASHYATLVDPARARKPKD KPRVERPMTYVRDSFWKGREFDSLAQMQQAAVTWSTEVAGLRYLRALEGAQPLRMFEA VEQQALIALPPRAFELTSWSIGTVGVDTHLKVGKALYSVPWRLIGQRLHARTAGDVVQ IFAGNDVVATHVRRPSGRSTDFSHYPPEKIAFHMRTPTWCRHTAELVGPASQQVIAEF MRDNAIHHLRSAQGVLGLRDKHGCDRLEAACARAIEVGDPSYRTIKGILVAGTEHAAN EPTTSSPASTAGGVPARP" gene 3847165..3847701 /gene="PPE59" /locus_tag="Rv3429" /db_xref="GeneID:887630" CDS 3847165..3847701 /gene="PPE59" /locus_tag="Rv3429" /function="UNKNOWN" /note="Rv3429, (MTCY77.01), len: 178 aa. Member of the M. tuberculosis PPE family, similar to many e.g. the upstream Q50703|YY25_MYCTU|Rv3425|MTCY78.04c (176 aa), FASTA scores: opt: 781, E(): 1.9e-44, (69.9% identity in 176 aa overlap); and Q50702|YY26_MYCTU|Rv3426|MTCY78.03c (232 aa), FASTA scores: opt: 555, E(): 1.7e-29, (72.0% identity in 125 aa overlap) (but diverges at 3' end)); etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177973.1" /db_xref="GI:57117107" /db_xref="GeneID:887630" /translation="MHPMIPAEYISNIIYEGPGADSLSAAAEQLRLMYNSANMTAKSL TDRLGELQENWKGSSSDLMADAAGRYLDWLTKHSRQILETAYVIDFLAYVYEETRHKV VPPATIANNREEVHRLIASNVAGVNTPAIAGLDAQYQQYRAQNIAVMNDYQSTARFIL AYLPRWQEPPQIYGGGGG" gene complement(3847642..3848805) /locus_tag="Rv3430c" /db_xref="GeneID:887615" CDS complement(3847642..3848805) /locus_tag="Rv3430c" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1540." /experiment="experimental evidence, no additional details recorded" /note="Rv3430c, (MTCY77.02c), len: 387 aa. Possible IS1540 transposase, similar to several e.g. Q49592 transposase from Mycobacterium intracellulare (340 aa), FASTA scores: opt: 1377, E(): 1.6e-81, (64.2% identity in 338 aa overlap); similarity is lost at C-terminus due to possible frameshift after aa 297." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_217947.1" /db_xref="GI:15610566" /db_xref="GeneID:887615" /translation="MIDTAIEEMIPLIGVRAACAATGRAPASYYRAHSKRLSAQSDTF TSTAVTDPSGPRESAQPRALSAAEREHVLAVLNSQRFADMAPAVVYATLLDEGIYLCS ESTMYRLLRERGQTGDRRRQATHPAAVKPELVAHQPNSVWSWDITKLRGPAKWSYYYL YVILDIFSRYVVGWMVASRESKVLAERLIAQTLAAQHISADQLTLHADRGSSMSSKPV ALLLADLGVTKSHSRPHTSNDNPLSEAQFKTLKYRPDFPKRFESIEAARVHCDRFFGW YNHEHKHSGIGLHTPADVHYGRADQIRRHRATVLDTAYRDHLERIRSQTTRATRATGL QRDQPTTEGGPADSINPRKSCLRNVDRFRPGLLDLPAPAPVDLRRLLPSGQIR" repeat_region complement(3847644..3848806) /note="IS1540, len: 1163 bp. Insertion sequence IS1540." /mobile_element="insertion sequence:IS1540" gene complement(3849294..3850139) /locus_tag="Rv3431c" /db_xref="GeneID:887608" CDS complement(3849294..3850139) /locus_tag="Rv3431c" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1552" /note="Rv3431c, (MTCY77.03c), len: 281 aa. Possible truncated transposase for IS1552, similar to, but shorter than other transposases e.g. P72303 from Rhodococcus opacus (418 aa), FASTA scores: opt: 1509, E(): 1.2e-91, (80.95% identity in 278 aa overlap); Q9AKV5 from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 1115, E(): 7.8e-66, (63.45% identity in 268 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217948.1" /db_xref="GI:15610567" /db_xref="GeneID:887608" /translation="MFAELIRAGLQALIEAEATEAIGAGRYERSDGRIVHRNGHRPKT VSTTAGDIEVQIPKLRAGSFFPSLLERRRRIDKALHAVIMEAYVHGVSTRSVDDLVAA MGVQAGVSKSEVSRICAGLDTEIEAFRTRSLTHTEFPYVFCDATFCKVRVGAHVVSQA LVVATGVSIDGTREVLGTAVGDSESYEFWREFLASLKARGLTGVHLVISDAHAGLKAA VAQQFSGASWQRCRVHFMRNLYTAVAAKHAPAVTVAVKTIFAHTDPEEVGAQWDRVAD PLCQP" repeat_region complement(3849296..3850140) /note="IS1552, len: 845 bp. Insertion sequence IS1552." /mobile_element="insertion sequence:IS1552" gene complement(3850372..3851754) /gene="gadB" /locus_tag="Rv3432c" /db_xref="GeneID:887580" CDS complement(3850372..3851754) /gene="gadB" /locus_tag="Rv3432c" /EC_number="4.1.1.15" /function="CATALYZES THE PRODUCTION OF GABA [CATALYTIC ACTIVITY: L-GLUTAMATE = 4-AMINOBUTANOATE + CO(2)]." /note="Rv3432c, (MTCY77.04c), len: 460 aa. Probable gadB, glutamate decarboxylase (EC 4.1.1.15), similar to many e.g. P73043|GAD|SLL1641 from Synechocystis sp. strain PCC 6803 (467 aa), FASTA scores: opt: 1684, E(): 6.2e-99, (55.35% identity in 457 aa overlap); Q9X8J5|SCE9.23 from Streptomyces coelicolor (475 aa), FASTA scores: opt: 1650, E(): 8.9e-97, (57.4% identity in 446 aa overlap); Q9AQU4|GAD from Oryza sativa (Rice) (501 aa), FASTA scores: opt: 1498, E(): 3.7e-87, (51.6% identity in 432 aa overlap); Q07346|DCE_PETHY from Petunia hybrida (Petunia) (500 aa), FASTA scores: opt: 1485, E(): 2.5e-86, (51.15% identity in 437 aa overlap); etc. BELONGS TO GROUP II DECARBOXYLASES (DDC, GAD, HDC AND TYRDC)." /codon_start=1 /transl_table=11 /product="glutamate decarboxylase GadB" /protein_id="NP_217949.1" /db_xref="GI:15610568" /db_xref="GeneID:887580" /translation="MSRSHPSVPAHSIAPAYTGRMFTAPVPALRMPDESMDPEAAYRF IHDELMLDGSSRLNLATFVTTWMDPEAEKLMAETFDKNMIDKDEYPATAAIEARCVSM VADLFHAEGLRDHDPTSATGVSTIGSSEAVMLGGLALKWRWRQRVGSWKGRMPNLVMG SNVQVVWEKFCRYFDVEPRYLPMERGRYVITPEQVLAAVDENTIGVVAILGTTYTGEL EPIAEICAALDKLAAGGGVDVPVHVDAASGGFVVPFLHPDLVWDFRLPRVVSINVSGH KYGLTYPGVGFVVWRGPEHLPEDLVFRVNYLGGDMPTFTLNFSRPGNQVVGQYYNFLR LGRDGYTKVMQALSHTARWLGDQLREVDHCEVISDGSAIPVVSFRLAGDRGYTEFDVS HELRTFGWQVPAYTMPDNATDVAVLRIVVREGLSADLARALHDDAVTALAALDKVKPG GHFDAQHFAH" gene complement(3851792..3853213) /locus_tag="Rv3433c" /db_xref="GeneID:887571" CDS complement(3851792..3853213) /locus_tag="Rv3433c" /function="UNKNOWN" /note="Rv3433c, (MTCY77.05), len: 473 aa. Hypothetical protein, member of YKL151c/yjeF family, equivalent to P37391|YY33_MYCLE|ML0373|U229G|B229_C2_201 HYPOTHETICAL 47.2 KDA PROTEIN from Mycobacterium leprae (473 aa), FASTA scores: opt: 2650, E(): 5e-136, (84.55% identity in 473 aa overlap). Also similar to other hypothetical bacterial proteins e.g. Q9X3W3 from Zymomonas mobilis (484 aa), FASTA scores: opt: 700, E(): 1.2e-30, (33.7% identity in 484 aa overlap); O86783|SC6G4.20c from Streptomyces coelicolor (485 aa), FASTA scores: opt: 563, E(): 3.2e-23, (48.45% identity in 489 aa overlap); Q9LC81 from Arthrobacter sp. Q36 (313 aa), FASTA scores: opt: 553, E(): 7.9e-23, (44.2% identity in 303 aa overlap); etc. Contains Pfam match to entry PF01256 hypothetical UPFOO31 family signature and PF03853 YjeF-related protein N-terminus. BELONGS TO THE UPF0031 FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217950.1" /db_xref="GI:15610569" /db_xref="GeneID:887571" /translation="MRHYYSVDTIRAAEAPLLASLPDGALMRRAAFGLATEIGRELTA RTGGVVGRRVCAVVGSGDNGGDALWAATFLRRRGAAADAVLLNPDRTHRKALAAFTKS GGRLVESVSAATDLVIDGVVGISGSGPLRPAAAQVFAAVQAAAIPVVAVDIPSGIDVA TGAITGPAVHAALTVTFGGLKPVHALADCGRVVLVDIGLDLAHTDVLGFEATDVAARW PVPGPRDDKYTQGVTGVLAGSSTYPGAAVLCTGAAVAATSGMVRYAGTAHAEVLAHWP EVIASPTPAAAGRVQAWVVGPGLGTDEAGAAALWFALDTDLPVLVDADGLTMLADHPD LVAGRNAPTVLTPHAGEFARLAGAPPGDDRVGACRQLADALGATVLLKGNVTVIADPG GPVYLNPAGQSWAATAGSGDVLSGMIGALLASGLPSGEAAAAAAFVHARASAAAAADP GPGDAPTSASRISGHIRAALAAL" misc_feature complement(3852236..3852262) /locus_tag="Rv3433c" /note="PS01050 Hypothetical YKL151c/yjeF family signature 2" misc_feature complement(3852308..3852337) /locus_tag="Rv3433c" /note="PS01049 Hypothetical YKL151c/yjeF family signature 1" gene complement(3853215..3853928) /locus_tag="Rv3434c" /db_xref="GeneID:887573" CDS complement(3853215..3853928) /locus_tag="Rv3434c" /function="UNKNOWN" /note="Rv3434c, (MTCY77.06c), len: 237 aa. Possible conserved transmembrane protein, showing some similarity with Q9CGH7|YLDB HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (258 aa), FASTA scores: opt: 248, E(): 1.6e-09, (28.8% identity in 198 aa overlap); and P94983|Rv1648|MTCY06H11.13 from Mycobacterium tuberculosis (268 aa), FASTA scores: opt: 205, E(): 1.2e-06, (31.45% identity in 194 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217951.1" /db_xref="GI:15610570" /db_xref="GeneID:887573" /translation="MADASVVARLRSWALAVWHFVSNAPLTYAWLVVLVITTIIQNNL TGSQLHFVLLHRSTNIAELGRDPLEVLFSSLLWIDGRNLEPYLLLFTLFLAPAEHWLG HLRWLTVGLTAHIGATYLSEGLLYLAIQHRDASERMVHARDIGVSYFLVGVMAVLTYH IAKPWRWGYLGVLLVIFGFPLIAMDKAELDFTAVGHFASILIGLLFYPMARERDGRLW NPARIKSLLHRRGTRGRRA" gene complement(3853939..3854793) /locus_tag="Rv3435c" /db_xref="GeneID:887564" CDS complement(3853939..3854793) /locus_tag="Rv3435c" /function="UNKNOWN" /note="Rv3435c, (MTCY77.07c), len: 284 aa. Probable conserved transmembrane protein, showing some similarity with P95061|Rv0713|MTCY210.32 HYPOTHETICAL 33.9 KDA PROTEIN from Mycobacterium tuberculosis (313 aa), FASTA scores: opt: 557, E(): 1.3e-26, (35.8% identity in 282 aa overlap); and O32991|MLCB2492.12 from Mycobacterium leprae (95 aa), FASTA scores: opt: 150, E(): 0.022, (35.3% identity in 85 aa overlap). Equivalent to AAK47881 from Mycobacterium tuberculosis strain CDC1551 (312 aa) but shorter 28 aa." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217952.1" /db_xref="GI:15610571" /db_xref="GeneID:887564" /translation="MGRILRVVVGLVLVIAAYVTVIALYHSTGLGRPHEVAHGRPTAD GTTVTLHVEQLQTIKGVLVANLAVSPGTELLDSQTQGLKDDLTVTVTSVVTPTKRTWS SGSLPGVFPVPLTISGDPANWPFDHYRSGPITVQLYRGAAHAPERVSVTFVDRLPGWN VDISGVGDANVPAPYRVGLHRSPSSVAFGTVIVGVLIALAGVGLFVAVQTARGRRQFQ PPMTTWYAAMLFAVIPLRNALPDAPPIGFWIDVTVVLWVVVALVTSMVLYILCWWWHL KPDVDETM" gene complement(3855015..3856889) /gene="glmS" /locus_tag="Rv3436c" /db_xref="GeneID:887568" CDS complement(3855015..3856889) /gene="glmS" /locus_tag="Rv3436c" /EC_number="2.6.1.16" /function="CATALYZES THE FIRST STEP IN HEXOSAMINE METABOLISM, CONVERTING FRUCTOSE-6P INTO GLUCOSAMINE-6P USING GLUTAMINE AS A NITROGEN SOURCE [CATALYTIC ACTIVITY: L-GLUTAMINE + D-FRUCTOSE 6-PHOSPHATE = L-GLUTAMATE + D-GLUCOSAMINE 6-PHOSPHATE]." /note="Catalyzes the first step in hexosamine metabolism, converting fructose-6P into glucosamine-6P using glutamine as a nitrogen source" /codon_start=1 /transl_table=11 /product="glucosamine--fructose-6-phosphate aminotransferase" /protein_id="NP_217953.1" /db_xref="GI:15610572" /db_xref="GeneID:887568" /translation="MCGIVGYVGRRPAYVVVMDALRRMEYRGYDSSGIALVDGGTLTV RRRAGRLANLEEAVAEMPSTALSGTTGLGHTRWATHGRPTDRNAHPHRDAAGKIAVVH NGIIENFAVLRRELETAGVEFASDTDTEVAAHLVARAYRHGETADDFVGSVLAVLRRL EGHFTLVFANADDPGTLVAARRSTPLVLGIGDNEMFVGSDVAAFIEHTREAVELGQDQ AVVITADGYRISDFDGNDGLQAGRDFRPFHIDWDLAAAEKGGYEYFMLKEIAEQPAAV ADTLLGHFVGGRIVLDEQRLSDQELREIDKVFVVACGTAYHSGLLAKYAIEHWTRLPV EVELASEFRYRDPVLDRSTLVVAISQSGETADTLEAVRHAKEQKAKVLAICNTNGSQI PRECDAVLYTRAGPEIGVASTKTFLAQIAANYLLGLALAQARGTKYPDEVEREYHELE AMPDLVARVIAATGPVAELAHRFAQSSTVLFLGRHVGYPVALEGALKLKELAYMHAEG FAAGELKHGPIALIEDGLPVIVVMPSPKGSATLHAKLLSNIREIQTRGAVTIVIAEEG DETVRPYADHLIEIPAVSTLLQPLLSTIPLQVFAASVARARGYDVDKPRNLAKSVTVE" gene 3856911..3857387 /locus_tag="Rv3437" /db_xref="GeneID:887567" CDS 3856911..3857387 /locus_tag="Rv3437" /function="UNKNOWN" /note="Rv3437, (MTCY77.09), len: 158 aa. Questionable ORF. Possible conserved transmenbrane protein, C-terminus similar to N-terminal part of O06345|Rv3482c|MTCY13E12.35c HYPOTHETICAL 28.5 KDA PROTEIN from Mycobacterium tuberculosis (260 aa), FASTA scores: opt: 140, E(): 0.1, (58.8% identity in 34 aa overlap); and Q9XAN5|SC4C6.05c PUTATIVE MEMBRANE PROTEIN from Streptomyces (347 aa), coelicolor FASTA scores: opt: 112, E(): 6.8, (50.0% identity in 32 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217954.1" /db_xref="GI:15610573" /db_xref="GeneID:887567" /translation="MVGRAVPSPNRRYRRVWPPRTKGQHLSNPYAQHQLKLIRHTGAL ILWQQRTYVVSGTREQCEAAYKSAQTYNLLVGWWSLVSLLAMNWIALISNFNAIRRVR AAADGASVPHGPHAIAHPAVPRGPIPAGWYPDPSGAGLRYWDGATWTHWTHPPRHR" gene 3857397..3858239 /locus_tag="Rv3438" /db_xref="GeneID:887561" CDS 3857397..3858239 /locus_tag="Rv3438" /function="UNKNOWN" /note="Rv3438, (MTCY77.10), len: 280 aa. Conserved hypothetical protein, equivalent to Q9CCV6|ML0370 HYPOTHETICAL PROTEIN from Mycobacterium leprae (289 aa), FASTA scores: opt: 1491, E(): 9.2e-81, (79.85% identity in 283 aa overlap); and highly similar (but shorter 41 aa) to Q49872|B229_F1_20 HYPOTHETICAL 34.0 KDA PROTEIN from Mycobacterium leprae (324 aa), FASTA scores: opt: 1491, E(): 1e-80, (79.85% identity in 283 aa overlap). Shows some similarity to Q9KIU3|LIPA LIPASE from plasmid pAH114 uncultured bacterium (281 aa), FASTA scores: opt: 168, E(): 0.0081, (29.3% identity in 140 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217955.1" /db_xref="GI:15610574" /db_xref="GeneID:887561" /translation="MPRIRKLVAALHRRGPHRVLRGDLAFAGLPGVVYTPEAGLHLPG VAFGHDWLTGTSRYSGLLEHLASWGIVAAAPDSERGLAPSVLNLAFDLGVALDIVAGV RLGPGKISVHPAKLGLVGHGFGGSAAVFAAAGLTGTHVKSVAAIFPTVTNPAAEQPAA TLDVPGLILTAPGDPKTLTSNALGLSRAWDKATLRIVSKARAGGLVEGRRLTKVLGLP GPHRRTQRSVRALLTGYLLYTLGGDKTYRRFADPDLQLPKTDPIDPEAPPITPGEKIV TLLK" gene complement(3858259..3859662) /locus_tag="Rv3439c" /db_xref="GeneID:887565" CDS complement(3858259..3859662) /locus_tag="Rv3439c" /function="UNKNOWN" /note="Rv3439c, (MTCY77.11c), len: 467 aa. Conserved hypothetical ala-, pro-rich protein, similar in part to N-terminal part of Q49853|B229_C1_154 HYPOTHETICAL 11.2 KDA PROTEIN from Mycobacterium leprae (103 aa), FASTA scores: opt: 265, E(): 0.0013, (51.1% identity in 90 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217956.1" /db_xref="GI:15610575" /db_xref="GeneID:887565" /translation="MADRLNVAERLAEGRPAAEHTQSYVRACHLVGYQHPDLTAYPAQ IHDWYGSEDGLDLHALDADCAQLRAAASVLMEALRMERSQVAVLAAAWTGSGADAAVH FVQRHCETGNSVVTEVRAAAQRCESLRDNLWQLVDSKVATAIAIDERALAQRPAWLAA AEALTTEGADRPTAVEVVRQQIQPYVDDDVRNDWLTTMRSTTAGVAASYDAVTDQLAS APRAHFEIPDDLGPGRQPSPASVPAQPSATAAITPAAALPPPDPVPAVTSRPVTPSDF GSAPGDGSATPAGVGSAGGFGDAGGTGGLGGFAGLAGLANRIVDAVDSLLGSVAEQLG DPLAADNPPGAVDPFAEDAADNADDGDDAHPEEADEAAEPKEATEPDEADEVDDADES VPAERAQDVAEEATLPPVAEPPPPAAPPVAEPPPPVAAPAPPGAPEPANGPSPEALSE GATPCEIAADELPQAGP" gene complement(3859665..3859976) /locus_tag="Rv3440c" /db_xref="GeneID:887592" CDS complement(3859665..3859976) /locus_tag="Rv3440c" /function="UNKNOWN" /note="Rv3440c, (MTCY77.12c), len: 103 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217957.1" /db_xref="GI:15610576" /db_xref="GeneID:887592" /translation="MRPDSVNSAGIDIAAVYAVADRFSAAAELIDDAIGNHLTRLAFG GACAGRGHASRGDALRCRLDRLAGELSVWSRAAVQIAFALRAGANRYAEADLCAAARI G" gene complement(3860024..3861370) /gene="glmM" /locus_tag="Rv3441c" /db_xref="GeneID:887589" CDS complement(3860024..3861370) /gene="glmM" /locus_tag="Rv3441c" /EC_number="5.4.2.-" /function="UNKNOWN; INVOLVED IN CELLULAR METABOLISM." /note="catalyzes the conversion of glucosamine-6-phosphate to glucosamine-1-phosphate" /codon_start=1 /transl_table=11 /product="phosphoglucosamine mutase" /protein_id="NP_217958.1" /db_xref="GI:15610577" /db_xref="GeneID:887589" /translation="MGRLFGTDGVRGVANRELTAELALALGAAAARRLSRSGAPGRRV AVLGRDPRASGEMLEAAVIAGLTSEGVDALRVGVLPTPAVAYLTGAYDADFGVMISAS HNPMPDNGIKIFGPGGHKLDDDTEDQIEDLVLGVSRGPGLRPAGAGIGRVIDAEDATE RYLRHVAKAATARLDDLAVVVDCAHGAASSAAPRAYRAAGARVIAINAEPNGRNINDG CGSTHLDPLRAAVLAHRADLGLAHDGDADRCLAVDANGDLVDGDAIMVVLALAMKEAG ELACNTLVATVMSNLGLHLAMRSAGVTVRTTAVGDRYVLEELRAGDYSLGGEQSGHIV MPALGSTGDGIVTGLRLMTRMVQTGSSLSDLASAMRTLPQVLINVEVVDKATAAAAPS VRTAVEQAAAELGDTGRILLRPSGTEPMIRVMVEAADEGVAQRLAATVADAVSTAR" misc_feature complement(3861041..3861085) /gene="glmM" /locus_tag="Rv3441c" /note="PS00710 Phosphoglucomutase and phosphomannomutase phosphoserine signature" gene complement(3861495..3861950) /gene="rpsI" /locus_tag="Rv3442c" /db_xref="GeneID:887488" CDS complement(3861495..3861950) /gene="rpsI" /locus_tag="Rv3442c" /function="INVOLVED IN TRANSLATION MECHANISM. THIS PROTEIN IS ONE OF THE ASSEMBLY PROTEINS OF THE 50S RIBOSOMAL SUBUNIT." /note="forms a direct contact with the tRNA during translation" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S9" /protein_id="NP_217959.1" /db_xref="GI:15610578" /db_xref="GeneID:887488" /translation="MTETTPAPQTPAAPAGPAQSFVLERPIQTVGRRKEAVVRVRLVP GTGKFDLNGRSLEDYFPNKVHQQLIKAPLVTVDRVESFDIFAHLGGGGPSGQAGALRL GIARALILVSPEDRPALKKAGFLTRDPRATERKKYGLKKARKAPQYSKR" misc_feature complement(3861627..3861683) /gene="rpsI" /locus_tag="Rv3442c" /note="PS00360 Ribosomal protein S9 signature" gene complement(3861947..3862390) /gene="rplM" /locus_tag="Rv3443c" /db_xref="GeneID:887584" CDS complement(3861947..3862390) /gene="rplM" /locus_tag="Rv3443c" /function="INVOLVED IN TRANSLATION MECHANISM. THIS PROTEIN IS ONE OF THE EARLY ASSEMBLY PROTEINS OF THE 50S RIBOSOMAL SUBUNIT." /experiment="experimental evidence, no additional details recorded" /note="in Escherichia coli this protein is one of the earliest assembly proteins in the large subunit" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L13" /protein_id="NP_217960.1" /db_xref="GI:15610579" /db_xref="GeneID:887584" /translation="MPTYAPKAGDTTRSWYVIDATDVVLGRLAVAAANLLRGKHKPTF APNVDGGDFVIVINADKVAISGDKLQHKMVYRHSGYPGGLHKRTIGELMQRHPDRVVE KAILGMLPKNRLSRQIQRKLRVYAGPEHPHSAQQPVPYELKQVAQ" gene complement(3862624..3862926) /gene="esxT" /locus_tag="Rv3444c" /db_xref="GeneID:887587" CDS complement(3862624..3862926) /gene="esxT" /locus_tag="Rv3444c" /function="UNKNOWN" /note="Rv3444c, (MTCY77.16c), len: 100 aa. esxT, ESAT-6 like protein (see citation below), equivalent to Q9CCV7|ML0363 POSSIBLE SECRETED PROTEIN from Mycobacterium leprae (104 aa), FASTA scores: opt: 362, E(): 1.1e-18, (71.25% identity in 73 aa overlap). C-terminal part highly similar to Q49852|B229_C1_150 HYPOTHETICAL 5.3 KDA PROTEIN from Mycobacterium leprae (49 aa), FASTA scores: opt: 227, E(): 1.4e-09, (68.9% identity in 45 aa overlap). SEEMS TO BELONG TO THE ESAT6 FAMILY." /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXT" /protein_id="NP_217961.1" /db_xref="GI:15610580" /db_xref="GeneID:887587" /translation="MNADPVLSYNFDAIEYSVRQEIHTTAARFNAALQELRSQIAPLQ QLWTREAAAAYHAEQLKWHQAASALNEILIDLGNAVRHGADDVAHADRRAAGAWAR" gene complement(3862947..3863324) /gene="esxU" /locus_tag="Rv3445c" /db_xref="GeneID:887585" CDS complement(3862947..3863324) /gene="esxU" /locus_tag="Rv3445c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3445c, (MTCY77.17c), len: 125 aa. esxU, ESAT-6 like protein (see citations below), showing weak similarity to O30373|VCD|PA2257 PYOVERDINE BIOSYNTHESIS PROTEIN from Pseudomonas aeruginosa (215 aa), FASTA scores: opt: 103, E(): 5.6, (32.35% identity in 133 aa overlap). SEEMS TO BELONG TO THE ESAT6 FAMILY." /codon_start=1 /transl_table=11 /product="ESAT-6 like protein ESXU" /protein_id="NP_217962.1" /db_xref="GI:15610581" /db_xref="GeneID:887585" /translation="MVEPGRIGGNQTRLAAVLLDVSTPNTLNADFDLMRSVAGITDAR NEEIRAMLQAFIGRMSGVPPSVWGGLAAARFQDVVDRWNAESTRLYHVLHAIADTIRH NEAALREAGQIHARHIAAAGGDL" gene complement(3863317..3864531) /locus_tag="Rv3446c" /db_xref="GeneID:887586" CDS complement(3863317..3864531) /locus_tag="Rv3446c" /function="UNKNOWN" /note="Rv3446c, (MTCY77.18c), len: 404 aa. Hypothetical unknown ala-, val-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217963.1" /db_xref="GI:15610582" /db_xref="GeneID:887586" /translation="MSPHRAVIEAGPGAIRRLCCGADVVADTAVSAAALAAIDDQVAL LDERPVAVDSLWFDALRSVAVDHRDGPVVVHPSWWSAARVEVVTAAARTLTRDVVVHP RSWLLRQASSGVSAATVVVEIAERLVLVAGAEVAAVARRTDAESVAGQVGSVIARMTR GITAVVLIDVPSTVAGAAALAAAIAGAVRGTGSSVVEIDGVRLARLARAALPPSDEPA DPAARPATRSRVPTLARVAAAGVALALLAPAAVVRHGATTLQRPPTTLLVEGRVALTI PADWSTQRVVSGPGSARVQVTSPADPEVALHVTQSPVPGETLPGTAQRLKRAIDASPA GVFVDFNPSDIRAGRPAVTYREVRAGHQVRWTILLDGAVRISVGCQSGPGHEDLLREV CAQAVRSVHAVG" gene complement(3864528..3868238) /locus_tag="Rv3447c" /db_xref="GeneID:887581" CDS complement(3864528..3868238) /locus_tag="Rv3447c" /function="UNKNOWN, BUT COULD HYDDROLYSE ATP/GTP." /note="Rv3447c, (MTCY77.19c), len: 1236 aa. Probable conserved membrane protein, similar to various bacterial proteins e.g. O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 1186, E(): 1.9e-60, (42.9% identity in 1312 aa overlap); Q9L0T6|SCD35.15c from Streptomyces coelicolor (1525 aa), FASTA scores: opt: 932, E(): 9.2e-46, (27.2% identity in 1374 aa overlap); Q9CD30|ML2535 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1329 aa), FASTA scores: opt: 910, E(): 1.5e-44, (34.4% identity in 1319 aa overlap); Q9KE81|BH0975 HYPOTHETICAL PROTEIN from Bacillus halodurans (1489 aa), FASTA scores: opt: 805, E(): 1.9e-38, (25.85% identity in 1292 aa overlap); etc. The C-terminal region is similar to Q9CDD7|ML0052 (alias O33086|MLCB628.15c) HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa), FASTA scores: opt: 850, E(): 2.3e-41, (35.2% identity in 588 aa overlap); and O6973|Rv3871|MTV027.06 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (591 aa), FASTA scores: opt: 845, E(): 4.3e-41, (35.3% identity in 586 aa overlap). N-terminal part shows similarity with HYPOTHETICAL PROTEINS from Mycobacterium tuberculosis e.g. O69735|Rv3870|MTV027.05 (747 aa), FASTA scores: opt: 761, E(): 3.6e-36, (38.2% identity in 746 aa overlap). Equivalent to AAK47893 from Mycobacterium tuberculosis strain CDC1551 (1200 aa) but longer 36 aa. Contains three of PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217964.1" /db_xref="GI:15610583" /db_xref="GeneID:887581" /translation="MNSGPACATADILVAPPPELRRSEPSSLLIRLLPVVMSVATVGV MVTVFLPGSPATRHPTFLAFPMMMLVSLVVTAVTGRGRRHVSGIHNDRVDYLGYLSVL RTSVTQTAAAQHVSLNWTHPDPATLWTLIGGPRMWERRPGAADFCRIRVGVGSAPLAT RLVVGQLPPAQRADPVTRAALRCFLAAHATIADAPIAIPLRVGGPIAIDGDPTKVRGL LRAMICQLAVWHSPEELLIAGVVSDRNRAHWDWLKWLPHNQHPNACDALGPAPMVYST LAEMQNALAATVLAHVVAIVDTAERGNGAITGVITIEVGARRDGAPPVVRCAGEVTAL ACPDQLEPQDALVCARRLAAHRVGHSGRTFIRGSGWAELVGIGDVAAFDPSTLWRNVN QHDRLRVPIGVTPDGTAVQLDIKEAAEQGMGPHGLCVGATGSGKSELLRTIALGMMAR NSPEVLNLLLVDFKGGATFLDLAGAPHVAAVITNLAEEAPLVARMQDALAGEMSRRQQ LLRMAGHLVSVTAYQRARQTGAQLPCLPILFIVVDEFSELLSQHPEFVDVFLAIGRVG RSLGMHLLLASQRLDEGRLRGLETHLSYRMCLKTWSASESRNVLGTQDAYQLPNTPGA GLLQTGTGELIRFQTAFVSGPLRRASPSAVHPVAPPSVRPFTTHAAAPVTAGPVGGTA EVPTPTVLHAVLDRLVGHGPAAHQVWLPPLDEPPMLGALLRDAEPAQAELAVPIGIVD RPFEQSRVPLTIDLSGAAGNVAVVGAPQTGKSTALRTLIMALAATHDAGRVQFYCLDF GGGALAQVDELPHVGAVAGRAQPQLASRMLAELESAVRFREAFFRDHGIDSVARYRQL RAKSAAESFADIFLVIDGWASLRQEFAALEESIVALAAQGLSFGVHVALSAARWAEIR PSLRDQIGSRIELRLADPADSELDRRQAQRVPVDRPGRGLSRDGMHMVIALPDLDGVA LRRRSGDPVAPPIPLLPARVDYDSVVARAGDELGAHILLGLEERRGQPVAVDFGRHPH LLVLGDNECGKTAALRTLCREIVRTHTAARAQLLIVDFRHTLLDVIESEHMSGYVSSP AALGAKLSSLVDLLQARMPAPDVSQAQLRARSWWSGPDIYVVVDDYDLVAVSSGNPLM VLLEYLPHARDLGLHLVVARRSGGAARALFEPVLASLRDLGCRALLMSGRPDEGALFG SSRPMPLPPGRGILVTGAGDEQLVQVAWSPPP" misc_feature complement(3865113..3865136) /locus_tag="Rv3447c" /note="PS00017 ATP/GTP-binding site motif A" misc_feature complement(3865923..3865946) /locus_tag="Rv3447c" /note="PS00017 ATP/GTP-binding site motif A" misc_feature complement(3866928..3866951) /locus_tag="Rv3447c" /note="PS00017 ATP/GTP-binding site motif A" gene 3868352..3869755 /locus_tag="Rv3448" /db_xref="GeneID:887624" CDS 3868352..3869755 /locus_tag="Rv3448" /function="UNKNOWN. POSSIBLY INVOLVED IN TRANSPORT ACROSS THE MEMBRANE." /note="Rv3448, (MTCY77.20), len: 467 aa. Probable conserved integral membrane protein, showing some similarity with Q9CD35|ML2529 from Mycobacterium leprae (485 aa), FASTA scores: opt: 371, E(): 3.6e-14, (27.25% identity in 481 aa overlap); and two proteins from Mycobacterium tuberculosis O86362|Rv0290|MTV035.18 (472 aa), FASTA scores: opt: 429, E(): 1.6e-17, (28.6% identity in 479 aa overlap); and O05457|Rv3887c|MTCY15F10.25 (509 aa), FASTA scores: opt: 203, E(): 0.00019, (25.6% identity in 492 aa overlap). Contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217965.1" /db_xref="GI:15610584" /db_xref="GeneID:887624" /translation="MPTSDPGLRRVTVHAGAQAVDLTLPAAVPVATLIPSIVDILGDR GASPATAARYQLSALGAPALPNATTLAQCGIRDGAVLVLHKSSAQPPTPRCDDVAEAV AAALDTTARPQCQRTTRLSGALAASCITAGGGLMLVRNALGTNVTRYSDATAGVVAAA GLAALLFAVIACRTYRDPIAGLTLSVIATIFGAVAGLLAVPGVPGVHSVLVAAMAAAA TSVLAMRITGCGGITLTAVACCAVVVAAATLVGAITAAPVPAIGSLATLASFGLLEVS ARMAVLLAGLSPRLPPALNPDDADALPTTDRLTTRANRADAWLTSLLAAFAASATIGA IGTAVATHGIHRSSMGGIALAAVTGALLLLRARSADTRRSLVFAICGITTVATAFTVA ADRALEHGPWIAALTAMLAAVAMFLGFVAPALSLSPVTYRTIELLECLALIAMVPLTA WLCGAYSAVRHLDLTWT" misc_feature 3868352..3868438 /locus_tag="Rv3448" /note="PS00402 Binding-protein-dependent transport systems inner membrane comp signature" gene 3869752..3871119 /gene="mycP4" /locus_tag="Rv3449" /db_xref="GeneID:887602" CDS 3869752..3871119 /gene="mycP4" /locus_tag="Rv3449" /EC_number="3.4.21.-" /function="THOUGHT TO HAVE PROTEOLYTIC ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="Rv3449, (MTCY13E12.02), len: 455 aa. Probable mycP4, membrane-anchored serine protease (mycosin) (EC 3.4.21.-) (see citation below), similar to hypothetical unknowns or proteases from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK48366|MT3998 SUBTILASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (411 aa), FASTA scores: opt: 747, E(): 3.5e-33, (45.65% identity in 416 aa overlap); O05461|Rv3883c|MTCY15F10.29 MEMBRANE-ANCHORED MYCOSIN MYCP1 (446 aa), FASTA scores: opt: 747, E(): 3.8e-33, (45.45% identity in 451 aa overlap); O53695|Rv0291|MTV035.19 PROBABLE MEMBRANE-ANCHORED MYCOSIN MYCP2 (461 aa), FASTA scores: opt: 660, E(): 1.9e-28, (44.0% identity in 457 aa overlap); etc. And similar to hypothetical proteases from Mycobacterium leprae e.g. O33076|MLCB628.04|ML0041 HYPOTHETICAL 45.7 KDA PROTEIN (PROBABLE SECRETED PROTEASE) (446 aa), FASTA scores: opt: 683, E(): 1.1e-29, (43.8% identity in 450 aa overlap); Q9CD36|ML2528 PUTATIVE PROTEASE (475 aa), FASTA scores: opt: 608, E(): 1.3e-25, (43.0% identity in 451 aa overlap); Q9CBV3|ML1538 POSSIBLE PROTEASE (567 aa), FASTA scores: opt: 389, E(): 9.7e-14, (33.8% identity in 562 aa overlap); etc. Also some similarity to other proteases from several organisms e.g. O31788|APRX ALKALINE SERINE PROTEASE from Bacillus subtilis (442 aa), FASTA scores: opt: 296, E(): 8.3e-09, (29.4% identity in 313 aa overlap); O86650|SC3C3.17c PUTATIVE SECRETED SERINE PROTEASE from Streptomyces coelicolor (450 aa), FASTA scores: opt: 279, E(): 7e-08, (33.55% identity in 343 aa overlap); Q9KBJ7|APRX|BH193 INTRACELLULAR ALKALINE SERINE PROTEASE from Bacillus halodurans (444 aa), FASTA scores: opt: 257, E(): 1.1e-06, (28.65% identity in 335 aa overlap); O86642|SC3C3.08 SERINE PROTEASE from Streptomyces coelicolor (413 aa), FASTA scores: opt: 243, E(): 5.7e-06, (38.25% identity in 387 aa overlap); etc. Has putative signal peptide at N-terminus and hydrophobic stretch at C-terminus. Contains three signatures typical of subtilase family: aspartic acid active site (PS00136), histidine active site (PS00137), and serine active site (PS00138). BELONGS TO PEPTIDASE FAMILY S8 (ALSO KNOWN AS THE SUBTILASE FAMILY), PYROLYSIN SUBFAMILY." /codon_start=1 /transl_table=11 /product="membrane-anchored mycosin" /protein_id="NP_217966.1" /db_xref="GI:15610585" /db_xref="GeneID:887602" /translation="MTTSRTLRLLVVSALATLSGLGTPVAHAVSPPPIDERWLPESAL PAPPRPTVQREVCTEVTAESGRAFGRAERSAQLADLDQVWRLTRGAGQRVAVIDTGVA RHRRLPKVVAGGDYVFTGDGTADCDAHGTLVAGIIAAAPDAQSDNFSGVAPDVTLISI RQSSSKFAPVGDPSSTGVGDVDTMAKAVRTAADLGASVINISSIACVPAAAAPDDRAL GAALAYAVDVKNAVIVAAAGNTGGAAQCPPQAPGVTRDSVTVAVSPAWYDDYVLTVGS VNAQGEPSAFTLAGPWVDVAATGEAVTSLSPFGDGTVNRLGGQHGSIPISGTSYAAPV VSGLAALIRARFPTLTARQVMQRIESTAHHPPAGWDPLVGNGTVDALAAVSSDSIPQA GTATSDPAPVAVPVPRRSTPGPSDRRALHTAFAGAAICLLALMATLATASRRLRPGRN GIAGD" misc_feature 3870031..3870063 /gene="mycP4" /locus_tag="Rv3449" /note="PS00136 Serine proteases, subtilase family, aspartic acid active site" misc_feature 3870136..3870168 /gene="mycP4" /locus_tag="Rv3449" /note="PS00137 Serine proteases, subtilase family, histidine active site" misc_feature 3870730..3870762 /gene="mycP4" /locus_tag="Rv3449" /note="PS00138 Serine proteases, subtilase family, serine active site" gene complement(3871084..3872496) /locus_tag="Rv3450c" /db_xref="GeneID:887619" CDS complement(3871084..3872496) /locus_tag="Rv3450c" /function="UNKNOWN" /note="Rv3450c, (MTCY13E12.03c), len: 470 aa. Probable conserved membrane protein (possible membrane spanning region near N-terminus). Similar to hypothetical unknowns proteins from Mycobacterium leprae e.g. O33088|MLCB628.17C|ML0054 HYPOTHETICAL 51.9 KDA PROTEIN (PUTATIVE MEMBRANE PROTEIN)(481 aa), FASTA scores: opt: 708, E(): 6.4e-32, (32.9% identity in 480 aa overlap); Q9CD29|ML2536 (552 aa), FASTA scores: opt: 394, E(): 1.7e-14, (33.6% identity in 503 aa overlap); etc. Also similar to other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O69734|Rv3869|MTV027.04 (480 aa), FASTA scores: opt: 717, E(): 2e-32, (32.55% identity in 479 aa overlap); O05449|Rv3895c|MTCY15F10.17 (495 aa), FASTA scores: opt: 670, E(): 8.3e-30, (36.4% identity in 475 aa overlap); O5368|Rv0283|MTV035.11 (538 aa), FASTA scores: opt: 467, E(): 1.5e-18, (36.3% identity in 493 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217967.1" /db_xref="GI:15610586" /db_xref="GeneID:887619" /translation="MPSPATTWLHVSGYRFLLRRIECALLFGDVCAATGALRARTTSL ALGCVLAIVAAMGCAFVALLRPQSALGQAPIVMGRESGALYVRVDDVWHPVLNLASAR LIAATNANPQPVSESELGHTKRGPLLGIPGAPQLLDQPLAGAESAWAICDSDNGGSTT VVVGPAEDSSAQVLTAEQMILVATESGSPTYLLYGGRRAVVDLADPAVVWALRLQGRV PHVVAQSLLNAVPEAPRITAPRIRGGGRASVGLPGFLVGGVVRITRASGDEYYVVLED GVQRIGQVAADLLRFGDSQGSVNVPTVAPDVIRVAPIVNTLPVSAFPDRPPTPVDGSP GRAVTTLCVTWTPAQPGAARVAFLAGSGPPVPLGGVPVTLAQADGRGPALDAVYLPPG RSAYVAARSLSGGGTGTRYLVTDTGVRFAIHDDDVAHDLGLPTAAIPAPWPVLATLPS GPELSRANASVARDTVAPGP" misc_feature complement(3872323..3872355) /locus_tag="Rv3450c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene 3872617..3873405 /gene="cut3" /locus_tag="Rv3451" /db_xref="GeneID:887611" CDS 3872617..3873405 /gene="cut3" /locus_tag="Rv3451" /EC_number="3.1.1.-" /function="HYDROLYSIS OF CUTIN (A POLYESTER THAT FORMS THE STRUCTURE OF PLANT CUTICLE)." /experiment="experimental evidence, no additional details recorded" /note="Rv3451, (MTCY13E12.04), len: 262 aa. Probable cut3, cutinase precursor (EC 3.1.1.-), similar to others e.g. Q9KK87 from Mycobacterium avium (220 aa), FASTA scores: opt: 540, E(): 3.5e-24, (43.4% identity in 219 aa overlap); Q00298|CUTI_BOTCI|CUTA from Botrytis cinerea (Botryotinia fuckeliana) (202 aa), FASTA scores: opt: 214, E(): 2e-05, (31.45% identity in 210 aa overlap); Q9Y7G8 from Pyrenopeziza brassicae (203 aa), FASTA scores: opt: 203, E(): 8.5e-05, (31.05% identity in 190 aa overlap); P29292|CUTI_ASCRA from Ascochyta rabiei (223 aa), FASTA scores: opt: 155, E(): 0.054, (31.65% identity in 120 aa overlap). Similar to other proteins from Mycobacterium tuberculosis e.g. the downstream ORF O06319|Rv3452|MTCY13E12.05 HYPOTHETICAL 23.1 KDA PROTEIN (226 aa), FASTA scores: opt: 775, E(): 1e-37, (58.65% identity in 220 aa overlap); Q50664|CUT2_MYCTU|Rv2301|MT2358|MTCY339.08c PROBABLE CUTINASE PRECURSOR (219 aa), FASTA scores: opt: 565, E(): 1.3e-25, (44.85% identity in 223 aa overlap); Q10837|CUT1_MYCTU|Rv1984c|MT2037|MTCY39.35 PROBABLE CUTINASE PRECURSOR (217 aa), FASTA scores: opt: 489, E(): 3e-21, (47.05% identity in 221 aa overlap); etc. Equivalent to AAK47897 from Mycobacterium tuberculosis strain CDC1551 (247 aa) but longer 15 aa. Contains cutinase, serine active site motif (PS00155). BELONGS TO THE CUTINASE FAMILY. Alternative start possible at 3733. Start changed since first submission (+15 aa)." /codon_start=1 /transl_table=11 /product="cutinase precursor CUT3" /protein_id="NP_217968.2" /db_xref="GI:57117108" /db_xref="GeneID:887611" /translation="MNNRPIRLLTSGRAGLGAGALITAVVLLIALGAVWTPVAFADGC PDAEVTFARGTGEPPGIGRVGQAFVDSLRQQTGMEIGVYPVNYAASRLQLHGGDGAND AISHIKSMASSCPNTKLVLGGYSQGATVIDIVAGVPLGSISFGSPLPAAYADNVAAVA VFGNPSNRAGGSLSSLSPLFGSKAIDLCNPTDPICHVGPGNEFSGHIDGYIPTYTTQA ASFVVQRLRAGSVPHLPGSVPQLPGSVLQMPGTAAPAPESLHGR" misc_feature 3872980..3872997 /gene="cut3" /locus_tag="Rv3451" /note="PS00155 Cutinase, serine active site, GGYSQG" gene 3873452..3874132 /gene="cut4" /locus_tag="Rv3452" /db_xref="GeneID:887610" CDS 3873452..3874132 /gene="cut4" /locus_tag="Rv3452" /EC_number="3.1.1.-" /function="HYDROLYSIS OF CUTIN (A POLYESTER THAT FORMS THE STRUCTURE OF PLANT CUTICLE)." /note="Rv3452, (MTCY13E12.05), len: 226 aa. Probable cut4, cutinase precursor (EC 3.1.1.-), similar to other e.g. Q9KK87 from Mycobacterium avium (220 aa), FASTA scores: opt: 522, E(): 7.3e-24, (46.6% identity in 221 aa overlap); P30272|CUTI_MAGGR|CUT1 from Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) (228 aa), FASTA scores: opt: 205, E(): 3.8e-05, (29.25% identity in 164 aa overlap); Q00298|CUTI_BOTCI|CUTA from Botrytis cinerea (Botryotinia fuckeliana) (202 aa), FASTA scores: opt: 204, E(): 3.9e-05, (33.5% identity in 209 aa overlap); etc. Similar to other proteins from Mycobacterium tuberculosis e.g. upstream ORF O06318|CUT3_MYCTU|Rv3451|MT3557|MTCY13E12.04 PROBABLE CUTINASE PRECURSOR (247 aa), FASTA scores: opt: 773, E(): 1.3e-38, (59.35% identity in 209 aa overlap); Q50664|CUT2_MYCTU|Rv2301|MT2358|MTCY339.08c PROBABLE CUTINASE PRECURSOR (219 aa), FASTA scores: opt: 704, E(): 1.3e-34, (53.4% identity in 219 aa overlap); etc. Contains PS00155 Cutinase, serine active site. BELONGS TO THE CUTINASE FAMILY. Alternative start possible at 4553 in cSCY13E12 but no RBS." /codon_start=1 /transl_table=11 /product="cutinase precursor CUT4" /protein_id="NP_217969.1" /db_xref="GI:15610588" /db_xref="GeneID:887610" /translation="MIPRPQPHSGRWRAGAARRLTSLVAAAFAAATLLLTPALAPPAS AGCPDAEVVFARGTGEPPGLGRVGQAFVSSLRQQTNKSIGTYGVNYPANGDFLAAADG ANDASDHIQQMASACRATRLVLGGYSQGAAVIDIVTAAPLPGLGFTQPLPPAADDHIA AIALFGNPSGRAGGLMSALTPQFGSKTINLCNNGDPICSDGNRWRAHLGYVPGMTNQA ARFVASRI" misc_feature 3873824..3873841 /gene="cut4" /locus_tag="Rv3452" /note="PS00155 Cutinase, serine active site, GGYSQG" gene 3874404..3874736 /locus_tag="Rv3453" /db_xref="GeneID:887596" CDS 3874404..3874736 /locus_tag="Rv3453" /function="UNKNOWN" /note="Rv3453, (MTCY13E12.06), len: 110 aa. Possible conserved transmembrane protein, showing weak similarity with other proteins e.g. Q9F6C3 PUTATIVE ABC TRANSPORTER from Propionibacterium thoenii (424 aa), FASTA scores: opt: 104, E(): 6.8, (40.6% identity in 69 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_217970.1" /db_xref="GI:15610589" /db_xref="GeneID:887596" /translation="MPGVITNSESPTAADHDRITATRETLEDYTLRLAPRSYRRWPPA VVGISALGGIAYLADFAIGANVGITWGTANALCGIAIFALVVFVTGLPLAYYAARYNI DLDLIYPR" gene 3874822..3876090 /locus_tag="Rv3454" /db_xref="GeneID:887613" CDS 3874822..3876090 /locus_tag="Rv3454" /function="UNKNOWN" /note="Rv3454, (MTCY13E12.07), len: 422 aa. Probable conserved integral membrane protein, showing some similarity to various proteins (generally transporters) e.g. Q9I5C8|PA0811 PROBABLE MFS TRANSPORTER from Pseudomonas aeruginosa (415 aa), FASTA scores: opt: 145, E(): 0.13, (28.2% identity in 188 aa overlap); Q01266|YHYC_PSESN HYPOTHETICAL PROTEIN IN HYUC 3'REGION (ORF 5) (FRAGMENT) from Pseudomonas sp. strain NS671 (245 aa), FASTA scores: opt: 130, E(): 0.75, (24.65% identity in 134 aa overlap); Q9I242|PA2073 PROBABLE TRANSPORTER (MEMBRANE SUBUNIT) from Pseudomonas aeruginosa (476 aa), FASTA scores: opt: 125, E(): 2.5, (24.6% identity in 252 aa overlap); etc. Equivalent to AAK47900 from Mycobacterium tuberculosis strain CDC1551 (562 aa) but shorter 140 aa. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217971.1" /db_xref="GI:15610590" /db_xref="GeneID:887613" /translation="MAQGLKLGLHIPLWAGYACSTLIIFPLVVYGMKVLSQLQLWTTP LWLILMAAPFGYLVVSHPDSIGQFFSYAGKDGHGGLSFGSVLLAAGVCLSLIAQIAEQ IDYLRFMPPRTPENANRWWTWTLLAGPGWVAFGATKQIIGLFLAVYLMANIPGSSTIA NQPVHQFMQIYRTFVPGWLALTLAVILVVLSQIKINVTNAYSGSLAWTNSFTRLTKHY PGRVVFLGVNLAIALILMEANMFDFLNTILGCYANCGMAWVVAVASDIGFNKYLLGLS PKTPEFRRGMLYAINPVGFGSLLLAAGLSIVTFFGGLGAALQPYSPLVAIVTALVMPP ILAAATKGKYYLRRTHDGIDLPMYDEHGNPSAAVLTCHVCHQDFERPDMLACQTHGAH VCSLCLSTDKQAEHVLPGLARAHIPGDQVP" misc_feature 3874846..3874878 /locus_tag="Rv3454" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(3876052..3876945) /gene="truA" /locus_tag="Rv3455c" /gene_synonym="hisT" /db_xref="GeneID:887590" CDS complement(3876052..3876945) /gene="truA" /locus_tag="Rv3455c" /gene_synonym="hisT" /EC_number="5.4.99.12" /function="FORMATION OF PSEUDOURIDINE AT POSITIONS 38, 39 AND 40 IN THE ANTICODON STEM AND LOOP OF TRANSFER RNAS [CATALYTIC ACTIVITY: URACIL + D-RIBOSE 5-PHOSPHATE = PSEUDOURIDINE 5'-PHOSPHATE + H(2)O]." /note="mediates pseudouridylation (positions 38, 39, 40) at the tRNA anticodon region which contributes to the structural stability" /codon_start=1 /transl_table=11 /product="tRNA pseudouridine synthase A" /protein_id="NP_217972.2" /db_xref="GI:161352460" /db_xref="GeneID:887590" /translation="MSLTRRPPKSPPQRPPRISGVVRLRLDIAYDGTDFAGWAAQVGQ RTVAGDLDAALTTIFRTPVRLRAAGRTDAGVHASGQVAHVDVPADALPNAYPRAGHVG DPEFLPLLRRLGRFLPADVRILDITRAPAGFDARFSALRRHYVYRLSTAPYGVEPQQA RYITAWPRELDLDAMTAASRDLMGLHDFAAFCRHREGATTIRDLQRLDWSRAGTLVTA HVTADAFCWSMVRSLVGALLAVGEHRRATTWCRELLTATGRSSDFAVAPAHGLTLIQV DYPPDDQLASRNLVTRDVRSG" gene complement(3876890..3877432) /gene="rplQ" /locus_tag="Rv3456c" /db_xref="GeneID:887523" CDS complement(3876890..3877432) /gene="rplQ" /locus_tag="Rv3456c" /function="INVOLVED IN TRANSLATION MECHANISM." /note="is a component of the macrolide binding site in the peptidyl transferase center" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L17" /protein_id="NP_217973.1" /db_xref="GI:15610592" /db_xref="GeneID:887523" /translation="MPKPTKGPRLGGSSSHQKAILANLATSLFEHGRITTTEPKARAL RPYAEKLITHAKKGALHNRREVLKKLRDKDVVHTLFAEIGPFFADRDGGYTRIIKIEA RKGDNAPMAVIELVREKTVTSEANRARRVAAAQAKAKKAAAMPTEESEAKPAEEGDVV GASEPDAKAPEEPPAEAPEN" gene complement(3877464..3878507) /gene="rpoA" /locus_tag="Rv3457c" /db_xref="GeneID:887629" CDS complement(3877464..3878507) /gene="rpoA" /locus_tag="Rv3457c" /EC_number="2.7.7.6" /function="DNA-DEPENDENT RNA POLYMERASE CATALYZES THE TRANSCRIPTION OF DNA INTO RNA USING THE FOUR RIBONUCLEOSIDE TRIPHOSPHATES AS SUBSTRATES. THE AMINO-TERMINAL PORTION IS INVOLVED IN THE ASSEMBLY OF CORE RNAP, WHEREAS THE C-TERMINAL IS INVOLVED IN INTERACTION WITH TRANSCRIPTIONAL REGULATORS [CATALYTIC ACTIVITY: N NUCLEOSIDE TRIPHOSPHATE = N PYROPHOSPHATE + RNA(N)]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Dimerization of the alpha subunit is the first step in the sequential assembly of subunits to form the holoenzyme" /codon_start=1 /transl_table=11 /product="DNA-directed RNA polymerase subunit alpha" /protein_id="NP_217974.1" /db_xref="GI:15610593" /db_xref="GeneID:887629" /translation="MLISQRPTLSEDVLTDNRSQFVIEPLEPGFGYTLGNSLRRTLLS SIPGAAVTSIRIDGVLHEFTTVPGVKEDVTEIILNLKSLVVSSEEDEPVTMYLRKQGP GEVTAGDIVPPAGVTVHNPGMHIATLNDKGKLEVELVVERGRGYVPAVQNRASGAEIG RIPVDSIYSPVLKVTYKVDATRVEQRTDFDKLILDVETKNSISPRDALASAGKTLVEL FGLARELNVEAEGIEIGPSPAEADHIASFALPIDDLDLTVRSYNCLKREGVHTVGELV ARTESDLLDIRNFGQKSIDEVKIKLHQLGLSLKDSPPSFDPSEVAGYDVATGTWSTEG AYDEQDYAETEQL" misc_feature complement(3877866..3877889) /gene="rpoA" /locus_tag="Rv3457c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3878659..3879264) /gene="rpsD" /locus_tag="Rv3458c" /db_xref="GeneID:887620" CDS complement(3878659..3879264) /gene="rpsD" /locus_tag="Rv3458c" /function="THIS PROTEIN BINDS DIRECTLY TO 16S RIBOSOMAL RNA." /note="primary rRNA binding protein; nucleates 30S assembly; involved in translational accuracy with proteins S5 and S12; interacts with protein S5; involved in autogeneously regulating ribosomal proteins by binding to pseudoknot structures in the polycistronic mRNA; interacts with transcription complex and functions similar to protein NusA in antitermination" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S4" /protein_id="NP_217975.1" /db_xref="GI:15610594" /db_xref="GeneID:887620" /translation="MARYTGPVTRKSRRLRTDLVGGDQAFEKRPYPPGQHGRARIKES EYLLQLQEKQKARFTYGVMEKQFRRYYEEAVRQPGKTGEELLKILESRLDNVIYRAGL ARTRRMARQLVSHGHFNVNGVHVNVPSYRVSQYDIVDVRDKSLNTVPFQIARETAGER PIPSWLQVVGERQRVLIHQLPERAQIDVPLTEQLIVEYYSK" misc_feature complement(3878926..3879000) /gene="rpsD" /locus_tag="Rv3458c" /note="PS00632 Ribosomal protein S4 signature" misc_feature complement(3879022..3879045) /gene="rpsD" /locus_tag="Rv3458c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(3879273..3879692) /gene="rpsK" /locus_tag="Rv3459c" /db_xref="GeneID:887524" CDS complement(3879273..3879692) /gene="rpsK" /locus_tag="Rv3459c" /function="S11 PLAYS AN ESSENTIAL ROLE FOR THE SELECTION OF THE CORRECT TRNA IN PROTEIN BIOSYNTHESIS. IT IS LOCATED ON THE LARGE LOBE OF THE SMALL SUBUNIT." /experiment="experimental evidence, no additional details recorded" /note="located on the platform of the 30S subunit, it bridges several disparate RNA helices of the 16S rRNA; forms part of the Shine-Dalgarno cleft in the 70S ribosome; interacts with S7 and S18 and IF-3" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S11" /protein_id="NP_217976.1" /db_xref="GI:15610595" /db_xref="GeneID:887524" /translation="MPPAKKGPATSARKGQKTRRREKKNVPHGAAHIKSTFNNTIVTI TDPQGNVIAWASSGHVGFKGSRKSTPFAAQLAAENAARKAQDHGVRKVDVFVKGPGSG RETAIRSLQAAGLEVGAISDVTPQPHNGVRPPKRRRV" misc_feature complement(3879306..3879329) /gene="rpsK" /locus_tag="Rv3459c" /note="PS00054 Ribosomal protein S11 signature" gene complement(3879696..3880070) /gene="rpsM" /locus_tag="Rv3460c" /db_xref="GeneID:887514" CDS complement(3879696..3880070) /gene="rpsM" /locus_tag="Rv3460c" /function="INVOLVED IN THE BINDING OF FMET-TRNA AND, HENCE, IN THE INITIATION OF TRANSLATION." /experiment="experimental evidence, no additional details recorded" /note="located at the top of the head of the 30S subunit, it contacts several helices of the 16S rRNA; makes contact with the large subunit via RNA-protein interactions and via protein-protein interactions with L5; contacts P-site tRNA" /codon_start=1 /transl_table=11 /product="30S ribosomal protein S13" /protein_id="NP_217977.1" /db_xref="GI:15610596" /db_xref="GeneID:887514" /translation="MARLVGVDLPRDKRMEVALTYIFGIGRTRSNEILAATGIDRDLR TRDLTEEQLIHLRDYIEANLKVEGDLRREVQADIRRKIEIGCYQGLRHRRGMPVRGQR TKTNARTRKGPKRTIAGKKKAR" misc_feature complement(3879768..3879809) /gene="rpsM" /locus_tag="Rv3460c" /note="PS00646 Ribosomal protein S13 signature" gene complement(3880286..3880399) /gene="rpmJ" /locus_tag="Rv3461c" /db_xref="GeneID:887272" CDS complement(3880286..3880399) /gene="rpmJ" /locus_tag="Rv3461c" /function="INVOLVED IN TRANSLATION MECHANISM." /note="smallest protein in the large subunit; similar to what is found with protein L31 and L33 several bacterial genomes contain paralogs which may be regulated by zinc; the protein from Thermus thermophilus has a zinc-binding motif and contains a bound zinc ion; the proteins in this group have the motif" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L36" /protein_id="NP_217978.1" /db_xref="GI:15610597" /db_xref="GeneID:887272" /translation="MKVNPSVKPICDKCRLIRRHGRVMVICSDPRHKQRQG" misc_feature complement(3880292..3880369) /gene="rpmJ" /locus_tag="Rv3461c" /note="PS00828 Ribosomal protein L36 signature" gene complement(3880432..3880653) /gene="infA" /locus_tag="Rv3462c" /db_xref="GeneID:887325" CDS complement(3880432..3880653) /gene="infA" /locus_tag="Rv3462c" /function="NO SPECIFIC FUNCTION HAS SO FAR BEEN ATTRIBUTED TO THIS INITIATION FACTOR; HOWEVER, IT SEEMS TO STIMULATE MORE OR LESS ALL THE ACTIVITIES OF THE OTHER TWO INITIATION FACTORS, IF-2 AND IF-3." /note="stimulates the activities of the other two initiation factors, IF-2 and IF-3" /codon_start=1 /transl_table=11 /product="translation initiation factor IF-1" /protein_id="NP_217979.1" /db_xref="GI:15610598" /db_xref="GeneID:887325" /translation="MAKKDGAIEVEGRVVEPLPNAMFRIELENGHKVLAHISGKMRQH YIRILPEDRVVVELSPYDLSRGRIVYRYK" gene 3880907..3881764 /locus_tag="Rv3463" /db_xref="GeneID:888286" CDS 3880907..3881764 /locus_tag="Rv3463" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3463, (MTCY13E12.16), len: 285 aa. Conserved hypothetical protein, similar to Q9RDA2|SCE20.23 HYPOTHETICAL 31.4 KDA PROTEIN from Streptomyces coelicolor (290 aa), FASTA scores: opt: 770, E(): 2.2e-41, (48.6% identity in 247 aa overlap); and Q9X7Y1|SC6A5.35 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (341 aa), (see BLASTP results), FASTA scores: opt: 119, E(): 2.9, (24.1% identity in 274 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217980.1" /db_xref="GI:15610599" /db_xref="GeneID:888286" /translation="MTNCAAGKPSSGPNLGRFGSFGRGVTPQQATEIEALGYGAVWVG GSPPAALSWVEPILQATTTLCVATGIVNIWSAPAQRVAESFHRIEAAYPGRFLLGIGV GHAEMISEYRKPYNALVEYLDRLDDYGVPANRRVVAALGPRVLGLSARRSAGAHPYLT TPEHTARARELIGPSAFLAPEHKVVLTTDSARARTVGRQALDMYFNLANYRNNWKRLG FTDDEVSRPGSDRLVDAVVAYGTPDAIAARLNEHLLAGADHVPIQVLTEDDNLVSALT ELAKPLRLT" gene 3881837..3882832 /gene="rmlB" /locus_tag="Rv3464" /db_xref="GeneID:887332" CDS 3881837..3882832 /gene="rmlB" /locus_tag="Rv3464" /EC_number="4.2.1.46" /function="INVOLVED IN DTDP-L-RHAMNOSE BIOSYNTHESIS [CATALYTIC ACTIVITY: DTDP-GLUCOSE = DTDP-4-DEHYDRO-6-DEOXY-D-GLUCOSE + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Rv3464, (MTCY13E12.17), len: 331 aa. rmlB (alternate gene name: rfbB), dTDP-glucose-4,6-dehydratase (EC 4.2.1.46) (see citations below), nearly identical to Q50556|RMLB rhamnose biosynthesis protein (EC 4.2.1.46) from Mycobacterium tuberculosis (329 aa) (previously rfbB, now known as rmlB). Equivalent to Q9CBH7|RMLB|ML1964 DTDP-GLUCOSE 4,6-DEHYDRATASE (alias Q9X7A3|RMLB PUTATIVE DTDP-(GLUCOSE OR RHAMNOSE)-4,6-DEHYDRATASE (331 aa)) from Mycobacterium leprae (333 aa), FASTA scores: opt: 1925, E(): 1.9e-112, (84.0% identity in 331 aa overlap). Also highly similar to others e.g. Q9UZH2|RFBB|PAB0785 from Pyrococcus abyssi (333 aa), FASTA scores: opt: 1115, E(): 4.2e-62, (51.55% identity in 322 aa overlap); O27817|MTH1789 from Methanobacterium thermoautotrophicum (336 aa), FASTA scores: opt: 1104, E(): 2.1e-61, (51.65% identity in 331 aa overlap); BAB60064|TVG0950610 from Thermoplasma volcanium (318 aa), FASTA scores: opt: 1102, E(): 2.6e-61, (49.65% identity in 310 aa overlap); etc. Also related to P72050|MTCY13D12.18|RV3784 HYPOTHETICAL 36.3 KDA PROTEIN (SIMILAR TO GALACTOWALDENASES FROM EUKARYOTIC AND PROKARYOTIC ORIGIN) from Mycobacterium tuberculosis (326 aa), FASTA scores: E(): 1.4e-26, (33.8% identity in 320 aa overlap).; rfbB" /codon_start=1 /transl_table=11 /product="dTDP-glucose 4,6-dehydratase RMLB" /protein_id="NP_217981.1" /db_xref="GI:15610600" /db_xref="GeneID:887332" /translation="MRLLVTGGAGFIGTNFVHSAVREHPDDAVTVLDALTYAGRRESL ADVEDAIRLVQGDITDAELVSQLVAESDAVVHFAAESHVDNALDNPEPFLHTNVIGTF TILEAVRRHGVRLHHISTDEVYGDLELDDRARFTESTPYNPSSPYSATKAGADMLVRA WVRSYGVRATISNCSNNYGPYQHVEKFIPRQITNVLTGRRPKLYGAGANVRDWIHVDD HNSAVRRILDRGRIGRTYLISSEGERDNLTVLRTLLRLMDRDPDDFDHVTDRVGHDLR YAIDPSTLYDELCWAPKHTDFEEGLRTTIDWYRDNESWWRPLKDATEARYQERGQ" gene 3882834..3883442 /gene="rmlC" /locus_tag="Rv3465" /db_xref="GeneID:887352" CDS 3882834..3883442 /gene="rmlC" /locus_tag="Rv3465" /EC_number="5.1.3.13" /function="INVOLVED IN dTDP-L-RHAMNOSE BIOSYNTHESIS, WITHIN THE O ANTIGEN BIOSYNTHESIS PATHWAY OF LIPOPOLYSACCHARIDE BIOSYNTHESIS: CONVERSION OF dTDP-4-KETO-6-DEOXY-D-GLUCOSE TO DTDP-4-KETO-RHAMNOSE [CATALYTIC ACTIVITY: dTDP-4-DEHYDRO-6-DEOXY-D-GLUCOSE = dTDP-4-DEHYDRO-6-DEOXY-L-MANNOSE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3465, (MTCY13E12.18), len: 202 aa. rmlC (alternate gene name: rfbC), dTDP-4-dehydrorhamnose 3,5-epimerase (EC 5.1.3.13) (see citations below), nearly identical to O33170|RMLC RMLC PROTEIN from Mycobacterium tuberculosis (203 aa), FASTA scores: opt: 1171, E(): 2.6e-71, (89.5% identity in 200 aa overlap) (previously known as rfbC). Equivalent to Q9X7A4|RMLC|ML1965 PUTATIVE DTDP-4-DEHYDRORHAMNOSE 3,5-EPIMERASE from Mycobacterium leprae (202 aa), FASTA scores: opt: 1072, E(): 1.1e-64, (75.4% identity in 199 aa overlap). Also highly similar to others e.g. Q9F8S7|CUMY from Streptomyces rishiriensis (198 aa), FASTA scores: opt: 671, E(): 7e-38, (51.3% identity in 193 aa overlap); Q9L6C5 from Streptomyces antibioticus (202 aa), FASTA scores: opt: 665, E(): 1.8e-37, (49.25% identity in 197 aa overlap); P29783|STRM_STRGR from Streptomyces griseus (200 aa), FASTA scores: opt: 608, E(): 1.2e-33, (49.25% identity in 201 aa overlap); Q54265|STRM from Streptomyces glaucescens (200 aa), FASTA scores: opt: 603, E(): 2.5e-33, (46.7% identity in 197 aa overlap); etc. Also highly similar to Q9S4D4|TYLJ PUTATIVE NDP-HEXOSE 3-EPIMERASE from Streptomyces fradiae (205 aa), FASTA scores: opt: 625, E(): 8.6e-35, (45.9% identity in 194 aa overlap).; rfbC" /codon_start=1 /transl_table=11 /product="dTDP-4-dehydrorhamnose 3,5-epimerase RmlC" /protein_id="NP_217982.1" /db_xref="GI:15610601" /db_xref="GeneID:887352" /translation="MKARELDVPGAWEITPTIHVDSRGLFFEWLTDHGFRAFAGHSLD VRQVNCSVSSAGVLRGLHFAQLPPSQAKYVTCVSGSVFDVVVDIREGSPTFGRWDSVL LDDQDRRTIYVSEGLAHGFLALQDNSTVMYLCSAEYNPQREHTICATDPTLAVDWPLV DGAAPSLSDRDAAAPSFEDVRASGLLPRWEQTQRFIGEMRGT" gene 3883525..3884193 /locus_tag="Rv3466" /db_xref="GeneID:887830" CDS 3883525..3884193 /locus_tag="Rv3466" /function="UNKNOWN" /note="Rv3466, (MTCY13E12.19), len: 222 aa. Conserved hypothetical ORF in REP13E12 repeat, but extending 5' of repeat. Has segment of identity to other REP13E12 ORF's e.g. MTCY336.16, MTCI65.15c, MTCY09F9.19, cMTCY251.14c." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217983.1" /db_xref="GI:15610602" /db_xref="GeneID:887830" /translation="MGSGSRERIVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLE CLVRRLPAVGHALINQLDAQASEEELGGTLCCALANRLRITKPDAARRIADAADLGPR RALTGEPLAPQLTATATAQRQGLIGEAHVKVIRALFRPPARRGGCVHPPGRRSRPGRQ SRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPEQPAIRRHVTAKWLPDPPS AGHL" repeat_region 3883550..3884921 /note="REP-8, len: 1372 bp. REP13E12, copies in Mycobacterium tuberculosis cosmids: cY336 from 14471 to 15821 (approx. 100% identity); cY251 from 11693 to 13109 (approx. 100% identity); cI65 from 14515 to 15905 (approx 75% identity); cI125 from 27240 to 28597 (approx. 65% Identity); cY22G8 from 13352 to 14689 (approx. 65% identity); and cY9F9 from 9019 to 10451 (approx. 65% identity); also nearly identical to EM_BA :MB35021 U35021 Mycobacterium bovis BCG DNA flanking deletion region 3 from 56 to 1466.; REP-8" /rpt_type=DIRECT gene 3883964..3884917 /locus_tag="Rv3467" /db_xref="GeneID:888022" CDS 3883964..3884917 /locus_tag="Rv3467" /function="UNKNOWN" /note="Rv3467, (MTCY13E12.20), len: 317 aa. Conserved hypothetical ORF in REP13E12 repeat, identical to ORF's from other REP13E12 copies e.g. MTCY251.13c, MTCI65.15c, MTCY09F9.19, cMTCY336.17. Also identical to Mycobacterium bovis Q50655 HYPOTHETICAL 34.6 kDa PROTEIN (317 aa) in identical repeat." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217984.1" /db_xref="GI:15610603" /db_xref="GeneID:888022" /translation="MSTRQAAEADLAGKAAQYRPDELARYAQRVMDWLHPDGDLTDTE RARKRGITLSNQQYDGMSRLSGYLTPQARATFEAVLAKLAAPGATNPDDHTPVIDTTP DAAAIDRDTRSQAQRNHDGLLAGLRALIASGKLGQHNGLPVSIVVTTTLTDLQTGAGK GFTGGGTLLPMADVIRMTSHAHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIM LFANDRGCTKPGCDAPAYHSQAHHVTAWTSTGRTDITELTLACGPDNRLAEKGWTTHN NTHGHTEWLPPPHLDHGQPRTNTFHHPERFLHNQDDDDKPD" gene complement(3884975..3886069) /locus_tag="Rv3468c" /db_xref="GeneID:888007" CDS complement(3884975..3886069) /locus_tag="Rv3468c" /EC_number="4.2.1.46" /function="POSSIBLY INVOLVED IN dTDP-L-RHAMNOSE BIOSYNTHESIS [CATALYTIC ACTIVITY: dTDP-GLUCOSE = dTDP-4-DEHYDRO-6-DEOXY-D-GLUCOSE + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Rv3468c, (MTCY13E12.21c), len: 364 aa. Possible dTDP-glucose-4,6-dehydratase (EC 4.2.1.46), but experimental study shown that the purified protein didn't have dTDP-glucose dehydratase (rmlB) activity (see citation below). Similar to others e.g. O08246|MTME from Streptomyces argillaceus (331 aa), FASTA scores: opt: 238, E(): 1.2e-07, (29.65% identity in 344 aa overlap); Q9LFG7|F4P12_220 from Arabidopsis thaliana (Mouse-ear cress) (433 aa), FASTA scores: opt: 237, E(): 1.8e-07, (27.25% identity in 308 aa overlap); Q9LZI2|F26K9_260 from Arabidopsis thaliana (Mouse-ear cress) (445 aa), FASTA scores: opt: 225, E(): 1e-06, (25.95% identity in 335 aa overlap); etc. Also similar to various enzymes and hypothetical unknowns proteins e.g. BAB48655|MLL1234 UDP-GLUCOSE 4-EPIMERASE from Rhizobium loti (Mesorhizobium loti) (307 aa), FASTA scores: opt: 757, E(): 4.6e-40, (43.4% identity in 302 aa overlap). First start taken, alternative at 17080 in cSCYY13E12 suggested by similarity. Note that previously known as rmlB3 (see citation below).; rmlB3" /codon_start=1 /transl_table=11 /product="dTDP-glucose 4,6-dehydratase" /protein_id="YP_177974.1" /db_xref="GI:57117109" /db_xref="GeneID:888007" /translation="MGTHAATMRVRAGVRSSPLLLHAGTPPTAAAAESGMRTLVTGSS GHLGEALVRTLRARGADIVSLDSRPSRYTNIVGCVSDRALLRDVMAGVEVVFHAAAHH KPQLAFLPRQAFLDTNIIGTQTVLDAAVAANVRAFVMTSSTTVFGDALTPPADQPAAW IDESVTPIPKNIYGVTKASSEDLCQLAHRNDGLACVVLRVARFFVEGDDMPDLYDGRS QDNIKANEYACRRVALEDAVDAHLNAAQRAPQLGFGRYLVSATTPFTRDDLTQLRTDA ASVFARRVPLAAAVWTQRGWRFPDRLDRVYVNSRARRDLNWRPRFDLNAVAARLARGQ SVHTPLSQLVGSKAYAHSSYHRGVFAPARP" gene complement(3886073..3887083) /gene="mhpE" /locus_tag="Rv3469c" /db_xref="GeneID:888074" CDS complement(3886073..3887083) /gene="mhpE" /locus_tag="Rv3469c" /EC_number="4.1.3.-" /function="INVOLVED IN AROMATIC HYDROCARBONS DEGRADATION [CATALYTIC ACTIVITY: 4-HYDROXY-2-OXOVALERATE = PYRUVATE + ACETALDEHYDE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of pyruvate and acetaldehyde from 4-hydroxy-2-ketovaleric acid; involved in the degradation of phenylpropionate" /codon_start=1 /transl_table=11 /product="4-hydroxy-2-ketovalerate aldolase" /protein_id="NP_217986.1" /db_xref="GI:15610605" /db_xref="GeneID:888074" /translation="MLMTATHREPIVLDTTVRDGSYAVNFQYTDDDVRRIVGDLDAAG IPYIEIGHGVTIGAAAAQGPAAHTDEEYFRAARSVVRNARLGAVIVPALARIETVDLA GDYLDFLRICVIATEFELVMPFVERAQSKGLEVSIQLVKSHLFEPDVLAAAGKRARDV GVRIVYVVDTTGTFLPEDARRYVEALRGASDVSVGFHGHNNLAMAVANTLEAFDAGAD FLDGTLMGFGRGAGNCQIECLVAALQRRGHLAAVDLDRIFDAARSDMLGRSPQSYGID PWEISFGFHGLDSLQVEHLRAAAQQAGLSVSHVIRQTAKSHAGQWLSPQDIDRVVVGM RA" gene complement(3887144..3888802) /gene="ilvB2" /locus_tag="Rv3470c" /db_xref="GeneID:888041" CDS complement(3887144..3888802) /gene="ilvB2" /locus_tag="Rv3470c" /EC_number="2.2.1.6" /function="INVOLVED IN VALINE AND ISOLEUCINE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: 2-ACETOLACTATE + CO(2) = 2-PYRUVATE]." /note="Rv3470c, (MTCY13E12.23c), len: 552 aa. Probable ilvB2, acetolactate synthase large subunit (EC 4.1.3.18), similar to others e.g. P73913|ILVG|SLR2088 from Synechocystis sp. strain PCC 6803 (621 aa), FASTA scores: opt: 779, E(): 4.5e-39, (30.7% identity in 567 aa overlap); O78518|ILVB_GUITH from Guillardia theta (Cryptomonas phi) (575 aa), FASTA scores: opt: 742, E(): 6.9e-37, (28.8% identity in 566 aa overlap); Q59950|ILVX from Spirulina platensis (612 aa), FASTA scores: opt: 715, E(): 3e-35, (28.45% identity in 569 aa overlap); etc. Contains thiamine pyrophosphate enzymes signature (PS00187)." /codon_start=1 /transl_table=11 /product="acetolactate synthase large subunit" /protein_id="NP_217987.1" /db_xref="GI:15610606" /db_xref="GeneID:888041" /translation="MTVGDHLVARMRAAGISVVCGLPTSRLDSLLVRLSRDAGFQIVL ARHEGGAGYLADGFARASGKSAAVFVAGPGATNVISAVANASVNQVPMLILTGEVAVG EFGLHSQQDTSDDGLGLGATFRRFCRCSVSIESIANARSKIDSAFRALASIPRGPVHI ALPRDLVDERLPAHQLGTAAAGLGGLRTLAPCGPDVADEVIGRLDRSRAPMLVLGNGC RLDGIGEQIVAFCEKAGLPFATTPNGRGIVAETHPLSLGVLGIFGDGRADEYLFDTPC DLLIAVGVSFGGLVTRSFSPRWRGLKADVVHVDPDPSAVGRFVATSLGITTSGRAFVN ALNCGRPPRFCRRVGVRPPAPAALPGTPQARGESIHPLELMHELDRELAPNATICADV GTCISWTFRGIPVRRPGRFFATVDFSPMGCGIAGAIGVALARPEEHVICIAGDGAFLM HGTEISTAVAHGIRVTWAVLNDGQMSASAGPVSGRMDPSPVARIGANDLAAMARALGA EGIRVDTRCELRAGVQKALAATGPCVLDIAIDPEINKPDIGLGR" misc_feature complement(3887462..3887521) /gene="ilvB2" /locus_tag="Rv3470c" /note="PS00187 Thiamine pyrophosphate enzymes signature" gene complement(3888808..3889341) /locus_tag="Rv3471c" /db_xref="GeneID:888050" CDS complement(3888808..3889341) /locus_tag="Rv3471c" /function="UNKNOWN" /note="Rv3471c, (MTCY13E12.24c), len: 177 aa. Conserved hypothetical protein, similar to Q59013|MJ1618 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (125 aa), FASTA scores: opt: 262, E(): 1.2e-09, (39.05% identity in 105 aa overlap); and O26452|MTH352 CONSERVED PROTEIN from Methanobacterium thermoautotrophicum (131 aa), FASTA scores: opt: 222, E(): 3.8e-07, (35.05% identity in 117 aa overlap). Equivalent to AAK47934 from Mycobacterium tuberculosis strain CDC1551 (184 aa) but shorter 7 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217988.1" /db_xref="GI:15610607" /db_xref="GeneID:888050" /translation="MSTRPERERASTSTDAVLQATVALSAGHKPAFRGFVKDPPRARA HAAAMFVSNAREAEPFVAPDLSEIRVLVDRATVGVASVSLAHATVAAGAETVWHRLQA TDEIYFVLSGRGLVSVGDESGEVGPGDAVWIPAGVPQKIRALGSVPLTFLCACGPAYL PERDQRMGEAAVIGAWP" gene 3889362..3889868 /locus_tag="Rv3472" /db_xref="GeneID:888006" CDS 3889362..3889868 /locus_tag="Rv3472" /function="UNKNOWN" /note="Rv3472, (MTCY13E12.25), len: 168 aa. Conserved hypothetical protein, showing some similarity to other proteins e.g. Q9ZAT9|DPSH DAUNORUBICIN BIOSYNTHESIS ENZYME from Streptomyces peucetius (194 aa), FASTA scores: opt: 181, E(): 6.8e-05, (30.7% identity in 127 aa overlap); Q53879 DAUH/E from Streptomyces sp. C5 (151 aa), FASTA scores: opt: 168, E(): 0.00038, (29.25% identity in 127 aa overlap); and Q9L4U3|AKNV from Streptomyces galilaeus (144 aa), FASTA scores: opt: 122, E(): 0.36, (31.25% identity in 129 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217989.1" /db_xref="GI:15610608" /db_xref="GeneID:888006" /translation="MRPVDEQWIEILRIQALCARYCLTIDTQDGEGWAGCFTEDGAFE FDGWVIRGRPALREYADAHARVVRGRHLTTDLLYEVDGDVATGRSASVVTLATAAGYK ILGSGEYQDRLIKQDGQWRIAYRRLRNDRLVSDPSVAVNVADADVAAVVGHLLAAARR LGTQMSDT" gene complement(3889948..3890733) /gene="bpoA" /locus_tag="Rv3473c" /db_xref="GeneID:888101" CDS complement(3889948..3890733) /gene="bpoA" /locus_tag="Rv3473c" /EC_number="1.11.1.-" /function="SUPPOSED INVOLVED IN DETOXIFICATION REACTIONS." /note="Rv3473c, (MTCY13E12.26c), len: 261 aa. Possible bpoA, peroxidase (non-haem peroxidase) (EC 1.11.1.-), similar to various enzymes or hypothetical unknown proteins e.g. O85849 HYPOTHETICAL 26.2 KDA PROTEIN from Sphingomonas aromaticivorans (247 aa), FASTA scores: opt: 684, E(): 4.9e-34, (43.8% identity in 242 aa overlap); AAK45412|MT1155 HYDROLASE, ALPHA/BETA HYDROLASE FOLD FAMILY from Mycobacterium tuberculosis strain CDC1551 (311 aa), FASTA scores: opt: 675, E(): 2e-33, (39.45% identity in 256 aa overlap); Q9K3V0|SCD10.27 PUTATIVE HYDROLASE from Streptomyces coelicolor (352 aa), FASTA scores: opt: 248, E(): 9.7e-08, (26.05% identity in 261 aa overlap); P29715|BPA2_STRAU|BPOA2 NON-HAEM BROMOPEROXIDASE (EC 1.11.1.-) (BROMIDE PEROXIDASE) (277 aa), FASTA scores: opt: 237, E(): 3.6e-07, (29.45% identity in 265 aa overlap); O31168|PRXC_STRAU|CPO|CPOT NON-HEME CHLOROPEROXIDASE (EC 1.11.1.10) (278 aa), FASTA scores: opt: 236, E(): 4.2e-07, (29.45% identity in 265 aa overlap); AAK62388|T5L19.180 LIPASE-LIKE PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (350 aa), FASTA scores: opt: 236, E(): 5.1e-07, (26.65% identity in 274 aa overlap); etc. Also similar to O06575|BPOB|Rv1123c|MTCY22G8.12c HYPOTHETICAL 32.5 KDA PROTEIN from Mycobacterium tuberculosis (302 aa), FASTA scores: opt: 675, E(): 2e-33, (39.45% identity in 256 aa overlap). Equivalent to AAK47936 from Mycobacterium tuberculosis strain CDC1551 (294 aa) but shorter 33 aa. May have been inactivated or truncated by neighbouring IS6110." /codon_start=1 /transl_table=11 /product="peroxidase BpoA" /protein_id="NP_217990.1" /db_xref="GI:15610609" /db_xref="GeneID:888101" /translation="MVFLHGGGQTRRSWGRAAAAVAERGWQAVTIDLRGHGESDWSSE GDYRLVSFAGDIQEVLRNLPGQPALVGASLGGFAAMLLAGELSPGIASAVVLVDIVPN MDLAGASRIHAFMAERVESGFGSLDEVADVIANYNPHRPRPSDPDGLVANLRRRGDRW YWHWDPQFIGGIAAFPPVEVTDVDRMNAAVATILRDEVPVLLVRGQVSDIVRQESADQ FLSRFPQVEFTDVRGAGHMVAGDRNDAFAGAVLDFLARHVGVR" repeat_region 3890779..3892133 /note="IS6110-16, len: 1355 bp. Insertion sequence IS6110." /mobile_element="insertion sequence:IS6110-16" repeat_region 3890779..3890806 /note="28 bp inverted repeat at left end of IS6110 :TGAACCGCCCCGGCATGTCCGGAGACTC" gene 3890830..3891156 /locus_tag="Rv3474" /db_xref="GeneID:888097" CDS 3890830..3891156 /locus_tag="Rv3474" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="first part; Rv3474, (MTCY13E12.27), len: 108 aa. Possible transposase (first part), probably frameshifts (-1) with MTCY13E12.28|Rv3475 to make full length product. Identical to Q50686|YIA4_MYCTU INSERTION ELEMENT IS6110 HYPOTHETICAL 12.0 kDa PROTEIN (108 aa). BELONGS TO THE TRANSPOSASE FAMILY 8" /codon_start=1 /transl_table=11 /product="transposase IS6110" /protein_id="NP_217991.1" /db_xref="GI:15610610" /db_xref="GeneID:888097" /translation="MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGV GCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFAAE LDRPAR" gene 3891051..3892091 /locus_tag="Rv3475" /db_xref="GeneID:888055" CDS <3891051..3892091 /locus_tag="Rv3475" /function="INVOLVED IN THE TRANSPOSITION OF THE INSERTION SEQUENCE IS6110." /note="[SECOND PART]; Rv3475, (MTCY13E12.28), len: 346 aa. Possible IS6110 transposase (second part), probably made by a frameshift (-1) with MTCY13E12.27|Rv3474. Identical to P19774|TRA9_MYCTU PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS986/IS6110 (278 aa)" /codon_start=1 /transl_table=11 /product="transposase IS6110" /protein_id="NP_217992.1" /db_xref="GI:15610611" /db_xref="GeneID:888055" /translation="AEALAAGQRRIAKGERDFKDRVGFLRGRARPASTLITRFIADHQ GHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTIADPATARP ADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVASTMATSMV LDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYD NALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA AYYAQRQRPAAG" repeat_region complement(3892106..3892133) /note="28 bp inverted repeat at right end of IS6110 :TGAACCGCCCCGGTGAGTCCGGAGACTC" gene complement(3892371..3893720) /gene="kgtP" /locus_tag="Rv3476c" /db_xref="GeneID:888002" CDS complement(3892371..3893720) /gene="kgtP" /locus_tag="Rv3476c" /function="INVOLVED IN ACTIVE TRANSPORT OF DICARBOXYLIC ACID ACROSS THE MEMBRANE. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3476c, (MTCY13E12.29c), len: 449 aa. Probable kgtP, dicarboxylate-transport integral membrane protein, possibly member of major facilitator superfamily (MFS), highly similar to others e.g. Q9HT43|PA5530 from Pseudomonas aeruginosa (435 aa), FASTA scores: opt: 1209, E(): 2.3e-68, (47.05% identity in 425 aa overlap); Q9I6Q9|PCAT|PA0229 from Pseudomonas aeruginosa (432 aa), FASTA scores: opt: 1131, E(): 1.8e-63, (40.4% identity in 438 aa overlap); Q9WWZ2 from Pseudomonas putida (429 aa), FASTA scores: opt: 1090, E(): 6.5e-61, (41.2% identity in 425 aa overlap); P17448|KGTP_ECOLI|WITA|B2587 from Escherichia coli strain K12 (432 aa), FASTA scores: opt: 1083, E(): 1.8e-60, (40.05% identity in 422 aa overlap); etc. Also similar to O05301|MTCI364.12|Rv1200 HYPOTHETICAL 44.6 KDA PROTEIN from Mycobacterium tuberculosis (425 aa), FASTA scores: E(): 5.2e-25, (28.5% identity in 382 aa overlap). Contains sugar transport protein signatures 1 and 2 (PS00216, PS00217). BELONG TO THE SUGAR TRANSPORTER FAMILY." /codon_start=1 /transl_table=11 /product="dicarboxylic acid transport integral membrane protein KgtP" /protein_id="NP_217993.1" /db_xref="GI:15610612" /db_xref="GeneID:888002" /translation="MTVSIAPPSRPSQAETRRAIWNTIRGSSGNLVEWYDVYVYTVFA TYFEDQFFDRADRNSTVYVYAIFAVTFVTRPVGSWFLGRFADRRGRRAALTFSVSLMA ACSLIVALVPSRSSIGVAAPILLILCRLVQGFATGGEYGTSATYMSEAATRERRGYFS SFQYVTLVGGHVLAQFTLLVILAVFTREQVHEFGWRIGFAVGGGAAIVVFWLRRTMDE SLSQERLTAIKAGRDHDSGSLRELATHYWKPLLLCFLVTLGGTVAFYTYSVNAPAIVK SVYGSQAMTATWINLVGLILLMMLQPIGGMISDKIGRKPLLLWFGVGGLIYTYVLVTY LPETRSPTMSFLLVAVGYVILTGYCSINALVKSELFPAHVRALGVGVGYALANSVFGG TAPLIYQALKERDQVPMFIAYVTACIAVSLIVYVFFIKNKADTYLDREQGFAFYGHA" misc_feature complement(3892758..3892811) /gene="kgtP" /locus_tag="Rv3476c" /note="PS00216 Sugar transport proteins signature 1" misc_feature complement(3893253..3893330) /gene="kgtP" /locus_tag="Rv3476c" /note="PS00217 Sugar transport proteins signature 2" gene 3894093..3894389 /gene="PE31" /locus_tag="Rv3477" /db_xref="GeneID:888474" CDS 3894093..3894389 /gene="PE31" /locus_tag="Rv3477" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3477, (MTCY13E12.30), len: 98 aa. Member of the Mycobacterium tuberculosis PE family (see Brennan & Delogu 2002), similar to O53941|Rv1791|MTV049.13 (99 aa), FASTA scores: opt: 373, E(): 4.3e-18, (64.65% identity in 99 aa overlap); MTCI364.07; MTCY21C12.10c; MTCY1A11.25c; MTC1A11.04; MTCY359.33; etc." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177975.1" /db_xref="GI:57117110" /db_xref="GeneID:888474" /translation="MSFTAQPEMLAAAAGELRSLGATLKASNAAAAVPTTGVVPPAAD EVSLLLATQFRTHAATYQTASAKAAVIHEQFVTTLATSASSYADTEAANAVVTG" gene 3894426..3895607 /gene="PPE60" /locus_tag="Rv3478" /db_xref="GeneID:888047" CDS 3894426..3895607 /gene="PPE60" /locus_tag="Rv3478" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3478, (MTCY13E12.31), len: 393 aa. PPE60 (alternate gene name: mtb39c). Member of the M. tuberculosis PPE family, highly similar to others e.g. Q11031|YD61_MYCTU|Rv1361c|MT1406|MTCY02B10.25c (396 aa), FASTA scores: opt: 2165, E(): 1.1e-109, (85.35% identity in 396 aa overlap); MTCI364.08; MTCY10G2.10; MTCY03A2.22c; MTCY274.23c; MTCY164.34c; MTCY98.0029c; etc. Note that expression of Rv3478 was demonstrated in lysates by immunodetection (see Dillon et al., 1999).; mtb39c" /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177976.1" /db_xref="GI:57117111" /db_xref="GeneID:888047" /translation="MVDFGALPPEINSARMYAGPGSASLVAAAKMWDSVASDLFSAAS AFQSVVWGLTVGSWIGSSAGLMAAAASPYVAWMSVTAGQAQLTAAQVRVAAAAYETAY RLTVPPPVIAENRTELMTLTATNLLGQNTPAIEANQAAYSQMWGQDAEAMYGYAATAA TATEALLPFEDAPLITNPGGLLEQAVAVEEAIDTAAANQLMNNVPQALQQLAQPAQGV VPSSKLGGLWTAVSPHLSPLSNVSSIANNHMSMMGTGVSMTNTLHSMLKGLAPAAAQA VETAAENGVWAMSSLGSQLGSSLGSSGLGAGVAANLGRAASVGSLSVPPAWAAANQAV TPAARALPLTSLTSAAQTAPGHMLGGLPLGHSVNAGSGINNALRVPARAYAIPRTPAA G" gene 3895820..3898885 /locus_tag="Rv3479" /db_xref="GeneID:888478" CDS 3895820..3898885 /locus_tag="Rv3479" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3479, (MTCY13E12.32), len: 1021 aa. Possible transmembrane protein, with hydrophobic stretches at C-terminus. Start changed since first submission (-54 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217996.2" /db_xref="GI:57117112" /db_xref="GeneID:888478" /translation="MAGVTREINLLAQASQWRRLGGTFPTNSQLTNESAASLRLYAQL IDLLDMVVDVDILSGTSAGGINAALLASSRVTGSDLGGIRDLWLDLGALTELLRDPRD KKTPSLLYGDERIFAALAKRLPKLATGPFPPTTFPEAARTPSTTLYITTTLLAGETSR FTDSFGTLVQDVDLRGLFTFTETDLARPDTAPALALAARSSASFPLAFEPSFLPFTKG TAKKGEVPARPAMAPFTSLTRPHWVSDGGLLDNRPIGVLFKRIFDRPARRPVRRVLLF VVPSSGPAPDPMHEPPPDNVDEPLGLIDGLLKGLAAVTTQSIAADLRAIRAHQDCMEA RTDAKLRLAELAATLRNGTRLLTPSLLTDYRTREATKQAQTLTSALLRRLSTCPPESG PATESLPKSWSAELTVGGDADKVCRQQITATILLSWSQPTAQPLPQSPAELARFGQPA YDLAKGCALTVIRAAFQLARSDADIAALAEVTEAIHRAWRPTASSDLSVLVRTMCSRP AIRQGSLENAADQLAADYLQQSTVPGDAWERLGAALVNAYPTLTQLAASASADSGAPT DSLLARDHVAAGQLETYLSYLGTYPGRADDSRDAPTMAWKLFDLATTQRAMLPADAEI EQGLELVQVSADTRSLLAPDWQTAQQKLTGMRLHHFGAFYKRSWRANDWMWGRLDGAG WLVHVLLDPRRVRWIVGERADTNGPQSGAQWFLGKLKELGAPDFPSPGYPLPAVGGGP AQHLTEDMLLDELGFLDDPAKPLPASIPWTALWLSQAWQQRVLEEELDGLANTVLDPQ PGKLPDWSPTSSRTWATKVLAAHPGDAKYALLNENPIAGETFASDKGSPLMAHTVAKA AATAAGAAGSVRQLPSVLKPPLITLRTLTLSGYRVVSLTKGIARSTIIAGALLLVLGV AAAIQSVTVFGVTGLIAAGTGGLLVVLGTWQVSGRLLFALLSFSVVGAVLALATPVVR EWLFGTQQQPGWVGTHAYWLGAQWWHPLVVVGLIALVAIMIAAATPGRR" gene complement(3898909..3900402) /locus_tag="Rv3480c" /db_xref="GeneID:888473" CDS complement(3898909..3900402) /locus_tag="Rv3480c" /function="UNKNOWN" /note="Rv3480c, (MTCY13E12.33c), len: 497 aa. Conserved hypothetical protein, similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O69701|Y1D4_MYCTU|Rv3734c|MT3839|MTV025.082c (454 aa), FASTA scores: opt: 520, E(): 2e-23, (39.95% identity in 488 aa overlap); Q10554|Y895_MYCTU|Rv0895|MTCY31.23 (505 aa), FASTA scores: opt: 434, E(): 2.7e-18, (34.2% identity in 497 aa overlap); AAK45165|MT0919 (520 aa), FASTA scores: opt: 434, E(): 2.7e-18, (34.2% identity in 497 aa overlap); etc. Also similar to Q9X7A8|MLCB1610.05|ML1244 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 272, E(): 1e-08, (28.85% identity in 485 aa overlap); and Q9RIU8|CM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 254, E(): 1.1e-07, (30.4% identity in 497 aa overlap). SEEMS TO BELONG TO THE UPF0089 FAMILY. TBparse score is 0.917." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217997.1" /db_xref="GI:15610616" /db_xref="GeneID:888473" /translation="MSQTARRLGPQDMFFLYSESSTTMMHVGALMPFTPPSGAPPDLL RQLVDESKASEVVEPWSLRLSHPELLYHPTQSWVVDDNFDLDYHVRRSALASPGDERE LGIPVSRLHSHALDLRRPPWEVHFIEGLEGGRFAIYIKMHHSLIDGYTGQKMLARSLS TDPHDTTHPLFFNIPTPGRSPADTQDSVGGGLIAGAGNVLDGLGDVVRGLGGLVSGVG SVLGSVAGAGRSTFELTKALVNAQLRSDHEYRNLVGSVQAPHCILNTRISRNRRFATQ QYPLDRLKAIGAQYDATINDVALAIIGGGLRRFLDELGELPNKSLIVVLPVNVRPKDD EGGGNAVATILATLGTDVADPVQRLAAVTASTRAAKAQLRSMDKDAILAYSAALMAPY GVQLASTLSGVKPPWPYTFNLCVSNVPGPEDVLYLRGSRMEASYPVSLVAHSQALNVT LQSYAGTLNFGFIGCRDTLPHLQRLAVYTGEALDQLAAADGAAGLGS" gene complement(3900493..3901182) /locus_tag="Rv3481c" /db_xref="GeneID:888466" CDS complement(3900493..3901182) /locus_tag="Rv3481c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3481c, (MTCY13E12.34c), len: 229 aa. Probable integral membrane protein. No real similarity with others." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_217998.1" /db_xref="GI:15610617" /db_xref="GeneID:888466" /translation="MRGLLPVAGHWVSVLTGLVPLALVIALSPLSVIPAVLVVHSPQP RPSSLAFLGGWLLGLAVVTAVFVAASGALGGLSTTSPAWASWLRVVLGSALIVFGVLR WLTRHRHTEMPGWMRAFASFTPARAGLVGAVLVVVRPEVLIICAAAGLAIGSGGHGAA GSWIYTAFFAMLAASTVAIPILAYVAAGDRLDDSLERLKDWMEKNHAGMVAAILVVIG LLLLYNGVHAM" gene complement(3901324..3902106) /locus_tag="Rv3482c" /db_xref="GeneID:888480" CDS complement(3901324..3902106) /locus_tag="Rv3482c" /function="UNKNOWN" /note="Rv3482c, (MTCY13E12.35c), len: 260 aa. Probable conserved membrane protein. N-terminal region shares some similarity with N-terminus of O88067|SCI35.32c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (319 aa), FASTA scores: opt: 155, E(): 0.023, (54.55% identity in 33 aa overlap); and with C-terminus of O06254|Rv3437|MTCY77.09 HYPOTHETICAL 17.9 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (alias AAK47883|MT3542.1 from strain CDC1551) (158 aa), FASTA scores: opt: 140, E(): 0.11, (58.8% identity in 34 aa overlap). Some similarity to others e.g. Q9XAN5|SC4C6.05c PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (347 aa), FASTA scores: opt: 131, E(): 0.75, (29.4% identity in 221 aa overlap). First start taken." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_217999.1" /db_xref="GI:15610618" /db_xref="GeneID:888480" /translation="MEHDVATSPPAGWYTDPDGSAGQRYWDGDRWTRHRRPNPSAPRS PLALRVDGLRSRWLGMPAGLRLTVPVAAVLTMVGVAVYAWIRPLPDDWSQLPKRLSCQ LRPGPTPPATITVASVDVSHPRGAVLRLVVRFAEPLPPSPSGSFASGFAGYLLTYTIA NNGKEFAELGPQQDTDELAIRKPGESRGTEPNMRPDRNTNARRTAPDTVEINLETKRL GLDQAPVDPQLTFAAQFRTPSTVTVDFGSQFCQGERLAGQRR" gene complement(3902150..3902812) /locus_tag="Rv3483c" /db_xref="GeneID:888448" CDS complement(3902150..3902812) /locus_tag="Rv3483c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3483c, (MTCY13E12.36c), len: 220 aa. Conserved hypothetical protein (see citation below), similar to Q9CC94|ML1099 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (202 aa), FASTA scores: opt: 276, E(): 1.4e-08, (33.1% identity in 148 aa overlap). Also showing similarity with Mycobacterium tuberculosis proteins Q11065|LPRE_MYCTU|LPRE|Rv1252c|MT1291|MTCY50.30. PUTATIVE LIPOPROTEIN PRECURSOR (202 aa), FASTA scores: opt: 276, E(): 1.4e-08, (29.5% identity in 200 aa overlap); O53445|Rv1097c|MTV017.50c HYPOTHETICAL 29.9 KDA PROTEIN (293 aa), FASTA scores: opt: 161, E(): 0.047, (25.4% identity in 118 aa overlap); P71882|LPPP_MYCTU|Rv2330c|MT2392|MTCY3G12.04 PUTATIVE LIPOPROTEIN PRECURSOR (175 aa), FASTA scores: opt: 146, E(): 0.21, (28.25% identity in 184 aa overlap); and O06170|Rv2507|MTCY07A7.13 HYPOTHETICAL 28.5 KDA PROTEIN (273 aa), FASTA scores: opt: 148, E(): 0.23, (25.15% identity in 191 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218000.1" /db_xref="GI:15610619" /db_xref="GeneID:888448" /translation="MSDEIDPDWPAPAYQPSDDVDTTPPAPGGSWPTAWLVALVVLAC VAAAVVAYAGMHRVRPGANQAAPATTSAPARPTSPASQVGPCGPDEATAVRAALAQLA PDSKTGRPWNSTPEDSNYDPCADLSAVLVTVQDATNSSPDQALMFHRGTFVGTATPRA YPFTNLIGPASTNDIVVLSYRTRQSCDGCQDGILTIVGFAWRGDHVQILDSLPELFDA PP" gene 3903078..3904616 /gene="cpsA" /locus_tag="Rv3484" /db_xref="GeneID:888445" CDS 3903078..3904616 /gene="cpsA" /locus_tag="Rv3484" /function="NOT KNOW." /note="Rv3484, (MTCY13E12.37), len: 512 aa. Possible cpsA, hypothetical protein, equivalent to Q50160|CPSA|ML2247 HYPOTHETICAL PROTEIN CPSA from Mycobacterium leprae (516 aa), FASTA scores: opt: 2557, E(): 1.6e-143, (74.9% identity in 518 aa overlap); and with good similarity to Q9CCK9|ML0750 HYPOTHETICAL PROTEIN from Mycobacterium leprae (489 aa), FASTA scores: opt: 855, E(): 4.6e-43, (34.45% identity in 502 aa overlap). Also similar (or with similarity) to hypothetical proteins from Mycobacterium tuberculosis: P96872|Rv3267|MTCY71.07 (498 aa), FASTA scores: opt: 928, E(): 2.3e-47, (37.35% identity in 498 aa overlap); and O53834|Rv0822c|MTV043.14c (684 aa), FASTA scores: opt: 425, E(): 1.5e-17, (26.15% identity in 524 aa overlap). Shows also similarity with various bacterial proteins e.g. Q9KZK0|SCE34.26 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (507 aa), FASTA scores: opt: 329, E(): 5.3e-12, (28.85% identity in 478 aa overlap); Q9K4E6|2SC6G5.02 CONSERVED HYPOTHETICAL PROTEIN, POSSIBLE MEMBRANE PROTEIN, from Streptomyces coelicolor (382 aa), FASTA scores: opt: 305, E(): 1.1e-10, (29.8% identity in 386 aa overlap); O69850|SC1C3.08c PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (366 aa), FASTA scores: opt: 304, E(): 1.2e-10, (29.6% identity in 395 aa overlap); Q9KZK3|SCE34.23 PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (396 aa), FASTA scores: opt: 296, E(): 3.8e-10, (31.25% identity in 349 aa overlap); AAK43602|CPSA CPSA PROTEIN from Streptococcus agalactiae (485 aa), FASTA scores: opt: 250, E(): 2.4e-07, (30.25% identity in 162 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218001.1" /db_xref="GI:15610620" /db_xref="GeneID:888445" /translation="MARSEGNRPRHRAVPQPSRIRKRLSRGVMTLVSVVALLMTGAGY WVAHGALGGITISQALTPEDPRSSGNNMNILLIGLDSRKDQEGNDLPWSVLKQLHAGD SDDGGYNTNTLILVHVGADGKVVAFSIPRDDWVPFTGVPGYNHIKIKEAYGLTKQYVA EQLANQGVSDRKELETRGREAARAATLRAVRSLTGVPIDYFAEINLAGFYDLAQTLGG VDVCLNHAVYDSYSGADFPAGRQRLNAAQALAFVRQRHGLDNGDLDRTHRQQAFLSSV MRELQDSGTFTNLDRLDNLMAVARKDVVLSAGWDEDLFRRMGDLAGGNVEFRTLPVVR YDNIDGQDVNIIDPTAIRAEVAAAFGSAPPTSQTAAAAKPNPSTVVDVVNAGSISGLA SQVSGALLKRGYTAGQVRDRESGDPFTTAIEYGAGAETDAQNVADLLGIDAPNHPDPA VAPGHIRVTVDTNFSLPAPDEATAAATSTETSTYPLYGGGTTTDPTPDQGAPIDGGGV PCVN" gene complement(3904622..3905566) /locus_tag="Rv3485c" /db_xref="GeneID:888427" CDS complement(3904622..3905566) /locus_tag="Rv3485c" /function="UNKNOWN; SUPPOSED TO BE INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3485c, (MTCY13E12.38c), len: 314 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar, but longer 41 aa, to P71824|Rv0769|MTCY369.14 PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE CY369.14 from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 462, E(): 1.8e-19, (34.0% identity in 253 aa overlap). Also similar to various dehydrogenases e.g. P25529|HDHA_ECOLI|HSDH|B1619 NAD-DEPENDENT 7 ALPHA-HYDROXYSTEROID DEHYDROGENASE (SDR FAMILY) (EC 1.1.1.159) from Escherichia coli strain K12 (alias BAB35750|ECS2327 or AAG56608|HDHA for strain O157:H7) (255 aa), FASTA scores: opt: 462, E(): 1.8e-19, (34.7% identity in 248 aa overlap); Q9FD15|RUBG PUTATIVE REDUCTASE (SDR FAMILY) from Streptomyces collinus (249 aa), FASTA scores: opt: 446, E(): 1.5e-18, (36.1% identity in 255 aa overlap); BAB51974|MLL5540 PUTATIVE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (253 aa), FASTA scores: opt: 442, E(): 2.5e-18, (36.25% identity in 251 aa overlap); Q08632|SDR1_PICAB SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE (SDR FAMILY) from Picea abies (Norway spruce) (Picea excelsa) (271 aa), FASTA scores: opt: 441, E(): 3.1e-18, (32.3% identity in 260 aa overlap); Q9A326|CC3380 2-DEOXY-D-GLUCONATE 3-DEHYDROGENASE from Caulobacter crescentus (260 aa), FASTA scores: opt: 436, E(): 5.7e-18, (32.8% identity in 253 aa overlap); Q16698|DECR_HUMAN 2,4-DIENOYL-COA REDUCTASE, MITOCHONDRIAL PRECURSOR (EC 1.3.1.34) from Homo sapiens (Human) (335 aa), FASTA scores: opt: 430, E(): 1.5e-17, (30.4% identity in 306 aa overlap); etc. Contains short-chain alcohol dehydrogenase family signature (PS00061). BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES FAMILY (SDR)." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_218002.1" /db_xref="GI:15610621" /db_xref="GeneID:888427" /translation="MNSRAPRNLAVSSPSAQVTGRMVQNGENLFQFRREGPQVQLSFQ DRTYLVTGGGSGIGKGVAAGLVAAGAAVMIVGRNPDKLAAAVKDIEALKTGAIGYEPA DITDEEQTLRVVDAATAWHGRLHGVVHCAGGSQTIGPITQIDSQAWRRTVDLNVNGTM YVLKHAARELVRGGGGSFVGISSIAASNTHRWFGAYGVTKSAVDHMMKLAADELGPSW VRVNSIRPGLIRTDLVVPVTESPELSADYRVCTPLPRVGEVEDVANLAMFLLSDAASW ITGQVINVDGGHMLRRGPDFSPMLEPVFGADGLRGVVG" misc_feature complement(3904934..3905020) /locus_tag="Rv3485c" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 3905772..3906221 /locus_tag="Rv3486" /db_xref="GeneID:888276" CDS 3905772..3906221 /locus_tag="Rv3486" /function="UNKNOWN" /note="Rv3486, (MTCY13E12.39), len: 149 aa. Conserved hypothetical protein, similar to Q9RC47|YFID|BH3304 HYPOTHETICAL PROTEIN from Bacillus halodurans (129 aa), FASTA scores: opt: 186, E(): 2.1e-05, (40.0% identity in 95 aa overlap); and Q9KKT1|VCA1019 HYPOTHETICAL PROTEIN from Vibrio cholerae (148 aa), FASTA scores: opt: 128, E(): 0.15, (35.25% identity in 139 aa overlap). Some similarity to other proteins e.g. P54720|YFID_BACSU HYPOTHETICAL PROTEIN from Bacillus subtilis (134 aa), FASTA scores: opt: 165, E(): 0.00052, (31.75% identity in 126 aa overlap). Equivalent to AAK47949 from Mycobacterium tuberculosis strain CDC1551 (163 aa) but shorter 14 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218003.1" /db_xref="GI:15610622" /db_xref="GeneID:888276" /translation="MHAEGPPSVICIRLLVGLVFLSEGIQKFMYPDQLGPGRFERIGI PAATFFADLDGVVEIVCGTLVLLGLLTRVAAVPLLIDMVGAIVLTKLRALQPGGFLGV EGFWGMAHAARTDLSMLLGLIFLLWSGPGRWSLDRRLSKRATACGAR" gene complement(3906174..3907007) /gene="lipF" /locus_tag="Rv3487c" /db_xref="GeneID:888430" CDS complement(3906174..3907007) /gene="lipF" /locus_tag="Rv3487c" /EC_number="3.-.-.-" /function="UNKNOWN; LIPOLYTIC ENZYME INVOLVED IN CELLULAR METABOLISM." /note="Rv3487c, (MTCY13E12.41c), len: 277 aa. Probable lipF, esterase/lipase (EC 3.-.-.-) (see citation below), highly similar, but shorter 50 aa, to O53424|LIPU|Rv1076|MTV017.29 PUTATIVE ESTERASE/LIPASE from Mycobacterium tuberculosis (297 aa), FASTA scores: opt: 1229, E(): 3.3e-71, (76.4% identity in 246 aa overlap); and similar to other putative lipases from Mycobacterium tuberculosis e.g. P71759|LIPK|RV2385|MTCY253.36c (306 aa), FASTA scores: opt: 468, E(): 1.2e-22, (36.2% identity in 254 aa overlap). Equivalent, but shorter 79 aa, to Q9ZBM4|MLCB1450.08|ML0314 PUTATIVE HYDROLASE (PUTATIVE ESTERASE) from Mycobacterium leprae (335 aa), FASTA scores: opt: 1225, E(): 6.6e-71, (73.6% identity in 250 aa overlap). Also similar to esterases and lipases of around 300 aa e.g. Q44087|EST ESTERASE PRECURSOR from Acinetobacter lwoffii (303 aa), FASTA scores: opt: 428, E(): 4.3e-20, (31.85% identity in 251 aa overlap); P18773|EST_ACICA ESTERASE (EC 3.1.1.-) from Acinetobacter calcoaceticus (303 aa), FASTA scores: opt: 420, E(): 1.4e-19, (31.5% identity in 251 aa overlap); Q9KIU1 ESTERASE from uncultured bacterium Plasmid pAH116 (308 aa), FASTA scores: opt: 405, E(): 1.3e-18, (35.1% identity in 242 aa overlap); Q9X8J4|SCE9.22 PUTATIVE ESTERASE from Streptomyces coelicolor (266 aa), FASTA scores: opt: 390, E(): 1e-17, (35.85% identity in 237 aa overlap); etc. Equivalent to AAK47950 from Mycobacterium tuberculosis strain CDC1551 (327 aa) but shorter 50 aa. TBparse score is 0.940." /codon_start=1 /transl_table=11 /product="esterase/lipase LipF" /protein_id="NP_218004.1" /db_xref="GI:15610623" /db_xref="GeneID:888430" /translation="MRAPGVRAADGAGRVVLYLHGGAFVMCGPNSHSRIVNALSGFAE SPVLIVDYRLIPKHSLGMALDDCHDAYQWLRARGYRPEQIVLAGDSAGGYLALALAQR LQCDDEKPAAIVAISPLLQLAKGPKQDHPNIGTDAMFPARAFDALAAWVRAAAAKNMV DGRPEDLYEPLDHIESSLPPTLIHVSGSEVLLHDAQLGAGKLAAAGVCAEVRVWPGQA HLFQLATPLVPEATRSLRQIGQFIRDATADSSLSPVHRSRYVAGSPRAASRGAFGQSP I" gene 3907667..3907990 /locus_tag="Rv3488" /db_xref="GeneID:888417" CDS 3907667..3907990 /locus_tag="Rv3488" /function="UNKNOWN" /note="Rv3488, (MTCY13E12.41), len: 107 aa. Hypothetical protein, similar to various bacterial proteins e.g. O28730|AF1542 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (101 aa), FASTA scores: opt: 321, E(): 6.4e-15, (50.55% identity in 87 aa overlap); O50207 SQ1_IV (FRAGMENT) from Rhodococcus erythropolis (59 aa), FASTA scores: opt: 298, E(): 1.4e-13, (71.2% identity in 59 aa overlap); Q9KFB0|BH0575 BH0575 PROTEIN from Bacillus halodurans (102 aa), FASTA scores: opt: 294, E(): 4.1e-13, (43.15% identity in 95 aa overlap); etc. Also similar to Mycobacterium tuberculosis P71704|Rv0047c|MTCY21D4.10c (180 aa) (37.8% identity in 82 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218005.1" /db_xref="GI:15610624" /db_xref="GeneID:888417" /translation="MREFQRAAVRLHILHHAADNEVHGAWLTQELSRHGYRVSPGTLY PTLHRLEADGLLVSEQRVVDGRARRVYRATPAGRAALTEDRRALEELAREVLGGQSHT AGNGT" gene 3908072..3908236 /locus_tag="Rv3489" /db_xref="GeneID:888410" CDS 3908072..3908236 /locus_tag="Rv3489" /function="UNKNOWN" /note="Rv3489, (MTCY13E12.42), len: 54 aa. Hypothetical unknown protein. No similarity with other proteins." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218006.1" /db_xref="GI:15610625" /db_xref="GeneID:888410" /translation="MSTKSDHGEIGDVEPLADSTASQARRVVAAYANDADECRIFLSM LGIGPAKLES" gene 3908236..3909738 /gene="otsA" /locus_tag="Rv3490" /db_xref="GeneID:888404" CDS 3908236..3909738 /gene="otsA" /locus_tag="Rv3490" /EC_number="2.4.1.15" /function="INVOLVED IN OSMOREGULATORY TREHALOSE BIOSYNTHESIS. Mycobacteria can produce trehalose from glucose 6-phosphate and UDP-glucose (the OtsA-OtsB pathway) from glycogen-like alpha(1-->4)-linked glucose polymers (the TreY-TreZ pathway) and from maltose (the TreS pathway) [CATALYTIC ACTIVITY: UDP-GLUCOSE + D-GLUCOSE 6-PHOSPHATE = UDP + ALPHA,ALPHA-TREHALOSE 6-PHOSPHATE]." /note="Rv3490, (MTCY13E12.43), len: 500 aa. Probable otsA, alpha, alpha-trehalose-phosphate synthase (EC 2.4.1.15) (see citations below), equivalent to Q50167|OTSA|ML2254 PROBABLE TREHALOSE-PHOSPHATE SYNTHASE from Mycobacterium leprae (498 aa), FASTA scores: opt: 2706, E(): 1.6e-166, (80.3% identity in 497 aa overlap). Also similar to others e.g. Q92410|TPS1_CANAL from Candida albicans (Yeast) (478 aa), FASTA scores: opt: 895, E(): 4.9e-50, (37.15% identity in 479 aa overlap); Q00764|TPS1_YEASTTPS1|CIF1|BYP1|FDP1|GGS1|GLC6|YBR126c|YB R0 922 from Saccharomyces cerevisiae (Baker's yeast) (495 aa), FASTA scores: opt: 847, E(): 6.2e-47, (36.1% identity in 490 aa overlap); BAB48232|MLL0691 from Rhizobium loti (Mesorhizobium loti) (520 aa), FASTA scores: opt: 884, E(): 2.7e-49, (36.2% identity in 478 aa overlap); etc. Equivalent to AAK47953 from Mycobacterium tuberculosis strain CDC1551 (478 aa) but longer 22 aa." /codon_start=1 /transl_table=11 /product="alpha,alpha-trehalose-phosphate synthase" /protein_id="NP_218007.1" /db_xref="GI:15610626" /db_xref="GeneID:888404" /translation="MAPSGGQEAQICDSETFGDSDFVVVANRLPVDLERLPDGSTTWK RSPGGLVTALEPVLRRRRGAWVGWPGVNDDGAEPDLHVLDGPIIQDELELHPVRLSTT DIAQYYEGFSNATLWPLYHDVIVKPLYHREWWDRYVDVNQRFAEAASRAAAHGATVWV QDYQLQLVPKMLRMLRPDLTIGFFLHIPFPPVELFMQMPWRTEIIQGLLGADLVGFHL PGGAQNFLILSRRLVGTDTSRGTVGVRSRFGAAVLGSRTIRVGAFPISVDSGALDHAA RDRNIRRRAREIRTELGNPRKILLGVDRLDYTKGIDVRLKAFSELLAEGRVKRDDTVV VQLATPSRERVESYQTLRNDIERQVGHINGEYGEVGHPVVHYLHRPAPRDELIAFFVA SDVMLVTPLRDGMNLVAKEYVACRSDLGGALVLSEFTGAAAELRHAYLVNPHDLEGVK DGIEEALNQTEEAGRRRMRSLRRQVLAHDVDRWAQSFLDALAGAHPRGQG" gene 3909890..3910468 /locus_tag="Rv3491" /db_xref="GeneID:888382" CDS 3909890..3910468 /locus_tag="Rv3491" /function="UNKNOWN" /note="Rv3491, (MTCY13E12.44), len: 192 aa. Hypothetical unknown protein. No significant homology with other proteins." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218008.1" /db_xref="GI:15610627" /db_xref="GeneID:888382" /translation="MNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQ AGVSGVTWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFS GRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPK LSKQPFSLQLIGPPPSPVQRYPLYCNNIAMCY" gene complement(3910465..3910947) /locus_tag="Rv3492c" /db_xref="GeneID:888381" CDS complement(3910465..3910947) /locus_tag="Rv3492c" /function="UNKNOWN" /note="Rv3492c, (MTCY13E12.45c), len: 160 aa. Conserved hypothetical Mce-associated protein, showing some similarity to hypothetical Mycobacterium tuberculosis proteins e.g. O53974|Rv1973|MTV051.11 (near Mce operon 3) (160 aa), FASTA scores: opt: 214, E(): 2.6e-07, (25.3% identity in 154 aa overlap); and Q11032|YD62_MYCTU|Rv1362c|MT1407|MTCY02B10.26c (220 aa), FASTA scores: opt: 187, E(): 2e-05, (23.4% identity in 154 aa overlap). Contains lipocalin signature at C-terminus (PS00213)." /codon_start=1 /transl_table=11 /product="Mce associated protein" /protein_id="NP_218009.1" /db_xref="GI:15610628" /db_xref="GeneID:888381" /translation="MRRLISVAYALMVATIVGLSAAGGWFYWDRVQTGGEASARALLP KLAMQEIPQVFGYDYQTVERSLTAVYPLLTPDYRQEFQKSANAQIIPEAKKREVVVQA NVVGVGVMDAKRDCASVMVYLNRTVTDKTRQPLYDGSRLRVDFQRIDGKWLIAYITPI" misc_feature complement(3910483..3910518) /locus_tag="Rv3492c" /note="PS00213 Lipocalin signature" gene complement(3910947..3911675) /locus_tag="Rv3493c" /db_xref="GeneID:888379" CDS complement(3910947..3911675) /locus_tag="Rv3493c" /function="UNKNOWN" /note="Rv3493c, (MTCY13E12.46c), len: 242 aa. Conserved hypothetical Mce-associated ala-, val-rich protein, showing weak similarity to O07422|Z97050|Rv0178|MTCI28.18 HYPOTHETICAL 25.9 KDA PROTEIN (near Mce operon1) from Mycobacterium tuberculosis (244 aa), FASTA scores: opt: 163, E(): 0.046, (24.65% identity in 211 aa overlap)." /codon_start=1 /transl_table=11 /product="Mce associated alanine and valine rich protein" /protein_id="NP_218010.1" /db_xref="GI:15610629" /db_xref="GeneID:888379" /translation="MAADTGVAGGQQSTTRRARRKASRPAGPAEGESSRPAQGAATVR AAARTESKPAKAAKPALRPVKPPPRRPAHRVLVGWLSLAAGLLAIAALAWGVTALVMQ NRDADARQARNQRFVDAATQTVVNMFSYTPDTIDESVNRFVNGTSGPLRGMLNANNNV DNLKGLFRATNATSEAVVNGAALEGIDEISDNASVLVSVRVTVADIDGVNKPSMPYRL RVIVHEDENGRMTGYDLKYPDGGN" gene complement(3911675..3913369) /gene="mce4F" /locus_tag="Rv3494c" /db_xref="GeneID:888376" CDS complement(3911675..3913369) /gene="mce4F" /locus_tag="Rv3494c" /function="UNKNOWN, BUT THOUGHT INVOLVED IN HOST CELL INVASION." /experiment="experimental evidence, no additional details recorded" /note="Rv3494c, (MTV023.01c), len: 564 aa. mce4F; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), similar to Mycobacterium tuberculosis proteins O07418|Rv0174|MTCI28.14|mce1F (515 aa); O07784|Rv0594|MTCY19H5.28c|mce2F (516 aa); and O53972|Rv1971|MTV051.09|mce3F (437 aa). Also similar to others e.g. Q9CD09|MCE1F|ML2594 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (516 aa), FASTA scores: opt: 1040, E(): 3.6e-31, (35.9% identity in 529 aa overlap); Q9F361|SC8A2.02c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (433 aa), FASTA scores: opt: 570, E(): 3.7e-14, (30.8% identity in 458 aa overlap); etc. Has hydrophobic stretch, possibly a signal peptide at the N-terminus. TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE4F" /protein_id="NP_218011.1" /db_xref="GI:15610630" /db_xref="GeneID:888376" /translation="MIDRLAKIQLSIFAVITVITLSVMAIFYLRLPATFGIGTYGVSA DFVAGGGLYKNANVTYRGVAVGRVESVGLNPNGVTAHMRLNSGTAIPSNVTATVRSVS AIGEQYIDLVPPENPSSTKLRNGFRIQRQNTRIGQDVADLLRQAETLLGSLGDTRLRE LLHEAFIATNGAGPELARLIESARLLVDEANANYPQVSQLIDQAGPFLQAQIRAGGDI KSLADGLARFTWQLRAADPRLRDTLADAPDAIDEANTAFSGIRPSFPALAASLANLGR VGVIYHKSIEQLLVVFPALFAAIITSAGGVPQDEGAKLDFKIDLHDPPPCMTGFLPPP LVRSPADESVREIPRDMYCKTAQNDPSTVRGARNYPCQEFPGKRAPTVQLCRDPRGYV PVGTNPWRGPPIPYGTEVTDGRNILPPNKFPYIPPGADPDPGVPIVGPPPPGQVAGPG PAPHQPAQPAPPPNDNGPPPPFTSWMPPGYPPEPPQVPYPATIPPPPPPEGTGPPPGP APGPQPQASGPAYTIYDQLSGAFADPAGGTGIFAPGMTGASSAENWVDLMRDPRQL" gene complement(3913380..3914534) /gene="lprN" /locus_tag="Rv3495c" /db_xref="GeneID:888364" CDS complement(3913380..3914534) /gene="lprN" /locus_tag="Rv3495c" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv3495c, (MTV023.02c), len: 384 aa. Possible lprN (alternate gene name: mce4E), lipoprotein which belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07417|LPRK|Rv0173|MTCI28.13|mce1E (390 aa); O07785|LPRL|Rv0593|MTCY19H5.29|mce2E (402 aa); and O53971|LPRM|Rv1970|MTV051.08|mce3E (377 aa). Also similar to others e.g. Q9F360|SC8A2.03c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (413 aa), FASTA scores: opt: 656, E(): 2.2e-32, (37.55% identity in 317 aa overlap); Q9CD10|LPRK|ML2593 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (392 aa), FASTA scores: opt: 616, E(): 5.5e-30, (28.95% identity in 373 aa overlap); etc. Contains possible signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.897.; mce4E" /codon_start=1 /transl_table=11 /product="MCE-family lipoprotein LprN" /protein_id="NP_218012.1" /db_xref="GI:15610631" /db_xref="GeneID:888364" /translation="MNRIWLRAIILTASSALLAGCQFGGLNSLPLPGTAGHGEGAYSV TVEMADVATLPQNSPVMVDDVTVGSVAGIVAVQRPDGSFYAAVKLDLDKNVLLPANAV AKVSQTSLLGSLHVELAPPTDRPPTGRLVDGSRITEANTDRFPTTEEVFSALGVVVNK GNVGALEEIIDETHQAVAGRQAQFVNLVPRLAELTAGLNRQVHDIIDALDGLNRVSAI LARDKDNLGRALDTLPDAVRVLNQNRDHIVDAFAALKRLTMVTSHVLAETKVDFGEDL KDLYSIVKALNDDRKDFVTSLQLLLTFPFPNFGIKQAVRGDYLNVFTTFDLTLRRIGE TFFTTAYFDPNMAHMDEILNPPDFLIGELANLSGQAADPFKIPPGTASGQ" misc_feature complement(3914472..3914504) /gene="lprN" /locus_tag="Rv3495c" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(3914531..3915886) /gene="mce4D" /locus_tag="Rv3496c" /db_xref="GeneID:888361" CDS complement(3914531..3915886) /gene="mce4D" /locus_tag="Rv3496c" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv3496c, (MTV023.03c), len: 451 aa. mce4D; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07416|Rv0172|MTCI28.12|mce1D (530 aa); O07786|Rv0592|MTCY19H5.30c|mce2D (508 aa); and O53970|Rv1969|MTV051.07|mce3D (423 aa). Also similar to others e.g. Q9CD11|MCE1D|ML2592 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (531 aa), FASTA scores: opt: 837, E(): 2.6e-34, (34.55% identity in 446 aa overlap); Q9F359|SC8A2.04c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (337 aa), FASTA scores: opt: 606, E(): 4.9e-23, (32.35% identity in 300 aa overlap); etc. Hydrophobic region at N-terminus. TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE4D" /protein_id="NP_218013.1" /db_xref="GI:15610632" /db_xref="GeneID:888361" /translation="MMGRVAMLTGSRGLRYATVIALVAALVGGVYVLSSTGNKRTIVG YFTSAVGLYPGDQVRVLGVPVGEIDMIEPRSSDVKITMSVSKDVKVPVDVQAVIMSPN LVAARFIQLTPVYTGGAVLPDNGRIDLDRTAVPVEWDEVKEGLTRLAADLSPAAGELQ GPLGAAINQAADTLDGNGDSLHNALRELAQVAGRLGDSRGDIFGTVKNLQVLVDALSE SDEQIVQFAGHVASVSQVLADSSANLDQTLGTLNQALSDIRGFLRENNSTLIETVNQL NDFAQTLSDQSENIEQVLHVAGPGITNFYNIYDPAQGTLNGLLSIPNFANPVQFICGG SFDTAAGPSAPDYYRRAEICRERLGPVLRRLTVNYPPIMFHPLNTITAYKGQIIYDTP ATEAKSETPVPELTWVPAGGGAPVGNPADLQSLLVPPAPGPAPAPPAPGAGPGEHGGG G" gene complement(3915883..3916956) /gene="mce4C" /locus_tag="Rv3497c" /db_xref="GeneID:888354" CDS complement(3915883..3916956) /gene="mce4C" /locus_tag="Rv3497c" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv3497c, (MTV023.04c), len: 357 aa. mce4C; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07415|R0171|MTCI28.11|mce1C (515 aa); O07787|Rv0591|MTCY19H5.31|mce2C (481 aa); and O53969|Rv1968|MTV051.06|mce3C (410 aa). Also similar to others e.g. Q9F358|SC8A2.05c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (351 aa), FASTA scores: opt: 658, E(): 1.1e-30, (33.95% identity in 318 aa overlap); Q9CD12|MCE1C|ML2591 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (519 aa), FASTA scores: opt: 555, E(): 1.2e-24, (28.35% identity in 328 aa overlap); etc. Hydrophobic region at N-terminus. TBparse score is 0.889." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE4C" /protein_id="NP_218014.1" /db_xref="GI:15610633" /db_xref="GeneID:888354" /translation="MLNRKPSSKHERDPLRTGIFGLVLVICVVLIAFGYSGLPFWPQG KTYDAYFTDAGGITPGNSVYVSGLKVGAVSAVSLAGNSAKVTFSVDRSIVVGDQSLAA IRTDTILGERSIAVSPAGSGKSTTIPLSRTTTPYTLNGVLQDLGRNANDLNRPQFEQA LNVFTQALHDATPQVRGAVDGLTSLSRALNRRDEALQGLLAHAKSVTSVLSERAEQVN KLVEDGNQLFAALDARRAALSALISGIDDVAAQISGFVADNRKEFGPALSKLNLVLAN LNERRDYITEALKRLPTYATTLGEVVGSGPGFNVNVYSVLPGPLVATVFDLVFQPGKL PDSLADYLRGFIQERWIIRPKSP" gene complement(3916946..3917998) /gene="mce4B" /locus_tag="Rv3498c" /db_xref="GeneID:888349" CDS complement(3916946..3917998) /gene="mce4B" /locus_tag="Rv3498c" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /note="Rv3498c, (MTV023.05c), len: 350 aa. mce4B; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07414|Rv0170|MTCI28.10|mce1B (346 aa); O07788|Rv0590|MTCY19H5.32c|mce2B (275 aa); and O53968|Rv1967|MTV051.05|mce3B (342 aa). Also similar to others e.g. Q9CD13|MCE1B|ML2590 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (346 aa), FASTA scores: opt: 803, E(): 6.1e-41, (41.05% identity in 346 aa overlap); Q9F357|SC8A2.06c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (354 aa), FASTA scores: opt: 624, E(): 3.4e-30, (32.55% identity in 338 aa overlap); etc. Hydrophobic region at N-terminus. TBparse score is 0.878." /codon_start=1 /transl_table=11 /product="MCE-family protein MCE4B" /protein_id="NP_218015.1" /db_xref="GI:15610634" /db_xref="GeneID:888349" /translation="MAGSGVPSHRSMVIKVSVFAVVMLLVAAGLVVVFGDFRFGPTTV YHATFTDASRLKAGQKVRIAGVPVGSVKAVKLNPDHSIDVAFAIDRSYTLYSSTRAVI RYENLVGDRFLEITSGPGELRKLPPGGTINVAHTQPALDLDALLGGLRPVLKGFDADK INTITSAVIELLQGQGGPLANVLADTGAFSAALGARDQLIGEVITNLNAVLATVDAKS AQFSASVDQLQQLVSGLAKNRDPIAGAISPLASTTTDLTELLRNSRRPLQGILENARP LATELDNRKAEVNNDIEQLGEDYLRLSALGSYGAFFNIYFCSVTIKINGPAGSDILLP IGGQPDPSKGRCAFAK" gene complement(3917998..3919200) /gene="mce4A" /locus_tag="Rv3499c" /db_xref="GeneID:888344" CDS complement(3917998..3919200) /gene="mce4A" /locus_tag="Rv3499c" /function="UNKNOWN, BUT THOUGHT TO BE INVOLVED IN HOST CELL INVASION." /experiment="experimental evidence, no additional details recorded" /note="Rv3499c, (MTV023.06c), len: 400 aa. mce4A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins P72013|MCE1|Rv0169|MTCI28.09|mce1A (454 aa); O07789|MCE2|Rv0589|MTCY19H5.33c|mce2A (404 aa); and O53967|MCE3|Rv1966|MTV051.04|mce3A (425 aa). Also similar to others e.g. Q9F356|SC8A2.07c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (418 aa), FASTA scores: opt: 619, E(): 7.8e-30, (32.4% identity in 352 aa overlap); Q9S4U5|MCE1 MYCOBACTERIAL CELL ENTRY PROTEIN from Mycobacterium bovis BCG (454 aa), FASTA scores: opt: 529, E(): 2.1e-24, (30.35% identity in 448 aa overlap); Q9CD14|MCE1A|ML2589 from Mycobacterium leprae (441 aa), FASTA scores: opt: 515, E(): 1.4e-23, (28.35% identity in 430 aa overlap); etc. Contains a possible N-terminal signal sequence. TBparse score is 0.914. Note that previously known as mce4.; mce4" /codon_start=1 /transl_table=11 /product="MCE-family protein MCE4A" /protein_id="YP_177977.1" /db_xref="GI:57117113" /db_xref="GeneID:888344" /translation="MSGGGSRRTSVRVAAALLAGLMVGSAVLTYLSYTAAFTSTDTVT VSSPRAGLVMEKGAKVKYRGIQVGKVTDISYSGNQARLKLAIDSGEMGFIPSNATVRI AGNTIFGAKSVEFIPPKTPSPKPLSPNAHVAASQVQLEVNTLFQSLIDLLHKIDPLET NATLSALSEGLRGHGDDLGALLSGLNTLTRQANPKLPALQEDFRKAAVVANVYADAAG DLNTVFDNLPTINKTIVDQKDNLNDTLLATIGLSNNAYETLAPAEQNFIDAINRLRAP LKVTSDYSPVFGCLFKGIARGVKEFAPLIGVRKAGLFTSSSFVLGAPSYTYPESLPIV NASGGPNCRGLPDIPTKQTGGSFYRAPFLVTDNALIPYQPFTELQVDAPSTLQFLFNG AFAERDDF" gene complement(3919220..3920062) /gene="yrbE4B" /locus_tag="Rv3500c" /db_xref="GeneID:888336" CDS complement(3919220..3920062) /gene="yrbE4B" /locus_tag="Rv3500c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3500c, (MTV023.07c), len: 280 aa. yrbE4B, hypothetical unknown integral membrane protein, part of mce4 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07413|Rv0168|MTCI28.08|yrbE1B (289 aa); O07790|Rv0588|MTCY19H5.34|yrbE2B (295 aa); and O53966|Rv1965|MTV051.03|yrbE3B (271 aa). Also highly similar to conserved hypothetical integral membrane proteins of the P45030|YRBE_HAEIN (261 aa) type, e.g. Q9CD15|YRBE1B|ML2588 from Mycobacterium leprae (289 aa), FASTA scores: opt: 973, E(): 1.5e-50, (50.2% identity in 269 aa overlap); P45030|YRBE_HAEIN|HI1086 from Haemophilus influenzae (261 aa), FASTA scores: opt: 270, E(): 6e-11, (25.4% identity in 264 aa overlap); etc. TBparse score is 0.887." /codon_start=1 /transl_table=11 /product="integral membrane protein YrbE4b" /protein_id="NP_218017.1" /db_xref="GI:15610636" /db_xref="GeneID:888336" /translation="MSYDVTIRFRRFFSRLQRPVDNFGEQALFYGETMRYVPNAITRY RKETVRLVAEMTLGAGALVMIGGTVGVAAFLTLASGGVIAVQGYSSLGDIGIEALTGF LSAFLNVRVVAPVIAGIALAATIGAGATAQLGAMRVSEEIDAVECMAVHSVSYLVSTR LIAGLVAIIPLYSLSVLAAFFAARFTTVFVNGQSAGLYDHYFNTFLIPSDLLWSFMQA IAMSIAVMLVHTYYGYNASGGSVGVGVAVGQAVRTSLIVVVVITLFISLAVYGASGNF NLSG" gene complement(3920097..3920861) /gene="yrbE4A" /locus_tag="Rv3501c" /db_xref="GeneID:888320" CDS complement(3920097..3920861) /gene="yrbE4A" /locus_tag="Rv3501c" /function="UNKNOWN" /note="Rv3501c, (MTV023.08c), len: 254 aa. yrbE4A, hypothetical unknown integral membrane protein, part of mce4 operon and member of YrbE family (see citations below), highly similar to Mycobacterium tuberculosis proteins O07412|Rv0167|MTCI28.07|yrbE1A (265 aa); O07791|Rv0587|MTCY19H5.35|yrbE2A (265 aa); and O53965|Rv1964|MTV051.02|yrbE3A (265 aa). Also highly similar to conserved hypothetical integral membrane proteins of the P45030|YRBE_HAEIN (261 aa) type, e.g. Q9CD16|YRBE1A|ML2587 from Mycobacterium leprae (267 aa), FASTA scores: opt: 1059, E(): 1e-57, (64.75% identity in 247 aa overlap); P45030|YRBE_HAEIN|HI1086 from Haemophilus influenzae (261 aa), FASTA scores: opt: 313, E(): 3e-14, (25.7% identity in 241 aa overlap); etc. TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="integral membrane protein YrbE4a" /protein_id="NP_218018.1" /db_xref="GI:15610637" /db_xref="GeneID:888320" /translation="MIQQLAVPARAVGGFFEMSMDTARAAFRRPFQFREFLDQTWMVA RVSLVPTLLVSIPFTVLVAFTLNILLREIGAADLSGAGTAFGTITQLGPVVTVLVVAG AGATAICADLGARTIREEIDAMRVLGIDPIQRLVVPRVLASTLVALLLNGLVCAIGLS GGYAFSVFLQGVNPGAFINGLTVLTGLRELILAEIKALLFGVMAGLVGCYRGLTVKGG PKGVGNAVNETVVYAFICLFVINVVMTAIGVRISAQ" gene complement(3921087..3922040) /gene="fabG" /locus_tag="Rv3502c" /db_xref="GeneID:887697" CDS complement(3921087..3922040) /gene="fabG" /locus_tag="Rv3502c" /EC_number="1.1.1.100" /function="UNKNOWN; SUPPOSED INVOLVEMENT IN CELLULAR METABOLISM." /note="Catalyzes the first of the two reduction steps in the elongation cycle of fatty acid synthesis" /codon_start=1 /transl_table=11 /product="3-ketoacyl-(acyl-carrier-protein) reductase" /protein_id="NP_218019.1" /db_xref="GI:15610638" /db_xref="GeneID:887697" /translation="MKLTESNRSPRTTNTTDLSGKVAVVTGAAAGLGRAEALGLARLG ATVVVNDVASALDASDVVDEIGAAAADAGAKAVAVAGDISQRATADELLASAVGLGGL DIVVNNAGITRDRMLFNMSDEEWDAVIAVHLRGHFLLTRNAAAYWRDKAKDAEGGSVF GRLVNTSSEAGLVGPVGQANYAAAKAGITALTLSAARALGRYGVCANVICPRARTAMT ADVFGAAPDVEAGQIDPLSPQHVVSLVQFLASPAAAEVNGQVFIVYGPQVTLVSPPHM ERRFSADGTSWDPTELTATLRDYFAGRDPEQSFSATDLMRQ" misc_feature complement(3921453..3921539) /gene="fabG" /locus_tag="Rv3502c" /note="PS00061 Short-chain dehydrogenases/reductases family signature." gene complement(3922065..3922256) /gene="fdxD" /locus_tag="Rv3503c" /db_xref="GeneID:888311" CDS complement(3922065..3922256) /gene="fdxD" /locus_tag="Rv3503c" /function="FERREDOXINS ARE IRON-SULFUR PROTEINS THAT TRANSFER ELECTRONS IN A WIDE VARIETY OF METABOLIC REACTIONS." /note="Rv3503c, (MTV023.10c), len: 63 aa. Probable fdxD, ferredoxin, equivalent to Q9R6Z5|B229_C3_226 HYPOTHETICAL 9.3 KDA PROTEIN from Mycobacterium leprae (83 aa) FASTA scores: opt: 276, E(): 1.8e-13, (75.9% identity in 54 aa overlap). Also similar to several e.g. Q9R6Z5|PHDC from Nocardioides sp. strain KP7 (69 aa), FASTA scores: opt: 177, E(): 2.1e-06, (43.35% identity in 60 aa overlap); Q9X4X8|DITA3 DIOXYGENASE DITA FERREDOXIN COMPONENT from Pseudomonas abietaniphila (78 aa), FASTA scores: opt: 166, E(): 1.4e-05, (36.2% identity in 58 aa overlap); P00203|FER_MOOTH from Moorella thermoacetica (Clostridium thermoaceticum) (63 aa), FASTA scores: opt: 157, E(): 5.4e-05, (36.65% identity in 60 aa overlap); P18325|FER2_STRGO|SUBB from Streptomyces griseolus (64 aa) FASTA scores: opt: 157, E(): 5.5e-05, (39.35% identity in 61 aa overlap); etc. BELONGS TO THE BACTERIAL TYPE FERREDOXIN FAMILY. TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="ferredoxin FdxD" /protein_id="NP_218020.1" /db_xref="GI:15610639" /db_xref="GeneID:888311" /translation="MRVIVDRDRCEGNAVCLGIAPDIFDLDDEDYAVVKTDPIPVDQE DLAEQAIAECPRAALSRGE" gene 3922471..3923673 /gene="fadE26" /locus_tag="Rv3504" /db_xref="GeneID:887722" CDS 3922471..3923673 /gene="fadE26" /locus_tag="Rv3504" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3504, (MTV023.11), len: 400 aa. Probable fadE26, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to other ACYL-CoA DEHYDROGENASES from Mycobacterium tuberculosis e.g. P71858|FADE29|Rv3543c|MTCY03C7.13 (387 aa) FASTA scores: opt: 1031, E(): 7.5e-59, (46.25% identity in 402 aa overlap); and P95280|FADE17|Rv1934c|MTCY09F9.30 (409 aa), FASTA scores: opt: 617, E(): 3.1e-32, (32.6% identity in 423 aa overlap); etc. Also similar to others e.g. Q9A6G3|CC2131 from Caulobacter crescentus (403 aa) FASTA scores: opt: 710, E(): 3.2e-38, (33.4% identity in 413 aa overlap); Q9I4V2|PA1022 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 522, E(): 3.7e-26, (34.1% identity in 358 aa overlap); Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa), FASTA scores: opt: 509, E(): 2.6e-25, (34.45% identity in 363 aa overlap); etc. COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY. TBparse score is 0.885." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE26" /protein_id="NP_218021.1" /db_xref="GI:15610640" /db_xref="GeneID:887722" /translation="MRISYTPQQEELRRELRSYFATLMTPERREALSSVQGEYGVGNV YRETIAQMGRDGWLALGWPKEYGGQGRSAMDQLIFTDEAAIAGAPVPFLTINSVAPTI MAYGTDEQKRFFLPRIAAGDLHFSIGYSEPGAGTDLANLRTTAVRDGDDYVVNGQKMW TSLIQYADYVWLAVRTNPESSGAKKHRGISVLIVPTTAEGFSWTPVHTMAGPDTSATY YSDVRVPVANRVGEENAGWKLVTNQLNHERVALVSPAPIFGCLREVREWAQNTKDAGG TRLIDSEWVQLNLARVHAKAEVLKLINWELASSQSGPKDAGPSPADASAAKVFGTELA TEAYRLLMEVLGTAATLRQNSPGALLRGRVERMHRACLILTFGGGTNEVQRDIIGMVA LGLPRANR" gene 3923698..3924819 /gene="fadE27" /locus_tag="Rv3505" /db_xref="GeneID:888248" CDS 3923698..3924819 /gene="fadE27" /locus_tag="Rv3505" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3505, (MTV023.12), len: 373 aa. Probable fadE27, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to other ACYL-CoA DEHYDROGENASES from Mycobacterium tuberculosis e.g. P71857|FADE28|Rv3544c|MTCY03C7.12 (339 aa) FASTA scores: opt: 497, E(): 1.8e-22, (30.3% identity in 343 aa overlap); and P95281|FADE18|Rv1933c|MTCY09F9.31 (363 aa) FASTA scores: opt: 421, E(): 6.4e-18, (32.35% identity in 334 aa overlap). Also similar to other e.g. Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 425, E(): 3.5e-18, (30.75% identity in 351 aa overlap); Q9RJX3|SCF37.28c from Streptomyces coelicolor (362 aa) FASTA scores: opt: 317, E(): 1e-11, (32.8% identity in 372 aa overlap); Q9L8Q3|PDTORFO from Pseudomonas stutzeri (Pseudomonas perfectomarina) (513 aa), FASTA scores: opt: 301, E(): 1.2e-10, (25.9% identity in 394 aa overlap); etc. COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY. TBparse score is 0.888." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE27" /protein_id="NP_218022.1" /db_xref="GI:15610641" /db_xref="GeneID:888248" /translation="MDFTTTEAAQDLGGLVDTIVDAVCTPEHQRELDKLEQRFDRELW RKLIDAGILSSAAPESLGGDGFGVLEQVAVLVALGHQLAAVPYLESVVLAAGALARFG SPELQQGWGVSAVSGDRILTVALDGEMGEGPVQAAGTGHGYRLTGTRTQVGYGPVADA FLVPAETDSGAAVFLVAAGDPGVAVTALATTGLGSVGHLELNGAKVDAARRVGGTDVA VWLGTLSTLSRTAFQLGVLERGLQMTAEYARTREQFDRPIGSFQAVGQRLADGYIDVK GLRLTLTQAAWRVAEDSLASRECPQPADIDVATAGFWAAEAGHRVAHTIVHVHGGVGV DTDHPVHRYFLAAKQTEFALGGATGQLRRIGRELAETPA" gene 3924890..3926398 /gene="fadD17" /locus_tag="Rv3506" /db_xref="GeneID:888251" CDS 3924890..3926398 /gene="fadD17" /locus_tag="Rv3506" /EC_number="2.3.1.86" /function="UNKNOWN, BUT SUPPOSED INVOLVEMENT IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_218023.1" /db_xref="GI:15610642" /db_xref="GeneID:888251" /translation="MTPTHPTVTELLLPLSEIDDRGVYFEDSFTSWRDHIRHGAAIAA ALRERLDPARPPHVGVLLQNTPFFSATLVAGALSGIVPVGLNPVRRGAALAGDIAKAD CQLVLTGSGSAEVPADVEHINVDSPEWTDEVAAHRDTEVRFRSADLADLFMLIFTSGT SGDPKAVKCSHRKVAIAGVTITQRFSLGRDDVCYVSMPLFHSNAVLVGWAVAAACQGS MALRRKFSASQFLADVRRYGATYANYVGKPLSYVLATPELPDDADNPLRAVYGNEGVP GDIDRFGRRFGCVVMDGFGSTEGGVAITRTLDTPAGALGPLPGGIQIVDPDTGEPCPT GVVGELVNTAGPGGFEGYYNDEAAEAERMAGGVYHSGDLAYRDDAGYAYFAGRLGDWM RVDGENLGTAPIERVLMRYPDATEVAVYPVPDPVVGDQVMAALVLAPGTKFDADKFRA FLTEQPDLGHKQWPSYVRVSAGLPRTMTFKVIKRQLSAEGVACADPVWPIRR" misc_feature 3925349..3925384 /gene="fadD17" /locus_tag="Rv3506" /note="PS00455 Putative AMP-binding domain signature." gene 3926569..3930714 /gene="PE_PGRS53" /locus_tag="Rv3507" /db_xref="GeneID:888256" CDS 3926569..3930714 /gene="PE_PGRS53" /locus_tag="Rv3507" /function="UNKNOWN" /note="Rv3507, (MTV023.14), len: 1381 aa. Member of the Mycobacterium tuberculosis PE protein family, PGRS subfamily of gly-rich proteins (see citation below), similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O06810|Rv1450c|MTCY493.04 (1329 aa), FASTA scores: opt: 2173, E(): 1.4e-135, (51.15% identity in 1412 aa overlap). Equivalent to AAK47970 from Mycobacterium tuberculosis strain CDC1551 (1384 aa) but with some minor differences between the proteins. Contains two PS00583 pfkB family of carbohydrate kinases signatures 1." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177978.1" /db_xref="GI:57117114" /db_xref="GeneID:888256" /translation="MSFVLVSPETVAAVATDLKRIGASLAHENASAAASTTAVVSAAA DEVSTAVAALFSQHAQGYQAAAAQVAAFHSRFVQALTAGAGAYAFAEAANASPLQSAM GAVSASAQTLLSRPLIGNGANATTPGGNGGDGGWLFGSGGNGAPGAAGQSGGNGGSAG LWGNGGAGGAGGSGGAAGGNGGNGGWLFGAGGTGGIGGTGAPGAMGGTGGNGGNGALL IGGGGLGGAGGMGGTGGGTGGTGGNGGNGALLIGAGGVGGAGGIGGQGTGAGGAAGAG GTGGNGGAGGLFMNGGDGGAGGQGGDGAAGDAAASAGGTGGKGGQGGDGGTGGAGGAG PVLFGHGGAGGMGGQGGTGGMGGAGGDGTTVIAAGTGGEGGTGGAAGAGGAAGARGAL TSGGLAGGVGAGGTGGTGGTGGNGADAAAVVGFGANGDPGFAGGKGGNGGIGGAAVTG GVAGDGGTGGKGGTGGAGGAGNDAGSTGNPGGKGGDGGIGGAGGAGGAAGTGNGGHAG NTGDGGDGGTGGNGGNGTGGVNGADNTLNPDTPGGAGEPGGAGGAGGAGGAAGGPGGT GGTGGNGGNGGNGGNGGNGGNGGNGGNAGNNSTNAPVGGEGGAGGDGGAGGAGGAANG GTAGSQGTGGVGGDGGAGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGGNGGVGGAAGA NGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTGAKGGDGGDGGAGGAGGNAG GAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGDAGGSGGDGGKGGQGGSGGTGGSG APIGGGAGGTGGSGGHAGKGGAGGIGAQGTTITVPGNGGNAGDGGNGGNAGAGGNGGS GDFGGNTTSGASGSGGNGGNAGTAGSGGAGGTGGTGLSGGNGGNGGNGGNGGDGGNGA HGTVGAQFVPATSLPTPNGGAGGNGGTGSNGGAPGPAGAPGPTTGGNAGSQGIGGDGG NGGDGGKGGDGADAVNVVFMPTEPQAATGTAGSAGDPTGGNGGPGTPGSPMVAPPPPT PITQVQQGGDGGAGGTGSTNANDGTATGGKGGEGGVGSILGGPGGNGGTGGNASATGT NGVANAGNGGKGGDGGQFGAGGNGGAGGSVTDGSAGSTAGNGGNGGNATNGTIAGQPA GGNGSAGGKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGAN GGDGGAGGAGGAGGRGGKGIDGGFGGDGGNGGSNNGTGAGGNGGNGGTGGVGSVGAAG GDGGNGGTGGFAGFGGTAGNGGSGGTGGAGGDGGTGGDGGNGVIAGGGGTGGNGGASG AGGAGGTGGFAGNGNAGGNGGTGGASEDGDNGNAGSGATGGTGGNGGTGGDGGAAGLG GVA" misc_feature 3927967..3928041 /gene="PE_PGRS53" /locus_tag="Rv3507" /note="PS00583 pfkB family of carbohydrate kinases signature 1." misc_feature 3929965..3930039 /gene="PE_PGRS53" /locus_tag="Rv3507" /note="PS00583 pfkB family of carbohydrate kinases signature 1." gene 3931005..3936710 /gene="PE_PGRS54" /locus_tag="Rv3508" /db_xref="GeneID:888270" CDS 3931005..3936710 /gene="PE_PGRS54" /locus_tag="Rv3508" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3508, (MTV023.15), len: 1901 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see Brennan & Delogu 2002), similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. downstream O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 6598, E(): 0, (71.05% identity in 1533 aa overlap). Equivalent to AAK47971 from Mycobacterium tuberculosis strain CDC1551 (1384 aa) but shorter 13 aa and with some minor differences between the proteins. Contains five PS00583 pfkB family of carbohydrate kinases signatures 1." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177979.1" /db_xref="GI:57117115" /db_xref="GeneID:888270" /translation="MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGA DEVSARIAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAYASAEAAAAEQIVLGV INAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLW GNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAG GVGGAGGGTGGAGGRAELLFGAGGAGGAGTDGGPGATGGTGGHGGVGGDGGWLAPGGA GGAGGQGGAGGAGSDGGALGGTGGTGGTGGAGGAGGRGALLLGAGGQGGLGGAGGQGG TGGAGGDGVLGGVGGTGGKGGVGGVAGLGGAGGAAGQLFSAGGAAGAVGVGGTGGQGG AGGAGAAGADAPASTGLTGGTGFAGGAGGVGGQGGNAIAGGINGSGGAGGTGGQGGAG GMGGSGADNASGIGADGGAGGTGGNAGAGGAGGAAGTGGTGGVVGAAGKAGIGGTGGQ GGAGGAGSAGTDATATGATGGTGFSGGAGGAGGAGGNTGVGGTNGSGGQGGTGGAGGA GGAGGVGADNPTGIGGTGGTGGKGGAGGAGGQGGSSGAGGTNGSGGAGGTGGQGGAGG AGGAGADNPTGIGGAGGTGGTGGAAGAGGAGGAIGTGGTGGAVGSVGNAGIGGTGGTG GVGGAGGAGAAAAAGSSATGGAGFAGGAGGEGGAGGNSGVGGTNGSGGAGGAGGKGGT GGAGGSGADNPTGAGFAGGAGGTGGAAGAGGAGGATGTGGTGGVVGATGSAGIGGAGG RGGDGGDGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGNGGDGGDGATGA AGLGDNGGVGGDGGAGGAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAGGAGGAGDNNF NGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGGTGG DGGDAGSGGGGGFGGAAGKAGGGGNGGRGGDGGDGASGLGLGLSGFDGGQGGQGGAGG SAGAGGINGAGGAGGNGGDGGDGATGAAGLGDNGGVGGDGGAGGAAGNGGNAGVGLTA KAGDGGAAGNGGNGGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAG GNGGTGGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGGNGGVGGD GGEGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGTGGAGGDGAPATLIGG PDGGDGGQGGIGGDGGNAGFGAGVPGDGGDGGNAGFGAGVPGDGGIGGTGGAGGAGGA GADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGDGDGF IGGSGGTGGTGGDAGVGGLANTGGTAGNAGIGGAGGRGGDGGAGDSGALSQDGNGFAG GQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGI GGAGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDGGQGGAGGHGGQGGKGGL NSTGLASAASGDGGNGGAGGAGGNGGDGDGFIGGSGGTGGTGGDAGVGGLANTGGTAG NAGIGGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTG GAGGDGQNGTTGVASEGGAGGQGGDGGQGGIGGAGGNAGFGAGVPGDGGIGGTGGAGG AGGAGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGA GGLGGGGGTGGTNGNGGLGGGGGNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGNGGSA TGVGNGGNGGDGGNGGDGGNGAPGGFGGGAGAGGLGGSGAGGGTDGDDGNGGSPGTDG S" misc_feature 3932295..3932366 /gene="PE_PGRS54" /locus_tag="Rv3508" /note="PS00583 pfkB family of carbohydrate kinases signature 1." misc_feature 3932814..3932885 /gene="PE_PGRS54" /locus_tag="Rv3508" /note="PS00583 pfkB family of carbohydrate kinases signature 1." misc_feature 3935400..3935474 /gene="PE_PGRS54" /locus_tag="Rv3508" /note="PS00583 pfkB family of carbohydrate kinases signature 1." misc_feature 3936015..3936089 /gene="PE_PGRS54" /locus_tag="Rv3508" /note="PS00583 pfkB family of carbohydrate kinases signature 1." misc_feature 3936573..3936647 /gene="PE_PGRS54" /locus_tag="Rv3508" /note="PS00583 pfkB family of carbohydrate kinases signature 1." gene complement(3936877..3938424) /gene="ilvX" /locus_tag="Rv3509c" /db_xref="GeneID:888267" CDS complement(3936877..3938424) /gene="ilvX" /locus_tag="Rv3509c" /function="COULD BE INVOLVED IN VALINE AND ISOLEUCINE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: 2-ACETOLACTATE + CO(2) = 2 PYRUVATE]." /experiment="experimental evidence, no additional details recorded" /note="thiamine-pyrophosphate requiring enzyme" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218026.1" /db_xref="GI:15610645" /db_xref="GeneID:888267" /translation="MNGAQALINTLVDGGVDVCFANPGTSEMHFVAALDAVPRMRGML TLFEGVATGAADGYARIAGRPAAVLLHLGPGLGNGLANLHNARRARVPMVVVVGDHAT YHKKYDAPLESDIDAVAGTVSGWVRRTEAAADVGADAEAAIAASRSGSQIATLILPAD VCWSDGAHAAAGVPAQAAAAPVDVGPVAGVLRSGEPAMMLIGGDATRGPGLTAAARIV QATGARWLCETFPTCLERGAGIPAVERLAYFAEGAAAQLDGVKHLVLAGARSPVSFFA YPGMPSDLVPAGCEVHVLAEPGGAADALAALADEVAPGTVAPVAGASRPQLPTGDLTS VSAADVVGALLPERAIVVDESNTCGVLLPQATAGAPAHDWLTLTGGAIGYGIPAAVGA AVAAPDRPVLCLESDGSAMYTISGLWSQARENLDVTTVIYNNGAYDILRIELQRVGAG SDPGPKALDLLDISRPTMDFVKIAEGMGVPARRVTTCEEFADALRAAFAEPGPHLIDV VVPSLVG" gene complement(3938421..3939257) /locus_tag="Rv3510c" /db_xref="GeneID:888298" CDS complement(3938421..3939257) /locus_tag="Rv3510c" /function="UNKNOWN" /note="Rv3510c, (MTV023.17), len: 278 aa. Conserved hypothetical protein, similar to Q50662|Rv2303c|MTCY339.06 HYPOTHETICAL 34.6 KDA PROTEIN from Mycobacterium tuberculosis (307 aa), FASTA scores: opt: 416, E(): 1.2e-19, (35.7% identity in 255 aa overlap). Middle of the putative protein highly similar to N-terminal end of Q49860|B229_C2_182 HYPOTHETICAL 11.0 KDA PROTEIN from Mycobacterium leprae (95 aa), FASTA scores: opt: 304, E(): 7.9e-13, (83.65% identity in 55 aa overlap). Also some similarity with other bacterial proteins e.g. P95886 ORF C02006 from Sulfolobus solfataricus (269 aa), FASTA scores: opt: 293, E(): 9.6e-12, (31.3% identity in 198 aa overlap); Q9XDF3|NONC NONC PROTEIN from Streptomyces griseus subsp. griseus (317 aa), FASTA scores: opt: 270, E(): 3.4e-10, (29.95% identity in 227 aa overlap); Q54229|NONR MACROTETROLIDE ANTIBIOTIC-RESISTANCE PROTEIN from Streptomyces griseus (347 aa), FASTA scores: opt: 270, E(): 3.6e-10, (29.95% identity in 227 aa overlap); etc. TBparse score is 0.907." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218027.1" /db_xref="GI:15610646" /db_xref="GeneID:888298" /translation="MTIDVWMQHPTQRFLHGDMFASLRRWTGGSIPETDIPIEATVSS MDAGGVTLGLLSAWRGPNGQDLISNDAVAEWVRLYPNRFAGLAAVDLDRPMAAVRELR RRVGEGFVGLRVVPWLWGAPPTDRRYYPLFAECVQSAVPFCTQVGHTGPLRPSETGRP IPYIDQVALDFPELVIVCGHVGYPWTEEMVAVARKHENVYIDTSAYTIKRLPGKLVRF MKTDTGQRKVLFGTNYPMIAHTHALTGLDELGLSDEARRDFLHGNAVRVFKLDPRGKV QT" gene 3939617..3941761 /gene="PE_PGRS55" /locus_tag="Rv3511" /db_xref="GeneID:888273" CDS 3939617..3941761 /gene="PE_PGRS55" /locus_tag="Rv3511" /function="UNKNOWN" /note="Rv3511, (MTV023.18), len: 714 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK47974|MT3615.3 (1217 aa) FASTA scores: opt: 2563, E(): 1.5e-94, (59.65% identity in 773 aa overlap); and upstream O53553|Rv3508|MTV023.15 (1901 aa), FASTA scores: opt: 2455, E(): 3.9e-90, (60.4% identity in 737 aa overlap); etc. Contains PS00583 pfkB family of carbohydrate kinases signature 1." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177980.1" /db_xref="GI:57117116" /db_xref="GeneID:888273" /translation="MSFVLISPEVVSAAAGDLANVGSTISAANKAAAAATTQVLAAGA DEVSARIAALFGMYGLEYQAISAQVAAYHQQFVQTLRTGAASYMLAEATNVEQNLLNL INAPTQTLLGRPLIGDGANATTPGGAGGDGGLLFGSGGNGAPGAPGQAGGAGGSAGLL GNGGSGGAGGTGAPGGNGGNAGWLYGRGGVGGAGGIGGGTGGAGGHAWLFGHGGTGGI GGGPGGNGGWLLGNGGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGG NAAWLLGGGGTGGAGGIGGGNGGHGGNGGWLLGNGGNGGLGGDGDGGTGGGHGGNGGN PGWLLGTAGGGGNGGAGSTGTAGGGSGGTGGDGGTGGRGGLLMGAGAGGHGGTGGAGG AGVNGGGAGGAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDGGGGGDGFDGTMAGLG GTGGSGGTGGDGGAPGNGGAGGAGQLLSHSGVAGASGKGGAGGTGGNGGAGSAGADAP AGSGAMGSTGFAGGAGGDGGNGGGSGASQGNGGNGGNGGTGGKGGTGGAGMNSLDPLL AAQDGGQGGTGGTGGNAGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGT TGGAGGAGGAGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGAGMNSLDPLLAAQD GGQGGTGGTGGNAGAGGTGFTPRRRRQRRQRR" misc_feature 3940430..3940504 /gene="PE_PGRS55" /locus_tag="Rv3511" /note="PS00583 pfkB family of carbohydrate kinases signature 1." gene 3941724..3944963 /gene="PE_PGRS56" /locus_tag="Rv3512" /db_xref="GeneID:888306" CDS <3941724..3944963 /gene="PE_PGRS56" /locus_tag="Rv3512" /function="UNKNOWN" /note="Rv3512, (MTV023.19), len: 1059 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK47974|MT3615.3 (1217 aa) FASTA scores: opt: 3688, E(): 4.5e-130, (53.95% identity in 1136 aa overlap); and downstream O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 3611, E(): 3.6e-127, (53.15% identity in 1195 aa overlap); etc. Frameshifted PGRS protein, could be continuation of upstream MTV023.18, but no error could be found." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177981.1" /db_xref="GI:57117117" /db_xref="GeneID:888306" /translation="PQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGGAGGAGGA GGTGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGDGALAGSSGGAGGKGGNGGDA GKAGTGSAPGTAGTGGDGGKGGNGGIGAAGTTGPVGTGASGGTGGSGGAGGTGGDGGA ANGGTAGAGGAGGNGGKGGDGGAGVTSSTAGNSGGAGGSGGKGGDAGAGGAGATPGAN GIAGNGGDGGDGAAGAVGISGATGAGDGGHGGTGAAGGNGGTGGAGGSGIDGVGGGTG GTGGNGGNGAIGGAGGDAGGSGNSGGNGGIGGKGGNAGAGGAAGSNGGTVGANGTGGD GGNGGAAGAATAGSNGGAGTGSAGGNGGTGGRGGSGGAGGDGIGGVGGGKGGNGADGE VGGAGGAGGSGPNTSPGGNGGQGGQGGSGGAGGAAGAGGAGGGANGTAGNGGQGGAGG TGGAGAASSATNGGSGGAGGTGGDGGSGGAGGTGGAGGTGGAAGDGGQGGQGGAGGGA GGQGGAGGAGGTGGNGGNITGGTAGTAGAAGNGGAAGKGGAGGQGGTGGGTGGQGGAG GDGGAGGTGGDRTVGGGTVPAGSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNG GNGGNRNSGNGTGGAGGNGGGGANGGAGGAGGSGGGTGGNGGAGGDAGDAGNGGNGNG TGNGGNGGNGGIAGMGGNGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGAGGN GGAAGTGGTGGDGGLTGTGGTGGSGGTGGDGGNGGNGADNTANMTAQAGGDGGNGGDG GFGGGAGAGGGGLTAGANGTGGQGGAGGDGGNGAIGGHGPLTDDPGGNGGTGGNGGTG GTGGAGIGSLGGGTGGDGGNGGNGGTGGEGGEVGGAGGTGGAAGNGGDGGTGGTGGGD GGAGGTGGTGGTGGLGDPRVGGSGGDGGTGGSGGAAGNGGNGGNAGAGGNGNGGTGGA GGIGGTGGNGGDAEPGVPPGAGGAGGAGTTGGKGGTGGNGSGTGSGGTGGDGGTGGGG GNGGTGWNGGKGDTGSGGGAGDGGKAPAGGTGGAGGDGGAGGKGGSGGV" gene complement(3945092..3945748) /gene="fadD18" /locus_tag="Rv3513c" /db_xref="GeneID:888277" CDS complement(3945092..3945748) /gene="fadD18" /locus_tag="Rv3513c" /EC_number="6.2.1.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3513c, (MTV023.20c), len: 218 aa (Start uncertain). Probable fadD18, fatty-acid-CoA synthetase (C-terminal fragment) (EC 6.2.1.-), almost identical to C-terminal end of downstream O53560|FADD19|Rv3515c|MTV023.22c, probably result of partial gene duplication. Also similar at the C-terminus to other fatty-acid-CoA synthetases e.g. Q9EXL2|FADD from Streptomyces griseus (540 aa), FASTA scores: opt: 586, E(): 1.2e-28, (52.45% identity in 185 aa overlap); AAB87139|MIG MEDIUM CHAIN ACYL-CoA SYNTHETASE PRECURSOR from Mycobacterium avium (550 aa), FASTA scores: opt: 506, E(): 9.5e-24, (50.0% identity in 150 aa overlap); Q9A7C3|CC1801 PUTATIVE 4-COUMARATE--CoA LIGASE from Caulobacter crescentus (561 aa), FASTA scores: opt: 430, E(): 4.4e-19, (45.75% identity in 153 aa overlap); Q9KDT0|BH1131 ACID-CoA LIGASE from Bacillus halodurans (546 aa), FASTA scores: opt: 338, E(): 1.9e-13, (38.05% identity in 142 aa overlap); Q9RTR4|DR1692 LONG-CHAIN FATTY ACID--CoA LIGASE from Deinococcus radiodurans (584 aa), FASTA scores: opt: 331, E(): 5.3e-13, (35.15% identity in 145 aa overlap); etc." /codon_start=1 /transl_table=11 /product="fatty-acid-CoA ligase" /protein_id="NP_218030.1" /db_xref="GI:15610649" /db_xref="GeneID:888277" /translation="MAASLSENLSCHSSNMCRLSGNAATNLERPGEEPPGDRCTRRQA VRPARTLAKKGNIPVGYYKDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGRGS VSINSGGEKVYPEEVEAALKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAE LDSFVRSEIAGYKVPRSLWFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGS" repeat_region complement(3945098..3945597) /note="500 bp perfect direct repeat 2; second copy at 3950830..3951329." gene 3945794..3950263 /gene="PE_PGRS57" /locus_tag="Rv3514" /db_xref="GeneID:888294" CDS 3945794..3950263 /gene="PE_PGRS57" /locus_tag="Rv3514" /function="UNKNOWN" /note="Rv3514, (MTV023.21), len: 1489 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK47971 (1715 aa) FASTA scores: opt: 6940, E(): 0, (67.0% identity in 1713 aa overlap); and upstream O53553|YZ08_MYCTU|Rv3508|MTV023.15 (1901 aa), FASTA scores: opt: 6598,E(): 0, (71.05% identity in 1533 aa overlap). Contains two PS00583 pfkB family of carbohydrate kinases signatures 1. TBparse score is 0.838." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177982.1" /db_xref="GI:57117118" /db_xref="GeneID:888294" /translation="MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGA DEVSARIAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAYASAEAAAAEQIVLGV INAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLW GNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAG GVGGAGGGTGGAGGRAELLFGAGGAGGAGTDGGPGATGGTGGHGGVGGDGGWLAPGGA GGAGGQGGAGGAGSDGGALGGTGGTGGTGGAGGAGGRGALLLGAGGQGGLGGAGGQGG TGGAGGDGVLGGVGGTGGKGGVGGVAGLGGAGGAAGQLFSASGAAGNAGVGGAGGQGG DGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGQGGAGGAGG AGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGGAGGQGGDG GAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGTGGQGGAGGAG GAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGGAGGQGGD GGAGGAGADADQPGATGGTGFAGGAGGAGKAGGSSSAGGTNSSGSAGGTGRQSGTGGA GGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGGAGGSSG AGGTNGSGGAGGTDGQGGAGGAGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTG GTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGSGGSS CAGGTNGSGGAGGTCGQVVAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDP GKGGTGGTGGTGGSGGAGGSGGANFNGGTGGTGGTGGKGGLNTDGLSSATSGTGGTGG TGGKGGTGGAGDDSAGGTGGTGGAGGNAGAGGLANTGGTAGNAGIGGDGGQGGNGGQG DSGSGLGGQPGFAGGAGGKGGAGGSSGAGGTNGSGGAGGAGGQGGAGGAGISFSNGSN GGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGGANFNGGTGGT GGTGGTGGKGGMGGIAGDGGPGGDGGNAGVGGKGGTNGNGGSGGTGGTGGAGGNAGAG GLANTGGTAGNAGIGGDGGQGGNGGQGDSGSGLGGQPGFAGGPGGKGGAGGNAGTGGT NGSGAGGAGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTG GTGGTGGSGGAGGSGGANFNGGTGGTGGTGGTGGKGGMGGIAGDGGPGGDGGNAGVGG KGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDGGDGADGGAATGVGDGGD GGNGGNGGNGGTGVGSPGGLGGAGGTGGLGGAGAGGGADGDDGDDGQPGNNGS" misc_feature 3947423..3947494 /gene="PE_PGRS57" /locus_tag="Rv3514" /note="PS00583 pfkB family of carbohydrate kinases signature 1." misc_feature 3947774..3947845 /gene="PE_PGRS57" /locus_tag="Rv3514" /note="PS00583 pfkB family of carbohydrate kinases signature 1." gene complement(3950824..3952470) /gene="fadD19" /locus_tag="Rv3515c" /db_xref="GeneID:888275" CDS complement(3950824..3952470) /gene="fadD19" /locus_tag="Rv3515c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A; in Mycobacterium may be involved in virulence" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="YP_177983.1" /db_xref="GI:57117119" /db_xref="GeneID:888275" /translation="MAVALNIADLAEHAIDAVPDRVAVICGDEQLTYAQLEDKANRLA HHLIDQGVQKDDKVGLYCRNRIEIVIAMLGIVKAGAILVNVNFRYVEGELRYLFDNSD MVALVHERRYADRVANVLPDTPHVRTILVVEDGSDQDYRRYGGVEFYSAIAAGSPERD FGERSADAIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTDFATGEFVKDEYDLAKAAA ANPPMIRYPIPPMIHGATQSATWMALFSGQTTVLAPEFNADEVWRTIHKHKVNLLFFT GDAMARPLVDALVKGNDYDLSSLFLLASTAALFSPSIKEKLLELLPNRVITDSIGSSE TGFGGTSVVAAGQAHGGGPRVRIDHRTVVLDDDGNEVKPGSGMRGVIAKKGNIPVGYY KDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGRGSVSINSGGEKVYPEEVEAA LKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAELDSFVRSEIAGYKVPRSL WFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGG" repeat_region complement(3950830..3951329) /note="500 bp perfect direct repeat 1; second copy at 3945098..3945597." misc_feature complement(3951925..3951960) /gene="fadD19" /locus_tag="Rv3515c" /note="PS00455 Putative AMP-binding domain signature." gene 3952544..3953335 /gene="echA19" /locus_tag="Rv3516" /db_xref="GeneID:888301" CDS 3952544..3953335 /gene="echA19" /locus_tag="Rv3516" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_218033.1" /db_xref="GI:15610652" /db_xref="GeneID:888301" /translation="MESGPDALVERRGHTLIVTMNRPAARNALSTEMMRIMVQAWDRV DNDPDIRCCILTGAGGYFCAGMDLKAATQKPPGDSFKDGSYGPSRIDALLKGRRLTKP LIAAVEGPAIAGGTEILQGTDIRVAGESAKFGISEAKWSLYPMGGSAVRLVRQIPYTL ACDLLLTGRHITAAEAKEMGLIGHVVPDGQALTKALELADAISANGPLAVQAILRSIR ETECMPENEAFKIDTQIGIKVFLSDDAKEGPRAFAEKRAPNFQNR" gene 3953431..3954270 /locus_tag="Rv3517" /db_xref="GeneID:888284" CDS 3953431..3954270 /locus_tag="Rv3517" /function="UNKNOWN" /note="Rv3517, (MTV023.24), len: 279 aa. Hypothetical protein, similar to several hypothetical mycobacterial proteins e.g. P71763|Rv1482c|MTCY277.03c from Mycobacterium tuberculosis strain H37Rv (339 aa) (alias AAK45794|MT1529 from Mycobacterium tuberculosis strain CDC1551 (292 aa) but longer) FASTA scores: opt: 1040, E(): 3.7e-60, (59.0% identity in 273 aa overlap); O07396|MAV346 from Mycobacterium avium (346 aa) FASTA scores: opt: 1018, E(): 1e-58, (57.2% identity in 278 aa overlap); O53421|Rv1073|MTV017.26 from Mycobacterium tuberculosis strain H37Rv (283 aa), FASTA scores: opt: 903, E(): 2.4e-51, (48.0% identity in 277 aa overlap); Q50134|U650AG|MLCB57.67c from Mycobacterium leprae (75 aa) FASTA scores: opt: 158, E(): 0.0015, (41.8% identity in 55 aa overlap); etc. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218034.1" /db_xref="GI:15610653" /db_xref="GeneID:888284" /translation="MIEPFLGSEAIASGALTRHRLRSAYATIHPDVYVSPGADLTAWS RAQAAWLWSRRRGVIAGQSAAAMHGAKWVDARQAAELLYDHRRPPAGIHTWSDRVADD EIQPISGMNTTTPARTALDLARRYPVGKAVAAIDALARATDLKLADVEMLAERYRGSR GIRNARIALDLVDPGAESPRETWLRLLLIRAGFPRPQTQIPVYDEYGQLVAVIDMGWA GIKVGVDYEGDHHRTDRRTFNKDIKRAEALTELGWTDVRVTVEDTEGGIIWRVSAAWQ RRT" gene complement(3954325..3955521) /gene="cyp142" /locus_tag="Rv3518c" /db_xref="GeneID:888282" CDS complement(3954325..3955521) /gene="cyp142" /locus_tag="Rv3518c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv3518c, (MTV023.25c), len: 398 aa. Probable cyp142, cytochrome P450 monoxygenase (EC 1.14.-.-), member of Cytochrome P450 family and similar to many e.g. Q9L465|CYP162A1|NIKQ from Streptomyces tendae (396 aa) FASTA scores: opt: 798, E(): 2e-43, (36.7% identity in 403 aa overlap); P33271|CPXK_SACER|CYP107B1 from Saccharopolyspora erythraea (Streptomyces erythraeus) (405 aa), FASTA scores: opt: 725, E(): 9.1e-39, (37.1% identity in 407 aa overlap); Q9X8Q3|CYP107P1|SCH10.14c from Streptomyces coelicolor (411 aa), FASTA scores: opt: 691, E(): 1.3e-36, (37.2% identity in 317 aa overlap); etc. Also similar to Q50696|C124_MYCTU|CYP124|Rv2266|MT2328|MTCY339.44c from Mycobacterium tuberculosis strain H37Rv (428 aa) FASTA scores: opt: 692, E(): 1.2e-36, (36.8% identity in 402 aa overlap). Equivalent to AAK47979 from Mycobacterium tuberculosis strain CDC1551 (372 aa) but longer 26 aa. Contains PS00086 Cytochrome P450 cysteine heme-iron ligand signature. BELONGS TO THE CYTOCHROME P450 FAMILY. TBparse score is 0.891." /codon_start=1 /transl_table=11 /product="cytochrome P450 monooxygenase 142" /protein_id="NP_218035.1" /db_xref="GI:15610654" /db_xref="GeneID:888282" /translation="MTEAPDVDLADGNFYASREARAAYRWMRANQPVFRDRNGLAAAS TYQAVIDAERQPELFSNAGGIRPDQPALPMMIDMDDPAHLLRRKLVNAGFTRKRVKDK EASIAALCDTLIDAVCERGECDFVRDLAAPLPMAVIGDMLGVRPEQRDMFLRWSDDLV TFLSSHVSQEDFQITMDAFAAYNDFTRATIAARRADPTDDLVSVLVSSEVDGERLSDD ELVMETLLILIGGDETTRHTLSGGTEQLLRNRDQWDLLQRDPSLLPGAIEEMLRWTAP VKNMCRVLTADTEFHGTALCAGEKMMLLFESANFDEAVFCEPEKFDVQRNPNSHLAFG FGTHFCLGNQLARLELSLMTERVLRRLPDLRLVADDSVLPLRPANFVSGLESMPVVFT PSPPLG" misc_feature complement(3954496..3954525) /gene="cyp142" /locus_tag="Rv3518c" /note="PS00086 Cytochrome P450 cysteine heme-iron ligand signature." gene 3955550..3956260 /locus_tag="Rv3519" /db_xref="GeneID:887247" CDS 3955550..3956260 /locus_tag="Rv3519" /function="UNKNOWN" /note="Rv3519, (MTV023.26), len: 236 aa (start uncertain). Hypothetical unknown protein. The C-terminal end is highly similar to N-terminal end of AAK47980|MT3620 HYPOTHETICAL 7.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (73 aa), FASTA scores: opt: 279, E(): 9.4e-12, (95.65% identity in 46 aa overlap). TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218036.1" /db_xref="GI:15610655" /db_xref="GeneID:887247" /translation="MPVSQHTIAGTVLTMPVRIRTANLHSAMFSVPADPAQRLIDYSG LRVCEYLPGKAIVMQMLVRYVDGDLGRYHEYGTAIMVNPPGTQRRGPRALTRAAAFIH HLPVDQVFTLEAGRTIWGFPKIMADFNVTDGRRFGFDVSADGRLIAGIEFSTGLPVPT LGWQMLKTYSHHDGVTREIPWEMKVSGLRARLGGARLRLGDHPYAKELASLGLPKRAL LSQSAANVEMTFGDGHPI" gene complement(3956325..3957368) /locus_tag="Rv3520c" /db_xref="GeneID:887310" CDS complement(3956325..3957368) /locus_tag="Rv3520c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3520c, (MTV023.27c), len: 347 aa. Possible coenzyme F420-dependent oxidoreductase (EC 1.-.-.-), equivalent to Q9CCV8|ML0348 POSSIBLE COENZYME F420-DEPENDENT OXIDOREDUCTASE from Mycobacterium leprae (350 aa), FASTA scores: opt: 2029, E(): 9.1e-120, (86.85% identity in 342 aa overlap). Similar to many coenzyme F420-dependent enzymes (and other proteins) e.g. Q9AD98|SCI52.11c PUTATIVE ATP/GTP-BINDING PROTEIN from Streptomyces coelicolor (351 aa), FASTA scores: opt: 859, E(): 1.6e-46, (41.9% identity in 346 aa overlap); Q9X7Y1|SC6A5.35 PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (341 aa), FASTA scores: opt: 800, E(): 7.9e-43, (38.95% identity in 339 aa overlap); Q9ZA30|GRA-ORF29 PUTATIVE FMN-DEPENDENT MONOOXYGENASE from Streptomyces violaceoruber (343 aa), FASTA scores: opt: 354, E(): 6.7e-15, (34.2% identity in 336 aa overlap); Q49598|MER COENZYME F420-DEPENDENT N5,N10-METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE from Methanopyrus kandleri (349 aa), FASTA scores: opt: 283, E(): 1.9e-10, (26.75% identity in 329 aa overlap); Q58929|MER|MJ1534 F420-DEPENDENT METHYLENETETRAHYDROMETHANOPTERIN REDUCTASE from Methanococcus jannaschii (331 aa), FASTA scores: opt: 227, E(): 5.8e-07, (26.35% identity in 334 aa overlap); O27784|MTH1752 COENZYME F420-DEPENDENT N5,N10-METHYLENE TETRAHYDROMETHANOPTERIN REDUCTASE from Methanobacterium thermoautotrophicum (321 aa), FASTA scores: opt: 207, E(): 1e-05, (27.4% identity in 336 aa overlap); etc. Also similar to Q11030|YD60_MYCTU|Rv1360|MT1405|MTCY02B10.24 HYPOTHETICAL 37.3 KDA PROTEIN from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 313, E(): 2.5e-12, (28.0% identity in 311 aa overlap). TBparse score is 0.890." /codon_start=1 /transl_table=11 /product="coenzyme F420-dependent oxidoreductase" /protein_id="NP_218037.1" /db_xref="GI:15610656" /db_xref="GeneID:887310" /translation="MEAGMKLGLQLGYWGAQPPQNHAELVAAAEDAGFDTVFTAEAWG SDAYTPLAWWGSSTQRVRLGTSVIQLSARTPTACAMAALTLDHLSGGRHILGLGVSGP QVVEGWYGQRFPKPLARTREYIDIVRQVWARESPVTSAGPHYRLPLTGEGTTGLGKAL KPITHPLRADIPIMLGAEGPKNVALAAEICDGWLPIFYSPRMAGMYNEWLDEGFARPG ARRSREDFEICATAQVVITDDRAAAFAGIKPFLALYMGGMGAEETNFHADVYRRMGYT QVVDEVTKLFRSGRKDEAAEIIPDELVDDAVIVGDIDHVRKQMAVWEAAGVTMMVVTA GSAEQVRDLAALV" gene 3957521..3958432 /locus_tag="Rv3521" /db_xref="GeneID:887209" CDS 3957521..3958432 /locus_tag="Rv3521" /function="UNKNOWN" /note="Rv3521, (MTV023.28), len: 303 aa. Conserved hypothetical protein, similar to (although longer than) other conserved hypothetical proteins e.g. O29296|AF0966 from Archaeoglobus fulgidus (176 aa), FASTA scores: opt: 286, E(): 5.4e-11, (31.15% identity in 170 aa overlap); O30036|AF0203 from Archaeoglobus fulgidus (149 aa) FASTA scores: opt: 259, E(): 2.3e-09, (33.8% identity in 142 aa overlap); O29297|AF0965 from Archaeoglobus fulgidus (154 aa), FASTA scores: opt: 241, E(): 3.2e-08, (31.4% identity in 137 aa overlap); Q9Y995|APE2390 from Aeropyrum pernix (157 aa), FASTA scores: opt: 204, E(): 6.8e-06, (27.45% identity in 153 aa overlap); BAB60424|TVG1322512 from Thermoplasma volcanium (164 aa), FASTA scores: opt: 183, E(): 0.00015, (29.75% identity in 148 aa overlap); etc. Equivalent to AAK47982 from Mycobacterium tuberculosis strain CDC1551 (334 aa) but shorter 31 aa. TBparse score is 0.884." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218038.1" /db_xref="GI:15610657" /db_xref="GeneID:887209" /translation="MGPTLSRFFTALRARRIVGVRGSDGRVHVPPVEYDPVTYEPLSE MVPVSSVGTVASWTWQPEPLAGQPLDRPFAWALIKLDGADTLLMHAVDVGTAGPSAIH TGARVHAHWADQPVGAITDIACFALGETAEPVAAHKTEDARDPVTMIVTPIQLEIQHT ASHEESAYLRAIAQGKLVGARTGKTGKVYFPPHGADPATGKPTSEFVELPDKGTVTTF AIVNIPFLGQRIKPPYVAAYVLLDGADIPFLHLVSDVDAHQVRMGMRVEAVWKPRERW GLGIDNIEYFRPTGEPDANYDTYKHHL" gene 3958448..3959512 /gene="ltp4" /locus_tag="Rv3522" /db_xref="GeneID:887885" CDS 3958448..3959512 /gene="ltp4" /locus_tag="Rv3522" /EC_number="2.3.1.16" /function="UNKNOWN; PROBABLY INVOLVED IN LIPID METABOLISM." /note="Rv3522, (MTV023.29), len: 354 aa. Possible ltp4, lipid carrier protein or keto acyl-CoA thiolase (EC 2.3.1.16), similar to several e.g. O30103|AF0134 3-KETOACYL-CoA THIOLASE (ACAB-4) from Archaeoglobus fulgidus (398 aa) FASTA scores: opt: 352, E(): 5.3e-15, (30.45% identity in 381 aa overlap); O29295|AF0967 3-KETOACYL-CoA THIOLASE (ACAB-9) from Archaeoglobus fulgidus (400 aa) FASTA scores: opt: 312, E(): 1.8e-12, (28.05% identity in 367 aa overlap); O29294|AF0968 3-KETOACYL-CoA THIOLASE (ACAB-10) from Archaeoglobus fulgidus (388 aa), FASTA scores: opt: 293, E(): 2.9e-11, (25.9% identity in 309 aa overlap); O58409|PH0676 LONG HYPOTHETICAL NONSPECIFIC LIPID-TRANSFER PROTEIN (ACETHYL CoA SYNTHETASE) (EC 6.2.1.-) from Pyrococcus horikoshii (389 aa), FASTA scores: opt: 292, E(): 3.3e-11, (25.8% identity in 368 aa overlap); Q9Y9A3|APE2382 LONG HYPOTHETICAL NON SPECIFIC LIPID-TRANSFER PROTEIN from Aeropyrum pernix (360 aa) FASTA scores: opt: 270, E(): 7.8e-10, (27.25% identity in 363 aa overlap); Q9YDI4|APE0929 LONG HYPOTHETICAL NONSPECIFIC LIPID-TRANSFER PROTEIN from Aeropyrum pernix (400 aa), FASTA scores: opt: 258, E(): 4.9e-09, (26.45% identity in 306 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="lipid-transfer protein" /protein_id="NP_218039.1" /db_xref="GI:15610658" /db_xref="GeneID:887885" /translation="MSVRDIAVVGFAHAPHVRRTDGTTNGVEMLMPCFAQLYDELGIT KADIGFWCSGSSDYLAGRAFSFISAIDSIGAVPPINESHVEMDAAWALYEAYIKLLTG EVDTALVYGFGKSSAGTLRRVLSRQTDPYTVAPLWPDSVSMAGLQARLGLDSGKWTHE QMARVAFDSFTNARRVDSVEPPITVGELLARPFFADPLRRHDIAPITDGAAAVVLAAD NRARELRENPAWITGIEHRIESPALGARDITESPSTKLAAKIATGGHTGDIDVAEIHG PFTHQHLIVAEAIRIPGKTKVNPSGGPLAANPMFAAGLERIGFAAQHTWDGSARRVLA HATSGPALQQNLVAVMEGRG" misc_feature 3959309..3959332 /gene="ltp4" /locus_tag="Rv3522" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene 3959529..3960713 /gene="ltp3" /locus_tag="Rv3523" /db_xref="GeneID:888247" CDS 3959529..3960713 /gene="ltp3" /locus_tag="Rv3523" /EC_number="2.3.1.9" /function="UNKNOWN; PROBABLY INVOLVED IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_218040.1" /db_xref="GI:15610659" /db_xref="GeneID:888247" /translation="MAGKLAAVLGTGQTKYVAKRQDVSMNGLVREAIDRALADSGSTF DDIDAVVVGKAPDFFEGVMMPELFMADAMGATGKPLIRVHTAGSVGGSTGVVAASLVQ SGKYRRVLALAWEKQSESNAMWALSIPVPFTKPVGAGAGGYFAPHVRAYIRRSGAPAH IGAMVAVKDRLNGSRNPLAHLQQPDITLEKVMASQMLWDPIRFDETCPSSDGACAVVV GDEEIADARLAQGHPVAWIHGTALRTEPLAFAGRDQVNPQAGRDAAAALWKAAGITSP IDEIDAAEIYVPFSWFEPMWLENLGFAREGEGWKLTEAGETAIGGRLPVNPSGGVLSA NPIGASGLIRFAEAAIQVMGKAEARQVPGARKALGHAYGGGSQYFSMWVVGCEKPKQA AA" gene 3960755..3961786 /locus_tag="Rv3524" /db_xref="GeneID:888250" CDS 3960755..3961786 /locus_tag="Rv3524" /function="UNKNOWN" /note="Rv3524, (MTCY03C7.32c), len: 343 aa. Probable conserved membrane protein, showing some similarity to C-terminal part of putative Mycobacterium tuberculosis proteins O05871|P95308|PKND_MYCTU|Rv0931c|MT0958|MTCY08C9.08 serine-threonine protein kinase PknD (EC 2.7.1.-) (664 aa) FASTA scores: opt: 727, E(): 8.3e-36, (45.3% identity in 298 aa overlap); O53893|Rv0980c|MTV044.08c PGRS-FAMILY PROTEIN (457 aa), FASTA scores: opt: 208, E(): 4.4e-05, (33.75% identity in 166 aa overlap); and O53891|Rv0978c|MTV044.06c PGRS-FAMILY PROTEIN (331 aa) FASTA scores: opt: 153, E(): 0.062, (30.75% identity in 117 aa overlap). Contains PS00237 G-protein coupled receptors signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218041.1" /db_xref="GI:15610660" /db_xref="GeneID:888250" /translation="MVKFTPDSQTSVLRAGKCSGTLSPSRSRLQRGSWPVDSERRRYG WPRNRRTLAITGAAVVVVVTLAAIGYLIFEPKISGSSTSRQAASPTTPSPPSQVVVPI DLWNPDGVTVDLADAVYVADSGHKRLLKLPAGSNTPTTLPFTDTIGPGGVAVNSNRDV YVIDEDSHHVLKLAAGIEPPVELPFGSLGDAHGLAVDRSDSVYVVDYDNAKVLKLPPG ADTPTELPFVGLDHPYDVAVDGAGTVYVTDSGHNRVVALTAGSATPVHLPFADLSFPA GVTVDRDDSVYVADLNNNRVLKLAAGSNAQSQLPFTGLFSPTDVAVDNDGAVYVIDFY NRMLKLPTA" misc_feature 3961310..3961360 /locus_tag="Rv3524" /note="PS00237 G-protein coupled receptors signature." gene complement(3961800..3962324) /locus_tag="Rv3525c" /db_xref="GeneID:888255" CDS complement(3961800..3962324) /locus_tag="Rv3525c" /function="UNKNOWN" /note="Rv3525c, (MTCY3C7.31), len: 174 aa. Possible siderophore-binding protein, similar to ferripyochelin binding proteins (and related) e.g. Q9RSN5|DR2089 FERRIPYOCHELIN-BINDING PROTEIN from Deinococcus radiodurans (240 aa), FASTA scores: opt: 472, E(): 3.3e-21, (46.9% identity in 162 aa overlap); O59257|PH1591 LONG HYPOTHETICAL FERRIPYOCHELIN BINDING PROTEIN from Pyrococcus horikoshii (173 aa), FASTA scores: opt: 431, E(): 6.7e-19, (40.0% identity in 170 aa overlap); Q9V158|FBP|PAB0393 FERRIPYOCHELIN BINDING PROTEIN from Pyrococcus abyssi (173 aa), FASTA scores: opt: 429, E(): 8.9e-19, (39.4% identity in 170 aa overlap); BAB47820|MLR0180 FERRIPYOCHELIN BINDING PROTEIN-LIKE from Rhizobium loti (Mesorhizobium loti) (175 aa), FASTA scores: opt: 415, E(): 6.1e-18, (42.55% identity in 141 aa overlap); etc." /codon_start=1 /transl_table=11 /product="siderophore-binding protein" /protein_id="NP_218042.1" /db_xref="GI:15610661" /db_xref="GeneID:888255" /translation="MPLFSFEGRSPRIDPTAFVAPTATLIGDVTIEAGASVWFNAVLR GDYAPVVVREGANVQDGAVLHAPPGIPVDIGPGATVAHLCVIHGVHVGSEALIANHAT VLDGAVIGARCMIAAGALVVAGTQIPAGMLVTGAPAKVKGPIEGTGAEMWVNVNPQAY RDLAARHLAGLEPM" gene 3962439..3963599 /locus_tag="Rv3526" /db_xref="GeneID:888268" CDS 3962439..3963599 /locus_tag="Rv3526" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3526, (MTCY03C7.30c), len: 386 aa. Hypothetical oxidoreductase (EC 1.-.-.-), highly similar, except in C-terminus (also longer 69 aa), to O69348|ORF12 PROTEIN (function unknown) from Rhodococcus erythropolis (316 aa) FASTA scores: opt: 1137, E(): 6.9e-65, (59.6% identity in 250 aa overlap). Also some similarity with several aminopyrrolnitrin oxidases (PRND proteins, involved in the pathway for pyrrolnitrin biosynthesis, a secondary metabolite derived from tryptophan which has strong anti-fungal activity) e.g. Q9RPG0|PRND from Myxococcus fulvus (379 aa), FASTA scores: opt: 322, E(): 4.4e-13, (25.85% identity in 352 aa overlap); Q9RPG4|PRND from Burkholderia cepacia (Pseudomonas cepacia) (373 aa) FASTA scores: opt: 306, E(): 4.5e-12, (25.2% identity in 373 aa overlap); P95483|PRND from Pseudomonas fluorescens (363 aa), FASTA scores: opt: 305, E(): 5.1e-12, (25.0% identity in 372 aa overlap); etc. And also some similarity to other putative enzymes like dioxygenases, oxidases, vanillate O-demethyl oxygenase, etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218043.1" /db_xref="GI:15610662" /db_xref="GeneID:888268" /translation="MSTDTSGVGVREIDAGALPTRYARGWHCLGVAKDYLEGKPHGVE AFGTKLVVFADSHGDLKVLDGYCRHMGGDLSEGTVKGDEVACPFHDWRWGGDGRCKLV PYARRTPRMARTRSWTTDVRSGLLFVWHDHEGNPPDPAVRIPEIPEAASDEWTDWRWN RILIEGSNCRDIIDNVTDMAHFFYIHFGLPTYFKNVFEGHIASQYLHNVGRPDVDDLG TSYGEAHLDSEASYFGPSFMINWLHNRYGNYKSESILINCHYPVTQNSFVLQWGVIVE KPKGMSEEMTDKLSRVFTEGVSKGFLQDVEIWKHKTRIDNPLLVEEDGAVYQLRRWYE QFYVDVADIKPEMVERFEIEVDTKRANEFWNAEVEKNLKSREVSDDVPAEQH" gene 3963605..3964054 /locus_tag="Rv3527" /db_xref="GeneID:888295" CDS 3963605..3964054 /locus_tag="Rv3527" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3527, (MTCY03C7.29c), len: 149 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218044.1" /db_xref="GI:15610663" /db_xref="GeneID:888295" /translation="MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKS RDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTE ASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDPNR" gene complement(3964479..3965192) /locus_tag="Rv3528c" /db_xref="GeneID:888297" CDS complement(3964479..3965192) /locus_tag="Rv3528c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3528c, (MTCY03C7.28), len: 237 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218045.1" /db_xref="GI:15610664" /db_xref="GeneID:888297" /translation="MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEG AYTFRALDKYPVKEAVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDAL FLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPH SKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDC RGFGWLPNIQNRAFLFARQ" gene complement(3965884..3967038) /locus_tag="Rv3529c" /db_xref="GeneID:888324" CDS complement(3965884..3967038) /locus_tag="Rv3529c" /function="UNKNOWN" /note="Rv3529c, (MTCY03C7.27), len: 384 aa. Conserved hypothetical protein, showing some similarity to Q50695|YM67_MYCTU|Rv2267c|MT2329|MTCY339.43 HYPOTHETICAL 46.1 KDA PROTEIN from Mycobacterium tuberculosis (388 aa) FASTA scores: opt: 261, E(): 1.6e-09, (27.25% identity in 253 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218046.1" /db_xref="GI:15610665" /db_xref="GeneID:888324" /translation="MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLL DAYQGEAGLTVLGSKMNRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLVRT GTTALHRLLGADPAHQGLHMWLAEYPQPRPPRETWESNPLYRQLDAQFTQHHAENPGY TGLHFMAAYELEECWQLLRQSLHSVSYEALAHVPSYADWLSRQDWTPSYCRHRRNLQL IGLNDAEKRWVLKNPSHLFALDALMATYPDALVVQTHRPVETIMASMCSLAQHTTEGW STKFVGAQIGADAMDTWSRGLERFNAARAKYDSAQFYDVDYHDLIADPLGTVADIYRH FGLTLSDEARQAMTTVHAESQSGARAPKHSYSLADYGLTVEMVKERFAGL" gene complement(3967038..3967820) /locus_tag="Rv3530c" /db_xref="GeneID:888338" CDS complement(3967038..3967820) /locus_tag="Rv3530c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3530c, (MTCY03C7.26), len: 260 aa. Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases and hypothetical proteins e.g. BAB53258|Q987E5|MLL7083 PROBABLE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (258 aa), FASTA scores: opt: 405, E(): 5.3e-18, (33.45% identity in 263 aa overlap); Q9VNF3|CG12171 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (257 aa), FASTA scores: opt: 404, E(): 6.1e-18, (32.8% identity in 256 aa overlap); Q9A3X5|CC3076 OXIDOREDUCTASE (SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY) from Caulobacter crescentus (254 aa), FASTA scores: opt: 400, E(): 1.1e-17, (31.0% identity in 255 aa overlap); BAB50080|MLR3115 DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (259 aa), FASTA scores: opt: 393, E(): 3e-17, (31.9% identity in 254 aa overlap); Q9F5J1|SIM-NJ1|SIMD2 PUTATIVE 3-KETO-ACYL-REDUCTASE from Streptomyces antibioticus (273 aa), FASTA scores: opt: 388, E(): 6.3e-17, (31.6% identity in 250 aa overlap); etc." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_218047.1" /db_xref="GI:15610666" /db_xref="GeneID:888338" /translation="MTGMLKRKVIVVSGVGPGLGTTLAHRCARDGADLVLAARSAERL DDVAKQIIDTGRRAVAVRTDITDDDDVSNLVQATLAAYGKADVLINNAFRVPSMKPLA GTTFEHIRDAIELSALGTLRLIQAFTPALAQSHGAIVNVNSMVIRHSQPKYGTYKMAK SVLLAMSHSLATELGEQGIRVNSVAPGYIWGDTLKSYFDHQAGKYGTTVDQIYQATAA NSDLKRLPTEDEVASAILFLASDLASGITGQTLDVNCGEYHT" gene complement(3967817..3968944) /locus_tag="Rv3531c" /db_xref="GeneID:888348" CDS complement(3967817..3968944) /locus_tag="Rv3531c" /function="UNKNOWN" /note="Rv3531c, (MTCY03C7.25), len: 375 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218048.1" /db_xref="GI:15610667" /db_xref="GeneID:888348" /translation="MYSDPLREAIAEAEQLVAAAPHIETEADLLEGLQYLAGCIAGCM HLAFDYERDHPFLQSGTGPFTKMGLDNPDTLYFGTRLQANRDYVVSGRRGTTTDLSFQ LLGGEYTDYNVPASQAAFDDRELDIAADGSFEWRLRPSAPGQLVIREVYGDWSQQRGT LAIARLDTVGTAPPPLTRELMEKRYATAGSQLVNRVKTWLQFPQWFYLNIPVNTMVAP RLTPGGLATQYSSAGHFELRPGQALVITVPVSDAPYLGFQLGSMWYISLDYINHQTSL NASQAQADPDGKVRIVVAEQNPGVTNWVETLGHRRGFLQFRWQRVSRELTEADGPTVE LVDFDAIPAALPHYQHNKISEDDWRARIALRQRQIATRMLG" gene 3969343..3970563 /gene="PPE61" /locus_tag="Rv3532" /db_xref="GeneID:888370" CDS 3969343..3970563 /gene="PPE61" /locus_tag="Rv3532" /function="UNKNOWN" /note="Rv3532, (MTCY03C7.24c), len: 406 aa. Member of the Mycobacterium tuberculosis PPE protein family, similar to many, e.g. O53956|Rv1807|MTV049.29 (403 aa), FASTA scores: opt: 954, E(): 1.1e-43, (44.1% identity in 417 aa overlap)" /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177984.1" /db_xref="GI:57117120" /db_xref="GeneID:888370" /translation="MFMDFAMLPPEVNSTRMYSGPGAGSLWAAAAAWDQVSAELQSAA ETYRSVIASLTGWQWLGPSSVRMGAAVTPYVEWLTTTAAQARQTATQITAAATGFEQA FAMTVPPPAIMANRAQVLSLIATNFFGQNTAAIAALETQYAEMWEQDATAMYDYAATS AAARTLTPFTSPQQDTNSAGLPAQSAEVSRATANAGAADGNWLGNLLEEIGILLLPIA PELTPFFLEAGEIVNAIPFPSIVGDEFCLLDGLLAWYATIGSINNINSMGTGIIGAEK NLGILPELGSAAAAAAPPPADIAPAFLAPLTSMAKSLSDGALRGPGEVSAAMRGAGTI GQMSVPPAWKAPAVTTVRAFDATPMTTLPGGDAPAAGVPGLPGMPASGAGRAGVVPRY GVRLTVMTRPLSGG" gene complement(3970705..3972453) /gene="PPE62" /locus_tag="Rv3533c" /db_xref="GeneID:888385" CDS complement(3970705..3972453) /gene="PPE62" /locus_tag="Rv3533c" /function="UNKNOWN" /note="Rv3533c, (MTCY03C7.23), len: 582 aa. Member of the Mycobacterium tuberculosis PPE protein family, similar to many, e.g. O53309|Rv3159c|MTV014.03c (590 aa) FASTA scores: opt: 2289, E(): 2.3e-95, (63.5% identity in 600 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177985.1" /db_xref="GI:57117121" /db_xref="GeneID:888385" /translation="MNYAVLPPELNSLRMFTGAGSAPMLAAAVAWDGLAAELGSAASS FGSVTSDLASQAWQGPAAAAMAAAAAPYAGWLSAAAARAAGAAAQAKAVASAFEAARA ATVHPLLVAANRNAFAQLVMSNWFGLNAPLIAAVEGAYEQMWAADVAAMVGYHSGASA AAEQLVPFQQALQQLPNLGIGNIGNANLGGGNTGDLNTGNGNIGNTNLGSGNRGDANL GSGNIGNSNVGGGNVGNGNFGSGNGRAGLPGSGNVGNGNLGNSNLGSGNTGNSNVGFG NTGNNNVGTGNAGSGNIGAGNTGSSNWGFGNNGIGNIGFGNTGNGNIGFGLTGNNQVG IGGLNSGSGNIGLFNSGTNNVGFFNSGNGNLGIGNSSDANVGIGNSGATVGPFVAGHN TGFGNSGSLNTGMGNAGGVNTGFGNGGAINLGFGNSGQLNAGSFNAGSINTGNFNSGQ GNTGDFNAGVRNTGWSNSGLTNTGAFNAGSLNTGFGAVGTGSGPNSGFGNAGTNNSGF FNTGVGSSGFQNGGSNNSGLQNAVGTVIAAGFGNTGAQTVGIANSGVLNSGFFNSGVH NSGGFNSENQRSGFGN" gene complement(3972552..3973592) /locus_tag="Rv3534c" /db_xref="GeneID:888387" CDS complement(3972552..3973592) /locus_tag="Rv3534c" /EC_number="4.1.3.-" /function="SUPPOSED INVOLVEMENT IN ONE, OR SEVERAL, CATABOLIC PATHWAYS [CATALYTIC ACTIVITY: 4-HYDROXY-2-OXOVALERATE = PYRUVATE + ACETALDEHYDE]." /note="catalyzes the formation of pyruvate and acetaldehyde from 4-hydroxy-2-ketovaleric acid; involved in the degradation of phenylpropionate" /codon_start=1 /transl_table=11 /product="4-hydroxy-2-ketovalerate aldolase" /protein_id="NP_218051.1" /db_xref="GI:15610670" /db_xref="GeneID:888387" /translation="MTDMWDVRITDTSLRDGSHHKRHQFTKDEVGAIVAALDAAGVPV IEVTHGDGLGGSSFNYGFSKTPEQELIKLAAATAKEARIAFLMLPGVGTKDDIKEARD NGGSICRIATHCTEADVSIQHFGLARELGLETVGFLMMAHTIAPEKLAAQARIMADAG CQCVYVVDSAGALVLDGVADRVSALVAELGEDAQVGFHGHENLGLGVANSVAAVRAGA KQIDGSCRRFGAGAGNAPVEALIGVFDKIGVKTGIDFFDIADAAEDVVRPAMPAECLL DRNALIMGYSGVYSSFLKHAVRQAERYGVPASALLHRAGQRKLIGGQEDQLIDIALEI KRELDSGAAVTH" gene complement(3973589..3974500) /locus_tag="Rv3535c" /db_xref="GeneID:888396" CDS complement(3973589..3974500) /locus_tag="Rv3535c" /EC_number="1.2.1.10" /function="SUPPOSED INVOLVEMENT IN ONE, OR SEVERAL, CATABOLIC PATHWAY [CATALYTIC ACTIVITY: ACETALDEHYDE + CoA + NAD(+) = ACETYL-CoA + NADH]." /note="catalyzes the formation of acetyl-CoA from acetalaldehyde" /codon_start=1 /transl_table=11 /product="acetaldehyde dehydrogenase" /protein_id="NP_218052.1" /db_xref="GI:15610671" /db_xref="GeneID:888396" /translation="MPSKAKVAIVGSGNISTDLLYKLLRSEWLEPRWMVGIDPESDGL ARAAKLGLETTHEGVDWLLAQPDKPDLVFEATSAYVHRDAAPKYAEAGIRAIDLTPAA VGPAVIPPANLREHLDAPNVNMITCGGQATIPIVYAVSRIVEVPYAEIVASVASVSAG PGTRANIDEFTKTTARGVQTIGGAARGKAIIILNPADPPMIMRDTIFCAIPTDADREA IAASIHDVVKEVQTYVPGYRLLNEPQFDEPSINSGGQALVTTFVEVEGAGDYLPPYAG NLDIMTAAATKVGEEIAKETLVVGGAR" gene complement(3974511..3975296) /locus_tag="Rv3536c" /db_xref="GeneID:888406" CDS complement(3974511..3975296) /locus_tag="Rv3536c" /EC_number="4.2.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3536c, (MTCY03C7.20), len: 261 aa. Probable hydratase, 2-oxo-hepta-3-ene-1,7-dioate hydratase (EC 4.2.1.-) or 2-keto-4-pentenoate hydratase (EC 4.2.1.-). Indeed, highly similar to many 2-oxo-hepta-3-ene-1,7-dioate hydratases e.g. Q9CKS2|HPAH|PM1534 from Pasteurella multocida (267 aa) FASTA scores: opt: 743, E(): 1.5e-39, (45.5% identity in 266 aa overlap) Q9RZ31|DRA0122 from Deinococcus radiodurans (268 aa), FASTA scores: opt: 709, E(): 2e-37, (45.5% identity in 266 aa overlap); Q9HWQ4|HPCG|PA4127 from Pseudomonas aeruginosa (267 aa), FASTA scores: opt: 703, E(): 4.8e-37, (45.1% identity in 266 aa overlap); Q46982|HPAH|HPCG from Escherichia colis strain ATCC 11105 (267 aa), FASTA scores: opt: 679, E(): 1.6e-35, (41.35% identity in 266 aa overlap); etc. But also highly similar to many 2-keto-4-pentenoate hydratases (2-hydroxypentadienoic acidhydratases) e.g. Q9LAF7|PHED from Bacillus thermoglucosidasius (258 aa), FASTA scores: opt: 698, E(): 9.7e-37, (42.45% identity in 252 aa overlap); Q52442|BPHH from Pseudomonas sp (260 aa) FASTA scores: opt: 675, E(): 2.7e-35, (41.4% identity in 251 aa overlap); P77608|MHPD_ECOLI|B0350 from Escherichia coli strain K12 (269 aa), FASTA scores: opt: 674, E(): 3.2e-35, (42.75% identity in 255 aa overlap); Q52038|BPHX1 from Pseudomonas pseudoalcaligenes (260 aa), FASTA scores: opt: 663, E(): 1.5e-34, (40.6% identity in 251 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hydratase" /protein_id="NP_218053.1" /db_xref="GI:15610672" /db_xref="GeneID:888406" /translation="MLRDATRDELAADLAQAERSRDPIGQLTAAHPEIDVVDAYEIQL INIRQRVAEGARVVGHKVGLSSPIMQQMMGVDEPDYGHLLDDMQVFEDTPVQASRYLS PRVEVEVGFILAADLPGAGCTEDDVLAATEALVPAIELIDTRIKDWQIKICDTIADNA SAAGFVLGAARVPPADLDVRAIDAKLTRNGEVVAEGRSDAVLGNPATAVAWLAGKVES FGVRLRKGDIVLPGSCTFAVEARAGDEFVADFTGLGLVRLSFE" gene 3975369..3977060 /locus_tag="Rv3537" /db_xref="GeneID:888422" CDS 3975369..3977060 /locus_tag="Rv3537" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="initiates steroid ring degradation; catalyzes the transhydrogenation of 3-keto-4-ene-steroid to 3-keto-1,4-diene-steroid e.g., progesterone to 1,4-androstadiene-3,17-dione" /codon_start=1 /transl_table=11 /product="3-ketosteroid-delta-1-dehydrogenase" /protein_id="NP_218054.1" /db_xref="GI:15610673" /db_xref="GeneID:888422" /translation="MTVQEFDVVVVGSGAAGMVAALVAAHRGLSTVVVEKAPHYGGST ARSGGGVWIPNNEVLKRRGVRDTPEAARTYLHGIVGEIVEPERIDAYLDRGPEMLSFV LKHTPLKMCWVPGYSDYYPEAPGGRPGGRSIEPKPFNARKLGADMAGLEPAYGKVPLN VVVMQQDYVRLNQLKRHPRGVLRSMKVGARTMWAKATGKNLVGMGRALIGPLRIGLQR AGVPVELNTAFTDLFVENGVVSGVYVRDSHEAESAEPQLIRARRGVILACGGFEHNEQ MRIKYQRAPITTEWTVGASANTGDGILAAEKLGAALDLMDDAWWGPTVPLVGKPWFAL SERNSPGSIIVNMSGKRFMNESMPYVEACHHMYGGEHGQGPGPGENIPAWLVFDQRYR DRYIFAGLQPGQRIPSRWLDSGVIVQADTLAELAGKAGLPADELTATVQRFNAFARSG VDEDYHRGESAYDRYYGDPSNKPNPNLGEVGHPPYYGAKMVPGDLGTKGGIRTDVNGR ALRDDGSIIDGLYAAGNVSAPVMGHTYPGPGGTIGPAMTFGYLAALHIADQAGKR" gene 3977062..3977922 /locus_tag="Rv3538" /db_xref="GeneID:888426" CDS 3977062..3977922 /locus_tag="Rv3538" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3538, (MTCY03C7.18c), len: 286 aa. Probable dehydrogenase (EC 1.-.-.-), similar to Q9L009|SCC30.12c PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (333 aa), FASTA scores: opt: 842, E(): 3.6e-44, (48.4% identity in 285 aa overlap); and similar to C-terminal part of other (principally ESTRADIOL 17 BETA-DEHYDROGENASES/17-BETA-HYDROXYSTEROID DEHYDROGENASES) e.g. P70540 PEROXISOMAL MULTIFUNCTIONAL ENZYME TYPE II (SDR FAMILY) from Rattus norvegicus (Rat) (735 aa) FASTA scores: opt: 622, E(): 1.9e-30, (37.45% identity in 283 aa overlap); or P70523|MPF-2 MULTIFUNCTIONAL PROTEIN 2 (SDR FAMILY) (beta-oxidation protein displaying 2-enoyl-CoA hydratase and D-3-hydroxyacyl-CoA dehydrogenase activity) from Rattus norvegicus (Rat) (734 aa), FASTA scores: opt: 616, E(): 4.3e-30, (37.1% identity in 283 aa overlap); P51659|DHB4_HUMAN|HSD17B4|EDH17B4 ESTRADIOL 17 BETA-DEHYDROGENASE (EC 1.1.1.62) from Homo sapiens (Human) (736 aa), FASTA scores: opt: 614, E(): 5.7e-30, (35.9% identity in 284 aa overlap); P97852|DHB4_RAT|HSD17B4|EDH17B4 ESTRADIOL 17 BETA-DEHYDROGENASE from Rattus norvegicus (Rat) (735 aa) FASTA scores: opt: 613, E(): 6.6e-30, (37.1% identity in 283 aa overlap); Q9DBM3|HSD17B4 ESTRADIOL 17 BETA-DEHYDROGENASE from Mus musculus (Mouse) (735 aa) FASTA scores: opt: 611, E(): 8.7e-30, (36.5% identity in 285 aa overlap); etc. Also similar to Q11198|Rv3389c|MTV004.47c HYPOTHETICAL 30.3 KDA PROTEIN from Mycobacterium tuberculosis (290 aa), FASTA scores: opt: 609, E(): 5.3e-30, (39.65% identity in 285 aa overlap). Note that previously known as ufaA2.; ufaA2" /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="YP_177986.1" /db_xref="GI:57117122" /db_xref="GeneID:888426" /translation="MPIDLDVALGAQLPPVEFSWTSTDVQLYQLGLGAGSDPMNPREL SYLADDTPQVLPTFGNVAATFHLTTPPTVQFPGIDIELSKVLHASERVEVPAPLPPSG SARAVTRFTDIWDKGKAAVICSETTATTPDGLLLWTQKRSIYARGEGGFGGKRGPSGS DVAPERAPDLQVAMPILPQQALLYRLCGDRNPLHSDPEFAAAAGFPRPILHGLCTYGM TCKAIVDALLDSDATAVAGYGARFAGVAYPGETLTVNVWKDGRRLVASVVAPTRDNAV VLSGVELVPA" gene 3978059..3979498 /gene="PPE63" /locus_tag="Rv3539" /db_xref="GeneID:888438" CDS 3978059..3979498 /gene="PPE63" /locus_tag="Rv3539" /function="UNKNOWN" /note="Rv3539, (MTCY03C7.17c), len: 479 aa. Member of the Mycobacterium tuberculosis PPE protein family, similar to many e.g. O53949|Rv1800|MTV049.22 (655 aa), FASTA scores: opt: 914, E(): 7.3e-47, (37.55% identity in 490 aa overlap); etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177987.1" /db_xref="GI:57117123" /db_xref="GeneID:888438" /translation="MADFLTLSPEVNSARMYAGGGPGSLSAAAAAWDELAAELWLAAA SFESVCSGLADRWWQGPSSRMMAAQAARHTGWLAAAATQAEGAASQAQTMALAYEAAF AATVHPALVAANRALVAWLAGSNVFGQNTPAIAAAEAIYEQMWAQDVVAMLNYHAVAS AVGARLRPWQQLLHELPRRLGGEHSDSTNTELANPSSTTTRITVPGASPVHAATLLPF IGRLLAARYAELNTAIGTNWFPGTTPEVVSYPATIGVLSGSLGAVDANQSIAIGQQML HNEILAATASGQPVTVAGLSMGSMVIDRELAYLAIDPNAPPSSALTFVELAGPERGLA QTYLPVGTTIPIAGYTVGNAPESQYNTSVVYSQYDIWADPPDRPWNLLAGANALMGAA YFHDLTAYAAPQQGIEIAAVTSSLGGTTTTYMIPSPTLPLLLPLKQIGVPDWIVGGLN NVLKPLVDAGYSQYAPTAGPYFSHGNLVW" gene complement(3979499..3980659) /gene="ltp2" /locus_tag="Rv3540c" /db_xref="GeneID:888450" CDS complement(3979499..3980659) /gene="ltp2" /locus_tag="Rv3540c" /EC_number="2.3.1.16" /function="UNKNOWN, SUPPOSED INVOLVEMENT IN LIPID METABOLISM." /note="Rv3540c, (MTCY03C7.16), len: 386 aa. Probable ltp2, lipid-transfer protein or keto acyl-CoA thiolase (EC 2.3.1.16), similar to several e.g. Q9X4X2|DITF DITF PROTEIN (hypothetical protein, similar to non-specific lipid-transfer protein and 3-ketoacyl-CoA thiolase) from Pseudomonas abietaniphila (397 aa), FASTA scores: opt: 665, E(): 5.3e-34, (33.4% identity in 392 aa overlap); O30255|AF2416 3-KETOACYL-CoA THIOLASE (ACAB-12) from Archaeoglobus fulgidus (384 aa), FASTA scores: opt: 496, E(): 1.6e-23, (30.35% identity in 389 aa overlap); O28978|AF1291 3-KETOACYL-CoA THIOLASE (ACAB-11) from Archaeoglobus fulgidus (392 aa), FASTA scores: opt: 494, E(): 2.2e-23, (30.6% identity in 379 aa overlap); O26884|MTH793 LIPID-TRANSFER PROTEIN (STEROL OR NONSPECIFIC) from Methanobacterium thermoautotrophicum (383 aa), FASTA scores: opt: 487, E(): 5.9e-23, (30.4% identity in 388 aa overlap); etc." /codon_start=1 /transl_table=11 /product="lipid-transfer protein" /protein_id="NP_218057.1" /db_xref="GI:15610676" /db_xref="GeneID:888450" /translation="MLSGQAAIVGIGATDFSKNSGRSELRLAAEAVLDALADAGLSPT DVDGLTTFTMDTNTEIAVARAAGIGELTFFSKIHYGGGAACATVQHAAMAVATGVADV VVAYRAFNERSGMRFGQVQTRLTENADSTGVDNSFSYPHGLSTPAAQVAMIARRYMHL SGATSRDFGAVSVADRKHAANNPKAYFYGKPITIEDHQNSRWIAEPLRLLDCCQETDG AVAIVVTSAARARDLKQRPVVIEAAAQGCSPDQYTMVSYYRPELDGLPEMGLVGRQLW AQSGLTPADVQTAVLYDHFTPFTLIQLEELGFCGKGEAKDFIADGAIEVGGRLPINTH GGQLGEAYIHGMNGIAEGVRQLRGTSVNPVAGVEHVLVTAGTGVPTSGLILG" gene complement(3980659..3981048) /locus_tag="Rv3541c" /db_xref="GeneID:888475" CDS complement(3980659..3981048) /locus_tag="Rv3541c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3541c, (MTCY03C7.15), len: 129 aa. Hypothetical protein, showing some similarity to Q9CBJ7|ML1909 HYPOTHETICAL PROTEIN from Mycobacterium leprae (142 aa) FASTA scores: opt: 110, E(): 1.2, (27.95% identity in 118 aa overlap); and other (see also BLASTP results) e.g. Q9L0M3|SCD82.08 HYPOTHETICAL 15.2 KDA PROTEIN from Streptomyces coelicolor (142 aa), FASTA scores: opt: 127, E(): 0.086, (27.65% identity in 123 aa overlap). Contains PS00075 Dihydrofolate reductase signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218058.1" /db_xref="GI:15610677" /db_xref="GeneID:888475" /translation="MTVVGAVLPELKLYGDPTFIVSTALATRDFQDVHHDRDKAVAQG SKDIFVNILTDTGLVQRYVTDWAGPSALIKSIGLRLGVPWYAYDTVTFSGEVTAVNDG LITVKVVGRNTLGDHVTATVELSMRDS" misc_feature complement(3980797..3980823) /locus_tag="Rv3541c" /note="PS00075 Dihydrofolate reductase signature." gene complement(3981045..3981980) /locus_tag="Rv3542c" /db_xref="GeneID:888486" CDS complement(3981045..3981980) /locus_tag="Rv3542c" /function="UNKNOWN" /note="Rv3542c, (MTCY03C7.14), len: 311 aa. Hypothetical protein, showing some similarity to other e.g. Q58947|MJ1552 from Methanococcus jannaschii (141 aa) FASTA scores: opt: 177, E(): 0.00065, (46.65% identity in 60 aa overlap); BAB59276|TVG0142586 from Thermoplasma volcanium (135 aa), FASTA scores: opt: 175, E(): 0.00083, (35.65% identity in 87 aa overlap); Q9HI85|TA1457 from Thermoplasma acidophilum (135 aa), FASTA scores: opt: 162, E(): 0.0052, (31.8% identity in 107 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218059.1" /db_xref="GI:15610678" /db_xref="GeneID:888486" /translation="MTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDR NPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGV VATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEM NWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRH PPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLG ELRGADPARVAIGMPVRATYIDFPDWSLYAWEPDE" gene complement(3981977..3983140) /gene="fadE29" /locus_tag="Rv3543c" /db_xref="GeneID:888131" CDS complement(3981977..3983140) /gene="fadE29" /locus_tag="Rv3543c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3543c, (MTCY03C7.13), len: 387 aa. Probable fadE29, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9A8P3|CC1310 from Caulobacter crescentus (404 aa), FASTA scores: opt: 624, E(): 9.4e-32, (32.75% identity in 400 aa overlap); Q9I4V2|PA1022 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 550, E(): 3.9e-27, (33.7% identity in 350 aa overlap); O28976|AF1293 from Archaeoglobus fulgidus (384 aa), FASTA scores: opt: 529, E(): 8.1e-26, (30.0% identity in 393 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. O53549|FADE26|Rv3504|MTV023.11 (400 aa), FASTA scores: opt: 1031, E(): 2.8e-57, (46.0% identity in 402 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE29" /protein_id="NP_218060.1" /db_xref="GI:15610679" /db_xref="GeneID:888131" /translation="MFIDLTPEQRQLQAEIRQYFSNLISPDERTEMEKDRHGPAYRAV IRRMGRDGRLGVGWPKEFGGLGFGPIEQQIFVNEAHRADVPLPAVTLQTVGPTLQAHG SELQKKKFLPAILAGEAHFAIGYTEPEAGTDLASLRTTAVRDGDHYIVNGQKVFTTGA HDADYIWLACRTDPNAAKHKGISILIVDTKDPGYSWTPIILADGAHHTNATYYNDVRV PVDMLVGKENDGWRLITTQLNNERVMLGPAGRFASIYDRVHAWASVPGGNGVTPIDHD DVKRALGEIRAIWRINELLNWQVASAGEDINMADAAATKVFGTERVQRAGRLAEEIVG KYGNPAEPDTAELLRWLDAQTKRNLVITFGGGVNEVMREMIAASGLKVPRVPR" gene complement(3983125..3984144) /gene="fadE28" /locus_tag="Rv3544c" /db_xref="GeneID:888091" CDS complement(3983125..3984144) /gene="fadE28" /locus_tag="Rv3544c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3544c, (MTCY03C7.12), len: 339 aa. Probable fadE28, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9RJX3|SCF37.28c from Streptomyces coelicolor (362 aa), FASTA scores: opt: 334, E(): 5.1e-13, (27.65% identity in 329 aa overlap); Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 278, E(): 1.2e-09, (26.95% identity in 319 aa overlap); O29813|AF0436 from Archaeoglobus fulgidus (382 aa) FASTA scores: opt: 205, E(): 3.5e-05, (24.75% identity in 384 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. O53550|FADE27|Rv3505|MTV023.12 (373 aa) FASTA scores: opt: 497, E(): 7e-23, (30.3% identity in 343 aa overlap); and to P46703|ACDP_MYCLE|FADE25|ACD|ML0737|B1308_F1_34 PROBABLE ACYL-CoA DEHYDROGENASE from Mycobacterium leprae (389 aa) FASTA scores: opt: 165, E(): 0.0012, (25.2% identity in 345 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE28" /protein_id="NP_218061.1" /db_xref="GI:15610680" /db_xref="GeneID:888091" /translation="MDFDPTAEQQAVADVVTSVLERDISWEALVCGGVTALPVPERLG GDGVGLFEVGALLTEVGRHGAVTPALATLGLGVVPLLELASAEQQDRFLAGVAKGGVL TAALNEPGAALPDRPATSFVGGRLSGTKVGVGYAEQADWMLVTADNAVVVVSPTADGV RMVRTPTSNGSDEYVMTMDGVAVADCDILADVAAHRVNQLALAVMGAYADGLVAGALR LTADYVANRKQFGKPLSTFQTVAAQLAEVYIASRTIDLVAKSVIWRLAEDLDAGDDLG VLGYWVTSQAPPAMQICHHLHGGMGMDVTYPMHRYYSTIKDLTRLLGGPSHRLELLGA RCSLT" gene complement(3984144..3985445) /gene="cyp125" /locus_tag="Rv3545c" /db_xref="GeneID:887782" CDS complement(3984144..3985445) /gene="cyp125" /locus_tag="Rv3545c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv3545c, (MT3649, MTCY03C7.11), len: 433 aa. Probable cyp125, cytochrome P-450 (EC 1.14.-.-), similar to others e.g. Q59723|LINC|CYP111 from Pseudomonas incognita (406 aa), FASTA scores: opt: 831, E(): 8e-45, (34.75% identity in 406 aa overlap); Q9X8Q3|CYP107P1|SCH10.14c from Streptomyces coelicolor (411 aa), FASTA scores: opt: 694, E(): 3.3e-36, (32.35% identity in 417 aa overlap); Q9L465|CYP162A1|NIKQ from Streptomyces tendae (396 aa) FASTA scores: opt: 664, E(): 2.5e-34, (34.15% identity in 413 aa overlap); O08469|CPXY_BACSU|CYPA|CYP107J1 from Bacillus subtilis (410 aa), FASTA scores: opt: 579, E(): 5.6e-29, (30.05% identity in 366 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. Q50696|CYP124|Rv2266|MT2328|MTCY339.44c (428 aa) FASTA scores: opt: 1040, E(): 6.1e-58, (40.75% identity in 432 aa overlap). BELONGS TO THE CYTOCHROME P450 FAMILY." /codon_start=1 /transl_table=11 /product="cytochrome P450 125" /protein_id="NP_218062.1" /db_xref="GI:15610681" /db_xref="GeneID:887782" /translation="MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFA ELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRFKN DIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAA AGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSA ELIGYAMKMAEEKAKNPADDIVTQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNS ITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTALRDYELSGVQIKKG QRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFN AVADHMPDLKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH" gene 3985557..3986732 /gene="fadA5" /locus_tag="Rv3546" /db_xref="GeneID:887360" CDS 3985557..3986732 /gene="fadA5" /locus_tag="Rv3546" /EC_number="2.3.1.9" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION [CATALYTIC ACTIVITY: 2 ACETYL-CoA = CoA + ACETOACETYL-COA]." /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_218063.1" /db_xref="GI:15610682" /db_xref="GeneID:887360" /translation="MGYPVIVEATRSPIGKRNGWLSGLHATELLGAVQKAVVDKAGIQ SGLHAGDVEQVIGGCVTQFGEQSNNISRVAWLTAGLPEHVGATTVDCQCGSGQQANHL IAGLIAAGAIDVGIACGIEAMSRVGLGANAGPDRSLIRAQSWDIDLPNQFEAAERIAK RRGITREDVDVFGLESQRRAQRAWAEGRFDREISPIQAPVLDEQNQPTGERRLVFRDQ GLRETTMAGLGELKPVLEGGIHTAGTSSQISDGAAAVLWMDEAVARAHGLTPRARIVA QALVGAEPYYHLDGPVQSTAKVLEKAGMKIGDIDIVEINEAFASVVLSWARVHEPDMD RVNVNGGAIALGHPVGCTGSRLITTALHELERTDQSLALITMCAGGALSTGTIIERI" misc_feature 3986565..3986615 /gene="fadA5" /locus_tag="Rv3546" /note="PS00737 Thiolases signature 2." gene 3986844..3987299 /locus_tag="Rv3547" /db_xref="GeneID:887496" CDS 3986844..3987299 /locus_tag="Rv3547" /function="UNKNOWN" /note="Rv3547, (MTCY03C7.09c), len: 151 aa. Conserved hypothetical protein, similar to other hypothetical proteins e.g. O85698|3SCF60.07 from Streptomyces lividans and Streptomyces coelicolor (149 aa), FASTA scores: opt: 353, E(): 6.3e-17, (42.55% identity in 134 aa overlap); Q9WX21|SCE68.11 from Streptomyces coelicolor (305 aa) FASTA scores: opt: 290, E(): 2.1e-12, (38.5% identity in 122 aa overlap) (similarity in N-terminus for this protein); BAB52932|Q988L5|MLL6688 from Rhizobium loti (Mesorhizobium loti) (148 aa), FASTA scores: opt: 105, E(): 3, (26.75% identity in 86 aa overlap). Also similar to mycobacterial hypothetical proteins e.g. Q9ZH81 from Mycobacterium paratuberculosis (144 aa), FASTA scores: opt: 366, E(): 8.2e-18, (43.9% identity in 123 aa overlap); and Q10772|YF58_MYCTU|Rv1558|MT1609|MTCY48.07c from Mycobacterium tuberculosis (148 aa), FASTA scores: opt: 330, E(): 2.2e-15, (39.75% identity in 151 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218064.1" /db_xref="GI:15610683" /db_xref="GeneID:887496" /translation="MPKSPPRFLNSPLSDFFIKWMSRINTWMYRRNDGEGLGGTFQKI PVALLTTTGRKTGQPRVNPLYFLRDGGRVIVAASKGGAEKNPMWYLNLKANPKVQVQI KKEVLDLTARDATDEERAEYWPQLVTMYPSYQDYQSWTDRTIPIVVCEP" gene complement(3987382..3988296) /locus_tag="Rv3548c" /db_xref="GeneID:887452" CDS complement(3987382..3988296) /locus_tag="Rv3548c" /function="UNKNOWN; SUPPOSED INVOLVEMENT IN CELLULAR METABOLISM." /note="Rv3548c, (MTCY03C7.08), len: 304 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), highly similar to various dehydrogenases/reductases (generally belonging to the SDR FAMILY) e.g. Q9I4V1|PA1023 from Pseudomonas aeruginosa (305 aa), FASTA scores: opt: 446, E(): 1.7e-17, (43.75% identity in 256 aa overlap); Q9A6K0|CC2093 from Caulobacter crescentus (301 aa) FASTA scores: opt: 437, E(): 5.3e-17, (42.8% identity in 257 aa overlap); Q9HYH8|PA3427 from Pseudomonas aeruginosa (303 aa), FASTA scores: opt: 399, E(): 6.5e-15, (45.5% identity in 257 aa overlap); Q9VXJ0|CG3415 from Drosophila melanogaster (Fruit fly) (598 aa), FASTA scores: opt: 402, E(): 7.5e-15, (40.7% identity in 285 aa overlap); etc. Also highly similar to O53547|Rv3502c|MTV023.09c PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from (317 aa) FASTA scores: opt: 739, E(): 1.6e-33, (45.15% identity in 310 aa overlap); and other proteins from Mycobacterium tuberculosis. Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_218065.1" /db_xref="GI:15610684" /db_xref="GeneID:887452" /translation="MGLVDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDIGVGLDG SPASGGSAAQDVVDEILAAGGQAVADGSDISDWDQAANLIQAAVETYGGVDVLVNNAG IVRDRMIANTSEEEFDAVIAVHLKGHFATMRHAASHWRGLSKAGKAPKDIDARIINTS SGAGLQGSVGQGNYSAAKAGIAALTLVGAAEMRRYGVTVNAIAPAARTRMTETVFAEM MAKPQEGFDAMAPENVSPLVVWLGSAESRDVTGKVFEVEGGIIRVAEGWAHGPQVDKG VKWDPAELGPVVSDLLAKSRPPVPVYGA" misc_feature complement(3987730..3987816) /locus_tag="Rv3548c" /note="PS00061 Short-chain alcohol dehydrogenase family signature." gene complement(3988319..3989098) /locus_tag="Rv3549c" /db_xref="GeneID:887849" CDS complement(3988319..3989098) /locus_tag="Rv3549c" /function="UNKNOWN; SUPPOSED INVOLVEMENT IN CELLULAR METABOLISM." /note="Rv3549c, (MTCY03C7.07), len: 259 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), similar to various dehydrogenases/reductases (generally belong to the SDR FAMILY) e.g. Q9UKU3 from Homo sapiens (Human) (270 aa), FASTA scores: opt: 451, E(): 4.8e-21, (38.05% identity in 247 aa overlap); Q9S274|SCI28.09c from Streptomyces coelicolor (234 aa), FASTA scores: opt: 439, E(): 2.4e-20, (36.8% identity in 231 aa overlap); Q9PFI6|XF0671 from Xylella fastidiosa (247 aa), FASTA scores: opt: 437, E(): 3.4e-20, (37.7% identity in 252 aa overlap); etc. Also highly similar to O33308|FABG5|Rv2766c|MTV002.31c ALCOHOL DEHYDROGENASE (SDR FAMILY) from Mycobacterium tuberculosis (260 aa), FASTA scores: opt: 504, E(): 2.3e-24, (38.5% identity in 244 aa overlap). Contains PS00061 Short-chain alcohol dehydrogenase family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_218066.1" /db_xref="GI:15610685" /db_xref="GeneID:887849" /translation="MTLAEAADAINFGLAGRVVLVTGGVRGVGAGISSVFAEQGATVI TCARRAVDGQPYEFHRCDIRDEDSVKRLVGEIGERHGRLDMLVNNAGGSPYALAAEAT HNFHRKIVELNVLAPLLVSQHANVLMQAQPNGGSIVNICSVSGRRPTPGTAAYGAAKA GLENLTTTLAVEWAPKVRVNAVVVGMVETERSELFYGDAESIARVAATVPLGRLARPA DIGWAAAFLASDAASYISGATLEVHGGGEPPPYLGASSANK" misc_feature complement(3988589..3988675) /locus_tag="Rv3549c" /note="PS00061 Short-chain alcohol dehydrogenase family signature." gene 3989153..3989896 /gene="echA20" /locus_tag="Rv3550" /db_xref="GeneID:888232" CDS 3989153..3989896 /gene="echA20" /locus_tag="Rv3550" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_218067.1" /db_xref="GI:15610686" /db_xref="GeneID:888232" /translation="MPITSTTPEPGIVAVTVDYPPVNAIPSKAWFDLADAVTAAGANS DTRAVILRAEGRGFNAGVDIKEMQRTEGFTALIDANRGCFAAFRAVYECAVPVIAAVN GFCVGGGIGLVGNSDVIVASEDATFGLPEVERGALGAATHLSRLVPQHLMRRLFFTAA TVDAATLQHFGSVHEVVSRDQLDEAALRVARDIAAKDTRVIRAAKEALNFIDVQRVNA SYRMEQGFTFELNLAGVADEHRDAFVKKS" gene 3989896..3990774 /locus_tag="Rv3551" /db_xref="GeneID:887235" CDS 3989896..3990774 /locus_tag="Rv3551" /EC_number="2.8.3.-" /function="UNKNOWN. PROBABLE SUBUNIT OF A COA-TRANSFERASE COMPOSED OF RV3551|MTCY03C7.05c AND RV3552|MTCY03C7.03c." /note="Rv3551, (MTCY03C7.05c), len: 292 aa. Possible CoA-transferase, alpha subunit (EC 2.8.3.-), similar in part to other CoA-transferases e.g. Q59111|GCTA_ACIFE|GCTA GLUTACONATE COA-TRANSFERASE SUBUNIT A (EC 2.8.3.12) (GCT LARGE SUBUNIT) from Acidaminococcus fermentans (319 aa) FASTA scores: opt: 247, E(): 6.3e-09, (27.35% identity in 307 aa overlap); Q9XD83|PCAI from Streptomyces sp. 2065 (251 aa), FASTA scores: opt: 222, E(): 2.3e-07, (27.55% identity in 243 aa overlap); BAB50895|MLL4183 from Rhizobium loti (Mesorhizobium loti) (285 aa), FASTA scores: opt: 206, E(): 2.8e-06, (27.4% identity in 281 aa overlap); etc. Also some similarity with O06167|SCOA_MYCTU|RVv504c|MT2579|MTCY07A7.10c PROBABLE SUCCINYL-COA:3-KETOACID-COENZYME A TRANSFERASE SUBUNIT A from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 210, E(): 1.4e-06, (25.5% identity in 247 aa overlap). BELONGS TO THE GLUTACONATE COA-TRANSFERASE SUBUNIT A FAMILY. Note that this putative protein may combine with the putative protein encoded by the downstream ORF Rv3552 to form a CoA-transferase that comprises two subunits." /codon_start=1 /transl_table=11 /product="CoA-transferase subunit alpha" /protein_id="NP_218068.1" /db_xref="GI:15610687" /db_xref="GeneID:887235" /translation="MPDKRTALDDAVAQLRSGMTIGIAGWGSRRKPMAFVRAILRSDV TDLTVVTYGGPDLGLLCSAGKVKRVYYGFVSLDSPPFYDPWFAHARTSGAIEAREMDE GMLRCGLQAAAQRLPFLPIRAGLGSSVPQFWAGELQTVTSPYPAPGGGYETLIAMPAL RLDAAFAHLNLGDSHGNAAYTGIDPYFDDLFLMAAERRFLSVERIVATEELVKSVPPQ ALLVNRMMVDAIVEAPGGAHFTTAAPDYGRDEQFQRHYAEAASTQVGWQQFVHTYLSG TEADYQAAVHNFGASR" gene 3990771..3991523 /locus_tag="Rv3552" /db_xref="GeneID:887453" CDS 3990771..3991523 /locus_tag="Rv3552" /EC_number="2.8.3.-" /function="UNKNOWN. PROBABLE SUBUNIT OF A COA-TRANSFERASE COMPOSED OF RV3551|MTCY03C7.05c AND RV3552|MTCY03C7.03c." /note="Rv3552, (MTCY03C7.03c), len: 250 aa. Possible CoA-transferase, beta subunit (EC 2.8.3.-), similar in part to other CoA-transferases e.g. Q9I6R1|PA0227 from Pseudomonas aeruginosa (260 aa), FASTA scores: opt: 233, E(): 8.6e-08, (24.8% identity in 238 aa overlap); BAB50894|MLL4181 from Rhizobium loti (Mesorhizobium loti) (264 aa), FASTA scores: opt: 210, E(): 2.6e-06, (24.15% identity in 203 aa overlap); and AAK41345|Q97Z51|GCTB from Sulfolobus solfataricus (245 aa), FASTA scores: opt: 122, E(): 1.1, (25.5% identity in 243 aa overlap). POSSIBLY BELONGS TO THE GLUTACONATE COA-TRANSFERASE SUBUNIT B FAMILY. Note that this putative protein may combine with the putative protein encoded by the upstream ORF Rv3551 to form a CoA-transferase that comprises two subunits." /codon_start=1 /transl_table=11 /product="CoA-transferase subunit beta" /protein_id="NP_218069.1" /db_xref="GI:15610688" /db_xref="GeneID:887453" /translation="MSTRAEVCAVACAELFRDAGEIMISPMTNMASVGARLARLTFAP DILLTDGEAQLLADTPALGKTGAPNRIEGWMPFGRVFETLAWGRRHVVMGANQVDRYG NQNISAFGPLQRPTRQMFGVRGSPGNTINHATSYWVGNHCKRVFVEAVDVVSGIGYDK VDPDNPAFRFVNVYRVVSNLGVFDFGGPDHSMRAVSLHPGVTPGDVRDATSFEVHDLD AAEQTRLPTDDELHLIRAVIDPKSLRDREIRS" repeat_region complement(3991568..3991625) /note="58 bp Mycobacterial Interspersed Repetitive Unit, Class III." gene 3991621..3992688 /locus_tag="Rv3553" /db_xref="GeneID:887190" CDS 3991621..3992688 /locus_tag="Rv3553" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3553, (MTCY03C7.02c), len: 355 aa. Possible oxidoreductase (EC 1.-.-.-), highly similar (except in C-terminus) to Q9A327|CC3379 HYPOTHETICAL PROTEIN from Caulobacter crescentus (321 aa), FASTA scores: opt: 639, E(): 4.6e-29, (46.35% identity in 248 aa overlap); and Q9WZQ7|TM0800 CONSERVED HYPOTHETICAL PROTEIN from Thermotoga maritima (314 aa), FASTA scores: opt: 622, E(): 4.1e-28, (37.95% identity in 340 aa overlap). Also similar to two TRANS-2-ENOYL-ACP REDUCTASES; Q99YD4|FABK|SPY1751 from Streptococcus pyogenes (323 aa), FASTA scores: opt: 604, E(): 4.4e-27, (33.25% identity in 346 aa overlap); and Q9FBC5|FABK from Streptococcus pneumoniae (324 aa), FASTA scores: opt: 553, E(): 3.3e-24, (32.1% identity in 346 aa overlap); and similar with several 2-NITROPROPANE DIOXYGENASES, e.g. Q9F7P8 from uncultured proteobacterium EBAC31A08 (322 aa), FASTA scores: opt: 505, E(): 1.7e-21, (33.6% identity in 348 aa overlap); Q9FMG0 (alias AAK44141) from Arabidopsis thaliana (Mouse-ear cress) (333 aa), FASTA scores: opt: 489, E(): 1.4e-20, (33.15% identity in 341 aa overlap); O28109|AF2173 (NCD2) from Archaeoglobus fulgidus (274 aa), FASTA scores: opt: 456, E(): 8.9e-19, (36.3% identity in 237 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218070.1" /db_xref="GI:15610689" /db_xref="GeneID:887190" /translation="MRLRTPLTELIGIEHPVVQTGMGWVAGARLVSATANAGGLGILA SATMTLDELAAAITKVKAVTDKPFGVNIRADAADAGDRVELMIREGVRVASFALAPKQ QLIARLKEAGAVVIPSIGAAKHARKVAAWGADAMIVQGGEGGGHTGPVATTLLLPSVL DAVAGTGIPVIAAGGFFDGRGLAAALCYGAAGVAMGTRFLLTSDSTVPDAVKRRYLQA GLDGTVVTTRVDGMPHRVLRTELVEKLESGSRARGFAAALRNAGKFRRMSQMTWRSMI RDGLTMRHGKELTWSQVLMAANTPMLLKAGLVDGNTEAGVLASGQVAGILDDLPSCKE LIESIVLDAITHLQTASALVE" gene 3992685..3994742 /gene="fdxB" /locus_tag="Rv3554" /db_xref="GeneID:887221" CDS 3992685..3994742 /gene="fdxB" /locus_tag="Rv3554" /function="UNKNOWN; C-TERMINUS PROBABLY INVOLVED IN ELECTRON TRANSFER IN ONE OR SEVERAL METABOLIC REACTIONS." /note="Rv3554, (MTCY06G11.01, MTCY03C7.01c), len: 685 aa. Possible fdxB, two-domain protein, with ferredoxin reductase electron transfer component in C-terminal part (EC 1.-.-.-) and unknown function in N-terminal part. Indeed, N-terminal end is similar to O85832 HYPOTHETICAL 36.1 KDA PROTEIN from Sphingomonas aromaticivorans strain F199 (catabolic plasmid pNL1) (309 aa), FASTA scores: opt: 615, E(): 2.5e-30, (33.1% identity in 311 aa overlap); and P73428|SLL1468 HYPOTHETICAL 36.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (312 aa), FASTA scores: opt: 317, E(): 4.5e-12, (30.2% identity in 268 aa overlap). And C-terminal end is similar to Q9F9U6|PAAE protein involved in aerobic phenylacetate metabolism from Azoarcus evansii (360 aa), FASTA scores: opt: 935, E(): 7e-50, (43.85% identity in 351 aa overlap); CAC44653|PAAE|SCBAC17A6.08 PUTATIVE PHENYLACETIC ACID DEGRADATION NADH OXIDOREDUCTASE from Streptomyces coelicolor (368 aa), FASTA scores: opt: 93, E(): 9.5e-50, (41.95% identity in 372 aa overlap); Q9FA57|PACI FERREDOXIN from Azoarcus evansii (360 aa), FASTA scores: opt: 925, E(): 2.9e-49, (43.3% identity in 351 aa overlap); P76081|PAAE_ECOLI|B1392 PROBABLE PHENYLACETIC ACID DEGRADATION NADH OXIDOREDUCTASE from Escherichia coli strains K12 and W (356 aa), FASTA scores: opt: 910, E(): 2.4e-48, (43.05% identity in 353 aa overlap); Q9APJ6|PAAE ELECTRON TRANSFER PROTEIN (FRAGMENT) from Hyphomicrobium chloromethanicum (241 aa), FASTA scores: opt: 404, E(): 1.7e-17, (35.45% identity in 234 aa overlap); BAB51608|MLL5100 FERREDOXIN from Rhizobium loti (Mesorhizobium loti) (365 aa), FASTA scores: opt: 316, E(): 5.8e-12, (28.95% identity in 349 aa overlap); etc. C-terminus also similar to P96853|Rv3571|MTCY06G11.18 PUTATIVE ELECTRON TRANSFER PROTEIN from Mycobacterium tuberculosis (358 aa), FASTA scores: opt: 450, E(): 3.6e-20, (32.95% identity in 358 aa overlap). Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature. BELONGS TO THE 2FE2S PLANT-TYPE FERREDOXIN FAMILY. COFACTOR: BINDS A 2FE-2S CLUSTER (BY SIMILARITY)." /codon_start=1 /transl_table=11 /product="electron transfer protein FdxB" /protein_id="NP_218071.1" /db_xref="GI:15610690" /db_xref="GeneID:887221" /translation="MTDACQAEYAIAAMSTVEMDQAAPESAAHHPLPDPGESVPRLAL PTIGIFLATLTAFVGSTTAYISGWIPFWVTIPVNAAVTFVMFTVVHDASHYAISSIRW VNGLFGRLAWLFVGPVVAFPAFGYIHIQHHRHSNDDEQDPDTFASHGSLWVLPLRWSM VEYFYIKYYLPRGRSRPVIEVAETLVMMTLFLTGLIVAIVTGNFWTLAIVFLIPQRIG LTVLAWWFDWLPHHGLEDTQRSNRYRATRNRVGAEWLFTPVLLSQNYHLVHHLHPSVP FYRYLRTWRRNEEAYLERNAAISTVFGQQLNPDEYRQWKELNGRLARLLPVRMPARSS SPHAVLHRIPVASVDPITADATLVTFAVPEALRDAFRFEPGQHVTVRTDLGGQGIRRN YSICAPATRAQLRIAVKHIPGGAFSTFVANELKAGDVLELMTPTGRFGTPLDPLHRKH YVGLVAGSGITPVLSILATTLEIETESRFTLIYGNRTKESTMFRAELDRLESRYADRL EILHVLSSEPLHTPELRGRIDRDKLTRWLTSTLRPAGVDEWFICGPLAMATAVRETLI EHGVDSERIHLELFYGFDTPPATRPSYAGATVTFTLSGQRAIFDLVPGDSILEGALGL RSDAPYACMGGACGTCRAKLIEGNVEMDHNFALRKAELDAGYILTCQSHPTTPFVAVD YDA" misc_feature 3994578..3994604 /gene="fdxB" /locus_tag="Rv3554" /note="PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature." gene complement(3994830..3995699) /locus_tag="Rv3555c" /db_xref="GeneID:887952" CDS complement(3994830..3995699) /locus_tag="Rv3555c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3555c, (MTCY06G11.02c), len: 289 aa. Hypothetical protein, highly similar to others from Mycobacterium tuberculosis e.g. O53562|AL022022|Rv3517|MTV023.24 (279 aa), FASTA scores: opt: 874, E(): 8.3e-48, (49.45% identity in 275 aa overlap); P71763|Rv1482c|MTCY277.03c (339 aa), FASTA scores: opt: 755, E(): 3e-40, (45.75% identity in 260 aa overlap); O69681|Rv3714c|MTV025.062c (296 aa), FASTA scores: opt: 733, E(): 6.4e-39, (44.1% identity in 281 aa overlap); etc. Also highly similar to other mycobacterial hypothetical proteins e.g. O07396|MAV346 from Mycobacterium avium (346 aa), FASTA scores: opt: 714, E(): 1.1e-37, (44.6% identity in 260 aa overlap); and Q50134|U650AG|MLCB57.67c from Mycobacterium leprae (75 aa), FASTA scores: opt: 130, E(): 0.17, (35.1% identity in 57 aa overlap) (only partial homology with this protein). Shows some similarity to P52392|NHSR_STRAS PUTATIVE NOSIHEPTIDE RESISTANCE REGULATORY PROTEIN (ORF699) from Streptomyces actuosus (233 aa), FASTA scores: opt: 120, E(): 1.9, (25.25% identity in 194 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218072.1" /db_xref="GI:15610691" /db_xref="GeneID:887952" /translation="MDELPWPVLGSEVLAAKAIPERAMRQLYEPVYPGVYAPAGVELT ARQRAHAAWLWSRRRAVVAGNSAAALLGAKWVNPALDAELVHANRKPPPRIVVHTDRL APHETVAVDGVAVTTPARTAFDIGRRTPSRLQAVQRLDALANSTDVKVADVQAVIAEH TGARGLVRLRAVLPLIDGGAESPQETWTRLVLIDAGLPKPQTQIRVFDDYGDFVARID LGYEQLRVGVEYDGPQHWTDPAQRARDIERSTALLDLGWTIIRVTSELLWYRRGTFVG RVDAAMRAAGWRP" gene complement(3995804..3996964) /gene="fadA6" /locus_tag="Rv3556c" /db_xref="GeneID:887285" CDS complement(3995804..3996964) /gene="fadA6" /locus_tag="Rv3556c" /EC_number="2.3.1.9" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION [CATALYTIC ACTIVITY: 2 ACETYL-CoA = CoA + ACETOACETYL-COA]." /experiment="experimental evidence, no additional details recorded" /note="Catalyzes the synthesis of acetoacetyl coenzyme A from two molecules of acetyl coenzyme A. It can also act as a thiolase, catalyzing the reverse reaction and generating two-carbon units from the four-carbon product of fatty acid oxidation" /codon_start=1 /transl_table=11 /product="acetyl-CoA acetyltransferase" /protein_id="NP_218073.1" /db_xref="GI:15610692" /db_xref="GeneID:887285" /translation="MTEAYVIDAVRTAVGKRGGALAGIHPVDLGALAWRGLLDRTDID PAAVDDVIAGCVDAIGGQAGNIARLSWLAAGYPEEVPGVTVDRQCGSSQQAISFGAQA IMSGTADVIVAGGVQNMSQIPISSAMTVGEQFGFTSPTNESKQWLHRYGDQEISQFRG SELIAEKWNLSREEMERYSLTSHERAFAAIRAGHFENEIITVETESGPFRVDEGPRES SLEKMAGLQPLVEGGRLTAAMASQISDGASAVLLASERAVKDHGLRPRARIHHISARA ADPVFMLTGPIPATRYALDKTGLAIDDIDTVEINEAFAPVVMAWLKEIKADPAKVNPN GGAIALGHPLGATGAKLFTTMLGELERIGGRYGLQTMCEGGGTANVTIIERL" misc_feature complement(3995921..3995971) /gene="fadA6" /locus_tag="Rv3556c" /note="PS00737 Thiolases signature 2." gene complement(3997029..3997631) /locus_tag="Rv3557c" /db_xref="GeneID:887467" CDS complement(3997029..3997631) /locus_tag="Rv3557c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3557c, (MTCY06G11.04c), len: 200 aa. Probable transcriptional regulator, tetR family, similar to other e.g. Q9RRV9|DR2376 from Deinococcus radiodurans (197 aa) FASTA scores: opt: 326, E(): 2.3e-14, (31.2% identity in 189 aa overlap); Q9HZW2|PA2885 from Pseudomonas aeruginosa (198 aa), FASTA scores: opt: 308, E(): 3.5e-13, (31.55% identity in 187 aa overlap); Q9RFR4 from Pseudomonas fluorescens (207 aa), FASTA scores: opt: 291, E(): 4.7e-12, (29.75% identity in 195 aa overlap); Q9K8P5|BH2958 from Bacillus halodurans (215 aa), FASTA scores: opt: 271, E(): 9.9e-11, (23.95% identity in 192 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. O53641|Rv0158|MTV032.01 (214 aa), FASTA scores: opt: 232, E(): 3.5e-08, (25.5% identity in 192 aa overlap); and O06169|Rv2506|MTCY07A7.12 (215 aa), FASTA scores: opt: 215, E(): 4.5e-07, (35.15% identity in 148 aa overlap); etc. SEEMS TO BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="TetR family transcriptional regulator" /protein_id="NP_218074.1" /db_xref="GI:15610693" /db_xref="GeneID:887467" /translation="MDRVAGQVNSRRGELLELAAAMFAERGLRATTVRDIADGAGILS GSLYHHFASKEEMVDELLRGFLDWLFARYRDIVDSTANPLERLQGLFMASFEAIEHHH AQVVIYQDEAQRLASQPRFSYIEDRNKQQRKMWVDVLNQGIEEGYFRPDLDVDLVYRF IRDTTWVSVRWYRPGGPLTAQQVGQQYLAIVLGGITKEGV" gene 3997980..3999638 /gene="PPE64" /locus_tag="Rv3558" /db_xref="GeneID:887822" CDS 3997980..3999638 /gene="PPE64" /locus_tag="Rv3558" /function="UNKNOWN" /note="Rv3558, (MTCY06G11.05), len: 552 aa. Member of the Mycobacterium tuberculosis PPE family of glycine-rich proteins, similar to many e.g. P71868|Rv3533c|MTCY03C7.23 (582 aa), FASTA scores: opt: 1908, E(): 1.7e-83, (58.5% identity in 583 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177988.1" /db_xref="GI:57117124" /db_xref="GeneID:887822" /translation="MAHFSVLPPEINSLRMYLGAGSAPMLQAAAAWDGLAAELGTAAS SFSSVTTGLTGQAWQGPASAAMAAAAAPYAGFLTTASAQAQLAAGQAKAVASVFEAAK AAIVPPAAVAANREAFLALIRSNWLGLNAPWIAAVESLYEEYWAADVAAMTGYHAGAS QAAAQLPLPAGLQQFLNTLPNLGIGNQGNANLGGGNTGSGNIGNGNKGSSNLGGGNIG NNNIGSGNRGSDNFGAGNVGTGNIGFGNQGPIDVNLLATPGQNNVGLGNIGNNNMGFG NTGDANTGGGNTGNGNIGGGNTGNNNFGFGNTGNNNIGIGLTGNNQMGINLAGLLNSG SGNIGIGNSGTNNIGLFNSGSGNIGVFNTGANTLVPGDLNNLGVGNSGNANIGFGNAG VLNTGFGNASILNTGLGNAGELNTGFGNAGFVNTGFDNSGNVNTGNGNSGNINTGSWN AGNVNTGFGIITDSGLTNSGFGNTGTDVSGFFNTPTGPLAVDVSGFFNTASGGTVING QTSGIGNIGVPGTLFGSVRSGLNTGLFNMGTAISGLFNLRQLLG" gene complement(3999647..4000435) /locus_tag="Rv3559c" /db_xref="GeneID:887897" CDS complement(3999647..4000435) /locus_tag="Rv3559c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3559c, (MTCY06G11.06c), len: 262 aa. Probable oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases e.g. Q9F5J1|SIM-NJ1|SIMD2 PUTATIVE 3-KETO-ACYL-REDUCTASE (SDR FAMILY) from Streptomyces antibioticus (273 aa), FASTA scores: opt: 510, E(): 2.8e-24, (40.15% identity in 249 aa overlap);Q9L2C9|SC7A8.29 PUTATIVE DEHYDROGENASE from Streptomyces coelicolor (255 aa), FASTA scores: opt: 500, E(): 1.1e-23, (41.4% identity in 239 aa overlap); Q9HQ41|FABG|VNG1341G 3-OXOACYL-[ACYL-CARRIER-PROTEIN] REDUCTASE from Halobacterium sp. strain NRC-1 (255 aa) FASTA scores: opt: 500, E(): 1.1e-23, (40.0% identity in 250 aa overlap); etc. Also similar to oxidoreductases from Mycobacterium tuberculosis eg Q11020|YD50_MYCTU|FABG2|Rv1350|MT1393|MTCY02B10.14 PUTATIVE OXIDOREDUCTASE (247 aa), FASTA scores: opt: 497, E(): 1.6e-23, (39.2% identity in 245 aa overlap)." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_218076.1" /db_xref="GI:15610695" /db_xref="GeneID:887897" /translation="MNLSVAPKEIAGHGLLDGKVVVVTAAAGTGIGSATARRALAEGA DVVISDHHERRLGETAAELSALGLGRVEHVVCDVTSTAQVDALIDSTTARMGRLDVLV NNAGLGGQTPVADMTDDEWDRVLDVSLTSVFRATRAALRYFRDAPHGGVIVNNASVLG WRAQHSQSHYAAAKAGVMALTRCSAIEAAEYGVRINAVSPSIARHKFLDKTASAELLD RLAAGEAFGRAAEPWEVAATIAFLASDYSSYLTGEVISVSCQHP" gene complement(4000432..4001589) /gene="fadE30" /locus_tag="Rv3560c" /db_xref="GeneID:887838" CDS complement(4000432..4001589) /gene="fadE30" /locus_tag="Rv3560c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3560c, (MTCY06G11.07c), len: 385 aa. Probable fadE30, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9I4V2|PA1022 from Pseudomonas aeruginosa (381 aa), FASTA scores: opt: 845, E(): 1.6e-47, (39.2% identity in 388 aa overlap); Q9A5G9|CC2478 from Caulobacter crescentus (407 aa), FASTA scores: opt: 734, E(): 2.8e-40, (35.5% identity in 386 aa overlap); Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa), FASTA scores: opt: 656, E(): 3.2e-35, (37.9% identity in 351 aa overlap); etc. Also similar to acyl-CoA dehydrogenases from Mycobacterium tuberculosis e.g. P95280|FADE17|Rv1934c|MTCY09F9.30 (409 aa), FASTA scores: opt: 939, E(): 1.4e-53, (43.8% identity in 404 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE30" /protein_id="NP_218077.1" /db_xref="GI:15610696" /db_xref="GeneID:887838" /translation="MQDVEEFRAQVRGWLADNLAGEFAALKGLGGPGREHEAFEERRA WNQRLAAAGLTCLGWPEEHGGRGLSTAHRVAFYEEYARADAPDKVNHFGEELLGPTLI AFGTPQQQRRFLPRIRDVTELWCQGYSEPGAGSDLASVATTAELDGDQWVINGQKVWT SLAHLSQWCFVLARTEKGSQRHAGLSYLLVPLDQPGVQIRPIVQITGTAEFNEVFFDD ARTDADLVVGAPGDGWRVAMATLTFERGVSTLGQQIVYARELSNLVELARRTAAADDP LIRERLTRAWTGLRAMRSYALATMEGPAVEQPGQDNVSKLLWANWHRNLGELAMDVIG KPGMTMPDGEFDEWQRLYLFTRADTIYGGSNEIQRNIIAERVLGLPREAKG" gene 4001637..4003160 /gene="fadD3" /locus_tag="Rv3561" /db_xref="GeneID:887244" CDS 4001637..4003160 /gene="fadD3" /locus_tag="Rv3561" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid--CoA ligase" /protein_id="NP_218078.1" /db_xref="GI:15610697" /db_xref="GeneID:887244" /translation="MINDLRTVPAALDRLVRQLPDHTALIAEDRRFTSTELRDAVYGA AAALIALGVEPADRVAIWSPNTWHWVVACLAIHHAGAAVVPLNTRYTATEATDILDRA GAPVLFAAGLFLGADRAAGLDRAALPALRHVVRVPVEADDGTWDEFIATGAGALDAVA ARAAAVAPQDVSDILFTSGTTGRSKGVLCAHRQSLSASASWAANGKITSDDRYLCINP FFHNFGYKAGILACLQTGATLIPHVTFDPLHALRAIERHRITVLPGPPTIYQSLLDHP ARKDFDLSSLRFAVTGAATVPVVLVERMQSELDIDIVLTAYGLTEANGMGTMCRPEDD AVTVATTCGRPFADFELRIADDGEVLLRGPNVMVGYLDDTEATAAAIDADGWLHTGDI GAVDQAGNLRITDRLKDMYICGGFNVYPAEVEQVLARMDGVADAAVIGVPDQRLGEVG RAFVVARPGTGLDEASVIAYTREHLANFKTPRSVRFVDVLPRNAAGKVSKPQLRELG" misc_feature 4002156..4002191 /gene="fadD3" /locus_tag="Rv3561" /note="PS00455 Putative AMP-binding domain signature." gene 4003161..4004294 /gene="fadE31" /locus_tag="Rv3562" /db_xref="GeneID:887884" CDS 4003161..4004294 /gene="fadE31" /locus_tag="Rv3562" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3562, (MTCY06G11.09), len: 377 aa. Probable fadE31, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa), FASTA scores: opt: 657, E(): 1.7e-34, (36.45% identity in 351 aa overlap); Q9A5G9|CC2478 from Caulobacter crescentus (407 aa), FASTA scores: opt: 653, E(): 3.2e-34, (33.95% identity in 392 aa overlap); Q9EX72|MLHC from Rhodococcus erythropolis (324 aa) FASTA scores: opt: 631, E(): 6.5e-33, (36.95% identity in 330 aa overlap); P45867|ACDA_BACSU|ACD from Bacillus subtilis (379 aa), FASTA scores: opt: 347, E(): 1e-15, (28.6% identity in 385 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis e.g. P96842|FADE30|Rv3560c|MTCY06G11.07c (385 aa), FASTA scores: opt: 843, E(): 2.3e-46, (38.95% identity in 380 aa overlap). COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE31" /protein_id="NP_218079.1" /db_xref="GI:15610698" /db_xref="GeneID:887884" /translation="MDLNFDDETLAFQAEVREFLAANAASIPTKSYDNAEGFAQHRYW DRVLFDAGLSVITWPAKYGGRDAPLLHWIVFEEEYFRAGAPGRASANGTSMLAPTLFA HGTAEQLDRILPKMASGEQIWAQAWSEPESGSDLASLRSTASKVDGGWLLNGQKIWSS RAPFADMGFGLFRSDPAVERHRGLTYFMFDLKAKGVTVRPIAQLGGDTGFGEIFLDDV FVPDRDVIGAPNDGWRAAMSTSSNERGMSLRSPARFLASAERLVQLWKDRGSPPEFAD RVADAWIKAQAYRLQTFGTVTRLAAGGELGAESSVTKVFWSELDVHLHQTALDLRGAD GELAGPWTEGLLFALGGPIYAGTNEIQRNIIAERLLGLPREKT" gene 4004291..4005250 /gene="fadE32" /locus_tag="Rv3563" /db_xref="GeneID:887818" CDS 4004291..4005250 /gene="fadE32" /locus_tag="Rv3563" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3563, (MTCY06G11.10), len: 319 aa. Probable fadE32, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9I4V4|PA1020 from Pseudomonas aeruginosa (370 aa), FASTA scores: opt: 347, E(): 7.6e-14, (35.15% identity in 333 aa overlap); Q9RJX3|SCF37.28c from Streptomyces coelicolor (362 aa), FASTA scores: opt: 300, E(): 5.3e-11, (32.4% identity in 349 aa overlap); Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 285, E(): 4.1e-10, (30.4% identity in 329 aa overlap); P45857|ACDB_BACSU|MMGC from Bacillus subtilis (379 aa), FASTA scores: opt: 230, E(): 1.1e-07, (25.5% identity in 357 aa overlap); etc. Also similar to other from Mycobacterium tuberculosis eg P96846|FADE33|Rv3564|MTCY06G11.11 (318 aa), FASTA scores: opt: 478, E(): 7.6e-22, (32.9% identity in 292 aa overlap). COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE32" /protein_id="NP_218080.1" /db_xref="GI:15610699" /db_xref="GeneID:887818" /translation="MTMEFALNEQQRDFAASIDAALGAADLPGVVRAWAAGDVAPGRK VWQQLANLGVTALGVAEKFDGLGASPVDLVVALERLGRWCVPGPVTESIAVAPILLAH DDQAERSHGLASGELIATVAMPPRVPRAVDADTAGLVLLAGDGSVTEGTPGDCHRSVD PSRRLYEVAASGQAWRAPKDVVARAYEFGALATAAQLVGAGQALLEAAVNYAKQRTQF GRAIGSYQAIKHKLADVHIAIELACPLVYGAAVSLEPRDVSAAKAAASEAALLAARWA LQTHGAIGFTCEHDLSLWLLRVQALHSAWGTPQEHRRRVLEAL" gene 4005247..4006203 /gene="fadE33" /locus_tag="Rv3564" /db_xref="GeneID:888458" CDS 4005247..4006203 /gene="fadE33" /locus_tag="Rv3564" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3564, (MTCY06G11.11), len: 318 aa. Probable fadE33, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to others e.g. Q9A5G8|CC2479 from Caulobacter crescentus (344 aa), FASTA scores: opt: 373, E(): 1.9e-15, (34.3% identity in 338 aa overlap); Q9I4V4|PA1020 from Pseudomonas aeruginosa (370 aa), FASTA scores: opt: 277, E(): 1.4e-09, (31.95% identity in 335 aa overlap); Q9X7Y6|SC6A5.40c from Streptomyces coelicolor (395 aa), FASTA scores: opt: 273, E(): 2.5e-09, (30.1% identity in 352 aa overlap); P45857|ACDB_BACSU|MMGC from Bacillus subtilis (379 aa), FASTA scores: opt: 478, E(): 7.9e-22, (32.9% identity in 292 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. P96845|FADE32|Rv3563|MTCY06G11.10 (319 aa), FASTA scores: opt: 478, E(): 7.9e-22, (32.9% identity in 292 aa overlap). COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE33" /protein_id="NP_218081.1" /db_xref="GI:15610700" /db_xref="GeneID:888458" /translation="MTPPEERQMLRETVASLVAKHAGPAAVRAAMASDRGYDESLWRL LCEQVGAAALVIPEELGGAGGELADAAIVVQELGRALVPSPLLGTTLAELALLAAAKP DAQALTELAQGSAIGALVLDPDYVVNGDIADIVVAATSGQLTRWTRFSAQPVATMDPT RRLARLQSEETEPLCPDPGIADTAAILLAAEQIGAAERCLQLTVEYAKSRVQFGRPIG SFQALKHRMADLYVTIAAARAVVADACHAPTPTNAATARLAASEALSTAAAEGIQLHG GIAITWEHDMHLYFKRAHGSAQLLESPREVLRRLESEVWESP" gene 4006200..4007366 /gene="aspB" /locus_tag="Rv3565" /db_xref="GeneID:888305" CDS 4006200..4007366 /gene="aspB" /locus_tag="Rv3565" /EC_number="2.6.1.1" /function="THOUGHT TO BE INVOLVED IN GLUTAMATE BIOSYNTHESIS [CATALYTIC ACTIVITY: L-ASPARTATE + 2-OXOGLUTARATE = OXALOACETATE + L-GLUTAMATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of oxalozcetate and L-glutamate from L-aspartate and 2-oxoglutarate" /codon_start=1 /transl_table=11 /product="aspartate aminotransferase" /protein_id="NP_218082.1" /db_xref="GI:15610701" /db_xref="GeneID:888305" /translation="MTDRVALRAGVPPFYVMDVWLAAAERQRTHGDLVNLSAGQPSAG APEPVRAAAAAALHLNQLGYSVALGIPELRDAIAADYQRRHGITVEPDAVVITTGSSG GFLLAFLACFDAGDRVAMASPGYPCYRNILSALGCEVVEIPCGPQTRFQPTAQMLAEI DPPLRGVVVASPANPTGTVIPPEELAAIASWCDASDVRLISDEVYHGLVYQGAPQTSC AWQTSRNAVVVNSFSKYYAMTGWRLGWLLVPTVLRRAVDCLTGNFTICPPVLSQIAAV SAFTPEATAEADGNLASYAINRSLLLDGLRRIGIDRLAPTDGAFYVYADVSDFTSDSL AFCSKLLADTGVAIAPGIDFDTARGGSFVRISFAGPSGDIEEALRRIGSWLPSQ" misc_feature 4006890..4006931 /gene="aspB" /locus_tag="Rv3565" /note="PS00105 Aminotransferases class-I pyridoxal-phosphate attachment site." gene complement(4007331..4008182) /gene="nat" /locus_tag="Rv3566c" /db_xref="GeneID:888005" CDS complement(4007331..4008182) /gene="nat" /locus_tag="Rv3566c" /EC_number="2.3.1.5" /function="COULD HAVE A ROLE IN ACETYLATING, AND HENCE INACTIVATING, THE ANTITUBERCULAR DRUG ISONIAZID [CATALYTIC ACTIVITY: ACETYL-CoA + ARYLAMINE = CoA + N-ACETYLARYLAMINE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3566c, (MT3671, MTCY06G11.13c), len: 283 aa. nat (alternate gene name: nhoA), arylamine N-acetyltransferase (EC 2.3.1.5) (see citations below), highly similar to O86309|NAT_MYCSM ARYLAMINE N-ACETYLTRANSFERASE from Mycobacterium smegmatis (see citation below) (275 aa), FASTA scores: opt: 1114, E(): 3e-66, (60.95% identity in 274 aa overlap). Also highly similar to others e.g. Q98D42|BAB51429|MLR4870 from Rhizobium loti (Mesorhizobium loti) (278 aa), FASTA scores: opt: 697, E(): 1.1e-38, (44.1% identity in 272 aa overlap); P77567|NHOA_ECOLI|B1463 from Escherichia coli strain K12 (281 aa), FASTA scores: opt: 537, E(): 4.4e-28, (38.85% identity in 273 aa overlap); Q00267|NHOA_SALTY from Salmonella typhimurium (281 aa), FASTA scores: opt: 507, E(): 4.3e-26, (34.8% identity in 273 aa overlap); etc. BELONGS TO THE ARYLAMINE N-ACETYLTRANSFERASE FAMILY. Note that previously known as nhoA (332 aa) and that nucleotide 4007874 has been changed since first submission (G deleted).; nhoA" /codon_start=1 /transl_table=11 /product="arylamine n-acetyltransferase nat (arylamine acetylase)" /protein_id="YP_177989.1" /db_xref="GI:57117125" /db_xref="GeneID:888005" /translation="MALDLTAYFDRINYRGATDPTLDVLQDLVTVHSRTIPFENLDPL LGVPVDDLSPQALADKLVLRRRGGYCFEHNGLMGYVLAELGYRVRRFAARVVWKLAPD APLPPQTHTLLGVTFPGSGGCYLVDVGFGGQTPTSPLRLETGAVQPTTHEPYRLEDRV DGFVLQAMVRDTWQTLYEFTTQTRPQIDLKVASWYASTHPASKFVTGLTAAVITDDAR WNLSGRDLAVHRAGGTEKIRLADAAAVVDTLSERFGINVADIGERGALETRIDELLAR QPGADAP" gene complement(4008167..4008433) /locus_tag="Rv3566A" /db_xref="GeneID:3205089" CDS complement(4008167..4008433) /locus_tag="Rv3566A" /function="UNKNOWN" /note="Rv3566A, len: 88 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_177990.1" /db_xref="GI:57117126" /db_xref="GeneID:3205089" /translation="MSGADPPTRRAFGQMARAATGWVSVSGQFAVAADTCRCEGTLFA VDPETHVANHNRCDIVGRLRDERPNTLRSVRRGDEVRMATWHWI" gene complement(4008719..4009282) /locus_tag="Rv3567c" /db_xref="GeneID:887525" CDS complement(4008719..4009282) /locus_tag="Rv3567c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3567c, (MTCY06G11.14c), len: 187 aa. Possible oxidoreductase (EC 1.-.-.-), similar to various oxidoreductases and hypothetical proteins e.g. O69360 ORF61 PROTEIN from Rhodococcus erythropolis (194 aa) FASTA scores: opt: 974, E(): 3e-59, (77.05% identity in 183 aa overlap); Q9JN75|MMYF PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (174 aa), FASTA scores: opt: 451, E(): 1e-23, (43.65% identity in 158 aa overlap); P54990|NTAB_CHEHE|NMOB NITRILOTRIACETATE MONOOXYGENASE COMPONENT B (EC 1.14.13.-) from Chelatobacter heintzii (322 aa), FASTA scores: opt: 409, E(): 1.3e-20, (38.3% identity in 167 aa overlap)Chelatobacter heintzii; AAK62356 PUTATIVE NADH:FMN OXIDOREDUCTASE from Burkholderia sp. DBT1 (177 aa), FASTA scores: opt: 360, E(): 1.6e-17, (36.15% identity in 155 aa overlap)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218084.1" /db_xref="GI:15610703" /db_xref="GeneID:887525" /translation="MSAQIDPRTFRSVLGQFCTGITVITTVHDDVPVGFACQSFAALS LEPPLVLFCPTKVSRSWQAIEASGRFCVNVLTEKQKDVSARFGSKEPDKFAGIDWRPS ELGSPIIEGSLAYIDCTVASVHDGGDHFVVFGAVESLSEVPAVKPRPLLFYRGDYTGI EPEKTTPAHWRDDLEAFLITTTQDTWL" gene complement(4009297..4010199) /gene="bphC" /locus_tag="Rv3568c" /db_xref="GeneID:887886" CDS complement(4009297..4010199) /gene="bphC" /locus_tag="Rv3568c" /EC_number="1.13.11.39" /function="INVOLVED IN THE DEGRADATION OF BIPHENYL [CATALYTIC ACTIVITY: BIPHENYL-2,3-DIOL + O(2) = 2-HYDROXY-6-OXO-6-PHENYLHEXA-2,4-DIENOATE + H(2)O]." /note="Rv3568c, (MTCY06G11.15c), len: 300 aa. Probable bphC, 2,3-dihydroxybiphenyl 1,2-dioxygenase (EC 1.13.11.39), highly similar to other e.g. Q9KWQ5|BPHC5 from Rhodococcus sp. RHA1 (300 aa), FASTA scores: opt: 1715, E(): 3.8e-103, (82.15% identity in 297 aa overlap); O50479|EDOB from Rhodococcus rhodochrous (300 aa) FASTA scores: opt: 1714, E(): 4.4e-103, (82.5% identity in 297 aa overlap); O69359|BPHC6 from Rhodococcus erythropolis (300 aa), FASTA scores: opt: 1647, E(): 9.1e-99, (78.25% identity in 299 aa overlap); Q9RBT2|BPHC1 from Pseudomonas sp. SY5 (301 aa) Pseudomonas sp. SY5 (298 aa) FASTA scores: opt: 767, E(): 3.9e-42, (42.8% identity in 299 aa overlap); P47228|BPHC_BURCE from Burkholderia cepacia (Pseudomonas cepacia) (297 aa), FASTA scores: opt: 670, E(): 6.8e-36, (40.55% identity in 296 aa overlap); etc. Contains PS00082 Extradiol ring-cleavage dioxygenases signature. BELONGS TO THE EXTRADIOL RING-CLEAVAGE DIOXYGENASE FAMILY." /codon_start=1 /transl_table=11 /product="biphenyl-2,3-diol 1,2-dioxygenase" /protein_id="NP_218085.1" /db_xref="GI:15610704" /db_xref="GeneID:887886" /translation="MSIRSLGYLRIEATDMAAWREYGLKVLGMVEGKGAPEGALYLRM DDFPARLVVVPGEHDRLLEAGWECANAEGLQEIRNRLDLEGTPYKEATAAELADRRVD EMIRFADPSGNCLEVFHGTALEHRRVVSPYGHRFVTGEQGMGHVVLSTRDDAEALHFY RDVLGFRLRDSMRLPPQMVGRPADGPPAWLRFFGCNPRHHSLAFLPMPTSSGIVHLMV EVEQADDVGLCLDRALRRKVPMSATLGRHVNDLMLSFYMKTPGGFDIEFGCEGRQVDD RDWIARESTAVSLWGHDFTVGARG" misc_feature complement(4009402..4009467) /gene="bphC" /locus_tag="Rv3568c" /note="PS00082 Extradiol ring-cleavage dioxygenases signature." gene complement(4010196..4011071) /gene="bphD" /locus_tag="Rv3569c" /db_xref="GeneID:887378" CDS complement(4010196..4011071) /gene="bphD" /locus_tag="Rv3569c" /EC_number="3.7.1.-" /function="INVOLVED IN THE DEGRADATION OF BIPHENYL." /note="Rv3569c, (MTCY06G11.16c), len: 291 aa. Probable bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase (EC 3.7.1.-), highly similar to others e.g. Q9KWQ6|BPHD2 from Rhodococcus sp. RHA1 (292 aa), FASTA scores: opt: 1468, E(): 1.3e-85, (75.5% identity in 294 aa overlap); Q52036 from Pseudomonas putida (286 aa), FASTA scores: opt: 785, E(): 1.9e-42, (45.1% identity in 295 aa overlap); Q52011|BPHD from Pseudomonas pseudoalcaligenes (286 aa), FASTA scores: opt: 774, E(): 9.3e-42, (44.05% identity in 295 aa overlap); P47229|BPHD_BURCE from Burkholderia cepacia (Pseudomonas cepacia) (286 aa) FASTA scores: opt: 772, E(): 1.2e-41, (44.5% identity in 295 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A. SIMILAR TO ALPHA/BETA HYDROLASE FOLD." /codon_start=1 /transl_table=11 /product="2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase BphD" /protein_id="NP_218086.1" /db_xref="GI:15610705" /db_xref="GeneID:887378" /translation="MTATEELTFESTSRFAEVDVDGPLKLHYHEAGVGNDQTVVLLHG GGPGAASWTNFSRNIAVLARHFHVLAVDQPGYGHSDKRAEHGQFNRYAAMALKGLFDQ LGLGRVPLVGNSLGGGTAVRFALDYPARAGRLVLMGPGGLSINLFAPDPTEGVKRLSK FSVAPTRENLEAFLRVMVYDKNLITPELVDQRFALASTPESLTATRAMGKSFAGADFE AGMMWREVYRLRQPVLLIWGREDRVNPLDGALVALKTIPRAQLHVFGQCGHWVQVEKF DEFNKLTIEFLGGGR" misc_feature complement(4010439..4010462) /gene="bphD" /locus_tag="Rv3569c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene complement(4011086..4012270) /locus_tag="Rv3570c" /db_xref="GeneID:887241" CDS complement(4011086..4012270) /locus_tag="Rv3570c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3570c, (MTCY06G11.17c), len: 394 aa. Possible oxidoreductase (EC 1.-.-.-), most similar to hydroxylases and oxygenases (and also some similarity to acyl-coa dehydrogenases) e.g. O69349 HYDROXYLASE from Rhodococcus erythropolis (393 aa), FASTA scores: opt: 958, E(): 1.1e-53, (39.95% identity in 383 aa overlap); P26698|PIGM_RHOSO PIGMENT PROTEIN from Rhodococcus sp. strain ATCC 21145 (387 aa), FASTA scores: opt: 665, E(): 5.4e-35, (32.2% identity in 382 aa overlap); Q9ZGA9|LANZ5 OXYGENASE HOMOLOG from Streptomyces cyanogenus (397 aa) FASTA scores: opt: 588, E(): 4.5e-30, (30.55% identity in 386 aa overlap); Q9F0J3|NCNH HYDROXYLASE from Streptomyces arenae (405 aa), FASTA scores: opt: 580, E(): 1.5e-29, (31.25% identity in 336 aa overlap); O69789|BPFA INDOLE DIOXYGENASE from Rhodococcus opacus (399 aa), FASTA scores: opt: 558, E(): 3.7e-28, (31.8% identity in 387 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218087.1" /db_xref="GI:15610706" /db_xref="GeneID:887241" /translation="MTSIQQRDAQSVLAAIDNLLPEIRDRAQATEDLRRLPDETVKAL DDVGFFTLLQPQQWGGLQCDPALFFEATRRLASVCGSTGWVSSIVGVHNWHLALFDQR AQEEVWGEDPSTRISSSYAPMGAGVVVDGGYLVNGSWNWSSGCDHASWTFVGGPVIKD GRPVDFGSFLIPRSEYEIKDVWYVVGLRGTGSNTLVVKDVFVPRHRFLSYKAMNDHTA GGLATNSAPVYKMPWGTMHPTTISAPIVGMAYGAYAAHVEHQGKRVRAAFAGEKAKDD PFAKVRIAEAASDIDAAWRQLIGNVSDEYALLAAGKEIPFELRARARRDQVRATGRSI ASIDRLFEASGATALSNEAPIQRFWRDAHAGRVHAANDPERAYVIFGNHEFGLPPGDT MV" gene 4012417..4013493 /gene="hmp" /locus_tag="Rv3571" /db_xref="GeneID:887315" CDS 4012417..4013493 /gene="hmp" /locus_tag="Rv3571" /function="MAY PLAY A ROLE IN PROTECTION FROM OXIDATIVE (NITRIC OXIDE) AND NITROSATIVE STRESS. MAY ALSO BE INVOLVED IN ANAEROBIC METABOLISM. COULD HAVE NITRIC OXIDE DIOXYGENASE ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="Rv3571, (MTCY06G11.18), len: 358 aa. Possible hmp, oxidoreductase, hemoglobine-related protein (see citation below) (EC 1.-.-.-), similar to several e.g. Q44253|ATDA5 ANILINE DIOXYGENASE REDUCTASE COMPONENT from Acinetobacter sp (336 aa) FASTA scores: opt: 748, E(): 1.5e-38, (34.95% identity in 346 aa overlap); P95533|TDNB ELECTRON TRANSFER PROTEIN from Pseudomonas putida (337 aa), FASTA scores: opt: 723, E(): 5.2e-37, (36.35% identity in 341 aa overlap); AAK65059|SMA0752 POSSIBLE DIOXYGENASE REDUCTASE SUBUNIT from Rhizobium meliloti (Sinorhizobium meliloti) (353 aa) FASTA scores: opt: 495, E(): 4.9e-23, (31.9% identity in 345 aa overlap); P76081|PAAE_ECOLI|B1392 PROBABLE PHENYLACETIC ACID DEGRADATION NADH OXIDOREDUCTASE (356 aa), FASTA scores: opt: 364, E(): 5.1e-15, (34.45% identity in 357 aa overlap); Q9L131|HMPA FLAVOHEMOPROTEIN from Streptomyces coelicolor (398 aa), FASTA scores: opt: 352, E(): 3e-14, (32.8% identity in 247 aa overlap); etc. Contains PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature. Note that it has been shown hmp transcription increased at early stationary phase and is lower at late stationary phase and during exponential growth." /codon_start=1 /transl_table=11 /product="hemoglobine-like protein" /protein_id="NP_218088.1" /db_xref="GI:15610707" /db_xref="GeneID:887315" /translation="MTEAIGDEPLGDHVLELQIAEVVDETDEARSLVFAVPDGSDDPE IPPRRLRYAPGQFLTLRVPSERTGSVARCYSLCSSPYTDDALAVTVKRTADGYASNWL CDHAQVGMRIHVLAPSGNFVPTTLDADFLLLAAGSGITPIMSICKSALAEGGGQVTLL YANRDDRSVIFGDALRELAAKYPDRLTVLHWLESLQGLPSASALAKLVAPYTDRPVFI CGPGPFMQAARDALAALKVPAQQVHIEVFKSLESDPFAAVKVDDSGDEAPATAVVELD GQTHTVSWPRTAKLLDVLLAAGLDAPFSCREGHCGACACTLRAGKVNMGVNDVLEQQD LDEGLILACQSRPESDSVEVTYDE" misc_feature 4013329..4013355 /gene="hmp" /locus_tag="Rv3571" /note="PS00197 2Fe-2S ferredoxins, iron-sulfur binding region signature." gene 4013511..4014041 /locus_tag="Rv3572" /db_xref="GeneID:887227" CDS 4013511..4014041 /locus_tag="Rv3572" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3572, (MTCY06G11.19), len: 176 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218089.1" /db_xref="GI:15610708" /db_xref="GeneID:887227" /translation="MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLET HTFLDCHPNGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQF ERVMKAGGSDVITTVYFGEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYP GVNLTSPSTCAQANFS" gene complement(4014077..4016212) /gene="fadE34" /locus_tag="Rv3573c" /db_xref="GeneID:887843" CDS complement(4014077..4016212) /gene="fadE34" /locus_tag="Rv3573c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3573c, (MTCY06G11.20c), len: 711 aa. Probable fadE34, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to others, especially in C-terminal half, e.g. Q9RJX2|SCF37.29c from Streptomyces coelicolor (393 aa) FASTA scores: opt: 780, E(): 2.8e-39, (44.1% identity in 347 aa overlap); Q9A6N8|CC2049 from Caulobacter crescentus (401 aa), FASTA scores: opt: 705, E(): 8.7e-35, (41.5% identity in 342 aa overlap); Q9EX72|MLHC from Rhodococcus erythropolis (324 aa), FASTA scores: opt: 673, E(): 6.1e-33, (42.05% identity in 283 aa overlap); P41367|ACDM_PIG|ACADM from Sus scrofa (Pig)(421 aa) FASTA scores: opt: 325, E(): 4.9e- 13, (28.5% identity in 368 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. P95097|FADE22|Rv3061c|MTCY22D7.20 (721 aa), FASTA scores: opt: 1635, E(): 2.7e-90, (42.65% identity in 729 aa overlap). COULD BELONG TO THE ACYL-COA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE34" /protein_id="NP_218090.1" /db_xref="GI:15610709" /db_xref="GeneID:887843" /translation="MVATVTDEQSAARELVRGWARTAASGAAATAAVRDMEYGFEEGN ADAWRPVFAGLAGLGLFGVAVPEDCGGAGGSIEDLCAMVDEAARALVPGPVATTAVAT LVVSDPKLRSALASGERFAGVAIDGGVQVDPKTSTASGTVGRVLGGAPGGVVLLPADG NWLLVDTACDEVVVEPLRATDFSLPLARMVLTSAPVTVLEVSGERVEDLAATVLAAEA AGVARWTLDTAVAYAKVREQFGKPIGSFQAVKHLCAQMLCRAEQADVAAADAARAAAD SDGTQLSIAAAVAASIGIDAAKANAKDCIQVLGGIGCTWEHDAHLYLRRAHGIGGFLG GSGRWLRRVTALTQAGVRRRLGVDLAEVAGLRPEIAAAVAEVAALPEEKRQVALADTG LLAPHWPAPYGRGASPAEQLLIDQELAAAKVERPDLVIGWWAAPTILEHGTPEQIERF VPATMRGEFLWCQLFSEPGAGSDLASLRTKAVRADGGWLLTGQKVWTSAAHKARWGVC LARTDPDAPKHKGITYFLVDMTTPGIEIRPLREITGDSLFNEVFLDNVFVPDEMVVGA VNDGWRLARTTLANERVAMATGTALGNPMEELLKVLGDMELDVAQQDRLGRLILLAQA GALLDRRIAELAVGGQDPGAQSSVRKLIGVRYRQALAEYLMEVSDGGGLVENRAVYDF LNTRCLTIAGGTEQILLTVAAERLLGLPR" misc_feature complement(4014836..4014889) /gene="fadE34" /locus_tag="Rv3573c" /note="PS01156 TonB-dependent receptor proteins signature 2." gene 4016484..4017083 /locus_tag="Rv3574" /db_xref="GeneID:887204" CDS 4016484..4017083 /locus_tag="Rv3574" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3574, (MTCY06G11.21), len: 199 aa. Probable transcriptional regulator tetR family, similar to others e.g. Q9KXK1|SCC53.10 from Streptomyces coelicolor (250 aa) FASTA scores: opt: 492, E(): 4.8e-25, (44.8% identity in 183 aa overlap); Q9RA03|KSTR from Rhodococcus erythropolis (208 aa), FASTA scores: opt: 294, E(): 3.1e-12, (28.9% identity in 187 aa overlap); BAB54261|MLR7895 from Rhizobium loti (Mesorhizobium loti) (193 aa), FASTA scores: opt: 166, E(): 0.00062, (32.05% identity in 78 aa overlap); P17446|BETI_ECOLI|B0313 from Escherichia coli strain K12 (195 aa), FASTA scores: opt: 142, E(): 0.0034, (25. 6% identity in 168 aa overlap); etc. Equivalent to AAK48038 from Mycobacterium tuberculosis strain CDC1551 (243 aa) but shorter 44 aa. Contains possible helix-turn-helix motif from aa 37-58 (+3.70 SD). POSSIBLY BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein TetR-family" /protein_id="NP_218091.1" /db_xref="GI:15610710" /db_xref="GeneID:887204" /translation="MAVLAESELGSEAQRERRKRILDATMAIASKGGYEAVQMRAVAD RADVAVGTLYRYFPSKVHLLVSALGREFSRIDAKTDRSAVAGATPFQRLNFMVGKLNR AMQRNPLLTEAMTRAYVFADASAASEVDQVEKLIDSMFARAMANGEPTEDQYHIARVI SDVWLSNLLAWLTRRASATDVSKRLDLAVRLLIGDQDSA" gene complement(4017089..4018168) /locus_tag="Rv3575c" /db_xref="GeneID:888084" CDS complement(4017089..4018168) /locus_tag="Rv3575c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3575c, (MTCY06G11.22c), len: 359 aa. Probable transcriptional regulator belonging to lacI family, similar to others e.g. BAB53947|MLL8376 from Rhizobium loti (Mesorhizobium loti) (358 aa), FASTA scores: opt: 707, E(): 2.6e-35, (35.5% identity in 355 aa overlap); Q9RRI9|DR2501 from Deinococcus radiodurans (359 aa) FASTA scores: opt: 544, E(): 1.6e-25, (40.35% identity in 347 aa overlap); Q9RL31|SCF51A.34 from Streptomyces coelicolor (347 aa), FASTA scores: opt: 307, E(): 2.9e-11, (30.0% identity in 330 aa overlap); O87590|CELR_THEFU from Thermomonospora fusca (340 aa), FASTA scores: opt: 280, E(): 1.2e-09, (32.3% identity in 353 aa overlap); P21867|RAFR_ECOLI from Escherichia coli (335 aa) FASTA scores: opt: 241, E(): 2.6e-07, (27.15% identity in 269 aa overlap); etc. Equivalent to AAK48039 from Mycobacterium tuberculosis strain CDC1551 (404 aa) but shorter 45 aa. Contains possible helix-turn-helix motif, at aa 9-30 (+5.86 SD). COULD BELONG TO THE LACI FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein LacI-family" /protein_id="NP_218092.1" /db_xref="GI:15610711" /db_xref="GeneID:888084" /translation="MSPTPRRRATLASLAAELKVSRTTVSNAFNRPDQLSADLRERVL ATAKRLGYAGPDPVARSLRTRKAGAVGLVMAEPLTYFFSDPAARDFVAGVAQSCEELG QGLQLVSVGSSRSLADGTAAVLGAGVDGFVVYSVGDDDPYLQVVLQRRLPVVVVDQPK DLSGVSRVGIDDRAAMRELAGYVLGLGHRELGLLTMRLGRDRRQDLVDAERLRSPTFD VQRERIVGVWEAMTAAGVDPDSLTVVESYEHLPTSGGTAAKVALQANPRLTALMCTAD ILALSAMDYLRAHGIYVPGQMTVTGFDGVPEALSRGLTTVAQPSLHKGHRAGELLLKP PRSGLPVIEVLDTELVRGRTAGPPA" gene 4018358..4019071 /gene="lppH" /locus_tag="Rv3576" /db_xref="GeneID:888444" CDS 4018358..4019071 /gene="lppH" /locus_tag="Rv3576" /function="UNKNOWN" /note="Rv3576, (MTCY06G11.23), len: 237 aa. Possible lppH, conserved lipoprotein, similar in part with proteins from Mycobacterium tuberculosis; C-terminus of Q11053|PKNH_MYCTU|PKNH|Rv1266c|MT1304|MTCY50.16 PROBABLE SERINE/THREONINE-PROTEIN KINASE (EC 2.7.1.-) (626 aa) FASTA scores: opt: 396, E(): 6.5e-19, (36.0% identity in 200 aa overlap); and with P71740|LPPR|Rv2403c|MTCY253.17 PROBABLE LIPOPROTEIN PROTEIN (251 aa), FASTA scores: opt: 134, E(): 0.087, (22.7% identity in 207 aa overlap). Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site. Note that previously known as pknM.; pknM" /codon_start=1 /transl_table=11 /product="lipoprotein LppH" /protein_id="YP_177991.1" /db_xref="GI:57117127" /db_xref="GeneID:888444" /translation="MGKQLAALAALVGACMLAAGCTNVVDGTAVAADKSGPLHQDPIP VSALEGLLLDLSQINAALGATSMKVWFNAKAMWDWSKSVADKNCLAIDGPAQEKVYAG TGWTAMRGQRLDDSIDDSKKRDHYAIQAVVGFPTAHDAEEFYSSSVQSWSSCSNRRFV EVTPGQDDAAWTVADVVNDNGMLSSSQVQEGGDGWTCQRALTARNNVTIDIVTCAYSQ PDLVAIGIANQIAAKVAKQ" misc_feature 4018388..4018420 /gene="lppH" /locus_tag="Rv3576" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site." gene 4019262..4020128 /locus_tag="Rv3577" /db_xref="GeneID:888442" CDS 4019262..4020128 /locus_tag="Rv3577" /function="UNKNOWN" /note="Rv3577, (MTCY06G11.24), len: 288 aa (other start sites possible upstream; equivalent to AAK48041 from Mycobacterium tuberculosis strain CDC1551 (379 aa) but shorter 91 aa). Hypothetical protein, showing some similarity to Q9RI88|SCJ11.16c HYPOTHETICAL 37.9 KDA PROTEIN from Streptomyces coelicolor (349 aa) FASTA scores: opt: 285, E(): 1.5e-10, (27.45% identity in 266 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218094.1" /db_xref="GI:15610713" /db_xref="GeneID:888442" /translation="MPTARSDAPLSVTWMGVATLLVDDGSSALMTDGYFSRPGLARVA AGKVSPSAERVDGCLARANVSRLTAVIPVHTHIDHAMDSALVADRTGAQLVGGESAAN VGRGYGLPEESLVVAVPGEPIQLGAFDVTLVESHHCPPDRFPGVISAPLTPPVKASAY RCGEAWSTLVHHRPSGRRLLIQDSAGFVSGALAGYRADAAYLSVGQLGLQPPSYLLEY WTETVRTVGVRRVILIHWDDFFRPLSKPLRALPYAADDLDLSIRILDELAAQDGVALQ MPTVWRREDPWM" gene 4020142..4021383 /gene="arsB2" /locus_tag="Rv3578" /db_xref="GeneID:888329" CDS 4020142..4021383 /gene="arsB2" /locus_tag="Rv3578" /function="THOUGHT TO BE INVOLVED IN TRANSPORT OF ARSENIC ACROSS THE MEMBRANE (EXPORT): ARSENIC RESISTANCE BY AN EXPORT MECHANISM. FORM THE CHANNEL OF AN ARSENITE PUMP RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3578, (MTCY06G11.25), len: 413 aa. Possible arsB2, arsenical pump integral membrane protein, similar to many e.g. Q9I1J6|ARSB|PA2278 from Pseudomonas aeruginosa (427 aa), FASTA scores: opt: 375, E(): 3.1e-15, (32.15% identity in 429 aa overlap); Q9K8K7|ARSB|BH2999 from Bacillus halodurans (436 aa), FASTA scores: opt: 360, E(): 2.5e-14, (28.7% identity in 432 aa overlap); P52146|ARB2_ECOLI from Escherichia coli (plasmid R46) (429 aa), FASTA scores: opt: 345, E(): 2e-13, (29.8% identity in 426 aa overlap); etc. Also highly similar to Q9KYM0|SC9H11.21c PROBABLE MEMBRANE EFFLUX PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 730, E(): 1.7e-36, (53.95% identity in 443 aa overlap). SEEMS TO BELONG TO THE ARS FAMILY." /codon_start=1 /transl_table=11 /product="arsenical PUMP integral membrane protein ArsB2" /protein_id="NP_218095.1" /db_xref="GI:15610714" /db_xref="GeneID:888329" /translation="MTLAVALILLAVVLGFAVARPRGWPEAAAAVPAAVILLAIGAIS PQQAMAQVSGLARVVAFLGAVLVLAKLCDDEGLFEAAGAAMARASAESHRLLRQVFAV SAAITAALCLDATVVLLTPVVLATVRRLRTPVRPYAYATAHLANAASLLLPVSNLTNL LAYHGAGISFTKFTLLMALPWLSAVAAVYVVFRWFFARDLRVVPDRQQLKPAPRLPMF VLVVVALTLGGFAVAESVGLAPTWAALAGAAVLALRSLRRGHTSVLRIARAVNVSFLV FVLALGVVVHAVMLNGMAARMSAVLPTGSGLPALLGIAALAAVLANVVNNLPATLVLV PLVAAGGPAAVLAVLLGVNIGPNLTYAGSLSNLLWRGVLRRHNVDASVGEYTRLGLCT VPAALAMAVLALWASAQVLGI" gene complement(4021425..4022393) /locus_tag="Rv3579c" /db_xref="GeneID:888317" CDS complement(4021425..4022393) /locus_tag="Rv3579c" /EC_number="2.1.1.-" /function="CAUSES METHYLATION." /note="Rv3579c, (MTCY06G11.26c), len: 322 aa. Possible tRNA/rRNA methyltransferase (EC 2.1.1.-), equivalent, but longer 31 aa, to Q9CCW4|ML0324 PUTATIVE METHYLTRANSFERASE from Mycobacterium leprae (278 aa), FASTA scores: opt: 1517, E(): 3.4e-79, (83.75% identity in 277 aa overlap). Also highly similar to Q9L0Q5|SCD8A.09 from Streptomyces coelicolor (314 aa), FASTA scores: opt: 937, E(): 3.4e-46, (56.75% identity in 319 aa overlap); and similar to others e.g. Q06753|YACO_BACSU from Bacillus subtilis (249 aa), FASTA scores: opt: 616, E(): 4.9e-28, (41.05% identity in 246 aa overlap); Q9KGF2|BH0113 from Bacillus halodurans (249 aa), FASTA scores: opt: 596, E(): 6.7e-27, (38.5% identity in 244 aa overlap); P74328|Y955_SYNY3|SLR0955 from Synechocystis sp. strain PCC 6803 (384 aa), FASTA scores: opt: 585, E(): 4e-26, (35.85% identity in 304 aa overlap); P39290|YJFH_ECOLI|B4180 from Escherichia coli strain K12 (243 aa), FASTA scores: opt: 521, E(): 1.2e-22, (38.1% identity in 244 aa overlap); etc. Equivalent to AAK48043 from Mycobacterium tuberculosis strain CDC1551 (253 aa) but longer 69 aa. POSSIBLY BELONGS TO THE RNA METHYLTRANSFERASE TRMH FAMILY." /codon_start=1 /transl_table=11 /product="tRNA/rRNA methyltransferase" /protein_id="NP_218096.1" /db_xref="GI:15610715" /db_xref="GeneID:888317" /translation="MPGNSRRRGAVRKSGTKKGAGVGSGGQRRRGLEGRGPTPPAHLR PHHPAAKRARAQPRRPVKRADETETVLGRNPVLECLRAGVPATALYVALGTEADERLT ECVARAADSGIAIVELLRADLDRMTANHLHQGIALQVPPYNYAHPDDLLAAALDQPPA LLVALDNLSDPRNLGAIVRSVAAFGGHGVLIPQRRSASVTAVAWRTSAGAAARIPVAR ATNLTRTLKGWADRGVRVIGLDAGGGTALDDVDGTDSLVVVVGSEGKGLSRLVRQNCD EVVSIPMAAQAESLNASVAAGVVLAEIARQRRRPREPREQTQNRMI" gene complement(4022394..4023803) /gene="cysS" /locus_tag="Rv3580c" /db_xref="GeneID:888628" CDS complement(4022394..4023803) /gene="cysS" /locus_tag="Rv3580c" /EC_number="6.1.1.16" /function="INVOLVED IN TRANSLATION [CATALYTIC ACTIVITY: ATP + L-CYSTEINE + TRNA(CYS) = AMP + PYROPHOSPHATE + L-CYSTEINYL-TRNA(CYS)]." /note="catalyzes a two-step reaction; charges a cysteine by linking its carboxyl group to the alpha-phosphate of ATP then transfers the aminoacyl-adenylate to its tRNA" /codon_start=1 /transl_table=11 /product="cysteinyl-tRNA synthetase" /protein_id="YP_177992.1" /db_xref="GI:57117128" /db_xref="GeneID:888628" /translation="MTDRARLRLHDTAAGVVRDFVPLRPGHVSIYLCGATVQGLPHIG HVRSGVAFDILRRWLLARGYDVAFIRNVTDIEDKILAKAAAAGRPWWEWAATHERAFT AAYDALDVLPPSAEPRATGHITQMIEMIERLIQAGHAYTGGGDVYFDVLSYPEYGQLS GHKIDDVHQGEGVAAGKRDQRDFTLWKGEKPGEPSWPTPWGRGRPGWHLECSAMARSY LGPEFDIHCGGMDLVFPHHENEIAQSRAAGDGFARYWLHNGWVTMGGEKMSKSLGNVL SMPAMLQRVRPAELRYYLGSAHYRSMLEFSETAMQDAVKAYVGLEDFLHRVRTRVGAV CPGDPTPRFAEALDDDLSVPIALAEIHHVRAEGNRALDAGDHDGALRSASAIRAMMGI LGCDPLDQRWESRDETSAALAAVDVLVQAELQNREKAREQRNWALADEIRGRLKRAGI EVTDTADGPQWSLLGGDTK" gene complement(4023868..4024347) /gene="ispF" /locus_tag="Rv3581c" /db_xref="GeneID:888221" CDS complement(4023868..4024347) /gene="ispF" /locus_tag="Rv3581c" /EC_number="4.6.1.12" /function="INVOLVED IN THE DEOXYXYLULOSE-5-PHOSPHATE PATHWAY (DXP) OF ISOPRENOID BIOSYNTHESIS (AT THE FIFTH STEP). CONVERTS 4-DIPHOSPHOCYTIDYL-2C-METHYL-D-ERYTHRITOL 2-PHOSPHATE INTO 2C-METHYL-D-ERYTHRITOL 2,4-CYCLODIPHOSPHATE AND CMP. ALSO CONVERTS 4-DIPHOSPHOCYTIDYL-2C-METHYL-D-ERYTHRITOL INTO 2C-METHYL-D-ERYTHRITOL 3,4-CYCLOPHOSPHATE AND CMP." /note="catalyzes the conversion of 4-diphosphocytidyl-2-C-methyl-D-erythritol 2-phosphate into 2-C-methyl-D-erythritol 2,4-cyclodiphosphate" /codon_start=1 /transl_table=11 /product="2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase" /protein_id="NP_218098.1" /db_xref="GI:15610717" /db_xref="GeneID:888221" /translation="MNQLPRVGLGTDVHPIEPGRPCWLVGLLFPSADGCAGHSDGDVA VHALCDAVLSAAGLGDIGEVFGVDDPRWQGVSGADMLRHVVVLITQHGYRVGNAVVQV IGNRPKIGWRRLEAQAVLSRLLNAPVSVSATTTDGLGLTGRGEGLAAIATALVVSLR" gene complement(4024344..4025039) /gene="ispD" /locus_tag="Rv3582c" /db_xref="GeneID:887787" CDS complement(4024344..4025039) /gene="ispD" /locus_tag="Rv3582c" /EC_number="2.7.7.60" /function="INVOLVED IN THE DEOXYXYLULOSE-5-PHOSPHATE PATHWAY (DXP) OF ISOPRENOID BIOSYNTHESIS (AT THE THIRD STEP). CATALYZES THE FORMATION OF 4-DIPHOSPHOCYTIDYL-2C-METHYL-D-ERYTHRITOL FROM CTP AND 2C-METHYL-D-ERYTHRITOL 4-PHOSPHATE." /note="4-diphosphocytidyl-2C-methyl-D-erythritol synthase; MEP cytidylyltransferase; MCT; catalyzes the formation of 4-diphosphocytidyl-2-C-methyl-D-erythritol from CTP and 2-C-methyl-D-erythritol 4-phosphate; involved in isoprenoid and isopentenyl-PP biosynthesis; forms homodimers" /codon_start=1 /transl_table=11 /product="2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase" /protein_id="NP_218099.1" /db_xref="GI:15610718" /db_xref="GeneID:887787" /translation="MVREAGEVVAIVPAAGSGERLAVGVPKAFYQLDGQTLIERAVDG LLDSGVVDTVVVAVPADRTDEARQILGHRAMIVAGGSNRTDTVNLALTVLSGTAEPEF VLVHDAARALTPPALVARVVEALRDGYAAVVPVLPLSDTIKAVDANGVVLGTPERAGL RAVQTPQGFTTDLLLRSYQRGSLDLPAAEYTDDASLVEHIGGQVQVVDGDPLAFKITT KLDLLLAQAIVRG" gene complement(4025056..4025544) /locus_tag="Rv3583c" /db_xref="GeneID:887854" CDS complement(4025056..4025544) /locus_tag="Rv3583c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3583c, (MTV024.01c, MTCY06G11.30c), len: 162 aa. Possible transcriptional factor, identical to Q9CCW7|ML0320 PUTATIVE TRANSCRIPTION FACTOR from Mycobacterium leprae (165 aa), FASTA scores: opt: 1004, E(): 6.1e-56, (97.55% identity in 162 aa overlap); and Q9ZBM8|MLCB1450.01c PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (94 aa), FASTA scores: opt: 600, E(): 6e-31, (97.85% identity in 94 aa overlap). Also highly similar to others e.g. Q9L0Q9|SCD8A.05 from Streptomyces coelicolor (160 aa), FASTA scores: opt: 878, E(): 4.3e-48, (85.0% identity in 160 aa overlap); Q9K600|BH3935 from Bacillus halodurans (153 aa) FASTA scores: opt: 383, E(): 3.1e-17, (36.4% identity in 151 aa overlap); Q9KD36|BH1383 from Bacillus halodurans (164 aa) FASTA scores: opt: 305, E(): 2.4e-12, (33.55% identity in 164 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transcription factor" /protein_id="NP_218100.1" /db_xref="GI:15610719" /db_xref="GeneID:887854" /translation="MIFKVGDTVVYPHHGAALVEAIETRTIKGEQKEYLVLKVAQGDL TVRVPAENAEYVGVRDVVGQEGLDKVFQVLRAPHTEEPTNWSRRYKANLEKLASGDVN KVAEVVRDLWRRDQERGLSAGEKRMLAKARQILVGELALAESTDDAKAETILDEVLAA AS" gene 4025830..4026378 /gene="lpqE" /locus_tag="Rv3584" /db_xref="GeneID:887254" CDS 4025830..4026378 /gene="lpqE" /locus_tag="Rv3584" /function="UNKNOWN" /note="Rv3584, (MTV024.02), len: 182 aa. Possible lpqE, conserved lipoprotein, equivalent to Q9ZBM7|MLCB1450.02|LPQE|ML0319 PUTATIVE LIPOPROTEIN from Mycobacterium leprae (183 aa), FASTA scores: opt: 722, E(): 6.2e-37, (63.45% identity in 175 aa overlap). Also similar in part to Q9KK69 EXPORTED PROTEIN 996A010 (FRAGMENT) from Mycobacterium avium (41 aa), FASTA scores: opt: 180, E(): 0.00012, (69.25% identity in 39 aa overlap); and Q9L0R0|SCD8A.04c PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (241 aa), FASTA scores: opt: 127, E(): 0.86, (27.15% identity in 173 aa overlap). Equivalent to AAK48048 from Mycobacterium tuberculosis strain CDC1551 (238 aa) but shorter 56 aa. Contains probable N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. TBparse score is 0.895." /codon_start=1 /transl_table=11 /product="lipoprotein LpqE" /protein_id="NP_218101.1" /db_xref="GI:15610720" /db_xref="GeneID:887254" /translation="MNRCNIRLRLAGMTTWVASIALLAAALSGCGAGQISQTANQKPA VNGNRLTINNVLLRDIRIQAVQTSDFIQPGKAVDLVLVAVNQSPDVSDRLVGITSDIG SVTVAGDARLPASGMLFVGTPDGQIVAPGPLPSNQAAKATVNLTKPIANGLTYNFTFK FEKAGQGSVMVPISAGLATPHE" misc_feature 4025887..4025919 /gene="lpqE" /locus_tag="Rv3584" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site." gene 4026444..4027886 /gene="radA" /locus_tag="Rv3585" /db_xref="GeneID:887287" CDS 4026444..4027886 /gene="radA" /locus_tag="Rv3585" /function="INVOLVED IN GENETIC RECOMBINATION. MAY PLAY A ROLE IN THE REPAIR OF ENDOGENOUS ALKYLATION DAMAGE." /note="Sms; stabilizes the strand-invasion intermediate during the DNA repair; involved in recombination of donor DNA and plays an important role in DNA damage repair after exposure to mutagenic agents" /codon_start=1 /transl_table=11 /product="DNA repair protein RadA" /protein_id="NP_218102.1" /db_xref="GI:15610721" /db_xref="GeneID:887287" /translation="MANARSQYRCSECRHVSAKWVGRCLECGRWGTVDEVAVLSAVGG TRRRSVAPASGAVPISAVDAHRTRPCPTGIDELDRVLGGGIVPGSVTLLAGDPGVGKS TLLLEVAHRWAQSGRRALYVSGEESAGQIRLRADRIGCGTEVEEIYLAAQSDVHTVLD QIETVQPALVIVDSVQTMSTSEADGVTGGVTQVRAVTAALTAAAKANEVALILVGHVT KDGAIAGPRSLEHLVDVVLHFEGDRNGALRMVRGVKNRFGAADEVGCFLLHDNGIDGI VDPSNLFLDQRPTPVAGTAITVTLDGKRPLVGEVQALLATPCGGSPRRAVSGIHQARA AMIAAVLEKHARLAIAVNDIYLSTVGGMRLTEPSADLAVAIALASAYANLPLPTTAVM IGEVGLAGDIRRVNGMARRLSEAARQGFTIALVPPSDDPVPPGMHALRASTIVAALQY MVDIADHRGTTLATPPSHSGTGHVPLGRGT" misc_feature 4026726..4026749 /gene="radA" /locus_tag="Rv3585" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene 4027891..4028967 /locus_tag="Rv3586" /db_xref="GeneID:887485" CDS 4027891..4028967 /locus_tag="Rv3586" /function="UNKNOWN" /note="non-specific DNA-binding; scans chromosomes during sporulation for DNA-damage; delays initiation of sporulation; participates in a checkpoint signaling cascade for cell-cycle progression and DNA repair" /codon_start=1 /transl_table=11 /product="DNA integrity scanning protein DisA" /protein_id="NP_218103.1" /db_xref="GI:15610722" /db_xref="GeneID:887485" /translation="MHAVTRPTLREAVARLAPGTGLRDGLERILRGRTGALIVLGHDE NVEAICDGGFSLDVRYAATRLRELCKMDGAVVLSTDGSRIVRANVQLVPDPSIPTDES GTRHRSAERAAIQTGYPVISVSHSMNIVTVYVRGERHVLTDSATILSRANQAIATLER YKTRLDEVSRQLSRAEIEDFVTLRDVMTVVQRLELVRRIGLVIDYDVVELGTDGRQLR LQLDELLGGNDTARELIVRDYHANPEPPSTGQINATLDELDALSDGDLLDFTALAKVF GYPTTTEAQDSTLSPRGYRAMAGIPRLQFAHADLLVRAFGTLQGLLAASAGDLQSVDG IGAMWARHVREGLSQLAESTISDQ" gene complement(4028968..4029762) /locus_tag="Rv3587c" /db_xref="GeneID:888057" CDS complement(4028968..4029762) /locus_tag="Rv3587c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3587c, (MTV024.05c), len: 264 aa. Probable conserved membrane protein, equivalent to Q9CBJ2|ML1918 HYPOTHETICAL MEMBRANE PROTEIN from Mycobacterium leprae (263 aa), FASTA scores: opt: 1438, E(): 2.4e-57, (77.55% identity in 267 aa overlap). Contains hydrophobic stretch in N-terminus; possible signal sequence. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218104.1" /db_xref="GI:15610723" /db_xref="GeneID:888057" /translation="MLDLEPRGPLPTEIYWRRRGLALGIAVVVVGIAVAIVIAFVDSS AGAKPVSADKPASAQSHPGSPAPQAPQPAGQTEGNAAAAPPQGQNPETPTPTAAVQPP PVLKEGDDCPDSTLAVKGLTNAPQYYVGDQPKFTMVVTNIGLVSCKRDVGAAVLAAYV YSLDNKRLWSNLDCAPSNETLVKTFSPGEQVTTAVTWTGMGSAPRCPLPRPAIGPGTY NLVVQLGNLRSLPVPFILNQPPPPPGPVPAPGPAQAPPPESPAQGG" gene complement(4029871..4030494) /locus_tag="Rv3588c" /db_xref="GeneID:887836" CDS complement(4029871..4030494) /locus_tag="Rv3588c" /EC_number="4.2.1.1" /function="CATALYZES THE REVERSIBLE HYDRATATION OF CARBON DIOXIDE [CATALYTIC ACTIVITY: H(2)CO(3) = CO(2) + H(2)O]." /note="Rv3588c, (MTV024.06c), len: 207 aa. Probable carbonic anhydrase (EC 4.2.1.1), equivalent to Q9CBJ1|ML1919 PUTATIVE CARBONIC ANHYDRASE from Mycobacterium leprae (213 aa), FASTA scores: opt: 1160, E(): 3.1e-66, (84.55% identity in 207 aa overlap). Also similar to many e.g. Q9X903|SCH35.03 from Streptomyces coelicolor (207 aa), FASTA scores: opt: 689, E(): 1.6e-36, (53.85% identity in 195 aa overlap); Q9RS89|DR2238 from Deinococcus radiodurans (264 aa), FASTA scores: opt: 451, E(): 2e-21, (39.7% identity in 189 aa overlap); Q39589|BETA-CA1 from Chlamydomonas reinhardtii (267 aa) FASTA scores: opt: 419, E(): 2.1e-19, (36.55% identity in 197 aa overlap); etc. Contains PS00704 and PS00705 Prokaryotic-type carbonic anhydrases signature 1 and 2. BELONGS TO THE PLANT AND PROKARYOTIC CARBONIC ANHYDRASE FAMILY. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="carbonic anhydrase" /protein_id="NP_218105.1" /db_xref="GI:15610724" /db_xref="GeneID:887836" /translation="MPNTNPVAAWKALKEGNERFVAGRPQHPSQSVDHRAGLAAGQKP TAVIFGCADSRVAAEIIFDQGLGDMFVVRTAGHVIDSAVLGSIEYAVTVLNVPLIVVL GHDSCGAVNAALAAINDGTLPGGYVRDVVERVAPSVLLGRRDGLSRVDEFEQRHVHET VAILMARSSAISERIAGGSLAIVGVTYQLDDGRAVLRDHIGNIGEEV" misc_feature complement(4030171..4030233) /locus_tag="Rv3588c" /note="PS00705 Prokaryotic-type carbonic anhydrases signature 2." misc_feature complement(4030321..4030344) /locus_tag="Rv3588c" /note="PS00704 Prokaryotic-type carbonic anhydrases signature 1." gene 4030493..4031407 /gene="mutY" /locus_tag="Rv3589" /db_xref="GeneID:886639" CDS 4030493..4031407 /gene="mutY" /locus_tag="Rv3589" /EC_number="3.2.2.-" /function="INVOLVED IN BASE EXCISION REPAIR. REMOVES ADENINE MISPAIRED WITH 8-OXOG. MAY REPAIR A.G AND A.C MISMATCHES BY ADENINE EXCISION." /note="Rv3589, (MTV024.07), len: 304 aa. Probable mutY, adenine glycosylase (EC 3.2.2.-) (see citation below), equivalent to Q9CBJ0|MUTY|ML1920 PROBABLE DNA GLYCOSYLASE from Mycobacterium leprae (297 aa), FASTA scores: opt: 1592, E(): 2.6e-94, (74.9% identity in 303 aa overlap). Also similar to many DNA glycosylases (generally adenine glycosylases) e.g. Q9S6T7|SCE94.06 from Streptomyces coelicolor (308 aa), FASTA scores: opt: 965, E(): 2.6e-54, (50.5% identity in 297 aa overlap); Q9S6G1|MUTY from Streptomyces antibioticus (307 aa), FASTA scores: opt: 901, E(): 3.1e-50, (48.5% identity in 303 aa overlap); Q9HPQ6|MUTY|VNG1520G from Halobacterium sp. strain NRC-1 (312 aa), FASTA scores: opt: 566, E(): 7.2e-29, (39.85% identity in 296 aa overlap); BAB53965|MLL7523 from Rhizobium loti (Mesorhizobium loti) (396 aa), FASTA scores: opt: 511, E(): 2.8e-25, (39.65% identity in 237 aa overlap); Q05869|MUTY_SALTY|MUTB from Salmonella typhimurium (350 aa), FASTA scores: opt: 421, E(): 3.8e-20, (35.2% identity in 227 aa overlap); etc. COULD BELONG TO THE NTH/MUTY FAMILY. TBparse score is 0.905." /codon_start=1 /transl_table=11 /product="adenine glycosylase MutY" /protein_id="NP_218106.1" /db_xref="GI:15610725" /db_xref="GeneID:886639" /translation="MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQI LVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHE CATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAV HGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWC AWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQR DRALESLLADALVTRTVDGRFALPGEGF" gene complement(4031404..4033158) /gene="PE_PGRS58" /locus_tag="Rv3590c" /db_xref="GeneID:887874" CDS complement(4031404..4033158) /gene="PE_PGRS58" /locus_tag="Rv3590c" /function="UNKNOWN" /note="Rv3590c, (MTV024.08c, MTCY6F7.04), len: 584 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to e.g. O53439|Rv1091|MTV017.44 (853 aa), FASTA scores: opt: 2005, E(): 1.4e-70, (54.95% identity in 646 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177993.1" /db_xref="GI:57117129" /db_xref="GeneID:887874" /translation="MSFVIVAPEALMSVASEVAGIGSALNAANAAAAAPTTGVLAAAA DEVSAAMAALFGAHAQEYQRLSAQAAGFHAQFVQALNAGVNSYASAEAANASPLQAVE QQVLGLINGPAQTLLGRPLIGNGADGAPGTGQPGGPGGLLWGNGGNGGSGVAGVGGPG GSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGNGGAGGFGGVGTTVSGNGG AGGAAGAFGNGGVGGAGGAAVIGGLPGNGGAGGNAGLIGAGGDGGVGGVGAPGTNGMN PPPNQTSQAANGSPGANNGAGSGGAGLPGNPGAVPGRAGGAGGLGGSGSDTSEGPVTG GNGGNGGDGGPGAPGGNGAPGGIGVNTGTGWAYGGNGGNGGDGGAGARGGDGGNGGNG LALNGGNGIGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAG TGGVGGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGAGGKGGSGLVGGDGGNGG AGGAGGNGGKGGAGGAGGGAGMFSQPGVHGAGGTGGQGGAGGAGGAGGAAGAGTVVAG NPGDPGGFGAAGADGLPG" gene complement(4033269..4034042) /locus_tag="Rv3591c" /db_xref="GeneID:886316" CDS complement(4033269..4034042) /locus_tag="Rv3591c" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3591c, (MTCY6F7.03), len: 257 aa. Possible hydrolase (EC 3.-.-.-), equivalent to Q9CBI9|ML1921 HYPOTHETICAL PROTEIN from Mycobacterium leprae (256 aa) FASTA scores: opt: 1421, E(): 5.6e-83, (78.5% identity in 251 aa overlap). Also similar to others e.g. Q9K3V0|SCD10.27 PUTATIVE HYDROLASE from Streptomyces coelicolor (352 aa), FASTA scores: opt: 193, E(): 5.2e-05, (33.35% identity in 270 aa overlap); O33745|STTC THIOESTERASE (EC 3.1.2.-) from Streptomyces sp (308 aa) FASTA scores: opt: 242, E(): 3.6e-08, (30.35% identity in 270 aa overlap); Q9RK95|SCF1.09 PUTATIVE HYDROLASE from Streptomyces coelicolor (258 aa), FASTA scores: opt: 239, E(): 4.9e-08, (30.75% identity in 247 aa overlap); Q9HZ14|PA3226 PROBABLE HYDROLASE from Pseudomonas aeruginosa (275 aa), FASTA scores: opt: 226, E(): 3.4e-07, (26.6% identity in 252 aa overlap); Q9HPT9|EST|VNG1474G CARBOXYLESTERASE from Halobacterium sp. strain NRC-1 (274 aa), FASTA scores: opt: 215, E(): 1.7e-06, (26.95% identity in 256 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_218108.1" /db_xref="GI:15610727" /db_xref="GeneID:886316" /translation="MPRMPANLLTHRGGRGEPLVLVHGLMGRGSTWARQLPWLTLLGA VYTYDAPWHRGRDVADPHPISTERFVADLGDAVSALGAPTRMVGHSMGALHSWCLAAE RPELVSALVVEDMAPDFRGRTTGPWEPWLRALPVEFDSAEQVFAEFGPVAGRYFLDAF DRTATGWRLHGRTARWIEIAAEWGTRDYWAQWRAVRSPALLIEAGDGVTPPGQMRAMA ERDYPTAYLRVPDAGHLVHDEAPQVYRRAVESFLAGLTP" gene 4034057..4034374 /gene="TB11.2" /locus_tag="Rv3592" /db_xref="GeneID:886278" CDS 4034057..4034374 /gene="TB11.2" /locus_tag="Rv3592" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3592, (MTCY6F7.02c), len: 105 aa. TB11.2, conserved hypothetical protein (see citations from 2000 below), equivalent to Q9CBI8|ML1922 HYPOTHETICAL PROTEIN from Mycobacterium leprae (105 aa) FASTA scores: opt: 591, E(): 2.5e-34, (84.6% identity in 104 aa overlap). Shows some similarity with other bacterial hypothetical proteins e.g. Q9RXN8|DR0272 from Deinococcus radiodurans (109 aa), FASTA scores: opt: 178, E(): 1e-05, (34.3% identity in 102 aa overlap); P38049|YHGC_BACSU from Bacillus subtilis (166 aa) FASTA scores: opt: 175, E(): 2.4e-05, (40.85% identity in 71 aa overlap); Q9K649|BH3883 from Bacillus halodurans (102 aa) FASTA scores: opt: 162, E(): 0.00012, (33.75% identity in 80 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218109.1" /db_xref="GI:15610728" /db_xref="GeneID:886278" /translation="MPVVKINAIEVPAGAGPELEKRFAHRAHAVENSPGFLGFQLLRP VKGEERYFVVTHWESDEAFQAWANGPAIAAHAGHRANPVATGASLLEFEVVLDVGGTG KTA" gene 4034352..4035710 /gene="lpqF" /locus_tag="Rv3593" /db_xref="GeneID:886290" CDS 4034352..4035710 /gene="lpqF" /locus_tag="Rv3593" /function="UNKNOWN" /note="Rv3593, (MTCY6F7.01c), len: 452 aa. Probable lpqF, conserved lipoprotein, equivalent to Q9CBI7|MPQF|ML1923 PROBALE SECRETED PROTEIN from Mycobacterium leprae (454 aa), FASTA scores: opt: 2465, E(): 5.7e-144, (79.15% identity in 451 aa overlap). Also similar to Q9KJ91 HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces clavuligerus (430 aa), FASTA scores: opt: 609, E(): 5.2e-30, (30.3% identity in 350 aa overlap); and some similarity with putative beta-lactamases e.g. Q9RYR7|DRA0241 BETA LACTAMASE-RELATED PROTEIN from Deinococcus radiodurans (499 aa), FASTA scores: opt: 322, E(): 2.5e-12, (28.25% identity in 322 aa overlap). Equivalent to AAK48057 from Mycobacterium tuberculosis strain CDC1551 (438 aa) but longer 14 aa. Contains N-terminal signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="lipoprotein LpqF" /protein_id="NP_218110.1" /db_xref="GI:15610729" /db_xref="GeneID:886290" /translation="MGPARLHNRRAGRRMLALSAAAALIVALASGCSSAPTPSANAAN HGHRIDTRTPPGLRAQQTMDMLNSDWPIGEIGVGTLAAPGQVDTVKTTMEALWWDRPF ALAGVDIGASVAALHLISSYGAQQDIRIHTDDDGWVDRFDVETQAPSIASWRDVDAAL SKTGARYSFQVAKVDNGRCDPVAGTNTGESLPLASIFKLYVLHALAGAVQHNTVSWDD LLTVTAKSKAVGSSGLELPVGARVSVRTAAEKMIATSDNMATDLLIERLGTRAIEEAL ASAGHHDPASMTPFPTMYELFSVGWGKPDLRDQWKHATQQVRAQILRQTNSTPYQPDP TRAHTPASNYGAEWYGSAEDICRVHAALRADAVGPASPVRQIMSAVPGIQLDRSVWPY IGAKAGGLPGDLTFSWYAVDKTGQPWVVSFQLNWPRDHGPTVTGWMLQVARQVFALIA PQ" misc_feature 4034415..4034447 /gene="lpqF" /locus_tag="Rv3593" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site." gene 4035857..4036684 /locus_tag="Rv3594" /db_xref="GeneID:885659" CDS 4035857..4036684 /locus_tag="Rv3594" /function="UNKNOWN" /note="Rv3594, (MTCY07H7B.28c), len: 275 aa. Hypothetical protein, highly similar in part with Q9ZX49|GP29 from Mycobacteriophage TM4 (547 aa), FASTA scores: opt: 526, E(): 1.3e-25, (46.25% identity in 186 aa overlap); and Q9FZS0|LYSA|GP2 from Mycobacterium phage Ms6 (384 aa) FASTA scores: opt: 147, E(): 0.064, (33.35% identity in 84 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218111.1" /db_xref="GI:15610730" /db_xref="GeneID:885659" /translation="MGWIGDPIWLEEVLRPALGERLRVLDGWRERGHGDFRDIRGVMW HHTGNSRETAKSIARGRPDLPGPLANLHIAHSGVVTIVAVGVCWHAGRGSYPWLPTDN ANWHMIGVECAWPTIRRDGSYDAGERWPDAQIVSMRDVAAALTLKLGYGPERNIGHKE YAGAAQGKWDPGNLSMDWFRAEVAKDTRGEFDHPLTPPPAVIARPPILPKPRNPRDDR ILLEEVWDQLRGIEGRGWPVLGDKTIVDYLAELGNKVDALAAKLDAREGLDRPSDTR" gene complement(4036731..4038050) /gene="PE_PGRS59" /locus_tag="Rv3595c" /db_xref="GeneID:885464" CDS complement(4036731..4038050) /gene="PE_PGRS59" /locus_tag="Rv3595c" /function="UNKNOWN" /note="Rv3595c, (MTCY07H7B.27), len: 439 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar to many e.g. O53439|Rv1091|MTV017.44 (853 aa), FASTA scores: opt: 1644, E(): 1.2e-57, (58.75% identity in 492 aa overlap)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_177994.1" /db_xref="GI:57117130" /db_xref="GeneID:885464" /translation="MSFVIAVPEFLSAAATDLANLGSTISAANAAASIPTTGVLAAGA DDVSAAIAALFGAHAQAYQTISAQAATFHAQFVQTLSAGAGAYANAEAANVQQSLLNA INAPTQALLGRPLIGDGADGTAPGQNGGAGGLLYGNGGNGAAGVNAGIAGGSGGAAGL IGNGGSGGAGGAGAAGGSGGQGGLLYGNGGAGGNGGAATIPGGNGGAGGAGGNAWLFG NGGAGGLGAAGAAGAAGVNPLTVPAGQGSMGNNGEPGGPGQPGTEFGQTGGTGGTGGT GLSVGGTGGTGGTGGTGGAGGSGGRGGLLVGDGGAGGIGGTGGEGGIGARGGTGGQGG MGGAGQPGVGGDAGDGGNGGIGGDGGAGGDGGAGGAGGAGGLFGVSGSSGLGGAAGSG GNGGGGGEPGVAGSPGVGPAGRGGDGNLGQFGPEGAPGQPGQPGQPG" gene complement(4038158..4040704) /gene="clpC1" /locus_tag="Rv3596c" /db_xref="GeneID:885104" CDS complement(4038158..4040704) /gene="clpC1" /locus_tag="Rv3596c" /EC_number="3.4.-.-" /function="HYDROLYSES PROTEINS IN PRESENCE OF ATP. MAY INTERACT WITH A CLPP-LIKE PROTEASE INVOLVED IN DEGRADATION OF DENATURED PROTEINS." /experiment="experimental evidence, no additional details recorded" /note="Rv3596c, (MTCY07H7B.26), len: 848 aa. Probable clpC1, ATP-dependent protease ATP-binding subunit (EC 3.4.-.-), equivalent to P24428|CLPC_MYCLE PROBABLE ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT from Mycobacterium leprae (848 aa) (see Misra et al., 1996), FASTA scores: opt: 5286, E(): 0, (97.15% identity in 845 aa overlap). Also highly similar to members of the clpA/clpB family e.g. Q9S6T8|SCE94.24c from Streptomyces coelicolor (841 aa) FASTA scores: opt: 4399, E(): 0, (81.0% identity in 848 aa overlap); Q9KGG2|CLPC|BH0103 from Bacillus halodurans (813 aa), FASTA scores: opt: 3279, E(): 3.8e-173, (61.9% identity in 808 aa overlap); Q55662|CLPC|SLL0020 from Synechocystis sp. strain PCC 6803 (821 aa), FASTA scores: opt: 3201, E(): 7.6e-169, (60.5% identity in 820 aa overlap); P51332|CLPC_PORPU from Porphyra purpurea (821 aa), FASTA scores: opt: 3045, E(): 3e-160, (57.65% identity in 817 aa overlap); P37571|CLPC_BACSU|MECB from Bacillus subtilis (810 aa), FASTA scores: opt: 2969, E(): 4.6e-156, (61.15% identity in 811 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Note that previously known as clpC. BELONGS TO THE CLPA/CLPB FAMILY, CLPC SUBFAMILY.; clpC" /codon_start=1 /transl_table=11 /product="ATP-dependent protease ATP-binding subunit ClpC1" /protein_id="YP_177995.1" /db_xref="GI:57117131" /db_xref="GeneID:885104" /translation="MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGV AAKSLESLGISLEGVRSQVEEIIGQGQQAPSGHIPFTPRAKKVLELSLREALQLGHNY IGTEHILLGLIREGEGVAAQVLVKLGAELTRVRQQVIQLLSGYQGKEAAEAGTGGRGG ESGSPSTSLVLDQFGRNLTAAAMEGKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEP GVGKTAVVEGLAQAIVHGEVPETLKDKQLYTLDLGSLVAGSRYRGDFEERLKKVLKEI NTRGDIILFIDELHTLVGAGAAEGAIDAASILKPKLARGELQTIGATTLDEYRKYIEK DAALERRFQPVQVGEPTVEHTIEILKGLRDRYEAHHRVSITDAAMVAAATLADRYIND RFLPDKAIDLIDEAGARMRIRRMTAPPDLREFDEKIAEARREKESAIDAQDFEKAASL RDREKTLVAQRAEREKQWRSGDLDVVAEVDDEQIAEVLGNWTGIPVFKLTEAETTRLL RMEEELHKRIIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKTELSKAL ANFLFGDDDALIQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKPFSVVLF DEIEKAHQEIYNSLLQVLEDGRLTDGQGRTVDFKNTVLIFTSNLGTSDISKPVGLGFS KGGGENDYERMKQKVNDELKKHFRPEFLNRIDDIIVFHQLTREEIIRMVDLMISRVAG QLKSKDMALVLTDAAKALLAKRGFDPVLGARPLRRTIQREIEDQLSEKILFEEVGPGQ VVTVDVDNWDGEGPGEDAVFTFTGTRKPPAEPDLAKAGAHSAGGPEPAAR" misc_feature complement(4039025..4039048) /gene="clpC1" /locus_tag="Rv3596c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." misc_feature complement(4040036..4040059) /gene="clpC1" /locus_tag="Rv3596c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene complement(4040981..4041319) /gene="lsr2" /locus_tag="Rv3597c" /db_xref="GeneID:885580" CDS complement(4040981..4041319) /gene="lsr2" /locus_tag="Rv3597c" /function="DOMINANT T-CELL ANTIGEN AND POSSIBLY STIMULATES LYMPHOPROLIFERATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3597c, (MTCY07H7B.25), len: 112 aa. Probable lsr2, identical to P24094|LSR2_MYCLE|ML0234 LSR2 PROTEIN PRECURSOR (15 KDA ANTIGEN) (A15) from Mycobacterium leprae (112 aa), FASTA scores: opt: 698, E(): 6.7e-37, (92.85% identity in 112 aa overlap). Also highly similar to others e.g. Q9X8N1|SCE94.26c from Streptomyces coelicolor (111 aa), FASTA scores: opt: 379, E(): 4.4e-17, (58.05% identity in 112 aa overlap); Q9ETI2|LSR2 from Corynebacterium equii (Rhodococcus equi) (119 aa), FASTA scores: opt: 328, E(): 6.9e-14, (47.5% identity in 120 aa overlap); and Q9RKK8|SCD25.12c from Streptomyces coelicolor (105 aa), FASTA scores: opt: 293, E(): 9.4e-12, (47.75% identity in 111 aa overlap)." /codon_start=1 /transl_table=11 /product="iron-regulated LSR2 protein precursor" /protein_id="NP_218114.1" /db_xref="GI:15610733" /db_xref="GeneID:885580" /translation="MAKKVTVTLVDDFDGSGAADETVEFGLDGVTYEIDLSTKNATKL RGDLKQWVAAGRRVGGRRRGRSGSGRGRGAIDREQSAAIREWARRNGHNVSTRGRIPA DVIDAYHAAT" gene complement(4041423..4042940) /gene="lysS" /locus_tag="Rv3598c" /db_xref="GeneID:885574" CDS complement(4041423..4042940) /gene="lysS" /locus_tag="Rv3598c" /EC_number="6.1.1.6" /function="INVOLVED IN TRANSLATION [CATALYTIC ACTIVITY: ATP + L-LYSINE + TRNA(LYS) = AMP + PYROPHOSPHATE + L-LYSYL-TRNA(LYS)]." /note="class II; LysRS2; catalyzes a two-step reaction, first charging a lysine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA; in Methanosarcina barkeri, LysRS2 charges both tRNA molecules for lysine that exist in this organism and in addition can charge the tRNAPyl with lysine in the presence of LysRS1" /codon_start=1 /transl_table=11 /product="lysyl-tRNA synthetase" /protein_id="NP_218115.1" /db_xref="GI:15610734" /db_xref="GeneID:885574" /translation="MSAADTAEDLPEQFRIRRDKRARLLAQGRDPYPVAVPRTHTLAE VRAAHPDLPIDTATEDIVGVAGRVIFARNSGKLCFATLQDGDGTQLQVMISLDKVGQA ALDAWKADVDLGDIVYVHGAVISSRRGELSVLADCWRIAAKSLRPLPVAHKEMSEESR VRQRYVDLIVRPEARAVARLRIAVVRAIRTALQRRGFLEVETPVLQTLAGGAAARPFA THSNALDIDLYLRIAPELFLKRCIVGGFDKVFELNRVFRNEGADSTHSPEFSMLETYQ TYGTYDDSAVVTRELIQEVADEAIGTRQLPLPDGSVYDIDGEWATIQMYPSLSVALGE EITPQTTVDRLRGIADSLGLEKDPAIHDNRGFGHGKLIEELWERTVGKSLSAPTFVKD FPVQTTPLTRQHRSIPGVTEKWDLYLRGIELATGYSELSDPVVQRERFADQARAAAAG DDEAMVLDEDFLAALEYGMPPCTGTGMGIDRLLMSLTGLSIRETVLFPIVRPHSN" misc_feature complement(4042122..4042175) /gene="lysS" /locus_tag="Rv3598c" /note="PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1." gene complement(4042952..4043035) /locus_tag="Rv3599c" /db_xref="GeneID:887149" CDS complement(4042952..4043035) /locus_tag="Rv3599c" /function="UNKNOWN" /note="Rv3599c, (MTCY07H7B.23), len: 27 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218116.1" /db_xref="GI:15610735" /db_xref="GeneID:887149" /translation="MPASSLGTGSPAADRLDATHERRREVI" gene complement(4043041..4043859) /locus_tag="Rv3600c" /db_xref="GeneID:885572" CDS complement(4043041..4043859) /locus_tag="Rv3600c" /EC_number="2.7.1.33" /function="UNKNOWN" /note="type III; catalyzes the formation of (R)-4'-phosphopantothenate from (R)-pantothenate in coenzyme A biosynthesis; type III pantothenate kinases are not subject to feedback inhibition from coenzyme A and have a high Km for ATP" /codon_start=1 /transl_table=11 /product="pantothenate kinase" /protein_id="NP_218117.1" /db_xref="GI:15610736" /db_xref="GeneID:885572" /translation="MLLAIDVRNTHTVVGLLSGMKEHAKVVQQWRIRTESEVTADELA LTIDGLIGEDSERLTGTAALSTVPSVLHEVRIMLDQYWPSVPHVLIEPGVRTGIPLLV DNPKEVGADRIVNCLAAYDRFRKAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSS DAAAARSAALRRVELARPRSVVGKNTVECMQAGAVFGFAGLVDGLVGRIREDVSGFSV DHDVAIVATGHTAPLLLPELHTVDHYDQHLTLQGLRLVFERNLEVQRGRLKTAR" gene complement(4043862..4044281) /gene="panD" /locus_tag="Rv3601c" /db_xref="GeneID:885596" CDS complement(4043862..4044281) /gene="panD" /locus_tag="Rv3601c" /EC_number="4.1.1.11" /function="INVOLVED IN PANTOTHENATE BIOSYNTHESIS [CATALYTIC ACTIVITY: L-ASPARTATE = BETA-ALANINE + CO(2)]." /note="Converts L-aspartate to beta-alanine and provides the major route of beta-alanine production in bacteria. Beta-alanine is essential for the biosynthesis of pantothenate (vitamin B5)" /codon_start=1 /transl_table=11 /product="aspartate alpha-decarboxylase" /protein_id="NP_218118.1" /db_xref="GI:15610737" /db_xref="GeneID:885596" /translation="MLRTMLKSKIHRATVTCADLHYVGSVTIDADLMDAADLLEGEQV TIVDIDNGARLVTYAITGERGSGVIGINGAAAHLVHPGDLVILIAYATMDDARARTYQ PRIVFVDAYNKPIDMGHDPAFVPENAGELLDPRLGVG" gene complement(4044281..4045210) /gene="panC" /locus_tag="Rv3602c" /db_xref="GeneID:885459" CDS complement(4044281..4045210) /gene="panC" /locus_tag="Rv3602c" /EC_number="6.3.2.1" /function="INVOLVED IN PANTOTHENATE BIOSYNTHESIS [CATALYTIC ACTIVITY: ATP + (R)-PANTOATE + BETA-ALANINE = AMP + PYROPHOSPHATE + (R)-PANTOTHENATE]." /note="catalyzes the formation of (R)-pantothenate from pantoate and beta-alanine" /codon_start=1 /transl_table=11 /product="pantoate--beta-alanine ligase" /protein_id="NP_218119.1" /db_xref="GI:15610738" /db_xref="GeneID:885459" /translation="MTIPAFHPGELNVYSAPGDVADVSRALRLTGRRVMLVPTMGALH EGHLALVRAAKRVPGSVVVVSIFVNPMQFGAGEDLDAYPRTPDDDLAQLRAEGVEIAF TPTTAAMYPDGLRTTVQPGPLAAELEGGPRPTHFAGVLTVVLKLLQIVRPDRVFFGEK DYQQLVLIRQLVADFNLDVAVVGVPTVREADGLAMSSRNRYLDPAQRAAAVALSAALT AAAHAATAGAQAALDAARAVLDAAPGVAVDYLELRDIGLGPMPLNGSGRLLVAARLGT TRLLDNIAIEIGTFAGTDRPDGYRAILESHWRN" gene complement(4045207..4046118) /locus_tag="Rv3603c" /db_xref="GeneID:885583" CDS complement(4045207..4046118) /locus_tag="Rv3603c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3603c, (MTCY07H7B.19), len: 303 aa. Conserved hypothetical ala-, leu-rich protein, identical except at N-terminus (really different) to AAK48066|MT3708 CHALCONE/STILBENE SYNTHASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (361 aa) FASTA scores: opt: 1742, E(): 8.3e-95, (100.0% identity in 275 aa overlap). Equivalent to O69525|MLCB2548.02c|ML0229 HYPOTHETICAL 32.7 KDA PROTEIN from Mycobacterium leprae (309 aa), FASTA scores: opt: 947, E(): 2.4e-48, (67.85% identity in 311 aa overlap). Also highly similar to Q9X845|SCE126.02c HYPOTHETICAL 42.2 KDA PROTEIN from Streptomyces coelicolor (420 aa), FASTA scores: opt: 683, E(): 8.5e-33, (49.3% identity in 284 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218120.1" /db_xref="GI:15610739" /db_xref="GeneID:885583" /translation="MERFDGLRPARLKVGIISAGRVGTALGVALQRADHVVVACSAIS HASRRRAQRRLPDTPVLPPLDVAASAELLLLAVTDSELAGLVSGLAATSAVRPQTIVA HTSGANGIGILAPLAQQGCIPLAIHPAMTFTGSDEDISRLPDTCFGITAADDVGYAIG QSLVLEMGGEPFCVREDARILYHAALAHASNHIVTVLADALEALRAALSGGELLGQQT VDDQPGGIVERIVGPLARAALENTLQRGQAALTGPVARGDAAAVADHLAALADVDAAL AQAYRINALRTAQRAHAPADVVEVLTA" gene complement(4046303..4047496) /locus_tag="Rv3604c" /db_xref="GeneID:885232" CDS complement(4046303..4047496) /locus_tag="Rv3604c" /function="UNKNOWN" /note="Rv3604c, (MTCY07H7B.18), len: 397 aa. Probable conserved ala-, arg-, pro-rich transmembrane protein, equivalent to O69526|MLCB2548.03c|ML0228 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (432 aa), FASTA scores: opt: 869, E(): 2.9e-31, (59.7% identity in 432 aa overlap). Contains two possible membrane-spanning domains. N-terminus shortened since first submission (previously 462 aa)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218121.2" /db_xref="GI:57117132" /db_xref="GeneID:885232" /translation="MTVLSRGARVRRGGRRPGWVLLTALLVLAIGASSALVFTDRVEL LKLAVLLALWAAVAGAFVSVLYRRQSDVDQARVRDLKLVYDLQLDREISARREYELTL ESQLRRELASELRAPAADEVAALRAELAALRTSLEILFDADLEHRPALGTVEKEARAA RALDGESPPADWVSSDRVMAVRGGDGASRTDEASIIDVPEVGVPPVSGGPRHYEAPPP PQPEPLFEPRHRPPPLPPQQERPVWQPVTSHGQWLPAETPGSQWASVEPETTPAAPPP GRRRRARHASPADQAYNPPAYVELAAQYGESGRRSRHSAEHRDHDIGGSGAGTGERPP SPPMAPPPPAEPTRRHRTADTPPDDSGGLHARDPLTGGQSVADLMARLQVESTGGGRR RRRGE" gene complement(4047705..4048181) /locus_tag="Rv3605c" /db_xref="GeneID:885844" CDS complement(4047705..4048181) /locus_tag="Rv3605c" /function="UNKNOWN" /note="Rv3605c, (MTCY07H7B.17), len: 158 aa. Probable conserved secreted or membrane protein, identical to O69527|MLCB2548.04c|ML0227 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (158 aa), FASTA scores: opt: 944, E(): 2.6e-56, (85.45% identity in 158 aa overlap). Also similar to other proteins e.g. Q9X8I2|SCE9.09 POSSIBLE SECRETED PROTEIN from Streptomyces coelicolor (162 aa), FASTA scores: opt: 174, E(): 9.2e-05, (31.25% identity in 128 aa overlap); etc. Contains possible N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218122.1" /db_xref="GI:15610741" /db_xref="GeneID:885844" /translation="MGPTRKRDLTAAVVGAAAVGYLLVAVLYRWFPPITVWTGLSLLA VAVAEALWARYVRVKISDGEIGDGPGWLHPLVVARSLMVAKASAWVGALVTGWWIGVL AYFLPRRSWLRAAAEDTTGTVVAAGSALALVVAALWLQHCCKSPQDPTEHADGAES" gene complement(4048181..4048747) /gene="folK" /locus_tag="Rv3606c" /db_xref="GeneID:885848" CDS complement(4048181..4048747) /gene="folK" /locus_tag="Rv3606c" /EC_number="2.7.6.3" /function="INVOLVED IN DIHYDROFOLATE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: ATP + 2-AMINO-4-HYDROXY-6-HYDROXYMETHYL-7,8-DIHYDROPTERIDINE = AMP + 2-AMINO-7,8-DIHYDRO-4-HYDROXY-6-(DIPHOSPHOOXYMETHYL)PTERI DI NE]." /note="Rv3606c, (MTCY07H7B.16), len: 188 aa. Probable folK, 2-amino-4-hydroxy-6-hydroxymethyldihydropterine pyrophosphokinase (EC 2.7.6.3), equivalent to O69528|HPPK_MYCLE|FOLK|ML0226\MLCB2548.05c 2-AMINO-4-HYDROXY-6-HYDROXYMETHYLDIHYDROPTERIDINE PYROPHOSPHOKINASE from Mycobacterium leprae (191 aa) FASTA scores: opt: 772, E(): 1.2e-44, (63.15% identity in 190 aa overlap). Also similar to many e.g. P71512|HPPK_METEX|FOLK|FOLA from Methylobacterium extorquens (158 aa), FASTA scores: opt: 292, E(): 1.4e-12, (36.85% identity in 171 aa overlap); O33726|HPPK_STRPY|FOLK|SPY1100 from Streptococcus pyogenes (166 aa), FASTA scores: opt: 234, E(): 1.1e-08, (34.3% identity in 175 aa overlap); Q9X8I1|SCE9.08 from Streptomyces coelicolor (203 aa), FASTA scores: opt: 232, E(): 1.7e-08, (43.25% identity in 185 aa overlap); P26281|HPPK_ECOLI|FOLK|B0142 from Escherichia coli strain K12 (158 aa), FASTA scores: opt: 198, E(): 2.6e-06, (32.85% identity in 143 aa overlap); etc. BELONGS TO THE HPPK FAMILY." /codon_start=1 /transl_table=11 /product="2-amino-4-hydroxy-6-hydroxymethyldihydropteridi ne pyrophosphokinase FolK" /protein_id="NP_218123.1" /db_xref="GI:15610742" /db_xref="GeneID:885848" /translation="MTRVVLSVGSNLGDRLARLRSVADGLGDALIAASPIYEADPWGG VEQGQFLNAVLIADDPTCEPREWLRRAQEFERAAGRVRGQRWGPRNLDVDLIACYQTS ATEALVEVTARENHLTLPHPLAHLRAFVLIPWIAVDPTAQLTVAGCPRPVTRLLAELE PADRDSVRLFRPSFDLNSRHPVSRAPES" gene complement(4048744..4049145) /gene="folB" /locus_tag="Rv3607c" /db_xref="GeneID:885345" CDS complement(4048744..4049145) /gene="folB" /locus_tag="Rv3607c" /EC_number="4.1.2.25" /function="INVOLVED IN FOLATE BIOSYNTHESIS. CATALYZES THE CONVERSION OF 7,8-DIHYDRONEOPTERIN TO 6- HYDROXYMETHYL-7,8-DIHYDROPTERIN (BY SIMILARITY) [CATALYTIC ACTIVITY: 2-AMINO-4-HYDROXY-6-(D-ERYTHRO-1,2,3-TRIHYDROXYPROPYL)-7, 8- DIHYDROPTERIDINE = 2-AMINO-4-HYDROXY-6-HYDROXYMETHYL-7,8-DIHYDROPTERIDINE + GLYCOLALDEHYDE]." /note="Rv3607c, (MTCY07H7B.15), len: 133 aa. Probable folB, dihydroneopterin aldolase (EC 4.1.2.25), equivalent to O69529|FOLB_MYCLE|ML0225|MLCB2548.06c PROBABLE DIHYDRONEOPTERIN ALDOLASE from Mycobacterium leprae (132 aa), FASTA scores: opt: 673, E(): 5.1e-37, (74.8% identity in 131 aa overlap). Also similar to many e.g. Q9X8I0|FOLB_STRCO|SCE9.07 from Streptomyces coelicolor (119 aa), FASTA scores: opt: 334, E(): 4.5e-15, (46.15% identity in 117 aa overlap); P74342|FOLB_SYNY3|SLR1626 from Synechocystis sp. strain PCC 6803 (118 aa) FASTA scores: opt: 287, E(): 5e-12, (38.45% identity in 117 aa overlap); P28823|FOLB_BACSU|FOLA from Bacillus subtilis (120 aa), FASTA scores: opt: 283, E(): 9.2e-12, (39.0% identity in 118 aa overlap); etc. BELONGS TO THE DHNA FAMILY. Note that previously known as folX.; folX" /codon_start=1 /transl_table=11 /product="dihydroneopterin aldolase FolB" /protein_id="YP_177996.1" /db_xref="GI:57117133" /db_xref="GeneID:885345" /translation="MADRIELRGLTVHGRHGVYDHERVAGQRFVIDVTVWIDLAEAAN SDDLADTYDYVRLASRAAEIVAGPPRKLIETVGAEIADHVMDDQRVHAVEVAVHKPQA PIPQTFDDVAVVIRRSRRGGRGWVVPAGGAV" gene complement(4049138..4049980) /gene="folP1" /locus_tag="Rv3608c" /db_xref="GeneID:885831" CDS complement(4049138..4049980) /gene="folP1" /locus_tag="Rv3608c" /EC_number="2.5.1.15" /function="INVOLVED IN DIHYDROFOLATE BIOSYNTHESIS (AT THE SECOND STEP). CATALYZES THE FORMATION OF THE IMMEDIATE PRECURSOR OF FOLIC ACID. IT IS IMPLICATED IN RESISTANCE TO SULFONAMIDE [CATALYTIC ACTIVITY: 2-AMINO-4-HYDROXY-6-HYDROXYMETHYL-7,8-DIHYDROPTERIDINE DIPHOSPHATE + 4-AMINOBENZOATE = PYROPHOSPHATE + DIHYDROPTEROATE]." /note="Rv3608c, (MTCY07H7B.14), len: 280 aa. Probable folP1, dihydropteroate synthase 1 (EC 2.5.1.15), equivalent to O69530|FOLP (alias Q9S0T0|FOLP and Q9R2U9|FOLP) DIHYDRONEOPTERIN ALDOLASE from Mycobacterium leprae (284 aa), FASTA scores: opt: 1418, E(): 7.2e-77, (76.75% identity in 284 aa overlap). Also highly similar to many e.g. Q9X8H8|SCE9.05 from Streptomyces coelicolor (288 aa), FASTA scores: opt: 953, E(): 2.4e-49, (56.0% identity in 266 aa overlap); Q9A3I0|CC3224 from Caulobacter crescentus (274 aa), FASTA scores: opt: 682, E(): 2.6e-33, (45.5% identity in 268 aa overlap); P73248|DHPS_SYNY3|FOLP|SLR2026 from Synechocystis sp. strain PCC 6803 (289 aa), FASTA scores: opt: 665, E(): 2.7e-32, (44.55% identity in 265 aa overlap); P26282|DHPS_ECOLI|FOLP|B3177 from Escherichia coli strain K12 (282 aa), FASTA scores: opt: 642, E(): 6.1e-31, (41.95% identity in 274 aa overlap); etc. Contains PS00792 Dihydropteroate synthase signature 1, PS00793 Dihydropteroate synthase signature 2. SIMILAR TO OTHER SPECIES DHPS." /codon_start=1 /transl_table=11 /product="dihydropteroate synthase" /protein_id="YP_177997.1" /db_xref="GI:57117134" /db_xref="GeneID:885831" /translation="MSPAPVQVMGVLNVTDDSFSDGGCYLDLDDAVKHGLAMAAAGAG IVDVGGESSRPGATRVDPAVETSRVIPVVKELAAQGITVSIDTMRADVARAALQNGAQ MVNDVSGGRADPAMGPLLAEADVPWVLMHWRAVSADTPHVPVRYGNVVAEVRADLLAS VADAVAAGVDPARLVLDPGLGFAKTAQHNWAILHALPELVATGIPVLVGASRKRFLGA LLAGPDGVMRPTDGRDTATAVISALAALHGAWGVRVHDVRASVDAIKVVEAWMGAERI ERDG" misc_feature complement(4049816..4049857) /gene="folP1" /locus_tag="Rv3608c" /note="PS00793 Dihydropteroate synthase signature 2." misc_feature complement(4049912..4049959) /gene="folP1" /locus_tag="Rv3608c" /note="PS00792 Dihydropteroate synthase signature 1." gene complement(4049977..4050585) /gene="folE" /locus_tag="Rv3609c" /db_xref="GeneID:885346" CDS complement(4049977..4050585) /gene="folE" /locus_tag="Rv3609c" /EC_number="3.5.4.16" /function="INVOLVED IN THE BIOSYNTHESIS OF TETRAHYDROFOLATE (AT THE FIRST STEP) [CATALYTIC ACTIVITY: GTP + 2 H(2)O = FORMATE + 2-AMINO-4-HYDROXY-6-(ERYTHRO-1,2,3-TRIHYDROXYPROPYL)DIHYD RO PTERIDINE TRIPHOSPHATE]." /note="involved in the first step of tetrahydrofolate biosynthesis; catalyzes the formation of formate and 2-amino-4-hydroxy-6-(erythro-1,2, 3-trihydroxypropyl)dihydropteridine triphosphate from GTP and water; forms a homopolymer" /codon_start=1 /transl_table=11 /product="GTP cyclohydrolase I" /protein_id="NP_218126.1" /db_xref="GI:15610745" /db_xref="GeneID:885346" /translation="MSQLDSRSASARIRVFDQQRAEAAVRELLYAIGEDPDRDGLVAT PSRVARSYREMFAGLYTDPDSVLNTMFDEDHDELVLVKEIPMYSTCEHHLVAFHGVAH VGYIPGDDGRVTGLSKIARLVDLYAKRPQVQERLTSQIADALMKKLDPRGVIVVIEAE HLCMAMRGVRKPGSVTTTSAVRGLFKTNAASRAEALDLILRK" misc_feature complement(4050175..4050201) /gene="folE" /locus_tag="Rv3609c" /note="PS00860 GTP cyclohydrolase I signature 2." gene complement(4050601..4052883) /gene="ftsH" /locus_tag="Rv3610c" /db_xref="GeneID:885732" CDS complement(4050601..4052883) /gene="ftsH" /locus_tag="Rv3610c" /EC_number="3.4.24.-" /function="THOUGHT TO ACT AS AN ATP-DEPENDENT ZINC METALLOPEPTIDASE, WITH ATPase AND PROTEOLYTIC ACTIVITIES. PROBABLY HAS A REGULATORY ROLE IN STRESS RESPONSE AND SPECIFIC PROTEINS SECRETION FOR ADAPTATION TO HOST ENVIRONMENT." /experiment="experimental evidence, no additional details recorded" /note="Rv3610c, (MT3714, MTCY07H7B.12), len: 760 aa. ftsH, membrane-bound protease (cell division protein) (EC 3.4.24.-) (see citations below), equivalent to Q9CD58|FTSH_MYCLE|ML0222 (alias O69532|FTSH) CELL DIVISION PROTEIN FTSH HOMOLOG from Mycobacterium leprae (787 aa), FASTA scores: opt: 4388, E(): 9.6e-205, (87.2% identity in 790 aa overlap). Also highly similar to many FTSH proteins e.g. O52395|FTSH from Mycobacterium smegmatis (769 aa), FASTA scores: opt: 3976, E(): 7.6e-185, (82.4% identity in 761 aa overlap); Q9X8I4|SCE9.11c from Streptomyces coelicolor (668 aa), FASTA scores: opt: 2417, E(): 1.4e-109, (57.2% identity in 668 aa overlap); P72991|FTH4_SYNY3|SLR1604 from Synechocystis sp. strain PCC 6803 (616 aa), FASTA scores: opt: 1926, E(): 7.2e-86, (49.35% identity in 612 aa overlap); P28691|FTSH_ECOLI|HFLB|MRSC|TOLZ|B3178 from Escherichia coli strain K12 (644 aa), FASTA scores: opt: 1859, E(): 1.3e-82, (48.95% identity in 605 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), and PS00674 AAA-protein family signature. BELONGS TO THE AAA FAMILY OF ATPASES AND PEPTIDASE FAMILY M41 (ZINC METALLOPROTEASE). COFACTOR: BINDS ONE ZINC ION (POTENTIAL)." /codon_start=1 /transl_table=11 /product="membrane-bound protease FTSH (cell division protein)" /protein_id="NP_218127.1" /db_xref="GI:15610746" /db_xref="GeneID:885732" /translation="MNRKNVTRTITAIAVVVLLGWSFFYFSDDTRGYKPVDTSVAITQ INGDNVKSAQIDDREQQLRLILKKGNNETDGSEKVITKYPTGYAVDLFNALSAKNAKV STVVNQGSILGELLVYVLPLLLLVGLFVMFSRMQGGARMGFGFGKSRAKQLSKDMPKT TFADVAGVDEAVEELYEIKDFLQNPSRYQALGAKIPKGVLLYGPPGTGKTLLARAVAG EAGVPFFTISGSDFVEMFVGVGASRVRDLFEQAKQNSPCIIFVDEIDAVGRQRGAGLG GGHDEREQTLNQLLVEMDGFGDRAGVILIAATNRPDILDPALLRPGRFDRQIPVSNPD LAGRRAVLRVHSKGKPMAADADLDGLAKRTVGMTGADLANVINEAALLTARENGTVIT GPALEEAVDRVIGGPRRKGRIISEQEKKITAYHEGGHTLAAWAMPDIEPIYKVTILAR GRTGGHAVAVPEEDKGLRTRSEMIAQLVFAMGGRAAEELVFREPTTGAVSDIEQATKI ARSMVTEFGMSSKLGAVKYGSEHGDPFLGRTMGTQPDYSHEVAREIDEEVRKLIEAAH TEAWEILTEYRDVLDTLAGELLEKETLHRPELESIFADVEKRPRLTMFDDFGGRIPSD KPPIKTPGELAIERGEPWPQPVPEPAFKAAIAQATQAAEAARSDAGQTGHGANGSPAG THRSGDRQYGSTQPDYGAPAGWHAPGWPPRSSHRPSYSGEPAPTYPGQPYPTGQADPG SDESSAEQDDEVSRTKPAHG" misc_feature complement(4051924..4051980) /gene="ftsH" /locus_tag="Rv3610c" /note="PS00674 AAA-protein family signature." misc_feature complement(4052254..4052277) /gene="ftsH" /locus_tag="Rv3610c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." repeat_region complement(4052949..4052966) /note="18 bp direct repeat 2, GGGTTTGCGATCGCCACG" gene 4052950..4053603 /locus_tag="Rv3611" /db_xref="GeneID:885469" CDS 4052950..4053603 /locus_tag="Rv3611" /function="UNKNOWN" /note="Rv3611, (MTCY07H7B.11c), len: 217 aa. Hypothetical unknown arg-, pro-rich protein. Possible ORF containing several direct repeats." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218128.1" /db_xref="GI:15610747" /db_xref="GeneID:885469" /translation="MAIANPAEPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITP EPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAAGRHHQPRGDRKPRAWR QCGPQNGPRRSQAITPEPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAA GRHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAAGRHWLDQRPVVPDGVGKSDS" repeat_region complement(4052971..4052994) /note="(24 bp) part of 111 bp direct repeat unit 6, GTGGCGACCCGCTGCACCCGGCTC" repeat_region complement(4052995..4053105) /note="111 bp direct repeat unit 5, GTGGCGACCCGCTGCACCCGGCTCTGGGGTGATTGCCTGGCTCCTCCTCGGCCCGTT TTGCGGGCCGCATTGTCGCCAGGCGCGGGGTTTGCGATCGCCACGGGGCTGATG" repeat_region complement(4053004..4053021) /note="18 bp direct repeat 1, GGGTTTGCGATCGCCACG" repeat_region complement(4053106..4053216) /note="111 bp direct repeat unit 4, GTGGCGACCCGCTGCACCCGGCTCTGGGGTGATTGCCTGGCTCCTCCTCGGCCCGTT TTGCGGGCCGCATTGTCGCCAGGCGCGGGGTTTGCGATCGCCACGGGGCTGATG" repeat_region complement(4053217..4053327) /note="111 bp direct repeat unit 3, GTGGCGACCCGCTGCACCCGGCTCTGGGGTGATTGCCTGGCTCCTCCTCGGCCCGTT TTGCGGGCCGCATTGTCGCCAGGCGCGGGGTTTGCGATCGCCACGGGGCTGATG" repeat_region complement(4053328..4053438) /note="111 bp direct repeat unit 2, GTGGCGACCCGCTGCACCCGGCTCTGGGGTGATTGCCTGGCTCCTCCTCGGCCCGTT TTGCGGGCCGCATTGTCGCCAGGCGCGGGGTTTGCGATCGCCACGGGGCTGATG" repeat_region complement(4053439..4053549) /note="111 bp direct repeat unit 1, GTGGCGACCCGCTGCACCCGGCTCTGGGGTGATTGCCTGGCTCCTCCTCGGCCCGTT TTGCGGGCCGCATTGTCGCCAGGCGCGGGGTTTGCGATCGCCACGGGGCTGATG" gene complement(4053518..4053847) /locus_tag="Rv3612c" /db_xref="GeneID:885381" CDS complement(4053518..4053847) /locus_tag="Rv3612c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3612c, (MTCY07H7B.10), len: 109 aa. Conserved hypothetical protein. Residues 58 to 81 highly similar to N-terminal part of AAK46718|MT2424 HYPOTHETICAL 3.9 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (36 aa), FASTA scores: opt: 108, E(): 0.38, (69.25% identity in 26 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218129.1" /db_xref="GI:15610748" /db_xref="GeneID:885381" /translation="MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWAD RVSPGAVTHATGAMCPTLGAHQFEPNQVRCTACLTRTLSCRIFRRRRELPVVGLASGD PLHPALG" gene complement(4053881..4054042) /locus_tag="Rv3613c" /db_xref="GeneID:885124" CDS complement(4053881..4054042) /locus_tag="Rv3613c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3613c, (MTCY07H7B.09), len: 53 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218130.1" /db_xref="GI:15610749" /db_xref="GeneID:885124" /translation="MCTMPKLWRAFMAGRPLGSTFTPRQPTGAAPNHVRALDDSIDPS SAPAARAAL" gene complement(4054142..4054696) /locus_tag="Rv3614c" /db_xref="GeneID:885777" CDS complement(4054142..4054696) /locus_tag="Rv3614c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3614c, (MTCY07H7B.08), len: 184 aa. Conserved hypothetical protein, equivalent to Q49730|ML0407|B1620_C3_264|MLCL383.03 HYPOTHETICAL 24.2 KDA PROTEIN from Mycobacterium leprae (216 aa) FASTA scores: opt: 899, E(): 1.7e-51, (71.3% identity in 188 aa overlap); and similar to two hypothetical proteins from Mycobacterium leprae: Q9CDD6|ML0056 (169 aa), FASTA scores: opt: 285, E(): 1.2e-11, (38.35% identity in 172 aa overlap); and O33090|MLCB628.19c (338 aa), FASTA scores: opt: 289, E(): 1.2e-11, (38.95% identity in 172 aa overlap). Also highly similar to O69732|Rv3867|MTV027.02 HYPOTHETICAL 19.9 KDA PROTEIN from Mycobacterium tuberculosis (183 aa), FASTA scores: opt: 563, E(): 1e-29, (54.9% identity in 173 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218131.1" /db_xref="GI:15610750" /db_xref="GeneID:885777" /translation="MDLPGNDFDSNDFDAVDLWGADGAEGWTADPIIGVGSAATPDTG PDLDNAHGQAETDTEQEIALFTVTNPPRTVSVSTLMDGRIDHVELSARVAWMSESQLA SEILVIADLARQKAQSAQYAFILDRMSQQVDADEHRVALLRKTVGETWGLPSPEEAAA AEAEVFATRYSDDCPAPDDESDPW" gene complement(4054812..4055123) /locus_tag="Rv3615c" /db_xref="GeneID:885770" CDS complement(4054812..4055123) /locus_tag="Rv3615c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3615c, (MTCY07H7B.07), len: 103 aa. Conserved hypothetical protein, equivalent to Q49723|ML0406|B1620_C2_214|MLCL383 HYPOTHETICAL 11.1 KDA PROTEIN from Mycobacterium leprae (106 aa), FASTA scores: opt: 364, E(): 4.1e-18, (60.85% identity in 92 aa overlap). Also shows similarity to P96212|Rv3865|MTCY01A6.03 HYPOTHETICAL 10.6 KDA PROTEIN from Mycobacterium tuberculosis (103 aa), FASTA scores: opt: 198, E(): 6.8e-07, (36.25% identity in 102 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218132.1" /db_xref="GI:15610751" /db_xref="GeneID:885770" /translation="MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITH GPYCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF T" gene complement(4055197..4056375) /locus_tag="Rv3616c" /db_xref="GeneID:885377" CDS complement(4055197..4056375) /locus_tag="Rv3616c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3616c, (MTCY07H7B.06), len: 392 aa. Conserved hypothetical ala-, gly-rich protein, equivalent to Q49722|ML0405|B1620_C2_213|MLCL383.01 HYPOTHETICAL 40.8 KDA PROTEIN from Mycobacterium leprae (394 aa) FASTA scores: opt: 1620, E(): 5.3e-75, (62.7% identity in 394 aa overlap). Also similar to P96213|Rv3864|MTCY01A6.04c HYPOTHETICAL 42.1 KDA PROTEIN from Mycobacterium tuberculosis (402 aa), FASTA scores: opt: 389, E(): 1.1e-12, (31.75% identity in 400 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218133.1" /db_xref="GI:15610752" /db_xref="GeneID:885377" /translation="MSRAFIIDPTISAIDGLYDLLGIGIPNQGGILYSSLEYFEKALE ELAAAFPGDGWLGSAADKYAGKNRNHVNFFQELADLDRQLISLIHDQANAVQTTRDIL EGAKKGLEFVRPVAVDLTYIPVVGHALSAAFQAPFCAGAMAVVGGALAYLVVKTLINA TQLLKLLAKLAELVAAAIADIISDVADIIKGTLGEVWEFITNALNGLKELWDKLTGWV TGLFSRGWSNLESFFAGVPGLTGATSGLSQVTGLFGAAGLSASSGLAHADSLASSASL PALAGIGGGSGFGGLPSLAQVHAASTRQALRPRADGPVGAAAEQVGGQSQLVSAQGSQ GMGGPVGMGGMHPSSGASKGTTTKKYSEGAAAGTEDAERAPVEADAGGGQKVLVRNVV" gene 4057733..4058701 /gene="ephA" /locus_tag="Rv3617" /db_xref="GeneID:885769" CDS 4057733..4058701 /gene="ephA" /locus_tag="Rv3617" /function="BIOTRANSFORMATION ENZYME THAT CATALYZES THE HYDROLYSIS OF EPOXIDES (ALKENE OXIDES, OXIRANES) AND ARENE OXIDES TO LESS REACTIVE AND MORE WATER SOLUBLE DIHYDRODIOLS BY THE TRANS ADDITION OF WATER. THOUGHT TO BE INVOLVED IN DETOXIFICATION REACTIONS FOLLOWING OXIDATIVE DAMAGE TO LIPIDS [CATALYTIC ACTIVITY: AN EPOXIDE + H(2)O = A GLYCOL]." /note="Rv3617, (MTCY07H7B.05c, MTCY15C10.35c), len: 322 aa. Probable ephA, epoxide hydrolase (EC 3.3.2.3) (see citation below), similar to many e.g. Q9A8W9|CC1229 from Caulobacter crescentus (330 aa), FASTA scores: opt: 965, E(): 1.8e-51, (46.15% identity in 323 aa overlap); Q9M9W5|F18C1.13 from Arabidopsis thaliana (Mouse-ear cress) (331 aa), FASTA scores: opt: 778, E(): 4.3e-40, (40.35% identity in 332 aa overlap); Q9S7P1 from Oryza sativa (Rice) (322 aa), FASTA scores: opt: 774, E(): 7.4e-40, (41.1% identity in 321 aa overlap); P80299|HYES_RAT|EPHX2 from Rattus norvegicus (Rat) (554 aa), FASTA scores: opt: 759, E(): 9.5e-39, (40.5% identity in 306 aa overlap) (similarity only with the C-terminal part for this one); etc. SIMILAR TO ALPHA/BETA HYDROLASE FOLD. Contains PS00888 Cyclic nucleotide-binding domain signature 1." /codon_start=1 /transl_table=11 /product="epoxide hydrolase EphA" /protein_id="NP_218134.1" /db_xref="GI:15610753" /db_xref="GeneID:885769" /translation="MGAPTERLVDTNGVRLRVVEAGEPGAPVVILAHGFPELAYSWRH QIPALADAGYHVLAPDQRGYGGSSRPEAIEAYDIHRLTADLVGLLDDVGAERAVWVGH DWGAVVVWNAPLLHADRVAAVAALSVPALPRAQVPPTQAFRSRFGENFFYILYFQEPG IADAELNGDPARTMRRMIGGLRPPGDQSAAMRMLAPGPDGFIDRLPEPAGLPAWISQE ELDHYIGEFTRTGFTGGLNWYRNFDRNWETTADLAGKTISVPSLFIAGTADPVLTFTR TDRAAEVISGPYREVLIDGAGHWLQQERPGEVTAALLEFLTGLELR" misc_feature 4057784..4057834 /gene="ephA" /locus_tag="Rv3617" /note="PS00888 Cyclic nucleotide-binding domain signature 1." gene 4058698..4059885 /locus_tag="Rv3618" /db_xref="GeneID:885276" CDS 4058698..4059885 /locus_tag="Rv3618" /function="UNKNOWN" /note="Rv3618, (MTCY15C10.34c, MTCY07H7B.04c), len: 395 aa. Possible monooxygenase (EC 1.-.-.-), similar to others (principally bacterial luciferases alpha chain) e.g. Q9JN87|MMYO PUTATIVE ALKANAL MONOOXYGENASE from Streptomyces coelicolor (373 aa), FASTA scores: opt: 949, E(): 8.9e-54, (41.7% identity in 374 aa overlap); Q9EUT9|LIMB LIMONENE MONOOXYGENASE from Rhodococcus erythropolis (387 aa), FASTA scores: opt: 856, E(): 9.1e-48, (42.0% identity in 388 aa overlap); AAK72698 LUXA-LIKE PROTEIN from Bradyrhizobium japonicum (458 aa) FASTA scores: opt: 350, E(): 4.4e-15, (29.7% identity in 347 aa overlap); Q9K4C1|2SC6G5.34c PUTATIVE ALKANAL MONOOXYGENASE (LUCIFERASE) from Streptomyces coelicolor (342 aa), FASTA scores: opt: 291, E(): 2.2e-11, (26.5% identity in 362 aa overlap); etc. Also similar to P95278|Rv1936|MTCY09F9.28c HYPOTHETICAL 41.8 KDA PROTEIN from Mycobacterium tuberculosis (369 aa), FASTA scores: opt: 473, E(): 4.3e-23, (32.55% identity in 378 aa overlap)." /codon_start=1 /transl_table=11 /product="monooxygenase" /protein_id="NP_218135.1" /db_xref="GI:15610754" /db_xref="GeneID:885276" /translation="MKAPLRFGVFITPFHPTGQSPTVALQYDMERVVALDRLGYDEAW FGEHHSGGYELIACPEVFIAAAAERTTHIRLGTGVVSLPYHHPLMVADRWVLLDHLTR GRVMFGTGPGALPSDAYMMGIDPVEQRRMMQESLEAILALFRAAPDERIDRHSDWFTL REAQLHIRPYTWPYPEIATAAMISPSGPRLAGALGTSLLSLSMSVPGGYAALETAWGV VREQAAKAGRGEPDRADWRVLSIMHLSDSRDQAIDDCTYGLPDFSRYFGAAGFVPLAN TVEGTQSSREFVEQYAAKGNCCIGTPDDAIAHIEDLLHRSGGFGTLLLLGHDWAPPPA TFHSYELFARAVIPYFKGQLAAPRASHEWARGKRDQLIGRAGEAVVKAITEHVAEQGE AGS" gene complement(4059984..4060268) /gene="esxV" /locus_tag="Rv3619c" /db_xref="GeneID:885328" CDS complement(4059984..4060268) /gene="esxV" /locus_tag="Rv3619c" /function="UNKNOWN" /note="Rv3619c, (MTCY15C10.33, MTCY07H7B.03, MT3721), len: 94 aa. esxV, ESAT-6 like protein (see citations below), highly similar to many Mycobacterial ESAT-6 like proteins e.g. O53942|ES65_MYCTU PUTATIVE ESAT-6 LIKE PROTEIN 5 from Mycobacterium tuberculosis (94 aa), FASTA scores: opt: 582, E(): 4.4e-33, (92.55% identity in 94 aa overlap); Q49946|ES6X_MYCLE|U1756D PUTATIVE ESAT-6 LIKE PROTEIN X from Mycobacterium leprae (95 aa), FASTA scores: opt: 409, E(): 2.5e-21, (64.15% identity in 92 aa overlap); etc. Strictly identical to P96364|ES61_MYCTU|Rv1037c|MT1066|MTCY10G2.12 PUTATIVE ESAT-6 LIKE PROTEIN 1 (94 aa). BELONGS TO THE ESAT6 FAMILY.; ES6_1, Mtb9.9D" /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXV (ESAT-6 like protein 1)" /protein_id="NP_218136.1" /db_xref="GI:15610755" /db_xref="GeneID:885328" /translation="MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGG AGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA" gene complement(4060295..4060591) /gene="esxW" /locus_tag="Rv3620c" /db_xref="GeneID:885787" CDS complement(4060295..4060591) /gene="esxW" /locus_tag="Rv3620c" /function="UNKNOWN" /note="Rv3620c, (MTCY15C10.32, MTCY07H7B.02, MT3722), len: 98 aa. esxW, ESAT-6 like protein (see citation below). Member of the M. tuberculosis hypothetical QILSS protein family with Rv1038c, Rv1792, Rv2347c and Rv1197|O05299|ES63_MYCTU|MT1235|MTCI364.09 PUTATIVE ESAT-6 LIKE PROTEIN 3 from Mycobacterium tuberculosis (98 aa), FASTA scores: opt: 638, E(): 2.3e-36, (97.95% identity in 98 aa overlap). Also similar to Q49945|ES6Y_MYCLE PUTATIVE ESAT-6 LIKE PROTEIN Y from Mycobacterium leprae (100 aa), FASTA scores: opt: 370, E(): 2.1e-18, (57.9% identity in 95 aa overlap); etc. BELONGS TO THE ESAT6 FAMILY.; ES6_10, QILSS" /codon_start=1 /transl_table=11 /product="putative ESAT-6 like protein ESXW (ESAT-6 like protein 10)" /protein_id="NP_218137.1" /db_xref="GI:15610756" /db_xref="GeneID:885787" /translation="MTSRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAG WSGMAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS" gene complement(4060648..4061889) /gene="PPE65" /locus_tag="Rv3621c" /db_xref="GeneID:885097" CDS complement(4060648..4061889) /gene="PPE65" /locus_tag="Rv3621c" /function="UNKNOWN" /note="Rv3621c, (MTCY15C10.31, MTCY07H7B.01), len: 413 aa. Member of the Mycobacterium tuberculosis PPE family, ala-, gly-rich proteins, similar to many e.g. Q10813|YS92_MYCTU|Rv2892c|MT2959|MTCY274.23c (408 aa) FASTA scores: opt: 955, E(): 1.8e-42, (44.45% identity in 423 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_177998.1" /db_xref="GI:57117135" /db_xref="GeneID:885097" /translation="MLDFAQLPPEVNSALMYAGPGSGPMLAAAAAWEALAAELQTTAS TYDALITGLADGPWQGSSAASMVAAATPQVAWLRSTAGQAEQAGSQAVAAASAYEAAF FATVPPPEIAANRALLMALLATNFLGQNTAAIAATEAQYAEMWAQDAAAMYGYAGASA AATQLSPFNPAAQTINPAGLASQAASVGQAVSGAANAQALTDIPKALFGLSGIFTNEP PWLTDLGKALGLTGHTWSSDGSGLIVGGVLGDFVQGVTGSAELDASVAMDTFGKWVSP ARLMVTQFKDYFGLAHDLPKWASEGAKAAGEAAKALPAAVPAIPSAGLSGVAGAVGQA ASVGGLKVPAVWTATTPAASPAVLAASNGLGAAAAAEGSTHAFGGMPLMGSGAGRAFN NFAAPRYGFKPTVIAQPPAGG" gene complement(4061899..4062198) /gene="PE32" /locus_tag="Rv3622c" /db_xref="GeneID:885712" CDS complement(4061899..4062198) /gene="PE32" /locus_tag="Rv3622c" /function="UNKNOWN" /note="Rv3622c, (MTCY15C10.30), len: 99 aa. Member of the Mycobacterium tuberculosis PE family (see citation below), but no glycine rich C-terminus present. Similar to others e.g. O53938|Rv1788|MTV049.10 (99 aa), FASTA scores: opt: 376, E(): 7.1e-17, (65.6% identity in 96 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_177999.1" /db_xref="GI:57117136" /db_xref="GeneID:885712" /translation="MSIMHAEPEMLAATAGELQSINAVARAGNAAVAGPTTGVVPAAA DLVSLLTASQFAAHAQLYQAISAEAMAVQEQLATTLGISAGSYAATEAANAATIA" gene 4062527..4063249 /gene="lpqG" /locus_tag="Rv3623" /db_xref="GeneID:885233" CDS 4062527..4063249 /gene="lpqG" /locus_tag="Rv3623" /function="UNKNOWN" /note="Rv3623, (MTCY15C10.29c), len: 240 aa. Probable lpqG, conserved lipoprotein, showing some similarity with hypothetical proteins e.g. Q57432 from Methanosarcina barkeri (251 aa), FASTA scores: opt: 319, E(): 6.8e-12, (31.2% identity in 218 aa overlap); Q9PEA5|XF1123 OUTER MEMBRANE PROTEIN from Xylella fastidiosa (242 aa) FASTA scores: opt: 312, E(): 1.7e-11, (28.25% identity in 237 aa overlap); BAB49547|MLR2408 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (236 aa), FASTA scores: opt: 304, E(): 5e-11, (27.05% identity in 244 aa overlap); etc. Has suitable signal peptide and prokaryotic membrane lipoprotein lipid attachment site (PS00013)." /codon_start=1 /transl_table=11 /product="lipoprotein LpqG" /protein_id="NP_218140.1" /db_xref="GI:15610759" /db_xref="GeneID:885233" /translation="MIRLVRHSIALVAAGLAAALSGCDSHNSGSLGADPRQVTVFGSG QVQGVPDTLIADVGIQVTAADVTSAMNQTNDRQQAVIDALVGAGLDRKDIRTTRVTVA PQYSNPEPAGTATITGYRADNDIEVKIHPTDAASRLLALVVSTGGDATRISSVSYSIG DDSQLVKDARARAFQDAKNRADQYAQLSGLRLGKVISISEASGAAPTHEAPAPPRGLS AVPLEPGQQTVGFSVTVVWELT" misc_feature 4062563..4062595 /gene="lpqG" /locus_tag="Rv3623" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site." gene complement(4063254..4063904) /gene="hpt" /locus_tag="Rv3624c" /db_xref="GeneID:885398" CDS complement(4063254..4063904) /gene="hpt" /locus_tag="Rv3624c" /EC_number="2.4.2.8" /function="INVOLVED IN PURINE SALVAGE [CATALYTIC ACTIVITY: IMP + PYROPHOSPHATE = HYPOXANTHINE + 5-PHOSPHO-ALPHA-D-RIBOSE 1-DIPHOSPHATE (GUANINE CAN REPLACE HYPOXANTHINE TO PRODUCE GMP)]." /note="Catalyzes the salvage synthesis of inosine-5'-monophosphate (IMP) and guanosine-5'-monophosphate (GMP) from the purine bases hypoxanthine and guanine, respectively" /codon_start=1 /transl_table=11 /product="hypoxanthine-guanine phosphoribosyltransferase" /protein_id="NP_218141.1" /db_xref="GI:15610760" /db_xref="GeneID:885398" /translation="MTPALVVGPAAWHAVHVTQSSSAITPGQTAELYPGDIKSVLLTA EQIQARIAELGEQIGNDYRELSATTGQDLLLITVLKGAVLFVTDLARAIPVPTQFEFM AVSSYGSSTSSSGVVRILKDLDRDIHGRDVLIVEDVVDSGLTLSWLSRNLTSRNPRSL RVCTLLRKPDAVHANVEIAYVGFDIPNDFVVGYGLDYDERYRDLSYIGTLDPRVYQ" misc_feature complement(4063473..4063511) /gene="hpt" /locus_tag="Rv3624c" /note="PS00103 Purine/pyrimidine phosphoribosyl transferases signature." gene complement(4063901..4064872) /gene="mesJ" /locus_tag="Rv3625c" /db_xref="GeneID:885409" CDS complement(4063901..4064872) /gene="mesJ" /locus_tag="Rv3625c" /function="THOUGHT TO BE INVOLVED IN A CELL CYCLE PROCESS." /note="Rv3625c, (MT3727, MTCY15C10.27), len: 323 aa. Possible mesJ, cell cycle protein, equivalent to O69538|Y0C5_MYCLE|ML0213|MLCB2548.18c HYPOTHETICAL 34.1 KDA PROTEIN from Mycobacterium leprae (323 aa) FASTA scores: opt: 1592, E(): 9e-92, (78.0% identity in 327 aa overlap). Similar to bacterial hypothetical proteins Q9X8I6|SCE9.13c from Streptomyces coelicolor (352 aa) FASTA scores: opt: 705, E(): 1.4e-36, (47.85% identity in 305 aa overlap); and Q9HXZ3|PA3638 from Pseudomonas aeruginosa (442 aa), FASTA scores: opt: 382, E(): 2e-16, (40.6% identity in 271 aa overlap). But also similar (or with similarity) to bacterial cell cycle proteins (MESJ) e.g. Q9KPX0|VC2242 MESJ PROTEIN from Vibrio cholerae (440 aa), FASTA scores: opt: 363, E(): 3e-15, (34.8% identity in 253 aa overlap); Q9RV23|DR1207 (600 aa) CELL CYCLE PROTEIN MESJ (PUTATIVE/CYTOSINE DEAMINASE-RELATED PROTEIN) from Deinococcus radiodurans (600 aa), FASTA scores: opt: 310, E(): 7.6e-12, (36.6% identity in 265 aa overlap) (similar only at the N-terminal end); Q9PFJ8|XF0659 CELL CYCLE PROTEIN from Xylella fastidiosa (437 aa), FASTA scores: opt: 301, E(): 2.1e-11, (35.05% identity in 271 aa overlap); P52097|MESJ_ECOLI|B0188 PUTATIVE CELL CYCLE PROTEIN MESJ from Escherichia coli strain K12(432 aa) FASTA scores: opt: 299, E(): 2.8e-11, (34.65% identity in 277 aa overlap); etc. BELONGS TO THE UPF0072 (MESJ/YCF62) FAMILY." /codon_start=1 /transl_table=11 /product="cell cycle protein MESJ" /protein_id="NP_218142.1" /db_xref="GI:15610761" /db_xref="GeneID:885409" /translation="MDRQSAVAQLRAAAEQFARVHLDACDRWSVGLSGGPDSLALTAV AARLWPTTALIVDHGLQPGSATVAETARIQAISLGCVDARVLCVQVGAAGGREAAARS ARYSALEEHRDGPVLLAHTLDDQAETVLLGLGRGSGARSIAGMRPYDPPWCRPLLGVR RSVTHAACRELGLTAWQDPHNTDRRFTRTRLRTEVLPLLEDVLGGGVAEALARTATAL REDTDLIDTIAAQALPGAAVAGSRGQELSTSALTALPDAVRRRVIRGWLLAGGATGLT DRQIRGVDRLVTAWRGQGGVAVGSTLRGQRLVAGRRDGVLVLRREPV" gene complement(4064851..4065903) /locus_tag="Rv3626c" /db_xref="GeneID:885792" CDS complement(4064851..4065903) /locus_tag="Rv3626c" /function="UNKNOWN" /note="Rv3626c, (MTCY15C10.26), len: 350 aa. Conserved hypothetical protein, similar to Q9X8I7|SCE9.14c HYPOTHETICAL PROTEIN from Streptomyces coelicolor (375 aa) FASTA scores: opt: 720, E(): 2.2e-38, (41.55% identity in 361 aa overlap); and shows some similarity to Q9HPS0|VNG1497C HYPOTHETICAL PROTEIN (317 aa) FASTA scores: opt: 226, E(): 4.5e-07, (29.7% identity in 347 aa overlap). Contains neutral zinc metallopeptidases, zinc-binding region signature (PS00142)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218143.1" /db_xref="GI:15610762" /db_xref="GeneID:885792" /translation="MTGASELTLGNTVDWEFAASVGERLARPAPPSTEYTRRQVIDEL TVAAEKAEPPVRDVTGLIADGVVPPARVVDRPAWIRSAAESMRAMTHGSAKPRGFLTG RITGAQTGAVLAFVASGILGQYDPFGAAGEGCLLLVYPNVIAVERQLRVEPSDFRLWV CLHEVTHRVQFTANPWLSGYMSQALNLLTFEPVDDIGRVVSRLADFIRSRGHGTDDSE VNPSGILGLVRAVQSEPQRKALDQLLVLGTLLEGHAEHVMDAVGPMVVPSVATIRRRF DDRRHHKQPPLQRLVRALLGFDAKLSQYTRGKAFVDHVVDRAGMKLFNTIWSGPETLP LPAEIENPQRWIDRVL" misc_feature complement(4065397..4065426) /locus_tag="Rv3626c" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." gene complement(4065900..4067285) /locus_tag="Rv3627c" /db_xref="GeneID:885728" CDS complement(4065900..4067285) /locus_tag="Rv3627c" /function="UNKNOWN (POSSIBLY INVOLVED IN CELL WALL BIOSYNTHESIS)." /note="Rv3627c, (MTCY15C10.25), len: 461 aa. Hypothetical ala-rich protein which may have cleavable signal peptide at N-terminal end. Equivalent to O69539|MLCB2548.20c|ML0211 HYPOTHETICAL 47.2 KDA PROTEIN from Mycobacterium leprae (461 aa), FASTA scores: opt: 2295, E(): 3.5e-116, (76.2% identity in 462 aa overlap); and C-terminal end shows similarity with O05758|MLCB5.28c HYPOTHETICAL 24.1 KDA PROTEIN from Mycobacterium leprae (225 aa), FASTA scores: opt: 268, E(): 1.8e-07, (32.25% identity in 220 aa overlap). Also similar (or with similarity) to various proteins (notably penicillin binding proteins) e.g. Q9X8I8|SCE9.15c HYPOTHETICAL 45.9 KDA PROTEIN from Streptomyces coelicolor (459 aa) FASTA scores: opt: 707, E(): 8.3e-31, (35.75% identity in 439 aa overlap); Q9Z541|SC9B2.18c PUTATIVE CARBOXYPEPTIDASE from Streptomyces coelicolor (451 aa), FASTA scores: opt: 450, E(): 5.3e-17, (31.75% identity in 469 aa overlap); Q9JVV4|NMA0665 PUTATIVE PEPTIDASE from Neisseria meningitidis (serogroup A) (or Q9JY10|NMB1797 from serogroup B) (469 aa), FASTA scores: opt: 269, E(): 3e-07, (26.15% identity in 463 aa overlap); O85665|PBP3 PENICILLIN BINDING PROTEIN 3 from Neisseria gonorrhoeae (469 aa), FASTA scores: opt: 265, E(): 4.9e-07, (31.85% identity in 201 aa overlap); P45161|PBP4_HAEIN|DACB|HI1330 PENICILLIN-BINDING PROTEIN 4 PRECURSOR/PEPTIDASE (479 aa) FASTA scores: opt: 230, E(): 3.8e-05, (27.9% identity in 394 aa overlap); P24228|PBP4_ECOLI|DACB|B3182 PENICILLIN-BINDING PROTEIN 4 PRECURSOR from Escherichia coli strain K12 (477 aa), FASTA scores: opt: 166, E(): 0.1, (28.2% identity in 408 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218144.1" /db_xref="GI:15610763" /db_xref="GeneID:885728" /translation="MGPTRWRKSTHVVVGAAVLAFVAVVVAAAALVTTGGHRAGVRAP APPPRPPTVKAGVVPVADTAATPSAAGVTAALAVVAADPDLGKLAGRITDALTGQELW QRLDDVPLVPASTNKILTAAAALLTLDRQARISTRVVAGGQNPQGPVVLVGAGDPTLS AAPPGQDTWYHGAARIGDLVEQIRRSGVTPTAVQVDASAFSGPTMAPGWDPADIDNGD IAPIEAAMIDAGRIQPTTVNSRRSRTPALDAGRELAKALGLDPAAVTIASAPAGARQL AVVQSAPLIQRLSQMMNASDNVMAECIGREVAVAINRPQSFSGAVDAVTSRLNTAHID TAGAALVDSSGLSLDNRLTARTLDATMQAAAGPDQPALRPLLDLLPIAGGSGTLGERF LDAATDQGPAGWLRAKTGSLTAINSLVGVLTDRSGRVLTFAFISNEAGPNGRNAMDAL ATKLWFCGCTT" gene 4067423..4067911 /gene="ppa" /locus_tag="Rv3628" /db_xref="GeneID:885775" CDS 4067423..4067911 /gene="ppa" /locus_tag="Rv3628" /EC_number="3.6.1.1" /function="INVOLVED IN THE FUNCTION OF CELLULAR BIOENERGETICS [CATALYTIC ACTIVITY: PYROPHOSPHATE + H(2)O = 2 ORTHOPHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the hydrolysis of pyrophosphate" /codon_start=1 /transl_table=11 /product="inorganic pyrophosphatase" /protein_id="NP_218145.1" /db_xref="GI:15610764" /db_xref="GeneID:885775" /translation="MQFDVTIEIPKGQRNKYEVDHETGRVRLDRYLYTPMAYPTDYGF IEDTLGDDGDPLDALVLLPQPVFPGVLVAARPVGMFRMVDEHGGDDKVLCVPAGDPRW DHVQDIGDVPAFELDAIKHFFVHYKDLEPGKFVKAADWVDRAEAEAEVQRSVERFKAG TH" gene complement(4067957..4069054) /locus_tag="Rv3629c" /db_xref="GeneID:885351" CDS complement(4067957..4069054) /locus_tag="Rv3629c" /function="UNKNOWN" /note="Rv3629c, (MTCY15C10.23), len: 365 aa. Probable conserved integral membrane protein, equivalent to O69543|MLCB2548.26|ML0205 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (356 aa), FASTA scores: opt: 1547, E(): 3e-89, (66.2% identity in 361 aa overlap). Also similar to other membrane and hypothetical proteins e.g. CAC37534|SCIF3.15c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (363 aa), FASTA scores: opt: 819, E(): 7.7e-44, (51.55% identity in 351 aa overlap); Q9CGK3|YKJK HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (339 aa) FASTA scores: opt: 683, E(): 2.2e-35, (48.3% identity in 350 aa overlap); Q9KY24|SCC8A.24c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (380 aa) FASTA scores: opt: 528, E(): 1.1e-25, (50.25% identity in 372 aa overlap); Q9RJH8|SCF73.09 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (370 aa) FASTA scores: opt: 439, E(): 3.9e-20, (50.2% identity in 384 aa overlap); Q9PE36|XF1192 INTEGRAL MEMBRANE PROTEIN from Xylella fastidiosa (341 aa), FASTA scores: opt: 337, E(): 8.3e-14, (47.65% identity in 361 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218146.1" /db_xref="GI:15610765" /db_xref="GeneID:885351" /translation="MSTFRIFGFSLLMTVVALVTGYLHGGPTALFLLAVLALLEVSLS FDNAIINAAILQRMSPFWQRMFLTIGILIAVFGMRLVFPLAIIWTTAGLDPVRAMELA LRPPAHGALEFADGSPSYEKLITAAHPQIAAFGGMFLLMLFLDFVVHDRDIKWLKWIE VPFARIGRLGQVPVIVASVGLVLAGALLTHSSDQRGTVLIAGLLGMVTYLVVNGISRA FRPAGLGEATPGVQARQAAGKAGCALFLYLEVLDAAFSFDGVTGAFAITTDPIIIALG LGVVGAMFVRSITIYLVRQDTLDRYVYLEHGAHWAIGALAIILLLSIDHRFAVPEWVT ASVGVVFIGAAFTESVRRNRLTVRSPTKFGS" gene 4069175..4070470 /locus_tag="Rv3630" /db_xref="GeneID:885802" CDS 4069175..4070470 /locus_tag="Rv3630" /function="UNKNOWN" /note="Rv3630, (MTCY15C10.22c), len: 431 aa. Probable conserved integral membrane, highly similar to P71789|YF10_MYCTU|Rv1510|MTCY277.32 HYPOTHETICAL 44.3 KDA PROTEIN from Mycobacterium tuberculosis (432 aa) FASTA scores: opt: 1940, E(): 2.3e-103, (70.75% identity in 424 aa overlap). Note that N-terminal end is highly similar to AAK45825|MT1558 HYPOTHETICAL 18.1 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (172 aa) FASTA scores: opt: 649, E(): 4.2e-30, (61.65% identity in 167 aa overlap); and C-terminal end is highly similar to AAK45826|MT1560 HYPOTHETICAL 25.8 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (256 aa), FASTA scores: opt: 1269, E(): 2.6e-65, (76.7% identity in 253 aa overlap). Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site, so could be a protease." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_218147.1" /db_xref="GI:15610766" /db_xref="GeneID:885802" /translation="MAVGAAAVTEVGDTASPVGSSGASGGAIASGSVARVGTAAAVTA LCGYAVIYLAARNLAPNGFSVFGVFWGAFGLVTGAANGLLQETTREVRSLGYLDVSAD GRRTHPLRVSGMVGLGSLVVIAGSSPLWSGRVFAEARWLSVALLSIGLAGFCLHATLL GMLAGTNRWTQYGALMVADAVIRVVVAAATFVIGWQLVGFIWATVAGSVAWLIMLMTS PPTRAAARLMTPGATATFLRGAAHSIIAAGASAILVMGFPVLLKLTSNELGAQGGVVI LAVTLTRAPLLVPLTAMQGNLIAHFVDERTERIRALIAPAALIGGVGAVGMLAAGVVG PWIMRVAFGSEYQSSSALLAWLTAAAVAIAMLTLTGAAAVAAALHRAYSLGWVGATVG SGLLLLLPLSLETRTVVALLCGPLVGIGVHLVALARTDE" misc_feature 4069892..4069924 /locus_tag="Rv3630" /note="PS00639 Eukaryotic thiol (cysteine) proteases histidine active site." gene 4070514..4071239 /locus_tag="Rv3631" /db_xref="GeneID:885314" CDS 4070514..4071239 /locus_tag="Rv3631" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3631, (MTCY15C10.21c), len: 241 aa. Possible transferase (EC 2.-.-.-), more specifically a glycosyltransferase (EC 2.4.-.-), equivalent to O69542|MLCB2548.24c|ML0207 PUTATIVE TRANSFERASE (PUTATIVE GLYCOSYLTRANSFERASE) from Mycobacterium leprae (239 aa) FASTA scores: opt: 1303, E(): 2.8e-72, (81.2% identity in 239 aa overlap). Also similar to many dolichyl-phosphate mannose synthases and hypothetical proteins e.g. O59263|PH1585 HYPOTHETICAL 34.6 KDA PROTEIN from Pyrococcus horikoshii (313 aa), FASTA scores: opt: 472, E(): 1.2e-21, (36.65% identity in 232 aa overlap); Q9V152|PAB1971 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE from Pyrococcus abyssi (287 aa), FASTA scores: opt: 467, E(): 2.3e-21, (35.85% identity in 223 aa overlap); Q58619|YC22_METJA|MJ1222 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (243 aa), FASTA scores: opt: 400, E(): 2.4e-17, (33.35% identity in 228 aa overlap); O26474|MTH374 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE RELATED PROTEIN from Methanobacterium thermoautotrophicum (291 aa) FASTA scores: opt: 354, E(): 1.7e-14, (33.5% identity in 218 aa overlap); O26239|MTH136 DOLICHYL-PHOSPHATE MANNOSE SYNTHASE from Methanobacterium thermoautotrophicum (220 aa), FASTA scores: opt: 345, E(): 4.8e-14, (33.5% identity in 221 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transferase" /protein_id="NP_218148.1" /db_xref="GI:15610767" /db_xref="GeneID:885314" /translation="MASKMDTETHYSDVWVVIPAFNEAAVIGKVVTDVRSVFDHVVCV DDGSTDGTGDIARRSGAHLVRHPINLGQGAAIQTGIEYARKQPGAQVFATFDGDGQHR VKDVAAMVDRLGAGDVDVVIGTRFGRPVGKASASRPPLMKRIVLQTGARLSRRGRRLG LTDTNNGLRVFNKTVADGLNITMSGMSHATEFIMLIAENHWRVAEEPVEVLYTEYSKS KGQPLLNGVNIIFDGFLRGRMPR" gene 4071236..4071580 /locus_tag="Rv3632" /db_xref="GeneID:885711" CDS 4071236..4071580 /locus_tag="Rv3632" /function="UNKNOWN" /note="Rv3632, (MTCY15C10.20c), len: 114 aa. Possible conserved membrane protein, equivalent to O69541|MLCB2548.23c|ML0208 HYPOTHETICAL 12.9 KDA PROTEIN (PUTATIVE MEMBRANE PROTEIN) from Mycobacterium leprae (113 aa), FASTA scores: opt: 594, E(): 7.1e-35, (82.0% identity in 111 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218149.1" /db_xref="GI:15610768" /db_xref="GeneID:885711" /translation="MNWIQVLLIASIIGLLFYLLRSRRSARSRAWVKVGYVLFVLAGI YAVLRPDDTTVVANWFGVRRGTDLMLYALVMAFSFTTLSTYMRFKDLELRYARIARAL ALEGAQAPEQCR" gene 4071791..4072666 /locus_tag="Rv3633" /db_xref="GeneID:885252" CDS 4071791..4072666 /locus_tag="Rv3633" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3633, (MTCY15C10.19c), len: 291 aa. Conserved hypothetical protein, similar to Q9X5S6|MMCH from Streptomyces lavendulae (254 aa), FASTA scores: opt: 368, E(): 3.2e-16, (35.05% identity in 194 aa overlap); Q9APW1 HYPOTHETICAL 32.7 KDA PROTEIN from Pseudomonas aeruginosa (295 aa), FASTA scores: opt: 359, E(): 1.3e-15, (37.65% identity in 170 aa overlap); Q9APV4 HYPOTHETICAL 34.1 KDA PROTEIN from Pseudomonas aeruginosa (309 aa), FASTA scores: opt: 316, E(): 7.6e-13, (28.65% identity in 262 aa overlap). And some similarity to Q9HGD7|FUM9 FUM9P from Gibberella moniliformis (300 aa), FASTA scores: opt: 254, E(): 6.5e-09, (29.95% identity in 157 aa overlap); and P47181|YJ9S_YEAST|YJR154W|J2240 HYPOTHETICAL 39.0 KDA PROTEIN from Saccharomyces cerevisiae (Baker's yeast) (346 aa), FASTA scores: opt: 190, E(): 8.5e-05, (26.75% identity in 127 aa overlap). Also similar to P71782|YF01_MYCTU|Rv1501|MT1550|MTCY277.23 from Mycobacterium tuberculosis (273 aa), FASTA scores: opt: 286, E(): 5.5e-11, (27.5% identity in 280 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218150.1" /db_xref="GI:15610769" /db_xref="GeneID:885252" /translation="MTQSSSVERLVGEIDEFGYTVVEDVLDADSVAAYLADTRRLERE LPTVIANSTTVVKGLARPGHVPVDRVDHDWVRIDNLLLHGTRYEALPVHPKLLPVIEG VLGRDCLLSWCMTSNQLPGAVAQRLHCDDEMYPLPRPHQPLLCNALIALCDFTADNGA TQVVPGSHRWPERPSPPYPEGKPVEINAGDALIWNGSLWHTAAANRTDAPRPALTINF CVGFVRQQVNQQLSIPRELVRCFEPRLQELIGYGLYAGKMGRIDWRPPADYLDADRHP FLDAVADRLQTSVRL" gene complement(4072667..4073611) /gene="galE1" /locus_tag="Rv3634c" /db_xref="GeneID:885765" CDS complement(4072667..4073611) /gene="galE1" /locus_tag="Rv3634c" /EC_number="5.1.3.2" /function="INVOLVED IN GALACTOFURANOSYL BIOSYNTHESIS: CONVERTS UDO-GlcP TO UDP-GalP [CATALYTIC ACTIVITY: UDP-GLUCOPYRANOSE = UDP-GALACTOPYRANOSE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3634c, (MTCY15C10.18), len: 314 aa. galE1, UDP-glucose 4-epimerase (EC 5.1.3.2) (see citations below), equivalent to O69544|ML0204|RMLB2|MLCB2548.27c PUTATIVE SUGAR DEHYDRATASE (PUTATIVE SUGAR-NUCLEOTIDE DEHYDRATASE) from Mycobacterium leprae (319 aa), FASTA scores: opt: 1798, E(): 8.2e-100, (86.4% identity in 309 aa overlap). Also similar to other UDP-GLUCOSE 4-EPIMERASES e.g. Q9WYX9|TM0509 from Thermotoga maritima (309 aa) FASTA scores: opt: 877, E(): 4.8e-45, (45.8% identity in 308 aa overlap); Q57664|GALE_METJA|MJ0211 from Methanococcus jannaschii (305 aa), FASTA scores: opt: 792, E(): 5.4e-40, (42.05% identity in 309 aa overlap); Q9K6S7|BH3649 from Bacillus halodurans (311 aa), FASTA scores: opt: 723, E(): 7e-36, (40.5% identity in 316 aa overlap); Q9HSV1|GALE2|VNG0063G from Halobacterium sp. strain NRC-1 (328 aa), FASTA scores: opt: 597, E(): 2.3e-28, (36.35% identity in 322 aa overlap); etc. Contains short-chain alcohol dehydrogenase family signature (PS00061) but this maynot be significant. BELONGS TO THE SUGAR EPIMERASE FAMILY. Note that previously known as rmlB2, a DTDP-glucose 4,6-dehydratase (EC 4.2.1.46) (see Ma et al., 2001).; rmlB2" /codon_start=1 /transl_table=11 /product="UDP-glucose 4-epimerase GALE1 (galactowaldenase) (UDP-galactose 4-epimerase) (uridine diphosphate galactose 4-epimerase) (uridine diphospho-galactose 4-epimerase)" /protein_id="NP_215015.2" /db_xref="GI:57117137" /db_xref="GeneID:885765" /translation="MRALVTGAAGFIGSTLVDRLLADGHSVVGLDNFATGRATNLEHL ADNSAHVFVEADIVTADLHAILEQHRPEVVFHLAAQIDVRRSVADPQFDAAVNVIGTV RLAEAARQTGVRKIVHTSSGGSIYGTPPEYPTPETAPTDPASPYAAGKVAGEIYLNTF RHLYGLDCSHIAPANVYGPRQDPHGEAGVVAIFAQALLSGKPTRVFGDGTNTRDYVFV DDVVDAFVRVSADVGGGLRFNIGTGKETSDRQLHSAVAAAVGGPDDPEFHPPRLGDLK RSCLDIGLAERVLGWRPQIELADGVRRTVEYFRHKHTD" gene 4073634..4075409 /locus_tag="Rv3635" /db_xref="GeneID:885675" CDS 4073634..4075409 /locus_tag="Rv3635" /function="UNKNOWN" /note="Rv3635, (MTCY15C10.17c), len: 591 aa (start unclear). Probable conserved transmembrane protein, equivalent, but longer 25 aa, to O69545|ML0203|MLCB2548.28 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (569 aa), FASTA scores: opt: 2933, E(): 4.6e-173, (77.0% identity in 569 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218152.1" /db_xref="GI:15610771" /db_xref="GeneID:885675" /translation="MPAPRMPRVALVAVLLITVQLVVRVVLAFGGYFYWDDLILVGRA GTGGLLSPSYLFDDHDGHVMPGAFLVAGAIIRVAPLVWTGPAISLVVLQLLESLALLR ALYVISSWRPVLLIPLTFALFTPLAVPGFAWWAAALNSLPMLAALAWVCADAILLVRT GNHRYAVTGVLVYLGGLLFFEKAAVIPFVSFAVAALQCHVRGDRSALATVWRAGVRLW TPSLALTVGWVALYLAVVDQRRWSSDLSMTWDLLCRSVTHGIVPALAGGPWDWARWAP ASPWATPPAVVMVLGWLVLIAVLALSLVRKRRIGPVWLTAAGYAVACQVPIFLMRSSP FTALELAQTLRYFPDLVVVLALLAAVALQAPNRAGTRWLDASPARAVATVASAVLFLT SSLYSTATFLASWRDNPTEGYLKNAQASLAAAASGAPLLDQEVDPLVLQRVAWPENLA SHMFALLRVRPEFATTTTQLRMFTSTGRLVDAKVTWVRTIIAGPVPQCGYFVQPDRPE RLILDGPLLPGDWTVELNYLANSDGSMALALSDGPERKVPVHPGLNRVYARLPGAGDA ITVRANTTALSLCIGAAPVGFLAPA" repeat_region 4075615..4077750 /note="IS1534, len: 2136 bp. Putative Insertion sequence element, IS1534 (IS15C10.2), that resembles IS21; possibly defective." /mobile_element="insertion sequence:IS1534" repeat_region 4075615..4075630 /note="16 bp inverted repeat at the left end of putative IS element IS1534; GAAAATTGACCAGCTT." gene 4075752..4076099 /locus_tag="Rv3636" /db_xref="GeneID:885274" CDS 4075752..4076099 /locus_tag="Rv3636" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv3636, (MTCY15C10.16c), len: 115 aa. Possible transposase, weakly similar to others e.g. O69924|SC3C8.12 PUTATIVE TRANSPOSASE from Streptomyces coelicolor (487 aa) FASTA scores: opt: 132, E(): 0.12, (33.05% identity in 112 aa overlap); O96916 TC1-LIKE TRANSPOSASE from Anopheles gambiae (African malaria mosquito) (332 aa), FASTA scores: opt: 117, E(): 0.84, (30.75% identity in 91 aa overlap); Q9R2U5|IS466A|IS466A-ORF|TNPA|IS469|SCP1.276 TRANSPOSASE (INSERTION ELEMENT IS466S TRANSPOSASE) from Streptomyces coelicolor (513 aa), FASTA scores: opt: 114, E(): 2, (30.5% identity in 82 aa overlap); etc. Similar in part to P96288|Rv2943|MTCY24G1.06c HYPOTHETICAL 45.8 KDA PROTEIN from Mycobacterium tuberculosis (413 aa), FASTA scores: opt: 533, E(): 1.4e-28, (74.55% identity in 110 aa overlap). Contains possible helix-turn-helix motif from aa 19-40 (+4.98 SD)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_218153.1" /db_xref="GI:15610772" /db_xref="GeneID:885274" /translation="MLSVEDWAEIRRLRRSERLPISEIARVLKISRNTVKSALASDGP PKYQRAAKGSVADEAEPRIRELLAAYPRMPATVIAERIGWWYSIRTLSGRVRELRPLY LPPDPASRDICGR" gene 4076484..4076984 /locus_tag="Rv3637" /db_xref="GeneID:885496" CDS 4076484..4076984 /locus_tag="Rv3637" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv3637, (MTCY15C10.15c), len: 166 aa. Possible transposase. C-terminal end highly similar to Q9RLQ9|ISTA PUTATIVE TRANSPOSASE A (FRAGMENT) from Mycobacterium bovis (102 aa), FASTA scores: opt: 397, E(): 1.4e-19, (58.8% identity in 102 aa overlap). Weakly similar to others e.g. Q9KJ02 PUTATIVE TRANSPOSASE (FRAGMENT) from Polyangium cellulosum (329 aa), FASTA scores: opt: 191, E(): 1.6e-05, (32.1% identity in 134 aa overlap); Q9LCU2|ISTA COINTEGRASE from Pseudomonas aeruginosa (382 aa) FASTA scores: opt: 144, E(): 0.024, (26.8% identity in 123 aa overlap); P15025|ISTA_PSEAE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS21 from Pseudomonas aeruginosa (390 aa), FASTA scores: opt: 144, E(): 0.025, (26.85% identity in 123 aa overlap); etc. Also highly similar to C-terminal end of P96288|Rv2943|MTCY24G1.06c HYPOTHETICAL 45.8 KDA PROTEIN from Mycobacterium tuberculosis (413 aa) FASTA scores: opt: 722, E(): 1.5e-40, (63.7% identity in 168 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_218154.1" /db_xref="GI:15610773" /db_xref="GeneID:885496" /translation="MPGRVFASPADFNTQLQAWLVRANHRQHRVLGCRPADRIEADTA AMLTLPPVGPSIGWRTSTRLPRDHYVRLDGNDYSVHPVAIGRRIEITADLSRVRVWCG GTLVADHDRIWAKHQTISDPEHVVAAKLLRRKRFDIVGPPHHVEVEQRLLTTYDTVLG LDGPVA" gene 4076984..4077730 /locus_tag="Rv3638" /db_xref="GeneID:885803" CDS 4076984..4077730 /locus_tag="Rv3638" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv3638, (MTCY15C10.14c), len: 248 aa. Possible transposase, highly similar to Q9RLQ8|ISTB ISTB PROTEIN from Mycobacterium bovis (266 aa), FASTA scores: opt: 784, E(): 4e-46, (78.0% identity in 259 aa overlap); and similar to others e.g. P15026|ISTB_PSEAE INSERTION SEQUENCE IS21 PUTATIVE ATP-BINDING PROTEIN from Pseudomonas aeruginosa (265 aa), FASTA scores: opt: 420, E(): 2.2e-21, (38.8% identity in 255 aa overlap); Q45619|ISTB_BACST INSERTION SEQUENCE IS5376 PUTATIVE ATP-BINDING PROTEIN from Bacillus stearothermophilus (251 aa), FASTA scores: opt: 402, E(): 3.6e-20, (34.5% identity in 232 aa overlap); P15026|ISTB_ECOLI ISTB PROTEIN from Escherichia coli (265 aa), FASTA scores: opt: 419, E(): 8e-23, (38.8% identity in 255 aa overlap); etc. C-terminus highly similar to C-terminus of P96287|Rv2944|MTCY24G1.05 HYPOTHETICAL 25.5 KDA PROTEIN from Mycobacterium tuberculosis strain H37Rv (alias AAK47343|MT3016 IS1533, ORFB from Mycobacterium tuberculosis strain CDC1551) (238 aa), FASTA scores: opt: 784, E(): 3.6e-46, (87.4% identity in 135 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_218155.1" /db_xref="GI:15610774" /db_xref="GeneID:885803" /translation="MAAKTATNSRDVAAELAYLTRALKAPTLRGAIEQLADRARTKTW SYEEFLAACLQREVSARESHGGEGRIRAARFPSRKSLEEFDFDHARGLKRDTIAHLGT LDFVTLAIGIAIRACQAGHRVLFATASQWVDRLAAAHHSGTLQSELIRLARYPLLVVD EVGYIPFEPEAANLFFQLVSSRYERASLIVTSNKPFGRWGEVFGDDVVAAAMIDRLVH HAEVIALKGDSYRIKDRDLGRVPTVTADDQ" repeat_region complement(4077735..4077750) /note="16 bp inverted repeat at the right end of putative IS element IS1534; GAAAATTGACCAGCTT." gene complement(4077884..4078450) /locus_tag="Rv3639c" /db_xref="GeneID:885244" CDS complement(4077884..4078450) /locus_tag="Rv3639c" /function="UNKNOWN" /note="Rv3639c, (MTCY15C10.13), len: 188 aa. Hypothetical protein, with C-terminus highly similar to N-terminus of P95044|Rv0698|MTCY210.15 HYPOTHETICAL 22.3 KDA PROTEIN from Mycobacterium tuberculosis (203 aa), FASTA scores: opt: 224, E(): 4.5e-07, (54.8% identity in 73 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218156.1" /db_xref="GI:15610775" /db_xref="GeneID:885244" /translation="MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSAT TCNYPPAAKDSAQDGFRHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFGGP TPAPRGLATRQCPPRTVHVDRVRPNGAERALRARFRPILRPQFTLGDGANGLPLAACT KTGAYVPHLPYSPIAVDPQPSAGQQGPS" repeat_region complement(4078506..4079798) /note="IS1553, len: 1293 bp. Putative Insertion sequence element, IS1553." /mobile_element="insertion sequence:IS1553" repeat_region 4078506..4078518 /note="13 bp inverted repeat at the right end of putative IS element IS1553; GAGTTCGTCGGTG." gene complement(4078520..4079749) /locus_tag="Rv3640c" /db_xref="GeneID:885324" CDS complement(4078520..4079749) /locus_tag="Rv3640c" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION SEQUENCE." /note="Rv3640c, (MTCY15C10.12), len: 409 aa. Probable transposase, highly similar to others e.g. Q48882 TRANSPOSASE from Mycobacterium avium (411 aa) FASTA scores: opt: 1574, E(): 6.2e-93, (59.75% identity in 400 aa overlap); Q9AKV5 PUTATIVE TRANSPOSASE (FRAGMENT) from Mycobacterium paratuberculosis (395 aa), FASTA scores: opt: 1566, E(): 1.9e-92, (60.0% identity in 395 aa overlap); Q48368 TRANSPOSASE from Mycobacterium avium (410 aa), FASTA scores: opt: 1561, E(): 4.1e-92, (59.4% identity in 404 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_218157.1" /db_xref="GI:15610776" /db_xref="GeneID:885324" /translation="MALPQSALSELLDAFRTGDGVDLIRDAVRLVLQELSELEATERI GAARYERSDTRVTDRNGARSRVLSTQAGDVELRIPKLRKGSFFPAILEPRRRIDQALY AVVMEAYVHGISTRAVDDLVEAMGVETGISKSEVSRICAGLDEIVGAFRTRTLGHIEF PYVYLDATYLNVRNGTGQVVSMAVIVASGIAADGSREILGLDVGDSEDETFWRGFLTS LKGRGLGGVRLVISDQHAGLVKALKRCFQGAGHQRCRVHFARNLLAHVPKDKADMVAS MFRMIFSAPDAEAVHATWEGVRDRLAASFPKIGPLMDDARAEVLAFTAFPKAHWQKIW STNPLERINKEIKRRSRVVGIFPNPAAVIRLVGAVLADMHDEWQASERRYLSEASMAL LYPDSDNAVVAAISGGQ" repeat_region complement(4079786..4079798) /note="13 bp inverted repeat at the left end of putative IS element IS1553; GAGATCGTCGGTG." gene complement(4079925..4080560) /gene="fic" /locus_tag="Rv3641c" /db_xref="GeneID:885540" CDS complement(4079925..4080560) /gene="fic" /locus_tag="Rv3641c" /function="COULD BE INVOLVED IN CELL FILAMENTATION INDUCED BY CYCLIC AMP AND MAY HAVE SOME ROLE IN CELL DIVISION." /note="Rv3641c, (MTCY15C10.11), len: 211 aa. Possible fic, cell filamentation protein, similar to others e.g. Q9PCU8|XF1657 CELL FILAMENTATION PROTEIN from Xylella fastidiosa (203 aa), FASTA scores: opt: 324, E(): 2.2e-14, (32.8% identity in 189 aa overlap); P20605|FIC_ECOLI|B3361 from Escherichia coli strain K12 (200 aa), FASTA scores: opt: 323, E(): 2.5e-14, (31.0% identity in 187 aa overlap); P20751|FIC_SALTY from Salmonella typhimurium (200 aa), FASTA scores: opt: 322, E(): 2.9e-14, (32.65% identity in 193 aa overlap); etc." /codon_start=1 /transl_table=11 /product="cell filamentation protein FIC" /protein_id="NP_218158.1" /db_xref="GI:15610777" /db_xref="GeneID:885540" /translation="MPHPWDTGDHERNWQGYFIPAMSVLRNRVGARTHAELRDAENDL VEARVIELREDPNLLGDRTDLAYLRAIHRQLFQDIYVWAGDLRTVGIEKEDESFCAPG GISRPMEHVAAEIYQLDRLRAVGEGDLAGQVAYRYDYVNYAHPFREGNGRSTREFFDL LLSERGSGLDWGKTDLEELHGACHVARANSDLTGLVAMFKGILDAEPTYDF" gene complement(4080571..4080765) /locus_tag="Rv3642c" /db_xref="GeneID:885220" CDS complement(4080571..4080765) /locus_tag="Rv3642c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3642c, (MTCY15C10.10), len: 64 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218159.1" /db_xref="GI:15610778" /db_xref="GeneID:885220" /translation="MFVQATELQKVKRRFRNVRATRRNTELEGTRSTAATRADQNDYA RGKITAAELGERVRRRYNIQ" gene 4081160..4081351 /locus_tag="Rv3643" /db_xref="GeneID:885600" CDS 4081160..4081351 /locus_tag="Rv3643" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3643, (MTCY15C10.09c), len: 63 aa (questionable ORF). Identical to AAK48106 from Mycobacterium tuberculosis strain CDC1551 (33 aa) but longer 30 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218160.1" /db_xref="GI:15610779" /db_xref="GeneID:885600" /translation="MERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYS RPIRAFRPLKSNRPGDIPT" gene complement(4081365..4081437) /locus_tag="Rvnt39" /note="tRNA-Thr(CGT)" /db_xref="GeneID:2700461" tRNA complement(4081365..4081437) /locus_tag="Rvnt39" /product="tRNA-Thr" /note="codon recognized: ACG" /anticodon=(pos:4081402..4081404,aa:Thr) /db_xref="GeneID:2700461" gene complement(4081516..4082721) /locus_tag="Rv3644c" /db_xref="GeneID:885553" CDS complement(4081516..4082721) /locus_tag="Rv3644c" /EC_number="2.7.7.7" /function="DNA POLYMERASE IS A COMPLEX, MULTICHAIN ENZYME RESPONSIBLE FOR MOST OF THE REPLICATIVE SYNTHESIS IN BACTERIA." /note="catalyzes the DNA-template-directed extension of the 3'-end of a DNA strand; the delta' subunit seems to interact with the gamma subunit to transfer the beta subunit on the DNA" /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit delta'" /protein_id="NP_218161.1" /db_xref="GI:15610780" /db_xref="GeneID:885553" /translation="MSGVFTRLVGQQAVEAELLATAKAARRDSAHSAGGGGTMTHAWL LTGPPGSGRSVAALCFAAALQCTSGGEPGCGRCRACTTTLAGTHADVRRVIPEGLSIG VDEMRAIVQIAARRPTTGHWQIVVIEDADRLTEGAANALLKVVEEPPPSTVFLLCAPS VDPEDIAVTLRSRCRHVALVTPSTHAIAQVLSDGDGLDPDTANWAASVSGGHVGRARR LATDPQARQRRERALGLARDAATPSRAYAAAEELVAGAEAEALALTAQRIEAETEELR TALGAGGTGKGTGAALRGATGAMKDLERRQKSRQTRASRDALDRALIDLATYFRDALL VAAHAGGVRANHPDMADRVAALAAHAPPERLLRCIEAVLACREALAVNVKPKFAVDAM VATIGQELR" gene 4082807..4084456 /locus_tag="Rv3645" /db_xref="GeneID:885620" CDS 4082807..4084456 /locus_tag="Rv3645" /function="UNKNOWN" /note="Rv3645, (MTCY15C10.07c), len: 549 aa. Probable conserved transmembrane protein, equivalent, but longer 20 aa, to O69547|ML0201|MLCB2548.30 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (530 aa), FASTA scores: opt: 2958, E(): 1.5e-168, (85.5% identity in 530 aa overlap). Also closely related to several other hypothetical M. tuberculosis proteins, e.g. Q10631|YD18_MYCTU|Rv1318c|MT1359|MTCY130.03c (541 aa) FASTA scores: opt: 1105, E(): 2.7e-58, (39.35% identity in 506 aa overlap); Q10633|YD20_MYCTU|Rv1320c|MT1362|MTCY130.05c (567 aa) FASTA scores: opt: 1031, E(): 7.1e-54, (38.1% identity in 509 aa overlap); Q10632|YD19_MYCTU|Rv1319c|MTCY130.04c (535 aa), FASTA scores: opt: 1016, E(): 5.3e-53, (37.1% identity in 531 aa overlap); etc. Also similar at C-terminal end to many adenylate cyclases (EC 4.6.1.1) e.g. O83498|TP0485 from Treponema pallidum (614 aa) FASTA scores: opt: 365, E(): 3.2e-14, (31.55% identity in 317 aa overlap); P94180|CYAA from Anabaena sp. strain PCC 7120 (735 aa), FASTA scores: opt: 364, E(): 4.2e-14, (32.75% identity in 229 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218162.1" /db_xref="GI:15610781" /db_xref="GeneID:885620" /translation="MDAEAFVGFRQVPAARYGGLMATTAALPRRIHAFVRWVVRTPWP LFSLSMLQSDIIGALFVLGFLRYGLPPQDNIQLQDLPPVNLLIFVSTVIILFLAGAVV NLKLLMPVFRWQRRDNLLTEPDPAATELARSRALRMPLYRTLISLAVWATGGGVFILA SWSVAKHAAPVVAVATALGATATAIIGYLQSERVLRPVAVAALRSGVPENVNAPGVIL RLMLAWIPSTGVPLLAIVLAVAADKIALLHATPEALFNPILMMALAALGIGSVSTLLV AMSIADPLRQLRWALSEVQRGNYNAHMQIYDASELGLLQAGFNDMVRELSERQRLRDL FGRYVGEDVARRALERGTELGGQERDVAVLFVDLVGSTQLAATRPPAEVVQLLNEFFR VVVETVARHGGFVNKFQGDAALAIFGAPIEHPDGAGAALSAARELHDELIPVLGSAEF GIGVSAGRAIAGHIGAQARFEYTVIGDPVNEAARLTELAKLEDGHVLASAIAVSGALD AEALCWDVGEVVELRGRAAPTQLARPMNLAAPEEVSSEVRG" gene complement(4084453..4087257) /gene="topA" /locus_tag="Rv3646c" /db_xref="GeneID:885608" CDS complement(4084453..4087257) /gene="topA" /locus_tag="Rv3646c" /EC_number="5.99.1.2" /function="THE REACTION CATALYZED BY TOPOISOMERASES LEADS TO THE CONVERSION OF ONE TOPOLOGICAL ISOMER OF DNA TO ANOTHER. [CATALYTIC ACTIVITY: ATP-INDEPENDENT BREAKAGE OF SINGLE-STRANDED DNA, FOLLOWED BY PASSAGE AND REJOINING]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the ATP-dependent breakage of single-stranded DNA followed by passage and rejoining, maintains net negative superhelicity" /codon_start=1 /transl_table=11 /product="DNA topoisomerase I" /protein_id="NP_218163.1" /db_xref="GI:15610782" /db_xref="GeneID:885608" /translation="MADPKTKGRGSGGNGSGRRLVIVESPTKARKLASYLGSGYIVES SRGHIRDLPRAASDVPAKYKSQPWARLGVNVDADFEPLYIISPEKRSTVSELRGLLKD VDELYLATDGDREGEAIAWHLLETLKPRIPVKRMVFHEITEPAIRAAAEHPRDLDIDL VDAQETRRILDRLYGYEVSPVLWKKVAPKLSAGRVQSVATRIIVARERDRMAFRSAAY WDILAKLDASVSDPDAAPPTFSARLTAVAGRRVATGRDFDSLGTLRKGDEVIVLDEGS ATALAAGLDGTQLTVASAEEKPYARRPYPPFMTSTLQQEASRKLRFSAERTMSIAQRL YENGYITYMRTDSTTLSESAINAARTQARQLYGDEYVAPAPRQYTRKVKNAQEAHEAI RPAGETFATPDAVRRELDGPNIDDFRLYELIWQRTVASQMADARGMTLSLRITGMSGH QEVVFSATGRTLTFPGFLKAYVETVDELVGGEADDAERRLPHLTPGQRLDIVELTPDG HATNPPARYTEASLVKALEELGIGRPSTYSSIIKTIQDRGYVHKKGSALVPSWVAFAV TGLLEQHFGRLVDYDFTAAMEDELDEIAAGNERRTNWLNNFYFGGDHGVPDSVARSGG LKKLVGINLEGIDAREVNSIKLFDDTHGRPIYVRVGKNGPYLERLVAGDTGEPTPQRA NLSDSITPDELTLQVAEELFATPQQGRTLGLDPETGHEIVAREGRFGPYVTEILPEPA ADAAAAAQGVKKRQKAAGPKPRTGSLLRSMDLQTVTLEDALRLLSLPRVVGVDPASGE EITAQNGRYGPYLKRGNDSRSLVTEDQIFTITLDEALKIYAEPKRRGRQSASAPPLRE LGTDPASGKPMVIKDGRFGPYVTDGETNASLRKGDDVASITDERAAELLADRRARGPA KRPARKAARKVPAKKAAKRD" misc_feature complement(4086220..4086264) /gene="topA" /locus_tag="Rv3646c" /note="PS00396 Prokaryotic DNA topoisomerase I active site." gene complement(4087610..4088188) /locus_tag="Rv3647c" /db_xref="GeneID:885217" CDS complement(4087610..4088188) /locus_tag="Rv3647c" /function="UNKNOWN" /note="Rv3647c, (MTCY15C10.05), len: 192 aa. Conserved hpothetical protein, equivalent to O69549|MLCB2548.32c|ML0199 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium leprae (200 aa), FASTA scores: opt: 1029, E(): 9e-58, (80.4% identity in 199 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218164.1" /db_xref="GI:15610783" /db_xref="GeneID:885217" /translation="MSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAE SWRASALAEMIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPRWL PGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPTLIGTRGTRPA LRISGRRRLSRLVENVGEPPDGAEAWVQWPRT" gene complement(4088328..4088531) /gene="cspA" /locus_tag="Rv3648c" /db_xref="GeneID:885837" CDS complement(4088328..4088531) /gene="cspA" /locus_tag="Rv3648c" /function="POSSIBLY INVOLVED IN COLD ACCLIMATION PROCESSES (THE PRODUCTION OF THE PROTEIN IS SUPPOSED PREDOMINANTLY INDUCED AT LOW TEMPERATURES)." /experiment="experimental evidence, no additional details recorded" /note="Rv3648c, (MTCY15C10.04), len: 67 aa. Probable cspA, cold shock protein A, identical to O69550|CSPB|CSPA|ML0198 SMALL COLD-SHOCK PROTEIN from Mycobacterium leprae (67 aa) FASTA scores: opt: 451, E(): 3.7e-27, (97.0% identity in 67 aa overlap). Also highly similar to many e.g. Q9KGW0|CSPA from Mycobacterium smegmatis (67 aa) FASTA scores: opt: 439, E(): 2.9e-26, (92.55% identity in 67 aa overlap); P54584|CSP_ARTGO from Arthrobacter globiformis (67 aa), FASTA scores: opt: 335, E(): 1.5e-18, (73.45% identity in 64 aa overlap); O30875|CSPA_MICLU from Micrococcus luteus (Micrococcus lysodeikticus); Q9Z5R4|CSPA_BORPE from Bordetella pertussis (67 aa) FASTA scores: opt: 294, E(): 1.7e-15, (59.7% identity in 67 aa overlap); etc. Contains 'cold-shock' DNA-binding domain signature (PS00352) at N-terminal end. BELONGS TO THE COLD-SHOCK DOMAIN (CSD) FAMILY." /codon_start=1 /transl_table=11 /product="cold shock protein A" /protein_id="NP_218165.1" /db_xref="GI:15610784" /db_xref="GeneID:885837" /translation="MPQGTVKWFNAEKGFGFIAPEDGSADVFVHYTEIQGTGFRTLEE NQKVEFEIGHSPKGPQATGVRSL" misc_feature complement(4088430..4088489) /gene="cspA" /locus_tag="Rv3648c" /note="PS00352 'Cold-shock' DNA-binding domain signature." gene 4088781..4091096 /locus_tag="Rv3649" /db_xref="GeneID:885841" CDS 4088781..4091096 /locus_tag="Rv3649" /EC_number="3.6.-.-" /function="POSSIBLY HAS HELICASE ACTIVITY." /note="Rv3649, (MTCY15C10.03c), len: 771 aa. Probable helicase (EC 3.6.-.-), similar to many (known or hypothetical) ATP-dependent helicases e.g. Q9X915|SCH5.13 PUTATIVE HELICASE from Streptomyces coelicolor (815 aa) FASTA scores: opt: 2550, E(): 9.6e-139, (52.45% identity in 774 aa overlap); Q05549|YDR291W|D9819.1 PROTEIN SIMILAR TO SEVERAL DNA HELICASES from Saccharomyces cerevisiae (Baker's yeast) (1077 aa), FASTA scores: opt: 1161, E(): 5.9e-59, (31.05% identity in 780 aa overlap); P50830|YPRA_BACSU HYPOTHETICAL HELICASE from Bacillus subtilis (749 aa), FASTA scores: opt: 1154, E(): 1.1e-58, (34.05% identity in 734 aa overlap); Q9KC10|BH1764 ATP-DEPENDENT RNA HELICASE from Bacillus halodurans (764 aa), FASTA scores: opt: 1122, E(): 8e-57, (32.3% identity in 759 aa overlap); etc. SEEMS SIMILAR TO DEAD/DEAH BOX HELICASE FAMILY, AND TO HELICASE C-TERMINAL DOMAIN." /codon_start=1 /transl_table=11 /product="helicase" /protein_id="NP_218166.1" /db_xref="GI:15610785" /db_xref="GeneID:885841" /translation="MASFGSHLLAAAVAGTPPGERPLRHVAELPPQAGRPRGWPEWAE PDVVDAFADRGISSPWSHQAEAAELAYAGRHVVIGTGPASGKSLAYQLLVLNALATDS RARALYLSPTKALGHDQLRAAHALAAAVPRLADVAPTAYDGDSPDEVRRFARERSRWL FSNPEMTHLSVLRNHARWAVLLRNLRFVIVDECHYYRGVFGSNVAMVLRRLLRLCARY SAHPTVIFASATTASPGATAADLIGQPVVEVTEDGSPRGARTVALWEPALRSDVIGEH GAPVRRSAGAEAARVMADLIVEGAQTLTFVRSRRAAELTALGARARLVDIAPELSDTV ASYRAGYLAEDRSALHQALAEGQLRGLATTNALELGVDIAGLDAVVLAGFPGTVASFW QQAGRSGRRGQGALVVLIARDDPLDTYLVHHPAALLDKPVERVVIDPVNPHLLGPQLL CAATELPLDDAEVRSWGAVEVAESLVDDGLLRRRNGRYFPAPGVKPHAAVDVRGAIGG QIVIVEAGTGRLLGSVGVGQAPAAAHPGAVYLHQGETYVVDSLDFQDGIAFVHAEDPG YATFAREVTDIAVTGTGERLVFGPVALGLVPVTVTNHVVGYLRRQLSGEVLDFVELDM PEHTLPTTAVMYTITSDALVRSGIEATRIPGSLHAAEHAAIGLLPLVASCDRGDIGGM STATGPEGLPSVFVYDGYPGGAGFAERGFRRARTWLGATAEAIEACECPSGCPSCVQS PKCGNGNDPLDKAGAVRVLRLVLAELSEESP" gene 4091233..4091517 /gene="PE33" /locus_tag="Rv3650" /db_xref="GeneID:885832" CDS 4091233..4091517 /gene="PE33" /locus_tag="Rv3650" /function="UNKNOWN" /note="Rv3650, (MTCY15C10.02c), len: 94 aa. Short protein, member of the Mycobacterium tuberculosis PE family (see citation below), but without the repetitive gly-rich region, similar to the N-terminal part of many e.g. O53809|Rv0746|MTV041.20 PGRS-FAMILY PROTEIN (783 aa), FASTA scores: opt: 363, E(): 2.1e-15, (76.55% identity in 81 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_178000.1" /db_xref="GI:57117138" /db_xref="GeneID:885832" /translation="MSFVIAAPEALDSAATDLVVLGSTLGAATAAAAAQTTGIVAAAH DEVSAAIAALFSAHGQAYQAASAQAAAFHTRFIRARSRHPQQETTCRRVR" gene 4091841..4092878 /locus_tag="Rv3651" /db_xref="GeneID:885296" CDS 4091841..4092878 /locus_tag="Rv3651" /function="UNKNOWN" /note="Rv3651, (MTCY15C10.01c), len: 345 aa. Hypothetical protein, with some similarity to Q9ZHK1 HYPOTHETICAL 36.5 KDA PROTEIN from Rhodococcus sp. X309 (329 aa) FASTA scores: opt: 332, E(): 3.4e-13, (27.4% identity in 321 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218168.1" /db_xref="GI:15610787" /db_xref="GeneID:885296" /translation="MTHDWLLVETLGDEPAVVARGRELKKLVPITTFLRRSPYLAAVR TAIAETLQTGQSLTSITPKHDRVIRTEPVIMTDGRMHGVQVWSGPTDAEPPDRPIPGP LKWDLTRGVATDTPESLTNSGKNPEVEITYGRAFAEDLPARELNPNETQVLAMAVKAK PGKTLCSIWDLTDWQGTPIRIGFVARSALEPGPNGRDHLVARAMNWRAETKAPAVPVD DLAQRILIGLAQAGVHRALVDLKTWTLLKWLDQPCSFYDWRRSAADGPRLHPDDQHVI DAMTRDLANGSASHVLRLPGHDVDWVPVHVTVNRIELEPDTFAGLVALRLPTDEELAD AGLPKATDVTT" gene 4093632..4093946 /gene="PE_PGRS60" /locus_tag="Rv3652" /db_xref="GeneID:886260" CDS 4093632..4093946 /gene="PE_PGRS60" /locus_tag="Rv3652" /function="UNKNOWN" /note="Rv3652, (MTV025.001A), len: 104 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), similar at N-terminal end with many e.g. P56877|Y278_MYCTU|Rv0278c|MTV035.06c (957 aa) FASTA scores: opt: 242, E(): 3e-09, (77.35% identity in 53 aa overlap). Originally annotated as the first part of a PE-PGRS family protein (Rv3653/PE_PGRS61 being the second part) but more similar to a PE family protein. Length extended since first submission (+50 aa)." /codon_start=1 /transl_table=11 /product="PE-PGRS family-related protein" /protein_id="YP_178001.1" /db_xref="GI:57117139" /db_xref="GeneID:886260" /translation="MSYVIAAPEALVAAATDLATLGSTIGAANAAAAGSTTALLTAGA DEVSAAIAAYSECTARPIRHSVRGRRRSMSGSCRPWPQVGAPMRPPRPPASRRCRARS IC" gene 4093940..4094527 /gene="PE_PGRS61" /locus_tag="Rv3653" /db_xref="GeneID:886259" CDS 4093940..4094527 /gene="PE_PGRS61" /locus_tag="Rv3653" /function="UNKNOWN" /note="Rv3653, (MTV025.001B), len: 195 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to the C-termini of members of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, e.g. MTCY1A11_25, MTCY28_25, MTCY130_10, MTCY1A10_19, MTCY21B4_13, MTCI418B_6,MTCY28_34, MTV004_1, MTCY441_4; etc. Originally annotated as the second part of a PE-PGRS family protein (Rv3652/PE_PGRS60 being the first part). Start shortened since first submission (-50 aa). TBparse score is 0.886." /codon_start=1 /transl_table=11 /product="PE-PGRS family-related protein" /protein_id="YP_178002.1" /db_xref="GI:57117140" /db_xref="GeneID:886259" /translation="MLNAPTQALLGRPLVGNGANGAPGTGANGGDGGILFGSGGAGGS GAAGMAGGNGGAAGLFGNGGAGGAGGSATAGAAGAGGNGGAGGLLFGTAGAGGNGGLS LGLGVAGGAGGAGGSGGSDTAGHGGTGGAGGLLFGAGEDGTTPGGNGGAGGVAGLFGD GGNGGNAGVGTPAGNVGAGGTGGLLLGQDGMTGLT" gene complement(4094660..4094914) /locus_tag="Rv3654c" /db_xref="GeneID:885612" CDS complement(4094660..4094914) /locus_tag="Rv3654c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3654c, (MTV025.002c), len: 84 aa. Hypothetical protein, similar to C-terminus of Q9X916|SCH5.14c MEMBRANE SPANNING PROTEIN from Streptomyces coelicolor (230 aa) FASTA scores: opt: 176, E(): 2.4e-05, (47.0% identity in 83 aa overlap). Equivalent to AAK48118 from Mycobacterium tuberculosis strain CDC1551 but shorter 18 aa. TBparse score is 0.872." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218171.1" /db_xref="GI:15610790" /db_xref="GeneID:885612" /translation="MVARHRAQAAADLASLAAAARLPSGLAAACARATLVARAMRVEH AQCRVVDLDVVVTVEVAVAFAGVATATARAGPAKVPTTPG" gene complement(4094923..4095300) /locus_tag="Rv3655c" /db_xref="GeneID:885614" CDS complement(4094923..4095300) /locus_tag="Rv3655c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3655c, (MTV025.003c), len: 125 aa. Hypothetical protein, with similarity to Q9X917|SCH5.15c HYPOTHETICAL 15.2 KDA PROTEIN from Streptomyces coelicolor (150 aa) FASTA scores: opt: 211, E(): 7.7e-07, (39.65% identity in 111 aa overlap). Equivalent to AAK48119 from Mycobacterium tuberculosis strain CDC1551 (99 aa) but longer 26 aa at the C-terminus. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218172.1" /db_xref="GI:15610791" /db_xref="GeneID:885614" /translation="MEAALAIATLVLVLVLCLAGVTAVSMQVRCIDAAREAARLAARG DVRSATDVARSIAPRAALVQVHRDGEFVVATVTAHSNLLPTLDIAARAISVAEPGSTA ARPPCLPSRWSRCCCASPVRVHI" gene complement(4095324..4095530) /locus_tag="Rv3656c" /db_xref="GeneID:885624" CDS complement(4095324..4095530) /locus_tag="Rv3656c" /function="UNKNOWN" /note="Rv3656c, (MTV025.004c), len: 68 aa. Conserved hypothetical protein, similar to Q9X918|SCH5.16c SMALL HYPOTHETICAL PROTEIN from Streptomyces coelicolor (75 aa), FASTA scores: opt: 129, E(): 0.0039, (40.0% identity in 60 aa overlap). Equivalent to AAK48120 from Mycobacterium tuberculosis strain CDC1551 (42 aa) but longer 26 aa. TBparse score is 0.869." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218173.1" /db_xref="GI:15610792" /db_xref="GeneID:885624" /translation="MLVITMFRVLVARMTALAVDESGMSTVEYAIGTIAAAAFGAILY TVVTGDSIVSALNRIIGRALSTKV" gene complement(4095540..4096115) /locus_tag="Rv3657c" /db_xref="GeneID:885619" CDS complement(4095540..4096115) /locus_tag="Rv3657c" /function="UNKNOWN" /note="Rv3657c, (MTV025.005c), len: 191 aa. Possible conserved membrane protein, rich in ala residues, similar to Q9X919|SCH5.17c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (267 aa), FASTA scores: opt: 324, E(): 4.7e-12, (40.9% identity in 154 aa overlap). TBparse score is 0.893." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218174.1" /db_xref="GI:15610793" /db_xref="GeneID:885619" /translation="MALWLGAGPSVVRARAGRPPRAHRPHQGLLLGRTDVADPLAVAA SLDVLAVCLAAGMAVSTAAAATAAVAPPRLARVLRRAADLLALGADPNIAWSRPPDLP PGTHDAQTDAVLRLARRSAASGAALADGIVELAVQVRHDAAQAAAAAAERAGVLIAGP LGLCFLPAFLCVGIVPLVVGLAGDVLQFGLV" gene complement(4096139..4096939) /locus_tag="Rv3658c" /db_xref="GeneID:885618" CDS complement(4096139..4096939) /locus_tag="Rv3658c" /function="UNKNOWN" /note="Rv3658c, (MTV025.006c), len: 266 aa. Probable conserved transmembrane protein, similar to Q9X920|SCH5.18c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (321 aa), FASTA scores: opt: 335, E(): 4.1e-13, (38.05% identity in 247 aa overlap). TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218175.1" /db_xref="GI:15610794" /db_xref="GeneID:885618" /translation="MSGIASAALILSLALVVLPGSPRCRLTPDDTGRRVLLVGARRVA WGVGCVAVGVAALLPLPTVVAVAVLGATLGLRYRRRRRYLRRSREGQALEAALELVVG ELRAGAHPVRAFSIAADETGGPVAVALRAVAARARLGADVTAGLLAAARSSALPAYWE RLAVCWQLGSDHGLAIASLMRAAQRDVAERQRFSARVSAGMAGARASAAILAILPLLG VLLGQLIGARPLSFLLTGRVGGWLLVVGLTLACAGLLWSDRITDRPVL" gene complement(4096936..4097994) /locus_tag="Rv3659c" /db_xref="GeneID:885627" CDS complement(4096936..4097994) /locus_tag="Rv3659c" /function="UNKNOWN" /note="Rv3659c, (MTV025.007c), len: 352 aa. Conserved hypothetical protein, highly similar, but always shorter (various lengths) at N-terminus, to Q9X921|SCH5.19c PUTATIVE SECRETORY PROTEIN from Streptomyces coelicolor (523 aa), FASTA scores: opt: 1287, E(): 5.3e-66, (59.85% identity in 351 aa overlap); Q9HW98|PA4302 PROBABLE TYPE II SECRETION SYSTEM PROTEIN from Pseudomonas aeruginosa (421 aa), FASTA scores: opt: 776, E(): 5.4e-37, (42.8% identity in 320 aa overlap); AAK65510|CPAF2 PROBABLE CPAF2 PILUS ASSEMBLY PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (497 aa) FASTA scores: opt: 769, E(): 1.5e-36, (40.45% identity in 309 aa overlap); Q9KY93|SCK15.11 PUTATIVE SECRETORY PROTEIN from Streptomyces coelicolor (445 aa), FASTA scores: opt: 751, E(): 1.5e-35, (38.15% identity in 333 aa overlap); etc. Contains PS00017 ATP/GTP binding site motif A (P-loop). Note that previously known as trbB. TBparse score is 0.906.; trbB" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_178003.1" /db_xref="GI:57117141" /db_xref="GeneID:885627" /translation="MLGDTEVLANLRVLQTELTGAGILEPLLSADGTTDVLVTAPDSV WVDDGNGLRRSQIRFADESAVRRLAQRLALAAGRRLDDAQPWVDGQLTGIGVGGFAVR LHAVLPPVATQGTCLSLRVLRPATQDLAALAAAGAIDPAAAALVADIVTARLAFLVCG GTGAGKTTLLAAMLGAVSPDERIVCVEDAAELAPRHPHLVKLVARRANVEGIGEVTVR QLVRQALRMRPDRIVVGEVRGAEVVDLLAALNTGHEGGAGTVHANNPGEVPARMEALG ALGGLDRAALHSQLAAAVQVLLHVARDRAGRRRLAEIAVLRQAEGRVQAVTVWHADRG MSDDAAALHDLLRSRASA" misc_feature complement(4097494..4097517) /locus_tag="Rv3659c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene complement(4098096..4099148) /locus_tag="Rv3660c" /db_xref="GeneID:885319" CDS complement(4098096..4099148) /locus_tag="Rv3660c" /function="POSSIBLY PLAYS A REGULATORY ROLE IN CELULAR DIFFERENTIATION." /note="Rv3660c, (MTV025.008c), len: 350 aa. Conserved hypothetical protein, similar to O33612 PROTEIN CONCERNED IN INHIBITION OF MORPHOLOGICAL DIFFERENTIATION IN Streptomyces azureus from Streptomyces cyaneus (Streptomyces curacoi) (370 aa), FASTA scores: opt: 655, E(): 5.9e-31, (42.2% identity in 315 aa overlap); Q9X922|SCH5.20c PUTATIVE SEPTUM SITE DETERMINING PROTEIN from Streptomyces coelicolor (396 aa), FASTA scores: opt: 592, E(): 2.9e-27, (43.25% identity in 275 aa overlap). And shows some similarity to AAK65513|CPAE2 PROBABLE CPAE2 PILUS ASSEMBLY PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (586 aa) FASTA scores: opt: 212, E(): 5.1e-05, (25.75% identity in 295 aa overlap); and several cell division inhibitors or septum site-determining proteins. Equivalent to AAK48124 from Mycobacterium tuberculosis strain CDC1551 (261 aa) but longer 89 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218177.1" /db_xref="GI:15610796" /db_xref="GeneID:885319" /translation="MLTDPGLRDELDRVAAAVGVRVVHLGGRHPVSRKTWSAAAAVVL DHAAADRCGRLALPRRTHVSVLTGTEAATATWAAAITVGAQHVLRMPEQEGELVRELA EAAESARDDGICGAVVAVIGGRGGAGASLFAVALAQAAADALLVDLDPWAGGIDLLVG GETAPGLRWPDLALQGGRLNWSAVRAALPRPRGISVLSGTRRGYELDAGPVDAVIDAG RRGGVTVVCDLPRRLTDATQAALDAADLVVLVSPCDVRACAAAATMAPVLTAINPNLG LVVRGPSPGGLRAAEVADVAGVPLLASMRAQPRLAEQLEHGGLRLRRRSVLASAARRV LGVLPRAGSGRHGRAA" gene 4099647..4100510 /locus_tag="Rv3661" /db_xref="GeneID:885316" CDS 4099647..4100510 /locus_tag="Rv3661" /function="POSSIBLY PLAYS A REGULATORY ROLE IN CELULAR DIFFERENTIATION." /note="Rv3661, (MTV025.009), len: 287 aa. Conserved hypothetical protein, highly similar to O33611|IMD_STRCN from Streptomyces cyaneus (Streptomyces curacoi) protein involved in inhibition of morphological differentiation in Streptomyces azureus (BELONGS TO THE SERB FAMILY) (277 aa) FASTA scores: opt: 1073, E(): 3.5e-61, (61.45% identity in 262 aa overlap); and Q9X923|SCH5.21 PUTATIVE MORPHOLOGICAL DIFFERENTIATION-ASSOCIATED PROTEIN from Streptomyces coelicolor (268 aa), FASTA scores: opt: 1057, E(): 3.6e-60, (61.45% identity in 262 aa overlap). Also similar to various bacterial proteins (principally serB-related proteins) e.g. Q49823|ML2424 HYPOTHETICAL SERB PROTEIN from Mycobacterium leprae (300 aa), FASTA scores: opt: 452, E(): 1.4e-21, (35.8% identity in 257 aa overlap); Q9WX12|SCE68.20 HYPOTHETICAL 32.0 KDA PROTEIN from Streptomyces coelicolor (298 aa), FASTA scores: opt: 415, E(): 3.1e-19, (33.55% identity in 280 aa overlap); Q9RIT2|SERB PHOSPHOSERINE PHOSPHATASE (FRAGMENT) from Streptomyces coelicolor (266 aa), FASTA scores: opt: 405, E(): 1.2e-18, (34.1% identity in 261 aa overlap); etc. Also similar to Q11169|Y505_MYCTU|Rv0505c|MTCY20G9.32c HYPOTHETICAL 39.5 KDA PROTEIN from Mycobacterium tuberculosis (373 aa), FASTA scores: opt: 454, E(): 1.2e-21, (35.15% identity in 276 aa overlap). BELONGS TO THE SERB FAMILY." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218178.1" /db_xref="GI:15610797" /db_xref="GeneID:885316" /translation="MTVSDSPAQRQTPPQTPGGTAPRARTAAFFDLDKTIIAKSSTLA FSKPFFAQGLLNRRAVLKSSYAQFIFLLSGADHDQMDRMRTHLTNMCAGWDVAQVRSI VNETLHDIVTPLVFAEAADLIAAHKLCGRDVVVVSASGEEIVGPIARALGATHAMATR MIVEDGKYTGEVAFYCYGEGKAQAIRELAASEGYPLEHCYAYSDSITDLPMLEAVGHA SVVNPDRGLRKEASVRGWPVLSFSRPVSLRDRIPAPSAAAIATTAAVGISALAAGAVT YALLRRFAFQP" gene complement(4101265..4102035) /locus_tag="Rv3662c" /db_xref="GeneID:885178" CDS complement(4101265..4102035) /locus_tag="Rv3662c" /function="UNKNOWN" /note="Rv3662c, (MTV025.010c), len: 256 aa. Conserved hypothetical protein, equivalent to Q9CB99|ML2289 HYPOTHETICAL PROTEIN from Mycobacterium leprae (256 aa) FASTA scores: opt: 1255, E(): 3.3e-69, (78.05% identity in 255 aa overlap). Also similar to Q9X924|SCH5.22c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (274 aa), FASTA scores: opt: 289, E(): 1.8e-10, (39.25% identity in 270 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218179.1" /db_xref="GI:15610798" /db_xref="GeneID:885178" /translation="MTVDPLAPLMELPGVAAASDRVRDALSRVHRHRANLRGWPVAAA EASLRAARASSVLDGGPARLHDAGAPTSGKPALSDPVFAGALRVGQALEGGAGPVVGV WRRAPLQALARLHMLAAADQVDDDRLGRPRSDADVGPRLELLADVVTHPTLASAPVVA AVAHGELLTLRPFGCADGVVARAVSRLVTIATGLDPHGLGVPEVIWMRQPAEYHDAAR RFAGGTPDGVAGWLLLCCGAMLDGAREALSIAESLSPG" gene complement(4102032..4103678) /gene="dppD" /locus_tag="Rv3663c" /db_xref="GeneID:885039" CDS complement(4102032..4103678) /gene="dppD" /locus_tag="Rv3663c" /function="INVOLVED IN ACTIVE TRANSPORT OF DIPEPTIDE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv3663c, (MTV025.011c), len: 548 aa. Probable dppD, dipeptide-transport ATP-binding protein ABC-transporter (see citation below), similar to many ATP-binding proteins e.g. AAK65441|SMA1434 PROBABLE ABC TRANSPORTER ATP-BINDING PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymA (550 aa), FASTA scores: opt: 1528, E(): 1e-78, (46.25% identity in 545 aa overlap); O50270|MOAD MOAD PROTEIN from Agrobacterium radiobacter (588 aa), FASTA scores: opt: 1354, E(): 6.7e-69, (42.9% identity in 541 aa overlap); Q9KM01|VCA0588 PUTATIVE PEPTIDE ABC TRANSPORTER ATP-BINDING PROTEIN from Vibrio cholerae (530 aa), FASTA scores: opt: 951, E(): 3.1e-46, (44.0% identity in 534 aa overlap); BAB49448|MLR2279 ATP-BINDING PROTEIN OF PEPTIDE ABC TRANSPORTER from Rhizobium loti (Mesorhizobium loti) (604 aa), FASTA scores: opt: 949, E(): 4.4e-46, (41.55% identity in 544 aa overlap); etc. Contains 2 PS00211 ABC transporters family signature, and 2 PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="peptide ABC transporter ATP-binding protein" /protein_id="NP_218180.1" /db_xref="GI:15610799" /db_xref="GeneID:885039" /translation="MSVPAAPLLSVEGLEVTFGTDAPAVCGVDLAVRSGQTVAVVGES GSGKSTTAAAILGLLPAGGRITAGRVVFDGRDITGADAKRLRSIRGREIGYVPQDPMT NLNPVWKVGFQVTEALRANTDGRAARRRAVELLAEAGLPDPAKQAGRYPHQLSGGMCQ RALIAIGLAGRPRLLIADEPTSALDVTVQRQVLDHLQGLTDELGTALLLITHDLALAA QRAEAVVVVRRGVVVESGAAQSILQSPQHEYTRRLVAAAPSLTARSRRPPESRSRATT QAGDILVVSELTKIYRESRGAPWRRVESRAVDGVSFRLPRASTLAIVGESGSGKSTLA RMVLGLLQPTSGTVVFDGTYDVGALARDQVLAFRRRVQPVFQNPYSSLDPMYSVFRAI EEPLRVHHVGDRRQRQRAVRELVDQVALPSSILGRRPRELSGGQRQRVAIARALALRP EVLVCDEAVSALDVLVQAQILDLLADLQADLGLTYLFISHDLAVIRQIADDVLVMRAG RVVEHASTEEVFSRPRHEYTRQLLQAIPGAPSAPRKVGNL" misc_feature complement(4102341..4102385) /gene="dppD" /locus_tag="Rv3663c" /note="PS00211 ABC transporters family signature." misc_feature complement(4102686..4102709) /gene="dppD" /locus_tag="Rv3663c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." misc_feature complement(4103175..4103219) /gene="dppD" /locus_tag="Rv3663c" /note="PS00211 ABC transporters family signature." misc_feature complement(4103532..4103555) /gene="dppD" /locus_tag="Rv3663c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene complement(4103675..4104475) /gene="dppC" /locus_tag="Rv3664c" /db_xref="GeneID:885483" CDS complement(4103675..4104475) /gene="dppC" /locus_tag="Rv3664c" /function="INVOLVED IN ACTIVE TRANSPORT OF DIPEPTIDE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3664c, (MTV025.012c), len: 266 aa. Probable dppC, dipeptide-transport integral membrane protein ABC-transporter (see citation below), similar to many peptide permeases e.g. Q9F351|SC9E12.04 PUTATIVE PEPTIDE TRANSPORT SYSTEM INTEGRAL MEMBRANE from Streptomyces coelicolor (305 aa), FASTA scores: opt: 901, E(): 1.1e-47, (51.15% identity in 262 aa overlap); Q9KFX1|APPC|BH0349 OLIGOPEPTIDE ABC TRANSPORTER (PERMEASE) from Bacillus halodurans (305 aa), FASTA scores: opt: 652, E(): 1.5e-32, (35.55% identity in 270 aa overlap); P94312|DPPC_BACFI DIPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus firmus (304 aa), FASTA scores: opt: 642, E(): 5.9e-32, (35.75% identity in 263 aa overlap); P24139|OPPC_BACSU|SPO0KC OLIGOPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (305 aa), FASTA scores: opt: 637, E(): 1.2e-31, (37.4% identity in 262 aa overlap); P26904|DPPC_BACSU|DCIAC DIPEPTIDE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (320 aa), FASTA scores: opt: 621, E(): 1.2e-30, (39.9% identity in 263 aa overlap); etc. HAS SIMILARITY WITH INTEGRAL MEMBRANE COMPONENTS OF OTHER BINDING-PROTEIN-DEPENDENT TRANSPORT SYSTEMS. BELONGS TO THE OPPBC SUBFAMILY." /codon_start=1 /transl_table=11 /product="peptide ABC transporter transmembrane protein" /protein_id="NP_218181.1" /db_xref="GI:15610800" /db_xref="GeneID:885483" /translation="MIAAALILLILVVAAFPSLFTAADPTYADPSQSMLAPSAAHWFG TDLQGHDIYSRTVYGARASVTVGLGATLAVFVVGGALGALAGFYGSWIDAVVSRVTDV FLGLPLLLAAIVLMQVMHHRTVWTVIAILALFGWPQVARIARGAVLEVRASDYVLAAK ALGLNRFQILLRHALPNAVGPVIAVATVALGIFIVTEATLSYLGVGLPTSVVSWGGDI NVAQTRLRSGSPILFYPAGALAITVLAFMMMGDALRDALDPASRAWRA" gene complement(4104531..4105457) /gene="dppB" /locus_tag="Rv3665c" /db_xref="GeneID:885474" CDS complement(4104531..4105457) /gene="dppB" /locus_tag="Rv3665c" /function="INVOLVED IN ACTIVE TRANSPORT OF DIPEPTIDE ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3665c, (MTV025.013c), len: 308 aa. Probable dppB, dipeptide-transport integral membrane protein ABC-transporter (see citation below), similar to many peptide permeases e.g. Q9F352|SC9E12.03 PUTATIVE PEPTIDE TRANSPORT SYSTEM INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (307 aa), FASTA scores: opt: 1145, E(): 1.8e-61, (57.65% identity in 307 aa overlap); Q53191|Y4TP_RHISN PROBABLE PEPTIDE ABC TRANSPORTER PERMEASE PROTEIN Rhizobium sp. strain NGR234 (313 aa), FASTA scores: opt: 653, E(): 5.2e-32, (31.2% identity in 314 aa overlap); P24138|OPPB_BACSU OLIGOPEPTIDE TRANSPORT SYSTEM PERMEASE from Bacillus subtilis (311 aa), FASTA scores: opt: 643, E(): 2.1e-31, (33.45% identity in 305 aa overlap); etc. BELONGS TO THE OPPBC SUBFAMILY." /codon_start=1 /transl_table=11 /product="peptide ABC transporter transmembrane protein" /protein_id="NP_218182.1" /db_xref="GI:15610801" /db_xref="GeneID:885474" /translation="MGWYVARRVAVMVPVFLGATLLIYGMVFLLPGDPVAALAGDRPL TPAVAAQLRSHYHLDDPFLVQYLRYLGGILHGDLGRAYSGLPVSAVLAHAFPVTIRLA LIALAVEAVLGIGFGVIAGLRQGGIFDSAVLVTGLVIIAIPIFVLGFLAQFLFGVQLE IAPVTVGERASVGRLLLPGIVLGAMSFAYVVRLTRSAVAANAHADYVRTATAKGLSRP RVVTVHILRNSLIPVVTFLGADLGALMGGAIVTEGIFNIHGVGGVLYQAVTRQETPTV VSIVTVLVLIYLITNLLVDLLYAALDPRIRYG" gene complement(4105459..4107084) /gene="dppA" /locus_tag="Rv3666c" /db_xref="GeneID:885315" CDS complement(4105459..4107084) /gene="dppA" /locus_tag="Rv3666c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF DIPEPTIDE ACROSS THE MEMBRANE (IMPORT)." /note="Rv3666c, (MTV025.014c), len: 541 aa. Probable dppA, dipeptide-binding lipoprotein component of dipeptide transport system (see citation below), similar to many substrate-binding proteins e.g. Q9F353|SC9E12.02 PUTATIVE PEPTIDE TRANSPORT SYSTEM SECRETED PEPTIDE-BINDING PROTEIN from Streptomyces coelicolor (544 aa), FASTA scores: opt: 1200, E(): 9e-67, (39.2% identity in 538 aa overlap); P24141|OPPA_BACSU OLIGOPEPTIDE-BINDING PROTEIN from Bacillus subtilis (545 aa), FASTA scores: opt: 523, E(): 7.9e-25, (26.15% identity in 516 aa overlap); P23843|OPPA_ECOLI PERIPLASMIC OLIGOPEPTIDE-BINDING PROTEIN from Escherichia coli (543 aa), FASTA scores: opt: 452, E(): 2e-20, (25.9% identity in 529 aa overlap); etc. Contains probable N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="periplasmic dipeptide-binding lipoprotein DppA" /protein_id="NP_218183.1" /db_xref="GI:15610802" /db_xref="GeneID:885315" /translation="MVRQMRAALAALATGLLVLAPVAGCGGGVLSPDVVLVNGGEPPN PLIPTGTNDSNGGRIIDRLFAGLMSYDAVGKPSLEVAQSIESADNVNYRITVKPGWKF TDGSPVTAHSFVDAWNYGALSTNAQLQQHFFSPIEGFDDVAGAPGDKSRTTMSGLRVV NDLEFTVRLKAPTIDFTLRLGHSSFYPLPDSAFRDMAAFGRNPIGNGPYKLADGPAGP AWEHNVRIDLVPNPDYHGNRKPRNKGLRFEFYANLDTAYADLLSGNLDVLDTIPPSAL TVYQRDLGDHATSGPAAINQTLDTPLRLPHFGGEEGRLRRLALSAAINRPQICQQIFA GTRSPARDFTARSLPGFDPNLPGNEVLDYDPQRARRLWAQADAISPWSGRYAIAYNAD AGHRDWVDAVANSIKNVLGIDAVAAPQPTFAGFRTQITNRAIDSAFRAGWRGDYPSMI EFLAPLFTAGAGSNDVGYINPEFDAALAAAEAAPTLTESHELVNDAQRILFHDMPVVP LWDYISVVGWSSQVSNVTVTWNGLPDYENIVKA" gene 4107792..4109747 /gene="acs" /locus_tag="Rv3667" /db_xref="GeneID:885479" CDS 4107792..4109747 /gene="acs" /locus_tag="Rv3667" /EC_number="6.2.1.1" /function="ACTIVATES ACETATE TO ACETYL-COENZYME A [CATALYTIC ACTIVITY: ATP + ACETATE + CoA = AMP + PYROPHOSPHATE + ACETYL-COA]." /note="Acs; catalyzes the conversion of acetate and CoA to acetyl-CoA" /codon_start=1 /transl_table=11 /product="acetyl-CoA synthetase" /protein_id="NP_218184.1" /db_xref="GI:15610803" /db_xref="GeneID:885479" /translation="MSESTPEVSSSYPPPAHFAEHANARAELYREAEEDRLAFWAKQA NRLSWTTPFTEVLDWSGAPFAKWFVGGELNVAYNCVDRHVEAGHGDRVAIHWEGEPVG DRRTLTYSDLLAEVSKAANALTDLGLVAGDRVAIYLPLIPEAVIAMLACARLGIMHSV VFGGFTAAALQARIVDAQAKLLITADGQFRRGKPSPLKAAADEALAAIPDCSVEHVLV VRRTGIEMAWSEGRDLWWHHVVGSASPAHTPEPFDSEHPLFLLYTSGTTGKPKGIMHT SGGYLTQCCYTMRTIFDVKPDSDVFWCTADIGWVTGHTYGVYGPLCNGVTEVLYEGTP DTPDRHRHFQIIEKYGVTIYYTAPTLIRMFMKWGREIPDSHDLSSLRLLGSVGEPINP EAWRWYRDVIGGGRTPLVDTWWQTETGSAMISPLPGIAAAKPGSAMTPLPGISAKIVD DHGDPLPPHTEGAQHVTGYLVLDQPWPSMLRGIWGDPARYWHSYWSKFSDKGYYFAGD GARIDPDGAIWVLGRIDDVMNVSGHRISTAEVESALVAHSGVAEAAVVGVTDETTTQA ICAFVVLRANYAPHDRTAEELRTEVARVISPIARPRDVHVVPELPKTRSGKIMRRLLR DVAENRELGDTSTLLDPTVFDAIRAAK" misc_feature 4108569..4108604 /gene="acs" /locus_tag="Rv3667" /note="PS00455 Putative AMP-binding domain signature." gene complement(4109783..4110481) /locus_tag="Rv3668c" /db_xref="GeneID:885609" CDS complement(4109783..4110481) /locus_tag="Rv3668c" /EC_number="3.4.-.-" /function="UNKNOWN; HYDROLYSES PEPTIDES AND/OR PROTEINS." /note="Rv3668c, (MTV025.016c), len: 232 aa. Possible protease (EC 3.4.-.-) (and more specifically a putative alkaline serine protease (EC 3.4.21.-), equivalent to Q9CB98|ML2295 HYPOTHETICAL PROTEIN from Mycobacterium leprae (234 aa), FASTA scores: opt: 1249, E(): 7.4e-66, (77.5% identity in 231 aa overlap). Also similar at C-terminal end with many proteases e.g. O86984 ALKALINE SERINE PROTEASE PRECURSOR from Thermomonospora fusca (368 aa), FASTA scores: opt: 190, E(): 0.00056, (28.9% identity in 173 aa overlap); Q55353|SAPII ALKALINE SERINE PROTEASE II from Streptomyces sp (382 aa), FASTA scores: opt: 160, E(): 0.032, (27.15% identity in 199 aa overlap); O54109|SC10A5.18 PUTATIVE SECRETED PROTEASE from Streptomyces coelicolor (411 aa), FASTA scores: opt: 155, E(): 0.066, (26.4% identity in 163 aa overlap); Q54392|SAL|SCI11.35C SERINE PROTEASE SAL PRECURSOR (300 aa), FASTA scores: opt: 153, E(): 0.068, (28.1% identity in 185 aa overlap); P00778|PRLA_LYSEN|ALPHA-LP ALPHA-LYTIC PROTEASE PRECURSOR (397 aa), FASTA scores: opt: 154, E(): 0.074, (26.75% identity in 172 aa overlap); etc. Also similar with Q50618|YI15_MYCTU|Rv1815|MT1863|MTCY1A11.28c HYPOTHETICAL 22.8 KDA PROTEIN from Mycobacterium tuberculosis (221 aa), FASTA scores: opt: 134, E(): 0.69, (30.95% identity in 181 aa overlap)." /codon_start=1 /transl_table=11 /product="protease" /protein_id="NP_218185.1" /db_xref="GI:15610804" /db_xref="GeneID:885609" /translation="MQTAHRRFAAAFAAVLLAVVCLPANTAAADDKLPLGGGAGIVVN GDTMCTLTTIGHDKNGDLIGFTSAHCGGPGAQIAAEGAENAGPVGIMVAGNDGLDYAV IKFDPAKVTPVAVFNGFAINGIGPDPSFGQIACKQGRTTGNSCGVTWGPGESPGTLVM QVCGGPGDSGAPVTVDNLLVGMIHGAFSDNLPSCITKYIPLHTPAVVMSINADLADIN AKNRPGAGFVPVPA" gene 4110827..4111345 /locus_tag="Rv3669" /db_xref="GeneID:885626" CDS 4110827..4111345 /locus_tag="Rv3669" /function="UNKNOWN" /note="Rv3669, (MTV025.017), len: 172 aa. Probable conserved transmembrane protein, equivalent to Q9CB97|ML2296 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (181 aa), FASTA scores: opt: 863, E(): 1.4e-47, (77.35% identity in 181 aa overlap). Also similar to two PUTATIVE INTEGRAL MEMBRANE TRANSPORT PROTEINS from Streptomyces coelicolor; Q9X930|SCH5.28 (162 aa) FASTA scores: opt: 265, E(): 6.3e-10, (37.4% identity in 155 aa overlap); and Q9X9W1|SCI7.29c (165 aa), FASTA scores: opt: 194, E(): 1.9e-05, (30.6% identity in 134 aa overlap). Contains two hydrophobic stretches in centre. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218186.1" /db_xref="GI:15610805" /db_xref="GeneID:885626" /translation="MSKIDRKNGVPSTLTTIPLADPHAGPAEPSIGDLIKDATTQMST LVRAEVELARAEITRDVKKGLTGSVFFISSLVVGFYSTFFFFFFVAELLDTWIWRWVA FLLVFAIMVVVTAVLALLGFLKVRRIRGPRQTIASVKETRTALTPGHDKTPVTPKPVT SDRATPVDPSGW" gene 4111346..4112329 /gene="ephE" /locus_tag="Rv3670" /db_xref="GeneID:885577" CDS 4111346..4112329 /gene="ephE" /locus_tag="Rv3670" /function="THOUGHT TO BE INVOLVED IN DETOXIFICATION REACTIONS FOLLOWING OXIDATIVE DAMAGE TO LIPIDS. BIOTRANSFORMATION ENZYME THAT CATALYZES THE HYDROLYSIS OF EPOXIDES: AROMATIC HYDROCARBONS CATABOLISM [CATALYTIC ACTIVITY: AN EPOXIDE + H(2)O = A GLYCOL]." /note="Rv3670, (MTV025.018), len: 327 aa. Possible ephE, epoxide hydrolase (EC 3.3.2.3) (see citation below), equivalent to Q9CB96|ML2297 PUTATIVE HYDROLASE from Mycobacterium leprae (324 aa), FASTA scores: opt: 1799, E(): 7.2e-105, (80.55% identity in 324 aa overlap). Also similar to many hydrolases (epoxide hydrolases) and hypothetical proteins e.g. Q9X931|SCH5.29 PUTATIVE HYDROLASE from Streptomyces coelicolor (324 aa), FASTA scores: opt: 687, E(): 1.4e-35, (40.65% identity in 327 aa overlap); Q9RRE3|DR2549 EPOXIDE HYDROLASE-RELATED PROTEIN from Deinococcus radiodurans (278 aa), FASTA scores: opt: 321, E(): 8.2e-13, (32.15% identity in 311 aa overlap); Q9K3Q1|2SCG4.13 PUTATIVE HYDROLASE from Streptomyces coelicolor (292 aa), FASTA scores: opt: 295, E(): 3.5e-11, (30.18% identity in 275 aa overlap); Q9S7P1 EPOXIDE HYDROLASE from Oryza sativa (Rice) (322 aa), FASTA scores: opt: 289, E(): 9.1e-11, (28.7% identity in 338 aa overlap); O23227|C7A10.830|AT4G36530 EPOXIDE HYDROLASE from Arabidopsis thaliana (Mouse-ear cress) (378 aa) FASTA scores: opt: 287, E(): 1.4e-10, (26.1% identity in 272 aa overlap); Q21147|K02F3.6 EPOXIDE HYDROLASE from Caenorhabditis elegans (386 aa), FASTA scores: opt: 283, E(): 2.5e-10, (33.35% identity in 156 aa overlap); etc. Also similar to P95276|EPHB|Rv1938|MTCY09F9.26c from Mycobacterium tuberculosis (356 aa), FASTA scores: opt: 296, E(): 3.6e-11, (29.7% identity in 340 aa overlap). Contains PS00213 Lipocalin signature. SIMILAR TO ALPHA/BETA HYDROLASE FOLD." /codon_start=1 /transl_table=11 /product="epoxide hydrolase EphE" /protein_id="NP_218187.1" /db_xref="GI:15610806" /db_xref="GeneID:885577" /translation="MAAPDPSMTRIAGPWRHLDVHANGIRFHVVEAVPSGQPEGPDAA TPPMQPALARPLVILLHGFGSFWWSWRHQLCGLTGARVVAVDLRGYGGSDKPPRGYDG WTLAGDTAGLIRALGHPSATLVGHADGGLACWTTALLHSRLVRAIALISSPHPAALRR STLTRRDQRHALLPTLLRYQLPIWPERLLTRNNAAEIERLVRARGCAKWLASEDFSQA IDHLRQAIQIPAAAHCALEYQRWAVRSQLRSEGRRFIRAMTQQLGMPLLHLRGDADPY VLADPVERTQRYAPHGRYISIAGAGHFSHEEAPEEVNRHLMRFLEQVHQLS" misc_feature 4111358..4111399 /gene="ephE" /locus_tag="Rv3670" /note="PS00213 Lipocalin signature." gene complement(4112322..4113515) /locus_tag="Rv3671c" /db_xref="GeneID:885176" CDS complement(4112322..4113515) /locus_tag="Rv3671c" /EC_number="3.4.21.-" /function="UNKNOWN; HYDROLYSES OF PEPTIDES AND/OR PROTEINS (POSSIBLY CLEAVED PREFERENTIALLY AFTER SERINE RESIDUE)." /note="Rv3671c, (MTV025.019c), len: 397 aa. Possible serine protease membrane protein (EC 3.4.21.-), equivalent to Q9CB95|ML2298 PUTATIVE MEMBRANE-ASSOCIATED SERINE PROTEASE from Mycobacterium leprae (401 aa), FASTA scores: opt: 2061, E(): 2.3e-108, (80.9% identity in 398 aa overlap). Also similar to many serine proteases, but generally with extended N-terminus, e.g. Q9X932|SCH5.30c PUTATIVE SERINE PROTEASE (FRAGMENT) from Streptomyces coelicolor (385 aa), FASTA scores: opt: 835, E(): 1.2e-39, (39.9% identity in 386 aa overlap); Q9Z6T0|DEGP_CHLPN|HTRA|CPN0979|CP0877 PROBABLE SERINE PROTEASE DO-LIKE PRECURSOR from Chlamydia pneumoniae (Chlamydophila pneumoniae) (488 aa), FASTA scores: opt: 285, E(): 1e-08, (29.05% identity in 296 aa overlap); P73354|HTRA|SLR1204 SERINE PROTEASE from Synechocystis sp. strain PCC 6803 (452 aa), FASTA scores: opt: 284, E(): 1.1e-08, (29.55% identity in 308 aa overlap); Q9RWC4|DR0745 PERIPLASMIC SERINE PROTEASE, HTRA/DEGQ/DEGS FAMILY from Deinococcus radiodurans (366 aa), FASTA scores: opt: 271, E(): 4.9e-08, (35.45% identity in 206 aa overlap); etc. Also similar, but longer 114 aa at the N-terminus, to Q9S2P8|SC5F7.13 PUTATIVE PEPTIDASE from Streptomyces coelicolor (282 aa), FASTA scores: opt: 594, E(): 3.1e-26, (38.95% identity in 285 aa overlap). And similar, but longer 146 aa at the N-terminus, to O07175|PEPA|Rv0125|MTCI418B.07 from Mycobacterium tuberculosis (355 aa), FASTA scores: opt: 295, E(): 2.2e-09, (29.55% identity in 254 aa overlap); and Q9CCY9|ML2659 PROBABLE SECRETED SERINE PROTEASE from Mycobacterium leprae FASTA scores: opt: 286, E(): 6.9e-09, (30.6% identity in 255 aa overlap). Contains PS00135 Serine proteases, trypsin family, serine active site." /codon_start=1 /transl_table=11 /product="membrane-associated serine protease" /protein_id="NP_218188.1" /db_xref="GI:15610807" /db_xref="GeneID:885176" /translation="MTPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAG VLLAPHIVSQISAPRAKLFAALFLILALVVVGEVAGVVLGRAVRGAIRNRPIRLIDSV IGVGVQLVVVLTAAWLLAMPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRL SALLNTSGLPAVLEPFSRTPVIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVL EGTGFVISPDRVMTNAHVVAGSNNVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPP PLVFAAEPAKTGADVVVLGYPGGGNFTATPARIREAIRLSGPDIYGDPEPVTRDVYTI RADVEQGDSGGPLIDLNGQVLGVVFGAAIDDAETGFVLTAGEVAGQLAKIGATQPVGT GACVS" misc_feature complement(4112472..4112507) /locus_tag="Rv3671c" /note="PS00135 Serine proteases, trypsin family, serine active site." gene complement(4113521..4114342) /locus_tag="Rv3672c" /db_xref="GeneID:885463" CDS complement(4113521..4114342) /locus_tag="Rv3672c" /function="UNKNOWN" /note="Rv3672c, (MTV025.020c), len: 273 aa. Conserved hypothetical protein, equivalent to Q9CB94|ML2299 HYPOTHETICAL PROTEIN from Mycobacterium leprae (266 aa) FASTA scores: opt: 1358, E(): 5.2e-75, (76.4% identity in 267 aa overlap). Also similar to others (generally in C-terminal end) e.g. Q9XA45|SCH17.02c HYPOTHETICAL 26.5 KDA PROTEIN from Streptomyces coelicolor (247 aa) FASTA scores: opt: 524, E(): 1.3e-24, (42.65% identity in 251 aa overlap); Q9AB27|CC0407 MUTT/NUDIX FAMILY PROTEIN from Caulobacter crescentus (216 aa), FASTA scores: opt: 285, E(): 3.2e-10, (36.2% identity in 174 aa overlap); BAB49788|MLL2727|Q98HS8 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (204 aa), FASTA scores: opt: 278, E(): 8.1e-10, (31.45% identity in 151 aa overlap); P43337|YEAB_ECOLI|B1813 HYPOTHETICAL 21.4 KDA PROTEIN from Escherichia coli strain K12 (192 aa) FASTA scores: opt: 252, E(): 2.9e-08, (35.9% identity in 170 aa overlap); etc. Contains PS01293 Uncharacterized protein family UPF0036 signature, LLT." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218189.1" /db_xref="GI:15610808" /db_xref="GeneID:885463" /translation="MSAGGTPLQAGATPTGSRGTVALRPDAGPSWLRPLVDNVGQIPD AYRRRLPADVLAMVTAAGAVSAMTSSRRDHREAAVLVLFSGPEAGPGDGGVPDDADLL LTVRASTLRHHAGQAAFPGGVVDPADDGPVATALREANEETGIDPSRLHPLATMERTF IAPSRFHVVPVLAYSPDPGPVAVVNEAETAIVARVPVRAFINPANRLMVYRRPHTRRW AGPAFLLNQMLVWGFTGQVISAVLDVAGWAQPWDTGDIRELDAAMVLIDDESDPR" misc_feature complement(4113977..4114039) /locus_tag="Rv3672c" /note="PS01293 Uncharacterized protein family UPF0036 signature, LLT." gene complement(4114474..4115157) /locus_tag="Rv3673c" /db_xref="GeneID:885190" CDS complement(4114474..4115157) /locus_tag="Rv3673c" /function="FUNCTION NOT KNOW: POSSIBLY ACTS ON THIOREDOXIN." /note="Rv3673c, (MTV025.021c), len: 227 aa. Possible membrane protein, thioredoxin-like protein (thiol-disulfide interchange protein) (EC 1.-.-.-), equivalent to Q9CB93|ML2300 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (215 aa), FASTA scores: opt: 978, E(): 2.5e-52, (71.15% identity in 215 aa overlap). Some similarity with thioredoxin-related proteins e.g. P35160|RESA_BACSU RESA PROTEIN from Bacillus subtilis (181 aa), FASTA scores: opt: 212, E(): 5.7e-06, (30.55% identity in 108 aa overlap); Q9RXW6|DR0189 THIOL:DISULFIDE INTERCHANGE PROTEIN from Deinococcus radiodurans (185 aa) FASTA scores: opt: 206, E(): 1.3e-05, (33.8% identity in 139 aa overlap); Q9I505|PA0953 PROBABLE THIOREDOXIN from Pseudomonas aeruginosa (154 aa), FASTA scores: opt: 180, E(): 0.00044, (34.85% identity in 109 aa overlap); Q9KCP7|BH1522 THIOREDOXIN (THIOL:DISULFIDE INTERCHANGE PROTEIN) from Bacillus halodurans (177 aa), FASTA scores: opt: 178, E(): 0.00064, (31.75% identity in 107 aa overlap); P43221|TLPA_BRAJA THIOL:DISULFIDE INTERCHANGE PROTEIN (CYTOCHROME C BIOGENESIS PROTEIN) from Bradyrhizobium japonicum (221 aa), FASTA scores: opt: 189, E(): 0.00017, (26.85% identity in 227 aa overlap); etc. Also similar to O06392|Rv0526|MTCY25D10.05 HYPOTHETICAL 23.2 KDA PROTEIN from Mycobacterium tuberculosis (216 aa) FASTA scores: opt: 160, E(): 0.0093, (27.45% identity in 142 aa overlap). Contains PS00194 Thioredoxin family active site. POSSIBLY BELONGS TO THE THIOREDOXIN FAMILY." /codon_start=1 /transl_table=11 /product="membrane-anchored thioredoxin-like protein" /protein_id="NP_218190.1" /db_xref="GI:15610809" /db_xref="GeneID:885190" /translation="MPSLPTTPAETAMTTLTGKTRWTIAILAVVAALMAALVAQLHDY SASSTISQRPAPREHRDGDTPEALAWSRQRANLPPCPAAGNGPGAAALRGVVVVCAGD GSAVDVARALAGRRVVINLWAHWCAPCMTELPVMAEYQRRVGPAVLVVTVHQGQNEAA ALSRLADLGVRLPTLQDDRRRVAAALRVANVMPATVVLRPDGSVAQTLPRAFGSADEI VAAVGNDAG" misc_feature complement(4114750..4114806) /locus_tag="Rv3673c" /note="PS00194 Thioredoxin family active site." gene complement(4115157..4115894) /gene="nth" /locus_tag="Rv3674c" /db_xref="GeneID:885058" CDS complement(4115157..4115894) /gene="nth" /locus_tag="Rv3674c" /EC_number="4.2.99.18" /function="HAS BOTH AN APURINIC AND/OR APYRIMIDINIC ENDONUCLEASE ACTIVITY AND A DNA N-GLYCOSYLASE ACTIVITY. INCISES DAMAGED DNA AT CYTOSINES, THYMINES AND GUANINES. ACTS ON A DAMAGED STRAND (OXIDIZED PYRIMIDINES), 5' FROM THE DAMAGED SITE [CATALYTIC ACTIVITY: ENDONUCLEOLYTIC CLEAVAGE NEAR APURINIC OR APYRIMIDINIC SITES TO PRODUCTS WITH 5'-PHOSPHATE]." /note="Rv3674c, (MT3775, MTV025.022c), len: 245 aa. Probable nth, endonuclease III (EC 4.2.99.18) (see citation below), equivalent to Q9CB92|NTH|ML2301 PUTATIVE ENDONUCLEASE III from Mycobacterium leprae (272 aa), FASTA scores: opt: 1363, E(): 3.6e-81, (89.4% identity in 226 aa overlap). Also similar to many e.g. Q9XA44|SCH17.03c from Streptomyces coelicolor (250 aa), FASTA scores: opt: 937, E(): 2.2e-55, (61.65% identity in 219 aa overlap); P46303|UVEN_MICLU from Micrococcus luteus (Micrococcus lysodeikticus) (279 aa), FASTA scores: opt: 899, E(): 8.1e-53, (58.45% identity in 248 aa overlap); P73715|END3_SYNY3|NTH|SLR1822 from Synechocystis sp. strain PCC 6803 (219 aa), FASTA scores: opt: 684, E(): 1.7e-38, (52.2% identity in 203 aa overlap); P39788|END3_BACSU|NTH|JOOB from Bacillus subtilis (219 aa), FASTA scores: opt: 552, E(): 1.2e-29, (43.3% identity in 194 aa overlap); etc. Equivalent to AAK48142 from Mycobacterium tuberculosis strain CDC1551 (262 aa) but shorter 17 aa. Contains PS00764 Endonuclease III iron-sulfur binding region signature, and PS01155 Endonuclease III family signature. BELONGS TO THE NTH/MUTY FAMILY. COFACTOR: BINDS A 4FE-4S CLUSTER WHICH IS NOT IMPORTANT FOR THE CATALYTIC ACTIVITY, BUT WHICH IS PROBABLY INVOLVED IN THE PROPER POSITIONING OF THE ENZYME ALONG THE DNA STRAND (BY SIMILARITY). N-terminus extended since first submission (previously 226 aa)." /codon_start=1 /transl_table=11 /product="endonuclease III" /protein_id="NP_218191.2" /db_xref="GI:57117142" /db_xref="GeneID:885058" /translation="MPGRWSAETRLALVRRARRMNRALAQAFPHVYCELDFTTPLELA VATILSAQSTDKRVNLTTPALFARYRTARDYAQADRTELESLIRPTGFYRNKAASLIG LGQALVERFGGEVPATMDKLVTLPGVGRKTANVILGNAFGIPGITVDTHFGRLVRRWR WTTAEDPVKVEQAVGELIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFG LGPTEPLLAAPLVQGPETDHLLALAGL" misc_feature complement(4115253..4115303) /gene="nth" /locus_tag="Rv3674c" /note="PS00764 Endonuclease III iron-sulfur binding region signature." misc_feature complement(4115469..4115558) /gene="nth" /locus_tag="Rv3674c" /note="PS01155 Endonuclease III family signature." gene 4116002..4116379 /locus_tag="Rv3675" /db_xref="GeneID:885155" CDS 4116002..4116379 /locus_tag="Rv3675" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3675, (MTV025.023), len: 125 aa. Possible membrane protein, with some similarity to Q9YCZ2|APE1120 HYPOTHETICAL 11.7 KDA PROTEIN from Aeropyrum pernix (103 aa), FASTA scores: opt: 100, E(): 9, (40.0% identity in 55 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218192.1" /db_xref="GI:15610811" /db_xref="GeneID:885155" /translation="MFTLLVSWLLVACVPGLLMLATLGLGRLERFLARDTVTATDVAE FLEQAEAVDVHTLARNGMPEALDYLHRRQARRITDSPPLGSGAGPRYAGPLFVTDLDS PVEPPRHGQPNPQFRTARHANHV" gene 4116478..4117152 /locus_tag="Rv3676" /db_xref="GeneID:885502" CDS 4116478..4117152 /locus_tag="Rv3676" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3676, (MTV025.024), len: 224 aa. Probable transcriptional regulator belonging to crp/fnr family, identical to Q9CB91|ML2302 PUTATIVE CRP/FNR-FAMILY TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (224 aa), FASTA scores: opt: 1408, E(): 8.8e-81, (95.95% identity in 224 aa overlap). Also highly similar to transcriptional regulators AAK58838 from Corynebacterium glutamicum (Brevibacterium flavum) (227 aa), FASTA scores: opt: 1178, E(): 1.9e-66, (79.9% identity in 224 aa overlap); and Q9XA42|SCH17.05 from Streptomyces coelicolor (224 aa), FASTA scores: opt: 869, E(): 3.4e-47, (54.45% identity in 224 aa overlap); and similar to others e.g. Q9RRX0|DR2362 from Deinococcus radiodurans (231 aa) FASTA scores: opt: 344, E(): 1.8e-14, (30.8% identity in 211 aa overlap); P29281|CRP_HAEIN from Haemophilus influenzae (224 aa), FASTA scores: opt: 330, E(): 1.3e-13, (32.25% identity in 189 aa overlap); P03020|CRP_ECOLI|CAP|CSM|B3357 from Escherichia coli strain K12 and Shigella flexneri (210 aa), FASTA scores: opt: 323, E(): 3.5e-13, (32.25% identity in 189 aa overlap); etc. Contains helix-turn-helix motif at aa 175-196 (Score 1990, +5.96 SD). BELONGS TO THE CRP/FNR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="CRP/FNR family transcriptional regulator" /protein_id="NP_218193.1" /db_xref="GI:15610812" /db_xref="GeneID:885502" /translation="MDEILARAGIFQGVEPSAIAALTKQLQPVDFPRGHTVFAEGEPG DRLYIIISGKVKIGRRAPDGRENLLTIMGPSDMFGELSIFDPGPRTSSATTITEVRAV SMDRDALRSWIADRPEISEQLLRVLARRLRRTNNNLADLIFTDVPGRVAKQLLQLAQR FGTQEGGALRVTHDLTQEEIAQLVGASRETVNKALADFAHRGWIRLEGKSVLISDSER LARRAR" gene complement(4117258..4118052) /locus_tag="Rv3677c" /db_xref="GeneID:885079" CDS complement(4117258..4118052) /locus_tag="Rv3677c" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3677c, (MTV025.025c), len: 264 aa. Possible hydrolase (EC 3.-.-.-), equivalent to Q9CB90|ML2303 PUTATIVE HYDROLASE from Mycobacterium leprae (262 aa) FASTA scores: opt: 1400, E(): 8.5e-81, (82.05% identity in 262 aa overlap). Also similar to other hydrolases and hypothetical proteins e.g. Q9XA41|SCH17.06c PUTATIVE HYDROLASE from Streptomyces coelicolor (256 aa) FASTA scores: opt: 609, E(): 3.9e-31, (54.65% identity in 247 aa overlap); Q9A9Q1|CC0923 METALLO-BETA-LACTAMASE FAMILY PROTEIN from Caulobacter crescentus (297 aa), FASTA scores: opt: 306, E(): 4.7e-12, (35.45% identity in 268 aa overlap); Q9Y392 CGI-83 PROTEIN from Homo sapiens (Human) (288 aa), FASTA scores: opt: 281, E(): 1.7e-10, (33.2% identity in 259 aa overlap); Q9F7R6 PREDICTED METALLOBETA LACTAMASE FOLD PROTEIN from uncultured proteobacterium EBAC31A08 (265 aa), FASTA scores: opt: 257, E(): 5.1e-09, (32.55% identity in 252 aa overlap); Q9PBI4|XF2160 HYDROXYACYLGLUTATHIONE HYDROLASE from Xylella fastidiosa (258 aa), FASTA scores: opt: 232, E(): 1.9e-07, (30.3% identity in 165 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_218194.1" /db_xref="GI:15610813" /db_xref="GeneID:885079" /translation="MSKTAESLTHPAYGQLRAVTDTASVLLADNPGLLTLDGTNTWVL RGPLSDELVVVDPGPDDDEHLARVAALGRIALVLISHRHGDHTSGIDKLVALTGAPVR AADPQFLRRDGETLTDGEVIDVAGLTITVLATPGHTADSLSFVLDDAVLTADTVLGCG TTVIDKEDGSLADYLESLHRLRGLGRRTVLPGHGPDLLDLEAIASGYLLHRHERLEQI RAALRDLGDDATVREVVEHVYLDVDEKLWNAAEWSVQAQLDYLRTR" gene complement(4118059..4118514) /locus_tag="Rv3678c" /db_xref="GeneID:885495" CDS complement(4118059..4118514) /locus_tag="Rv3678c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3678c, (MTV025.026c), len: 151 aa. Conserved hypothetical protein, equivalent, but shorter 23 aa, to Q9CB89|ML2304 HYPOTHETICAL PROTEIN from Mycobacterium leprae (174 aa), FASTA scores: opt: 746, E(): 2.1e-40, (78.15% identity in 151 aa overlap). Also highly similar to many hypothetical proteins or transcription regulators e.g. Q9XA38|SCH17.09c from Streptomyces coelicolor (155 aa), FASTA scores: opt: 637, E(): 1.5e-33, (69.1% identity in 152 aa overlap); BAB48205|MLR0658 from Rhizobium loti (Mesorhizobium loti) (154 aa), FASTA scores: opt: 500, E(): 6.8e-25, (55.35% identity in 150 aa overlap); BAB50615|MLR3802 TRANSCRIPTION REGULATOR from Rhizobium loti (Mesorhizobium loti) (153 aa), FASTA scores: opt: 425,E(): 3.8e-20, (44.35% identity in 151 aa overlap); Q9U0W7|L7276.02 from Leishmania major (163 aa) FASTA scores: opt: 404, E(): 8.5e-19, (47.7% identity in 151 aa overlap); Q9UZA3|PAB0825 PUTATIVE TRANSLATION INITIATION INHIBITOR from Pyrococcus abyssi (127 aa), FASTA scores: opt: 108, E(): 3.7, (30.75% identity in 130 aa overlap); etc. Contains PS00044 Bacterial regulatory proteins, lysR family signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218195.1" /db_xref="GI:15610814" /db_xref="GeneID:885495" /translation="MSAKARLGQLGVTLPQVAAPLAAYVPAVRTGNLVYTAGQLPLEA GKLVRTGKLGADVNPEEGKTLARICALNALAAVDSLVDLDAVTRVVKVVGFVASAPGF HGQPSVINGASDLLAEVFGDSGAHARSAVGVSELPLDAPVEVELIVEVG" misc_feature complement(4118407..4118499) /locus_tag="Rv3678c" /note="PS00044 Bacterial regulatory proteins, lysR family signature." gene complement(4118530..4118691) /locus_tag="Rv3678A" /db_xref="GeneID:3205049" CDS complement(4118530..4118691) /locus_tag="Rv3678A" /function="UNKNOWN" /note="Rv3678A, len: 53 aa. Conserved hypothetical protein, similar to SCH17.10|AL079353_10 conserved hypothetical protein from Streptomyces coelicolor (53 aa), FASTA scores: opt: 259, E(): 1.5e-13, (78.0% identity in 50 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_178004.1" /db_xref="GI:57117143" /db_xref="GeneID:3205049" /translation="MTQPTAWEYATVPLLTHATKQILDQWGADGWELVAVLPGPTGEQ HVAYLKRPK" gene 4118776..4119798 /locus_tag="Rv3679" /db_xref="GeneID:885809" CDS 4118776..4119798 /locus_tag="Rv3679" /EC_number="3.6.1.-" /function="ANION-TRANSPORTING ATPASE; SUPPOSED CATALYZES THE EXTRUSION OF UNDETERMINATED ANIONS [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED ANION(IN) = ADP + PHOSPHATE + UNDETERMINATED ANION(OUT)]." /experiment="experimental evidence, no additional details recorded" /note="Rv3679, (MTV025.027), len: 340 aa. Probable anion transporting ATPase (EC 3.6.1.-), equivalent to Q9CB88|ML2305 PROBABLE ANION TRANSPORTER PROTEIN from Mycobacterium leprae (341 aa), FASTA scores: opt: 1810, E(): 2.1e-98, (84.15% identity in 341 aa overlap). Also highly similar to Q9XA36|SCH17.11 PUTATIVE ION-TRANSPORTING ATPASE from Streptomyces coelicolor (325 aa), FASTA scores: opt: 989, E(): 1.4e-50, (52.15% identity in 328 aa overlap); and similar to many anion transporting ATPases (principally arsenite transporters) e.g. O50593|ARSA_ACIMU ARSENICAL PUMP-DRIVING ATPASE (ARSENITE-TRANSLOCATING ATPASE) from Acidiphilium multivorum (583 aa), FASTA scores: opt: 225, E(): 8.1e-06, (25.1% identity in 319 aa overlap); AAG43231|ARSA ARSENITE ACITVATED ATPASE from Salmonella typhimurium plasmid R46 FASTA scores: opt: 211, E(): 5.3e-05, (26.95% identity in 267 aa overlap); P52145|ARA2_ECOLI|ARSA ARSENICAL PUMP-DRIVING ATPASE from Escherichia coli plasmid IncN R46 (583 aa), FASTA scores: opt: 211, E(): 5.3e-05, (26.95% identity in 267 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). SOME SIMILARITY TO THE ARSA ATPASE FAMILY." /codon_start=1 /transl_table=11 /product="anion transporter ATPase" /protein_id="NP_218196.1" /db_xref="GI:15610815" /db_xref="GeneID:885809" /translation="MVATTSSGGSSVGWPSRLSGVRLHLVTGKGGTGKSTIAAALALT LAAGGRKVLLVEVEGRQGIAQLFDVPPLPYQELKIATAERGGQVNALAIDIEAAFLEY LDMFYNLGIAGRAMRRIGAVEFATTIAPGLRDVLLTGKIKETVVRLDKNKLPVYDAIV VDAPPTGRIARFLDVTKAVSDLAKGGPVHAQSEGVVKLLHSNQTAIHLVTLLEALPVQ ETLEAIEELAQMELPIGSVIVNRNIPAHLEPQDLAKAAEGEVDADSVRAGLLTAGVKL PDADFAGLLTETIQHATRITARAEIAQQLDALQVPRLELPTVSDGVDLGSLYELSESL AQQGVR" misc_feature 4118857..4118880 /locus_tag="Rv3679" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene 4119795..4120955 /locus_tag="Rv3680" /db_xref="GeneID:885317" CDS 4119795..4120955 /locus_tag="Rv3680" /EC_number="3.6.1.-" /function="ANION-TRANSPORTING ATPASE; SUPPOSED CATALYZES THE EXTRUSION OF UNDETERMINATED ANIONS [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED ANION(IN) = ADP + PHOSPHATE + UNDETERMINATED ANION(OUT)]." /experiment="experimental evidence, no additional details recorded" /note="Rv3680, (MTV025.028), len: 386 aa. Probable anion transporting ATPase (EC 3.6.1.-), equivalent to Q9CB87|ML2306 PROBABLE ANION TRANSPORTER PROTEIN from Mycobacterium leprae (381 aa), FASTA scores: opt: 2131, E(): 6.5e-120, (88.1% identity in 370 aa overlap). Also highly similar, but shorter 29 aa, to Q9XA35|SCH17.12 PUTATIVE ION-TRANSPORTING ATPASE from Streptomyces coelicolor (481 aa), FASTA scores: opt: 1190, E(): 1.1e-63, (51.25% identity in 441 aa overlap); and similar to many anion transporting ATPases e.g. Q9UZA6|PAB1555 ANION TRANSPORTING ATPASE from Pyrococcus abyssi (330 aa) FASTA scores: opt: 242, E(): 3e-07, (24.6% identity in 297 aa overlap); Q9P7F8|SPAC1142.06 PUTATIVE ARSENITE-TRANSLOCATING from Schizosaccharomyces pombe (Fission yeast) (329 aa), FASTA scores: opt: 239, E(): 4.5e-07, (27.9% identity in 197 aa overlap); Q9HS79|ARSA1|VNG0365G ARSENICAL PUMP-DRIVING ATPASE from Halobacterium sp. strain NRC-1 (347 aa), FASTA scores: opt: 238, E(): 5.4e-07, (29.35% identity in 358 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="anion transporter ATPase" /protein_id="NP_218197.1" /db_xref="GI:15610816" /db_xref="GeneID:885317" /translation="MSVTPKTLDMGAILADTSNRVVVCCGAGGVGKTTTAAALALRAA EYGRTVVVLTIDPAKRLAQALGINDLGNTPQRVPLAPEVPGELHAMMLDMRRTFDEMV MQYSGPERAQSILDNQFYQTVATSLAGTQEYMAMEKLGQLLSQDRWDLIVVDTPPSRN ALDFLDAPKRLGSFMDSRLWRLLLAPGRGIGRLITGVMGLAMKALSTVLGSQMLADAA AFVQSLDATFGGFREKADRTYALLKRRGTQFVVVSAAEPDALREASFFVDRLSQESMP LAGLVFNRTHPMLCALPIERAIDAAETLDAETTDSDATSLAAAVLRIHAERGQTAKRE IRLLSRFTGANPTVPVVGVPSLPFDVSDLEALRALADQLTTVGNDAGRAAGR" misc_feature 4119870..4119893 /locus_tag="Rv3680" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." gene complement(4121198..4121554) /gene="whiB4" /locus_tag="Rv3681c" /db_xref="GeneID:885320" CDS complement(4121198..4121554) /gene="whiB4" /locus_tag="Rv3681c" /function="INVOLVED IN A TRANSCRIPTIONAL MECHANISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3681c, (MTV025.029c), len: 118 aa. Probable whiB4 (alternate gene name: whmA), WhiB-like regulatory protein (see Hutter & Dick 1999), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Equivalent to ML2307 HYPOTHETICAL PROTEIN from Mycobacterium leprae (116 aa). Also highly similar to Q9S2B9|SCH17.13c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (112 aa), FASTA scores: opt: 392, E(): 1e-20, (67.95% identity in 78 aa overlap); Q9X951|WBLA HYPOTHETICAL 14.3 KDA PROTEIN from Streptomyces coelicolor (129 aa), FASTA scores: opt: 392, E(): 1.1e-20, (67.95% identity in 78 aa overlap); Q9ACZ0|SCP1.161c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (268 aa), FASTA scores: opt: 273, E(): 4.4e-12, (50.0% identity in 78 aa overlap); Q06387|WHIB-STV from Streptomyces griseocarneus (87 aa) FASTA scores: opt: 231, E(): 1.5e-09, (43.85% identity in 73 aa overlap); etc. Also similar to several putative regulator proteins from Mycobacterium tuberculosis e.g. MTCY7D11_7; MTCY78_13; MTCY10H4_23; MTCY1A6_6; and U00016_29 from Mycobacterium leprae. N-terminus shortened since first submission.; whmA" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIB-like WHIB4" /protein_id="NP_218198.2" /db_xref="GI:57117144" /db_xref="GeneID:885320" /translation="MSGTRPAARRTNLTAAQNVVRSVDAEERIAWVSKALCRTTDPDE LFVRGAAQRKAAVICRHCPVMQECAADALDNKVEFGVWGGMTERQRRALLKQHPEVVS WSDYLEKRKRRTGTAG" gene 4121916..4124348 /gene="ponA2" /locus_tag="Rv3682" /db_xref="GeneID:885751" CDS 4121916..4124348 /gene="ponA2" /locus_tag="Rv3682" /EC_number="2.4.2.-" /EC_number="3.4.-.-" /function="INVOLVED IN PEPTIDOGLYCAN SYNTHESIS (AT THE FINAL STAGES), CELL WALL FORMATION. SYNTHESIS OF CROSS-LINKED PEPTIDOGLYCAN FROM THE LIPID INTERMEDIATES. THE ENZYME HAS A PENICILLIN-INSENSITIVE TRANSGLYCOSYLASE N-TERMINAL DOMAIN (FORMATION OF LINEAR GLYCAN STRANDS) AND A PENICILLIN-SENSITIVE TRANSPEPTIDASE C-TERMINAL DOMAIN (CROSS-LINKING OF THE PEPTIDE SUBUNITS). SUPPOSED INVOLVED IN STATIONARY-PHASE SURVIVAL." /experiment="experimental evidence, no additional details recorded" /note="(MUREIN POLYMERASE) [INCLUDES: PENICILLIN-INSENSITIVE TRANSGLYCOSYLASE (PEPTIDOGLYCAN TGASE) + PENICILLIN-SENSITIVE TRANSPEPTIDASE (DD-TRANSPEPTIDASE)]; Rv3682, (MTV025.030), len: 810 aa. Probable ponA2, penicillin-binding protein (class A), bienzymatic membrane-associated protein with transglycosylase (EC 2.4.2.-) and transpeptidase (EC 3.4.-.-) activities. Almost identical to Q9CB85|PON1|ML2308 PENICILLIN BINDING PROTEIN (CLASS A) from Mycobacterium leprae (803 aa) FASTA scores: opt: 4743, E(): 3.3e-217, (87.7% identity in 806 aa overlap); or P72351|PON1|PBP1 HIGH-MOLECULAR-MASS CLASS A PENICILLIN BINDING PROTEIN from Mycobacterium leprae Cosmid B577 (821 aa), FASTA scores: opt: 4547, E(): 6.3e-208, (88.05% identity in 769 aa overlap) (see Basu et al., 1996). Also equivalent to a predicted homologous protein from Mycobacterium smegmatis. Also similar to others e.g. Q9XA34|SCH17.14 from Streptomyces coelicolor (428 aa; fragment), FASTA scores: opt: 727, E(): 2.3e-27, (36.55% identity in 413 aa overlap); Q9F9V7|PONA from Mycobacterium smegmatis (715 aa), FASTA scores: opt: 446, E(): 6.6e-14, (27.65% identity in 771 aa overlap) (see Billman-Jacobe et al., 1999); Q9CCY4|PONA|ML2688 from Mycobacterium leprae (708 aa), FASTA scores: opt: 413, E(): 2.4e-12, (26.8% identity in 660 aa overlap); Q9X6W0|PONB|MRCB|PA4700 from Pseudomonas aeruginosa (774 aa), FASTA scores: opt: 398, E(): 1.3e-11, (27.2% identity in 666 aa overlap); P45345|PBPB_HAEIN|MRCB|PONB|HI1725 (781 aa), FASTA scores: opt: 380, E(): 9.4e-11, (28.6% identity in 601 aa overlap); etc. Also similar to P71707|PONA1|Rv0050|MTCY21.13 PROBABLE BIFUNCTIONAL PENICILLIN-BINDING PROTEIN 1A/1B (PBP1) from Mycobacterium tuberculosis (678 aa) FASTA scores: opt: 372, E(): 2e-10, (28.35% identity in 769 aa overlap). SEEMS TO BELONG TO THE TRANSGLYCOSYLASE FAMILY IN THE N-TERMINAL SECTION, AND TO THE TRANSPEPTIDASE FAMILY IN THE C-TERMINAL SECTION" /codon_start=1 /transl_table=11 /product="bifunctional membrane-associated penicillin-binding protein 1A/1B" /protein_id="YP_178005.1" /db_xref="GI:57117145" /db_xref="GeneID:885751" /translation="MPERLPAAITVLKLAGCCLLASVVATALTFPFAGGLGLMSNRAS EVVANGSAQLLEGQVPAVSTMVDAKGNTIAWLYSQRRFEVPSDKIANTMKLAIVSIED KRFADHSGVDWKGTLTGLAGYASGDLDTRGGSTLEQQYVKNYQLLVTAQTDAEKRAAV ETTPARKLREIRMALTLDKTFTKSEILTRYLNLVSFGNNSFGVQDAAQTYFGINASDL NWQQAALLAGMVQSTSTLNPYTNPDGALARRNVVLDTMIENLPGEAEALRAAKAEPLG VLPQPNELPRGCIAAGDRAFFCDYVQEYLSRAGISKEQVATGGYLIRTTLDPEVQAPV KAAIDKYASPNLAGISSVMSVIKPGKDAHKVLAMASNRKYGLDLEAGETMRPQPFSLV GDGAGSIFKIFTTAAALDMGMGINAQLDVPPRFQAKGLGSGGAKGCPKETWCVVNAGN YRGSMNVTDALATSPNTAFAKLISQVGVGRAVDMAIKLGLRSYANPGTARDYNPDSNE SLADFVKRQNLGSFTLGPIELNALELSNVAATLASGGVWCPPNPIDQLIDRNGNEVAV TTETCDQVVPAGLANTLANAMSKDAVGSGTAAGSAGAAGWDLPMSGKTGTTEAHRSAG FVGFTNRYAAANYIYDDSSSPTDLCSGPLRHCGSGDLYGGNEPSRTWFAAMKPIANNF GEVQLPPTDPRYVDGAPGSRVPSVAGLDVDAARQRLKDAGFQVADQTNSVNSSAKYGE VVGTSPSGQTIPGSIVTIQISNGIPPAPPPPPLPEDGGPPPPVGSQVVEIPGLPPITI PLLAPPPPAPPP" gene 4124417..4125376 /locus_tag="Rv3683" /db_xref="GeneID:885780" CDS 4124417..4125376 /locus_tag="Rv3683" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3683, (MTV025.031), len: 319 aa. Conserved hypothetical protein, equivalent to Q9CB84|ML2309 HYPOTHETICAL PROTEIN from Mycobacterium leprae (330 aa) FASTA scores: opt: 1791, E(): 9e-107, (85.45% identity in 296 aa overlap). Also similar to Q9X935|SCH66.03 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (309 aa) FASTA scores: opt: 610, E(): 1.4e-31, (51.45% identity in 307 aa overlap); and Q9RRY7|YN45_DEIRA|DR2345 HYPOTHETICAL PROTEIN from Deinococcus radiodurans (305 aa) FASTA scores: opt: 243, E(): 3.2e-08, (31.1% identity in 315 aa overlap) and some similarity to other hypothetical bacterial proteins e.g. Q9CF81|YQED from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (278 aa) FASTA scores: opt: 200, E(): 1.6e-05, (26.85% identity in 287 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218200.1" /db_xref="GI:15610819" /db_xref="GeneID:885780" /translation="MAAVLPTLIRTGAVALGSAIAGIGYAALVERNAFVLREVTMPVL TPGSTPLRVLHISDLHMLPNQHRKQAWLRELASWEPDLVVNTGDNLAHPKAVPAVVQT LSDLLSRPGVFVFGSNDYFGPRLKNPMNYLTSPDHRVRGAALPWQDLRAAFTERGWLD LTHTRREFEVAGLHIAAAGVDDPHIDRDRYDTIAGPASPAANLRLGLTHSPEPRVLDR FAADGYQLVLAGHTHGGQLCLPLYGALVTNCGLDRSRAKGASHWGANMRLHVSAGIGT SPFAPVRFCCRPEATLLTLIATPMGGRDSSSNLGRSQPTVSVR" gene 4125439..4126479 /locus_tag="Rv3684" /db_xref="GeneID:885598" CDS 4125439..4126479 /locus_tag="Rv3684" /EC_number="4.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3684, (MTV025.032), len: 346 aa. Probable lyase (EC 4.-.-.-), and more specifically a cysteine synthase (EC 4.2.99.8), highly similar to many lyases e.g. Q9K3N2|SCG20A.08c PUTATIVE LYASE from Streptomyces coelicolor (374 aa), FASTA scores: opt: 1469, E(): 3.7e-85, (63.35% identity in 341 aa overlap) (shorter 31 aa at N-terminus); Q9KT44|VC1061 CYSTEINE SYNTHASE (EC 4.2.99.8)/CYSTATHIONINE BETA-SYNTHASE FAMILY PROTEIN from Vibrio cholerae (355 aa), FASTA scores: opt: 1366, E(): 1.1e-78, (63.25% identity in 321 aa overlap); Q9I4R3|PA1061 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (365 aa), FASTA scores: opt: 1311, E(): 3.2e-75, (59.8% identity in 341 aa overlap); Q9PH18|XF0128 CYSTEINE SYNTHASE from Xylella fastidiosa (390 aa), FASTA scores: opt: 1288, E(): 9.5e-74, (58.55% identity in 333 aa overlap) (shorter 34 aa at N-terminus); P55708|Y4XP_RHISN PUTATIVE CYSTEINE SYNTHASE from Rhizobium sp. strain NGR234 plasmid sym pNGR234a (336 aa), FASTA scores: opt: 376, E(): 2.1e-16, (29.2% identity in 315 aa overlap); etc. Equivalent to AAK48153 from Mycobacterium tuberculosis strain CDC1551 (368 aa) but shorter 22 aa." /codon_start=1 /transl_table=11 /product="lyase" /protein_id="NP_218201.1" /db_xref="GI:15610820" /db_xref="GeneID:885598" /translation="MIEADARRSADTHLLRYPLPAAWCTDVDVELYLKDETTHITGSL KHRLARSLFLYALCNGWINENTTVVEASSGSTAVSEAYFAALLGLPFIAVMPAATSAS KIALIESQGGRCHFVQNSSQVYAEAERVAKETGGHYLDQFTNAERATDWRGNNNIAES IYVQMREEKHPTPEWIVVGAGTGGTSATIGRYIRYRRHATRLCVVDPENSAFFPAYSE GRYDIVMPTSSRIEGIGRPRVEPSFLPGVVDRMVAVPDAASIAAARHVSAVLGRRVGP STGTNLWGAFGLLAEMVKQGRSGSVVTLLADSGDRYADTYFSDEWVSAQGLDPAGPAA ALVEFERSCRWT" gene 4126541..4126614 /locus_tag="Rvnt40" /note="tRNA-Pro(CGG)" /db_xref="GeneID:2700460" tRNA 4126541..4126614 /locus_tag="Rvnt40" /product="tRNA-Pro" /note="codon recognized: CCG" /anticodon=(pos:4126575..4126577,aa:Pro) /db_xref="GeneID:2700460" gene complement(4127295..4128725) /gene="cyp137" /locus_tag="Rv3685c" /db_xref="GeneID:885625" CDS complement(4127295..4128725) /gene="cyp137" /locus_tag="Rv3685c" /EC_number="1.14.-.-" /function="CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS." /note="Rv3685c, (MTV025.033c), len: 476 aa. Probable cyp137, cytochrome P-450 (EC 1.14.-.-), similar to many e.g. Q9VXY0|C4S3_DROME|CYP4S3|CG9081 from Drosophila melanogaster (Fruit fly) (495 aa), FASTA scores: opt: 376, E(): 1.2e-15, (28.35% identity in 413 aa overlap); Q59163|CYP110A2 from Anabaena variabilis (459 aa) FASTA scores: opt: 320, E(): 3.1e-12, (31.4% identity in 411 aa overlap); O23051|C883_ARATH from Arabidopsis thaliana (Mouse-ear cress) (490 aa), FASTA scores: opt: 313, E(): 8.8e-12, (28.25% identity in 425 aa overlap); etc. Also similar to many from Mycobacterium tuberculosis e.g. O53765|C13B_MYCTU|CYP135B1|Rv0568|MT0594|MTV039.06 (472 aa), FASTA scores: opt: 920, E(): 4.6e-49, (36.25% identity in 447 aa overlap); P96813|C138_MYCTU|CYP138|Rv0136|MT0144|MTCI5.10 (441 aa) FASTA scores: opt: 886, E(): 5.3e-47, (35.5% identity in 445 aa overlap); etc. BELONGS TO THE CYTOCHROME P450 FAMILY. TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="cytochrome P450 137" /protein_id="NP_218202.1" /db_xref="GI:15610821" /db_xref="GeneID:885625" /translation="MVLRSLASPAALTDPKRCASVVGVAAFAVRREHAPDALGGPPGL PAPRGFRAAFAAAYAVAYLAGGERRMLRLIRRYGPIMTMPILSLGDVAIVSDSALAKE VFTAPTDVLLGGEGVGPAAAIYGSGSMFVQEEPEHLRRRKLLTPPLHGAALDRYVPII ENSTRAAMHTWPVDRPFAMLTVARSLMLDVIVKVIFGVDDPEEVRRLGRPFERLLNLG VSEQLTVRYALRRLGALRVWPARARANTEIDDVVMALIAQRRADPRLGERHDVLSLLV SARGESGEQLSDSEIRDDLITLVLAGHETTATTLAWAFDLLLHHPDALRRVRAEAVGG GEAFTTAVINETLRVRPPAPLTARVAAQPLTIGGYRVEAGTRIVVHIIAINRSAEVYE HPHEFRPERFLGTRPQTYAWVPFGGGVKRCLGANFSMRELITVLHVLLREGEFTAVDD EPERIVRRSIMLVPRRGTRVRFRPAR" gene complement(4128751..4129083) /locus_tag="Rv3686c" /db_xref="GeneID:885306" CDS complement(4128751..4129083) /locus_tag="Rv3686c" /function="UNKNOWN" /note="Rv3686c, (MTV025.034c), len: 110 aa. Hypothetical protein, similar to P96893|Rv3288c|MTCY71.28c HYPOTHETICAL 15.2 KDA PROTEIN from Mycobacterium tuberculosis (and Mycobacterium bovis) (137 aa) FASTA scores: opt: 106, E(): 5.6, (29.1% identity in 79 aa overlap); and a few hypothetical proteins e.g. Q9GUV6|L2259.2 from Leishmania major (360 aa) FASTA scores: opt: 118, E(): 2.1, (28.7% identity in 101 aa overlap). Equivalent to AAK48155 from Mycobacterium tuberculosis strain CDC1551 (166 aa) but shorter 56 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218203.1" /db_xref="GI:15610822" /db_xref="GeneID:885306" /translation="MVYTGSDAGDHASAPQPSGSGSVPASVNVPGLVVAAVWAVGLVA GLVALTIGHLAVAAAALVVAVMAPWCRVAYIAHGQHRVCGETLRGTPAGETASFPTGW RGLRFSTR" gene complement(4129323..4129691) /gene="rsfB" /locus_tag="Rv3687c" /db_xref="GeneID:885599" CDS complement(4129323..4129691) /gene="rsfB" /locus_tag="Rv3687c" /function="REGULATES NEGATIVELY Rv3287c|RSBW|USFX. POSSIBLY REGULATED BY PHOSPHORYLATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3687c, (MTV025.035c), len: 122 aa. rsfB, anti-anti-sigma factor (see citation below), showing some similarity to sporulation proteins and sigma-factor genes e.g. Q9WVX8|RSBV_STRCO|BLDG|SCH5.12c ANTI-SIGMA B FACTOR ANTAGONIST from Streptomyces coelicolor (113 aa) FASTA scores: opt: 163, E(): 0.0007, (31.15% identity in 106 aa overlap); Q9F3A2|SC5F1.27c PUTATIVE ANTI-SIGMA FACTOR ANTAGONIST from Streptomyces coelicolor (114 aa) FASTA scores: opt: 159, E(): 0.0013, (29.8% identity in 104 aa overlap); P73609|SLR1859 HYPOTHETICAL 12.0 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (108 aa) FASTA scores: opt: 152, E(): 0.0034, (32.2% identity in 90 aa overlap); L47358|BACSPOI_1 spoIIA A from Paenibacillus polymyxa (117 aa), FASTA scores: opt: 107, E(): 0.23, (24.8% identity in 113 aa overlap); SQSIGB_4 rsbU, rsbV, rsbW & sigB genes from Steptomyces aureus (108 aa) (28.3% identity in 60 aa overlap); etc. Also similar to hypothetical proteins from Mycobacterium tuberculosis e.g. MTCY180_14 and MTCY441 _8." /codon_start=1 /transl_table=11 /product="anti-anti-sigma factor RSFB (anti-sigma factor antagonist) (regulator of sigma F B)" /protein_id="NP_218204.1" /db_xref="GI:15610823" /db_xref="GeneID:885599" /translation="MSAPDSITVTVADHNGVAVLSIGGEIDLITAAALEEAIGEVVAD NPTALVIDLSAVEFLGSVGLKILAATSEKIGQSVKFGVVARGSVTRRPIHLMGLDKTF RLFSTLHDALTGVRGGRIDR" gene complement(4129893..4130357) /locus_tag="Rv3688c" /db_xref="GeneID:885563" CDS complement(4129893..4130357) /locus_tag="Rv3688c" /function="UNKNOWN" /note="Rv3688c, (MTV025.036c), len: 139 aa. Hypothetical protein, similar to other bacterial hypothetical proteins e.g. Q9X934|SCH66.02c from Streptomyces coelicolor (154 aa), FASTA scores: opt: 425, E(): 3.4e-19, (46.1% identity in 154 aa overlap); Q9WZF4|TM0690 from Thermotoga maritima (149 aa), FASTA scores: opt: 326, E(): 3.4e-13, (40.4% identity in 151 aa overlap); Q9PHU3|CJ0573 from Campylobacter jejuni (147 aa), FASTA scores: opt:290 , E(): 5.1e-11, (36.4% identity in 151 aa overlap); etc. Also some similarity to upstream O69654|Rv3686c|MTV025.034c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis. TBparse score is 0.880." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218205.1" /db_xref="GI:15610824" /db_xref="GeneID:885563" /translation="MAELKSQLRSDLTQAMKTQDKLRTATIRMLLAAIQTEEVSGKQA RELSDDEVIKVLARESRKRGEAAEIYTQNGRGELAATEHAEARIIDEYLPTPLTEGEL ADVADTAIAEVAEELGHRPSMKQMGLVMKAATVIAAGKADGARLSAAVKERL" gene 4130357..4131712 /locus_tag="Rv3689" /db_xref="GeneID:885107" CDS 4130357..4131712 /locus_tag="Rv3689" /function="UNKNOWN" /note="Rv3689, (MTV025.037), len: 451 aa. Probable conserved transmembrane protein, with Proline rich N-terminus, similar to Q9KYW6|SCE33.17 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (462 aa) FASTA scores: opt: 730, E(): 2.7e-21, (38.1% identity in 412 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218206.1" /db_xref="GI:15610825" /db_xref="GeneID:885107" /translation="MHKRYAPQRPKPDTETYIEKCTDRRQDGGHDERRQLLRPVSMLP PGYPVEPPPVAPGYAPAGYPPYPATPPGYGPPGYGAPPSYGPPPGYGPPLGYPAAPPG CGPPPGYGPPLGYGPPVAPGAVKPGIIPLRPLTLSDIFNGAVGYIRANPKATLGLTAM VVVTLQIISLVALFGPMTAFGDIVTGEPDELTGAVVGGWSASFGASLLVSWLAGVLLS GMLTVIVGRAVFGSPITVGEAWAKVRGRLLALFGLALLEAAGVVAVLGLAVVILSGVA AAANEAAAALLGFPLLLVVGVSLAYLYVVLLFAPVLIVLERLPIVEAITRSFALVRHG FWRVLGIRLLTVLVVGVVGNAIAAPFMIVGEIVTAVTASDGSVTMRLVGATLSAIGVT IGQIVTAPFSAGVVVLLYTDRRIRAEAFDLVLQTGLEAGPAGGPAPVESTDNLWLTRP F" gene 4131739..4132392 /locus_tag="Rv3690" /db_xref="GeneID:885114" CDS 4131739..4132392 /locus_tag="Rv3690" /function="UNKNOWN" /note="Rv3690, (MTV025.038), len: 217 aa. Probable conserved membrane protein, similar to Q9KYW5|SCE33.18 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (231 aa), FASTA scores: opt: 419, E(): 1.5e-19, (36.0% identity in 211 aa overlap). Equivalent to AAK48159 from Mycobacterium tuberculosis strain CDC1551 (233 aa) but shorter 16 aa. TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218207.1" /db_xref="GI:15610826" /db_xref="GeneID:885114" /translation="MPSIDIDREAAHQAAQRELDKPIYPKDSLTKELTDWIDEQLYRI LEKGSSIPGGWFTITVLLILLMIAVTAAVQIARRTMRTNRGGDYQLFDAGQLTAAQHR STAESYAAEGNWAAAIRHRLQAVARELEETGMLNPAAGRTANELASDAGEVLPHLAGE LTQAATAFNDVTYGERPGTQGAYQMIADLDDHLRSRSPAVVSAVQHPAVFDSWAQVR" gene 4132518..4133519 /locus_tag="Rv3691" /db_xref="GeneID:885623" CDS 4132518..4133519 /locus_tag="Rv3691" /function="UNKNOWN" /note="Rv3691, (MTV025.039), len: 333 aa. Conserved hypothetical protein, similar to Q9KYW4|SCE33.19 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (387 aa) FASTA scores: opt: 481, E(): 6e-23, (36.6% identity in 358 aa overlap). Equivalent to AAK48160 from Mycobacterium tuberculosis strain CDC1551 (381 aa) but shorter 48 aa. TBparse score is 0.931." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218208.1" /db_xref="GI:15610827" /db_xref="GeneID:885623" /translation="MAPASTSSTGGHALATLLGNHGVEVVVADSIADVEAAARPDSLL LVAQTQYLVDNALLDRLAKAPGDLLLVAPTSRTRTALTPQLRIAAASPFNSQPNCTLR EANRAGSVQWGPSDTYQATGDLVLTSCYGGALVRFRAEGRTITVVGSSNFMTNGGLLP AGNAALAMNLAGNRPRLVWYAPDHIEGEMSSPSSLSDLIPENVHWTIWQLWLVVLLVA LWKGRRIGPLVAEELPVVIRASETVEGRGRLYRSRRARDRAADALRTATLQRLRPRLG VGAGAPAPAVVTTIAQRSKADPPFVAYHLFGPAPATDNDLLQLARALDDIERQVTHS" gene 4133516..4134592 /gene="moxR2" /locus_tag="Rv3692" /db_xref="GeneID:885323" CDS 4133516..4134592 /gene="moxR2" /locus_tag="Rv3692" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM; REGULATES METHANOL DEHYDROGENASE." /note="Rv3692, (MTV025.040), len: 358 aa. Probable moxR2, methanol dehydrogenase regulatory protein, highly similar (generally longer at N-terminus) to Q9KYW3|SCE33.20 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (329 aa), FASTA scores: opt: 1523, E(): 4.2e-74, (70.9% identity in 330 aa overlap); Q9Z538|SC9B2.21c PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (332 aa) FASTA scores: opt: 1008, E(): 1.1e-46, (50.8% identity in 313 aa overlap); Q9UZ67|MOXR-3|PAB0848 METHANOL DEHYDROGENASE REGULATORY PROTEIN from Pyrococcus abyssi (314 aa), FASTA scores: opt: 989, E(): 1.1e-45, (50.65% identity in 302 aa overlap); Q9AAN1|CC0566 MOXR PROTEIN from Caulobacter crescentus (323 aa), FASTA scores: opt: 988, E(): 1.3e-45, (52.3% identity in 306 aa overlap); etc. Also similar to O53170|MTV007.26|MOXR|Rv1479 from Mycobacterium tuberculosis (377 aa); and O07392|AF002133_6|MOXR from Mycobacterium avium (309 aa). Also high similarity with several hypothetical bacterial proteins. TBparse score is 0.912." /codon_start=1 /transl_table=11 /product="methanol dehydrogenase transcriptional regulatory protein MOXR2" /protein_id="NP_218209.1" /db_xref="GI:15610828" /db_xref="GeneID:885323" /translation="MTQSASNPQAPPTQTPGAELPGYPPQAGGAPTAAPSGPHPHRAE AESARDALLALRAEVAKAVVGQDGVISGLVIALLCRGHVLLEGVPGVAKTLIVRAMSA ALQLEFKRVQFTPDLMPGDVTGSLVYDARTAEFVFRPGPVFTNLLLADEINRTPPKTQ AALLEAMEERQVSVEGEPKPLPNPFIVAATQNPIEYEGTYQLPEAQLDRFLLKLNVTL PARDSEIAILDRHAHGFDPRDLSAINPVAGPAELAAGREAVRHVLVANEVLGYIVDIV GATRSSPALQLGVSPRGATALLGTARSWAWLSGRDYVTPDDVKAMARPTLRHRVMLRP EAELEGATPDGVLDGILASVPVPR" repeat_region 4134601..4134725 /note="125 bp Mycobacterial Interspersed Repetitive Unit, Class III." gene 4134726..4136048 /locus_tag="Rv3693" /db_xref="GeneID:885059" CDS 4134726..4136048 /locus_tag="Rv3693" /function="UNKNOWN" /note="Rv3693, (MTV025.041), len: 440 aa (alternative start at 41910). Possible conserved membrane protein, similar to Q9KYW2|SCE33.21 PUTATIVE LIPOPROTEIN from Streptomyces coelicolor (436 aa), FASTA scores: opt: 875, E(): 3.3e-46, (56.25% identity in 448 aa overlap); Q9AAN0|CC0567 HYPOTHETICAL PROTEIN from Caulobacter crescentus (437 aa), FASTA scores: opt: 355, E(): 2.3e-14, (30.9% identity in 450 aa overlap); P73233|SLR2013 HYPOTHETICAL 48.5 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (435 aa), FASTA scores: opt: 340, E(): 1.9e-13, (29.7% identity in 438 aa overlap); etc. Equivalent to AAK48162 from Mycobacterium tuberculosis strain CDC1551 (475 aa) but shorter 35 aa. Also similar to other hypothetical proteins from Mycobacterium tuberculosis; MTV014_7; MTV007_27; and MTCY71_36 M." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218210.1" /db_xref="GI:15610829" /db_xref="GeneID:885059" /translation="MILTGRTGLLALICVLPIALSPWPARAFVMLLVALAVAVTVDTL LAASTRKLRFTRSPYTSARLGQPVDASLLLCNGGRRRFRGQVRDAWPPSARAQPHTHD VDVAAGQRQQVHTALRPVRRGDQRAAMVTARSIGPLGLAGRQSSQSVPGLVRVLPPFL SRKHLPSRLAKLREIDGLLPTLIRGQGTEFDSLREYVVGDDVRSIDWRASARRADVMV RTWRPERDRRVVIVLDTGRMAAGRVGVDPTAADPAGWPRLDWSMDAALLLAALASRAG DHVDFLAHDRISRAGVFGASRSELLAQLVDAMAPLRPALIESDWHAMIATILRRTRRR SLVVLLTDLNATALDEGLLPVLPQLSARHHVLVAAVADPRVDQLAAGRSDAAAVYDAA AAERARNDRRAIASQLRRGGVDVIDAPPAEIAPGLADRYLAMKATGRL" gene complement(4136122..4137114) /locus_tag="Rv3694c" /db_xref="GeneID:885578" CDS complement(4136122..4137114) /locus_tag="Rv3694c" /function="UNKNOWN" /note="Rv3694c, (MTV025.042c), len: 330 aa. Possible conserved transmembrane protein, highly similar to Q9KZM4|SCE34.01c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (335 aa), FASTA scores: opt: 1113, E(): 2.5e-60, (51.5% identity in 334 aa overlap); and similar to Q9KEW6|BH0733 HYPOTHETICAL PROTEIN from Bacillus halodurans (355 aa), FASTA scores: opt: 381, E(): 6.1e-16, (24.15% identity in 331 aa overlap); Q9AAM9|CC0568 HYPOTHETICAL PROTEIN from Caulobacter crescentus (332 aa), FASTA scores: opt: 352, E(): 3.3e-14, (30.3% identity in 310 aa overlap); P74166|SLR1478 HYPOTHETICAL 35.4 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (317 aa), FASTA scores: opt: 330, E(): 6.8e-13, (25.65% identity in 308 aa overlap); etc. C-terminal end shows similarity to O29631|AF0624|AE001061_10 CONSERVED HYPOTHETICAL PROTEIN (putative nifU protein) from Archaeoglobus fulgidus (185 aa), FASTA scores: opt: 154, E(): 0.021, (29.0% identity in 131 aa overlap). Equivalent to AAK48163 from Mycobacterium tuberculosis strain CDC1551 (395 aa) but shorter 65 aa. Also some similarity to MTCY428_20 HYPOTHETICAL 43.7 KDA PROTEIN from Mycobacterium tuberculosis." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218211.1" /db_xref="GI:15610830" /db_xref="GeneID:885578" /translation="MDVDAFLLTNRGTWDRLDHLIKKRHSLSGAEIDELVELYQRVST HLSMLRSASSDQLMTGRLSSLVARARSAVTGAHAPLTRTFIRFWTVSFPVVAYRTWRW WLATAVAFFAVVVLIGFWVAGSHEVQSAIGTPTEIDELVSHDVQSYYSEHPAASFALQ VWVNNSWVATTCIAMSVVLGLPIPLVLFDNAANVGLIAGLMFQAGKGDFLLGLLLPHG LLELTAVFLAAAIGMRLGWSVISAGNRPRGQVLAEQGRGVVSVAVGLVGVFLVAGLIE AVVTPSPLPTFVRIAVGIIAEAVFLSYIGYFGRRAAQAGETGDMEDAPDVVPTG" gene 4137206..4138138 /locus_tag="Rv3695" /db_xref="GeneID:885538" CDS 4137206..4138138 /locus_tag="Rv3695" /function="UNKNOWN" /note="Rv3695, (MTV025.043), len: 310 aa. Possible conserved membrane protein, equivalent, but longer 88 aa, to Q9CB83|ML2312 POSSIBLE MEMBRANE PROTEIN from Mycobacterium leprae (196 aa), FASTA scores: opt: 898, E(): 5.2e-36, (71.05% identity in 190 aa overlap). Also highly similar to Q9KZM3|SCE34.02 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (318 aa), FASTA scores: opt: 740,E(): 2.4e-28, (43.25% identity in 319 aa overlap); and similar to P72718|SLR0254 HYPOTHETICAL 30.4 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (266 aa), FASTA scores: opt: 287, E(): 6.1e-07, (29.6% identity in 260 aa overlap); Q9HW83|PA4318 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (265 aa), FASTA scores: opt: 250, E(): 3.5e-05, (32.0% identity in 203 aa overlap); Q9KEW5|BH0734 HYPOTHETICAL PROTEIN from Bacillus halodurans (266 aa), FASTA scores: opt: 168, E(): 0.0047, (25.95% identity in 231 aa overlap); etc. C-terminal end shows some similarity to proline-rich proteins e.g. Q62106 PROLINE-RICH SALIVARY PROTEIN (FRAGMENT) from Mus musculus (Mouse) (188 aa) (36.1% identity in 97 aa overlap). Equivalent to AAK48164 from Mycobacterium tuberculosis strain CDC1551 (269 aa) but longer 41 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218212.1" /db_xref="GI:15610831" /db_xref="GeneID:885538" /translation="MSEVVTGDAVVLDVQIAQLPVRAVSAVIDITIIFIGYILGLMLW ATALTQFDEALTTAFLIIFTVLALVGYPLVWETATRGRSVGKIVMGLRVVSDDGGPER FRQALFRALASVVEIWMLLGSPAVICSMLSPKAKRVGDVFAGTVVVSERGPRLGPPPV MPPSLAWWASSLQLSGLTAGQAEVARQFLVRAPQLDPALREQMAYRIAGDVVARIAPP PPPGVPPQLVLAAVLAERHRRELLRLRPTLPPAGQAPWAQMAPHRGWPPGLSGATPWS PQQPVIPWPEPDPPPQAAPWPQQAPDGPGFSPPG" gene complement(4138202..4139755) /gene="glpK" /locus_tag="Rv3696c" /db_xref="GeneID:885280" CDS complement(4138202..4139755) /gene="glpK" /locus_tag="Rv3696c" /EC_number="2.7.1.30" /function="ACTS IN RATE-LIMITING STEP IN GLYCEROL UTILIZATION. KEY ENZYME IN THE REGULATION OF GLYCEROL UPTAKE AND METABOLISM [CATALYTIC ACTIVITY: ATP + GLYCEROL = ADP + GLYCEROL 3-PHOSPHATE]." /note="Converts glycerol and ADP to glycerol-3-phosphate and ADP" /codon_start=1 /transl_table=11 /product="glycerol kinase" /protein_id="NP_218213.1" /db_xref="GI:15610832" /db_xref="GeneID:885280" /translation="MSDAILGEQLAESSDFIAAIDQGTTSTRCMIFDHHGAEVARHQL EHEQILPRAGWVEHNPVEIWERTASVLISVLNATNLSPKDIAALGITNQRETTLVWNR HTGRPYYNAIVWQDTRTDRIASALDRDGRGNLIRRKAGLPPATYFSGGKLQWILENVD GVRAAAENGDALFGTPDTWVLWNLTGGPRGGVHVTDVTNASRTMLMDLETLDWDDELL SLFSIPRAMLPEIASSAPSEPYGVTLATGPVGGEVPITGVLGDQHAAMVGQVCLAPGE AKNTYGTGNFLLLNTGETIVRSNNGLLTTVCYQFGNAKPVYALEGSIAVTGSAVQWLR DQLGIISGAAQSEALARQVPDNGGMYFVPAFSGLFAPYWRSDARGAIVGLSRFNTNAH LARATLEAICYQSRDVVDAMEADSGVRLQVLKVDGGITGNDLCMQIQADVLGVDVVRP VVAETTALGVAYAAGLAVGFWAAPSDLRANWREDKRWTPTWDDDERAAGYAGWRKAVQ RTLDWVDVS" misc_feature complement(4138559..4138621) /gene="glpK" /locus_tag="Rv3696c" /note="PS00445 FGGY family of carbohydrate kinases signature 2." misc_feature complement(4138931..4138966) /gene="glpK" /locus_tag="Rv3696c" /note="PS00070 Aldehyde dehydrogenases cysteine active site." misc_feature complement(4139282..4139320) /gene="glpK" /locus_tag="Rv3696c" /note="PS00933 FGGY family of carbohydrate kinases signature 1." gene complement(4139805..4140242) /locus_tag="Rv3697c" /db_xref="GeneID:885492" CDS complement(4139805..4140242) /locus_tag="Rv3697c" /function="UNKNOWN" /note="Rv3697c, (MTV025.045c), len: 145 aa. Possible conserved membrane protein, similar to many proteins from Mycobacterium tuberculosis e.g. Q10800|YS72_MYCTU|Rv2872|MT2939|MTCY274.03 (147 aa) FASTA scores: opt: 223, E(): 7.3e-08, (32.6% identity in 141 aa overlap); O53501|Rv2103c|MTV020.03 (144 aa), FASTA scores: opt: 215, E(): 2.4e-07, (31.4% identity in 137 aa overlap); O53812|Rv0749|MTV041.23 (142 aa), FASTA scores: opt: 192, E(): 7.6e-06, (31.25% identity in 144 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218214.1" /db_xref="GI:15610833" /db_xref="GeneID:885492" /translation="MSETFDVDVLVHATHRASPFHDKAKTLVERFLAGPGLVYLLWPV ALGYLRVVTHPTLLGAPLAPEVAVENIEQFTSRPHVRQVGEANGFWPVYRRVADPVKP RGNLVPDAHLVALMRHHGIATIWSHDRDFRKFEGIRIRDPFSG" gene 4140493..4142022 /locus_tag="Rv3698" /db_xref="GeneID:885565" CDS 4140493..4142022 /locus_tag="Rv3698" /function="UNKNOWN" /note="Rv3698, (MTV025.046), len: 509 aa. Conserved hypothetical protein, highly similar to Q9AK89|SC10A9.15c CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (505 aa), FASTA scores: opt: 1720, E(): 9e-103, (53.65% identity in 494 aa overlap). N-terminal end highly similar to CAC42136|SCBAC25F8.01 CONSERVED HYPOTHETICAL PROTEIN (FRAGMENT) from Streptomyces coelicolor (291 aa), FASTA scores: opt: 1078, E(): 8.7e-62, (52.6% identity in 291 aa overlap); and C-terminus highly similar to CAC44687|SCBAC17A6.42c (235 aa), FASTA scores: opt: 911, E(): 3.8e-51, (57.25% identity in 234 aa overlap). TBparse score is 0.934." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218215.1" /db_xref="GI:15610834" /db_xref="GeneID:885565" /translation="MRTISPFLRCRHETCCISNVGEEVTRTTYSREHQREYRRKVRLC LDVFETMLAQTRFEADRPLTGMEIECNLVDADYQPAMSNRYVLDAIADPAYQTELGAY NIEFNVPPRPLPGRTCLELEDEVRASLNDAETKASCSGAHIVMIGILPTLMPEHLTDG WMSASARYAALNESIFKARGEDIPINIAGPEPLSCHAGSIAPESACTSVQLHLQLAPA DFPANWNAAQVLAGPQLALGANSPYFFGHQLWSETRIELFTQSTDARPEELKSRGVRP RVWFGERWITSVLDLFQENIRYFPTLLPEVSDEDPLAELSAGRIPHLSELRLHNGTVY RWNRPVYDVVDGRPHLRLENRVLPAGPTVVDMLANHAFYYGALRGLSEADPPLWTQMN FAAAQANFLAAARYGMDAQLDWPGLGEVTTRELVLGTLLPMAHEGLRRWGVDAEVRDR FLGVIGGRAQTGRNGARWQVATVAALQDGGLTRPAALAEMLRRYCEHMHSNEPVHTWD T" gene 4142044..4142745 /locus_tag="Rv3699" /db_xref="GeneID:885779" CDS 4142044..4142745 /locus_tag="Rv3699" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3699, (MTV025.047), len: 233 aa. Conserved hypothetical protein, showing similarity with hypothetical proteins e.g. Q9P3V6|SPAC1348.04 (alias Q9P3E7|SPAC750.03c or Q9P7U5|SPAC977.03) from Schizosaccharomyces pombe (Fission yeast) (145 aa), FASTA scores: opt: 188, E(): 7.5e-05, (31.65% identity in 120 aa overlap); and Q9KB70|BH2058 from Bacillus halodurans (241 aa) FASTA scores: opt: 185, E(): 0.00018, (27.8% identity in 162 aa overlap); Q9XA90|SCF43A.25c PUTATIVE METHYLTRANSFERASE from Streptomyces coelicolor (215 aa), FASTA scores: opt: 166, E(): 0.0025, (29.95% identity in 147 aa overlap); etc. Also highly similar to O06426|Rv0560c|MTCY25D10.39c HYPOTHETICAL 25.9 KDA PROTEIN from Mycobacterium tuberculosis (241 aa), FASTA scores: opt: 690, E(): 6.5e-36, (53.4% identity in 234 aa overlap); and similar to other hypothetical proteins from Mycobacterium tuberculosis e.g. P71805|Rv1377c|MTCY02B12.11c (212 aa) FASTA scores: opt: 378, E(): 1.5e-16, (35.4% identity in 192 aa overlap); P71972|Rv2675c|MTCY441.44c (250 aa) FASTA scores: opt: 297, E(): 2e-11, (31.1% identity in 193 aa overlap); etc. TBparse score is 0.888." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218216.1" /db_xref="GI:15610835" /db_xref="GeneID:885779" /translation="MTDEVMDWDSAYREQGAFEGPPPWNIGEPQPELATLIAAGKVRS DVLDAGCGYAELSLALAADGYTVVGIDLTPTAVAAATKAAEERGLTTASFVQADITEF AAYPAGSAGRFSTVIDSTLFHSLPVDSRDRYLSSVHRAAAPGASYYVLVFAKGAFPAE LEVKPNEVDEDELRAAVSKYWKIDEIRPAFIHVNPVTIPPQLAGAPVEFPPYDHDEKG RVKFPAYLLTAHKAG" gene complement(4142748..4143920) /locus_tag="Rv3700c" /db_xref="GeneID:885161" CDS complement(4142748..4143920) /locus_tag="Rv3700c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3700c, (MTV025.048c), len: 390 aa. Conserved hypothetical protein; could be a transferase (EC 2.-.-.-) or a lyase (EC 4.-.-.-). Indeed, similar to various enzymes e.g. Q53824|CAC CAPREOMYCIN ACETYLTRANSFERASE from Streptomyces capreolus (359 aa), FASTA scores: opt: 338, E(): 1.1e-12, (33.35% identity in 363 aa overlap); Q9HXX3|CSD_PSEAE|PA3667 PROBABLE CYSTEINE DESULFURASE (EC 4.4.1.-) from Pseudomonas aeruginosa (401 aa) FASTA scores: opt: 260, E(): 4.8e-08, (30.2% identity in 404 aa overlap); Q9X815|SC6G10.30 PUTATIVE AMINOTRANSFERASE from Streptomyces coelicolor (460 aa), FASTA scores: opt: 243, E(): 5.4e-07, (29.15% identity in 374 aa overlap); Q9A761|CC1865 AMINOTRANSFERASE CLASS V from Caulobacter crescentus (379 aa), FASTA scores: opt: 234, E(): 1.6e-06, (27.95% identity in 383 aa overlap); O74351|NFS1_SCHPO|SPBC21D10.11c PROBABLE CYSTEINE DESULFURASE from Schizosaccharomyces pombe (Fission yeast) (498 aa), FASTA scores: opt: 232, E(): 2.5e-06, (29.1% identity in 285 aa overlap); Q9RME8|NIFS NIFS PROTEIN (CYSTEINE DESULFURASE, TRNA SPLICING PROTEIN) from Zymomonas mobilis (370 aa), FASTA scores: opt: 230, E(): 2.6e-06, (32.85% identity in 201 aa overlap); etc. Contains PS00626 Regulator of chromosome condensation (RCC1) signature 2. TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218217.1" /db_xref="GI:15610836" /db_xref="GeneID:885161" /translation="MRRSGANSPAGDSLADRWRAARPPVAGLHLDSAACSRQSFAALD AAAQHARHEAEVGGYVAAEAAAAVLDAGRAAVAALSGLPDAEVVFTTGSLHALDLLLG SWPGENRTLACLPGEYGPNLAVMAAHGFDVRPLPTLQDGRVALDDAAFMLADDPPDLV HLTVVASHRGVAQPLAMVAQLCTELKLPLVVDAAQGLGHVDCAVGADVTYASSRKWIA GPRGVGVLAVRPELMERLRARLPAPDWMPPLTVAQQLGFGEANVAARVGFSVALGEHL ACGPQAIRARLAELGDIARTVLADVSGWRVVEAVDEPSAITTLAPIDGADPAAVRAWL LSQRRIVTTYAGVERAPLELPAPVLRISPHVDNTADDLDAFAEALVAATAATSGER" misc_feature complement(4143624..4143656) /locus_tag="Rv3700c" /note="PS00626 Regulator of chromosome condensation (RCC1) signature 2" gene complement(4143951..4144916) /locus_tag="Rv3701c" /db_xref="GeneID:885521" CDS complement(4143951..4144916) /locus_tag="Rv3701c" /function="UNKNOWN" /note="Rv3701c, (MTV025.049c), len: 321 aa. Conserved hypothetical protein, highly similar to other hypothetical proteins e.g. Q9RCZ8|SCM1.46 from Streptomyces coelicolor (251 aa), FASTA scores: opt: 897, E(): 1.1e-50, (59.9% identity in 242 aa overlap); P73759|SLR0865 from Synechocystis sp. strain PCC 6803 (337 aa), FASTA scores: opt: 779, E(): 5.7e-43, (40.35% identity in 327 aa overlap); Q9GWA1|LM12.997 from Leishmania major (383 aa) FASTA scores: opt: 616, E(): 2.1e-32, (39.05% identity in 297 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218218.1" /db_xref="GI:15610837" /db_xref="GeneID:885521" /translation="MRVSVANHLGEDAGHLALRRDVYSGLQKTPKSLPPKWFYDTVGS ELFDQITRLPEYYPTRAEAEILRARSAEVASACRADTLVELGSGTSEKTRMLLDALRH RGSLRRFVPFDVDASVLSATATAIQREYSGVEINAVCGDFEEHLTEIPRGGRRLFVFL GSTIGNLTPGPRAQFLTALAGVMRPGDSLLLGTDLVKDAARLVRAYDDPGGVTAQFNR NVLAVINRELEADFDVDAFQHVARWNSAEERIEMWLRADGRQRVRVGALDLTVDFDAG EEMLTEVSCKFRPQAVGAELAAAGLHRIRWWTDEAGDFGLSLAAK" gene complement(4144913..4145614) /locus_tag="Rv3702c" /db_xref="GeneID:885224" CDS complement(4144913..4145614) /locus_tag="Rv3702c" /function="UNKNOWN" /note="Rv3702c, (MTV025.050c), len: 233 aa. Conserved hypothetical protein, highly similar to other hypothetical proteins Q9RCZ9|SCM1.45 from Streptomyces coelicolor (271 aa), FASTA scores: opt: 383, E(): 2.3e-17, (44.85% identity in 252 aa overlap); and P54004|Y199_SYNY3|SLR0199 from Synechocystis sp. strain PCC 6803 (304 aa), FASTA scores: opt: 292, E(): 1.7e-11, (30.05% identity in 263 aa overlap); and similar to others e.g. Q9KMU4|VCA0225 from Vibrio cholerae (254 aa), FASTA scores: opt: 260, E(): 1.6e-09, (29.8% identity in 245 aa overlap). Equivalent to AAK48172 from Mycobacterium tuberculosis strain CDC1551 (194 aa) but longer 39 aa. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218219.1" /db_xref="GI:15610838" /db_xref="GeneID:885224" /translation="MCRHLGWLGAQVAVSSLVLDPPQGLRVQSYAPRRQKHGLMNADG WGVGFFDGAIPRRWRSPAPLWGDTSFHSVAPALRSHCILAAVRSATVGMPIEVSATPP FTDGHWLLAHNGVVDRAVLPAGPAAESVCDSAILAATIFAHGLDALGDTIVKVGAADP NARLNILAANGSRLIATTWGDTLSILRRADGVVLASEPYDDDSGWGDVPDRHLVEVTQ KGVTLTALDRAKGPR" gene complement(4145614..4146891) /locus_tag="Rv3703c" /db_xref="GeneID:885128" CDS complement(4145614..4146891) /locus_tag="Rv3703c" /function="UNKNOWN" /note="Rv3703c, (MTV025.051c), len: 425 aa. Conserved hypothetical protein, similar to other hypothetical proteins e.g. Q9RD00|SCM1.44 from Streptomyces coelicolor (446 aa), FASTA scores: opt: 1480, E(): 1.4e-85, (53.9% identity in 421 aa overlap); P72841|SLR1303 from Synechocystis sp. strain PCC 6803 (410 aa), FASTA scores: opt: 533, E(): 4.5e-26, (36.6% identity in 429 aa overlap); Q9KYH7|SCC61A.16 from Streptomyces coelicolor (256 aa), FASTA scores: opt: 266, E(): 1.9e-09, (32.25% identity in 248 aa overlap); etc. Also similar to P95060|Rv0712|MTCY210.31 HYPOTHETICAL 32.7 KDA PROTEIN from Mycobacterium tuberculosis (299 aa), FASTA scores: opt: 243, E(): 5.9e-08, (30.6% identity in 304 aa overlap). TBparse score is 0.908." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218220.1" /db_xref="GI:15610839" /db_xref="GeneID:885128" /translation="MTSPEQLACHLARARARTLRLVDFDDAELCCQYDPLMSPLVWDL AHIGQQEELWLLRGGDPGQPGLLPPAVEGLYDAFEHSRASRVELPLLSPARARSYCAT VRSAALDALAALPEDGDSFVFAMVISHENQHDETMLQALNLRTGSPLLAATSALPAGR PRMAGTSVLVAGGPFVLGVDAADEPCSLDNERPAHVVDVPAFRIGRVPVTNGEWQDFI DDGGYTQSRWWSERGWQHRQRAGLTAPQFWRSGGRTRTRFGHVEDIPADEPVQHVSYF EAEAYAAWAGARLPTEVEWEKACAWDPATGSRRRYPWGTEEPTDTYANLGGQTLRPAP VGAYPAGASACGAEQMLGDVWEWTTSPLRPWPGFVPMVYERYSQPFFGGDYRVLRGGS WAVEPAILRPSFRNWDHPYRRQIFAGVRLAWDI" gene complement(4146888..4148186) /gene="gshA" /locus_tag="Rv3704c" /db_xref="GeneID:885053" CDS complement(4146888..4148186) /gene="gshA" /locus_tag="Rv3704c" /EC_number="6.3.2.2" /function="INVOLVED IN GLUTATHIONE BIOSYNTHESIS (AT THE FIRST STEP) [CATALYTIC ACTIVITY: ATP + L-GLUTAMATE + L-CYSTEINE = ADP + ORTHOPHOSPHATE + GAMMA-L-GLUTAMYL-L-CYSTEINE]." /note="Rv3704c, (MTV025.052c), len: 432 aa. Possible gshA, glutamate--cysteine ligase (EC 6.3.2.2), similar to many e.g. Q9A2Z2|CC3414 GLUTAMATE--CYSTEINE LIGASE from Caulobacter crescentus (453 aa), FASTA scores: opt: 404, E(): 5.9e-17, (30.45% identity in 312 aa overlap); Q9SEH0|GSH1 GAMMA-GLUTAMYLCYSTEINYL SYNTHETASE PRECURSOR from Pisum sativum (Garden pea) (499 aa), FASTA scores: opt: 400, E(): 1.1e-16, (26.4% identity in 439 aa overlap); Q9RH09|GSH GAMMA-GLUTAMYLCYSTEINE SYNTHETASE from Zymomonas mobilis (462 aa), FASTA scores: opt: 397, E(): 1.6e-16, (28.95% identity in 304 aa overlap); P46309|GSH1_ARATH|GSH1|AT4G23100|F7H19.290 GLUTAMATE--CYSTEINE LIGASE from Arabidopsis thaliana (Mouse-ear cress) (522 aa), FASTA scores: opt: 395, E(): 2.3e-16, (27.25% identity in 385 aa overlap); etc. But note that this putative protein is also similar to Q9JMV4|GSHA PUTATIVE GLUTATHIONE SYNTHETASE (FRAGMENT) from Bradyrhizobium japonicum (460 aa), FASTA scores: opt: 498, E(): 1.3e-22, (33.35% identity in 333 aa overlap) (no significant publications found (August 2001)). TBparse score is 0.898." /codon_start=1 /transl_table=11 /product="glutamate--cysteine ligase gshA (gamma-glutamylcysteine synthetase) (gamma-ECS) (GCS) (gamma-glutamyl-L-cysteine synthetase)" /protein_id="NP_218221.1" /db_xref="GI:15610840" /db_xref="GeneID:885053" /translation="MTLAAMTAAASQLDNAAPDDVEITDSSAAAEYIADGCLVDGPLG RVGLEMEAHCFDPADPFRRPSWEEITEVLEWLSPLPGGSVVSVEPGGAVELSGPPADG VLAAIGAMTRDQAVLRSALANAGLGLVFLGADPLRSPVRVNPGARYRAMEQFFAASHS GVPGAAMMTSTAAIQVNLDAGPQEGWAERVRLAHALGPTMIAIAANSPMLGGRFSGWQ STRQRVWGQMDSARCGPILGASGDHPGIDWAKYALKAPVMMVRSPDTQDTRAVTDYVP FTDWVDGRVLLDGRRATVADLVYHLTTLFPPVRPRQWLEIRYLDSVPDEVWPAVVFTL VTLLDDPVAADLAVDAVEPVATAWDTAARIGLADRRLYLAANRCLAIAARRVPTELIG AMQRLVDHVDRGVCPADDFSDRVIAGGIASAVTGMMHGAS" gene complement(4148318..4148962) /locus_tag="Rv3705c" /db_xref="GeneID:885229" CDS complement(4148318..4148962) /locus_tag="Rv3705c" /function="UNKNOWN" /note="Rv3705c, (MTV025.053c), len: 214 aa. Conserved hypothetical protein, equivalent to Q9CB80|ML2320 HYPOTHETICAL PROTEIN from Mycobacterium leprae (215 aa) FASTA scores: opt: 1145, E(): 5.9e-68, (79.45% identity in 214 aa overlap). Some similarity to the C-terminal end of Q11053|PKNH_MYCTU|Rv1266c|MT1304|MTCY50.16 PROBABLE SERINE/THREONINE-PROTEIN from Mycobacterium tuberculosis (626 aa), FASTA scores: opt: 175, E(): 0.0005, (24.9% identity in 201 aa overlap); and to the N-terminal end of P23903|E13B_BACCI|GLCA GLUCAN ENDO-1,3-BETA-GLUCOSIDASE A1 PRECURSOR from Bacillus circulans (682 aa), FASTA scores: opt: 122, E(): 1.6, (25.6% identity in 164 aa overlap). TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218222.1" /db_xref="GI:15610841" /db_xref="GeneID:885229" /translation="MRIAAAVVSIGLAVIAGFAVPVADAHPSEPGVVSYAVLGKGSVG NIVGAPMGWEAVFTRPFQAFWVELPACNNWVDIGLPEVYDDPDLASFNGATTQTSATD QTHLVKQAVGVFASNDAADRAFHRVVDRTVGCSGQTTAIHLDDGTTQVWSFAGGPSTG TDEAWTKQEAGTDRRCFVQTRLRENVLLQAKVCQSGNAGPAVNVLAGAMQNTLG" gene complement(4149091..4149480) /locus_tag="Rv3705A" /db_xref="GeneID:3205111" CDS complement(4149091..4149480) /locus_tag="Rv3705A" /function="UNKNOWN" /note="Rv3705A, len: 129 aa. Conserved hypothetical protein, similar to downstream ORF O69674|Rv3706c|MTV025.054c CONSERVED HYPOTHETICAL PROLINE RICH PROTEIN from Mycobacterium tuberculosis (106 aa), FASTA scores: opt: 245, E(): 0.00013, (40.7% identity in 113 aa overlap)." /codon_start=1 /transl_table=11 /product="proline rich protein" /protein_id="YP_178006.1" /db_xref="GI:57117146" /db_xref="GeneID:3205111" /translation="MTETPQPAAPPPSAATTSPPPSPQQEKPPRLYRAAAWVVIVAGI VFTVAVIFFSGALVLGQGKCPYHRYYHHGMFRPVGPVAPGPGMGWVFGFPGGPPPPGM GPGFPGGPGGPAVGPTGPGPTTAPARP" gene complement(4149591..4149911) /locus_tag="Rv3706c" /db_xref="GeneID:885222" CDS complement(4149591..4149911) /locus_tag="Rv3706c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3706c, (MTV025.054c), len: 106 aa. Conserved ypothetical pro-rich protein, similar to upstream ORF Rv3705A (129 aa), and AAK48176|MT3808.1 HYPOTHETICAL 13.0 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (129 aa), FASTA scores: opt: 245, E(): 4.4e-06, (40.7% identity in 113 aa overlap)." /codon_start=1 /transl_table=11 /product="proline rich protein" /protein_id="NP_218223.1" /db_xref="GI:15610842" /db_xref="GeneID:885222" /translation="MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFT GYILGKHAGHGGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPA TPAP" gene complement(4150030..4151040) /locus_tag="Rv3707c" /db_xref="GeneID:885587" CDS complement(4150030..4151040) /locus_tag="Rv3707c" /function="UNKNOWN" /note="Rv3707c, (MTV025.055c), len: 336 aa. Equivalent to Q9CB79|ML2321 HYPOTHETICAL PROTEIN from Mycobacterium leprae (336 aa), FASTA scores: opt: 1948, E(): 6.7e-110, (81.95% identity in 332 aa overlap); and P41402|YASD_MYCSM HYPOTHETICAL 35.9 KDA PROTEIN IN THE ASPARTOKINASE GENE CLUSTER from Mycobacterium smegmatis (333 aa), FASTA scores: opt: 1731, E(): 7.4e-97, (70.85% identity in 333 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218224.1" /db_xref="GI:15610843" /db_xref="GeneID:885587" /translation="MLRIGPTAGTGTPTGDYGIGATDLCEFVEFPSQLLQVCGDSFAG QGVGFGGWYAPVALHVDTESIDDPAGVRYTGVTGVGTPLLADPTPPGDSQLPAGVVQI NRRNYLMVTTTKDLQPQNSRLVRAEAARGGWQTVSGSRRNAAYQDGRQTQISGYYDPV PTPDSPTGWVYIVADSFTRGEPAVLYRATPESFTDRSRWQGWAGGPDGGWNKPPTPLW PDQLGEMSIRQIDGQTVLSYFNASTGNMEVRVAHHPTSLGAAPVTTVVRHDEWPEPAE SLPPPYDNRLAQPYGGYISPGSTIDELRIFVSQWDTRARQNGPYRVIQFAVNPFKPWS DP" gene complement(4151180..4152217) /gene="asd" /locus_tag="Rv3708c" /db_xref="GeneID:885118" CDS complement(4151180..4152217) /gene="asd" /locus_tag="Rv3708c" /EC_number="1.2.1.11" /function="INVOLVED AT THE SECOND STEP IN THE COMMON BIOSYNTHETIC PATHWAY LEADING FROM ASP TO THE CELL WALL PRECURSOR MESO-DIAMINOPIMELATE, TO LYS, TO MET, TO ILE AND TO THR [CATALYTIC ACTIVITY: L-ASPARTATE-SEMIALDEHYDE + ORTHOPHOSPHATE + NADP(+) = L-ASPARTYL PHOSPHATE + NADPH]." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 4-aspartyl phosphate from aspartate 4-semialdehyde" /codon_start=1 /transl_table=11 /product="aspartate-semialdehyde dehydrogenase" /protein_id="NP_218225.1" /db_xref="GI:15610844" /db_xref="GeneID:885118" /translation="MGLSIGIVGATGQVGQVMRTLLDERDFPASAVRFFASARSQGRK LAFRGQEIEVEDAETADPSGLDIALFSAGSAMSKVQAPRFAAAGVTVIDNSSAWRKDP DVPLVVSEVNFERDAHRRPKGIIANPNCTTMAAMPVLKVLHDEARLVRLVVSSYQAVS GSGLAGVAELAEQARAVIGGAEQLVYDGGALEFPPPNTYVAPIAFNVVPLAGSLVDDG SGETDEDQKLRFESRKILGIPDLLVSGTCVRVPVFTGHSLSINAEFAQPLSPERAREL LDGATGVQLVDVPTPLAAAGVDESLVGRIRRDPGVPDGRGLALFVSGDNLRKGAALNT IQIAELLTADL" misc_feature complement(4151447..4151491) /gene="asd" /locus_tag="Rv3708c" /note="PS01103 Aspartate-semialdehyde dehydrogenase signature" gene complement(4152218..4153483) /gene="ask" /locus_tag="Rv3709c" /db_xref="GeneID:885223" CDS complement(4152218..4153483) /gene="ask" /locus_tag="Rv3709c" /EC_number="2.7.2.4" /function="INVOLVED AT THE FIRST STEP IN THE COMMON BIOSYNTHETIC PATHWAY LEADING FROM ASP TO THE CELL WALL PRECURSOR MESO-DIAMINOPIMELATE, TO LYS, TO MET, TO ILE AND TO THR [CATALYTIC ACTIVITY: ATP + L-ASPARTATE = ADP + 4-PHOSPHO-L-ASPARTATE]. POSSIBLY ACTS IN TETRAMER CONFIGURATION, TETRAMER CONSISTING OF TWO ALPHA (CATALYTIC ACTIVITY) AND TWO BETA (FUNCTION NOT KNOWN) CHAINS." /experiment="experimental evidence, no additional details recorded" /note="catalyzes the formation of 4-phospho-L-aspartate from L-aspartate and ATP, in Bacillus, lysine sensitive; regulated by response to starvation." /codon_start=1 /transl_table=11 /product="aspartate kinase" /protein_id="NP_218226.1" /db_xref="GI:15610845" /db_xref="GeneID:885223" /translation="MALVVQKYGGSSVADAERIRRVAERIVATKKQGNDVVVVVSAMG DTTDDLLDLAQQVCPAPPPRELDMLLTAGERISNALVAMAIESLGAHARSFTGSQAGV ITTGTHGNAKIIDVTPGRLQTALEEGRVVLVAGFQGVSQDTKDVTTLGRGGSDTTAVA MAAALGADVCEIYTDVDGIFSADPRIVRNARKLDTVTFEEMLEMAACGAKVLMLRCVE YARRHNIPVHVRSSYSDRPGTVVVGSIKDVPMEDPILTGVAHDRSEAKVTIVGLPDIP GYAAKVFRAVADADVNIDMVLQNVSKVEDGKTDITFTCSRDVGPAAVEKLDSLRNEIG FSQLLYDDHIGKVSLIGAGMRSHPGVTATFCEALAAVGVNIELISTSEIRISVLCRDT ELDKAVVALHEAFGLGGDEEATVYAGTGR" misc_feature complement(4153445..4153471) /gene="ask" /locus_tag="Rv3709c" /note="PS00324 Aspartokinase signature" gene 4153860..4155674 /gene="leuA" /locus_tag="Rv3710" /db_xref="GeneID:885092" CDS 4153860..4155674 /gene="leuA" /locus_tag="Rv3710" /EC_number="2.3.3.13" /function="INVOLVED IN LEUCINE BIOSYNTHESIS (AT THE FIRST STEP). CATALYZES CONDENSATION OF ACETYL-CoA AND 2-OXOISOVALERATE TO FORM 2-ISOPROPYLMALATE SYNTHASE [CATALYTIC ACTIVITY: 3-CARBOXY-3-HYDROXY-4-METHYLPENTANOATE + CoA = ACETYL-COA + 3-METHYL-2-OXOBUTANOATE + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Rv3710, (MTV025.058), len: 644 aa. leuA, alpha-isopropylmalate synthase (EC 4.1.3.12) (see citations below), equivalent to Q9CB76|LEUA|ML2324 2-ISOPROPYLMALATE SYNTHASE from Mycobacterium leprae (607 aa), FASTA scores: opt: 3291, E(): 3.7e-192, (80.7% identity in 642 aa overlap). Also highly similar to many e.g. P42455|LEU1_CORGL|LEUA from Corynebacterium glutamicum (Brevibacterium flavum) (616 aa), FASTA scores: opt: 2547, E(): 5.3e-147, (63.25% identity in 645 aa overlap); O31046|LEU1_STRCO|LEUA from Streptomyces coelicolor (573 aa), FASTA scores: opt: 2226, E(): 1.5e-127, (57.8% identity in 616 aa overlap); BAB49833|Q98HN3|MLR2792 from Rhizobium loti (Mesorhizobium loti) (588 aa), FASTA scores: opt: 1849, E(): 1.1e-104, (58.0% identity in 536 aa overlap); etc. Equivalent to AAK48181 from Mycobacterium tuberculosis strain CDC1551 (659 aa) but shorter 15 aa. Contains PS00815 and PS00816 Alpha-isopropylmalate and homocitrate synthases signatures 1 and 2. BELONGS TO THE ALPHA-IPM SYNTHETASE / HOMOCITRATE SYNTHASE FAMILY." /codon_start=1 /transl_table=11 /product="2-isopropylmalate synthase" /protein_id="NP_218227.2" /db_xref="GI:161352459" /db_xref="GeneID:885092" /translation="MPVNRYRPFAEEVEPIRLRNRTWPDRVIDRAPLWCAVDLRDGNQ ALIDPMSPARKRRMFDLLVRMGYKEIEVGFPSASQTDFDFVREIIEQGAIPDDVTIQV LTQCRPELIERTFQACSGAPRAIVHFYNSTSILQRRVVFRANRAEVQAIATDGARKCV EQAAKYPGTQWRFEYSPESYTGTELEYAKQVCDAVGEVIAPTPERPIIFNLPATVEMT TPNVYADSIEWMSRNLANRESVILSLHPHNDRGTAVAAAELGFAAGADRIEGCLFGNG ERTGNVCLVTLGLNLFSRGVDPQIDFSNIDEIRRTVEYCNQLPVHERHPYGGDLVYTA FSGSHQDAINKGLDAMKLDADAADCDVDDMLWQVPYLPIDPRDVGRTYEAVIRVNSQS GKGGVAYIMKTDHGLSLPRRLQIEFSQVIQKIAEGTAGEGGEVSPKEMWDAFAEEYLA PVRPLERIRQHVDAADDDGGTTSITATVKINGVETEISGSGNGPLAAFVHALADVGFD VAVLDYYEHAMSAGDDAQAAAYVEASVTIASPAQPGEAGRHASDPVTIASPAQPGEAG RHASDPVTSKTVWGVGIAPSITTASLRAVVSAVNRAAR" misc_feature 4153974..4154024 /gene="leuA" /locus_tag="Rv3710" /note="PS00815 Alpha-isopropylmalate and homocitrate synthases signature 1" misc_feature 4154583..4154624 /gene="leuA" /locus_tag="Rv3710" /note="PS00816 Alpha-isopropylmalate and homocitrate synthases signature 2" gene complement(4155740..4156729) /gene="dnaQ" /locus_tag="Rv3711c" /db_xref="GeneID:885088" CDS complement(4155740..4156729) /gene="dnaQ" /locus_tag="Rv3711c" /EC_number="2.7.7.7" /function="DNA POLYMERASE III IS A COMPLEX, MULTICHAIN ENZYME RESPONSIBLE FOR MOST OF THE REPLICATIVE SYNTHESIS IN BACTERIA. THE EPSILON SUBUNIT CONTAIN THE EDITING FUNCTION AND IS A PROOFREADING 3'-5' EXONUCLEASE [CATALYTIC ACTIVITY: N DEOXYNUCLEOSIDE TRIPHOSPHATE = N DIPHOSPHATE + {DNA}(N)]." /note="3'-5' exonuclease of DNA polymerase III" /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit epsilon" /protein_id="NP_218228.1" /db_xref="GI:15610847" /db_xref="GeneID:885088" /translation="MSHTWGRPASHQDRGWAVIDVETSGFRPGQARIISLAVLGLDAA GRLEQSVVSLLNPKVDPGPTHVHGLTAAMLDGQPQFADIAGEVVDVLRGRTLVAHNVA FDYAFLAAEAEIAEAELPVDFVMCTVELARRLQLGVDNLRLETLAAHWGVPQQRPHDA FDDVRVLTGILAAALESARELDVWLPVHPVTRRRWPNGRVTHDELRPLKAVAARMACP YLNPGRYVQGRPLVQGMRVGLAAEVKRTHEELVERILHAGLAYSDVVDRDTSLVVCNA TAPEHGKGYHALQLGVPVMPEARFMECIGAVVGGASVEDFTDVAPVEKQLALF" gene 4156981..4158222 /locus_tag="Rv3712" /db_xref="GeneID:885228" CDS 4156981..4158222 /locus_tag="Rv3712" /EC_number="6.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3712, (MTV025.060), len: 413 aa. Possible ligase (EC 6.-.-.-), equivalent to O69522|ML2326|MLCB2407.24c HYPOTHETICAL 43.8 KDA PROTEIN (POSSIBLE LIGASE) from Mycobacterium leprae (411 aa), FASTA scores: opt: 2265, E(): 8e-129, (84.25% identity in 413 aa overlap). Also similar to ligases or hypothetical proteins e.g. Q9FCA1|2SCG58.12 PUTATIVE LIGASE from Streptomyces coelicolor (412 aa), FASTA scores: opt: 1168, E(): 6.7e-63, (45.8% identity in 406 aa overlap); P74303|SLR0938 HYPOTHETICAL 50.2 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (459 aa), FASTA scores: opt: 392, E(): 3.1e-16, (28.45% identity in 397 aa overlap); Q99ZX1|SPY1035 PUTATIVE UDP-N-ACETYLMURAMYL TRIPEPTIDE SYNTHETASE (EC 6.3.2.13) from Streptococcus pyogenes (445 aa), FASTA scores: opt: 335, E(): 8.1e-13, (29.2% identity in 438 aa overlap); Q9CGJ0|YLBD HYPOTHETICAL PROTEIN from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (449 aa), FASTA scores: opt: 324, E(): 3.8e-12, (28.75% identity in 445 aa overlap); Q9ZGG7|MURC UDP-N-ACETYLMURAMYL TRIPEPTIDE SYNTHETASE from Heliobacillus mobilis (455 aa), FASTA scores: opt: 292, E(): 3.2e-10, (30.75% identity in 449 aa overlap); etc. TBparse score is 0.874." /codon_start=1 /transl_table=11 /product="ligase" /protein_id="NP_218229.1" /db_xref="GI:15610848" /db_xref="GeneID:885228" /translation="MVTTRARLALAAGAGARWASRVTGRGAGAMIGGLVAMTLDRSIL RQLGMGRRTVVVTGTNGKSTTTRMTAAALGTLGAVATNAEGANMDAGLVAALAAHRDA ELAVLEVDEMHVPHISDAVDPAVVVLLNLSRDQLDRVGEINVIERTLRAGLARHPDAV VVANCDDVLMTSAAYDSPNVVWVAAGGAWSNDSVSCPRSGEVIVRKAPSQEDHWYSTG ADFKRPAPHWWFDDATLYGPDGLALPMRLALPGSVNRGNAAQAVAAAVALGADPAVAV AAVCQVDEVAGRYRTVRIGAHQARILLAKNPAGWQEALAMVDKHADGVVIAVNGRVPD GEDLSWLWDVRFEHFEKTRVVAAGERGTDLAVRLGYAGVEHTLVHDTVAAIASCPPGR VEVVANYTAFLQLQRALARRG" gene 4158227..4158922 /gene="cobQ2" /locus_tag="Rv3713" /db_xref="GeneID:885584" CDS 4158227..4158922 /gene="cobQ2" /locus_tag="Rv3713" /function="INVOLVED IN COBALAMIN BIOSYNTHESIS. CATALYZES AMIDATIONS AT POSITIONS B, D, E, AND G ON ADENOSYLCOBYRINIC A,C-DIAMIDE. NH(2) GROUPS ARE PROVIDED BY GLUTAMINE, AND ONE MOLECULE OF ATP IS HYDROGENOLYZED FOR EACH AMIDATION (BY SIMILARITY)." /note="Rv3713, (MTV025.061), len: 231 aa. Possible cobQ2, cobyric acid synthase (EC undetermined), equivalent to O69521|ML2327|MLCB2407.23c HYPOTHETICAL 24.5 KDA PROTEIN from Mycobacterium leprae (230 aa), FASTA scores: opt: 1313, E(): 4.7e-73, (86.1% identity in 230 aa overlap). Also partially similar to several cobyric acid synthases and hypothetical proteins e.g. Q9FCA0|2SCG58.13 HYPOTHETICAL 26.2 KDA PROTEIN from Streptomyces coelicolor (242 aa), FASTA scores: opt: 639, E(): 6.2e-32, (46.6% identity in 234 aa overlap); Q9ZGG8|COBQ COBYRIC ACID SYNTHASE from Heliobacillus mobilis (252 aa), FASTA scores: opt: 501, E(): 1.7e-23, (40.75% identity in 206 aa overlap); BAB58053|SAV1891 HYPOTHETICAL 27.4 KDA PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (243 aa), FASTA scores: opt: 400, E(): 2.3e-17, (35.95% identity in 217 aa overlap); Q9CGJ1|COBQ COBYRIC ACID SYNTHASE from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (261 aa), FASTA scores: opt: 353, E(): 1.8e-14, (35.3% identity in 201 aa overlap); O26880|COBQ_METTH|MTH787 PROBABLE COBYRIC ACID SYNTHASE from Methanobacterium thermoautotrophicum (504 aa), FASTA scores: opt: 201, E(): 5.6e-05, (33.35% identity in 171 aa overlap); etc. Also similar to hypothetical mycobacterial proteins O05811|COBB_MYCTU|Rv2848c|MT2914|MTCY24A1.09 (457 aa) and P71842|Rv0789c|MTCY369.33c (199 aa). SEEMS TO BELONG TO THE COBB/COBQ FAMILY, COBQ SUBFAMILY." /codon_start=1 /transl_table=11 /product="cobyric acid synthase CobQ2" /protein_id="NP_218230.1" /db_xref="GI:15610849" /db_xref="GeneID:885584" /translation="MVRIGLVLPDVMGTYGDGGNAVVLRQRLLLRGIAAEIVEITLAD PVPDSLDLYTLGGAEDYAQRLATRHLRRYPGLQRAAGRGAPVLAICAAIQVLGHWYET SSGDRVDGVGLLDVTTSPQDARTIGELVSKPLLAGLTQPLTGFENHRGGTVLGPGTSP LGAVVKGAGNRAGDGFDGAVAGSVVATYMHGPCLARNPELADLLLSKVVGELAPLDLP EVDLLRRERLSAR" gene complement(4158931..4159821) /locus_tag="Rv3714c" /db_xref="GeneID:885456" CDS complement(4158931..4159821) /locus_tag="Rv3714c" /function="UNKNOWN" /note="Rv3714c, (MTV025.062c), len: 296 aa. Conserved hypothetical protein, highly similar to O07396|MAV346 MAV346 PROTEIN from Mycobacterium avium (346 aa) FASTA scores: opt: 834, E(): 2.2e-46, (50.0% identity in 286 aa overlap); and also highly similar to several proteins from Mycobacterium tuberculosis e.g. O53421|Rv1073|MTV017.26 (283 aa), FASTA scores: opt: 869, E(): 1e-48, (51.1% identity in 270 aa overlap); P71763|Rv1482c|MTCY277.03c (339 aa), FASTA scores: opt: 775, E(): 1.3e-42, (46.35% identity in 289 aa overlap); P96837|Rv3555c|MTCY06G11.02c (289 aa), FASTA scores: opt: 733, E(): 5.9e-40, (44.15% identity in 281 aa overlap); etc. Partially similar to Q9Z512|UVRC_STRCO|SCC54.13c EXCINUCLEASE ABC SUBUNIT C from Streptomyces coelicolor (728 aa), FASTA scores: opt: 122, E(): 2.5, (27.0% identity in 174 aa overlap). Equivalent to AAK48186 from Mycobacterium tuberculosis strain CDC1551 (341 aa) but shorter 45 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218231.1" /db_xref="GI:15610850" /db_xref="GeneID:885456" /translation="MLISRMSVRSASMSVMGDVFIGSEAITAGRLTRHELQRWYQPMF RGVYVSRRSVPTLWDRTVGAWLATRRHGVIAGNAASALHGAQWVDVDVAIELISPTTR PQHGLVIRRETLCDDEITRVVGLPVTTLARTAYDLGRHLSRGEAVARLDALMRATPFS RDDVLLLAKRHAGARGVRRLRDVLPLVDGGAASPKETWLRLLLIDAGLPVPTTQIPVV HRWRNVGVLDMGWEKYMVAAEYDGDQHRSDRGRYVKDQRRLRKLAELGWIVIRVIAED NPDDVVNRVRAALLARGWRP" gene complement(4159889..4160500) /gene="recR" /locus_tag="Rv3715c" /db_xref="GeneID:885307" CDS complement(4159889..4160500) /gene="recR" /locus_tag="Rv3715c" /function="MAY PLAY A ROLE IN DNA REPAIR. IT SEEMS TO BE INVOLVED IN AN RECBC-INDEPENDENT RECOMBINATIONAL PROCESS OF DNA REPAIR. IT MAY ACT WITH RECF|Rv0003 (AND RECO|Rv2362c ?) FOR MODULATING ASSEMBLY OF DISASSEMBLY OF RECA FILAMENTS." /note="involved in a recombinational process of DNA repair, independent of the recBC complex" /codon_start=1 /transl_table=11 /product="recombination protein RecR" /protein_id="NP_218232.1" /db_xref="GI:15610851" /db_xref="GeneID:885307" /translation="MFEGPVQDLIDELGKLPGIGPKSAQRIAFHLLSVEPSDIDRLTG VLAKVRDGVRFCAVCGNVSDNERCRICSDIRRDASVVCIVEEPKDIQAVERTREFRGR YHVLGGALDPLSGIGPDQLRIRELLSRIGERVDDVDVTEVIIATDPNTEGEATATYLV RMLRDIPGLTVTRIASGLPMGGDLEFADELTLGRALAGRRVLA" gene complement(4160512..4160913) /locus_tag="Rv3716c" /db_xref="GeneID:885595" CDS complement(4160512..4160913) /locus_tag="Rv3716c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3716c, (MTV025.064c), len: 133 aa. Conserved hypothetical protein, equivalent to O69519|Y1B6_MYCLE|ML2330|MLCB2407.20 HYPOTHETICAL 11.9 KDA PROTEIN from Mycobacterium leprae (116 aa), FASTA scores: opt: 616, E(): 2.6e-21, (84.55% identity in 110 aa overlap). Also highly similar to hypothetical 12 kDa proteins in the vicinity of recR from other bacteria e.g. Q9XAI3|YT3D_STRCO|SC66T3.30c HYPOTHETICAL 11.7 KDA PROTEIN from Streptomyces coelicolor (115 aa), FASTA scores: opt: 379, E(): 9.5e-11, (50.8% identity in 122 aa overlap); BAB56641|SAV0479 CONSERVED HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus Mu50 (105 aa) FASTA scores: opt: 295, E(): 4.9e-07, (41.75% identity in 103 aa overlap); Q99WC4P24281|YAAK_BACSU HYPOTHETICAL 11.8 KDA PROTEIN IN DNAZ-RECR INTERGENIC REGION from Bacillus subtilis (107 aa), FASTA scores: opt: 272, E(): 5.3e-06, (39.4% identity in 104 aa overlap); P17577|YBAB_ECOLI|B0471|Z0588|ECS0524 from Escherichia coli strain K and O157:H7 (109 aa), FASTA scores: opt: 256, E(): 2.8e-05, (38.0% identity in 100 aa overlap); etc. Contains probable coiled-coil domain from aa 1-40. SEEMS TO BELONG TO THE UPF0133 FAMILY. TBparse score is 0.888." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218233.1" /db_xref="GI:15610852" /db_xref="GeneID:885595" /translation="MQPGGDMSALLAQAQQMQQKLLEAQQQLANSEVHGQAGGGLVKV VVKGSGEVIGVTIDPKVVDPDDIETLQDLIVGAMRDASQQVTKMAQERLGALAGAMRP PAPPAAPPGAPGMPGMPGMPGAPGAPPVPGI" gene 4161048..4161773 /locus_tag="Rv3717" /db_xref="GeneID:885602" CDS 4161048..4161773 /locus_tag="Rv3717" /function="UNKNOWN" /note="Rv3717, (MTV025.065), len: 241 aa. Conserved hypothetical protein, equivalent to O69518|MLCB2407.19c (alias Q9CB75|ML2331 256 aa) HYPOTHETICAL 25.1 KDA PROTEIN from Mycobacterium leprae (244 aa), FASTA scores: opt: 1325, E(): 5.7e-74, (81.95% identity in 244 aa overlap). Also similar to Q9KXK7|SCC53.04 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (336 aa), FASTA scores: opt: 536, E(): 1.2e-25, (41.2% identity in 233 aa overlap); and shows similarity with C-terminal end of other proteins e.g. Q9RMZ0|PXO2-42 PXO2-42 PROTEIN from Bacillus anthracis (531 aa), FASTA scores: opt: 191, E(): 0.00022, (26.6% identity in 222 aa overlap); Q9RTX0 PUTATIVE N-ACETYLMURAMOYL-L-ALANINE AMIDASE (603 aa); Q9LCR4|CWLU CWLU PROTEIN from Paenibacillus polymyxa (Bacillus polymyxa) (524 aa), FASTA scores: opt: 141, E(): 0.24, (29.2% identity in 219 aa overlap); etc. Shows similarity with C-terminal end of O53593|CWLM|Rv3915|MTV028.06 PUTATIVE HYDROLASE from Mycobacterium tuberculosis (406 aa), FASTA scores: opt: 176, E(): 0.0014, (25.7% identity in 218 aa overlap). TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218234.1" /db_xref="GI:15610853" /db_xref="GeneID:885602" /translation="MIVGVLVAAATPIISSASATPANIAGMVVFIDPGHNGANDASIG RQVPTGRGGTKNCQASGTSTNSGYPEHTFTWETGLRLRAALNALGVRTALSRGNDNAL GPCVDERANMANALRPNAIVSLHADGGPASGRGFHVNYSAPPLNAIQAGPSVQFARIM RDQLQASGIPKANYIGQDGLYGRSDLAGLNLAQYPSILVELGNMKNPADSALMESAEG RQKYANALVRGVAGFLATQGQAR" gene complement(4161815..4162258) /locus_tag="Rv3718c" /db_xref="GeneID:885582" CDS complement(4161815..4162258) /locus_tag="Rv3718c" /function="UNKNOWN" /note="Rv3718c, (MTV025.066c), len: 147 aa. Conserved hypothetical protein, equivalent to O69517|ML2332|MLCB2407.18 HYPOTHETICAL 15.5 KDA PROTEIN from Mycobacterium leprae (145 aa), FASTA scores: opt: 780, E(): 1.4e-44, (81.95% identity in 144 aa overlap). Also highly similar to Q9ZBJ2|SC9C7.18 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (147 aa) FASTA scores: opt: 475, E(): 1.7e-24, (52.05% identity in 146 aa overlap); and showing some similarity to various proteins e.g. P27538|PR2_PETCR PATHOGENESIS-RELATED PROTEIN 2 from Petroselinum crispum (Parsley) (Petroselinum hortense) (158 aa); P92918|ALL2_APIGR MAJOR ALLERGEN API G 2 from Apium graveolens (Celery) (159 aa); etc. TBparse score is 0.891. Thought to be differentially expressed within host cells (see citation below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218235.1" /db_xref="GI:15610854" /db_xref="GeneID:885582" /translation="MGQVSAASTILINAEPTATLDALADYETVRPKILSPHYSEYQVL EGGKGRGTVAKWRLQATQSRVRDVQVNVDVAGHTVIEKDMNSSMVTNWTVAPAGPGSS VTVKTTWTGAGGVKGFFEKTFAPLGLKKIQAEVLSNLKTELEGDA" gene 4162306..4163718 /locus_tag="Rv3719" /db_xref="GeneID:885855" CDS 4162306..4163718 /locus_tag="Rv3719" /function="UNKNOWN" /note="Rv3719, (MTV025.067), len: 470 aa. Conserved hypothetical protein, equivalent to O69516|ML2333|MLCB2407.17c HYPOTHETICAL 51.8 KDA PROTEIN from Mycobacterium leprae (459 aa), FASTA scores: opt: 2593, E(): 7.8e-161, (82.75% identity in 458 aa overlap). Also some similarity to Q9CU63|5830417J06RIK HYPOTHETICAL PROTEIN (FRAGMENT) from Mus musculus (Mouse) (479 aa) FASTA scores: opt: 454, E(): 6.1e-22, (27.1% identity in 413 aa overlap); Q9HBA8 SELADIN-1 (UNKNOWN) from Homo sapiens (Human) (516 aa), FASTA scores: opt: 444, E(): 2.9e-21, (26.7% identity in 412 aa overlap); O17397|DIMH_CAEEL|F52H2.6 DIMINUTO-LIKE PROTEIN from Caenorhabditis elegans (525 aa), FASTA scores: opt: 419, E(): 1.2e-19, (24.4% identity in 434 aa overlap); Q39085|DIM_ARATH|DWF1 CELL ELONGATION PROTEIN DIMINUTO from Arabidopsis thaliana (Mouse-ear cress) (561 aa) FASTA scores: opt: 318, E(): 4.8e-13, (24.6% identity in 455 aa overlap); etc. Also some similarity to Mycobacterium tuberculosis hypothetical proteins P72056|Rv3790|MTCY13D12.24 (461 aa) FASTA scores: opt: 174, E(): 0.00016; (25.1% identity in 426 aa overlap); and Q50685|Rv2280|MTCY339_30c (459 aa). TBparse score is 0.936." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218236.1" /db_xref="GI:15610855" /db_xref="GeneID:885855" /translation="MQGQLSRTRVYTVPVPGSAQSAYACGVERLLASYRSIPATASIR LAKPTSNLFRARVKHDARGLDASGLTGVIGIDPEARTADVAGMCTYEDLIAATLHYGL SPLVVPQLRTITLGGAVTGLGIESASFRNGLPHESVLEMDILTGAGELLTVSPGQHSD LYRAFPNSYGTLGYSTRLRIQLEPVRPFVALRHIRFSSLTAMVAAMERIIDTGGLDGE SVDYLDGVVFSADESYLCIGMQTSVPGPVSDYTGQDIYYRSIQHEAGIKEDRLTIHDY FWRWDTDWFWCSRSFGAQNPRLRRWWPRRYRRSSVYWRLMALDQRFGIADRFENSRGR PARERVVQDIEVPIERTCEFLEWFGENVPISPIWLCPLRLRDHAGWPLYPIRPDRSYV NIGFWSSVPVGATEGATNRKIENKVSALDGHKSLYSDSFYTREEFDELYGGETYNTVK KAYDPDSRLLDLYAKAVQRR" gene 4163736..4164998 /locus_tag="Rv3720" /db_xref="GeneID:885219" CDS 4163736..4164998 /locus_tag="Rv3720" /EC_number="2.1.1.-" /function="UNKNOWN, BUT INVOLVED IN LIPID METABOLISM." /note="Rv3720, (MTV025.068), len: 420 aa. Possible fatty-acyl-phospholipid synthase (EC 2.1.1.-), equivalent to Q9CB74|ML2334 (alias O69515|MLCB2407.16c, 439 aa) HYPOTHETICAL PROTEIN from Mycobacterium leprae (420 aa) FASTA scores: opt: 2508, E(): 4.7e-153, (86.45% identity in 420 aa overlap). Also similar (especially at the C-terminus) to various fatty-acid synthases (principally cyclopropane-fatty-acyl-phospholipid synthases (EC 2.1.1.79)) and hypothetical proteins e.g. Q9KZ58|SCE25.32c PUTATIVE FATTY ACID SYNTHASE from Streptomyces coelicolor (438 aa), FASTA scores: opt: 1101, E(): 5.5e-63, (46.1% identity in 425 aa overlap); P31049|YLP3_PSEPU HYPOTHETICAL 44.7 KDA PROTEIN from Pseudomonas putida (394 aa), FASTA scores: opt: 810, E(): 2.1e-44, (46.4% identity in 293 aa overlap); Q9HT28|PA5546 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (394 aa), FASTA scores: opt: 804, E(): 5.2e-44, (40.7% identity in 371 aa overlap); Q9RSD7|DR2187 PUTATIVE CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE from Deinococcus radiodurans (462 aa), FASTA scores: opt: 747, E(): 2.6e-40, (35.95% identity in 409 aa overlap); BAB50831|Q98ET6|MLL4091 CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIPID SYNTHASE from Rhizobium loti (Mesorhizobium loti) (422 aa), FASTA scores: opt: 674, E(): 1.1e-35, (39.1% identity in 284 aa overlap); P30010|CFA_ECOLI|CDFA|B1661 CYCLOPROPANE-FATTY-ACYL-PHOSPHOLIP SYNTHASE from Escherichia coli strain K12 (381 aa), FASTA scores: opt: 530, E(): 1.7e-26, (33.65% identity in 312 aa overlap); etc. Also similar to other proteins from Mycobacterium tuberculosis e.g. CMA2|Rv0503c|MTCY20G9.30c (302 aa); P96911|Rv0621|MTCY20H10 (354 aa); O50416|LPQD|Rv3390|MTV004.48 (236 aa); etc." /codon_start=1 /transl_table=11 /product="fatty acid synthase" /protein_id="NP_218237.1" /db_xref="GI:15610856" /db_xref="GeneID:885219" /translation="MAEILEIFTATGQHPLKFTAYDGSTAGQDDATLGLDLRTPRGAT YLATAPGELGLARAYVSGDLQAHGVHPGDPYELLKTLTERVDFKRPSARVLANVVRSI GVEHILPIAPPPQEARPRWRRMANGLLHSKTRDAEAIHHHYDVSNNFYEWVLGPSMTY TCAVFPNAEASLEQAQENKYRLIFEKLRLEPGDRLLDVGCGWGGMVRYAARRGVRVIG ATLSAEQAKWGQKAVEDEGLSDLAQVRHSDYRDVAETGFDAVSSIGLTEHIGVKNYPF YFGFLKSKLRTGGLLLNHCITRHDNRSTSFAGGFTDRYVFPDGELTGSGRITTEIQQV GLEVLHEENFRHHYAMTLRDWCGNLVEHWDDAVAEVGLPTAKVWGLYMAASRVAFERN NLQLHHVLATKVDPRGDDSLPLRPWWQP" gene complement(4164995..4166731) /gene="dnaZX" /locus_tag="Rv3721c" /db_xref="GeneID:885361" CDS complement(4164995..4166731) /gene="dnaZX" /locus_tag="Rv3721c" /EC_number="2.7.7.7" /function="DNA POLYMERASE III IS A COMPLEX, MULTICHAIN ENZYME RESPONSIBLE FOR MOST OF THE REPLICATIVE SYNTHESIS IN BACTERIA. THIS DNA POLYMERASE ALSO EXHIBITS 3' TO 5' EXONUCLEASE ACTIVITY [CATALYTIC ACTIVITY: N DEOXYNUCLEOSIDE TRIPHOSPHATE = N PYROPHOSPHATE + DNA(N)]." /note="catalyzes the DNA-template-directed extension of the 3'-end of a DNA strand; the tau chain serves as a scaffold to help in the dimerizaton of the alpha,epsilon and theta core complex; the gamma chain seems to interact with the delta and delta' subunits to transfer the beta subunit on the DNA" /codon_start=1 /transl_table=11 /product="DNA polymerase III subunits gamma and tau" /protein_id="NP_218238.1" /db_xref="GI:15610857" /db_xref="GeneID:885361" /translation="MALYRKYRPASFAEVVGQEHVTAPLSVALDAGRINHAYLFSGPR GCGKTSSARILARSLNCAQGPTANPCGVCESCVSLAPNAPGSIDVVELDAASHGGVDD TRELRDRAFYAPVQSRYRVFIVDEAHMVTTAGFNALLKIVEEPPEHLIFIFATTEPEK VLPTIRSRTHHYPFRLLPPRTMRALLARICEQEGVVVDDAVYPLVIRAGGGSPRDTLS VLDQLLAGAADTHVTYTRALGLLGVTDVALIDDAVDALAACDAAALFGAIESVIDGGH DPRRFATDLLERFRDLIVLQSVPDAASRGVVDAPEDALDRMREQAARIGRATLTRYAE VVQAGLGEMRGATAPRLLLEVVCARLLLPSASDAESALLQRVERIETRLDMSIPAPQA VPRPSAAAAEPKHQPAREPRPVLAPTPASSEPTVAAVRSMWPTVRDKVRLRSRTTEVM LAGATVRALEDNTLVLTHESAPLARRLSEQRNADVLAEALKDALGVNWRVRCETGEPA AAASPVGGGANVATAKAVNPAPTANSTQRDEEEHMLAEAGRGDPSPRRDPEEVALELL QNELGARRIDNA" misc_feature complement(4166585..4166608) /gene="dnaZX" /locus_tag="Rv3721c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(4166821..4168128) /locus_tag="Rv3722c" /db_xref="GeneID:885321" CDS complement(4166821..4168128) /locus_tag="Rv3722c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3722c, (MTV025.070c), len: 435 aa. Conserved hypothetical protein, equivalent to O69513|MLCB2407.14 (alias Q9CB73|ML2336, 463 aa) HYPOTHETICAL 46.8 KDA PROTEIN from Mycobacterium leprae (426 aa), FASTA scores: opt: 2505, E(): 8.3e-154, (87.25% identity in 424 aa overlap). Also highly similar to Q9RU17|DR1579 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (452 aa), FASTA scores: opt: 1162, E(): 3.1e-67, (44.8% identity in 422 aa overlap); and partially similar to Q9I371|PA1654 PROBABLE AMINOTRANSFERASE from Pseudomonas aeruginosa (388 aa) FASTA scores: opt: 162, E(): 0.0078, (25.85% identity in 348 aa overlap) and other aminotransferases. TBparse score is 0.900. N-terminus extended since first submission (previously 408 aa)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218239.2" /db_xref="GI:57117147" /db_xref="GeneID:885321" /translation="MSFDSLSPQELAALHARHQQDYAALQGMKLALDLTRGKPSAEQL DLSNQLLSLPGDDYRDPEGTDTRNYGGQHGLPGLRAIFAELLGIAVPNLIAGNNSSLE LMHDIVAFSMLYGGVDSPRPWIQEQDGIKFLCPVPGYDRHFAITETMGIEMIPIPMLQ DGPDVDLIEELVAVDPAIKGMWTVPVFGNPSGVTYSWETVRRLVQMRTAAPDFRLFWD NAYAVHTLTLDFPRQVDVLGLAAKAGNPNRPYVFASTSKITFAGGGVSFFGGSLGNIA WYLQYAGKKSIGPDKVNQLRHLRFFGDADGVRLHMLRHQQILAPKFALVAEVLDQRLS ESKIASWTEPKGGYFISLDVLPGTARRTVALAKDVGIAVTEAGASFPYRKDPDDKNIR IAPSFPSVPDLRNAVDGLATCALLAATETLLNQGLASSAPNVR" gene 4168345..4168430 /locus_tag="Rvnt41" /note="tRNA-Ser(GGA)" /db_xref="GeneID:2700443" tRNA 4168345..4168430 /locus_tag="Rvnt41" /product="tRNA-Ser" /note="codon recognized: UCC" /anticodon=(pos:4168379..4168381,aa:Ser) /db_xref="GeneID:2700443" gene 4168536..4169300 /locus_tag="Rv3723" /db_xref="GeneID:885791" CDS 4168536..4169300 /locus_tag="Rv3723" /function="UNKNOWN" /note="Rv3723, (MTV025.071), len: 254 aa. Probable conserved transmembrane protein, with hydrophobic stretches at the N-terminus, and equivalent to O69512|ML2337|MLCB2407.13c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (250 aa), FASTA scores: opt: 1029, E(): 1.2e-44, (64.45% identity in 253 aa overlap). TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218240.1" /db_xref="GI:15610859" /db_xref="GeneID:885791" /translation="MGRKVAVLWHASFSIGAGVLYFYFVLPRWPELMGDTGHSLGTGL RIATGALVGLAALPVVFTLLRTRKPELGTPQLALSMRIWSIMAHVLAGALIVGTAISE VWLSLDAAGQWLFGIYGAAAAIAVLGFFGFYLSFVAELPPPPPKPLKPKKPKQRRLRR KKTAKGDEAEPEAAEEAENTELAAQEDEEAVEAPPESIESPGGEPESATREAPAAETA TAEEPRGGLRNRRPTGKTSHRRRRTRSGVQVAKVDE" gene 4169467..4169709 /gene="cut5a" /locus_tag="Rv3724A" /db_xref="GeneID:3205112" CDS 4169467..4169709 /gene="cut5a" /locus_tag="Rv3724A" /EC_number="3.1.1.-" /function="HYDROLYSIS OF CUTIN (A POLYESTER THAT FORMS THE STRUCTURE OF PLANT CUTICLE)." /note="Rv3724A, (MTV025.072), len: 80 aa. Probable cut5a, truncated cutinase precursor (EC 3.1.1.-), similar to N-terminal end of others e.g. Q9KK87 SERINE ESTERASE CUTINASE from Mycobacterium avium (220 aa), FASTA scores: opt: 202, E(): 1.5e-06, (56.45% identity in 62 aa overlap); Q9XB09|RVD2-RV1758 PROTEIN (FRAGMENT) from Mycobacterium bovis BCG (143 aa), FASTA scores: opt: 200, E(): 1.5e-06, (61.4% identity in 57 aa overlap); and Q00298|CUTI_BOTCI|CUTA CUTINASE PRECURSOR from Botrytis cinerea (Botryotinia fuckeliana) (202 aa), FASTA scores: opt: 108, E(): 2.2, (40.4% identity in 52 aa overlap). Also highly similar to others from Mycobacterium tuberculosis e.g. O06318|CUT3_MYCTU|Rv3451|MT3557|MTCY13E12.04 PROBABLE CUTINASE PRECURSOR (247 aa), FASTA scores: opt: 189, E(): 1.2e-05, (58.0% identity in 50 aa overlap); Q50664|CUT2_MYCTU|Rv2301|MT2358|MTCY339.08c PROBABLE CUTINASE PRECURSOR (219 aa), FASTA scores: opt: 172, E(): 0.00015, (59.2% identity in 49 aa overlap); O06793|Rv1758|MTCY28.24|Z95890 HYPOTHETICAL 17.9 KDA PROTEIN (174 aa), FASTA scores: opt: 641, E(): 2.7e-29, (57.2% identity in 166 aa overlap); O06319|Rv3452|MTY13E12.05; and U00015_11 from Mycobaterium leprae. BELONGS TO THE CUTINASE FAMILY. Rest of cutinase ORF continues as Rv3724B|CUT5B, frameshifting could occur near position 4169668. Sequence has been checked but no errors found." /codon_start=1 /transl_table=11 /product="cutinase precursor" /protein_id="YP_178007.1" /db_xref="GI:57117148" /db_xref="GeneID:3205112" /translation="MDVIRWARRLAVVAGTAAAVTTPGLLSAHVPMVSAEPCPDVEVV FARGTGEPPGIGSVGGLFVDALRFPGWRQVTRGLRR" gene 4169606..4170169 /gene="cut5b" /locus_tag="Rv3724B" /db_xref="GeneID:885390" CDS 4169606..4170169 /gene="cut5b" /locus_tag="Rv3724B" /EC_number="3.1.1.-" /function="HYDROLYSIS OF CUTIN (A POLYESTER THAT FORMS THE STRUCTURE OF PLANT CUTICLE)." /note="Rv3724B, (MTV025.072), len: 187 aa. Probable cut5b, truncated cutinase (EC 3.1.1.-), similar to C-terminal end of others e.g. Q9XB09|RVD2-RV1758 PROTEIN (FRAGMENT) from Mycobacterium bovis BCG (143 aa) FASTA scores: opt: 335, E(): 3.4e-12, (53.25% identity in 92 aa overlap); Q9KK87 SERINE ESTERASE CUTINASE from Mycobacterium avium (220 aa), FASTA scores: opt: 251, E(): 2.5e-07, (44.05% identity in 168 aa overlap). Also similar to proteins from Mycobacterium tuberculosis e.g. O06793|Rv1758|MTCY28.24 HYPOTHETICAL 17.9 KDA PROTEIN (174 aa), FASTA scores: opt: 641, E(): 2.5e-29, (57.25% identity in 166 aa overlap); O06319|Rv3452|MTCY13E12.05 HYPOTHETICAL 23.1 KDA PROTEIN (226 aa), FASTA scores: opt: 385, E(): 7.5e-15, (46.65% identity in 165 aa overlap); O06318|CUT3_MYCTU|Rv3451|MT3557|MTCY13E12.04 PROBABLE CUTINASE PRECURSOR (247 aa), FASTA scores: opt: 307, E(): 1.9e-10, (40.7% identity in 167 aa overlap); Q10837|CUT1_MYCTU|Rv1984c|MT2037|MTCY39.35 PROBABLE CUTINASE PRECURSOR (217 aa), FASTA scores: opt: 261, E(): 6.7e-08, (50.9% identity in 169 aa overlap); etc; and U00015_11 from Mycobacterium lepra. 5'-end of gene is Rv3724A|CUT5A; frameshifting may occur near position 4169668. TBparse score is 0.918." /codon_start=1 /transl_table=11 /product="cutinase" /protein_id="YP_178008.1" /db_xref="GI:57117149" /db_xref="GeneID:885390" /translation="MAPGSHLVLAASEDCSSTHCVSQVGAKSLGVYAVNYPASNDFAS SDFPKTVIDGIRDAGSHIQSMAMSCPQTRQVLGGYSQGAAVAGYVTSAVVPPAVPVQA VPAPMAPEVANHVAAVTLFGAPSAQFLGQYGAPPIAIGPLYQPKTLQLCADGDSICGD GNSPVAHGLYAVNGMVGQGANFAASRL" gene 4170214..4171143 /locus_tag="Rv3725" /db_xref="GeneID:885887" CDS 4170214..4171143 /locus_tag="Rv3725" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3725, (MTV025.073), len: 309 aa. Possible reductase (EC 1.-.-.-), similar to various oxidoreductases and hypothetical proteins e.g. O34285|HPNA HPNA PROTEIN from Zymomonas mobilis (337 aa), FASTA scores: opt: 317, E(): 6.1e-11, (30.5% identity in 272 aa overlap); Q9SZB3|F17M5.120|AT4G33360|AAK49584 HYPOTHETICAL 37.9 KDA PROTEIN from Arabidopsis thaliana (Mouse-ear cress) (344 aa), FASTA scores: opt: 314, E(): 9.1e-11, (30.35% identity in 267 aa overlap); AAK59445|AT4G33360 PUTATIVE DIHYDROKAEMPFEROL 4-REDUCTASE from Arabidopsis thaliana (Mouse-ear cress) (332 aa), FASTA scores: opt: 313, E(): 1e-10, (30.8% identity in 263 aa overlap); Q9FSC6|CCR CINNAMOYL-CoA REDUCTASE (EC 1.2.1.44) from Populus trichocarpa (Western balsam poplar) (338 aa), FASTA scores: opt: 305, E(): 2.9e-10, (30.3% identity in 274 aa overlap); Q9M631 CINNAMOYL CoA REDUCTASE from Populus tremuloides (Quaking aspen) (337 aa), FASTA scores: opt: 291, E(): 1.8e-09, (30.15% identity in 272 aa overlap); P73212|DFRA_SYNY3|LR1706 PUTATIVE DIHYDROFLAVONOL-4-REDUCTASE (EC 1.1.1.219) (DIHYDROKAEMPFEROL 4-REDUCTASE) from Synechocystis sp. strain PCC 6803 (343 aa), FASTA scores: opt: 278, E(): 1e-08, (29.35% identity in 259 aa overlap); etc. Also some similarity to proteins from Mycobacterium tuberculosis e.g. P96816|Rv0139|MTCI5.13 HYPOTHETICAL PROTEIN (340 aa) FASTA scores: opt: 234, E(): 3.2e-06, (28.25% identity in 269 aa overlap); and O06373|galE1|Rv3634c|MTCY15C10.18 PROBABLE UDP-GLUCOSE 4-EPIMERASE (314 aa) (27.3% identity in 194 aa overlap). TBparse score is 0.960." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218242.1" /db_xref="GI:15610861" /db_xref="GeneID:885887" /translation="MQNATMRVLVTGGTGFVGGWTAKAIADAGHSVRFLVRNPARLKT SVAKLGVDVSDFAVADISDRDSVREALNGCDAVVHSAALVATDPRETSRMLSTNMAGA QNVLGQAVELGMDPIVHVSSFTALFRPNLATLSADLPVAGGTDGYGQSKAQIEIYARG LQDAGAPVNITYPGMVLGPPVGDQFGEAGEGVRSALWMHVIPGRGAAWLIVDVRDVAA LHAALLESGRGPRRYTAGGHRIPVPELAKILGGSPAPRCWPSRCPIPRCVSRDRCWIK PGPICLSILRSPRQVCSTTHRCRSPTIRRAKKN" gene 4171421..4172614 /locus_tag="Rv3726" /db_xref="GeneID:885801" CDS 4171421..4172614 /locus_tag="Rv3726" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3726, (MTV025.074), len: 397 aa. Possible dehydrogenase (EC 1.-.-.-), similar to many e.g. O34788|YDJL DEHYDROGENASE from Bacillus subtilis (346 aa) FASTA scores: opt: 401, E(): 3.4e-17, (29.6% identity in 395 aa overlap); Q59696|ADH 2,3-BUTANEDIOL DEHYDROGENASE (EC 1.1.1.4) from seudomonas putida (362 aa), FASTA scores: opt: 326, E(): 1.3e-12, (29.45% identity in 387 aa overlap); AAG59541|YJJN PUTATIVE OXIDOREDUCTASE from Escherichia coli strain EDL933 (345 aa), FASTA scores: opt: 325, E(): 1.5e-12, (30.85% identity in 256 aa overlap); Q9HWM8|PA4153 2,3-BUTANEDIOL DEHYDROGENASE from Pseudomonas aeruginosa (363 aa), FASTA scores: opt: 324, E(): 1.8e-12, (30.5% identity in 387 aa overlap); etc. TBparse score is 0.922." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_218243.1" /db_xref="GI:15610862" /db_xref="GeneID:885801" /translation="MKAVTCTNAKLEVVDRPSPAPAKGQLLLDVLRCGICGSDLHARL HCDELADVMAESGYHAFMRSNQQVVFGHEFCGEVVDYGPGTRRTPRRGTPVVAMPLLR RGNKEVHGIGLSTMAPGAYAERLVVEQSLTFPVPNGLAPEIAALTEPMAVGWHAVRRG EVGKGDVAIVIGCGPIGLAVICMLKSRGVHTVIASDFSPGRRALATACGADSVVDPVQ DSPYAVAAGLGQGNRHLQSILDAFDLAVGTVERLQRLRLPWWHLWRAAEAAGAATPKR PVIFECVGVPGIIDGIIASAPLFSRVVVVGVCMGSDHIRPAMAINKEINLRFVLGYTP LEFRDTLHMLADGKVNAAPLITGTVGLPGVAAAFDALGDPEAHAKIMIDPKSNAASPQ PFRVE" gene 4172955..4174763 /locus_tag="Rv3727" /db_xref="GeneID:885766" CDS 4172955..4174763 /locus_tag="Rv3727" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3727, (MTV025.075), len: 602 aa. Possible oxidoreductase (EC 1.-.-.-), similar to several plants phytoene dehydrogenases/desaturases (EC 1.3.-.-) e.g. Q9HSE1|CRTI3|VNG0277G PHYTOENE DEHYDROGENASE from Halobacterium sp. strain NRC-1 (541 aa), FASTA scores: opt: 299, E(): 1.1e-10, (29.85% identity in 576 aa overlap); Q9FZL6|CITPDS1 PHYTOENE DESATURASE from Citrus unshiu (Satsuma orange) (553 aa), FASTA scores: opt: 164, E(): 0.018, (24.2% identity in 434 aa overlap); Q07356|CRTI_ARATH|PDS|AT4G14210|DL3145c PHYTOENE DEHYDROGENASE PRECURSOR from Arabidopsis thaliana (Mouse-ear cress) (566 aa), FASTA scores: opt: 163, E(): 0.021, (23.95% identity in 434 aa overlap); etc. N-terminal end similar to O69871|SC1C3.29 PUTATIVE PROTOPORPHYRINOGEN OXIDASE (FRAGMENT) from Streptomyces coelicolor (61 aa), FASTA scores: opt: 154, E(): 0.012, (60.45% identity in 43 aa overlap). The region between aa 155-310 is highly similar to Q49778|B2126_C1_169 from Mycobacterium leprae (159 aa), FASTA scores: opt: 437, E(): 1.5e-19, (46.6% identity in 161 aa overlap). And the region between aa 462-546 is highly similar to the N-terminal end of Q50003|U1764T from Mycobacterium leprae (155 aa), FASTA scores: opt: 277, E(): 8.3e-10, (57.65% identity in 85 aa overlap). TBparse score is 0.965." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218244.1" /db_xref="GI:15610863" /db_xref="GeneID:885766" /translation="MKPSPADTHVVIAGAGIAGLAAAMILAEAGVRVTLCEAASEAGG KAKSLRLADGHPTEHSLRVYTDTYQTLLTLFSRIPTEHDRTVLDNLVGVSMVSATAQG VIGRIAAPVALQRRRPTFARIIGKVVEPPRQLVRILLRGPMVIVGLAQRGVPATDVLH YLYAHLRLLWMCRERLLAELGDISYADYLQLGCKSAQAQEFFSAVPRIYVAARTSAEA AAIAPIVLKGLFRLKSNCPSALNDAKLPAIMMMDGPTSERMVDPWIRHLTRLGVDIHF NTRVGDLEFDDGRVTALISSDGRRFACDYALLAVPYLTLRELAKSAHVKRYLPQLTQQ HALALEASNGIQCFLRDLPATWPPFIRPGVVTTHLQSQWSLVCVLQGEGFWKNVRLPE GTRYVLSITWSDVETPGPVFDRPLSECTPDEILTECLTQCGLDKSNVLGWRIDHELKH LDEAEYEKVASELPPHLVSAPARGQRMVNFSPLTVLMPGARHRSPGICTSVPNLLLAG EVIYSPDLTLFVPTMEKAACSGYLAARQIMNMVASHAAPLRIDFRDPAPFAVLRRVDR WFWSRRRRPPDRSTFATPPTAMPAPSHLTDVDRSAS" gene 4174873..4178070 /locus_tag="Rv3728" /db_xref="GeneID:885271" CDS 4174873..4178070 /locus_tag="Rv3728" /function="UNKNOWN, BUT SEEMS INVOLVED IN EFFLUX SYSTEM (PROBABLY SUGAR OR DRUG TRANSPORT)." /note="Rv3728, (MTV025.076), len: 1065 aa. Probable conserved transmembrane protein organised into two domains. Domain comprising the first 510 aa residues is similar to various multidrug resistance and efflux proteins and contains sugar transport protein signature 1 (PS00216). Domain corresponding to the last 550 aa residues contains cyclic nucleotide-binding domain signature 2 (PS00889) and is very similar to Q50733|YP65_MYCTU|Rv2565|MT2641|MTCY9C4.03c hypothetical 62.1 kDa protein from Mycobacterium tuberculosis (31.0% identity in 546 aa overlap). Highly similar to O05884|Rv3239c|MTCY20B11.14c PROBABLE TRANSMEMBRANE TRANSPORT PROTEIN from Mycobacterium tuberculosis (1048 aa) FASTA scores: opt: 4328, E(): 5e-201, (64.15% identity in 1046 aa overlap). N-terminal end similar to P71879|Rv2333c|MTCY3G12.01|MTCY98.02c (537 aa); P71836|Rv0783c|MTCY369.27c (540 aa); and O07753|Rv1877|MTCY180.41c (687 aa). SEEMS BELONG TO THE SUGAR TRANSPORTER FAMILY. Possibly member of major facilitator superfamily (MFS)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218245.1" /db_xref="GI:15610864" /db_xref="GeneID:885271" /translation="MHTVATNNAAPVIAAGPVGPSRRRRRVHAPLTRRRQPSSSAVLL VAAFGAFLAFLDSTIVNVAFPDIQRHFHSDISDLSWMLNAYNIVFAAFLVAAGRLADL MGRKRVFILGVALFTVASGLCAIAESVGELVAFRVLQGIGAAVLVPASLGLVVEAFPA ERRAHGVNLWGAAGAIAAGLGPPIGGALIEADGWRWVFLVNLPLGVFAVLAARRALVE NRAAGRRRVPDVRGAVLLAFALGLLTLGLIKGPDWGWASLPTSGSLLAAAVAMVGFVM SSRHHPAPMVEPTLLRIQSFVAGTGLTAVASAGFYAYLLTHVLFLNYVWGYTLLEAGM AVAPAALVAAVVAAVLGRVADRHGYRFIVGIGALIWAASLLWYLKVVGSQPDFLGEWL PGQILQGIGVGATFPLLGSAALARLAKGGSYATASAVTGTIRQVGAVIGVAVLVILVG TPAPGAAEEALRHGWALAAICFVAVGIGALSLGRIRPVPAAVEPPPGPPVAPLGARRP PRPAPVASPAAAVAPTPKTSREVNLLEALRFARPDTQQIELQAGSYLFHAGDVSDALY VVRSGRLQVLAGDGAKDEVVAELGRGQVVGELGVLLDAPRSASVRAVRDSSLMRVTKA EFAKIADAGVLGALAGVLAKRQHQTRVASQRTTPEVVVAVVGVDANAPVAMVATELCR ALSTRLRAVAPGRVDCDGLERAEQTADRVVLHAAVGDARWREFCLRVADRVVLVASNP AVPVAPLPTRATGADLVLAGRPAGREHRRAWEQLITPRSMHVVRREFVADDLRVLATR IAGRSVGLVLSGGAARACAHLGVLEELEAAGVTVDRFAGTSMGAIIAALAASGLDAAG VDAQIYEHFVRKSHGDYTLPSKGLIRGKRTQSTLRTIFGDHLVEELPKHFRCVSVDLL ARRPVVHRQGPLADVVGCSMRLPFLYAPLPYGGTLHVDGGVLDNVPVTTLVGKDGPLI AVNVASGGNPSPASGGHRRGKPRVPGLTDTLLRTMTISSAMASEKVLAQADLVIKPNP IGVGLMEYHQIDRAREAGRIAAREALPQIMELVHG" misc_feature 4175158..4175208 /locus_tag="Rv3728" /note="PS00216 Sugar transport proteins signature 1" misc_feature 4176655..4176708 /locus_tag="Rv3728" /note="PS00889 Cyclic nucleotide-binding domain signature 2" gene 4178285..4180615 /locus_tag="Rv3729" /db_xref="GeneID:885706" CDS 4178285..4180615 /locus_tag="Rv3729" /EC_number="2.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3729, (MTV025.077), len: 776 aa. Conserved hypothetical protein, possible transferase (EC 2.-.-.-), similar to several hypothetical proteins and various transferases e.g. O26919|MTH831 MOLYBDENUM COFACTOR BIOSYNTHESIS MOAA HOMOLOG from Methanobacterium thermoautotrophicum (497 aa), FASTA scores: opt: 697, E(): 4.8e-34, (30.7% identity in 492 aa overlap); Q58036|Y619_METJA|MJ0619 HYPOTHETICAL PROTEIN from Methanococcus jannaschii (506 aa), FASTA scores: opt: 670, E(): 2e-32, (30.6% identity in 497 aa overlap); O27968|AF2316 CONSERVED HYPOTHETICAL PROTEIN from Archaeoglobus fulgidus (518 aa), FASTA scores: opt: 477, E(): 6.4e-21, (29.4% identity in 500 aa overlap); BAB60102|TVG0985801 MOLYBDENUM COFACTOR BIOSYNTHESIS PROTEIN from Thermoplasma volcanium (606 aa), FASTA scores: opt: 402, E(): 2.1e-16, (28.1% identity in 509 aa overlap); etc. C-terminus similar to methyltransferases e.g. Q9S0N6|AVED C5-O-METHYLTRANSFERASE from Streptomyces avermitilis (283 aa), FASTA scores: opt: 298, E(): 1.9e-10, (31.5% identity in 292 aa overlap). Also similar to the Mycobacterium tuberculosis proteins P71673|YE05_MYCTU|Rv1405c|MT1449|MTCY21B4.22c (274 aa); and Q50584|Rv1523|MTCY19G5.05c. TBparse score is 0.909." /codon_start=1 /transl_table=11 /product="transferase" /protein_id="NP_218246.1" /db_xref="GI:15610865" /db_xref="GeneID:885706" /translation="MFVEYTKSICPVCKVVVDAQVNIRHDKVYLRKRCREHGSFEALV YGDAQMYLESARFNKPGTFPLRFQTEVRDGCPSDCGLCPDHKQHACLGLIEVNTHCNL DCPICFADSGHQPDGYAITAAQCERMLDTLVAAEGEPEVVMFSGGEPTIHKQLLEFVD AAQARPVKTVIINTNGIRLASDRRFVDQLATRNRPGHPVHIYLQFDGLDEATHRRIRG HDLRDVKQRALDNCAAAGLTVSLVAAVERGLNEHELGAVIRHGMAQPGVQPVVFQPVT HAGRHVQFDPLTRLTNSDIIACITAQLPEWFRPGDFFPVPCCFPSCRSITYLLTDGEH VVPIPRLLNVEDYLDYVSNRVIPDLAIREALENLWSASAVPGTDTMTAQLQRATAALN CAEGCGINLPEALTHLTDRVFAIVIQDFQDPYTLNVKQLMKCCVQQITPDGRLIPFCA YNSVGYREQVREQLTGVPVPDIVPNAIPLAGLLADAPHGSKQANTGGSIARLAGPTRG APMALPPQQIKACCADAYSRDIVALLLGDSFHPGGATLTRRLADQLGLRSTGDPRRVA DIAAGPGASARLLASDYGVAVDGVDISEINVKRAQAAVAQTGLTERVRFHLGDAESVP LPDDTFDALVCECAFCTFPDKNAAAQQFARILRPGGLAGITDVTVGDGGLPAELTPLA AWVACIADARTVTDYTDILEGAGLRTRHIESHDESLLDMIDRIDARITALHVAAPEIL ADNGIRHDSVRDFTALARAAVQTGRIGYTLMIAEKP" gene complement(4180680..4181720) /locus_tag="Rv3730c" /db_xref="GeneID:885201" CDS complement(4180680..4181720) /locus_tag="Rv3730c" /function="UNKNOWN" /note="Rv3730c, (MTV025.078c), len: 346 aa. Conserved hypothetical protein, highly similar to Q9XAM1|SC4C6.19 HYPOTHETICAL 38.5 KDA PROTEIN from Streptomyces coelicolor (341 aa), FASTA scores: opt: 1313, E(): 2.2e-75, (59.25% identity in 336 aa overlap); and similar to C-terminal end of PUTATIVE ATP-DEPENDENT DNA LIGASES e.g. BAB49297|MLL2077 from Rhizobium loti (Mesorhizobium loti) (833 aa), FASTA scores: opt: 550, E(): 5.3e-27, (31.3% identity in 294 aa overlap); and BAB54816|MLL9625 from Rhizobium loti (Mesorhizobium loti) plasmid pMLb (883 aa) FASTA scores: opt: 492, E(): 2.5e-23, (33.7% identity in 291 aa overlap); etc. Also similar to the hypothetical proteins e.g. Q9ZC15|SC1E6.07 HYPOTHETICAL 34.9 KDA PROTEIN from Streptomyces coelicolor (319 aa) FASTA scores: opt: 537, E(): 1.5e-26, (34.95% identity in 292 aa overlap); Q9XAF7|SC6G9.25 HYPOTHETICAL 32.1 KDA PROTEIN from Streptomyces coelicolor (293 aa), FASTA scores: opt: 474, E(): 1.3e-22, (33.75% identity in 302 aa overlap); etc. Also highly similar to P95226|Rv0269c|MTCY06A4.13c HYPOTHETICAL 44.0 KDA PROTEIN from Mycobacterium tuberculosis (397 aa), FASTA scores: opt: 940, E(): 7.7e-52, (50.3% identity in 312 aa overlap). TBparse score is 0.895." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218247.1" /db_xref="GI:15610866" /db_xref="GeneID:885201" /translation="MAAAAEELDVDGIAVRLTSPDRMYFPKLGSHGTKRRLVEYYFAV AGGPMLTALRDRPTHLQRFPDGVDGEQIYQKRIPRHRPDYLQTCRVTFPSGRMADALK VTHPAAIVWAAQMGTITLHPWQVRCPDTEHPDELRIDLDPQPGTGFVEARTVAVDVLR SVLDDLGLVGYPKTSGGRGIHVFLRIATDWDFVEVRRAGIALAREVERRAPDAVTTSW WKEERGARIFIDFNQNARDRTMASAYSVRPTPIATVSMPLTWEELAGADPDDYTMTTV PELVKIRDDPWAGMDDVAQSIAPLLDLAAADEERGLGDMPYPPNYPKMPGEPKRVQPS RDTDLKGGNTSK" gene 4181758..4182834 /gene="ligC" /locus_tag="Rv3731" /db_xref="GeneID:885771" CDS 4181758..4182834 /gene="ligC" /locus_tag="Rv3731" /EC_number="6.5.1.1" /function="THIS PROTEIN SEALS DURING DNA REPLICATION, DNA RECOMBINATION AND DNA REPAIR NICKS IN DOUBLE-STRANDED DNA [CATALYTIC ACTIVITY: ATP + (DEOXYRIBONUCLEOTIDE)(N) + (DEOXYRIBONUCLEOTIDE)(M) = AMP + PYROPHOSPHATE + (DEOXYRIBONUCLEOTIDE)(N+M)]." /note="catalyzes the ATP-dependent formation of a phosphodiester at the site of a single-strand break in duplex DNA; in mycobacteria LigC has weak intrinsic nick joining activities and is not essential for growth" /codon_start=1 /transl_table=11 /product="ATP-dependent DNA ligase" /protein_id="NP_218248.1" /db_xref="GI:15610867" /db_xref="GeneID:885771" /translation="MQLPVMPPVSPMLAKSVTAIPPDASYEPKWDGFRSICFRDGDQV ELGSRNERPMTRYFPELVAAIRAELPHRCVIDGEIIIATDHGLDFEALQQRIHPAESR VRMLADRTPASFIAFDLLALGDDDYTGRPFSERRAALVDAVTGSGADADLSIHVTPAT TDMATAQRWFSEFEGAGLDGVIAKPPHITYQPDKRVMFKIKHLRTADCVVAGYRVHKS GSDAIGSLLLGLYQEDGQLASVGVIGAFPMAERRRLLTELQPLVTSFDDHPWNWAAHV AGQRTPRKNEFSRWNVGKDLSFVPLRPERVVEVRYDRMEGARFRHTAQFNRWRPDRDP RSCSYAQLERPLTVSLSDIVPGLR" gene 4182934..4183992 /locus_tag="Rv3732" /db_xref="GeneID:885795" CDS 4182934..4183992 /locus_tag="Rv3732" /function="UNKNOWN" /note="Rv3732, (MTV025.080), len: 352 aa. Conserved hypothetical protein. The region between aa 175-352 is highly similar to the region between aa 72-257 of Q9KH39 HYPOTHETICAL 55.5 KDA PROTEIN from Mycobacterium smegmatis (511 aa), FASTA scores: opt: 1122, E(): 7.3e-63, (98.85% identity in 176 aa overlap). Also shows some similarity with Q55304 HYPOTHETICALK PROTEIN from Synechocystis sp. strain PCC 6803 (387 aa), FASTA scores: opt: 201, E(): 2.7e-05, (27.1% identity in 251 aa overlap); and P74254|SLR1173 HYPOTHETICAL 52.5 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (463 aa), FASTA scores: opt: 201, E(): 3.1e-05, (27.1% identity in 251 aa overlap). Also slightly similar to MTCY01B2_21 and DPO1_MYCTU DNA POLYMERASE I. TBparse score is 0.913." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218249.1" /db_xref="GI:15610868" /db_xref="GeneID:885795" /translation="MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGS QATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDT LSAPLIEHQRHWSLRRGVGASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTW LSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMR LSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHG SYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGA AGGAVVVVLRRRRRAHTG" gene complement(4184012..4184512) /locus_tag="Rv3733c" /db_xref="GeneID:885721" CDS complement(4184012..4184512) /locus_tag="Rv3733c" /function="UNKNOWN" /note="Rv3733c, (MTV025.081c), len: 166 aa. Conserved hypothetical protein, highly similar to Q9FCB0|2SCG58.03 PUTATIVE MUTT-LIKE PROTEIN from Streptomyces coelicolor (153 aa), FASTA scores: opt: 541, E(): 7.2e-29, (52.7% identity in 148 aa overlap); and BAB49143|MLR1881 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (156 aa), FASTA scores: opt: 526, E(): 7.2e-28, (52.65% identity in 150 aa overlap). TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218250.1" /db_xref="GI:15610869" /db_xref="GeneID:885721" /translation="MPKLSAGVLLYRARAGVVDVLLAHPGGPFWAGKDDGAWSIPKGE YTGGEDPWLAARREFSEEIGLCVPDGPRIDFGSLKQSGGKVVTVFGVRADLDITDARS STFELDWPKGSGKMRKFPEVDRVSWFPVARARTKLLKGQRGFLDRLMAHPAVAGLSEG PESLPR" gene complement(4184526..4185890) /locus_tag="Rv3734c" /db_xref="GeneID:885335" CDS complement(4184526..4185890) /locus_tag="Rv3734c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3734c, (MTV025.082c), len: 454 aa. Hypothetical protein, highly similar to O69707|Y1E0_MYCTU|Rv3740c|MT3848|MTV025.088c HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (448 aa), FASTA scores: opt: 1917, E(): 1.3e-111, (61.4% identity in 451 aa overlap); and similar to many other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. P71694|YE43_MYCTU|Rv1425|MT1468|MTCY21B4.43|MTCY493.29c (459 aa), FASTA scores: opt: 824, E(): 1.1e-43, (36.5% identity in 460 aa overlap); Q50680|YM85_MYCTU|Rv2285|MT2343|MTCY339.25c (445 aa) FASTA scores: opt: 766, E(): 4.1e-40, (36.4% identity in 453 aa overlap); etc. Also similar to Q9RIU8|SCM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 331, E(): 4.3e-13, (32.9% identity in 468 aa overlap); and Q9X7A8|ML1244|MLCB1610.05 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (491 aa), FASTA scores: opt: 296, E(): 7e-11, (28.35% identity in 413 aa overlap). Contains PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2. Start site chosen by homology, but may extend further upstream to 93257. TBparse score is 0.923." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218251.1" /db_xref="GI:15610870" /db_xref="GeneID:885335" /translation="MDLMMPNDSMFLFIESREHPMHVGGLSLFEPPQGAGPEFVREFT ERLVANDEFQPMFRKHPATIGGGIARVAWAYDDDIDIDYHVRRSALPSPGRVRDLLEL TSRLHTSLLDRHRPLWELHVVEGLNDGRFAMYTKMHHALIDGVSAMKLAQRTLSADPD DAEVRAIWNLPPRPRTRPPSDGSSLLDALFKMAGSVVGLAPSTLKLARAALLEQQLTL PFAAPHSMFNVKVGGARRCAAQSWSLDRIKSVKQAAGVTVNDAVLAMCAGALRYYLIE RNALPDRPLIAMVPVSLRSKEDADAGGNLVGSVLCNLATHVDDPAQRIQTISASMDGN KKVLSELPQLQVLALSALNMAPLTLAGVPGFLSAVPPPFNIVISNVPGPVDPLYYGTA RLDGSYPLSNIPDGQALNITLVNNAGNLDFGLVGCRRSVPHLQRLLAHLESSLKDLEQ AVGI" misc_feature complement(4185141..4185170) /locus_tag="Rv3734c" /note="PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2" gene 4186089..4186577 /locus_tag="Rv3735" /db_xref="GeneID:885318" CDS 4186089..4186577 /locus_tag="Rv3735" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3735, (MTV025.083), len: 162 aa. Conserved hypothetical protein, highly similar to several bacterial hypothetical proteins e.g. Q9UX41|ORF-C09_016|SSO0651|AAK40956 from Sulfolobus solfataricus (163 aa), FASTA scores: opt: 627, E(): 1.2e-34, (55.9% identity in 161 aa overlap); O26795|MTH699 from Methanobacterium thermoautotrophicum (168 aa), FASTA scores: opt: 616, E(): 6.7e-34, (56.1% identity in 155 aa overlap); |Q9Y9J9|APE2289 from Aeropyrum pernix (191 aa), FASTA scores: opt: 591, E(): 3.4e-32, (54.65% identity in 161 aa overlap) ; etc. Contains PS00435 Peroxidases proximal heme-ligand signature. TBparse score is 0.902." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218252.1" /db_xref="GI:15610871" /db_xref="GeneID:885318" /translation="MSLAWDVVSVDKPDDVNVVIGQAHFIKAVEDLHEAMVGVSPSLR FGLAFCEASGPRLVRHTGNDGDLVELATRTALAIAAGHSFVIFLREGFPINILNPVQA VPEVCTIYCATANPVDVVVAVTPHGRGIVGVVDGQTPLGVETDRDIAQRRDLLRAIGY KL" misc_feature 4186308..4186340 /locus_tag="Rv3735" /note="PS00435 Peroxidases proximal heme-ligand signature" gene 4186634..4187695 /locus_tag="Rv3736" /db_xref="GeneID:885389" CDS 4186634..4187695 /locus_tag="Rv3736" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3736, (MTV025.084), len: 353 aa. Probable transcriptional regulator, araC/xylS family, similar to many transcriptional regulators and hypothetical proteins e.g. CAC38740 HYPOTHETICAL 35.4 KDA PROTEIN from Bradyrhizobium japonicum (318 aa), FASTA scores: opt: 438, E(): 2e-20, (29.4% identity in 306 aa overlap); Q9HZ25|PA3215 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (337 aa), FASTA scores: opt: 395, E(): 1.1e-17, (30.3% identity in 320 aa overlap); Q9HTN1|PA5324 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (356 aa), FASTA scores: opt: 313, E(): 1.8e-12, (25.85% identity in 329 aa overlap); Q9Z3Y6|PHBR TRANSCRIPTIONAL REGULATOR PHBR from Pseudomonas sp. 61-3 (379 aa), FASTA scores: opt: 271, E(): 8.3e-10, (22.95% identity in 357 aa overlap); etc. Also highly similar to Q06861|VIRS_MYCTU|Rv3082c|MTV013.03c POSSIBLE VIRULENCE-REGULATING PROTEIN from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 656, E(): 3.7e-34, (36.95% identity in 333 aa overlap); and similar to other hypothetical mycobacterial proteins e.g. P71663|YD95_MYCTU|Rv1395|MT1440|MTCY21B4.12 (344 aa). Contains helix-turn-helix motif at aa 245-266 (Score 1140, +3.07 SD). SEEMS BELONG TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS.TBparse score is 0.926." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein AraC/XylS-family" /protein_id="NP_218253.1" /db_xref="GI:15610872" /db_xref="GeneID:885389" /translation="MSVVRGTALANYPSLVAGLGGDPATLLRAAGVRDQDVGNYDAFI SIRAAIRAIESAAAVTATMDFGRRLAQRQGIEILGPVGVAARTAATVGDALAIFNTFM AAYSPVIAIRITPLAGQRSFIALEFLLDEPASYPQTMELALGVALGVIRLLLGADYAP LAVHLPHDPLTPEAFYLQYFGCRPYFAERVGGFTMRTADLSRPLNRDDVAHRVVVDYL SSITPLGEGIVESVRTIVRQLLPTGAATLNVVAEQFHLHPKTLQRRLAEENTTFVILV DRVRKDVADRYLRTTGIGLTHLARELGYAEQSVLTRSCKRWFGTGPAAYRNQARLQTT VSAPGSGRGPNPGNVSVSC" gene 4187699..4189288 /locus_tag="Rv3737" /db_xref="GeneID:885794" CDS 4187699..4189288 /locus_tag="Rv3737" /function="UNKNOWN" /note="Rv3737, (MTV025.085), len: 529 aa. Probable conserved transmembrane protein, similar to others and also some hypothetical proteins e.g. AAK61331|THRE THREONINE EXPORT CARRIER from Corynebacterium glutamicum (Brevibacterium flavum) (489 aa), FASTA scores: opt: 773, E(): 1.8e-36, (37.25% identity in 424 aa overlap); Q9X8J0|SCE9.17 PUTATIVE MEMBRANE PROTEIN from Streptomyces coelicolor (578 aa), FASTA scores: opt: 642, E(): 5.4e-29, (31.6% identity in 481 aa overlap) (shorter 119 aa at N-terminus); Q9CJU6|PM1895 HYPOTHETICAL PROTEIN from Pasteurella multocida (262 aa), FASTA scores: opt: 233, E(): 4.1e-06, (25.0% identity in 256 aa overlap); Q9S267|SCI30A.06 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (297 aa), FASTA scores: opt: 163, E(): 0.042, (29.65% identity in 263 aa overlap); etc. Also partially similar to O05435|Rv3910|MTCY15F10.01c|MTV028.01 HYPOTHETICAL 123.6 KDA PROTEIN from Mycobacterium tuberculosis (1184 aa) (34.4% identity in 125 aa overlap). TBparse score is 0.891" /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218254.1" /db_xref="GI:15610873" /db_xref="GeneID:885794" /translation="MDQDRSDNTALRRGLRIALRGRRDPLPVAGRRSRTSGGIDDLHT RKVLDLTIRLAEVMLSSGSGTADVVATAQDVAQAYQLTDCVVDITVTTIIVSALATTD TPPVTIMRSVRTRSTDYSRLAELDRLVQRITSGGVAVDQAHEAMDELTERPHPYPRWL ATAGAAGFALGVAMLLGGTWLTCVLAAVTSGVIDRLGRLLNRIGTPLFFQRVFGAGIA TLVAVAAYLIAGQDPTALVATGIVVLLSGMTLVGSMQDAVTGYMLTALARLGDALFLT AGIVVGILISLRGVTNAGIQIELHVDATTTLATPGMPLPILVAVSGAALSGVCLTIAS YAPLRSVATAGLSAGLAELVLIGLGAAGFGRVVATWTAAIGVGFLATLISIRRQAPAL VTATAGIMPMLPGLAVFRAVFAFAVNDTPDGGLTQLLEAAATALALGSGVVLGEFLAS PLRYGAGRIGDLFRIEGPPGLRRAVGRVVRLQPAKSQQPTGTGGQRWRSVALEPTTAD DVDAGYRGDWPATCTSATEVR" gene complement(4189285..4190232) /gene="PPE66" /locus_tag="Rv3738c" /db_xref="GeneID:886262" CDS complement(4189285..4190232) /gene="PPE66" /locus_tag="Rv3738c" /function="UNKNOWN" /note="Rv3738c, (MTV025.086c), len: 315 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O53265|Rv3018c|MTV012.32c (434 aa), FASTA scores: opt: 464, E(): 2.2e-17, (47.05% identity in 338 aa overlap). Probably a continuation of the upstream ORF MTV025.87c|Rv3739c|PPE67. At position 97470-72 a stop codon is present which interrupts a possibly longer ORF, observed in related ORFs MTV012_32 or MTCY21B4_4. The sequence has been checked and no errors were detected. A similar situation, but with a frameshift separating the ORFs is found in MTV012_36/MTV012_35. Sequence similarity is also seen with MTCY251_15; MTCY261_19; MLCB2492_30 from Mycobacterium leprae; MTCY10G2_10; MTY21C12_9; MTCI125_26; MTCY164_36; MTCY6A4_1. TBparse score is 0.920." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_178009.1" /db_xref="GI:57117150" /db_xref="GeneID:886262" /translation="MTTAYASALAAMPTLTELAANHTSHAVLLGTNFFGINTIPIALN EADYARMWIQAATTMSIYEGTSDAALASAPQTTPAPVLFNGGAGVASALPAISAATLD PASIIGIIIEILIQLFLISLEILFAIVAYTIIIVLILPLVIFAYAIVFAVLAIIFGPP LLVIASPFVLTGSVIAVPTSLSTSLSTAVPIGVGQYLADLASADAQAIEVGLKTADVA PVAVRPAAAPPLRESAAVRPEARLVSAVAPAPAGTSASVLASDRGAGVLGFAGTAGKE SVGRPAGLTTLAGGEFGGSPSVPMVPASWEQLVGAGEAG" gene complement(4190284..4190517) /gene="PPE67" /locus_tag="Rv3739c" /db_xref="GeneID:886257" CDS complement(4190284..4190517) /gene="PPE67" /locus_tag="Rv3739c" /function="UNKNOWN" /note="Rv3739c, (MTV025.087c), len: 77 aa. Member of the Mycobacterium tuberculosis PE family, showing high homology with O53269|Rv3022c|MTV012.36c (82 aa) FASTA scores: opt: 398, E(): 1.2e-19, (74.0% identity in 77 aa overlap); and similar to the N-termini of other PPE proteins e.g. O53265|Rv3018c|MTV012.32c (434 aa) FASTA scores: opt: 398, E(): 4.8e-19, (74.0% identity in 77 aa overlap). ORF ends at the stop codon at position 97470, which is not present in similar ORFs: MTV012_32, or MTCY21B4_4. Sequence homology with MTV012_32, and MTCY21B4_4 continues in the downstream ORF MTV025.086c|Rv3738c|PPE66. Sequence was checked, but no errors were detected. A similar situation, but with a frameshift separating the ORFs, is found in MTV012_36/MTV012_35. Also ORF MTV025.87c shows similarity to MTV03 _14; MTCY6A4_1; MTV035_8; MTV037_17; MLCB2492_30; MTCY261_19; MTCY251_15; MTCY3A2_23; MTCY28_16; etc." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_178010.1" /db_xref="GI:57117151" /db_xref="GeneID:886257" /translation="MTAPIWFASPPEVHSALLSAGPGPASLQAAAAEWTSLSAEYASA AQELTAVLAAVQGGAWEGPSAEAYVAAHLPYLA" gene complement(4190833..4192179) /locus_tag="Rv3740c" /db_xref="GeneID:885781" CDS complement(4190833..4192179) /locus_tag="Rv3740c" /function="UNKNOWN" /note="Rv3740c, (MTV025.088c), len: 448 aa. Conserved hypothetical protein, highly similar to several other Mycobacterium tuberculosis hypothetical proteins e.g. O69701|Y1D4_MYCTU|Rv3734c|MT3839|MTV025.082c (454 aa) FASTA scores: opt: 1917, E(): 2.3e-112, (61.4% identity in 451 aa overlap); Q50680|YM85_MYCTU|Rv2285|MT2343|MTCY339.25c (445 aa) FASTA scores: opt: 858, E(): 3.4e-46, (37.4% identity in 460 aa overlap); Q10554|Y895_MYCTU|Rv0895|MT0919|MTCY31.23 (505 aa), FASTA scores: opt: 767, E(): 1.9e-40, (44.3% identity in 467 aa overlap); MTCY31_25; MTCY28_26; MTCY493_29; MTCY21B4_43; MTCY8D5_16; MTCY3A2_28; MTV013_8; MTY13E12_33; MTV013_9; MTY20B11_9; etc. Also similar to Q9RIU8|SCM11.13c HYPOTHETICAL 47.1 KDA PROTEIN from Streptomyces coelicolor (446 aa), FASTA scores: opt: 319, E(): 1.7e-12, (30.9% identity in 453 aa overlap). TBparse score is 0.924." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218257.1" /db_xref="GI:15610876" /db_xref="GeneID:885781" /translation="MSPIDALFLSAESREHPLHVGALQLFEPPAGAGRGFVRETYQAM LQCREIAPLFRKRPTSLHGALINLGWSTDADVDLGYHARRSALPAPGRVRELLELTSR LHSNLLDRHRPLWETHVIEGLRDGRFAIYSKMHHALVDGVSGLTLMRQPMTTDPIEGK LRTAWSPATQHTAIKRRRGRLQQLGGMLGSVAGLAPSTLRLARSALIEQQLTLPFGAP HTMLNVAVGGARRCAAQSWPLDRVKAVKDAAGVSLNDVVLAMCAGALREYLDDNDALP DTPLVAMVPVSLRTDRDSVGGNMVGAVLCNLATHLDDPADRLNAIHASMRGNKNVLSQ LPRAQALAVSLLLLSPAALNTLPGLAKATPPPFNVCISNVPGAREPLYFNGARMVGNY PMSLVLDGQALNITLTSTADSLDFGVVGCRRSVPHVQRVLSHLETSLKELERAVGL" gene complement(4192179..4192853) /locus_tag="Rv3741c" /db_xref="GeneID:885129" CDS complement(4192179..4192853) /locus_tag="Rv3741c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3741c, (MTV025.089c), len: 224 aa. Possible oxidoreductase, probably combines with product of upstream ORF MTV025.090c to form a functional monooxygenase (EC 1.-.-.-), highly similar to C-terminal end of various oxidoreductases e.g. Q9APW3 AROMATIC-RING HYROXYLASE from Pseudomonas aeruginosa (508 aa), FASTA scores: opt: 549, E(): 5.9e-28, (56.1% identity in 155 aa overlap); Q9A588|CC2569 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (498 aa), FASTA scores: opt: 487, E(): 5.6e-24, (39.55% identity in 225 aa overlap); Q9RZT0|DRB0033 ARYLESTERASE/MONOXYGENASE from Deinococcus radiodurans (833 aa), FASTA scores: opt: 460, E(): 4.7e-22, (38.5% identity in 226 aa overlap); etc. Also similar to C-terminal end of Mycobacterium tuberculosis proteins (generally monooxygenases) e.g. P96223|Rv3854c|MTCY01A6.14 HYPOTHETICAL 55.3 KDA PROTEIN (489 aa), FASTA scores: opt: 542, E(): 1.6e-27, (50.0% identity in 162 aa overlap); O53762|Rv0565c|MTV039.03c PUTATIVE MONOXYGENASE (486 aa), FASTA scores: opt: 462, E(): 2.2e-22, (37.15% identity in 226 aa overlap); O53300|Rv3083|MTV013.04 MONOXYGENASE (495 aa), FASTA scores: opt: 462, E(): 2.2e-22, (45.65% identity in 173 aa overlap); etc. Note similarity to MTCY01A6.14 and MTV013.04 continue in upstream ORF (MTV025.090c) after a gap of 100 aa. TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218258.1" /db_xref="GI:15610877" /db_xref="GeneID:885129" /translation="MIGRDRAYAVTRRKDIAKQRLVWRLCQRYPRAARRLIRHLNAKQ LAAGYPADEHFKPVYNPWDQRLCAVPDADMFKAIRDGRASVVTEAIDTFTENGIRLQS GRELAADISITATGLNLLAFGGINLSVDGVAVDVAEKVAFKGFLLSDVSNFAGPHGRT RAHHLLSAAARSHADPAAAGRRSPLADLKVLREGPVDDDHLRFTTSASASRLTVKRIT RSTPWN" gene complement(4192850..4193245) /locus_tag="Rv3742c" /db_xref="GeneID:885263" CDS complement(4192850..4193245) /locus_tag="Rv3742c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3742c, (MTV025.090c), len: 131 aa. Possible oxidoreductase, probably combines with product of downstream ORF MTV025.090c to form a functional monooxygenase (EC 1.-.-.-), highly similar to N-terminal end of various oxidoreductases e.g. Q9A588|CC2569 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (498 aa), FASTA scores: opt: 170, E(): 0.00048, (47.55% identity in 103 aa overlap); Q9APW3 AROMATIC-RING HYROXYLASE from Pseudomonas aeruginosa (508 aa) FASTA scores: opt: 160, E(): 0.0022, (50.55% identity in 87 aa overlap); Q9RZT0|DRB0033 ARYLESTERASE/MONOXYGENASE from Deinococcus radiodurans (833 aa), FASTA scores: opt: 153, E(): 0.0097, (45.45% identity in 88 aa overlap); etc. Also similar to C-terminal end of Mycobacterium tuberculosis proteins (generally monooxygenases) e.g. P96223|Rv3854c|MTCY01A6.14 HYPOTHETICAL 55.3 KDA PROTEIN (489 aa), FASTA scores: opt: 140, E(): 0.044, (37.1% identity in 132 aa overlap); O53300|Rv3083|MTV013.04 MONOXYGENASE (495 aa) FASTA scores: opt: 133, E(): 0.13, (43.05% identity in 79 aa overlap); O53762|Rv0565c|MTV039.03c PUTATIVE MONOXYGENASE (486 aa), FASTA scores: opt: 110, E(): 4.1, (42.85% identity in 77 aa overlap); etc. Note similarity to MTCY01A6.14 and MTV013.04 continue in downstream ORF (MTV025.089c) after a gap of 100 aa. TBparse score is 0.915." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218259.1" /db_xref="GI:15610878" /db_xref="GeneID:885263" /translation="MHSEQSASIEHVDVLIVGAGISGTGAAYYLKTMQPAKTFAIVEA RYPAIRSDSDLHTFSYEFKPWQHEKATASADAIMVHRGRSLAGGDRTLRHRRTRHHEL RMVIIGSGATAVTLVPAMAQTAGAVTMPK" gene complement(4193391..4195373) /gene="ctpJ" /locus_tag="Rv3743c" /db_xref="GeneID:885106" CDS complement(4193391..4195373) /gene="ctpJ" /locus_tag="Rv3743c" /EC_number="3.6.1.-" /function="CATION-TRANSPORTING ATPASE; POSSIBLY CATALYZES THE TRANSPORT OF A UNDETERMINATED CATION (POSSIBLY CADMIUM) WITH HYDROLYSE OF ATP [CATALYTIC ACTIVITY: ATP + H(2)O + UNDETERMINATED CATION(IN) = ADP + PHOSPHATE + UNDETERMINATED CATION(OUT)]." /note="Rv3743c, (MTV025.091c), len: 660. Probable ctpJ, cation-transporting P-type ATPase (EC 3.6.1.-), transmembrane protein highly similar to others e.g. Q9ZBF3|SC9B5.27 PUTATIVE CATION-TRANSPORTING ATPASE from Streptomyces coelicolor (638 aa), FASTA scores: opt: 1635, E(): 2.5e-86, (62.25% identity in 63.95 aa overlap); Q59997|CADA|SLR0797 CADMIUM-TRANSPORTING ATPASE from Synechocystis sp. strain PCC 6803 (642 aa), FASTA scores: opt: 1474, E(): 4.3e-77, (42.4% identity in 604 aa overlap); P30336|CADA_BACFI PROBABLE CADMIUM-TRANSPORTING ATPASE from Bacillus firmus (723 aa), FASTA scores: opt: 1327, E(): 1.3e-68, (36.6% identity in 626 aa overlap); etc. Also highly similar to O53160|CTPD_MYCTU|Rv1469|MT1515|MTV007.16 PROBABLE CATION-TRANSPORTING P-TYPE ATPASE D from Mycobacterium tuberculosis (657 aa), FASTA scores: opt: 1845, E(): 2.3e-98, (55.85% identity in 650 aa overlap). Contains PS00154 E1-E2 ATPases phosphorylation site and PS01229 Hypothetical family signature 2. BELONGS TO THE CATION TRANSPORT ATPASES FAMILY (E1-E2 ATPASES). TBparse score is 0.903." /codon_start=1 /transl_table=11 /product="cation transporter P-type ATPase CtpJ" /protein_id="NP_218260.1" /db_xref="GI:15610879" /db_xref="GeneID:885106" /translation="MAVRELSPARCTSASPLVLARRTKLFALSEMRWAALALGLFSAG LLTQLCGAPQWVRWALFLACYATGGWEPGLAGLQALQRRTLDVDLLMVVAAIGAAAIG QIAEGALLIVIFATSGALEALVTARTADSVRGLMGLAPGTATRVGAGGGEETVNAADL RIGDIVLVRPGERISADATVLAGGSEVDQATVTGEPLPVDKSIGDQVFAGTVNGTGAL RIRVDRLARDSVVARIATLVEQASQTKARTQLFIEKVEQRYSIGMVAVTLAVFAVPPL WGETLQRALLRAMTFMIVASPCAVVLATMPPLLAAIANAGRHGVLAKSAIVMEQLGTT TRIAFDKTGTLTRGTPELAGIWVYERRFTDDELLRLAAAAEYPSEHPLGAAIVKAAQS RRIRLPTVGEFTAHPGCRVTARVDGHVIAVGSATALLGTAGAAALEASMITAVDFLQG EGYTVVVVVCDSHPVGLLAITDQLRPEAAAAISAATKLTGAKPVLLTGDNRATADRLG VQVGIDDVRAGLLPDDKVAAVRQLQAGGARLTVVGDGINDAPALAAAHVGIAMGSARS ELTLQTADAVVVRDDLTTIPTVIAMSRRARRIVVANLIVAVTFIAGLVVWDLAFTLPL PLGVARHEGSTIIVGLNGLRLLRHTAWRRAAGTAHR" misc_feature complement(4193682..4193750) /gene="ctpJ" /locus_tag="Rv3743c" /note="PS01229 Hypothetical cof family signature 2" misc_feature complement(4194336..4194356) /gene="ctpJ" /locus_tag="Rv3743c" /note="PS00154 E1-E2 ATPases phosphorylation site" gene 4195440..4195802 /locus_tag="Rv3744" /db_xref="GeneID:885418" CDS 4195440..4195802 /locus_tag="Rv3744" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3744, (MTV025.092), len: 120 aa. Probable transcriptional regulator, possible arsR family, highly similar to many e.g. Q9ZBF4|SC9B5.26c from Streptomyces coelicolor (120 aa), FASTA scores: opt: 480, E(): 2.4e-24, (63.25% identity in 117 aa overlap); O31844|YOZA YOZA REGULATOR from Bacillus subtilis (107 aa), FASTA scores: opt: 249, E(): 1.6e-09, (44.8% identity in 96 aa overlap); P30340|SMTB_SYNP7|SMTB from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (122 aa), FASTA scores: opt: 230, E(): 2.9e-08, (46.0% identity in 87 aa overlap); etc. Equivalent to AAK48216 from Mycobacterium tuberculosis strain CDC1551 (135 aa) but shorter 15 aa. Also similar to MTCY27_22; MTCY39_25; and MTCY441_12. Contains helix-turn-helix motif at aa 47-68 (Score 1815, +5.37 SD). SEEMS TO BELONG TO THE ARSR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein ArsR-family" /protein_id="NP_218261.1" /db_xref="GI:15610880" /db_xref="GeneID:885418" /translation="MGHGVEGRNRPSAPLDSQAAAQVASTLQALATPSRLMILTQLRN GPLPVTDLAEAIGMEQSAVSHQLRVLRNLGLVVGDRAGRSIVYSLYDTHVAQLLDEAI YHSEHLHLGLSDRHPSAG" gene complement(4195886..4196098) /locus_tag="Rv3745c" /db_xref="GeneID:885597" CDS complement(4195886..4196098) /locus_tag="Rv3745c" /function="UNKNOWN" /note="Rv3745c, (MTV025.093c), len: 70 aa. Conserved hypothetical protein, highly similar to others e.g. N-terminus of Q9X4E6 HYPOTHETICAL 13.4 KDA PROTEIN from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (124 aa), FASTA scores: opt: 279, E(): 4.4e-14, (59.4% identity in 69 aa overlap); N-terminus of Q9A2A6|CC3660 HYPOTHETICAL PROTEIN from Caulobacter crescentus (172 aa) FASTA scores: opt: 272, E(): 1.9e-13, (63.35% identity in 60 aa overlap); N-terminus of P74345|SLR1628 HYPOTHETICAL 14.5 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (134 aa), FASTA scores: opt: 233, E(): 1.3e-10, (54.85% identity in 62 aa overlap); etc. TBparse score is 0.894." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218262.1" /db_xref="GI:15610881" /db_xref="GeneID:885597" /translation="MSDCNVLGGALEQGGTDPLTGFYRDGCCATGPEDLGWHTICAVM TTEFLAHQRSVGNDLSIARPPRWLRP" gene complement(4196171..4196506) /gene="PE34" /locus_tag="Rv3746c" /db_xref="GeneID:885764" CDS complement(4196171..4196506) /gene="PE34" /locus_tag="Rv3746c" /function="UNKNOWN" /note="Rv3746c, (MTV025.094c), len: 111 aa. Probable member of the Mycobacterium tuberculosis PE family (see citation below), but without the glycine-rich C-terminal part, similar to N-termini of many e.g. O69737|Rv3872|MTV027.07 (99 aa) FASTA scores: opt: 306, E(): 1e-13, (50.5% identity in 99 aa overlap); O53215|Rv2490c|MTV008.46 (1660 aa) FASTA scores: opt: 125, E(): 0.99, (34.25% identity in 111 aa overlap). Also weakly similar to MTV008_46; MTCI418B_6; MTCY130_1; MTY25D10_11; MTCY1A11_25; MTCY21B4_13; MTCY21B4_27; MTCY493_2; MTCY28_25; etc. TBparse score is 0.900." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_178011.1" /db_xref="GI:57117152" /db_xref="GeneID:885764" /translation="MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAE EVSAWAVTAFTTAATGLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIP RPGQTLARE" gene 4196724..4197107 /locus_tag="Rv3747" /db_xref="GeneID:885778" CDS 4196724..4197107 /locus_tag="Rv3747" /function="UNKNOWN" /note="Rv3747, (MTV025.095), len: 127 aa. Hypothetical protein, highly similar to downstream ORF O69715|Rv3748|MTV025.096 CONSERVED HYPOTHETICAL PROTEIN (119 aa), FASTA scores: opt: 494, E(): 6e-27, (64.4% identity in 118 aa overlap). TBparse score is 0.924." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218264.1" /db_xref="GI:15610883" /db_xref="GeneID:885778" /translation="MILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVL TQAEPDSSDRDITVEMRPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWV LVVTGGTGAISLPVLVSDMPATIGF" gene 4197236..4197595 /locus_tag="Rv3748" /db_xref="GeneID:885776" CDS 4197236..4197595 /locus_tag="Rv3748" /function="UNKNOWN" /note="Rv3748, (MTV025.096), len: 119 aa. Hypothetical protein, highly similar to upstream ORF O69714|Rv3747|MTV025.095 CONSERVED HYPOTHETICAL PROTEIN (127 aa), FASTA scores: opt: 496, E(): 2.5e-28, (64.4% identity in 118 aa overlap). TBparse score is 0.871." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218265.1" /db_xref="GI:15610884" /db_xref="GeneID:885776" /translation="MIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLT QAETDDPDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVL VVTGGAGTISLPLIVTG" gene complement(4197628..4198137) /locus_tag="Rv3749c" /db_xref="GeneID:885384" CDS complement(4197628..4198137) /locus_tag="Rv3749c" /function="UNKNOWN" /note="Rv3749c, (MTV025.097c), len: 169 aa. Hypothetical protein, showing some similarity with O85864 HYPOTHETICAL 21.4 KDA PROTEIN from Sphingomonas aromaticivorans plasmid pNL1 (196 aa), FASTA scores: opt: 148, E(): 0.011, (32.7% identity in 104 aa overlap); Q9LCU6 HYPOTHETICAL 21.2 KDA PROTEIN from Arthrobacter sp. TM1 (192 aa), FASTA scores: opt: 125, E(): 0.35, (31.5% identity in 92 aa overlap); Q9L631|SPCB MYO-INOSITOL-2-DEHYDROGENASE from Streptomyces spectabilis (374 aa); Q9WJP8|PRE-S1 PRE-S1 PROTEIN (FRAGMENT) from Hepatitis B virus (88 aa); etc. Contains PS00092 N-6 Adenine-specific DNA methylases signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218266.1" /db_xref="GI:15610885" /db_xref="GeneID:885384" /translation="MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFA FGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFL LDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFG LGVLLPFDQ" misc_feature complement(4197751..4197771) /locus_tag="Rv3749c" /note="PS00092 N-6 Adenine-specific DNA methylases signature" gene complement(4198205..4198597) /locus_tag="Rv3750c" /db_xref="GeneID:885807" CDS complement(4198205..4198597) /locus_tag="Rv3750c" /function="SEQUENCE EXCISION." /experiment="experimental evidence, no additional details recorded" /note="Rv3750c, (MTV025.098c), len: 130 aa. Possible excisionase, similar to others e.g. Q9LCU5 PUTATIVE EXCISIONASE from Arthrobacter sp. TM1 (174 aa) FASTA scores: opt: 297, E(): 1.2e-12, (40.35% identity in 114 aa overlap); O85865 PUTATIVE EXCISIONASE from Sphingomonas aromaticivorans plasmid pNL1 (152 aa), FASTA scores: opt: 223, E(): 7.3e-08, (39.15% identity in 97 aa overlap); Q9XBH1|XIS EXCISIONASE from Bacteroides fragilis (124 aa) FASTA scores: opt: 128, E(): 0.1, (30.7% identity in 88 aa overlap); etc. Also some similarity to transcriptional regulators. Also similar to Mycobacterium tuberculosis hypothetical proteins e.g. P71902|YN10_MYCTU|Rv2310|MT2372|MTCY3G12.24c (114 aa) FASTA scores: opt: 224, E(): 4.9e-08, (42.7% identity in 82 aa overlap). Contains helix-turn-helix motif at aa 55-76 (Score 1925,+5.74 SD)." /codon_start=1 /transl_table=11 /product="excisionase" /protein_id="NP_218267.1" /db_xref="GI:15610886" /db_xref="GeneID:885807" /translation="MTSLLEVLGAPEVSVCGNAGQPMTLPEPVRDALYNVVLALSQGK GISLVPRHLKLTTQEAADLLNISRPTLVRLLEDGRIPFEKPGRHRRVSLDALLEYQQE TRSNRRAALGELSRDALGELQAALAEKK" gene 4198874..4199089 /locus_tag="Rv3751" /db_xref="GeneID:885857" CDS 4198874..4199089 /locus_tag="Rv3751" /function="SEQUENCE INTEGRATION." /note="Rv3751, (MTV025.099), len: 71 aa. Probable integrase (fragment), similar to part of many e.g. Q48908 INTEGRASE (FRAGMENT) from Mycobacterium paratuberculosis (191 aa), FASTA scores: opt: 206, E(): 5.5e-08, (57.65% identity in 59 aa overlap); Q9ZWV7|INT INTEGRASE from Corynephage 304L (395 aa), FASTA scores: opt: 156, E(): 0.00036, (45.75% identity in 59 aa overlap); Q9K722|BH3551 INTEGRASE (PHAGE-RELATED PROTEIN) from Bacillus halodurans (378 aa), FASTA scores: opt: 151, E(): 0.00079, (46.15% identity in 52 aa overlap); etc. Also similarity with various conjugative transposons. Also similar to Mycobacterium tuberculosis hypothetical proteins e.g. P71903|Rv2309c|MTCY3G12.25 (151 aa), FASTA scores: opt: 193, E(): 3.8e-07, (50.85% identity in 59 aa overlap); O53403|Rv1055|MTV017.08 (78 aa), FASTA scores: opt: 171, E(): 7.8e-06, (54.15% identity in 48 aa overlap); etc." /codon_start=1 /transl_table=11 /product="integrase" /protein_id="NP_218268.1" /db_xref="GI:15610887" /db_xref="GeneID:885857" /translation="MKRAKVQQITPHDLRHTAASLAVSAGVNVLALQRILGHKSAKVT LDTYADLFDADLDAVAVTLGKDADQQT" gene complement(4199131..4199217) /locus_tag="Rvnt42" /note="tRNA-Ser(CGA)" /db_xref="GeneID:2700471" tRNA complement(4199131..4199217) /locus_tag="Rvnt42" /product="tRNA-Ser" /note="codon recognized: UCG" /anticodon=(pos:4199181..4199183,aa:Ser) /db_xref="GeneID:2700471" gene complement(4199247..4199705) /locus_tag="Rv3752c" /db_xref="GeneID:885586" CDS complement(4199247..4199705) /locus_tag="Rv3752c" /EC_number="3.5.4.-" /function="UNKNOWN; PROBABLY INVOLVED IN DEAMINATION OF SPECIFIC SUBSTRATE." /note="Rv3752c, (MTV025.100c), len: 152 aa. Probable cytidine/deoxycytidylate deaminase (EC 3.5.4.-), equivalent to Q9CB32|ML2474 POSSIBLE CYTIDINE/DEOXYCYTIDYLATE DEAMINASE from Mycobacterium leprae (171 aa), FASTA scores: opt: 890, E(): 1.6e-50, (88.1% identity in 151 aa overlap). Also highly similar to other deaminases and hypothetical proteins e.g. Q9AK79|2SCD60.04c PUTATIVE DEAMINASE from Streptomyces coelicolor (143 aa), FASTA scores: opt: 559, E(): 2.9e-29, (66.45% identity in 146 aa overlap); Q9F9W7 CYTOSINE DEAMINASE from Bifidobacterium longum (143 aa) FASTA scores: opt: 512, E(): 3.1e-26, (54.85% identity in 144 aa overlap); P21335|YAAJ_BACSU HYPOTHETICAL 17.8 KDA PROTEIN from Bacillus subtilis (161 aa), FASTA scores: opt: 425, E(): 1.4e-20, (47.7% identity in 151 aa overlap); AAK74212|SP0020 CYTIDINE/DEOXYCYTIDYLATE DEAMINASE FAMILY PROTEIN from Streptococcus pneumoniae (155 aa), FASTA scores: opt: 401, E(): 4.7e-19, (46.25% identity in 147 aa overlap); P30134|YFHC_ECOLI|B2559 HYPOTHETICAL 20.0 KDA PROTEIN from Escherichia coli strain K12 (178 aa), FASTA scores: opt: 397, E(): 9.5e-19, (47.0% identity in 149 aa overlap); etc. Contains PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature. BELONGS TO THE CYTIDINE AND DEOXYCYTIDYLATE DEAMINASES FAMILY. TBparse score is 0.866." /codon_start=1 /transl_table=11 /product="cytidine/deoxycytidylate deaminase" /protein_id="NP_218269.1" /db_xref="GI:15610888" /db_xref="GeneID:885586" /translation="MTTDEDLIRAALAVAATAGPRDVPVGAVVVGADGTELARAVNAR EALGDPTAHAEILAMRLAAGVLGDGWRLEGTTLAVTVEPCTMCAGALVLARVARLVFG AWEPKTGAVGSLWDVVRDRRLNHRPEVRGGVLARECAAPLEAFFARQRLG" misc_feature complement(4199433..4199549) /locus_tag="Rv3752c" /note="PS00903 Cytidine and deoxycytidylate deaminases zinc-binding region signature" gene complement(4199721..4200221) /locus_tag="Rv3753c" /db_xref="GeneID:885505" CDS complement(4199721..4200221) /locus_tag="Rv3753c" /function="UNKNOWN" /note="Rv3753c, (MTV025.101c), len: 166 aa. Conserved hypothetical protein, only equivalent to Q9CB33|ML2473 HYPOTHETICAL PROTEIN from Mycobacterium leprae (159 aa) FASTA scores: opt: 920 E(): 1.4e-52,, (88.6% identity in 158 aa overlap). TBparse score is 0.877." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218270.1" /db_xref="GI:15610889" /db_xref="GeneID:885505" /translation="MQRPAADTPDGFGVAVVREEGRWRCSPMGPKALTSLRAAETELR ELRSAGAVFGLLDVDDEFFVIVRPAPSGTRLLLSDATAALDYDIAAEVLDNLDAEIDP EDLEDADPFEEGDLGLLSDIGLPEAVLGVILDETDLYADEQLGRIAREMGFADQLSAV IDRLGR" gene 4200421..4201326 /gene="tyrA" /locus_tag="Rv3754" /db_xref="GeneID:885559" CDS 4200421..4201326 /gene="tyrA" /locus_tag="Rv3754" /EC_number="1.3.1.12" /function="INVOLVED IN TYROSINE BIOSYNTHESIS [CATALYTIC ACTIVITY: PREPHENATE + NAD(+) = 4-HYDROXYPHENYLPYRUVATE + CO(2) + NADH]." /note="catalyzes the formation of 4-hydroxyphenylpyruvate from prephenate" /codon_start=1 /transl_table=11 /product="prephenate dehydrogenase" /protein_id="NP_218271.1" /db_xref="GI:15610890" /db_xref="GeneID:885559" /translation="MRAAAAAGREVFGYNRSVEGAHGARSDGFDAITDLNQTLTRAAA TEALIVLAVPMPALPGMLAHIRKSAPGCPLTDVTSVKCAVLDEVTAAGLQARYVGGHP MTGTAHSGWTAGHGGLFNRAPWVVSVDDHVDPTVWSMVMTLALDCGAMVVPAKSDEHD AAAAAVSHLPHLLAEALAVTAAEVPLAFALAAGSFRDATRVAATAPDLVRAMCEANTG QLAPAADRIIDLLSRARDSLQSHGSIADLADAGHAARTRYDSFPRSDIVTVVIGADKW REQLAAAGRAGGVITSALPSLDSPQ" gene complement(4201289..4201888) /locus_tag="Rv3755c" /db_xref="GeneID:885298" CDS complement(4201289..4201888) /locus_tag="Rv3755c" /function="UNKNOWN" /note="Rv3755c, (MTV025.103c), len: 199 aa. Conserved hypothetical protein showing similarity to CAC47343|SMC03980 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (196 aa) FASTA scores: opt: 244, E(): 4.1e-09, (30.9% identity in 191 aa overlap); Q9I2B5|PA1994 from Pseudomonas aeruginosa (187 aa), FASTA scores: opt: 226, E(): 6e-08, (29.9% identity in 194 aa overlap); and Q98N73|MLR0268 HYPOTHETICAL PROTEIN (183 aa), FASTA scores: opt: 234, E(): 1.8e-08, (27.05% identity in 185 aa overlap). TBparse score is 0.925." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218272.1" /db_xref="GI:15610891" /db_xref="GeneID:885298" /translation="MNAVPSDLTPRVWPAMLTWRAQDISRMESVRVQLSGKRIRANGR IVAAATANNPAFGAHYDLQTDETGATKRFGLTVTLAERERQLAIARDEENMWLVTDHQ GERRAAYNGALDIDLVFSPFFNALPIRRLGLHERAESIALPVVYVNVPEMSVDAATVS YTSEGRLDGIKLRSPVADTTVTVDSDGFIVDYPGLAERM" gene complement(4201894..4202613) /gene="proZ" /locus_tag="Rv3756c" /db_xref="GeneID:885534" CDS complement(4201894..4202613) /gene="proZ" /locus_tag="Rv3756c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3756c, (MTV025.104c), len: 239 aa. Possible proZ, osmoprotectant transport integral membrane protein ABC transporter (see citation below), similar to osmoprotection proteins (proW, proZ) involved in glycine betaine/L-proline/choline transport, e.g. BAB58609|Q99RI4|OPUCB|SA2236|SAV2447 OPUCB PROTEIN (PROBABLE GLYCINE BETAINE/CARNITINE/CHOLINE ABC TRANSPORTER) from Staphylococcus aureus (211 aa) FASTA scores: opt: 434, E(): 2.5e-18, (36.6% identity in 194 aa overlap); Q45461|OPBB_BACSU|OPUBB|PROW CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN (mediate the uptake of choline for synthesis of the osmoprotectant glycine betaine) from Bacillus subtilis (217 aa), FASTA scores: opt: 402, E(): 1.9e-16, (32.0% identity in 203 aa overlap); O34878|OPCB_BACSU|OPUCB GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (217 aa), FASTA scores: opt: 385, E(): 1.8e-15, (30.2% identity in 222 aa overlap); P39775|O34657|OPUBD|PROZ|OPBD_BACSU CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (226 aa) FASTA scores: opt: 350, E(): 2e-13, (31.75% identity in 208 aa overlap); etc. COULD BELONG TO THE CYSTW SUBFAMILY. TBparse score is 0.911." /codon_start=1 /transl_table=11 /product="osmoprotectant (glycine betaine/carnitine/choline/L-proline) transport integral membrane protein ABC transporter PROZ" /protein_id="NP_218273.1" /db_xref="GI:15610892" /db_xref="GeneID:885534" /translation="MNFLQQALSYLLTASNWTGPVGLAVRTCEHLEYTAVAVAASALI AVPVGLLIGHTGRGTLLVVGAVNGLRALPTLGVLLLGVLLFGLGLGPPLVALMLLGIP SLLASTYAGIASVDPLVVDAARAMGMTESQVLLRVEVPNALPLMLGGLRSATLQVVAT ATVAAYASLGGLGGYLIDGIKERRFHIALVGAMMVAALALTLDGLLALAGWVSVPGTG RMRKLAAVVDKPAAGGGHALR" gene complement(4202610..4203299) /gene="proW" /locus_tag="Rv3757c" /db_xref="GeneID:885581" CDS complement(4202610..4203299) /gene="proW" /locus_tag="Rv3757c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3757c, (MTV025.105c), len: 225 aa. Possible proW, osmoprotectant transport integral membrane protein ABC transporter (see citation below), similar to osmoprotection proteins (proW, proZ) involved in glycine betaine/L-proline/choline transport, e.g. BAB58607|Q99RI6|OPUCD|SA2234|SAV2445 OPUCD PROTEIN (PROBABLE GLYCINE BETAINE/CARNITINE/CHOLINE ABC TRANSPORTER) from Staphylococcus aureus (231 aa) FASTA scores: opt: 364, E(): 7.1e-15, (30.0% identity in 220 aa overlap); Q45461|OPBB_BACSU|OPUBB|PROW CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN (mediate the uptake of choline for synthesis of the osmoprotectant glycine betaine) from Bacillus subtilis (217 aa), FASTA scores: opt: 348, E(): 6.2e-14, (31.05% identity in 206 aa overlap); O34878|OPCB_BACSU|OPUCB GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (217 aa), FASTA scores: opt: 343, E(): 1.2e-13, (30.1% identity in 206 aa overlap); O34742|OPCD_BACSU|OPUCD GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT SYSTEM PERMEASE PROTEIN from Bacillus subtilis (229 aa) FASTA scores: opt: 337, E(): 2.9e-13, (31.1% identity in 193 aa overlap); etc. COULD BELONG TO THE CYSTW SUBFAMILY." /codon_start=1 /transl_table=11 /product="osmoprotectant (glycine betaine/carnitine/choline/L-proline) transport integral membrane protein ABC transporter PROW" /protein_id="NP_218274.1" /db_xref="GI:15610893" /db_xref="GeneID:885581" /translation="MHYLMTHPGAAWALTVVHLRLSLLPVLIGLMSAVPLGLLVQRAP LLRRLTTATASVIFTIPSLALFVVLPLIIGTRILDEANVIVALAAYTTALLVRAVLEA LDAVPAQVHDAATAIGYSRIAQMLKVELPLSIPVLVAGLRVVAVTNIAMVSVGSVIGI GGLGTWFTAGYQTNKSDQIVAGVVAMFLLAIVVDVVINLAGRLATPWERAPRAARRRR QVAAPITGGAR" gene complement(4203287..4204417) /gene="proV" /locus_tag="Rv3758c" /db_xref="GeneID:886293" CDS complement(4203287..4204417) /gene="proV" /locus_tag="Rv3758c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) ACROSS THE MEMBRANE (IMPORT). RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv3758c, (MTV025.106c), len: 376 aa. Possible proV, osmoprotectant transport ATP-binding protein ABC transporter (see citation below), highly similar to osmoprotection proteins (proV) involved in glycine betaine/L-proline/choline transport, e.g. BAB58610|Q99RI3|OPUCA|SA2237|SAV2448 GLYCINE BETAINE/CARNITINE/CHOLINE ABC TRANSPORTER (ATP-BINDING) from Staphylococcus aureus (410 aa), FASTA scores: opt: 816, E(): 8.4e-39, (39.5% identity in 362 aa overlap); O34992|OPCA_BACSU|OPUCA GLYCINE BETAINE/CARNITINE/CHOLINE TRANSPORT ATP-BINDING PROTEIN from Bacillus subtilis (380 aa), FASTA scores: opt: 807, E(): 2.5e-38, (40.55% identity in 333 aa overlap); Q45460|OPBA_BACSU|OPUBA|PROV CHOLINE TRANSPORT ATP-BINDING PROTEIN from Bacillus subtilis (381 aa), FASTA scores: opt: 801, E(): 5.6e-38, (40.65% identity in 337 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop) and PS00211 ABC transporter family signature. BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS). TBparse score is 0.896." /codon_start=1 /transl_table=11 /product="osmoprotectant (glycine betaine/carnitine/choline/L-proline) transport ATP-binding protein ABC transporter PROV" /protein_id="NP_218275.1" /db_xref="GI:15610894" /db_xref="GeneID:886293" /translation="MICFDDVSKVYAHGATAVDRLTLEVPNGMLTVFVGPSGCGKTTA LRMINRMVDPTSGTITVDGTDVSTVNAVKLRLGIGYVIQNAGLMPHQRVIDNVATVPV LKGQPRRAARKAGYEVLERVGLDPKVATRYPAQLSGGEQQRVGVARALAADPPILLMD EPFSAVDPVVRHELQNEILRLQAELHKTIVFVTHDIDEALKLADLVAVFAPGGALAQY DETARLLSSPANDFVSKFIGLGRGYRWLQLFDAAGLPVRDIEQVSVNGLSDARDRQVR DGWVLVVDGAGAPLGWIDADGRRRHRGGAALSDAMTVGGSVFRPNGNLSQALDAALSS PSGVGVAVDGGGKVIGGILAADVLAEFQKGKKAGGGAKPCTT" misc_feature complement(4203968..4204012) /gene="proV" /locus_tag="Rv3758c" /note="PS00211 ABC transporters family signature" misc_feature complement(4204292..4204315) /gene="proV" /locus_tag="Rv3758c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(4204426..4205373) /gene="proX" /locus_tag="Rv3759c" /db_xref="GeneID:886270" CDS complement(4204426..4205373) /gene="proX" /locus_tag="Rv3759c" /function="THOUGHT TO BE INVOLVED IN ACTIVE TRANSPORT OF OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) ACROSS THE MEMBRANE (IMPORT)." /note="Rv3759c, (MTV025.107c), len: 315 aa. Possible proX, osmoprotectant-binding lipoprotein component of osmoprotectant transport system (see citation below), similar to osmoprotection proteins (proX) involved in glycine betaine/L-proline/choline transport, e.g. AAK79442|CAC1474 PROLINE/GLYCINE BETAINE ABC TRANSPORT SYSTEM PERIPLASMIC COMPONENT from Clostridium acetobutylicum (303 aa), FASTA scores: opt: 308, E(): 1.2e-11, (27.4% identity in 314 aa overlap); Q9X4J2|PROXL|SCE19A.33 PROXL PROTEIN from Streptomyces coelicolor (322 aa), FASTA scores: opt: 302, E(): 3e-11, (27.2% identity in 327 aa overlap); O29280|AF0982 OSMOPROTECTION PROTEIN (PROX) from Archaeoglobus fulgidus (292 aa), FASTA scores: opt: 235, E(): 3.4e-07, (23.15% identity in 285 aa overlap); etc. Also similar to MTV006_16 HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis, and MLU15180_43 HYPOTHETICAL PROTEIN from Mycobacterium leprae. Equivalent to AAK48230 from Mycobacterium tuberculosis strain CDC1551 (343 aa) but shorter 28 aa. Contains probable N-terminal signal sequence." /codon_start=1 /transl_table=11 /product="osmoprotectant (glycine betaine/carnitine/choline/L-proline) binding lipoprotein PROX" /protein_id="NP_218276.1" /db_xref="GI:15610895" /db_xref="GeneID:886270" /translation="MRMLRRLRRATVAAAVWLATVCLVASCANADPLGSATGSVKSIV VGSGDFPESQVIAEIYAQVLQANGFDVGRRLGIGSRETYILALKDHSIDLVPEYIGNL LLYFQPDATVTMLDAVELELYKRLPGDLSILTPSPASDTDTVTVTAATAARWNLKTIA DLAPHSADVKFAAPSAFQTRPSGLPGLRHKYSLDIAPGNFVTINDGGGAVTVRALVEG TATAANLFSTSAAIPQNHLVVLEDPEHNFLAGNIVPLVNSRKKSDHLKDVLDAVSAKL TTAGLAELNAAVSGNSGVDPDQAARKWVRDNGFDHPVRQ" gene 4205538..4205840 /locus_tag="Rv3760" /db_xref="GeneID:886093" CDS 4205538..4205840 /locus_tag="Rv3760" /function="UNKNOWN" /note="Rv3760, (MTV025.108), len: 100 aa. Possible conserved membrane protein, equivalent to Q50094|ML2366|MLCB12.11c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (113 aa), FASTA scores: opt: 423, E(): 1.2e-20, (67.7% identity in 99 aa overlap). Also similar with Q9JST1|NMA2149 PUTATIVE INNER MEMBRANE HYPOTHETICAL PROTEIN from Neisseria meningitidis (serogroup A) (104 aa), FASTA scores: opt: 113, E(): 0.95, (33.85% identity in 62 aa overlap); and showing similarity with Q9ZAX7 ABC TRANSPORTER MEMBRANE PROTEIN SUBUNIT from Streptococcus mutans (498 aa), FASTA scores: opt: 108, E(): 6.7, (42.35% identity in 85 aa overlap) (similarity at C-terminus); and P33108|SECY_MICLU PREPROTEIN TRANSLOCASE SECY SUBUNIT from Micrococcus luteus (Micrococcus lysodeikticus) (436 aa), FASTA scores: opt: 106, E(): 8.2, (29.05% identity in 86 aa overlap). Equivalent to AAK48231 from Mycobacterium tuberculosis strain CDC1551 (117 aa) but shorter 17 aa. TBparse score is 0.880." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218277.1" /db_xref="GI:15610896" /db_xref="GeneID:886093" /translation="MPGSVPGKAPEEPPVKFTRAAAVWSALIVGFLILILLLIFIAQN TASAQFAFFGWRWSLPLGVAILLAAVGGGLITVFAGTARILQLRRAAKKTHAAALR" gene complement(4205862..4206917) /gene="fadE36" /locus_tag="Rv3761c" /db_xref="GeneID:886098" CDS complement(4205862..4206917) /gene="fadE36" /locus_tag="Rv3761c" /EC_number="1.3.99.-" /function="UNKNOWN, BUT POSSIBLY INVOLVEMENT IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="Rv3761c, (MTV025.109c), 351 aa. Possible fadE36, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many conserved hypothetical proteins and showing some similarity with few acyl-CoA dehydrogenases, e.g. Q9APX7|FADE36 FADE36 PROTEIN from Pseudomonas aeruginosa (360 aa), FASTA scores: opt: 147, E(): 0.046, (26.15% identity in 214 aa overlap); part of AAB52261.2|U97002 protein similar to acyl-CoA dehydrogenases and epoxide hydrolases from Caenorhabditis elegans (985 aa), FASTA score: (31.2% identity in 324 aa overlap). C-terminal part is highly similar to Q50095|U1740AK|MLU15183_45 hypothetical protein from Mycobacterium leprae cosmid B174 (122 aa), FASTA scores: opt: 341, E(): 7.3e-15, (57.6% identity in 99 aa overlap). Contains PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2. TBparse score is 0.910." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase" /protein_id="NP_218278.1" /db_xref="GI:15610897" /db_xref="GeneID:886098" /translation="MTSVDRLDGLDLGALDRYLRSLGIGRDGELRGELISGGRSNLTF RVYDDASSWLVRRPPLHGLTPSAHDMAREYRVVAALGDTPVPVARTISLCQDDSVLGA PFQVVEFVAGQVVRRRAELEALGSRSVIEGCVDALIRVLVDLHSIDPKAVGLSDFGKP DGYLERQVRRWGSQWELVRLPDDHRDADISRLHLALQQAIPQQSRTSIVHGDYRIDNT ILDTDDPCHVRAVVDWELSTLGDPLSDAALMCVYRDPALDLIVHAQAAWTSPLLPAAD ELADRYSLVSGQPLGHWEFYMALAYFKLAIIAAGIDYRRRMSEQAEGKDTAAESVPDV VAPLIARGLAEIAKKSG" misc_feature complement(4206888..4206917) /gene="fadE36" /locus_tag="Rv3761c" /note="PS00339 Aminoacyl-transfer RNA synthetases class-II signature 2" gene complement(4206996..4208876) /locus_tag="Rv3762c" /db_xref="GeneID:886096" CDS complement(4206996..4208876) /locus_tag="Rv3762c" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3762c, (MTV025.110c), len: 626 aa. Possible hydrolase (EC 3.-.-.-), highly similar to hypothetical proteins and beta-lactamases (EC 3.5.2.6) e.g. Q9RL04|SC5G9.23 HYPOTHETICAL 70.3 KDA PROTEIN from Streptomyces coelicolor (648 aa), FASTA scores: opt: 2088, E(): 3.7e-124, (52.9% identity in 624 aa overlap); P32717|YJCS_ECOLI|B4083 HYPOTHETICAL 73.2 KDA PROTEIN from Escherichia coli strain K12 (661 aa), FASTA scores: opt: 1911, E(): 5.7e-113, (46.9% identity in 631 aa overlap); Q9A824|CC1540 METALLO-BETA-LACTAMASE FAMILY PROTEIN from Caulobacter crescentus (647 aa), FASTA scores: opt: 1891, E(): 1e-111, (48.55% identity in 628 aa overlap); Q08347|YOL164W CHROMOSOME XV READING FRAME ORF from Saccharomyces cerevisiae (Baker's yeast) (646 aa) FASTA scores: opt: 1829, E(): 8.4e-108, (45.7% identity in 615 aa overlap); Q9I5I9|PA0740 PROBABLE BETA-LACTAMASE from Pseudomonas aeruginosa (658 aa), FASTA scores: opt: 1699, E(): 1.4e-99, (43.15% identity in 630 aa overlap); Q52556|SDSA ALKYL SULFATASE (protein involved in the degradation of sulfate esters of long-chain primaryal cohols e.g. SDS sodium dodecyl sulfate) from Pseudomonas sp (528 aa), FASTA scores: opt: 841, E(): 1.7e-45, (33.7% identity in 534 aa overlap); etc. N-terminual end also highly similar to Q48790|SEPA SEPA PROTEIN (protein implicated in cell separation) from Listeria monocytogenes (391 aa), FASTA scores: opt: 1256, E(): 8.3e-72, (49.6% identity in 363 aa overlap). Also slight similarity to P96253|Rv0407|MTCY22G10.03 HYPOTHETICAL 37.0 KDA PROTEIN from Mycobacterium tuberculosis (336 aa). TBparse score is 0.897." /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="NP_218279.1" /db_xref="GI:15610898" /db_xref="GeneID:886096" /translation="MPMEHKPPTAVIQAAHGEHSLPLHDTTDFDDADRGFIAALSPCV IKAADGRVVWDNDAYSFLDGAAPTSVHPSLWRQSQLTAKQGLYQVVPGIYQVRGFDIS NISFVEGDTGLIVIDPLVSTEVAAAALDLYRAHRGADRPVVAVIYTHSHVDHFGGVLG VTTQADVDAGKVAVLAPEGFTAHAVQENIYAGSAMMRRAGYMYGTVLARGLRGHVGCG LGQTLSTGEVSLVVPTVDITETGETHTIDGVEIEFQMAPGTEAPAEMHFYFPRFRALC MAENATHNLHNLLTLRGALVRDPRAWSGYLTEAIDTFADRTDVVFASHHWPTWGREKI VEFLSQQRDMYSYLHDQTLRLLNQGYTGVEIAEMFQLPPALQRAWHTHGYYGSVSHNV KAIYQRYMGWFDGNPGWLWPHPPEALAPRYVDALGGIDRVLELAREAFDAGDFRWAAT LLDHAVFADSEHAAARGLYADTLEQLAYGAECATWRNFFLTGAAELRDGNPGSSGQVP APTFFAQLTPDQIFDVLAISINGPRAWDLDLAIDFTFTEPDVNYRLTLRNGVLIHRKL PADPATANATVTVGDKVRLVAAALGDISSPGFEVFGDRTVLQTFLSVLDRPDSAFNIV TP" gene 4209047..4209526 /gene="lpqH" /locus_tag="Rv3763" /db_xref="GeneID:886097" CDS 4209047..4209526 /gene="lpqH" /locus_tag="Rv3763" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3763, (MTV025.111), len: 159 aa. lpqH, conserved 19 KDa lipoprotein antigen precursor (see citations below), equivalent to P31502|19KD_MYCIT|MI22 19 KDA LIPOPROTEIN ANTIGEN PRECURSOR (MI22 ANTIGEN) from Mycobacterium intracellulare (162 aa), FASTA scores: opt: 773, E(): 6.2e-35, 75.95(% identity in 162 aa overlap); P46733|19KD_MYCAV 19 KDA LIPOPROTEIN ANTIGEN PRECURSOR from Mycobacterium avium (161 aa), FASTA scores: opt: 743, E(): 2.5e-33, (72.5% identity in 160 aa overlap); and Q9X7A5|LPQH|ML1966 POSSIBLE LIPOPROTEIN from Mycobacterium leprae FASTA scores: opt: 371, E(): 2.2e-13, (42.6% identity in 162 aa overlap). POSSIBLY ATTACHED TO THE MEMBRANE BY A LIPID ANCHOR. SIMILAR TO OTHER MYCOBACTERIUM 19 KDA ANTIGEN. Contains PS00013 Prokaryotic membrane lipoprotein lipid attachment site." /codon_start=1 /transl_table=11 /product="19 kDa lipoprotein antigen precursor LPQH" /protein_id="NP_218280.1" /db_xref="GI:15610899" /db_xref="GeneID:886097" /translation="MKRGLTVAVAGAAILVAGLSGCSSNKSTTGSGETTTAAGTTASP GAASGPKVVIDGKDQNVTGSVVCTTAAGNVNIAIGGAATGIAAVLTDGNPPEVKSVGL GNVNGVTLGYTSGTGQGNASATKDGSHYKITGTATGVDMANPMSPVNKSFEIEVTCS" misc_feature 4209080..4209112 /gene="lpqH" /locus_tag="Rv3763" /note="PS00013 Prokaryotic membrane lipoprotein lipid attachment site" gene complement(4209582..4211009) /locus_tag="Rv3764c" /db_xref="GeneID:886094" CDS complement(4209582..4211009) /locus_tag="Rv3764c" /EC_number="2.7.3.-" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv3764c, (MTV025.112c), len: 475 aa. Possible histidine protein kinase (EC 2.7.3.-), part of a two-component regulatory system, similar to others e.g. Q9ADN6|2SC10A7.25 PUTATIVE TWO COMPONENT SYSTEM HISTIDINE KINASE from Streptomyces coelicolor (524 aa), FASTA scores: opt: 1332, E(): 5.4e-70, (49.9% identity in 477 aa overlap); Q9L3C1|KB|CAC42479 PUTATIVE HISTIDINE KINASE from Amycolatopsis mediterranei (469 aa), FASTA scores: opt: 515, E(): 1.4e-22, (36.1% identity in 313 aa overlap); P72560 HISTIDINE PROTEIN KINASE from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (438 aa), FASTA scores: opt: 480, E(): 1.4e-20, (40.1% identity in 232 aa overlap); P30847|P76401|BAES_ECOLI|B2078 SENSOR PROTEIN from Escherichia coli strain K12 (467 aa); etc. Also similar to others from Mycobacterium tuberculosis e.g. P96368|Rv1032c|MTCY10G2.17 (509 aa), FASTA scores: opt: 1007, E(): 4e-51, (43.5% identity in 416 aa overlap); and P71815|Rv0758|MTCY369.03 (485 aa), FASTA scores: opt: 738, E(): 1.6e-35, (28.6% identity in 438 aa overlap). Equivalent to AAK48235 from Mycobacterium tuberculosis strain CDC1551 (506 aa) but shorter 31 aa. TBparse score is 0.916." /codon_start=1 /transl_table=11 /product="two component sensor kinase" /protein_id="NP_218281.1" /db_xref="GI:15610900" /db_xref="GeneID:886094" /translation="MGITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWR HETHNYIRSGPGPRFLDAPGQPAGMVAAVVSDGTTVAAGYLTGSGSRAALTSTGRSQL ERIAGSRTPLTLDLDGLGRYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVT VIALVAATTAGIVIIKRALAPLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTE VGQLGSALNRMLDHIAAALSARQASETCVRQFVADASHELRTPLAAIRGYTELTQRIG DDPEAVAHAMSRVASETERITRLVEDLLLLARLDSGRPLERGPVDMSRLAVDAVSDAH VAGPDHQWALDLPPEPVVIPGDAARLHQVVTNLLANARVHTGPGTIVTTRLSTGPTHV VLQVIDNGPGIPAALQSEVFERFARGDTSRSRQAGSTGLGLAIVSAVVKAHNGTITVS SSPGYTEFAVRLPLDGWQPLESSPR" gene complement(4211080..4211784) /locus_tag="Rv3765c" /db_xref="GeneID:886100" CDS complement(4211080..4211784) /locus_tag="Rv3765c" /function="SENSOR PART OF A TWO COMPONENT REGULATORY SYSTEM." /note="Rv3765c, (MTV025.113c), len: 234 aa. Probable response regulator of a two-component regulatory system, highly similar to others e.g. Q9ADN7|2SC10A7.24 PUTATIVE TWO COMPONENT SYSTEM RESPONSE REGULATOR from Streptomyces coelicolor (271 aa), FASTA scores: opt: 1111, E(): 4.8e-63, (72.3% identity in 231 aa overlap); Q9F161 RESPONSE REGULATOR from Corynebacterium glutamicum (Brevibacterium flavum) (232 aa), FASTA scores: opt: 692, E(): 1.2e-36, (46.0% identity in 226 aa overlap); Q9KZU5|SCD84.23c PUTATIVE TWO-COMPONENT SYSTEN RESPONSE REGULATOR from Streptomyces coelicolor (248 aa), FASTA scores: opt: 674, E(): 1.7e-35, (44.05% identity in 236 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. Q50806|Rv1033c|MTCY10G2.16 RESPONSE REGULATOR HOMOLOG (257 aa), FASTA scores: opt: 947, E(): 1e-52, (59.5% identity in 232 aa overlap); P71814|Rv0757|MTCY369.02 PHOP-LIKE PROTEIN (247 aa) FASTA scores: opt: 829, E(): 2.8e-45, (54.65% identity in 225 aa overlap); O53894|Rv0981|MTV044.09 (230 aa), FASTA scores: opt: 662, E(): 9e-35, (44.65% identity in 224 aa overlap); and also similar to MTCY31_34; MTCY19H5_20; MTY13628_5; MTCY20G9_17; and to MLCB57_27 from Mycobacterium leprae; and MBY13627_3 from Mycobacterium bovis BCG. Equivalent to AAK48236 from Mycobacterium tuberculosis strain CDC1551 (286 aa) but shorter 52 aa. THE N-TERMINAL REGION IS SIMILAR TO THAT OF OTHER REGULATORY COMPONENTS OF SENSORY TRANSDUCTION SYSTEMS. SIMILAR TO BACTERIAL REGULATORY PROTEINS INVOLVED IN SIGNAL TRANSDUCTION. TBparse score is 0.899." /codon_start=1 /transl_table=11 /product="two component transcriptional regulatory protein" /protein_id="NP_218282.1" /db_xref="GI:15610901" /db_xref="GeneID:886100" /translation="MRRADGQPVTVLVVDDEPVLAEMVSMALRYEGWNITTAGDGSSA IAAARRQRPDVVVLDVMLPDMSGLDVLHKLRSENPGLPVLLLTAKDAVEDRIAGLTAG GDDYVTKPFSIEEVVLRLRALLRRTGVTTVDSGAQLVVGDLVLDEDSHEVMRAGEPVS LTSTEFELLRFMMHNSKRVLSKAQILDRVWSYDFGGRSNIVELYISYLRKKIDNGREP MIHTLRGAGYVLKPAR" gene 4212293..4212982 /locus_tag="Rv3766" /db_xref="GeneID:886099" CDS 4212293..4212982 /locus_tag="Rv3766" /function="UNKNOWN" /note="Rv3766, (MTV025.114), len: 229 aa. Hypothetical unknown protein. Segment 183 to 229 highly similar to C-terminal part of O06288|Rv3594|MTCY07H7B.28c CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis (275 aa), FASTA scores: opt: 128, E(): 0.92, (46.8% identity in 47 aa overlap). TBparse score is 0.943." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218283.1" /db_xref="GI:15610902" /db_xref="GeneID:886099" /translation="MRSAFDSGRLTFGIVYTYARPNWWANANTVRSMIDAAGGLHPRV ALMLDVESGGNPPGDGSSWINRLYWNLADYAGSPVRIIGYANAYDFFNMWRVRPAGLR VIGAGYGSNPNLPGQVAHQYTDGSGYSPNLPQGAPPFGRCDMNSANGLTPQQFAAACG VTTTGGPLMALTDEEQTELLTKVREIWDQLRGPNGAGWPQLGQNEQGQDLTPVDAIAV IKNDVAAMLAE" gene complement(4212996..4213940) /locus_tag="Rv3767c" /db_xref="GeneID:886101" CDS complement(4212996..4213940) /locus_tag="Rv3767c" /function="UNKNOWN" /note="Rv3767c, (MTV025.115c, MTCY13D12.01), len: 314 aa. Conserved hypothetical protein, similar to other Mycobacterium tuberculosis hypothetical proteins e.g. P96823|Rv0146|MTCI5.20 HYPOTHETICAL 34.0 KDA PROTEIN (310 aa), FASTA scores: opt: 909, E(): 5.3e-50, (48.1% identity in 316 aa overlap); O53686|Rv0281|MTV035.09 (302 aa), FASTA scores: opt: 802, E(): 2.8e-43, (45.2% identity in 314 aa overlap); Q50726|YX99_MYCTU|Rv3399|MT3507|MTCY78.29c (348 aa), FASTA scores: opt: 796, E(): 7.6e-43, (45.35% identity in 302 aa overlap); MTCY78_30; MTCY31_23; MTCY210_45; MTCY4C12_14; MTY13D12_21, MTCI5_19; MTCY180_22; etc. Contains probable N-terminal signal sequence" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218284.1" /db_xref="GI:15610903" /db_xref="GeneID:886101" /translation="MPRTDNDSWAITESVGATALGVAAARAAETESDNPLINDPFARI FVDAAGDGIWSMYTNRTLLAGATDLDPDLRAPIQQMIDFMAARTAFFDEYFLATADAG VRQVVILASGLDSRAWRLPWPDGTVVYELDQPKVLEFKSATLRQHGAQPASQLVNVPI DLRQDWPKALQKAGFDPSKPCAWLAEGLVRYLPARAQDLLFERIDALSRPGSWLASNV PGAGFLDPERMRRQRADMRRMRAAAAKLVETEISDVDDLWYAEQRTAVAEWLRERGWD VSTATLPELLARYGRSIPHSGEDSIPPNLFVSAQRATS" gene 4214070..4214429 /locus_tag="Rv3768" /db_xref="GeneID:886102" CDS 4214070..4214429 /locus_tag="Rv3768" /function="UNKNOWN" /note="Rv3768, (MTCY13D12.02), len: 119 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218285.1" /db_xref="GI:15610904" /db_xref="GeneID:886102" /translation="MGSTPPRTPQEVFAHHGQALAAGDLDEIVADYADDSFVITPAGI ARGKEGIRQLFVKLLDDIPNALWDLKTQIFEGDILFLEWTANSAVSRVDDGVDTFVFR DGTIWAHTVRYTPHPKT" gene 4214615..4214887 /locus_tag="Rv3769" /db_xref="GeneID:885239" CDS 4214615..4214887 /locus_tag="Rv3769" /function="UNKNOWN" /note="Rv3769, (MTCY13D12.03), len: 90 aa. Hypothetical unknown protein, possible coiled-coil protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218286.1" /db_xref="GI:15610905" /db_xref="GeneID:885239" /translation="MTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVRE HTGRLDRVTTKVGQLAAKSDDTNARVRSLEEGQAEIKDLLLRALDK" gene complement(4215200..4215775) /locus_tag="Rv3770c" /db_xref="GeneID:886104" CDS complement(4215200..4215775) /locus_tag="Rv3770c" /function="UNKNOWN" /note="Rv3770c, (MTCY13D12.04c), len: 191 aa. Hypothetical unknown leu-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218287.1" /db_xref="GI:15610906" /db_xref="GeneID:886104" /translation="MLSGIQQNTLMDNDPLAHGYYVADLLVALAVVVLMLRARRTRPE LARMLLLGTLIGLVWELPVFGLSAWTNTPIIEWATPLPLPTVVFLLAHSVWDGPLLTM GWLLARALTGEPAGALGLTVQVLWGQLTALAVELSAILAGTWSYVDDLWFNPVMFWFR GHPVTAAMQLTWLLAPLCFAALVRRLALTAR" gene complement(4215881..4216063) /locus_tag="Rv3770A" /db_xref="GeneID:3205079" CDS complement(4215881..4216063) /locus_tag="Rv3770A" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv3770A, len: 60 aa. Probable remnant of a transposase, similar to many e.g. Rv2812|MTCY16B7.31c|Z81331_17 IS1604 putative transposase from Mycobacterium tuberculosis (469 aa), FASTA scores: opt: 204, E(): 1e-07, (80.5% identity in 41 aa overlap). Continuation of Rv3770B." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_178012.1" /db_xref="GI:57117153" /db_xref="GeneID:3205079" /translation="MGSTPWCPNPCQCTLRTPVEVLELAVALRPENPDRTAGAIQRIL RAQLAGDRIALRGRGS" gene complement(4216078..4216269) /locus_tag="Rv3770B" /db_xref="GeneID:3205080" CDS complement(4216078..4216269) /locus_tag="Rv3770B" /function="POSSIBLY REQUIRED FOR THE TRANSPOSITION OF AN INSERTION ELEMENT." /note="Rv3770B, len: 63 aa. Probable remnant of a transposase, similar to many e.g. Rv2812|MTCY16B7.31c|Z81331_17 IS1604 putative transposase from Mycobacterium tuberculosis (469 aa), FASTA scores: opt: 379, E(): 1.6e-21, (93.55% identity in 62 aa overlap). Continues as Rv3770A." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_178013.1" /db_xref="GI:57117154" /db_xref="GeneID:3205080" /translation="MRAERARAIGLFRYQLIREAADAAHSTKERGKMVRELASREHTD PFGRKVRISRHTIDRWIRN" gene complement(4216404..4216730) /locus_tag="Rv3771c" /db_xref="GeneID:886103" CDS complement(4216404..4216730) /locus_tag="Rv3771c" /function="UNKNOWN" /note="Rv3771c, (MTCY13D12.05c), len: 108 aa. Hypothetical protein, highly similar, but shorter 81 aa, to P71640|Rv2811|MTCY16B7.32c HYPOTHETICAL 21.1 KDA PROTEIN from Mycobacterium tuberculosis (202 aa), FASTA scores: opt: 469, E(): 2.7e-25, (73.15% identity in 108 aa overlap)" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218288.1" /db_xref="GI:15610907" /db_xref="GeneID:886103" /translation="MPAPAEKALSQVGFRRIAADLARPAETVRGWLRRFAERAEAVRS VFTVMLRAVDPDPVMPDAAVGVFAYAVTVIAAVVTVIECQFALSTVSLAETAVAVSGG RLVAPG" gene complement(4216865..4216937) /locus_tag="Rvnt43" /note="tRNA-Arg(ACG)" /db_xref="GeneID:2700457" tRNA complement(4216865..4216937) /locus_tag="Rvnt43" /product="tRNA-Arg" /note="codon recognized: CGU" /anticodon=(pos:4216902..4216904,aa:Arg) /db_xref="GeneID:2700457" gene complement(4216968..4217056) /locus_tag="Rvnt44" /note="tRNA-Ser(GCT)" /db_xref="GeneID:2700421" tRNA complement(4216968..4217056) /locus_tag="Rvnt44" /product="tRNA-Ser" /note="codon recognized: AGC" /anticodon=(pos:4217020..4217022,aa:Ser) /db_xref="GeneID:2700421" gene 4217134..4218195 /gene="hisC2" /locus_tag="Rv3772" /db_xref="GeneID:886105" CDS 4217134..4218195 /gene="hisC2" /locus_tag="Rv3772" /EC_number="2.6.1.9" /function="INVOLVED IN HISTIDINE BIOSYNTHETIC PATHWAY (AT THE EIGHTH STEP) [CATALYTIC ACTIVITY: L-HISTIDINOL-PHOSPHATE + 2-OXOGLUTARATE = 3-(IMIDAZOL-4-YL)-2-OXOPROPYL PHOSPHATE + GLUTAMATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3772, (MTCY13D12.06), len: 353 aa. Probable hisC2, histidinol-phosphate aminotransferase (EC 2.6.1.9), highly similar to Q9ZBY8|SCD78.11 PUTATIVE HISTIDINOL-PHOPHATE AMINOTRANSFERASE from Streptomyces coelicolor (359 aa), FASTA scores: opt: 1165, E(): 7.1e-64, (52.55% identity in 356 aa overlap); and similar to many e.g. Q9EYX2 from Gardnerella vaginalis (317 aa) FASTA scores: opt: 814, E(): 1.7e-42, (45.15% identity in 308 aa overlap); Q9CMI7|HISH_1PM0838|HISH from Pasteurella multocida (365 aa), FASTA scores: opt: 701, E(): 1.5e-35, (35.05% identity in 351 aa overlap); O07131|HIS8_METFL|HISC|HISH from Methylobacillus flagellatum (368 aa), FASTA scores: opt: 645, E(): 4e-32, (34.5% identity in 345 aa overlap); etc. Contains PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site. BELONGS TO CLASS-II OF PYRIDOXAL-PHOSPHATE-DEPENDENT AMINOTRANSFERASES. COFACTOR: PYRIDOXAL PHOSPHATE." /codon_start=1 /transl_table=11 /product="putative aminotransferase" /protein_id="NP_218289.1" /db_xref="GI:15610908" /db_xref="GeneID:886105" /translation="MTARLRPELAGLPVYVPGKTVPGAIKLASNETVFGPLPSVRAAI DRATDTVNRYPDNGCVQLKAALARHLGPDFAPEHVAVGCGSVSLCQQLVQVTASVGDE VVFGWRSFELYPPQVRVAGAIPIQVPLTDHTFDLYAMLATVTDRTRLIFVCNPNNPTS TVVGPDALARFVEAVPAHILIAIDEAYVEYIRDGMRPDSLGLVRAHNNVVVLRTFSKA YGLAGLRIGYAIGHPDVITALDKVYVPFTVSSIGQAAAIASLDAADELLARTDTVVAE RARVSAELRAAGFTLPPSQANFVWLPLGSRTQDFVEQAADARIVVRPYGTDGVRVTVA APEENDAFLRFARRWRSDQ" misc_feature 4217773..4217802 /gene="hisC2" /locus_tag="Rv3772" /note="PS00599 Aminotransferases class-II pyridoxal-phosphate attachment site" gene complement(4218241..4218825) /locus_tag="Rv3773c" /db_xref="GeneID:886095" CDS complement(4218241..4218825) /locus_tag="Rv3773c" /function="UNKNOWN" /note="Rv3773c, (MTCY13D12.07c), len: 194 aa. Hypothetical protein, highly similar to C-terminal end of O53773|Rv0576|MTV039.14 POSSIBLE TRANSCRIPTIONAL REGULATOR from Mycobacterium tuberculosis (434 aa), FASTA scores: opt: 575, E(): 8.3e-30, (47.4% identity in 192 aa overlap); and some similarity with other proteins from Mycobacterium tuberculosis e.g. P71985|Rv1727|MTCY04C12.12 (189 aa) FASTA scores: opt: 176, E(): 0.00022, (31.1% identity in 180 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218290.1" /db_xref="GI:15610909" /db_xref="GeneID:886095" /translation="MPPESRPGPDSPPTDELACAEAALQVLQQVLHTIGRQDKAKQTP CPGYDVKKLTEHLLNSIMVLGGMVGAEFSLRADIDSVERLVSGAARSALDAWHRHGLE GDVSLGPGSMSAKVAVSVFSVEFLVHAWDYAVAVGSELKAADSLAEYVLELARKLIKP EERSVAGFNEPVDVPEDGGALERLIAFTGRNPAR" gene 4218849..4219673 /gene="echA21" /locus_tag="Rv3774" /db_xref="GeneID:886106" CDS 4218849..4219673 /gene="echA21" /locus_tag="Rv3774" /EC_number="4.2.1.17" /function="COULD POSSIBLY OXIDIZES FATTY ACIDS USING SPECIFIC COMPONENTS (BY SIMILARITY) [CATALYTIC ACTIVITY: (3S)-3-HYDROXYACYL-CoA = TRANS-2(OR 3)-ENOYL-CoA + H(2)O]." /note="Catalyzes the reversible hydration of unsaturated fatty acyl-CoA to beta-hydroxyacyl-CoA" /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase" /protein_id="NP_218291.1" /db_xref="GI:15610910" /db_xref="GeneID:886106" /translation="MGETYESVTVETKDQVAQVTLIGPGKGNAMGPAFWSEMPEVFHA LDADREVRAIVITGSGKNFSYGLDVPAMGGMFAPLIADGALARPRTDFHTEILRMQKA INAVADCRTPTIAAVQGWCIGGAVDLISAVDIRYASADAKFSVREVKLAIVADMGSLA RLPLILSDGHLRELALTGKNIDAARAEKIGLVNDVYDDADQTLAAAHATAAEIAANPP LAVYGIKDVLDQQRTSAVSENLRYVAAWNAAFLPSKDLTEGISATFAKRPPQFTGE" gene 4219685..4220932 /gene="lipE" /locus_tag="Rv3775" /db_xref="GeneID:886269" CDS 4219685..4220932 /gene="lipE" /locus_tag="Rv3775" /EC_number="3.1.-.-" /function="UNKNOWN; LIPOLYTIC ENZYME PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3775, (MTCY13D12.09), len: 415 aa. Probable lipE, hydrolase lipase (EC 3.1.-.-), equivalent to Q9CD95|LIPE|ML0119 PROBABLE HYDROLASE from Mycobacterium leprae (411 aa), FASTA scores: opt: 2418, E(): 6.4e-144, (84.75% identity in 406 aa overlap). Also similar to other esterases e.g. Q9ABH2|CC0255 ESTERASE A from Caulobacter crescentus (374 aa), FASTA scores: opt: 427, E(): 2.4e-19, (28.9% identity in 391 aa overlap); O87861|ESTA ESTERASE A from Streptomyces chrysomallus (389 aa), FASTA scores: opt: 417, E(): 1e-18, (31.0% identity in 361 aa overlap); Q9RK50|SCF12.08 PUTATIVE ESTERASE from Streptomyces coelicolor (376 aa), FASTA scores: opt: 385, E(): 1e-16, (31.35% identity in 373 aa overlap); etc. Also similar to proteins from Mycobacterium tuberculosis e.g. P71778|Rv1497|MTCY277.19 HYPOTHETICAL 45.8 KDA PROTEIN (429 aa), FASTA scores: opt: 457, E(): 3.5e-21, (30.4% identity in 395 aa overlap)." /codon_start=1 /transl_table=11 /product="lipase LipE" /protein_id="NP_218292.1" /db_xref="GI:15610911" /db_xref="GeneID:886269" /translation="MRAGDGKIRVPADLDAVTATGEEDHSEIDGAAVDRIWRAARHWY RAGMHPAIQLCIRHHGRVVLNRAIGHGWGNAPTDEADAEKIPVTTDTPFCVYSAAKAI TATVVHMLVERGHFALDDRVCEYLPSYTSHGKHRTTIRHVLTHSAGVPFPTGPRPDVR RADDHEYAVERLGELRPLYRPGLVHIYHALTWGPLMREIVYAATGKEIREILATEILD PLGFRWTNFGVAERDVPLVAPSHATGRQLPPVIAAVFRKAIGGTVHEIIPYTNTPFFL STILPSSNTVSTANELSRFMEILRRGGELDGVRVLSPETLRGAVTECRRLRPDFATGL MPLRWGTGFMLGSAKYGPFGRNAPAAFGHLGLVNIAVWADPERALSGGLISSGKPGRD PEAGRYGALLNAITAEIPRASSG" gene 4221089..4222648 /locus_tag="Rv3776" /db_xref="GeneID:888953" CDS 4221089..4222648 /locus_tag="Rv3776" /function="UNKNOWN. THOUGHT TO BE REGULATED BY Rv2720|LEXA." /experiment="experimental evidence, no additional details recorded" /note="Rv3776, (MTCY13D12.10), len: 519 aa. Conserved hypothetical protein, highly similar to Q10709|YL00_MYCTU|Rv2100|MTCY49.40 HYPOTHETICAL 58.9 KDA PROTEIN from Mycobacterium tuberculosis (550 aa) FASTA scores: opt: 1646, E(): 1.2e-83, (77.85% identity in 510 aa overlap) (homology from potential start at 7744); and similar to other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O33266|Rv0336|MTCY279.03 (503 aa) FASTA scores: opt: 682, E(): 2.2e-30, (41.65% identity in 497 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218293.1" /db_xref="GI:15610912" /db_xref="GeneID:888953" /translation="MFEISLSDPVELRDADDAALLAAIEDCARAEVAAGARRLSAIAE LTSRRTGNDQRADWACDGWDCAAAEVAAALTVSHRKASGQMHLSLTLNRLPQVAALFL AGQLSARLVSIIAWRTYLVRDPEALSLLDAALAKHATAWGPLSAPKLEKAIDSWIDRY DPAALRRTRISARSRDLCIGDPDEDAGTAALWGRLFATDAAMLDKRLTQLAHGVCDDD PRTIAQRRADALGALAAGADRLTCGCGNSDCPSSAGNHRQATGVVIHVVADAAALGAA PDPRLSGPEPALAPEAPATPAVKPPAALISGGGVVPAPLLAELIRGGAALSRMRHPGD LRSEPHYRPSAKLAEFVRIRDMTCRFPGCDQPTEFCDIDHTLPYPLGPTHPSNLKCLC RKHHLLKTFWTGWRDVQLPDGTIIWTAPNGHTYTTHPDSRIFLPSWHTTTAALPPAPS PPAIGPTHTLLMPRRRRTRAAELAHRIKRERAHVTQRNKPPPSGGDTAVAEGFEPPDG VSRLSLSRRVH" gene complement(4222581..4222667) /locus_tag="Rvnt45" /note="tRNA-Ser(TGA)" /db_xref="GeneID:2700428" tRNA complement(4222581..4222667) /locus_tag="Rvnt45" /product="tRNA-Ser" /note="codon recognized: UCA" /anticodon=(pos:4222631..4222633,aa:Ser) /db_xref="GeneID:2700428" gene 4222694..4223680 /locus_tag="Rv3777" /db_xref="GeneID:886110" CDS 4222694..4223680 /locus_tag="Rv3777" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3777, (MTCY13D12.11), len: 328 aa. Probable oxidoreductase (EC 1.-.-.-), equivalent to Q9CD96|ML0118 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (336 aa) FASTA scores: opt: 1661, E(): 1.1e-87, (76.0% identity in 325 aa overlap). Also highly similar to many e.g. Q9XA55|SCGD3.24c PUTATIVE QUINONE OXIDOREDUCTASE (EC 1.6.5.5) from Streptomyces coelicolor (326 aa) FASTA scores: opt: 1118, E(): 1.3e-64, (59.6% identity in 312 aa overlap); O65423|F18E5.200|F17L22.40|AT4G21580 PUTATIVE NADPH QUINONE OXIDOREDUCTASE from Arabidopsis thaliana (Mouse-ear cress) (325 aa), FASTA scores: opt: 1110, E(): 3e-56, (52.15% identity in 326 aa overlap); Q98FI0|MLL3767 NADPH QUINONE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (326 aa), FASTA scores: opt: 980, E(): 7.9e-49, (47.85% identity in 324 aa overlap); etc." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218294.1" /db_xref="GI:15610913" /db_xref="GeneID:886110" /translation="MTIMRAVVAESSDRLVWQEVPDVSAGPGEVLIKVAASGVNRADV LQAAGKYPPPPGVSDIIGLEVSGIVAAVGPGVTEWSAGQEVCALLAGGGYAEYVAVPA DQVLPIPPSVNLVDSAALPEVACTVWSNLVMTAHLRPGQLVLIHGGASGIGSHAIQVV RALAARVAITAGSPEKLELCRDLGAQITINYRDEDFVARLKQETDGSGADIILDIMGA SYLDRNIDALATDGQLIVIGMQGGVKAELNLGKLLTKRARVIGTTLRARPVSGPHGKA AIAQAVAASVWPMIAANRVRPVIGTRLPIQQAAQAHELMLSGKTFGKILLTV" gene complement(4223699..4224895) /locus_tag="Rv3778c" /db_xref="GeneID:886116" CDS complement(4223699..4224895) /locus_tag="Rv3778c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3778c, (MTCY13D12.12c), len: 398 aa. Possible aminotransferase (EC 2.6.1.-), equivalent to Q9CD97|ML0117 HYPOTHETICAL PROTEIN from Mycobacterium leprae (398 aa) FASTA scores: opt: 2141, E(): 1.2e-123, (83.4% identity in 398 aa overlap). Also similar to other aminotransferases and cysteine desulfurases e.g. Q9K3K6|SCG20A.34 PUTATIVE AMINOTRANSFERASE from Streptomyces coelicolor (400 aa), FASTA scores: opt: 723, E(): 6.5e-37, (36.3% identity in 402 aa overlap); Q9KSS2|VC1184 NIFS-RELATED PROTEIN (AMINOTRANSFERASE-RELATED) from Vibrio cholerae (416 aa) FASTA scores: opt: 595, E(): 4.5e-29, (31.35% identity in 405 aa overlap); Q98NK4|MLR0102 AMINOTRANSFERASE from Rhizobium loti (Mesorhizobium loti) (425 aa), FASTA scores: opt: 563, E(): 4.2e-27, (29.4% identity in 408 aa overlap); Q9RY03|DR0151 NIFS-RELATED PROTEIN from Deinococcus radiodurans (401 aa), FASTA scores: opt: 484, E(): 2.7e-22, (32.35% identity in 399 aa overlap); Q9A766|CC1860 AMINOTRANSFERASE CLASS V from Caulobacter crescentus (408 aa), FASTA scores: opt: 390, E(): 1.5e-16, (27.85% identity in 413 aa overlap); etc." /codon_start=1 /transl_table=11 /product="aminotransferase" /protein_id="NP_218295.1" /db_xref="GI:15610914" /db_xref="GeneID:886116" /translation="MAYDVARVRGLHPSLGDGWVHFDAPAGMLIPDSVATTVSTAFRR SGASTVGAHPSARRSAAVLDAAREAVADLVNADPGGVVLGADRAVLLSLLAEASSSRA GLGYEVIVSRLDDEANIAPWLRAAHRYGAKVKWAEVDIETGELPTWQWESLISKSTRL VAVNSASGTLGGVTDLRAMTKLVHDVGALVVVDHSAAAPYRLLDIRETDADVVTVNAH AWGGPPIGAMVFRDPSVMNSFGSVSTNPYATGPARLEIGVHQFGLLAGVVASIEYLAA LDESARGSRRERLAVSMQSADAYLNRVFDYLMVSLRSLPLVMLIGRPEAQIPVVSFAV HKVPADRVVQRLADNGILAIANTGSRVLDVLGVNDVGGAVTVGLAHYSTMAEVDQLVR ALASLG" gene 4224985..4226985 /locus_tag="Rv3779" /db_xref="GeneID:886107" CDS 4224985..4226985 /locus_tag="Rv3779" /function="UNKNOWN" /note="Rv3779, (MTCY13D12.13), len: 666 aa. Probable conserved transmembrane ala-, leu-rich protein, equivalent to Q9CD98|ML0116 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (654 aa), FASTA scores: opt: 1991, E(): 2e-112, (66.5% identity in 666 aa overlap). Shows some similarity with Q9RRU0|DR2395 PUTATIVE NA+/H+ ANTIPORTER from Deinococcus radiodurans (458 aa), FASTA scores: opt: 138, E(): 0.69, (31.9% identity in 138 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein alanine and leucine rich" /protein_id="NP_218296.1" /db_xref="GI:15610915" /db_xref="GeneID:886107" /translation="MGLWFGTLIALILLIAPGAMVARIAQLRWPVAIAVGPALTYGVV ALAIIPYGALGIPWNGWTALAALAVTCAVATGLQLLLARFRDLDAEALAVSRWPAVTV AAGVLLGALLIGWAAYRGIPHWQSIPSTWDAVWHANTVRFILDTGQASSTHMGELRNV ETHAPLYYPSVFHGLVAVFCQLTGAAPTTGYTLSSLAASVWLFPVSAAVLTWRAVRSH PGALWSASCASAEWRAAGAAGTAAALSASFTAVPYVEFDTAAMPNLAAYGIAVPTMVL ITSTLRHRDRIPVAVLALVGVFSLHITGGIVVALLVSAWWLFEALRHPVRSRLADLLT LAGVAAMAGLVMLPQFLSVRQQEDIIAGHAFPTYLSKKRGLFDAVFQHSRHLNDFPVQ YALIVLAAIGGLILLVKKIWWPLAVWLLLIVMNVDAGTPLGGPIGGVAGALGEFFYHD PRRIAAATTLLLMLMAGVALFATVMLLVAAAKRLTDRFRPQPVSVWASATATLLIGAT LVSAWHYFPRHRFLFGDKYDSVMIDQKDLDAMAYLASLPGARDTLIGNANTDGTAWMY AVAGLHPLWTHYDYPLQQGPGYHRFIFWAYGRNGESDPRVLEAIQVLRIRYILTSTPT VRGFAVPDGLVSLETSRSWAKIYDNGEARIYEWRGTAAATHS" gene 4226989..4227525 /locus_tag="Rv3780" /db_xref="GeneID:886115" CDS 4226989..4227525 /locus_tag="Rv3780" /function="UNKNOWN" /note="Rv3780, (MTCY13D12.14), len: 178 aa. Conserved hypothetical protein, equivalent to Q9CD99|ML0115 HYPOTHETICAL 19.1 KDA PROTEIN from Mycobacterium leprae (174 aa), FASTA scores: opt: 903, E(): 2.3e-48, (82.95% identity in 170 aa overlap). Also highly similar to Q9XA56|SCGD3.23c HYPOTHETICAL 19.5 KDA PROTEIN from Streptomyces coelicolor (179 aa), FASTA scores: opt: 692, E(): 1.8e-35, (65.9% identity in 170 aa overlap). Note that this putative protein is 4 aa longer at the N-terminus compared to previous annotation (in Nature 393: 537-544 (1998))." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218297.1" /db_xref="GI:15610916" /db_xref="GeneID:886115" /translation="MRKRMVIGLSTGSDDDDVEVIGGVDPRLIAVQENDSDESSLTDL VEQPAKVMRIGTMIKQLLEEVRAAPLDEASRNRLRDIHATSIRELEDGLAPELREELD RLTLPFNEDAVPSDAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARAQLQQMRQGAL PPGVGKSGQHGHGTGQYL" gene 4227529..4228350 /gene="rfbE" /locus_tag="Rv3781" /db_xref="GeneID:886113" CDS 4227529..4228350 /gene="rfbE" /locus_tag="Rv3781" /function="MAY FORM AN ATP-DRIVEN O-ANTIGEN/LIPOPOLYSACCHARIDE EXPORT APPARATUS, IN ASSOCIATION WITH RFBD|Rv3783. RESPONSIBLE FOR ENERGY COUPLING TO THE TRANSPORT SYSTEM." /note="Rv3781, (MTCY13D12.15), len: 273 aa. Probable rfbE, polysaccharide-transport ATP-binding protein ABC transporter, involved in O-antigen/lipopolysaccharides (LPS) transport (see Braibant et al., 2000), equivalent to Q9CDA0|ML0114 PUTATIVE ABC TRANSPORTER ATP-BINDING COMPONENT from Mycobacterium leprae (272 aa), FASTA scores: opt: 1581, E(): 3e-83, (91.4% identity in 267 aa overlap). Also highly similar to AAK71283 LPS/O-ANTIGEN EXPORT PERMEASE from Coxiella burnetii (258 aa), FASTA scores: opt: 793, E(): 2.5e-38, (45.45% identity in 253 aa overlap); Q9PAF0|XF2568 ABC TRANSPORTER ATP-BINDING PROTEIN from Xylella fastidiosa (246 aa), FASTA scores: opt: 758, E(): 2.4e-36, (47.75% identity in 243 aa overlap); Q56903|RFBE_YEREN O-ANTIGEN EXPORT SYSTEM ATP-BINDING PROTEIN from Yersinia enterocolitica (239 aa) (see Zhang et al., 1993), FASTA scores: opt: 697, E(): 7e-33, (48.65% identity in 224 aa overlap); Q50863|RFBB_MYXXA O-ANTIGEN EXPORT SYSTEM ATP-BINDING from Myxococcus xanthus (437 aa), FASTA scores: opt: 605, E(): 2e-27, (42.05% identity in 207 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN FAMILY (ABC TRANSPORTERS)." /codon_start=1 /transl_table=11 /product="o-antigen/lipopolysaccharide transport ATP-binding protein ABC transporter RfbE" /protein_id="NP_218298.1" /db_xref="GI:15610917" /db_xref="GeneID:886113" /translation="MSDPHHPHIQTHNAWVEFPIFDAKSRSLKKAVLGKAGGTIGRNN SNVVVIEALRDITMELNLGDRVGLVGHNGAGKSTLLRLLSGIYEPTRGWAKVTGRVAP VFDLGIGMDPEISGYENIIIRGLFLGQTRKQMQAKVDEIAEFTELGEYLSMPLRTYST GMRVRLAMGVVTSIDPEILLLDEGIGAVDADFLRKAQSRLQNLVERSGILVFASHSNE FLARLCKTAIWIDHGVIRLAGGIEEVVRAYEGEDAARHVREVLAETQADRQNVQG" misc_feature 4227736..4227759 /gene="rfbE" /locus_tag="Rv3781" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 4228347..4229261 /locus_tag="Rv3782" /db_xref="GeneID:886114" CDS 4228347..4229261 /locus_tag="Rv3782" /EC_number="2.4.1.-" /function="POSSIBLY INVOLVED IN dTDP-L-RHAMNOSE BIOSYNTHESIS WITHIN THE O ANTIGEN BIOSYNTHESIS PATHWAY OF LIPOPOLYSACCHARIDE BIOSYNTHESIS." /note="Rv3782, (MTCY13D12.16), len: 304 aa. Possible L-rhamnosyltransferase (EC 2.4.1.-), equivalent to Q9CDA1|RFBE|ML0113 PUTATIVE GLYCOSYL TRANSFERASE from Mycobacterium leprae (283 aa), FASTA scores: opt: 1583, E(): 9.3e-96, (81.6% identity in 277 aa overlap). Also some similarity with AAK68916|WCFN PUTATIVE GLYCOSYLTRANSFERASE from Bacteroides fragilis (291 aa) FASTA scores: opt: 241, E(): 2.1e-08, (30.75% identity in 195 aa overlap); O58161|PH0424 HYPOTHETICAL 40.5 KDA PROTEIN from Pyrococcus horikoshii (348 aa), FASTA scores: opt: 194, E(): 2.8e-05, (23.85% identity in 302 aa overlap); O26448|MTH348 RHAMNOSYL TRANSFERASE from Methanothermobacter thermautotrophicus (313 aa), FASTA scores: opt: 177, E(): 0.00033, (28.2% identity in 333 aa overlap); O07868|CPS19BQ PUTATIVE RHAMNOSYL TRANSFERASE FASTA from Streptococcus pneumoniae (300 aa), FASTA scores: opt: 156, E(): 0.0074, (25.45% identity in 232 aa overlap); and other putative transferases. Note that C-terminal end shows some similarity with part of Q05161|RFB O-ANTIGEN BIOSYNTHESIS PROTEIN B from Escherichia coli strain 0101. Note that previously known as rfbE.; rfbE" /codon_start=1 /transl_table=11 /product="L-rhamnosyltransferase" /protein_id="YP_178014.1" /db_xref="GI:57117155" /db_xref="GeneID:886114" /translation="MTESVFAVVVTHRRPDELAKSLDVLTAQTRLPDHLIVVDNDGCG DSPVRELVAGQPIATTYLGSRRNLGGAGGFALGMLHALAQGADWVWLADDDGHAQDAR VLATLLACAEKYSLAEVSPMVCNIDDPTRLAFPLRRGLVWRRRASELRTEAGQELLPG IASLFNGALFRASTLAAIGVPDLRLFIRGDEVEMHRRLIRSGLPFGTCLDAAYLHPCG SDEFKPILCGRMHAQYPDDPGKRFFTYRNRGYVLSQPGLRKLLAQEWLRFGWFFLVTR RDPKGLWEWIRLRRLGRREKFGKPGGSA" gene 4229258..4230100 /gene="rfbD" /locus_tag="Rv3783" /db_xref="GeneID:886111" CDS 4229258..4230100 /gene="rfbD" /locus_tag="Rv3783" /function="MAY FORM AN ATP-DRIVEN O-ANTIGEN/LIPOPOLYSACCHARIDE EXPORT APPARATUS, IN ASSOCIATION WITH RFBE|Rv3781. RESPONSIBLE FOR THE TRANSLOCATION OF THE SUBSTRATE ACROSS THE MEMBRANE." /note="Rv3783, (MTCY13D12.17), len: 280 aa. Probable rfbD, polysaccharide-transport integral membrane protein ABC transporter (see Braibant et al., 2000), involved in O-antigen/lipopolysaccharides (LPS) transport, equivalent to Q9CDA2|ML0112 PUTATIVE ABC TRANSPORTER COMPONENT from Mycobacterium leprae (276 aa), FASTA scores: opt: 1646, E(): 4e-102, (84.3% identity in 280 aa overlap). Also highly similar to Q9PAF1|XF2567 ABC TRANSPORTER PERMEASE PROTEIN from Xylella fastidiosa (267 aa), FASTA scores: opt: 723, E(): 7.6e-41, (41.3% identity in 259 aa overlap); and similar to others e.g. Q56902|RFBD_YEREN O-ANTIGEN EXPORT SYSTEM PERMEASE PROTEIN from Yersinia enterocolitica (259 aa) (see Zhang et al., 1993), FASTA scores: opt: 566, E(): 2e-30, (28.05% identity in 264 aa overlap); Q06955|RFBH RFBH PROTEIN (involved in the export of lipopolysaccharide) (alias Q9KVA3|VC0246) LIPOPOLYSACCHARIDE/O-ANTIGEN TRANSPORT PROTEIN from Vibrio cholerae (257 aa), FASTA scores: opt: 358, E(): 1.3e-16, (24.4% identity in 258 aa overlap); Q9HTB8|WZM|PA5451 MEMBRANE SUBUNIT OF A-BAND LPS EFFLUX TRANSPORTER from Pseudomonas aeruginosa (265 aa), FASTA scores: opt: 263, E(): 2.7e-10, (25.45% identity in 263 aa overlap); etc. BELONGS TO THE ABC-2 SUBFAMILY OF INTEGRAL MEMBRANE PROTEINS." /codon_start=1 /transl_table=11 /product="O-antigen/lipopolysaccharide transport integral membrane protein ABC transporter RfbD" /protein_id="NP_218300.1" /db_xref="GI:15610919" /db_xref="GeneID:886111" /translation="MTFMDAQASFQTQSRTLARVRGDLVDGFRRHELWLHLGWQDIKQ RYRRSVLGPFWITIATGTTAVAMGGLYSKLFRLELSEHLPYVTLGLIVWNLINAAILD GAEVFVANEGLIKQLPAPLSVHVYRLVWRQMIFFAHNIVIYFVIAIIFPKPWSWADLS FLPALALIFLNCVWVSLCFGILATRYRDIGPLLFSVVQLLFFMTPIIWNDETLRRQGA GRWSSIVELNPLLHYLDIVRAPLLGAHQELRHWLVVLVLTVVGWMLAAFAMRQYRARV PYWV" gene 4230256..4231236 /locus_tag="Rv3784" /db_xref="GeneID:886117" CDS 4230256..4231236 /locus_tag="Rv3784" /EC_number="4.2.1.46" /function="POSSIBLY INVOLVED IN dTDP-L-RHAMNOSE BIOSYNTHESIS WITHIN THE O ANTIGEN BIOSYNTHESIS PATHWAY OF LIPOPOLYSACCHARIDE BIOSYNTHESIS [CATALYTIC ACTIVITY: dTDP-GLUCOSE = dTDP-4-DEHYDRO-6-DEOXY-D-GLUCOSE + H(2)O]." /experiment="experimental evidence, no additional details recorded" /note="Rv3784, (MTCY13D12.18), len: 326 aa. Possible dTDP-glucose 4,6-dehydratase (EC 4.2.1.46), but experimental study shown that the purified protein didn't have dTDP-glucose dehydratase (rmlB) activity (see citation below). Similar to others e.g. Q9YCT1|APE1180 LONG HYPOTHETICAL DTDP-GLUCOSE 4,6-DEHYDRATASE from Aeropyrum pernix (330 aa) FASTA scores: opt: 598, E(): 3.7e-30, (34.9% identity in 315 aa overlap); O27817|MTH1789 DTDP-GLUCOSE 4,6-DEHYDRATASE from Methanothermobacter thermautotrophicus (336 aa) FASTA scores: opt: 587, E(): 1.8e-29, (34.9% identity in 315 aa overlap); Q9X5W0|GRSE TDP-GLUCOSE-4,6-DEHYDRATASE HOMOLOG from Streptomyces griseus (324 aa), FASTA scores: opt: 583, E(): 3.2e-29, (35.7% identity in 325 aa overlap); Q9K7J7|SPSJ|BH3364 SPORE COAT POLYSACCHARIDE SYNTHESIS (DTDP GLUCOSE 4, 6-DEHYDRATASE) from Bacillus halodurans (321 aa), FASTA scores: opt: 562, E(): 6.5e-28, (33.0% identity in 318 aa overlap); Q9UZH2|RFBB|PAB0785 DTDP-GLUCOSE 4,6-DEHYDRATASE from Pyrococcus abyssi (333 aa), FASTA scores: opt: 552, E(): 2.8e-27, (33.95% identity in 318 aa overlap); P27830|RFFG_ECOLI|B3788 DTDP-GLUCOSE 4,6-DEHYDRATASE from Escherichia coli strain K12 (355 aa), FASTA scores: opt: 401, E(): 7.5e-28, (31.3% identity in 348 aa overlap); etc. But also similar to several UDP-glucose 4-epimerases (EC 5.1.3.2) and other proteins e.g. O59375|PH1742 LONG HYPOTHETICAL UDP-GLUCOSE 4-EPIMERASE from Pyrococcus horikoshii (306 aa) FASTA scores: opt: 600, E(): 2.6e-30, (34.5% identity in 313 aa overlap); Q9ZGC7|LANH14 NDP-HEXOSE 4,6-DEHYDRATASE HOMOLOGfrom Streptomyces cyanogenus (326 aa), FASTA scores: opt: 593, E(): 7.6e-30, (36.45% identity in 321 aa overlap); Q57664|GALE_METJA|MJ0211 PUTATIVE UDP-GLUCOSE 4-EPIMERASE from Methanococcus jannaschii (305 aa) FASTA scores: opt: 575, E(): 9.6e-29, (32.6% identity in 313 aa overlap); etc. SEEMS TO BELONG TO THE SUGAR EPIMERASE FAMILY, DTDP-GLUCOSE DEHYDRATASE SUBFAMILY. Note that previously known as epiB.; epiB" /codon_start=1 /transl_table=11 /product="dTDP-glucose 4,6-dehydratase" /protein_id="YP_178015.1" /db_xref="GI:57117156" /db_xref="GeneID:886117" /translation="MEILVTGGAGFQGSHLTESLLANGHWVTVLDKSSRNAVRNMQGF RSHDRAAFISGSVTDGQTIDRAVRDHHVVFHLAAHVNVDQSLGDPESFLETNVMGTYR VLEAVRRYRNRLIYVSTCEVYGDGHNLKEGERLDEHAELKPNSPYGASKAAADRLCYS YFRSYGLDVTIVRPFNIFGVRQKAGRFGALIPRLVRQGINGEGLTIFGAGSATRDYLY VSDIVGAYNLVLRTPTLRGQAINFASGKDTRVRDIVEYVADKFGARIEHRDARPGEVQ RFPADISLAKSIGFQPQVEIWDGIDRYINWAKDQPQYPYEQDGFSGSSVL" gene 4231320..4232393 /locus_tag="Rv3785" /db_xref="GeneID:886119" CDS 4231320..4232393 /locus_tag="Rv3785" /function="UNKNOWN" /note="Rv3785, (MTCY13D12.19), len: 357 aa. Hypothetical unknown protein. Note that this putative protein is equivalent to AAK48258|MT3893 NAD-DEPENDENT EPIMERASE/DEHYDRATASE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (712 aa), but shorter 355 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218302.1" /db_xref="GI:15610921" /db_xref="GeneID:886119" /translation="MVTVARRPVCPVTLTPGDPALASVRDLVDAWSAHDALAELVTMF GGAFPQTDHLEARLASLDKFSTAWDYRARARAARALHGEPVRCQDSGGGARWLIPRLD LPAKKRDAIVGLAQQLGLTLESTPQGTTFDHVLVIGTGRHSNLIRARWARELAKGRQV GHIVLAAASRRLLPSEDDAVAVCAPGARTEFELLAAAARDAFGLDVHPAVRYVRQRDD NPHRDSMVWRFAADTNDLGVPITLLEAPSPEPDSSRATSADTFTFTAHTLGMQDSTCL LVTGQPFVPYQNFDALRTLALPFGIQVETVGFGIDRYDGLGELDQQHPAKLLQEVRST IRAARALLERIEAGERMATDPRR" gene complement(4232374..4233597) /locus_tag="Rv3786c" /db_xref="GeneID:886108" CDS complement(4232374..4233597) /locus_tag="Rv3786c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3786c, (MTCY13D12.20), len: 407 aa. Hypothetical unknown protein. Segment between aa 265-300 (approximatively) is highly similar to part of O03937|RORF1608 MINOR CAPSID PROTEIN from Bacteriophage phig1e (1608 aa), FASTA scores: opt: 242, E(): 8.4e-07, (26.85% identity in 272 aa overlap); Q9ETT9|ORF36 PUTATIVE PEPTIDASE from Corynebacterium equii (Rhodococcus equi) plasmid pREAT701 (p33701) and Plasmid virulence (546 aa), FASTA scores: opt: 231, E(): 1.6e-06, (34.15% identity in 167 aa overlap); O69910|SC2E1.40c HYPOTHETICAL 22.8 KDA PROTEIN. from Streptomyces coelicolor (226 aa) FASTA scores: opt: 218, E(): 4.6e-06, (34.15% identity in 164 aa overlap); and others." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218303.1" /db_xref="GI:15610922" /db_xref="GeneID:886108" /translation="MRILAMTRAHNAGRTLAATLDSLAVFSDDIYVIDDRSTDDTAEI LANHPAVTNVVRARPDLPPTPWLIPESAGLELLYRMADFCRPDWVMMVDADWLVETDI DLRAVLARTPDDIVALMCPMVSRWDDPEYPDLIPVMGTAEALRGPLWRWYPGLRAGGK LMHNPHWPANITDHGRIGQLPGVRLVHSGWSTLAERILRVEHYLRLDPDYRFNFGVAY DRSLLFGYALDEVDLLKADYRRRIRGDFDPLEPGGRLPIDREPRAIGRGYGPHAGGFH PGVDFATDPGTPVYAVASGAVSAIDEVDGLVSLTIARCELDVVYVFRPGDEGRLVLGD RIAAGAQLGTIGAQGESADGYLHFEVRTQDGHVNPVRYLANMGLRPWPPPGRLRAVSG SYPPATPCTITAEDR" gene complement(4233610..4234536) /locus_tag="Rv3787c" /db_xref="GeneID:886118" CDS complement(4233610..4234536) /locus_tag="Rv3787c" /function="UNKNOWN" /note="Rv3787c, (MTCY13D12.21), len: 308 aa. Conserved hypothetical protein, highly similar to several mycobacterial hypothetical proteins e.g. P95074|Rv0726c|MTCY210.45c from Mycobacterium tuberculosis (367 aa), FASTA scores: opt: 1038, E(): 1.6e-58, (55.85% identity in 283 aa overlap); O53795|MBE50c|Rv0731c|MTV041.05c from Mycobacterium tuberculosis (318 aa), FASTA scores: opt: 1030, E(): 4.5e-58, (56.15% identity in 292 aa overlap); Q9CCZ4|ML2640 from Mycobacterium leprae (310 aa) FASTA scores: opt: 709, E(): 9.9e-38, (43.75% identity in 279 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218304.1" /db_xref="GI:15610923" /db_xref="GeneID:886118" /translation="MARTDDDSWDLATGVGATATLVAAGRARAARAAQPLIDDPFAEP LVRAVGVEFLTRWATGELDAADVDDPDAAWGLQRMTTELVVRTRYFDQFFLDAAAAGV RQAVILASGLDARGYRLPWPADTTVFEVDQPRVLEFKAQTLAGLGAQPTADLRMVPAD LRHDWPDALRRGGFDAAEPAAWIAEGLFGYLPPDAQNRLLDHVTDLSAPGSRLALEAF LGSADRDSARVEEMIRTATRGWREHGFHLDIWALNYAGPRHEVSGYLDNHGWRSVGTT TAQLLAAHDLPAAPALPAGLADRPNYWTCVLG" gene 4234780..4235265 /locus_tag="Rv3788" /db_xref="GeneID:886120" CDS 4234780..4235265 /locus_tag="Rv3788" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Regulates the synthesis of nucleoside triphosphates for nucleic acid synthesis, CTP for lipid synthesis, and GTP for protein elongation" /codon_start=1 /transl_table=11 /product="nucleoside diphosphate kinase regulator" /protein_id="NP_218305.1" /db_xref="GI:15610924" /db_xref="GeneID:886120" /translation="MSEKVESKGLADAARDHLAAELARLRQRRDRLEVEVKNDRGMIG DHGDAAEAIQRADELAILGDRINELDRRLRTGPTPWSGSETLPGGTEVTLRFPDGEVV TMHVISVVEETPVGREAETLTARSPLGQALAGHQPGDTVTYSTPQGPNQVQLLAVKLP S" gene 4235374..4235739 /locus_tag="Rv3789" /db_xref="GeneID:886109" CDS 4235374..4235739 /locus_tag="Rv3789" /function="UNKNOWN" /note="Rv3789, (MTCY13D12.23), len: 121 aa. Possible conserved integral membrane protein, equivalent to Q9CDA3|ML0110 HYPOTHETICAL 13.9 KDA PROTEIN from Mycobacterium leprae (123 aa) FASTA scores: opt: 587, E(): 7.3e-34, (72.95% identity in 122 aa overlap). Also equivalent to AAK48262 from Mycobacterium tuberculosis strain CDC1551 (142 aa) but shorter 21 aa." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_218306.1" /db_xref="GI:15610925" /db_xref="GeneID:886109" /translation="MRFVVTGGLAGIVDFGLYVVLYKVAGLQVDLSKAISFIVGTITA YLINRRWTFQAEPSTARFVAVMLLYGITFAVQVGLNHLCLALLHYRAWAIPVAFVIAQ GTATVINFIVQRAVIFRIR" gene 4235779..4237164 /locus_tag="Rv3790" /db_xref="GeneID:886125" CDS 4235779..4237164 /locus_tag="Rv3790" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3790, (MTCY13D12.24), len: 461 aa. Probable oxidoreductase (EC 1.-.-.-), equivalent to Q9CDA4|ML0109 PUTATIVE FAD-LINKED OXIDOREDUCTASE from Mycobacterium leprae (460 aa), FASTA scores: opt: 2722, E(): 1.4e-161, (86.55% identity in 461 aa overlap). Also highly similar to others e.g. Q9KZA4|SC5G8.10c PUTATIVE OXIDOREDUCTASE from Streptomyces coelicolor (457 aa), FASTA scores: opt: 1336, E(): 1.7e-75, (47.1% identity in 452 aa overlap); Q98KY4|MLL1265 PROBABLE OXIDOREDUCTASE from Rhizobium loti (Mesorhizobium loti) (449 aa), FASTA scores: opt: 636, E(): 4.9e-32, (36.0% identity in 439 aa overlap); Q9HDX8|SPAPB1A10.12c PUTATIVE D-ARABINONO-1,4-LACTONE OXIDASE from Schizosaccharomyces pombe (Fission yeast) (461 aa), FASTA scores: opt: 297, E(): 5.6e-11, (23.55% identity in 467 aa overlap); etc. C-terminal end has a high similarity to Q9AQD0 PUTATIVE OXIDOREDUCTASE (FRAGMENT) from Mycobacterium smegmatis (149 aa) FASTA scores: opt: 901, E(): 6.5e-49, (86.6% identity in 149 aa overlap)." /codon_start=1 /transl_table=11 /product="oxidoreductase" /protein_id="NP_218307.1" /db_xref="GI:15610926" /db_xref="GeneID:886125" /translation="MLSVGATTTATRLTGWGRTAPSVANVLRTPDAEMIVKAVARVAE SGGGRGAIARGLGRSYGDNAQNGGGLVIDMTPLNTIHSIDADTKLVDIDAGVNLDQLM KAALPFGLWVPVLPGTRQVTVGGAIACDIHGKNHHSAGSFGNHVRSMDLLTADGEIRH LTPTGEDAELFWATVGGNGLTGIIMRATIEMTPTSTAYFIADGDVTASLDETIALHSD GSEARYTYSSAWFDAISAPPKLGRAAVSRGRLATVEQLPAKLRSEPLKFDAPQLLTLP DVFPNGLANKYTFGPIGELWYRKSGTYRGKVQNLTQFYHPLDMFGEWNRAYGPAGFLQ YQFVIPTEAVDEFKKIIGVIQASGHYSFLNVFKLFGPRNQAPLSFPIPGWNICVDFPI KDGLGKFVSELDRRVLEFGGRLYTAKDSRTTAETFHAMYPRVDEWISVRRKVDPLRVF ASDMARRLELL" gene 4237165..4237929 /locus_tag="Rv3791" /db_xref="GeneID:886124" CDS 4237165..4237929 /locus_tag="Rv3791" /function="UNKNOWN; SUPPOSED INVOLVED IN CELLULAR METABOLISM." /note="Rv3791, (MTCY13D12.25), len: 254 aa. Probable short-chain dehydrogenase/reductase (EC 1.-.-.-), equivalent to Q9CDA5|ML0108 PUTATIVE OXIDOREDUCTASE from Mycobacterium leprae (254 aa), FASTA scores: opt: 1458, E(): 1.6e-83, (89.0% identity in 254 aa overlap); and O05764 PUTATIVE PROTEIN BELONGING TO THE SHORT-CHAIN ALCOHOL DEHYDROGENASE from Mycobacterium smegmatis (254 aa), FASTA scores: opt: 1412, E(): 1.2e-80, (85.05% identity in 254 aa overlap). Also highly similar to Q9KZA5|SC5G8.09c PUTATIVE SHORT-CHAIN DEHYDROGENASE from Streptomyces coelicolor (256 aa), FASTA scores: opt: 733, E(): 1.8e-38, (45.3% identity in 254 aa overlap); and P43168|YMP3_STRCO HYPOTHETICAL OXIDOREDUCTASE from Streptomyces coelicolor (251 aa), FASTA scores: opt: 623, E(): 1.2e-31, (42.15% identity in 254 aa overlap); and similar to various oxidoreductases (principally acetoacetyl-CoA reductases) e.g. P14697|PHBB_ALCEU ACETOACETYL-CoA REDUCTASE (EC 1.1.1.36) (246 aa) from Alcaligenes eutrophus (Ralstonia eutropha) (246 aa) FASTA scores: opt: 264, E(): 2.3e-09, (29.9% identity in 204 aa overlap); P45375|PHBB_CHRVI ACETOACETYL-CoA REDUCTASE from Chromatium vinosum (246 aa), FASTA scores: opt: 261, E(): 3.5e-09, (27.45% identity in 226 aa overlap); Q9RT30|DR1938 OXIDOREDUCTASE (SHORT-CHAIN DEHYDROGENASE/REDUCTASE FAMILY) from Deinococcus radiodurans (283 aa), FASTA scores: opt: 251, E(): 1.7e-08, (27.55% identity in 236 aa overlap); etc. Also similar to Q10681|YK73_MYCTU|Rv2073c|MT2133|MTCY49.12 PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE from Mycobacterium tuberculosis (249 aa), FASTA scores: opt: 589, E(): 1.5e-29, (41.25% identity in 252 aa overlap). Contains PS00061 Short-chain dehydrogenases/reductases family signature. BELONGS TO THE SHORT-CHAIN DEHYDROGENASES/REDUCTASES (SDR) FAMILY." /codon_start=1 /transl_table=11 /product="short chain dehydrogenase" /protein_id="NP_218308.1" /db_xref="GI:15610927" /db_xref="GeneID:886124" /translation="MVLDAVGNPQTVLLLGGTSEIGLAICERYLHNSAARIVLACLPD DPRREDAAAAMKQAGARSVELIDFDALDTDSHPKMIEAAFSGGDVDVAIVAFGLLGDA EELWQNQRKAVQIAEINYTAAVSVGVLLAEKMRAQGFGQIIAMSSAAGERVRRANFVY GSTKAGLDGFYLGLSEALREYGVRVLVIRPGQVRTRMSAHLKEAPLTVDKEYVANLAV TASAKGKELVWAPAAFRYVMMVLRHIPRSIFRKLPI" misc_feature 4237603..4237689 /locus_tag="Rv3791" /note="PS00061 Short-chain alcohol dehydrogenase family signature" gene 4237932..4239863 /locus_tag="Rv3792" /db_xref="GeneID:886127" CDS 4237932..4239863 /locus_tag="Rv3792" /function="UNKNOWN" /note="Rv3792, (MTCY13D12.26), len: 643 aa. Probable conserved transmembrane protein, equivalent, but longer 21 aa, to Q9CDA6|ML0107 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (632 aa), FASTA scores: opt: 1981, E(): 2.1e-110, (77.5% identity in 631 aa overlap). C-terminal end highly similar to C-terminus of O05765 PUTATIVE PRODUCT ORF 3 from Mycobacterium smegmatis (603 aa), FASTA scores: opt: 1261, E(): 1.4e-67, (70.7% identity in 266 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218309.1" /db_xref="GI:15610928" /db_xref="GeneID:886127" /translation="MPSRRKSPQFGHEMGAFTSARAREVLVALGQLAAAVVVAVGVAV VSLLAIARVEWPAFPSSNQLHALTTVGQVGCLAGLVGIGWLWRHGRFRRLARLGGLVL VSAFTVVTLGMPLGATKLYLFGISVDQQFRTEYLTRLTDTAALRDMTYIGLPPFYPPG WFWIGGRAAALTGTPAWEMFKPWAITSMAIAVAVALVLWWRMIRFEYALLVTVATAAV MLAYSSPEPYAAMITVLLPPMLVLTWSGLGARDRQGWAAVVGAGVFLGFAATWYTLLV AYGAFTVVLMALLLAGSRLQSGIKAAVDPLCRLAVVGAIAAAIGSTTWLPYLLRAARD PVSDTGSAQHYLPADGAALTFPMLQFSLLGAICLLGTLWLVMRARSSAPAGALAIGVL AVYLWSLLSMLATLARTTLLSFRLQPTLSVLLVAAGAFGFVEAVQALGKRGRGVIPMA AAIGLAGAIAFSQDIPDVLRPDLTIAYTDTDGYGQRGDRRPPGSEKYYPAIDAAIRRV TGKRRDRTVVLTADYSFLSYYPYWGFQGLTPHYANPLAQFDKRATQIDSWSGLSTADE FIAALDKLPWQPPTVFLMRHGAHNSYTLRLAQDVYPNQPNVRRYTVDLRTALFADPRF VVEDIGPFVLAIRKPQESA" gene 4239863..4243147 /gene="embC" /locus_tag="Rv3793" /db_xref="GeneID:886112" CDS 4239863..4243147 /gene="embC" /locus_tag="Rv3793" /EC_number="2.4.2.34" /function="INVOLVED IN THE BIOSYNTHESIS OF THE MYCOBACTERIAL CELL WALL ARABINAN AND RESISTANCE TO ETHAMBUTOL (Emb; dextro-2,2'-(ethylenediimino)-di-1-butanol). POLYMERIZES ARABINOSE INTO THE ARABINAN OF ARABIOGALACTAN [CATALYTIC ACTIVITY: UDP-L-ARABINOSE + INDOL-3-YLACETYL-MYO-INOSITOL = UDP + INDOL-3-YLACETYL-MYO-INOSITOL L-ARABINOSIDE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3793, (MTCY13D12.27), len: 1094 aa. embC, integral membrane protein, indolylacetylinositol arabinosyltransferase (EC 2.4.2.34) (see citations below), equivalent to Q9CDA7|EMBC|ML0106 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1070 aa) FASTA scores: opt: 6078,E(): 0, (82.95% identity in 1072 aa overlap); Q50393|EMBC PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1074 aa), FASTA scores: opt: 5523, E(): 0, (75.35% identity in 1072 aa overlap). Also similar to Q9CDA9|EMBB| ML0104 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1083 aa), FASTA scores: opt: 2789, E(): 1.9e-156, (44.0% identity in 1095 aa overlap); O30406|EMBB PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1082 aa), FASTA scores: opt: 2746, E(): 6.4e-154, (44.6% identity in 1096 aa overlap); etc. Also similar to to P72030|EMBB|Rv3795|MTCY13D12.29 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1098 aa), FASTA scores: opt: 2276, E(): 3.1e-126, (44.45% identity in 1118 aa overlap); and P72060|EMBA|Rv3794|MTCY13D12.28 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 1974, E(): 1.9e-108, (41.0% identity in 1110 aa overlap). Contains PS00044 Bacterial regulatory proteins, lysR family signature; and PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="integral membrane indolylacetylinositol arabinosyltransferase EmbC (arabinosylindolylacetylinositol synthase)" /protein_id="NP_218310.1" /db_xref="GI:15610929" /db_xref="GeneID:886112" /translation="MATEAAPPRIAVRLPSTSVRDAGANYRIARYVAVVAGLLGAVLA IATPLLPVNQTTAQLNWPQNGTFASVEAPLIGYVATDLNITVPCQAAAGLAGSQNTGK TVLLSTVPKQAPKAVDRGLLLQRANDDLVLVVRNVPLVTAPLSQVLGPTCQRLTFTAH ADRVAAEFVGLVQGPNAEHPGAPLRGERSGYDFRPQIVGVFTDLAGPAPPGLSFSASV DTRYSSSPTPLKMAAMILGVALTGAALVALHILDTADGMRHRRFLPARWWSTGGLDTL VIAVLVWWHFVGANTSDDGYILTMARVSEHAGYMANYYRWFGTPEAPFGWYYDLLALW AHVSTASIWMRLPTLAMALTCWWVISREVIPRLGHAVKTSRAAAWTAAGMFLAVWLPL DNGLRPEPIIALGILLTWCSVERAVATSRLLPVAIACIIGALTLFSGPTGIASIGALL VAIGPLRTILHRRSRRFGVLPLVAPILAAATVTAIPIFRDQTFAGEIQANLLKRAVGP SLKWFDEHIRYERLFMASPDGSIARRFAVLALVLALAVSVAMSLRKGRIPGTAAGPSR RIIGITIISFLAMMFTPTKWTHHFGVFAGLAGSLGALAAVAVTGAAMRSRRNRTVFAA VVVFVLALSFASVNGWWYVSNFGVPWSNSFPKWRWSLTTALLELTVLVLLLAAWFHFV ANGDGRRTARPTRFRARLAGIVQSPLAIATWLLVLFEVVSLTQAMISQYPAWSVGRSN LQALAGKTCGLAEDVLVELDPNAGMLAPVTAPLADALGAGLSEAFTPNGIPADVTADP VMERPGDRSFLNDDGLITGSEPGTEGGTTAAPGINGSRARLPYNLDPARTPVLGSWRA GVQVPAMLRSGWYRLPTNEQRDRAPLLVVTAAGRFDSREVRLQWATDEQAAAGHHGGS MEFADVGAAPAWRNLRAPLSAIPSTATQVRLVADDQDLAPQHWIALTPPRIPRVRTLQ NVVGAADPVFLDWLVGLAFPCQRPFGHQYGVDETPKWRILPDRFGAEANSPVMDHNGG GPLGITELLMRATTVASYLKDDWFRDWGALQRLTPYYPDAQPADLNLGTVTRSGLWSP APLRRG" misc_feature 4239902..4239979 /gene="embC" /locus_tag="Rv3793" /note="PS00044 Bacterial regulatory proteins, lysR family signature" misc_feature 4240148..4240171 /gene="embC" /locus_tag="Rv3793" /note="PS00017 ATP/GTP-binding site motif A" gene 4243233..4246517 /gene="embA" /locus_tag="Rv3794" /db_xref="GeneID:886123" CDS 4243233..4246517 /gene="embA" /locus_tag="Rv3794" /EC_number="2.4.2.34" /function="INVOLVED IN THE BIOSYNTHESIS OF THE MYCOBACTERIAL CELL WALL ARABINAN AND RESISTANCE TO ETHAMBUTOL (Emb; dextro-2,2'-(ethylenediimino)-di-1-butanol). POLYMERIZES ARABINOSE INTO THE ARABINAN OF ARABIOGALACTAN [CATALYTIC ACTIVITY: UDP-L-ARABINOSE + INDOL-3-YLACETYL-MYO-INOSITOL = UDP + INDOL-3-YLACETYL-MYO-INOSITOL L-ARABINOSIDE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3794, (MTCY13D12.28), len: 1094 aa. embA, integral membrane protein, indolylacetylinositol arabinosyltransferase (EC 2.4.2.34) (see citations below), equivalent to P71485|EMBA ARABINOSYL TRANSFERASE from Mycobacterium avium (1108 aa), FASTA scores: opt: 5024, E(): 0, (81.9% identity in 1109 aa overlap); Q9CDA8|EMBA|ML0105 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1111 aa), FASTA scores: opt: 4782, E(): 0, (78.6% identity in 1111 aa overlap); Q50394|EMBA PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1092 aa), FASTA scores: opt: 4100, E(): 0, (67.4% identity in 1092 aa overlap). Also similar to Q9CDA7|EMBC|ML0106 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1070 aa), FASTA scores: opt: 1933, E(): 1.5e-100, (40.6% identity in 1108 aa overlap); Q50393|EMBC PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1074 aa), FASTA scores: opt: 1870, E(): 5.1e-97, (41.4% identity in 1113 aa overlap); etc. Also similar to P72059|EMBC|Rv3793|MTCY13D12.27 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 1974, E(): 7.7e-103, (40.9% identity in 1110 aa overlap); and P72030|EMBB|Rv3795|MTCY13D12.29 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1098 aa), FASTA scores: opt: 1288, E(): 2.1e-64, (42.5% identity in 1114 aa overlap). Supposed regulated by embR|Rv1267c." /codon_start=1 /transl_table=11 /product="integral membrane indolylacetylinositol arabinosyltransferase EMBA (arabinosylindolylacetylinositol synthase)" /protein_id="NP_218311.1" /db_xref="GI:15610930" /db_xref="GeneID:886123" /translation="MPHDGNERSHRIARLAAVVSGIAGLLLCGIVPLLPVNQTTATIF WPQGSTADGNITQITAPLVSGAPRALDISIPCSAIATLPANGGLVLSTLPAGGVDTGK AGLFVRANQDTVVVAFRDSVAAVAARSTIAAGGCSALHIWADTGGAGADFMGIPGGAG TLPPEKKPQVGGIFTDLKVGAQPGLSARVDIDTRFITTPGALKKAVMLLGVLAVLVAM VGLAALDRLSRGRTLRDWLTRYRPRVRVGFASRLADAAVIATLLLWHVIGATSSDDGY LLTVARVAPKAGYVANYYRYFGTTEAPFDWYTSVLAQLAAVSTAGVWMRLPATLAGIA CWLIVSRFVLRRLGPGPGGLASNRVAVFTAGAVFLSAWLPFNNGLRPEPLIALGVLVT WVLVERSIALGRLAPAAVAIIVATLTATLAPQGLIALAPLLTGARAIAQRIRRRRATD GLLAPLAVLAAALSLITVVVFRDQTLATVAESARIKYKVGPTIAWYQDFLRYYFLTVE SNVEGSMSRRFAVLVLLFCLFGVLFVLLRRGRVAGLASGPAWRLIGTTAVGLLLLTFT PTKWAVQFGAFAGLAGVLGAVTAFTFARIGLHSRRNLTLYVTALLFVLAWATSGINGW FYVGNYGVPWYDIQPVIASHPVTSMFLTLSILTGLLAAWYHFRMDYAGHTEVKDNRRN RILASTPLLVVAVIMVAGEVGSMAKAAVFRYPLYTTAKANLTALSTGLSSCAMADDVL AEPDPNAGMLQPVPGQAFGPDGPLGGISPVGFKPEGVGEDLKSDPVVSKPGLVNSDAS PNKPNAAITDSAGTAGGKGPVGINGSHAALPFGLDPARTPVMGSYGENNLAATATSAW YQLPPRSPDRPLVVVSAAGAIWSYKEDGDFIYGQSLKLQWGVTGPDGRIQPLGQVFPI DIGPQPAWRNLRFPLAWAPPEADVARIVAYDPNLSPEQWFAFTPPRVPVLESLQRLIG SATPVLMDIATAANFPCQRPFSEHLGIAELPQYRILPDHKQTAASSNLWQSSSTGGPF LFTQALLRTSTIATYLRGDWYRDWGSVEQYHRLVPADQAPDAVVEEGVITVPGWGRPG PIRALP" gene 4246514..4249810 /gene="embB" /locus_tag="Rv3795" /db_xref="GeneID:886126" CDS 4246514..4249810 /gene="embB" /locus_tag="Rv3795" /EC_number="2.4.2.34" /function="INVOLVED IN THE BIOSYNTHESIS OF THE MYCOBACTERIAL CELL WALL ARABINAN AND RESISTANCE TO ETHAMBUTOL (Emb; dextro-2,2'-(ethylenediimino)-di-1-butanol). POLYMERIZES ARABINOSE INTO THE ARABINAN OF ARABIOGALACTAN [CATALYTIC ACTIVITY: UDP-L-ARABINOSE + INDOL-3-YLACETYL-MYO-INOSITOL = UDP + INDOL-3-YLACETYL-MYO-INOSITOL L-ARABINOSIDE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3795, (MTCY13D12.29), len: 1098 aa. embB, integral membrane protein, indolylacetylinositol arabinosyltransferase (EC 2.4.2.34) (see citations below), equivalent to P71486|EMBB ARABINOSYL TRANSFERASE from Mycobacterium avium (1065 aa), FASTA scores: opt: 4998, E(): 0, (83.25% identity in 1076 aa overlap); Q9CDA9|EMBB|ML0104 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1083 aa), FASTA scores: opt: 4706, E(): 0, (78.0% identity in 1101 aa overlap); O30406|EMBB (alias Q50395) PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1082 aa), FASTA scores: opt: 4163, E(): 0, (68.4% identity in 1091 aa overlap); etc. Also similar to Q50393|EMBC PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium smegmatis (1074 aa), FASTA scores: opt: 2482, E(): 5e-135, (44.7% identity in 1101 aa overlap); Q9CDA7|EMBC|ML0106 PUTATIVE ARABINOSYL TRANSFERASE from Mycobacterium leprae (1070 aa), FASTA scores: opt: 2259, E(): 3.4e-122, (43.4% identity in 1104 aa overlap); etc. Also similar to P72059|EMBC|Rv3793|MTCY13D12.27 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 2276, E(): 3.6e-123, (44.45% identity in 1118 aa overlap); and P72060|EMBA|Rv3794|MTCY13D12.28 INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis (1094 aa), FASTA scores: opt: 1288, E(): 2.5e-66, (42.35% identity in 1114 aa overlap). Supposed regulated by embR|Rv1267c." /codon_start=1 /transl_table=11 /product="integral membrane indolylacetylinositol arabinosyltransferase EMBB (arabinosylindolylacetylinositol synthase)" /protein_id="NP_218312.1" /db_xref="GI:15610931" /db_xref="GeneID:886126" /translation="MTQCASRRKSTPNRAILGAFASARGTRWVATIAGLIGFVLSVAT PLLPVVQTTAMLDWPQRGQLGSVTAPLISLTPVDFTATVPCDVVRAMPPAGGVVLGTA PKQGKDANLQALFVVVSAQRVDVTDRNVVILSVPREQVTSPQCQRIEVTSTHAGTFAN FVGLKDPSGAPLRSGFPDPNLRPQIVGVFTDLTGPAPPGLAVSATIDTRFSTRPTTLK LLAIIGAIVATVVALIALWRLDQLDGRGSIAQLLLRPFRPASSPGGMRRLIPASWRTF TLTDAVVIFGFLLWHVIGANSSDDGYILGMARVADHAGYMSNYFRWFGSPEDPFGWYY NLLALMTHVSDASLWMRLPDLAAGLVCWLLLSREVLPRLGPAVEASKPAYWAAAMVLL TAWMPFNNGLRPEGIIALGSLVTYVLIERSMRYSRLTPAALAVVTAAFTLGVQPTGLI AVAALVAGGRPMLRILVRRHRLVGTLPLVSPMLAAGTVILTVVFADQTLSTVLEATRV RAKIGPSQAWYTENLRYYYLILPTVDGSLSRRFGFLITALCLFTAVFIMLRRKRIPSV ARGPAWRLMGVIFGTMFFLMFTPTKWVHHFGLFAAVGAAMAALTTVLVSPSVLRWSRN RMAFLAALFFLLALCWATTNGWWYVSSYGVPFNSAMPKIDGITVSTIFFALFAIAAGY AAWLHFAPRGAGEGRLIRALTTAPVPIVAGFMAAVFVASMVAGIVRQYPTYSNGWSNV RAFVGGCGLADDVLVEPDTNAGFMKPLDGDSGSWGPLGPLGGVNPVGFTPNGVPEHTV AEAIVMKPNQPGTDYDWDAPTKLTSPGINGSTVPLPYGLDPARVPLAGTYTTGAQQQS TLVSAWYLLPKPDDGHPLVVVTAAGKIAGNSVLHGYTPGQTVVLEYAMPGPGALVPAG RMVPDDLYGEQPKAWRNLRFARAKMPADAVAVRVVAEDLSLTPEDWIAVTPPRVPDLR SLQEYVGSTQPVLLDWAVGLAFPCQQPMLHANGIAEIPKFRITPDYSAKKLDTDTWED GTNGGLLGITDLLLRAHVMATYLSRDWARDWGSLRKFDTLVDAPPAQLELGTATRSGL WSPGKIRIGP" gene 4249878..4251005 /locus_tag="Rv3796" /db_xref="GeneID:886128" CDS 4249878..4251005 /locus_tag="Rv3796" /function="UNKNOWN" /note="Rv3796, (MTV026.01-MTCY13D12.30), len: 375 aa. Conserved hypothetical protein. C-terminal end similar in part to Q983J3|MLR8305 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (227 aa), FASTA scores: opt: 288, E(): 4e-09, (38.95% identity in 154 aa overlap). Similar to P54548|YQJK_BACSU HYPOTHETICAL PROTEIN (BELONGS TO THE ATSA/ELAC FAMILY) from Bacillus subtilis (307 aa) FASTA scores: opt: 263, E(): 1.3e-07, (26.1% identity in 295 aa overlap); and some similarity to other proteins e.g. AAK46775|MT2479 PUTATIVE ARYLSULFATASE from Mycobacterium tuberculosis strain CDC1551 (224 aa), FASTA scores: opt: 194, E(): 0.00072, (25.85% identity in 259 aa overlap). Equivalent to AAK48269 from Mycobacterium tuberculosis strain CDC1551 (338 aa) but longer 37 aa. SOME SIMILARITY TO THE A. CARRAGEENOVORA ATSA / E. COLI ELAC FAMILY. Note that previously known as atsH.; atsH" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_178016.1" /db_xref="GI:57117157" /db_xref="GeneID:886128" /translation="MLLGMHQAGHVGTHERRAAATRRSALTAAGLAVVGAGVLGASAC SPQKSPQPSSPRLPDNALITLGVAAGPPPTPSRVGISSVLKIGRDLYVIDCGLGSLNA FTNAGLQFDDLKAMFITHLHTDHIVDYYNFFLSGGFLAPPGRAPVLVYGPGPAGGLPP SEVGNPNPATVNPANPTPGLAAATEALHRAFAYTSNIFIRDYGIDNVADLVKVTEIGL PPGSDYRNRAPKMSPFSVASDDNVSVTATLVSHYDVYPAFGFRFDLKKSGVSVTFSGD TTKSDNLITLAQGTDILVHEAVFSLDTAYFGNAFPPNYLVNSHTSAEQVGEVAAAAKP KQLILSHYAPDDLPDSQWLDKIKKNYSGMTTIARDGQVFAL" gene 4251085..4252866 /gene="fadE35" /locus_tag="Rv3797" /db_xref="GeneID:886122" CDS 4251085..4252866 /gene="fadE35" /locus_tag="Rv3797" /EC_number="1.3.99.-" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="Rv3797, (MTV026.02), len: 593 aa. Probable fadE35, acyl-CoA dehydrogenase (EC 1.3.99.-), similar to many e.g. Q9HY33|PA3593 from Pseudomonas aeruginosa (575 aa) FASTA scores: opt: 838, E(): 2.1e-46, (35.3% identity in 569 aa overlap); Q9ANZ8|AIDB from Burkholderia pseudomallei (Pseudomonas pseudomallei) (554 aa), FASTA scores: opt: 633, E(): 3.4e-33, (33.1% identity in 480 aa overlap); Q9HX44|PA3972 from Pseudomonas aeruginosa (549 aa) FASTA scores: opt: 560, E(): 1.7e-28, (29.9% identity in 569 aa overlap); P33224|AIDB_ECOLI|B4187 from Escherichia coli strain K12 (541 aa), FASTA scores: opt: 455, E(): 1e-21, (31.15% identity in 514 aa overlap); etc. Also similar to O86368|FADE8|Rv0672|MTCI376.02c ACYL-CoA DEHYDROGENASE from Mycobacterium tuberculosis (542 aa), FASTA scores: opt: 479, E(): 2.9e-23, (32.2% identity in 460 aa overlap). COULD BELONG TO THE ACYL-CoA DEHYDROGENASES FAMILY." /codon_start=1 /transl_table=11 /product="acyl-CoA dehydrogenase FADE35" /protein_id="NP_218314.1" /db_xref="GI:15610933" /db_xref="GeneID:886122" /translation="MPEYDLEAVDKLPFSTPEKAQRYQTENYRGAMGLNWYLTDPTLQ FIMAYYLRPDELAFAEPHLTRIGELTGGPVTRWAEETDRNPPRLERYDRWGHDISRVV LPESFIQSKRAVIEARQAVRDDAARAGVKPSLALFAADYLLNQADIGMACALATGGNM VRSLVTAYAPPDVREFVLGKLNSGEWDGEAAQLLTERAGGSDLGALETTATRSGDVWL LNGFKWFASNCAGEAFVVLAKPEGAPDSTRGVATFLVLRTRRDGSRNGVRIRRLKDKL GTRSVASGEIEFVDAEAFLLSGEPSADAGPSDGKGLTRMMELTNRLRLGTASFALGNA RRALVESLCYAGQRRAFGGALIDKPLMRRKLAEMVVDVEAALAMVFDGFGAANHRQPR CLPQRIAVPVTKLKTCRLGITVASDAIEIHGGNGYIETWPVARLLRDAQVNTIWEGPD NILCLDVRRGIEQTRAHETLLARLRDAVSVSDDDDTTRLVSRRIEDLDAAITAWTKLD RQLAEARLFPLAQFMGDVYAGALLTEQAAWERATRGTDRKALVARLYARRYLADQGPL RGIDADCDEALQRFDELVAGAFTAEQT" gene 4252993..4254327 /locus_tag="Rv3798" /db_xref="GeneID:886268" CDS 4252993..4254327 /locus_tag="Rv3798" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION SEQUENCE ELEMENT IS1557." /note="Rv3798, (MTV026.03), len: 444 aa. Probable transposase for insertion sequence element IS1557, highly similar to Q60255 SIMILAR TO TRANSPOSASE OF ISAE1 FROM ALCALIGENES EUTROPHUS H1-4 (FRAGMENT) from dibenzofuran-degrading bacterium DPO360 (163 aa) FASTA scores: opt: 767, E(): 3.2e-42, (67.25% identity in 168 aa overlap); and similar to P74920 TRANSPOSASE from Thiobacillus ferrooxidans (404 aa), FASTA scores: opt: 375, E(): 1.1e-16, (27.55% identity in 439 aa overlap); Q48349 TRANSPOSASE from Alcaligenes eutrophus (Ralstonia eutropha) (408 aa), FASTA scores: opt: 324, E(): 2e-13, (3.9% identity in 369 aa overlap); Q9FDC1|TNP TRANSPOSASE from Burkholderia mallei (Pseudomonas mallei) (386 aa) FASTA scores: opt: 282, E(): 9.8e-11, (25.85% identity in 391 aa overlap); etc. C-terminal end identical to O53804|Rv0741|MTV041.15 TRANSPOSASE from Mycobacterium tuberculosis (104 aa), FASTA scores: opt: 582, E(): 1.8e-30, (85.6% identity in 104 aa overlap). BELONGS TO THE TRANSPOSASE FAMILY 12." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_218315.1" /db_xref="GI:15610934" /db_xref="GeneID:886268" /translation="MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSA VLRRCGRCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVPWA RHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADTEKRIDRFANL RRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATLGLFFDALGAERAAQITHV SADAADWIADVVTERCPDAIQCADPFHVVAWATEALDVERRRAWNDARAIARTEPKWG RGRPGKNAAPRPGRERARRLKGARYALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLL KESLRHVFSVKGEEGKQALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQ GLIESTNTKIRLLTRIAFGFRSPQALIALAMLTLAGHRPTLPGRHNHPQISQ" repeat_region 4252993..4254324 /note="IS1557-3, len: 1332 bp. Insertion sequence IS1557." /mobile_element="insertion sequence:IS1557-3" gene complement(4254380..4255948) /gene="accD4" /locus_tag="Rv3799c" /db_xref="GeneID:886131" CDS complement(4254380..4255948) /gene="accD4" /locus_tag="Rv3799c" /EC_number="6.4.1.3" /function="KEY ENZYME IN THE CATABOLIC PATHWAY OF ODD-CHAIN FATTY ACIDS, ISOLEUCINE, THREONINE, METHIONINE, AND VALINE [CATALYTIC ACTIVITY: ATP + PROPIONYL-CoA + CO(2) + H(2)O = ADP + ORTHOPHOSPHATE + METHYLMALONYL-COA.]" /experiment="experimental evidence, no additional details recorded" /note="Rv3799c, (MTV026.04c), len: 522 aa. Probable accD4, propyonyl-CoA carboxylase beta chain 4 (EC 6.4.1.3), equivalent to Q9CDB0|ACCD4|ML0102 PUTATIVE ACYL COA CARBOXYLASE from Mycobacterium leprae (517 aa) FASTA scores: opt: 3154, E(): 8e-187, (91.2% identity in 511 aa overlap) . Also similar to many e.g. Q9X4K7|PCCB from Streptomyces coelicolor (530 aa), FASTA scores: opt: 1714, E(): 4.4e-98, (50.0% identity in 510 aa overlap); P53003|PCCB_SACER from Saccharopolyspora erythraea (Streptomyces erythraeus) (546 aa), FASTA scores: opt: 1549, E(): 6.6e-88, (50.65% identity in 519 aa overlap); Q9WZH5|TM0716 from Thermotoga maritima (515 aa) FASTA scores: opt: 1529, E(): 1.1e-86, (46.7% identity in 512 aa overlap); etc. Also similar to P53002|PCCB_MYCLE|ACCD5|PCCB|ML0731|B1308_C1_125 PROBABLE PROPIONYL-CoA CARBOXYLASE BETA CHAIN 5 from Mycobacterium leprae (549 aa), FASTA scores: opt: 1493, E(): 1.9e-84, (49.8% identity in 514 aa overlap); and P96885|PCC5_MYCTU|ACCD5|PCCB|Rv3280|MT3379.1|MTCY71.20 PROBABLE PROPIONYL-CoA CARBOXYLASE BETA CHAIN 5 from Mycobacterium tuberculosis (548 aa), FASTA scores: opt: 1471, E(): 4.2e-83, (49.15% identity in 515 aa overlap). BELONGS TO THE ACCD/PCCB FAMILY. Length extended since first submission (+5 aa)." /codon_start=1 /transl_table=11 /product="propionyl-CoA carboxylase beta chain" /protein_id="NP_218316.2" /db_xref="GI:57117158" /db_xref="GeneID:886131" /translation="MTVTEPVLHTTAEKLAELRERLELAKEPGGEKAAAKRDKKGIPS ARARIYELVDPGSFMEIGALCRTPGDPNALYGDGVVTGHGLINGRPVGVFSHDQTVFG GTVGEMFGRKVARLMEWCAMVGCPIVGINDSGGARIQDAVTSLAWYAELGRRHELLSG LVPQISIILGKCAGGAVYSPIQTDLVVAVRDQGYMFVTGPDVIKDVTGEDVSLDELGG ADHQASYGNIHQVVESEAAAYQYVRDFLSFLPSNCFDKPPVVNPGLEPEITGHDLELD SIVPDSDNMAYDMHEVLLRIFDDGDFLDVAAQAGQAIITGYARVDGRTVGVVANQPMH MSGAIDNEASDKAARFIRFSDAFDIPLVFVVDTPGFLPGVEQEKNGIIKRGGRFLYAV VEADVPKVTITIRKSYGGAYAVMGSKQLTADLNFAWPTARIAVIGADGAAQLLMKRFP DPNAPEAQAIRKSFVENYNLNMAIPWIAAERGFIDAVIDPHETRLLLRKSMHLLRDKQ LWWRVGRKHGLIPV" gene complement(4255945..4261146) /gene="pks13" /locus_tag="Rv3800c" /db_xref="GeneID:886133" CDS complement(4255945..4261146) /gene="pks13" /locus_tag="Rv3800c" /function="UNKNOWN; SUPPOSED INVOLVED IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3800c, (MTV026.05c), len: 1733 aa. Probable pks13, polyketide synthase (EC undetermined), equivalent to Q9CDB1|PKS13|ML0101 POLYKETIDE SYNTHASE from Mycobacterium leprae (1784 aa), FASTA scores: opt: 7454, E(): 0, (83.6% identity in 1748 aa overlap); and similar to Q9Z5K6|ML2357|MLCB12.02c PUTATIVE POLYKETIDE SYNTHASE from Mycobacterium leprae (1871 aa), FASTA scores: opt: 1682, E(): 1.2e-85, (38.3% identity in 1096 aa overlap). Also similar in part to many e.g. Q9ADL6|SORA SORAPHEN POLYKETIDE SYNTHASE A from Polyangium cellulosum (6315 aa) FASTA scores: opt: 1422, E(): 1e-70, (31.45% identity in 1616 aa overlap); AAK73501|AMPHI AMPHI PROTEIN (involved in amphotericin biosynthesis) from Streptomyces nodosus (9510 aa), FASTA scores: opt: 1441, E(): 1.2e-71, (30.45% identity in 1662 aa overlap); Q9RFL0|MTAB MTAB PROTEIN (involved in myxothiazol biosynthesis) from Stigmatella aurantiaca (4003 aa), FASTA scores: opt: 1429, E(): 2.8e-71, (33.8% identity in 1089 aa overlap); Q9L4X2|NYSJ from Streptomyces noursei (5435 aa), FASTA scores: opt: 1407, E(): 6.1e-70, (30.5% identity in 1764 aa overlap); CAC37876|SC1G7.01c from Streptomyces coelicolor (3489 aa) FASTA scores: opt: 1382, E(): 1e-68, (31.05% identity in 1489 aa overlap); etc. Also highly similar to Q10977|PPSA_MYCTU|Rv2931|MT3000|MTCY338.20 PHENOLPTHIOCEROL SYNTHESIS POLYKETIDE SYNTHASE from Mycobacterium tuberculosis (1876 aa), FASTA scores: opt: 1728, E(): 3.4e-88, (36.95% identity in 1269 aa overlap); and P96203|PPSD|Rv2934|MTCY19H9.02. Contains PS00606 Beta-ketoacyl synthases active site." /codon_start=1 /transl_table=11 /product="polyketide synthase PKS13" /protein_id="NP_218317.1" /db_xref="GI:15610936" /db_xref="GeneID:886133" /translation="MADVAESQENAPAERAELTVPEMRQWLRNWVGKAVGKAPDSIDE SVPMVELGLSSRDAVAMAADIEDLTGVTLSVAVAFAHPTIESLATRIIEGEPETDLAG DDAEDWSRTGPAERVDIAIVGLSTRFPGEMNTPEQTWQALLEGRDGITDLPDGRWSEF LEEPRLAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADNIDPQQRMALELTWEALEH ARIPASSLRGQAVGVYIGSSTNDYSFLAVSDPTVAHPYAITGTSSSIIANRVSYFYDF HGPSVTIDTACSSSLVAIHQGVQALRNGEADVVVAGGVNALITPMVTLGFDEIGAVLA PDGRIKSFSADADGYTRSEGGGMLVLKRVDDARRDGDAILAVIAGSAVNHDGRSNGLI APNQDAQADVLRRAYKDAGIDPRTVDYIEAHGTGTILGDPIEAEALGRVVGRGRPADR PALLGAVKTNVGHLESAAGAASMAKVVLALQHDKLPPSINFAGPSPYIDFDAMRLKMI TTPTDWPRYGGYALAGVSSFGFGGANAHVVVREVLPRDVVEKEPEPEPEPKAAAEPAE APTLAGHALRFDEFGNIITDSAVAEEPEPELPGVTEEALRLKEAALEELAAQEVTAPL VPLAVSAFLTSRKKAAAAELADWMQSPEGQASSLESIGRSLSRRNHGRSRAVVLAHDH DEAIKGLRAVAAGKQAPNVFSVDGPVTTGPVWVLAGFGAQHRKMGKSLYLRNEVFAAW IEKVDALVQDELGYSVLELILDDAQDYGIETTQVTIFAIQIALGELLRHHGAKPAAVI GQSLGEAASAYFAGGLSLRDATRAICSRSHLMGEGEAMLFGEYIRLMALVEYSADEIR EVFSDFPDLEVCVYAAPTQTVIGGPPEQVDAILARAEAEGKFARKFATKGASHTSQMD PLLGELTAELQGIKPTSPTCGIFSTVHEGRYIKPGGEPIHDVEYWKKGLRHSVYFTHG IRNAVDSGHTTFLELAPNPVALMQVALTTADAGLHDAQLIPTLARKQDEVSSMVSTMA QLYVYGHDLDIRTLFSRASGPQDYANIPPTRFKRKEHWLPAHFSGDGSTYMPGTHVAL PDGRHVWEYAPRDGNVDLAALVRAAAAHVLPDAQLTAAEQRAVPGDGARLVTTMTRHP GGASVQVHARIDESFTLVYDALVSRAGSESVLPTAVGAATAIAVADGAPVAPETPAED ADAETLSDSLTTRYMPSGMTRWSPDSGETIAERLGLIVGSAMGYEPEDLPWEVPLIEL GLDSLMAVRIKNRVEYDFDLPPIQLTAVRDANLYNVEKLIEYAVEHRDEVQQLHEHQK TQTAEEIARAQAELLHGKVGKTEPVDSEAGVALPSPQNGEQPNPTGPALNVDVPPRDA AERVTFATWAIVTGKSPGGIFNELPRLDDEAAAKIAQRLSERAEGPITAEDVLTSSNI EALADKVRTYLEAGQIDGFVRTLRARPEAGGKVPVFVFHPAGGSTVVYEPLLGRLPAD TPMYGFERVEGSIEERAQQYVPKLIEMQGDGPYVLVGWSLGGVLAYACAIGLRRLGKD VRFVGLIDAVRAGEEIPQTKEEIRKRWDRYAAFAEKTFNVTIPAIPYEQLEELDDEGQ VRFVLDAVSQSGVQIPAGIIEHQRTSYLDNRAIDTAQIQPYDGHVTLYMADRYHDDAI MFEPRYAVRQPDGGWGEYVSDLEVVPIGGEHIQAIDEPIIAKVGEHMSRALGQIEADR TSEVGKQ" misc_feature complement(4260265..4260315) /gene="pks13" /locus_tag="Rv3800c" /note="PS00606 Beta-ketoacyl synthases active site" gene complement(4261153..4263066) /gene="fadD32" /locus_tag="Rv3801c" /db_xref="GeneID:886130" CDS complement(4261153..4263066) /gene="fadD32" /locus_tag="Rv3801c" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /experiment="experimental evidence, no additional details recorded" /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="long-chain-fatty-acid--CoA ligase" /protein_id="NP_218318.1" /db_xref="GI:15610937" /db_xref="GeneID:886130" /translation="MFVTGESGMAYHNPFIVNGKIRFPANTNLVRHVEKWAKVRGDKL AYRFLDFSTERDGVARDILWSDFSARNRAVGARLQQVTQPGDRVAILCPQNLDYLISF FGALYSGRIAVPLFDPAEPGHVGRLHAVLDDCAPSTILTTTDSAEGVRKFIRARSAKE RPRVIAVDAVPTEVAATWQQPEANEETVAYLQYTSGSTRIPSGVQITHLNLPTNVVQV LNALEGQEGDRGVSWLPFFHDMGLITVLLASVLGHSFTFMTPAAFVRRPGRWIRELAR KPGETGGTFSAAPNFAFEHAAVRGVPRDDEPPLDLSNVKGILNGSEPVSPASMRKFFE AFAPYGLKQTAVKPSYGLAEATLFVSTTPMDEVPTVIHVDRDELNNQRFVEVAADAPN AVAQVSAGKVGVSEWAVIVDADTASELPDGQIGEIWLHGNNLGTGYWGKEEESAQTFK NILKSRISESRAEGAPDDALWVRTGDYGTYFKDHLYIAGRIKDLVIIDGRNHYPQDLE CTAQESTKALRVGYAAAFSVPANQLPQTVFDDSHAGLKFDPEDTSEQLVIVGERAAGT HKLDHQPIVDDIRAAIAVGHGVTVRDVLLVSAGTIPRTSSGKIGRRACRAAYLDGSLR SGVGSPTVFATSD" gene complement(4263355..4264365) /locus_tag="Rv3802c" /db_xref="GeneID:886135" CDS complement(4263355..4264365) /locus_tag="Rv3802c" /function="UNKNOWN" /note="Rv3802c, (MTV026.07c), len: 336 aa. Probable conserved membrane protein, with a N-terminal signal sequence followed by Pro-rich region. Equivalent to Q9CDB3|ML0099 HYPOTHETICAL PROTEIN from Mycobacterium leprae (336 aa) FASTA scores: opt: 1759, E(): 1.1e-85, (75.5% identity in 335 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218319.1" /db_xref="GI:15610938" /db_xref="GeneID:886135" /translation="MAKNSRRKRHRILAWIAAGAMASVVALVIVAVVIMLRGAESPPS AVPPGVLPPGPTPAHPHKPRPAFQDASCPDVQMISVPGTWESSPQQNPLNPVQFPKAL LLKVTGPIAQQFAPARVQTYTVAYTAQFHNPLTTDNQMSYNDSRAEGTRAMVAAMTDM NNRCPLTSYVLIGFSQGAVIAGDVASDIGNGRGPVDEDLVLGVTLIADGRRQQGVGNQ VPPSPRGEGAEITLHEVPVLSGLGLTMTGPRPGGFGALDGRTNEICAQGDLICAAPAQ AFSPANLPTTLNTLAGGAGQPVHAMYATPEFWNSDGEPATEWTLNWAHQLIENAPHPK HR" gene complement(4264563..4265462) /gene="fbpD" /locus_tag="Rv3803c" /db_xref="GeneID:886121" CDS complement(4264563..4265462) /gene="fbpD" /locus_tag="Rv3803c" /EC_number="2.3.1.-" /function="INVOLVED IN CELL WALL MYCOLOYLATION. PROTEINS OF THE ANTIGEN 85 COMPLEX ARE RESPONSIBLE FOR THE HIGH AFFINITY OF MYCOBACTERIA TO FIBRONECTIN. POSSESSES A MYCOLYLTRANSFERASE ACTIVITY REQUIRED FOR THE BIOGENESIS OF TREHALOSE DIMYCOLATE (CORD FACTOR), A DOMINANT STRUCTURE NECESSARY FOR MAINTAINING CELL WALL INTEGRITY." /experiment="experimental evidence, no additional details recorded" /note="Rv3803c, (MT3910, MTV026.08c), len: 299 aa. fbpD (alternate gene names: mpt51, mpb51, fbpC1), secreted MPB51/MPT51 antigen protein (fibronectin-binding protein C) (mycolyl transferase 85C) (EC 2.3.1.-) (see citations below), identical to Q48923|MPT51|MPB51 ANTIGEN PRECURSOR from Mycobacterium bovis (299 aa), FASTA scores: opt: 2093, E(): 1.5e-112, (100.0% identity in 299 aa overlap) (see Ohara et al., 1995); and highly similar to other Mycobacterial antigen precursors e.g. Q05868|MPT5_MYCLE|MPT51|ML0098 MPT51 ANTIGEN PRECURSOR from Mycobacterium leprae (301 aa), FASTA scores: opt: 1624, E(): 9.8e-86, (77.8% identity in 302 aa overlap); O52972|A85C_MYCAV|FBPC ANTIGEN 85-C PRECURSOR (FIBRONECTIN-BINDING PROTEIN C) from Mycobacterium avium (352 aa), FASTA scores: opt: 753, E(): 6.6e-36, (41.5% identity in 315 aa overlap); P21160|A85B_MYCKA ANTIGEN 85-B PRECURSOR (FIBRONECTIN-BINDING PROTEIN B) from Mycobacterium kansasii (325 aa), FASTA scores: opt: 574, E(): 1.1e-25, (37.55% identity in 309 aa overlap); P12942|A85B_MYCBO ANTIGEN 85-B PRECURSOR from Mycobacterium bovis (323 aa), FASTA scores: opt: 572, E(): 1.4e-25, (39.85% identity in 291 aa overlap); etc. Also similar to P31953|A85C_MYCTU|FBPC|MPT45|Rv0129c|MTCI5.03c|FBPC2 SECRETED ANTIGEN 85-C (MYCOLYL TRANSFERASE 85C) (FIBRONECTIN-BINDING PROTEIN C) from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 751, E(): 8.4e-36, (40.65% identity in 310 aa overlap); P17944|A85A_MYCTU|FBPA|MPT44|Rv3804c|MT3911|MTV026.09c SECRETED ANTIGEN 85-A (MYCOLYL TRANSFERASE 85A) (FIBRONECTIN-BINDING PROTEIN A) from Mycobacterium tuberculosis (338 aa), FASTA scores: opt: 592, E(): 1e-26, (39.05% identity in 302 aa overlap); etc. Contains PS00178 Aminoacyl-transfer RNA synthetases class-I signature. Note that the secreted protein MPB51 is one of the major proteins in the culture filtrate of Mycobacterium bovis BCG.; mpt51; mpb51; fbpC1" /codon_start=1 /transl_table=11 /product="secreted MPT51/MPB51 antigen protein FBPD (MPT51/MPB51 antigen 85 complex C) (AG58C) (Mycolyl transferase 85C) (fibronectin-binding protein C) (85C)" /protein_id="YP_178017.1" /db_xref="GI:57117159" /db_xref="GeneID:886121" /translation="MKGRSALLRALWIAALSFGLGGVAVAAEPTAKAAPYENLMVPSP SMGRDIPVAFLAGGPHAVYLLDAFNAGPDVSNWVTAGNAMNTLAGKGISVVAPAGGAY SMYTNWEQDGSKQWDTFLSAELPDWLAANRGLAPGGHAAVGAAQGGYGAMALAAFHPD RFGFAGSMSGFLYPSNTTTNGAIAAGMQQFGGVDTNGMWGAPQLGRWKWHDPWVHASL LAQNNTRVWVWSPTNPGASDPAAMIGQAAEAMGNSRMFYNQYRSVGGHNGHFDFPASG DNGWGSWAPQLGAMSGDIVGAIR" misc_feature complement(4265214..4265249) /gene="fbpD" /locus_tag="Rv3803c" /note="PS00178 Aminoacyl-transfer RNA synthetases class-I signature, P." gene complement(4265642..4266658) /gene="fbpA" /locus_tag="Rv3804c" /db_xref="GeneID:886132" CDS complement(4265642..4266658) /gene="fbpA" /locus_tag="Rv3804c" /EC_number="2.3.1.-" /function="INVOLVED IN CELL WALL MYCOLOYLATION. PROTEINS OF THE ANTIGEN 85 COMPLEX ARE RESPONSIBLE FOR THE HIGH AFFINITY OF MYCOBACTERIA TO FIBRONECTIN. POSSESSES A MYCOLYLTRANSFERASE ACTIVITY REQUIRED FOR THE BIOGENESIS OF TREHALOSE DIMYCOLATE (CORD FACTOR), A DOMINANT STRUCTURE NECESSARY FOR MAINTAINING CELL WALL INTEGRITY." /experiment="experimental evidence, no additional details recorded" /note="Rv3804c, (MT3911, MTV026.09c), len: 338 aa. fbpA (alternate gene names: mpt44, 85A), precursor of the 85-A antigen (fibronectin-binding protein A) (mycolyl transferase 85A) (EC 2.3.1.-) (see citations below), identical to P17944|P17996|FBPA|MPT44 ANTIGEN 85-A PRECURSOR from Mycobacterium bovis (338 aa), FASTA scores: opt: 2341, E(): 1.2e-132, (100.0% identity in 338 aa overlap); and highly similar to other Mycobacterial antigen precursors e.g. O52956|A85A_MYCAV|FBPA ANTIGEN 85-A PRECURSOR (85A) from Mycobacterium avium (347 aa), FASTA scores: opt: 1987, E(): 1.7e-111, (82.55% identity in 338 aa overlap); Q05861|A85A_MYCLE|FBPA|ML0097 ANTIGEN 85-A PRECURSOR (85A) from Mycobacterium leprae (330 aa), FASTA scores: opt: 1936, E(): 1.9e-108, (83.0% identity in 329 aa overlap); O06052|A85A_MYCGO|FBPA ANTIGEN 85-A PRECURSOR (85A) from Mycobacterium gordonae (339 aa), FASTA scores: opt: 1932, E(): 3.3e-108, (80.45% identity in 338 aa overlap); etc. Also highly similar to P31952|A85B_MYCTU|FBPB|Rv1886c|MT1934|MTCY180.32 SECRETED ANTIGEN 85-B from Mycobacterium tuberculosis (325 aa), FASTA scores: opt: 1830, E(): 3.9e-102, (78.85% identity in 317 aa overlap); P31953|A85C_MYCTU|FBPC|MPT45|Rv0129c|MTCI5.03c|FBPC2 SECRETED ANTIGEN 85-C from Mycobacterium tuberculosis (340 aa), FASTA scores: opt: 1597, E(): 3.4e-88, (67.25% identity in 336 aa overlap).; mpt44; 85A" /codon_start=1 /transl_table=11 /product="secreted antigen 85-A FBPA (Mycolyl transferase 85A) (fibronectin-binding protein A) (antigen 85 complex A)" /protein_id="NP_218321.1" /db_xref="GI:15610940" /db_xref="GeneID:886132" /translation="MQLVDRVRGAVTGMSRRLVVGAVGAALVSGLVGAVGGTATAGAF SRPGLPVEYLQVPSPSMGRDIKVQFQSGGANSPALYLLDGLRAQDDFSGWDINTPAFE WYDQSGLSVVMPVGGQSSFYSDWYQPACGKAGCQTYKWETFLTSELPGWLQANRHVKP TGSAVVGLSMAASSALTLAIYHPQQFVYAGAMSGLLDPSQAMGPTLIGLAMGDAGGYK ASDMWGPKEDPAWQRNDPLLNVGKLIANNTRVWVYCGNGKPSDLGGNNLPAKFLEGFV RTSNIKFQDAYNAGGGHNGVFDFPDSGTHSWEYWGAQLNAMKPDLQRALGATPNTGPA PQGA" gene complement(4266953..4268836) /locus_tag="Rv3805c" /db_xref="GeneID:886138" CDS complement(4266953..4268836) /locus_tag="Rv3805c" /function="UNKNOWN" /note="Rv3805c, (MTV026.10c), len: 627 aa. Probable conserved transmembrane protein, equivalent, but shorter 19 aa, to Q9CDB4|ML0096 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (649 aa), FASTA scores: opt: 3511, E(): 1.1e-204, (80.9% identity in 629 aa overlap). Equivalent to AAK48278 from Mycobacterium tuberculosis strain CDC1551 (641 aa) but shorter 14 aa." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218322.1" /db_xref="GI:15610941" /db_xref="GeneID:886138" /translation="MVRVSLWLSVTAVAVLFGWGSWQRRWIADDGLIVLRTVRNLLAG NGPVFNQGERVEANTSTAWTYLLYVGGWVGGPMRLEYVALALAMVLSLLGMVLLMLGT GRLYAPSLRGRRAIMLPAGALVYIAVPPARDFATSGLESGLVLAYLGLLWWMMVCWSQ PLRARPDSQMFLGALAFVAGCSVLVRPEFALIGGLALIMMLIAARTWRRRVLIVLAGG FLPVAYQIFRMGYYGLLVPSTALAKDAAGDKWSQGMIYVSNFNRPYALWVPLVLSVPL GLLLMTARRRPSFLRPVLAPDYGRVARAVQSPPAVVAFIVGSGVLQALYWIRQGGDFM HGRVLLAPLFCLLAPVGVIPILLPDGKDFSRETGRWLVGALSGLWLGIAGWSLWAANS PGMGDDATRVTYSGIVDERRFYAQATGHAHPLTAADYLDYPRMAAVLTALNNTPEGAL LLPSGNYNQWDLVPMIRPSSGTAPGGKPAPKPQHAVFFTNMGMLGMNVGLDVRVIDQI GLVNPLAAHTERLKHARIGHDKNLFPDWVIADGPWVKWYPGIPGYIDQQWVTQAEAAL QCPATRAVLNSVRAPITLHRFLSNVLHSYEFTRYRIDRVPRYELVRCGLDVPDGPGPP PRE" gene complement(4268925..4269833) /locus_tag="Rv3806c" /db_xref="GeneID:886129" CDS complement(4268925..4269833) /locus_tag="Rv3806c" /function="UNKNOWN" /note="catalyzes the formation of decaprenylphosphoryl-5-phosphoribose from phosphoribose diphosphate and decaprenyl phosphate" /codon_start=1 /transl_table=11 /product="phosphoribose diphosphate:decaprenyl-phosphate phosphoribosyltransferase" /protein_id="NP_218323.1" /db_xref="GI:15610942" /db_xref="GeneID:886129" /translation="MSEDVVTQPPANLVAGVVKAIRPRQWVKNVLVLAAPLAALGGGV RYDYVEVLSKVSMAFVVFSLAASAVYLVNDVRDVEADREHPTKRFRPIAAGVVPEWLA YTVAVVLGVTSLAGAWMLTPNLALVMVVYLAMQLAYCFGLKHQAVVEICVVSSAYLIR AIAGGVATKIPLSKWFLLIMAFGSLFMVAGKRYAELHLAERTGAAIRKSLESYTSTYL RFVWTLSATAVVLCYGLWAFERDGYSGSWFAVSMIPFTIAILRYAVDVDGGLAGEPED IALRDRVLQLLALAWIATVGAAVAFG" misc_feature complement(4269084..4269131) /locus_tag="Rv3806c" /note="PS00225 Crystallins beta and gamma 'Greek key' motif signature" gene complement(4269840..4270337) /locus_tag="Rv3807c" /db_xref="GeneID:886134" CDS complement(4269840..4270337) /locus_tag="Rv3807c" /function="UNKNOWN" /note="Rv3807c, (MTV026.12), len: 165 aa. Possible conserved transmembrane protein, equivalent to Q9CDB6|ML0094 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (192 aa), FASTA scores: opt: 714, E(): 2.4e-38, (72.85% identity in 151 aa overlap). Also highly similar to Q9KZA3|SC5G8.11 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (169 aa), FASTA scores: opt: 324, E(): 1.1e-13, (41.5% identity in 159 aa overlap); and similar in part to others e.g. Q9K3L3|SCG20A.27 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (230 aa), FASTA scores: opt: 277, E(): 1.3e-10, (41.65% identity in 168 aa overlap); P72269|ORF8 HYPOTHETICAL PROTEIN from Rhodococcus erythropolis (487 aa) FASTA scores: opt: 229, E(): 2.7e-07, (36.25% identity in 149 aa overlap); O86625|SC3A7.24c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (201 aa) FASTA scores: opt: 200, E(): 9.1e-06, (34.95% identity in 146 aa overlap); Q9KYD7|SCD72A.19 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (238 aa) FASTA scores: opt: 178, E(): 0.00026, (35.7% identity in 112 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218324.1" /db_xref="GI:15610943" /db_xref="GeneID:886134" /translation="MVAVQSALVDRPGMLATARGLSHFGEHCIGWLILALLGAIALPR RRREWLVAGAGAFVAHAIAVLIKRLVRRQRPDHPAIAVNVDTPSQLSFPSAHATSTTA AALLMGRATGLPLPVVLVPPMALSRILLGVHYPSDVAVGVALGATVGAIVDSVGGGRQ RARKR" gene complement(4270366..4272279) /gene="glfT" /locus_tag="Rv3808c" /db_xref="GeneID:886136" CDS complement(4270366..4272279) /gene="glfT" /locus_tag="Rv3808c" /EC_number="2.-.-.-" /function="CONVERTS UDP-GALACTOFURANOSE IN CELL WALL GALACTAN POLYMERIZATION. HAS UDP-Galf:beta-D-(1->5) AND UDP-Galf:beta-D-(1->6) GALACTOFURANOSYLTRANSFERASE ACTIVITIES." /experiment="experimental evidence, no additional details recorded" /note="Rv3808c, (MTV026.13c), len: 637 aa. glfT, bifunctional UDP-galactofuranosyl transferase (EC 2.-.-.-) (see citations below). Equivalent to Q9CDB7|ML0093 HYPOTHETICAL PROTEIN from Mycobacterium leprae (643 aa), FASTA scores: opt: 3751, E(): 0, (85.4% identity in 643 aa overlap). Contains a beta-glycosyltransferase domain A." /codon_start=1 /transl_table=11 /product="bifunctional UDP-galactofuranosyl transferase GLFT" /protein_id="NP_218325.1" /db_xref="GI:15610944" /db_xref="GeneID:886136" /translation="MSELAASLLSRVILPRPGEPLDVRKLYLEESTTNARRAHAPTRT SLQIGAESEVSFATYFNAFPASYWRRWTTCKSVVLRVQVTGAGRVDVYRTKATGARIF VEGHDFTGTEDQPAAVETEVVLQPFEDGGWVWFDITTDTAVTLHSGGWYATSPAPGTA NIAVGIPTFNRPADCVNALRELTADPLVDQVIGAVIVPDQGERKVRDHPDFPAAAARL GSRLSIHDQPNLGGSGGYSRVMYEALKNTDCQQILFMDDDIRLEPDSILRVLAMHRFA KAPMLVGGQMLNLQEPSHLHIMGEVVDRSIFMWTAAPHAEYDHDFAEYPLNDNNSRSK LLHRRIDVDYNGWWTCMIPRQVAEELGQPLPLFIKWDDADYGLRAAEHGYPTVTLPGA AIWHMAWSDKDDAIDWQAYFHLRNRLVVAAMHWDGPKAQVIGLVRSHLKATLKHLACL EYSTVAIQNKAIDDFLAGPEHIFSILESALPQVHRIRKSYPDAVVLPAASELPPPLHK NKAMKPPVNPLVIGYRLARGIMHNLTAANPQHHRRPEFNVPTQDARWFLLCTVDGATV TTADGCGVVYRQRDRAKMFALLWQSLRRQRQLLKRFEEMRRIYRDALPTLSSKQKWET ALLPAANQEPEHG" gene complement(4272276..4273475) /gene="glf" /locus_tag="Rv3809c" /db_xref="GeneID:886142" CDS complement(4272276..4273475) /gene="glf" /locus_tag="Rv3809c" /EC_number="5.4.99.9" /function="INVOLVED IN LIPOPOLYSACCHARIDE BIOSYNTHESIS, IN THE CONVERSION OF UDP-GALACTOPYRANOSE INTO UDP-GALACTOFURANOSE THROUGH A 2-KETO INTERMEDIATE [CATALYTIC ACTIVITY: UDP-D-GALACTOPYRANOSE = UDP-D-GALACTO-1,4-FURANOSE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3809c, (MTV026.14), len: 399 aa. glf (alternate gene name: ceoA), UDP-galactopyranose mutase (EC 5.4.99.9) (see citations below), identical to previously sequenced gene, and equivalent to Q9CDB8|GLF|ML0092 PUTATIVE UDP-GALACTOPYRANOSE MUTASE from Mycobacterium leprae (413 aa), FASTA scores: opt: 2347, E(): 1.3e-140, (86.6% identity in 396 aa overlap). Also highly similar to others e.g. AAK61905|EPSJ UDP-GALACTOPYRANOSE MUTASE (PROTEIN INVOLVED IN EXOPOLYSACCHARIDES BIOSYNTHESIS) from Streptococcus thermophilus (365 aa), FASTA scores: opt: 972, E(): 5.9e-54, (45.85% identity in 375 aa overlap); P37747|GLF_ECOLI|B2036 UDP-GALACTOPYRANOSE MUTASE from Escherichia coli strain K12 (367 aa), FASTA scores: opt: 958, E(): 4.5e-53, (43.55% identity in 379 aa overlap); O86897|CAP33FN from Streptococcus pneumoniae (369 aa) FASTA scores: opt: 954, E(): 8.1e-53, (44.8% identity in 375 aa overlap); etc. COFACTOR: FAD (BY SIMILARITY). N-TERMINAL SHOWS SIMILARITY TO FAD OR NAD CONTAINING PROTEINS.; ceoA" /codon_start=1 /transl_table=11 /product="UDP-galactopyranose mutase Glf" /protein_id="NP_218326.1" /db_xref="GI:15610945" /db_xref="GeneID:886142" /translation="MQPMTARFDLFVVGSGFFGLTIAERVATQLDKRVLVLERRPHIG GNAYSEAEPQTGIEVHKYGAHLFHTSNKRVWDYVRQFTDFTDYRHRVFAMHNGQAYQF PMGLGLVSQFFGKYFTPEQARQLIAEQAAEIDTADAQNLEEKAISLIGRPLYEAFVKG YTAKQWQTDPKELPAANITRLPVRYTFDNRYFSDTYEGLPTDGYTAWLQNMAADHRIE VRLNTDWFDVRGQLRPGSPAAPVVYTGPLDRYFDYAEGRLGWRTLDFEVEVLPIGDFQ GTAVMNYNDLDVPYTRIHEFRHFHPERDYPTDKTVIMREYSRFAEDDDEPYYPINTEA DRALLATYRARAKSETASSKVLFGGRLGTYQYLDMHMAIASALNMYDNVLAPHLRDGV PLLQDGA" gene 4273739..4274593 /gene="pirG" /locus_tag="Rv3810" /db_xref="GeneID:886139" CDS 4273739..4274593 /gene="pirG" /locus_tag="Rv3810" /function="SURFACE-EXPOSED PROTEIN REQUIRED FOR MULTIPLICATION AND INTRACELLULAR GROWTH. SEEMS TO PLAY A ROLE IN VIRULENCE." /experiment="experimental evidence, no additional details recorded" /note="Rv3810, (MTV026.15), len: 284 aa. pirG (alternate gene names: P36 or erp for Exported Repeated Protein), cell surface protein precursor (see citations below), equivalent to P19361|28KD_MYCLE|ML0091 28 KDA ANTIGEN PRECURSOR from Mycobacterium leprae (236 aa), FASTA scores: opt: 555, E(): 9.8e-18, (52.65% identity in 281 aa overlap).; erp; P36" /codon_start=1 /transl_table=11 /product="exported repetitive protein precursor PirG (cell surface protein) (EXP53)" /protein_id="NP_218327.1" /db_xref="GI:15610946" /db_xref="GeneID:886139" /translation="MPNRRRRKLSTAMSAVAALAVASPCAYFLVYESTETTERPEHHE FKQAAVLTDLPGELMSALSQGLSQFGINIPPVPSLTGSGDASTGLTGPGLTSPGLTSP GLTSPGLTDPALTSPGLTPTLPGSLAAPGTTLAPTPGVGANPALTNPALTSPTGATPG LTSPTGLDPALGGANEIPITTPVGLDPGADGTYPILGDPTLGTIPSSPATTSTGGGGL VNDVMQVANELGASQAIDLLKGVLMPSIMQAVQNGGAAAPAASPPVPPIPAAAAVPPT DPITVPVA" gene 4274798..4276417 /locus_tag="Rv3811" /db_xref="GeneID:886137" CDS 4274798..4276417 /locus_tag="Rv3811" /function="UNKNOWN" /note="Rv3811, (MTV026.16), len: 539 aa. Conserved hypothetical protein, showing some similarity to Q9KZK5|SCE34.21c PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (416 aa), FASTA scores: opt: 603, E(): 8.1e-26, (34.4% identity in 404 aa overlap); Q9S2P9|SC5F7.14c HYPOTHETICAL 31.9 KDA PROTEIN from Streptomyces coelicolor (308 aa), FASTA scores: opt: 472, E(): 9.5e-19, (37.5% identity in 208 aa overlap). Middle section (approximatively aa 185-350/390) shows some similarity with Q9GK12 PEPTIDOGLYCAN RECOGNITION PROTEIN PRECURSOR from Camelus dromedarius (Dromedary) (Arabian camel) (193 aa) FASTA scores: opt: 274, E(): 4.6e-08, (32.2% identity in 177 aa overlap); O75594|PGLYRP|PGRP from Homo sapiens (Human) (196 aa), FASTA scores: opt: 272, E(): 6e-08, (30.9% identity in 220 aa overlap); Q9JLN4|PGRP PEPTIDOGLYCAN RECOGNITION PROTEIN from Rattus norvegicus (Rat) (182 aa), FASTA scores: opt: 253, E(): 6.2e-07, (32.15% identity in 171 aa overlap); etc. C-terminal end shows similarity with Q01377|CSP1_CORGL PS1 PROTEIN PRECURSOR (ONE OF THE TWO MAJOR SECRETED PROTEINS) from Corynebacterium glutamicum (Brevibacterium flavum) (657 aa), FASTA scores: opt: 250, E(): 2.7e-06, (39.45% identity in 109 aa overlap). Contains PS00687 Aldehydedehydrogenases glutamic acid active site. Note that previously known as csp.; csp" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_178018.1" /db_xref="GI:57117160" /db_xref="GeneID:886137" /translation="MAATVVIVAWIANRPPASSHEPSPTPNTQLAEQPLIGLGGGVTV RELTQDTPFSLVALTGDLAGTSARVRAKRPDGDWGPWYQTEYETEPRDPAGTDGSVEL GGLNPGPRSTDPVFVGTTTTVQVAVTRPIDAPITQPPAGRPPNDLLDSGLGYRPATKE QPFGQNISAILISPPQAPPGTQWTPPTAVTMAGQPPAIISRAEWGADESLRCETPEYD RGVRAAVVHHTAGSNDYSPLESAGIVKAIYTYHSKTLGWCDIAYNALVDKYGQVFEGS AGGLTKPVEGFHTGGFNRNTWGVAMIGNFDDVAPTPIQIRTVGRLLGWRLGMDDVDPR SMVDLQSAGSSYTTFPGGAIARLPAIFTHRDVGNTDCPGNAAYAVMDEIRDIAAHFND PPEELIKALEGGAIYQRWQALGGMNSALGAPTSPEADAADGARYATFAKGAMYWSPVT DAQPITGAIYEAWASQSYERGPLGLPTSAEIQEPLQITQNFQHGTLNFERLTGNVTEV VDGITTPLATRPPSGPTVPPEHFTLPTHPIT" misc_feature 4275095..4275118 /locus_tag="Rv3811" /note="PS00687 Aldehyde dehydrogenases glutamic acid active site" gene 4276571..4278085 /gene="PE_PGRS62" /locus_tag="Rv3812" /db_xref="GeneID:886143" CDS 4276571..4278085 /gene="PE_PGRS62" /locus_tag="Rv3812" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN VIRULENCE." /note="Rv3812, (MTV026.17, MTCY409.18c), len: 540 aa. Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citations below), similar to many e.g. P96828|Rv0151c|MTCI5.25c (588 aa), FASTA scores: opt: 389, E(): 6.2e-14, (29.2% identity in 473 aa overlap); MTCY7H7B_27; MTCY493_24; MTCY441_4; MTCY39_36; MTCY1A11_4; MTCY359_33; MTCY130_10; MTCY98_9; etc. The transcription of this CDS seems to be activated in macrophages (see Ramakrishnan et al., 2000)." /codon_start=1 /transl_table=11 /product="PE-PGRS family protein" /protein_id="YP_178019.1" /db_xref="GI:57117161" /db_xref="GeneID:886143" /translation="MSFVVTVPEAVAAAAGDLAAIGSTLREATAAAAGPTTGLAAAAA DDVSIAVSQLFGRYGQEFQTVSNQLAAFHTEFVRTLNRGAAAYLNTESANGGQLFGQI EAGQRAVSAAAAAAPGGAYGQLVANTATNLESLYGAWSANPFPFLRQIIANQQVYWQQ IAAALANAVQNFPALVANLPAAIDAAVQQFLAFNAAYYIQQIISSQIGFAQLFATTVG QGVTSVIAGWPNLAAELQLAFQQLLVGDYNAAVANLGKAMTNLLVTGFDTSDVTIGTM GTTISVTAKPKLLGPLGDLFTIMTIPAQEAQYFTNLMPPSILRDMSQNFTNVLTTLSN PNIQAVASFDIATTAGTLSTFFGVPLVLTYATLGAPFASLNAIATSAETIEQALLAGN YLGAVGALIDAPAHALDGFLNSATVLDTPILVPTGLPSPLPPTVGITLHLPFDGILVP PHPVTATISFPGAPVPIPGFPTTVTVFGTPFMGMAPLLINYIPQQLALAIKPAA" gene complement(4278394..4279215) /locus_tag="Rv3813c" /db_xref="GeneID:886147" CDS complement(4278394..4279215) /locus_tag="Rv3813c" /function="UNKNOWN" /note="Rv3813c, (MTCY409.17), len: 273 aa. Conserved hypothetical protein, equivalent to Q9CDB9|ML0089 HYPOTHETICAL PROTEIN from Mycobacterium leprae (281 aa) FASTA scores: opt: 1479, E(): 9.6e-81, (80.45% identity in 271 aa overlap); and similar to Q98LI0|MLL1014 from (280 aa) . Also similar to many hypothetical proteins from several organisms e.g. Q9ZBX2|SCD78.27c from Streptomyces coelicolor (280 aa), FASTA scores: opt: 597, E(): 2.2e-28, (43.25% identity in 266 aa overlap); Q9RXR7|DR0240 from Deinococcus radiodurans (284 aa), FASTA scores: opt: 543, E(): 3.5e-25, (38.65% identity in 264 aa overlap); Q99YH5|SPY1700 from Streptococcus pyogenes (274 aa) FASTA scores: opt: 373, E(): 4.3e-15, (30.75% identity in 270 aa overlap); P70947|YITU from Bacillus subtilis (270 aa) FASTA scores: opt: 353, E(): 6.5e-14, (30.0% identity in 280 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218330.1" /db_xref="GI:15610949" /db_xref="GeneID:886147" /translation="MKPTVPALVACDVDGTLLDDGETVTKRTRDAVHAAVDAGTHFIL ATGRPPRWVRPIVDALGFAPMAVCANGAVIYDPGTDRVMSVRTLPVDALATLAEVATR VIPGAGLAVERIGERAHDTATPQFVSSPGYEHAWLNPDNTEVSIDHLLSAPAIKLLIR KAGAASADMAAELAKHVGFEGDITYSTNNGLVEIVPLGISKATGVDEIARPLGISDAE VVAFGDMPNDVPMLLRAGLGVAMGNAHPDALAVADEVTAPNSEDGVARVLERWWS" gene complement(4279230..4280015) /locus_tag="Rv3814c" /db_xref="GeneID:886150" CDS complement(4279230..4280015) /locus_tag="Rv3814c" /EC_number="2.3.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3814c, (MTCY409.16), len: 261 aa. Possible acyltransferase (EC 2.3.1.-), highly similar to Q9CDC0|ML0087 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (257 aa), FASTA scores: opt: 753, E(): 7.7e-42, (46.75% identity in 246 aa overlap). Also highly similar to many acyltransferases and hypothetical proteins e.g. Q9K3R3|2SCG4.01 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (242 aa), FASTA scores: opt: 587, E(): 4.6e-31, (41.95% identity in 243 aa overlap); Q9ZBS1|SC7A1.02 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (264 aa), FASTA scores: opt: 293, E(): 6.6e-12, (29.2% identity in 267 aa overlap); Q9PNZ5|AAS|CJ0938 PUTATIVE 2-ACYLGLYCEROPHOSPHOETHANOLAMINE ACYLTRANSFERASE / ACYL-ACYL CARRIER PROTEIN SYNTHETASE from Campylobacter jejuni (1170 aa), FASTA scores: opt: 274, E(): 3.9e-10, (29.1% identity in 219 aa overlap) (similarity only with middle section); Q9EY25 PUTATIVE ACETYL TRANSFERASE from Xanthomonas oryzae pv. oryzae (249 aa), FASTA scores: opt: 238, E(): 2.4e-08, (29.2% identity in 209 aa overlap); etc. Also highly similar to downstream ORFs O07808|Rv3815c|MTCY409.15 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (251 aa), FASTA scores: opt: 1069, E(): 2.1e-62, (60.4% identity in 245 aa overlap); and O07807|Rv3816c|MTCY409.14 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (259 aa), FASTA scores: opt: 776, E(): 2.5e-43, (50.9% identity in 228 aa overlap). And similar to O53516|Rv2182c|MTV021.15c HYPOTHETICAL 27.0 KDA PROTEIN from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 239, E(): 2e-08, (30.6% identity in 232 aa overlap)." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="NP_218331.1" /db_xref="GI:15610950" /db_xref="GeneID:886150" /translation="MAEPFFRMMEILVPSIVAANGNKITFEGLENIPERGGALIALNH TSYVDWVPASIAAHHRRRRLRFMIKAEMQDVRAVNYVIKHAQLIPVDRSVGADAYAVA VQRLRAGELVGLHPEATISRSLELREFKTGAARMALEAQVPIIPMIVWGAHRIWPKDH PKNLFRNKIPIVAAIGSPVRPEGNAEQLNAVLRQAMNAILYRVQEEYPHPKGEHWVPR RLGGGAPTVEESRQLRIAELAKRRQKRGYDGVTSSRRSQVGPH" gene complement(4280033..4280788) /locus_tag="Rv3815c" /db_xref="GeneID:886149" CDS complement(4280033..4280788) /locus_tag="Rv3815c" /EC_number="2.3.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3815c, (MTCY409.15), len: 251. Possible acyltransferase (EC 2.3.1.-), highly similar to Q9CDC0|ML0087 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (257 aa), FASTA scores: opt: 845, E(): 2.7e-47, (53.25% identity in 246 aa overlap). Also highly similar to Q9K3R3|2SCG4.01 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (242 aa), FASTA scores: opt: 656, E(): 3.7e-35, (47.85% identity in 234 aa overlap); and similar to many putative acyltransferases and hypothetical proteins e.g. P74498|SLL1848 HYPOTHETICAL 24.3 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (225 aa) FASTA scores: opt: 275, E(): 1.2e-10, (34.8% identity in 181 aa overlap); Q9ZBS1|SC7A1.02 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (264 aa), FASTA scores: opt: 266, E(): 5.2e-10, (29.7% identity in 229 aa overlap); Q9PNZ5|AAS|CJ0938 PUTATIVE 2-ACYLGLYCEROPHOSPHOETHANOLAMINE ACYLTRANSFERASE/ ACYL-ACYL CARRIER PROTEIN SYNTHETASE from Campylobacter jejuni (1170 aa), FASTA scores: opt: 264, E(): 2.3e-09, (23.55% identity in 221 aa overlap) (similarity only with middle section); etc. Also highly similar to upstream ORF O07809|Rv3814c|MTCY409.16 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (261 aa), FASTA scores: opt: 1069, E(): 1e-61, (60.4% identity in 245 aa overlap) ; and downstream ORF O07807|Rv3816c|MTCY409.14 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (259 aa) FASTA scores: opt: 847, E(): 2e-47, (55.7% identity in 246 aa overlap). And similar to O53516|Rv2182c|MTV021.15c HYPOTHETICAL 27.0 KDA PROTEIN from Mycobacterium tuberculosis (247 aa), FASTA scores: opt: 237, E(): 3.6e-08, (30.9% identity in 233 aa overlap)." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="NP_218332.1" /db_xref="GI:15610951" /db_xref="GeneID:886149" /translation="MAEPTYRVLEILAQLLVLATGTRITYVGEENVPDQGGAVVAINH TSYVDWLPAALAMHRRRRRMRFMIKAEMQRVRLVNFLIRHTRTIPVDRGAGGSAYAVA VQRLREGELVGVYPEATISRSFELKGFKTGAARMAAEADVPIVPVVVWGAQRIWTKDH PRQIGRAKVPVTVQVGRPLRAAAGIEQTNAALRESMTALLWQAQERYPHPAGAYWVPR RLGGGAPTLAEAARMEADEAAARAASRTPHESR" gene complement(4280792..4281571) /locus_tag="Rv3816c" /db_xref="GeneID:886152" CDS complement(4280792..4281571) /locus_tag="Rv3816c" /EC_number="2.3.1.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3816c, (MTCY409.14), len: 259 aa. Possible acyltransferase (EC 2.3.1.-), equivalent to Q9CDC0|ML0087 PUTATIVE ACYLTRANSFERASE from Mycobacterium leprae (257 aa) FASTA scores: opt: 1401, E(): 1.5e-80, (81.9% identity in 254 aa overlap). Also highly similar to many putative acyltransferases and hypothetical proteins e.g. Q9K3R3|2SCG4.01 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (242 aa), FASTA scores: opt: 758, E(): 2.4e-40, (51.7% identity in 234 aa overlap); Q9ZBS1|SC7A1.02 PUTATIVE ACYLTRANSFERASE from Streptomyces coelicolor (264 aa), FASTA scores: opt: 312, E(): 2e-12, (29.55% identity in 237 aa overlap); O67841|AAS|AQ_2058 2-ACYLGLYCEROPHOSPHOETHANOLAMINE ACYLTRANSFERASE from Aquifex aeolicus (211 aa), FASTA scores: opt: 281, E(): 1.5e-10, (32.7% identity in 162 aa overlap); etc. Also highly similar to upstream ORFs O07808|Rv3815c|MTCY409.15 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (251 aa), FASTA scores: opt: 847, E(): 6.7e-46, (55.7% identity in 246 aa overlap); and O07809|Rv3814c|MTCY409.16 PUTATIVE ACYLTRANSFERASE from Mycobacterium tuberculosis (261 aa), FASTA scores: opt: 776, E(): 1.9e-41, (50.9% identity in 228 aa overlap)." /codon_start=1 /transl_table=11 /product="acyltransferase" /protein_id="NP_218333.1" /db_xref="GI:15610952" /db_xref="GeneID:886152" /translation="MEPVYGTVIRLARLSWRIQGLKITVTGVDNLPTSGGAVVAINHT SYLDFTFAGLPAYQQGLGRKVRFMAKQEVFDHKITGPIMRSLRHIPVDRQDGSASYDA AVRMLKAGELVGVYPEATISRSFEIKEFKTGAARMAIEAGVPIVPHIVWGAQRIWTKD RPKKLFRPKVPVTIVVGERIEPTLPTAELNGLLHSRMQHLLERAQELYGPHPAGEFWV PHRLGGGAPSLAEAARLDAQEAAVRAARRAQRAHPAGAPEQ" gene 4281647..4282402 /locus_tag="Rv3817" /db_xref="GeneID:886144" CDS 4281647..4282402 /locus_tag="Rv3817" /EC_number="2.7.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM [CATALYTIC ACTIVITY: ATP + SUBSTRATE = ADP + SUBSTRATE 3'-PHOSPHATE]." /note="Rv3817, (MTCY409.13c), len: 251 aa. Possible phosphotransferase (EC 2.7.-.-), similar to many phosphotransferases e.g. O53023 KANAMYCIN MARKER from Escherichia coli (264 aa), FASTA scores: opt: 232, E(): 7.5e-08, (32.4% identity in 247 aa overlap); BAA78209|NEO NEOMYCINE PHOSPHOTRANSFERASE from Drosophila melanogaster (Fruit fly) (264 aa), FASTA scores: opt: 227, E(): 1.6e-07, (32.0% identity in 247 aa overlap); AAG09774 AMINOGLYCOSIDE 3'-PHOSPHOTRANSFERASE from Vibrio cholerae (264 aa), FASTA scores: opt: 227, E(): 1.6e-07, (32.0% identity in 247 aa overlap); P00552|KKA2_KLEPN|NEO|KAN AMINOGLYCOSIDE 3'-PHOSPHOTRANSFERASE from Klebsiella pneumoniae (264 aa), FASTA scores: opt: 227, E(): 1.6e-07, (32.0% identity in 247 aa overlap); etc." /codon_start=1 /transl_table=11 /product="phosphotransferase" /protein_id="NP_218334.1" /db_xref="GI:15610953" /db_xref="GeneID:886144" /translation="MSFPSSPPALPAIVARFAVGRPVRAVWVNELGGVTFRVDSGMGA GCEFIKVARRGTADFANEARRLRWAAPYLAVPRVLGVGVDGDWAWLHTDALPGLSAVH PRWRASPQVAVPALGAGLRTLHDSLPVHSCPFDWSTASRLAKLAPARRAELGDSPPVD RLVVCHGDACSPNTILDDTGRCCGHVDFGNLGVADRWADLAVATLSLQWNFPDYPGQV RDDEFFAAYGVAPDPARIDYYRRLWQAEDDSSR" gene 4282449..4283999 /locus_tag="Rv3818" /db_xref="GeneID:886153" CDS 4282449..4283999 /locus_tag="Rv3818" /function="UNKNOWN" /note="Rv3818, (MTCY409.12c), len: 516 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218335.1" /db_xref="GI:15610954" /db_xref="GeneID:886153" /translation="MQVTSVGHAGFLIQTQAGSILCDPWVNPAYFASWFPFPDNSGLD WGALGECDYLYVSHLHKDHFDAENLRAHVNKDAVVLLPDFPVPDLRNELQKLGFHRFF ETTDSVKHRLRGPNGDLDVMIIALRAPADGPIGDSALVVADGETTAFNMNDARPVDLD VLASEFGHIDVHMLQYSGAIWYPMVYDMPARAKDAFGAQKRQRQMDRARQYIAQVGAT WVVPSAGPPCFLAPELRHLNDDGSDPANIFPDQMVFLDQMRAHGQDGGLLMIPGSTAD FTGTTLNSLRHPLPAEQVEAIFTTDKAAYIADYADRMAPVLAAQKAGWAAAAGEPLLQ PLRTLFEPIMLQSNEICDGIGYPVELAIGPETIVLDFPKRAVREPIPDERFRYGFAIA PELVRTVLRDNEPDWVNTIFLSTRFRAWRVGGYNEYLYTFFKCLTDERIAYADGWFAE AHDDSSSITLNGWEIQRRCPHLKADLSKFGVVEGNTLTCNLHGWQWRLDDGRCLTARG HQLRSSRP" gene 4283996..4284331 /locus_tag="Rv3819" /db_xref="GeneID:886146" CDS 4283996..4284331 /locus_tag="Rv3819" /function="UNKNOWN" /note="Rv3819, (MTCY409.11c), len: 111 aa. Hypothetical unknown protein. Contains PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218336.1" /db_xref="GI:15610955" /db_xref="GeneID:886146" /translation="MMQFYDDGVVQLDRAALTLRRYHFPSGTAKVIPLDQIRGYQAES LGFLMARFNIWGRPDLRRWLPLDVYRPLKSTLVTLDVPGMRPKPACTPTRPKEFIALL DELLALHRT" misc_feature 4284110..4284157 /locus_tag="Rv3819" /note="PS00012 Phosphopantetheine attachment site" gene complement(4284419..4285825) /gene="papA2" /locus_tag="Rv3820c" /db_xref="GeneID:886140" CDS complement(4284419..4285825) /gene="papA2" /locus_tag="Rv3820c" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN LIPID METABOLISM." /note="Rv3820c, (MTCY409.10), len: 468 aa. Possible papA2, conserved polyketide synthase (PKS) associated protein, highly similar to Q49618|PAPA3|ML1230|B1170_C1_180 PKS-ASSOCIATED PROTEIN A3 from Mycobacterium leprae (471 aa), FASTA scores: opt: 1660, E(): 2.7e-102, (53.95% identity in 456 aa overlap). Also similar to Q9F2R3|SCD65.19c HYPOTHETICAL 52.8 KDA PROTEIN from Streptomyces coelicolor (473 aa), FASTA scores: opt: 575, E(): 1.8e-30, (27.8% identity in 464 aa overlap); and weakly similar to part of other proteins. Also high similarity with other PKS-ASSOCIATED PROTEINS from Mycobacterium tuberculosis; O50438|PAPA3|Rv1182|MTV005.18 (472 aa), FASTA scores: opt: 1694, E(): 1.5e-104, (53.8% identity in 461 aa overlap); and O07799|PAPA1|Rv3824c|MTCY409.06 (511 aa), FASTA scores: opt: 1664, E(): 1.6e-102, (53.9% identity in 462 aa overlap); and similar to C-terminal end of O53902|PAPA4|Rv1528c|MTV045.02 (165 aa), FASTA scores: opt: 186, E(): 4.1e-05, (37.9% identity in 66 aa overlap)." /codon_start=1 /transl_table=11 /product="polyketide synthase associated protein PapA2" /protein_id="YP_178020.1" /db_xref="GI:57117162" /db_xref="GeneID:886140" /translation="MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQ AQHLRRYRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDTYHSWFEFDN AEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQPLQWDCFLFGIIQSDDHFTF YASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPAGRYDDHCVRQYADTAALT LDSARVRRWVEFAANNDGTLPHFPLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVA AGARFSGGVFACAALAERELTNCETFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVA SGLFDSAARVAQISFDSGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAP LSTVANSDLNFRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNPIASESVANYIAAMK SIYIRTADGTLATLKPGT" gene 4285973..4286686 /locus_tag="Rv3821" /db_xref="GeneID:886141" CDS 4285973..4286686 /locus_tag="Rv3821" /function="UNKNOWN" /note="Rv3821, (MTCY409.09c), len: 237 aa. Probable conserved integral membrane protein, equivalent to Q49630|ML1233|B1170_F2_64 HYPOTHETICAL 24.4 KDA PROTEIN /INTEGRAL MEMBRANE PROTEIN (POTENTIAL) from Mycobacterium leprae (230 aa), FASTA scores: opt: 619, E(): 2.4e-32, (46.65% identity in 240 aa overlap). Shows some similarity to P29466|I1BC_HUMAN|CASP1|IL1BC|IL1BCE (404 aa), FASTA scores: opt: 126, E(): 0.88, (29.05% identity in 155 aa overlap). Also highly similar to P71796|Rv1517|MTCY277.39 HYPOTHETICAL 26.9 KDA PROTEIN from Mycobacterium tuberculosis (254 aa), FASTA scores: opt: 284, E(): 5.4e-11, (36.35% identity in 256 aa overlap). Start site chosen on basis of similarity to LEPB1170_F2_64 and MTCY277.39, but may extend further upstream." /codon_start=1 /transl_table=11 /product="integral membrane protein" /protein_id="NP_218338.1" /db_xref="GI:15610957" /db_xref="GeneID:886141" /translation="MWSTVLVLALSVICEPVRIGLVVLMLNRRRPLLHLLTFLCGGYT MAGGVAMVTLVVLGATPLAGHFSVAEVQIGTGLIALLIAFALTTNVIGKHVRRATHAR VGDDGGRVLRESVPPSGAHKLAVRARCFLQGDSLYVAGVSGLGAALPSANYMGAMAAI LASGATPATQALAVVTFNVVAFTVAEVPLVSYLAAPRKTRAFMAALQSWLRSRSRRDA ALLVAAGGCLMLTLGLSNL" gene 4286721..4287935 /locus_tag="Rv3822" /db_xref="GeneID:886155" CDS 4286721..4287935 /locus_tag="Rv3822" /note="Rv3822, (MTCY409.08c), len: 404 aa. Conserved hypothetical protein, similar in part to hypothetical proteins from Mycobacterium leprae: Q9CC62|ML1232 (358 aa) FASTA scores: opt: 601, E(): 1.1e-25, (36.7% identity in 335 aa overlap); and Q49633|B1170_F3_112 (391 aa) FASTA scores: opt: 601, E(): 1.2e-25, (36.25% identity in 347 aa overlap). Also similar to P71862|Rv3539|MTCY03C7.17c PPE FAMILY PROTEIN from Mycobacterium tuberculosis (479 aa), FASTA scores: opt: 547, E(): 1.3e-22, (38.1% identity in 281 aa overlap); O50440|Rv1184c|MTV005.20c (359 aa); O06828|Rv1430|MTCY493.24c (528 aa); O53642|Rv0159c|MTV032.02c (468 aa); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218339.1" /db_xref="GI:15610958" /db_xref="GeneID:886155" /translation="MKCPGVSDCVATVRHDNVFAIAAGLRWSAAVPPLHKGDAVTKLL VGAIAGGMLACAAILGDGIASADTALIVPGTAPSPYGPLRSLYHFNPAMQPQIGANYY NPTATRHVVSYPGSFWPVTGLNSPTVGSSVSAGTNNLDAAIRSTDGPIFVAGLSQGTL VLDREQARLANDPTAPPPGQLTFIKAGDPNNLLWRAFRPGTHVPIIDYTVPAPAESQY DTINIVGQYDIFSDPPNRPGNLLADLNAIAAGGYYGHSATAFSDPARVAPRDITTTTN SLGATTTTYFIRTDQLPLVRALVDMAGLPPQAAGTVDAALRPIIDRAYQPGPAPAVNP RDLVQGIRGIPAIAPAIAIPIGSTTGASAATSTAAATAAATNALRGANVGPGANKALS MVRGLLPKGKKH" gene complement(4288260..4291529) /gene="mmpL8" /locus_tag="Rv3823c" /db_xref="GeneID:886145" CDS complement(4288260..4291529) /gene="mmpL8" /locus_tag="Rv3823c" /function="UNKNOWN. THOUGHT TO BE INVOLVED IN FATTY ACID TRANSPORT." /experiment="experimental evidence, no additional details recorded" /note="Rv3823c, (MTCY409.07), len: 1089 aa. Probable mmpL8, conserved integral membrane transport protein (see Tekaia et al., 1999), member of RND superfamily, equivalent to Q49619|MMLA_MYCLE|MMPL10|TP1|ML1231|B1170_C1_181 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (1008 aa), FASTA scores: opt: 2718, E(): 7.3e-149, (56.25% identity in 1028 aa overlap). Also similar to others e.g. Q9XCF6|TMTPC from Mycobacterium avium (974 aa), FASTA scores: opt: 660, E(): 2.7e-30, (28.2% identity in 1050 aa overlap); Q9XCF5|TMTPB from Mycobacterium avium (963 aa), FASTA scores: opt: 653, E(): 6.7e-30, (27.0% identity in 1014 aa overlap); Q9KH53|TMTPC from Mycobacterium smegmatis (994 aa), FASTA scores: opt: 648, E(): 1.3e-29, (28.45% identity in 1013 aa overlap); etc. Also highly similar to other mmpL proteins from Mycobacterium tuberculosis; O50439|MMLA_MYCTU|MMPL10|RV1183|MT1220|MTV005.19 (1002 aa), FASTA scores: opt: 2777, E(): 2.9e-152, (58.25% identity in 996 aa overlap); Q50585|MMLC_MYCTU|MMPL12|Rv1522c|MT1573|MTCY19G5.06 (1146 aa), FASTA scores: opt: 2433, E(): 2.1e-132, (49.9% identity in 1050 aa overlap); and similar to others e.g. P95235|MML9_MYCTU|MMPL9|Rv2339|MT2402|MTCY98.08 (962 aa), FASTA scores: opt: 651, E(): 8.8e-30, (28.6% identity in 1038 aa overlap); etc. BELONGS TO THE MMPL FAMILY." /codon_start=1 /transl_table=11 /product="integral membrane transport protein" /protein_id="NP_218340.1" /db_xref="GI:15610959" /db_xref="GeneID:886145" /translation="MCDVLMQPVRTPRPSTNLRSKPLRPTGDGGVFPRLGRLIVRRPW VVIAFWVALAGLLAPTVPSLDAISQRHPVAILPSDAPVLVSTRQMTAAFREAGLQSVA VVVLSDAKGLGAADERSYKELVDALRRDTRDVVMLQDFVTTPPLRELMTSKDNQAWIL PVGLPGDLGSTQSKQAYARVADIVEHQVAGSTLTANLTGPAATVADLNLTGQRDRSRI EFAITILLLVILLIIYGNPITMVLPLITIGMSVVVAQRLVAIAGLAGLGIANQSIIFM SGMMVGAGTDYAVFLISRYHDYLRQGADSDQAVKKALTSIGKVIAASAATVAITFLGM VFTQLGILKTVGPMLGISVAVVFFAAVTLLPALMVLTGRRGWIAPRRDLTRRFWRSSG VHIVRRPKTHLLASALVLVILAGCAGLARYNYDDRKTLPASVESSIGYAALDKHFPSN LIIPEYLFIQSSTDLRTPKALADLEQMVQRVSQVPGVAMVRGITRPAGRSLEQARTSW QAGEVGSKLDEGSKQIAVHTGDIDKLAGGANLMASKLGDVRAQVNRAISTVGGLIDAL AYLQDLLGGNRVLGELEGAEKLIGSMRALGDTIDADASFVANNTEWASPVLGALDSSP MCTADPACASARTELQRLVTARDDGTLAKISELARQLQATRAVQTLAATVSGLRGALA TVIRAMGSLGMSSPGGVRSKINLVNKGVNDLADGSRQLAEGVQLLVDQVKKMGFGLGE ASAFLLAMKDTATTPAMAGFYIPPELLSYATGESVKAETMPSEYRDLLGGLNVDQLKK VAAAFISPDGHSIRYLIQTDLNPFSTAAMDQIDAITAAARGAQPNTALADAKVSVVGL PVVLKDTRDYSDHDLRLIIAMTVCIVLLILIVLLRAIVAPLYLIGSVIVSYLAALGIG VIVFQFLLGQEMHWSIPGLTFVILVAVGADYNMLLISRLREEAVLGVRSGVIRTVAST GGVITAAGLIMAASMYGLVFASLGSVVQGAFVLGTGLLLDTFLVRTVTVPAIAVLVGQ ANWWLPSSWRPATWWPLGRRRGRAQRTKRKPLLPKEEEEQSPPDDDDLIGLWLHDGLR L" gene complement(4291639..4293174) /gene="papA1" /locus_tag="Rv3824c" /db_xref="GeneID:886156" CDS complement(4291639..4293174) /gene="papA1" /locus_tag="Rv3824c" /function="UNKNOWN; THOUGHT TO BE INVOLVED IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3824c, (MTCY409.06), len: 511 aa. Possible papA1, conserved polyketide synthase (PKS) associated protein, highly similar to Q49618|PAPA3|ML1230|B1170_C1_180 PKS-ASSOCIATED PROTEIN A3 from Mycobacterium leprae (471 aa), FASTA scores: opt: 1879, E(): 7.1e-111, (55.5% identity in 465 aa overlap). Also similar to Q9F2R3|SCD65.19c HYPOTHETICAL 52.8 KDA PROTEIN from Streptomyces coelicolor (473 aa), FASTA scores: opt: 476, E(): 1.7e-22, (26.7% identity in 464 aa overlap); and similar in part to Q09164|SIMA|CYSYN CYCLOSPORIN SYNTHETASE from Tolypocladium inflatum (15281 aa) FASTA scores: opt: 238, E(): 2.8e-06, (22.35% identity in 371 aa overlap). Also highly similar to other PKS-ASSOCIATED PROTEINS from Mycobacterium tuberculosis; O50438|PAPA3|Rv1182|MTV005.18 (472 aa), FASTA scores: opt: 1862, E(): 8.4e-110, (55.95% identity in 470 aa overlap); and upstream ORF O07803|PAPA2|Rv3820c|MTCY409.10 (468 aa) FASTA scores: opt: 1664, E(): 2.5e-97, (53.9% identity in 462 aa overlap). Contains PS00453 FKBP-type peptidyl-prolyl cis-trans isomerase signature 1." /codon_start=1 /transl_table=11 /product="polyketide synthase associated protein" /protein_id="NP_218341.1" /db_xref="GI:15610960" /db_xref="GeneID:886156" /translation="MRIGPVELSAVKDWDPAPGVLVSWHPTPASCAKALAAPVSAVPP SYVQARQIRSFSEQAARGLDHSRLLIASVEVFGHCDLRAMTYVINAHLRRHDTYRSWF ELRDTDHIVRHSIADPADIEFVPTTHGEMTSADLRQHIVATPDSLHWDCFSFGVIQRA DSFTFYASIDHLHADGQFVGVGLMEFQSMYTALIMGEPPIGLSEAGSYVDFCVRQHEY TSALTVDSPEVRAWIDFAEINNGTFPEFPLPLGDPSVRCGGDLLSMMLMDEQQTQRFE SACMAANARFIGGMLACIAIAIHELTGADTYFGITPKDIRTPADLMTQGWFTGQIPVT VPVAGLSFNEIARIAQTSFDTGADLAKVPFERVVELSPSLRRPQPLFSLVNFFDAQVG PLSAVTKLFEGLNVGTYSDGRVTYPLSTMVGRFDETAASVLFPDNPVARESVTAYLRA IRSVCMRIANGGTAERVGNVVALSPGRRNNIERMTWRSCRAGDFIDICNLKVANVTVD REA" misc_feature complement(4291888..4291935) /gene="papA1" /locus_tag="Rv3824c" /note="PS00453 FKBP-type peptidyl-prolyl cis-trans isomerase signature 1" gene complement(4293225..4299605) /gene="pks2" /locus_tag="Rv3825c" /db_xref="GeneID:886148" CDS complement(4293225..4299605) /gene="pks2" /locus_tag="Rv3825c" /function="UNKNOWN; SUPPOSED INVOLVED IN LIPID METABOLISM." /experiment="experimental evidence, no additional details recorded" /note="Rv3825c, (MTCY409.05), len: 2126 aa. Probable pks2, polyketide synthase (EC undetermined) (see citation below), equivalent to Q9CD78|MAS|ML0139 PUTATIVE MYCOCEROSIC SYNTHASE from Mycobacterium leprae (2116 aa), FASTA scores: opt: 6828, E(): 0, (63.3% identity in 2128 aa overlap); and Q49624|PKS3|MASA|ML1229|B1170_C2_209 PROBABLE MYCOCEROSIC ACID SYNTHASE from Mycobacterium leprae (2118 aa) FASTA scores: opt: 5220, E(): 0, (62.4% identity in 2130 aa overlap); or similar in part to others from Mycobacterium leprae e.g. Q9CB70|ML2354 POLYKETIDE SYNTHASE (1822 aa) FASTA scores: opt: 2787, E(): 2.1e-145, (34.7% identity in 2135 aa overlap). Also highly similar to Q02251|MCAS_MYCBO|MAS MYCOCEROSIC ACID SYNTHASE from Mycobacterium bovis (2110 aa), FASTA scores: opt: 3495, E(): 2.6e-184, (61.65% identity in 2130 aa overlap). Also highly similar to other polyketide synthases from Mycobacterium tuberculosis e.g. O53901|PKS5|Rv1527c|MTV045.01c|MTCY19G5.01 (2108 aa) FASTA scores: opt: 9576, E(): 0, (69.8% identity in 2124 aa overlap); P96291|MAS|Rv2940c|MTCY24G1.09|MTCY19H9.08c (2111 aa), FASTA scores: opt: 3518, E(): 1.4e-185, (64.05% identity in 2126 aa overlap); O50437|PKS4|Rv1181|MTV005.17 (1582 aa), FASTA scores: opt: 3461, E(): 1.6e-182, (64.55% identity in 1609 aa overlap); etc. Contains PS00606 Beta-ketoacyl synthases active site and PS00012 Phosphopantetheine attachment site." /codon_start=1 /transl_table=11 /product="polyketide synthase PKS2" /protein_id="NP_218342.1" /db_xref="GI:15610961" /db_xref="GeneID:886148" /translation="MGLGSAASGTGADRGAWTLAEPRVTPVAVIGMACRLPGGIDSPE LLWKALLRGDDLITEVPPDRWDCDEFYDPQPGVPGRTVCKWGGFLDNPADFDCEFFGI GEREAIAIDPQQRLLLETSWEAMEHAGLTQQTLAGSATGVFAGVTHGDYTMVAADAKQ LEEPYGYLGNSFSMASGRVAYAMRLHGPAITVDTACSSGLTAVHMACRSLHEGESDVA LAGGVALMLEPRKAAAGSALGMLSPTGRCRAFDVAADGFVSGEGCAVVVLKRLPDALA DGDRILAVIRGTSANQDGHTVNIATPSQPAQVAAYRAALAAGGVDAATVGMVEAHGPG TPIGDPIEYASVSEVYGVDGPCALASVKTNFGHTQSTAGVLGLIKVVLALKHGVVPRN LHFTRLPDEIAGITTNLFVPEVTTPWPTNGRQVPRRAAVSSYGFSGTNVHAVVEQAPQ TEAQPHAASTPPTGTPALFTLSASSADALRQTAQRLTDWIQQHADSLVLSDLAYTLAR RRTHRSVRTAVIASSVDELIAGLGEVADGDTVYQPAVGQDDRGPVWLFSGQGSQWAAM GADLLTNESVFAATVAELEPLIAAESGFSVTEAMTAPETVTGIDRVQPTIFAMQVALA ATMAAYGVRPGAVIGHSMGESAAAVVAGVLSAEDGVRVICRRSKLMATIAGSAAMASV ELPALAVQSELTALGIDDVVVAVVTAPQSTVIAGGTESVRKLVDIWERRDVLARAVAV DVASHSPQVDPILDELIAALADLNPKAPEIPYYSATLFDPREAPACDARYWADNLRHT VRFSAAVRSALDDGYRVFAELSPHPLLTHAVDQIAGSVGMPVAALAGMRREQPLPLGL RRLLTDLHNAGAAVDFSVLCPQGRLVDAPLPAWSHRFLFYDREGVDNRSPGGSTVAVH PLLGAHVRLPEEPERHAWQADVGTATLPWLGDHRIHNVAALPGAAYCEMALSAARAVL GEQSEVRDMRFEAMLLLDDQTPVSTVATVTSPGVVDFAVEALQEGVGHHLRRASAVLQ QVSGECEPPAYDMASLLEAHPCRVDGEDLRRQFDKHGVQYGPAFTGLAVAYVAEDATA TMLAEVALPGSIRSQQGLYAIHPALLDACFQSVGAHPDSQSVGSGLLVPLGVRRVRAY APVRTARYCYTRVTKVELVGVEADIDVLDAHGTVLLAVCGLRIGTGVSERDKHNRVLN ERLLTIEWHQRELPEMDPSGAGKWLLISDCAASDVTATRLADAFREHSAACTTMRWPL HDDQLAAADQLRDQVGSDEFSGVVVLTGSNTGTPHQGSADRGAEYVRRLVGIARELSD LPGAVPRMYVVTRGAQRVLADDCVNLEQGGLRGLLRTIGAEHPHLRATQIDVDEQTGV EQLARQLLATSEEDETAWRDNEWYVARLCPTPLRPQERRTIVADHQQSGMRLQIRTPG DMQTIELAAFHRVPPGPGQIEVAVRASSVNFADVLIAFGRYPSFEGHLPQLGTDFAGV VTAVGPGVTDHKVGDHVGGMSPNGCWGTFVTCDARLAATLPPGLGDAQAAAVTTAHAT AWYGLHELARIRAGDTVLIHSGTGGVGQAAIAIARAAGAEIFATAGTPQRRELLRNMG IEHVYDSRSIEFAEQIRRDTNGRGVDVVLNSVTGAAQLAGLKLLAFRGRFVEIGKRDI YGDTKLGLFPFRRNLSFYAVDLGLLSATHPEELRDLLGTVYRLTAAGELPMPQSTHYP LVEAATAIRVMGNAEHTGKLVLHIPQTGKSLVTLPPEQAQVFRPDGSYIITGGLGGLG LFLAEKMAAAGCGRIVLNSRTQPTQKMRETIEAIAAMGSEVVVECGDIAQPGTAERLV ATAVATGLPVRGVLHAAAVVEDATLANITDELLARDWAPKVHGAWELHEATSGQPLDW FCLFSSAAALTGSPGQSAYSAANSWLDAFAHWRQAQGLPATAIAWGAWSDIGQLGWWS ASPARASALEESNYTAITPDEGAYAFEALLRHNRVYTGYAPVIGAPWLVAFAERSRFF EVFSSSNGSGTSKFRVELNELPRDEWPARLRQLVAEQVSLILRRTVDPDRPLPEYGLD SLGALELRTRIETETGIRLAPKNVSATVRGLADHLYEQLAPDDAPAAALSSQ" misc_feature complement(4293351..4293398) /gene="pks2" /locus_tag="Rv3825c" /note="PS00012 Phosphopantetheine attachment site" misc_feature complement(4298997..4299047) /gene="pks2" /locus_tag="Rv3825c" /note="PS00606 Beta-ketoacyl synthases active site" gene 4299812..4301566 /gene="fadD23" /locus_tag="Rv3826" /db_xref="GeneID:886154" CDS 4299812..4301566 /gene="fadD23" /locus_tag="Rv3826" /EC_number="2.3.1.86" /function="UNKNOWN, BUT INVOLVED IN LIPID DEGRADATION." /note="activates fatty acids by binding to coenzyme A" /codon_start=1 /transl_table=11 /product="acyl-CoA synthetase" /protein_id="NP_218343.1" /db_xref="GI:15610962" /db_xref="GeneID:886154" /translation="MVSLSIPSMLRQCVNLHPDGTAFTYIDYERDSEGISESLTWSQV YRRTLNVAAEVRRHAAIGDRAVILAPQGLDYIVAFLGALQAGLIAVPLSAPLGGASDE RVDAVVRDAKPNVVLTTSAIMGDVVPRVTPPPGIASPPTVAVDQLDLDSPIRSNIVDD SLQTTAYLQYTSGSTRTPAGVMITYKNILANFQQMISAYFADTGAVPPLDLFIMSWLP FYHDMGLVLGVCAPIIVGCGAVLTSPVAFLQRPARWLQLMAREGQAFSAAPNFAFELT AAKAIDDDLAGLDLGRIKTILCGSERVHPATLKRFVDRFSRFNLREFAIRPAYGLAEA TVYVATSQAGQPPEIRYFEPHELSAGQAKPCATGAGTALVSYPLPQSPIVRIVDPNTN TECPPGTIGEIWVHGDNVAGGYWEKPDETERTFGGALVAPSAGTPVGPWLRTGDSGFV SEDKFFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAIAVPSNGVEKLVAIVEL NNRGNLDTERLSFVTREVTSAISTSHGLSVSDLVLVAPGSIPITTSGKVRRAECVKLY RHNEFTRLDAKPLQASDL" repeat_region complement(4301543..4303415) /note="IS1537, len: 1873 bp. Insertion sequence IS1537." /mobile_element="insertion sequence:IS1537" gene complement(4301563..4302789) /locus_tag="Rv3827c" /db_xref="GeneID:886151" CDS complement(4301563..4302789) /locus_tag="Rv3827c" /function="REQUIRED FOR THE TRANSPOSITION OF THE INSERTION SEQUENCE IS1537." /note="Rv3827c, (MTCY409.03), len: 408 aa. Possible transposase within IS1537 element, similar to several transposases e.g. O83029|TNPC|DR2324|DR0666|DR0978|DR1381|DR1651|DR1933 TRANSPOSASE from Deinococcus radiodurans(408 aa) FASTA scores: opt: 302, E(): 3.9e-12, (30.75% identity in 358 aa overlap); Q9RXX7|DR0178 PUTATIVE TRANSPOSASE from Deinococcus radiodurans (409 aa), FASTA scores: opt: 297, E(): 8.2e-12, (31.1% identity in 360 aa overlap); P73816|SLR2062 TRANSPOSASE from Synechocystis sp. strain PCC 6803 (400 aa), FASTA scores: opt: 296, E(): 9.3e-12, (30.05% identity in 353 aa overlap); etc. Highly similar to proteins from Mycobacterium tuberculosis e.g. O33333|Rv2791c|MTV002.56c TRANSPOSASE (459 aa) FASTA scores: opt: 2211, E(): 9.4e-136, (87.75% identity in 367 aa overlap); P95117|Rv2978c|MTCY349.09 HYPOTHETICAL 51.4 KDA PROTEIN (459 aa), FASTA scores: opt: 2165, E(): 9e-133, (85.85% identity in 367 aa overlap); Q10809|YS85_MYCTU|Rv2885c|MT2953|MTCY274.16c HYPOTHETICAL 51.3 KDA PROTEIN (460 aa), FASTA scores: opt: 2127, E(): 2.6e-130, (83.95% identity in 368 aa overlap); O0777|Rv0606|MTCY19H5.16c PROBABLE TRANSPOSASE (FRAGMENT) (247 aa), FASTA scores: opt: 1405, E(): 9.3e-84, (85.3% identity in 238 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_218344.1" /db_xref="GI:15610963" /db_xref="GeneID:886151" /translation="MMARFEVPEGWCVQAFRFTLDPTEDQARALARHFGARRKAYNWA VATLKADIEAWRVTGIGTVKPSLRVLRKRWNTVKDEVCVNAETGAVWWPECSKEAYAD GIGGAVDAYWNWQNSRSGKREGKTMGFPRFKKKGRDQDRVTFTTGAMRVEPDRRHLTL PVVGTVRTHENTRRIERLIATGRARVLAISVRRNGTRLDASVRVLVQRPQQPNVAQPG SRVGVDVGVRRLATVANEAGAVLEEVPNPRPLDTALKELRYASRARSRCTKGSRRYRE RTTEISRLHRRVNDVRTHHLHVLTTRLAQTHGHIVVEGLDAAGMLRQKGLPGARARRR GLSDSALGTPRRHLSYKTGWYGSALVVADRWFPSLSVEPTVRPGLARLVAVKRGREAA AWLPNNPETGCKSRDH" gene complement(4302786..4303397) /locus_tag="Rv3828c" /db_xref="GeneID:886160" CDS complement(4302786..4303397) /locus_tag="Rv3828c" /function="PREVENTS THE COINTEGRATION OF FOREIGN DNA BEFORE INTEGRATION INTO THE CHROMOSOME." /note="Rv3828c, (MTCY409.02), len 203 aa. Possible resolvase within IS1537 element, similar to others e.g. Q97X40|SSO1915 FIRST ORF IN TRANSPOSON ISC1913 from Sulfolobus solfataricus (213 aa), FASTA scores: opt: 275, E(): 1.6e-11, (30.6% identity in 196 aa overlap); Q9V1M0|PAB2076 RESOLVASE RELATED PROTEIN from Pyrococcus abyssi (212 aa), FASTA scores: opt: 254, E(): 4.2e-10, (29.95% identity in 197 aa overlap); Q9RMU7|ORFA PUTATIVE TRANSPOSASE (BELONGS TO THE MERR FAMILY OF TRANSCRIPTIONAL REGULATORS) from elicobacter pylori (Campylobacter pylori) (217 aa), FASTA scores: opt: 243, E(): 2.3e-09, (31.8% identity in 154 aa overlap); etc. Also highly similar to proteins from Mycobacterium tuberculosis e.g. O33334|Rv2792c|MTV002.57c RESOLVASE (193 aa), FASTA scores: opt: 970, E(): 1.5e-58, (79.25% identity in 193 aa overlap); O07773|Rv0605|MTCY19H5.17c PUTATIVE RESOLVASE (202 aa), FASTA scores: opt: 964, E(): 4e-58, (76.25% identity in 202 aa overlap); P95116|Rv2979c|MTCY349.08 HYPOTHETICAL 21.4 KDA PROTEIN (194 aa), FASTA scores: opt: 895, E(): 1.8e-53, (74.75% identity in 194 aa overlap); Q10831|YS86_MYCTU|Rv2886c|MT2954|MTCY274.17c HYPOTHETICAL 31.9 KDA PROTEIN (295 aa), FASTA scores: opt: 826, E(): 1.1e-48, (66.2% identity in 204 aa overlap) (similarity only at C-terminus); etc. Contains PS00397 Site-specific recombinases active site. Possible helix-turn-helix motif from aa 11-32, Score 1305 (+3.63 SD)." /codon_start=1 /transl_table=11 /product="resolvase" /protein_id="NP_218345.1" /db_xref="GI:15610964" /db_xref="GeneID:886160" /translation="MSVVCCRNRWMNLAVWAERNGVAWVIAYRWFRAGLLPVPAQRVG RLILVNDPAVEESGRGRTLVYARVSSADQRSDLDRRVARVTAWATSQHLSVDKVVAEG GWALNGHRRKFFALLGDPVVTRIVVEHRDRFCWFGSEYVEAALVAQGRELVVVDLAEV DDDLVGDMTEILTSMCARLYGERAAQNGAKRALAAAVGDAEAA" misc_feature complement(4303179..4303205) /locus_tag="Rv3828c" /note="PS00397 Site-specific recombinases active site" gene complement(4303398..4305008) /locus_tag="Rv3829c" /db_xref="GeneID:886157" CDS complement(4303398..4305008) /locus_tag="Rv3829c" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3829c, (MTCY409.01, MTCY01A6.40), len 536 aa. Probable oxidoreductase dehydrogenase (EC 1.-.-.-), similar to others e.g. Q9A3T1|CC3121 PHYTOENE DEHYDROGENASE-RELATED PROTEIN from Caulobacter crescentus (543 aa), FASTA scores: opt: 607, E(): 9.2e-28, (28.25% identity in 552 aa overlap); Q98FP6|MLR3676 PHYTOENE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (521 aa), FASTA scores: opt: 605, E(): 1.2e-27, (28.2% identity in 546 aa overlap); Q97W24|SSO2422 PHYTOENE DEHYDROGENASE RELATED PROTEIN from Sulfolobus solfataricus (518 aa), FASTA scores: opt: 388, E(): 4.4e-15, (27.35% identity in 530 aa overlap); Q98BS8|MLL5443 PROBABLE DEHYDROGENASE from Rhizobium loti (Mesorhizobium loti) (524 aa), FASTA scores: opt: 374, E(): 2.9e-14, (24.35% identity in aa overlap); etc. Also similar to MTCY493.22c|Rv1432|MTCY493.22c HYPOTHETICAL 50.5 KDA PROTEIN (probable dehydrogenase) from Mycobacterium tuberculosis (25.1% identity in 295 aa overlap)." /codon_start=1 /transl_table=11 /product="dehydrogenase" /protein_id="NP_218346.1" /db_xref="GI:15610965" /db_xref="GeneID:886157" /translation="MTGYDAIVIGAGHNGLTAAVLLQRAGLRTACLDAKRYAGGMAST VELFDGYRFEIAGSVQFPTSSAVSSELGLDSLPTVDLEVMSVALRGVGDDPVVQFTDP TKMLTHLHRVHGADAVTGMAGLLAWSQAPTRALGRFEAGTLPKSFDEMYACATNEFER SAIDDMLFGSVTDVLDRHFPDREKHGALRGSMTVLAVNTLYRGPATPGSAAALAFGLG VPEGDFVRWKKLRGGIGALTTHLSQLLERTGGEVRLRSKVTEIVVDNSRSSARVRGVR TAAGDTLTSPIVVSAIAPDVTINELIDPAVLPSEIRDRYLRIDHRGSYLQMHFALAQP PAFAAPYQALNDPSMQASMGIFCTPEQVQQQWEDCRRGIVPADPTVVLQIPSLHDPSL APAGKQAASAFAMWFPIEGGSKYGGYGRAKVEMGQNVIDKITRLAPNFKGSILRYTTF TPKHMGVMFGAPGGDYCHALLHSDQIGPNRPGPKGFIGQPIPIAGLYLGSAGCHGGPG ITFIPGYNAARQALADRRAANCCVLSGR" gene complement(4305056..4305685) /locus_tag="Rv3830c" /db_xref="GeneID:886158" CDS complement(4305056..4305685) /locus_tag="Rv3830c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3830c, (MTCY01A6.39), len: 209 aa. Probable transcriptional regulator tetR family, similar to others e.g. P39885|TCMR_STRGA TETRACENOMYCIN C TRANSCRIPTIONAL REPRESSOR from Streptomyces glaucescens (226 aa) FASTA scores: opt: 255, E(): 6.1e-10, (33.65% identity in 202 aa overlap); Q9RDR0|SC4A7.02 PUTATIVE TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (227 aa) FASTA scores: opt: 230, E(): 2.8e-08, (30.05% identity in 213 aa overlap); Q9EWU3|3SC5B7.06 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 221, E(): 1.2e-07, (32.05% identity in 181 aa overlap); Q9AJ68|BUTR PUTATIVE TRANSCRIPTIONAL REPRESSOR from Streptomyces cinnamonensis (268 aa), FASTA scores: opt: 216, E(): 2.7e-07, (37.8% identity in 119 aa overlap); etc. Contains possible helix-turn-helix motif from aa 33-54, Score 1699 (+4.97 SD). SEEMS TO BELONG TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein TetR-family" /protein_id="NP_218347.1" /db_xref="GI:15610966" /db_xref="GeneID:886158" /translation="MVRPPQTARSERTREALRQAALVRFLAQGVEATSAEQIAEDAGV SLRTFYRHFRSKHDLLFADYDAGLHWFRAALDARPADESIIDSVQAAIFSFPYDVDAV TKIASLRRGELEPSRIVRHMREVEADFADAIQAQLRRRNCDIAGAPDARLHIAVTARC VAAAVFGAMEAWMLGSDRSLGELARVCHVALESLRVGISDTWTTLTVSS" gene 4305757..4306239 /locus_tag="Rv3831" /db_xref="GeneID:886161" CDS 4305757..4306239 /locus_tag="Rv3831" /function="UNKNOWN" /note="Rv3831, (MTCY01A6.38c), len: 160 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218348.1" /db_xref="GI:15610967" /db_xref="GeneID:886161" /translation="MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYV VGIASIALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTIAN VILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHRHERSRATVGA" gene complement(4306236..4306811) /locus_tag="Rv3832c" /db_xref="GeneID:886165" CDS complement(4306236..4306811) /locus_tag="Rv3832c" /function="UNKNOWN" /note="Rv3832c, (MTCY01A6.37), len: 191 aa. Conserved hypothetical protein, similar in part to various proteins e.g. Q9XBC9|CZA382.22c PUTATIVE RRNA METHYLASE from Amycolatopsis orientalis (259 aa), FASTA scores: opt: 196, E(): 1.3e-05, (38.2% identity in 110 aa overlap); CAC48459|SMB20059 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (259 aa), FASTA scores: opt: 188, E(): 4.3e-05, (33.8% identity in 136 aa overlap); Q98FP8|MLL3672 METHYL TRANSFERASE-LIKE PROTEIN from Rhizobium loti (Mesorhizobium loti) (264 aa), FASTA scores: opt: 180, E(): 0.00014, (32.05% identity in 156 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218349.1" /db_xref="GI:15610968" /db_xref="GeneID:886165" /translation="MAMNLLHRRHCSSAGWEKAVANQLLPWALQHVELGPRTLEIGPG YGATLQALLGLTASLTAVEVDNSMVERLNRRYGQRARIIRGDGTQTGLPDDHFTSVVC FTMLHHVASAQLQDQLFAEAYRVLQPGGVFAGSDGVPSLPFRLIHIADTYTPIAPADL PGRLRAVGFTDIHVDVAGARLRWRATKPVAA" gene 4306867..4307658 /locus_tag="Rv3833" /db_xref="GeneID:886162" CDS 4306867..4307658 /locus_tag="Rv3833" /function="INVOLVED IN A TRANSCRIPTIONAL MECHANISM." /note="Rv3833, (MTCY01A6.36c), len: 263 aa. Probable transcriptional regulator belonging to araC family, similar to others e.g. Q9KYN4|SC9H11.05 PUTATIVE ARAC-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (289 aa), FASTA scores: opt: 754, E(): 1.2e-42, (50.45% identity in 232 aa overlap); Q9HXH2|PA3830 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (270 aa), FASTA scores: opt: 501, E(): 6.2e-26, (34.85% identity in 238 aa overlap); Q9HX87|PA3927 PROBABLE TRANSCRIPTIONAL REGULATOR from Pseudomonas aeruginosa (262 aa), FASTA scores: opt: 496, E(): 1.3e-25, (36.45% identity in 266 aa overlap); P76241|YEAM_ECOLI|B1790 HYPOTHETICAL TRANSCRIPTIONAL REGULATOR from Escherichia coli strain K12 (273 aa) FASTA scores: opt: 388, E(): 1.9e-18, (30.5% identity in 223 aa overlap); etc. Contains probable helix-turn-helix motif from aa 164-185, Score 2014 (+6.05 SD). SEEMS TO BELONG TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL REGULATORS." /codon_start=1 /transl_table=11 /product="AraC family transcriptional regulator" /protein_id="NP_218350.1" /db_xref="GI:15610969" /db_xref="GeneID:886162" /translation="MSENSHHRLATTSLTLPPGARIERHRHPSHQIVYPSAGAVSVTT HAGTWITPVNRAIWIPAGCWHQHKFHGHTQFHGVALDPQRYRGGPATPTVLAVNPLMR ELVIACSQADRTDTDEHHRMLAVLQDQLPTTSIREPLWVPSPTDRRLRHACALIADNL TQPLTLQQIGGRIGVSQRTLSRLFSDELGMTFPQWRTQLRLQHALVLLAERHDVTSVA SECGWATPSAFIDTYRQAFGHTPGQAAKPMAATRLTRLRRARDRR" gene complement(4307655..4308914) /gene="serS" /locus_tag="Rv3834c" /db_xref="GeneID:886163" CDS complement(4307655..4308914) /gene="serS" /locus_tag="Rv3834c" /EC_number="6.1.1.11" /function="INVOLVED IN TRANSLATION MECHANISM [CATALYTIC ACTIVITY: ATP + L-SERINE + TRNA(SER) = AMP + PYROPHOSPHATE + L-SERYL-TRNA(SER)]." /note="catalyzes a two-step reaction, first charging a serine molecule by linking its carboxyl group to the alpha-phosphate of ATP, followed by transfer of the aminoacyl-adenylate to its tRNA" /codon_start=1 /transl_table=11 /product="seryl-tRNA synthetase" /protein_id="NP_218351.1" /db_xref="GI:15610970" /db_xref="GeneID:886163" /translation="MIDLKLLRENPDAVRRSQLSRGEDPALVDALLTADAARRAVIST ADSLRAEQKAASKSVGGASPEERPPLLRRAKELAEQVKAAEADEVEAEAAFTAAHLAI SNVIVDGVPAGGEDDYAVLDVVGEPSYLENPKDHLELGESLGLIDMQRGAKVSGSRFY FLTGRGALLQLGLLQLALKLAVDNGFVPTIPPVLVRPEVMVGTGFLGAHAEEVYRVEG DGLYLVGTSEVPLAGYHSGEILDLSRGPLRYAGWSSCFRREAGSHGKDTRGIIRVHQF DKVEGFVYCTPADAEHEHERLLGWQRQMLARIEVPYRVIDVAAGDLGSSAARKFDCEA WIPTQGAYRELTSTSNCTTFQARRLATRYRDASGKPQIAATLNGTLATTRWLVAILEN HQRPDGSVRVPDALVPFVGVEVLEPVA" misc_feature complement(4308075..4308149) /gene="serS" /locus_tag="Rv3834c" /note="PS00179 Aminoacyl-transfer RNA synthetases class-II signature 1" gene 4309047..4310396 /locus_tag="Rv3835" /db_xref="GeneID:886168" CDS 4309047..4310396 /locus_tag="Rv3835" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3835, (MTCY01A6.34c), len: 449 aa. Probable conserved membrane protein, equivalent to Q9CDC2|ML0081 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (450 aa), FASTA scores: opt: 2079, E(): 1.8e-74, (69.35% identity in 457 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218352.1" /db_xref="GI:15610971" /db_xref="GeneID:886168" /translation="MLDAPEQDPVDPGDPASPPHGEAEQPLPGPRWPRALRASATRRA LLLTALGGLLIAGLVTAIPAVGRAPERLAGYIASNPVPSTGAKINASFNRVASGDCLM WPDGTPESAAIVSCADEHRFEVAESIDMRTFPGMEYGQNAAPPSPARIQQISEEQCEA AVRRYLGTKFDPNSKFTISMLWPGDRAWRQAGERRMLCGLQSPGPNNQQLAFKGKVAD IDQSKVWPAGTCLGIDATTNQPIDVPVDCAAPHAMEVSGTVNLAERFPDALPSEPEQD GFIKDACTRMTDAYLAPLKLRTTTLTLIYPTLTLPSWSAGSRVVACSIGATLGNGGWA TLVNSAKGALLINGQPPVPPPDIPEERLNLPPIPLQLPTPRPAPPAQQLPSTPPGTQH LPAQQPVVTPTRPPESHAPASAAPAETQPPPPDAGAPPATQSPEATPPGPAEPAPAG" gene 4310401..4310814 /locus_tag="Rv3836" /db_xref="GeneID:886164" CDS 4310401..4310814 /locus_tag="Rv3836" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3836, (MTCY01A6.33c), len: 137aa. Conserved hypothetical protein, highly similar to Q9RKJ2|SCD25.30 HYPOTHETICAL 13.1 KDA PROTEIN from Streptomyces coelicolor (116 aa), FASTA scores: opt: 395, E(): 3.3e-19, (54.4% identity in 114 aa overlap); and similar to CAC47753|SMC0379 CONSERVED HYPOTHETICAL PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) (144 aa) FASTA scores: opt: 194, E(): 6e-06, (33.05% identity in 109 aa overlap); and Q98E37|MLL4425 HYPOTHETICAL PROTEIN from Rhizobium loti (Mesorhizobium loti) (201 aa), FASTA scores: opt: 184, E(): 3.7e-05, (29.75% identity in 121 aa overlap). Contains PS00142 Neutral zinc metallopeptidases, zinc-binding region signature." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218353.1" /db_xref="GI:15610972" /db_xref="GeneID:886164" /translation="MTVRMDPQRFDELVSDALDLIPPELADAMDNVVVLVANRHPQHE NLLGQYEGVALTERGSDYAGSLPDAITIYREALLDACDSEDEVVDQVAITVIHEVAHH FGIDDERLDQLGWRDEPAPGRGNPDLSAPDAMNGP" misc_feature 4310680..4310709 /locus_tag="Rv3836" /note="PS00142 Neutral zinc metallopeptidases, zinc-binding region signature" gene complement(4311009..4311707) /locus_tag="Rv3837c" /db_xref="GeneID:886169" CDS complement(4311009..4311707) /locus_tag="Rv3837c" /EC_number="5.4.2.-" /function="THOUGHT TO BE INVOLVED IN GLYCOLISIS AND PERHAPS GLYCOGEN METABOLISM [CATALYTIC ACTIVITY: 3-PHOSPHOGLYCERATE = 2-PHOSPHOGLYCERATE]." /note="Rv3837c, (MTCY01A6.32), len: 232 aa. Probable phosphoglycerate mutase (EC 5.4.2.-), equivalent to Q9CDC3|ML0079 PUTATIVE PHOSPHOGLYCERATE MUTASE from Mycobacterium leprae (231 aa), FASTA scores: opt: 1116, E(): 7.3e-66, (71.55% identity in 232 aa overlap). Also similar to others e.g. Q9ZAX0|PGM 2,3-PDG DEPENDENT PHOSPHOGLYCERATE MUTASE from Amycolatopsis methanolica (205 aa), FASTA scores: opt: 474, E(): 6.4e-24, (41.85% identity in 203 aa overlap); Q9F3Q7|SC10F4.03 PUTATIVE ISOMERASE from Streptomyces coelicolor (224 aa) FASTA scores: opt: 349, E(): 1e-15, (33.2% identity in 223 aa overlap); Q9RDL0|SCC123.14c PUTATIVE PHOSPHOGLYCERATE MUTASE from Streptomyces coelicolor (223 aa), FASTA scores: opt: 256, E(): 1.2e-09, (34.0% identity in 203 aa overlap); Q9RVD2|DR1097 PUTATIVE PHOSPHOGLYCERATE MUTASE from Deinococcus radiodurans (232 aa), FASTA scores: opt: 201, E(): 5.1e-06, (31.45% identity in 175 aa overlap); etc. Also similar to P71724|Rv2419c|MTCY428.28|MTCY253.01 HYPOTHETICAL 24.2 KDA PROTEIN from Mycobacterium tuberculosis (223 aa), FASTA scores: opt: 210, E(): 1.3e-06, (32.0% identity in 172 aa overlap). Contains PS00175 Phosphoglycerate mutase family phosphohistidine signature." /codon_start=1 /transl_table=11 /product="phosphoglycerate mutase" /protein_id="NP_218354.1" /db_xref="GI:15610973" /db_xref="GeneID:886169" /translation="MSGRLVLLRHGQSYGNVERRLDTLPPGTALTPLGRDQARAFARS GCRRPALLAHSVAIRAYQTAAVVAAELDMVAHEVAGIHEVQVGELENRNDDEAVAEFN ATYSRWHRGELDVPLPGGETANDVLDRYLPVLADLRMRYLDDGDWDGDIVVVSHSAAI RLAAAVLAGVDGNFVLDNHLENVESVVLAPITDGRWSCVQWGLRKPPFCPDPAEAAAS PVTHAVTSSTDPMG" misc_feature complement(4311660..4311689) /locus_tag="Rv3837c" /note="PS00175 Phosphoglycerate mutase family phosphohistidine signature" gene complement(4311704..4312669) /gene="pheA" /locus_tag="Rv3838c" /db_xref="GeneID:886170" CDS complement(4311704..4312669) /gene="pheA" /locus_tag="Rv3838c" /EC_number="4.2.1.51" /function="INVOLVED IN L-PHENYLALANINE BIOSYNTHESIS [CATALYTIC ACTIVITY: PREPHENATE = PHENYLPYRUVATE + H(2)O + CO(2)]." /note="catalyzes the formation of phenylpyruvate from prephenate in phenylalanine biosynthesis" /codon_start=1 /transl_table=11 /product="prephenate dehydratase" /protein_id="NP_218355.1" /db_xref="GI:15610974" /db_xref="GeneID:886170" /translation="MVRIAYLGPEGTFTEAALVRMVAAGLVPETGPDALQRMPVESAP AALAAVRDGGADYACVPIENSIDGSVLPTLDSLAIGVRLQVFAETTLDVTFSIVVKPG RNAADVRTLAAFPVAAAQVRQWLAAHLPAADLRPAYSNADAARQVADGLVDAAVTSPL AAARWGLAALADGVVDESNARTRFVLVGRPGPPPARTGADRTSAVLRIDNQPGALVAA LAEFGIRGIDLTRIESRPTRTELGTYLFFVDCVGHIDDEAVAEALKAVHRRCADVRYL GSWPTGPAAGAQPPLVDEASRWLARLRAGKPEQTLVRPDDQGAQA" misc_feature complement(4311962..4311985) /gene="pheA" /locus_tag="Rv3838c" /note="PS00858 Prephenate dehydratase signature 2" gene 4312765..4313541 /locus_tag="Rv3839" /db_xref="GeneID:886171" CDS 4312765..4313541 /locus_tag="Rv3839" /function="UNKNOWN" /note="Rv3839, (MTCY01A6.30c), len: 258 aa. Conserved hypothetical protein, similar in part to Q9RD78|SCF43.10cfrom HYPOTHETICAL 25.8 KDA PROTEIN Streptomyces coelicolor (241 aa), FASTA scores: opt: 270, E(): 3.2e-10, (33.45% identity in 272 aa overlap); and O00320|F25451_2 HYPOTHETICAL PROTEIN from Homo sapiens (Human) (339 aa), FASTA scores: opt: 126, E(): 0.77, (28.75% identity in 240 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218356.1" /db_xref="GI:15610975" /db_xref="GeneID:886171" /translation="MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLY DGSFAVAVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETL DLIATDNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLA ARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEAR DGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRNGLRARR" gene 4313567..4313980 /locus_tag="Rv3840" /db_xref="GeneID:886167" CDS 4313567..4313980 /locus_tag="Rv3840" /function="SUPPOSED INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3840, (MTCY01A6.29c), len: 137 aa. Possible transcriptional regulator, highly similar in part to PSR PROTEINS (PENICILLIN BINDING PROTEIN REPRESSORS) e.g. Q47828|PSR PSR PROTEIN from Enterococcus hirae (293 aa) FASTA scores: opt: 221, E(): 2.2e-07, (41.65% identity in 108 aa overlap); O86213|PSRFM PSRFM PROTEIN (FRAGMENT) from Enterococcus hirae (171 aa), FASTA scores: opt: 202, E(): 2.4e-06, (40.75% identity in 108 aa overlap); Q47865|PSR PENICILLIN BINDING PROTEIN REPRESSOR from Enterococcus hirae (148 aa), FASTA scores: opt: 201, E(): 2.5e-06, (51.65% identity in 60 aa overlap); etc. Also highly similar in part to other transcriptional regulators e.g. BAB57524|MSRR PEPTIDE METHIONINE SULFOXIDE REDUCTASE REGULATOR from Staphylococcus aureus subsp. aureus Mu50 (327 aa), FASTA scores: opt: 195, E(): 1.2e-05, (36.7% identity in 109 aa overlap); Q99Q02|MSRR|SA1195 PEPTIDE METHIONINE SULFOXIDE REDUCTASE REGULATOR from Staphylococcus aureus subsp. aureus N315, and Staphylococcus aureus (327 aa), FASTA scores: opt: 192, E(): 1.9e-05, (36.7% identity in 109 aa overlap); Q9K6Q8|LYTR|BH3670 ATTENUATOR FOR LYTABC AND LYTR EXPRESSION from Bacillus halodurans (304 aa), FASTA scores: opt: 171, E(): 0.00041, (34.5% identity in 113 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein" /protein_id="NP_218357.1" /db_xref="GI:15610976" /db_xref="GeneID:886167" /translation="MAGCIQRFSHVRCLGPGLASDNPTTLISIPRDSYVPIPGHGRDK INAAFALGGGRLLTQTVELATGLHLDHYAEVGFSEFADLVDAFDPLAGVDLPAGCQTL DGRAALGYVRTRATPRADLEGSDVPVPAAAFETQP" gene 4314178..4314723 /gene="bfrB" /locus_tag="Rv3841" /db_xref="GeneID:886176" CDS 4314178..4314723 /gene="bfrB" /locus_tag="Rv3841" /function="INVOLVED IN IRON STORAGE; FERRITIN IS AN INTRACELLULAR MOLECULE THAT STORES IRON IN A SOLUBLE, NONTOXIC, READILY AVAILABLE FORM. THE FUNCTIONAL MOLECULE, WHICH IS COMPOSED OF 24 CHAINS, IS ROUGHLY SPHERICAL AND CONTAINS A CENTRAL CAVITY IN WHICH THE POLYMERIC FERRIC IRON CORE IS DEPOSITED." /experiment="experimental evidence, no additional details recorded" /note="Rv3841, (MTCY01A6.28c), len: 181 aa. Possible bfrB, bacterioferritin, similar to other ferritin or hypothetical proteins e.g. O26261|MTH158|RSGA FERRITIN LIKE PROTEIN from Methanothermobacter thermautotrophicus (171 aa), FASTA scores: opt: 277, E(): 6.6e-11, (30.1% identity in 166 aa overlap); Q99SZ3|SA1709 HYPOTHETICAL PROTEIN from Staphylococcus aureus subsp. aureus N315 (166 aa), FASTA scores: opt: 275, E(): 8.7e-11, (33.35% identity in 156 aa overlap); Q9X0L2|TM1128 FERRITIN from Thermotoga maritima (164 aa), FASTA scores: opt: 247, E(): 5.3e-09, (25.65% identity in 156 aa overlap); Q9KDT7|BH1124 FERRITIN from Bacillus halodurans (169 aa), FASTA scores: opt: 246, E(): 6.3e-09, (28.95% identity in 152 aa overlap); O29424|AF0834 PUTATIVE FERRITIN from Archaeoglobus fulgidu (169 aa), FASTA scores: opt: 246, E(): 6.3e-09, (28.95% identity in 152 aa overlap); etc. Also shows similarity with Rv1876|MTCY180.42|BFRA PROBABLE BACTERIOFERRITIN from Mycobacterium tuberculosis (159 aa). SEEMS BELONG TO THE BACTERIOFERRITIN FAMILY." /codon_start=1 /transl_table=11 /product="bacterioferritin BfrB" /protein_id="NP_218358.1" /db_xref="GI:15610977" /db_xref="GeneID:886176" /translation="MTEYEGPKTKFHALMQEQIHNEFTAAQQYVAIAVYFDSEDLPQL AKHFYSQAVEERNHAMMLVQHLLDRDLRVEIPGVDTVRNQFDRPREALALALDQERTV TDQVGRLTAVARDEGDFLGEQFMQWFLQEQIEEVALMATLVRVADRAGANLFELENFV AREVDVAPAASGAPHAAGGRL" gene complement(4314738..4315562) /gene="glpQ1" /locus_tag="Rv3842c" /db_xref="GeneID:886177" CDS complement(4314738..4315562) /gene="glpQ1" /locus_tag="Rv3842c" /EC_number="3.1.4.46" /function="GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE HYDROLYZES DEACYLATED PHOSPHOLIPIDS TO G3P AND THE CORRESPONDING ALCOHOLS [CATALYTIC ACTIVITY: A GLYCEROPHOSPHODIESTER + H(2)O = AN ALCOHOL + SN-GLYCEROL 3-PHOSPHATE]." /experiment="experimental evidence, no additional details recorded" /note="Rv3842c, (MTCY01A6.27), len: 274 aa. Probable glpQ1, glycerophosphoryl diester phosphodiesterase (EC 3.1.4.46), equivalent to Q9CDC5|GLPQ|ML0074 PUTATIVE GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE from Mycobacterium leprae (271 aa), FASTA scores: opt: 1635, E(): 1.9e-100, (88.85% identity in 269 aa overlap). Also highly similar to others e.g. CAC44700|SCBAC25E3.13c PUTATIVE PHOSPHODIESTERASE from Streptomyces coelicolor (275 aa), FASTA scores: opt: 413, E(): 5.7e-20, (48.05% identity in 258 aa overlap); P37965|GLPQ_BACSU GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE from Bacillus subtilis (293 aa), FASTA scores: opt: 405, E(): 2e-19, (31.3% identity in 249 aa overlap); Q99VC9|GLPQ|SA0820 GLYCEROPHOSPHORYL DIESTER PHOSPHODIESTERASE from Staphylococcus aureus subsp. aureus N315 (309 aa) FASTA scores: opt: 341, E(): 3.5e-15, (29.3% identity in 273 aa overlap); etc." /codon_start=1 /transl_table=11 /product="glycerophosphoryl diester phosphodiesterase" /protein_id="NP_218359.1" /db_xref="GI:15610978" /db_xref="GeneID:886177" /translation="MTWADEVLAGHPFVVAHRGASAARPEHTLAAYDLALKEGADGVE CDVRLTRDGHLVCVHDRRLDRTSTGAGLVSTMTLAQLRELEYGAWHDSWRPDGSHGDT SLLTLDALVSLVLDWHRPVKIFVETKHPVRYGSLVENKLLALLHRFGIAAPASADRSR AVVMSFSAAAVWRIRRAAPLLPTVLLGKTPRYLTSSAATAVGATAVGPSLPALKEYPQ LVDRSAAQGRAVYCWNVDEYEDIDFCREVGVAWIGTHHPGRTKAWLEDGRANGTTR" gene complement(4315568..4316596) /locus_tag="Rv3843c" /db_xref="GeneID:886178" CDS complement(4315568..4316596) /locus_tag="Rv3843c" /function="UNKNOWN" /note="Rv3843c, (MTCY01A6.26), len: 342 aa. Probable conserved transmembrane protein, equivalent to Q9CDC6|ML0073 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (344 aa), FASTA scores: opt: 1420, E(): 2.6e-68, (63.05% identity in 349 aa overlap)." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218360.1" /db_xref="GI:15610979" /db_xref="GeneID:886178" /translation="MIQVCSQCGTGWNVRERQRVWCPRCRGMLLAPLADMPAEARWRT PARPQVPTASDTRRTPPRLPPGFRWIAVRPGAAPPPRHGPRLRGPTPRYAGIPRWGLT DHVDQAPVPASAKAGPSPAAVRTTLLVSLLVFSIAVVVFVVRYVLLVINRNTLLNSVV ASASVWLGVLVSLAAIAAAGTTIVLLVRWLVARRAAAFMHQGLPERRSARELWAGCLL PMVNLLWAPLYVIELALVEDRYTRLRRPIVVWWIVWIVSNAISMFAFATSWVTDAQGI ANNTTMMVLAYLCAAAAVAAAARVFEGFEQKPVERPAHRWVVVNTDGRSAPASSVAVE LDGQEPAA" gene 4318775..4319266 /locus_tag="Rv3844" /db_xref="GeneID:886180" CDS 4318775..4319266 /locus_tag="Rv3844" /function="REQUIRED FOR THE TRANSPOSITION OF AN INSERTION SEQUENCE." /note="Rv3844, (MTCY01A6.25), len: 163 aa. Possible transposase, identical to P96234|Rv3348|MTV004.04 PUTATIVE TRANSPOSASE from Mycobacterium tuberculosis. Also some similarity with others e.g. N-terminal part of P19834|YI11_STRCL INSERTION ELEMENT IS116 HYPOTHETICAL 44.8 KDA PROTEIN from Streptomyces clavuligerus (399 aa) FASTA scores: opt: 146, E(): 0.017, (29.1% identity in 158 aa overlap)." /codon_start=1 /transl_table=11 /product="transposase" /protein_id="NP_218361.1" /db_xref="GI:15610980" /db_xref="GeneID:886180" /translation="MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPT LAGLRTLTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGAIV GKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVIDANRSWRRLMS LAR" gene 4319281..4319640 /locus_tag="Rv3845" /db_xref="GeneID:886179" CDS 4319281..4319640 /locus_tag="Rv3845" /function="UNKNOWN" /note="Rv3845, (MTCY01A6.24c), len: 119 aa. Hypothetical unknown protein. Contains PS01137 Hypothetical YBL055c/yjjV family signature 1." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218362.1" /db_xref="GI:15610981" /db_xref="GeneID:886179" /translation="MDRVRRVVTDRDSGAGALARHPLAGRRTDPQLAAFYHRLMTTQR HCHTQATIAVARKLAERTRVTITTGRPYQLRDTNGDPVTARGAKELIDAHYHVDTRTH PHNRAHTDTMQNSKPAR" misc_feature 4319548..4319574 /locus_tag="Rv3845" /note="PS01137 Hypothetical YBL055c/yjjV family signature 1" gene 4320704..4321327 /gene="sodA" /locus_tag="Rv3846" /db_xref="GeneID:886174" CDS 4320704..4321327 /gene="sodA" /locus_tag="Rv3846" /EC_number="1.15.1.1" /function="DESTROYS RADICALS WHICH ARE NORMALLY PRODUCED WITHIN THE CELLS AND ARE TOXIC TO BIOLOGICAL SYSTEMS [CATALYTIC ACTIVITY: 2 PEROXIDE RADICAL + 2 H(+) = O(2) + H(2)O(2)]." /experiment="experimental evidence, no additional details recorded" /note="Rv3846, (MTCY01A6.22c), len: 207 aa. sodA (alternate gene names: sodB, sod), superoxyde dismutase (EC 1.15.1.1) (see citations below), equivalent to many e.g. P47201|SODM_MYCAV|SODA|SOD from Mycobacterium avium (206 aa), FASTA scores: opt: 1210, E(): 1.8e-73, (82.5% identity in 206 aa overlap); Q9F9R1|SOD from Mycobacterium paratuberculosis (207 aa), FASTA scores: opt: 1207, E(): 2.9e-73, (81.65% identity in 207 aa overlap); O86165|SODM_MYCLP|SODA|SOD from Mycobacterium lepraemurium (206 aa), FASTA scores: opt: 1204, E(): 4.5e-73, (82.05% identity in 206 aa overlap); P13367|SODM_MYCLE|SODA|ML0072 from Mycobacterium leprae (206 aa), FASTA scores: opt: 1169, E(): 9.6e-71, (80.5% identity in 205 aa overlap); etc. Contains PS00088 Manganese and iron superoxide dismutases signature. BELONGS TO THE IRON/MANGANESE SUPEROXIDE DISMUTASE FAMILY. ALTHOUGH FOUND EXTRACELLULARLY, NO SIGNAL SEQUENCE IS PRESENT. AN ALTERNATIVE SECRETORY PATHWAY MAY BE USED.; sodB; sod" /codon_start=1 /transl_table=11 /product="superoxide dismutase [Fe] SODA" /protein_id="NP_218363.1" /db_xref="GI:15610982" /db_xref="GeneID:886174" /translation="MAEYTLPDLDWDYGALEPHISGQINELHHSKHHATYVKGANDAV AKLEEARAKEDHSAILLNEKNLAFNLAGHVNHTIWWKNLSPNGGDKPTGELAAAIADA FGSFDKFRAQFHAAATTVQGSGWAALGWDTLGNKLLIFQVYDHQTNFPLGIVPLLLLD MWEHAFYLQYKNVKVDFAKAFWNVVNWADVQSRYAAATSQTKGLIFG" misc_feature 4321181..4321204 /gene="sodA" /locus_tag="Rv3846" /note="PS00088 Manganese and iron superoxide dismutases signature" gene 4321538..4322071 /locus_tag="Rv3847" /db_xref="GeneID:886182" CDS 4321538..4322071 /locus_tag="Rv3847" /function="UNKNOWN" /note="Rv3847, (MTCY01A6.21c), len: 177 aa. Conserved hypothetical protein, equivalent to Q9CDC7|ML0071 HYPOTHETICAL PROTEIN from Mycobacterium leprae (177 aa) FASTA scores: opt: 1149, E(): 1.6e-64, (96.6% identity in 177 aa overlap); and Q9F9R0 HYPOTHETICAL 18.5 KDA PROTEIN from Mycobacterium paratuberculosis (177 aa), FASTA scores: opt: 1139, E(): 6.8e-64, (96.6% identity in 177 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218364.1" /db_xref="GI:15610983" /db_xref="GeneID:886182" /translation="MGTGSGGPIGVSPFHSRGALKGFVISGRWPDSTKEWAQLLMVAV RVASLPGLLSTTTVFGAREELPDEPEPGTVGLVLAEGTVFGESAIQPGYFADHQPPAL LMLHPPSETTPSLPECTGAASGCVLLPGLPYLGLEHRAAWVEAEADGTITSMVSRVGV DPISHPDTAILAMLLAA" gene 4322326..4323234 /locus_tag="Rv3848" /db_xref="GeneID:886159" CDS 4322326..4323234 /locus_tag="Rv3848" /function="UNKNOWN" /note="Rv3848, (MTCY01A6.20c), len: 302 aa. Probable conserved transmembrane protein, similar to hypothetical (transmembrane) proteins e.g. Q9HVG2|PA4629 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (192 aa), FASTA scores: opt: 304, E(): 5.3e-11, (35.05% identity in 174 aa overlap); Q9A5S7|CC2370 HYPOTHETICAL PROTEIN from Caulobacter crescentus (207 aa), FASTA scores: opt: 285, E(): 7.4e-10, (29.9% identity in 184 aa overlap); Q9KY43|SCC8A.05c PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (193 aa), FASTA scores: opt: 245, E(): 1.6e-07, (32.8% identity in 195 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218365.1" /db_xref="GI:15610984" /db_xref="GeneID:886159" /translation="MLAATLLSLGAVFLAELGDRSQLITMTYTLRYRWWVVLTGVAIA AFTVHGVAVAIGHFLGSTVPARPAACVSAIAFLIFAVWVWREDTASDSETSPTAAEPR LALFTVVSSFALAELGDKTTLATVTLASDHHWAGVWIGTTLGMILADGLAIGAGLLLH RRLPERLLQVLTGLLFLLFGLWLLFDDALGFRSVAIAVTAAVVLAAATTAVSVRVAQT RRRRPTAAATPEDDSTRPERSSVAPGHPGSILLPLPEVSLRGRRPPSGSPDERCADPG SKGGSRRISVGCWLPGVGRIRPTRSS" gene 4323499..4323897 /locus_tag="Rv3849" /db_xref="GeneID:886184" CDS 4323499..4323897 /locus_tag="Rv3849" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3849, (MTCY01A6.19c), len: 132 aa. Conserved hypothetical protein, equivalent to Q9CDC9|ML0069 HYPOTHETICAL PROTEIN from Mycobacterium leprae (132 aa) FASTA scores: opt: 724, E(): 8.7e-41, (83.95% identity in 131 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218366.1" /db_xref="GI:15610985" /db_xref="GeneID:886184" /translation="MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPY LSQLRSGNRTNPSGATMAALANFFRIKAAYFTDDEYYEKLDKELQWLCTMRDDGVRRI AQRAHGLPSAAQQKVLDRIDELRRAEGIDA" gene 4324015..4324671 /locus_tag="Rv3850" /db_xref="GeneID:886173" CDS 4324015..4324671 /locus_tag="Rv3850" /function="UNKNOWN" /note="Rv3850, (MTCY01A6.18c), len: 218 aa. Conserved hypothetical protein, equivalent to Q9CDD0|ML0068 HYPOTHETICAL PROTEIN from Mycobacterium leprae (238 aa) FASTA scores: opt: 1071, E(): 7.2e-55, (78.35% identity in 217 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218367.1" /db_xref="GI:15610986" /db_xref="GeneID:886173" /translation="MGLFGKRKSRATRRAEARAIKARAKLEAKLSAKNEARRIKAAQR AESKALKAQLKARRDSDRAALKVAEAELKVAREGKLLSPTRIRRLLTVSRLLAPILTP VIYRAAMAARGLIDQRRADQLGVPLAQIGRFSGHGARLSARVGGAERSLRMVQEKKPK DVETKQFVSAVTNRLTDLSAAVAAAEHMPAKRRRTAHSAISSQLDGIEADLMARLGLT" gene 4324683..4324967 /locus_tag="Rv3851" /db_xref="GeneID:886186" CDS 4324683..4324967 /locus_tag="Rv3851" /function="UNKNOWN" /note="Rv3851, (MTCY01A6.17c), len: 94 aa. Possible membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218368.1" /db_xref="GI:15610987" /db_xref="GeneID:886186" /translation="MTAIGMSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPA AFAHTAQLSTATPAPAVAATDANDVPTWPFVVGTVAAVAVAALWAVRRGR" gene 4325074..4325478 /gene="hns" /locus_tag="Rv3852" /db_xref="GeneID:886187" CDS 4325074..4325478 /gene="hns" /locus_tag="Rv3852" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3852, (MTCY01A6.16c), len: 134 aa. Possible hns, histone-like protein, equivalent to Q9CDD1|HNS|ML0067 HISTONE-LIKE PROTEIN from Mycobacterium leprae (121 aa), FASTA scores: opt: 341, E(): 4.3e-09, (51.5% identity in 134 aa overlap). Shows some similarity with other histone-like proteins e.g. O65795|HIS1 HISTONE H1 from Triticum aestivum (Wheat) (288 aa), FASTA scores: opt: 183, E(): 0.091, (34.85% identity in 109 aa overlap); etc." /codon_start=1 /transl_table=11 /product="histone-like protein HNS" /protein_id="NP_218369.1" /db_xref="GI:15610988" /db_xref="GeneID:886187" /translation="MPDPQDRPDSEPSDASTPPAKKLPAKKAAKKAPARKTPAKKAPA KKTPAKGAKSAPPKPAEAPVSLQQRIETNGQLAAAAKDAAAQAKSTVEGANDALARNA SVPAPSHSPVPLIVAVTLSLLALLLIRQLRRR" gene 4325495..4325968 /gene="menG" /locus_tag="Rv3853" /db_xref="GeneID:886181" CDS 4325495..4325968 /gene="menG" /locus_tag="Rv3853" /EC_number="2.1.-.-" /function="INVOLVED IN MENAQUINONE BIOSYNTHESIS (AT THE LAST STEP). CONVERTS DIMETHYLMENAQUINONE (DMK) TO MENAQUINONE (MK)." /experiment="experimental evidence, no additional details recorded" /note="regulator of RNase E; increases half-life and abundance of RNAs; interacts with RNase E possibly inhibiting catalytic activity" /codon_start=1 /transl_table=11 /product="ribonuclease activity regulator protein RraA" /protein_id="NP_218370.1" /db_xref="GI:15610989" /db_xref="GeneID:886181" /translation="MAISFRPTADLVDDIGPDVRSCDLQFRQFGGRSQFAGPISTVRC FQDNALLKSVLSQPSAGGVLVIDGAGSLHTALVGDVIAELARSTGWTGLIVHGAVRDA AALRGIDIGIKALGTNPRKSTKTGAGERDVEITLGGVTFVPGDIAYSDDDGIIVV" gene complement(4326004..4327473) /gene="ethA" /locus_tag="Rv3854c" /db_xref="GeneID:886175" CDS complement(4326004..4327473) /gene="ethA" /locus_tag="Rv3854c" /function="ACTIVATES THE PRO-DRUG ETHIONAMIDE (ETH); INDUCED ETH SENSITIVITY WHEN OVEREXPRESSED IN Mycobacterium tuberculosis." /experiment="experimental evidence, no additional details recorded" /note="Rv3854c, (MTCY01A6.14), len: 489 aa. ethA (alternate gene names: aka, etaA), monooxygenase required for activation of the pro-drug ethionamide (EC 1.-.-.-) (see citations below), highly similar to other monooxygenases e.g. Q9A588|CC2569 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (498 aa), FASTA scores: opt: 1959, E(): 2.9e-114, (57.6% identity in 481 aa overlap); Q9RZT0|DRB0033 ARYLESTERASE/MONOXYGENASE from Deinococcus radiodurans (833 aa), FASTA scores: opt: 1771, E(): 2.2e-102, (53.75% identity in 480 aa overlap); Q9A8K5|CC1348 MONOOXYGENASE (FLAVIN-BINDING FAMILY) from Caulobacter crescentus (499 aa), FASTA scores: opt: 1385, E(): 1.4e-78, (43.2% identity in 486 aa overlap); etc. Also highly similar to others from Mycobacterium tuberculosis e.g. O53300|Rv3083|MTV013.04 MONOXYGENASE (495 aa) FASTA scores: opt: 1692, E(): 1.1e-97, (49.7% identity in 489 aa overlap); O53762|Rv0565c|MTV039.03c PUTATIVE MONOXYGENASE (486 aa), FASTA scores: opt: 1571, E(): 3.7e-90, (49.05% identity in 471 aa overlap); O69708|Rv3741c|MTV025.089c POSSIBLE OXIDOREDUCTASE (probably second part of a two component monooxygenase) (224 aa), FASTA scores: opt: 542, E(): 1.7e-26, (50.0% identity in 162 aa overlap); etc.; aka; etaA" /codon_start=1 /transl_table=11 /product="monooxygenase ETHA" /protein_id="NP_218371.1" /db_xref="GI:15610990" /db_xref="GeneID:886175" /translation="MTEHLDVVIVGAGISGVSAAWHLQDRCPTKSYAILEKRESMGGT WDLFRYPGIRSDSDMYTLGFRFRPWTGRQAIADGKPILEYVKSTAAMYGIDRHIRFHH KVISADWSTAENRWTVHIQSHGTLSALTCEFLFLCSGYYNYDEGYSPRFAGSEDFVGP IIHPQHWPEDLDYDAKNIVVIGSGATAVTLVPALADSGAKHVTMLQRSPTYIVSQPDR DGIAEKLNRWLPETMAYTAVRWKNVLRQAAVYSACQKWPRRMRKMFLSLIQRQLPEGY DVRKHFGPHYNPWDQRLCLVPNGDLFRAIRHGKVEVVTDTIERFTATGIRLNSGRELP ADIIITATGLNLQLFGGATATIDGQQVDITTTMAYKGMMLSGIPNMAYTVGYTNASWT LKADLVSEFVCRLLNYMDDNGFDTVVVERPGSDVEERPFMEFTPGYVLRSLDELPKQG SRTPWRLNQNYLRDIRLIRRGKIDDEGLRFAKRPAPVGV" gene 4327549..4328199 /gene="ethR" /locus_tag="Rv3855" /db_xref="GeneID:886189" CDS 4327549..4328199 /gene="ethR" /locus_tag="Rv3855" /function="REGULATES NEGATIVELY THE PRODUCTION OF ETHA. INDUCED ETH RESISTANCE WHEN OVEREXPRESSED IN Mycobacterium tuberculosis." /note="Rv3855, (MTCY01A6.13c), len: 216 aa. ethR (alternate gene names: aka, etaR), regulatory protein tetR family, involved in ethionamide sensitivity/resistance, negatively controls neighbouring ethA (Rv3854c, MTCY01A6.14; alternate gene names: aka etaA) (see citations below). Equivalent to Q9CDD3|ML0064 PUTATIVE TRANSCRIPTIONAL REGULATOR from Mycobacterium leprae (214 aa), FASTA scores: opt: 1017, E(): 7e-62, (77.0% identity in 213 aa overlap). Also similar to other transcriptional regulator e.g. Q9S1R1|SCJ9A.09 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR from Streptomyces coelicolor (204 aa), FASTA scores: opt: 305, E(): 1.2e-13, (34.5% identity in 200 aa overlap); Q9KYT9|SCE22.24 PUTATIVE TETR-FAMILY TRANSCRIPTIONAL REGULATOR (FRAGMENT) from Streptomyces coelicolor (244 aa), FASTA scores: opt: 179, E(): 4.9e-05, (35.5% identity in 93 aa overlap); Q9RUK2|DR1384 TRANSCRIPTIONAL REGULATOR (TETR FAMILY) from Deinococcus radiodurans (196 aa), FASTA scores: opt: 167, E(): 0.00026, (41.75% identity in 79 aa overlap); etc. Also similar to P95100|Rv3058c|MTCY22D7.23 HYPOTHETICAL 23.8 KDA PROTEIN from Mycobacterium tuberculosis (216 aa) FASTA scores: opt: 261, E(): 1.2e-10, (31.65% identity in 221 aa overlap); and O08377|Rv1534|MTCY07A7A.03 HYPOTHETICAL 24.5 KDA PROTEIN from Mycobacterium tuberculosis (225 aa), FASTA scores: opt: 164, E(): 0.00047, (25.5% identity in 248 aa overlap). Contains helix-turn-helix motif at aa 45-66, Score 1320 (+3.68 SD). BELONGS TO THE TETR/ACRR FAMILY OF TRANSCRIPTIONAL REGULATORS.; aka; etaR" /codon_start=1 /transl_table=11 /product="transcriptional regulatory repressor protein (TETR-family) ETHR" /protein_id="NP_218372.1" /db_xref="GI:15610991" /db_xref="GeneID:886189" /translation="MTTSAASQASLPRGRRTARPSGDDRELAILATAENLLEDRPLAD ISVDDLAKGAGISRPTFYFYFPSKEAVLLTLLDRVVNQADMALQTLAENPADTDRENM WRTGINVFFETFGSHKAVTRAGQAARATSVEVAELWSTFMQKWIAYTAAVIDAERDRG AAPRTLPAHELATALNLMNERTLFASFAGEQPSVPEARVLDTLVHIWVTSIYGENR" gene complement(4328401..4329408) /locus_tag="Rv3856c" /db_xref="GeneID:886193" CDS complement(4328401..4329408) /locus_tag="Rv3856c" /function="UNKNOWN" /note="Rv3856c, (MTCY01A6.12), len: 335 aa. Conserved hypothetical protein, highly similar to various proteins from diverse organisms e.g. Q9EWR3|3SCF60.21 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (372 aa) FASTA scores: opt: 1286, E(): 2.4e-73, (64.0% identity in 336 aa overlap); P72464|ORF1 from Streptomyces lividans (343 aa), FASTA scores: opt: 1275, E(): 1.1e-72, (60.1% identity in 336 aa overlap); Q9K899|BH3107 DNA-DEPENDENT DNA POLYMERASE BETA CHAIN from Bacillus halodurans (571 aa), FASTA scores: opt: 592, E(): 1.2e-29, (39.15% identity in 240 aa overlap); etc. May be a DNA polymerase beta (gene name: yshC) (see citation below)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218373.1" /db_xref="GI:15610992" /db_xref="GeneID:886193" /translation="MDPVTALRQIAYYKDRNRHDPRRVMAYRNAADIIEGLDDAARQR HGQANSWQSLAGIGPKTAKVIAQAWSGREPDLLAELRADAEDLGGGAIRAALRGDLHL HSNWSDGSAPIEEMMATAAALGHQYCALTDHSPRLTIANGLSPDRLRKQLDVIDELRE KFAPLRILTGIEVDILEDGSLDQEPEMLDRLDIVVASVHSKLSMDSAAMTRRMVRAVA NGHTDVLGHCTGRLIAGNRGIRPESKFDAEAVFTACREHGTAVEINSRPERRDPPTRL LHLARDIGCVFSIDTDAHAPGQLDFLGYGAQRALDAEVPADRIVNTWPADTLLAWTGS H" gene complement(4329417..4329614) /locus_tag="Rv3857c" /db_xref="GeneID:886192" CDS complement(4329417..4329614) /locus_tag="Rv3857c" /function="UNKNOWN" /note="Rv3857c, (MTCY01A6.11), len: 65 aa. Possible membrane protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218374.1" /db_xref="GI:15610993" /db_xref="GeneID:886192" /translation="MNCALGFDTKPILLASYVTHGARRATANQFERPAKGAGVLMALL ILGEMAGFAVVVTGVVFGQLV" gene complement(4330039..4331505) /gene="gltD" /locus_tag="Rv3858c" /db_xref="GeneID:886196" CDS complement(4330039..4331505) /gene="gltD" /locus_tag="Rv3858c" /EC_number="1.4.1.13" /function="PROBABLY INVOLVED IN GLUTAMATE BIOSYNTHESIS [CATALYTIC ACTIVITY: 2 L-GLUTAMATE + NAD(+) = L-GLUTAMINE + 2-OXOGLUTARATE + NADH]." /note="glutamate synthase is composed of subunits alpha and beta; beta subunit is a flavin adenine dinucleotide-NADPH dependent oxidoreductase; provides electrons to the alpha subunit, which binds L-glutamine and 2-oxoglutarate and forms L-glutamate" /codon_start=1 /transl_table=11 /product="glutamate synthase subunit beta" /protein_id="NP_218375.1" /db_xref="GI:15610994" /db_xref="GeneID:886196" /translation="MADPGGFLKYTHRKLPKRRPVPLRLRDWREVYEEFDNESLRQQA TRCMDCGIPFCHNGCPLGNLIPEWNDLVRRGRWRDAIERLHATNNFPDFTGRLCPAPC EPACVLGINQDPVTIKQIELEIIDKAFDEGWVQPRPPRKLTGQTVAVVGSGPAGLAAA QQLTRAGHTVTVFEREDRIGGLLRYGIPEFKMEKRHLDRRLDQMRSEGTEFRPGVNVG VDISAEKLRADFDAVVLAGGATAWRELPIPGRELEGVHQAMEFLPWANRVQEGDDVLD EDGQPPITAKGKKVVIIGGGDTGADCLGTVHRQGAIAVHQFEIMPRPPDARAESTPWP TYPLMYRVSAAHEEGGERVFSVNTEAFVGTDGRVSALRAHEVTMLDGKFVKVEGSDFE LEADLVLLAMGFVGPERAGLLTDLGVKFTERGNVARGDDFDTSVPGVFVAGDMGRGQS LIVWAIAEGRAAAAAVDRYLMGSSALPAPVKPTAAPLQ" gene complement(4331498..4336081) /gene="gltB" /locus_tag="Rv3859c" /db_xref="GeneID:886195" CDS complement(4331498..4336081) /gene="gltB" /locus_tag="Rv3859c" /EC_number="1.4.1.13" /function="PROBABLY INVOLVED IN GLUTAMATE BIOSYNTHESIS [CATALYTIC ACTIVITY: 2 L-GLUTAMATE + NADP(+) = L-GLUTAMINE + 2-OXOGLUTARATE + NADPH]." /note="Rv3859c, (MTCY01A6.09), len: 1527 aa. Probable gltB, ferredoxin-dependent glutamate synthase large subunit (EC 1.4.1.13), equivalent to Q9CDD5|GLTB|ML0061 PUTATIVE FERREDOXIN-DEPENDENT GLUTAMATE SYNTHASE from Mycobacterium leprae (1527 aa), FASTA scores: opt: 9277, E(): 0, (90.25% identity in 1527 aa overlap). Also highly similar to many e.g. Q9S2Y9|SC3A3.04c from Streptomyces coelicolor (1514 aa), FASTA scores: opt: 5939, E(): 0, (64.3% identity in 1544 aa overlap); Q9Z465|GLTB from Corynebacterium glutamicum (Brevibacterium flavum) (1510 aa), FASTA scores: opt: 5790, E(): 0, (63.25% identity in 1534 aa overlap); P39812|GLTB_BACSU|GLTA from Bacillus subtilis (1520 aa), FASTA scores: opt: 3445, E(): 2.8e-196, (52.25% identity in 1531 aa overlap); etc. SIMILAR TO OTHER GLUTAMATE SYNTHASES." /codon_start=1 /transl_table=11 /product="ferredoxin-dependent glutamate synthase [NADPH] large subunit" /protein_id="NP_218376.1" /db_xref="GI:15610995" /db_xref="GeneID:886195" /translation="MTPKRVGLYNPAFEHDSCGVAMVVDMHGRRSRDIVDKAITALLN LEHRGAQGAEPRSGDGAGILIQVPDEFLREAVDFELPAPGSYATGIAFLPQSSKDAAA ACAAVQKIAEAEGLQVLGWRSVPTDDSSLGALSRDAMPTFRQVFLAGASGMALERRCY VVRKRAEHELGTKGPGQDGPGRETVYFPSLSGQTLVYKGMLTTPQLKAFYLDLQDERL TSALGIVHSRFSTNTFPSWPLAHPFRRIAHNGEINTVTGNENWMRAREALIKTDIFGS AADVEKLFPICTPGASDTARFDEVLELLHLGGRSLAHAVLMMIPEAWERHESMDPARR AFYQYHASLMEPWDGPASMTFTDGTVVGAVLDRNGLRPSRIWVTDDGLVVMASEAGVL DLHPSTVVRRMRLQPGRMFLVDTAQGRIVSDEEIKADLAAEHPYQEWLDNGLVPLDEL PEGKDVRMPHHRIVMRQLAFGYTYEELNLLVAPMARLGAEPIGSMGTDTPVAVLSQRP RMLYDYFHQLFAQVTNPPLDAIREEVVTSLQGTTGGERDLLNPDQNSCHQIVLPQPIL RNHELAKLVSLDPNDKVNGRPHGLRSKVIRCLYRVSEGGAGLAAALEEVRGAAAAAIA DGARIIILSDRESDEEMAPIPSLLAVAGVHHHLVRERTRTQVGLVVESGDAREVHHMA ALVGFGAAAINPYLVFESIEDMLDRGVIEGIDRTAALNNYIKAAGKGVLKVMSKMGIS TLASYTGAQLFQAVGISEQVLDEYFTGLTCPTGGITLDDIAADVAARHRLAYLDRPDE RAHRELEVGGEYQWRREGEYHLFNPETVFKLQHSTRTGQYKIFKEYTRLVDDQSERMA SLRGLLKFRTGVRPPVPLDEVEPASEIVKRFSTGAMSYGSISAEAHETLAIAMNRLGA RSNCGEGGEDVKRFDRDPNGDWRRSAIKQVASARFGVTSHYLTNCTDLQIKMAQGAKP GEGGQLPGHKVYPWVAEVRHSTPGVGLISPPPHHDIYSIEDLAQLIHDLKNANPSARV HVKLVSENGVGTVAAGVSKAHADVVLISGHDGGTGATPLTSMKHAGAPWELGLAETQQ TLLLNGLRDRIVVQVDGQLKTGRDVMIATLLGAEEFGFATAPLVVAGCIMMRVCHLDT CPVGVATQNPLLRERFTGKPEFVENFFMFIAEEVREYLAQLGFRTVNEAVGQAGALDT TLARAHWKAHKLDLAPVLHEPESAFMNQDLYCSSRQDHGLDKALDQQLIVMSREALDS GKPVRFSTTIGNVNRTVGTMLGHELTKAYGGQGLPDGTIDITFDGSAGNSFGAFVPKG ITLRVYGDANDYVGKGLSGGRIVVRPSDDAPQDYVAEDNIIGGNVILFGATSGEVYLR GVVGERFAVRNSGAHAVVEGVGDHGCEYMTGGRVVILGRTGRNFAAGMSGGVAYVYDP DGELPANLNSEMVELETLDEDDADWLHGTIQVHVDATDSAVGQRILSDWSGQQRHFVK VMPRDYKRVLQAIALAERDGVDVDKAIMAAAHG" gene 4336777..4337949 /locus_tag="Rv3860" /db_xref="GeneID:886188" CDS 4336777..4337949 /locus_tag="Rv3860" /function="UNKNOWN" /note="Rv3860, (MTCY01A6.08c), len: 390 aa. Conserved hypothetical protein, showing similarity with hypothetical proteins from Mycobacterium leprae e.g. Q9CDD8|ML0048 (586 aa), FASTA scores: opt: 484, E(): 5.5e-14, (29.95% identity in 407 aa overlap); O33082|MLCB628.11c (478 aa) FASTA scores: opt: 484, E(): 4.8e-14, (29.95% identity in 407 aa overlap); etc. Also some similarity with O86637|SC3C3.03c HYPOTHETICAL 112.1 KDA PROTEIN from Streptomyces coelicolor(1083 aa), FASTA scores: opt: 483, E(): 9.6e-14, (30.45% identity in 404 aa overlap). And some similarity with other proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O05456|Rv3888c|MTCY15F10.24 HYPOTHETICAL 37.7 KDA PROTEIN (341 aa), FASTA scores: opt: 603, E(): 2.8e-19, (35.2% identity in 284 aa overlap); O06396|Rv0530|MTCY25D10.09 HYPOTHETICAL 43.0 KDA PROTEIN (405 aa), FASTA scores: opt: 538, E(): 2e-16, (31.0% identity in 371 aa overlap); O69740|Rv3876|MTV027.11 (666 aa), FASTA scores: opt: 475, E(): 1.5e-13, (30.2% identity in 391 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218377.1" /db_xref="GI:15610996" /db_xref="GeneID:886188" /translation="MYERDEFLRDRIRPHQPGTPRGYSPRPPSGDRCPAPPPGRHAAA ATPPGPPRLPSAPLRPLPDPAWPRQPEAPPPSTWADPALAPIRSRTRPGERGWRRMVR LVTFGLVGLGRSGMQRQEAQFEATIRTVLHGNHKVAVLGKGGVGKTSVAACVGSILAE LRQQDRIVGIDADTAFGRLSSRIDPRAAGSFWELTTDTNLRSFTDITARLGRNSAGLY VLAGQPASGPRRVLDPAIYREAALRLDHHFAISVIDCGSSMEAAVTQEVLRDVDALIV VSSPWADGASAAANTIEWLSDYGLTGLLRRSIVVLNDSDGHADKRTKSLLAQEFIDHG QPVVEVPFDPHLRPGGVIDMSHEMAPTTRLKILQVAATVTAYFASRPADAHGSPPR" misc_feature 4337197..4337220 /locus_tag="Rv3860" /note="PS00017 ATP/GTP-binding site motif A" gene 4337946..4338272 /locus_tag="Rv3861" /db_xref="GeneID:886183" CDS 4337946..4338272 /locus_tag="Rv3861" /function="UNKNOWN" /note="Rv3861, (MTCY01A6.07c), len: 108 aa. Hypothetical unknown protein. Overlaps in part next ORF Rv3862c|whiB6." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218378.1" /db_xref="GI:15610997" /db_xref="GeneID:886183" /translation="MTWLADPVGNSRIARAQACKTSISAPIVESWRAQRGAQCGQREK SCRCSRAVHIQGISPPLFRRPLEPAVQAAVASCRLGRHPVVAHRVTVALGQGSQLAQR ECPRPA" gene complement(4338171..4338521) /gene="whiB6" /locus_tag="Rv3862c" /db_xref="GeneID:886190" CDS complement(4338171..4338521) /gene="whiB6" /locus_tag="Rv3862c" /function="INVOLVED IN TRANSCRIPTIONAL MECHANISM." /note="Rv3862c, (MTCY01A6.06), len: 116 aa. Possible whiB6 (alternate gene name: whmF), WhiB-like regulatory protein (see citation below), similar to WhiB paralogue of Streptomyces coelicolor, wblE gene product (85 aa). Shows similarity with Q49765|WHIB7|ML0639|B1937_F2_68 PUTATIVE TRANSCRIPTIONAL REGULATOR WHIB7 from Mycobacterium leprae (89 aa) FASTA scores: opt: 112, E(): 0.49, (41.2% identity in 51 aa overlap). Some similarity to Q9AD55|SCP1.95 PUTATIVE REGULATORY PROTEIN from Streptomyces coelicolor (102 aa) FASTA scores: opt: 129, E(): 0.038, (32.95% identity in 85 aa overlap); AAK47632|MT3290.1 CONSERVED HYPOTHETICAL PROTEIN from Mycobacterium tuberculosis strain CDC1551 (96 aa), FASTA scores: opt: 126, E(): 0.058, (33.35% identity in 84 aa overlap); Q9FC80|SC4B10.07 CONSERVED HYPOTHETICAL PROTEIN from Streptomyces coelicolor (88 aa), FASTA scores: opt: 119, E(): 0.16, (44.65% identity in 70 aa overlap); Q9K4K8|SC5F8.16c REGULATORY PROTEIN from Streptomyces coelicolor (83 aa), FASTA scores: opt: 114, E(): 0.34, (37.05% identity in 54 aa overlap); etc.; whmF" /codon_start=1 /transl_table=11 /product="transcriptional regulatory protein WHIB-like WHIB6" /protein_id="NP_218379.1" /db_xref="GI:15610998" /db_xref="GeneID:886190" /translation="MRYAFAAEATTCNAFWRNVDMTVTALYEVPLGVCTQDPDRWTTT PDDEAKTLCRACPRRWLCARDAVESAGAEGLWAGVVIPESGRARAFALGQLRSLAERN GYPVRDHRVSAQSA" gene 4338849..4340027 /locus_tag="Rv3863" /db_xref="GeneID:886197" CDS 4338849..4340027 /locus_tag="Rv3863" /function="UNKNOWN" /note="Rv3863, (MTCY01A6.05c), len: 392 aa. Hypothetical unknown ala-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218380.1" /db_xref="GI:15610999" /db_xref="GeneID:886197" /translation="MAGERKVCPPSRLVPANKGSTQMSKAGSTVGPAPLVACSGGTSD VIEPRRGVAIIGHSCRVGTQIDDSRISQTHLRAVSDDGRWRIVGNIPRGMFVGGRRGS SVTVSDKTLIRFGDPPGGKALTFEVVRPSDSAAQHGRVQPSADLSDDPAHNAAPVAPD PGVVRAGAAAAARRRELDISQRSLAADGIINAGALIAFEKGRSWPRERTRAKLEEVLQ WPAGTIARIRRGEPTEPATNPDASPGLRPADGPASLIAQAVTAAVDGCSLAIAALPAT EDPEFTERAAPILADLRQLEAIAVQATRISRITPELIKALGAVRRHHDELMRLGATAP GATLAQRLYAARRRANLSTLETAQAAGVAEEMIVGAEAEEELPAEATEAIEALIRQIN" gene 4340270..4341478 /locus_tag="Rv3864" /db_xref="GeneID:886185" CDS 4340270..4341478 /locus_tag="Rv3864" /function="UNKNOWN" /note="Rv3864, (MTCY01A6.04c), len: 402 aa. Conserved hypothetical protein, similar to Q49722|ML0405|B1620_C2_213|MLCL383.01 HYPOTHETICAL 40.8 KDA PROTEIN from Mycobacterium leprae (394 aa) FASTA scores: opt: 397, E(): 1.2e-12, (31.0% identity in 410 aa overlap). Also similar to various proteins from several organisms e.g. Q9VYF9|CG12723 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (450 aa), FASTA scores: opt: 291, E(): 2.3e-07, (34.6% identity in 130 aa overlap); Q98UE3 PROCOLLAGEN ALPHA1(III) (FRAGMENT) from Xenopus laevis (African clawed frog) (117 aa) FASTA scores: opt: 257, E(): 3.6e-06, (41.75% identity in 103 aa overlap); P27393|CA24_ASCSU COLLAGEN ALPHA 2(IV) CHAIN PRECURSOR from Ascaris suum (Pig roundworm) (Ascaris lumbricoides) (1763 aa), FASTA scores: opt: 273, E(): 5.7e-06, (32.1% identity in 240 aa overlap); etc. Also similar to O06267|Rv3616c|MTCY07H7B.06 (392 aa) FASTA scores: opt: 389, E(): 3e-12, (31.6% identity in 399 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218381.1" /db_xref="GI:15611000" /db_xref="GeneID:886185" /translation="MASGSGLCKTTSNFIWGQLLLLGEGIPDPGDIFNTGSSLFKQIS DKMGLAIPGTNWIGQAAEAYLNQNIAQQLRAQVMGDLDKLTGNMISNQAKYVSDTRDV LRAMKKMIDGVYKVCKGLEKIPLLGHLWSWELAIPMSGIAMAVVGGALLYLTIMTLMN ATNLRGILGRLIEMLTTLPKFPGLPGLPSLPDIIDGLWPPKLPDIPIPGLPDIPGLPD FKWPPTPGSPLFPDLPSFPGFPGFPEFPAIPGFPALPGLPSIPNLFPGLPGLGDLLPG VGDLGKLPTWTELAALPDFLGGFAGLPSLGFGNLLSFASLPTVGQVTATMGQLQQLVA AGGGPSQLASMGSQQAQLISSQAQQGGQQHATLVSDKKEDEEGVAEAERAPIDAGTAA SQRGQEGTVL" gene 4341566..4341877 /locus_tag="Rv3865" /db_xref="GeneID:886172" CDS 4341566..4341877 /locus_tag="Rv3865" /function="UNKNOWN" /note="Rv3865, (MTCY01A6.03c), len: 103 aa. Conserved hypothetical protein, showing some similarity to O06268|Rv3615c|MTCY07H7B.07 HYPOTHETICAL 10.8 KDA PROTEIN from Mycobacterium tuberculosis (103 aa), FASTA scores: opt: 198, E(): 7.5e-07, (36.25% identity in 102 aa overlap); Q49723|ML0406|B1620_C2_214|MLCL383.02 HYPOTHETICAL 11.1 KDA PROTEIN from Mycobacterium leprae (106 aa), FASTA scores: opt: 154, E(): 0.00071, (31.05% identity in 103 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218382.1" /db_xref="GI:15611001" /db_xref="GeneID:886172" /translation="MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTH GSFTSKFNDTLQEFETTRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIF G" gene 4341880..4342731 /locus_tag="Rv3866" /db_xref="GeneID:886200" CDS 4341880..4342731 /locus_tag="Rv3866" /function="UNKNOWN" /note="Rv3866, (MTCY01A6.01c, MTV027.01), len: 283 aa. Conserved hypothetical protein. N-terminal end highly similar to O33091|MLCB628.20c HYPOTHETICAL 13.1 KDA PROTEIN from Mycobacterium leprae (122 aa), FASTA scores: opt: 260, E(): 2.1e-09, (43.6% identity in 117 aa overlap); and C-terminal end highly similar to O33090|MLCB628.19c HYPOTHETICAL 36.7 KDA PROTEIN from Mycobacterium leprae (338 aa), FASTA scores: opt: 540, E(): 1.4e-26, (54.5% identity in 156 aa overlap). Also similar to Q9CD34|ML2530 POSSIBLE DNA-BINDING PROTEIN from Mycobacterium leprae (289 aa), FASTA scores: opt: 146, E(): 0.058, (28.25% identity in 269 aa overlap) and O53694|Rv0289|MTV035.17 HYPOTHETICAL 31.6 KDA PROTEIN from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 133, E(): 0.39, (28.15% identity in 277 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218383.1" /db_xref="GI:15611002" /db_xref="GeneID:886200" /translation="MTGPSAAGRAGTADNVVGVEVTIDGMLVIADRLHLVDFPVTLGI RPNIPQEDLRDIVWEQVQRDLTAQGVLDLHGEPQPTVAEMVETLGRPDRTLEGRWWRR DIGGVMVRFVVCRRGDRHVIAARDGDMLVLQLVAPQVGLAGMVTAVLGPAEPANVEPL TGVATELAECTTASQLTQYGIAPASARVYAEIVGNPTGWVEIVASQRHPGGTTTQTDA AAGVLDSKLGRLVSLPRRVGGDLYGSFLPGTQQNLERALDGLLELLPAGAWLDHTSDH AQASSRG" gene 4342770..4343321 /locus_tag="Rv3867" /db_xref="GeneID:886203" CDS 4342770..4343321 /locus_tag="Rv3867" /function="UNKNOWN" /note="Rv3867, (MTV027.02), len: 183 aa. Conserved hypothetical protein, highly similar to the hypothetical proteins from Mycobacterium leprae: Q9CDD6|ML0056 (169 aa) FASTA scores: opt: 403, E(): 1.8e-18, (48.2% identity in 166 aa overlap); Q49730|ML0407|B1620_C3_264|MLCL383.03 (216 aa), FASTA scores: opt: 517, E(): 1.7e-25, (51.45% identity in 175 aa overlap); and O33090|MLCB628.19c (338 aa), FASTA scores: opt: 403, E(): 3.4e-18, (48.2% identity in 166 aa overlap). Also highly similar to O06269|Rv3614c|MTCY07H7B.08 HYPOTHETICAL 19.8 KDA PROTEIN from Mycobacterium tuberculosis (184 aa), FASTA scores: opt: 559, E(): 3.4e-28, (54.35% identity in 173 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218384.1" /db_xref="GI:15611003" /db_xref="GeneID:886203" /translation="MVDPPGNDDDHGDLDALDFSAAHTNEASPLDALDDYAPVQTDDA EGDLDALHALTERDEEPELELFTVTNPQGSVSVSTLMDGRIQHVELTDKATSMSEAQL ADEIFVIADLARQKARASQYTFMVENIGELTDEDAEGSALLREFVGMTLNLPTPEEAA AAEAEVFATRYDVDYTSRYKADD" gene 4343314..4345035 /locus_tag="Rv3868" /db_xref="GeneID:886199" CDS 4343314..4345035 /locus_tag="Rv3868" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3868, (MTV027.03), len: 573 aa. Member of the CBXX/CFQX family of hypothetical proteins; C-terminal end is highly similar to many e.g. P40118|CBXC_ALCEU|CBXXC|CFXXC CBXX PROTEIN (317 aa) FASTA scores: opt: 572, E(): 3e-24, (42.7% identity in 294 aa overlap); CAC48589 PROBABLE CBBX PROTEIN from Rhizobium meliloti (Sinorhizobium meliloti) plasmid pSymB (311 aa) FASTA scores: opt: 569, E(): 4.3e-24, (40.05% identity in 292 aa overlap); P95648|CBBX_RHOSH CBBX PROTEIN from Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) (309 aa), FASTA scores: opt: 559, E(): 1.5e-23, (41.4% identity in 290 aa overlap); etc. Equivalent to O33089|Y2G8_MYCLE|ML0055|MLCB628.18c HYPOTHETICAL 62.3 KDA PROTEIN from Mycobacterium leprae (573 aa), FASTA scores: opt: 3330, E(): 3.9e-175, (89.2% identity in 573 aa overlap); and similar to Q9CD28|Y282_MYCLE|ML2537 HYPOTHETICAL 69.1 KDA PROTEIN from Mycobacterium leprae (640 aa), FASTA scores: opt: 943, E(): 2.4e-44, (37.5% identity in 571 aa overlap). Also similar to many proteins from Mycobacterium tuberculosis (strains H37Rv and CDC1551) e.g. O53687|Y282_MYCTU|Rv0282|MT0295|MTV035.10 HYPOTHETICAL 68.1 KDA PROTEIN (631 aa), FASTA scores: opt: 936, E(): 5.8e-44, (39.05% identity in 568 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218385.1" /db_xref="GI:15611004" /db_xref="GeneID:886199" /translation="MTDRLASLFESAVSMLPMSEARSLDLFTEITNYDESACDAWIGR IRCGDTDRVTLFRAWYSRRNFGQLSGSVQISMSTLNARIAIGGLYGDITYPVTSPLAI TMGFAACEAAQGNYADAMEALEAAPVAGSEHLVAWMKAVVYGAAERWTDVIDQVKSAG KWPDKFLAGAAGVAHGVAAANLALFTEAERRLTEANDSPAGEACARAIAWYLAMARRS QGNESAAVALLEWLQTTHPEPKVAAALKDPSYRLKTTTAEQIASRADPWDPGSVVTDN SGRERLLAEAQAELDRQIGLTRVKNQIERYRAATLMARVRAAKGMKVAQPSKHMIFTG PPGTGKTTIARVVANILAGLGVIAEPKLVETSRKDFVAEYEGQSAVKTAKTIDQALGG VLFIDEAYALVQERDGRTDPFGQEALDTLLARMENDRDRLVVIIAGYSSDIDRLLETN EGLRSRFATRIEFDTYSPEELLEIANVIAAADDSALTAEAAENFLQAAKQLEQRMLRG RRALDVAGNGRYARQLVEASEQCRDMRLAQVLDIDTLDEDRLREINGSDMAEAIAAVH AHLNMRE" misc_feature 4344313..4344336 /locus_tag="Rv3868" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 4345039..4346481 /locus_tag="Rv3869" /db_xref="GeneID:886166" CDS 4345039..4346481 /locus_tag="Rv3869" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3869, (MTV027.04), len: 480 aa. Possible conserved membrane protein (has hydrophobic stretch near N-terminus), equivalent to O33088|ML0054|MLCB628.17c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (481 aa), FASTA scores: opt: 2489, E(): 8.3e-136, (75.75% identity in 478 aa overlap); and similar to others e.g. Q9Z5I3|ML1544|MLCB596.27 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (506 aa), FASTA scores: opt: 739, E(): 3.9e-35, (33.65% identity in 490 aa overlap). Also similar to hypothetical proteins from Mycobacterium tuberculosis e.g. O05449|Rv3895c|MTCY15F10.17 (495 aa), FASTA scores: opt: 795, E(): 2.3e-38, (35.8% identity in 486 aa overlap); O53933|Rv1782|MTV049.04 (506 aa), FASTA scores: opt: 763, E(): 1.6e-36, (34.7% identity in 490 aa overlap); O06317|Rv3450c|MTCY13E12.03c (470 aa) FASTA scores: opt: 717, E(): 6.7e-34, (32.55% identity in 479 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218386.1" /db_xref="GI:15611005" /db_xref="GeneID:886166" /translation="MGLRLTTKVQVSGWRFLLRRLEHAIVRRDTRMFDDPLQFYSRSI ALGIVVAVLILAGAALLAYFKPQGKLGGTSLFTDRATNQLYVLLSGQLHPVYNLTSAR LVLGNPANPATVKSSELSKLPMGQTVGIPGAPYATPVSAGSTSIWTLCDTVARADSTS PVVQTAVIAMPLEIDASIDPLQSHEAVLVSYQGETWIVTTKGRHAIDLTDRALTSSMG IPVTARPTPISEGMFNALPDMGPWQLPPIPAAGAPNSLGLPDDLVIGSVFQIHTDKGP QYYVVLPDGIAQVNATTAAALRATQAHGLVAPPAMVPSLVVRIAERVYPSPLPDEPLK IVSRPQDPALCWSWQRSAGDQSPQSTVLSGRHLPISPSAMNMGIKQIHGTATVYLDGG KFVALQSPDPRYTESMYYIDPQGVRYGVPNAETAKSLGLSSPQNAPWEIVRLLVDGPV LSKDAALLEHDTLPADPSPRKVPAGASGAP" gene 4346481..4348724 /locus_tag="Rv3870" /db_xref="GeneID:886204" CDS 4346481..4348724 /locus_tag="Rv3870" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3870, (MTV027.05), len: 747 aa. Possible conserved transmembrane protein, equivalent to O33087|ML0053|MLCB628.16c PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (744 aa), FASTA scores: opt: 4333, E(): 0, (85.4% identity in 746 aa overlap); and similar to N-terminal end of others e.g. Q9CD30|ML2535 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1329 aa), FASTA scores: opt: 1003, E(): 1e-52, (33.65% identity in 725 aa overlap); O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 1078, E(): 3e-57, (35.4% identity in 774 aa overlap); P71068|YUKA YUKA PROTEIN from Bacillus subtilis (1207 aa) FASTA scores: opt: 529, E(): 4.3e-24, (26.1% identity in 636 aa overlap); Q9KE81|BH0975 HYPOTHETICAL PROTEIN from Bacillus halodurans (1489 aa), FASTA scores: opt: 455, E(): 1.5e-19, (27.1% identity in 734 aa overlap); etc. Also similar to N-terminal end of hypothetical proteins from Mycobacterium tuberculosis e.g. O53689|Rv0284|MTV035.12 (1330 aa), FASTA scores: opt: 982, E(): 1.9e-51, (33.8% identity in 719 aa overlap); O06264|Rv3447c|MTCY77.19c (1236 aa), FASTA scores: opt: 761, E(): 4.1e-38, (38.2% identity in 746 aa overlap); O53935|Rv1784|MTV049.06 (932 aa), FASTA scores: opt: 547, E(): 2.8e-25, (36.25% identity in 276 aa overlap). Contains PS00017 ATP/GTP-binding site motif A (P-loop). Note some similarity (with hypothetical proteins from Mycobacterium tuberculosis and P71068|YUKA) continues in downstream ORF MTV027.06." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218387.1" /db_xref="GI:15611006" /db_xref="GeneID:886204" /translation="MTTKKFTPTITRGPRLTPGEISLTPPDDLGIDIPPSGVQKILPY VMGGAMLGMIAIMVAGGTRQLSPYMLMMPLMMIVMMVGGLAGSTGGGGKKVPEINADR KEYLRYLAGLRTRVTSSATSQVAFFSYHAPHPEDLLSIVGTQRQWSRPANADFYAATR IGIGDQPAVDRLLKPAVGGELAAASAAPQPFLEPVSHMWVVKFLRTHGLIHDCPKLLQ LRTFPTIAIGGDLAGAAGLMTAMICHLAVFHPPDLLQIRVLTEEPDDPDWSWLKWLPH VQHQTETDAAGSTRLIFTRQEGLSDLAARGPHAPDSLPGGPYVVVVDLTGGKAGFPPD GRAGVTVITLGNHRGSAYRIRVHEDGTADDRLPNQSFRQVTSVTDRMSPQQASRIARK LAGWSITGTILDKTSRVQKKVATDWHQLVGAQSVEEITPSRWRMYTDTDRDRLKIPFG HELKTGNVMYLDIKEGAEFGAGPHGMLIGTTGSGKSEFLRTLILSLVAMTHPDQVNLL LTDFKGGSTFLGMEKLPHTAAVVTNMAEEAELVSRMGEVLTGELDRRQSILRQAGMKV GAAGALSGVAEYEKYRERGADLPPLPTLFVVVDEFAELLQSHPDFIGLFDRICRVGRS LRVHLLLATQSLQTGGVRIDKLEPNLTYRIALRTTSSHESKAVIGTPEAQYITNKESG VGFLRVGMEDPVKFSTFYISGPYMPPAAGVETNGEAGGPGQQTTRQAARIHRFTAAPV LEEAPTP" misc_feature 4347915..4347938 /locus_tag="Rv3870" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" repeat_region 4348721..4348773 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" repeat_region 4348774..4348826 /note="53 bp Mycobacterial Interspersed Repetitive Unit, Class II" gene 4348827..4350602 /locus_tag="Rv3871" /db_xref="GeneID:886202" CDS 4348827..4350602 /locus_tag="Rv3871" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3871, (MTV027.06), len: 591 aa. Conserved hypothetical protein, equivalent to Q9CDD7|ML0052 HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa) FASTA scores: opt: 3341, E(): 9.8e-192, (80.85% identity in 596 aa overlap); and O33086|MLCB628.15c HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa), FASTA scores: opt: 3329, E(): 5.1e-191, (80.55% identity in 596 aa overlap). And similar to C-terminal end of others e.g. Q9Z5I2|ML1543|MLCB596.28 POSSIBLE SPOIIIE-FAMILY MEMBRANE PROTEIN from Mycobacterium leprae (1345 aa), FASTA scores: opt: 601, E(): 5.6e-28, (32.3% identity in 613 aa overlap); O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 977, E(): 2.1e-50, (35.15% identity in 583 aa overlap); Q9L0T6|SCD35.15c PUTATIVE CELL DIVISION-RELATED PROTEIN from Streptomyces coelicolor (1525 aa), FASTA scores: opt: 414, E(): 9e-17, (27.6% identity in 424 aa overlap);P71068|YUKA YUKA PROTEIN from Bacillus subtilis (1207 aa), FASTA scores: opt: 343, E(): 1.3e-12, (25.8% identity in 395 aa overlap); etc. And similar to to C-terminal end of hypothetical proteins from Mycobacterium tuberculosis e.g. O06264|Rv3447c|MTCY77.19c (1236 aa) FASTA scores: opt: 845, E(): 1.5e-42, (35.3% identity in 586 aa overlap); O53689|Rv0284|MTV035.12 (1330 aa) FASTA scores: opt: 646, E(): 1.2e-30, (33.35% identity in 606 aa overlap); O53935|Rv1784|MTV049.06 (932 aa) FASTA scores: opt: 589, E(): 2.1e-27, (33.1% identity in 619 aa overlap); etc. Contains 2 X PS00017 ATP/GTP-binding site motif A (P-loop). Note some similarity (with hypothetical proteins from Mycobacterium tuberculosis and P71068|YUKA) continues in upstream ORF MTV027.05." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218388.1" /db_xref="GI:15611007" /db_xref="GeneID:886202" /translation="MTAEPEVRTLREVVLDQLGTAESRAYKMWLPPLTNPVPLNELIA RDRRQPLRFALGIMDEPRRHLQDVWGVDVSGAGGNIGIGGAPQTGKSTLLQTMVMSAA ATHSPRNVQFYCIDLGGGGLIYLENLPHVGGVANRSEPDKVNRVVAEMQAVMRQRETT FKEHRVGSIGMYRQLRDDPSQPVASDPYGDVFLIIDGWPGFVGEFPDLEGQVQDLAAQ GLAFGVHVIISTPRWTELKSRVRDYLGTKIEFRLGDVNETQIDRITREIPANRPGRAV SMEKHHLMIGVPRFDGVHSADNLVEAITAGVTQIASQHTEQAPPVRVLPERIHLHELD PNPPGPESDYRTRWEIPIGLRETDLTPAHCHMHTNPHLLIFGAAKSGKTTIAHAIARA ICARNSPQQVRFMLADYRSGLLDAVPDTHLLGAGAINRNSASLDEAVQALAVNLKKRL PPTDLTTAQLRSRSWWSGFDVVLLVDDWHMIVGAAGGMPPMAPLAPLLPAAADIGLHI IVTCQMSQAYKATMDKFVGAAFGSGAPTMFLSGEKQEFPSSEFKVKRRPPGQAFLVSP DGKEVIQAPYIEPPEEVFAAPPSAG" misc_feature 4349076..4349099 /locus_tag="Rv3871" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature 4349952..4349975 /locus_tag="Rv3871" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene 4350745..4351044 /gene="PE35" /locus_tag="Rv3872" /db_xref="GeneID:886191" CDS 4350745..4351044 /gene="PE35" /locus_tag="Rv3872" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3872, (MTV027.07), len: 99 aa. Some similarity to Mycobacterium tuberculosis conserved PE family proteins (see Brennan & Delogu 2002), e.g. O69713|Rv3746c|MTV025.094c (111 aa), FASTA scores: opt: 306, E(): 5.5e-13, (50.5% identity in 99 aa overlap). Equivalent to AAK48354 from Mycobacterium tuberculosis strain CDC1551 (112 aa) but shorter 14 aa." /codon_start=1 /transl_table=11 /product="PE family-related protein" /protein_id="YP_178021.1" /db_xref="GI:57117163" /db_xref="GeneID:886191" /translation="MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGAD EVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE" gene 4351075..4352181 /gene="PPE68" /locus_tag="Rv3873" /db_xref="GeneID:886201" CDS 4351075..4352181 /gene="PPE68" /locus_tag="Rv3873" /function="UNKNOWN" /note="Rv3873, (MTV027.08), len: 368 aa. Member of the Mycobacterium tuberculosis PPE family, highly similar to many e.g. O33085|ML0051|MLCB628.14c from Mycobacterium leprae (302 aa), FASTA scores: opt: 656, E(): 2.8e-24, (46.2% identity in 288 aa overlap); and O53691|Rv0286|MTV035.14 (513 aa), FASTA scores: opt: 566, E(): 7.8e-20, (35.25% identity in 363 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_178022.1" /db_xref="GI:57117164" /db_xref="GeneID:886201" /translation="MLWHAMPPELNTARLMAGAGPAPMLAAAAGWQTLSAALDAQAVE LTARLNSLGEAWTGGGSDKALAAATPMVVWLQTASTQAKTRAMQATAQAAAYTQAMAT TPSLPEIAANHITQAVLTATNFFGINTIPIALTEMDYFIRMWNQAALAMEVYQAETAV NTLFEKLEPMASILDPGASQSTTNPIFGMPSPGSSTPVGQLPPAATQTLGQLGEMSGP MQQLTQPLQQVTSLFSQVGGTGGGNPADEEAAQMGLLGTSPLSNHPLAGGSGPSAGAG LLRAESLPGAGGSLTRTPLMSQLIEKPVAPSVMPAAAAGSSATGGAAPVGAGAMGQGA QSGGSTRPGLVAPAPLAQEREEDDEDDWDEEDDW" gene 4352274..4352576 /gene="esxB" /locus_tag="Rv3874" /db_xref="GeneID:886194" CDS 4352274..4352576 /gene="esxB" /locus_tag="Rv3874" /function="UNKNOWN. EXPORTED PROTEIN COTRANSCRIBED WITH Rv3875|MT3989|MTV027.10." /experiment="experimental evidence, no additional details recorded" /note="Rv3874, (MT3988, MTV027.09), len: 100 aa. esxB, 10 KDA culture filtrate antigen (see citations below, especially first), highly similar to O33084|CF10_MYCLE|ML0050|MLCB628.13c 10 KDA CULTURE FILTRATE ANTIGEN CFP10 HOMOLOG from Mycobacterium leprae (99 aa), FASTA scores: opt: 237, E(): 2.4e-08, (39.4% identity in 99 aa overlap). Also similar to O05440|ES6D_MYCTU|Rv3905c|MT4024|MTCY15F10.06 PUTATIVE ESAT-6 LIKE PROTEIN 13 from Mycobacterium tuberculosis (103 aa) FASTA scores: opt: 126, E(): 0.18, (23.1% identity in 91 aa overlap); and shows some similarity with other proteins from Mycobacterium tuberculosis. Contains probable coiled-coil from aa 49-93. BELONGS TO THE ESAT6 FAMILY. Note that previously known as lhp (alternate gene name: cfp10).; lhp, cfp10" /codon_start=1 /transl_table=11 /product="10 kDa culture filtrate antigen EsxB" /protein_id="NP_218391.1" /db_xref="GI:15611010" /db_xref="GeneID:886194" /translation="MAEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWR GAAGTAAQAAVVRFQEAANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF" gene 4352609..4352896 /gene="esxA" /locus_tag="Rv3875" /db_xref="GeneID:886209" CDS 4352609..4352896 /gene="esxA" /locus_tag="Rv3875" /function="NOT KNOWN. ELICITS HIGH LEVEL OF INF-GAMMA FROM MEMORY EFFECTOR CELLS DURING THE FIRST PHASE OF A PROTECTIVE IMMUNE RESPONSE. EXPORTED PROTEIN COTRANSCRIBED WITH Rv3874|MT3988|MTV027.09|LHP|CFP10." /experiment="experimental evidence, no additional details recorded" /note="Rv3875, (MT3989, MTV027.10), len: 95 aa. esxA, early secretory antigenic target (see citations below), identical to Q57165|O84901|ESAT6 EARLY SECRETORY ANTIGENIC TARGET from Mycobacterium bovis (94 aa), FASTA scores: opt: 596, E(): 4.6e-33, (100.0% identity in 94 aa overlap). Also similar to Q50206|ESA6_MYCLE|ESAT6|ESX|L45|ML0049|MLCB628.12c 6 KDA EARLY SECRETORY ANTIGENIC TARGET HOMOLOG (ESAT-6-LIKE PROTEIN) (L-ESAT) from Mycobacterium leprae (95 aa), FASTA scores: opt: 236, E(): 3.3e-09, (36.25% identity in 91 aa overlap); and weak similarity with others proteins ESAT-like from Mycobacterium leprae. Also some similarity with O53266|ES69_MYCTU|Rv3019c|MT3104|MTV012.33c PUTATIVE SECRETED ESAT-6 LIKE PROTEIN 9 from Mycobacterium tuberculosis (96 aa), FASTA scores: opt: 131, E(): 0.03, (26.15% identity in 88 aa overlap); and other ESAT-like protein. Contains probable coiled-coil from 56 to 92 aa. BELONGS TO THE ESAT6 FAMILY. Note that previously known as esat-6.; esat-6" /codon_start=1 /transl_table=11 /product="6 kDa early secretory antigenic target ESXA (ESAT-6)" /protein_id="YP_178023.1" /db_xref="GI:57117165" /db_xref="GeneID:886209" /translation="MTEQQWNFAGIEAAASAIQGNVTSIHSLLDEGKQSLTKLAAAWG GSGSEAYQGVQQKWDATATELNNALQNLARTISEAGQAMASTEGNVTGMFA" gene 4353010..4355010 /locus_tag="Rv3876" /db_xref="GeneID:886206" CDS 4353010..4355010 /locus_tag="Rv3876" /function="UNKNOWN" /note="Rv3876, (MTV027.11), len: 666 aa. Conserved hypothetical pro-, ala-rich protein, similar to several proteins from Mycobacterium leprae e.g. Q9CDD8|ML0048 HYPOTHETICAL PROTEIN (586 aa), FASTA scores: opt: 1682, E(): 2.1e-45, (50.75% identity in 672 aa overlap); O33082|MLCB628.11c HYPOTHETICAL 52.0 KDA PROTEIN (478 aa), FASTA scores: opt: 1588, E(): 1.5e-42, (53.5% identity in 542 aa overlap) (also has a proline rich N-terminus); etc. Also similar to other proteins from Mycobacterium tuberculosis, especially in C-terminus, e.g. O06396|Rv0530|MTCY25D10.09 (405 aa), FASTA scores: opt: 670, E(): 2.5e-14, (34.85% identity in 396 aa overlap) (also has Pro-rich N-terminus); etc. Note that N-terminus is repetitive and highly Proline rich." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218393.1" /db_xref="GI:15611012" /db_xref="GeneID:886206" /translation="MAADYDKLFRPHEGMEAPDDMAAQPFFDPSASFPPAPASANLPK PNGQTPPPTSDDLSERFVSAPPPPPPPPPPPPPTPMPIAAGEPPSPEPAASKPPTPPM PIAGPEPAPPKPPTPPMPIAGPEPAPPKPPTPPMPIAGPAPTPTESQLAPPRPPTPQT PTGAPQQPESPAPHVPSHGPHQPRRTAPAPPWAKMPIGEPPPAPSRPSASPAEPPTRP APQHSRRARRGHRYRTDTERNVGKVATGPSIQARLRAEEASGAQLAPGTEPSPAPLGQ PRSYLAPPTRPAPTEPPPSPSPQRNSGRRAERRVHPDLAAQHAAAQPDSITAATTGGR RRKRAAPDLDATQKSLRPAAKGPKVKKVKPQKPKATKPPKVVSQRGWRHWVHALTRIN LGLSPDEKYELDLHARVRRNPRGSYQIAVVGLKGGAGKTTLTAALGSTLAQVRADRIL ALDADPGAGNLADRVGRQSGATIADVLAEKELSHYNDIRAHTSVNAVNLEVLPAPEYS SAQRALSDADWHFIADPASRFYNLVLADCGAGFFDPLTRGVLSTVSGVVVVASVSIDG AQQASVALDWLRNNGYQDLASRACVVINHIMPGEPNVAVKDLVRHFEQQVQPGRVVVM PWDRHIAAGTEISLDLLDPIYKRKVLELAAALSDDFERAGRR" repeat_region 4353280..4353330 /note="51 bp imperfect direct repeat 1, GAACCGGCCGCATCTAAACCACCCACACCCCCCATGCCCATCGCCGGACCC" repeat_region 4353331..4353381 /note="51 bp imperfect direct repeat 2, GAACCGGCCCCACCCAAACCACCCACACCCCCCATGCCCATCGCCGGACCC" repeat_region 4353382..4353432 /note="51 bp imperfect direct repeat 3, GAACCGGCCCCACCCAAACCACCCACACCTCCGATGCCCATCGCCGGACCT" gene 4355007..4356542 /locus_tag="Rv3877" /db_xref="GeneID:886207" CDS 4355007..4356542 /locus_tag="Rv3877" /function="UNKNOWN" /note="Rv3877, (MTV027.12), len: 511 aa. Probable conserved transmembrane protein, equivalent to Q9CDD9|ML0047 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (512 aa), FASTA scores: opt: 2496, E(): 2.8e-140, (74.0% identity in 512 aa overlap); and highly similar, but longer 32 aa, to O33081|MLCB628.10c HYPOTHETICAL 51.4 KDA PROTEIN from Mycobacterium leprae (480 aa), FASTA scores: opt: 2346, E(): 2e-131, (74.15% identity in 480 aa overlap). Shows also similarity with other membrane proteins from Mycobacterium leprae e.g. Q9CBV2|ML1539 PROBABLE MEMBRANE PROTEIN (503 aa), FASTA scores: opt: 318, E(): 2e-11, (22.7% identity in 520 aa overlap). Also similar to various proteins from Mycobacterium tuberculosis e.g. O53944|Rv1795|MTV049.17 PUTATIVE MEMBRANE PROTEIN (503 aa), FASTA scores: opt: 391, E(): 9.4e-16, (24.45% identity in 523 aa overlap); O86362|Rv0290|MTV035.18 HYPOTHETICAL 47.9 KDA PROTEIN (472 aa), FASTA scores: opt: 332, E(): 2.8e-12, (28.1% identity in 509 aa overlap); O05457|Rv3887c|MTCY15F10.25 HYPOTHETICAL 53.2 KDA PROTEIN (509 aa), FASTA scores: opt: 167, E(): 0.017, (24.0% identity in 517 aa overlap); etc. Equivalent to AAK48359 from Mycobacterium tuberculosis strain CDC1551 (479 aa) but longer 32 aa." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218394.1" /db_xref="GI:15611013" /db_xref="GeneID:886207" /translation="MSAPAVAAGPTAAGATAARPATTRVTILTGRRMTDLVLPAAVPM ETYIDDTVAVLSEVLEDTPADVLGGFDFTAQGVWAFARPGSPPLKLDQSLDDAGVVDG SLLTLVSVSRTERYRPLVEDVIDAIAVLDESPEFDRTALNRFVGAAIPLLTAPVIGMA MRAWWETGRSLWWPLAIGILGIAVLVGSFVANRFYQSGHLAECLLVTTYLLIATAAAL AVPLPRGVNSLGAPQVAGAATAVLFLTLMTRGGPRKRHELASFAVITAIAVIAAAAAF GYGYQDWVPAGGIAFGLFIVTNAAKLTVAVARIALPPIPVPGETVDNEELLDPVATPE ATSEETPTWQAIIASVPASAVRLTERSKLAKQLLIGYVTSGTLILAAGAIAVVVRGHF FVHSLVVAGLITTVCGFRSRLYAERWCAWALLAATVAIPTGLTAKLIIWYPHYAWLLL SVYLTVALVALVVVGSMAHVRRVSPVVKRTLELIDGAMIAAIIPMLLWITGVYDTVRN IRF" gene 4356693..4357535 /locus_tag="Rv3878" /db_xref="GeneID:886198" CDS 4356693..4357535 /locus_tag="Rv3878" /function="UNKNOWN" /note="Rv3878, (MTV027.13), len: 280 aa. Hypothetical unknown ala-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218395.1" /db_xref="GI:15611014" /db_xref="GeneID:886198" /translation="MAEPLAVDPTGLSAAAAKLAGLVFPQPPAPIAVSGTDSVVAAIN ETMPSIESLVSDGLPGVKAALTRTASNMNAAADVYAKTDQSLGTSLSQYAFGSSGEGL AGVASVGGQPSQATQLLSTPVSQVTTQLGETAAELAPRVVATVPQLVQLAPHAVQMSQ NASPIAQTISQTAQQAAQSAQGGSGPMPAQLASAEKPATEQAEPVHEVTNDDQGDQGD VQPAEVVAAARDEGAGASPGQQPGGGVPAQAMDTGAGARPAASPLAAPVDPSTPAPST TTTL" gene complement(4357593..4359782) /locus_tag="Rv3879c" /db_xref="GeneID:886212" CDS complement(4357593..4359782) /locus_tag="Rv3879c" /function="UNKNOWN" /note="Rv3879c, (MTV027.14c), len: 729 aa. Hypothetical unknown ala-, pro-rich protein (N-terminal end is repetitive and highly Proline-rich)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218396.1" /db_xref="GI:15611015" /db_xref="GeneID:886212" /translation="MSITRPTGSYARQMLDPGGWVEADEDTFYDRAQEYSQVLQRVTD VLDTCRQQKGHVFEGGLWSGGAANAANGALGANINQLMTLQDYLATVITWHRHIAGLI EQAKSDIGNNVDGAQREIDILENDPSLDADERHTAINSLVTATHGANVSLVAETAERV LESKNWKPPKNALEDLLQQKSPPPPDVPTLVVPSPGTPGTPGTPITPGTPITPGTPIT PIPGAPVTPITPTPGTPVTPVTPGKPVTPVTPVKPGTPGEPTPITPVTPPVAPATPAT PATPVTPAPAPHPQPAPAPAPSPGPQPVTPATPGPSGPATPGTPGGEPAPHVKPAALA EQPGVPGQHAGGGTQSGPAHADESAASVTPAAASGVPGARAAAAAPSGTAVGAGARSS VGTAAASGAGSHAATGRAPVATSDKAAAPSTRAASARTAPPARPPSTDHIDKPDRSES ADDGTPVSMIPVSAARAARDAATAAASARQRGRGDALRLARRIAAALNASDNNAGDYG FFWITAVTTDGSIVVANSYGLAYIPDGMELPNKVYLASADHAIPVDEIARCATYPVLA VQAWAAFHDMTLRAVIGTAEQLASSDPGVAKIVLEPDDIPESGKMTGRSRLEVVDPSA AAQLADTTDQRLLDLLPPAPVDVNPPGDERHMLWFELMKPMTSTATGREAAHLRAFRA YAAHSQEIALHQAHTATDAAVQRVAVADWLYWQYVTGLLDRALAAAC" gene complement(4360199..4360546) /locus_tag="Rv3880c" /db_xref="GeneID:886205" CDS complement(4360199..4360546) /locus_tag="Rv3880c" /function="UNKNOWN" /note="Rv3880c, (MTV027.15c), len: 115 aa. Conserved hypothetical protein, equivalent to O33080|ML0044|MLCB628.09 HYPOTHETICAL 12.2 KDA PROTEIN from Mycobacterium leprae (113 aa), FASTA scores: opt: 397, E(): 2e-19, (56.35% identity in 110 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218397.1" /db_xref="GI:15611016" /db_xref="GeneID:886205" /translation="MSMDELDPHVARALTLAARFQSALDGTLNQMNNGSFRATDEAET VEVTINGHQWLTGLRIEDGLLKKLGAEAVAQRVNEALHNAQAAASAYNDAAGEQLTAA LSAMSRAMNEGMA" gene complement(4360543..4361925) /locus_tag="Rv3881c" /db_xref="GeneID:886214" CDS complement(4360543..4361925) /locus_tag="Rv3881c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3881c, (MTV027.16c), len: 460 aa. Conserved hypothetical ala-, gly-rich protein. C-terminal end highly similar to O06126 HYPOTHETICAL 9.5 KDA PROTEIN (FRAGMENT) from Mycobacterium tuberculosis strain NTI 64719 (90 aa) FASTA scores: opt: 333, E(): 6.3e-07, (69.75% identity in 86 aa overlap) but sequence difference causes frameshift in NTI 64719. Also similar to part of small Mycobacterium leprae ORF O33078|MLCB628.06 (EMBL:Y14967) (101 aa), FASTA scores: opt: 194, E(): 0.04, (59.3% identity in 54 aa overlap), suggesting this is represented by a pseudogene in Mycobacterium leprae." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218398.1" /db_xref="GI:15611017" /db_xref="GeneID:886214" /translation="MTQSQTVTVDQQEILNRANEVEAPMADPPTDVPITPCELTAAKN AAQQLVLSADNMREYLAAGAKERQRLATSLRNAAKAYGEVDEEAATALDNDGEGTVQA ESAGAVGGDSSAELTDTPRVATAGEPNFMDLKEAARKLETGDQGASLAHFADGWNTFN LTLQGDVKRFRGFDNWEGDAATACEASLDQQRQWILHMAKLSAAMAKQAQYVAQLHVW ARREHPTYEDIVGLERLYAENPSARDQILPVYAEYQQRSEKVLTEYNNKAALEPVNPP KPPPAIKIDPPPPPQEQGLIPGFLMPPSDGSGVTPGTGMPAAPMVPPTGSPGGGLPAD TAAQLTSAGREAAALSGDVAVKAASLGGGGGGGVPSAPLGSAIGGAESVRPAGAGDIA GLGQGRAGGGAALGGGGMGMPMGAAHQGQGGAKSKGSQQEDEALYTEDRAWTEAVIGN RRRQDSKESK" gene complement(4362032..4363420) /locus_tag="Rv3882c" /db_xref="GeneID:886208" CDS complement(4362032..4363420) /locus_tag="Rv3882c" /function="UNKNOWN" /note="Rv3882c, (MTV027.17c, MTCY15F10.30), len: 462 aa. Possible conserved membrane protein, equivalent to O33077|ML0042|MLCB628.05 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (467 aa), FASTA scores: opt: 2346, E(): 1.1e-140, (72.1% identity in 462 aa overlap). Also similar to O05459|Rv3885c|MTCY15F10.27 POSSIBLE MEMBRANE PROTEIN from Mycobacterium tuberculosis (537 aa) FASTA scores: opt: 283, E(): 2.5e-10, (26.8% identity in 414 aa overlap); and C-terminal end shows similarity with AAK48368|MT4000 HYPOTHETICAL 45.6 KDA PROTEIN from Mycobacterium tuberculosis strain CDC1551 (422 aa) FASTA scores: opt: 215, E(): 4.1e-06, (26.85% identity in 320 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218399.1" /db_xref="GI:15611018" /db_xref="GeneID:886208" /translation="MRNPLGLRFSTGHALLASALAPPCIIAFLETRYWWAGIALASLG VIVATVTFYGRRITGWVAAVYAWLRRRRRPPDSSSEPVVGATVKPGDHVAVRWQGEFL VAVIELIPRPFTPTVIVDGQAHTDDMLDTGLVEELLSVHCPDLEADIVSAGYRVGNTA APDVVSLYQQVIGTDPAPANRRTWIVLRADPERTRKSAQRRDEGVAGLARYLVASATR IADRLASHGVDAVCGRSFDDYDHATDIGFVREKWSMIKGRDAYTAAYAAPGGPDVWWS ARADHTITRVRVAPGMAPQSTVLLTTADKPKTPRGFARLFGGQRPALQGQHLVANRHC QLPIGSAGVLVGETVNRCPVYMPFDDVDIALNLGDAQTFTQFVVRAAAAGAMVTVGPQ FEEFARLIGAHIGQEVKVAWPNATTYLGPHPGIDRVILRHNVIGTPRHRQLPIRRVSP PEESRYQMALPK" gene complement(4363417..4364757) /gene="mycP1" /locus_tag="Rv3883c" /db_xref="GeneID:886217" CDS complement(4363417..4364757) /gene="mycP1" /locus_tag="Rv3883c" /EC_number="3.4.21.-" /function="THOUGHT TO HAVE PROTEOLYTIC ACTIVITY. EXPRESSED DURING INFECTION OF MACROPHAGES." /experiment="experimental evidence, no additional details recorded" /note="Rv3883c, (MTCY15F10.29), len: 446 aa. mycP1, membrane-anchored serine protease (mycosin) (EC 3.4.21.-) (see citations below), equivalent to O33076|ML0041|MLCB628.04 PROBABLE SECRETED PROTEASE from Mycobacterium leprae (446 aa), FASTA scores: opt: 2448, E(): 1.5e-124, (79.15% identity in 446 aa overlap); and highly similar, but in part, to several putative proteases from Mycobacterium leprae; Q9CBV3|ML1538 (567 aa) FASTA scores: opt: 902, E(): 3e-41, (37.25% identity in 556 aa overlap); and Q9CD36|ML2528 (475 aa), FASTA scores: opt: 873, E(): 9.4e-40, (42.7% identity in 459 aa overlap). Shows also similarity with several proteases from other organisms e.g. Q9PCD0|XF1851 SERINE PROTEASE from Xylella fastidiosa (1000 aa), FASTA scores: opt: 281, E(): 1.3e-07, (27.95% identity in 422 aa overlap); P42780|BPRX_BACNO EXTRACELLULAR SUBTILISIN-LIKE PROTEASE PRECURSOR (EC 3.4.21.-) from Bacteroides nodosus (Dichelobacter nodosus) (595 aa), FASTA scores: opt: 270, E(): 3.2e-07, (28.9% identity in 384 aa overlap); Q46541|APRV5 ACIDIC PROTEASE V5 from Bacteroides nodosus (Dichelobacter nodosus) (595 aa), FASTA scores: opt: 264, E(): 6.8e-07, (28.65% identity in 384 aa overlap); etc. Also highly similar to various proteins from Mycobacterium tuberculosis e.g. O53695|Rv0291|MTV035.19 PROBABLE MEMBRANE-ANCHORED MYCOSIN MYCP3 (461 aa), FASTA scores: opt: 1168, E(): 1.2e-55, (44.6% identity in 453 aa overlap); O53945|Rv1796|MTV049.18 PROBABLE MEMBRANE-ANCHORED MYCOSIN MYCP5 (585 aa), FASTA scores: opt: 928, E(): 1.2e-42, (37.85% identity in 555 aa overlap) (note gap from aa 155-264); and downstream ORF O05458|Rv3886c|MTCY15F10.26 PROBABLE MEMBRANE-ANCHORED MYCOSIN MYCP2 (550 aa), FASTA scores: opt: 910, E(): 1.1e-41, (40.15% identity in 533 aa overlap) (note partial gap from aa 146-234); etc. Equivalent to AAK48366 from Mycobacterium tuberculosis strain CDC1551 (411 aa) but longer 35 aa. Has signal sequence with possible signal peptidase I cleavage site in residues 19-21 (ASA) and hydrophobic stretch at C-terminus, followed by short positively charged segment, that seems to act as a membrane anchor. ACTIVATED BY Ca2+ (see Dave et al., 2002). Contains three serine protease, subtilase family active site motifs: a aspartic acid active site motif (PS00136); a histidine active site motif (PS00137); and a serine active site motif (PS00138). BELONGS TO PEPTIDASE FAMILY S8 (ALSO KNOWN AS THE SUBTILASE FAMILY), PYROLYSIN SUBFAMILY." /codon_start=1 /transl_table=11 /product="membrane-anchored mycosin MYCP1 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-1)" /protein_id="NP_218400.1" /db_xref="GI:15611019" /db_xref="GeneID:886217" /translation="MHRIFLITVALALLTASPASAITPPPIDPGALPPDVTGPDQPTE QRVLCASPTTLPGSGFHDPPWSNTYLGVADAHKFATGAGVTVAVIDTGVDASPRVPAE PGGDFVDQAGNGLSDCDAHGTLTASIIAGRPAPTDGFVGVAPDARLLSLRQTSEAFEP VGSQANPNDPNATPAAGSIRSLARAVVHAANLGVGVINISEAACYKVSRPIDETSLGA SIDYAVNVKGVVVVVAAGNTGGDCVQNPAPDPSTPGDPRGWNNVQTVVTPAWYAPLVL SVGGIGQTGMPSSFSMHGPWVDVAAPAENIVALGDTGEPVNALQGREGPVPIAGTSFA AAYVSGLAALLRQRFPDLTPAQIIHRITATARHPGGGVDDLVGAGVIDAVAALTWDIP PGPASAPYNVRRLPPPVVEPGPDRRPITAVALVAVGLTLALGLGALARRALSRR" gene complement(4364979..4366838) /locus_tag="Rv3884c" /db_xref="GeneID:886210" CDS complement(4364979..4366838) /locus_tag="Rv3884c" /function="UNKNOWN" /note="Rv3884c, (MTCY15F10.28), len: 619 aa. Probable CBXX/CFQX protein family, similar to hypothetical proteins from Mycobacterium leprae e.g. Q9CD28|Y282_MYCLE|ML2537 (640 aa), FASTA scores: opt: 725, E(): 2.9e-34, (28.95% identity in 587 aa overlap); O33089|Y2G8_MYCLE|ML0055|MLCB628.18c (BELONGS TO THE CBXX/CFQX FAMILY) (573 aa); Q9CBV5|ML1536 (610 aa) FASTA scores: opt: 648, E(): 7.4e-30, (31.5% identity in 549 aa overlap). Also similar to proteins belonging to the CBXX/CFQX FAMILY e.g. Q9RKZ2|SC6D7.05c PUTATIVE CBXX/CFQX FAMILY PROTEIN from Streptomyces coelicolor (618 aa) FASTA scores: opt: 557, E(): 1.3e-24, (28.6% identity in 601 aa overlap); P27643|SP5K_BACSU|SPOVK|SPOVJ STAGE V SPORULATION PROTEIN K from Bacillus subtilis (322 aa) FASTA scores: opt: 485, E(): 1.1e-20, (35.0% identity in 280 aa overlap) (similarity only at C-terminus); Q9KAC6|BH2363 STAGE V SPORULATION PROTEIN K from Bacillus halodurans (315 aa), FASTA scores: opt: 462, E(): 2.2e-19, (36.05% identity in 244 aa overlap) (similarity only at C-terminus); etc. And similar to hypothetical proteins from Mycobacterium tuberculosis belonging to the CBXX/CFQX FAMILY e.g. O53687|Y282_MYCTU|Rv0282|MT0295|MTV035.10 HYPOTHETICAL 68.1 KDA PROTEIN (631 aa), FASTA scores: opt: 743, E(): 2.6e-35, (29.9% identity in 612 aa overlap); O69733|Y2G8_MYCTU|Rv3868|MT3981|MTV027.03 HYPOTHETICAL 62.4 KDA PROTEIN (573 aa), FASTA scores: opt: 678, E(): 1.3e-31, (31.25% identity in 589 aa overlap); O53947|YH98_MYCTU|Rv1798|MT1847|MTV049.20 (610 aa) FASTA scores: opt: 669, E(): 4.6e-31, (30.95% identity in 549 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). SEEMS TO BELONG TO THE CBXX/CFQX FAMILY." /codon_start=1 /transl_table=11 /product="CBXX/CFQX family protein" /protein_id="NP_218401.1" /db_xref="GI:15611020" /db_xref="GeneID:886210" /translation="MSRMVDTMGDLLTARRHFDRAMTIKNGQGCVAALPEFVAATEAD PSMADAWLGRIACGDRDLASLKQLNAHSEWLHRETTRIGRTLAAEVQLGPSIGITVTD ASQVGLALSSALTIAGEYAKADALLANRELLDSWRNYQWHQLARAFLMYVTQRWPDVL STAAEDLPPQAIVMPAVTASICALAAHAAAHLGQGRVALDWLDRVDVIGHSRSSERFG ADVLTAAIGPADIPLLVADLAYVRGMVYRQLHEEDKAQIWLSKATINGVLTDAAKEAL ADPNLRLIVTDERTIASRSDRWDASTAKSRDQLDDDNAAQRRGELLAEGRELLAKQVG LAAVKQAVSALEDQLEVRMMRLEHGLPVEGQTNHMLLVGPPGTGKTTTAEALGKIYAG MGIVRHPEIREVRRSDFCGHYIGESGPKTNELIEKSLGRIIFMDEFYSLIERHQDGTP DMIGMEAVNQLLVQLETHRFDFCFIGAGYEDQVDEFLTVNPGLAGRFNRKLRFESYSP VEIVEIGHRYATPRASQLDDAAREVFLDAVTTIRNYTTPSGQHGIDAMQNGRFARNVI ERAEGFRDTRVVAQKRAGQPVSVQDLQIITATDIDAAIRSVCSDNRDMAAIVW" misc_feature complement(4365699..4365722) /locus_tag="Rv3884c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(4366908..4368521) /locus_tag="Rv3885c" /db_xref="GeneID:886220" CDS complement(4366908..4368521) /locus_tag="Rv3885c" /function="UNKNOWN" /note="Rv3885c, (MTCY15F10.27), len: 537 aa. Possible conserved membrane protein (has hydrophobic stretch near N-terminus), showing some similarity with O05462|Rv3882c|MTV027.17c|MTCY15F10.30 POSSIBLE MEMBRANE PROTEIN from Mycobacterium tuberculosis (462 aa) FASTA scores: opt: 283, E(): 8.3e-10, (26.55% identity in 414 aa overlap); and O33077|ML0042|MLCB628.05 PUTATIVE MEMBRANE PROTEIN from Mycobacterium leprae (467 aa), FASTA scores: opt: 260, E(): 2.1e-08, (28.0% identity in 382 aa overlap). Equivalent to AAK48368 from Mycobacterium tuberculosis strain CDC1551 (422 aa) but longer 115 aa." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218402.1" /db_xref="GI:15611021" /db_xref="GeneID:886220" /translation="MTSKLTGFSPRSARRVAGVWTVFVLASAGWALGGQLGAVMAVVV GVALVFVQWWGQPAWSWAVLGLRGRRPVKWNDPITLANNRSGGGVRVQDGVAVVAVQL LGRAHRATTVTGSVTVESDNVIDVVELAPLLRHPLDLELDSISVVTFGSRTGTVGDYP RVYDAEIGTPPYAGRRETWLIMRLPVIGNTQALRWRTSVGAAAISVAQRVASSLRCQG LRAKLATATDLAELDRRLGSDAVAGSAQRWKAIRGEAGWMTTYAYPAEAISSRVLSQA WTLRADEVIQNVTVYPDATCTATITVRTPTPAPTPPSVILRRLNGEQAAAAAANMCGP RPHLRGQRRCPLPAQLVTEIGPSGVLIGKLSNGDRLMIPVTDAGELSRVFVAADDTIA KRIVIRVVGAGERVCVHTRDQERWASVRMPQLSIVGTPRPAPRTTVGVVEYVRRRKNG DDGKSEGSGVDVAISPTPRPASVITIARPGTSLSESDRHGFEVTIEQIDRATVKVGAA GQNWLVEMEMFRAENRYVSLEPVTMSIGR" gene complement(4368518..4370170) /gene="mycP2" /locus_tag="Rv3886c" /db_xref="GeneID:886215" CDS complement(4368518..4370170) /gene="mycP2" /locus_tag="Rv3886c" /EC_number="3.4.21.-" /function="THOUGHT TO HAVE PROTEOLYTIC ACTIVITY." /experiment="experimental evidence, no additional details recorded" /note="Rv3886c, (MTCY15F10.26), len: 550 aa. Probable mycP2, ala-, pro-rich membrane-anchored serine protease (mycosin) (EC 3.4.21.-) (see citation below), highly similar to Q9CBV3|ML1538 POSSIBLE PROTEASE from Mycobacterium leprae (567 aa), FASTA scores: opt: 1034, E(): 3.9e-32, (43.5% identity in 575 aa overlap); and highly similar, but with gaps, to several putative proteases from Mycobacterium leprae; O33076|ML0041|MLCB628.04 (446 aa), FASTA scores: opt: 860, E(): 1.1e-25, (38.65% identity in 538 aa overlap); Q9CD36|ML2528 (475 aa) (475 aa), FASTA scores: opt: 413, E(): 7.1e-09, (37.7% identity in 562 aa overlap). Also similarity with Q99405|PRTM_BACSP M-PROTEASE (EC 3.4.21.-) from Bacillus sp. strain KSM-K16 (269 aa), FASTA scores: E(): 7.6e-06, (27.1% identity in 277 aa overlap). And highly similar, but also with gaps, to other mycosins from Mycobacterium tuberculosis e.g. O53945|Rv1796|MTV049.18 (585 aa), FASTA scores: opt: 1173, E(): 2.4e-37, (47.9% identity in 578 aa overlap); the upstream ORF O05461|Rv3883c|MTCY15F10.29 (446 aa) FASTA scores: opt: 910, E(): 1.5e-27, (40.15% identity in 533 aa overlap); O06316|Rv3449|MTCY13E12.02 (455 aa) FASTA scores: opt: 477, E(): 2.7e-11, (38.75% identity in 550 aa overlap); etc. Contains Pro rich protein with two serine protease, subtilase family active site motifs: aspartic acid active site motif (PS00136); and histidine active site motif (PS00137). BELONGS TO PEPTIDASE FAMILY S8 (ALSO KNOWN AS THE SUBTILASE FAMILY), PYROLYSIN SUBFAMILY. THOUGHT TO BE CLEAVED INTO SMALLER MOLECULAR WEIGHT PROTEINS, 36 AND 29 KDA (see citation below)." /codon_start=1 /transl_table=11 /product="alanine and proline rich membrane-anchored mycosin" /protein_id="NP_218403.1" /db_xref="GI:15611022" /db_xref="GeneID:886215" /translation="MASPLNRPGLRAAAASAALTLVALSANVPAAQAIPPPSVDPAMV PADARPGPDQPMRRSNSCSTPITVRNPDVAQLAPGFNLVNISKAWQYSTGNGVPVAVI DTGVSPNPRLPVVPGGDYIMGEDGLSDCDAHGTVVSSIIAAAPLGILPMPRAMPATAA FPPPAGPPPVTAAPAPPVEVPPPMPPPPPVTITQTVAPPPPPPEDAGAMAPSNGPPDP QTEDEPAVPPPPPGAPDGVVGVAPHATIISIRQSSRAFEPVNPSSAGPNSDEKVKAGT LDSVARAVVHAANMGAKVINISVTACLPAAAPGDQRVLGAALWYAATVKDAVIVAAAG NDGEAGCGNNPMYDPLDPSDPRDWHQVTVVSSPSWFSDYVLSVGAVDAYGAALDKSMS GPWVGVAAPGTHIMGLSPQGGGPVNAYPPSRPGEKNMPFWGTSFSAAYVSGVAALVRA KFPELTAYQVINRIVQSAHNPPAGVDNKLGYGLVDPVAALTFNIPSGDRMAPGAQSRV ITPAAPPPPPDHRARNIAIGFVGAVATGVLAMAIGARLRRAR" misc_feature complement(4369742..4369774) /gene="mycP2" /locus_tag="Rv3886c" /note="PS00137 Serine proteases, subtilase family, histidine active site" misc_feature complement(4369844..4369876) /gene="mycP2" /locus_tag="Rv3886c" /note="PS00136 Serine proteases, subtilase family, aspartic acid active site" gene complement(4370155..4371684) /locus_tag="Rv3887c" /db_xref="GeneID:886211" CDS complement(4370155..4371684) /locus_tag="Rv3887c" /function="UNKNOWN" /note="Rv3887c, (MTCY15F10.25), len: 509 aa. Probable conserved transmembrane protein (has hydrophilic stretch from 1-130 then very hydrophobic domain), similar to other membrane proteins and with weak similarity to known transporters, e.g. Q9CBV2|ML1539 PROBABLE MEMBRANE PROTEIN from Mycobacterium leprae (503 aa), FASTA scores: opt: 395, E(): 2.3e-16, (28.0% identity in 496 aa overlap); Q9CD35|ML2529 CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (485 aa), FASTA scores: opt: 221, E(): 6.6e-06, (24.6% identity in 423 aa overlap); Q9ADP8|2SC10A7.11 PUTATIVE INTEGRAL MEMBRANE PROTEIN from Streptomyces coelicolor (430 aa), FASTA scores: opt: 171, E(): 0.0062, (26.55% identity in 358 aa overlap); CAC44275|SCBAC17F8.03 PUTATIVE DRUG EFFLUX PROTEIN from Streptomyces coelicolor (416 aa), FASTA scores: opt: 160, E(): 0.028, (27.85% identity in 323 aa overlap); etc. Also similar to others from Mycobacterium tuberculosis e.g. O53944|Rv1795|MTV049.17 PUTATIVE MEMBRANE PROTEIN (503 aa), FASTA scores: opt: 360, E(): 2.9e-14, (26.65% identity in 514 aa overlap); etc. Equivalent to AAK48369 from Mycobacterium tuberculosis strain CDC1551 (469 aa) but longer 40 aa." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218404.1" /db_xref="GI:15611023" /db_xref="GeneID:886211" /translation="MTAPHKVAFPARCAVNICYDKHLCSQVFPAGIPVEGFFEGMVEL FDADLKRKGFDGVALPAGSYELHKINGVRLDINKSLDELGVQDGDTLVLVPRVAGESF EPQYESLSTGLAAMGKWLGRDGGDRMFAPVTSLTAAHTAMAIIAMAVGVVLALTLRTR TITDSPVPAAMAGGIGVLLVIGALVVWWGWRERRDLFSGFGWLAVVLLAVAAACAPPG ALGAAHALIGLVVVVLGAITIGVATRKRWQTAVVTAVVTVCGILAAVAAVRMFRPVSM QVLAICVLVGLLVLIRMTPTVALWVARVRPPHFGSITGRDLFARRAGMPVDTVAPVSE ADADDEDNELTDITARGTAIAASARLVNAVQVGMCVGVSLVLPAAVWGVLTPRQPWAW LALLVAGLTVGLFITQGRGFAAKYQAVALVCGASAAVCAGVLKYALDTPKGVQTGLLW PAIFVAAFAALGLAVALVVPATRFRPIIRLTVEWLEVLAMIALLPAAAALGGLFAWLR H" gene complement(4371681..4372706) /locus_tag="Rv3888c" /db_xref="GeneID:886219" CDS complement(4371681..4372706) /locus_tag="Rv3888c" /function="UNKNOWN" /note="Rv3888c, (MTCY15F10.24), len: 341 aa. Probable conserved membrane protein, showing similarity with hypothetical proteins from Mycobacterium leprae: O33082|MLCB628.11c (478 aa), FASTA scores: opt: 530, E(): 7.7e-26, (32.45% identity in 336 aa overlap); Q9CDD8|ML0048 (586 aa), FASTA scores: opt: 530, E(): 9.1e-26, (32.45% identity in 336 aa overlap); Q9CCI1|ML0798 (592 aa), FASTA scores: opt: 426, E(): 3e-19, (27.5% identity in 342 aa overlap) (similarity only at C-terminus). Also similar to proteins from Mycobacterium tuberculosis e.g. P96217|Rv3860|MTCY01A6.08c (390 aa), FASTA scores: opt: 603, E(): 1.7e-30, (35.2% identity in 284 aa overlap); O06396|Rv0530|MTCY25D10.09 (405 aa), FASTA scores: opt: 573, E(): 1.3e-28, (32.0% identity in 328 aa overlap); C-terminus of O69740|Rv3876|MTV027.1 (666 aa), FASTA scores: opt: 509, E(): 2.1e-24, (31.0% identity in 303 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218405.1" /db_xref="GI:15611024" /db_xref="GeneID:886219" /translation="MTNPWNDPNMLDDGAIGRGDPSVRHHFRDSVSDTMRITDLAAPR KIPPGTGWRKFVYSVSFHKINPGESPRERHYRNLQGRIRRHIRRQYVITVVSGKGGVG VTTMAACIGGVFRECRPENVIAIDAVPSFGTLADRIDESPPGDYAAIINDTDVQGYAD IREHLGQNTVGLDVLAGNRTSDQPRPLVPAMFSAVLSRLRRTHTVIVIDTSPDLEHDV MKAVLQSTDTLVFVSGITADRSRPVLRAVDYLRAQGYHELVSRSTVILNHTDSITDKD ALAYLTERFTKVGAIVEAMPFDPHLAKGGIIDTVHELNKKSRLRLFEITAGLADKYVP DAERAAQ" gene complement(4372800..4373630) /locus_tag="Rv3889c" /db_xref="GeneID:886223" CDS complement(4372800..4373630) /locus_tag="Rv3889c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3889c, (MTCY15F10.23), len: 276 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218406.1" /db_xref="GI:15611025" /db_xref="GeneID:886223" /translation="MLTTTVDGLWVLQAVTGVEQTCPELGLRPLLPRLDTAERALRHP VAAELMAVGALDQAGNADPMVREWLTVLLRRDLGLLVTIGVPGGEPTRAAICRFATWW VVLERHGNLVRLYPAGTASDEAGAGELVVGQVERLCGVAEAAPLRPVTVDADELLHAV RDAGTLRSYLLSQRLDVDQLQMVTMAADPTRSAHATLVALQAGVGPEKSARILVGDST VAIVDTAAGRICVESVTSGQRRYQVLSPGSRSDIGGAVQRLIRRLPAGDEWYSYRRVV" gene complement(4373726..4374013) /gene="esxC" /locus_tag="Rv3890c" /db_xref="GeneID:886222" CDS complement(4373726..4374013) /gene="esxC" /locus_tag="Rv3890c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3890c, (MT4005, MTCY15F10.22), len: 95 aa. esxC, ESAT-6 like protein (see Gey Van Pittius et al., 2001), equivalent to Q9K548|ES6B_MYCPA PUTATIVE ESAT-6 LIKE PROTEIN 11 (ORF3890C) from Mycobacterium paratuberculosis (95 aa), FASTA scores: opt: 490, E(): 3.3e-26, (76.85% identity in 95 aa overlap). BELONGS TO THE ESAT6 FAMILY.; ES6_11" /codon_start=1 /transl_table=11 /product="ESAT-6 like protein ESXC (ESAT-6 like protein 11)" /protein_id="NP_218407.1" /db_xref="GI:15611026" /db_xref="GeneID:886222" /translation="MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFF AGHGAQGFFDAQAQMLSGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF" gene complement(4374049..4374372) /gene="esxD" /locus_tag="Rv3891c" /db_xref="GeneID:886218" CDS complement(4374049..4374372) /gene="esxD" /locus_tag="Rv3891c" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3891c, (MTCY15F10.21), len: 107 aa (first GTG taken). esxD, ESAT-6 like protein, equivalent to Q9K547 HYPOTHETICAL 10.3 KDA PROTEIN (FRAGMENT) from Mycobacterium paratuberculosis (100 aa), FASTA scores: opt: 498, E(): 1.7e-26, (77.25% identity in 101 aa overlap). SEEMS TO BELONG TO THE ESAT6 FAMILY (see Gey Van Pittius et al., 2001)." /codon_start=1 /transl_table=11 /product="ESAT-6 like protein EsxD" /protein_id="NP_218408.1" /db_xref="GI:15611027" /db_xref="GeneID:886218" /translation="MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPA TWSGTGVVASHMTATEITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFG ASHGS" gene complement(4374484..4375683) /gene="PPE69" /locus_tag="Rv3892c" /db_xref="GeneID:886227" CDS complement(4374484..4375683) /gene="PPE69" /locus_tag="Rv3892c" /function="UNKNOWN" /note="Rv3892c, (MTCY15F10.20), len: 399 aa. Member of the Mycobacterium tuberculosis PPE family of conserved proteins, similar to many e.g. O05298|Rv1196|MTCI364.08 from Mycobacterium leprae (391 aa), FASTA scores: opt: 348, E(): 2.2e-08, (26.6% identity in 380 aa overlap)." /codon_start=1 /transl_table=11 /product="PPE family protein" /protein_id="YP_178024.1" /db_xref="GI:57117166" /db_xref="GeneID:886227" /translation="MPDPGWAARTPEANDLLLTAGTGVGTHLANQTAWTTLGASHHAS GVASAINTAATAASWLGVGSAASALNVTMLNATLHGLAGWVDVKPAVVSTAIAAFETA NAAMRPAPECMENRDEWGVDNAINPSVLWTLTPRIVSLDVEYFGVMWPNNAAVGATYG GVLAALAESLAIPPPVATMGASPAAPAQAAAAVGQAAAEAAAGDGMRSAYQGVQAGST GAGQSTSAGENFGNQLSTFMQPMQAVMQAAPQALQAPSGLMQAPMSAMQPLQSMVGMF ANPGALGMGGAAPGASAASAAGGISAAATEVGAGGGGAALGGGGMPATSFTRPVSAFE SGTSGRPVGLRPSGALGADVVRAPTTTVGGTPIGGMPVGHAAGGHRGSHGKSEQAATV RVVDDRR" gene complement(4375762..4375995) /gene="PE36" /locus_tag="Rv3893c" /db_xref="GeneID:886213" CDS complement(4375762..4375995) /gene="PE36" /locus_tag="Rv3893c" /function="UNKNOWN" /note="Rv3893c, (MTCY15F10.19), len: 77 aa. Member of the Mycobacterium tuberculosis PE family of conserved proteins (see citation below), similar to others e.g. O53690|Rv0285|MTV035.13 from Mycobacterium tuberculosis (102 aa), FASTA scores: opt: 136, E(): 0.042, (35.6% identity in 73 aa overlap)." /codon_start=1 /transl_table=11 /product="PE family protein" /protein_id="YP_178025.1" /db_xref="GI:57117167" /db_xref="GeneID:886213" /translation="MVWSVQPEAVLASAAAESAISAETEAAAAGAAPALLSTTPMGGD PDSAMFSAALNACGASYLGVVAEHASQRGLFAG" gene complement(4376262..4380452) /locus_tag="Rv3894c" /db_xref="GeneID:886230" CDS complement(4376262..4380452) /locus_tag="Rv3894c" /function="UNKNOWN" /note="Rv3894c, (MTCY15F10.18), len: 1396 aa. Possible conserved membrane protein (possible transmembrane segments from aa 37-85), similar to Q9CD30|ML2535 HYPOTHETICAL PROTEIN from Mycobacterium leprae (1329 aa), FASTA scores: opt: 652, E(): 2.2e-30, (27.85% identity in 1425 aa overlap); Q9CDD7|ML0052 HYPOTHETICAL PROTEIN from Mycobacterium leprae (597 aa), FASTA scores: opt: 537, E(): 6.6e-24, (27.5% identity in 585 aa overlap) (similarity only with C-terminal end); Q9Z5I2|ML1543|MLCB596.28 POSSIBLE SPOIIIE-FAMILY MEMBRANE PROTEIN from Mycobacterium leprae (1345 aa), FASTA scores: opt: 523, E(): 8.6e-23, (31.65% identity in 1412 aa overlap). Also similar to various proteins e.g. O86653|SC3C3.20c ATP/GTP BINDING PROTEIN from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 973, E(): 2.8e-49, (28.1% identity in 1409 aa); Q9L0T6|SCD35.15c PUTATIVE CELL DIVISION-RELATED PROTEIN from Streptomyces coelicolor(1525 aa), FASTA scores: opt: 524, E(): 8.3e-23, (24.95% identity in 1450 aa overlap); Q9KE81|BH0975 HYPOTHETICAL PROTEIN from Bacillus halodurans (1489 aa), FASTA scores: opt: 444, E(): 4.1e-18, (22.5% identity in 1346 aa overlap); etc. Also similar to AAK46103|MT1833 FTSK/SPOIIIE FAMILY PROTEIN from Mycobacterium tuberculosis strain CDC1551 (1391 aa), FASTA scores: opt: 769, E(): 2.9e-37, (30.6% identity in 1434 aa overlap); and other hypothetical proteins from Mycobacterium tuberculosis e.g. O53689|Rv0284|MTV035.12 (1330 aa), FASTA scores: opt: 634, E(): 2.5e-29, (28.2% identity in 1443 aa overlap); O06264|Rv3447c|MTCY77.19c (1236 aa), FASTA scores: opt: 632, E(): 3.1e-29, (28.75% identity in 1391 aa overlap); O69736|R3871|MTV027.06 (591 aa), FASTA scores: opt: 588, E(): 6.6e-27, (27.75% identity in 605 aa overlap) (similarity only with C-terminal end); etc. Contains two possible (PS00017) ATP/GTP-binding sites (P-loop) in central portion." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218411.1" /db_xref="GI:15611030" /db_xref="GeneID:886230" /translation="MSKKAFPINRVNIDPPKPVRVAPNPPIALPEREPRNIWVMIGVP ALIVALIGTIVMLYVSGVRSLATGFFPLMGIGAFSMLAFSGRFGRARKITWGELEKGR RRYLRDLDTNRDEIQTAVCAQREWQNAVHSDPPGLGAIIGGPRMWERGRGDVDFLEVR VGTGVQHAPDSVLSVTWPDISSDEELEPVTGQALRDFILEQRKIRDIAKVVNLRSAPG FSFVSEDLDRVRSLMRSVLCSLAVFHNPRDVKLMVVTRNREVWAWMVWLPHNLHDELF DACGWRRLIFATPEELEAALGAELHMKGKRGAWTPPTVASPTAMGSALETGQVGVDLG PHLVIVDDNTGSPDAWESVVGQVGKAGLTVLRIASRVGTGVGFAEDQVFEMAQRHGAA TAVKAGRDGADADDDQRPAPLLRARGTFFAHADQLSIHRAYRYARAMARWSPTSRSEV TDSTSGAAELLRSLGISDPRELDVDRLWAERRGRGDDRWCEIPVGAKPNGELQNIILR AKDFGGFGFHSVVIGTSGSGKSELFLSLVYGIALTHSPETFNVIFVDMKFESAAQDIL GIPHVVAALSNLGKDERHLAERMRRVIDGEIKQRYELFKSVGARDANDYEEIRLAGRD LPPVPVLLVIVDEYLELFANHKKWIDLIIHIGQEGRGANVFFMLGGQRLDLSSLQKVK SNIAFRIALRAESGDDSREVIGSDAAYHLPSKENGFALLKVGPRDLEPFRCFYLSAPF VVPKKKEVARTIDMTLTQPRLYDWQYQPLDAADAEALATAAAADAEPDEFLYYDDGFK KKKIVDVLRESLYNVPHRSPRRPWLAPLEDPEPVDRLVAAYRGKPWHVDYGQNPGLMF PVGVMDIPEESQQVVHAVDALRSNIIVVGAKQRGKTTTLMALMCSAATMYTPERVTFF CIGGATMAQIGSLPHVTDIVSPKDAEGIERILSTMDALIDAREEAFRRAKIDMDGFRE RRFGIGGDGVGGTDPTDAFGDVFVVLDDYDDLYAKDTLLGDRIISLSSRGPEYGVHLM CSAGGWIHGQRQSLLQNVTARIQLRLADPGESQMGHLSIESREAARRTLNRPGFGLTE SLHELRIGVPALADPGTGELVGITDVGARIADVAGVTKHASLQRLPQRVELSAIVEHE AVHQGGDDLSIAFAIGERHELGPVPIKLRESPGLMILGRQGCGKTTALVAIGEAVMNR FSPQQAQLTLIDPKTAPHGLRDLHAPGYVRAYAYDQDEIDEVITELAQQILLPRLPPK GLSQEELRALKPWEGPRHFVLIDDVQDLRPAQSYPQKPPVGAALWKLMERARQVGLHV FSTRNSANWATMPMDPWVKSQTSAKVAQLYMDNDPQNRINRSVRAQTLPPGRGLLVGA DGDVEGILVGYPSVPGEQ" misc_feature complement(4377777..4377800) /locus_tag="Rv3894c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" misc_feature complement(4378863..4378886) /locus_tag="Rv3894c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)" gene complement(4380453..4381940) /locus_tag="Rv3895c" /db_xref="GeneID:886231" CDS complement(4380453..4381940) /locus_tag="Rv3895c" /function="UNKNOWN" /note="Rv3895c, (MTCY15F10.17), len: 495 aa. Probable conserved membrane protein, highly similar to two CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae: Q9Z5I3|ML1544|MLCB596.27 (506 aa), FASTA scores: opt: 1070, E(): 1.4e-53, (39.8% identity in 485 aa overlap); and Q9CD29|ML2536 (552 aa), FASTA scores: opt: 483, E(): 4e-20, (36.85% identity in 499 aa overlap). Also highly similar to various proteins from Mycobacterium tuberculosis e.g. O53933|Rv1782|MTV049.04 HYPOTHETICAL PROTEIN (506 aa), FASTA scores: opt: 1106, E(): 1.2e-55, (41.25% identity in 485 aa overlap); O69734|Rv3869|MTV027.04 HYPOTHETICAL PROTEIN (480 aa), FASTA scores: opt: 795, E(): 6.1e-38, (36.0% identity in 486 aa overlap); O33088|ML0054|MLCB628.17c PUTATIVE MEMBRANE PROTEIN) (481 aa), FASTA scores: opt: 740, E(): 8.3e-35, (35.65% identity in 485 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218412.1" /db_xref="GI:15611031" /db_xref="GeneID:886231" /translation="MPLSLSNRDQNSGHLFYNRRLRAATTRFSVRMKHDDRKQTAALA LSMVLVAIAAGWMMLLNVLKPTGIVGDSAIIGDRDSGALYARIDGRLYPALNLTSARL ATGTAGQPTWVKPAEIAKYPTGPLVGIPGAPAAMPVNRGAVSAWAVCDTAGRPRSADK PVVTSIAGPITGGGRATHLRDDAGLLVTFDGSTYVIWGGKRSQIDPTNRAVTLSLGLD PGVTSPIQISRALFDGLPATEPLRVPAVPEAGTPSTWVPGARVGSVLQAQTAGGGSQF YVLLPDGVQKISSFVADLLRSANSYGAAAPRVVTPDVLVHTPQVTSLPVEYYPAGRLN FVDTAADPTTCVSWEKASTDPQARVAVYNGRGLPVPPSMDSRIVRLVRDDRAPASVVA TQVLVLPGAANFVTSTSGVITAESRESLFWVSGNGVRFGIANDEATLRALGLDPGAAV QAPWPLLRTFAAGPALSRDAALLARDTVPTLGQVAIVTTTAKAGA" gene complement(4381943..4382851) /locus_tag="Rv3896c" /db_xref="GeneID:886216" CDS complement(4381943..4382851) /locus_tag="Rv3896c" /function="UNKNOWN" /note="Rv3896c, (MTCY15F10.16), len: 302 aa (first GTG taken, although TBparse suggests TTG at 16079). Putative conserved ala-rich protein. C-terminus highly similar to C-terminal end of other proteins e.g. Q9XAS4|SC10A7.01 HYPOTHETICAL 17.2 KDA PROTEIN from Streptomyces coelicolor (244 aa), FASTA scores: opt: 255, E(): 1.4e-08, (32.0% identity in 222 aa overlap); CAC44611|STBAC16H6.32 PUTATIVE SECRETED PROTEIN from Streptomyces coelicolor (172 aa), FASTA scores: opt: 214, E(): 3.4e-06, (42.55% identity in 94 aa overlap); Q38352|ORF360 from Lactococcus delbrueckii bacteriophage LL-H (360 aa), FASTA scores: opt: 211, E(): 9.5e-06, (40.0% identity in 115 aa overlap); P54334|XKDO_BACSU|XKDO PHAGE-LIKE ELEMENT PBSX PROTEIN from Bacillus subtilis (1332 aa), FASTA scores: opt: 209, E(): 3.6e-05, (38.35% identity in 86 aa overlap); etc. Also similar to P71594|P71594|Rv0024|MTCY10H4.24 HYPOTHETICAL 30.3 KDA PROTEIN from Mycobacterium tuberculosis (281 aa), FASTA scores: opt: 265, E(): 3.9e-09, (29.25% identity in 287 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218413.1" /db_xref="GI:15611032" /db_xref="GeneID:886216" /translation="MSTWHRIGTEGEPLTDPLTTQAIAALSRGHGLFAGGVSGADIDA PQIQQYANAISWVANAVPTAAAYRWRGAARALRRLANTDEALAQIMAAAQIDHAHART ATRALLEAAKTDAMALTDTPLGRREAMARMAARLRAQHRHIARCRSRARLLGLRLRRL RYLRTAAARRPQVTTPGGRAQVLAAIQKALDIQGVHDPAARARWTRGMDLVARRESNY NANAINHWDSNAARGTPSRGVWQFIAPTFAAYHEPGTSTNIHDLVAQACAFINYARGH YGVAADASNLADLIQQADPRRSPRGY" gene complement(4383008..4383640) /locus_tag="Rv3897c" /db_xref="GeneID:886225" CDS complement(4383008..4383640) /locus_tag="Rv3897c" /function="UNKNOWN" /note="Rv3897c, (MTCY15F10.15), len: 210 aa. Conserved hypothetical protein, highly similar in part to Q10691|YK83_MYCTU|Rv2083|MT2145|MTCY49.22 HYPOTHETICAL 30.8 KDA PROTEIN from Mycobacterium tuberculosis (314 aa) FASTA scores: opt: 815, E(): 4.7e-26, (73.05% identity in 167 aa overlap). Similarity to MTCY49.22 suggests that this is a continuation of MTCY15F10.14. There is a frameshift mutation near 3'-end with respect to this sequence as well, similarity to MTCY49.22 continues in an overlapping ORF. Sequence appears to be correct." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218414.1" /db_xref="GI:15611033" /db_xref="GeneID:886225" /translation="MMQQAVSGITGALGGAVGGVMGPLTQLPQQAMQAGQGAMQPLMS ALQQTYGAEGLDVADGARLVDSIEGEPGLGGEPGAGDVGAGGGGGGTTPTGYLGPPPV PTSSPPTTPAGAPAKSVTPDPVSGTPRASGPAGMTGMPMVPPGALGAGAEGANKDKPV EKRVTGCAEWSTGQGPLNSTAECSGEICRRQAGGHQVDATDPCCAERRQG" gene complement(4383653..4383985) /locus_tag="Rv3898c" /db_xref="GeneID:886233" CDS complement(4383653..4383985) /locus_tag="Rv3898c" /function="UNKNOWN" /note="Rv3898c, (MTCY15F10.14), len: 110 aa. Conserved hypothetical protein. Highly similar, but in part, to Q10691|YK83_MYCTU|Rv2083|MT2145|MTCY49.22 HYPOTHETICAL 30.8 KDA PROTEIN from Mycobacterium tuberculosis (314 aa) FASTA scores: opt: 204, E(): 0.00042, (50.6% identity in 81 aa overlap). Similarity suggests it should be in frame with next ORF and that the stop codon could be read through, the sequence appears to be correct. Homology lost upstream at 15138 gatc sequence may suggest discontinuity due to chimerism in cY15F10 or cY49." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218415.1" /db_xref="GI:15611034" /db_xref="GeneID:886233" /translation="MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPV DLPAPADIANGALFAAGNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQFQ GVGAQAEA" gene complement(4384147..4385379) /locus_tag="Rv3899c" /db_xref="GeneID:886228" CDS complement(4384147..4385379) /locus_tag="Rv3899c" /function="UNKNOWN" /note="Rv3899c, (MTCY15F10.13), len: 410 aa. Conserved hypothetical protein, similar in part to proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551. Region between aa 29-80 is strictly identical to P96909 HYPOTHETICAL 15.1 KDA PROTEIN (FRAGMENT) (143 aa) FASTA scores: opt: 562, E(): 4e-16, (69.0% identity in 142 aa overlap); and the N-terminal end is highly similar, but longer 65 aa, to O07266 HYPOTHETICAL 13.7 KDA PROTEIN (FRAGMENT) (143 aa), FASTA scores: opt: 562, E(): 4e-16, (69.0% identity in 142 aa overlap). Highly similar to C-terminal end of Q10690|YK82_MYCTU|Rv2082|MTCY49.21 HYPOTHETICAL 73.6 KDA PROTEIN (721 aa), FASTA scores: opt: 1388, E(): 1.5e-48, (55.25% identity in 409 aa overlap). And similar to P71599|Rv0029|MTCY10H4.29 HYPOTHETICAL 39.6 KDA PROTEIN (365 aa), FASTA scores: opt: 403, E(): 1.7e-09, (33.75% identity in 252 aa overlap). Note that MTCY15F10.12 and MTCY15F10.13 appear frameshifted with respect to MTCY49.21 although the sequence appears to be correct." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218416.1" /db_xref="GI:15611035" /db_xref="GeneID:886228" /translation="MVTGQPAAAGAHSLSEGAMTAMQSGSVPPPQATPPITTPPVVSA PTMAAGIEATHGPVDTPANTSGAPPASTGTTGPVAPTVVTAGPVAAPAAPVVGGSAVP AGPLPAYGSDLRPPVVAAPAVPSVPTAPVSGAPVAPSASSAPSAGGALVSPVERAASK AVAGQAGASSSTMAGASALSATAGATAGAVSARAAEQQRLQRIVDAVARQEPRISWAA GLRDDGTTTLLVTDLAGGWIPPHVRLPANVTLLEPTARRRDADVIDLLGAVVAVAAHE SNTYVAEPGPDAPALTGDRSARSAIPKVDEFGPTLVEAVRRRDSLPRIAQAIALPAVR KTGVLENEAELLHGCITAVKESVLKAYPSHELTAVGDWMLLAAIEALIDEQDYLANYH LAWYAVTTRRGGSRGFAA" gene complement(4385373..4386308) /locus_tag="Rv3900c" /db_xref="GeneID:886235" CDS complement(4385373..4386308) /locus_tag="Rv3900c" /function="UNKNOWN" /note="Rv3900c, (MTCY15F10.12), len: 311 aa. Conserved hypothetical ala-rich protein, highly similar to N-terminal end of Q10690|YK82_MYCTU|Rv2082|MTCY49.21 HYPOTHETICAL 73.6 KDA PROTEIN from Mycobacterium tuberculosis (721 aa), FASTA scores: opt: 592, E(): 2.7e-22, (37.15% identity in 280 aa overlap). Note that MTCY15F10.12 and MTCY15F10.13 appear frameshifted with respect to MTCY49.21 although the sequence appears to be correct." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218417.1" /db_xref="GI:15611036" /db_xref="GeneID:886235" /translation="MVAADLPPGRWSAVLVGPWWPAPSAALRAAAQHWATWAMQKQEL ARNLISQHDLLLRNQGRTAEDLIGRYLRGAKSEVTKAEKYEIKKGAFNTAADAIDYLR SRLTGIAGEGNKEIDDVLASKKPLPEQLAEIQAIQTRCNADAANASRDAVDKVMTAMQ EILEAEDIGDDPRTWARANGFNVDDAPPPRLIRENDLAALTGPGARGGSFGSVEGAGD LASPQSVGAGGFSGSGVQAACSQPAPRAIGASSRHASAGPVPPAPVVTTPAAATPPVI ATGPRWRCPAGRCRRRPSDRAYRLRRLGNRLRPGW" gene complement(4386365..4386814) /locus_tag="Rv3901c" /db_xref="GeneID:886226" CDS complement(4386365..4386814) /locus_tag="Rv3901c" /function="UNKNOWN" /note="Rv3901c, (MTCY15F10.11), len: 149 aa. Possible membrane protein (hydrophobic stretch from 30-52), showing some similarity with O53200|Rv2473|MTV008.29 HYPOTHETICAL 25.1 KDA PROTEIN from Mycobacterium tuberculosis (238 aa), FASTA scores: opt: 147, E(): 0.036, (31.35% identity in 134 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218418.1" /db_xref="GI:15611037" /db_xref="GeneID:886226" /translation="MQAANRRSADTICGVTAPAPLPIPRTRSWPAIVVAAIAAVVAVA ALIVALTNARPAATPATTSVPTYTAAQTAAAQRQLCDTYKLVAHAVPVDTNGSDKALA RITLTNAAAILDNAAADPALDAKHRDAARASDRLPHNDRNGEWWHSS" gene complement(4387365..4387895) /locus_tag="Rv3902c" /db_xref="GeneID:886236" CDS complement(4387365..4387895) /locus_tag="Rv3902c" /function="UNKNOWN" /note="Rv3902c, (MTCY15F10.10), len: 176 aa. Hypothetical unknown protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218419.1" /db_xref="GI:15611038" /db_xref="GeneID:886236" /translation="MTIGVDLSTDLQDWIRLSGMNMIQGSETNDGRTILWNKGGEVRY FIDRLAGWYVITSSDRMSREGYEFAAASMSVIEKYLYGYFGGSVRSERELPAIRAPFQ PEELMPEYSIGTMTFAGRQRDTLIDSSGTVVAITAADRLVELSHYLDVSVNVIKDSFL DSEGKPLFTLWKDYKG" gene complement(4387892..4390432) /locus_tag="Rv3903c" /db_xref="GeneID:886229" CDS complement(4387892..4390432) /locus_tag="Rv3903c" /function="UNKNOWN" /note="Rv3903c, (MTCY15F10.08), len: 846 aa. Hypothetical unknown ala-, pro-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218420.1" /db_xref="GI:15611039" /db_xref="GeneID:886229" /translation="MAPLAVDPAALDSAGGAVVAAGAGLGAVISSLTAALAGCAGMAG DDPAGAVFGRSYDGSAAALVQAMSVARNGLCNLGDGVRMSAHNYSLAEAMSDVAGRAA PLPAPPPSGCVGVGAPPSAVGGGGGAPKGWGWVAPYIGMIWPNGDSTKLRAAAVAWRS AGTQFALTEIQSTAGPMGVIRAQQLPEAGLIESAFADAYASTTAVVGQCHQLAAQLDA YAARIDAVHAAVLDLLARICDPLTGIKEVWEFLTDQDEDEIQRIAHDIAVVVDQFSGE VDALAAEITAVVSHAEAVITAMADHAGKQWDRFLHSNPVGVVIDGTGQQLKGFGEEAF GMAKDSWDLGPLRASIDPFGWYRSWEEMLTGMAPLAGLGGENAPGVVESWKQFGKSLI HWDEWTTNPNEALGKTVFDAATLALPGGPLSKLGSKGRDILAGVRGLKERLEPTTPHL EPPATPPRPGPQPPRIEPPESGHPAPAPAAKPAPVPANGPLPHSPTESKPPPVDRPAE PVAPSSASAGQPRVSAATTPGTHVPHGLPQPGEHVPAQAPPATTLLGGPPVESAPATA HQPQWATTPAAPAAAPHSTPGGVHSTESGPHGRSLSAHGSEPTHDGASHGSGHGSGSE PPGLHAPHREQQLAMHSNEPAGEGWHRLSDEAVDPQYGEPLSRHWDFTDNPADRSRIN PVVAQLMEDPNAPFGRDPQGQPYTQERYQERFNSVGPWGQQYSNFPPNNGAVPGTRIA YTNLEKFLSDYGPQLDRIGGDQGKYLAIMEHGRPASWEQRALHVTSLRDPYHAYTIDW LPEGWFIEVSEVAPGCGQPGGSIQVRIFDHQNEMRKVEELIRRGVLRQ" gene complement(4390437..4390709) /gene="esxE" /locus_tag="Rv3904c" /db_xref="GeneID:886237" CDS complement(4390437..4390709) /gene="esxE" /locus_tag="Rv3904c" /function="UNKNOWN" /note="Rv3904c, (MT4023, MTCY15F10.07), len: 90 aa. esxE, ESAT-6 like protein, hypothetical unknown ala-rich protein. BELONGS TO THE ESAT6 FAMILY (see citation below).; ES6_12" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218421.1" /db_xref="GI:15611040" /db_xref="GeneID:886237" /translation="MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAA AHAEAQRHWAAGEAMMRQALAQLTAAGQSAHANYTGAMATNLGMWS" gene complement(4390720..4391031) /gene="esxF" /locus_tag="Rv3905c" /db_xref="GeneID:886239" CDS complement(4390720..4391031) /gene="esxF" /locus_tag="Rv3905c" /function="UNKNOWN" /note="Rv3905c, (MT4024, MTCY15F10.06), len: 103 aa. esxF, ESAT-6 like protein (see citation below), hypothetical unknown ala-, gly-rich protein, ESAT-6 like protein. BELONGS TO THE ESAT6 FAMILY.; ES6_13" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218422.1" /db_xref="GI:15611041" /db_xref="GeneID:886239" /translation="MGADDTLRVEPAVMQGFAASLDGAAEHLAVQLAELDAQVGQMLG GWRGASGSAYGSAWELWHRGAGEVQLGLSMLAAAIAHAGAGYQHNETASAQVLREVGG G" gene complement(4391097..4391606) /locus_tag="Rv3906c" /db_xref="GeneID:886221" CDS complement(4391097..4391606) /locus_tag="Rv3906c" /function="UNKNOWN" /note="Rv3906c, (MTCY15F10.05), len: 169 aa. Conserved hypothetical protein, strongly related to Q50578|AT9S (SOD related in Escherichia coli) from Mycobacterium tuberculosis strain AOYAMA B (155 aa), but apparently different as flanking sequences differ and shorter 43 aa, FASTA scores: opt: 548, E(): 1.3e-26, (79.4% identity in 102 aa overlap). Selfmarch results suggest that Rv3906c is not related to any other hypothetical protein from Mycobacterium tuberculosis strain H37Rv except itself. Shows also similarity with Q9VFR2|CG9297 HYPOTHETICAL PROTEIN from Drosophila melanogaster (Fruit fly) (930 aa), FASTA scores: opt: 221, E(): 4.9e-06, (36.95% identity in 157 aa overlap); Q9HQ55|CBP|VNG1320G CALCIUM-BINDING PROTEIN HOMOLOGY from Halobacterium sp. strain NRC-1 (385 aa) FASTA scores: opt: 143, E(): 0.13, (35.65% identity in 160 aa overlap); Q24795 CALCIUM-BINDING PROTEIN (FRAGMENT) from Echinococcus granulosus (338 aa), FASTA scores: opt: 140, E(): 0.17, (33.95% identity in 156 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218423.1" /db_xref="GI:15611042" /db_xref="GeneID:886221" /translation="MEYCIAGDDGSAGIWNRPFDVDLDGDGRLDAIGLDLDGDGLRDD ALADFDGDDVADHAVFDVDNDGTPESYFIDDGSGTWAVAVDRGGQLRWYGLDGVEHTG GPLVDFDGFGGLDDRLLDTDGDGLADRVLCAGEQRVTGYVDTDGDGRWDVRLTDTDGD GTADGASSL" gene complement(4391631..4393073) /gene="pcnA" /locus_tag="Rv3907c" /db_xref="GeneID:886240" CDS complement(4391631..4393073) /gene="pcnA" /locus_tag="Rv3907c" /EC_number="2.7.7.19" /function="INVOLVED IN TRANSCRIPTION MECHANISM [CATALYTIC ACTIVITY: N ATP + (NUCLEOTIDE)(M) = N DIPHOSPHATE + (NUCLEOTIDE)(M+N)]." /experiment="experimental evidence, no additional details recorded" /note="Rv3907c, (MTCY15F10.04), len: 480 aa. Probable pcnA, polynucleotide polymerase (EC 2.7.7.19), equivalent to Q9CCY1|PCNA|ML2697 PCNA PROTEIN from Mycobacterium leprae (486 aa), FASTA scores: opt: 2713, E(): 4.3e-160, (84.1% identity in 478 aa overlap); and Q59534|PCNB POLYA POLYMERASE from Mycobacterium leprae (411 aa) FASTA scores: opt: 2077, E(): 7.1e-121, (82.55% identity in 373 aa overlap). Also highly similar to many e.g. Q9X8T2|SCH24.18 PUTATIVE RNA NUCLEOTIDYLTRANSFERASE from Streptomyces coelicolor (483 aa), FASTA scores: opt: 1856, E(): 3.7e-107, (61.55% identity in 455 aa overlap); Q9ZN65 POLYA POLYMERASE from Prevotella ruminicola (Bacteroides ruminicola) (479 aa), FASTA scores: opt: 830, E(): 8.5e-44, (34.85% identity in 445 aa overlap); P42977|PAPS_BACSU POLY(A) POLYMERASE from Bacillus subtilis (397 aa), FASTA scores: opt: 479, E(): 3.5e-22, (29.35% identity in 450 aa overlap); etc. Contains: PS00017 ATP/GTP-binding site motif A (P-loop), PS00018 EF-hand calcium-binding domain, and probably less significant a PS00237 G-protein coupled receptor signature, and PS00639 Eukaryotic thiol (cysteine) proteases histidine active site. BELONGS TO THE TRNA NUCLEOTIDYLTRANSFERASE / POLY(A) POLYMERASE FAMILY." /codon_start=1 /transl_table=11 /product="poly(A) polymerase" /protein_id="YP_178026.1" /db_xref="GI:57117168" /db_xref="GeneID:886240" /translation="MPEAVQEADLLTAAAVALNRHAALLRELGSVFAAAGHELYLVGG SVRDALLGRLSPDLDFTTDARPERVQEIVRPWADAVWDTGIEFGTVGVGKSDHRMEIT TFRADSYDRVSRHPEVRFGDCLEGDLVRRDFTTNAMAVRVTATGPGEFLDPLGGLAAL RAKVLDTPAAPSGSFGDDPLRMLRAARFVSQLGFAVAPRVRAAIEEMAPQLARISAER VAAELDKLLVGEDPAAGIDLMVQSGMGAVVLPEIGGMRMAIDEHHQHKDVYQHSLTVL RQAIALEDDGPDLVLRWAALLHDIGKPATRRHEPDGGVSFHHHEVVGAKMVRKRMRAL KYSKQMIDDISQLVYLHLRFHGYGDGKWTDSAVRRYVTDAGALLPRLHKLVRADCTTR NKRRAARLQASYDRLEERIAELAAQEDLDRVRPDLDGNQIMAVLDIPAGPQVGEAWRY LKELRLERGPLSTEEATTELLSWWKSRGNR" misc_feature complement(4391760..4391798) /gene="pcnA" /locus_tag="Rv3907c" /note="PS00018 EF-hand calcium-binding domain." misc_feature complement(4392789..4392812) /gene="pcnA" /locus_tag="Rv3907c" /note="PS00017 ATP/GTP-binding site motif A (P-loop)." misc_feature complement(4392939..4392971) /gene="pcnA" /locus_tag="Rv3907c" /note="PS00639 Eukaryotic thiol (cysteine) proteases histidine active site." misc_feature complement(4393002..4393052) /gene="pcnA" /locus_tag="Rv3907c" /note="PS00237 G-protein coupled receptors signature." gene 4393449..4394195 /locus_tag="Rv3908" /db_xref="GeneID:886242" CDS 4393449..4394195 /locus_tag="Rv3908" /function="UNKNOWN" /experiment="experimental evidence, no additional details recorded" /note="Rv3908, (MTCY15F10.03c), len: 248 aa. Conserved hypothetical protein, equivalent to Q50195|ML2698|L222-ORF6 HYPOTHETICAL PROTEIN from Mycobacterium leprae (251 aa), FASTA scores: opt: 1270, E(): 3.4e-62, (79.05% identity in 248 aa overlap). Also similar to O66548|APFA|AQ_158 HYDROLASE from Aquifex aeolicus (134 aa), FASTA scores: opt: 300, E(): 1.1e-09, (37.3% identity in 142 aa overlap); and similarity with other various proteins e.g. O93721 DIADENOSINE 5'5'''-P1,P4-TETRAPHOSPHATE PYROPHOSPHOHYDROLASE from Pyrobaculum aerophilum (143 aa), FASTA scores: opt: 205, E(): 0.00017, (34.85% identity in 109 aa overlap); Q9HS29|APA|VNG0431G DIADENOSINE TETRAPHOSPHATE PYROPHOSPHOHYDROLASE from Halobacterium sp. strain NRC-1 (142 aa), FASTA scores: opt: 199, E(): 0.00036, (34.0% identity in 147 aa overlap); Q9YA58|APE2080 HYPOTHETICAL 19.2 KDA PROTEIN from Aeropyrum pernix (175 aa) FASTA scores: opt: 191, E(): 0.0012, (36.9% identity in 141 aa overlap); etc. Also similar to P95110|MUTT1|Rv2985|MTCY349.02 HYPOTHETICAL 34.7 KDA PROTEIN from Mycobacterium tuberculosis (317 aa) FASTA scores: opt: 224, E(): 3e-05, (34.05% identity in 144 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218425.1" /db_xref="GI:15611044" /db_xref="GeneID:886242" /translation="MSDGEQAKSRRRRGRRRGRRAAATAENHMDAQPAGDATPTPATA KRSRSRSPRRGSTRMRTVHETSAGGLVIDGIDGPRDAQVAALIGRVDRRGRLLWSLPK GHIELGETAEQTAIREVAEETGIRGSVLAALGRIDYWFVTDGRRVHKTVHHYLMRFLG GELSDEDLEVAEVAWVPIRELPSRLAYADERRLAEVADELIDKLQSDGPAALPPLPPS SPRRRPQTHSRARHADDSAPGQHNGPGPGP" misc_feature 4393755..4393814 /locus_tag="Rv3908" /note="PS00893 mutT domain signature." gene 4394192..4396600 /locus_tag="Rv3909" /db_xref="GeneID:886245" CDS 4394192..4396600 /locus_tag="Rv3909" /function="UNKNOWN" /note="Rv3909, (MTCY15F10.02c), len: 802 aa. Conserved hypothetical protein, equivalent to Q9CCY0|ML2699 PUTATIVE SECRETED PROTEIN from Mycobacterium leprae (797 aa) FASTA scores: opt: 3777, E(): 8.8e-206, (72.35% identity in 803 aa overlap). Note that the N-terminal end is highly similar to Q50196|L222-ORF7 (286 aa), FASTA scores: opt: 1213, E(): 2.7e-61, (71.75% identity in 255 aa overlap); and the C-terminal end is highly similar to Q50197|L222-ORF8 also from Mycobacterium leprae (512 aa) FASTA scores: opt: 2375, E(): 9.9e-127, (71.8% identity in 518 aa overlap). Shows some similarity with N-terminal end of Q9I2M3|PA1874 HYPOTHETICAL PROTEIN from Pseudomonas aeruginosa (2468 aa), FASTA scores: opt: 171, E(): 0.13, (22.9% identity in 672 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218426.1" /db_xref="GI:15611045" /db_xref="GeneID:886245" /translation="MTALQLGWAALARVTSAIGVVAGLGMALTVPSAAPHALAGEPSP TPFVQVRIDQVTPDVVTTSSEPHVTVSGTVTNTGDRPVRDVMVRLEHAAAVTSSTALR TSLDGGTDQYQPAADFLTVAPELDRGQEAGFTLSAPLRSLTRPSLAVNQPGIYPVLVN VNGTPDYGAPARLDNARFLLPVVGVPPDQATDFGSAVAPETTAPVWITMLWPLADRPR LAPGAPGGTVPVRLVDDDLANSLANGGRLDILLSAAEFATNREVDPDGAVGRALCLAI DPDLLITVNAMTGGYVVSDSPDGAAQLPGTPTHPGTGQAAASSWLDRLRTLVHRTCVT PLPFAQADLDALQRVNDPRLSAIATISPADIVDRILDVSSTRGATVLPDGPLTGRAIN LLSTHGNTVAVAAADFSPEEQQGSSQIGSALLPATAPRRLSPRVVAAPFDPAVGAALA AAGTNPTVPTYLDPSLFVRIAHESITARRQDALGAMLWRSLEPNAAPRTQILVPPASW SLASDDAQVILTALATAIRSGLAVPRPLPAVIADAAARTEPPEPPGAYSAARGRFNDD ITTQIGGQVARLWKLTSALTIDDRTGLTGVQYTAPLREDMLRALSQSLPPDTRNGLAQ QRLAVVGKTIDDLFGAVTIVNPGGSYTLATEHSPLPLALHNGLAVPIRVRLQVDAPPG MTVADVGQIELPPGYLPLRVPIEVNFTQRVAVDVSLRTPDGVALGEPVRLSVHSNAYG KVLFAITLSAAAVLVTLAGRRLWHRFRGQPDRADLDRPDLPTGKHAPQRRAVASRDDE KHRV" gene 4396597..4400151 /locus_tag="Rv3910" /db_xref="GeneID:886247" CDS 4396597..4400151 /locus_tag="Rv3910" /function="UNKNOWN" /note="Rv3910, (MTCY15F10.01c.MTV028.01), len: 1184 aa. Probable conserved transmembrane protein (hydrophobic domain 50-550), equivalent to Q9CCX9|ML2700 POSSIBLE CONSERVED MEMBRANE PROTEIN from Mycobacterium leprae (1206 aa), FASTA scores: opt: 5554, E(): 0, (75.15% identity in 1182 aa overlap); and highly similar, but shorter 380 aa, to Q50199|L222-ORF10 from Mycobacterium leprae (784 aa) FASTA scores: opt: 3297, E(): 5.5e-170, (68.8% identity in 769 aa overlap); and at the N-terminal end with Q50198|L222-ORF also from Mycobacterium leprae (379 aa) FASTA scores: opt: 1955, E(): 5.7e-98, (88.4% identity in 353 aa overlap) (ORFs 9 and 10 are adjacent on L222). Also similar in part (principally at the N-terminal end) to other membrane proteins e.g. Q9X8T0|SCH24.16c PUTATIVE TRANSMEMBRANE PROTEIN from Streptomyces coelicolor (811 aa), FASTA scores: opt: 573, E(): 2.8e-23, (31.05% identity in 573 aa overlap); O05467|MVIN_RHITR INTEGRAL MEMBRANE PROTEIN VIRULENCE FACTOR MVIN HOMOLOG from Rhizobium tropici (533 aa), FASTA scores: opt: 468, E(): 9e-18, (27.1% identity in 524 aa overlap); P56882|MVIN_RHIME INTEGRAL MEMBRANE PROTEIN VIRULENCE FACTOR MVIN HOMOLOG from Rhizobium meliloti (Sinorhizobium meliloti) (535 aa), FASTA scores: opt: 453, E(): 5.8e-17, (26.2% identity in 557 aa overlap); etc." /codon_start=1 /transl_table=11 /product="transmembrane protein" /protein_id="NP_218427.1" /db_xref="GI:15611046" /db_xref="GeneID:886247" /translation="MRPSPGEVPTASQRQPELSDAALVSHSWAMAFATLISRITGFAR IVLLAAILGAALASSFSVANQLPNLVAALVLEATFTAIFVPVLARAEQDDPDGGAAFV RRLVTLATTLLLGATTLSVLAAPLLVRLMLGTNPQVNEPLTTAFAYLLLPQVLVYGLS SVFMAILNTRNVFGPPAWAPVVNNVVAIATLAVYLAVPGELSVDPVRMGNAKLLVLGI GTTAGVFAQTAVLLVAIRREHISLRPLWGIDQRLKRFGAMAAAMVLYVLISQLGLVVG NRIASTAAASGPAIYNYTWLVLMLPFGMIGVTVLTVVMPRLSRNAAADDTPAVLADLS LATRLTMITLIPTVAFMTVGGPAIGSALFAYGNFGDVDAGYLGAAIALSAFTLIPYAL VLLQLRVFYAREQPWTPITIIVVITGVKILGSLLAPHITGDPQLVAAYLGLANGLGFL AGTIVGYYILRRALRPDGGQLIGVGEARTVLVTVAASLLAGLLAHVADRLLGLSELTA HAGSVGSLLRLSVLALIMLPILAAVTLCARVPEARAALDAVRARIRSRRLKTGPQTQN VLDQSSRPGPVTYPERRRLAPPRGKSVVHEPIRRRPPEQVARAGRAKGPEVIDRPSEN ASFGAASGAELPRPVADELQLDAPAGRDPGPVSRPHPSDLQNGDLPADAARGPIAFDA LREPDRESSAPPDDVQLVPGARIANGRYRLLIFHGGVPPLQFWQALDTALDRQVALTF VDPQGVLPDDVLQETLSRTLRLSRIDKPGVARVLDVVHTRAGGLVVAEWIRGGSLQEV ADTSPSPVGAIRAMQSLAAAADAAHRAGVALSIDHPSRVRVSIDGDVVLAYPATMPDA NPQDDIRGIGASLYALLVNRWPLPEAGVRSGLAPAERDTAGQPIEPADIDRDIPFQIS AVAARSVQGDGGIRSASTLLNLMQQATAVADRTEVLGPIDEAPVSAAPRTSAPNSETY TRRRRNLLIGIGAGAAVLMVALLVLASVLSRIFGDVSGGLNKDELGLNAPTASTSAAS SAPPGSVVKPTKVTVFSPDGGADNPGEADLAIDGNPATSWKTDIYTDPVPFPSFKNGV GLMLQLPQATVVGTVAIDVASTGTKVEIRSASTPTPATLEDTAVLTSATALRPGHNTI SVEAAAPTSNLLVWISTLGTTDGKSQADISEITIYAAS" gene 4400186..4400854 /gene="sigM" /locus_tag="Rv3911" /db_xref="GeneID:886246" CDS 4400186..4400854 /gene="sigM" /locus_tag="Rv3911" /function="THE SIGMA FACTOR IS AN INITIATION FACTOR THAT PROMOTES ATTACHMENT OF THE RNA POLYMERASE TO SPECIFIC INITIATION SITES AND THEN IS RELEASED." /experiment="experimental evidence, no additional details recorded" /note="Member of the extracytoplasmic function sigma factors which are active under specific conditions; binds with the catalytic core of RNA polymerase to produce the holoenzyme and directs bacterial core RNA polymerase to specific promoter elements to initiate transcription: in Mycobacterium bovis this protein has been shown to be active at high temperatures and during stationary phase" /codon_start=1 /transl_table=11 /product="RNA polymerase sigma factor SigM" /protein_id="NP_218428.1" /db_xref="GI:15611047" /db_xref="GeneID:886246" /translation="MPPPIGYCPAVGFGGRHERSDAELLAAHVAGDRYAFDQLFRRHH RQLHRLARLTSRTSEDADDALQDAMLSAHRGAGSFRYDAAVSSWLHRIVVNACLDRLR RAKAHPTAPLEDVYPVADRTAQVETAIAVQRALMRLPVEQRAAVVAVDMQGYSIADTR PDAGRGRGHRQEPLRPGAGPPSAAAGLSQHRGEHPALTPLPVRRSIDPRARRYPTSGY CHRA" gene 4400870..4401634 /locus_tag="Rv3912" /db_xref="GeneID:886234" CDS 4400870..4401634 /locus_tag="Rv3912" /function="UNKNOWN" /note="Rv3912, (MTV008.03), len: 254 aa. Hypothetical unknown ala-rich protein." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218429.1" /db_xref="GI:15611048" /db_xref="GeneID:886234" /translation="MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRV RSDPQAQQILRALNRVRRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAH AARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPL SRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVL LVIPADTPDKLAVFAVAPHCSAADTGLLASTVVPRA" gene 4401728..4402735 /gene="trxB2" /locus_tag="Rv3913" /db_xref="GeneID:886232" CDS 4401728..4402735 /gene="trxB2" /locus_tag="Rv3913" /function="ENZYME THAT CATALYSE THE REDUCTION OF DISULPHIDES BY PYRIDINE NUCLEOTIDES THROUGH AN ENZYME DISULPHIDE AND A FLAVIN. SEEMS REGULATED BY SIGH (Rv3223c PRODUCT). [CATALYTIC ACTIVITY: NADPH + OXIDIZED THIOREDOXIN = NADP(+) + REDUCED THIOREDOXIN]." /experiment="experimental evidence, no additional details recorded" /note="Rv3913, (MT4032, MTV028.04), len: 335 aa. Probable trxB2, thioredoxin reductase (EC 1.6.4.5) (see citation below), equivalent to O30973|TRXB_MYCSM THIOREDOXIN REDUCTASE from Mycobacterium smegmatis (311 aa), FASTA scores: opt: 1575, E(): 1.8e-87, (78.35% identity in 305 aa overlap); and highly similar, but shorter at C-terminus, to P46843|TRXB_MYCLE|TRXB/A|TRX|ML2703 BIFUNCTIONAL THIOREDOXIN REDUCTASE/THIOREDOXIN from Mycobacterium leprae (458 aa), FASTA scores: opt: 1766, E(): 8.7e-99, (83.25% identity in 328 aa overlap). Also highly similar to many e.g. P52215|TRXB_STRCO|SCH24.12 from Streptomyces coelicolor (321 aa), FASTA scores: opt: 1249, E(): 7.2e-68, (60.4% identity in 313 aa overlap); Q9Z8M4|TRXB_CHLPN from Chlamydia pneumoniae (Chlamydophila pneumoniae) (311 aa), FASTA scores: opt: 978, E(): 1.3e-51, (49.85% identity in 307 aa overlap); P09625|TRXB_ECOLI|B0888 from Escherichia coli strain K12 (320 aa), FASTA scores: opt: 948, E(): 8.6e-50, (49.2% identity in 309 aa overlap); etc. Contains PS00573 Pyridine nucleotide-disulphide oxidoreductases class-II active site. BELONGS TO THE PYRIDINE NUCLEOTIDE-DISULFIDE OXIDOREDUCTASES CLASS-II. COFACTOR: FAD (BY SIMILARITY)." /codon_start=1 /transl_table=11 /product="thioredoxin reductase TRXB2" /protein_id="NP_218430.1" /db_xref="GI:15611049" /db_xref="GeneID:886232" /translation="MTAPPVHDRAHHPVRDVIVIGSGPAGYTAALYAARAQLAPLVFE GTSFGGALMTTTDVENYPGFRNGITGPELMDEMREQALRFGADLRMEDVESVSLHGPL KSVVTADGQTHRARAVILAMGAAARYLQVPGEQELLGRGVSSCATCDGFFFRDQDIAV IGGGDSAMEEATFLTRFARSVTLVHRRDEFRASKIMLDRARNNDKIRFLTNHTVVAVD GDTTVTGLRVRDTNTGAETTLPVTGVFVAIGHEPRSGLVREAIDVDPDGYVLVQGRTT STSLPGVFAAGDLVDRTYRQAVTAAGSGCAAAIDAERWLAEHAATGEADSTDALIGAQ R" misc_feature 4402160..4402222 /gene="trxB2" /locus_tag="Rv3913" /note="PS00573 Pyridine nucleotide-disulphide oxidoreductases class-II active site." gene 4402732..4403082 /gene="trxC" /locus_tag="Rv3914" /db_xref="GeneID:886241" CDS 4402732..4403082 /gene="trxC" /locus_tag="Rv3914" /function="THIOREDOXIN PARTICIPATES IN VARIOUS REDOX REACTIONS THROUGH THE REVERSIBLE OXIDATION OF ITS ACTIVE CENTER DITHIOL, TO A DISULFIDE, & CATALYZES DITHIOL-DISULFIDE EXCHANGE REACTIONS. FORMS TOGETHER WITH THIOREDOXIN REDUCTASE AND NADPH A REDOX ACTIVE SYSTEM WHICH DONATES ELECTRONS TO A WIDE VARIETY OF DIFFERENT METABOLIC PROCESS (BY SIMILARITY). SEEMS REGULATED BY SIGH (Rv3223c PRODUCT)." /experiment="experimental evidence, no additional details recorded" /note="Rv3914, (MT4033, MTV028.05), len: 116 aa. trxC (alternate gene names: mpt46, trx, trxA *), thioredoxin (EC 1.-.-.-) (see citations below), equivalent to O30974|THIO_MYCSM|TRXA THIOREDOXIN from Mycobacterium smegmatis (112 aa), FASTA scores: opt: 576, E(): 2.1e-32, (80.2% identity in 111 aa overlap); and also equivalent to C-terminal end of P46843|TRXB_MYCLE|TRXB/A|TRX|ML2703 BIFUNCTIONAL THIOREDOXIN REDUCTASE/THIOREDOXIN from Mycobacterium leprae (458 aa), FASTA scores: opt: 628, E(): E(): 2e-35, (82.9% identity in 117 aa overlap). Also highly similar to many e.g. P80579|THIO_ALIAC from Alicyclobacillus acidocaldarius (Bacillus acidocaldarius) (105 aa), FASTA scores: opt: 411, E(): 3e-21, (57.15% identity in 105 aa overlap); P00275|THI1_CORNE from Corynebacterium nephridii (105 aa), FASTA scores: opt: 394, E(): 4.3e-20, (56.7% identity in 97 aa overlap); P00274|THIO_ECOLI|TRXA|TSNC|FIPA|B3781 from Escherichia coli and Salmonella typhimurium strain K12 and LT2 respectively (108 aa), FASTA scores: opt: 364, E(): 4.7e-18, (54.45% identity in 101 aa overlap); etc. Also similar to O53162|TRXB|Rv1471|MTV007.18 THIOREDOXIN from Mycobacterium tuberculosis (123 aa), FASTA scores: E(): 2.3e-15, (41.9% identity in 93 aa overlap). Contains PS00194 Thioredoxin family active site. BELONGS TO THE THIOREDOXIN FAMILY. The product of this CDS is supposed secreted. In this cas, this protein could exert its free radical scavenging activity inside macrophages. (*) Warning: note that Rv1470|MTV007.17 correspond also to trxA.; trx; trxA; mpt46" /codon_start=1 /transl_table=11 /product="thioredoxin trxC (TRX) (MPT46)" /protein_id="NP_218431.1" /db_xref="GI:15611050" /db_xref="GeneID:886241" /translation="MTDSEKSATIKVTDASFATDVLSSNKPVLVDFWATWCGPCKMVA PVLEEIATERATDLTVAKLDVDTNPETARNFQVVSIPTLILFKDGQPVKRIVGAKGKA ALLRELSDVVPNLN" misc_feature 4402816..4402872 /gene="trxC" /locus_tag="Rv3914" /note="PS00194 Thioredoxin family active site." gene 4403192..4404412 /locus_tag="Rv3915" /db_xref="GeneID:886250" CDS 4403192..4404412 /locus_tag="Rv3915" /EC_number="3.-.-.-" /function="UNKNOWN; PROBABLY INVOLVED IN CELLULAR METABOLISM." /note="Rv3915, (MTV028.06), len: 406 aa. Probable hydrolase (EC 3.-.-.-), equivalent to Q9CCX8|ML2704 PUTATIVE HYDROLASE from Mycobacterium leprae (406 aa) FASTA scores: opt: 2341, E(): 2.7e-138, (86.95% identity in 406 aa overlap); the N-terminal end is highly similar to Q59535 N-ACETYMURAMYL-L-ALANINE AMIDASE (EC 3.5.1.28) from Mycobacterium leprae (205 aa), FASTA scores: opt: 1046, E(): 5.7e-58, (84.85% identity in 185 aa overlap). Also similar to other hydrolases (especially amidases (EC 3.5.-.-)) e.g. C-terminal end of Q9K6R3|LYTC|BH3665 N-ACETYLMURAMOYL-L-ALANINE AMIDASE (MAJOR AUTOLYSIN) from Bacillus halodurans (588 aa), FASTA scores: opt: 363, E(): 4.3e-15, (33.15% identity in 356 aa overlap); Q9PKC7|TC0539 PUTATIVE N-ACETYLMURAMOYL-L-ALANINE AMIDASE from Chlamydia muridarum (268 aa), FASTA scores: opt: 285, E(): 1.6e-10, (26.05% identity in 242 aa overlap) (RV3915 product appears longer 127 aa); Q9S596|PDCA PENICILLIN-RESISTANT DD-CARBOXYPEPTIDASE (EC 3.4.-.-) from Myxococcus xanthus (302 aa), FASTA scores: opt: 270, E(): 1.5e-09, (39.85% identity in 158 aa overlap); etc. Note that previously known as cwlM.; cwlM" /codon_start=1 /transl_table=11 /product="hydrolase" /protein_id="YP_178027.1" /db_xref="GI:57117169" /db_xref="GeneID:886250" /translation="MPSPRREDGDALRCGDRSAAVTEIRAALTALGMLDHQEEDLTTG RNVALELFDAQLDQAVRAFQQHRGLLVDGIVGEATYRALKEASYRLGARTLYHQFGAP LYGDDVATLQARLQDLGFYTGLVDGHFGLQTHNALMSYQREYGLAADGICGPETLRSL YFLSSRVSGGSPHAIREEELVRSSGPKLSGKRIIIDPGRGGVDHGLIAQGPAGPISEA DLLWDLASRLEGRMAAIGMETHLSRPTNRSPSDAERAATANAVGADLMISLRCETQTS LAANGVASFHFGNSHGSVSTIGRNLADFIQREVVARTGLRDCRVHGRTWDLLRLTRMP TVQVDIGYITNPHDRGMLVSTQTRDAIAEGILAAVKRLYLLGKNDRPTGTFTFAELLA HELSVERAGRLGGS" gene complement(4404433..4405167) /locus_tag="Rv3916c" /db_xref="GeneID:886249" CDS complement(4404433..4405167) /locus_tag="Rv3916c" /function="UNKNOWN" /note="Rv3916c, (MTV028.07c), len: 244 aa. Conserved hypothetical protein, equivalent to Q50200|ML2705|L222-ORF1 HYPOTHETICAL PROTEIN from Mycobacterium leprae (259 aa), FASTA scores: opt: 1266, E(): 2e-74, (76.4% identity in 250 aa overlap). Also highly similar (but with gaps) to Q9R3S2|STH24.10 HYPOTHETICAL 22.6 KDA PROTEIN from Streptomyces coelicolor (205 aa), FASTA scores: opt: 387, E(): 7.5e-18, (40.25% identity in 231 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218433.1" /db_xref="GI:15611052" /db_xref="GeneID:886249" /translation="MSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEF EKEAWLSMVMLEWGSCGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVS ADAVLLTSMGIERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAV TPDVRPVLEALGDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVE AALERLLENARLQEPIAAGSTAGNTS" gene complement(4405457..4406491) /gene="parB" /locus_tag="Rv3917c" /db_xref="GeneID:886244" CDS complement(4405457..4406491) /gene="parB" /locus_tag="Rv3917c" /function="INVOLVED IN CHROMOSOME PARTITION. LOCALIZE TO BOTH POLES OF THE PREDIVISIONAL CELL FOLLOWING COMPLETION OF DNA REPLICATION. BINDS TO THE DNA ORIGIN OF REPLICATION (BY SIMILARITY)." /experiment="experimental evidence, no additional details recorded" /note="Rv3917c, (MTV028.08c, MT4036), len: 344 aa. Probable parB, chromosome partitioning protein, equivalent to Q50201|PARB_MYCLE|ML2706 PROBABLE CHROMOSOME PARTITIONING PROTEIN from Mycobacterium leprae (333 aa), FASTA scores: opt: 1654, E(): 1.6e-88, (78.6% identity in 332 aa overlap). Also highly similar to to others e.g. Q9S6U1|STH24.09 PUTATIVE PARTITIONING OR SPORULATION PROTEIN from Streptomyces coelicolor (328 aa), FASTA scores: opt: 966, E(): 9.7e-49, (58.55% identity in 287 aa overlap) (no similarity on N-terminus); Q9PB63|PARB_XYLFA|XF2281 PROBABLE CHROMOSOME PARTITIONING PROTEIN from Xylella fastidiosa (310 aa), FASTA scores: opt: 598, E(): 1.8e-27, (38.65% identity in 326 aa overlap); P31857|PARB_PSEPU PROBABLE CHROMOSOME PARTITIONING PROTEIN from Pseudomonas putida (290 aa), FASTA scores: opt: 573, E(): 4.6e-26, (40.35% identity in 322 aa overlap); etc. Contains probable helix-turn-helix motif at aa 179 to 200 (Score 1150, +3.1 0 SD). BELONGS TO THE PARB FAMILY. Note that previously known as parA.; parA" /codon_start=1 /transl_table=11 /product="chromosome partitioning protein ParB" /protein_id="NP_218435.2" /db_xref="GI:57117170" /db_xref="GeneID:886244" /translation="MTQPSRRKGGLGRGLAALIPTGPADGESGPPTLGPRMGSATADV VIGGPVPDTSVMGAIYREIPPSAIEANPRQPRQVFDEEALAELVHSIREFGLLQPIVV RSLAGSQTGVRYQIVMGERRWRAAQEAGLATIPAIVRETGDDNLLRDALLENIHRVQL NPLEEAAAYQQLLDEFGVTHDELAARIGRSRPLITNMIRLLKLPIPVQRRVAAGVLSA GHARALLSLEAGPEAQEELASRIVAEGLSVRATEETVTLANHEANRQAHHSDATTPAP PRRKPIQMPGLQDVAERLSTTFDTRVTVSLGKRKGKIVVEFGSVDDLARIVGLMTTDG RDKGLHRDAL" gene complement(4406488..4407531) /gene="parA" /locus_tag="Rv3918c" /db_xref="GeneID:886224" CDS complement(4406488..4407531) /gene="parA" /locus_tag="Rv3918c" /function="INVOLVED IN CHROMOSOME PARTITION. LOCALIZE TO BOTH POLES OF THE PREDIVISIONAL CELL FOLLOWING COMPLETION OF DNA REPLICATION (BY SIMILARITY)." /experiment="experimental evidence, no additional details recorded" /note="Rv3918c, (MTV028.09c), len: 347 aa. Probable parA, chromosome partitioning protein, highly similar to Q9CCX7|PARA|ML2707 PUTATIVE CELL DIVISION PROTEIN from Mycobacterium leprae (351 aa), FASTA scores: opt: 1679, E(): 2.9e-93, (78.1% identity in 347 aa overlap). Also highly similar to others e.g. Q9RFM1|PARA PARA PROTEIN from Streptomyces coelicolor (357 aa), FASTA scores: opt: 1197, E(): 2e-64, (60.45% identity in 306 aa overlap); Q98DZ3|MLL4479|PARA CHROMOSOME PARTITIONING PROTEIN from Rhizobium loti (Mesorhizobium loti) (266 aa), FASTA scores: opt: 835, E(): 7.2e-43, (50.95% identity in 257 aa overlap); O05189|PARA_CAUCR CHROMOSOME PARTITIONING PROTEIN from Caulobacter crescentus (267 aa), FASTA scores: opt: 813, E(): 1.5e-41, (51.35% identity in 261 aa overlap) (has its N-terminus shorter); etc. Equivalent to AAK48403 from Mycobacterium tuberculosis strain CDC1551 (381 aa) but shorter 34 aa. Also similar to other Mycobacterium tuberculosis proteins: MTCI125.30, FASTA scores: E(): 4.3e-32, (35.2% identity in 327 aa overlap); and MTCY07D11.13, FASTA scores: E(): 3e-30, (39.9% identity in 263 aa overlap). BELONGS TO THE PARA FAMILY. Possible alternative start site at aa 107. Note that previously known as parB.; parB" /codon_start=1 /transl_table=11 /product="chromosome partitioning protein ParA" /protein_id="NP_218434.2" /db_xref="GI:57117171" /db_xref="GeneID:886224" /translation="MSAPWGPVAAGPSALVRSGQASTIEPFQREMTPPTPTPEAAHNP TMNVSRETSTEFDTPIGAAAERAMRVLHTTHEPLQRPGRRRVLTIANQKGGVGKTTTA VNIAAALAVQGLKTLVIDLDPQGNASTALGITDRQSGTPSSYEMLIGEVSLHTALRRS PHSERLFCIPATIDLAGAEIELVSMVARENRLRTALAALDNFDFDYVFVDCPPSLGLL TINALVAAPEVMIPIQCEYYALEGVSQLMRNIEMVKAHLNPQLEVTTVILTMYDGRTK LADQVADEVRQYFGSKVLRTVIPRSVKVSEAPGYSMTIIDYDPGSRGAMSYLDASREL AERDRPPSAKGRP" gene complement(4407528..4408202) /gene="gidB" /locus_tag="Rv3919c" /db_xref="GeneID:886243" CDS complement(4407528..4408202) /gene="gidB" /locus_tag="Rv3919c" /function="NOT KNOWN." /note="glucose-inhibited division protein B; SAM-dependent methyltransferase; methylates the N7 position of guanosine in position 527 of 16S rRNA" /codon_start=1 /transl_table=11 /product="16S rRNA methyltransferase GidB" /protein_id="NP_218436.1" /db_xref="GI:15611055" /db_xref="GeneID:886243" /translation="MSPIEPAASAIFGPRLGLARRYAEALAGPGVERGLVGPREVGRL WDRHLLNCAVIGELLERGDRVVDIGSGAGLPGVPLAIARPDLQVVLLEPLLRRTESLR EMVTDLGVAVEIVRGRAEESWVQDQLGGSDAAVSRAVAALDKLTKWSMPLIRPNGRML AIKGERAHDEVREHRRVMIASGAVDVRVVTCGANYLRPPATVVFARRGKQIARGSARM ASGGTA" misc_feature complement(4408152..4408169) /gene="gidB" /locus_tag="Rv3919c" /note="PS00539 Pyrokinins signature." gene complement(4408334..4408897) /locus_tag="Rv3920c" /db_xref="GeneID:886255" CDS complement(4408334..4408897) /locus_tag="Rv3920c" /function="UNKNOWN" /note="Rv3920c, (MTV028.11c), len: 187 aa. Hypothetical protein, similar to JAG protein, equivalent to Q9L7M2 HYPOTHETICAL 20.1 KDA PROTEIN from Mycobacterium paratuberculosis (183 aa), FASTA scores: opt: 1004, E(): 7.3e-52, (85.05% identity in 187 aa overlap); and Q50204|ML2709 HYPOTHETICAL PROTEIN SIMILAR TO JAG PROTEIN SPOIIIJ ASSOCIATED PROTEIN IN BACILLUS SUBTILIS from Mycobacterium leprae (193 aa), FASTA scores: opt: 871, E(): 4.4e-44, (73.05% identity in 193 aa overlap). Also similar to other bacterial proteins e.g. O54595|STH24.06|JAG JAG-LIKE PROTEIN from Streptomyces coelicolor (170 aa), FASTA scores: opt: 593, E(): 6.7e-28, (62.85% identity in 167 aa overlap); Q9RCA6|JAG|BH4063 JAG PROTEIN HOMOLOG from Bacillus halodurans (207 aa), FASTA scores: opt: 282, E(): 1.1e-09, (35.0% identity in 140 aa overlap); Q9X1H1|TM1460 PUTATIVE JAG PROTEIN, PUTATIVE from Thermotoga maritima (221 aa), FASTA scores: opt: 258, E(): 3e-08, (31.9% identity in 138 aa overlap);Q01620|JAG_BACSU JAG PROTEIN (SPOIIIJ ASSOCIATED PROTEIN) from Bacillus subtilis (208 aa), FASTA scores: opt: 196, E(): 0.00012, (28.05% identity in 139 aa overlap); etc." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218437.1" /db_xref="GI:15611056" /db_xref="GeneID:886255" /translation="MADADTTDFDVDAEAPGGGVREDTATDADEADDQEERLVAEGEI AGDYLEELLDVLDFDGDIDLDVEGNRAVVSIDGSDDLNKLVGRGGEVLDALQELTRLA VHQKTGVRSRLMLDIARWRRRRREELAALADEVARRVAETGDREELVPMTPFERKIVH DAVAAVPGVHSESEGVEPERRVVVLRD" gene complement(4408969..4410069) /locus_tag="Rv3921c" /db_xref="GeneID:886238" CDS complement(4408969..4410069) /locus_tag="Rv3921c" /function="UNKNOWN" /note="functions to insert inner membrane proteins into the IM in Escherichia coli; interacts with transmembrane segments; functions in both Sec-dependent and -independent membrane insertion; similar to Oxa1p in mitochondria" /codon_start=1 /transl_table=11 /product="putative inner membrane protein translocase component YidC" /protein_id="NP_218438.1" /db_xref="GI:15611057" /db_xref="GeneID:886238" /translation="MSLLFDFFSLDFIYYPVSWIMWVWYRLFAFVLGPSNFFAWALSV MFLVFTLRALLYKPFVRQIRTTRQMQELQPQIKALQKKYGKDRQRMALEMQKLQREHG FNPILGCLPMLAQIPVFLGLYHVLRSFNRTTGGFGQPHLSVIENRLTGNYVFSPVDVG HFLDANLFGAPIGAYMTQRSGLDAFVDFSRPALIAVGVPVMILAGIATYFNSRASIAR QSAEAAANPQTAMMNKLALYVFPLGVVVGGPFLPLAIILYWFSNNIWTFGQQHYVFGM IEKEEEAKKQEAVRRRAANAPAPGAKPKRSPKTAPATNAAAPTEAGDTDDGAESDAST ERPADTSNPARRNSGPSARTPRPGVRPKKRKR" gene complement(4410053..4410415) /locus_tag="Rv3922c" /db_xref="GeneID:886256" CDS complement(4410053..4410415) /locus_tag="Rv3922c" /function="UNKNOWN" /note="Rv3922c, (MTV028.13c), len: 120 aa. Possible hemolysin, highly similar to Q9L7M0|YIDD_MYCPA HYPOTHETICAL 12.4 KDA PROTEIN from Mycobacterium paratuberculosis (115 aa), FASTA scores: opt: 521, E(): 1.9e-29, (65.2% identity in 112 aa overlap). Also highly similar to Q44066|HLYA_AERHY PUTATIVE ALPHA-HEMOLYSIN from Aeromonas hydrophila (85 aa), FASTA scores: opt: 276, E(): 1.5e-12, (51.45% identity in 70 aa overlap); and to many bacterial hypothetical proteins from bacterium e.g. P22847|YIDD_ECOLI|B3704.1 HYPOTHETICAL PROTEIN from Escherichia coli strain K12 (85 aa), FASTA scores: opt: 276, E(): 1.5e-12, (51.45% identity in 70 aa overlap)." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NP_218439.1" /db_xref="GI:15611058" /db_xref="GeneID:886256" /translation="MSLSRQSCGRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRF VPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDG ANDDWNPASKRGERESFV" gene complement(4410412..4410762) /gene="rnpA" /locus_tag="Rv3923c" /db_xref="GeneID:886248" CDS complement(4410412..4410762) /gene="rnpA" /locus_tag="Rv3923c" /EC_number="3.1.26.5" /function="RNaseP CATALYZES THE REMOVAL OF THE 5'-LEADER SEQUENCE FROM PRE-tRNA TO PRODUCE THE MATURE 5'TERMINUS. IT CAN ALSO CLEAVE OTHER RNA SUBSTRATES SUCH AS 4.5S RNA. THE PROTEIN COMPONENT PLAYS AN AUXILIARY BUT ESSENTIAL ROLE IN VIVO BY BINDING TO THE 5'-LEADER SEQUENCE AND BROADENING THE SUBSTRATE SPECIFICITY OF THE RIBOZYME [CATALYTIC ACTIVITY: ENDONUCLEOLYTIC CLEAVAGE OF RNA, REMOVING 5'-EXTRA-NUCLEOTIDE FROM tRNA PRECURSOR]." /note="protein component of RNaseP which catalyzes the removal of the 5'-leader sequence from pre-tRNA to produce the mature 5'terminus; this enzyme also cleaves other RNA substrates" /codon_start=1 /transl_table=11 /product="ribonuclease P" /protein_id="NP_218440.2" /db_xref="GI:161352458" /db_xref="GeneID:886248" /translation="MLRARNRMRRSADFETTVKHGMRTVRSDMVVYWWRGSGGGPRVG LIIAKSVGSAVERHRVARRLRHVAGSIVKELHPSDHVVIRALPSSRHVSSARLEQQLR CGLRRAVELAGSDR" misc_feature complement(4410568..4410612) /gene="rnpA" /locus_tag="Rv3923c" /note="PS00648 Bacterial Ribonuclease P protein component signature." gene complement(4410786..4410929) /gene="rpmH" /locus_tag="Rv3924c" /db_xref="GeneID:886258" CDS complement(4410786..4410929) /gene="rpmH" /locus_tag="Rv3924c" /function="INVOLVED IN TRANSLATION MECHANISM. THIS PROTEIN IS ONE OF THE EARLY ASSEMBLY PROTEINS OF THE 50S RIBOSOMAL SUBUNIT (BY SIMILARITY)." /note="in Escherichia coli transcription of this gene is enhanced by polyamines" /codon_start=1 /transl_table=11 /product="50S ribosomal protein L34" /protein_id="NP_218441.1" /db_xref="GI:15611060" /db_xref="GeneID:886258" /translation="MTKGKRTFQPNNRRRARVHGFRLRMRTRAGRSIVSSRRRKGRRT LSA" misc_feature complement(4410855..4410917) /gene="rpmH" /locus_tag="Rv3924c" /note="PS00784 Ribosomal protein L34 signature" ORIGIN 1 ttgaccgatg accccggttc aggcttcacc acagtgtgga acgcggtcgt ctccgaactt 61 aacggcgacc ctaaggttga cgacggaccc agcagtgatg ctaatctcag cgctccgctg 121 acccctcagc aaagggcttg gctcaatctc gtccagccat tgaccatcgt cgaggggttt 181 gctctgttat ccgtgccgag cagctttgtc caaaacgaaa tcgagcgcca tctgcgggcc 241 ccgattaccg acgctctcag ccgccgactc ggacatcaga tccaactcgg ggtccgcatc 301 gctccgccgg cgaccgacga agccgacgac actaccgtgc cgccttccga aaatcctgct 361 accacatcgc cagacaccac aaccgacaac gacgagattg atgacagcgc tgcggcacgg 421 ggcgataacc agcacagttg gccaagttac ttcaccgagc gcccgcacaa taccgattcc 481 gctaccgctg gcgtaaccag ccttaaccgt cgctacacct ttgatacgtt cgttatcggc 541 gcctccaacc ggttcgcgca cgccgccgcc ttggcgatcg cagaagcacc cgcccgcgct 601 tacaaccccc tgttcatctg gggcgagtcc ggtctcggca agacacacct gctacacgcg 661 gcaggcaact atgcccaacg gttgttcccg ggaatgcggg tcaaatatgt ctccaccgag 721 gaattcacca acgacttcat taactcgctc cgcgatgacc gcaaggtcgc attcaaacgc 781 agctaccgcg acgtagacgt gctgttggtc gacgacatcc aattcattga aggcaaagag 841 ggtattcaag aggagttctt ccacaccttc aacaccttgc acaatgccaa caagcaaatc 901 gtcatctcat ctgaccgccc acccaagcag ctcgccaccc tcgaggaccg gctgagaacc 961 cgctttgagt gggggctgat cactgacgta caaccacccg agctggagac ccgcatcgcc 1021 atcttgcgca agaaagcaca gatggaacgg ctcgcggtcc ccgacgatgt cctcgaactc 1081 atcgccagca gtatcgaacg caatatccgt gaactcgagg gcgcgctgat ccgggtcacc 1141 gcgttcgcct cattgaacaa aacaccaatc gacaaagcgc tggccgagat tgtgcttcgc 1201 gatctgatcg ccgacgccaa caccatgcaa atcagcgcgg cgacgatcat ggctgccacc 1261 gccgaatact tcgacactac cgtcgaagag cttcgcgggc ccggcaagac ccgagcactg 1321 gcccagtcac gacagattgc gatgtacctg tgtcgtgagc tcaccgatct ttcgttgccc 1381 aaaatcggcc aagcgttcgg ccgtgatcac acaaccgtca tgtacgccca acgcaagatc 1441 ctgtccgaga tggccgagcg ccgtgaggtc tttgatcacg tcaaagaact caccactcgc 1501 atccgtcagc gctccaagcg ctagcacggc gtgttcttcc gacaacgttc ttaaaaaaac 1561 ttctctctcc caggtcacac cagtcacaga gattggctgt gagtgtcgct gtgcacaaac 1621 cgcgcacaga ctcatacagt cccggcggtt ccgttcacaa cccacgcctc atccccaccg 1681 acccaacaca caccccacag tcatcgccac cgtcatccac aactccgacc gacgtcgacc 1741 tgcaccaaga ccagactgtc cccaaactgc acaccctcta atactgttac cgagatttct 1801 tcgtcgtttg ttcttggaaa gacagcgctg gggatcgttc gctggatacc acccgcataa 1861 ctggctcgtc gcggtgggtc agaggtcaat gatgaacttt caagttgacg tgagaagctc 1921 tacggttgtt gttcgactgc tgttgcggcc gtcgtggcgg gtcacgcgtc atgggcattc 1981 gtcgttggca gtccccacgc tagcggggcg ctagccacgg gatcgaactc atcgtgaggt 2041 gaaagggcgc aatggacgcg gctacgacaa gagttggcct caccgacttg acgtttcgtt 2101 tgctacgaga gtctttcgcc gatgcggtgt cgtgggtggc taaaaatctg ccagccaggc 2161 ccgcggtgcc ggtgctctcc ggcgtgttgt tgaccggctc ggacaacggt ctgacgattt 2221 ccggattcga ctacgaggtt tccgccgagg cccaggttgg cgctgaaatt gtttctcctg 2281 gaagcgtttt agtttctggc cgattgttgt ccgatattac ccgggcgttg cctaacaagc 2341 ccgtagacgt tcatgtcgaa ggtaaccggg tcgcattgac ctgcggtaac gccaggtttt 2401 cgctaccgac gatgccagtc gaggattatc cgacgctgcc gacgctgccg gaagagaccg 2461 gattgttgcc tgcggaatta ttcgccgagg caatcagtca ggtcgctatc gccgccggcc 2521 gggacgacac gttgcctatg ttgaccggca tccgggtcga aatcctcggt gagacggtgg 2581 ttttggccgc taccgacagg tttcgcctgg ctgttcgaga actgaagtgg tcggcgtcgt 2641 cgccagatat cgaagcggct gtgctggtcc cggccaagac gctggccgag gccgccaaag 2701 cgggcatcgg cggctctgac gttcgtttgt cgttgggtac tgggccgggg gtgggcaagg 2761 atggcctgct cggtatcagt gggaacggca agcgcagcac cacgcgactt cttgatgccg 2821 agttcccgaa gtttcggcag ttgctaccaa ccgaacacac cgcggtggcc accatggacg 2881 tggccgagtt gatcgaagcg atcaagctgg ttgcgttggt agctgatcgg ggcgcgcagg 2941 tgcgcatgga gttcgctgat ggcagcgtgc ggctttctgc gggtgccgat gatgttggac 3001 gagccgagga agatcttgtt gttgactatg ccggtgaacc attgacgatt gcgtttaacc 3061 caacctatct aacggacggt ttgagttcgt tgcgctcgga gcgagtgtct ttcgggttta 3121 cgactgcggg taagcctgcc ttgctacgtc cggtgtccgg ggacgatcgc cctgtggcgg 3181 gtctgaatgg caacggtccg ttcccggcgg tgtcgacgga ctatgtctat ctgttgatgc 3241 cggttcggtt gccgggctga gcacttggcg cccgggtagg tgtacgtccg tcatttgggg 3301 ctgcgtgact tccggtcctg ggcatgtgta gatctggaat tgcatccagg gcggacggtt 3361 tttgttgggc ctaacggtta tggtaagacg aatcttattg aggcactgtg gtattcgacg 3421 acgttaggtt cgcaccgcgt tagcgccgat ttgccgttga tccgggtagg taccgatcgt 3481 gcggtgatct ccacgatcgt ggtgaacgac ggtagagaat gtgccgtcga cctcgagatc 3541 gccacggggc gagtcaacaa agcgcgattg aatcgatcat cggtccgaag tacacgtgat 3601 gtggtcggag tgcttcgagc tgtgttgttt gcccctgagg atctggggtt ggttcgtggg 3661 gatcccgctg accggcggcg ctatctggat gatctggcga tcgtgcgtag gcctgcgatc 3721 gctgcggtac gagccgaata tgagagggtg ttgcgccagc ggacggcgtt attgaagtcc 3781 gtacctggag cacggtatcg gggtgaccgg ggtgtgtttg acactcttga ggtatgggac 3841 agtcgtttgg cggagcacgg ggctgaactg gtggccgccc gcatcgattt ggtcaaccag 3901 ttggcaccgg aagtgaagaa ggcataccag ctgttggcgc cggaatcgcg atcggcgtct 3961 atcggttatc gggccagcat ggatgtaacc ggtcccagcg agcagtcaga tatcgatcgg 4021 caattgttag cagctcggct gttggcggcg ctggcggccc gtcgggatgc cgaactcgag 4081 cgtggggttt gtctagttgg tccgcaccgt gacgacctaa tactgcgact aggcgatcaa 4141 cccgcgaaag gatttgctag ccatggggag gcgtggtcgt tggcggtggc actgcggttg 4201 gcggcctatc aactgttacg cgttgatggt ggtgagccgg tgttgttgct cgacgacgtg 4261 ttcgccgaac tggatgtcat gcgccgtcga gcgttggcga cggcggccga gtccgccgaa 4321 caggtgttgg tgactgccgc ggtgctcgag gatattcccg ccggctggga cgccaggcgg 4381 gtgcacatcg atgtgcgtgc cgatgacacc ggatcgatgt cggtggttct gccatgacgg 4441 gttctgttga ccggcccgac cagaatcgcg gtgagcgatc aatgaagtca ccagggttgg 4501 atttggtcag gcgcaccctg gacgaagctc gtgctgctgc ccgcgcgcgc ggacaagacg 4561 ccggtcgagg gcgggtcgct tccgttgcgt cgggtcgggt ggccggacgg cgacgaagct 4621 ggtcgggtcc ggggcccgac attcgtgatc cacaaccgct gggtaaggcc gctcgtgagc 4681 tggcaaagaa acgcggctgg tcggtgcggg tcgccgaggg tatggtgctc ggccagtggt 4741 ctgcggtggt cggccaccag atcgccgaac atgcacgccc gactgcgcta aacgacgggg 4801 tgttgagcgt gattgcggag tcgacggcgt gggcgacgca gttgaggatc atgcaggccc 4861 agcttctggc caagatcgcc gcagcggttg gcaacgatgt ggtgcgatcg ctaaagatca 4921 ccgggccggc ggcaccatcg tggcgcaagg ggcctcgcca tattgccggt aggggtccgc 4981 gcgacaccta cggataacac gtcgatcggc ccagaacaag gcgctccggt cccggcctga 5041 gagcctcgag gacgaagcgg atccgtatgc cggacgtcgg gacgcaccag gaagaaagat 5101 gtccgacgca cggcgcggtt agatgggtaa aaacgaggcc agaagatcgg ccctggcgcc 5161 cgatcacggt acagtggtgt gcgaccccct gcggcgactc aaccgcatgc acgcaacccc 5221 tgaggagagt attcggatcg tggctgccca gaaaaagaag gcccaagacg aatacggcgc 5281 tgcgtctatc accattctcg aagggctgga ggccgtccgc aaacgtcccg gcatgtacat 5341 tggctcgacc ggtgagcgcg gtttacacca tctcatttgg gaggtggtcg acaacgcggt 5401 cgacgaggcg atggccggtt atgcaaccac agtgaacgta gtgctgcttg aggatggcgg 5461 tgtcgaggtc gccgacgacg gccgcggcat tccggtcgcc acccacgcct ccggcatacc 5521 gaccgtcgac gtggtgatga cacaactaca tgccggcggc aagttcgact cggacgcgta 5581 tgcgatatct ggtggtctgc acggcgtcgg cgtgtcggtg gttaacgcgc tatccacccg 5641 gctcgaagtc gagatcaagc gcgacgggta cgagtggtct caggtttatg agaagtcgga 5701 acccctgggc ctcaagcaag gggcgccgac caagaagacg gggtcaacgg tgcggttctg 5761 ggccgacccc gctgttttcg aaaccacgga atacgacttc gaaaccgtcg cccgccggct 5821 gcaagagatg gcgttcctca acaaggggct gaccatcaac ctgaccgacg agagggtgac 5881 ccaagacgag gtcgtcgacg aagtggtcag cgacgtcgcc gaggcgccga agtcggcaag 5941 tgaacgcgca gccgaatcca ctgcaccgca caaagttaag agccgcacct ttcactatcc 6001 gggtggcctg gtggacttcg tgaaacacat caaccgcacc aagaacgcga ttcatagcag 6061 catcgtggac ttttccggca agggcaccgg gcacgaggtg gagatcgcga tgcaatggaa 6121 cgccgggtat tcggagtcgg tgcacacctt cgccaacacc atcaacaccc acgagggcgg 6181 cacccacgaa gagggcttcc gcagcgcgct gacgtcggtg gtgaacaagt acgccaagga 6241 ccgcaagcta ctgaaggaca aggaccccaa cctcaccggt gacgatatcc gggaaggcct 6301 ggccgctgtg atctcggtga aggtcagcga accgcagttc gagggccaga ccaagaccaa 6361 gttgggcaac accgaggtca aatcgtttgt gcagaaggtc tgtaacgaac agctgaccca 6421 ctggtttgaa gccaacccca ccgacgcgaa agtcgttgtg aacaaggctg tgtcctcggc 6481 gcaagcccgt atcgcggcac gtaaggcacg agagttggtg cggcgtaaga gcgccaccga 6541 catcggtgga ttgcccggca agctggccga ttgccgttcc acggatccgc gcaagtccga 6601 actgtatgtc gtagaaggtg actcggccgg cggttctgca aaaagcggtc gcgattcgat 6661 gttccaggcg atacttccgc tgcgcggcaa gatcatcaat gtggagaaag cgcgcatcga 6721 ccgggtgcta aagaacaccg aagttcaggc gatcatcacg gcgctgggca ccgggatcca 6781 cgacgagttc gatatcggca agctgcgcta ccacaagatc gtgctgatgg ccgacgccga 6841 tgttgacggc caacatattt ccacgctgtt gttgacgttg ttgttccggt tcatgcggcc 6901 gctcatcgag aacgggcatg tgtttttggc acaaccgccg ctgtacaaac tcaagtggca 6961 gcgcagtgac ccggaattcg catactccga ccgcgagcgc gacggtctgc tggaggcggg 7021 gctgaaggcc gggaagaaga tcaacaagga agacggcatt cagcggtaca agggtctagg 7081 tgaaatggac gctaaggagt tgtgggagac caccatggat ccctcggttc gtgtgttgcg 7141 tcaagtgacg ctggacgacg ccgccgccgc cgacgagttg ttctccatcc tgatgggcga 7201 ggacgtcgac gcgcggcgca gctttatcac ccgcaacgcc aaggatgttc ggttcctgga 7261 tgtctaacgc aaccctgcgt tcgattgcaa acgaggaata gatgacagac acgacgttgc 7321 cgcctgacga ctcgctcgac cggatcgaac cggttgacat cgagcaggag atgcagcgca 7381 gctacatcga ctatgcgatg agcgtgatcg tcggccgcgc gctgccggag gtgcgcgacg 7441 ggctcaagcc cgtgcatcgc cgggtgctct atgcaatgtt cgattccggc ttccgcccgg 7501 accgcagcca cgccaagtcg gcccggtcgg ttgccgagac catgggcaac taccacccgc 7561 acggcgacgc gtcgatctac gacagcctgg tgcgcatggc ccagccctgg tcgctgcgct 7621 acccgctggt ggacggccag ggcaacttcg gctcgccagg caatgaccca ccggcggcga 7681 tgaggtacac cgaagcccgg ctgaccccgt tggcgatgga gatgctgagg gaaatcgacg 7741 aggagacagt cgatttcatc cctaactacg acggccgggt gcaagagccg acggtgctac 7801 ccagccggtt ccccaacctg ctggccaacg ggtcaggcgg catcgcggtc ggcatggcaa 7861 ccaatatccc gccgcacaac ctgcgtgagc tggccgacgc ggtgttctgg gcgctggaga 7921 atcacgacgc cgacgaagag gagaccctgg ccgcggtcat ggggcgggtt aaaggcccgg 7981 acttcccgac cgccggactg atcgtcggat cccagggcac cgctgatgcc tacaaaactg 8041 gccgcggctc cattcgaatg cgcggagttg ttgaggtaga agaggattcc cgcggtcgta 8101 cctcgctggt gatcaccgag ttgccgtatc aggtcaacca cgacaacttc atcacttcga 8161 tcgccgaaca ggtccgagac ggcaagctgg ccggcatttc caacattgag gaccagtcta 8221 gcgatcgggt cggtttacgc atcgtcatcg agatcaagcg cgatgcggtg gccaaggtgg 8281 tgatcaataa cctttacaag cacacccagc tgcagaccag ctttggcgcc aacatgctag 8341 cgatcgtcga cggggtgccg cgcacgctgc ggctggacca gctgatccgc tattacgttg 8401 accaccaact cgacgtcatt gtgcggcgca ccacctaccg gctgcgcaag gcaaacgagc 8461 gagcccacat tctgcgcggc ctggttaaag cgctcgacgc gctggacgag gtcattgcac 8521 tgatccgggc gtcggagacc gtcgatatcg cccgggccgg actgatcgag ctgctcgaca 8581 tcgacgagat ccaggcccag gcaatcctgg acatgcagtt gcggcgcctg gccgcactgg 8641 aacgccagcg catcatcgac gacctggcca aaatcgaggc cgagatcgcc gatctggaag 8701 acatcctggc aaaacccgag cggcagcgtg ggatcgtgcg cgacgaactc gccgaaatcg 8761 tggacaggca cggcgacgac cggcgtaccc ggatcatcgc ggccgacgga gacgtcagcg 8821 acgaggattt gatcgcccgc gaggacgtcg ttgtcactat caccgaaacg ggatacgcca 8881 agcgcaccaa gaccgatctg tatcgcagcc agaaacgcgg cggcaagggc gtgcagggtg 8941 cggggttgaa gcaggacgac atcgtcgcgc acttcttcgt gtgctccacc cacgatttga 9001 tcctgttctt caccacccag ggacgggttt atcgggccaa ggcctacgac ttgcccgagg 9061 cctcccggac ggcgcgcggg cagcacgtgg ccaacctgtt agccttccag cccgaggaac 9121 gcatcgccca ggtcatccag attcgcggct acaccgacgc cccgtacctg gtgctggcca 9181 ctcgcaacgg gctggtgaaa aagtccaagc tgaccgactt cgactccaat cgctcgggcg 9241 gaatcgtggc ggtcaacctg cgcgacaacg acgagctggt cggtgcggtg ctgtgttcgg 9301 ccggcgacga cctgctgctg gtctcggcca acgggcagtc catcaggttc tcggcgaccg 9361 acgaggcgct gcggccaatg ggtcgtgcca cctcgggtgt gcagggcatg cggttcaata 9421 tcgacgaccg gctgctgtcg ctgaacgtcg tgcgtgaagg cacctatctg ctggtggcga 9481 cgtcaggggg ctatgcgaaa cgtaccgcga tcgaggaata cccggtacag ggccgcggcg 9541 gtaaaggtgt gctgacggtc atgtacgacc gccggcgcgg caggttggtt ggggcgttga 9601 ttgtcgacga cgacagcgag ctgtatgccg tcacttccgg cggtggcgtg atccgcaccg 9661 cggcacgcca ggttcgcaag gcgggacggc agaccaaggg tgttcggttg atgaatctgg 9721 gcgagggcga cacactgttg gccatcgcgc gcaacgccga agaaagtggc gacgataatg 9781 ccgtggacgc caacggcgca gaccagacgg gcaattaatc aggctcgccc gacgacgatg 9841 cggatcgcgt agcgatctga ggaggaatcg ggcagctagg ctcggcagcc gggtacgagt 9901 gttaggagtc ggggtgactg caccgaacga gccgggggcg ctcagcaagg gcgacggccc 9961 gaatgcggat ggcttggtcg accgtggggg cgcacatcgg gcagcgaccg ggccaggccg 10021 cataccagat gctggagacc cgccgccgtg gcagcgtgct gcgactcggc aatcccaagc 10081 ggggcatcgt cagccgccgc cggtatcaca ccctgagggg cgcccgacca acccgcccgc 10141 cgccgccgat gctcggctga atcgcttcat ctccggtgcg tctgccccgg tgaccggccc 10201 agccgccgcg gtcaggaccc cgcagccgga tcccgacgct tcgctggggt gtggcgacgg 10261 ttcccccgcc gaggcctatg ccagcgagct gcccgaccta tccggcccga ctccgcgggc 10321 cccgcaacgc aaccccgcgc cggcgcgtcc cgcggagggt ggcgcgggat cgagagggga 10381 ttcggccgcc ggttcgagcg gcggtcgttc gattaccgct gagagtagag acgcccgtgt 10441 ccagctgtcg gcgcggcgaa gccgcgggcc ggttcgagcc agcatgcaga tccgacggat 10501 tgatccatgg agcacgttga aggtgtcgct gttgttgtcg gtggcgctgt tcttcgtctg 10561 gatgatcacg gtcgcgttcc tctacctggt gctcggcggt atgggcgtat gggccaagct 10621 caacagcaac gtcggtgacc tgttgaacaa cgcgagcggc agcagcgcgg aacttgtctc 10681 cagcggcacc atcttcggcg gcgcattcct gatcggcttg gtcaacatcg tcctgatgac 10741 cgcgcttgcc accatcggtg cgttcgtcta caacctgatc accgatctga tcggcggcat 10801 cgaagtgacg ctggcagacc gggactaatg ttttgagagt cgggcgccgg ttgcggtaat 10861 ctcgtcgctc ggccgtacgc gagtacgggc ctatagctca ggcggttaga gcgcttcgct 10921 gataacgaag aggtcggagg ttcgagtcct cctaggccca cgaccatgtg cccgtcacga 10981 cgttcggtga ggttcgcatt gccactggcc gcgatcgctg tggcggccat cgtcgtgcgg 11041 ttccgacgcg gagccgatgt ctggcatgtg gccggcgatc cacctcctga tcacataacc 11101 ggtgacgaag aggggcctta gctcagttgg tagagcactg cctttgcaag gcaggggtca 11161 ggggttcgag tcccctaggc tccacaagtg aaaagcgtag ctcggatact tcgaatgacc 11221 acgtttgatc acaatcgcga gtgaagaggg cgttgatggc cactccgacg gcctcgacac 11281 ccgacccgta caggtggcgg tagcggtcca aggtcaaccc ggcggagtcg tgttcgagca 11341 tgttctgaag tgccttgaat tcgccccggc ctggatcgcc aacgacgccg ggtgtgccga 11401 gctcatgcag ttttgaactc ctacaccacc gccggcttcc cggtagcgtc catcacagtc 11461 tgagggaaca gctgcgccgc ggtcaccgcc tgcgaccacc accggcgccg cacatggctg 11521 ccgcgcatgt agccgcccgc cgagtccggg aacgctagaa gctcagcaac ccatcgaacg 11581 cggtcggccg gttgtcggcg tccacgagca cgcaccctag agcgaaagtc atggatccgc 11641 cgttggcggg gtctccggta ttgccggact cgtctatgta agcgaccagc acgcgacgat 11701 gctggcacga ttcttgggcg attgaccaca gttacagata actactgtta accgcagttg 11761 tgtcctttcg caggtggact gagttgtaac ccattgatct gcatcatgat tcgcctgtgc 11821 aaggcggggg tcaggggttc gaatccctag gccccaccgt gtgacgaccg gcctcaggag 11881 cgcggttgca cctcgacgct cggtggtcgg ggcgacggct ccggtcgcga cgagcgccgg 11941 acgatgctga aggcgacggc accgccggcg aggatggccg ccgcgatccc cgcgaagatc 12001 cagaggtggt gtttgctgcg acgttgggtc cgggcgtcct gtagggcctg cggtaggttg 12061 gccaccacgt cctgggcagc ggtcagctct tgagcgagcg tctcttgggc ggcagcgacc 12121 tcgcgggcca atcggccttc ccggtaacgg cggcgaagcc cggccgcggt cgaccgggcg 12181 gactgaagcc caagtccgac cccgagttcg aggaggcctc gggtcacgtc caccggaccc 12241 accgcagagt aggccagacc ccgggtcagc cgctcgcgtg gggtcaaccg ggtttccacc 12301 tgctcactca ttttgccgcc tttctgtgtc cgggccgagg cttgcgctca ataactcggt 12361 caagttcctt cacagactgc catcactggc ccgtcggcgg gctcgttgcg ggtgcgccgc 12421 gtgcgggttt gtgttccggg caccgggtgg gggcccgccc gggcgtaatg gcagactgtg 12481 attccgtgac taacagcccc cttgcgaccg ctaccgccac gctgcacact aaccgcggcg 12541 acatcaagat cgccctgttc ggaaaccatg cgcccaagac cgtcgccaat tttgtgggcc 12601 ttgcgcaggg caccaaggac tattcgaccc aaaacgcatc aggtggcccg tccggcccgt 12661 tctacgacgg cgcggtcttt caccgggtga tccagggctt catgatccag ggtggcgatc 12721 caaccgggac gggtcgcggc ggacccggct acaagttcgc cgacgagttc caccccgagc 12781 tgcaattcga caagccctat ctgctcgcga tggccaacgc cggtccgggc accaacggct 12841 cacagttttt catcaccgtc ggcaagactc cgcacctgaa ccggcgccac accattttcg 12901 gtgaagtgat cgacgcggag tcacagcggg ttgtggaggc gatctccaag acggccaccg 12961 acggcaacga tcggccgacg gacccggtgg tgatcgagtc gatcaccatc tcctgacccg 13021 aagctacgtc ggctcgtcgc tcgaatacac cttgtggacc cgccagggca cgtggcggta 13081 caccgacacg ccgttggggc cgttcaaccg gacgccctca cgccaagtcc gctcaccttt 13141 ggccgcgacc ggcgtaaccg gcagcggtaa gcgcatcgag cacctccact gggtcggtgc 13201 cgagatccca gcgggacaaa atcagcagcc ccccgctgac cgtttcgatc tcgagcaggc 13261 gcaccaggcg gccgtaacgg cgaaactcgt cgattcggat gatcttgata ttggaatgtc 13321 gtaatagctg cgtccggaac caacctcgga tcgccaggcc gtcgggggta attgccagcc 13381 ttggacgtgc gcgccaagtg gcgctcgcaa acaagatcag acccagcgcg gcaactccgg 13441 tcaacacccg cccgggcgta tctgtgacta aggtcacaga cgcaatagcc atcacgactc 13501 ccccggctcc gcaaccagcg attcccgagg tgcgaggcgc ccatgctgtt tgctgcatgt 13561 attccttaga ccctctcacc actgcagaca aagttatcca cagacgctat caacagtggg 13621 gatgaatcac atgcgtgtga ttgagtgacc aaaaggttgc tggcacagta acgacccgac 13681 cagaatatga attcattcta tcggcggcgt ggatcaatgc cagcgcatcg tgagcaacaa 13741 accggtgatc atgaaagcga acgcgatcgc atagttccag ggaccgagtt gcgccatcca 13801 attgagcgct gtgggggctt ggctgccaat ggctgccaac tgaaacacca ttaaccagat 13861 gagtccgatc agcatcagac cgatgaacaa cgagacgaac catacgctcg acggtccgac 13921 cttcaccttc atcggcgtgc ggctcaccgc gctgacggtg aagtcgttct tcttgcggac 13981 cttggacttg ggcatcactt tcctcgggat ctggcgggac tacctcgaca agacgacgaa 14041 tggcccgggg tgcaacgata gaagttgcag ctgcaggcat accttgttat gagactaacc 14101 cacccaacac cctgcccgga aaacggagag accatgattg atcggcgccg atcggcgtgg 14161 cgtttcagtg tccccttagt gtgcttgctg gcggggctgc tgctggccgc cacgcatggg 14221 gtgtcgggcg gcaccgagat ccgccgcagc gatgcgccgc gactggtcga ccttgtccgt 14281 cgggcgcagg catcggtgaa ccgtctcgcc accgaacgcg aagcgctgac caccagaatc 14341 gactcggtgc acggccgatc tgtcgatacc gcgttggcgg ccatgcagcg gcggtccgcc 14401 aagctggccg gtgtggcggc tatgaatccg gtccatgggc cgggcctggt ggttaccctg 14461 caagacgcgc aacgcgacgc caacggccgg tttccgcgcg acgcgtcccc ggacgatctg 14521 gttgtgcatc agcaagacat cgaggctgtc ctcaacgcgt tgtggaatgc cggtgctgag 14581 gcgatccaga tgcaggacca gcgcatcatc gcgatgtcga tagctcgttg tgtcggaaac 14641 acgttgctgc tcaacgggcg tacctatagc ccgccctaca cgatcgccgc gatcggagac 14701 gccgccgcca tgcaggctgc tctggctgcg gctcccctgg tgacgctcta caagcagtac 14761 gtggtccggt tcggcctcgg gtactgcgaa gaagtccatc ctgacttgca gatagtcggc 14821 tatgccgatc ccgtccggat gcacttcgcg cagcctgcag gccccttgga ctactgaacg 14881 actgccggca gggtcaggcg gtagcctgtc acgatgcgga tcctggtcgt tgacaactac 14941 gacagcttcg tgttcaacct ggtgcagtac ctcggccagc tcggcatcga ggccgaggtg 15001 tggcgcaacg acgaccaccg gctatccgat gaggccgccg tcgccggcca attcgacggt 15061 gtcctgctca gtcccggtcc gggtaccccg gagcgcgcgg gcgcgtcggt gagtatcgtg 15121 cacgcgtgtg cggcagcaca cacccctttg ctgggggtct gccttgggca ccaagccatc 15181 ggcgttgcgt tcggcgccac cgtggaccgt gcgcccgagc tattgcacgg caagaccagc 15241 agcgtattcc acaccaatgt cggtgtgcta caagggcttc cggatccctt cacggccact 15301 cgataccatt cgttgacaat tctgcctaag tcgctgccag cggtgctgag ggtcacggcc 15361 cgcactagca gcggtgtgat catggccgtg cagcacaccg ggctgccgat ccacggtgtc 15421 cagttccatc cggagtcgat tctcaccgag ggcgggcacc gcatactggc caactggctc 15481 acctgctgcg gatggacgca agacgacacc ctggtacgtc ggctggaaaa cgaagtgctc 15541 accgccatct caccgcactt cccaacttca accgctagcg cgggcgaagc tactggccga 15601 acctcagcgt gatgatgccg tcccggttga cgccggtccc cgccggcggg ttttgataga 15661 cgacccggtt gtgttgggag ccaccggcgt cgacgtcggc ccctttgtcg agcatcccgg 15721 tccagcccag cgcgcgcaat cgtggttcgg cgtcgaccca gaacatgccg gataggtcgg 15781 gcatgacgaa ttggttgccc ttggacacct gtagttcgat gactgaatcg accggaactg 15841 tggtgcctgc gggtggattg gtgccggtca cctcgccggc gggacggggg ctgtccaccg 15901 aggcctgact gaatttggtg aagccgtaga cgttgaggtt cttctgcgcc acgtcgacgg 15961 tctggcccgc gacatcggga atgtctttgg tcgccggacc agagccaacg atgatgatga 16021 ccacattggt gatggccgac gtctggttgg ctggcgggtt ggtcccgatg accttgccca 16081 ccagttccgg ggtggacggc gaattcgctt gcttgaagcg gccgaatccg gcggcagtca 16141 gtttcttgac cgcttcggcg tatgtcagcg tggagacgtc gggtatttcg cgttgctcgg 16201 gtccggtgga cacgttgact gtgatctcgt cgcctgcact caccgacgtg ttggcggccg 16261 ggtcggtgcc gataacgtgg tccggtggga ttgtcgagtc cggcttctgc aaggtgcgga 16321 ttttgaagcc ccggttttgc agtgtggcga tggcgtcggc ggaggattga ccccgaacgt 16381 cgggaacttg aacgtcgcgg gtgatgccgc cgaacgtgtt gatggcgatg gttaccacga 16441 cggtcagcac agcgagcacg gcgaccaccg caacccaacg gcccaccgaa ccgatgctgc 16501 ggtcacggtc ggtgtcgtct aagtcctggc gtggtagcgg atcggtgcgc ggaccgctaa 16561 ggttgccggc cgcagacgac agcagcgagg tccgctcggc atcggtgagc actttgggcg 16621 cctcgggcgg ctcaccgttg tgcacgcgga ccaggtcggc gcgcatctcc gccgctgtct 16681 gatagcggtt ttccggattt ttggccagcg ccttgagaac gacggcgtcc aggtcggcgg 16741 agaggccttc gtgccgcgcc gaaggtggga tcgggtcttc gcgcacatgt tggtaggcaa 16801 ccgagacggg tgagtcgccg gtgaaaggtg gctccccggt gaggacttca taaagaacac 16861 agcccaagga atagacatcg gatcgggcgt cgacggaatc accccgggcc tgttcgggtg 16921 acaggtactg cgccgtgccg atcactgctg cggtctgggt cacgctgttg ccgctgtcgg 16981 caatggcgcg ggcgatgccg aaatccatca cctttactgc attggtcgcg ctgatcatga 17041 tgttcgccgg cttgacgtca cggtggatga ttccgttctg atgactgaag ttcagcgctt 17101 ggcaggcgtc ggcgatgacc tcgatggcgc gtttgggcgt catcggccct tcggtgtgga 17161 caatgtcgcg cagggtaacg ccgtcgacgt attccatgac gatgtagggc aatggcccgg 17221 cgggcgtttc ggcttcaccg gtgtcgtaga ccgcgacgat tgcagggtgg ttcaatgccg 17281 cggcgttttg cgcctcacgc cggaagcgaa ggtaaaaact gggatcgcgg gctagatcag 17341 cgcgcagcac cttgaccgca acgtcgcggt gcaaccggag gtcgcgggcc aggtggacct 17401 cggacatgcc cccaaatcca aggatttcgc caagttcgta gcggtcggac aggtgggaag 17461 gggtggtcat tgcgctatct cgtatcgggc cagcgacgcg cgcgaatgcg gtgtcggcgg 17521 gacaacccag ctttgcagtc cagaatgacg tgtttccccg cgttccgtcc aattgagtcg 17581 cgggctagca tcagtcccgc cagtgttgct ggccggaggg ttcccggtgg tggtcacggt 17641 cggcgtcggt gcctgctgcg ggctgttgtc cccgggcgct ttgatgacga gcagcacggc 17701 gatgatgatt gccagcgccc ccagcacccc cgcggcccag agcagcgcac gctgaccgga 17761 cgaaaacgtg cgccgcggcg gccggtgacc acccgtggcc gggcgggatc gacgggatgc 17821 cgcagtccgg ccagcagagt tggccgcgac cctggctgtc gtacccgacg gaatggccgc 17881 cggggcggcc cggccagggg ggggtgtctg gctgggccgc gggggccggc ggccggcgcg 17941 caccgctgcc accgcgtcgg cgaacggtcc cccactgcga tagcgcatcg cggggttctt 18001 caccagagtt atctcgatga gttctcgcac attgggcggc aggtcgggag gcagcggcgg 18061 cggcggctcc ttgatgtgct tcattgccac ggtcagggca ccatcgccgg cgaacggccg 18121 tttacccgaa accgcttcat acccaacaac tcccagtgaa tagacgtcgc tggccgggct 18181 ggcgtcgtga ccgagggcct gctccggcgc gatgtattgg gcggtgccca tcaccatgcc 18241 ggtctgggtc acgggcgctg catcgacggc tttggcgatg ccgaagtcgg tgatcttcac 18301 ctgcccggtg ggggtgatca agatgttgcc cggtttgacg tcgcggtgca ccaggccagc 18361 ggcatgcgcg atctgcagag cgcggccggt ctgctcgagc atgtccagtg cgtgccgcaa 18421 cgacagccgg ccggtgcgtt tgagcaccga atttagtggc tcgccgttga ccagctccat 18481 caccaggtag gccgtgcgac cctccccgtt catctggctt tcgccgtagt cgtgcacgct 18541 ggcgatgccc ggatggttca gcatcgcggt ggtgcgcgct tcggcccgga accgttcgat 18601 gaactccgga tcggaggaga actcgctctt gagcaccttc accgcaacac gccggcccaa 18661 ccggttatcc acggcctccc agacttggcc cataccaccg gtggcgatga ggcgctgcag 18721 gcggtatctg cccgacagcg tcacgccaac tcgggggctc atggttcccc ctgcagtgcg 18781 gcttcgatca ccgcccgccc gatcggtgcc gcgagggcac ctccggtggc ggacagccga 18841 tcagccccgt tctccaccag cacggcaaca gccaccttgg gcgcttgtgc gggcgcaaag 18901 gcgatgtacc aagcgtgcgg tggagtgtga cgagggtcgg tgccatgttc ggcggtgccc 18961 gtcttggatg cgatctgcac gccggggatt gcccctttct gctgtgcgac tttctcggcg 19021 ccgaccatca gctctgttag cttagcggcg acctgcggtg acaccgcgcg gcgctgctgg 19081 tatccgacgg tggttgagat attggctagg tccggtccct tgaggctgcc gactagataa 19141 ggcctcatcg taatgccgcc gtttgcgatg gtcgcggcta tttctgcgtt cgctagcggg 19201 gtcagcgcaa cgtccttttg gccgatactg gtcatcccta gtgcggcgct gtccgggata 19261 ggcccgacgg ttgattccgc cacttgcagc ggagttgggc gcggtgggct atcgagaccg 19321 aacgcgcgcg ccatgctgcg cagggcgtcg gcgccggtgc ggatgcccag ctggacgaat 19381 gcggtgttgc atgatttgac gaatgcctca cgcagcgaca cggtgggttc gtccccgcac 19441 ggcgcaccgc cgtagttctc tagctgggcg gtgctgcctg gcaacggaat tgtgggcgcc 19501 gcagtcagct gttcggtctc ggtggccccg gcggccagcg cggccgcagt ggtgatcact 19561 ttgaaagtcg aacccggtgg atacgtctca gagatggcac ggttggtcag tggagaggcg 19621 ggattgtcgc caagccgctg ccaggcttgc gcctgcacct cggggttatg cgacgccagc 19681 aggttggggt cgtaggacgg agaagacacc aacgccaaaa tcttgccggt tgatggctca 19741 agggcgacca ccgctccctt acagggcccg tagcagcctt gctgcatcgc gtcccagccg 19801 gcttgctgaa tgcgcgggtt gatcgtggta tcgacattac cgccgcgtgg gtcgcgaccg 19861 gtgaagaagt cggccagccg gcggccgaac agacggcggt cggacccgtt caatatcggg 19921 tcctcggctc gttctagggc ggtgctggaa tagcgcaggg agtagaagcc ggtaaccggc 19981 gcgtacacct caggattggg atagacccgc aggaaacgaa agcggccgtc ggtggctacc 20041 gagtacgcca gcagttggcc accagcggtg atctggccgc gctgccgtga atactcgtcg 20101 agcaacactc gctggttgcg gggatcggca cgcagcccgt cggcggtgaa gacctgcgtc 20161 atggtcgcgt tgagcagtag caacacgatc aacgccatca cggtcaccga tattcggcgc 20221 agagaggcgt tcatacgcgt tcgatgacct cggtgccggc cgccgtaatc ggcgacttat 20281 ttcgtgggcg ggtgcgcagt gggcggcggg ctccgtgcga gatgcgtgcc aggatggcca 20341 gcaatatgta gttggccagc agtgaagacc cgccgtagga catccacggt gtggtcaacc 20401 cggtcagcgg aatgagtcgg gtcacaccgc cgacgacgat gaacagctga atggctagcg 20461 tcgatgagag gccggcggcc agcagcttgc cgaagctatc gcgggtggcg atggccgtgc 20521 gcaaaccccg gatgatcacg atggtgtaga gcatcaggat ggccgtcaag cccaccaacc 20581 caagctcttc gccgaacgcg gcgatgatga aatcggtgga tgccgcgggc acggtgtcgg 20641 gttgaccatt accgagcccg gtgccgaaga taccgcctgt agcgaagctg aaaagcgact 20701 gcacgatctg atatccggtg ccgtctggat ctgcgaacgg atccagccag gtctgtacgc 20761 ggagccggac gtgctcaaaa atgaagtacg ccaccaaggt tcctgccgcg aacagagtca 20821 ggccgatgac gacccaactg aaccgctggg tggcgaggta aaccaccacc agaaacgatg 20881 tgtacagcag cagcgaagcg ccgaggtctt tctcgaagac catcacaccc accgagatga 20941 cccaggctgc caacagtggc gcgaggtctc gcgggcgcgg cagggtcatt ccgagcaaat 21001 gtttgccggc gctggtgaac aggccgcgtt tggccaccag taccgccgaa aagaagatca 21061 gcagcagaat ctttgaaaat tcggcgggtt gaatcgagaa gccgggcaac cggatccaga 21121 tcttggcgcc gttctgttcg gacagtgctg ccgggagcag cgcgggaact gccaagaaaa 21181 ccagacccgc gagcccgcaa atgtagccgt agcgtgcgag ctgtcggtgg tccttgagga 21241 aggtcaccac gagcgcgaag gcagctacgc ccaccagcgt ccacagcatc tgctggtttg 21301 cgctggggtg ccgatgctcg ccgatctcgt tgtccaccag atcgaggcgg tggatcatta 21361 ccaggccaag tccgttgagc agtgccacca ccgggagcaa cagcgggtca gtgtaggggg 21421 cgaagcgccg gatggccaga tgcgcggatc cgaacagggt caggaaggcc agtccgtagc 21481 tagtcaagtc ccagggcacc ccctggtctt gattggcctg cacgaccagc agtgcggcaa 21541 acgtgattac ggcggcaaag cacagcagca gcagttcagc gttgcgccga gtcggcaacg 21601 ggggcgttac ggccaccggc gcttgcagtc gtgtcgtcat gccgccgccc ggcagtcgat 21661 gcccggctga ggcgggggtg gcggaagtgc ggccatcgtc ggcgagctgg tgacgggcca 21721 aggcgtcggc ggcgacgcgg gcgctgccgg ggaggcactc gtggggatgg caggagtagt 21781 tccggtgggg gccggcgcgg aggtggtggg tgatggagag gctggcgagg aggtgacgtt 21841 tggttcggtt gtctcgctgg tggtgggtgg ggccgggcgc ccgggcgggg acgtggcacg 21901 cggcgccggg caaggcggca gcagggagtt ggccgccagt tcgcgcaact gcccgatggc 21961 gtcatcgaga gtgccggccg ggagaccggc ccgaacctgt gcgcgctccg gcggtcgcag 22021 atcctccagt ttcatcagat ggcagtcgag agggccccca gactgtccgt agctgatctg 22081 cgacagctcg ttacgcgggc tgaggcagcc catcaggtaa ggctggtgca gggacatgcc 22141 cagtagcgac ccttgaatcc cccgcatgat ggacacgctg ccggcgtagt ccgctacgta 22201 gtagttgctg cggatgatcg cgcgaccaat gagcaggccc gcagtcatca gcacggtcac 22261 cagtgcgaca acgaatgcta gccgtcggcc cgaccaccgt ggccgactga atgtatcggc 22321 ctgtggcgga acgcgtttaa cgatctcctt gcgctggctg atggcagagg cccggccggc 22381 ggcggtgttg ggcagggtca gttggtcgtc gtcgcctgag accgccccgg ccagaatcgg 22441 ttgggtctgg ccgtagtcgt agtcgacgac gtcggcgacg acgacagtga cgttgtcggg 22501 gccgccgccg cgcagcgcca gttcaatgag gcggtgagcg ctctcggcaa cctcggggat 22561 ctgcagggcc tcgaggatag tttcatcgct aaccggatcg gacaacccgt ccgagcacag 22621 caggtaacga tcaccggcgc gggcttctcg catggtcagc gtcggttcga cctcatggcc 22681 ggtcaacgcc cgcatgatca acgagcgttg cgggtggctg tgcgcctcct ccggggtgat 22741 ccggccttcg tcgaccagcg tttggacaaa cgtgtcgtcc ttggtgatct gcgtcagctc 22801 accgtcgcgc agcaggtaac cgcgcgagtc accgatatgc accaggccga gccggttgcc 22861 cgcgaacagg attgcggtga gcgtggtacc catgccttcg agatcgggct ccatctcgac 22921 ttgcgctgcg atagccgagt tgccggcgcg caccgcggca tccagcttgg ccagcagatc 22981 gccaccgggc tcgtcgtcat cgagatgggc caatgcggca atcaccaact gggacgccac 23041 ctcgccggcc gcatgcccac ccatgccgtc ggccagggcc aatagccgtg ccccagcgta 23101 gaccgagtct tcgttgttgg cgcgtaccaa gccgcgatcg ctgcgcgccg cgtatcgcag 23161 gaccagggtc acgcgcgcca ctctcccccg caagcgggtg ggggtacccc ccacttgtgg 23221 gggcgcgccc ccaccgcttc tctgcgctct gcatcgtcgc cagcgcgggt cacgggcgca 23281 actcgattgc agttttgccg atgcgaaccg gcgttccgat cggaactcgt accgcagtcg 23341 tcaccttcgc cctgtccagg taagtgccgt tggtcgatcc tagatcttcg acgtaccact 23401 cggagccgcg catagacagc cgagcgtgcc gcgtcgaggc gtagtcgtcg gtcagcacca 23461 gggtcgagtc gtcggcgcgc ccgatcaaca ccggctgttc gctcagcgtg atacgcgcgc 23521 cagtcaacgc accttcggtc accaccaggt agcgtgcagc gtgccggcgc tgacgcgcgc 23581 ctaagagcgt ccctcgcagc gccaggccgc ggcgcatcat gaccgcgccg gtcggcgcat 23641 aaatgtcggt cttcaagatc cgtagcacgg accagatgaa tacccacaac aacatcaaga 23701 atccggcacg cgtcagttgc agtaccaacc cctgcatctg gcgtcctttc cgtcctgcac 23761 cgtctgctcc ggccccgcgc tgccgagcac gtcagcaaag tcacgatact ttgacggtgg 23821 tcggcgcggg tcaaccccgg cagcttcgag cccagtaggt tcagtgcatg cggacgatga 23881 tctcggagtg tcccaagcgg atcacatcac cgtcggccaa ctgccactcc tgtaccggtg 23941 cattgttaac agtggtgccg ttggtggagt tcaggtctgc gagcaatgcg acctgcccgt 24001 cccaccggat ctccaagtga cggcgtgaca caccggtgtc gggcagccgg aactgggcgt 24061 cctgtccgcg accgatgatg ttggagccct cgcggagctg gtaagtgcgt ccgctgccgt 24121 cgtcgagctg cagcgtaacc gacgttccgg cggacccata gccgccctgc ccgtaaccgc 24181 tgtagccacc gggcgctggc tgaccgtagt ccggagcgcc tgattggccg tagtcgtagt 24241 ctcggccggc gggttcggcg tacccgccac cctgaggagc gtatcccggg acccgcgggg 24301 attcggtgta gcgggtgtag tcagcgccgc cgccatagtc ttgccggccg tatgtcgtgg 24361 cgccttgctg gtagccctgg tcgtaaccgc cttggtcggg gtaagccggt cgttgctcgg 24421 gcgggcccgg agggccagag ggcacatagc tgccctcctc gtggcgagcc gggccacgcc 24481 cgtactcccc gtacccgccg tagccgggct ggccgccacc gggtgaaggg ccgtagccgc 24541 cgctttggcg atagccctgg tcgtagccgg gagcgccgta gccggcagcc gggccgggag 24601 aaacaggagg gcgttgctcg tagggcggcg gatagccccc ctgcccttgg tcggggtagc 24661 ctcgaccctg gtcctggtac ccgcgctggt cggggtagcc gcgttgctcg gggtaaccgc 24721 gttgctcggg gtaaccgccc tggtcggggt acccgatttg ctcggggtag tcgccctggt 24781 ccgggtggcg cgggcgtggg tagcccggct ggggcgggta gccgcccgtc tcgggtggat 24841 accccccgcg ggggtcagat ccgccttgcg gatccgggcc accacgcgga tcctcttgcg 24901 gacgcgcata gcggtcgtcg taatactcgt cgggacgccc ctgcccctga ccgccacggt 24961 agctcgaatt gtcactcatt ggtgctactc ctggttctgc gccaaacgcg tggtttgatt 25021 gtggccgggc gcaatcgatg accggcgggt gggtctcaac gtcggggtta acagtgccgc 25081 gggcgcggaa ctggccggta tgcaggttcg acgactgctc gaatcggacg accacatcac 25141 catacgtttg ccacccctgt tcttggatat agtccgccaa gtcccgagca aaaccggttg 25201 acttcagctc aggatcagcg cccaacttct caaagtcgtg cacaccgagg gtaatgatgt 25261 attcgttggg cgccaaaagg cgatttccct gcagcgactg gatgccgtcg gccgcctcgc 25321 ggcgcagcag ggcttcgacc tcttgcggga cgatcgagcc tccaaagatg cgggcaaacg 25381 catcgccaac cgtctgctcg agtttgcgct caacgcgctg aaccagcctt ttctggctac 25441 ccatctttca gcgctcgcct cactgttctg gtgcatcgtc ggcgcaaggc aaacgactcg 25501 cctgtatgtc gtgtcaatca atcatggtat cgggacagtg tgagcgagcg gaaagggccg 25561 gccacgccca ctgagcccgc cggcgcccct ggcagcggat ggggcctgcg gctactacag 25621 tggtatggtc ctccggttgt tgcgggcgag tggcggaatg gcagacgcgc tggcttcagg 25681 tgccagtgtc cttcgggacg tgggggttca agtccccctt cgcccaccgt actgtgagac 25741 gagtcgtgac cgacatcgtc gtcgaaacgg ccgccatggg cgtggttgga acggctagcg 25801 cacgcccacg gccagcccag ggcaaccccg gtatcgacgt gactatcgcc ggcgctgtcc 25861 ggttcagctc ggtcgcggcc gcagccgggt ggcggggcgc ctcggtaccc ttttacacgg 25921 cgcgcatcgc ggtcagcgtc ctcgccgctt gttgcgccat accggttatc acgtcggcgg 25981 ctggcaggac ggcattgacc aggcccgcgg cttgaccggc ggtgacattg gcgatgctgt 26041 agtcacgcgc agcaacggct cgccaatatc tggccatggc ttcttcgcga tggagaatgt 26101 cgagttcggt gtcctcgaat tggtcggtga gggcgttgct tagcacgctc atcgtgtgtc 26161 cttgcggcca gggatagcgc cgtagctgat cgtagatagt ggtgcggcac atgtcgtcgc 26221 cagtggccgc cagcagcggg tcccgcgcct gcggtgtgga taacgcttcg accgtggcgt 26281 agaagcgcgt accgaccaat accccggcgg cgcccaacat caacgcggcg gcaaggcccc 26341 ggccgtcggc gatgcccccg gcggcgatca ccgggatatc agttccccgc gcggtgacca 26401 ggtcgacgat ttcgggtacc aaggtcaggg tggaacgtgg accgtggccg tgcccaccgg 26461 cctcggtgcc ctgagccacc aacacatcgg cgccgacctg cagggctcgc tcggcctggg 26521 tccggttttg gatctggcag accaaccgcg ttccggcgga cttgatggcg tcagcgaaaa 26581 ccgcggggtc cccgaacgac agcatcaccg ccaccggctc atactgcagc gcgaggtcga 26641 gcagctgcgg ttggcgggcc aaagaccagg tgatgaaccc gcagcccacc ggcgctccag 26701 cggcgagatc gaactgccgg gccaaccaat cccggtcccc atagccgccc ccgatgaggc 26761 cgagtccccc tgcgccactt accgcggcag ccagctcacc gccggcgatc aagtccattg 26821 gcgcggacac tatcggatag tcgattccga acatctggct aaaggccgtc gatagcacca 26881 caacaacctc cttggcgagc gtcgtgatga cacgcagatc ctggccgatg gtaggtgatc 26941 aggcgagcca cttcttcgcc gaactcgcga gccgagcctg atcacgctgg gtttggcaac 27001 tgccgggctt gccgaccggg catcaagcgg ccggttgtgg gccaacctgt gcgatcggca 27061 ggtgcaccac gaccccgggc accggggtga cctcgagtcc ttcgttgcgg gccagcagag 27121 ccgcattgtc cgggagctgc ctggaattga tctcgccgcc ggcaatccga cgcagcacgt 27181 cgtgggctgc cgcgagctgt tcgcgctttc ggtactggcc gccgggaagc ttgatgccgg 27241 cccatacgcc gtactccacc cgatgctcga ccgcgtgttg agcacaccgg cgctgctgta 27301 ggagcgggca ccggcgcagg cattggatcc gcgcttgggt ggccgaccgc tcataggcgc 27361 gtgccttagc ggcgccgtcg ctgccgtcgt catcggggta cccgaaccat agttccgggt 27421 cggttgcgca ggggtgtgcc atgtgccggc ctccttgttg aacgaaacat aggcaaaagc 27481 gtatatgtct gtggcgggct ctgcaagaga atcgcgataa aaacgtatat acataagggg 27541 tggccgcggc cgagtcgtat ccgggtagta tccggcttat ggccggagcg tgcggtgagc 27601 cgtgagtcgg ccggcgcggc cattcgcgca cttcgcgagt cgcgtgactg gtccctcgcg 27661 gacctggcgg ccgccactgg cgtaagcacc atgggcctga gctatctgga gcgcggtgcc 27721 cgcaagccac acaaaagcac agttcagaag gtcgaaaatg gcctcggcct gccgcctggc 27781 acctactcgc ggctgttggt cgccgctgat cccgatgcgg agctggcccg actgatcgcc 27841 gcacagccgt ccaacccgac ggctgtccgc cgcgccggtg cggtcgtcgt ggaccgccac 27901 agcgataccg acgtgctgga gggctacgcc gaagcacagc tcgatgccat caaatccgtc 27961 atcgaccgat tgcctgcgac gacctccaac gaatatgaga cgtatattct ctctgtgatc 28021 gcgcaatgcg tgaaggcgga gatgctggcc gccagctcct ggcgggtggc ggtgaacgcc 28081 ggcgccgact cgaccggccg gctcatggag catctgcggg cgctggaagc cacgcgcggc 28141 gcgctactgg agcggatgcc gacaagcttg agcgcccggt tcgatcgggc atgtgcgcag 28201 tcgtcgttac cggaggcggt cgtggccgcg ctaatcggcg tcggcgccga cgaaatgtgg 28261 gatatccgca atcggggcgt catccctgcg ggcgcgctcc cccgcgtccg agccttcgtc 28321 gacgcaatcg aggcaagtca cgacgcggat gaggggcagc agtgaattac agcgaggtcg 28381 agctgttgag tcgcgctcat caactgttcg ccggagacag tcggcgaccg gggttggatg 28441 cgggcaccac accctacggg gatctgctgt ctcgggctgc cgacctgaat gtgggtgcgg 28501 gccagcgccg gtatcaactc gccgtggacc acagccgggc ggccttgctg tctgctgcgc 28561 gaaccgatgc cgcggccggg gccgtcatca ccggcgctca acgggatcgg gcatgggccc 28621 ggcggtcgac cggaaccgtt ctcgacgagg ctcgctcgga taccaccgtt actgcggtta 28681 tgccgatagc ccagcgcgaa gccatacgcc gtcgtgtggc gcggctgcgc gcgcaacgag 28741 cccatgtgct gacggcgcga cgacgggcac gacggcacct ggcggcgctg cgtgcgctgc 28801 ggtaccgggt ggcgcacggc ccgggggtcg cgctggccaa acttcggctg ccgtcgccga 28861 gcggtcgcgc cggcatcgcg gtccacgccg cgctgtcgcg acttggccgt ccctatgtct 28921 ggggcgcaac ggggcccaac cagttcgact gttccggttt ggtccagtgg gcctacgccc 28981 aggcgggtgt tcacctggat cgcaccacct atcaacagat caacgagggg atcccggtgc 29041 cgcgctcaca ggtccggccg ggcgatctgg tcttcccgca ccccgggcac gtgcagctgg 29101 cgatcggcaa caatctggtc gtcgaggcgc cccatgcggg cgcgtcggtt cgggtcagct 29161 cgctgggcaa caacgtgcag attcggcgac cgctgagtgg cagataatcg cccaatcaga 29221 cgggcaggat gagaaggttg aaccatgtcg gagcaagccg ggtcttcggt agctgtcatc 29281 caggagcgcc aggctttgct ggcaaggcaa cacgacgccg tggccgaagc cgaccgtgag 29341 ttggccgacg tgctagccag cgcgcatgcg gccatgcggg aaagcgtccg tcggctggat 29401 gctatcgcgg ccgaactcga ccgcgcggtt ccggatcagg atcagcttgc cgtcgatacg 29461 cccatgggag cgcgtgagtt tcaaacgttc ctggtcgcca agcagcgcga gatcgtagcg 29521 gtcgtcgccg ccgcccacga gctcgatcgc gcaaaaagcg ctgtgctaaa gcgcctgcgg 29581 gcacagtaca cggaaccggc ccgttagctg cggaccggat acgctggacc ggcaggcgtt 29641 gggtgaattg tcggcgacta cacacctagg tactgtcacg cggcatggaa gcgccgggga 29701 cagggcccgc agtgggtcgc agtggcgttt gacgcggcga tgtccacgca cgaagatctc 29761 cttgccacga tcaggtacgt ccgcgaccga accggtgacc caaacgcgtg gcagaccggg 29821 ttgacaccga ccgaggtgac cgcggtggtc acgtccacga cacgttccga acagctcgat 29881 gccattttgc gtaagatccg ccagcggcat tcgaacctgt actatccagc accgcccgat 29941 cgggaacaag gagacgccgc ccgtgccatc gcggatgcgg aagcagctct ggcacatcag 30001 aattcggcta ccgcgcagct cgatctgcag gtcgtctcgg caattctgaa cgcgcatctg 30061 aagactgtcg agggtggcga atcgctgcac gagcttcagc aagagatcga agccgcggta 30121 cgcattcgat ccgatctgga cactccggcc ggcgcgcgtg atttccagcg tttcttgatc 30181 ggcaagctca aggatatccg ggaggtggtt gcgaccgcga gcctggacgc tgcgtcgaaa 30241 tccgctctga tggccgcctg gacatcgctg tatgacgcat ccaagggcga ccgtggcgat 30301 gccgatgacc gcggaccggc gtcggtcggc tcgggcggcg cgcccgcacg cggtgccggt 30361 cagcagccgg agttgccgac acgagccgaa cccgattgcc tcctcgactc gctgctgctc 30421 gaggatccgg gtttgctggc cgatgaccta caggtgccgg gaggcacatc cgcggcaata 30481 ccatcagcgt cgtcgacgcc aagcctgccc aatcttggcg gagcaacgat gccgggtggc 30541 ggagcaacac cggccttggt ccccggtgtg agcgcgccgg gtgggcttcc gctctccggc 30601 ctgctgcgcg gcgtgggtga cgaaccggag ttgacggact tcgacgaacg gggacaagaa 30661 gtcagggatc cggccgatta tgagcattcc aacgaaccgg atgagcgtcg cgccgacgac 30721 cgagaaggcg ccgacgagga cgccgggctg ggcaagtcag aatcgccacc gcaggctccg 30781 acgaccgtga cgctgcccaa cggtgagacg gtgaccgcgg ccagtcccca gctcgccgcg 30841 gcgatcaagg cggcggccag cggcacaccg atcgcagatg cgttccaaca acagggaatt 30901 gccatcccgc taccgggaac cgcggtcgcc aaccccgtcg accccgcccg gatctcagcg 30961 ggagacgtag gtgtgttcac cgccacgccc ttgcccttgg ccctagcaaa gctcttctgg 31021 acggccagat tcaacacatc tcagccgtgc gagggccaaa ctttctaggc tggatacatc 31081 cagcggcgac cgcgaccgcg ccggcgagga ccgaagcacc gacaccaacc aggccggcgg 31141 ccgctcgata ggtactgacc gcccggtcac aacaagagga gacagcggat gacagatcga 31201 attcacgtgc agcctgcaca tttacgtcag gccgctgccc atcaccagca gaccgccgac 31261 tacctgcgga ccgtgccgtc gtcgcacgac gcgatccgcg aaagtctgga ctcgctgggg 31321 cctattttca gtgagctccg cgacaccggg cgtgagctgc tcgagctcag aaagcagtgc 31381 taccagcagc aagccgacaa ccacgccgat attgcccaga acctgcgaac gtcggccgcg 31441 atgtgggagc agcacgagcg agcggcgtcg cgcagcctcg gcaacatcat tgacgggagc 31501 cgatgacagg gcgatgaccg acgccaatcc cgctttcgac acggtccacc ccagcgggca 31561 cattcttgtt cggtcctgcc gcggtggata catgcatagc gtctcgctga gcgaggcggc 31621 gatggagacc gacgcagaaa ccctggcgga agccatcctg ctcaccgccg acgtgtcctg 31681 ccttaaagcg ttgctggaag tacgcaacga gatcgtggcg gcgggccaca ccccgtccgc 31741 gcaggttccc acgaccgacg acctgaacgt cgcgatcgaa aagctgctgg cccatcaact 31801 gcgccgccgt aaccgttgaa gtgctagatg agccaggtct tggtgctgtc gggatcgggt 31861 gcgatgtcgg tgggcggctc gatcggattg gggccgaaca attctcgcgc tcgagtgagc 31921 agagcccgca cctcgtcgag ttgctgctgc agcgcagaat cagccataac cccacgctac 31981 ccaggccccg tctgacacac aattcaccac ccgctcaccg cctgcgcggg ccagatgatg 32041 ccggtacgct tacccggtgg cgatcttcgg tcgatggagt gcgcgccagc gactccggag 32101 agcgacccgg gaatccctca cgattccgac gtttagctcc tcgctggatt gcaccacacg 32161 ggtaattggc gggctctggc ccgctgagct ttcgtctaac accgccgaaa ccgccacgct 32221 tgcagaacat ctgaaagcgg atctgcatcg gatagttggt tctgccaacg acgagctgat 32281 ggtcatctgg cgtgcgggga tggctgattc gacgcgacgc gcagaagaag acagagtgat 32341 cgaccgcgcc cgcgcgtcgg cgatgcgtcg cgtcgagtcg gcgatgcgcg agcttcggca 32401 gataacgggg cgcgttcccg tggaaattcc gcgtatgcgc ggcgccggcg gctcggatct 32461 ggacacgaca cgactcatgc cggccgtcac ggtagttcag cccgctgacc aggcctgtac 32521 ggattggccg gttgccgccg ccgaggatga cgaagcccga ctgcagcgcc tcctggcgtt 32581 cgtggctcgt caggagccac ggctgaactg ggcggtcggc gttcacgcgg acggcacgac 32641 ggtcctggtc accgacgtcg cccatggttg gatacctccg ggcatcgccc ttcccgaagg 32701 cgtgcgattg ttggcaccgg cgcgacgcgc cggcagagcc cccgagttgg tcggtatcac 32761 gacgtgttgc aagacgtaca cccccggtga ctcgctgcgt cgggcggtcg attcaaccgc 32821 gccgacgtcc tcggtgcagc cgcgagcgtt gccagcgatc gccggcctga gtgtggagct 32881 gggcatagcg acccagcggc acgacggctt accgaagatc gtgcacgcca tggccacggc 32941 ggccggcaac ggcgccgccg ccgaggaagt cgacctgttg cgggtgcacg tcgataccgc 33001 gctccaccac gtcttggccc agtatccccg ggtcgatccg gcgttactgc tcaactgtat 33061 gttgttggcc gccaccgagc gcagcgtcac gggagacccg atcgcggcga actatcactt 33121 cgcgtggttc cgggaactcg attcacgccg atagctttct cgaatcccca cggcaagcgt 33181 ccggcgatga attgacgctg gtggggggcg tggacatact gtcatggtgt cggggtcgga 33241 cagtcgcagc gaaccgagcc agctgagcga ccgagacctc gtcgaatcgg ttcttcgtga 33301 cttgagcgag gcggccgaca agtgggaggc gctcgtcacg caggctgaaa ctgttaccta 33361 cagcgtggac ttgggagacg ttcgcgctgt tgccaattcg gacgggcggt tgctcgagct 33421 gacgttgcat ccgggcgtga tgaccggcta cgcgcacggg gagctggccg accgagtgaa 33481 cctggcgatt acggccctgc gcgacgaggt tgaggccgag aaccgggcac ggtacggcgg 33541 ccgcctgcag tgacatcggt atctgcgagg atcaagccca tttgctggca aggcatttcg 33601 gcgcggggcg caaggcccac agccgggccg tggccaccct gaaagccgat atccaagcct 33661 ggcacccggc tggcatccag accccgaagc cgcgatgcga atcagatgtg ttcgcgcgaa 33721 tcggtcacac gagccaccca tcaactcgga agagccgggt ggggccggga gcatccgagg 33781 caccgcttgc ctgacataac agcgtaaccg ccccgccatt gtcgctgtga tggacatgcc 33841 ccagccattt gtcggctagc tatacagcga acgtcaattt ttcgtgaatc agcctgaggc 33901 tattgataat tcacggcggc acgtcctact cttagcggcg ctatgcgacc caatgcgcgt 33961 gcgatgttgc gtttggtgca ttgtggtgcc ggtgctggtg ggccggcgat aacgtcgaaa 34021 ggtgcggtat tgggtgaccg tgttggcgcg ttgtcgcagt gccgatcggc ggcagcgctg 34081 agtcgattcg actttgcacc ccgtgactct gttcccaccg ccaccttcgg tggtggatgc 34141 gctttcaggt ccaccaatag gctagctgtt ttcgagcggt gtatttgcgt ggggggtgaa 34201 tgtggatacg gacaatgaca ggcccacgct ggcgagggtt taccgcagcc tgcgggacat 34261 ttgtccggac agctggaatc ttccgggcgg tcggatgccc actggcttgg gctatgactt 34321 tctgcgccct gtcgaggact cggggatcaa cgacctgaag cactattact tcatggcgga 34381 tttggccgat gggcaaccgc taggccgggc aaacctctat agcgtctgtt tcgacctggc 34441 caccaccgac cgcaagctca ctccggcctg gcgaacgacc atcaaacggt ggtttccggg 34501 gtttatgacc ttccgtttcc tcgagtgcgg gttgctcacc atggtgagca acccgctggc 34561 gttgcggtcc gacaccgact tggagcgggt attgcctgtg ctggccggcc agatggacca 34621 gttggcgcat gacgacgggt cggatttctt gatgatccgg gacgtggacc cggaacacta 34681 ccagcgatac cttgacatcc tgcgcccgtt gggctttcgg cctgcgctgg gcttttcccg 34741 ggtagacacg accatcagct ggtcgagcgt ggaagaggca ctgggctgcc tgtctcacaa 34801 aaggcgcctg ccgttgaaga cgtcgctgga gtttcgtgag cggttcggta tcgaggtcga 34861 ggaactcgac gagtatgccg agcatgcgcc ggtattggcc cggctttggc gcaacgtcaa 34921 gacggaggca aaggattacc agcgcgagga cctgaaccct gagttcttcg cggcgtgttc 34981 tcggcatctg catggacgta gcagactgtg gttgttccgc taccagggca cgccaattgc 35041 cttctttttg aacgtttggg gtgcggatga gaactacata ctgcttgagt ggggcatcga 35101 tcgtgatttt gaacattata ggaaggcgaa tctgtaccgg gcggcgctga tgctcagcct 35161 aaaagatgcg atcagccgag ataaacggcg aatggaaatg ggtattacga actatttcac 35221 aaaacttcgc attccgggtg cccgagtcat accgaccatc tatttcctgc gtcacagcac 35281 ggatccggtg catacggcaa cgttagcgcg aatgatgatg cacaatattc aacggccaac 35341 gctacccgac gatatgtcgg aggaattctg tcgctgggaa gagcgaatac gtctggacca 35401 ggacgggcta cccgaacacg atatctttcg caagatcgat cgtcagcaca aatacacggg 35461 gctcaaactc ggcggagtct acggttttta tccccgattc accggaccgc agcgatccac 35521 ggtcaaggcc gcggagctgg gcgagatcgt gttgctgggc acgaactcgt atctgggcct 35581 ggccacccat ccagaggtgg tggaggcctc ggcggaggcc acgcgacggt acggcaccgg 35641 ctgctcgggt tcgccgttgc tgaacggcac gttggacttg cacgtctcgc ttgagcagga 35701 actagcctgt tttttgggca aacccgccgc cgtgttgtgc tccaccggat atcagagcaa 35761 cctggcggcg atcagcgcgc tatgcgaatc cggggacatg atcatccaag acgcgctgaa 35821 ccaccgcagc ctgttcgacg ccgccaggtt gtccggggcc gacttcacct tgtaccggca 35881 caacgacatg gaccacctgg cgcgggtgct acgccgcacc gaggggcgcc gccggatcat 35941 cgtcgtggac gcggtgttca gcatggaagg caccgtcgcc gacctggcca ccatcgccga 36001 gcttgccgac cggcacggct gccgggtcta tgtggacgag tcccatgcgc tgggcgtgct 36061 cggccccgac gggcgaggag cttcggccgc gttgggtgtc ttggcgcgca tggacgtggt 36121 gatgggcacg ttcagcaaat cctttgcctc cgtcggcggg ttcatcgccg gagatcggcc 36181 cgtcgtggac tacatccggc acaacggttc aggtcatgtg ttttccgcca gcctgccgcc 36241 ggccgccgcg gctgccaccc acgcggctct gcgcgtcagt cggcgtgaac ccgaccggcg 36301 ggctcgggtg ctggccgcgg ccgagtacat ggccaccggc ctggcacggc agggctatca 36361 ggccgagtat cacggaaccg cgatcgtgcc ggtgatcctg ggcaacccga ccgtggcgca 36421 tgcgggctat ctgcggctga tgcgctccgg ggtgtatgtg aacccggtgg cccccccagc 36481 cgtgccggag gagcgttcgg gattccgcac cagctaccta gccgaccacc gacaatctga 36541 cctcgaccgg gccttgcacg tgtttgccgg ccttgccgag gacctgaccc cgcaaggagc 36601 cgcgctatga aagaggccat caacgccacc atccaacgga tcttgcgaac cgaccgcggc 36661 atcaccgcga accaggtact cgtcgacgac ctgggttttg actcgctcaa gctgttccag 36721 ttgatcaccg agctagaaga cgaattcgac atcgccatct ctttccgcga cgcacagaac 36781 atcaaaacag tgggagacgt ctacaccagc gtcgcggtct ggttccccga aaccgccaag 36841 ccggccccac ttgggaaagg aacagcatga ccgacgacgc cgatcttgat ctggtccgaa 36901 gaactttcgc cgcgtttgcc cgcggcgacc tcgccgagct gacgcaatgc tttgcgcccg 36961 acgtggagca gtttgtcccg ggcaagcacg ccctggctgg ggtgttccgc ggcgtggaca 37021 acgtggttgc ctgcctcggc gacaccgcgg ccgccgccga cggcaccatg acggtgacgc 37081 ttgaagacgt gttaagcaac accgatggcc aggtgatcgc cgtgtatcga ttgcgggcca 37141 gcagggccgg gaaggtcctc gaccagcgcg aggcgatcct ggttaccgtc gccggtggtc 37201 ggatcacccg acttagcgag ttttacgccg acccggcggc gaccgaaagc ttctgggcat 37261 gacggcggcc ttgctttcac cagccatcgc ctggcagcag atctcggctt gcacggaccg 37321 cacgctgacg atcacttgcg aggattccga ggtaatcagc tatcaggacc tcatcgcgcg 37381 cgcggcggca tgcatccccc cgctacggcg tcttgacctc aaacgcggtg aacccgtgct 37441 gatcaccgcc cacaccaacc tggaattcct gtcctgcttt ttgggcctca tgctccatgg 37501 cgctgtgccg gtacccatcc cgccgcggga ggcactgaag accaccgagc gtttcatgac 37561 tcggctcggc ccactgctgc gccatcaccg cgtgctgatc tgcacaccgg ccgaacacga 37621 cgagatacgc gctgccgcca gcaccgactg ccagatcagc agatttactg ccctagccga 37681 ggctggcgac gagcagttcg gccgcgccac ggcccagcaa ctcgccgaca ccgccaccgc 37741 cgactggccg ctatgcaccc tcgacgacga cgcctacgtc caatacacct ctggcagcac 37801 cgcagcacca cgcggagtgg tcatcaccta ccgcaacctg ctgtccaaca tgcgcgcaat 37861 ggccgtgggc tcacaattcc agcacggcga tgtcatgggc agctggctgc ccttgcacca 37921 tgacatgggg ctggtgggca gcctattcgc cgcactcttc aacagtgtca gcgcggtatt 37981 caccacgcca caccggtttc tgtatgaccc gttgggattc ctcagactgc tcaccagctc 38041 cggggctacc cacacgttca tgcctaactt cgctctggag tggctgatca acgcctacca 38101 caggcgcggc gccgacatcg aaggcatcga cctacacaaa atgcgccgct tgatcatcgc 38161 ctccgaaccc gtccatgccg agggcatgcg gagattcgcc gccaccttcg ccggcgtcgg 38221 acttgccccc acggccctgg gttcgggcta tggcctggcc gaagcgaccg tcgccgtgtc 38281 gatgtcagcg cccaacacgg gattccgcac cgaaacccac gccgccgcgg aggtcgtcac 38341 cggcggccga gtgctgcctg gctacgaggt gcgcattgac gccgcaccag gtgcccgggc 38401 cggaacgatc aaactgcgcg gcgacagcgt ggccgccaaa gcctatgtgg gcgggaagaa 38461 gctggacgcg ctcgacgagg aaggcttctg cgacacccac gacttgggtt ttcttgtaga 38521 cgacgaaatc gtcatccttg gccggcagga cgaggtgttc attgtccacg gagaaaacag 38581 attcccctac gacatcgagt tcatcattcg cggggaatcc gagcagcacc ggaccaaagt 38641 cgcatgtttc ggggtcaacg aacgcgtcgt ggttgtgttg gaaagcccat tggacagcat 38701 catcgacaag gccgaagccg accgactgag atgtcaagtc gttgccgcga ctgggctgca 38761 gttggatgaa ctgatcacgg ttcggcgcgg cgcgattccc accaccacca gcggcaagct 38821 caaacgacgc gccgtcgcgc aggcttatcg agacggcaca ctgccccgtc ttgccaccca 38881 cgcgtggacg gcggatcccg atagcgctcc caaaacgacc cggtccagcc tggaaggcgc 38941 ccactgatct tccactgacg tctcatcaaa cccccggggc gctcgcgcgc tgggcgcgct 39001 catcgaccgg ggcttgggtt gattggcccc ggctctcttc gcgcgctggg cgcgctcatc 39061 gaccgcggcc gggtggcccg gcgaaagctt gggcgatcgt cagccagcgt tgtgcgtcct 39121 cccctactgc gttgacgtca agagtgctca gcgcgcgccg ctgggtgacc aggaagcaga 39181 agtcctcggc ggacccggtg acccgctggg ccgcatcgga tggcccccaa gaccaagtgt 39241 cgccgctcgg tccccgcagc tcgaccagga acggctcggc cggaggggtt aggttgttga 39301 cgatgaacgc gtagtcgcgg gtgcggacac cgagatgcgc aatagaccgc agtcgctggg 39361 tggcgggccg gatgacgccc agggcgtcgg cgacgtccag tccatgtgcc caggtctcca 39421 tcaaccgcgc tgttgccatc gacgccgcgc tcatcggtgg cccgaaccag gccaatttgc 39481 ggccatcggg aaccgccagc agttcctcgt gcagccgccc ccgagtgacc cgccagtctg 39541 tgagcagttc ggcaggtgaa acggccgcca gttctgtcgc ggcgtcgtcg acgaaaccgg 39601 ccggattggc cgcggcggcg gtcatcagct cggcgaaccc ggcctcgtcg gtgaccgccg 39661 tcagcgccac tcgatcggtc cacagcaggt ggccgatctg gtgtgcgatg gtccaacccg 39721 gcgcaggtgt cggatcggcc cagcgatccg ctggcagatg cgccaccagc gcgtcgaggt 39781 cgtcgctttc ggcacgcagg tctgccacga acggcccagg atccgccatc accacctcct 39841 gaggtaacag ttcgtcggga aaggcatgtt tgtaccctag cgaccgatca caggctggcc 39901 gcggcgcccg acgatggtgt gcaccaccag cccggctagg tagatcgccg acccgaacag 39961 cacaaatacg ggcgcgtgcc cgtgctcggg aatcaaggct gcggccacgg taatcgagag 40021 gatgtatgag acccaaaaca gtgcatcctg cacggcgaac acgtgcccgc gcaatgcgtc 40081 gtcgacgtcc atctgcatcg ccgaatcggc gcacagcttg accacctggc cggccacacc 40141 taaaaggaag ccgcatacca ccatcaccgg gaccagcagc ccggcggccg cgacctggat 40201 agtggcggcc gcagccaacg cgccatttgc cgtggcgtag cgtccccagc gccggatcgc 40261 ggtcggagtc aagacgttgg ccaggaaggc tcccagcccg gtggccgcga agaacagcag 40321 tgcggtaccc aaccccccaa cggcccgggc ggtcacgtgg cggaccagga gcaagatcag 40381 cagtgagttg ataccgacca ccatccgatg cgctgccaaa ccggacaggc cggcagcgac 40441 ggtcggaagt tgcaccacgg tgcgcgctcc atgtagccaa ccggtgacca cggcgtagac 40501 agcagatccg tggatcgcgc gttcggtgtc gtccgggccg agtacccgcg ggccgaaccg 40561 cagcgaccaa agcaacgcga tcgatacggg gatcgccacc aggaagacga tcgcggaggc 40621 cccctcgtcg ccgctgccga gcagccaacg aggcaacagc atgaagttgg cgcccaggaa 40681 cgcggagacc gcccccgacg cgatggccac cgagttcatc gtgaccacct gttcgcgcgg 40741 caccacgtgg ggcagtgccg ccgacagtcc cgaggcgacg aatcgtgcca agccgttggc 40801 gaccagcgct ccgaccaaca gcggcacgtc gccggctccg accgcgagta tcgtgccgac 40861 cccggcgatc agggctagcc ggccggtgtt ggcgccaacc agcacccacc gccgatccca 40921 ccggtccatt agggccccgg cgaagggccc cagcagcgaa tagggcagaa acagcaccgc 40981 gaaggccccc gcgatggcca tcgggtcggc cgcccggtcc gggttgaaca gcaacgctcc 41041 ggccagcccc gcctgaaaca acccgtcgcc gaactgactc gcaacccgca cctgcagcag 41101 acgccagaag tcgggcaagc tgcgcaccga ccgccaaacg tcgacgggtg cgcgtgcgtg 41161 catccgggag tgaatcacta aacccacttc caccctgggc acaggcaagg ttcggtccac 41221 cccgtgccgc cccaaccaca gtacaaatat tcgccgaccc tgcttgttcg ccccgggcga 41281 tgcgacggtg gtgcgatgat ggtgtggtgg cgccgcacga agaccccgag gaccatgtcg 41341 cacccgccgc acaacgggtg cgagcgggca ccttattgtt ggccaacacc gatctccttg 41401 aaccgacatt tcgccgcagt gtgatctaca tcgtggagca caacgacggc ggcaccctcg 41461 gtgtggtcct caatcggccc agcgaaaccg cggtctacaa cgtgttgccg cagtgggcca 41521 aactcgcggc caagccaaag acaatgttca tcggtgggcc ggtgaagcgc gacgcggcgc 41581 tgtgtctggc ggtattgcgg gttggcgctg acccggaagg cgtgccgggc ctaaggcatg 41641 tcgcgggcag gctggtgatg gtcgatctgg atgccgaccc cgaggtgctc gcagcggcgg 41701 tggaaggggt gcgcatctac gccgggtact ccggctggac catcggtcag ctcgaaggtg 41761 aaatcgagcg cgacgactgg attgtgttgt cggcgttgcc atctgacgtt ttggtggggc 41821 cgagagccga cctgtggggg caggtgctgc gacggcagcc gctgccgctg tcgctgctgg 41881 ccacccaccc gatcgatctg agccggaact aggctactcc gccgccgagc ttgccagagc 41941 agcgcgtcgc gtcgccgcgg tcgagccagg cgatccggcc cagcctagtg ggccacaggc 42001 tgttcaatga caggcctggg tgcagaccgc gcagctgcca acgcagttgg cggtggggct 42061 agcggtttca cggcgcagcg cgtactgggc gctctgccac gaccccgcgg ccagcgtgcc 42121 gaccgcgccc gcaatgcaga cgatcaccac catcaaggcg gtgtgcccgg gcgcggccac 42181 cgccaccact cccccggcgg ccagcattac tgcggctgcc aactgcgtgg gcgccatcgc 42241 gcgcagcgcc agcgccgtgg ggtcggcagt gggcgtatgg cacagcgacc agctcccgaa 42301 cagggcggac gccgccgccg cacacatgca cagcacaccc gcgaggaaca ttcgttcacc 42361 atacgaggcc gccgacgaat ccgctcaccg agctccatgc gggcccgtgt ttctgctcgg 42421 cctcatcgcg acctagcgcg gcgggactgg tgtcagggtg cccgcgggcg gatacccagg 42481 cgcctgcccg ggtagtccca ccggtgccga accgggtgcc ggggcaggcg cctgagcggg 42541 cgccgcatgc gcaaccactt ggaatccgtt gacaatcgca tcggtggccg gcccgtcggt 42601 gaccgcctgc gacagcgcgg tggtcaccga cagcgaaacc aggtacttgt cggctccgga 42661 ggtggcgatg acgtggcgcc gggaggtgtt gagggtcatg tcgttttcgc ggtaggtgcc 42721 ctcgatgatt gatgacggaa agccgtcgaa attggccatc gaggcgtttg tggtctgcca 42781 tgcgagcaat ttctggctgt caatgtagcc gtgtgtgatg gcctcagcgg gatcgaagtc 42841 accgatcagc ctatacacca ccagctgcgc attcgacgtg tagacgctgt tgcccaaccg 42901 gtcggcgatc accacgaacg cgtcgggcac gttggggtcg ggcacctgag tccagcgcgg 42961 cggcatgggc agtgtgatgt cgagcgcctt gaatccgtgc ggtcgctgtg cctccagctt 43021 gacgcccttc tcccggaggt ggtcccgaag tgtgccgctg atcgcgggag tcactggcgg 43081 cggcagcggg ggcacagcgg tggacccggg tgctccgacc ggaatcggcg acgcgatcgg 43141 tgcgggtgct ggcgccggtg agaacctgtt gctgctcccg cccggaagcg ccgtgaggtt 43201 ctgcacgggc gggactgttg ccggcgccga gactggggca gggataggcg gcggtggcag 43261 caggggatcc gctgaggcct tcccggcggt gaccagcacc acgccgatga aaccggtggc 43321 catgccgcct gcgaagaccc gccaggtgcg cgcgatctgg atcatttgcg tcggtccctc 43381 cgaatggccg ggcgacggtg cccgtcgtcg aggctgaatg taaccagcgc tccatggcag 43441 tgcacaggct tgaaatgcag ctggaatgaa cctctgatcg tggtgcaacg gaaccgagac 43501 caacccgtgg ccggtagcgc ggccccggag gttcccgggc cacccttata ccctgttggg 43561 cgtgaccgaa tcgccaaccg ctgggcctgg cggcgtgccc cgtgccgacg acgcggactc 43621 cgacgtgcca cggtaccgct ataccgccga gctcgcggct aggctggaac ggacctggca 43681 ggaaaactgg gcccggctag ggacgttcaa cgtgcccaac ccggtcggct cgctggcccc 43741 accggatggt gccgcggtgc ctgacgacaa gctcttcgtg caggacatgt tcccctaccc 43801 ctcgggtgag ggactccacg ttggtcatcc cctcggctac atcgcgaccg acgtctatgc 43861 ccgctatttc cggatggtgg gccgtaatgt gctgcatgcg ctagggttcg acgcgttcgg 43921 gctgcccgcc gagcaatacg cggtacaaac cggcacccat ccgcgtaccc ggaccgaagc 43981 caacgtcgtc aactttcgcc gccagttggg ccggctgggc ttcggccacg acagccgacg 44041 aagcttctcg accaccgatg tcgacttcta caggtggact cagtggatct tcctacagat 44101 atacaacgcg tggttcgaca ccacagccaa caaggcgcgc ccgatatcag agctggtcgc 44161 cgaattcgag tccggtgcaa ggtgtctcga tggcggccgg gattgggcca agttgaccgc 44221 gggggagcga gccgatgtga tcgacgagta ccggctggtc tatcgggcgg attcgctggt 44281 gaactggtgc ccggggctag gtacggtgct tgccaacgaa gaggtgaccg ccgacggccg 44341 cagcgaccgg ggcaattttc cggtgttccg gaagcggttg cggcaatgga tgatgcggat 44401 caccgcctat gccgaccggc tgctcgacga cctggatgtg ctggattggc ctgagcaggt 44461 caagaccatg cagcgcaact ggatcgggcg ttcgacgggt gcggtggcgc tgttctcggc 44521 gagagcggcc agcgatgacg ggttcgaagt cgacatcgag gtgttcacca cgcggcccga 44581 caccttgttc ggcgccacgt atctggtgct ggctcccgag cacgacttgg tcgacgagtt 44641 ggtcgccgcg tcctggccgg ctggggtcaa ccccttgtgg acatacggcg gcggcacacc 44701 tggtgaggcc atcgccgcct accggcgtgc gatcgccgcc aaatcagacc tcgagcgcca 44761 ggagagcagg gaaaagaccg gcgtcttctt gggcagctac gccatcaacc cggccaacgg 44821 tgagccggtg ccgatcttca tcgccgacta cgtgctggcc gggtacggta ccggggcaat 44881 catggcggtg ccgggtcatg accagcggga ctgggacttc gctcgggcat ttggtctacc 44941 gatcgtggaa gtaattgccg gcggcaatat ttcggaatcc gcgtatacag gcgatggcat 45001 cctggtcaac tcggattacc tcaatggaat gagcgtgcca gcagcaaagc gggccatcgt 45061 cgaccggttg gagtccgcgg gccgcggccg ggctcgaatc gaattcaaat tgcgcgactg 45121 gctttttgcg cggcagcggt attggggtga accattcccg atcgtctatg acagcgacgg 45181 gcgtccgcat gcgctcgacg aagctgcact gcccgtcgag ctgcctgatg tcccggacta 45241 ctcgccggtt ttgttcgacc ccgacgatgc ggacagcgag ccttcgcccc cactggccaa 45301 ggcgactgag tgggtacacg tcgacctgga cctcggtgat ggcctgaagc cctacagccg 45361 cgacaccaac gtgatgccgc agtgggcggg cagctcctgg tatgaactgc gctacaccga 45421 tccgcacaac tcagaacggt tctgcgccaa ggaaaacgag gcctattgga tgggaccgcg 45481 gccggctgag cacggcccgg acgaccccgg tggcgtcgac ttgtacgtcg gcggtgctga 45541 acacgcggtt ttgcacctgc tgtattccag gttctggcac aaggtcttgt acgacctggg 45601 tcacgtcagc tctcgcgagc cttaccgcag gctggtcaat cagggctata ttcaagctta 45661 cgcttacacc gatgcgcgcg gatcctatgt ccctgccgag caggtgatcg aacgcggtga 45721 cagatttgtc tatcctggac ctgacggtga ggtcgaagtt ttccaggaat tcggcaaaat 45781 cggtaagagc ctgaagaatt cggtatcgcc ggacgaaatc tgcgacgcat acggggcaga 45841 tacgcttcgg gtttacgaga tgtcgatggg gccgctggag gcttcacgtc catgggccac 45901 aaaggatgtt gtcggcgcgt accgttttct gcagcgggtg tggcgcttgg tcgtcgacga 45961 gcacaccggc gaaactcggg tggctgacgg cgtggaactc gacatcgata cgctacgggc 46021 gttgcaccgc accatcgtcg gcgtgtcaga agactttgcg gcacttcgca ataacaccgc 46081 aacggctaag ttgatcgaat acacgaacca cctcaccaag aagcatcgtg atgcggtgcc 46141 tcgggccgcc gtggagccgc ttgtacaaat gctggctccg ctggccccac atattgccga 46201 ggagctgtgg ctgcgactgg gcaacaccac ctcgttggca cacggcccgt tcccgaaggc 46261 cgatgccgcc tacctcgtcg acgagacggt cgagtatccg gtgcaggtga acggcaaggt 46321 acgtggccgg gtggtggtgg ccgccgacac cgacgaggaa acgctgaaag ccgccgttct 46381 gaccgacgaa aaggtccagg cattcttggc tggtgccacc ccgcgcaagg ttatcgtggt 46441 cgccggccgg ctggtcaatc tcgtcatcta ggtcgtgtcg gcggtgccga cggtgggcga 46501 ggtaatccgc ggggtagttc gttgtatgcg ttacgccgcg agagccggcg gcgaccagat 46561 tggttgatag cgtggtactt tcacgctcgt ttgcgagcag gggagttgct tgcagggcca 46621 ctggccggtt cgcccgaggc gagacgctcc agtggcgcca gggccttcct gagggtttcc 46681 aagtcggagc ggggaagttg gctgagcagc gcggccagag ccgcgcgccg gttggccagt 46741 gactcaccgt gaaccgcccg cccttgcggc gtgatgtcta ccaacaccgc ccgcaagtcg 46801 gacgggtctc gcgagcgttt caccagtcca atcttctcga gccgccggat cgccacggtg 46861 gtggtgggag ttcgcacccg ttcgtgagcg gccaggtcgg tcatccggat gggaccttga 46921 tcgagcaggg tgaccaggat cgacagttgc gccagcgtta ggtcgccggc tgcagccccg 46981 ttgggatccc cgcggcgcag cattgaaatc agcttggaca atgcgcggtg cagcccctcc 47041 gccagttggg tcacttccgg tgcggtgaat tcgctgtccg ccataaaccg gcagtctaac 47101 ctgacatgcg tgtgaccgta gacttgtgtc gggcgacctt tgaccgccaa tgcatttggt 47161 cccgaaatcc gctgcatttt cttgccaatc gagcggacaa cactcatgtc atggctgact 47221 acctacattg tcagttctgc cggatccatg gtcagtgatg tcgaatgcca ctgaccgcca 47281 acggaaaccg gctctcgcgt taacgggaca gtcaatattg gagacgccgg cagccgctgc 47341 tggcttcacc atcggatcgg cgtaattagg gcaccggtga ggagggctgg tagcttctgg 47401 cgaagccagg gatcggcgcc ccaaacgggc cgggacaagc gccctcgggc gggaccaata 47461 ctcggcggcg gaacagttcg gccagcatcg tctgggccat cagctcggaa cggccgatgc 47521 aggcagccct cgcagcttca ggttcgcgcc gatggattgc ggcgttctct tcctcgtaga 47581 acggcaacac gtcgtcgcgg ctgttttggt atgtcatcca gaacactcgc ggaatcaggt 47641 tctgtgaggc ccggatggtg gcgtgcagcc gcggtcctgc gtactcgtcg ttgaccgtgc 47701 gccggtactc ccacacgcat tcggcgaagg cccgcgactc cttggagttg cgcagcgatc 47761 gcatgactgc gtcgagctgg cccaggatcc gaggcgtggg gttggcggct gcgcgggcag 47821 aggcaatgcc gttgagcaag ccgtcgagtt cgtgatgttc caggatggtg gcgacgtcga 47881 accgctcgat gaacgcgccg cggtgatagc gagtcgacac aatgccgtcg tgttcgagtt 47941 gaaccagcgc ctcttggatg ggaacccggc tgacccccag gccgtgcgcg atttcattgc 48001 ggtcgacgcg gtccccgctg cgcagtttgc cggtcaatag caggttgagg atgtgggcga 48061 caacctggtc cttttcctta accccgtact tttttggcat cggtatctag catctctttc 48121 agcccgctgc agccatccgg cgctggcaag tttctcatga ctcggcgtct gcgttgtggt 48181 gtttcccaga tgaagccggg ggtaacgcga tctgacagac gtcaaccgga gttcaccggc 48241 catcgcgcca cctgcaaagc gcggccgcag cgctcaggtc gtagtcggga ccgtcacagc 48301 caacggtcaa cagcgtgaca ccgagaccgg cgagggcttc ggcgctggcg atcagcccgc 48361 cgccgtcgac cgcggcggag cgttcgatag tcgctgggtt tcggccgacg gtcgagcagt 48421 gcgtgctcag cacggccgac ttcgctaggt agctgtcccc ggcggtaaag ctgtgccaga 48481 tatcggcata ctcggcgacc agtcgcaggg tcttacgctc tccgccgccg ccgatcagca 48541 ccgggatgtc ccgtgtcggc ggcgggttca gcttgccaag ccgcgccttg atccggggca 48601 gcgcagccgc caggtcgtcg aggcggctgc ccgctgtgcc gaaccggtag ccgtactcgt 48661 cgtagtcctt ctgtttccag cccgacccga tacccaggat gagccggccg ccggagatgt 48721 ggtcgacggt acgggccatg tcggcaagca gctccggatt gcggtaggag ttgcacgtca 48781 ctagagcgcc gatttcgatg tgcgacgttt gctcggccca ggctcccaag acggtccagc 48841 attcgaagtg tgggccgtca gggtcgccgt agagcggaaa gaagtggtcc caggtaaaag 48901 cgatgtccac accgatgtcc tcgcaccggc ggacggcgtc tcggacggcg cggtaatggg 48961 gggcgtgctg cggctgcagt tgtacgccga tacgaacggg gagatcggga cgcacgagtg 49021 aagtcatggg tccaccgtag gctcagcgtg tgtcgagcac cccgcgcacg atctcgatca 49081 gggcgcgcgg ttggtcactt tgcaccgagt ggcctgactt ctcgacgatg tgaacgccac 49141 ggaaatgcgt tgcacgcctg tggagttcgg cggtgtcctg gtcggtgacg aagcccgacg 49201 agccgccgcg cacgagtgtg atcggcgcgg acagggcgtc gacgtcgtcc cagagccctg 49261 cgaaatctcc gaacgtgcgg atcgcgtcat agcgccacac ccagttgccg ttgtccagcc 49321 ggcgggagtt gtggaacacg ccgcggcgca acgacttgac atcgcggtgc ggggccgcgg 49381 cgatcgttag gtccagcatg gcctgaaagc tggggaattc ccgctcgccg tgcatcagcg 49441 ccaccgtgcc gcgctgctcg gcggtcagct cggcgtgccg ttgcaatgcc gacggggtga 49501 cgtcgacgag aacgagttcg ccgaccaggt cgggtgccat cgcggccagc cgtatcgcag 49561 tcaacccgcc cagcgacatg ccgaccacga attcggcacc cggcgcaagc tcgcgtagca 49621 ccggcgccaa ggtctcggag ttgagctgcg gcgagtaatt gccgtcctcc cgccaagcgg 49681 aatggccgtg ccctggaagg tccaccgcca gcgccggctc acccaggccg acgatcacgg 49741 tgtcccaggt atgggcgttc tgtccgccgc cgtgcagaaa gatcacccgc ggcgcagagc 49801 cgccccagcg cagcgcgctg atggctcccg cttggacccg ctcgacttca ggcagtggac 49861 cattgacacc ggcctgctca gcgttctcag ccagcagggc aaactcgtcc agtccggtca 49921 gttcgtcgtc agatagcacg cagcggacgt tacccgcgtt tgactctgcg gataccaggc 49981 aattgtgcga gtggcccgcg tggtgagcgc agagtcaacg ctaaccgatg atgaactctt 50041 cgagttgcgc gcgcgcgatg tcgtcgggca gctgctcggg cgggctcttc atcaggtagg 50101 ccgacgccgg gatcaccggt ccgccgatgc cgcggtcctt ggcgatcttg gccgcccgca 50161 ccgcgtcgat gatgacgccg gccgagtttg gcgagtccca cacctcgagc ttgtactcca 50221 ggttcaacgg cacatctccg aaggcgcggc cctcaagacg gacataggcc catttgcgat 50281 cgtcgagcca tccgacgtgg tcggacgggc cgatgtgcac gtccttggtc ttgaactcgc 50341 gcttcagatt cgaagtgacg gcctgggtct tggagatctt cttggactcc agccgttcac 50401 gttcgagcat gttgaggaag tccatgttgc cgcccacgtt gagctgcatg gtgcggtcga 50461 gctgcacgcc gcggtcctcg aacagcttgg ccagcacccg gtgggtgatc gtcgcgccga 50521 cctggctctt gatgtcatca ccgacgatgg gtaccctggc gtcggtgaac ttcttggccc 50581 acaccgggtc ggaggcgatg aacaccggca gcgcgttgac gaacgccacc ccggcgtcga 50641 tagcacactg ggcgtagaac ttgtcggctt cctccgagcc caccggcaaa taggagacca 50701 gcacgtcgac cttggcctcc ttgagcgcct ggacgacgtc gacgggctcc gcgtcggaga 50761 gttcgatggt gtcggcgtag tacttgccga tgccatcgag ggtaggcccg cgctgcacga 50821 tcacgttggt cggcgccaca tcggcgatct tgatggtgtt gttctccgag gcgaagatgg 50881 cgtcggacag gtcgaagccg accttcttgg cgtccacgtc gaacgccgcc acgaacttga 50941 cgtcgcgaac gtggtacggg ccgaaccgca cgtgcatgag gcccggtacg gtcgatgtgt 51001 cgtcggcgtt gtagtagtac tcgacgccct ggaccagcga ggacgcgcag ttgccgacgc 51061 cgacaatggc gactcgaacc tccgtcgacg cctccggcgc cggtaacgac tggtgctcac 51121 tcattaaggc gttctcctaa cctcataacc tctggggtgt cttgggtgtt ggttcgtgct 51181 gggtttacgt ctgttcggcg gggttgggtg ctgcccgttc cgcggcgatg agctcgttga 51241 gccacttgac ctcgcgctcg ctggactcga gcccgagttg atgcaattgg cgggtgtagc 51301 ggtcgaagga actgctggcc cgcgccaccg cctcgcgcaa gccttcccgg cgttcctcga 51361 cctggcggcg ccggccttcc aggatgcgca tccgcgcttc ggccggggtg cggttgaaga 51421 acgccaggtg caccccgaaa ccgtcgtcgg tgtagttgtg tgggccggtg tcggccacca 51481 gctcgccgaa tcgacggcga cccttgtcgg tcagttggta aacgcgtcgt gctcgccgca 51541 ccggggtgcc cgctggggcg gcattctcgg cgatcaaccc gtcggcctgc atgcgtcgca 51601 gcgccgggta taacgaaccg tacgaaaatg cccgaaacgc gcccagcagg ccggtcagcc 51661 tcttgcgcaa ctcgtagcca tgcatcggtg actcgatcaa cagacccagg atggcgagct 51721 ccagcatcga gtcacctcct tttgtatggc ttttgaatgg ccgttacgac ggttcgacgc 51781 ctcgcgtcat cgtatcgcct cgatatattt gcgacaacat caccgcgtca agacgggtag 51841 ctgacgtgct tgatggtgcc gtcacctgcg aaaacgaggt atccaccgcc gtagtcgcta 51901 gagacataca acgacaacga caacgcagcc ggcgtggtgg ggtccttgac cggttcgacg 51961 atcaggtaca tgcttttgac gtcggattgt ttgaggccga gggtttccgg ggcgccgcgc 52021 atgatgccca cagcggtctt cgcatcgaat ttgctcaggt caaccacgga cacgtcggca 52081 atgctcttgg cggaactggt cgcatcgccc cagccgccgc ggtaggtata cgccaggact 52141 cggcggtcgt ccgccgggtc gacgcgatcg agcgacgcat actccgggta gatcaccagc 52201 cggtagccca tggtgtcgcc gaaccgcttg cgggtctgct ccagcaggcc ggtgagcccg 52261 ccgagggaat gcagctgcct gggcggggtc agcaccacgg gggcgatccc gtcgggcttt 52321 gctccgggat ccgaggtgaa gtccagcgga gagcgggtgt tgccgtacac gccccagccg 52381 atgccgacgc ccagcagcac cgatgcgaca aacgcagcgg ccagcaagcc caactcggtg 52441 cgtttcgccc gcgatttgag cgcgggcatt tgtgcgggtg cgctctcgac ctgcaggtcg 52501 gccaccagac gctgcaggtc acctagggtc acagccttgg tagctgcgct gacgcgctcc 52561 cggtgttcct ccatcgagag ctcgccgtca cgcagggcgt cgtcgagaat ccggcaggcg 52621 tcctgccggt cgctgtcttt ggcgcgggtt gccgtcgata ctccgcgcgc aaggggtgcg 52681 cccagccact tcgccacagg gacgatagta ggagtctggc tgggaatctg aactcgatcc 52741 cgccgtaccc gcgcaacaac ggcgccggtt gcgtatcggt ggtgtggatg gcgtcgtact 52801 ctggtcagcg tgcgactgca gcgacaggta gtggactaca cgctacggcg acgctccctg 52861 ctggccgagg tgtattcggg acgcaccggt gtgtcggagg tgtgcgacgc caacccctac 52921 ctgctgcgcg ccgcaaagtt tcatgggaag cccagccggg tcatctgccc gatctgccgc 52981 aaggagcagc tcacactggt gtcgtgggtg ttcggcgagc acctcggtgc ggtatcaggg 53041 tccgcgcgca ccgccgaaga actgatcctg ctggcgaccc ggttctccga gttcgcggtc 53101 cacgtggtgg aggtatgtcg aacctgcagt tggaatcatc tggtcaagtc atacgtcctg 53161 ggcgccgcac gtccggcacg cccccctagg gggtctggcg ggacgcggac ggcgcgcaac 53221 ggggcccgca cggccagtga atagcgacgg gcgtcaccat cagtcgtcca gcggcgcccc 53281 gcgcgggccg gcgaatcccg gccagcgtgg tcaggttcca cccgacgaca gactgaccgc 53341 gatcctcccg ccggtgaccg atgaccgatc ggctccgcac gcggactcca tcgaggcggt 53401 caaggccgcg ctcgacggcg cgccgccgat gcccccgccg cgcgacccgc tcgaggaggt 53461 cacggccgcg ttggccgccc cgcccggtaa accgccgcgg ggggatcagc ttggtggcag 53521 acgtcgccca ccggggccgc ccgggccccc cggttcgtcc ggacagcctg ccggccggct 53581 gccccaaccg agggtggact tgccccgggt cggccagatc aactggaaat ggatacggcg 53641 ttcgctgtac ctcaccgcgg cggtggtgat cctgttgccg atggtcacct tcacgatggc 53701 ctacctgatc gtcgacgttc ccaagccagg tgacatccgt accaaccagg tctccacgat 53761 ccttgccagc gacggctcgg aaatcgccaa aattgttccg cccgaaggta atcgggtcga 53821 cgtcaacctc agccaggtgc cgatgcatgt gcgccaggcg gtgattgcgg ccgaagaccg 53881 caatttctat tcgaatccgg gattctcgtt caccggcttc gcgcgggcag tcaagaacaa 53941 cctgttcggc ggcgatctgc agggcggatc gacgattacc cagcagtacg tcaagaacgc 54001 gctggtcggt tccgcacagc acgggtggag cggtctgatg cgcaaggcga aagaattggt 54061 catcgcgacg aagatgtcgg gggagtggtc taaagacgat gtgctgcagg cgtatctgaa 54121 catcatctac ttcggccggg gcgcctacgg catttcggcg gcgtccaagg cttatttcga 54181 caagcccgtc gagcagctga ccgttgccga aggggcgttg ttggcagcgc tgattcggcg 54241 gccttcgacg ctggacccgg cggtcgaccc cgaaggggcc catgcccgct ggaattgggt 54301 actcgacggc atggtggaaa ccaaggctct ctcgccgaat gaccgtgcgg cgcaggtgtt 54361 tcccgagaca gtgccgcccg atctggcccg ggcagagaat cagaccaaag gacccaacgg 54421 gctgatcgag cggcaggtga caagggagtt gctcgagctg ttcaacatcg acgagcagac 54481 cctcaacacc caggggctgg tggtcaccac cacgattgat ccgcaggccc aacgggcggc 54541 ggagaaggcg gttgcgaaat acctggacgg gcaggacccc gacatgcgtg ccgccgtggt 54601 ttccatcgac ccgcacaacg gggcggtgcg tgcgtactac ggtggcgaca atgccaatgg 54661 ctttgacttc gctcaagcgg gattgcagac tggatcgtcg tttaaggtgt ttgctctggt 54721 ggccgccctt gagcagggga tcggcctggg ctaccaggta gacagctctc cgttgacggt 54781 cgacggcatc aagatcacca acgtcgaggg cgagggttgc gggacgtgca acatcgccga 54841 ggcgctcaaa atgtcgctga acacctccta ctaccggctg atgctcaagc tcaacggcgg 54901 cccacaggct gtggccgatg ccgcgcacca agccggcatt gcctccagct tcccgggcgt 54961 tgcgcacacg ctgtccgaag atggcaaggg tggaccgccc aacaacggga tcgtgttggg 55021 ccagtaccaa acccgggtga tcgacatggc atcggcgtat gccacgttgg ccgcgtccgg 55081 tatctaccac ccgccgcatt tcgtacagaa ggtggtcagt gccaacggcc aggtcctctt 55141 cgacgccagc accgcggaca acaccggcga tcagcgcatc cccaaggcgg tagccgacaa 55201 cgtgactgcg gcgatggagc cgatcgcagg ttattcgcgt ggccacaacc tagcgggtgg 55261 gcgggattcg gcggccaaga ccggcactac gcaatttggt gacaccaccg cgaacaaaga 55321 cgcctggatg gtcgggtaca cgccgtcgtt gtctacggct gtgtgggtgg gcaccgtcaa 55381 gggtgacgag ccactggtaa ccgcttcggg tgcagcgatt tacggctcgg gcctgccgtc 55441 ggacatctgg aaggcaacca tggacggcgc cttgaagggc acgtcgaacg agactttccc 55501 caaaccgacc gaggtcggtg gttatgccgg tgtgccgccg ccgccgccgc cgccggaggt 55561 accaccttcg gagaccgtca tccagcccac ggtcgaaatt gcgccgggga ttaccatccc 55621 gatcggtccc ccgaccacca ttaccctggc gccaccgccc ccggccccgc ccgctgcgac 55681 tcccacgccg ccgccgtgac cggcgcgctg tcccaaagca gcaacatctc gccacttcct 55741 ttggccgccg atctgcggag cgccgataac cgcgattgcc ccagccgcac cgacgtattg 55801 ggtgccgctc tggcgaatgt cgtcggtggc ccggtaggcc ggcacgcgct gatcggccgc 55861 acccggctga tgaccccgct gcgggtgatg tttgcaatcg cgttggtgtt cctggcgctc 55921 ggttggtcga cgaaagcggc ctgcttgcag tccaccggaa ccggtccagg tgatcagcgg 55981 gtggccaact gggataacca gcgtgcttac taccagttgt gctactccga tacggtgccg 56041 ctctatggcg ctgagttatt gagccaaggc aagtttccgt acaaatcaag ctggatcgaa 56101 accgacagca acggcacacc gcagctgcgc tacgacggac agatcgcggt gcgctatatg 56161 gagtatccgg tgctgactgg gatctatcag tacctgtcga tggcgatagc caagacctac 56221 accgcgttaa gcaaggtggc tcccctcccg gtggttgccg aagtggtgat gttcttcaac 56281 gtcgccgcgt tcggtttggc gctggcgtgg ctgacaaccg tctgggcgac ctcgggcctg 56341 gccggccgcc ggatatggga tgcggcgctg gtggccgcct caccgctggt gatctttcag 56401 atattcacca atttcgatgc gctggcaacg ggtttggcga cgagtgggct gctggcctgg 56461 gcgcggcgca gaccggtgct tgccggtgtg ctgatcgggt tgggctccgc ggcgaaactg 56521 tatccgctgt tgttcttgta cccgttgttg ctgctgggca tccgggccgg tcgcctgaat 56581 gctctggccc gcaccatggc ggccgcggcg gcgacctggt tgttggtgaa tctgccggtg 56641 atgctgctct ttccgcgcgg ctggtcggag ttcttccggc tcaacacccg gcgcggcgac 56701 gacatggact cgttgtacaa cgtcgtcaag tcgttcaccg gctggcgtgg cttcgacccc 56761 accctgggct tctgggagcc gccgctggtg ctgaacacgg ttgtcacgct cttgttcgtg 56821 ttatgttgtg cggcaattgc ttacatcgcg ctcaccgcac cccaccggcc gcgcgtggcg 56881 cagctgactt tcttgacggt ggccagcttc ctgttggtca acaaggtgtg gagtccccag 56941 ttctcgcttt ggctggtgcc gctggccgtg ctggctttgc cgcaccgccg gatcttgctg 57001 gcgtggatga cgatcgacgc gttggtgtgg gtgccgcgga tgtactacct atacggcaac 57061 ccgagccgct cgctgcccga gcagtggttc accacgacgg tgttgctgcg tgacatcgcc 57121 gtgatggtgc tgtgcggact ggtggtctgg cagatctacc gccccgggcg cgacctcgtg 57181 cgtaccggcg ggccaggggc actgccggct tgtgggggag tcgacgaccc ggtgggaggg 57241 gtctttgcca acgccgccga cgccccgcca ggtcggctac cgtcgtggct gcgtccccgg 57301 ctgggcgacg agcatgcgcg agagaggacg cccgatgcag gtcgcgatcg cactttttcc 57361 gggcaacacc gcgcttgacg cggttggccc ctacgaggtg ctgcagcggg tgccgtcgtt 57421 cgacgtcgtg ttcgtcggcc accgccgcgg ggaggttcgc agcgacaacg ccatgctggg 57481 tctgctgtgt gacgcggcat tcgacgagct aacccggccc gatgtggtga tctttccggg 57541 cggcatcgga actcggaccc tgatccacga ccagaccgtg ctcgactggg tgcgcgaagc 57601 gcaccggcac accctactca ccacctcggt gtgcaccggc gggctggtgt tggcggctgc 57661 cggactgctc aacggcttga ccgcgaccac gcattggcga gtacaggatc tgttcaactc 57721 gctgggcgcc cgatacgtcc cccagcgtgt cgtcgagcat ctgccagagc gggtcatcac 57781 cgccgccggg gtgtcgagcg ggatcgacat gggattgcgg ctggtggagc ttttggtcag 57841 ccgggaagcc gccgaagcga gccagctgat gatcgagtat gacccgcagc caccggtgga 57901 tgccggctcc ctggccaagg cctcgccggc tacccatcgg ctcgcgttgg agttctatca 57961 gcatcgtttg tgatctgttc gcgataggcc tcgccgttcg cgacactgac attgcgcaca 58021 cgacacgccg cggatcgtcg caccgggtta agcctggagt gcggtggtgc ctggtcggca 58081 ttttcgcagt cgagggctct cgtgtagcct gggcgagttg ccgacgcagg cgaccctcct 58141 gccacggatc gaccgtggcc gcacacgacc acaggaggtg atgaggttcc tatgcgtcca 58201 tacgaaatca tggtcatcct cgacccgacc ctcgacgaac gcaccgtagc cccgtccttg 58261 gagacgttcc tcaacgtcgt ccgtaaggac ggcggaaaag tcgaaaaggt ggacatctgg 58321 ggcaagcgtc ggctggcgta cgagatcgcc aagcatgccg aaggcatcta cgtggtgatc 58381 gacgtgaaag ccgccccggc gacggtgtcc gaactcgacc gccagctcag cctcaacgag 58441 tcggtgttgc gcaccaaggt aatgcgcacc gacaagcact aatcggcctg ccaggcactg 58501 gctgttcgct gtcggtgcgg ttacgtaggc tcggcgaaga agaacacgac cagccgccga 58561 acccaggcgg acgcaggagg aaattgtggc tggtgacacc accatcacca tcgtcggaaa 58621 tctgaccgct gaccccgagc tgcggttcac cccgtccggt gcggccgtgg cgaatttcac 58681 cgtggcgtca acgccccgga tctatgaccg tcagaccggc gaatggaaag acggcgaagc 58741 gctgttcctc cggtgcaata tctggcggga ggcggccgag aacgtggccg agagcctcac 58801 ccggggggca cgagtcatcg ttagcgggcg gcttaagcag cggtcgtttg aaacccgtga 58861 gggcgagaag cgcaccgtca tcgaggtcga ggtcgatgag attgggcctt cgcttcggta 58921 cgccaccgcc aaggtcaaca aggccagccg cagcggcggg tttggcagcg gatcccgtcc 58981 ggcgccggcg cagaccagca gcgcctcggg agatgacccg tggggcagcg caccggcgtc 59041 gggttcgttc ggcggcggcg atgacgaacc gccattctga ccccaagaac tgcaaatcaa 59101 gaaacggaaa gatagacact catggccaag tccagcaagc ggcgcccggc tccggaaaag 59161 ccggtcaaga cgcgtaaatg cgtgttctgc gcgaagaagg accaagcgat cgactacaag 59221 gacaccgcgc tgttgcgcac ctacatcagc gagcgcggca agatccgcgc gcgtcgggtc 59281 acgggcaact gcgtgcagca ccagcgagac atcgcgctcg cggtgaagaa cgcccgcgag 59341 gtggcgctgc tgccctttac gtcttcggtg cggtagcgcc gaatgtccaa cggagagtgc 59401 aaaataccat gaagctcatt ctcacggccg atgtcgatca cctcgggtcc atcggcgaca 59461 ctgtcgaggt caaggacggg tatggccgta actttctgct cccgcgcggc ctggcgatcg 59521 tcgcctcgcg cggagcccag aagcaggctg acgagatccg ccgggcccgc gaaaccaaaa 59581 gcgtacgcga cctagagcac gccaacgaga tcaaggcggc gatcgaggcg ctcggcccga 59641 tagcgctgcc ggtgaagact tcagctgatt ctgggaagtt gttcggctcg gtgaccgccg 59701 cagatgtggt tgctgccatc aagaaggccg gtggaccaaa cctcgataag cggatcgttc 59761 ggctgcccaa gacgcacatc aaggccgtgg gcacgcattt tgtgtcggtg cacctgcacc 59821 cggaaatcga tgtcgaggta tcgctggacg tcgtggcgca gagctaaggc gagctgaggc 59881 cacaacagtt tgcgcatgcc ggtggtgacc gcggtcggcc gccgccgggg tttcgccatg 59941 ccctgggtgt ccaccgcacg gtccggtgcg gtgatgctgg cgaactattc ggccggcgtt 60001 tgcgggcggg tgtcttcacc gggccttaac gtcaggaaaa tgtgtctgaa agccaacacg 60061 cccggcgcgg taacctggct cgacacgccg aagagattct tgtccacaca aacggcgtcg 60121 cgttgtatgg ccgttaacag cagtgatgtc gtaacgggcc gtattgatcc acaggttctc 60181 cacaccccgc tcaacacaga cgtcgacgga tatgcacatg cgatgcacag ctccataaac 60241 agtggcccct tggagtactt gccagcaacg tttagcgtct tcccggcgct aggcgatgtg 60301 ggtgacttgg gcggtggtgt cggtgcggcg acttacgctc tggataggtt gtcgaatatg 60361 cgttcgggtg cttgtgtcgg aggaggtgag agcccatggc ggtcgttgat gacctagcgc 60421 ccggcatgga ctcctcaccg cccagtgaag attacggccg tcaaccaccg caggatctcg 60481 ccgccgagca gtccgtgctg ggcgggatgt tgctgagcaa ggacgccatc gccgatgtac 60541 tggaacggct acggcccggc gatttttatc gtccggcgca tcagaacgtc tacgacgcca 60601 ttttggacct gtatgggcgg ggagaaccgg ctgatgcggt gacggtggcc gccgaactgg 60661 atcgccgtgg gctgctgcgc cgcatcggcg gtgctcccta cctgcacacc ctgatctcga 60721 cggtgccgac ggccgccaac gcgggctact acgcgagcat cgttgccgaa aaggcgctgc 60781 tgcgccggct ggtagaggcc ggaacccggg tggtgcagta cggctatgcc ggcgccgaag 60841 gcgcggatgt ggccgaggtg gtcgatcgcg cgcaggccga aatctacgac gtcgcggatc 60901 ggcggctgtc ggaagacttt gtggcgcttg aggacctgct gcaaccgacg atggacgaga 60961 tcgatgccat cgcttccagt ggcggcctgg cgcgcggggt ggctaccggc ttcaccgaac 61021 tcgacgaggt caccaacggc ctgcatccgg ggcagatggt catcgtggcg gcgcgcccgg 61081 gcgtgggaaa gtccaccctt gggctggact tcatgcggtc atgctcgatc aggcatcgga 61141 tggccagcgt catcttctcg ctggagatga gcaagtccga gattgtcatg cgactgctgt 61201 cggcggaggc caaaatcaag ctctccgaca tgcgttcggg ccggatgagc gatgacgact 61261 ggacccggct ggcgcggcgg atgagcgaaa tcagcgaagc gccactgttt atcgacgact 61321 cgcccaacct gaccatgatg gagatccgtg ccaaggcgcg ccgcctgcgg caaaaggcca 61381 acctgaagtt gatcgtggtc gactacctgc aactgatgac ctcgggcaag aagtatgaat 61441 cacggcaggt ggaggtgtcg gagttctcgc ggcatctgaa gctgttggca aaagagcttg 61501 aggttcccgt ggtcgcgatc agccagctca accgtgggcc cgagcagcgt accgataaga 61561 aaccgatgct ggccgacctc agggaatcgg gctgcctgac cgcgtccacc agaatcttgc 61621 gcgccgatac cggcgctgag gtcgccttcg gtgagctcat gcgaagcggt gaacgtccca 61681 tggtgtggtc gctggacgag cggctgcgca tggtggcccg gccgatgatc aacgtgttcc 61741 cgagcgggcg caaggaagtg tttcggcttc ggctggcttc cggacgcgaa gtcgaggcca 61801 ccggcagcca cccctttatg aagttcgaag gctggactcc cttggcgcag ttgaaggttg 61861 gtgaccggat cgcagcaccg cgccgggtac ctgagcccat cgacactcag cggatgcccg 61921 agtctgagct catttcgctg gctcgcatga tcggtgacgg gtcgtgcctg aagaaccagc 61981 cgatccgcta cgagccggtg gatgaggcga acctggccgc ggtgacggtc tcggcggcgc 62041 actcggatag ggctgcgatc cgcgacgact acctcgcagc tcgagtgccg tcgttgcgcc 62101 cggcgcggca acgactaccg cgcgggcggt gcacgccgat tgcggcgtgg ctggctggcc 62161 tagggctatt cacgaaacgc agccacgaaa aatgcgtacc ggaggctgta tttcgcgccc 62221 ccaatgacca ggtggcgttg tttctgcggc atctgtggag cgctggtggc tctgttcggt 62281 gggatcccac gaatggtcaa ggccgggtct actacggctc aaccagtagg cgtctcatcg 62341 acgatgtggc tcaattgctg cttcgggttg ggattttttc ctggatcaca cacgccccaa 62401 agttgggcgg ccacgattcg tggcggctgc acattcatgg cgcgaaggat caggtcaggt 62461 tccttcgtca cgtcggcgtt cacggcgccg aagcggtggc ggcccaagag atgctgcgtc 62521 agctcaaagg accggttcgc aacccgaacc tggacagcgc gccgaaaaaa gtatgggcgc 62581 aagtccgcaa ccgactgtcc gccaaacaga tgatggacat ccagctccac gaaccgacga 62641 tgtggaagca ttccccgagc cggtcaaggc cgcatcgcgc ggaggcgcgg atcgaagatc 62701 gagcgatcca tgagctggcg agaggcgacg cgtactggga caccgtcgtg gagatcacca 62761 gcattggaga tcaacatgtt ttcgatggga ctgtaagcgg cacacacaat ttcgtcgcca 62821 atggcattag tttgcacaat tcgctggaac aagatgccga cgttgtcatc ctgctgcatc 62881 gacccgacgc ctttgaccgc gacgatccac gtgggggaga agcggatttc attctcgcca 62941 aacaccgcaa cggtccgacg aagacggtca ccgtagcgca tcaactgcac ctgtcacgct 63001 tcgccaacat ggctcggtga catgcggatg tgtggggtct cacggagcgt ggccgaatct 63061 cacgaatgat ggggccatca gggcggaccg gtccacgcat ccgcggcggc gttgaagtcc 63121 ccgagcaaca cgcgtcgtgg ttgatgcgtg agatgagtca gatcagggcg acaggacgtc 63181 gaaccagtgg gactaatgca tgatcaccag atacaagcct gagtcggggt ttgtcgcccg 63241 tagcggtggt cccgaccgga agcgtcccca tgactggatc gtttggcact tcacccatgc 63301 cgacaatctc cctgggatca tcaccgctgg ccgtctgctg gccgattcag cagtcacccc 63361 gacgaccgag gtggcatata acccagtcaa ggagttgcgc cgccacaaag tcgtcgcccc 63421 cgacagcagg tacccggcgt cgatggcaag cgatcatgtg ccgttctaca ttgcggcgcg 63481 gtcgcccatg ctctacgtcg tatgcaaggg ccactccggc tactccggcg gtgccggccc 63541 gctggtgcac ctcggggtgg cgcttggcga catcatagac gcggatctga cgtggtgcgc 63601 cagtgacggc aatgctgcag ccagctacac caagttcagc cgccaggtcg acacgctcgg 63661 caccttcgtc gactttgacc tgctctgcca gcggcaatgg cacaacaccg atgacgaccc 63721 caaccgccag agccgccgcg ccgccgagat cctggtatac ggccatgtcc cgttcgagct 63781 ggtcagctac gtgtgttgct ataacaccga gacgatgaca cgggtacgaa ctctgctcga 63841 tcctgtcggt ggggtgcgaa agtatgtcat caagcccggc atgtactact aaggaaggag 63901 gaggccatat gatcacgtac ggctctggcg acctccttcg ggctgacacc gaagcgctcg 63961 tcaacaccgt caactgtgtt ggggtgatgg gcaagggaat tgcgctgcag ttcaaacgcc 64021 gctaccccga gatgttcacc gcctacgaaa aggcgtgcaa acgcggcgaa gttaccatcg 64081 gcaagatgtt cgtcgtcgac accggacagc tcgacggacc gaaacacatc atcaacttcc 64141 ccaccaagaa acactggcgt gcaccgtcga agctggccta tatcgacgcc ggcctcattg 64201 atctcatccg cgtgatccgt gaactcaaca ttgcttctgt ggcagttccc ccgctggggg 64261 tgggcaacgg aggtctggat tgggaagatg tcgagcaacg gctcgtatca gcattccagc 64321 agctgcccga cgttgacgcc gtgatctacc ccccatcagg tggatctcgc gccatcgagg 64381 gcgtcgaagg acttcggatg acctgggggc gcgccgtcat actcgaagcg atgcggcgat 64441 atctccagca gcgccgcgcg atggagccgt gggaagaccc tgcagggatc tcgcatctgg 64501 agattcagaa gctcatgtac ttcgccaacg aggccgatcc cgatcttgcg ctagatttca 64561 cgcccggccg atacgggcca tacagcgaac gtgtccgtca cttactgcaa ggaatggagg 64621 gcgcattcac agtcggcctg ggtgacggca ccgcaagagt tcttgcgaac caaccgatct 64681 cgttgactac taagggaact gacgccataa cggactatct ggccaccgat gcggcagctg 64741 accgggtgag cgccgcagtc gacacggtgt tgcgcgtcat cgaaggcttt gaaggcccat 64801 acggggttga gctgctcgcc agtacgcatt gggtggccac acgtgagggc gccaaggaac 64861 cagccacggc agcggccgcg gtccgaaagt ggacaaaacg caagggtcgg atctacagcg 64921 acgatcgcat cggtgttgcc ctcgaccgca ttcttatgac tgcctgaaag cgaccggctc 64981 gtcgttaagg atgtgcgccg acgcccagcc gtcagggagc gttgggctgc tcggacggaa 65041 ttgccccacc gcaaccaccc ggtggcggcg ggccggggag gggctcaccg ccgctgacac 65101 aatcgaagta aaactgtggg ccggtaaacc acgtttgcat ccactggtgc caaaacgagc 65161 cgtcggggta cttctcgccg tcgcacacgg ccaagtcgcc aaaaccccat cggccacccg 65221 ggcaatagcc tttcgtcatg tccggctgat gcgggtcagg tggatctgcg ctggcaaccg 65281 aggcaggaaa cacaagcgcc gctgcacaac ccagtatcgc agtactcagg cgagcaaact 65341 tcaacttcat ttcaaactcc gtcaaacgtt gaatcgactc ggcggactcc aagcgatggt 65401 cagcgcttgc ggatgagccg cggcaatgag tcgtagtggg cagacattcc cgagaacagc 65461 ctgaaatcct gttcggttga tgccgtgccg gcatcgacgt accaggacga ggcactgact 65521 cgggaaggca cagccgccgt ggcgattgta tatgacgcgt cggactgggc agcgatggcg 65581 cgggactctg cccgggcgcc ggccttggac acggccagcg cccgccacct gtcgtcggca 65641 tttggcgttt gtcgaattgc ggcattattt tgctcgggtg atgtcatcag ctattggttc 65701 ggtcgcgcgg tggatagtcc ccctcctggg ggttgcagcc gttgcttcca tcggtgttat 65761 cgcggacccg gtgcgggtcg ttcgggcccc ggcgttgatc ctggtcgatg cggcaaaccc 65821 gctggccgga aagcccttct acgtcgatcc cgcctcggcg gccatggtcg ccgcgcgcaa 65881 cgccaacccg ccgaacgccg agctgacctc cgtcgccaac accccgcagt cctactggct 65941 cgaccaggca ttcccgccgg cgaccgtcgg cggcacggtt gccaggtaca ccggagcggc 66001 gcaggcggcc ggcgccatgc cggttctgac gctgtatgga atcccccatc gcgactgcgg 66061 tagctacgca tccggtgggt tcgcgacggg cactgattac cgcgggtgga tcgacgctgt 66121 cgcatccggc ctgggctcat cgccggcgac gatcatcgtc gaacccgatg cgctggccat 66181 ggccgactgc ctgtcgcctg accagcgcca ggaacgtttc gacttggtgc gctacgccgt 66241 cgacacgctg acccgcgacc cggccgctgc cgtgtacgtc gatgcggggc attcgcgctg 66301 gctgagcgcc gaggcaatgg ccgccaggct caacgatgtc ggtgtgggcc gcgcgcgcgg 66361 gtttagcctc aacgtctcga acttctacac caccgatgag gaaatcggct atggcgaggc 66421 gatttcgggg ctcacgaacg gttcgcatta cgtgatcgac acgtcgcgca acggcgccgg 66481 acccgcgccc gacgccccgc tcaactggtg taaccccagc ggccgcgccc tgggcgcacc 66541 gcccaccacg gcgaccgcgg gcgcgcacgc cgacgcttac ctgtggatca aacgtcccgg 66601 ggaatcggac ggaacctgcg gtcgcgggga gcctcaggcg ggtcggttcg ttagccagta 66661 cgccatcgat ctggcccaca acgccggcca gtagagacct cacgcgcaga ccggctgagc 66721 gtgcggccgt tgggccgtcg gcgtcgggtt cggccaggtg gggtaacggt tcgggcacgt 66781 ttccactacc tcgtgacacg tcatgcggca ccgcggttcg ggtggtcgac aatgcgggac 66841 atgacccaaa attcggggtg ctgccggccc gcagcgtcgg gctgcgccgc gctggtgacc 66901 gtcgcgagac gggagcccga cgttggcgcg tgagatctca cgccagacgt ttctgcgggg 66961 tgccgccgga gcgttggccg ccggcgcggt cttcggctcg gtccgggcta ccgcggatcc 67021 ggctgcctct ggctgggagg ctctttcttc cgccctcgga gggaaagtgc tacaaccgga 67081 cgacggtccc caattcgcaa cggccaagca ggttttcaac accaactaca acggctatac 67141 gccggcggtg atcgttaccc cgacatcgca gctggacgtg cagaaggcga tggcgttcgc 67201 tgccgcgaac aacctcaagg tggccccacg cggtggcggg cactcctacg tgggggcgtc 67261 cacggccaac ggcgccatgg tgctcgacct acgtcagcta cctggggaca tcaactacga 67321 cgccaccacc gggcgggtca cggtgacgcc cgccaccggt ttgtacgcca tgcaccaggt 67381 gttggccgcg gccggccggg gcatcccgac cggcacctgc ccgacggtcg gtgtcgcggg 67441 acacgcgctg ggcggcgggc tgggcgccaa ttcccggcac gccggcctgc tctgtgacca 67501 attgacgtcg gcgtcggtgg tgctgcccag cggccaggcg gtcaccgcgt ccgccaccga 67561 ccaccccgac ctgttctggg cgttgcgcgg tggcggtggc ggcaacttcg gcgtgacaac 67621 ctcgctgacc ttcgcgacgt tccccagcgg ggacctcgac gtcgtgaacc tcaatttccc 67681 accgcagtcg ttcgcgcagg ttctggtcgg ttggcagaat tggctgcgaa ccgccgaccg 67741 aggcagctgg gcactggccg atgccaccgt cgacccgctg ggcacgcatt gccgcatcct 67801 tgcgacctgc ccggccgggt cgggcggcag cgtggcggcc gccatcgttt cggccgtcgg 67861 aacgcaaccg accggcaccg aaaaccacac gttcaactat ctggacctgg tcagatatct 67921 ggccgtcggg aacctcaacc cgtcgccgct gggatatgtc ggcggatccg atgtcttcac 67981 gacgatcact ccggcgaccg cccagggaat cgcctcggcg gtcgacgcct ttccgcgtgg 68041 agcgggccgc atgttggcga tcatgcacgc cctcgacggc gcgctcgcca ctgtgtcacc 68101 gggggccacg gccttcccgt ggcgtcggca gtcggcgctg gtgcagtggt acgtcgaaac 68161 atccggctcc ccgtcggaag cgactagctg gctcaacacc gcacatcaag cggtgcgagc 68221 gtattcggtt ggcggctatg tgaactatct cgaggtaaac caaccgccgg cacgttactt 68281 tggcccgaat ctgtcccggc tgagcgcagt acgtcagaag tatgacccca gccgggtcat 68341 gttctccggg ctgaacttct agcagccccg catgagtact agcccctagg acgggccatc 68401 ctcgtctacc ctgggaagtg atcatggaac tttccgtgtc tgttatcgcg gggttggtca 68461 tcgcactgct ggcggccatc acccctgctg cgggcgaacg cccggaaagc cgccgccagg 68521 cgctcgcaaa tgccgccgag gccggggagc atccggccac atcaccgttg cgacggtagc 68581 cgattcgtcg cgatacggct gtggagttag gaggcgcgga tggagacagg ttcgccggga 68641 aaacgtccgg tcttgcccaa gcgtgcccgc ctgctggtga cggcaggcat gggcatgctc 68701 gcgttgctgc tgtttggacc ccggctagtc gatatttacg ttgactggtt gtggtttggt 68761 gaggtcggtt tccgcagcgt ctggatcacg gtactgctga cccgcctggc gattgtcgca 68821 gcggtcgcac ttgtggtggc cggcattgtg cttgctgccc tactgctggc gtatcgctcg 68881 cggccgttct ttgtacccga cgagccgcag cgggacccgg tcgcgccact tcgcagcgcg 68941 gtgatgcgcc ggccgcgcct gttcgggtgg ggcatcgccg tcacgctcgg tgtggtgtgc 69001 gggctgatcg cttcgttcga ctgggtgaag gttcagttgt tcgtacacgg gggcaccttt 69061 ggcatcgtgg accccgaatt cggctatgac attgggtttt tcgtcttcga tctgccgttc 69121 taccggtcgg tgctgaactg gctgttcgtg gccgtggttc tggcgtttct agcgagcctg 69181 ttgacgcatt acctgttcgg cggccttcgg ctgacaaccg gcagaggcat gctgacccag 69241 gcagctcgcg ttcaactcgc agtgttcgcc ggcgcggttg tactgctgaa ggcggttgcc 69301 tactggttgg atcgctatga gctgttgtcg agtggacgta aggagccgac cttcaccggc 69361 gccggctaca ccgatatcca cgccgagctg ccggccaagc ttgtgctggt ggcgattgcg 69421 gtattgtgtg cggtgtcatt ctttaccgcg atctttttgc gcgacttgag gattccggcg 69481 atggccgccg cactgctggt gctgtcggcg atcctggtcg gtggactgtg gccgctgctg 69541 atggagcagt tctcggtgcg tcccaacgcc gccgatgtcg aacgcccata tatccaacgc 69601 aacatcgaag cgacccgcga ggcgtatcgg atcggtggcg attgggtcca gtaccgtagc 69661 tatccgggca tcggtaccaa acagccgcgc gacgtgcccg tggatgtcac cacgattgcc 69721 aaggtgcggc tgttggaccc gcatatcctg tcccgaacct tcacccagca acagcagctc 69781 aagaatttct ttagcttcgc cgagatactc gacatcgatc gctatcgcat cgacggtgag 69841 ctgcaggact acatcgtcgg cgtccgggag ctctcgccga aaagcctcac cggcaatcag 69901 accgactgga tcaacaaaca caccgtctac acgcatggca acggcttcgt ggccgccccg 69961 gccaatcggg tgaacgcggc ggcccgcggt gccgagaata tttccgacag caacagcggg 70021 tacccgatat acgccgtcag tgacatcgcg tcgctgggtt ctgggcgcca ggtcatcccg 70081 gtcgagcagc cacgggtcta ctacggcgag gtgatcgccc aggccgatcc ggactacgcg 70141 atcgtgggcg gagccccggg gtccgcgccg cgcgagtatg acaccgacac gtccaagtac 70201 acctataccg gcgccggggg tgtgtcgatc ggaaactggt tcaaccgcac ggtgtttgcc 70261 accaaggtcg cccagcacaa gttcctgttc tcccgggaga tcggctcgga gtcgaaggtg 70321 ttgatccatc gcgacccgaa ggaacgggtg caacgcgtgg cgccgtggtt gaccaccgac 70381 gacaacccct atccggtggt ggtgaacggg cggatcgtct ggatcgtcga cgcctacacc 70441 accttggaca cctatccgta cgcacaacgc agctcgctcg agggcccggt gaccagcccg 70501 accggcattg tgcggcaagg caagcaggtg tcgtacgtgc gtaactccgt caaggcaacc 70561 gtggacgcct acgacggaac cgtaacgctg tttcagttcg atcgagacga cccggtgctg 70621 cggacctgga tgcgtgcctt tcccggaacc gtcaagtccg aagaccagat tcccgacgag 70681 ttgcgtgccc acttccgtta tccggaggac cttttcgagg tccaacgtag cttgctggcc 70741 aagtatcatg tcgacgaacc gcgagagttc ttcaccacca acgccttctg gtcggtgccc 70801 agcgacccga ccaacaacgc taacgccact caaccgccgt tctacgtcct cgtcggcgac 70861 cagcagagcg cccagccgtc cttccggttg gcgtcggcga tggttggcta caaccgcgaa 70921 ttcctctccg cgtacatctc ggcgcactcg gatccggcga actacggcaa gctgaccgtg 70981 ctggagttac ccaccgacac cctgacccaa ggcccgcaac aaattcagaa ctcgatgatc 71041 tccgacactc gggtcgcctc cgagcgcacc ctgctggaac ggtcaaaccg gattcactac 71101 ggcaacctct tgtcgctgcc gatcgccgac ggcggcgtgc tctatgtgga accgctctac 71161 accgagcgga tctcgacaag cccgagcagt tcgactttcc cgcaactttc ccgggtgctg 71221 gtcagcgtgc gtgaaccccg caccgagggc ggggtccggg tcgggtacgc accgaccctg 71281 gccgaatctt tggatcaggt atttgggccc ggcaccggtc gggtcgccac cgctcgcggc 71341 ggtgatgccg ccagcgcgcc accgccggga gccggcgggc cggcaccgcc gcaggccgta 71401 ccgccaccga gaacgaccca accgccggcc gccccgcccc gggggccgga cgtccccccc 71461 gcgacggtgg ccgaactgcg ggaaacgctg gccgatctgc gcgcggtgct cgaccggtta 71521 gagaaggcca tcgatgccgc cgaaacgccc ggtggataag ccggcattct tagccggtga 71581 actccgctat ggctaccatt caagttcggg atttgcccga agatgtcgcc gaaacctatc 71641 gacggcgcgc caccgcagcg gggcagtcgc tgcagacgta tatgcgcacc aagctcatcg 71701 aaggggtgcg gggccgagac aaggccgagg caatcgagat cctggaacag gcgctcgcca 71761 gcactgccag cccaggcatc agccgggaga ccatcgaggc atcccggcgg gagctcaggg 71821 gtggatgaat gtgtagtcga cgcggcggcc gtggttgacg ctctcgccgg caagggcgcc 71881 agcgcgatcg ttctgcgcgg tttgctcaag gagtcgattt ctaacgcgcc gcatttgctg 71941 gacgcagagg tcggacatgc actccgccgc gccgtgctca gcgacgaaat ctccgaagag 72001 caggctcgcg ccgcgttgga tgccttgcct tatctcatcg acaatcgtta cccgcacagc 72061 ccacgactga tcgaatacac atggcagcta aggcacaacg tcacgttcta cgacgccctt 72121 tacgtcgcac tggccaccgc actggatgtc ccgctgctca cgggcgactc gcggcttgcg 72181 gccgcgccgg gccttccgtg cgaaatcaaa ctcgttcggt gacatccctt tgcgggacgc 72241 caatggcgcc gtcgtagccg ggccagcccg tcgtcagcct tggacagcct ccagcgctgc 72301 attgaacgtc ttgctgggcc gcatcaccgc cgtagtcatg tcgctgtccg gcgcgtagta 72361 gccgccgatg tccaccggtt cgccttgtac ctcggtgagc tctcgcacga tgacgtcttc 72421 gtttttggtc aacacatctg ccagcgaggc gaagtgttcg gccagctgct ggtcgtcggt 72481 ctgcgcggcc agctcttgtg cccagtacat ggcgaggtag aactggctgc cccggttgtc 72541 gagttcaccg gttttgcgcg acggactctt gtcgttgtcc agcagcttgc cgatggcggc 72601 atccagggtc ttacccaaga gtttggcccg ctcgttaccg gtcttgatgc cgatatcctc 72661 gaaaccggcg cccagcgcga ggaactcacc cagagaatcc cagcgcaggt gattctcctc 72721 caccaattgt ttgacgtgct tgggtgccga accgcccgcc cccgtctcgt acattccgcc 72781 gccggccatc agcggaacga cggacagcat cttggcgctg gtgcctaact ccaggatcgg 72841 gaacaggtcg gtgaggtagt cgcgcaggat gttgccggtc gcggcgatgg tgtccagtcc 72901 acggaccagg cgctcgcacg tgtagcgcat ggatcgcact tgcgacatga tctggatgtc 72961 cagaccttcg gtgtcgtgat ctttcaggta tgtcttgacc ttcttgatca gctcgttctc 73021 gtgcgggcgg tacgggtcca gccagaacag caccggcatc ccggagatgc gcgcgcgggt 73081 gacagccagc ttgacccagt cacggatcgg tgcgtccttg acgatgcaca tgcgccagat 73141 gtcgccggct tccacgttct cggtcagcag cacctcgccg gtggcgacat cgacgatgtt 73201 ggcgacgccg tcctcgggaa tctcgaacgt cttgtcgtgc gagccgtact cctcggcctg 73261 ctgggccatc agacccacat tggggacggt gcccatcgtc gtcggatcga actggccatt 73321 tgtcttgcag aagttgatga tctcctgata gatgcgcgag aaggtggact ccgggttgac 73381 cgccttggtg tccttgagct ttccgtcggc gccatacatc ttgccgcccg cgcgaatcat 73441 cgcgggcatc gaggcgtcca cgatcacatc gctcggcgag tggaagttgg agatacctct 73501 ggccgaatcg accatcgcga gctcggggcg gtgttcgtgg caacggtgta ggtcctcgat 73561 gatctcgtcg cgttgcgacg ccggcagcga ctcgatcttg ctgtacagat cggacaagcc 73621 attgttgacg ttgacgccca agtcgtcgaa cagctcctgg tgcttggcga aggcgtcctt 73681 gtagaagatc ctgaccgcgt ggccgaagac gatggggtgg ctgaccttca tcatggtcgc 73741 cttgacgtgc aaggagaaca tcacgccggt ctcgaacgca tcctgcatct gctcttcgta 73801 gaagtcgcac agcgctttct tgctcatgaa catgctgtcg atgacgtcgc cgtcatccag 73861 cggcacctcg ggcttgagca cgatcgtctt gccgctcttg gccagcagtt ccatcctcac 73921 gttgcgcgcg cggtccagtg tcatcgactt ctcgccggcg tagaagtcac cgtgccgcat 73981 gtgcgctacg tgggtgcgtg aggccatcga ccactcgccc atgctgtgcg ggtgcttgcg 74041 cgcgtactcc ttcaccgcct tgggcgcccg acggtccgaa ttgccttggc gcagtaccgg 74101 gttcaccgcg ctgcccaggc atctggcgta gcgctctttg atggccttct cctggtcagt 74161 cttcgggtcc gccgggtagt ctgggaccgc gtaacccttg tcttgcagtt ccttgatggc 74221 ggctaccagc tgtggcaccg aggcgctgat gttcggcagc ttgatgatgt tggtgtcggg 74281 tagctgagtc agccggccca gttcggcgag gttatccggt acccgctgct cctcggtcag 74341 gtaatcgggg aattccgcca ggatgcgtgc cgctacagag atgtcgctgg cctcgatctt 74401 gatgcccgcc ggttcggcaa aggcacgcac aatcggcaga aaggcgtagg tcgccagcag 74461 cggcgcctcg tcggtcagcg tgtaaatgat ggtcggctgt tcggcgctca tggtgttctc 74521 ccggcgtcac tgtcggtcag atgctgaatc actccgcgtt gtagcggcgg ttaccagtat 74581 cgcggattgc gccgcacatg attcgggcgg tgttctgcgc gacgacgatc actttctgtt 74641 tgcccgaagg ccgtcgaggg cgacgtcggt cacctttgcg gccaactcag cgttgtagct 74701 ctgcatcgct tggcagccga ctaggagcgt cttgacttca agcacgtcta cgtccggccg 74761 tacggtgccg gcgcgctggg cggcgcgcaa caggtcggtg agcaggtcca agaaatctgc 74821 ctcggcttcc ggggccgcgc tgctgatttc aatcccgacg ccggccagcg cctcgaccag 74881 gccgcgatcg gtggcgcccc actgcaatac catcgaccgc aggaatgcaa acagcgcgtc 74941 gccgggatgc ttggatttga gcagggcatg tcccttgtcg atgatgcggt gcatccggtc 75001 ggcgatcacc gcctgaaaca gcgcctcctt ggtcgggaaa tgccggtata ccgtgcctgc 75061 gccgactccg gcgcgccgag cgatctcgtc aacgggcacc gatagaccgt cggccgcaaa 75121 ggtttggtag gcaacctcca atacgcgtgc ccggttacgg gccgcgtcgg cacgcacccg 75181 ccggtcagta ggagccaagt cgtacctccg aaagccttga caaagcgggg cgcgcgttcc 75241 gtatagttcg gctaagcgga gcgctcgccc cgcttagtca aagcatagcg aggagccctc 75301 atgaccaaat ggactgccgc cgacattcct gaccagaccg gccggaccgc cgtcatcacg 75361 ggggccaaca ccggacttgg attcgagacc gccgcagcgc ttgccgccca tggtgcacac 75421 gtggtgctgg ctgtgcgcaa cctcgacaag ggcaagcagg cggcggcacg catcaccgag 75481 gccacccccg gcgccgaagt agagcttcag gagcttgacc tgacctcgct ggcgtcggtg 75541 cgcgccgccg cggcacagct gaagtctgac caccagcgca tcgacctgct gatcaacaac 75601 gccggggtga tgtatacacc ccgacagacc acagcagacg gcttcgagat gcagttcggc 75661 accaaccact tgggccattt cgcgttgacc ggcctgttga ttgatcgact gctgcccgtc 75721 gccggttcac gagtggtcac catcagcagc gtcggccatc gcatccgtgc cgcaatccat 75781 ttcgacgacc tccagtggga acgccggtac aggcgggtcg ccgcctacgg ccaagccaag 75841 ctcgccaacc tgctgttcac ttatgaactt cagcgtcggt tagcaccggg cggaaccacc 75901 atcgcggtcg cgtcgcaccc gggagtgtcc aacaccgaag tggtccgcaa catgccacgg 75961 ccgctcgtcg cggtggcggc catactggcg ccgctgatgc aagacgccga actgggggcc 76021 ctgccgacat tgcgtgccgc caccgatccc gcggtgcgcg gcggccagta cttcggaccc 76081 gatggcttcg gtgaaatacg gggctacccg aaggtggtgg cctccagcgc ccagtctcac 76141 gacgagcagc tgcagcgccg cctgtgggct gtgtccgaag agctcaccgg ggtcgtctat 76201 cccgtcggat gagccggact caacggcaac ggttggtcaa cactcgacga tgttgactgc 76261 gacgttgatg gcgagcccgc cggccgaggt ttccttgtac ttggtgtgca tgtccgcgcc 76321 ggtggcgcgc atggtgtcga tgacctggtc gagggtgacg cgatggatgc cgtcgccgcg 76381 caatgccatc cgtgcggcgt tgatggcctt gccggcggaa atcgcgttgc gttcgatgca 76441 ggggatctgc accagcccgg cgatggggtc acaggtcagg ccgaggctgt gttccatggc 76501 gatctcggcg gcgttttcca cttgtcgcgg tgtgccgccg aggatttcag ccaatccggc 76561 ggcggccatg gcggccgcgg agccgacctc gccctgacag ccgacctcgg ctccggagat 76621 cgatgctcgc tccttgaaca acgatccgat ggctccagca gtgagcagga atcgcacggt 76681 gacatcgtcg gggtcccccg cgccggccga cgtgtagtgg attgcgtagt gcaggaccgc 76741 cggcacgatg ccggcggcac cgttggtcgg ggcggtgacg acgcgcccac cggaggcgtt 76801 ctcctcgttg actgccagcg cgaccaggtt gacccagtcc tcagcgaatt ccggcttgcg 76861 agtggggtct tcggcgttca agcggtcata ccacaccttc gctcgccggc gcacccggag 76921 gccgccagga agcaaccctt cgcgagcgat gctccgctgt tcgcactcaa ccatgacgtc 76981 gcgcaggtgc agcagcgcgg cgcgtacctc gttctcggtg cggcaacatg tttcgttgcg 77041 cagcgccgct tcgctaattg acacgtcgag gcggtcacag atgtccagca gttcttgggc 77101 cgacacgtag ggaagggcaa ctgagcatgg atgttggccg ctgttgccgc tggtctgttc 77161 cgtgacgatg aaccctccgc ccaccgaaaa ataagtctcg gtggccaaga cgcggccgtg 77221 tgggcccgcg gcagtgaacg tcattccgtt gggatgcgtt ggcagaacga tgtcgggatg 77281 caggtcgata tcacgctcgg tcagcgggac cggaatgaca ccgccgattc gcgtcacgcc 77341 ggacgctgcg atctcggcga gccggcgttc cttgtgttcg gtggtaatcg tttctggctg 77401 gcagccttcc agccccagca atatcgccga catggtgcca tgaccggctc cggtggccgc 77461 gagcgagccg aacagatcca ctcgcatcgc ctcgaggtca tccaggtggc cccggcggcg 77521 cagcgcaact acgaactggt ttgccgcgcg catcggtccc acggtgtggg aactggacgg 77581 cccgatgccg atggtgaaca ggtcgaagac gctgatggtc atgtccggtg cagttccggg 77641 tagagcggat agcgtgcggc cagccgctgg acctgggcgc gcagcggacc cagctggtcg 77701 tcgttggtgg ccgtcagtgc cgccgcgatg aggtctgcca cggcgcggaa gtcgttgtgg 77761 gagaagccgc gtgcggccag cgccggggtg ccgattcgca ggcccgaggt gatcatcggg 77821 ggacgagggt cgaagggtac cgcgttgcgg ttgacggtga tgtccacggc ggccaaccgg 77881 tcttcggctt gctggccgtc gagttcggcg tcgcgcaggt cgactaggac gaggtgcaca 77941 tcggtgccgc cggttagcac cgcgatgcca cgttcggcga cgtcgggctg ggtcaaccgg 78001 ccggcaagga tgcgcgcgcc gtcgaggcaa cgttgttggc gctgcgcgaa ttcaggttgt 78061 gctgccatct tgaatgcggt ggccttggct gcgatgacat gctcgagcgg cccgccctgc 78121 tgcccaggga agaccgcgga attgatcttc ttggcgatgg ccgggtcatt gcacaagatg 78181 atgccgccgc ggggcccgcc gagcgtcttg tgagtggtgg aggtgacgac gtgggcgtgc 78241 ggcaccgggc tggggtgcac gccagcggcg accaggccgg cgaaatgcgc catatccacc 78301 atgagcacgg cgtcgacttc gtcggcgatg gcgcggaagc gggcgaaatc cagctggcgt 78361 gggtacgccg accagccggc gatgatcatt ttgggccggt gtgtgcgcgc tgcctcggcg 78421 acggcatcca tgtcgaccag gtagtcctct ttggacacct cgtaggcggt ggcgtggtag 78481 agcttgccgg aaaagttgat ccgcatcccg tgggtcaggt gaccgccatg agccagcgac 78541 aaccccagga tggtgtcgcc ggggtttagc agcgcatgca tggtggcggc gttggcggtg 78601 gcccccgaat gtggttgcac gttggcgtat tcggcgccaa agagcgcttt gacgcggtcg 78661 atagccaact gctcgacacc gtcgacgaat tcacagccac cgtagtagcg ccggcccggg 78721 tagccttcgg cgtacttgtt ggtcaagacc gaaccttggg cctgcatcac ggccagcggt 78781 gcatagttct ccgaagcgat catctccaag ccggattctt gacggcgcag ctcgccgtcg 78841 atcagggcgg cgatgtccgg gtcgaaggcg gtcagggagt cgttgagggt gttcatcagc 78901 tcagtccggt ctgttcggcg tactcggggg cggtcaaggg tgttcccgga gcaatcggct 78961 gcccggccaa atgggcatcc ggcggccgcg acatcgtttc ggccacggcg aggtcgccaa 79021 cagttcgatc gtgcggttca gacaagggcc aactccggtt tcgacgagcc cggatcgcgc 79081 cgggctggtt gcgccctccc cgctctgtcc tgaaacctga gagtctgcgg cgtcgcatca 79141 tggcgccgct ctacaccttc ggtcaggcac ggtcggtgcg accgtccctg tctccagagt 79201 tgcctcggcg gtgtggtgct tgggcctgag agattctcgg ggaggagatt gctcctacgg 79261 cgcctcgaca tggaggttct cccacatcgc gtcagcggct gttcgattgt gacggaaagc 79321 aacatacaca ccacgcatgt gttttgtcac cctgcggtcg gtggtagtcg gacggcccaa 79381 tcagacagcg cgggtcatat cacgcgttcg tgcacagttg ggtgtttatc cacaggggtg 79441 cgtttgtcgg cggctggcgg ggcgtggcgg cgatagcatt cgaatatgag ttcgatcacg 79501 gtgtcggtgg acccggtgga cccggtggac ccggtggacc cggtggaccc ggtggacgcc 79561 gtggtcgccg cgggatcaga cgggctcact gtggcccgca tcgagtccga gatcggggcc 79621 ttggagttcc tgaacgaact gcgcactgaa ctcaagagtg gacagtttcg acctcaaccg 79681 gtgcgggaac gcaagatccc caaaccgggc gggttgggca aggtacggcg gctggggatt 79741 cccacagtgg ccgaccgggt cgttcaggcg gcgttgaaac tggtgctaga acccatcttt 79801 gagaccgact tcgagccggt ctcctacggg tttcggcccg cgcgacgcgc gcacgacacg 79861 atcgctgaga ttcacttgtt cggcacccag gagtatcgct gggtgctcga cgctgatatc 79921 aaggcgtgct ttgaccgcat cgaccacgcg gacctgatgg accgggtgcg tcaccggatc 79981 aaagacaagc gggtgttgcg gctggtgaac tggcagcgca ttcggcatcg ctggaattgg 80041 accgacgtcc gccgctggct caccgacccc accgggcggt ggcaccccat cagcgcggac 80101 gggatcaccc tgtttaaccc cgccgcggtg cccattcggc gataccgcta tcggggcaac 80161 acgatcccca ctccctggac tcaggctgtc tgaaccaccc catcggcaga ttccgtgaag 80221 agccagatac ggtgaaagtc gcacgtccgg ttcgaagggc ggccacggga aacggacccg 80281 cagcaacgcg ggcaccgcac ccatggtcga cccaactgcc acgcacccgg tgaccggtgc 80341 gaagtccacc atatcgacca gtgggcaacc ggcggctcaa ccgatatcga caaactcacc 80401 ttcacctgca cacccaacca caagctagtc gggaaaggct ggcagacaag gaaacggtcc 80461 gacggccaaa cggaatggat cccgccaccc cacctcgacc gcggtgccca caccaacgac 80521 taccaccacc ccgaacgcct cttcgaccac tagcgggccg cgccctgacc acaaaacgtc 80581 aagaccaggc cccacaagtg cgccacgttg gtagcctctg ggaatgctct tcgcggccct 80641 gcgtgacatg caatggagaa agcgccgcct ggtcatcacg atcatcagca ccgggctgat 80701 cttcgggatg acgcttgttt tgaccggact cgcgaacggc ttccgggtgg aggcccggca 80761 caccgtcgat tccatgggtg tcgatgtatt cgtcgtcaga tccggcgctg ctggaccttt 80821 tctgggttca ataccgtttc ccgatgttga cctggcccga gtggccgctg aacccggtgt 80881 catggccgcg gccccgttgg gcagcgtggg gacgatcatg aaagaaggca cgtcgacgcg 80941 aaacgtcacg gtcttcggcg cgcccgagca cggacctggc atgccacggg tctcagaggg 81001 tcggtcaccg tcgaaaccgg acgaagtcgc ggcatcgagc acgatgggcc gacacctcgg 81061 tgacactgtc gaggtcggcg cgcgcagatt gcgggtcgtt ggcattgtgc cgaattccac 81121 cgcgctggcc aagatcccca atgtcttcct cacgaccgag ggcttacaga aattggcgta 81181 caacgggcag ccgaatatca cgtccatcgg gatcataggt atgccccgac agctgccgga 81241 gggttaccag actttcgatc gggtgggcgc tgtcaatgat ttggtgcgcc cattgaaggt 81301 cgcagtgaat tcgatctcga tcgtggctgt tttgctgtgg attgtggcgg tgctgatcgt 81361 cggctcggtg gtgtaccttt cggctcttga gcggctacgt gacttcgcgg tgttcaaggc 81421 gattggcacg ccaacgcgct cgattatggc cgggctcgca ttacaggcgc tggtcattgc 81481 gttgcttgcg gcggtggtgg gcgtcgtcct ggcgcaggtg ttggcaccac tgtttccgat 81541 gattgtcgcg gtacccgtcg gtgcttacct ggcgctaccg gtggccgcga tcgtcatcgg 81601 tctgttcgct agtgttgccg gattgaagcg cgtggtgacg gtcgatcccg cgcaggcgtt 81661 cggaggtccc tagcggtggg cgatctcagc attcagaacc tcgtcgttga gtactacagc 81721 ggtggatacg cgcttaggcc gatcaacggt ttgaacctcg acgtggcagc cgggtcgttg 81781 gtgatgctgc tcggacccag cggctgcggc aagacgacac tgctttcctg tctgggcggc 81841 attctgcgcc cgaagtctgg ggcgatcaag ttcgacgaag tcgacatcac gacgctacaa 81901 ggcgccgagc tggcgaacta ccggcgtaac aaggtcggca tcgtgttcca ggcgttcaat 81961 ctggtgccca gcctgaccgc tgtcgagaac gtgatggtgc cgttacgctc ggccgggatg 82021 tcacgcaggg cgtcgcgtag gcgtgccgaa gaactgctgg cgcgcgtcaa tctcgcggaa 82081 cgaatgaatc atcgacccgg tgatctgagc ggaggtcagc agcaacgagt cgcggtggca 82141 cgcgcgattg cgctggatcc gccactgatc ctcgctgacg aaccgaccgc acacctggat 82201 ttcatccagg tggaggaggt gctgcggttg atccgcgaac tggccgatgg cgagcgtgtg 82261 gtcgtggtcg caacccacga cagcaggatg ttgccgatgg ccgatcgcgt cgttgagctg 82321 acacccgatt tcgcggagac aaatcggcca cctgaaaccg tacatcttca ggccggcgag 82381 gtgctgttcg agcagagcac gatgggcgac ctgatctacg tggtgtcgga gggcgagttt 82441 gagattgtgc acgaattggc cgacggcggt gaggaattgg tcaaggttgc cgggccgggg 82501 gattacttcg gcgagatagg cgtgctgttt cacctgccgc gctcggcgac cgtgcgtgcc 82561 cgcagcgacg cgacggccgt cggctatacc gtgcaggcgt ttcgtgagcg gctcggcgtg 82621 gggggtctgc gcgatctgat cgagcatcgt gcgcttgcca acgactaacc cggcttggcc 82681 ggaactagcc actgccgggg cagcggtggc ggttcacacc gcgtgcgcgt ttggaggtcc 82741 ctgagcgatg ggcgatctga gcattagcca ggtgtcggcg cgtccgggac ggatcgggat 82801 tcgcgctagg caaatgttcg acggataccg gtttcagcgt ggtcccgtgc tggtcgtggt 82861 cgaggatggt cggatcagcg cggtcgattt tgctggctcc gcctgccccg atatgaacct 82921 ggttgatctg ggtgaatcga ctttgttgcc gggtctggtg gatgcgcatg cgcatttgtg 82981 ctgggacccc gacggtaggc cagaggattt ggccggcgac ccccatgcgg tgctggtggg 83041 acgggcgcga cggcacgccg cggccgcgtt gcgctccggg atcaccacga ttcgcgatct 83101 cggcgaccgt gactatgcgg ccttggcgct gcgggaggag tatcggcaga aaacgacggt 83161 ggggccggaa ctggtggttt ctgggccacc attgactcgc agcggcgggc attgctggtt 83221 cctcggcggc gtggccgata gcgtcgagga gctggttgat gcggtgcagg agcgggccgc 83281 gcggggagcg gattggatca aggtgatggc cacgggcgga ttcgttacca cagcatccga 83341 tccgtggcag ccgcagtacg gcagcggcca actggccgcg gtggtggcgg ccgccgagca 83401 ggtaggtcta ccggtgaccg cacatgcaca tgccaccgca gggatcgccg cggcggtcgc 83461 cgcgggtgtt gacggcatcg agcactgcac gttcttgagc gaaggcagcg ccgccgccag 83521 cccggatgtt gttgaagcga ttgttgccca aggtgtgtgg tgcggtatga cgattccccg 83581 ggtgtatccg gagatgccgg agaaccttgt cgcggttgtg caggatggat ggcgaaacat 83641 ccgccggctc atcgacgccg gtgcgcgtgt cgccctgtcc accgacgctg gagtcgcccc 83701 gggcagacgc catgacgtgc tccccgacga tttggtgtat ctgtctcgac acgggttcac 83761 cagcacagag gtgctgaccg gcgccaccgc agcggccgct gccagctgtg ggctcggcca 83821 ccgcaagggt cgcatcgcgc cgggctacga cgctgatctg ctggctgttg cggcaggtgt 83881 ggaccatgac cccgccggac tctgcgacgt caaagccgtc tggcgcagcg gaacccaggt 83941 accgctacaa gcatccgctg tgggctacaa caccccgtca taaccccgtc ataaaatgca 84001 ggacagcatc ttcaatctgt tgaccgagga acagcttcgg ggtcgcaaca cgctcaagtg 84061 gaactatttc gggcccgatg tagtgccact gtggctggcg gagatggact ttcccaccgc 84121 accggctgtg ctcgacgggg tgcgggcgtg cgtcgacaac gaggagttcg gctacccgcc 84181 gttgggcgag gacagcctgc cgagggcgac ggccgattgg tgccgacaac gctacggttg 84241 gtgcccccga ccggactggg tccgcgtcgt gccggatgtc ctgaagggga tggaagtcgt 84301 cgtcgaattc cttacccggc cggagagtcc ggtcgcgttg ccggttccgg cttacatgcc 84361 gtttttcgac gtcctgcacg tcaccggccg ccaacgagtg gaagtcccaa tggtgcagca 84421 agactcggga cgctacctgc tggacctgga cgctctgcag gccgcgttcg tccgcggtgc 84481 cggatcggtg attatctgca atccgaataa cccactgggt acggcgttca ccgaagccga 84541 gctacgtgcg attgtggata tcgcggcccg ccacggcgcc cgggtgatcg cggatgagat 84601 ctgggcaccg gtggtctacg gatcgcgcca tgtcgccgcc gcttcggtgt cggaggcggc 84661 ggctgaagtc gtggtcacgt tggtgtcggc gtccaaaggc tggaacttgc cgggtctgat 84721 gtgcgctcag gtgatcctgt ctaaccgccg tgacgcccac gactgggacc ggatcaacat 84781 gttgcaccgc atgggcgcat caacggtcgg tatccgcgcg aacatcgccg cctaccatca 84841 tggcgaatct tggttggacg agctgctccc ttatctgcgg gcgaaccgtg atcatctggc 84901 acgggcgctg ccggagttag ctcccggggt agaggtcaac gctccggacg gtacctacct 84961 gtcgtgggtg gatttccgtg cgctggctct gccgtctgaa ccggcggaat acctgctctc 85021 gaaggcgaag gtggcgctgt cgcctggcat tccgttcggc gccgcggtgg gctcgggatt 85081 tgcgcggctg aacttcgcca ccacccgcgc aatactggat cgggcgatcg aggctatcgc 85141 ggccgccctg cgcgacatca tcgattaagc caaccagtag attcacaacg ctgcggcgtg 85201 ttgggtcagg ctgaagaaga tgtaggcgag gcagatcagg aagttcagtg ccacgagaac 85261 caaacccaga cagattagtg aatgcgtggc tcggcgttgt aggcggtgga atttcgcgac 85321 gcgcttctca tggttcagct gggtcacgat cagtgcgaac ttgacgtcgg tccattcttc 85381 gtcggcggcg ggagcgccca acagcatttc ctgaaggcgc ttcggcgggg cttcgacgcc 85441 gacccgcgcg aagtgctggc tcagccgccg ggcttgcctg cggccaagaa tctgaccccg 85501 caccggtggc tgatgcgaga gcttccttcg ttcgtccccc cagtggttgg acggggtcgt 85561 cacagcgggc attctaagtc ccgcgggcca caaaaggcag tgccgcggaa cttcttggcc 85621 caaacgggca cccggctacg tgcgcaccgc gaccgtcgac aactggtcgg cgagccggtc 85681 cggggaatcc accatcgaga acgtccgtgc tccctcgatt acctcgaaac gggcgcgcgg 85741 gatggtcgcg gcgagccgtt gaccgttctc gagtgcgaag aacacgtcat ccgccgacca 85801 cgcgatgagc gccggcttgt cgaattcagg cagccgggcg gcgactgcgg tggtgacttc 85861 ggtgcgcagc gatagcgaga gctgacgcag gtcttcggcg atggccgggt tggatagcgc 85921 cggacgaacc caggcccggg tgagatggtc gatgttgtgg tgcgacaaac cggcatacgc 85981 gcggttacgc gcggccggtg cccgcatcac ctggatcgcg gcccggaaca gggtggccga 86041 tttcgcggcc aggatcaccg gtttgaggat cggcggcgga aagtgttcga acgcatcgca 86101 actagtgagg accagggcac cgagccgttc gggatagtgg accgcgacga gctgggtgac 86161 gaccccgccg gtgtcgttgc cgaccagcac cacgtccttg agctcgagcg cggcaaggac 86221 gtcggcgacg atgccggcaa ccccgccgat ggtctggtcg gcgccggggc gtagcggctt 86281 aggatgcgca cccagcggcc aggtgggggc gatgcagcgc aggccacgac cggcgagtcg 86341 ctcactgacc cgtcgccata gttgaccgcc catcatgtac ccgtgcacga acacgacagg 86401 cctgccagtt tcgggtccgg ttgcttcgta atgaatagtt ccggcactaa tgtcgatcgt 86461 cgacatggat gcccaccctt cgaggtacat ttacaagcag actgccggta acttaccaac 86521 agattgtatg gaaatcaaga gacgcaccca ggaggaacgc tccgcggcga cccgcgaggc 86581 gctgatcacc ggggcccgca agctgtgggg gttacggggt tatgcggagg tggggacgcc 86641 ggaaatcgcg accgaggcgg gggtcacgcg gggggcgatg taccaccaat tcgccgataa 86701 agcagcacta ttccgcgatg tggtggaggt cgtggagcaa gacgtgatgg cccggatggc 86761 caccttggtc gccgcctcgg gggcggcgac gccggccgat gcaatccggg cagcggtcga 86821 tgcctggctc gaggtatctg gtgatccgga ggtgcgtcag ctgatcctgc tggatgcgcc 86881 cgtcgtgctg ggctgggcgg gtttccgcga cgtcgcccag cgatacagcc tgggcatgac 86941 cgaacagttg atcaccgagg cgatccgggc cggccagttg gctcgtcaac cggtgcggcc 87001 gctggcccag gtgctcattg gcgcgctcga cgaggcggcg atgttcatcg ccaccgccga 87061 cgaccccaag cgcgcccgtc gggagaccag acaggtgctg cgccggctca tcgacgggat 87121 gcttaacggc tagcgctggg cgcggcctcg gcaaaatggc ttgcggaccg ggatctgagt 87181 tccagaactg ggcgcaggac tggctggtca ccacttggcg gcgaggcgtg tccattccgc 87241 tgccaggtcg cggtcccggt ggaagccgcg cagggtaatc agctcgatag ctttgcgcgc 87301 atcttggata tcttgaggcg atgcggcgtc cacgagcgca cgtagatcac tgcgatcctg 87361 gggtcgccga tcatcatctc tcgcaagaag tttcatcgcg atcagatgcg ccgttgtggc 87421 caccggagcg actagatcgg gcaagatctc gatctcctcg gcagcctccg caatctccgg 87481 ttcgatgcca cagctcgcga aaaggaggtc caccacaaca ttcgcggcag tgtctgcggt 87541 tgctccgaga cggaccgctg ccaaccgtct ggccgcgtcc tgctctaccg acgccaggag 87601 atggtactgc tgggtaagaa gttgacggac taaagattcc gcggcatcgt cgtttgccac 87661 cgcgacaaca atgtccacgt cacgggtgaa acgtggttcg gatcgcgcag acaccgcgaa 87721 accaccaacc agcgcccacc gctgacgcaa tccggtcagg tccttggcga ccctacggag 87781 tgtcgactcc acagcgttca tgtgaaccgt gtggacgtcg ggcctgcgct gtcaccctcc 87841 tccgccccgg gacgcgtcat cctccacgcg tcgatagctg cttcaatttc aacaacgtcc 87901 gcattgggcc gttcacgacc cagcctcatg cgctgcatct gctcgccaac ctcgtacatg 87961 tccagagcga gcctcagctt ctgcgcagcg acggaaactg ccacactcaa agcctactgg 88021 gcgcacgtgt ggcaacgagt cgatccacac gaaatgccgc cgttgggccg cggactagcc 88081 gaattttccg ggtggtgaca cagcccacat ttggcatggg actttcggcc ctgtccgcgt 88141 ccgtgtcggc cagacaagct ttgggcattg gccacaatcg ggccacaatc gaaagccgag 88201 caggtggaac cgaaacgcag tcgcctcgtc gtatgtgcac ccgagccatc gcacgcgcgg 88261 gaattcccgg atgtcgccgt attctccggc ggccgggcta acgcatccca ggccgaacgg 88321 ttggctcgtg ccgtgggtcg cgtgttggcc gatcggggcg tcaccggggg tgctcgggtg 88381 cggctgacca tggcgaactg cgccgatggg ccgacgctgg tgcagataaa cctgcaggta 88441 ggtgacaccc cattaagggc gcaggccgcc accgcgggca tcgatgatct gcgacccgca 88501 ctgatcagac tggatcgaca gatcgtgcgg gcgtcggcac agtggtgccc ccggccttgg 88561 ccggatcggc cccgccggcg attgaccacg ccggccgagg cgctagtcac ccgccgcaaa 88621 ccggtcgtgc taaggcgcgc aaccccgttg caggcgattg ccgctatgga cgccatggac 88681 tacgacgtgc atttgttcac cgacgccgag acgggggagg acgctgtggt ctatcgggct 88741 ggaccgtcgg ggctgcggct ggcccgccag caccacgtat ttcccccagg atggtcacgt 88801 tgtcgcgccc cagccgggcc gccggtgccg ctgattgtga attcgcgtcc gacaccggtt 88861 ctcacggagg ccgccgcggt ggaccgggcg cgcgaacatg gactgccatt cctgtttttc 88921 accgaccagg ccaccggccg cggccagctg ctctactccc gctacgacgg caacctcggg 88981 ttgatcaccc cgaccggtga cggcgttgcc gacggtctgg catgagcccg ggctcgcggc 89041 gcgccagccc gcaaagcgcc cgggaggtgg tcgagctcga ccgtgacgag gcgatgcggt 89101 tgctggccag cgttgaccat gggcgtgtgg tgttcacccg cgcggcgctg ccggcgatcc 89161 gtccagtcaa tcacctcgtg gtcgacggtc gggtgatcgg gcgcacccgc ctgacggcca 89221 aggtgtccgt tgcggtgcga tcgagcgccg atgccggtgt cgtggtcgcc tacgaagccg 89281 acgaccttga tccgcggcgt cggacggggt ggagtgtggt ggtgacggga ctggcgaccg 89341 aggtcagcga tcccgagcag gttgcccgct accagcggct gctacacccg tgggtgaaca 89401 tggcgatgga caccgtggtc gcgatcgaac ccgagatcgt caccggcatc cgcatcgttg 89461 ctgactcgcg tacgccgtag ccgattggcc gcgggcggcc cgcacgcatc cgcactatct 89521 gataaattct tcaactcgtc aaccgatgta acgctgaagc tctcaggaga cgcggtggag 89581 tccgaaccgc tgtacaagct caaggcggag ttcttcaaaa cccttgcgca tccggcgcgg 89641 atcaggattt tggagctgct ggtcgagcgg gaccgttcgg tcggtgagtt gctgtcctcg 89701 gacgtcggcc tggagtcgtc gaacctgtcc cagcagctgg gtgtgctacg ccgggcgggt 89761 gttgtcgcgg cacgtcgtga cggcaacgcg atgatctatt cgattgccgc acccgatatc 89821 gccgagctgc tggcggtggc acgcaaggtg ctggccaggg tgctcagcga ccgggtggcg 89881 gtgctagagg acctccgcgc cggcggctcg gccacgtaac gccatgggtt gggttgccaa 89941 gattttccgt gttggccggg tggtcgagcc cgcggccccc ttaccggcgg cgatagccga 90001 accacccgcc ggggtacggg gttcgctgca gatccgacat gttgacgcgg gttcgtgcaa 90061 cgggtgtgag gtggagattt cgggcgcctt tggcccggtg tatgacgcgg agcggttcgg 90121 ggcgcggctg gtcgcctcgc cccaacacgc cgatgcgttg ttggtgaccg gcgtggtcac 90181 gcacaacatg gccggcccac tgcgcaagac cctggaggcc acgccgcgcc cgcgggtggt 90241 aatcgcgtgc ggggattgcg cgctgaaccg gggggtgttc gccgacgcct acggcgtggt 90301 cggtgcggtc ggcgaggtgg tacccgtcga cgtcgagatc gccggctgcc cgccgacacc 90361 cgcggccatc atggcggcgc tgcgatcggt gaccgggaaa tgaccgctgc accgacggcc 90421 ggcggggtcg tcacttcggg cgtgggcgtt gccggggtcg gcgtggggtt gctgggcatg 90481 tttggaccgg tgcgtgtagt gcacgtcggt tggctgcttc cgctgtccgg cgtgcacatc 90541 gagctcgacc ggttgggcgg attcttcatg gcgctcacgg gcgcggtagc ggctccggtc 90601 ggttgttacc tgatcggcta cgtgcgccgt gaacacctcg gtcgggtccc gatggcggtg 90661 gtgccgctgt tcgtcgcggc gatgctgttg gtgccggccg cgggctcggt gacgacgttt 90721 ctgctggcgt gggagctgat ggcgatcgcg tcgctgatcc tggtgctctc cgagcacgcc 90781 cgcccgcagg tccgctcggc gggcctgtgg tacgccgtga tgactcagct gggattcatc 90841 gcaatcctgg tcgggctggt ggtgttggcg gcggccgggg gttccgaccg gttcgccggc 90901 ctcggggcag tctgcgacgg ggtccgcgcc gccgtattta tgctcacgct ggtcgggttt 90961 ggttcgaagg cgggcctggt gccactgcac gcctggctgc cgcgggccca cccggaggcg 91021 ccgagcccgg tgtcggcgtt gatgagcgcg gcgatggtca acctgggcat ttacggcatc 91081 gtccgtttcg atctgcagct gctggggccg ggcccacgct ggtgggggct tgcgctgctg 91141 gccgtgggcg gcacgtccgc gctgtatggg gtgctgcagg cttcggtggc cgccgatctc 91201 aaacggctgc tggcctattc gacgaccgag aacatgggcc tgatcacgct ggcgctcggt 91261 gcggcaacac ttttcgcgga taccggagcc tacgggccgg cgtcgatcgc cgccgccgca 91321 gcgatgctgc acatgattgc gcacgcggcg tttaagagcc tcgccttcat ggcggccgga 91381 tctgtgctgg ccgcgaccgg gctgcgcgac ctggacctgc tcggcgggct ggcccgccga 91441 atgccggcga ccaccgtctt tttcggggtg gccgcactgg gcgcatgtgg tctgccgttg 91501 ggcgccgggt ttgtcagtga gtggctgctg gtccagtcgt tgatccacgc tgcccccgga 91561 cacgacccca tcgtggcgct gacgacaccg ctggcggtcg gcgtggtcgc actggccacc 91621 ggtctgagcg tggcggcgat gaccaaggcc ttcgggatcg ggtttctcgc ccgtccccgc 91681 tccacccaag ccgaagcggc gcgtgaggcg ccggccagca tgcgcgccgg catggcgatc 91741 gcggcgggcg cctgcctggt gctggcggtg gcaccgctgc tggtcgcacc catggtgcgg 91801 cgggccgccg cgacgctgcc ggccgctcag gcggtcaagt tcaccggtct gggcgccgtg 91861 gtgcggctgc ccgcgatgtc cgggtcgatc gcgcccggcg tgatcgccgc cgctgtgctc 91921 gccgcggcgt tggcggtagc cgtcctcgcg cggtggcgtt tccgccggcg cccggcgccg 91981 gccaggttgc cgctgtgggc ttgcggcgcg gccgatctca ccgtgcgcat gcaatacacg 92041 gccacgtcgt tcgccgagcc gctgcagcgg gtcttcggcg acgtgctgcg cccggacacc 92101 gacatcgagg tcacccacac cgccgagtcg cgctatatgg ccgagcggat cacctaccgg 92161 accgcggtcg ccgacgcgat cgaacagcgc ctctatactc cggtggtcgg ggcggtggcc 92221 gccatggccg agctgctgcg ccgtgcccac accggcagcg tgcaccgcta cctggcctac 92281 ggcgcgctgg gcgtactgat cgtgctggtg gtcgcgaggt gaacgtgatg tcctacctag 92341 cgggcgccgc gcaaatcggc ggggtcatgg tgggtgcgcc gctggtcatc ggtatgacgc 92401 ggcaggtacg ggcacgctgg gaaggccggg ccggcgccgg cctgctgcaa ccgtggcgtg 92461 atctgctcaa acagcttggc aagcaacaga tcacaccggc ggggacgacg atcgtgttcg 92521 ccgccgcgcc ggtgatcgtc gccgggacaa cgcttttgat cgccgcgatc gcacctctgg 92581 tggccaccgg gtcacccctg gaccccagcg ccgacttgtt tgccgtggtc gggctgctat 92641 tcctgggcac cgtcgcactg accctggccg gcatcgacac cggcacctct ttcggcggca 92701 tgggcgccag ccgcgagatc accatcgccg cactggtcga accaacgatc ctgctggcgg 92761 tgttcgcgct gtccatcccc gccggatcgg ccaatctcgg tgcgctggtg gcgagtacga 92821 tcgaccaccc gggccacgtg gtgtcgctgg ccggcgtact ggccttcgtg gcgttggtga 92881 ttgtcatcgt cgccgagacc gggcggctgc cggtggacaa cccggccacc cacctggaat 92941 tgacgatggt gcacgaggcc atggtcctcg agtacgccgg cccacggctg gcgctggtcg 93001 aatgggcggc cgggatgcgg ctcacggtgc tgctggcact gctggcgaat ctgttcctgc 93061 cgtgggggat cgccggcgcc gcgcccaccg cgctcgacgt gttgaccggc gtggtggcgg 93121 tggcggccaa ggtcgcgatt ctcgcggtgc tgctggcgac gttcgaggtg ttcctcgcca 93181 aactgcgatt gttccgggta cccgaactgc tggccggctc gtttctgctg gccttgctcg 93241 cggtcaccgc cgccaacttc ttcacggtgg gggcgtgagg ggccagcgat gagtaacgcc 93301 aacttctcga tcctggtcga cttcgccgcg ggtgggctgg tgttggcgtc ggtgctgatt 93361 gtctggcgcc gcgacctgcg ggccattgtg cggctgctgg cctggcaggg tgctgcgctg 93421 gccgcgatcc cgctactgcg cggcatccgc gacaacgacc gtgcgctgat cgcggtgggc 93481 atcgccgtgt tggcgctgcg cgcgctggtg ttgccctggc tgctggcccg cgcggtgggc 93541 gccgaagcgg ccgcgcagcg ggaggccacc ccgttggtca acaccgccag ctcgctgctg 93601 attaccgccg gactgaccct caccgcgttc gcgatcaccc agccggtggt caacctggaa 93661 ccgggcgtca ccatcaacgc ggtgccggcc gcgttcgcgg tggtgctgat cgcgctgttc 93721 gtgatgacca cgcggctgca cgcggtctcg caggccgccg gattcctgat gctagacaac 93781 gggatcgcgg cgaccgcatt cctgctcacc gccggggtgc cgctgatcgt cgaacttggt 93841 gcctcgctgg acgtgctgtt cgcggtcatc gtgatcggcg tgttgaccgg ccggctgcgc 93901 cgcattttcg gcgatgccga cctggacaag ctgcgggagt tgcgggattg atgaccggtt 93961 tgctgcttgc cgcgatcctc gcaccgctcg ccgcgtcaat cgcctccttg atcaccgggt 94021 ggcgacgcac gacggcgacg ctcaccgcgc tgtccgccac gacggtgctg gcctgcgctg 94081 tggcgatggg gttttggatg gggtcggggg cgcagttcgg gctgggcggt ctgctgcgcg 94141 ccgatgcgct gacggtggtc atgctcgtcg tcatcgggat cgtcggcaca ctggccaccg 94201 cggcgagcat cggctacatc gacaccgagc tggcacacgg gcatatcgac ggacgtagcg 94261 ctcggctgta tggggtgctg accccggcgt ttctttgcgc gatggttctg gcggtgtgcg 94321 ccaacaacat cggcgtcatt tgggtagcga tcgaggccac cacggtgatc accgcgtttc 94381 tggtggggca tcgccgcacc cgcaccgcgc tggaagcgac ctggaaatac gtggtgatct 94441 gttcggtcgg gatcgccgtc gccttcttgg gtaccgtgct gctgtatttc gccgcgcggg 94501 attccggtgc cgctgctgcc ggcgcgctga acctcgatat cctggccgaa cacgccgccg 94561 gcctagaccc cggggtcgct cgactggccg gcgggttgct gctcatcggt tatggcgcca 94621 aggcgggcct cttcccgttt cacacctggc tggcggacgc gcacagccaa gcccccgcac 94681 cggtgtccgc actgatgagc ggcgtgctgc tggcggtggc gttctcggtg ctgatccgat 94741 tgcggccgat cctcgacgcg gtcagcgggc ccgcctacct gcgcaacggg ctgctcgtgg 94801 tcgggttggc gacgctgctg gtggcggtgc tgatgctgac cgtgaccggc gacgtcaagc 94861 ggatgctggc ctactcgtcg atggagcaca tgggcctgat cgcgatcgcc gcggccgccg 94921 gcacgacatt ggcgatcgcc gcgctgctgc tgcacgtgct cgcccacggg atcggcaaga 94981 ccgtgctgtt tctggcgggc ggtcagctgc aggccgcaca cgactccacc gccatcgccg 95041 atatcaccgg cgtgatgcga cggtcgcggc tgatcggcgt gtcgtttgcc gtcggcctga 95101 tcgtcctgct tggcttgccg ccgttcgcga tgttcgccag cgagctggcg atcgcgcgct 95161 cattggccaa cgagcggctg gcctgggtgc tgggtgcggc gctgctgctg atcgccatcg 95221 gtttcacggc tctggcacgc aattccggac gcatgctgct cggcaccccg gcggcgggcg 95281 cgccggcgat caccgtgccg gccaccgcgg cggcggcgtt gatggtgggc atcgtcgtct 95341 cggcggccct cggcatcacc gcgggcccac tcgccgacct gcttggcatc gccgccagca 95401 acgtgggtct accgtgatga gtgccagctg gctgcgccac cgggtatccg agcgtggact 95461 gatagcgacg gccgaacaac tctgggccga ttcgtttcgc ctggccctgg tcgctgccca 95521 cgacgacggc gacagtctgc gtgtcgtgta ccttttcttg gcgggctatc cagatcgccg 95581 cgtcgagttg gaatacgttg tgccggcgga taatccagag atcagatcgt tggcgtacct 95641 gtcctttccg gctggccggt tcgagcgcga aatggcggac ctgtacggaa ttcgcccggt 95701 cggccatccc aaaccccgcc gactggtacg gcacgcgcat tggcccgact ggcatcccat 95761 gcgcaccgac gccgggcccg cgcccgaatt cactgatacg ggggccttcc cgttcctcgc 95821 cgtcgaagga cccggcgtgt acgagattcc ggtcgggccg gtgcacgccg gcctcatcga 95881 acccggtcac ttccggtttt ctgtcgcggg cgagacgatc gtgcggctga aggcgcggct 95941 gtggtttgtg caccgtggca tcgagaaact cttccacggc cgccccgcca cggccgcggt 96001 cgatctcgcc gaacgcatca gcggcgacac gtcggcagcg cacgcgctcg cgcacagcct 96061 ggcgatcgaa gacgctctcg gcatcgagct gccccacgag gtccaccggc tgcgggccct 96121 gatcgtcgaa ctcgaacggc tctacaacca cgccgccgac ctgggtgcct tggccaacga 96181 cgtcggctac tcgctggcca acgctcacgc ccaacgcatc cgcgaaaatc tgttgcggcg 96241 caatgccgca gtcaccggtc accggctact gcgcggcgcc atccgcgcgg gcggggttgc 96301 gctgcgtgcg ctgcccgata ccgacgagct tgcagcgctc gccgtcgatc tcgccgaggt 96361 cgccaccctg acgctggcca actcggtggt ctacgaccgc ttcgccggca ccgccgtgct 96421 gcaccccgac gacgccagcg ccctgggctg cctgggctat gttgcccgcg ccagcggact 96481 gcgcagcgac gcccgggtcg aacaccccac catagtgctg cccatcaccg agatcggcgc 96541 gcctgacggc gacgtcttgg ctcgctacac cgtgcggcgc gacgaattcg ccgcgtctgc 96601 cgctcttgct caacacattg tcgaatcaca caccggtcca atagaatacg ccgctacact 96661 gcacccggtg ggcgcgccca gcagcggtat cggcatcgtc gaaggctggc gcggcactat 96721 cgtgcaccgc gtcgaaattg acgtcgatgg ccgcatcacc cgggcgaaag tcgtcgatcc 96781 gtcctggttc aactggcccg cactgccggt ggcgatggcc gacaccatcg tccccgactt 96841 cccgttggcc aacaaaagct tcaaccagtc ctacgcgggc aacgacctct aaccgtgagc 96901 gcgcccagtt gtacggccct agcggcgtgt cggtgtacaa acacgcaccc tcgcgggttc 96961 ggttgcgcca aactagaagt accgtggtca agggacgttc ggggagcctg tcgtggcgtc 97021 gagtgcgcac cggtgacctc ggtctggctg tttggggtgg acgcgaggag taccgggcgg 97081 tcaaaccggg cacaccaggg atacaaccga agggagacat gatgactgtg accgttgtcg 97141 atgctggacc cggccgggtg agccgttcgg tggaggtggc cgcgccggcg gccgagttgt 97201 tcgccatcgt tgctgatccc cggcgccacc gcgaactgga cggatcgggc acggttcgcg 97261 gcaacatcaa ggtaccggcg aaattagttg tcgggtcgaa gttttcgacg aagatgaagt 97321 tgttcggcct accgtatcgc atcaccagca gggtgaccgc gctcaaaccg aacgaattgg 97381 tcgagtggag ccacccgtta ggccatcggt ggagatggga attcgaatcg ctgtcaccga 97441 cactgacccg cgtcaccgag acattcgact accacgccgc cggtgcgatc aagaacggcc 97501 tgaagttcta cgagatgacg ggtttcgcga agtccaatgc ggcgggaatc gaggccacgt 97561 tggccaagct gagcgatcag tacgcccgcg gtagggcatg acgccatggg ggcgtgtcgg 97621 tgtaccgaca cgctcgctca cgggttcggt tgcaccaaga aaagatgtac cagatcacct 97681 gcctgaatag gatttttggc ccgacgtagc ttcgggctag cgcgagcgac gactccgccg 97741 tcgagcagga tgtcaccgtg gatcaaccgt ggaacgccaa catccactac gacgctctgc 97801 tggatgccat ggtgccgctc ggtacccagt gcgtgctcga cgtcgggtgc ggcgacgggt 97861 tgctggctgc ccggctggct cggcgcatac cctacgtcac ggcagtggac atcgatgcgc 97921 ccgtcctgcg acgtgcgcag acacggttcg ccaacgcgcc gatccgctgg ctgcatgccg 97981 acatcatgac ggctgagctg cccaacgcgg gcttcgacgc cgtggtctcc aatgccgccc 98041 tgcaccacat cgaggacact cggacggcgc tgagccggct cggcgggctg gtaactcccg 98101 gtgggacgct ggccgtggtc accttcgtga cgccctcgct gcgaaacggc ttatggcact 98161 tgacaagctg ggttgcctgc ggcatggcca atcgcgtcaa gggcaagtgg gaacattccg 98221 ctccgatcaa gtggccgccc ccgcagacgt tgcatgagct acgcagccac gttcgcgccc 98281 tgctgcccgg ggcgtgtatc cgtcggctgc tgtacggccg ggtgctcgtt acgtggcgcg 98341 cacccgtcta atcgggagaa cccaatggcg gcggccgata tgaccaagtg cgcgttagct 98401 tgcgagattg gctgcccgca tccaatgatc ggcggatacg ggtcgcaaac cacctcagac 98461 cggcagctaa ggagcgcaag tggccaagaa ccaaaaccgc atccgcaacc ggtgggagtt 98521 gatcacctgt ggtctcgggg gacacgtcac ctacgcgccg gacgacgcgg cacttgctgc 98581 gcggctgcgc gccagcaccg ggctgggcga agtatggcgc tgcttgcgct gcggcgattt 98641 cgcgctcggt gggccgcagg ggcgtggtgc tcccgaggat gcgccgttga ttatgcgcgg 98701 caaggcgtta cgtcaggcca tcatcattcg cgcgctcggg gtcgaacggc tagtccgggc 98761 gttggtgttg gcgctggccg cgtgggcggt gtgggagttt cgcggtgcgc ggggagctat 98821 ccaggcgacc ctggataggg acttgccggt cctgcgtgcg gccggattca aggtcgatca 98881 aatgacggtg atccacgctc tggagaaagc gttggccgcc aaaccgtcga cgttggccct 98941 gatcacgggc atgctggcgg catacgcagt gctgcaggcc gtcgaggggg tcggtttgtg 99001 gctgctgaag cgctggggcg agtacttcgc ggtggtggcc acctcaattt tcctgccgtt 99061 ggaggttcac gacctggcca agggcatcac gacgactcgg gtcgtgacct tcagcatcaa 99121 tgtcgccgcc gttgtctacc tgctgatttc taagcggttg ttcggtgtgc gcggcgggcg 99181 caaggcttat gacgtcgaac ggcgcggcga gcagctgctc gacctcgagc gcgccgcgat 99241 gctcacctga ccagccaaaa tcccacctgt gcggggcctg cgggttgtgt caaaggtcac 99301 cagcgccttt ttcgcactgt ttactccggc gcggcgtgcc cgtaaagccg cccgggtgaa 99361 cttggatcag gtggcgcaat gtcgccggac cgacgaagga ccgacgctgt gtcaacactg 99421 ccaacctggg tcagccagag ctctaccgac cgcggcgtgg tcgcgccaat cacagcgcgt 99481 gcccgcgacg cactgcaggc cgtgctgcgc gccaggcgcc gcggccagcg ctctgacttg 99541 cgccttatgc gcagaggcgt ggagcgttgt tgaggtcagg cccgcgccga gggccgcgac 99601 tttctcgcta caatcgcgcg cggcgcggga gagccgctag ccgccggtga ccggcgattg 99661 gagattgagt tgcgaccgaa cggatggcgg tgacggtcgg cgtcatttgt gcgatcccgc 99721 aagagctggc gtatctgcgc ggtgtcctgg tcgatgcgaa acgccagcag gtcgcgcaga 99781 tcctcttcga tagcggccaa ctcgacgcgc accgggtcgt gttggccgcc gccggcatgg 99841 gcaaagttaa cacgggcctg accgcaacgc tgcttgccga tcgattcggc tgccgcacca 99901 tcgttttcac gggagtggcc ggcgggctgg atcccgagct atgcatcggt gacatcgtca 99961 tcgccgatcg ggtcgtccaa cacgacttcg gtctgctcac cgatgagcgg ctgcgcccct 100021 atcagcccgg acacatcccc ttcatcgaac cgaccgagcg gctcggatac ccggttgatc 100081 ccgcggtcat cgatcgggtc aaacaccgcc tcgacgggtt cacgctggcg ccgctgtcca 100141 ccgccgcggg aggtggtggc cggcagccac gcatctacta cggcaccatc ctgaccggtg 100201 accaatacct tcactgcgag cgcacccgca accggctgca ccacgaactc ggcggtatgg 100261 ccgtcgaaat ggaaggcggt gcggtggcgc aaatctgcgc gtccttcgat atcccatggc 100321 tggtcattcg cgcgctctcc gatctcgccg gagccgattc gggggtggac ttcaatcggt 100381 ttgtcggcga ggtggcggcc agttcggccc gcgttctgct gcgcttgctg ccggtgttga 100441 cggcctgttg aagacgacta tccgccggtg cgttcaccgc gtcaggcggc ttcggtgagg 100501 tgagtaattt ggtcattaac ttggtcatgc cgccgccgat gttgagcgga ggccacaggt 100561 cggccggaag tgaggagcca cgatgacgac ggccgtgacc ggtgaacacc acgcgagtgt 100621 gcagcggata caactcagaa tcagcgggat gtcgtgctct gcgtgcgccc accgtgtgga 100681 atcgaccctc aacaagctgc cgggggttcg ggcagctgtg aacttcggca cccgggtggc 100741 aaccatcgac accagcgagg cggtcgacgc tgccgcgctg tgccaggcgg tccgccgcgc 100801 gggctatcag gccgatctgt gcacggatga cggtcggagc gcgagtgatc cggacgccga 100861 ccacgctcga cagctgctga tccggctagc gatcgccgcc gtgctgtttg tgcccgtggc 100921 cgatctgtcg gtgatgtttg gggtcgtgcc tgccacgcgc ttcaccggct ggcagtgggt 100981 gctaagcgcg ctggcactgc cggtcgtgac ctgggcggcg tggccgtttc accgcgttgc 101041 gatgcgcaac gcccgccacc acgccgcctc catggagacg ctaatctcgg tcggtatcac 101101 ggccgccacg atctggtcgc tgtacaccgt cttcggcaat cactcgccca tcgagcgcag 101161 cggcatatgg caggcgctgc tgggaagcga tgctatttat ttcgaggtcg cggcgggtgt 101221 cacggtgttc gtgctggtgg ggcggtattt cgaggcgcgc gccaagtcgc aggcgggcag 101281 tgcgctgaga gccttggcgg cgctgagcgc caaggaagta gccgtcctgc taccggatgg 101341 gtcggagatg gtcatcccgg ccgacgaact caaagaacag cagcgcttcg tggtgcgtcc 101401 agggcagata gttgccgccg acggcctcgc cgtcgacggg tccgctgcgg tcgacatgag 101461 cgcgatgacc ggcgaggcca aaccgacccg ggtgcgtccg ggggggcagg tcatcggcgg 101521 caccacagtg cttgacggcc ggctgatcgt ggaggcggcc gcggtgggcg ccgacaccca 101581 gttcgccgga atggtccgcc tcgttgagca agcgcaggcg caaaaggccg acgcacagcg 101641 actagccgac cggatctcct cggtgtttgt tcccgctgtg ttggttatcg cggcactaac 101701 cgcagccgga tggctaatcg ccgggggaca acccgaccgt gccgtctcgg ccgcactcgc 101761 cgtgcttgtc atcgcctgcc cgtgtgccct ggggctggcg actccgaccg cgatgatggt 101821 ggcctctggt cgcggtgccc agctcggaat atttctgaag ggctacaaat cgttggaggc 101881 cacccgcgcg gtggacaccg tcgtcttcga caagaccggc accctgacga cgggccggct 101941 gcaggtcagt gcggtgaccg cggcaccggg ctgggaggcc gaccaggtgc tcgccttggc 102001 cgcgaccgtg gaagccgcgt ccgagcactc ggtggcgctc gcgatcgccg cggcaacgac 102061 tcggcgagac gcggtcaccg actttcgcgc catacccggc cgcggcgtca gcggcaccgt 102121 gtccgggcgg gcggtacggg tgggcaaacc gtcatggatc gggtcctcgt cgtgccaccc 102181 caacatgcgc gcggcccggc gccacgccga atcgctgggt gagacggccg tattcgtcga 102241 ggtcgacggc gaaccatgcg gggtcatcgc ggtcgccgac gccgtcaagg actcggcgcg 102301 agacgccgtg gccgccctgg ccgatcgtgg tctgcgcacc atgctgttga ccggtgacaa 102361 tcccgaatcg gcggcggccg tggctactcg cgtcggcatc gacgaggtga tcgccgacat 102421 cctgccggaa ggcaaggtcg atgtcatcga gcagctacgc gaccgcggac atgtcgtcgc 102481 catggtcggt gacggcatca acgacggacc cgcactggcc cgtgccgatc taggcatggc 102541 catcgggcgc ggcacggacg tcgcgatcgg tgccgccgac atcatcttgg tccgcgacca 102601 cctcgacgtt gtaccccttg cgcttgacct ggcaagggcc acgatgcgca ccgtcaaact 102661 caacatggtc tgggcattcg gatacaacat cgccgcgatt cccgtcgccg ctgccggact 102721 gctcaacccc ctggtggccg gtgcggccat ggcgttctca tcgttcttcg tggtctcaaa 102781 cagcttgcgg ttgcgcaaat ttgggcgata cccgctaggc tgcggaaccg tcggtgggcc 102841 acaaatgacc gcgccgtcgt ccgcgtgatg cgttgtcggg caacacgata tcgggctcag 102901 cggcgaccgc atccggtctc ggccgaggac cagaggcgct tcgccacacc atgattgcca 102961 ggaccgcgcc gatcaccacc ggcagatgag tcaaaatccg cgtggtgctg accgcgccgg 103021 acagcgcatc cacaatcaca tagccggtca gtatggcgac gaacgccgtc agaacaccgg 103081 ccaggccggc ggcggcgctc ggccatagcg ccgcgcccac catgatcaca ccgagcgcaa 103141 tcgaccacga cgtggactcg ttgagcaagt gggtgccggc acccgtcggg tgctgatggg 103201 tcaggccgac gtctaggcca aacccctgca cggtgcccag ggcgatctgc gcgatgccca 103261 cgcacagcaa cgcccaacgt cgccaggtca tcggtgaatg ttgccgccgc ggcgcccggc 103321 ggatcccgag gcgcccaaca ggcgggacaa ccgggcggga ctcggcgagc cgacgcagat 103381 caccagcctg gctggccacc tgggtaaacc atgcgcgaca ggcgctgcac tcgcccaggt 103441 gttcatcgac tctcgccgag ggcaccggtg cgcgctcgcc gtcgagtcgt gccgacagcg 103501 cttcgcgcgc gacctcgcag tccatgccat caatagtcgc gcaatgccga cggattgctc 103561 cagcgggctc ggaccacatc gccgcgggca cacccctgca gccttgcaaa acggttgatg 103621 cgtggtggtt aaagctcccg gccgttgtgg cttgtgcgag cacggtggcc cgggtggtgc 103681 gtgagcgccg tggggctcgc gttcaggggt caatcgggtt tgtcgtcgtc gtcttggttg 103741 tggaggaatc gttcggggtg gtggaaggtg ttggtgcggg gttggccgtg gtcgaggtgg 103801 ggtggtggta gccattcggt gtggccgtgg gtgttgttgt gggtggtcca gcctttttcg 103861 gcgagtcggt tgtcggggcc gcaggccagg gtcagctcgg tgatgtcggt gcgtccggtg 103921 ctggtccagg cggtgacgtg gtgggcttgg ctgtggtagg ccggtgcgtc acagccgggt 103981 ttggtgcagc cgcggtcgtt ggcgaacagc atgatccgct gggccgggga ggctaggcgt 104041 ttggtgtgat acagcgccag gggtgtgccg tggtcgaaga tcgcctgggg gtacctcccg 104101 cttgcggggg agtagtggtg ggcgtggctg gtcatgcgga tcacatcggc catgggtagc 104161 agggtgccgc cgccggtgaa gcccttgccg gcgccggttt gcaggtcggt cagggtggtg 104221 gtgaccacga tcgagacggg aagaccgttg tgttggccca gtttcccgga ggcgatcagc 104281 gcgcgcagcc cggccagcag cccgtcgtgg ttgcgttggg cttggctgcg ggtgtcgcgg 104341 tcgatggcgg ccgcatcggg ggtggtgtcg atgaccgggg tgtggtcgtc ggggttggtc 104401 gcgccggggg cggccagttt ggctagcacg gcttcaaagg tggcccgcgc ttggggggtc 104461 aggtagccac ttagccgtga catgccgtcg tattgctggt tgctcagggt gatgccgcgt 104521 ttgcgggcgc gttcggtgtc ggtgaggtcg ccgtcggggt gtagccagtc catgacccgc 104581 tgggcgtagc gggccagctc gtcgggacga tattgagcgg ctttgccggc caggtcggct 104641 tcggcggcct ggcgggtgga cacatccacc gcggcgggca ggtgggcgaa aaagggcgcg 104701 aatcactttg acgtgcgcct cgccgatcag gccctggcgt tgggcggtgg cggtggcggt 104761 caactgcggg gccaacggtt cgccagtgag cgcccgacgt tgccctaagg cttggcttcg 104821 gcgctgcgtc ggccggcttc gggcttggtg atgcgcagcc ggttggccag cgcgcagcac 104881 agcgtgccgc ccagttcttc ctcgctggct tgggtgtcga gttggttgat caacgtgtgc 104941 tgggccgccg gtagccggcg cgccaagcat tccagacgtt ccagagaccg cagccgttcc 105001 ggggtgctca gcacctcaaa ggacacctcg tccaagcggt ccaggtcggc atccagcgcg 105061 tcgaagacct cgacaagctc ctcccggcta ttcgctaaca tgttcgaatc ataacgtcgg 105121 gcactgacaa agagcgcccc gctgataacc gtgaaactga agtgacacaa gggatttacc 105181 cagatcctac gagttgatac gggaaggtac cgcacctttc ctgggcgcga tgggaacttt 105241 ctgcccgtta tggccgacta acaccgcggg tgaagcaaag cgctgcctag gcaaggaggt 105301 gagtcctggc ggccacgata tggatggcta taccaccgga ggtgcactcg ggcctgttga 105361 gcgccgggtg cggtccggga tcattgcttg ttgccgcgca gcagtggcaa gaacttagtg 105421 atcagtacgc actcgcatgc gccgagttgg gccaattgtt gggcgaggtt caggccagca 105481 gctggcaggg aaccgccgcc acccagtacg tggctgccca tggcccctat ctggcctggc 105541 ttgagcaaac cgcgatcaac agcgccgtca ccgccgcaca gcacgtagcg gctgccgctg 105601 cctactgcag cgccctggcc gcgatgccca ccccagcaga gctggccgcc aaccacgcca 105661 ttcatggcgt tctgatcgcc accaacttct tcgggatcaa caccgttccg atcgcgctca 105721 acgaagccga ttatgtccgc atgtggctgc aagccgccga caccatggcc gcctaccagg 105781 ccgtcgccga tgcggccacg gtggccgtac cgtccaccca accggcgcca ccgatccgcg 105841 cgcccggcgg cgatgccgca gatacccggc tagacgtatt gagttcaatt ggtcagctca 105901 tccgggatat cttggatttc attgccaacc cgtacaagta ttttctggag tttttcgagc 105961 aattcggctt cagcccggcc gtaacggtcg tccttgccct tgttgccctg cagctgtacg 106021 actttctttg gtatccctat tacgcctcgt acggcctgct cctgcttccg ttcttcactc 106081 ccaccttgag cgcgttgacc gccctaagcg cgctgatcca tttgctgaac ctgcccccgg 106141 ctggactgct tcctatcgcc gcagcgctcg gtcccggcga ccaatggggc gcaaacttgg 106201 ctgtggctgt cacgccggcc acggcggccg tgcccggcgg aagcccgccc accagcaacc 106261 ccgcgcccgc cgctcccagc tcgaactcgg ttggcagcgc ttcggctgca cccggcatca 106321 gctatgccgt gcccggcctg gcgccacccg gggttagctc tggccctaaa gccggcacca 106381 aatcacctga caccgccgcc gacacccttg caaccgcggg cgcagcacga ccgggcctcg 106441 cccgagccca ccgaagaaag cgcagcgaaa gcggcgtcgg gatacgcggt taccgcgacg 106501 aatttttgga cgcgaccgcc acggtggacg ccgctacgga tgtgcccgct cccgccaacg 106561 cggctggcag tcaaggtgcc ggcactctcg gctttgccgg taccgcaccg acaaccagcg 106621 gcgccgcggc cggaatggtt caactgtcgt cgcacagcac aagcactaca gtcccgttgc 106681 tgcccactac ctggacaacc gacgccgaac aatgaacaag gagaaaagaa ccgatgacgc 106741 ttaaggtcaa aggcgaggga ctcggtgcgc aggtcacagg ggtcgatccc aagaatctgg 106801 acgatataac caccgacgag atccgggata tcgtttacac gaacaagctc gttgtgctaa 106861 aagacgtcca tccgtctccg cgggagttca tcaaactcgg caggataatt ggacaaatcg 106921 ttccgtatta cgaacccatg taccatcacg aagaccaccc ggagatcttt gtctcctcca 106981 ctgaggaagg tcagggggtc ccaaaaaccg gcgcgttctg gcatatcgac tatatgttta 107041 tgccggaacc tttcgcgttt tccatggtgc tgccgctggc ggtgcctgga cacgaccgcg 107101 ggacctattt catcgatctc gccagggtct ggcagtcgct gcccgccgcc aagcgagacc 107161 cggcccgcgg aaccgtcagc acccacgacc ctcgacgcca catcaagatc cgacccagcg 107221 acgtctaccg gcccatcgga gaggtatggg acgagatcaa ccggaccacg cccccaataa 107281 agtggcctac ggtcatccgg cacccaaaga ccggccaaga gatcctctac atctgcgcga 107341 cgggcaccac caagatcgag gacaaggacg gcaatccggt tgatccggag gtgctgcaag 107401 aactcatggc cgcgaccgga cagctcgatc ctgagtacca gtcgccgttc atacatactc 107461 agcactacca ggttggcgac atcatcttgt gggacaaccg ggttctcatg caccgagcga 107521 agcacggcag cgccgcgggc actctgacga cctaccgcct gaccatgctt gatggcctca 107581 agacgccggg atacgcggca tgagccacac cgacttgacg ccctgcacac gggtgctggc 107641 atccagcggc acggttccga tcgcagagga actgctggcc agagtgctcg agccctactc 107701 ctgcaaagga tgtcgctacc tcatcgacgc acagtacagc gccaccgagg attcggttct 107761 tgcctatggc aacttcacga tcggtgagtc cgcctatatt cgaagcacgg ggcacttcaa 107821 cgcggtcgaa ctgattctgt gtttcaatca gctcgcctac agcgccttcg ctccggccgt 107881 cctcaacgag gaaatccggg tgcttcgcgg ctggtcgatc gacgactact gccaacacca 107941 gctctctagc atgctgatca ggaaggcatc atcgcggttc agaaaaccgc tgaacccgca 108001 aaagttctct gcccgcctcc tgtgtcgaga tctgcaggtc atcgaacgaa cctggcgcta 108061 tctcaaggtc ccgtgcgtca tcgagttctg ggacgagaac ggcggggcgg cgtccggtga 108121 gatcgaacta gcggccctca acattccgta atccaatggg aggaaagaag tttcaagcta 108181 tgcctcagtt gccatctacc gtgctggacc gggtcttcga gcaggcacgg cagcagccgg 108241 aagcaatcgc cttgcgtcgc tgcgacggca ctagcgcact gcggtaccgt gaactcgtcg 108301 ccgaagttgg tggccttgcc gcggatttgc gtgcccagtc ggttagccgg ggttctaggg 108361 tgctggtcat ttccgacaat ggacccgaga cgtacctgtc ggtgctggcg tgtgcaaagc 108421 tcggggcgat cgccgtcatg gccgacggca atcttccgat cgcagccatc gaacgattct 108481 gtcagatcac cgaccccgca gcggctctcg tcgcaccagg gagcaagatg gcatcttccg 108541 ccgttcccga ggcgctgcac tcgataccag tgatcgcggt cgacatagcc gctgttacac 108601 gggaatccga gcattccttg gatgcagcca gcctcgccgg gaacgcggac caggggagcg 108661 aggatccgct ggcgatgatc ttcaccagcg gtaccacggg cgagcccaag gctgtgctac 108721 tggccaaccg caccttcttc gccgtcccgg acatcttgca aaaagagggt ttgaactggg 108781 tcacttgggt cgtcggcgaa accacctact cgccgctgcc ggcgacgcac atcggtggac 108841 tgtggtggat acttacctgc ctgatgcacg gcgggttgtg tgtcaccggc ggcgagaata 108901 cgacatcgtt gctggagatt ctcaccacga acgcggtggc gacgacgtgc ctagtgccaa 108961 cgcttctttc gaagttagtt tctgaactga agtccgccaa cgcgacggtt ccctcgctgc 109021 gcctagttgg atacggtggt tcgcgggcga tcgcggccga tgtgcggttt atcgaagcta 109081 ccggcgtgcg caccgcacag gtctacggat tgagcgagac cggttgcacg gctttgtgtt 109141 tgccgaccga tgacggctcg atcgtcaaga tcgaagcagg tgctgttggc cgtccgtacc 109201 ctggcgtgga cgtctatctt gccgctaccg atggcatcgg ccctaccgcc cccggcgccg 109261 gcccgtccgc ctcgttcggc acgctatgga ttaagtcacc ggccaacatg ctgggctact 109321 ggaacaatcc cgaacgcacc gcagaggtgc tgattgacgg ctgggtgaac accggtgacc 109381 tgctggagcg ccgcgaggac ggcttcttct acatcaaggg aagatcctcg gagatgatca 109441 tctgtggtgg cgtgaacatt gcgcccgacg aggtcgatcg catcgcggag ggcgtgtcgg 109501 gcgtccgcga ggccgcgtgc tacgagattc ctgacgaaga gttcggcgcg ctggtgggcc 109561 tggccgtggt cgcatcggca gagcttgacg agtcggcagc ccgggcgctc aagcacacga 109621 ttgcggctcg ttttcgacgg gagtccgagc cgatggcgcg gccgtcgaca attgtgatcg 109681 tcaccgacat tccacgaacg cagtccggca aggtcatgcg ggcctcgctt gcagcggcgg 109741 caacagcaga caaggccaga gtggtcgttc gtggctgagc cggtgcggga ccgaatcctc 109801 gccgccgtct gcgacgtgtt gtatatcgac gaggcggatc tcattgatgg cgacgaaacg 109861 gatctccgcg acctcgggct ggactctgtt cggtttgttc tgctgatgaa gcagctaggc 109921 gtgaaccgac aatccgaact gccgtcccga ttggccgcga acccgtcgat tgcgggttgg 109981 cttcgcgagc tggaggctgt gtgcaccgag ttcggttaag ccgctcgcag cgcaacctct 110041 acaacggcgt gcgccaggat aacaatcccg cgttatatct gatcggcaag agctatcggt 110101 tccgccggtt ggagctggcg agattcctgg ccgctctgca cgcaacggta ctggacaacc 110161 ccgtgcaact ttgcgtcctg gagaattcgg gggcagacta tccggatctg gtgccgcggc 110221 tacggttcgg cgacatcgtg cgggtggggt cagccgatga gcacctgcag agcacatggt 110281 gttcgggcat cctgggcaag ccactggtgc ggcatacggt gcacaccgac ccgaacgggt 110341 atgtgaccgg tctggacgtt cacacccacc acatcctgct ggacggcggc gcgaccggga 110401 cgatcgaagc tgacctggcg cgttacctga ccaccgaccc ggcgggcgaa acccccagtg 110461 tcggtgcggg tctagccaag ctcagggagg cgcaccgtcg tgagacggcc aaggtggaag 110521 aatcgcgggg gcgcctgtcg gctgtcgtgc agcgtgaact cgccgacgaa gcataccacg 110581 gcgggcacgg gcacagcgtt agcgacgctc ccgggaccgc ggccaagggc gtcctgcacg 110641 aatcggcaac gatctgcggc aacgcgtttg atgccatcct gaccctttcg gaagcgcagc 110701 gggtcccgct taatgtgctg gtggctgcgg cggccgtcgc ggtggacgcg agccttcggc 110761 agaacaccga aaccctcttg gtgcacacgg tggacaaccg gttcggagat tctgatctga 110821 atgtcgcgac ctgtttggtc aattcggttg cccagaccgt ccggtttccc ccatttgcgt 110881 cggtgtccga tgtcgttcga acgcttgacc gcggctatgt caaggcggta agacgccggt 110941 ggcttcgtga ggagcattac cgccgaatgt atttggcgat caaccggaca tctcacgtgg 111001 aggcgttgac gctaaatttc attcgcgagc catgcgcacc tggcctgcgc ccgttcttgt 111061 cggaggtccc gattgccacg gatatcggtc cggtcgaggg catgacggtg gcgtctgttc 111121 tggacgaaga acagcgcaca ctgaacctag ccatctggaa ccgagccgat ctgcccgcgt 111181 gcaagacaca ccccaaggtc gcggaacgga tagcggcagc gttggaatcg atggcggcga 111241 tgtgggatcg gccgatcgcc atgatcgtca acgactggtt cgggatcggc ccggacggga 111301 ctcgctgcca aggcgattgg ccagcccgtc agccgtcgac gcccgcgtgg tttctcgatt 111361 ccgcaagggg cgtccaccaa tttctcggca ggcgccgctt cgtctacccg tgggtcgcgt 111421 ggttggtgca acgcggcgcc gcaccgggtg atgttctggt gttcaccgac gacgacaccg 111481 acaagaccat tgacctgctc atcgcgtgtc accttgcggg ttgcgggtac agcgtctgcg 111541 acaccgctga cgaaatttcc gtgcggacca atgcgattac cgagcacggc gatggcatct 111601 tggtgacagt ggtcgacgtg gccgccaccc agctggcggt tgtcggccat gacgagctgc 111661 ggaaggtcgt tgacgagcgc gtcacacagg tgacacacga cgcactgctg gccaccaaga 111721 ccgcctacat catgccgacc tcgggaacta ccggacaacc caagctggtg cgaatctcac 111781 acggctcgct cgcggttttc tgtgatgcga tcagccgcgc ctacggttgg ggagcccacg 111841 acaccgttct gcagtgcgct ccgttgacat cggacatcag cgtcgaggag attttcggtg 111901 gcgcggcctg tggcgcgcga ctggtgcgat ccgcggctat gaaaaccggc gacctggcgg 111961 cgctggttga cgatctcgtc gcccgcgaga cgacaatcgt cgacctgccg accgccgtct 112021 ggcagctgtt gtgcgccgac ggcgacgcca ttgacgcgat cggccgctcg cgcctgcggc 112081 agatcgtaat cggcggtgaa gccatccgct gtagcgccgt ggacaagtgg cttgaatcgg 112141 ctgcttcaca agggatctcg ctgctctcga gctatggtcc aacagaagcc acggtcgtcg 112201 ccaccttctt gccgatcgtt tgcgaccaga ccaccatgga cggcgcactg ctcaggctcg 112261 gccggccgat cctaccgaac acggtgttcc tcgcgttcgg tgaagtcgtc attgtcgggg 112321 atttagtcgc cgacggctac ctcgggatcg acggcgacgg cttcggcacc gtgacggccg 112381 cagacggttc ccgacgccgt gcctttgcca ctggcgaccg ggtgaccgtc gacgccgaag 112441 gatttccggt cttctccgga cgcaaagacg ccgtcgtcaa gatctccggc aagcgtgtcg 112501 atatcgctga ggtaaccagg cgcatcgccg aagaccccgc ggtgtcagat gtcgccgtcg 112561 agttgcacag cggaagcctc ggagtgtggt tcaagagcca acggacccgc gagggcgaac 112621 aagacgctgc cgcggcgacc cggatcaggc tcgtcctcgt gagtctggga gtgtcgtcgt 112681 ttttcgttgt cggcgtgccg aatatcccga ggaagcccaa cgggaagatc gacagcgaca 112741 acctgccgag gctgcctcag tggtcagctg ctgggctaaa caccgccgag acgggtcagc 112801 gagcggccgg cctctcgcag atctggagcc ggcagctcgg ccgggcaatc gggccggact 112861 cgtcgctgct tggtgagggc atcggctcgt tggatctcat cagaatactg cccgagacgc 112921 gtaggtatct ggggtggcgc ctctcgctgc tggatctgat cggtgccgat accgccgcca 112981 atctggccga ttacgcgcca acgcccgacg cgccgacggg cgaagatcgg tttaggccgc 113041 tggtggccgc gcaacggccc gcggcgattc cgttgtcgtt tgcccagcgg cgactatggt 113101 ttctcgacca gttacagcga cccgctccgg tctacaacat ggcggtggcg ttgcggctgc 113161 gcgggtatct cgataccgag gcgttgggcg cggcggtcgc cgatgtcgtg ggccgccacg 113221 aaagcctacg gacggtgttt ccggcggtcg acggggtccc tcggcagctg gtcatcgaag 113281 cgcggcgggc agatcttggc tgcgacatcg tcgatgccac cgcatggccg gctgaccggc 113341 tgcaacgggc catcgaggag gcggcgcgcc acagcttcga tttggcaacc gagatacctt 113401 tgcggacgtg gcttttccgg atcgccgacg acgaacatgt gctggtggcg gttgcacacc 113461 atatcgccgc cgacggctgg tcggtggctc cgctgacggc cgatctgagt gcggcatatg 113521 ccagccgttg tgcgggtcgg gcaccggact gggcgccatt gccagtgcag tatgtcgatt 113581 acacgctgtg gcagcgggaa atcctcggtg atctcgacga cagcgacagc ccgatcgccg 113641 cgcagctggc ctactgggaa aatgcgttgg ccggtatgcc ggaacggctg cggctgccca 113701 ccgctcggcc ctatccaccg gttgccgatc agcgcggcgc cagtttggtg gtggattggc 113761 cggcgtcggt gcaacagcag gtgcgtcgga tcgcccgcca gcacaacgcg accagcttca 113821 tggtggtagc tgccgggctt gccgtgctgc tgtcgaaact cagcggaagc cccgatgtgg 113881 cggtcggatt tcccatcgcc ggccgcagcg atcctgcgct ggataacttg gtgggctttt 113941 ttgtcaacac cttggtgttg cgggtcaacc tggccggtga tcccagcttc gccgaactgc 114001 tggggcaggt gcgagcgcgc agcctggccg cctacgaaaa tcaagacgta cctttcgagg 114061 tgctcgttga tcgcctcaaa cccactcgag ccctgaccca tcacccgctg atccaggtga 114121 tgttggcctg gcaggacaat ccggttggac agctgaattt gggtgatctg caggccaccc 114181 cgatgccgat cgacacccgc accgcccgca tggacttggt gttttcgtta gcggaacgct 114241 tcagcgaggg tagcgaacct gccgggatcg gcggagcggt ggaataccgc accgatgtgt 114301 ttgaagccca agcaatcgac gtgcttatcg agcggttgcg gaaggtgttg gtggcggtgg 114361 ccgctgctcc ggaacggacg gtgtcgtcga tcgatgcgct ggatgggacc gagcgtgccc 114421 ggttggatga gtggggtaac cgcgctgtgc tgactgcgcc cgcgcccacg ccggtgtcga 114481 tcccgcagat gttggccgcc caggtggcac gtatccccga agcggaggcg gtgtgttgcg 114541 gggacgcgtc gatgacgtat cgggaactcg acgaggcgtc caaccggtta gcgcatcggc 114601 tggcaggttg tggggccggc ccgggcgagt gtgtggcgct gctgttcgag cggtgcgcgc 114661 cggcggtcgt ggcgatggtg gcagtgctca aaaccggggc ggcgtatctg ccgatcgatc 114721 cggcgaatcc tccgccgcgg gtggcgttca tgctcggcga cgcggtgccc gtggccgcgg 114781 tcaccacggc tgggctgcgc tcccggttgg cgggacacga cttgccgatc atcgatgtcg 114841 tcgatgcttt agcggcatat ccgggcacgc ccccacccat gccggccgca gtgaacctcg 114901 cctacatcct gtacacctcg ggcactaccg gcgagcccaa aggcgtgggg atcacccatc 114961 gcaacgtcac caggctgttc gcatcactgc cggcacgctt gtcggcggcg caggtgtggt 115021 cgcagtgtca ttcctatggc ttcgacgcct cggcgtggga gatctggggc gcgttgctag 115081 gtggtgggcg actggtgatc gtgcccgagt cggtggcggc ctcgccgaac gactttcatg 115141 ggctgctcgt ggccgaacac gtcagcgtgc tgactcagac tccggctgcg gtggcaatgt 115201 tgccgacgca gggtttggag tcggtggcgt tggtggtggc cggtgaggca tgtccggcag 115261 cgctggtgga tcggtgggcg cccgggcggg tgatgctaaa tgcttatggc ccaaccgaga 115321 ccacgatctg tgcggcgata agtgcgccgt tgcgaccggg ttcggggatg ccgccgattg 115381 gtgttccggt gtcgggggcg gcgttgtttg tgctggatag ctggttgcgc ccggtaccgg 115441 ccggggtggc cggagagttg tacattgccg gtgcgggcgt cggtgttggg tattggcgtc 115501 gggcggggct gaccgcgtca cggtttgtgg cctgcccatt cggcggttcc ggggcacgca 115561 tgtatcgcac cggggatctg gtgtgttggc gcgccgatgg ccagttggag ttcctggggc 115621 gcaccgacga tcaggtcaag atccgcgggt atcgcatcga gctcggcgag gttgcgaccg 115681 cgctggccga gctggctggg gtaggtcaag cggttgtaat cgcccgtgaa gaccgccctg 115741 gggacaagcg cctagtcggg tatgccaccg aaattgcccc cggggcagtg gacccggccg 115801 ggctgcgggc gcaactagcc cagcgattgc ccggttacct ggtgccagcc gcggtggtag 115861 tgatcgatgc gcttccgttg acggtcaacg gcaaacttga tcatcgtgcg ttgccggcac 115921 cggaatacgg tgataccaac ggatatcgcg ctccggccgg gccggttgag aagaccgtgg 115981 ccggcatctt tgcccgggtt cttgggcttg agcgggtcgg cgtcgacgac tcgttcttcg 116041 agctcggcgg cgattcgctg gcggcaatgc gggttatcgc cgcgatcaac accaccctaa 116101 acgccgatct gccggtgcgc gcgttgctgc acgcgtcgtc gacgagaggt ttaagccagc 116161 tgttggggcg agatgcccga ccgaccagcg atccgcgctt ggtgtctgtg cacggcgaca 116221 accccaccga ggtgcatgcc agcgacctca cgctggaccg gttcatcgac gccgacacgc 116281 tggccaccgc cgtcaacctg ccgggcccga gccccgagct acggacggtc ctgctgacgg 116341 gcgcgacggg tttcctcgga cggtatctgg tccttgaatt gctgcggcgg ctggacgtcg 116401 acggcaggct gatctgtttg gtgcgggcgg agtccgacga ggatgcgcgg cgtcgtctgg 116461 agaagacctt cgatagcggt gacccggaat tgctgcggca cttcaaggag cttgccgccg 116521 accggctgga ggtcgtcgca ggcgacaaga gcgaacccga cctgggcctg gaccaaccga 116581 tgtggcggcg gctggccgaa accgtggatt tgattgtcga ttccgcggcg atggtcaacg 116641 cgtttcccta ccacgaattg ttcgggccca acgtcgcggg caccgccgag ctgatccgaa 116701 tcgcgcttac caccaagctc aaacccttca cctacgtgtc aaccgccgac gtgggtgctg 116761 cgatcgagcc gtcggcgttc accgaggacg ccgacatccg ggtaatcagc cccacccgca 116821 ccgtcgacgg cggctgggct ggcggctacg gcaccagcaa gtgggccggt gaggtgctgc 116881 tgcgcgaggc caacgacctg tgcgcgctgc cggtcgcggt gtttcgctgc gggatgatcc 116941 tggccgacac cagctatgcc ggacagctca acatgtcgga ctgggtcacc cggatggtgt 117001 tgagcttgat ggctaccggc atcgcgcctc gttcgttcta cgaaccggac tccgagggca 117061 atcggcaacg cgcgcacttc gacgggctgc cagtcacctt cgttgccgag gcgatcgcgg 117121 tgctgggcgc gcgggtggcc ggctcatcgt tggcgggatt tgcgacctat cacgtgatga 117181 acccgcacga cgacggtatc gggctcgatg agtatgtgga ctggctgatt gaggccggct 117241 acccgatacg ccgcatcgat gactttgcgg agtggttgca gcggtttgag gccagcctgg 117301 gcgctctgcc ggatcggcaa cgccggcact cggtgctgcc gatgctgctg gcgagcaatt 117361 cccagcgatt gcagccgctt aagccgacca gggggtgctc cgcgccgacc gaccgattcc 117421 gtgccgcggt gcgagcggcg aaagtcggct ccgacaagga caatccagac atcccgcacg 117481 tgtcggcgcc gaccatcatc aactacgtca ccaacctaca actgctcgga ctgctgtagt 117541 tgctcggcga taaagagcgc agccatggtc gggggagatc atgtggtcac tttcgggtcg 117601 gcatcgattc tgcgagcaga atatgtggtt gatggccact aggccggtac cggggaactg 117661 gcggttcccg gccgatgagc atcggccctg acgcgcggcc gtaagctcca ggaatgggga 117721 cgcacggggc taccaagagt gcgacgtcgg ctgtgccaac gccccggtcg aactccatgg 117781 cgatggtacg gctggcaatt ggcctgctgg gtgtgtgcgc ggtggtcgcg gccttcgggc 117841 tggtgtcggg agcgcgccgc tacgctgagg ccggcaatcc ctatccgggc gccttcgtca 117901 gcgtcgccga gccggtcggg ttcttcgccg cgtcgctggc cggtgcgctg tgtctgggcg 117961 cgctgatcca cgtggtcatg acggccaaac ccgagccgga tggcttaatc gacgccgcgg 118021 cgttccggat tcacctgctg gcagaacgtg tttcaggtct ctggttgggg ctagccgcga 118081 ccatggtggt cattcaggcc gcccacgata ctggagtggg gcccgcgaga ctgctggcta 118141 gtggggcact atcggactcc gtcgccgcct ccgagatggc acgcgggtgg attgttgcgg 118201 cgatctgcgc gctggtggtt gcgacggcgc tgcggctgta cactcgctgg ctcgggcacg 118261 ttgtgctgct tgtccccact gtgcttgccg tcgtcgccac cgcggtgacc ggtaacccgg 118321 gacagggacc cgaccatgac tacgcgacca gcgccgcgat cgtgttcgcg gtcgcgttcg 118381 ccaccttgac cgggctcaag atcgctgcgg cgttggcggg aacgacgcca agccgcgctg 118441 tgctggtaac gcaggtcacc tgtggagcgc tcgcgttggc atacggagcg atgctgcttt 118501 atctcttcat cccgggctgg gcggtcgatt cggattttgc ccgccttggt ctgcttgcgg 118561 gggtaatcct gacgtcggtg tggttgtttg actgctggcg gctgttggtc aggccgccac 118621 atgcgggccg tcgccgcggt ggtggctccg gtgccgcact ggccatgatg gccgccatgg 118681 cttcgatagc tgccatggcc gttatgaccg cgccgcgatt tctcacccac gcgttcacgg 118741 cttgggatgt cttcctcggc tatgaactgc cgcaaccgcc gaccatagcc cgggtgctca 118801 ccgtgtggcg cttcgatagc ctgatcggag ccgctggtgt ggttctcgcg atcgggtatg 118861 cggcgggctt cgccgcgctg cggcgccgag gtaactcttg gccggtgggc agattgatcg 118921 cctggctgac tggttgcgcc gcactggtat tcaccagcgg ctccggtgta cgggcctatg 118981 gttcggcgat gttcagcgtc cacatggccg aacacatgac actgaacatg ttcatcccgg 119041 tcctgttggt gctcggtggc ccggtcacgc tggcgctgcg ggtgctgccg gtaacgggtg 119101 atggacggcc gccgggggct cgcgaatggc tgacctggct gctgcactcc cgggtgacaa 119161 ctttcctgtc gcacccgatc accgcattcg tcctctttgt ggcctcgccc tatatcgtct 119221 atttcacacc gctgttcgat accttcgtcc gctatcactg gggccacgag ttcatggcga 119281 tccatttcct ggtggtcggg tacttgttct actgggcgat catcggcatc gacccagggc 119341 cgcgccgact gccctacccg ggccggatcg ggctgttgtt cgcggtgatg ccgttccacg 119401 ccttcttcgg gatcgcgctg atgacgatgt cgtctacggt gggcgctacg ttctatcgtt 119461 ccgtcaatct gccgtggttg tcgagcatca tcgccgacca gcatctcggc ggtggaattg 119521 cttggagcct aacggaattg ccggtcatca tggtcatcgt ggcgctggtt acccaatggg 119581 cgcgccaaga ccgccgagtc gcgtcccgcg aagaccggca tgccgacagc gactacgccg 119641 acgacgagct ggaagcctac aacgcgatgc ttcgcgagtt gtcgcgaatg cggcgctgaa 119701 tgtgcagatg attttggaag cggttggcgt atctgcccgt gctcggctac accaggaccg 119761 cggggcgctg gcacgcgaac gatccggcga ggaggtgggc cagccggaga ttccctccac 119821 aggctgcagc agaagtcctg gatctgaccc cgacctgaac ccttgtcagt gcggtccatc 119881 gacggaaaat tgctgttccg ccatgctggg catgctattg agcgccaaaa ttgcgtagcc 119941 gcaagctgtt tgacacgacg aaaaatgacg agaacgccat ggcggcaccg gcgatcaaag 120001 ggttgagcag tccggcggcg gcaatcggga tggctgcgac gttgtacccg aacgcccaga 120061 tcatgttcat ccggatcgtc cgcatggttg cacgggccag gtccagcgcc tgcggaacag 120121 tattcagatc atcgcgcacc agaatgatgt cggctgcacc gagcgcgacg tcggtgccac 120181 gcccgatcgc caaccccaag tcggcaccca ccaacgcggg accgtcgttg atgccgtcac 120241 cgaccatggc gacggtatgt ccttcctcgc ggagccgttg gatcacgtcg accttgcctt 120301 cgggcagcat atcggcgaca gcggagtcga tgccgacctg cgccgccacc gcgtcggcgg 120361 cggcccgatt gtcgccggtg agcagaatcg tccgcagccc gcggctgcgt agcgcagcga 120421 cggcggcagc cgctgaatcc ttgagggtgt cggcgattgt cagggctgcg cggacgacac 120481 cgtcgaccga cacaaaaacg acagtctcgc ctcgggattc gccgtccagg cgcgcggaca 120541 ccagagccgc gtcgtggcag ggcgtggtcc gggtaatcca ggatggcttg ccgacctcaa 120601 cgtgatggcc gccgacttcc cccgatacac cgcagcccgc gacggcgaca aacccgttga 120661 ctggacccgg atccggcgaa gcggcaacga tggccgccgc catcgcatgc tcggaagccg 120721 attcgacagc ggcggcgagg ccaagcactt cctcgcgatc tcgctcgctg gtgcctgaac 120781 ctgccattgt tacggtgctc accgccagct gcccaaccgt caacgtgccg gtcttgtcga 120841 acaccacggt gtcgatgctc cggatggttt ccagtgcccg gtaccccttg ataaagatcc 120901 ctagctgcgc tccccgtccg gaagcaacca tcatggcggt aggtgtcgcg agcccaagcg 120961 cacacgggca cgcgatcacc aacaccccta gcgtgaccga gaacgcgcga tccgcgcctg 121021 cgccgctgac gagccaggcc gcacctgcaa gtccagcaat gacgaaaacc accggcacga 121081 acacgcccgc gatgtggtcg gcgaggcgct gggcacgcgc cttctgcgtc tgggcttgct 121141 ccacgaggcg gaccatcgcg gcgaactggg tatcggcccc taccgcggtg gcctcgatga 121201 ccaggcggcc gtccatcacg accgtgcccc ccacgaccga ggccgccgga taggcacgga 121261 ccggcttggc ctcaccggtc atggcgctca tatcgatcgc cgcgctgccg tcgacaacga 121321 ctccgtcagc tgcgatggtt tcccccggcc gcgtcacgaa gcgctggcgc ttcttgagtt 121381 cgctcgccgg tatcactagc tccgcgccgt cgggcagcag caccgccaca ttcttggcgc 121441 ctagctccgc cagcgcacgc agcgcgctgc cggccttgga cttggctcgt gcttcaaagt 121501 aacgaccggc aagaacgaag acggtcacac cggccgcgac ctcgaggtag atcgagtcgc 121561 tgttgagaat ggcccgccag attcccgagc cttcccgtgg cggctgatcg ccgaagacgg 121621 acgaaagcga ccaggcggtg gcggccacga tcccgaccga gatcagcgtt tccatggatg 121681 tcgtccggtg gcgcgcgttt cgcagcgcga ccgagtggaa gggccatgcg gcccaggtca 121741 caaccggagc ggccagggcc gtcaatatgt atccccagcc gggaaccctg gcgctgggga 121801 cgatcgcgaa caacgtcgac aggtcagcca gcggcacgaa caacaccgcc gcgactagca 121861 gccgccgcag cagtctgcgg gcgtgggcgc cgtcgggatc ctttgtccgt ttgtctagga 121921 cggttgtctc ggtgtgcggt gccgcgtggt atccggcttt ctcgaccacc ccgcacagct 121981 catcggctgc catgcccacg gcatcgatgg tcgcgacgcg ggttgcgaag ttgacggatg 122041 cgcgtactcc ggggatcttg ttgagcttcg tctcgacgcg gctggcacag gccgcacatg 122101 acatacccaa aacatcgagc cggatccgcc gcaccgactg caggtcggca tctcccacaa 122161 ctggagccgc cacggccctc ctcggatcgg cgtatttgca cccgtcagcc tacaagtcgt 122221 aagcaggcgg taatcggttc cctatggccc gctggatgca ctggcgatgg attcttttgg 122281 tccgatttct gcggttggcg tgctaggttt ccgactgtga cgcccgtcac aacgtttcct 122341 ctcgtggacg cgatcctcgc tggtcgcgac cgcaaccttg acggcgttat cttgatcgcc 122401 gcccaacacc tgctgcaaac aacgcacgcc atgctgcgtt cgctatttcg ggtcggcctc 122461 gatccgcgca acgtcgcggt gatcggcaag tgctattcca ctcacccggg agttgtcgac 122521 gcgatgcggg ccgacggcat ctatgtcgac gattgcagcg acgcctacgc accccacgaa 122581 tcattcgaca cccagtacac ccgccacgta gaacggtttt tcgccgaatc ctgggcgcgg 122641 cttacggccg ggcgtacggc tcgtgtcgtg ctcctcgacg acggcggatc gctgctagcc 122701 gtcgccggcg ccatgctcga tgcgagcgcc gacgtgatcg gaatcgagca gacgtccgcc 122761 ggctacgcca aaatcgtcgg ttgtgcgctg gggtttcccg tcatcaacat cgcccgctcg 122821 tcggcaaagc ttctatacga gtcgccgatc atcgccgcac gcgtgacaca gacggcattc 122881 gagcgcaccg cgggcatcga ctcaagcgca gcgatcctga tcaccggcgc gggcgcaatc 122941 ggcactgccc tggccgatgt gctgcgtccg ctgcatgacc gggtggacgt gtacgacacg 123001 cgctccggct gtatgacgcc catcgatctt ccgaatgcga tcggcggcta tgacgtgatc 123061 atcggtgcca ccggcgccac cagtgtgccc gccagcatgc acgaattgct gcgccccggc 123121 gtattgctga tgtcggcgtc ttcgtccgat cgcgagttcg atgccgtcgc gttgcgtcgg 123181 cgcacgacgc ccaatcctga ctgccatgcc gacctcaggg tagccgacgg cagtgtcgac 123241 gctaccttgt tgaattcggg cttcccggtc aactttgacg gttcgcccat gtgcggcgat 123301 gcgtcgatgg cgctcacgat ggcgttgttg gcggccgcgg tgttgtatgc gtcggtcgcg 123361 gtcgccgacg aaatgtcatc cgatcatccg catctcgggc tgatcgacca gggcgacatc 123421 gtggcatcgt ttctgaacat cgacgtcccg ctccaagctc tcagccggct accgttgctt 123481 tcgatcgatg ggtatcgccg ccttcaggtg cgctccggct ataccttgtt ccgccaaggt 123541 gagcgggccg accacttctt tgtcatcgaa tccggcgagc ttgaggcgct cgtcgacggg 123601 aaggtcatcc ttagactcgg tgccggagac cacttcggcg aggcgtgttt gctcggtggc 123661 atgcggcgca tagcgacggt gcgggcatgt gagccatcgg tcctgtggga gctcgacggc 123721 aaggctttcg gcgacgcgct gcatggggac gctgcaatgc gtgagatcgc ctacggtgtc 123781 gctcgcaccc ggctcatgca cgccggcgcg tccgagtcct tgatggtgta acggtcttgc 123841 actcgtgggc tgtcggcgga tcacgggatc gttatgccgg ttcttgcgag tgacataggt 123901 tgacatacgt ataaccggtc cctgcggtcg aacacggctt gacaattgga cgaatctcgt 123961 tgcgcgccat cagttgtgct cacaggatcg ccgccgttcg gagcgatgag cccgcttggc 124021 gcgcgaagtg cgccggggcg gatcctgccc gagccgcgcg acgacggcct cgatgcccgt 124081 cgcggtcgat gaccttgatt ccttgggcgc tgacccgcac cttgatgcgg cggtcctccg 124141 acggtaagta gtaggccttg agctggatat tgggcggcca acgtcgccga gtgcggcggt 124201 gtgagtgcga cacagcctta ccgaaaccca cagtgcggcc ggtgatttgg cagcgggcgg 124261 acatggcgaa cctcctcccg gaccagcctg ttgaaaatag ttttcgacaa ccgttgcacg 124321 gcacggtagc gtgggtgcag tttaatggca atcattttca ataaggtttg gcgatgcgta 124381 ctccggtgat attggtggca ggtcaggatc acaccgacga ggtgacgggc gccttgttgc 124441 gccggaccgg aacggtggtc gtggagcacc ggtttgacgg ccatgtggtg cgacggatga 124501 ctgccacgct gagccgtggc gaattgatca ccacggagga cgctttggag ttcgcccacg 124561 gctgtgtgtc gtgcacaatc cgcgacgacc tgctggtgct gttacgcaga ctgcaccgcc 124621 gagacaatgt cggccggatc gtcgtgcacc tggcgccgtg gctggagccc cagcccatct 124681 gctgggcgat cgaccacgtg cgggtttgcg tcggacacgg atacccagac ggaccagccg 124741 ccctcgacgt gcgggtcgcg gccgtggtga cctgtgtgga ctgcgtaagg tggctgccgc 124801 agtcactcgg cgaggacgaa ctgcccgacg ggcgcacggt ggcccaagtg acggtcggtc 124861 aggccgagtt cgccgacctt ctggtgctga cccacccgga accggtcgcc gtggcggttc 124921 tgcgccgact ggcccctcga gcgcgaatca ccggcggcgt cgaccgcgtc gagctggcgc 124981 tggcgcatct ggacgacaac tcacggaggg gtcgtaccga taccccgcac acgccattgc 125041 tggcgggcct gcctccgttg gcagccgacg gtgaggttgc gatcgtggaa ttcagtgccc 125101 gccgcccgtt tcacccgcaa cgtctgcatg ccgcggttga cctgctgctc gatggcgtgg 125161 ttcgcactcg aggtcggctg tggctggcca accggccgga tcaggtcatg tggctcgaat 125221 cagccggtgg cggtctgcgg gtcgcatcgg ccggaaagtg gttggcggcg atggcggcct 125281 cggaggtggc ctatgtcgac ctggagcggc ggttgttcgc cgacctgatg tgggtctacc 125341 cgttcggaga ccggcacacc gcgatgacgg tactggtatg cggcgccgat ccgaccgaca 125401 tcgtcaatgc cctgaacgcg gcgctgctca gcgacgacga aatggcatct ccgcaacgct 125461 ggcagtccta cgtcgaccct ttcggcgact ggcatgacga cccgtgccac gaaatgcccg 125521 atgcggctgg ggaattctcg gcacaccgca actcaggaga atctcgatga aaccccggta 125581 tccatcccga ctaccagccc gtggtacaga cgccgacact acggctcagc gcgcgctgga 125641 tgctaccgag ggcgtcgata ggttctatcc acccgcgtca gagtcttccg cgtcgtcagg 125701 gcgttcatca ggttgcacga caccgactgt gcttgccaac cacttcggtg ccagcgctga 125761 gactgcggtg gctcctgccg tggcgctgaa gacgcccgtc caggcgaccg gtcccagcgg 125821 ggtacacccg aaaagtggct gataaccggg gtttgaatga tgccgaccaa caccccggcg 125881 ctgcccagtg cggtggcaat gacgagcgga ctgtgccggc gcgtcagcag tgtctgcgct 125941 agctgggtca tcacgagtgc agtcaaaccc attgtcgccg tgcgtcgttc ggttccggga 126001 gtccagcgcc cgatggccca ggctgccgtt gcgccggcgg cggtgacgac gccgcggtta 126061 acgatctgac gcagcaatgg cgcgtccagc gagggcgtag gcccgatcag caccgcacgt 126121 cgatgttcgc gctgcgctcg ctcggccgcg tcatcggttg ggtattcggc gtcgtcgggt 126181 tcggcaaact gcgaggtgac ggccaccgca agcgcgggaa acatgtcggt gagcagattc 126241 accagcagca gttgacgagt ccccaccggc gcccgcccgg ccccgaacgc cgtcccgatg 126301 acggtgaaca gaacttcgcc cacattgccg ccgaccagaa tcgtcaccgc gtcacgaaca 126361 ccggcccaca tgctgcggcc ctcgaccagc gcgtcgagca gcacgcccag gtcatcgtcg 126421 gtcagcacga tatcggcggc cccacgggcg gcagaggaac cgcgcccgct cactccgatg 126481 cccacgtcgg ccatccggat ggccgcggcg tcgttggcgc cgtcgccgac catcgcggtc 126541 actcgcccgc agcgctgcag cgccgccaca atctgaacct tttgttccgg gctgacccga 126601 gcaaagactt gcatgtcggc ggcgagtttg gcatgcgcct cctcgtccag gacggcaagt 126661 tcggcaccgg tcacgactcg cgcgtccgcc ggtagtccca gctggcgggc gatcgcccgg 126721 gcggtgatcg gatggtcgcc ggtgatcagc accacgttgc gctcggcgtc cagcaaggct 126781 tcgatcaacg gacgcgagga agaccgggcc gtatccgcca atccgacata gccgatcagc 126841 tcgagatcgt gcgcgacggc gtcgacagcg tcggcgtcgg tctcgtcatc atgggtggtc 126901 ccgttgtccc aggtgcgctg cgcgactgcc agaacacgca ggccctgctc ggcgaggtgg 126961 cgtaccacgg attcggcatg ttcgtggtcg acgcccgggt cggcgagtcg gcagcgcggc 127021 aggatcgtct ccggagcgcc cttgagcatc aacatcggta tcccgtcggt gcccactctg 127081 ccgatcgcgg cggcgtagcc gcgactggac tcaaacggta cttcggccag caccacccac 127141 tccgaatcgc cttggctact aagcgaaccg gccagcgcac tagccgccgc gaggatcgcc 127201 tcatcggtgg cgtgcgcgtg cccttccccg ttatggggct gcgtggacgc gcgcgcggcg 127261 gcccgcagca cctcggcgga gggcgcatcg gtggtctgcg gcaacggatc ccgttcggct 127321 gcggtgctgc tcggtagcgc gcataccacc cgcaggcggt tctcggtgag tgtgccggtc 127381 ttgtcgaaac atatggtgtc gacacggccc agcgcctcga tggtgcgagg cgagcgcacc 127441 agcgccccac gtgccgtcag gcgctgggcg gcggcaagct gggagagggt ggccaccaac 127501 ggtagaccct ccgggaccgc ggccaccgcg atggcgacgc cgtcggccac cgcttgccgc 127561 agcgacgccc ggcgcagcaa cgccagagct gtcaccgcgg cgccgccggc caacgtcatg 127621 ggcagcactt tgctggtcag ctcgcgcagc cgggcctgga ctccggccgc cgtttcgaca 127681 tcggcgaccg ccgagatcgc gcgatgtgcg gcggtgccga ctccggtggc taccacgatc 127741 gcgcgggcgt gtccggcgac gatggtgctg ccctcaaaca gcatgctggc ccggtcgggg 127801 tcgttgacgg cgacggggtc cacctgcttg tccaccggta gcgactcgcc ggtaagaaag 127861 gactcgtcga cctcgaggtc ttcggccacc agcaggcgcg catccgccgg gaccacctcc 127921 ggcgcggcca ggtcgatgac atcgccgact cgcagcgact tcgccgacac cgtggccgtc 127981 cgggtggcgt gccgggccgc ctccagtcga cgtcgggtag tcgctaccgc cggcaccacc 128041 acccggcgca ccagctggtc ctgctcggcg aatagctcgg cggccgccgc ctcggctcgc 128101 aatcgttgta ccccaccggt gatcgcgttg accgtcatca cgcccgctac cagtagcgcg 128161 tcgatattgc tgccgacaat cgccgatgct gcggcgccca ccgccaggat cggagtcagc 128221 ggatcggcca gttcatggcg ggtggccacc gccagctgcg ccaaggttcg tgccgggccg 128281 cgcagcggcg ccatcaccgg ttcgtaggac aggtcgtcca gaatgcgccg ccaggccggg 128341 attccgggtt cgacggccaa gggtcgggag ccgccggcta gccgcgagta gacgatctcg 128401 gggtccagcg cgtgccaggc ggtcagcggt tgcggggtgg ggtcgggcat ccgcagcacc 128461 ttggcggccg accacattcc ggacaccaaa gccgttgcgg cagcggcatt gaccggattg 128521 agccagcgac ggaagctggc tgggttggtg gttttgtcct gctcaccggt gaccaacaac 128581 agcccggcca aggtggtgcc accttgggcg aggtgtaccg cggattcact ggctgcccgg 128641 gccaccggaa gcgctgacag gatccgcacc gccgcggcca gatcggtgcc ggtgattagg 128701 tcggcagtcc atggtgttgc cccgcgggga tcgtcgagag ccacaccgac gtcagcgatg 128761 gccaacgcgg ccaacgtatc ggtggatgcg aagtcccggt gcaccgcggt gatcagcaat 128821 accggtccgc gatccgcgcg caactcacgc accaacttca gcaacggcgt gccaggcgga 128881 tgcgtcgaac cgacgctggc cgatagatct tcggtgcccg cgacatggcg caaaaccacc 128941 cgcgctccgg ttcggtgcgc ggtctgcagc agcgggattg cgtatgggtc gacttcccac 129001 cccacgtcga cgctgcccac gcattggcca tcgaccacca ggtcggcatg ctcgaggccc 129061 tgagccggcg ttgccgacgg cccttgagcc ggagcccatc tcaagcgggc accggtagcc 129121 ggcaattcat cggggtcggg ttcgggtgcc tgctcgccat ggagcaaggc gtcggcgacc 129181 tcgtagacgc ggtcgtcgtc ccagccgggt tcgtctccct gtgcatgcaa tacggcgcgg 129241 ttgtcaccgc gcagcgctgc gccgtcgata acgaccaccc gcacccgatc caggcggcgc 129301 aacgcgccgg ggtcgaggac tagttgcccg gtgttggcga gcccgcgacc cagcaccgcc 129361 gcgaacgcct gtcggcccat gtgcgcggca cgtggtactc cggccaggat cgcgccggcc 129421 gcgtcctcgg taccaccgcc ggccaccagt gcactggccg cggcgatcaa cgaaccgttt 129481 gcggcctggt tgacgtattg ctccaccggc ccggcccttg accctttggc ggtgtcgatc 129541 gcggcgtcga tggatccgcc gaccacgacg tgcgaagcct cgcctgccgc cgcagccgcc 129601 caactgtgtc gcggctcctg cgatttggcc ccggccgacg agatgatggg caccaccgga 129661 gcctgcggcc gtctggggga ggccagcgcg ggttcgcggt cacgccatac gcgacggtgc 129721 gccgccgctt cggagatttg caggctgcgt tgcaccagat cgagcagcgg tgtgcccagg 129781 gactgggtta gcccgttggc cgccgccgtc gtagcggcga gcgcaatatc ggtgcccact 129841 cgacccaacc gcgactccat aagcgacacc attcgcggtt gatggtttat cagagcggcc 129901 agggctctgg tggtttgcgg tgcggcgggc agtcgggcga cccagccggt gaccgtggca 129961 cccatcgcta ccagatccat tgccgcagcg gtcaacggca ccaggatcgc caaggggtta 130021 cctgggtcgg cgaatggtgc cgagttcggc gacgacaccg acccagccag gaatatgtcg 130081 gcagccaccg cggaaaccac atcacgtacc tcgtccacgg cgatgtcgct atccgcatca 130141 ggttcaagtt cgaccaccag ccgacccaat gagccctcaa cgtgggcctc ggccacgccc 130201 gggatcctgc ggactggctc ctccaccatg gcggcatgct cgtgccagcg agggaatggc 130261 agtagcggat ccaggtcgaa atgcacgcgc cgtccgctgc gccaacgcac cggcggtgtc 130321 attccgtcag gcgattcgtt gtgggaaccg cgtaccccga ttgcgcgacc agtcgtttgc 130381 accaccgact gcaccaccgg gccggtcagc tccagcaccg ggctggccag cgtctgcacc 130441 gccgcggccg cactccccgg caagcgggcg ccggctcgca ctgtctgtgc tactccgttg 130501 gtcacaccac cgaggacagt ggccacaccc gggatcttca ctgagtcacc cttcaactac 130561 cgataccgcg cctaatcctg atggcgtatc agcgccatgt ctaccgactt gcgcatactt 130621 cgccgggtga ggtcgccggt gaaggcagtc cggacccctt tggtctgcga gcgatgaatg 130681 cagacgccgt gtcggatcta gcttgagtac gggcgggccc gtgacgcgcc ggtggcgggc 130741 acgtgaaacc gacccaaacg atcccaacga cgcggcaacg cctggctaac ggctcacgga 130801 tcgaatcagt ggatgcggtg gggtccgtga atcagccggc aagcggccaa gcgttgcatt 130861 gtgcggccga cattgggggg ccgacgaaat cggctcacaa aatgcggtgg tgggccctgc 130921 cgacgtggta acccgccggg aaggacttct cgatgacctg cagctggtgg ttcactctcg 130981 actccagctc gagcaccacg cagctgtcac cgacggtctt cgactcaagg acgatgtacc 131041 gctgaccgcc accgttgaca tcggtgatcg ggtcaccgga gtgcaatgtt tcgacgggca 131101 ccaaatccgg atttccgttt gactccacgc gaaacacagt aaggcaatta agccgactag 131161 ggaacactct gcgtggtgcg ccacctcgac gcggaggcac cagcgggttg gccgcggcgg 131221 gctcgctctt gggtggtcgc cagtgattgt gaccagctgc cggagcggga atgcgtgttg 131281 gacagccgaa tccccgcaat tggcgcaacg tgccccggaa gccgcgctac atttggctcc 131341 tagccaccag acagcttgcc cgaaaacggc agaggtccct gatgtcgctt ttgatcacat 131401 caccggcgac ggtggctgcg gcggcaacac atctggcggg tatcggatcg gcgctcagca 131461 cagccaacgc ggcagcggcc gctccgacga cggcgctatc ggtcgcgggt gccgatgagg 131521 tctcggtgct gatcgcagcg ctattcgagg cgtacgccca ggagtatcag gcgctgagtg 131581 cccaggcact ggcgttccac gaccagttcg tgcaggcgct caacatgggt gcggtttgct 131641 atgcggccgc agagacagcc aacgcaactc cgctgcaggc tctgcagact gtgcagcaga 131701 acgtcctcac cgtggtcaac gcgcccaccc aggcattgct aggtcgacca atcatcggca 131761 acggtgccaa cgggttaccg aacaccgggc aagacggtgg gcccggcggg ttgctgttcg 131821 gcaacggtgg caacggcgga tccggcgggg tggatcaggc cggtggtaac ggcggtgcag 131881 ccggcctgat cggtaacggc gggtccggcg gcgtcggcgg gccggggata gctggcagtg 131941 cgggcggggc gggcggcgcc ggtgggctgc tgttcggcaa cggcgggccc ggcggggccg 132001 gtgggattgg caccaccggt gacggtgggc ctggcggtgc cggcggtaac gccatcggtc 132061 tgtttggcag cggaggtacc ggcgggatgg gcggcgtcgg cggcatgggc ggtgtcggca 132121 acggcggcaa cgcgggtaac ggcggcaccg ccggactgtt cggtcacggc ggggccggcg 132181 gtgccggggg catcggcagc gccgacggcg ggctcggtgg tggcggcggc aatggccggt 132241 tcatgggcaa cggtggggtc ggcggtgccg gcggctacgg cgctagcgga gacggcggaa 132301 acgccggcaa cggcggcttg ggcggcgtgt tcggcgatgg cggggccggt ggtaccggcg 132361 gtctgggtga cgttaacggc gggcttgccg gtattggcgg taacgccggg ttcgtccgca 132421 acggcggagc cggcggcaat ggccagctcg gcagcggcgc agtctcctcg gcgggtggga 132481 tgggcggcaa cgggggcttg gtgttcggca acggcggccc cggcggtcta ggcgggccgg 132541 gcacgtcggc cggcaacggc ggtatgggcg gcaacgctgt cggactgttc ggccagggcg 132601 gggccggcgg ggccggcggg tccggattcg gggccggtat tccaggtggc aggggcggtg 132661 acggcggtag cggcgggctg atcggcgacg gcggcaccgg tggcggtgca ggcgcgggtg 132721 acgctgctgc atcggccggt ggtaacggtg gtaacgcccg gttgatcggg aacggcggtg 132781 acggtggccc gggcatgttc ggcgggcccg gcggagctgg cggcagcggc ggcacgatat 132841 tcggcttcgc cggaaccccc gggccgagct aggcgtgttg catcccgccc aacggcgcag 132901 gcaacaatgg tgcgatgagt ggcgccagct catcggagtc gcccacctgc tatcgccatc 132961 ccgggcgccg gacctacgtc cgctgcaccc gatgtgatcg gtacatctgt ggcgaatgta 133021 tgcgcgtggg tcccgtcggc caccagtgcg cggagtgtgt gcgcgaaggc gcccgggcgg 133081 tgcggcagcc tcgtacccca ttcggcgggc ggcagcggtc ggcaactccg gtggttacat 133141 acacgctgat ctcgctgaat gcgctggtgt tcgtcatgca agtgaccgtg atgggtctgg 133201 aacggcagct cgctttgtgg ccacccgcgg tcgccagcgg tcagacctac cggttggtga 133261 cctcggcgtt cctgcactac ggggcgatgc acctgctgtt gaacatgtgg gcgctgtatg 133321 tggtgggtcc gccgttggag atgtggctgg gccggttgcg gttcggcgcg ctgtatgcgg 133381 tgagcgcgct gggtggctcg gtgttggtct atctgatcgc accgcttaat acggcgacgg 133441 cgggggcatc gggggcggtg ttcggtcttt tcggtgccac gttcatggtg gccaggcggc 133501 tccaccttga tgttcgttgg gtcgtcgcgc tcatcgtgat caacttggct ttcacgttcc 133561 tcgcgccggc gatcagctgg caggggcacg tcggcgggct ggtaacgggt gcgctggtgg 133621 cagcgaccta cgtctacgcg cccagggaac gtcggaactt gatccaggcc acagtgacga 133681 tcaccgtttt ggttgcgttc gtcgtgctga tcggctggcg cacagtcgat ttgctcgcac 133741 tgttcggtgg gcgcctcaac ctgagctgaa cacatcaaaa ccgatagccg cttgtcttcg 133801 cgtgtcttcg gggaatccga cgcggtcaca tctaaactcg ccacgatcaa gaggaggggc 133861 agcgacgtat cggcagcaag cactgcgccg gacgacgaag tggtcagggc gcgctaacag 133921 cgagagctga gccgggcggg attcactccg tgccggcacg ttctgttccc cggccccgtt 133981 gggtggcccc ggtgcgccgg gtcggtcggc tggccgtatg ggatcggccg gagcggcgca 134041 gcggaattcc agcgttagat ggccttcgtg cgatagcggt cgcgctggta ctcgccagcc 134101 atggcggcat ccccggtatg ggcggcgggt tcatcggcgt cgacgccttc ttcgtcttga 134161 gcggatttct catcacctcg ctgctgctcg acgagctggg gcgcaccggt cgtatcgatc 134221 tgagcgggtt ctggattcgc cgtgcgcggc ggctgctgcc ggcgctggtg ctgatggttc 134281 tcaccgtgag cgccgcacgc gcactatttc ctgaccaagc tctcaccggg ctacggagcg 134341 atgcgatcgc cgcgttccta tggacggcga attggcggtt tgtggcccaa aataccgatt 134401 acttcaccca gggcgctcca ccctcgcccc tacagcacac ctggtcgttg ggggtggagg 134461 agcagtatta cgttgtctgg ccactgttgc tgatcggggc gacgctactg ttggcggccc 134521 gggcgaggcg ccgttgcaga cgggccacgg tgggcggggt tcggttcgcc gcgttcctga 134581 ttgccagtct cggcacgatg gcttccgcca ccgccgcggt cgcatttacc tcggcggcca 134641 cccgcgaccg gatttacttc ggcaccgata cccgtgcgca ggcgttgctg atcggctccg 134701 cggcagcggc tctgctggtg cgggattggc catcgctgaa ccgcgggtgg tgcctgatcc 134761 ggactcgctg gggacggcgg attgcccgtc tgttgccgtt cgtcgggctg gctgggctgg 134821 cggtgacgac tcacgtcgca acgggcagtg tgggcgagtt ccgccatggt ctgctgatcg 134881 tggtggcagg tgcggccgtc atcgtggttg cctcggtagc catggagcag cgcggagcgg 134941 tggcccgcat cctggcctgg cgaccgttgg tgtggctggg caccatatcg tacggcgtct 135001 atctgtggca ctggccaatc tttctggcgc tcaacggcca acgtacgggc tggtcgggcc 135061 cggccctgtt tgccgctagg tgtgcagcca cggtggtgct ggccggtgcg tcgtggtggc 135121 tgatcgagca acctattcgg cgctggcgac cggcacgggt tccgctgttg ccgctggcag 135181 cggcgaccgt tgccagcgct gccgccgtga cgatgctcgt tgttccggtc ggagccggac 135241 cggggctacg cgagatcggc cttccgcccg gcgtttcggc ggtcgccgcg gtctcgccgt 135301 cgccgccgga agcgagtcag cccgcgcccg ggccacgaga tcccaaccgg ccgttcaccg 135361 tttcggtatt cggtgattcg atcgggtgga ctttgatgca ttacctgccg ccgactcccg 135421 gattccggtt catcgaccac accgtcatcg gctgcagcct ggtacgcggc acaccgtatc 135481 ggtacatcgg tcaaaccctg gagcagaggg cggaatgcga cggctggccg gccagatggt 135541 cggcgcaggt caaccgggac caaccggacg ttgcgttgct gatcgtcggc cgctgggaga 135601 cggtagaccg ggtcaatgag gggcggtgga cacatatcgg cgacccgacc ttcgatgcgt 135661 acctcaacgc cgagctacag cgagcgctca gcatcgttgg atccaccggg gttcgagtga 135721 tggtcaccac cgtgccctac agccgcggcg gcgaaaagcc ggacggccgc ttgtatccgg 135781 aggatcaacc cgagcgtgtg aacaaatgga acgccatgtt acataacgcc attagccaac 135841 actcgaacgt cggaatgatc gacctcaaca aaaagctttg tccagacggc gtttacacgg 135901 ccaaggtcga cggcatcaag gtccgcagtg atggtgttca tctcacccag gaaggcgtga 135961 agtggctgat accgtggctt gaggattcgg tgcgggtcgc cagttaatcc gccgtgtgct 136021 ccggatgagc gcgacggtaa ccctggaatt gtgctgtgtg ctggctgtgt cgttgtgatg 136081 agcctgtcta agtggtgcgt aaccgtttga cgagccgcgg cctcgctgca aacattgaag 136141 cccgcacgtc tgggtttgta tttacacaac gagggcgctc cccgatctgg cgcgcgcaac 136201 gaggtgcgca ctatccattc gaggtgaact ggactccttg atgctcaggc cggtgcggtt 136261 tgtcgagaaa ggcgaatagg aacagtccat gaaagtgtgg atcactgggg ctggcggaat 136321 gatggggtca catctcgccg aaatgttgct ggccgccgga cacgatgtgt acgctaccta 136381 ctgcaggccg accatcgatc cgtcggacct gcaattcaac ggagcagaag tcgatatcac 136441 cgactggtgc tcggtctacg attcgatagc gacattccgc cccgacgcgg tatttcatct 136501 cgcggcccaa agctatccgg cggtttcgtg ggcccggccg gttgagacgc tgaccaccaa 136561 catggttggc accgccatcg ttttcgaagc actacgtcgc gtgcgaccgc acgcaaagat 136621 tattgttgcg ggctcgtcgg ccgaatatgg atttgttgac ccatccgagg ttccgattaa 136681 tgagcggcga gaacttcgcc cgctccatcc gtatggtgtt tctaaggcgg ccaccgacat 136741 gctggcgtat caatatcaca agtcttacgg catgcacacc gtcgtcgctc gtatcttcaa 136801 ttgcaccggg ccacgcaaag tcggagatgc actttccgat ttcgtccgcc gttgtacatg 136861 gttggagcac catccggaac aaagtgccat ccgggtggga aatcttaaga cgaaacggac 136921 tatcgtggac gtccgcgatc tcaatcgggc gttgatgctg atgctggata aaggcgaggc 136981 cggggctgac tacaatgtgg gaggttcgat cgcctacgag atgggcgacg ttctcaaaca 137041 agtaatcgcg gcttgtaaac gtgacgatat cgtgccggaa gtcgaccccg cccttcttcg 137101 gcccaccgac gaaaagatca tctacggaga ttgcagcaag ctggcggcca taacaggctg 137161 gcaacaagaa atctgtttga ctcagacgat tgccgacatg ttcgattatt ggcgtagcaa 137221 atccgagtcc gccctgatgg tgtgaccgaa tgtctttgtc ctgccaacct gaggagcaga 137281 taagattgac cgtaacggac tctcagtatc gacaaaaggt gtgcaccgcg agaactgctg 137341 aggagatctt tgtagagaca atcgctgtca agacacgcat cctcaatgac cgggtcttgc 137401 tggaagccgc tcgcgcaatt ggggaccgct tgattgccgg ctatcgtgcg ggagcacgcg 137461 tcttcatgtg tggcaacggt ggtagcgctg cggatgcgca acattttgcc gcggagctaa 137521 cgggtcacct gatctttgat cggccaccgc ttggcgccga ggcactccac gccaattcgt 137581 cgcacctaac agcggtggcc aacgactatg actacgacac cgtttttgcc agggccctcg 137641 aaggatctgc gcgtcccggc gacacgcttt ttgcgataag tacctccggc aattctatga 137701 gtgtactgcg ggccgcgaaa accgcaaggg agttgggtgt gacggttgtt gcaatgacgg 137761 gcgaatccgg cggccagctg gcagaattcg cagatttctt gatcaacgtc ccgtcacgcg 137821 acaccgggcg aatccaggaa tctcacatcg tttttattca tgcgatctcc gaacatgtcg 137881 aacacgcgct tttcgcgcct cgccaatagg aaagccgatc cttacgcggc cattcgaaag 137941 atggtcgcgg aacgtgcggg acaccaatgg tgtctcttcc tcgatagaga cggggtcatc 138001 aatcgacaag tggtcggcga ctacgtacgg aactggcggc agtttgaatg gttgcccggg 138061 gcggcgcggg cgttgaagaa gctacgggca tgggctccgt acatcgttgt cgtgacaaac 138121 cagcagggcg tgggtgccgg attgatgagc gccgtcgacg tgatggtgat acatcggcac 138181 ctccaaatgc agcttgcatc cgatggcgtg ctgatagatg gatttcaggt ttgcccgcac 138241 caccgttcgc agcggtgtgg ctgccgtaag ccgagaccgg gtctggtcct cgactggctc 138301 ggacgacacc ccgacagtga gccattgctg agcatcgtgg ttggggacag cctcagcgat 138361 cttgaattgg cacacaacgt cgccgctgct gccggtgcat gtgccagtgt ccagataggg 138421 ggcgccagtt ctggcggtgt cgctgacgcg tcatttgact cgctctggga gttcgctgtc 138481 gcagtcggac atgcgcgggg ggagcggggc taatggcgat cttgcgcggg cgagcgccgt 138541 tgcggctcgg actcggcggt ggcgggacag acgtggaacc gtactcgagc cagtttggcg 138601 gacgaattct tagcgtaacc atcgacaaat acgcctacgc gttcgcggag cgcggaacag 138661 gagatgagat cgcctttcgc tcgccggacc gcgaccgagc cggccaggcc tcgatcgacg 138721 atctggcgtc tctcgaagaa gactttccgt tgcacgtcgc cgtctaccgg cgggtgattg 138781 cggagttcaa cggtggtaca ccgtttccgc tccagctggc gacgcaggtg gacgctcctc 138841 ccgggtcggg gctgggctcg tcgtctgctt tggtggtggc gatgcttctc acgacatgtg 138901 cgctcatcgg ctcgtcgccg ggcccatacg agctggcgcg actggcctgg gaaatcgaac 138961 gggttgatct cggcatggcc ggtggttggc aagaccacta cgccgcggct ttcggcggct 139021 tcaacttcat ggagtcccgc cccaacggag aagtcgtggt gaatccgctt cggatacggc 139081 gggaggtgat cgccgaactg gaagcttccc ttcttctgta cttcggcggc gtctccaggc 139141 tgtcgtcgga agtcatcgcc gatcaacaac gcaatgtcgt cgagcgagac gcggacgcgc 139201 ttgcggccac tcactcgatc tgcgccgagg cactcgaaat gaaggatctt ctcgtggtgg 139261 gtgacatacc cggcttcgcc gattcactgc ttcgcggctg gcaagcgaaa aagcggacgt 139321 caacccgaat ctcgaacccc gcaatcgagc acgcttacca ggtcgcgcag tccagcggca 139381 tggtcgccgg gaaagtctcg ggtgccggtg ggggtggctt cctcatgatg atcgtggacc 139441 cgcgtcgccg tatcgaagtc gcacgcagcc tcgaacgaga gtgcggagga tcggtggctc 139501 cttgcctgtt taccaaaggc ggagcggtga cctggcatat cccagagtcc acggcacccg 139561 taaggcgtgg agttgctgat gccgtggctt cagcgctcgg aaacgctgga atcttgctgt 139621 gtgctggctg tgtccttgcg acgagccact cgacttggcg cgtaccggtt tgacgatcgg 139681 ggagcccagt gcaagcatga gaccccgcaa gcaccgggcg ctgacgcctc ttcgtgaggt 139741 gactgagacg acacctccgt gtgtcctggc cgtgaggagg tgagggcgag atgagtccga 139801 gcgacagtcc cgatccgaca ttcgtcttgt cccgatctgg ctccggcatt ctttctgcct 139861 tctgagcttt cgcgagttac tgcgcatgtc cgatgtggcg cagttgtggc gttctgaatg 139921 acgcacgctg atcgggcttc ctgcaggaga agaacatgac cacgatgatc atgacttttg 139981 ttgttccaca acgtgttacc cgtgcgacga aagggcgggc acggtcgctg ctgcgggtga 140041 gtcggcgtct gacggacacg tttcgcgcac cgctcgcctg gaccccgcag gagcgggccg 140101 accggtatgt ggcacgtatg ccgatcgcgg tgattgcgga ctgagcgggc gtcggcgcgt 140161 ggcgcggtta cccgttggac cggcgctagc ccaacccgcg cgcgcgtgtt ggtacaccga 140221 cacgctgtct gggccctaca actgcgcacg ctcgcggcca gtgccgctag ccgaccacct 140281 caatgggatc accaacggtg acggcgtcga agtaccatgc cgcgttgtcc gggctgaggt 140341 tgatacagcc gtggctgacg ttggcgtatc cctgcgagtt gaccgaccag ggggccgagt 140401 gcacgtacac gccgctccag gtaacacgaa ccgcgtagtg ggcggtgagc agatacccgt 140461 ccgaggaatt cagcgggatg ccgatggtac gcgagtccat cacgaccgtg cgctccttgg 140521 acattgcgtg aaagctaccg attggtgtcg ggcggctggg cttgcctaac gacgcgggca 140581 tggtgcggag gacttctccg tttctgctga ccgtgaaggt atgtgccgag atgctggcaa 140641 ccccgatcag tgcgtcaccg gtctcgaatc cttcggtcag ttcctgcaca cccaccgaga 140701 cacgggtgtg aggtggccaa taccggtggg gcacccaccg cacgacattg ctagcgaccc 140761 actcgaagtg tccggtcgtg ttgtgcggtg tgctgatgcg gatggaccgc tcgacggcgc 140821 ggcgatcggt cacgggcgtg gtgaatgtca ccaccaccgg gtgcgccacc cccaccacgg 140881 caccattagc cggcgacacc gacgcaacgc ctgggatcgg ttggagtggc gggaccgcgg 140941 cggtcgctat gctgactgat tccgcggtga gcatcagcgt gatcgcgacc acaacggata 141001 gataacgaac cactcgacgc atggcgtcca ccctcccgag atggtgcgat cgacacacga 141061 cattctagtg accatcgacc cattgcgggc cgagcaagca gtttctggat agccccgccg 141121 ccccgcgggt gcggattggc aggccgcgcg gcctcgcgtt agcctcagcg gaatcggtgc 141181 caaggccgag gaggtgcggg tgctcttccg tcagctggag tacttcgtcg cggtcgccca 141241 ggagcggcat ttcgctcggg ccgctgagaa gtgctacgtg tcgcaacctg cgctgtcttc 141301 ggcaatcgcc aagctcgaac gcgaactcaa cgtcaccctg atcaatcgcg gacacagttt 141361 cgaaggcctt actcgcgagg gtgagcggtt ggtggtatgg gccaagcgga tacttgccga 141421 gcacgctgcg ttcaaggccg aggtggatgc ggtgcggtcc gggataaccg ggacgcttcg 141481 gctaggcacg gttcccaccg cgtcaacgac ggcatccctg gtgctgtcgg cgttttgctc 141541 ggcgcacccg ttggcgaagg tgcaagtctg ttcccggctg gctgcgaccg agctgtaccg 141601 acggctgcgc gaattcgagc tcgatgccgt catcgtgcac cccgagaccc aagacagtga 141661 tgatgttgat ctggtgccgc tctatgagga gcagtacgtg ctgttgtcgc cggcggatat 141721 gctgccgccg gggacatcga cgttggtgtg gcgggatgcc gcgcaactac cgttggcatt 141781 gctcactgcg gatatgcggg accgccaggt tatcgacgcc gcgttcgccg accacgcggt 141841 ctcggcgatc ccgcaggtcg aaaccgattc cgttgcttcg ctgttcgcac aggtggcaac 141901 cggcaactgg gcgtccatcg ttccgcacac ctggctatgg gcaatgccaa tgagcgggcc 141961 gacgggtggt gagatccgcg cggtcgaatt ggtcgatccg gtgctgaaag cccagatcgc 142021 cctggctacc aacgccttgg gaccgggatc tccggttgcc cgagcgctca taacatgcgc 142081 gcaggcgctg gcgctgaacg aattctttga cacgcagctg cgggggatca cccgtcgccg 142141 ctgatcgcgg gcgtcgctgc gctggtagtg ttcagcttcg ccaggtggcc gctctccacc 142201 ccgtctgcag ggtcgagttc gcagtcgatg agtgacggtc cgttcgatgc cagtgcatcg 142261 gtcagcgccg actccagttc ggttggggtg cttacgtgat atcctttgcc gccgaacgcc 142321 tctgcgatca gttcgtgacg tgcatgagcg ttcagcacgg tgggcgctgg gtcgtgtcgc 142381 cacaccgggg cggccgacct aaagatcgtt gcctcgtcgc cgcggtagac gccgccgttg 142441 ttgaggatga cgacggtgac cgggagtcgg tatcggcaga tggtctcgaa ctccatgccg 142501 ctgaagccaa atgcgctgtc gccctcgatc gccacgacag gtcgcccggt ctcgacggcg 142561 gccgcgatgg cgtagcccat gccgatgccc atcacgcccc aggttccgct gtcgagccgg 142621 tgccgcggta ggtgcatgtc gatgatgttg cgggcgaggt ccagcgcgtt ggcgccttcg 142681 ttgaccacat agacatccgg gttgcgttgc agcacagacc taatggcacc aagcgcgttg 142741 tagaaccgca tcggatgatg atcgtcggcc aaccgccgac gcatcttggc actgttgcgg 142801 gccttgcggt cggcgagctc gccggtccac gccgccgagg ccacgctcga acgatcggcc 142861 gcagcttcga ggagcgccga cattaccgag ccgatgtcgc cggtcagcgg tgccacgatc 142921 ggccggttgc tgtcaaactc cgacgcctcg atatcgacct ggatgaactt ggcatcggcc 142981 gaccattgcg gcgactctcc gttgcctagt agccaattca gccgagcgcc aaccagcagc 143041 accacgtcgg cgcgggccat cgccagcgaa cgagccgcag ccgccgactg cgggtgtgag 143101 tcgggcagca gccccttggc catcgacatc ggcaggaagg gaatgccggt gtgctccaca 143161 aactcccgaa taacgttgtc ggcctgcgca tatgccgcgc ccttgctgag cacgagcagc 143221 ggtcgctgcg cttgggcgag cacgtccagc gcgcgatcaa tcgcctccgg tgccggcagt 143281 agtcggggag ccgggtccac cggccgccaa atggcgccgg aagcagccga tgcctcaacg 143341 gcctggccca gcacatcgcc ggggatatcg aggtatacac cgccgggccg cccggaggtc 143401 gcggtgcgaa tggcgcgcgc gacgccgcgc ccgatgtcct ggacttggcc gatccgatac 143461 gccgccttca cgaacggtcg agcggcgttg agctggtcga ggtcctgata gtcgccgcgc 143521 tgcaggtcga ccatcggccg gctgctcgat ccggagatct ggatcatcgg gaagcagttc 143581 gtggtggcgt tcgccagcgc gggcaggccg ttgagaaagc cggggccgga cgtcgtcaga 143641 cacacgccgg gccgtgcggt gaggaacccc gcggcggccg ccgcattgcc cgctgatgct 143701 tcgtggcgga aaccgatata gcggatcccc gaggcttggg cggcgcgagc caggtcggtg 143761 atcgggatgc cgacaacgcc gtagatggtg tcgacgtcgt tggctttgag ggcgtccacc 143821 accaggtggc agccgtcggt cagcactgtg cagggagatg ccgatcgtgt ggtcatggtg 143881 ttcactgttg tccggggcgc cggccgtgtc caagaccgag tcactatgca gcgatttacg 143941 cggtctatca accgttagcg gatcggtatt ggacgccggg caggcgagcc cggcactgtg 144001 ctgatcgtgc cgaacccgca caccgaacac atggaaggag cgttcgcgat ggcatccgac 144061 ttcggcccgc gcatcgccga tcttgtcgag gtggcggcga cccggctgcc cgaggctccg 144121 gcgctcgtcg tcaccgcgga tcgcatcgcg atcagccacc gcgacctggc ccgtctggtt 144181 gatgagctgg ccggccagct gacgcggtcc ggcctgctgc ccggtgaccg ggtcgcgctg 144241 cgcatgggca gcaacgccga attcgtcgtc gccttgctgg cggcgtcgcg tgcggatctc 144301 gtcgtcgtgc cgctggatcc ggcgctgccc atcaccgagc aacgcgtccg aagccaggcc 144361 gcgggagccc gggtggtgct gattgacgcg gatgggccgc acgacagggc agaacccacc 144421 acccggtggt ggccgctcac ggtgaacgtc ggcggtgaca gcggcccctc gggtggcacc 144481 ttgtcggtcc acctggacgc cgccaccgag ccgaaccccg caacctcgac gcccgaggga 144541 ctgcgacccg atgacgccat gatcatgttc accggcggga cgaccggcct gccgaagatg 144601 gtcccctgga cgcacgcaaa catcgccagc tcggtccgcg ccatcatcac cgggtaccgg 144661 ctgagcccgc gggacgccac cgttgcggtg atgccgctct accatggcca cgggctgatc 144721 gcgtcgttgc ttgccaccct ggcgtccggc ggcgcggtgt cgctgcccgc acgcgggcga 144781 ttctccgcgc acaccttctg ggacgacatc aaagccgttg gagccacctg gtatacggcg 144841 gttccgacga ttcaccaaat cctgctggag cgatcggcaa ccgaaccgtc ggggcgcaaa 144901 cctgccgcac tgcgtttcat ccgcagctgc agcgcaccgc tcactgccca agccgcgcta 144961 gcactgcaaa ccgagttcgc ggcaccggtc gtgtgtgcct tcggcatgac cgaagccacc 145021 caccaggtaa cgacaacgca gattgagggt atcgaccaaa ccgaaactcc cgtcgtgtca 145081 accggtctgg tcggccggtc gacgggagcg caaatccgga tcgtcgggtc cgacgggctg 145141 ccactgcccg cgggcgcggt cggggagatc tggctgcggg ggaccaccgt ggtacgcggg 145201 tatctgggtg acccgacgat aaccgccgcg aatttcaccg acggttggtt gcgtaccggt 145261 gatctcgggt ccctgtcggc ggccggtgac ctgagcatcc gcggccgcat caaggaactc 145321 atcaaccgag gtggtgaaaa gatctcgccc gagcgcgtcg agggcgtgct ggccagccat 145381 ccaaacgtca tggaggcagc cgtattcggc gtcccgcacc agctctacgg cgaggcggtc 145441 gcggcggtga ttgtgcctcg tgagtccgcc ccgccgactc gcgaggagct tgtccagttc 145501 tgccgggaac ggttggcggc cttcgagatc ccggcctcct tccaggaggc cagcgggctg 145561 ccgcacaccg cgaagggttc gctcgaccgc cgcgctgtcg ccgaacggtt cggccattcg 145621 gtgtagctag ccggccccgg cctttacccg ggcggcggcg gattccggca tcggttcgta 145681 gcgggcaaac gaacgggtga aggatgcggc cccatgcgcc agcgagcgca aatcgattgc 145741 gtagcgggtc agctcgacct gaggcacctc ggccttgatc accgtgcggt cgtgccccgc 145801 ggtctcggtg ccgagcactc ggccacgacg actggacagg tcgcccaaca ccgcgccgac 145861 gaaatcgtcg ggtaccagca ccgaaatctc atcgattggc tcgagcaaga tcaccttcgt 145921 cgcggccgcg gcctcccgca atgcgagcgc gccggccatt tggaaggcga aatcggaaga 145981 gtcgacgctg tgggctttgc cgtcgagcaa cgtgacccgg atatcgacca ccgggtagcc 146041 ggcgtgcact cccttatcca tctgtgcgcg gacacctttc tccacattgg ggataaactg 146101 ccgcggcacc gccccgccaa ccactttgtc gaggaactcg aacccggagc cctccggcag 146161 cggctccacc tcgatgtcgc acaccccgta ctgaccgtga ccaccggact gtttgatgtg 146221 gcggccatgg cctttcgcat tgccggcgaa ggtttcccgc agcggcaccc gcagctcgat 146281 cgtgtctacg ctgacgccgt accggttggc cagtgtatcc aggacgacgc cggcatgggc 146341 ctcgcccata caccacagca cgacctgatg ggtctcttga ttttgctcga tccgcagtgt 146401 cgggtcttcg gcggccaacc ggcccaaccc gaccgacagc ttgtcttcgt cggtcttggc 146461 atgcgccgca atggcgatcg gcagcagcgg ctcgggcatg gtccagggtt tcagcaccag 146521 gggctcggcc ttatccgaga gtgtgtcccc ggtctcggcc cggctcagct tgccgatggc 146581 gcagatgtcg cccgcgacca cggctgctgc cgggcgctgt tgcttgccca gcgggaacga 146641 caagactccg atgcgctcgt cttcgtcgtg gtcggggtgc gtgttactag ttccgccgcc 146701 gaaaaacgat gagaaatggc ccgacacatg gaccgtcgtg tcgggcctga tggttccgga 146761 gaacacccgc accaagctga cccggccgac gtaggggtcc gacgtcgtct tcaccacctc 146821 ggcgagcaac ggcgcgtcat tgtcacaggc cagctccgca tgcgggacac cctgcggggt 146881 aaagacctcc ggcagtgggt gctccatcgg agacggaaat ccgcgggtgg ctacctcaag 146941 caattccagt gtgccgaccc cggtgctgct gcacaccgga atcaccggga agaacgagcc 147001 tcgggcgacg gctttctcca gatcctggat cagcaccgac tcgtcgatcg tctcgccgcc 147061 gaggtagcgc tccatcaagg actcatcctc ggattcctcg atgattcctt cgatcaaggc 147121 gccgcgcgcc tcctcgattc gctcggtgtc cgactcggcc ggggttcgtg tcgttcgctt 147181 gccgtcggcg tactcgtaca gtgcctgcga aagcaatccg atcaggccgt caccggacgg 147241 caggtagagc ggtaagacct tgtcgccgaa ggcgtcttgt gccgcggtca gcgcttcccg 147301 gtagttcgcc cgggcgtggt ccagcttggt gatgaccacc gcgcggggca tgccgacctg 147361 gctgcattcc tgccacaggg acttggtcgg ttcgtcgacg ccctcgttgg ccgcgatcac 147421 gaacagtgcg caatcggcgg cccgcaaccc ggcccgcagc tcacccacga agtcggcgta 147481 cccaggggtg tcgacgaggt tgaccttgat gccgtcgtaa gccagcgagg cgaccgcaag 147541 gcccaccgag cgctgttgcc ggatctccgc ctcgtcgaag tcgcagaccg tggtgccctc 147601 ggtgaccgag cccggcctgg acaacacctt ggccgccacc aggagagcct cgatgagggt 147661 ggtcttgccg ccccccgagg gccccaccag aaccacgttg cgaacgccgc cgggcccgtt 147721 tgcggtggga gcggcggccg cgccctggga agcattcact ctgtcggcca tggctttcct 147781 ccagttctcc ggggtcggtt cccgtggtgt ggcccagcag gacgtagtag gcaacttttc 147841 tcccaactgc cgcccagcac aagggtcggg tcaggtgagt agtaggcaat cggagccgtc 147901 gttgtggtca ggcgtgccag ctggcccagc gctggactgc tattgcgatt accgggccgt 147961 tcaacggaac cgattggtat tgagcatatt tggcgcgcag caggcggtat gcggcgcgca 148021 tcacctcgcc atcgcgatga atcgcggcga ccccgtcggc ccggacccac cacaactggg 148081 tccaatcatc ggcatagctg tcgacgagca cgctggcccg tggattgtgc tcgagattgg 148141 cgagccggcg cagccgctgc gtcgttttcc gcttcgcgtc gacggcggtg tagataacgt 148201 ctgcaccggt cgcctcggcc gggcgcctag cgccgagcgc gaatacgacc ggcaccaggt 148261 ggggtgtgcc gtcgggcgtg ctggtggcca gtcgtgcgac gggggactgg gcaaacctga 148321 gctttgggtc gaattccccc acggcgccag cttatgctca gctgccgccc aacgtcgcgc 148381 agtctggacg gccagacgtc gcggccgtga cagcggacat ctcgggcagc ccggtccatg 148441 gggcgtgcgt gctaatggtg ccggtggtaa tccagtggcg cgcaaggtaa ttggccgggt 148501 cggtctcggc cgccgcagga atcggttggg tcggtttgaa cgtgacagag acgaacagag 148561 accagtgcta tcgcgtcgaa cggacgaccg ttgacgcttt gacacatccc gagtatcgag 148621 tacatactcg aggcgtgcag cgggtcaggg tcacgaggaa cgcccggaag caccgcgtgt 148681 ccaagcaccg catcgtcgcc gctatgcgcc actgcggtgt tccggtcatt caggaagatg 148741 gctcgctgta ctaccagggc cgcgatacgt cgggccgtct taccgaggtc gtcgccgtcg 148801 aagccgacga cggtgacctg atcatcactc acgcaatgcc gaaggagtgg aagcgatgac 148861 gaagaagcca cgtaaccccg ccgactacgt gatcggcgac gatgtcgagg tgtctgacgt 148921 cgatctcaag caagaggagg tctatgtcga tggcgagcgg ctaacggacg agcgcgtcga 148981 gcagatggct tcagagtcgc tgcggctggc gcgcgaacga gaagccaacc tgattcctgg 149041 cggcaagtct ctgtccggcg gctctgcgca ctcgccggct gtgcaggtgg tcgtttcgaa 149101 ggctacccac gccaagctca aggagctggc gcgcagccgg aagatgagcg tatctaagct 149161 gctgcgtccc gtgctcgacg agttcgtaca gcgagaaacg ggtcggattc tcccacggcg 149221 ttagcttgtg ctcagccgcc gctcgacgtc gcgaagtctg gacagtcagc tgtcgcagcc 149281 gtgaccagcg gacatctcgg gcagctagcc cgacagggtg cgcgtgcacc tggcccgggt 149341 ggtaatccat tgacgcgcac ggcaattggc cggctcggtc tcggtctgcg gataccgcac 149401 tgaagggcga caattttggc gaaaaggccg tgtgcggtgc cgggtcgcgc tacgttcaga 149461 ttcacctaac aatgtcgtcc gccaacgagc gtgttcgccg gtggtggggc gggcgggttg 149521 gggaggtgtg tgatgtcgtt tgtcagcgta gccccggaga ttgtggtggc cgcggcaaca 149581 gacctggcgg gtatcggatc ggcgatcagc gcggccaatg ccgccgcggc tgcgccgacc 149641 accgccgtgc tggccgcggg tgccgatgag gtgtcggcgg cgatcgcggc gctgttttcc 149701 ggccacgctc aggcctatca ggcgctcagc gcccaggcgg cggcgtttca tcagcagttc 149761 gtgcagacgc ttgccggtgg cgctggagca tatgcggccg ccgaggccca ggtcgagcag 149821 cagctgctgg ccgcgatcaa cgcgcccacc caggcgctgc tggggcgccc cttgatcggc 149881 aacggtgccg atggggcgcc ggggactggg caggccggcg gggctggggg gatcttgtac 149941 ggcaatggcg gcaatggcgg ctccggggcg gctgggcagg ccgggggtgc cggcgggccg 150001 gccgggctga tcggccatgg cgggtccggc ggggccggcg gctccggcgc ggccggcggg 150061 gccggcgggc acggcggatg gctgtggggc aacggcggcg tcggcggatc cggcggggcg 150121 ggtgtcggcg caggcgtggc tggcggtcac ggcggtgcgg gcggtgccgc cgggctgtgg 150181 ggcgccggcg gcggcggtgg caatggcggg aacggcgccg atgccaacat cgtcagcggt 150241 ggagacggtg gcctcggcgg tgccggtggc ggtggcggat ggctctacgg cgacggcggg 150301 gccggcggac acggcggaca aggcgcaatc ggcctcggcg gcggcgccgg cggcgacggg 150361 ggccagggcg gcgccggccg cggactgtgg ggtactggcg gcgccggcgg acacggcggg 150421 caaggcggtg gtaccggggg cccaccgctg cccggtcagg caggcatggg cgccgcgggt 150481 ggcgccggtg ggctgatcgg caacggcggg gccggcggcg acggcggtgt cggcgcgtcc 150541 ggcggggtcg ccggagtagg cggtgccggc gggaacgcca tgctgatcgg gcacggcggc 150601 gccggcggcg ccggcggaga cagcagtttc gctaatggcg cggccggcgg cgcgggcggt 150661 gccggagggc acctcttcgg caatggcggg tccggcggcc acggcggagc cgtcacggcc 150721 ggcaacaccg gtatcggtgg cgccggcggc gtcggtgggg acgccaggct gatcggccac 150781 ggtggcgccg gcggtgccgg cggggaccgc gccggagcct tggttggccg tgacggcggg 150841 cccggtggga acgggggcgc tggcggccag ctatacggca acggcggcga cggcgccccc 150901 ggcaccggcg gaacactgca ggcggcggtg agcggattgg tgacggcttt gttcggtgca 150961 cccggccaac ccggcgacac cggccaaccc ggctagcccc gatcaacgag ggtttcggtg 151021 ccggtccggg gcatggccat ccgctgagct ggcgatctgg actacgttgg tgtagaaaaa 151081 tcctgccgcc cggaccctta aggctgggac aatttctgat agctaccccg acacaggagg 151141 ttacgggatg agcaattcgc gccgccgctc actcaggtgg tcatggttgc tgagcgtgct 151201 ggctgccgtc gggctgggcc tggccacggc gccggcccag gcggccccgc cggccttgtc 151261 gcaggaccgg ttcgccgact tccccgcgct gcccctcgac ccgtccgcga tggtcgccca 151321 agtggggcca caggtggtca acatcaacac caaactgggc tacaacaacg ccgtgggcgc 151381 cgggaccggc atcgtcatcg atcccaacgg tgtcgtgctg accaacaacc acgtgatcgc 151441 gggcgccacc gacatcaatg cgttcagcgt cggctccggc caaacctacg gcgtcgatgt 151501 ggtcgggtat gaccgcaccc aggatgtcgc ggtgctgcag ctgcgcggtg ccggtggcct 151561 gccgtcggcg gcgatcggtg gcggcgtcgc ggttggtgag cccgtcgtcg cgatgggcaa 151621 cagcggtggg cagggcggaa cgccccgtgc ggtgcctggc agggtggtcg cgctcggcca 151681 aaccgtgcag gcgtcggatt cgctgaccgg tgccgaagag acattgaacg ggttgatcca 151741 gttcgatgcc gcgatccagc ccggtgattc gggcgggccc gtcgtcaacg gcctaggaca 151801 ggtggtcggt atgaacacgg ccgcgtccga taacttccag ctgtcccagg gtgggcaggg 151861 attcgccatt ccgatcgggc aggcgatggc gatcgcgggc cagatccgat cgggtggggg 151921 gtcacccacc gttcatatcg ggcctaccgc cttcctcggc ttgggtgttg tcgacaacaa 151981 cggcaacggc gcacgagtcc aacgcgtggt cgggagcgct ccggcggcaa gtctcggcat 152041 ctccaccggc gacgtgatca ccgcggtcga cggcgctccg atcaactcgg ccaccgcgat 152101 ggcggacgcg cttaacgggc atcatcccgg tgacgtcatc tcggtgacct ggcaaaccaa 152161 gtcgggcggc acgcgtacag ggaacgtgac attggccgag ggacccccgg cctgatttcg 152221 tcgcggatac cacccgccgg ccggccaatt ggattggcgc cagccgtgat tgccgcgtga 152281 gcccccgagt tccgtctccc gtgcgcgtgg catcgtggaa gcaatgaacg aggcagaaca 152341 cagcgtcgag caccctcccg tgcagggcag tcacgtcgaa ggcggtgtgg tcgagcatcc 152401 ggatgccaag gacttcggca gcgccgccgc cctgcccgcc gatccgacct ggtttaagca 152461 cgccgtcttc tacgaggtgc tggtccgggc gttcttcgac gccagcgcgg acggttccgg 152521 cgatctgcgt ggactcatcg atcgcctcga ctacctgcag tggcttggca tcgactgcat 152581 ctggttgccg ccgttctacg actcgccgct gcgcgacggc ggttacgaca ttcgcgactt 152641 ctacaaggtg ctgcccgaat tcggcaccgt cgacgatttc gtcgccctgg tcgacgccgc 152701 tcaccggcga ggtatccgca tcatcaccga cctggtgatg aatcacacct cggagtcgca 152761 cccctggttt caggagtccc gccgcgaccc agacggaccg tacggtgact attacgtgtg 152821 gagcgacacc agcgagcgct acaccgacgc ccggatcatc ttcgtcgaca ccgaagagtc 152881 gaactggtca ttcgatcctg tccgccgaca gttctactgg caccgattct tctcccacca 152941 accggatctg aactacgaca accccgccgt gcaagaggcg atgatcgacg tcatccgctt 153001 ttggctcggc ttgggcatcg acgggtttcg gttggacgcg gtgccctatc tctttgaacg 153061 tgagggcacc aactgcgaga acctgccgga aacacacgct tttctcaagc gagtccgcaa 153121 ggtggtggac gacgaattcc ccggccgggt gctgctagcc gaagccaatc agtggccggg 153181 cgatgtcgtc gaatatttcg gtgatcccaa caccggtggc gacgagtgcc acatggcctt 153241 tcacttcccg ctgatgccgc gcatcttcat ggccgtgcgc cgggagtccc gttttccgat 153301 ctcggagatc atcgcccaga ccccaccaat ccctgacatg gcgcaatggg ggatatttct 153361 gcgcaaccac gacgagctga cgttagaaat ggtcaccgac gaagagcgcg actacatgta 153421 cgccgagtac gccaaggatc cacggatgaa ggcgaatgtc ggaatccgtc gtcggcttgc 153481 gccgctgctc gacaacgacc gcaaccagat cgagctgttc accgcgctgc tgctgtcgct 153541 gcccggctcg ccggtcctct actacggcga cgagatcggg atgggcgacg tgatctggtt 153601 gggtgatcgc gacggcgtgc gcatcccgat gcagtggaca ccggaccgca acgcgggttt 153661 ctccaccgcc aacccgggtc ggctgtacct gccgcccagc caggacccgg tttacgggta 153721 tcaggccgtc aacgtcgagg cgcaacgcga cacctcgacg tcgctgctca acttcactcg 153781 caccatgctg gccgtgcgtc gccgacaccc cgcgtttgcg gtcggcgcat tccaggaatt 153841 gggcgggtcc aacccgtcgg tgctggccta cgtgcgtcag gtggccggcg atgacggcga 153901 caccgtgctc tgtgtcaaca acctgtcgcg attcccgcag cccatcgaat tggacttgca 153961 gcaatggacc aactacacgc cggtcgagct gaccgggcac gtggagtttc cacgcatcgg 154021 ccaggtgccc tatctgctga cgctgccagg acacgggttc tactggttcc agttgaccac 154081 acatgaggtg ggggcacctc ccacttgcgg gggagagcgg cgcctatgac tcgcgccggc 154141 gacgatgcac agcgaagcga tgaggaggag cggcgcctat gactcgcgcc agcgacgatg 154201 cacagcgaag cgatgaggag gagcggcgcc tatgactcgg tcggacacgc tggcaaccaa 154261 gctgccatgg tccgattggc tttcgcggca acgttggtat gccggacgca accgcgagct 154321 ggccacggtc aagccgggcg tagtcgtcgc cctgcgacac aacctcgacc tagtcctggt 154381 cgacgtaacc tacaccgacg gtgcaacgga gcgttaccag gtgctcgtcg gatgggattt 154441 tgagccggcg tccgagtacg gcacgaaagc cgccatcggc gtcgccgacg atcgcacggg 154501 attcgatgct ctctacgacg tcgccgggcc gcaattcctc ctgtcgctaa tcgtctcgtc 154561 cgccgtctgt ggcacatcca ccggcgaagt aacgttcacc agggagccag acgtcgagct 154621 gccctttgcc gcgcagccgc gggtatgtga cgccgaacag agcaacacca gtgtgatctt 154681 cgatcggcgg gctatcctca aggtgttccg ccgggtaagc agcgggatca accccgacat 154741 agagctgaac cgcgtgctta cccgtgccgg taatccacat gtggcccgcc tgctgggcgc 154801 ttaccagttt gggcggccca atcgttcgcc aaccgatgct ctggcgtacg ccctgggcat 154861 ggtgaccgag tatgaggcga acgcggccga aggctgggcg atggccaccg ccagcgtgcg 154921 ggacctcttc gccgagggag acctctatgc ccacgaagtc ggcggcgatt tcgccggtga 154981 atcctaccgg ctcggcgagg cggtcgcctc ggtgcacgcc acgctggctg acagcctcgg 155041 aaccgcgcag gcaacgttcc cggtggaccg gatgctggcg cggctgtcgt cgacggtggc 155101 ggtggtgccc gaactgcggg agtacgcgcc aacgatcgaa cagcaattcc agaagctcgc 155161 ggcggaggca atcacggtcc agcgggtgca cggtgacctg cacttgggac aggtgctgcg 155221 taccccggaa agctggctgt tgatcgactt tgaaggcgag ccgggccagc cgctggacga 155281 acggcgagcg ccggattcgc cgctgcgcga cgtggccggt gtgttgcgat cgttcgagta 155341 cgccgcttac gggccgctgg tggaccaggc caccgacaaa caacttgccg ctcgcgcccg 155401 cgaatgggtc gagcgcaatc gcgccgcctt ctgcgacggc tacgcggtcg cgtcgggaat 155461 cgacccgcga gattcggcgc tgctgttggg cgcctacgaa ctcgacaagg cggtttatga 155521 gaccggctat gagacacggc accggccggg ttggcttccg attccgctgc gttcgatcgc 155581 ccgcctgacc gctagctgat accggccggg gtgtccggct tattgcttgg cgtgcgtgcg 155641 tcctgggcgt ctggaagcat gctcgtgtgc aacgagagat ttatgacggt gaggcgcggc 155701 tgtcatgggt gttggcggcg ctggccggga tactgggggc aaccgcgttc acccactccg 155761 cgggatactt cgttactttc atgaccggca attcgcagcg cgcggtgctg ggattgtttg 155821 gggacgacgc gtggatgtct gtcaccgcgt cgttgctgat tctattcttc gtcgccggcg 155881 tggtgattgc gtcggtgtgc cggcggcatt tctgggcggc gcatccccac ggcccgaccg 155941 tgctgaccac cttcagtttg atatttgccg ccggagtcga cattatgctg ggcggctggc 156001 acgagagcat gctcgatttt gtgccgattc tgttcgtggt cttcgggatt ggcgccttga 156061 acacatcgtt cgtcaaggat ggcgaggtat cggttccgtt gagctatgtg accggcacat 156121 tggtcaagat gggccagggc atcgaacgtc acctggccgg cggaaaagtg gaggactggc 156181 tcggctactt cctgctgcac gccagcttcg tgctaggcgc cgcggccggt ggcgccatta 156241 gtatggtcgt caccggaccc cagatgctcg cggtcgcggc ggtagtgtgc gctgcgacaa 156301 ccggctatac ctacctgcac gctgaccggc gagggttggt caatcaaaag cggccccagc 156361 cgggaaagcg gctctttcga gcgctcaggc gaggcgaatt agattcggga acctccacgc 156421 ccgcaaccaa ttacgggtcg agttagcttg gcttccagtg gcgctggcga aggggtgacc 156481 acgccaactt cacccggaag gtccgaccca gtgcggatgt tccacacatc ggcagcagcg 156541 ctggccgttg cgctgctgcc gatgctggct tgctggctca ggcggccggc gcagcagggg 156601 cggccggggg tgtcgcgccg ttgagcacat gctggatatc ggccttcatg gcgaccagct 156661 gctcgttcca gtagggccac gagtgtgttc cgttgggcgg gaagttaaac accccgttgc 156721 gtccaccgtc ggccgcgtag gtgtcccgga aggtctggtt ggtgcgcagg gtgaggcctt 156781 ccaggaactt cgccggtatg ttgtcgccgc cgaggtcgct gggtgtgccg ttaccgcagt 156841 acacccagat ccgggtgttg ttggcgacca ggcggggaat ctgaaccatt gggtcgttgc 156901 gcttccaggc cgggtcgctg gacggacccc acatgctgtt ggcgttgtaa ccgcccgagt 156961 cgttcatcgc caggccgatc agcgtcggcc accagccctc ggacgggttg aggaagcccg 157021 acaacgacgc ggcgtacggg aactgctgcg ggtagtacgc ggccaggatc agcgcggaac 157081 cgcccgacat cgaaagaccc accgccgcgt tgcctgtcgg ggacacgccc ttgttggcct 157141 gtagccaggc gggcatctct ctggtaagga aggtctccca cttgtaggtg tagttctggc 157201 cgttgctctg cgagggctga taccagtcgg tgtagaaact ggattggccg cccacgggca 157261 tgatcaccga caaccctgac tggtagtact cctcgaaggc cggggtgttg atgtcccagc 157321 cgttgtagtc atcctgggcc cgcagaccgt cgagcaggta gaccgcgtgc ggtccgccgc 157381 cctggaactg gaccttgatg tcgcggccca tcgacgcgga tggcacctgc agatattcca 157441 ctggaagacc gggcctagag aatgcgcccg cggtggccgg cccgccgaag gtaccgacca 157501 gaccgtaaac caggacagcc cccatagccg cgatagccag ccggcgcggc agggttgtcg 157561 ctgcgctccg caaccttcgc acctgttcga agaacgtcat agctactacc aatcccaact 157621 ctcatctgcc gcacgacgcg gtcgaatctg ttctgggcga gtgaaacaca ccgaggacgc 157681 tcagttcgaa tgtcgtggcc gcagcgcgag atcgcggttg gctaacgatt cagcgtcggc 157741 ccggacacct tgggcgattg acacacccgg gtcacggctg gctcccgagc ggcgcaacga 157801 ccgcacgcac aacccctatg cttactgccg accagaggag agacccatgc gcaccttcga 157861 gtcggtcgcc gacctggccg ccgccgcggg cgaaaaggtc gggcagagcg actgggtgac 157921 catcacccag gaagaggtca atctgttcgc cgacgcaacg ggtgatcacc agtggatcca 157981 cgtcgacccg gaacgggccg ctgcgggtcc ctttggcacc accatcgcgc acggattcat 158041 gaccctggcg ttgctcccgc gcctgcaaca ccagatgtac accgtcaagg gcgtcaagct 158101 ggcaatcaac tacggcctga acaaggttcg cttcccggca ccagtacccg tcggctcgcg 158161 ggtgcgtgcg acgagctcgc tggtcggtgt cgaggatctg ggcaacggca ccgtgcaggc 158221 gacggtgtcg acgaccgtcg aggtcgaggg atcggccaag ccggcgtgtg tggccgaaag 158281 catcgtgcgc tacgtcgcct gaggcaactc gcggtcagaa ttcggcgatc gcgtgctcga 158341 ggcgttgggc cagccaggcc tcggcgtgcg cgcgccgggt cggaatgtgc tgtgacggga 158401 aaagcgttgt caccggctgg tattcgcgca gcgtacggcg ggcgacggtc atcttgtgca 158461 gctcagtggc gccgtcggcg atgcccagtg actcggctgc cagcatcatc ttgacgaacg 158521 gcatctcgtc ggagaccccg agcgcgccgt gcaggtgcat ggcccgctgc acgacgtcat 158581 gcagcacctg gggcatcgcc accttgaccg ccgcgatgtc gcggcgcacc ttttgatagt 158641 cgtggtgttt gtcgataagc cacgcggtgc gcagtaccag cagccggaac tgctcgatct 158701 ggatccaact atcggcgatc ttctcctggg tcatctgcag atcggcgagc cgcccgtgtc 158761 tagtctggcg cgacagggca cgctcgcaca tcatgtcgaa tgctctgcgc gccagcgcga 158821 ttgtccgcat cgcgtgatgt attcggccgc cgcccaatcg ggtctgcgcg atcatgaacg 158881 cttggccctc gccgccgagc acatgatcgg ccggcacccg gacgtcgtgg tagcggatgt 158941 agccgtggct ggcgtgccgg gtggactcgg ctcccacacc gacgttgcgc acgatctcga 159001 tgcccggggt gtcggccggg acgatgaaca gcgacatctt ctcgtacgta cgggcttccg 159061 gcttggtgac ggccatgacg ataaagaacg acgcatgctt ggcgttggtg gaaaaccact 159121 tctcgccgtt gatgatccag tccccgtttc ccgcggcatc gcgggtcgcc gcggtcacga 159181 acagcccggg atcggaacca ccctgcggct cggtcatcga atagcaggag gtgatctcgc 159241 cgtcgagcag cggtcgtaga tagcgggctt tctgctcgtc ggtgccgaac agcgccagga 159301 tctcggcgtt gccggagtcc ggcgcctgac agccgaacgc cgacggcgcc caccgggagc 159361 ggccgatgat ctcgttgagc agcgccagct tgacctgacc gaagccctgt ccgccgagtt 159421 cgggacgcaa atgcgcggcc cacaacccct ggtctttcac ctgccgctgc agcggccgca 159481 ggatcgccat cgtgtcggcg ttctttttgt cgtaaggatc gagggcgacc agatcgagcg 159541 gttcgagttc ctcggccatg aatttttcga cccaatccag cttggactgg tattgcgggt 159601 cggtttcgaa gtcccacacc gtcggcaacc gttccccggc gcggcgtcgc accggcatcg 159661 ttgatagagc aagaccatcg taggtgcggt ctagcggctt cagcgcagtt cgggcaggac 159721 gttggtgcgg tagaagtcga tggcggtgat cgggtcgtcc tgggggaaat gcaggaaggg 159781 gacggcgccg gcgtcgagaa ccgcttgcac cgcaccgatg tggacgccgg gatcggtacc 159841 gaccgcccaa ttggccagca ctttctcgat cgggttcgac tcggcggcac gctggatctc 159901 gaccggattg ggctggtcga cggccccggc ggtgaatcgc cacaagtcgg cggcgcgggc 159961 ggccgccttg tcgtcgccga cgacggcgaa cagttcggcc cgcttaccca gggtggtggg 160021 atctcgtccg gccgcttgag cgcccgcggc gaacgcggcg agcagcttgg cgtcgttgat 160081 gtcgcgggct tgggcgatcc aaccatcacc gtatcggccg gccagggtgg cgctctgggg 160141 gccgctcgcg gcgacaaaga tcggcggcgg catcgccggc gtgtcgtaga gcttgagctc 160201 gtcggtccga aaatagtggc ccgtgaacga gatccgctca ccgctccaca gctggcggat 160261 cagtacgatg gcctcgatca gccggtcgtg gcgctcgcgg tagttgccga acgtgtcggt 160321 ggcggcttgt tcgttgagcc gctcgccggt gcccagcccc agaaacaccc gtccggggtt 160381 caggatcgcc agcgaggcaa acgcctgagc gacggtggcc ggatggtagc ggtatatggg 160441 acaggtcacc ccggtgccga acaagatgct gctggtgctg ttgcccacca acgccagggt 160501 cagccaggga aacatcgaat ggccctcgtt gtcttgccat ggctgtaggt ggtcgctggc 160561 ccacacatac cggaagccag cttgctcggc ggcttgggcg tgcgccacca gccgatcggt 160621 gcggaattgt tcgtgggata agacgacacc caccccgcgg cttgccggct ctggggtcgg 160681 cgtcggaccg ctgcgcgtgc tgcaaccgcc gcctagccca ccggcgccga tcgcgccgaa 160741 cccggcggcc agaccgaacg tccgccgtga gatgccggtc atcgggctgc actacccgcg 160801 tcgcgctgca gcacaccttc gagagtgcat cctgactcac cgtcggcgcc accggttagc 160861 ctggcgagat gaccccgcag gcacgcccag cgcgcagggc cgatgtccgc gagctgtccc 160921 gcaccatggc ccgggcgttc tatgacgatc cggtcatgag ctggttactg tcgaacgaca 160981 acgcccgcac cgcaaggctg acccggttgt tcgcgacgat tgtccgccac cagcatctgg 161041 ccggcggtgg tgtggaagtg gcccgcggcg cggcgggcat cggcggggcg gcgctgtggg 161101 atcccccgga tcgatggcgg gagtcgcgcc gccagcaact ggcgatgaca ccggggttcc 161161 tgcgggtgtt cggctttcgg acggccaagg cccgcgcggc gctggacgtg atgatgcgtg 161221 tgcatcccga agaaccccac tggtatctgg ccgccatcgg cagcgacccg acggtccgcg 161281 gccaggggtt cggtcaggtg ctgatgcggt cacggctgga ccgttgcgat gccgaacact 161341 gtccggccta cctcgaatcc accaaacccg agaatgtgcc ctactatcaa cggttcggtt 161401 tccgggtgac ccgtgagatc gctctgcccg acgcggggcc gccgctatgg gcgatgtggc 161461 gggagcctcg gtagcggttc ttggcagctg gatcgttcgt ccggccgggt gatcactgcg 161521 cgaccgtgaa tctggcgacg ccgcaccggc gtgtcgcgtc gccagactca cagtcgcggc 161581 aatctctgac cgccggtgcg ctgagatagc tcccgaggtg caaaagtggt gcgcagatcg 161641 tcaggctgag cttgccggga tcgcgtgggt cggcacccgc agccgtcgtc tgccacccaa 161701 tagtgtgtgc gacccgcccg gtacacgcgg aatcaacggg tatgcggttc tggcataggc 161761 ttgtcaggca atgatcgctc tgcccgcctt ggaaggtgtc gaacatcggc acgtggatgt 161821 ggcggaaggc gtcaggatcc acgttgcgga cgccgggccg gccgatggtc cggcggtaat 161881 gctggtgcac ggcttcccgc agaactggtg ggagtggcgc gacctcatcg gcccgctggc 161941 cgccgacggc aaccgggtgc tgtgtcccga cctgcgcggc gcgggctgga gttcggcgcc 162001 ccgctcgcgg tataccaaga ccgagatggc tgacgatctg gctgcggttt tggacggcct 162061 gggtgtggcc aaggtcaagc tggtggccca cgattggggt gggccggtcg cgttcatcat 162121 gatgttgcgc catcccgaga aggtgaccgg gtttttcggc gtgaacaccg tggcaccctg 162181 ggtgaagcgc gatcttggca tgctccgcaa tatgtggcgg ttctggtatc agatccccat 162241 gtcgctgccg gtgatcggcc cgcgggtgat cagcgatcct aagggccgct acttccggct 162301 gttgaccggg tgggtcgggg gcggatttcg ggttcccgat gacgacgtgc gcctgtactt 162361 ggactgcatg cgcgagccgg ggcacgccga ggccggatcg cggtggtatc gcacctttca 162421 gaccagggaa atgctgcgct ggctgcgcgg cgagtacaac gacgctcggg tcgatgtccc 162481 ggtccgatgg ctgcacggca ccggagatcc ggtgatcacg cccgacctgc tggacggcta 162541 tgccgagcgg gccagcgatt tcgaggtgga gctggtcgac ggcgtgggcc attggatcgt 162601 cgagcagcga cccgagctgg tgctcgaccg ggtgcgtgcg ttcctagctg cggggaccga 162661 gcagcgcgat tgacgcatcc accgccggct cgacgatgtt ccggatcggc tggccgtcct 162721 cgacggtcag cgcggtcagt tcacgcaaac cgcccagcaa gattacggcc agtggcacat 162781 tcagtggcgg taggttagcc cgccggaacc cagggctggc gctgagctcg atcagcaggc 162841 tggttagctg ctccatgccg cggcgctgga cggggtaagc ggcggcaccg agcgacggga 162901 attcacggat ccaactcaac gtcaccgccg gcctggattc gatatgggtg acgtaggcct 162961 cgaccgcctg acgaatctgg tcgtgccagt cggcgtttgg atcgacggcc gcccggatgc 163021 tgttgcccaa cgtctcgttg tccgctagca ggagttccaa aaagcactgt tccttgctgg 163081 tgaaccggtc gtagaacgtg cgcttggatg tgcgggcgtg ccggacgatg tcggagacgg 163141 tggtggcgcg ataaccccgc tcaccgatcg aggcgaccag gccgtcgagc aaccgtagcc 163201 gaaacgagtc ggtctcgacg accaacgcgc cggcggcgac tgctgtcacc cgcgcctcct 163261 ctacctatcc cttgtcaggt ttggtaccaa agagtaccgt actggacaag ccacggtaca 163321 ccaccgtacc acgcccgatc cagggacgtt aggagcaaca ccgccatgag cgaagtcgtc 163381 accgccgcac cggcaccgcc cgtagtccga cttcccccgg cggtccgcgg gccgaagttg 163441 ttccagggat tggccttcgt ggtgtcacgg cgacggctgc tggggcggtt cgtgcgtcgc 163501 tacggcaagg ccttcaccgc caatatcctg atgtacggcc gggtcgtggt ggtcgccgac 163561 ccgcagctag ccaggcaggt cttcaccagc agtcctgagg agctgggcaa catccagccc 163621 aacctgagtc ggatgttcgg ttccggctcg gtgttcgcgc tggacggcga cgaccaccgg 163681 cggcggcgcc ggctactggc gccgcctttc cacggcaaga gcatgaagaa ctacgagacc 163741 atcatcgaag aggagaccct gcgcgagacc gccaattggc cgcaaggaca ggctttcgca 163801 acgctgccgt caatgatgca tatcacgctc aacgccatcc tgcgtgcgat cttcggggcc 163861 ggcggcagtg aactagacga gctgcgccgc ctcattccgc cgtgggtcac gctgggctcg 163921 cgcctggcgg cgctaccgaa acccaaacgc gactatggcc gccttagccc gtggggccgg 163981 ctggccgagt ggcggcgcca gtacgacact gtcatcgaca agctcatcga agccgagcgg 164041 gccgacccga acttcgccga tcggaccgac gtattggcgt tgatgctgcg cagcacttac 164101 gacgacggtt ccatcatgtc gcgcaaggac attggcgacg agctgctcac gctgctggcc 164161 gccgggcacg aaaccacggc ggcgacactg ggctgggcgt tcgagcggct cagccggcac 164221 cccgacgtgc tcgcggctct ggtcgaggag gtcgacaacg gcggtcacga gctgcgtcaa 164281 gcggcgatcc tggaggtaca gcgggccagg accgtcatcg attttgcggc tcgtcgcgtc 164341 aatccacccg tttaccagct cggcgagtgg gtgattcccc gcgggtattc gatcattatc 164401 aatatcgccc agatacatgg cgatcccgac gtcttcccgc agccggatcg cttcgacccg 164461 cagcgctaca tcggaagtaa gccatccccg tttgcgtgga tcccttttgg tggcgggacc 164521 cgccgctgtg tcggggccgc attcgccaac atggagatgg atgtggtgct gcgaacggtg 164581 ctgcgccact tcaccctcga gaccaccacg gccgcgggcg agcgcagcca cggtcgagga 164641 gttgcattca ccccgaagga tggcggtcgg gtggtgatgc gccgacgctg acggccagct 164701 cgggcccgcg ttcaggtccc gagttcgggt gaaaggctgg cccgcagtgc agattcggcg 164761 gtccgtcggg gtagcctcca gccgggccgg acgaagtggc acgtgtaccc gttggggtag 164821 cgctgcaggt agtcctggtg ctcgggttcg gcttcccaga aatccccggc cgggctgacc 164881 tcggtcacca ccttgccggg ccacaggccg gatgcctcga catcggcgat ggtgtccagc 164941 gcgatccgct tttgctgctc atcgaagtag aagatggccg accggtagct ggtcccccgg 165001 tcgttacctt gccggtcttt ggttgtcggg tcgtggatct ggaagaagaa ttccagcagg 165061 gtgcggtaat cggtgaccgt ggggtcgaag atgatttcga cggcttcggc gtgcgtgccg 165121 tggttacggt aggttgcgtt ggggatgttc ccgccgctgt agcccacccg cgtggagacc 165181 acaccgggct ggttgcggat cagatcctgc agcccccaaa agcagccgcc ggcgaggatc 165241 gctttctgat tgctcgtcat ttccggacct cccgatcagg ctacactccg gcgatggagt 165301 gtaacggcgc gaagaccgca ctgtgagcgc ttcggagttc tcccgtgctg aactcgccgc 165361 cgccttcgag aagttcgaga agaccgtggc ccgcgccgcc gcgacgcgcg actgggattg 165421 ctgggtgcag cactacaccc ccgacgtcga atacatcgag cacgcggcgg gcatcatgcg 165481 aggccgccag cgggtacgtg cctggattca agaaacgatg acgaccttcc cgggcagtca 165541 catggtggcc ttcccgtcgc tgtggtcggt gatcgacgag tccaccgggc gaattatctg 165601 cgaattggac aaccccatgc tcgaccccgg cgacggcagc gtgatcagcg cgacgaacat 165661 ttcgatcatc acctatgccg gcaatggcca gtggtgccgt caagaagaca tctacaaccc 165721 gttgcggttc ctgcgggcgg cgatgaagtg gtgtcgcaag gcgcaggagt tgggcaccct 165781 cgacgaggac gcggcgcgtt ggatgcgccg gcatggaggt ccttaaatga acgcacccaa 165841 gctggtcatt ggcgcgaacg gcttcctggg ttcgcacgtg actcgccagc tcgtcgccga 165901 ctgcgcgccg cagaaaggtg aggtacgcgc gatggtgcga cccgctgcca acacccggag 165961 catcgacgat ctaccgctca cccgattcca cggcgacgtc ttcgacaccg ccaccgtggc 166021 cgaggcgatg gccggctgcg acgacgtcta ctactgtgtg gtcgacaccc gcgcctggtt 166081 gcgcgatccc tccccgctgt ttcgcaccaa tgtggcaggc ctgcgcaacg tcctcgatgt 166141 ggccacagac gccagcctgc gcaggttcgt cttcaccagc agttatgcga cggtgggtcg 166201 tcggcgtgga cacgtggcga ccgaagaaga ccgggtggat acccgcaagg tgactcctta 166261 cgtgcggtcc cgggtggcgg ccgaggatct ggtgctgcaa tacgcgcacg acgcaggtct 166321 gcccgccgtc gcgatgtgtg tgtcgacaac ctacggcggc ggcgactggg gccgcacccc 166381 acacggcgcc ttcatcgcgg gcgcggtgtt cggcaggctg cctttcacga tgcgcggcat 166441 ccggctggag gcggtgggtg tcgacgatgc tgcgagggcg ctgatcttgg cggccgaacg 166501 cgggcgcaac ggcgaacggt acctcatctc cgaacgcatg atgccgttgc aagaagtggt 166561 gcggatcgcc gcggatgagg ccggtgtccc gccgccacga tggtcgatct cggtgccggt 166621 gctttacgcc ctgggtgcgt tgggcagttt gcgagcccga ctcacgggca aagataccga 166681 actcagcctg gcgtcggtgc gcatgatgcg ttccgaggcc gatgtcgacc acggcaaggc 166741 cgtccgcgag ttgggttggc agccacgtcc ggtggaggag tcgatccggg aggccgcccg 166801 gttctgggcg gcgatgcgca ccgtcgggaa ggaccccgcg gcctcgtgat ccgaaaaggc 166861 ctagggacgc tgccgggaat gttgatcgcc ggcacgtgtt gcacaggtca tgagcaaccg 166921 gattgtgtta gaacccagcg ccgatcaccc gatcaccatc gagccgacca accgacgggt 166981 gcaggtacgc gtcaatggcg aggtggtcgc ggacacggcc gcggcgctgt gcttgcagga 167041 agccagttac cctgcagtgc aatatattcc gttggccgac gtggtacagg ataggctgat 167101 ccgcaccgag accagcacct attgcccgtt caagggtgaa gccagctatt acagcgtgac 167161 taccgacgcc ggcgacatcg tcgacgacgt gatgtggacg tacgaaaacc cttatccggc 167221 ggtagcggcg atcgcggggc atgtcgcgtg ctatccggac aaagccgaaa tcagcatctt 167281 cccggggtag cgcaggctac cgggtatacc tcggccaacg actgggtgtc gctgtattcg 167341 cgcagcgaga tgatcatccc gtcacgggtc tcgaagatgc agacgaacgg gctgtcatat 167401 cgggtccggt cggcgctcac accgtcgcaa tgcccctcga ccactaccgt ttcaccctcg 167461 ttgacgcagc ggatgagttc gatgttgacc tcgaagacct gcttgcgccg ctcgactgct 167521 cgccgaaacg tcttcttgtc caattccgta cgggtgacga tgctccagta ggtgaagtcg 167581 ttgctgagca gcgcgaagcc ttcgtcgaga tctccgccct cgcagaggct ttgcaggaac 167641 atccaggcca gttcggcttg cgggtcgtcg aacggcgtca tcacatcgcc atcttgtctc 167701 gggagacagc gtgcggtcaa ttgacgtggt cgtcgaagcg gtggtcacct tcgcgggggc 167761 ggccggcttc gcgcacacct tggcgccgtt gcgtcgcggt cagcaggatc catgctttcg 167821 ggtccccggt gacggcacta tctggcggac cagcttgctg cccaccgggc cggtcaccgc 167881 gcggatcagc cgtgctgggc gcgacgccgc ccgttgcgtg gcgtggggca gcggtgccga 167941 ggagtttgtc gacatggcgc ccgccatgct gggcgccgcc gacgacgcca gcgatttcgt 168001 gccgctgcat ccggccgtgg ccgccgcgca ccgccggctg ccgaacttgc gcctgggccg 168061 caccggccag gtgctggaag ccttgatccc ggcggtcatc gagcagcggg tacccggcgc 168121 cgacgcgttt cggtcgtggc ggctgttggt gtccaagtac ggaacgcagg cccccggtcc 168181 ggcgccaccc ggcatgcggg tgccgccgtc ggccgaggtg tggcgtcaca tcccgtcctg 168241 ggagtttcat cgcgccaatg tcgacccggg gcgggctcgc gcggtggtgg gttgcgcgca 168301 gcgggcggcg tcgctggagc ggctggtgtc gctgcccgcg gctcgggcgg cggaggcgct 168361 gacatcgttg cctggagtcg gggtatggac cgcggccgag accacacaac gcgtgttcgg 168421 tgacgccgac gccgtgtcgg tcggcgacta ccacattccg aagatgatcg gctggacgct 168481 tgtgggccgg ccggtcgacg acgccggcat gctcgagctg ctggagccga tgcgcccgca 168541 tcgccaccgg gtggtccgct tgctcgaagc cagcggcttg gcgcgtgagc cgcgccgcgg 168601 gccccggctg ccggtacaga acatccgggc gctgtagggg agtttgacgg ggatcttgct 168661 cggtccggcg ccccgattcc cgccagatcg gctgccggcg ccgctaagcc gttgtcggcc 168721 gatcactgcc tccgcgttcg gcctcggcgg tctgccggtt cagtcgctgc gtctcgtaga 168781 tggtgacgtt ggtgcgagac aacaacagtg ccgcgatacc gacggcgatg atcgctccag 168841 gcaccaccga gaacgagccg gtcatctcag cgaccatgat catgacggcc agcggcgcgc 168901 gggagacact gccgaagcac gccatcattg cgaccacgac gaagatgccc ggctcgtggg 168961 gcaccccggg cagctcggtg agctcgccta gccgccagat cgccgctccg acgaaggcgc 169021 cgatcacgat tcccggcccg aatagcccgc ctgatccgcc ggtgccgatc gacagcgacg 169081 tcgcgaggat cttggcgatc ggcaagacga tgacgatcca caacgggatg ctcagcagcg 169141 tcccccgatc ggcggctagc tgcgcccagc catagccgct gctcaggatc tggggaatcg 169201 gcagacctaa cagcccgacc agcagtccgc cgatcgccgg tttgagcacc gggcccccgg 169261 gcagccggcg cgtaattgcc accgacgcgt gaaagactcg ggcatacaag tagcctacgg 169321 cggctgcgat cagcccgatc accacgaacc acagtagtgg ccacgccttt tcgaagcgat 169381 actcggcgtc gatgtagccg aacagcgggt cgaagcccaa gaaggcgccg agcacggcgt 169441 aggcggttcc cgaggcgatg aaacccggca gcaggttgcg gtagtcgaag tcgtcgcggt 169501 aggggatcga ggcgcccaac gccgctccgc ccagtggcgc agcgaagatg gcgccgatgc 169561 cggcgccgat acccagcgct accgcggtcc ggccgtcttc gttggacagg ttcagccggc 169621 gggtcagcag tgagcagaag ccggccgaga tctgcgcggt cgggccttcg cggccgcctg 169681 aaccgcccga gccgatggtc aaggcgctgg ccaccatctt caccagcacc gcccgacctc 169741 ggatggcgcg cggatcgccg tgcaccgact cgatcgcttc gtcggtgccg tgaccggtgg 169801 cctccggggc gagcttggcc acgatcaatg ccgacagcac cgccccgccc gtcgtcacca 169861 gcggaatcgc ccacggacgc gcgaaaccgg tggacccgcg gtggccgccc tccccaacgg 169921 gagtgggaat ctgatagtcc gcgaggtagc cgagcagaaa ctcgctggtg tatttcagcg 169981 cgaggtagaa gacgacggcg cccaggccgg caatgacacc gatcgtgatg cctagcagga 170041 accatttgcg caggtagccc gcgctcctga tcgatacgcc gaatcgtccg ccggcggcct 170101 cgttcccgat gtcttccgcc tccggcatgg tcgggaggtt agcagcatgc caagcgaaca 170161 ccgaccagtc gcccggcgcc atcccagagt tggccagcgc tatccgacga tcagcagcgc 170221 aaccatggcc caggtctgga cgtacgcgat caccgccgct gtgcggcgag gagatccgaa 170281 acggtgccgc actcttggac cccgacctct gtcatgacgc cgccgctcgt cgtggccgcg 170341 ttcaggccgg tcggccatta ccgactcgca acggacagag ccggtgggcc ctgctcgccc 170401 ccggcgaccg gagccaagct gacaagttcc gtagcatccc gcccaacggt aggtaccaag 170461 ccgcagtggt ggcacacttt agtgatgtca atgtcgctca cggccggtcg cggcccggga 170521 cgtcccccgg cggcgaaagc agatgagact cggaagcgta ttctgcacgc cgcccgtcaa 170581 gtgttcagcg aacgtggtta tgacggcgcg acttttcagg agatcgccgt ccgcgccgac 170641 ctgacccgac cggcgatcaa ccactacttc gccaacaagc gggtgctcta ccaagaggtg 170701 gtggagcaaa cccacgaact cgtcattgtg gccggcatcg aacgggcacg ccgcgagccg 170761 accttgatgg ggcggctggc ggtcgtcgtt gacttcgcga tggaggccga tgcccagtat 170821 cccgcctcga ccgcgttcct ggccaccacc gtgctcgaat cccagcggca tccagaattg 170881 agtcggaccg aaaacgatgc ggtgcgagca acccgagaat tcctggtttg ggctgtcaat 170941 gatgcgatcg aacgcggtga actagccgcc gacgtcgatg tctcttcgtt ggccgagacg 171001 ctgttggtcg tgttgtgtgg cgtgggcttc tatatcggtt ttgtcgggag ctatcagcgg 171061 atggcgacca tcaccgattc gttccagcag ctgttggccg gcacgctctg gcggcctccg 171121 acctgaccga gacctaaccg gcggccccga agcgtagtga tgtgccacac aaatcgtata 171181 ggttacctaa cttacttagg tagcatggca tgccgtgacc gaactcgacg acgtgtcctc 171241 gttaccatcc tcgcgacgga ccgctggcga tacctgggcg atcaccgaaa gcgttggcgc 171301 caccgcgttg ggggtcgcgg cggcacgtgc cgtggaaacg gccgcgacca atccgctgat 171361 ccgtgacgag ttcgccaagg tgttggtgtc gtcggcgggt accgcctggg cacggctggc 171421 cgacgccgat ttggcctggc tcgacggtga tcagctcggc cgacgcgtgc atcgggttgc 171481 ctgcgactac caggcggtgc gcacccactt cttcgacgag tacttcggtg ccgccgtcga 171541 cgcaggtgtc cggcaggtgg tgatcctcgc tgccggactg gacgctcggg cctaccgcct 171601 gaactggccg gcgggcactg tggtttacga gatcgaccag ccttcggtgt tggagtacaa 171661 ggcggggatt cttcaatcgc atggcgcggt tccaacggcg agacggcatg ccgtcgcggt 171721 ggacctgcgc gacgactggc cggccgcgct gatagctgcc ggattcgatg gcacccaacc 171781 gactgcctgg ctagccgagg gcttgctacc ctacctgccc ggcgacgccg cggaccggct 171841 attcgacatg gtcaccgcgc tcagcgcacc gggcagccag gtcgctgtcg aggctttcac 171901 catgaacaca aagggcaaca cgcagcgctg gaatcggatg cgcgagcgac tcggtttaga 171961 catcgatgtc caggcgttga cctaccacga gcccgaccgg tcggatgccg cgcaatggct 172021 ggccacgcat ggctggcagg tgcacagcgt gagcaatcgc gaggagatgg cccgactggg 172081 ccgggcgatc ccgcaagacc tggtcgacga gaccgtccgc accacgttgc tgcgagggcg 172141 tctggtcaca cccgctcaac cggcgtgaca ccggcatcac gagaaccaga gggagcacag 172201 gatgagcgcc atgcgcaccc atgacgacac ctgggatatc aagaccagcg tcggcgccac 172261 cgcagtgatg gtggctgctg cccgggccgt cgaaaccgac cggcccgacc cgctgatccg 172321 cgatccctac gccagactgc tcgtcaccaa cgccggggcc ggcgccattt gggaagccat 172381 gctcgaccca acactggtag ccaaggcggc tgccatcgat gccgaaaccg cggccatcgt 172441 cgcctatctg cgcagctacc aagcggtgcg gaccaacttc ttcgatacct acttcgccag 172501 cgctgtcgcc gccggaatcc ggcaggtagt gattctggcg tccggactgg attcccgcgc 172561 ctatcgcctg gactggcccg ccggaaccat cgtgtatgag atcgatcaac ccaaggtgct 172621 ttcctacaag tccacgacgc tggcggaaaa cggggtaacg ccgtcggctg gtcgccgtga 172681 ggtgcccgcc gacctgcgcc aggactggcc cgccgcgctg cgtgatgccg ggtttgaccc 172741 gacggcacgc acggcgtggt tggccgaggg gctgttgatg tacctaccgg ccgaggccca 172801 ggaccggctg ttcacccagg tcggcgccgt gagcgtggcg ggcagccgga tcgcggccga 172861 gactgcgccg gtgcacggcg aagagcggcg agcagaaatg cgggcacggt tcaagaaagt 172921 ggccgatgtg ctcggtatcg agcagaccat cgacgtgcag gaactggtct accacgacca 172981 ggatcgggcg tccgttgccg actggctcac cgatcacggt tggcgggccc gatcccaacg 173041 tgcgcccgac gagatgcgcc gcgtgggtcg ctgggttgag ggggtgccga tggcggacga 173101 cccgactgcg ttcgccgagt ttgtcaccgc agagcggttg tagcgagcgc atccgactga 173161 ccttatatat ccggatatat ggctggatct tttctattgc tggttcaacc gggtgactag 173221 gatcgcggtt atcaccgatg agtgaccgcg tcaaggcggt cgcgccgccg gacggaagga 173281 cgatgatgac caccgaatcg gttgcccgga agacccagaa atctgagacc gaggctccgc 173341 gcgaaccggc gcccgtttcg gatgaaaagc aaaccgatgt cgctaaaacg gtggctcggc 173401 tgcgaaagac ctttgccagc gggcgtaccc gcagcgtcga gtggcgcaag cagcagttgc 173461 gcgcgctaca gaagttgatg gacgagaacg aggacgcgat cgccgcggca ctcgccgagg 173521 atctggatcg caatccgttc gaggcatacc tcgctgacat cgcgacgacc tccgccgaag 173581 cgaaatacgc ggccaagcgg gtgcgcaggt ggatgcggcg ccgctacctg ctgctcgagg 173641 tgccgcagct gcccggccgc ggctgggtgg agtacgagcc atatggcacc gtgctaatca 173701 tcggtgcctg gaactacccg ttctacctga ccctgggtcc ggcggtcgga gccattgccg 173761 ctggaaacgc cgtcgtgctc aaaccgtcgg aaatcgccgc tgcatcggcg cacttgatga 173821 ccgaattggt gtatcgctat ctcgacaccg aagcgatcgc ggtcgtgcag ggcgatggtg 173881 cggtgagtca ggagctgatc gctcagggtt tcgaccgcgt gatgttcacc ggtggcaccg 173941 agatcggccg caaggtctac gaaggcgccg cgccgcacct gaccccggtc accctcgagc 174001 tcggcggcaa gagcccggtg atcgtcgcgg ccgatgccga tgtagatgtc gcggccaagc 174061 ggatcgcctg gatcaaactg ctcaacgccg ggcagacatg cgttgcaccc gactatgtgc 174121 tggcggatgc caccgtccgc gacgagctgg tcagcaagat caccgcggcc ctcaccaagt 174181 tccgctccgg tgcgccgcag ggcatgcgca tcgtcaacca gcgtcaattc gaccggctga 174241 gtggatacct cgccgcagcg aaaaccgacg ctgcagccga cggcggcggg gtcgtcgtgg 174301 gcggcgactg tgacgcatcg aacctgcgca tccaacccac cgtggtcgtc gatcccgacc 174361 cggacgggcc gttgatgagc aacgagatct tcggaccgat cctgccggtg gtcaccgtca 174421 aatctctgga cgacgcgatt cgcttcgtga actcgcggcc caagccgcta tcggcgtacc 174481 tgttcactaa gtcgcgtgcg gttcgcgagc gggtgatcag ggaggtgccg gcgggcggaa 174541 tgatggttaa ccatttggct tttcaggtgt cgacggccaa actgccgttc ggtggtgtcg 174601 gcgcatcggg catgggtgcc taccacggcc gttggggttt cgaggagttc agccaccgta 174661 agtcggtgtt gaccaaacca acccgacccg acctgtccag ctttatctac ccgccgtaca 174721 ccgagcgcgc catcaaggtg gctcgccggc tgttctgacc tgggcgcggg ttgtcgcccc 174781 gttgacaccc gactcgttat aaccccgaat tgtgattgcg gagaggagcc tgatgcccgg 174841 agtgcaagat cgcgtcatcg tcgttactgg agccggcggt ggcttgggcc gcgaatacgc 174901 ccttacgctc gccggggagg gcgccagcgt cgtggtcaac gacctcggtg gcgcccgcga 174961 cggcacgggc gccggttcgg cgatggccga tgaggtcgtc gccgagattc gcgacaaggg 175021 gggccgggcg gtcgccaact acgacagcgt cgccaccgag gacggcgcag cgaacatcat 175081 caagaccgcg cttgacgaat tcggcgccgt gcacggtgtg gtgagcaacg ccgggatctt 175141 gcgcgacggc accttccaca agatgtcgtt cgagaattgg gacgccgtgc ttaaggtgca 175201 cctttatggc ggataccacg tgctacgcgc ggcctggccg catttccgtg agcagagtta 175261 cggccgggtc gtggtggcga cctccaccag cgggctgttc ggcaacttcg gccagaccaa 175321 ctatggggcg gccaagcttg gtctggtcgg cctgatcaat acgctggcgc tggagggagc 175381 caagtacaac atccacgcca atgctcttgc cccgatcgcg gcgaccagga tgacccagga 175441 catcctgccg cccgaagtac tggaaaagct cacacccgag ttcgtcgcac cggtggtggc 175501 ctacctgtgc accgaggagt gtgccgacaa cgcatcggtg tacgtcgtcg gtggtggcaa 175561 ggtgcagcga gttgcgctgt ttggcaacga cggcgccaac ttcgacaaac cgccgtcggt 175621 acaagatgtt gcggcgcggt gggccgagat caccgatctg tccggtgcga aaattgctgg 175681 attcaagttg tagaagtaaa tgaaggcttg tgtcgtaaaa gaactttccg gcccgtccgg 175741 catggtgtac accgacatcg acgaggtatc cggtgacggc ggaaaggttg ttatcgacgt 175801 acgggccgcc ggcgtctgct ttccggacct gctgctgacc aagggcgagt atcaactgaa 175861 gctaacgccg ccgttcgtgc ccggcatgga aacggcgggt gtggtgcgtt cggcgccgtc 175921 ggatgcgggt tttcatgtgg gcgaacgtgt ttcagcattc ggagtgctcg gcggctacgc 175981 cgaacaaata gccgtaccgg tggccaatgt ggttcgcagc cccgtcgagc tcgatgacgc 176041 cggggcggtg tcgctgttgg tgaactacaa caccatgtac ttcgccctgg ctcggcgtgc 176101 cgcgctgcga ccgggagaca ccgtgctggt gctcggcgcc gccggcggag tgggcacggc 176161 cgccgtccag atcgcgaagg cgatgcaggc tggcaaggtg atagccatgg tgcaccgcga 176221 aggtgcgatc gactatgtcg cttcgctcgg tgccgacgtg gtgcttccgc tgaccgaggg 176281 ctgggctcag caggtgcgtg accacaccta cggtcagggg gtggacatcg tcgtcgatcc 176341 catcggcgga ccgacattcg acgacgcgct cggcgtgctg gcgatcgacg gcaagttatt 176401 gttgatcggc tttgccgcgg gtgctgtacc gaccctcaag gtcaaccggc tgctggtgcg 176461 caatatcagc gtggtgggcg tcgggtgggg cgagtatctc aacgcggttc ccggttcggc 176521 cgccttgttc gcctgggggc taaaccagct ggtctttctg gggctcagac cgcctccgcc 176581 gcaacgctat ccgttgtcgg aagcacaggc cgcgttgcag agtctggacg acggcggtgt 176641 gctcggcaag gttgtgctcg agccctaagc gcatgctcgc gattcggcga tacggtgatg 176701 ctgtgacgga tcggcgggcc aacacgagga attcgcaccc gctgccggcg tgaccaacgc 176761 cacgctggca gcaatcgggt atccgatcgc gttggccagc aagctgttgg cgatatcggc 176821 cgtcgaaagc acaaccgcgt agccgtccgc aaccacagtg gaaatggtgc tggcgatctt 176881 ggtgtgcgcg agcgcttcga tgccagggtc agggagcccg gtgggcgccc ggtcgtcggg 176941 caacgtcagc atcgacgagc cgccggtctg tgacaagttc gccaacaacg gattgggcag 177001 ccacgccggt acctggcgcg gatgtggtgc acgaacgcgt tgacatattg ggggctcttc 177061 gcggatgagg gtgtagggcg ggtcggcgcg tcgttgccgg gtaggggtcg cggtctttcg 177121 atgatgggcg gttccacgct gccgaaaagg aagacctcgg cgtgtctgcc cgaggcacta 177181 ggtcgcaagg gtaaccgagg gtgcacgttg acggggtgag gccaagcggg cgccgagcgt 177241 gaactgaggg cgagatttcg gccgattctc cgccctcagt tcacgctggg cgacggcgcc 177301 aacgggctgc ccctggccgg tcgcaccaag acgccgcata cgtaccaaac ttcccatact 177361 cacccatcgc ggtgaacccc aaacccagtg ccggccacca ttggccttcc cgatggattg 177421 gtgccagcag caaccggcat catcgaaaac cggctcttca tgatcgaggg ccggcagcgg 177481 ctcgagcagc ggcaggccgg ggtgatcacg tagtagtgct gaatgacccg agcatcgggc 177541 gatcagatgc tgaagctttg cagttgctga gtaatgtcgg ccaacgtcac cacaatcgcg 177601 atgaattcaa tcatgccgcc cagggcggcc aacccaatgg tggccgcgag cggcagctcg 177661 atcgcagcgc ggaggttgcc ggccgccagt tgattcacga acagggtgag gtcataggcg 177721 ggcaggatag tgacgaaggc aagacctaga tctgccgtcg gaagaagaat cgagtagccg 177781 gtcgacacaa cggaagcgaa agtgtccgcg atgttgatga gcgtcgccgg ttgtggcggc 177841 ggtggcggcg gtagcagcgt cggcacatac ggcgggaacg cgggcatcgg agtttggggc 177901 agggtgttca gggcggctgg caactcgacc atgaagtcgt tgacgccctg ttgcgttccg 177961 gcaaccaggg catcggcgac aacgctcgcc gggacatccg ggaagagccc gaatggggta 178021 ggcacgttcg ccgggctcgt cgaatagccg aacctcgggt cgccgtaacc cagattgacg 178081 attacttcca agttcggttc gaccagcgcc gccagcggtg ggccaatgac cgggattgcc 178141 cgcaacgggg ccagcagcgg cagatgctcg gtttcaatga tgtagtacgt gttcgacgtc 178201 gtgccctgtg tcggcaactg cgtggccgac gctatctgtg ccggtgtgag gtccgcatac 178261 gtggtgtgca ccgtgagtat cccgaatact gcgttgatat cggacaggac attgagtgga 178321 taccgcggga agtcggcgaa accgtcgtac tcgagggtgt aggtcgtcgt cggataggga 178381 ttgtccgggg tcgccccgta gaacggtagg ccgagggtgg tgacattcag accgggtatg 178441 cgcgcaagta tcccgccatt gggattcatc tcgttgccga tcaagatgaa attgagctgg 178501 ctggggctgg gagcgttggg acccagcgag atgaggtgct gcatttccag ggacgcgatg 178561 acggcgctct gcgaatagcc gaacacggtg acgtggtttc cggcgttgat ttgctcccaa 178621 atcgcgccgt cgagaatctg taggcccaac tgcaccgagg tttggaaggg cagggatttg 178681 acgccggtga tcggatatag ctcttcgggc gtcaccagcg ctttgacgac cggattcgag 178741 acgacggggt cgatgaacaa ggtcgtgatg gcgttgacat aactcggcgt gggtatcggt 178801 gacccggtgc cgcccatgat gatcgccgta ttttggttga acattggcgg tagcaccggg 178861 ggtgaggttg gcttaaagag tccggccgtc gcctcctgca ccagcgcgct cgtgttggtg 178921 gcctcggcat tgacaaatgc gtttgcggcc gccgccaacc tctgggtgaa ttcgttgtga 178981 aacgccgcaa cctgtgcgct gatcgcctgg aactgctggc cgtacgcgcc gaacagcgtg 179041 gcaagggccg tggacacttc gtccgcggca gccgccgcca ggccggttgt cggggccgcg 179101 acggccgccg tagcctggtt gatcgccgag ccgatcccgg ccaaatcggt agccgccgct 179161 gccaataccg acggctgcgc gaatacgtac gacaaacccc atccctcctt gtcgacgggg 179221 cccataaccc acccgtcgag ccgatacgtt gagcgtaaag cgactccgcg gttgtgtctg 179281 gcctttggag tgaacccaaa tggggccatg ctgcctcgtc attggcgagg tcggtaaacg 179341 gtagtcggtg gacgtcgatg ccgtcgggaa tccgttaggt gacgaggccc tcgatgtttc 179401 gaacggtgtc cgaggccgcc gcgaggaggg tgagcaattc cacgccgccc gctatcgatc 179461 gtgcctaaac ctacggtggc cgccagggga tagccgatcg cgttgatcag attgcccgca 179521 gcgagttgcc tgacgaacag ttgggtggtg tacagcggca gggtggtgac cagggcgagg 179581 gcgatgtcca cggtgggcag caggacggcg tagttggttg agatgatcct ggcgagcgtg 179641 ttcaccacct cggccggcgt cggtgcggcg gccaccgcgg ccaccagatc ggcgggttgc 179701 ggcagctgga tctgcgggag cgtgagcggt tgcgcggaca gcgcctgcag gtcggccgtg 179761 aagtcaagga tgccttcttg tgttccggcg gccagggcat cggcgatgac ctgaggcggc 179821 acgttcggcc acagcccgaa cggcgttcgc acatcggcgt agctcgtcga gtagccgtag 179881 ttcgggtcgc cgtagcccag gttgacgatc accttcaggt tcggctggat caggtcggcc 179941 agcggatctc cgatgaccgg caccgcccgc agcggttgca gcagcggccg attctcggtg 180001 cggatgatgt agtagtcggt gacccccgta tagcccggcg acgtcggtaa tttagtagcg 180061 ccctcgacct gcgcgggcgt gaggtccaaa tacttggtgt gtacgaatgt gatgcctgca 180121 accgcgttga ggtcggaaat gaagttgagc gggtatcgcg agaagtcggc gaacccgtcg 180181 tactcgagcg tgtagatggc cgtcggatag atcgtgtccg agggcgttgc gccatagaac 180241 gtcaggtcca gagtcggaag cgtcagatcc gggaaccgcg cgagcatacc gccattgggg 180301 ttcatttcat tgccgacaag cacgaaattg aggtcgctcg ccgaaggtgc ggccccgccc 180361 atcgccgtga acctctgcat ctccagcgac gcgattatgg cgctttgcga ccagccgaaa 180421 acggtgaccg cgtttccggt ggtcgcgagc tctaccatga tcgcgtcgtg caagatggtc 180481 aagccctctt ccactgacgt gttgaggacc aaacttctga caccggtgag tgggtacaac 180541 tcttcgggtg tgaagacggc ttgtagcgcc cgccgaacgg acctacagcg tattggcggc 180601 gtcaacatag acggcggtgg tagtggaatt ccggtgggcc caaagaacaa ggtggtcaag 180661 ttcgccggga atggcggaat catcgcggcc gccgcggggg ttggtgcggc ggcgggcaca 180721 gccagctgat tttgccgggt gctggcgatg gcggcctcgg catctgcgta gctgttcgcc 180781 gcggcggcca acgtctggtg gaacctaact gtgaaacgcc tcgacttgag cgagcacggc 180841 ctggtattcc tggccgtatg cgccgaacgg tttcgcgatg gcggccgaca cctcatcgcc 180901 ggccgccgcg gccagtgcac acgtcgggcc tgccgcggcc gcgccggccg tactcacggc 180961 cgaaccgatt cctgccacct cggcggcggc cgccgctacg atccgcggct cagcgatcag 181021 atacgacatc gtctcactcc cctagcacca ggtgtcggcc aaccgggtca acccggggtt 181081 ttggtcagcc cagagcggtc ccgctgccct ggtggtcgct tacgcgaatc ggattcgcgc 181141 gaaagcgttt cccctcatcc gagcagcacc ccgcgcatcc ggttgactgt ggcctggctg 181201 ataccggcgt cgcgcaggta gccgcccagc gatccgtagg tctcgtcaat ggtctggcgt 181261 gcggcggcca ggtactccgc gcggacaccc aggaccccgt cggacagccg ggccttggtg 181321 aacgtcacca cctcgggtgc cagttcggtg tcgaaacgct gctggatcat ctcggagatc 181381 cgggcccgca gttgtggcac ggagtcgttg ctgcgcaggt agtcggcgac gatgacgtcg 181441 cggtccaggc cgaccgcttc aagcaccagc gcgaccacga agccggtgcg atccttaccc 181501 gcgaagcagt gggtgagcac cgggcgtccg gcggcaagca gtgtgacgac acgatgtagc 181561 gcgcgctgtg ctccattgcg cgttgggaat tggcgatact cgtcggtcat gtagcgggtg 181621 gccgcgtcat ttatcgactg gctggattcg ccggactcgc cgttggaccc gtcattggtt 181681 agcagcctct tgaatgcggt ttcgtgcggc gctgagtcgt cggcgtcatc atcggcgagg 181741 tcggggaacg gcagcaggtg gacgtcgatg ccgtccggaa cccgtcctgg accgcggcgg 181801 gcaacctccc gggacgaccg caggtcggca acgtcggtga tccccagccg gcgcagcgtt 181861 gcccggccgg cgtcgtcgag gcggctcagc tcgctggacc ggaacagccg ccccggccgc 181921 aatgcggttg cggtgtcggc gacgtcacga aagttccacg cgcccggcag ttcacggaca 181981 gccatctcag gtgaccgccg cagcgaaggt ggacttctcc ctcgacagct cggcgcgggc 182041 gatggagcgc aggtgcacct cgtcgggacc gtcgaagatg cgcatggcgc ggtgccagcc 182101 gtacaaccgg gccagcgggg tgtcgtcgct gacgccggcg gccccgtgga cctggattgc 182161 gcggtcgatg acatcgcagg ccacccgcgg ggccaccgcc ttgatcatgg cgaccaggtg 182221 gcgcgcctct ttgttgccat gttggtcgat tgtccacgcc gccttttcgc acagcagcct 182281 tgcctggtcg atttcgttgc gggactgagc aatcgcctgt tgcacgacgc cctgttcggc 182341 tagcggacgg ccgaacgcca cccggttgcg gacgcgattc accatgagtg ccaaggcgcg 182401 ttcggccgcg cccagcgcac gcatgcagtg gtggatacgg cccggcccca gccgggcctg 182461 ggctatggcg aatccgctgc cctcttcgcc gagcaggttg gtggccggga cccggacgtt 182521 gtggtagtcg atctcgcagt ggccgtgccg gtcctgccag ccgaacaccg gtgtggagcg 182581 aacgatcgtc acgccggggg tgtcgatcgg gacgaggacc atcgactgct gttggtgggc 182641 ggctgcgtcc gggttggtgc ggcccatcac gatgaggatc ttgcaccgcg ggtccgccgc 182701 tcccgacgtc caccacttac ggccgttgat gacgtagtcg gcaccgtccc gggagatggt 182761 ggtttcgatg ttgcgggcgt cgctgctggc caccgccggc tcggtcatcg agaaggcgct 182821 gcggatcttg ccgtcgagca gcggccgcag ccattgcgcc cgttgctgct cggtgccgaa 182881 catgtgcagg atctccatgt tgccggtgtc cggtgcggcg cagttgagtg cctcgggcgc 182941 gatttccatg ctccatccgg tcatttcggc cagcggcgcg tactccaggt tggtcaatcc 183001 cgactcggcc gacaggaata ggttccacag gccgcggtct ttggccttgg ttttcagttc 183061 ctcgatgatc ggcggcgcgg tgtggtcggc cggtccggcc gcgcggcgat agtcgtcgta 183121 atcggcctca gcgccgaaga cgtgctcggt catgaagtcg gacaaccgcg tgcggtagtc 183181 gatggccttg gccgacatcg cgaagtccat tccgccacga tatctaccgg cgctagcaga 183241 cgcataagtc cctcgacacg ccgacgagaa gggggttttg cgtctgctcg ccgtcgtttc 183301 gtgccaccgt tcaactgacc cgcaagtggc agcgcgagct cgactattcg ctacgcaaga 183361 gtttgtggag cttccacgac aaccgcattg cgatgcggtt ccagtacgaa tcccgtgacc 183421 gcaacggcca gtggtatcgc agctacggca ccgaactgtg gcgaagccag catcaacgac 183481 gtgccgatcg ccgaatccga gcgtcgctac ctcggtgcgc gctcggcatc cgagtatggc 183541 caggaaatac cgctctggta gcccggtagg gtgtctgagc aaatctatcg gcgttcagta 183601 aggaaagtgg atgtacgcgc catgacagat ccgcagacgc agagcaccag ggtcggggtg 183661 gttgccgagt cggggcccga cgaacgacgg gtcgcgctgg ttcccaaggc ggtcgcgtcg 183721 ctggtgaacc gtggtgtggc ggtcgtggtc gaggccggtg cgggcgagcg cgcgctgctt 183781 cccgatgagc tctacaccgc tgtcggtgcc agcatcgggg atgcttgggc cgccgacgtc 183841 gttgtcaagg tcgcgccgcc gacggcggcg gaggtcggcc ggttgcgcgg tgggcagaca 183901 ctgatcggct ttctagcgcc ccgtaatgct gacaactcga tcggcgcgct gacccaggcc 183961 ggggtgcagg cgttcgcgct cgaggccatc ccgcgcatct cgcgggcgca ggtgatggac 184021 gcgctgtcgt cgcaagccaa cgtgtctggg tataaggctg tgctgctcgc ggcctcggaa 184081 tcgacccggt tctttccgat gctgacgacg gcggccggaa cggtgaagcc ggccacggtg 184141 ctggtgctcg gcgtcggcgt ggccggcctg caggcgctgg cgacggccaa acggctaggc 184201 gcgcgcacca cgggctacga tgtgcgtccc gaggtggccg accaggtccg atcggtgggc 184261 gctcaatggc ttgatttggg catctcagcg tccggtgagg gcggttacgc ccgcgaactg 184321 accgacgacg agcgcgccca gcagcaaaag gcattggaag aagcgatcag tggcttcgac 184381 gtggtgatca ccaccgcgct ggtgccgggc cgcccggcgc caacgttggt gaccgccgct 184441 gcagtggaag cgatgaagcc tggcagcgtg gtggtggatc tcgccggcga gacgggcggc 184501 aactgcgaat tgaccgagcc cggccggaca gtcgtcaagc acgacgtcac cattgccgca 184561 ccgctgaacc tgccggccac gatgcccgag cacgccagcg agctctacag caagaacatc 184621 accgcgctac tcgacttgtt gatcaaagac ggcaggctgg ccccggactt cgacgacgag 184681 gtgattgccc agtcgtgtgt cacccgcggg aaggactcct agatgtacaa cgaattgttg 184741 gagaacctgg cgatcctggt gctgtccgga ttcgtcgggt tcgcggtgat ctcgaaagtg 184801 cccaacacgt tgcacacccc gctgatgtca ggaaccaacg ccatccacgg cattgtcgtt 184861 ctcggcgcgc tggtggtttt cggcgaaatt gagcacccat cgctcgtgtt gcaggtcatc 184921 ctgttcgtcg cggtggtgtt cggcacgctg aacgtcatcg gcggattcat cgtcaccgac 184981 cgaatgctcg gcatgttcaa ggccaagaag cccgccgtgc cagccaagcc cgaccgcgac 185041 gaggcgctcc gatgaacctg cactacctgg tcgagattct ctacatcatc tccttttcac 185101 tcttcatcta cgggttgatg gggctcaccg gccccaagac cgcggtgcgc gggaacctga 185161 tcgccgcggc cggcatgacc atcgccgtgg cggccacgtt ggtcatgatc cgacacacca 185221 gccaatggcc gctgatcatc gccggtctgg tggtgggtgt tgtgctcggt gtgccgccgg 185281 cgcgactgac caagatgacc gccatgccgc agctggtggc attcttcaac ggcgtgggcg 185341 gaggaacggt cgcactcatc gcgctgtcgg agttcatcga taccaccggc ttttccgcat 185401 tccagcacgg cgagtcgccg accgtgcaca tcgtggtggc ctcattgttc gccgcgatca 185461 tcgggtcgat ctcgttctgg gggtctatcg tcgcgttcgg caagttgcag gagatcatct 185521 ccgggcggcc gatcggactc ggcaaggcgc agcagccgat caacctgttg ctgctggccg 185581 tggccgtggc cgccgccgtg gtgatcggac tgcacgcgca tcccgggagc ggtggggtcg 185641 cattgtggtg gatgatcggc ctgttggtcg ccgccggcgt gctgggtctg atggtggtgt 185701 tgccgatcgg tggcgccgac atgccggtgg tcatctcgat gctcaacgcc atgaccggcc 185761 tgtcggccgc ggcggcgggt ctggcgttga acaacaccgc gatgatcgtg gccggcatga 185821 tcgtcggcgc gtccggctcg atcctgacca acctgatggc taaggcgatg aaccgctcca 185881 ttccggcgat cgtcgcgggc ggtttcggcg gcggcggtgt ggcgcccagt ggcggcggcg 185941 acgacaaaca cgtcaaggcc acttcggccg ccgatgccgc gatccagatg gcatacgcca 186001 atcaggtgat cgtggtgccc ggctacgggt tggccgtcgc gcaggcgcag catgcggtga 186061 aggacctggc aaccttgctg gaggacaggg gtgtgccggt caagtacgcg attcacccgg 186121 tcgccggccg gatgcccggg catatgaacg tgctgctggc cgaggccgaa gtcgactacg 186181 acgcgatgaa ggacatggac gacatcaacg acgagttcgc ccgcaccgac gtcaccatcg 186241 tgatcggcgc caacgacgtc accaacccgg cggcccgcaa cgagacgtcc agcccgatct 186301 acggcatgcc gatcctcaac gtggacaagt cgaggtcggt gatcgtgctc aaacggtcga 186361 tgaattccgg gttcgccggc atcgacaacc cgctgttcta cgccgacggc accactatgt 186421 tgttcggtga tgcgaagaaa tcggtgaccg aagtctccga ggaactcaag gcgttgtagc 186481 gcgcgagcgc tggctcagac gggcggatac gccggcggcg ggtatccgtc gccggtttcg 186541 accccgcgta gaccccaggt gaggtaccgg aagaagaact cgatttcgtc gctcacgtcg 186601 tagtcaggac tcggatccat cacttcaccc tctcgactcg cgacttggtt cgcaacggag 186661 tttagtcaca tccgcgccgg tgcgacaggt tgtcgccgcc ttgcctaaac tgaacaacca 186721 gttgattgat acagcttcgg ccggggccca tgggctccac cggcagcgac gatagcgagt 186781 agcgatgcca tccgacacca gccccaacgg gctaagccgc cgtgaggagt tgctggctgt 186841 tgccaccaaa ctattcgcgg cgcgcggtta tcacggcacc cggatggacg acgtcgccga 186901 tgtgatcggg ctcaacaaag caacggtcta tcactactac gccagcaagt cgctgatcct 186961 gttcgacatt taccgtcagg cggccgaggg caccctggcc gccgtgcacg acgatccgtc 187021 ctggacggcc cgtgaagcgc tgtaccagta cacggtccgg ctgctcactg cgatcgcgag 187081 caaccccgag cgggccgccg tgtacttcca ggagcagccc tacatcaccg agtggttcac 187141 cagcgagcag gtcgccgagg tccgcgagaa ggagcagcaa gtctacgagc acgtacacgg 187201 cctgatcgac cgcgggattg ccagcggcga gttctatgag tgcgactcgc atgtggtggc 187261 gctggggtac atcgggatga cgctgggcag ctaccgctgg ctgcggccga gcgggcgccg 187321 aacggccaag gagatcgcgg cggagttcag cacggcactg ctgcgcgggc tgatccgcga 187381 cgaatcgatc cgcaaccagt ctccgcttgg aactcggaag gaaacgtgaa cctcacgcga 187441 tcggtggaat caatctcgct acggacccga gggcgccact gagcaccgac aactccgtca 187501 cactggattg accgaagttg aacatcaggc ccggattcgc cgacggaaga tacggatacg 187561 tattgggtag cgcggactgc ggtaacaatc cgatgcttac tagggcggct tgggggcctt 187621 gcacggtccc ggtcgccagg gccgaggcca cggcgatcgg gttgattggc gcgaacaggc 187681 tggccggggt gggtacgtcg gcgtagccgt agccatagcc caagtcgact agcacccgta 187741 ggtcgggctg aatcagctcg gctattgggg tccctacgaa ggggatggcg cgaatcggct 187801 gcaacagcgg caggtcctgg gtcagaaaca tgtagtaatg ggtgttgccg gtgtagcccg 187861 gagacgtggg caacggcacg gcattggcaa cctcggccgc ggtgaagggg tacgcgttgt 187921 gcacccatct gatgcccatg aaggcgttga ggtccgacaa gatattgagc gggtactgcg 187981 ggttgtgggc gtagccgtcg tattggccgg tgtacatgta ggtctggtag ggggaatccg 188041 gtggagtcgc accgttgaac gacatatcca agaacgggag gtaaaggccc acgtaacgct 188101 cgaggacgcc gccgttgggg ttattgatat taccgatcaa cgtgaaagcc agccggcttg 188161 gatctggggc ttggcccggt ggtaacgcca taagagcgcg tatttcattg gtcgctaccg 188221 cggcgctttg cgagtagccg aaaacgacga cgtcatgccc attttgtagt tccgcgttga 188281 tgccgttgtt cagcagcgtg acaccctggg cgatggattg gtccagtgac aggttcccga 188341 taaacggcca ccactgctcg ggcgtgtact gggcgaccgg gttgttgggc ccgaaaatgg 188401 gccgaatgta tgcgctgtca atgatcgcca agacgcggtc actaaggatc ggttccccgg 188461 tgccgcccat catcaacgcg gttagcgggt tgcctgacag catcccgaca gaaccgaggg 188521 cgccgctgga cccggcggtg cccgacatag cagcggtgtt gctggcttca gcctgggcat 188581 aggcggcccc ggcggcagcc agcgcccggg tgaactcgcc atggaacgcc gcagcctgct 188641 ttaggacctc ttgacattcg cgcgcgtatt cgctgaacag cgctgcagcg gccgacgaca 188701 cctcatcggc ggccgcggcc agcagtccgg tcgttggacc cgcagcggac gcgctggccg 188761 ctcgtatcgc cgaaccgatc ccgtccacgt ccgcggccgt cgttgccaac atctccgggg 188821 ccgcgatgac gtaggacatc tggtctcctg ttcgacgctg gggcccttag agcctagagc 188881 gcgcccgccg ggaagcccgg cgttttcggc caatcgttat cgcggccgcg tcaggtgaag 188941 accggtggcg ggatcaggtg caggatgttg ccgagaccgc cactcatcag ggatagcagt 189001 gtcacctgtg gctggccgaa gtagaaattc aggcccgggt ttatcgacgg gacccacgga 189061 tagctgtccg ggaaccactc cggcccaatc aatccggctt ccaccccaat ctccacgatg 189121 gcgccatagg gcgcctgcag gctccctttg atcaggtaat acgtgacagc gaacgggttg 189181 gggatcgaga acagcccggc cggagtgggg atatccgcgt aattgccgcc cggcccgtag 189241 tcggcgtagc ccaagtcgac gagcacccgc agctgcggct ggaacaggtc ggcgatcggg 189301 ggaccggcgt aggggatgtc acggatcggc tggagcagtg gcagatcctg agtcaggaac 189361 atgtagtact gggtgttgcc ggtatagccc ggggaggtcg gcaacggcac cgcgttatcc 189421 acctgggtgg ccatgagttc cgggtacgtg ttgtgcacgt agaagtagcc catgaaggcg 189481 ttgatgtccg acaggatgcg cagcgggaat tgcggcgcgt gggcgatgcc gtcgtactgg 189541 gccgtgtaaa tgtgtgtcgg gtagggacta ttcgccgggg ttgcgccatt gaacggcacg 189601 tccaggaacg ggatgtagaa gccggggaag cgcgccagca gcccgccgac gggattgttg 189661 ccactaccaa tcatgacgaa ggagatatcg tccggattcg gcgaacccat cgccatcagc 189721 gaattgatgt agttgttgat gatcgtggcg ctctgcgagt agccgaacgc aacgaccttg 189781 ttgtcgaggg ccagttggtt gttgacggcg gtattcagca gcgccacgcc ttcggtgacg 189841 gactggttga acgtcagatt gccgaggtcg ggggtaaccg gccagaactg ctcgggcgtg 189901 aacaggcctt gcgagacagc acccgggaag agggtctgga tgaaagcctt gttgatgtct 189961 gtcacgtact cggggtcggg tagcgggtta ttggtgccgc ccataatcaa cgccgttatc 190021 ggactctccg cagccagctg cgcgatcgcc ggcagcccgc cggccccgct ggatccgttg 190081 gggctcaacg gcgcacggcc caacagcgtc cggatcggtg cgttgatagt gtccagcgcg 190141 tgcgataccc gggccgcatt ggccgcttcg gcgtgtgcgt aggcgttgcc ggcggcctcc 190201 aacgtccggg tgaactcgct gtggaacgcc gcggcctgct tgacgaccgc ctgatactcc 190261 cgcccgtatg cgctgaacag ggccgccgtt gccgccgaaa cctcatcgcc ggccgcggcc 190321 agcaggttac atgtcgggcc tgccgcagcc gcgttggcgg cccgcagcgt ggaagcgatc 190381 tcatccacat gggcagctgc cgtcgccagc atgtcagggg ctgtgaccag gtgcgacatc 190441 tccccgtcct tcccaacgga ccggcgcccg caccggtcac ttgggactga cccgctaccg 190501 cgggtattag gtacttaacg agagtaaggc ggtcctgccg ctacgtccgg cgtttggaca 190561 aacctcgatg actgcctgac ctatggcggc tgctataacc gcgagcatgc taaccagctt 190621 ggtgagtgcg gtcggatcgc atcacgtcac caccgaccct gacgtgctgg ccggccgcag 190681 cgtcgaccac accggccgct atcggggccg ggccagcgcg ctggtgcggc ccggctcggc 190741 tgaagaggtc gccgaagtgc tgcgggtgtg ccgggacgct ggagcctatg tcaccgttca 190801 aggcggccgc acctcactgg tggcgggcac cgttcccgaa cacgacgacg tgctgctgtc 190861 taccgaacgg ctttgcgtcg tcagcgatgt cgataccgtt gagcgccgaa tcgagatcgg 190921 tgccggggtc acactggccg cggtgcagca cgccgcgtca acggctgggc tggtgttcgg 190981 cgtggatttg tcggcccggg ataccgcgac cgtcggtggc atggcctcga cgaacgccgg 191041 cggattgcgc acggtccgtt acggcaacat gggcgagcag gttgtcgggc tagacgtcgc 191101 gctgcccgac ggtacggtgc tgcgccggca cagccgggtg cgtcgcgaca acaccggcta 191161 cgacctgccc gcgctgttcg tcggggccga aggcaccctg ggggttatca ccgcgctgga 191221 tctgcggctg caccccaccc cgtcgcatcg ggtgacagcc gtgtgcgggt tcgccgagct 191281 ggcagcgctg gtcgatgccg gccgaatgtt ccgcgacgtg gagggcatcg cggcgttgga 191341 attgattgac ggtcgggccg ccgcgctaac ccgtgaacat cttggcgttc gcccccccgt 191401 cgaggctgac tggttgctat tggtggaact ggccgccgac cacgatcaga ccgaccggct 191461 cgccgacctg ctcggcggtg cacggatgtg cggggagccc gcggtcggtg tggatgccgc 191521 tgcgcagcaa cggttgtggc gcacccgtga atcgctggcc gaggtgctcg gtgtgtacgg 191581 cccgccgctg aagttcgacg tctcgctgcc attgtcggcg atcagcggct tcgcccgaga 191641 tgcggtcgcg ttggttcacc gacacgtccc ggattctccg gaggcgttgc cgctgttgtt 191701 cggtcacatc ggtgagggca acctgcacct gaacgtgctg cgttgcccgc ctgatcggga 191761 accggcgttg tacgcaaaga tgatgggcct catcgccgaa tgcggcggta acgtcagttc 191821 agaacatggg gtgggcagcc gcaagcgtgc ctacctggga atgtcccggc aggccaacga 191881 cgtcgccgcg atgcggaggg tcaaggcggc gttggacccg accgggtacc ttaacgccgc 191941 ggtcttgttc gactgaccgg tgctgcgcaa gcattcagcg cctttagaga tcaccggtga 192001 aactgatgag ctgacgcacc gcgatgccat cggcgaggtg gtccatcgcc tcgttgatat 192061 cgtccaaccg aatcgttgac gtcaccagcg actccaccgg cagacggccc gattgccaca 192121 acgacacgaa gcggggaatg tcgtggctgg gcaccgccga acccagatag ctgccgatca 192181 gtgaccggcc ttcggtgaca aaatccaacg gcgacaagct gatccggaca tccggtggcg 192241 gcaacccgac ggtgatggtg cgccctccgg gcgcggtaag cccgatcgcg gtgtgcagcg 192301 cggcaggatg accgacggct tcgacaacca cggcggcttt gaccccgccg gccgtggcct 192361 gctgcggtgt gtagatctca tgggcgccca aggcctttgc ggccgacagc ttttcgggta 192421 gctgatcgac ggcgaccaca cgaacgtctg tatacgtcaa agcggtgagc accgctgcca 192481 taccgacgcc cccgaggccg acgacggcga ccgactggcc gggctgcgga tcaccgacgt 192541 tgagtaccgc acccccaccg gtgagcaccg cgcacccgag tagggcagcg acggtgggcg 192601 gcacctcgtg cggcaccgga accacgctgg cccggttgac gacgacatgg gtcgcgaaac 192661 ccgagacgcc gaggtggtgg tacaccgggc ggccgccccg gctgagccgg ataccgccac 192721 cgagcagtgt gccggccttg ttggccgcgc tgcccggttc gcacggcgtc cgaccgtcgg 192781 tcgcgcacgc cgcgcactgg ccgcaacgcg gaaggaacac cagcacgact cgctgaccga 192841 ccgcgacccc gtcgacgccg tcgccgacct gctcgacgat tccagcggct tcatgaccga 192901 gcaagatcgg caccggccgt acccgggtgc cgtcgaccac cgacaggtcg gagtggcaca 192961 cgcccgcagc ctcgattcgg acaaggacct caccgcggtc gggcgggtcc aggtgcagct 193021 cgacgacgct gattggtttc gaccgccaat agggccgcgg cacaccgatc tggtctagca 193081 ccgcgccccg gatggcaggc atgttggaat acaaccatgg ctgcactgcc ggcaccggag 193141 aagctcctgc gcagcgactt tccggtgctg tggccggtgg gaactcgatg ggccgacaac 193201 gacatgttcg gccacctcaa caacgccgtc tactaccagc tgtttgacac cgcgataaac 193261 gcctggatca acacgagcac cggggttgac ccgctcgcga tgcctgtgct gggcattgtc 193321 gcggagtcgg gctgccgtta tttctcggaa ctgcgtttcc cggagagcct aatggtgggc 193381 ctggctgtga cgcggttggg gcgcagcagc gtcacctacc ggctgggtgt gtttaaggag 193441 cctgacgatg cgggggtgat caccgcactc gggcactggg tgcacgtcta tgtcgatcgg 193501 actagccgca ggccggttcc gattcccgag gccattcggt cgctgttgtc gacggcttgc 193561 gtaagcggat aagccgcgcc cagattgcgt tcagggctgt gattttcgcc gctccaacca 193621 cagccatgac ggcaatctcg tgctcaccgc gacccaggta tgcttcccga atgccagttt 193681 tgagcaagac cgtcgaggtc accgccgacg ccgcatcgat catggccatc gttgccgata 193741 tcgagcgcta cccagagtgg aatgaagggg tcaagggcgc atgggtgctc gctcgctacg 193801 atgacgggcg tcccagccag gtgcggctcg acaccgctgt tcaaggcatc gagggcacct 193861 atatccacgc cgtgtactac ccaggcgaaa accagattca aaccgtcatg cagcagggtg 193921 aactgtttgc caagcaggag cagctgttca gtgtggtggc aaccggcgcc gcgagcttgc 193981 tcacggtgga catggacgtc caggtcacca tgccggtgcc cgagccgatg gtgaagatgc 194041 tgctcaacaa cgtcctggag catctcgccg aaaatctcaa gcagcgcgcc gagcagctgg 194101 cggccagcta aggcatgtgc gggctcagcc gaagacttcg gtctcagcca gggcctccgt 194161 cagcctgcgt gccccatcgg tgaactgcca gacggtgtgc tcgattacgg cggctgtgtc 194221 gcggcggcgc agcgcggcga tcagctgccg atgactgttc accgcgtccg cgccccatcg 194281 cgggtcggcc gcgaacacct gcgcccatat agcgcgcggc attaagcagg aaccaggcca 194341 acttgatccg gcggctcgct ttgttgaaga cgcggtggaa cgcgaactcg atcgacgcga 194401 tggttttggc atcaccggac ccgatagcac cggccagcgc attgttgatg cggtccagct 194461 cgtcgatctc aacgtcggtg atgtgagcgg tggccgatgt ggcaagttct tgggcaatgg 194521 tggcctgcag ccagaaaatg tcgtcgatgt cttggcgggt caacggcagc accacgtggc 194581 cgcgatgtgg ctccagcccg accatcccct caccgcgcag tttcagcagc gcctcccgca 194641 ccggcgtgac gctgactccg agctcggctg ccgtctcgtc gagacggatg aacgttccag 194701 agcgcagggc gcccgacatg atggcggccc gcaggtggcc cgcgacctcg tcggacaact 194761 gtgcccggcg caggggaagc tggctccgcg gcttcgccga tagaggtgcg ttcacgtggc 194821 ttgccaggac tttcagggtc gggccgggat tgccggggac ttgccggggg cttggcgggg 194881 gcttgttgtt gggccgctca ggccatagtg tgacccagac aacatcatgc tttatcaaat 194941 atcaacctgg cgcaagggat gcgcaagtga aaggaaggga aggaagggat agttgaccgc 195001 gcaactggcc agtcacctga cgcgggcgct aacactagcc caacagcagc cctaccttgc 195061 tcgccggcag aactgggtca accagctcga acggcacgcg atgatgcagc cagacgcgcc 195121 ggcgctgagg tttgtgggca acaccatgac gtgggctgac ctaaggcgcc gggttgcggc 195181 gctggcgggc gcattgagcg gtcgcggggt cggtttcggc gatcgggtca tgatcctgat 195241 gcttaaccgc accgagttcg tcgagtcggt gctggccgcc aacatgatcg gggccatcgc 195301 cgtaccactg aatttccggc tcaccccaac cgaaatcgcc gtcctggtcg aagactgtgt 195361 cgcacacgtg atgctgaccg aagctgcgct ggctccggtg gccatcggtg tccgcaacat 195421 ccagcccttg ctgagcgtga tcgtggtcgc cggcggatcc agccaggaca gcgtgttcgg 195481 ctatgaggac ctactcaacg aggccgggga tgtccacgaa ccggtggaca tcccgaacga 195541 ctcgccggcc ttgatcatgt acacctcggg caccaccggc cgcccgaagg gcgccgtgct 195601 gactcacgcg aacctcaccg gtcaggcgat gaccgcgctc tacaccagtg gcgccaatat 195661 caacagcgac gtcggtttcg tcggcgtccc gctgttccat atcgccggaa tcggcaacat 195721 gctgaccggg ctgctgctcg gcttgcccac ggtgatctat ccgctgggcg cgttcgaccc 195781 gggacagctg ctcgacgtgc tggaggcaga gaaggtcacc ggcatctttc tggttcccgc 195841 gcagtggcag gcggtctgta ccgaacagca agcacgacca cgtgacttga ggttacgggt 195901 gttgtcgtgg ggagctgcgc cggcgccgga tgcgttgctg cggcagatgt cggcaacctt 195961 tcccgaaacc cagatactgg ccgcattcgg ccagaccgag atgtcaccgg tcacctgcat 196021 gctgctcggc gaagatgcga tcgctaagcg cggatcggtc ggcagggtga tcccgaccgt 196081 cgccgcaagg gtggtcgatc agaacatgaa cgatgtcccc gtcggcgaag tgggcgaaat 196141 tgtctaccgg gcaccaacat tgatgagctg ctactggaac aacccggagg ccaccgcgga 196201 ggcgttcgca ggcggctggt tccattctgg ggatctggtt cgtatggact ccgacggtta 196261 cgtctgggtg gtggaccgca agaaggacat gattatctcc ggcggtgaaa acatttactg 196321 cgccgagctg gaaaacgttc tggccagcca tcccgacatc gccgaagtcg cggtcatcgg 196381 ccgggccgac gagaagtggg gagaggtgcc gatcgcggtc gcggccgtaa cgaacgacga 196441 ccttcggatc gaagacctag gtgagttcct gaccgaccgg cttgcgcgct acaagcaccc 196501 caaggcgctc gagatcgtgg acgctctgcc ccgcaacccc gcggggaagg tgctcaagac 196561 tgaactgcga ttgcgctacg gcgcctgtgt gaatgttgaa agacgttctg catcagctgg 196621 tttcacggag agaagggaaa accgacagaa attgtaacgt ttgcccgcta ttgacgaagg 196681 gttaaatgtg cggatgcctt acactcctgg ctggccatcg ggtagattcc tgtggtctcc 196741 gttactccct gtgagtaacg aggtggcggt cacacaccaa gggtcggggc aaggaggagg 196801 cgtgcgacat gatgcgccgc ggcgccgcga tacccaggtc ggcggcttga gggagccgcg 196861 gtgacgacgt cgacaacgct tggcggttac gtccgcgacc aactgcaaac cccgctgacc 196921 ctcgtcggtg gattctttcg catgtgtgtg ctgactggaa aggcgctgtt tcgctggccg 196981 ttccagtggc gcgagttcat tctgcagtgc tggttcatca tgcgggtcgg atttttaccg 197041 acgatcatgg tctcgatacc gctgacggtg ctgttgatct tcacgctcaa tattctgctg 197101 gcccagttcg gcgcggcaga catctccggt tccggcgcgg cgatcggcgc ggtcacccag 197161 cttggcccgc tgacaacggt gctggtggtc gccggcgccg gatccacggc catctgcgcc 197221 gacctgggtg cccgcaccat ccgcgaggaa atcgacgcga tggaggtgct gggcatcgat 197281 cccatccacc gtctggtggt gccgcgggtg ctcgcctcga tgctggtcgc cacgctgctc 197341 aacggcttgg tgatcaccgt cggcctggtc ggtggctttc tcttcggtgt ctatctgcag 197401 aacgtttcgg gcggcgccta ccttgccacg ctgaccttga tcaccggcct gcccgaggtg 197461 gtcatcgcaa ccatcaaagc cgcaacgttc ggcctgatcg cgggccttgt cggctgctat 197521 cgggggctga ccgtccgtgg cggttccaag ggtcttggca ccgccgtcaa cgagaccgtg 197581 gtgctgtgtg tgattgccct gttcgccgtc aacgtgatct tgacgaccat cggtgtgcga 197641 ttcgggacgg ggcgctgaca tgtcgaccgc tgctgtgctg cgcgcccgct tcccgcgggc 197701 ggtcgccaac cttcgtcaat atggaggtgc ggcggcccgt ggattggacg aggccggcca 197761 gctcacctgg ttcgctttga ccagcatcgg gcagatcgcg cacgcgctgc gctactaccg 197821 caaggagacg ctgcggctga tcgcccagat cggcatgggt accggcgcga tggccgtcgt 197881 cggcggcacg gtcgccatcg ttggctttgt cacgctgtcc ggcagctcgc tggtcgcaat 197941 ccagggcttc gcgtcgctgg gcaacatcgg tgtcgaggcg ttcaccgggt tcttcgccgc 198001 actgatcaac gtgcgcatcg ccggcccagt tgtcacgggt gtcgccctgg cggccacggt 198061 cggtgcgggt gctacggccg agctgggcgc gatgcggatc agcgaggaga tcgatgccct 198121 ggaagtgatg ggcatcaagt cgatctcgtt tctggcctcc acccggatca tggccgggct 198181 ggtggtgatc atcccgctgt acgcgttggc gatgattatg tcgttcctgt ccccgcagat 198241 caccaccacg gtgctctacg ggcagtcgaa cggcacctac gagcattact ttcaaacgtt 198301 cctgcgtccc gacgatgtct tttggtcctt cttggaggcc ctcatcatca ctgcgatcgt 198361 catggtcagc cactgctact acgggtacgc cgccggtgga ggccccgtcg gtgtcggcga 198421 ggccgtcggc cgatcgatgc gtttctcgtt ggtctcggtg caggtcgttg tcctgtttgc 198481 agcgttggcg ctctacggtg tcgacccgaa cttcaatctc acggtgtagc cgcatgacga 198541 cgccggggaa gctgaacaag gcgcgagtgc cgccctacaa gacggcgggt ttgggtctag 198601 tgctggtctt cgcgctcgta gttgccttgg tatacctgca gtttcgcggg gagttcacgc 198661 ccaagacgca gttgacgatg ctgtccgctc gtgcgggttt ggtgatggat cccgggtcga 198721 aggtcaccta taacggggtg gagatcgggc gggtagacac catctcggag gtcacacgtg 198781 acggcgagtc ggcggccaag ttcatcttgg atgtggatcc gcgttacatc cacctgattc 198841 cggcaaatgt gaacgccgac atcaaggcga ccacggtgtt cggcggtaag tatgtgtcgt 198901 tgaccacgcc gaaaaacccg acaaagaggc ggataacgcc aaaagacgtc atcgacgtac 198961 ggtcggtgac caccgagatc aacacgttgt tccagacgct cacctcgatc gccgagaagg 199021 tggatccggt caagctgaac ctgaccctga gcgcggccgc ggaggcgttg accgggctgg 199081 gcgataagtt cggcgagtcg atcgtcaacg ccaacaccgt tctggatgac ctcaattcgc 199141 ggatgccgca gtcgcgccac gacattcagc aattggcggc tctgggcgac gtctacgccg 199201 acgcggcgcc ggacctgttc gactttctcg acagttcggt gaccaccgcc cgcaccatca 199261 atgcccagca agcggaactg gattcggcgc tgttggcggc ggccgggttc ggcaacacca 199321 cagccgatgt cttcgaccgc ggcgggccgt atctgcagcg gggggtcgcc gacctggtcc 199381 ccaccgccac cctgctcgac acttatagcc cggaactgtt ctgcacgatc cgcaacttct 199441 acgatgccga tccgctcgct aaagcggcgt ccggtggcgg taacggctac tcgctgagga 199501 cgaactcaga gatcctatcc gggataggta tctccttgtt gtctcccctg gcgttagcca 199561 ccaatggggc ggcaatcgga atcggactgg tagccggatt gatagcgccg cccctcgcgg 199621 tggccgcaaa tctagcggga gccctacccg gaatcgttgg cggcgcgccc aatccctata 199681 cctatccgga gaatctgccg cgggtgaacg ctcgcggtgg cccggggggc gcccccggtt 199741 gctggcagcc gatcacccgg gatctgtggc cagcgccgta tctggtgatg gacaccggtg 199801 ccagcctcgc cccgtacaac cacatggagg ttggctcgcc ttatgcagtc gagtacgtct 199861 ggggccgtca ggtaggggat aacacgatca acccatgaaa atcactggaa ccgtcgtcaa 199921 actcggcatc gtctcggtgg tgctgctgtt cttcacggtg atgatcatcg tgattttcgg 199981 tcagatgcgc ttcgaccgga ctaatggcta taccgcggag ttcagcaatg tcagcgggct 200041 gcgccaaggc cagtttgtcc gtgcttcggg ggtagagatc ggcaaggtca aagcactaca 200101 cctggtcgac ggtggccgtc gggttcgggt ggagttcaat atcgatcgtt cggtgccgtt 200161 gtatcagtcc acgaccgccc agatccgcta ttccgacctg atcggtaacc ggtacgtgga 200221 gctcaaacgg ggtgagggca agggggccaa cgatctgctg ccgccaggtg gactcatccc 200281 attgtcccgc acgtcaccgg ccttggatct ggacgcgttg atcggtggtt tcaagccggt 200341 gtttcgggcg ttggatcccg cgaaggtgaa caacatcgcc aacgcgctca tcaccgtctt 200401 ccaggggcaa ggtggcacca taaacgacat cctcgaccag accgcgcaac tgaccagcca 200461 gatcgcggag cgcgatcagg cgatcggtga ggttgtcaag aacctgaaca tcgtgctgga 200521 caccacggtc aagcatcgaa aagagttcga cgagacggtc aataacttgg agaatctgat 200581 cactgggctg aggaaccact ccgaccagtt ggccggcggc ctcgcgcaca tcagcaacgg 200641 cgccggcacg gtggccgacc tgcttgccga gaatcgcacg ttggtgcgca aggccgtcag 200701 ctacctggac gctattcagc aaccggtcat cgaccagcgc gtcgagttgg acgacctgct 200761 ccacaagacg ccgaccgcgt tgacggcgct cggacgcgcc aacggaacct acggcgattt 200821 ccagaacttc tacctctgcg acctccagat caagtggaac ggattccaag ccggagggcc 200881 ggtccgcacg gtgaagctct ttagccagcc gacgggtagg tgcacgccgc aatgagaacg 200941 ctggaaccac ccaaccgaat gcgaattggg ctcatgggca tcgtcgttgc gctgctcgtt 201001 gtcgctgtgg gccaaagctt taccagtgtt cccatgctat tcgcaaagcc gagctactac 201061 ggccagttca ccgactccgg cggactgcac aagggcgaca gggtacgcat cgccggcttg 201121 ggagtgggca ccgtggaggg gctcaagatc gacggcgacc acatcgtggt caagttctcc 201181 atcggcacca acaccatcgg caccgagagc cgcctagcca tccgcaccga caccatcctg 201241 ggtaggaaag tgctcgagat cgagccgcgc ggcgcccaag cgttgccgcc cgggggcgtt 201301 ttgccggttg ggcaaagcac caccccgtac cagatttacg acgcgttctt cgacgtcacc 201361 aaggccgcat ccggctggga catcgagacg gtcaagcggt cgctgaatgt gttgtcggag 201421 accgttgatc agacctatcc gcacctgagc gccgccctcg acggggtggc taagttctcc 201481 gacaccatcg gcaagcgcga cgagcagatc acgcacctac tagcccaggc caaccaggtg 201541 gccagcatcc tgggtgatcg cagtgagcag gtcgaccgcc tattggtcaa cgctaagacc 201601 ctgatcgccg cgttcaacga gcgcggccgc gcggtcgacg ccctgctggg gaacatctcc 201661 gctttctcgg cccaggtgca aaaccttatc aacgacaacc cgaacctgaa ccatgtgctc 201721 gagcagctgc gcatcctcac cgacctgttg gtcgaccgca aggaggattt ggctgaaacc 201781 ctgacgatct tgggcagatt cagcgcgtcg ttcggtgaga cgtttgcctc tgggccctac 201841 ttcaaagtgc tgctggccaa cctggtgccg ggtcagatct tgcagccgtt tgtcgatgcg 201901 gcattcaaga agcgtggtat tagcccggag gacttctggc gcagcgccgg gctgccggca 201961 taccggtggc ccgaccccaa tggcacccgg ttccccaacg gtgcgccgcc gccaccaccg 202021 ccggtgttgg agggcacgcc cgagcatccc gggccggcgg tgccgccggg atcgccgtgc 202081 tcctacaccc cgccggcgga cggtctgccg cggccgtggg atccgctgcc ctgcgctaac 202141 ctcactcaag gtccattcgg tggccccgat ttcccggcgc cgctggatgt cgcgacgtcg 202201 ccgccgaacc cagacggtcc accgcccgcc ccgggcctac caatcgcggg acgtccgggt 202261 gaggtgccgc cgaacgttcc cggcacgccg gtgccgattc cacaggaggc tccccccggg 202321 gcacgcacgc tgcccctcgg gccggcgcct ggtccggctc cgcccccggc ggcgccaggc 202381 ccgccggcac caccgggccc cgggccgcag ttgccggccc cgttcatcaa ccccggcggc 202441 accggcggta gtggcgtgac gggaggtagc gagaattgag caccatcttt gatatccgca 202501 acctgcggtt gccgcagctg tcgcgggcct cggttgtcat cggatcgttg gtggtggtgc 202561 tggcgctggc cgccggaatt gttggtgtgc ggctctatca aaaactgacg aacaacacgg 202621 tggtcgccta cttcacccaa gccaatgcgc tgtatgtcgg agacaaggtc cagattatgg 202681 gcctcccggt cggttcgatc gacaagatcg aaccagccgg cgacaaaatg aaggtgactt 202741 tccactacca gaacaagtac aaggtgcctg ccaatgcctc cgcggtgatc ctcaacccca 202801 ccttggtggc gtcgcggaac attcagttgg agccacccta cagaggtggt ccagtgctgg 202861 ccgataatgc ggtgatcccg gtcgagcgca cccaggtacc gacggagtgg gacgagctgc 202921 gggacagcgt ttcgcatatt atcgacgagc tcggcccgac acctgagcag cccaaggggc 202981 cgttcggcga agtcatcgag gcattcgccg acgggctggc cggcaagggt aagcaaatca 203041 acaccacgct gaacagcctg tcgcaggcgt tgaacgcctt gaatgagggc cgcggcgact 203101 tcttcgcggt ggtacgcagc ctggcgctat tcgtcaacgc gctacatcag gacgaccaac 203161 agttcgtcgc gttgaacaag aaccttgcgg agttcaccga caggttgacc cactccgatg 203221 cggacctgtc gaacgccatc cagcaattcg acagcttgct cgccgtcgcg cgcccgttct 203281 tcgccaagaa ccgcgaggtg ctgacgcatg acgtcaataa tctcgcgacc gtgaccacca 203341 cgttgctgca gcccgatccg ttggatgggt tggagaccgt cctgcacatc ttcccgacgc 203401 tggcggcgaa cattaaccag ctttaccatc cgacacacgg tggcgtggtg tcgctttccg 203461 cgttcacgaa tttcgccaac ccgatggagt tcatctgcag ctcgattcag gcgggtagcc 203521 ggctcggtta tcaagagtcg gccgaactct gtgcgcagta tctggcgcca gtcctcgatg 203581 cgatcaagtt caactacttt ccgttcggcc tgaacgtggc cagcaccgcc tcgacactgc 203641 ctaaagagat cgcgtactcc gagccccgct tgcagccgcc caacgggtac aaggacacca 203701 cggtgcccgg catctgggtg ccggatacgc cgttgtcaca ccgcaacacg cagcccggtt 203761 gggtggtggc acccgggatg caaggggttc aggtgggacc gatcacgcag ggtttgctga 203821 cgccggagtc cctggccgaa ctcatgggtg gtcccgatat cgcccctccg tcgtcagggc 203881 tgcaaacccc gcccggaccc ccgaatgcgt acgacgagta ccccgtgctg ccgccgatcg 203941 gtttacaggc cccacaggtg ccgataccac cgccgcctcc tgggcccgac gtaatcccgg 204001 gtccggtgcc accgacgccg gcaccggtgg gggcgccgtt gcccgctgag gcaggagggg 204061 gtcaatgatg agcgtgctgg cgcggatgcg ggtgatgcgc caccgagcct ggcaggggct 204121 ggtgttgctg gtgctcgcac tcttgctgag ttcgtgcggc tggcgcggca tctccaatgt 204181 ggcgatcccc ggcggcccgg gcaccggccc gggctcctac accatctacg tgcagatgcc 204241 ggacacgttg gcgatcaacg gcaacagtcg ggtcatggtg gccgacgtct gggtcggatc 204301 gatccgcgcg atcaagttga agaactgggt ggccacgctg acgctgagcc tgaagaagga 204361 cgtcacgcta ccgaaaaatg ccaccgccaa gatcgggcag accagcctgc tgggttcgca 204421 gcacgtcgag ctggccgcgc cgccagatcc gtcgccggtg ccgctgaagg atggtgacac 204481 catcccgttg aagcgctcct cggcctatcc caccaccgag cagacgctgg ccagcatcgc 204541 caccttgttg cgcggcggcg gcctggtgaa cctcgaaggg attcagcaag agatcaacgc 204601 catcgtgacg gggcgggcgg accagatccg ggcctttctt ggcaagctcg acaccttcac 204661 cgacgagctc aaccagcaac gcgatgacat tacccgcgcc attgattcca ccaatcggtt 204721 gttggcttat gtgggcggtc gttcggaagt cctcaatcgg gtgctcaccg acctaccgcc 204781 attgatcaag cactttgcgg ataagcagga actgttgatc aacgcttccg atgcggtagg 204841 ccggctcagc cagtccgccg accagtatct ttcggctgcc cggggcgatc tgcaccagga 204901 cctgcaggcg ctgcaatgcc cgctcaagga actgcgtcga gccgctccgt atctggtggg 204961 tgcgctcaaa ttgatcctca cccagccctt tgacgtcgac accgtgccgc agctggtgcg 205021 gggcgactac atgaacttgt cgctgacgct ggacctgacc tacagcgcca tcgacaatgc 205081 gttccttacc gggaccggat tctccggtgc gttgcgcgcc ctcgagcagt cttttggccg 205141 cgatcccgag acaatgattc ccgacatccg gtacacaccg aaccccaacg atgcgccggg 205201 cggcccgctg gtagaaaggg gaaatcgcca gtgctgactc gcttcatccg acgccagttg 205261 atcctttttg cgatcgtctc cgtagtcgca atcgtcgtat tgggctggta ctacctgcga 205321 attccgagtc tggtgggtat cgggcagtac accttgaagg ccgacttgcc cgcatcgggt 205381 ggcctgtatc cgacggccaa tgtgacctac cgcggtatca ccattggcaa ggttactgcc 205441 gtcgagccca ccgaccaggg cgcacgagtg acgatgagca tcgccagcaa ctacaaaatc 205501 cccgtcgatg cctcggcgaa cgtgcattcg gtgtcagcgg tgggcgagca gtacatcgac 205561 ctggtgtcca ccggtgctcc gggtaaatac ttctcctccg gacagaccat caccaagggc 205621 accgttccca gtgagatcgg gccggcgctg gacaattcca atcgcgggtt ggccgcattg 205681 cccacggaga agatcggctt gctgctcgac gagaccgcgc aagcggtggg tgggctggga 205741 cccgcgttgc aacggttggt cgattccact caagcgatcg tcggtgactt caaaaccaac 205801 attggcgacg tcaacgacat catcgagaac tccgggccga ttttggacag ccaggtcaac 205861 acgggtgatc agatcgagcg ctgggcgcgc aaattgaaca atctggccgc acagaccgcg 205921 accagggatc agaacgtgcg aagcatcctg tcccaggcgg cccccaccgc cgatgaggtt 205981 aacgcggtat tcagcggtgt tcgcgattcg ctgccacaga ccctggccaa tcttgaggtt 206041 gtgttcgata tgctcaagcg ctaccacgcc ggcgtggagc aattgttggt gttcctccca 206101 cagggtgccg cgatcgcaca gaccgtactc acgccaactc cgggtgctgc ccagctgccg 206161 ctcgcgccgg cgatcaacta tccgccgccg tgcttgacgg gttttcttcc tgcatcggag 206221 tggcggtctc cggccgatac cagtcccagg ccgttgccgt cgggaaccta ttgcaagatt 206281 ccccaggatg cccagctgca agtccggggg gcgcgcaaca ttccctgtgt cgatgtcctg 206341 ggcaaacgag cggcgacgcc gaaggagtgc cgcagtaagg acccgtacgt tccgctgggt 206401 accaacccgt ggtttggtga tccgaaccag attctcacct gcccggcacc tggagcgcgc 206461 tgcgatcagc cggtgaagcc cgggttggtg attccggcgc cctcgatcaa caccggtttg 206521 aatccggcgc ccgccgatca ggtgcaagga acgcccccgc cggtcagtga cccgttgcaa 206581 agaccgggtt cgggtactgt gcagtgcaac gggcagcagc ctaacccgtg cgtctacact 206641 ccaacatcgg gcccgtcggc ggtctatagc ccggccagcg gtgaactggt ggggccggat 206701 ggtgtcaagt acgccgtcgc aaactcgagc acaacaggag acgacggatg gaaggagatg 206761 ctggcgccgg ccagctgaac cctgccgatg cgaataagtc gtcgtctacg gaggtgaagg 206821 cggcggattc ggcggaatct gacgccggag ccgaccagac tggcccgcag gtgaaggcgg 206881 cggattcggc ggaatctgac gccggagagc tcggcgagga cgcgtgccca gaacaggccc 206941 tcgtcgagcg gcgcccgtcg cggttgcggc gaggctggct tgttggcatt gcggcgacgc 207001 tgctcgcgtt ggccggtggc cttggcgcag cgggttattt tgcgttgcgc tcacaccagg 207061 aaagccaatc aatcgcgcgc gaggaccttg cggccattga ggccgctaag gattgcgttg 207121 cggccacgca ggcacccgat gctggggcga tgtcggctag catgcagaag atcatcgagt 207181 gtggcaccgg tgatttcggt gcccaggcgt cgttgtacac cagcatgctc gtcgaggcgt 207241 atcaagcggc cagcgtccac gtgcaagtga ccgatatgcg cgcggcggtc gagcgcaaca 207301 acaatgacgg gtcggtcgat gttctggtgg cgctccgggt caaggtgtcc aacaccgact 207361 cggatgccca tgaagtcggc taccgtcttc gggtccggat ggcactggat gagggccgct 207421 ataagatcgc caaactcgac caggtgacga agtgacggtg gtggtcgaga agacgccgac 207481 caccctgccc caggcgacac cgaacggtgc agcgccctgg catgttcggg cgggcgcctt 207541 cgccatcgac gtgctgcccg ggctcgccgt ggcggcgacc atggcgttga cggctttaac 207601 ggtgccgccg ggcagcgcgt ggcggtggtt atgcgcttgt ctgctcggat tgaccattct 207661 ccttctggcc gttaaccggt tgttgttgcc gacgattacc ggatggagtc ttggccgcgc 207721 tcttaccggc atccgggtgg ttcggcgtga cggctccgcc atcggtccgt ggcggttgct 207781 ggtccgggat ttggcgcact tggtggacac cctctcgctg tttgtgggtt ggctgtggcc 207841 gctgtgggat tcgcggcgac gcaccttcgc cgacctgttg ttgcgcactg aggtgcgacg 207901 tgtcgaaccg gtgcagcggc ccgcggtgat acggcgactg acggcggcgg tggcattggc 207961 ggcggcgggc gcgtgcgcga gcgcaaccgc ggtgggcgct gcggtggtgt acgtcaatga 208021 atggcaaacc gatcacactc gcgcgcagct cgcaacgcgg ggcccgaagc tcgtggtcga 208081 cgtcctgagc tacgaccccg aaacggtgca gcgtgatttc gaacgggcgc gatcgctggc 208141 caccgacagg taccgcccgc agctgagcat ccaacaggat tcggtgcgcg agtcgggacc 208201 tgttcgtaac cagtactggg ttaccgacag cgcggtgctg tcggcgacac cagctcaggc 208261 gaccatgctg ttgttcatgc agggtgaacg cggtacacca cccaatcagc ggtatattca 208321 gtcaactgtg cgggcgatct tccaaaaatc gcgcgggcaa tggcgcctcg acgatctggc 208381 agtcgtgatg aaaccccgac aacccaccgg cgaaaaatga gcccccgtcg taagtttgaa 208441 cccggcgagg gggcgctgct ggccccgcag tcaatcgaac cgtcgcggcg atggggtttg 208501 ccgctggctc tgaccgcatc cgctgtggtt atggccgcgg cgatctcagc ctgtgcgctc 208561 atgcggatct cccatgaatc gcaccagcga gcagcgcaca aggatatcgt gatgctcagt 208621 gatgtccgat ctttcatgac catgttcacg tcaccggatc cgtttcacgc caacgaatat 208681 gcggagcggg tgctgtccca cgccacgggc gacttcgcca agcagtacca cgaaagagca 208741 aacgatatcc tgattcgcat ctccggggtg gaaccgacca caggaacggt tctagacgcg 208801 ggcgtacaga ggtggaacga ggatggtagt gccaacgtgc tggtggtcac ccagatcacc 208861 tcgaaatccg cggacggcaa gcgggtggtc tcgaacgcca atcgttggct ggtaacggct 208921 aagcaggaag gtaacgagtg gaagatcagc agtctgcttc cggtgatctg acccaaaagt 208981 ccgttgccaa cggagagtcc accgacacgg catccgcagc caccgagggc caccggggcg 209041 agatcgacgc cgcgggagag ccggacgaac gcggtgccgc cgtggctgac agccaagctg 209101 acgaggatga ttcggccgcg acggctgcca ggggcggcaa gacacgggca agacgatcgc 209161 gtggcaggcg gttagcgatc acggtcggcg tggccgctgc gttgttcgtg ggctcggcag 209221 cgttcgctgg tgcgacggtg gagccctacc tctccgagcg cgccgtggtg gccaccaagc 209281 tcatggtcgc gcggaccgcc gccaatgcga tcacgacgtt gtggacctac acgccggaga 209341 acatggacac cctggccgat cgggccgcga attacctcag cggtgatttc gcggctcagt 209401 accgcagatt cgtcgaccag atcgccgcag caaacaaaca ggccaagatt accaacgata 209461 ccgaggtcac cggtgctgcc gtggaatcgc tgagcggccg ggatgccgtt gccatcgtct 209521 acaccaacac cacgaccacc agtccggtga ccaagaacat cccagcattg aagtatctgt 209581 cctaccggct gttcatgaag cgttatgacg cgcggtggct ggtgaccagg atgacgacca 209641 tcacctcgct ggatttgacg ccgcaggtgt agcgggaccg agcccgccgg cgctgcgaag 209701 ccttagttga acgccagcca gctgggcagc gcccgctcat gggagtcaca gagcacctga 209761 cgggtgtcgc acgatccttt cggcgaccca gcgccggccc acatgccccc ggtatcacgg 209821 cgcagaacga tcgccgatga ccccccgccg tcgagcagaa tcgcggtgtc actacccagg 209881 ccgcggaaca ggtcttggat gttgtccggg gtgtagttgc cgccctggaa gatgtacatc 209941 tcgtccttct gcttcgcata ggcaagcgcc gttcgcgcgg cgctgggacc gccgtcgtgg 210001 agctggccgg tattgccggg ggataacagc ccgattccgg ccacggcgac gaaccgtgca 210061 ttcttgttga gcaagtcctc gatcaccgga gtggcaagat cgtagtcctg tctgcttttg 210121 ggccgcaaaa catacggtgc accaccgacc ggaaggatca tcgtcgtcag cgagctccac 210181 aactcatttc ctccggaaag gccctgcttt ccggcgtagg cgacggtgcc ggtgaccgct 210241 tggttggcgc gtccttgtcc gcgggtgttg tccacgtagg cgcccagcgg tgagctgcag 210301 ccggtcgacc gccagctgcc ccccttttgt ccgcgaacgt cgaagaagtt ggcgttgacc 210361 gcaatggtgg gtcgccccat acgctgccac gccttaagcg gcgggtagat ctcggaggct 210421 tgccacaagc cttcaccggt gcgagcacct gggttgtgtt cgcagcgcgc ctggtctcca 210481 gtgtgggtgt ctaccagtag atgtggtgaa agccgttggg aggcattctt gatgatcatc 210541 agatggccgc cgttgttcat ctcgtaccag tgaccacctg cgttgagcag cggcatagga 210601 tgaccgccgc cgaagttgta caccaggtat gagcccctgg tggtggctat cgcttgggcg 210661 agcatctcgc gcccgtcggc ggcgcgggcg gccggctgcc cggtggtgca ggcgagggcg 210721 gcgcacaccg ccaacgcggc gtagcaagcc gtcaatcggc gcaggctggc agtcggtgtc 210781 agcacagcaa ccctctcggc ccgaatccac atgcaaccat cccagcatta ggcacactga 210841 tcacactgtc aacttcagta acagctgcgt gacggttcgg ccgcgttcga attacggttg 210901 ttcgcttgag ctttcgcgtg cgctttggcg agcttggtat tgagcttggt gttccacggc 210961 gatcgccatc tcgaccgctc ccggaattcg gtggaagctg ctgcggtcgt acaggtgtgt 211021 gatgaatcca cccagcaaca gcccgatgat cagcccgatc gacgtcatgg tcagggcctg 211081 ggacaggccg gcgtcggcgt ttccgttcag atacagcagt gaccgcacac ctaggaacac 211141 ctggtgcatc ggctcgaatt gagccaacca gcggaagaac gctggtacgg cttccagcgg 211201 gacggtcgcg cccgccgacg gcaatccgag gatgacgaag atcaacatgc tgaccaacag 211261 gcccatcgag cccagcaccg cgatcagcga gctggacgtg acgccgaccg ctatgatcgc 211321 gaatactccg tagagccata cttgccaccc gagcggaatc ggcatgccca ggccgtgggc 211381 gatcgccagg tagacacccg aggtgagcaa cgccagcacc accatcaccg cccacttgac 211441 caacagcgta cggaagcgag agatgttgac ctgctcggcg aagcgataga cggggccgaa 211501 ttcggctggt acatagccaa gcatcgagtc caccagggtg ctcaccacga tgctgccggt 211561 aaagcccgcc aatagcagca agagggcgta gtaaaacgcc gacagcccgt tgccggtgcc 211621 gttgggcagt gggttatagg cggtggattt gacatcgatg ggactggcca gcccggccgc 211681 cgccgccccg gccagtgcca caccgccggt ctgggccgct acctccgcgg taagtcgctc 211741 gcccactttg ccgttgacca ccgtcagtgc ccgggtcagc gtctggccgg cgatgctagc 211801 tgccagcgtg cccgcccgcg gattcgttga gatcgtgatc gcgggccggt ctgtgcgggt 211861 tggcgtcacc gcactcgccc cgaagtcccg tagctgcgac gagaaggtcg gcggtatcag 211921 cgccgagccg tacaccgccg cggtgtcgag cagccgcctg gcctcgtccg gcgaaaccac 211981 tcggatgtcg aacttgttct tgtccaagcc ggaaaccaga ccgtcgacaa tctgctggcc 212041 cgcgggcccg gcgtcctcgt tcaccaacgc gattgggaaa tgccgcaaat tggtcatggg 212101 gtttaggatg ccgcccagat agagcgcggc cagcgccgac atcagggcca acgtggtggc 212161 gatcggtgcc atccagaaac gcaccgtccg aatcgctttg acgttccgct tggggttggg 212221 tgcggcgggg cgcggctgcg cttgagacat gcgggctcct gtctgtcgtg gccactctat 212281 gttgccgaat cgcccagctt cgcgtgcatc tcccagatca gtacttccga cggctcattg 212341 gcggtcaggc cgcgggcgtc agcgtcggtg aaccgcaccg cgtccccgtc ggcaagctcg 212401 ccgccacctt ccagagtgag gcggccgtag gcgacgaaca gatgcaggaa gggtgcgcag 212461 ggcaggctga ccgtagcgcc gggccgcagc cgcgcgccgt gcaacgaggc gctgctgtta 212521 tgcagggtga gcgctgcgtc ttgcccgggt atgcccgacg cgatggttac caggccggcg 212581 cgcaacagtt cgtcgtctat ctcctgctgt tggtagctgg cagtgatgcc ggttgcatcg 212641 ggtattaccc acatctgcac gaaatgcacc ggctcggtag cagaatcgtt catttccgaa 212701 tgcaagattc cggtgccggc cgacatgcgt tgggccagac cgggatagat cactccgcta 212761 ttgccggcgg aatcctggtg tctgagcgct ccccgcagca cccaggtcac gatttccatg 212821 tcacggtgtg gatggggatc aaaacccgaa gccggttcca tttggtcgtc gttgttcacc 212881 aacaggagcc cgtggtgggt gttgtcggga tcgtagtggt cgccgaatga gaacgaatgc 212941 cgggatttca gccaggacgt cgtggtgacc gcccggtcgg ccgcacgcct tatctcgacg 213001 gtggcggtca tgacgtcacg ttcgccatca cagcgaatcg ggcaggccga atttcgggaa 213061 caaggtggtg tcgaggaagg ccaccacgtg tgacacgcgg tcggcggcca tgtccagcac 213121 gtgtagctga aaaggcaggt gcacgtcacc ggcacgcatg tacatggccg cggcgggctg 213181 gccgttggcg atcaacgaaa tcaggcgcat atcgccaggc gaataggcgg ggcactgttg 213241 gtgaatgagg gtgacgatgg cctgtgcgcc ctggtaccag ccggtatacg gcggcatttc 213301 ccagatcgcc tcggcggtga acagctcgac caaccggtcg atgtcataag cctcgaacgc 213361 ggcgatatag cgggccaaca ggtcttgcgc ctcgggtgaa tccggcgcgg acaaccggtc 213421 ggcggcgctg ggccggaccg tctgcagctg agagcgggcc cgctgcagca ggctattgac 213481 ggcgacggtg ctggtaccga tcgcgtcggc cacctcggcc gatttccact gcagcacgtc 213541 gcgcagcagc agtacggctc gctgccgggg tgagaggtgc tgcagagccg ccacaaaggc 213601 caaccgcacc gattcccggt tcccgacgat cgttgaggga tcagcagggt cgtccgtcac 213661 gtccggcagc ggctccagcc aggacacctc tcgacgttcc accaactccc cggacggatc 213721 ggcactcggc cgcccgagcc ccgtcggcaa cggccggcgt cgacggccct ccaacgccgt 213781 caggcaggtg ttggtggcga tccgatgcag ccaggtgcgt agcgaggact tgcccgcgaa 213841 gccctcatag gccttccagg cccgcagcag cgtctcctga acaaggtctt ccgcgtcgtg 213901 cagcgagcca gtcatgcgat agcagtgtgc gagcagttca cgccggtagg gctcggtgtg 213961 ggcggagaag tccccgcgcc gttcgtcggc gggctcgcgg ccagagtttt ctgcgagcac 214021 actcacgtca atgagcctac gcagagtctc cgacactctc accggagcag ccgttacgct 214081 cccggtaatg actaccaccc ggactgaacg gaatttcgcg ggcatcggcg atgtgcgcat 214141 cgtctacgac gtctggacgc cggacaccgc gccgcaagcg gtggtcgtgc tggcccatgg 214201 tctgggcgag catgcccgcc gctacgacca tgtcgcgcag cggctcggcg cggccggcct 214261 ggtcacctat gcgcttgacc accgcgggca tggccgctcg ggtggcaaac gggtgctagt 214321 gagagacatc tccgagtaca ccgctgactt cgacaccctc gttgggatcg ccacccggga 214381 atatcccggg tgcaagcgca tcgtgctcgg gcacagcatg ggcggcggca ttgtgttcgc 214441 ttacggtgtc gaacgtccag acaactacga cctgatggtg ctttcggcgc cggcggtggc 214501 ggcacaggac ctggtgagcc cggtagtggc ggttgccgcc aagcttctgg gcgtcgtggt 214561 gcccggcctg ccggtgcagg aactggattt tactgccatc tctcgcgacc ctgaggtggt 214621 ccaggcttac aacaccgacc cactcgtgca ccacggacgg gttccggccg ggattggccg 214681 cgcgctgctg caggtgggcg agaccatgcc gcggcgagca ccggcattga ccgcgccgct 214741 gctagtgctg cacggcaccg atgaccggct gatccccatc gagggcagcc gtcgcctggt 214801 cgaatgtgtg ggatcggccg acgtgcagct gaaggagtat cccgggctgt accacgaggt 214861 gttcaacgag ccggagcgca accaggtgct cgacgatgtg gtcgcctggc tcaccgagcg 214921 gttgtaggcc gagccgacct gtcgcagccc tccactagtt ttggcgccat gaccaacgac 214981 aagatgctgg cccgcatcgc agccctgctg cgccaggccg aaggcaccga caacccgcac 215041 gaggccgacg cgttcatgag caccgcacaa cggttggcca cggcggcatc catcgacctg 215101 gcggtggccc ggtcgcacgc gggcaaccgt tcacccgcgc aggccccgac acagcgcacc 215161 atcaccatcg gggcggcggg cacccgcgga ttgcggacct atgtgcagct cttcgtgctc 215221 atcgcggcgg ccaacgacgt gcgctgcgac gtggcatcga attcgacgtt cgtgtacgcc 215281 tacgggttcg ccgaggacat cgacaccagc cacgccctat acgccagcct ggtggtccag 215341 atggtccggg catccgacgc ctacctcgcc tcgggagcgc accggcccac gccgacgatc 215401 accgcccgac tcaacttcca gctggcgttc ggcgcccggg tcggccagcg cttggccgat 215461 gcccgagagc agactcggca ggaagccacc aaggaccgtg atcgtccgcc tggtaccgca 215521 attgccctgc gggacaagga catcgagctg catgagtact accgtcgttc ctctaaggcg 215581 cgcggcgcct ggcgagccag ccgggccacc gcgggatact cgtcggcggc acggcgcgcc 215641 ggtgatcgag cgggacggca agcacgactc gggaacaacc ccgagctgcc cggggcacgg 215701 gccgcgctgg gccggtgatc ggcgcggacg ttccgcggga ttcccagcgt gccagggtgt 215761 acgcggccga ggcgttcgtc cggaccttgt tcgaccgcgt caccgcacac ggctcaccga 215821 cggtggagtt cttcggtacc cagttgacgc tgcccccaga aggtcggttc ggttcggtgg 215881 catcggtgca gcgttatgtg gacgacgtgc ttgcgctacc ggcggtaggg cagaactggc 215941 cgacggtgtc gccggtgcgc gtgcgggcgc gccgggcggc caccgcggcg cactatgaaa 216001 accatggcgg cacaggcact attgcggtac ccgaccggca caccgccggt tgggcgatgc 216061 gcgagttggt cgtgctacac gaagtggcgc atcatttgtg ccaggtgcca ccgccacacg 216121 gacccgagtt tgtggcgacg gtgtgcaccc tgacagagct ggtgatggga cccgaagttg 216181 gtcacgtgtt tcgcgtcgtc tacgcgcagg agggcgtgcg ctgaacgagc tagacgccga 216241 cctgcgggca cgtgaggtcg aggcccagat gaccgacgac gagcgattct cactgttggt 216301 cggcctgacc ggggccagcg atctgtggcc ggtgcgcgat gaacgcatcc cacagggcgt 216361 gccgatgtgt gccgggtatg tgccggggat tccccggctc ggggtcccgg ccttgttgat 216421 gagcgatgcc ggtctgggcg tcaccaaccc tggctaccgc cccggtgaca ccgctacggc 216481 gctgcccgcc ggccttgccc tagcggccag ctttaacccg gtgctggccc ggtcctcggg 216541 caaagcgatc ggccgggagg cgcgcagtcg cgggttcaac gtgcaactgg ccggcgcaat 216601 caatctggcg cgcgacccgc gtaacggccg caacttcgag tacctttccg aggacccgtt 216661 gttgagtgcc acgatggccg cggagtcgat catcgggatt cagcagcagg gtgtcattgc 216721 gacgacgaaa cacttctcgc tgaactgcaa cgaaaccaat cggcactggc tggacgcggt 216781 catcgatccc gacgcgcacc gcgagtcgga cttgttggcg ttcgagatcg tcatcgagcg 216841 gtcgcagccc ggcgccgtga tggcggcgta caacaaggtc aacggagatt acgctgccgg 216901 caacgaccac ttgctcaacg acgtgctgaa aggtgcttgg ggataccgcg gttgggtgat 216961 gtcggattgg ggcggaacac ccagctggga gtgcgcgctg gccggcctgg accaagagtg 217021 cggtgcgcag atcgatgcag tgctgtggca gtcggaagca ttcaccgacc gcctgcgtgc 217081 cgcctacgcc gacggcaatc tacccaaggg gcgcctgtcg gacatggtac ggcggatcct 217141 gcggtcgatg tttgccgtcg gaatcgaccg atggaaacca gcgccggcgc cggacatgaa 217201 tgcgcacaac gagattgccg cacagatggc gcggcaagga atcgtgctgc tgcaaaaccg 217261 agggctgctg ccgctcgctc ccgaatcggc cgggcgtatt gccgtcatcg gcggctatgc 217321 acacctcggt gtgccagccg gttacggttc gagcgccgtc accccgccgg ggggctatgc 217381 gggcgtgata ccgatcggtg ggtctggctt ggcagccggg ttgcgtaatc tctacctgct 217441 gccgtcaagc ccgctgagtg agttgcgaaa gcggttgccc aacgcgcagt tcgagttcga 217501 tcctggcatc aacccggcgg aggcggtgct ggctgcgcgg cgagcagaca tcgcgatcgt 217561 gttcgcgatc cgtgccgaag gagagggctt cgacagcgcc gatctgtcgc tgccatgggg 217621 tcaggatgcg ctgatcgccg cagtcgcgtc cgccaacgcg aataccgttg tggtgcttga 217681 gacgggcaac ccggtgacca tgccctggcg cgactcggtg aacgccatca tgcaggcctg 217741 gtatccgggc caggcgggtg gccaggccgt tgcggagatt gtgaccgggc aggtgaatcc 217801 ttcgggccgg ctgccgatca ccttcccggt cgatctcggt cagacgccac gctcgcaacc 217861 gcccgagctc ggtgccccgt gggggacatc gaccacgatc cactacaccg agggcgccga 217921 tgttggttac cgctggtttg ccagcacaaa tcagaccccg atgttcgcgt tcggtcacgg 217981 cttgtcctat accagtttcg agtatcgtga cctggtggtg acgggcggcc acaccgtgca 218041 cgccagtttc agcgttacca acacgggcga ccgcagcggg gcggatgtcc cgcagctgta 218101 tatgatcgca gctcccggcg aatcgcggtt gcggttgctg ggattcgagc gggtcgagct 218161 cgaacccggc cagactcggc gggtaaggat cgaggcggac ccgcgactgc tcgcccgcta 218221 cgacggcgag gccagaagct ggcgcatcga gccgggcggt tacacggtgg cggtgggcgc 218281 ttcggcggta gcgctgaagc tggcagccaa ggtcaagctg gccggccgtg ggttcgggcg 218341 gtgacgggcc ggcccagcga ggcccgtacc cacgaccggc atgataggtc tacttgaccg 218401 gggccaattc gtcgccgcag gtgcagcggt aggcgtcacc ggcgccagca cagtggcatg 218461 ggacttcgat gcgaacccga cagccacagc cctcgtggct gcaggtcagc aaggtcccag 218521 cctcgtagtt cgtcattcgt atcaccctca tccgtgtcgg ggatccccga ggaatcccag 218581 gtggtcagct gtcggtaatc cagaacagct acttaaatat ataccctata cgggtatctg 218641 gtaaaccccc aggccggtgg gcggttgcct gctggcgcgc gacggtcggt ggtcgcgcta 218701 gcgtttgggc atggaccagc aacccaaccc gcccgacgtc gacgcatttt tggacagcac 218761 actggtcggc gacgatccgg cgttagccgc ggcattggcg gccagcgacg cggccgagtt 218821 accccgcatc gcggtgtcgg cacagcaggg caagttcctg tgcctgctgg ccggtgccat 218881 ccaggcgcgc cgcgtcctcg agatcggcac actcggtggc ttcagcacca tttggctggc 218941 gcgtggcgcg ggcccacagg gacgggtggt cacgctggaa taccagccca agcacgctga 219001 ggtcgcccgg gtgaacctgc agcgagcggg cgtcgccgat cgggtggagg tggtcgtcgg 219061 tccggcgctg gacacgttgc cgacgttggc cggtggcccg ttcgacctgg tgttcatcga 219121 cgccgacaaa gagaacaacg tcgcatatat tcagtgggcg atccggttgg cccggcgcgg 219181 cgcagtgatc gtggtggaca acgttattcg tggcggcggg attcttgctg agtccgacga 219241 tgccgacgca gtggcggcac gtcggacgct gcaaatgatg ggtgagcacc ccggcctaga 219301 cgccacggcg atccagaccg tcgggcgcaa gggctgggac ggtttcgccc tcgctttggt 219361 gcggtagccg ctggtccggc gcccaatttt cgttgctggc atcccgaaaa cgggcgtaat 219421 cttggagcag atggatgggt ggcagcgagc ccaaaagttt tgctgcataa cagaaaggtt 219481 gcaaaatgag tacagtccat tcatcaattg atcaacaccc tgatttgttg gctctgcgtg 219541 ccagcttcga ccgcgccgcc gagtcgacga tcgcgcattt cacattcggt ctggccctgc 219601 tggcgggcct gtatgtggct gcatcgccgt ggatcgtcgg cttcagcgcc accagagggc 219661 tgccaacgtg tgaccttatc gtggggatcg cggtcgcgta cttggcgtat gggttcgcgt 219721 cggccctgga tcgcacacac ggcatgacct ggacgctacc cgtgctcggt gtgtgggtca 219781 ttttctcgcc gtgggtgcta ccaggggtcg cggtgacggc tggcatgatg tggtcgcaca 219841 tcatcgcagg tgcggtggta gccgtcctgg gcttctactt cgggatgcgc acgcgggccg 219901 cggctaacca aggatagttc gaagttcgcg agccagaggg caactcggga atgtcctggc 219961 cggggcggtc ccggccaggc agcggctagt tgcggctagc cgcagaccgc gccgaccgcg 220021 gcagagctga ccagcttgac gtacttggac agtacgccag tagtgtagcg cggcggtgga 220081 ggactgaaat cctgttgtcg ggacgcgaat tcggccggat cggccaacac atcgagaacg 220141 cggccggcca cgtcgagccg gatccggtcg ccgttgcgca gaagtgcgat cggtccgccg 220201 tcgaccgcct ccggtgcgat gtggccaacg cacaggccgg tggttccacc ggagaaccgg 220261 ccgtcggtca gcagtagaac atctttaccg agtcctgcgc ctttgatcgc gcctgtgatg 220321 gcgagcattt cgcgcatccc ggggccgccc ttgggtcctt cgtaccggat taccacggcg 220381 tcgcccacgg taatggtgcc atcctcaagg gcgtccagcg cagcgcgctc gccgtcgaaa 220441 actcttgcgg tgccttcgaa tacgtcggaa tcgaatccgg cggtcttgac caccgcacct 220501 tcgggtgcca gcgatccgtg caggatggtg atgccaccgc tcgggtggat cgggtttgcc 220561 aacgcacgta gcaccttgcc atctggatcc ggcggggtga tggcagccag attctcggcc 220621 atggtgtgac cggtaaccgt caggcagtcg ccgtgtagca gaccggcgtc cagcagcgcc 220681 ttcataacca ccggcacacc gccgatgtga tcgacgtcgg acatcacatg gcggccgaac 220741 ggcttgacat cggccaaatg cggcaccccc gacccgatcc ggctgaagtc ctgaagcgat 220801 agtgcgacgt tggcctcgtg ggcgatggcc agcagatgca gcaccgcgtt ggtcgagccg 220861 ccgaacgcca ttaccaccgc gatggcgttc tcgaacgcct ccttggtgag gatgtcgcgg 220921 gcggtgatgc cgcggcgcag cagctcgacg acggcctgac cgctgcgacg cgcgaacccg 220981 tcgcgccggc ggtcggtcgc cggcggtgcc gcgctgcccg gcaacgacat gccgagcgcc 221041 tcggcggcgc tggccatggt gttagcggtg tacatgccgc cgcatgcccc ttcgccgggg 221101 cagattgccc gctcgatggc atcgacgtcg gcgcgactca tcaaaccgcg agagcacgct 221161 ccgaccgcct cgaaggcgtc aatgatggtg acgtctcgtt cgctaccgtc ggagagcttg 221221 gcccggccgg gcaaaataga gcccgcgtag aggaacaccg ccgccagatc cagtcgtgcg 221281 gcggccatca gcattccggg cagcgatttg tcgcatccgg ccagcagcac cgaaccgtcg 221341 agtcgttcgg cctgcatcac gacttcgacg ctgtcggcga tcacctcacg ggaaaccagc 221401 gagaagtgca tcccctcatg acccatggag atgccgtccg aaaccgagat cgtgccgaac 221461 tcaagcggat agccgccggc cgaaaacacc ccctccttga ccgcgttggc cagccggtcc 221521 aatgagagat tgcacggcgt gatttcgttc cacgacgacg cgaccccgat ctgtggcttc 221581 gcgaagtctt cgtcgtccat gcccaccgcc ctcaacatgc cccgggcagc ggccttctcc 221641 aggccgtcgg tgacgtctcg actgcggggc ttgatgtcgg cgaccgtcga gacggaagcg 221701 gcttcgtcgg tggtttgcgg cattgttcaa gtatgcggcc caaggatgcg ctcgccgcgg 221761 cacggttgcc aaattctagg tccgataccc cgctggggta caagatatga tgggtagcat 221821 gcctgggccc tgctttcggg ttggcgagta tctctggaga tggcgagtaa atgacagcag 221881 cacacggcta cacgcagcaa aaggacaact acgccaagcg gttgcgtcgc gtcgaggggc 221941 aagtgcgcgg catcgcgcga atgatcgagg aagacaagta ctgcattgac gttctgaccc 222001 agatcagcgc cgtcaccagt gcgttgcggt cggtggcgct gaacctgctg gacgagcacc 222061 tgagccactg cgtcacccgt gccgtggccg agggcggtcc tggggctgac ggcaagctgg 222121 cagaggcctc ggcagcaatc gcgcgcctgg ttcgttcctg atcgccgcgt gttgaagcgc 222181 aaacctgccc accacccgtt ggtgcggtgc gtacggtagg ggcagcgtaa tcgtgccctg 222241 aacgaccccg aaccatcgaa cttcgcggcc gattccgcgc aggacgcgat gactgcccca 222301 accggaacct ccgccactac gacgcgaccg tggacgccac ggatcgccac gcaactgtcc 222361 gtgctggctt gcgcggcctt tatctatgtc accgccgaaa tcctgccagt gggcgcgctg 222421 tcggcgatag cgcggaactt gcgcgtcagc gtggtcctag ttgggacctt gctgtcctgg 222481 tatgcccttg tcgcggccgt gacaacggtt ccgctggtgc gttggaccgc acactggccg 222541 cgccgccggg ccctggtggt cagcctggtc tgcctgaccg tctcgcaact cgtctcggcg 222601 ctggcgccca acttcgcggt gctggccgcc gggcgggtgc tctgcgcggt cacccatggc 222661 ctgctgtggg cggtcatcgc gccgatcgcc acccggctgg tgccgcccag tcacgccggg 222721 cgcgccacga cgtcgatcta catcggaacc agtctggcgc tggtcgtcgg tagcccactc 222781 acggctgcca tgagcctgat gtggggttgg cggctggcgg cggtgtgcgt gaccggcgcg 222841 gcggccgcgg tcgccctggc cgcccggctg gcgttgccgg agatggtgct gcgcgccgac 222901 cagctcgagc acgttggccg acgggctcgt caccaccgta atcctcgcct ggtcaaggtc 222961 agtgtgctca cgatgatcgc ggtaaccggc catttcgtgt cctacaccta catcgtggtg 223021 atcatccgcg acgtcgtcgg tgtacgtggg ccgaatctgg cctggctgct cgccgcctat 223081 ggggtcgccg gcctggtgtc cgtgcccctg gtggcgcggc cgttggaccg ttggcccaag 223141 ggcgccgtca tcgtcggtat gaccggactg acggcggcgt tcaccttgct gaccgcgctg 223201 gcattcggtg aacgccacac cgcggcgacg gcactgctgg gcaccggtgc gattgtgctg 223261 tggggagcct tggccactgc cgtgtcaccg atgctgcaat cggcggcgat gcgtagcggc 223321 ggcgacgacc ccgacggggc ctcaggtttg tatgtgacgg cgtttcagat cggcatcatg 223381 gccggcgctc tgctgggtgg gctgctctac gagcgcagct tggcgatgat gctgaccgcg 223441 tcggcgggtt tgatgggtgt tgcgttgttc gggatgacgg ttagccagca cttgttcgag 223501 aatccgactc tgagtcccgg cgacggctaa cacagcaggt cagcgggacc agttggtgcc 223561 gctatgccac actgggctga agaacgtcac cggagggaaa gcaattatgt cgcgctggaa 223621 gcagggctgg acgaggggga gtctattcgc cgctctgaac atagccgcag tggttgcggt 223681 gctgatgctg ggtgctggcg ttgccgtggc ggacccggac gcggctcccg gcgatcccgg 223741 aggtcccggg gccccggggg cacagcggga cccgtcgacc cgccggcagt tgacctgttg 223801 gcgccgccac ccgacccgtt ggcgctgccg ccggcacttg acccgttggc gccgccgcca 223861 cctgacccgc tcgcgccgcc cccgcctgac ccgctggcag tgccggtagc agcgggcccc 223921 gttgccgggc aggatccgac atcgtttgtt ggcccgccgc cgttccggcc gccgacgttc 223981 aatccggtcg acggcgcgat ggtcggtgtg gccaagccga tcgtcatcaa cttcgcggtg 224041 ccgatcgccg accgggcgat ggccgaaagc gccatccaca tttcgtccat cccgcccgtg 224101 ccgggcaagt tctactggat gagcccgact caggtacgct ggcgcccgtt tgagttctgg 224161 cccgccaaca ccgcggtaaa catcgatgcg gccggcacca agtcgagctt ccggaccggt 224221 gattcgctgg tggccaccgc cgacgacgcc acgcatcaga tgacaatcac ccgcaacggc 224281 gtcgtgcaaa agaccttccc catgtcgatg ggcatggtgt ccggcggcca ccagaccccg 224341 aatggcacct actacgtgct tgagaagttc gccaccgtgg tcatggactc ctcgacgtac 224401 ggggtcccgg tcaactcggc ccaaggctac aagttgaccg tctccgacgc cgtccggatc 224461 gacaacagcg gcaacttcgt gcacagcgcg ccgtggtcgg tggcagatca gggcaagcgc 224521 aacgtcaccc acggctgcat caacctcagc ccggccaacg cgaagtggtt ctacgacaac 224581 ttcggcagcg gtgacccggt cgtcgtgaag aactctgtcg ggacttacaa caaaaacgac 224641 ggtgcccagg actggcagat ctaacggccg cgcggttgcc cacgagtgac ccgtagccaa 224701 tcgcggctcc ccttactgga gctttactga aagcaggtca gcgacagcat cgtgtagtgc 224761 cgaagcagcc ggcgggcgca gtctttcacc accaggttgc gcctgccgtc gagactgtag 224821 gcggcgtcga ccgcccagaa ggcgaacaag ctggtcgata ggtaagcggc catgtcctcg 224881 cagcgcagca gtgcctcggc cacatcctcg agcaggtgtt tgaccgagcg cgcggcctcg 224941 gcatggctca tgccgagcaa gtcgaagtgc caatcggcgg gcgggtcttg gtcttcgtcc 225001 agcggcagcg tctttagcac ctcggagatt agcgcgtgga tctgcagcgg acgtagacag 225061 tcggccgggc ggggctgctg aatgtcggca agccagccca gcaaccgctc gtatagccga 225121 tctgctgaca tctcgcgaat caggttgcgc gcccacttcc gcggttcctc gcggttgtcg 225181 tggtaaagga tcagatccag cgggccatgc ttgtcgaacc gcatcaggcg ccagatcagg 225241 tcgaagaact cctccgctga caggacctca ccgtagggaa tcgtcagttc gcggttgcgg 225301 atctcgcgca cttcgggggc cttggtggcc gccgaaatcc attcctcgaa ccgcgcgccg 225361 tcgatctgct ttgtctgcag tggtagcgat accggacgtt gcggggtttt cttccacttg 225421 aggagcagtt cggcaatctc gtcagcggcg gcgcacaggg tggcaataaa tcgctgctgg 225481 taagactcgt cgggtacacc cctagtcaac agctcgctca aggagacatc cgaactgtcc 225541 tcgacatcgc tgaccatccg ccctggattg tcggcgacgc ggtggaaaac cagaatgatg 225601 cagctgcgag cacttccgga ggcggtgcgg aagcggtgat gcaccagcga gccagcggcg 225661 aatacaccgc tcagcggtcg ctcggaatcg gtcacctcga tcaccgttgg gtggtcctga 225721 ccgcccagct ggcgctgcgc gtcgaacacc ttgcggatgc tgtcgggggt ggtgaagatc 225781 gatgcatttt cactcgacca cggcaccccg tcctcgttga cgaagcaggt gcgggcgagt 225841 ttatgggtgc cgggtaggaa ggtgaaattc tgtcccgctg gcccctgcgc ggttccgcgc 225901 cgccaggtaa tgaggatctt gtattcgtcg ttgaacgggg tgttgtcgat atgcagcatg 225961 ttgtcctggg ccaggaccga cagcggttcg gcgtccttgc cgcgtgcatc gatcattctg 226021 atcggtccac cgacggcata ggagatcaac gcgatcatca aggggtgcac cagcgcgcca 226081 ttgaccgctg gatcggtcag cattccgggg cttcgccgca ggtctaggaa gcggtgaatg 226141 aaactgcgac tgccctcccg ggccatgagt tcgtcgtagc gttttaccaa ttccaaaaag 226201 tcgtccgact cgacgatgtc ggcaaggacg actgcgccct gctcggccat ctgatcgatg 226261 aggtctcgta gtgccgcggt ggcgatttcg gcagcttctg aaccctgggc cgccagcgcg 226321 tcggtcagct cctgcagcag tttcctgcgg taggactcct tgtcatctag atcacggaat 226381 cgtttgtagg cccaggcggc gggcagctgg tcctgtgaca ccagctcggg accgatcggt 226441 ggtgcgtatt ccagcggcaa tatgtcatcg gcggcgaact tggtcagctg gattccgtcg 226501 gaattatccg gcaaggcctg ggtggtcgca gtctgaccga gtgagctcat gtcccgggaa 226561 atctgaatca cctccgcttt cgcgtattgc gcaagaactc ggttcgttga cccgtcgagg 226621 tcgactgcag aacgtacctc cggaggcggc gttatcgcca gacctattac ctgggggtct 226681 gcccgaaagg gaaaacccgg tgtcctttct ggttatcgaa gtgaccggaa tattcggtgc 226741 cggcggcgca cacgcgagaa tggatgccgc gcacgagttt atgcgcttgt tcgggttctg 226801 cccgaaaggg aagacttgat ttcccgttag ttcaaccacc gggtgatcgg cgcactgaac 226861 gagaaaggat atggcgaatg cgcacgaatt gctggtggcg gttgtccggc tatgtcatgc 226921 ggcatcggcg cgatctgctg ttgggattcg gggcggcgct ggccggcacc gtcatcgccg 226981 ttttggttcc gctggtaacc aagcgtgtca tagacgacgc gatcgcggcc gaccacagac 227041 cgctggcgcc ctgggccgtg gttctggtcg ccgccgccgg ggcgacctac ttgctgatgt 227101 acgtacgccg gtactacggc ggtcgaattg cccacctggt acagcatgac ctgcgcatgg 227161 acgcctttca ggccctgttg cggtgggacg gccgacaaca ggaccggtgg agcagcggcc 227221 agctcatcgt ccgcaccacc aatgacctgc aactggtgca ggcgttgctg ttcgatgtgc 227281 ccaatgtgct caggcatgtg ctgacactgc tactaggtgt cgcggtcatg acctggttgt 227341 cggtgccgct tgcgctgctt gcggtgctgc tggtacccgt gattggcctg atcgcccacc 227401 gcagccgccg gctgctggcc gcagccaccc actgtgccca ggaacacaag gccgcggtca 227461 ccggagtcgt cgatgcggcg gtctgcggaa tccgggtcgt caaggcgttc gggcaggagg 227521 agcgggagac ggtcaagctg gtgacggcat cccgcgcgct ctatgctgcc cagctgcggg 227581 tggccaggct caacgcacac ttcggtccgc tgctgcaaac cctgcccgcg ttgggtcaga 227641 tggcggtctt cgcgctcggc ggatggatgg ccgcgcaggg cagcattacg gtgggcacct 227701 ttgttgcctt ctgggcctgc ctgacattgc tggcgcggcc ggcatgcgat ctggcgggga 227761 tgctgaccat tgcccagcag gcgcgcgccg gcgcggtgcg ggtactcgaa ctcatcgaca 227821 gccggccgac gctggttgac ggcaccaagc cgctgtcgcc ggaggctcgg ttatcactgg 227881 agttccagcg ggtgtccttc ggatatgtgg ctgaccgccc cgtgctccgc gagataagcc 227941 tgtcggtccg ggccggggag accctggcgg tggtcggtgc gccgggcagc ggcaaatcca 228001 cgttggcgtc gctggcgacg cgttgctacg acgtcacaca gggcgcggtg cggatcggtg 228061 gtcaggatgt gcgcgagctg acgctcgact cgctgcggtc agccatcggc ctggtacccg 228121 aagatgccgt cctgttctcc ggaacgatcg gtgcaaacat cgcctatggc cgcccggatg 228181 cgacgcccga acagattgcc acggcggccc gggcggcgca catcgaggag ttcgtcaaca 228241 ctctgccgga cgggtatcag acggccgtcg gtgcgcgcgg actgacgctg tccggcgggc 228301 aacgccaacg catcgccctg gcccgggcgc tactgcacca gccgcggttg ttgatcatgg 228361 acgacccgac ctctgccgtg gatgcggtca tcgaatgcgg aattcaggag gtgctgcggg 228421 aggcgatcgc ggatcgcacc gcggtcattt tcacccgccg ccgatccatg cttaccttgg 228481 ccgaccgggt cgcggtcctc gactccgggc gcctgctcga tgtcggcacc cccgacgagg 228541 tgtgggagcg ctgtccccgc tatcgggaat tgctgtcgcc cgcgccggat ctcgccgatg 228601 acctggttgt cgcggagcgc tcgccggtgt gtcgaccggt ggccgggctc ggcaccaagg 228661 ccgcgcagca caccaacgtc cacaaccccg ggcctcacga tcacccaccc ggccccgacc 228721 cgttacgccg cctgctgcgt gagttccgcg gcccgcttgc gttgagcctg ctgttggtgg 228781 ccgtgcagac ctgcgcgggt ctgctgccgc ccctgctcat ccgccacggt attgacgtcg 228841 ggattcgccg ccatgtgctc tcggcgcttt ggtgggcagc gctcgccggc accgccaccg 228901 tggtcattag gtgggtcgtg cagtggggga gtgccatggt cgccggatac accggtgagc 228961 aggtgctgtt tcgattgcgg tccgtcgtct tcgcccatgc ccagcgcctg ggcctggacg 229021 catttgaaga cgacggagat gcccagatcg tcaccgcggt caccgccgac gtcgaggcca 229081 tcgtggcgtt cctgcgcacg ggtctggtcg ttgccgtgat cagcgtggtg accctggtcg 229141 gcattttggt ggcgctgctg gccatccgcg cccggctggt gttgctgatc ttcaccacca 229201 tgccggtgct tgcccttgcg acctggcaat tccgtcgggc gtcgaattgg acctatcggc 229261 gggcgcggca ccggttgggg acggtaaccg ccacgttgcg tgagtacgcg gcggggttgc 229321 ggatcgccca ggcgttccgc gccgaatacc ggggactgca aagctatttc gctcatagtg 229381 acgactatcg ccgacttggg gtgcgcgggc agcggctgct agccctgtac tacccgttcg 229441 tggcattgct ctgcagcctg gcgaccaccc tggtcctgct cgacggtgca cgcgaggtgc 229501 gagcgggggt gatctcggtc ggagcgctgg tgacctatct gctctacatc gagctgttgt 229561 acacgccgat aggcgaactg gcgcaaatgt tcgacgatta ccagcgtgcg gcggtggcgg 229621 ccgggcggat ccggtcgctg ctgagcacgc ggacaccgtc gtcgccggcg gcacgaccgg 229681 tggggacgtt gcgtggtgaa gtggttttcg acgccgtcca ctattcctac cgaacacgag 229741 aagtgccggc actggccggc atcaacctgc gaattccggc cgggcagacg gtggtgttcg 229801 tcggctccac cggatccggg aaatccaccc tgatcaagtt ggtggcgcgg ttctacgatc 229861 cgacccatgg gacggtccga gtcgacggat gcgacctgcg ggagttcgat gtcgacggct 229921 atcgcaaccg gctcggcatc gtgacgcagg agcagtacgt cttcgccggg acggtccgcg 229981 atgccatcgc atacggacgg cccgatgcca ccgatgccca ggtcgaacgg gctgcgcggg 230041 aggtcggtgc ccatccgatg atcaccgcac tcgacaacgg gtacctgcat caggtcaccg 230101 cgggtgggcg caatctgtcc gccggtcagc tgcagttgct cgcattggcc agggcgcgtc 230161 tggttgaccc cgacattctg ctgctggatg aggccaccgt ggccctggat cctgccaccg 230221 aggccgtggt gcagcgggcc accctcaccc tggcagcccg tcggacgacc ttgatcgtgg 230281 ctcacgggct agccatcgcc gaacacgccg accgcattgt cgtgctcgag cacggcaccg 230341 ttgtcgagga cggcgcccac accgaacttc tcgctgctgg gggccactat tcgcggctgt 230401 gggcggccca tactcgactg tgttcgccgg aaatcactca gcttcaatgt attgacgcat 230461 agacgtcacc aagccaccga atgggtggcg agttgaccgg gcgccggatc ccgacggttg 230521 tggttgatct gccgaatcaa cggcttctgg ccacgaacat gtgtccgcga ctggcgtctg 230581 cgataccaac ccaatcggtt actatagaaa ctgttcccgc cgacaactaa ctcccttgtt 230641 cgcgtggagg ggttctcggg tccggtcagc gaggtccgga gcggggcgga aatttcattg 230701 aacagccgta gaagttcagc caggaccgga acggatccag cggcaagcat gccttcagga 230761 gccatgttgt cgaatcagtg cctagggctg ggggcgcccg gaaggaacac cacagggggg 230821 accgacattc cgcatgtggt caagcgcagc ggagcgaaat tccgcgagga gttcatcctc 230881 cgtccggacc gggtgcaaat ggcaccggtg aatgtcattt cggtcgcggt ggtggcgagc 230941 gacccgttga cccgcgatgg agctttggcc cgactctcgt ctcaccggga gctcgacgtg 231001 cgcgcttggc aggctggatg cgaaacctcg gtcctgctcg tgctggccac cacgatcacc 231061 gcgcctcttc tatgccagat cgaggacgtg cagaaggatg gccccagtca cgccccgaaa 231121 ctggtcgtcg tcgccgacga attctccgct gaacaagttt tccggatgat caagctgggg 231181 ttgaccgggt tgttgtatcg cagccagagc acgttcgact gcatcgtcga gacaatccgg 231241 ttgtccgccg aaggccgcct gcgactcccc gaacgtgtcc agcgttacct ggtcggccgc 231301 atcaagtcca ccccgaccgc cgaacctgac acaccgtgcg ccgccgctct tgccgagcgt 231361 gaggtggcgg tgctgcgtct gctagcggac ggcttgagca cgcaccaagt ggcggtgcag 231421 ctcaactatt gcgagcgcac gatcaagaac atcgttcatg acatagtgac gcggctgaag 231481 ctccgcaacc gcacgcatgc cgtcgcacat gcgctgcgcg cgggcctcat ttgattgatg 231541 gccggcgtcc gacgtacgtg cggccgggcc gatcccaagc gagtggtgta acgtgcacgg 231601 tagccattat gtatagcaac atacatatgc ctcggatgga gcggcgatgc aaggtccacg 231661 cgaacggatg gtggtctcgg ccgcgctgtt gattcgggaa cggggagccc acgccaccgc 231721 catctcggat gtgctgcagc acagcggcgc accgcggggg tcggcctatc actacttccc 231781 gggcggtcgt acccaactgc tatgcgaggc cgtcgattac gccggagagc atgtcgccgc 231841 catgatcaac gaggccgagg ggggcctgga gctgctggac gcgctgattg acaagtatcg 231901 ccagcagctg ctcagcaccg actttcgcgc cggctgcccg atcgccgcgg tctcggtgga 231961 ggcgggcgac gaacaagatc gcgagcggat ggccccggtg atcgcgcgtg cagcggcggt 232021 gtttgaccgc tggtcggact tgactgccca gcggttcatt gccgacggca taccgccgga 232081 tcgggcgcac gagctggcgg tgttggcgac gtcgacgctc gagggcgcaa tcttgctggc 232141 tcgggtgcgg cgcgacctga cgccgctgga tctggttcac cgccagctgc gcaacctgct 232201 gctggccgag ctgcccgaaa ggagccgatg atgaccagct ctgattggct gcccaccgcg 232261 tgcatcctct gcgagtgcaa ctgcggcatc gtcgtgcaag tcgacgatcg ccgactggcc 232321 cgcatccggg gcgacaaggc gcatccgggg tctgcgggct acacctgcaa caaggcgttg 232381 cggctggacc attaccagaa caaccgggct cgcctgagct cgccgatgcg ccgccgagcc 232441 gatggcacct acgaggagat cgactgggac acggcgattg tcgagattgc cgagggattc 232501 aaacagatcc gtgataccca cggcggggac aagatcttct actacggcgg cggcggacag 232561 ggcaatcacc tcggcggcgc ctacagcggc gcctttctga aggcactggg gtcgcgctac 232621 cggtcgaatg cgctggcgca ggagaagacc ggcgaagcct gggtcgactt ccagctgtac 232681 ggcggtcaca cgcgcggcga gttcgagaac gccgaggtgt cggtgttcgt cgggaagaac 232741 ccatggatgt cgcagagctt cccgcgggcc cgggtcgtgc tcaacgagat cgccaaggat 232801 cccggccggt cgatgatcgt gatcgatccc gtcgtcaccg acaccgcgaa gatggccgac 232861 ttccatctac gggtgcaacc gggttgcgac gcctggtgct tggcggcttt ggccgcggtc 232921 ttggtccagg aaaacctctg taacgaagcc tttcttgccg cgcacgtgca cggagtggac 232981 accgtgcgcg ccgccctgca agaggtcccg gtcgccgact acgcgcagcg ttgcggggtg 233041 gacgaggagt tgttgcgtgc cgcggcccgg cgcatcggca ccgccgcgag cgtgtcggtg 233101 ttcgaagacc tgggaatcca gcaggcgccc aacagcaccg tctgctccta tctgaacaag 233161 ctgctgtgga tcctgaccgg caacttcgcg aaaaagggtg gccaacacct gcattcgtcg 233221 ttcgctccgc tgttcagcca ggtctccggc cgcacaccgg tcaccggtgc gcctattatc 233281 gcgggcctga tcccgggcaa cgtggtgccc gaggagatcc tgaccgagca cccggatcgg 233341 tttcgggcga tgatcgtaga gaggggcaat ccggctcact cgctggccga ttcagccgcc 233401 tgccgggcgg cattccaggc gctggaactg atggtggtcg tcgatgtcgc catgaccgag 233461 acggccaggc tcgcccacta cgtgctgccg gcggcgtcgc agttcgagaa gccggaagcc 233521 acattcttca atttcgagtt tccacgcaac ggctttcagt tgcgccggcc gttgtttccg 233581 ccactgcccg gaacactgcc cgaacccgag atttgggcgc ggctggtgcg ggcacttggc 233641 gtagtcgacg aagcggacct gcggccgctg cgagaggccg ctgctcaggg tcgccaggcg 233701 tataccgagg cgttcctcgc ggcggcggcg accaatccca ccgtggcgaa actgaccgcc 233761 tatgtgctct atgaaacgct cgggccgacg ctgccggacg gtctggccgg ggcggccgcg 233821 ttgtggggac ttgcccagaa gacggcgatg gcctaccctg acgccgtccg ccgcgccggc 233881 cacgccgacg gcaacgcgct gttcgacgcg attctcgagc gcccctccgg ggtcacgttt 233941 accgtgcaca actacgaaga cgacttcgct ttgattagcc accccgatca caagatcgcc 234001 ctggagattc cggaaatgct ggcagagatc cggtcgctga cccagacccc gtcgcggttg 234061 accacgcctc aactgccgat cgtgctgtcg gtgggcgagc gccgcgcgta cacggccaac 234121 gacatcttcc gtgacccgtc ctggcgcaaa cgcgacgcca acggggcgct gcgggtcagc 234181 gtcgaagacg cccaggccct gggactggcc gatgggtgcc tggctcgtat cacgaccgcg 234241 gcgggcagtg cggaggcgac ggtggaggtc accgagacga tgctggccgg acacgccgcg 234301 ctgcccaacg gctttgggct ggactacacc ggcgacgacg ggcgcaccgt cgtcgccggt 234361 gtcgccccga acgcacttac ttcgacgaga tggcgcgacc cctacgccgg caccccctgg 234421 cacaagcacg tgcccgccgc catccgccga gcagacgcag aatcgcccat ttggtatccc 234481 aaatgggcga ttctgcctgc tcgcggggtc ttagcctagt tccagatccg gaccctgcgc 234541 tgcgggtcca gaaacagcgc gtcatcctcg gtgacgtcga aggcctgata aaaagcgtcc 234601 acgttgcgaa ccacaccgtt gcaccggaac tccggcgggg agtgcggatc gaccgccaac 234661 cggcggattg cttcggctgc acgcgatttg gttcgccata tttgtgccca gccgaagaac 234721 acccgttgca tgccggtcag cccgtcgata accggagcgg ggttgccgtt cagcgagagc 234781 tggtaagcca gcagggcgat cgacagcccg cccaggtcgc cgatgttctc gcctatggtg 234841 aacgcgcctt gcacatgagg cgggccgggg tggtcgacga gatcgcgcgg cgtgtaagcg 234901 tggtactgct cgatcaacgc tttggtgcgg gcggcgaact cggtgcgatc gtcgtcggtc 234961 caccaatcga ccagattgcc gtcgccgtcg tatttggcgc cctgatcgtc gaaaccgtgc 235021 ccgatctcgt gcccgatcac cgccccgatc ccgccgtagt tggcggcctc gtcggcctgc 235081 ggatcgaaaa atggtggctg taaaatcgct gcggggaaga cgatttcgtt catccccggg 235141 ttgtagtagg cgttgacggt ttgtggtgtc atgaaccact cgtcgcggtc gaccgggccg 235201 aaaagcttgg ctagctcgcg gtcatggttg acggcgtagc cgcgctggac gttaccgtag 235261 aggtcgtcgc ggtcgatcgc cagcttcgag tagtcgcgcc acttgatcgg atagccgact 235321 ttggcggtga acttgttcag cttcgctagc gcgcgttgcc gggtctgcgg cgtcatccaa 235381 tccagctcgc tgatgctgat ccgatacgcc tcctgcaggt tgtccaccag ggtgtcgatg 235441 cgggacttgg catccggcgg gaaatggcgt tgtacataga gctttccgac ggcatcgccc 235501 atcaggttct ccaccagtga caccccacgc ttccaacggt cccgaagctg ctgtgcgccg 235561 gtaagcgtgc ggccgtagaa ttcgaagtcc tcggcgacca gggcgcgggt cagccagggg 235621 gcccgggcgc ggatcaaacg ccaacgcgcc cagcatttcc agtcttcaac gttaacgctc 235681 gcccacagcg aggcaaaggt gacgaggtaa tcaggttggc gcacaaccag ttccgtcatg 235741 gcgtccggag cgctccccaa tgcggtcacc cagctgaccc agtcgaaacc cgccccttcg 235801 gtctgcagct gggcaaacgt gcgcaggttg tagccaaggt cggcgtcgcg gcgcttcacc 235861 acatcccaat gcgcgtcggc gagtttggtc tccagcgcga cgatgcggtc cgcggttttg 235921 gcatggtcac ggctctcgcc cccgtacacc aggccgaaca tccgggcgat gtgccccggg 235981 taggccgcta gcacggcggc gtgttgctcg tcacggtagt aggactcgtc gggtaatccg 236041 atgccggatt gggtgaaatg caccaagtaa cgggtcgagt ctttggaatc ggtatcgaca 236101 tagactccga tgccgccgcc cacgccggca cgttgcagag tgccaagggc ggcggccaat 236161 tcggtggcgt cggccgcgct gtcaatcgtg gccaattcgt cgtgcagcgg ttgcacccct 236221 gcgcgctcga cggcttcctc gtcgaggaag ctggcgtaga ggtcgccgat gcgctgcgca 236281 tcggtgccta ccgcagcacc tgcttggctg gcctggatga tcaggtctcg cacttgtgtc 236341 tcggcgcggt cgaacaggct acggaaggcg ccgtcggtcg ctcggtccgc tggtatctcg 236401 tgttcagcca gccagcggcc gttaacgtgg ccgaacaggt cgtcttgggg tcgggcatca 236461 gcgtcgatgt ggctcaggtc gatacccgag gggatggcaa gtgtcacccc gccatccttc 236521 cacctctttt cgggtgcaac gatcgggcca tgcctgacgg ggagcagagc cagccaccgg 236581 cccaagaaga tgcggaagac gactcgcggc ccgacgccgc ggaggccgcc gcggccgaac 236641 ccaaatcatc agccggtccg atgttctcga cctacggtat cgcctcgaca ctactcggcg 236701 tgctatcggt cgccgcggtc gtgctgggtg cgatgatctg gtccgcacac cgcgatgact 236761 ccggcgagcg tacctacctg acccgggtca tgctgaccgc cgctgaatgg acggccgtgc 236821 tgatcaacat gaacgccgac aacatcgatg ccagcctgca gcgactgcac gacggaacgg 236881 tcggtcaact caacaccgac ttcgacgctg tcgtgcagcc ctaccggcag gtggtggaga 236941 agttgcggac gcacagcagc ggcaggatcg aggcggtagc gatcgatacg gtgcaccgcg 237001 agctggatac ccagtccggt gccgcccgac cggtagtaac cacgaaattg ccaccgtttg 237061 ccactcgcac cgactcggtg ctgctggtcg cgacgtcggt cagtgagaac gccggcgcca 237121 aaccccagac cgtgcactgg aacttgcggc tcgatgtctc cgatgtggac ggcaagctga 237181 tgatctcccg gttggagtcg attcgatgag aaatgcttgg cggctggtgg tgttcgatgt 237241 cctggcacca ctggccacga tcgccgccct ggccgcgatc ggcgtcttgc tcggctggcc 237301 cctgtggtgg gtttcgacgt gctcggtgtt ggtgctgctg gtggtcgaag gtgtggcaat 237361 caacttctgg ctgttgcgtc gtgattcggt aaccgtcggt accgacgacg atgcgcccgg 237421 gctgcgactg gccgttgtct tcctgtgcgc cgccgcgatc tcggcggcgg tggtgactgg 237481 gtacctgcgc tggacgacac cggaccgcga cttcaatcgg gattcccggg aagtggtgca 237541 tcttgccacg gggatggccg agacggtcgc gtcattctcc ccgagcgcac cggccgccgc 237601 tgttgaccgg gccgcggcga tgatggtgcc cgaacatgcg ggcgggttca aggagcaata 237661 cgccaagtcc agcgccgatc tcgcacggcg cggtgttacg gcccaggccg ctacgctggc 237721 ggccggcgtg gaggcgatcg ggccgtcggc agccagtgtt gcggtgattc tgcgggttag 237781 ccaaagcatt cccggccagc cgaccagtca agcggcgcga gcgctgcggg tgaccttgac 237841 caagcggggc agcggctggc tggtgctcga cgtgacgccg atcaacgctc gctaagagtc 237901 ggcggcacgt acggatttgg ctctgacgaa ccggtccgac agccgccgca tccggatcat 237961 cagcgaggcc gacgggctca cgatgccgtc gaggtaggcg gtcaggtcct gcgctgtgac 238021 gccaatgcgc gacgcgaatt cctgtcgttg caggccagag cggtccaaca ggagcccaac 238081 ctgacgggcc acctcggcgc gctcattggc gtctaggtga gtacgggccc ggtccagcac 238141 ctcccaaaag gcgttggcga tgccggtcgc cggtatgccc tcgaggactt cttcgacttg 238201 gcgtgctgtc cgcccgtagg ggtcgcgctt gagcgcggcc gctatgcgtt gccaggtggc 238261 gatgtcgcca ctttccagcg ccgaacgaat ggcgacggta ggccagaact cgacccgccg 238321 gtcgacgtcc ggctcgctcc acgcgacggt gggttgttgc ggcggtgcgg ggtgtggctc 238381 ggctgccaac gtcacctcgc ctcctccaac atcgccacgg ccaccgacag gcaacgccgc 238441 cggacctctt cccactttgc ctgggcatca gctccgggcg actggtcacc gaggtcagac 238501 ggttgcggat ctgccaggcg accaaccaac tgggtggcca tccattgccg cccgggtgct 238561 tgacaagagt agtaccgatc catcccagcc agcaccgcgg cggcggtttc gggtgccatc 238621 gtatcgacca ggtcagcaaa gtcggcgtag tcgtggctgc tgtttcggga catgatcagg 238681 tagcccttga agcgcagcgt ttccgcgccg gttgggatct gcaagcggtc accggtgggc 238741 aatgcgacgt tggtcgtctc caccgggctg cgccgccggt agccggggcc cgcccggtga 238801 gtctgcaccc ccccgcactc ccaggtggtt gtctgaagcg cgtcgagggc gaccgcgagc 238861 cgcttgcgcc acacggtgac cgggtgcacc gggcgctggc cccatgatat cgcccgggcg 238921 atgccgttgc ggtaggccag ctgcacctgg tcgagcgcct taccgtcaca tccacacccg 238981 gtgaaggcga gcggatcggt aacgcaaatg gcgtccggcg caagtcgctt gagcttggcc 239041 gccgacttga gcaccatccg cacatccgcg ctgggcggga tcgccgcggc gaagtcgtca 239101 ggaatgacca cgacgtcacc gaggtcgact ttcggcagcg gtcggtcgaa gtcgaccgac 239161 ggcaagatgt gggccagcca tcggggcaac caccagttcc atcggtcaaa catcgccatc 239221 aatgccggta ccagcaccag ccgcacgacg gtggcgtcca cggcgatcgc gaccgcgcac 239281 gccacgccga tctcggccac tagcggcatg ccggcgaacg cgaacccgca aaacaccgcg 239341 atcatgatca acgcggcgct ggtgatcgtg cgcgcgctgg tgcgcacacc gtacgcgacc 239401 gcgtcgcggg tctggcccgt ctgcaggaac cgctcccgga ttcgcgtaag caggaagatt 239461 tcatagtcca tcgacaaccc gaacgtcatc gccaggacca gcgggggaac ggtgctgtcg 239521 atcgaatgaa gcgccgggaa accgagcccc cgtgcccagc cccactggaa gaccatcacc 239581 aggctgccgt aggcggcggc caccgacagc agcgtcatca gcacgccctt gaacgccagg 239641 aacaccgagc ggattgagat caacaacatc aaaaacgcga tcaccgccac gaagaccagc 239701 accagcggtt gcgtcgcgga cacccggtcg tcgaaatcct tgatcagagc ggtcggcccg 239761 ccgacgtcca cttgtgccgc gccggcaacc cggggtagct gggtccgcat ccaggtgatg 239821 gtgtcgcggg cgcccaaatc ctcgggatcg accgatagca ccgcgctgag caaagcgctg 239881 ccgttgtcgt cggcgaatcg cggtggggcc accgaaacga cgttgggcgc ctgtgcgatc 239941 cgatgacgga ttgcggcgat tgtctggcta tgttcgggtg cggacgcacc gccggcgtca 240001 aacctgacca gcacctgaac cgggcccagc gcgcccggcc ccagcgcttg ggccgcggcc 240061 gctgcgccgg tgcggatctc gtgtgacgag tcgaactggc gcagcaagct gttgcccagc 240121 accatcaagg ttgccggtgc cgccatgaca agcagcacgg tcgatgccgc cagtgctgtg 240181 atccagggtc ggcgcatcac ccacccgacc cagcgggacc agaaccaaga ttgcgtgctt 240241 gccggccgcc gcgaccagtg cactaacgct gaccgcttgg ccgccgcgcg ggcaaatgtt 240301 gctagcacgg caggtgtcag ggtggccgac gtcagcatcg caaccgcgac cgcgagaatc 240361 gccccggtgg ccatcgatct cagcgccggg gtgttgatca ggtagatccc ggtcagcgac 240421 gcgatgaccg tcataccgga caacaccaca gccaaccccg aagtggccat cgcggcgtcg 240481 accgcgtcgg gcggccggcg tccgcaacgc agttcctcgc ggtagcgcat caggatgaac 240541 agggagtagt cgacggcaag cgcgatgccg aacatcgaaa cggtcgatgt cacgaacacc 240601 gacatggtgg tgtgcatcga caacacaaac accaggccca tggtgatgac gaccgtgcaa 240661 acggcgagtg ccagcgggat cgctgcggcg gccaacgagc cgaaaaccgc aaccaggacc 240721 atcagaatga taggcaggtt ccagcgttcg gcgttggcaa tatcgtgttt ggtgtttgcc 240781 gccgcggccg cggacagcgc gccctgcccg atgacataga gccgcacttt gccgttggca 240841 gtttgcccgg actgatcgcc tttgacgcct attcggtcgc gcagcttttt ggcgacgtca 240901 ctggtgcccg cgttgcgggc gtccagccgc agcgacacca catacggccg gtccggttgc 240961 gggggccgtt gggtggggtt gggtgcctcc gtcaccccag gcagttcgct ggctatttgt 241021 cgcagtagcg cgacggcatt gtcgatgtct tggtagctag catccggtcg gggggccgct 241081 accagcgcca gcgccggggc tccccggtcc gggtagtgcg cgtcgagttg gtcgtggacc 241141 agcaatgact gcgacccggc gacttcgaaa ccgccaccgg ttagattccc cgactgcgtc 241201 atcgccaggt aaaccgccgg cactaacgcc agcaaccaac ccgtgaagac caaccaacgg 241261 cacctgcgca ggttgcggct caagcgcatc atgaactgct ggatttcgga ctccccgtac 241321 tctcgcgcag tgcgtgcccg cgagcctacc gaagatcgcg tgcatgcgtt cggcgtggac 241381 cgcacagcac ctggagttgg cggcgccgag ggccgagatg gcaggatgac ggatcgtcgg 241441 gggcgggaac tcccaggccg ccgggccgtc gcaaacccgt cgcaaacccg tcgcaaaccg 241501 taaggagtca tccatgaaga caggcaccgc gacgacgcgg cgcaggctgt tggcagtact 241561 gatcgccctc gcgttgccgg gggccgccgt tgcgctgctg gccgaaccat cagcgaccgg 241621 cgcgtcggac ccgtgcgcgg ccagcgaagt ggcgaggacg gtcggttcgg tcgccaagtc 241681 gatgggcgac tacctggatt cacacccaga gaccaaccag gtgatgaccg cggtcttgca 241741 gcagcaggta gggccggggt cggtcgcatc gctgaaggcc catttcgagg cgaatcccaa 241801 ggtcgcatcg gatctgcacg cgctttcgca accgctgacc gatctttcga ctcggtgctc 241861 gctgccgatc agcggcctgc aggcgatcgg tttgatgcag gcggtgcagg gcgcccgccg 241921 gtagatgccg gaccgccgcc gggtccggcg cagtcgagcg tgaggcagcg gtcgcctacc 241981 ggggcggtgt ctcgccgcct tctggtcgca ggtcaggggt cggcgctgga ccttgcggtg 242041 tggtttcgac cgggtcgtcg cagggtgtgc cctgcggttg gatgacaagt cgcaggtttg 242101 gatcggttgg cgggtcgcga tcgttgtcgg aatcggcggt gctctcggtg cggaacatga 242161 agaagaacac cacccagccg attgcggcga tgagcagcca gctgatcagc cggtagatca 242221 acatcgccga gatggcactc ggcaagggca tgccgctgga taccaggccg ggtaccagca 242281 ccgcctcgac caccaacaca ccaccgggca tcagcggtat ggtgccgacc gcgcgggcgg 242341 cggcgtaggc gaccgccagc ccaccgaccg aggcatggtc gccggcggcg tacgcggcga 242401 aaccgaggca ggctacgtcg gcgatccagt tgaacaacga ccaaccgaac gccacgccca 242461 ggtcgcgcct gcccaggctg accgattcca gctgcatgag cgtctcgcgc cacttcggta 242521 ggccggcatc ggccggccta ccgcgaaccg agttggccca cgacaaaact ctcctgccga 242581 tcccctcgat gagctccggc cgcgacgcca ccgcctgggc cagtagcagc aatgtgacga 242641 agccgcccag ggtgaacagc agtgagaacg ggttgttctt ggcgcccagg aagaatgcgc 242701 cacccaaccc gagcaatgcc aagcccaccg cctgcaacac gcccgacatg accagctgcc 242761 atgacgccac caccgtcgag gcgccccaga tgcgttgctg acggagtaag aacgtagccg 242821 acaacaccgg cccacccggc agcgtggtgc tcagcgagtt ggcggcgtag aaggcggcct 242881 ccgaccgcca ttgcttgacg tgcaccccgg cggatttcag cagggttcgc tgaatctggg 242941 cgaagctgtg catcgaggcg cccgcggctg ccaccgcggc cagcaaccac caccacttgg 243001 cgcgatacaa gctcacccag gccttggcga gctggtccca gcccaacgcc acctctatag 243061 caagcacgat tgcgacgatg gccagtaccg cccatcgcaa ccaccagtac ttgccgcgcg 243121 ggggtacgcc ctcagcgggg ggtgccccca cccgcgtgcg agggagtgcc cccacgcgct 243181 ggcggaggtt gcgggcgggg gcgtcgtgcg acacgtgctt aagggtaacc gtgcaggtgg 243241 cgccgtaatc gcgatacatc gctaaccgtg tcagcctcgt tggggggtcg tgaccggatc 243301 gtgccgcctg gcaaagtaac tatgcgggct cgacgcgacc cgccgcgacc ttacgacgcc 243361 gccgttcccg ttacgcttgc cggatgtcgg cgagcctgga tgacgcttcg gtcgcaccgc 243421 tggttcgcaa gaccgcggcc tgggcgtggc ggttcttggt catcctggcc gcgatggtcg 243481 cgctgctgtg ggtcctcaac aagtttgagg tcatcgtcgt cccggtgttg ctggcgctga 243541 tgttgagtgc gttgctggtg ccgccggtgg attggctgga ctcccggggc ctgccgcacg 243601 ctgtcgcggt gacgctggtc ttgttgagcg gtttcgcggt tctcggcggc atcctgacgt 243661 tcgtcgtcag ccaattcatc gcggggttgc cgcatctggt caccgaggtt gagcgcagca 243721 tcgactccgc gcgcagatgg ctgatcgaag gcccggcgca cttgcgcggc gaacagatcg 243781 acaacgcggg caacgccgcg atcgaggcgc tgcgcaacaa ccaggcgaag ctgaccagtg 243841 gcgcattgtc gactgcagcc accattaccg agctggttac cgcggcggtg ctggtactgt 243901 tcacgctcat tttcttcctc tacgggggcc ggagcatctg gcagtacgtc acgaaggcct 243961 tcccggccag cgtccgtgac agagtgcgtg cggcggggcg cgccggttat gcgtcgctga 244021 tcgggtacgc gcgggccacc ttcctagtgg cattgaccga tgcggccggg gtgggcgcgg 244081 ggctggcggt gatgggtgtg ccgctggcat taccgctggc ctcgctggtg tttttcggtg 244141 ccttcattcc gttgatcggt gccgtggtcg ccgggtttct ggccgtggtg gtggccctgc 244201 tggccaaggg cattggctac gcgctgatca cggtcggttt gctaatcgcg gtgaaccaac 244261 ttgaggccca tttactgcag ccgctggtga tgggtcgggc ggtgtcgatt cacccgctgg 244321 ccgtggtgct ggccattgcc gctggcggtg tgcttgccgg agtcgtcggc gccctgttgg 244381 ccgtcccgac ggtcgctttc ttcaacaatg cggtgcaggt gctgctgggc gggaatccgt 244441 tcgccgacgt ggcagacgtt tcttccgatc acctcaccga ggtttaaagg cgtccttcgc 244501 ggcgaagcag atcctgggcg gacagggcgc cgccgccgcg gcggcgctga cgcgtcttat 244561 cgctcgtgcc gcgggcattc agctgctcag tggctgcctc tgagtcgtcg ccgtccgacc 244621 gtatgattgg cagggccgcg gtgggttcgg ccgggtcacc ggctgcgtct gtggagcggt 244681 tcgccgcaag cggcatagcc cgggtctgac cggcagacgg ggccgatggc ggggtcgggg 244741 ttggtggcgg cgtggatgct ggcgactgca cggaccgacc ggcagccgcg aggcgggttg 244801 tcggcggctc cgtcgacgac ccgatctgca tccgtgtggt gctcgcccca gacggcgccg 244861 ccggcgcggc agttgcttcc agggcaggcg tgagctccgg tgagcttgct ggactcgagc 244921 gggccggtcg aggtgactcc gccagcggat gggtcggatc gtgcggtggg cgcgggtccc 244981 cagcggcgcg cgccgcaacc agcccagctg tgaccggagg acgtgcggga cgcccgttgc 245041 tgacgggccg cttgcgctcg tcgggcaggt ggatctcgcc cagcccgatg cgggtctgca 245101 ggcgtctggc ccagcgcggt gcccaccagc agtcatcgcc gagcagcttc atcaccgatg 245161 gcactaaaaa catccgcacc acggtcgcgt ccagcagcag cgccgccatc agtccaaagg 245221 ccagatactt catcatcacc aggtcggaga acacgaacgc gcccgcgacg acggcaacaa 245281 tcagcgccgc ggcggtaatg atgcgtccgg tggctgcggt gccgatccgg atcgcctcct 245341 gggtcgacat gccgcgctct cgcgcctcga ccatccggga caccaagaac acctcgtagt 245401 cggtggatag gccgaagacc agcgcgatga tcagcccgat caccggcgct gtcagcgggg 245461 tcggcgtgaa attcagccac ttcgaaaagt gtccgtcgac gaatatccac gtcaggatgc 245521 ccatggtgga cccgagcgtc agagcgctca tcagcgtcgc cttgattggc agcaccaccg 245581 agccgaacgc caagaacatc aagacgatcg tggtggtcag caggatgacc accatcagcg 245641 gcatcttcgc gaacaggccg tggattgaat ccagctccag ggcgggagtt ccaccgacca 245701 agaccgtgat tcctttgggc ggggtgatcg cgcgcagctc ggtgagcttc ttcgacgcgt 245761 cagccgggtt gatcaacccg ttctgcagga cgcgcaccga tggatcttta gatgcgccta 245821 ccgcgtaggc acgctcttgc cacatattcg ccggatcgtt gtccggctcg atgaatccgc 245881 cgatcgccat cgccttgctg cggatgtcag cgatctgcgc gtcggtgacc ggttgatggt 245941 tgctggtctg gatcaccagt gtcagcggat tggtgcggta tccggggaag agtttgtcga 246001 actcctcctg cgcctggcgc accgaattgg tcggcggcaa gtacttctcg ctgatcccgc 246061 ccaatgacag cttgcccacc gggataatca gcaaaatcat gatgatgacg atcggtgcgg 246121 cgaacagcac tgggcgcttc atcacccggt taaccagctt gccccagaag ccggcttcga 246181 cctcttcgcg ggtcttggtc cgctgcaggc ggtcggcgag ccagttcagg taggcggccg 246241 aaatcttcca gttcgccagg aagggcaccc ggaacagggt ccgcacgccg agcgcgtcga 246301 cgtgtttgcc caggatcccc agacaggccg gcaacacggt gatagacagg atggccgaca 246361 gcatcaccga tgcgatcgtg gcgtaggtca gcgacttcag gaaaccctgc gggaagagca 246421 gcagaccgat cgccgacgcg acgatcaaca ccgccgagaa cgtcaccgtg cgtccggcgg 246481 tgatcaccgt gcgccgtact gccgtctcgg tgtcgtagcc ttcggcgatc tcttcgcgga 246541 accggctcac gatgaacaac ccgtagtcga tggcgatccc cagaccgatc agcgacacca 246601 cgggctgggc gaaatagtgc acgggaccga agatcgcgag gaaccgcatg atgcccagcg 246661 cgccggcgat gcacagccct ccgaccatca ccggtaggcc ggcggcgatc acgccgccga 246721 acacgaagaa caacaccacc gccaccaacg gcagcgccag cacttccatt cgccgttggt 246781 cggtggcgat ggtgccggtc aacgcctcgg ccaccggttg cagcccggcg agcttcaccg 246841 tgcctccgtc gagccgctgc aggtcgggtg cgatggcctt gtagttgttg aggatggtgt 246901 cgtcgtcatc acccttgagc gggatggaaa cgaaggtgta cttcttgtcg gcggtggcca 246961 tgccggtcgc ctgactcgct ctcaggtagc cggcccatcc caagacctgg tcggggtgat 247021 cctgctggaa ccggttgagc tcgtcgacga ccttctttga ccaggccggg tcgtcaacgg 247081 tcttgccggc tggggcttgg aagatcgcga cgatgtgacc gcttcggtct cggccgtaga 247141 cctggtcgcc cagcaccgat gcttgcaccg attggctgcc gtcgtcgtag aagccgctct 247201 gcgtgacgtg cttgccgagg ctcagcccga aaacgccgcc gccgaggcat agagcgacca 247261 tgaccccgat tacgatgaac cggtagcggt acacagttcg accccaccag gcgaacacgt 247321 aagctcctta ctggatcggc agcgacccgc gtattgcttt ttggttgtca cacacgtcgg 247381 ctgtcacact cgcgaggtca acagcgagga cagcggccgg aacggctgca gccaagcccc 247441 ctgctcaggt agcgaatcga ggccgattcg aggtagtggt tcccggaaaa caccagcgat 247501 gtcctccagg tcgacgaact ccaaggtatc cgacgctagc gcccaactgg cgtgttcacg 247561 aaatccgagc acttgaactg gggttccgct gcgggcgacc gcctccaacg gttggcggaa 247621 tgcctgaccg tcggccgacg ccaccaccag cgcggcgagc ccttcgcggt agcgctcgtc 247681 gatgtgcgcc aacatgtcgc ggtcaacgtc gctgtcctcg tctactttcg gtttggcgaa 247741 gacggcgaat ccgacattgc gcaacgcgtc cacccacggc cggaccacct cggcgctgcc 247801 aggggcgatg ttggtgaaga cggtggcctc cggttcggtc gagatgcctg gacggccggc 247861 cacaatctct gcggttcggg ccagcagcca gcgtcccagg gcgtcgaatc gtggtcgttc 247921 cagtgctgtc ggccggcggc ccaagatgga gcccaaaccc atgtcgaggt tgggagcgtc 247981 ccacaccagc aatacccgtg cccccggcgc accgagactg gtcagcccgt cctgcgataa 248041 gtcttccgcc agtaccgagt gccgggcgag ggattcagat gtttgtgaag tcacgtcttc 248101 ggtcaggctc atcatcatct aattttcagg tctctttcag agcaaccgtg ctttttccat 248161 aacaactcga tgactgcgcc gcccccaagc tgggctttcc tctcgtactt ggtagccggt 248221 cggacgaccg aaatcggcag cagttcggtg tcggggtcga cgcgaaccag ccggggttcg 248281 gcgtcgccgg ccgcggcgat gtgctcggcg tagccgggat ggtcggtcgc ggcgtgcagc 248341 acaccactgg gaacgagccg gtctgcgatc aaggccatgg tggccggctg taacaggcgg 248401 cgcttatggt ggcgtgcctt cggccacgga tcggggaaga agactcgaac accgcacaac 248461 gaatcggggg cgatcaagtg ttgcagcacg tcgacggcat tgccaaggat cagccggatg 248521 ttgatcccgt cggagcccac tttgtcaatc gcgcagagca gctgagccag cccgcggcga 248581 tagacgtcca cagcgatcac gtcgacatgg ggttcggcct tcgccatcgc cagcgtcgac 248641 gtgccgctgc cggagccgat ctccaacacc accggcgcgt cacggccaaa ccaggcacgg 248701 gtatccaccg gtgtcccgcg cggggattga ggtagcgcca ggaggccaag ctccggccaa 248761 agtcgctccc aggtctcgcg ttgggccttg gagatccccg accgccgcga ccggatgctc 248821 gtgctgggga gctggcccga tgccaccggt gtgtcgggac gtagccctac cccgggttgc 248881 gcatgcattt gtccatggtg gaccatcagc gcccggcgta gccgcccctg gtccagattg 248941 atacccaaca gttgccttcg gcgggtagcg gacaactgct gactcgcgcc tcggcggcga 249001 gggtgccacc attctgaacg aaccgatcgg gtgggagatg cgcggacaag ggcaccagat 249061 tttcgtcgac gagctggcgc gattcgccac cagctccgcc gaccagcggg tagtggcgat 249121 cgcgcagcgg gccgccgaac cgctgcgcgt agcggtccgt gggcgtcccg gggtgggttg 249181 ccgcacggtg gcgcgcgccc tgcagggtgc tgggagctcg tcgggcatga cggtgacacc 249241 gcaagcacgc gccgccgact ctgacgtcga cctggtcgtc tacgtcaccg tcgaggtagt 249301 caagcccgag gaccgcgaag ccatcgccgc cacccggcgc ccggtggtgg cggtgttgaa 249361 caaggccgat ctggccggcc cgctctcggg tgcaggtccg atcgtgatgg cgcaggcccg 249421 gtgcgcgcaa ttttctacac tcctcggggt ccccatggag tccatgatcg ggttgctcgc 249481 cgtcgcggcg ctcgacgatc ttgatgacac cttgcgggcc gtgctgcggg cgctagccgc 249541 ccaccccgac ggctttgacg ctctcgaccg agccgttgcg gggtttctgg cggcagccct 249601 gccggtccct accgaggtac ggttgcggtt gctggacacc ctcgacctgt tcggcatcgc 249661 actgggcatg gcagcgttcc ggccgggccg gccctcgcga accccggcgc agctccgcac 249721 cctgttacgc cgggtcagcg gtgtcgacgc cgtcatcgac aaggtcaccg ccgccggttc 249781 tgaggtgcgc taccggcggt tgcttgacgc ggtcgcggag ctggaggcgc tggccgcgca 249841 ggccaaggag atcggcggtc cgatcggtga gttcttgcgc gacgacgaca cagttctcgc 249901 ccggatggcg gccgccgtcg acgtagccct ggccgtcggg ctagacgttg gcccgttgga 249961 cgatccggcc gcccacctgc cgcgggcggt gcggtggcat cgttacagcc tggacaacgg 250021 tgacatgcac cgcacgtgcg gcgcggatat cgctcgggga tcccttcggc tgtggtcgct 250081 ggccggcggc atgcccctgc accgataccg gaagtcatcg tgatccgcgc ggctagtgat 250141 gacccggccg gggtggacga gctggtggca gcgatcgcgc cggggcttgc cgggctgggt 250201 ttgccggtca tcaaccgccg cgaggtggtg ctggtgaccg gtccgtggct ggccggggtt 250261 agcggtgtgc gcgcggcact ggccgaaagg ctgccgcagc gtaggttcgt cgagacggca 250321 gagttgggac ccggcgatgc gccggtggcg gtggtgttcg ttgtttccgc ggcaaccgcg 250381 ctgaccgaat ccgattgcgt gttgctggac accgccgcgg agcacaccga tgcggtggta 250441 gctgtggtgt ccaagatcga cgtgcaccgc ggctggcgtg acgtgcttac cagtaaccgc 250501 gacaggctgg ccgcgcgcgc gtcccgctac gcccgggtgc cctgggttgg cgcggccgcc 250561 gcacctgagc tgggcgagcc atacctggac gacttggtcg ccgccatcca gaaacagctc 250621 gccgatccgg ctgtcgcgcg gcgaaacatg ttgcgtgcgt gggaatcccg gcttctgatg 250681 gtcgcgcggc ggttcgatgg cgatgcacag agcgccggtc ggcgggcacg ggtcgacgcg 250741 ttgcgccagc aacggcgcac ggtcctgcgg caggggcgtc aatcgaagtc tgaacacacc 250801 atcgcgctgc gcgcgcagat ccagcacgct cgggtcaaat tgtcctactt tgcccgcaat 250861 cggtgttcgt tgctgcgcgt cgagctgcag gagcacgtcg ccggtctgtc ccggaaggac 250921 atcgccaggt tcgcggcata cacgcgcggc cgggtccagg aggtggtcgc cgaggtgggc 250981 gaaggtgccg tcgcgcacct tgccgacgtc gcgcagctgt tgggtgtgcc ggtgcagcca 251041 ccggtcctcg agaacctccc ggcggtgctc ccgacggttg tggccccgcc actgacatca 251101 cgacgattgg agatccggct gacaacactc ttgggcgccg ggttcgggct gggtatcgcg 251161 ctgaccctga gcaggctggt ggcgggtctt actcccggcc tggctgcatc ggggatggtg 251221 gcgggtgtgg cgatcggcct ggcggtgacc gcctgggtgg tgaatgcccg cgcgctgctg 251281 cacgaccgtg tcgtggtgga ccgctggacg ggtgaggtga cggcatcgct gcggtccgtg 251341 gtggagcagc tggtcgccac tcgggtggtg gctgtcgaga cgctgctgag caccgcgatt 251401 agtgaacgcg acgacgccga gaacgcccgg gtggccgatc aggtcagcat cattgacggc 251461 gaactgcgcg aacacgccgt cgctgcggcg cgggccgcgg ccctgcgtga ccgggagatg 251521 ccggcggtgc gggccgcact tgaggcggtg cgtgcagaac tcggcgagcc gggtgcgccc 251581 acaacaggcc tgttctgaag cttctgaatc gttgttgtga gcaggcttat acccgcccaa 251641 gtcttccctg acaagttctg ggcgataatc tggataaaaa gtgtctcact aggtgagcgg 251701 ccgtatcagc ctcgccacca agacgggcat acctaaccca tacgtaaccg cgagcacccg 251761 ataactacgc aggagaattc gatgacctca gcgaccatcc ccggtctgga taccgcgccg 251821 acgaatcacc aggggttgct gtcctgggtc gaagaggtcg ccgagctcac ccagccggac 251881 cgggtggtct tcactgacgg ctcggaagaa gagttccagc ggctctgcga tcagctagtc 251941 gaggccggca cgttcatcag gctcaacccc gagaagcaca agaactccta cctggcattg 252001 tcggatccgt ccgatgtcgc gcgggtggag tcgcggacgt acatctgctc ggcgaaagag 252061 atcgacgccg gccccaccaa caactggatg gatcccggcg aaatgcggtc catcatgaaa 252121 gacctgtacc ggggttgcat gcgcgggcgc accatgtatg tggtgccgtt ctgtatggga 252181 ccgctgggcg ccgaggaccc caaacttggt gtggagatca ccgactccga gtacgtcgtc 252241 gtctccatgc gcaccatgac ccggatgggc aaggccgcgc tggagaaaat gggcgacgac 252301 ggtttctttg tcaaggcgct gcactcggtc ggcgcgccgc tggaaccggg ccaaaaggac 252361 gtggcctggc cctgcagcga aaccaagtac atcacccact tcccggagac ccgggagatc 252421 tggagctacg gctcgggcta cggcggcaac gcgttgctgg gcaagaagtg ctactcactg 252481 cgtatcgcgt cggcgatggc ccacgatgag ggctggctgg ccgagcacat gctgatcctc 252541 aagctgattt cgccggagaa caaggcttac tacttcgcgg ccgcattccc gtcggcgtgt 252601 ggcaagacca acctggcgat gctgcagcca accatccccg gctggcgtgc ggagacactc 252661 ggagacgaca tcgcatggat gcgatttggc aaggacggtc gcctgtacgc cgtcaacccg 252721 gaattcggct tcttcggggt ggcgccgggc accaactgga agtcgaaccc taacgccatg 252781 cgcaccattg ccgccggcaa cacggtgttc accaatgtcg cactcaccga cgacggcgac 252841 gtgtggtggg agggcctgga aggcgacccg cagcacctga tcgactggaa gggcaacgac 252901 tggtacttcc gcgagacgga aaccaatgcg gcacacccga actcccggta ctgcacaccg 252961 atgtcgcagt gcccgatcct ggcccccgag tgggatgacc cgcagggcgt cccgatctcg 253021 gggatcctgt tcggcggccg ccgcaagacc acggttccgc tggtcaccga ggcgcgcgac 253081 tggcagcacg gggtgttcat cggtgcgacc ctgggtagcg agcagaccgc cgcggccgag 253141 ggcaaggtcg gcaatgtgcg ccgcgacccg atggccatgc tgccgttttt gggctacaac 253201 gttggggact acttccagca ctggatcaac ctgggcaagc acgccgatga gtccaagctg 253261 cccaaggtgt tcttcgtcaa ctggttccgt cgcggtgacg acggtcgctt cctgtggccg 253321 ggcttcggcg agaacagccg ggtgctgaag tggatcgtcg atcgcatcga gcacaaggcc 253381 ggcggtgcga ccaccccgat cggcaccgtt cccgccgtgg aggacttgga cctggacgga 253441 ctggacgtcg acgccgccga tgtagccgcg gcgctggcag tcgatgccga tgaatggcgt 253501 caggaactgc cgctgatcga agaatggctg cagttcgtcg gcgagaagct gccgaccggt 253561 gtcaaagatg agttcgacgc cctgaaggag cgcctaggtt agggcgagca gacgcataag 253621 cccccgcacg cacggcgtgt cgagggcttt agtgtctgct cgcgctcgtt agcggcgggc 253681 acgcacaagt tcttcgacag cgcgcaaaga caccgaaagc ctctcttccc aaccgcccgt 253741 gatcaccacg aatgatcgtc ccgcggcgcg gagagcctgc tcgcagcggg cgaaaaaggt 253801 accgcgtgcg ccggggacac agcgtccgtc gtcggcgtcc cagggcacat cgggcgtggt 253861 gagcagtgtg agatcgtagg gacgccgagc tagatcacgg agctcttgcg ggcagccgcc 253921 cgccaggaac tcggcccaca cggtcgtcgc gagcggatcc gtgtcgcaga tcaggacgcg 253981 atcggcgtca cgagccaagg cttcctccga cgcgatctgt ccgcgaacga tttcggccca 254041 ctccagtcct atcagtgagc cgccattgag ctcccgcaac attttcgccc gctccgggac 254101 ccacttcgtt cggagctttt ccgcaaccgc ctgtgccagc gtggtcttcc cggtggattc 254161 gggtccgatg atgctcacgc gtttgacgaa ggccggccgc acgcaccgtg ggatgtgttg 254221 ccagtggcca agcgggtccg cgcggatgtc ggttgcagtc acgggaacga cggtgcgacc 254281 gtgatcgacc gccacgaaac gcgctccgag gacctgggca aagtccgcgt tgtagggctc 254341 ggcaccgaag acgaagtcgg ggcgggttgc cagcacgccc tgcaggctcg ccttccagat 254401 gtcccagaag tccgggtgct cccacgggcg ctgcgggttc tcgttggcca gatggaccac 254461 gcgatcgaag gggaacagct cccgcatcca tgcaacgcgc tgggcgcccg gaatcggctc 254521 tgctgccgtt gatccgacga cgatggtcag ctcatccacc catcgccgcg cgaactcgca 254581 aaggtagacg tgtcccgcat ggggcggcat gaacttgccg agcaccattc cgtgtgtcac 254641 gacgtcgcct cagcgattcg gccggcgatg acattctccc actcgtccag ccgcgaaaga 254701 aagtacgcct cagcctcatc gaagctgatt ccgacgtcag gagcgacctt ggctacgacg 254761 tcggcgtaga cccgggaggc ctctgggtcg acgaattccc actgaccgag gggccatttc 254821 gcggtgagat aacccgcgtc cgagtacgct tggtacaacg gcgttcccgg ataggggacg 254881 atccggctca tgaacttgaa gccgaccgta tactttgttg cccgcagcag gcggactgtt 254941 tcgcgtagct cgtcgggctg cacggtgggg tgaaacataa tggtgcccgg gataacgtca 255001 atgccgagct gttgcagggc gttgatcgtg tcggcggcat cttgtccacg agtgaggatc 255061 tgcttgcggt aggcgcgcag ttgctcgtag gacccagtct ccacgccgat gaatacccga 255121 cgcaggcccg ccctgtgcag atgtttgaac aagtccagat cgacaacgga gtccagccgg 255181 atatcgacca tgaagttgac gctgatcccc ctcctgagta ccgcgttggc aaagtcagcg 255241 gcgcgttgct gcgaaccggg gtgtttggag ataaacaggt cgtcggtgat ggataggaag 255301 ttgacgtcgt agtccgacac cagataatcg atctcgtcga cgaccgcgtc aaccgacttc 255361 gcccggtagc tgtccttccc tagcatcgcg gacatggccc cggtgccgca gaacgtgcag 255421 cggtaggggc atccgcgggt ggaaaagacg gaggcggcga agccatcagc aaggacggtc 255481 ggcaactcgt cgcgagccgg gcgaggcaac tcgtcaaggt cgaccagcga ggagggtgtg 255541 cgcaggatct gtccctgctc actacggcgg gctagtcccg ggacgtcgtc aaccgcagcg 255601 tcattcgcca gggccaaggc cagcttggtg aacgctacct ccccgtcgcc aacgacgacg 255661 tagtcgaaac agtcatgctg gcgcaggatg cgctcgtagt tcagtgttgc catcgcattc 255721 ccgatgacga tgcgcacgcc atcccaggcc tgtctcgcgc gctgcgccaa ccacaacacc 255781 tccggaaatg tgtcgatgca ggaaaagccg acaagccggg gcgttcctga taaggcggcg 255841 gcgctttgca tggccagcca cgtctcctgc acggacccgt ggccggcgac caggccgttg 255901 acggaggtga ctgcgatccc ttgggtcttg gcgtatgcct tgatcgacat catcccgagg 255961 tgctccatgg gcatggagca atacagccac ggatctccaa gcttgagtcc gtcaacgtac 256021 gacagcccgt cttgacggac gcctggagga ttgaccagaa gagttgccac gtggagaact 256081 ttacaaacga tttcggctgg tgatgggcgg aattgcgccc tgcggctctg gtcgccgggc 256141 cgcgacgtac cctcggcgca tgcagattcg cccgtatatc ggcgccgata agcccgccgt 256201 catcctgtat ccgtccggga cggtcatcag cttcgacgag ttggaggccc gcgccaaccg 256261 gttggcgcat tggttccgcc aggctggtct gcgcgaggac gacgtcgtgg ccatcctgat 256321 ggagaacaac gagcacgtgc acgcggtcat gtgggcggct cgccgcagcg ggttgtacta 256381 cgtgccgatc aatacccacc tgaccgcctc cgaggccgcc tacatcgtcg acaacagcgg 256441 tgccaaagca attgtcggtt cggcggcgct gcgcgagacc tgccacggcc tggccgaaca 256501 ccttccgggc gggctgccgg acctgctgat gcttgccggg ggcggtctgg tcggctggat 256561 gacctacccg gaatgcgttg ccgatcaacc agacaccccg atcgaggacg aacgcgaggg 256621 tgacctgctg cagtactcgt cgggaacgac tggccgaccg aagggaatca aacgcgaatt 256681 gccacacgtc tcaccggatg cggcacccgg gatgatgccg gcactgctcg atttctggat 256741 ggacgccgac tcggtatatc tgagtcccgc gccgatgtac cacaccgctc cgtcagtgtg 256801 gacgatgagc gcactggccg cgggcgtcac caccgtcgtg atggagaagt tcgacgccga 256861 gggcgccctc gacgccatcc agcgctaccg ggtgacccac gcgcaattcg tcccggccat 256921 gttcgtccgg atgctgaaac tccctgaagc agttcgtaat tcgtatgaca tgtccagcct 256981 taggcgagtg atccacgcgg ccgctccatg tccagtccag atcaaggagc agatgattca 257041 ctggtgggga ccgatcatcg acgagtacta cgcctcctcg gaagccagcg ggtcgacgtt 257101 gatcacagcc gaggattggt tgacgcatcc gggttcggtc ggcaagccca tacagggcgg 257161 ggtgcacatc gtgggcgccg acggcagcga gctgccgccg aaccagccgg gcgaaatcta 257221 tttcgagggc gggtacccct tcgaatacct caacgatccg gcgaaaaccg cggcgtcgcg 257281 caacaagcac ggctgggtaa ccgtcggcga cgtcggctat ctcgacgacg acggctactt 257341 gttcctgacc ggccggcgcc accacatgat catctccggc ggcgtgaaca tctacccgca 257401 ggaggcggag aacctcttgg tcgcccaccc caaggtgctc gacgcggcgg tgttcggcgt 257461 tcccgacgac gagatgggtc aacgtgtcat ggccgcggtg caaaccgtcg actccgccga 257521 tgccaacgat cagttcgccg gcgagctatt agcctggtta cgagaccgct tgtcacactt 257581 caagtgtcca aggtcgatcg cgttcgaacc gcaattgccg cgcaccgaca ccggaaagct 257641 ctacaagagc gggctggtcg aaaaatactc ggtgtgaccg atgctgccgg gggcccgacc 257701 tgtccaccca gacaccggct atatcccgcc ccgggccacc agttgtccgg ctatcacgtt 257761 gcgctggatc tcgttggtgc cttcaccgcc gttccacgtc gtactcggtc gagtagccgt 257821 agccgccgtg gatacgcacg gcgtttaggg cgatttccat cgcgacctcg gaggcgaaca 257881 acttggccat cccggcctcc atatcgcagc gttggccgct gtcgtaccgc tcggcggcat 257941 agcgggtcag ctgacgggcc gctgtgagct tggtcgccat gtcggccagg taattgccga 258001 ccgcctgatg ctgccagatc ggtcggccaa agctttcccg ttgctgagcg taggccagcg 258061 agtcctcgag tgccgccgtc gccacgccca gcgcccgcgc ggccacttgg atgcgacccg 258121 tttcaagtcc cttcatcatc tgcgaaaagc cttgacccat ggctccgccc aggatcgccg 258181 agaccggcac ccggaggttg tcgaacgaca gctcgcagga ttcgacgccc ttgtaaccca 258241 acttcggcaa gtcccgcgac accgtgagtc ccggcccggg ttcgacgagc acgatcgaca 258301 tgccttggtg ccgcggtgtg gcgttcgggt cggtcttgca cagcaccgcg aaaagtccgg 258361 accggcgggc gttgctgatc cacgtcttgc agccgttgat caacaacccg gcagagcctt 258421 cagggccgtc ggccaacgcc gtggtcgaca tgttctgcag atccgagcca ccgccgggct 258481 cggttagcgc catggtggcc cgcagctcgc cactggccat cgggggcaga tatgtccgcc 258541 gctgttcctc ggtgccaaac agggtcagca atttggcgac gacggtgtgc ccgcccatcg 258601 cgccggccag gctcatccag ccgcgtgcca gctcctgggt gacttgcaca tagcacggca 258661 tcgacaccgg cgacccgccg tactgttcgt cgatcgccag gccgtagatg ccgatgtgtt 258721 tcatctgctc gatccacgcc tccgggtagc tattggcatg ctcgacctca cggacggttg 258781 gcttcacgtc tcggtcgatg aatgcccgca cggtggcgac cagcatcgct tcgtcgtcgt 258841 tgagctcgtt gcgcaccttt tgtcgccctc cgtattgacc ccctgtccga tagcctgcca 258901 gcatgtggcg ttgtggctag cgggtatggg ggcatccgcg tcggcgggcc ctatttcgat 258961 gacctgtcaa aaggtcaggt gttcgactgg gcgccggggg tcacactgtc gctggggctg 259021 gcggccgccc atcagtcgat cgtgggtaac cggctacgcc tggctctgga ctccgacctg 259081 tgtgcggcgg tgacgggtat gccggggccg ctggcgcatc cgggcctggt ttgcgatgtg 259141 gcgatcggcc agtcaacttt ggcgactcag cgggtcaaag ccaacctgtt ctaccgcggg 259201 ctcaggtttc accgatttcc ggcagtgggc gacaccctct acacccgtac cgaggtggtg 259261 gggctgcgag ccaactcgcc caaaccgggc cgtgcgccaa ccggattggc ggggctgcgg 259321 atgaccacga tcgaccggac cgatcggttg gtgctcgatt tctaccggtg cgccatgctg 259381 cccgccagcc ccgattggaa acccggcgct gtgccaggtg acgacttgtc caggatcggt 259441 gccgacgcgc cggcgccggc cgccgatcca accgcacact gggacggtgc ggttttccga 259501 aagcgggttc ccgggccgca cttcgatgcc ggtattgccg gtgcggtgtt gcatagcacc 259561 gcagacctgg tcagtggagc gccggagctg gctcggctca ccctcaatat cgctgctacg 259621 caccatgatt ggcgggtcag cggacgacgg ctggtctacg gcgggcatac catcggactg 259681 gcactcgcgc aggcaacccg gctattgcct aacctggcga ccgtcctgga ctgggaatcc 259741 tgcgaccaca ccgctccggt acacgagggc gacaccctct acagcgagct gcatatcgag 259801 tctgcgcagg cccacgcaga cgggggtgtg ctgggactgc ggtcactggt ctacgcggtc 259861 agcgattcgg cgagtgagcc cgatcggcag gtgctcgact ggcgttttag cgccttgcaa 259921 ttctaggttc ggttactaag ggccagcgcg gcacgcaaac tgttgcactg actagtgaag 259981 aacctttgtg agaccccaac attcggggcc acacgatcga aaccgtggaa ggcgccttcg 260041 actacttcta cctggcatgg caccccggct gctgtcagac gttcggcata ggccagatcc 260101 tcgtcgtgga gcaggtcgtg ggtgccgacg ccgatccatg ccggcgccag ccctcctagg 260161 tcgtcacgcc gtcccgggac cgcgacccgt gcgtccgcat cgccaagata tgcccgccag 260221 ccgaaccggt tggcgcgccc gttccatagc cggtagtgcg ggttggcggg ggcgatcgac 260281 ggccggtcgt cgagcatggg gtacaccagc aactgaaatg ccggtgtgat gccgccacgg 260341 tcgcgggcaa gcagagccag cgccgccgcg aggccgccgc cggcactagc gccgccgatt 260401 gccacccgcg cggggtccac cgccggcagg ctggccagcc aggtcaacgc cgagtagcag 260461 tcgcccaggg cggcaggata cggattttcc ggcgccaggc ggtagtccac cgatgcgaca 260521 gtgatgccca gtctgctgct gaaccggagg cagagccgat cgtcctgttg cgcggtgccc 260581 attacgtatc cgccggcgtg gatccacagc agcgcgggcg ctggttcgtt gctgccggcg 260641 ggtcggtata gccggacacc gaccccggat tccagggtga gcacctcgat atcggggggt 260701 gtacgggaca ttcgaagccc cgcgacgacg atcaatgccc gcatgactgg cagggtgcga 260761 ggaccgacca gctgtcgtgg ggtgacgacg gcgatgcgac gcaggtcggg gtggacttcg 260821 ttgccggaca ccggtccagt atgcgtcggc gcaatttcgc ctcggtacag cgatggcttt 260881 ggcaggctgc ggttagtcga acgaggatcg ggatggtggc ctgatgagtg atccagcaag 260941 aggggcggaa gccgaggatg cctacggttt tcccgccggg ctgtggcgct ggctgcagcg 261001 gcatccaccg ccggcgttgc accggctcac ccggtttcgc agcccgttgc gtggtccgtg 261061 gttgacgtcg gtgttcggcc tggtgctatt ggtggcgttg cctttcgtca tcatcaccgg 261121 gctactttct tatatcgcct atgcgccgca gctgggccag gccatccccg gtgacgtcgg 261181 ctggctgcga ctccccgctt tcacctggcc cacccgtccg tcctggctgt accggttgac 261241 ccaggggctg catgtggggc tggggctggt gatcattccc gtggtgctgg ccaagttgtg 261301 gtcggtgata ccgcggctgt ttgtgtggcc gccggcgcgc tcgattgccc aggtgctcga 261361 acggttgtcg gtgctgatgc tggtcggtgg gatcctgttc cagatcgtca ccggcgtgct 261421 caacattcag tatgactaca tcttcgggtt cagcttctac accggccact attttggggc 261481 ttgggtcttc attgcgggtt tcctgttgca tatcgtggtc aagatccccc acatggtcac 261541 cgggttgcga tcgataccga tgcgagaagt gttgggtacc aacgtggctg acacccgggc 261601 gcagccgtgc gatccggacg ggctggtgtc ggtcaatccg ggcgaggcca cgctaagcag 261661 acgcggtgcc ctgggattgg tcggtgccgg ggtgctgctg atcggggtgc tgacggttgg 261721 gcaaaccctg ggcgggttca cccgcaaggc cgccctgctg ctgccccggg gccgtgtcgt 261781 gagcccgggc gacttcccgg tcaacaagac cgccgccgcc gccgggatca ccgcggaggc 261841 cattggcccc gactggcggc tggtgctgtg tggcgggcct gcggaagtag tgctggatcg 261901 cgccacgctg gccggcctgc cgcaacgcac cgcccggctg ccgctggcct gcgtcgaagg 261961 gtggtcggcc gtgcgcacct ggagcggcgt gccgctggcc gagctggcgc tgctggcggg 262021 cgtgccggcg gcgcgctcgg cacgggttac atcgctgcag cgcggcgggg cgttcggcga 262081 ggcgaagctg gcggcaaacc agatcgccga ccccgatgcg ctgctggcgt tgcgggtcga 262141 cggggcggat ctgtcgctgg atcatggcta cccggcccgc atcatcgttc ccgcactgcc 262201 cggtgtgcac aacacgaaat gggtcgctgg catcgaattc cacaagaggt gaaatgttcg 262261 acattgcaac gcgtttcaaa aactcctacg ggtcaggtcc attgcacctg ctggcgatgg 262321 tgtctggctt cgccctgctg ggctacatcg tggccaccgc caggccctcg gcgctgtgga 262381 accaggccac ctggtggcag tcgatcgcgg tctggtttgt cgccgccgtc gtagcccacg 262441 acctgctgtt gtacccgctc tacgcgctgg ccgaccggat cctggccagg ctagtcggca 262501 ggcgcgacgt ctcggcgccc cgccgccgcc cggaactacc ggtacgcaac tacattcgga 262561 tcccggcgct ggcagccggc ttgacgctgc tggttttcct gcccggcatc atcagacagg 262621 gtgcgccgac atacctggat gcgaccggac agacgcagga accatttctg ggcaggtggt 262681 tgctgctcac cgcggtcgcg ttcgggatca gcgcggccgc ttacgccatt cggctggtgg 262741 tggcgcacgt gaggcggcgc cgagcggggt gttcgcgggt cgacgcgatc gacgaggagt 262801 aggctcccac catgaaccag cgacgcgccg ccgggtcaac cggtgtggcc tacatcagat 262861 ggttgctacg tgcccgtccc gctgactata tgctggcctt gagtgtcgcc gggggttcgc 262921 taccggtggt gggtaagcac ctcaagccgc tcggcggcgt tactgccatc ggcgtctggg 262981 gcgcccggca cgcatccgat ttcttgtccg cgacggcgaa ggatttactg acccccggta 263041 tcaacgaggt tcgccgtcga gatcgtgcca gcacgcagga ggtttccgtc gcggccttac 263101 gcggcatcgt ttcgcccgac gaccttgccg tcgaatggcc ggcgccggag cgcacgccgc 263161 cggtctgcgg ggcgctgcgc caccgccgtt acgtccaccg ccgtcgcgtc ctctacggcg 263221 acgacccggc ccagttgctc gacgtatggc gccgcaaaga tatgcccacc aaacccgcgc 263281 cggtgttgat cttcgtccca ggcggtgcct gggtgcacgg cagtcgcgcc atccaggggt 263341 atgcggtgct gtctcggctg gccgcacagg ggtgggtgtg cctatcgatc gactaccggg 263401 tcgcaccgca tcaccgctgg ccacgacaca tcctggatgt caagaccgcc atcgcgtggg 263461 cacgggccaa tgtcgacaaa ttcggcggtg accgcaattt cattgcggtg gctggttgtt 263521 cggccggcgg ccacttgtcc gcgctggccg ggctcaccgc caacgacccg caatatcagg 263581 ccgagctgcc agagggctcc gacacgtcgg tcgacgcggt ggtggggatt tacggccgct 263641 acgactggga ggaccgctcc accccggaac gtgcccggtt cgtcgatttt ctggagcggg 263701 tagtggttca gcgcacgatt gatcgtcacc ccgaagtgtt ccgtgacgcg tcgccgatcc 263761 aacgagtcac cagaaatgca ccgccattcc tggtgattca tggcagccgt gactgtgtca 263821 tcccggttga gcaggcgcgg agctttgtcg agcggttacg agcggtctcc cgctcacagg 263881 ttggctacct ggagctgccc ggtgcgggcc acggcttcga cctgctagac ggcgctcgca 263941 ccggcccgac ggcacacgcg atcgcgctgt ttctcaacca ggttcatcgc agccgggcac 264001 agttcgcgaa agaggtcatc taaacgccgg ccaattgtat ggtcgcccta tgagtagggg 264061 gctgcggtga aacggctcag cggctgggac gcggtactgc tttacagcga gaccccgaat 264121 gtgcacatgc acacactcaa ggtcgccgtg atcgaattgg attcggacag acaggaattc 264181 ggtgtcgacg cgtttcgcga ggtgatcgct ggccggctgc ataagcttga gccattgggc 264241 tatcagctgg ttgatgtccc gttgaagttc catcacccga tgtggcggga gcactgccag 264301 gtcgatctca actaccacat ccggccgtgg cggttgcgcg ccccgggggg tcggcgcgaa 264361 ctcgacgagg cggtcggaga aatcgccagc accccgctga accgcgacca cccgctgtgg 264421 gagatgtact tcgttgaggg gcttgccaac caccggatcg cggtggttgc caaaattcac 264481 catgcgttgg ctgacggtgt tgcctcggca aacatgatgg cacgggggat ggatctgctg 264541 ccgggaccgg aggtcggccg ctatgtgcct gaccccgctc ctaccaagcg gcagttgctg 264601 tccgcggcgt tcatcgacca cttgcgccac ctcggccgga ttcctgcaac catccggtac 264661 accacgcagg gtctaggccg ggtgcgacgt agctcgcgca agctctcacc cgcactgacc 264721 atgccattta ccccgccacc gacgttcatg aatcaccggc tcaccccgga gcgcaggttc 264781 gccaccgcca ccctggcgct gattgacgtg aaggcgacgg ccaagttgct gggggcgacg 264841 atcaacgaca tggtgctggc catgtcgacc ggcgctctgc gtaccctgct attgcgctat 264901 gacggcaagg ccgaaccgct gctggcgtcg gtcccggtga gttacgactt ctcaccggag 264961 cggatctccg gtaaccgctt caccggaatg ctggtggcgc tgcctgccga ctccgacgac 265021 ccgttgcagc gggtgcgcgt ctgtcacgaa aacgcggtct ccgccaagga gagccaccag 265081 cttttgggac cggagttgat cagccgctgg gcggcttact ggccacctgc cggtgcggaa 265141 gccttgttcc ggtggttgtc tgagcgcgac gggcagaaca aggtactcaa cttgaatatc 265201 tcgaatgttc ccggtccgcg cgaacgcggc cgcgtggggg ccgcgctggt caccgagatc 265261 tattcggtgg gcccgttgac cgccggtagc ggattgaata tcacggtgtg gagttatgtc 265321 gatcagctca atatctcggt gttaaccgat ggttccaccg tgcaggaccc gcatgaagta 265381 accgcgggaa tgatcgcgga cttcatcgaa atacgccgcg ccgctggtct ttccgtggag 265441 ttgacagtcg tcgagtccgc gatggcgcag gcatgacacg aaacaccgga cgagtatgag 265501 gccagtatga gcagcgaaag cgacgcagcc aacaccgaac ctgaggttct ggtagaacag 265561 cgggatcgga ttttgatcat cacgatcaac cgcccgaaag ccaagaacgc ggtcaacgcc 265621 gcagtcagcc ggggcttggc cgatgcgatg gatcagcttg acggcgatgc cggcctgtcg 265681 gtggcaatcc tgaccggtgg gggcggttcg ttctgcgcgg gcatggacct caaggcgttc 265741 gcccggggcg agaatgtcgt cgtcgaaggt cgcggccttg gctttaccga acgtccgccg 265801 accaagccgc tcattgctgc ggtggaaggc tacgcgttgg cgggtggcac cgagctggcg 265861 cttgctgccg acctgatcgt ggcggccagg gattcggcgt tcgggattcc tgaagtcaag 265921 cggggtctgg ttgccggcgg cgggggattg ctgcggttgc cggagcgcat cccgtatgcg 265981 atagccatgg agttggcgct gaccggtgac aacctaccgg ccgaacgcgc gcacgagctg 266041 gggctcgtca acgttttggc cgagccgggg accgccctcg atgctgcgat cgcgttggcg 266101 gagaagatca ccgccaatgg gccgctggcg gtggtggcca ccaagcggat tatcaccgag 266161 tcgcgtgggt ggagtcccga cactatgttc gctgagcaga tgaagatcct ggtgccggtg 266221 ttcacctcca acgacgcgaa ggaaggtgcg atcgcgttcg ccgagaggcg ccggccccgt 266281 tggacgggca cctagcccag ctacgcgacg gtgtagccca tcggcagcag gacactcttt 266341 tgctgggtga agtgttcgac accctcgggc ccgttctcgc ggccgattcc ggagttcttg 266401 tagccgccga agggtgagcc gggatcgaag gcgtaccagt tgattccgta tgtcccggtg 266461 cggatctgct gcgagatctt gatgcctttg ggcacgtcgg tggtccacac gctgcccgcc 266521 agcccataca ctgaatcgtt ggcgatcgcg atcgcgtcct cctcggtgtc ataaggaatg 266581 atggccagca ccggcccgaa gatctcctcc tgtgcgatgg tcatcttgtt gtcgacatcg 266641 gcgaatacgg tgggttggat aaagaagccg ttgtccaagc cctcgggacg gccgccgccg 266701 cacaccaacc gagcgccctc ctcgatgccc ttggcgatgt agccttcaac gcgagtccgc 266761 tgcttctccg agatcagcgg cccgatctga gctgccgggt ccgacggcgg gcccaccggg 266821 agagccgtta cgaaattagt taccgcagcc acgatttcgt cgtaccggga gcgcggagcc 266881 agaatgcggg tctggttgac gcagccctgt ccggcgttca tgacgccgga gaacaccatc 266941 atcggaatag ctgcggccag gtcgacgtcc tcgagaatga tggccgccga cttgccgccg 267001 agttctaagg tgcacggctt gagcatctca gcggcacgcc tgccgacctc tcggccgacg 267061 gccgagctgc cggtgaaggt aaacatgtcg atgtccgggt tagacgtcag cgcctgaccg 267121 gtctcaatcc ctcccggcac taccgacaac accccctcgg gcaggcccac ctcggcgaac 267181 acctccgcca aagcgtttgc ggtcagcggt gtttcggcgg cgggcttgag cacgatggtg 267241 cagccggcca gcagcgccgg cgcaatcttg ttgacggcca gaaacagcgg gacgttccag 267301 gccacgatcg cgcccaccac accgaccggc tcacggctga caatgctctg tccataggag 267361 ccggtgcggg tttcggtcca ggtgaccttg tccgctgcac cggcaaagta gttcatcgcc 267421 cccatcgaac ccatccagtg catcgtctcg atgatggtcg gcggctggcc ggtttcggct 267481 gcgagcagct tggtgaacag gtccttgcgc tcagccagca tcttgaccgc cgcagcgatc 267541 accgccgcac gctcgtgcgg cggggtcgag ggccaggggc cgttgtcgaa cgccgcacgt 267601 gctgcggcga ccgcggcgtc gacgtcggcg gcggccgcca tcggcacctt gccgacatat 267661 tccccagtgg ctgggcagcg tacctcgata acatcggagg tcgacggttt ggtccacttg 267721 ccgccgatga aaagcttgtc gtattccgtg gcactgtcag acatatgcgc cgctcctcct 267781 catcgctgcg ctcggcatcg tcgccggcgg tcatggcgtc accctaccca agccgaacgc 267841 gaaacgagaa cgtgttccat tattagggtg tgagcaccaa taccagattg ctcaccagga 267901 actcacgcag caccgggacg gatgtcagcc accacgccca tctggggtgg tagcggggaa 267961 atacggctaa cgcggctccg gtgccggcag cccagcgcag accctcggcg gcggacacgg 268021 caaacaacga cgacccatag ttgttctttg ccggatggcc gtgtttgcgg acatatcggg 268081 cggcggcgcg ggcgccgccg aggtagtggc tgaggcccat ctcgtgcccg ccgaatggcc 268141 ccagccaaac cgtgtaggac agcacgacca acccgcctgg cttggtcacc cgcagcatct 268201 cggtgccaag ctgccagggg cgcggcacgt gttcggcgac attggaggac aagcagatgt 268261 ctaccgagtc gtcggcgaac ggcagtgcca tgcctgacgc ccggacgaac atgccgggcc 268321 ggccggtgaa cgcaggtccg gcggcatgca tttcatcagg gtccggctcg acgccgatgt 268381 agccgacacc ggcgtcggag aacgccgtcg cgaaataccc cggcccgccg ccaacgtcga 268441 gcagcgtacg gccaactggc ggctcgctat gtgtggccag ccacagatcg ccgatcattg 268501 ctgcggtgtc ggccgccagt gtgcgataga accgtgccgg gtcgcgctgc tcgtagcgga 268561 agtctgccag cagtcgcagc gagcgccgca gtgtcgcccg tcgcgcgaac acatcggtga 268621 ccgccacctg gcacacccta cggcccgcta ggctatcgac caatgtctgc tctgcgctcg 268681 gtgttgctgc tgtgctggcg cgacatcggg cacccccagg ggggcgggag cgaagcctat 268741 ctgcaacgca tcggggctca gttggccgca tcgggcattg cagtcacgtt gcgcaccgct 268801 cgctatcccg gtgcgccacg gcatgaactg gtcgacgggg tgcggatcag tcgtgccggc 268861 gggcgctact cggtgtatct atgggcgttg ctggcgatgg ccgcagcccg atgtgggctt 268921 gggccgctgc gccgagtgcg cccggatgtg gtcgtcgata cccaaaacgg ctggccgttt 268981 gtggcccggc tgttgtatgg ccggcggtcg ctggtactgg tacaccattg ccaccgtgag 269041 cagtggccgg tggccgggcg gatgatgggt cggctcggct ggtatgtcga gtcgatgttg 269101 tcgccacggc tacaccggcg caaccagtac gtgacggtgt cgctgccgtc ggcgcgggat 269161 ctgatcgccc tcggtgtgga cagcgagcgg atcgctgtgg tgcgcaacgg cctcgacgag 269221 gcgccgtcgc caacgttgtc cggcccacgt gcgcccacgc cgcgtgtggt ggtgctctcc 269281 cggctggtgc cgcacaagca gatcgaggac gcgttggcag cggtcgcgga gctacagcct 269341 cggataccgg gcctgcacct agacatcgtc ggcggtggct ggtggcggca gcgcctcgtt 269401 gaccatgtgc accggctcga cattgctgac gccgttacct ttcacgggca tgtcgacgat 269461 gtgaccaaac accatgtgct gcaaagctcc tgggtgcact tgttgccctc acgtaaagag 269521 ggatgggggc tcgcggtcat cgaggcggcc cagcacggcg tgcccaccat cgggtacaga 269581 tcctccggtg gtttggcgga ctcgatcgtc gacggggtga ccggcatatt ggtcgacgac 269641 cgggccgaat tggtggcttg gctcgaacaa ctgctgtccg attcggtgct gcgtgaccaa 269701 ctcggcgcca aggcacaggc gcgtagcggt gagttctcct ggcggcaaag cgccgaagcg 269761 ctgcgcagcg tgttggaggc agtgcaggcc agccgttttg tcagcggcgt ggtttgagcc 269821 ggcttcgaca gacttaatcc tgggcgcggc tcgccggcgt gtcttcgcag tggtgtaagt 269881 gtcggcgcac ccaatagccg gccgcgccag cgccgccgac cagcagcatc gaaagccacg 269941 cccaatgcgc gagcattgtc gctttgaggc gggccgacga tgcaccggag gtttggccgc 270001 cgacccgata aagagccaat tcgtcgtcgc ggtgcgccgc tgctagccgg ccgagggtgc 270061 gtgcggccgc gcccatgtcg ccggcgctgt cggattcgac gaccagccac ccgacgccgg 270121 ccgcggccaa ggttgacgga tggggcccgg tgagcagcag ctcctggacc gcccgggcgt 270181 gcgcgtcttc gccgggaacg gtcaccccgg aaatgaccag atcacctgtg gtcagcacat 270241 cggcgcgaac ccaacggggg agcggatcga gtaccggtgc cgaaccggac cacgagaagc 270301 gccgcatggt gcccgcgggc aagaccgcaa ccgtccgggg atcggcattg atcgccgctg 270361 ccaccgccgc ccaaccggac gggtagtgca caggcgcaac cttgccccac accccccacg 270421 ccaagtcagg cagcgttagg accagcgcca gacagcagac caccgccgcc gttgccggtc 270481 gcagccagcg tcgcagcgtt agcaccgtgc ccgcaccgga gagtgtgtat ccgggtaccg 270541 ccagcgcgac ccacttctgt ccgtcgcgca gcacgcccag gccgggtgcg gcatcgacca 270601 ccacccgtag cgcgtgcaga cctgggccgg tcgcaaggac agccgggacc atcacggaca 270661 ccgccgctag tgtcagcagc ggcactgcca cgggccggcg cgccacagtc ggtagtccga 270721 tcgccaccat ggcgagtagt acgacggcgg atgccactgc gaaaagcgtt gtccgcgagc 270781 taggtacggc ctcgccgttc cagatcccac cgagactggc caagctgcca agcgtgccca 270841 gccccggttc ggcgcgtggc gcgaacgcgg taaccccaag ctgattggct gccgtgtggc 270901 tggtcaacga cgagcccagc gccgacgccg tcagccaggg cagcgcaccc accagcgcgg 270961 agcccaacgc cgcgacccca cattgccagc gcgggcggcc cgcgccgggc atcgccacgc 271021 acaccaccgc aactgtcgcg gcgagtagca gcccggacgg ggtcaggccg gccagcgcaa 271081 cccagaacgc cagcccaaaa agcccgaacc aacccgcgcc aaccgttgtt cgcatcgtta 271141 acatcgcggt cgcaacccag ggcagacacc catagccgac cagcaggctc caatggccct 271201 gcaaaagtcg ttcggccaca tagggattcc agatcgccag cgtgatcgcg acaaactggc 271261 cggctgcccc cgctgcgggt agtgccgttg cgaccagtcg ggccgcgccc cagcccgcca 271321 gccaaagccc cagcagcagc agcgctttca ccacgacgcc gccgtcgacg aggtgtgacg 271381 ccaaagcgac cgcgaagtcc tgcggagtcg cccggggcgc cgatgtcagc cctagggcgt 271441 tggccgacac atacgaccgt ggtgtggaca ctgcatcgcg cagcagtagg tatccgggcc 271501 gcagtagcgg cgcggccaac agcagcacca agaccagcgc gtaccccggt cggaaccagc 271561 gcacgtcgcc tgattagcgc cgctcgggcg ggccggggtc gggatgcccc gcgtccggcg 271621 gtgggggcgg ctgcgccgaa ccgagtcggg gcggatccga gccactcggc tcgcgcggga 271681 agtcggggcg ctgcgtgggc agtttctcgg tctcagcctc cgctccaggc accggttctt 271741 cgaagccgcc gcggcggtaa tcgtggtcgt cccgatcccc gctggcagcc atcagcgcac 271801 cttcggtccg aaggctaaac gacgcgaaca gaccaccgcc gaccagcgcg accaagccgg 271861 ccgcggtgaa tgtaatcggc agcacccgcg accacagcgc cagccggtcg cgctcgtcgc 271921 gagccgcgtt gacctgggat tcgaccgtct cttcggtgga ggtgacctgg tagtcggcaa 271981 acgtgacctc tggtttgagt gggtcacgag cgaagtagtg gttggcgcgt tcggtttctt 272041 tgacgatggt gccggacacc gggtccaccc agaatgttcg ctgcgccgcg taatagcggg 272101 tcatggtgat ttgctcgttc ggatcaccgg gtagccccca catcgccgct gatgtggtga 272161 ctttgccgtc ctcgtcgccg gcgtacagcg acgggtattt gaggggagcc accagcttcc 272221 cctcgggggt gtagccgacg ttctgcgtga agcggtatgt ggttaaaccg ttgacgtcct 272281 cctcgccttc gtagttggcg tcaaacgcct tctgtgcgat ggggtcgaaa taggggtatg 272341 tcttcttctc ggtgtgaaac gggaaccggt aagacagccc gtcgtgccgc agcggaatgg 272401 ccgtcggcgg gttctcgtca ttgaggcccc gcggtttctg gacggcgccg ccggtgtggg 272461 tgtcgtcgga gacagccatc gccgtcttgc ggttgagggt gaccgtgtcg acgatcgcca 272521 gcagcagccc gctgtccttc tgcttgtcgg tgcgccggag cgaggatccg acctgaagtg 272581 tgaccacgtc ggcgttggcg ggcgattcga cggtgacttg ctgttgggac accagcggca 272641 cgtcctggtt gaccacgatg tgctcggtgg ctagcgacgc cgagtcgagt gccgttccag 272701 tgccgtcgct gatcaacgtg gcatcgatat cgagtgggat ctcagcgatc ctgctggtgg 272761 tataggtcga cagcagcagg gcggcgatca gtagggcggc tccgagtccg atagcgccgc 272821 acgcggcgaa ccgcaacatg actgcccggt tcacctgcgc cgctctcccc cgcaagcggg 272881 tggtgccccc acctcatcgc ttcgtccccc gcaagcgggc ggtgccccca ctgcatcgtc 272941 gccggcgcgg ttcacgttgc tgtgacctcc ttatggtcca tggactcgtc ggtcgggacc 273001 cgctccgacc tgaccaagcg aggcaaaacc cgtttgaccc taacagcaga gcgtatgggc 273061 ccggcggacg aatcgggtgc accgattcgc ccgcaaacac ctcacaggca cactgtgttg 273121 gtgaccaacg gccaggtggt gggtgggacc cgtggctttc tgcccgccgt cgagggaatg 273181 cgcgcatgcg cggccgtcgg cgtcgtggtc actcacgtcg cgttccagac cgggcactct 273241 agcggtgtgg gcgggcggct gttcggccgc ttcgatctgg cggtggcggt gttcttcgcc 273301 gtgtcgggat tcttgttgtg gcgcggacac gccgcagcgg cgcgagatct gcggtcacac 273361 ccgcgaaccg gtccgtatct gcgatcgcgg gtggcgcgca tcatgccggc ctatgtggtg 273421 gcggtggtcg tcatcctgtc cctgctgccc gacgcggatc atgccagcct gaccgtgtgg 273481 ctggccaacc tgacgctcac ccagatctat gtgccgctga ccctgaccgg cggcctgacc 273541 cagatgtgga gcctgtccgt ggaggtcgcc ttctatgcgg cgctgccggt cttagcgttg 273601 ctgggccgcc gaattccggt cggtgcccga gtgccggcga tcgcggcgct ggcggcgctc 273661 agctgggcgt ggggctggct cccgttggac gccgggtcgg ggatcaaccc gttgacctgg 273721 ccgccggcgt tcttctcgtg gttcgccgcg ggaatgttgc tggcggagtg ggcctacagc 273781 ccggtcgggt tgccgcatcg gtgggcgcgc cgccgcgtgg cgatggcggt taccgcgctg 273841 ctgggttacc tggtggcggc ctcgccgttg gcgggtccgg agggcctggt tccgggcacg 273901 gcggcacaat tcgcggtgaa gaccgcgatg ggctcgctgg tagcgttcgc gctggtggcg 273961 ccgctggtgc tggaccggcc cgacacgtcg caccggctgc tgggcagccc cgcgatggtg 274021 accctgggcc gttggtccta tggcctgttc atctggcatc tggccgcgct ggccatggtg 274081 tttcccgtga tcggagcgtt cccgtttacc gggcgaatgc cgacggtgct ggtgttgacg 274141 ctgatcttcg gtttcgcgat cgccgcggtc agctacgccc tggtcgagtc gccctgccgg 274201 gaagcgttgc gccgctggga gcgccgcaac gaacccatat cggtcggcga acttcaggcg 274261 gacgcgattg caccctgact cggccggctg acacctggcg ggcacctagt cgatcgtgcc 274321 cgctggcacg atccactgac agggctgacc ggtcacggcg gcgatgagat cgaagtccgc 274381 gtcgtaatgt agaacgacca gcccgtgttc ctcgccggcc gcggcaatga gcaggtccgg 274441 gattttgcga ccgcgctgac tacgcgcagc gagtaggcgc tggattccaa gcgcgcggcg 274501 atgatgcgat gccgtcgatt cgatgaggtc gaacgcgctc aatgccacca tgagccgctg 274561 ccactcggtc tcattgcgtg cggagtaccc gacttcaagg tcggttattt gcgtgcgagc 274621 gacggcaccg gcctcagcca acggttccac cgcccgccgc acggcgggcc ggctgagcct 274681 tttgatcacg ctggtgtcga gaagatattt cagcgccatg cttcggcgcg gtcctctggc 274741 ggtgcggcgg ccagcgtgtc gagagcggcg gcgacgcgtt gaactcgctg agacgtggct 274801 tgccgcaggg ccgcgttgac ggtgtctttg atcgtcgtcg tgcccaattc tgtacgagcc 274861 atgtttaaag cctgctcgtc gatgtcgacg agatgtttcg ccatgaatcg gagtatatat 274921 caataaggag ccgatatata tgcacaatgc caagcccatg gcattcgccc ggcgcggctg 274981 tctcactgat agccgccctg ccgctcgaag atgcggcgcg ggttgtcgac gagcatggtg 275041 tgcagctgct cgtcggtgac gccgtgctgc ttcagtgcgg ggatgacgtc gttgtggatg 275101 tggaggtaat gccaattcgg catcgccacc ggcaccagct cctcgggaag cgcgtcgaaa 275161 tagcagcagg cgtcgtgtga tagcaccatc ttgtcggcat ggccgcgctc gcacattcgg 275221 gccacgatgt tcacccggtc ctgaaacggt gagatcacgt cgacgccgaa ccggtccatc 275281 ccgaggtagg agccggcggc gatgagctct tccaggtagc cgacgtcggt gctgtcgccg 275341 cagtgtccga taaccacccg gctcaggtcc accccctcct cggcgaagat gcgttgctgg 275401 tcaaggccgc gccgcagccc ggcgtgggtg tgggtggaga tcggcgcccc ggtgcgtttg 275461 tgtgcttggg cgaccgcgcg caacacccgc tcgacaccag gggtgaggcc gggttcgtcg 275521 gtggcgcact tgaggattcc cgccttgatg ccggtgtcgg cgatgccgtg ctcgatgtcg 275581 cggacgaaca tgtcggtcat gatctccggg ccgtccagct gtgcgcccgg cccgaggtag 275641 tggaagtaga acgggacgtc gttgtaggtg tacaagccgg tggccacgac gatgttcagc 275701 tcggtggccg cggccacccg ggcgatgcgc gggatgtatc ggcccagccc gatcaccgtg 275761 aggtcgacga tggtgtccac gccgcgggcc ttgagttcgc ctagccgggc gatggcgccg 275821 gccacccgct tgtcctcgtc gccccaggct tccgggtagt tctgcgcaat ctcggtggtc 275881 atgatgaaga cgtgctcgtg catcagcgtg acgccgagat cagcggtgtc gatgggtccg 275941 cgagcggtat ttagttctgg cacgtcactg atgctaggcc gcaatcggtg tcttgcgggg 276001 ccgcagtgca gtagcgtcac cctcgtcgtt gaccgaaccg ctcgggagcc aattcttatg 276061 ctgctcaacc ccaaccattt gacacgcaaa tacccagacc gtcgctccgg ggagatcatg 276121 gccgcgacgg tggacttctt cgagtccagg gggaaggccc ggctcaagca cgacgaccac 276181 gagcggatct ggtactcgga cttcctggac ttcgtcgggc gggaacgcat ctttgcttcc 276241 ctactgacgc cggcctccta tggcgccgat gattgccgct gggacaccta ccggatcagc 276301 gagttcgccg agatcatggg cttctacggg ctgagctact ggtacccctt ccaggtgacc 276361 gccctaggcc tgggcccgat ctggatgagc gccaacgagg acgccaagcg caaggccgcc 276421 gcggggctcg aggccggcga agtgttcgcc ttcggcctgt ccgaacagac ccacggcgcc 276481 gacgtctatc agaccgacat gatccttacc cccagcgacg gcggctggac cgccaacggc 276541 gagaagtact acatcggcaa cgccaacgtg gcccggatgg tctccacctt cggcaagatc 276601 gccggcaccc cagaaagcca ggagtacgtc ttcttcgtcg ccgactccca gcacgagcgg 276661 tatgacctga tcaagaatgt ggtgaactcg cagaactatg tggccaatta cgcgctgcgc 276721 gattacccgg tcaccgaggc cgacatcctg catcgtggcg ccgaagcctt ccacgccgcc 276781 ctcaacacgg tcaacgtctg caagtacaac ctgggttggg gtgccatcgg aatgtgcacc 276841 cacgccctct acgagtcggt cacccacgcg gccaaccgtc acctgtacgg cactgtggtg 276901 accgacttca gccacgtgcg gcggctgctc accgacgcct acgtgcggct aattgcgatg 276961 aagctggtcg ccagccgggc cagcgactac atgcgcagcg cgtcggccgc cgaccgtcgc 277021 tacctgctct acagcccgct gaccaaggcg aaggtcacca gcgaaggcga gcgggtcatc 277081 accgccctgt gggacgtcat tgcggccaaa ggggtggaaa aggacacgtt tttcgagacc 277141 gtggctcgcg agattggcct gctgcccagg ttggaaggca ccgtgcacat caacatcggg 277201 ctactcggca aattcatgcc caactacctg ttcgctcccg actccacgct gccggtcatc 277261 ccgcgtcgcg acgacgccgc cgatgacgcg ttcctgtttg cccagggacc caccgggggc 277321 ttgggtaagg tgcgtttcca cgactggcgc gcgtcatttg acacctgcgc gcatctgcct 277381 aatgtcgcac tgctgcgcga gcaagtcgac gtgttcgccg agctgctggc cagcgccacc 277441 ccggacgcgg cacagcagaa ggatatcgac tttgccttcg gcgtgggaca actcttcgcg 277501 aacgtgccct atgcccagct cattttggag gaggcccggc tatctggtgt cgacgaggcc 277561 ttgatcgacg agatcttcgg cgtactggtt cgggacttca acacccatgc cgtcgagctg 277621 cacggcaggt ccgccacgac agccgaacag gctcggttcg ccatgcgaat ggtccgtcgg 277681 ccggtgcacg atcccgcccg ctacgaccag atctggaagg accacgtgct cgcgctcaac 277741 ggcgcatatc aaatggcacc atagtgcgcc gcgtcgagat cgacgctgcc gtgttgccca 277801 ctcgcacttt cgcgcgctgg tgtcaatctc gacgccagcc ttgaccgtga tgcagcgcac 277861 agtagaatga ccagtggtca ccaacgcaag gaggccccat gccgacggtg acgtgggcgc 277921 gtgtcgatcc ggctcgccgt gccgccgtgg tggaagccgc cgaggctgag ttcggtgcgc 277981 acggattctc ccgcggcagc ttgaacgtca tagcccggcg tgccggagtc gccaagggca 278041 gcctgttcca gtacttcgcg gacaagcgcg acctctacgc gtttattgcc gacatcgcca 278101 gccagcgagt ccgctcctac atggaggacc tgatccgcga gctggacccg aaccggccgt 278161 tcttcgaatt cctcaccgac ctgctcgatg gctgggtcgc ctacttcgcc gagcatcctc 278221 gggaacgtgc gttgcatgct gcggcgaccc tggaggtcga caccgatgcc cgcatcagcg 278281 tgcgcagcgt cctgcaccgc cactacctgg acgtgctacg gccgctggtg cgcgacgcgc 278341 acgcgcgggg cgacctgcgc gcagattccg acaccggtgc attgatgtcg ctgctgctgc 278401 tgatctttcc gcacctggcg ctggctccat acatgcgtgg tttggatccg atcctgggcc 278461 tcgacgagcc cacacctgag cagcccgcgc tggccgtgcg caggcttgtc gccgtgctgg 278521 cggcggcctt cgatgcccag caccccgcga ccaactcagc ccagacccga tcggaggaga 278581 tcacatgaca cgcacacgtt cgggctcgct cgccgcgggc ggactcaact gggcgagcct 278641 gccactgaag ctgttcgccg ggggcaacgc aaagttctgg catccggccg acatcgactt 278701 cacgcgcgac cgggcggact gggagaagtt gtcggacgac gaacgtgact acgccacccg 278761 attgtgcacc cagttcattg ccggcgagga ggcggtgacc gaggacatcc agccgttcat 278821 gtccgcgatg cgggccgagg gacggctggc cgacgagatg tatctgacgc agttcgcgtt 278881 cgaggaagcc aaacacaccc aggtgtttcg catgtggctg gatgccgtcg gaatcagcga 278941 agacttgcat cgctatctcg acgacttgcc cgcctaccgc caaatcttct acgcggagtt 279001 gccggagtgc ctcaacgcat tgtcggccga tccctcaccg gccgcccagg tccgggcgtc 279061 ggtcacctac aaccacatcg tggaaggcat gctggcgctc acgggctact acgcctggca 279121 caagatctgt gtggaacgcg caatccttcc cggcatgcag gagctggtcc ggcgcatcgg 279181 tgacgacgag cgacgccaca tggcttgggg caccttcacc tgtcggcgcc acgtcgccgc 279241 cgacgacgcc aattggacgg tgttcgaaac acggatgaac gagctcatcc cgctggcgct 279301 gcgcctcatc gaggagggct ttgcgctgta cggcgaccag cccccattcg acctgtccaa 279361 ggacgatttc ctgcaatact cgaccgacaa gggaatgcgc cggttcggca ccatcagcaa 279421 cgcccgcggc cggccggtcg ccgaaatcga cgtcgactac tcgcccgcgc agctggagga 279481 caccttcgcc gacgaggacc ggcgcaccct ggcagcggcc tcggcctagg cctggcgagc 279541 agacgcaaaa tcgcccaatt tcgtgccgaa ttgggcgatt ttgcgtctgc tcgccagggg 279601 aacgctaggc gatccagacg gtcttgatgt tgcagaactc gcgtatcccg tgtgcggaca 279661 gttcccggcc atagcccgag cgcttgaccc cgccgaacgg caattcggga taggacaccg 279721 tcatgccgtt gataaaaacc tggcccgcca cgatgtcgtc gatgaagcgt cgttgctcgg 279781 tctcgtcgcg ggtccaggcg ttggatccca gcccgaaggt ggtggcgttg gcgatctcga 279841 cggcctcgtc gatgttcgcc gcgcggaaca ccgaggcgac cggaccgaag acctcctcgg 279901 tgtagagagc catgtccttg gagatgtcgg tgatcacggt cggcgggtag aaccagcccg 279961 gccggtcgag acgctttccg ccgcaccgga tcaccgcgcc cgccgcggca gcatcctcga 280021 cttgcttggc aacctcgttg cggccctgct cggtggccag cgggcccacg tcggtgtccg 280081 ggtcggtcgg gtcgccgacc cgtaacgccg ccatccgcgc gacgaacttg tcgacgaaat 280141 cgtcgtaaat gtcggcgtgg acgatgaacc gcttggcggc gatgcaggat tggccgttgt 280201 tctgcacccg gccggtgacg gcggtgctga ccgcggcgtc cagatcggcc gacggcataa 280261 cgatgaacgg gtcgctgccg ccgagctcga gcacggtcgg cttgatctcg ttaccggcga 280321 tagcgcccac cgattggccg gccggctcgc ttccggtcag cgtggccgcc gcgacccggg 280381 gatcacgcag gatggcttcg acggctcccg agctaacaag caacgtctgg aagcagccgt 280441 ccgggaagcc gcctcgggcg atgacgtcgg ccaggtacag cgcgcattgc ggcacgttcg 280501 acgcgtgctt gagcaggccg acgtttccgg ccatcagtgc cggtgcggcg aaccgaaccg 280561 cttgccacag ggggaagttc catggcatca ccgccaggat caccccgagc ggctggtatc 280621 ggccgtaggc cgccgacgcc ccgaccttgg ccgcatcggc gggttcgtcg gccagcaacg 280681 cctcggcgtt ttcggcgtag tagcgaaaac ccttggcgca cttcagtgcc tcggctttgg 280741 ccgcggccag cgtcttgccc atctcgagcg tcatcatcgc ggcggcctgg tcggcctcgg 280801 cttccagcaa gtcggcggtg gcattggccc accgggcgcg ctgggcgaag ctggtctggc 280861 ggtagtcggc gaaccgccgg tgggcccggg ctattgccgc gtcgacttcg tcatcggtcg 280921 ccgcagtgaa tgtcttgact gtttcgccgg tagccgggtt gatggtggcg atgggcacgc 280981 tgacatcctt tgctgggtgg gtttgcacaa atcgtccggt gtccagcctg ccactaacgt 281041 ggccagcgct cccgagcagg aggtgtcggg gcctcctatc ggctggggtg ggctctatca 281101 cgggcaggac cagcgtggcg gaacatgtca ccgatcgcat gttcgtcggg agctaatcgg 281161 cccgttcaat cggccggtgg cgaggcgact ttgcgtagcg acatcggcgg gacgtagcgc 281221 ccgatcagcg tgcggtgcca ccaggctcgg tcccgtcgca gctcggccac ggtggtgaag 281281 cggtactgat acagctgcgc gcgcacatac cgaggcggag attgcgggaa aggattgtgg 281341 cgcaacagct tcagcgtcgc aggatcattg cgcagcaacc ggtttaggaa tggcgtcatc 281401 cacggtagtg cgtagcccgg tgagatggcg gcgaaccaca tgagccagtc cagccgcaga 281461 tggtaggggg cccattgccg cggcagccgg cgcggatcac cgggcttgcc cttgaattcg 281521 tatgctttcc agacggtttg ttcggtaatc ggtgactcgt cggtcccttc gattaccact 281581 tcccggcggg tgcggcagat gctgccgaac gccccgtagg tgttgaccaa atgaaagggg 281641 ttgaacgaca tgttcattcg ttgatgagag gacagcagat tgcgtgccgg ccagtagctc 281701 agcaacagca ccgccgcggt gaatacgacc acgaggccgg cgaaccactg cggcggtgcc 281761 gacagcgccg gctgggccgg catcggcagc agcgccgcgg ccgaagatgt gtcgatcgcg 281821 ctgcacgcca agaggatggt cagccaattg agccaggaga aatttcccga tgccaccagc 281881 catagctggg taaccacgat gatcgcggcg gcgatgctgg ctgcgggctg tggtgtgaac 281941 aacccaaacg gcaccacgag ctgggcgaaa tggttgcccg ccacctcaat ccggtgcaat 282001 ggcttaggca ggtgatggaa gaaccagctc aacgggcccg gcatgggctg tgtttcgtgg 282061 tggtagtaca ggcacgtcag actgcgccag cacgagtcgc cgcgcatctt gatcaatccg 282121 gcaccgaatt cgacccggaa cagcagccag cgcgccaaca acaacgtcag aatcggcggg 282181 gcggtgcgct cgtttccgag gaagatcatc aggaagccgg tctccagcag cagcgactcc 282241 caaccgaatg agtaccacgc ctgcccgacg ttgacgatgg acaggtagag cacccacagc 282301 gtcagccaga tcagcatcgt ggcccacaac ggcacgaagg aggccgcacc ggcgacgacg 282361 gctgccgaca acacggcacc caaccagcag accccggcga acacccgatc ggaatagcga 282421 aagtgaaaga tgctcggtgt tcgccagaag gactgtccag ccagataccg cggcaccggc 282481 agcatgccgt gctctccgat gaggggccgg aactgctgtg cggccgcgac gaatgcgatc 282541 agataaataa tcgccgtgcc gcgctccagc gccagtctgc ccagccaata ttcgggcgct 282601 gaaaaccatc ccatggccgt tactccttgg acacggcgtt cacaccaact attgcatgcg 282661 gtcttgacca cgagactctg atgtggcgac caccgatgcc gccaccacgg aaaccgaaat 282721 cagtgccagc agttgcacac tggcccagtt cccggcgtat ccgtcgaccg accgccacgg 282781 atgccgggac agcgcagcgc cggccaggat cagcccgccc gcagccagtc cgacggtgac 282841 ccggtcacgt agtcgctccc ggcggcgcag tgcataccgc acaccgaggg ccgtgcccat 282901 caccatgacc ccggcgatgc tggcgatcac cgccccggcc gccagcaccc cggccgccgc 282961 ccaggctccg ggcctccagg gtggtgtggg ccggtcggcg agctgccgtc gcccggtccg 283021 ccagaacgcc agcagagcta gcaggggcag cagggccagc cctatcgcca ggctcgcccg 283081 atacagcgag ttcggtgcga atgtcagcgt gatggtgccg gggttcccgg cgggcaccac 283141 ccaggcctgc tgccacccgt tgacggcgat cggtgtcagc cgggccccgg tgctcgtgcg 283201 ggccacccag cccgagttga tgctttcggg taccaccagc acccgggaag tggccgactc 283261 gggaacccga acttcgcggt gggtgggacc ccacgcaccc gtttcagcag aagtaactgt 283321 cgcgcttgac aatccagcac cgggagttga caactgggca ccgtcgacca cgaacgcggc 283381 gccggggctg atcagcaatt cctgctgtcc cgccggcagc gctatcggct cgcgttcaca 283441 cgggagcgcg gcgaccggtt caccgtccag caaggcgccc accgtggttc ggatcgaggt 283501 gtgcacgaac cggcccgcga cggcgacgac cgggccgtga tcgcaatcca cggtgagcgc 283561 acgcgcgcgg ttgcgggcgg cgtcggccgg cgcaatcggg gcgccgccgg cgctaagcac 283621 caccacttcg gccagccccg gcggcttgag ctggtcgaag cccagcgcgt tgcgatcgat 283681 gacatcgtcc cagtccagca ggctgaccga caccgtgtcg gtcacccggg gatgcagcca 283741 tagcgtcgtt agctcgccga cctgcagttg tcggacctgg gggccgtcgc ccaggttgat 283801 ggccaccacc gtcggatggg ccggcaacat cgaccggctg gcggccagcc gcagcccggt 283861 caccacggtg ggccgcggca gggtcagcgt cagcgtcggc ggggttttgt gttgcaccac 283921 ccgctgcggc gcggtccagg cggtggccgg atcgccgtcg gcggccgcgt acgccgagcc 283981 gaggatgtcg acaaggtcgg aatcaccgct ggcccgggtg gtggaaggcg cggcgatcaa 284041 gtcggccagc ttcgggccct gccgtggtcg cacccacacc atcggggtca ccgacaccgg 284101 gcggggtacg gtcagtgtgc ggctgagatt ggccggttcc tcgggtgcca gggccatcga 284161 ggcggcgcag cgcacgccgt cgggtcccgg ggcgcagccc ggtctgccca gcagttcgga 284221 tcccaggtcc cagcccgcga tcgccgaacc cggcggcggc ccgggcacca gcacggtgtg 284281 tcgcagctga accggatggg cgaaaccgga cgcatcgtat tgggtgatgg ccagatcggt 284341 gatgccgaac tgcacaccgg ccgacccgtc gtcggtggcg gccgcggtaa accgcaccca 284401 gggggtttcg ccgtagggca gtgcggcggt gagcggtttg cccgcctcat cgaaccgcag 284461 ggtggtgctg ccgttgacgg tctcgatcag gatgcgtcgg acctgggcgc cgaccgcggt 284521 cgcgctgggt gtcagggtga cgacggcatt ggtcaccgga cggtcgaaat ccacctgcag 284581 ccactgccca acggcggcct gcagcgcgtt ggacacccaa gcggtcgccg ggtcaccgtc 284641 gacggcggcc gcgggtgcgc tcgccggggc gacgtcgggc atggcggtgg catccgccga 284701 ggagctcgac acggtgatcc ggccgccggt ccatccaccg acgaccggct ccgcgcccgg 284761 caccgggtag tcaggcaccc ggttgtaggt gtgccgggcg tcgccgggtg cccggatcgc 284821 cgacgagtgg tggtccaccc ggccgtaatc ggtctcgcgg gccaccgggg tgtcggtgac 284881 ggcgacctgg ggcaccggca agccggcagc tcgagcgtcc gcggtcatca gcaccggacc 284941 cagcgggggc tggccctgca gccggcgtcg ttcgtccaga cgcagcagga cctcgggtcc 285001 gccgtcgacg cgggcgagct ggtcggtcgc ggcgaagtag ggcgcaccgg ggttggcggg 285061 cgcgctcacc cggtagatct caatcgcggg atatcggggt cgcaggccgc tgtcgttgac 285121 gaaacccgcc agcggatcag gacccaccgg cgcgccgaac tccgccagct tcgctagccc 285181 gggcgaccct gcgatgctac ggtgcagcag aatcggtcgt gccgagcgcg acgtctcggg 285241 atccagatcg ttgcgtacca gcacatagga aatgccttgg cgggcaaggg tatcggccag 285301 ccccgccgac ggtcgtccgg cggcgaacag gcgttgcacg gagtccagcg ctcgaatggt 285361 ctgcggcggg gtcagcggaa tggagtcgcg cacgccccac gggccgtcgc cgagcacctg 285421 cagcggctcg tcgtggctgg tgccccacac ctgggtggcg aacggggcgc ccgggaccac 285481 cagcacccgc ccgggagtgg gcgtcgcggc atggtgtgtg cgcagccagt cggcggcctc 285541 ctgccagtac tggggaagcg caccgaacgt gccgggcggg gcgacccggc cggtccacgc 285601 cagcgaggtg ctgaccatca gcgcggtcag ggctaccacc gccaccgcta ctcgcttgtc 285661 ccgctcgggg tgcgcgaacg cgcgcagcca cgccggcctc ggcgcgctgc ctggcagcgg 285721 aactcggctc agcagctgcg ccaagcccag caccaggggc agccggatca caggccccac 285781 cttgtgtacg ttgcgcaggg gggtgccggc ggcgtccagg aacgcctgca ccgggtgggc 285841 gaccggcgaa gccagcccgc cgcggtggcc aacggccagc agcaccaccc cgaccaacag 285901 catcgtcacc agccggccgc gcgccggcat cgccgggcta gtcagtccgg ccagcccggc 285961 cgctgcgacc aggcaggtgc ccaggatggc cgccgatccg gtgaccaacg gcgcgcccgc 286021 ggtcgcgttc ggcgccacga acggcgtcca gctgtcggtg ccgcgcagca cctccaccag 286081 cgaggaccat tgcgtggtca cgccggaaga ttcgatgaag tccagaaacg gcggactgac 286141 cccgtgcagc tgcgtcagcg ccattaccca ccacagtgtc gccagggcca tcgccaacag 286201 ccaccacgcg gtgtagcgcc accacaaccg attcggccgg tgacaggccc accagatcac 286261 cgccggcagg caaccggcca gcgtcgcgat ggcgttgacc gcgcccatca gcgccaccgc 286321 cagcccggct tgggcggcca gcgcgcgcac cgagcggcca gaagtccccc gcagcgccag 286381 gatcgtgggc agcagcaccc acggcgccag catcatcggc aaggtttccg acgagatcga 286441 cccgagtgtg gtcagcaccc gtggtgacag cgcgaacgcc acggcgccga ccacccgcga 286501 ggacgggccg ccgacgccca gcgcctcggc tacccgcagc aggccccaga agccgaccgt 286561 gagcaacacc gcccaccaca gccgctgagt gacccagccg ggcactccca gcaggtgacc 286621 gatcacgaag aaggtgccgt gcggaaacag atacccgtag gcctggttct gcgcctgccc 286681 gaacggcagg tcgctgttcc acaggttggt cgcacgcgcc aggaagcgca gcgggttggc 286741 ggtgaggtcc agcttggtgt cgggggagac ttgtccgggg gattgggcga acgtcagcgc 286801 caacgctacc gcgccgacca ccggcagcca tttgcgagac aacggcgcca cctgcgaccc 286861 ggaggccgcc tcagcctgcg gcgcccgggt cgcggggcta gctacggtta ccgtactcga 286921 cccggttgag cactgatgac gacggatcgc ccccggggag tggcggtttc ttgtcctgct 286981 gcaccatcag ggtcactccg aagatcgcgg ccgcgcccag caacagacca accaccacgc 287041 ttgcggcggc gggcgcgacg atccggttca tcggtggctc ctcgacggct gtgggtgcgg 287101 cttgagaggc tagaggcaac ttagcagaag cgtgggcctg gccccccaac ccggagcgta 287161 tgcgccaccg tgacagcatg tccggatggc ttttccacgc acactggcga tactcgctgc 287221 ggcagcagcg ttggtggtgg cctgcagcca tggcggcaca cccaccggat cgtcgacgac 287281 ctccggcgcg tcgcccgcaa ctccggtagc cgttcccgtg ccccggagct gcgccgagcc 287341 ggcggggatc ccggcgctgc tgtccccccg tgacaagctg gcccagctgc tggtggtcgg 287401 cgtgcgagat gctgcggacg cccaagccgt ggtcaccaac taccacgtcg gcggcatcct 287461 catcggcagc gacaccgacc tgacgatttt tgacggcgcg ctggccgaga tcgttgccgg 287521 cgggggtccg ctgccgctgg cggtgagtgt cgacgaggaa ggcgggcggg tgtcccggtt 287581 gaggtcgctg atcggcggta cggggccgtc ggcccgcgaa ctggcacaaa cccgaaccgt 287641 ccagcaggtg cgcgacttgg ctcgagaccg cggccggcag atgagaaagc tgggtatcac 287701 catcgacttc gccccggtgg tcgacgtcac cgacgccccg gatgacacgg tgatcgggga 287761 ccggtcgttc ggctcggatc cggctacggt caccgcgtat gccggggcgt acgcgcaggg 287821 tctgcgcgat gccggggtgc tgccggtgct caagcatttc cccggtcacg ggcgtggctc 287881 gggtgattcg cacaacgggg gtgtcacgac accaccgctt gatgacctgg tgggcgatga 287941 cctggtgccc taccgaacgc tggtgaccca ggcgccggtc ggtgtgatgg tgggtcatct 288001 gcaggttcct gggttgaccg gctccgagcc ggccagtctg agcaaggccg cggtgaacct 288061 gctgcgcacc ggcacgggat acggcgcacc gccgttcgat ggtccagtgt tcagcgacga 288121 cctctctggt atggccgcga tctcagaccg gtttggcgtc agcgaggcgg tgttgcgcac 288181 cttgcaagcc ggtgccgata tcgcactgtg ggttaccacc aaagaggtgc ccgcggtgct 288241 ggaccgcctg gaacaggcgc tgcgcgccgg tgaattgccg atgtcggcgg tcgaccggtc 288301 ggtggtgcgg gtggcgacca tgaaggggcc caacccgggg tgtggccgtt agcgatgtgc 288361 ggctggcgcc ccactgctta ccgtagggtt agatagacgg gctacagggg cccaaaaggg 288421 gctggcgatg gcaggtggta ccaagcgact accgcgtgct gtccgagagc agcagatgct 288481 cgatgccgcc gtgcagatgt tctcggttaa cggctaccac gagacctcga tggacgcgat 288541 cgctgccgag gcgcagatct ccaagccgat gctgtacctg tactacggct ccaaggaaga 288601 cctgttcggc gcctgcctga accgtgagat gagccggttc atcgacgcgt tgcgttccag 288661 catcaacttc gaccagagcc cgaaagactt gctgcgcaac accatcgtgt cgttcctacg 288721 ctatatcgat gccaaccggg cgtcgtggat cgtgatgtac acccaggcca ccagctccca 288781 agcgttcgcg cacacggtgc gtgaggggcg cgaacagatc gtccaactgg tggccgagtt 288841 ggtgcgggcc ggcacccgcg gcccgcttac ggacgccgaa atcgagatga tggccgtcgc 288901 gctggtgggc gccggcgagg cagtggccac ccggctcggt atcggtgaca ccgacgttga 288961 cgaggcggcc gagatgatga tcaacctgtt ctggctcggc ctcaagggcg cgccggtgga 289021 tcggctcgag accgggcact gacctgcgcg gtatcggcca ctgagatgtg ggtgtatttt 289081 agatgcagat gtaaattcga tgtatgattc gaacgcaagt ccagctccca gatgagcttt 289141 accgggacgc caagcgggtc gcgcacgagc acgaaatgac ccttgccgag gtcgttcgtc 289201 gcgggctgga gcacatggtg cggatctatc cgaggcgcga tgcggcgtcc gacacctggc 289261 agccgcccac gccgcgtcga ctcggtccgt ttcgtgcgtc cgaagaaacg tggcgcgagc 289321 tcgccaacga ggcgtgagta gcccgtgctc tcgatcgata cgaatatcct gctgtacgcg 289381 cagaaccggg attgccccga gcatgacgcc gccgccgcct tcctcgtcga gtgcgctggt 289441 cgagccgacg tcgcagtctg cgaactcgtg cttatggagc tgtatcaatt gctgcggaat 289501 cctacggtgg tgacgcgacc gctcgagggc cccgaggcgg cggaagtctg tcagacgttc 289561 cgtcgcaacc ggcggtgggc gctcctcgag aacgctccgg tcatgaacga ggtgtgggtg 289621 ttggcggcca cgcctagaat tgctcgccgg cgcctattcg atgcccggct ggcactgacc 289681 ttgcgccatc atggtgtcga cgaattcgcc actcgaaaca tcaacggctt caccgacttc 289741 ggcttctcac gcgtgtggga cccgataacg tcggatggct gaccacgccg ggccgatccg 289801 cgtggccccg gctatagacc ccgcacggta gcggtcaggt gggggtatcc cttggccata 289861 ttgcgcagcg tgagatccca gccgccatcg ccttcggcga cgtagagtcc cgcggtggcc 289921 ggcagcagca ccggcttggc gaaccgaacc gaatagcgca ccgcgtccgg aaaacgggct 289981 tcgatattcg ccaataccgc cgcggcagtg aacatcccgt gcgcgatgac ggtggggaag 290041 ccgaacagtt tcgccgcgat cgggttggtg tggatcgggt tgtgatcgcc gccgacggcg 290101 gcatagcggc ggatcttcgc cggggtgatc cgcaggaccg cggcgggcgg gggtagcttg 290161 ggcttttttt gcggcggcgg tttgggttcg ccggacaagc tggtgcgttg ttgatgcagg 290221 aacgtcgtca cctggtgcca ggcgacatcg ttgccgacgc tgacgttggt caccagatcg 290281 accagcaggc ccctgcggtg ttcgcgcaga ttctccgcgc gcacccgcac gcccaccgcg 290341 tcggtgaccg cgatcggccg gtattgcgtg atgtggttct cggtgtgtat cgctcccatt 290401 gcggcgaacg ggaagtcgaa gccggtcacc aacgacatca ccgatggaaa agtcaacgcg 290461 aacggatagg tcaacggcac ctggttgccg tagcgcagac cggtgaccgc cgcgtaggcc 290521 gcgacgttgg cggggtcgat cggcagctcc tcgacggtca ccgtccggtt gggcagctgg 290581 tctgtccggg gcaccacggg tagcgccccg gccgccgcgc gcagcaggtt cttcaggccg 290641 ctgggttgag tcactactgt cccctcacgc gccgatcatg gcctggccgc agacacgaat 290701 gacgttgccg gtcaccgcgt ttgacgccgg gctggcgaag taggcgatgg cctcggcgac 290761 gtcgacgggc tgcccgccct gcagcagcga gttcagccgg cggcctacct cacgggtggc 290821 cagcgggatg gcggccgtca tctgggtttc gatgaatccc ggtgccacgg cgttgatcgt 290881 gatgcctttc gcggccaggc cgggtgccag cgcctgggtg atgccgatca tcccggcctt 290941 ggtggtggcg tagttggtct ggccgcggtt gccggcgatg ccggcgatcg acgacagccc 291001 gatcacccga ccaccctctc cgatgctgcc gttgcccacc agaccctcgg tgagccgcaa 291061 cggggcaagc agattgacag ccaggacggc gtcccaacgc gcatcgtcca tgttggccag 291121 cagcttgtca cgggtgatgc cggcgttgtt gaccaggatg tcggccttgc caccgtggtg 291181 gtcgcgcagg tgctcgctga tcttgtcgac ggcatcgtcg gcggtgacgt cgagccacag 291241 cgcggtgccg cccaccttgc tggcggtttc ggccaggttc tcggcggcgg actccacatc 291301 gatggcgacc acgtgggcgc cgtcgcgagc gaacacctcg gcgatggttg cgccgatgcc 291361 gcgggccgcg ccggtcacaa tggcgacctt gccgtccagc ggcttctccc agtcggccgg 291421 cggtgtggaa tcgtccgccc cgacagagaa gacttggccg tcgacgtagg ccgacttggc 291481 cgacagcagg aatcgcatgg tcgactcgag gccggtagct gcgggcttgg cgtccggcga 291541 caggtagacc aacgccgttg tcgcaccgcg gcgcagttcc ttgcccagcg agcgggtgaa 291601 gccctccagc gcgcgctgcg cgatccgctc gttcgtgctg gcggccgctt cgggtgtgcc 291661 gccaacaacc accacgcgcc cgcaacggcc gagattgcgc agtaccggag taaagaactc 291721 gtgcagcccc ttgagcccgg ccggctctgt gatgccggtg gcgtcgaaga ccagcccgcc 291781 gaacgagtcc gcccagcgcc cgcccaggtt gtttcctacc aggtcgtagt ccttttcgag 291841 tgccgcgcgc agtggttcga cgaccctgcc ggccccgccg atcagcagcg acccggtcag 291901 tggcggttcg cctgctcgat agcggcgaag cgtctcgggt tgcggaacac ccaattgcct 291961 ggccaaaaac gatcctggac cggagttgac aacctgcgag aacagatcgg acgaacgctt 292021 gggagccact tcagctgcct tccgtatcgt gtgggggtcg ggcgcgccaa tacacgtaac 292081 cgtatcgagg actaacttac ttcagagtaa gaacagtggg tagtatggcc ctcaacggcc 292141 gatcccccga actgatcaac ggagaaaaca gtggcccctg ctgctaagaa cacttcacag 292201 accaggcggc gagtcgccgt actgggcggc aaccgcatcc cgttcgccag atcggacggt 292261 gcctacgcgg atgcgtccaa ccaggacatg ttcaccgcgg cgctgagcgg cttggtggac 292321 cgattcggac tcgccggcga gcggctggac atggtggtgg gcggtgcggt gctcaaacac 292381 agccgcgact tcaatctaat gcgcgaatgc gtgctgggct ccgaactctc gccgtacacg 292441 ccggcgttcg acctgcagca ggcctgcggg acgggcctgc aggccgcgat cgcggccgcc 292501 gacggcattg ccgccgggcg gtatgaggtg gccgccgctg gcggggtgga caccacctcg 292561 gacccgccga tcggcctggg cgacgacctg cgccgcaccc tgctcaagct gcgccgatct 292621 aggtccaacg tgcaacgcct caagctggtg ggcacgctgc cggccagcct gggcgtggag 292681 atccccgcca acagcgagcc gcgcaccggg ctgtcgatgg gcgagcacgc cgccgtcacc 292741 gccaagcaga tgggcatcaa acgcgtagac caggacgagc tggccgccgc cagccatcgc 292801 aatatggccg acgcctacga ccggggtttc ttcgacgacc tggtcagtcc gtttttaggg 292861 ctgtaccgag acgacaatct gcggcctaac tccagcgtcg agaaactggc cacgctgcgt 292921 ccggtcttcg gagtgaaggc cggtgacgcg acgatgacgg ccggcaattc gactccgctg 292981 accgacggcg cctcggtggc attgctggcc agcgaacagt gggcggaggc acactcgctg 293041 gctccgctgg cctatctcgt ggatgccgag accgccgcgg tcgactatgt caacggcaac 293101 gacggcctgt tgatggcgcc gacctacgcg gtaccccggc tgctggcccg taacgggttg 293161 agcctgcagg acttcgactt ctacgaaatc cacgaggcgt ttgcctccgt ggtgctcgcg 293221 catctggcgg cgtgggagtc cgaggagtac tgcaagcggc ggctgggcct ggacgccgcg 293281 ctggggtcga tcgatcggtc caagctcaac gtcaacgggt cgtcgttggc cgccgggcac 293341 cccttcgcgg cgaccggtgg gcggattttg gcgcagaccg ccaagcagct cgccgagaag 293401 aaggcggcga aaaaaggcgg cggaccgctg cgcgggctga tttcgatctg cgcggccggc 293461 ggccaaggtg tggccgcgat tttggaggcc tgacgctgac ggctcggtaa gtgcctcgcg 293521 ggaagtcccg agtggccggt gggccgccca aagaaatgtg ttgcgggtgg tttgcgccct 293581 gagcagatgg gtacccgatc actcggatag ccccgtgttg ttgtctgacc cccgaccccg 293641 acggcaatgc ggggcaatcc cctggaaagg gccgccgctg gtgggagggg acccagcggc 293701 ggtctttttg ggcttgcccc atcgttcgtt gactctgcgt ccaccacgca aaagtgcgag 293761 taacccgtcc ggtggacgca gagtcaacag ataaggatca gaacgcggcc tcgtcgagtt 293821 ccatgatgtc gttgtccagc gtctcgatca cctcgcgggt gctggtcaac agcggcaaga 293881 agttcttcgc gaagaacgac gccaccgcga ctttgccttc gtagaaggac cgctcgtcgc 293941 cggtggcacc cgcgtcgagt gccgccaccg ccaccgcggc ctgacgctgc agcaaccagc 294001 cgatgatgag gtcaccgacg ctcatcaaga agcgcaccga acccaagccc accttgtaga 294061 ggctggtgac gtcctgctgc gcggccatca ggtagccggt cagtgcggcc gccatgccct 294121 ggacgtcggt gagcgccttg gccagcagcg cgcgttcggt cttcagccgg ccgttgccag 294181 caccgctgtc gacgaactcc tggatctggc ctgacacgtg cgccaacgcc acgcccttgt 294241 cacggacgat tttgcggaag aagaagtctt gtgcctggat ggcggtggtg ccttcgtaca 294301 gggagtcgat cttggcgtcc cggatgtact gctcgatcgg atagtcctgc aagaagccgg 294361 atccacccag ggtttgcagg ctttcagtga gcttggcgta agcctgttcg gagcccacac 294421 ccttgactac cggcaacatc aggtcgttga ccttgacggc caacttggcg tccacaccgt 294481 gcaccacctc ggcgacagcc gcgtcctgga aagtggcggt gtagaggtag agcgcacgca 294541 ggccctcggc gtaagccttc tgggtcatca gcgagcggcg cacgtcgggg tgatgtgtga 294601 tcgtcacccg gggcgcggtc ttgtcggtca tctgggtcag gtcggcaccc tgcacgcggg 294661 acttggcgta ctgaagcgcg ttgaggtagc cggtggacag cgtcgcgatg gccttcgtgc 294721 cgaccatcat gcgggcctgc tcaatgacct cgaacatctg cgcgatgccg ttgtgtacct 294781 cgccgaccag ccagcccttg gcggggacgc cgtgttggcc gaacgccagt tcacaggtcg 294841 ccgagacctt taggcccatc ttgtgttcga cgttggtgac gaacacgcca ttgcgctcgc 294901 cgggttcgcc ggtttcgacg tcgaacagga acttgggcac gaagtacagc gacaggccct 294961 tggtgccggg accggcgccc tccgggcgag ccagcaccag gtggaagatg ttctcgaaca 295021 ggtcgccgga gtcacccgag gtaatgaacc gcttgacgcc gtcgatgtgc caggacccgt 295081 cggcctgttg gacagctttg gttcgggcag cgcccacatc ggagccggca tccggctcgg 295141 tgagcaccat ggtcgatccc cagccgcgtt cggcggctag gaccgcccac ttcttctgct 295201 cctcggtgcc gaggtggtag aggatctggg cgaagcccgc gccgccggcg tacatccata 295261 ccgccggatt ggcgcccaag atgtgctcat gcagcgccca gaccactgcc ttgggcatcg 295321 gcatgccccc gagtgcctcg tcgatgccga ccttgtccca accggcttcc agcatcgcgt 295381 tgactgactt tttgaacgat tccggcagca tcaccgagtg ggttttcggg tcgaaaacgg 295441 gcgggttgcg gtccccttcg acgaacgact cggccaccgg cccctcggcc agccggctga 295501 cctcggccag catgtcgcgg gcggtgtcga cgtcgacgtc gctgaattcg ccatggccca 295561 aagctttgtc gacgcccagc acttcgaaca ggttaaaaac ctggtcacgg acgttgctcc 295621 ggtagtggct cactgccgat cctcctcgtt gagagtgcca cctcagggtt gggtagggtt 295681 gggtactcga aaccaagtta cccaccagta acaccgtcaa aatatatccg ttgcataggt 295741 caatgcaagt tgatgtgagc tacattgcac caactaacta accaaccggt tgggttagcg 295801 gtgatcctgg ccgtgtcggt cctctcacct gcggcgatag cgatcaaatg aagaatatgc 295861 ggagtctagg gcggcagcgc ctggcagcgt agatcatcgg ctcacgcgga tgcggcctct 295921 tggtacggac atgcgcgcgg atgtccggcg agtagggtcg gatgcgaaaa ctacgtcctc 295981 ggctctaggg gcgaatgaag ttcggtgaac tcaacgaaca acctgacgcc gtcctcactg 296041 cgggaggcct tcggccattt cccgaccggg gtggtggcca tcgctgcgga ggtcgacgga 296101 gtgcggcaag gcttggcagc cagtaccttt gtcccggtct cgctggaacc gccgctggtg 296161 tcgttctgtg tgcagaacac ctcgacgaca tggccgaaac tcaccggcgt gccgatgctg 296221 ggcatcagcg tgctcggcga ggcccatgac gccgcagtgc gcacactggc cgcaaaaact 296281 ggggacaggt tcgccggttt ggagacggta tccaacgacg ccggcgccgt cttcatcaag 296341 ggcaccagcg tgtggctcga gagcgcgatc gagcagctgg tcccggcggg agatcacacc 296401 atcgtggtct tacgggtcaa ccaggtcaag gtggatccca acgtagcgcc cattgtgttc 296461 catcgcagcg tgctccgccg actcggcgtc taaacgtcta tacggacgcc cacttggtct 296521 gtccggacaa catagcggtc agcggcccat tctggttgcg ataaatgatg gtagatcacg 296581 tcattttgct tccagtagtc gtgcccatgt ttgagaggca caactattgg tcgctttcat 296641 tcgttgcgcg cagaccggtc tttgtatgac gatgatggga agttctatct gccgccaaaa 296701 gcagaatggc aggacgcagg atgaagcgat gagccgaccc gccggaaccg gtttccggga 296761 acgggtggga tgcatgccca cttgaggtct cgcggcaggc ggtggagcgt ggcaaaaacg 296821 tcgcatcggg tgagcagcgc cgatggcatg agtaagcgta ttttgcgttt gataatcgcg 296881 cagagcggct tctatagcgc cgcacttcag ctcgggaatg tctcgatcgt tctaccgttt 296941 gtggtagccg agctcgacgc cgaattgtgg atagcggctc ttatttttcc tgcattcacg 297001 gccggtgggg cgatcgggaa tgtggtcgcg ccgccggcgg tggccgccgt tccacgccgt 297061 caccgattgt tcattattgt gtcctgtttg gccgtcctgg ctggcgtcaa tgccttgtgc 297121 gcaaccatcg gcaaaggaag cgtcgctgga atcctattgg tggtcaatgt gacgctgatc 297181 ggggtcgttt cggcgatctc cttcgtcgcc ttcgcggatc tggtggcggc tatgccatca 297241 ggaaccgccc gagcccgcat tcttcttacc gaggtcggag taggggcggc tttgacggcc 297301 gtggtggcgg cgacgctgtc attcgtaccc gaccaacacc cattaagcag gaacattcac 297361 ctactgtgga cggcagccgt ggcaatggct atctcggcgg ccatatgccg ggcattgcct 297421 caccggatcg tccccagggt ccatgcggcg cccggtctgc acaaactcgt gtacgtcggt 297481 tggacggcta tccgaaccaa tggttggtat cgtcggtacc tgcttgtgca ggtactcttt 297541 ggctcggtcg tgctcgggtc ctcgttccac agcattcgcg tcgccgccgt acccggggac 297601 cagcccgacg aggtcgttgc cgtcgtcctt ttcgtctgcg tcggactctt gggtgggatc 297661 gcgttgtgga accgcgtccg ggagagattt ggcctggtcg gtttgtttgt cggcagtgca 297721 ctcgttagca tcgccgcggc agtgctatcc atcgcattcg atttggccgg agcgtggccc 297781 aacgtcgtcg ccatcggtct ggtgattgca ctggtatcca tcgccaatca aagcgtattc 297841 accgcaggcc aactgtggat tgcccgtgac gccgaacccg gcctgcgaac atccctcatc 297901 tccttcggcc agctcgtcat caacgcaggc ttagtcggta tgggtttggc gctggggttg 297961 attgcccagg atcacgatgc ggtgtggccg gtgatgatcg ttctgctgtt gaacctgacg 298021 gctgcctact cagcgacgcg gttcgctcca gccaagtccg tggatgttcg tggcttgcct 298081 caggtttcgc gcacttcccg acctaaaacc gggggttagc ggcgaaacag cttgctgccc 298141 agccatacca ccggatcata cttgcggtcg gcgacccgtt ctttcatcgg gatcagggca 298201 ttgtcggtga tcttgatgtt ttctgggcac acttcggtgc agcacttggt gatattgcag 298261 tagcccaggc cgtgctcttc ctgtgcttgg ctgcgtcggt cccgggtgtc cagcggatgc 298321 atttcgagtt cggcgattcg catcaggaag cgggggccgg cgaacgcatc cttgttttcc 298381 tcgtgatcgc gaactacgtg gcagacgttt tggcacagga agcattcaat gcacttgcgg 298441 aactcctgcg agcgtgcaac gtcgacttgc gccattcggt actcgctggg ctgtagctcc 298501 ttgggtggcg cgaaagacgg gatctcgcgc gctttttggt agttgaacga gacgtcggta 298561 acaagatcgc gaatcaccgg aaacgtccgc attggggtga ccgtgacgat ctcgtcctcg 298621 tcgaatgtcg acatccgcgt catgcacatc agtcgcggtt tgccgttgat ctcggccgag 298681 caggatccgc acttgccagc tttgcaattc cagcgcactg cgagatccgg tgtctgcgtc 298741 tgttgtagac ggaggatgac gtccagcacg acctcgccct cgttgacctc cacggtgaat 298801 tcgcggagtt cgccacagct ttcgtctccg cgccacaccc gcatactcgc gctgtacgtc 298861 atttagcctc tccgtcctgg atgctcggcc agctcttcgt cggtgtagta tttctccaac 298921 tccgagatct cgaagagctc cagcaagtcg ggtcgcatgg gcgtttgcag ctgctgggtg 298981 acgttgatgt ggcagttgga gtcgccggac ccgctgccac cggtgcccat ggtttcggtg 299041 gcccggcata ccagcaagat cctgcgccag ttggggtcca taccgggatg gtcgtctcgg 299101 gtgtggccgc ctcggctttc ggtgcgctgt agcgcagctc tggccacgca ctcgctgacc 299161 agcaacatgt tgcgcaggtc gatggacagg ttccagcccg gattgtattg acggtgacct 299221 tcgacgagta cgttgtggta gcgcgaccac agctcggcca aaagagtcag cgccctggat 299281 atttcgtcgg cgttgcggat gataccgacc agatcgttca tcacgtactg caagtccata 299341 tgcagcgcgt acggattctc cggcgccgag ccgtctttcg gtccttcgaa ggggctcagc 299401 gcctgctggg ccgccgcatc gatagcctcc gctgaaaccg ctggccggct gctcagtgcc 299461 cgtacgtaat ccgctgcgcc caggccggcc cgccggccga ataccagcag atcggacagc 299521 gaattgccgc ccagccggtt ggagccgtgc ataccgccgg cacactcacc ggcagcgaac 299581 aggcctggca ccgtggcggc gccggtgtcc gcgtctactt cgacaccgcc catcacgtag 299641 tgacacgtcg gcccgacttc cattgcctgc gttgtgatat cgacttcagc gagctctttg 299701 aactggtgat acatcgacgg caatcgccgt ttgatctcgg cgggtgtcag ccgggatgcg 299761 atgtcgaggt agacgccgcc gtgcggggta ccgcggccgg ccttgacctc tgagttgatc 299821 gcgcgcgcga cctcgtcgcg gggcagcaag tccggggtgc gtcgggccga gtcgttgtcc 299881 ttaagccact ggtcggcctc ttcctccgtc tcggcgtact ggcccttgaa caccggcgga 299941 atgtagtcga acatgaagcg agagttctcc gagtttttga gcactccgcc gtcgccgcga 300001 acaccctcag tgaccagaat tcccttgaca ctgggcggcc acaccatgcc cgtcgggtgg 300061 aactggacga actccatgtt gatcagcgtc gccccggccc gcagtgccaa cgcgtgcccg 300121 tctccggtgt actcccagga gttggatgtc accttgaacg acttgccgat cccgccagtg 300181 gcaagcacca ccgctggcgc ctcgaacacg atgaaccggc cgctttcccg ccagtagccg 300241 aaggctccgg cgatcgcgcc ttggtccttg agcagttcgg tgatggtgca ttcggcgaac 300301 actttgatcc gcgcttcgta gtcgccgagc tcggcgtggt cctcctgctg cagcgagaca 300361 accttttgct gcagggtgcg gatcaactcc aggccggtgc ggtcgccgac gtgcgccagt 300421 cgcggatagg tgtgtccgcc gaagttgcgc tgactgattc ggccatcgtc ggtgcggtcg 300481 aacagcgcgc cgtaggtctc caactcccag acccggtccg gcgcctcctt ggcgtgcagc 300541 tcggccatac gccagttgtt caggaacttt ccaccgcgca tcgtgtcgcc gaagtgagtc 300601 ttccaattgt ccttcgggtt ggcgttgccc atcgcggccg cgcagccgcc ttcggccatg 300661 accgtgtggg ccttgccgaa tagggatttg cacacgacgg ctactttcaa gccgcgttcc 300721 cgcgcctcga tgaccgcgcg taaccccgcg ccgccggcac cgatcacgac tacgtcgtag 300781 gagtgccgct cgacctcaac cataaaacct cgctcagctt ctgaaacgat ccttcagcca 300841 ataaatctga gatctgtgat gctgccactg gccaccagca tgatgtagaa atcggtgagc 300901 gccagggtcc ccagcgtgat ccacgcgaat tgcatgtgtc gggtattgag cttgctgacc 300961 tgtgtccaga tccagtatcg cactgggtgc ttggagaaat gcttgagccg accgccggtg 301021 gcgtgccggc acgaatggca cgagatggtg tatgcccaca gcagaaccac attgatcgtc 301081 aaaatgacat tgcccaaacc gaagccgaat ccggacggcg agtgaaatgc cgcgatcgcg 301141 tcataggtgt tgatcagcga caccaccacc gcgatataga agaaataccg gtgggtgttc 301201 tggacgatca gcggaagccg ggtttcaccg gtgtaatgag cccgcggctc gggcactgcg 301261 cagcttgtcg gcgactgcca taccgaccgg tagtaggcct tgcggtaata atagcaggtg 301321 agccggaatc caagcaggaa cggtaatacc atcgctccca acggaatcca ccctggaaaa 301381 tgcccgaacc agacgccgag atgactggcg ccgggctggc aggacgcgct gacgcacggc 301441 gagtagaacg gcgtcaggta atgatatttt tccacccagt attggctgcc ccagaacgcc 301501 cgagtggtcg catagcagat gaacgccaaa agaccgaggt tggtcagcag cggtggcaac 301561 caccagaggt cggtgcgaag cgtccgttct gggatttgtg cgcgggtggg tgtgaaaacg 301621 ccgatcgcag gacggttcgc cgtgggtgcg ctcatctaat gtgatcctct tcgcgtgtta 301681 tctcgtcgaa gggtacacag agaacggccc cctttttctg gggggctcgg ttgttcagta 301741 cctgtgacct ccgacaccct catcgtcgac atcgcgccaa aattcgcgat cgtactcggt 301801 gtcggggatg gcgattttct cgctgggttg cggtacggcg gcccgttcca ggtctaattc 301861 agagacgtcg gtgtcgagca attcgatgtc ggtgaggatc cggtcggcat cgatcacgat 301921 gcgacgcgtt gccgggttgt cgccgaatcg cgccttcagt gcggtcacgc accgccgcag 301981 tccgccgacg aggtcgtgca gttcggcgag ttcggcagtc gtggacaatg ggtgctccct 302041 gggctggcgg tgttacagat cacagtacgc tcccgatact agctatcgac ggacggagtc 302101 gttgggtcta ctcggcccaa tggcatgatc cggcggaccc atcggcccgg ccggatcatg 302161 ccgtatcgcg aactacttcg tgatggcgat gcgctgcgcc tgagtttcgg ctggggcctt 302221 gtaggcgccg gcaacccgga cggtcagcac accggcgtca taggaagccg cgatggcctc 302281 gctggtgacg tgcgcgggca gccggaacga gcggcggaat gatccgtagc ggatctcacg 302341 cagggtgcgg ccgtctttgt ctccggcgtc ttgcgtgtgc tcgtcgcggt gttcgccgcg 302401 gatcaccagg cggctcaccg gctggccagg gtcaagctcg acgttgacgt ccttgtcgac 302461 gtcaatgccg ggcagttcca aacggaccac cgcgtcgtcg ccatccttga cgatctcggc 302521 ggccggcgtg aagtctccgg cgaccgggcg gtaccagtcc gtcgtcgcgg cagggccgaa 302581 gaagtcacgt agccagcggt cccagggctc aacgtcccac accggacgcg accacaatgc 302641 gagattgttc atggttatct cctcatgctt cgttgtgagt tagctgtgtc cggcgcgttg 302701 ccggcccgct ataccaagaa cctgagtcga ccacgcttaa gttccacctc ggcgttcacc 302761 ggaagcgaac actgtcacac agccggtcgc caggtgtgat cacagcgtca tatgtgcgtc 302821 acattcggcg atttttcggt aatttgcccc tcataccctc agaccatgcc tacggctggg 302881 agttcgcgcg cgcctgccgc ggctcgcgag atcgtcgtgg tcggccacgg catggtgggc 302941 catcggctgg tcgaagcggt gcgtgcccgt gacgcggacg ggtcgctgcg gatcacggtg 303001 ctggccgagg agggcgatgc ggcctatgac cgggtcggcc tgacgtccta taccgaaagc 303061 tgggaccgcg ccctgttggc cttgccgggt aacgattacg ccggtgacca gcgggttcgg 303121 ttgctactaa acacccgagt cacccagatt gaccgggcaa ccaagtcggt ggtcaccgcg 303181 gcagggcaac ggcatcgcta cgacaccctg gtgctggcca ccggctccta cgcattcgtc 303241 ccgccggtgc ccggccacga cctgcccgcg tgccacgtct accgcacctt tgacgatctc 303301 gacgctatcc gcgccggcgc ccagcgcacc ctggacggcg gtcacaccga tggcggggtg 303361 gttatcggtg gcggcctgct gggcctggaa gccgccaatg cgctgcgcca gttcgggttg 303421 cagacacacg tcgtcgagat gatgccacga ttgatggccc aacagatcga cgaggccggg 303481 ggtgcactac tggccaggat gatcgccgat ctcgggatcg cggtgcacgt cgggaccggt 303541 accgagtcga tcgagtcggt gaagcattcg gatggctcgg tgtgggcgcg ggttcgcctg 303601 agcgacggcg aggtgatcga tgctggggtg gtgatctttg ccgccggcat ccggccgcgc 303661 gacgagttgg ccagggcggc ggggctggcg atcggcgacc ggggcggtgt gctcaccgac 303721 ttgtcctgcc ggacaagcga tcccgatatc tacgcggtcg gcgaagtcgc cgcgatagac 303781 gggcggtgtt acggcctggt cgggcccgga tacaccagcg ccgaggtggt ggccgaccga 303841 ctgctggacg ggtcggccga gttccccgaa gcggacctgt cgaccaaact caagctgttg 303901 ggtgtcgacg tcgccagctt cggcgacgcg atgggggcaa ccgagaactg cctcgaggtt 303961 gtcatcaatg acgcggtgaa gcgcacatat gccaagttgg tgctctccga cgacgccacc 304021 acgctgctcg gtggcgtgct ggtgggcgat gcctcgtcgt acggggtgct gcggccgatg 304081 gtcggcgccg aactgcccgg ggatcccctg gcgctgatcg cgccggccgg atctggggcc 304141 ggcgctggcg ctttaggtgt tggggcgctg ccggattcgg cccagatctg ctcgtgcaac 304201 aacgtcacca agggcgagct gaagtgcgcg attgccgacg gttgtgggga cgttcccgcg 304261 ctgaagtcat gcaccgcggc cggcacgtcg tgtgggtcgt gcgtgccgct gctcaagcag 304321 ctgctagaag ccgagggtgt ggagcagtcc aaggcgctgt gcgagcactt cagccagtcg 304381 cgcgcggagc tttttgaaat catcaccgcc accgaagtcc ggactttctc cgggttgctt 304441 gaccgctttg gacgcggaaa gggttgcgac atctgcaaac ccgtggtcgc ctctatcctg 304501 gcatccaccg gctccgacca cattttggac ggcgagcagg cctcgctaca agattccaac 304561 gaccacttcc tggccaacat ccagaagaac ggcagttact cggtggtgcc gagggtgcct 304621 ggcggtgaca tcaagccaga acacctgatt ttgatcggcc agatcgcaca ggacttcggc 304681 ctctacacca agatcaccgg cggtcagcgg atcgacttgt tcggcgcccg ggtggatcag 304741 ctgcccttga tctggcagcg actggttgat ggcggcatgg aatctgggca cgcctacggc 304801 aaggcggtgc ggaccgtgaa gagctgcgtg ggcagcgact ggtgccgcta cggtcagcag 304861 gattcggtgc agctggccat cgacctggaa ctgcgttatc gcgggctacg ggcaccgcac 304921 aaaataaagc tgggcgtctc gggttgcgcg cgggaatgcg ccgaggcgcg cggcaaggat 304981 gtgggcgtga tcgccaccga gaaaggctgg aacctttacg tcgccggcaa cggcggcatg 305041 acgcccaagc acgctcaact actggccagc gacctcgaca aagagacgct catccgctac 305101 atcgaccgct ttctcattta ctacatccgc acggccgacc ggctgcagcg aaccgcgcca 305161 tgggtggaat cgcttgggct ggaccatgtg cgcgaggtgg tctgcgagga ctcgctgggt 305221 ctggccgagg aattcgaggc cgcgatgcaa cgccatgtcg ccaactacaa gtgcgagtgg 305281 aagggcgtgc tggaggaccc ggacaagctg tcccggttcg tttccttcgt caacgccccc 305341 gatgccgtcg actcgacggt gaccttcacc gagcgtgccg ggcgcaaagt acctgtgtcc 305401 attggtatcc cgcgggtccg atcatgaagt ccgggaggac aaaggaggga ctgtgacgct 305461 tctcaacgac attcaggtat ggaccaccgc ctgcgcatac gaccatctca ttccgggacg 305521 tggtgtcggg gtgttactcg atgacggtag tcaggtggca ctgttccggc tcgacgacgg 305581 ctcggtgcac gcggtcggta acgtcgaccc gttctccggt gctgcggtga tgtcccgcgg 305641 catcgtcggt gatcgcggag gtcgcgccat ggtgcaatcg ccgatcctga agcaggcttt 305701 cgcgctcgac gatggctcgt gcctcgacga tccgcgcgtt tcggtgccgg tgtatccggc 305761 gcgcgtcaca cccgaaggcc gcattcaggt cgcgcgggta gcggtctagc tcaccccgcg 305821 aacctcacag cttgagcaca cgtccggcga tgaccagatg tacctcatcg cagacggctg 305881 ccacgcgtcg gttgattgtg cccagtagat cgcgaaacag cacgcccgaa gaatgggatg 305941 gcaccacccc gaggccgacc tcgttcgtca ccacgatcgc agtgggcaat ccggtcagcg 306001 cggcgcacaa cccgtcgagc cgtgcctcga ggacggcgta gacgtccgcg gtcgcagcag 306061 accacaacgc ctcgccatcc atgatggccg tcagccaggt gcccaagcag tccacgagca 306121 cgggacttcg tgcctcggac aaagccgtcg cgacgtcggc cgtttccacc gttagccagg 306181 tcggtgggcg gcgagcgcga tgcagtgcga cccgggcgtc ccaatcggga tcgctgccag 306241 cggccgggcg gccaggcgcg acgtagacga cgtcggccgc atcgcccaac aacgcttcgg 306301 cgtgcgtgga ctttcccgag cggacgccgc cagtgaccag tatccgcacc gggtcatcgt 306361 aggtggggcg gcctcatggc gcgcccggag cgagaaaggg caaggtcggc gggcaaccat 306421 ggcgggccag gttgagcagc gcatcgacgt cgaggtgtcg ttcgacgaga tcgccgagca 306481 ggtcgaggcg gcgctcgcgt gcggccagga agcatgagcc cgacggggcg aggccgagcg 306541 tctctcgcag gaaggcctcg cgcagggcgt cgccttccaa cgagccgtgc cacatggtgc 306601 cgaacaccgg tccgtcgcgc gcgccgccga ggaactcctc ggcggtgtca ccgcgggtga 306661 tccggccgtg gtgaatctcg taccccgacg cgggcacacc gagtccttcg ccgcgcggta 306721 gccgcagcac cttgtggggg gaaaatgcgg tctccacgtc gagcaaaccc aagccctcga 306781 cctcggtcac ctgccctccc ggaccttcga tgccgtacgg gtcgcgaatc acccggccca 306841 gcatctggaa cccgccacaa atgccgagca gcggcttgcc cgccgcaaca tgcaccagca 306901 gcgcacgatc taggtctcgc gccctcagcc aggctagatc ggcgatcgtt gcccgggtgc 306961 ccggcaacac gatcagatcg gcatcgtcca gcgcgcgggg gtcggaagcg aacacgacat 307021 ccaagtcggg ctcaagaccc aatgcgtcga catcggtgaa gttgctgatt cgtggcaggc 307081 gcacgacggc tacccggcgg gccccggtgc ccgccgcgcg ccggccctgt aggtcgaggg 307141 catcttcgga gtccagccag aggtcggggt gccacggcag ggtgccgtac accctgcgcc 307201 cggtgacccg ttccaggtcg cgcagacctg gcgccagcag gtcggagtcg ccccgaaact 307261 tattgaccac aaaccccgcg accagcgcct ggtcctcggc agccagcaac gcgacggtgc 307321 ccaggaacgc agcgaacacc ccgccgcggt cgatgtcacc gacgacgatg gtcggcagtc 307381 ccgcatgacg ggcaagcccc atgttgacgt agtcacctgc gcgcaggttg atttcggccg 307441 ggctgccggc gccctccgcg acaacgacgt cgtagcgggc ggcgagggcg tcgaaggcgc 307501 ggcatgcggc ctcggcgagc gctcgccgcc ccgcacacca gcttgacgac gccacctcgc 307561 cccagggctt gcccatcaac accacgtggc tgcggtgatc actggccggc ttgagcaaga 307621 ccgggttcat cgccgcctcg ggcgtggtcc tagccgcgag tgcctgcacc cattgcgccc 307681 gaccgatctc cacgcccgtg ccgtcggggc ctcggcagac catcgagttg ttggacatgt 307741 tctgcgcctt aaacggcgcc acccgcacac cgcgtcgggc caacgcgcgg cacagccccg 307801 cggtcacggc gctcttaccg gcgtcgcttg tcgtacccgc gaccagcaga cccgacatcc 307861 gtctcccgaa ggtttctcac tccacccggg tcgctgagtc ggtgtcccag gttccgggca 307921 tcattggcgt gcgtgggctg ccgccgaacg cgtcgttggg taacgtgatc agtcctgcga 307981 cttgtccggg actggccttg tgggttgttc cggcgaaacc cagggttccc gctccttgag 308041 gcgaacccgt cgggtcgtgg ccggtttcgg ggtctaaatc caggtattcg taaccgcggc 308101 cgagctgttt gatctttggc cgccgacgcc gctgcggttg aacctgttcc tcgggcgccg 308161 ccgcggccgc tggggcctcg gcgctgtcgg gttccggcgt cttctttcga acgccggtgc 308221 cgacggcctt cctggcctgc gccgccgagt tcaggtcacc caccaggtac ccgaagcttt 308281 gtatgccggc tccggtcacc ggcggcgggg cggtcaccgg cggcggtggc ggcccgggcg 308341 ggggcgtcgg cgcggtcacg gccgtggggg ccggggctgg ggctggagtt ggggtggggg 308401 tcgggatact cggggcaatc gccgcgaccg gcgggatgac gggcggcgcg gatggcggga 308461 tgccaaccag gcccgccagc ccagacaagc ccgcgaagcc gcctgctgca ctcgcagggg 308521 caagggtcaa cggcgccagc ggggcggcta gcaggggcag cgccgccggg agcaacgcaa 308581 gagtttgctc gagcagcgtt ttaaccagcg cgatggtatc ggtgatgatc gtgccaatgg 308641 cttcgaccgt ggtgaacatc agggtaaaag cgatagttgc gggattgccc gacgcgaacg 308701 ccgccgccag atccgccccg atgaatgcga aggtttgcga caggaaggcg acataagatc 308761 cgatgtccat ggggtagcca agggcgaacg caatgttggc cgggcttagg aaggttagcg 308821 gattacccag cgagggcagc cacggatcaa atccggaaaa catcgcttgc aaaaagggga 308881 ggttggtcag ccagttgatg aacggttgta taacgttgtt gtagaagtcg gtatacccga 308941 tcttctgcaa ccattgcagc cattcctgga cttggttcgg ctcgtcggaa gccgctgtcg 309001 gcgcgttggc tttcacgatc tggggggctg gggtggtctg cggtgcggcg gccaccgccg 309061 cggtcgagac cgcttgatag ctggccatcg tggtggcggc ctggatccac atccgcgcgt 309121 agtcggactc gttgagcgcg atcgggatgg tgttgatgcc gaagaagttc gtcgccatca 309181 gcacgccgtg gagggcgtgg ttggcgccca gctcggccaa cgttggcatc gcggccaagg 309241 cggtgccgta ggcggtggcc gcggtttctt gccgggtggc catggccgcg ctgttagcgc 309301 tggcctgcac cagccacgcc agataagggg tatgggcggc cacgtaaacc gcggcggtcg 309361 ggccgtccca ggtgccggcc tgtacggcgg ccaacagcgc ggccagctcg tcggccgtct 309421 ccgcgtaggc gatgctcaac gagtgccacc cctcggccga caccagcagc ggaccgggcc 309481 caggcccgct gcttagcagc gccgagtgca cctctggggg cgaagccatc cagatcgggg 309541 cggtcatcgg cggctgaccg ccggcggagg tgtcgtcgcg tcgcgagcag ccacgttaag 309601 gcccagcagc gtggtggtgg cccgaccgct agacaaggtt tggagcgtca tgaccggtta 309661 gctttctcgg ggtacaccgc cccgggtggc aggacgcgat gacgcgagtc tcctggctcc 309721 cggatcgttg cttgcctcgc cttccagcct gtggccgtgg cttacgaggg tcgctccccg 309781 gtgacagtgg cgggaccgcg ccggattctc accggcttcc tgcatcgtca tcgcctgacg 309841 ggaagaatat tggcatgcag agcgtggatt tgcacgttga gcggcatttg ccaagcaggg 309901 gtcggtcaca tcgcacggtc gcaacagtca catgtgtcac tgcactaggc gacatccgat 309961 ctgcccagct ctcagcgaca ggcgcctggc cggcggtttt gttcccaagt tggtcgtggc 310021 tgtgcgggat tggaggcggc gttgacctgc agaaaccgag ttgtcgcgct tagctgggca 310081 cagcgaccat cgccgacggc ggagctcggc gtcggtgagt cgcttcggtc ggccggggcg 310141 gcgcgattcg ggttcgacca cgtggtcgtc gaccagctga cgcgccgaac gtgcaaccac 310201 ggcggcagcg cccggcgacg tgtccccgcc accagtacac gttcggcgca gccagtgcac 310261 acacggcacg gagtttagga cttactcatt tggctatccg cgaccgatat cgccgaccag 310321 gtagcgctgc attgtcgggc caatggcgtc gaccagcatc tccaccgaca tggagtgcag 310381 cggctcagaa cgcaccccgt agcgcatgat gcccaaaccg acgagttgag cggcgcacag 310441 cgacgctcgg atggcaatct tgtcggcccc gagcatcttg agcaacgggt tgaagaccgg 310501 tccgatgaac atggactgca cgatctcggc ggtcttggct agcccggtgg ttgcgatggc 310561 gctcgccgca aagggaccgc cgccggccgc atcccaggtg gtgatcagca cgtagagggt 310621 tcggcggcct acctggttga cgcttccggt gacgattttt tcgatgaaat ccggtgtgcc 310681 gaagggcaac cgcagcatct tcgctaccgg gtcgaggagt ccgcgtgatg gctcttggct 310741 acgggccatg gtgtcaggat caccccgctg tgatcaaaga tcaagcgtca ccggtgtcgg 310801 cgtgccatgc cagcggtgca gccgttgctg acgtgctacc gcgctgcgaa atcggttcgc 310861 gaccagctgt gccaagcccg gatgggtgcc gagcggtcgg gttaccacat cggcaccgga 310921 tgcccgcagc cgctcttgaa aaaggccttc tgccaacagg aaggaggcga ccgcgacgcg 310981 gcgcgcacct cggttggctt cggcccggtc tcgggcccgc tgcacagccg tgcgcacatc 311041 cggaccgccg gtgcccgcaa atcccatgtc cacccatgat ccggtcagtt cggacactag 311101 cgtccgagtg gtgtgcaggt cggcacgtgc ccgcctatcc gacgcgccgg ccgctgcgag 311161 gatcactgaa tcgccaggac gccaaccgga ttccaccagc tgctgggtga ctatctgcgc 311221 gatctcacgg catggcccca acgcgggggt gaccgtgaca tgcgggtgcg cactggctgc 311281 gacatgagcg ggcaggtcgg tgcgaacatg atatccgcgg gacaagaacg cgggcaccac 311341 gattgcggga cggcaggaaa gggcggaaag cacttcgctg ggtgagggtc cgagcacatc 311401 aacgaaggcg acctgcacag tgcggtcgac gagcgcgctc acttgcgcgg cgatgtccgc 311461 tatcatcgcg acaccggacg gtctgcgggt tccgtgggcc gtcaagatca ggttcatacg 311521 tcatcgtgcc ggctgtcaac ggcgagacgg tagccacgtt tcaccactgt tgccacgatg 311581 ttcttgtcgc ccagagccgt tcgtagccgc aggacggcgg tgtccacggc gtgggtgtcg 311641 ctgccgtcgc cgggtaggac gcgtagcaag tcgccacgag agacgacgcc gccggggcga 311701 tgtaccaacg cgcgcaaaat cgccattccg gacggcgata gtggcttcac cgaatcatcc 311761 accagcacag aggttccacg gatctcgatc acgtggccgg ctgctttgaa cgtgcacgaa 311821 cccagcagcg gcagctcctc ggcaatgtgg cgggctaagg ctcccaaccg cattcgctcg 311881 ggagccgacg tcgggacgcc ctttcggatc aacggccgcg aagttaccgg gccgacacac 311941 atcgcgtgca cgtcggtacg cagcgcagcc aacagttggt cctcgatatc caattcacgg 312001 ctgcgttcta gcaccgcggc tgcggcaggt gccgacgtga aggtgaccgc gtcgaattgt 312061 cgtcgcgcga tcccggtgac taaatggtcg aacacgccgc ctagtggcgc cggcttccac 312121 cggtaaaccc ggatcggcac cacttgcgcg ccggcgaaac gtaacccgcc cagaaattcc 312181 ggaaacgggt cccagctgtc ggcggcaccg tgcagctgga cggcaatacg cgtacgggac 312241 acccccgatt cgagcagata ttccagcact tcatgcgacg attcagagtc gggggaccac 312301 tcttcacgca ggccggcggc acgcagcgca ccagttgcct ttggtccgcg ggagatgatc 312361 cgggccgacg acaacgattc caggagctcg ttggccagcc cccacccctc ggccgcggcc 312421 aaccagccgc gaaatccgat gccggtgtgg gcgaccagaa tgtcaggcgg gtcggcgatc 312481 aacgcctcgg tgttgttctg cagttcatcg tcgtcgggaa gcgcgatcat cttgatcgct 312541 ggggcactac agacctcggc gccctggcgg cgaagcaatg cgcacagctc ttcggcgcgg 312601 cgagcggatg tcaccgcgat ccggtagccg gtcagtggcg ccgagtgtgc ctgggccata 312661 tgacgtgtct aggcctgtga ggtttcagtc gcgttaccag gcaattgctg ccggattgcc 312721 cattgccgat acccacctct gtggcttcgg gcgtggcgct agacgtaggc caagcccgcg 312781 ggtgcggtgg tcgccggcac gagctcgccg gcgctcttta ggccccgacg cacataaatc 312841 gcccaggtca gcaccgaggc gaccaggtag aacaccccga aggcccaaaa tgccgaggtg 312901 gccgtgccac tggtcaggta ggactctcgc agagccaggt tgacgcccac tccgccgagc 312961 gcgccgaccg ccccggccag gccgatcagc gcgcctgaca tcgaccgcga ccactgcctg 313021 cgctcggctt cactgatctg cagcgaatgg ctgcgcgcct cgaagatcga cggaatcatc 313081 ttgtacacag agccattgcc gatgccggac aaaatgaaca gagccgtgaa gccgatgacg 313141 tagccgacca tcgtcgcagt cggcatcggc ccggccaggt ggtcaccgaa agtgcttgcg 313201 ctgatgagta ttccggtggc cagcagcatg gcgcagaagg cagctagggt gactcggccg 313261 ccaccgatac ggtcggcgag cttgccgcca tatattcggg acagcgatcc caatagcggc 313321 cccaggaagg cgatctgggc cgcatgcagc gaggcctgcg ccgtgctctg accgctggcg 313381 atgaagttga tctgcagcac ctgaccgaat gcgaaagaga acccgatgaa cgagccgaaa 313441 gtgccgatgt acagcagcga gatcacccag gtgtgcggct cggacactac cgcacgcatg 313501 gtgttcagct cgatgcgata ctccgtcagg ttgtccatgt acagtgcggc gccgaggccg 313561 gcgaccgcca gcagcaccag atatatcgcg cacacccagt agggctcgcg gtcaccggcc 313621 gttgcgatca ccagcaggcc gaccaactgc accatcggca ccccgaggtt gccgccaccc 313681 gcgttgagcg caagcgcggc gcccttgagt cgttgcggaa agaaagcgtt gatgttcgtc 313741 atggaggcgg cgaagttgcc gccgccgagg ccggctagcg caccgcacac cagatacggc 313801 cacagtggca aaccagggtt ggccagcaac agaatgctgc caacggtcgg aatcaacagc 313861 accagtgcgg aaaagatggt ccagttgcgc cccccgaact ttgcggtggc aaatgtgtaa 313921 gggaagcgca ggcatgcccc gaccaaggtc gcggtggcgc cgagcaggaa cttgtcgccg 313981 gcggaaaagc cgtacaccga tgtgggcatg aacagcacca tcaccgacca gagggaccag 314041 acggaaaatc cgacgtgctc ggcggccacc gaccagatca gattgcgtcg ggcgatgaat 314101 ttgttgccgg cctcccacgc caccgagtct tcgggatccc agtcggagat ctggtgggaa 314161 cggcccatac tgacccctat cgtgatcgac gttctcgatc acgctagaaa tcctttgttg 314221 cccgggcgct tccggtagtg accccggcgt gaactttcgc tcacacggtt accgccagcg 314281 tgtgagggcg gccgtgcagc ggagcggatt accagacgtc gcccgcgcgc caatcgcaca 314341 tcagctccgc cgaggtgtcc aggctgatgt cgatgggcag gacgaacacc gttccgtcgt 314401 catcgggtgt acggactgga ccggttggtg ccagtaccga tgtcgggccg tgccagggca 314461 gccagccgcg tgaggcgtac agtctgcggg cccgcgccga ggaactgagc gctccgagct 314521 ggtaagcgcc gcgcatcacc tgctcgacgg cgtccaacag cgcgctcacc aggcgttggc 314581 cccgccagtc cgcccgcacc gcaacgcctt cgacgtaccc gcagcgcagc gcgttgccgc 314641 ggtagatcag tcgccgctgg atcaccgcgg catgcgcgat gatcgccccg tgatgccaga 314701 tcagggcgtg catcccaccc agcgtgtgct cccagtcggt ctcggtgaag tcaccggcaa 314761 acgcgccggt gaccatctga cggatgtcct ggcgggtctc gctgtcaaga tcggcggtgt 314821 ggaccaggcg ggccgtgtgt acctgggtgt gcacagtccc tgtctaccag gcttgtgtta 314881 caccctggcc aggcaaccga gaccggggtc gtgcccagtg cagtcgcaca tattggccgg 314941 gccgtatctg cgcgaccttg tcgatgtcct cgtcggtgat gacgccgacg accggatagc 315001 ttccggtgat cgggtgatcc ggccccagga tcaccggtaa tccgttgggc ggcacctgga 315061 ttgcgccgcg ggtaacgcct tcgccgggca gttgccgatc cggccagcgg tgctgtagcg 315121 ggcggccctg tagccgcatt cctacgcggt cactgcggtt ggacgccatc cagatggtat 315181 gcaccaacgc gtccgggtcc accagccagt cgtcgcgcgg cccgggcacc acccgcagct 315241 ccaccagatg ctcctcgata gcggccaccg gtgcctggtc gagttcggga tagtcgtcgg 315301 tgtgttcgcc gaccggcagc acgtctccgg cccgtagcgg cgacgggccg atcgccgaca 315361 tcacgtcgta gctgcgtgac cccagcacgg gctccacaca gacgccgccg cgcaccgcca 315421 gataggtccg cagcccggcc cgtggggtgc ccagtgagat cacctggccg tcccggacgt 315481 ggtgaatgct gttggtgccg accatgattc cgttcacggt cggatcggtg tcggcgcccg 315541 tcaccgcgat gtcgacgtcg ccgccgcgaa cccgcgccga gaagccgccg aaggtcactt 315601 cgaccgtggc ccaatcgtcg gggttggcga ctagccggtt ggccagcgtg tgggagcggc 315661 ggtcggcggc accggatcga ccgacaccga gatgggccag tccggcacgg ccgaggtctt 315721 cgacgagggc cagcggtccg ctgcgcagga tttccagtgt tgtcatggct gcttcctcca 315781 gctcaggcgg cccggaactg aacccacatg cccggtgtga gcagcgccgg ctggggtcgg 315841 tcgacatccc acaggaccgc gtcggtgtgg ccgatgatct gccaatcgct gggcgcttga 315901 gatggatata tcgcgctgaa tccgtcggcg agggcgaccg atccgggcgg catcgaggtg 315961 cgccgttcgg gccggcgcgg cacccgcagg ctcgggtcgc cgtcgatcag gtaggcgaac 316021 cccggggcgg acccactgaa tcccgcccgc catccggtgg cggtgtgggc gttgatgacc 316081 gctgcggtgg tcaggccggt gcagcgggcg acctcggcga ggtctgggcc gtcgtagacg 316141 acgtcgatta ccaggtcgca tcggtgatcg gccgcagcca ccgcctcggg ggtgacccgc 316201 aacctgcgca gccgctgacg ggtgacccct tggtagcggg gcgcgtccag cttcaccaat 316261 acggtgcgcg aggccgcaac gatgtcgacc acaccgggta gcgccgcggc tcgcaatgca 316321 tcggtccatg ccattgcgtc agcggtgctg tcacattgca gcatcagcgc atggtcgccg 316381 tagtcgagca cggtgcaggc caatgccgcg tccataaaga cgtccatcac actcatgcgt 316441 cgacggtagc gctgcaatct tcggctcggc cagggatttt cgagactgcc agaggtgcct 316501 tagcaaatgc tcatgcgccc aagatctggc tgatctgtgg cggcagttgc tcggccacca 316561 cgggataact cagcaccgac gagaaggcga tggctccggc ctgttccttg gaagtgaaga 316621 tgtggcgccg ctgggcggtt gcctgcgacg ccgcaatctc cgggtcggcc aacaacgcct 316681 tctcgtcctc ggggctctcg gtcatccaga tcagcacatc ggcggcatca agcaccgctt 316741 taatgtgatc gcgcggaatg acgccgcgct gatcgacggc gaagggtttg atgctgtcgg 316801 cgatcaccag acccatgtcg ttgaggaagt cagttcgcca gcccgccagg gttgcgacca 316861 cgttgccctg ccagaggcga ccctgcagca acagcgcctt cttgccccgc cagcgcggat 316921 gccgctgcgc caccgcggcg aacttctggt cgacggcctc gatcagcgac ctcatccggt 316981 cggccgcaaa caccgcctgg ccgatcgacc tggcctggtc cttccacggc tcgaagaatg 317041 cgtcgccgcc ggactgggcg acggtcgggg cgatcgccga cagctgctga taggtatcgg 317101 cgtccacccc ggcgttgatc gccacgatca ggtcgggttt taaggcggcg attcggtcga 317161 tctgaatccc gttgtccagg ttcaataccg ccggccgcgc cccgccgagc ttgggcgccg 317221 cccacggcca caccgcaaac ggctggtcac cgaaccagtc ggtcaccgcg atgggcacca 317281 catcgaccgc gagcaagtcg tcctgctcgg tgtagccggc gctgaccacg cgcttgggtg 317341 gctctttgat gacggtctga ccgaacaggt gggtgatagt taccgccgcg ccgccaggag 317401 tgcccggcgg gggtttgggc gatgaacagc ccgcgaacag cccggtggct gctgcagcct 317461 cggcgacctg caagaatccc cggcggctgc atccctgtcg cacagcgtga gggtatcgcg 317521 cgcgttaccg ccggcgtcgg gcgctggtac ttgctggccc gtatccgccg ccgccggggg 317581 tttcgatcac cagcgtgtcg cccggctcga cgtgcgttga gccgcatccg gccaactcga 317641 cggtgctgcc gtcggcgcgt tccactcggt tgcgtcccag ctctccgggg gagccgccgg 317701 ccatgccgta gggccgaacc cgccgatgac cggagagcgt gctgaccgtc atcggctcgg 317761 tgaactcgag gcgtcggacg gcgccgtcgc cgccccgcca gcgaccggcg cccccgctgc 317821 cctgacgtac ggcgaactcg cgcagcaaca ccgggtagcg ccactccagc acctcgggat 317881 cggtgagccg ggagttggtc atgtgcgtct gcaccaccga ggccccgtgg tacccgtcac 317941 cggccccgga gcccgatcct acggtttcgt agtactggtg ccgctcgttg ccgaacgtga 318001 cgttgttcat cgtcccggat ccctcggcct gcacacccaa cgcggcgaac agcgcgccgg 318061 tgatcgcctg cgaggtttcg acgttgccag cgaccaccgc ggcgggatgg gttggtgcga 318121 gcatcgagcc ttcggggacg acgatacgca acgggcgcag gcaaccgtcg ttgagcggga 318181 tgtcgtcggc gaccagggtc cggaacacgt agagcaccgc cgcattcacc accgaggtcg 318241 gtgcgttgaa gttggtgtcc agctgagccg aggttccggt gaagtcgatg gtcgcgctgc 318301 gggcggcgcg gtcgacggtg atgcgcacgg cgatcgtcgc gcccgaatcc atgcggtagc 318361 ggtaggcgcc gttgtcgagc cggtcgatga cccggcggac cgcttcctcg gcgttgtcct 318421 ggacgtggcg catgtaggcc gccaccacgt cgcggccgaa gtggtcgatc atttttccga 318481 cctcgtcgac gcccttttgg ttggcggcga tctgcgcgcg cagatcggcg aggttggtgt 318541 cgggattgcg ggaaccgaac ggcgcctcgg taagcaggcg ccgggtttcg gcctcgcgga 318601 accgtccgtt ctcggcgagc agccagttgt cgaacagcac gccctcttcg tggatctcgc 318661 ggctgtcggc gggcatggag ccgggggtga tgccgccgat ttcggcgtgg tgcccgcgag 318721 aggcgacaaa gaataggacg tcctcgccgc cggtgttgaa caccggggtg atcactgtga 318781 tgtccggcag gtgggtgccg ccgtggtacg ggtcgttgac ggcgtatacg tcaccgggct 318841 tcatgccgct caagcgccgg cggatcactt ccttgacggt ggtgcccatc gagccgaggt 318901 gcaccggaat gtgcggggcg ttggcgacca ggttgccgtc cggatcgaac agcgcgcagg 318961 agaagtccag ccgctcccgg atgttcaccg actgggcggt ggcttccagc cggaagccca 319021 tctgctcggc gatcgacatg aacaggttgt tgaagatctc caacagcacc gggtcggcct 319081 cgaaaccggc ctcgaaaccg gcccgagtgg ccgcatcggg ccgcggcggg gtgaccactc 319141 gttgcgcgag caggtgcccg gtctccgtca tcgtcgcctg ccagccgtcg tcgacgacgg 319201 tggtggcgtt ggcctcggcg atgatcgccg gaccggtcag cacgtcgccc ggccgcatcg 319261 cctccctacg ccgcagcggt gcgtcgcgcc acaatccgtt cgaatagatc cgcacggttt 319321 ccgacgagcc ggtggtgtcg ttggcctgat cgcccagctg ggacaggtcg ggctggtcgg 319381 tgagcccggt cgcctcgacc gagatcgctt cggcgatcag cggacgatcc agcaggaacg 319441 tgtacagcgc gcggtggctg ctttcaaacg ccgtggccat ggtctcgatc tcggccagtt 319501 gcacggggat cgcggtatcg gttccctcat agcgcaggtg cacccggcga accacccgga 319561 tgcgctcacc cgggacgccc tcgtccagca actcggcgcg ggcggctcgt tcgagggatt 319621 ccgcaacgct ggccaaacgc tgtggcgcgg cgggtccgag cgggatctcc accgattgtt 319681 cgcgcattgc ggtggtgtcg gccaggccga tccccagcgc ggaaagcacg ccggccattg 319741 gtgggatcag caccgtgcgg atgccgaggg cgtcggccac cgcacatgcg tgctgaccgc 319801 cggcgccgcc gaacgtcgtc agcgcgtacc gcgtcacgtc gtgtcccttt tgcacggaga 319861 tctttttgac cgcgttggcc atgttcgcca ccgcgatccg cagatatccc tcggcgacct 319921 gctcgggtga ccggtcgtcg ccggtccgcg cggcgatgtc ggcggccagg tcggtgaagc 319981 cacgccgcac ggtcccggcg tccagcggct ggtcgccgga aggaccgaat acggacggga 320041 agtgggtggg ctggatgcgg ccgagcatca cgttggcgtc ggtgacgcac agcggtccgc 320101 cgccgcggta gcaggccggg ccggggtcgg ctccggccga gtccgggccg actcggtagc 320161 ggctcccgtc gaaatgcaga atcgacccgc cgccggcggc caccgtgtgg atgtccagca 320221 tcggcgcgcg cagccggacc ccggcaacct gggtcgtgaa gacgcgttcg tactcgccgg 320281 cgtagtgcga cacgtcggtc gaggtgccgc ccatgtcgaa gccaataaca tgatcgaagc 320341 cggccagcgc cgacatccgc accatgccga cgatgccgcc ggccggacca gacagaatcg 320401 cgtccttgcc gcggaagtgc ccggcctgcg ccagcccccc gttggactgc atgaacatca 320461 gtcgcacacc ccgcatctgg tcggccacct ggttgatgta tcggcgcagc accggggaca 320521 agtaggcgtc gaccacggtg gtatccccgc gcgggaccag tttcatcagc gggctgacct 320581 cagatgacaa cgagatctgg gcgaagccga tgcgctgcgc cagcgtaccg atttctcgct 320641 cgtgtcccgg gtagaggtaa ctgtgcaggc acaccaccgc gaccgcgcgg attccgtccg 320701 catgggcctg ccgcatcttc tcgcccaatg cctccaggtc gggtgcccgc agcacccggc 320761 cgtcggctgt gacccgttca tcgacctcga cgacccgctc ataaagcatc tcgggcaaca 320821 cgatccgccg gtcgaagatg cgcggacgat tctggtaggc gatgcgcagg gcgtcgccga 320881 aaccgcgggt gatcaccagc agtgtgcgct cacccgtgcg ctcgagcaac gcattggtcg 320941 ccaccgtggt gcccatccgc accgcgtcga cgcgcgtgcc cgcctcgccg ttcgctagca 321001 gcgcacggat gccggccacc gcggcgtcgc gatagcgtgc cgggttgtcc gacagcagct 321061 tgtgggtcag cagccgtccg tccggccggc gcgccacaac gtcggtgaac gtgccacccc 321121 ggtcgaccca gaagtgccac cccgcgccaa ccacccggac tcccccttca cgctcgcagc 321181 cggtcccgtc ctcacaacgg cagacgggcc gaagccacct aaaggtatct ccgctgtaac 321241 agcgcgcatc cgggccggta acagggtctc tttagcgtcg agccgtcatt accgctgatg 321301 tcgcccgctt gtcgacagga gacctaaccg atggcactca ccaccgcccc ggcaatcgat 321361 tatgcgctgc cacgccagca ggatgagggc gatcactgga tcgacgactg gcgcccggaa 321421 gacccggtgt tctgggagac gatcggcagg ccgatcgccc gccgtaacct gatcttctcc 321481 atcttcgccg agcacgtcgg cttcagcgtg tggatgctgt ggagcatcgt ggttgtccag 321541 atgaccgccg ccgctcccgg gcaccccgcc gcgtccggct gggcgctgtc cgccagccag 321601 gccctatgtt tggtcgccgt ccccagcggt gtcggggcgt tcctccggct gccgtacacc 321661 ttcgcgatcc cgatctttgg tggccgcaac tggacgaccg tctcggcggc gctgctggtg 321721 atcccgtgcc tgctgctggc ttgggcggtg agccaccctt ccctgccgtt cgcggtgttg 321781 gtggtgatcg cggccaccgc cggtttcggt ggcggcaact ttgcctcatc gatggccaac 321841 atctcgttct tctacccgga gaaggacaag ggttgggcgc tgggcctgaa cgcggccgga 321901 ggcaacatcg gggtggcggt ggtgcagaag atcattccgc ccatcgtggt cgccggcagt 321961 ggggtggcac tgtcgcgtgc cggactgttc ttcgtgccct tggccgtcgc cgccgcggtg 322021 tgcgcattcc tgtttatgaa caacctcacg gaggccaagg ccgatgtgaa gccggtgtgg 322081 cagtcgctgc ggcatgccga cacctggatc atgtcgctgc tgtacatcgg cacctttggg 322141 tcgttcatcg ggtattcggc ggccttcccg acgttgctca agaccgtgtt tggccgtggt 322201 gacatcgcgt tgggttgggc cttcctcggc gcgggcatcg gttccctggt ccgtccgctg 322261 ggcggcaagc tcgccgaccg gatcggcggt gcgcggatca ccgcggccag tttcgtcatg 322321 ctggcggccg gggcggctgc ggcgttgtgg tcggtgcagt cggtcaatct gccggtgttc 322381 ttcgtcagct tcatgttctt gttcgttgcc accggcatcg gcaatggttc gagctaccgg 322441 atgatctcga ggatcttcca ggtcaaaggc gaagtcgccg gcggggatcc ggaaacgatg 322501 gtgaacatgc gccgacaggc cgccggagcg ctgggcatca tctcctcgat cggcgcgttc 322561 ggcgggtttg tggtgccgct ggcctacgcc tggtcgaagg tgcacttcgg caatatcgaa 322621 cccgccctgc acttctacgt ggcgttcttc cttgccctgc tcgtcgtcac ctggtactgc 322681 tacctgcgta gaaccacccc catgggccag gtgggggtgt agttagcccg gcggcggtct 322741 cacgttgtga gccacgcgca aactcagact ctgccgatgt caacgcccag ctcggcaccg 322801 agcttgtcca gcggcatggt gacgtgctct cgctcgtgca cgctgagccg gtcgcgcagg 322861 tcttcaagct cttccattag gctctcgtag cgctcagctg agatgaggat ggctgcgggt 322921 cggccatgat tcatcaacac gacgtcatcg tcggcggatt cacgcacaag cctcgatagg 322981 tgagcgcggg cttcactaat aggcactaga ctgctggtca tcggtagacc ccccttcggt 323041 gtccgacgcg agtaacggtg atgacacgcg ctgcgtcgtc gacgatgtaa acaacgcgat 323101 agttgccgag gcggatgcgg taagtggtgt cgaagccact catcttctcg cagccacgcg 323161 ggcgcggttc gtcggcgagc gcggcgacgg cggtcagatg cgccgctggt cgtggcggtg 323221 cagccgttgg attgctttag ctgccgagtt ctcgatttcg accgcgtacc cacttgccat 323281 acaaaaatgt acagacttca gatgcataat ataagcgcta attttgccga cgcgctctca 323341 ccgcggccac gggctgtagt cggcgatcag ctcctcctgc ggcgggcgct ggtcggccgg 323401 gacgtgctgc aggttgatcc ggatccgata ccagatcgaa ctcggcccgc gcatgccgtc 323461 gaccagcaca tcggcgggcc gcagcaaagc cgcggcgccg ggataccggt cgcgccagat 323521 gtccaacgcc gccatcgcct cggcccgcgt cttggcgcgg gcgatctcga tcagcggctt 323581 ggcggattgc gccttctgcg gcggacccag ttcctcggcc agcatcaaca gccggtcgag 323641 ccggccaacc gcgtcatcca tcccggccca ggggtcaccg atgtcggcca accggctggg 323701 cacggtggcc atggtgaaca ccgccgggtc gcagccgggc acctcctccc agtgcagcgg 323761 cgtggacacc cgggcatccg gggtggcccg caccgagtag gccgacgcga ccgtgcggtc 323821 cttggcgttc tggttgaagt cgacgaacac gccctcgcgt tcttccttcc accaacgact 323881 ggttgccgcg tcgggtaggc gccgttcgac ctcacgcgca acggtctggg cggccaggcg 323941 cacctgggga aacgaccagc aaggcgcgat ccgggcatag acgtgaaagc cccgcgaccc 324001 ggacgtcttc ggccatgcgg tcaacccgta atcctccagc acctcccgga ccaccaacgc 324061 gacctcgacg acccgctgcc acgcgacccc gggcatcggg tccaagtcca cccgcagctc 324121 gtcgggatga tcgaggtcgc cggcgagcac cggatgcgga ttgagatcca cacaccccag 324181 gttgatcacc cacgccagcc cggcggcgtc gtgaatgacc gcctccgcgg cggagcggcc 324241 cgacgcatag tgcagctcgg ccacgtccac ccagtctggc cggtttgccg gtgcgcgctt 324301 ctgaaacacc gcctcggcgg agatgccctt gacgaaacgc ttgagaatca tcggccggcc 324361 ggccaccccg cgcatcgccc cctcggccac ggcgaggtaa tagcggacca gatcgaactt 324421 ggtgtagccc tttcgatcgt tgtgagcggg gaagacgacc ctgcccggat gcgtgacgat 324481 gacctggcgt ccgtgcacgt ccagcgacac cggggcggcc atgcggctca tggtaatttg 324541 cgacccgcct cacatagggt gaggtcatgc ctaacctcac tgatctgccc gggcaggccg 324601 tctccaagct ccagaagtcc atcggacagt acgtcgcgcg cggcactgcc gagttgcatt 324661 acctgcggaa gatcatcgaa tcgggcgcga tcgggctgga gccgccgctg aactacgccg 324721 cgctcgcagc cgatatccgc aagtgggggg aagtcggcat gctgccgtcg cacaatgcca 324781 ggcgcgcccc caaccgggcg gccgtcatcg acgaagaagg cacgctcacg ttttccgaac 324841 tcgacgaggc cgcacacgcg gtggccaatg gcctactggc caagggtgtc cgcgccgggg 324901 acggcgtcgc catcttggcg cgcaaccacc gctggtttgt catcgccaac tacggggcgg 324961 cccgagtggg ggcccgcatc atcttgctca acagcgagtt ctccggcccg cagatcaaag 325021 aggtgtcgga ccgtgagggc gccaaggtga tcatctacga cgacgagtac accaaggccg 325081 tcagcttggc ccagccaccg ttgggcaagc tgcgggcgct tggtgtcaat cccgacgacg 325141 acaagccgtc gggcagctcc gacgaaacgt tggccgagct gattgcgcac agcagcaccg 325201 cgcccgcccc gaaggcgagc cgccgtgcgt cgatcatcat tttgaccagc ggcaccaccg 325261 gcaccccgaa gggggcgaac cgtaacacac cgccgacgct ggctccgatc ggcggcattt 325321 tgtcgcacgt gccgttcaag gccggcgagg tgacgctgtt gccgtcgccg atgttccatg 325381 cgctgggtta catgcacgcc gcgctcgcca tgttcctggg ctcgacgctg gtgctgcggc 325441 ggcggttcaa gcccgcgttg gtgctggaag acatcgaaaa gcacaaggcg acatccatgg 325501 tcgttgtacc agtgatgctg tcgcggatcc tcgaccagct ggagaaaacc gaacccaagc 325561 ccgacttgtc gagcttgaag atcgtgttcg tatccggatc gcaattgggt gccgagctgg 325621 ccacccgcgc gctgggggac ctcggcccgg tcatctacaa catgtacggc tcgaccgagg 325681 tcgcgttcgc caccatcgcc ggccccaagg atctgcagtt caaccccagc acggtggggc 325741 ccgtcgtcaa gggggtgacg gttaagatcc tcgacgagaa cggcaatgag gtgccgcagg 325801 gtgccgttgg ccggatcttt gtgggcaatg ccttcccgtt cgagggttac accggcggcg 325861 gtggcaagca gatcatcgac ggcctgttgt cgtccggcga cgtcggctac ttcgacgagc 325921 gcggcctgct gtatgtgagc ggccgcgacg acgagatgat cgtctctggt ggtgagaacg 325981 tgtttcccgc cgaagtcgag gatctgatca gcgggcatcc cgacgtggtg gaggccgccg 326041 cgatcggcgt cgacgataag gagttcggtg cccggctgcg cgcgttcgtg gtcaagaagc 326101 cgggagctga cctcgacgag gacaccatca agcagtacgt acgcgatcat cttgcccgct 326161 acaaggtgcc gcgggaggtg atcttcctcg acgagctacc gcgcaacccc accggcaagg 326221 tcctcaaacg tgagctacgc aagctgtagc tgctcgcgcg ggtacttacg ggtcgcgggg 326281 taggcccagc aaccgctcgg cgatgatgtt gagctggacc tccgacgtgc ccccgtagat 326341 ggtggtggcc cggctggcta gcaggtactc gccccacttg ccgggcaatc gctctgtgtc 326401 gccgatcacc gcatcggtgc caaaggacga caccgcgaat tcggcataac cctggccggt 326461 gcgcatggac aacagcttgg agatcgccgc cggcgccatc gggtcacccc cggccagcgt 326521 caacagcgtg gagcgcaagt tgagcagctt ggtggcgtgg ccctcggcga tcaattgccc 326581 ggcacggtgt cgcgcgacct ggtcgaactg tccttcgaaa cggtaatcgc gaacgaagtc 326641 gacgaactcg cccagggtgg ggaggaaggt cgaatcgctg ccgccgatcg acacccgctc 326701 ggccgtcagg gtgttgcggc tgacctccca cccccggttc acctccccga gcaccaactc 326761 gtcggggacg aacacgtcgt cgaggtagac ggtgttgaaa aactccttac ccgtgagctc 326821 gcgcagcggc ttcacttgta cgccttcgct tttcatgtcc agcaggaagt aggtgatgcc 326881 gttgtgcttg ggcgccgacg ggtccgtccg cgccagcagc gcaccccatt gggagtactg 326941 cgcgccggtg gtccagatct tctgaccagt gatgcgccag ccaccgtcga cccgggtggc 327001 cttggttgcc aggctagcca ggtccgatcc cgcgcccggc tcggagaaca gctggcacca 327061 gaaaatgtcg cctcggaacg ttggcggcag gaggcgctgc ttctgattgt cggttccgaa 327121 cgcgacgatc gacggcacga tccacgtcgc gatggcaatc tgcggccgct tgacccgccc 327181 ggcggtgaac tcctgggcga tgatgatctg ctcgaccggg ctggcggccc gaccccacgg 327241 cttgggcaga tatggcagca cccacccacc ttcggcgatc gcgacagtgc gcggctctcg 327301 cggcatcgcc ttcagcgcgg cgacttcggc ccggatctgg gcccgcagct tctcggtaga 327361 ggggtccagg tcgatgtcta ccggacgcat accggcagtc gtcgcggtgt ccaccacccg 327421 ctgcggatac tccgagccgc ggccaaagca cgcggccagc atcaacgccc ggcggtagta 327481 gacgttcgtg tcatgctccc aggtgaagcc gatgccgccg tgcacctgaa tgcagtcctg 327541 cgtgcagcgc tgagcggtcg ccggtgccag cgtcgccgcc accgccgccg cgaattcgac 327601 gtcggagcta gattcgcccg cgtcgtctaa ggctcgcgcc gcgtcccaca ccgcggcggt 327661 ggcccgctcg gtgtcagcga tcatctcggc gcacttgtgc ttgatcgcct ggaattgccc 327721 gatcggccgg ccgaattgtt cgcggatctt ggcatatgcc gacgcggtgt cggtcgccca 327781 ccgcgcgacg ccaacggctt cagcggacag cagggtggac atcagcgcgt gagcggtcgt 327841 catcgtgagg ttgctcagca gggcgtcgtc gctgacgtcg accgcgttgg cccgaacatg 327901 cgcgatgggc cgcaacggat ccaggctctt gaccgcttcg atctcgagct gatcgttgcg 327961 cagtacaacc cactcgtcac ggctttcgat ggccaccggt agcaccagaa cggaggcttg 328021 cgccgcggcc ggaaccgcgc ggacttcgcc ccggatcacc agcacgtcgc catggcgggt 328081 ggcggtcagc ccggaatcta gcgcgtaggc ggcgatggcc gcaccggttg ccagttcggc 328141 gaggactttg gcttgcggat catgggctgc gatcagcgcg ctggcgatcg ccgacggcac 328201 gaacggcccg ggcacggcgc cgtagccgaa ctcggcaagc accaccgcta gctcgaggat 328261 gccgaaaccc tggccgccga ccgactcggc cagatgcaca ccctgcaagc cttgttcggc 328321 cgcggcctgc cagtaaggcg gcgggttttc gaccggtgat tctagcgccg cgtgcagcac 328381 ctcggacggc gctacccgcg ccaccaggga acgcaccgaa tcggccagct cataatgctc 328441 aggagtaata gcgatcgaca ttgctcgcct tcccatgctg ttggacgttt cggccaagca 328501 ccttccaagc taacaaccgg tgggtcggtt attaacgttg gctagcggat ggccggcgaa 328561 atgggtgaga acactcagcg ccaccgcttg gctatccact tggcgatggt gtcggcctgc 328621 tcgctgcggg cgcccggggt ggtgaagtaa tggtcggtgt cgatcgagac ctgagtcttg 328681 tcgctgctgg cgagcccgtc gtagatctgc tgggcatccg acgggaagat tccggtgtcg 328741 gcctcggcgt tgagcaccag ggccgggcag gtgatccggg ccaggtgggg tgcggcacgg 328801 gtttgggcca cccgcaggct ccacatgccc agccagccgc gcagcgtgca ggccgcggcg 328861 atgccgtgtg cggagcggtt cgccttcacc ggcgtgcccg cgtagcactg gttgggccga 328921 cgcttggtcg gttcgatgct gggatcgacc atgcgcgggt cggcccaggt acgcatcacg 328981 ctgaacggcc gatcagaaaa gccagctgcg cgaacacgtt tgagttcgga ttcggcccag 329041 tcggtgatgg tgtggttgcg tttgacctgc gcggagcgat accggctgat aaactccggt 329101 gagtacggcg gcccgttgcg ttcgtcgaac aggtcaagtt cgggatcggt tgcaaccgga 329161 tcattttcgt caatgacggc ggcgtccatc caagcggtga gcacatccgg acggccggga 329221 tgagctgcgg cggcaacgta tgcgtcggcg gccggcaatt cggttacccc ggctgcgggt 329281 cgcataccgt ccaggggagt cacgttcgga tcgaccgctt gtgattggta ggcggccatc 329341 aatgagccac cacctgaatt gccaagcaac accactgttt ccacgccctg aacttcgcgg 329401 agccagcgca ccccgacgcc gatgtcgacc agtgcgtgat cgagcagaaa gctgctttcg 329461 aaaccacgga atcgggtgtt ccagcccaga aacccgatgc cgcggatcgc catgtactcg 329521 gcgagatagt gctcggagaa atcgatctgg tagtgcgcgg cgatgagcgc caccttcggt 329581 ttgcgtccca cgctgtggtg gtacagcccc tggcatgggt gcccaccggc agccgcacgc 329641 cccgcggttc gcgacggcag cccgacgaac tctcggatga ccccgggcgt ggcagcacga 329701 ccagtcaatt tgagctgtcc tccttactgt agatggcgcg gtagtaaatg ttggccaagg 329761 tctggatgca cgcctgatcg tcaggttgtc cacggcgtga acgcttaccg ctgagctgca 329821 ggtagcaaaa ctggttgaac attgccacaa tcgcctcggc catcaactgg gggtcatcgc 329881 cgacgcaata gccgtgcgcc tgagcgcgtt tgaccgtctc ggtgatgaac gatattggaa 329941 tctggcatat ttcggaccag tattgcgcga agtcgtcact gaccatcgcc aactgtgaca 330001 cgctgatcgc ttctgcgagg cggttgcggt aggtgtacca atgggcggca gcggcttcat 330061 acgcgcgctc gcggtcggat aggccgtgcc ggatcaccga caatgcccgc tggttggcgt 330121 cgtcgcggaa gcgcagcgcc cactgccgga ccatcgcctc tttggagtcg tagtagttgt 330181 aaaaggatgc cgccgagcgg ccggcttcgg cggtgatgtc ggcgacggtg gtcgccagga 330241 ttccgttgcg caccacgacc gtccgcgcgg cggcgtcgat tgcggcctgg gtccgccgac 330301 cgcgttgcgt cgggaagtcc ggcacctggg cacctccctg gaacaaaact gaacctgatg 330361 ttagattcag attcagagct tggccaggcc gccgtcccgg ggagccaatg ggagccgcac 330421 gatgatcaag ccgcacaaca ccaacaccga attcgagctt ggtgggatca accacgtcgc 330481 gctggtgtgt tcggacatgg cgcgcaccgt ggacttctac agcaacatcc tggggatgcc 330541 gctgatcaag gcgctcgatc tgcccggcgg ccaagggcag cacttcttct ttgacgccgg 330601 caacggcgat tgtgtcgcct tcttctggtt cgccgatgca cctgatcggg tgcccggtct 330661 ttcgtcgccg gttgccatcc ccggcatcgg cgacatcacc agcgcggtga gcaccatgaa 330721 ccatctggcg tttcatgtac ccgccgaaag gttcgacgcc taccggcagc ggctcaagga 330781 caaaggcgtg cgggtcggcc cggtgctcaa ccatgacgac agcgagacgc aggtgtccgc 330841 ggtggtgcat cccggtgtgt acgtacgctc gttctacttc caggaccccg atgggataac 330901 tctggaattc gcttgctgga caaaggaatt cactacgagc gacgcgcagg ccgtgccgaa 330961 gacggcggct gaccggcgac ctccggtggc tgcggatcgt tagccccgga tttggcagct 331021 gttgccgcta cccggggacg ggacaagttt gggtcggtga gttcatcgag cagcgcagct 331081 agctgatcga ccagctggtc gggatcgagt cgcacgtcac cggccagcca ggcgctgatg 331141 gtctgcccga cgccgccgac ggcgaagtgt gcgaccgcct tgacgtggtc atttgccggt 331201 gcgtgcaggg tgtcgacggc atgttggccg gacagcatgg cgaacagggc gctggattcc 331261 gcacgcttgc gggtgatcac tgcgttggcc agctgtgtgc tgaacagcag gcgtccgacg 331321 cgggcgtctg cggtgatggt ccgcacgatg ttggccatgc ccgcgcgagt ctgctcccgc 331381 gccggtaccg ccgtgaccgc ggcctgagtg gtggcgacca gctcggccac cacccagtcg 331441 aacacgcggc cgacgaattc gtccttgtcg gtgaagcttt cgtagaagta gcgcaccgac 331501 aggccggccc gccggcaaat ggtgcggatg gttagctcgg cgatgtcgtg ctggtcggac 331561 cccaacaggt ccaggccggc agagagcgac tggcgacggc gcgtcgccag tcgctcggcg 331621 gcctcgacgc cgcggtaggg tcgatcactg cgcgtcatac ggatcatctt gacactcggg 331681 cacgataccg gccaatatca ggatacaggt gtttccataa ttagcggcag cgccgggagg 331741 ccttcggatg gcgatttcgc tggtggctca ccagcccatc ccccacgtcg agcgtcccat 331801 ggccgaccca ccccgtctcc agctggccag gcgccggcga tcggcggccg gccccggcgg 331861 taacgaggac agcttgatgg gagtggcgct gctagccggc ccggccaacg tgatcatgga 331921 gttggcgatg ccgggtgtcg gctacggcgt gttggagagc cgtgtcgaaa gcggccggct 331981 ggaccgccat ccgatcaagc gggcgcgcac cacctttacc tacgttgcgg tggccgttgc 332041 cggcagcgac gaccagaagg cggcctttcg tcgcgcggtg aataaggttc acgcgcaggt 332101 gtattcgact ccggagagcc cggtgtccta ccacgcgttc gatcccgaac tacagctgtg 332161 ggtggcggca tgcctctata agggcggcgt cgacgtctac cgcaccttcg tcggcgagat 332221 ggacgacgaa gaggccgacc atcattaccg cgcgggcatg gcgatgggca ccacgttgca 332281 ggtgccgccg cagatgtggc caccggatcg ggcggccttc gaccgctact ggcggcaatc 332341 actggacagg gtgcacatcg atgacgtcgt tcgcgactac ctgtatccga tcgtggcgct 332401 ccgaattcgc gggatcgcac tgccgggtcc gctgcggcgg ctgtcggagg gtatcgcgct 332461 gctgatcacc accggtttcc tgccgcagcg gtttcgcgac gagatgcggt tgccgtggga 332521 cgcgaccaag cagcggcgct ttgacgcgct catggccgtg ctgcgcacgg tgaatcgcct 332581 gatgccgcgg tttgtccggg agttcccgtt caacctgatg ctctgggacc tggaccggcg 332641 gatgaggcgc gggcgcccgc tggtgtaatc gccggcttcg cgtggaccgt tgccggtaga 332701 ccgctcgcta gattggcggg cgaatatggc gcacagaggc aaaccgggcg aaatccctat 332761 ccaggctcac cacggcgcag tgatgctcca cggcgatggc cccgagtacc gcgtcaggta 332821 tcaagtcgcc cgatgcgtcg gcctcgtcgc agagttttcg cagcagcacc aggtgtctgg 332881 ggccggggct tgtcggaagg tgatggggct gggcgttgac ggcttcgacg aatgcgaatg 332941 catccgctcg tggtgacgga atctcgaaga tgcgtcgatt cgttgttagc cggaggaacg 333001 acgcccacac taggttcggc actgtgaagg ggtcgtcggc cgcaagcagt cgatcgaacc 333061 aggggcggac ggttcggtga ttcggatggt caccgcggtg tgcagccagc agcacgttga 333121 cgtcgatgag gaacatcgcc tatttgtgcc tgtccaggct cacttccgcg agttcagttc 333181 cagaccctcg tcgagcactt cggacaacac cgtattcgag gttaggtcga tacctggccg 333241 cggaccggtg ccggcgtcaa aaacggggac ggttggccgg gcgccgccgg tacgggcggc 333301 ggcgagctcc cgccgaaggg cgtcttcgat cacagcgccc agcgattaac cacgctcgcg 333361 ggcccggcgt ttggcggtag ccagtagttc atccgagatt gacacggtgg tgcgcatgat 333421 gctcaggata gcgcatctac ggcatcatct gcggtgagca actgatgccc tcaacgccgc 333481 gtgtggtcgc aggtctgcct gctatggcaa gccgttgagt ccgttctcgc cgagcagcag 333541 cccgccggtg ccgccggcac cgggcgtggc cccggctttg ccggcgttgc cgccgttgcc 333601 gccgttgccg atcagcacgg cgttgccgcc gacaccaccg ctgccgccgg taccggcgcc 333661 aaacccgccg gcaacccccg tcaccgccgt tgccgaacac cccggcgtgg ccaccgtcac 333721 cgccggtgcc gccggtaccg gcgcctagag cgttggcacc gctgccgccg gcgccgccgg 333781 cgccggcgga gccgaagagc aagccgccgt tcccgccggc gccgccggcg ccgccttgct 333841 ggatgctggt aagtgctgcc ccgccgtgcc cgccggcgcc gccggcgccg cggaagccga 333901 agagtaaggc gccgttcccg ccggttccgc cggccccgcc ggcaagggag ctggcgccac 333961 cgctgccgcc ggcgccaccg gaggcgccga gggagagtag gccggcgttg ccgccgtgcc 334021 cgccgccgcc ggtggtgatc ccggaccctc ccgagccggc ggcgccgccg gtgccgccgg 334081 ctccgaacag tccgccgttc ccgccgttcc caccggcccc gaagttcgtg ccggccccgc 334141 cggtgccgcc agttccgaac agtccgccgt tcccgccgtt cccgccggct gcgttgaacc 334201 cgccggcccc tccggctccg ccgttggcga acagtccgcc gttgccgccg gcgccgccga 334261 cgccggccgg gacaccgcca gcggcgccgt ggccgccggt gccggccgcg ccgaagagca 334321 aaccggcgtc gccgccgcgc ccgccggccc cgccgatgcc agcgacgcct atggagttcc 334381 caccgttgcc gccggtgccg ccggagccga tcagcaagga gaccccaccg gcgccgccgg 334441 ccccgccgat ccctccagca ccggtggcta tcccgccggt cccgccattg ccaccggtac 334501 cgaacaagat cccgccggcc ccgccggccc cgcccgtagc cgtggcggcg gtgttggtcg 334561 caccgtgccc gccgttaccg ccgttgccga acaaccaccc gccggccccg ccggcagccc 334621 cggtccccgg ggtcccgttg gcgccgttgc cgaacagcca cccgccggcc ccgccgtcag 334681 ccccggttcc aggagtcccg ttggcgccgt tgccgatcag cgggcggccg gtgagcgtct 334741 ggaagggctc gttcaccaca ttgagcacat tttgctgcag ggtgtgcagt ggcgaggtgc 334801 tcgcgggagc attgaatccg tctagaccga gcagcagccc gctgacgccg cccactccgg 334861 ccttgcccgc gccaatccca ccgctaccgc cgttaccgcc attgccgatc aacacgccgg 334921 tgccgccgat cccgccgttg ccgccggtca ccgcgctggc gccaccgtta ccgccgttgc 334981 cgccgttacc gatcagcccg ggggtgccgc cagccccacc gatcccgccg gcgaagccct 335041 ggccaactcc gccgttgccg ccggcgccgc cggagccgaa gaccgtgccg gcgttgcccc 335101 cggggccgcc ttgcccgccg tcggcgaagc cgaatccgcc ggcgccgccg gagccgccgg 335161 agccgaagag cagcccagcg ttgccgccgg cgccgccggc gccgcctatg ccgccggccg 335221 tgagagtacc gccgtcccca ccgattccgc cggcgccgcc cgcggcgccg agggcgagca 335281 tgccggcatt gccgccggcc ccgccgtccc cgccggcgac caggctgtgt ccgccgctgc 335341 cgccttcccc gcctgcgccg aacagcccgc cggccccgcc ggccccgccg actccgccga 335401 agctgctgtc ggcgaacccg ccatgcccgc cggtgccgcc ggcgccgaac agcccgccag 335461 cgccaccggc cccaccggcc ccgccggagc tgccggcccc accggatccg ccgaccccgc 335521 cggtggcgaa cagcccgccg gccccgccgg cgccgcccgc cccgccgagt gcactgccgt 335581 tcgtgaatcc gccggccccg ccgactccgg cggcgccgaa gagcaggccg gcgttgccgc 335641 cagccccgcc ggcgccgccg gccccgcccg tgagggctac tacgccgccg ccggcgccgc 335701 cggcgccgcc ggcgccgaac agcatggcgt tgccgccggc tccgccggac ccgccgatcc 335761 cactgctggc gaccccgcca gcgccgccgg cgccgccgtt gccgatgagc ccgccggcgc 335821 cgccgttgcc gccggcgccg ccgttgccgc cggcgccgcc gttgacgccg gccgcgccgg 335881 atcctccggc gccgccgttg ccgattaacc agccgccgtc cccgccattg gccccggtgc 335941 cgggggcgcc gttggcgccg ttgccgatca acgggcgccc ggtattcgcc aggaagaact 336001 cgttgatcgg atccagcagc ggcgacaccg cggcggcctc ggcggccgca taggcgccgc 336061 caccggaggt caatgcctgc acgaactggg catgaaacgc ctgcgcttgg gcgctgagcg 336121 cctgataggc ctggccgtgg gcgccgaaca gcgcggcgat ggctgtcgac acctcgtcgg 336181 cgcccgcggc catcagtgcc gtggtgttgg ccgccgcggc tgcgtttgcc gcgctgatgc 336241 tcgatccgag actggccaaa tccgttgccg ctgccgcgat aacctctggc gccgcaatca 336301 caaacgacat ctgacacctc ccaatacgca tgaccgctct gtcatgccga cccggggaac 336361 gtcaccagca aaaatcggca gtaagaagca tcccatttcc agcgacaaca cctggggggt 336421 tttggtcaaa ctctggtaag cgacttcgtg taccgggtga acccggtgtg tcttgaagga 336481 cagcccgcag gctgatgctg ggggatctgg gccggccgac catggctggc cggctgttgg 336541 tctgatggcc ggttcgcggt tacaggccgt tgagcccgtt ctcgccgatg atcagcccgc 336601 tggtgccgcc ggcgccgggt gtgccgccgg ctttcccgcc gttgccgccg ttgccgccgt 336661 tgccgatcag cacggcgttg gggaccgagc tcgaattccc accggtgtca gcgccaaacc 336721 cgccggcgcc gccgtcgccg ccgttgccga acaccccggc cgtaccgccg tcaccgccgg 336781 tgccgccgct gctgccgatg ccgctggagc caccggtgcc gccggcaccg ccgaagccga 336841 agagcgagcc gccactgccg ccgttcccgc cgaccccgcc ggtcccgccg acatttaagg 336901 cgctgccgcc gctgccgccg gcgccgccgg aggcgccgag ggcgagtagg ccggcgttgc 336961 cgccgctgcc gccgttgccg ccgaaggtgc cgccgctgct gccgccagca ccgccagtgc 337021 cgccggcgcc gaacagcccg ccgtgccccc cggcgccgcc gtcggcgccg agcgtgcccg 337081 ccccgccggt gccgccggcg ccgaagagca atccgttccc cccggtcccg ccattcgcgc 337141 caaacccgcc ggccccgccg gccccgccgt tggcgaacag cccaccggta ccaccggctc 337201 cggcggtgcc gccggcaccg ataaagtttt gggagagggc ggcctggccg ccggtccctg 337261 cggcaccgag gaacaagccg gcgtcaccgc cgcgcccgcc ggccccgccg gtgtccaggc 337321 caaacccgcc gctgccgccg gtgccgccgg agccgatcag caaggcggct ccgccggtcc 337381 cgccggtccc gccttggccc gtcgttccga tgccgccgga cccgccggtg ccgccaatac 337441 ctgacaggat tccgccggcc ccgccggatc cgccgtctcc gccgtcggcg ccggtcgctc 337501 cgtggccgcc gttgccgccg ttgccgaaca accacccgcc ggccccaccg tcggccccgg 337561 tccccggagt gccgttggcg ccgttgccga tcagcggtcg cccggtgagg gcttgggtgg 337621 gctcgttgat cgcgttgagg atttgttgct gcagggtgtg cagtggcgtg ctggcggggg 337681 cgttgaatcc gtctcgacct agtagctgcc cgcctaagcc gccggcgccg gccgtgccgg 337741 cgggtgcgcc agtgccgcca ctaccaccgt taccgccatt gccgatcagc acgccgcttc 337801 cgccggcgcc gccggcggcg ccggcgccgt tcgcgctggc gccgccgttg ccgccgttgc 337861 cggcgttgcc gacgagcccg ggcgcgccgc cggccccacc ggttccgccg gcgcccgcga 337921 aggacccgcc gccggcgccg ccggcaccgc ccgccccgat gagcagaccc gcctttccgc 337981 cggcgccgcc cgccccgccg gcgtcgaagc ccagcccgca gacgccgccg gcgccgccgg 338041 agccgaacaa cgtgccgccg tcgcctccga tcccaccggc accgccgcca ccgtccgggt 338101 tggatccgcc gctgccgccg gcgccgccgg cggcaccgag gctgagcatg ccggcgtcgc 338161 cgccggcccc accgttcccg ccgacgttga ttatgctcgt cccgccacta ccgccggtgc 338221 cgccggcgcc gaacagcccg ccagagccgc catccccgcc ggcgccgccg ccaaagatgc 338281 cgaatccgcc gggcccaccg gtgccgccgg cgccgaacag cccgccgttt ccgccggatc 338341 cgccggcccc gccggtgccg gcgtcggttg ccccgccggc gccaccgacc ccgccgtcgg 338401 cgaacagccc gccgtttccg ccggccccgc cggcgccgcc ggtggcgccg aaagcggctg 338461 cgaatccgcc gggaccgccg accccggcgg ccccgaacag catgccggcg gccccgccgg 338521 cgccgccggc gccgccggtc cccgtgctgg ccctcccgcc ggcgccgccg gcgccgccgt 338581 tgccgatgag cccgccggcg ccgccgttgc cgccggcccc gccgttgacg ccggccgcgc 338641 cggatcctcc ggcgccgccg ttgccgatta accagccgcc gtccccgcca ttggccccgg 338701 tgccgggggc gccgttggtg ccgttgccga tcagcgggcg cccggtattc gccaggaaga 338761 actcgttgat cggggcgagc agcggcgagg tggcggcggc ctcggcggcg gcgtacgcgc 338821 cgccaccgga ggtcaacgcc tgcacgaact gggcatgaaa cgcctgcgct tgggcgctga 338881 gcgcctgata ggcctggccg tgggcgccga acagcgccgc aaccgccgtc gagacttcat 338941 cggcacccgc ggccagcagt gctgtggtgt tggccgccgc ggccgcgttg gccgcggcga 339001 tgctcgactc gagactggct aaatccgttg ccgctgccgc gataacctct ggcgccgcaa 339061 tcacaaacga catctgacac ctcccaatac gcatgaccgc tctgtcatgc cgacccgggg 339121 aacgtcacca gcaaaaatcg gcgggctaca gaataactcc ggcccgggaa agggatttgg 339181 tatttcccaa aatatctccc acatttatgc ggtcggcgcg tcggccgacg ggagctggca 339241 gcacccgtgg gccggcgccg agcgttcgct ggtgtccggc tgggacttgc attgcggcgc 339301 gccgtggtgt ggaatagtgg taatgaaaat catgttcatc agtcctctgt ggtgtttacg 339361 gctatgacgc tgtggatggc ctcgccgccc gaggtgcatt cggcgttgct cagcagcggg 339421 ccggggccgg gctcggtgtt gtcggcggcc ggggtgtggt cgtcgctgag cgccgaatac 339481 gccgcggtcg ccgacgagct catagggctg ctgggcgccg tgcagaccgg cgcttggcag 339541 gggcccagcg ccgcggctta tgtggccgcc cacgcgccgt acctcgcgtg gttaatgcgg 339601 gccagcgaaa ccagcgcgga agcggccgcc cggcacgaga ccgtggccgc ggcctacacg 339661 accgcggtgg cggccatgcc gacgttggtc gagctggccg ccaaccacac gcttcacggg 339721 gtcttggtgg cgacgaactt cttcggcatc aacaccatcc cgatcgcgct caacgaggcc 339781 gactacgcgc ggatgtggac gcaggccgcc agcacgatgg cgacctatca agcggtcgcc 339841 gaggccgcgg tggcgtcggc accgcagacc accccggcgc cgccgatctt ggcagccgaa 339901 gcggccgacg atgaccacga tcatgaccac gatcacgggg gcgaaccgac cccgctggac 339961 tatctggtcg cggagatatt gcgcatcatc agcggtgggc gcctgatctg ggatcccgcc 340021 gagggcacca tgaacggaat cccgttcgaa gattatacgg acgcagccca accaatctgg 340081 tgggttgttc gtgccatcga attcagtaag gactttgaaa cgtttgttca ggaactgttt 340141 gtcaatccgg tggaggcatt tcagttctac tttgagcttc tattgttcga ctacccgacc 340201 cacattgtgc agattgttga ggcgttgagc cagtccccgc agttgctggc ggtcgcactc 340261 ggttccgtca tctccaactt gggtgcggtg accgggttcg ccgggctatc cggcttggcc 340321 ggcatgcagc cggcggctat cccggcgcta gcacccgtcg cggcggcccc gtcgacattg 340381 ccggcggtcg cgatggcccc gaccatggcc gcgccgggcg cggcggttgc gtcggcagcc 340441 gcgccggcgt ccgcgccggc ggccagcacg gtggccagcg ccacgccggc accgccgccg 340501 gcacccggcg ccgccgggtt cggctatccc tacgccatcg ctccgcccgg catcgggttc 340561 ggctcgggga tgagcgccag cgccagcgct caacgcaagg caccacagcc cgatagtgcg 340621 gcggcggcgg cggccgcggc ggccgtacgt gaccaagcgc gggcgcggcg gcggcgccgt 340681 gtcacgcggc gcggatacgg cgacgagttt atggatatga acatcgacgt cgatccggac 340741 tggggccctc cgcccggcga agacccagtc acatccacgg tggcctcgga tcggggtgcc 340801 ggacatctgg gctttgccgg gacggcccgc agggaggcgg ttgccgacgc ggccgggatg 340861 accacgctgg ctggcgatga tttcggcgac gggccaacga cgccaatggt gccgggttcg 340921 tgggatccgg accgggatgc gcctggctcg gcggagcctg gagatcgggg ctgagctagc 340981 cgcgtagggt cgattgggtg cgtaccgaag gtgatagctg ggacatcaca acgagtgtcg 341041 gttcgaccgc gctgtttgtc gcgacggcgc gagcgctgga agcccagaag tccgacccgc 341101 tggtcgtcga cccatatgcg gaggcgttct gccgtgccgt cggcggttcg tgggccgatg 341161 tgctcgacgg caagcttccc gaccacaagt tgaagagcac cgatttcggc gagcacttcg 341221 tcaacttcca gggtgcccgc accaagtatt tcgacgagta tttccgtcgg gccgccgccg 341281 ccggcgcgcg gcaggtggtc atcctggcgg cggggctgga ctcgcgcgcg taccggctgc 341341 cttggcccga cgggaccacg gtttttgagc tggaccgccc gcaggtcctt gatttcaagc 341401 gcgaggtgct cgccagccac ggtgcccaac cgcgcgccct gcgccgcgag atcgccgtcg 341461 acctgcgtga cgattggcca caagccttgc gggacagtgg tttcgatgcg gctgcaccgt 341521 cggcatggat tgccgaaggg ctgctgatct atctcccggc caccgcccag gagcggctat 341581 tcaccggcat cgatgccctg gccgggcgcc gaagccacgt cgccgtcgag gatggtgccc 341641 caatggggcc agacgaatat gcggctaagg tcgaagagga gcgcgccgcg atcgccgagg 341701 gagccgagga gcacccgttt tttcaactgg tctacaacga gcgatgcgcg ccggccgccg 341761 agtggttcgg cgagcgaggt tggaccgcgg tcgctacgct gttgaacgac tacctcgaag 341821 cggtgggtcg cccggtaccc ggaccggaat ccgaagccgg gccgatgttc gcccgcaaca 341881 ccctggtcag tgccgcccgc gtctgacggc gcaccgttcg cgctgccggc accccgggct 341941 ccataatgaa aatcatgttc agtaagctac actctgcata tcgggctacc aacgaaatgg 342001 agtatcggtc atgatcttgc cagccgtgcc taaaagcttg gccgcagggc cgagtcgatt 342061 ggtcgcggtc gcctcgacag ttagcttatg caatgctaac ttcggggcaa agttcaggcg 342121 gatcggccga tggcgggcgt aggtgaagga gacagcggag gcgtggagcg tgatgacatt 342181 ggcatggtgg ccgcttcccc cgtcgcgtct cgggtaaatg gcaaggtaga cgctgacgtc 342241 gtcggtcgat ttgccacctg ctgccgtgcc ctgggcatcg cggtttacca gcgtaaacgt 342301 ccgccggacc tggctgccgc ccggtctggt ttcgccgcgc tgacccgcgt cgcccatgac 342361 cagtgcgacg cctggaccgg gctggccgct gccggcgacc agtccatcgg ggtgctggaa 342421 gccgcctcgc gcacggcgac cacggctggt gtgttgcagc ggcaggtgga actggccgat 342481 aacgccttgg gcttcctgta cgacaccggg ctgtacctgc gttttcgtgc caccggacct 342541 gacgatttcc acctcgcgta tgccgctgcg ttggcttcga cgggcgggcc ggaggagttt 342601 gccaaggcca atcacgtggt gtccggtatc accgagcgcc gcgccggctg gcgtgccgcc 342661 cgttggctcg ccgtggtcat caactaccgc gccgagcgct ggtcggatgt cgtgaagctg 342721 ctcactccga tggttaatga tcccgacctc gacgaggcct tttcgcacgc ggccaagatc 342781 accctgggca ccgcactggc ccgactgggc atgtttgccc cggcgctgtc ttatctggag 342841 gaacccgacg gtcctgtcgc ggtcgctgct gtcgacggtg cactggccaa agcgctggtg 342901 ctgcgcgcgc atgtggatga ggagtcggcc agcgaagtgc tgcaggactt gtatgcggct 342961 caccccgaaa acgaacaggt cgagcaggcg ctgtcggata ccagcttcgg gatcgtcacc 343021 accacagccg ggcggatcga ggcccgcacc gatccgtggg atccggcgac cgagcccggc 343081 gcggaggatt tcgtcgatcc cgcggcccac gaacgcaagg ccgcgctgct gcacgaggcc 343141 gaactccaac tcgccgagtt catcggcctc gacgaggtca aacgccaggt gtcgcggctg 343201 aagagctcag tggccatgga actggtccgc aagcagcgtg ggctcacggt cgcccaacgc 343261 acgcaccact tggtgtttgc gggaccgccc gggaccggca agaccaccat tgcccgggtg 343321 gtcgccaaga tctattgcgg ccttggcttg ttgaagcggg agaacatccg cgaggtccat 343381 cgcgccgacc tcatcggcca acacatcggc gagaccgagg cgaaaaccaa cgcgatcatc 343441 gacagcgcgc tggacggggt gctgttcctc gacgaggcct acgccctggt ggccaccggc 343501 gccaagaacg acttcgggtt ggtggccatt gacaccttgt tggccaggat ggaaaacgac 343561 cgcgaccggc tggtggtcat catcgccggc tatcgcgccg acctggacaa attcctggac 343621 accaacgagg gacttcggtc gcgtttcacc cgcaacatcg actttccctc ctacacgtcc 343681 catgagctgg tggagatcgc gcacaagatg gccgaacagc gagacagcgt cttcgaacag 343741 tccgcgctgc acgatttgga ggcgttgttc gccaagttgg cggcggagtc gacaccagat 343801 accaacggaa tctcgcgacg tagcctcgac atcgcgggca atggtcggtt tgtgcgcaac 343861 atcgtcgaac gctccgaaga agagcgtgaa ttccggctgg accattccga acatgccgga 343921 tccggtgagt tcagcgacga ggagctgatg accatcacgg ccgacgacgt gggtagatcg 343981 gtagagccgc tattgcgtgg cctcgggctc tcggtgcggg catgacgaac cagcagcacg 344041 accacgactt cgaccacgac cgtcgctcgt tcgcctcccg aaccccggtc aacaacaacc 344101 ccgacaaggt tgtctaccgc cgcggcttcg tcacccgcca tcaggtgacg ggctggcggt 344161 tcgtgatgcg ccgaatcgcc gccggaatcg cattgcacga cacccgcatg ctggtcgacc 344221 cgttgcgcac tcagtcacgc gcggtgctga tgggtgtgct gattgtgatc acggggttga 344281 tcggctcctt cgtattctcg ttgattcggc ccaatgggca ggcgggtagc aacgcggtgc 344341 ttgccgaccg gtccaccgcg gcgctgtatg tgcgggtggg cgagcagctg cacccggtgc 344401 tcaacctgac ctcggcccgg ctgatcgtcg gccggccggt gagcccgacg acggtgaaaa 344461 gtactgagtt ggaccagttt ccgcgcggaa acctgatcgg catcccgggt gcgccggagc 344521 ggatggtgca gaacacctcc accgacgcga actggacggt gtgtgacggc ctcaacgcac 344581 cgtcgcgggg cggtgcggat ggcgtgggtg tgacggtgat tgccggcccg ctggaggaca 344641 ccggcgcacg cgcggccgcg ctcgggcccg ggcaggcggt gctggtcgac agcggcgccg 344701 gcacctggct gttgtgggac ggcaagcgca gcccgattga tctggccgat catgcggtca 344761 ccagcggcct cggcctgggc gccgacgtgc ccgcgccgcg gatcatcgcc tcggggctgt 344821 tcaacgcgat acccgaagca ccgccactga cggcgccgat catcccggat gccggcaacc 344881 cggcgagctt cggtgtgccg gcgccgatcg gcgcggtggt gagttcctac gccctgaaag 344941 actcgggcaa gaccatatcg gacaccgtgc agtactacgc ggtgctgccg gacggtttgc 345001 agcagatttc gccggtattg gcggcaatcc tgcgcaacaa caactcctat ggtctgcagc 345061 agccgcctcg gctgggggcc gacgaggtcg ccaagctgcc ggtgtcgcgg gtgttggaca 345121 ccaggcgcta tcccagcgag ccggtaagtc tcgtcgacgt tacccgtgac cccgtcacct 345181 gcgcgtactg gagcaagccg gtgggtgcgg ccaccagctc gttgactctg ttggcaggct 345241 cggcgctgcc ggtgccagat gcggtgcaca ccgtcgagct ggtcggcgcc ggcaacggtg 345301 gtgtggcaac ccgagtggcg ttagcggccg gtactggcta cttcacccag acggtgggcg 345361 gcggcccaga tgcgccgggc gccgggtcgt tgttctgggt gtcggatacc ggggtgcgtt 345421 acggtatcga caatgagcct cagggagtgg ctggaggcgg caaagcggtt gaggcccttg 345481 gcctgaaccc gcccccggtc cccatcccgt ggtcggtgct gtcgctgttt gtgcccggcc 345541 cgacgctgtc gcgtgccgac gcgctgctgg cacacgacac cttggtgccc gacagcaggc 345601 ccgctcgtcc ggtatcggcc gagggagggt accggtgagc agactgatct ttgaggctcg 345661 tcgccgactg gcgccgccga gcagccacca gggcaccatc atcatcgagg cgcctcccga 345721 gctgcctcgg gtgatcccac cgtcactgct acgacgagcg ctgccttatc tgatcgggat 345781 cctcatcgtg gggatgatcg tggcgctggt cgccaccggg atgcgggtga tttctccgca 345841 gacgttgttc ttcccatttg tgctgctgtt ggcggccacc gcgctctacc gcggcaacga 345901 caagaagatg cgcaccgagg aggtcgacgc cgaacgggcc gactacctac gttacctatc 345961 ggtggtgcgg gacaacattc gggcccaggc cgccgagcag cgggccagcg cgttgtggtc 346021 tcatcctgac ccgacggcgt tggcgtcggt gccggggtca cgtcgccaat gggagcgtga 346081 cccgcacgac cccgactttt tggtgttgcg ggccggccgg cacacggtac cgctggctac 346141 tacgctgcga gtcaacgaca ccgccgacga gatcgacctg gaaccggtgt cgcacagtgc 346201 attacgcagc ctgctcgaca cccagcgcag cattggcgac gtgccgaccg ggatcgacct 346261 gaccaaggtt tcgccgatca ccgtgctggg ggagcgcgca caggtgcgcg cggtgttacg 346321 cgcctggatc gctcaggcgg tgacctggca cgacccgacg gtgctcgggg tggcgctggc 346381 cgcgcgtgat ctggagggtc gcgattggaa ctggctgaag tggttaccgc acgtggacat 346441 tcccggccgc ctcgatgcgc tgggcccggc ccgcaatctg tcgaccgatc ccgacgagct 346501 catcgcgctg ctggggcccg tcctggcaga ccgcccggcg tttaccgggc agccaacaga 346561 tgcgttgcgg cacttgctga tcgtcgtcga tgacccggac tacgacctgg gcgcatcgcc 346621 gctggcggtg ggccgcgcgg gtgtcaccgt cgtgcactgc tcggccagtg cgccgcaccg 346681 ggaacagtat tcggatccgg aaaagccgat cctgcgggtg gctcacggcg ctatcgaacg 346741 ctggcagaca ggcggctggc agccctacat cgacgccgcc gaccaattca gcgctgatga 346801 ggccgcccac ctggcgcgcc gactgtcgcg gtgggactcc aaccccaccc atgccgggct 346861 gcgctcggcg gccactcgcg gcgcgagttt caccacactg ctgggcatcg aggacgcatc 346921 ccgactggat gtgcccgcgc tgtgggcgcc gcgacgacgc gacgaggagt tacgcgtgcc 346981 gatcggtgtc actggcaccg gcgagccgct gatgttcgac ctcaaagacg aagccgaggg 347041 cgggatgggc ccgcacgggc tgatgatcgg catgaccggt tcgggcaagt cgcagacttt 347101 gatgtcgatt ctgttgtcgc tgttgaccac acactccgcg gagcggctca tcgtcatcta 347161 cgccgacttc aagggtgagg ccggcgccga cagtttccga gatttcccgc aggtggttgc 347221 ggtgatctcg aatatggccg agaagaagtc gttggctgat cggttcgccg acacgctgcg 347281 cggcgaggtg gctcgtcgcg agatgctgct gcgtgaggcc ggccgcaagg tccagggcag 347341 cgcgttcaac tcggtgctcg agtatgaaaa cgccatcgcc gcagggcata gcctgccgcc 347401 catcccgaca ctgttcgtgg tcgccgacga gttcaccttg atgctggccg atcacccgga 347461 atacgcggag ctgttcgact atgtggcccg caagggtcgc tcgtttcgca tccacatcct 347521 attcgcgtcc cagacactgg acgtgggcaa gatcaaagac atcgacaaga acaccgccta 347581 tcggattggg ctgaaagtgg ccagccccag cgtttctcgc cagatcatcg gcgtggagga 347641 cgcctaccac atcgagtcgg gcaaagaaca caaaggcgtg ggctttttgg tgcccgcgcc 347701 cggtgccacc ccgataaggt tccgcagcac ctatgtcgac gggatctatg aaccgccgca 347761 gacggctaaa gccgttgtcg tgcaatccgt tccggagccc aagctgttca ccgccgccgc 347821 ggtggaaccg gatccgggca cggtgatcgc cgatactgac gaacaagaac ccgccgaccc 347881 accacgcaaa ctgatcgcga ccatcggcga acaactggcc cgctacggtc cgcgggcgcc 347941 gcagttgtgg ctgccgccac tcgacgaaac gatcccactg agcgcggcgt tggcccgcgc 348001 cggggtgggc ccccggcagt ggcgctggcc gctgggggag atcgacaggc ccttcgagat 348061 gcggcgcgac ccgttggtgt ttgacgctag gtcgtcggcc ggaaatatgg tgatccacgg 348121 cggccccaag tccggcaaat ccactgcgct gcagacattc atcctctcag ctgctagcct 348181 gcactcgccg cacgaggtta gcttctattg cctggactac ggcggtgggc agctgcgggc 348241 gctacaggat ctagcgcacg tcggcagtgt cgcctcagcg ctggaacccg aacgcatccg 348301 ccgcaccttc ggcgagctcg agcaactgct gttgtcccgg cagcagcggg aagtattccg 348361 tgaccggggt gctaatggct cgacccccga cgacgggttc ggtgaggtgt tcctggtcat 348421 cgacaatctc tatggcttcg gccgcgataa caccgatcag ttcaacaccc gtaatccgtt 348481 gctggccagg gtaaccgaac tggtcaacgt gggccttgcc tacgggatcc acgtgatcat 348541 taccacgccg agctggctgg aagtgccgtt ggcgatgcgc gacgggctcg ggctgcgtct 348601 cgagctgcga ctgcacgacg cgcgcgacag caacgtgcgg gtggtcggcg ccctgcgccg 348661 cccggccgac gccgtcccgc acgaccagcc cggccgcgga ctgaccatgg ccgccgagca 348721 cttcctgttc gcggctccag aactggacgc gcaaacaaac ccggtggccg cgatcaacgc 348781 ccgctacccc ggcatggcgg ctcccccggt tcggttgttg cccaccaacc ttgcgccgca 348841 cgccgtcggc gaactgtatc ggggtcccga ccaactggtg attggccagc gcgaagaaga 348901 cctggcgccg gtgatactcg acctcgccgc caacccgctg ctgatggtgt tcggcgatgc 348961 caggtcagga aagacgacgc tgctgcgcca catcatccgc accgtccgcg agcactccac 349021 cgccgaccgg gtcgcgttca ccgtgctgga ccgccggcta cacctggtcg acgaaccact 349081 gttccccgac aacgagtaca ccgccaacat cgatcggatc atcccggcga tgctcgggct 349141 ggccaacctc atcgaggcgc gccggccgcc ggccgggatg tctgcggccg agctgtcccg 349201 ctggaccttt gccgggcaca cccactacct gatcatcgac gacgtcgacc aggtaccgga 349261 ttcgccggcg atgaccggtc cctacatcgg acagcggccg tggaccccgc tgatcggtct 349321 cctggcccag gccggcgact tggggctacg ggtgattgtc accgggcgtg ccactggatc 349381 ggcgcacctg ctgatgacaa gtccgttgct gcgccggttc aacgacctgc aggcgaccac 349441 gctgatgttg gcaggcaatc cggccgacag cggcaagatt cgcggtgagc ggtttgcccg 349501 attgcctgct ggacgagcaa ttctgttgac cgacagtgat agtccaacct acgtgcagtt 349561 gatcaacccg ctggtcgatg cggccgcggt ttctggtgaa acccaacaga aggggagtca 349621 gtcatgacgt tgcgagtggt tccggagggg ctggccgcag ccagcgctgc ggtggaagcg 349681 ctgacggcgc ggttggccgc cgcgcatgcg agcgcagcgc cggtgattac cgcggtagtg 349741 ccgccggcgg cggatccggt gtcgctgcag accgcggccg ggttcagtgc acagggcgtc 349801 gagcacgcgg tcgtcaccgc cgaaggtgtc gaagagctgg gacgcgccgg cgttggtgtg 349861 ggcgaatccg gcgccagcta cctggccggt gatgcggccg ccgccgctac gtacggggtc 349921 gtgggcggct gagcatggcc gcgcccatct ggatggcttc gccgccggag gtacattcgg 349981 cgttgcttag caatggtccg ggcccgggtt cgctagtggc ggctgccacg gcctggagcc 350041 agctgagtgc cgagtatgcc tcgacggcag cagaactcag tgggctactg ggggcggtac 350101 ctggttgggc atggcagggg cccagcgcgg agtggtacgt ggccgcgcat ttgccatatg 350161 tggcgtggct gacgcaggcc agtgcggatg ccgcaggagc agcggcccag cacgaggccg 350221 ccgcggcggc ctacaccact gccttggcag ccatgccgac attagcggag ttggccgcca 350281 accacgtgat tcacaccgtg ttggtggcga cgaatttctt tgggatcaac acgattccca 350341 tcacgctcaa tgaggccgat tacgtgcgca tgtggttgca ggcggccgcc gtcatgggtc 350401 tttatcaggc ggcttcgggt gcggcactgg cttcggcgcc gcgcaccgtc ccggcgccga 350461 cggttatgaa tccaggtggc ggtgcggcga gcactgtcgg ggcggtcaac ccctggcagt 350521 ggctcttagc gttgcttcaa cagctctgga acgcctacac gggtttctac gggtggatgt 350581 tgcagctcat ctggcagttc ctgcaggatc ccattggtaa ctcgatcaag atcatcatcg 350641 ccttcctcac gaatcccatt caggcactga tcacttacgg gccgctgttg ttcgcgctgg 350701 gctaccagat tttcttcaac ctggtcggct ggccgacctg gggcatgatc ttgagctcgc 350761 cgttcttgtt gccggccggg ctcgggctgg gcttggcagc aatagccttt ctacctattg 350821 tgcttgcgcc cgcggtgatt ccgccggcga gtactccgct ggctgctgcc gccgtcgccg 350881 ccgggtcggt gtggccggcg gtcagcatgg ccgtaacggg ggcgggcacc gctggggctg 350941 cgacgcccgc ggcgggcgcg gctccgtctg cgggcgcagc gccggccccg gcagctcccg 351001 cgaccgccag tttcgcctat gcggtgggtg gcagcggtga ttgggggccg agcttggggc 351061 cgacggtagg tggtcgcggt ggtatcaagg cgccggccgc tacggttccg gcggcggccg 351121 cggcggcggc aactcgtggg cagtcgcgcg cgcggcggcg ccggcggtct gaattgcggg 351181 actacggcga cgagttcttg gacatggatt ccgatagcgg tttcggcccc tcgacgggcg 351241 accacggcgc gcaggcctcc gaacgggggg ccgggacgct gggattcgcc gggaccgcaa 351301 ccaaagaacg ccgggtccgg gcggtcgggc tgaccgcact ggccggtgat gagttcggca 351361 acggcccccg gatgccgatg gtgccgggga cctgggagca gggcagcaac gagcccgagg 351421 cgcccgacgg atcggggaga gggggaggcg acggcttacc gcacgacagc aagtaaccga 351481 attccgaatc acgtggaccc gtacgggtcg aaaggagaga tgttatgagc cttttggatg 351541 ctcatatccc acagttggtg gcctcccagt cggcgtttgc cgccaaggcg gggctgatgc 351601 ggcacacgat cggtcaggcc gagcaggcgg cgatgtcggc tcaggcgttt caccaggggg 351661 agtcgtcggc ggcgtttcag gccgcccatg cccggtttgt ggcggcggcc gccaaagtca 351721 acaccttgtt ggatgtcgcg caggcgaatc tgggtgaggc cgccggtacc tatgtggccg 351781 ccgatgctgc ggccgcgtcg acctataccg ggttctgatc gaaccctgct gaccgagagg 351841 acttgtgatg tcgcaaatca tgtacaacta ccccgcgatg ttgggtcacg ccggggatat 351901 ggccggatat gccggcacgc tgcagagctt gggtgccgag atcgccgtgg agcaggccgc 351961 gttgcagagt gcgtggcagg gcgataccgg gatcacgtat caggcgtggc aggcacagtg 352021 gaaccaggcc atggaagatt tggtgcgggc ctatcatgcg atgtccagca cccatgaagc 352081 caacaccatg gcgatgatgg cccgcgacac ggccgaagcc gccaaatggg gcggctagct 352141 cgcgctacat ggatgcaaca cccaacgccg tcgagctgac ggtcgacaac gcttggttca 352201 tcgctgaaac cattggggcg gggacctttc cgtgggtgct ggcgatcacg atgccctata 352261 gtgatgccgc ccagcggggt gcgttcgtcg accgtcagcg cgacgagctg acccggatgg 352321 ggctgttatc gccgcagggt gttatcaacc ctgcggtcgc cgactggatc aaagtggtgt 352381 gcttcccgga ccgctggctt gacctgcgtt atgtggggcc ggcctcggcc gacggcgcct 352441 gcgagctgct acgtggcatc gtcgcgctgc gcaccggcac cggtaagacc tccaacaaga 352501 ccggaaacgg tgttgttgcg ctgcgtaatg cgcagctggt cacgttcacc gcgatggata 352561 tcgacgaccc ccgggcgctg gttccgattc ttggtgtcgg tttggcgcac cggccgccgg 352621 cgcggttcga cgagttcagc ttgccgacgc gggtgggcgc gcgggccgac gaacggctgc 352681 ggtccggcgt gccactcggg gaagtcgttg actatctggg tattccggcg tccgcacggc 352741 cggtggtgga gtccgtcttc tcggggccgc gcagctacgt cgagatcgtc gccgggtgca 352801 accgtgacgg ccggcacacc accaccgagg tcggcctaag catcgtcgac acctcggcgg 352861 gccgggtgtt ggtgagtccg tcgcgggcat tcgacggcga gtgggtctcc accttcagcc 352921 ctgggacacc gtttgcgatc gccgtcgcga tccaaacact gaccgcgtgc ttgccagacg 352981 ggcaatggtt cccgggacag cgggtgtcgc gggacttctc cacccaatcc tcgtaatcag 353041 aaaccagaaa gtgagcacga tgtcccagga acggtcccgc tgatgtccgg caccgtcatg 353101 cagatcgtcc gcgtcgccat tcttgcggac agcaggttga ccgagatggc cctgcccgcg 353161 gagttgccac tgcgcgaaat cctgcccgcg gtacaacgct tggtggttcc ctcggcgcaa 353221 aacggcgatg gtggccaagc cgactccggc gctgccgtgc aactgagttt ggcgcccgtc 353281 ggcgggcagc cgtttagctt ggatgccagc ctggacaccg tcggtgtcgt cgacggtgat 353341 ctgttggtgt tgcagccggt gcccgccggt ccggccgcgc cgggcatcgt cgaagacatc 353401 gccgacgccg cgatgatctt ttcgacgtcg cggttaaagc cctggggcat agcgcatatc 353461 caacgaggag cgctggccgc ggtgattgcc gtggctctgc tggctaccgg tttgacggtg 353521 acctatcggg ttgccaccgg tgtgctggcc gggctgctgg cggtggccgg gatcgcggtg 353581 gctagcgcgc tggccggatt gttgatcacc atccgttcgc cacgttcggg tatcgcgctg 353641 tcgatcgccg cgctggtccc catcggcgcg gccctggcgt tggcggtgcc aggaaagttc 353701 gggccggcgc aggtattgct gggtgcagct ggggtagccg catggtcgct gatcgcgctg 353761 atgattccca gcgccgaacg ggaacgcgtc gtcgccttct tcaccgcagc ggcggtggtc 353821 ggggcgtcgg tggcgctggc ggccggtgcg caattgctgt ggcagctgcc gttgttgagc 353881 atcggctgcg ggctgattgt ggcggcgctg ttggtcacca tccaggcggc tcagctttcc 353941 gcactgtggg cgcggttccc gttgccggtg atcccggcgc cgggggatcc caccccgtcg 354001 gccccgccgt tgcgcctgct ggaggatttg cctcggcggg tgcgggtcag tgacgcccat 354061 caaagcggct tcatcgccgc ggccgtgctg ctcagcgtgt tggggtcggt ggccatcgcg 354121 gtgcgcccag aggcgctcag cgttgtgggc tggtatctgg tggcggcgac tgcggccgcg 354181 gccaccctgc gcgcgcgggt gtgggattcg gccgcatgca aggcgtggct gctggctcag 354241 ccctatctgg tagccggggt cctgttggtg ttctacaccg cgaccggacg ctatgtcgcc 354301 gcgttcggcg cggtgctggt gctagccgtg ctcatgctgg cctgggttgt ggtggcactg 354361 aacccgggca tcgcttcgcc ggagagctac tcgctgccgc tgcgccggct gctgggtttg 354421 gtcgccgccg ggctggatgt ttcgctgatc cccgtcatgg cctacctggt cggattgttc 354481 gcttgggtgc tcaacagatg atccgtgccg catttgcgtg tctggcggcg accgtggtcg 354541 ttgcggggtg gtggacgccg ccggcgtggg cgatcgggcc gccggtggtg gacgccgccg 354601 cgcaaccgcc cagcggagac ccgggaccgg tggcgccgat ggaacaacgc ggtgcgtgca 354661 gcgtctccgg tgttatcccg ggcaccgatc caggcgtacc gacgcccagc caaacgatgc 354721 tgaatctgcc tgcggcttgg cagttttccc ggggtgaggg ccagctggtg gcgatcatcg 354781 acaccggggt gcagccgggc ccgcgactgc ccaacgtcga tgccggtggt gacttcgtgg 354841 agtcgaccga cgggctgacc gattgtgacg ggcatggcac cctggtcgcc ggaatcgtcg 354901 ccggccagcc cggtaatgac ggcttctctg gtgtggcgcc ggcggcgcgg ctgctgtcca 354961 tcagggcgat gtctacgaag ttctcaccgc gcacatcggg gggcgatccg cagctggcgc 355021 aggccacact tgacgtcgcg gtgctggccg gtgccatcgt tcatgcggcc gaccttggtg 355081 ccaaggtgat caacgtctcc acgatcacct gcctacccgc cgatcggatg gtcgaccagg 355141 ccgcgctggg cgcggcgatc cggtatgcgg cggtggacaa ggacgcggtg atcgtggcgg 355201 ccgcgggaaa caccggagcg agcggatcgg tcagcgcgtc gtgtgattcc aacccgttga 355261 ccgatctgag ccgcccagac gatccgcgga actgggcggg cgtcacctcg gtgtccatcc 355321 cgtcgtggtg gcagccctac gtgttgtcgg tggcgtcgct cacatccgcc gggcagccat 355381 cgaaattcag catgcccggg ccgtgggtgg gcatcgccgc acccggggaa aacattgcgt 355441 cggtgagtaa ctcaggcgac ggcgccctgg ctaacggact gcccgacgcc caccagaaac 355501 tggtggctct cagcggcacc agctacgcgg ccggctatgt ctccggggtg gccgcgctgg 355561 tccgcagccg ctatcccggg ctgaacgcca ccgaggtggt gcgccggctg accgccaccg 355621 cgcaccgcgg cgcccgagag tcctccaaca tcgtcggcgc cggcaacctg gacgcggtgg 355681 cggccctgac ctggcaactg cccgccgaac ccgggggcgg tgccgcaccg gccaagccgg 355741 tcgccgatcc gccggtcccg gcgcccaaag acaccacacc gcgcaacgtc gcattcgccg 355801 gagcagccgc gctgagcgtg ctggtcgggc tcacagccgc gactgtcgcg atagcgcgcc 355861 gacgaaggga gcccaccgaa tgaacccgat cccttcttgg cccggcaggg gccgggtcac 355921 gttggtgctg ctggcggtgg tgcctgtagc gctggcctac ccctggcaat cgacacgcga 355981 ttacgtgctg ctgggcgtgg ccgccgccgt cgtgattggg ctattcggct tctggcgcgg 356041 gctgtatttc accacgatcg cgcgccgcgg gttggcaatc ctgcgccgcc gacgccggat 356101 tgccgagccc gcaacgtgca cgcgcacaac ggtgctggtg tgggttgggc cgccggcatc 356161 ggatacgaac gtgctgccgc tgacgctgat cgcccggtat ttggaccgat acggcatccg 356221 cgccgacacg attcgcatca ccagccgcgt caccgcatcc ggcgactgcc ggacctgggt 356281 cgggttgacg gtggtcgccg acgataacct ggcggcgctg caggcccggt cagcgcgcat 356341 ccccttgcaa gagaccgcgc aggtcgcggc gcgccggctc gccgaccatc tgcgcgaaat 356401 cggttgggag gctggtacgg ccgcacccga cgagatccca gcgttggtgg ctgcggattc 356461 tcgcgagacg tggcgcggaa tgcggcacac cgactcggat tacgttgcgg catatcgggt 356521 cagcgccaat gccgagttgc ccgatacgtt gcccgcgatc cggtcgcgtc cggcgcagga 356581 gacctggatc gcgctggaga tcgcatatgc cgccgggtca tcaacccgct acacggtggc 356641 cgctgcctgc gcattgcgga ccgattggcg gcctggcggc accgcaccgg tggccggcct 356701 gctcccgcaa cacggaaacc acgtgccagc cctgacagcc ttggatccgc gatccacccg 356761 ccgactcgac gggcacaccg atgctcctgc cgacctgctg acccggctgc actggcctac 356821 tcctaccgcc ggcgcccacc gggcaccgct gaccaacgcc gtcagtcgaa catgaggccc 356881 tgcaggaaca cggtcatccg ccgcagatag tccaactggc tcacatgcag caggtggctg 356941 ccggggaacc agtgcagcgc acagcgatcc cactgcttcc acagcgttac cgcgtgctcg 357001 ggtggagcca ttcgatcgcc aaggccggtg atgatcatcc gccggtcctt aggtagcagc 357061 ggccgatagt tcagtgggcc gtggtaggcc agcccggcga tcagctcatc acggctgatg 357121 ttggttagcc gcagtcctag cttgacgagc ttattggccg gaaaccattc gtcgaacagc 357181 ttggcgggca tgacgacggg gcagttgggg atgacagcct caagccgact ttcgaccgaa 357241 gccagcagcg cagacgtgta gccccccagg gatatacccg tcagggcgat acggtcgacg 357301 ccgatgtggc gcaggtagtc cacgatggaa cgaaagtcat acactgcctg cgccatcgcc 357361 tcggcgaagc cgctcaatcc gctagtgaaa tagccgaaac cgctaaacgg cgagaacttt 357421 tcggcccgct ggccgtgaaa cggcaacgtg tacagcaaaa cgtcgtagcc ggaccggtaa 357481 taccaaggca gcgaaaagaa cagcccgttg agcaagtatg acgatcccat gaagccgtgg 357541 atgacgcaca gcgtaggacg cgggccgtcg cggtggcgcc agtgctgcgc gtgcacaatg 357601 ttgttggcgg tcaatgcact ccaccgctgg cgcatcgtgg ggttgatcgc ccggaagccg 357661 ctggcaaatg cgatgttgtc cacggtgccg cgcgcaaccc attcggtgag cgggctggcc 357721 ggccgcgagg tgaccttggg caactccgtc ggcgccggaa aggacttcgc cggatcatgc 357781 gctgccgcaa gttcggcgta gaagttcagg ttgctgcgct cgctgccttc gttgacgtga 357841 cgtagtgcgt tggcgacaac ggccggagtc accgtcgcgg acagcaccga cgcgaccgcg 357901 gtgcgcagcg cgacatcggc gatcgccgaa gactcgacga gtatccgctg gcgggccgac 357961 agcaccgagc gcgagggcag gccctcggcg ccggcatccg cgccggggac gtcgggaatg 358021 gggacgggcg gaccgatcgc gtcggcagtg aacgtccctg acatctcgga catcaatgtc 358081 gatggtaatc gccaatgtgg ctgaccgctg aaggtttcga ctgtatcgtc aatttctcac 358141 tcggtcgagc gcttgtccag gagcacgtac atgtgggatc ccgacgtcta cctggctttt 358201 tcgggtcatc gcaaccgccc gttctacgag ttggtgtcac gggtgggtct cgagcgggcg 358261 cgccgcgtgg tcgacctggg gtgcgggccc ggccacctga cacgctacct ggcacgacga 358321 tggcccggcg cggtgatcga ggctctggac agctcaccgg agatggtcgc tgccgcggcc 358381 gaacgcggga tcgacgccac caccggtgac ctgcgggact ggaaaccaaa gcccgacacc 358441 gatgtggtgg tgagcaacgc tgcgttgcat tgggtgcctg agcattccga cctgttggtc 358501 cggtgggtcg acgagctggc gccgggatca tggatcgctg ttcagatccc cggcaacttc 358561 gagacgccgt cgcacgccgc ggtacgggcg ttggcccgcc gcgagccgta tgcaaagcta 358621 atgcgcgaca taccttttcg tgtgggcgcg gtggtccaat ctccggcgta ttacgcggag 358681 ctgctgatgg acaccggctg caaggtcgac gtgtgggaga ccacgtacct acaccagctg 358741 accggcgagc acccggtgtt ggactggatt accggaagcg cgctggtccc agtgcgtgag 358801 cggctcagcg atgagagctg gcagcagttt cggcaggagc tcattccgct gctgaacgac 358861 gcctacccgc cacgggccga cggtagcacc atctttccct tccggcggct gttcatggtc 358921 gccgaagttg gtggcgcgcg ccgctcaggt gggtagcccc agccgcggcg cctccgctcg 358981 gtaccggtcg acccactcat cagagcgctg gttggcctgc cgttccagca tcggtgcggg 359041 cgccagtttg ggatcctggc cgatggcgtc aagcacactg gccacaatcg cggtcaggtt 359101 gcgccataac accgggtagg cgatatcgat cggatcgatg ccttcctcgg caaaccaagc 359161 gcgccagccg ttttcctgat cgcgcagatt cctgatgatg tgggcgatgg caccggcgtg 359221 gtagacggcc tgcgagtcgc gcttggggtc cggatggccc cgccaaacct gggtttgcac 359281 ggcgcgccag aacgacaccg cttgtgacac cacatcgggc cggtggacgt gcacgaaaac 359341 cggttcgttg ccaatgacgt cgcggattgc cgcgcgcaag ccatccccgg agcgatccgg 359401 caattgtgct gcgcgttgct gcagcagcgc agtctgattc cacatcaact tgccgcccca 359461 gacgccgttg ggcgtgcgac cggaggtgcg gacgtgctca cgccaggcaa ccggcgtcgc 359521 ggtgtccggt gtaccggggt ccagcggatc gagcaattgc aggatcgtgt catcgtcgac 359581 cccagcgaac cactcccggg gctggggggc catcccggtg ctaggcaggt attggaagaa 359641 ctcctgtggt tccccggcac agcccgtcgc gcgcagcgat tccaccagca gcgtgctgcc 359701 gctgcgttgg gtggcgagca ccagatacgg tctcacagcg cgggacatcc gatgagccta 359761 gctgcagtgt tcgtcgatgc cgcggtcggc ggcgatcgct gaccggcccg ttggcgtctt 359821 gcggtggatc cgcagatacg tttcggtgta gcgctcggcg atgcgggaac cggcgaagtc 359881 cgacggaatg acgtcggccg tgcgctgtcg ccaatcatgc agtcgcacgg ccagatcggc 359941 cgcgatcgcg gccacgccct gggtgctgtc gtcgccggct aacaggttat tggtctcggt 360001 gggatcggcg cgtagatcgt agagttcccg ctgcgggcgg ggcgccttga ccaacggtgc 360061 gacggccatg ccggccgggc tttcctggat atcccacggt aggtccagca gcggccgggg 360121 cgcgtaattc tcgatgtagc tgtattcctt ggtgcggatt gcccgaatcg gatcgaacga 360181 gtcgtgatag gtcttggcgg tgtatacgtg gtcacgcacc gcagcgtttt cagtgtccgg 360241 cgcgaggagg gccggtgcgt gtgacacacc ctcgacatcg gcgggtacct cgagtctcag 360301 caggtccaat agcgtcggaa ccagatcgac gccgctgaaa agctcgtcat agacgcgagg 360361 cgccatcgcc cggcgagtgg gcgggcggat gatcagcgcg ataccggttc cggcgtcata 360421 cagtgtggac ttcgcccgcg gaaatgccgg accgtgatcg gtgacgaaca ccacccaggt 360481 gctggcgtct aggccggtat cggccagtgt gtcaagtagc cggccaaccg cctcgtcggc 360541 tgtggcgata gaaccgtaga actcggcgac gtcttggcgc acctcggggg tatcgggcag 360601 atagtcgggc agctcgacgg ccgcgctgtc ggccggccgg tagcgctcat gcggataggg 360661 ccggtgggtt tcgaagaagc cggcggtcaa caggaaccgt tgtccgtcta acgcgggcac 360721 gcgattatgc agccagtcct gggctttggc gaccacgtat tcgcagtagg agttcgacac 360781 gtcgaattcg tcgaagccca gccgctttgg gtaggacgtc tcatgctgca taccgaaaag 360841 agctgagtac caacccgatt cggatagcaa ttgcggtagg gtttggaccc cggtgcggta 360901 ttcccagccg tgatgggcca ggccgaccaa cccgttgctt tgcgggtagc ggccggtgaa 360961 cagcgagccc cgcgatggtg tgcacagcgg cgcggtggca tgtgccctgg tgaacaggat 361021 gccctcggcg gcaagccggt ccagccgcgg gctgtagacg tccggatggt ggtagacgcc 361081 gagatagcgc cccaggtcgt gccagtgcac gatcagcagg ttctcgcgct gccctgtggc 361141 acgctcactc gtcacctttg tcacctctcc agcgaaccgc acccggcgcc gaagccggac 361201 aatagagcct atacgtcgcg aggcactaga tacgccaccg atgatggcgg taggctcgct 361261 gattgaatcg cggcgacggc gtaggcgtgt tgtgtcttgg cgtccaggag tcacgagtcg 361321 acgggaggtt cccgtgtcct ttgtgatcgc acaaccggag atgatcgcgg cggcggccgg 361381 tgagttggcc agcatcagat cggcgatcaa cgcggccaat gcggcggccg cggcccagac 361441 caccggagtc atgtcggcgg ccgccgacga ggtgtctacg gcggttgccg cgctgttttc 361501 ctcgcatgcc caggcctatc aggccgccag cgcgcaagcg gccgcctttc acgcccaggt 361561 ggtgcggacc ctgaccgtgg acgcgggagc gtatgccagc gccgaggccg ccaacgccgg 361621 gccgaacatg ctggccgcgg tcaacgcccc cgcccaggcg ctgttggggc gcccactgat 361681 cggcaacggt gccaacgggg cgccgggcac cgggcaggcc ggcggcgacg gtgggctgtt 361741 gttcggcaac ggcggcaacg gcgggtccgg cgcacccgga caggccggcg gggccggcgg 361801 ggcggccggg ttcttcggca acggtggcaa cggcggggac ggcggggccg gagcgaacgg 361861 cggcgccggc ggcaccgccg gctggttctt cggcttcggc ggcaacggcg gggccggcgg 361921 gatcggtgtt gccggcatca acggcggtct cggcggcgcc ggcggcgacg gcggcaacgc 361981 cgggttcttc ggcaacggcg gcaacggcgg catgggcggg gccggggcgg ccggcgtgaa 362041 cgccgtcaat cccggcctgg ccaccccggt caccccggcg gccaacggcg gcaacggcct 362101 caacctcgtc ggcgttcccg gcaccgccgg tggcggcgcc gatggcgcca acggcagtgc 362161 cattggccag gcgggcggcg ctggcggtga cggcggcaac gcctccacga gtgggggcat 362221 cgggatcgcg caaaccgggg gcgccggcgg cgctggcggt gccggcggcg acggcgcacc 362281 cggtggcaac ggcggcaatg gtggcagcgt cgagcacact ggcgctaccg gctcctctgc 362341 gagcggcggc aatggtgcca ccggcgggaa cggcggggtc ggtgcgcccg gcggtgccgg 362401 cggcaacggc ggccacgtca gcggcggatc ggtcaacaca gccggcgccg gtggcaaagg 362461 cggcaacggc ggcaccggcg gcgccggcgg cccgggcggc cacggcggca gcgttctatc 362521 cggcccggtt ggcgacagtg gcaacggtgg tgccggcggg gacggcgggg ccggggttag 362581 cgccaccgat atcgccggca ccggcgggcg cggcggcaac ggtggtcatg gcgggctgtg 362641 gatcggcaac ggcggcgacg gtggtgcggg cggtgtcggc ggtgtcggcg gggccggtgc 362701 ggctggcgcg atcggcggcc acggcggcga tggcggctcc gtaaataccc ctattggcgg 362761 cagcgaggcc ggtgacggcg gtaagggcgg cctgggcggg gacggcggtg ggcgcgggat 362821 attcggccag tttggggccg gcggggccgg tggtgccgga ggcgtcggcg gcgccggcgg 362881 ggctggcggg accggcggcg gcggcggcaa cggtggggcc attttcaatg ccggtacccc 362941 cggcgccgcc ggcacgggcg gtgacggcgg tgttggcggg accggtgcgg ccggcgggaa 363001 aggcggggcc ggcggtagcg gcggcgtcaa cggcgccacc ggcgccgacg gcgccaaggg 363061 cctcgacggt gccaccggcg gcaaaggcaa caacggcaac cccggctgag tccggattca 363121 ccgagtctgt agataccgtg gtccgcattc gcagttttgt gcgccaacta cagcctcgat 363181 gacacgaccg cggcgaatcc cgtttcccgg gtgcggcgac accgcgtcct acgattagta 363241 ggatctctgg tatgacgaaa gagaagatct ccgtgacggt ggacgcggcc gtcctcgcgg 363301 cgatcgacgc ggacgccagg gcggcgggtt tgaatcggtc ggaaatgatt gagcaggcac 363361 tgcgcaacga gcacctgcgt gtcgctctgc gcgattacac ggctaaaacc gtaccggcgt 363421 tggacatcga tgcctacgca cagcgggtgt accaggcgaa ccgggcggcc ggaagttgat 363481 cgctcccggc gacatcgcgc cgcgccgcga cagtgaacac gagctctacg tcgccgtctt 363541 gtccaacgcg ctccatcggg ccgcggacac cggacgggtg atcacctgcc cattcattcc 363601 gggccgggtc cccgaggatc tcttggcgat ggtggtggcg gtcgagcaac ccaacggcac 363661 gctgctgccg gaactcgtgc agtggcttca tgttgccgcg ctcggtgcgc cactcggcaa 363721 cgcgggcgtg gccgccctac gcgaggctgc ctcggtcgtg acagctctgc tctgttagcc 363781 ctgtcaccgg cgaagatacc tgatatcgcc agatatcatc ggaagatgag tgatgtactg 363841 attcgggaca tccccgacga cgtgttagca agccttgacg cgatcgcggc acgcttgggc 363901 ttgtcgcgga ccgaatacat ccgtcggcgt ttagcccagg atgcgcagac ggctcgcgtc 363961 accgtgacag ccgcggatct tcgacgcctc aggggtgcgg ttgccggtct gggcgatccc 364021 gagcttatgc gtcaggcgtg gaggtgactg accagcgctg gctgatcgac aagtcggcgc 364081 tggtgcggct cacggacagc cctgacatgg aaatctggtc gaaccggatc gaacgcggcc 364141 tggtacacat cacgggcgtg acacgcttgg aagtagggtt ctcggccgaa tgcggggaga 364201 tagcgcgacg ggagtttcgt gaaccgccgc tgtctgcgat gcccgtggaa tacctaaccc 364261 cgagaattga agaccgtgcg ctcgaggtgc agaccttgct tgccgaccgc ggacaccacc 364321 gtggcccgtc gatcccggat ctgctcatcg ccgcgacagc cgaactgtcg ggcttgacgg 364381 tactgcacgt cgacaaggac tttgacgcca tcgccgcgct taccggtcag aaaacagaac 364441 ggctcacgca tcgcccgcct tccgcttaag gagcccgacc aacccttgtg attggcgtgg 364501 gggggcgcta acgtaactgt ctgtaacgtt cgatacagaa ctggcgccgg ggtgcggccg 364561 cgactctacg agccgagaca agccggcgca aggatggcgc accagtgggc gttcccgcca 364621 agaaaaaaca gcagcagggg gagaggtcac gagaatcgat tctcgacgcg accgaacgcc 364681 tgatggcgac caagggctac gcggcgacct cgatcagcga catccgcgac gcgtgcgggc 364741 tagcacccag ctctatttac tggcacttcg gctccaaaga gggcgtgctg gccgccatga 364801 tggagcgcgg cgcgcagcgc ttctttgccg cgatacccac ctgggatgag gcccatgggc 364861 ccgtcgagca gcgatccgag cgccagctga ccgagctggt gagcctgcag tcgcagcatc 364921 cggacttcct gcgcctgttc tacctgctgt cgatggaacg aagtcaggat ccggcggttg 364981 ccgcggtggt gcgccgggtc cgcaacaccg cgatcgcccg atttcgtgac agcatcacgc 365041 acctgctgcc atcggacatc ccgccgggca aagccgatct cgtcgtcgcg gagctgaccg 365101 cgttcgcggt tgcgctgtcg gacggcgtct atttcgccgg ccaccttgaa ccggacacga 365161 ccgacgtcga gcgcatgtac cggcggctgc ggcaagcgct cgaggccctg attcccgtcc 365221 tcctggagga gacatgaaca ccggaaccgc cgtcatcacc ggggccagct ccggcctcgg 365281 gttgcagtgc gcccgcgccc tgctacgtcg cgacgcatcg tggcatgtgg tgttggcggt 365341 gcgcgacccg gcgcgcggcc gtgcggccat ggaggaattg ggggagccaa accggtgttc 365401 ggttctcgag gtggacctcg cgtcggtgcg gtccgtgcgc agtttcgtgg aaaccgtgcg 365461 gaccacgccg ctgccgccga ttcgtgccct ggtgtgcaat gccggcctgc aggtggtgtc 365521 gggcatcgcg ttcaccgacg acggtgtcga gatgacgttc ggggtaaacc acttgggtca 365581 ctttgcttta gtgaccggga ttctcgactg gttggcccgt ccggcgcgca tcgttgtcgt 365641 cagcagcggc acgcacgacc cgagcaagca caccggaatg cccgaccctc ggtatacctg 365701 cgccgccgac ctcgcgcacc cgcccaccga tcagaacacg ccggccgaag gccgccgtcg 365761 atacaccacg tccaagctgt gcaacgtgct cttcacctac gagctcgacc gccgcctcga 365821 tcacggagaa cagggcgtga tggtcaacgc gttcgacccc ggcctaatgc cgggctccgg 365881 cttggcccgc gactatccgc cgatcctgcg actggcgtac cgtctcctgt cgccgatgct 365941 gcgcgtcctt cccttcgttc acagcacccg ggtctccggc gaacacctgg cggcgctggc 366001 ggtcgatccg cggttcgcgg gcgtgacggg ccaatatttc gcgggcgcca aggcgatccg 366061 gtcttccgcc gagtcctacg atcgggcaaa ggcgctcgac ctctgggaga ccagtgaacg 366121 gctgctggcc caggtgacat agctgcgcgt tatcccctaa agaaacccgc caggttggtg 366181 ccaaagttac cgatgccgga aaggaacccc ggcgtcgcga gatccagcgc gctggcgttc 366241 aaccagcccg agatggtgtt gcccacgttg gcgacacccg atcccagcgc gccggtattg 366301 aggtagcccg acaggccggc accggagttc acgaatcccg atacgcttcc ggcgccgctg 366361 ttgaagaagc ccgacgacgg gccgccagtg aggttgaagt agccgggggt caccggaatg 366421 cccaacagcg gcaggccgat caggccctga tagtcgccac tcaccaagaa gccgttgctg 366481 tagctgccgg taatgaacgc accggtgtcc acatcgcccg tgttcgccac gcccgtgttg 366541 tagtcaccgg tgttgaggta gcccgtgttg ccactgcccg ggttgaagcc gccggtgttg 366601 aagctgcccg ggttgaagct gccggtgttg aggtccccgg ggttgaacac gcccgtgttg 366661 gtgctgcccg cgttgccgat gccggtgttg aagccgcccg agttcgcgag accgaagttg 366721 ccggtgccgg tgttaaagat gccgacgttg ccggtgcccg agttgaagaa cccgatgttt 366781 ccgctgccgg agttgaacag cccgatgttg ttgctgcccg agttgaggct gccgatcccg 366841 atctggccgt tgccggtgag cccgacgccg atgttgttgt taccggtatt cgcgaagccg 366901 atgttgtagc tgccggtgtt ggcaaagccc aggttgtcgc tgccgaagtt cgcgaagccg 366961 atgttgtagc tgccggcgtt gccaaagccc aggttgtcgt cgcccagatt tgccaacccg 367021 atgttgtagc tgcccaggtt cgccaagccg atatcgaaga tcccggtgtt ggcgatgccg 367081 atgttgttac cgccgatgtt gaccccgccg aagttgaggt cgccgaggtt gccgatgccc 367141 aggttggagt cgccggtatt aacgaagccg atgttgacgc tgcccaggtt cccgatgccc 367201 gcgttgaggc cgccctggtt tgcgacgccg aagttcagcg tcaggttgcc ggtgttgtcg 367261 aggaacaggc cggccaggtt ggcgccgatg tttgcgatgc ccgagccgaa ggcgggcgtc 367321 gcgaggtcca gcgtgctcgt gttgtagacg cccgagatgg tgttgcccag gttcgccaga 367381 cccgactgca gcgcgccgaa attcagtagg cccgaattgc ccaagccgcc ggcgttaaag 367441 aagcccgaat tgccagcgcc gaagttcccg gagcccgaca tgttgccggc gccgaagttc 367501 ccgaagcccg atacatggcc ggcgccggtg tggaagaaac ccgacgacgg gccggtggtc 367561 gagttgccga aacccggggt accaccgatg ctgatgccga tggggatcgg gccgaagccg 367621 ccggtgccaa ccatgctgat ggtttgctga atgggcgaat cgatggcgat gacttgattg 367681 acatcgatcg tgatggggcc gatcatctcg ttgacaagca ccgccgcagg accaagcaag 367741 actcgtatct ggaaaccggg aatggtgaaa ctgtttggcg tggtggcgac gacggtgccg 367801 gtgatgggta tgtcgattgg aacactcaag tcgtagcggt aggggatttc gggaatggtg 367861 atcgttgtgg aaaggccaat caacccctgg tagtcacctc gccagaagaa cccgttgctg 367921 taattgccgg agatgaacgc gccggtgttg acgttgcccg tgttggccac acccgtgttg 367981 tagtcacccg cgttgaagta gcccgtgttg tagtcaccgg agttgaagct gccggtgttg 368041 tgatcgccga ggttgaagct gccggtattg gtgctgccag tgttgaagct gccggtgttg 368101 atgctgccgg tgttgatgct gccggtgttg ccgacgccgg tgttgacgtt gcccgggttg 368161 aacaggcccg tgttggtgct gcccgtgttc ccgaggccgg tgttgaagct gcccgagttt 368221 gcgatgccga agtttgcggt gccggtgttt ccgatgccca cgttgccggt gcccgagttg 368281 aagaatccta cgtttccgtc accggagttg aacaagccga tgttgtggct gcccgagttg 368341 aagctgccga acccgatctg accggtgccg gtgagcccga tgccgatatt gccgctaccg 368401 gtgttggcga agccgatatt ggcactgccg gtgttagcaa tgccgatatg gtagttgccc 368461 gagttggcga agccgacgct gtagttgccc aggtttgcca agccaatgtt gtggttgccc 368521 acgtttgcga aaccgacatt gaagattccg gtgttcccga tcccgaagtt agagcccccg 368581 aggtttgcca agccgacatt gaggttgccg aggccggcca agtcgaggat cgtcgtgccg 368641 gcgccgccct gcagcaggcc ggcgatgttt gcgaggccgg agccgaaggc cggcgtcccg 368701 aggtccagcg ggctcgtgtt gtagataccc gagatggtgt tgcccacatt cgccacaccc 368761 gatcccaacg cgccgacgtt gagcaagccc gagactcctg aggctgccga cgcaaggttc 368821 cacaggcccg atgtgttgcc gccgacgttg ccgaacccgg atgcggtgcc ggcgccggtg 368881 ttgaagaagc ctgacgacgg gctggtggtc gagtttccga tgcccggcgc tgccggaatg 368941 tcgatgatcg ggatggtgat ggggccgagg ccggcggtgg cgctgatgtt gatcgcggtc 369001 gtgggtccgc ccacggcgat cgcgaacgtg ggaacgctga gcacgaagct cgggacaatg 369061 atgggaccga tgtccggctc ggtatggatg tgaaagctaa acgcgaagga ttcgaagccg 369121 atgatgggga tagtgaaatt gtccaccacg aggtcggtga aactgccggt gatcggtatg 369181 tcgattggga tattgacgtc caagtgcgcc ggaatctccg gaatagtcag cgcgtaggag 369241 taaccgatca ggccctggta gtcgccccgc cacaagatgc cgttgctgta gttgccggag 369301 atgaaagcgc cggtgctgac gtcgcccgta ttcgcgatgc cggtgttgta gttcccggtg 369361 ttgaagtggc cggtgttggt gttacccgcg ttgaagccgc cggtgttgaa gctgccggta 369421 ttgaaattgc cggtgttgaa gttaccgggg ttgaagccgc cggtgttgcc gtcgcccgag 369481 ttgaacaagc ccgtgctggt gctgcccgag ttgccgatgc cgaagtttcc ggtgccggtg 369541 ttgccgatgc cgaagttccc ggtgcccgag ttaaagaagc cgatgtttcc gtcgcccgag 369601 ttgaacaagc cgatgtttcc gctgcccgag ttcagagcgc cgatcccgat ttggccggta 369661 ccggtgagcc cgatgccgat attgttgctg ccggtgttgg caaagccgat attgttgctg 369721 ccggtgttgg caacgccgat gttgtagctg cctaggttgg caaagcccag gttgtcgtcg 369781 ccgaagtttc cgaagccgat gttgtagttg cccagattcg ccacgccgac gtcaaagatc 369841 ccggtgttgc cgaagccggc gttattgctg cccaggtttg ccagcccaag gttcagagtc 369901 atcgtgccca taccgtcgcg catgagaccg gaggcaaagg ccggcgtcac gaggtccaac 369961 gcgctcgcgt tgtagaaacc cgagacggtg tgacccacgt tagtcacacc cgaccccagc 370021 gcaccgacat tgagataacc cgaaatccct gaggcgccgg cgaccacgtt caaaaagccc 370081 gacgcgctgc ccgctccgga gttgaagaag cccgacgacg gactagtggt cgagttgccg 370141 aagcccgatg tcgcgggaat gtcgatgatc gggatggtga tggagccgat accggcgctg 370201 gcggtgatac cgatcgaggt ggtgggtccg cccacggtga tcgccgccgt gggcaaggtg 370261 atattgatgg tcgggatgat gatgggggtg aagtcgatat tattttcggc agctacgatg 370321 ctgaagccct ggagggtgac gaccccggcg tcgatgttga tgggtatatg tatcgggatg 370381 tcgacgccaa aggttagggc gatttcggga atcgctagcg ccgcgtgcaa gccaatgagg 370441 ccctgataat ttccactcca caagaacccg ttgctgtagc tgccggagat gaaggcgccg 370501 gtgtcaacat cgccggtgtt tccgagtccc gtgttgtagc tgcctgggtt gaaatccccg 370561 gtgttggaat cgcccggatt gaagctgccg gtgttgaagc tgccggtgtt gccgataccg 370621 gtattgacgt cgccggagtt gaagaagccg gtgttggtgc tgccggtgtt tccaagcccg 370681 aagtttgcgg tgccggtgtt gccgatgcca acgtttccgt tgcccgagtt gaaaaagccg 370741 atgtttccgc tgcccgagtt gaacaagccg atgtttccgc tgccagaatt cagggagccg 370801 aacccgatct ggccgtcgcc cgtgagccca atgccgatat tgttgctgcc ggtgttcgcg 370861 aacccgatat tgttgctgcc ggtgttggcg aagcccaggt tgtcgttgcc caagtttccg 370921 aagccgacgt tgtagctgcc gaggtttccg aagcccaggt tgtcatcgcc gaagtttccg 370981 aagccgatgt tgtaactgcc cagatttgcc aaaccgatgt cgaatattcc ggtgtttgcg 371041 ccgccgatgt tgttgccacc gatattggcg ctgccgaagt tggcgctgcc gaggtttgca 371101 aagccgatgt tgtagtcgcc gaggtttgca atgccgacgt tgagggtgcc gtggtttgcc 371161 aagcccaggt tgaggaccat ggtgcccgtg ctgtcgcgca gcaggccggc gatactggtg 371221 ctgatgttgg ccaggcctga attgaaggcc ggcgtcgcga ggtccgacgt gctggtgttg 371281 tagaaccccg agacggtggt gcccacgttc gccagacctg atcccagggc gccgacgttg 371341 aggagccccg acgcccccga ggtcgcggag gccaggttcc aaaagcccga attggcgccg 371401 ccgaagttgc cgaagcccga ggcgccgccg gcgccggtat tgaagaaacc tgacgacggg 371461 ttggtggtcg agtttccgaa acctggcgcc gccgggatac tgatgagcgg gatcctaatg 371521 gcgccaccgc cagttatggt gatcgcggta ttcggccctc ctatggcgac tgtcgtcgtg 371581 gggccgacaa ccgtgatgtt cggaatggta atggggccaa agtgggctcg ctggcccgct 371641 attgacgaaa ggacgatatc gccggtgggc ggaatcgtga cgcccataag ggtgatgttg 371701 ccggccgagg cggtgatcgg gatatcgatg ggaatattca cgccgaggct tatggggaga 371761 ggcatatcga tcaccaggtt gaggccgacc aggccctggt aatcgccgct taagaacaac 371821 ccgttgctgt agtttccggt gatgaaagcc ccggtgtcaa cgtcgccggt gttggcgatg 371881 ccggtgttgt agttgccaat gttgaggtag ccggtgttgg tattgcccgg attgaagcca 371941 ccggtgttga agctgccggt gttgaagcta ccggtgttga agctgcccgg gttgaaggcg 372001 cccgtgttga cgtcgccgga gttgaatagg ccggtattgg tgttgccggt gtttccgatg 372061 ccagtgttga agctgcccga gtttgcgatg ccgaagttgc cgctgccgga attgaagaat 372121 ccgatgttgt tgctgcccga gttgaacagg ccgatgtttc cgctgcccga gttgaagctg 372181 ccgaacccga tctgtccgtt gcccgtgagc ccgatgccga cattgttgct gcccgtgttc 372241 ccaaagccga cattgttgct gccggtgttc gcaaagccga tgttgccgcc gcccgcgtta 372301 gcgaaaccca gattgtcgtt gccgacgttg ccgaagccga tgttgtagct gccgaagttg 372361 ccgaagccca ggttgtcgtc gccaaggttt ccgaagccga tgttgtagct gcccaggttc 372421 gccaggccga catcgaagat tccggtgttc ccgatgccga cgttgttgtg gccgatggtg 372481 gcgccgccga agttaaagcc gccgagactt gcgaagccca cgttgaggtt gccgtggttg 372541 gccaagccca agttaatagc cgcagtaccc gcgccgtcgc gcagcaggcc ggcaatattg 372601 gttccgatat ttgccaaccc ggagttaacg gcgggcgtcg agaggtccga cgtgcccacg 372661 ttgtagatac ccgagatggt gttgcccaca ttcgccacac ccgatcccag cgcgccgacg 372721 ttgaggaagc ccgacattcc cgacgttgtg gagaccaggt tcataaagcc cgacgcggcg 372781 cccccgaagt tgccgaagcc cgaggcgctg ccagcgccgc tattgaagaa gcccgacgac 372841 agtccgccgg tcgagttgcc gaagcctgga gtcgctggaa tatggataat cgggatgttg 372901 atggcgtcga cgaccacagt ggcgccgata ttgatcgcgg tagtgggtcc acccaccatg 372961 accacaggtg tgggaccggt tatccgtatc actgggacgg tgaaggggtc gatatcgacg 373021 ggtccgaaaa aaataacagt gacggctgtg ttcggtggaa gatcgagccc gctgtatacg 373081 atgtccgtga agctggcggt gatcggtatg tgaatcggga tattcacgtc gacgctcaca 373141 atcgggattt cgggaatgtc gacgccgatg gcgaggtcga tcagaccctg gttgtcgccc 373201 cgccacagaa ggccgttgct gtggttgccg gagatgaaag cgccggtgtt gatattgccg 373261 gtgttcgcca tgccggtgtt gtaagtcgcc ggtgttgaag tagccggtgt tgtagttacc 373321 tgcattgaag ccaccggtgt tgacgctgcc ggtgttgacg ctgccggtgt tatagctgcc 373381 cgggttcaag ctacccgtgt tgaggtctcc ggagttgaac aagcccgtgt tggtgctgcc 373441 cgcgttcccg atgccgaagt ttccggcgcc cgagtttccg atgccccagt ttccggtgcc 373501 cgcgtttcct atcccgaagt ttccgctgcc cgagttgaac aatccgatgt ttccgtcacc 373561 cgagttgaac aagccgatat tgtggctgcc cgagttcagg ctgccgaacc cgatctggcc 373621 gctgccggtg agcccgatcc cgatattgtt gctgccgata ttcgcaaaac cgatattgtt 373681 gtcacccgtg ttcgcgaagc caagattatt gctgccggtg ttcgcaaagc cgatgttgta 373741 gctgcccgcg tgggcgaagc ccaggttgtc gccgcctaga ttgccgaagc caatattgta 373801 gtcgcccagg tttgccaagc ccacattgaa gattccggcg tttgcaccgc cgacattgtt 373861 gccgccgacg tttgccaacc cgaagttaag ggtcatggtg cccacgctgt ggtgcagcag 373921 gccggagtta aaggctggcg tcgccagatt cgacgtgctc gtgttgtaga gacccgagat 373981 ggtgttgcca acgttagcta cacccgatcc cagcgcgccg acgttcccga agcccgaaag 374041 tcccgaggtt gccgaggcca ggttccaaaa gcccgaagcg ccgccgccga agttgccgaa 374101 gcccgaggcg gtgccggcgc cagcgttgaa gaagcccgac gacgggctgg tggtcgagtt 374161 cccgaagccc ggggccgccg gaatcttgat gagcgggatg ctgacgcccc ccaccatgcc 374221 ggtgaggttg ccgtcgatcg tggtggttgg tccgcccacg gtgatcgtca ccgtgggaag 374281 ggtgagcgtg gattgcggga gctcgaccgg gccgtagtaa acaacgaagg gaacaatgga 374341 tgtgaagggc aaacgcatgc ccggaatcgt catcacgctt ccgggcatga ccatcacctg 374401 atgtatcggc atgctgaata gctgcgcgtt tatcggaatg gcgggaatct cgagggcgat 374461 atcggcaccg atcaggcctt ggtagtcgcc ccgccacaag acgccgttgc tgtagttgcc 374521 ggcgatgaag gcgccggtgt tgacgttgcc ggtgttggcc actccagtgt tgtagtcgcc 374581 ggtgttgaag tagccggtgt tgtagttacc tgcgttgaag ctgccggtgt tgtagttgcc 374641 ggtgttgaag ttcccggtgt tgtagctgcc cgggttgaag ccgccggtgt tgacgtcgcc 374701 ggtgttgaac cagccggtgt tggtgctgcc cgtgttgccg aggccgaagt ttccggtgcc 374761 gctgtttccg atgccgaaat tgccggtgcc cgagttgaac aacccgacgt ttccgctgcc 374821 ggagttgaac aggccgatgt tgtggctgcc cgagttgaag ctgccgaacc cgatctgtcc 374881 gttgccggtg agcccgatgc cgacattgtt gctgcccgta ttcccaaagc cgacattgtt 374941 gctgccggtg ttcgcaaagc cgatgttgtg gccgcccagg ttggccaaac ccaggttgtc 375001 gctgcccagg tttgcaaagc cgaggttgta gctgcccaaa ttgccgaagc cgacgttgaa 375061 cacgccgacg tttccgttgc ccacgttgtt ggcgccgacg tttgccaagc cgagattgaa 375121 gcccgccgcg ctcggggggc cggcagcggc tgccgcggcg ctggtcagcc gctccgatag 375181 gcccgccagc ttcttcagct gctgggtgaa cggcatcaac gcggagacgg ccgccgacgc 375241 tccagcgtga tagccaacca tcgcggccac atcctgggcc cacatccgct cataggcggc 375301 ctcggtggcc gcgatcgccg gagcgttgaa tcccagcaga ttcgagctca ccagcgacac 375361 cagcacggcg cggttggccg cgacgatcgc cggatgcacc gtcgctgccc gcgccgcctc 375421 gaacgtggcc acggctaccc gtgcctgagc ggcggcctgc tcagcctggg ccgttgccga 375481 aatcaaccag cccaggtagg gggctaccgc gcgcgccatc gcaaccgccg cggggccgcg 375541 ccacgccgca tccgccaggc ccgaggtcac cgacccaaac cacgacgccg ccaccgccag 375601 ttcgtcggct agtccgtccc aggccgccgc ggccgccaac atcggccccg atccggcccc 375661 gagatacatc cgtaacgaat tgacctcggg cgccgacacc acgaaatcca tccgtcatac 375721 ccgttcgtca gctggccgtc ggaggtacgt tcaggctaat caatcgtcta ctactcgact 375781 agcccgtgaa cgggtgaaaa atgctaggac attcacgtat tggcccgagt ggggctggtc 375841 gagtatcagg ggaagcttta tggggcaaag tcaagtttgt ggttcgtcgt atcggggcga 375901 tccaaccgag cacatgttta gtgcaccaga acgacgggcc gtgtatcggg tgatcgccga 375961 acgccgagac atgcgccggt tcgtgcccgg cggtgtggtg tccgaggatg tgctggcgcg 376021 gctgttgcac gccgcacacg ccgcgcccag cgtcggtctg atgcagccat ggcgctttat 376081 ccgcatcacc gacgagacac tcaagcgacg catccacgcg ctcgtcgacg acgaacgcct 376141 actcaccgcc gaagccctgg gagcacggga agaagaattc ctggcgctga aggtcgaggg 376201 cattctcgac tgcgccgagc tgctggtggt ggcgctgtgc gaccgcagag ggtcctacat 376261 cttcggccgg cgcaccctgc cccagatgga tctggcgtcg gtgtcgtgcg ccatccaaaa 376321 cctgtggctg gcagcgcggt ccgaaggcct gggcatggga tgggtgtcgc tgttcgaccc 376381 acaacgttta gcggccctgc tggcgatgcc cgccgacgcc gaaccggtgg ccatcttgtg 376441 cctggggccg gtgcccgagt ttccggaccg gcccgcgctg gaactggatg gctgggccta 376501 cgcgcggcca ctcgcggaat tcgtctccga aaaccgatgg agttatccgt cggcgctggc 376561 cacagatcac catcacggcg aataggtcac gccgaccgcg aggttgacgt attcggccgg 376621 cacgtcaaag gccagcgatc gccgcggcaa gcggctcaac acatcttcac cgatagtggc 376681 cttgaagtgc agcgcgagct ggccgctgaa cttcgggtcg acatcggcga catcgagatc 376741 gagttgcaga cccgtcggcg agatttcggc ggttgcggca ccggcctgcg gcccgtccca 376801 gcgcgcgtcg acgacccggc cggctagctt cggcaccatc gacagggttc ccagcacccg 376861 ctgttcggtg aacgccagcg cacccacgta gctggcgatg ctgtgcgatg cccgcagccc 376921 cgggataacg ccggtgaacc gcctggtgac ggccacgtat tcggcaaggt agatgagtcc 376981 ctcagcctca acctgacagc gtaggtcagc aggcagcctg ccgagaccaa accacttgcg 377041 aacaatgaca gccatgaggc cagtatggag tcgttttgtc ggtgccgcac cgatgctggt 377101 aggagttaga gcatgactcg cccgcaagcg cttctcgctg tttcgctcgc ttttgtcgca 377161 accgcggtgt atgccgtcat gtgggtgggg cactcccagg attggggttg gctgcatagt 377221 ttcgattggt cgttgttgaa cgcagcgcac gacatcggga taaagaaccc tgcgtgggtg 377281 cgcttctggg atggtgtatc cctgatcttg ggcccagtcg tgctgcggcc gctgggtttg 377341 ctggccgcga tggtcgcact ggcgaagcgc aagatacgga tagcgttgtt gctgttggcc 377401 tgtttaccgc tcaacgcgat catgacgatc gcggccaaat ccgtggccca ccgcccgcga 377461 ccggcgactg cgctggtatc tgcccattcg acttcgtttc cgtcagggca tgcgttggag 377521 gcgaccgcaa gcgtactcgc gctgctaacc gtcctgttgc ccatgctgca cagcaggttt 377581 actcggcaca tcgccatcac ggtgggcgcg ctgtgcgtgt tgacggtcgg tgttgccagg 377641 gtggcgttga acgtgcatca tccgaccgac gttgttgccg gctgggcgct ggggtacctg 377701 tatttcctcg tgtgcctgtg cgtatttcga ccgccgtcga tattcggtgc ccaacgcgcg 377761 tctcatgctt tgtcgccgcc agtggaggtg tcgagacaac ccgaaccgga agtcgacacg 377821 gcccgctaaa gccatggtgc gctgtgcatt tcgctttgtc accgcacagt gacccagccg 377881 gattctaacc ttgacttgac cacacgaggt gattgtctga cgattgagcg atgagccgac 377941 tcctagcttt gctgtgcgct gcggtatgca cgggctgcgt tgctgtggtt ctcgcgccag 378001 tgagcctggc cgtcgtcaac ccgtggttcg cgaactcggt cggcaatgcc actcaggtgg 378061 tttcggtggt gggaaccggc ggttcgacgg ccaagatgga tgtctaccaa cgcaccgccg 378121 ccggctggca gccgctcaag accggtatca ccacccatat cggttcggcg ggcatggcgc 378181 cggaagccaa gagcggatat ccggccactc cgatgggggt ttacagcctg gactccgctt 378241 ttggcaccgc gccgaatccc ggtggcgggt tgccgtatac ccaagtcgga cccaatcact 378301 ggtggagtgg cgacgacaat agccccacct ttaactccat gcaggtctgt cagaagtccc 378361 agtgcccgtt cagcacggcc gacagcgaga acctgcaaat cccgcagtac aagcattcgg 378421 tcgtgatggg cgtcaacaag gccaaggtcc caggcaaagg ctccgcgttc ttctttcaca 378481 ccaccgacgg cgggcccacc gcgggttgtg tggcgatcga cgatgccacg ctggtgcaga 378541 tcatccgttg gctgcggcct ggtgcggtga tcgcgatcgc caagtaaccc cggacctcga 378601 ttgtgaactg tgcgacgggt tttcggcgtg ttgcgtcgtg agattcacgt tcggcgtcaa 378661 tcggccagcg cgcggcccgg cctgatgttg aagttaaggc ccgccaacga catggtcgcc 378721 tcgtaggttc ggtcgtagcc ggtggcgctg atccgccagc cgtcggtggt tcgtcggtac 378781 tggtcgtggt agaacgcggc gccgatgagc atgaaattga actcggcgac gatgacccgg 378841 tcttgcaggt accagatgcc ggttgcggta tcgccggtca cggtgatttc cggatgggtg 378901 acccggtgtt cggtgatgac acccgggccg agtgcctggc gcaggtagtc gaccaggtcg 378961 gcgcggttgg tgaagtgcag ctccgtaccg accgatgacc cgtaatcgcc ggtgacatcc 379021 tcggccaggg tgtcggtgaa gtcgtcccaa tgcttggtgt ccaatgcccg cagataccgg 379081 tatttgagct gtttgatcgc tgcaatgtcg gctggatcac ccggagtcac cacgccattg 379141 cagcacaccg gctcacgggt agctttgggg tatgagccaa tcccggtacg cggggttgtc 379201 ccgcagcgag ctggcagttc tgttacccga gctgttgttg atcggccagc tgatcgaccg 379261 atcgggcatg gcctggtgta tacaggcatt cggccgccag gagatgctgc agatcgccat 379321 cgaggagtgg gcgggcgcca gcccgatcta caccaagcgc atgcaaaagg cgctgaactt 379381 cgagggcgac gacgtgccca ccatcttcaa ggggctacag ctcgacatcg gcgcgccgcc 379441 gcaattcatg gacttccgtt tcaccctgca cgaccgctgg cacggcgagt ttcacctcga 379501 ccactgcggt gcgctgctcg acgtggagcc gatgggcgac gactacgtcg tcggcatgtg 379561 ccacaccatc gaagatccga cgttcgacgc caccgcgatc gcgaccaacc cgcgcgcgca 379621 ggtgcgcccc atccaccggc cgccccgcaa gccggccgac cggcatccgc actgtgcgtg 379681 gaccgtcatc atcgacgagt cctatcccga ggctgagggt attccggcgc tggacgcggt 379741 ccgtgaaacc aaagctgcca cctgggaatt agacaacgtc gatgcgtctg acgacgggct 379801 ggtggactat tcgggtccgc tggtgtccga cctggacttc ggggcgttct cgcattccgc 379861 actggtgcgg atggccgatg aggtctgcct gcaaatgcac ctgctgaatc tgtcgttcgc 379921 cattgccgtg cggaaacggg ccaaagccga tgctcaactg gccatttcgg tgaacacccg 379981 ccagttgatc ggagtggccg ggctgggcgc agaacgcatt caccgtgcga tggctttacc 380041 cggcggaatc gaaggcgcgt taggtgtgct ggagctacac ccgctgctca acccggccgg 380101 ttacgtgctg gccgaaacgt cgccggaccg tctggtggtg cacaactcgc cagcccacgc 380161 cgacggcgcc tggatttcgt tgtgcacacc ggcatccgtg cagccgttgc aggccatcgc 380221 caccgctgta gacccgcatc tgaaggttcg gatcagcggg acggacaccg actggaccgc 380281 ggaactcatc gaggccgatg ccccagcgag cgaactgccg gaggtgttgg tagccaaggt 380341 cagtcgcgga tcggtcttcc agttcgagcc gaggcgctca ctgccgttga ccgtgaaatg 380401 agctcgatgc gatctgtcaa gtcggtggcg gtaccgcttc ggtgacacca ccgcatcgac 380461 cgcataccaa tgaggttgtc accgaaccgt atacggccca cccgccgcta tggttaacgc 380521 tggccaccga cccctattga cgaaagcctt ccgctatgta cgacccgctg gggttgtcga 380581 tcgggaccac aaacctggtc gcggcgggta acggaggtcc gccggttact cgtcgcgccg 380641 tgctgaccct gtacccgcat tgcgcaccga aaatcggtgt gcctagccag aacccgaact 380701 tgatcgagcc gggcgcccta atgagcggct ttgttgagcg cattggagat gcggtggcgc 380761 tggtgtctcc cgacggatcc gtgcacgatc cagacctctt gctggtcgag gcgctggatg 380821 cgatggtgct gaccgccggt gcggacgcga gttcctcgga gatcgccatt gccgttcccg 380881 cgcattggaa gcccggagct gtacacgcac tgcgtaacgg tttgcggacg cacgtcggct 380941 tcgtccgcag cggcatggcg ccgcgcctgg tttccgatgc gatcgcggcg ttgaccgcgg 381001 tgaactcgga attgggcctg ccccacggca gtgtggtggg gttgcttgat ttcggtggct 381061 ccgcgactta cgtcaccttg gtggagacca agtcggattc caggacgtcg gatttccagc 381121 ccgttagtgc cacggcacgg taccaggact tttccggtag tcagatcgac caggctttgc 381181 tgcttcgggt catcgaccaa ttcgggtacg gcgatgacgt cgatccggcc agtaccgccg 381241 cggtcgggca actcggccaa ctcagggagc agtgccgtgc ggcaaaggaa cgactgtcca 381301 ccgacgttgc cacggaattg ttcgctgagc ttgccgggtg cagctcgagc atcgagatga 381361 ctcgggaaca gctcgaagac ctgatccagg atccattgac cggcttcatc tacgcgttcg 381421 acgacatgct ggcgcgccac aacgcgagct gggcggatct cgcggcggtg gtcaccgtcg 381481 gcggtggtgc caatattccc cttgtgactc aacgtctttc gttccacact cgtcgacctg 381541 tgctgaccgc gtcgcaaccc gggtgcgcgg cggcgatggg tgcgttgctg ctcgccaacc 381601 gtgggggaga gcgcgattcg cgaacgcgga cgtccatcgg cctcgccacg gccgcagccg 381661 ccggcaccag tgtcatcgag ctgccggccg gcgacgtcat ggtcatcgac catgaggcct 381721 tgaccgatcg cgagttggcc tggtcgcaga ccgacttccc aagcgaagct ccggcgcgtt 381781 tcgagggcga ctcgtataac gaaggcggcc cctgctggtc gatgcgtctg aacgcggtcg 381841 agccccccaa aggaccagcg tggcggcgaa tccgggtgtc gcagttgctc atcggggtgt 381901 cggcggtagt ggccatgacc gcgatcgggg gcgtggcatt gacgttgaca gccatcgaga 381961 gacgcccaag cccgctacca accccaattg tgcccggcct ggccccgatg ccgcccggat 382021 ccgtcgtgcc tagctcgcgc gcaccgaccc cgccgccacc gccgtcgacc gttgcgccgc 382081 ttcccagtgc ggcaccggcc ccgacgacgg tcgcgccggc accgccgccg cccacacagg 382141 tggtgacgac cacgacagcg ccacccgtca ccacgacgcc gaggccgtcg ccgaccacca 382201 caacgaccac cgcgccaccg tcgacaacga cgacaaccga gccgccggtg acgaccactt 382261 cgacgattcc aacgattccg acgactacga cgacggtgaa gatgaccacg gagtggttgc 382321 acgtcccgtt tttgcccgtt ccgatcccgg tcccgattcc gcaaaatccg ggtgccggcg 382381 aaccgcagaa cccgttcgga agccttggct ctgggtgagc cgcgttcccc ggagctggcc 382441 ccgtcggtgt caggtccgta gtatcggtat gggttgctga ggaggtcgcg tgggcgacta 382501 tggtccgttt ggattcgatc ccgacgaatt cgatcgggtg atccgggagg ggagcgaggg 382561 actgcgcgac gcgttcgagc ggatcggcag gttcctcagc tcatccggcg cgggaacggg 382621 ctggtcggca atcttcgagg acttgtcccg gcgctcgcgt ccggcgccgg agaccgccgg 382681 cgaggccggt gacggtgtgt gggccatcta tacggtggac gccgacggtg gtgcccgcgt 382741 tgaacaggtg tatgcgaccg agcttgacgc cctgcgcgcg aacaaggaca acaccgaccc 382801 gaaacgcaaa gtccgcttcc tgccatacgg catcgcggtc agcgtcctcg acgatccggt 382861 ggacgaggcc cagtaacgtc agccctgctg gacgctgttg gaaccgccgg cattgctgat 382921 cttcggcgag cccgagtgat acgtgacctc gttgttgaag ccggcggctt cgatggtgtc 382981 gacggagtcg acggtgaccg agttcctcat gccggacacg gtgaggctgg tgcagtggcc 383041 ggtgatcacc accgtgttgg acatgccgct gacgctgaca atgctgtcgt tgcaggcgat 383101 tgtccggttc acgttgacgc cggagacgct caggctggcg ccggccggcg gaagagtggt 383161 ggccggttgc gcagtcgggg ttggaacagc ccgggagaca gacggggtgg gcgagagaac 383221 gacgaagttg ccttgggaaa gccgctgtgc gctgaatgcg gcgatgccac ccaccagaac 383281 cagcacgccg acaacgacga ccgcggccag gatccaccac gccctgttgc cggaggacga 383341 tcgcggcgat gggccgccga acgggccgcc atagctatac ggcggcggtg gcgggccggg 383401 tggataggtg tagccgcccg actgcgagcc gccgagttcg gaggcgcgtg ccacgtcggc 383461 tagcggccgc tccagttccc ggattcgcgc ctccgggtca tcctctgggt tcatgcacag 383521 atgctcccac acgacgatca tgccgcatag gtagttgcgc ccggcggcac cacacgattc 383581 ggcttggcct gctatcgtcc catgcttatg cctgagatgg atcgtcgccg aatgatgatg 383641 atggcggggt tcggcgccct ggctgccgcg cttcccgccc cgacagcctg ggccgacccg 383701 tcccggccgg ccgcgccggc tggtccgaca ccggcgcccg ccgcgccggc tgcggcaacc 383761 ggtgggcttt tgttccacga cgagttcgac gggccggccg gttcggtccc ggacccgtcc 383821 aagtggcagg tgtcgaacca ccggacgccc atcaagaacc cggtgggctt tgaccggccc 383881 cagttttttg ggcagtaccg cgacagtcga cagaacgtgt tcctcgacgg caactccaat 383941 ctcgtgctgc gcgctacccg agagggcaac aggtatttcg gtggcctggt ccacggcctg 384001 tggcggggtg gcatcgggac cacctgggag gcccggatca agttcaactg cctggctccg 384061 ggcatgtggc ccgcctggtg gttgtccaat gacgatcctg gtcgcagcgg cgaaatcgac 384121 ctgatcgagt ggtatggcaa cgggacttgg ccgtcgggaa ccaccgtgca cgccaacccg 384181 gacggcaccg cattcgagac ctgcccgatc ggtgtggacg gtggttggca caactggcgc 384241 gtcacgtgga atccgagcgg catgtacttc tggctggatt acgccgacgg cattgagccc 384301 tacttctcgg ttccggcgac cggaatcgaa gacctcaacg agcccatccg cgagtggccg 384361 ttcaacgacc ccggctacaa ggtgtttccg gtgttgaacc ttgcggttgg cggttctggt 384421 ggcggcgatc ccgcgacggg ttcctatcca caggagatgc tcgtcgactg ggtgcgcgtc 384481 ttttaacgcc tcgcgctctt gcccggggtg ctacccggct tgctcggaga aagcatggag 384541 tttttggtca ccatgaccac ccgcgttccc gatagcatgc ccgcggacgc agtcgagcgg 384601 gtccgtgccc gcgaggctgc ccgctcgcgc gagctcgcgg cacagggaaa gctactccgc 384661 ctgtggcgcc cgccgctgcg gccgggcgaa tggcgcaccc tggggctgtt cgccgccgac 384721 gacaacggcg aactggagca gctgctggcc tcgatgccgc cgcggtcgtg gcgcaccgac 384781 gacgtcacgc cgctgggtgc tcacccgaac gacccggttg gccaggggat aaccatcgcg 384841 ccgggtaagg gtccggagtt tctgatcgcg acgaccatta tggtgccacc gggtaccccg 384901 gctcaggtgg tcgacgacac cgtggcgcgc gaggctcgcc gcgcgcccga gctggccggg 384961 cggggacacc tggtgcggtt gtgggcacta cccgacggac cggacggcca gcgcaccctg 385021 gggctgtggc gggctcgcga ccctggcgag ctgatggcca tcctggaatc gctaccgctt 385081 gctggctgga tgaccatcga gaccacgccg ctgagtccgc atcccgatga tccgatccgc 385141 atgccctgac cgtttccggt gtcgccgggc tcttaggcgc cgtcccactc gccgcgggcg 385201 atgagaacat cacgaagtag gtccgcgcga tcggtgatga tgccgtccac gtccatgtcg 385261 agaagggtgt gcatcacatc gggttcgtcg acggtccagg catgcacttg gcgtcccgca 385321 gcatgaaagc cgcggacccg tgccggcgta atgaccggta caccgccaag ccgtgacggt 385381 agttgcacgc agtcgatgtc gcgcatcatc cgccaggcat atgcccggct gcccagcgga 385441 cgcgcggtca gccacgccag cagcgcgccc gttcctgccg aactagcgac ccgcttggtc 385501 agcaggcgca atgcgcgccg gcgacggcgc tcggaaaacg aaccgatcag cacccggttg 385561 tgcgcgttgc accgctcgat gacgttgacg gtcggctcga tcgccgatgc ggctttaatg 385621 tcgatgttga cccgcatgtc tggcagcgcg gtaagcaggt cttccagggt tgggatcgac 385681 tgccccgcac ccagctgcgc cttgcggaca tcacgccaat ccaaccggtc gaccgcgccg 385741 gataacccca ccccgggcgc cagcctacgg tcatgcagga tcacggctac gccgtcccgg 385801 gtggcgcgaa cgtcggtctc gatgtagcgg aatccgagct tggccgcctc ctggaacgcc 385861 cccatgctgt tcatgggcaa tctgaacgac gtaaatcctc tgtgcgccat ggcaatccgc 385921 cccccatggc gaagaaattc cacggtaggt gcgccaccgt cgctcatcag gtcagtatca 385981 catagcctcg gccgccgggg gcgtccacgc cgggggcagc accgctctgt cggcgacggt 386041 tcctgtgcac cagccgcttt cgcatcgcag tggtaatggg cgctcccata cggcgcggtc 386101 ggcgacgacg gtgcatgggc cggccatcgt tttgggcctt ccccgccttg ccgccgggcc 386161 accgacgttc agcatcacca tcagcgtcga cgtgtcacat cggagccgat gacgggaatc 386221 gaacccgcgt attcagcttg ggaagctgat gttctgccat tgaactacat cggcacggtt 386281 gcctcgaaag gctagcatcc agaatcattc catcaccccc aggccgtaca agatcagaaa 386341 tccggcaaag aacatcacgg ccatccatgg ccgctccggc gtttcgtgcg cctcgacaag 386401 cagttcctcc acgaccagcc agagcagcgc ccccgccgcg aacgccaaca cgagggtcag 386461 gacggtattt cccgcccggc ccagcgccac ggcacctgac acaccgccca ccgcgatcac 386521 taggctcagg gcgcttgtgg tcgccgcggc ccggatccta ggcattccgg agccggccag 386581 gcgcagggcc accgccagac ccaggaacag cacctcgacc gtcagggcga tggtgatgat 386641 gatcgcggtg cgactggaca ccgtcgcgcc cgttgcgacc agcaacccgt cgatgaagag 386701 gtcaaccgcg actacggtga ggaacccgac gggcagttcg cccacgtcgt cgccgtcttg 386761 atgttccccg tggccgtcaa atcggcgcag tgcaacgagt accgcgacgc ctgcactgaa 386821 gcccacaacg atcagccaga gcggacctct gctgcgcagg tctggtagca cttccccggc 386881 cacggcggcc atgacaattc ccgcggcgaa atgttggacg ccgctgacca tcgccgccga 386941 cggcgtgcgc accgacggga ccacgccgcc gagaatcccg gcgagaaccg ggaaggtgac 387001 caacgaggcg gccgttgtga cgttgctgat gccaacctcc cggtttcggt cgaagatctc 387061 ggctcgggca cgcttgaaca ttgtgacggc tagtgacaaa tgcagcgact ttcggggaaa 387121 cgggcattga aataaggaag gaacagcatg tcgaaggtgc tggtcaccgg attcggaccc 387181 tacggcgtga cgccggtaaa tccggcacag ctcaccgccg aagagctgga tggtcgcacc 387241 atcgccggcg caacggtcat ctcgcggatc gtgcccaaca cgttcttcga gtcgatcgcg 387301 gcagctcagc aggccatcgc agagatcgag ccagcattgg tgatcatgct gggcgaatac 387361 ccgggacgca gcatgatcac cgtcgagcga ctcgcgcaaa acgtcaacga ctgcgggcgg 387421 tacggcctcg ccgactgcgc cggcagggtt ttggtcggtg agccaaccga ccccgccggc 387481 ccggtcgcct accacgcgac cgtaccggtt cgcgcgatgg tgctggccat gcgaaaggcc 387541 ggcgtgccag ctgacgtctc ggacgcggcg ggcacgttcg tgtgcaatca cctcatgtac 387601 ggcgtgctgc accacctcgc ccagaagggt ctgcccgtcc gcgccggttg gattcatctg 387661 ccgtgcctgc ccagcgtcgc cgcactggat cacaacctcg gtgttccgag catgtcggtc 387721 cagacggcgg tcgccggggt cacggctggc atcgaggcag ccattcggca gtccgcagat 387781 atccgcgaac cgatcccgtc gcgattgcag atctagggcg cagctgacgg cggtcttcta 387841 gagattagat atttattctt ccgttatctt gtcgtaatct gctcagcgtg ggccgacatg 387901 aattagctag ggaccggcga aagtcgtcag cggtcctggc tgcggtcctc gccccggccg 387961 ccgtgttctt cgccacgggc ggagatgtca gtacgcttgc cgcccgcgcc gatgccaacc 388021 cggttctcgg cgacgacgcg ccctgttgtg tgcagatcgt gccggttgca ccgctggctt 388081 tctcctcaca gatatccggc ggtgaaatcg ggacgggcct tgctgccagc cagttcgctt 388141 cggcatcgag atggcgcatc gtatctcggt atttgccggt aggggtggca cccgagcagg 388201 gtctacaggt caagaccgtc ttgacagccc gcagtatcag tgcggctttc cccgaaattc 388261 gcgaaatcgg cggcgttcgg ccggatgcgc tgagatggca tcccaatggt ttggcgctcg 388321 acgtgatggt tcccaacccc ggcaccgccg agggcatagc gctgggcaac gagatcgtcg 388381 ctttcgtact gaagaacgcg acccgatttg ggatgcaaga tgtgatttgg cgtggcgcct 388441 actacacgcc caacggcgcg cggacaaccg gggccggcca ctacgaccac atccacatca 388501 cgaccgtggg cggcgggtat cccaccggcg aggaactcta catccgctga gccagcgtgc 388561 ggcgacagat acgctcgtcg ggtgctgctc tccgatcgtg atcttcgggc cgagatctcc 388621 tccgggcggt tggggatcga cccgttcgac gacaccctgg tccagccgtc cagcatcgac 388681 gtccggctcg attgcttgtt tcgggtgttc aacaacactc gctacaccca catcgacccg 388741 gccaagcagc aggacgagct gaccagcctg gtgcaaccgg tcgacgggga acccttcgtg 388801 ttgcacccgg gcgaattcgt gctcggctcg acgctggagc ttttcactct gcccgacaac 388861 ctcgccggac ggctggaagg caagtcttcg ttgggccggc tgggcctgct gacgcattcc 388921 accgcgggct tcatcgatcc tggcttcagc ggtcacatca ccctggagct atccaacgtc 388981 gccaacctgc cgatcacttt gtggcccggc atgaaaatcg gtcagctgtg catgttgcgc 389041 ctgaccagcc cgtccgagca tccctacggc agttcccggg cggggtcgaa ataccagggt 389101 cagcgcgggc ccacgccgtc gcgctcctac cagaacttca tcaggtctac ttagcatccg 389161 gcgcggctag gcctgtcgcg ggtagctgtc acctgccgtt tgcctggtgc tcagcgccgc 389221 gatgcggttc gctcatcgca gccacctaca cacagtggtg tgcgatgcag cgtcttcggc 389281 actgggtatc tgggtgccac ccacgccgtc ggtatggcgc aactgggaca cgaggtcgtc 389341 ggggtcgata tcgatcccgg taaggtcgcc aagctcgccg ggggtgacat tccgttctac 389401 gaacccggcc tgcgaaagct gttgactgat aacctggctg ccggccgctt gcggttcacc 389461 accgactacg acatggcggc cgatttcgcc gacgtgcatt tcctgggggt cggcacgccg 389521 caaaagatag gcgaatatgg cgccgacctg cggcatgtcc acgccgtcat cgatgcgctg 389581 gtgccgcgtc tggtcagggc gtcgattctg gtcggcaagt cgacagtccc agtgggcacc 389641 gcagccgaac tgggacatcg ggccggtgca ctggcacccc ggggagtcga cgtggaaatt 389701 gcctggaatc cggaattcct gcgcgagggc ttcgcggtgc acgacaccct caaccccgac 389761 cgtatcgtcc ttggggtaca agatgattcg acgcgcgccg aggtagccgt ccgcgagctg 389821 tacgcgccgc tgctggcagc gggcgtgccg tttctggtga ccgatctgca gaccgcggag 389881 ttggtcaagg tatccgccaa tgcctttctg gcgaccaaga tttcgtttat caatgcgatc 389941 tccgaagtgt gcgaggcggc gggtgccgac gttagccagc tggccgatgc gctcggatac 390001 gacccgcgga tcggacgcca atgcctcaac gcgggcttgg gtttcggcgg cggctgcttg 390061 cccaaggaca tccgcgcttt catggcccgc gccggcgaac tgggagccga ccaggcgttg 390121 acgttcctgc gtgaagtgga cagcatcaac atgcgccggc gcaccaagat ggtggaactg 390181 gccaccaccg catgcggtgg ctcgttgctg ggcgccaata ttgcggtgct cggcgcggcg 390241 ttcaaacccg aatccgatga cgtgcgcgat tcgcccgccc tcaatgtggc gggccagctg 390301 cagctcaacg gcgccacggt ccacgtgtac gatccaaagg ccttggacaa cgcccaccga 390361 ctgttcccta ccttgaacta tgcggtttcg gttgcggagg cctgcgagcg cgcggacgcc 390421 gtgttggtgc ttaccgaatg gcgggagttc atcgatctcg aacccgctga tctagccaac 390481 cgggtgcggg cccgggtgat cgtggacggc cgcaactgcc tcgacgtgac ccgctggcgg 390541 cgggcaggct ggcgggtgtt ccggctggga gtgccgcgat tagggcactg accggcgcag 390601 ccagcgcaag tactctcggt caccgagcag ttccagacga cgccacagca cggggttgtc 390661 ggcggactgg gtgaaatggc agccgatagc ggctagctgt cggctgcggt caacctcgat 390721 catgatgtcg aggtgaccgt gaccgcgccc cccgaaggag gcgctgaact cggcgttgag 390781 ccgatcggcg atcggttggg gcagtgccca ggccaatacg gggatactgg gtgtcgaagc 390841 cgccgcgagc gcagcttcgg ttgcgcgacg gtggtcgggg tggcctgtta cgccgttgtc 390901 gtcgaacacg agtagcaggt ctgctccggc gagggcatcc accacgcgtt gcgtcagctc 390961 gttgagcggg atctgcgcta gaccgttatc cgggtatgcg agtagttgca catgatcgac 391021 acccaggacc tgtgccgcag cggcgagttc ctcccggcgc acctcaccga ggtttcggtc 391081 ggtccggccg agtgtggagg cctcgccgtg ggtgaagcac aatcctcgca gccgcgttcc 391141 ctgcgccgtg aaatcaccca ataccgcccc gagcccgaag gactcgtcgt ccggatgggc 391201 gaacacagca agcacttcgt gtgcgcaggg gagacggttg cagctgttca tcgattcacc 391261 gtccggagga tccgtgcgcg cgggtggaca gccgccgcat attatgtagt tccaatgagc 391321 aatggaatta tattcccaag gatgactgga aatggctgga cagtccgatc gtaaggcggc 391381 gttgttggac caggtagcgc gcgtgggcaa ggcgctggcc aatgggcggc gattgcaaat 391441 cctggacttg ctcgcccaag gtgagcgcgc ggtagaagcg atcgcgacgg cgaccgggat 391501 gaacctgacc acggcatcgg cgaatctgca ggcgctgaag agcggcgggc tggtcgaggc 391561 tcgccgcgag gggacccggc agtactaccg gattgctggg gaagacgtgg caaggctgtt 391621 cgcgctggtg caagtggttg ccgacgagca tctggccgac gtggcggtcg cggccgcaga 391681 cgtgctcggt tcgccggagg atgcgatcac ccgtgcggag ctgctgcggc ggcgcgaagc 391741 cggcgaggtc accctggtcg acgtgcgacc gcacgaggaa taccaggccg gccatatccc 391801 gggcgccatc aatatcccga tagccgaact ggccgaccgg ctcgccgaac taactggcga 391861 ccgcgacatt gtcgcctact gtcgtggtgc ctactgcgtc atggcccccg atgccgtccg 391921 catcgcgcgc gacgcggggc gggaggtgaa acgcctcgac gacggaatgc tcgaatggcg 391981 attggccgga ctgccggtcg acgagggtgc accggtcggg catggggatt gatcgcccgt 392041 ggggccgaag ggaagtctac gtttggtgaa gcggcagcca gaactgctcg ttgcccagca 392101 tgaacactgg caggacacct accgagcgca tccggtgctg tacggaaccc gcccgtcaga 392161 gccgggggta tatgccgccg aggtgttcaa tgccgacggc gtgcagcggg tgctggagtt 392221 ggcggccggt catgggcgtg acaccctgta tttcgctggc tagggcttca cggtggtggc 392281 caccgatttc agcgacgttg ccgtcgcgca acttcgccga agtgcccaag cgcgcggggt 392341 ctccgcgcgg gtgcaaccga ttgtgcacga tctgcgccag cctctgcccg tcaaaaccgg 392401 ttccattgac ggcgcctttg cacacatggc gttgtgtatg gcgttgtcca ccagcgaaat 392461 tcatgcagtc gttgccgagg tcggccgggt gttgaggccg ggtggaaagt tcatctacac 392521 cgttcggcat accggcgatg cgcactacgg cgccgggcag gcccacggtg acgacatctt 392581 cgagtgcgca gggttcgcag tgcacttctt ccgccgtgag ctggtagcgc gcctggctac 392641 cggttgggta ctcgaggagg tacacgattt cgaggaaggt gagctgcccc ggcggctatg 392701 gcgggtcact gtcaccaagc ccgcctagcc ggcgctgtgg gatcagccgc aggtgtgcac 392761 cgtgtttggg gacggtggtg atgttgcgca ccaacggagt ctcgcctttg gacgggccgg 392821 cggcggtgat ggtgaagcgg cggaaaatct cttgcaggat gaccgctccc tcggtgaggg 392881 cgaacccgaa gccgaggcat cggcgcacac cgccgccgaa tggcagccag gtgttgggtg 392941 ccacgctgcc gtcaaggaac cggctaggac gaaactctgt gggtttgggg tgcgatacct 393001 cgctggcgtg ggccaacagg atcgacgtgt tgaccaccgt ccccgctggc agtcgccaac 393061 caccgatctc tgccggcgcg gtgaccttgc gagcggtaga agcgatgacg gtgtgtcggc 393121 gcattccttc cttgaggacg gcctccaaga atccgtcgtc accgccgacg gcagcccaga 393181 ctacttggct ttggatttcc ggagcatggg caagttccca caacgtccag gacagggcgg 393241 cggcggttgt ctcatgaccg gccagcagca acgtgatgag ctggtcgcga agctcggcat 393301 cggtcagcgg cttagtaggc gtgtccttgg tttgcaaaag tctggatagc acgtcggttc 393361 gggcggtgag atcggaatcg atacggcggg aggcgatctc gcggtagagg atctcgtcta 393421 tcttggtttg gttatggaag aagcgcttcc agggattcat ccgcttgagc gacgggtacg 393481 gaacgcccgc gagaatcgcg ggatggatgt ttatgatctg ttgcagccga ctagtcaact 393541 cggccttgac ttttgggtca gtgaccccga aaacgacccg caggatgatg tcgagggtga 393601 gcgcattcat gtggtcaaga ctgttgatcg ttgcgtgggg ccgccagcgc gtgatgtgtt 393661 cacgcgcaac ggaggcgatc atgtcgcggt atccgcgcag cgcggcgcgg gtgaacgcgg 393721 gcatgagcag cgatcgcatc cgcgcgtgtt cggcttcgtc ggtcatcaat accgagtgct 393781 cgcccatgac aaaaccaagg atgtggttgc cttcgcccgc gtgcagcgac ctcgggtcgg 393841 ccgcgaagat ctctttgatg tgttcggggc gggtatagac cacgaggttg tcggcatatg 393901 ggggcacccg caaggagaac acgtcgccgt acttgcgatg catcgctggc aggaaccatt 393961 cccgaaacct caggtacagc acgctctgca ggtagcgggg tagccgcggc ccgggtggca 394021 ggcccgtcgt caacgtgctt gccatggcgg ctcccttctg ataatcaaat gtttgatgta 394081 aacgaatgct tatcacgata ggatgcagct gtgcaacagc aacgcacaaa ccgcgacaaa 394141 ctgctcgacg gcgctctggc ttgtttacga gaacgcggct acggcaacac cagctcgcgc 394201 gacatcgctc gtgcggcagg ggtgaacatc gcgtcgatca actaccactt cggtagcaag 394261 gacgcgctgc tcgacgatgc gctcggccgg tgcttttcga cgtggaacca gcgtgtccag 394321 gaggcattcg atcactcccg cgccgccggt ccggccgggc agatcctggc ggtactcgaa 394381 gccaccgtcg attcgttcga gcagatccgc cccgccgtgt atgcgtgtgt ggagtcatac 394441 gctccggcgt tgcgctcaga ggccttgcgg gagcgcctgg ccgccggata tgccgacgtt 394501 cggcagcatt cggtcgatct ggctggcgct gcgcttgccg gtaccgacat agcaccgccg 394561 gagaacctgt cgaccatcgt ctcggtgttg atggcggtca tcgatggcct catgatccag 394621 tggatcgccg atccgtccgc caccccgcga tcgaccgagg taatccgagc gcttgccagc 394681 atcggcgcgg tcgtcacgtc gcagttgcgg tgaaccacac ggtcgccgga tggtctgcac 394741 tgcgcttgat gccgacgtcg atgaagccgg cagcgccaag ccacgcggcg gtgtcgaggg 394801 tggggggcac gcgatagatc gcgggatcga aacgggccgc cagtggttgg tcatcggata 394861 tcgatgtaag gacgagtcga cccccgggcc gcagggctcg agcgatgtcg caaaggctgg 394921 cgcggggatc gggccagaag taaaagttgt gcacgccgag caccttgtca aggctgtggt 394981 cggcaaccgg cagggttact ccatcgccgt gataaagcga gatcaggccg gctgcaatgg 395041 ctttcgcgtt gtgatgggcc gcgattgcga tcatggtcgt cgacacctcg acgccgctca 395101 cttgcgcgcc ggcggcggcg agcagcccaa gggttcggcc ggggccaaag ccgatctcgc 395161 aaacccgctc gcccgggccg ggcgcgagca gctcgacggc gatgcgattg acgtcggcgg 395221 tctcggctcg ccagatccgt cccagtaggc ggccgaacgc gcctgttggc cgggcagcct 395281 gactggatag gtaccgtcgg gccggatgtg tgaggcgcat ggggacgacc tttcggttgc 395341 aagcggttag tccgaagaag ctgtggtggc ccgaacgaca aactcggcga gggtcgcagc 395401 gatcgcatcg tcatcgatca cgccaggttg cacgatcgac caaggctccg gtgacgggtc 395461 gaagtgaaga tgcaccgccc acaacgcgca caactcgacg atggtccggg ccaccatcgg 395521 tgccggcccg ggcaggatca gaaggccggc gcgctcgcgg tgcactaggt atgcctggac 395581 cgcatcgact tgggcgttcc ggccggtgcc gaaccaaacc tcggcgaggt cgggtagctc 395641 gggggcacag cggtcgacca gtttgagcgc gatccggtgc cgggccaggc ggctgtagag 395701 gtcggtgacg ataccggcga gttctgctcg cgcgtctcca gtcgtcgcac ccggcggcaa 395761 agtcgctcgc aacgcgtgcg tgagccgcat gtcggtgacc tcgccagcca gtcgggccga 395821 caccacagct gcgatctcgc ccgcaacggg agcggccacc ggcagttcgg atgccagcgg 395881 aagggcttcc tgagcgtcgc cgtagcgcac cgccgccgcg aacagcgcag ccttgccctg 395941 ggcgtagcca tacagcgtgc ctttggccag ggcgagtgcg tcggccacgt cctgcacctg 396001 ggtgcgctgg taaccgtggg cgatgaacac ccgcgccgac gcggcgacaa tcgcggaaaa 396061 ccggtccgcg ggaatgctgc gggccatggg ccgataatag tttgactgac tcggtcagtc 396121 accccaagac cttgcgcaag actgcggcgg aatctaatat tccaaagata tatggaactc 396181 gatgcgaagg aatcaggctc atgagcaaga cggttctcat ccttggcgcg ggtgtcggcg 396241 gcctgaccac cgccgacacc ctccgtcaac tgctaccacc tgaggatcga atcatattgg 396301 tggacaggag ctttgacggg acgctgggct tgtcgttgct atgggtgttg cggggctggc 396361 ggcggcctga cgacgtccgc gtccgcccca ccgcggcgtc gctgcccggt gtggaaatgg 396421 ttactgcaac cgtcgcccac attgacatcg cggcccaggt agtgcacacc gacaacagcg 396481 tcatcggcta tgacgcgttg gtgatcgcat taggtgcggc gctgaacacc gacgccgttc 396541 ccggactgtc ggacgcgctc gacgccgacg tcgcgggcca gttctacacc ctggacggcg 396601 cggctgagct gcgtgcgaag gtcgaggcgc tcgagcatgg ccggatcgct gtggctatcg 396661 ccggggtgcc gttcaaatgc ccagccgcac cgttcgaagc ggcgtttctg atcgccgccc 396721 aactcggtga ccgctacgcc accggaaccg tacagatcga cacgttcacg cctgacccgc 396781 tgccgatgcc cgttgcaggt cccgaggtcg gcgaggcttt ggtctcgatg ctcaaggatc 396841 acggtgtcgg cttccatcct cgcaaggccc tagctcgcgt cgatgaggcc gcaaggacga 396901 tgcacttcgg tgacggcacg tccgaaccgt tcgatctgct tgccgtggtc cccccgcacg 396961 tgccctccgc cgcggcgcgg tcagcgggtc tcagcgaatc cgggtggata cccgtggacc 397021 cgcgcaccct gtccactagc gccgacaacg tgtgggccat cggcgatgcg accgtgctga 397081 cgctgccgaa tggcaaaccg ctgcccaagg ctgccgtgtt cgccgaagcc caggccgcag 397141 ttgtcgccca cggcgtcgcc cgccatctcg gttacgacgt agctgagcgc cacttcaccg 397201 gcacgggcgc ctgctacgtc gagaccggtg atcaccaggc agccaagggc gacggcgatt 397261 tcttcgctcc gtcggcgccc tcggtgacgc tgtacccgcc gtcgcgggag tttcacgagg 397321 agaaggtcgc acaagaactg gcctggctga cccgctggaa gacgtgacac gccggtgggc 397381 gcggccccct accacggctc ctaccggcgc ccctgaaaca ccagactgtg gataaccgct 397441 gttgcgcaag cctgctagta gcctcgccaa ggtggactac tcgtcggcat acctggagca 397501 gacccacgcc ttcggcgaac tgatccgcaa cgtcgatcaa tccaccccgg tgccgacctg 397561 cccgggctgg agcctgggtc aactattccg ccacgtcggg cgcggggacc gctgggcggc 397621 gcagattgtc cgcgatcgac tcgaccattt cctcgatcca cgcagcgtcg agggcggtaa 397681 gccaccgccg gaccccgacg acgcgatctc ctggctgtac ggcggggcgc ggctgctggt 397741 cgacgctgtg gaacaaacgg gtgtggaaac gccggtgtgg accttcctcg gaccgcgccc 397801 ggcgggctgg tgggttcggc ggcggctaca cgaggtcgca gtgcaccgcg ccgacgtggc 397861 gatcaccgtc gggggcgaat tcacactgga accgaacgtg gcagccgacg ggatcagcga 397921 attcctggag cgcatagcgg tccaggccgg cagcggcggc acgccattac cgctcgaaga 397981 cgacgacacc ttacatctgc acgccaccga tccggggctt cttgaagccg gcgaatggac 398041 ggttcgtcgc gacgagcgcg gcgtcacctg gtcgcatcgg cacggaaagg gtgccgtggc 398101 actgcgtggc ggcgccaccg agctgctgct ggcgatggtg cgccgactct cggttgccga 398161 caccggcatc gagctgttgg gggatgccgg ggtatggcaa aaatggctgg atcgcacgcc 398221 gctgtagccg ccgcacacgg taactttcag accatgacca catcggagat cgctaccgtg 398281 ctggcctggc acgacgccct caatgccgcc gacattgaga ccctcgtggc gttgtctact 398341 gacgacatcg acatcggtga cgcgcacggg gctgtacagg gccacgatgc gctgcgcggg 398401 tgggccagct cgctcaccac aaccgcagaa cttggccgca tgtacgtgca ccacggagtc 398461 gtggtcgtcg aacaaaagat caccagcggc gaagatccgg gcatcgccag gaccggcgcc 398521 gcggcgttcc gtgtggtcca agaccacgtc gcatcggttt tccggcacga agacttggcg 398581 tcggcgctgg cggccaccga actcaccgag gacgatttgg tcgattgagg tcggcgaacg 398641 gcagttagga gccagttatg cgcgggatca tcttggccgg cggttcgggc acccggctgt 398701 acccgatcac catggggatc agcaagcagc tgctgccggt ctacgacaaa ccgatgatct 398761 actacccgct caccacgctg atgatggctg ggatccgaga cattcagttg atcaccaccc 398821 cgcatgacgc gcccggcttt catcgactcc tgggcgacgg cgcgcacttg ggagtgaaca 398881 tcagctacgc cacccaggat cagcctgacg gtctggcgca ggcgttcgtc attggcgcca 398941 accacatcgg cgccgattcg gtggcattgg tgttggggga caacatcttc tacggcccag 399001 gtctggggac cagcctgaag cgcttccaat ccatcagtgg tggagcaatt ttcgcctatt 399061 gggtagccaa cccgtcggcc tatggtgtcg ttgagttcgg cgccgagggc atggcgctgt 399121 ctctggagga gaagccggtg accccgaagt cgaattacgc ggtgccgggc ctgtatttct 399181 atgacaacga tgtgatcgaa atcgccaggg gtttaaagaa atcagcgcgc ggggagtacg 399241 agatcaccga ggtcaaccag gtctacctca atcagggtag gttggcggtc gaggtgctgg 399301 cccgcgggac agcgtggctg gacaccggga cattcgactc gctgctggac gccgccgatt 399361 tcgtccggac cctggagcgt cggcagggcc tgaaggtcag catccccgaa gaagtggcgt 399421 ggcgcatggg ctggatcgac gacgagcagc tggtgcagcg agcccgtgct ctggtcaagt 399481 ccggatatgg taactacctg ctggagttgt tggagcgcaa ctgatttcgg cgggttattg 399541 tcggtgatta tggaaccccc tggtagcccg tcctggatga gcagcccacc ggaccagcca 399601 ttgccgaaca gcccgccgtt ggcgccgttg gcgatcagcg ggccccaaca gcgcctgggt 399661 cggcgcatcg gcggtggtct cggcgctggc acacgagccc gcacccacgt tcaggttctg 399721 tgcaaactgg ccatggaacg ccgccgcctg attgttgagg gagtgatgcc gccgaccgtg 399781 tgcggaaatc agtgccgcga cggccgccga cacctcgtct tcggccgccg ccagcacgcg 399841 ggtcttgtgg cgcttcggcg ggaagttgct gatccgagat gctggcggct ggtttccttg 399901 tggtggcctg ggccgggtgg tggcgcacag tgggcccggt ggggtcgcgg ccggccgggc 399961 aagaacgctg cgccctggcc gggccatgag cggagccggc aagctcgacg gcgcccggca 400021 tgcgcggtgc aagaacccca tggaccgcac cgagtgccgt gctcgccctc ggcggctacc 400081 gagccggtgt ctccctagtc atccacgtta tccacagcgc cttgggttac cgggcgccgg 400141 tcgggtagcg atggtagtat cgaaagtatg ttcgatcagg tgcgggggcg catgccttca 400201 ccggaggcga tcgctcattt tgatgagcgg tttgaatgcc atgctccgcg gaccacgagg 400261 gtgtcggcgg cgttcatcga tcggatctgc tcggcgactc gggccgaaaa ccgggccgct 400321 gcggcgcagt tggtggcgtt gggggagttg ttcgcctatc ggtggtcgcg ttgcgggggc 400381 cgcgaggagt gggtgatgga caccatggcg gcggtggccg ccgaggtggc ggcggcgttg 400441 cggatcagtc agggtctggc ggccagccgg ttgcggtatg cgcgggcgat gcgtgagcgg 400501 ctgcctaaga cggctgaggt gtttagcgcc ggcgacatcg gctatctgat gtttgccacg 400561 attgtgtatc gcaccgactt gatcgttgac cctgatgttt tggcggcggt ggatgcgcag 400621 ttggccgcca atgtggcgcg ttggccctcg atgaccaagg cccgcctggc tgggcaggtc 400681 gataagatcg tggcgcgtgc cgatgccgat gcggtgcggc ggcgcaagga gtatcaggcc 400741 cagcgccagt tctgggtcgg ggaaagccaa gacggtgtgt gccagatcgg tggcagcctg 400801 ttggccgtcg acgcacacgc cctcgatgcg cggttgagcg cgttggcggg caccgtgtgt 400861 gagcacgatc cgcgcagccg tgagcagcgc cgcgcggacg cgttgggggc gttggcgggc 400921 ggggccgatc ggctgggctg tggctgtggg cgcgctgatt gtgcggccgg gaagcggcct 400981 gcggccccgc cggtggtgat tcacctgatc gccgaggcgg ccacgatcaa tggcacgggc 401041 tcggcgccgg catcgcagat gaacgccgac gggctgatca ccgccgaact ggtggccgag 401101 ctggccaaga cggccacgct ggtgccgctg gttcatcccg gcgatgcgcc gcccgagccg 401161 gggtatgcgc cgtcgaaagc gctcgccgat ttcgttcgct gccgggatct gacgtgtcgc 401221 tggcccggct gtgatgagcc cgccaccaat tgcgacctgg atcatacgat cccgtatgcc 401281 gctggtgggc ccacccatgc gtcgaacctg aaatgttact gccgtaccca tcacctggtg 401341 aaaacgtttt ggggatggcg tgatcaacag ctacccgacg gcaccctgat tttgacctcc 401401 ccgtccgggc atacctatgt cagcaccccg ggcagtgcgc tgctgttccc cagcttgtgc 401461 cacttcagcg gcggcatccc ggcaccggaa gccgacccac cctacgacca ttgcgaccag 401521 cgcacagcga tgatgcccaa acgccggcgc acccgcgccc aagaccgggc ctatcgcatc 401581 gccaccgaac gtcgacaaaa ccacgccgcc cgccagcgcg cccaggtgct cacccagacc 401641 gccgcggcca ccgacaccca cggcccacca ccggatccca acgacgaccc accgccgttt 401701 tgatgtggaa cggcctgtca agtggccgat tagtgcttgt tgcctcgggg ttgtttgggg 401761 tttctggctt tgatccgatg acgggaccct gcggcgctcc ctcgacgccg ccgcgccggc 401821 ttaagggcgc ccggccgcgc tgccaccccc agggcatcac gtgcgtcggc tgctattgcc 401881 ggtaactgac caggaagtta cccagccgct cgatggcggc cgccagatcg cgggaccatg 401941 gcagcgtcac caggcgcaga tgatccggtg cgggccagtt gaacccggtg ccctgggtga 402001 ccaggatctt ctccgacagc agcagatcga gcacgagttg ctcgtcgtcg tcgatgtcgt 402061 agacctcggg gtctagccgg ggaaacgcat acagcgcgcc cgccggtttg acgcacgaca 402121 cccccgggat ctcgttgagc ttggtccagg cgatgtcgcg ctgctcgagc agccggccgc 402181 cgggcagcac caggtcctcg atgctctgat ggccgcccag tgcaacctga atggcatgct 402241 gggccgggac atttgggcac aaccgcatat tggccagcag gccgatgccc tcgatgaagc 402301 tgctggcgtg ctccttgggt ccggtgatcg ccagccagcc ggcccggtat ccggcgacgc 402361 ggtaggcctt cgacagccca ttgaaggtca ggcacaacat atccggggcg atcgatgcca 402421 ggctgatgtg cttggcgtcg tcgtagagga ttttgtcgta gatttcgtcc gccaacagca 402481 gcagttgatg cttgcgggcc agatcgacca tctgggtgag gatttcgcag ctgtacaccg 402541 cgccggttgg gttgttgggg ttgatcacga ccagcgcctt ggtgcgctcg gtgatcttgg 402601 attccaggtc ggcgatatcg ggctgccagc cttgggtctc atcgcacagg tagtggaccg 402661 gagtgccgcc agccagcgag gtcgacgccg tccacagcgg gtagtccggt gatggaatca 402721 gcacctgatc gccgttgtcc agcagggctt gcagcgtcat cgtgatcagc tcggagaccc 402781 cgttacccag gtagacgtcg tccacgtcga atcggggaaa tccgggcacc agctcgtagc 402841 gcgtgaccac cgcacgccgg gccgacagga tgccctgcga gtcggagtac ccctgcgcgt 402901 agggcagcgc ctggatgata tcgcgcatga tcacgtcggg tgcttcgaag ccgaacggcg 402961 ccgggttgcc gatgttgagt ttgaggatgc ggtgaccttc ggcttcgagc cgcgcggcgt 403021 gctggtgcac cgggccgcgg atctcgtaca ggacgtcctg cagcttggcc gactgagcga 403081 aggcgcgctg ccgctgatgg ctggcggtgt gccagggcag ctggtgggtt gtcacgtcca 403141 caatggtgcc atcgttgtcc actggaattt gctgtcaggt gccaaatcgt gatcagcgtt 403201 tgcccggtgg acgggccccg cgcgcaatgc ccagcccttt caccggcgcg gccggtgcag 403261 ccggatcacc gtcggtttgc ggctttggcg gtgcggcggg ctccggttgc ggctttgctt 403321 cgggttgcgg ctgggctgct ggctcggcga gcccaggagc cgggggaggt gtctttttcg 403381 ccccgggccg cttggcgccg gcggcaatac ccagcccttt aacgggcgcg gcgggcgcag 403441 ccggggccgc gggcgttggg gccgctttct tggcgccagg ccgcttggcg ccggcggcca 403501 tgccgaggcc tttcacgggt gctgcgggag cggccggcgc tggcgcctgt ggtgcctcgg 403561 cgggtgcctc cacgggcgtc accggtgcgg cagccttagg agcggctttc ggggcgcgct 403621 cctgagcctg tttggcggcc gtacccttgg ccggcagctg cgccttgtcg tggtctagtg 403681 atccgagtag cacctgggcc acgtcgagca cctcgacgcc gctgcggccg gcttcttcct 403741 gccgatcgtt cacaccgtcg gtgaccatca cccggcagaa tgggcacgcg gtggcgattg 403801 cggtggcatc ggtggccagc gcctcatcga cgcgttcatg gttgatccgc ttgccgatgt 403861 gttcttccat ccacatgcgg gcgccgcctg cgccgcaaca aaagctgcgg tcggcatggc 403921 gcggcatctc ggtcaggctg gcccccgcgg caccgatcag ctcccgtggt gcctcgtagg 403981 ccttgttgtg ccgacccagg tagcacgggt cgtggtaggt gatgtcctga gaaaccggag 404041 tgacagggac cagcctcttg tcgcgcacca accgattgag cagctgggtg tggtgcagca 404101 cggtgtagtt ggcgcccagc tgccgatatt ccttgccgat ggtgttgaag cagtgcgggc 404161 aggtgacaac gatcttgcgg tcgacggtct ccacaccctc gaacaaaccg tccagggtct 404221 cgacggcctg ttgtgccagc tgctggaaga ggaactcgtt gccggagcgg cgcgccgagt 404281 cgccgttgca ggtttcccca gcgcccagca ccaagtattt caccctggcg acggcgagca 404341 gctcggcgac ggccttggtg gtcttcttgg ccttgtcgtc gtaggcgccc gcacaaccca 404401 cccagaacag gtactcgtag ccgtcgaagc tgtcgacgtc ctggccgtac acggggacgt 404461 cgaagtcaac ctcgtcgatc cagttggtgc gatctgaggc gttctgaccc cacgggttgc 404521 ccttggtctc caggttcttg aacagcaccg acagctcgga ggggaactcc gactccatca 404581 tcacctggta gcggcgcata tcgacgatgt gatcgacatg ttcgatatcc accgggcact 404641 gctcgacgca ggcaccacag gtcacacatg accacaagac gtcgggatcg ataacgccac 404701 cctgttcctc ggtgccgacc agcgggcgag tcgcctgctc cggtccatgc ccgggcactc 404761 gaccgaaccc cgattccggc acgtgatgat gctcttggtg accggcctcg ccgcccgcgc 404821 tggcatcctt ttggcccagg atgtagggcg ccttggccat ccaatggtcg cgcaggtcca 404881 tgatgaccag cttgggcgac aacggtttgc cggtgttcca ggccgggcat tgcgactgac 404941 agcgtccgca ctcggtgcag gtagcgaagt cgagcatccc cttccaggtg aagtcttcga 405001 tcttgccgcg gccgaatacg gcatcctcgc tgggattctc gaagtcgatt ggtttgccat 405061 cggcttcgag cggcaacagc gggcccagcc catccggcag ccgtttgaac gtgacgttaa 405121 tgggcgccag gaagatgtgc aggtgcttgg aatgcaaaac gaggatcagg aacgcaagca 405181 tgaccccgat gtgcagcaac agcgctgtgg tttcgatgat ttcgttggcg ggctgcccga 405241 gggggcgaag aatcgcgccg aatagctgcg ataggaaggc cccgttgccg tagggcaggg 405301 tgccgttgtt gaccgctgag ccgcggacca acacgtaggt ccagatgacg ttgaagatca 405361 tcaacaggac gagccacgcg ccgccgttgt gcgatccgta gaaccgggag ctccgaccga 405421 tctcgcgggg gttgcgcagg atacggatga tggcgaaggt cgtgataccg agaaagacgg 405481 cggtggcaaa gaagtcctgc aggaagccca acgcgtccca ccggccgatg accgggatgt 405541 ggaatctctc ctcgaacagc aggccgtaag cctcgatata gacggtgagc aggatgaaga 405601 agccccacat ggtgaaaaag tgcgccaggc ccgggatcga ccatttcaac agtcggcgct 405661 gccctagaac ctcggagatc tgggtccaga tgcgggtgcc gaggttgtcg gttcgcccgc 405721 tggccggctg cccggacatg accagcttgt aaagccacca gactcgccgc agagcgaaca 405781 cccccaccac cgcggtcatg ctcatgccca gtatcagcct gatgagcgtt tgcgtggtca 405841 cggaaggtca ccccaattcg tagcactcaa tggaacccct gcataacctg ctcatcctga 405901 catctgtgcg actttcgccg cgagaaaggc tgtcctaacc taccggtcgt caacgcctct 405961 catctgcggt taagctctcc ggggccagca tggcccgcag catcgacaac atctccgacc 406021 gggagccagc gcccagccgc tggcgtatcc gggcgacgtg gtgctcgacc gtcttcgctg 406081 agatgaacag ccgggcgcca atgtcgcgat agggcatgcc cagtagcagt agctcggcga 406141 cttcgcgttc gcgatcggat agcggcgagc ccgccggtgg ctggcgcggt gccggcgggg 406201 taccggaagc tggttccgtg tcgccggccc cgctgggggg ctcgccgaaa tcgttgccca 406261 gcttaagatc ccgtgccaac tgcagcatgg caccggacac ccgtgcgtcg gatgtttgca 406321 atgcggcctg acctgccagt cgggtcgcat ccgacgtcag gccgacgtgt gacagggacc 406381 gcgccgccgc ggtgacctcg tcggcgtcga cgttttcggc caggacccgc agccaggtgc 406441 gaccggcatc cgacagggcc tgcgcgagcg tgctgtgggc gaccattgca ccgagggcct 406501 gtccgtgcgg tgccaccgat tccggcgaat tggcgaggat tccagcgtgc actccagccc 406561 aatgcagtga gttcgaccac agggcggggt tgcccagcga atccagcagc gtgagcgcct 406621 gatccagggt gtgttgtagc tggtcaacct ggcgcattcg ggcggccgcg acccacagtt 406681 caccaagtgg cagcagggcg aacagatcga gcgaatactc ggccagcgct tccatcgccg 406741 cataccaatg ctgttgcagc gcaccgatat cgccggtgcg acgcgagatc gcggtttgca 406801 gtgccgcggc ccacaacgcg tcgcgccggt gcaggtgcgt gccggcgctg gccgccgcga 406861 cgtccgcgct tgccgacggc aattgcccct cttgcatttt gatccagccg gaaagcagca 406921 ggtgccgacg ctggaacagc gggtcggcgc cggctcgcac ggcacgcccg atcacactgc 406981 gggcgcggac cggatcgccg gcgtgtatcg cggccaaggt aaccagcgct gccgggctgt 407041 ccggaatgac ttggctgagc gattgttcgg tggcaatggc ttggcccagt tttgccatcg 407101 cgaccggata cggctgatcc atggtcagca gcagcccctc ggcgaggttg cgcgcgcaac 407161 gcgctgccat cgtcggtgga ccggcatcct tgagtcgcag ggtggcacgc gccgtcgcca 407221 ggtcgccgtt cgcggcgaac acgatcgtgg cggccgagct caccatcgtg tccgggtgtg 407281 ggcccagcca gccgaacaac tcggctgcgt gtcccgtgtt gccgtcgtgg accgcgacgc 407341 tggccgcaac ccgcaccgcg gcagcgcgtt cggtggcatc cggggagctg agcagatcgt 407401 cggctagtgt tgccgcggcc gtacagtcgc cggtgcgggc cagtgcgtcg gccaggcgga 407461 ccgtcaatcc tttggcgccg gcatggaccg cggcgcggta cagccgtgcg caacggaccg 407521 aagcgtcgcg ggtgtccgcg gcgtaccgcg tgaggatgtc cgccagccgc tcgtcgcgca 407581 gcccgtgttc ggccagtcgc agcgctagct ccgccgacac cggcgagata tcgagttgtg 407641 agcgtaacag cgaggtttcg acctcgtggt ggtgtgcatt gccgacgatc tgagcgatcg 407701 catcatggac tgactgcaga aacgccgcgg tgtgtgacga ctcgatcagt ccgctggcgt 407761 gcgcacgatc gaccaatccg cgggcatccg ttaccgaaat cccaagtgca gcagctacat 407821 cgctgacccc tagctcgtgg gttagcgaca tcatgagcag ggtgtccaga gtgggttcgt 407881 cgaggcggcg cagccgctcg atgagggcca ccttggccgc ttgcgcggga gcctgtgccc 407941 tggcggaaac cgcatgaatg aggaacggca gtcccgcggt gcaatctcgc aggtgctcgg 408001 caaccggaag tggaccgagc gagattcgtg gccggtcccg ttcgagcgcc atcgtcaggg 408061 ctcgtagtgc ccggtggtgc tcgcgggctt ccgcggccgc caccaccgtc agccgtgaat 408121 cggccacgcg ctcggtgagc cggagcaatt cggtatcggt gagcaactgg gcgtcgtcga 408181 tgacgagcgc ggtctccggc ggttcgccgt ctggcggcgg gcatgccagc acggtgagtc 408241 ccgagcggcg cagtgtgtcg cgggcggcag ccagaacggt ggtcttgccc gttccgatgc 408301 ccccggtgat caggaccttg accggtaccg tcggggcatt cgcgagttcc aggagggcac 408361 ggcgtgctgc cggcgggacc tcggtgaggg aatcggtcac cgatgcgtcg tatgcttggc 408421 cacggttctt gcaccccctg tgctgcacgg ctggtcggcg gcggctccct caccatagcc 408481 ccagcccgtc ccgcagcccc gcatttcccc taatgcggcc atcccctaac ggcgccccgg 408541 ggccggcggg ttccgcaccg aacacggacg cggcctcaac cgatagcatc gtgctaacac 408601 gggactaacg ggggtggggc aaggaggcgg gtagtggcaa actcgttgct cgactttgtc 408661 atctcgcttg tgcgcgaccc ggaagcggcc gcacgttacg ccgcgaaccc cgagcggtcg 408721 atcgccgaag ctcaccttac cgacgtgacc agagcggatg tgaacagcct gatcccggtg 408781 gtgtcggatt cgttgtcgat gtccgaaccc atcggagccg ctggcggggc acacgctggc 408841 gatcgtggca acgtttgggc gagcggcgcg gccacggctg cgcttgatgc gttcgcccca 408901 cacgccgatg cgggtgttgt ccaacagcac ggtgcggtcg gcagcgttct caaccagccg 408961 accccacccg gaccgggcgt gacacccacc gatccgcgcc ccttccgagc cggtccacat 409021 gagacgtcgg cgctgctcac gagcgctgaa atacccgaca cgaccagcga ggacggggga 409081 ttgccgacag accatccggc tgtctggaac cacccggtcg ttgacccaca taccgtcgag 409141 cccgatcatc acggctacga catccacgga taagttccgg accggcgtag gggtgcccca 409201 tttcccctaa tcccctaacg cggcggccag gccgatcccg ataggtgttt ggccggcttg 409261 cggatcagac cccgatttcg gggtgaggcg gaatccatag cgtcgatggc acagcgccgg 409321 tcacgccggc gaacagcttc ttcgattgaa gggaaatgaa gatgacctcg cttatcgatt 409381 acatcctgag cctgttccgc agcgaagacg ccgcccggtc gttcgttgcc gctccgggac 409441 gggccatgac cagtgccggg ctgatcgata tcgcgccgca ccaaatctca tcggtggcgg 409501 ccaatgtggt gccgggtctg aatctgggtg ccggcgaccc catgagcgga ttgcggcagg 409561 ccgtcgccgc tcggcatggc tttgcgcagg acgtcgccaa tgtcggcttc gccggtgacg 409621 cgggcgcggg ggtggcaagc gtcatcacga ccgatgtcgg tgcgggcctg gctagcggac 409681 tgggtgctgg gttcctgggt cagggtggcc tggctctcgc cgcgtcaagc ggtggtttcg 409741 gcggtcaggt cggcttggct gcccaggtcg gtctgggttt tactgccgtg attgaggccg 409801 aggtcggcgc tcaggttggt gctgggttag gtattgggac gggtctgggt gctcaggccg 409861 gtatgggctt tggcggcggg gttggcctgg gtctgggtgg tcaggccggc ggtgtgatcg 409921 gtgggagcgc ggccggggct atcggtgccg gcgtcggcgg tcgcctaggc ggcaatggcc 409981 agatcggagt tgccggccag ggtgccgttg gcgctggtgt cggcgctggt gtcggcggcc 410041 aggcgggcat cgctagccag atcggtgtct cagccggtgg tgggctcggc ggcgtcggca 410101 atgtcagcgg cctgaccggg gtcagcagca acgcagtgtt ggcttccaac gcaagcggcc 410161 aggcggggtt gatcgccagt gaaggcgctg ccttgaacgg cgctgctatg cctcatctgt 410221 cgggcccgtt agccggtgtc ggtgtgggtg gtcaggccgg cgccgctggc ggcgccgggt 410281 tgggcttcgg agcggtcggg cacccgactc ctcagccggc ggccctgggc gcggctggcg 410341 tggtggccaa gaccgaggcg gctgctggag tggttggcgg ggtcggcggg gcaaccgcgg 410401 ccggggtcgg cggggcacac ggcgacatcc tgggccacga gggagccgca ctgggcagtg 410461 tcgacacggt caacgccggt gtcacgcccg tcgagcatgg cttggtcctg cccagtggcc 410521 ccctgatcca cggcggtacc ggcggctatg gcggcatgaa cccgccagtg accgatgcgc 410581 cggcaccgca agttccggcg cgggcccagc cgatgaccac ggcggccgag cacacgccgg 410641 cggttaccca accgcagcac acgccggtcg agccgccggt ccacgataag ccgccgagcc 410701 attcggtgtt tgacgtcggt cacgagccgc cggtgacgca cacgccgccg gcgcccatcg 410761 aactgccgtc gtacggcctt ttcggactac ccgggttctg attcgcgagc cgatttcacg 410821 aaccggtggg gacgttcatg gtccccgccg gtttgtgcgc ataccgtgat ctgaggcgta 410881 aacgagcgag aaagtggggc gacacggtga cccagcccga tgacccacgt cgggtcggtg 410941 tgatcgtcga actgatcgat cacactatcg ccatcgccaa actgaacgag cgtggtgatc 411001 tagtacagcg gttgacgcgg gctcgccagc ggatcaccga cccgcaggtc cgtgtggtga 411061 tcgccgggct gctcaaacag ggcaagagtc aattgctcaa ttcgttgctc aacctgcccg 411121 cggcgcgagt aggcgatgac gaggccaccg tggtgatcac cgtcgtaagc tacagcgccc 411181 aaccgtcggc ccggcttgtg ctggccgccg ggcccgacgg gacaaccgca gcggttgaca 411241 ttcccgtcga tgacatcagc accgatgtgc gtcgggctcc gcacgccggt ggccgcgagg 411301 tgttgcgggt cgaggtcggc gcgcccagcc cgctgctgcg gggcgggctg gcgtttatcg 411361 atactccggg tgtgggcggc ctcggacagc cccacctgtc ggcgacgctg gggctgctac 411421 ccgaggccga tgccgtcttg gtggtcagcg acaccagcca ggaattcacc gaacccgaga 411481 tgtggttcgt gcggcaggcc caccagatct gtccggtcgg ggcggtcgtg gccaccaaga 411541 ccgacctgta tccgcgctgg cgggagatcg tcaatgccaa tgcagcacat ctgcagcggg 411601 cccgggttcc gatgccgatc atcgcagtct catcactgtt gcgcagccac gcggtcacgc 411661 ttaacgacaa agagctcaac gaagagtcca actttccggc gatcgtcaag tttctcagcg 411721 agcaggtgct ttcccgcgcg acggagcgag tgcgtgctgg ggtactcggc gaaatacgtt 411781 cggcaacaga gcaattggcg gtgtctctag gttccgaact atcggtggtc aacgacccga 411841 acctccgtga ccgacttgct tcggatttgg agcggcgcaa acgggaagcc cagcaggcgg 411901 tgcaacagac agcgctgtgg cagcaggtgc tgggcgacgg gttcaacgac ctgactgctg 411961 acgtggacca cgacctacga acccgcttcc gcaccgtcac cgaagacgcc gagcgccaga 412021 tcgactcctg tgacccgact gcgcattggg ccgagattgg caacgacgtc gagaatgcga 412081 tcgccacagc ggtcggcgac aacttcgtgt gggcatacca gcgttccgaa gcgttggccg 412141 acgacgtcgc tcgctccttt gccgacgcgg ggttggactc ggtcctgtca gcagagctga 412201 gcccccacgt catgggcacc gacttcggcc ggctcaaagc gctgggccgg atggaatcga 412261 aaccgctgcg ccggggccat aaaatgatta tcggcatgcg gggttcctat ggcggcgtgg 412321 tcatgattgg catgctgtcg tcggtggtcg gacttgggtt gttcaacccg ctatcggtgg 412381 gggccgggtt gatcctcggc cggatggcat ataaagagga caaacaaaac cggttgctgc 412441 gggtgcgcag cgaggccaag gccaatgtgc ggcgcttcgt cgacgacatt tcgttcgtcg 412501 tcagcaaaca atcacgggat cggctcaaga tgatccagcg tctgctgcgc gaccactacc 412561 gcgagatcgc cgaagagatc acccggtcgc tcaccgagtc cctgcaggcg accatcgcgg 412621 cggcgcaggt ggcggaaacc gagcgggaca atcgaattcg ggaacttcag cggcaattgg 412681 gtatcctgag ccaggtcaac gacaaccttg ccggcttgga gccaaccttg acgccccggg 412741 cgagcttggg acgagcgtga gcaccagcga ccgggtccgc gcgattctgc acgcaaccat 412801 ccaggcctac cggggtgcgc cggcctatcg tcagcgtggc gacgtttttt gccagctgga 412861 ccgcatcggt gcgcgcctag ccgaaccgct gcgcatcgcg ttggctggca cactcaaggc 412921 cggaaaatcc actctcgtca acgcccttgt cggcgacgac atcgctccga ccgatgccac 412981 cgaggccacc cggattgtga cctggttccg gcacggtccg acaccgcggg tcaccgccaa 413041 ccatcgcggc ggtcgacgcg ccaacgtgcc gatcacccgt cggggcgggc tgagtttcga 413101 cctgcgcagg atcaacccgg ccgagctgat cgacctggaa gtcgagtggc cagccgagga 413161 actcatcgac gccaccattg ttgacacccc gggaacgtcg tcgttggcat gcgatgcctc 413221 cgagcgcacg ttgcggctgc tggtccccgc cgacggggtg cctcgggtgg atgcggtggt 413281 gttcctgttg cgcaccctga acgccgctga cgtcgcgctg ctcaaacaga tcggtgggct 413341 ggtcggcggg tcggtgggag ccctgggcat catcggggtg gcgtctcgcg cggatgagat 413401 cggcgcgggc cgcatcgacg cgatgctctc ggccaacgac gtggccaagc ggttcacccg 413461 cgaactgaac cagatgggca tttgccaggc ggtggtgccg gtatccggac ttcttgcgct 413521 gaccgcgcgc acactgcgcc agaccgagtt catcgcgctg cgcaagctgg ccggtgccga 413581 gcgcaccgag ctcaataggg ccctgctgag cgtggaccgt tttgtgcgcc gggacagtcc 413641 gctaccggtg gacgcgggca tccgtgcgca attgctcgag cggttcggca tgttcggcat 413701 ccggatgtcg attgccgtgc tggcggccgg cgtgaccgat tcgaccgggc tggccgccga 413761 actgctggag cgcagcgggc tggtggcgct gcgcaatgtg atagaccagc agttcgcgca 413821 gcgctccgac atgcttaagg cgcataccgc cttggtctcc ttgcgccgat tcgtgcagac 413881 gcatccggtg ccggcgaccc cgtacgtcat tgccgacatc gacccgttgc tagccgacac 413941 ccacgccttc gaagaactcc gaatgctaag ccttttgcct tcgcgggcaa cgacattgaa 414001 cgacgacgaa atcgcgtcgc tgcgccgcat catcggcggg tcgggcacca gtgccgccgc 414061 tcggctgggc ctggatcccg cgaattctcg cgaggccccg cgcgccgcgc tggccgcagc 414121 gcaacactgg cgtcgccgtg cggcgcatcc actcaacgat ccgttcacta ccagggcctg 414181 tcgcgcggcg gtgcgcagcg ccgaggcgat ggtggcggag ttctctgctc gccgctgacg 414241 cgtcaggccc tcgggtgtca cagtggtggg cgtgactggt ggcgccaacg caacggtgat 414301 cagccaccgg gtggaacatg ttttcgagcc caaggggcag cgacggcagc tcggggcaca 414361 agggtcataa gggcatgcgc tcagaatgtg tcgaccttct cgatgctgac gaacatgcca 414421 tggcccgtgc ggttgttcgt gaagcgggtg ccatcggtgg tggcgtcgat ggtccagccc 414481 tgcgcctcat aggttcggta gtcgatgctg acggtgggga tggcgcccag attgcccaac 414541 acccactgca cggagccgcc ggcggtgatg tggatgccgt tggcgtgctc accatctcgc 414601 aagggggagt tggtgaacgg tgcctcgcag ccaacgctgt ccctattgat ctggcaacgc 414661 gtcattccgg acttggtttc gatgaagacg taaccgttgg agtcaggcgg gagcggaatg 414721 gcgccggccg gcgcggtcgg gctgaccggt gtcgtcggta gcgtcggggc ggtggtcccc 414781 ggcggcgcgg tagttggcct aggcgtcgga aaagtcggct cggtaggtcc cgaccctggc 414841 gaagcgaccg gcctgccgtc gatggtggtg ttgcagccgg caactagcgc ggtagccgcc 414901 agcagggccg ccataccccg tgcaatgagc gatagccgca cgcgctactc cccggaaatc 414961 tgagatatcg ggagtaggtt acgcgcgagg tcccgcaatt tactgcagtg acgcgcttct 415021 gcaacggccc gcataatcgg agaatggcgt tgttgccgtc gacggtcgtg ggagtcttgc 415081 tggccgcggg tgcgggccgg tggtatggca agccgaaagt gctggttgac gggtggctgg 415141 acaccgcggt cggggcgttg cgcgacggtg gttgtaacga cgttattttg gtgctgggtg 415201 ctgtcgaggt gtcggcaccg gccggtgtca ccgcgattac cgcgccggac tggcagcagg 415261 ggctgagcgc gtcagtgcgt gcgggtctgg cccaggccga ccgcgagcac gccgactacg 415321 ccgtcctgca tgtgatcgac acgcccgatg tcaatgccaa ggtggtggct cgagtccttg 415381 gccgtgcctt ggtatcccgc agcggtctgg cagggcgcgg ccgcatacct gcgcacagtg 415441 cccgacgtcg aggctgttga gtgcggcgac ttggctagtg gtcgcgatgt cgacgtggac 415501 ctcagattgg atccgccgaa tggacgaccg cgacactctt ggtgtggtcg atggcgtggt 415561 gcgcgacggc cgtcacacga ttgcggacca aataccagcc accgatgagg gccggtacac 415621 cgatcaccgt tgcggcgatc atccagggac cgtgttgttc gtcgaagtac atcaggatca 415681 gcacgccggc caggaaagcc agcgtcagat agccgctgaa cggtgaaagc ggcatccgga 415741 atttcggccg ctgcagctgc ccggcgttcg ccatccggtg gagccgcagc tggcaagcca 415801 cgatcgtcgc ccaggccgcg atgactcccg tcgcggcgat gtggagcacg atctcgaagg 415861 cttggctcgg tttgatggcg ttgagaatga tgcccaacag gccgataccg gcggtgagca 415921 ggatcccgcc gtacggcacg ccggtcttcg acattggtgc ggtgaacctc gggccgctgc 415981 cgttgatcgc cattgatcgc aggatccgtc cggtggaata cagtccggcg ttgaggctcg 416041 acagcgcggc ggtgagcacg acgaggttca tcacgctgcc cgccgcgtcg ataccgatct 416101 tggaaaagaa ggtcacgaac gggctgacat gttctttgta ggcggtatag ggcagcagca 416161 gggccagcag gacagtcgac ccgacgtaga agcacgcgat gcgcaacacc acagagttga 416221 tcgcgcgcgg catgatcttt gccggttcgg ctgtttcccc ggccgcgatg cccaccagtt 416281 cgattgcggc gtaggcgaat accacccccg aggtgaccag cactatgggc agcagaccgg 416341 ttggcacgat gcctccatgg ctgctccaca gggagacccc ggtctcctgg ccgtcgatct 416401 tgtagcgccc agcgagaaag accgtaccga cgatcagaaa cgtaaccagc gcgatcacct 416461 tgatcaatga ggcccagaac tccagctcgc cgaagagcct gaccgagatc aggttcatcg 416521 acaacacgac cagcagcgcg atcaacgcca gcgtccactg ggggatgggt tgaaacgccc 416581 gccagtaatg gcaatagtgc gcgatcgcgg tggtatcgac gatccccgtc atcgcccagt 416641 tcaggaagta catccacccg gcgacgaagg ccaccttttc cccgtagaac tcgcgggcgt 416701 aggacacgaa tgaccccgag gacggacggt gcagcaccag ttcgccgagc gcgcgcagga 416761 tcaggaacac gaagatgccg cagatcccat agaccaggaa caaaccgggc cccgccgatg 416821 caaggcggcc gccggcgccg agaaaaaggc cggtaccgat ggcaccaccg agagcgatca 416881 tttgcagttg ccggctatgg aggcctttgt gatagcccgt gtcttcgcgc gtgagccgct 416941 cgtcggtgat gtctagcggt ggcattgagc tccctgggat ggtggcttct tgggacgcgc 417001 gtgagatggg gcacacccaa cggactggct gtcaggctat cccacgcggc tgcgaggtgc 417061 cgcttggcaa ccaatcggaa acaatcgatc ggtcaacggt gctttgttgt cgtgccgacc 417121 gtcgcgggtg gccgcgttga cagtcgatat tgcggtcaca ggctgacgcg cctggccagc 417181 cagacgctcg cgaagtgcgg gtccgtcctg gccgcgaggg tgtcgtagcc gcggtcgtag 417241 tgtgagactc cgacacctga tccgccagcg cagagatgtg agatcaacgg acggaaggcg 417301 acggtgcccg gcgcgcgcga gttgacgctg cgcgtcgagc gcggggctct atttcggcgt 417361 cgatgggcag catcggcagc gtcatcagct cgcgcagcaa ttcgtcgtga tccgcggcgc 417421 tgcgcgctgg gtacccggcc tcgatgggta tcatttttgg ttatcgttct ggttatcatg 417481 aatgttgtga cggcccatcc caagtacccg aatgaccctc ttgcgctggt attgattgaa 417541 ctgcgccatc cgcggaccga gccgccggtg ccatctgcta tctccatcct gaaggaggag 417601 ctggcgcgat ggactcccat actcgaacag gaggaggtgc ggcaggtcaa cctagaaacg 417661 ggcgaacata ccgcacactc acagaagaag ctcgttgccc gtgatcgccg caccgcgatc 417721 acgtttcgac ccgacgccat gaccctcgaa gtcaccgact acccgggctg ggaggagttt 417781 cggtccatcg ttcacgcgat ggtcacagcc cgccaggacg tggccccagt cgatggctgc 417841 atccggatcg gtctgcgcta catcaacgag attcgggcat cgctggcgga gccatccggc 417901 tgggcgtact gggtggcgga aagtctcctc gggcctggga cacagcttgc cgatctcaaa 417961 ctcaccacca ccgcgcaacg gcacgtcatt cagtgcgaag gcccggagcc aggcgactcc 418021 ttgacactga ggtacgccgg tgcgcgcggc gcggtcatcc agtcaacccc gtttctccag 418081 cggttgaaag aacctccggc agaaggagat ttcttcctca tcgatatcga cagcgcgtgg 418141 agcgacccct gcaagggcat cccagcgctc gacgcccacc tggtggacga ggtcgccgaa 418201 aggctccaca cacccatcgg cccactgttc gaatcgctga taacttccga actccgtaca 418261 aaggtgctgc aacaacctgg gcaggagtga ccatgaccat ttcgttctct agctcgaatc 418321 tccgagacga cgccacctct ggcaacggcg attaccgcct cgacaagctg cccgaaacca 418381 ccccatcgac ctcggtgttc gaccgcgccg atgtcaccta ccgccaattc acggaactcc 418441 acgggcaagc ccgcgacaca cggcgggagg cgcacgtggt tgagctggag tccaagaccg 418501 gcgagcgggc tcggtgcgca cccatgcatg cgcttgagca gctcgcggac tacggctttg 418561 cctggcggga catcgcacgc gttgtcggag tgagcgtgcc cgcaatcacc aaatggcgca 418621 agggcgctgg agttaccggc gagaaccggc taaaaatcgc ccgtctactc gccctcatcg 418681 acatgctctc ggaccgattc atcggcgagc ccgcctcctg gctggaaatg ccgatccaag 418741 ccggagtggg aatcacccga atggacctcc tggagcgagg tcgatatgac ctcgtattgg 418801 cgctggctag tacccacact ggggacggta cggtcgaata cgtactgaac gagactgata 418861 aggactggcg agagaccgtt gtagacaacg ctttcgaatc ctacacagcc gaggacggcg 418921 tgatctcgat aagacccaag cggtaaccgt gccagagctg gagacgcccg acgacccaga 418981 gtcgatatac cttgcccgcc tcgaggatgt cggagaacac agaccgacgt tcacgggcga 419041 catctaccga ctcggcgatg gtcgcatggt gatgatcctc cagcacccat gcgcgctgcg 419101 gcacggcgtt gacctccatc cgcgactgct ggtcgctccc gtaagacccg actcgcttcg 419161 ttccaactgg gctagagccc cgttcggcac gatgccgctt ccgaagctca tcgacggtca 419221 ggatcactcg gcggacttca tcaatcttga actcatcgat tcaccaacgc ttccgacctg 419281 tgagcggatc gcggtgctca gccagtcagg cgtcaacttg gtcatgcaac ggtgggtgta 419341 ccacagcacc cggctcgccg tgcccacgca cacctactcc gacagcaccg ttggcccgtt 419401 cgatgaggca gacctgatcg aggagtgggt gacggatcgc gtcgacgatg gggccgaccc 419461 gcaggcggcc gaacacgaat gcgcctcctg gctcgatgaa agaatcagcg gccgcactcg 419521 gcgagcgctg ctcagcgacc gtcagcacgc cagttcaata cggcgagaag cgcgttctca 419581 tcgaaagtcg gtcaagctgg cggactgagc actgctctcc gggcttgacc ggggcctctc 419641 ccagctacgc cccgagcgtg tgccctgccg acacgcggga acaagacccg cacgaccagc 419701 gttagcatgc tcagtaagtt gagtgcatca ggctcagctc tgaattgaca gcacaccgcc 419761 gtcgaggcaa gcttgagcgg ggtgcactca tcatagtgca ggaaagaagc tctacatatt 419821 caggaggatt caccatggct cgtgcggtcg ggatcgacct cgggaccacc aactccgtcg 419881 tctcggttct ggaaggtggc gacccggtcg tcgtcgccaa ctccgagggc tccaggacca 419941 ccccgtcaat tgtcgcgttc gcccgcaacg gtgaggtgct ggtcggccag cccgccaaga 420001 accaggcagt gaccaacgtc gatcgcaccg tgcgctcggt caagcgacac atgggcagcg 420061 actggtccat agagattgac ggcaagaaat acaccgcgcc ggagatcagc gcccgcattc 420121 tgatgaagct gaagcgcgac gccgaggcct acctcggtga ggacattacc gacgcggtta 420181 tcacgacgcc cgcctacttc aatgacgccc agcgtcaggc caccaaggac gccggccaga 420241 tcgccggcct caacgtgctg cggatcgtca acgagccgac cgcggccgcg ctggcctacg 420301 gcctcgacaa gggcgagaag gagcagcgaa tcctggtctt cgacttgggt ggtggcactt 420361 tcgacgtttc cctgctggag atcggcgagg gtgtggttga ggtccgtgcc acttcgggtg 420421 acaaccacct cggcggcgac gactgggacc agcgggtcgt cgattggctg gtggacaagt 420481 tcaagggcac cagcggcatc gatctgacca aggacaagat ggcgatgcag cggctgcggg 420541 aagccgccga gaaggcaaag atcgagctga gttcgagtca gtccacctcg atcaacctgc 420601 cctacatcac cgtcgacgcc gacaagaacc cgttgttctt agacgagcag ctgacccgcg 420661 cggagttcca acggatcact caggacctgc tggaccgcac tcgcaagccg ttccagtcgg 420721 tgatcgctga caccggcatt tcggtgtcgg agatcgatca cgttgtgctc gtgggtggtt 420781 cgacccggat gcccgcggtg accgatctgg tcaaggaact caccggcggc aaggaaccca 420841 acaagggcgt caaccccgat gaggttgtcg cggtgggagc cgctctgcag gccggcgtcc 420901 tcaagggcga ggtgaaagac gttctgctgc ttgatgttac cccgctgagc ctgggtatcg 420961 agaccaaggg cggggtgatg accaggctca tcgagcgcaa caccacgatc cccaccaagc 421021 ggtcggagac tttcaccacc gccgacgaca accaaccgtc ggtgcagatc caggtctatc 421081 agggggagcg tgagatcgcc gcgcacaaca agttgctcgg gtccttcgag ctgaccggca 421141 tcccgccggc gccgcggggg attccgcaga tcgaggtcac tttcgacatc gacgccaacg 421201 gcattgtgca cgtcaccgcc aaggacaagg gcaccggcaa ggagaacacg atccgaatcc 421261 aggaaggctc gggcctgtcc aaggaagaca ttgaccgcat gatcaaggac gccgaagcgc 421321 acgccgagga ggatcgcaag cgtcgcgagg aggccgatgt tcgtaatcaa gccgagacat 421381 tggtctacca gacggagaag ttcgtcaaag aacagcgtga ggccgagggt ggttcgaagg 421441 tacctgaaga cacgctgaac aaggttgatg ccgcggtggc ggaagcgaag gcggcacttg 421501 gcggatcgga tatttcggcc atcaagtcgg cgatggagaa gctgggccag gagtcgcagg 421561 ctctggggca agcgatctac gaagcagctc aggctgcgtc acaggccact ggcgctgccc 421621 accccggcgg cgagccgggc ggtgcccacc ccggctcggc tgatgacgtt gtggacgcgg 421681 aggtggtcga cgacggccgg gaggccaagt gacggacgga aatcaaaagc cggatggcaa 421741 ttcgggcgaa caggtaaccg tcactgacaa gcggcggatc gatcccgaga cgggtgaagt 421801 gcggcacgtc cctcccggcg acatgccggg agggacggct gcggccgatg cggcgcacac 421861 cgaagacaag gtcgccgagc tgaccgccga tctgcaacgc gtgcaggccg acttcgccaa 421921 ctaccgtaag cgggcgttgc gcgatcagca ggcggccgct gaccgagcca aggccagcgt 421981 tgtcagccaa ttgctgggtg tactggacga tctcgagcgg gcgcgcaagc acggcgattt 422041 ggagtcgggt ccactgaagt cggtcgccga caagctagac agcgcgttga ccgggctggg 422101 tctggtggcg ttcggtgccg agggcgagga tttcgacccc gtgctgcacg aagcggtgca 422161 acacgagggc gacggcgggc aggggtccaa gccggtaatc ggcaccgtca tgcggcaggg 422221 ctaccaactg ggtgagcagg tgctgcggca cgccttggtc ggcgtcgtcg acacggtggt 422281 cgtcgacgcg gccgaactgg agtcagtcga cgacggcact gcggtcgcag ataccgccga 422341 aaacgatcaa gctgaccagg gcaatagcgc cgacacctcg ggcgaacagg cagaatcaga 422401 accgtcgggc agttaacaac aaaagaggaa ggcgagaggg ggtgacgcga catggcccaa 422461 agggaatggg tcgaaaaaga cttctaccag gagctgggcg tctcctctga tgccagtcct 422521 gaagagatca aacgtgccta tcggaagttg gcgcgcgacc tgcatccgga cgcgaacccg 422581 ggcaacccgg ccgccggcga acggttcaag gcggtttcgg aggcgcataa cgtgctgtcg 422641 gatccggcca agcgcaagga gtacgacgaa acccgccgcc tgttcgccgg cggcgggttc 422701 ggcggccgtc ggttcgacag cggctttggg ggcgggttcg gcggtttcgg ggtcggtgga 422761 gacggcgccg agttcaacct caacgacttg ttcgacgccg ccagccgaac cggcggtacc 422821 accatcggtg acttgttcgg tggcttgttc ggacgcggtg gcagcgcccg tcccagccgc 422881 ccgcgacgcg gcaacgacct ggagaccgag accgagttgg atttcgtgga ggccgccaag 422941 ggcgtggcga tgccgctgcg attaaccagc ccggcgccgt gcaccaactg ccatggcagc 423001 ggggcccggc caggcaccag cccaaaggtg tgtcccactt gcaacgggtc gggcgtgatc 423061 aaccgcaatc agggcgcgtt cggcttctcc gagccgtgca ccgactgccg aggtagcggc 423121 tcgatcatcg agcacccctg cgaggagtgc aaaggcaccg gcgtgaccac ccgcacccga 423181 accatcaacg tgcggatccc gcccggtgtc gaggatgggc agcgcatccg gctagccggt 423241 cagggcgagg ccgggttgcg cggcgctccc tcgggggatc tctacgtgac ggtgcatgtg 423301 cggcccgaca agatcttcgg ccgcgacggc gacgacctca ccgtcaccgt tccggtcagc 423361 ttcaccgaat tggctttggg ctcgacgctg tcggtgccta ccctggacgg cacggtcggg 423421 gtccgggtgc ccaaaggcac cgctgacggc cgcattctgc gtgtgcgcgg acgcggtgtg 423481 cccaagcgca gtgggggtag cggcgaccta cttgtcaccg tgaaggtggc cgtgccgccc 423541 aatttggcag gcgccgctca ggaagctctg gaagcctatg cggcggcgga gcggtccagt 423601 ggtttcaacc cgcgggccgg atgggcaggt aatcgctgat ggcgaagaac ccaaaggacg 423661 gggaatcccg gacgtttttg atctcggtag ccgccgagct agccggcatg catgcacaga 423721 ccctgcgtac ctacgatcgt cttgggttgg tcagcccgcg gcgcacctcc ggtggcgggc 423781 gccgctattc cctgcatgac gtcgagttgc tgcgccaggt gcagcacctc tcgcaggacg 423841 agggggtcaa cttggccggc atcaagcgca ttattgaact gaccagtcag gtcgaggcgc 423901 tgcagtccag gttgcaagag atggctgagg agttggcggt gttgcgtgcc aaccagcgcc 423961 gcgaggtcgc ggtggtgccg aagagcaccg ccctggtcgt ctggaaaccg cgccggtgag 424021 cgagcgcgcg tagcggggga gcgaacggcg cagttggcac cagccggtga gcgagcgcgc 424081 gtagcggggg agcgaacggc gcagttggca ccagccggtg agcgagcgcg cgtagcgggg 424141 gagttagggt ccgctaccgt tgttgaggat gccggagagt cgggctccgt ggttgccgaa 424201 gccggagata agggcttggg tcgcgaggtc cagcatgctc gtgttgtaga aaccggagac 424261 ggtattgcct aggttcgccc agcccgacag caggttgccg aagttttgga agcccgaatt 424321 cctacgccgc cagcattgaa gaagcccgaa gtctcggtga agacgtttcc caggcccgac 424381 acggcggctg cggcgtcgtt gaggaagccc gatgcgccac cggcgccgga gttgaagaag 424441 cccgacgacg gggttgtggt cgagttgaag aatcccgggc tctgctgcca gccgaagccg 424501 aaggggaacg cgcccacggt gccgctgccg gcgaaatcga gggtttgggt gaaagccgtg 424561 tcgatgggct ggtcggggtt gatcgtgctg gcatcgattt cgtaggggcc gagatgttcg 424621 gtggtgatgg gtatggtgac cgagacatgc tttacacacc ccttgaaagg gatgtagatc 424681 acgcagaccg acacccgcaa cttgatgggt atttcgaatt cgtcaatagt gaacgcgtcc 424741 tgggtgatgg cgttgatgtc gccctcgatg ggtatttcaa tgttggaacc tgtcgtagct 424801 ccacgggatt tcggaaacgg cgctctggta ggcgaaaccg cctaggccct ggtagtcgcc 424861 ccgccagaag aagccgttgc tgtagttgcc cgaattgaag gcgccggtgt tgacatcgcc 424921 ggagttggcg atgccggtgt tggagttgcc cgggttgaag tagccggtgt tgtagttacc 424981 ggggttgaag ccgccggtgt tgtagtcgcc ggggttgaag ctgccggtgt tggtgttccc 425041 ggcgttgaat aggccggtgt tgtagtcacc ggagttgcca atgccggtgt tgaccaggcc 425101 ggtgttgaag aagccggagt tggtgctgcc ggtgttgccg atgccggtgt tgtagtcacc 425161 ggagtttccg atgccccagt tgccggtgcc ggagttgccg aagccgatgt tgccggtgcc 425221 ggagttgaat agcccgatgt tgccggtgcc cgagttccag ccgccgaacc cggtcatggt 425281 gtcgccggtc agcccgatgc cgatgtttcc gtttccggtg ttgccgaagc cgatgttgcc 425341 ggtgccggta ttgccgatgc cgatgtttcc gtttccggtg ttgccgaagc cgatgttgcc 425401 gatggccgcc gtcagccccg gaccgacgtt gccgaacccg atgttggagc tgccgatgtt 425461 ggcgctgccc aggttgaagt cgccgatgtt ggcgctgccc aggttgaagt cgccgatgtt 425521 ggcgctgccc aggttgtaga cgccgatgtt ggcgctgccc aggttgtagt tgccgatgtt 425581 tgcgctaccc agattccaga acccgaggtt ggccaagccc acgctgaagg tcgtctcggt 425641 cgggccgttc tgcaaccacc cagccaggtt ggtgccgatg ttgagcaagc ccgagacatt 425701 tgccggtgct cccaagccgg tgttgtagat gcccgagatg ctgttgccca aattcgccca 425761 gcccgactgc agcgacccat agttttggaa gcccgaattt cccatgctgc tagtggcgaa 425821 gttgtagagg cccgagttgt tgccgaagtt cagcaagccc gatgcgctgc cagcacccca 425881 gttgaggaag cccgacgacg ggccggtggt ggcgttgaag aagccgggcg ctggccgcaa 425941 gtcgatgatc ggaatgctga tcgggccggc gccggcgccg cccacgatgt tgatcacggt 426001 ggagccgtcg ggcttgccga tgttgaggtt gatcgccggc gaggggccga agtcaatttc 426061 gatgggtgtg tccagcgggg ccgacgcgtc cccgccatgc agggtgatcg gaccgaccgg 426121 ggccaagacg gtgccactga ggatgcctat gtcgacgctt ccgctggcgt cgattctggg 426181 gaacgtaatg gcggggatgg agacattggt gatgtcgccg gtgatgggga tgttgaccgg 426241 gatgtcgaca ttgaggaacg cggcaggtcg ctcgatggtg atggtgtagt tggccgccag 426301 caggccctgc cggtcggcgc gccagagcaa gccgttgttc atgctgccgg tgatgaacgc 426361 gccggtgccg tagtctccgg cgtttgccat gccggtgttg tagctaccgg tgttgtagga 426421 gccggtgttg tagtggcccg ggttggcgat gccggtgttg aaggtgccga tgttgaacag 426481 gccggtgttg tggttgcccg ggttggcgat gccggtgttg accaggcccg cgttgagcaa 426541 gccggtgttg tagttgccgc tgtttccgat gccggtgttg ccggtgccgg tgttgccgat 426601 gccgacgttg ccggtgcccg agttgccgat gccgacattg ccggtgcccg agttaaacaa 426661 gccgatgttg aaactgcccg agttggtgcc gccgatgccc acctggctgt cgccggatag 426721 gccgacgccg aagccgccgg tgcccatgtt gccgatgccg acgttgccgg tgcccgagtt 426781 gaacaagccg gtgttgccgg tgccggagtt gaacaagccg gtgttggcgg tgccggagtt 426841 gaagaaaccg gtgttgccgg cgccggagtt cagggagctg aagccggaca agccgtcgcc 426901 ggtcagccag atgccgacgt tgttgttgcc ggtgttgaac aagccgatgt tgttgttgcc 426961 ggtgttggca atgccctggt tgaagttgcc cgcgttggcc atgccaaagt tgttgtcgcc 427021 caggttgaac aggcccatgt tggcgatgcc ggcgttcagc gggccgaacc cgatctggtt 427081 gtcgccggac aggccgatgc cgatgttgtt gttgccggtg ttgccgaagc cgatgttgta 427141 gttaccggtg ttgccgacac cgatgttgta gttgccggtg ttgccgatac cgatgttgtt 427201 gacagccgcc gttagccccg gaccggtgtt tgcgatcccg acgttaaagt cgccgatgtt 427261 tgcgccgccg atgttcgcgt tgccaatgtt gccgaagccg acgtttgaat tgccgaggtt 427321 tgcactgccg aggtttgcac tgccgaggtt tgcactgccg aggtttgcac tgcccacgtt 427381 gaattggccg aggtttgcca ggcccgcgtt gaaggtcgac ccgttcgggc cgcggagcac 427441 gccggtcagg tcggcgccga cgttggagag tcccgagagg ttagccggcg tcgcgaagtc 427501 cgccgcactg gtgttgtaga agcccgagac ggtgttgccc aagttcgccc agccggattg 427561 cagcgagccg aagttctgca agccagagtt tcctatgccg gcgaaagcgg tgttccagaa 427621 gcccgaattg ttggcgccga cgtttccgaa tccagacgag ctgccggtgc cggagttgaa 427681 gaagcccgat gaggggccgg tggtcgagtt tccgaaaccg ggggccggcg ggatgtccag 427741 tagcgggatg acgaccgggc cggcgccgct ggtgatcgtg atgggtatcg aggatgaacc 427801 gccggggtcg ccgatgttga tatcgatcac cggtaaggtg ccggcgatcc tcggaacgat 427861 gatgggtccg acgctgaagt gggtgacccc gagggcgcgg agggtgatag ccggaatctg 427921 gaagccgttg acggcaaggg taccgaagtc gagatggatg gggatgttga ccggaatgtt 427981 gaggctaaag ttcagtagcg ggatctgggg aacagtgatt gcgtagtgtg cgccccactg 428041 gccttggtga tcgctctgcc agaaggcgcc gttgctgaag ttgcccccga tgaaggcgcc 428101 ggtgttcacg tcgccggtgt tgtagaagcc ggtgttgtag tcgccggtgt tgtagaagcc 428161 ggtattgaag tcgccgtcgt tgaagctgcc ggtgttggtg ctgccggtgt tgtagctgcc 428221 cgtgttggcg atgcccacgt tggcgatgcc ggtgttggtg ctgcccacgt tgtagccacc 428281 ggtgttgtag gtgcccacgt tggcgacgcc catattgccg gtgcctgggt tccacaggcc 428341 ccagttgccg gtgcccgagt tccccaagcc ggtgttgccg acacccgggt tgccgatccc 428401 ggtgtttccg atgcccgagt tgccaatgcc gatgtttccg gtgcccgagt tgaacaaacc 428461 gatgttgttg gtgcccgagt tgaacaggcc ggtgttggcg gtaccggagt tccaggagcc 428521 gaacccctgt tgattgtggc cggacagccc gataccgatg ttgttgttgc cggtgttggc 428581 aaacccgatg ttgttgctgc cgctgttggc gaagcccagg ttgaaatcgc cggcgttgcc 428641 gaatccgaag ttgtagctgc ccaggttgcc taggccgatg ttgtagttac ccaggttcgc 428701 cgggccgata ttgtatgagc cccggtttcc ggagaagacg ttgaagctgc cgatgttgcc 428761 atggccgacg ttcgcgttgc cgaggtttcc gaggccgacg ttggagtcgc cgatattgcc 428821 gtggccgaca ttcccgctgc ctacgttgcc aaacccgagg ttgaggatct gggtgttgtt 428881 gacgaagccg gcgaccgtag caccgacatt ccctccgccg gagagattgg ccggcatcga 428941 aagcccaagc gtgctcgcat tgacaaggcc ggagacagtg ttaccgaagt tgaacccgcc 429001 ggagatcagc tcgccgaagt tctggacgcc cgagattccc gagcttccag aggtgaggtt 429061 gaagaagccg gaactattgc tgcccacgtt gccaacgccc gacacggtac cggtaccggc 429121 gttgaagaat cccgacgacg gggtggtcgt cgagttcccg aatcccgggg ccgccggtat 429181 atcgatgagt ggaatcttga tcggcaatag accgccggtg ccggcgatat cgatcagcgg 429241 gtccggcccg ctacccaggt tgatgccgat attgggaagg acaatcgaga tgttcgggaa 429301 actgaatgca tcgagtgtgg cggcattgaa cggtatgccg atcaagaaga tatcgccggt 429361 gatctccggg aatctgaagc catgaacggt gaacgtgcca agtgtgccgg tgaccgggat 429421 atcgaggaag atcggcacgt gcagtttcac cggaacggcg gtgtcgggca cggtgatcgt 429481 ttggctgatc cccgccaggc cttggtaatc gccccgccac cagaacccgt tgctgtaatt 429541 gccggtgatg aagccgccgg tgttgacatc gcccgagttg gccagtccgg tgttgtagct 429601 gccggtgttc aaataacccg tgttgtagct acccgggttg aagccggccg tgttgaagct 429661 gccggcgttg aagctgccgg tgttgtaatg gccggtgttg aagatgccgg tgttgacgtt 429721 gccggcgttg agcaagccgg tgttgacgtc gccggtgttg aagatgccgg tgttggtgtc 429781 accggtgttg gcgatgcccc agtttccggt gcccgagttg ccgatgccga tgttgccggt 429841 gcccgagttg aacaaaccga tgttgttggt gccagagttg aacaggccgc tgttgccgct 429901 gccggagttc cagccaccgg caaaattgaa gccctgctgg ttgtcgccgg acaggccgat 429961 accgatgttg ttgttgccgg tgttggcaaa cccgatgttg ttgttgccgg tgttggcgaa 430021 gcccaggttg aagtcgccgg cgttgccgaa tccgaagttg tagctgccca ggttgcctag 430081 gccgatgttg tagttaccca ggttcgccgg gccgatgttg tatgagccct ggtttccgcc 430141 gaagacgttg aagctgccga ggttgccgct gccgaggttg aagctgccga tgttcgccaa 430201 gccggcgttg ctgtcgccta cgttggagaa gccgacgttg aattggccga tgtttcccag 430261 gccgaggttg aacatcgaca tcccggtcgc ctggtcgtgg aagaaccccg cgaggttgct 430321 gccgatgttg aacatgcccg agacgttggc cggtgccccg atgccggtgt tgaatacgcc 430381 cgagacggta tcgcccaggt tcgccagtcc cgattgcagc gagccgtagt tgttgaagcc 430441 cgaggtcgcg gagttcgcga cgttctggaa gccggaaatg ttggcgccga tgttggcgat 430501 gcccgatacg gttccggggc cgccgttgaa gaagcccgag gacggatcgg tggtggcgtt 430561 gaaaaagccc gtggtagccg caatgttgac gaacgtgaca tcgaagggac cgacgcttgc 430621 ggtggccggg atcctgatcg cggtcgaacc gccagggtcg ccgatgttga ccgtgatcgc 430681 gggaccggtc ccggtgatgg gcgggagaac ggccttgctg attgcaccgg ccagcagggg 430741 gatccctgcg atgtcgatgg tgaaaccgaa gttgatttgc tcaagcgtta tgccgctgta 430801 gacggtgttg gtgaagctgg cggtgatggg gatgttgacg ggaacttcca cggtgacgtg 430861 tgcgggtatt tcgggaacat ggacccgata gcccgcgctg aataggccct gctggtcgcc 430921 gcgccagaag gcgccgttgc ccatgtcgcc ggtgatgaaa gccccggtgg cgatatcacc 430981 ctggttggca aagcccgtac tgaaattgcc ggtgttgtag aagcccgtgt tgaagtcgcc 431041 caggttggcg atgccggtgt tggtgtcgcc ggtgttgtac cagccggtgt tgtagctgcc 431101 cgcgttggcg acaccggtgt tgacgatgcc ggtgttgaag aagcccgtgt tagtgctgcc 431161 ggtgttgccg atgccggtgt tgccgctgcc ggagttgccg ataccccagt tgccggtgcc 431221 cgagttgccg atgccgacgt tgttggtgcc ggagttgaac aagccgatgt tcgcggtgcc 431281 tgagttccag ccgccagcaa aattgaagcc ctgctggttg tcgccggaca gcccgatgcc 431341 gatgttgttg ttgccggtgt tggcaaatcc gatgttgttg ttgccggtgt tggcaaagcc 431401 ttggttgaaa tcgccggcgt tgccgaagcc gatgttgtag ttacccaggt tcgcgaaacc 431461 gatgttgtag ttacccaggt tcgccggacc gatattgtat gagccctggt ttccggagaa 431521 gacgttgaag ctgccgacgt ttccgctgcc caggttgaag tcgccgagat ttgcgctgcc 431581 gatgttcaac tggcccaggt tggcaaggcc cgcgttgaag atcgtcccgg tcggaccgcg 431641 gaacacgccg gacaggttgg tgccgatgtt gttcaggccc gagacattgg ccggcgtgga 431701 gaggttcacc gtactggtgt tgaaaaagcc cgatacggag ttgcccaggt tcgcccagcc 431761 tgactgcagc gagccgaggt tctggaaacc cgaattccct atcgcgctgc tcaaaccact 431821 gttccagacg cctgaactgc cgccgccgac gttttggaag ccagatgtgc caccggtgcc 431881 cgagttgaag aagccggacg aggggttggt ggtcgaattt ccgatgcccg gcgccggatc 431941 gatcttgagg aaggtaatcg tgcggctctc cagagcaccg acaatgctga tggggacggt 432001 caccgtcggt ccgccgatgg tgagggtgat cgtcggaacg gtcagcgtgg atgcgctgag 432061 attgaccggg ccgaagaaga acaaaccgct cagatagaag gtttggggga aaacggtcga 432121 ggcctcggtg accgtgatca tgttgccgcc gaaggtcatt acgttgtgta cgtcaatgac 432181 catctgctcg tttatgggga tgaatggagt ggtgaccgag agatcgatgg caatctggcc 432241 ctggttatcg cccgccacca agaagccatt gttgaagtcg cccgtgtcga aagcgccggt 432301 attgacgttg ccgggattga agaagccggt gttggtgtca cccgggttat agctgccggt 432361 attggtgtca cccacgttga agttgccggt gttggtgtta ccgacgttga agccgccggt 432421 gttgtagctg cccgtgttgt agaagcccgt gttgaagtcg ccggcgttga ggatgcccgt 432481 gttgtagctg ccagcattga ggatgccggt attgtcggta cccgggttcc cgatacccca 432541 gttcccggtg cccgagtttg cgatgccgac gtttccggtg cccgcgttga agatgccaac 432601 gttattggtg cccgaattga acaggccgct gttgccggtg cccgagttcc agccgctagc 432661 aatattgaag ccctgctggt tgtcgccgga cagcccgatg ccgatgttgt tgttgccggt 432721 gttggcgaac ccgatgttgt tgttgccggt gttggcaaag ccttggttga agtcgcccgc 432781 gttcccgaag ccgacgttgt agtcgccgac gtttccaaaa ccgatgttgt agatcccgag 432841 gtttccggat ccgatgttgt agtttcccag gcttccggaa ccgacattga atactccgat 432901 gtttccactg ccgatattga agctgccgac gttgccgctg cccaagatgt tttggctgcc 432961 gaggttgccg ctgccaagga tgttgaagtc accgacgttt ccgctgccga gaatgttgta 433021 attgccgatg ttggcgttgc cgagaatgtt cacgacgccc cggtttgcca ggccgagatt 433081 gaagaccggt gggccaccga aaaatcccga catgttgctt ccggtgttga agaagcccga 433141 gatcaaggcc ggcgttgtga tggccaccag gctcatgttg aacaaacccg atacggtgtt 433201 gcccgagttg atcacgcccg ataccagcac gcccgcgttt gccaggccgg agttaccgat 433261 ggcccccgac gaagagttga agaagccaga attgttggca ccggagttca ggaagccgga 433321 cgcgctaccg gcaccgctgt tgaagaatcc cgacgacggc gcactggtcg agttgaagaa 433381 gccggggctc ccgaaaatca ggccttggtg gtcgccgcgc cacaagaagc cgttgttgaa 433441 gttgccagta atgaaggcgc cggtgttgac attgccggag tttgccaagc cggtgttgta 433501 gttgccgctg ttcaggtagc ccgtgttgta ctggcccatg ttgaagccgc cggtattgct 433561 gttgcccggg ttgtagctac cggtgttgta gttgccggcg ttgccgacgc cggtgttggc 433621 tattccggag ttgaagaagc ccgtgttggc gtcgccggag ttgccaaaac cggtgttgta 433681 gctgttgccc gagttgccaa tgccccagtt cccggtaccc gagttgccga tgccgacgtt 433741 tccggtgccc gagttgaaca gaccgatgtt gccggtgccc gagttcaggc cgccgaaccc 433801 caacaaaccg ctacccgtga gcccgatacc tcggttgccg tctccggtat tgccgaagcc 433861 gatgttgttg ctgccggtgt tgccaaaccc gatgttgttg ctgccggtgt tgccgaaacc 433921 gatgttgttc agcgctgcgg tcaacccagg acccacgttg ccaaacccga tgttggagct 433981 gccgatgttg ccgctgccga tgttgccgtt gccgatgttg gccgagccga gattgaagtt 434041 cccgacatta ccgttgccga cgttgccctc gccgacgttc gccaagccca ggttgcggaa 434101 gacccgcgtg gtcacctgag ccgcggccgc gctgaccagc gcaccgccgc ccgccacggt 434161 cggcagcgcc tggccgaacg gtgtcaacgc cgagacggcc gccgaagccc cggcatggta 434221 gccaaacatc gccgccacgt cctgggccca catctgctcg taggcggcct cggtggccgc 434281 gatcgccggg gcgttttggc ccagcaggtt cgagaccacc agcgacacga acagtgcccg 434341 gttggccgag atgatcgccg gatgtaccgt cgccgccagg gctgcctcga aggcggccgc 434401 cgccagccgg gtttgggtgg ccgcctgctg ggcctgcgcc gccgccgcgc tcaaccagcc 434461 cagatagggg gcggccgctc ccgtcatcgc cgtcgacgcc gcgcccagcc acgaggaacc 434521 tgccagcccc gccgtcaccg ccgaaaacga ggccgcggcc gaacccaatt cgtcggccag 434581 tccatcccaa gcggccgccg cgtccagcat cggcgccaac ccggcaccca cgtacaggcg 434641 cgccgaattg atctccgggg gcagcaccgc gaagctcatc tagcgtccct aaccggaacc 434701 gctgaccacc accgcgtggt gggtggagcc aaacgtcccg ttccgcgctt gggtgtcttg 434761 acagtgacga ttattcaaca gacgcctgac gcaggtttgg ctttggagtg tcgagacaga 434821 aaatctcagc tagggctggc cgggcagtag ccgcaccatc aggccgttgc cttcggccaa 434881 cagcgtctcg tcgctgtcaa acagttccgc gcacacaaac gcctttcggc cctcggtatt 434941 ggtgactcgt ccgcgtacga tcaacggcac atcaatcggg gtgattcggc ggtaatcaac 435001 gtgcagaaag gcggtccggc tgatcggccg tcccgccgca tgcgagatca tgccgaacat 435061 gtgatcaaac aacagcggca acacgccgcc gtgcaccgcg gagttgcccc cgacgtgaaa 435121 ccggctaaac gacccccgca tctcaacacc gtcggtgccg taccgggtca ccgtccatgg 435181 cggtagcagc aggctgccca tgccgggcag gccgggggtc cgcccggccg gcgccttgcc 435241 ttcgtcggcc tcaaatgggc tcagcaactc gacgagcgcg gcggcgcgct cggccgcctc 435301 gtcccacacg gcgtcgccgg ggtccgccgc gaccgccagg tcctgcaacc ggcgcatggt 435361 cgccacgaac tggccgaacc ccgcaccggg actggccgga ccgtactccg gaaatccacc 435421 gtggtggtga tactcgggat cgagttcgtc ggggtgcact gacgcatctg tcacgggcga 435481 tcctgcagga cgtcccggcg cacgatggtc tgttcccgcc ccggaccgac tccaatgcac 435541 gaaaccggtg ctccggcaag ctgttccagt cgcagcacat aatcacgcgc tttggcgggc 435601 aggtcgtcga actcgcgcgc cccggagatg tcttcccacc agcccggcag ctcctcgtaa 435661 accggcttgg cgcggcaaag atcccgctgg gtcatcggca tatcgcgggt gcgccggccg 435721 tcgatctcat atccgacgca gaccggcacc gattccaggc tggacagcac gtcgagcttg 435781 gtcaggaagt agtcggtgat gccgttgacc cgggcggcgt agcgggcaat gacggcgtcg 435841 aaccagccgc agcgccggcg ccggccggtg gtcacaccga actcgcggcc agtcttggac 435901 aggtattcgc cgtgttcgtc gaacagctcg gtggggaacg ggccggagcc cacccgagtg 435961 gtgtaggcct tgagaatccc cagcacggtg ccgatgcggg tcgggccgat accagagccc 436021 acggccgcgc cacccgccgt cggattcgac gatgtcacat acgggtaggt gccgtggtcg 436081 acatcgagca gggtgccttg agagccttcc agcagcaccg tttcgccggc ctccagggca 436141 gcattgagta gcagccgggt gtcggcgatg cgatgcttga aaccctcggc ctgctccagc 436201 agcgcgtcga ccacctgcgc ggggtccagg gccttgcggt tgtagatctt gaccagcact 436261 tggttcttga actcgcacgc ggcctcgacc ttgtgggtca attgttccgg gtccagcaca 436321 tcggcgaccc ggatcccaat acgggcgatc ttgtcctggt agcacggccc gataccacgg 436381 ccggtggtgc cgatcttctt gctgcccata tagcgctcgg tgaccttgtc gatagcaatg 436441 tggtaaggca tcagcagatg ggcgtcggcg gagatcaaca gcttggcggt gtccacgccg 436501 cggtcttgca gtccccgcag ctcattgagc aggacaccgg gatcgatcac cacgccgttg 436561 ccgataacgt tggtgacccc gggcgtcagc acacccgacg ggatgagatg caatgcgaaa 436621 ttctcgccgg taggcaagac gacggtgtgc ccggcgttgt tgcccccctg atagcgcacc 436681 acccactgca cgcggccacc caacaggtcg gtggccttac ccttgccctc gtcgccccat 436741 tgggcgccga tgaggacgat cgccggcatg agttgctccc acctggtctc gcaggctatg 436801 cccgcttatt gtggtccagc cggtgaccta ccctacccag caggttgcga ggagctgtca 436861 tgtatacggc cgagaacgca cccggcgtcg cggtgttgct ctccggtgat gccgacgtgc 436921 ccggcccgtt gaccggcttg cctacccatc aagacaacct ggacaccgtc atcggacggt 436981 attcgcggct catcgtcgtc ggcgccgacg cggacctggg ggcggtactg actcggctgt 437041 tgcgcaccga ccggctcgac gtcgaggtgg gttatgtgcc gcgccggcgc agccccgcga 437101 cccgggccta ccgcttgccg gccgggcgcc gggcggcgcg gcgcgcccgg tgtggcgtcg 437161 ctcggcgggt gccgctaatc cgtgacgaga ccgggtcggt aatcgtcggc cgagcacagt 437221 ggctgccggc cgaagagcag gccctgatcc acggcgaggc ggtcgttgac gacaccgtgc 437281 tgttcgatgg cgatgtggcc ggggtgtgca tcgagccgac gctgaccctg ccaggcctgc 437341 gagctgcggt agacggcgcc ggaaagtggc ggcggtggat cggcgggcgc gccgcgcagc 437401 taggcaccac cggtgctgcg gtacttcggg acggtgtcgc ggcgccccgc ccggtgcgcc 437461 gatcgacgtt ttaccgcaac gtcgagggtt ggctgctggt ccggtagttt tcgaccggtg 437521 agcgagacgg gccagcgcga gtcggtgcga cccagcccga tctttctggg cctgctcgga 437581 ttgacggccg tcgggggcgc gctggcctgg ctggccgggg agacggtgca gccgctggcc 437641 tacgccgggg tgttcgtcat ggtgatcgcc ggctggctgg tgtcgctgtg cctgcacgag 437701 ttcggtcacg cgttcaccgc ttggcgtttc ggtgaccacg acgtcgcagt gcgcggctac 437761 ctgacgctgg atccccgccg ctacagccat cccatgctct cgctcggtct gccgatgctg 437821 ttcatcgccc tgggcgggat cggtctgccg ggtgccgcgg tgtatgtgca cacctggttc 437881 atgacgacgg cgcgccgcac cctggtcagt ttggcggggc cgacggtcaa cctggcgctg 437941 gccatgttgc tgctggcggc gacccggttg ttgttcgacc cgatccacgc ggtgttatgg 438001 gccggggtgg cgttcctagc attccttcag ctcaccgcgc tggtgttaaa cctgctaccc 438061 atcccgggtc tggacggcta tgcggccctg gagccgcacc tgagacccga gacgcagcgc 438121 gccctggcgc cggccaagca gttcgctttg gtgtttctgc tggtcctgtt cctggcgccg 438181 acgctgaacg ggtggttttt cggggtggtg tactggctct tcgacctgtc tggcgtgtcg 438241 caccggctgg ccgccgcggg cagcgtgctg gcccgtttct ggagtatctg gttctgaccg 438301 ttcagagccc aagcgccgga cgggccgcgg ggtcacagtc gtcaagcaga tccaggcagc 438361 gtccatactc gtcggtctcg ccgatagcgg ctgcggcgcg cgccagcgcc gccacacacc 438421 gtaggaaacc ccggttgggc tggtgggaat acggcaccgg gccgaagccc ttccagccat 438481 ggcggcgcag ctggtccagg ccgcggtggt acccggtacg cgcgtatgcg taggccgtga 438541 cggtcttgtc gtcggccagc gccccttcgg cgagcaccgc ccaggcgacc gacgccgacg 438601 gatgcgcggc cgcgacgatg ctcggacttt cgttggcaag cagctccgct tcggcgtcgc 438661 tgtcgccagg caacaggatt ggctcaggtc ccaagagatc acccatcgac gtcatgggag 438721 ttattgtgcg cttggtcacg tcacctcgac gatggggcca accgaaggct gggtcgctaa 438781 gctccaaaga gccactcgat accgggagga cagcagcacc catgtccaac gcacccgagc 438841 cagaccgctc agccggtgaa tccgggagcg aaccggccgg cgagcggtcc gccgatcctg 438901 gcgaggaacg caccgaaagc taccccctgg tgcctcacga cgccgaaacc gagaccgtgg 438961 tgatcaccac ctccgacaac gatgccgcgg ttacgcaacc ggaagcgcag cgcgaacgcc 439021 gtttcaccgc gcccggcttc gacgccaagg agacccaggt gatcgtcacg gcccacgagg 439081 cagccaccga ggttttccaa accaaccagg cgccgaccac cccgccgcgg atgccaaccg 439141 gaatgccccc gaaaactgct gtgccacaat caatcccgcc acggacggag gcgacgtcag 439201 tccggcaacg cacctggggc tgggcgctgg cggtggtagt gatcgtgctg gcgttggcgg 439261 caatcgcgat cctgggcacc gtgctgctga cccgcggcaa acattcgaag atgtcgcagg 439321 aagatcaggt gcggcaggcc atccagagct tggacatcgc catccagacc ggcgacctga 439381 ccgcgctgcg ttccctgact tgtggctcca cccgcgatgg ctacgtggat tatgacgagc 439441 gtgattgggc cgaaacctat cgccgggttt cggcggccaa acaatatccg gtcatcgcca 439501 gcatcgacca ggtcgtcgtc aacggcgcgc acgccgaggc caatgtcacc actttcatgg 439561 cgttcgatcc ccaggtccgc tcgacccgca gcctcgacct acagtttcgc gacgatcagt 439621 ggaagatctg ccagtcctcc agcaactgaa gccaggattg gctggtttgc ccgcattttg 439681 gccattggtc agtgctagga ccggtccgca tcaccggcac gtcaccagga ccgactagtc 439741 cgaacaccga aacgagcaac cgtagccgaa atgcggctgg atcccgtctg tggcaatgta 439801 ctggcggcct gttcccgcag agacggcggc atagcgtctc gatcgtcaac gagaggcagg 439861 tgatcgccag gtgagcatcc gccccgccga gaactcaaca ctcgacatcc gccacgtcat 439921 cggtatcggc accccgaaag ccgtcgattt gtggctcgac gtcgtcaccg agctgccgga 439981 tcgcgcccgc gaactcgggt cgttatccaa agccgaactc ggaaagcttg gcccactgct 440041 cgacggcacc aacgccgtcg agctattcga gtcgatcgac gacaagctgg ccgcagaggc 440101 actgcacgcg atggatccgt cgctggccgc caccttcctc gaggccctcg actccgacca 440161 cgccgccaac atcctgcgcg aattcaagga gcccaagcgg gaggcgctgc tgacgttgct 440221 accgctggag cgggcgatgg tgctgcgtgg cttgttgagc tggccggagg actgcgccgc 440281 ggcccacatg gtgcccgaaa cgctgaccgt acgcccgaac atgacggtgt cgcaggccgt 440341 cgccagcgtg cgggaacgcg cctcgggcct gcgcagcgat gcacgaacca ccgcctacgt 440401 ctatgtgaca gacgccgact cccacctgct gggtgtgatc gcctttcgcg ccctggtgct 440461 ggccaatccc gaacagcgag tccgtgagct gatgggtgac gacctcatcg tcgtgtcgcc 440521 gttgactgac aaggagctcg cggcgcagac aatcatgggc cacaacctga tggcggttcc 440581 cgtcgtcgat gccgacaacc ggctactggg catcatcgcc gaagacgaag ccatcgacat 440641 tgccgaggag gaagcaaccg aagacgccga gcgccagggt gggtcggccc cgctcgaggt 440701 gccctacctg cgggcgtcgc cgtggctgct atggcgcaag cgggtcgtct ggctcctggt 440761 actttttgct gccgaggcct acaccggcag cgtcctgcgg gcgttctccg acgaaatgga 440821 ggcggtgata gcgctcgcgt tcttcatccc actgctgatc ggcaccggcg gcaacaccgg 440881 cacccagatc gccaccactc tggtccgcgc gatggccacc ggtcaggtcc ggtttcgcga 440941 tgtgcctgcg gtgttagcca aggagctgtc aaccggtgtg ctggtcggcc tcactatggc 441001 cgccgccgcg gtggtgcgcg cctggacatt gggcgtgggc ccgcaggtga ccctgacggt 441061 cgcgctgacg gtggccgcca tcgtggtgtg gtcgtcgctg gtggctgccg tccttccgcc 441121 gctgctgaag aagttgcgca tcgacccggc catcgtttcg gggccgatga tcgccaccat 441181 cgttgacggc acgggtctgc tcatctactt cctggtcgcg cacctgacgc tgaccgagct 441241 gcacggcttg tgagcggccc cggtttagtg ggttagggac tttccggcgc agtgcaggtc 441301 attgcacgcc tgaacgaccc gctggctcat cgaagcttcg gccttcttga ggtagctgcg 441361 cgggtcgtag accttcttga cacccacctc gccatcgacc ttgagcactc cgtcgtagtt 441421 ggtgaacatg tgaccggcga tcgggcgggt gaacgcgtac tgggtgtcgg tgtcgacgtt 441481 catcttcacc acgccgtagc gcagcgcctc ctcgatctcc gacttaagcg aacccgagcc 441541 gccgtggaac acgaagtcga acggcttggc gtcggccggc agtccgagct tggccgccgc 441601 cacctgttgc ccttgcgcaa ggatgtcggg gcgaagcttg acgttgccgg gcttgtagac 441661 gccatgcacg ttgccgaacg tcgcggccag caggtatttg ccgtgctcac cggcgcccag 441721 cgcctcgatg gttttctcga agtcctccgg gctggtgtac agcttctcgt tgatctcgtt 441781 cgccacgccg tcctcttcgc cgccgacgac gccgatctcg atctccagaa tgatcttggc 441841 ggccgccgcc gccttgagca gctcctgggc gatggccagg ttctcatcga ttggcactgc 441901 cgagccgtcc cacatgtgcg actggaacaa aggattgcca cctttgctca cgcgttgcgc 441961 cgagatcgcc agcaagggcc ggacatagct gtccaacttg tccttggggc agtggtcggt 442021 gtgcagcgcc acgttgaccg ggtacttggc cgcgataacg tgggtgaact ccgccaaggc 442081 gaccgcaccg gtcaccatgt ctttgacccc gaggccggag ccgaattctg cgccaccggt 442141 cgagaactgg atgattccgt cactgccggc gtcggcgaaa cctttgatcg cggcgttgac 442201 ggtttccgag gaggtgcagt tgatagccgg gaaagcgtac gagttttgtt tggcctgacc 442261 gagcatctcc gcgtagacct cgggcgttgc gataggcatg aaacgttcct cctgacgact 442321 ccgatccacc cagtatcgca acaccgcaac cgagcttgtc ggcctgtgcg tgatggccgg 442381 tatgttggga cgtcatgagc accgccgtga cggccatgcc ggacatcctc gacccgatgt 442441 actggttggg cgccaacggc gtattcggtt ccgcggtgct gcccgggatt ttgatcatcg 442501 tcttcatcga gaccggtctg ctgtttccgc tgctgccggg cgagtcgctg ttgttcaccg 442561 gcgggctgtt gtccgctagc ccggcaccac cggtcaccat cggggtgctc gccccgtgcg 442621 ttgcgctggt cgcggtgctc ggcgatcaga ccgcatattt catcgggcga cggatcgggc 442681 cggcgctgtt caagaaggaa gactcccggt tcttcaagaa acactatgtg accgagtccc 442741 acgcgttttt tgagaagtac gggaaatgga cgataattct ggctcgattc gtgccgatcg 442801 cgcggacttt tgtgccagtc attgccgggg tgtcctacat gcggtatccg gtgttcctcg 442861 ggttcgacat cgtcggcgga gtcgcctggg gtgcgggtgt gacgttggcg ggctactttt 442921 tgggcagtgt cccgttcgtg cacatgaact ttcagctcat catcctggcc atcgtgttcg 442981 tctcactgtt gcccgcactg gtctcggcgg cgcgggtcta ccgggcgcgg cgtaacgcac 443041 cccagagcga ccccgacccg ttggtgttac ccgagtgagc tgaccgctgc ggcgctgtgg 443101 gcggcttcca tcagcatcca acccgatagc tgcaccgaca gatctcgctc ggcaatcgcc 443161 gagctatgca ccgctcctcg gacggaccgc gcctgctcac cgccggcggt gggcaactcg 443221 gcttcgcgat cccagaacgc cccgaacacc ggcaacccgt ccacggtttg ccggtaatcc 443281 cacgccgatt gcgcgctagc cagcactatc gcgcgggcgg tgtcgcgggc ggcggcgtcg 443341 tcggccgagt cgcccggcaa cgtggtggcg accaaggcga ggtatcgggc ggtgatcccc 443401 gcgaacaggc caccgtcccc gccgccggcg ccccgtaaca cacccaatgg agccatgtgc 443461 tcgttgacgg ccgcgaccaa gcgatgaacg cgagcgcagt gccgcgctct ggctgccgga 443521 ccggtgcgca ccgccagctc ggtttccagc ccgagcacca ccccttggca gtaggtgtac 443581 tgcgcgcgga ccaacgaccc ggccttgatg ccgtcgaata ccaggtgtgt ctccggatcg 443641 atcagcgtgc gatcgatcca gtcggccatc tgttctgcgc gcttgagcct tttcccgtac 443701 tggtctgggt agcgggccag gaatagcccg gccgggccgt tggctggggc gttgaagaac 443761 tggtcctgct tgcgccacgg gatgccgccg ccgtcctcgg gcacccaggc ttcgacgaac 443821 tggttggtga gcttgggcag tgcgcgccgg cgtcgtaccc cggcgacccg gtcggcacgt 443881 tccagcgcta acgctagcca cgccatgtcg tcgtaatagc tgttgagcca cgagaaattg 443941 ttgcggaccc ggtgcgagcg gacctggcgg ttgatccggg cgcgccgctg cggctgcggg 444001 tcgcgcagct gcgcgtcgac caggcaatcc agcaggtgtg cctgccacca gtagtgccag 444061 ctgccgaaca accggtcgcg ccgggttgac ggccaagcca ccaccgccaa ctgggtgccc 444121 ggcaacgccc aaagccgtct cagatgccgt tgcgtgacgg cggtttcggc gctggctgcc 444181 cggtttgcca gattcataat gcgatcctgc cctagcctgt cttacgccgt ctcaggcctg 444241 ttactccagc gtgacatcaa gggtggcggc gtccacgaag gccagcatgc gcgcccgatg 444301 atgccacccc cgctgaactg agcgacgatc cgcgggcccg cgagccggct gttgtcatag 444361 acggtggcgc cgtcggccag cgtgatggcc tgggcgacaa gctcggccag ccggcggtga 444421 cgctcgcgga tcttggtctc cggcacatcg tggccgcccg cggcgacgcg atgcctgacg 444481 cgctcgaccg ccaggccttc ggggataacc aacacgtgca gtacgacggt gtagccggcc 444541 gtgcgcgcgg tgcggatgag ctcgagcttc gatgggtgcg agaacaccgt ctcggcaatg 444601 aacggccggc ccaagtcgat gagcctcgcg cgggtgtcgg cggcgacctg cgccgcctgg 444661 taggcgtgcg atgttgggtc gtcgggccag cgttgtttgg cgatttcgtc ggcgttgacg 444721 aagacgatgc cgggcagcaa gggcgccagc gtgagggcga cgaacgtcga cttgccggcg 444781 ccgttgggcc cggcgaccag atcgagccgc ttcacgcgtg gcgcgttgtc ttcgagctgg 444841 cgatcacggc gtggccgcca gcacgaccga ggtgccgtct ggccggtgct cgacgatgtc 444901 gcccgcgtcg ttcagggcga ccgtggtgat gccctgcgcg gcgagcacgt cgccgtagtt 444961 ggttcgcgac aggcgctcct cgatagctgc ggagatctcg gcgttgaaca ccacgccctc 445021 ctccagcgtc aggtccgtca tcggcagatg ccccgcgagc gcagcttcca cgcggcgccg 445081 cgacgccgtg tgctggttcg acaccgcccg accgacgcgg gcccagtggt cgagctgctg 445141 cttggccgaa cggctctgac gagcaccctc ggccgccgcg ctgtccacca gatccgcggc 445201 gacgcgcgtg acgcggtcga cggctttggg cacgacatct ctcctcgggt gtagcgatct 445261 gttacagctt atagcaaagt gctacaccga gctgtggtga ggggcgcaca cggctagcgg 445321 gcaccggcca gcgccagcag caactggtgc aggccggcca gcgagtgcgc cggcaggaac 445381 aggtcgcaat agggcagcgc cgccgccatc gaaccggcac gcggctggaa ctccggatgt 445441 gccgcgcgag ggttcagcca gaccagcaac tcggcgcggc gacgcaccct ggtcagtgcg 445501 tgcaccaaca cgtcgggcgg atcgctgtcc cagccgtcgg aggcgatgat caccaccgcg 445561 ccgcgtaacg cgttgccatg cggcggggcc agcagggcgg cgacactacg gccgatgaac 445621 gtaccgccgt agcggtcggt caccctagcg ttggcccgat gtagcgccat ctcggccgag 445681 cgatgagaca gcaccgaggt aagtcgagtc agcgacgtcg aaaacgcgaa aacctccggg 445741 tggccccctg cccggcgcag caccgccgcc cgcatcagac gcagatagat ggcggcgtag 445801 ggctgcatcg agcggctcac atcgcagagc aggagcaccc gcctggggcg tcggcggggc 445861 cggatccgtg ccaacagcac cgactcccag ccagtcgacc gcgacgcgtt catcgtcgcc 445921 cgcaggtcga tgcgcttgcc gtgcgggctg gactcgaatc gcatgctgcg ccgccgcggc 445981 cagcgcgcca tcgtggcctc cagccaggcg ccgagcagac gcagatcgtc gggatcgaac 446041 tggtcgaatg gctcgtcggc ccgggcgaca atgcggctgg gcaggacatc gggcagtgtg 446101 cggctgggtc cgccctgacc ggcgctggcc atcgtcagcg agcgagtatc ccagggcaga 446161 ttctgggctt gggcggcaca agatcgccgc ttggcgcggt gcccgacgcc ggccaccggt 446221 gtgcgcgggc ctgcaatggg cggtggtggg cggttggcac cgtcgggttc ggcgctgcca 446281 aataccccga acagcgaagc gaataccgca tcgaacgtgg ccagttcgtc tacacggctg 446341 accagggtca accgcgcgcc ccaatacagc gccgccggcg tacgcggcac caactgctgc 446401 aacgcctgca ccaaactcgc ttgaccgctg gcggacaccg gtatcccggc gtcgcgaagg 446461 cgcgctgcca gcgctgccgc gaacgccgcg aggtcgacgc ccggcaacag tgcaggggtg 446521 gccatcccat tcatcgccgc cggcgcagca cccagatcag cagcagcacg gtcagcgccg 446581 ccagcagcgc cgaaccgtac tttttgagct ggccgccgtc ggccagctgc agcaagtcga 446641 tgggcgccgc ttcggtagca gggggtgtgc cttgcgggct ttctgaactc tgggccgcga 446701 gctcggcttc tagcgaatcc acgaactggc ccagcagctt ctccgacacc tgctgcagca 446761 tgccactgcc gaattgcgcc agtttgccga caatcttcag atcggtgtcg acggtgacgc 446821 gggtacgctc tccgacctcg tgcagctggg cagcgaccgt ggcggccgcg ttgccggtac 446881 cgcgcgcctc cttgcctttg gcgtcgaaaa cggcgcggtg ctggttgcgg tcctgctcga 446941 caaagtgcac cttgccgctg aactcgctgg tgaccggccc aaccttgacc ttgaccttac 447001 cgaggtactc gtcgccctca tggccgatca actgggctcc aggcatcagc ggaatcatct 447061 gctccaggtc gcatagcctg ctccaggcct gctcgatcgg agcgctgacg gtgaactcgt 447121 tggcgatctt catcctgtgc gtcctctcat gcgtggctgc actcagtaaa agcttggtac 447181 gcatcgcgaa tctgcgtacg gtcgtcgggc gttttggcca gggccccaag gctggccaga 447241 gcgggactcg aatccgctgc ggtgaggtct gcgaccccga gtgccaccaa agccgccacc 447301 cagtcgatag tctcggccac accgggtggc ttgtccagat cgagatcccg tgcagtgcaa 447361 acgaattgag tggcgttctc gatcaacggc gcggtagccc cgggcaccgt gcggcgcacg 447421 atcgcggccg cccggtccgg ccccgggtag tcgatccagt ggtagaggca gcgccgccgc 447481 agtgcgtcgt gcaggtcacg gctgcggttg gacgtgagca ccgcgatcgg cgggcactcc 447541 gcgaggaaag tgcccagctc gggaacggtc accgcggact caccaaggaa ctccagcagt 447601 aacgcctcga attcgtcgtc ggcccggtcg atttcatcga tgagcagcac tggaggggtc 447661 ggtccgcggt gccgcacgca ccgcaggatg ggccggtcca ccagatacgc ctcggtgtac 447721 agatccgctt ccgatatgtc tgagataccc ttgccgcgcg cctcggccag cctgatggac 447781 aatagctggc gttggtagtt ccagtcgtag agcgcctcgt tggccgtcag cccttcatag 447841 cattgcagcc ggatcagcgt ggtatccaac acgactgcaa gggttttcgc ggctgttgtc 447901 ttgccaacac cgggctcacc ctccaacaac agcggcctgc ccagcgtaac cgccagatag 447961 attgccgacg ccgtgccggt atccagcagg tagttctgtt cgtcgaaccg gcggatcacg 448021 tcgtcgggac ttgcgaaggt cacgagggca ccgattccag cagccgtcgg tagtcgtccc 448081 aggtgtccac gtcaagcggc acgcagccgt ccacggcgag ttcgcgcact gggtggcggc 448141 cggagtgcac cagcttccag acacccttgt cgccgtgcag tcgcgcgagt tcgccgaaca 448201 cggtgcggct aaaccagaat ggatgcccga cgccgtcggc gtagcggcac accatgatct 448261 cggtggccgg cccgacgtcg atgatccgcc gcagtgtcgc cggcgccacc tgaggctggt 448321 cgcccagcat cagcacgatc ccggtggccc gcggatgcac ccgtgccaac gcgacgcgca 448381 gcgatgccgc acacccgcgc tcgacatcct cgacgaccac cacgtcggtc ccgtccagcg 448441 ccatcgcggc acgcaccgcc gacgccgcac cgcccagggt gaggatcagc tggtcgaatc 448501 cggcttgccg ggcaacgtcg agggtggccc caagcaccgt ggtatcccga tatggcagta 448561 gctgtttggg cgtgcccaac cggttggagc gcccggcggc gagtaccaca ccggtgatct 448621 gggtcgcggt catgcgccgc cgttctcgtc cgccaacgcc ttccggcctc tggggccgcc 448681 gccgcgcagc gtggcgatca gttccgccgc aatcgacacc gcgatctccg ccggagtttt 448741 ggcgccgatg gccaatccga ccggggtatg cacccgggcc cgctcggcat cggacaggtc 448801 cagcgaatcc aggatggacg cgccgcgtac cgtgctggcc accagcccga catacccaac 448861 gccgttatcc agcgccgtgc ggatgatttc ggcttcgggc ccgccgtggc tggcgatcac 448921 aatcgcagtt ggcaaggcgt cggtgtcggc cggatcggtg tcgcggcgcg cgtcgtagcc 448981 caacaggccg cacagttcga tcaacgcgtc ggcgatcggg gtttcgccgt aaatctggat 449041 cagcggggcc ggcagctgcg gggtcaggaa gatctccagg gatccgccgg ccaggcacgg 449101 gttgaccacc acacacgccc cgggagcttc cgggaagtgc acgtcaccgt cgggcagcac 449161 gcgcagcagc acgctctcgc cggcctgcaa cacgcccatc gccgccttgc ggaccgagtt 449221 ctgcgcgcag tggccgccga caaagccctc gatggtgccg tccgccaaca ggattgcctc 449281 atcgcccggg cgggccgacg tgggctgctg ggcccgcacc acggtcgcgc gcacgaacgg 449341 tgtccgcgcg gccaccagct gtgcggcccg gtcactgatg gacatcgacg ccctcgagct 449401 cccctagatc ggtggtgtgg cccggccctg catggcctcc cagacccgcg acggcgtcaa 449461 cggcatgtcg gcgtgccgaa ccccgaacgg cgccaacgca tccaccaccg cgttcaccac 449521 cgccggcggg gaacccaccg tggccgactc accgatgccc ttggcgccga tcgggtgatg 449581 cggcgacggg gtcacggtgt gcccggtctc taggtgtggc acctcgagcg cggtcgggat 449641 caggtagtcc atcaacgatc cgcccagaca gttgccgtcc tcgtcgaagg caatcatctc 449701 catcagcgcc atgccgatgc cgtcgacgat gccgccgtgt acctgaccct cgatgatcat 449761 cgggttgatc cgggttccgc aatcatcgac ggccaaaaag cgccgcacct tcaccaccgc 449821 ggtgcccggg tcgatgtcga ccacacagaa gtaggcgccg tacgggtagg tcagattcga 449881 cgggttgtag cagacctcgg catccagccc gccctcgatg ccctcgggca gatcgccggc 449941 gccgtgcgcg cgcatcgcga tgtcggcgat ggtcaccgcg gccgacgggt cacccttgac 450001 gtggaacttc cctttctccc actgtaagtc ggcgaccgaa acctcgagca tgcccgaggc 450061 gatgatcttg gccttgtcgc gcaccttgcg ggcgaccagc gccgcggcac cacccgagac 450121 gggtgtggac cggctgccgt aggtgcccaa cccgaacggt gtctggtcgg tgtcgccgtg 450181 caccacctcg atgtcgtcgg gcgcaatccc cagctcctcg gcgacgatct gcgcgaacgt 450241 cgtctcgtgg ccctggccct gggtctgaac cgaaagccgc agcacggctt tgcccgtcgg 450301 gtgcacgcgc agctcgcagc cgtcggccat gcccaggccg aggatgtcca tgtccttgcg 450361 cggcccggcg cccacggcct cggtgaaaaa tgacatcccg atgcccatca gctcgccgcg 450421 cgctcgccgc tgcttttgtt cggcgcgtaa cgcctcgtag ccgatcatgt tcatcgcctt 450481 acgcattgtg gtctcgtagt cgcccgagtc gtacacccaa ccagtcttgc tctgatacgg 450541 aaactggttg ggccgcaata gattccgcaa gcgcagctcg gctggatcca tcttcagctc 450601 gaaggccagg cagtccacca gccgctcgac gaagtagacc gcttcggtga tgcggaacga 450661 acacgcgtag gcgaccccgc cgggcgcctt gttggtatac accgcggtca tgtgacagta 450721 ggcggcctcg atgtcgtagc tgccggtgaa caccccgaag aacccggctg ggtacttcgc 450781 cggcgcggcc tgggcgttaa acgcaccatg gtcggccagc acattggacc ggatcgccag 450841 gatcttgccg tcacggttgg cggcaatctc gccgaccatg atgtagtcgc gggcgaatcc 450901 ggtggacgtc aggttctcgc tgcggtcctc catccatttg accggcttgt ccagcagcag 450961 cgacgcgaca atggcacaga cataaccggg atagatcggc accttgttgc cgaagccgcc 451021 gccgatgtcg ggcgagatca cccgaatctt gtgttcgggc aacccggcca ccagcgcgta 451081 tagcgtgcga tgcgcgtgcg gcgcctggct ggtggtccac agcgtcagct ttccggtgac 451141 cggatctaga tcggccaccg cgccacaggt ttccatcggc gccgggtgca cccgcgggta 451201 gacgatctcc tgctggacaa cgacgtcggc cttggcgaac accgcctcgg tcgccgccgc 451261 gtcgccggtc tcccagtcga agatgtgatt gtcgctcttt ccctccagat cggtgcggat 451321 gaccggcgcc gacgggtcca gcgccgtgcg ggcatccacg acgggatccc gcggttcgta 451381 gtcgacgtcg accaactcgc atgcatcgcg ggccgaatac cggtcctcgg caaccacgaa 451441 cgccacctct tggccctgga agcgcgtctt gtcggtggcc agcacggctt gtacgtcgtt 451501 ggctagtgtc ggcatccaag ccaggccctt ggcggccaga tcggcgccgg tcaccacggc 451561 cttgactttc ggatgtgcct gcgcggcagt cacatcgatg cgcacgatgc gggcatgcgc 451621 atacggcgaa cgcaggatgg ccagatgcaa catgcccggc agcgcgacgt cgtcgacgta 451681 ggttccgcgc ccgcggatga atcgcgggtc ctctttgcgc atcatccggc cgtgcccgca 451741 cggctgctga gcgttgtcgg ctaggtcttc cggcgacgga gggcgtgact cgatcgttgt 451801 catgactgcg cctttacggt ctggtgcgct gccgcccact gaatggagcg cacgatcgtg 451861 gtgtatccgg tgcaccggca gatctgcccc gagatcgctt cccggatggt ctgctcgtcg 451921 ggatccgggt tgcggtccag cagggcgcgc gcggtaatca gcattcccgg ggtgcagaag 451981 ccgcattgca gcccgtggca gcgcatgaac ccttcctgca ccgggtcgag ctggccgtcg 452041 ggcccagcca agccctctac cgtgcggatg ctgtgcccgg aggccatcac ggcgagcatc 452101 gtgcaggatt tcaccggcac gccgtcgacc tccaccacgc atgtcccgca gttgctggta 452161 tcacagcccc agtgagttcc ggtgagccgc agctgatcac ggagaaaatg gaccagcagc 452221 atccggggtt cgacctcggc ggtgacgggc tcgccgttta ccgtcatgtt cacctgcatg 452281 gttggttccc ctctcaggcc tcgggggccg ccggcgcgcc gagcacgcgc ccggcggcgg 452341 tgcgcagcgt gcgaacggtc agttcaccgg cgaggtgccg cttgtactcc gcggtgccgc 452401 ggacgtcggt caccggcgtg caagcttgcg cggcgcgccg gcccgcctca gcgaacacct 452461 cttcggtagc gggttggccg accagtcccg cggacagctc cgccagcgcg accgggtcgg 452521 gattcaccgc ggtcaaaccc acccgagcgg cgaggatcgt ctggccgtcg agcgtgaccg 452581 cggcaccggc cgcggtgatg gcccagtcgc cgacccgccg ttccaccttg gcgtacgcgc 452641 tggaggtgtt gtgccgcagc ggaatccgca cctcaattag gacctcgttg tgggcgagcg 452701 cggtttcgta cggcccgacc aggaagtcgt cgatcgctat ctcacgttca cccgagggcc 452761 ctttcgccag gcacaccgca tccagaacgg tgcacacggt cgacaggtcc tcggccggat 452821 ccgcctggca gagcgaaccg cccagggtgc cgcggttgcg gaccaccggg tcggcgatca 452881 cccgctcggc atcgcggaag atcgggcaca ccgccgccag cgcatcggag tccagaatct 452941 ctcgatggcg ggtcatcgca cccagccgaa ccaggttggg attgttgatt ccgccgacca 453001 cgacgtagcc gagttcgggg gccaggtcgt tgatgtccac gaggtactcg gggttggcga 453061 tgcgcagctt catcatcggc agcaggctgt gcccgccggc gaccacccgc gctccctccc 453121 ccaaccgatc caacaatccg atggcgtggt ccacgctggt ggcacgttcg tattcgaaag 453181 gcccaggtac ttgcatgcgc cccagtgtcg gccgcccgcg aaaagggcgt caatgtcgag 453241 ttaagtaatc cttgaactcg cccgctacct gcgcatcatg gtggatccgt ccggcaatat 453301 cggccagcgg gcgtccccct ccgccccaac gtcgggcgat gatgtcggcc gcgatcgaga 453361 ccgcggtctc ctcgggggtt cgggcaccga gatccagccc gatcgggctg gacaaccggc 453421 tcagctcggc gtcggtcagg cccgccgcgc gtagccgatc catccggtcg tcgtgcgtct 453481 tgcgtgatcc catcgccccc acgtatccga cacccaggcg cagcgccacc tcgagcaccg 453541 ggacgtcgaa cttcggatcg tgggtgagca cgcagatcac cgtgcgctcg tcgataccac 453601 ccgcctccgc ctgggcagcc agatagcggt ggggccatgc gacgacgacg tcatcggccg 453661 tcggaaagcg cgctggcgtg gcgaataccg cgcgggcgtc gcagacggtg acccggtagc 453721 cgaggaacga accctgccgc gccagcgcgg cggcgaagtc gatggcaccg aacaccagca 453781 tccgcgggcg cggcgcgtgg ctggacacga agacctccat gccctcgcca cgccgctgcc 453841 catcgggccc atattcgagg atctcgctgc ggcccaccgc gagcagaccc cgcgcatcgt 453901 cgataaccgc cgcatcggca cgcgccgaac ccagcgaacc cgtcacgggg ctctttgtgt 453961 cgggccggat caccagtcgg cgacccaccc gccgctcgtc cggatgggcg atgacggtcg 454021 cgatggcgac cgggcgttgc gcgccgatgt cgtcggccag ctcgcccagc tcgggaaacg 454081 tggcccgcga tacgggctcg acgaagacgt cgatgatgcc gccacaggtc aggcctaccg 454141 cgaatgcggt atcgtcgctg actccgtagt gttccagccg cggtatcccg gtttgggcca 454201 cctcggcggc cagctcatat accgcaccct ccacgcagcc gcccgacacc gacccactta 454261 ccgaaccgtc cggggctacc accatcgcgg cccctggggg ccgcggcgct gaccgcaagg 454321 ttcgcaccac cgtcgcgacc cccgcggtgt caccggcggc ccagatcgcc atcagctcgg 454381 caagcacttc acgcacgctt cccaaagtag gcttcagtgc atgaccccgg ctcaacttcg 454441 ggcctattcg gcggtggttc gcctgggctc ggtacgggcg gccgccgcgg aactcggtct 454501 ttccgacgcc ggagtctcca tgcacgtcgc ggcgctgcgc aaggaactcg acgacccgct 454561 gtttaccagg accggtgccg ggctggcgtt cacgcccggc gggctgcggc tggccagccg 454621 cgcggtcgaa atcctgggcc tgcaacaaca aaccgcgatc gaggtcaccg aggccgccca 454681 cgggcgtcgg ttgctgcgca tcgccgcctc cagcgccttc gccgaacacg ccgcgccggg 454741 cctgatcgag ctcttctcgt ctcgggccga cgacctttcg gtcgagttga gcgtgcatcc 454801 caccagccgg ttccgcgaac tgatctgctc gcgcgccgtc gacatcgcga tcggcccggc 454861 cagtgagagc tcgatcggtt ccgacggctc gatctttcta cggcccttcc tgaagtatca 454921 gatcatcacc gtcgtcgcgc cgaatagccc actggccgca ggcattccga tgcccgcgct 454981 gttgcgtcac cagcaatgga tgttgggtcc gtccgccggc agcgtagatg gtgagatcgc 455041 aaccatgttg cgcggcttgg cgattccgga gtcccagcaa cggatcttcc agagcgatgc 455101 cgccgcgctg gaggaggtca tgcgcgtcgg gggcgccacg ctggccattg gctttgcggt 455161 cgccaaggat cttgccgccg gacggttggt gcacgtgacc ggtcctgggc tggatcgcgc 455221 cggcgagtgg tgtgtggcga cattggcgcc ttcggcccgc caacccgccg tctccgagct 455281 tgttggcttc atcagcaccc cgaggtgtat tcaggcgatg atcccgggta gcggggtcgg 455341 ggtgacgcgg ttccgcccaa aggtccacgt caccctgtgg agctagctac ttcgacttga 455401 aaggctcggc gcgccggtcc gcccgttgac ggggcccggc tgcgaggatt agccagttcc 455461 cttgtcgcac aggagcgttg aggctatcgc cgtacgccta ctgcgtgcga tcagcgcttg 455521 ctcgttccat accacagggt gcggcccagg tgcaaggttc actgtgcatc gtgcgctgga 455581 gcctttggtg cctgttgccc gttgaaccgt gatccagcgc ggctgagggt gtggtggtgt 455641 cgggccgctg ggaggccggg aatgcggacg gtaacggtgg ctccgcgggg ttgatcggca 455701 gcggcggggc cggcggcgac ggcggtagcg gcggggccac cggcgccggt ggcgaaggtg 455761 gcgatgctgg agcaagcggg tccataaacg gcaacgccgg cgaccccggc aacagcggag 455821 aacgcggcgc agtgggcaag cccggcgcac ccggctgacc cgaaaatcac cgcatcaccg 455881 ggctcgctca caaccgagag cggacgcggg ctcggcgggc tagacgaatc gacgcgccaa 455941 ctttctcgga tcgaagaagc tatacgcttt acccccatga gtgtgtacaa ggtgatcgac 456001 atcatcggga ccagccccac atcctgggaa caggcggcgg cggaggcggt ccagcgggcg 456061 cgggatagcg tcgatgacat ccgcgtcgct cgggtcattg agcaggacat ggccgtggac 456121 agcgccggca agatcaccta ccgcatcaag ctcgaagtgt cgttcaagat gaggccggcg 456181 caaccgcgct agcacgggcc ggcgagcaga cgcaaaatcg cacggtttgc ggttgattcg 456241 tgcgattttg tgtctgctcg ccgaggccta ccaggcgcgg cccaggtccg cgtgctgccg 456301 tatccaggcg tgcatcgcga ttccggcggc cacgccggcg ttaatgcttc gcgtcgaccc 456361 gaactgggcg atcgacaccg tgaccgccgc gccggcacgg gcgtcgtcgg taatgccggg 456421 cccttcctgg ccgaataaca gcaggcattc ccgcggcaac gcggtctgct ccaggcgcgc 456481 cgcacccggg acgttgtcca ccgccaccac ggtcaagccg gcgcccgccg cgaactccag 456541 cagcccggtg gtgctgtcgt ggtggcataa ccgctgatag cggtcggtca ccatggcgcc 456601 gcgccgattc caccgccgac gcccgacgat gtgcacggtg tgcacggcga atgcattggc 456661 ggtgcgcacc accgagccga tattggcatc gtgtccgaag ttctcgatcg ccacgtgcaa 456721 ggggtgacgg cgcgtatcga tgtcggcgat gatcgcctct cgggtccagt accggtaggc 456781 gtcgacgacg ttgcgagcat cgccgtcgcg caacaacacc gggtcgtatc gggggtcgtc 456841 cgggaggtcg cctgcccagg gccccacgcc gccggtcggc gcgccccatt ccgtaggccc 456901 gggcccaagc gcactcatcg cgaggtccac aacgcggcgt gggttcccac tgtcgcgacc 456961 gtcgcgtaca gcaacgcctc gttgatctcg ccctgcgtac acagcgacgc gctaatgtgg 457021 accgcgtcca gtagcgcgtc aggaaggatc agcaccgagg atccatacgt cgcgcacgcc 457081 gggctcagtc cgcggccggg cagtgtcggt gcggcgagca gcatcatggg accgccgcgg 457141 gcagcggagt cgtcccgata ccgatccggg acggcgtcga cctccagccc cagcagcatg 457201 tagccattac cggtgaacgc caccggatct ccgagctggg gcttacccag ctgcatgccg 457261 tcggcgcgcc aagccgagac gctggtggtc ttgaccacca gaccggcgtc gttttgattg 457321 gtcggcagca tccccaccgg aaacgcggcc ggataggccg cggcagtacc cgggatccga 457381 tcgcggggcg agtacgtgta cacgccgcgg acggcgcttc gctctttcag cggacccagg 457441 cagaccgtgc ctgtgagccg gccggcaggc gccgatagcg gcgacacgac gtcgcgcacg 457501 tgggccatcg cgtcgccgca gctgccgagc gcagcggatt ccatcggatg ggccaacgcg 457561 ccgtacagcc cgaagcgaat gtcttcgggc ttggcgtgcg gcgcatgcgg gtccgtcggg 457621 gatgcatcca cgtcgatgag cacatagtcg ccggaccagc gcagattcga caccgacatg 457681 ttccagccca gcaccgccag cgattcgccg gtccgggcgc tctgggcgcc gtaggtgcga 457741 ccggaatgac tcgaacccga gcagcccgtc agaccggaca ggacgacggc cccgcaggtg 457801 gcccaggcaa cgagaatgcg cacagcgatg ccgccgacgc ctaatccagc cccagatcgg 457861 ccaggcccag cacgctgcgg tagcgcagtc cctcggcttc gatagcctct gcggccccgg 457921 tggcgcgatc caccacggta gccacgccga caacctcacc acccacgtct tggacggcgt 457981 gcaccgccgt cagcgcggag ttaccggtgg tactggtgtc ctctaccacc agcacccgct 458041 gcccggtaac ctccgaccct tcgataagtc gctgcatgcc atgggctttc gccgacttgc 458101 ggaccacgaa cgcgtcgatc ggacggcccg gggcatgcat gatggcggtc gccacgggat 458161 cggccccgag tgtcaggccg ccgacaaccg aatagtccca gtcggcagtg agttcgcgca 458221 ttagccggcc gatcagcgcg gacgcccgat ggtgcaaggt ggcgcgacgc aggtcgacgt 458281 aatagtcggc ctcccggcca gacgacagcg tgacgcggcc gtgcaccacc gacagccggc 458341 gcaccaactc agccaactct gcgcggtcag gtccggccac ggcttctcct cacgccgcca 458401 cgcgggaggc cgatcacatg cggcgtcacc gcggtggcct cgggcgtgac atccgcggtc 458461 tcagtgttgg tagttggtgg cctgctggcc gttgcgtccg gccggcggcg ggcgccgaac 458521 gccttcggag cggccttcat cgcggcggat cggctcgggg gcccgacgag ccggatcggg 458581 cagcaccgtg gttgccggat ccggctgggc gcgccgcggc ggtaactccg cggggccgcc 458641 cggggccacg ggtcggccag gtgccgcgcc gcgcggtccc accccggttt gttggggcat 458701 ctcctgcggc agcggcggca acacgcgcag caagtcgttg aattggcgca cggtgcgcag 458761 gccctcgtcc cactgggcac gggtgctggc aatcggcatg ctgaccagcg tccagttctg 458821 ctcgttccac atgatttcgg cgcagtcggg cgcggtgtgc gcgaaggtga ccatccgccg 458881 atcgcaggcg cgccgggccg cgtctagatt ggtggagtac accatccgtg gcccgatcgc 458941 gcccagcagc cagatgtcgc tttctcgtgg ctctttcagg cctttgagcc gcaggtcgac 459001 cacgacattg gtgcccacct tgcgatgcag cgcgatcacg gtggcgactt cctcgagatc 459061 gaagatgtac accgcctcgc cgcggatctg acccagcacc acgttatggg cggcaacatc 459121 gccaactgtg gacatcacgc cgcgcgtcca gcgcttgagt atctcggtgg attcccgttc 459181 gtagtcgaac ccgtgcgatc gcgcccacga cttgcggcgt ctgctgcgcc cgcggcgtcg 459241 atcgatgtcg acgtacagca acaccacggc accgacgaag cacagtgccg agagcgtgaa 459301 ccaaagcggg accatcggtg cttagcctat ccgctggcgg cccggaaccg agaatgcgac 459361 caggtcacaa cccagtcacc ttccacgccg agcagacgag gaatcgcact gcgcggacct 459421 cacgcgtgcg attccgcgtc tgctcgtcag acaaatcagc ccaggatcag cgagtcggcg 459481 tcggggctga cgttgaccgg cacggtatcg ccgtcgtgca cctggccggc caacagcatc 459541 ttggccagct ggtcaccgat ggcctgctgc accagccggc gcaacggccg cgccccgtac 459601 accgggtcga atccgcgctg cgccaaccag cgcttggccg gcagcgagac ctgcagctgc 459661 agccgccgct gcgccagccg cttgcccagc tgcgccagct ggatgtcgac gatgcgcacc 459721 agctcttcgg ggttgagacc ctcaaagatg agcacgtcgt cgagccggtt gatgaactcc 459781 ggcttgaacg tagcgcgcac cgcggccagc acctgctcgg cgctgccacc cgaccccagg 459841 ttggacgtca ggatcaagat ggtgttgcgg aagtcgaccg tgcggccgtg cccgtcggtg 459901 agccggccct cgtcgaggac ctgcagcagc acgtcgaaca cgtccgggtg cgccttctcg 459961 atctcgtcga acagcaccac cgtgtaggga cgccggcgca ccgcctcggt cagctgaccg 460021 cccgcctcgt atcccacata gccgggcggg gcgccgatca accgagccac ggtgtgcttc 460081 tcgccgtact cgctcatgtc gatgcggacc atcgcccgct cgtcgtcgaa caggaagtcg 460141 gccagcgcct tggccagctc ggtcttgccg acaccggtcg ggccgaggaa catgaacgcc 460201 ccggtgggcc ggttggggtc ggacaccccg gcccggctgc gccgcaccgc atcagagact 460261 gcggtaaccg cggccttctg cccgatgacc cgcttgccca gctcgtcttc catgcgcagc 460321 agcttggcgg tctcgccttc cagcagccga ccggccggga tgccggtcca cgccgacacc 460381 acgtcggcga tgtcgtcggg accgacctcc tccttgagca tcacctgctc ccgggcctgc 460441 gcctgcggca acgccgcgtc gagcttcttc tccacctcgg ggatgcgtcc gtagcgcagc 460501 tcggcggcct tggccaggtc gccgtcgcgt tcggcccgct cggattcccc gcgcagggct 460561 tccagctgct ccttgaggtc gcggacgatt tcgatcgcgt tcttctcgtt ctgccagcgg 460621 gtggtgagct cggccaactt ctctttctgg tcggccagct cggagcgcag cttggccaac 460681 cgctccgccg acgcctcgtc ttcttctttg gacagcgcca tctcttcgat ctccagccgg 460741 cgcaccagcc gctcgacctc gtcgatctcg acgggccgcg agtcgatctc catccgcagc 460801 cggctggccg cctcgtcgac caggtcgatg gccttgtcgg gcaggaagcg ggcggtgata 460861 taccggtcgc tcaaagtggc agctgccacc agcgccgagt cggtgatgcg caccccgtgg 460921 tgcacctcgt agcggtcttt gagcccgcgc aggatgccga tggtgtcctc caccgacggc 460981 tcgccgacgt acacctgttg gaaacggcgc tcgagcgcgg cgtccttctc gatgtgcttg 461041 cggtattcgt ccagcgtggt cgccccgacc agccgtaact cgccgcgggc cagcatcggc 461101 ttgatcatgt tgccggcgtc catcgccccc tcgccggtgg cgccggcgcc gacgatggtg 461161 tgcagctcgt cgatgaacgt gatgatttgg ccggccgagt tcttgatgtc gtcgaggacg 461221 gccttgagcc gttcctcgaa ttcgccgcgg tatttggagc cggcgaccat cgagccgaga 461281 tcgagcgcga cgatggtctt gtcgcgcaag ctctccggca cgtcgccggc cacgatgcgc 461341 tgcgccaggc cctccacgat cgcggtcttg ccgacgccgg gctcaccgat cagcaccggg 461401 ttgttcttgg tgcgacggga cagcacctgc accacgcggc ggatctcgtt gtcgcggccg 461461 atgaccgggt cgagtttgcc ttcgcgggcg cgggcggtca ggtcggtgga gtacttctgc 461521 agcgcctgat aggtcgcctc cggttcgggg ctggtgaccc gggcgctgcc gcgcaccttg 461581 acgaacgcct cccgcagcgc ctgcggcgag gcgccgtggc cggtcaacag cttggcgacg 461641 tcggagtcac cggtggccag cccgaccatc acgtgctcgg tggagacgta ctcgtcgtcc 461701 agctcggtgg ccagctgctg cgcggtggtg atcgccgcta acgactcgcg ggacagctgc 461761 ggctgcgtgc tggctccagt cgcctgcggc aaacggtcga gcaggcgctg ggtttcggcg 461821 cggacggtgg cgggctcgac accgacagcc tccagtagcg gtgcggcgat accgtcgttt 461881 tgggtcagca gcgccatcag caggtgagcg ggccggatct cgggattgcc ggcggtcgaa 461941 gccgcctgta acgccgcggt tagcgccgcc tgcgtcttgg tcgtcgggtt aaacgagtcc 462001 acgacacctc cattcggggt ccgttcgaaa tgcttgtcgg gttgttcaac gccgtcaatg 462061 ttgagtctgt tccgctcaat tttacccact tgtgcatccg ccgccgtttc gccgcgagct 462121 tagaatcgag gtccgtgggc ctcgaggacc gggacgcgtt gcgggtgttg caaaacgcct 462181 tcaagctcga cgacccggaa ctggtccgcc gcttctatgc ccattggttt gccctcgacg 462241 cctcggtacg cgacctgttc ccacccgaca tgggcgccca gcgagccgct ttcgggcagg 462301 cgctgcactg ggtgtacggc gagctggtgg cgcagcgcgc cgaggaaccg gtggcctttc 462361 ttgcccagct cggccgcgac caccgcaaat acggtgtgct gccaacccag tacgacacgt 462421 tgcgccgcgc gctgtatacg accctgcgtg actatctggg ccatccaagc cggggcgcct 462481 ggacggacgc cgtcgacgag gccgccggcc agtcgctcaa cctgatcatc ggggtgatga 462541 gcggtgccgc ggacgccgat gacgcgcccg cctggtggga cggcacggtc gtcgagcaca 462601 tccgggtgtc acgcgacctt gctgtcgctc ggctgcagct ggaccgcccg ctgcactatt 462661 accctggcca atacgtcaac gtgcatgttc cgcaatgccc ccgccggtgg cgatatctca 462721 gcccagccat tccggccgac ccgaacgggc ggatcgagtt tcacgtccgg gtggttcccg 462781 gtggcctggt cagcaacgcc atcgtgggtg aaactcggcc cggtgaccgg tggcgattgt 462841 ccggtccgca cggagccttt cgggtggacc gcgacggcgg cgacgtgctc atggtcgccg 462901 gtagcaccgg gctggcgccg ctgcgggcgc tgatcatcga cctcagccgc ttcgcggtga 462961 atccgcgcgt gcacctgttc ttcggagcac gctatgcctg cgaactctac gacctgccca 463021 cgctgtggca gatcgcggcg cacaatccgt ggctgtcggt ctcgccggtg tcggagtaca 463081 acggtgatcc ggcttgggcc gccgactatc ccgacgtgtc ggcgccgcgc ggtctgcacg 463141 tgcgccagac cggccgacta cccgatgtgg tctcccgata cggcggctgg ggcgatcggc 463201 agattctgat ctgcggtgga ccggccatgg tccgcgccac caaggccgcc ctgatcgcca 463261 aaggcgcgcc accggagcgc attcagcacg acccactgtc gcgctagccg ggcggaaatc 463321 caccgtccgg tggcgtcgct tcgacatggc atacggcctt tgctacccgg tcaccgctgg 463381 ctagcatgag tgcgactgag tggagcgggg atgagcaagt tgctgccacg gggcacagtg 463441 acattgctgt tggccgacgt cgagggatcc acctggctgt gggagaccca tccagacgac 463501 atgggtgctg ccgtggcgcg cctcgacaaa gccgtgtctg gtgtgattgc cgcccatgac 463561 ggcgtacgcc cagtcgagca gggtgagggt gatagctttg tcctcgcgtt cgcctgcgcg 463621 tcggatgccg tggccgccgc gttggacttg cagcgagcgc ggctcgcacc gatccggttg 463681 cgcataggcg tgcacaccgg ggaggtcgcg ctccgcgacg aaggcaacta tgccggtccg 463741 accatcaacc ggaccgcgcg cctgcgtgac ttggcgcatg ggggccagac ggtgctctcg 463801 ggcgtgaccg aaagcctggt catcgatcgc ctcccggaca aagcatggct ggttgacctg 463861 gggacgcacg cgctgcggga tctgtcgcgt ccggagcggg taatgcagct gtgtcatccc 463921 gaattgcgta tcgatttccc gccgctgcgg gtggccaatg acgatgtggc ccatggtctt 463981 ccggtgcacc tgacgcgttt tgtggggcgc ggcgcgcaga tcaccgaggt gcaccggttg 464041 gtgaccgata accggttggt gaccctgacc ggcgccggcg gcgtgggcaa gacacggctg 464101 gcggcgcagc tcgcggcgca gatcgccggt gagttcggtc gcgcgtggtt cgtggatctg 464161 gcgccgatca cggaccccga cttggtgccg gtcacggtgg cgggcgcgct gggactgcac 464221 gaccagccgg gccgctccac gacggacacc gtgctgcgct ttcttggcgg gcgtccagcc 464281 ctggtggtgc tggataactg cgagcacctg ctggatgcga cggcggcctt ggtgttagcg 464341 ctggtgaaag cgtgccgggg ggtgaggttg ctggcaactt gtcgtgagcc gctccgggtc 464401 gagggtgagg tgagctaccg ggtgccgtcg ctgtcactga gcgatgaagc cgttgagatg 464461 ttttgctacc gggctcagcg agtccggccg gactttcgcc tcaccgacga caactccgcc 464521 gcagtgaccg agatctgcaa acggctggac ggtttgccgc tggcgatcga gctggcggct 464581 gcgcggctgc ggtcgatgac gcttgacgag atcatcgatg gcttgcgtga ccggttcgcg 464641 ctgttgaccg gcggtgcgcg cacggccgcg caccggcagc agacgctgtg ggcctcggtg 464701 gattggtcgt acacgctatt gaccgagccg gaacgtacct tgtttcgccg gcttgcggtg 464761 tttgtgggtt gcttttttgt cgacgacgca caggcggttg cctgcagcgg cgatgtgcag 464821 cgctaccagg tccttgacga gatcaccctg ctggtcgaca agtcactggt gatggccgac 464881 gacaacagcg gccggacgtg ctatcggtta tgcgagacga tgcgccacta cgcgttggaa 464941 aaactctccg aggctggcga ggtggacgcc gtgtttgcgc ggcaccgtga ctactacacg 465001 gcgctggctg ccagggtcga caatcccgga ccctccgatt attcgcactg cctcgaccaa 465061 gccgaaaccg agatcgacaa cctacgtgcc gcctttgtgt ggaaccggga aaattccgac 465121 accgagggcg ccttggcgct ggcgtcctcc ctgttgcggg tatggatgac gcgggggcgc 465181 atccaggagg ggcgcgcctg gtttgacagc attcttgccg acgagaatgc gcgtcatctc 465241 gaggtggcgg ccgcggtgcg cgcccgggca ttggccgaca aggccctgct cgacatcttc 465301 gtcgacgccg ccgccggtat ggagcaggcc caacaggctt tggtgatcgc gcgcgaggtc 465361 gatgaaccgg cgctgctgtc ccgggcgctc acggcctgcg gcttgatcgc ggtagcggta 465421 gctcgcgccg atgcggccgc gtcttatttc gccgaggcga tcgacctggc acgagcggta 465481 gacgaccggt ggaggctggc ccagatcctt acctttcagg cggtcgatgc ggtcgtggcg 465541 ggtgacccgg tcgcggcacg cccggccgcc caagaggcac gcgagctggc tgccgcgatc 465601 ggtgaccact ccaatgcgct gtggtgccgc tggtgtctcg gctacgccca gctgatgcgg 465661 ggggagctgg ccgcggccgc cgcccaattc ggcgaggtgg tggacgaggc cgaggcgtct 465721 caggaagtgc tgcacaaggc caacagcctg cagggcctgg ccttcgcgct cgcctaccag 465781 ggtgaattga gtgcggctag ggcggcggcc gacgccgctc tcgaggccgc cgagctgggc 465841 gagtacttcg cgggtatggg ctactcggcg ttgaccacgg ccgcgttggc cgccggcgac 465901 gtgcagacgg ctcaacatgc cagcgaggcg gcctggcgga acttgagttt ggcgctgccc 465961 ctctcggcag cggtgcagcg cgcgttcaat gcccaggctg cactggctgg tggtgacctt 466021 agcgcagcgc gtcgttggtg tgacgatgcc gtgcagtcaa tgaccggcca tcatctggcg 466081 atggcgctgg cgactcgcgc caggatcgcg gtcgccgagg gcaagcggga agaagccgaa 466141 cgcgacgcgc ataaggcgct cgcgtgcgcg gccgagagcg gggcacacct ggatctcccc 466201 gacgtgctcg aatgccttgc cggcctggcc agcgacgccg gcacccacca tgcggcggca 466261 cgactcttcg gcgccgccga ggctatccga cagcagatcg gctcggtccg cttcgcgatt 466321 taccgttcgg actatgtgca gtcggtgacg gctctgcgag atgcgatggg ggagaaagac 466381 ttcgacgctg catgggccga aggtgccgcg ttgtcgatca aggagacgat cgcctatgcg 466441 caacgtggcc actcctggcg caaacgaccg gccaccggtt gggaatcgct tactccgacc 466501 gagattgacg tcgtgcgact ggttggcgag ggactggcca acaaggacat cgcgacgcgg 466561 cttttcgtct caccgcgaac agtgcaaacg cacctgacgc acgtctacac caaactcggc 466621 ttcacctcgc gactgcaact cgctcaagcg gccgcccgcc gtacctgagt gctattgatt 466681 ggcgttcggg gacggcggta ccacgatgat ggtcgctccg gggatcgccg ccagggtcgc 466741 cgcaaggttg gcaaccacgc cgggcggcaa accgggcggt aggccaggta tggctgaccc 466801 ggcggccgcc gcggggaccg cgggcgtctg ctgggcggcg ggcctggtgg cagccgggcc 466861 ggctccgccg gctccggccg gggcggctgc agccggggtc ggcgcggagc caccgcgggc 466921 ggcaagaccg gccagaccgg tgccggccaa acccgccgct gccaggccgg cgtaggagcc 466981 gtccgaactc cccgtcatat acatcggagt cgcgccgggg aacatcgccg caacttggcg 467041 caccgctggc ggaagattcc agctgtgcgg cacggagagt ccgccgatgt tagcggagta 467101 gccggtgagc gcggcgaccg gcggctgtgg cgggctggtg acccgtcctc cgacctcagg 467161 gagatccccg tcacccgcgg ccccagcggg tgtctggtcg gattcggctg ggcaatgcgg 467221 cgccttttgg gcctcgtcga ccacgtcgtg gcggtagatc tcgccgagct gcgccgccga 467281 tacccccaaa ctgccagccg cgacggctac agctgcggct acgagaacgt caagatcctc 467341 gatggggttc ggaatggcgt cgaagggcgg cggtaggaag gactgcaggg ttggtagcag 467401 gctcacgtcg gtcgccgcgg cggccggtac taccgcctga gccgctgtcg cggcggcgtc 467461 actcagcgac ccggccccgc tggtggtcgc cggtggcggg tgaacggcgc aactgggacg 467521 cggcccccga ggcgcccgca tagccgtcca tcgccaagat gtcatgggcc cacatttcgc 467581 cgtagtgcgt ttcactagtc gcgatcgccg gggtgttctg tccaaaaaca ttggtctgga 467641 ccagtgacag cattgtgcgg cggttggccg cgattaccgt cgggggcacg gtcgccgcgt 467701 acgccgactc gtaggcgttc gccgcggcca cggcctgagc cgcggcctgc tcggccgagg 467761 cggcggtggc cctcatccac gcgacatagg ggacggccgc ggccgccatc gacagtgccg 467821 acgggcccag ccagtcatcg ccggtgagcc cggaaatcac cgaggagtag gaagccgccg 467881 tcgcggtcag ttcgttggcc agccgttgcc aggctgcggc cgcttgcatc aggggcctgg 467941 agccgggacc ggaatagatt ctggcggagt tgatttccgg tggtagcgca ccgaaatcca 468001 tgactagccg ctcctcacac cggcagcagc ctcagcgctg cgtggctggg tcgtcacgaa 468061 agacacggat tctcctttgc cgaagctgtc cggtccgcgc agggttcgtc gctgccgcga 468121 gccaggcgac tgggcgcata cctattcggg tggcggcaac catgtcggag ccggatggat 468181 ggctaagcgg tcatcaagtt cggatggctt gggttatcag gtcactcagt tgcccccacc 468241 tcctcatagc aaaagtacac aggcagatgt gagcggagtt gcgaaaatag acaaataatt 468301 gagccgagca acgaccgagc gagagggtga gctggtgatc gacggctgga cggaagaaca 468361 gcacgaaccc accgttaggc atgagcgccc agcagctccc caagacgttc ggcgggtgat 468421 gttgctgggt tcggccgaac ccagccggga gctggcgatc gcgttgcagg gcttgggcgc 468481 ggaggtgatc gccgtcgacg gctatgtcgg cgcgcctgcc caccggatag ccgaccagtc 468541 ggtggtggtc accatgaccg atgctgaaga gctgacggcg gtgatccggc ggctgcaacc 468601 ggatttcttg gtgacggtca ccgccgcggt gtctgtggat gctctcgatg ccgtcgagca 468661 agccgacggc gagtgcactg agctggtgcc gaacgcccgt gccgtccggt gcacggccga 468721 ccgggagggc ctgcgccggc tggccgccga tcagctcggc ctgcccacag ccccgttctg 468781 gttcgtcgga tcccttggcg aacttcaagc ggtggccgtc catgctgggt ttccgttgct 468841 ggtgagcccg gtggcagggg tggctggcca gggtagctcg gtggtcgccg ggcccaacga 468901 ggtcgagccc gcctggcagc gcgcggcagg ccatcaagta cagccgcaga ctgggggagt 468961 gagccctcgg gtgtgcgccg agtcggtggt cgagatcgag tttttggtca ccatgatcgt 469021 tgtgtgcagt cagggcccga acgggccgct catcgagttc tgtgcaccta tcggtcatcg 469081 cgacgccgat gccggtgagt tggaatcctg gcaaccgcag aagctgagca cggcggcgct 469141 ggacgcggcc aagtcgatcg ccgcgcgcat cgtcaaggcg ctcgggggac gcggggtttt 469201 cggcgtcgaa ttgatgatca acggcgatga ggtgtatttc gccgatgtca ccgtgtgtcc 469261 tgccgggagt gcctgggtca ccgtgcgcag ccagcggctt tcggtgttcg aactgcaggc 469321 ccgggcgatc ctgggtctgg cggtggacac cctgatgatc tcgccgggtg ccgcgcgggt 469381 gatcaacccg gaccacacgg caggccgggc agcggtcggc gccgcaccac ctgccgatgc 469441 gctgaccggt gcgctcggtg tgccggaaag cgacgtcgtg atattcggcc gcgggcttgg 469501 ggtggcgctg gccaccgcac ccgaggtggc aatcgcccgc gaacgcgccc gcgaagttgc 469561 atctcggcta aatgtgccag actcacgcga gtgagctacg ccggagatat cacgccactt 469621 caggcctggg agatgctcag cgataatccg cgggcggtcc tggtcgacgt gcgctgcgag 469681 gcggaatggc gcttcgtcgg tgtgcccgac ttgtcgagcc ttggtcgtga agtggtctat 469741 gtcgaatggg cgacgtccga cgggacgcac aacgacaact tcctcgccga gttgcgggac 469801 cgcatcccgg cggacgctga tcagcacgag cggcccgtta ttttcttgtg tcgctccggt 469861 aaccgctcca tcggcgcggc cgaggtcgcg accgaggcgg gcatcacgcc ggcctataac 469921 gtgctggacg gcttcgaagg gcatctcgac gctgagggtc atcgaggcgc aacgggctgg 469981 cgggcggtgg gactgccgtg gagacaggga tgaccgacga gtcttcggtc cgcaccccga 470041 aggcgctgcc cgacggcgtc agccaggcca ccgtcggggt gcgcggcggg atgttgcggt 470101 cggggttcga agagaccgcc gaggcgatgt acctgacgtc cggatatgtc tacggctcgg 470161 cggcggttgc cgagaagtcg ttcgctggcg agctggacca ctatgtgtac tcccgctacg 470221 gcaacccaac ggtgtcggtg ttcgaggagc ggctgcggct gatcgagggt gccccggcgg 470281 cgttcgccac cgccagtggc atggccgcgg tattcacctc gctgggcgcg ctgctgggtg 470341 ccggagaccg actggttgcc gcgcgcagcc tgtttggctc gtgtttcgtg gtgtgcagcg 470401 agatcctgcc gcgctggggg gtgcagaccg tcttcgtcga cggtgacgac ctctcgcaat 470461 gggagcgggc gctttcggta cccacgcagg ccgtgttctt cgagacgccg tccaatccca 470521 tgcagtcgct ggtggatatc gctgcggtga ccgagctggc acatgccgcg ggtgcaaaag 470581 tggtgctgga caacgtattt gccacaccgc tactgcagca gggctttccg ctgggggtcg 470641 acgtggtggt gtactcgggc accaagcaca tcgacggtca gggtcgggtg ctgggcgggg 470701 ccatactcgg tgaccgggag tacatcgacg gtccggtgca aaagctgatg cgccacaccg 470761 gtccggcgat gagtgcgttc aacgcctggg tactgttgaa aggccttgag acgctggcta 470821 ttcgggtgca acacagcaat gcctcggcgc agcggatcgc ggagttcctc aacggccatc 470881 cctcggttcg gtgggtgcgt tacccgtacc tgccgtcgca cccacaatat gacctggcca 470941 agcgtcagat gtccggtggc ggaaccgtcg ttaccttcgc actcgactgc ccggaggatg 471001 ttgccaaaca gcgggccttc gaggtgctcg acaagatgcg gctgatcgac atctccaaca 471061 acctcggcga cgccaaatcg cttgtcaccc accccgccac cacgacgcac cgggcgatgg 471121 gcccggaggg ccgggccgcg atcgggctcg gtgacggtgt ggtccgcatc tcggttgggt 471181 tggaagacac cgacgacctg attgccgata tcgatcgggc gttgagctaa cccgctgcct 471241 cttgctcggc gtgctcggcc tgttcggcgg ctgccagcgc tccttgtgcc tgctgttcca 471301 tcaaggtcat cactaacctg gcgtagatca tctggctggt gatggccatc tggccgcggg 471361 cgcggcccat gaaggagatc ccccaggcga acagggctgc gatgcggttc cgatagccga 471421 ccaggtagac caggtgcagc accagccacg ccagccaggc gaagtacccg gcaaactcca 471481 gcttgccgac ctgcgcgacg gcgctgtggc gggagatcgt cgccatgctg cccttgttga 471541 agtaatggaa cggcttgcga ttggctgggt cgtcattgcc cttgaccatg tgtttgatca 471601 ccgtggtggc gtatcgggcc ccctggatcg cgccctgagc caccccgggt acgccgggca 471661 cgaacatcag atcgccgact acgaagacgt tcggatgtcc cttgacggtg agatcgggtt 471721 ccacgatcac ccttccggcc cggtcgattt cggttccgtc ggatccctcg gcgatcatct 471781 tgcccagcgg gctggccgcc acgccggccg cccaaacctt gcacgcgcat tcgatgcggc 471841 gttcgccgcc gtccttttcc ttgatggtga tgcctttgta gtcgaccgcg gtcaccatcg 471901 cgttgagttg aacctcgacg tccatctttt ccagccgccg ttgtgccttg agacccagct 471961 ttggacccat cggcggcaac accgcgggtg cggcgtcgag caggatcacc cggcactcac 472021 tgggcgtgat ggtcctaaac gcgcctgcca gggtgcgctc ggcgagctcg acgatctgcc 472081 cagccacctc gacgccggtc ggcccagcgc cgacgacgac gaacgtcagg cgccgctccc 472141 gttcggcatg gtcggtgctg acctcggcgg cctcgaacgc gcccaggatg cggccgcgca 472201 gctccagcgc gtcgtcgatg gtcttcattc cgggcgcgaa ggtggcgaat tcgtcgttgc 472261 cgaagtagga ctgctgtgcg ccggcggcca cgatgaggct gtcgtacggc gtcaccgtgg 472321 tcatgtccat caatttcgac gtgaccgtct gcgctttcag gtcgatcgcg ttgacctcgc 472381 ccagcaacac ccggacgttc ttttgccggc gcaggatcag ccgggtggtc ggggcaatgt 472441 cgccctcgga caagatcccg gtggccactt gatacagcag cggctggaac aggtgggtcg 472501 ttgtcttgga gatcagcgtg atgtcgacat ccgcccgttt aagcgccttg gccgcattca 472561 ggccgccgaa tccactaccg atgatgacca cgcgatggcg cccgccgacg gccgagggtt 472621 caccagatga gagcgtcatg gtcctccttc agtctggtcg ctgtggcgca gctacacagt 472681 acgactcccg tcatgccaac ggcgtaactt tttgtgggcc ttgtgggcct tgtgggcctt 472741 gtgggccttt gtcgggccgc cttcggatcg gacgctcggg atggctgttg ggcgctgcgc 472801 aatcccgcgc ttcgatcagg cagcgtccgg cagtgccatc aatggcggcc aggtacacct 472861 ctccgacggc tcgacatcgc cggcccggca gttacctgca ccatggccgg gcgatgcggg 472921 agcggctgcc gaaggtcggg caggtgtttg ctgccgggga aatcgactac cacatgtttc 472981 agacgttggt gtatcgcacc gatttgatca ccgacccgca ggtgttggcg cgggtggatg 473041 ccgagctggc gctgcgggtg cggggctggc cgtcgatgac ccggggcagc tggccgccgc 473101 gatagatcgg atcgtggcgg tggccgaccc cgatgcggtg cgccaggtgc gggagcgggc 473161 ccgcgatcgg gaggtgtcga tctggaattc cgcggacggc atgggcgagg tgtacgccca 473221 gttgtatgcc accgacgccc aagccctgga tgcgcggctg aacgccttgg tggccacggt 473281 gtgtgccggt gatccgcgca gcacagatca gcgccgcgcc gacgcgctgg gcgcgttggc 473341 ggccggggcg gatcggctgg cctgccgctg cgacaatccc gactgtgccg ccgaggggcg 473401 cccggtgtcg gcggtggtga ttcatgtggt ggccgagcag gccagcgtca agggccacgg 473461 ccaggcgccg gcagcgttgc tgggcggcga cgggctgatc ccggccgagc tggtggccga 473521 gttggccaag accgccgggc tgcagccgat cccggtcccg gccgggaccg agccgggtta 473581 tcggccctcg gtgaagctgg cggcgtttgt gcgggcccgg gatctgacct gtcgggcgcc 473641 cggttgcgac cgcccggcca cccagtgcga cctggatcac accatcgcgt tcgccgacgg 473701 tggggccacc cacgcggcca acctcaaatg cctgtgccgt cttcatcatt tgctggccac 473761 cttctgtggc tggcgcgccc agcaactgcc cgacggcacg gtgatttgga cgctgccggg 473821 taaccagacc tacgtcacca ccccgggcag cgcgctgctg ttcccggcgc tgtgcacccc 473881 caccggtgac ccgcccgcac ccgagccggc ccgcgccgac cgccgcgggc agcgcaccgc 473941 gatgatgccg cgccgggcca gcacccgcac ccaaaaccgc gcccattgca tcgccgccga 474001 acgccaccgc aaccaccaag cccgccggat tgcccaagcg gccgtcatcg ccaccgagac 474061 ccacggccca ccacccgatc ccgacgacga cccgccgcct ttttgatgaa gtgagtccga 474121 atcatctcga cgtggacggg tgcggcgtcg ggtggtcgcc ggttggcgca gaccctccag 474181 aggggaggat gaggagctcg gcacctgcgt cggcggccct gagataggcc agcaggcggt 474241 ggccgaagtc gctgacgtcg tggataagga tgtggccgct ttcttctgcc ggagtcccgg 474301 tgccgttgcc gtgccaaacg gttgttgtcg cgatcgcgac gccggtttga atgagggccg 474361 ctagcaccgg cacgggctcg accttactcg ccgccctcat cacctcgcgt cgctggatct 474421 cgtcctggtc cggtgaggac ttggcggctt tggccagccg aacgagtgca tggatatgca 474481 cgggctcaag ttgggaaagc gtggccacga tgagtgatgc cggctcgacc ttctggtcat 474541 cctcgagcgc ggcggcggca gcttgcgcga ggagccggcg cttggcctcc atactggtgc 474601 gagtggcggc ctcgatcgcc tggctgagaa gcggctcgag ttcgggattt ttgtcaatgc 474661 ggctcaacac ggtgtccgcg ccgccgacgc tctcgcatat ctcgcgcgtg gttgtctcgg 474721 cgcggtgccg ggtgcgttcc tcgatggcgt cgaacacggt ttgtagcggg ccgccgacca 474781 tcgggatggc ggataggccg gcgctgatca cgacagcgaa gacaggtctg ggctcagtca 474841 tagctcgaac agtagaggcc gtcgcggcaa ggacggccga cggcgtgttt tcggcgttgc 474901 ggggtggtcg ccggacacga ggaggcagac cgaggctcga tggattggat gccgctcggc 474961 gactacgaga ctttccggca ttggtcgggg aagccccgcg catgggggcc gcaagagtcg 475021 gggtggcgcg cgtggttcgg cgggaagata gtcgatgggc tctgcgaggt actcgacgag 475081 cacctcgcgg tgcggcgtcg tggtgttcca gccgcgatcg gctgcgtgcc ctggctgagt 475141 agcgaggcgg tcgccgagac gctgctcgca ttgagcgtct tttgcgtggt gatcgacaag 475201 ggaacctcgt tcccgtcgcg actgcgtaac cctgacaaag ggtttcccaa cgtcgcccta 475261 ttgcggcttc gcgacatggc gccctccgag catggctcac gctgctcctc ggcccgtggt 475321 cgtctatgcc tgagcatgag ctaggtccgg tgcgggcgct cggctggcta cgagaggacc 475381 gcaagccgct gctgaatgcc aaattgctcg tgctcggtca tctggctttg aacgtctacg 475441 accccgataa cggttacggc gaagaggtgt tggactttga gccgcggacg gtgtggtggg 475501 gatcggccaa ttggaccgtg cgggccgggt cacacttgga agttggcttt gcatgcgacg 475561 acccaaccct cgtcgaagaa gctacagcgt ttgtcgctga cgtgatcgcg ttctccgaac 475621 cgatcgacac gacctgtgcc ggtcccgaac cgaacctcgt gcaggtggag ttcgacgacg 475681 ccgcgatggc tgaggcgatg gaggagatgg ccgagcccga tgatgacggg gaggattggt 475741 agcgatgctg cttgatgaac ccaaaggtcg tcacgggcac gctcaaaacg ttcttcgtgt 475801 aatcggtgcc atcatttgct ggccaccttc tggggctggc gcgcccagca actgcccgac 475861 ggcaccgtga tttggacgct gccgggtgac cagacctatg tcaccacccc gggcagcgcg 475921 ctgctgttcc cggcgctgtg cacccccacc ggtgacccac ctcgacccga cccggcccgc 475981 gccgaccgcc gcgggcagcg caccgcgatg atgccgcgcc gggccagcac ccgagcgcaa 476041 aaccgcgccc actacatcgc cgccgaacgc caccgcaacc accaagcccg ccggattgcc 476101 cacgtggtca cccaaaccgc cacaaccgcc cccgagacta acggcccacc acccgatccc 476161 gacgacgacc cgccgccctt ctaaccggta ggcgcctgcc caaaacacgg gtattgggta 476221 aaggcacggg gtcctgatgt tgttgtattt caatgcgatt cagctaaggc ccggagccca 476281 tggctcgtcc ggatggtcgg ttggggtgat gtgtatgccc ctcctgctcc atcccgtttc 476341 cttgtatcct caagtttgtc gtttggcgct gttgcgacag gaaggcgtcg atcatgcacg 476401 cactgaggtt ggtcggcttg gcgatattga cggcgatcgc tccaatcgcg gtcctcatcg 476461 gaagtagccc agcgcatgcc gataccgata ttggtcaacc gtgctcgccg gaaggcgcga 476521 aactctgggg gaaccccggc ccgatatatt gcgagcgcac ggcggacggg caactgcaat 476581 gggtatcaat tcctgcttgg gcattgtgtg tggcgttctg cgaccggcct ggcgggccat 476641 aggggcccac cagcggaccc ccacggtccg ccggcctgct agcccggcca tgagctcgcg 476701 gtggttcggt agttcgcgtt gggcgcactg cagaagtccg aggccgtgcc ggccagcaag 476761 acgaaatagc cctcttcgcc gcggcgggtg tcggcgaacg gcgggtaggt ccaaccgttg 476821 ttgcacatcg aagtcggcgt gcccgccgtg gccttgcgcc aaccggactt gacgcaggcc 476881 ttgagggcgt cgacgtcgga aacggtgctc ttggcgacga ccagcatcgc gcctccccat 476941 tgcccgtcgg ttcgccggta ggcgcgcgcc ccgaagccgg cccccgagcc ggctgtgttc 477001 tgttggcagc ccaacacccg ctgcagcggc aagaacaccg acgccacatc gcggtccgcg 477061 acagcggctg cggccgtgat gaattgagcc gcgtccgagc cgtcaacctc ctggacggtg 477121 ggatctccga agctaaacag ctgctgtgag atccgctgtc gcagcgcggc gtcaccgtcg 477181 ccgatgaccg gtccgctgcc gctggatgtc atcgggggca gcgcgccggt gggttccgcg 477241 cccgcggtgg gcgcggcggc cagcacggcc agggacaaac cgcacgcggc gacaccgaca 477301 acgcgggcaa tgactcccat ggctacctac ctccccggcg gcatgggtgg ggcgtcgttc 477361 ggtgctacct cggcaccgat cttgcgaaat agtatgtcgg cctggttgcg gtagttgccc 477421 tggtcatcga aggcctcggg ggcgtaggtg actgcgaccg ccaccgcgac ccgttgcgac 477481 ggcagatagg cctccaccgc ggcgtaaccg gcgaacatgg gattttgcag cagccaatgg 477541 ccggatatga cgatcccgag accatagctg tagccgtcgt tctgctcgaa gcaggtgggg 477601 cagcccggct gggcgcgggt cttgccgcgc agctcggtcg acaccatctt cttgtacgaa 477661 tccgccgaga gcagcctgcc cgacccgatc cccaccgcgg tggcctccat gtcgtagatg 477721 gtggtggttt ggatggcgcc gtgggtgatg gtccacgacg gattccagaa ggtcgattcc 477781 tcgtaaaacg gcacgccggc aggaattttc aaggccgctc ggcgctcgga ggtgaatgca 477841 tgcaaggcgg gctcggggat ggcgggggta tcggagttgg cggtggccgt gaggcccagg 477901 ggggaaagga ccttgcgctg cagcagggtt ggcatgtctt ggccggcggc cttctccaac 477961 gccagcccca gcaagaggta attggtgtgc gcgtagttcc agttggtgcc cgggtcgtaa 478021 agcagtggcc gtgaagagat ttgatcgagt aactcttgtg ttgtccactg ccggaacgga 478081 ttagcgtaaa gctcggcatc aaacgcctcg ttgccgagga cgtagtcggg gtagccggat 478141 gtcatctgcg ctagttgacc cagcgtgacc cggtcggcgt gcggaaagtc gggaagccac 478201 ctggacagct tgtcgtccag gcgcagcttt ttttcgtcga ccagtttgag caacagcgtc 478261 gcgacatagg agattgcgac cgcgccgttg cgaaagtgca tggcggtggt ggccggcacg 478321 ccggtcatcg agtcgccgac ggcccgcgtc acgacctcct tgccggccac ggtgacccgg 478381 accagcaccg ccttcagatg cgcttgcgtc atgaagtcac gcacaatccg gatgaccgcg 478441 tcggccttgg ccccgttgtt ggtcggcgac gaagccggcc cggtgcgggg tggggcgcag 478501 ccggccagca gcccgagagc caggaccgaa cacccgaggc gccgcaagac gggcatgcga 478561 cggtcctacc ggaaggcggc caagcccgtg aaggcctgac cgagcaccag ctgatgcatc 478621 tcgggcgtgc cctcgtaggt gagcaccgac tccaggttga ccatgtgccg gatgaccggg 478681 tactccagcg atatcccgtt gccgcccagt attgttcgag cggtccggca gattttgagc 478741 gcttcccggg tgttgttgag cttgccgaag ctgacctgat cggggcgcag gcccaccctg 478801 tctttgaggc gccccagatg caacgacagc agctgaccct tgtgcagttc cacggccatg 478861 tcgacgagct tggcctgggt cagctggaag ccggcgatcg gacgtccgaa ctgggtgcgc 478921 tgtctcgcgt agtcgagcgc gcactgccag gccgacctgg ccgcgcccat cgctccccag 478981 acgatcccgt agcgcgcctc cgacaggcat gccagcggcg ccctgaggcc ggtcgcgccg 479041 ggcagcatgg cgtcggcggg cagccggaca ttgtcgagca ccagctcgct ggtgatcgac 479101 gcccgcagcg acagcttgtg accgatggtg ttggcggtga aacccggggt gtcggtgggc 479161 acgatgaatc cgcggattcc gtcgtcggtg gcggcccaca cgatcgccac gtcggcgacc 479221 gagccgttgg tgatccacat cttgcccccg gtgatcaccc agtccggacc atcgcgtcgc 479281 gcccgggttt tcatcgcggc cgggtcggag ccgacgtcgg gctcggtgag cccgaagcag 479341 ccgagcaggt caccggtggc catgccgggc agccactgcc gcttttgctc gtcggagcca 479401 aagctcgcga tggcgaacat cgccagcgaa ccctgtaccg acaccagcga ccggatgccg 479461 gagtcggcgg cctccagctc ccggcaggcc aggccatagt gcaccgccga cgcgccgcca 479521 cagccgtggc cgtgcagctg cattcccagc agtccgagtt cgccgaactg tttggccaaa 479581 tcgcgcgcga ccggtaggtc gccgtcctcg aaccacgccg cgacgtgcgg ggtgacgtgt 479641 tcggcgcaga accgcctgac ggtgtcgcgg acggcgatct cgtcgctgga tagcgacgcg 479701 tccagtccca gcgggtcgtc gcggtcaagg gcgggtggtg tcggggtgct catcactcaa 479761 tactgccccg gcccggtagc ctcgcggcat gcgaccacgg cgcgcgctgg cggggctggc 479821 cgccgacgtc gtcgccgtgc tggtgttctg cgcggtggga cgtcgcagcc acgccgaagg 479881 actgagcgtc accggcctgg cggctacggc atggccattt ctcaccggca ctggtatcgg 479941 ttgggtgctg gctcgcggct ggcggcggcc gaccgccctc gcccccacgg gggtgatcgt 480001 gtggctgtgc accatcgtgg tcggcatggt gttacgcaag gtcagttcgg cgggtgtggc 480061 cgcgagtttc gtcgtggtcg cgtccgcggt caccgcggtg ctgctgctgg gttggagagc 480121 cgccgttgcg ctgatggcac cgcaccgcgc ggacggctga gaaggccaaa tgtcgtcggg 480181 gtgttcgccg accccgggat ttccgacgtc cgcctccgtg ccctcgaagt ctcagtaccg 480241 agccagattt cacggtcgag accccaacca acaggtcagc gcggtgccac cgcgatcgtg 480301 atgttggcgc aggtatgggc cgcgacctgt cgagctacga cccgggccgt gccgctactg 480361 cagcagcgct gcgcgttccg cactgagcgc agtggtgcga cgaggcccga gcacctgggg 480421 ttgtcgggct agccgatcca cccgacgtgg ccaccagaac cagcggccga gcagagttgc 480481 cagcgcaggc gtcatgtaag accggacaac gagggtgtcg aacagcaatc cgatcatgat 480541 cgtggtgccg atttggccga cgacgcgtag atcgctggcc accatcgagc ccatggtgaa 480601 ggcaaacacc agtccggcga tggtgaccac tcggccggtg ccggccatgg ctcggatcat 480661 gccggttttg aggccggccc cgatttcttc ttggaatcgg gctatcaaga gcaggttgta 480721 gtcggatccg acggccaaca tgacaatgat ggccatgggc agcacgagcc agtgcagtgg 480781 catatgcagg atgtgctgcc agatgaggac cgacaatccg aaggctgagc ccagcgaaag 480841 ggcgaccgtg ccgacgatga cggcggatgc gaccacgctt ctggtgatgc cgagcatgat 480901 gatgaagatc aggcaaagcg acgccacgac ggcgatcatg acgtcataca gggtgccctc 480961 gtggatgtct ttgtaggtgg atgaggtgcc cgccagatag atgctggcag cctgtagcgg 481021 ggttcctttc acggcttcgt cggcggcctg catgatgggg tcgatgtgtg agatgccttc 481081 agcgctcgcg ggatcacccc gatgggtgat gacgaatcga gcgcaggtgc catcggggga 481141 taggaagagt ttcagacccc gctggaagtc ggggttttgg aaggcctccg gtggaaggta 481201 gaacgagtcg tcgttgttgg cggcgtcgaa tgttcgaccc atgacggtgg cgttgcgggt 481261 catgtcttcc atttgagtga ccagtccgga gaacgcgctg gtcagtgttt gggcaaggtc 481321 tttgacggtt tgcatggtgg cgatcgtggg gtccagttgg gcgagtagtt gccgctgtgt 481381 ggtgtccatg cgttcggtgt cgtcggtgag gttggcaagg tcctcggtga gcttgtcgac 481441 gttatccatg ctgttcaaca aggagcgcat cgaccagcag atgggaatgt cgaagcagtg 481501 gcgctcccag tacgtgaaac ttctgagggg gcgccagaag tcgtcgaaat cggcgattcg 481561 atcgcgtagt tcgttggcgt tgtctcgcat ctgcctggta tgagcgttca tatcatgggt 481621 ggcatcggtt agctgtcgcg tcagctcctg ggttcgctgg gtgatgtcga tcatgcgttg 481681 cagttgatcg gtaagggtgg atagatcagc cacgcggtcc ttgaggttct gcaggttttc 481741 gatggtcatc gtgctttgca tgccgagctg aaacgggatc gacgagtggt cgatcggagc 481801 ccccaacggt ctggtaatgc tttgcacccg cgcgatcccc ggcgtatgga agacggtttt 481861 ggcgatcctg tccaggatga gcatgtcggt cgggttacgc aggtcgtgat cggcctcgac 481921 catcaggacc tccggttcca tgcgggcttg cggaaagtga cggtctgatg cgaggtaacc 481981 gatgttggat ggcgccgcgc tggggatgta gtagcgctcg ttgtagttgg tctggtattt 482041 cggcaaggcg agcagtccga tcagcgctat cagcagggtg gcggccaaga cggggccggg 482101 ccatcgcacg acgaccgtgc cgatccggcg ccaacgccgt ttcgttgtcg ctcgtttggg 482161 gtcgaatagc ccgaatcggc tggcaacggc gatgatcgcg ggcgccagcg tcagagacgc 482221 caacatcacc gtgaccaaac cgatcgcgca tggcgatgcg agggtattga agtagggtag 482281 ccgggtaaag ccgaggcagt acatggcgcc ggcgacggtg aggccagatg ccaggaccac 482341 gtgtgccgtc ccaccaaaca tggtgtagta agcggcttct cggttctggc cagtcgcacg 482401 tgcctcttga tagcgtccga cgagaaagat gatgtagtcc gtcgaagccg cgatcgtgag 482461 cgccaccaac acgttgacag tgaatgtaga caggcccatg aggtcgttga cggcaaaagt 482521 ggagatgatg ccgcggaccg ccagcagctc gagcccgacc gtcagcagca tgatcagagc 482581 agcagaaagc gagcggtagg cgatgaacaa catgatcgcg atcaccgcaa tgctgatgcc 482641 ggtaatcgtg tgaaggctgc ggtcgccgta tacaactcga tcggcgccga gtggacccgg 482701 gcctgtcacg taagccttga tccccggcgg cggtggcacg ctgtccacga tgcgttgcac 482761 ggcggcgaca gactcgttgg cctgcgagcc gccctgatca ccagtgaggt tcagctggac 482821 atatgctgcc ttgccgtcag cgctctgcga tcccgccgcg gtcagcggat cgccccagaa 482881 gttctcaatg tgttggacgt gggtggtgtc ttgtgacagc ttggtcacta gcacgtcata 482941 gaagcggtgc gcctcatcac ccagcttctc ttggccctcc agcagcacca ttgcagtggt 483001 gtcagaatcg aattgctgaa agtccttgcc gatgcgcttc atggcgatca gtgacggagc 483061 atcgtggggg cctaacgcca ccgaatgtgt cctagcgacc gactgtagct gcggcgcaac 483121 gacgttcacc acaatagtca gcgccaccca gaacagaatg atgggtagcg acaacgcatg 483181 gatcgtccgg gcggcggccg acaggtgccc ggctagacgt tggctcctca cgcggatttc 483241 accaggcaac tggtgtgcgc gtggtaagca ttcacaatgc gctcctcgcg gatcacctcg 483301 ttgacagtga tgcgacagcc caggcttgca ccgtcaccgc gggcaaccac gttggcgact 483361 acggcggtca aggtggtcac gatggtaaat gaccacggga ccgcggcatt gacgacctca 483421 tgcggctggg catcggcatc caggtaattg atgctggcga ccgtccctgg cgggccgaag 483481 acctcgtaga gaacatgctt cgggtaaaac gcgatgatcg ggtcgaggtt gccggtgtcg 483541 ggcgcatgtt gatgtgagcc aaacaccgag tgcagccgcg agaccgtcac ggccgcgaca 483601 gccacaacga tgactatcac catcgggatc cagaagcgtt tggcaacgcc gaacatttac 483661 cttcctgatt ccatcgcttc aacaagccgc cgcgtgagga cgaaccctac cggggagacg 483721 ccactcgttg gggcagtttt gtacactccg tttacatcgt ttacggcgag gtcaaaaaat 483781 ttcggttaat cgtacaggct gccgctcggt catctatagt catcgatcca gagccgcttc 483841 gaccagcctg tggtcgaagc ggatcagttg aaccggagga gtggaaacat gagcggcccg 483901 acgggaaatt cgatgcccag acagctcggc ggcctggtgg ccaggatcgt taccgggtaa 483961 gggatcgcca ctcccaatgt ctgttatatc cacgttgcgc gaccgtgcga ccacgactcc 484021 aagcgacgaa gcctttgtgt tcatggatta cgacacaaaa accggcgacc aaattgaccg 484081 aatgacgtgg agtcaattat attctcgcgt caccgccgtg tctgcgtatc taataagtta 484141 tggccggcat gctgaccgac gaaggaccgc agcgatatca gctccgcaag gtctggacta 484201 tgttgcagga tttctaggag cactgtgcgc cggatggacg ccggttccgt taccagaacc 484261 gctgggcagc ctacgcgata agcggactgg actggctgta ctcgactgtg ccgccgacgt 484321 cgtgctgacg acgtcgcaag ccgaaacgcg ggtcagggcc acgatagcta cacatggggc 484381 gtctgtaact acgccggtca tagcgttgga tacattggac gagccatccg gagataactg 484441 tgatctcgat tctcaactat cagactggag ttcgtatttg cagtatactt cgggttcaac 484501 ggccaacccc cgtggtgtgg ttttatccat gcgtaacgtt acggaaaatg tcgaccaaat 484561 tatccgtaac tattttcgcc atgagggcgg cgcgccgagg ttgcccagct cggtcgtttc 484621 gtggttgccg ctttaccatg acatgggttt aatggttggc ctctttattc cgttgtttgt 484681 cggatgtccg gttatcctga cgagcccaga ggcatttatc cgtaagcctg ccagatggat 484741 gcaactgctt gctaaacacc aggcgccatt ttcggccgcg ccgaacttcg cattcgattt 484801 ggccgtcgct aaaacttccg aagaggacat ggcggggctg gatttaggcc acgtaaatac 484861 aataatcaac ggcgcggagc aggtacagcc aaatacaata accaaattcc tccgccggtt 484921 ccgtccctac aatttgatgc ccgcagcggt caagccatca tacgggatgg ctgaagcggt 484981 ggtttacctg gcgacgacga aggcgggatc acctccaacg tcaaccgagt tcgatgctga 485041 tagcttggct cgaggccacg cggagctaag tactttcgaa actgagcgtg caacgcgttt 485101 aatacgctac cacagcgacg acaaggaacc gttgcttcgg attgtcgatc cggactcgaa 485161 tatcgagctc ggaccgggac gtatcggcga gatttggatt cacggtaaga atgtgtctac 485221 cggatatcac aatgcagacg acgcgctcaa tcgagataag ttccaggcca gcatccggga 485281 ggcctctgcg ggaacgccaa ggtcgccgtg gcttcgcacg ggagacttgg gattcatagt 485341 aggagatgag ttctacatcg tcggccgtat gaaagatctc attatccaag acggtgtaaa 485401 ccattatccc gatgatatcg aaactacggt caaggagttt accggtggcc gggtcgcggc 485461 attttcagta tccgacgacg gggtggagca tttggtcatt gcggccgagg taaggactga 485521 gcatgggccc gataaagtga ctattatgga tttctcgacg atcaaaaggc tggtcgtatc 485581 ggcgttgtcg aaattacatg gcctgcatgt aacagatttt cttctggtac cgcccggggc 485641 gctaccgaag accaccagcg gaaagattag ccgggcggca tgcgcaaagc agtacggagc 485701 aaataagttg caacgagtag caacgttccc atgacagacg gttcggtcac tgcggataag 485761 cttcaaaaat ggtttcgaga gtacttgtcc acgcatatcg agtgtcatcc aaatgaggtc 485821 agcctagacg ttccgattag agatttaggt ttgaaatcga ttgatgtctt agcgattccc 485881 ggcgacctcg gtgacagatt tgggttttgt attcccgatt tggccgtttg ggataatcct 485941 agcgctaatg atttgattga tagtctgttg aaccagcgta gtgctgactc gttaagagag 486001 agtcatggac acgccgacag gaacacgcag ggtcggggca gcataaacga gccggttgcg 486061 gtcatcggag tgggctgtcg atttccggga gatattgacg gcccggaacg gctatgggac 486121 tttctgaccg agaagaagtg tgcgataaca gcgtatccag atcgtgggtt cacgaatgct 486181 ggaactttcg cggagtccgg aggcttttta aaggatgtcg cgggtttcga taatagattt 486241 tttgatatcc cgccggacga ggctctgcga atggatccgc aacaacggtt gttactggag 486301 gtctcttggg aagcgttaga gcatgcagga attattcctg agtcattaag actttcacgt 486361 acgggcgtat tcgttggggt gtcgtcaact gactacgtcc ggcttgtgtc agctagcgct 486421 cagcaaaagt ctactatttg ggataacacc ggcggttctt cgagtattat tgccaataga 486481 atctcatact ttctcgatat tcagggtccg tccattgtca ttgacacggc atgctcgtca 486541 tccctggtcg ccgtgcatct agcctgtcga agtctcagta cctgggactg cgatatcgca 486601 cttgtcggtg ggacgaatgt tcttatttca ccagaaccat ggggtgggtt tagggaagcg 486661 ggcatcttgt cgcagacagg ctgctgtcac gcgttcgata aatccgccga cgggatggta 486721 cgcggtgagg gatgcggagt tatcgtgctg cagcgcctca gtgatgcacg ccttgagggc 486781 cggcggatat tagcgattct gacgggttca gcggtcaatc aggacggtaa gtccaacggt 486841 attatggcgc caaatcctag tgcgcaaatt ggtgttcttg aaaatgcatg caagagcgct 486901 cgcgtcgatc cgctggaaat cggctacgtc gaggcccacg ggaccggaac gtcgttaggg 486961 gataggatcg aggcgcacgc cttaggcatg gtctttggtc gcaagagacc gggatctggg 487021 cccctgatga tcgggagcat caagccgaat atcggccatc tggaaggtgc ggctggcatc 487081 gccggattga tcaaggcggt gttgatggtt gagcgtggct cgctgcttcc gagcgggggg 487141 tttacggagc caaatccagc tatcccattc acggaattgg gcctgagagt tgtagacgaa 487201 cttcaggagt ggccggtggt ggcgggtcgg ccgcgccggg ctggggtgtc atcgttcggc 487261 tttggcggca ccaatgcgca tgtgattgtc gaggaagctg gttcggttgg ggcggacacg 487321 gtttcgggcc gcgcggatgt tggcggttcc ggtggtgggg tggtggcgtg ggtgatttcg 487381 gggaagacgg cttcggcgtt ggctgctcag gcgggtcggt tggggcggta tgtgcgggct 487441 cggccggcgc ttgatgttgt tgatgtgggg tattcgttgg tgagcacgcg gtcggtgttt 487501 gatcatcggg cggtggtggt cggccagact cgcgatgagt tgctggctgg gttggctggg 487561 gtggttgctg gtcggccgga ggctggggtg gtctgcggtg ttggcaagcc ggcgggcaag 487621 acggcttttg tgtttgccgg tcagggctcg cagtggctgg gtatgggtag cgagctttat 487681 gctgcctacc cggttttcgc cgaggccctc gatgctgtgg tggacgagtt ggaccggcac 487741 ctgcggtatc cgctgcgcga tgtgatctgg gggcacgacc aagatctgtt gaataccacc 487801 gaattcgccc agccggcgct gtttgcggtg gaggtggcgc tgtatcggct gctcatgtcg 487861 tggggggtgc ggccgggttt ggtgctgggt cattcggtgg gcgagttggc cgcggcgcac 487921 gtcgccgggg cgctgtgttt gccggatgcg gcgatgctgg tggccgcgcg tggacggttg 487981 atgcaggcgt tgcccgccgg cggcgccatg tttgcggtgc aggcccgtga agacgaggta 488041 gcgccgatgc tggggcacga tgtgagcatc gcggcggtca atggtccggc ttcggtggtg 488101 atctctggtg cccacgatgc ggtgagcgcg atcgctgatc ggctgcgcgg ccagggccgt 488161 cgggtccacc ggttggcggt ctcgcatgcc tttcactcgg cgttgatgga gccgatgatc 488221 gctgagttca cagccgttgc ggccgaactg tctgtgggct tgcccacgat cccggtcatt 488281 tccaatgtga ccgggcagtt ggtggccgac gacttcgcct cagctgatta ctgggcccgg 488341 catatccggg cggtggtgcg gtttggcgac agtgttcgta gtgcccactg cgccggtgcc 488401 agtcgtttca tcgaagtcgg gcccggtggc ggcttgacgt cgttgatcga ggcatcgctg 488461 gccgacgcgc agatcgtgtc ggtgcccacg ctgcgcaaag atcggcccga accggtcagt 488521 gtgatgacgg cggcggccca gggcttcgtc tcggggatgg gcctggattg ggcctcggtg 488581 ttttccgggt accggcccaa gcgggtggag ttgccgacgt atgccttcca gcatcaaaag 488641 ttctggctcg caccagcccc atcggtcagc gaccccaccg ccgccggcca gatcggggct 488701 agcgatggtg gtgctgaact cttggcgtcc tccgggtttg ccgcccggct ggccggtcgg 488761 tcggccgacg agcaactcgc cgcagcgatc gaggtggtat gtgagcatgc cgcagcggtg 488821 ctggggcgcg acggcgctgc cggactcgac gctggccagg cgtttgccga ttcgggattt 488881 aattccttga gtgccgtgga gctacgtaac cgcttaacag ccgtcaccgc agtaacgctg 488941 ccggccaccg cgatcttcga tcaccccacc ccgaccgaac tagcccagta tctgatcacc 489001 caaatagacg gtcacggcag ctccgccgcc gcagcggcaa acccggcgga gcgaatcgat 489061 gcgctcaccg atctttttct acaagcttgc gatgcgggtc gggatgccga tggttggaag 489121 atggtcgccc tggcgtcgaa tacgcgcgag cgcatgagct caccggttcg gaacaacgta 489181 tcgaagaacg tcgcactgct ggcagatggt atctccgatg tggttgtaat ttgtatccca 489241 actctaactg tgctatcgga tcagcgtgaa tatcgagata ttgcgaatgc gatgacaggc 489301 cgccattcgg tttattcgct tacgcttccc gggttcgatt cgtctgatgc actgccgcaa 489361 aacgcggata tgattgttga aaccgtatct aacgcaatta ttgatgtggt aggcggcagc 489421 tgccgttttg tgctgtcggg ctattcatcg ggtggggtgt tggcctatgc cctctgctcc 489481 catctgtcgg tcaagcacca gcggaatccc ctcggagtcg cactcatcga tacatatctg 489541 cctagtcaga tcgccaatcc ttcaatgaat gaagggttca gccccaacga tactgggaag 489601 ggcctttccc gtgaagtaat tcgagtggcc agaatgttga atcggttaac tgccacccga 489661 ctcaccgcgg cagccaccta tgctgcaatc tttcaggcct gggaaccagg tagatcaatg 489721 gctccggttc ttaacatcgt ggcgaaggac cgaatagcta ccgtcgaaaa tttacgcgaa 489781 gaacgaatca accggtggcg aactgctgct gcagaggcgg cctattctgt agccgaagta 489841 cccggggatc atttcggaat gatgagcacc tcgagtgagg caatagctac cgaaatacat 489901 gattggattt ctgggctcgt tcgagggcct catcggtagc tttgcgaatc ggcccgtgcc 489961 acagctcgcc gtgaccaggt gccaggatgt tggtctctag caaagccagc gcagccaggc 490021 tgcggatact gttctgctgg ctgtggctga acaccgcggg cagtagctgt ggcccgcggt 490081 gacgcaacat cggatgacca gtgatcagcg catcgccgct ggccagcaca ccgtcgacga 490141 catacgagca gtgaccgctg gtgtgtcccg gggtgaaaat cgccatcggt tgacccggca 490201 gcccggcggc cgcttcggcg gtcagcggct gggcggtcgg aatgccgtcg ccggtcaggc 490261 cgccgcggcg aagcaagtga ataccccaga ccgccacacg gggccgccag ctgcgcagcg 490321 caacatcgaa aaccgaggca ttctcccggt attcccgctt ggcgtgacct acctcctcgg 490381 cgtggcagta caccggcgtg ctgtgctcac gagcaaacca gattgccgag cccaggtggt 490441 cgatgtgcgc gtgggtgagc acgatggcgc gcacgtcacc cggtgtgtag cccagtttgt 490501 tcagcgaggc cagcacctcc gcacggtcgc cgggatagcc ggcgtcgatc agcagcacgc 490561 cggtgtcgtc ggtgactagc acccagttga ccgcgtggcc gcgagcgagg tgaaccttgt 490621 cggtgatctg aacaagctcc gccatgcccg cgagtctagg agcgagcgcg agcgcggcaa 490681 gccgggtgcc gcgggtcgcg accatgggat atggagcgat cgcgagcgcg gcgaagccgg 490741 gcgtggcggg tcgcgtttat ggcataggag tagaaagaac tggtggctga actgaagcta 490801 ggttacaaag catcggccga acaattcgca ccgcgcgagc tcgtcgaact agccgtcgcc 490861 gccgaagccc acggcatgga cagcgcgacc gtcagcgacc attttcagcc ttggcgccac 490921 cagggcggcc atgccccgtt ctcgctgtcc tggatgaccg ctgtcggcga acgtaccaac 490981 cggctgctgc tgggcacttc ggtgctgacc cccaccttcc gctacaaccc cgccgtcatc 491041 gctcaggctt tcgccaccat gggatgcctg tacccgaacc gtgttttcct tggcgtgggc 491101 accggtgagg cgctgaacga aatcgccacc ggatacgagg gcgcctggcc ggagttcaag 491161 gagcggttcg cccggctgcg tgaatcggtg gggctaatgc ggcagctgtg gagcggtgac 491221 cgcgtcgact ttgacggcga ctattaccgg ctcaagggtg cctcgatcta cgacgtgccc 491281 gacgggggcg tgcccgtcta catcgccgcc ggcggcccgg cggtggccaa gtacgccggc 491341 cgcgccggtg acggcttcat ctgtacgtcc ggcaagggcg aggagctcta caccgagaag 491401 ctgatgccgg cggtacgaga aggcgccgct gccgctgacc gatccgtcga cggcatcgac 491461 aagatgatcg aaatcaagat ctcctacgac cccgacccgg agctggcatt gaacaacacc 491521 cggttttggg cgccgctgtc gttgacagct gagcagaagc acagcatcga cgacccgatc 491581 gagatggaga aggccgccga tgcgctgcca atcgaacaga tcgccaagcg ctggatcgtg 491641 gcgtcggacc ccgacgaagc cgtcgaaaag gtaggtcaat acgtgacatg gggcctgaac 491701 cacctggtat ttcacgcacc aggacatgac cagcgccggt ttctggagct cttccagtcg 491761 gacctggcac ccaggttgcg gcgacttggc tgactcctcg gcgatctacc tcgccgcacc 491821 agaatcgcag acgggtaagt cgacgattgc actggggctt ttgcaccgac tgaccgcgat 491881 ggtcgccaaa gtcggtgtgt tccggccgat tacgcggctc tctgcggagc gggactacat 491941 cctggaacta ctgctcgcgc acaccagtgc gggcctgccc tatgagcggt gtgttggcgt 492001 gacctaccag cagctgcatg ctgaccgcga cgacgcgatc gccgaaattg tcgattcgta 492061 tcacgcaatg gccgacgagt gtgacgcggt ggtggtcgtc ggcagtgact acaccgacgt 492121 caccagcccc accgagctct cggtcaacgg ccggatcgcg gtgaacctcg gcgcgccagt 492181 gttgttgacg gttcgggcga aggaccgcac ccccgatcag gtcgccagcg tcgtcgaggt 492241 ctgcttggcc gagctggaca cccagcgcgc tcataccgcg gcggtagtgg cgaaccggtg 492301 cgagctgtcc gcgataccgg ccgtgaccga cgcgctgcgc aggttcaccc cgcctagcta 492361 tgtagtgccc gaggaaccac tgctgtcggc gccgaccgtt gccgagttaa cgcaggctgt 492421 gaacggggcg gtggtaagcg gtgatgttgc gctgcgcgaa cgtgaggtga tgggcgtgct 492481 ggccgcgggt atgaccgccg accatgtgtt ggagcggctg accgatggca tggcggtgat 492541 tactcccggc gaccgctcgg acgtggtgtt ggccgtcgct agcgcccatg cggccgaagg 492601 gtttccgtca ttgtcatgca tcgtcctcaa tggcgggttc cagttgcatc cggcgatcgc 492661 cgccctggtt tccggcctgc gattgcggtt acctgtcatc gccaccgcgt tgggcaccta 492721 cgacaccgcc agcgctgccg cgtcggcccg cgggctggta acggcgacgt cgcaacgcaa 492781 gatcgacacc gcgttggagc tgatggaccg ccacgtggac gtcgccggtc tattggcgca 492841 gctgaccatt cccatcccta cggtcactac accacagatg ttcacttatc ggctgctgca 492901 gcaggcccgt tcggacctca tgcgcatcgt ccttcccgaa ggggacgacg atcgcatcct 492961 caaatcggcg ggccgcctgc ttcagcgcgg catcgtcgac ctgaccatcc tgggcgatga 493021 agccaaagtc cgtctgcggg cagcggaact cggtgtggac ctggacggcg ccacggtaat 493081 cgagccatgc gcaagcgaac tgcacgatca attcgccgac cagtatgcgc agttgcgtaa 493141 ggcgaaggga atcaccgtgg agcatgcccg cgaaatcatg aacgatgcca catatttcgg 493201 caccatgctg gtgcacaact gtcatgccga cggcatggta tcgggtgctg ctcacaccac 493261 ggcgcacacc gttcgtccgg cgctggagat catcaagacc gttccgggca tatccaccgt 493321 gtccagcatt ttcctgatgt gtctgccgga tcgggtactg gcgtacggcg actgcgcgat 493381 catcccgaac ccgacggtgg agcagctcgc tgatatcgcc atctgctcgg cacgcaccgc 493441 cgcacagttc ggcatcgagc cccgggtggc catgctgtcc tactccaccg gtgactcggg 493501 gaaaggtgcc gacgtcgaca aggtcagagc ggcaacggag ttggtgcgcg ctcgggagcc 493561 gcagctgccg gtcgagggtc ccattcaata cgacgccgca gtggaaccgt cggtcgcggc 493621 caccaagttg cgcgattcgc cggtggccgg ccgcgcgacg gtgctgatct tccccgatct 493681 caataccggc aacaacacct acaaagcggt gcagcgttct gcgggtgcga tcgcgatcgg 493741 cccggtgctg cagggcttac gcaagccggt gaacgaccta tctcggggtg cactggtcga 493801 cgacatagtc aacaccgtgg ccatcacggc gattcaggcg cagggcgtcc atgagtagca 493861 ccgtgctggt gatcaattcc ggctcgtcgt cgctgaagtt ccagctcgtc gagccggtcg 493921 ccggcatgtc acgtgccgcc gggattgtcg agcggatcgg cgagcggtca tccccggttg 493981 ccgatcacgc ccaggcgctg catcgcgcat tcaagatgtt ggccgaggac ggaattgacc 494041 tgcagacctg cgggctggtg gcggtcggac accgggtggt ccacggcggc acggagtttc 494101 accagccgac gctgctggat gacacggtga tcggcaagct tgaggagctg tcggcgctgg 494161 ccccgttgca caacccgccg gcggtactgg gcatcaaggt ggcacgcaga ttgctggcca 494221 atgtcgcgca cgtcgcggtg ttcgatacgg cctttttcca tgacttgccc ccggcggccg 494281 cgacctatgc catcgaccgc gacgtcgccg acagatggca tatccgccgc tacggatttc 494341 atggcacttc acaccaatac gtcagcgagc gggccgccgc cttcctgggc cgcccgctcg 494401 acggtttgaa tcagattgtg ctgcatctgg gtaacggtgc ctccgcctcg gcgattgccc 494461 gcggccggcc ggtggaaacg tcgatgggcc tgacaccgct tgagggcttg gtgatgggca 494521 cccgcagtgg cgacctggac ccgggcgtca tcagctactt gtggcgcacc gcgaggatgg 494581 gtgtcgagga catcgaatcg atgctcaacc atcggtccgg gatgttgggg ttggcggggg 494641 agcgggattt tcgccgtcta cgactagtga tcgaaaccgg ggacaggtca gcacaattgg 494701 cgtatgaggt gttcatccac cggttgcgca agtaccttgg tgcctatctg gcggtgttgg 494761 gccacaccga tgtggtgagc tttaccgccg ggatcggcga aaacgatgcg gcggtgcggc 494821 gggacgcgtt ggctggcctt caggggctag gtatcgcact cgaccaagac cgcaacctgg 494881 gcccggggca cggcgcccgg cggatttcgt cagacgattc accgatcgcc gtgctggtgg 494941 ttcccacgaa tgaagaactg gccatcgccc gcgattgcct gagggtgctg ggcggacgcc 495001 gagcgtgaat catacgacag cccgccggcg tgtcgcgtcg tgcgattcac actcgggcgg 495061 cttagaacgt gctggtgggc cggaccttgt tggccatgtc caccagcgtg tagcgatgcc 495121 gttgagtggg agctacccgg gccaggctgc gcagtgacgc ctcgacaccc agccgcagcc 495181 cgtgactggt gaacgggaaa ccgaggatgt ggttggtgct ggccttgttg tccttcagcc 495241 agtccagcgc gccacccagc accagggcgc ggatctgcag cacgcgtggt tcggtcgggg 495301 gcagcgcctc cactcttcgg gcggcgtcgc ggatctgttc ctcggtgact tcactcgttg 495361 accggccgga caacagagtc accgcgctgg tcagccgtgc cgtggtgaaa tgccgagaag 495421 tgggcggtac ctcgtcgagc gtgcgcacgg cgccgacccg atcaccttcg gccgaccggg 495481 ctctggccag tccgaaagcc gccgagatca cgccgtcgtt ggtgctccac accgtctgat 495541 agaacttgtg ttcgtcggtg ttgccggcta gttcggcggt ggcggccagg gcgagcttgg 495601 gcgccagctc gccgggaaag gtatccagca cctcggtgaa atgtttggtg gccgagtcat 495661 agtcgccggt gagcagctcg gcgacggccc ggtaccagac caatcgccat cgccagccaa 495721 cgcgttcggc cagatcgtcg agttttcggg tggccttggc cacatcgccg agatccagca 495781 gcgcgcggac ttccattagc ggcagctcca ctgactcgga gaagtcgacg ccgtcggcgt 495841 ccagcgcacc gtggcgggcc gcgcgcagcg agtctagggt ctgcaccggc tgggagagca 495901 ccgtggcctg caggaccgaa gctgcgacgt cggtcggatc gaccagcggc accgacagcg 495961 cggtcacgat ctcgttggcg gtcagcttct ccgcgtgcac ctgcccgtcc agatacacgt 496021 cggtgtgcgc caccagcagg tccactccaa atgtcgaccg actgggactg aagatcgttg 496081 atagccctgg ccgcggcacc ccggtgtcct gggcgaccac ctcccgcaac acgcccgtca 496141 attgcgcgga catctcttcg gcggtggtga accgttgccg cggatcgggg tcgatggccc 496201 tgcgcagcaa ccggccgtaa gagtcgtagg ttttcagcac cgggtcgtct tcgggtagcc 496261 catccacata acggccattg cgggtgggca ggtccagcgt gagcgccgcg agcgtgcgtc 496321 ccacggtgta gatgtcggtg gccaccgtcg gaccggtccg cacgatctcg ggcgcctgga 496381 agcctggggt cccgtagagg tagccgaacg agttgatccg cgataccgcg cccaggtcga 496441 tcagcttgag ctgttcctcg gtcagcatga tgttttccgg cttcaggtcg ttgtagacca 496501 agccgatgga atgcaggtag ctcagcgccg gcaggatctc cagcaggtag gcgatggcct 496561 ccgcgacggg cagtttctga cccttgctgc gtttgagcga ttgcccgccg acgtattcca 496621 tcacgatgta gccgaccgga tccccgtgcc tgtcggtgtg ctcgacaaag ttgaagatct 496681 gcacgatcga cgggtgcacc acctcggcca ggaactggcg ttcggccatc gccattgcct 496741 gcgcttcggc atcaccggaa tgcaccaggc ccttgagcac caccggacgg ccgttgacat 496801 tgcggtcgag agcgaggtag atccagccca gtccgccgtg cgcgatgcag cctttgacct 496861 cgtactggcc ggcgacgatg tccccgggat ttagctgcgg caggaacgaa tacgggctgc 496921 cgcaataggg acaccagccc tctgaagctc ccttggtctc cgagtcggac cggccgacgg 496981 gacgtccaca gttccagcag aaccgcttgg actccggcac caccgggttg gtcatcaggg 497041 cctcaagcgg atcgatatcg ggcgcccgcg ggatttccac caggccgccg cccagccgtc 497101 tgaccggcgg gcgcacccgg ctggtggtgg ccatccggtc ttgcggctcg gtgtccgggc 497161 cgagcgtcgg atgggggaag ttgtcctcat cgccgaaatc ggggcggaac accgcctggg 497221 tgctcagggg tcgaaccgtc gcggacgtcg cggtctgggc gtccgccggt tgggtgccgg 497281 ggcccgaacg ttcggtctct gacgctttgg ccatcagtcc acatacctcg gcgtgggcgg 497341 ggcgggggct gggccgagca ccgtcagcca cttgcgatac aacgtgttcc aggtgccgtc 497401 attgcggatg cgttcgagcg tgccgttgac gaaccggacc aatccggtgt tgtccaggtt 497461 gatcccgacg ccgtagggct ggtcggccat gtcgggcccg acgatatgca ggtaggggtc 497521 ttcctctacc agcccggcca ggatggtgtc gtcggtgctg acagcgtcga tctcgcgctg 497581 ctgcaaggcc accaagcagt ccgcccagtt caccaccgac acaatgacgg gaggcggtgc 497641 gatctcccgg atacggcgca acgatgtggt gcccctggcc acacagaccc gcttgcccga 497701 caggtcggac acctttgtga tcggcgagtc acgcggggcg aggatgcgtt ggttggcgtc 497761 gaggtagacg gtggagaagt tgaccagctt gcgccgctcg caggtgatcg acatcgtctt 497821 gacgacgatg tcgacctgcg acttctgcag cgcggtgacc cgctccgcgg ccgacaggat 497881 ccggtactcg acatgtgacg ggacaccgaa gatgtcgcgt gccacttcgc cggcgatgtc 497941 aacgtcgaag ccggtgatct cgccggtgat cgggtcgcgg aagctgaaca ggttgctgcc 498001 gatgtcgagt ccgacgatca gcctgccgcg cgcgcggatg tcggccaccg cggcgtcggc 498061 ctcggccttg gtggcaaagg ggcgcaggct ggcggtggga tcgcagtcct ggctcgaact 498121 gtccggcggc agcgggggtt gcggtggcat gatctccatg ccgaccggtg tgggcagcgg 498181 cagcgtcggc gtcgcctcca cccccagcgt ttccgagtgg ccgcaactgg ccagcaccat 498241 cgccaaggcc agcggcgcga gtggggctgc cgcccgcgcc aggagggccc ggcgcgtcat 498301 caccgatact ctttcagccg gggccacagg ccgagcgcga cggcaatggc ggcacctaag 498361 ctgagcacca cgccgcccac ctgcgcgcct gccagcccgc gatgcgcatt gaggatgtcg 498421 tggcgcagtt gggtgcggct ttgtcccatg gctttggtca gtgcttcgtc gagcttgtcg 498481 aatgcggggg tagcatcgtc ctcgccttta cccagtgcca cctgagtggc agcccgatag 498541 ttgccgacgg agatgtcgga attgatccgg tcgttggcct gccgccagcg caccaacagc 498601 tggtcggcgc cctgcagatc gggtttgtcg acggcgtggc ggcgggccat gtagtcgttg 498661 agctggcgtt gcatggcgtc gatgcgctga tagaaggcct gcttgcggac ctcttcgtcg 498721 ccgcgccgga tcagcgacag tgtctcgtcg gcccgtgcct gttgggcggt gatcgccagg 498781 ttggtgatgg tcttgagtga ctcagccgcg gtatctttcg cgctacggct ggccgttgta 498841 gagatggtca gcgcagttcc cacccacacc accatgacga gaataccgag cgcgcccacg 498901 acaagaccgg ggttaatccg tcgcctggtg cgccgggcca gccagcgatg tgcgaacgca 498961 ccgaagacca cggtggtggc gaccaccagg atcaccgggg ccgggatctg ggtcgacgcg 499021 gtggtttccc gatctacccg tgctgatgtc gcctggtaga gccgttgcgc gtcgggcagg 499081 atcgtcgatt gcatcagccc cgacgcctct gacagatatg acgacccgac cgggttgccc 499141 gcccggttgt tggcgcgggc gatctcgacc aggccggtgt agacggccaa ttcggcgttg 499201 atccggccca gcaattgcac caacgattcg tcggtgagcc cgctcgaggc ccgggttacc 499261 gctaccgagg catcggtaat ggcctgctcg tagcgcagcc gaacgccgcc cggctcggct 499321 tgggctatga acgcggtggc ggccgcggca tcagccaccg acagcgtggt gtacagccgt 499381 ccagccgcga acgacagcgg ctcggtgtgg tcgagcaccg cggtcaacac ctgctgccgg 499441 tgttcgatgg tggtggaggt agcgaaggcg ctggccacgc cgagagccgc caacacgatg 499501 ccgatcgtca tgattcggcc gggtgtcgtc gagatgaacc accgccgggg atgtgccggt 499561 tcggcgggcg agcgtgatcc cagcggctcg gtcgacgggt gcgccagctc aaccgtcacg 499621 tctgttagga cctcatcttt cggctaacgc aacgaaactc tataagcgaa ttctaagaga 499681 aggttccgac agatggtgtt aggcatacgc aattgcccag ttgcccgcct gcatattctg 499741 aacaggtgcg gggcgacggt gacggatggg tggtgtccga cagcggcgtt gcgtactggg 499801 gccgctacgg tgcggccggt ctgttgcttc gggctccgcg gccggacggc acccccgcgg 499861 tgctgctgca gcaccgcgcg ctgtggagcc atcagggcgg cacctggggc ttgccgggcg 499921 gtgctcgaga cagccacgag acgccggaac agaccgcggt ccgcgaatcg agcgaggagg 499981 cgggcctgtc cgccgagcga ctcgaggtgc gggccacggt ggtcaccgcc gaggtgtgcg 500041 gggtcgacga cacgcactgg acctacacca ccgttgtcgc cgatgccggg gagttgctgg 500101 acaccgtgcc caaccgggaa agcgccgaac tgcgctgggt ggccgagaac gaggtggccg 500161 acttgccgtt acatcccgga ttcgccgcca gttggcaacg actgcggacc gctccggcga 500221 ccgtgccact ggcccggtgc gacgaacggc ggcagcggct gccgcgcacc attcagatcg 500281 aggccggggt tttcctctgg tgtacgccgg gcgacgcgga tcaggcgccc tcgccgctgg 500341 gtaggcggat cagttcgctg ctgtaagcgc cgaccggagc tgctcggccg ccgcacgtgg 500401 gtcgtcagcc gaggtgatcg cccgcaccac cacgatccgg cgagcgccgg catcgagcac 500461 ggccggcagc cgttgcgcgt tgatgccgcc gatagcgaac cacggcttgt cgtcgccgcc 500521 gagttcggcg gcgacccgta ccagccccag acccggcgcc gcacggccag gcttggtcgg 500581 tgtcggccaa catggtccga cacagaaata gtcggcgtcg ccggcggcgg ccgcagcaac 500641 ctggtcgggg tcgtgggtgg accggccgat gagggtatcc ggtgccagga tctgtcgtgc 500701 gacgttcacg ggcaggtcgc gttgacccag atgcagcacg tcggcgccgg ccgcgcgggc 500761 aatatcggcg cggtcgttga ccgcgaatag ggcgccgtac cggtgcgctg cgtcggccag 500821 gatctcgcag gcggccagtt cgtcacgcgc ctgtagcggg ccgaaccgca gctcaccggg 500881 tgagcccttg tcgcgcaact ggatgatgtc cactccgccg gccagggcgg cctcggcgaa 500941 ctgagccaag tcgccgcgtt cccgacgggc gtcggtgcac agatacagcc ttgccgatgc 501001 cagacgggat tcgtgcacat cgtgacgcta gcgcgctagc gtggaaccct gtagacacgg 501061 gagtcccggg agcggggtct gagagtgggc gcgcctgccc ttaccgtcac acctgatccg 501121 gatcatgccg gcgaagggag gtcaaggatg gcgtccgacc tacacaccgg gtcgctggct 501181 gtcatcggcg gcggtgtcat cgggctgtcg gtggcccgcc gtgccgccca agccggctgg 501241 ccggtgcggg tgcaccgcag cgacgagcgg ggggcgtcct gggttgccgg cggcatgctg 501301 gccccacaca gcgaaggctg gcccggcgag gaacggttgt tgcggctagg cctgcagtcc 501361 ctgcggcttt ggcgtgaggg cagctttctc gacgggctgg gcccgcaact ggtcaccgcg 501421 cacgagtcgc tggtggtggc cgtcgaccgg gccgacgtcg ccgacctgcg cactgtcgcg 501481 gactggttgt ccgcacaggg gcacccggtg atctgggagt cggctgcccg tgacgtcgaa 501541 cccctactgg cgcaaggcat ccggcacggg tttcgggcgc ccaccgaact ggccgtcgac 501601 aaccgcgccc tgctcgacgc gctgtgccgt gactgcgagc gactcggagt tcgctggagc 501661 tcacaggtga gcagcctgtc cgacgtcgat gcgcacacgg tggtgatcgc caacggcatt 501721 gacgccccgg ccttgtggcc cggcctgccg atacgcccgg tgaagggtga ggtgctgcgg 501781 ctgcgatggc gaccaggttg tatgcccttg ccgcagagag tgattcgtgc ccgtgtgcgt 501841 ggacgacagg tctatctggt gccacgttcg gacggggtgg tcgtcggcgc cacccaatac 501901 gagcacgggc gcgacaccgc gccggtggta tcgggagttc gtgacctgct agacgatgcg 501961 tgtaccgtgc tgccggcgct gggtgagtac gagctggccg agtgtgaggc cggactgcgc 502021 ccgatgacac ccgacaactt gccgctggtc caacgcctgg attcgcggac cctggtcgcg 502081 gccggtcacg gccgatccgg attcctattg gcgccgtgga ctgccgaaca gattgtgtcc 502141 gaactcgttt cggttggggc cgcctcatga tcgtcgttgt caacgagcaa caggtcgagg 502201 tcgacgagca gaccaccatc gccgcgctgc tggattcgct gggcttcggg gaccggggta 502261 tcgctgtggc gttgaacttt tcggtgctac cacgatcgga ctgggccacc aagatctgtg 502321 agctgcgtaa gccggtgcga ctagaggtgg tgacggcggt gcagggtggc tgagtccaag 502381 ttggttatcg gtgaccgcag cttcgcctcg cggctcatca tgggtactgg gggtgcgacc 502441 aatctggcgg tgctagagca ggctctgatc gcctcaggta ccgagctgac caccgtcgcg 502501 atacgccggg tcgacgccga cgggggaacc ggcctgctcg acctgctcaa ccggctcggc 502561 atcacaccgc tacccaacac cgcggggtcc cgcagcgccg cggaagcggt cctgacagcg 502621 cagttggccc gtgaggcgct gaacaccaac tgggtcaagc tcgaggtgat tgccgacgaa 502681 cgcaccctgt ggcctgatgc ggtcgaatta gtccgggctg cagaacaatt ggtggacgac 502741 ggatttgtgg tcctaccgta cacaaccgac gacccggtgc tggcccgccg gctagaagat 502801 accggttgcg cagcggtgat gccgctgggt tcgccgatcg gcaccggcct tggtatcgcc 502861 aacccgcaca atatcgagat gatcgtcgcc ggtgcccgcg ttcccgtggt gctggacgcg 502921 ggcatcggta ccgccagcga tgccgcgttg gcgatggagt tgggttgcga tgccgtgttg 502981 ttggccagtg cggtgacccg ggccgccgac ccgccggcga tggccgcggc gatggccgcc 503041 gcggtgaccg ccggatatct ggcgcgttgc gcggggcgga tcccgaaacg cttctgggct 503101 caggcttcca gcccggcacg ataaccaaaa cggtgaagcc acggggtgcg ggcggcccgc 503161 taccggtccg attgccccgg atgtggcagc ttgcgcatac agtgcagcct tatacacgcc 503221 gacctgttgg ctgccgccga ctacaacgtt gtgggattgg cggcggcggt gctatcggtg 503281 tgggcctact tggcgtagac ctatggccga ctggtgggac gacgagtccg gagttggcag 503341 caccatcgcc agtgttccgt agcggcattg tcgctggtag tgctttggtt tgtgctgtgt 503401 aacctccggt ttaggccatt caacgctctg ttcgtttgat tggtcggtgg gatgcgaaag 503461 ctgcgcggcg acaggcgcgg tctaatctgg gcgcgatggt gaacaaatcc aggatgatgc 503521 cggcggtgct ggccgtggct gtggtcgtcg cattcctgac gacgggctgt atccggtggt 503581 ctacgcagtc gcggcccgtt gttaacggcc ccgctgccgc agagttcgcc gttgcgttgc 503641 gcaaccgggt gagcaccgac gcgatgatgg cgcacctatc gaaactgcag gacatcgcca 503701 acgccaacga cggcactcgc gcggtgggca cccctggcta tcaggccagc gtcgactatg 503761 tggtaaacac actgcgcaac agcggttttg atgtgcaaac cccggagttc tccgctcgcg 503821 tgttcaaggc cgaaaaaggg gtggtgaccc tcggcggcaa caccgtggag gcgagggcgc 503881 tcgagtacag cctcggcaca ccgccggacg gggtgacggg cccgctggtg gctgcccccg 503941 ccgacgacag tccgggctgc agtccgtcgg actacgacag gctgccggtg tccggtgcgg 504001 tggtgctggt agatcgcggc gtctgtcctt ttgcccagaa ggaagacgca gccgcgcagc 504061 gcggtgcggt ggcgctgatc attgctgaca acatcgacga gcaggcgatg ggcggcaccc 504121 tgggggctaa taccgacgtc aagatcccgg tggtgagtgt caccaagtcg gtcggattcc 504181 agctacgcgg acagtctggg ccaaccaccg tcaagctcac ggcgagcacc caaagtttca 504241 aggcccgcaa cgtcatcgcg cagacgaaga cggggtcgtc ggccaacgtg gtgatggcag 504301 gtgcgcattt ggacagcgtt ccggaaggac ccggcatcaa cgacaacggc tcgggagtgg 504361 ctgcggttct ggaaacggca gtgcagctgg ggaactcacc gcatgtgtcc aacgcggtac 504421 ggttcgcctt ctggggcgcc gaggaattcg gcctgattgg gtcacgaaac tacgtcgagt 504481 cgctggacat cgacgcgctc aaaggcatcg cgctgtatct gaacttcgac atgttggcgt 504541 cgccgaaccc gggttacttc acctacgacg gtgaccagtc gctgccgcta gacgcccgcg 504601 gtcagccggt ggtgcccgaa ggctcggccg gtatcgagcg cacgttcgtc gcctatctga 504661 agatggccgg caagaccgcg caggacacct cgttcgacgg tcggtccgac tacgacggct 504721 tcacgctggc gggtatccct tcgggtggcc tgttctccgg cgctgaggtc aagaagtccg 504781 ccgagcaagc cgagctctgg ggcggcaccg ccgacgagcc tttcgatccc aactatcacc 504841 agaagacaga caccctggac catatcgacc gcaccgcgct cggtatcaac ggcgctggcg 504901 tcgcgtacgc ggtgggtttg tatgcgcagg acctcggcgg ccccaacggg gttccggtca 504961 tggcggaccg cacccgccac ctgattgcca aaccgtgatc cgggcctgat ctcgccactg 505021 accccgcacc gaccgatcta gaatgggatt tccttggtgc atgccgggcg ggacggggtt 505081 aggagatgca tggtcgcggg cggtatcgac ctctggtccg ctgtgttcgc cctcgccggg 505141 tggccgcgtc ggtgcggacc ccgatcgcct gtctagcggc ggtggtcgtg atagccggct 505201 gcacgaccgt cgtcgacggg cgggcgctgt ccatcctcaa cgacccgttc cgggtggggg 505261 gtctgcccgc gaccaacggt ccgagcggcg cccgccccga cgcaccggct gcgtcgggca 505321 cggtgatcaa caccaacaac ggagcgatcg acaagttgtc gttgttgtcg gtcaacgaca 505381 tcgaggacta ctggatggcg gtctacagcg aatcgctgaa gggcaccttc cggccggtcg 505441 gcaagctggt gtcctacgat tccaacgacc caagtagtcc gatcgtctgc cacattgaca 505501 cctatcagct cgtcaacgcc tttttcagct ctcggtgcaa cttgattgcc tgggatcgag 505561 gggtcttcat ggcggtcgcg caagaatact tcggcgacat gtccgtcaat ggtgtgctgg 505621 cacacgaatt cgggcatgct ctgcaagtga tggcgaattt ggttaccagg aaagatccca 505681 ccatcgtccg cgagcagcaa gcggattgct tcgccggggt ctatctgtgg tgggtggccg 505741 aaggtaagtc gacacgcttt acgctgagca ccgcggacgg gctcgaccac gtgctcgccg 505801 gcatcatcac cacccgagac ccggtgatgg aagccgatgc ggaaaacgac gacgaacatg 505861 ggtcggcctt ggatcgggtc agcgcgttcc agctgggctt catcaacggc acgccggcgt 505921 gcgcggcgat cgacgaggac gaagtcgagc ggcgccgcgg tgacctgccg acggcgttgc 505981 gggtcgatgc cagcggcaac ccagagaccg gcgaggtcgg aatcaacgaa gagaccctct 506041 cgacgttgat ggagttgatg ggcaagatct tctcgccgaa gaatccgccc acgctgtcct 506101 accagccggc cggttgccca gacgccaagc ccagcccacc ggccgcctac tgtccggcca 506161 ccaacaccat cgtggtcgac ctgcccgccc tggcgaggat gggcaaggtg gcctcggcag 506221 cggaacacag cctgccgcag ggcgatgaca cgtcgttgtc gattgtgatg tcgcggtacg 506281 cgttggcggt gcagcacgaa cgcgggctgc cgatgcagag cccgtggacc gccttacgga 506341 cggcgtgcct gaccggcgtt gcgcaccgca agatggccgt gcccatcgac ctgccctccg 506401 gccagcaact cgtacttacc gcgggtgatc tcgacgaagc ggtttccggg ttgctgacca 506461 accgcatggt cgccagtgac gccgacggtg tcagcgttcc ggccggtttc actcggatag 506521 ccgcgttccg tgccggcgtg ggcggcgaca tggacgcatg ctatgcccgg tatccgggat 506581 aggactggcc ctgatgttga tcgttgtgca cccacatcac caaaaacccg gtgaccagca 506641 accaccccag ggcaacggac gggatcgccc aggcgcgacg tacgagtagt gcggtgtcac 506701 aacgcgtgac cagggcctga gtgttgtcgt tgccgtcggc gtacagcgcc tgggtgaggt 506761 tggagcgcca gccgctgcca caggtgacct tgatgccata cgcatcgtat tgatccaggt 506821 agaccggaaa ccacagcgcc atcagaccaa tgaccgccag cagcaggcca gtaattccga 506881 tgaacatctg gcgacgattc acggcttctc catgtcttgc gatgtgcatt cgggattcgg 506941 gcgccgcagc gctcgcgtca tgcaagcgca aatgcgggct ttgccaacaa aggccgggtg 507001 gccacgccca ggcaagttgt gagggaggcc ccccggggcc gcaaccatgt taacgcgcgt 507061 ccgcctaagc attcagcgcg ccgtgcccta ccggcactac gcccgggcgt gcgtgcggaa 507121 cctgacagag ctcacgctat ttggcccgcc gacagacgta gcgccgcatc gaccgccagc 507181 ctggcgacat cgagggtctt cgagcccaag tcatgtcggg cgccggtgat ctcgacgacc 507241 tcggtcggtg ccgagaccat cgccgcggcg gaacgcacct gggccagcgt gccgaacggg 507301 tccgccgttc cgtgggtgaa caccgtcggc actgcgatcc ccggcaagtg ctcggtacgg 507361 acgcgttccg gctttcccgg cggatggacc ggataggaga acagcgtcag cacgtcgacc 507421 ggtgcctgcc cggccgccac caccatggac gtctgccgac cgccgtagga atgtcccccg 507481 gcgatcagcg gaccctcggc aaggccgcgg cacagctgga tcgcttcgac gatgccggca 507541 cggtcgcctg accccgagcc ggatggcgga ccggtgggtc ggcgtcggcg gtagggcagg 507601 ttgtagcgca cggccagcca tcctcggcgg gtccattcgg cgcaaacctg ttgcaacagt 507661 gtggattcgc ggctaccgcc cgcgccgtgg gtaaggacga ctaccccgtg tggtgggccg 507721 gccggttggt gtgcaacgcc ggcgatctga tcaaggttca tgacagccga aacagcggcg 507781 aaacgggccc gtggccgcgg cccagtggat aggccgcgcg caggcattcg gtaacccatc 507841 gcttcccgaa gtccaccgcg tcgggcacgg tgaagccgtg cgccaacgcg gcggcgatcg 507901 cggtcgccag cgtgtcacca ccgccatggt catcgccggt gggtagtcgc tgcgcgtcga 507961 actggtagca gctgacgccg tcatagagca ggtcgcagct gccgtccgac gagcgcaggt 508021 gtccgccttt gaccagcacc cactgcggcc ccagcgcatg cagggctttg gccgccgcac 508081 gctgcgactc ggcgtcgact acctcgatat ctaccagcag gcgcgcctcg tcaaggttgg 508141 gggtcagcag cgtcgccaac gggaacagct gaccgcgaag cgaatccagg gcagacggtg 508201 ccaacagcgg gtctccgtgc atggatgcgc ataccgggtc gacgacgagc ggaacggaca 508261 gctcgagccg acgccaggtc gcggccacgg tcgcaacgat gcgcgacgag gccagcatcc 508321 cggtcttggc ggcttgaacg ccgatgtcgg tgacgaccgc ctcgatctgg ccggccacca 508381 catcgttggg aacttcatga atatccttga ctcccaacgt gttctgtacg gtaaccgccg 508441 tgactgcgac gcacgcgtgc actcctagca gtgccatcgt gcgcatatcg gcttggatgc 508501 cggcaccgcc cccggagtcc gatccggcga tgctcaacac ccgcggcggc gtcattcccg 508561 gcggtgccag cgggaggtag ttcactgggt tatcgggaga tacacccgat tgccgtgctc 508621 tgcgaattca cgtgactttt cggccattcc ggcggcgagc acggcttcga tgtccgcttc 508681 ggtctcaagc ccgtgttcgg cggcgtactc acggacgtcc tgggtgatgc gcatggagca 508741 gaacttcggt ccgcacatcg agcagaagtg cgcggtcttg gccggctccg ccggcagggt 508801 ttcgtcgtgg aattcccgtg cggtgtcggg atccagcgac agtgcgaact ggtcgttcca 508861 gcggaactcg aaacgcgccg tgctcaaagc gtcgtcgcgc tcctgggcgc gcggatggcc 508921 cttggccaaa tcggccgcat gcgcggcgat cttgtaggcg atcaccccgt ccttgacgtc 508981 cttgcggtcc ggcaacccga ggtgctcctt gggggtgacg tagcacagca tcgcggtacc 509041 ggcttgggcg atgatggccg caccgatcgc cgaggtgatg tggtcgtagg ccggcgcgat 509101 gtcggtggcc agcggaccca gcgtgtagaa cggggcctcc tcacacagtt cctcttccag 509161 ccgcacattc tcgacgatct tgtgcattgg gatatggccc ggcccctcga tcatcacctg 509221 tgcgccatgg gctttggcga tcttggtgag ctcgcccagg gtgcgcagct cggcgaactg 509281 cgcggcgtcg ttggcatcag cgatcgaccc tggtcgcagc ccgtcaccga gtgagaaggt 509341 gacgtcgtag cgggcgaaaa tatcgcagag ctcctcaaag ttggtgtaca agaacgactc 509401 ccgatgatgt gccaaacacc acgcggccat gatcgaaccc ccgcgggaca cgatgccggt 509461 gacccgcttg gcggtcagcg gcacataccg cagcagcacc ccggcgtgca ccgtcatgta 509521 gtccacgcct tgctcacact gctcgatcac ggtgtcgcgg tagatctccc aggtcagctc 509581 ggtcggatcg cccttgactt tctccagcgc ctgatagatc ggcacggtgc cgaccggcac 509641 gggagaattg cgcaggatcc actcgcgggt ttcgtggatg ttcttgccgg tggacaggtc 509701 catgatggtg tcggcccccc agcgggtggc ccacaccatc ttgtcgacct cctcggcgat 509761 cgagctcgtc accgccgagt tgccgatgtt ggcgttgact ttcaccgcga acgccttgcc 509821 gatgatcatc ggctcgctct cggggtggtg gtggttggcc gggatcaccg cgcggccgcg 509881 ggcgacctcg tcgcgcacta gctcggcgga catgtcttcg cgggcggcga tgaacgccat 509941 ctcggcggtg atctccccgg cgcgggcccg ctgcagctgg gtgccccgat cgcgaaccac 510001 tccgggccta tgcggcagcc ccgcggtcag gtcgatcacc gtgtccgtgt cggtgtaggg 510061 cccggaggtg tcgtagaggt cgaagtggtc tccggtggac aagtgcaccc gtcgaaacgg 510121 gacttggaga gtagctccgc tgccgggagc ctcgatttca cggtaggcct tggcgctgcc 510181 cgcgatggga cccgtggtca ccgacggttc aacggtgatg gtcatttgca actccctacg 510241 ccggcattac ccggtcaggt tcgtacggtc gacggccccg agccgtcctc tcagcgcact 510301 cggcgtgcgc tcccgcgtgg gtacccccac gctagcgcag cgcggcgccg gtgtgcacgg 510361 acggcccgat gccgcgttag gcctcttcca tcgcctcgcc gagttcctcg aggacccggt 510421 tgtggtggtt atttgccaag atatgtccgg tggcgataac cagggcgacc ggccagtcga 510481 tgagttcgag ggcagcgagt gccgccagac ctccgtagta ggccaggtgc tctggccgcg 510541 gaatcttcac ttggccgcag atcgggaggt tcatcacaat agtctccgct tcgcggatct 510601 tagccacggc ctcacgctga gatgtcgctc gacgcgtatt cttttcggcc atcatttgcc 510661 tttcagtaac gaagggtttg ccgttgtgca gggtggtgcg gtcaccgtcg ggggtatgtc 510721 gacgaggacc gatgcgctgg cggacccttt ggcttcgtcg tttgccttgt tgctgacggt 510781 gcctttactg gagctttacg ccgtgctgtg gcgcgtcggc gtcgtcgagg tccggggggc 510841 gcaccggggg acgcgtcgcg ggaaagcgca tcggtctcgg gtggttgcgg gttcggctgg 510901 cccgatttgt cccgacccgt cagcacacgg ttcagcacgg cgacggcgac ggtcgcagca 510961 gtcgcggctg ctgtcgcctg cgcccagccg agtgggtcga ggggggtgca gccgagcaac 511021 tggctgacga cggggatgct gatcaaggtt gccagcgcgg ccagtgagcc cagtgcggtg 511081 agcacaacca gccaggcatg cgagtccacc aaggtttgac ccaactgagc ggccaccagc 511141 gccaccagcg ccaccgtgga tgcgcggcgc ggcaagccgg tgaaccccgc catcacccag 511201 gccacggtgg ccgcggccgc cgtggtcgcc ccgcggatac cgacggcacg ccatagctcg 511261 cgttgatcgg gaccgcgggt tgccggcgtt accgggtcgc ttggcttgct gaccgcgagc 511321 gccgccgcgg gcagtgcgtc ggtcagcatg ttcaccagca gcagctgacg ggtgttcaac 511381 ggcgaggtcc cggtgatagc gctgccgatg atggcaaagg ccacctcgcc cgcgttgccg 511441 ccgagcagca cagacactgc cgcttgcacc cgctgccaaa gctggcgtcc ttccaggatt 511501 gcgggcagca atgactctat ccggccgtcg accaacacca ggtcggcggc gactcgggcc 511561 gggtcgctgc cgtgggcgac gacaccgatg ccgacggtgg cggcgcggat cgcggccgcg 511621 tcattggagc cgtcgccgac cattgcgcac acccggccgc tgtgttccag cgtctgcacg 511681 atctgtacct tgttctccgg tgtcatccgg gcgaagatca cccgctcggc taccgctcgc 511741 tcctggtcct tgcgtgacag ggcatcccac tcggcaccgc taatgacctg ctcagggctc 511801 acttgcatgc cgagctcctc ggcgatggcg gcggcggtaa tcgggtgatc accggtgatc 511861 agccggatat ccagatcgtg ctcgtgcagg tccgcaagta gggccgccgc ctgggcgcgg 511921 ggggtgtcgg acaacccaag aaaccccacc agactcaact cgtcgcggca caatctcgcg 511981 atctcgtcgg ggtcgtccac gaccgactgt gcctgttgcg cggtcagctg gcggtgggcc 512041 accgcgatca cccgcaatcc gttggcggcc agttcagcga ccgcgtcgtc catgctcgag 512101 ccgatgcctt cgcacgccgc cagcaccact tcgggcgcac ctttgacggt cagctcggtg 512161 ccggacaccg aggcggaaaa cgacctaccg gagcgaaatg gcaggtgggc ggcgggttct 512221 gcggcgccgg gttcggcacc gtcggtgcca ctggcggcag ccgctgccgc agcttgcacg 512281 atcgcgacgt cggtggcgtg cacctgcggg ccgttcgacg ccggcgcagc gtgcgccgcg 512341 cagcgcagca cttcctcgcg cgagtgcccc gccaccggcc gcacctgcgc cacccgcaaa 512401 cggttctcgc tgagcgttcc ggtcttgtcg aagcagacca tgtcgacacg gccgagcgcc 512461 tccaccgagc gcgggatgcg gaccagcgca ccgaagtgac ttagccgtcg cgcggatgcc 512521 tgctgggcca gtgtggccac cagcggcatc ccttccggca ctgcggccac tgtgactgcg 512581 ataccgctgg ccaccgcttg gcgtaggccc cgccggcgca acagcccaag cccggtgacc 512641 agtgcgccgc cggtcatgct gaccggccag gcctggttgg tgagccgact cagctgatgc 512701 tgcaggccga cgctggacag atcaccggac acgagctcgg ccgcgcggcg ctcctgagtg 512761 tcaggaccca ccgcggtcac caccgcgacc gcggtgccgg acacgacggt cgtcccggca 512821 tagagcatgc agcgacgttc gatcaggtcg acacccggcg tgggttcgac ttgtttggtc 512881 accgacagcg actcaccggt gagcgcggac tcgtcgacct ccacgtcgac ctcctcaatc 512941 acccgggcgt cggcgggaac cacctcgtgg gtccgcacct cgatgatgtc gccgggacgc 513001 agctcctcgg cgcggacttc gatgtacctc ggctggtcgt ccgcgccggc cagcaccttc 513061 ctggcgggtg gaatctgctg agccaacaag cgattcagcc gactttcggc acgcagccgc 513121 tggctggccg cgagaataga gtttccggtg agcaccgaac cgaccatcac cgcgtccacc 513181 ggcgaaccca acaccgcact ggccattgca ccaagcgcca gcataggcgt caacgggtcc 513241 gacaactccg cgcgcatggc cttggtcaac tgccaaagcg cgttcaaggg tgcctgggtt 513301 atttgtgcgc cgcgcttcgc cgtgtgcagg ccaccggcca gcgcacgggc cggatacggc 513361 gaaggcggtg ccttcgccgg tgcctgctcg tccggcgacg gcaaagcttt gcggacttgc 513421 tcgaccgaca ttgcgtgcca ttcatgagcg ggtgccggtc gcggtgcttg cgcgtcgacg 513481 accttgcgtg ccagcaggta tcccgagagc agtccggccg ccgcgccggt ggtcaccggg 513541 ccgggcccca gtccgcggac ccctggcagc atcaacaggg ctcccaaagc cgatgcacca 513601 ccggaaattt cgttacctcg ctggcgtgcg gccctggccg ccggaatcgc gtgcagcacc 513661 ctccaggcgg ctccaagatc gggcagcagg acatctgcgt accagggcgg tgcaccggct 513721 ccaggtggtg gcagcacacc aagcgccaca tcggcagccg aaagcgcttg cttaccaacc 513781 gatgacaaca ccgcgacggt gcggcccgcc tggcgcagct cggccaccgc acgggctagg 513841 gcttcgtcga gggacccgct ggcaccgtcg tcgaggggcc ggatgtcgtc gaacaccggt 513901 cgcaactcgc ccagggcgtc gacatcgacc gaaaccaggt ccgccccggt gcgatgtgcc 513961 tcggcaacca ccgcggaggc cagccggtcg tgcatggggc ggaaaagtgc ctcgactgcc 514021 gaatccgaac cgctggccga gacaccgggt actcggtgcc agcctgggcg caggccactc 514081 tccgtcaaaa cgagttgcgc ccgattccac gctgtggaca gctcgtctgc gccgcaaccg 514141 cggatgcgcg ccacgcgcag gtcatcggtg cacagcacgc gggggtcgat gacgatcgca 514201 tcgacccgat ccaatcggcg caaactctcc ggccgtaacg gcaacaccgc gtgctgatca 514261 gccaaacctt ggccgagcgc cgcggcgaac gcctccggcg tggtccggct ggctttgggg 514321 gtggccacca gcgtcgcggt cgcggccatg tccgcgtcgc gggtcccggc gcccacgagc 514381 accgcgctca gcgcttggat cagcgcgaaa cgcgcgacgc tgcgttggac aggttgcgtc 514441 gaccgtgcgg gacgcggcca aagggattgg ggttggtcgg ccggttcgtc ggcgtgcagc 514501 gcgagctgtg gttcatgccg gcgccaggct ctggctccgg cacggcattc cgcggctttc 514561 agcgcctgga tcgtcagatc caccgacaac gccgccggcg acagcgtgac cgtgtgtgcg 514621 gcggccatgg ccagctcaag gacggtggcg gtcgcctccg tgcctattcg atcctcgagt 514681 aggcggcgca gcaacggctg gtggtccacg gccgccaccg ctgcctcgat gacgagcgga 514741 aatcggggcc agcgcagcgc ccggccgccg agcgctaagc ccagcccggc cgcggtggcg 514801 gctaccgtta cagctctgac cgccagcagc acgccgtcgc ccggcaggct ccccggtgat 514861 tgcgccagct gatcggcggc ttgatctggg tggcggtgtc tttcggcttt ttcggcgtca 514921 tcgacaatgc ggcaaagttc gcgcagtgat gtgtcgggat cgtcgatagc gacgacgaca 514981 cgggacaacg ggtagttcag gctggccgac ccgaccccgg ggtgggcttg gattgcgttg 515041 agcacgacgc gcccaagttc gtcgtcgcct ccgctgcgca agccgcgcac ttcgatccag 515101 gcgcgacgct cgccacgcca acagttccgg ccgagtgtct cgcgggacag ctcgccggaa 515161 agtgccttgg ctcctgcgcg cagcgggatg atggcgacct tcataccggt tccgaccccg 515221 gtcttggcca gggttgccga gaccgctgtc gcggcagtga tcgatgcgcc ggtaagggtg 515281 gcggtcgccc ggaagcccgt ggcgacagca cgcaccggca tagcccgtgc gatggatgca 515341 gcaatgctca agagttgctc aacgccgcca gactagttgg tgctgcgcag ctcagcggtg 515401 cccgctcggc gaccgctcgt cttcctggct gttgtcttgc tcgctttggc tggcgcttcc 515461 tttgcggcag ccggcttgtc gggcaccggc gcaagtttcg ccttcaccgg cggtgcggcc 515521 acctcggggg tgcggttgag cttccgcaat agcaacgctc cgccgcccac ggccaacagg 515581 atcggccaat cgacgagtcc agcgacaccg atcgcaccga tggccaacgc ggctgccgcg 515641 gtcgacttgc tgccgctgct aagccccttc tgaatgcctt tggcggctcc ggtcacaccg 515701 ccgacgatgc cgctcacggc ggcgccaccc accgcacccg cggccgctgt ggtcgccgtt 515761 gctgcaccac tcaccgttcg ccccacggtt cgaactgtcc cgccgaccac actcatgatg 515821 actccctggc ccaaactgca ttcgtttaca aatggtttag ctacagttct acactcgtta 515881 acccgcaccc tgcattcgca ccgctgacga gatttctgtt cagcgctctc gaaatgcaag 515941 cctgccacgc cgccctgact gagacaacgc gcaactgccg cgtgcggcgc gactgccgac 516001 taccgccgta cgccgcctac ccggcgtgca ggtcgacgag caccggagcg tggtcgctgg 516061 gcgctttgcc tttacgctcc tcgcgtacga tctgggcgtc catcacccgg gcggccaacg 516121 ccggcgagcc gaggatgaag tcgatgcgca tgccctgttt cttcgggaac cgcagctgcg 516181 tgtaatccca gtaggtgtaa accccgggtc ccggggtgaa aggccgtact acatcggtga 516241 attgcgcgtc gacaatggcg ttgaacgcct tgcgctcggg ttcggaaacg tgcgtgcagc 516301 cggcgaagaa ttcggtgctc cagacatcat catcggtcgg agcgatgttc cagtcgccca 516361 tcagtgcgat tggtgcggcg ggatcgtcac gtagccagcc ttcggccgta tcacgcagcg 516421 cggcaagcca atccaacttg taggtgtagt gcggatcgtc cagggcgcgc ccgttgggca 516481 cgtagaggct ccacacccgg atgccgccgc aggtggcgcc cagggcacgg gcctccgtcg 516541 tggcggccac ttccggcttg ccgctccagc tgggctggcc gtcgaaccca acccgcacgt 516601 cgtcgaggcc gacgcgggat gcgatcgcca cgccgttcca ctgatcgaag ccgacgtgtg 516661 cgacgtcata gccgagttcg aacagcggca aggccgggaa ttggccgtcc gggcacttgg 516721 tctcctgcat ggccaacacg tcgacatcgg cgcgcccaag ccaatcgagg acacgatcca 516781 accgggtgcg aatcgaattc acattccagg tggccagccg cagcagcggc gatcgcaagc 516841 gcggcgaagc cgggcgttgg gggtggccgc cgtcaattgt gccgtcgggc atggctagaa 516901 ggtatcccag ccgaccgact gggcaggaag atagcggcag tgatggtgca gccggaagcc 516961 cagggactcg gcgaggacgg atgtcgcggt gtcgtgcacg cgcacgtagc cgcgggtcgc 517021 gccgcggccc gctccccagc ccaacagcgc ttcccacaat tggcggccag cggagccggt 517081 cgcggattgc tcgtcggcgg cacgcattgc cgacagaccc acccaccggg tgccgtcggg 517141 tgcgtcggtt accgctgcac gtgcgaccgc cacacccagg tagctgccga atgccaactc 517201 gccgtcgatg acgggggtcg ccatgtcgag gggtaggcgt tggtggtaga gccgcagcca 517261 ggtgtcgtcg gggtggtcca gcaacgtgac cgaccggtcg ggttcaccgg tggacacgtc 517321 acgcaccaac acttgctctc ggcgctcacc tgccaggtcg gccggtagtg gcagcaagcg 517381 gtccgggacg gccagccatg gctgcagatc acggctcgca taccatgcgc tgatttctgt 517441 gatggtgttc gtgtgtgccg agatatccag cggtactgct gaattagcgg ccagtacggc 517501 cccgtgtccg gctcgcagga gccagccgtc cagccaggtt cgttcaacgc cgggccaggc 517561 cgccgcggcg gcgtgttcaa gtgcgcggat cgcggcggtg cgcaccggcg catcggtcag 517621 gacccgcagg gccaccacat cgacgggcga gaactcgacg atggtcccgg tcttggtctg 517681 cactcgcacc gtcggatcga cggctagcag ccgacccacc gcatcggtca gcggtggcat 517741 cgatccggcg ggccggcggt agcgcaccgt tacccgtgtc ccaagccccg gccacgagac 517801 cattagtgac cgaacgggtc ggggtcctcg ccgggcagcc acgacagtcc gggaacgccc 517861 cagccatgtg acttgacggc ccgtttggcg ttgcgggcgt accggccgat gaggcggtcc 517921 aggtacagga atccatcaag gtgcccggtt tcgtgctgca gcatccgcgc gaacaggccg 517981 gtgccctcga tactgaccgg actgccatcg gcgtcgagtc cggtgactcg tgcccacttc 518041 gcgcgtccgg taggaaatga ctcgccggga accgacagac agccttcgtc gtcggtgtcc 518101 gggtcgggca tggtctcagg tatttcggag gtctcaagca ccggattgat gaccacaccg 518161 cgtcggcggg cggtcattgc gcggtccgcg gcgcaatcgt agacgaagag ccgcaggctg 518221 cagccgatct ggttggcagc caggccgact ccgttggcgg cgtccatggt gtcgtacatg 518281 gtggcgatca actgggcgag atccgccggg agtgaaccgt cggcggcgac cgtcaccggt 518341 gtggtcgcag tgtgtaagac gggatcgccc acgatgcgga tgggtacgac tgccatggtg 518401 ggctagctta agcgcgccga cgatacgcgc cgcgaggcgg cgggctgagg aggcgggcaa 518461 tcggcttagg cgcgccgcgg ggcggcgggc atcatcgccg ggtgtgaacc acacgacggc 518521 tggccggcat gtcgcgtcgc aggattcaca ctcggagcat gagccggcgc gccgcgatcg 518581 gcagtcgggt gcaagcaagt cggccgactc gcgggcagga ttaccgcccg acggttcctg 518641 gcgtggttca atattcgccg aagaagcgcc tacgtaggcc aagtcattcg tacacattga 518701 gaattcgccg gaagggccca ggggaaagcg atatggacag cgccatggcg cgggcaattc 518761 gatcggggga cgacgccgag gtcgccgatg ggctgacccg gcgcgagcac gacatcctgg 518821 cgttcgaacg tcagtggtgg aagtttgccg gtgtcaagga agaagccatc aaagagttgt 518881 tctccatgtc ggcgacgcgc tactaccaag tgctcaatgc gctggtggat cggcccgagg 518941 cgctggccgc cgacccgatg ctggtaaagc ggttgcggcg gctgcgcgcc agtcggcaga 519001 aggcgcgggc cgcgcgacgc cttggcttcg aggtgacctg acactctccc cgcttttgcc 519061 ggttgtgtcc cggtgctggt tacagtgggc tcgatgaatg agcgtgtacc cgactcttcc 519121 gggcttcccc tgcgggccat ggtgatggtg ctgttgtttc tcggcgtcgt cttcctgctg 519181 ctcgtctggc aggcactggg ttcgtctccg aactccgagg acgactcgtc agcgatttcc 519241 accatgacca ccaccactgc ggcgccgacg tcgaccagcg ttaagcccgc ggcgccccgg 519301 gccgaggtgc gcgtctacaa catctcaggc acagaaggcg ccgccgcgcg gacggccgat 519361 cggctcaagg cggccggttt cacggtcacc gacgttggga atctatcgtt acccgacgtc 519421 gcggcgacca cggtgtacta caccgaagtc gaaggcgaac gggccaccgc cgacgcggta 519481 ggccggacgc taggagcagc ggtggagctg cgactgccag agctgtccga ccagccgccc 519541 ggggtcatcg tcgtggtgac cggctgacgc tgattcgaac gccaggttag gctctcgcta 519601 tgccaaagcc cgccgatcac cgcaatcacg cagctgtcag cacgtcggtc ctgtccgcgt 519661 tgtttctggg cgccggtgcc gcgctgctga gcgcatgctc gtcgccgcag cacgcgtcta 519721 cagttccggg taccacgccg tcgatttgga ccggatcgcc cgcgccgtcg ggactttcgg 519781 gtcacgacga ggagtcgccc ggtgcgcaga gcctgaccag taccctgacg gcgcccgacg 519841 gcacgaaggt agcgaccgcg aagttcgagt tcgccaacgg ctatgccacc gtcacgatcg 519901 cgacgaccgg cgtcggtaag ctcacgcccg gcttccacgg cctacacatc caccaggtgg 519961 gtaagtgtga gcccaactcg gttgccccca ccggcggtgc gcccggcaac tttctgtccg 520021 ccggcggcca ctaccacgtg ccagggcata ccggcacccc cgccagcggc gacctggcct 520081 cgctgcaggt acgcggtgac ggttcggcga tgctggtgac caccaccgac gccttcacca 520141 tggacgacct gctgagcggc gcgaaaaccg cgatcatcat tcacgccggc gccgacaact 520201 ttgccaacat tccgccagaa cgctacgtcc aggtcaatgg gactccgggt cccgacgaga 520261 cgacgttgac caccggcgac gccggcaagc gggtggcgtg cggtgtcatt ggttccggct 520321 agcttgcctg cccgcaggtc ggccgcccga attgatttcg caggctcacc gcggcccacc 520381 ctcggtgtgg agtgggagtt cgcgctcgtt gactcgcaga cccgcgatct gagcaatgaa 520441 gccaccgcgg ttatcgccga aatcggcgaa aacccgcggg tccacaagga attgctgcgc 520501 aacaccgtag agattgtcag cggtatctgc gaatgtaccg ccgaggcaat gcaggatctg 520561 cgcgataccc tgggccccgc ccgtcagatc gtgcgcgacc gcgggatgga gctgttctgc 520621 gcgggtaccc accccttcgc gcggtggtcg gcccagaagc tcaccgacgc gccgcggtac 520681 gcggagctga tcaaacgcac ccagtggtgg ggccggcaga tgctgatctg gggtgtacac 520741 gtgcatgtcg ggattcgctc ggcgcacaaa gtgatgccga tcatgacgtc gctgctcaac 520801 tactacccgc atctgttggc gctctcggcc tcatcaccct ggtggggtgg cgaagacacc 520861 gggtatgcca gcaaccgggc gatgatgttc cagcagttgc ccaccgccgg gctgccgttt 520921 cactttcaga ggtgggcgga gttcgaaggt ttcgtgtacg accagaagaa gaccggcatc 520981 atcgaccata tggacgaaat ccgttgggat ataagaccct caccccatct gggcaccctg 521041 gaggtgcgga tctgcgatgg cgtgtccaac ctacgagagc tcggcgcgct ggtcgcgctg 521101 acgcattgcc tgatcgtcga tctggaccgc cgcttggacg ccggcgaaac gctaccgacc 521161 atgcctccct ggcacgtcca ggagaacaag tggcgtgccg cccgctacgg cctggacgcg 521221 gtgatcatct tggacgccga cagcaacgaa cggctggtta ccgatgacct cgcggatgtg 521281 ctgacccggc tggagccggt cgccaagtcg ctgaactgtg ccgacgagct tgccgcggtc 521341 tccgatatct accgcgatgg cgcctcctac cagcggcagc tgcgagtggc gcagcagcat 521401 gacggcgatt tgcgcgcggt agttgacgcg ctggttgccg agctggtgat ttagccgatg 521461 cgggctggct gagtgtgacg tccgccagcc gcgaggagat tgaggtttag gtgatggccg 521521 atttcgcgcc ggttgagttg gcgatgttcc cgctcgagtc ggcgccgctg cccgacgaag 521581 atctgccgtt gcacatcttt gagccccgct acgcggcgct ggtccgtgac tgcatggaca 521641 ccgcggatcc tcgcttcggt gttgtactga tctcgcgtgg ccgcgaggtc ggcggcggcg 521701 atacgcgatg tgatgtcggg acgctggcca ggatcaccga atgcgcggac gcgggttcgg 521761 gtcgctatat gctgcgctgc cgggtgggcg aacggatccg ggtgtgcgac tggctgcccg 521821 acgatccgta cccgcgtgcg aaggtacggt tctggcccga ccagccgggg cacccagtga 521881 cggctgccca gctgctggaa gtcgaagacc gggttgtggc gctattcgag cggatcgctg 521941 ccgcccgggg agttcggctg ccggcccgtg aggtggtatt gggctacccg gtggttgacc 522001 cagccgatac cgggcagcgt ctgtacgcgc tggcatgtcg agtgccgatg ggcccggccg 522061 atcggtacgc cgtgctggcg acgccgtcgg cggccgatcg attggtccgc ttgggtgacg 522121 cgctggactc ggtggccgcg atggtggagt tcgagttgtc gacgtaactg ccctacgcgg 522181 tgcgtctgac ccactgggcc tgaaccacat tcactgcgcc gagcaccata tacggacccg 522241 tcaccgccgg caagcgcatc cgggtgcgga accggctcga caatggtcaa cgccttcgca 522301 ccattgccga ccagtacccg caattgctcg acttcatcag tggtcgctag gaccgaaggt 522361 cacccttggt gccgaactta cgcagcgacg ccacctgcag cggatccagc gacgcgcgca 522421 cggtttctcg cgcggtcgcc aggtcggcgg cggtgacgtt ggcggcatcg atggaacgcc 522481 gcatcgcggt aagcgcggct tcgcgcagca gcgccacaca gtcggcggca ctataaccgt 522541 cgagtccggc tgccacctcg tccaggtcga cgtcggagct cagcgggatc gacttgccag 522601 cggtgcgcag gatttcgcgg cgagcggcag cgtcgggcgg ttcaacgaac accagccgtt 522661 ctagccgccc cgggcgcagc agcgccgggt ctatcagatc gggccggttg gtcgcgccta 522721 gcatgacgac atcccgcagc gggtcaatac cgtcgagctc agtcagcagc gcggccacca 522781 cccggtcgga gacgcccgag tcgaagctct gaccgcgccg tggcgccaga gcgtccagct 522841 cgtcgaggaa caccagtgac ggcgcggagt cgcgggcccg ccggaatagc tcgcggactg 522901 ccttctccga ggagcccacc cacttgtcca tcagctccga ccctttgacg gcatgcacgc 522961 tcaactgtcc ggtgctggcc agggcacgaa ccacaaaggt cttgccgcag ccgggcgggc 523021 cgtacagcaa caccccgcgc ggcggttcga cacctagccg agcgaaggtg tcggggtgct 523081 gcagcggcca cagcaccgcc tcggtcagtg cttgtttggc cgcggccatg tcaccgacat 523141 cgtcgagcgt cacgtcaccc acggtgactt cgtcgctggc cgagcgggac agcggccgga 523201 tgacggtcaa cgcaccgagg aggtcgtctt ggtgcagcat cggtggtcgg ccgtcggcac 523261 tggctcgaga cgctgcccgc agcgccgcct cgcgaaccag cgcagccagg tcggccacga 523321 cgaaacccgg tgtgcgggag gcgatttcgt cgaggttgag gtctccggta ggaaccggat 523381 tcagcagcgc ctccagcagc gatttgcggg tggccgcgtc gggcagcggc aggccaagct 523441 cccggtcgca caactcgggg gaacgcagcc gggcatcgag ttgatcgggc cgtgctgagg 523501 tggcgatcaa taccacaccg gcggtggcca ccgcggtacg cagctcggac aggatcagcg 523561 aggctaccgg ctcggcggcg gctggcagca gggcgtcggc atcggtgatc agcaacacac 523621 cgccctcatg gcgaaccgcc tgcactgccg aggccacggc tttgacccgg tctccggcgg 523681 ccagagctcc aatctccgga ccatccagtg tcaccaacct tcggccgtcg cacaccgcgc 523741 gcaccagcgt cgccttgccc accccggccg gacccgacac cagcacaccc aaattggtgc 523801 cggcgcccaa ggtctgtagt aggtgcggct catcgagggc aagcttgagc cattcggtga 523861 gcttggcagc ctgcggctgg gcgcccttga gctcttcgat ctggatctcc ggactcgaga 523921 tgctcacttg cccggccgtg gacgtaccca ttgcggccgg gaccccagcg ccccaggtga 523981 ccagcgagtt gggctgcacg ctgaccggcc cgtcggggtc gacgccggta acggtcagca 524041 gctccgaggt ccaactgatc ccgaccgcag ctgccaatgc gcggctggca gccgacgtgg 524101 atgtgccggg gcctagatcg cggggcagca gcgagaccgc gtcaccgacg gtcatcacct 524161 tgccgagtag ggcctgccgc agcgtgaccg gcggcaccga ctgggtggcc agcgttgaac 524221 cgctcagcgt caccgatcgc gctccgtaga cggtgaccgg gctgacgatc acctcggtgc 524281 cttcgcgaag gcccgcattg gacagtgtga cgtcatcgag cagcaccgtc ccgaccgcgg 524341 tgtctgccgc ggccaggccg gcgaccgcgg cggttgtccg agagccggtc agcgacaccg 524401 cgtcccactc gcggatgcca agggcagcaa tggcattggg gtgcaaccga acgacgccgc 524461 ggcgtgagtc gacggccgag gtgttcagcc gggcggtaag ggtgagttgg cgggccgggt 524521 ccgggtgggt cacagccgtc gacccggctt gcgcaggccc agccgcgcca tcgacggccg 524581 gtagggatgc gcccggcggc tcgcgcgccg caccgcgcgc cgttgcttgg gcttgtcgtc 524641 ccacacctca gggtgttggg caagccagcg ctggctgcgc accgcgaaag gaatatggca 524701 catgtaggcg atgatgatca cccagatcaa caagtagggg gccaggactg cggccgccgc 524761 gcagatagcc agcaccgcca gcagggcggc cgcgtagttg ggtggtaccg acacggcgtg 524821 catctttttc atcgggatcc cgctgaccaa gagtatcgac gttcccgtca cccaaaagct 524881 gaggaaccag cccgaggtcc accatccttc gccgaactgc attttgaggg ctagcaggcc 524941 gatcatggaa accgcgcccg ccggcgcggg cattccgacg aagaattcat gcgcgtaggc 525001 gggctgggtt ccgtcgtcct gcagtgcgtt gtaccgcgcc agccgtaata ccacgcacac 525061 cgcgtagagc agcacgacca cccaaccgac cggccacttc gacaacatcg acacgtaaag 525121 caccagcgcg ggtgtcactc cgaagttcac cgcgtcggcc agtgagtcga tctctgcgcc 525181 catccgcgac tgggcatcca ggatgcgggc cacccggccg tcgagcccgt cgaggatggc 525241 cgctgcggcg atcagtgcca tcgcggcctt cggctggtgc tcgagcgcaa acttgattgc 525301 ggtcagtccc gcgcaaatgg acagcaccgt catcgcgctg ggcagtatct gcaggtttac 525361 ccctcgcctg ccgcggggct ttccgatcat cgacattcgg ccagcacggt ctcgccggcg 525421 accgcgcgct ggccgacgtt gacgatcggc tctgcgcccg ctggcaggta ggtatccagc 525481 cgggagccga accggatcag gccgtaggtg tcaccgatgg ccagcttgtc tccgacgtgt 525541 gcgtcgcaca caatgcggcg cgccaccagc ccggcgatct gcaccgcgac cacctcggcg 525601 ccgttgggca tgcggatccg cacactggtg cgctcgttgt cgtcgctcgc ctccggtagg 525661 tcggccgacc cgaaccggcc cggccggtgt tgcacggcga tcacttcccc gctcaccggg 525721 gcacgttgca cgtgggcgtc caatatcgac aggaagatgc tgactcgcgg taacggcgtg 525781 tcacccatgc tgagttcggc cggtggggcc gctgagtcga tcgcgcagat cacgccgtcg 525841 gcgggcgcga caatggcagc cggcctggtg ggcggtaccc gctgcgggtg ccggaagaag 525901 cccgcgcagg cagcggccgc cagcagaccc gtgccgcgca accaccggta gcggtgtccg 525961 acggccgcaa tcgcaaggcc ggcggcaatg aacggccgcc cggccggatg aaccggtgga 526021 acggcggacc gcaccagggc gagcagatgt tgcgggccgt cggggcgggg gcgtctggcc 526081 acggggtcat cttacggagc ttcgtgccgc aggttgggtg cacggcacta ggatcggtcc 526141 ggttaggtca agtcccagac ttgcagctgc gttccggcag ccacctccac gacgtcctcc 526201 gggatgtcca gaagtccgtt ggccgatgcc aaccaacgca aatggtgcga cgccggtggg 526261 ccgtagctga tgaccgtgcc tgcctggtga tcgagtattg cgcgtcggaa ctgacgtttg 526321 ccgcgcggcg atgtcaggct cgcggtgagt accgcgcttc ggtgcggccg gtacggatcc 526381 ggcaggccca tggccatgcg cagcggggga cggatgaaca cctcgaagga caccagcgcg 526441 ctgaccgggt tgccgggaag ggtgacgatc ggcgtacctg ccacccgccc gacgccctgg 526501 ggcattccgg gttgcatcgc caccttgacg aattcgacac cgtggtcgcc tccccggtag 526561 tcagcgctgc cgaacgcgtc tttgaccacc tcgtaggctc cggcactgac accgccgctg 526621 gtgatgatca ggtcggcgtc caccgcgtac cggtcaagga tcgcgccgaa ctgcgcgacg 526681 tcgtcgccgg cggttgcggt ggcgaccaca gcggcgcccg catcgcggac ggcagcggcc 526741 agcatgatcg agttggactc gtagatctga cccggttgta ggggcgtgcc tggcgacgcc 526801 agctccgacc ctgtggagat caccagcacc cgctgacggg ggagcaccgg cagctcggcc 526861 aaacccagcg cggcggccag gccgagcacc gccggggtca cgatctggcc gttgtgcagc 526921 accgtggtac cggcggcgac gtcttcgccc gaccgtcgga tgtgcttgcc tggggtggcc 526981 tgttggcgga tcgccaccga atcgacgccg ccgtcggtgg cttcgaccgg cacgatcgcc 527041 gtcgcaccgg tgggcactgg cgcaccggtc atgatccggt gcgcagtcac aggctgcagc 527101 gtcagcatgt cggcgcgccc ggcgggaatg tcctcggcga ccggcaacat caccggattt 527161 tgcggtgtgg cacctgaggt gtcttcggcg cgcaccgcat agccatccat tgcggagttg 527221 tcgaaaaccg gcagcgacag cggtgcgacc acgtcgccgc ccaggaccag accttgagcc 527281 tgggtcagcg gaaccgtaat cgggcgacag gcgcgcatca tctccgctac gacacgttga 527341 tgctcctgga ctgaccgcac ccggccatta tcggtcgttc agactccgaa gctgacgccg 527401 gtgagttctt cggagacggt ccagaggcgg cgctgcagat ctttgtcgtg ggactgcgcg 527461 ctggattgga ccaccttcgg gtgaccgcgc tgctcgccga acccgtccgg gccgtagtat 527521 tgcccgccct gcgtggtcgg atcggtggcg gcacgcagtg ttggcagggc gcccatctct 527581 gggctttgga aaagcaacgg cccgagcacg gtagcgacgg gccggataag tcgcggcagg 527641 ttgcgagtca gctcggtgtt ggagccgcca gggtgagcgg cgacggcgat ggtggatttg 527701 cccgcttcgc ccagccggcg ttgcagctcg taggtgaaca gcagattagc cagtttggct 527761 tgtccgtagg cggcgacgcg gttgtaacgg cgttcccact gcaagtcgtc gaagtggatg 527821 gcagcgtgaa tccggtggcc ctggctgctg acggtcacca cccgcgaacc gggtaccggc 527881 agcatgtggt cgagtaccag tccggttagt gcgaaatgac cgagatggtt ggtaccgaac 527941 tgcagctcga aaccgtcctt ggtgacctgc ttcggcgtcc acatcacgcc ggcgttattg 528001 attagcacgt cgatgcgcgg ataggccgtg cgtaacgcgt cggcggctgc gcgcaccgag 528061 tccagcgagc acagatcgag ttgctgcagc gtgacgtggg cgcctgggcg ggcggccatg 528121 atgcgggccc gggcggcgtt gcccttctcg agattgcgga cggccaacac tacgtgtgca 528181 ccgcggtcgg caaacacggc ggcggtgtgg tagccgatgc cggtgttggc gccggtgacc 528241 acaacgacgc gcccgctttg atcggggacg tctgcggccg accatttacg ggtcttgttg 528301 tcgttggcgg tcatgggccg aacatactca cccggatcgg agggccgagg acacggtcga 528361 acgaggggca tgacccggtg cggggcttct tgcactcggc ataggcgagt gctaagaata 528421 acgttggcac tcgcgaccgg tgagtgctag gtcgggacgg tgaggccagg cccgtcgtcg 528481 cagcgagtgg cagcgaggac aacttgagcc gtccgtcgcg ggcactgcgc ccggccagcg 528541 taagtagcgg ggttgccgtc acccggtgac ccccgtttca tccccgatcc ggaggaatca 528601 cttcgcaatg gccaagacaa ttgcgtacga cgaagaggcc cgtcgcggcc tcgagcgggg 528661 cttgaacgcc ctcgccgatg cggtaaaggt gacattgggc cccaagggcc gcaacgtcgt 528721 cctggaaaag aagtggggtg cccccacgat caccaacgat ggtgtgtcca tcgccaagga 528781 gatcgagctg gaggatccgt acgagaagat cggcgccgag ctggtcaaag aggtagccaa 528841 gaagaccgat gacgtcgccg gtgacggcac cacgacggcc accgtgctgg cccaggcgtt 528901 ggttcgcgag ggcctgcgca acgtcgcggc cggcgccaac ccgctcggtc tcaaacgcgg 528961 catcgaaaag gccgtggaga aggtcaccga gaccctgctc aagggcgcca aggaggtcga 529021 gaccaaggag cagattgcgg ccaccgcagc gatttcggcg ggtgaccagt ccatcggtga 529081 cctgatcgcc gaggcgatgg acaaggtggg caacgagggc gtcatcaccg tcgaggagtc 529141 caacaccttt gggctgcagc tcgagctcac cgagggtatg cggttcgaca agggctacat 529201 ctcggggtac ttcgtgaccg acccggagcg tcaggaggcg gtcctggagg acccctacat 529261 cctgctggtc agctccaagg tgtccactgt caaggatctg ctgccgctgc tcgagaaggt 529321 catcggagcc ggtaagccgc tgctgatcat cgccgaggac gtcgagggcg aggcgctgtc 529381 caccctggtc gtcaacaaga tccgcggcac cttcaagtcg gtggcggtca aggctcccgg 529441 cttcggcgac cgccgcaagg cgatgctgca ggatatggcc attctcaccg gtggtcaggt 529501 gatcagcgaa gaggtcggcc tgacgctgga gaacgccgac ctgtcgctgc taggcaaggc 529561 ccgcaaggtc gtggtcacca aggacgagac caccatcgtc gagggcgccg gtgacaccga 529621 cgccatcgcc ggacgagtgg cccagatccg ccaggagatc gagaacagcg actccgacta 529681 cgaccgtgag aagctgcagg agcggctggc caagctggcc ggtggtgtcg cggtgatcaa 529741 ggccggtgcc gccaccgagg tcgaactcaa ggagcgcaag caccgcatcg aggatgcggt 529801 tcgcaatgcc aaggccgccg tcgaggaggg catcgtcgcc ggtgggggtg tgacgctgtt 529861 gcaagcggcc ccgaccctgg acgagctgaa gctcgaaggc gacgaggcga ccggcgccaa 529921 catcgtgaag gtggcgctgg aggccccgct gaagcagatc gccttcaact ccgggctgga 529981 gccgggcgtg gtggccgaga aggtgcgcaa cctgccggct ggccacggac tgaacgctca 530041 gaccggtgtc tacgaggatc tgctcgctgc cggcgttgct gacccggtca aggtgacccg 530101 ttcggcgctg cagaatgcgg cgtccatcgc ggggctgttc ctgaccaccg aggccgtcgt 530161 tgccgacaag ccggaaaagg agaaggcttc cgttcccggt ggcggcgaca tgggtggcat 530221 ggatttctga ccccggcgag aagtcgcagc gaggagcccg gtccctttgt ggggccgggc 530281 tcctctggtt gggagctacg gtaccgagaa caccacgcag tcgtgtaggc aacctttggc 530341 cgctgtgggc gagtcggggg ccgcgtctcg gtgcagcagc gcgcggatgg gtacgacacc 530401 gcagcgggcg gtgtcgtcat cggggcctgc gtccgacgcc tgggcacggc cgtcgacgat 530461 cagcgagtag ccgctaggat cggatggcgg ccacaacagg gtgacttcgc tgcggtgggc 530521 caggttttgc cgcgtacgac ccccgatcag gccgacgtcg accactgccc ggggtccatc 530581 ggggccgtcg gggagttcgc gcagcaccgg ctcgactgcc accgtgtgca cgcgatggcc 530641 atcatcgacg gtgatcaggt aagcgaacgg gtagtcgggc aaggcggcgg ccagccgttt 530701 gaggtctacc tttttggcac ccacggattc gaggataggc gcccgatgtg ttactccgaa 530761 ccgaccggct gcccgatccg cgggctggcg taggcggatt cgcggtcggg gctcgggtag 530821 aagttcgact tggggatgcc ggagccgggg gtactcggct cacgcacggc ggtattccgc 530881 aagcccgagt cgttgctgcc cgagttgacg aagctcgggt agctggtgcc agggcttcta 530941 aggcccgggt ttgcgcccga gccagccgcg gcactgccgc taccggggtt cgggttgcct 531001 gagtccaggc cgccaacagg agcactggcc ggggcggcga cgggcgtgtt ggtcaggccc 531061 gagttgagga cgttcgccag gccgtgttgg agaccgcccg ttgatccgag ggcggaggcg 531121 aggatgcccg aactcaaagc cgccgtgctc atgccgccgg tggcgtagcc ggcggagctg 531181 accaaggccg cctccgagcc agccgcgctt cctaaggcgg cgttttgcat ccccgcgttc 531241 cagaagctgg tgttgaggct gcctgcgctg ccgaggcccg cgttgattgt ccccgaggtc 531301 ccgatgccgc tgttcaggga gcccgaattc ccgatgccga tgtttccgct gccggagttg 531361 aataagccga cgttgccggt gcccgagttc ccgaagccga tgttgccgct acccgagttg 531421 aagccgccga aacccatctg gtgatcaccg gtgatcccga acccgatatt cccgctaccg 531481 gtgttgccga agccgatatt cccgtcgccg aggttgccga ggcccaggtt gccgctgccg 531541 gtgttgccgc tgccgatgtt gccggtgccg gtgttgccgc tgccgatgtt gttgttgccg 531601 atgttgttgt tgccgatgtt gccgctgccg gtgttgccga agcccagatt gatctggccg 531661 ttcttgccga tgtcgatgcc gaggttccgc aagacctgct gccagggcgc cagttgtgcg 531721 acggccgcag acgcatcgaa gtggtaacca gccatcgccg ccacgtccaa tgcccacatt 531781 tgctcgtatg ccgcctcgac gtccatgagc gccggagcat tctgcccaaa ccagttcgta 531841 gctgccagca gctgcatcag gccacgattg gccgctacca ctgccggctg cacggtggcc 531901 gccagcgccg cctcgaacgc ggtcgctatt gccatggcct gtgcggccgc ttgttccgcc 531961 tgcgctgccg ccgtgctgag ccaggctagg tactgggttg cgacggccat catcgccgcc 532021 gcggacggac ccagccaggc gccactagtc agttcggatg tgacggagcc aagcgacgct 532081 attgacgcga gcaatttttc ggccagctcg ccccaggcgg tggccgcagc aattagcggt 532141 cccgacccgg gaccggcaaa catcagtgcc gaattgatct ctggcggcaa ccacgcaaaa 532201 tgcgggcttg tcactgatcc aacttaactg tcagcgaccg ttgccgtggc ggtatcggca 532261 cttcaatacc actcatcttt ggggtcatct ttggagcgcc cctaggaacc gccagcttac 532321 ctagtcccgg gtaggggccg actggcggcc gggatgcagc tgagggtctg ccacctgccc 532381 cgtaatgtcg ctggtatggc aagcaccgac gccgcggccc aagagttgct ccgcgacgcg 532441 ttcacccggt tgatcgaaca tgtcgacgaa ctcaccgacg gcctcaccga ccaactcgcc 532501 tgctaccgcc cgacccccag cgccaacagc attgcgtggc tgctctggca cagcgcccgg 532561 gtgcaggata tacaggtcgc ccatgtggcc ggcgtggaag aggtgtggac ccgcgacggt 532621 tgggtggacc gctttgggtt agatctgccg cggcacgaca ccggatatgg acaccgtccc 532681 gaggatgtgg cgaaggtacg ggcacccgcc gacctgctgt cggggtacta ccacgcggtg 532741 cataaactga ccctggaata catcgctggc atgaccgcag atgagttgtc ccgtgtggtg 532801 gataccagtt ggaatccgcc ggttaccgtc agcgcacggt tggtgagcat cgtcgacgac 532861 tgcgctcagc acctcgggca ggccgcctac ctgcggggga tagcccgata acggcgacat 532921 ccgccggatc gctgaggcga tggtcagcta cgccgaagat cgcctgcacc gatggttacc 532981 tgacgctagc cggcagcgcc gccctagtgg tacccggcgt gttcgtcgcg atgctgggca 533041 ccattgtcgc gccgagactg cggtgagggg ccggggtgtg cgtcctcggc tcacccgagc 533101 ggcagctcgg ccaagatggt accggtgggc tgtggtgatc cggtgccggg ttcgacggtg 533161 aatgccagtg cggtcgaggc tccgagatcg gtcagcgtcg ccgtcgtcga gggcgtcacc 533221 gccgcggtgc ccatcgtccc cgccgacctc ggccctttgg cccctcccag cagccacatc 533281 tgatacacgg ttccccggga tggtggcgcc acattgttca tcaccagcag acctgtgttg 533341 cggtcgcggg agaacaccac cgtggccgtc ccggcgccca gtgggcgaga gaccgtccgt 533401 acgtccggcg ccgtcagaac ttgctcggcc acggtggggg gtggcgatgg ccgggtcagc 533461 acccccaggc cgaacgcccc cagccccaca gcgatcgccg ctgcggacgc aaaggctgcc 533521 gtacgccagc gtgattggcg cctaacctcg ggcttggtcg catccaggat ggccgtccgc 533581 agatgtgctg gcggctcggc ggtggtggcc gccgagacga cggccatcgt ctcgcggacg 533641 gctcgaactt cgtcgttgaa agccgcggct accggcgagg gcgcggcggc cacccgtcgg 533701 tcgatgtcgg ctcgttcatc gtcggacaca gcgttcaggg catacggggt agccagctcg 533761 agcagctcaa aatcggtatg ttcagtcatg agcgccgctc tcccaacgca tcgcttcgct 533821 cggccggcgc agtcatgaca cgtccaggca gttgcgcagg ctgcgcaggg cgtcgcgcat 533881 gcgggatttg atggtcgaca gattggccgc taaccgccgc gaaacttcga catacgtcag 533941 cccgccgtag taggccagtt cgatgcactg ccgctgcgtg tcggtcaacg ccttgaggca 534001 ctcggtcacc cggcgccgct catcaccggc gatcgccagg tcggcgacga cgtcactcgc 534061 gggatcgacg ttggccgcac catagcgcac ttcccgctgg ttgccggctt gctcgcaacg 534121 gactcggtcg acagcgcgcc ggtgggccat ggtcaaaagc caggccaacg cggaaccttt 534181 ggcggagtca aactccgacg cgttccgcca cacctcaaga tagatctcct gggtggtttc 534241 ttcgctgtag ccggtatcac gcagcacccg catcaccagt ccatacaccc gcgacttggt 534301 gtggtcgtag aattcggcga atgcggcctg gtcgtgacca gcgacccggc gcaacagggc 534361 gtccaggtcg ctgctcagcc gtggcggtcc ggtcatcgat gggtagccta tcgccagccg 534421 gcgccgtgat ggtcaagccg gtcatcaccg acgcgccgat cgcggtggcc ggggcacgaa 534481 ataggctgtt cgcctttgat attcggcgaa accggggcga cccttcaggt atctctcagt 534541 cagccgggct ccgctgacgt ccaccagcag gtaggtcatc agcagcggcg aacccaccgt 534601 ggccagcggc gcccagtcgt tgatcgtgat caaccacaac ccccaccaga cacaggcatc 534661 gccgaagtag ttggggtgac gcgtccaggc ccacaggccg cggtccatga tgaccccgcg 534721 attggccggg tcggatttga atacccacag ttgccaatct cccaccgctt cgaaggtgat 534781 accgaccagc cacacggcta agcccacgcc cccaacagcc agtaacggct tcggcgtcgg 534841 cccggtgact gcggaaagct gcagcgggaa tgagacgaac aacgtcagga ggccctgtaa 534901 tccgaagacc ttgcgcaatg cctgcacagg cgtggcaccg cgcagcaggt cggcgtagcg 534961 gggatcctcc ccctgaccgg ctgtcttgcg gtacatgtgc cagctcagcc gcagacccca 535021 ggtcgacacc aacgctagta gcagccatcg gcgaaccggg tcgccgtggc cgagcgtcgc 535081 ggcggcgacg gcgacggcga cgaaacccaa gccccatacc acgtcgacga cgttgtaccg 535141 gccgatgcgg cggccgatcg caaacgccac cgaatgcacc acggccacag ccaaagccga 535201 cacgctggtt accacgacga tgttcacggg gggccctcgc ggatcaacgt ccactggtag 535261 acgtccagat agcccgaccg gaagcccgcc tccgagtacg ccaggtacag ctcccacatc 535321 cgtgcaaaca cctcgtcgaa acctaaatgc gccagcccat ctcgccgctg cataaatcgt 535381 tcccgccaga gccgcagcgt ctcggcgtaa tgcggtcgca gcgaggccgc gtcgacgatg 535441 cgcagcccgg tgtgttgccc ggtgatgtcg atgatggcct gcgtggacgg tagcagtccg 535501 ccagggaaga tgtacttctg gatccaggtc tgggtgtggc gggtggccag cattcggtgg 535561 tgcggcatgg tgatcgcttg aatcgctacc gggccacccg ggcgcaccaa ctgttctagc 535621 gcggcgaagt accgtggcca cgaacggtat cccaccgcct cgatcatctc gactgagact 535681 actgagtcat actgcccgtc gacgtcgcgg tagtcgcaca agtcgatctc tacccggtgg 535741 ccaaagccgg ccgcggcgac ccgctgccga gccagccgtt gctgctccac cgatagggtc 535801 accgagcgga tgtgggcccc ccgtgcggcc gcgcgaatgc acagctcgcc ccatccggtg 535861 ccgatctcga gaacgtggct gccctgctgg accccggcca cgtcgagcag ccggtcgatc 535921 ttgcggcgtt gggctgcggc caactcggtc caggcgggag ttggctgggc cagcaggtcg 535981 gtgaacattg cgcacgaata cgtcatggtc tcgtcgagaa acgcggcgaa caggtcgttc 536041 gacaggtcat agtgcacggc tatattgcgc cgggcctgat ctcggctgtg gtctggccaa 536101 ctaggtcgaa aggtcggcgt gatcggccgc agccagtgca gcgagcgcgg taccagctcg 536161 tccaccgacc ctgccagcac ggtcaacacc cgcgtgagct ccttcgacga ccattcgccg 536221 gccatgtagg actcgccgaa gccgatcaag ccgtggcgcc cgatccggcg tgcaagtgcg 536281 tccggccgat ggatgaacag gctgggtgcg cgcggatcgg cggcacctgt tgccgttccg 536341 tcggagtaga ccaatcgcag cggcaagtga gtggccgtgc gccgaagcag ccggttggcg 536401 attgccgccg atgccgcggc taggggaccg cgcggcacct tggcaaccgc tggccagcga 536461 tccgaatcga ttgctgccga cggtgtctgg ctggtttcga cggtcatcgc ggcaccaccg 536521 gaactcgacg tagccacagt ctgatcccct gtatcctgat gcgcgcggcc accaccatcg 536581 gcgccagcgg tgaaatgatt tgcatcatcg cgatctgtct tgtcgttgcc ggtcgccgct 536641 gcccacgcag ggtggctgtg aattccgggc acacctgccg gcggtcacgg tgcagcgtca 536701 ccgtgacgtc gagttcgcgg tcgggccgtg gtgcccgtat caggtagtag ccggctagct 536761 gatgaaacgg cgaaacgtag aagttcttgg ccgtcaccac gggcaggtcg gccggcggta 536821 gcaggtaagc atggcgtccg ccgtaggtgt tgtgcacctc ggcaatgaca tggcgcagtt 536881 ggccgtcgcg gtcgtggcac cagaagatgc tcaacgggtt gaagacatag ccgagaacgc 536941 gtgcttgcag cagcgcggtg atacggccgt cggggacggc aaggccgcga gcggcaaaga 537001 aggcgtccag ccggtcacgc agcgagctat gcggcggaca cgagaacggg tcagcgaagt 537061 ggtcgtcggc gtggaaccgt gcgaacggtc gcagccacca gggcagctgg gggaggttgt 537121 cgacatccac gtaccagctg tagctgcggt atgcgaacga gtggtgcacc gggacttgtc 537181 tgcagtggct gatcgtggtg cggtagatcg ccggcgtcag ggtttgagtc agcacgcgac 537241 catcgcctcc tgtgggatcg ctgccggcca gtcggcgcca aggcgccggg ccgcccgcag 537301 acccgaggcg gcgccgtcct cgtggaatcc ccagccgtgg taggcgccgg cgaataccac 537361 ccgattgtca cccagcgtcg gcaataagcg ttgggctgca accgattccg gtgtgtacag 537421 cggatggctg taggtcatct cggcgatcac cgagctggga tcaacccggt cgtggccgcc 537481 gagggtgacc agataccggc ggccaccgtc gaggcgcatt agcctgctga tgtcgtagct 537541 gaccacgacc tggtgctgcc cgggtgtcac caggtagttc caggatgcgc gggcgcgatg 537601 gtggcggggc aggaccgact cgtcggtgtg cagctgggcg ctgttggtgg agtatgcgat 537661 cgcgcccagg accgcgcgct cggccggtgt cggctcgtcg agcaacagca gcgcctggtc 537721 gggatggacc gcgacgacgg ccgcatcgaa acgccgcgac ggcccatcac ccgcgcccac 537781 caataccccg tccggcagcc ggcgcagcga gtgcactggc gtgcgggtcg acacctcgtc 537841 cagctgagct gcgatcgcct gcacgtagtt ggcggaacct ccggtgacgg tacgccaggt 537901 tggcgacccg aacaccgaca gcatgccgtg atggtcgagg aagacgaaca gataccgggc 537961 cggatagcgc aaggcgtcgg ccccgccgca ggaccacacg gcggcgacca agggtgtgat 538021 gaagtaatcg acgaaatact gcgagaagtg gtgccggctc aggaaggctt ccagcgtctc 538081 cggtttgtct tccgcgttgt cggtctcctc acgcagcagg cgagccgcgg cgcggtggaa 538141 gcggagaatc tcggcaagca tgcacagata ccgtggccgc agcgattgcc ggcaagcgaa 538201 cagcccgcgc gctcccagtg cgccggcata ttcgagtccg atgtcgtcgg cgcgcaccga 538261 catcgacatt tccgactcct gggtggccac acccagttcg gcgaacaatc ggcacaacgt 538321 tggataggtt cggtcgttgt gcaccaggaa cgccgagtcg acgccgacga cgtcggtgcc 538381 ccgggggccg ccaccgttgt ccagatagtg ggtgtgggca tgaccgccca gccggccgtc 538441 cgcctcgtac agggtgactc ggtcccgtcc agacaggatg taggcggcgg tgaggccggc 538501 gaccccactt ccgacaacag ccaccgatcg tcggagtgat tgctgcacat cctgtattcg 538561 gagcggccgg ctagacggac gggcggttca gccgaggcgg tcgctgctca tcgccaaggg 538621 ccggcccgcg ggctgggttt cgctgggtac ggtcggggtc cgggcgggcc gggaacgcac 538681 ccgcagcggc caccagaacc agcggcccag tagtgcggcg atggatggcg tcatgaacga 538741 tcgcacgatc aacgtgtcga acagcaaacc cagaccgatg gtggtgccca cctgtccgat 538801 aacccgcaga tcgctgacgg ccatggacgc catggtgacg gcgaatacca gccctgcgtt 538861 ggtcacgacc ttgccggtgc cgcccatcga ccggatgatg ccggtcttca gccccgctcc 538921 tatttcctgt ttgaaccggg agaccaagag cagattgtag tcagatccca ccgccaacag 538981 aacgatgacc gacatcgcaa gcacgagcca atgcagatgg attgcgagaa tatgctgcca 539041 gagcagcacc gatagtccga aagaggcacc cagtgaaagt gcgactgtgc ccacaatgac 539101 ggcggcggca ataaaggccc gtgtgatgat cagcatgatg ataaaaatga gacagaggga 539161 cgaaattgcc gcgataagaa ggtcccattg ggcgccctcg gagatgtcgt ggaagacggc 539221 cgccgtgccg gccaggtaga tcttggcgtc ttctagtgga gttcccttga gcgattcctc 539281 ggccgcggta cgaatcgcgt cgatactttt gatgccctcg ggtgattgcg gatcccccct 539341 gtgcaggatg ataaaccggg ccgcgtgtcc gtccgaagac aggaacgact tcatggcgcg 539401 ctggaagtct ttgttcttga aaacctcggg tggaaggtag aacgagtcgt cgttcttggc 539461 ggcgtcaaaa gccttaccca tggctgtggc attgtcgctc atttcgagca tctggtcgaa 539521 gattccggtc atggtgctgt gcatggtaag aatcatggtc cgcatgtttt ccatggcctc 539581 aatctgcggc gggatctgcg cgaccatttg tggcatgagg cgatccatct cgcgcaagtc 539641 gcccaagagg acgcctattt gctcgctgag cttgtcgatt ccgtccagtg catcgaatat 539701 cgatctgaac gaccaacaga tcggaattcc gtagcagtgc ttttcccagt agaaatagct 539761 tcgaattggt ctccagaaat catcaaaatc cgcgacgtgg tcgcgtaatt cttcggtgat 539821 ctccttcatc tcttcggtgt cgccgaccat gcggtgggta gtactggcca tctccgccat 539881 caagctatgc atccgcgtca acaccgcaat cgtcgtggcc atctcgtcgg cctgcttcag 539941 catgtcgttc gcccggtcgc gctggtactt tatggtctgc agctgaccgg cattttgcat 540001 gctgatctgg aacgggatcg acgtgtggtc catcgtcgtt ccttcgggcc gggtaattgc 540061 ttgcacacgg gaaatgcccg ggacccggaa gatgccttta gccagcttgt ccaggaccag 540121 aaaatctgcc ggattccgca tatcgtgatc ggattcaatc attaggatct cgggcttcat 540181 cctggcctga gagaaatgac gatccgcggc cgcatatcct tggttggcgg gtatgaagtc 540241 cggtaggtag tcacggtcgt tgtagctggt tttgtatcca ggcagggcga gcagaccgac 540301 tagggcgatc gcgcaggtgg cgacgagaac gggcagcggc cagcgaacca ccacggtacc 540361 cacccgccgc cagccacgga ctttgaggag ccgcttaggg tcgaacaggc cgaaccggct 540421 gccgacgtgt aggacggccg gacccagcgt caacgcgacc gccactgcga ctagcatccc 540481 caccgcgcag gggatgccca gggtttgaaa gtagggcatg cgggcaaagc tcaggcaaaa 540541 ggtagctccg gcgatggtca atccagagcc cagaatcacg tgggcggtcc cgcggtacat 540601 ggtgtagtag gcggcctctt tgtcctcgcc ggcttggcgg gcttcctggt agcgcccgat 540661 gatgaatatc ccgtagtccg taccggccgc gattgccagc gaagtcagca agctcaccgc 540721 aaaggtggta agtccgatag ccccgctatg ccccagaacc gctacgactc cgcgcgcagc 540781 cgtcaattcg acccccaccg tgatcagcag gagaaccacg gtgattatcg accggtagac 540841 gagcaacaac ataataaaga tcacggcgac cgtaaccatg gtgatcctgg ccatggatct 540901 atcgccactg tggtgcatat ccgcggcgag tgcggatggt ccggtcacat aggcctttat 540961 gcccggcggc gcgggcgtgc tttcgacgat gctgcgtact gcctcgacgg attcgttggc 541021 cagcggcgtg ccttggttgc cggcaagtga cagttgaaca taggcggcct tgccgtcgtt 541081 actttgcacg cccgcggcgg tgagtgggtc cccccataaa tcttggacac tttgcacgtg 541141 cttcttatcg gccctcaatt gagcaaccag gccgtcgtaa tacttatggg cagcgtcgcc 541201 aaggggttgg ttaccctcta ttatgaccat cgcgaaactg tcggaatcgc cttccttgaa 541261 caccatgccg atacgtccca tcgcctcaaa cgacggtgca tccttgggac tcagcgacac 541321 cgatcgctct tggccgacag cttccagtga cgggacaaat acggtgacaa cgacgcaaac 541381 tgccagccag ccaaggatga tcggtaccgc aaaggcgtgg atcatcctgg cgatgaatgg 541441 cttttcgggg cgagcgttgg tattggagtc gttcgcgaat ttagtactca cgcggacttc 541501 accaagcagt aagtataggc gttgacttcg ttggaaaccc tctcggccct gaccttgccg 541561 tctaccgtga ttcggcagcc aatgctgtcg ctattacctt gtgccacgat atttcccatc 541621 accgccgcgt cgtttgtcgt gatatgcaat gaccacggta gcaccgctcc atcgacccgt 541681 tgcggctcgg aattgacgtc gaaataacta atgtcggcga ctgttccggg gggtccgaag 541741 atctcgtaag tcaggtgttt agggttgaat ggtttgctgt tttccaggtt ggtgtcggag 541801 tacgacgggc ggttttcgga gccgaagaag ccgcggatcc ggtgcacggt gaagcccccg 541861 acgatgacca ccaccaggat gaccagtgga atccaagtcc gcattagcac cttgaaaatc 541921 tcagatcccc ttcaccggtt ggcagtggta cggcggacga tacccaactt tcaaaatccg 541981 ttcgagctgg tcgctacttg aacgcaacta agcctagcct aagtaaaaca tggttttagg 542041 cccgagctct cgactcctta cctcgttcgc tggagtgtaa cgcatatcac gtgcgtaacg 542101 gcacgctacg ttatcggcag ccctcttaca aatcacacgg tgtgcgttat cctctggcgg 542161 tggcgcaact cggcttccag cgcgcccgca ccgaggaaaa caagcgccaa cgtgcggcgg 542221 cgctggtgga agccgcgcgg tcgctggcgc tggagacggg cgtggcatcg gtgacgttaa 542281 cggctgtcgc aggtcgtgcc gggattcact actctgcggt gcgccgctac ttcacctcgc 542341 acaaagaagt gctgctgcac ctcgccgccg agggttgggc gcggtggtcg ggcacggtat 542401 gcgagcagct gggcgagccg gggccgatgt cggcaccgcg ggtggccgag gcactggcca 542461 acggtctggc cgccgatccg ctgttttgtg atctgcttgc caatctgcat ctgcatctcg 542521 agcaggaggt ggatgtcgac cgggtcatcg aggtcaagcg gaccagcatc gcagccgtga 542581 tagcgctcgt cgacgcgatc gaaagcgcat tgccggcact cgggcgttct ggggcattcg 542641 acatcctgct ggccgcttac tcgctggcgg ccaccctgtg gcagatcgcc aatccgccgg 542701 agcggctcac cgacgcctat gccgaggagc cagagttgct cccaccggag tggaacctcg 542761 actttgctgc cgcgcttact cgcctgctca ccgctacgct tctcggcctg ctcgccggat 542821 ccccatgcga atgccggtcg ccaacgcgct gaagcgggtg cgggacgaag ggggcgccgg 542881 acttgggccc gcttggcggc ggtaggtgac caaactcacg cttcttgggc gtgcgccgca 542941 gccgaaccac gactattgct agttgcaaac gatagtcata gtcaattgtt gccagacgca 543001 cagctggtgt tggcgggagt cgccgataga ggagtgttcg acatgacgtt gcacgtcggt 543061 gccgacggcc tagagaccgc aactacggcg cgcgccgtgg cggtcgctag gtccggaatg 543121 gattgtgtgg ccggtgatgc gtcaggggcg acttcgtgcc tacgcggtga gctatgacga 543181 gcgcactgat atggatggcc tctccgccgg aggtgcattc ggccttgttg agtagtgggc 543241 cggggccggg gccggtactg gccgccgcca cagggtggtc gtcactgggc cgtgaatacg 543301 ccgcggttgc tgaggaactc ggggcattgc tggctgcggt gcaagccggg gtgtggcagg 543361 ggcccagcgc cgaatcattt gctgccgcgt gcctgccgta tctgtcttgg ttgacgcagg 543421 ccagcgccga ctgcgccgcg gcggctgccc ggctggaggc ggtgaccgcc gcctacgccg 543481 cggctttggt ggccatgccc accctggccg agttggcggc taaccacgcg acccacgggg 543541 ccatggtggc gaccaatttc ttcgggatca acaccatacc gatcgcggtc aacgaggccg 543601 actacgtgcg gatgtggctt caggcggcca ccacgatggc cacctatcaa gcggtcgcgg 543661 actcggcggt gcgctcgatc ccggacagcg tgcctccgcc gcgaattctg aaatccaatg 543721 cccaatccca acactcgagc tcgaataatt ccgggggcgc ggacccggtg gacgacttca 543781 ttgcagagat cttgaagatc atcaccggcg gtcgcgtgat ctgggacccc gaagccggca 543841 ctgtcaacgg cctcccctat gacgcttata ccaaccccgg cacactcatg tggtggattg 543901 ccagaagtct ggaacttctt caagactttc aagagttcgc caagctgctg ttcaccaatc 543961 cggtgaaggc ttttcagttc cttgtcgacc tcatcctgtt cgactggcct acacacatgc 544021 tgcagctggc tacctggctg gccgagaacc cgcagttgct ggtggctgcg ctcaccccag 544081 ccatctccgg actgggagcg gtatcggggt tggccgggtt gaccggccta gtccctcagc 544141 cccccgtcgt gcccgcgccg gcacccgatg cggtcgtgcc caccgtgttg ccactcgccg 544201 ggacggccac gccgactacc gcgccggcca gcgccccggc cgccggagcg gcgcccgggc 544261 ccccggccgg taccgccact gccacatcgg cgtcggtgcc aacgagcgcc ggcggctttc 544321 ccccttacct cgtgggcagc ggtccaggca tcgacttcga cgcggggacg cccgccggtt 544381 ccaggagagc gcagcccgcc gcggataacg tcacggccgt ggcggcagcg caggtgtcgg 544441 cccgtcatca ggcacgtcgg cgccgacgag cggcggcgaa ggaacgtggc aacgccgacg 544501 agttcgtcga tatggactcc ggcccggcga ttccgccgtc gggcgagcgg gacgcttggg 544561 cgtccaattc gggcgtgggc gggctggggt ttgccggcac cgcaagcaac gagacggtgg 544621 cagcgccggc cggattgacc acgctggccg acgatgagtt ccagtgtggc ccacggatgc 544681 cgatgctgcc gggcgcttgg gacttgggaa cttgggaccg cggggactga ttaccctaca 544741 acgcagcgac gtcgcgcatg atgtcggtgg gttcgcgcac cggcgcccca caggtcaggc 544801 agaacgcgcc cggggaacgg gtgagccgac cgacttgaag caggactttg gcctcgacgt 544861 gccacaagca ggcaatgcac agaatttcga cggtgttccc gaatgggtcc aggtcggggt 544921 cgttacattc gtctaccgca tgcagatgca ccacgtaact cgcccggttg gtgcacccgg 544981 ctccggactg gcaggtgatt ccaccccagt ccagggccgc cagcgtgtgt gggatctcgt 545041 tgccgggcgc ttgactcatg cgccgcgctc cagtgtccag gccatgcggc ccacgatgtt 545101 tacctctgcc ccgcaacggc atggtatccc ggcgcgtggc cggtggtggc tgggctacca 545161 agagcgaagt cgggcatggc cttagtccta gtggtacgcg ataggtcgtc gaattccgtg 545221 ggtgatggat atgactattt cgtagctggt cgccagaatc aatccgccga acggcggctg 545281 atgggcccaa cgggctgtcc cccgaatggt ggacaacatt tccgggttcg ttgcaaacga 545341 ccgcgctttg acgccggtta gctttaggcc ggacttaggc ccagttccac accgacatgt 545401 cgccggctgg gtatccattg cacacctcgg tccctttagc gacgacgccc ttgttgttga 545461 agaagatttt catgtgattg acccaggcaa acgtcagcgg atcgccattg taaaagtgtt 545521 cggagtagtc tcggcgctcc gccggtgaca gcgagaagaa ccagtgcgcc ttgttgatcg 545581 tcgcttgctg aaggtttgca tggttgttga agtcgatcat gtaccgctgg tagtacaccg 545641 gactggtatc ccgcaccgcc gccagatatt gttcggcgtc gcaggtggtt gcgatcatcc 545701 ggcgaggtat tggaaagtct tccgtggagt cggctgccgc gctttgtgga aatgtcgcag 545761 cggcgatgcc gagaaccaga aatgccgcgc cggcacgcag gatggaactc agccgagaca 545821 tagtggttac cgtagcactt ttggggcgcc tcgaggcggg cagacgacaa ggttcatagt 545881 ctgtctcact acatgctccc atcaggagtg atgacgtgcg tggggtcggg tcgcagttcc 545941 ggtggggctt ggctgtagtc gccgaacggg ccgtcgcggc gctcgaccgc ggctcgcaca 546001 ccctgggttt gggcggtccg gatgaactcg agcgcgtcgg gggtgttgcg catcagcccg 546061 tcgagaatgc cgcccagcag ctgggtggag gccaggccca tgttctcgta ggcctggttg 546121 acgatcagtt tctgggcttg caactgtgac aacgggattc gtgccagctc ggtggcgatc 546181 tcggcgacgc gagcctcgag ccgctcgaac ggcaccgcct cgttgatcag ctcggcttcg 546241 gcggcctgca caccggtcag cggccggccc gtcagcgagt gccatttgac cttggcaagg 546301 ctgagtcgat acagccacat cccggtcaaa taggctcccc acatgcggct atacggagtc 546361 ccgatcacgg cgtcctcgct ggcgatcaca atgtcggcac acagcgcgta gtcgctggcc 546421 ccgccgacgc accaaccatg cacttgcgcg atcaccggtt tggacgcccg ccagatggcc 546481 atgaatttct gcgtcggtcc ggtctcccgc gcggtgacca tggcgaaatc cttgcccgga 546541 tcccatcggc cgtcggtcat catggcatcg ccccaatgct ggaagccgcc gccgaagtcg 546601 taaccgccgg agaaggcgcg gccggcaccg cgcagcacga tgaccttgat gtcctggtcg 546661 cgctcggcca acccgatagc ggcctcgatc tcgtcgggca tgggcgggac gatggtgttg 546721 agctgttccg ggcggttgag cgtgatggtg gccaccggcc cggccgtcgt gtacagcagc 546781 gtctggaaat cgggtgtcgg catagcagca gcgaagtcac ttcggcccta agggtcaagt 546841 gtctcagcgg ggatcgtgat aacgccgctg gttcgaagct tcggccaacc cgggcgcagg 546901 gtttcgctag ctggcatttg catgcctcgg gcatcggtgt ccggttgcgc tctttgctcc 546961 gacgttagcc gcagggccct gcggctaggc gcggccggtg ccgttggccg cggcggcaat 547021 cgatgttgca gcagttacaa cgccaaatgg agtctgagcg catcgtcgag ttcgatcagc 547081 tcggcagggg agacgttgcg cagcgacgga tccaacctgc tgggcctgcg ccttcgaatc 547141 gacggccagg ccaccgctcg ctgccggcaa caacacctgg aatggggacc ttttcggtgt 547201 tgctggtaac cgggacaacc ggcaccacgc ctcggtcgag acgtatcgcg gcagcgttgg 547261 ccctgtcgtt gctgacaatt accgctggcc gccgcatatt tgccgcgctg ccgcgggccg 547321 gatccaggtc gacctgccag atctcaccgc gcagcatcta cgccgttcgc tgcaaaccgc 547381 cgactgcgac ggcaggccca ctctcttggc atgcgtccaa tgctgcgacg tcctcggtag 547441 acaagctcac gcttggcttc atgccgcagt cctacccatg tagtaacaga tagtaatacg 547501 tagtaatagg tagtaatgca gtatcaatcg gctacaactc gatagccacg ttatttgggc 547561 taagtccacc gttcgtgaat gccggttagc cggccagcat ccgccatagg aacgcgaaac 547621 tcagcgccga tttgaatgcg atctgtgcgt tgtcggctgc gccggcgtgc ccaccctcga 547681 tgttttcgta ataccagacg gggtggcccg cagcctgcag ggccgccgtc attttgcggg 547741 cgtggccggg gtgcacccga tcgtcgcggg tagaggtcgt catgagtact ggcgggtatt 547801 tccggttcgc cgaaatgttt tggtatggcg aatattcaga gatgaacttc cagtcatccg 547861 ggttatccgg atcgccgtat tcggccatcc aggaagcgcc ggccagcagc aggtggtacc 547921 gcttcatgtc cagcagcggc acgtcgcaga ccagcgcgcc gaacttctcc gggtacccgg 547981 tcaacatgat gcccatcagc agcccaccgt tgctgccgcc ccgcgcgccg agctgctcag 548041 cggtggtgat gccgcgggtc accaaatcgg ttgccacggc ggcgaagtct tgggcgacct 548101 tgtcccggcc ctcgcgcatc gcctgcgtgt gccagccagg cccgtactcg ccgccgccgc 548161 ggatgttggc caacgcatag gtgcccccgc gggccagcca cagccggccc aggacgccgt 548221 catacgtcgg cgttctggat gtctcgaatc caccgtagcc gttcaacaat gtggggccgg 548281 gattgtccgc gtcggtgcgt cgcacgacga aatacgggat cgatgtgcca tcgtctgatg 548341 tcgcgaaata ctgtgttaca gccatgtttt ccgcgtcgaa gaaagctggc gcagatttga 548401 tctctgctag tcggccgtca tcggtgccgc gcatcagccg cgacggcgta tcgaatccac 548461 tggagtcgag gaagaactcg tcgccgtggc tgtcggcgga gacgatgacg gtgttggtgg 548521 cggcggggat acctgagagt ggctcacgtc gccagctgcc gggagttgcg atctcgacgc 548581 ggctcgccac gtcggccagg gtgacgatca acagccggtc tcgggtccag gcgtattggt 548641 acagcgcggt gtgctcgtcg ggttcgaaca ccacctgtaa ttccgctgag ccggcaagga 548701 attcgtcgta ttcggcggcc agcagtgagc cggcagtgta cctggtggtg gccacggtcc 548761 agtcggtgcg cagctcgatc aacagccagt cgcggtgaat tgacacgctc gcgtcggtgg 548821 gggcttcgat tcggatcagc tccgaaccac gcaattcgta gacctcttcg ttccagaagt 548881 cgagggcccg tcccagcagg gtgcgctcga atccgggcgt gcgatccgct gacgcgttga 548941 cgcggacgtc ggtgcccgcg ccctcgaaga ttgtctccgc atcggccagc ggtttgcccc 549001 ggcgccatcg cttgatcact cgcggatagc cggaagtggt gagcgagtcg ccgccgaagt 549061 cggtgcccag caagacagtg tccgggtcct cccaggtaat ctgggatttg gccggtggca 549121 gctggaaccc atcctcgacg aattcgcgtg tcagcatgtc gaattcacgc acaatggatg 549181 catccgagcc gcccggggac aggccgatca gcgcgcgcgt gtagtcgggt tcgatgacac 549241 cggcgccgcc ccacacccac ttctggtcgt cggcgcggcc cagttcatca acatcgatca 549301 gcacatccca gcccggcgag tcggtgcggt agctgtccag cgtggtgcgc cgccacaacc 549361 cgcgggggtt ggcggcatcg cgccagaagt tgtagagata gttgccgcgc ctgttcacat 549421 aggggattcg ggcatcggtg tcgagcacct cgagcgcctc gacgcgcatc cgctcgaact 549481 ctgcgtcgca gaacgccgcc gttgtcggct tgttgcgcgc gcgtacccaa tccagcgctt 549541 ccgcaccggt gacgtcctcg agccataggt aggggtcagc gccgtctggg gcaggctcaa 549601 atgtcatgga agccattgtg gccccggcgg tagtgtgagc tgtattacat gattttgacg 549661 aggagccgaa tacgatgact gtcttttccc gtcccggttc cgccggggcg ctgatgtcct 549721 atgaatcccg gtaccaaaac ttcatcgggg gccagtgggt cgcgccggtc catgggcgct 549781 acttcgagaa cccgacgccg gtgaccggcc agccgttctg cgaggtgccg cgctccgacg 549841 cggccgacat cgacaaggcg ctcgacgccg cgcacgcggc ggcgccgggg tggggcaaga 549901 ccgcaccggc cgaacgggcg gcgatcctca acatgattgc cgaccgcatc gacaagaacg 549961 ccgccgcgct ggcggtggcc gaggtctggg acaacgggaa accggtccgg gaagcgctgg 550021 ccgccgatat cccgttggcg gtcgatcact tccggtactt cgccgcggcg attcgcgccc 550081 aggagggcgc gctgagccag atcgacgagg acaccgtggc ctaccacttc cacgagccgc 550141 tcggcgtggt gggccagatc attccgtgga acttccccat cctgatggcg gcctggaagc 550201 tggcgccggc gttggcggcc ggcaacacgg cggtgctcaa acccgccgag cagacacccg 550261 cttcggtgct ctacctgatg tcgctgatcg gtgatctgtt gccgcccggg gtggtcaacg 550321 tggtcaacgg attcggcgcc gaggccggca agccgttggc ctccagcgac cgcatcgcca 550381 aggtcgcgtt caccggggaa accaccacgg ggcggctgat catgcaatac gcctcgcaca 550441 acctgatccc ggtcaccctg gaactcggcg gcaagagccc caacatcttc ttcgccgacg 550501 tgctggccgc ccacgacgac ttctgcgaca aggcgctgga aggcttcacc atgttcgccc 550561 tcaaccaggg cgaggtgtgc acctgcccgt cgcgcagtct gatccaggcc gacatctacg 550621 acgagttcct ggagctggcg gcgatccgga ccaaggcggt ccggcagggc gacccgctgg 550681 acaccgaaac catgctgggt tcccaggcct ccaacgacca gctggaaaag gtgttgtcct 550741 acatcgaaat cggcaagcaa gagggtgcgg tgattatcgc cggaggcgag cgcgccgaac 550801 taggcggcga cctgtccggc ggttattaca tgcagccgac gatcttcacc ggcaccaaca 550861 acatgcggat tttcaaggag gagatcttcg ggccggtggt cgcggtgacg tcgttcaccg 550921 attacgacga cgcgatcggc atcgccaacg acaccctcta cggcttgggt gccggtgtgt 550981 ggagccgcga cggcaacact gcctatcggg ccgggcggga catccaggcc ggccgggtgt 551041 gggtcaactg ctaccacctc taccccgcgc acgcggcgtt cggcggctac aagcagtccg 551101 gcatcggccg ggagggccac cagatgatgc tgcagcacta ccagcacacc aagaacctgc 551161 tggtgtccta ctcggataag gcgctggggt tcttctgatg aacgctcccg cgggggtgct 551221 catcaccgcc gaggccgccg cgctgctggc tgggttacag gaccggcacg gtccggtgat 551281 gttccaccaa tccggcggct gctgcgacgg gtccgcgccg atgtgctacc cgcgggcgga 551341 cttcctggtc ggtgaccgcg acatcttgct gggtgtgttg gacgtcgggg aagacggcgt 551401 gccggtgtgg atttcgggcc cgcagtacca ggcctggaag cacacccagc tgatcatcga 551461 cgtggtgccg ggccgcggtg gcgggttcag tctggaagcg cccgagggcg tgcgctttct 551521 cagcagaggt cgggtgttca gcgacgccga aaaggcgatg cgggaggctg cgccggtgat 551581 caccggcgca gcctacgagt gcggcgaacg accgttagtg cggggtcttg tcgtcgatct 551641 cgacgatcca gatgccacgc cgggagtgtg ccgcgccagt cggcggtagc cgcagtaagg 551701 tcgtagaccg tgatccccct tccgcggtca tggcagctga ccagcgcgat gctggttggt 551761 aatgcgatcg gactgctagc gggggtggcg tgcagcgtgc tggtgcatgc ccggatccgt 551821 ccggacatcg tcatcgcaat ggtagtcggg attcccagcg cgatcgggct gctggtcatc 551881 ctgttctccg gacgtcgatg ggtgacgatg ctgggcgcgt tcatcctggc gttggcgccg 551941 ggttggtttg gtgtgctggt tgcgatccag gtggcgtcca gtggctgaca acgattaccg 552001 gtcggcaccc ggaaccgagc cgtttgtgcc cgatttcgac accggcgcac actcgcagcg 552061 gttcctctcg ttggccggcc agcaagacag ggcggggaaa tcctggccag gctcgacgcc 552121 gaagccgcag gaggaccccg tgggtgtcgc gccttcggcc agcgtcgagg tgctggggtc 552181 cgagccggcc gccacgctag cgcactcggt tacagtaccc ggtcgatata cctacctgaa 552241 gtggtggaag ttcgttctag tggtcctcgg cgtatggatc ggtgctggcg aggtcggcct 552301 gagcttgttc tactggtggt atcacacact cgacaagacg gccgccgtgt tcgtcgtcct 552361 ggtctacgtc gtcgcgtgca ccgtcggtgg cttgatcctg gcgctggtgc cgggcaggcc 552421 actgatcacg gcgttgtccc tcggagtgat gtcggggccg tttgcctcgg tcgccgccgc 552481 ggcgccgctc tacggctact actactgcga gcggatgagt cattgcctgg tcggcgtcat 552541 tccgtactag tcggttgtcg gacttgacct actgggtcag gccgacgagc actcgaccat 552601 tagggtaggg gccgtgaccc actatgacgt cgtcgttctc ggagccggtc ccggcgggta 552661 tgtcgcggcg attcgcgccg cacagctcgg cctgagcact gcaatcgtcg aacccaagta 552721 ctggggcgga gtatgcctca atgtcggctg tatcccatcc aaggcgctgt tgcgcaacgc 552781 cgaactggtc cacatcttca ccaaggacgc caaagcattt ggcatcagcg gcgaggtgac 552841 cttcgactac ggcatcgcct atgaccgcag ccgaaaggta gccgagggca gggtggccgg 552901 tgtgcacttc ctgatgaaga agaacaagat caccgagatc cacgggtacg gcacatttgc 552961 cgacgccaac acgttgttgg ttgatctcaa cgacggcggt acagaatcgg tcacgttcga 553021 caacgccatc atcgcgaccg gcagtagcac ccggctggtt cccggcacct cactgtcggc 553081 caacgtagtc acctacgagg aacagatcct gtcccgagag ctgccgaaat cgatcattat 553141 tgccggagct ggtgccattg gcatggagtt cggctacgtg ctgaagaact acggcgttga 553201 cgtgaccatc gtggaattcc ttccgcgggc gctgcccaac gaggacgccg atgtgtccaa 553261 ggagatcgag aagcagttca aaaagctggg tgtcacgatc ctgaccgcca cgaaggtcga 553321 gtccatcgcc gatggcgggt cgcaggtcac cgtgaccgtc accaaggacg gcgtggcgca 553381 agagcttaag gcggaaaagg tgttgcaggc catcggattt gcgcccaacg tcgaagggta 553441 cgggctggac aaggcaggcg tcgcgctgac cgaccgcaag gctatcggtg tcgacgacta 553501 catgcgtacc aacgtgggcc acatctacgc tatcggcgat gtcaatggat tactgcagct 553561 ggcgcacgtc gccgaggcac aaggcgtggt agccgccgaa accattgccg gtgcagagac 553621 tttgacgctg ggcgaccatc ggatgttgcc gcgcgcgacg ttctgtcagc caaacgttgc 553681 cagcttcggg ctcaccgagc agcaagcccg caacgaaggt tacgacgtgg tggtggccaa 553741 gttcccgttc acggccaacg ccaaggcgca cggcgtgggt gaccccagtg ggttcgtcaa 553801 gctggtggcc gacgccaagc acggcgagct actgggtggg cacctggtcg gccacgacgt 553861 ggccgagctg ctgccggagc tcacgctggc gcagaggtgg gacctgaccg ccagcgagct 553921 ggctcgcaac gtccacaccc acccaacgat gtctgaggcg ctgcaggagt gcttccacgg 553981 cctggttggc cacatgatca atttctgagc ggctcatgac gaggcgcgcg agcactgaca 554041 ccccccagat catcatgggt gccatcggtg gtgtggttac cggctacatc ctctggctgg 554101 cggcgatctc cgtcggcgat ggtctgacga cggtgagtca atggagtcgc gtggtgttat 554161 tgctgtcggt cctggtggcg gtgtgcggcg cggcgggcgg cttgcggctg cgcagccgcg 554221 gcaagctcgc gtggtcggcg tttgctttca gtttgccgat tcctcccgtg gtgctgaccg 554281 tggcggtgct ggccgacatc tacctttgac ggctactgtg ggttgtccgg cgggatggcc 554341 agggcggtga tcgttgcggc gatcgcgtcg tattgggttg cgagtaaaca gaattcgatc 554401 aacaggcgcg gatcgaggtg agttgccagc cgctcccagg tgcccgcggt gatcgtgcga 554461 tccttgatca attcatcggt agcctgtagc agcgcctgtt ggcgggcgct gagcactttt 554521 cgcggtccgt ctccatctgg aacgtcgggc caggcgaata tcgtggcctg ggtgttggcg 554581 tctaggcccc gacggcgcgc cattcggcga tgatgctgaa gttcgtattc gcaagatcgt 554641 aggtgtgcga cccgaaggat caccaactcg gtatcgacgc cgggcagccg cccgtgcagt 554701 agtcggccgg tgtagatggc aaaggtccag aacaagtact ggcggtagcc cagcgtggtg 554761 aacaggtgca tctgcggtgc cccaaccgca cgtgcggcca gcttggccac cagccagttg 554821 accggcccca gctggcggaa cttccccggg gagatacgcg cgacttggcc gttctgaccg 554881 gtcatagttg tttcaccaga tacggggaca ccgtgctgcg gtgttcgtcg agatccagtg 554941 cccgccccaa ggcggggaag gcgcgttgcg gacagttgtc gcgttcgcag acgcggcaac 555001 cggcgccgat aggtgtggcc gcagtattcg ggtcacccga caagtcgagt ccttccgagt 555061 agacgagccg gtgcgcgtgg cgaagttcgc agcccagccc gatcgcgaag gtcttaccgg 555121 gctgaccata ccgggcggcc cggagctcaa cggtgcgggc cacccacagg tagttgcggc 555181 cgtcgggcat ctgggcgatt tgcaccaaga tcttccccgg gttggcaaac gtttcgtaga 555241 cgttccacag cgggcaggtg ccgccgctgg aggagaagtg aaagccggtg gccgactgac 555301 gttttgacat gtttcccgct cggtccaccc ggacgaaggt gaacgggacc ccgcgcatcg 555361 aaggccgttg tagtgtcgac agccggtggg cgatggtctc gtagctcacc gagtagaacg 555421 ccgacagccg ctcgacgtcg tagcggaaat tctcggcgac gtcgtggaac tggcggtagg 555481 gcagcacggt ggccgcggcg aagtaattag ccaggcccag ccgggccaac gtccgcgact 555541 cggcgctggt gaacttgccg tcggtgacca tggcgtcgat gaggtcgccg aactcgagat 555601 aggccaactc ggcggccatc ttgaacacct gctggcccgg ggagaggtga ctgctgatct 555661 ccagcgtgtt ggtcgcgggg tcgtagcggt gcagcacggt gtcaccgagg tcgatgcgct 555721 tgttgatgcg tactccgtgc acctcggtga gccggcgggt caattcgcgg gccaggtcgc 555781 cgtggtgcat ccgcatctgg gccgtgaggt cttcggccgc ggtgtccagc gcatgtagat 555841 agttctggcg ttggtagaag tagtcgcgca cctcttcgtg cggcatggtg atcgaccctc 555901 ggccactgcc gtcggagaac cgctcctcgg tcgcggcggc cagctgcgcg gtggtgatcc 555961 ggtagcgccg atgcaggttg accaccgcgc aggccagccc gggatgagcg ctgaccattt 556021 cggccacttc atgcgggtcg atggcgatgt ctagatcgcg gtccagggtc acctccctga 556081 gttcggcaac cagccgggtg tcgtcctggg aggcaaagaa cgtcgcgtcc accccgaaca 556141 cttcggtgat gcgcagcagc acggccacgg tcagcggccg gacgtcgtgt tcgatctggt 556201 tcagatagct cggcgagatc tccagcatct gggccagcgc ggcctggctg aacccgcgct 556261 cgttacgcag ttggcggacc cgcgagccga cgtaggtctt ggacacccaa ccgagcgtac 556321 cgggtgttgt gaagacgcca ttcgcagagt tagcaagcgt gctgcgattg gtgtttccgc 556381 cacggcgttg gcatgattcg caccgggact caagggtgag cctgaggtac acgcgaggag 556441 gaaatgggga gaacgccgtg agcctcgaca aaaaattgat gcccgtgccc gacggtcacc 556501 ccgacgtgtt cgaccgagaa tggccgctgc gcgtcggcga catcgaccgc gcgggccggc 556561 tgcggctgga cgcggcttgt cggcacatcc aggacatcgg tcaggaccaa ctgcgcgaga 556621 tgggcttcga ggagacccac ccgctgtgga tcgtccgcag gaccatggtg gaccttatcc 556681 ggccgatcga gttcggcgac atgctgcggt gtcggcgctg gtgctcgggc acctccaacc 556741 ggtggtgtga gatgcgagtt cgtgtcgatg gccgcaaggg cggcctgatc gaatccgagg 556801 cgttctggat ccacgtcaac cgggaaaccg agatgccggc ccgcattgcc gacgacttcc 556861 tcgcgggtct gcaccggacc acgtctgttg atcggctgcg ctggaagggc tatctgaagc 556921 cgggcagccg ggatgatgcg tcggagatcc acgagttccc ggtccgggtc accgatatcg 556981 acttgttcga ccacatgaac aacgctgtct attggagtgt gatcgaggac tacctggcgt 557041 cgcatgcaga gctgctgcgg ggccctttgc gggtgaccat cgagcatgag gcgccggttg 557101 cgctcggcga caagctggag atcatctccc acgttcaccc ggctggttcg accgagatat 557161 tcggcccggg gttggtcgac cgcgctgtta caacgctcac atatgtggtt ggcgacgagc 557221 ccaaggcagt cgcctcgctg ttcaatctgt gaccggatcc gcaggacgtc gatccgtggg 557281 tttacctgcg gatttgtcgt tactggcggg tagcttctga aacggttcag tttttgggcg 557341 acttcgcaaa atttgcaaaa agtccgcagg ccgttgccga aattcgcaag tgaaatgggt 557401 ggaccagcgt tgacacgctg tgccatggtc gagttagcac accagtgaag ctgcgccgtt 557461 gacaccgcct ggacgacggt agggcgtcag cgttttcggc aatgaaagac cgttaaggag 557521 ttgtctatgt ctgtcgtcgg caccccgaag agcgcggagc agatccagca ggaatgggac 557581 acgaacccgc gctggaagga cgtcacccgc acctactccg ccgaggacgt cgtcgccctc 557641 cagggcagcg tggtcgagga gcacacgctg gcccgccgcg gtgcggaggt gctgtgggag 557701 cagctgcacg acctcgagtg ggtcaacgcg ctgggcgcgc tgaccggcaa catggccgtc 557761 cagcaggtgc gcgccggcct gaaggccatc tacctgtcgg gctggcaggt cgccggcgat 557821 gccaacctgt ccgggcacac ctaccccgac cagagcctgt atcccgccaa ctcggtgccg 557881 caggtggtcc gccggatcaa caacgcactg cagcgcgccg accagatcgc caagatcgag 557941 ggcgatactt cggtggagaa ctggctggcg ccgattgtcg ccgacggcga ggccggcttt 558001 ggcggcgcgc tcaacgtcta cgagctgcag aaagccctga tcgccgcggg cgttgcgggt 558061 tcgcactggg aggaccagtt ggcctctgag aagaagtgcg gccacctggg cggcaaggtg 558121 ttgatcccga cccagcagca catccgcact ttgacgtctg ctcggctcgc ggccgatgtg 558181 gctgatgttc ccacggtggt gatcgcccgt accgacgccg aggcggccac gctgatcacc 558241 tccgacgtcg acgagcgcga ccagccgttc atcaccggcg agcgcacccg ggaaggcttc 558301 taccgcacca agaacggcat cgagccttgc atcgctcggg cgaaggccta cgccccgttc 558361 gccgacttga tctggatgga gaccggtacc ccggacctcg aggccgcccg gcagttctcc 558421 gaggcggtca aggcggagta cccggaccag atgctggcct acaactgctc gccatcgttc 558481 aactggaaaa agcacctcga cgacgccacc atcgccaagt tccagaagga gctggcagcc 558541 atgggcttca agttccagtt catcacgctg gccggcttcc atgcgctgaa ctactcgatg 558601 ttcgatctgg cctacggcta cgcccagaac cagatgagcg cgtatgtcga actgcaggaa 558661 cgcgagttcg ccgccgaaga acggggctac accgcgacca agcaccagcg cgaggtcggc 558721 gccggctact tcgaccggat tgccaccacc gtggacccga attcgtcgac caccgcgttg 558781 accggttcca ccgaagaggg ccagttccac tagtctgccg agcagacgca aaagcaccct 558841 tttgcggcgc aaaagtggcg cttttgcgtc tgctcgcgca tttgaggagg aacagtgagc 558901 gatgcgatcc agcgggtagg ggttgtcggg gccgggcaga tggggtccgg catcgccgag 558961 gtctcggctc gcgccggcgt cgaagtgacg gtgttcgagc cggccgaggc gttgatcacc 559021 gcgggacgca accgcatcgt gaagtcgctg gagcgggccg tcagcgccgg caaggtaacc 559081 gagcgcgagc gtgaccgcgc cctcggcctg ttgaccttca ccaccgacct caacgaccta 559141 tccgataggc aactggtgat cgaggccgtt gtcgaggacg aggccgtcaa gtccgagatc 559201 ttcgccgagc tcgaccgggt cgtcaccgat ccggacgcgg tgctggcgtc gaatacctcc 559261 agcatcccga tcatgaaggt cgccgcggcc accaagcagc cgcaacgggt tcttggcctg 559321 catttcttca atccggtccc ggtgctgccg ctggtcgagt tggtgcgcac gctggtcacc 559381 gacgaagccg ccgccgcgcg cacggaggag tttgccagta ctgtgctggg caaacaggtc 559441 gtgcgttgct ccgaccgctc cggattcgtg gtcaatgcgc tcctggtgcc gtatttgctg 559501 tcggcgattc ggatggtcga ggccgggttt gccaccgtcg aagatgtcga caaggccgtt 559561 gttgcggggt tatcgcaccc gatgggtccg ctgcggcttt ccgatcttgt cggcctagac 559621 accctcaagc tgatcgcgga caagatgttc gaagaattca aagaaccgca ctacgggccc 559681 cctccgctgt tgctgcgtat ggttgaggcg ggccagttgg gaaagaaatc gggtcgaggt 559741 ttctacacgt actgaagtgt atgaacggcc cccaggcttg acgcaaggcg agatcacaga 559801 ccgagacggt gtggttacga tcgtgtgaca gccgttgcgt acatcgggta gtatttccgc 559861 gatcaacaga tgagaggttc ggccggcatg actgagttaa ggccctttta cgaagagtcg 559921 caatcgattt acgacgtttc cgacgagttt ttctcactgt ttctagaccc cacgatggct 559981 tacacctgcg cgtacttcga gcgtgaggac atgactctcg aagaagcgca aaacgcgaag 560041 ttcgatttgg cgctggacaa gttgcatctt gagcccggga tgacgctgct cgatattggc 560101 tgcggctggg gtggtgggct gcaacgagcg atcgagaact acgatgtgaa cgtcatcggt 560161 atcacgctca gtcgcaatca gttcgagtac agcaaagcga aattggcgaa aattcccacc 560221 gaacgcagcg tccaggtgcg gctgcagggc tgggatgagt tcacggacaa ggtcgaccgt 560281 attgtcagca tcggtgcctt cgaagcattc aaaatggagc gttatgcggc attctttgag 560341 cgttcctacg acatacttcc agatgacggc cggatgctgc tgcacacaat tctgacctat 560401 acgcagaagc agatgcatga gatgggcgtc aaggtgacga tgagcgatgt gcggtttatg 560461 aaattcatcg gcgaagaaat ttttccgggc ggacagttac cggcgcagga agacatcttc 560521 aaatttgcgc aggcggcgga cttttcggtg gagaaggtgc aattgctgca gcagcattac 560581 gctcggacgc taaacatctg ggcggcgaat ctggaggcta acaaggaccg cgccattgct 560641 cttcagtccg aggagattta caacaaatac atgcactatc tgaccggatg tgagcacttc 560701 ttccgcaagg gcatcagcaa cgtgggacag ttcacactga ccaagtagcc catcgccgcc 560761 cgagcacccc aggggttgcg gagctcacgc cgggtgtggc ttgacgcccg ggcaccggcc 560821 ggtgggtagc cagcgcgctt tgtccggtta cttttccagt gtgaactggt cgacgtcggt 560881 gtaaccctgg cggaacagct tcgcgcagcc ggtcaggtac ttcatgtagc ggtcgtagac 560941 agtctgcgac tggatcgcga tggcctgatc tttgttggcc tcgagcgctg tggcccacat 561001 gtccagcgtc ctggcgtagt gcagctgcaa tgactggacc gcggtgaccc ggaagccgac 561061 cttctcggcg tactcgtgca ccgtcgggat ggacggcagc cagccaccgg ggaagatctc 561121 ggccaggatg aatttggtga agtgaaccag ttcgtgggtc aacgtcaggc ccttttccct 561181 gccttctttg aaggtggggc gcacgatggt gtgcagcaac atcttgccgt cggccggcaa 561241 cgtgcggtgg gtcacctcga agaaatggtg gtagcgctgg tggccgaagt gctcgaacgc 561301 gccgatcgag acgatgcggt cgacgggctc gtcaaatttc tcccatccct ccagcaacac 561361 tcgtctggag cggggggtgt ccatttggtc gaacattttc tggacatgac cggcctggtt 561421 ctccgacaac gtcaggccca cgacattgac gtcgtatttc tcgatggcgc gccgcatggt 561481 cgcgccccag ccgcagccga tgtccagcaa cgtcatcccg ggttcgaggt tcagcttgcc 561541 cagggccagg tcgatcttgg cgatctgggc ctcctgcagc gtcatgtcgt cgcgttcgaa 561601 gtaggcacag ctgtaggtct gggtggggtc caagaacagc cggaaaaagt cgtcggagag 561661 gtcgtaatga gcttgcacgt ttccaaaatg cggcgtgagc tgcacggaca taccgattga 561721 gcctttctgt gttccgaggc ccgcatccgc ttgcctcgac gcacccctga tctatccccg 561781 atgcatccct tgcatgctag ctgctgaaag gcggcccagt cgcaatcggc gccatgacca 561841 gctgtcgcag ccgtcagcga aaatcaccag gcgcgccgcc aggcaccgat cgccaggccc 561901 acaaccagca gcgcaccggc ctgacgcacg tgcagccagg ccaacgcggc ataccacagc 561961 ggccacaccg gaaacgccgg tggcggctgc tggggccgcc gtcgcaggaa atagggccac 562021 actttcgcca gccggggcag cgccccggcg accagcaacg cggccagggc atggcagcca 562081 gcatgacgtt tacggcgata agtaggtaga agccgaccat catggccagt gtcacggtgc 562141 gcgcgcacgt ttcgccgagt agcaccggca gcgttcggat acccagcggt tcgtcgtaac 562201 cgatcttgtc gatgtgctta cccatcagca ccgtggtgca caacagcccg taggggagcg 562261 acgccagcac gacctcccaa ccgcccgcgc ccaccgcggc gtagtaggtt ccagcgcacg 562321 ctcgggtgag ccgcagctgg tggttcgtcg aggcgtggta tatgcggcac ggttggcccc 562381 ggtagcggcc gggtgctggg catagcgggc gcgcgcgtag gtggcgctgt cagtaccgac 562441 atcggtgtcg tagagatcgt tcataaggtt gttggcgatg tgcggcgcat gtgattccca 562501 ccacaggacg agccagcgcc aatccaagcc aggctcgccg atcgccaaca gccccgcgac 562561 caggccggag accagggtca tcggcagcac tgcggcccgg gtgacgacga gccaccgggt 562621 gaccgtgtcg gtcggcccgt cagctggcgg gttggtggtg cgaagtgcgt aggcccacga 562681 tctgagccgg gagcccgcgc ccgcgtcggg catccctaaa gcctagacct gcccccaggc 562741 aggcacgatc ggcgaaggat gcggctgctc gcgaaacttc tccaacgatc cgccggcctc 562801 gacgatgccg cacagtgcgc tccagctcag catcgtcagg tagtcgatca gttcgtcact 562861 gctcatgcgc gggtctgaca tccaggagtg ggtggccagc tgcacgccgc ccacgatcag 562921 atatgcccac ggctcgactc cgccggtgtc catcccggct tcttgcatgc ggcggcgcag 562981 catcaccgcg agcatgcggg caatgattcg ctccgagtcg gcaatcactt tgcttttgct 563041 ggccgagcta ttcgccatca cgaaccgata cggctccggt tgggccgcca cggtctcgac 563101 atagacccgg atgatttcgc gggtcagttc gaaaccatcc atatcggccg acagcgcagc 563161 gatcatgttg gggatcaagg tggtctgcgt gaaccgcatc atcacggcgg tcgtcaggtc 563221 gtttttgtcg acgaagtagc ggtagagcac ggtcttggag accccgatct cggccgctat 563281 ctcgtccatg ctgaggaagc ggccatgccg gcgaatcgcc tcaatcgtgc cgtccaccag 563341 ctcattgcgg cgctccacct tgtgctggtg ccagcgtcgc ttgcgaccat ccgtcttcac 563401 ggtcacggcc gggatacgct ctgccactgt tgccaattcc cattcactag acgctcccga 563461 tactacggcc aattgggggt cctgctggca cattggacgc gcgcgcgggg tgcgcaggac 563521 agtgtcgtca cattaactgg tgccggtgat agcggatgat ggtgtggtgg cacataaagc 563581 cgaggtgtcg ggctcgccgc cgccacggct gaatttgagc acccagccga cggtggcgcg 563641 gcgtgtccgc gcctccttcg cggaatcctt cgccgcagcc gatccggagg cggatgccgc 563701 ccggcggatg gcgctgcgtc ggatgaaagt ggtggcagtg gggtttttgg taggcgccac 563761 cggcgtgttc ctcgcttgtc gctgggcaca ggccgatggc gctgaccacg cgtggctggg 563821 ttatctgggc gctgcggcgg aagccggtat ggtcggcgcc ttggcggact ggttcgcggt 563881 gaccgcgctg ttcaagcatc cgctaggcat tccgatcccg catacggcga tcatcaagcg 563941 caagaaggat cagctgggcg agggcctggg caccttcgtg cgggagaatt tcctgtcgcc 564001 gccggtcgtg gagaccaagc tgcgtgatgc gcagataccg agtcggcttg gcaagtggtt 564061 gtcagaggcc acgcatgccc agcgggtggc ggccgagacc gcaacggtgc tgcgggtgct 564121 ggtggagctg ctgcgtgacg aggacatcca gcaggtgatc gaccggatga ttgtgcgtcg 564181 tatcgccgaa ccgcagtggg gtccgccggc gggccgggtg ctggcgacgt tgctggccga 564241 gaatcggcag gaagccttta tccaattgtt ggccgatcgg gcgttccagt ggtcgctcaa 564301 cgccggggtg gtgatccagc gggtggtgga gcgtgactcg ccgagttggt cgccccgatt 564361 catcgaccac ctggttggcg accgtatcca ccgtgagttg atggaattta ccgacaaggt 564421 gcgccgcaac cccgatcacg agttgcgccg ttcggctacc cgcttcttgt tcgatttcgc 564481 tgacgacctg caacacgatc cggccactgt cgcgcgcgcc gacgcgatca aagaggagct 564541 aatggcgcgc gatgagatcg ccactgcggc cgcggcggcg tggaagacac tgaagcggtt 564601 ggtgctcgag ggtgttgacg acccgtccag tgcgttgcgc acccgcatca ccgatgcggt 564661 catccggatc ggcgaatcgc ttcgtgacga tgccgacctg cgtgacaagg tagacagttg 564721 gacggtgcgg gcggcccaac atctggtctc ggagtacggg gtggagatca ccgcgatcat 564781 caccgagacg atcgagcgct gggacgccga ggaagccagc cggcgaatcg aactgcacgt 564841 cggccgagac ctgcagttca ttcggatcaa cggaacagtg gtcggggcga tggcagggtt 564901 ggcgatctat gcgatcgcgc aactgttgtt ctgacgggtg ctaacaaacg cttgcaatag 564961 caagcacttg gacgtactct ggtggccgtt gcaccgatca ccccgagcta ggagtagcca 565021 atgtcgtcgg aggagaagct ggccgccaag gtgtccacca aggcctccga tgtggcttcc 565081 gacatcggca gcttcatcag gtcgcaacgt gagacggcgc acgtctcgat gcggcagctc 565141 gccgagcggt ccggcgtcag caatccgtac ctgagccagg ttgagcgcgg attgcgtaag 565201 ccgtccgccg acgtgttgag ccagatcgca aaggcgctgc gggtctcggc cgaagtcctt 565261 tatgtgcgcg ccgggattct cgagcccagc gagaccagtc aggtgcgtga cgccatcatc 565321 accgatacgg cgatcaccga gcgtcagaag cagattctgc tcgatatcta cgcgtcattt 565381 acccaccaga acgaagccac ccgggaggag tgtccgagcg atccgacacc gaccgatgac 565441 tagccgttgg ccggctgttt tgcgcaccgg ctggcgggta atcaaacctg aaggacagtc 565501 atctgggtga ggtcgaccgc aggctgatcc agccgatcgg ccgcgctggc caacagcgac 565561 tccgtcgatg acgtgcagca aaggagacat gtagtgaccg gatcagctgg gcctgacatc 565621 tacgaactcg accgacaacc gacccgacga tcagaaggtt tccccggcaa gtcgcgtgcc 565681 atgtcaatcc gcgggtcttg actagtcctc cctggaggag ccgacgcttg ccccaacgtc 565741 cagaccaaag atgtaagaac gccgatatca gaaaatagtt aatgaaagga atacccatgg 565801 ctgaaaactc gaacattgat gacatcaagg ctccgttgct tgccgcgctt ggagcggccg 565861 acctggcctt ggccactgtc aacgagttga tcacgaacct gcgtgagcgt gcggaggaga 565921 ctcgtacgga cacccgcagc cgggtcgagg agagccgtgc tcgcctgacc aagctgcagg 565981 aagatctgcc cgagcagctc accgagctgc gtgagaagtt caccgccgag gagctgcgta 566041 aggccgccga gggctacctc gaggccgcga ctagccggta caacgagctg gtcgagcgcg 566101 gtgaggccgc tctagagcgg ctgcgcagcc agcagagctt cgaggaagtg tcggcgcgcg 566161 ccgaaggcta cgtggaccag gcggtggagt tgacccagga ggcgttgggt acggtcgcat 566221 cgcagacccg cgcggtcggt gagcgtgccg ccaagctggt cggcatcgag ctgcctaaga 566281 aggctgctcc ggccaagaag gccgctccgg ccaagaaggc cgctccggcc aagaaggcgg 566341 cggccaagaa ggcgcccgcg aagaaggcgg cggccaagaa ggtcacccag aagtagtcgg 566401 gctccgaatc accatcgact ccgagtcgcc cacggggcga ctcggagtcg acgtgttgga 566461 tgcaaaccgc atagtctgaa tgcgtgagcc acctcgtggg taccgtcatg ctggtattgc 566521 tggtcgccgt cttggtgaca gcggtgtacg cgtttgtgca tgctgcgttg cagcggcccg 566581 atgcctatac cgccgccgac aagctgacca agccggtgtg gttggtgatc ctgggcgcgg 566641 ccgtggcgtt ggcctccatc ctgtatcccg ttttgggtgt gctcgggatg gcgatgtccg 566701 cctgtgcgtc cggcgtgtat ctggtcgacg tgcggcccaa gcttctcgag attcagggca 566761 agtcgcgcta acggaatgaa agccctggtg gccgtgtcgg cggtggccgt cgtcgcactg 566821 ctcggtgtat cttccgccca agctgatccc gaggcggatc ccggcgcagg tgaggccaac 566881 tatggtggcc ccccaagttc cccacgtctt gtcgatcaca ccgaatgggc gcagtgggga 566941 agtctgccca gcctccgggt ctacccgtcc caagttgggc gtacagcctc ccgccgcctc 567001 gggatggccg ctgccgacgc ggcctgggcc gaggttctcg cgctgtcacc ggaggccgac 567061 actgccggca tgcgcgcgca gttcatctgc cactggcagt acgccgaaat cagacaaccc 567121 ggcaaaccca gctggaacct cgagccgtgg cggccggtcg tcgacgactc ggagatgttg 567181 gcttccggct gcaatccggg cagccctgaa gagtcgtttt agtgctcggc caaccgactc 567241 gggcgcagtt ggccgcgctg gtagaccaca ccctgctcaa gcctgagacc acccgtgccg 567301 atgtggccgc gctggtcgcc gaagccgccg aactcggcgt ctacgcggtc tgcgtgtcgc 567361 cgtcgatggt gccagttgcg gtccaagccg gtggtgtgcg ggttgcggcg gtgacgggct 567421 tcccgtcggg caagcacgtg tcctcggtca aggcgcatga ggcggctgcg gccctggcat 567481 ccggcgccag tgagatcgac atggtcatcg acatcggggc tgcgctgtgc ggtgacatcg 567541 acgcagtgcg ctccgacatc gaggcggtgc gtgccgctgc ggccggggct gtgctcaagg 567601 tgatcgtgga gtcggcggtg ctgttgggac agtcaaacgc gcacacgttg gtggatgcgt 567661 gtcgtgccgc cgaggatgcc ggtgccgact tcgtcaaaac ctcgactggg tgtcatccgg 567721 ccggcggggc cacggtgcgt gccgtcgagc tgatggccga gacggtcggc cctcggctag 567781 gggtcaaagc cagcggtggg atccgcaccg ccgccgacgc ggtcgcgatg ctcaacgccg 567841 gtgccaccag gttgggcctg tccggcaccc gggcggtgct cgatgggctc agctgacagc 567901 tgagcgcgcg ggtggcggcg tcaaatgtgc gagaagcagg gattctggat gccggtgggg 567961 atagccgcgt cgcgagttga gaaccggctc accacgccgg tcgaggtgac ttgcacgctg 568021 tccgcgtgaa tccccaacgg gtagttcttg gtcaggctgg aggtgaactc gttcagcgtc 568081 gactgaacgg tttctttcgg cagcgagaac ccgagcgtgt tgaaattgat gatctgcagc 568141 tccaatcctt tgccagccac tatcggcttg gctgtgatgt tgttcagcag gcccttcagt 568201 tcgacggtgc cgtctgcggg gtgagtgacc acgctgctgg tgacgaaagc gcccaggatc 568261 ggaatcgcgt tttgcaccga ttccttgatg ccttccgacg accaggtaat ggtggcgtcc 568321 agggcgccga tcgtgcccct agagttgggg gtgttcttga gccggacgtt ctggatcgtg 568381 agctttatct gcatgccctt ggcatcgcgg atctgattgc ccgcggtttc caccgagata 568441 ttggtgaagt gccgcgtagc gacctgccac agcagcagcg gcgccacacc gaaggatgcg 568501 gtggcttggt ctttgaccac gcatgcgacc gcttgggcga ccttgctatt ggcaacatgg 568561 cgagcgtata gctcgcctcc gatcagcccg gcgaggacga gcgaaaacac gatgatcagg 568621 acaagaaaga cggttagcgg gtcgcggcgg gcacgtcgtt tggtcttcac cgccgctggc 568681 tcttcctctt gcgcagccag caggcccgtt gggtcccacg cctggtgcgc agcttggcgg 568741 ccggatcgcc gtgtgtgggc atgcgacgca gccagatgct cagtttgcgg ctgctgctcc 568801 ggttgggtgg gcggactcac cggttcttgg atatgacccg cgggctcgcc ggggcgcagt 568861 cgaccggtgg atgcttcgga cgatgccggg ggtcgggcga gcggaccttg atccccaggg 568921 cgcgcccagg gcgacgggtc gttcggtgga ccttgcgggt tggtcaccca cgcgattgtg 568981 ccttatcgat ctgaacgaag tctgtctggt tgcgtagcac cgcaatgcgg tcgcgagccg 569041 cggccacatt gtcgacatcg atgtcggcga ccagcagttg cggctgggtg ccagctgaca 569101 ccaccacctc gcctagcggc gaggccacca ggctgccgcc taccccggtc ggtgcagccg 569161 agctcgcccc cacgccggtg cgggcatcac ctgggtctgc ttggccggcc gcggcgacgt 569221 aactcatgga gtctagcgcc cgggcgcggg ccagcaacgt ccactgttcg agtttgcccg 569281 gaccggaacc ccaggatgca cagaccgcga tcagttgggc cccgcgccgc gccagctcgg 569341 tataaagggc gggaaagcga atgtcgtagc aaacggtcaa acccacccgc acgccgtcga 569401 ccacgactac caccggttcg cgcccgggtg cgacggtacg tgactcggtg aagccgaacg 569461 cgtcatagag gtggatcttg tggtagtgcg cgtccggctg attgggcgtg cccgggccgg 569521 ctgcgatcag cgtgtttgtt acccgcccgt cgccggtcgg ggtgaacatg ccggcgatca 569581 cggtgatgcc cgcctcggtc gcgatccgtc ggactccgtt tgcccagggt ccgtcgacgg 569641 gctcggcgac ctgccgcagc gggacaccga gccggcacat ggtcgcctca ggaaacacca 569701 ccagctgtgc gcccgcggtg gcggcttcgc cggcgtactt gccgaccagt tgcagattgg 569761 cggcggggtc ggtaccgctg cggatttgcg ccaacgcgat tcgcatgcgc gccagcctag 569821 gcccggcgac gagcgcgccg caccggcgcg cgcaggagcc gggcaatcca gcttgcgccc 569881 ggcgacgagc gcgccgcacc ggcgcgcgca ggagccgggc aatccagctt gcgcccggcg 569941 acgagcgcgc cgtaccggcg cgcgcaggag ccgggcaagc tggcacctca gacgttgttc 570001 gtgatccaca gcgtggtgaa gcgctgttcg atggtcacta gctggcttaa ttgggtgccg 570061 ataagcctct ccagcttccc gccaatgaac gggatacgca cctggatggt gacctgcagc 570121 gtcattcggg agccacccga ctccggtatg ggcgagagca cggcggtgcc ccacaagttc 570181 accggagcgt ccacgatcga tcccgcaatg gacgcggtcg cgatgccttc cttgaccggg 570241 ccccaggtct cctcgcgccg taccgaaaga tcgccccggt gcaactgtgt gaccaggccg 570301 ggcagattgt gactgcgcac catctgcagg gtgacgactt cgatggtgcc gtcgtctccg 570361 gagtcgccac ctacgcgtat cgactcaagg gtcgcgacgt cgaccggcgt ttcggccagt 570421 ctggctttcc agtagtccgc ctcgtagaaa gcccgatgaa cctcctcgac gctgccctcg 570481 tagtcggccg acatgtcgaa tgaacgcggc atagcaggtc aggctaccct tacgggccat 570541 gaaacggagc ggtgtcggtt cgctctttgc cggtgcgcat attgccgagg cggtcccgtt 570601 ggcgccgctg accactttgc gtgtgggccc gatcgcccga cgtgtcatca cttgcaccag 570661 cgccgaacag gtggtggctg cgctgcggca cctggattcg gcggccaaga ccggagctga 570721 ccgcccgctg gtgtttgctg gtggctccaa tttggtgatc gccgagaacc tgaccgacct 570781 gaccgtggtg cggttggcca atagcggcat caccatcgac ggtaacttgg tgcgggccga 570841 ggccggtgcg gtcttcgatg acgtggtggt tagggccatc gaacagggtc tgggcggact 570901 ggaatgcctg tctggcatcc caggatcggc cggggcgaca cccgtgcaga acgtgggggc 570961 gtatggcgcg gaggtgtctg acaccatcac tcgggttcgg cttttggatc ggtgcacggg 571021 tgaggtgcgt tgggtatccg cgcgcgacct gcgcttcggc tatcgcacga gcgtgctcaa 571081 acacgctgat gggcttgcgg tgcccaccgt ggtcttggag gtggagtttg cgctggatcc 571141 gtcgggccgc agcgcaccgc tgcgctacgg cgagctgatc gccgcgctga atgcgaccag 571201 cggcgagcgc gccgacccgc aagcggtccg cgaagcggtg ctggccctgc gggcacgcaa 571261 gggcatggtg ctggacccga ccgaccatga cacctggagc gtgggatcgt tcttcacaaa 571321 cccggtggtc acccaggatg tttacgaacg gctggccggt gacgcggcca ccagaaagga 571381 cggtccggtc ccgcactatc ccgcgcccga cggcgtcaag ctggccgccg gctggctggt 571441 ggaacgggcc ggcttcggca agggctatcc ggatgccggc gccgccccat gccggctttc 571501 caccaaacat gcgctggcgc tgacaaatcg tggcggggcc accgccgaag atgtggtgac 571561 gctggcgcgc gccgtgcgcg atggggtcca tgatgtgttt ggtatcacac taaaacccga 571621 acccgtgctg atcggctgca tgttgtagct gcgttttcgc ggcggggcgg cgtggcgcgc 571681 attgcttagg gctggttgcc aggcgttctg tggtcattcg tgtgctgttt cgcccggtat 571741 ctttgatacc cgtgaataac tccagcaccc cccagagtca ggggccgatc agtcggcgtc 571801 tggcgttgac ggcccttggg tttggggtgt tggcaccgaa cgttctggtc gcgtgcgccg 571861 gcaaagtgac caagctggcc gagaagaggc cgccaccggc gcctcgtctg actttccggc 571921 ctgccgactc tgccgccgac gtggtgccga tcgcgccgat cagcgtcgag gtcggtgacg 571981 gctggtttca gcgggtcgcg ctgaccaatt cggcaggcaa ggtcgtcgcc ggggcataca 572041 gccgggatcg caccatctac acgatcaccg agccgctggg ctacgacacg acctacacct 572101 ggagcggttc ggccgtcggc catgacggca aggcggttcc ggtggcgggc aagttcacca 572161 ccgtggcacc cgtcaagacg atcaacgcgg gattccagct cgccgacggc cagaccgtcg 572221 ggatcgcggc gccggtgatt attcagttcg attcaccgat cagcgacaag gccgccgtcg 572281 agcgggcact aaccgtgacc accgacccgc ctgtcgaggg cggctgggcc tggctgcccg 572341 acgaggcgca gggcgctcgc gtgcactggc gtcctcggga gtactacccg gcgggtacca 572401 ccgtcgacgt cgacgccaag ctgtatgggc tgccgttcgg cgacggcgcg tacggcgcgc 572461 aggatatgtc gttgcacttc cagatcggtc gtcgtcaggt ggtcaaggcc gaagtctcgt 572521 cgcaccgcat ccaagtcgtc accgatgccg gcgtcatcat ggacttcccg tgcagctacg 572581 gcgaggccga cttggcgcgc aacgtcaccc gcaacggcat ccacgtcgtc accgagaaat 572641 actcggactt ctacatgtcc aacccggccg ccggttacag ccatatccac gaacgttggg 572701 cggtgcggat ttccaacaac ggcgagttca tccatgccaa ccctatgagc gccggtgccc 572761 agggcaacag caatgtcacc aacggctgta tcaacctgtc gacggagaac gccgaacagt 572821 actaccgcag cgcggtctac ggtgacccgg ttgaggtgac cggcagttcg atccagctgt 572881 cctacgccga cggtgacatc tgggactggg cggtggactg ggacacctgg gtgtcgatgt 572941 cggcgctacc gccaccggcg gccaaaccgg cggcgacgca aatcccggtc accgccccgg 573001 tcacgccgtc ggatgccccc accccgtccg gcacacccac gactactaac ggaccgggtg 573061 ggtagcgcga cggctagctg atgcctggtc gcggggccgg atgacgatct ggtcaaggtt 573121 gacgtgtgag ggccgggtgg ccacgaatcc gatcacctcg gcgacgtcgg cggctactag 573181 cggtgtcatg ccggcataaa ccgcgtccgc gcgttgctgg tcgccgtcga agcggaccag 573241 cgaaaattcg gtctcgaccg cacctggagc gatctcggtg agccggaccg gcttccccag 573301 cagttcgccg cgcagcgtgc gatgcagcgc gccctgcgcg tgcttggcag cggtgtagcc 573361 ggcgccgccg tcgtacacct cgatcgcggc gatcgaggtg acggtgacga tcaggccgtc 573421 gccggagtcg atcagcttgg gcagcagcgc gcgggttacc cgcagcgtgc ccagtacgtt 573481 ggtgtcccac atccatcgcc agtgctccaa atcggcatcg gcgacgaact gaagcccctt 573541 ggcgccaccg gcgttgttga ccagcacgtc cacccggctc agcgcgcggg ccaacgcttc 573601 gacggcggcg tcgtcagtga catcggccac aattgcggtt ccgccgatct ggttggccag 573661 cgcggtgatc cggtccgccc gacgcgccac cgcgaccacg tgaaacccct gggccgcaag 573721 ggttctcgcg gttgcctcgc cgataccgga actggcgccg gtgaccacgg cgactcgctt 573781 gcgggtgccg attgtcgtca tcgggacaac tctaataaac gtgctaaatt ctcggtgtgt 573841 accacagcgc cttgttccgc acgacgaccg cgtgtctttt cgcgggcgcg tgttgttgcc 573901 gccccctttg ccgcgcctga ccgatacacg tcagcaggtg tggccaacag gacccggcca 573961 ttggaactcg gagaagaacg cccgtgtact cgactaaccg cacctcacag tcactcagcc 574021 gcaagcccgg ccgcaagcac cagctgcgat cgcaccgtta cgtcatgccg ccgtcgctgc 574081 acctgtccga ttccgcggct gcgtccgtct tccgggccgt gcgtttgcgt ggtccggtcg 574141 gtcgggacgt aattgctgga tctacgtcgc tgagcatcgc gacggtgaac cgccaggtca 574201 tcgcactgct ggaagcgggc ctcctgcgtg agcgggcgga cctggcggtt tccggggcta 574261 tcgggcgccc acgcgtgcct gtcgaagtaa accacgagcc ttttgtcacc ctgggcatcc 574321 acatcggtgc ccggaccacc agcatcgtgg ccaccgacct gttcggccgc acgctcgaca 574381 cggtggagac cccgaccccg cgtaacgctg ccggggccgc gctgacctca ctggccgaca 574441 gcgctgaccg atacttgcag cgctggcgcc ggcgccgtgc gctgtgggtc ggggtgacgc 574501 ttggtggtgc agtcgacagt gccaccggtc atgtcgacca tccgcggctc ggttggcgtc 574561 aggctccggt cggacccgtg ctggcggatg ccctaggcct gcccgtgtcg gtggcgtccc 574621 acgtcgacgc catggccggg gccgagctga tgctcggcat gcggcggttc gcaccgagct 574681 cgtcgacgag cctctacgtc tacgcccgcg aaaccgtagg ctatgcgctg atgatcggtg 574741 ggcgggtgca ctgcccggcc agtggtcccg gcaccatcgc gcccctgccc gtccactctg 574801 aaatgctcgg cggtaccggg cagctggagt ccactgtcag cgacgaggcg gttttggctg 574861 ctgcccgccg gctgcggatc atccccggca tcgcttcgag gacccggacc ggtgggtccg 574921 ctaccgccat caccgacttg ctgcgagtgg cacgagccgg taatcagcaa gccaaggagc 574981 tgctggcgga gcgggcccgc gtgctcggtg gggcggtcgc gctgctgcgt gacttactca 575041 atcccgacga agtggtggtg ggtggccagg cgtttaccga atatcccgag gcgatggagc 575101 aggtggaggc ggcgtttacg gcagggtcgg tgctggcgcc gcgtgacatc cgcgtgaccg 575161 ttttcggcaa ccgggtgcag gaggccgggg caggcatcgt gtccctaagc gggctctatg 575221 ccgatccatt gggtgccttg cggcgatcgg gcgcgctgga tgcccggctg caggacaccg 575281 ccccggaggc gctcgcgtga tcggctgacg agccgcgtcc gcgcgtgtca cttcggttcc 575341 tgcaaggatg gcaggtgtgc ggcacgatga cggttcaggg ttgatcgccc agcgccgtcc 575401 ggtccgcggc gagggtgcca cccgctcgcg cggcccatcc gggccatcca atcggaatgt 575461 ttcggcagca gacgacccgc gccgggttgc gctgctggcg gtgcacacct caccgctggc 575521 acagccgggc accggtgacg ccggcggcat gaacgtctac atgctgcaaa gtgcgctgca 575581 cctggcccgt cggggcatcg aggtggagat cttcacccgg gccaccgcat cggcagatcc 575641 accggtggtg cgggtggcac ccggggtgct ggtgcgcaac gtggtggcgg ggcccttcga 575701 gggtttggac aagtacgacc tgcccaccca gctttgtgcg ttcgccgccg gggtgctgcg 575761 cgccgaggcg gtccacgaac cgggttacta cgacatcgtg cactcgcact actggctgtc 575821 gggtcaggtc ggctggctgg cgcgcgaccg ctgggcggtg ccgttggtgc acaccgcaca 575881 cacgctggcc gccgtgaaga acgcggcact ggccgacggc gacggacccg agccgccgct 575941 gcgtacggtc ggggagcagc aggtcgtcga cgaggcggat cggttgatcg tcaacaccga 576001 cgatgaagcc aggcaagtga tttcgcttca tggtgccgat ccggcacgaa tcgacgtggt 576061 ccatcccggt gtcgatctgg acgtgttccg cccgggtgat cggcgcgcgg cccgggccgc 576121 gctaggacta ccagttgacg agcgcgtggt ggccttcgtc ggacgcatcc agccgctgaa 576181 ggcacccgac attgtgctgc gtgcggccgc caagttgccc ggggtgcgca tcatcgtggc 576241 cggcggaccg tcgggcagcg gtctggcttc accggacgga ctggtccggc tcgccgacga 576301 actgggcatc tctgcacggg tgacgtttct gccgccgcag tcccacacgg atctggccac 576361 cttgtttcgg gcggcggacc tggttgcggt gccgagctac tccgagtcgt tcggcctggt 576421 tgctgtggag gcccaagcgt gcggcacacc ggtggtggcc gcggcggtgg gcgggctgcc 576481 cgtcgcggtg cgcgacggga tcaccggcac cctggtgtcc gggcacgagg tcggtcagtg 576541 ggccgacgcc atcgatcacc tgctgcggtt gtgtgccggg ccacggggac gggtgatgag 576601 ccgggcggcg gcacggcacg ccgccacgtt ctcgtgggag aacaccaccg acgcgctgtt 576661 ggccagttat cggcgtgcga tcggcgagta caacgccgag cgccagcgcc ggggcggcga 576721 ggtgatatcg gacctggtag cggtgggcaa gccccgccac tggacgccgc gtcgcggggt 576781 gggcgcgtga cttcctcctt gccgaccgtg caacgtgtga tccagaatgc gctcgaggtc 576841 agccagctga agtactccca acacccccgc ccgggcgggg cgccgcccgc gctgatcgtc 576901 gagctgccgg gcgaacgcaa gctcaagatc aacaccatcc tgagcgtcgg cgagcattcg 576961 gtgcgtgtcg aggcgttcgt gtgtcgcaag cctgacgaga accgcgaaga cgtataccgg 577021 ttcctgctgc ggcgcaaccg ccgcctgtat ggggtcgcgt acacgctgga caatgtcggc 577081 gacatctacc tggtgggcca gatggcgctg tccgcagtgg acgccgacga ggttgaccgg 577141 gtgttggggc aggtgttaga ggtggtggat tcggacttca atgcgttgtt ggagttggga 577201 tttcggtcgt cgattcaacg agagtggcag tggcggttat ctcgcggtga gtcgctgcag 577261 aacctgcagg ccttcgctca cttacgcccg acgacgatgc agagcgcgca gcgcgatgag 577321 aaggagttgg gcggttaggt cgagcccgac gacgatgcag agcgcgcagc gcgatgagaa 577381 ggagttgggc ggttaggtcg agcccgacga cgatgcagag cgcgcagcgc gatgagaagg 577441 agttgggcgg ttaggtcgag cccgacgacg atgcagagcg cgcagcgcga tgagaaatag 577501 cactcgtgga ggtcaagacg cccgccggtg atgggctggt ggcgctcacc ccgttccgga 577561 ctcagaaatt cgcgatcaca atttgcgcgt tcaagtcatt ggcatgcatg tgatggttta 577621 gcgttccgct gtgcctcttc aggtgtttgt cggcttcgtt gccatgatga cgctcaaggt 577681 cgcgatcggc ccgcaaaacg catttgtcct gcgccaagga attaggcgag aatacgtgct 577741 ggtcattgtg gcgctgtgcg ggatcgctga tggggcactg attgccgcgg gcgttggcgg 577801 cttcgctgcg ctgattcacg ctcatcccaa tatgactttg gttgcccgat ttggcggcgc 577861 agcgttcttg attggctacg cgctattggc cgcgcggaac gcgtggcgcc cgagcgggct 577921 ggtgccgtcg gaatcggggc cggctgcgct gatcggcgtg gtgcaaatgt gcctggtggt 577981 gacctttctc aacccacacg tctatctgga cactgtggtg ttgatcggtg ccctcgccaa 578041 tgaggaatca gatctgcggt ggtttttcgg agccggtgcc tgggccgcca gcgtcgtatg 578101 gttcgccgtg ttgggattta gcgcgggccg gctacagcca ttcttcgcaa ctccagctgc 578161 ttggcgcatt cttgatgcgc tggttgccgt gacgatgatt ggggtcgccg tcgttgtgct 578221 cgtcacgtca ccaagtgtgc cgacggccaa tgtcgcactg atcatttgac cacctcgtag 578281 gccgcccatg tatcggcctt ggtgaaccgg ccgttacggt gccgaccacc tcggcggtat 578341 gaacgcgctg cgcagcggac cgaggagaat tcgggcattt tggtccacga tgaggagtgc 578401 gggagtgcgt gagagacttg ccggtatggc aaacactggc agcctggtgt tgctgcgcca 578461 cggcgagagc gactggaatg ccctcaacct gttcaccggc tgggtcgatg tcggcctgac 578521 ggacaagggc caggcagagg cggttcgaag cggcgagctg atcgcggaac acgacctatt 578581 gcccgacgtg ctctacacct cgttgctgcg gcgcgcgatc accaccgcgc atctggcgtt 578641 ggacagcgcc gatcggctct ggattcccgt gcggcgtagc tggcggctca acgaacgcca 578701 ctacggcgcg ctgcagggtt tggacaaggc cgagaccaag gcccgctatg gcgaagagca 578761 gttcatggcc tggcggcgca gctatgacac gccgccgccg ccgatcgagc ggggcagtca 578821 gttcagccag gacgccgacc ctcgttacgc cgacatcggc ggtggcccgc tcaccgaatg 578881 tctggctgac gtggtcgccc ggtttttgcc atatttcacc gacgtcatcg ttggcgactt 578941 gcgggtcggc aagacggtgc tgatcgttgc ccacggcaac tcgttgcgcg cgctggtcaa 579001 gcacctggac cagatgtctg acgacgaaat cgtcggactg aacatcccga ccggaattcc 579061 gctgcgctac gacctggatt ccgcgatgag gccgctggtg cgcggtggta cgtatctgga 579121 cccggaggcg gcagccgccg gcgccgccgc ggtggccggc cagggccgcg ggtaattgtt 579181 tgagatccca cctgccggcg gtttcggcgg ctgatggtgt gctttggtgc gctgtttgcc 579241 aaacagcatg tgaacggtaa ccgaacagct gtggcgtagt gtgtgacttg tccgattttg 579301 gccttgccgc gctagggcga cgttcaccgg atttgtagga ttttccttgt gactgtgttc 579361 tcggcgctgt tgctggccgg ggttttgtcc gcgctggcac tggccgtcgg tggtgctgtt 579421 ggaatgcggc tgacgtcgcg ggtcgtcgaa cagcgccaac gggtggccac ggagtggtcg 579481 ggaatcacgg tttcgcagat gttgcaatgc attgtcacgc tgatgccgct gggcgccgcg 579541 gtggtggaca cccatcgcga cgttgtctac ctcaacgaac gggccaaaga gctaggtctg 579601 gtgcgcgacc gccagctcga tgatcaggcc tggcgggccg cccggcaggc gctgggtggt 579661 gaagacgtcg agttcgacct gtcgccgcgc aagcggtcgg ccacgggtcg atccgggcta 579721 tcagtgcatg ggcatgcccg gttgctgagc gaggaagacc gccggttcgc cgtggtgttc 579781 gtgcacgacc agtcggatta tgcgcggatg gaggcggcta ggcgtgactt cgtggccaac 579841 gtcagtcacg agctcaagac gcccgtcggt gccatggctc tactcgccga ggcgctgctg 579901 gcgtcggccg acgactccga aaccgttcgg cggttcgccg agaaggtgct cattgaggcc 579961 aaccggctcg gtgacatggt cgccgagttg atcgagctat cccggctaca gggcgccgag 580021 cggctaccca atatgaccga cgtcgacgtc gatacgattg tgtcggaagc gatttcacgc 580081 cataaggtgg cggccgacaa cgccgacatc gaagtccgca ccgacgcgcc cagcaatctg 580141 cgggtgctgg gcgaccaaac tctgctggtt accgcactgg caaacctggt ttccaatgcg 580201 attgcctatt cgccgcgcgg gtcgctggtg tcgatcagcc gtcgccgtcg cggtgccaac 580261 atcgagatcg ccgtcaccga ccggggcatc ggcatcgcgc cggaagacca ggagcgggtc 580321 ttcgaacggt tcttccgggg ggacaaggcg cgctcgcgtg ccaccggagg cagcggactc 580381 gggttggcca tcgtcaaaca cgtcgcggct aatcacgacg gcaccatccg cgtgtggagc 580441 aaaccgggaa ccgggtcaac gttcaccttg gctcttccgg cgttgatcga ggcctatcac 580501 gacgacgagc gacccgagca ggcgcgagag cccgaactgc ggtcaaacag gtcacaacga 580561 gaggaagagc tgagccgatg acctgcgccg acgacgatgc agagcgtagc gatgaggtgg 580621 gggcaccacc cgcttgcggg ggagagtggc gctgatgacc tgcgccgacg acgatgcaga 580681 gcgtagcgat gaggtggggg caccacccgc ttgcggggga gagtggcgct gatgacctgc 580741 gccgacgacg atgcagagcg tagcgatgag gtgggggcac cacccgcttg cgggggagag 580801 tggcgctgat gaccagtgtg ttgattgtgg aggacgagga gtcgctggcc gatccgctgg 580861 cgtttctgct gcgcaaggag ggctttgagg ccacggtggt gaccgatggt ccggcagctc 580921 tcgccgagtt cgaccgggcc ggcgccgaca tcgtcctgct cgatctgatg ctgcctggga 580981 tgtcgggtac cgatgtatgc aagcagttgc gcgctcggtc cagcgttccg gtgatcatgg 581041 tgaccgcccg ggatagcgag atcgacaagg tggtcggcct ggagctgggc gctgacgact 581101 acgtgaccaa gccctattcg gcacgcgagt tgatcgcacg catccgcgcg gtgctgcgcc 581161 gtggcggcga cgacgactcg gagatgagcg atggcgtgct ggagtccggg ccggttcgca 581221 tggatgtgga gcgccatgtc gtctcggtga acggtgacac catcacgctg ccgctcaagg 581281 agttcgacct gctggaatac ctgatgcgca acagcgggcg ggtgttgact cgcggacaac 581341 tgatcgaccg ggtctggggt gcggactacg tgggcgacac caagacgctc gacgtccatg 581401 tcaagcggct gcgctccaag atcgaagccg acccggctaa cccggttcac ttggtgacgg 581461 tgcgcgggct gggctacaaa ctcgagggct agcggacgcc gacaaccttg gcgactgtct 581521 ggtcggctac ggccagtgcc atcgccatga tggacagctg cgggttcact tccgggcagc 581581 tgggcaggat cgaggcgtcg gcaacccaca cgccctcgac gccgcgcagc cggcccgtcg 581641 cgtcgaccgg acaaagctgc tcgtcggcgc cggcggccgc ggtgcccgtc ggatggaagg 581701 cggccaggtg caggcttctg gggttggctc ggcgcagcac atcctgcagc tcgggcaggg 581761 accgcatcgg tggggcgccg gggataccgg tcagcacctc caccgcgccg gcggcaaaga 581821 gcagccggcc aatggcctgc agcgcgaccc gtagcttggc gatctcacct ggagctatgt 581881 catagcgcac caccgtctcg ccgcgcaccg accgcaccgt gccgacgccc cgatcggcca 581941 ccatcgcccc gaatgttgcg atctgcggcg cccggtcgag ccagcggagc agctcggccc 582001 cgtagccggg gaagaccatc gaccccatgc ccggcggtgt ggaggtggcc tcgatcagca 582061 cgccgtcgga ttcgtgaaac tcgtgaaccg ccgcgctctg cagcaccccg cgccacgcga 582121 agacgtcgtc gtcgaagagc ccggccagca tagttgccgg gtgcagcgca aggttgtggc 582181 ccagtcgcgg gtgcccacca agaccgctgc gccgcaacag tcctggcgtc tccgtcgcac 582241 cggcggcgac gacgaccgcg tcggccagca cgtcgagtgt ggtgccgtcg ggccggcggg 582301 ctcgcacgcc ataggcccgc ccggcgcggt gcaggatccg ttcgacccgc gcccaggaga 582361 tgatccgcgc gccggccgcg caggcttgcg gcagggcgtt gaggtgcacg ccgaacttgg 582421 cgttgctggg gcagccgatc gcgcactggc aacagccacg gcaccccggc gcattgcgcg 582481 ggatgggcgc cgcccgccag cccagcgact tggcggcctg cagcaacagg cgcccgttgc 582541 ggcccatgat ctccagcggc accggcgcaa cccgcagtgt ttgctccgca tcgtcaagac 582601 gacgtcccag ctggtcgggg tcggccaggc cgagaccgaa ctcgtcacgc cagcgccgct 582661 gcacggcaag tgaaggccga aagcaggtgc cggagttgac gacggtggtg ccgcccaccg 582721 cccggcccat cggcagcacc accgccggtc gcccgagcgc gacggtggcc ccggcgccac 582781 ggtacaaccc ggcataacgg tcgaccgggt gggtgctacg gaactcctcg accgtccagc 582841 gccgtccctc ttcgagcacg accacgtcaa ggccggcccg ggccagcgtg cgcgcgacca 582901 tcgcgccgcc cgccccggag ccgacgacca ccgcatcggc cctggtgacg gatgggctgt 582961 ccgccgacaa gatgacggtc aactccgcgt cggggcgcgc cgcgtcatgt tcctgggcgc 583021 gggcgagcaa ttcgtgcgcg taggtgtcgg cgccgttggc caacagcacg atcgccttca 583081 acccctccac ggccgcagcg acttccgggc tcagtgcggc gatccggtgc agcacccgtg 583141 cccgctcgtc cgggtgcagt cgcggtagcg accggccggt ggtgaggtag ctggccgccg 583201 ccagtgaagc cagcccggcg cgcaccgcga atcgtgaggt cgccggcagt cgtgtgacgt 583261 agcggtcaac gcgctgcacg aattgagccg gcaacgggcc gccgagctcc ggcggcagca 583321 gcgcggcgcc gaacgaggcc aacggatagg acttagcccg atcggcgagc cggctcatat 583381 ccggcgcccg agccggcggc cgagctttat gaagaacgga tacgttgcga agatggcagc 583441 ggccatcgcg tgcagcggcc actccgcgcg ggccacatcc acgctgaaaa ccccgctgtt 583501 ccacatgaaa tcgcggccgt tctgtgcgcg aaatggccgc cacagcatcc cgagcccggg 583561 tacgttgtgg tacagcccga acgaagcgcc gaagaagacg cccagggcgg cggcctcggc 583621 ggcatcgcgg cggtccacgg gcaggcgtcg ctcgatgagc actccgcaga caaacagcag 583681 cggcgggtcg agcaggaaac tcatgctggc gtcccttcct tgatagccgg tgccgcggtt 583741 ccccgcaggc cgacttcggc gtgtccggtg cccagcaccg accagtgccg gccgccgagc 583801 tcgatgtgga tgtcggcctg ctcggtgttg gtgcacaccg ccttggcccc gtcgggatcg 583861 gtgtatccca ggcttacgca ccgctccggc ggctggtcta cccggattag cgcctcccgg 583921 ccgccgatgc gtccttccag ttgccagtgc cgcacgccga gcgttgtccg cattcgcagc 583981 gacggtaaag gacttgcggg ccaatccttt ccgtcgatgc ggaagcgaac gaacgctagc 584041 ggcgcgagcc tgcgtaggcc cggcttgtgt gataccgcgg tcaccacctc taggacgtcg 584101 ccgtcgccga gatcggcatg gatccatccc caccgcttgg cattgccatg tccgtagatg 584161 tgggccacac tgccgcgcca gctgtcgacg cggtgggtgg tttcgccgac ggccaaggag 584221 ccagcgaaga cggcggtggg tgcgatcacc acttgggcgc cgggcagcaa ctcgcgctcc 584281 caggccacgc gaggaaacgt ccacagtggc gccgcggtgt ccttccagga cagctcccat 584341 gcgagtgatc gggtacgtcc ggtcagctcc gctggcgcca ttcgtacacc ggcgatgtcg 584401 aaccaggcgg ggccggccgc gggttgggcg ggctgggggc cgaagcgctc ggtgcccggc 584461 ggggcatccg gtggaaacca ggtcacccag ccgtgcgcgt agggcccgcc ggtcgtcggg 584521 gccaccgtct cacagtgcac ccataggccg gtacgcgtca gtggatccga cagagtcgca 584581 taccagactt ccaggcgccc ggctgcaccg cgccaccgcg gcaaggccgc cgaccgcgtt 584641 tcatcgtcca ctgcggcacc tcctgctggc tgagttgtcg attcgcccac tatattggtt 584701 gagccaatga accagtcaag tgtctttcag ccgccggatc ggcagcgggt ggatgagcgg 584761 atcgcgacga cgatcgccga cgccatcctc gacggcgtct tcccgccggg ctcgaccctg 584821 ccgcccgagc gagacctggc agagcggctc ggtgtcaacc gcacctcgct acgccagggt 584881 ctggcgcgac tgcaacagat gggcctgatc gaggtgcggc acggcagcgg cagtgtggtc 584941 cgtgaccccg aggggctcac ccatcccgcg gtggtcgagg cgctggtgcg caaactgggc 585001 cccgacttcc tcgtcgagtt gctggagatc cgcgcggcgt taggcccgtt gattggccgc 585061 ctggcggccg cccggagcac gcccgaggat gccgaggcgt tgtgtgcggc gctggaagtg 585121 gtgcaacagg cggacacggc cgcggcgcgg caggcagccg atcttgccta cttccgggtg 585181 ctcatccaca gcactcgcaa ccgcgcattg gggttgctct accgctgggt ggagcacgcc 585241 ttcggcggcc gcgagcatgc gctcaccggg gcctacgacg acgcggaccc agtgttgacc 585301 gacctgcggg cgatcaacgg ggcggtgctg gccggtgacc cggcggccgc tgccgcgacc 585361 gtcgaggcgt atctgaacgc cagtgcgctg cgcatggtca agtcctaccg cgaccgcgct 585421 tagctactgg gccgcacgcg tcgccggatg tacggcgatg agccctaatt gactgcggcg 585481 cttgcacatt gctgcgagtt ccccataggc cttctccccg agtaattcgg tgagttcgtc 585541 ggcaaggctc tgccacacct gcttggttcc gacatgggcg gccggatcgc cggtgcaata 585601 ccagtgcagg tcagcaccac ccgagcccca accgcggcgg tcatattcgg tgagggtggt 585661 cttgaggatt tcggtgccgt cggggcgggt cacccattcc tggctgcgcc ggatcggcaa 585721 ctgccagcag acatcgggtt tcatcgtcaa cggcggcacg cccagcttga gggctttgct 585781 gtgcagcgcg cagccggcgc caccggcgaa cccgggccgg ttcaagaaga tacacgcgcc 585841 cttgtgtttg cgggtgcggt gctggggttg gccgtcgtgc tcgtcgagtt ccaggtagcc 585901 cttgcggcgc aggccctttg cccggaactg ccagtcgtcg tcggtcagct tgtgcaccgc 585961 gtcggccaac cgggtgcggt cgtcgtcgtc ggacaggaac gcaccgtgcg aacaacagcc 586021 gtcgtttggc cggcccgcga cggtgccctg gcaggcgggt gtgccgaata cacacgccca 586081 gcgcgacagc aaccaggtaa ggtcggccgc gatcaggtgc tcgggattgt ccgggtcgta 586141 gaactccacc cactcacggg cgaagtccaa ctcgacttct tgccccgggt gcaccggtct 586201 ccgtcgcgaa tttgccacgg attcaacgtt agaccacgaa gcccgccgcg ggattccgcc 586261 atagcccagc acggccggca catgccaccg ggcgccttgc gcgggtcgcc acacgcccgt 586321 atcttcgccc ggctagtttg ttttcgtgcg attgggcgtg ctggacgtgg gtagcaacac 586381 ggtccatctg ctggtggtcg atgcccaccg cggcggccac ccgaccccga tgagctcgac 586441 gaaggccacg ctgcggctgg ccgaggccac cgacagctcg ggcaagatca ccaagcgcgg 586501 agccgacaag ctgatttcca ccatcgacga attcgccaag attgccatca gctcgggctg 586561 tgccgagctg atggccttcg ccacgtcggc ggtccgcgac gccgagaatt ccgaggacgt 586621 cctgtcccgg gtgcgcaaag agaccggtgt cgagttgcag gcgctgcgtg gggaggacga 586681 gtcacggctg accttcctgg ccgtgcgacg atggtacggg tggagcgctg ggcgcatcct 586741 caacctcgac atcggcggcg gctcgctgga agtgtccagt ggcgtggacg aggagcccga 586801 gattgcgtta tcgctgcccc tgggcgccgg acggttgacc cgagagtggc tgcccgacga 586861 tccgccgggc cggcgccggg tggcgatgct gcgagactgg ctggatgccg agctggccga 586921 gcccagtgtg accgtcctgg aagccggcag ccccgacctg gcggtcgcaa cgtcgaagac 586981 gtttcgctcg ttggcgcgac taaccggtgc ggccccatcc atggccgggc cgcgggtgaa 587041 gaggacccta acggcaaatg gtctgcggca actcatcgcg tttatctcta ggatgacggc 587101 ggttgaccgt gcagaactgg aaggggtaag cgccgaccga gcgccgcaga ttgtggccgg 587161 cgccctggtg gcagaggcga gcatgcgagc actgtcgata gaagcggtgg aaatctgccc 587221 gtgggcgctg cgggaaggtc tcatcttgcg caaactcgac agcgaagccg acggaaccgc 587281 cctcatcgag tcttcgtctg tgcacacttc ggtgcgtgcc gtcggaggtc agccagctga 587341 tcggaacgcg gccaaccgat cgagaggcag caaaccatga cgggaccaca ccccgaaaca 587401 gagagctccg gtaaccggca gatctcggtg gccgagttgc tggccaggca aggggtcacc 587461 ggcgccccgg cccgacggcg ccggcggcga cgcggcgata gtgacgccat cacggtcgcc 587521 gagctgaccg gtgagattcc gatcattcgt gacgaccatc accacgccgg cccggacgcg 587581 cacgcgagcc agtctccggc ggctaacggg cgagtccagg ttggcgaagc tgccccacag 587641 tcgccggcgg aaccagtcgc cgagcaggtt gccgaagagc caacgagaac cgtgtactgg 587701 tcgcaacccg agccgcgctg gcccaagtcc cccccgcagg accggcgcga gtccgggccc 587761 gagcttagcg agtacccgcg gccactgcgc cacacgcata gcgacagagc acccgcgggg 587821 ccgccgtccg gtgccgaaca catgagtccg gatccggtcg agcactaccc cgatctctgg 587881 gtggatgtcc tggacaccga ggtgggcgaa gcggaagccg agaccgaggt gcgcgaagcg 587941 caacctgggc gcggcgagcg ccacgccgca gcggcggcgg ccggcaccga cgtcgagggt 588001 gatggtgcgg ccgaggcgcg ggttgcccgt cgtgccctgg acgtggtccc gacgctgtgg 588061 cgcggcgcgt tggtcgtgct gcagtcgatc ctggccgttg ccttcggtgc cgggttgttc 588121 atcgccttcg accagttgtg gcgctggaac agcatagtgg cgctagtgct atcggtgatg 588181 gtcatccttg gcctagtggt ctcggtgcgg gcagtccgca agaccgaaga catcgccagt 588241 acgttgatcg cggttgcggt gggggcgctg attaccctgg gaccgctggc cttgttgcaa 588301 tcgggctagc cgccaccaca cacagtgcgc ccagcaatca aagtcggctt gtcgacggcc 588361 tcggtgtacc cgttgcgggc cgaggccgcg ttcgagtacg ccgacaggct tggctacgac 588421 ggggtcgagc tgatggtctg gggtgaatcg gtcagtcagg acatcgatgc cgtccggaag 588481 ctgtcgcgcc gctaccgcgt gccggtgttg tcggtgcacg ctccgtgcct actcatctcg 588541 cagcgggtgt ggggcgccaa tccgatcctc aagttggacc gcagtgtgcg ggccgccgaa 588601 caactgggcg cgcaaacggt cgtcgtgcat ccgcctttcc gctggcaacg acgctacgcc 588661 gaagggttca gcgatcaggt tgccgcccta gaagcggcca gcaccgtgat ggtggccgtt 588721 gaaaacatgt ttcccttccg agcggaccgg tttttcgggg ccggccagtc ccgggaacgg 588781 atgcgtaagc ggggtggtgg cccaggtccg gcgatctcgg cgttcgcgcc gtcctacgac 588841 ccgctggacg gcaaccacgc gcattacacg ctggacctct cgcacaccgc gactgcgggc 588901 accgactcgc tggatatggc gcggcggatg ggcccagggc tggtgcacct gcacctgtgt 588961 gacggcagcg gcctgcccgc cgacgagcac ctggtgcccg gccgcggtac ccagccgacc 589021 gccgaggtgt gccagatgct ggccggcagc ggcttcgtcg gccacgtcgt gttggaggtg 589081 tccacctcaa gcgcgcgttc ggccaatgaa cgcgaatcca tgctggccga gtcgttgcag 589141 ttcgcccgca ctcacctgct gcgttgatat gccgggaaca ctatgaacgc gttgttcacc 589201 acggcgatgg cgctgcgccc gcttgactcc gatcccggca atccggcgtg ccgggttttt 589261 gaaggcgagc tgaacgagca ctggaccatc gggcccaagg tgcacggcgg tgcgatggtg 589321 gcgctgtgtg ccaatgccgc ccgcaccgct tacggcgcgg ccggacagca gcccatgcgg 589381 caaccggtcg cagtgtcggc gagctttctg tgggcgccgg atccggggac gatgcggttg 589441 gtgacgtcga tccgcaagcg tggtcgccgg attagcgtgg ccgatgtcga gctcacccag 589501 ggtggccgca cagcggtgca cgccgtggtc accctgggtg agccggagca ttttctcccc 589561 ggcgttgatg ggagcggcgg ggccagtgga accgcgccgc tgctgtcggc gaatccggtg 589621 gtggagctga tggcaccgga accgcccgag ggagtcgtgc cgatcggtcc cggccatcag 589681 ctggccgggc tggtgcactt aggcgaaggc tgcgatgtcc ggccggtgtt gtcgacgttg 589741 cggtccgcga ccgatgggcg gccaccggtg attcagctgt gggcgcgtcc acgcggcgtt 589801 gctccggacg cgctgttcgc tctgttgtgc ggggacttgt cggccccggt gaccttcgcg 589861 gtggaccgca ccggctgggc gcctacagtt gcgctcaccg cctatcttcg ggccctgccc 589921 gccgacggct ggctgcgagt gctctgcacc tgcgtcgaaa tcgggcagga ctggtttgac 589981 gaggaccaca tcgtcgtcga ccggttgggc cgcatcgtgg tgcagacgcg ccaactggcg 590041 atggtgcctg cccagtagca cggatcggcc gagctgtctg cgatgctttt cggcatggca 590101 aggatcgcga ttatcggcgg cggcagcatc ggtgaggcat tgctgtcggg tctgctgcgg 590161 gcgggccggc aggtcaaaga cctggtagtg gccgagcgga tgcccgatcg cgccaactac 590221 ctggcgcaga cctattcggt gttggtgacg tcggcggccg acgcggtgga gaacgcgacg 590281 ttcgtcgtcg tcgcggtcaa accagccgac gtcgagccgg tgatcgcgga tctggcgaac 590341 gcgactgcgg cggccgaaaa cgacagtgct gagcaggtgt tcgtcaccgt ggtagcgggc 590401 atcacgatcg cgtatttcga atccaagcta ccggctggga cgccagtggt gcgtgcgatg 590461 ccgaacgcgg cggcattggt gggagcgggg gttacagcgc tggccaaagg ccgctttgtc 590521 accccgcaac agcttgagga ggtctcggcc ttgttcgacg cggtcggcgg cgtgctgacc 590581 gttccggaat cgcagttgga cgcggtgacc gcggtgtccg gctcgggtcc ggcctatttc 590641 tttctgctgg tcgaggccct ggtggatgcc ggagtcgggg tgggcttgag ccgtcaggtg 590701 gccaccgatc tcgccgcgca gacaatggct ggctcagcgg cgatgctgct ggagcggatg 590761 gagcaagacc agggtggcgc caatggcgag ctgatggggc tgcgcgtgga ccttaccgca 590821 tcacggctgc gcgccgcggt tacctcgccg ggcggtacga ccgccgctgc gctgcgggaa 590881 ctcgaacgcg gcgggtttcg gatggctgtc gacgcggcgg ttcaagccgc caaaagccgc 590941 tctgagcagc tcagaattac accggaatga ttcacgaatt ttgaactgat tatccctcac 591001 cagtaccagt aaccccacta gtcccgctat tctcctcttt gtaagcgcgt gtgggtgcca 591061 gcggagggga agccgctggg actgcgcgtg cctgacacga ttgggttgcg atgacgtcta 591121 cgaacgggcc atcggcgcgg gataccggtt ttgttgaggg ccagcaggcc aagacacaac 591181 ttctcaccgt ggccgaagtg gcggccctga tgcgggtgtc caagatgacg gtgtaccggc 591241 tggtgcacaa tggcgaactg cccgcggttc gggtcgggcg gtcattccgg gtgcatgcca 591301 aggccgtcca cgacatgttg gagacttcgt acttcgacgc gggctagttg ccggccgcac 591361 gcggccggag tccgcctgac cgatctggca atgctcgggc gctgccggtt tggtgttccg 591421 tgcgaccgcc cgggtagagt gtccgggtca gatagccgta tagatggcgg ggtcatgggt 591481 tcagtaatca agaagcggcg caagcgcatg tccaagaaaa agcatcgcaa gctgctgcgt 591541 cgcacccggg tgcagcgcag gaaactgggc aaataggttg cgagcagacc ccgccagctc 591601 gaccgtcacg cgcttgtaac gccgccgttt cgcctggccg ttaggctgtc ggagtgagtt 591661 cgtcgaacgg gcgcggtggc gccggaggag tcggcggcag cagtgagcac ccgcagtacc 591721 ccaaagttgt gctggtgacc ggtgcttgcc gtttcctagg cggctacctg accgcacggc 591781 ttgcccagaa cccgctgatc aaccgggtca tcgcggtgga cgcgatcgcg ccgagcaagg 591841 acatgctgcg ccggatgggc cgagccgaat ttgttcgcgc tgatatccga aacccattca 591901 tcgccaaggt gattcgcaat ggcgaggtgg acacggtggt gcacgccgcg gcggcctcgt 591961 atgcgccgcg gtccggcggc agtgcggcat tgaaggaact taacgtgatg ggcgcgatgc 592021 aactgttcgc cgcctgccaa aaggcgccct cggtccgccg ggtcgtgctg aagtcgacct 592081 ctgaggttta cggatcgagc ccacacgatc cggtgatgtt caccgaggac agcagcagtc 592141 gacgtccttt cagccaaggt ttccctaagg acagtctcga tatcgagggc tacgtgcgcg 592201 cgctgggccg acgccgcccc gatattgcag tgactatcct gcggctggcc aacatgatcg 592261 gcccggcgat ggacaccacg ctttcacgat atctggccgg gccgctggtc ccgacgatct 592321 tcggccgtga tgcgcgactg cagttgctgc acgagcagga tgcgctgggt gcgttggagc 592381 gcgcggcgat ggccggcaag gccggaacgt tcaacatcgg agccgacggc atcctcatgc 592441 tgtcgcaggc gatccggcgg gccgggcgaa ttccggtgcc ggtgccaggg tttggggtat 592501 gggctctgga ttcgctgagg cgagcgaatc actacaccga gctgaatcgt gagcaattcg 592561 cttacctgag ttatggccgg gttatggaca ccaccagaat gcgcgtcgaa ctgggttacc 592621 agccgaagtg gacgaccgtc gaggcgttcg atgactattt tcgcggccgc ggcctgactc 592681 ccattattga cccacatcgg gtacgctcct gggagggtcg cgccgtaggt ttagcgcagc 592741 gctggggtag ccgaaatcca attccatgga gcggactcag ataggtttgg atgggtaacg 592801 tggcgggcga aaccagagcg aatgtcattc cactgcacac aaatcggagc cgggtagcgg 592861 cgcgcaggcg tgccggtcaa cgggcagagt cccggcagca tccgtcgttg ctgtccgatc 592921 caaatgaccg ggcgtcggcc gagcagatcg ccgccgttgt ccgggaaatc gacgaacacc 592981 ggcgcgctgc gggtgccacg acctcgtcca ccgaggccac gcccaacgac cttgcgcaac 593041 tcgtcgccgc ggttgctgga tttctccgac agcgcctgac cggtgactac agcgtcgacg 593101 aattcgggtt cgacccgcac ttcaacagcg ccatcgtacg acccttgctg cgattcttct 593161 tcaagtcatg gtttcgggtc gaagtcagtg gtgtcgagaa catcccgcgc gatggtgcgg 593221 cgctggtggt ggccaatcac gcaggtgtgt tgccgtttga cgggttgatg ttgtcggtgg 593281 ccgtccacga cgagcacccg gcgcatcggg atctgcggct gcttgccgcc gacatggtgt 593341 tcgacctccc cgtgatcggc gaagccgccc gcaaggcggg tcataccatg gcgtgtacga 593401 cggatgcgca ccggttgctt gcctccggcg aactcaccgc ggtgttcccc gagggataca 593461 aggggctggg taagcgtttc gaggaccgtt accggttaca gcggtttggt cgcggcggct 593521 tcgtatcggc cgcgctacgg accaaggcgc cgattgtgcc gtgttcgatc atcggctccg 593581 aagagatcta ccccatgctg accgatgtca agctgctggc tcggctgttc ggcctgccgt 593641 acttcccgat tacgccgttg ttcccgttgg ctggaccggt cgggctagtg ccgttgccct 593701 cgaaatggcg catcgcgttc ggtgagccga tctgcaccgc cgactacgcc tccaccgacg 593761 ccgacgaccc gatggtgacg ttcgagttga ccgatcaggt gcgcgagacg atccagcaga 593821 cgctataccg actgcttgcc ggccgtcgca acatcttttt cggctgaccc ttatttgacc 593881 agagtgaact ggcagacgtc cgtgtacttg tcgcggaaca ggtctgagca gccacgtagg 593941 tagtgcatgt agatgtcgta cgtctcctgg cccttgaggg cgatcgcctc atctttgtgc 594001 gcctgtagcg catccgccca ggcgttcagg gtcggcacgt agttggcccc gatccggtgg 594061 tagcgctcga ccttccatcc ggcgttggag gagtaatagt ccacctgcga gatcctgggc 594121 agccgcccgc ccgggaagat ctcggtcagg atgaacttga tgaagcgcag caggctcatc 594181 ggagacgtca agcccagctc ctgggcttcc tctttgtccg ggatagtgat ggtgtgcagc 594241 agcatccggc cgtcgtcggg cgtcaaattg tagaacttct tgaagaaggt gtcgtagcgc 594301 tcgaacccgg cgtccccggc accgtcggcg aaatgctcaa acgcaccgag tgacacgatg 594361 cggtcgaccg gctcgtcgaa ctcctcccag ccctggattc gcacctcttt tcggcggggg 594421 ctgtcgacct catcgaacat cgccttgtcg tgggcgtact ggttttcgct cagggtcaag 594481 ccgatgacgt tgacgtcgta ctcggcgacc gcgtgtcgca tggtggaacc ccagccgcag 594541 ccgatgtcga gcagcgtcat gccgggctca aggttcagct tgtccagtgc cagcttgcgc 594601 ttcgcgtact gcgcctcttc cagcgtcata tcgggacgtt cgaagtaggc gcagctgtac 594661 gtcatcgatg ggtcaagcca gagcttgaag aactcgttcg atttgtcgta gtgggatcga 594721 actgcttcga ccggcggctt gagctgcgtg ccgcttgtcg tgtcgccctg tgacgtcatt 594781 gaacggaccc tactttcccc actagatcga tgcaatcgcc gccaccgttg catcggcatc 594841 ggcttcgtgg tgggccgctt ctcccaacat ggtgacgaca ctggtgacca caggctttcc 594901 ttcggcgtcg gtaacttcgc ttcggatctc ggcgagcacc gtgccgtggg attcgatgac 594961 ggagtcaaga taggtgtcga agtacagctt gtcgttggcc aggatcggcc ggtggaagcg 595021 gaacttctgg tcgcgatgaa agacccgggc gatgttgatc gggatattga acttggtgaa 595081 gatctccagc tgcacgcgcc ggccggcgat cgccaggaag gtcagcgggg ctaccagcgc 595141 ggggtaaccg gccgctgcgg catccggctc gctgtagtgg gtcgggtggt cgtctttgac 595201 cgcgaccgcg aactcgcgga tcttctcgcg ccccaccaga aagtggtccg gcgcccgata 595261 atgcttgccg atcagtgtct gggcttcttc gggaactgtc atgccgctgc cgccctccgc 595321 tcgaatagtt gctaagccct attgcccggc tcctcctcgc cccgctgcgc gggtcgcatc 595381 gtcgccaggc tgggccctat tgcccggctc ctcctcgccc cgctgcgcgg gccgcatcgt 595441 cgccaggcta acggcgcagc ttatcagcgt gattggcgtc tagaggctag agccgccaac 595501 gcgccgccgg ccgcacccag cgccagggcc gacggaaccc cgatccgagc ggccttgcgg 595561 gcgattcgga aatcacggat ctcccacccc cgttcccggg ccaggctgcg caggcgggcg 595621 tcggggttga tggcgaccgc ggtgcccacc agcgacagca tcgggacgtc gttgtagctg 595681 tcggagtagg cggtgcagcg tttgagattg agtccctccc ggatggccag cgaccgcacc 595741 gcgtgtgcct tgccggtgcc gtgcaggatc tcgccgacca gtctgccggt gaatatcccg 595801 tcgaccgact cggcgacggt gcccagggcg ccggttaggc cgagccggcg ggcgatggtg 595861 gccgcgagtt cgtatggggt agcggtgatc agccatacct gctggccggc gtccaggtgc 595921 atctgggtga gttcgcgggt gccgtcccag atcttgtcgg cgatgatctc gtcgtaaatc 595981 tcctctccca aggccaccaa ctccgcgacg gatcggccct cgatgaacgc gagcgccttg 596041 cgccggccag cggcgacgtc gttgctgttc tccttgccaa gtagctggaa cttggcctga 596101 gcgtaaagaa atccgaggac gtcgcggtag gtgaagtagt ggcgagcggc tagcccgcgg 596161 ccgaagtgca ccgccgacga gccctgaacc aaggtgttgt ccacgtcgaa gaaggcggct 596221 gcggtcaggt cgatcggcgg ctgccgatcg ctgccggcgg cggcgacggg ggccggcatg 596281 tccaccggcg agtggctggc gctggcatcg ggtggcggcg ggtcggccgg cgaagccagg 596341 tcgacgtgac cggcctggtc tgggctaccc aggtgggagg aaaccatcat tactcctaat 596401 cgcggtgcct gcccggtggc cgatgctgcg gccgttatca accctatccg gcaaatgcgc 596461 ggcggagctc ttggctggcg cggattgatc tgcaagccca gcgcggtatc gaaattcgcg 596521 aggccgcagc gactttcgtc gtgaacacga cccgcagcgg ttcggggcca acatgtcagc 596581 cccataccgg tacgcgcaaa gctgggtacg tgaaatcctg aattcttcag cctgtcaacg 596641 gtagcgtcta cgctagctaa cgcaacgaga catccgatta ctacgcacgt taggacattt 596701 caggaggtat cgggaggcct aagggtcact aggtccgcgc gatgggcgga acacgagggt 596761 gaggatgatt tcggttagcg gcgccgtgaa acgcatgtgg ttgctgctgg ccatcgtcgt 596821 ggtggccgtt gtcggggggc ttggtatcta tcggctgcac agcatcttcg gtgttcacga 596881 gcaacccact gtcatggtca agcctgattt cgacgtcccg ctgttcaacc ccaagcgggt 596941 gacctacgaa gtctttggcc ccgccaagac cgcaaagatc gcctacctgg accctgatgc 597001 ccgggtgcat cgactcgata gcgtgtccct gccgtggtcc gtcacggtcg agacgacgct 597061 gcccgcggtc agcgtcaacc tcatggcgca gagtaacgcc gacgtgatca gctgccggat 597121 catcgtcaac ggcgccgtta aggacgaaag gtctgagacc tcgccgcgag cgctaacctc 597181 ctgccaggtg tcatccggat gagcgaaaga cacgccgcac tgacgtcact gccgcccatt 597241 ctgccgcggc tgatccgccg gtttgcggtg gtgatcgtcc tgctctggct gggcttcacc 597301 gcctttgtca atctcgccgt accgcaactg gaagtggtcg gaaaagcaca ctcggtatcg 597361 atgagcccca gcgacgccgc atcgattcag gcgatcaagc gcgttggtca ggtgttcggt 597421 gagtttgatt ccgataacgc ggtaacgatc gtgctggaag gcgaccagcc actcggtggg 597481 gacgcgcacc ggttctatag cgatctgatg cggaagcttt ccgccgatac ccgccatgtc 597541 gcgcacatcc aggacttctg gggggatccg ctgacagcgg cgggatccca aagtgcggat 597601 gatcgggccg cctacgtcgt ggtgtacctc gtcggtaaca acgaaaccga agcgtatgac 597661 tcggtccacg cggtgcggca catggtggac accacaccgc caccgcacgg ggtgaaggcc 597721 tatgtcaccg gtccggcagc actcaatgcc gaccaggccg aggccggaga caaaagtatc 597781 gctaaggtca ccgcgatcac gagcatggtg atcgcagcaa tgttgctagt gatctatcgc 597841 tccgtaatta ccgcggttct cgtcttgatc atggtcggca tcgacctcgg cgcaatccgc 597901 ggattcatcg ccttgctcgc cgaccacaac attttcagcc tttcaacatt tgcgaccaac 597961 ctgctcgttc tcatggcgat tgcggcgagc acggactacg cgatattcat gctcggccgt 598021 taccacgaat cgcgctacgc cggcgaggat cgggaaacgg ccttctacac gatgtttcac 598081 gggaccgccc acgtgatctt gggttcgggt ttgaccattg ccggcgccat gtattgcctc 598141 agctttgccc ggcttccgta ttttgaaacg ctcggcgcgc ccattgctat cggcatgctg 598201 gtcgcggtct tggcggcgct cacgctcggc ccggccgtac tgaccgtggg cagcttcttc 598261 aagctgttcg atcccaagcg gcggatgaac actcggcggt ggcgccgggt gggaacggca 598321 attgtgcgtt ggccggggcc ggtgctcgcg gcgacatgct tggtcgcctc cattggcttg 598381 ctggccttgc ccagttaccg gacaacgtat gatctgcgca agttcatgcc cgccagcatg 598441 ccgtccaatg tgggggatgc ggcggctggt cgacgctttt cacgggctcg gctgaaccct 598501 gaggtgctgt tgatcgagac tgaccacgat atgcgtaatc cggtggacat gctggtgttg 598561 gacaaggtag ccaaaaatat ctaccacagt cccggtattg aacaagtgaa agcgataacc 598621 cggcccttgg gaacaaccat caagcacact tcgataccgt tcatcatcag catgcagggc 598681 gtgaatagta gcgagcaaat ggaattcatg aaggaccgaa ttgatgacat actggtgcag 598741 gtggccgcga tgaatacctc catcgagacg atgcatcgca tgtatgcact catgggcgag 598801 gtcattgaca acaccgtcga catggatcat ctcacgcatg atatgtcgga cataacggct 598861 acgctaagag atcatctcgc ggatttcgag gatttcttcc ggcctattcg cagctacttc 598921 tactgggaaa aacattgttt cgacgttccg ctctgctggt cgataagatc gatattcgat 598981 atgtttgaca gtgtggacca gctgagcgaa aagctcgagt acctggtcaa ggatatggat 599041 attctgatta cactgttgcc gcagatgcgc gcgcagatgc cgccgatgat atctgcgatg 599101 acgacgatgc gggacatgat gcttatctgg catggcacgc ttggcgcgtt ctataagcaa 599161 caggagagga ataacaagga ccccggcgcg atgggccggg tttttgacgc cgcccagatc 599221 gatgattcgt tctatctgcc gcagtcggct tttgagaatc cggatttcaa gcgggggctg 599281 aagatgtttt tgtctccgga cggcaaggca gcccgctttg tcattgctct ggagggagat 599341 cccgcaacgc ccgagggcat ctctcgggtc gagccgatca agcgggaggc tagagaggcc 599401 ataaagggaa ctccattgca gggcgctgcg atctatctgg gtggcaccgc ggcgacgttc 599461 aaggatattc gagagggcgc cagatacgat ctgctgatcg ccggagtggc ggcgataagc 599521 ttgattttga tcatcatgat gatcatcacc cgaagtgtgg tagccgcagt ggttatcgtg 599581 ggtaccgtcg tgctttccat gggcgcctct ttcgggcttt ccgtattggt ctggcaggac 599641 attctgggta tcgagttgta ctggatggtg ttggcgatgt cggtgatcct gctcctggcg 599701 gtgggatccg actacaatct gctgctgatt tcccggttga aagaggaaat tggggccgga 599761 ttgaacaccg gaattatccg tgccatggct ggtaccgggg gagtggtgac ggctgccggc 599821 atggtgttcg ccgttaccat gtcgttgttt gtgttcagcg atttgcgaat tattggtcag 599881 atcggtacca ccatcggcct gggcttgctg ttcgacaccc tcgtcgtgcg ctcgttcatg 599941 acaccgtcca ttgctgcgct gctgggacgc tggttctggt ggccgctacg ggtgcgcccg 600001 cgcccggcca gtcagatgct tcggccgttc gcgccgcgcc gattggttcg cgccttgttg 600061 ctgccgtccg gccagcaccc gtcagcgact ggcgcccatg agtaggcccc aggtggagct 600121 tttgactcgc gccgggtgcg cgatctgcgt gcgggtagcg gagcagctgg ccgaactgtc 600181 cagcgaactg ggcttcgaca tgatgacgat cgacgtcgat gtcgcggcgt cgacgggcaa 600241 tccagggctg cgagctgagt ttggcgatcg gttgccggtg gtcctgctgg acggccgcga 600301 gcacagctac tgggaggtcg acgagcaccg gctgcgtgcg gatatagccc gcagcacatt 600361 tggtagccca cctgataaac gtctaccgta gacaccagtt ttactggggt agtcgaggga 600421 gctggccagg tggtgctgcc gtgagcgtgc tgctcttcgg ggtgtcgcat cgtagcgcgc 600481 cggtcgtcgt ccttgaacaa ctcagtatcg acgaatccga tcaagtcaag atcatcgacc 600541 gagtgctggc ttcgccgctg gtgaccgagg cgatggtgct gtcgacttgc aaccgcgtcg 600601 aggtctacgc cgtagtggac gcgttccatg gcggcctgtc ggtgatcggg caggtgcttg 600661 ccgaacactc cggtatgtcg atgggggagc tgaccaagta cgcatatgtc cgctacagcg 600721 aggcagcagt tgagcacctg ttcgcggttg ccagcggcct ggactcggcg gtgatcggcg 600781 agcagcaggt gcttggtcag gtgcgccgcg cctatgccgt cgccgaatcc aaccgcacgg 600841 tcggccgcgt gctgcacgaa ttggcccagc gggcgctgtc ggtgggcaag cgagtgcact 600901 ccgaaaccgc cattgacgct gccggtgcct ccgtggtgtc ggtcgccctg ggaatggccg 600961 agcgcaaatt gggctcgttg gcgggcacga ccgcggtggt gatcggcgcc ggggcgatgg 601021 gcgcgctgtc ggcggtacat ctgacccgtg ccggcgtcgg gcacattcag gtgctcaacc 601081 ggtcgttgtc ccgggcgcag cggttggccc gaaggatccg cgaatctggc gtgccggccg 601141 aggcgctagc gctcgaccgc ctggctaatg tcctggccga tgccgacgtg gtggtcagct 601201 gtactggggc ggtgcgtccg gtggtgtcgc tggccgatgt gcatcatgcg ctggccgccg 601261 cccgccgtga cgaggccacc cgtccgttgg tgatatgcga cttgggcatg ccgcgtgacg 601321 tcgatcctgc ggtggccaga ttaccgtgtg tgtgggtcgt ggacgtggat agcgtgcaac 601381 atgaaccctc ggcacatgcc gcggctgccg acgttgaggc cgcccgccac atcgtcgccg 601441 ccgaagttgc cagctatctg gtggggcagc ggatggccga ggtcacccca accgtgacgg 601501 cgttgcgcca gcgagccgcc gaagtggtcg aagcggaatt gctgcgcctg gacaaccggc 601561 tgcccggcct gcagagtgtc cagcgcgagg aggtggcccg caccgtacgg cgagtcgtgg 601621 acaagctgtt gcacgcgcct accgtgcgga tcaagcagct cgccagtgcg cccggcggtg 601681 acagctacgc cgaggcgctg cgcgaactct tcgagcttga ccagaccgcc gtcgatgccg 601741 tcgccactgc aggtgaatta ccggtggtgc caagcggatt cgacgctgaa agtcgccgcg 601801 gtggaggcga catgcaaagc agcccgaagc gatcgccgag taactgattg gcgcacgtga 601861 tccggatagg tacccggggc agcttgctgg ccaccactca ggccgccact gtcagagacg 601921 ccctcatcgc tggtggccac tccgcggagt tggtgaccat cagcaccgag ggtgaccgat 601981 ccatggcgcc gatcgccagt ctcggggttg gcgtcttcac cacggcgttg cgcgaggcga 602041 tggaggcagg cctcgtcgat gcggcggtgc attcgtacaa ggatttgccg actgccgccg 602101 atccaaggtt cacggttgcg gcgataccgc cgcgcaatga cccccgcgac gcggtggtag 602161 cccgtgacgg gctgacgctg ggggaattgc cggtcggatc gttggtgggc acatcctcgc 602221 cgcggcgggc cgcacagctt agagcattgg gtctcggttt ggaaatccgc cccctacgag 602281 gcaacctaga taccaggttg aacaaggtaa gtagcggcga tcttgacgcc atcgtggtgg 602341 cccgggctgg tctggcgcgg ctgggccgcc tcgatgacgt gaccgagacg ttagagccgg 602401 tgcagatgtt gcccgcgccg gctcagggcg cgctcgcggt cgaatgccgc gccggcgaca 602461 gccggttggt ggcagtgctg gcggagttgg atgacgccga cacgcgtgcg gcggtcaccg 602521 ccgagcgagc cctgcttgcc gacctggagg caggttgctc cgcaccggtg ggagcgatcg 602581 cagaagtggt cgagtccatc gatgaggacg gccgtgtctt cgaggagctg tcgctgcgcg 602641 ggtgcgtggc ggcgctggac ggatccgacg tgatccgcgc gtccggcatc ggcagttgcg 602701 gtcgggcacg ggagctgggg ctctcggtcg ccgcggagct gttcgagctg ggcgcccggg 602761 agctgatgtg gggagtgcgg cattagcccg catgaagaag tgactgggag tgacaatcat 602821 gacgcgaggg cgtaagccga gaccgggccg catcgttttc gtgggctccg gtccgggcga 602881 ccccggcttg cttacgacac gggctgccgc ggtgctggcc aacgccgcgc tggtgttcac 602941 cgatcccgac gtaccggagc cggtggtggc gctgatcggc acggatctgc cccccgtgtc 603001 cggcccggcg cccgccgagc cggttgccgg gaacggcgat gcggccggcg gaggaagtgc 603061 gcaggaacac ggccgggccg cgtccgcggt agtctccggt ggtcctgaca tccgcccggc 603121 gctgggcgat cccgccgatg tggccaagac gctgaccgcc gaggcccgtt cgggtgtcga 603181 cgtggtgcgg ctggtggcgg gcgatccgct cacggtggat gcggtaatca gcgaggtgaa 603241 cgccgtcgca cgcacccacc tgcacatcga aatcgtgccc ggcctggccg ccagcagcgc 603301 ggtcccgacc tatgccgggt tgccgctggg ttcgtcgcac accgtcgccg acgtgcgtat 603361 cgaccccgaa aacaccgact gggacgcgct ggctgccgca cccgggccgc tgatcctgca 603421 ggccaccgca tcgcatctag ccgaatcggc ccgcagcctg atcgatcacc agctggccga 603481 gtccactccg tgcgtggtga ccgcacacgg caccacctgt cagcagcgtt cggtcgagac 603541 cacacttcag ggattgaccg acccggccgt cctgggcgct accgaccccg cgtgctccgc 603601 aaacgggagg gactcccagg ccggaccgct gatagtgacc atcggcaaga cggtgaccag 603661 tcgggcaaag ctgaactggt gggagagccg cgccctctac ggctggacgg tgttggtgcc 603721 gcgcaccaag gaccaggccg gcgagatgag cgagcggctc acgtcgtacg gcgcgctgcc 603781 ggtggaggtg ccgaccatcg ccgtcgagcc gccgcgcagc cccgcgcaga tggagcgcgc 603841 cgtcaagggc ctggtcgatg gccgattcca gtggatcgtg ttcacctcca ccaacgcggt 603901 gcgtgcggtg tgggagaagt tcggcgagtt cggtctggat gcccgcgcgt tctccggggt 603961 gaagatcgcc tgtgtcggcg agtcgacggc cgaccgggtg cgcgccttcg gaatcagtcc 604021 cgagctggtg ccctccgggg agcagtcctc gcttggcttg ctagacgact tcccgcccta 604081 cgacagcgtt ttcgacccgg tgaaccgggt tttgctgccg cgcgccgaca tcgccaccga 604141 aacgctggcc gagggactgc gagagcgtgg ctgggagatc gaggacgtca ccgcctaccg 604201 gaccgtgcgg gccgcgccgc cgccggccac tacccgggaa atgatcaaga cgggcgggtt 604261 tgacgcggta tgtttcacct ccagctcgac ggtgcgaaac ctggtcggca tcgccggcaa 604321 gccgcacgcg cggacgatca tcgcctgcat agggccaaag accgccgaga ccgcagccga 604381 gttcggcttg cgggtcgatg tccagccgga caccgccgcc atcggcccgc tggtcgatgc 604441 gctggccgag catgccgccc ggttgcgcgc tgagggtgcg ctgcccccgc cgcgcaagaa 604501 gagccgcagg cgctagtggc ccaccctcgt caggtgagcg tgcgtgtctg tacaccgaca 604561 cgccgaccga gctggcattt tgcgtacgct cgcggctacg aatgagcatg agttcctatc 604621 cgcggcagcg accgcgccgg ctccgctcca ccgtcgcgat gcgccgtctg gttgcgcaaa 604681 cctcgttgga gccaaggcat ttggtgctgc cgatgttcgt tgccgacggc attgacgagc 604741 cgcggccgat tacctccatg ccgggcgtgg tacagcacac ccgggattcg ctacgtaggg 604801 ccgcggcagc cgcggtggcc gccggcgtgg gtgggctgat gcttttcggc gtgccgcgcg 604861 accaggacaa ggacggtgtc ggttcggcgg gcatcgaccc cgacgggatc ctcaacgtcg 604921 cccttcgcga tctggccaag gacctgggtg aggccacggt gttgatggcc gacacctgtc 604981 tggacgagtt caccgaccac gggcactgcg gtgtgctcga tgaccggggc cgggtcgata 605041 acgacgccac cgtggcccgc tatgtggaac tggctgtggc gcaagcggaa tcgggcgccc 605101 acgtggtcgg acccagtggg atgatggatg gccaggtagc cgcgatccgg gacggtttgg 605161 acgccgccgg ctacatcgat gtggtgatct tggcctacgc cgcgaagttt gcttcggcgt 605221 tctacggccc gttccgcgag gcggtgagct ctagcctgtc cggggatcgg cgcacctacc 605281 agcaggagcc gggcaacgcc gccgaggcgc tgcgtgagat cgagctcgat ctcgacgaag 605341 gcgccgacat tgtgatggtc aaacccgcga tgggctacct cgatgtggtg gcggccgcgg 605401 cggacgtctc gccggtcccg gtggccgcct atcaggtctc gggagagtac gcgatgattc 605461 gtgcggcggc ggccaataat tggatcgatg agcgtgccgc ggtgctagag tcgctgaccg 605521 gtatccggcg tgccggcgcc gacatcgtgc tcacctactg ggcggtagac gcggcgggct 605581 ggcttacgtg acggaggcct gacatgacac caaccgggga taccaagccc aagttgttgt 605641 tctacgaacc cggcgcgagc tggtactggg tgctgactgg tccgcttgcg gcggtgtcgg 605701 tgctcctcct cgagatatcc agcggcgccg gggttgggtt gataacgccg gcgatctttc 605761 tggtgatggt gtcggcgttc gtggcattgc aggtgaaggc ggcgcggatt cacacgtcgg 605821 tcgagctgac gcatgatgcc ttgcgccaag gcaccgagac catcaggctg gccgaaatcg 605881 tcaaaatcta tccggaggca gacggccgcg agacgtccgg ggaagagccg gcaaagtggc 605941 agtcggcgcg gaccctgggc gagctcgtcg gcgtaccgcg cggccgggtg ggaatcgggc 606001 tgaagctgac cggaggccgc accgcccagg cctgggcgcg tcgtcatcaa cagctgcggg 606061 cggcgctgac tccgctggtt caggagcggc tcgggcccgt ggattctgat gtcgccgacg 606121 tcaacggtga cgacgccggg ccagcgcggt gatcgcccgc taccgggccg gggccgaact 606181 gttcctggct tgtgccgcgc ttgccggatc tgcggcgagc tggtcgcgga cccgctccac 606241 cgtggccgtc gcgcccgtca tcgacggcca gccggtcacc ctgtcggtgg tctatcaccc 606301 gcaaccgttg gtgctgaccc tgctgctggc gacgatcgcc ggcgtgttgt cggtggtggg 606361 gacggccagg ttgcggcgcg cgcgagctgg cttgaacgca catccggacg gcttgaacca 606421 gcgtccgccc ggcggttggt gtcattgagc cgtttgcgtg gatcacttcc gctgctgctt 606481 gatcgggccc tggtctgtgt cggcagcggc tggtagtatc gaaagtatgt tcgatcaggt 606541 gcgggggcgc atgccttcac cggaggcgat cgctcatttt gatgagcggt ttgaatgcca 606601 tgctccgcgg accacgaggg tgtcggcggc gttcatcgat cggatctgct cggcgactcg 606661 ggccgaaaac cgggccgctg cggcgcagtt ggtggcgttg ggggagttgt tcgcctatcg 606721 gtggtcgcgt tgcgggggcc gcgaggagtg ggtgatggac accatggcgg cggtggccgc 606781 cgaggtggcg gcggcgttgc ggatcagtca gggtctggcg gccagccggt tgcggtatgc 606841 gcgggcgatg cgtgagcggc tgcctaagac ggctgaggtg tttagcgccg gcgacatcgg 606901 ctatctgatg tttgccacga ttgtgtatcg caccgacttg atcgttgacc ctgatgtttt 606961 ggcggcggtg gatgcgcagt tggccgccaa tgtggcgcgt tggccctcga tgaccaaggc 607021 ccgcctggct gggcaggtcg ataagatcgt ggcgcgtgcc gatgccgatg cggtgcggcg 607081 gcgcaaggag tatcaggccc agcgccagtt ctgggtcggg gaaagccaag acggtgtgtg 607141 ccagatcggt ggcagcctgt tggccgtcga cgcacacgcc ctcgatgcgc ggttgagcgc 607201 gttggcgggc accgtgtgtg agcacgatcc gcgcagccgt gagcagcgcc gcgcggacgc 607261 gttgggggcg ttggcgggcg gggccgatcg gctgggctgt ggctgtgggc gcgctgattg 607321 tgcggccggg aagcggcctg cggccccgcc ggtggtgatt cacctgatcg ccgaggcggc 607381 cacgatcaat ggcacgggct cggcgccggc atcgcagatg aacgccgacg ggctgatcac 607441 cgccgaactg gtggccgagc tggccaagac ggccacgctg gtgccgctgg ttcatcccgg 607501 cgatgcgccg cccgagccgg ggtatgcgcc gtcgaaagcg ctcgccgatt tcgttcgctg 607561 ccgggatctg acgtgtcgct ggcccggctg tgatgagccc gccaccaatt gcgacctgga 607621 tcatacgatc ccgtatgccg ctggtgggcc cacccatgcg tcgaacctga aatgttactg 607681 ccgtacccat cacctggtga aaacgttttg gggatggcgt gatcaacagc tacccgacgg 607741 caccctgatt ttgacctccc cgtccgggca tacctatgtc agcaccccgg gcagtgcgct 607801 gctgttcccc agcttgtgcc acttcagcgg cggcatcccg gcaccggaag ccgacccacc 607861 ctacgaccat tgcgaccagc gcacagcgat gatgcccaaa cgccggcgca cccgcgccca 607921 agaccgggcc tatcgcatcg ccaccgaacg tcgacaaaac cacgccgccc gccagcgcgc 607981 ccaggtgctc acccagaccg ccgcggccac cgacacccac ggcccaccac cggatcacaa 608041 cgacgaccca ccgccgtttt aggctgacct gctgattagc ggtagcacca gctgacggcg 608101 gcggtcgatg gcgtcagcca ggtcgtggag cgctttatgc accgagcgcg ccatcgggaa 608161 catggattca tgctcgccct ggtcacagcg gccacctagc tgttcgacta ctgcggggct 608221 cgcgactaat gcccactgga cgccggcggc tcggcagtcc tcatcgagga tgcacaagag 608281 cgagatgccg gccccactga agtgactcaa ctcgctcagg tcgagcacca tcggatttgt 608341 tccgaggctg aaacgccgga cgtgctcgct gatctgctcg acattggcgg cgtcgatctc 608401 gcctcggatg gtcaccactg tcgccaggtg atgcaggtag gcccgaatct gagcgccacc 608461 gtagtcaacg gcggcatttc cgggccgcgt cgtgacgctg caagccgatt ttgacgtcgg 608521 gatcgtggta gtcatcaata gcctcgttct ccgtcgcgtt gcgggccgac cgatcgccgg 608581 ctaaagctgc ctttaaccaa acccgcaaaa tctaagggga gcgaaagccg cctctaactc 608641 tttgctaaga agcgattttc ggggtgctcc cggcgaccca cgccgtcgcg gccatggcgc 608701 tgttaggctg cgatggctgc cggttgctag tcgggggctg atgatatggc cggtggtatg 608761 gatcagccgc ccggtcagcc tagaaggcgg accagacagc agagttcaga cggaaagaac 608821 ggcgtgcgcg ctgcagagat caccggagaa attagggccc tgacaggatt gcgcatcgtc 608881 gcggcggtgt gggtagtgct gtttcacttc cgaccgatgt tgggtgatgc gtcaccgggc 608941 ttccgcgacg ccctcgcgcc ggtgctcgac tgcggcgcgc agggtgtaga cctcttcttc 609001 atcctcagtg ggttcgtgct gacctggaac tacctcgacc gcatgggccg gtcgtggtcg 609061 gtccgtgcca acctgcactt cttgtggctg cggctggcca gggtgtggcc ggtgtacctg 609121 gtcaccttgc acctggccgc cgtgtgggtc atctttacgc tgcacgtcgg tcacgtgccg 609181 tctccggagg caggccagct gaccgcgatc agctatgtgc gccagatcct gctggtgcag 609241 ctgtggtttc agccgtattt cgatggatcc agttgggatg gaccggcctg gtcgatcagt 609301 gcggaatggt tggcctactt gctgttcggt ctgctcattc tggtcatctt ccggatgaag 609361 cacgccacca gggcgcgggg cctgatgtgg ctggccttcg cggcgtcgtt gccgcccgtg 609421 gtgctgctgt tggccagcgg ccagttctat acgccatgga gctggctgcc ccgaatcgtg 609481 acgcaattcg ccgcgggagc gctggcgtgt gccgccgtcc gcaggttgcg gccgaccgat 609541 cgcgctcgcc gcatcgccgg gtacctttcc gtgctggtcg gcgtcgcgat tgtcggcatc 609601 ctctacctgt tgcacgcgca tccgctcgcc ggggtcgagg acagcggcgg ggtggtcgac 609661 gtgctgttcg ttccgctggt gatcagcctg gcgattggcg tcggcagcct gccggcgttg 609721 ctgtcgacgc ggttgatggt ttttggcggg cagatctcgt tttgcctcta catggtgcac 609781 gagctggtgc ataccgcctg gggatgggcc gtgcaacaat acgagcttgc gctgcaggat 609841 cagccgtgga aatggaacgt cgtcggtctg ctcgcgatcg ccctgggggc tgcgatcttg 609901 ctgtatcact tcgtcgaaga accgggccgc cgatggatgc gccggatggt cgacgtcaaa 609961 gccgcgagtg cgagaagcga gcccggggag ccggtaggca gcacgcgtta tcaaatcgac 610021 gatgcgctgg aaggggtttc ggcccgcgcg gtgtgacggt tgagtggggc tgcagcgggt 610081 cgacgcgagt tcacatcggt ttcctcgtac gattcccttt atttggacgc ggcgcacgac 610141 ccgttcaact ttgagccgag tccagtggag ccatcagtgg agtcagtgtg agtcgcccgg 610201 gtacatacgt cattggtctc actctcctgg tcggcctggt cgtcggcaat ccagggtgcc 610261 cgcggtccta ccgcccactg accctggatt accggcttaa cccggtcgcg gtgattggcg 610321 actcctatac caccggcacc gatgagggcg gtctgggctc gaaatcatgg accgctcgca 610381 cctggcagat gctcgctgca cgtggcgtgc ggatcgcagc cgacgtggcc gccgagggcc 610441 gggccggcta cggggtgccc ggcgaccacg gcaacgtgtt tgaggatctg accgccaggg 610501 ccgtccagcc cgacgatgca ctggtggtgt tctttggctc ccgcaacgac caaggcatgg 610561 atcctgagga tcccgagatg ctggccgaaa aggtccgcga cactttcgat ctagcgcgcc 610621 accgcgcacc atccgcgagc ttgctggtga tcgcaccgcc gtggcctacc gccgacgtac 610681 ctggcccaat gctgcggatt cgcgacgtgc tgggcgctca ggcgcgggcc gcaggagcag 610741 tgtttgtcga cccgatcgcc gaccactggt ttgtcgacag gcccgagctg atcggcgcgg 610801 atggcgtgca tcccaacgat gcgggacatg agtatctggc ggacaagatc gcgccgctga 610861 tcagcatgga gttggttgga tgagttggga gtcacgagcc acgcaaaggg tttagcgtga 610921 cgacggtcga cgtgctagtc ctctgcgtgc cgttcgtaat cccaacgctc aaggcgcgcc 610981 tgcaactgca ggagaccaag tccggcgagt ggcgccgcgg cggtgaggaa ggccagcagc 611041 atcggactca tctcagaacc tccaaaacca tttcattcgt accacgttcg tcgtcgaggg 611101 gtggttcttt cgcgaaacat gtccgtccga attcagctgt cctcagccac cgccacgctg 611161 cgccacgtca gctaggacgc catccaagcc agttcgccgg gcaactgttc gcgccagtac 611221 gacgcgtcgt gtcctccggg cgagaagctg ccggcaggcg gttggtgcag ttggttgacg 611281 aattggcgag tggcgaagta gaagcggtcg ctggtgccgc aatccacccg tagcgggatt 611341 gagttcagcg cgggcaggcc caacacgctg tgttgcacat agtcgtcgta gctgtcgaac 611401 gccccgggtg tgctgccggt gaacgacgtg aacaatgccg ggctgatggc acagatcccc 611461 gcggttctgg ccggacccaa ccgggcaccc aggagcagcg cgccgtatcc ccccatcgac 611521 caccccagga atcccacccg ggaggtgtcc atacccatcg aggtcagcat cggcagcagc 611581 tcgtcgagca ccatcgcacc cgagtccccg ccggaagagc gacggtgcca gtaggtgttg 611641 ccgccgtcga cgccgaccac cgcgaacgct ggcttgccct ccttgaccag gcgggccaac 611701 ccctgctcga cgccgagatc cagcatcatg ccggcgttgc cgtccttgcc atgcagtgcg 611761 atcactggcc gcagctgccc gctctggccg ggcggcatgg agatcaccca gttggtcttg 611821 atgcctccgc gagccgccga gatgaacgag ccggagatcc tggtcggcaa gctgctgccc 611881 gccgtcgggg gctcgaacgg cgccggggcc gcctgcggct caagtgggtc caccagggcg 611941 ccgaaggccc acacgccggc ggctcccgcg ccggcgccgg caccccaacg gagcagggca 612001 cggcgggtca ggtctgccat gggcgtcatg atgccgcgcc gatcggtgtt gcccgcacag 612061 ccacgccgta gcaccggcca atcgtgacac cggtaacggc tggcgagtcg ccgtagtggg 612121 ggcccggctg cgcagcagtg acggcatgaa gaactttcgc aaaactggaa acggctggta 612181 ccggaagtcg gtattctttg cgcggcagct gcgtgtcaat gatgaccgag cggtagcccg 612241 gtcgtccctg gtgtatggga gggtgttcga tcacctgcct caacatctcc gaagtgccga 612301 acgagaccaa ccgtaagaag aaccgtcagg ccggactcga ccgcagtatc cgggtgattc 612361 atggcagctt cgacgacatt cccgagccgg acagcggcta tgacgtcgtc tggtcacaag 612421 atgcgatcct gcacgcgccc gaccgccgaa aggtgctcga ggaggcattc cgggtgttgc 612481 ggcccggcgg cgaactgatc ttcaccgatc cgatgcaggc cgacgatgtt cccgacggtg 612541 tgctgcagcc ggtctacgac cggctcaacc tgcgtgacct tggctcgatg cgcttctatg 612601 cgtgaagccg cacaggcact cggtttcgag gtgctcgacc aaagagacct ggttcgcaat 612661 ctgcggacgc actacagccg agtgttcgag gaactcgaag cccggcgtct cgaactcgag 612721 gggaagtcct cccaggagta cctcgacaag atgcgggtag gcctgaagaa ctgggtcgag 612781 gccgccgaca acggtcactc tcgcgtgggg catccaacat ttccgagaac ccgcctgact 612841 ccgatatgcc agctgcccac ggccgcgatc gactcgacgg ctggtcgtcg ccggtatcgt 612901 tgaccccacg gactgcgtga cagccggggg cacggagttg cccggcggcg ccagtactgc 612961 ccccgacgga ccggaaggca ggtgccatag ctaccacttc aggactgcgc ccaggactgt 613021 cgcagcgtca gctcaacatg atcgctatcg gcggcgtcat cggtgctggc ttgttcgtcg 613081 ggtctggtgt ggttatccgt gcgaccggtc cggcggcatt cctgacctat gcgctgtgcg 613141 gcgcactgat cgttctggtg atgcgcatgc tgggcgagat ggccgccgcc aatccgtcga 613201 ctggagcgtt cgccgactac gcggcaaaag ccctgggcgg ctgggcggga ttctcggttg 613261 gctggctgta ctggtacttc tgggtaatcg tcgtggggtt cgaggcggtt gccggcggga 613321 aggttctaac ctactggatc gatgcgccgc tgtggttggc gtcgctgtgt ctgatgatga 613381 tgatgaccgc gacgaacttg gtctcggtgt catccttcgg tgagttcgag ttctggttcg 613441 ccggagtcaa ggttgccacc atcgtcggct tcctggtcct tggcaccgct ttcgccttcg 613501 ggctgctgcc gggccatggc atggatttca gcaacctcag cgcgcacggt ggcttctttc 613561 ccgacggggt aggtgccgtc ttcgctgcca tcgtggtcgc gatcttctcc atgactggca 613621 cggaagtagt caccatcgcc gcggctgaag cgccggaccc tcaacgagcg gtccaacgcg 613681 cgatgagcac ggtggtggca cgcatcgtga tcttcttcgt cggctcggtc ttcctgctca 613741 cggtgatcct gccgtggaac tcgttggagc ttggcgcctc cccgtacgtt gccgcgctgc 613801 ggcacatggg tattgggggt gctgatcaga tcatgaatgc cgtcgtgctt accgcggtgc 613861 tgtcctgctt gaactcgggc ctgtataccg cgtcgcggat gctgttcgtg ctcgccgccc 613921 ggcaggaggc gccggcccag ctggtcaaag tcaaccggcg tggagtcccc accttcgcga 613981 tcatgggatc gtccgtggtg ggattcctgt gcgtgatcat ggcatgggtc tcacccgcaa 614041 cggtattcgt tttcctgctc aactcgtcgg gcgctgtgat tttgttcgtc tacctgctta 614101 tcgcgctgtc gcagatcgtg ttgcgtcgcc agacatctgg ccaaaatctg ggggtacgga 614161 tgtggctttt cccggggctg tcgatcgtca cggtgaccgg aattgtcgcc gtgctggcgc 614221 ggatggcgtt cgactacgcc gcgcgcagcc agctctggct cagcctgctg tcctgggcag 614281 tggtcgttgg gtgttatttg gtcaccacat tggtgcgacg tccccttaat cggccttggt 614341 gagcagtacg gcctcgtcga acggcagtct ggcaaagacc ggccgccatc ggctgctgac 614401 atacggcgcc gcctcggcct tggtgagccg ccgcgggttg gcgacaccaa aggttttgcc 614461 gtagcggcgc atccggccac cgccggccgc ggtgatgttc ttcagccagt cgcggttcgg 614521 accgtaggtg agcaaaatcg ccacgcccgc ccggccgtcg acgtccgcgc tgaacacgtt 614581 caacggggta cggtacggct tgcccgagcg gcggcccacg tgctcaagaa tcgcgaacgc 614641 cgggagccag ccggcccata gccgctgaat ggggttggtg acatatcgat tgaaccgagc 614701 cagccactgc ggtagttgca tgcccaccat ccaactcgtg gaccggccgc ggcatcaagc 614761 aaacctctgg tggctgcggc aaactcttac accctgtagt tgagcgacct gggcaggctg 614821 gaacactagt cgtcatgggc agcacggaac aggccacctc gcgggtaagg ggagccgcgc 614881 gcacatcggc gcagctgttc gaggccgcat gcagcgtcat acccggcgga gtgaactccc 614941 cggtgcgggc gttcacggcg gtgggcggca ccccgcgctt cattaccgaa gcccacggct 615001 gctggttgat tgacgccgac ggcaaccgct acgtagacct ggtctgctca tggggcccga 615061 tgatcctcgg tcacgcgcat ccggccgtcg tcgaggcagt ggccaaggcc gcagcccgcg 615121 gcctgtcctt cggggccccg actcccgccg aaacccaact agccggcgag atcatcggcc 615181 gggtagctcc cgtcgagcgg atacggctgg tgaactccgg caccgaggcc actatgagcg 615241 ccgtgcggct ggcccgcgga ttcaccggcc gggccaagat cgtcaagttc tccggctgct 615301 accacggaca cgtcgacgca ttgctcgccg acgcgggttc gggagtggcc accctgggct 615361 tatgtgacga cccccagcgc ccggcttcgc cgcgctcgca atcgtcacgg ggcctgccgt 615421 cctcccccgg ggtcactggc gccgcggcag ccgacacgat cgtgttgccc tacaacgaca 615481 tcgatgccgt acagcagacc ttcgcccggt tcggcgagca gatcgccgcc gtaatcaccg 615541 aggccagccc cggcaacatg ggagtcgtcc cgcccgggcc cggcttcaac gcggcgctgc 615601 gcgcgatcac cgccgagcac ggcgccctgc tcatcctcga cgaggtgatg accgggttcc 615661 gggtcagccg aagtggttgg tacggaatcg atccggtgcc cgctgacctg ttcgccttcg 615721 gcaaggtgat gagcggcggg atgcccgccg ccgcgttcgg cgggcgcgcc gaggtgatgc 615781 agcggctggc gccgctgggg ccggtgtatc aggccggcac gttgtcgggt aacccggtgg 615841 cggttgccgc cgggctggca acgctgcggg ccgccgacga cgcggtctac accgcattgg 615901 acgccaacgc tgaccgcctg gccggcctgc tctccgaggc actgacggat gccgttgtgc 615961 cacaccagat ttcgcgggca ggcaatatgc tcagtgtgtt cttcggcgaa acaccggtga 616021 ccgacttcgc gtccgcgcgg gccagccaga cctggcgtta tccagcgttc tttcatgcca 616081 tgctggacgc cggtgtctac ccgccgtgca gtgccttcga ggcatggttc gtctcggccg 616141 ctttggacga cgcggcgttc ggccggatcg ccaacgcgct gcccgccgcg gcccgagcgg 616201 cggcccagga aaggcccgcc tgatgcccga ggaaacccaa gtccacgtgg tgcgccacgg 616261 tgaggtgcac aaccctaccg gcatcctgta cgggcggctg cccggattcc acctgtccgc 616321 aaccggcgcg gcgcaggccg ccgccgtcgc cgacgcgctg gccgaccgcg acatcgtcgc 616381 ggtaatcgca tcgcccttgc agcgtgccca ggagaccgcc gcgcccatcg ccgcccggca 616441 tgaccttgcg gtggagacag acccggatct gatcgaatcg gccaacttct tcgagggccg 616501 ccgcgtcggc cccggtgacg gggcatggcg cgacccgcgg gtgtggtggc agctgcgtaa 616561 cccgttcacc ccgtcgtggg gtgagcctta cgtggatatc gctgcccgaa tgacgaccgc 616621 ggtggacaag gcacgtgtcc gcggcgccgg ccatgaggtg gtgtgcgtca gccatcagct 616681 gccggtgtgg acgctgcggc tgtatctgac cggtaagcgc ctctggcacg atccgcgccg 616741 tcgggactgc gcactggcct cggtgacgtc gttgatctac gacggcgacc gcctggttga 616801 cgtggtgtat tcgcagccgg cggcgctttg accgcgccgg cgacgatgca gagcagagcg 616861 accagaagga gcggcgcttt gaccatgcgc cggctggtga tcgccgcagc ggtatcggca 616921 ttgctgctca ccggctgttc cgggcgcgac gccgtcgccc aaggcggcac gttcgaattc 616981 gtctcgcccg gcggaaagac cgacatcttc tacgatccgc ctgccagccg cggccgcccg 617041 ggcccactgt ctgggccgga gctggcggat ccggcgcgca gtgtgtcgct ggacgacttc 617101 cctgggcagg tcgtcgtcgt caacgtgtgg gggcaatggt gtgggccgtg ccgggccgag 617161 gtcagccaac tacagcgggt gtatgacgcc acccgaggtg cgggtgtgtc gttcctcggg 617221 atcgacgtgc gcgacaacaa ccgccaggcg ccccaggact tcatcaacga ccggcatgtg 617281 acgtacccgt cgatctatga cccggcgatg cgcaccttga tcgcattcgg tggcaaatac 617341 cccaccagcg tcattccgtc cacgctggtg ctggaccgtc agcaccgggt cgcggcggtg 617401 tttctgcgcg aattgctggc tgcggacctg cagccggtgg tcgagcgggt ggccgaggag 617461 gagccgtcgg gtcgggctcc ggtgggggcg caatgaccgg gttcaccgag attgccgcgg 617521 tggggccact gctggtggcg gtgggggtat gtctgctggc tggtctggtg tcgttcgcct 617581 caccatgtgt ggtgccgctg gtgcccggct acctgtcgta tctggcggcc gtcgttgggg 617641 tggacgagca gctgccggcc ggcgtcgtca aacccccggt ggctgcccgc tggcgggtcg 617701 ccggatcggc ggcgctgttc gtggcggggt tcacgacggt gttcgtgctg ggcaccgtcg 617761 ccgtcttggg catgaccacc acgctgatca cgaatcagct gctgctgcag cgggtcggag 617821 gcgtgctgat cgtcgtcatg ggcctggtgt tcgtggggtt catcggagcc ctgcagcgcc 617881 aggcgaggtt cacgccgcgc cagttgacga gcgtagcggg ggcgccggtg cttggcgcgg 617941 tgttcgcgct cggctggaca ccgtgcctgg ggccgacgct gaccggggtg atcaccgttg 618001 cctcggccac cgagggtgcc agcgtggcgc gtgggatcgt gctggtgatt gcctattgcc 618061 tggggctggg gattccgttc gtgcttttgg cgttcggttc ggcgtgggcg gtggcgggcc 618121 tgggctggct gcgccggcac accagggcca tccagatctt cggcggggcg ctgctgatcg 618181 cggtcggtgc cgcgctggtc accggggtgt ggaacgacgt cgtgtcgtgg ctgcgcgacg 618241 ccttcgtttc cgacgtgagg ttgccgattt gagtgggcag ggtgccgcgc aaaaggcgcg 618301 caacatgtgg cggtcgttga cgtcgatggg caccgcgctg gtgctgctgt ttttgctcgc 618361 gctggctgcc atacccgggg ccctgctgcc gcagcgtggc ctcaacgccg ccaaggtgga 618421 cgactacctg gccgcgcacc cactcatcgg tccgtggctg gacgagctgc aggccttcga 618481 cgtgttctcc agcttctggt tcaccgccat ctacgtgctg ctgttcgtgt ccctcgtcgg 618541 ctgtctggcc ccgcggacga tcgagcacgc ccgcagcctg cgggctacac cggtcgccgc 618601 cccgcgcaac ctggcccggc tgcccaagca cgcccacgcc cggctggccg gcgagcccgc 618661 cgccctggcc gccaccatca cgggccggct gcgcggctgg cgcagcatca cccggcaaca 618721 aggcgacagc gtggaagtct ccgccgagaa gggctacctg cgcgagttcg gcaacctggt 618781 gttccacttc gcgctgctgg gtctgctggt ggcggtggcc gtcggcaagc tgttcggcta 618841 cgagggcaac gtgatcgtga tagccgacgg cggacccggt ttttgttcgg cgtcgccggc 618901 cgcgttcgac tcgtttcgcg ccggcaacac cgtcgacggc acgtcgttgc acccgatctg 618961 tgtgcgggtc aacaacttcc aagcgcacta cctgccgtcc gggcaggcca cctcgttcgc 619021 cgccgacatc gactatcagg ccgacccggc cactgctgac ctgatcgcca acagctggcg 619081 gccctaccgg ctgcaggtca atcacccgct gcgggtcggc ggcgaccggg tgtacctgca 619141 gggccacggc tatgcgccca ccttcaccgt gacgttcccg gacgggcaga cccgcacgtc 619201 gaccgtgcag tggcgacccg acaacccgca gaccctgctg tcggcgggcg tcgtgcgcat 619261 cgacccgccg gccggcagct accccaaccc cgacgagcgt cgcaaacacc agatcgccat 619321 ccagggcctg ctggctccca ccgagcagct cgacggcacc ctgctgtcgt cgcgtttccc 619381 cgcgctcaat gccccggcgg tggccatcga catctaccgc ggcgacaccg gcctggacag 619441 cgggcggccc cagtcgttgt tcaccctgga ccaccggctg atcgagcagg gccggctggt 619501 caaggaaaag cgggtcaacc tgcgcgccgg tcagcaagtc cgcatcgacc aaggcccggc 619561 ggccggcacg gtggtccggt tcgacggcgc ggtgccgttc gtcaacctgc aggtctccca 619621 cgaccccggc cagtcctggg tgctggtctt cgcaatcacg atgatggcgg gactgctggt 619681 gtcgctgctg gtgcgcaggc gccgggtgtg ggcgcggatc acgccgacga ccgcgggtac 619741 ggtaaacgtc gagctgggcg gcctgacgcg caccgacaac tccgggtggg gcgccgagtt 619801 cgagcggctg accgggcggt tgctggcggg ttttgaggcg cggtccccgg acatggccga 619861 agcggccgca gggaccggaa gggacgtcga ttgaacacgc tgcacgtcaa cgtcggcctg 619921 gcccgctact ccgactgggc gttcacctcg gccgtggtgg cgctggtggt cgcgctgctg 619981 ctgctggcgt tcgagttcgc ccaggttcgc ggtcgcggac tcgcgccgct ggccgtgccg 620041 gccggatcgg tggccaccga tagcgctacc cctgggatcg tggcggacca acggcaccgg 620101 ccgttcgacg aacgcgtcgg gcggggcggg ctggccgtcg cctatctggg catcgggcta 620161 ctgctggcgt gcgtcgtgct gcgcggcctg gccacccagc gggtgccgtg gggcaacatg 620221 tacgagttca tcaacctgac ctgcttgtcc gggctcatcg ccggcgcggt cgtgctgcgc 620281 cgtgcgcgat accggccgct gtgggtcttc ctgctggtcc cggtgctgat cctgctcacc 620341 gtgtccggac gctggctcta cgccaatgcc gccccggtga tgccggcact gcagtcctac 620401 tggctgccca ttcatgtgtc ggtggtcagc ctcggttctg gggtattcct ggtcgccggt 620461 gtcgccagca tcctgttcct tgtgcgcaca tcgcggctgg gtgagccaac cggtgaaggc 620521 gcgctggcgg gtatggtgcg gcggctcccc gatgcccaaa ccctggacgg aatcgcctac 620581 cggaccacga tcttcgcctt ccccgttttc ggcttcgggg tgatattcgg tgccatctgg 620641 gccgaggaag cctggggccg ctactggggc tgggacccca aggagacggt gtccttcgtc 620701 gcgtgggtgg tgtacgcggc gtacctgcac gcgcggtcaa cggcgggttg gcgggaccgc 620761 aaggccgcct ggatcaatgt cgccggcttc gtggccatgg tcttcaatct gttcttcgtt 620821 aacctggtga ccgtcggcct gcactcgtat gcgggcgtgg gctgaccgtt cgtctgcaac 620881 cgacccgagg accgcagcaa gggggagtgc tggtgaccga gcatccgagg acgggcgtgg 620941 gagcccccga tagcggcaac ggcggcacgg atcatccgac cgtgcagttg ccgcccgtgc 621001 catccgtggg ggcaccaccg gctgcggccg gtggtgaaac accgactagg tcagttgcgg 621061 gattccgcac ccagcggctc gacccgacgg cctacggcgc ctactacagc ggccccgatg 621121 agggcccggc cagcccggct gaaaggccgc cgtatcgtct cgagccggtg ccccatacgc 621181 cgtatccgga actggccacc accacgctgc tgaggccggt caagccgcca ccgtcggaag 621241 gctggcgtcg gttgctctat ctgctgtcgg gtcggctgat caacgccggg gaaggccctc 621301 gggccgcgca cctcaacgac ctggtcgctc aggtcaaccg cccgctgcgc ggctgctacc 621361 ggatcgcggt gttgtcgttg aaagggggtg tcggcaagac cacgatcacc gcgaccctgg 621421 gggccacctt tgccgacctg cgcggtgacc gggttgtcgc ggtcgacgcc aatcccgacc 621481 gcggcacact gagccaaaag gtcccgctcg agacgccggc cacggtgcgg cacctgctgc 621541 gcgacgccga cggcatcgag cgctacagcg acgttcgcgg ctacacatcg aagggaccca 621601 gcgggctgga agtgctggca tcggacagtg atccggcctc ctcggacgca ttcagcgccg 621661 acgactacac ccgcaccctg gacattctgg agcggttcta cggcctggtg ctcaccgact 621721 gcggtaccgg gttgctgcac tcggcgatgt cggcggttct gcctaggtcc gacgtactgg 621781 tcgtggtcag ctcggggtcc atcgacggcg cccgcagcgc cgcggcgacg ctggactggc 621841 tgcaggccca cggccacgac gaccaggtgc gcaactcgat cgccgtcgtc aacgcggtgc 621901 ggccgcgcgc gggcaaggtc gacgtgggca aggtcgtcga gcacttctcc aggcgttgcc 621961 gtgcggtgcg cgtggtgccg ttcgacccac acctcgaaga aggcgccgaa atcgcgctgg 622021 atcggttgcg gcgggagacc cgcgaagcgc tcaccgaact ggcagcggtg gtggccgctg 622081 gattccccgg cgacccgcgg cgctgcaaac cgagcttcac ctaggaacgg ttattgtccc 622141 cgtgccccaa ccgccgcagg aactctggat cgtcgtcggg cccgatgacg cgagtcttgg 622201 gccggttcat ctgtgcccgt gcagcgcgcc agccaaggta gatcagcgtc gccaaaatca 622261 gcacgaggag caggtagagc actcgacacc tccttggacc gaatataccc gcgccgtagg 622321 ctcaggctgt gtcagaagcg cctaacgaca agaccactcg gggtgttgtc gacatactgg 622381 tctatgcgac ggcgcggctg ctgctggtgg tggcggtcag cgcagcgatt ttcggggtcg 622441 cgcgactgat cgggttgacc gaattccccg ttgtcgtggc cacgctgttc gggctgatca 622501 tcgcgatgcc gttgggcatt tgggtgttca gcccgctgcg gcggcgcgcc acggccgcgc 622561 tcgcggtggc cggtgagcgt cggcgcgccg agcgggaacg gctgcgggcc cggctgcgtg 622621 gcgagtcgct acccgaagaa cagtgagcgc ggggcgcctg gtagtcggca ttgtgcacaa 622681 gtgggttggg cattcagcac agtgtttgcg ctgatcgtgg cgattcgcct cggccgcgat 622741 tggcggctcc taacgttggc tgcaccgggt gtgggttgcg ggaaggtgtg cgatgtctaa 622801 tttgctggta accccggagc tggtggcggc tgcggcggcg gatttggcgg gtattgggtc 622861 ggctatcggt gcggccaatg cggcggccgg ggccccgacg atggcgctgt tggccgccgg 622921 tgccgatgag gtgtcggcgg cggtggcggc cgtgttttcc tcctacgccc agcaatatca 622981 ggcgctgagc gctgcggcgg cggcgtttca cgaccagttc gtgcgggcgt tggccgcggg 623041 tgcgggtgcg tatgcgggcg ccgaggccgc caacgtggag cagcagttgc tgaacgcgat 623101 caatgcgccc accctcgcgt tgttggggcg gccgctgatc ggcaacggcg ccgacggggc 623161 ggccgggacc ggtcaggccg gcggggcagg cgggctgttg tacggcaacg gcggtaacgg 623221 cgggtcgggt gcggccgggc aggccggcgg ggccggcggc gccgccgggc tgatcggcca 623281 cggcgggacc ggcggggccg tcaccggggt cagcaccacc ggcgggccgg gcggtcacgg 623341 cggtgacgcc ggcctgtacg ggtttggcgg ggccggtggc gcgggtgggt tcggccagag 623401 cggggcggcc ggcggggccg gtggggccgg tgggtggttg tacggcgacg gcggcgacgg 623461 cggcgcaggc gacaacggcg gtaacgagtc cggcaccggc gtcagtgccg ttgggggtgt 623521 gggtggggcc ggtggtgctg gtgggttgtt gttcggtaac ggcggcgacg gcggcgtcgg 623581 cggcgacggc ggcgacggca gcagcaccca ggattccggt ggtgatgggg gtgcgggtgg 623641 ggccggtggt gctggtgggt ggttgcttgg taatgggggg gccggcgggg ccggcggggc 623701 cgcctcaatc aaggttgcca ctggtgggct gggtggtgat ggtggcgatg ccgggctgtt 623761 cgggtttggt ggggacggcg gctggggcgg acgcggagtg gatgctcgat tcggtgcggc 623821 tgggggtgcc gctggggccg gcggtgcggg cgggtggttg tacggcgatg gcggcgccgg 623881 cggcgtcggc ggtgtcggcg gtgctgtctt cagcctttcc tccggtgacg gcggggccgg 623941 cggggccggt ggcggtggtg ggtggttgtt cggtaacggc ggcgacggcg gcgccggtgg 624001 cggcggcggt ggccgcttcg gcagcggcag cggtgccggt ggtgatgggg ctgtcggtgg 624061 ggccggtggt gcgggcgcgt ggttcggcaa cggtggcgcc ggcggcgtcg gcggcggcgg 624121 tggccgcggc accaccgcca tcggtggcga cgggggtgcc ggtggggccg gtggtgcggg 624181 tgggtggttg tacggcgacg gcggcgccgg cggtgccggc ggcggtggtg gccgcggcgg 624241 caccggcaac gatggtggcg acggcgggga cggcggccgc ggcggtgatg cccagctgct 624301 tggcaacggc ggtgacggcg gggccggcgg ggccggcggg cccgccgggt tggcgcttcc 624361 cccggggccg gcgcggccgg cgggggcggc ggtgccggcg gttcgctgtt cggcagcccc 624421 ggcacgaccg gcccgcacgg ctgatccctg gctagcgccg atcttcgcgc gctcaaccct 624481 tcggcattcg caccacctgg gcggcatagc tcagaccggc gccgtagccg atcaacaggg 624541 ccagatcgcc gggcttggcc gcgccggtcg tcagtaattc ggccatcgcg agcggaatgg 624601 aggccgccga ggtgtttccg gtgtgctcga tatcgttggc gaccaccgcg tcgggccgca 624661 actgcaggtt cttgaccagc agctcgttga tgcggctatt ggcctgatga gggacgaaca 624721 cgtctatctg gtcgggtcgc accccggcgg cgtccatcgc gcgccgaccg acgtcgccca 624781 ttttgaacgc tgcccaacgg aagaccgcgg gaccttcgag ccgcacaaac gggcgtgggc 624841 cgctgggatt ctgggcgaaa gtgatccagt cgatgtcctg ccgtatggca tcggcctgtt 624901 cgccgtcgct acccgccacg gttggtccaa tgccttgaaa cggtgtctcg cccaccacca 624961 ctgcggccgc gccgtcggcg aagatgaagc agttgccgcg gtcgtacatg tctatcgtgg 625021 gggacagttt ttccgtgccg accaccagca tcgtggccgc acctccgccc cggatcatgt 625081 cggccgctgc gccaagcgca tatccgaatc cggcgcaccc cgccgaaaga tcgaacccga 625141 gtatgccctt ggcgcccagc gacgccgcga ccattggggc ggccggcggg gtttgcagga 625201 aatgggtgtt ggtggtgacg atcacgccat cgatgtcggc cgccgacagg ccggcgttcg 625261 acagtgcccg tcgacaggcc tcagtcgcca tggaagccgc cgactcgtcg tcggcggcga 625321 atcggcgggt cttgatgccg gttcgggtgt agatccactc gtcggacgag tcgatgtgct 625381 ggcatatctc gtcgttggtg accacgcgtt cgggccggta cgccccgaca ctgagcagcc 625441 cgacgctcct ggcgccgctg gtcgtggcga tctccgtcat acccgtccta tctgttctcg 625501 tcgagtgtgc acctacggcg acgacacgcc gacggagccc gccctgagtg cacgttcgaa 625561 gttagctcaa ctgaccaaac gccaatgccc ccgccaccgc caacgcccac accagcatgg 625621 ccagcccagt gtcacgcagt accgggatca gctcgcgccc gccgcgcccg gatcgcaccg 625681 gtccggcagc gcgcagcgcc aaaggcgcgg ccaccaagcc caccacacac cacggcgtgg 625741 ccagcattag cacgaacgtc agcaccccgg cgaccgccag caggccctgg taaagcatcc 625801 gggtccgggc gtctcccagc cgcaccgcca gcgtgatctt gtcggcccgc gcgtcggtgg 625861 ggatgtcgcg caggttgttg gccaccagca ccgagcacga caacgcaccc gttgctaccg 625921 cctgtgccag ccccacccag tccacccgca atgcctgcgt gtactgggta ccgagcacgg 625981 cgaccggccc gaagaacaca aacaccgcca gttcgccgaa gcccgcatag ccgtagggtt 626041 ttgacccgcc ggtgtagagc caggccccgg cgatgcagat cgcacccacc gcaatcagcc 626101 acggcgcgct gagcagcgcc aaaaccagcc cggccagcgc accgagcgcc aggctcgtca 626161 tggcagcggt cagcaccgag cgcggggtcg ccagccgcga gcccaccaac cgcaccggac 626221 ccaccctgtc gtcatcggtg ccgcggatgc cgtcggagta gtcattggcg taattgaccc 626281 caatgaccag cgccaccgca acagccagtg ccaacagcgc tttccaccac acggccgcgt 626341 gcagccaggc cgcggcgccg gtgccggcaa ccactggcgc gatcgcgttc ggcagcgttc 626401 ggggccgcgc gccggagacc cactgtgcga aactggccac cagggcatcc tgccctatgc 626461 acaacaatgg gcgcatgctc ggagtgatcg gcggcagcgg cttctacacc ttctttgggt 626521 cggacacccg cacagtcaat tcggacaccc cctacggtca acccagcgcc ccgatcacga 626581 tcggcaccat cggggtgcac gacgtcgcgt tcttgccccg ccacggcgcc catcaccagt 626641 actcggcgca cgccgtgccg tatcgggcca acatgtgggc gctgcgcgcg cttggtgtgc 626701 ggcgggtctt cgggccgtgt gcggtcggca gcctggaccc tgaactcgag cccggcgcgg 626761 tcgtggtgcc cgatcagctg gtcgaccgca ccagcggccg cgccgacacc tatttcgact 626821 tcggcggtgt ccatgccgcc ttcgccgatc cgtactgccc cacgctgcgg gccgcggtga 626881 ccggcctgcc cggtgttgtc gacggcggca ccatggtggt gatccagggt ccgcggtttt 626941 ccacccgcgc ggaaagccag tggttcgccg ctgccgggtg caatctggtc aacatgaccg 627001 gctatcccga ggcggtgctg gctcgcgaac tcgaattatg ctacgcagca atcgctttgg 627061 tgacagatgt ggatgccggc gtcgctgctg gcgatggcgt gaaagccgcc gacgtgttcg 627121 ccgcattcgg ggagaacatc gaactgctca aaaggctggt gcgggccgcc atcgatcggg 627181 tcgccgacga gcgcacgtgc acgcactgtc aacaccacgc cggtgttccg ttgccgttcg 627241 agctgccatg agggtgctgc tgaccggcgc ggccggcttc atcgggtcgc gcgtggatgc 627301 ggcgttacgg gctgcgggtc acgacgtggt gggcgtcgac gcgctgctgc ccgccgcgca 627361 cgggccaaac ccggtgctgc caccgggctg ccagcgggtc gacgtgcgcg acgccagcgc 627421 gctggccccg ttgttggccg gtgtcgatct ggtgtgtcac caggccgcca tggtgggtgc 627481 cggcgtcaac gccgccgacg cacccgccta tggcggccac aacgatttcg ccaccacggt 627541 gctgctggcg cagatgttcg ccgccggggt ccgccgtttg gtgctggcgt cgtcgatggt 627601 ggtttacggg caggggcgct atgactgtcc ccagcatgga ccggtcgacc cgctgccgcg 627661 gcggcgagcc gacctggaca atggggtctt cgagcaccgt tgcccggggt gcggcgagcc 627721 agtcatctgg caattggtcg acgaggatgc cccgttgcgc ccgcgcagcc tgtacgcggc 627781 cagcaagacc gcgcaggagc actacgcgct ggcgtggtcg gaagcgagtg gcggttcggt 627841 ggtggcgttg cgctaccaca acgtctacgg ccccggcatg ccgcgcgaca ccccctactc 627901 cggagtggcc gcgatcttcc gctcggcggt tgaaaaaggc aagccaccaa aggttttcga 627961 agacggcggc cagatgcggg acttcgtgca cgtggacgac gtggccgcgg cgaacctcgc 628021 cgcggtgcat ctgggtgaag cggaccgcga cgggtttacc gcggtcaacg tctgttccgg 628081 gcgccccatc tcgatccttc aggtggcaac cgcgatatgc gacgcccgcg gtggctcgat 628141 gtccccggcc atcaccgggc actaccgcag cggcgacgtg cgccacattg tcgccgatcc 628201 cgcgcgggcc gcccgcgtgc tcgggttccg cgcggccgtc gatccaggcg aaggactgcg 628261 tgagttcgcg ttcgcgccgc ttcgctgacc gctcgagcta cgacgagtgg tccggcggcc 628321 ggtagatctt cggccgcact gggtgcgtcg acccagctga cctgaaaatc cggggggatc 628381 cagcaggccg ggacagcgcc ggggtgtgcg ggggttgcgg cagctggcgc agcctgccga 628441 tgacgatggc cgccgcgagg atgctgagcg ccaggccgca cagcaccaca tcgaaggtgc 628501 tggtgatctg ctgggcaaga tcgtttccgt agacggggtg agccgacccg cccgaagatg 628561 cctggaatcg cggagccagg acgacgccga cgatgacgcg cgaaacggtg aatgtgcata 628621 gcaatccgca cagcgacgag atcagcctcg cgggcagggc ctcggtcgcc atgttgaagc 628681 gaagccagat cgctaggacc gcggtggcca gatcgaacgc agccagcgcc atgctgtcgt 628741 agcgcgagaa cggataattc aacagcaccg cgacgacgag gtcggtcaac cgcatcgcta 628801 ccgagcccag gcaccacacg gcgatgagca gcaagccgtt tcgggacgct gcccgccaca 628861 cggttttgtc gatcggtggc gcgttcgggg acctgaacag ggtcagcggc gcacacattg 628921 ccgcggccgc ggcccacacc agataaccct cgtatcccac gccggccgtc gacgtgtttt 628981 gggcaatgcc gtggaacgcg tcgatctcgc ggccgatggg aaggctccac acgatggacc 629041 cggcgatcag ggtggagcca cccagcgcca cggttgagag cgcttcggcc gctgtaggcc 629101 gaagcagcca ccgggacgcg accagcacgg cggccagggc taccacgccg tacaccaccg 629161 ccgtgtcgat gaccgcgagg ttctgtttgc caaaaccgga cgcgccggcc gcgggctcca 629221 acgcgtacct gacccgccag ctcaggttga aaccggtgct gagggccgca cccaacatag 629281 acgcgtagcc gaggaactgg gtggcccgca gccacctgct gtggctgccc tcatcggtgg 629341 tagcgccggt tagcgccggt tgcgcgctca acagcgcgcc ggtgatcccc agccatcccc 629401 ccggcccgac accaccgggc acgtggacgg tgccgccgag tcgaatcgtc tggatcgcgt 629461 cgaacaccac gaaggccagc accagcagca gataggggac gttgaggccc aggcgaagct 629521 gtgagcgcct cccggcgaag gtcacggcaa gcgatgccaa agagagcgat gtcaccgcca 629581 gcagcaaccc gaacacggtc ttgctgctgt ccgggattcg gaaaccgaaa tacaggttcc 629641 atgggaaaaa cagcgcaccg atgagcaggg caccagcggc caagtcgcgg acgacctcgc 629701 gtcgtcgggt gtcgtcgctg ctcaggccca cgatgccccc cgggaatcaa gaacggttgg 629761 cgccgagtcg gtcctgtggt ggcgtgggtg cacccggccg ggccgactgc gttgctcgct 629821 tgcgaacata gtctccgttc cgacgacgcg gcagtggcgc agaacacgcg gttgggcgga 629881 tctcgtttgc ccggtgaccg tcccgctgtt tgcgaacccg gttacgctgc ggtcataggc 629941 gaacgctgtc gccgaattac cgatactgcc gacggtatcg cagtgtaacg atgccgggac 630001 attgctggtt gtggggtagc cagccgaagg agagccgcga tggacgtcgc tttgggggtt 630061 gcggtcacgg atcgggtcgc gcgtctggcg ctggtcgact cggctgcgcc cggcaccgtg 630121 atcgaccagt tcgtgctcga tgtggccgag cacccggtcg aggtgttaac cgagaccgtg 630181 gtgggcacgg atcggtcatt ggccggcgaa aaccaccggc tggtcgctac ccggctgtgt 630241 tggccggatc aggccaaagc tgacgagctg cagcacgcac tgcaggactc cggggtccac 630301 gacgttgccg tgatatccga ggcgcaggcc gccacggcgc tggtcggggc ggcacatgcc 630361 ggctctgccg tgctgttggt gggtgatgag acggcaacct tatcggtggt tggtgacccg 630421 gacgcgccgc cgacgatggt ggccgtcgcg ccggtggcgg gcgccgacgc cacatcgacc 630481 gtcgataccc tgatggcccg gctcggcgac caggccctcg ccccggggga tgtcttcctg 630541 gtgggtaggt ccgccgagca caccacggtt cttgccgacc agctgcgcgc ggcgtcgacg 630601 atgcgcgtgc agactcccga cgaccccacg ttcgcgctgg cccgtggcgc ggcgatggcg 630661 gccggcgccg ctacgatggc gcacccggcc ctggtcgcgg atgcgaccac ttcgctcccc 630721 cgggccgagg cggggcaatc gggttctgaa ggcgagcagc tggcgtactc gcaggccagc 630781 gattacgagc tgcttccggt cgacgaatat gaggaacacg acgaatacgg ggcagccgcg 630841 gatcgctcgg cgccgttgag ccgacggtcg ctgctgatcg gcaacgctgt cgtggccttt 630901 gcggtgatcg gtttcgcctc gctggcggtg gcggtggcgg tcaccatccg accgaccgcg 630961 gcctcaaaac cggtagaggg acaccaaaac gcccagccag ggaagttcat gccgttgttg 631021 ccgacgcaac agcaggcgcc ggtcccgccg cctccgcccg atgatcccac cgctggattc 631081 cagggcggca ccattccggc tgtacagaac gtggtgccgc ggccgggtac ctcacccggg 631141 gtgggtggga cgccggcttc gcctgcgccg gaagcgccgg ccgtgcccgg tgttgtgcct 631201 gccccggtgc caatcccggt cccgatcatc attcccccgt tcccgggttg gcagcctgga 631261 atgccgacca tccccaccgc accgccgacg acgccggtga ccacgtcggc gacgacgccg 631321 ccgaccacgc cgccgaccac gccggtgacc acgccgccaa cgacgccgcc gaccacgccg 631381 gtgaccacgc cgccaacgac gccgccgacc acgccggtga ccacgccacc aacgaccgtc 631441 gccccgacga ccgtcgcccc gacgacggtc gctccgacca ccgtcgcccc gaccacggtc 631501 gctccagcca ccgccacgcc gacgaccgtc gctccgcagc cgacgcagca gcccacgcaa 631561 caaccaaccc aacagatgcc aacccagcag cagaccgtgg ccccgcagac ggtggcgccg 631621 gctccgcagc cgccgtccgg tggccgcaac ggcagcggcg ggggcgactt attcggcggg 631681 ttctgatcac ggtcgcggct tcactacggt cggaggacat ggccggtgat gcggtgacgg 631741 tggtgctgcc ctgtctcaac gaggaggagt cactcccggc ggtgctggcc gcgatcccgg 631801 ccggctatcg ggcgctagtg gtggacaaca acagcaccga tgacaccgcg acggtggccg 631861 cccgccacgg tgcccaggtg gttgtcgagc cgcggcccgg atacggctcg gcggtgcatg 631921 ccggtgtgct cgccgcgacc acccccatcg tagcggtcat cgacgccgac ggctcgatgg 631981 atgccggcga cttgcccaag ctggtcgccg aactcgacaa gggcgccgac ctggtgaccg 632041 gtcggcggcg gccggtggcg ggcctgcact ggccatgggt cgcccgggtg ggcaccgtgg 632101 tgatgagctg gcggctgcgc acccgccacc gcctgccggt gcacgacatc gcgcccatgc 632161 gggtcgcccg gcgagaggcc ctgctggatc tgggcgttgt cgatcgacgc tcgggttacc 632221 cgctggagct gctggtccgg gccgctgcgg cgggctggcg tgtcgtcgaa ctcgacgtca 632281 gttacggtcc ccggaccggc ggcaaatcca aggtcagcgg ttcgctgcgg ggcagcatca 632341 tcgcgatcct ggacttctgg aaggtgatct cgtgagctgc ctgccggtca gcgtgctggt 632401 ggtcgctaaa gcgccggagc cgggccgggt caagacccgg ctggccgcgg cgattggcga 632461 taaggtcgcc gccgacatcg ccgcggccgc actgctggac accctggatg cggtggccgc 632521 tgcgccggtc accgcccggg cggtggcgct taccggcgac ctggactccg cggccgattc 632581 cgcggagatc cgccgacggc ttaagtcctt cacggtattt cggcagcgcg gtgacgcctt 632641 cgccgaccgg ctcgccaacg cacacgtcga cgcggccgac ggctatccgg tgctgcagat 632701 cgggatggac acgccccagg tgaccgccga gctgttggcc gattgtgcac gcctgctgct 632761 tcaaatcccc gcggtgctcg gcctggcgtt cgacggcggt tggtgggtgc tggggatacg 632821 cacgcctact gcggccgagt gcctgcgcgc cgtcccgatg tcacagccag acaccggcga 632881 gctcaccttg aaggcgttgc gcgacaacgg cattgatgtg acgctagtgc agcgtctggg 632941 cgacttcgac atcgtggacg acatcgcgct ggtacgcgat tgctgcgctc cggggagtcg 633001 gttcgcgcag gctacccgcg cggctggact ctgaggccgc gccggcgcat ttgcttacca 633061 gttggtgaag atgatgctgt tcagcagtag ggccccggcg gcgttgaccg ccagccagag 633121 tcggtgcgag cggggcggca gcagtgcggg cgccgcggtc agccaaatgg tgaagggcag 633181 ccagattcgt tcggtctcgg ctttgctcag catgctcagg tcggccaagg cgatggcggc 633241 cagcaccgcc agcagcagca gatggcagcc ggatcgacga ctgatcgcgg cccggtcgaa 633301 tacccggctg agacctgcga cgctgcctaa cccgatagcg cagaccacgc acgccaagtt 633361 tgcccaggac caatagccga acggccgatc tttggcgatc ccctgccaat agcgttgctg 633421 gacaagggta taaccgtcga accaggagaa tccggcaacc gcgaagctca ccgcgaccac 633481 cagcgccgcc agcacggccg gccccagtgc ccgcaggacg ggccgccaat ctgcggcggc 633541 caacaccgcc atccccggca gcacgatcag cacgagcccg tagttgagaa agacacccca 633601 gccgagcagt agccccgctc cggccgccac cagcgccggg aagcgagtgg caccatgcac 633661 cgccaccgcc aacagggcga taccccacgc cgccacaccg gcgaaatacc cgtcggccga 633721 aaccgcgatc cagatcgccg tcggcgccac cgcgacgaat ggtgccgtcc gccgcgccat 633781 ctgctcactg gccagcaccc gcacggcgat cagcaccgcc gccgccgcgc tggatcccac 633841 cagcaggcac accagccccg cccaaccgcc accgcgcagc ccgatccgat ccagccagac 633901 aaacgtcagc agcgcacccg gcgggtgccc ggagacgtga gtcacccagg aattgggctg 633961 gaagtcgaga atccggctgg tgaacgtccg caacgtcgcc gggatgtcgg caatgccggg 634021 cacctgccac aggtactcgt cacgggtggt caatcggccg gcaaagccgc gctgccagcc 634081 gtcgatcatc gccagtgaga acgcccaggc ggcggcggtg gcccaggtgc tcagcgtcag 634141 cacccgccag gggagccggt gcgccactac cggcccccac gccacaacgg ccactgcggt 634201 aagaaccgcc ggggccgtgc cccagccaac atgggcgtcc cagtagccga agatcggcgc 634261 ggcgccggcg cgcgtggcaa accgctccaa gccgatatcg gatcgcggtt tgattcccag 634321 gttcaaccgc ggcagtacga acgcggcgcc gaccaggaca aacccgatcg cgacggccaa 634381 tccctcgcgg cgaccgatcc tcacgaccga tcagcctatt gatcggcttc accggcgaac 634441 cggcgcacca acgctgcccg gtccaccttg ccgatgccgc gtcgcggtag cacgttcacg 634501 acatgtagct ctcggggcgc ggcggtgacg tccagggtgc gcgcgacatg cgcccgcagc 634561 gcttctagcg ttggtggtgg gcatccgtcg ccgaccacaa tcgcggcgac cactcgctga 634621 ccgagtcggt cgtcggcaag tccaaaaacc gcgcagtcac gcaccgcagg gtgggtgccc 634681 agtgcggcct ccactggctg cggcagcacg gtgaatccgc ccgtgctgat cgcttcgtcg 634741 gctcggccca gcacggtcag cacacccgaa tcacccgatt caagggcgcc aaggtcgtcg 634801 gtgtgaaacc agcctggctc ggcgaacgga tcgggcgaga ccgggttgcg atagcccttg 634861 gccagggtcg caccgccgat agctatgcgg ccgccggcca gcaccctcag ccggaccccg 634921 tcgagcggaa cgccgtcgta gacacagccg cccgaggtct cgctcatgcc gtaggtgcgc 634981 accaccgtga tgccggcggc ggccgcggcg tccaggatgg gccggggggc cggcccgccg 635041 ccgatcagca ccgcgtccaa ttcggccagc gcggccgtgg ccgccgggtc ggtaagtgcc 635101 ttggccaact gtgcggcgac cagcgacgtg tatcgccggc cagaacccaa tctctttatc 635161 gcgttgggta attcggtgac atcgaatccc gcggagacgt tcagttcgac aggaactgat 635221 ccggcgatca cgctgcgcac cagcaccgcc agcccggcga tgtgatacgg cggcacagcc 635281 aacagccagc tgcccggtcc gccgagccgg tcgtgggcgg ccgaggcgct ggcggtcaag 635341 gccgccgcgg tcaacatggc gcccttgggc ggtcccgtgg ttcctgacgt cgtcactacc 635401 agggcgacgt cgtcgtcaat ctgctcgccc actcgcaaag cgcccagcaa ggactcatgc 635461 tgggtgggca ccgcgaccaa tgccgggtcg ctgccaccca gcactcgttg cagggcaggc 635521 agcagcagcg cggtagcaga accggccggg acgtgcagcg cacgcaggat ggctatgcgt 635581 gctcctcgcg gtcgcggacg tcatcgagtg gccatccctg ggcggccaac cgcgcccgga 635641 cgcgctcaac atcctccggt gacggcaact cgtcggtgaa atgggtgatc accacgccga 635701 tgtcgatctg gtcgaagtca ccgagtcgca tcagttcgtt agccaccgcc ttgacctcat 635761 cgtggctcag ccggcggcaa agcagggcga gcaccgcaaa ggagtcggtc ggcggaatgc 635821 cctcgggata tcccgcgcgc aaccacgcga cgatcgaggt gagaaaccgg ttcacgctgt 635881 tatatcttcc cgtcggggcc gtcgccaaac cctatgtcgc ggccatctgc gactctactt 635941 gggtgtggcg cccaggaagg cccagccggt gtgatgcgcg atgaagtcac gggcgatata 636001 cagcactcca atgatgacta ccgcaagcac cagcgcgaag atcgcccaac tgacggcaac 636061 cagcagcggg cgccggcgcg cggtggcgtc ggcaccgtcg ccggcggcct gcagccgcac 636121 gcccaccgcg aacaggcccg gtagcagcgc gccggctagc agactgaaga tcaggatctt 636181 cagggtggcc gtgtagttga accaggcact cacggggcgt tcctcgtggt gacgccgaac 636241 tgtggtgctc ggtggttcgg tgggggagtc ggagccggcg gtgcaggcac cggcggcctc 636301 tgatccgcaa gcggctgcgc acccgcttcc aggccggccg tcaggttgcc ttcccagtcg 636361 gcgttgacgt tggtgtggtc gatcggcgcc ctgcgcgacc gcagccagat ggcggtggcg 636421 gtcagccaca acagtgcgaa accgaggatc gcaccggggt agccaccgat gaaatgcacc 636481 agcccgtagg tgaaggcccc gaccagcccg gccaacggga gcgtcaccag ccacgcgacc 636541 accatgcggc cggctacccc ccagcgcacc tcggcgccgg gcttgccgac gccgctgccc 636601 agcacggacc cggtcgcgac ctgcgttgtg gacagcgcat agccgaagtg cgcggacaac 636661 agaatgacgg cggccgatga cgactcggcg gccataccct gcggtggttt gatctcgacc 636721 agccctttgc ctagggtgcg gatgatgcgc cagccaccca ggtaggtacc ggcggccatg 636781 gccacggcgc aactcacgat cacccacagc ggcggcaccg atgccgtcgt gctgaccgcg 636841 ccgtaggaca tcaacgccag gaagatcacg cccatcgtct tctgcgcgtc gttggtgccg 636901 tgcgccagcg agaccagcga cgccgagccg atctggccgc gccggaaacc gcgttccgta 636961 cgcttttcgg caaccccgcg cgtcgtccgg tagaccagcc aggtgccgac tgctccgacc 637021 agcgtggcca gcagcgcggc taccacggcc ggcacgatca ccttggacac cactccgctc 637081 cagatcaccc cacgcaggcc gacggcggca attgtggcgc cgacgatgcc gccgatcagc 637141 gcatgtgagg aactcgacgg aatgcccagc aaccaggtca acaggttcca gacgatcccg 637201 ccgaccaggc cggcgaacac caactccagc gtcaccagat tcgcgtcgat cagacccttg 637261 gcgattgtgg ccgccacggc ggtggacaaa aacgcaccga tcaggttcag cacggcagga 637321 agtgctaccg ctacccgcgg tgccagggcg ccgctggcaa tcgaggtcgc catggcgttt 637381 ccggtgtcgt ggaacccgtt ggtgaagtcg aacgccaatg ccgtcacgac gacaatgagc 637441 aaaaggaaca actgaaggtt cacagggcct gattctgctg gtcgggatat tgcgttgtcg 637501 atcaaacgag tacgcgaaat gcgggtgtat ctcgactcgt cgtcagatgt taccaatcac 637561 gtaacccagc gttttgcgga gttcacgccc gggtgtctgt acgcagcggg tgaccctcgg 637621 gaacctcgac gaatatcagt gtgatcccgt ctgggtcggt cacatgcatc tcgtgcaggc 637681 cccacggttc gcggcggggc tcgcgagcga tcgacacgcc tcggctgacc agctcggtct 637741 gggtagcctc gaggtcgcgc acctgcagcc acagcgcgcc gggaaaaggt ccccgcgaat 637801 ggtccggctc gccgtaaccg gccagttcga gcagtgactg accggcgaaa aacactgtgc 637861 cggccccgta ttcacgggca atcgccagcc cgatctggtc acggtagaag ctcagcgacc 637921 gctgatagtc cgccggccga agtagcatcc ggctggccag gatttccatg gccctgtgtc 637981 tatcacgtag cggcacgccg gcggccgagg gtcggcaggc cgggacccgg ttcaagggtt 638041 gagctgttcg ttgcggcgct gcatgagtgc attgacccag cggggaccga tgctgtccag 638101 cgcgttgacg gcgacggcaa cccgaggcgc gatccgcacc ggtcgggtgc gggcggcggt 638161 gaccatccac tcggcggctt ccgcggcggt cagcgccggc agcccgtcgt aggccttcgt 638221 cggcgcaatc atcggagttg ccaccagcgg gtagtacagc gtcgtcgaat gcacgccctg 638281 actaccccac tcggtttcga tgatccggct caccgccgac agtgcggcct tcgatgcgtt 638341 gtacaccgag aacagcggcg aagcctccga caacacgccc caggtggcga cattgatgat 638401 atggccgtcg ccacgctcga gcatcccggg tgcaagcccg cggataagcc gcagcggggc 638461 atagtagttg agcaccatgg tgcgctcgac gtcgtgccag cgttccagcg actcggccag 638521 cggccgccgg atcgaccggc cggcattgtt gatcaggatg tcgatcccgc cgatgcgctt 638581 ttcgacgtct tcgaccagcg cgtcgatcgc ttccatgtcc gagaggtcgc aggggagcga 638641 catcgccgtg ccgccgtcgc cggtgatccg gtccgccacc gcatccagca gatccttacg 638701 gcgcgcgacg gcaaccacga cggcgcggtg cagtccgaac tgtttggtcg cggccgcacc 638761 gatgcctgac gacgcgccgg tgagcaggat gcgcttgccg gtgaggtcga cgggttgcat 638821 cgcgggccgg ttgatcagca gttgcggcga aattggtggc cgcatgccgg ccaatgtgat 638881 ttgttcagtc aaccagcgca gcggtctttt gctcacagct ggggagtcta gttttgccga 638941 gcctgtagtt actgtggtgt cccactcgtc gggcttctgc tcggcaacta cagcctcggc 639001 gaacggccgc gttagaaata gcgcggaaac gggctccagt cggggggacg cttctgtagg 639061 aaggcgtcgc ggccttcgac ggcctcgtcg gtcatgtagg ccaggcgggt ggcctcaccg 639121 gcaaacagtt gctgacccac cagcccgtcg tcgagcaggt tgaacgcgaa cttcagcatc 639181 cgttgcgcct gaggcgattt cgcgttgatc tcggccgccc actgcagccc cactgtctcc 639241 agctcggcgt gttcggccac cgcgttgacc gcgcccatct ggtgcatctg ctcggcggtg 639301 taggtgcggc ctaggaagaa gatctcgcgg gcaaacttct ggccgacctg acgggccaga 639361 tatgcgctgc cgtaaccgcc atcgaagctg ccgacgtcgg cgtcggtctg cttgaagcgg 639421 gcgtactcgc ggctggccag ggtgagatcg cagaccacgt gcaggctgtg tcccccgccg 639481 gccgcccagc cattgaccag acaaatgacc accttgggca tgaaccggat cagccgctgc 639541 acctccagga tgtgcaaccg gccggcgcgg gcgacatcaa ccgtgtccgc ggtgtctccg 639601 ctggcgtact ggtaaccgct gcgcccacga attcgttggt cgccgccgga gcagaacgcc 639661 cagccgccgt ccttcgggga cggcccgttg ccggtcagca gcaccactcc gacgtcgggc 639721 gacattcgtg catggtcgag cacccggtac agctcgtcga cggtgtgcgg gcgaaatgcg 639781 ttgcgcactt cagggcggtt gaacgccacc cgcaccgtgg catcgtcgac gtggcggtgg 639841 taggtgatgt cggtcagatc gtcgaacccg tccacgagcc gccacgcctt ggcatcaaaa 639901 gggttgtcac tcaaggctgt tgaactccgt ccttgttcgc cggctggagc caccacggcg 639961 atctgatccg ttcacccatg cctgccacag taatcatggc cgctgggcgt cagccggacg 640021 gtatggtgcc cggggccttg gtcacatgtg gtcgtgagtt ggcgcccggg cggctttctg 640081 tggagggtca ccgcgtactc gatcatggcg ctgctcgcag ttcatcaccg aaccggcaca 640141 gtgtcgctcg gcaatgcggt ctactcgtgg ggcatgttaa gcgctcaaca gggcggcgcg 640201 cccacctttc tccaagcccc gctggactca gccgatggcg tgagccgagg gccaggcgcg 640261 tgccaatctt tcgtcggtgg tcaacaacac cagacctgcc gtttcggcca gctcgacgta 640321 gagggcatcg gtcaggcgga gggtgtcgcg gcgcgaccac gctccagcaa gcagcgacga 640381 aagaccgtgt cgagtcaccg gcacctgtcg caactcctcc agtgccgcat cgacataggc 640441 aacggtgagt gcgccggcgc gctgcatgcg ccccagcgcc gacaacacct ctgcatcgaa 640501 gtgcgccggc gcgtgcatcg cggtccgagc cagccgcgcg cgcaccgcag agcaccgatc 640561 gctagtgcga gccagtagat ccaccatggc actcgcgtcg acgaccacct gctccggcgg 640621 cgaagtgggc gatgctctca cgcttcgaac tcatcgcgag cggcatcgat cgcacccagc 640681 acgtcatcat gccgagcgcc ggtgcttctg ggttccaacc cctcaagcca cgcatcggtt 640741 gcggagttct ccaactcggc actgatcgcg gcctgagtca gcgccgagac gttcaagccc 640801 cgcgccctgg cgcgctccgc caattcgtcg ggcacataca cgttcaaccg agccatacac 640861 accaatgtac acacaacgat cgttttcgtg cgccggctca acaaggcctt cggcgggttc 640921 tttcgcccgc cgcagaccgc gaaacccgct gtgaaggtgg gttatcccga gcatcgccgg 640981 catatctgca cggcatcggc ggcgtcaagt gcgccggcat cgccaggccg aaggccgggg 641041 tgaagactaa tccagatcag atgcgaggga ccagacttca tgcaacggcc aagccctagc 641101 cgaccgcgcg cccagcgcct tcccagaacc gtgcgcgcac ggccttcttg tccggctttc 641161 ctagaccggt caacggcaaa gagtcgacga ccaccacccg cttgggtgcc tgcaccgatc 641221 ccttgcgttg tttgaccgct gcctggatct cggcggtcat ggcctcgatc gcgggctcat 641281 cgcgggccgc gttggagcgc aacaccacca ccgcggtgac ggcctcgccc cacttctcat 641341 ccggcgcgcc aaccacgcac acctgagcaa ccgccggatg ctcggccacc acgtcctcga 641401 cctcccgggg gaacacgttg aagccgccgg tgacgatcat gtccttgacg cggtcgacga 641461 tgtagtagaa gccatcggag tcctcgcggg ccaggtcgcc ggtgtgcagc cagccgtctt 641521 taaaagtccg cgacgtctcg tctggcagat tccagtaacc gcccgccaac agcggtccgc 641581 tgacacagat ttcgccgact tcgccctgct tcaccggctt gccatgctcg tctaacagcg 641641 cgacgcgggc gaacagcgtc ggccgcccac atgaggtcag ccgcttctcg tcgtgatcgc 641701 ccttggccag ataggtgatc accatgggcg cctcggattg cccgtagtac tgggcgaaga 641761 ttgggccgaa ccgccggatc gcctcggcta gtcgcaccgg gttgatcgcc gaggcgccgt 641821 agtagacggt ttccagcgac gacaggtccc gggtgtgcga atccgggtgg tccagcagcg 641881 cgtacagcat cgatggcacc aacatggtcg ctgtaatgcg ttgctcctca atgattctga 641941 gtacctcggc cgggtcgaac ttcgccagca ctatcatctc gccgcccttg atcaccgtcg 642001 gcgtgaaaaa cgccgcgccg gcgtgcgaca gcggggtgca cattaagaac cgcgggttgg 642061 ccggccactc ccattcggcg agctggatcg aggtcatggt ggcgatcgac tgcgcggtgc 642121 ctatcacgcc cttaggcttg ccggtggtgc cgccggtgta agtcaggccg ataacttggt 642181 cgggtggcag gtcggcggcg accagcggct gcggctggta tttggcggcc tcggcggata 642241 ggtcgactgc cacatgcttg agcgcatcgg gcaccggccc aatggtgagg atttgctgca 642301 gcgagtccac ctgctccagc agagccagtg cgcgctcgac gaacatcggg ttggggtcga 642361 tgatcagtga gctgatgccg gcgtcgttca gcacgtaggc gtgatcggcc agcgagccca 642421 acgggtgcag cgcggtgcgc cgataaccgc gggcctgccc ggcgccgatg atcatcaaaa 642481 cttcaggacg gttgagcgac agcagaccga ccgccacccc ggtgccggca cctagcgcct 642541 cgaatgcctg gatgtactgg ctgatacggt ccgccagctg gccaccggtc agcctggtgt 642601 cgccgaggaa cagcaccggc ttgttctggt ggcgcttgag cgctcccact agcagatggc 642661 cgttgtgggt cgggctgcgc aacagctcgc ccgaacaatc ctggtcacgc atggcgccgc 642721 tctccctcgc tagctggggt acccccaccg catcgcttcg tcccccgcaa gcgggtggta 642781 cccccactgc atcgtcgccg gcggtgctca tctggcaaga ctagaacgtg ttgcaatttg 642841 gatctgccgt gccctcgtaa tctcgaagga tcactacgct tggagcccat ggccgatgca 642901 gacctcgtca tgaccggaac cgtgctcacc gtcgacgatg cgcggccaac ggccgaggcg 642961 atcgcggtcg ccgacggccg ggtcattgcc gtcggtgacc ggtccgaggt tgccggcctg 643021 gttggcgcca acacccgggt catcgatctg ggtgccgggt gcgtcatgcc aggatttgtt 643081 gaggcacacg gccatccgct actggaggcg gtcgtgctgt cggaccggtt cgtcgatatc 643141 cgtccggtga cgatgcggga cgcggacgac gtcgttgccg cgatccgcgg cgaggttgca 643201 cggcgcggcc cggccggcgc ctatctggtc ggctgggatc cgctgctgca gtccggtctt 643261 ggcgagccga cgctgacctg gctcgacagc ctcgcgccga acgggccgct ggtgatcatc 643321 cacaactccg gacacaaggc ttacttcaac tcgcacgccg cctggctcaa tgggctcacc 643381 cgagacaccg cggatcccaa gggcgcgaag tatggccgcg acggcaatgg cgaactcgac 643441 ggcaccgccg aggaaatcgg cgcgattctt ccgcttttgg ccggtgtagc cgaccccagc 643501 aacttcggtg ccatgctgcg cgccgagtgt gctcggctca accgtgccgg cctgaccaca 643561 tgctcggaga tggcttttga cccagggtat cggccgatgg tcgaggcggt gcgcgccgaa 643621 ctgacggtcc ggctgtgcac ctacgagatc tccaatgcgc ggatgtgcac cgatgcgacg 643681 cctgggcaag gtgacgacat gctgcgccag gtgggcatca agatctgggt ggacggctcg 643741 ccgtgggtcg gcaatatcga tctgaccttt ccctacctgg acacccccgc cacccgtgcc 643801 atcggtgtac cgcccggttc ccgcgggtgc gccaattaca cccgtgaaca gttggccgaa 643861 atcgtcgggg cctactttcc gcggggctgg cagatcgcct gtcacgtgca cggcgacggc 643921 ggtgtggaca ccatcctcga cgtctacgaa gaggcactgc gccgcaatcc tcgagacgat 643981 caccggctgc ggctcgaaca cgtcggggcc atccggcccg accaactgcg gcgcgccgcc 644041 gaactcggtg tcacctgcag catcttcgtc gaccagatcc attactgggg cgatgtgatc 644101 gtcgatgacc tgttcggggc acagcgcggg tcccggtgga tgccggctgg atccgcggtg 644161 gccgccggca tgcgtatctc gctgcacaac gacccgcccg tcacaccgga ggagccactg 644221 cgcaacatca gcgtggccgc aacccgggtg gcgcccagtg gccgggtgct ggcaccggag 644281 gagcgcctga cggtcgagca ggcgattcgc gcgcagacca tcgatgccgc ctggcaactg 644341 ttcgctgagg acgcgatcgg ctcgcttcag gtcggcaagt acgcggatat ggtggtgctg 644401 tcggcggatc cccggacggt gccgccagag cagatcgccg acctggcggt gcgggcgacg 644461 tttctggccg gtcgccaggt ttatcggcgg tgatacccgt gctgcccccc ctagaagccc 644521 tgctggaccg cctgtatgtg gtggccctgc cgatgcgagt gcgtttccgc ggcatcacca 644581 cccgtgaagt ggccttgatc gagggtccgg ccggttgggg cgaattcggt gcgttcgtgg 644641 agtaccagtc cgcgcaggcg tgcgcgtggt tggcgtcggc gatcgagacc gcctactgtg 644701 cgccgccgcc ggtgcgacgt gaccgcgttc cgattaacgc cactgtgccg gccgttgccg 644761 ccgcccaggt gggcgaggtg ctggcccggt ttcctggggc ccggacggcc aaggtgaagg 644821 tcgccgagcc tgggcagagc ttggccgacg acatcgagcg tgtcaacgcg gttcgggagc 644881 tggttcccat ggtgcgggtg gacgccaacg gtggctgggg tgtcgccgag gcggtggccg 644941 cggcggccgc cctgaccgcc gacggcccgc tggaatacct tgaacaaccc tgtgccaccg 645001 tcgccgaact cgccgagttg cgccggcggg tggatgtgcc gatcgccgcc gacgaaagca 645061 tccgcaaggc cgaggatccg ttggccgttg tccgcgctca ggccgccgat atcgcggtgc 645121 tgaaggtcgc cccgctgggc ggtatttcgg cgctgcttga tatcgcggcg cggatcgccg 645181 ttccggtggt ggtctccagc gcgctcgatt ccgccgtcgg aatcgccgcc ggcctgaccg 645241 ccgccgcggc cctgccggag ctcgaccacg cgtgcgggct gggcaccggc gggctgtttg 645301 aagaggacgt ggccgagccc gcagcacccg tcgacggctt tctggcagtt gcgcggacaa 645361 cgcccgaccc ggcgcggttg caagccctgg gtgcaccgcc gcagcggcga cagtggtgga 645421 tcgaccgggt caaggcctgc tactcgttgc ttgtaccgtc tttcgggtga tcaacctggc 645481 ctacgacgac aacgggaccg gtgacccggt ggtctttatc gccggccgcg gcggcgccgg 645541 acgcacctgg cacccacatc aagtcccggc ctttctggcg gctggatatc ggtgcatcac 645601 gttcgacaat cggggcatcg gcgccaccga aaacgccgaa ggcttcacca cgcaaaccat 645661 ggtcgccgac accgcggcgc tgatcgaaac cctagacatc gccccggcgc gcgttgtcgg 645721 ggtgtcgatg ggggcattca tcgcgcagga actcatggtg gtcgcacccg agctggtcag 645781 ctcggcggtg ctgatggcca ctcgcggccg cctggaccgc gcccgccagt tctttaacaa 645841 agccgaggcc gaactctatg actcgggtgt ccagctgcca cccacatacg acgcgagggc 645901 tcgcttactg gagaacttct cccgaaagac gctcaacgat gacgtggccg ttggcgactg 645961 gatcgcgatg ttttccatgt ggccgattaa gtccaccccc ggactgcgct gtcagctaga 646021 ttgcgctccg cagaccaacc ggctgcccgc ctaccgcaac atcgccgcgc cggtgctggt 646081 gattggtttc gccgacgacg tggtgacgcc gccctacctg ggtcgggagg tcgccgacgc 646141 cctgccgaac ggccgttacc tgcagatacc tgacgccggt catctcgggt tcttcgagcg 646201 gccggaagcc gtcaacaccg cgatgctgaa gttcttcgcc agtgtcaagg cctgagcgcg 646261 gcccggccat acggtccggc tgtgacactc tgtactggtg aacccctcga cgacacaggc 646321 gcgcgtcgtc gtcgacgaac tgatccgcgg cggcgttcgc gacgtggtgc tgtgtccggg 646381 ctcgcgcaat gcgccgctgg ccttcgcgct gcaggacgcc gaccggtccg gccggatccg 646441 gttgcacgtt cgcatcgatg aacgcaccgc cggctacctg gccatcgggc tggcaatcgg 646501 ggcgggcgcg ccggtgtgtg tcgcgatgac atccggcacc gccgtggcca acctcggtcc 646561 ggcggtggtg gaggcaaact acgctcgggt gccgctgatc gtgctgtcag ccaatcggcc 646621 ctacgagctg ctgggcaccg gcgccaacca gaccatggaa cagctgggct atttcggcac 646681 ccaggtgcgc gccagcatca gcctggggct ggccgaggac gcacccgagc ggacctcggc 646741 gctcaacgcg acctggcgat cggctacgtg ccgagtgttg gcggccgcca cgggtgctcg 646801 caccgccaac gcgggccccg tgcacttcga catcccgctg cgcgaaccgc tggtgcccga 646861 tcccgagccc ctcggcgcgg tcaccccgcc gggccggcct gctggcaagc cgtggaccta 646921 cacgccgccg gtcaccttcg accagccact ggacatcgac ctgtcggtcg acaccgtggt 646981 catctccggg catggcgctg gcgtgcaccc caacctcgcg gcgttgccga ccgtcgcaga 647041 accgacggcg ccgcggtccg gggacaaccc gttgcacccg ctggcgctgc cgctgctgcg 647101 ccctcaacag gtgatcatgc tgggccggcc gacactgcat cgtccggtat cggtgctgct 647161 ggccgacgca gaagtgccgg tattcgcatt gacaaccggt ccacgctggc cggatgtctc 647221 gggtaactcg caggccaccg gcacgcgggc ggtcaccacc ggcgcgccgc ggcccgcgtg 647281 gctggaccgg tgtgcggcga tgaaccggca cgcgatcgcg gcggttcggg aacagctcgc 647341 ggcgcacccg ttgaccaccg ggctgcatgt cgcggcggcg gtgtcgcatg cgctgcggcc 647401 cggtgaccag ctggtgctcg gggcatccaa tccggtgcgg gatgtggcgt tggccggttt 647461 ggacacccgc ggcatccggg tacggtccaa ccgtggggtc gccggcatcg acggcaccgt 647521 gtccaccgcg atcggggcgg ccctagctta tgagggggct cacgagcgca ccggcagccc 647581 ggactccccg ccccgcacca tcgcactgat cggcgacctg acgttcgtgc acgacagctc 647641 cgggctgttg atcgggccga ccgaaccgat accgcggtca ttgaccatcg tggtgtctaa 647701 tgacaacggc ggcggcatct tcgaattgct cgagcagggt gatcccaggt tctccgacgt 647761 gtcatcgcga atcttcggca ccccacacga cgtcgatgtg ggcgcattgt gccgcgccta 647821 ccacgtggaa tctcgccaga tcgaggtcga cgaactcgga ccgaccctcg atcaacccgg 647881 tgccggcatg cgcgtgctcg aggtcaaggc cgaccggtcg tcgttgcgac aattgcacgc 647941 cgccatcaag gcggctctgt gatatcaccg aaacccctgc tgcacatcct gattcatggg 648001 ctcagtgatg aactgcccga tactcgaggc aggatcgtgc tgcgctggtt acgaatcgcc 648061 gtcctgatag tgaccggttt ggtcacgctg cagtcggtgc ttctggtggc tggtgcgtgg 648121 cgcaatgaca ttgcgatcca acgtaatatg ggggtcgcgc aggctgaggt gctcagcgcc 648181 gggccgcggc gttcgacgat cgagtttgtc acaccggatc ggatcaccta tcggccgcaa 648241 ctcggtgtgc tgtatccgtc cgaattatcc acgggcatgc gaatttacgt tgagtacaac 648301 aagagggatc ccaacctggt cagagtgcag caccgtaacg ccggactggc gatcatcccg 648361 gccgggtcca tcgcggtggt ggcctggctg atcgccgccg ccgcgctggt cgtgctagcg 648421 gtgctggaca agcggttgga acgtcgtgaa aattcggcgt ctgcaacggg ctgagcagca 648481 gagttcgcac gccgtatgcc gctacgcaac catttcgaca gccggcgctg acagtgtgtg 648541 tggcgtgcgc gttgcgatcg tcgccgagtc gttcctcccg caggtgaacg gcgtcagcaa 648601 ctcggtggtc aaggtactcg aacatctgcg tcgaaccggt catgaagccc tggtgatcgc 648661 gcccgacacg ccgccaggtg aagaccgcgc cgagcgactt cacgacggtg tccgggtgca 648721 ccgggtgccg tcgcggatgt tcccaaaggt gaccacgttg ccgctcggcg tgcccacctt 648781 ccgaatgctg agagcgctgc gcggattcga tccggatgtc gtgcatctgg cgtcgccggc 648841 gctgcttggc tacggtggac tccatgccgc tcggcggcta ggggtgccca cggtcgcggt 648901 ctaccaaacc gatgttccgg gtttcgcgtc cagctacggc attccgatga cagcacgggc 648961 ggcgtgggca tggttccgcc acttgcatcg cctggctgac cgcactctgg cgccgtccac 649021 agcgacaatg gaatccctta ttgcccaggg cattccgcga gtacaccggt gggcacgcgg 649081 ggtggacgtg caacgtttcg cgccgtcggc gcgaaacgag gtgttgaggc gacggtggtc 649141 accggacggc aaacccatcg tcggctttgt gggtcggctt gctccggaga agcatgtcga 649201 ccggctcacg ggtctggcgg cctccggcgc cgtgcggctg gtgatcgtcg gcgacggcat 649261 cgaccgggca agattgcaat cagcaatgcc cacagcggtt ttcaccggag cacggtatgg 649321 caaagagctc gccgaggcgt atgccagcat ggacgtcttc gtacattccg gtgagcacga 649381 gacgttctgc caagtcgtgc aggaagcgct ggcgtcgggg ctaccggtga tcgctccgga 649441 cgccggcgga ccgcgtgatc tgataacccc gcaccgcacc gggctgctgt tgccggtcgg 649501 cgagttcgag caccggcttc ctgacgccgt cgcccacctg gtgcacgaac gccagcgcta 649561 cgcgctggcc gcccggcgca gtgtgctggg ccgcagttgg ccggtggtct gcgatgagct 649621 gctcggccac tacgaggcgg tgcgaggtcg gcgcacgacc caggccgcgt aacggtagcg 649681 tcgaggctat gagtcgcgcc gccttggaca aggatccccg cgacgtggcg tcgatgttcg 649741 atggcgtcgc ccgcaagtat gacctgacca ataccgtgtt gtccctgggc caggaccggt 649801 attggcggcg agccactcgg tcggcgctgc ggatcgggcc cggccaaaag gtcctggacc 649861 tggccgcggg caccgccgtg tccaccgtag agctcaccaa atcgggcgcg tggtgtgtgg 649921 ctgccgattt ttcggtcggc atgcttgcgg cgggcgctgc gcgcaaggtt cccaaggtcg 649981 ccggtgacgc cacccggctg ccgtttggtg acgacgtgtt cgatgcggtc accatcagtt 650041 tcgggctgcg taacgtcgca aaccagcaag cggcgctgcg ggaaatggct cgtgtcaccc 650101 ggccgggcgg gcggctacta gtgtgcgaat tctccacgcc caccaatgcg ttgttcgcca 650161 ccgcctacaa ggaatacttg atgcgggcgc tgccccgggt ggcgcgggcg gtgtctagca 650221 accccgaggc ctacgagtac ctcgcggagt cgatcagggc ctggcccgac caggcggtgc 650281 tggcgcacca gatttcgcgg gccgggtggt cgggggtgcg gtggcgcaac ctgaccggcg 650341 gcatcgtagc tctgcatgcc ggatacaaac ccggcaaaca aaccccgcag tgaccggtag 650401 gaagacttag cgggtgccag cccgttgcag gacgcccaca tgctcagggc agtagtgatc 650461 gatcgcggcg cccaggaact gaaacgcttg gccctgggtg gttccgcgcg gcaggttgcg 650521 ttgcaggaaa gtggccgact tgtacgcatc gccgtcaacg cctctgctca gccgttcgca 650581 gctgatcttg gcaagccaag cgttgtagtc ctgcgggccg tagatcccga agcgatggat 650641 cgtgttgttg aagggggcgt cgtagtcgtc ggcctgcgcc ggcgctgcca aactaacggc 650701 agccaccgtc atgccgacga caacagccag ctttgttccc ttcattagcc ggactatacg 650761 cgtcgtttgg gtgcgccgtc agcccaggtg ggccgagagc agccagccac cgatcgactg 650821 cagcccgttg ggctcttcgc ggatgtccag gagtgcgggc atgccggcga agccggccgg 650881 gaacctcgcg tacagccgcg cgggcttgat ctcatcgatg atccagtact tggacaccgc 650941 cgcgcgcagc tcgtcctcgg tgaccgcatt gatcggcccc tcgggtatcg ccgcccggtc 651001 gaataccaac acgaagtagg aggcgcccgg tgccgccgca cgcacgatcg attgcagata 651061 gccctcccgg gactcgaccg gcatggagtg gaacagcgtg ctgtcgacga tggtgtcgaa 651121 cctgccgtca tagccggtaa acgaactggc gtcggccacc tcgaagctgg cattggccag 651181 gccgcgcttc gctgcttcat gccgagccag ttctacggcg gcgggggaga ggtccagtcc 651241 gaccgtggtg tgtccccgtt cggccagtgc cagcgaaatc gcggcctccc cgcagcccac 651301 gtcgaggacg tcgccgcgga acttgccctg cacgatcagg gcggccagct cgggctgggg 651361 ttcgccgatg ctccatggcg gtcggactcc ctccccgaag gcgacggatt caccgcggta 651421 ggcggattcg aactcaagat ccagcgattc agtcatgtgt tcatatatat caacggccct 651481 gatatatgtc aacacagttg acattcgcgc acccttggtt gccggccgtc agctgaacgg 651541 cggtcgtcga tcgacgagcc gggacaattg accgccaccg cgccacaccc gcgccaccca 651601 gtcgcggtcg tcgtcggtga ccagattgga catcacccgc acggcgatgt tcatcaatgc 651661 ggtggagcgc atcgtgatgg gcccagtcgt gggtaggaac cgtgggaagg tcagtaacaa 651721 cgctagccgg cgcgcaaccg agaagccgcg accgtagcgg tcggccagca gcgacggcca 651781 cagccgtgcc aggtcacgcg aatccagcag ttcggcggcc agccgcccgg tttccagccc 651841 gtagtcgatg ccctcgccat tgagcgggtt gacgcaggcc gcggcgtcgc cgatgagcat 651901 ccagttggac ccggccactc cagaaaccgc gccgcccatc ggcaacagcg ccgacgacac 651961 cgcgcgcggc tggccggtga agccccactc gtcacggcgc aggtcggtgt agtaggagat 652021 cagcgggcgc agggccagat cggctggccg tcttgaggtc gacaacgctc ccacgccgat 652081 gttcacttcg ccgttgccca gcggaaagat ccagccgtag ccgggtagca cggcgccgtc 652141 gggggagcgc agttccagat gcgacgtcag ccacgggtca tcgctgtacg ccgtgctcag 652201 gtacccccgg accgcgacgc catagaccgt ctcccgatgc catcgccggc ccagcttgcg 652261 tcccagcggg gatcgggccc cgtcggcaac gatcagctgg cggcagccca cctcagtgcc 652321 gtcggccagg gtcagcgata ccacccgcct cgatgaatca tggtgaacag caacggcttt 652381 agcgccaagt agcatgcgcg caccggtgtc ctcggcgacc tttcggatcc ggtcgtccag 652441 ctcgagacgg gccaccgcgc tgccgtacga cgggaagctc ggaccgggcc agtccacttc 652501 cacctcgcct ccgaagccgc tcatccgcaa cccacgatgc cggatgtggt ccgccagcca 652561 cttacctagt cccagctggt gcagttcggc gaccgcgcgt ggtgtcagcc cgtcgccgca 652621 aggcttgtcg cgggggaagg tggcggtgtc gatgacgagg acgtcgcggc ccgcgcgggc 652681 agcccaggcg gccgcagctg acccggccgg tccggcgccc acgaccacca cgtcggcact 652741 gtcatccacg ctcaccagta tgttggtcga gtgaggactc cggcgacggt ggtggcaggc 652801 gttgacctgg gcgacgctgt ctttgccgcg gccgtgcgtg ctggtgtcgc gcgagtcgag 652861 caactcatgg acaccgagct gcgccaggcc gacgaggtga tgagcgattc gctgctgcac 652921 ttgttcaatg ccggcggcaa gcggttccgt ccactgttca ccgtgctgtc ggcgcagatc 652981 gggccgcagc cggatgccgc agcggtgaca gtcgccgggg cggtgatcga gatgatccac 653041 ctggcgaccc tctaccacga tgacgtgatg gacgaggccc aggtccgccg cggcgcgccc 653101 agcgccaacg cgcaatgggg taacaacgtc gcgatcctgg ctggcgacta cctactggcc 653161 accgcatcgc ggctggtggc gaggttggga ccggaggcgg tgcggatcat cgccgacacc 653221 ttcgcccagt tggtgaccgg gcagatgcgt gagacgcgcg gcacgtcgga gaacgtggac 653281 tccatcgagc agtacctgaa ggtggtccag gagaagaccg gcagtctaat cggggcggcc 653341 ggccggctgg gtgggatgtt ctccggtgcc accgacgaac aggtcgaacg gctgagccgc 653401 ctcggcggcg tggtgggcac cgcgtttcag atcgccgacg acattatcga catcgacagc 653461 gagtctgacg agtcgggcaa gctgcccggt accgatgtgc gcgaaggagt acacaccctg 653521 ccgatgctct acgcgttacg ggaatcaggg cccgattgcg ctcggttgcg cgcactgctg 653581 aacggaccgg tcgacgacga cgccgaggtg cgcgaggcgc tgacattgtt gcgggcgtcg 653641 ccgggcatgg cccgggccaa agacgtcctg gcgcagtacg cggctcaggc acgtcacgag 653701 ctggccttac tgcccgacgt cccgggacgg cgtgccctgg cggcgctggt cgactacacc 653761 gtgagccggc acggctaggt tgcccggcca ggctcgattg cggaaccagc ggatacccct 653821 caggcgttga accagcagta atctcccaag ttgaggtgtt ctaggaggac acgcactgat 653881 gacttggcat ccgcatgcca accggctgaa gacgttcctg ctgttggtcg gtatgtccgc 653941 gttgatcgtg gccgtcggcg cgttgtttgg caggacggcg ctgatgctgg cggcgctgtt 654001 cgccgtcgga atgaacgtct acgtctactt caatagcgac aagctggcgc tgcgggcgat 654061 gcatgcgcaa ccggtttccg aactgcaggc gccggcgatg taccggatcg ttcgagagct 654121 ggcgaccagc gctcaccagc cgatgccccg gctgtacatc agcgacaccg ccgcacccaa 654181 cgcgttcgcc accggccgca acccgcgcaa tgccgcggtg tgttgcacga ctggcatcct 654241 gcgtatcctc aatgagcgtg agctgcgtgc cgtgctgggc cacgagctgt ctcacgtcta 654301 caaccgcgac atcctgatct cttgtgtggc aggtgcgctg gcagcggtga ttaccgcgct 654361 ggccaacatg gccatgtggg ccggcatgtt cggcggcaac cgagacaacg ccaatccctt 654421 tgcactgctt ctggttgcgc tgctgggccc gatcgcggca accgtgatac ggatggccgt 654481 gtcgcgatcg cgggagtacc aggccgacga gtcgggtgcc gtcctgaccg gggacccgct 654541 ggcgttggcg tcggcattgc gcaagatctc cggcggcgtc caggcggcgc cgctgccgcc 654601 ggagccgcag ctggccagcc aggcgcacct gatgatcgcc aacccgttcc gggcgggtga 654661 gcggatcgga tcgctgtttt cgactcaccc accgatcgag gaccgcattc gccggctgga 654721 ggcgatggcg cgcggctgat aactgtgggt atcgagatgc catcggtgat gagtcaggcg 654781 ccgctatcga ggaggcggtc gatcagttcg tggctggcat gccggcgtgc ggcgaggacg 654841 cgcccgtccc actgaccgaa cgcaacagcc gaactatcgt tgtgcggaca tcaccggcat 654901 gcgtgccggc agcggtggca agcctaaaac cccgagccgt gcacctcgtg tccggggacc 654961 tcggcgatca agcctctata cgcctgctcc accgtcgaac catggttgat gacggcgtcg 655021 acctcgcggg cgatcggcat gttcagcccg aactcgttgg cgaactccat caccacaccg 655081 gcagctttga cgccctcggc gacctggctc atcgatgcga tgatttcgtc gatcggcttg 655141 cctgcgccga gttgttcgcc cacatgccgg ttgcggctgc gttggctggt gcaggtgacg 655201 atcaggtcgc cgagaccggc cagtccgggg aacgtttcgc ttttcccacc cattgccaca 655261 cccagcttcg tcatctcgcg cagcgcgcgg gcgatcacca gggcgcgggt gttttcgccg 655321 atacccagcg aatagcccat cccgaccgcg atggcgaaga cgttcttgag ggcgcccgcc 655381 gtctcgacac cgacgacgtc gtcagttgtg tacacgcgga agcgccgggt gcgaaacatt 655441 gctgatagcc gggtcgccag gtgctggtcg ggcatggcca gcaccgccgc ggccgcgtag 655501 ccctcggcca cctcgcgggc gatgttcggg ccggccagga tgcctgccgg atgaccgggc 655561 agtacctcct cgatgatctg cgacatccgc atattggtgc cctgttcgag ccccttgacc 655621 agggacacca ctggcaccca gggtcgcagc tctttgctca gctcgacaag cactccgcgg 655681 aaaccgtgcg agggcacccc catgacgacg acgtcggcgc agttggcggc ctcggtgaag 655741 tctgtggtgg cgcgcagggt gtcgctgagc accacgtcgt tgccgaggta tcggctattg 655801 cggtggttgt cgttgatgtc ctgcgcggtg accgccgagc gcacccactg caaggttggt 655861 ccgcggcgcg cacagatgga ggcgacggtg gtgccccagg aaccgccgcc gaggacaacg 655921 actttgggtt cgcgcttgtt ggctgccatg gcgttcagcg tattgcggca accggacatt 655981 tgatatccgt cgacgaaccg caggagcaat catgccgcgc cgaacaccat tgcctcctcg 656041 atgcggtcga atcggtagtc gatggcgtcg gccaagtagt tctgtcgtac attccacggc 656101 cgcttggtgc cggacttggg cagcgcgtac ggcgcccgct tcacatagcc ggcctgaatg 656161 tcccaggacg gtttctcgtc catcggctcg tcgcccaggt gcggggcggc gcgcgtgtgt 656221 ccatgggcgg ccatgtgtgc cagtagtttt gccgtcgccc gggccgtcat gtcggcgcgc 656281 agcgtccagg acgcgttcgt gtaacccaca caccagaaca ggttgggcac gtcttcgagc 656341 atgtgcgcct tgtagacaaa gcgatcccga gggtcgatct cgacgccgtc gaggctgatc 656401 gcggccccgc caagcgcttg caactgcagg ccggtggcgg tgacgataat gtccgcatcg 656461 aggtgcccac cggatttgag tgcaataccg gtggcgtcga agtggtcgat atggtcggtg 656521 accacctcgg cgcggccgct ggtgatggcg ttgtacaggt cggcgtccgg gatcaggcac 656581 agtcgctgat cccacgggtt gtaccgcggc gtgaagtggg tttcgatgtc gtagccctcg 656641 ggcagatttt tgatcgcggt acggcgcagc agccatttca cgaacaccgg tgtcttgcgg 656701 gacaagaacc agaacaccgc ttccaataac gcgttgtaca ttcggacaat caagtgagaa 656761 gttttgggag gcaacgcttt acgaacaacg gcggcgaacg tgctgtattt ggatgccgag 656821 atcaggtagg tcggggatcg ctgcagcatg gttacctttt cggcccggtc ggtcagcgag 656881 gggatcagtg tgaccgcggt ggccccgctg ccgatcacca cgatcttctt gccggtgtag 656941 tccagatcct ctggccagtg ctggggatgc actaccgcgc cgccaaactt ctcgatgcct 657001 ccgaagtcgg gggtgtagcc ctcgtcatag ttgtagtagc cgctgccgaa gaacacgaac 657061 cggctgcggt agtgcttgtg cacgccgttc tgctcgaagg tgacggtcca ggtatcggtg 657121 gatgagtccc agtccgctgc gcgaacgtag ctgttgaact cgatgtggcg atcgatgccg 657181 tacttgtggg ccatgtcggt gaggtactcg cggatgtggg cgccgtcggc gatgccttct 657241 tcgcgggtcc acggctcgta gggaaacgac agcgtgaaga tgctgctgtc ggagcgcacg 657301 ccggggtagc ggaacagatc ccaggtgccg ccgatccgcg cacgcctttc caggatggtg 657361 taggtcagct gcgggttgcg ttcgatgatc cggtaggccg cgcccagtcc ggagatgccg 657421 gcgccgacga tgacgacgtc gacacagccg gcgtttggag tcacgctcat cgtgaacctc 657481 gcttgaaatc ctggatcagc gaccagggta gccaggacat ccagccagcc cctccagatc 657541 gccgcgacta gcggtagttc acaaactgca atgccacatc caggtcggcc ttcttcagca 657601 tggcgatgac ggcctgcagg tcgtcgcgct tcttgctggt gacccggacc tcgtcgccct 657661 ggatctgggt tttgacgttc ttggggcctg cgtcgcggat gagcttggtg atcttcttgg 657721 cgttctcgct gctaatgccc tgtttgaggg cgccggtaac tttgtacgtc ttacccgagg 657781 cctgcggttc tccggcctcg aaggccttca gcgagatgtc gcggcggatc agcttctcct 657841 tgaagacgtc gacggcggcc ttgacacgct cctcggtgga cgaggtgagc tcgacggcct 657901 cgtcgccctt ccacgcgatc ttggtgtcgg tgccgcggaa gtcgaagcgc gtggccagct 657961 ccttggcggc ctggttgagt gcgttgtcga cctcctgccg gtcgaccttg ctgacgatgt 658021 cgaacgatga gtccgccatt cggttcgtcc ctccttcgcg agatagccgt gtgtgctctg 658081 tctacccggt cgttgtaccc tgctaggcgg caggttgccc gagcggccaa tgggagcgga 658141 ctgtaaatcc gtcgcgaaag ctacgcaggt tcgaatcctg cacctgccac cacggtcaag 658201 ctggtatccg ggcatgggcg ccgggcatgg ccacgcccgc gcgttggtgc cccaacgtcg 658261 cctacggtcg gtagacagcg gcgcgacacc cgcactccaa caatttcggg aggtcaagtg 658321 gtggagttga gcccggatcg gatcatggcg atcggcggcg ggtacggccc gtctaaggta 658381 ctgcttaccg cggtcgggct tgggctgttc accgaacttg gcgatgaggc catgaccgcc 658441 gaggccattg ccgaccgcct cgggttgcta aagcgaccgg cgattgactt cctcgacgcc 658501 ttggtctcgc tggacttgct ggcgcgagac ggcgacggac ccgggtccca ctaccgcaat 658561 acaccggaga cagcgcactt tctggacgag gcccgtccca cctacgcggg cggcctgctg 658621 aagatctgga acgaacgcaa ctaccgcttc tgggcggatt tgaccgaggc gctcaagacc 658681 gggaaggcac aaagcgaggt caagcaaacc gggcggccct tcttcgaggc gctctatgca 658741 gatcctcggc ggctcgaggc gttcatggcg gctatggacg cggcgtcgcg acgcaacatc 658801 gagctcctcg cgaaacgctt tccgttcgag cgctaccggc gtctctgtga cgtgggctgc 658861 gcggacggtc tgttgtcacg aatcgtcgcg gcggctcacc cgcacttgca gtgcgtcagc 658921 ttcgacttgc ccgcggtgac cgagatcgct cgacgcaagc tgacagccga gggtttgggt 658981 gagcgggtgc aggcgtgcgc cggtgacttt ttggccgacc ctctgccggc ggccgatgtc 659041 atcacgatgg gccagattct gcacgactgg aacctcgacc gtaaacagca gttggtcgct 659101 aaggcctacg aggccctgtc caaggagggg gctttcattg tgatcgagac attgatcgac 659161 gacgcgcgac gcgaaaacac aaccggcctg atgatgtcac tgaacatgct tatcgagttc 659221 ggtgacgcgt tcgactactc cgccgccgac ttccgggggt ggtgtggcga ggcgggattc 659281 cgttcgttcg aggtgatccc gcttgccggc ggctccagcg cggcggtggc ctataaatag 659341 tgggcaatga catggtgggt ggccgaccaa cgtgaactga ggacggcaaa tcggcctcag 659401 ttcacgctcg gcgctttgag caacaaattg aacacataga atcgtgtcga tgagcggcac 659461 atcgtcgatg ggattgccgc cgggacctcg actttccggc tcggtgcagg ccgtgttgat 659521 gttgcgccat gggctgcgtt ttttgacggc ctgtcaacgc cgttacggca gtgttttcac 659581 gctgcatgtc gcggggttcg gccacatggt gtatctgtcc gatccggccg ccatcaagac 659641 agtgtttgcc ggcaacccga gtgtctttca cgccggcgaa gccaactcga tgttggccgg 659701 actgctcggc gacagctcac tgctgttgat cgacgacgac gtgcaccgcg accggcgtcg 659761 cctgatgtcg ccgccgttcc atcgcgacgc ggtcgcgcgc caggccgggc cgatagccga 659821 gattgccgcc gccaacatcg ccgggtggcc gatggctaag gcgttcgcgg tggcgcccaa 659881 gatgtctgag atcacccttg aggtgatcct gcggaccgtc ataggcgcca gcgatccggt 659941 ccggctcgcc gcgctgcgca aggtcatgcc gcggctgctc aacgtgggcc cgtgggcgac 660001 gctcgcactg gccaacccga gcctgctgaa caatcggctc tggagcaggc tgcgacggcg 660061 gatcgaagaa gccgacgccc tgctgtacgc cgagatcgcc gaccgccgag ccgatcccga 660121 tctggccgca cgcaccgaca cgctggccat gctggttcgg gccgccgacg aagacggacg 660181 gacgatgacc gagcgcgagc tgcgcgacca gctgataacg ttgctggtcg caggtcacga 660241 caccaccgcg acgggactgt cgtgggcact ggagcggttg acccgccacc cggtcaccct 660301 ggccaaggcc gtgcaagcgg ccgacgccag cgcggccggc gatccagccg gcgacgagta 660361 cctggacgcg gtggccaaag agacactgcg gatccgcccg gtggtgtacg acgtgggccg 660421 ggtcctcacc gaggcggtgg aggtggccgg ttaccggctg ccggccgggg tcatggtggt 660481 cccagcgatc gggctggtgc acgcgagcgc gcaactgtat ccggatccgg aacggttcga 660541 ccctgatcgg atggttggcg ccactttgag cccgaccacc tggttgccgt tcggcggcgg 660601 caaccgccgc tgcctcggcg ccacctttgc catggtcgag atgcgggtcg tccttcggga 660661 gatcctgcgc cgcgtcgagt tgagcaccac cacgacctcc ggcgaacggc cgaagctaaa 660721 gcacgtcatc atggtgccgc accgcggcgc gcgcatccgc gtccgggcaa ccagggacgt 660781 ttcggccacg tcgcaagcga cagcccaggg tgccggatgc ccagccgctc gcggtggcgg 660841 gccgtccaga gccgtcggca gccagtgacc agctggggta tccgcatggg gtcgcccagc 660901 gggtcccgag gggacttttg gccaccggcg ctggtggcct actgccctcc cgccgttgcg 660961 ccgggtgcgt gcacgattga agtccccaag gaagggacgc tcatgaaggc aaaggtcggg 661021 gactggctgg tgatcaaagg cgcgacgata gatcaaccgg accaccgagg gttgattatt 661081 gaggtgcgct catccgatgg ttcgccgccg tatgtggtgc gctggctcga gaccgaccat 661141 gtggcgacgg tgattccggg tccggatgcg gtcgtggtca ctgcggagga gcagaatgcg 661201 gccgacgagc gggcgcagca tcggttcggc gcggttcagt cggcgatcct ccatgccagg 661261 ggaacgtagg cgattcgctc aagcgacgaa gtcggtgggt gtcagctggc cggcgaaagt 661321 ccggcgccgg gatggaacgc tggtgccgtt cgacatcgcg cggatcgaag cagcggtgac 661381 gcgggcagcg cgcgaggtgg cttgcgacga ccccgatatg ccgggcaccg tagcgaaagc 661441 cgtcgccgac gcactcgggc gcggtatcgc tcccgttgag gacattcagg actgcgtgga 661501 ggcccggctg ggggaagccg gtctggatga cgtggcccgt gtttacatca tctaccggca 661561 gcggcgcgcc gagctgcgga cggctaaggc cttgctcggc gtgcgggacg agttaaagct 661621 gagcttggcg gccgtgacgg tactgcgcga gcgctatctg ctgcacgacg agcagggccg 661681 gccggccgag tcgaccggcg agctgatgga ccgatcggcg cgctgtgtcg cggcggccga 661741 ggaccagtat gagccgggct cgtcgaggcg gtgggccgag cggttcgcca cgctattacg 661801 caacctggaa ttcctgccga attcgcccac gttgatgaac tctggcaccg acctgggact 661861 gctcgccggc tgttttgttc tgccgattga ggattcgctg caatcgatct ttgcgacgct 661921 gggacaggcc gccgagctgc agcgggctgg aggcggcacc ggatatgcgt tcagccacct 661981 gcgacccgcc ggggatcggg tggcctccac gggcggcacg gccagcggac cggtgtcgtt 662041 tctacggctg tatgacagtg ccgcgggtgt ggtctccatg ggcggtcgcc ggcgtggcgc 662101 ctgtatggct gtgcttgatg tgtcgcaccc ggatatctgt gatttcgtca ccgccaaggc 662161 cgaatccccc agcgagctcc cgcatttcaa cctatcggtt ggtgtgaccg acgcgttcct 662221 gcgggccgtc gaacgcaacg gcctacaccg gctggtcaat ccgcgaaccg gcaagatcgt 662281 cgcgcggatg cccgccgccg agctgttcga cgccatctgc aaagccgcgc acgccggtgg 662341 cgatcccggg ctggtgtttc tcgacacgat caatagggca aacccggtgc cggggagagg 662401 ccgcatcgag gcgaccaacc cgtgcgggga ggtcccactg ctgccttacg agtcatgtaa 662461 tctcggctcg atcaacctcg cccggatgct cgccgacggt cgcgtcgact gggaccggct 662521 cgaggaggtc gccggtgtgg cggtgcggtt ccttgatgac gtcatcgatg tcagccgcta 662581 ccccttcccc gaactgggtg aggcggcccg cgccacccgc aagatcgggc tgggagtcat 662641 gggtttggcg gaactgcttg ccgcactggg tattccgtac gacagtgaag aagccgtgcg 662701 gttagccacc cggctcatgc gtcgcataca gcaggcggcg cacacggcat cgcggaggct 662761 ggccgaagag cggggcgcat tcccggcgtt caccgatagc cggttcgcgc ggtcgggccc 662821 gaggcgcaac gcacaggtca cctccgtcgc tccgacgggc accatctcac tgatcgccgg 662881 aaccaccgcg ggcatcgagc cgatgttcgc tatcgcgttc acccgcgcca tcgtcggccg 662941 gcatctgctg gaggtcaatc cgtgcttcga ccgactggcc cgcgatcggg gcttttatcg 663001 tgacgagctg atcgccgaga tcgctcagcg tggcggagtc cgtggctatc cgcggctgcc 663061 tgctgaggtg cgggccgcgt tcccgaccgc ggcggagatc gcgccgcagt ggcatctgcg 663121 catgcaggcc gcggtgcagc gccacgtcga ggccgccgtg tccaagacgg tcaacttgcc 663181 cgccacggcg acggtcgatg acgtccgcgc catctatgtg gccgcctgga aggcaaaggt 663241 caagggcatc acggtgtatc gctacggcag ccgggaagga caggtactgt cctacgccgc 663301 gccgaaaccg ctactggcgc aggctgacac ggagttcagc ggcggctgtg cgggccgctc 663361 ctgcgagttc tgacggcggc tcccatggcg cgagcagacg cagaatcgca caaaatcagc 663421 gattttgatg cgattctgcg tctgctcgcg cagggatcgc agggatcacc ccggccggct 663481 agcggtttag ccgcttgggc ctgggccgca caagtggtcg atgaaccaat cgcacgccag 663541 cttggcaacc tgttccagcg tgcctggttc ttcgaatagg tgtgtggcgc cggggaccac 663601 ggtgagttgg catttcccgg gtattaccgc ttgcgctcgt tggttcagct cgaggaccac 663661 ctggtcgcgt ccacccacga tcagcagcgt cggtgccacc acgctcccca gcgaatcacc 663721 cgcgagatcg ggccggccgc cgcgggacac caccgcccgc acgttcacgc gcggatcggc 663781 ggccgcgacc agcgccgcac ccgctcccgt gctggcgccg aagtagccga ccggcagcga 663841 tgcggtgtcg ggctgggtgg ccaaccaacc ggtcacgtcg atgagtcggg aagcgagcag 663901 ctcaatgtcg aagacgttgg cgcggttgcg ttcttcttcg ggcgtgagca agtcgaataa 663961 cagcgtcgca aacccggccc cggtcaagac ctctgcaacg taccgattgc ggatactgtg 664021 ccggctgctg ccactgccat gtgcgaaaac cacaattccc ctgggttttt cggggacagt 664081 caggtgccct gccaccggta ctggaccggc aacgacctgg acctcctcat cgcgaagcgg 664141 tgggtcagcg gcggcatcga tcgcacctgc ctcggcgaag tcgcggtgag cacgatccag 664201 aaacgccacc acctcgtcgt cggaggtctg ggtgaagttg cggtaaccct gcccgacggc 664261 gaagaacaac gccggcgtcg ccaaacacac cacctcatcg gcgtacccgg cgaatctcgc 664321 cacgatgtcg tctgggccga tcgggaccgc cagcaccacc ttgtccgcac cgtgcgcccg 664381 ggcgacctgg cacgccgcct tggccgtcgc tccggtggcg atgccgtcat cgacgatcac 664441 cgcgatccgc ccggtcaacg ggatgcggtc acgcccgcgg cggaagcgtt ccgcgcggcg 664501 ttgtagctcg atcagctgct tgcgttcgac cgcgtccatg gcggcagcat cgaggtgtgt 664561 cccgcggacg acgtcgtcgt tgagcacccg cacgccgtcc tcaccgatgg cgccgaaagc 664621 caattcgggt tggaacggca cgccaagctt gcgcacgacc aggacgtcga gtggcgcttg 664681 cagtgacttg gcgacctcaa aggccaccgg taccccgccg cgcggcaagc caaggacgac 664741 gacggccttg ccggatagct gcgccaggcg ttgcgccaac tggcgtccag cgtcgccacg 664801 atcgtcaaag agcttcatct gccgagtgtg tcgccatctc atggctccaa atatggaatt 664861 aggtccctgg gccgactgac gacagtccct cagcgaccgg attgcgcatc ccgccttgta 664921 cgctactccg caaatcccgg gcttgcgtcc gcggaagcga actcggcggc gctacggtgg 664981 tggctcactt cggccgtgcg cactcggatc gacgggccga tggcggccgg gcccgcgcgc 665041 ttcataggtc atcggattga ggtgatcgac tcggcgatga gtgttcgaaa gatgactcag 665101 tggtgtgcct tccgtcggtg agctgcacga catatgtgcg gtcgtcggcg tcgtactcaa 665161 ccgtgccgcc cgagacaacc atcccaacca acgcattgcc gtcctggaag tagtacgcgt 665221 tcttttctcg gcgcatggaa tccaggtggc aaccgggcac tatgaggacg ctcctgcggg 665281 gatcgtcggt gaagaccaga atccgcgcgc cccgtttctg ggctagggga tgcttcgtag 665341 gcttccgttg ccgcatgtgc cgcttgatgg cgtgctcacc cattttggtt ttgccctctc 665401 acttgacgct gcgttgccta gcatgccaac cggctagctt cgcggaacgt gctccccggg 665461 gtgcgggcat tcaccgggca cgtgaatcag tactgcgccg tcatcgacga tcccggcttg 665521 accgcggcga tgggcggtga tgtatagcca tccgggttcg atggtttgct tgcagcgtgt 665581 acattgcggg tcggcggcca tgtgctcctc gcttccctag cctcacggtt tgcgccgtcg 665641 gtcgacaggc gaactgctct tcgccgatgt acgtcactgc ttcggcggct aaaccccttg 665701 tccagcacga caagtccaac cggcctgcgt cggcggagtt tggcctcgtg ctcggctggc 665761 ggtgctcatg gtgtccctcc ggaactcggg gtaacggcaa gctttcgatg cgtcggcagt 665821 ccgaaatcta gagacgacga acttgttgtt ctagggtcgt ttggccttcg ccccgacgac 665881 gttggacccg gggtgggctt cggccgtgtc ggcgtgccgc agccgggcga gttcgcccac 665941 gatcctgtcg ctgaccgcca ccggatacga gtagccggtg tcctcgagcg agcgcagctc 666001 cggcgggagc gcgtcgatct gctggcgggc ccagtcccgc gcgccgtcca atgtgggtgc 666061 atgctgccgg atgcgtcggc cgttggtcat gatgggcacc agcaacgggt ccccgggaag 666121 gttttcaccg tgctcgccga gcgtgtcgcc gcaaaagact ccgtgctcga gcttacggaa 666181 cacctgcttg cgtcccgggt agatcacctt gccgctggag aacttggtgc gcccgctgcc 666241 gtcgtatgcc accagcttgt aggccatgtc cagcgcgggc gcgtcttgag ccacgacgag 666301 ctgggtgccc acgccgaagc cgtcgatcgg acagcgggca gccaaaagcg cggcgatgcg 666361 gttttcgtcg aggcccgacg acgcgaagat ctcgacctgc tcgagaccgg cggtgtcgag 666421 ccgtgcacgg gtcgccttgg acagctcatc gaggtcgccg gaatccagcc ggaccgcgcg 666481 cacatcgaag cgattgccca gccgcttggc caactcgatg acgtgatcga cgccgcgtag 666541 cgtgtcgtag gtgtccacga gcagcatggt ggctgggtag agccgggcga acgcctcgaa 666601 cgcggccacc tcactgtcga aggcttgaac aaagctgtgc gccatggtgc cgaacgtcgg 666661 gatcccatat tggcgggccg cgagcagatt cgacgtgccc gcagcgcccg cgagataact 666721 ggtgcgcgcg accttgcagg ccgcgtcggt gccgtgagcg cgccgcgcgc cgaaatccac 666781 caccggtcgt ccgcgcgcgg cggcgaccac ccgcgcggcc ttgctcgcga gcacgctttg 666841 cagatgaatc tggttcagca caaacgtctc gacaagctgg gcctcgatga ttggcgcgat 666901 cagctggacc gcgggttcgt tcggaaaaat cacggttcct tccggcgcgg cccagacatc 666961 tccggtgaaa cgcactccgg ccagccacct caggaactcg tcggaaaact ggcccaggcc 667021 acgcaggtaa cgcagatcct gctcgtcgaa tcgaaacgct tcgaggaact cgaccacatc 667081 ggccagcccg gcggccatga tgtaggacct gccaggcgga agcttgcgga agaatatctc 667141 gaaaaccgct gtgcccgaca ttctttcggc ccagtaggcc tgggccatcg tcacctcgta 667201 caggtcggtg aacagcgcgc cgacgtgttg gcggatcgcc atggttgccg gttactcctt 667261 gctcgttagg ttggcagcgg gaacgacctc cagcaggttg tcgggtcgag tcacgactcg 667321 aatcccgaac cggcggctga tgcgctcaat ggtgttgcgg agccattcgg tgtcggtctg 667381 ggaggcacgc tgtaggcgca tccgcgacac tcgcagtgga agcatctgca gcgagatcag 667441 gttcccgctg gcgggatcgg tgacggtcag atacagcagt cgcagttcac tgcggaacga 667501 ctcgtgcccg ccgatgcctt cgtagtcgtc aacgacgtca ccgcatccgt acaggatcgg 667561 tttaccgcga tatatctcga ttggccgcgg atggtgcgag gaatgtccgt ggaccatgtc 667621 gatgccggcg tcgatcagtc ggtgcgcgaa cgcgacgtcg ccgggtgcgg tcgcatagcc 667681 ccaattggat ccccaatgca tcgagactat ggcgatatcg ccggggcgtt tgtccgccag 667741 cacctgtgcc gccacatcgt cggcgacgtc gcgttgcgcc ggatcccgga tcaaccacac 667801 tccgggccgg tcgcggcggg cggcccagga ttcggggacg ccgctggatt ccgccgctac 667861 cgagccgacg atcacccggc gttcatggcc aaccgtgact agcgccgagc ggcgagcggc 667921 gagcaaatcg gctcccgccc cgacactctg gatccccgca ccggcgagag ccgcgaccgt 667981 atcggtcagc ccctggtagc cgaaatcgag aatgtggttg ttggccagcg cgcacacgtg 668041 cggccgcaat gccgtcagcg ccggcacgtt atccgggtgc atccggtagc agaccggttt 668101 gcggtcggcg aattcaccgt cggcggtgat cgtcgtctcc agattgatca aacagacgtc 668161 ggtcgcggtg ttctcaagga ccgccaacgc ctcgccccag ggccagcgcc aatccacggg 668221 gagcggaatg cgcccgttca cccgctcggc caggcgaaca tagccggtcg catcccgcat 668281 ataccgttcg cgcaattgcg gtttgccggg atgaggcagg atctgatcga cgccacggcc 668341 gagcatgacg tcaccgccca gcagcaccgt caccacatca ggattgccag ccactccgga 668401 ccaccgccgc cttcaggtaa tcgccgtaac acgcacccta tggcgtacat tgcacgtcat 668461 acgatcggcc ggcggcggcc tcgtgggtgg ggccgaaggt cctcaagacc gcgcccaaag 668521 gtcacattgc cggcgacaaa ccgtgcctac ctggcggaga ggtgcccgtc ggcggtggtc 668581 accaggtgta gtcgggcagc tcgaagtcgt cacgcacgct gccggcgaac agcgtcgcca 668641 gcgggccgaa gttcatcgtg cgcatcgcaa cgttgcgaaa ccacaggccg aatcgggttc 668701 gggtggcgaa aaaccagatg aacttcgccg cactggcttg cttgccctcg atgaagggac 668761 gcaggcgctt ctcgtaggcg tcgaaggcgc gacggtggtc gcccccggcg cgggcgagct 668821 ccccggccag cacgtaggcc tcggtgatcg ccaggccggt gccctcgccg ccgagcagcg 668881 agatgcaccc ggccgcgtcg ccgatcagca gcacccgacc gcgtgaccag cggtccatcc 668941 ggatttggct gaccacgtcg aagtacaggt cctcgacgtc gtcgagggcg gccagaatgt 669001 cccggctttc ccagcccacg tcgccgaatt ggtcgcgcag ctcatctttg ggtgccacgc 669061 cggggttgtc gtgttcggcg cggaagacga acaagaacat ggtgcggtcg ccgcgcagcg 669121 cgaaccgcgc cagctgtcgg tcgacggtgt tgtagaggac atagctgcgc tcgtcgcggg 669181 gccggtagcc gtcgaccacg caggccgcga ccttgcagcc caggtagtgc tcgaaatccc 669241 gctccggccc gaagaccagc cggcgcacgt tggagtgcag tccgtcggca ccgatgacca 669301 ggtcgaaatc gcgcggggcg gtcctttcga aggtgagccg gacgccgtcg cggtgctcgt 669361 cgatggtggc gatgctgtcg tcgaagatcg tttccacttg gtcttcgatc gtcgtgtaga 669421 tcgcggcggc gagatcgccg cgcggcaagc tggtgaagtc gtcgccgacc atgcggcgaa 669481 agacgtcgac gcccaggtcg gctttgacct tgccggtggg accgacggag cggacgtgtt 669541 ccatgtggta acccgccgct gcgatctggt ccgtgatgcc cattcgtttg gccacctggt 669601 agccgacgcc ccagaagtcg atcatgtagc cgccggtgcg gaacttcggc gcccgctcga 669661 tcactgtcgg ggtgtggccg gtgcgctgca gccagtgggc gagcgccgct cctgccacgc 669721 cggcaccgct aatcgctact ttcacactgc aattgtgctc ttcggcaata gtttagaaca 669781 agaccggtcg ctcgttgccc cttgatcaat acgttagtga gcgctaacgt attggcgtgt 669841 gcccgacatg ctggaagtcg cggcagagcc aacccggcgc cggctgctac agctcctggc 669901 accgggtgaa cgcaccgtta cccagcttgc gtcgcagttc acggtcaccc gttcggcgat 669961 atcgcagcac ctcggcatgc tcgccgaagc gggattggtt accgcccgca aacagggccg 670021 ggaacggtac taccggctcg atgagcgcgg ggtgctgcgg cttcgtgcgc tcatggagtc 670081 cttctggagc gacgagctgg accgtcttgt cgccgatgcc gcccactacc cgccgtcaca 670141 aggagactgt gccatgccgt tcgagaaagc ggtcgtcgtg cccttggatc cgaccagcac 670201 cttcgcgctc atcacccagc ccgacaggct tcggcgctgg atggccgtcg ccgcgcgtat 670261 cgagctgcgc accggtggcg cttatcgctg gacggtgact ccggggcata gcgcggccgg 670321 caccgtcatc gacgtcgacc ccggcaagcg ggtggtcttc acctggggtt gggaggacca 670381 cggcgacccc ccgccgggcg ggtcgacggt gaccatcacg ctgaccccgg tcgacggcgg 670441 caccgaggtc cggctggtcc acgacgggct gaccgcgcag caggccgccc ggcacgccaa 670501 agggtggaac cacttcctgg accggctggt cgtcgccggc caacgcggtg acgccggtcc 670561 cgacgaatgg gccgcagcgc ccgatccgct cgacgaatta tcttgtgccg aagcaacatt 670621 ggccgttctt cagcacgtac tgcgcgggat aggcgcctct gacctgacca ggcagacacc 670681 gtgtacggaa tatgacgttt cgcaactggc ggatcatttg ctgcgctcgc tggcgatcat 670741 cggcgctgcg gcgggcgcgc agctggcgcc ccgcgatgtg gacgcgccac tggaaaccca 670801 ggtggccgac gcggcgcagg ccgtgatgga agcctggcgg cggcgtggct tggcgggcac 670861 ggtggagctg aactcgaacc aggtgcctgc gacggtgccg gtcggcatcc tgtgcctaga 670921 atttctggtc cacgcttggg atttcgcgat tgccaccggt tctcaggtga tcgcgtccga 670981 gccggtgtcg gagtacgtac tggcggtggc cggcaaggtc atcaccccgg caacccgtaa 671041 ctccgcgggc ttcgccgcgc cggcggcggt cggttccttt gccccagtcc tcgatcgcct 671101 catcgccttc accggccgcc agccgaccgc aggccacgtg tccgccacct aacgaaagga 671161 tgatcatgcc caagagaagc gaatacaggc aaggcacgcc gaactgggtc gaccttcaga 671221 ccaccgatca gtccgccgcc aaaaagttct acacatcgtt gttcggctgg ggttacgacg 671281 acaacccggt ccccggaggc ggtggggtct attccatggc cacgctgaac ggcgaagccg 671341 tggccgccat cgcaccgatg cccccgggtg caccggaggg gatgccgccg atctggaaca 671401 cctatatcgc ggtggacgac gtcgatgcgg tggtggacaa ggtggtgccc gggggcgggc 671461 aggtgatgat gccggccttc gacatcggcg atgccggccg gatgtcgttc atcaccgatc 671521 cgaccggcgc tgccgtgggc ctatggcagg ccaatcggca catcggagcg acgttggtca 671581 acgagacggg cacgctcatc tggaacgaac tgctcacgga caagccggat ttggcgctag 671641 cgttctacga ggctgtggtt ggcctcaccc actcgagcat ggagatagct gcgggccaga 671701 actatcgggt gctcaaggcc ggcgacgcgg aagtcggcgg ctgtatggaa ccgccgatgc 671761 ccggcgtgcc gaatcattgg cacgtctact ttgcggtgga tgacgccgac gccacggcgg 671821 ccaaagccgc cgcagcgggc ggccaggtca ttgcggaacc ggctgacatt ccgtcggtgg 671881 gccggttcgc cgtgttgtcc gatccgcagg gcgcgatctt cagtgtgttg aagcccgcac 671941 cgcagcaata gggagcatcc cgggcaggcc cgccggccgg cagattcgga gaatgctaga 672001 agctgccgcc ggcgccgccg cccccgcctg cgcccccggc cccgccgcgg ccgtcggcgc 672061 cggggctgcc gaactggccg ggctggccgg attggccgat gatggccagg ggcccgaggt 672121 gtgcggtgcc gccggtgcca ccggtgccac ccttaccgcc agccccaggg atcgggaata 672181 aaccgccggg gtcggcccct ttgccgccgt ccccacctcg cccgcccgcc ccagcggtcc 672241 tgaagccgtc gccaccgtgc ccgccgtccc cgccattccc accggaactg gcatcaaggc 672301 cgtcgccgcc gaagccgccc cttccgccgt caccgccggc gctgacggtg ctggtgccgc 672361 cggcgccgcc catgccgccg gtgccgccgg ggccaaaggc ggagccaagg ccgccactgc 672421 cgccgacgcc accgtttccg gcgcggccgg ccgcccctgt cgcaccggtc gcgcccaggg 672481 tggaaccggt cccgccggca ccgccggcac caccggtgcc gccggtgccg ccggtgccgc 672541 catttccgcc agtcccgcca gtgccagcga ggctgctgaa gagagtgccg tgggcacctc 672601 tgccgccgtc gccgccggtg ccgccggtgc cgccggcgcc accggcccca ccatctccgc 672661 cggcgccttg gctgccgttg ttgcccgttg gcgacagcgc tttgccgccg gccccgccgt 672721 tgccgccgcc gccgccggcg ccgccggtcc cgccaacccc gccggtgcca ccgttaccgc 672781 cgtgaccgtc cgcgccagcg tcgaatgtgc cggtcgcacc ggtggcgccg gtggtgcccc 672841 gcaggcccgt cccgcccgtg ccgccggccc cgccccggcc gccgtcagcg ccgtcgccgg 672901 cgacgctccc accttgcccg cctacgccgc cgtcgccgcc gcggccgccg ctgccggtaa 672961 tggctccggg attgccgtca ctaccggtgc cgccgtctcc gccattgccg cccgctccgc 673021 cgttgccaat ctgcccggcg tttccgccgg cgccaccggt tccgccgtca ccgcccatgc 673081 ccctgctggc attgccgccg ttgccgccgt ggccgccggc cccaccgctg ccgcgcaggc 673141 tgccgttgcc gccgttgccg ccgttgccgc cggccgcgcc gttgccgctg agggcatggt 673201 cgccgttgcc gccgttgccg ccgttgccgc cgttgacatg aatgctgctg cttgagccgg 673261 tcgcaccgaa agtggagccg gcgccgccac tcccgccggc cccgctgggg ccggcgttgc 673321 cgccgttgcc gccgttgccg ccgatgccgt tgttggtgaa cacgctgccg ttagcgccgt 673381 tgccgccgtc accggggtcc ccgccggtgc cgccgctgcc gccgttgccg ccggcgcctt 673441 ggctgccggt tgtgcccgcc ggcccggccc cgcccggccc gccggtcccg cctcggccgc 673501 cctttccgcc ggccccgccg gcgccgccat cctggccgcg ggcacccgcg gtggcgccgt 673561 cggcgccgtc aatgccgcgg ccgccgttac cgccaactcc gccggtccca ccgtcgccgc 673621 cggcaccgcc ggggccttgg ctgccggcga cgccgttggg tgcggccccg ccgtccccgc 673681 cgtccccacc ttttccgccg gtaccgccaa ctccgccggt gccgccgggg tgcccgtccg 673741 cgcccgcgct ggaaccgttg acaccgtcgc tgccggaccc tccagtcccg ccgacgccgc 673801 cggtgccgcc ggccccgccg gtgccaccgt tgcccgccca ggcgccgccg gatccaccgg 673861 ccccaccgtt tccgccggtg ccgccatcca ggccggggtt gccgagcctg cccagaccgg 673921 gcaggccttt gctgccgttg ccgccggcgc cgccggcgcc gccgttgccg accaaaccgc 673981 catcaccgcc cctgccgccg gacgcgccgg tctggccaaa gccggtggca tcggcgcctc 674041 tgccgccgtt gccgccgttg ccgccgctgg tgggggtgtt gccgggtgcg ccgttggcac 674101 cgggggtgga gccgcttccg ccctggccgc cggcaccgcc gacaccggga tcaccgccgt 674161 ggccaccggc gccacctaca ccaccgttga caccgagcgc gccggcggcg ccgtgaccgc 674221 cgttgccagg agtcccgccg ttcccgccgg ctccgccgtc accgccagcg ccctggctgc 674281 cgttctggcc cgaggcggcc aacgcgagac cgccggcccc gccctcgccg ccggctccgc 674341 caggcccacc gttaccgcca ttcccgccgg gtgagcctgc ggccccggga gcggacgcat 674401 tgaagccgat gctgccagca cctccggatc cgccatcgcc gccggccccg ccagcacctc 674461 cggtgccgcc gtcaccggcc tgagttccgc cgttgccgcc ggccccgccg gtgccgccgg 674521 ccccgccggg gcgaccgggc gcttcggatc caaatccgag accgccggcc ccgccgcggc 674581 caccggcccc accggcaccg ccattaccca cctgaccgcc gtcgccaccc ctgccaccgt 674641 tcgcgccggt ctgtccgctg ctgatagcgt cggcgccttt gccgccgtcg ccgccgttac 674701 caccgctggt ggaggtggtg ccgggcgcgc cgttcgcgcc atgcgcgctg ccgccgacgc 674761 tggcgccacc ggcgccaccg gccccaccgg cgcccgggtt gccgccattg ccaccggtcc 674821 cgccggcacc aaggttgtga ccccacgtcc cggtagcgcc gttgccgccg tcaccgggag 674881 ctccgccgtc accgccgcta ccgccagccc cgccggcgcc gtggctgccg ccgaggccga 674941 gcagaccgtg gccgccgccg ggcccgccga ccccgccggt cccgccagcc ccaccattcc 675001 cgccgtttcc gccggcttga ccgtcagcgc ccaagttggt ggcgtgggcg ccgctggcgc 675061 ccgcaccgcc ggcgccgccg ggcccgccct cgccgccggc cccgccgttg ccgccgttgc 675121 ccatcagcac cccgccggcc ccgccggccc cgccgttgcc gccgatcccg ccggccccgc 675181 cagcggtgcc ggatccaccc ggtgtgctgg ccgacgtacc cgtgacaccg gcgatgccgt 675241 tgcctccggc cccaccggcc ccgccgacac cgaacaaccc ggcggtaccg ccggccccgc 675301 cgttgccgcc gaccgccccg gccccgccaa aacccccggc gcctccgttg ccatacagcc 675361 acccgcccgc gccgccgtga ccaccggccc cgccggtggt acccacgccg ccggctccac 675421 cgttgccgcc gttaccgatt aggcccgccg ccccgccggc cccgcctcgt tgtcctggcg 675481 ccccagaccc gccgttgccg ccgttgccgt acaagatgcc gcctggcccg ccggcctgcc 675541 cggtcccggg ggagccgtcg gcgccgttgc cgatcagcgg gcgtccgaac agcgcctggg 675601 tgggcgcatt gaccgcggct agcaaactct gttcaacgtt gaccgcctcg gcggccacgt 675661 acgagctcgc ggccgcggac agggtctgca cgaaccggtc atgaaacgtc gccacttggg 675721 cgctgacggt ctgatattcc tgggcgtgcg tgccgaacaa cgccgcaacg gccaccgaca 675781 cctcgtcggc tgacgcgggc agcactttcg ccaccgcggc cgctgcggtg ttggccgcag 675841 tgatcgtcga accaattttc gccaaatccg ttgccgccgt ggtcagcatc tccggcgtcg 675901 cgattacgaa cgacatctcg ctccccaggt caggtcagcc cggtgttgcc cggcgtggca 675961 aggaattgtg tggctatccc ggcgatctac catgtggagc gaatcttcgg gatcccaact 676021 ccaacgatcc cttgttgacg ctatcgtcaa aagggcaaaa ccccaaactt tacgcgaacg 676081 aactatccac agtgcaccct cgatttccgt cgacacgtgc aaacggccag acctcgacgg 676141 tgctagcccc gcggcgatat tgcaggtctt cgagccggtc gcgccccggg gcgcgaactc 676201 cgttgccctc ccgcgaccct gcgggagagg ataaggaatg gtcggctatg tggatgtccg 676261 ggcatacgcc gagctcaacg agttcgtgga gctgcaggcg cgcggtctga cggtgcgccg 676321 gccgttccgc agccatcaga cggtcaaaga tgtgctggag gcgatgggca ttccgcatac 676381 cgaggtggat ctcatcctgg tgaacggcga tcccgcggac ttttcctacc ggccggtcgc 676441 cggcgaccgc attgccgcct accctatgtt cgaggccctc gacatcgggt cgaccgccag 676501 gttgcgccca gcgccgttgc gtaacccgcg cttcgtcgtc gacgtcaacc tcggccagct 676561 ggcgcggctg cttcggctgt tgggcttcga cacacggtgg tcgagtgccg ccgatgatcc 676621 gacgctggcc gatatcagcc tgggcgagca gcgaattctg ctgacccgcg accgcggcct 676681 gttgaagcgc cgggcaatca cccatggtct gttcgtccac tcccagcacc cggaggagca 676741 ggcgctcgag gtgctgcggc ggctagacct caacgggcgg ctggcaccgc tatcccggtg 676801 tctgcgatgc aatggtgagc tggccgcggt ttccaaagac gaggtgattg gccagctgga 676861 gccgttgacc cgccggtact acgagtcatt cagccgctgc ttcggttgcg ggcggatcta 676921 ctggccggga tcacaccacg cacggttggt tcgcctcgtc gaacgactgc gggaccagct 676981 aactacttcg acctgacccg cacggtggtg cgcgcgtcga tcgtcgccag ctgacacgcc 677041 gaaggtgcaa ccacggcggc atcgagcggc gtgtccccgc caccaatgca cgttcggcgc 677101 ggccggcgca cgctcggcgc ggagctacga attgtcggcc ggagtcaacc gaatggctac 677161 cagcttgagc cggtccaccg cctcggcgaa ctcctcgagc gtgggtatgc gccggtcgcg 677221 gaagctcaac cccagcatgc gttggccacg cttgacgcca taggcctgtg cggcgcgcag 677281 gaaaagttca gaaacgaccg cacggtcccg gatgagctcg ccacgcatag cggtcgtctt 677341 gccgtcgtag acgacctggg cggcggcacc gtcggagaag ttgtgcttcc atccggcctc 677401 ggtcagcgcg tagaggtcgt tgtcgatgac gtgcgcgctc aagggaatcg agaagtgccg 677461 cccagtcttt cgcccggtga agctcaccac catcagctgt gtgcgtagcg ggccggcaag 677521 cggggtgtgc agcagggagc gcaggatcgg gttgacgagg cgaaggaggg ccgccggtgg 677581 gtgtgcgatg tctaccgcat acgactgatc tgtcatgcct tcaccgtaga tccgatcggg 677641 gttcgcggct acgccgacaa gttggtgacg caacaagata tatggcgcca ccggtagtac 677701 catacgtatg tggacaagac gacggtctac ctgccggatg aactcaaggc ggccgtgaag 677761 cgcgccgctc ggcagcgcgg agtctccgaa gcgcaggtaa tccgggagtc catccgggcg 677821 gcggtcggcg gcgccaagcc gccgccgcgc gggggtctat atgcgggttc ggagcccatc 677881 gcgcggcgag tcgacgagct gctggctggc ttcggtgagc ggtgatcatc gacacgagtg 677941 cgctgcttgc ctatttcgac gccgccgagc cagaccacgc cgcagtgtct gagtgcatcg 678001 atagctccgc agacgcgctc gtcgtatccc cttatgtggt agcggaactc gactatctcg 678061 tcgccacccg ggtaggtgtc gatgccgagc tcgccgtcct gcgtgaactc gccggcgggg 678121 cctgggagct cgccaactgc ggtgccgccg aaatcgagca ggccgcccgc atcgtcacga 678181 aataccagga tcagcggatc gggatcgcgg atgcggccaa cgtcgtgctg gccgaccgat 678241 accgcacgcg cacgatcctc accctggacc gtcggcactt ctcggcgctg cggccgatcg 678301 gcggtgggcg cttcaccgtc attccgtaaa ccgcaaccga ttcggtgctg caccgcggcg 678361 tgttcgtctt ccgcgtgcga tccgtccctt agggcgtgat ggtcgtctgc tcgtcgatga 678421 cgttggcggc gtccatcaac gtcatcgtct cgtcgtcgag cgcgtcggcg ttgagttgaa 678481 gcacgaacac cgcaccttgg ctgggaatca ccaccgtctt ctgcgcgacg gtccgcaact 678541 tgccgttctt gctgtatgaa ccaccgagct gccatgctga aaagccgccg agcgtggctg 678601 cacttccgtc gccgctgcct tggaagccgg gcaggttttt caactcgccg ggtgcgaatt 678661 ggaggacctt cgcggggtcg atgtcaccgg tgagtttgga gaggatcgca acgatggtgg 678721 ggggatcgtt gggatcggcg ggctgggtgt agacgatgcc gccatagggt gcgcgggagc 678781 tttccggaag cagccgccaa tcgtcgggca ccggcaggtc gatggtcggg gagccggggt 678841 cgccgtggtg cactggggtc tcctggatgt ggttgtcccg gatatagtcg gcgatggtgt 678901 agttgggccc cgctgcctga gccgaggtgg ttgccgacgt agtcgttgtc gacgtcgtcg 678961 gggacgtcgt ggttggcgac gtggttggcg cgctgtcggt cttgatgttg aaactgcagc 679021 cagccagtgc caggctcagc gccaccgtcg cgacggccgc cgtgaagtgc ttcattgcgc 679081 gctcccgaag attggaccgg cacttccggc cggtgaggtc ggattgagac tagtccaact 679141 ggtgtgcgcg cgaccctatc actgcaatcc catctcgatt gaccgcaaaa caccgcggga 679201 acaggcgtct atgcagtaag agacagctat gcgggcacgc aggttgcgca gagccctggc 679261 cgcgctcttg gcggtggcgg gtctgtttgt tccgttcatt gttggcgtgc ccacggccta 679321 cgacggtgag ccggtgttcg tcgccattcc ggtcgagcat gtcaatacgc tcatcggcac 679381 cggcacggga gccgcgatag tgggggagat caacaacttt cccggcgcct cggtgccgtt 679441 cggcatggtg cagtactcgc cggacaccgt cgacaactac gccggctacg actacgacaa 679501 cccgcattcc accggattca gcatgacgca cgcgtcggtg ggctgcccgg cgttcggcga 679561 catctcgatg ttgcccacga ccaccccgct cggctcgcag ccgtggagcg cctgggagga 679621 gatcgcccac gacgacaccg aggtcggcgt gcccggctac tacaccgtac ggttccccgg 679681 taccggggtg atcgccgagc tcaccgccac cacccgcacg ggcgtcggcc ggtttcgcta 679741 cccccgcaat gggtggccgg cgctgtttca cgtgcgctcc ggcgcatcgt tggcgggcaa 679801 ctacgccgcg acactgcaga tcgaggacaa caccacaatc accggctcgg cgaccagcgg 679861 cgggttctgc ggcaagaaga acctgtacac ggtgtacttc gccatgaagt tcagccagcc 679921 gttcagctcg tatggcacct gggacggcta cgcggtctat cccggttcac acagcatgaa 679981 ttcgagttac agcggggggt atgtcgggtt tccggccggc tcggtgctcg aggtgcggac 680041 cgccctgtcc tatgtgagcg tggacggggc gcgagccaac ctggacgccg aaggcggagc 680101 aagcttcgac gacatccgtg cggcgacatc gagcgaatgg aacgccgcgc tatcgcgaat 680161 cgcggtggcc ggcagggggc ctggcgacgt ggacaccttc tacacttgtc tttaccggtc 680221 actgttgcac cccaacacct ttaacgacgt ggacggacgt tacatcggat tcgacggtgt 680281 catccacagc gttgccagtg ggcacaccca ctacgccaat ttctccgact gggacaccta 680341 ccgcagcctc gccccactgc agggactgtt gttcccgcaa cgggccagcg acatgatcca 680401 gtcgttggtg accgacgcgg agcagagtgg tgcgtatccg cgttgggcgc tggcgaattc 680461 cgcaaccggc atgatgagcg gagacagtgt ggtaccgctc atcgtaaacc tctacgcctt 680521 cggcgccagg gatttcgacc tcaaatccgc gctgcactac atggtgaatg cagcgaccca 680581 gggcggtgtc ggacttgacg gtttcctgga gcggccggga atcgccgcct atctgaggct 680641 cggctatgga ccacaaacgg cggaattccg cgccaacggt cgtatcgccg gcgcctcggt 680701 cacgctggag tggtcggtcg atgactttgc catctcccga ttcgctgatt cgttgggcga 680761 taccgcaact gccgccgtct tccagaaccg gtcgcagtat tggcagaacc tgttcaatcc 680821 caccaccggc tatatctcgc cccggagcgc ggccggtttc ttccccgacg gtcccgggtt 680881 cgtggcatac ccctcgggct ttgggcagga cggatacgac gagggcaacg ccgaacaata 680941 cctgtggtgg gtgccgcata acgtggccgg tttggtgacc gcgcttggtg gccgcacggc 681001 cgtcgtcaag cggctcgacc gctttaccaa aaagctcaac gtcggcccca acgaacccta 681061 tctgtgggcc ggtaacgagc ccggtttcgg ggtgccctgg ctgtacaact acatcggcca 681121 accgtggaaa acccagcgga cggtcgaccg ggtccgcggg ctgttcggcc cgacacctgg 681181 cggtgcgccg ggcaacgacg acctcggcgc cctgtccagc tggtatgtct gggctgccct 681241 tggcctgtat ccgagcaccc cgggaaccac catcctgacc gtgaacacac cgcttttcga 681301 tcgcgccgtg atcgcgctcc ccaccggaaa gtccattcag atcaccgcgc cgggcgcatc 681361 cgggcggaac cgcctgaagt acatcgacgg cctgaccatc gaccgccaac cgagcaacca 681421 gacgtttctt ccggagtcga tcgtgcgcac cggaggcgac ctgaccttct cgctcgccgg 681481 cacacccaac aaggtctggg gaaccgcggc gtctgccgcg ccgccgtcat tcggtgcggg 681541 cagctcggcg gtgacggtaa atatcgcccg gcccatcatc gggatcgtgc cgggagcgac 681601 cgggaccgtg accgtcgacg cgcaacggat gatcgacggc gtcgacgact acactgtcac 681661 cccaacgtcc tacgttgttg ggattgcggc ggaaccgtta tccgggcaat tcgacgatga 681721 cggagccgtg agcgcgtcgg tcgcgatcac cgtagctcga tcggtgccgt cggggtatta 681781 cccgatctat gtcaccacca gcgccgggga tagtgcccgg acattgatcg tgctggtcgt 681841 ggtcgccgag gcggtggaat gatcattgcg caagcgcaga ggagttagat catttcgtgt 681901 ctggtcagcc agtgcatcac ctgccagccg gcgaataccg gtagccaaca ggtcaatagt 681961 cgatacagca gcaccgacgg cacacccaat gctgcaggta caccgaaggc ggcgagccca 682021 ccgatcagcg ccgcctccac cgcgccaacc ccgcccgggg tgggggcggc cgaggcgagg 682081 gtgccgccga ccatcgtcac cacggtcacc gtgacgaacg tcgttccgcc gccaaaggct 682141 tcgatactgg cccacagtgc caacgcagct ccgagcgtcg ttccggcaca accgagtacg 682201 atcaacgcca gtcgcttcgg ctcccgggcc aacgcaatga ggtcattcgt tacctccctg 682261 agcttcggcc gcaccgccgt cgctagccag cgtcgcagct tcggcacgaa gaggaatgtc 682321 ccgacaatgc ctagggccac accggcaatg aggtagagca ccgtggcatt cgggacgaaa 682381 tgagataggt cggtcgaggt gccggccagg gcgctgaaca ggatcagcag cacgaggtgg 682441 acgatcacct gtaccgactg ctgcagtgcc accgccgcgg tggcccgcac tgcggtcagc 682501 cctcccttct gcaagaaccg ggtactcaac gctagcccgc cgacgccggc cggggtagtc 682561 gttgcagcaa aagtgttggc tacctgcatg attgacagct tccagaagcc caccagccca 682621 tcagcgcagg cccacaacgc cgctgccgca ccgacatacg tcagcgccga caccgctagg 682681 cccagtagcg cccaccacca gttcgcggtt cgcagctggg aaaagaacgt gggcacggta 682741 ctgatgaaag ggtaagcgac atagaccaga gcaccgatta acaccagttg aatgagctgg 682801 ccgcggctga accgggtgat cgtttcggct ttgatctgat ccgcgcccgt ttgccgcatc 682861 acctcggcgc gtgtgctggc gatgaccgca tttgggtcgg ttatcgactc tcggattcgt 682921 tttggcacag cggatttggt aagtcttcgc gatgccgcca ggatggcttg cttgccgaac 682981 gtgtcaatgg ctgcggtcac ggcggcctcg gcgtcataca gcgccgacgt cgtcaccaag 683041 agttgggcca ggtcggattg gagttgggcg tcggtggcgc cgtactcggc ctcaccgaac 683101 ccgccgaaca gcaccgcgcc gttgtcgacg gtgatctcgg cactacacag gtccccgtgg 683161 gagatctgct ggtcgtgcag ggtccgtagc gcctcccaga catgggcagt cggcgtggtt 683221 ttggtgcatt cgctgatgcc gattccgcga gcgggccggt gtgcatacaa cgtccatccc 683281 cggtcgagcg gggacaccgc gatcaccgtc gtgttggcca tgcctagatc gccgaaggca 683341 atggccatca gcgcgcgatg ctcgaccgca cggcgcatgg aggcttgcag gggtgcggtc 683401 tcggtgccgc gcaacgtcag cttcagccag agttggcgca gcgcgccgcc gccactttgg 683461 tgcgggccgt acaactcgat caatgcctcg ctgcacgccc cggcgttggg ctgctcgcaa 683521 gcggccgaca gtaccagtgg cccgggcccg gccggccgca caaccgcgag cccggacacc 683581 gcgaatccgc gttttgccaa cgcgcgaatg gcaccatcca gtggcacttc aagcgctggt 683641 gtgccgacga ccaggaccac caacgcgccg accaaccacc ccaccgccag ccccaacaat 683701 gagcgggccg gcacaatcgc gctgacaacc agatggatcg gcacgaatgc caacagcagc 683761 gcccaccacc agtgccgcca gcgcgcgggc agccagggac ccgacacggt gagcaccgcc 683821 gcgagcatcg cgatccatcg cgggtcatcg agaaactggg ccagcaatgt ggcgagccgg 683881 tcggaaaggt caaagtgcca tcggggtgcc gcgatgcggc tactgctgat cgacaacggg 683941 agaacggcca taagtccggc ggccgcatac gcgcccagca gcttccactg ccgggaaacg 684001 atcaggccaa tcaggatcac gaacggcaac gccaaaatcg ccaggccgta ccccaggtac 684061 accagatcgg attgcgacgg ggacagcacc ccgacgatct ccgagatgga tttctccagc 684121 gccacccact gcgggcgggt gatcagcgaa ctcgtgatca ccgccacgag gtagatcgcc 684181 gccagcaccg cccggatgat gtcgttggtg cgccgggtca gtggttgcag caagttaccg 684241 gaaacgccga tgtcgcgtcc gtcaactcgc atgttctaac gatcttccga atcagggccc 684301 gcggtgtctg gtgccgtttc gcggctccgc ggacaactta gcccgataac tgcgtggggt 684361 gtcggtctga ccacttgacg tcttaccaat cttcattcac actgggcgca tggcgctgca 684421 gccggtgact cgccgatcgg tgcccgaaga ggtcttcgag cagatcgcta ccgatgtgct 684481 caccggcgag atgccgcccg gcgaggcgtt gcccagcgag cgtcggttgg ctgagttgct 684541 cggagtgtcg cgacccgcgg tccgcgaggc gctcaaacgg ctgtcggccg caggtctggt 684601 cgaggtgcgt cagggcgacg tcaccaccgt gcgtgacttc cggcggcacg ccggcctgga 684661 tctgttgccc cgattgttgt ttcgcaacgg tgagctggat atctccgtcg tccgcagcat 684721 cctcgaggcc cggctgcgca attttccgaa ggtcgcggaa ctagcggccg aacggaacga 684781 gcccgagttg gcggaattgc tgcaggattc gctgcgtgcg ctggacactg aggaagatcc 684841 gatcgtgtgg caacgccaca cgctcgactt ttgggatcat gtggtcgaca gcgccggttc 684901 gatcgtagat cgattgatgt acaacgcatt tcgtgctgct tacgagccga cgctagctgc 684961 tctgaccacc acgatgaccg ctgcggctaa gcgtccgtcg gactaccgga aactcgcgga 685021 tgcgatctgc tcaggtgatc ccaccggagc gaagaaagcc gcccaagacc tactcgaact 685081 tgcgaacaca tcgttgatgg ccgtactcgt tagccaggcg agtcggcaat gaccacccac 685141 gccgtgatca tcacctatct ccgcgaccag acgcagcccg ccgtcgatgc gatcggcggg 685201 ttctaccgga catgcgtact gactggcaag gcgctggttc ggcggccctt ccattggcgt 685261 gaggcgatcg agcagggctg gttcattacc agcgtctcgt tgctgccaac cctggcggtg 685321 tcgattccgt tgaccgtgtt gatcatcttc acgctcaata tcctgctggc cgagttcggc 685381 gccgccgaca tctccggcgc cggcgcggcg ctaggcgcgg tcacccagct gggcccgctg 685441 accaccgtgt tggtgattgc cggcgctgga gccacagcga tctgcgccga cctgggtgcc 685501 cgcaccatcc gggaagagat cgatgcgatg gaggtgctgg gcatcgaccc catccaccgg 685561 ctggtggtgc ctcgggtcgt tgccgcgacc atcgtcgccg cactgcttaa cggcgcggtg 685621 ataaccattg gcctggttgg tggtttcgtc ttcagtgtct tcatccaaca cgtctcggcc 685681 ggcgcctacg tgggcacgct caccttggtc accggtctac ccgaggtgat catctcggtg 685741 gtcaagtcgg cgacgttcgg cctgatcgct ggcctagtcg gctgttaccg cgggctgacc 685801 acgaaaggcg gccccaaggg agttggaacc gccgtcaacg aaaccctggt gctgtgcgtg 685861 atcgcgctgt tcgcgaccaa tgtggtgttg accacgatcg gcgtgcggtt cgggacggga 685921 cactagcatg gtggagtctt caacggcatc agcggcagcc gtattgcggg cccgctaccc 685981 acgcacagcc gccagccttg accgctacgg cggcggcacg gcccgaagac ttgagcggac 686041 agggactttc gcgagattca cccggatcag cgtcgtgcag atcggctggg cactgcgtcg 686101 ctatcgccgg gagacgctgc gcctggtcgc cgagatcggg atgggcaccg gcgcgatggc 686161 cgtcgtcggc ggcacggtcg cgatcatcgg ttttgtgacg ctgtccggcg gctcgctgat 686221 cgccatccag ggcttcgcgt cgctgggcaa catcggtgtc gaggcgttta ccggattctt 686281 tgccgcactg gccaacacac gcgtcgctgc gcccattgtc tccggtgtcg cgctggccgc 686341 gacggtgggc gccggcgcca ccgcacagtt aggtgccatg cggatcagtg aggagatcga 686401 cgcgctggaa gtgatgggca tcaagtcgat ttcgtttctg gtctccactc ggattctagg 686461 agggctggtg gtgatcatgc cgctgtacgc gctcgctctc gacatggctt tcacctctgg 686521 tcaggtggtc acaaccgtgt tctacggcca gtccaacggc acctatgagc actacttccg 686581 caccttcctg cgcccagagg atgtgggttg gtcggtcgtg gaggtggtga tcatcgcggt 686641 ggtggtgatg atcacccatt gctactacgg gtacaccgcc agcggtggcc cggttggggt 686701 cggccaggcg gttggtcgat cgatgcgttt ctcgctggtc tcggtggtgg tcgttgtcct 686761 gctggccgag ttggcgctct acggcgtcga cccgaacttc aatctcacgg tgtagccgcg 686821 gtgccaacgc tggtgacgag gaagaaccga cgtgcgtggc tgtatgtgga gggtgttgtc 686881 ctgctgttgg tgggcgcgtt ggtgctcgta ttggtgtaca agcagtttcg tggggaattc 686941 acgccgaaga ccgagctgac tatggtcgcc ttccgggctg ggctggttat ggaagctgga 687001 tccaaagtca cctacaacgg ggtggagatc ggccgggtgg gcagcatttc ggagattgag 687061 cgtgacggcc ggccggcggc gaagctggtt ttggacgtga atcctcgcta catcagcctg 687121 attccggtca atgtggtggc cgatatcgag gcggccaccc tgttcggcaa caagtatgtt 687181 gcgctgtccg cgccgaaaat tcctcaacag cagcggattt cctcacatga cgtgattgat 687241 gtggggtcgg tgaccaccga attcaacacg ttgttcgaga cgatcacctc gatcgccgag 687301 aaggtggatc cgatcgagct gaacgcgacg ctgtccgcgg tagcacaggc gctggatggg 687361 ctgggcggca agttcggtga gtcgatcgtt aatggcaatc agattctggc gcaattaaat 687421 ccgcggctgc cgcagctcgg ctatgatgtt cggcggttgg cggatctcgg tgaggtctat 687481 gtcgatgctt cgccggatct gtggtccttt ctgcagaacg cactgaccac tgcgcgcaca 687541 ttgaccagcc aacagcgcga tctggatgcc gcgttgttgg cggctacggg tgcgggcaac 687601 accggtgaag acgtttttgc tcgaggcggg ccgtatcttg cgcgcgcagc cgccgatctg 687661 gtgcccaccg ctacgctgct ggacacctac agtcccgaac tgttctgcat gatccgcaac 687721 tttcacgacg ctgcgcccaa agtcgcggac gcggtgggcg gcaacggcta ttcgctagcg 687781 gccgccggaa cgattttggg agcacccaat ccctatgtct atccggacaa tctgccgcgg 687841 gtgaatgccc acggtggacc cgggggccga ccgggctgct ggcagacgat cacccgggag 687901 ctgtggccgg caccctatct ggtgatggac accggtgcca gcctcgcacc gtacaaccac 687961 gtcgagctcg gccaaccgat gttcactgaa tacgtatggg gacgccaata cggagagaac 688021 acgatcaacc catgaaaacc acaggcacaa ctatcaaact cggcatcgtc tggttggtgc 688081 tgtcggtgtt caccgtgatg atcatcgtgg tgttcgggca ggtgcggttc catcacacca 688141 ccgggtactc cgcggtgttc acccatgtca gcgggctgcg ggccgggcaa tttgtccgcg 688201 ctgcgggcgt agaggtcggc aaggtcgcca aggtaacgct gatcgacggg gacaagcaag 688261 tattggtgga cttcaccgtg gatcgctcgc tgtcactgga tcaggcgacg accgcctcga 688321 tccgctacct caacctgatc ggcgaccggt accttgagct cggccgcggt cacagcggtc 688381 agcggctggc gccgggtgcc acgatcccgc tcgagcacac ccatccggcc ttggatctcg 688441 acgctctgct cggcgggttt cgcccactct tccaaacgtt ggacccagac aaggtcaaca 688501 gcatcgcctc ctcgatcatc accgtgttcc aagggcaagg cgccaccatc aacgacatcc 688561 tcgaccagac cgcctcgctg acggcaacgc tggccgaccg ggaccatgcg ataggtgagg 688621 tcgtcaacaa cttgaacacc gtgctggcca ccaccgtcaa gcatcaaacg gaattcgacc 688681 gcacggtcga caagctagag gtgctgatca ctggactgaa gaacagggcg gacccgctgg 688741 ccgcggcggc ggcacacatc agcagcgccg cgggaaccct agccgacctg ctggggcgga 688801 tcgtccattg ctgcacagca gcttcgggca cctcgagggc atccagcagc cgctcataga 688861 cgagctggca gaactcgacc acgtgttggg caagctgccg gacgcctacc ggatcatcgg 688921 ccgcgccggc ggcatatacg gtgacttctt caacttctat ctgtgtgaca tctcactgaa 688981 agtcaacgga ttacagcctg gaggtccggt acgcaccgtc aagttgttcg gccagccgac 689041 cggcaggtgc acaccgcaat gagaacgctg accgagttca accgcggccg tgtcgggatg 689101 atgggtgcgg tggtcacggt gctcgtcgtt ggtgttgcgc aaagcttcac cagcgtgccg 689161 atgctgttcg ccacacctac ctactatgcg caattcgccg acacgggtgg catcaacacg 689221 ggcgataagg tggaaatcgc tggggtgaac gtcgggctgg tgcgctcgct ggcaatccgc 689281 ggcaaccgcg tgttgatcgg attctcgttg cccggcaaga caatcgggat gcaaagccgg 689341 gcagcaattc gcaccgacac cattcttggc cgtaagaacc tggagatcga accccgcggt 689401 tcggagccgt tgaaacccaa cggtttcctg ccgttggcgc agaccactac gccataccaa 689461 atctatgacg cgttcgtcga tgtcacgaag gcggcgacgg gctgggacat cgatgccgtc 689521 aaacgctcgc taaacgtgtt gtcggagaca ttcgatcaga ccgccccgca tctaagtgcc 689581 gccctcgagg gtgtcaaggc attctccgac accgtcggcc ggcgcggcga gcagatcgag 689641 caactgctgg cgaacgccaa caggatcgcg cgcgtgctcg gcgaccgcag cgagcaggtc 689701 aacgggctgc tggtgaatgc caagacgctg ctggccgcgt tcaagcaacg cagccaggca 689761 ctgcgcattc tgctaaccaa cgtgtcggag gcatcagccc aggtatctgg cctgatcaca 689821 gacaacccca acctcaacca tgtgctggcc cagttgcgca cggtcagcga ggagctggtg 689881 aagcgcaaga acgaattggc cgatgtagcc gtcttgctcg gcagatacac cgcggccctg 689941 acagaggccg tcggttccgg accgttcttc aaggcgatgg tggtcaatct gctgccctac 690001 cagattcttc agccctgggt tgacgcggcg ttcaaaaagc ggggcatcga cccggagaac 690061 ttctggcgca gtgcgggtct gccggaattc cgctggcccg accccaacgg cacccggttc 690121 cccaacggcg cgccgccggc ggcgccaccg gtgcgggagg gtacacccaa gcatccggga 690181 ccggccgtcc cgccgggaac gccgtgctcc tacacaccgg cggcgggcgc gttgccacgg 690241 cccgacaccc cactaccctg cgcgggcgcc accgttggcc cgttcggtgg acccgacttc 690301 ccggcaccgc tcgatgtcca gccgtcgccg cctaatcccg atgggccgcc gccgacgccg 690361 ggcatcctaa gtgctgggcg gccgggcgag ccggctccgg ctgttccggg cataccgatg 690421 ccgctgccgc cgaacgcgcc gccgggtgca cgcacccaac cgcttgagcc gtttcctgac 690481 gggacgggag gtagcaacca atgagcacca tcttcgacat ccgcagcctg cgactgccga 690541 aactgtctgc aaaggtagtg gtcgtcggcg ggttggtggt ggtcttggcg gtcgtggccg 690601 ctgcggccgg cgcgcggctc taccggaaac tgactaccac taccgtggtc gcgtatttct 690661 ctgaggcgct cgcgctgtac ccaggagaca aagtccagat catgggtgtg cgggtcggtt 690721 ctatcgacaa gatcgagccg gccggcgaca agatgcgagt cacgttgcac tacagcaaca 690781 aataccaggt gccggccacg gctaccgcgt cgatcctcaa ccccagcctg gtggcctcgc 690841 gcaccatcca gctgtcaccg ccgtacaccg gcggcccggt cttgcaagac ggcgcggtga 690901 tcccaatcga gcgcacccag gtgcccgtcg agtgggatca gttgcgcgat tccatcaatg 690961 ggatcctccg ccagctcggc ccgacggagc ggcagccgaa ggggccgttc ggcgacctca 691021 tcgaatcggc cgcggacaac ctggccggca agggcaggca gctcaacgaa acgctgaaca 691081 gtttgtcgca ggcgttgacc gcgctgaacg agggccgggg agacttcgtt gcgatcacgc 691141 gaagcctggc gctatttgtc agcgcgctct accagaatga tcaacagttc gttgcgctca 691201 acgaaaacct tgccgagttc accgactggt tcaccaaatc cgaccatgac ttggccgaca 691261 cggtggaacg gatcgacgac gttctcggca ccgtccgaaa gttcgtgagc gacaacagat 691321 ccgtgctggc tgccgatgtc aacaacctcg ccgacgcgac cactacacta gtgcaacccg 691381 agccgcggga cggtctggaa accgcgttgc acgtgttgcc gacctacgcc agcaacttca 691441 acaaccttta ctatccactg cacagctctc tggtgggcca gttcgtgttc cccaacttcg 691501 cgaacccaat tcagctcatt tgcagcgcta ttcaggccgg cagccgactc ggctatcagg 691561 aatccgccga gctgtgcgcg cagtacttgg caccggttct ggacgctctc aagttcaatt 691621 acttgccgtt cggctcaaac ccgttcagtt cggcggccac tttgcccaag gaggtggctt 691681 actccgagga gcggctccgc ccgccgcccg ggtacaagga caccactgtc ccagggatct 691741 tctcgcggga cacaccgttt tcacacggca accatgaacc gggctgggtc gttgcgcccg 691801 ggatgcaggg tatgcaggtt cagccgttta ccgcgaacat gctcaccccg gaatcgctgg 691861 cagagctgct gggtggtccg gatattgccc ccccgccgcc gggaaccaac ttgcccggac 691921 cgccgaatgc gtatgacgag tccaatccgt tgccgccgcc gtggtacccg cagcccgcgt 691981 ccctcccggc tgcgggcgcc acaggacagc caggcccggg ccagtgaggt gcggcgtgag 692041 cgcgggtagc gcgaacggca agccgaaccg ttggaccctg aggtgcggcg tgagcgcggg 692101 tcaccgtgga tcggtgttct tgctggcggt cttgctggcc ccggtggttt tgacttcgtg 692161 tacctggcgt ggcatcgcca atgtgccgct gccggtcggc cggggtatgg gtccggatcg 692221 catgacgatc tacgtgcaga tgcctgacac gctggcgctg aacactaaca gccgggtcag 692281 ggttgccgac gtctgggtcg gtacggtgcg tgacatcagc ctgaggaact ggatcgcgac 692341 cctgacgctg gagctcgagc cgaccgtgcg gctaccggca aatgcgaccg cgaagatcgg 692401 ccagaccagc ctgttaggca cacaacatgt cgagctggcc gcaccgccaa tcccgtcacc 692461 gcagccgctg aaaagcggcg acaccatcgg cctgaagaac tcctcggcct accctaccgt 692521 cgaacggacc ttggccagcg tcgcgttgat cctcaccggc ggcggcatcg tcaacctcga 692581 cgtgattcaa accgagatcc tcaacatcct tgacggccat gccggtcaga ttcgcgaatt 692641 cctcgagcgg ctagccactt tcaccgccga gctgaacaac caacgcggcg atctgactcg 692701 cgcaatcgac tcaaccaacc aactcctgac catcatcgcc aaccgcaacg acacgctgga 692761 tcgggtgctc actgacgtcc caccgctgat cgagcatttc gccgacaccg gtcagctgtt 692821 cgctgacgcc accgaatcct tggggcggtt cagcgaagtc gccaaccggg cgctggcggc 692881 tacccggcct aaccttcacc agacgctgca gtcgttgcag cggccgttaa ggcaattgga 692941 acgggcttcg ccgtatgtgg tcggcgcgtt gaagctaggc ctcaccgctc cgttcaacat 693001 cgacgaggtg ccaaacgtta tccgcggcga ctacgtcaac gtgtccgcga cgttcgacgt 693061 gacgctttct gcactcgaca acgcactgct gagcggaacg ggcatctcgg gaatgttgcg 693121 tgcgctcgag caggcgtggg gacgggatcc ggacaccatg atcccggatg tccgctacac 693181 gccgaacccg aatgacgcgc cgggcggacc gctggtggaa agggctgagt gaggagatgc 693241 tgactcgcgc tatcaagacc cagctggtgt tgttgacggt gttggcggtc atcgcggtgg 693301 tggtccttgg ttggtatttc ctgcggatac ccagcctggt cggcatcggt cgatacacgc 693361 tttatgccga attgcctcgg tccgggggtc tataccgaac agccaacgtc acatatcggg 693421 gcatcaccat agggaaggtc accggcgtcg aaccaaccga gcggggcgcg cgagcaacca 693481 tgagcatcga caatggctac cagatcccca ccgacgcctc ggccaatgtg cactcagtgt 693541 cggcggtcgg cgagcagttc gttgacctgg tgtcgacccg caccagcggt ccgtatctgc 693601 ggcatgggca gacgatcacc acgactacgg tccccagcca gattggcccg gcgctggacg 693661 ccgccaaccg tggattggca gtgctgccca aagaccgggt cgcgtcggtg ctgcacgagg 693721 cgtcggaggc cgtgggcggg ctgggatcct cactgaatcg cctcatcgaa gccacccagg 693781 caatcgccca cgatgtcagg ggcagcctcg aggacatcga cgacatcatc gagcgttcgg 693841 cgcctatcat cgatagccag gtcaattccg gcaacgagat cgcccgctgg gccgccaacc 693901 tcaacacgct ggccgctcag accgcgcaga ccgatccggc ggtgcgaagc attctggcca 693961 acgcggcacc gactgccgat caggtcaacg ccacgttcag cgacgtgcgg gagtcgttgc 694021 cgcagacgct ggccaatctc gaggtcgtaa tcgatatgct caagcgctac cacaacggcg 694081 tcgagcaggc gttggtgttc ttgccgcagt ccggcgcgat cgcccagtcg gttactacag 694141 agttccccgg ccaggccgga ctgggtgtcg gcggcctggc gctcaaccaa ccaccgccgt 694201 gcctgaccgg cttcctgccg gcgtcggagt ggcggtcacc tgctgacacc agcaccgcac 694261 cgctacccaa gggcacctac tgcaggattc cgatggacgc gagcaatgtg gttcgtggag 694321 cacgcaacaa cccgtgtgta gacgtgcccg gcaagcgggc ggcgaccccg cgggaatgcc 694381 gcagcaatga agcttatgtg cccgggggca ccaatccctg gtatggggac cccaaccaga 694441 tgctcagctg tcccgcgccg gccgcgcgtt gtgaccagcc ggtgaagcca ggccaggtga 694501 tcccggcgcc gtcagttaac aatggcatca acccgctgcc cgccgatcag ctgccaggca 694561 cacctccacc ggtcaacgat cctttgcagc gacctgggtc aggcaccgtc cagtgcaatg 694621 ggcaacaacc caacccgtgc gtctacaccc cgagcacatt tcctacaacc atttacgacg 694681 tgcagagcgg caaagtcgta gcacccgacg gtgtggtgta ttccgttgag gcttcgactc 694741 atgccggagc cgacggatgg aaggtgatgc tggcaccaac cggctgagcc ggcgcgatca 694801 ggtaccggcg gattcgcgct ggtcaagaaa ggcaaccgtc agatcgttat gacctcgacg 694861 tcgggcatgg cggcgtagtc gttgtcttgg gtcaggatcg caatgccgtg cgccacggct 694921 gtggccgcaa tccagctgtc gttgatcggc acgcgcagtt tggcggcgcg cagcttggac 694981 accagtaatg cccatgcttc ggagaccgcc tcgtcgatgc ctagtggttc gaaccgttgc 695041 gcaagctggt aggtggagag ccgacgtgcg gcggcctcgg ggccggaggc ttgcaacacc 695101 ccgagccgca gctcgccgag tgtgactacc gagacgcccc attcgtatcc cgcaaaccgg 695161 tccgggtcga atcgtgtcgc ctcgatgcca atgaaaacgg atgtgtcggc gagggcgcgc 695221 cgtacgttca ccaccgcaca tcgtccgtgg tttgcgtcag cgtctctcgc agctcctcgc 695281 ccagattggt ggtatcgggg cccaagcgca ccagttcgcc gatcacctcg gcagctggca 695341 accattggcg gcgccgcttg agcggaacga tgcgcgctac ggggcgattg tccttgagca 695401 cctcgatttc ctcgccggcg gcaactcgcc gcagtacctc ggcggtgtgg ttgcgaagat 695461 cgcgagcggg tatcgtagca gacatgctac gagtgtagcg gagctgctgt cgcgccgcct 695521 cgtctcgatg tctgcggtca cgatctccgc aggttacggc cgctgctgtg cccgcagtcg 695581 cccgcgatgg tgggcccgtc ggggtagatt gcgagcgcgc ccggacggag gccgccgatg 695641 ccgaagtgcc gtgatttgtt cgaagagtta gccggcccag agcgtgctaa cgggtaacgc 695701 cgcgagcgtt gggccgaacg gggtggcttc cggcgcggcg gtgagcagca cgccgaactg 695761 aaatcgatca tcgagccgct cggccagata acgtaggcca cgcaggtcct cggctcgcgg 695821 cgtgctggtc gccttgacct cgatgccgca gacccgacca tcgggatgtt cgagcaccag 695881 atcgacctcg gcgccgccgc ggtcgcgaaa atgccacaga ctcggccgtt cggtcgacca 695941 ggtgagctgt ttgcgaatct cgttcgccac gaaagtctcc agtagcgggc cgagtggacg 696001 gccggggcga tccagcgtcg caccggtaac gccgagcagg tgacacgcca ggccactgtc 696061 cgagaccacc agtttcggtc ggcgaatcac cttgcggctc aggttggtcg accaggccgg 696121 cacccggtgg ataaggaacg ccgcttccag cagggccaga tagccagcgg tggtgcgagc 696181 cgggatcgac aggtcgttcg ccagtgcgct cacgttgagc tcggcgccgg tacgcgcggc 696241 gcagagccga agcacacgcg gcatttcggc aagccgctcg atcggcgaaa tctcgcggat 696301 caccgactgc gtcgccgtcg tgagatagtt gtcgaaccac gcgcgacgcc tcgacggcga 696361 tcgggcgacg atgtccggga agcctccggt ggcgatcctg tcgaccagat cggcgcggcg 696421 catatcggag ccgtggatca gctcgcgtgg tgcggtgaac agcgcatcga cgaaaccgtc 696481 cgcgattccg gcccgctcac cttgcgagaa cggccagagt tcgatgattt cgacccgccc 696541 gacgagcgcg tcggccatgt caggagccga gagcagcctc gctgaacccg tgagcaggaa 696601 cctgcccggc ctgcgatccc ggtcgacctc tgccttgatc gcccgaaaca gccccggctc 696661 gagctgggct tcgtcgatga cgagcgtgtc caccggccgg gatacgaatg cgcggggatc 696721 gtcgcgggcg gcgtcgcggt tggcgacgtc gtcaagcgag acgacttcgc tggatcccgg 696781 atagtcaagt cgcgcgacca gtgttgtttt gccgacctga cgcgcgccgt tgacaacgac 696841 gaccggggtg tcggcgagcg cggccagcac cgagggcgcg atcgcgcgtt cgacgactcc 696901 catgcggaca gaatacgctg ccgatttgtc tacctattgg ctgccgattc gtccccatta 696961 gcggtgcgga ttagtccaca tcatcgctgc ggatccgtcc gacggcggcc ctgagccacg 697021 tggccgacga agaacgctcg gaggcgctgc tgcgggagcg ctcgctgcac tacgtggcca 697081 gtagccgggc gggggacatg ttggtggtga cctggagcgg acagcggtcg gagttgttga 697141 gtcagctgaa gattcacgcg gcgacaacga cgtggacgcc gatcttttcg taagtgtcct 697201 tggctcgagc atcgcgtgtg gcgagttcag cccggtgctc ggcggcggca agggccacga 697261 gagcgtcata gaccgcgcca ccagtgatct cgaattgggc cagcacgcgt gggagatgtt 697321 cagtggtgcg ggaactcaac aacagcggtg ccgcaaagcg ttcggtaaga agccgcgcgg 697381 cgtccatcgg tgccagtcgt aggtcacgcg gcaggcgggt cagcacggag taggtttcgg 697441 ccagggcgtg cccgcacagc gcggcctccc gatgtgccca ccaggcgaca accgccgcat 697501 gcgcggtatg ggtccgtacc agcaacggaa tcgcgacgct ggtgtccact gccagcggcg 697561 gtttcacttc cggccgctat cgataaggcc gaacacgacc tcatcgtcga tcgtggtctc 697621 accggtggcc accagtacgc cattctcctc ttcgagacgc gctgttcgtc cggtgggaat 697681 caggtggaga ccagcgccat agcgggatat ttccacggtg gaccccggtt gcagccccaa 697741 ggcttcgcgc agcggtttgg gtacgacgat gcggccagcc gcatccacaa cagccttcat 697801 gggaatacga taccaatggc ttcccactca ggtgcggaag tcgactcacc gccgttacca 697861 cgaccccgac gaccacaccg tcagctgcgg cgcggcgtgg cgactattgg tcgctagtgt 697921 cgtgctcccg attcgggtcc tttgtatctg atgacgtgat ccgcggaagc tcctcgtgga 697981 agggcggccg cggtgtgggg agggtgatgc ggagttcggc cccgccgtcg gggtggttgg 698041 tggcgttggc atgtccgccg tgggtggtgg tgagggcggc gacgatggcc aggccgaggc 698101 cgctgccgcg accgccgcgg gcggtgtcgg cgcgggtgaa tcggtcgaag gcgacgggaa 698161 gaaagtggtc ggcgaatccg gggccgtggt cgcggacgcc gatgtcgact gcaccgtcgc 698221 gggcgtgcgc ggtgacagcg atttcaccgt ccccgtgggt gatggcgttg tcgagcacgg 698281 cggtgaggat tcggcgcagg tggtccggat cgatcgagac gaacaggtcc ggttccgcgc 698341 gtgtggtgat gtccgctcca gtagcggcga agcgggccac gctctcgtgc agcaggggag 698401 tgatcggcac cgctttggcg gaggggtggg attcggggcg gtcggcgcgg gccagggtga 698461 gcagttggtc ggccagtccg ctgagccggc gggtttcttc gagcgcggag cgcagggcgg 698521 cgctcagctg gtcggcgggt ctgggccggc gcagcgcagt tcgagttcgg tggtcagcag 698581 tgccaacggg gtgcgtaatt cgtggctggc gtcggcgacg aactgttgtt cgtgggcgag 698641 ggcccgttgc agtcgggtga gcatggtgtt gagagtcgtt gctagccaag cgatctcgtc 698701 gtcggtggga ggtaccggca gcggcgcgtc ggtgtcgggg tgcggcgtgg tggtcagtgt 698761 ttgcgccgcc gcgcggatcc ggtcgacggg ccgcagcgcg gcgcggctga gcaggtaggc 698821 ggccaccgcg gcgatgacga gcacgatcgg caggatggtc accaattccc ggaccagatc 698881 ggcggtgatg tcgtcggtga gcccgcgcag cgcgccatcg gggtcggctt cgtgagcggc 698941 gtcgcggaac tggacgacgg tgacggcgcc ggctgctgcc agaacgagcg ccatggcggc 699001 gctgaagacg agggtgagtc gccatcggat gggccactca gcgggggagc gcatgccgtc 699061 ctccgtcctt gcgcagccgg tatccggcac cgcgaatggt ttccagcgag gtgacgccga 699121 agggccggtc gatcttgtcg cgcaggtagc ggatgtagac gtcgacgatg ttggagcggg 699181 cctcgtaggc ggcgtcccag cagcgttcca gcagctgggc gcgggtgtgg acgatgccgg 699241 gacggcggat cagggcttcc agcagggtga attccttgtg actgagccgg atttcggtgt 699301 cggcacgcca gactcggtgt tcgctcgggt ccaggcgtag atcgccggcc tccagcgtcg 699361 gtgggcgtgg gatgggcccg cgccgtgaca gcgcgcgcaa ccgggcgaac agttcgtcga 699421 ggttgaacgg tttggtgagg taatcgtcgg cgccgccgtc taggcccgcg atgcggtcgg 699481 tgaccgcgcc gcgggcggta agcatcagca ccggtgtcca cacccgctgc cgtcgcagcc 699541 gcgcgcatac ctcgaacccg tcgataccgg gcagcatcac atccagcacc accgcgtcgt 699601 agtcaccgcc gtcgacggcc gccaccgcat ggcggccgtc ggcaacggtg tcgaccgtgt 699661 ggccctcctc ggtcagcgcc cgcgccagca gcgccgtcat cttgggctcg tcctcgacca 699721 ccaggatgcg cacacccgac accctgccgc atgcccggcc cgggccgcga ccagctctca 699781 tcgtcgtttc atctgccacc cctaccgtcg gagccgcaca ccgtcacagc gaggtagaca 699841 gatcaggaga aagcgatgaa tcgcatcgtg cagttcggag tttccgccgt ggccgcggcg 699901 gcgatcggca tcggagccgg gtcggggatc gcggcggcgt tcgacggcga ggacgaggtg 699961 accggccccg acgccgaccg cgcgcgcgcc gccgcggtgc aggcggtccc gggcggcacc 700021 gccggagaag tcgagaccga gaccggcgaa ggcgccgccg cctacggcgt gctggtcacc 700081 cggcccgacg gcacccgtgt cgaggtccac ctggaccggg atttccgggt tctggacacc 700141 gaaccggccg acggggacgg cggttagcat cggcgcatgc ccgcaccggg ccaccgatag 700201 cctccgggtg cgcaccgatg agatctagcg aggagaccat gatcaggcga cgaggcgccc 700261 gtatggccgc gctgctggcg gcggccgcgc tggcactgac cgcatgcgcg ggcagcgacg 700321 acaagggcga acccgacgac ggcggggacc ggggcgcatc cttggccacc accagcgatg 700381 cggactggaa gccggtggcc gacattctcg gccgaaccgg caagctgaac gatggcagcg 700441 tctacaaaat cgggtttgcg cgctcggatc tgagcgtgca gaccaagggg gtgaccgtcg 700501 cccccgcgct gtcactcggg tcgtgggtcg cgttcgcccg cacccccgac gggcagacca 700561 tgctgatggg agatctggtg gtcaccgaag acgagctggc ctcggtgacc gacgccgtgc 700621 aggccggcgg cctgcagcag accgcgctgc acaagcacct gctcgagcag tcgccgccga 700681 tctggtggac ccacatcgcc ggccacggcg acgccgccga cctggcccgt gcggtccggt 700741 cggcgctgga tgccaccgac acaccaccgc ccgcctcggc aacttccggc cagaccagct 700801 tggacctgga caccgcggcc atcgatgagg cgctgggccg ctccggcacc atcgcgggcg 700861 gggtgtacaa attcttcatc gcccgccgcg atccggtcac catgtccggc atgctcatcc 700921 ccccgtccat gggtctggct accgccctca acttccagcc caccggcaac ggccgcgcgg 700981 cgatcaacgg cgatttcgtc atgaccgccg ccgaggtcca agacgtcgtc caagcactgc 701041 gcggcggcgg aatcgacatc gtcgccatac acaaccacgg gttcgacgaa caaccacgcc 701101 tgttctacat gcacttctgg gccgagaacg acgccgtcgc actcgcccgc acgctacgcg 701161 ccgcggtgga cgccaccgcg gcccggtgac cccgcgcccc ggcgcatacc gacccgccgc 701221 gaaccaccgg tggcggacgt ggtcatgcag gcgtcgtgcg atgacgtcct cgttcaatgg 701281 gccatgttcg gccgggatcc tcgccacggc acggtcgcat ggaacgcttc ggccacggtg 701341 gccaccctat gccgcgtcga gccggggctg ccaactgttg cgcggtgagt ggtcggtagt 701401 tgtcggtggc gtgctgtagg aacagaggta tgaatctcgc ggcgtgggcc gagcgcaatg 701461 gcgtcgcgcg ggtgaccgcg tatcgctggt tccacgctgg gctcttgccg gtcccggccc 701521 ggaaggttgg tcgactcatt ctggtcgacg agctggctag cgaggctggc gcgcagccaa 701581 agactgcggt gtacgcgcgg gtgtcgtcgg ctgatcagaa gtctgatttg gatcggcagg 701641 tggcgcgggt gacttcgtgg gccacagccg aacagatccc ggtcgacaag gttgtcaccg 701701 aggtcgggtc ggtgctcaac gggcaccgac gtaagttccc tgcggtgctg cgcgatctgt 701761 cggtcacgcg gattgtggtt gagcatcggg atcggttctg ccggttcggt tcggagtatg 701821 tccacgctgc gctggccgct cagggtcggg agttggtcgt ggtggactcg gccgaggttg 701881 acgatgacct ggtatgggat atgaccgaga ttctgacctc gatgtgcgca aggttgtatg 701941 gcaaacgtgc tgctcagaac cgggccaagc gggccgtcgc ggctgccgct gtcgatgatc 702001 atgaggcggc ctgagatgcc gcgtttggag atccccaacg gctggtgtgt gcaagcgttc 702061 cggttcacac tcgatccgac cgccgagcag gcacacgcgt tggcgcggca tttcggcgcc 702121 cgccgcaagg cctacaactg gaccgtcgcg cagctgaaag ccgatatcca agcgtggcgc 702181 gcgaccggcg cccagacggc gaagccgtcg cttcgggtac tgcggaaacg ctggaacacg 702241 gtgaaagacg aggtgtgtgt caacgccgag actggcaccg tgtggtggcc ggaatgctcg 702301 aaagaggcct acgccgacgg gatcgcgggc gcggtcgacg cgtactggaa ctggcagcag 702361 aggcgtgctg gcaagcgcga cggcaagaga atgggcttcc ctcgattcaa gaagaagggc 702421 cgcgacgccg atcgcgtgtc gttcaccacg ggtgcgatgc gcgttgagcc cgaccgtaga 702481 cacctcactt tgccggtgat cggctgcgtg cgtacgcatg agaacacccg ccgcatcgag 702541 cgcctcatcg ccaaagaccg ggcgcgggtg ctggcgatca cggtgcgccg caacggcacc 702601 cggctggatg cgagtgtgcg ggtactggtg cagcgccccc agcaacccaa cgtggaactg 702661 cctgagtcgc gaatcggtgt cgacgtgggt gttcgtcgtc tggccacggt cgccaccgcg 702721 gacggcgcat gctgcccggt cctggtgcca gacggctaac gctgggcatt atccccgagg 702781 gcggcgccca tatcgacgtg ccccgaaaga ccgtgggcgc ctggcaaaca gccgacacca 702841 tgggcatctt ccaggccctt cccgacgtct ggggcgggtg gcggaccgaa tgctgggaag 702901 accgcttcga agagcagctg attcgatgca acggggcgct gcggcttccc gagctggatt 702961 tggccgcggg catggacagc gcccgggagt ggctccgtga caggatattt cagcgcttct 703021 cggacagccc ggcaggccaa attctgaaac tctccgagct gctggccgat gtcggacccg 703081 gtctggtcgt cagcgacgat gccgtgacga atggcggggc tcgcccaaac aacgaagagt 703141 gggcgcgttt cgttgcggcg tgcgatctgg tgcgtggggc tcacgccgaa tcggcctgac 703201 ttcggggata gtggtaccat cactttggta gaagggtact aacatggcgt tgaacatcaa 703261 agatccgtcg gttcaccagg cggtcaagca gatcgcgaaa atcaccggcg aatctcaggc 703321 tcgggcggtg gcgaccgcgg tgaacgagcg tctggccaga ctgcgcagcg acgatctcgc 703381 cgcccggctc ttggctatcg gccacaagac cgcgagcagg atgagcccgg aagcaaagcg 703441 cctcgaccac gatgctctgc tgtatgacga gcgagggctg ccggcgtgat cgtcgacacg 703501 tcggcgatca tcgcgattct gcgcgacgag gacgacgccg cggcctacgc cgacgcgctc 703561 gccaacgccg atgtccgcag actgtctgcg gccagctacc tggaatgcgg gatagtcctt 703621 gactcccagc gtgatccggt catcagcaga gcactggatg aacttatcga agaagccgag 703681 ttcgtcgtcg agccggtaac cgagcgccag gcccgcctgg cccgagcggc ctacgcggat 703741 ttcggcagag gcagcggcca ccccgcgggc ttgaatttcg gcgactgcct gtcctatgca 703801 ctggcgatcg atcgacgtga gccgctgctg tggaagggca acgactttgg gcacaccggc 703861 gtccaaaggg cactggatcg gcggtgatcg acgtcagcct ggcgcggcgg tgcgaggctc 703921 acgggtacga ctattttcgt tccgacgatc cggtggcagc ggcgggcttt gtggtgtccg 703981 ctgtgtggag ttgtgggcgt ggacctggga acgccacggg ttccgggcgt ttgccgaaac 704041 cgctgcgcca cagttgattt ggcgggagta cagacccggc tggacccgat acggcgacgg 704101 atctgtggcg caggtcaaat cgatcttcga cgctccgcgc ggttacctca atgcggcgtg 704161 tcgtcggcgt gttgtacatt gggcatcggg actcctgaga aggatcctgt aggccgcagc 704221 cccacccacg ggtggggctg acgtgcgtcc aagggggcca gatctggcag accttcatct 704281 tgtttgcgac gatgtcccat aatcgttggt ggtcttcacc gaccgggcgt ctttgacgtc 704341 tgaccgacgc ctccgaaagt ggaggtagga cacaaggtcg gcagcttgca gcaggcgacg 704401 gtgtttcgag ggcgcgaaat gcagtgcgtc gacgcccgct attcctcacc gccgcggttt 704461 cctcggtggc aatctcactt cgtcgagccg cgggcacggc tttcgagata gaggtcgata 704521 tgcccacaag tctcgcaggc aacggcgttg acctgggtgc cgcgattgaa gtggccggca 704581 ccttcgcgct tgaaacgcag cggcgcgttc cagacgacgg ccccttcgac gagctggtcg 704641 cccccgcatc tcacgcactt ctcgtcggtc acgacgcctc cccttctctg cggctggcca 704701 ggctacgccc agcgcttgat gcccaggaaa tccacggcgc cgccgctagt ttcacctgaa 704761 cgacgccgcg cgatcacgaa gctttcggat cgcccgtgcg gtaaacgctt gcggctccag 704821 atgccacagg tgcgcgcctt caggtgtgcg caacgccgcg aaaggaaccc gctcaccaca 704881 cacgagcttc tccagctcga gtgcccagcc tacggccagg gcggcacgct gccggtgcca 704941 ggcgtcagcg cgccacgtca gccggggcaa gccggcgagt ttgctctgga tgctttggcg 705001 gagtttgccg gcgcagagga tccgggacgg aaccgagccc agaccttcgg tgtcggacgc 705061 cggagcgacg cgggtgaacg cgatctcccg gcggtcgtag ttccaccagg cgcgcgcgac 705121 ggtgacctcg ccgaccgggg tcgcacggtt ggacagttcg cgcttttctt gacgcgcccc 705181 atgctgatcc agccacagat ggatttcggc aaccgtgcgg gcgggcagtg gctcggcgtt 705241 cgggcgcgga tcgcatccgg cgaccagcat cgcggggacg tgcggcggcc acagctgggc 705301 gagtggacgc agcggattga gccgaacgcc gtcgaagtcg ctgcgtttgg cgaggtcgcc 705361 ccagagtgcg ggacccaacc gcacgacgtc gagtccgcgg tgtcgctcgg ttcgccggaa 705421 tgtcgccgcc gcgtccggcg cggtcatgat cgtgatgacg tccaacccgt cggcgcggtg 705481 aacggcgggc cggccggtgc ccggggagac ggcaacccac cacggccctt cgtacagcaa 705541 atcccgctgg ccggggcccg gtttggcaac ggcatcctcg agctcgaccg tgcgcgccca 705601 gttctgcagc tcgggcagcg cacccggccg gaacgccgaa ccccacggtt cgtcgggatc 705661 gaatgacagg ccaccgacga gcggagcgat gtcgcgaacc agttccaggc cggaatattc 705721 gcggccgtcc gacgccgaac tctcggcgat gagcatcggt gtgccgtcat cgagcgccac 705781 ggtctcgagt tcgccgtcta tctcggcgac ccgccacttc gggtagcgca gtatcgcgcg 705841 ccgcgcgtcg tcggcggaga gttctccgcg cgcatatcgg gccagtaagc cccgtagctc 705901 gtcgtccatc ggccatcacc cggtcgggtt gcagcatccg ccacagaaca aagcggacga 705961 ctacgccacc tcgcggacat gcggaatctc ccgccgccgt cgtggtcgga tatcgtcgcc 706021 ggccaacgtg acgaccgcta ccgtgcagcc gttcgcggcg gtaaagtcga cttcgtagcc 706081 acccgcggcg tagcgcccga cgacggcacc gacgtctccg gcgatgaggg atttgtcggg 706141 aacatcccgt gttagcacca caacatcgtg ttctgcgtac atcggtccgc tcctagcgtg 706201 gataggcggt aaccaatcga ggcacgccgt cgggttcgtc gctgatccac actgtacgca 706261 atgcaaccat ccggccgcac cgtgattcca cgacaccatc gacgattgcc gtgacgccgt 706321 agggtgttgg ggccgatccg gcaaccgcgc ctgacggtgc ggccagggcg gctgccgggg 706381 atgatcgccg gcgtggcggc gaaacgaatg aaccgcgaac agttcttccg cgcggcgtcg 706441 gggctcgatg aggatcgcct acggaaggcg ctgtggaacc tctactggcg cggcaccgca 706501 aacatgcggg agcgcatcga ggccgagctg gccagcgccg ggcgcgctcg cccggcgcgc 706561 aaaataaagc cgccggccga tccggacatc gtgggttggg aggtcgacga gttcgtgtca 706621 ctggcgcggt cgggtgccta cctgggcggg gaccggcggg tgtcgccgcg ggaacgatcg 706681 cgctggcgtt tcaccttcaa gcggctcgcc gcggaagccc aggacgccct gcgagccgag 706741 gacgccgagc ccgcggcatc cgcactggag caactgatcg acctggcgcg cgaggccgac 706801 gggtacgact acttccgctc cgacgatccg gtggcagcgg cgggtttcgt cgtgtccgat 706861 gtggcggcgg cgggccaccc acacttccgt gagttcgccg ccgagatcgg tgcggcgatc 706921 ccgccgtgag taccgcccgc ccggctacta caagcccaaa gcggtgcgca gccggtcggc 706981 gtccatcccg ccacgggcgc ccgcgccggc gggaaacgtg tccaggagct tgatcaggtc 707041 ggcgcggcgg gtggggtcgt cggcggcctg ccgcggcgtg tgcccgtcca gcgcggggat 707101 gggttgatcg agccagctgg tctcgtagtc gcggatgaat tcctcgagcg cggcggccag 707161 ctcggggctg tcggggtcgg gcgcgcccgc gccggtaact ggcatctgct cggccagcgc 707221 ggcggcctcg cgggtgttgc gcagcggacg gcggtcgtcg tcgagcaccg tcatcgccgg 707281 gtcgaggcgg gtcagcgtgg ccagcacgcg atccatccgc ggttcgctgt tggtttccac 707341 ccgcagcgtg tcaccgtcga ggaccagcgt ggcccggacc cgcagcatgc cgtcgttggt 707401 gacgtgttcg atccaccgcg gcggctcctc gccgtcaacc cggtcgtaga ccccgtcgag 707461 cgcgccctgg atcccggccg gatcgtcgac tcgcacgctg gcctcgcaga ttgccagcga 707521 gtcgccctcg gtgttgacca gtgtcggcgg cgcgaaccgg cggctcagct gggccaccag 707581 tgtcaccggg tcgggctcgt catcgagcag ctcgatcagc acggcacgct cgtgcagcgc 707641 gaccggctcg atcccgccga agaacaccat ggtgtccccg gcgggcaccg ggcgcgcgca 707701 gatcagctgc ccggctcgca gctggcggct ggccgcccgc tcatgcacct catgggtgtc 707761 gccggtgcgt acgtcgcgca cgatcacgcc ctcgccaggt tgcacgtgct cgacctcgaa 707821 caccgaccgc tccacgagca gccattgctc ggcaagcagc cgctcgtcgt cgggtagcag 707881 cgaaccgcgc acttcgagga actccgcgaa cgcgccgccc tcgaacaaca ccgcgtccag 707941 caccagcgga tcggccagcg ccgcggccag cgcgtcctca tcgtcagagt cggcataccg 708001 gaagcgctca tagctgactt cggccagcag gccggtccag tcgcccgaca gtgcgtgctg 708061 ggatgccttg gcatacagcc agtccacccg ctcggccagc ggcagcgcct cacggccgag 708121 atggcatttc ttgtacttgc ggcccgaccc gcaccagcac gcctcgttgc ggcccaggtc 708181 gcggcgcggc tgggctcggt gccgctccag gagccgcacc agcgggtggt cgggttcggt 708241 gccggcgcgg cgcagcagtg ccaacccgcg ctcggcgtcg ccgcgatcgg aggcgatgcg 708301 ggccaggtcg agcaacggca gcggccactc ggtgtccatc gactcggccg ccagcagctc 708361 acgttcggcc gcctcgacat caccgatccg gtccagcgcg accgcgcgca gccagcgcac 708421 cgccacccgc gccgcgcgcg gcaccttggg ctccagcatc tcggtgagca ggcccagcgc 708481 ggccgccccg ccggagtcgg tgcccaccgt ctctgccacc agcagctcgg ccagcagcgg 708541 gtcggccagc gccgccccaa tgtcgccgag cagatcgacc aacgagtcgg agccggtttc 708601 cgtcgcggtc tcggcggctg tggcgagcac atcccgcggc aactcgtccg ggtcggttgc 708661 ttcgagcagc agcgacatcg tctcgtgcag tttgatcagc gtgtacagcg cgaccgcgtc 708721 gttggggtcg aggtcgtggc gaaaggccag cagttcgcat cggttctcga aacgccaagc 708781 gtcgaaattg aatccgccag gtgctagcca gtcgtcttcg tgcgtgaggc cgtgctggtc 708841 gaggatctcg cgcagcggtg ccactggctc ggtaaatgcc gccgggtcgt cgacgcacgc 708901 cgtccagacc gccgcgggga agaacgcggg ctcgtcgggg tcgaccagct cggccagccg 708961 ggcgccgacg gaggtgtccg caccggctgt gccgatccgc tcgagcacca gccctgcggc 709021 ggtcagccgc acaccgacca gatcccccgc ggcggccccc aacgtcgcca gcgtgcccgg 709081 ctccagcagc agtgccccgc cggggtcgat ggcctcgtcc gggatgcccc gtcgttcgag 709141 cagctcctcg tcgtatccgg ccagcacgat ccgcgccgcc gaaccgtcgg ccagccggcc 709201 atactcctcg tgctcgcaga gcgtggtgat cgggtccagg tccggggtca cgccgagcat 709261 gtcgtggacc gcctcgtccg cgccgagccg atgggtgaat acccgcccgg ctagcagcgt 709321 cggcagccac acccaccgat cgtcgaccaa ctgccttgcc ggccattccg tttcaaggcg 709381 aagcgcgcgc aggacggcgt ccgggtcggc cacgccgctg tccagcaggc gtcgtgcgat 709441 gtcgtcctcg ctcaatgggc catgttcggc caggattctc gccacggctt gggtcgcatc 709501 gaacgcttcg gccacggtgg ccaccttatg ccgcggccag ccgaggcttg acgtcgggca 709561 ccagccgatg gggctggcct cgcctagggt tcggcgttgt gacggcgccg acgcggtgga 709621 ccctggccga cggacgtgag ctgctgttct tttcgctgcc cgggccccgc accagcggca 709681 ccgccgcaga acgggtggct cgccacgctc aagcgcaaac gttcgccggc gatatccgcc 709741 agcgcgccat acagctggtc gtgtccgaac aagaagtggc aagcaaaatc accgccgcta 709801 ccgccggaat cgccaccacc accttcccgg aaacacccag catcgacgac accatcatcg 709861 gcaacgacaa ccgcgacact ggggtccggt tggtcgacgt caaacaagat ggcggcacta 709921 gtcccccgcc cccatttgcg ccgtgggaca cccctgatgg aacaccgccg ccgggcactg 709981 gcctaagccc tacgctgcag cagatgatcc tcggcggtga tccagctaat ctgaccggcc 710041 agggtcttgc ggacaacgtg caacggttcg tacagtcgct gcccgcaaac gaccccaaca 710101 cagcgtggtt gcgcggtcag gttgcggatc tgcaggcgca cgtcgccgat attgagtacg 710161 cccgcaccca ttgcagcacc aacgactgga tcgaccggac cgcccagttc gcctcgggcg 710221 ccatagtctt cagcatcggc gtgttgaccg cagagaccgg ggcgggggtc gtggctgccg 710281 cggccggtgg tgtcggcgcg gccacggcgg gcgtgagtct tctacaatgc ctggtgggga 710341 gcaagtgatg gacgtattgg ctgctgggat cgcggctggc gcgctcacgc tggcggcgtg 710401 gggcgcctgg cgcccgcact accgggcggc gtcctacctc gtggccggtg ccgtagagct 710461 ggcactgatc gggctgctgg tggtgaccgg gcaaacattg atggccatct cggtggcctt 710521 ccttgtggcg ctgggcggtc cgttggtggt ggtcaaccac cgcagagctg aacgcagccg 710581 aggttagatg aacgaagagg gcctgtaggt cgcactcatc gcgcggctag cctgtgaggc 710641 cagccctcgg gccgccaccc aacacggctc gtgcgctgtc tcggccggct cgtctgccgc 710701 acggccagca tgatcagtcc cgttggaata ccggtgagcg tcggcgcgcg catcacgatg 710761 cagcgatgtt aggatgaggc ggtgcgcact accatcgacc tgccgcaaga cctgcacaag 710821 caggcactgg cgattgcccg ggatacgcac cgcacgttga gtgaaacggt cgccgacctc 710881 atgcgacgag gcctggccgc caaccgccct accgcgttgt cctcagaccc cagaacggga 710941 ttgcctttgg tgagcgtcgg gaccgtcgtg acctccgagg acgtgcgttc attagaggac 711001 gagcagtgac ggtgctgctc gacgccaacg tgctgatcgc attggtggtc gccgagcatg 711061 tgcatcatga tgccgcagcg gactggctca tggcgtccga caccggattt gcgacctgcc 711121 cgatgacaca aggaagcctg gttcgattcc tggtgcgctc gggacagtcc gcggcggcgg 711181 ctcgggatgt cgtcagtgcg gtccagtgca cgagccgcca cgaattctgg cccgatgcac 711241 tctctttcgc cggtgtcgag gtcgctggtg tggttgggca ccggcaggtg accgatgcct 711301 accttgccca gctcgcgcga agccacgacg ggcagttggc gacgctcgac agcggcttag 711361 cacacctgca cggcgacgtc gcggtactca ttccaacgac cacctgatgt gcatcgtctc 711421 ccggcggcgc ggcgagccgc cccaaaacca acgattgggc cacgatgcgt aggcatagct 711481 gaggtggcgt cgcggccctc accggcgaca ccacagagga tctcgggccg atccgatgag 711541 cgccacgcca ccgcccggag gactcgacgc gtcggtgttc atcgcgaacg aacgcggtcg 711601 gcaactcgac gaggcgctcc cagtagggtt ctgcgttgtg acggcgccga cgcggtggac 711661 cctggccgat ggccgtgacc tgctgttctt ttcgctgccc ggacacgtcc cggcgccggt 711721 gtcggatcgt cggccgctgc ccgaacgtga cccggctccc tcgcggctgc ggttcgaccg 711781 ggccaccggc cagtgggtga tcgtcgccgc acagcgccag gatcgcacct acaagccgcc 711841 ggccgcgcgc tgcccgctgt gtccggggcc gaccggtctg agtagcgagg tgcccgcccc 711901 cgactacgac gttgtcgtct tcgagaaccg gtttcccagc ctggccgggg ccggcatcgc 711961 cccaatcggc gcgcccgacg gtgacgggtt cgtatccgct ccggggcacg gacgctgcga 712021 ggtgatctgc ttttcggccg atcacaccgg ttcgttcgcg ggcctggacc cggcgcatgc 712081 ccggctggtc gtgcacgcgt ggcggcaccg caccgccgaa ttgacggcgc tgcccggggt 712141 agcgcaggtg ttctgcttcg agaaccgtgg tgaggagatc ggggtgaccc tgcccacccg 712201 cacggccaga tttacgccta tccgtatctg acgccgcgca ccgcggcgat gctgcgccag 712261 gctcgtcggc accgaaagcg tcacggtgac aacctgtttg ccagcctgct ggcacgcgag 712321 gtcgccgacg gcagccgcat cgtggtacgc ggcgagctgt tcaccgcatt cgtaccgttc 712381 gccgcacgct ggccggtgga ggtgcacatt tacccaaacc ggttggtgcg caacctcacc 712441 gagctcaatg acggggagtt ggatgagttc gcccggatct atctggacgt gctgcagagg 712501 tttgatcgga tgtattcttc accgctgccg tacatgtcgg cgctgcacca gttcagcgag 712561 gtccagcgcg atggctactt tcacgtcgag ctcatgtcga tccggcgcag cgccaccaaa 712621 ctgaaatatc tggcggccgc cgagtcggcg atggacgcgt tcatcgccga cgttatcccg 712681 gagagcgtgg ccacccggct gcgcgagctg ggcccatgac ggtcagctac ggcgcacccg 712741 ggcgggtcaa cctgatcggc gaacacaccg attacaacct gggtttcgcg ctgccgattg 712801 cgttgccgcg gcgcaccgtt gtcacgttca cccccgagca caccggcgcg atcaccgcgc 712861 gcagcgaccg cgccgacggc tcggcgcgga tcccgctcga caccacgccg gggcaggtga 712921 ccggctgggc agcctatgcg gccggggcga tctgggcgct gcggggcgcc ggccacccgg 712981 tgcccggcgg ggcgatgtcg atcaccagcg acgtcgagat cgggtcgggg ctttcgtcgt 713041 cggcggcgct gatcggcgcg gtgctgggcg cggtcggcgc cgccaccggc acccgcatcg 713101 accgtctcga gcgggcccgg ctcgcacagc gagccgagaa cgactacgtc ggtgccccaa 713161 cgggtttgct cgaccacctg gccgcgctgt tcggagcgcc gaagaccgcg ctgctgatcg 713221 actttcgcga catcaccgtg cgcccggtgg ccttcgaccc ggacgcctgc gatgtggtgc 713281 tgctgttgat ggattctcga gcccgacact gtcacgccgg cggggagtat gcgctgcgcc 713341 gggcgtcgtg tgaacgggcg gccgccgatc tgggggtgtc ctcgttgcgc gctgtgcagg 713401 atcgcgggct ggcggcgctg ggcgcgatcg ccgatccgat cgacgcgcgc cgcgcccggc 713461 acgtgctgac cgagaatcag cgggtgctgg atttcgcggc cgcactggct gattcggatt 713521 tcaccgccgc cgggcagctg ctgaccgcgt cgcatgagtc catgcgcgag gacttcgcca 713581 tcaccaccga gcggatcgat ctgatcgccg agagcgccgt acgggccggt gcgctgggcg 713641 cccggatgac cgggggcggc ttcgggggcg ccgtgatcgc actggtgcct gccgataggg 713701 cgcgcgacgt ggccgacacg gtgcgacggg cggcggtcac cgccggctac gacgagccgg 713761 cggtgagccg gacctatgcc gcgcccggcg cggccgagtg ccgttgagcg ggttggcgaa 713821 gcgtcatgtc cacagtgagc agatcggtgc gggtccgcca ctgcccttga cctcgaagcc 713881 gaacaccggc tacgagaccg ctgccgcggt ggatctggtg tctggggctt agcccgcttc 713941 gctgataccc agaaccagtg cgagcgcgtg gtcggtctgg cgcatcctgc caggtgccag 714001 ggctcccaat cgttcgagca ggcgggtgac gaggatcgcc gactggtcct gcgcgcgagt 714061 attggttgtc gccgcggtat cccggggcca tgaagacgca gccttcgatc ttggcttcgc 714121 tcgactcccg atgccgcccc aactcgatcg ctcttgggtt tgtccgaccg cgggcgtagc 714181 ctttgcctcg aggtgcagcc gatggcaggc gatcgaggcg ctgaccccgg tccggcgaat 714241 gtgactccgg gtgcggatga ccatgcacag catgcgtcgc cgacggtgct atgtccccag 714301 ggtcacgtga acgcatggga ctacaggttc tgtgagcggt gcggctcgcc gatcggcgtg 714361 gtgccctggc cgtcggagga atcaggcaca cgccagacgg cgcccgcgcg atccttcgtc 714421 cccctcgtcg tcctcgcggc gacgctgctc gtggtcgccg tcgtcgtgac ggccgtcggc 714481 tacgcggtga cgcgaccggc tcgcaacgac cgtgaggagc ccagttccgc gcggggcgcc 714541 gccacgacgg gtgtgccgtt cgcacaggcc gaggccgcga gttgcccgga cgatccggtg 714601 cttgaagcgg agtcgatcga cctgacgtcc gacgggcttg cggtgagtgc cgcgttcatg 714661 tcggcatgcg ccggcggcga tgtcgagtcg aactcggcgc tcgaggtcac cgtcgccgac 714721 ggacggcgcg acgtggcggc cggaagcttc gacttctcgg cagatccgct gaggatcgag 714781 cccggcgtgc ccgcccgtcg aaccctggtc tttccgcccg gaatgtattg gcgaacgccc 714841 gacatgttgt ccggcgcacc ggcattggcg gccacacgga agggcaggtc cgatcgttcg 714901 gccgcacgag gcggatcggc acggacgacc atggtcgcgg ccgcgtccgc ggcaccggct 714961 tacggcagca tcaacgccgt tgccggggcg gtgctggtgg agctacgtga ctcggacttc 715021 ccctacgtgc gagtcggtat cgccaatcgc tgggtgccgc aggtgagttc gaagcgcgtc 715081 ggcctggtcg ccgcggggaa aacgtggacg agcgccgata ttcttcgcga tcacctggcc 715141 ctgcggcagc ggttcggggg cgcccgcctg gtgtggtcgg ggcactggac caccttcagc 715201 ggacccgatt tctgggtgac ggtggttggg ccggcgcagc ccaccgcagc tgaggccaat 715261 cgctgatgcg actcgaacgg gttcggcgcc gatgactgtt tcgcgaagtt catcagcacc 715321 ctcgttggcg cgaagggcac gacggtgtac cggaagtgac gacgctgcca tgagtttctg 715381 cgtgtattgc ggtgccgagc ttgccgaccc gaccaggtgc ggggcgtgcg gcgcatacaa 715441 gattggttca acctggcatc ggaccacgac gccgacggtc ggcgccgcga cgacggcaac 715501 gggatggcga cccgatccca ccggtcgcca cgagggacgc tacttcgtcg ccgggcagcc 715561 gaccgacctc gttcgcgagg gcgacgccga agccgttgac ccacttggtc agcagcagct 715621 ggatcagtca ggtgccgttg gtgtttcgcc gtcagcggtg tcggggtggg tgcgttctgg 715681 gcaccgtcga ctgtggtggg cgcttgcggg cgtggtggcg tttctcgggc tggtgggagc 715741 cggtgtcgtc gggacgctgt tcctgaatcg agaccgggag tccatcgacg acaagtacct 715801 cgccgccttg aggcggtccg gactcaccgg tgagttcaac tccgacgcga acgccatcgc 715861 ccgcggcaag caggtgtgcc gccagttgca agacggtggc gaacagcagg ggatgccggt 715921 cgatcaggtc gccgtgcaat actactgccc gcagttcagc gatggcttcc atatcctgga 715981 aaccataact gtcactggaa gtttcaccct caaggatgaa tcgccaaacg tgtacgcacc 716041 ggcgatcacc gtgtcgggct ccgggtgctc agggtcagcc ggctacgccg acatcgaccg 716101 gggaacgcag gtgacggtga aaaacggtca gggggacatc ctggccacgg ccttcctgca 716161 ggcgggtcag ggcggccgat tcttgtgcac cttccctttc tcgtttgaaa tcaccgaggg 716221 cgaagaccgc tacgtcgtgt cggtcagtcg tcgaggcgaa atgagttact cgttcgccga 716281 tctgaaggcc aatgggctat cgctcgtctt gggctgagtc accgcggtat tcggcacggc 716341 gcaccgctgc gcaaccagct agcgctgacc gtgtgatcta gaatctagct actagtatag 716401 aatcgagaca tggcgctgag tatcaagcac ccggaagccg accggctcgc gcgagcgctt 716461 gcggcgcgca ccggcgagac gttgaccgag gcagtggtta ccgcgttgcg cgagcggctc 716521 gctcgtgaga ctgggcgtgc ccgtgttgtc ccgttgcgcg acgagcttgc cgcgattcgg 716581 caccggtgcg cagcgttgcc ggtggtcgac aaccggtccg ctgaggcgat tctcggctat 716641 gacgagcgcg gattgccggc ctgatggtga tcgacacgtc cgcgctcgtt gcgatgctca 716701 gcgacgagcc agacgcagag cggttcgagg ccgccgtcga agccgaccac atccggctga 716761 tgtcgacggc gtcttacctg gaaacggcac tcgtgataga agcccgcttc ggtgaaccgg 716821 gcggacgtga gctggatctg tggcttcatc gcgccgcggt cgaccttgtt gccgtgcatg 716881 ccgaccaagc ggatgccgcg cgcgccgcct accgcacgta cggcaaggga aggcatcgtg 716941 cggggctcaa ctacggcgac tgcttctcat acggcctcgc caagatcagc ggccagccac 717001 tcctgttcaa gggcgaagat ttccaacaca ccgacatcgc cacggtcgcg ctgccctaat 717061 tcttagtcag ccaggtgttc gccgcaccgg ctttcggcag cgtcaacggt gttgttaagt 717121 gcggcagaag gttcacaagg catgtcgacc gctcagcgtg ctccgacttc gcgatccgga 717181 tcctcgacgc cgccgtccgc gccgtcgcca cgggcgtgtg cacgccactg gcggtacccg 717241 tgtcgcgccg cgaacgcacc gatgatggcg gtgacgcacc acaccgcgat cgcgcaggac 717301 gccagcagcg gtgagcgatc gccgatcgcc gcacccaatg cggtgtaggc gaatgcccgc 717361 ggcgcggaac cgatgaatgc accgacggcc atctgccaca acggaactcc gaacgtcccg 717421 aacgcatagg aggcgaacgc atccgatatg ccggggacaa agcgttggcc gacgacggcc 717481 cacaggccgc atcgttcgat cagcgcgtcg gtgcgatcgg cacgttcccc gcccagcagg 717541 gctcgcgcgc tggcccggcc ggctcgacgg ccgaccaggc tcgcgacaac ggcggtgccc 717601 accgtggcac ccagcgtcac gaagaccccc actagcggac cgaacagcag cccgctgctt 717661 gcggccagga tcgggcccgg gacgaacaac gcgccgagca cggccgacac tacgacatag 717721 gtcagcggcg ccgccggccc ggtcgccgag accgcgcccc gcaccgcggc cacatcgatg 717781 acgtccgtgg cggctaccag gtagaacatt cctacaagga agccggcgaa cacgacaagc 717841 cgcacgatgt ggcgtcgccg ggatgtcggt gcggaatcgt tgtgagtgct catgctgacc 717901 gtgattgttc cgcaccgacg ctggccgcgc ccgtcgtccc cggcgttggc tggggaacct 717961 cggctgcgcg ggcgccgtcc ggcgagcaac ccgtttgtcc tacgattgag ctacgatcgt 718021 aggcatgtct gaggtggcct cgcgtgagct gcgtaacgat acggccggcg tgctgcgccg 718081 cgtgcgggca ggggaggacg tcaccatcac cgtcagcggc cgtccggtcg cggtgcttac 718141 cccggttcgt ccgcggcgcc ggcgttggct gagcaaaacg gagttcctgt cgcggttgcg 718201 cggcgctcaa gccgatcccg ggctccgtaa cgacctcgcg gtccttgccg gcgacacgac 718261 cgaggatctc gggccgatcc ggtgagcacg acgccggccg ccggagtgct cgacacgtcg 718321 gtgttcatcg cgaccgaaag cggccggcaa ctcgacgagg cgctgatccc cgaccgggtc 718381 gccaccaccg tcgtcaccct cgccgaactg cgcgtcggcg tgctggccgc ggcgacgacc 718441 gacatccggg ctcaacgcct ggcgaccctg gaatccgttg ccgatatgga aacgttgccc 718501 gtcgacgacg atgccgcccg aatgtgggcc cgattgcgga tccatcttgc cgagtccggt 718561 cgccgggtgc ggatcaacga cctgtggatc gcggccgtcg cggcatcgcg agcgctgccg 718621 gtcatcaccc aggacgacga cttcgccgcc ctcgacggtg cggccagtgt ggagatcatt 718681 cgggtctgac tcggtggcca cgcgtctctc gcgctgttgt ccgcacccgc agggcgtccc 718741 ggtgggtcaa cgcggcggcc tcagtcgacg aacagcgcca tcgacgcggt aaacccgtgc 718801 aacgcgttgt ggcccgcgac cgggccgatc tccccggcgg cgaagaaacc ggccagcgga 718861 atcccgccca gcaggtcctc gatcgtcgac gcgtcgtggt cggtgacccc gaacattcgt 718921 cgtccgcgcc cgttgcaggt gaacagcagc ccaccgaccg ggggcccggg cagctccgcc 718981 gccgcccgct cgacggccag gcgcaggtcc ttgtcggccg ccgccgcgtc ccggacctgg 719041 aattgcacgg tcgcgccgac ctcgacaacc tcgccgatcc cgatcgcccc cgtcgttggg 719101 tcggcgccga gcagcccgcg gatcaaaaag tcgccctgac ccggcaccgc caggtgctcg 719161 tcgacgacga ttccgatctg caggccgcgg ctgaccagtt cctgctcgtc gggcgccatc 719221 cccaagacga tctcccgcag gcggtgcagc ggcggtcggc cgcccagctc ggtgatcacg 719281 gcaccgtccg cgccggtgac aatgtacggt tccccgatcg gccggcagcc ctgcgacacc 719341 acggaaacgc tgtgcgcgcc gggcaggcgc acgccgacca gcccggaggt gagcacgtcg 719401 cggtcacgaa acagccgggt gtcgccccgc cgacgcccac cgctcaccac cccgccgacg 719461 acggtcgttc ccggcaggtc ggtgttgagg tgctcgatga gcagattcga cgggaacgag 719521 tacgggtccg gcagcagcag gtgcaagtcg tgcgcggtcc ggtcgaagcg gtaaccggtg 719581 atcagagcgc ccgagccggt gcgaacgaag tccaggtgga atgtctccgc gggtgggccg 719641 gacgccagcc acaccgccac cgcgggctcg ttctccagct cgtggcgacc ggcgacgatg 719701 ccttgggcca cgcaaccgat cagcgcggcc ggctcgaccg acgcctgcac cgcagccagc 719761 aggtccacgg cctggtcggt gtgtgaccgc gatccgagga gcacggccag cgccggcgtc 719821 ccacccgcga gctcctcgcg cgcgtgcgcg gcagcctccg ccgcggcccg gcgcacgtcc 719881 ggcgcggtgg aaaccccgac tccgatccgc acacatccat gatgcgccgt cgccgtgctg 719941 ttcgtgtatg cgatgtcaaa gtccgggcgc ggttacccga cgagccgagc acatccccga 720001 cgagtcagcc acaccccgtc gactgtaacc gcatccgcaa cccgctggcc cgcaccgccc 720061 ggcgtgcgat cgcggcccgc accgaagctt cggacccgac cacccggacc ttgcgtttgg 720121 cccgggtcac cgcggtgtac agcaactccc gggtcagcaa ccgcgaatcc tcttgcggca 720181 tcagcaccgt cacctcgtcg acctggctgc cctgactctt gtggatggtc atcgcgtgca 720241 tggtctcgac gtcgccgagg cggccggtgg caacgtcaag tggcccggat gcaccagaaa 720301 tgacggcccg cagaccggtg gggccggcca gcacgacacc ggtgtcgccg ttgtagacgc 720361 gaaggccgta gtcgttggcc gtcaccagca gcggacgccc ggcgtaccac ggcgtccagg 720421 gcggctggcc ggtctcctcg gcgagccagg cttgaacccg gcggttccag tgcagcacgc 720481 cggtgggccc gtcccgatgc gcacacagca gccggtgctc gtccagggtg gccaacgcga 720541 cgtcggaggc acccaacagc gccgcctcgc gcagccgcaa cgcgtgtggc accagcaccg 720601 cgcgcaaccg cggcgccgga tcctcgtcgt cgacgaactc gatccgctcc tcacccgagc 720661 gcagcaggcc cagtacggca tcgccatcgc cggcccggat cgcttcggcc aaggtaccga 720721 tcaccttgcc gaaccgatgc gacgttcgca gctgcgccac cagcgcgtcg tcgcgtaccg 720781 agaagccatc gaccaaatcc gccagcaccg ctccggcttc caccgacgcc aactggtcgg 720841 catcgccgac gaggatcaac cgggcgcccg ggcgcaccgc ctcggccagc cgggccatca 720901 gcgtcagcga caccatcgag gtctcgtcga ccacgatcac gttgtgaggc aaccggttct 720961 ggcgatcctg gcgaaaccgc gctcccggtt tggcacccag cagacgatgc agcgtgaccg 721021 cgtgcaggtc gccgagccgt gcccggtcgg tggcgtcgag cttggccatc tcgcgccgta 721081 ccgcctcggc cagccgggcc gccgccttgc cggtgggtgc ggccagcgcg atccgcggcc 721141 gcggctcacc ggccagctcc gcctgctcgg caaccaacgc cagcagccgc gcgaccgtcg 721201 tcgtcttccc ggtgccaggc ccgccagtca acaccgtaac accttgcgag agcgcgattt 721261 ccgccgcgcg ccgctgctcg tcaaagccgg tcgggaacag tcgccgcaag tcgggtaccc 721321 cggccggtcg cctggatgtc agcaacgcga gcaggtccgc gcacacctgc tcttcttcgc 721381 gccagtagcg gtccagatag agcagccgat cgtcatacag gtgcagcacg ggtggatcgg 721441 cgagcaacgg actggcccgc accgccgcca accagtccgc cggatccggc cacggcaggt 721501 cgtcgtgtcc agcaacccgc gcgatcgaca acagatccac acacaccgaa ccggcccgta 721561 gcgcgcggac cgccaccgct accgccaacg ccacccgctc gtcgctctcc ccggccagtg 721621 cacagagacg ttgcgccaca tgcacatccg acacgtccag cacaccggcc tggttgaagg 721681 cccgcaccat cccggaggcc tcgacggcaa aatcgacgtc ggtgagcttc acgactgcag 721741 ccttccccgg tcgagcagat ccgagagcgc caccaccaac gccgtgggcg ggttccaggt 721801 gaacacaccg gccggatgcc cggccgtcac cggcgtcgcc gcaccgcaca tgccccgcac 721861 aaacaggtac agcaccccgc cgagatggcg cgccggagcg taatcccgct gccgccaccg 721921 cagaaagcgg tgcagcacaa caacatacag cagcgcctgc agtgggtagt ccgaatgcag 721981 catggcctcg gtcaaccgct cgaagccgta atcggcggcg gtgtcaccaa ggtgattggt 722041 cttgtaatcg accaccagat atcgctgccc gggtagccgc agcaccacgt cgatcgaccc 722101 cgccaggtag ccacgcagcg gttgatcacc caacccggcc gaaccaagcc gatcggcgta 722161 gggcgacaac gggtcgtcgc cgggcaggtg cgacgccagc agctcaccca cgtcggccag 722221 cgacacgtcc ggggaccggc cgcgcagatc gcccccggcc agcggcatct cgaagtccaa 722281 ctcccgcaga cgatcacgca caccgatctg ccgcaatgtc agtgcggcgg cggcgggtcc 722341 cagcggcgtg tcgtgcatcg gcagcaacgc tcgggccagt tcgggagcca gctgcgcgtg 722401 gtcgacgtcc acggtccacc acggcgcgtg ccggcgcacc tgggcttcca gttcggcagc 722461 cagatcggga gcggctgggt ccgcggtctc gagcaccgcg tgcaccagcg agccgaacga 722521 cgcccccgac ggcagcgcgg ccagcggtga tgtcagatcg gcgccggaac cgggcgcggc 722581 gacgacggcg atctccacct cgtccgcacg gccgccggcc gccggctcgc tggtgacggt 722641 gacggcttcc gagccccgca ccagatccga gtacgaggtc cgccgccacg tggtgtcgat 722701 ccggcggtga aagtgccgaa cctcgaaacc gggtacgggc accggctttt cgagggaact 722761 gcgagcaccg atgaccgatt cctcgaccga cggcccgccc gcggcctccc actgcgcgaa 722821 caccgcccag gcctgctcgt cggtgacgcg tggtgtacac cggtccggta cctgcgactg 722881 gccgggccgg cgcccgcgca gcaaccgcga caacccgccg ttgacctcgt cgaacgtcgg 722941 tgcccaccac gcgacgacct gcgattgcgc gcgggtaagc gcgacatagg tgagccggag 723001 gttgtcgtgg gccgcctcga cgcggttcag cccctcaacg gtgcgccgct gagcaccgcc 723061 gtccttgccg ccgatgtaca ggcagcgggt gccgtcgtcg tgatacagca ggatgtcgtc 723121 gctgcggacg ttgcggttga aggcgaacgg cagatacacg atgggaaact gcagtccctt 723181 ggccacgaag acggtcatga tctgcaccgc cgcggcgtcg ctgtccaacc ggcgattgtg 723241 ttccggcggg ccggcacccg ccttggcctg gcggcgcagc caatcgcgca gcccgggcag 723301 gccgagccgc tcgcgatgag cggcctcgtg cagcagctgc gcaatgtgcg ccaggtctgt 723361 caggtcccgt tcgccgccgc gctggctcag cacgcgccgg cccatcccgg ccagctgagc 723421 ggcctgaaac accgcggcca caccgcgatg gcgtgcgtgg tcggcccact cgcgcaacgt 723481 gccggccacc cgatcggtca gcgcatcgcc ctcggcggca agcgattccg cggtctcacc 723541 gaagaacatc gtgcacgcgg cggcgcggac cagcccgctg cgctgcggcg cgtcgaacgc 723601 ctccagcagg cacagccagt ccttggcggc ctgcgaggcg aacacgtcgg tgtcaccggt 723661 gtagatcgcc gggatgcccg cctccgccaa cgcattccgg cacgcccgcg cgtctttgtg 723721 atgctcgacg atcaccgcga tgtctgcggc caccacgggc cgcccggcga aggtggcccc 723781 gctggccagt agcgccgcga cgtcggcggc caggtcgtcg gggatgtgcc ggcgcagcgc 723841 ctcgatcggg acgtgggcgg tcccgtcata cccgagcgtg tgccgtttga ccacgcgcaa 723901 ccgaaacggc gccgggcgcg gcgccgaggc caggcggtgc ccggcgtggt gggcgtcggt 723961 gccgcggacg acgatgtcgg cgtgacccag ggtcgcatcg cgcagcaccg tctgcaggct 724021 ctcgaccagc gcccggtcgc tgcgccagtt gacgcccaac gtgtagcggg catcggcggt 724081 gccggccgcc ttgaggtagg tgtggatgtc gccgccgcga aagccgtaga tcgcctgctt 724141 gggatcgccg atcaggatca gcgccgaatg ccggctaaac gcgcgctcga gcacccgcca 724201 ctgcatgggg tcggtatctt gaaactcgtc caccagcacg atccgccagc gttcccgcat 724261 ccgatcgcga gctggcgagt cggccgcctc gagggctgtc gccaaacgga tcagcagatc 724321 gttgaatcct tgcgcacgca gccggccctt gcggcgctcg agttcctcga gcacctcggc 724381 ggcaaagcgc agccgcaccg ctgccttgct gccgggctcg ggatcaggcg ggcgcagttg 724441 ggcgcacggg tcgtcgacga cggcaagggc cagggccagc gcctcggcgt aggtcagctc 724501 cggatcggtc tcctgacgac cgaagttcgc cagatagcga tcgtccacga tctcagtgac 724561 caggtcggta aggctctcct tgagctccac gtcggcggcg ttgtcaccgg ccacaccgag 724621 ggatttcaac accgagccgc agaactcgtg ggtggtggcg atggttgccg cgtcgaagtt 724681 ggccagcgcg tcacgcagcc gcgaccgctt ctgggcgcgc tcggcgtcgc tgccgcgcag 724741 caggtgctcg acgagctcgc cgctcggcgg cgcgtcgcct tgtagcgcgc ccacggcctc 724801 gacgatctgc ccgcgcactc gctcgcgtaa ctcccggctg gccgcacggt tgaacgtgat 724861 caacaacatc tcgtcgagcg tcgcggcggt ttcggccaga tagcgggtga ccagaccggc 724921 cagcgcgaac gtcttaccgg tgccggcgct ggcttccagc acggtggtgg tgccctccct 724981 cggcaacggg cccagcagct cgaagcggtc catcagaccg acccttcggc ggccaacagc 725041 ggcagccata gccgggcggc cagcgcccct agccgggtct cttccccggc gacctcttcg 725101 cccgcgcggg gcttgccgag caacacctcg aagggtgcgc gcgggcccca ggctcgcacg 725161 tgggcgggcg cgtcgtcgtc gcccggccgg aacctgttgg tctgccagca ttcgcgggcg 725221 ggcgggtagg ggtcttggcc gtctcggcgt gcctgggccc acgcgcagga cgtcttcagc 725281 ggcagcggca gtggttcgcg ccggccggcg tcgtacagca acaccagctc ccgcaatacc 725341 gccaccgggt ccggcggcgg cacgaaaagc cttctggcga tgtggttcct ggtcttgctg 725401 cggccgatgc acagcgccga ccactcgcgg ccaggctctt gggcggccag cgtaaccagg 725461 ccgatccacg ccggcaacac atgcttgggc gccagctttg agtaggtcac cgacaccgtg 725521 cgcccgccga acacgggtgt caccgtgccg ctcagtcgcc gcccgtcgcc gaggtcgacg 725581 tcgacgtcgt gcgcctggcc gtggccgtcg cggtgcgcca gcgcggcggc cgccagatcg 725641 cgcgcgcggt tccggatttc cttcgcccgt cgcacgccga ggcgcccggg cggcaacgtg 725701 ccgcgacgcc attcggagtg agcggcgtcg tcggggtgca ggccgcggag catgtcgcgc 725761 aacatccgct cgcccaccgt ccactcggcc aaggcgtcga cctggaccgg tatcgagtcc 725821 tcgacggtgt cgacgtccca gggcagcgtg tagtccagcg cccggaagaa ccccttgacc 725881 ggatccttga agaagtcgag caggtccgcc agcgtcacgt cggccgcggg tggtgcgggc 725941 agccgaccgg agatgaaagc cgttggtgga cagcgcttcc cggcggcggc ctgggcggcg 726001 gcgagcgcgg cggggtcgaa cgtgaacggc ttggcgccca gcagtgcgcc gggggtgacg 726061 ttcttccggt cgaacggctg cagtgggtgt gtgaccagga tccgctcacg caccggcgct 726121 gacgtcgtct ggtcgagcgc gtcgagcaac tcggccagcg gcaccgcggg tgggcgcggt 726181 tgcccggtgc gctcgtcggc gccggtgtaa gtgatcacca gggtctgggt ggccgcacct 726241 atcgcgtcca gcagcaattg ccggtcctcc gaacggatgt cacgttcacc cgtcatcggt 726301 tctcgggcca gcacgtcgtc cccgtcggga tggctcagcc gcggaaacac gccgtcgtcc 726361 agacccacca ggcacaccac ccggtgcggc accgagcgca tcgggaccat cgtgcagacg 726421 gtcagcgtgc cggtgcgaaa gttggcccgg gtcgggcgcc cggccagctg cgcgtccaaa 726481 agcgctcgca cgtcgggcag ccgcaacagc ggcgccgcgc gcgaaccggc gcgcgccagc 726541 acgtcggcga actcccgctg cacctgcgcg cgttgccagc cgtcgttaca ggcggtcagc 726601 agatcgatcc ccgtggccag cgcatccagc catgcgacca acggccgtgc accgctgagt 726661 ccgccgacga catgatgcaa ccgttcgacg aactcggcca gcctcccggc cagctcgacc 726721 cgattgctgc cgacgtcatc aaggggcagc gcggtatcca gccacgcttg ggaatcctcg 726781 gacatggcca ccccggtgag gatgcggtcg agtccgaacc gccacgtgtt gtgcacgacg 726841 gtgtcgaggc catagcgtcg ccggtgcgtc gggtcgaagc cccagcggat gttcgattcg 726901 cgcacccacg tggtgatggt gtccaggtcg tcgtcggcga acccgaattt ggcgcgcacc 726961 ggagcggcct gcgcgaggtt gagcagttgg ctggcggtgg cccgggtttc ggcgatggtg 727021 agcagttcgg cggccaccga gagcagcgga ttggtctggg tcagggcgcg gtcggccaga 727081 cgcacccgca gccggtgtgc ggggtggcag tcgccggcca cctcaccgag gccgaagccg 727141 gcgacgatca acggtgcgta ggtgtcgatg tcggggcaca tcaccacgat gtcgcgcggt 727201 tgcagcgtcg ggtcgtcctc gaggaggccg agcagcacct cgcgcagcac atcgatttgc 727261 cgcgccgggc cgtgacaggc atggacctgc accgatcggt cggcatccga caagctacgc 727321 ccggcgggtc gcggcgcgtt gccggcgatg tcggcttgca gccatcccag caacgtgtcg 727381 ggtttggttg tggcaccaag gaattcgtcg gtggcccggg cggcgggcag cgcgcgctgc 727441 agttcgcgca cgtcgcggcc cagcgtttcc agcagcgggt gctgggcggc ccgccggctg 727501 gtgtcctgcc gccgcggcag caggccatca gcgccctgga agccggccag cgcccgccac 727561 aactcgtcgc tggggtgcgg cagccacagg tgcaggtcgt ggtggacggc cagcgcatcc 727621 agcagctgca cgtcggtgca ggccaggcgg gtgtggccga acagcgaaag ccgagccggc 727681 aggtcggcgg ggccgtcgcg cagccgggcg atggtcttgt cgtggcggac atgcggggga 727741 tcggccccga ccgtggtcac cagggcgcgc cacagtggcg gttgccaggc caagtcgccg 727801 ggcagctcgc cgaggtcgcc gtccagccaa gcggccagca acccgggacg ctggcgtgca 727861 taggacgcga acagcccggc tagccggcgc gccaccgaat agcgccggcc gcggcgcagc 727921 tccgcctcgg catcggtcgt cgcgaagtgc cccaagtggg atgccagcgt gcggcaccac 727981 ggttcgtcga ggctggcgtc gatcaccgcc agcagcggcc acgccagggc ttccggcgac 728041 cacgggtcgt cgtcgagggt gccggtgatc tcggcgatca gggactgcgg attgcggaac 728101 gcgatgccgg cgcacacccc gtcggcgcgg cccggcccgc agcccaacac gagcgaaagc 728161 cgttggctca gccagcgttc cacgccgcgg gcagcgacca gcaccagttc ctgcgcgaaa 728221 gggtcgggct ggggatcggc cagcagcgcg ccgagcccgt cggcaagcag atcggtgcgc 728281 tcggcacggt gcaggtgaag cgccatcggg cgtcacccta gtcgagcggc cggccgccga 728341 catgcatgct ggcgtgcata aacagacgcg agatcaccga acgacaaggg ctaccagtcg 728401 gtacgggcct tcttgtcgac ctggaactgg gtcagatatc gaaccgttcc gggatttcat 728461 caacgcgctg gggcgtgccg cgatgtcggc atgacgagcc gcctcggacg tgacacactt 728521 cgagatggag gaggcggtgt aggtgtgagg cggttgccca aagcaaccgc gtcacccacc 728581 acttacagcc cgaactcggc tgctatcccg tcgatcccgg cccgaatcgc agtgagcgcg 728641 tcggcgcggg agcgcaactt ggtcgcggca tgggcgtgtt ggttgagacc ggcgaactct 728701 cgtgcggctt cctcggcgcg gctgaccacc acctccggca gggcgatctc gtcgataaac 728761 ccggcggcca gcgcggtttc cccgaagaac gtcttggcca gcccggttgc ctgctggtat 728821 gccgaccggg tcagtcgcag cttcatgatc tctaacgccg cgtacggaat ggtcatgccg 728881 atcgcgacct cattggcctg gatgttgtat gcgtgggccg ccacccgatg atcgccgcag 728941 gacaacagaa acgcgcccat ggcgatggcg tgaccggtgc acgccatcac caccggtttg 729001 gggtaggaca agaggcgata cgccagctcg aagccgcccc tgagcatgtc gatcgcgggc 729061 tgcacttcac cggaggtgag gatcttcagg tcgaagcctc cgctgaatac ccggccatta 729121 ccggtgatca ccagcgcccc aacatcatca cggtccgcgt tgtcgatcgc tgcattgagg 729181 gcttgttgca tcgccgggcc cagtgcgttg accttgccgt cgtccatact gatgacggcg 729241 atggaatcct tgcgggtata gctgaccggg tcgctcatgc tctcgattga atcagatcag 729301 cattggggga tcttgtgcgc ccgcagttag cctgccggta tccgcgtggg ctgtggccct 729361 tgcccctccg agcgctggct gacctcggtg ggcacctcga cctgccgagc gcgccacctg 729421 tcctgggttt cggccgcggc gcggttgatc cggtcgatct cgctgcggaa cacgtcggga 729481 cgcgtctcct tgctcgcgta gtgaaaatac gtcaacagac tacgtaacag ctccagcttc 729541 tgcttttccc gcttcgccag gtcgtgggtg gtcatcagct cgatgcgcgc aacgggcggc 729601 agggcatccc agtccagcac cactttggtc tccaacggac gccgtccgcg ccgccagaac 729661 cgccatcggc gcggctgctc gggtcggtcg tagtaggtca ccgtgcccgg gaagcgagac 729721 tcgatgcccc gcccgatctc ggcgcgatcg agcgctgaat cccacaccat ccgccactcc 729781 tgaccgggag ccagcatcgg caactcttgg ggcagccgaa gttccacgac atcggcgtag 729841 ccgttggcgg cattctcgta ttgggccacg gttggtgggt tggggaacga gaaccggacg 729901 tcgtaggcgg ctgtgcgacc gaagttgcgg actaccagct cgatcacgtg ccagtccgcg 729961 acgtggggct ccataaacat ggccacgtag ggccgagtct gctccgcagc cagtcgacga 730021 ttgcgttgga tttgccgctt ggtcaccacc agggcgacca caccgagccc aagcgccgcc 730081 cacgcggccc acgccaacca ggtgccggag tcgacgccgg tgacctcatg ccagctgctc 730141 aggacccacc ccatggaatc caccatccgc ttataccaca gtgacatcgg accgagaagt 730201 tagctgacag gatcccagag gcgcctgggc actggtcgct ggctgccgaa tcgttggcgg 730261 aagcgccgct ggacacgtcg ctggacccgg gccggaacgg gagaggcttg cccagtcctt 730321 cagccgccca tcaacattcg ccattgatcg agacttgcgg ggcgataaac gtaattggaa 730381 cgcttgacct ccgacagcga cgcacttggc tcggccgaat accagtgccc gggaaagacg 730441 gttgggtcac ccggaagctc ggcgagctgt cgcaggctgc ggtacatctc gtcggaatca 730501 ccgccgggaa agtctgtgcg tccacagcct tccaggaata gcgtgtcacc ggcgaccagc 730561 cggccgtcga gtagaaagca ctgactgcct ggggtatgcc cgggtgtgtg cagcagctcg 730621 atgtcgatgt cgccgacgct gaccttgtcc ccatgctcat gggtgatcag gtcgccgaca 730681 ggaatcccag tgactcgcga aacccacagc gcttcatggg tgttcacgtg cacgggtaca 730741 gatgcccgct ccagcagctc agccagtccc ggcagctgaa aacccatcat cgagccgccc 730801 acatggtctg gatgatggtg ggtcaccagc acacccgata gctgcatatc gtcggattcg 730861 agcgcgtcga gcagatcccc ggcagcgtag gccgggtcga ccaccacgca gtccccggtt 730921 gtgcgatctc cgatcaggta ggcaaagttg cgcatttgcg tcgcgaacat gtcgccgacg 730981 gcgaaatcgc gaccggagag cagttgacgg aagtacagcc ggtccttgga cacgcaacca 731041 gcctatgtct tgtccatcgc cgcccagacc gcgtcttggc gtttgcagcc cgggacacgt 731101 taatgcggag tcttggggtc tgactgtggg tgcggtgggt atctttggtc catgctgaag 731161 agggtcgaga tagaggttga tgacgacctt atccaaaagg tcatccggcg gtaccgtgtg 731221 aagggtgcgc gcgaggctgt caaccttgcg ctgcgaacgt tgctcggcga ggcggatacc 731281 gcggagcatg ggcacgatga cgagtacgac gagttcagcg atcccaatgc ctgggttccg 731341 cggcggagcc gcgacacagg gtgatcccgt ccaatcttgg acgacttggt ccgtagctgc 731401 atgggtggca ccggtggttt ggtggcgttg cgcgccaggc tgtaccctct tttaggcccg 731461 cggcacgacc cgactggtcg ctacgggtga gcggccccct tagctcagtc ggcagagcgt 731521 ttccatggta aggaaaaggt caacggttcg attccgttag ggggctcggc ggacgccggg 731581 caggctggcg gtgcgtacca gaggcgatgt agctcagtcg gttagagcga acgactcata 731641 atcgttaggt cgccggttcg agtccggcca tcgctacaac acaacagcaa gactcgttag 731701 agagaacgga tatggcttcc agtaccgacg tgcggccgaa gatcactttg gcatgcgagg 731761 tgtgcaagca ccgtaactac atcaccaaaa agaaccgccg caacgacccg gaccggctgg 731821 agctgaagaa gttctgcccg aattgcggca aacaccaggc gcaccgcgag acgcggtaac 731881 cgccgacccg cgagcagttg ctgagactga ctaggtaggt tctacagccg tggcgttgag 731941 cgcagacatc gttgggatgc attaccggta tcccgaccac tacgaggtgg agcgggagaa 732001 gattcgcgag tacgccgtcg ccgttcaaaa cgacgacgcg tggtatttcg aggaggacgg 732061 cgccgccgaa ctcgggtata agggcttgct ggctccgttg acgtttatct gtgtgttcgg 732121 ctacaaggcc caggcggcgt tcttcaagca tgcgaacatc gcgaccgcgg aggcgcagat 732181 cgtccaggta gaccaagtgc tgaaattcga gaaaccgatc gtggcgggcg acaagctgta 732241 ctgcgacgtc tatgtggatt cggtgcgtga ggcgcacggc acccagatca tcgtgaccaa 732301 gaacatcgtc accaacgagg aaggtgacct cgtgcaggag acctatacga ccctggcggg 732361 ccgtgccggc gaggatggag agggattttc tgatggcgct gcgtgagttc agctcggtga 732421 aggtcggaga ccagcttccg gagaagacct acccgctgac ccgccaggat ctggtgaact 732481 acgccggagt ttcgggtgac ttgaacccga ttcactggga cgacgagatc gccaaggtcg 732541 tcgggctgga caccgcgatc gctcacggca tgttgacgat ggggatcggc ggtggctacg 732601 tcacatcctg ggttggcgac ccgggcgcgg tcaccgagta caacgtgcgg ttcactgcgg 732661 tggttccggt gcccaatgac ggcaagggcg ccgagctggt gttcaacggt cgggtgaaat 732721 cggttgatcc tgagagcaag tcggtgacca tcgcactcac cgctactacc ggcggcaaga 732781 agattttcgg gcgggccatc gcctcggcga agttagcgta gtttatggcg ctcaagaccg 732841 atatccgcgg gatgatttgg cggtacccgg actacttcat cgtgggccgt gagcaatgcc 732901 gcgagtttgc ccgagctgtc aagtgcgacc acccggcctt tttcagcgag gaagcggccg 732961 ccgacctcgg ttacgacgcg ctggttgctc cgctgacctt cgtgacgatc ctcgccaaat 733021 atgtgcaact ggacttcttc cgccacgtcg acgtgggcat ggagacgatg cagatcgttc 733081 aggtcgacca gcggttcgtg ttccacaaac ccgtgctcgc cggggacaag ttgtgggctc 733141 ggatggacat ccattcggtg gacgagcggt tcggcgcaga catcgtcgtt accagaaacc 733201 tctgcaccaa cgacgacggt gagctggtca tggaggccta caccacgctg atgggccagc 733261 agggtgatgg ttccgccaga ctcaaatggg acaaggaatc cgggcaggtc atcaggaccg 733321 cgtaattagc aactggccgc tgcggccatg tacactcgga cctcggggtt ttcccaacat 733381 cggcgcgctt tccgtgagtt caacgagcgg agtgtcgtct ccactttcgg ttcgcgatca 733441 ccgaacggag ggcgcgcgtg tcatgtgagc cccggcgtag tgggttggcc agggcctggt 733501 ctggtcttgc ctgccaaccg cgaaggggcg tagctcaact ggcagagcag cggtctccaa 733561 aaccgcaggt tgcaggttca agtcctgtcg cccctgctga aggcgaacgt tcgacgacga 733621 tgcaggcacg gcctgaagag gagacggacc ataggtatgt gccatggtgg acactggaag 733681 gtgccccacc agagcggaac ggctcgcggg gtagctagta aacgaaggag catgcggtga 733741 gcgacgaagg cgacgttgcc gacgaggccg tagccgacgg cgccgagaat gcggacagcc 733801 gcgggagcgg tggccggacg gccctggtga caaagccggt ggtgcggccg caacgtccca 733861 ccggcaagcg gtcgcggtcg cgtgcggcag gagccgacgc agacgtcgac gtcgaagagc 733921 cgtcgaccgc ggcttcggaa gctaccgggg tcgccaagga cgattcgacc accaaggccg 733981 tgtcgaaggc tgccagggca aaaaaggcca gtaaaccgaa ggcccggtcg gttaacccga 734041 tcgcattcgt ctacaactac ctcaagcagg tcgttgccga gatgcggaag gtaatctggc 734101 cgaaccgcaa acaaatgctt acctacacgt cggtggtgct ggcgtttctg gccttcatgg 734161 tggcgctggt cgccggtgct gacttgggcc tgaccaagct ggtgatgttg gtgttcggct 734221 gaggctcgag agtgacagag aggactgaaa accgtgacta ccttcgacgg tgacacgtcc 734281 gcgggtgagg cggtcgatct aacagaggcc aacgccttcc aggatgcagc ggccccggct 734341 gaagaggtcg atccggccgc cgcgctcaaa gcggagctgc gcagcaagcc cggcgactgg 734401 tacgtcgttc actcctacgc agggtacgag aacaaggtca aggccaacct ggaaacccgg 734461 gtgcagaacc ttgatgtcgg cgactacatc ttccaggtgg aggtgcccac cgaagaggtc 734521 accgagatca aaaacggcca acgcaagcag gtcaaccgta aggtgctgcc cggctacatt 734581 ctggtgcgga tggacttgac cgacgactcc tgggccgcgg tgcgtaacac gccgggggtc 734641 acggggttcg ttggggcaac atctcgcccg tcagcgctcg ccctcgacga cgtggtgaag 734701 tttctgcttc cgcgggggtc gacgaggaag gctgccaagg gtgcggccag cacggctgcc 734761 gccgccgagg cgggcgggct agagcgtccg gtcgtcgagg tcgactacga ggtgggcgaa 734821 tcggtaaccg tcatggacgg gccgtttgcc acattgccgg ccacgatcag cgaggtcaac 734881 gccgaacagc agaaactcaa ggtgctggtc tccatcttcg gccgcgaaac accggtggag 734941 ctgacctttg gccaagtctc caagatctag cccagcaggg caggccacac aggctgaaac 735001 aaggaaggac atcgacacgt catggccccg aagaagaagg tcgccgggtt gatcaagctg 735061 cagatcgtgg cgggccaggc caaccctgcc ccgccagtgg gccccgcgct cggtcagcac 735121 ggcgtcaaca tcatggagtt ctgcaaggcg tacaacgccg cgacggagaa ccagcgcggc 735181 aacgtcatcc cggtggagat caccgtttat gaagaccgta gcttcacttt cacgctgaag 735241 acgccgcccg ccgccaagct gctgcttaag gccgctggtg tggcgaaggg ttcggcggag 735301 ccgcacaaga ccaaggtcgc caaagtcacc tgggatcaag tccgcgaaat cgccgagacc 735361 aagaagacgg acctcaacgc caacgacgtc gacgctgcgg ccaagatcat cgccggtacc 735421 gctcggtcga tgggcatcac cgtcgaatag ggccctaccc gtgggagggc cagcttcggc 735481 ccgctgagta accacgaccc atagattgga tatcaaatga gcaagaccag caaggcatat 735541 cgcgccgccg ccgcgaaggt ggaccgcacc aacctctaca ccccgctgca ggcggccaag 735601 cttgccaaag agacctcgtc gaccaagcag gacgcgaccg tcgaggtggc gatccggctt 735661 ggcgtcgacc cgcgtaaggc agaccagatg gttcgcggca cggtcaacct gccacacggc 735721 actggtaaga ctgcccgcgt cgcggtattc gcggttggtg aaaaggccga tgctgccgtt 735781 gccgcggggg cggatgttgt cgggagtgac gatctgatcg agaggattca gggcggctgg 735841 ctggaattcg atgccgcgat cgcgacaccg gatcagatgg ccaaagtcgg tcgcatcgct 735901 cgggtgctgg gtccgcgcgg cctgatgccc aacccgaaaa ccggcaccgt caccgccgac 735961 gtcgccaagg ccgtcgcgga catcaagggc ggcaagatca acttccgggt tgacaagcag 736021 gccaacctgc acttcgtcat cgggaaagcg tcgttcgacg agaagttgtt ggcggagaac 736081 tacggcgcgg cgatcgacga ggtgctgcgg ctcaagccgt cctcgtcgaa gggccgctac 736141 ctgaagaaga tcaccgtgtc gacgacgacg ggcccgggca ttccggtcga cccatccatc 736201 acccgcaact tcgcggggga gtagtttccc cggcgagcag acgcataagc ccccgcacgc 736261 acggcgtgtc gggggcttat gcgtctgctc gccgggctta ggccgcggca cccggcttga 736321 ggtaggtcac caggctgcag tcgagcatct cgtcggtgaa gtagtgctcg cagccacgca 736381 aatacttcat gtagcggttg tagacctctt cggaggtgac ctcgatggcc ttgtccttat 736441 tggactgcag cgtgtccccc cagatccgca gcgtcttgat gtaatgcggg cgcaacgaga 736501 gcggctccgg gacggtgaaa ccggccttct cgccgtgttc gaccatcatc tcggtggacg 736561 gcaggcggcc gccgggaaat atctcggtga cgatgaactt gatgaaacgc gccgtctcga 736621 agctcagctt cttaccgcgg gccgccatct cgtaggggtg gtagctgacg ctgctctgga 736681 cggtcatccg gccgtcggcg ggcatgatgt tgaaacaccg cttgaagaag tcgtcgtagt 736741 tctcgtgccc gaagtgctcg aaggcttcga tcgacacaat ccggtcgacg ggttcggcga 736801 aatcctccca gccttgcagc agcacttgac gtgagcggtt ggtgtcgatc gaagccagca 736861 cttgctcgca gcgggcgtgc tggttcttgg acaacgtcag gccgatgacg ttaacgtcga 736921 accgctcgac ggcgcgcctc atggtggtgc cccaaccgca cccaatgtcc agcagcgtca 736981 tgcccggctt gaggtccagc ttgtccaggt tgaggtcgac cttggcgtat tgggcttctt 737041 cgagcgtgag ctccggtggc tcgaagtagg cacagctgta agttcgggtc gggtcctgga 737101 acagggcgaa gaaatcatcg gagacgtcgt agtgcgcttg gatgtcttcg aagcgtgtcc 737161 gtgtcttggt tgggctaatc ggtttctcgg ccattctcgt catgttctcc tggatggtgt 737221 cagttaccgg tggctgtgca cccatagccc gtcggtggca cgaaagtcta cttggccagc 737281 gtgaactggt tgcagtcgat gtagcccatt cggaacgcct tggcgcagcc ggtcaggtat 737341 ttcatgtacc gctcgtatac ctctgcggac tggatctcga tggcctcgtc cttgtgcgct 737401 tgaagcgcct cggcccacag gtcaagagtc ctggcgaagt gcggttgcag ggactggata 737461 tcggtaatgg tgaaaccggc cttcgtcaca tgctcctcga tcgtctcgat cgtcggcaac 737521 cggccgcccg ggaagatgtc ggtcacgatg aaccggatga atttggccat ctccatggtc 737581 aacggtatgc cgcgctcgat gacctgcttt acgtgcaagc cggtgatcga gtgcagcagc 737641 atcacgccgt ccgcgggcat cgcgttgtag gcgaacttga agaagtcatc gtagcgctcg 737701 aaaccgaagt gctctatcgc ttcgatggtt acgatgcggt ccaccggctc gctgaagttg 737761 gcccagtcgc tcagcagtac ccggtgcgag cggttggtgt cgaccttgtc gagcacttgc 737821 tggcagtagg cgtgctgatt tttcgacagg gtcaagccga cgacgttgac gtcgtagcgc 737881 tcgacggcac gcttcatgac cgaaccccag ccgcagccca cgtcgagcag tgtcattccc 737941 ggttctagcc ccagcttgcc cagggttagg tccagcttgg cgacctgtgc ttcgtgcaag 738001 gtcatgtcgt cgcgctcgaa gtaggcgcag ctgtaggtcc gagtcggatc ctggaacagc 738061 gcgaagaacg catctgaaag gtcgtaatgg gcttggacgt cgtcgacatt ggaccgagac 738121 tttgtggtgc ccgttgagtt atcagacatg tgtcctccca ctgtgagggg caccttcagc 738181 aggtggccat ccccggcacc ctacacggtg catggcacat cgcccgcatt cgcgctcgca 738241 tgcgccggtc tttctcgatc gggatttgcc agatatcacc ctggccggcg caatcactac 738301 ttcgccagcg tgaactggtt gacgtcgatg tagccgaccc ggaacagctt ggcgcagccg 738361 gtcaggtatt tcatgtaccg ctcgtagacc tcttcggact ggatcgcgat ggcctcgctt 738421 ttgtgttcct gcagcgcctc ggcccacagg tcgagggtcc tggcgtaatg cggctgcagc 738481 gactggcggc gagtcagcgt gaaacccgtc ttcgccgact gttcctcaac catttcaatc 738541 gtcggaggtt ggccccccgg gaagatttcg gtcgcgatga acttgagaaa gcgggccagc 738601 cacaacgtga gcggcaagcc gtggtcgacc atctgctgcc tggtcaggcc ggtgatcgtg 738661 tgcagcagca acacgccatc gggcggcagg attttgtggg cccgggcgaa gaagtcggcg 738721 tgacgatcgt ggccgaagtg ctcgaacgcg ccgatcgaca cgatgcggtc gacgggctcg 738781 ttgaactgct cccatcccgc cagcaacact cgcctgtcgc gcggggtgtc catctcgtcg 738841 aacgacttct gcacatgggc ggcctggttc ttcgacaatg tcaggccgac gacgttgacg 738901 tcatactgcg cgatcgcgcg ccgcatggtg gcgccccagc cgcaaccgat atcgagcagc 738961 gtcatgccgg gctgcagacc tagcttgccc agcgccaggt cgatcttggc gatctgggcc 739021 tcttccagcg tcatgtcctc gcgttcgaaa tgcgcgcagc tgtaggtctg ggtcggatcc 739081 aggaacagcc ggaagaagtc gtcggacagg tcgtagtgtg cctgcacgtc ctcgaagtgc 739141 ggcgttaggt cgttgaccat gaggtgtaat gcctttccgg accctaggtg gcctttcggt 739201 gcttgcacgg aacgcaccga tgcttccccc tccccgcatg ctcgaggcat gctatccgat 739261 acagggccgc cgcactaaac cgcgatcgaa tttgcccagg tcagggaacg gatatgagcg 739321 gacgagctac ttggtcatgg tgaactgggc gacgttgatt aggcctctgc ggaagcgctc 739381 cgcgcatccg gtcagatagt gcatgaagtt gttgtagacc tcttcggact gtacggcgat 739441 ggcgcgttcg cgggcagcct gtaggttggc ggcccatgca tcgagagtcc gtgcgtagtg 739501 ctgctgcagc agctggacat gctcgatggt gaagcccgcg gcctgcgcat tgtcgacaat 739561 gtcgggctcc gatggcagct cgccgcccgg gaagatcgac tcccgcagga atttgaggaa 739621 tcgaaggtcg ctcatcgtca gcgcaatgcc ctgttcgtgc agccacctgc ggtcgtaggt 739681 gaacaggctg tgcagtagca tccgcccgtc atcgggcagg atgtcgtagg agcgttcgaa 739741 gaacgtcaga taccgctcct ttttgaacgc gtcgaatgcc tcaaagctga cgatccggtc 739801 gacgttctct tcaaactctt cccagccctg cagccgggcc tcggcgcgcc gttgcgttcc 739861 gattgcggcc aggcggtctt tgctgcgttc atagtgattc cggctgagcg tgaggccgat 739921 gacattgacg tcgtacttct ccacggcccg aacgagcgcc ccgccccacc cgcaacccac 739981 gtcgagtagc gtcatccccg gttcgaggtt cagcttgtcc aacgccagat ccaccttggc 740041 cagttgcgcc tcttccagcg tcatatcgtc acgctcgaaa taggcgcagg tgtagaccca 740101 ggtgggatcg aggaacaacg cgaagaagtc atccgaaatg tcgtaagccg actgtgactc 740161 ttcgtaatat ggtctcagct tggccatagg cgacaacctc ccgcgccaac cgtacaacgc 740221 ctcgccgacc ggctcagccg gcctcagaga agttgcgcgt caactcgccg atcacccgat 740281 cccacagctg tctgggcagg tcatggccca tgccgtcgat gagcaccagg cgcgcgccgt 740341 tgattgctcg cgcgaccgcg cggccgccga acggccgcat cagcttgtcc gcgcgcccgt 740401 ggatgacgac ggtcggtgcg acgatgcgcc ggtcgtagcg cagcaggctg ccgctgccca 740461 gtatcgcgct gaactgctgg gcgattcccc agggatggaa gttgcggtcg tagctttcgg 740521 cggcctcggc tcgtacctgg tcttcgggaa tcgggtaggc cgggctgccg atgatcttgc 740581 tgacccggac ggcgttgtcg acaatgacgt cgcgtggcga atccggcggc ggacccgtga 740641 gcagcgccag cagcgcgcgt ggcgccggcg gtggcagaaa ccggtgattg ttgctggaga 740701 agatgaccgc cagggttttc gtccgctgcg cgaatcgcgc ggcgaaaatc tgggcgatca 740761 tgccgcccat cgacgccccg acgacgtgcg cgtgcttgac gtcgaggtga tcgagcaacg 740821 ccgcggcgtc ggcggccatg tcttccaacg tgtaggcagc ctggctgggc agaccgagcc 740881 aggaccggac caaccgcgtg gccagtggct gtcccgggcg gtggcgctcg gtcttggtgg 740941 acaggccgac atcgcggttg tcgtagcgga tgacgcgcag gcccttcgcg acgagccgcg 741001 cgcagaagtc ggtccgccac agcagcatct gggcgcccag gcccatgatc agcaacaccg 741061 gcgggtggtc gaggtcaccc atgtcctcgt agtacagctt cacatcaccg gagaccgcgg 741121 tgccgctacg gatgtccacc gagacctcgc ctaaacctcg atgtcggatt gatgttcgcg 741181 gctgacctcg accatgaagt tggcgaaata tccggtcagc tgcgggtccg acatcatctg 741241 ccacctcggc gccagcagct tcatgtagcg ctccacgtac aggaactgct tgccgatcag 741301 caccagctcg cggggcagct tgacgtcgta ggcgtcggcc agcgccgaga gctggcggcc 741361 gatgtcggca tatgacatgt cgcccagcga ttgcatggtc agcggggtgg cgaagcgctc 741421 caggtctttg gcggcctggg tctcgggctt catggtgccg acggcgccca tgagcacgac 741481 gatcttgccg gcggctgcgt ggtccttctt caccagcagc gcatacacca gctcgcggag 741541 tagccagcgg gtgcgtggat cgatgcggcc catgatcccg aagtcgaaga acacgatgcg 741601 gcccgcctcg tcgacgtaga ggttgcccgc gtgcaggtcg ccgtggaaca gcccgtgccg 741661 caggccgccc tcgaacaccg aaaacagcag tgccttgacc agctcgacac cgtcgaaccc 741721 ggccttgcgg atcgcggcgg cgttgtcgat gcggatgccg tgcacccgtt ccatcgtcaa 741781 cacccgctcg gtggtgaagt cccagtgcac ctgcggcacc cggatgtttt tgcccagcgg 741841 cgaggcgtgt aggtgggaga cccaggcctc catggactgc gcctcgaggc gaaagtccag 741901 ctcctcggcc aggttgtcgg cgaagtcggc gaccacgtct tgtgccgaga gccgccggcc 741961 cagcttggcc agttcgacgg tctgcgcgaa gcgcttgagg atctgcaggt cggcggcaac 742021 gcggcggcgg atgcccggcc gctggatctt gaccaccacc tcctcgccgc tgcgcagggt 742081 cgcgtagtgc acctgggcga tggacgccga cgcgaacggc tcttcctcga aggaggcgaa 742141 cagccgggcc ggctcgtcgc cgagttcctc gacgaagagc ttgtgcacct cgtcggtttt 742201 tgcgggcggc acccggtcga gcaggccgcg gaattcccgc gacagcgact caccgaatgc 742261 tcccgggctg gacgcgatga tctggccgaa cttcacgtat gtcggtccca gatcggcgaa 742321 ggtctgcggg agctccttga tcaccttctg ttgccagggc ccttttcggg ggagcctgcc 742381 gatgaaccgg acggcggtgc gggtgacctg ccaaccggtg gccgccaccc gggcagcttc 742441 gaccggcagc ggtacccggt caagcttggc cacctcgcgg tgtgtggtgg aacccatctg 742501 agcagtgtgc caaaccgggg cagacagctc ccaattgacg tgagcccgct cacttgctgg 742561 gtaagcgtcg ccgaatgtgt aatgagggcg gaaatccggc ccgatttccg ccctcattac 742621 acattcggcg acgcgtggac tacctcaagc cgtactggga tacccacccg caggaccgcg 742681 ccgacctgcg ccggttcctc gccgatggcc gtatcgaagt gatgggcgga acctacaacg 742741 aacccaacac caacctcacc agcccggaga ccaccatccg aaacctggtg cacggcatcg 742801 gttttcagcg tgacgtgctg ggcgccgagc cggccaccgc gtggcagctc gacgtgttcg 742861 gccatgaccc gcaatttcct gggctggccg ccgatgccgg gctgacgtcg agttcctggg 742921 cccgcgggcc acaccaccag tggggtccgg cccaaggcgg ggtagaccgc atgcagtttt 742981 gcagcgagtt cgagtggatc gcgccgtcgg gtcgcggcct gttgacccat tacatgccgg 743041 cgcattattc ggcgggctgg tcgatggact cgtccacctc gctggccgac gctgaggccg 743101 ccacctacgc gctgttcgac cagctcaaaa aggtcgcgct gacccgcaac gtgctcctgc 743161 cggtgggcac cgactacacc ccgccgaaca agtgggtcac cgccatccac cgcgactggg 743221 gtgcgcgcta cacctggccg cgcttcgtgt gcgcgctgcc caaggagttc ttcgccgcgg 743281 tgcgcgccga actggccaag cgtggttggg tgccgtcgcc gcagacccgc gacatgaacc 743341 cgatctacac cggcaaggac gtctcctaca tcgacaccaa acaagccaac cgggccgccg 743401 agaacgccgt cctggaagcc gagcggttcg cggtgttcgc cgcgctgctg accggcgccg 743461 agtatccgca ggcggcgttg gccaaggcgt gggtgcaact ggcctacggt gcgcaccacg 743521 acgccatcac cggctcggag tccgaccagg tctacctcga cctgctgacc gggtggcgtg 743581 acgcgtggga gctgggccgc gcggcccggg acaactcgct gcggttgctg tccggcgcgg 743641 tcgccgcgtc gcacgatcgc gtcgtcgtgt ggaacccgct gacccagcgg cgcaccgaca 743701 tcgtcactgc cagggtcgac ccgccgctgc aggccggcgt gcgggtgttc gatcccgacg 743761 gggctgaggt ggccgcgctc gtcgagcacg acggacggtc ggtcacctgg ctggcgtgcg 743821 acgtgccctc gctgggctgg cgggtttacc ggttggtgcc cgccgacgag gcgccaggct 743881 gggaattggt acccggcacc gacatcgcca acgagcacta tcggctggcc gtcgaccccg 743941 agcgtggcgg ggcgttgtcg tcgctggtgc aggacggccg ccagctgatc gccgccggcc 744001 gggtagccaa cgagctggcc ctctacgagg aatacccgtc gcacccgact cagggggagg 744061 gtccgtggca tctactgccc acggggccgg tggtgtgctc ctcggcatgc ccggcgcagg 744121 tgcaggcata ccgcggcccg ctcggtcagc ggttggtcgt gcgggggcgg atcggcaccc 744181 tgctgcgcta cacgcagaca ctcaccttgt gggacggcgt cgaccgggtg gactgccgca 744241 ccagcatcga cgagttcacc ggggaagacc gcttgctgcg gctgcgctgg ccgtgtccgg 744301 tacccggcgc catgccgatc agcgaagtgg gggacgccgt cgtcgggcgg ggtttcgcgt 744361 tgctgcacga ggggcccgaa tcggtggaca ccgcccagca tccgtggacc ctggacaacc 744421 cggcctacgg ctggttcggg ttgtcctcgg cggtgcgggt acgcgccggc gatggggtgc 744481 gcgcggtgtc ggtggccgag gtggtgtcgc cgacggagac ggtgtccggc ccgatggcgc 744541 gcgacctgat ggtcgcgctg gtccgcgcgg gcgtcaccgc gacctgcagc ggcgccgaca 744601 agccgcgcta cggccacctc gatgtcgatt ccaatctgcc ggacgccagg atcgcgctcg 744661 gtgggccgga ccgcaacacg ttcaccaagg ccgtgctggc cgaggccgcc ccggcctaca 744721 ccgccgaact gcagcggcag ctggcgaaga ccggcacggc cagggtgtgg gtgccggccg 744781 cgaacccgtt ggcgcgggcc tggctgcccg gcgcggactt gcgggcaccg tgcgcgctgc 744841 cggtgctggt gatcgacggc cgagacgaga agcacctgcg cgccgcggtg gcgtcgctgg 744901 ccgacgacct ggccgacgcc gagatcgtcg tgcaccagcg ggccgcgccg caaatggagc 744961 cgttcgagga tcgcacggtc gcgctgctca accgtggggt gcccagcttc gccgtcgact 745021 ccgagggcac cctgcacacc gcgctgatgc ggtcgtgcac cggctggccc tccggggtct 745081 ggatcgacca gccgcgacgc accgccccgg atggctcgaa tttccaactc cagcactgga 745141 cccaccactt cgactacgcg cttgtctgcg gcggcggcga ttggcggcgc gccggcatcc 745201 cggcgcgcag cgcgcagttc tcccacccgc tgcttgcggt ggcgccgcga cggccacagg 745261 gcgagctgcc ggcggtcggc tcgctgctgc acgtcgagcc ggccgactcg gtgcagctgg 745321 gcgcgctcaa ggcggccggc aacccgctgg cagccggcag cgcgcggccg gtccaacccg 745381 ccgcggtggc gctgcgattg gtgcaaacga caggagccga caccccggtc accatcggct 745441 gcgagctggg caaggtaggc gccctccggc cggccgacct gctggaaacg ccgctcgcaa 745501 tggcaagggc gcgcaagtcg tccatcgacc tgcacggcta tcaggtcgcc accgtgctgg 745561 cccggctcga cgtggccgct gatatggcta acgtgctggc ggccgacgac gtggcgttgg 745621 cgccgcacgc cgagaccgct cagccgcagt acgcgcgcta ttggctgcac aaccgcggcc 745681 cggcgccgct gggcgggctg cccgcggtcg cccacctgca cccgcggcgg gtgcgcggcc 745741 agcccggtga cgacgtggtg ctgcgcctga ccgcggccag cgactgcacc gattcggtgc 745801 tgggcggcgt ggtcgacgtc gtgtgtccgc tcggctggcc ggccacaccg gctcggttgc 745861 cgttcacgct gggcgccggg gcgcacctgc aggccgacat cgcgttgagc attcccgccg 745921 gcgcgccgcc gggaccgtat ccggtccgcg cgcagctgcg cgtcgtcgac acggcggtac 745981 cggccgcctg gcgccaggtg gtcgaggacg tgtgcgtggt caccgtcggc gccgactccg 746041 atctggagga gctggtctac ctcgtcgatg ggccggccga catcgagctg gccgccggcg 746101 accgggcccg gctggcggtg acgatcggca gccgcgctca cgccgagctg gccctggatg 746161 cgcactcgat cagcccctgg ggcacctggg agtggatcgg cccgcccgcg ctcggcgccg 746221 tgctacccgc ccggggcatg gccaagctgg ctttcgatgt gaccccgccg gcctggctgg 746281 agcccgggca gtggtgggcc ctggttcggg tcggttgcgc gggtcagttg gtctattcgc 746341 cggcggtgaa ggtgagcgtg acatgagcgg gcgaagccga ttgcccggct cctcctcacg 746401 ccgcgacgcg gcgcgcatcg tcgccgagcg ggtggtcgcg accgtcgccg gtgtcgcggt 746461 agcggtcgac gaggtcgacg cggccgaagc gcggctgcgc gacggaccgc gcgcggccgc 746521 gctgccggcg agcggcacca gcgagggacg ccaactgcgg cgctggctca cccaactgat 746581 cgtgaccgag cgggtggtag ccgccgaggc cgccgcacgt ggtctgaccg cggcgggcgc 746641 ccccgccgag gcggacctgc tgcccgacgc gacggctcgg ctggagatcg gcagcgtcgc 746701 cgccgcggtg ctggcggatc ctttggcgcg ggcgttgttc gccgccgtca ccgcgcgggt 746761 cgcggtcacc gacgacgccg tggccgacta ccatgcccgc aacccgctgc ggttcgccgc 746821 gccatgtccc ggccagcacg gctggcgtgc cccggcggcg gccgccccac cgctggatca 746881 ggtgcgccgc gcgatcaccg agcatctgtt gggggccgcg cgccgccgcg ccttccgggt 746941 gtggctggac gcgcgccgga acgccctggt ggtgctggcc cccggctatg agcaccccgg 747001 cgacccgcgc caacccgaca acacccgccg gcactgatgc tcaccctttg cctcgacatc 747061 ggcggcacca agatcgccgc gggcctggcc gacccggccg gcacgttggt gcacaccgcc 747121 caacgtccca ccccggcgta tggcggagcc gaacaggtct gggccgcggt cgccgagatg 747181 atcgccgacg cgctcggcgt ggcggggggc gcggtcggtg gtgtggggat cgcctcggcc 747241 ggtcctatcg acctacacag cggccgcgtc agcccgatca acatcggatc ctggggcggc 747301 tttccgctgc gggatcgggt cgccgccgcg gtcccggggg ttccggtgcg gctggggggt 747361 gacggggtgt gcatggcgct cggcgagcac tggctgggag ccggacgggg tgcgcgcttt 747421 ctgttgggtt tggtggtgtc caccggggtg ggcggcgggt tggtgctcga cggcgccccc 747481 tgtctcggcc gcaccggcaa cgccggtcac gtcggccacg tggtggtgga tccggatggc 747541 tcgccgtgcc cgtgcggggg gcgtggctgt gtggagacca tcgcgtccgg cccgtcgctg 747601 gcgcgctggg cgcgggccaa cggctggtcc gcgccgcccg gggccggcgc caaagagctg 747661 gccgaggcgg ctggggccgg agacccggtg gcgctgcggg ccttccgccg cggcgccgcg 747721 gcgctggccg cgatgatcgc ctcggtgggc gccgtgtgcg acttggatct cgccgtcatc 747781 ggcggcggcg tggccaagtc gggtcgcctg ctgttcgagc cgttacgtgc ggcgctagcc 747841 gaccacgccc ggctggactt tctggccggc ctgcgggtgg tgcctgccga gctgggcggc 747901 gccgccggcc tggtgggtgc ggccaggctc gcggccatcg cataatgccg attgtgaatc 747961 tggcgacgcg acacgccggt gcggcgtcgc gggattcaca ctcggcgata cgtgtcgccg 748021 ttttggctga ccggaccggg ccaggctatt gtggttgccg atccaccgaa gaccgtcggt 748081 caccgagcaa tcggttgaag gtccgggagc atcccggcga cccacgcagg aggacgaggc 748141 agcaccgccg gcgcgcgccg gcctagttcc acgccccgac cgcttcctgc gtcggggcgt 748201 tcgtcgttcc cgggtggtcg cagacggcac gtcgtacccc gactgccacc agacttgcac 748261 cgtcaggagg tatgcatggc cagggctgac aaggccaccg ccgtcgcaga catcgcagcg 748321 cagttcaagg agtcgaccgc gacgttgatc accgaatacc gcggcttgac ggtggccaac 748381 ctggccgagc tacgcaggtc tctgacgggg tcggcgacct acgcggtggc caaaaacaca 748441 ctcatcaagc gggcggcctc cgaggccggc atcgagggcc tcgacgaact gtttgtgggc 748501 cccaccgcga tcgcgttcgt caccggtgag ccggtcgacg ccgccaaggc catcaagacc 748561 ttcgccaagg agcacaaggc gctggtcatc aagggcggct acatggacgg ccacccattg 748621 accgtggccg aagtcgagcg catcgccgac ctggagtccc gcgaggtgtt actggccaag 748681 ctggccggtg cgatgaaggg caacctggcc aaggcggccg ggttgttcaa cgcgccggcc 748741 tcgcagctgg cccggctcgc ggccgccctg caggaaaaga aggcctgccc aggcccagac 748801 tcagccgagt agtcacccag taccccacac caggaaggac cgcccatcat ggcaaagctc 748861 tccaccgacg aactgctgga cgcgttcaag gaaatgaccc tgttggagct ctccgacttc 748921 gtcaagaagt tcgaggagac cttcgaggtc accgccgccg ctccagtcgc cgtcgccgcc 748981 gccggtgccg ccccggccgg tgccgccgtc gaggctgccg aggagcagtc cgagttcgac 749041 gtgatccttg aggccgccgg cgacaagaag atcggcgtca tcaaggtggt ccgggagatc 749101 gtttccggcc tgggcctcaa ggaggccaag gacctggtcg acggcgcgcc caagccgctg 749161 ctggagaagg tcgccaagga ggccgccgac gaggccaagg ccaagctgga ggccgccggc 749221 gccaccgtca ccgtcaagta gctctgccca gcgtgttctt ttgcgtctgc tcggcccgta 749281 gcgaacactg cgcccgctcg ggtgaatctc ccagcgcgac aagcaggttc accgtcatcg 749341 cggcgagcac cggttcgacg gccgcgcctc gatcgccgta gaagccggcc agctcgagca 749401 tcacgaagcc gtggatctgt gaccaaaact gcgccgcggt ggcaactatt gccgtgtcgt 749461 cgtcggctcc aagcgcggtc gcgaaccggc cggccagcag gcaccggtgc accgctcgca 749521 ccacatgcgc gaaactgggg tgctggtgtt cgatctcggc aaccttgagg gtcaacacgt 749581 cgcgcgctgg cacgttgatg ccgtgtgcgc tggtgctgcc gaacattagc cggtacatgt 749641 gcgggcgctc gatggcgtag cgccggtagg cggtgccgat ggccagcagg tcggcgaccg 749701 gatcggcggt ctgcgggacc gtcagcgcga catcgaactg gcgtagccct tcttcggcta 749761 tggcggcgat cagtccgcgc atcccgccga aatgggtgta caccgccatc gtcgaggtgc 749821 ctgctgcggc ggccaccttg cgggtctgca gcgcgtcggg cccgtgatcg tcgagcagtc 749881 gcacgccggc gtgcagcagc tcgtcgcgaa caccggtctg cgaggtcatc cttgccatgt 749941 tctcaccaag ggcgtaccgt tccaatatca gtgaaataac aatgttatag gagatcggca 750001 tgaccaccgc acaagccgcc gaatcccaaa acccatatct cgagggcttc ctggcgccgg 750061 tgagcaccga ggtaactgcc accgacctgc cggtcaccgg ccgcattccg gaacacctcg 750121 acgggcgtta tctgcgtaac ggccccaacc cggtcgcgga ggtcgacccg gccacctacc 750181 actggttcac cggcgacgcc atggtgcacg gagtcgcgct gcgcgacggg aaggcccgct 750241 ggtatcgcaa tcgctgggtc cgcacacccg cggtgtgcgc cgccctgggc gagcccattt 750301 cggcccggcc tcacccgcgc accgggatta tcgagggcgg tcccaacacc aacgtgctga 750361 cccacgccgg acgcaccctg gccttggttg aggccggcgt ggtcaactac gaactcaccg 750421 atgagctgga caccgtggga ccctgtgact tcgacggcac cctgcacggc ggttacaccg 750481 cccatccgca gcgtgatccg cacacgggtg aactgcacgc ggtgtcctac tcgttcgccc 750541 gcggacacag agtgcagtac tcggtgatcg gcaccgacgg acacgctcgt cggacggttg 750601 atatcgaggt ggcgggatcg ccgatgatgc acagcttctc cctgaccgac aactacgtgg 750661 tgatctacga cctgccggtg accttcgacc caatgcaggt ggtgccggcg tccgtgccac 750721 gctggctgca acggcccgcc aggttggtga tccagtcggt cctgggccgt gtccgcatcc 750781 ccgacccgat agcggcgttg ggcaaccgga tgcagggtca ctccgatcgc ctcccgtacg 750841 cctggaaccc cagctacccg gcgcgcgtcg gtgtcatgcc gcgcgagggt ggcaacgagg 750901 acgtgcggtg gttcgacatc gaaccctgct acgtatacca cccacttaac gcctactcgg 750961 agtgccggaa cggcgctgag gtgctggtgt tggacgtggt gcgctactca cggatgtttg 751021 atcgcgaccg gcggggtccc ggcggtgaca gccggccctc gctggatcgc tggaccatca 751081 acctggcgac cggtgcggtg accgccgaat gccgcgacga tcgggcgcag gagtttcccc 751141 gcatcaacga gactctggtg ggtgggccgc atcgcttcgc ctacaccgtc ggcatcgagg 751201 gtgggtttct cgtcggcgcc ggcgctgcgt tgtcgactcc gctgtataaa caggactgcg 751261 tgaccgggtc cagcacggtc gcctcgctcg atcccgacct gctgatcggc gagatggtgt 751321 tcgtgccgaa cccgtcggcg cgtgcagaag atgacgggat tctcatgggc tacggctggc 751381 accgcggccg cgacgaaggc cagctgctct tgctggatgc ccagactctc gagtcgatcg 751441 ccaccgtgca cctgccacag cgtgtgccga tgggcttcca cggcaactgg gcgccgacca 751501 cctgacggcg cctcgggtgc gatacagtga ctcataccac acaacgggcc ggtggcagcc 751561 acgagcgtcg acagaagggt ttcccatggg cgtcagcatc gaggtcaacg gactaacgaa 751621 gtccttcggg tcctcgagga tctgggaaga tgtcacgcta acgatccccg ccggggaggt 751681 cagcgtgctg ctgggcccat cgggtaccgg caaatcggtg tttctgaaat ctctgatcgg 751741 cctcctgcgg ccggagcgcg gctcgatcat catcgacggc accgacatca tcgaatgctc 751801 ggccaaggag ctttacgaga tccgcacatt gttcggcgtg ctgtttcagg acggtgccct 751861 gttcgggtcg atgaacctct acgacaacac cgcgttcccc ctgcgtgagc acaccaagaa 751921 aaaggaaagc gagatccgtg acatcgtcat ggagaagctg gccctagtcg gcctgggtgg 751981 ggacgagaag aagttccccg gcgagatctc cggcgggatg cgtaagcgtg ccggcctagc 752041 gcgtgccctg gtccttgacc cgcagatcat tctctgcgac gagcccgact cgggtctgga 752101 cccggttcgt accgcctacc tgagccagct gatcatggac atcaacgccc agatcgacgc 752161 caccatcctg atcgtgacgc acaacatcaa catcgcccgc accgtgccgg acaacatggg 752221 catgttgttc cgcaagcatt tggtgatgtt cgggccgcgg gaggtgctac tcaccagcga 752281 cgagccggtg gtgcggcagt tcctcaacgg ccggcgcatc ggcccgatcg gcatgtccga 752341 ggagaaggac gaggccacca tggccgaaga gcaggccctg ctcgatgccg gccaccacgc 752401 gggcggtgtc gaggaaatcg agggcgtgcc gccgcagatc agcgcgacac cgggcatgcc 752461 ggagcgcaaa gcggtcgccc ggcgtcaggc tcgggttcgc gagatgttgc acacgctgcc 752521 caaaaaggcc caggcggcga tcctcgacga tctcgagggc acgcacaagt acgcggtgca 752581 cgaaatcggc cagtaaggcg cgcggggatg cgaccgccgg accgccgcaa tcggatgatt 752641 tcgcgtaact tgccgcatat cacccggaga ccgaatcggg tcggccgctg gaggcggcgc 752701 ctgttcggga gctgatcacg caacgtttgt atctgctgcc gaccttccgt tggcggctcg 752761 cgtaggtggc acagtccgcg aagtgcttgg gccgctgatc aaggcgctcc cggagcacaa 752821 tccagacatg tcaggccgtc accgacgcac aggcgacggc cctcgagcag cgtgggaaga 752881 gccgggctcg tcgagtggat cacacatttc gaggcgctct cgtgtatcga gcggcacatc 752941 agccatgcgt gtctccttgt cctgccttct ccagaggaaa ccgctagtcg tcggcgctga 753001 cgaccctccg cactctgatg tcgggaaggt gacgctctgc gagttcgtag tcggcatcgt 753061 cgtggaggac tactaggccc ctggccgccg cagtgtcgca gatcagcaga tcgacaaccg 753121 acagggcacc caccgctccc gcccgggcga ggcggtgctg tgccgaatcg atccaccgcc 753181 acacggattt cggcactggc acatcggggt agacgtcacc aaacatccgg ctcatctggt 753241 cgaactcgtc cgcattccgc gctgatcggc agaactcggc tcgttgcggt tcgcacgacc 753301 cgacggcccg ctgagcagcg cggagttcca ggcctcggtg ggttccggtt gtcgttgcag 753361 ccgccaaacc gctgaggaat ccaccaggaa atagatcaaa tcccgagggc cttctcgtcg 753421 tcccgcgcgg ccacccagcc cttgtagtcc cagcctttcg cctactcgcg cgagcgggcc 753481 agggcctcga tgcgccgaaa ccgttcgacg taatcgcgca tcgcgaggtt cacggcttcc 753541 ttctttgtgt gcacggcggc gatgcgcatc acatcggcca gcgcttcgtc gtcgaggtcg 753601 atctgggtca ccgacacgac ggcctcctat gttgaagaca tatcacataa acatacgtaa 753661 ccaacatcgc gaggagaccg tctcgcgcct gctcagggca acgatatggc gccagtcaga 753721 ccaagcagca atacgatccc gggcaatagg ttggtcactt ggtgcgtgac gatgctggcc 753781 agtagaccgc cggaatagaa ccgtgccagc gcgatcggga tggccaccac caccagcagt 753841 ggagctcggg cgaactcgag atgggccaat gcgaagacca cggtggtaac caccagcgcc 753901 gcccaccgac cccagcgccg atccacagca ccccagagca gcccgcggta gatgatctct 753961 tcgcacagtg gcgcgacgaa caccacgacc agaaagacga ccagcgccca cggccaggac 754021 gcccgaacgc caccgaaaat ccttactaca gcggaattcg cttctggccc aacgatagcg 754081 gtgtagacca gcgacgccgg aatcgtgacc agcattccgc cgaaaccgaa catcaacccg 754141 agccgcagtc cgcgccacga ccagcgcagc cgcaagtcgg tgcggaggcc gttgccgcgg 754201 agcctggtga tgaggatggc cagcccggcg gcgaccaccg tgggggcggc tagcgcaagg 754261 gccagcaccc cggcagacac cgggccgtga ccggtaagga caaccgctaa cgaagtcgag 754321 gcgaccagga ataccagctc gacgaccaag aaggccccaa gtccccagcg gtgactgggg 754381 gctacggtat cggcacggcc cgcttccacg gctccgacgg tatcgaagtg tcaccgccac 754441 cggcgctgac gtcgagccgg cggacggccg gctgctacgc gcgcggtacc tcgtcgggcg 754501 gatcgggttc ggtcgatcga cgcgaaatat gttggcgact ggcaacttcc ggtgcttgcc 754561 acggtctcaa ctctcaccgc cgtgttgatc cggaccgacg gatgtgtcgc cgaaccgacc 754621 atatcgtggg cttgctgacg cgctcatcag tcggttcggg tggccacgtg caaccagccc 754681 ccgctcaaca ccccgtgctc gcccggagtg tttgacaggc ttcgtgcagg cgggccgggg 754741 acagccgggt gatgcggcgt cggaatgcgg tgcgtggcaa cgtatgaatg ttgtcgaagt 754801 tgacgacgca gtcgctcgga acacggtttt cgacggccgt gagctccaat tccgacacca 754861 ggcctcggcg ggtgcgggtt agggccacca caacgaccgc gccgatgcgg tctgccaccg 754921 gatctctggt aaggacaagt actggtctgt caccaccagg tgtggcggca aaccacaatt 754981 caccgcgccg catcggccca gtcggcccag tcctccgccg gcccccagtc ggcgatctca 755041 gccagtgcgt tctcgtcgtc cgtcaacggt cgctcggtgt aggcctggac atcctggtcc 755101 gcggccagcg cggccaagtg acgtcgcagc gcatcgcgca gcagctcgga gcggccgatg 755161 tgtaggcgac gcgcccacgc gtcggccagg tcgacgtcgt ggtcgtcggc gcggaagctg 755221 agcatcgtca tacatcgagt ttagagcgta tgacattgtc ggccggcgag cagacgcata 755281 agcccccgca cgctcggcgt gtcgggggct tatgcgactg ctcgcccggg gccgtcagcg 755341 gtcgccgagc aggctgacca tcccggccgc gtcgggaatg acgtgcacga catccgacag 755401 atcggcgaac gccgggtcag ccgagacaag agcggtcgcg ccggcgcttg cggcaaccgc 755461 cgcgagcacc gcgtcgcagg cttcaagccc tggcgttgtc tcgaacagcg tcaggccgcg 755521 cttcgaggtg gcctcgattg atggtgagta gcggcgagag cagttcggca tagtcacacg 755581 gcccagcgcg gcggcgtcgc tgcggtcgcg ccggcgggcg cgtacgtgga cgaactcctg 755641 gatcacctcg gcggtggtgg tcgcagcgat gcgttcgtcg gcgattgccg cgacgagatc 755701 gcggcaggga tcgcggagtg gatgctcggc gcctttggca tagacgagga cggtggtgtc 755761 gagcactatc atccgcggcg cgcccggagg gcctcgagtt cctgcttcag ctcccgcggc 755821 tcgggaacgg acatgtcggc ggcgtcgagc aggcgcctgc ccgcggactt gcggcgaccg 755881 gcggggctga cgaggcctcg atcaatggcc tcacgcacga cggttgcgac cgggacgcct 755941 cgctcgcgcg ccaccgcggt gatgcggcgg tggcactcgt cgtcgagcag gatctggagc 756001 cgatgcgcca gacgcatgct catacattta gcatgctgaa atttgggcgg cggctgccat 756061 tgcggtcgcg ttgacccgcg gacggcccag acgctgcggt tgtagcgtcg ataggcacgc 756121 gtattaggga ggaacaatgc cgcagccaag aacgcatctg ccgattccca gtgctgctcg 756181 caccgggctg atcacgtatg acgcgaagga tcccgacagc acctatccgc cgatcgagca 756241 gctgcgccca ccggcgggtg ccccgaatgt gttgctgatc ctgcttgacg atgtcgggtt 756301 cggtgcgtcg agcgcgttcg gaggcccatg caggacgtcg acggcggaac tgcttgccgg 756361 taacgggttg cggtacaacc ggtttcacac caccgcgctg tgctcgccga cgcgtcaggc 756421 gttgttaact ggacgcaacc atcactccgc cggcatgggc ggtatcaccg aaatcgccac 756481 cggtgcaccg ggatacagct cagtactacc gaacaccatg tcgccgatcg cgcggacgct 756541 aaagctcaac ggctacaaca ccgcccagtt cggcaagtgc cacgaagtcc cggtctggca 756601 gaccagcccg gtcgggccgt tcgacgcgtg gcccagcggc ggcggtggtt tcgaatactt 756661 ctacgggttt atcggtggcg aggctaacca gtggtatccg agtctgtacg agggcaccac 756721 gccggtcgag gtgaaccgca cgcccgagga gggttaccat ttcatggcgg acatgaccga 756781 caaggccctc ggctggatcg gacagcagaa ggcactggcc cccgaccggc cgttcttcgt 756841 gtacttcgcc ccgggcgcca cccacgcgcc ccaccacgtt ccgcgggagt gggccgacaa 756901 gtaccggggc cgcttcgatg tgggctggga cgcactgcga gaggaaacct tcgcccggca 756961 aaaggaactc ggggtgatcc cggcggactg ccagctgacc gcgcggcacg ccgaaatccc 757021 ggcgtgggac gacatgccgg aggacctcaa acccgtgcta tgccggcaga tggaggtcta 757081 cgcgggcttt ctggaataca ccgaccacca cgtcggccgg ctcgtcgacg gcctgcagcg 757141 cctcggtgtg ctcgacgaca cgctggtgtt ctacatcatc gacgacaacg gcgcctcggc 757201 cgagggcacg atcaacggca cctacaacga gatgttgaac ttcaacggcc tggccgacat 757261 cgagacgccg cggttcatga ccgaccggct cgacaagttc ggcgggccgg agtcctacaa 757321 ccactattcg gtgggttggg cgcatgcgat ggataccccc tatcagtgga ccaaacaagt 757381 ggcctcgcac tggggtggca cgcgtaacgg cacgattgtg cactggccca acggaattgc 757441 cgccaagggg gagatgcgct ggcagtttca ccacgtcatc gacgtggcgc cgaccatcct 757501 ggaggcggcg gggttgccgg aaccgttatt cgtcaacggc gtgcagcaac accccatcga 757561 aggggtcagc atggcctatt cgttcgacga cgcgcaggcg ccggatcggc acgagacgca 757621 gtatttcgag atgttcggaa accggggcat ctaccacaag ggttggaccg cggtgaccaa 757681 gcacaagacg ccgtggattt tggttggcga gcagaccgtc gcgttcgacg acgacgtgtg 757741 ggagctctac gacaccacca aggattggag ccaggccaaa gacttggcca aggagatgcc 757801 ggaaaagctg catgagctgc agcggctgtg gctgatcgag gcgacgcgct acaacgtgct 757861 tccgctggac gacgacaccg ccagccgcat caaccccgat ctggcgggca ggccggtgct 757921 catcaggggc aacacccagg tgctgttttc gaacatgggc cggttgtcgg agaactgtgt 757981 gctcaacctc aagaacaaat cgcacacggt gaccgctgag gtcgaggtgc ccgagaccgg 758041 tgctgagggc gtgatcgtcg cgcagggcgc cagcatcggc ggctggagcc tgtatgccaa 758101 cgacggcaag ctcaagtact gctacaacct gggtggtatc aagcacttct acgccgagtc 758161 cgccgacccg ctgccggccg gcgcccatca ggtgcgcatg gaattcgctt atgccggtgg 758221 cggtttgggc aagggcggcg aggtaactct ttatgtcgac ggccaacagg tcggcgaagg 758281 acatgtcgaa gccacccttg ccatcgtctt ctcggccgac gacggctgcg atgtcggcat 758341 ggattcgggc tcgcccgtct cacccgacta tgccccgggg agtaacgcgt tcaacgggcg 758401 gatcaagggc gtgcagctcg cgatcgccga ggccgccgct gctgcgggcc atctggtcga 758461 cccggagcac gcgatccgca tcgcgctggc gcgccaatag ggccgcacag tcaaacgggg 758521 aggggacggc gatggaaaag tcacggtgcc acgctgtcgc acatggaggt gggtgtgcgg 758581 gatctgcgaa atcgcacaag tcaggtggtc gatgcggtca aggccggggt gccggtgact 758641 ctcacggtac acggggagcc ggtcgccgat atcgtgccgc atcggcgccg catccgctgg 758701 ctgtcggggc gcatctgcgc gatgagctcg ccaagcgctc ggccgacccg cgcctcaccg 758761 atgaactcaa cgacttggcc ggtcataccc tcgacgacct gtgaccgagg gcgaggtcgg 758821 ggtaggcctg ctagatacgt cggtcttcat tgcgcgcgag agcggcggtg caatcgcgga 758881 cctgcctgaa cgcgtggcgc tttcggttat gacgatcggt gagctgcaac tcggtctgct 758941 caatgctggc gattcggcga cccgatcacg acgcgccgac accctcgcgc tagcgcgcac 759001 ggccgatcag atccctgtca gtgaagcggt gatgatttcg ttggctcgac tcgtcgcgga 759061 ctgccgagcc gcgggcgtgc ggcggtcggt gaagctgacc gacgctctca ttgcggcaac 759121 cgcggagatc aaggtgtgac accgaggact gatgaaggtg ccgctgcacc ctgcctgatg 759181 cctgacgtca cgatgcccgt gaagcgtggt gatgcccggg gagctttggg tgtgggtcca 759241 gctttgttcg tggtgagcgt gagcagctcg ctggtgaggg ccaggagctg tcgttgcacg 759301 gcggattgat cgattcgacc gcatccatct ggagctactg ccccagaccg gactcgcagc 759361 cttggcaagc cgctacgcgg gcattctcac ctgaggcaac gaagggcgct atgcgcgcat 759421 tgtgggtgag tcaacgcgag gacttgacgg cagacgctaa acgggtcaat ctgttgggca 759481 gcatgcgccg catgtggcca aaggaagtcg agatcgccag ctagcgccga tatccgggga 759541 tggttattgc cgggtatttg aggaatgcgc cgtcctgcgc tattgttgga cgttgcgctg 759601 gctacttcct gcccacctca cccgccactt gacaccgtgg tcttagtctg agcccagttt 759661 gcggctcagc ggtttagttg cgtgcgtgag atccggacag atcgttcgcc ggccgaaacc 759721 gacaaaatta tcgcggcgaa cgggcccgtg ggcaccgctc ctctaagggc tctcgttggt 759781 cgcatgaagt gctggaagga tgcatcttgg cagattcccg ccagagcaaa acagccgcta 759841 gtcctagtcc gagtcgcccg caaagttcct cgaataactc cgtacccgga gcgccaaacc 759901 gggtctcctt cgctaagctg cgcgaaccac ttgaggttcc gggactcctt gacgtccaga 759961 ccgattcgtt cgagtggctg atcggttcgc cgcgctggcg cgaatccgcc gccgagcggg 760021 gtgatgtcaa cccagtgggt ggcctggaag aggtgctcta cgagctgtct ccgatcgagg 760081 acttctccgg gtcgatgtcg ttgtcgttct ctgaccctcg tttcgacgat gtcaaggcac 760141 ccgtcgacga gtgcaaagac aaggacatga cgtacgcggc tccactgttc gtcaccgccg 760201 agttcatcaa caacaacacc ggtgagatca agagtcagac ggtgttcatg ggtgacttcc 760261 cgatgatgac cgagaagggc acgttcatca tcaacgggac cgagcgtgtg gtggtcagcc 760321 agctggtgcg gtcgcccggg gtgtacttcg acgagaccat tgacaagtcc accgacaaga 760381 cgctgcacag cgtcaaggtg atcccgagcc gcggcgcgtg gctcgagttt gacgtcgaca 760441 agcgcgacac cgtcggcgtg cgcatcgacc gcaaacgccg gcaaccggtc accgtgctgc 760501 tcaaggcgct gggctggacc agcgagcaga ttgtcgagcg gttcgggttc tccgagatca 760561 tgcgatcgac gctggagaag gacaacaccg tcggcaccga cgaggcgctg ttggacatct 760621 accgcaagct gcgtccgggc gagcccccga ccaaagagtc agcgcagacg ctgttggaaa 760681 acttgttctt caaggagaag cgctacgacc tggcccgcgt cggtcgctat aaggtcaaca 760741 agaagctcgg gctgcatgtc ggcgagccca tcacgtcgtc gacgctgacc gaagaagacg 760801 tcgtggccac catcgaatat ctggtccgct tgcacgaggg tcagaccacg atgaccgttc 760861 cgggcggcgt cgaggtgccg gtggaaaccg acgacatcga ccacttcggc aaccgccgcc 760921 tgcgtacggt cggcgagctg atccaaaacc agatccgggt cggcatgtcg cggatggagc 760981 gggtggtccg ggagcggatg accacccagg acgtggaggc gatcacaccg cagacgttga 761041 tcaacatccg gccggtggtc gccgcgatca aggagttctt cggcaccagc cagctgagcc 761101 aattcatgga ccagaacaac ccgctgtcgg ggttgaccca caagcgccga ctgtcggcgc 761161 tggggcccgg cggtctgtca cgtgagcgtg ccgggctgga ggtccgcgac gtgcacccgt 761221 cgcactacgg ccggatgtgc ccgatcgaaa cccctgaggg gcccaacatc ggtctgatcg 761281 gctcgctgtc ggtgtacgcg cgggtcaacc cgttcgggtt catcgaaacg ccgtaccgca 761341 aggtggtcga cggcgtggtt agcgacgaga tcgtgtacct gaccgccgac gaggaggacc 761401 gccacgtggt ggcacaggcc aattcgccga tcgatgcgga cggtcgcttc gtcgagccgc 761461 gcgtgctggt ccgccgcaag gcgggcgagg tggagtacgt gccctcgtct gaggtggact 761521 acatggacgt ctcgccccgc cagatggtgt cggtggccac cgcgatgatt cccttcctgg 761581 agcacgacga cgccaaccgt gccctcatgg gggcaaacat gcagcgccag gcggtgccgc 761641 tggtccgtag cgaggccccg ctggtgggca ccgggatgga gctgcgcgcg gcgatcgacg 761701 ccggcgacgt cgtcgtcgcc gaagaaagcg gcgtcatcga ggaggtgtcg gccgactaca 761761 tcactgtgat gcacgacaac ggcacccggc gtacctaccg gatgcgcaag tttgcccggt 761821 ccaaccacgg cacttgcgcc aaccagtgcc ccatcgtgga cgcgggcgac cgagtcgagg 761881 ccggtcaggt gatcgccgac ggtccctgta ctgacgacgg cgagatggcg ctgggcaaga 761941 acctgctggt ggccatcatg ccgtgggagg gccacaacta cgaggacgcg atcatcctgt 762001 ccaaccgcct ggtcgaagag gacgtgctca cctcgatcca catcgaggag catgagatcg 762061 atgctcgcga caccaagctg ggtgcggagg agatcacccg cgacatcccg aacatctccg 762121 acgaggtgct cgccgacctg gatgagcggg gcatcgtgcg catcggtgcc gaggttcgcg 762181 acggggacat cctggtcggc aaggtcaccc cgaagggtga gaccgagctg acgccggagg 762241 agcggctgct gcgtgccatc ttcggtgaga aggcccgcga ggtgcgcgac acttcgctga 762301 aggtgccgca cggcgaatcc ggcaaggtga tcggcattcg ggtgttttcc cgcgaggacg 762361 aggacgagtt gccggccggt gtcaacgagc tggtgcgtgt gtatgtggct cagaaacgca 762421 agatctccga cggtgacaag ctggccggcc ggcacggcaa caagggcgtg atcggcaaga 762481 tcctgccggt tgaggacatg ccgttccttg ccgacggcac cccggtggac attattttga 762541 acacccacgg cgtgccgcga cggatgaaca tcggccagat tttggagacc cacctgggtt 762601 ggtgtgccca cagcggctgg aaggtcgacg ccgccaaggg ggttccggac tgggccgcca 762661 ggctgcccga cgaactgctc gaggcgcagc cgaacgccat tgtgtcgacg ccggtgttcg 762721 acggcgccca ggaggccgag ctgcagggcc tgttgtcgtg cacgctgccc aaccgcgacg 762781 gtgacgtgct ggtcgacgcc gacggcaagg ccatgctctt cgacgggcgc agcggcgagc 762841 cgttcccgta cccggtcacg gttggctaca tgtacatcat gaagctgcac cacctggtgg 762901 acgacaagat ccacgcccgc tccaccgggc cgtactcgat gatcacccag cagccgctgg 762961 gcggtaaggc gcagttcggt ggccagcggt tcggggagat ggagtgctgg gccatgcagg 763021 cctacggtgc tgcctacacc ctgcaggagc tgttgaccat caagtccgat gacaccgtcg 763081 gccgcgtcaa ggtgtacgag gcgatcgtca agggtgagaa catcccggag ccgggcatcc 763141 ccgagtcgtt caaggtgctg ctcaaagaac tgcagtcgct gtgcctcaac gtcgaggtgc 763201 tatcgagtga cggtgcggcg atcgaactgc gcgaaggtga ggacgaggac ctggagcggg 763261 ccgcggccaa cctgggaatc aatctgtccc gcaacgaatc cgcaagtgtc gaggatcttg 763321 cgtaaagctg tcgcaaaatt actaaacccg ttaggggaaa gggagttacg tgctcgacgt 763381 caacttcttc gatgaactcc gcatcggtct tgctaccgcg gaggacatca ggcaatggtc 763441 ctatggcgag gtcaaaaagc cggagacgat caactaccgc acgcttaagc cggagaagga 763501 cggcctgttc tgcgagaaga tcttcgggcc gactcgcgac tgggaatgct actgcggcaa 763561 gtacaagcgg gtgcgcttca agggcatcat ctgcgagcgc tgcggcgtcg aggtgacccg 763621 cgccaaggtg cgtcgtgagc ggatgggcca catcgagctt gccgcgcccg tcacccacat 763681 ctggtacttc aagggtgtgc cctcgcggct ggggtatctg ctggacctgg ccccgaagga 763741 cctggagaag atcatctact tcgctgccta cgtgatcacc tcggtcgacg aggagatgcg 763801 ccacaatgag ctctccacgc tcgaggccga aatggcggtg gagcgcaagg ccgtcgaaga 763861 ccagcgcgac ggcgaactag aggcccgggc gcaaaagctg gaggccgacc tggccgagct 763921 ggaggccgag ggcgccaagg ccgatgcgcg gcgcaaggtt cgcgacggcg gcgagcgcga 763981 gatgcgccag atccgtgacc gcgcgcagcg tgagctggac cggttggagg acatctggag 764041 cactttcacc aagctggcgc ccaagcagct gatcgtcgac gaaaacctct accgcgaact 764101 cgtcgaccgc tacggcgagt acttcaccgg tgccatgggc gcggagtcga tccagaagct 764161 gatcgagaac ttcgacatcg acgccgaagc cgagtcgctg cgggatgtca tccgaaacgg 764221 caaggggcag aagaagcttc gcgccctcaa gcggctgaag gtggttgcgg cgttccaaca 764281 gtcgggcaac tcgccgatgg gcatggtgct cgacgccgtc ccggtgatcc cgccggagct 764341 gcgcccgatg gtgcagctcg acggcggccg gttcgccacg tccgacttga acgacctgta 764401 ccgcagggtg atcaaccgca acaaccggct gaaaaggctg atcgatctgg gtgcgccgga 764461 aatcatcgtc aacaacgaga agcggatgct gcaggaatcc gtggacgcgc tgttcgacaa 764521 tggccgccgc ggccggcccg tcaccgggcc gggcaaccgt ccgctcaagt cgctttccga 764581 tctgctcaag ggcaagcagg gccggttccg gcagaacctg ctcggcaagc gtgtcgacta 764641 ctcgggccgg tcggtcatcg tggtcggccc gcagctcaag ctgcaccagt gcggtctgcc 764701 caagctgatg gcgctggagc tgttcaagcc gttcgtgatg aagcggctgg tggacctcaa 764761 ccatgcgcag aacatcaaga gcgccaagcg catggtggag cgccagcgcc cccaagtgtg 764821 ggatgtgctc gaagaggtca tcgccgagca cccggtgttg ctgaaccgcg cacccaccct 764881 gcaccggttg ggtatccagg ccttcgagcc aatgctggtg gaaggcaagg ccattcagct 764941 gcacccgttg gtgtgtgagg cgttcaatgc cgacttcgac ggtgaccaga tggccgtgca 765001 cctgcctttg agcgccgaag cgcaggccga ggctcgcatt ttgatgttgt cctccaacaa 765061 catcctgtcg ccggcatctg ggcgtccgtt ggccatgccg cggctggaca tggtgaccgg 765121 gctgtactac ctgaccaccg aggtccccgg ggacaccggc gaataccagc cggccagcgg 765181 ggatcacccg gagactggtg tctactcttc gccggccgaa gcgatcatgg cggccgaccg 765241 cggtgtcttg agcgtgcggg ccaagatcaa ggtgcggctg acccagctgc ggccgccggt 765301 cgagatcgag gccgagctat tcggccacag cggctggcag ccgggcgatg cgtggatggc 765361 cgagaccacg ctgggccggg tgatgttcaa cgagctgctg ccgctgggtt atccgttcgt 765421 caacaagcag atgcacaaga aggtgcaggc cgccatcatc aacgacctgg ccgagcgtta 765481 cccgatgatc gtggtcgccc agaccgtcga caagctcaag gacgccggct tctactgggc 765541 cacccgcagc ggcgtgacgg tgtcgatggc cgacgtgctg gtgccgccgc gcaagaagga 765601 gatcctcgac cactacgagg agcgcgcgga caaggtcgaa aagcagttcc agcgtggcgc 765661 tttgaaccac gacgagcgca acgaggcgct ggtggagatt tggaaggaag ccaccgacga 765721 ggtcggtcag gcgttgcggg agcactaccc cgacgacaac ccgatcatca ccatcgtcga 765781 ctccggcgcc accggcaact tcacccagac tcgaacgctg gccggtatga agggcctggt 765841 gaccaacccg aagggtgagt tcatcccgcg tccggtcaag tcctccttcc gtgagggcct 765901 gaccgtgctg gagtacttca tcaacaccca cggcgctcga aagggcttgg cggacaccgc 765961 gttgcgcacc gccgactccg gctacctgac ccgacgtctg gtggacgtgt cccaggacgt 766021 gatcgtgcgc gagcacgact gccagaccga gcgcggcatc gtcgtcgagc tggccgagcg 766081 tgcacccgac ggcacgctga tccgcgaccc gtacatcgaa acctcggcct acgcgcggac 766141 cctgggcacc gacgcggtcg acgaggccgg caacgtcatc gtcgagcgtg gtcaagacct 766201 gggcgatccg gagattgacg ctctgttggc tgctggtatt acccaggtca aggtgcgttc 766261 ggtgctgacg tgtgccacca gcaccggcgt gtgcgcgacc tgctacgggc gttccatggc 766321 caccggcaag ctggtcgaca tcggtgaagc cgtcggcatc gtggccgccc agtccatcgg 766381 cgaacccggc acccagctga ccatgcgcac cttccaccag ggtggcgtcg gtgaggacat 766441 caccggtggt ctgccccggg tgcaggagct gttcgaggcc cgggtaccgc gtggcaaggc 766501 gccgatcgcc gacgtcaccg gccgggttcg gctcgaggac ggcgagcggt tctacaagat 766561 caccatcgtt cctgacgacg gcggtgagga agtggtctac gacaagatct ccaagcggca 766621 gcggctgcgg gtgttcaagc acgaagacgg ttccgaacgg gtgctctccg atggcgacca 766681 cgtcgaggtg ggccagcagc tgatggaagg ctcggccgac ccgcatgagg tgctgcgggt 766741 gcagggcccc cgcgaggtgc agatacacct ggttcgcgag gtccaggagg tctaccgcgc 766801 ccaaggtgtg tcgatccacg acaagcacat cgaggtgatc gttcgccaga tgctgcgccg 766861 ggtgaccatc atcgactcgg gctcgacgga gtttttgcct ggctcgctga tcgaccgcgc 766921 ggagttcgag gcagagaacc gccgagtggt ggccgagggc ggtgagcccg cggccggccg 766981 tccggtgctg atgggcatca cgaaggcgtc gctggccacc gactcgtggc tgtcggcggc 767041 gtcgttccag gagaccactc gcgtgctgac cgatgcggcg atcaactgcc gcagcgataa 767101 gctcaacggt ctgaaggaaa acgtgatcat cggcaagctg atcccggccg gtaccggtat 767161 caaccgctac cgcaacatcg cggtgcagcc caccgaggag gcccgcgctg cggcgtacac 767221 catcccgtcg tatgaggatc agtactacag cccggacttc ggtgcggcca ccggtgctgc 767281 cgtcccgctg gacgactacg gctacagcga ctaccgctag gtgggcgagc agacgcagaa 767341 tcgcacgcga aatgcctgcg cgatgcgatt ctgcgtctgc tcgccgtggt ggatgagccg 767401 gtcttgcatc gccgatgcgg gaaacccatg catgcgttgg ggcacgacgc cggcctggcc 767461 gccagattgg cgctgcccgc cgccccgttc aacatgagtg gcaatcccgc catctgcctg 767521 cctgcggggg acacgtcgtg aggaaccccg gtcggggttt agtttatcgg ccgtgaattc 767581 gccgaacggt tgctcgtcca agccggccac gcattccagc aggccactgc gttccatcgc 767641 cgacgcccag gcatggcctg ggttggtgat tgcggccgcc gagtcaaaca accgtgaact 767701 cgcgcgtcgt cgcgctgaac gctgtaagca tgccgttgcg atcgcgtgcg gtgccgtggt 767761 ggacgatgcg gtattgcccg ggcgtggtat cgccgggaac atcccagcga atgctgacat 767821 gcgatccggc ccgcccttgg cgctgccagc gaaagctcgt ggcccagtcg ccgtcgtcag 767881 caatccgcac ccagctggca ccttcccggc ggaccacttc gaggtaggtg ccgccgcggc 767941 gcagatcgtt attgggcagc gcgctgacga aaacggcttc caccgcctga cccggtcggt 768001 acgtcgccga gggctcggcg atgaccgctc cgaacgaccc ggcatcggcg ggcgcgccgc 768061 gcacccagct cagctcccgg gtgggccgcg gccggcgacc gagcgtcacc ggacggccgt 768121 cgcgcatggc ctcggcgagt tcggccacgg tctgcatgag ggcgcacagt tcccatcgac 768181 cgaacaacgt gctgccgccc tcgtagcgct gttcgagata ctcttcgggc gttgtcacgt 768241 aatggatgta ggcgttggtg tagcccacgc agagcacgtc ggccaggtcg gcgccaacaa 768301 tcgaagccac catgcggcgc agcctaagcc ccgcgacgat ggtcggttcg cccggaatac 768361 cgatcagata gaggcgaccg attcgcacga gctgaacggg aacaatttcc tggacaaagg 768421 ggtgtatccg gttcggcagg cgtgcgggca tcacaatgcc tttgggggcc tgtgccgctg 768481 ccgtcggcct tgccagccgg tacatggcgc gggatagtct gtcccagaac gggtttcgcc 768541 cttggcgaaa gccatggaag cccgggccct cgtcggtgcc tgccatggcc ccggcgccaa 768601 acatcggacg cccggtgcgg cgctcttcac cgtctggtgt gtactcgccg cgcacgagca 768661 cagaaccgag atcgacatag gtgaaccggg catcaatgcc agcgccgatg ggcgtcgctc 768721 cgctcaactg cgtgaaagca tcctcgaact ggcacaaccc ggtacgacgg gtgttgtcga 768781 attcccggtc tggtggggcc tcgggagaaa ggggcccgtc gacattcggg ctcatgtcgc 768841 ccggattcgt ctgtgcgaag gcggcgatga agtcgggctg gccggcgaga taatccgcgc 768901 cgcccacggt gcgttcccag tgataggccg cgaaaccctt gttgtctccg gagatgaggt 768961 ggttgcgatt cgtcatgctc gtaccgtggg tagcgaagaa atggatcacg cccacggtgg 769021 cctcgccccg gtcgatacgc acgagcgtgg tatgcgggtc gacgcgtttc gggaagaacg 769081 ccttgtcggc cggcgggttg cggtcgaacg ctgatgggga tcgattgatg cttgcgccgt 769141 acagctcgcc gtgcgagagc gaaacctcgg cgggcgccac atcggcatgc gcatgttcca 769201 ccgattcgac aattccgtcg acgatcgccg caaaggttgc cggccgaaag ccgctcgtgg 769261 tcaggttgta cagcaggtat ccgcagtacc cgccaggccc ggcgtgggtg tgggtcgccg 769321 tgatcagtgt gttctgctcc gagtaggtat cgccatacaa atcggccaac cggcgcagca 769381 cttcctcatt cacgttttgc atgggcagcg gcagttcggc gacaatcagc agcaaccgcg 769441 cgtccccgtc ctgggaatcg tcccggaaca caaacgcccg tgacctaagt cgctggtgaa 769501 tgccggcggt gcgctggtcg gacttgccgt agccgagcat gccgcagtcc gccgcctcac 769561 cagtgatgtc ggcgatgccg cgccctacac taagcattgc ctaatcctcc gcaccagcag 769621 caaatttcac gagcgctgac tacgcgctgc tccggggaaa cgtatcccac aaggagaaac 769681 actttatgcg ccggggccca cgaatacgga cggcagcatc ccgtgccgcg gctggccgga 769741 cgtgatccga gggtgtgggt ctcaccagat cggtctcact agacttgggt tgtgctcatt 769801 ggttcgcatg tcagcccaac cgatccgctg gccgcagcgg aggccgaagg cgctgacgta 769861 gtgcagattt tccttggcaa tccgcagagc tggaaggctc ccaagccgcg ggacgacgcc 769921 gccgcgctga aagccgcgac cctgcccatc tacgtgcatg cgccctacct gatcaacctt 769981 gcgtcggcga acaatcgcgt gcggatcccg tcgcgcaaga tcctgcaaga gacctgtgct 770041 gcggcggccg acattggcgc agcggcggtg atcgtgcacg gtgggcacgt cgccgacgac 770101 aacgacatcg acaagggctt ccagcgctgg cgcaaggcgc tggaccggct ggaaaccgag 770161 gttcccgtct acctggaaaa caccgccggc ggcgatcacg cgatggcgcg ccgcttcgac 770221 accatcgccc ggctctggga cgtcatcggc gacaccggaa tcgggttttg cctggacacc 770281 tgccacacct gggcggccgg cgaggcgctg accgatgccg tcgatcggat caaagcaatt 770341 accggccgca tcgatctggt gcactgcaac gactccaggg acgaagcggg atcgggccgt 770401 gaccgccacg ccaacctcgg cagcggccag attgatcctg acctgctggt ggctgccgtc 770461 aaggcggccg gcgcgccggt gatctgcgaa accgccgacc aaggtcgcaa ggacgacatc 770521 gcgtttctgc gggaaagaac cggcagctga cttcaagccc cgcggcacct accgttgact 770581 tatgctccgc agggtcgcca tactgctcgc cgctgtgctt gcgttcgcgg gctgctcggg 770641 gggaacgagg ttggcggcgg gcttcggcaa tggcaatagc gtgcacaccc tcgatgtcga 770701 tggagccggc cgcagctacc ggctttataa gcccgtcggg ttgccgtcct cggcgccgct 770761 ggtcgtcatg ttgcacggcg ggttcggcag cgccaagcaa gccgaaaggt cttatggctg 770821 ggacgaattg gccgactccg agaagttcct cgtcgcctac cccgatggct atcacagggc 770881 ttggaatgcc aatggcggag gctgctgcgg ccggcccgca cgtgaaggcg tcgacgacat 770941 cggcttcgtc cgcgcggtcg tcgccgacat cgccaacaat gtcagcatcg accccgcccg 771001 ggtctacgtc acgggcatga gcaacggtgc catcatgtcc tacacgctgg cctgcaacac 771061 cagcatcttc gcggcgatcg gcgtcgtttc gggcacgcaa ctagacccct gtcagtcccc 771121 gcgtccggtg tcggtcatcc acatccatgg cacggccgat ccgctggtcc gctaccacgg 771181 cgggcccggc gccgggttcg cgcgcatcga cggtccgccg gtgcccgatc tcaatgcgtt 771241 ctggcgcgag gtcaaccggt gcggcgcgct ggataccacg accgaaggtc cggtcaccac 771301 atcgggcgcc acatgcgccg acaatcgccg tgtcgtgctg ctcaccgtcg atgacgccgg 771361 ccaccgatgg ccgtcatttg ccacccagac actgtggcga ttctttgcag cgcacttcag 771421 atgaggacaa aaccatccgt tacattctct tgtgcagttg tagaaaaaac gtaacatggt 771481 ggcatgtcag atacgcatgt cgtcaccaac caggttccgc ccttggagaa ctacaatccc 771541 gcgtcatccc cggtgctcat cgaggctctg atccaggagg gtggccagtg gggcctggat 771601 gaagtaaacg aggtcggggc aatttctgcc agctgccaag cccaacgctg gggagagctt 771661 gcagaccgca accggcccat cctgcatacc cacgacgctt acgggtaccg ggtcgatgag 771721 gtggagtacg acccggccta ccacgagctg atgcgtaccg cgatcaccca tggcatgcac 771781 gccgcaccgt gggctgacga ccgcccgggt gcgcacgtgg tgcgagcggc caagacatcg 771841 gtgtggaccg tcgagccggg ccatatctgc cccatctcga tgacctacgc cgtcgttccg 771901 gcgctgcggt ataactccga gctggctgcg gtctacgagc cgctgctgac cagtcgtgag 771961 tacgacccgg agctgaagcc ggcgaccacg aaggccggca tcaccgccgg catgtcgatg 772021 accgagaagc agggtggctc cgacgtgcgc gctggcacca cccaggcgac cccgaatgcg 772081 gacggcagct acagcttgac cggccacaag tggttcactt cggcgccgat gtgcgacatc 772141 ttcctggtgc tcgcgcaggc accggacggg ctgtcgtgct tcctgctgcc gcgggtgctg 772201 cccgacggca cccgcaaccg aatgttcttg cagcggctca aggacaagct cggcaaccac 772261 gcaaacgcct cgagcgaggt cgaatacgac ggtgccgtcg cgtggctggt gggcgaggag 772321 ggccgcggcg tgccgaccat catcgagatg gtcaacctca cccggctgga ctgcgctctg 772381 ggcagtgcca ccagcatgcg caccggccta acccgcgccg tccaccatgc ccagcatcgg 772441 aaggcgttcg gcgcctacct gatcgaccag ccgttgatgc gcaacgtgct ggccgacctg 772501 gcggtggagg ccgaggccgc caccatcgtg gcaatgcgga tggccggtgc caccgacaac 772561 gcggtgcgcg ggaacgagac cgaagcgctg ctgcgtcgca tcggcctggc ggccgccaag 772621 tactgggtgt gcaagcgctc caccgctcac gccgccgaag cgctggagtg cctgggcggc 772681 aacggttatg tcgaggattc cgggatgccc cggctctacc gggaggcgcc gttgatgggc 772741 atctgggagg gctcgggcaa tgtcagcgcg ctagatacct tgcgcgccat ggcaacccgg 772801 cccgcatgcg tcgaggtgct gtttgacgag ctggcccgca gcgcaggcca ggaccccagg 772861 ctggacggcc acgtcgaaag gctgcgtccg cagctgggcg atcttgacac gatcggttat 772921 cgagcccgca agattgccga agacatctgc ctggcgttgc agggatcgtt gttggtgcgc 772981 cacggacatc ccgccgtcgc cgaggcgttt ctggccactc ggctcggcgg ccagtggggc 773041 ggagcgtacg gcaccatgcc ggccggtctg gatctcgcgc ccatcctcga gcgtgcgctg 773101 gtaaaaggct gagcggccgc tgatgacaca cgcgatcagg ccggtcgatt tcgacaacct 773161 gaagacgatg acctatgagg tcaccggtcg gattgcgcgg atcaccttca accggccgga 773221 gaagggcaac gcgatcatcg cagacacccc gctggagttg tctgctctgg tggagcgtgc 773281 cgatctggat ccaggcgtgc atgtcattct ggtgtccggt cgcggcgagg gattctgtgc 773341 cggcttcgac ctgtccgcct acgccgaggg gtcgtcgtcg accgggggcg gcggcgcata 773401 ccaaggcacg gtgctagatg gcaagaccca ggccgtcaac cacctaccga accagccgtg 773461 ggacccgatg atcgactacc agatgatgag ccggttcgtg cgcggattcg ccagtctgat 773521 gcatgccgac aagccgacgg tggtcaagat ccacggctac tgcgtggccg gcggcaccga 773581 catcgcgctg cacgccgatc aggtgatcgc cgccgccgac gccaagatcg gctacccgcc 773641 cacccgggtg tggggggtgc cggcggcggg cctgtgggcg caccggctcg gcgaccagcg 773701 ggccaaacgg ctgctgttca ccggcgattg catcaccggc gcgcaggccg ccgagtgggg 773761 cctggcggtc gaggcgccgg agccggctga cctcgacgag cggaccgagc gactggtggc 773821 ccggatcgcc gcactgccgg tcaatcaatt gatcatggtc aagctcgcgc tcaattccgc 773881 tctgctgcaa cagggtgtgg ccaccagcag gatggtcagc accgtgttcg acggcgccgc 773941 tcggcacaca cccgaggggc acgcgtttgt cgccgacgcg gtcgagcacg gcttccggga 774001 tgcggtgcgg cgccgtgacg agccgtttgg cgactacggc cgtcaagcat cgcgggtgta 774061 accatgccgg ccatgaccgc ccgttcggtg gtactcagcg tgctgctcgg tgctcatccc 774121 gcgtgggcca ccgcaagcga attgatccag ctgacagcgg atttcggtat caaggagacg 774181 acgttgcggg tcgcgctgac ccgcatggtc ggtgccgggg atctggtccg gtccgcggac 774241 ggctaccggc tctcggatcg gttgctggcc cgccagcgcc gacaagatga ggccatgcgc 774301 ccacggaccc gcgcttggca cggaaactgg cacatgctga ttgtcaccag catcggcacc 774361 gatgctcgta cccgggccgc actgcgaacc tgcatgcacc acaagcgttt cggtgaattg 774421 cgggaagggg tgtggatgcg gccggacaat ctcgacctcg acttggagtc cgacgttgcg 774481 gcccgggtta ggatgctgac ggcccgcgac gaggcccccg ccgacttggc cgggcagctg 774541 tgggatctgt cggggtggac cgaggccggc caccggttgc tcggcgacat ggcagcggcc 774601 accgacatgc ccgggcgatt tgtggtggct gcggcgatgg tgcgccacct gctcaccgat 774661 ccgatgttgc ccgctgaact gttgcccgcc gactggccgg gcgccgggtt acgggcggcg 774721 taccacgact tcgccactgc aatggcgaaa cgacgcgatg caactcaact cctggaggtg 774781 acatgagtga tctggtgcgt gtggagcgca aaggtcgggt gaccacggtg attctgaacc 774841 ggccggcctc ccgcaacgcg gtcaacggcc cgaccgccgc ggcgttgtgc gcggcgttcg 774901 agcaattcga ccgggacgac gccgcgtcgg tggccgtact ctggggtgcg ggtggaacct 774961 tttgtgcggg agccgatttg aaggcctttg gcacaccgga ggccaactct gtgcaccgga 775021 cgggtcccgg cccgatgggg ccgtcacgaa tgatgctgtc caaacctgtg atcgccgccg 775081 tcagcggcta cgccgtcgcc ggggggctgg aattggcact gtggtgcgac ctgcgggtgg 775141 ccgaggaaga cgccgtgttc ggtgtgtttt gccgtcgctg gggggtaccg ctcatcgacg 775201 gcggcaccgt gcgactgcca cggctgatcg ggcacagccg cgcgatggac atgatcctca 775261 ctggccgtgg ggtgccggcc gacgaagcgc tggccatggg gttggccaat cgggtggtgc 775321 ccaagggtca agcccgacag gcggctgagg agttggcggc gcaattggcc gcgctgccgc 775381 agcagtgtct gcgatcggat cggctgtcgg cgctgcacca gtggggcctg cccgagtccg 775441 cggcgctcga cctcgagttc gccagcatcg cgcgggtggc cggcgaggcg ctagaggggg 775501 cgagacggtt cgccgcgggt gccggtcggc atggggcccc ggcacctcgg gccgaacagg 775561 gcgacacgct ttaggcgggt acggctcaga ccaaggcgaa ggtccgtgcc gatgccggcg 775621 agggccacgg ctgcggaatg ggtcgttgcc ggacaacctg gggccaccag aaccactttc 775681 cgaggagggc cgcgatcgac ggtgtcatga acgaccggac gatcagggtg tcgaagagca 775741 ggcccatacc gatggtggtg ccaacctggg ccatcacggt cagctcgctg acggcaaacg 775801 acatcatggt gaaggcaaac accagcccgg cggcggtcac caccgacccg ctgccaccca 775861 tcgcacggat gatgccggtg ttgattccgg cgtggatctc ctccttgagc cgggcaacca 775921 gcagcaggtt gtagtccgcg ccgacggcca gcaggatgat gaccgccatc gccaacacca 775981 accagtgcag ctcgataccc aggatgtgtt gccagatcag caccgacagc ccgaacgagg 776041 cgcccagcga caacaccacg gtgccgacga tgacggcggc cgcgacgacg ctgcgggtgg 776101 tgatcagcat gatgatgaag atcaggcaga gtgcggagat tccggcgatc atcaagtcat 776161 aggtgttgcc gtcggacaag tccttgaaca tcgccgcggt accgcccagg tagatcgcgg 776221 atccctccaa cggtgtgccc ttgatggctt ccttggcggc ggtcttgatc ttggcgatgc 776281 gcgcgatgcc cgcctggctc atcgggtcgc cttcgtggct gatgatgaac cgcaccgcgt 776341 gcccgtccgg cgagaggaac tgttccaggc cgcgttggaa gtcgggattg tcgaaaacct 776401 cgggaggcag atagaacgag tcgtcgttgc gcgaagcatc aaaggcttcg cccatcgccg 776461 ccgaatcctc ctgcatcgcg gccatctgat cctgcagccc ttcctgggtg gaatgcatgc 776521 tcagcatctg cgccttcatg ctcttcatgg tctggatcat ctcgggcatc atcgcggtca 776581 gctggggcat gagcgtgtcc aggcgctgca tgagcggcag caggttgttg atgtcttcgg 776641 tcatgacgtc gattccgtcg agggtgtcga acaccgaccg cagcgaccag cagaccggga 776701 tgtcgtagca gtgcttttcc cagtagaagt agctgcggat ggggcggaag aaatcgtcga 776761 aatccgcaat atggttgcgc aactcctcga catcgaccac catccccgtc atctgaatga 776821 ccatttcgtg ggtgacatcg gccatctgct gggtgaggct gtgcatccgc tccatctggt 776881 cgatgttgga ctgaatgtcg ttgacctgct ccagcatcct ggccgtcagg tcctggttgt 776941 atttctcggt cagtttctgg ctggtgccct gcatgctgat caggaacggg attgaggtgt 777001 gctcgatcgg tttgccgtcc ggccgggtga tggcctgcac ccgggatatc ccctccacgg 777061 cgaaaatggc cttggcgatc ttgttgatca ccaaaaagtc ggccgaatta cgcatgtcgt 777121 ggtcgctttc gaccatcagc acctcggggt tcatccgggc ctgggagaaa tggcgctccg 777181 cggccgcata gccttcgttg gccggtaggt cggcgggcag gtagttgcgg tcgttgtagt 777241 tggtccggta gcccggcagg gtcagcagac cgacgagcgc cagggccacc gcaccgacca 777301 ggatggggcc gggccagcgg acgatggcgg ccccgacctt gcgccagccc cgcacccgcg 777361 ccatccgctt gggctcgagc agcttgccga accggctcgt cacggcgatt atcgccgggc 777421 ccagggtgag tgcggcggcg acgacgatga ccatcccgat cgccaacggc acaccgaggg 777481 tctgaaagta cggcagtcgg gtgaagctca gacagaacgt ggcacccgcg atggtcagac 777541 ccgagcccag cacgacatgg gcggtgccgc cgaacatggt gtagtacgcc gactcccggt 777601 cctggccgag cccgcgtgct tcctggtagc ggccgatcag gaagatggcg tagtcggtgg 777661 cggccgcgat cgccagcacc acgagcaggt tggtcgcgaa ggtcgagagc ccaatgatcc 777721 ggtggaaacc gaggaaagcc acgcccccgc gggtggcgag cagcccgagc accaccatcg 777781 tcagcatgat cgccgacgtg atgatcgacc ggtagaccag cagcaacatc acgatgatca 777841 cggtgaacgt gaccgcctcg atcacctgca gactacggtc gccggcctgc tgctgatcgg 777901 cgaccagcgc ggccgaaccg gtgacgtaca ccttgacacc gggtggcggc gcaaggcgct 777961 cgacgatggt cttgaccgct tccacggact cgttggccag tgactcgccc tgattgcccg 778021 cgagtttcac ctgaacgtag gcggccttgc cgtcgctgct ctgggcgccg gtggcggtca 778081 gtggatcccc ccaaaagtcc tgcaaggact ggacgtgggt ggtgtcggct tgcagtctgc 778141 cgatcatctg gtcgtaaaac gcatgggcgg cgtcaccgag cggccgctgg ccctccagca 778201 cgatcatcgc cgcgctgtcg gagtctccct cctcgaacac cttgccgatg tgtttcatcg 778261 agatcatcga cggtgccgcg tcggggctca tcgacaccgc ctgtatctgt ccgaccgttt 778321 ccagttgcgg cacagtgacg ttgaggacgg cgatggtgac caaccaccca aggatgatcg 778381 gcaccgcgaa ggtacggatc attctgggga tgaacggtcg cgccgcgtgc ctgtcgggcg 778441 ggacggagcc cgtcggcgca gctgtccttt gcacgatcat gcggatttca caaagcagta 778501 ggtcagggca tccacgccgg ttgcggtccg ctcgtccttc acttcgccat cgacggtgat 778561 tcggcaggtg atggaagtgc cgtcgccttg cgcgaggatg ttgggggccg cggacggcgc 778621 cgtggtcttc aaggtgagcg accacggcag ggctgcgccg tcgatccgct gtggcttggc 778681 gtcgaggtcc aggtagttga tgttgacgta actaccggag ccggaaactt cgtactccac 778741 caccttgggg tcgaacggct ccgggtcatc ggcgaagacc ttcggcgtca ccaagatgcc 778801 ttcggaacca aagaaagtgc ggatccgctg caccgtgaag ccggcgatgg cgaccacaac 778861 caggatgagc agcggtatcc aggcacgctt gagagttcca atcatcgccc tccgcctctg 778921 ccgcatgaag ttcacgccgg tctggtgacg cataccgaac gtcacagatt tcagagtaca 778981 gtgaaacttg tgagcgtcaa cgacggggtc gatcagatgg gcgccgagcc cgacatcatg 779041 gaattcgtcg aacagatggg cggctatttc gagtccagga gtttgactcg gttggcgggt 779101 cgattgttgg gctggctgct ggtgtgtgat cccgagcggc agtcctcgga ggaactggcg 779161 acggcgctgg cggccagcag cggggggatc agcaccaatg cccggatgct gatccaattt 779221 gggttcattg agcggctcgc ggtcgccggg gatcggcgca cctatttccg gttgcggccc 779281 aacgctttcg cggctggcga gcgtgaacgc atccgggcaa tggccgaact gcaggacctg 779341 gctgacgtgg ggctgagggc gctgggcgac gccccgccgc agcgaagccg acggctgcgg 779401 gagatgcggg atctgttggc atatatggag aacgtcgtct ccgacgccct ggggcgatac 779461 agccagcgaa ccggagagga cgactgatga gcaacctcgc aatctgaccg aggtggcgag 779521 caagacggcg attggcctgt ggtcactcct tgttgatgcg gttgcccgcg ccgaggttat 779581 cgattgtggg gtcaccgttt ttgtaggtga ccgtgttgtc cagcccaaca acaacgaggc 779641 gctcgtcgat cctgtcgaag gcgatcttgt tgttcgcacc accgacggtc accgtttcgc 779701 aggtgccgtt gacggtcagc gtgttgtccg agccggccac gttcagtgac ttgccgtcag 779761 cgcagtcaag ggtggcggta gtcccgatgg atccgtaggt cagcatgtca ccgatctgga 779821 tcgaagcggt tgtggattct ccggtcgtca cggtcggcgc cgcggtcggg ccgctcgtcg 779881 ctgtcgtggt ggtggcggtc gcgggcgtcg tggtagctgc cggcgggttg gcagtggaac 779941 tgcagccggc cagcggcaac gctgcggcag ccagcgccag agcaaaggtc gccaaccggg 780001 agtgggtagc gcgatcggcg cgcaacggtt tctcgaccac ctcagccgac ccgctgcagt 780061 cggtttacca ttcctagttc ccggccacgg tcccagatga acggatcacc attgcggaag 780121 aacaccgttt cgtcccagcc gtagacggtg atgtcgttga tgatcgtgtc ggcgacaacg 780181 gtgttggacg agcccatcac ggtcaccgcc cagcaggttc ccagcgcggt cacgatgttc 780241 tgagtgccgt tgaccaacaa ggtggattcg ttgcagtcca gcgtccgctc gatgccctgc 780301 ccggtgacat gggtgtcgcc gttcttggcg tgtgcggccg gcggtggggc ggccaaggcg 780361 acagcgatgg tgatgacacc ggcagccagc gacgcggcga cggtgttcca cttcacggcg 780421 ggcccccctt cgactgggcg ggtgatgctt gactgagcct tggtcgggcc ttgattgagc 780481 gtacgtgcat tcgcccgggc gacgacagac ctgagtgcat ttgccgggca ggcaccccgc 780541 gtctgatgtc agctactcca caacccggtc gctagagtca ttagttggcc ctaacgtccc 780601 ccgaagaccg gtgcggaccc aaagccgatc accccaaccg aagggcgaac cgccatggca 780661 gctcagccgc aagcaccgtc agcgggcggc cgcccgcgcg cggggaaagc ggtgaagtcc 780721 gtggctcgcc cggccaaact gagccgtgag agcatcgtcg agggcgccct gacctttttg 780781 gatcgggagg ggtgggactc gctgaccatc aatgcgctgg cgacccagct cgggaccaag 780841 gggccgtcgc tgtacaacca cgtggacagc ctcgaggatc tacgccgggc ggtgcggatt 780901 cgggtgatcg acgacatcat cacgatgctg aatagggtcg gtgcgggtcg cgcacgcgat 780961 gacgcggtgt tggtcatggc cggtgcctac cgcagctacg cccaccacca cccgggtcgg 781021 tactcggcgt tcacccggat gccgctgggc ggtgacgatc ccgaatacac cgctgcgact 781081 aggggcgcag ccgcgcccgt catcgccgtg ctgtcctcgt acggcctcga cggtgagcag 781141 gctttctacg cggcgctcga gttttggtcg gcactgcatg ggtttgtgtt gctggaaatg 781201 accggcgtca tggacgacat cgataccgat gcggtgttca ccgacatggt gctgcggctg 781261 gcggcgggca tggaaaggcg caccacacac ggtggtaccg cgtcaacgta gcgccctgct 781321 tcggccgcaa cgcccgcttt gacctgccag actggcggcg ggtattgtgg ttgctcgtgc 781381 ctggcggctt acgcttgatg taggggcgtg gatgccgggc caattcgcat gtccgcgatg 781441 cctcggatga gacgaatcga gtttgaggca agctatgcga cacacccggc cgcgggtaac 781501 cgtggcgggg catggccgac aaacagaacg tgaaagcgcc caagatagaa agccggtaga 781561 tgccaaccat ccagcagctg gtccgcaagg gtcgtcggga caagatcagt aaggtcaaga 781621 ccgcggctct gaagggcagc ccgcagcgtc gtggtgtatg cacccgcgtg tacaccacca 781681 ctccgaagaa gccgaactcg gcgcttcgga aggttgcccg cgtgaagttg acgagtcagg 781741 tcgaggtcac ggcgtacatt cccggcgagg gccacaacct gcaggagcac tcgatggtgc 781801 tggtgcgcgg cggccgggtg aaggacctgc ctggtgtgcg ctacaagatc atccgcggtt 781861 cgctggatac gcagggtgtc aagaaccgca aacaggcacg cagccgttac ggcgctaaga 781921 aagagaaggg ctgatgccac gcaaggggcc cgcgcccaag cgtccgttgg tcaacgaccc 781981 ggtctacgga tcgcagttgg tcacccagtt ggtgaacaag gttctgttga aggggaaaaa 782041 atcgctggcc gagcgcattg tttatggtgc gcttgagcaa gctcgcgaca agaccggcac 782101 cgatccggtg atcaccctca agcgggctct cgacaatgtc aaacccgccc tggaggtgcg 782161 cagccgtcgc gtcggcggcg cgacctatca ggtgcctgtc gaggtgcgcc ccgaccggtc 782221 gaccacgctg gcgctgcgct ggctcgtcgg ctactcgcgg caacgccgtg agaagacgat 782281 gatcgagcgc ctggcaaatg agatcctgga tgccagcaat ggccttgggg cctccgtcaa 782341 gcggcgtgag gacacccaca agatggccga ggcgaaccga gcctttgcgc attatcgctg 782401 gtgagaagcg ccggttagcc agccagggcg caaaccgaca gtgatagaca gctaactagc 782461 aaccgaaaga gtgggaagac ttctgtggca cagaaggacg tgctgaccga cctgagtagg 782521 gtccgcaact tcggcatcat ggcgcacatc gatgccggca agaccacaac caccgagcgc 782581 atcctgtact acaccggtat caactacaag attggtgagg tgcacgacgg cgcagccacc 782641 atggactgga tggaacagga acaggagcgc ggcatcacca tcacctctgc ggccacgacc 782701 acgttctgga aagacaacca gctcaatatc atcgacacgc cagggcatgt ggatttcacc 782761 gtcgaggtgg agcgcaatct gcgcgtgctc gacggcgcgg tcgcggtttt cgacggcaaa 782821 gagggtgtcg aaccgcagtc cgaacaggtg tggcggcagg ccgacaaata cgatgtcccc 782881 cgaatctgct tcgtcaacaa gatggacaag atcggtgcgg acttctactt ctcggttcgc 782941 acgatggggg agcggcttgg ggccaacgcc gtgcccattc agcttcccgt cggtgcggag 783001 gccgacttcg aaggcgtcgt cgacctggtg gagatgaacg ccaaggtgtg gcgcggcgag 783061 acgaaactcg gcgaaaccta cgacaccgtg gaaataccgg ccgacctggc cgagcaggct 783121 gaggagtacc ggaccaagct gctcgaggtg gtcgccgagt ccgacgagca cctgttggag 783181 aagtacctgg gcggtgagga gctcaccgtc gacgagatca agggcgcgat ccgcaagctg 783241 acaatcgcca gcgagatcta cccggtgctg tgcggcagcg cgttcaagaa caagggcgtg 783301 cagccgatgc tggatgccgt cgtcgactac ctgccgtcgc cgctggacgt tccgccggcg 783361 atcgggcacg cgcccgccaa ggaggacgag gaggtggtgc gcaaggcgac caccgacgag 783421 ccctttgcgg ccctggcgtt caagatcgct actcacccgt tcttcggcaa gctcacctac 783481 atccgggtgt actcgggcac cgtcgagtcg ggtagccagg tcatcaatgc caccaagggc 783541 aagaaagaac ggctgggcaa gctgttccag atgcactcca acaaggagaa cccggtcgat 783601 agggctagtg ccggtcacat ctacgcggtg atcggtctca aggacaccac caccggtgac 783661 accttgagcg acccgaacca gcagatcgtg ctggagtcga tgaccttccc cgacccggtg 783721 atcgaggtgg ccatcgagcc gaagaccaag agcgaccaag agaagctgag tctgtcgatc 783781 cagaagctcg ccgaagagga tccgaccttc aaggtgcacc tggattccga gaccggccag 783841 accgtcatcg gcggcatggg cgagctgcat ctggacatcc tggtggaccg catgcgccgg 783901 gaattcaagg tcgaggccaa cgtcggcaag cctcaggttg cctacaagga gaccatcaag 783961 cggctcgtgc agaacgtcga gtacacccac aagaagcaga cgggtggctc gggccagttc 784021 gccaaggtca tcatcaacct cgagccgttc accggtgaag agggcgcgac ctacgagttc 784081 gagagcaaag tcaccggcgg gcgtatcccg cgggagtaca tcccgtcggt ggatgccggc 784141 gcacaggacg ccatgcagta cggcgtgctg gccggctatc cgctggtgaa cctgaaggtc 784201 acgctgctcg acggcgccta ccacgaggtt gactcctcgg aaatggcgtt caagatcgcg 784261 ggctcgcagg tgctcaaaaa ggctgccgca cttgcgcagc cggtgatcct ggaaccgatc 784321 atggcggtcg aggtgaccac acccgaggac tacatgggtg acgtgatcgg cgacctgaac 784381 tcccgccgtg gccagatcca ggccatggag gagcgggctg gtgcgcgcgt tgttagggcg 784441 cacgtgccgc tgtcggagat gttcggctac gtcggtgacc ttcggtccaa gactcaaggc 784501 cgggcaaact actccatggt gttcgactcg tactccgaag tgccggcgaa cgtgtcgaag 784561 gaaatcatcg cgaaggcgac gggcgagtga gcgcaagctc acgagtgagg agccgagcaa 784621 tgggtacagc gaaggcgacg ggcgactagg cgatgcgaag acgaccgcta gtgagcgaag 784681 ctcacgagca atgagcagcg cgaaggcgac tggcgagtag atacaaccat acgagtaggc 784741 tggcccggtt acgaccgcgg cataactgaa aacatcaaca ctgcttttat aagcactaac 784801 aagtccagga ggacacaaaa gtggcgaagg cgaagttcca gcggaccaag ccccacgtca 784861 acatcgggac catcggtcac gttgaccacg gcaagaccac cctgaccgcg gctatcacca 784921 aggtcctgca cgacaaattc cccgatctga acgagacgaa ggcattcgac cagatcgaca 784981 acgcccccga ggagcgtcag cgcggtatca ccatcaacat cgcgcacgtg gagtaccaga 785041 ccgacaagcg gcactacgca cacgtcgacg cccctggcca cgccgactac atcaagaaca 785101 tgatcaccgg cgccgcgcag atggacggtg cgatcctggt ggtcgccgcc accgacggcc 785161 cgatgcccca gacccgcgag cacgttctgc tggcgcgtca agtgggtgtg ccctacatcc 785221 tggtagcgct gaacaaggcc gacgcagtgg acgacgagga gctgctcgaa ctcgtcgaga 785281 tggaggtccg cgagctgctg gctgcccagg aattcgacga ggacgccccg gttgtgcggg 785341 tctcggcgct caaggcgctc gagggtgacg cgaagtgggt tgcctctgtc gaggaactga 785401 tgaacgcggt cgacgagtcg attccggacc cggtccgcga gaccgacaag ccgttcctga 785461 tgccggtcga ggacgtcttc accattaccg gccgcggaac cgtggtcacc ggacgtgtgg 785521 agcgcggcgt gatcaacgtg aacgaggaag ttgagatcgt cggcattcgc ccatcgacca 785581 ccaagaccac cgtcaccggt gtggagatgt tccgcaagct gctcgaccag ggccaggcgg 785641 gcgacaacgt tggtttgctg ctgcggggcg tcaagcgcga ggacgtcgag cgtggccagg 785701 ttgtcaccaa gcccggcacc accacgccgc acaccgagtt cgaaggccag gtctacatcc 785761 tgtccaagga cgagggcggc cggcacacgc cgttcttcaa caactaccgt ccgcagttct 785821 acttccgcac caccgacgtg accggtgtgg tgacactgcc ggagggcacc gagatggtga 785881 tgcccggtga caacaccaac atctcggtga agttgatcca gcccgtcgcc atggacgaag 785941 gtctgcgttt cgcgatccgc gagggtggcc gcaccgtggg cgccggccgg gtcaccaaga 786001 tcatcaagta ggtctaccgg ccaccagacg caaaagaaca tgatgggcgc accagcgccc 786061 atcatgttct tttgcgtctg ctcgcgaaaa tgcccagcgt gcggcgctac gctgacatgg 786121 accctccgac gaggcaagga gcaggcacgt gttagcgcgc tacatcaaga tgcagttatt 786181 ggtgctgttg tgcggtggtc tggtcgggcc gatcttcttg gtcgtctact tcacgctcgg 786241 actgggcagc ctgatgtcgt ggatgttcta tgtcggtctg atcattaccg ttgctgacgt 786301 gctggtcgcg ctcgcattga ccaactacgg ggcaaagacc gctgccaaga ccgcggcact 786361 tgaacggagt ggagtgctgg cgctcgccca aatcaccggg ctcagcgaga cagggacccg 786421 gatcaacgat caaccgctgg taaaggtgca cctgcacatc tcgggacccg gcatcactcc 786481 gttcgacacg gaagaccggg tcatcgccag tgtgacccgg ctgggcaatc tcacggctcg 786541 aaaactggtg gtattggtga atcccgccac gcagcaatac ctgatcgact gggaacgaag 786601 cgctttggtc aacggcctgg tgcccgccca attcaccgtc gccgaagaca acaagaccta 786661 cgacttgagt gggcaaaccg gcccgctgat ggagatcttg cagattctga aggcaaacaa 786721 cgttccgctg aaccggatgg ttgacatccg ctcgaatccg gcactgcgtc agcaagtcca 786781 agcggtggtg cggcgggcag ccgagcggca ggcgccggcg gccgagccag cgtcgcaagg 786841 atcgatcgcc gagcggcttg cggagctgga atcgctgcgc gccagcggtg cggtcaacgc 786901 ggcggaatac gagagcaagc gcgcccagat catctccgaa atctgaggcg agctggggca 786961 ccatccgcgg cgagcagacg cgaaagcccg cgacacgccg aggcatcggg ggattttgtc 787021 tggtgggcgg gaatctgggg cacgttagaa cacgttacag tttcgctgct agcctgacag 787081 tcggcgagag gggcgtatgt gtctgcgcgg ggaggatcac tgcacggccg ggtggcattt 787141 gtcaccggcg ccgcccgcgc ccaaggacgg tcgcacgcgg tgcggctggc gcgcgagggg 787201 gccgatatcg tcgcgctgga catctgcgcg ccagtatccg gcagcgtgac ttacccgccg 787261 gccacgtccg aagatctcgg cgagaccgtc cgcgcggtgg aagccgaagg ccgcaaggtg 787321 ctcgcccgcg aggtggatat tcgcgacgac gccgagttgc ggcggctggt ggccgatggt 787381 gtcgagcagt tcggccggct cgacatcgtg gtggccaacg ccggggtgct gggttggggc 787441 aggctctggg aactcaccga tgagcagtgg gagaccgtta tcggggtcaa cttgacgggt 787501 acgtggcgca ccttgcgggc caccgtgccc gcgatgatcg atgccggcaa tgggggttcg 787561 attgtggttg tcagctcgtc ggcggggttg aaggcgacac cgggcaacgg ccactacgcg 787621 gccagcaagc atgcactcgt agcgctgacc aacacgttgg cgatagagct cggtgaattc 787681 ggcatacggg tcaactccat tcatccttac tcggtcgaca ccccgatgat cgaaccggag 787741 gcaatgattc agacgttcgc caagcatccc ggatatgtgc atagctttcc accaatgccg 787801 ttgcagccca aaggttttat gacaccagac gagatatccg acgtcgttgt ctggttggcc 787861 ggcgacggct cgggcgcact gtcgggcaat cagatcccgg tcgataaggg tgccttgaag 787921 tattgacgcg cgatcgtgta tgaacgcaca cgtgaccagt cgtgaaggcg tcaatgagtt 787981 tgacgatgga attgtgatcg tcggcggcgg attggcagct gcgcgcaccg ccgagcagtt 788041 gcgtcgtgcg ggctattcgg gtcgcctcac gatcgtcagc gacgaggtgc atctgccgta 788101 cgaccgtccg ccgctatcca aggaggtgct gcgcagcgag gtcgacgatg tggccctcaa 788161 accccgcgag ttctacgacg aaaaggacat cgcacttcgg ctggggtcgg ctgccgtcag 788221 cttggacacg ggagaacaga cggtaacgct ggccgacggt acggtgctcg gctacgacga 788281 gctcgtcatc gcgactggtt tggtgccccg gcgtattcca tcgcttcccg accttgatgg 788341 cattcgggtg ctccggtcgt tcgacgagag catggcactg cgcaagcatg catccgccgc 788401 acggcacgcc gtggtggtgg gggccggttt catcggctgc gaggtggctg ccagtctgcg 788461 cggtctcggt gtggatgtgg tgctggttga gccgcagccg gcgccgttgg cctcggtgct 788521 gggcgagcag atcggccagt tggtgacgcg gctccatcgc gatgagggcg ttgatgttcg 788581 cacgggtgtg acagtggccg aggtacgtgg caaggggcat gtcgacgcgg tggtcctgac 788641 cgacggtacc gaactgccgg ctgatctggt ggttgtgggc attgggtcga ccccggcgac 788701 cgaatggcta gagggtagcg gcgtcgaggt cgacaacggc gtgatctgtg acaaagccgg 788761 gcggactagc gcgccgaatg tgtgggcgct cggtgacgtc gcctcctggc gagatccgat 788821 gggacaccaa gcacgcgtgg aacattggag caacgtcgcc gaccaggccc gagtcgtggt 788881 gcccgcgatg ctcgggaccg atgtgcccac gggcgtggtc gtcccgtatt tctggagtga 788941 ccagtatgac gtcaaaatcc agtgcctggg ggagccgcac gccaccgacg ttgtgcatct 789001 ggtcgaggac gacgggcgca agttccttgc ctattacgag cgcgatggcg tgctggttgg 789061 cgtggtcggt ggcgggatgg ccggcaaggt catgaaggtg cgcggcaaga tcgccgcggg 789121 cgcgcccatc gccgaagtgt tagaccaaac tcaggcctag agctgaccta ggtggcagcg 789181 ggcgccctgg tcgtcggcgc attcggcgga catatcgtct ggctgtcggg acggctcggc 789241 cagcgcgccg gccgcacgca cccgtgcgac cgccgcgtcg atatcgccac cgtccatatc 789301 ggcacggcgc tcggatgcgg gctgcggccc gctgcaccgg gccatcagat gcacgcccgg 789361 cgcctgccag ccatccgcga cgcggcccgg tttgacggtc caccccagca cgtggcgtag 789421 aagtcgcgga tgacgtcgga atcggctacc tcgtagctga cattcgccgt ttcactgagc 789481 cggcatctgt tgagctctgg gcgtttccgg cacgggctcg gcttcaagac cgcgaatgcc 789541 gcgcttcggt tgtcggtagc atcgaggatg gctccgtgct cggattgacc tgtttgtccc 789601 gcggcgccgc tggctgcggt gatcgcctcg cgcgtggcga ccaggcctgc gacggcctag 789661 cagcaaccag ggtgggacaa ccggagccgg cgaatatgcc ggtcgggagc tcggttgttc 789721 gtgggagctc ggtgaccggc atggggacat gacggcggct gatcgtggtc agccagcttg 789781 tgtcgcgcca cgcagccatg aaatcgcgat tggcggcgga tcgcttccat gcgcggcatc 789841 cgttgcagcc cgaaatgtct ccgacgtcag gtcctcggcc gtgccacggg cgccgcagag 789901 ccgcccatac cgtcgccgcg ctggcgtgaa ttccgacggg ttggtcagga aaatatccgg 789961 cgagctgcta ccgtggcggc gccaacactg tgccgaccca actcggggat cgcagatcgc 790021 tcgtcattgc caggtcaccg gtgggccgtg tggatggcat tcacccaaga cccgggcgtg 790081 accacccggc caactgcgca tacgtaccag gtacttgatt tgggcgccgg gccgctggtg 790141 ggcaggctcc aaggtgaggt gcacgaacgg gcagtgggca tcggcttgcg cggctaatgc 790201 gtcgattccg gcgcggatcg ctgcgcgttc gtcggcgggc aggtactgcc aggtgatcga 790261 atgccacaac acggtgagtg catcgtcggt cagagtcatg ccggcgactg cggcgtgcgc 790321 cgcctgccga tggaggtccg cgggaatgtt gcgggcgacg gcgatggcgc cccgcaaccg 790381 ctccaaccga tcggtctggt ccggccagat gtagctcaac gcgttcagct ccccgtcggg 790441 gctggtgacg tcgatgggcg cgatgtcgta tccgtgtcgt tcgacgatcc gcaccgtggc 790501 cgtcggcggc aattcgccca gccaggcatt gtcgattcgc accggtgagt cggccaggcc 790561 ccattcgccg ccgagataac ggtagcggta ccgatctggt cgcaggttca gccctgcact 790621 ggaccctatc tcgaaaagcc ttattggcaa gtcgaattgg aggcaggcga tgagaagtcc 790681 accgatcaac gccgccgagc gccctacctc gttggtctgc ggtggccgat cgagagccgc 790741 acgcagcgac tccggctggt cggtcgcggt gcggacgata tcgggccagg ctgcctccgc 790801 ctgccaggtg ccgccggtgc tggggtacca gcggcgcaac accggtgcgc ggccgtcgag 790861 caccatccgg tgcaatccgc cgagcagccg aagcggcacc gcctggccct ccggagcacc 790921 cttctggtcg gccaagatgg acgcgaagac gccgccgctt tcgacgtcag ctgccacgag 790981 ctcaagtagc tcgcggtaca tcggggagcc ggaggaggtg cacacccgcc cctgtgaccg 791041 cagggtgtgg accaggtgtt cggtgcccgt cactggttga gtcggtccag accggcgccg 791101 acgacgtcaa acgcggcccc gagcgcttcg gtcaaggaga cggattcgtc gcgcagccaa 791161 tgttcatagg cgctcagagc gacccccagc attgtccagg cgacggtttg gggcataaag 791221 tctgtcgtct ttccacccga tctgcgggca acgaatttgg cgatcacctc gcgccagcca 791281 gcatacatgg tcatcgaata ggcctgcagt tcaggagttt gcaagatgac ccgcatgcgc 791341 ttgcggtgtc ggatggtttc ggattcgtca aaggtgttga aggccaacag cgctgcgcgc 791401 aacgcgtccc tcagctgaat ccgtgaatcg atattgtcga gtagaccttg tagctgtgca 791461 aggtgggtgc tgaagtcacc ccaggggatg gcgttcttgg aggcgtagta gcgaaacaac 791521 gttctgcggg cgatgccggc cgcccgggcg atgtcgtcca cgctgacatc ggtgaaaccg 791581 tgggcagcga acagttcgat ggcaacatcg ctgatgtggt gcggtgtggt tgagcgccgt 791641 cggcccaccc gcgactcgtg cggcatcaca ttcgcccttc catttcggca ctcgatgcca 791701 tattgtgtcc agatcgacgg atcgctgtcg agacctgctg gcgaaaggca atccagatgg 791761 actacgaaac cgataccgac accgagcttg tcaccgagac cctggttgaa gaggtgtcca 791821 tcgacggaat gtgtggggtt tactgaccgt gccggcgccc gcgcaggctc gccgggctga 791881 ttccagcgaa ttcgatcccg atcgcggctg gcgactacac ccacaggtgg cggtccggcc 791941 ggagcctttt ggcgcgctgc tctatcactt cggcacccgt aagttgtcat ttctgaaaaa 792001 tcgcaccatc ctcgcggtgg tgcagacgct ggcggattat cccgatatcc ggtcggcctg 792061 ccgcggcgcc ggcgtcgacg actgtgacca ggatccgtac ctgcacgccc tgagtgtgct 792121 cgccggttcg aacatgctgg ttcctcggca gacaacatga cgagccccgt accccgactc 792181 atcgagcagt tcgagcgggg gctcgacgcg ccgatctgcc ttacctggga gctgacctac 792241 gcctgcaacc tagcttgcgt gcactgcctg tcgtcctcgg gcaaacgcga tcccggcgag 792301 ttgtccaccc gccaatgcaa ggacatcatc gacgaactgg aacgcatgca ggtgttctac 792361 gtgaacatcg gcggcggcga accaaccgtg cgcccggact tttgggagct ggtagattac 792421 gccaccgcac accacgtcgg ggtgaaattc tccaccaacg gggtccggat cacccccgag 792481 gtggccacgc ggctggcagc caccgactac gtcgacgttc agatctcact cgacggcgcc 792541 acggccgagg tcaacgacgc catccgcggc accgggtcgt tcgacatggc ggtgcgcgcg 792601 ctgcagaacc tggcagcggc gggatttgcc ggcgtcaaga tctcggttgt gatcacccgg 792661 cgcaacgtcg cccagctcga cgaattcgcc acgctggcaa gccgttacgg agcgacgttg 792721 cggataacca ggttgcgacc gtccgggcgc gggactgacg tatgggccga cctgcacccc 792781 accgccgacc agcaggtgca gctttacgac tggctggttt ccaaaggaga gcgggtgctc 792841 accggcgatt ccttcttcca cctggcgccg ctcggccagt cgggggctct ggccggcttg 792901 aacatgtgcg gagccgggcg ggtagtgtgc ctgatcgacc cggtgggtga cgtgtatgcg 792961 tgcccattcg ccattcatga ccacttctta gccggaaacg tgttgtccga cggcggattt 793021 caaaatgtct ggaagaactc gtcgctgttt cgcgagctcc gggagcccca gtccgcaggc 793081 gcctgtggca gctgcggaca ctacgacagc tgccggggcg gctgcatggc ggcgaaattc 793141 ttcaccggcc tgccgctgga cgggccggat cccgaatgcg tgcaaggcca tagcgagccg 793201 gcgctggcgc gcgagcgcca cctaccgcgg ccccgcgccg accactcccg cggtcggcgc 793261 gtcagcaaac cggtgcccct gacgctgtcg atgcggccac ccaagcgccc gtgcaatgaa 793321 agtccggtgt agccgtggcc gaagcgtggt ttgaaacggt agccatcgcg cagcaacgcg 793381 cgaagcggag gctgccgaaa tcggtttact cgtccctgat tgcggccagt gaaaagggaa 793441 tcacggtcgc cgacaatgtc gcagcattca gcgagctcgg gttcgcgccg cacgtcatcg 793501 gggcgacaga taaacgtgac ttgtcgacga ccgttatggg gcaagaagtt tcgttgccag 793561 tgattatttc gccgaccggt gttcaggcgg tcgatcccgg cggtgaagtc gccgtcgcgc 793621 gggccgcggc cgcccggggt actgtgatgg gattgtcctc gtttgccagc aagccgatcg 793681 aggaggtcat tgccgccaac cccaagacct tcttccaggt ctactggcag ggcgggcgcg 793741 acgcgctcgc tgaacgcgtc gaacgggcgc ggcaggccgg cgcggtcggc ctggtcgtca 793801 ccaccgactg gacgttctcg cacgggcgcg actggggcag ccccaagatc cccgaagaga 793861 tgaacttgaa gaccatcctg cggctatccc cggaggcgat cacccggccg aggtggttgt 793921 ggaagttcgc caagacgcta cggccaccgg acctacgggt gcccaaccag ggccggcgcg 793981 gcgagcccgg cccaccgttc ttcgcagcct acggcgaatg gatggcaaca cctccgccga 794041 cctgggaaga tatcggctgg ctgcgcgaac tgtggggcgg accgttcatg ctcaagggcg 794101 tcatgcgggt cgacgatgcc aaaagagctg tggatgccgg ggtttcggcg atctcggtat 794161 ccaaccatgg tggcaacaat ttggatggga cgccagcatc gatccgggcc ctgcccgcgg 794221 tctcggcggc ggtcggcgat caggtcgaag tgttgctcga cggcggcatc cggcggggca 794281 gcgatgtcgt caaggcggtg gcgctgggcg cgcgcgcggt aatgattggt cgcgcttacc 794341 tgtggggctt ggccgccaac ggccaagccg gggtcgagaa tgtactcgac atcctgcgcg 794401 gtggtatcga ctcggctctg atgggtctcg ggcatgcctc tgtccatgac ctcagcccag 794461 ccgacatcct cgttcccacc gggttcatcc gcgacctggg tgtgccctcc cgacgggacg 794521 tttagccgga tgttgagctg ggcccaaatt ggggttggcc ctcccattac cacagagatg 794581 ctcgcgacgg aatgacgttt ttagaaattc tgatacgggc gtggcagccg tggcgggcga 794641 gcaggatgtg tggccggtaa gtcatcacga cgaaaaaaat ttcggtagaa gacataacaa 794701 ttggtgcacg ccaggtgaat tcgtcctacc atcggcgagt gccggtagtc ggggaactcg 794761 ggagtgcgac gtcgagccag ctaccaagca cgtcgccgtc gatagtgatc ccgctggggt 794821 ccaccgagca gcacggtccc cacctgccgt tagataccga tacccggatc gcgaccgccg 794881 tggcccggac cgtcaccgcg aggctgcacg ccgaggacct gcccattgct caggaggaat 794941 ggctgatggc gcccgccatt gcctacggcg ccagcggcga acaccagcgt ttcgctggaa 795001 cgatctctat cggcactgaa gccctgacga tgttgctcgt ggagtatggc aggtcggccg 795061 cctgctgggc ccggcgcctg gtcttcgtca acgggcacgg cggcaatgtc ggcgctttga 795121 cccgagcggt aggcctgctg cgcgctgaag gtcgcgacgc cggatggtgc ccgtgcacct 795181 gcccgggcgg tgacccccac gccggccaca ccgaaacatc cgtgctgctg catctttcgc 795241 cggccgacgt gcgcaccgaa cggtggcgcg cgggtaatcg cgcaccgctg cccgtgttgt 795301 tgccgtcgat gcgccgaggc ggggtcgcgg ccgtgagcga gacaggagtg ctcggggatc 795361 cgaccacggc gaccgcggcc gaggggcggc ggatcttcgc ggcgatggtc gacgactgtg 795421 tgcgccgagt cgcccggtgg atgccacagc ccgacgggat gttgacatga ccgcgccggc 795481 gacgatgcag agcgaagcga tgaggagaag cggcgcagat gaccgcgacc cgactgcctg 795541 acgggttcgc cgtccaggtt gaccgtcgcg tgcgagtgct tggcgacggc tcggccctgc 795601 tcggtggctc accgacccgg ttgctgcggc tggctcccgc cgcacgaggc ctgctctgtg 795661 acggccgcct taaggtccgc gacgaggtca gcgcggagct ggcccgcatc ctgctggacg 795721 ccacggtggc gcatccacgg ccgccgagtg ggccgtcaca tcgtgacgtc accgtcgtta 795781 taccagtacg gaacaacgca tctggtctgc ggcgtctggt gacctcgtta cgcggattac 795841 gcgtcatcgt ggtcgacgac ggttcggcgt gcccggtcga gtcggacgac tttgtcggcg 795901 cacattgcga catcgaagta ctccaccacc cccacagcaa ggggccggcc gcggctcgca 795961 acaccgggct agcggcctgc accaccgact tcgtggcgtt cctggattcc gacgtgacgc 796021 cgcggcgggg atggttggaa tccttactcg gccacttctg cgatcccacc gtcgcactcg 796081 tcgcacctcg catcgtcagc ttggtggaag gcgagaaccc ggtagctcgc tatgaggccc 796141 tgcactcgtc gttggacctt ggtcagcgcg aagcgccggt gttaccgcat agcacagtct 796201 cttacgtgcc gagcgccgcc atcgtttgcc ggagttcagc catccgcgac gtcggcggct 796261 tcgacgagac catgcactcc ggggaagatg tcgacttgtg ctggcggctc atcgaggctg 796321 gtgctcggct gcgctacgag ccaattgcgc tggtcgccca tgaccatcgg acccaattgc 796381 gggactggat cgcgcgcaag gcgttttacg gcggttcggc ggctccgcta gctgtgcggc 796441 acccggacaa gaccgcgccg ctggtgattt cgggcggggc gctgatggcg tggatcctca 796501 tgtcgatcgg cacaggcctt ggtcgactgg cgtcgttggt gatcgcggtg ctgactggtc 796561 gccggatcgc cagggccatg cgctgcgccg agacgtcgtt cttggatgtg cttgccgtcg 796621 ccacccgcgg gttgtgggcg gccgcgctgc agctggcgtc ggccatctgc cggcactatt 796681 ggccactggc attgctcgcg gccatcctgt cgcgccgctg taggcgggtg gtgttgattg 796741 cggcggtagt ggacggtgtg gtggattggc ttcgccgcag ggagggcgcc gacgatgatg 796801 ctgaaccgat tgggccgctg acctacctag tgctgaagcg cgtggacgac ttggcttatg 796861 gcgctggcct gtggtacggg gtggtgcgcg aacgtaacat cggcgcgctc aagccgcaga 796921 ttcgtaccta gtgtgactgc ggcggtccgg catagcgatg tgctggtcgt cggtgctgga 796981 agtgctggat cggttgttgc cgagcgtctt tccatggact cgagctgtgt ggtgaccgtg 797041 cttgaggctg gccccgggct ggccgatccg gggttgctgg ctcagacggc caatgggttg 797101 caactgccga tcggagctgg cagccctctg gttgagcgtt atcggacgcg gctcaccgat 797161 cgaccggttc gccacttgcc gatcgtgcgg ggtgcgacgg tcggcggttc cggcgcaatc 797221 aacggcggct atttctgccg cggactgccc agcgatttcg accgtgcctc gataccaggc 797281 tgggcatggt ctgacgttct ggagcacttc cgggctatcg agacagatct ggatttcgag 797341 acgcctgtgc atggccgtag tggccccatc ccagttcgcc gcacacacga aatgactggc 797401 atcactgaaa gtttcatggc tgccgcagag gacgcagggt tcgcttggat cgctgacctc 797461 aacgatgttg ggccggaaat gccttcgggt gtaggcgcgg tcccgctcaa catcgttaac 797521 ggcgtacgca ccagctcggc ggtcggctat ctgatgcccg cgctgggacg gccgaatctg 797581 acactgctgg cccggacgcg ggcggtgcgg ttgcgctttt ccgccaccac cgcggtgggt 797641 gtcgacgcga tcggcccagg aggcccggta agcctgagcg ctgaccgaat cgtattgtgc 797701 gccggagcga ttcagtcagc tcatctgttg atgctctcgg gcgtcggcga ggaggaggtg 797761 ttgcgatccg ccggtgtgaa ggtgcttatg gcgttgccgg ttggcatggg ctgcagtgac 797821 cacccggaat gggtgatgcc gaccaactgg gcggtggctg tcgatcggcc ggtgttagag 797881 gtgctgctga gcactcatga cggcatcgaa ataaggccgt acacaggcgg cttcgttgcg 797941 atgaccggcg acggtacagc cgggcatcgc gattggccgc atatcggggt ggcgctcatg 798001 cagccgcggg cacgcggacg catcacgttg gtctcgagtg atccccagat accagtccgc 798061 atcgagcacc gatacgacag tgaacctgcc gatgtcgcgg ccctgcgcca gggtagcgca 798121 ttggcccacg aattatgcgg tgcggcaacg cgcatcggtc cagccgtatg ggcgacatcg 798181 cagcatctgt gtggtagtgc cccaatgggc accgacgatg acccacgagc cgtcgtcgac 798241 ccgaggtgtc gggtccgcgg catcgaaaac ctatgggtga tagacggatc tgtccttccg 798301 tcgatcacca gtcgcggtcc acacgcaacg atcgtaatgc tgggccaccg cgcggccgaa 798361 tttgttcagt gactttcgtc gagtggggcg accacagcgg tcgctgccga atgtgcattt 798421 cggtcaggca ttgagcaggg gaccgaatag cgtagctccg catcggactg cagtcgtcag 798481 gtcgacgatg atggcgctga catcggaggt gggccgcggc ccaggcttcg cggtttggcg 798541 gcctgcgaag aagtggctct tctgacactt ccgtgggtgg acttctggtt tgagtaggcg 798601 cacgtcgttg tcgcttaggg tttctggctt gtcaaaggac aggaccagcg cagatcactg 798661 tagtcttagc tgatgctgcc gcccggattg ccgacgtcgt ggcccagcgg tgccccaacg 798721 cggtccgccg cgtcgatcct ttccacgtgg tggcctgggc caccgaggct ctagaggctg 798781 aacggcgccg ggcctgaaac gacgcgcgag cgcccgcccg gaccccgagt cattgggtcg 798841 caggggtaac cgaagggtgc acgttgaccg cgtgaggcta accggcaccg agcgtgaact 798901 gagggcggag aatcagagcc ccccgatttt ccgcccgcag aacacgttgg gcgacggcgc 798961 caacgggctg ccactggccg tgtgcaccac gacggctcac acgtgccaca cttcccatac 799021 tcacccatcg cggtggaccc caaacccagt gccggccacc aagggcgtcc ccgctggatt 799081 ggtgcaagca accttcatca tcgaaaacct tgaccccggc aacaacgaca cgccgacccc 799141 ccctacaccc aaactgcgat tagcccgaaa acctgggcac cataggcgat ctgaatacga 799201 tgcggattcg gtgctgcgga gaaaggatac atcgcgccga tgcgtccagg cggatgacgt 799261 ccgatgcgtg cagctggtcc aggatccgcg gcgcggacgt gtcgaactcg gtggttaccg 799321 cgccgagctt actgttggcc gacgggcggc ggtgaattgc caacgcccgc aatatggtgc 799381 ggatggatgg cccgttcggt tgggttgcgg ggtaggcggc gccgcgcgag gcgatcagcg 799441 ctgaggtcgg gaattcacct ccggtcgcgg gagtacagcg gtcggctggg gtgccgccgg 799501 tgtctgtcgg gtagaggcgg caggacacgc tcgccgtcaa aacggcttcg gcaaacgggt 799561 cttcgccgtc gacaggcagg gttggtgatc ccggcctcgg cggcgacggt ctggtcattg 799621 attgtgcgat gggtgatcgt cgtgtcgatc tgctcgcggc gaaggactcg gagatccggc 799681 gctcgatggg ggcagtaccg gtcggcgcgg gaagctcgca ggtggcgacg agttgggcga 799741 gtgatcgttg catccgctgt cgggcggcga ttctgtcggc cgactgtgcc aacttggcta 799801 gggccaattc gcggggcggt ctggcagtcg gcgggtccgc tgtcagctag ctgcagcagc 799861 tcctcgatct cgtcgaggct gaatccatgg gactacgcgc gtctgacgga cgagaccacc 799921 gataccgcat cggcgcgata ccgcgatagc cctgagaacg accgggccgg cgcggctagc 799981 aggtcctccg ctcgtaatag cgcagcgtct ggccgttgat cccagcccgc gcggcaacct 800041 ggctactccg cattccgcca ttccgaaccc tgtactcgac tgtcgagtca agggtgtgtg 800101 ttgtcagtgc cgggtcaggt gccgatagca accggccgcc cgccgctgca cccgagccag 800161 cggcgatttg gccgaagcgg tgatctgggg catactgttc aggttgccct gcgccaggtt 800221 tgcgtctggc cgggcatacg accagcgccc acacgggcgc gtccggtccc acccttgaag 800281 cgcgacgatt tcggccttga aattgatcgc gacaaggctg tgtgcgggcg acacgcccga 800341 gcgcggggcc ggtggaccta cgacaggtaa acagcggcgc agtattcggc gcaacgctag 800401 atcggtccag aaggaccggg tcgatcggcg cgccggggag caccggaccc ggatacgggc 800461 tcgagtggga gtgaggtagg agaagcgtgg cgggacagaa gatccgcatc aggctgaagg 800521 cctacgacca tgaggccatt gacgcttcgg cgcgcaagat cgtcgaaacc gtcgtccgca 800581 ccggtgccag cgtcgtaggg ccggtgccgc taccgactga gaagaacgtg tattgcgtca 800641 tccgctcacc gcataagtac aaggactcgc gggagcactt cgagatgcgc acacacaagc 800701 ggttgatcga catcatcgat cccacgccga agaccgttga cgcgctcatg cgcatcgacc 800761 ttccggccag cgtcgacgtc aacatccagt aggagattgg acagagcaat ggcacgaaag 800821 ggcattctcg gtaccaagct gggtatgacg caggtattcg acgaaagcaa cagagtagta 800881 ccggtgaccg tggtcaaggc cgggcccaac gtggtaaccc gcatccgcac gcccgaacgc 800941 gacggttata gcgccgtgca gctggcctat ggcgagatca gcccacgcaa ggtcaacaag 801001 ccgctgacag gtcagtacac cgccgccggc gtcaacccac gccgatacct ggcggagctg 801061 cggctggacg actcggatgc cgcgaccgag taccaggttg ggcaagagtt gaccgcggag 801121 atcttcgccg atggcagcta cgtcgatgtg acgggtacct ccaagggcaa aggtttcgcc 801181 ggcaccatga agcggcacgg cttccgcggt cagggcgcca gtcacggtgc ccaggcggtg 801241 caccgccgtc cgggctccat cggcggatgt gccacgccgg cgcgggtgtt caagggcacc 801301 cggatggccg ggcggatggg caatgaccgg gtgaccgttc ttaacctttt ggtgcataag 801361 gtcgatgccg agaacggcgt gctgctgatc aagggtgcgg ttcctggccg caccggtgga 801421 ctggtcatgg tccgcagtgc gatcaaacga ggtgagaagt gatggctgcg caagagcaga 801481 agacactcaa aatcgacgtc aagacgccgg cgggcaaggt cgacggcgct atcgagctgc 801541 cggccgagct gttcgacgtc ccggccaaca tcgcgctgat gcaccaggtg gtcaccgccc 801601 agcgggcggc ggcacgccag ggtacccact cgacgaagac gcgcggcgag gtcagtggcg 801661 gtggccgcaa gccctaccgg cagaagggga ccggtcgtgc ccggcagggc tcgacgcggg 801721 cgccgcagtt caccggcggt ggcgtggtac acggtcccaa gccgcgcgac tacagccagc 801781 gcacacccaa gaagatgatc gccgcggcgc tgcgcggggc gctgtccgac cgggcccgca 801841 acgggcgtat ccacgcgatc accgagctag tggaaggtca aaacccgtcg accaagagcg 801901 ccagggcatt tctggccagc ctgacagaac gtaaacaggt gctggtggtc atcgggcgca 801961 gcgacgaggc cggcgcgaaa agcgtgcgca atctgccggg cgtgcacatc ctggcgccgg 802021 accagctcaa cacctatgac gtgctgcgtg ccgacgacgt ggtgttcagc gttgaggcgc 802081 tgaatgccta tatcgcggcc aacaccacga cgtccgagga ggtttcggcc tgatggcgac 802141 gctcgctgac ccccgcgaca tcatcctggc cccggtgatc tcggagaaat cctatgggtt 802201 gctggatgac aacgtgtaca cgtttttggt gcgcccggat tccaacaaga cgcagatcaa 802261 gatcgccgtc gagaagattt ttgccgtcaa ggtcgcatcg gtgaacaccg cgaaccggca 802321 gggcaagcgt aaacgcaccc ggaccggata cggcaagcgc aagagcacca agcgcgccat 802381 cgtcaccctg gcgccgggca gcaggccgat cgacctgttc ggggcaccgg cctagcccgg 802441 cgacgatgca gagcgaagcg atgaggagga gcagggcaat gcggcctagc ccggcgacga 802501 gagcgtgaga gaaagacctg attagacatg gcaattcgca agtacaagcc cacgacgcct 802561 ggtcgtcgcg gcgccagcgt atctgatttc gccgagatca cccggtcaac cccggagaag 802621 tcgctggtgc gcccgctgca cggtcgcggt ggacgcaacg cgcatggccg gattaccacc 802681 cggcacaaag gcggcggtca taagcgcgct taccggatga tcgactttcg ccgcaatgac 802741 aaagatggtg tcaacgccaa ggtcgcgcac atcgagtacg acccgaaccg taccgcacgg 802801 attgcgttgc tccactatct cgatggggag aagcgctaca tcattgcacc caacggactt 802861 tcgcaagggg atgtggtgga atccggcgct aacgccgaca tcaagccggg caacaacctg 802921 ccattgcgca acatcccggc cggtaccttg atccacgccg tggagctccg cccgggaggt 802981 ggcgctaagc ttgcgcgctc ggccgggtcg agcatccagc tgctcggcaa ggaggccagc 803041 tacgcgtcgc tgcgtatgcc cagcggtgag atccgccggg tcgacgtccg ctgccgcgcg 803101 accgtcggcg aagtgggcaa tgccgagcag gcaaacatca actggggcaa ggccggtcgg 803161 atgcggtgga agggcaagcg cccgtcggtc cggggcgtgg tgatgaaccc ggtcgaccac 803221 ccgcacggcg gtggtgaggg taagacctcc ggcggccgtc acccggttag cccgtggggc 803281 aagcctgagg ggcgtacccg caatgcgaac aagtcgagca acaagttcat cgtccgacgc 803341 cggcgcaccg gcaagaagca ctcgcgttag ccgcgcaatc agatctaggg agtttcagga 803401 gtagccaacc atgccacgca gcctgaagaa gggcccgttc gtcgacgagc atctgctcaa 803461 gaaggtcgat gtccagaacg agaagaacac caagcaggtc atcaagacct ggtcgcgtcg 803521 gtcgaccatc attccggact tcatcggcca tacctttgcg gtgcacgacg gccgcaagca 803581 cgtccccgtg ttcgtcaccg aatcgatggt gggccacaaa cttggtgagt tcgcgccgac 803641 acgcaccttc aagggccaca ttaaagacga ccgaaagagc aagcggcgat gactgcggct 803701 actaaggcta ccgagtatcc ctcggcggtc gccaaggccc gatttgtgcg ggtgtcgcca 803761 agaaaggcgc gccgggtgat cgatctggtg cgtggcaggt cggtgtcaga cgcgctcgac 803821 atcctgcgct gggcgccgca ggccgccagc ggtccggtgg ccaaagtgat cgccagtgcg 803881 gcggccaacg cgcaaaacaa cggcgggctg gacccggcaa ccttggtggt ggccaccgtg 803941 tacgccgacc agggaccgac cgccaagcgc atccgtccgc gcgcccaggg ccgcgcgttc 804001 cgcatccgcc ggcgcactag ccacatcacg gtggtggtgg aaagccggcc ggccaaagat 804061 caacggtcgg cgaaatcgtc gcgggcccgc cgcaccgagg ccagcaaggc cgccagcaag 804121 gtcggggcta cggcgccggc caagaaagcg gccgccaaag cgcccgccaa gaaggcaccc 804181 gccagttccg gcgttaagaa gacacccgca aagaaagcgc ccgccaagaa ggcgcccgcc 804241 aaggcttctg agacttctgc agcgaaggga ggctcagact agtgggccaa aagatcaatc 804301 cgcatggctt ccggctgggc atcaccaccg actggaagtc gcgctggtat gccgacaagc 804361 agtatgccga gtacgtcaag gaggacgtgg cgatccgccg gctgctgtcc agtggcctag 804421 agcgtgctgg gatcgccgat gtagagatcg agcggacccg cgaccgggtc cgggtggaca 804481 ttcacaccgc gcgtccgggc atcgtcattg gtcggcgtgg gaccgaggcc gaccggattc 804541 gtgccgacct ggaaaagctg accggcaagc aggtccagct caacatcctg gaggtcaaaa 804601 acccggagtc gcaagcgcaa ttagtggccc agggggtagc cgagcagttg agcaaccggg 804661 tggcgttccg ccgcgcaatg cgcaaggcga tccagtcggc gatgcgtcag cccaacgtca 804721 agggaatccg ggtgcagtgc tcgggccgcc tcggcggcgc ggaaatgagc cgctcggagt 804781 tctaccgcga gggccgcgtc ccgctgcaca ccttgcgggc agatatcgac tacggcctat 804841 acgaggccaa gaccaccttc ggccggatcg gtgtgaaggt gtggatctac aagggtgaca 804901 tcgtgggcgg caaacgtgaa ttggctgccg ccgcgccagc gggcgccgac cgtccgcgcc 804961 gtgagcggcc gtcgggcacg cgcccccgtc gcagcggtgc ttcgggcacc acggcgaccg 805021 gtaccgacgc gggtcgggcc gcgggtggcg aagaggccgc gcctgacgcc gcagcgcccg 805081 ttgaagcgca gagcacggag agctgaatca tgttgattcc ccgtaaggtt aaacatcgca 805141 agcagcacca tcctcgccag cgcggcatcg ccagcggcgg caccacggtg aacttcggcg 805201 actacggcat tcaggccctt gagcacgcct atgtcaccaa ccggcagatc gaatcggcgc 805261 gtatcgccat caaccggcac atcaagcgtg gcggcaaggt ttggatcaac atcttccctg 805321 accgcccgct gaccaagaag cccgccgaaa cccgcatggg ttcgggcaag ggctcgccgg 805381 agtggtgggt agccaacgtt aagccgggcc gggtgctgtt cgagctcagt taccccaatg 805441 aaggtgtcgc ccgggccgcg ctcacccgag cgatccacaa gctgccgatc aaggcacgca 805501 ttattactcg agaggagcag ttctgatggc agtgggtgtc tcgccgggcg aactgcgtga 805561 gctcaccgac gaggagctgg ccgagcggtt gcgcgagtcc aaggaagagt tgttcaactt 805621 gcgtttccag atggcgaccg gccagctcaa caataaccgc cggctccgta cggtgcgtca 805681 ggaaatcgcg cgcatctaca ccgtgctgcg cgaacgagaa ctgggtctgg cgactgggcc 805741 cgatggtaag gaatcgtgat ggcagaggct aagaccggcg cgaaggcggc gcctagggtg 805801 gctaaggccg ccaaggcggc ccccaagaag gccgcaccca acgacgctga ggccataggt 805861 gcggccaacg cggcaaacgt taaggggccc aagcacactc cgcgtactcc gaagccacgc 805921 ggccgccgca agacacgaat cggctatgtg gtgagcgaca aaatgcagaa gaccattgtg 805981 gtggagctgg aagaccgcat gcggcacccg ctatacggca agatcatccg gaccactaag 806041 aaggtcaagg cacacgacga agacagcgtt gccggcattg gcgaccgtgt ctcgctgatg 806101 gagacgcgtc cgctgtcggc gaccaagcgc tggcggctcg tcgagatcct cgagaaggct 806161 aagtaagcct gacgagcagt cgcaaaagcc cccgacacgc gcggcgtgcg ggggcttttg 806221 cgactgctcg cccaaccagc gcggcgtcag tgcggaaatc ctcagctgat tcctaccctg 806281 tgcgtgtagt gtacacaacc gttcattaac tccacgggga agtgaggctg gcttatggca 806341 cccgaggcca ccgaggcgtt caacggcacc atcgagctgg atattcgtga ttcggagccg 806401 gattggggcc catacgcagc gccggtggca ccggagcact caccaaacat cctgtatctg 806461 gtctgggacg acgtcggcat cgcgacctgg gactgctttg gcggcctggt cgagatgccc 806521 gcgatgacgc gcgtcgccga gcgtggcgtg cgactgtcgc aatttcacac caccgcactg 806581 tgctcgccga cccgggcgtc gctgctgacc ggtcgcaacg ccaccaccgt aggcatggct 806641 accatcgaag agttcaccga cgggttcccc aactgcaacg ggcggatccc ggctgacacc 806701 gcgttgctcc cagaggtgct ggccgaacat ggctacaaca cctactgtgt gggcaagtgg 806761 cacctgacgc cactcgaaga atccaatatg gcgtcgacga agcggcactg gccgacctcg 806821 cgtgggttcg agcggttcta cggattccta ggcggggaga ccgaccagtg gtatcccgac 806881 ctggtatacg acaaccaccc agtgagtcct cccggcacac ccgagggtgg ctaccacctg 806941 tcaaaagaca tcgccgacaa gacgatcgag ttcattcgtg atgccaaggt gatcgcgccc 807001 gacaagccgt ggttcagcta cgtgtgccca ggcgccgggc atgcgccgca ccacgtcttc 807061 aaggaatggg cggacagata cgccggccga ttcgacatgg ggtatgagcg ctatcgcgag 807121 atcgtgctgg aaaggcaaaa ggcgctaggg atcgtgccac ccgacaccga actgtcgccc 807181 ataaaccctt atctggatgt gccggggcca aacggcgaga cctggccgct gcaggacacg 807241 gtgcggccgt gggactcgct gagcgatgaa gaaaagaagc tgttttgccg gatggccgag 807301 gtgttcgccg gctttctgag ctacaccgac gcccagatcg gacggatcct ggactacctc 807361 gaggaatccg gccagctgga caacaccatc atcgtggtga tctccgacaa cggcgccagc 807421 ggcgagggcg gacccaacgg atcggtcaac gaaggcaagt tcttcaacgg ctacatcgac 807481 accgtcgctg aaagcatgaa gctcttcgac cacctcggtg gcccgcagac ctacaaccac 807541 taccccatcg ggtgggcaat ggccttcaac accccctaca agctgttcaa gcgctacgcc 807601 tcgcatgaag gcggcattgc cgacccggca atcatctcct ggcccaacgg cattgccgca 807661 cacggtgaaa tccgcgacaa ctacgtcaat gtcagcgaca tcacgcccac cgtctacgac 807721 ctgttgggca tgacaccgcc ggggaccgtc aaggggattc cgcagaaacc gatggacggc 807781 gtgagcttca tagcggccct tgccgacccg gccgccgaca ccggcaagac cacccagttc 807841 tacaccatgc tgggcacccg cgggatctgg catgaaggtt ggttcgccaa caccattcac 807901 gcggccacgc ccgccggctg gtcgaatttc aacgctgacc gctgggaact gttccacatc 807961 gcagcagacc gcagccagtg ccacgacctg gccgccgagc atcccgacaa acttgaggag 808021 ctcaaggcgc tgtggttctc cgaagccgcc aagtacaacg ggctgccgct ggccgatctg 808081 aacctcctgg aaacgatgac tcggtcgcgg ccttacctgg tcagcgaacg agccagctac 808141 gtctactatc ccgactgcgc tgacgtcggc atcggcgcgg ccgtagagat tcgcgggcgc 808201 tcgttcgccg tgctggccga tgtgaccatc gataccaccg gcgccgaggg cgtgctgttc 808261 aagcacggcg gcgcccatgg cgggcacgtg ctgttcgtcc gggacggacg cttgcactac 808321 gtctacaact tcctcggtga gcgccagcag ctggtcagct cgtcgggtcc ggtcccgtcg 808381 ggaagacatc tactcggggt tcgttatttg cggaccggaa ccgtgcccaa cagtcacacg 808441 ccggtgggcg atcttgagct gttcttcgac gagaacctgg tcggcgccct gaccaatgtg 808501 ctgacccacc ctggaacgtt cgggttggcc ggcgccgcta tcagcgttgg ccgcaacggc 808561 ggttcggctg tgtccagcca ctacgaagcg ccgttcgcgt tcaccggcgg taccatcacc 808621 caggtcaccg tcgacgtgtc aggccgaccg ttcgaagatg tggaatccga tcttgcgctt 808681 gctttttcgc gtgactgagc ggtctgctgt gacgcgggac ggcgtggtcg gcatacgctg 808741 aagtcgtgct gaccgagttg gttgacctgc ccggcggatc gttccgcatg ggctcgacgc 808801 gcttctaccc cgaagaagcg ccgattcata ccgtgaccgt gcgcgccttt gcggtagagc 808861 gacacccggt gaccaacgcg caatttgccg aattcgtctc cgcgacaggc tatgtgacgg 808921 ttgcagaaca accccttgac cccgggctct acccaggagt ggacgcagca gacctgtgtc 808981 ccggtgcgat ggtgttttgt ccgacggccg ggccggtcga cctgcgtgac tggcggcaat 809041 ggtgggactg ggtacctggc gcctgctggc gccatccgtt tggccgggac agcgatatcg 809101 ccgaccgagc cggccacccg gtcgtacagg tggcctatcc ggacgccgtg gcctacgcac 809161 gatgggctgg tcgacgccta ccgaccgagg ccgagtggga gtacgcggcc cgtggcggaa 809221 ccacggcaac ctatgcgtgg ggcgaccagg agaagccggg gggcatgctc atggcgaaca 809281 cctggcaggg ccggtttcct taccgcaacg acggtgcatt gggctgggtg ggaacctccc 809341 cggtgggcag gtttccggcc aacgggtttg gcttgctcga catgatcgga aacgtttggg 809401 agtggaccac caccgagttc tatccacacc atcgcatcga tccaccctcg acggcctgct 809461 gcgcaccggt caagctcgct acagccgccg acccgacgat cagccagacc ctcaagggcg 809521 gctcgcacct gtgcgcgccg gagtactgcc accgctaccg cccggcggcg cgctcgccgc 809581 agtcgcagga caccgcgacc acccatatcg ggttccggtg cgtggccgac ccggtgtccg 809641 ggtagtgcca acttcgcatg aggaactgca cacccagcag ggcgtcagtc ggcgcgacga 809701 gtcactcccg ggggctacgc atgaattcga ctaccggagc gggcctggct gggcgtgggc 809761 gcgcgcagtt gtacggcccc aacggcgtgt cgctgtacaa acacacgccc tcgctggtcc 809821 ggttgcccca aaaagccaag ccccccaaac cagttgctcg ccagcaatga cgccggttgc 809881 taccatctga ctccgtgtcg cttcccgggg caggactggg gcagtgggtt atccggtgat 809941 gaccgatggc cggtagcgac ccaccaacag gtgggccggc gtcgcaggcg ggttcagacg 810001 cgggagcctc gccagaacac aaacacatgt cgcggcgaaa gcacctcgtg ctcgatgtct 810061 gcatcatcct gggtgttctc attgcctacg tcttttcgct gctcggctac gactggttgg 810121 cccacacacc gggtccgctt ccgcagccgg acgtgggcac gactgacgac accgtggttt 810181 tgatccgctt cgaggagctg cacactgtgg caaatcgcct cgatgtgaaa gtgctggtgc 810241 tgcccgacga ttcgatgatc gaccatcgcc tccaagtgtt gactaccgac acctcggtgc 810301 ggttgtatcc ggagaacgaa ctcggagatc tgcagtaccc ggtaggaaag ctgcccgcgc 810361 aagtagcgac cacgatcgag gcgcacggca acccgggcgc ctggccattc gatacataca 810421 ccaccgatac ggtccaggcc gatgtgctcg tcggcgctgg cgacaaccgt caatacgtac 810481 ccgcccgggt cgaagtgacc ggatcgctgg aaggctggga catcagcgcc gtccgcgtcg 810541 gggaaagcag ccaaacctct gatcgcccgg acaatgtcat catcaccctg aagagggcca 810601 agggtccgct ggttttcgac ctgggcatct gcctggtgct gatcacattg ccgacgttgg 810661 ccttgttcgt ggccatccag atgattaccg gccgcagaaa attccaacca ccgttcggca 810721 cttggtacgc cgcgatgttg ttcgctgtcg tgccgctgcg cactattctc ccgggctcgc 810781 cgccggcggg tgcgtggatt gaccgggccg ttgtgatctg ggtgctcata gcgctggcgg 810841 cggcgatggt ggtgtacatc gtcgcctggt accgagaatc ggactaaggc gggcgtcaga 810901 tggcttctgt cgacgcgtcc ggagggtttc cgctggattt cataaacagg cgctagcgcg 810961 gtgtccaacg atacgattgg ggcccatgcg gcccgacgag atcggctcgc tgcgggccgg 811021 cctggcggct gttgcgcggt gaactcaaaa cgcgttgacg ccggatcagc tatccgatga 811081 ttcaggcgga gatctcgacg atcgtgggcg ctaccgccaa tccggtatcc gggtagatca 811141 tgatcgacat gggttgatct gccctggtgg ggcggactca cattagcgaa attttgcgct 811201 gagtaggtcg tcccctaaac ttcaggggtt gccgtgagca gacctcggcc ggcgcgcata 811261 agctttgctt ggtcggcccc gcgtgcccgt cggcgacaaa gaccgcgcac gtcagggatg 811321 gtcctggctg gctcctccta ccgtgcacac gtcaaccagg tcaggagatc tagtgattca 811381 gcaggaatcg cggctgaagg tcgccgacaa caccggcgcc aaggagatct tgtgcatccg 811441 ggtgctgggc ggttcgtcgc gacgctacgc cggcatcggt gacgtcatcg tcgccaccgt 811501 gaaggacgcc attccgggcg gcaacgttaa gcggggggat gtcgtcaagg ccgtcgtggt 811561 gcgcacagtc aaggaacgcc gacgtcccga cggcagctac atcaagttcg acgagaacgc 811621 cgcggtgatc atcaagcccg acaacgaccc gcgcggcacc cgcatttttg gaccggtcgg 811681 tcgcgagctg cgggagaagc ggtttatgaa gatcatttcg ctggccccgg aggtgttgta 811741 gatgaaggtc cacaaaggcg acaccgtgct ggtgatttcg ggcaaagata aaggggccaa 811801 gggcaaagtc ttgcaggcgt atccggaccg caaccgggta ttggtcgagg gtgtcaaccg 811861 gatcaagaag cacaccgcga tctcgaccac ccagcggggc gcgcgttcgg gtgggatcgt 811921 cacccaggaa gcgccgatcc atgtctccaa cgtgatggtg gttgactccg acggcaagcc 811981 cacccgaatc ggctatcggg tcgacgagga gaccggcaag cgcgtccgta tctccaagcg 812041 caacggcaag gacatttgat gaccactgca cagaaggttc agccgcgcct caaggagcgc 812101 taccgcagtg agattcggga tgcgctgcgc aagcagttcg gctacggcaa tgtcatgcag 812161 atcccgacgg tgacgaaagt cgtcgtcaac atgggtgtcg gcgaggccgc ccgggacgcc 812221 aagttgatca acggggcggt caacgatttg gcgctgatca ccgggcagaa gccggaagtc 812281 cgccgggcgc gcaagtccat cgcgcagttc aaattgcgtg agggcatgcc ggtgggcgtc 812341 cgagtcacgc tgcgcggtga ccggatgtgg gagttccttg accggctcac gtcgatcgca 812401 ctgccacgca tccgtgactt ccgtgggctt tcgcccaaac agttcgacgg tgtgggcaac 812461 tacaccttcg ggctggccga gcaggcggta ttccacgagg tcgacgtgga caagattgac 812521 cgggtccgtg gcatggacat caacgtcgtc acttccgcgg cgaccgacga cgaaggccga 812581 gcgctgttgc gggccctcgg ctttcccttc aaggagaact gagcagatgg cgaagaaggc 812641 actggtcaac aaggccgcag gcaaaccgag gtttgccgtg cgcgcctaca cccgttgcag 812701 caagtgcggc cgcccgcgtg cggtctaccg caagttcggg ctgtgcagga tttgcctgcg 812761 cgagatggcg cacgcgggtg agttgcccgg cgtgcagaag agcagctggt aacgggacac 812821 ggggactaga acatatgacc gcgctgacga cgatgcagtg ggggtacccc cagacgcgca 812881 gcggcgaggg ggccgcaagc gatgaggagg agtagcgctc gatgaccgcg ctgacgacga 812941 tgcagagcgc aagcgatgag gaggagtagc gctcgatgac gatgacggac ccgatcgcag 813001 actttttgac ccgtctgcgt aacgccaact cggcgtatca cgacgaggtc agcttgccgc 813061 actccaagct caaggccaac atcgcgcaga ttctcaagaa cgaggggtac atcagcgact 813121 tccgaaccga ggacgctcgg gtcggtaaat cgctggttat ccagctcaag tacggcccta 813181 gccgggagcg cagcatcgcc gggttgcggc gggtgtccaa gcccggcctg cgggtgtacg 813241 cgaaatccac caatctgccg cgggtgctcg gcggcctggg cgtggcgatc atctcgacct 813301 cctcgggcct gctgactgac cggcaggcag ctagacaggg cgtgggcggc gaagtcctcg 813361 catatgtctg gtgagagtgt ggtgagagga agcaaccatg tcgcgtattg gtaagcagcc 813421 gattccggtg cccgccgggg tcgacgtcac gatcgaggga cagagcatct cggttaaggg 813481 gcccaagggc accctaggac tgacggtcgc cgagccaatc aaagtggcac gcaatgacga 813541 cggcgctatc gtggtcaccc gtcccgacga tgagcggcgt aatcgctcct tacacgggct 813601 gtcccgtacc ctggtgtcca acctggtcac tggcgtgacg caggggtaca ccaccaagat 813661 ggagatcttc ggggttggct atcgggtgca gctcaagggc tccaatctgg agtttgcgct 813721 ggggtacagc cacccggtgg tgatcgaggc tcccgaagga atcacgttcg ccgtccaggc 813781 accgacgaag ttcaccgttt ccgggatcga caaacaaaaa gtcggccaga tcgccgccaa 813841 tatccgccgt cttcgccgtc ccgatccgta caagggcaag ggcgtgcgct acgagggcga 813901 gcagatccgc cgcaaggtcg gaaagacagg taagtagcca tggcgcaatc agtttccgcg 813961 actcgacgaa tctcccgcct gcgccggcac acgcggctgc ggaagaagct ctcgggcacc 814021 gcggagcgcc cgcggctggt ggtgcatcgg tccgcgcggc acatccacgt gcaactggtg 814081 aacgacctca acggcaccac cgtggccgcc gcttcgtcga tcgaggccga tgtgcgcggc 814141 gtgccgggtg acaaaaaggc ccgcagtgtg cgggtcggcc agttgatcgc cgagcgggcc 814201 aaagccgccg gcatcgacac cgtggtattc gaccgcggcg ggtataccta cggcggacga 814261 atcgccgcgc tggccgacgc cgcacgcgag aacggattga gtttctgatg aacgggagga 814321 ccgcataatg gcggagcagc cggccggaca ggcaggcact accgacaacc gtgacgcacg 814381 gggtgatcgg gagggccggc gccgcgacag cggccgcggc agtcgtgaac gggatggcga 814441 gaagagcaac tatctagagc gggtcgtcgc catcaaccgc gtctccaagg tggtcaaggg 814501 tggtcggcgc ttcagcttca ccgctttggt catcgtgggc gacggtaacg ggatggtcgg 814561 tgtcggctac ggcaaggcca aggaagtacc ggccgcgatc gccaagggcg tcgaagaggc 814621 gcgcaaaagc ttcttccggg taccgctgat cggcggcacc atcacgcacc cggtgcaggg 814681 cgaggcggcc gccggtgtgg tgttgctacg gccggccagc ccgggtaccg gtgtgatcgc 814741 cggtggtgcg gcccgcgcgg tgctggaatg tgcgggggtg cacgacatct tggccaagtc 814801 gctgggcagt gacaacgcga tcaatgtggt gcacgccacc gtggccgcgc tcaagctgct 814861 gcagcgtccg gaggaggtgg cggcgcgccg cggtttgccg atagaggacg tcgccccggc 814921 cgggatgctg aaggcgcgtc ggaaaagtga agcgctggcc gccagcgttt tgccggatag 814981 aacgatatag ccatgtcaca gctgaagatc acccaggtgc gcagcaccat cggagcacgc 815041 tggaagcagc gcgagagcct gcgcactctg ggcttacgaa ggattcgtca ttcggtgatc 815101 cgcgaagaca acgcagcgac tcgcggactg atcgcggtgg tgcgtcacct cgtggaggtt 815161 gagcccgcgc agaccggagg gaagacatag tgacgctcaa gctgcatgac ctgcgccccg 815221 cgcgggggtc caagatcgcc cgcacccgag tcggtcgagg tgacggctcc aagggcaaga 815281 cggccggccg tggcaccaag ggcaccaggg cccgcaagca ggtgccggtg accttcgagg 815341 gcgggcagat gccgatccac atgcggctgc ccaagctcaa gggcttccgt aaccggtttc 815401 gcaccgaata cgaaattgtc aacgtcggcg acatcaaccg gctgtttccg cagggtggtg 815461 ccgtcggcgt ggacgacctg gtggccaagg gggccgtccg caagaacgct ctggtcaagg 815521 tgttgggtga cggcaagctg accgccaagg tcgacgtgtc cgcgcacaag ttcagcggca 815581 gcgcgcgcgc gaagatcacc gcagcgggcg gttcagccac cgagctctag tttcgggcga 815641 gcagacgcaa aatgcccccg aaatgcccat tttcgggggc ttttgcgtct gctcgcgggc 815701 ccttggcggc cggtgggtac gctgggtgaa tatggttgcc tttctgcctt ccattcccgt 815761 tgtcgaggac ctacgcgccc tggtcggccg ggttgatacc gcccgccacc acggtgtacc 815821 caacggctgc gtgctcgaat tcaacctgcg atcggtgccg ccggagacga cgggcttcga 815881 ccctcttacg gtgctcaccg ggggtgggcg gccgatggcg ctgcgcgatg cggtcgccgc 815941 gatccaccgt gccgccgagg acccccgggt agccgggctg atagcccgcg tgcagcttcc 816001 gccctcgccg gcgggggcgg ttcaggagct gcgggaggcc atcgcggcct tcagtgcggt 816061 caagccgtcg ctggcctggg ccgaaactta tccgggcacc ctgtcctact atctggcttc 816121 ggcgttcggt gaggtctgga tgcaaccctc ggggagtgtg gggctggtcg gcttcgccac 816181 caacgccaca ttcctgcgcg acgccctgca caaggcgggc atcgaggccc agttcgtcgc 816241 ccggggcgaa tacaagtcgg cggcaaacct tttcaccgag gatggcttca cagacgccca 816301 ccgcgaagcg gtcacgcgga tgctggacag tctgcaggac caggtgtggc aggcggtcgc 816361 caagtcgcgc aatatcggcg tcgatgcgct tgatgagctg gctgaccggg ctccgctatt 816421 gcgggacgac gccgtgactt gcggtctgat cgaccggatc ggatttcgcg accaagccta 816481 cgcccgtatg gcggaattgg ttggtgtgga aaaaggttca ccggaatcca gtggctcgca 816541 aacaagccca gacgaaaagc cgccgcggat gtacctggcg cgctacgcca gttcggcccg 816601 gccacggctg acgccccccg tcccatcgat tcctggtcgc cggtccaagc cgacgatcgc 816661 ggtggtgacc ctggaaggcc cgatcgtcaa cggtcgtggt gggccccagt ttctgccgct 816721 cggtccgtcg agcgccggcg gtgacaccat cgcggcagcg ctgcgggagg tggccgccga 816781 cgattcggtg tcggcgatag tgctgcgggt cgacagtccg gggggctcgg tcaccgcatc 816841 ggagactatc tggcgtgagg tggccagggc ccgcgaccgt ggcaaaccgg tggtggcgtc 816901 gatgggtgcg gtcgccgcct ccggtggcta ttacgtgtcg atgggtgccg acgccatcgt 816961 ggccaacccg ggcaccatca ccgggtcgat cggtgtgatc accggaaagc tggtggttcg 817021 ggatctcaag gaccggttgg gtgtcgggtc ggatgcggtg cgcaccaacg ctaatgccga 817081 tgcctggtcg atcgacgcac ccttcacccc ggaccagcag gcccatcgcg aggcggaggc 817141 ggacttgttc tacagcgact tcgtggaacg cgtcgccgag ggccgcaaga tgactaccga 817201 cgccgtggac gtcgttgcgc gaggccgggt ctggaccggt gccgacgctc tcgatcgcgg 817261 cctggtcgac gaactcggcg gccttcgaac cgcggtgcgt cgcgcgaagg tgctagccgg 817321 actagatgag gacaccgagg ttcgcatagt cagttatccg gggtcgtcac tctgggacat 817381 ggtgcgaccg cgtccgtcgt cacgaccggc agcggcatcg ctgccggatg ctatgggtgc 817441 gctgcttgcc cgttcgatcg tcggcatcgt cgagcaggtg gaacagactc tcagtggtgc 817501 cagcgtgttg tggctggggg agtcgcgcct ctagccgttc aaacgaccgc tgatgaagat 817561 gatttcgccg agcggatcgt cgtcgtgtgg ggcgggaacg ggcaaaccat tgcgcctgaa 817621 taggtcggtc cgcactgtgc cctcaacgtc ccagcccttg gcgcgcaggt agtcgacgac 817681 gtggctgcgt tcgccggaat acaccagcga cgccatgtcg atgtccacgc cgtgcttgcg 817741 aaacgaatcc gccatttctc gtacccggcc tgcgtcgaaa tccacaatgc ccgggacaag 817801 ttcggtagcg atcgtgctgc ccgcaacact gagttcggtg ctgttgtcga acaaccggtc 817861 ctgggatccg gcggcaggta gatcagcatg ccttcggcca accatgctgt cggtgccgtc 817921 gagtccaggc cggcagcttg cagtgccgcc ggccagtccg cgcgcaagtc gatgtacacc 817981 gtgcgccgaa tggcggtggg cttggcgccg atgccggcca aggtggttgt cttgaagtcg 818041 atcacctgtg gttggtcgat ctcgtagacc acggtgccgg ccggccacgg caaccgatag 818101 gcgcgcgcgt ccaacccggc tgccaggatc accacttgtc gcactccgcc gtccgtggca 818161 gtgcggaagt agtcgtcgaa gtacttggtg cgcaccgcta tcccgtcgat catcgcctgt 818221 gcccgccccg gcgaaaggtt cccggtcgtc gcgatatcga gctcgccgtc gatcaacttg 818281 gtgaagaaat ccagcccgac cgcgcgcacc agcggttcgg cgaacgggtc gttgatcaaa 818341 cctcgtggat ccttggtcgc caacgcgcgt ccggcagcaa ccatggtcgc ggtagccccg 818401 acgctggagg ctagatccca gttgtcgtcg tgagcgcgcg gcatctgcgc cctatgtccg 818461 ggtcgcagcg acgtagttca ttgtgccggg acggccgctg cagcgctgag gtcggccagt 818521 gtacgcgacc gccaactcag ccggtaagcc ctggcggcgg tggagcagtc gtcgaagcct 818581 ggtgagcatc actgcgagtc atcgtgtagg cggccgattt cgacagttca ttgacggggc 818641 aagcggtatg gcgccacgaa ggtgctggct tgcggtgtgc tgggtactgt ctgtgtttcc 818701 gttgcaacct ggcgctgaca tagaagaaat caggcaacgg cacttcttcg tcctcgaacg 818761 gttgaaatcc gttggcggtc agcaggtcct gggatttgat ctcggtcagc agccaaccat 818821 tgtcggatag gtacgaggcg ggctcgttgc ggtcgccgaa gtacaccagc tcattcatgt 818881 ctagatcgaa accgtatgcg cgccaacgat tggcgaggat cgtcatgcgc tccctcatcc 818941 gttcctcatg atgcggcttg aagttgcgta tgctctcggt tgcaaacctg ctgtccggca 819001 cactgagcgc ggtgacattg tccaacaagc ggtcctgcgc ttccggcggg aggtagcgga 819061 gcaacccttc agcgctccac gcggtgggct gggtcgggtc gaatcccgcc gcgcccaacg 819121 cggtgggcca atccgcacgc aaatcggcgg tgaccacgcg ccggtcggcg gtgggcgtgg 819181 cgcccagttc ggcgagtgtg cgagttttga actccatgac ttgcggttgg tcgatctcat 819241 acaccacggt ctgggcgggc caggccagcc ggtatgcccg ggaatccaat cctgaggcca 819301 ggatcacgac ctgcctgatg cccgcgcgtg tcgcatccat gaagaactcg tcgaagaact 819361 tggtgcggac ggcatggtgt tcggccatac ggaccatgga cgcattcggg cgttccggat 819421 cgtcgatgtc tgaggccgtc aattccccgc tcgcgagccg ggtcagaacg tccaccccca 819481 ccgcccggac cagcggctca gcgaactgat cgttgatcag tgggttggcg gcgcgggtcg 819541 ccatcgcgcg agccgccgca accatcgtgg cggtcgcccc gacgctggat gccagatccc 819601 aggtgtcccc ttcgcacctg atggaaccgg tgtatgtcat gcacggcctc tcttcaaaaa 819661 gcggggataa ttccttagta aagttaacaa caggcgacaa attccgcgac ttggaaaggc 819721 tggcgcgatc ggcggcgtcg gggtgccgcc atagggggcg cacgtggggg tcctggctgt 819781 tgagcgtgaa taccgcgatg ggttttcggc gtgtcgcgtg gtgcgattca ctctcggtgc 819841 ggctagagcg gattcgcgcg cagatagccg tagacgcccg tgaagttacg gcacacgtcc 819901 tcaggaattg gcaccggtcc accgagagcg cgggcacccc aaacgatttg tgcggtgcgc 819961 tcaacaaggg cggtgacgcg cagcacctgg tcggggcggg gccccacggc caccaggccg 820021 tggttggcga tcagggcggc ggcgcggccc tcaagcgcgc gcaccgcgtt gcggccgacc 820081 tcgggtgtac cggacgcggc gtactcggtg cagcgaacgt ccccgccgca gtagatcgcg 820141 aactcgtcga tgcaggcggg aatcggctca tgggcgacgg cgaacatggt cgcccacacc 820201 gggtggctgt ggatcacgct gccaatgtcg tcgaatgcgc gatagcacgc caggtgtagg 820261 tttagttcgg tcgacggcga ccggccgtcc ttggcgtgca gcaccgcacc gccggcgtcg 820321 actagcacca gatcgtggag cagcatctcg gcgtagtcga ccgaggacgg cgtgatgacc 820381 acgttgccgt ccgagcgcct ggctgagata tttccggcgg tcccctcgac caggccccga 820441 cgcaacatgt ccttggcggc cgccagcacc gcggattccg gggcgtcaac gaagttcatg 820501 agcccaatac ctccgggttg acgacatggg cgggcctgtt gccggacagc agtgcgccca 820561 ggtcgtcggc gaccatccgc gcctgccggg cctcggtgtt ccaggtggcc ccgccgatgt 820621 ggggggtgag gacgacattg ggcatgctca ccaaagggtg atcggtcggc agccattcac 820681 cggtgaagtg gtccaggccg gcggcggcca gcttgccgcc acgcagggcg tcgacgagcg 820741 catcggtgtc gcgcagctgg gaccgggcgg tgttgagaaa caccgcaccg tcgcgcatgg 820801 ccgcgaactg ctgggcaccg atcatcccga tcgtgtcgtc ggtgaccgcc gcgtgcatgg 820861 agacgatgtc agcctcggcc agcagctcgt caaggctgtg gccggcgtcg tcgcggtaag 820921 gatcgtgcgc gatgacccgc aggcccagcc cggacagcct ccagcgcacc gcgcgaccga 820981 cggcacccag gcccaccagc ccggcagtca gcccggcgat ttcggcaccg cggaaccgct 821041 gataggggat ggtgccgtcg cgaaagatgt tgccggaccg cacatctgcg tccgcgggaa 821101 tcaggtgccg ggcgacggcc agcaacaggg ccaccgtcat ctcggcgaca gcgtcggcgt 821161 tgcgagccgg ggtgtgcagc accggtatgc cggccgcggt ggcgccgggg atgtcgacgt 821221 tgctgggatc cccgcgggtg gcggcgacca cccgcaaccc ccgctcgaac accgggccac 821281 cgaccgagtc actttccacc acaagaacat cggcggcgac ggcggtgatc cggtcagcta 821341 gctgctcggc gctgtagatt cgcagcggtc gctgatcgat ccacgggtcg tataccacgt 821401 cggctagccg ccggagctgg gcgaaccccg gtccacgcaa tggagccgtc accagagcac 821461 gcggtcgagg cgtcacgttt gccaatgctg gcgtacggtg gcgcccgtgt cacgcgacga 821521 cgtcacaatc ggcatcgata tcggcaccac cgccgtcaaa gcggtggccg ccgacgacaa 821581 cggtcgggtg acggcgcggg tacggattgg ccaccagctg gcggtgccgg cccccgaccg 821641 gctggagcac gacgccgacg aagcgtggcg gcggggacca ttggcagcac tggaccggct 821701 ggtcggaccc gacacccggg cactggccgt tgccgcgatg gtgccatcgc tgaccgctgt 821761 cgatcccgct ggccggccga tcacacccgg gctgctgtac ggcgacgcca ggggtcgggt 821821 accgaacgcc tcggtggcac gggcgcagtc ggtgccgtcg gtgggtgaga ccgccgagtt 821881 tctgcgctgg acggccggcc aagcgctgga tgcgtccggg tactggccgg cgccggcggt 821941 ggccaattac gccttgtcgg gcgaagcggt catcgactat gccacggccg tcacgactct 822001 cccgttgttc gacgggacgg gatggaacgc gaccgcttgc gccgactgcg gtgtgaccgt 822061 tgaccggatg ccgcgggtgg agacgttcgg agtgggagtg gggcaggtgc gcggcaccgg 822121 cgcggtgctg gcggtcggtg ccgtcgatgc cctgtgcgaa cagatcgtgg ccggcgccga 822181 ccgcgacggc gacgtgttgg tgctatgcgg cgccaccttg atcgtgtgga ccaccatctc 822241 cgcggctcgt caagtgccgg gtttgtggac catcccgcat acggcaccgg gcaagagcca 822301 gatcggaggg gccagcaacg ctggtgggtt gttcctcaac tgggtggatc gtgttattgg 822361 accgggcgat ccagcgctag ccgatccgcg gcgggtgccg gtgtggctgc cctatatacg 822421 cggcgagcgc accccgttcc atgagcccga tcgccgggcc gtgctcgacg gtgtggatct 822481 ctcccaggac gccgcatcgg tgcggcgggc cgcctacgag gcgtcgggct tcgtcgtgcg 822541 ccagctcatc gagctaagcg gggcgccggt ggcgcgcatc gtggcggcag gcggcggcac 822601 ccggatacag ccttggatgc aggctatcgc cgacgcgacc ggccggccgg tggaggtgtc 822661 cagggtggcc gaaggggcgg cactgggagc ggctttcctc ggccgcttgg cggccggatt 822721 ggaatcgtcg atcgccgacg ctgcccggtg ggcctcaacc gaccgcattg tcgaacccag 822781 tgccgactgg gcggggccga ccaaggaacg ctatcgccgg ttcctggcgc tcagcggctc 822841 gaagttggcc tgacggtgga ccaagatgca tggcgcaaga actggtgtgt cgttctacgc 822901 ttatgcaatg acagatcacg accagaccgc ggcccgtcga gagatcgccg atgccctgct 822961 cgccgcgctg gaacgtcggc atgaggtcgc agacgccatc gtggaggccg ccaacaaggc 823021 cgccgccgtc gaggcgatcg tgaacttgct gggcacctcg cacttggccg ccgaagcggt 823081 gatgagcatg tctttcgatc agctcaccca ggatgcgcgc acaaagatca tcgccgagct 823141 cgacgacctg aacaaacagc tgagcttcac cgtcaaggag cgtccagcca gctctggtga 823201 gggcctggag ctgcggccgt tctccccaga tgaggaccgc gacatcttcg ctcgacgaac 823261 cgaagaaatg ggcgccgccg gcgatggatc cgggggaccc gccggcagcg tcgacgacga 823321 gatccgagcc gcacagaagc gcgtcgacga cgaggaggcg gcttggttcg tggctgttga 823381 ttccggcgtc aaggtcggga tggtgttcgg cgagcttgtc cacggcgagg tggacgtccg 823441 gatctggatt caccccgatc atcgaaaaaa gggttacgga accgcggcat tgcgcaagtc 823501 gcgctcggag atggcctggg cgttcccggc cgtgccgatg gtcgcccgcg cgcccgcggc 823561 ccaacccgcc cagccgggaa gtgccggccg gtagcatccg gttcggtctg gcaggcggtc 823621 gccaggccga tcggcggcga atccgcggcg ccaacgctgc cgccggatcc caactggctt 823681 aatcagcgtg tgtcttggtg tttctgcttc agttcggcgg agacatagat cacctcgccg 823741 aacggggcgt cgtcgccgtc gatcggcggc agtccgtgtt cagccagcaa gtcggtggtg 823801 ctcgcgctgg cggttcgcca gccgtggtcg gccagatagg tccgcgcgtc cgtgcggtcg 823861 ccgaaataca ccaggcccga catgtcgagg tcgaggccat ggcgcctgaa ccgctccgcc 823921 aggcgccgca tgcgtccccg cagttcttct tcgttgagcc gattgatgtc gcgcaggact 823981 tcggtggcga actggctgcc cggtacgctc tgggcggtga tctggtcaag cagccggtcc 824041 tgcgcctcgg cggacagata gcccagcagc ccctcggcga tccaggcggt ccgctgcgcg 824101 ttgtcaaagc cggctttttg cagggcggtg ggccagtcgt cgcgcaaatc gaccgccacg 824161 gtgcgccggt cggtcgtggg tgccgcaccc aggccggcca gcgtcgtggt cttgaagtcg 824221 atcacctgcg gctgatcgac ttcgaagacg atggtgccgg ctggccagcg cagccggtag 824281 gcgcgggaat ccaggccgga agccaagatg acggcctgcc gaatcccggc tcgggtggca 824341 tccagaaaga agttgtcgaa gtagtgagtg cgaatggcca tcgcgtcggc gaaccgccgc 824401 aggccgttgg cctcgtcttc ggctagctcg tcgggatcca gttcgccact ggccatgcgt 824461 acgaagaagt cgacgccgac cgcgcggacc agcggttccg cgaactggtc gttgaccagc 824521 gcgccgggag cccggccggc taccgctcgg gccgccgcca ccatggtggc cgtcaaaccc 824581 acactggacg ccaagtccca cgaatcgccc tcaaagcggg cactgcccgt ttgcgtcatc 824641 tgtaacccct tcgatagctc gcaccgtggc ggcccggaac gggccagtcc ataccagctg 824701 ttagtctctt acacgatttg gcgcgcgacg ccgtacgtcc tggcctgcgg gtgttgggcg 824761 cgtgatgcaa gatgaccccg ggctgcgcag gaggatagag tgctttcggc tttcatctcg 824821 tcgctgcgaa cagtcgactt gagacgaaag atcctcttca cgctgggcat cgtcattctc 824881 taccgtgtcg gtgccgcgct gccgtccccc ggtgtcaatt ttccgaacgt gcagcagtgc 824941 atcaaagaag ccagcgcggg cgaagccgga cagatctatt ccctgatcaa cctgttctcc 825001 ggcggtgcgt tattaaagct cacggtgttc gcggtggggg tgatgcccta catcaccgcc 825061 agcatcatcg tgcagctgct caccgtggtc atcccgaggt tcgaggaact ccggaaggaa 825121 ggccaggcgg gtcagtcgaa gatgacccag tacacccgtt acctagcgat cgcgttggct 825181 atccttcaag ccaccagcat cgtggcgttg gctgccaacg gcgggttgct acaaggttgc 825241 tcgctggaca tcatcgccga ccagagcatt ttcacactgg tcgtcatcgt gctcgtgatg 825301 acgggcggcg ccgcgttggt gatgtggatg ggcgagttga tcaccgaacg cggcatcggc 825361 aacggcatgt cgctgctgat cttcgttggc atcgctgccc gcatcccggc cgaaggtcaa 825421 agcatcctgg aaagccgcgg tggagtcgtc ttcaccgcgg tctgcgcggc cgcgttgatc 825481 atcatcgtcg gtgtggtgtt cgtcgaacag ggtcagcgcc ggattccagt gcaatacgcc 825541 aagcgcatgg tgggccggcg gatgtatggc gggacttcga cttatctgcc gctcaaggtc 825601 aaccaggccg gcgttatccc ggttatcttc gcgtcgtcgc tgatctacat tccgcacctg 825661 atcacccagc tgattcgcag cggcagcggt gtcgtgggaa acagctggtg ggacaaattc 825721 gtcggcacgt acctgtccga cccgagcaac ctggtctaca tcggcatcta cttcggcctc 825781 atcatcttct tcacctactt ctacgtgtcg atcaccttca accccgacga acgtgccgac 825841 gagatgaaga agttcggcgg cttcattccg ggaattcggc cgggccgtcc gaccgcagac 825901 tatctgcgct atgtgctgag ccggattacc ttgccgggct cgatttacct cggcgtgatc 825961 gccgtgctgc ccaacctgtt cctccagatc ggcgccggtg gaaccgtgca gaacctgccc 826021 tttgggggta ccgcggtgct gatcatgatc ggtgtcggtt tggatacggt caagcagatc 826081 gagagtcagc tcatgcagcg caactacgaa gggttcctca agtgagagtt ttgttgctgg 826141 gaccgcccgg ggcgggcaag gggacgcagg cggtgaagct ggccgagaag ctcgggatcc 826201 cgcagatctc caccggcgaa ctcttccggc gcaacatcga agagggcacc aagctcggcg 826261 tggaagccaa acgctacttg gatgccggtg acttggtgcc gtccgacttg accaatgaac 826321 tcgtcgacga ccggctgaac aatccggacg cggccaacgg attcatcttg gatggctatc 826381 cacgctcggt cgagcaggcc aaggcgcttc acgagatgct cgaacgccgg gggaccgaca 826441 tcgacgcggt gctggagttt cgtgtgtccg aggaggtgtt gttggagcga ctcaaggggc 826501 gtggccgcgc cgacgacacc gacgacgtca tcctcaaccg gatgaaggtc taccgcgacg 826561 agaccgcgcc gctgctggag tactaccgcg accaattgaa gaccgtcgac gccgtcggca 826621 ccatggacga ggtgttcgcc cgtgcgttgc gggctctggg aaagtagtca tgcgcccact 826681 ggcacggctg cggggtcgca gggtcgtgcc gcagcgcagt gccggcgaac tcgacgcgat 826741 ggccgcggcg ggcgccgtcg ttgccgccgc gctgcgggcg atccgtgcgg cagcggctcc 826801 cggcacatcc agcctgagtc tcgacgagat cgccgagtcg gtgatccgcg aatccggcgc 826861 caccccgtcg tttctgggct atcacggcta cccggcctcg atctgcgcgt cgatcaacga 826921 ccgggtggtt catggcatcc cgtcgaccgc cgaggtgctc gcgcccggtg atctggtatc 826981 catcgactgc ggtgcggtgc tggacggttg gcatggcgat gcggcgatca ctttcggggt 827041 tggcgccctg agcgacgccg acgaagcgct gtcggaggcg acaagggaat cgcttcaggc 827101 cggcatcgcc gcgatggtgg tcggcaatcg gttgaccgac gtcgcgcatg ccatcgaaac 827161 gggtacccgt gccgccgagc tccgttatgg acgctcgttc gggatcgtcg ccggttacgg 827221 gggccacggc atcggccgcc agatgcatat ggatccgttc ttgccgaacg agggtgcgcc 827281 ggggcgcggt ccgctgctgg ctgccggctc ggtgctggcc atcgaaccga tgctgaccct 827341 cggtaccacc aaaacggtgg tgctcgacga caaatggacg gtcacgaccg ccgatgggtc 827401 acgtgcggca cactgggaac acaccgtggc ggtaaccgac gacgggcccc gaattctgac 827461 gctcggttag cgcggctgcc ggcgcgggca gtggtgaacc aaactcttac tcgactcgtg 827521 tcagtaagcg ggaggtgatc gcgtggctcg tgtgtcgggc gccgcggccg ctgaagccgc 827581 gttgatgagg gcgctctacg acgagcatgc cgccgtgttg tggcgttacg cgctgcgctt 827641 gaccggggat gcggcccaag ccgaagacgt cgtccaagag acgctgttgc gggcgtggca 827701 gcatccggag gtgatcggcg acaccgcgcg gccggcaagg gcgtggttgt tcaccgtcgc 827761 gcgcaacatg atcatcgacg agcggcgcag cgcccggttc cgcaatgtgg tcggttcgac 827821 cgaccaatcg ggcacacccg agcagtcgac gccggacgag gtgaacgccg cactggatcg 827881 gctgctgatc gccgatgcgc tggcccaact gtccgccgag catagggccg tgatccagcg 827941 gtcctactac cgcggatggt cgaccgcaca gattgccacc gacctcggaa ttgccgaagg 828001 aacggtgaag tcgcgattgc actacgccgt gcgcgcgttg cggctcactc tgcaggaact 828061 gggagttact cgatgacggc agagcccatt cgcatggctg ccggctccgg atacgtgagg 828121 gtgacaggag agagatgaca tgacgatgcc gctacgagga cttggcccgc ccgatgacac 828181 cggtgtgcgc gaggtgtcga cgggtgatga tcaccactac gcgatgtggg atgcagctta 828241 cgtgttggga gcattgtctg cggccgaccg ccgcgaattc gaagcgcacc tggccggttg 828301 ccccgaatgc cggggggccg tcaccgaact ctgcggggtg cccgccctgc tgtcccagct 828361 cgatcgtgac gaagtggccg cgattagcga atccgccccg actgtggtgg cttcggggct 828421 gtcgccggag ttgttgccgt cgttgctggc ggcggtgcac aggcgtcggc gccgtacccg 828481 gctgatcacc tgggtggcct cgtccgccgc tgccgcggtg ctggcgatcg gtgtgctagt 828541 cggtgtgcag ggccactccg cggcaccgca gcgggcggcc gtgtcggcgc tgccgatggc 828601 ccaggtcggc acgcagctgt tggcgtccac ggtgtcgatc agcggcgagc cttgggggac 828661 gttcatcaac ctgcggtgcg tctgcctggc gccgccgtat gcttcccacg acacgctggc 828721 catggttgtg gtgggtcgtg acggcagcca gacacggctg gcgacttggt tggccgaacc 828781 cggtcacacc gcgacacccg ccggcagcat ttcgacaccg gttgaccaga tcgccgccgt 828841 gcaagtggtt gccgccgata ccggccaggt tctgctgcag cgttcgctct aagactgagc 828901 tttaggcacc tggcgccctg ctattggcac gccctacaag caccaggtgg tcgggcgtcg 828961 accacctgct cggagtgggc tgcatgatgc cgcgcatctt cagtcgtcga tcaccgtggt 829021 gctggccaaa cacgagttct ccgctgccac ggtggccgac gggtacagcc gcagcggggc 829081 cgggttcggg gtcgcggcgg cggcctccgg tggcggcact ttcctcggtc agaaatgcgc 829141 cgcagcaacg gcaagctgaa ttccgtaagg ttggcccgcg tcgacgcatg tgcgataaga 829201 aggggcgtgg cctcagataa tcgcgacccc atcgccgcag cacgggccaa ctgggagcgt 829261 tccgggtggg gtgatgtgtc gctaggcatg gtggcggtga cgtcggtgat gcgtgcgcat 829321 cagattctgc tggcccgcgt cgagacggcg ctgcgcccct atgacctgag tttctcccgc 829381 ttcgagctgc tgcggctgct ggcgttcagc cgtatcggag cgctaccgat caccaaagcg 829441 tcggaccgat tgcaggttca cgtgaccagc gtcacccacg cgatccgccg gctggaggcc 829501 gatggattgg tgcggcgggt tccgcacccc accgacgggc ggaccacact ggtgcagatc 829561 accgagctgg gtcgctccac ggtcgaggac gccaccgtca ccctcaacga gcaggtgttc 829621 gccaacgttg ggatgggcgc cgaggaatcg caggcgctgg tgtcggccgt cgaaacgttg 829681 cggcgcaacg ccggcgactt ttgagggcgg gcagacgcgt aagcgcccaa tgtcgtgccg 829741 aaatgggcgc ttatgcgtct gctcgcgccc ggcttggcgc gcagccggcg acattccatg 829801 accagtttgt gcgggccttg acgcgggcgc gggctcgtat gcgaccgccg aggccggccg 829861 gcttgctgct gggcaatggc ggggctcggc ggtatccggc ggcgggcagc taaccggact 829921 gccccgaaac ccactgcgtg gtcaacgatt tcaggacaag ctgttagcag gacgtgcccg 829981 cgctgcgcta tccaaaaacg tcatgggcac gcatgatggt gaaatgcggc ggacaccaat 830041 tcaaccgcga aaggcaggac agtggaccca ctgatggctc accagcgcgc tcaggacgcg 830101 ttcgccgcgc tcctggccaa cgtccgcgct gaccagctcg gcggccccac gccctgctcg 830161 gagtggacga tcaacgatct gatcgagcac gtcgtcggcg gcaacgagca ggtcgggcga 830221 tgggcggcca gccccatcga gccacccgcc cggcccgatg gcctcgttgc cgcccaccaa 830281 gccgcggccg cggtcgccca cgagatcttc gcggcgccgg gcgggatgtc cgccacattc 830341 aagctgccgt tgggcgaggt tcccgggcag gtgttcatcg ggttacgcac caccgatgtg 830401 ctgacccacg cgtgggatct tgccgccgcc accggccaat ccaccgatct tgatcccgag 830461 ttggccgtcg agcggctcgc cgccgcgcgt gccttggtgg ggccgcagtt ccgcgggccg 830521 ggaaagccct tcgcggacga gaagccttgc ccgcgtgagc gcccgcccgc cgatcagctg 830581 gcggcatttt tgggccgcac ggtgcggtga acccgcgaat tcggctgccg cgcaacgtgt 830641 ggatcaccgc gctgcggtcc agggcgccgt ggtcggcggc gaatctggcg tagatttcgg 830701 cggggtggcc acctagcggc gccgccgcac cggtcgaggc caccgcttcc atggccaggc 830761 ccacatcttg atcggcgtgg tggccacgcc cggtgtgaag tgctgttggc cgtgatgtcg 830821 gattacagtc tcggcgtgcc cgacgagaca ggccttggtg ctgacgcggc gcgcgcgtga 830881 agtggcgctg acacagcaca ttggggtatc cgcggagacc gatcgggccg tcgtccccaa 830941 gctgcgccag gcctatgaca gcctggtgtg cggtcgccgc cggcttggcg ccattggagc 831001 cgagatcgag aacgcggtgg cccatcagcg cgcgctgggc cttgacaccc cggccggtgc 831061 ccgtaacttc tcccggtttc tcgccaccaa agcacacgac atcacgcgag tgctggcagc 831121 aaccgccgcg gaatcccagg ccggcgcggc gcggttgcga tccctggctt cgtcctatca 831181 ggctgtggga tttggcccca aaccccagga gccgcctccg gatccagtgc catttccgcc 831241 ctaccagccg aaggtgtggg cggcgtgccg ggcgcgtggc caagacccgg acaaggtcgt 831301 caggacgttc catcacgcgc cgatgagcgc gagattccgc tcgctaccgg ccggagactc 831361 cgtgttgtac tgcggcaatg acaagtacgg gctgctgcac attcaggcca agcatggacg 831421 ccaatggcac gatattgcgg atgcacgatg gccgagtgca ggcaattggc gctatctcgc 831481 cgattacgca atcggtgcca cactggccta cccggagcga gtggagtaca accaagacaa 831541 cgacacgttc gccgtatacc ggagaatgtc gttgccagac ggcagatacg ttttcacaac 831601 ccgcgtcatt atttcggcac gcgacgggaa gatcattacg gccttcccgc agacgacgtg 831661 atgcgtcggt tgggaactaa gggaaggtga tggcgtgacc gggccaccgc gaagctatac 831721 agggcgccgg gatctcatcg cggagaagct ggagccgtac tttcagatca gcgccatgct 831781 gccgaagaac accagaccca cctcggaaac cgccgaagag ttctgggaca actcgctgtg 831841 gtgcagctgg ggcgaccgag aaacgggata cacccgcacc gtcacggttt cgatctgcca 831901 ggtggcggac ggcgaacgtg aggccgaagg ggttcgggac atgatgcggc tggagtgtcc 831961 ggctgggctg gatctacgga cacccaaccc ggaggcatac gagattaccg gtcagcggcc 832021 cggagaattc gtgttcgtgc tcggctatct ggggcatgtg cgggccatcg tgggcaactg 832081 ttacatcgag atcatgccga tgggcaccag ggtcgagctg agcaagttgg ccgatgtggc 832141 attggatatc ggccgcagtg tcggatgctc ggcctacgag aacgacttca cgctgccgga 832201 cattccaacg cagtggcgca accagccgct gggctggtac acgcaaggcc ttgcccccta 832261 cctgccgggg ctgtcggacc cgaaagacgc cgccgagggc tgatgggtgt gccggcgacc 832321 tctgagggcg agcagacgca taagcgccca atttcgggct cttctgaccc ttccgtgggt 832381 ggaaccttgg tctgagtagg cgcacgtcgt tgtagcttaa ggttgctggt ttgtcaaagg 832441 tccgaaacca aggggagcga gcaacgacgt gcgcaatgcg aggttgtggc gtgaactgct 832501 gggtgttgat aagcggacgg tggcctacgc caggtgtttt cggtcaaagg cgaagaaggc 832561 aagcaggcac tggatcggtg gatctcctgg gcgcggcgct gccgcatccc cgtcttcgtg 832621 gagctggccg gcggcatcgt gcgacaccgc caagccatcg acgccgccct tgaccacggc 832681 ctatggcaag gactgatcga atccaccaac accaagatcc gactcctaac ccggatcgcg 832741 ttcggattcc gctcccccga agcactcatc gccttggcca tgctcgccct cggcggccgc 832801 cgccccgccc taccgggcag aaccaaacac ccacggatca gtcagtagag ccggaaaacc 832861 tgggatttcg ctgcccgttg gacggtgcaa tgcgcttctg tccatgagtc gctggaagac 832921 ctgggcatct cgcccgggtt gtcctggctt attgggccat gacctcttgg gaggtgtcac 832981 atatcgtttg tgatcgcggc gccggaggcc atcgcggcag cggccacgga tttggcaagc 833041 atcggttcga cgatcggggc ggccaacgcc gcggccgcgg ccaacacgac ggcggtgctg 833101 gccgcgggcg ccgatcaggt gtcggtggcc atcgcggcgg cttttggggc gcacggccag 833161 gcctatcagg cgctcagcgc gcaggcggcg acgtttcata tccagtttgt gcaggccttg 833221 accgcgggcg cgggctcgta tgcggccgcc gaggccgcca gcgccgcgtc cataaccagt 833281 ccgctgctcg acgcgatcaa cgcgcccttc ctggcggcgt tggggcgccc gctgatcggt 833341 aacggcgccg acggggcgcc ggggaccggg gccgccggcg gggccggcgg attgttgttc 833401 ggcaacggcg gcgcgggcgg gtccggcgcg cccggcgggg ccggcggatt gttgttcggc 833461 aacggcggcg ccggcggccc cggcgcgtcc ggcggcgcgc tgggctgatc ggcaacggcg 833521 gtaacggcgg taagggcggg cttggggtcc cgccgggtgt cggtggtacc ggcggcgccg 833581 gggggctgct gctcggcctg gatgggttga cgtaggcggc ggcccgcagc ccgccgggct 833641 ccacgtcatc tggcgctgct ggcagaccaa cgctccctac gagcccacgc gccaccgagc 833701 cctccagggc cctgctggcc caacatcaac gaacggatac ctgggacagg acgactggaa 833761 ggcgggcagt tgacccatgc cgaataccgg tggcagcctg ctgcacatcg catccacttc 833821 cgggcgacca acacgtcgag cagccgcgac atccgcggca tgcaatgctg gcggcgcgac 833881 aggtgctagg aggagtggtt gcccgcaccg tagtagttca gccaggccgc aatgcgttga 833941 ccgatacgcg ggtcggtttc ctccggcagc agcaaaaccc gagcctggat cacacccacg 834001 tcgagcaggc cgcttcggat gagggcggcc acgaaagctt tgtccttctc gcgtccggcg 834061 gcaagtttcg ccaccgcgag gtcgtgcggt tccagaaagc gtggtttcgc cgggcgcgag 834121 gattcgacgg tccaactgac cagccggtcc cgccacccgt taggcaggat cgcggtgtcg 834181 atatgtacgc cctcggcata aacgccattg ctgcggtgaa aatcggacat ctcgccgatt 834241 gccacgtcga catgatccgc tttgtcccgc gccgggtcgt tgacaaacgc gatgtcggcc 834301 tcctgggagg cggtggcctg cggcggtagt tcgttttcat caaatgaccc caggatcgac 834361 tgcgacccga gtaccagcac gtccacatcg cccacaacag cacaggcgcg gcggaggaga 834421 tgtgcaagtt gctgacgcgt cattccgtca tggcccgctc gtgctcgcga tcccagtgat 834481 ccttgaacga ccgcagcacc gcgaccctcg tcgcctccgg cagtatgccc gcgaacgggg 834541 agttctgccg catctcccga gcgtcctccg aagggctggt caatacgtgc atgaccgcgt 834601 cgaggccgtc gttaaggaca cgctgccact tcgtgaaata ccaccccgcc atgccgtccc 834661 gacgatgcat acccgaccag cgacgcaagt tctctcgtgc ggcggagacg accgtatccg 834721 gttcggtcaa cagcgggctc agcagggcgc gatgcagcca cagcgacctt tcctcctcgc 834781 gggtcaaccg gcggctcgtg acgcgctcga cttcgctact gggcacccgc cgatggctgc 834841 cgacgtgcac gcacaccatc tcgccgcggt cacacatgtt gacgacatgc tgccgcgata 834901 ccccgagtat ctgcgcggcc tcactcgtct tcagcagagt ctccatgtcc caatgctggc 834961 tagtaaaccc aaaaaacaca acatcgttgc ggagcgtgat cgcaccggct gacgctagag 835021 cgagggcccc agttccgcgg ccgacaggtg gcagtgagct agctgccgcg cagcgtctcg 835081 atcactgcgc tgaagtccaa gtcggcgtga tcggcggcga acttggcgta gatctgggcg 835141 gcgtggctgc ccagtggggc cgccgcaccg gtcgaggcca ccgcttccat cgccaggccc 835201 aacatgccag gtgctgccga ctacggcggt gatccacacg gtcacggcgg aagcattggg 835261 ccgcatcggt attgatgcgc cgcggattcc tggatcgttg gacgtcgccg cgcatgcggc 835321 gatcgggctg ctgccgttgg tggccggctg cgaccgccga catcggcggc ctgtccgcgg 835381 tgctcgggcc ggacgggctg cccaagtgtc tttgtgtatg acggctatcc gggtggagcc 835441 ggtttcgtcg aacgcggttt gcaccggccc cgcggcgcag gtgggtgacc agtcacgctc 835501 accgcagcgc gattacgcgc accaggcctt gcaacccgat gtgccgcggc gccgcgcgcg 835561 gcggcacaga ccccgccggt gttcggcaaa aacggggtcg tcgtcttcga cgatgcggtg 835621 tacttgtcat cagaatcagt gtctatggtc atcgggggtg tcgtgggcgc tggcccgctg 835681 actcgggtgg gaggtggcac atgtcgttcg tgctggcgat gccggaggtg ttggggtcgg 835741 cggcaacgga tctggccgct ctgggctcgg tgctgggcgc ggccgatgcg gccgcggcgg 835801 ctacgacgac gggcatcgtg gccgcggccc aggatgaggt gtcggcggcg atcgcggcgt 835861 tgttttccgc ccacggccgg gcctatcagg tggccagtgc gcaggcggcg gcggttcacg 835921 cccagttcgt ggaggcgttg agcgcgggtg cgggggccta cgccagcgcg gaggccgccg 835981 gcgcggcggt gctggccaac ccggcgcaga gcgtgcagca ggacctgctg gccgccgtca 836041 atgcgcaaag tgtcgcgctc acggggcgcc cgttgatcgg caacggcgcc aacggggccc 836101 cgggcacggg ggccaatggt gcgccgggcg ggtggttgct cggtaatggt ggggccggcg 836161 ggtccgccgc cgctggctcg ggcctgcccg gcggggccgg cggggccgcc gggttgttcg 836221 gcaccggcgg ggctggtggg gccggcggga gttccacggt aggtgatggc gaggccgggg 836281 gtgccggtgg atcaggtggc tggttgttgg gcaccggtgg ggtcggcggg gtcggcgggc 836341 tcggggccgg cgccggtggg gccggcgggg ttggtggggc cggcgggctg ttgggtgctg 836401 gcgggcacgg cggcgccggc gggctaggcg ccgtcaccgg tggggtcggg ggaactggcg 836461 gagccggtgg gctgctggcc gggctgctgg ccgggccggg cggggccggc gggaccggcg 836521 gacgtggctt tctcaacaac ggtggggtcg gtggggctgg cggcaacgcc gggctgctgt 836581 tcggtgccgg cggcaccggt ggatccggcg gagccggcct aggtggtgac ggtggggccg 836641 gtggggccgg cggcaacacc ggtgtgctgt tcggcaacgc cggatccggg gggaccggcg 836701 ggttcggcga taccgacggg ggagccggcg gtgccggcgg tgacgccggc tggttgggct 836761 ccggtggggt cggcggggcc ggcgggttcg gcgaaaccgg tgacgggggt gtcggcgggg 836821 ccggcggcaa ggccgggttg ctgatcggta acggcggggc cggcggcgcc ggtgggcaag 836881 gcgccgtgac cggcggtacc ggcggggccg gcggcgacgg ggtgctgatc ggcaacggcg 836941 gcaacgccgg catcggcgga accggaccga ccgcgggtga taccggcgcg ggtgggatca 837001 gtgggctgct gctgggcgcc gacggcttca acaccccggc cagcgcctct ccgctgcaca 837061 ccctgaaaca acaggcgctg gccgcgatca acgcgccgac ccagacactg accgggcgac 837121 cgctgatcgg caacggcacc cccggggcgg tcggcagcgg ggccaccggg gcccccggtg 837181 ggtggctgct cggcgacggc ggggccggcg ggtccggcgc ggcgggctcg ggcgcgcccg 837241 gcggggcggg cggggctgcc gggctgtggg gtaccggcgg ggccggcggg gccggaggca 837301 gctcggcggg tggcggcggg gccggtgggg ccggcggggc cggcggctgg ctgctcggcg 837361 acggcggggc cggcgggatc ggcggagcca gcaccgtact cggcggcacc ggcgggggag 837421 gcggggtcgg tgggctgtgg ggcgccggtg gggccggcgg ggccggtgga accggccttg 837481 ttggtggcga cggcggggcc ggtggggccg gcgggaccgg cggactgctg gccgggctga 837541 tcggtgccgg cggaggtcac ggcgggaccg gcgggctcag cactaatggc gacggcgggg 837601 ttggcggggc cggcgggaat gccggaatgc tcgccgggcc gggcggcgcc ggcggagccg 837661 gcggtgacgg cgaaaacctg gacaccggtg gggacggcgg ggccggcggt agcgcagggc 837721 tgctgttcgg cagcggcggc gccggcggcg ccggcggatt tggtttcctc ggtggggacg 837781 gcggggccgg tggcaacgcc gggctgctgt tgtccagcgg cggggccggc gggttcggcg 837841 ggttcggcac cgccggtggg gtcggtgggg ccggcggcaa tgccggctgg ctgggcttcg 837901 gcggggccgg tggcgtcggc ggcagcgccg ggctgatcgg caccggcggc aacggcggca 837961 acggcggcac cggcgccaac gccggcagcc ccggaaccgg cggcgccggc gggttgctgc 838021 tgggccaaaa cgggctcaac gggttgccgt agccgggcgg cacggcatgg cttccgggcg 838081 tcaaccactc gccggtgatg cagatcggct gcggagcggg ccgccaaaat gggggccgcc 838141 gcgccaggta tctcggcgaa gatccccggc gctcgagcgc tttgtcagag gcccgtcgcg 838201 ggtcgtcgtg acgacggcta tccgggcggt gcgggtttcg cggcgcgccc tgtgcccggc 838261 accgccgccc gtttgtcggc aacgccgccg cgacccgtga gccgtccagc agctggcgcc 838321 tgcgaaacgt gtggaagcgc tgcatgcggt gccggatcgc gatatcgttg atttctgcaa 838381 ttaattccta cccgtacggg tgtgtcgctg gtagtcgggc accaggccgt gaggggttgg 838441 gaggcatgcg atgtcatggg tgatggtttc gccggagctg gtggtggcgg cggcagcgga 838501 tttggcgggg atcgggtcgg cgattagctc ggctaatgcg gcggcggccg tcaacacgac 838561 gggattgttg accgcgggtg ccgatgaggt gtcgacagcg attgcggcgt tgttcggtgc 838621 ccaaggccag gcctaccagg cggcgagcgc acaggcggcg gcgttttacg cccagttcgt 838681 gcaggccctg agcgccggcg gaggcgcgta tgcggccgcc gaggccgccg ccgtgtcgcc 838741 gctgctggcc ccgatcaacg cgcaattcgt ggcggccacc gggcgcccgc tgatcggcaa 838801 cggcgccaac ggcgcccccg ggaccggagc caacggcggg cccggcgggt ggttgatcgg 838861 caacggcggc gccggcgggt ctggcgcccc cggcgctggg gccggcggta acggcggggc 838921 cggcgggctg ttcggcagcg gcggggccgg cggggcctcc accgacgtcg ccggcggggc 838981 cggtggggcc ggcggggccg gcggaaacgc cggcatgctg ttcggcgccg ccggggtcgg 839041 cggcgtcggc ggattctcga acggcggtgc caccggcggg gcaggcgggg ccggcggggc 839101 gggcgggctg tttggcgccg gaagggaacg cggcagcggc gggtcgggca acctcactgg 839161 cggggccggc ggggccggcg gcaacgccgg gacactcgcc actggtgatg gcggggccgg 839221 cgggaccggc ggcgctagtc gcagcggcgg attcggcggg gccggcggag ccggcggcga 839281 cgccggcatg ttcttcggct ccggcggctc cggcggcgcc ggcggcatta gtaaaagcgt 839341 cggggacagc gccgccggcg gggccggcgg ggcccccggg ctgatcggca acggcggcaa 839401 cggcggcaac ggcggcgcga gcaccggcgg cggggacggt gggcccggcg gggccggcgg 839461 caccggcgtg ttgatcggca acggcggcaa cggcggcagc ggcgggaccg gcgcgaccct 839521 gggcaaggcc ggcatcggcg gtaccggggg ggtgctgttg ggcctggacg gctttacggc 839581 ccccgccagc acctcgcccc tgcacaccct gcagcaggac gtgatcaata tggtgaacga 839641 ccccttccag acgctcaccg ggcgtccgct gatcggcaac ggcgccaacg gcactccggg 839701 gaccggggct gacggcggag ccggcggctg gttgttcggc aacggcggaa acggcgggca 839761 gggaacgatc ggcggcgtca acggcggggc cggcggggcc ggcggggccg gcgggatctt 839821 gttcggcacc ggcggcaccg ggggcagcgg cgggcccggc gccaccggcc tcggcgggat 839881 tggcggggcc ggcggagccg ccttgctctt cggctccggc ggggccggcg gaagcggtgg 839941 tgccggcgcg gtcggtggca atggcggggc cggcggcaac gccggtgcgc tcttgggcgc 840001 cgccggggcc ggcggggccg gtggtgccgg cgcggtcggt ggcaatggcg gggccggcgg 840061 taacggcggg ctgttcgcca acgggggagc cggcgggccc ggtgggtttg gcagccccgc 840121 tggggctggc gggatcggcg gggcaggtgg gaacggcggg ctgttcggcg ccggcgggac 840181 cggcggggcc ggcgggggaa gcaccctcgc cggcggcgcc ggcggggcgg gcggcaacgg 840241 cgggctgttc ggcgccggcg gcaccggcgg cgccggcagc catagcaccg ccgccggagt 840301 ttccggaggg gccggcgggg ccggcggcga cgccggcttg ctctccctcg gcgcctccgg 840361 cggggccggc ggcagcggcg gttccagcct gaccgccgcc ggcgtggtcg gcggcatcgg 840421 cggcgccgga ggcttgctct tcggctccgg cggcgccggc gggagcggcg ggttcagcaa 840481 ctctggcaac ggcggcgccg gcggggccgg cggcgacgcg ggtttgctcg tcggctccgg 840541 cggggccggc ggggccggcg cctccgccac cggcgccgcc accggcgggg acggcggggc 840601 cggcggcaag tccggagcgt tcggtctcgg aggtgacggc ggcgccggcg gcgccaccgg 840661 tttgtccggt gctttccaca tcggcggcaa gggcggcgtc ggcggcagcg ccgtgctgat 840721 cggcaacggc ggcaacggcg gcaacggcgg taacagcggt aacgccggga aatccggggg 840781 tgcacccggc cccagcggcg ccggcggcgc cggcgggctg ctgctcggtg agaacgggct 840841 gaacggcttg atgtagccgg cgggcctgcg accgcgcgcg gcgttgacag catcgcttcg 840901 gccgctcgac cgcagatgat gctgttgatg cgttaccgtg tgcatcatgc gcaccacggt 840961 gtcaatctcc gatgaaatac tcgctgccgc caaacgccgg gcccgcgagc gtggtcaatc 841021 gctgggcgct gtgatcgagg acgcccttcg gcgggagttc gccgccgccc acgtcggcgg 841081 cgcccgcccg accgtcccgg ttttcgacgg cggcaccggt ccgcggcgag gcatcgacct 841141 gacctcgaat agagcgttgt ccgaagtgct cgacgagggc ctggaactga actcccggaa 841201 gtaaccccca ataggcgcag aacggcaatg ttccttctcg acgccaacgt gctgctggct 841261 gcacaccgcg gtgaccaccc gaatcaccga accgtccgcc cctggttcga tcgactgctc 841321 gcggctgacg accccttcac agtgccgaac ctggtatggg cgtcgttcct ccggctggca 841381 acgaatcgac gcatcttcga gattccgtca ccgcgagcag aggcattcgc attcgtcgaa 841441 gccgtcaccg cccagcccca tcaccttccg acgaaccccg gtcccagaca cctcatgctg 841501 ctgcgaaaac tctgcgacga ggccgacgca tcgggcgact tgatacctga cgcggtactc 841561 gcggccatag cagtggggca tcactgcgcc gtggtgagcc tggacaggga tttcgcccgg 841621 tttgcctcgg tgcgccacat tcgcccgccg ctctagcgag cggtcctcaa gtacagtcgg 841681 cgaccggaca aaccgctgcg ccagacgatt caccgtcctc gcgtcaattc gagcagctac 841741 ggccgaaagc caagggcctt cttggtcggg gtgaaaaagt tcagacgcag cgacaccagc 841801 tgccacagct ggttgagcaa ctccagttcc tcggtgctgt cgtagcgcca gtggaacgcg 841861 tgtttgcgca ccacacggtt gttctttcga ctccacgtgc gcctggtcgt tcgtctggta 841921 caccaggcta ccgggctatc ggattcggcc ccaaacctca ggagccgcct ccggatccgg 841981 tgccgtttcc gccctaccag ccgaaggtgt gggactaaac tatctagggc aagtgcgggc 842041 catagtgggc gactgcgtca tccacatcat gccgatgggc accggggtcg agctgagcaa 842101 gttggccgat ctggcattgg atatcggccg cagtgtcgga tgctcggcct acgagaacga 842161 cttcacgctg ccggacattc caacgcagtg gcgcaaccag ccgctgggct ggtacacgca 842221 aggccttgcc ccctacctgc cggggctgtc ggacccgaaa gacgccgccg agggctgatg 842281 ggtgtgccgg cgacctctga gggcgagcag acgcataagc gcccaatttc gtgtcgaaat 842341 gggcgcttat gcgtctgctc gcgcgcgcaa cgtgtggatc accgcgctga agtccaggtc 842401 ggcgtggtcg gcggcgaatt tggcgtagat gtcggcggcg tggctgccca gcggggccgt 842461 cgcaccggtg gcggccaccg catccatcgc caggcccagg tccttgttca tcaacgcggt 842521 cgaaaacccg ggcttgaagt cgttgttggc cggtgaggtg ggcaccgggc ccggcaccgg 842581 gcaattggtg tgcaccgccc agcaattgcc ggtcgcgccg gtgatgacgt cgaacaacga 842641 ttgtgcggac agcccgagct tctcggccag cacgaacgcc tcggcgatcg cgatctgctg 842701 caccgccagc accatgttgt tgcacacctt ggcggcctgt ccggcaccgg cggcgccgca 842761 gtgaatgatc ttgcccgcca tgggctctag taccgggcgt gcccgccgta gcgtggactc 842821 gtcgccgccg accatgaatg ccagcgtcgc ggcggcggcg cccttcaccc cgccggagac 842881 cggcgcatcc agttggagca tgccgtgcga ttcggccagc gcgtgcacct cacgggcatc 842941 ggtgaccgag atcgtggagc tgtcgatgaa cagcgttgcc ggacgcgcgg cggccagcac 843001 gtcggtgtag cagcgccgga ccacctcgcc ggtgggcagc atggtgatga ccacgtcggc 843061 ctcggccacc gcttcgggcg cgctacgaaa caccgcgaca ccgtgcgcgg cggcgccgga 843121 cgccgccgtg ggtgccgggt cgaatccacg cacgacgtgg cccgcaccaa ccagattcgc 843181 cgacatcggc gcacccatgt tgcccaaacc taggaaggcg atggtcgtca tctgagcctc 843241 tctaaacggt ggcgcggaac cgcgcggcct cggcccgacc gatgaccagc cgcatgatct 843301 cgttggtccc ttccaggatg cgatgcaccc gcaggtcgcg gacgatcttc tccagaccat 843361 actcgcgcag atagccatag ccgccgtgca gctgcagggc ctggtcggcg acctcaaagc 843421 aggtgtcggt gacgtagcgc ttggccatcg cacacagctc gaccttgtcg gcgtcgtcgt 843481 catcgagcgc acttgcggcc cgccacaaca acattcgcga cgtctgcagc ccggtagcca 843541 tgtcggccag ggtaaaccgc acggtgggct cgtcgagcag cgatccgccg aaggcctgtc 843601 ggtcgcgaac gtaggcgccc gctttgtcaa aggcggcctg cgcgccaccc agcgagcatg 843661 ctgcgatatt gagccggccg ccgttgaggc cgctcatcgc gataccgaag ccggcgcctt 843721 cgccgtcggc gccgcccagc atggcctcgg cgggtacccg caccccgtcc agcaccacct 843781 gcgcggtggg ttgggcatgc caacccatct tcgcttcggg cgcgccgaaa ctcagccccg 843841 gtgtgccctt ttcgacgatg aacgccgaca cgccgcgcgg accctcggcg cccgtgcgcg 843901 ccatcaccac atacacgtcc gatgctgcgg ccccggaaat gaattgtttg acgccatcga 843961 gcacgtagtc gccgcctttt cctgagccgt gcctgacggc gcgggtgctc agtgcgccgg 844021 catcggatcc ggcgcccggt tcggtcaggc agtagctggc gatgacgccc atggtggcca 844081 gtcgcggaat ccagtccttg cgttgctcgt cggtgccgaa gctgtcaatc atccacgcgc 844141 acatgttgtg gatggacaaa aacgcggcgg tcaccgggtc ggcgatcgcc aactgctcga 844201 agatgcgcac gccgtcgagc cggcgcagcc cactgccgcc gacgtcgtcg cggcaataga 844261 tcgcggccat gccgagttcg gccgcttccc gcaacacgtc caccggaaag tgtttggcgg 844321 catcccattc cagggcgtgc ggagccaggc gtttgccggc gaaggcggcc gccgtctcga 844381 cgatcacccg ttcgtcgtcg ttaaggacaa acatgacacg ctaactcatt gtggggatga 844441 cgaattcggc accgtccttg atgcctgacg gccatcgcga cgtgacggtc ttgaccttgg 844501 tgtagaactg gattgccgcc gggccgtgct ggttgaggtc gccgaagccg gagcgcttcc 844561 agccgccgaa agtgtggtag gccaccggca ccgggatcgg cacgttgacg ccgaccatgc 844621 ccacctgcac ccgggagacg aagtcgcggg ccgcgtcgcc gtcgcgggtg aagatcgcca 844681 ccccgttgcc gtattcgtgc tccgacggca gccgcaacgc ctcttcgtag tcgcgggcgc 844741 gaaccatgca caacaccggc ccgaagattt cgtcggtgta gatcgacatg tgggcagcga 844801 catggtcgaa cagggtcggc ccgatgaaga agccgccctc caggttcgca tcgccttcag 844861 gcagcccaaa ggtcaggtcg tcgctggcgc ggtcgcggcc gtcaacgacc agctcggcac 844921 cggcggccac accctggccg atgtagtcgc gcacccgcgc cagcgccgcc ccggtgacca 844981 gcgggccgta gtccgccttg gggtccaggc tgtgtcccac ccgcaagtta ttgatccgct 845041 cgatcagcct ggcgcgcaac cgctccgcgg tctgatcgcc caccggcacg gcgacgctga 845101 tcgccatgca gcgttcgccg gcgctgccgt atccggcgcc gatcagtgcg tccacggcct 845161 gatccaggtc cgcgtcgggc atcacgatca tgtggttctt ggcaccgccg aaacactgcg 845221 cccgcttgcc ggtggcggcg gcaccagcgt agatgtactg agcgatatcc gagctgccga 845281 cgaagccgac ggccttgatg tcggggtggt gcaggatggc gtcgacggcc tccttgtcgc 845341 cgtgcaccac ctggaacacg cccgccggca ggcccgcctc gatgaacagc tcggccagcc 845401 tcaccggaac cgacgggtcg cgctcacttg gcttgagcac gaaggcgttg ccgcacgcta 845461 gggccgggcc ggccttccac agcggaatca tcgccgggaa gttgaacggg gtgatccccg 845521 cgaccacacc caggggctgc cgcagcgaat agacgtcgat gccggggccg gcaccctcgg 845581 tgtactcgcc cttgagcagg tggggaatgc ccaggcagaa ctcgattacc tcgatgccgc 845641 gctggacgtc gccgcgggcg tcggccagcg ttttgccgtg ctcacgcgac aacagctcgg 845701 ccaactcgtc gatggtgtcg ttgaccagtt cgataaaccg catcaacacc cgggcacggc 845761 gctggggatt ccatgcggcc cagccctttt gggcctcgac cgcggaggcc acggccgcgt 845821 cgatgtctga cttgccggcc atcggtacct tcgcctggat ctggccggtg ttggggtcga 845881 agacgtcggc cgagcgcgtg gactggccgg cggtgcgttg tccgtcgatg aaatgtgaaa 845941 tctgtgtggt catggttgtc ctgtgcaagc cggtggcggc ggggaatccc gatacttgga 846001 tatcctagta actgtggcgg atggctcgca aggcgaccga gccgacagcg tcctagcggg 846061 agacgcttgg atgctcgttg cattttggcc gatacccgca tctgttccgg cgctgcgctc 846121 catcatggct agtacgcgac aacacccggg ggtaagcgat gtcatttgtg atcgtggcgc 846181 gggacgcgtt ggcggcggcc gcggcggatc tagcgcagat cggttcggca gtgaatgcgg 846241 gcaatctggc cgcagccaat ccgacgaccg ctgtggcggc ggcggccgcc gacgaggtat 846301 cggcggcact cgcggcgctg ttcggcgcgc atgcccggga gtatcaggcg gcggcggcgc 846361 aggcagcggc gtatcacgag cagtttgtgc accgattgag cgcggcagcg acatcgtatg 846421 cggttaccga ggtgaccatc gcgacgtcgc tccggggggc gctgggctcg gcgcccgcgt 846481 ccgtttccga cgggttccaa gcgttcgtct atggtccgat tcacgcgacc ggccagcaat 846541 ggatcaacag cccggtcggc gaggcgctcg ccccgattgt caatgcgccg acaaacgtgc 846601 tgctcggccg cgatctgatc ggcaacggcg tcaccgggac ggcggcagct cccaacggtg 846661 gccccggcgg tttgctattc ggtgacggtg gggccggcta taccggcggt aacggtggga 846721 gtgccgggtt aatcggcaac gggggtaccg gtggcgccgg ctttgccggc ggagtgggcg 846781 gcatgggcgg caccggcggc tggttgatgg gcaacggcgg catgggtggc gcgggcggtg 846841 tcggcggtaa cggcggcgcc gggggccagg cgctgttgtt cggcaacggc ggcctgggcg 846901 gagccggcgg ggctggcggg gtcgatgggg ctatcggtcg tggcgggtgg ttcatcggta 846961 ccggcggcat ggccacgatc ggtggtggcg gcaacgggca gtcgatcgtc atcgacttcg 847021 tgcggcacgg ccagacgccg ggcaacgccg caatgttgat cgacacggcg gtgcccggac 847081 ccggactcac cgcgctgggc cagcaacagg cgcaggccat cgccaacgcg ctcgcggcca 847141 agggccccta tgccgggatc ttcgactcgc agttgatcag aacgcagcag accgccgcgc 847201 cgttggcgaa cttgctgggg atggccccgc aggtattgcc cgggctcaac gagatccatg 847261 ccggcatctt cgaggacctg ccgcagatca gccccgcggg cctgctgtat ctcgtcggcc 847321 cgatcgcctg gacgctcgga tttcccatcg tgccgatgct ggccccgggc tccaccgacg 847381 tcaacgggat cgtcttcaac cgagccttta ccggtgcggt tcaaacgatc tacgacgctt 847441 ccttggccaa tccggtcgtg gccgcagacg gcaacatcac gtcggtcgct tactccagcg 847501 cattcaccat cggggtcggg acgatgatga acgtcgacaa tccccatccg ctactgctgc 847561 tcacccaccc ggtgcccaac accggcgccg tcgtggtaca gggcaatccc gagggcggct 847621 ggacgctggt cagctgggac gggatacccg tcgggccggc gtcgctgccg accgcgttat 847681 tcgtcgacgt gcgcgagctg atcacggcgc cgcaatatgc ggcctacgac atttgggagt 847741 ccctgttcac cggcgatccg gcggcggtca tcaacgcggt gcgagacggt gccgatgagg 847801 tcggcgcggc tgtggtccag ttcccacatg cggtggctga cgacgtgatc gacgctacgg 847861 gccaccccta tctaagcggc ctgccgatcg gtctgcccag cctgatccca tgaccgcgag 847921 cgaccaatag gtccccacat ggcccggagg ccgctgccag cattgacccg acgatgccgg 847981 cccgcaggct tccctgatcg tgcggaacct gctcggccgt gcatgggaca tccagatcgg 848041 attgcctccg ggtacggcgt acgccggacc cggtcgccgg gacgataccg ggctagtgtt 848101 agctagcggt ggaaaaagcc cgacacgaaa tcgatcgaat taaagccacc agaatcctgc 848161 tttccagagt tcccgaaacc cgatgtggcg ctgttgttgt cgggattggc ttcaagatta 848221 ccgaagcccg actgtagaaa acccgtattg ccaaagcccg acatgaatcc actgaacagg 848281 ccggttccct tgttcccgaa acccgatgtt acactaaccg aattgttgta acccgttacc 848341 gattggcccg agttggcgaa gcccgagatt tgtaaattac caacgttttg ggcgccggag 848401 tttcccctac cagaattatt gaaacccgaa tttccactgc cggcgtttcc gaatcccgag 848461 ttttcgccca gcccatcggt agtattgccg aaaccggtgt tcaggttgcc cgcgttaaag 848521 ccgcccgtgt tgatattgcc agaatttgcg aagccggtgt tcgtcaggcc agagttcaag 848581 aaaccagaat tagcgtctcc tccgttgaag ctgcctgagt tgaatgcacc cgagttgaag 848641 ctaccggtgt taatgatgcc gccgttgaag ttgccggtgt tgaaatcgcc cgcgttccct 848701 atgccggtat tggcctgacc tgagttgcca aagccagtgt tgacgcttaa cgcgttcccg 848761 aagccggtgt tgataaagcc ggagtttccg aagccggtgt tgatgttgcc tgagttggct 848821 acgcccgtgt tggtgacgcc cgagttgccc acgccgaagt tgccgctgcc cgagttgaag 848881 aagccgatgt tcccggtgcc cgagttacca aatcctatat taccgctacc ggaattcagt 848941 ccgccaaagc cgatctgatt gctgccggtt aacccaatgc cgatattatt gttgccggtg 849001 ttcccgaagc cgaagttgta gctgccgctg ttcccgaagc caacgttgcc gtcgcctacg 849061 tttcccagac cgatatttgc gttgcccgtg ttaccaccgc cgaagtttcc attgccaccg 849121 ttgccaatcc cgacattccc attgccgggg gtgggcacgg cgggggaact catgtttgga 849181 cctgcatttc cgataccaat gttggcattg ccaaagttcc cgaagccgaa gttgttgtcg 849241 ccagagttgc cacccccgac gttcgcatta ccgatgttgc cgctacccac gttaaagctg 849301 gacggtccga tgccaacgtt tccgtttccg ctacccaggt tgaagttgcc gatgttgccg 849361 ctacccacgt tgaagctgga aatctggccg tggaaggcgc ttccgtttcc gccgccaaag 849421 ttggcgttac cgaggtttcc attacccagg ttgtaatcgc cggtattacc attgccgaca 849481 ttgaggttgc cgatattgcc gacacccaaa ttgatgtttg gcaacgccgg ctgccacgac 849541 gccagctgtg ccgctgccgc cgaggatgcg gcatggtagc cggccatcac cgccacgtct 849601 tgtgcccaca tctgctcgta ggtggattcc atggccgcta tggccggcgc gttttgcccg 849661 aacacattcg acaccgccaa caaccacgtc cgaacacgat tggcctgcac caccgccgga 849721 tgcaccacgc cggccagcgc ctcctcaaat gcacccgccg ccgcccgagc ctgccgcgct 849781 gccagctcgg cctgggctcc agccgcggtc aaccagctcg catacggccc cgcggcattc 849841 gccatcgcgg ccgacgccgg tccctgccaa gcgccgccgg ccaactccga cgtcaccgac 849901 ccgaacgacg acgccgccgc gtgcaactcc tcggccaacc cgtcccaggc ccccgccgcc 849961 gccaatagtg gccgtgaccc cgcacccaga tacatccgta gcgaattggt ctccggaggc 850021 aaccacgcga aaccgaccat cacggccccc tcacaccatt gacaaaccag gacgcctcga 850081 gcctaactac acaacgcgaa gggattggga cttctatcgg aattgcgccg cgtgcactgg 850141 ccgccggcct ttccccgcca gcctcggtgt ttcatgccgc ttgccgtggt ctgccacctg 850201 cgagttcgca tttgtgcaga gtcccgtcgg gagttgtcaa aactaaaacg ggcgatcttg 850261 atcgcatcgg aagcgcgaga ttgcgccctg agctgcgcct tcgtggagcc cccggtcagg 850321 attgaacgga cgaccgctcg cttataaggc tgttccggta ccgatcctta agccatcgag 850381 gccctcggcc tcatagcggg ccaaccaggt atgcagcgtc tgccgcgaca ccccaacttt 850441 ctcggcaacc tgcgagatcg acaacccgtc gctgatcacc gccaacacgg cttgataccg 850501 ctgttctgcc acactcaact ccttcatcga aggagtgtca aggatcagcc gaaccaactg 850561 tcaagcatca gccgaaacat cgtcaggcat cacccgaacc caaaacgtca agcatcagcc 850621 gaggtactac acgaacgctt gagccccctg tcaggattga actgacgacc gctcgcttac 850681 aaggcgagtg ctctaccact gagctaagga ggccgatgaa atcgctgtga gtctagccgc 850741 tcactcgctg tcgacgacgc gttgcgaacg caccgaccgc gacgacgagc ggcgcgcggg 850801 acggcgcccg ggcagtggaa tgcgctcggc gatgctgctc agcgggttga ccaccatggt 850861 aagtgcgatc acagcgtctt gcagcgtcgc gatggccggc tcgagcgcct ccatcccggg 850921 tgtcagccgc gccaacgtgt cggcgacgtc ggcgagctgt tcgagcggtc cgtccttggc 850981 cgttatcttg tcgatcagtc cgccttcggc cagcagccgg tcggccagcc cgtcttcgga 851041 gagcacccgc tcgataagtc cgtcctcggc gagcagctgg tcggccagtc cgcccggttg 851101 cagcgcgcgc tgcatggcgc cgccttcagc ggtcaggcgg tcgagtaagc cgccgggctg 851161 ggtcagcagg tcgaccaccc cgccgggccg cagcatccgg tccatcggcc cgttgggcgc 851221 gatggcgcgt cccagcggca tatcgtcgtc caatagcctg gccagccggt tggcgcgggc 851281 aatcgtgtca tcgattccca gcatgttggc cattgaggtc gacccgcttg cgccgccggc 851341 atcacccaac gcttgtttgg ccatgtcaac cgccgcgccg gccatgttca aaccggtgtc 851401 ggcggcggcg agccccgctc gtgcgggcca ggtcgcaata cccacgaggg tttggccgag 851461 gttcattctg cgagtgtatt cacggcgcgc cgtggattga gcggcaacgg tccaagctga 851521 tttggcgatt cctggcagac tgttagcaga ctactggcaa cgagctttca ggaattacac 851581 aatgactgtg aaggtaacgt tcaaccaatg cggaaagggg ttgatctcgt gacggcggga 851641 accccaggcg aaaacaccac accggaggct cgtgtcctcg tggtcgatga tgaggccaac 851701 atcgttgaac tgctgtcggt gagcctcaag ttccagggct ttgaagtcta caccgcgacc 851761 aacggggcac aggcgctgga tcgggcccgg gaaacccggc cggacgcggt gatcctcgat 851821 gtgatgatgc ccgggatgga cggctttggg gtgctgcgcc ggctgcgcgc cgacggcatc 851881 gatgccccgg cgttgttcct gacggcccgt gactcgctac aggacaagat cgcgggtctg 851941 accctgggtg gtgacgacta tgtgacaaag cccttcagtt tggaggaggt cgtggccagg 852001 ctgcgggtca tcctgcgacg cgcgggcaag ggcaacaagg aaccacgtaa tgttcgactg 852061 acgttcgccg atatcgagct cgacgaggag acccacgaag tgtggaaggc gggccaaccg 852121 gtgtcgctgt cgcccaccga attcaccctg ctgcgctatt tcgtgatcaa cgcgggcacc 852181 gtgctgagca agcctaagat tctcgaccac gtttggcgct acgacttcgg tggtgatgtc 852241 aacgtcgtcg agtcctacgt gtcgtatctg cgccgcaaga tcgacactgg ggagaagcgg 852301 ctgctgcaca cgctgcgcgg ggtgggctac gtactgcggg agcctcgatg agtcttggta 852361 gttaatcgga tcggcagccc gaggagaacg cggcaatggc cagacacctt cgaggaaggc 852421 tgcccctacg ggtacgcctg gtcgcagcca cgctgatcct ggtggccact ggacttgtgg 852481 cctcggggat cgcggtcacc tcgatgttgc agcaccggct gaccagccgg atcgatcggg 852541 tgttgctcga ggaagcccaa atctgggcgc agatcacgct gcccttggcg ccggacccct 852601 accctggtca taaccccgat cggccgccgt cgaggttcta cgttcgggtg atcagccccg 852661 acggccagag ctatacggca ctcaacgaca acactgccat accggcggtg cccgccaaca 852721 atgatgtcgg ccggcacccg acgacgctgc catcgatcgg cggatccaag actttatggc 852781 gcgcggtctc ggtgcgcgcg tcggatggct acttgaccac cgtcgccatt gatctggccg 852841 acgtccggag caccgtgcgg tcactggtgc tgttgcaggt cggcataggc agtgcggtgc 852901 tggttgtccc cggggtggcg ggctacgctg tggttcgccg cagcctgcgg ccgctggcag 852961 aattcgagca gacggccgcg gcgatcggcg cggggcagct ggatcgccgg gtcccgcagt 853021 ggcatccgcg aactgaggtc ggccggcttt cgttggcgct caacggaatg ctggcacaaa 853081 ttcagcgggc ggtggcgtcc gcggaatctt ccgccgaaaa ggcccgggat tcagaggacc 853141 ggatgcgaca gttcatcacc gacgccagcc atgaactgcg taccccgttg accactatcc 853201 gcggcttcgc ggagctgtac cgacaaggag ccgcccgcga cgtgggcatg ctgctgtcgc 853261 ggattgagag cgaagcgagc cggatggggc tgctggtgga cgatttgctg ctgcttgccc 853321 ggctagatgc gcaccggccg ttggaactgt gccgggtgga cctgctggcg ctggccagtg 853381 atgccgcgca cgacgcgcgg gcgatggacc ccaaacgcag gatcaccctg gaggtccttg 853441 acggccccgg caccccggag gtcctcggcg acgaatcgcg gcttcggcag gtgctgcgca 853501 atctcgttgc aaatgccata cagcacaccc cggaaagcgc cgacgtcacc gtgcgagtcg 853561 gcaccgaggg cgacgacgcc atcctcgagg tcgccgatga cggtccgggc atgagtcagg 853621 aggatgcgct gcgggtgttc gagcggttct atcgcgccga ctcgtcgcgg gcgcgcgcca 853681 gcggcgggac cggactgggg ttgtcgatcg tcgactcttt ggtggcggcc catggcggag 853741 cggtcaccgt gacgaccgcg ctcggggagg gttgctgctt tcgtgtctcg ctgccgcgcg 853801 tcagtgacgt ggaccagctg agcctcacgc cagttgtgcc agggccgccc tgatcttggc 853861 ctgcgcttcg tccagcgatc ccggtgaggg gttgcggtcg acgttggcaa agccgaaatc 853921 actgaggctg cgggtgggaa acacgtggat gtgtaggtgg ggcacttcca gcccggcaat 853981 gatcatcccg gcgcgttggg ttgaaaacgc ccggcacacg gccttgccga tcagctggct 854041 caccgacatg acgcggccaa ataacgcggg atccacgttt tgccagtggt cgatttcggc 854101 gcgtggcacc accaaggtgt ggccttgcgt catcggctca atcgtcaaga acgccacgac 854161 gtcgtcgtcc tcgtagacga aacggccggg cagttcacgg ttgatgatct tggtgaagat 854221 cgacacccgt tgagcatatg acgtcgcaac ggcccccccc caggtttcat tcctggttac 854281 cgaaggtcat catgtcgagg ttccagtacc cacgcatgtt ggtgatcaaa ccggccttat 854341 tcacccggta ggtgaacacg ccgcggacct cactggtaaa gccgccgtca aactcgctgt 854401 gcaacaccag aatgtgggcg atctcgtccg gtgagctgga cgggaacgtc tcctcgcagg 854461 tgaccgtcaa ccgattggcc gcaatgtgtg tgtcgaagaa ggcgccgacg gcctccttac 854521 ctttgatgcc gctgccatcg ggattggtga cggacttgcc gatcggatcc tcgatgacga 854581 cgtcgtcggc catcagcgcc agccagccct cccggtcgtg ggcttggacg caccgccacg 854641 acgactgcga cgcgatcagg gccggggatt gggtcgtttg ggtcatggct atctccggct 854701 agcggtcgtc gtccgtgtac cggatcacgc cgcgaatgtt cttgccgttc agcatgtcct 854761 ggtatccgtc gttgatctgc tccagcttgt acgcagtggt caccatgtcg tcgaggttga 854821 gtttgccggc cttatacatc gacaacagct tcggaatgtc gtagtgcggg ttgccgccgc 854881 cgaagatggt gccctggatg ttcttttgca gcagggtcaa catcgcgagg ttcagcgtca 854941 cctgggtgtc gaccaggctg ccgatggccg tcagcacgca ggtgccgccc ttggccgtga 855001 tggtcagata gctgtcgacg tcggcgccat cgagcttgcc gacggtgatg atcaccttct 855061 gcgccatcag gccgtaggtg acctcggcaa tgcccatcag cgcggcgttg atgtccgggt 855121 agacgtgggt ggcaccgaat ttcagagcct gatcacgttt ccattccacc ggctccaccg 855181 cgaagacgta gcgggcgccc gcgctgaccg cgccctgcaa cgccgccatg ccgaccccac 855241 ccaagccgac gatggccacg tcgtcgcccg gccggacgtc ggccgtgcgg accgccgaac 855301 catagccggt ggtgacgccg caaccaacca ggcaggcgac ttcgaagggc accgacgggt 855361 cgatcttcac caccgagctg cggtgcacca ccatgtacgg tgaaaacgtt ccgagcaggg 855421 tcatcgggta gacgttctgg ccgcgagcct gaatccggaa ggagccgtcc gtcacagatt 855481 ccccggcgag cagccccgcc cccaggtcgc acagattccg cattccagcc tggcaggacg 855541 gacacttgcc gcaggacggg atgaatgcca acaccacgtg atcgcccggg gcgaagtcgt 855601 cgactcccgg gccgacctcg gtgacgatgc ccgcgccctc gtgtccgccc agaacgggaa 855661 agcccgccat cgggatgtcg cccgtcacca ggtgatggtc ggagcggcac atcccagccg 855721 cttccatctg gatcttgact tcgtccttgc gcgggtcgcc gatttcgatc tcttcgacgg 855781 accatggctg gttgaactcc cagatcagtg cgcccttggt cttcaccgca aacctgcttt 855841 catcgttgaa cttcggctac gagtggtccc tagcctcggc cggaacgccg actggctgag 855901 tgtaggtcaa cggcgctagg gcgtttacca cagtggcacc ggcgtcttgc cgagcgggta 855961 atagcccggc actttgttgc ccgacacggc gcgttcgatg cgcttctgca tgcctggcga 856021 tagcttgccg gccttgatca attccaggta gagcgcggaa acatgaccga agtcgaagaa 856081 atcgcgttgc cagttccatt tcccgccgcc ggcataccgg aaccagctgc cgccaatgcc 856141 gtacacctcc tgctcggcgc cattggcgtc ggtggcaacc tgtttccaga acccaaccac 856201 ctcgccctgt ttctcgtcga tgacgacccg ttgatagggg tagcgccagc cctgcaggcc 856261 gtccatttcc tggcccagcg caatgtcgcg gatctcgtcg atgccgacgc acatcacgtc 856321 ctcgttggga ccgacgttcc agccgtaggt ggcgtcgtcg gtgtagaagt cggccagcaa 856381 cgtccagtcg ccgcgccgct ccgccgtacg gttggcctgt aaccagcggt gaaccacatc 856441 ttcgagttcg tcgcgaggat agccggccac ggttactctc ccgtttctcg gatggacagt 856501 gcctgggtgg gacaggccca cacggcatgc ttgatcacgc cgcgggcttc ctcgggcggc 856561 tcggggtcga ggatttcgac ctggccgcgc ttgggcaccc ggaaatactc gggtgcctcc 856621 agctcgcaca tcgcgtgtcc ttggcacaga tcccggtcgg cttcgactcg atagcccatc 856681 gttaaactcc cgttcgccgg cggtagcgca cgcaagcggg ctgggccaac tgcaccacca 856741 tcttcgaatg gtcgttacga tagctttctg gcggttgcgc catctcaaac tcatactcgc 856801 gcaacaacac cgagaagatc gctttgatct gcatgatggc gaacgccgcc cccacgcaac 856861 gatgccggcc ggcgccgaac ggaatccacg tccagcggtt gagcagatct tcctggcgcg 856921 gctgctcgta tcgtgctggc acgaagtcgt ggggatcggg gaagtcttcg gggatccggt 856981 tggagatcgc cggggaggcc gccaccagat cgccctcatg aatccggtgg ccttgcacct 857041 cgaactcgcc cttggccact cgcatgagga tgatcagcgg agggtgcagg cgcagcgtct 857101 ctttcagcac gttttccagc tgcggaatct ggcgcagcgc atggaaactc accgatcggc 857161 cgtcgccgta cagctcgtcg agttcgtcga tcacggccgc gtaggcgtcg cgatggcgca 857221 tcaactcgat cagcgtccac gaagccgtac ccgagctggt gtgatggccg gcgaacatca 857281 tcgagatgaa catgccggtg atctcgtcgg ccgagaaccg gggagtgccg gtctcagcct 857341 tgacggcgat gagcacgtcg agcatgtcac ggtcgctctt gtcggtgggt gggttggcga 857401 tccggccgtt catgatgtcc gcaaccagtg ccaccagacc attgcgggct tcgtcgcggc 857461 gacggaagct ctcgatcggc agatacgggt cgacgtaggc tagtgggtcg gtgccgcgct 857521 ccaactcgtg atagagcttg gcgaatcgcc cgtcgagctg gtcgcggaac ttcttgccga 857581 tcaggcaggc cgaggaggtg tagatggtca gctcggcgaa gaagtccagc agatcgatct 857641 cgccggcctc accccagtcg gcgatcatcc gtcggacttg atcttcgatg gtggcagcgt 857701 ggcccttcat ctgctcgccg cgtagcgcgg cattgtgcag catctcttta cgccgttccg 857761 ggctggcgtc gaacaccacg ccctcgccga agatcggcgt catgaacggg tatgccttgg 857821 cctggtccag gtcgtcgtcg cccgcccgga agaagaattc gttggcgtgc gagccggaca 857881 gcagcacgac ctgcttcccg gccagctgga aggtaccgac gtctccgcat tcgtcgcgga 857941 cccgttgcat cagcccgatc ggatcggtgc ggaactcctc gaggtggccg tgttcgtcgt 858001 ggccacccga aacccggggt agtgcaacag cgctcattag cccggcatcc cctcttcgcc 858061 cagtactagc ttctgacggt gcgcgggcgc atccctcagc ggggcctccg gttgaatctc 858121 catgttcacc acgacacacc cgcgcggcgt ttctgcgacg aacgcgatag cgcgtgccag 858181 gtcgctgggt cgcaagaagt agttgtgccg ggcctgcccc cactttgccc agtccgccag 858241 cattgggccg acttgttcgg ccgacagctg ccagcccata ccggtcagcg tgggtcccgg 858301 atgcacgatc gatgcgcgaa caccggtgcc ttccaactcc atctgcaggt tggtgaccat 858361 agcggccaga ccggccttgg cggcgccgta ggcacccata tgcgggcgtt ggcgcaggcc 858421 cacatcggat ccgacgaaga tgaggtcacc tcgccggcgt gccaccatgg ccggtagcac 858481 ggccgtggcc agccggttgg caccgaccag gtgtatctga acctgctctg caaaggcctc 858541 ggtgctgacc tcgtgcagct gtcccgggag catgtcgcct gcactggaca ccagcagttc 858601 gacctcgccg agtgcctcga ccgtttgcgc cacaaacgat ttcaccgact cgggatcggt 858661 cacgtcgagg gggaaggcta ccgcctcgcc accgtcggcg cggattttgt cgaccagctc 858721 ggccaacttg tccatgcggc gggcccccaa ggcgaccgga aacccgcggc cggcgagttc 858781 ggttgcggtg gccgcgccga tgcccgacga tgcgccggcg acgacggtgg tccgccgggc 858841 ggggtgaggt tcgaagcgtg gcattacctg gcctgcacgc tgatcggcag atgggcaaat 858901 ccgcgcacgt tgctggaatg gacgcgcacg acgttgtcgt cgtcgacttc gtagttgcgg 858961 atccgacgca gcagcgcgcc cagggccacc cgggcttcca tccgggccag gtgagccccc 859021 agacagaagt gggcaccgct gccgaaactg actagtttgc agccgatttc gcggccgatg 859081 cgatagtcgt ccgggtcgtc gaacacccgg tcgtcacggt tggccgatcc cggtagcagc 859141 agcaacacct caccctcggg gatcgtggtg tcgtacaacg tgagatcgtg cgcgacggtg 859201 cgggccagaa tctggctgga cgtgtcgtag cgcagggttt cctccaccca catcggaatc 859261 cgggagtggt cggcgaatac gcgggccagc tggccagggt ggtgggcggc ccagtagacg 859321 gcattggcca gtagcttggt ggtggtctcg ttgccggcga tcaccatgag aaacaggaac 859381 gccatgattt cctggtcgga aagccggtcg ccgtcgagct cggctgccag cagtgccgac 859441 gtcagattgt tcgcgggccg ccgccggaat tccgcgatca ggtcagcgta atatctcatc 859501 agctcgatcg acgccgccat cgccggcggg ggcacatcgg ccacgccgtc ctcgcggtgc 859561 agcaccgcat cggccagcgc gcggatgcgg gcccggtcgg tgtcgggcac gcctatcagc 859621 tctgaaatca catccatcgg cagcttgcca gcgaattctg ctacgaaatc gaaactttcg 859681 gtttgcaggg ccgaatccag gtgaatgcgg gcaagttcga gcacctgcgg ctcgagttca 859741 cggatccgcc gtggggtgaa gcccttggac accaaggtac gcatccgcag atgtgcgggg 859801 tcgtccatgg ccagcatcga cattacccgg tacgcctcag aagtgcgtga ggacggatcc 859861 agggataccc cataggcatt cgacaacgcc gtgctgtccc ggaagccttg cagcacgtcg 859921 tggtgccgcg acaccgccca gaaattgcgt tcctcgttac ggtacagcgg ggcctcgtcc 859981 cgcagccgac gataatacgg gtacgggtct tcgtgaaagt cgtagtcgta ggggtccagg 860041 accagttcgg ggtcaccgac gcggacggtc attcgctgcc accagtgctc ggctcgttag 860101 ctccggccaa gatcaggccc accacgtacc cgagtcgatc ggcgatctcg tggtaggtga 860161 aggtgccgct gccggcctgt acgagcgctc cgaagaacgc catctcgagt gcgaacacgg 860221 taccgggatc ggcgccaggt ccgatcgccg atgtgatgcg gcggtggatc tcggcgccga 860281 ttcggtcgcg caccgcacgc accgcggggt cggcgccgcc gtcgagcagc gccgccgtgc 860341 acgccgcgcc gatttcgggt tcgtcggcaa ccaccagcgc caggtgtcgc aacgagctcg 860401 tcacccggat aggcatcggg acgttgacgt cggtgacgca ggggacctgg cggaccaggt 860461 cgaggtagac ctcggcgatc agatggttct tcgacgagaa gtatgtgtag gccgtcgccg 860521 gggctacctt ggcgcgggcc gccaccaggc gcaccgtcag gtcggcgtat gacttctccc 860581 gcagggtcgc catggcggca gctagcacct tgcggaaggt tgcctgctgg cggcggttgc 860641 gtgacaccgc ttcggcgtgg ggttcggttt ggcgctgggc cggggtggta accagtacat 860701 cgctggacac atgtccaagc tatcggatgg tcgcggcagg aggcaagcca gtctgctaaa 860761 catgcagcta acatgggact gtccgcgacg cgacgtggcc cctggtgcat cggtcaggac 860821 ggtgtagcgg ccttgcggat acggtctcga tgaggcaata tcggacaagt gtccaatcga 860881 tgatgagagt cggagaagtt gggagcggta gatggccctg tggggcgacg gaattagtgc 860941 gctgctcatc gacggcaaac tatcggacgg ccgtgcgggc accttcccga cggtcaatcc 861001 ggccaccgag gaagtgctgg gagtcgccgc cgacgccgat gccgaggaca tgggccgcgc 861061 catcgaggcc gcgcggcggg cgttcgactc gaccgactgg tcccgcaata ccgaacttcg 861121 ggtgcggtgt gttcggcaac tgcgcgacgc aatgcaacag cacgtcgaag aactacgcga 861181 actgacgatc tccgaggtgg gcgcgccgcg gatgctcacc gccagcgccc agctggaagg 861241 cccggtcggg gatctatcgt ttgcggcgga cacggccgag tcctacccgt ggaagcagga 861301 cctcggcgag gcatcgccgt tgggcatcgc cacccggcgc accctcgcac gggaggccgt 861361 cggtgtcgtc ggcgccatca ccccgtggaa cttcccgcac cagatcaatc tcgccaagct 861421 aggtccggcg ctagccgcgg gtaacaccgt cgttttaaag ccggcgcctg acacaccgtg 861481 gtgcgcagca gcgctcgggg aaatcatcgt cgagcacacc gacttcccac cgggcgttgt 861541 caacatcgtc acctccagca gtcacgcttt gggggcgctg ttggccaaag accctcgggt 861601 ggacatgatt tcgttcaccg gttctactgc gaccggccgt gccgtaatgg ccgatgccgc 861661 ggccaccatc aaaaaggttt ttctggaact gggtggcaag tcggcgttcg tcgtgctcga 861721 cgacgctgac ctagccgctg ccagcgcggt atcggcgttc tcggcttgca tgcacgccgg 861781 gcaggggtgc gcaatcacga cccggctggt ggtgccacgg gcccgttatg aagaggcggt 861841 tgccatcgcg gcagccacca tgtcgtcgat caggcccggc gatcccaacg accccggaac 861901 cgtttgcggg ccgttgattt cggcccgaca acgggatcgt gtgcagggct acctcgacct 861961 ggcggtcgcc gaaggcggaa ggttcgcatg cggtggcgcg cggccggcgg atagagaggt 862021 cggtttctac atcgagccca cggtcatcgc agggttgacc aatgacgcca gagtcgcccg 862081 agaggagatc ttcggaccgg tgctcacggt gattgcccac gacggtgacg atgatgcggt 862141 gcgcatcgcc aacgactcgc catacggctt gtcgggcacc gtgtatggcg ccgacccgca 862201 gcgcgccgcg aggattgcct cgcggctgcg ggtaggcacc gtcaacgtca atgggggtgt 862261 ctggtactgc gccgacgcgc cgttcggcgg ctacaagcaa tccggtatcg gacgcgagat 862321 gggtctcctc ggcttcgagg agtacttaga agccaaactc attgctaccg ctgcaaatta 862381 gctagcgggt tgacagcgca gaaaggaagc catgttcgac agcaaggtgg ctatcgtcac 862441 cggggctgcc cagggtatcg ggcaggccta cgctcaggcg ttggcccgcg aaggtgcctc 862501 ggtggtcgtc gctgacatca acgccgacgg tgccgcggcg gtagccaagc agattgtcgc 862561 cgacggcggt actgcgattc atgtgcccgt tgacgtgtcc gacgaggatt ccgctaaagc 862621 catggtcgac cgcgccgtcg gtgctttcgg cggcatcgac tatctggtga acaatgcggc 862681 gatctacggt ggcatgaagc tcgatctgtt gttgaccgtg ccgttggact actacaagaa 862741 attcatgagc gtcaaccacg acggcgtgct ggtgtgtacc cgcgcggtgt acaagcacat 862801 ggccaaacgg ggcggcggcg cgattgtcaa ccagtcctcg accgcggcct ggctgtattc 862861 caacttctac ggcctggcca aggtcggtgt caacgggctg acgcagcagc tggcccgcga 862921 gctgggcgga atgaagataa ggatcaatgc gatcgcaccc ggaccgatcg acaccgaagc 862981 tacccgcacc gtcacccccg cagagctggt caagaacatg gtgcagacca tcccgctgtc 863041 gcggatgggt acaccggagg atctggtggg catgtgcctg ttcctgctgt cggattcggc 863101 atcgtggatc accgggcaga tcttcaatgt cgatggcgga cagatcatcc ggtcatgacc 863161 ggcgccggcg ccgatgcaga gcggggcgat gaggtggggg cacgccccca caagtgggag 863221 gtacccccat ccgctggcgg gggagagcgg cgctcatgac cgctcacccg gagacaccac 863281 gcctgggata tatcggcttg ggtaatcaag gcgcgccgat ggctaagcgt ctgctcgatt 863341 ggcctggcgg actgaccgtt ttcgatgtgc gggtcgaggc catggcaccg ttcgtcgagg 863401 gcggcgccac cgcagcggca agcgtctccg acgtcgccga agccgacatc atcagcatca 863461 ccgtgttcga cgacgcgcag gtgagttcgg tgatcaccgc cgacaacgga ctggcgacgc 863521 acgccaagcc cggcactatt gtcgcgattc actccaccat cgccgacacg acagcagtcg 863581 atctggccga aaagctcaag ccgcagggga tccacatcgt ggatgcaccg gtcagcggcg 863641 gcgcggcggc ggccgccaag ggtgagttgg ccgtgatggt cggcgctgac gacgaggcgt 863701 tccagcggat taaagagcca ttttcgaggt gggcttcgct gttgattcat gccggggaac 863761 cgggcgctgg cacccggatg aaactggcgc gcaacatgtt gactttcgtc tcttatgccg 863821 ccgccgccga ggcgcagcgg ctggccgaag cctgtggctt agacctcgtg gcgctcggga 863881 aggtggtgcg gcacagcgac tcattcaccg gcggcgcggg agcgatcatg ttccgcaaca 863941 ccactgcgcc gatggagccg gctgacccgc tgcggccgtt gttggagcac acccgcggcc 864001 tgggtgagaa agacctgagt ctggcgttgg ccctgggcga ggtggtatcg gtcgacctgc 864061 cgctggccca gctggcgctg caacggctgg ccgccggcct cggggtaccg cacccggaca 864121 ccgagccagc aaaggagaca tgatggacga gctgcgccgc accggcctgg acaaaatgaa 864181 cgaggtttac gcctgggaca tgcccgacat gccaggtgag ttttttgccc tgaccgtcga 864241 tcacctattc ggcaggatct ggacccgtcc cggcctgtcc atgcgggacc ggcggatggc 864301 cgtgatcgcg gtgctgaccg ctcaaggcca gtcggatctg ctcgaggtcc aagtcaacgc 864361 cgtcctgcat aacgacgaac tcaccataga cgagctgcgt gaactcgctg tgttcattac 864421 ccactatgtc ggcttcccgc tgggctcgcg gctgaacagt gcgatcgagc gggtagcggc 864481 caagcgtaag caggcggccg agaacggctc gctgcccgac acgaaagcca acgtcgccga 864541 agttcttgct aaggaatctg gtaaatcgag ctagtctgac gtgtcgtgcg cgtcctggta 864601 atcggttcgg gtgcccgcga acatgcgcta ttgctggcgc tcggcaaaga cccgcaggtt 864661 tcggggctaa tcgttgctcc cggcaatgca ggcaccgctc ggatcgccga gcagcacgac 864721 gtcgacatca cctccgccga ggcggtggtc gccctggctc gcgaagtcgg cgctgacatg 864781 gtggtgattg gccccgaggt accgttagtg ctcggggtgg ccgacgccgt gcgcgcggcc 864841 ggcatcgtgt gtttcgggcc cggtaaggac gcggctcgca tcgaaggctc caaagcattc 864901 gccaaggacg tcatggcggc ggccggtgtg cgcaccgcga acagcgaaat cgtagacagc 864961 ccagcgcact tggacgcggc cctggaccgg ttcgggccgc ctgccggtga cccggcctgg 865021 gtggtcaaag acgaccggct agccgccggc aagggtgtgg tggtgacagc ggaccgcgat 865081 gtcgcgcgcg cacacggagc tgccctgctc gaggccgggc acccggtgtt gctggagtcc 865141 tacctggacg gcccggaggt atcgctgttc tgtgtcgtcg accgcaccgt cgtggtgccg 865201 ctgctgccgg cacaggactt caagcgagtc ggtgaggacg acaccggact taacaccggc 865261 ggtatgggcg cctacgcgcc gctgccgtgg ttgcccgaca acatctatcg ggaggtggtc 865321 agccggatcg tcgaacccgt tgcggccgaa ctagtccggc gtggaagctc gttttgcgga 865381 ttgctgtatg ttggtctcgc gattaccgcc cgcgggccgg cggtggtcga gttcaactgc 865441 cgattcggcg atccggagac ccaagccgtg ctggccttgc tggagtctcc gctcggccaa 865501 ctgcttcatg ccgccgctac cgggaagctg gccgatttcg gcgagttgcg gtggcgtgac 865561 ggtgtggccg taacagtggt actggcggcc gaaaactatc ccgggcgccc ccgggtcggc 865621 gacgtcgttg tcggctccga agccgagggg gtgctgcacg ccggaaccac gcggcgcgac 865681 gatggcgcga tcgtttcgtc cggtggccgg gtgctgtcgg tggtgggcac cggtgccgac 865741 ttgtccgcag cacgcgcaca cgcgtatgaa atcctcagtt caattcggtt gccaggaggt 865801 catttccgca gcgatatcgg tttacgggcg gccgagggga agatcagcgt ctagcaggct 865861 gcggcttggc catcacggcg gggatcgctg gccgcgaggt acccatcgtc gagccgccag 865921 attgcctgac aactcccgaa ctggctgtag tccgctactg cgaccaagtc atgcccacgc 865981 tgccgcagtt catcgagagt tgaatccggg aagccgtttt cgaaactgac ccgcataccg 866041 ttcacccagc ggaaccgagg gccgtcacag gccgcctggg ggttctggcc gtagtcggcg 866101 atgcgcacca gcacctgcac gtgaccctgg ggttgcatca tgccgcccat caccccgaag 866161 ctcatcaccg gcgcaccgtc gcgggtcaca aaacctggga tgatcgtgtg ataggggcgc 866221 ttccgtggcc caacccggtt cggatgtctc ggcaccacag tgaaatccga gccgcgattg 866281 tgcagcgaaa tgccggtgcc gggcaccacc acaccggagc cgaacccaag gtagttcgac 866341 tgaatcatgg acaccatcat tcccgcagca tcggcggcgg ccagatagac ggtgccgcct 866401 cgcgggatgc cggtggccgc cggcattgcc ctctttggat cgatcagcgt ggcgcgctgc 866461 cgcagatact ccttgtcgag caggcgcttc gggtgcaccg gcatgtagtc gatgtcggcg 866521 acacacgctt gcgcgtcggc gaaggcaagc ttcagtgctt cgatctgcac gtgcacactt 866581 tcagcggaat ccactgacca cgatgacata tcgaaatgct cgaggattcc gagggcgatc 866641 aaggccacga tgccctggcc gttgggcggt atctggtgga tggtgtaccc gcggtaggtt 866701 cccgtgatcg tgtcgaccca gtccacgcga tgggcggcga ggtcgtcggc acgcatcacc 866761 ccgccgtttg ccgccgagtg cgcctcgagt ttggcggcca gctctccccg gtagaactcc 866821 tcaccgttgg tcgccgcgat cttctctagc gtcgccgcgt ggtcaggaaa ggtaaacagc 866881 tcaccgggtt tcggcgctcg tccgccgggc atgaacgcat cggcgaatcc gggctgggat 866941 gcgaacaacg gcacctgtgc cgcccattgt gccgcgacgg tcggtgagac cagaaagccg 867001 ttgcggccgt acgagatggc gggctcgaag agtgtttcga atggtagcct gccgaacctg 867061 gcgtgcagtt ccacccaggc cgacaccgca ccgggcaccg tcacggagtt ccagccgagc 867121 acgggaacgg cgttgccgcc gaagtactct ggcgtccacg ccgagggtga gcggccggac 867181 gcgttcaggc cgtgcagttt ttgcccgtcc cagacgatgc tgaaggcgtc cgagccgatg 867241 ccattggaca ccggttccac cacggtgagg gtgatggctg tggcgacggc ggcgtcgacc 867301 gcgttgccgc cgtcggccag catccgaaga cccgcttgcg cggccagcgg ttgtgacgtg 867361 cacacgacgt ttgtcgccag gatgggcatg cgcggccaag cgtaggggaa ggtccaacca 867421 aacggcgtgc tcacgccgct taacctgtga gcagcggcgc gaaccaggtc agctcggcgg 867481 gtagctgtgc gctccagaac ccaccgttgt gcccgccagg ggagaagccg cccgccggcg 867541 ggtggggcag ctgcgccacg aactgcttgg ttgcggcata aaacggatcg ctgttgccgc 867601 aatcgacccg gatcgggatg gaccccaatg cggggagtcc gaaaaccgag ttcgccgacc 867661 agtcgtcggg tccgtcgaag gagccgggtg cgacggaacc ggcggatagc cacagtgccg 867721 ggctgaccgc gcagatcgct gcggtgcgtg ccggtccaag gcggctgccg agcagcaaag 867781 cgccgtagcc gcccatcgac cagcccagaa acgctacccg ggaggtgtcc agccgctggg 867841 tgtccaatag cggaatgagc tcgttgagca ccattgcccc cgcgtcctcg ccagaagccc 867901 gctggtgcca gtagctgctg cctccgtcca cggagaccac cgcgaacggt ggcaacccgg 867961 cgttgacggc ctgggccagg ccctgctcga cgccgccgtc catcacggcc gatgcgctac 868021 cgcccaagcc gtgcagtgcg atcacgggcc gcaacgcctg ggtctggccg ggtgggcggg 868081 cgatggccca gttggtcatc ttcccggcgc gcgctgccga cacgaacgag ccggtggaca 868141 tcgtcggcgc cgcctgagcc gggggggccg gatcgagtgc tggtgtcggc gccaatggaa 868201 cgtttgtgcc aatcgccgcc gccggtgcgg catgtgaagt tcggggctgc aacagcatgt 868261 cgatcgcata tgctgaggta gcgccaagga ccgtgccggc gccgagaccg agcacggcgc 868321 ggcggctcaa ctctggcatg cgggccatca tgccatggac gtttggccga attggcaatg 868381 cagtaccact ttgactggca gcatggatgg gcgtgacagc agcggtcact ccaaaaggag 868441 aacgtcggcg gtatgcgttg gtcagcgccg ccgcggagct gctcggcgag ggcgggttcg 868501 aggcggtacg ccaccgggcg gtggcgcggc gggccggttt gccgttggcg tctaccacct 868561 actacttctc gtcgctcgac gatttgatcg ctcgcgcggt cgaacacatc ggaatgatcg 868621 aggtggctca gctgcgagcc cgggtcagtg cgctgtcccg gcgacgtcgg gggcccgaga 868681 ccaccgccgt tgtgctggtt gacctgctgg tgggggaaat gtccagtccg gggcttgccg 868741 agcagctgat ctcacgatac gagcgccata tcgcctgtac ccgcctgcct gacctgcgcg 868801 aaagcatgcg ccgcagcctg cgtcagcgcg ctgaggccgt ggccgaggcc atcgagcgct 868861 ccggccgctc cgcacagatc gaactggtgt gtacgttgat ctgtgcggtc gacggatcgg 868921 tggtctcggc gctggtcgaa gggcgggacc cgcgtgccgc tgcgctggcg acggtggtcg 868981 acctcatcga cgtgctcgcg cccgtcgacc agcgtccggt gccgttctga agtcggtggg 869041 cagcgacggc gtgacaatgt acccggtggt gaagtcccca tagatcgtga catcggcggg 869101 ccggcgttgg gcgtacaacg ccacgtaggc gcatacgacg gcgtcgatcg gatcctcggc 869161 ggcccgcagg tcgctttttc gctgcgcgac cgtcacctgc cggcgcaacg agacccaatc 869221 cggctgaccg gctacctgca tccgaacccc ggcctgggcg agcccctcga cgccgtccat 869281 cagtcgcaat agctccgatt tgagcaggtc aacgctgcgt cccggcttgg ccttgtactt 869341 cagcgcgcgg ggtagccgaa acagcgccac cgtagccggg tgcggataga cctcgatggc 869401 ccgccgcgtg gcggacgaaa gaggatccat atccagcgcc agttggcggg ccagccgggc 869461 ggcgcgtgga acgtcggcaa actcgggctt ttcggtgttg gccggatacg cgccggcctc 869521 gaattgtcgg aagtctcgat tcagtgcggc ctccgccggc cgctggccgg tgcggttggc 869581 caccaccagc ggcgcgtcga aggcgaccag gcaatcgccc acaacgtagg gccgcagcgc 869641 cgccagcacg gaggcatcgt cgcgagcggc accgaccccc accagacacc cgtccgcgtc 869701 gacagccgcg acaccggtcg gattgcggcc ggcccaggcg aggtccacgc cgacgaagta 869761 catctgccca gggtatggcg gggccgcggc gtatgtgctg tggtgtcaca tccgtcactt 869821 gcgcctctgt cagagggatg cgcgttgtgc ccgtctcata gcgacatcgc ccgggcggca 869881 ccgggaccgg gcgttgccga gttgtcgcga tgagtcgggc acatcgggtg ctccctggcg 869941 ccgggactcg tgtgacaact gcgactacta ggcccgcgac cgtaagctgt gtctttgtga 870001 gggccaagtg agcattccca acgtgctggc cacccgatac gccagcgccg agatggtcgc 870061 gatctggtcg ccggaggcca aggtggtctc ggagcggcgg ttatggctgg ccgtattgcg 870121 ggcacaggca gagctggggg tagcggttgc cgattcggtg ctcgccgact acgaacgtgt 870181 ggtcgacgat gtggacttgg cctcgatctc agcccgggag cgggtgctgc gccacgatgt 870241 caaggcccgc atcgaggaat tcaacgcatt ggccggtcat gagcacgtgc acaaggggat 870301 gaccagccgc gacctgaccg agaacgtgga gcaactgcag attcggcggt cgctggaagt 870361 gattttcgcc catggggtgg cggcggtggc gcggctggcc gagcgggcgg tgagctaccg 870421 tgacctgatc atggccgggc gcagccacaa cgtggccgct caggccacca ccttgggcaa 870481 gcggttcgcc tcggcggccc aagagatgat gatcgcgttg aggcggttga gggagttgat 870541 cgaccgctac cccctgcgtg gcatcaaggg cccgatgggc accggtcagg acatgctcga 870601 tctgctgggc ggtgaccgtg cggcgctggc cgatctcgag cggcgcgtcg ccgacttctt 870661 gggctttgca actgttttca acagcgtggg gcaggtgtat ccgcgttcat tggaccacga 870721 cgtggtttcg gctctggtgc agctcggcgc ggggccgtca tcactggcac acacgattcg 870781 attgatggcc ggccacgagc tcgccaccga gggtttcgcg ccgggtcagg tcggttcgtc 870841 ggcgatgccg cacaagatga acacccgcag ctgcgaacgg gtcaacgggc tgcaggttgt 870901 gctacgcggc tatgcatcca tggtggccga gttagccggt gcacagtgga acgagggtga 870961 tgtgttttgc tccgtggtgc gccgggttgc gttgccggac agcttctttg ccgtcgacgg 871021 gcagatcgag acgtttttga cggtgctgga cgagttcggc gcctacccgg cggtgatcgg 871081 ccgcgagttg gatcgttatc tgccgttcct ggccaccact aaggtgctaa tggcggccgt 871141 gcgcgcgggg atgggtcgcg agtccgcgca ccggttgatc tccgagcacg cggtggcgac 871201 ggcgctggcc atgcgagaac acggcgcgga gcccgacctg ctggaccggt tggccgccga 871261 tccgcggctg acgctgggac gagacgcttt ggaggccgcg ctggccgaca agaaggcatt 871321 tgccggtgcc gcgggtgacc aggtcgatga tgtggtcgcg atggtggacg cgctggtgag 871381 ccgttacccg gacgcggcta aatacacgcc gggtgcaatt ctttagtgtc atgactaccg 871441 ccgccgggct ttcgggcatc gatctgaccg atctggacaa cttcgccgac ggcttccccc 871501 atcacctctt cgccatccac cgtcgtgaag cgccggtgta ttggcatcgg ccgaccgagc 871561 acaccccgga cggggagggc ttctggtcgg tggctaccta cgccgaaacc cttgaggtgt 871621 tacgtgatcc ggtgacctat tcgtcggtca ccgggggcca acgtcggttt gggggcacgg 871681 tgctgcagga tctgccggtc gccggccagg tgctcaacat gatggatgat ccccggcaca 871741 cccgtatccg gcggttggtc agctcgggct tgacaccacg gatgatccgg cgggtcgaag 871801 acgatctgcg ccgccgggcg cgtggattgc tcgatggcgt agaacccgga gcgcctttcg 871861 acttcgtggt cgagatcgct gccgaattgc ccatgcagat gatctgcatt ctgctgggtg 871921 tgccggagac ggatcgacat tggttgttcg aggcggttga gccgggattc gatttccgcg 871981 gctcccgcag ggcgacgatg ccgaggctga acgtcgagga tgccggatcg cggttataca 872041 cctacgcatt ggagctgatc gccggtaaac gcgccgaacc tgccgacgac atgctgtccg 872101 tcgtcgccaa cgctaccatc gacgatccgg acgcgccggc gctgtccgac gccgaactgt 872161 acctgttctt ccatctactg ttcagcgccg gcgcggaaac cacccgtaac tccattgccg 872221 gcgggctgct ggcgctggcc gagaaccctg accaactgca aacgctgcga agcgattttg 872281 agttgttgcc gactgcgatc gaagagatcg tgaggtggac gtcgccgtca ccatcgaagc 872341 ggcgcacggc gtcccgtgcg gtcagcctgg gcggccagcc gatcgaggcg ggtcagaagg 872401 ttgtggtgtg ggagggctcg gccaaccgtg atcccagcgt gttcgaccgc gcggacgagt 872461 tcgatatcac ccgaaaaccc aatccgcacc tgggtttcgg tcagggggtg cactattgcc 872521 tgggcgccaa tctggctcgg ctggaactgc gggtgctgtt cgaggaactc ttgtcccgct 872581 ttggctcagt gcgggtggtg gaacccgcgg aatggacacg tagcaaccgg cataccggca 872641 tccggcacct agtcgttgaa ttgcgcggag gctagtcccc gcgcagcggg attccggcgg 872701 cccgcaactc gagcgcggcc agcgcacgca tggtggcggg atcctctcgt cgccaggcgc 872761 cgaccggatc ggtgctgacg gcggccagtt tgcccggcgg ccggttcgcc aatgcgcgca 872821 gcgccagcag ctggcgaccg gctggggtcg ccgccagggt ggtaacggtc cacttgcgcc 872881 ggcagaaccg cagccgcagg aacagccagg gcatggccac ggcaagaatc ggcgtcgcgg 872941 cgaccgccag cgcgagcact accgcaagcc agccggccgt ggtgtccagg ttgtggccgg 873001 cgccggcgat gtcaagggcg gcctggcttg cggcggtgat ggggttgctg agcgcgtcgc 873061 ccaccaccgg gatacgctgg gcgtcctggc ccgcggccgc caggttgccg gcaatcccgt 873121 gcgagccgat ttcgatttgg cggccggcct cgccgattat cgagatggcg tcgtgcacgg 873181 cgaggccgac gagcatccat agcgtcgtcc acaccgcgac agtgatatcg ctgatcagtt 873241 gggccagcag tcggccgggc gtggtggcat acggcaagaa gcgcgatctc ataccagaga 873301 taccagcaca gggcgccgtc gtgcggcgga taggctggcg cgatgcgccc cgcattgtcc 873361 gactaccagc atgtggccag cggtaaggtc cgcgagatct accgtgtcga tgacgagcac 873421 ctgctgctgg ttgccagcga ccggatctcg gcgtacgact acgtcctgga cagcaccatc 873481 ccggacaagg gccgcgtcct gaccgccatg agcgcattct tcttcgggct cgtcgatgcc 873541 cctaaccatc tggccgggcc gccggacgac ccgcgtatcc ccgacgaggt gctgggccgc 873601 gcgctggtgg tgcgtcggct ggagatgctg ccggtggaat gtgtggcccg tggctacctg 873661 accggttcgg ggttactgga ttaccaggca accgggaagg tatgcggtat cgcgctgccg 873721 ccgggcctgg tcgaggccag tcggttcgcc acaccgctgt tcaccccggc gactaaagcc 873781 gcgttggggg accacgacga gaacatctcg tttgaccggg tggtggagat ggtaggcgcg 873841 ttgcgtgcca accagctgcg tgatcgtact ctgcagacgt atgtgcaggc cgccgatcac 873901 gctctcaccc gcggaatcat tatcgccgac accaagtttg aatttggcat cgaccgccac 873961 ggcaacctgc tgctggccga cgaaatcttc acaccggact cgtcgcggta ctggcctgcc 874021 gacgactacc gggccggcgt ggtccagacc agcttcgaca aacagtttgt ccgcagctgg 874081 ctcaccggct ccgagtccgg ctgggataga ggcagcgatc ggccgccgcc tccgctcccc 874141 gagcatatcg tcgaggccac gcgtgcccgt tatattaatg catacgaacg gatttccgaa 874201 ctaaaattcg acgactggat cggccctggc gcatgatgca ccgaaccgca ctaccctcac 874261 cgcccgtggc caagcgggtg cagacccgcc gggagcacca cggcgacgtc tttgtcgacc 874321 catatgaatg gttgcgcgac aaggacagcc ctgaagtaat cgcctacctc gaagctgaaa 874381 acgactacac cgaacggacc accgcgcacc ttgagccatt gcggcaaaag atcttccacg 874441 aaatcaaagc gcgtaccaag gaaaccgact tatcggtgcc gacgcgacgt ggcaactggt 874501 ggtactacgc gcggaccttt gagggaaagc agtatggcgt acactgtcgt tgcccggtaa 874561 ccgatcccga cgactggaac ccaccagagt tcgacgagcg caccgaaata cccggtgaac 874621 agcttctgct cgacgagaac gtggaagctg acggccacga cttcttcgca ctgggcgcgg 874681 ccagcgtcag cctggacgat aacctcttag cgtattccgt tgatgtcgta ggtgacgaac 874741 gatatacctt gcggttcaag gatttacgca ccggagaaca gtacccggac gagatcgccg 874801 ggatcggagc gggagtcacc tgggcagctg acaaccactg tctactacac caccgtggac 874861 gcggcctggc gtccggacac agtgtggcga taccgactag ggtccggcga atcgtcggag 874921 cgggtttacc acgaagccga tgatcggttc tggctcgcgg tggggcgtac tcgcagcaac 874981 gcctatctgc tgattgcggc ggggtcgtcc atcacttcgg aggtccgtta cgcgcacgcg 875041 gcagatccga cagcgcagtt cagcgtggtg ctgccgcgcc gcgacggcgt cgagtactcg 875101 gtggagcatg cggtcatagc tggccaggac cggtttctga tcctgcacaa cgacggcgcg 875161 gtgaacttca cactggtaga ggccccggtc gaggatcctg cgcggcaacg caccctcatc 875221 gcccaccgcg acgacgtccg actcgacgcg gtggatgcct tggccggcca tctggtagtc 875281 agctatcggc gcgaggcgct gccgcgggtt caactgtggc cgatcgggcc tgacggaaac 875341 tatggtgagc ccgaagagat ctcgttcgac tccgagctga tgtcggccgg actggggccc 875401 aaccccaact gggattcgcc caaactgcgg gtcggtgccg gatctttcgt caccccggtg 875461 cggatctacg acatcgacct ggtcactggc gagcgtacct tgctgaaaga acagcccgta 875521 ctgggcggct accgccgcga agactatgtg gagcggcgtg actgggcgta cggagacgac 875581 ggcacccgga tcccggtctc gatagtgcac cgagccgata tcgaattccc ggcacctgcg 875641 ttgatctatg gctacggcgc ctacgagatc tgtgaggatc cgcggttttc catcgctcgg 875701 ttgtcgctgc tggatcgcgg gatggtgttc gtcgtcgccc acgttcgcgg cggcggtgag 875761 atgggcaggc tgtggtatga aaacggcaag ctactggaca agaagaacac gttcaccgac 875821 ttcatcgcgg tggcaagaca tctggtggac acgggactta cttcccagca gcagctggtg 875881 gcattggggg gtagcgcggg cggtctgctg atgggcgcgg tggccaacat ggcaccggat 875941 ctcttcgccg gaatccttgc gcaggtgccg ttcgtggacc cgctgaccac catcttggat 876001 ccatcgttgc cgctgaccgt caccgagtgg gacgaatggg gaaatccgtt gaacgacagc 876061 gatgtctatg cctatgtgaa atcgtattcg ccgtacgaga acgtcacggc ccaaaagtac 876121 ccggccatcc tggcaatgac gtcgctgaac gacaccaggg tctattacgt ggagccggcc 876181 aagtgggtgg ccgcgttgcg gcacgccaag accgacggca attccgtgct gttgaagacc 876241 cagatgcacg ccggtcatgg tgggatcagt ggccgctacg agcgctggaa ggagaccgcg 876301 tttcaatacg ggtggttgct agctactgcc gacagcgacc gttacggcgg cggccaggga 876361 aacgacctcg atggcgctgc gccagcatag ccggtgggat cggccattcg ggatgcgtag 876421 acattggctc cgaacatggc cagcatcagc gccagcgagc ataccgccgc tgccatgcgg 876481 gtgtcgggca gcaacaggcc cacggcgacc aggagcctcc agcgcaccgg tgatggtgac 876541 cagcaggccg ggcgcaagca gcccgggtga aacgatggcg atgaggtggc cgcgcagggg 876601 cggcgtgaag tgagctcggc tgctcgacgg ctccgattcc gaactggtcg acgccgagac 876661 cgccgctgcc gccgagctgg cgcgcggggt ggcggcgctg cgcgatccca acgcccgggc 876721 gaatccggcg ggtgccgagc tggcgacctg gtcgctggtg cacggctttt cgacgctgtg 876781 gctcgacgat gcggtcaacg ctgacgtgaa gcagacgtca tgcggatagc aacggtgctc 876841 ttcgatgact agcctgctgt ttcggcagga atgccgcggg gatcagcgtc gagaccacta 876901 gcgcggtcgc tatcacgaat accaccgcgt aggcgtgcga aaggtcatgc agcagttggg 876961 ccgcgaagtt ggtttggcgc ggtagcgagg aagggtcaac cgccgccccc cgcccggcgc 877021 cactctctgg ggtcagtgcg actttctttg cagtagcgat gatttcgctg tgattgaact 877081 ggtaggtgag cagcaccgac atcagtgcgg tccctatcga accgcccacc tgctggttga 877141 cgctgatcag cgtcgaaccg cgagcgatct gatgtggggc cagggtctgc actgccgccc 877201 cggacagtgg catcatggag cagcccatgc ccatgcccat gattgccagc ccggtcggca 877261 gaatgggtaa gtagtccgct tgccgcgcga caccaaaggc gaaggtgccc aaccccgcag 877321 cgatcagcat gatcccaacc agcacgatct tggccggtcc ccgtcggtcc atcatcgctc 877381 cggcgatcgg catcgccagc atggcaccga ggccctgtgg gatgatatgc acccccgatt 877441 gcatcggtga ttggtgcaac acttgctgga ggtagctcgg gagcagcaag aaggagccaa 877501 acagcccgag ggagagcacc gtcatcgtca tgttggcctg cgcgaccgct cggttctgga 877561 acaagcgcat gtctatgagc ggatgttctg tgcggtacca cgaatgtgcg acgaatgccg 877621 cgatcaacgc caggccggtg atcgccggta tcaacacgtg ccgatcggcc atcgttccac 877681 gggcggggct agatgacacc ccgaacagga aggtcgccag gcccggcgac agcaacaaga 877741 ggcccatgta gtcgaagttt tccgacgctg ccgggcgatc tcttgggaac acgatcgccg 877801 ccaagacgag cgcggacagc ccgaccggca ggttgaccaa gaaaatccaa cgccagccgt 877861 aggccccgat gagccaacca cccaggatcg gcccaccgac cgggccgagc agcatcggaa 877921 tgcccaccac cgccatcacg cgccccagcc gcttcgggcc cgcctcacgg gccaagatgg 877981 caaaggacac cggcgtcagc atgcccccac cgaaaccctg gacaacacga aatatgatga 878041 gcagcaagat gtttggtgct actgcgcaca gcagtgagcc gagggtgaac gccaataccg 878101 aacccatgaa aagccgcctg gtgccgaacc ggtcggccgc ccaaccggct gtcgggatca 878161 cagtggccaa cgcgagcatg tagccggtca tggtccaggc cacgacggcc tgggtggacc 878221 cgaaatcggc aacgaaggtg cgttgcgcga cgctgaccac ggtgacgtcc acatgtgcca 878281 tcaccgaggc caggacacac actccggcgg tccgaagcaa ccccacatcg agcctatcgg 878341 gatagctgcg ttggccagag cggggccgcc ccgcgggggt gatgggcacc ggggcatcgc 878401 cttccgcggg acacgcttca accatggcgt tgccgagcat atcgataccg gtcacgggta 878461 ccgcgcgagg atgtcgggcg gtgcttggtt ccggcgtcgg gtcatggccc tggcgccgag 878521 ccgacgtgcg ctcgttctgc gctggtcagg gtccagatat acgcctgctg tccgcgtgtc 878581 cttcaccgtc cggaaacctg gaatcggcag actgcaagcg tgtctggaaa actgctcgtg 878641 tcggtctcgg ggataggtga gagcaccctg gccgatgtcg acgcgttctg cgcggaaatg 878701 gacgcccgct cggtgccggt atcgttgctg gtggctccgc gtatgcgcga tgactaccgg 878761 ctcgaccgcg acccacgcac cgtcgactgg ctgaccggtc gccgggccgc cggcgacgct 878821 ctggtactgc atggctacga cgaagcggcc accaagaggc ggcgcggcga attcgcaatg 878881 ctgcgcgcac acgaggccaa cctgcggctg atggccgccg accgggtgct cgaacacctt 878941 gggctgcgaa cccgactgtt tgcggcaccg ggctggctgg tatcaccagg tgtccgtaca 879001 gcgttgccgg ccaatggatt tcggctgctt gcggatctcc atggaatcac ggatctggtt 879061 cggctcacca ccgtgcgtgc ccgcgtgctg ggcatcggcg agggtttcct ggcggagccc 879121 tggtggtgcc ggatggtggt gatgtcggcc gagcggatcg cccggcgtgg gggcgtcgtc 879181 cggattgcgg tggccgcccg tcatttgcgc aagtccggtc cgctgcaggc gatgctcgat 879241 gccgtcgacc tggcgatgct gcaggggtgc acaccgatgg tgtaccggtg gcgagccgat 879301 gcggcggtac tcgacgcggc ctgaccgagc gcctgatcgg tggcgttaac ctgtaccgac 879361 atgagcgatg ctgtagccgg ttcagatgcc gaggggctca ccgctgatgc cattgtcgtg 879421 ggagccggat tagcgggcct ggtagccgct tgtgagttgg ccgaccgcgg cctacgggtg 879481 ctgatcctcg accaggagaa tcgggccaac gtgggcgggc aggccttctg gtcgttcggc 879541 ggtttgttct tggtcaacag tcccgagcag cgccgcttgg gcatccgtga tagccatgag 879601 cttgctctgc aggattggct ggggacggcg gcgttcgacc ggcccgagga ctactggccc 879661 gaacaatggg cgcatgctta cgtcgatttc gcggcggggg agaagcgcag ctggctgcgg 879721 gcccgcgggc tgaagatctt tccgctggtg ggctgggccg agcgtggtgg ttacgacgcg 879781 caggggcacg gcaactcggt gccccgtttc cacatcacct ggggtactgg gccggctctg 879841 gtcgacatat tcgtgcgtca gctgcgtgat cgccccacgg tgcgctttgc gcaccgccac 879901 caggtcgaca aactgatcgt cgagggtaac gcggtgacag gcgttcgggg taccgtgctg 879961 gagccctcgg atgagccgcg cggcgcgcct tcgtcgcgaa agtctgtggg gaaattcgag 880021 tttcgcgcgt cagcggtgat cgtcgccagt ggtggtatcg gtggcaatca tgagctggtg 880081 cgcaaaaact ggccgagacg gatgggccgc attcccaagc aactgttgag cggggtgccc 880141 gcgcacgttg atggcaggat gatcggcatc gctcaaaagg ccggggctgc ggtgatcaat 880201 ccggaccgga tgtggcatta caccgaaggc attaccaact acgacccgat ctggccgcgg 880261 cacggtatcc ggattattcc ggggccgtcg tcgctatggc tggatgccgc gggcaagcgg 880321 ttgccggtac cgttgtttcc cgggttcgac accctcggca cattggagta catcaccaag 880381 tctggacatg actacacctg gttcgtgttg aatgccaaga taatcgagaa ggaattcgcg 880441 ctgtccggtc aggagcagaa ccctgacttg accggtcggc gcctgggcca gctgttgcgc 880501 tctcgggctc acgccggccc gcccggaccg gtgcaggcat tcatcgatcg tggtgtggac 880561 tgcgtccacg cgaactcgtt gcgcgagttg gtggccgcga tgaacgagtt gcccgatgtg 880621 gtgccgctgg actacgagac ggtggcagcc gcggtcactg cgcgcgatcg tgaggtggtc 880681 aataagtaca gcaaggatgg acagatcacc gcgattcgtg ccgctcgccg ctaccgaggc 880741 gaccgatttg gccgggtggt ggcgccacat cggttgaccg atccgaaggc cgggccgctg 880801 atcgcggtca agctgcacat cctgactcga aagacgttgg gtggcatcga aactgactta 880861 gatgctcggg tgctcaaggc cgacggtacg ccactggccg ggttgtatgc agccggcgag 880921 gtcgccgggt tcggcggggg cggtgtccat ggctaccggg ccttggaggg caccttcctg 880981 ggtggatgca tattttccgg ccgcgctgcc ggccgcgggg ccgccgagga tatccgctag 881041 ttgtggccgc ttgacatagg agctattgct cgcgctagaa ggtgaccgcg ctttcctcgg 881101 gcaacacctg aaagtcggtg gtggtcatct cggtgagccg gccgtagtag atacccctgg 881161 cgtccggagc gacgatggct tggtggatgg gtaccgctcg tgccggagct acggcccgca 881221 ggtagtcgac cgcctcggag atcttcatcc atggggccgc ggcgggagtg gccagtacgt 881281 ccacctgctc gccgggaacg aacaacgcgt caccgggatg catcagtctt gcccgatgtt 881341 tactgtcgcc caccagatac gaaatgttct ctatcacagg gatttccggg tggatcaccg 881401 cgtggcaacc gccgaccgca cggacggtca gctccgctaa cggcagctcg tcgccaacgt 881461 gcaccgcccg ccatggctcg cccagctgcg ccgccgtctg cggatcggcg tacagctcgg 881521 cagccgggtt gtcctcgagc agggtcggca gccgcgtgac gtctatgtga tcggggtgct 881581 ggtgggtgat caagatcgcg gacaaaccgg tgattccctc gaagccgtgc gagaaagtac 881641 cgggatcgaa gagcaggcgg gtttgaccga actcagcgag gaggcaggaa tggccgaaat 881701 gcgtgagttg catgtttacg attgtgccct tatgggggcg tttccgatgc ggttgatcct 881761 ggcgacgatg ctggtcgccg gtcgcttgtt ggcgacgctc atggccgcgc ctagcgccca 881821 ggctgagccg gaaacctgcc cgccgatatg cgaccagatt cctgctaccg cgtggatcag 881881 cacccacgcc gtgccgttga actcgcaata ccgttggccg gcaatggccg gcgcggcagt 881941 ggcggtgacc agggcgacac cacgtttcgg gttcgagcag gtgtgcgcca cgccggcgtt 882001 cccgcacgac agccgcgatt gggcggtcgc gggccgggtc acggtggtcc accccgacgg 882061 ccagtggcag ttgcaggctc aggtgctgca ctggcgcggg gacaccgccc gcggtggcca 882121 gatcgcggcg tcggtgtttg gcaccgccgt cgccgcgtta cgcgcctgcc agctgggcgc 882181 accgctgcag tcgccgtcgg tcaccgacga cgaaccgacc cggatggccg cggtgatcag 882241 cgggccggtc atcatgtaca cctacctggt cgcgcacgta tcaagcagca cgatcagcga 882301 actcaccttg tggtcgtccg ggccgccaca agttccgtgg cctacggttg cggactccgc 882361 ggttctggac gccctgaccg cgccgttatg cgaagcctac atcggctcgt gcccgtgacc 882421 aggcggggca cctgccgccg gtagagttgg cgcgggaatc attgcccggc tcctggcggc 882481 cgctgtcgcc gggcgcggcg ggcagatctg aggaggagcg ccggtggcca gggtggtcgt 882541 gcatgtgatg cccaaggcgg agattcttga cccgcagggc caggcgattg tcggtgcgct 882601 ggggcggctt gggcatctcg gaatatcaga tgtgcgtcag ggcaagaggt ttgagctgga 882661 ggtcgacgat acggttgatg acaccacgct tgccgagatc gcagaatcac tgttggccaa 882721 caccgtgatc gaggactgga cgatcagccg ggacccgcag tgacggcgcg catcggtgtc 882781 gtcacgtttc ccggcacgct cgacgacgtc gacgccgcgc gcgcggcgcg gcaggtgggc 882841 gccgaggtgg tcagcctgtg gcatgccgac gccgacctta agggtgtcga cgccgtagtg 882901 gtgcccggcg gattttccta cggtgactac ctccgggccg gagcgatcgc cagattcgct 882961 ccggtgatgg acgaagtggt agctgccgcg gaccgcggca tgccggtgtt ggggatttgc 883021 aacggctttc aggtgctgtg tgaggccggg ctactacctg gtgccctgac ccgcaacgtg 883081 ggattgcact tcatctgccg ggatgtgtgg ctgcgggtag cgtcgacgtc gacggcgtgg 883141 acatcgcgtt tcgagcctga cgccgacctg ttggttccgc tgaagtccgg cgagggccgt 883201 tacgtggcgc cggagaaggt gcttgacgaa ctagaaggcg aaggccgggt ggtgttccgc 883261 taccatgaca acgtcaacgg ctcgctgcgc gacatcgccg gcatctgctc agccaacggc 883321 cgtgtcgtcg gcctgatgcc gcaccccgaa catgcgattg aagcgttgac cgggccgtcc 883381 gacgacggac tgggtctgtt ctattcagcg ctggatgccg ttctgacggg ctgaggtcac 883441 ccgctcacgc tcacccggcg tctcgcagca acggcggcgt cgcggttgga ggtaatccgg 883501 ctgccgtcag ctgaccgaag agctccgtcg cggccgagac ggcgttgtcg acgaaggtgg 883561 cgaaatcgtc gaaccggatg cggtccctga tcaaggaccg ctctgcggcc acgccgatgc 883621 ggtgcggatc agaggacccg tgcacgatcg cggtgacctc gtggttctgc aggttccacg 883681 cgttgacgat ctccgccaac cgggtgtggt cggtggcggg gaagaagtat gcgggactga 883741 ccctgatcgt gaacacgtcg cggtaggcgg gagagatttc taggtggacg tgcagccgca 883801 ggtgggcgtt ggcgacgaag aagaactcgg cgtcgtggtg gccacggaag tatcgccggc 883861 cgcgggcgcg caggtagcgc tcgatcaggt tggtgctcag cggctcgcct atcgactcag 883921 tcatgaactc atgatgcggc cggcgccttg gtgaatcctt tgagctggga acccggttgc 883981 gaagaacaag atgagaattc cctgagcgac gcggggcagc ccggccactg tgaatggcac 884041 gacgcgacac gcggcggagg cgtcgtgaga ttcacagtcg gtgggttgcg tcggccaatt 884101 caaccggggg gccggtccac agttcctcgt cagcggctac caaggcgtgt acttcggtgg 884161 actgcaacgc cttcaggacc gactgagcga ttcgttcgta ccattgcgcg accgcgctgg 884221 gatagtcggc gtagctgccg ttttcccgca ggatggttcc ttccaccgtc gggatcgggg 884281 tagctccgtc gaattcttgc cggtagggct tgccgaggcg ggcggcggtg ggtgcgtcga 884341 tggtggcgtc cagcttgacc catcgccgac caagatatgc ctcacccagc gagtgccacg 884401 ggaagggccg gccagttcgg cctccccata gggcacgtac ctgcggggac agaaactcct 884461 tatcgggggc gtcgatcgtc tggaacgcga tacgggccgg gacaccggcg gctcggcaca 884521 gggcgacgaa ggaacttgcc ttgcccatgc agaaggcgac cccgtggccg atcacgtcgc 884581 tggcgcggtg atgtccctgc gcgaggtagc gaaaggacgc gaggacgtcg tatggcacgt 884641 cgcgcacgta gtagtagatc cgcctgaccc gctcggtatc cgacaccgcg tcccggatga 884701 gggttgctgc cgtcgtacga acgagcggat ggcccgcgtc gaggtactcc gtgggcgtca 884761 gaaagtggtc catgccggtt ccattgttgg ctagcgtcat ggaatcgtga cctcagtttt 884821 gacccgcgga atgatgtcac tgccgatgat gtgcagataa tcggacttca cgacgtgtgg 884881 aatcgtgaag agaaaatggc cgacaccgcg gtcctggtat tcacgaatgc gctcgacaca 884941 cctgtcgggt gtcccgacga tgagccccgg ctcggggatg gacgcgaatt cttcgcggat 885001 ccggacttct tcctcgccgg actgggtggg tgccagcagc agcgtgaccg acagtcgcag 885061 cgtgtcgggg tcacgcccgg ccgcctccga cgcctgggtg agaaatccgc ggcgttgggt 885121 gacttgctgc ggcgaccacc agcgcacgtt caggccctgg gcatgcttag cggcgatgcg 885181 ctggacccgg tcgccttccc cgccgatcca caacggagga tgtggccgtt gcaccggcgg 885241 cggatcgcag gtggcgccgt ccaaggtgta aaaccggccg gcgtaggtgg ggtttggctc 885301 ggtccacacg gccttgatga cctgcagcga ctcggcaagc gcggagactc ggtcgccaac 885361 cggcgggaac gggatgccgt aggcttgcga ctcgcgccga aaccagccgg cgcccaatcc 885421 cagatcgaga cgtccctggg aaatgacgtc cagcgtcgca gccatcttgg ccagcacgga 885481 aggatgacgg taggaattgc acagcacgct ggtgcccaac cgcagcttcg tggtgtcgcg 885541 ggacaatgcc gcaagtgcgg tccagcactc gagcaggggc agcgacctcg aaggggcgca 885601 ctggcccgcc ccgccggttt cggtgccggt cgcggagccg gtgtcggcgg cgatgccggc 885661 gaccttcgca tactcgccgg ggcttatcgt caggaagtgg tcgcataacc acactgaatc 885721 gaatccgtat tcttccgccg tctgcgagac gacaaccatt tcgcggtaac tgccgaccgc 885781 caggccatta accgtcgcag ccaacatgag tccgaagtgc gggtcgtctt tggcgttcat 885841 gcgaaatctc gtttctcgat aattccggca cctgatccgg gcaacgttcg gggtaacgtg 885901 acggagaact ggtaccgctc ggggcgatgg tggaacacga ccacttcaag gggcttgccg 885961 tcattggtgt agctggtgcg gtcgacgacc agtaccggcg aacccaccgc cagacccaac 886021 gcgtcggcta cgtcggggga ggccccggcg gcatggattt cgtgggtagc ctgtgcaatg 886081 cgtacaccca gtcgccgctc ccacatcgca tatgtggttt cggtgtccgc gctgcccgat 886141 agcaacggct cgacggctgg gcccacgccg ggcggaagat aggccgtgac cagggccaag 886201 ggttgatcgc cagtgcggat gcgccggcga atacagagga cctcaaccaa acccagcgtc 886261 tcggaaatcc gttgcggcgc cggtccggtc tggtgtgaca gcacgtcgac ctgcggggta 886321 acaccacagc tcaacaacac ctctgtgatg gtgcgcacgc cgcaactgag ctcctgttcc 886381 accggatcgg cgacgaaggt acccaagcct tgccggcgca ctagccatcc ctgacgttgc 886441 agcatgccga ccgccgcgcg cacggtcacg cggctcaaac cggaacggtc gatcaattct 886501 cgttcgctgg gcaagcgccc gccgcgcggc agccgctgct ggatgatctg ggcctttagc 886561 gcctcggcaa gctgggtact cgccggcacg ctgccacgcg atatccgcag atcggcagcg 886621 tccaggtcca gcttgacaga tgtcataaga cgtattaaaa cgtcttatac tcaccacgtc 886681 aagcgtgcgt gcgcggtagc agcggaagaa ggtcagccat gacgtcaccc gtcgcggtca 886741 tcgcccggtt catgccacgg cctgacgcta ggtcggccct gcgcgctctc ttggacgcaa 886801 tgattacccc gacacgggcc gaggacggat gccgtagcta cgacctctac gagagcgccg 886861 acggcggcga gctggtgctt ttcgaacggt accgcagccg catcgcgctc gacgagcacc 886921 gcggttcgcc gcactatctg aactaccggg cacaggtcgg tgaattgctg acccggcccg 886981 tcgcggtgac tgtgctcgcg ccgctcgacg aggcttctgc ttagagcggg tagcacccag 887041 gcagcttgat ccacgcccgg caccggccga gcgctcggga accgccgcag accaccgcag 887101 tccccccgtg ggttcagcgg cgcggcggcg ggttggctat accagcaggt aaaacgaatc 887161 tcggtaggat tcaagaagtc tcagccacag ttcgctgatg gtcgggaagc acggaacggc 887221 gtgccacaac cgatcgattg gcacctggcc ggcgacggcg acggtggccg aatgcaacag 887281 ctcggcggcg cccgggccaa ccatggtcac gcccagcaga tggccccgat cgacgtcgac 887341 caccatgcgc gccctgccgg tgtatccgtc ggcaaagagc ttggctccca taacgacatc 887401 gccgatttcg acatcgatcg ctttgatccg gtgaccagcc tgtgcggcct gatcagctgt 887461 caggccgacc gctgcggctt cggggtcggt aaagaatgcc tgcggcaccg cgtgatggtc 887521 ggcggtggtc gcgtgcatgc cccacgacgt ggtgtctagc ggtcgtccgg cggcacgggc 887581 gccgatcgcg gtgccggcga tccgcgcctg gtatttgcct tggtgggtca gcaacgcgcg 887641 atggttgacg tcgccggcgg catagagcca gccgtcgtca acagcccgca ctcggcaggt 887701 gtcatcgacg tccagccagc tgcccggcgt cagtcctatt gtctccaagc cgatgtcgtc 887761 ggttcgcggt gctcggccgg tggcgaagag tacctcgtcg acccgcagct cggtaccgtc 887821 gtccagctcg aggaccactg ggccagttgg gttggggcgg cccagcgcgc gtaccgatac 887881 tcccacgcgc acgtcaacgc cggcgtcggc cagtccgcga ccgatgagtt cccccacaaa 887941 cggttccatt cggggcagca ggccagatcc ccgagccagc agggtcaccg aggcgcccag 888001 tccctgccag gcggtcgcca tctccacacc gacgccgccg gcgccgacga tcgcaagccg 888061 gtcggggacc gtactgttgt cggtggcttg gcgattggtc catggccggg cttcggtgat 888121 gccaggaagg tcggggagtg ctggccggct tccggtgcag atgacaacgg catgccgggc 888181 ggtcagcgcc acgctttcgc cgctcgactt ggtgacgacg acgcggcgcg gaccgtccaa 888241 tcgcccgtca ccgcgtatca gcgtcgcgcc gattccactc acccagtcgg cctggccggt 888301 gtcgtcccag tgggccacat agcggttgcg gcggccaaag acgccggctg tgttgatcga 888361 gccgtcgact gcttcgcgcg cgccgtcgac ccgtcgggcg tcagagatcg cgatgaccgg 888421 acgcagcaag gctttgctgg gcacacaggc ccaataggag cattcacccc cgacgagttc 888481 gcgctccacc accgcgacac gcaggccccc cgcgcgggca cgatcggcga cgttctgtcc 888541 aacgggtccc gcgccgagca cgacgacgtc atacgtttca ccctcacggc agccgggtgt 888601 tgccattggc gcctggtcct gttgggccgc ggtcataatc aaagatcctt tcgtcggact 888661 ctgccagcga cgctacgcgc gcctagcgcc ggtgagccgt gccggcctat cgcccaccag 888721 acgcaaaagc tctcgacacg ccgtgcgaaa agggaccttt atgtctcagt gtcggtgttg 888781 tgtgtgccgc gaggtgggtg tgtcggtgtg acagacgccg tgtcgcggtg gtttgttccg 888841 gatcacctgg tgtctggctc actttgcgtc tgccgtcctc ttggggttgg cgttgagcag 888901 tattgccggc actaggtgag aaggaccggc cggcgtgact tgataggagc gtggctttcg 888961 ccccgactga gatgtgtccg ccgaccggcc caacctcaac accccctcaa gtgaaggagg 889021 tgaaccgccc cggcatgtcc ggagactcca gttcttggaa aggatggggt catgtcaggt 889081 ggttcatcga ggaggtaccc gccggagctg cgtgagcggg cggtgcggat ggtcgcagag 889141 atccgcggtc agcacgattc ggagtgggca gcgatcagtg aggtcgcccg tctacttggt 889201 gttggctgcg cggagacggt gcgtaagtgg gtgcgccagg cgcaggtcga tgccggcgca 889261 cggcccggga ccacgaccga agaatccgct gagctgaagc gcttgcggcg ggacaacgcc 889321 gaattgcgaa gggcgaacgc gattttaaag accgcgtcgg ctttcttcgc ggccgagctc 889381 gaccggccag cacgctaatt acccggttca tcgccgatca tcagggccac cgcgagggcc 889441 ccgatggttt gcggtggggt gtcgagtcga tctgcacaca gctgaccgag ctgggtgtgc 889501 cgatcgcccc atcgacctac tacgaccaca tcaaccggga gcccagccgc cgcgagctgc 889561 gcgatggcga actcaaggag cacatcagcc gcgtccacgc cgccaactac ggtgtttacg 889621 gtgcccgcaa agtgtggcta accctgaacc gtgagggcat cgaggtggcc agatgcaccg 889681 tcgaacggct gatgaccaaa ctcggcctgt ccgggaccac ccgcggcaaa gcccgcagga 889741 ccacgatcgc tgatccggcc acagcccgtc ccgccgatct cgtccagcgc cgcttcggac 889801 caccagcacc taaccggctg tgggtagcag acctcaccta tgtgtcgacc tgggcagggt 889861 tcgcctacgt ggcctttgtc accgacgcct acgctcgcag gatcctgggc tggcgggtcg 889921 cttccacgat ggccacctcc atggtcctcg acgcgatcga gcaagccatc tggacccgcc 889981 aacaagaagg cgtactcgac ctgaaagacg ttatccacca tacggatagg ggatctcagt 890041 acacatcgat ccggttcagc gagcggctcg ccgaggcagg catccaaccg tcggtcggag 890101 cggtcggaag ctcctatgac aatgcactag ccgagacgat caacggccta tacaagaccg 890161 agctgatcaa acccggcaag ccctggcggt ccatcgagga tgtcgagttg gccaccgcgc 890221 gctgggtcga ctggttcaac catcgccgcc tctaccagta ctgcggcgac gtcccgccgg 890281 tcgaactcga ggctgcctac tacgctcaac gccagagacc agccgccggc tgaggtctca 890341 gatcagagag tctccggact caccggggcg gttcagaggc aaccaccatg gttgttgttg 890401 gaaccgatgc gcacaagtac agccacacct ttgtggccac cgacgaagtg ggtcgccaac 890461 tcggtgagaa gaccgtcaag gccaccacgg ccgggcacgc cacagccatc atgtgggccc 890521 gtgaacagtt cggcctcgag ctgatctggg gcatcgagga ctgccgcaac atgtcggcgc 890581 gtctggagcg tgacctactg gcggccggcc agcaggtggt gcgggtaccc accaagctga 890641 tggcccagac ccgcaagtcg gcgcgcagtc ggggcaagtc ggatccgatc gatgcgctgg 890701 cggtggcgcg ggcggtgatg cgtgaaaccg acctacccct ggccacccac gacgagacgt 890761 cgcgggagtt gaagttgttg actgaccgtc gagatgtcct tgtggcccaa cgcacgtcgg 890821 cgatcaaccg gttgcgctgg ctcgtccatg aactcgatcc cgagcgggca ccggcagcac 890881 gctcgctcga tgccgccaag caccagcagg ccctgcggac ctggctggac acccagccag 890941 gattggtcgc cgaactcgcg cgcgccgagc tgaccgacat catccggctc accggcgaga 891001 tcaacaccct agcccagcgc atcagcgccc gagtccacca ggtcgccccc gcactgctgg 891061 aaatccctgg ctgcgcggag ctgactgcag ccaaaatcgt cggcgaagcc gccggagtga 891121 cccggttcaa aagcgaagcc gccttcgcct gccatgccgc agtggctccc atcccggtgt 891181 ggtcgggcaa caccgccggc cagatgcggc tcagccgctc gggcaaccgc cagctcaacg 891241 ccgccctaca ccgcatcgca ctgacccaaa tccggatgac cgacagccgg ggccaggcct 891301 actaccaaag gctgcaagac gccgggaaaa ccaaacgcgc agcactacgc tgcctcaaac 891361 gccgcctagc ccgcaccgtc ttccaggccc tgcgcaccgt ccaccagccc agctccgaac 891421 acacccaacc cgcggccgct tgccatagga gctattgctc gcgctcgtgc cttagtggct 891481 gagcgcgacc gacgcctcgg cggtgtagca aaggaacgtc agcgtctcct gcaggtagag 891541 gcgcacggtg tccgtgtcgt ggctggcgta cccgattgca acgtcggtgc ccagctgtag 891601 gtcgaagtcg ccgcctcgag tggtcagcac gaacgcgccg tcgatggccg gggcccaaat 891661 gatgtccccg tccaccagcc ggttcagatg ctcacggatg ggatagccgt gatcggaagt 891721 ctcgctaacc ttggtgtaga cgtcagcaga gagcaacacc gaatacggtc cgtccacacc 891781 ggccaaccgc agttcggaca atgcctggga gatgacatca gggatttcac ggggatcctc 891841 gggcaacgtc agcgccgggt tcgaactcgc gctgcggatc ccttcgattg atgcggcgct 891901 gtagccttcg aatattgtgc ggtcctcgac gaaggccagc ttcttggccg cctcctttac 891961 cggttcccaa tcggagtcct tagagccacg ttccacgtcg tcgatctcgt tgcgcgacag 892021 ggtaaacgga acccgtagcc ggacaagggg tttgctggcc cgcaggtggg cgatcacgcc 892081 gttggttggt gccttaacat cgatcagccg gccggtgctg accgccgcgg tgacgggccc 892141 cccgggatca ctgacatcga ccacccggcg cccggcgatg tgtcgcttga acgtccgcgc 892201 cgcctccaat tcgatttccg cccaagcggc ttcggtgacc ggtgccaaat cgcggtagag 892261 attgttcatc gggggcttcc tttcaagctg ccgatcgata gcgacccggc tgccagagtt 892321 ggcgtcgccg cctgcggtag gggcggtgga tggtcgagaa agtcgatggt gggtgagaag 892381 aacagtccgc cggtcaccgc ggtggaaaag tcaagcactc gatcggtgtt gcctgccgga 892441 tcgccgagaa acatgttgcg cagcatctgc tcggtcaccg ttggcgtgcg cgaatatccg 892501 atgaagtaag tgccgtactc gcccttgccg acttcgccga acggcatgtt gtgtcgcacg 892561 atcttgcgct cggtgccgtc gtcgtcggtg atgacgttga gcgctacgtg tgaattggct 892621 ggcttcgcgt tgtcgtcgag ttcgatgtcg tcgagcttgg tccggccgat cacacgctcc 892681 tgctcggtga ccgagaggga ttcccacgag gccatatcgt gcacatactt ctgcacgtgc 892741 acataacacg agccggcgaa atttcgatcc tcgtcaccga tcgtggtggc cttgatggcg 892801 attgggccac ttgggttttc ggtgccatcg acaaagccca gcagatcacg gttgtcgaaa 892861 aaccggaagc cgtgcacttc gtcgacaacg gtcaccgcat cgcccatcga cttgagaatg 892921 cggccagcca actcgaagca cacgtccatg gtctcggccc ggatgtggaa caacagatcg 892981 ccgggagttg ccggggcggt atgccgtggt ccggtcagct cgacgaacgg atgcagctcg 893041 gtgggtcgag gtccggcgaa caagcggtcc caggcgtcgg acccgatcga gacgaccacg 893101 gacaagtgtt tggtcgggtc acggaagccg atcgcacgca ccaggccgga gatcttcgac 893161 agtgcgtcgt gcaccgtcgc ctcgccgtcg gcgccgatgg tggcgaccag gaagatcgcg 893221 gccggagtca acggcgccag aatcggctgc ggagagacag caggcacagc cacgacccta 893281 acgtccctgc aataccggtg atgctagaca tggctacatg gcggccacgg cacacggcct 893341 gtgcgaattc atcgacgcgt ccccgtcgcc gtttcacgtc tgcgcgacgg tggcgggacg 893401 gctgctcggc gccggatacc gcgagctgcg cgaagcggat cgctggccgg acaaaccggg 893461 ccggtacttc accgtccggg ctggctcgct ggtggcgtgg aacgccgagc agagcgggca 893521 cacgcaggtc ccattccgga tcgtcggcgc gcacaccgac agccccaatc tgcgggtcaa 893581 gcagcatccg gacaggctcg tcgccggctg gcacgtggtg gcgctgcaac cgtatggggg 893641 agtttggctg cactcctggc tggatcgcga tctgggcatc agcgggcggc tatcggtgcg 893701 tgacggtacc ggggtcagcc accggctggt cctgatcgac gacccgatcc tgcgggtgcc 893761 gcagctggcg attcacctgg ccgaggaccg caagtcgctc acgctcgatc cgcaacgaca 893821 catcaacgct gtatggggcg tgggagagcg ggtggagtcc tttgtggggt acgtcgctca 893881 gcgcgccggg gtggcggcgg ccgacgtgct ggccgcggac ctgatgaccc atgacttgac 893941 cccgtcggcg ctgatcggcg cttcggtcaa cggcactgcc agcctgctca gcgcgccgcg 894001 gctggacaac caggccagtt gctatgccgg gatggaggca ctgctggccg tggacgtgga 894061 ctcggcgtcg agcggattcg tgcccgtgct ggcgattttc gaccacgagg aggtgggatc 894121 ggcctcgggc cacggcgcac agtccgatct gctatccagc gtgctcgaac gcatcgtgct 894181 cgcggcgggc ggcacccggg aggacttcct gcgccgactg accacctcga tgctcgcctc 894241 ggccgacatg gcgcatgcga cgcaccccaa ctacccggac cgtcacgagc cgagccaccc 894301 gatcgaagtc aacgcgggtc cggtgctcaa ggtgcaccca aatctgcgct acgccaccga 894361 cggacgcacc gcggcggcgt tcgcactggc ctgccagcgc gcgggagtgc ctatgcagcg 894421 ttacgaacat cgcgccgatc tgccgtgcgg gtcgacgatc gggccgttgg ccgcggcgcg 894481 caccggaatc cccacggtcg acgtcggcgc cgcccagctg gcgatgcact ccgcgcgaga 894541 gttgatgggc gctcacgacg tagccgccta ttcggcggca ctgcaagcgt ttctttccgc 894601 cgagctatcc gaggcatagg gtcgggcggt atggcactca aggtagagat ggtcactttc 894661 gactgcagcg accctgcgaa gcttgccggc tggtgggccg agcagttcga tggcacgacg 894721 cgtgaactgc tgcccggcga attcgtcgtg gtcgcccgga ccgatggacc gcggttggga 894781 ttccagaagg tgcccgatcc cgcccctggg aaaaaccgcg tgcacctcga cttcacgacc 894841 aaggacctgg atgccgaggt gttgcgcctg gtcgccgccg gagccagtga ggtcgggcgg 894901 catcaggtcg gcgagagctt tcgctgggtg gtgctggctg accccgaagg caacgctttt 894961 tgcgtggcgg gtcaataacg aggcggttcc aaggggccga aaagcggccg gcagcggtcg 895021 aacccgtcca cccgaacctc aacagtgcga tggcgctgcc aatcgtcgcg ggtcagccgg 895081 aataacagcg cctctgccat agccccttcg cgtgccacgc gatctaggcc gttgtcgcgg 895141 tatccgttac ggcgggatac cgcgatcgag gccgggttat ccacgaacga cctcgacgtc 895201 gcgacctgcg cctccagctc ggcaaacgcg aaatacagta cagccgcccg catctcggtg 895261 ccgtagccgt gaccttggta acgcaacccg agccatgatc cagaatccac ctgacgggtg 895321 attgggaaat ccttggagct cagggcctgt acgcctacgg ccctaccgtc gacgaggacg 895381 gccagcggca gcgaccagtc atcccgcttg aacccggcca gttgctgcca taggtgcgac 895441 agcgtgttga acggcaggtc ctcgcgcgat gctcgcgtcc acggaaccga aaacggcatt 895501 cggtcggggt cgtggactcc ctccaggatg gtgtcgatca gctggtcgca caactcctcg 895561 gtgggcagtt gcaactggag ccgcggcgtg gtgatgcgca ggtcgaacaa cggccagtga 895621 cgagacatgg ttccattttg cgcaccacca tcctgagcgc ccgccccgat gtcagcccga 895681 cggctgatgc caccggggtt cttgccgcgg gcatacctat ccgtcggctt gtccgtgtca 895741 acgcggccgc agcgcgatgg ggcctagcta gactgcctcc gtgatgtctc cgctcgcccg 895801 gaccccgcgc aaaacgtcgg tgctggacac cgtcgaacac gccgcgacca cacccgacca 895861 accacaaccg tatggtgagc tgggcctcaa agacgacgag taccggcgga ttcgccagat 895921 cctgggccgc cggcccaccg acaccgagct ggccatgtac tcggtgatgt ggagcgaaca 895981 ctgttcgtac aagtcctcca aggtgcacct gcgctacttc ggtgagacca cctccgacga 896041 gatgcgcgcg gccatgctgg ccggcatcgg cgagaacgcc ggcgtcgtcg acatcggcga 896101 cggctgggcg gtcaccttca aggtggagtc acacaaccac ccgtcctacg tcgagcccta 896161 ccagggcgcg gccaccgggg tgggcggcat cgtccgcgac atcatggcca tgggcgcccg 896221 accggtcgcc gtgatggacc agcttcggtt cggcgccgcc gacgcccccg atacccgccg 896281 cgtgctcgac ggcgtggtcc gcggcatcgg cggatacggc aactccctgg gcctgcccaa 896341 cattggcgga gagaccgtct tcgacccgtg ctacgccggc aaccccttag tgaacgcgtt 896401 gtgtgtcggc gtattacggc aggaggacct gcatttggcg ttcgcctccg gcgccggcaa 896461 caagatcatc ctgtttggcg cgcgcaccgg gctcgacggt atcggcgggg tgtcggtgct 896521 ggcgtcggac accttcgatg ccgagggatc ccgcaagaag ctgccctcgg tgcaggtcgg 896581 cgacccgttc atggagaagg tgctcatcga atgctgtctc gagctctacg cgggcggcct 896641 ggtgatcggc atccaagacc tgggcggagc cggattatct tgtgccacat cggagttagc 896701 atccgccggt gatggcggaa tgacgatcca gctggacagc gtcccgctgc gggccaagga 896761 gatgacgccc gccgaggtgc tctgcagcga atcgcaggag cggatgtgcg cggtggtctc 896821 cccgaagaac gtcgacgcat tcctggcggt gtgccgcaag tgggaggtgc tggcgacggt 896881 gatcggcgag gtcaccgacg gcgaccggct gcagatcacc tggcacggcg agacggtggt 896941 cgacgtgccg ccgcgcaccg tagctcacga aggtccggta tatcagcgcc cggtcgcccg 897001 ccccgatacg caggacgcgc tgaacgccga ccgctcggcc aagctgtcac ggccggtcac 897061 cggcgacgag ctgcgcgcga ctttgcttgc gttacttggc agcccgcacc tgtgcagccg 897121 cgcgttcatc accgagcagt acgaccgcta tgtgcgcggc aacacggtgc tcgccgagca 897181 cgccgacggc ggcatgctgc gcatcgacga gtcgaccggc cggggcatcg cggtatcgac 897241 cgacgcgtcg ggacgctaca cgctgctgga tccctacgct ggcgcgcaac tcgcgttggc 897301 cgaggcgtac cgcaacgtcg ccgtcaccgg cgccaccccg gtcgcggtga ccaactgcct 897361 gaacttcggt tcccccgagg accccggcgt gatgtggcag ttcacgcagg cggtccgcgg 897421 tctggccgat ggctgtgcgg acctcgggat tccggtgacc ggtggcaacg tgagtttcta 897481 caaccaaacc ggttcggcgg caatcctgcc cacgccggtg gtcggggtgc tcggcgtcat 897541 cgacgatgtg cgtcggcgca tccctaccgg cctgggcgcc gagcccgggg aaacgttgat 897601 gctgttgggc gacacccgcg acgagttcga cggttccgtg tgggcgcagg tgaccgcaga 897661 ccacctgggt ggattgccgc cggtagtcga tctggcgcgg gagaagctgc tggccgcggt 897721 gctgagctcg gcgtcgcggg acgggctagt gtccgcggcg cacgatctgt ccgagggtgg 897781 gctggcccaa gccatcgtgg aatcggcgtt agcgggtgaa accggttgcc gcatagtgct 897841 tcccgaaggg gctgacccgt ttgtgctgct gttctccgag tcggcgggtc gggtgctggt 897901 cgcggtgcca cgcaccgagg agagccggtt tcgcgggatg tgtgaggcgc ggggacttcc 897961 cgcggtccgc atcggcgtcg tcgatcaagg ttcggacgcg gttgaggtgc agggcttgtt 898021 cgcggtgtcg ttggccgaac tgcgtgcgac atccgaggcg gtgttgccgc gatacttcgg 898081 atgagtcggc ttcgcgccct gtctttggcc gccggcctgg tcggctggag tctggtcagc 898141 ccgcggctgc cggcgccgtg gcggattccg ttgcaggcgg ggctggggag cgtgttggtg 898201 ctggttactc gtgcgacgat gggcctttgg ccgccgcggc tgtgggccgg gctgcggctg 898261 ggctgggccg cgggggcggc ggcggcgacc gcgatcgcgg caacgacgcc ggtgccgatg 898321 gtgcggttgt cgatgtcggc tcgtgagttg ccggcgtcgg tgccggtctg gctggtatgg 898381 cacatacctg gcggcacggt gtgggccgag gaggccgcgt ttcgcggggc gctggccact 898441 atcggtgccc gggccttcgg tcggtcgggt ggacggatac tgcaggccgg cgcctttggt 898501 ttgtctcaca tcgccgacgc gcgcgcgacg ggcgagccgc tggtgctcac ggtgttggcc 898561 accggtatcg ccggctggat gttcggttgg ctggccgacc ggtccggcag tctggcagca 898621 ccgctgctga cgcacttggc catcaacgag gccggtgcgg tcgccgcggt gctggtccag 898681 cggcgttctg gtatctcgac tcgactgtga tcgcggggtc gggcccctgg tgatcgtgga 898741 acggctcaca acagcgcgga cctggtcggc ggcgccgcta tactgattgg tcactgtcta 898801 accaatcaat ggagagggtt ggcacctcag gtgcatagac ttagggccgc ggagcatccg 898861 cggccggatt acgttctctt acatatcagc gacactcatc tcatcggggg ggatcgtcgg 898921 ctctacgggg cggtggacgc cgacgaccgg ctgggcgaac tgctcgaaca gttgaaccaa 898981 tccggccttc gtcccgatgc gatcgtcttc accggcgatt tggccgataa gggcgaaccg 899041 gcggcatacc gcaagctccg aggcctggtc gagccgttcg cggcgcagtt gggcgccgag 899101 ctcgtctggg tgatgggtaa ccacgacgac cgggccgaac tacgcaaatt cttgctggac 899161 gaagcgccat cgatggcgcc gctagaccgg gtgtgcatga tcgacggtct gcgcatcatc 899221 gtgttggata cctcggtacc cggacatcat cacggcgaaa tccgcgcgtc ccaattgggt 899281 tggcttgctg aagagttggc cacgccagcg ccggacggca ccattttggc gttgcatcat 899341 ccgccgattc cgagtgtttt ggatatggcc gtcacggtgg agctgcgcga ccaggctgcg 899401 cttgggcgag tgctgcgggg cactgacgtt cgcgccattt tggccgggca cctgcactac 899461 tcgacgaatg ccaccttcgt cgggatccca gtgtcggttg cctcggcgac ttgctacacc 899521 caggacctga ccgtcgctgc tggaggaacg cgtggcagag acggcgccca aggttgcaac 899581 ctggtgcacg tctatccgga caccgtcgtg cattcggtga ttccgctggg cggcggagaa 899641 acggtcggca cctttgtctc acccgggcag gcgcgacgca aaatcgccga gagcggcatt 899701 ttcatcgaac cgtcgcgtcg cgattcgcta ttcaagcacc ctccgatggt gctgacgtcc 899761 tcggcaccgc gaagtcccgt cgactgacgt ccgcggcgat cttctcccag ggagccggta 899821 tcgggaaata gcgctccagg aaactgacga ctcgttctgc gcgctgcgct gcggggactt 899881 caggaaagct accgtcgttg aggcagaaga aatcgtatcc gcggtgcttc cgcaacttag 899941 gaagtagccg aagacccgca tagctggtgg tgtcgacata gaggacttta gccttttcct 900001 gcgggacggc gcgtccggtc atcagcgcgt aatagtggta gaacgagttg gtcaccgaga 900061 tgtcggtgtc ggagcggaac gggctggccg cggtgcgggc gaattcctcc gggaattccc 900121 gctccatctc gatcagcaca ctcttgcgca acggtaccgc ggtgtgctcg agatgacggg 900181 taatcacctg cccgaaccgg tcgaagagca gctgccggtt tacccgggcc gcgttttcaa 900241 agccactacg cgctgggttg ttggcgccga gcccgatccg ggtcttggct tcgatgaacc 900301 tggtgactcc accgggagag aagaacatac tggccttgag cggccggccg aagaacatgt 900361 cgtcgttgga gtacaagaag tgctcgctga gccccgggat gtggtgcagc tggctctcca 900421 ccgcatgcga gttataggtc ggcaacgcgg aacggtcgga aaagtggtcc tcggcgcgaa 900481 cgatggtgat tttaggatgt tcggccaacc atggcggcgg ggttgaatcc gtcgcgatga 900541 agatgcgacg tatccacgga gcaaacatgt tcaccgaccg cagcgcgtat ttcaactcgt 900601 cgatttggcg gatccgcgct tcggcgtcgt cgccctcgcc caccacgtac tgcgacattt 900661 gagccatgcg gcgcgcccgg aactcggggt cactaccgtc cacccaggag aacaccatgt 900721 ctatgtcgaa cacgacgtcg ctggcgtgcg gggcaaacat cccgtcaagg gtcggccatt 900781 tgtacccgta gagtttgaca tttgtcggcg ttatttcgtt tcggggcagc actttgcggc 900841 taagcgagtt ttcgacaggg cagcggatca ccgtctcctc gtatacccag aattgcagtt 900901 ccacaccgaa cgccgggccg tagcgaaatc cgcccggcgc gatccggcgt cgatacaacc 900961 gcacgacacg cgggtcaacc agctgcgaca gcccgtcggt ggcgaccaaa acaggagaaa 901021 ggccaggctc atcaatagtt ttggcgtaca tcggttcggt tgcacatgcg gccgcaagag 901081 cgcgctcgag gccggcacgt agttcgatgt tgatggcaag caccggccgg ttcttgtggt 901141 ttcggatcag tagataggga atatcagccc tgtttaacac ctttcgcaga aagaccagat 901201 cttcgatctg ggcctcctgg ggggtcaggc cggattccag gcgggcgatc ttgccgcgcc 901261 gggtaacgat gatgggattc acggtgcgct gagcgggccg accgccgtcg cgcgaagaga 901321 ttttgggcat cgggtcaccg ccttgggaac tcagggagaa atgattaggt caccgaaaga 901381 atctcacaga tcgcgggtcg gcgcaggttg accgcgctgg cgcggggtcc atacagaatt 901441 gtgcggtcaa ggcgataact cttgcaagac accagatcta gcgatctaag aacatcggcc 901501 ggaaacctgg ttgttgcggc cgcgccatgt caagttcagt tcggaactgg gctcgcatac 901561 aacccgatcc cagtctcagc agcggcgctt ggccgccatc tggatggatc caccgattct 901621 tgagacccta aggtatgagc gctcgtgatc gagtcgatcc ggcgaagact cggcaggtcg 901681 tgttggccct cgcggactgg ttgcgcgacg aaacgttgcc agcacccgac accgacgtgt 901741 tggcggcggc ggttcggctt acggcgcgca cgctcgctgc gctggcccct ggcgccagcg 901801 tcgaagtccg gatcccaccg tttgctgcgg tgcagtgcat ttctgggccc cggcacactc 901861 gcggcacacc ccccaacgtc gtgcagaccg acccacggac ctggctcctg gtggctaccg 901921 ggctgtcggg ggtggcgcag gcccggggca gtggcgcgct gcagctctcc ggctcgcggg 901981 ccggtgagat cgaggcctgg ttgccactgg tggatctcgg ctgattccgg cgtgctgagc 902041 tgcggctatg gtgtgtgagg gtggcgccgg ggtgcccgac acgtaagccg aattcggcgg 902101 tgcagacgtc gtggccgtag actcggatta cgtcaccgac cgcgccgcag ggagccgcca 902161 aaccgtgacc ggccagcaac ccgagcaaga cctgaactcg ccccgggaag agtgcggtgt 902221 cttcggggtc tgggccccgg gtgaagacgt cgccaaactc acctactacg gcctgtacgc 902281 gttgcagcat cgcggccagg aagccgccgg gatcgccgtc gccgacggct cccaggtgct 902341 ggtcttcaaa gacctcggcc tggtcagcca ggtgttcgac gagcagacgt tggcggccat 902401 gcagggccat gtcgccatcg ggcactgtcg ttactccacc accggggaca cgacgtggga 902461 gaacgcccag cccgtgttcc gcaacaccgc cgctggcacc ggtgttgcgt tgggccacaa 902521 cggaaatctg gtcaatgccg ctgcccttgc cgcccgcgcc cgcgacgcgg ggttgatcgc 902581 cacccgctgc ccagccccgg cgacgacgga ctccgacatt ctgggggcgc tgctggccca 902641 cggtgctgcc gattccaccc tcgaacaggc ggcgctggac ctgctgccca cagtgcgggg 902701 agcgttctgt ctgacgttca tggacgaaaa cacgctttat gcgtgccgcg acccgtacgg 902761 ggtgcgcccg ctatcgctcg ggcgtttgga ccgtggctgg gtggtggcct ccgaaacggc 902821 cgcactcgac atcgtcggcg cctcgttcgt ccgtgatatc gaaccgggcg aattgctggc 902881 tatcgacgcc gacggggtgc ggtccacccg ctttgccaac cccacgccca agggctgcgt 902941 attcgaatac gtctacctgg cgcggccgga cagtacgatc gccggccggt cggtacacgc 903001 cgcgcgggtg gagatcggtc gccgactggc tcgggaatgc ccggtcgagg ccgacttggt 903061 gattggtgtg ccggaatccg gcacacccgc cgcggtcggc tacgcgcagg agtccggcgt 903121 tccatatggg cagggtctga tgaagaacgc ctatgtcggg cgcaccttca tccagccgtc 903181 acagaccatc cgtcagctcg gcatccggct gaagctcaac ccgctcaaag aggtgatccg 903241 cggcaagcgg ctcatcgtcg tcgacgactc gatcgtgcgg ggcaacaccc agcgtgcgct 903301 ggtacgcatg ctgcgcgagg ccggtgcggt cgaattgcat gtgcgcatcg cctcgccacc 903361 ggtgaagtgg ccgtgcttct acggtatcga cttcccctcg ccggccgagt tgatcgccaa 903421 cgccgtggaa aacgaggacg agatgctgga ggcggtacgg catgccatcg gggccgacac 903481 gctgggatac atctcgctgc ggggcatggt cgcggcgtcc gagcagccca cgtcgcggct 903541 gtgcaccgct tgcttcgacg gcaagtatcc aatagagctg ccccgcgaga ccgcgctagg 903601 caaaaatgtc atcgagcaca tgctcgccaa tgcggcccgc ggagccgcgc tgggcgaact 903661 cgccgccgac gacgaagtcc ccgttgggcg ctgacaaaac gcacgcgcgg tagcctttat 903721 cgcgatgacg gatctcgcaa aaggccccgg aaaagacccg ggtagtcggg gtatcaccta 903781 cgcgtcggcc ggggtcgaca tcgaagccgg tgaccgcgcc atcgacctgt tcaagccgct 903841 cgcttcgaag gccaccagac ccgaagtgcg cggcgggctg gggggattcg ccggactgtt 903901 cactctccgc ggtgactacc gcgaaccggt gctggcggcc tccagcgacg gcgtcggcac 903961 caaactcgcg atcgctcagg cgatggataa gcacgacacg gtgggcctgg acctggtggc 904021 gatggtggtc gatgacttgg tggtttgcgg cgccgagccg ctgttcctgt tggattacat 904081 cgccgtcggt cggatcgtgc cggagcgact cagcgcgatc gtcgccggta tcgccgatgg 904141 gtgcatgcgt gccggctgtg cgctgcttgg cggcgagacc gcagaacatc cgggcctgat 904201 cgagcccgat cactacgata tctctgccac cggcgtcggc gtcgtcgagg cggacaatgt 904261 gctgggtccc gaccgggtca aacccggcga cgtcatcatc gcgatgggct cgtcgggtct 904321 gcattccaat gggtactcgc tggtccgcaa ggtgttgctg gagatcgacc ggatgaatct 904381 ggccggtcat gtggaggagt tcggtcgcac cttgggcgaa gagttattgg agccgactcg 904441 catctacgcc aaagactgtt tggccttggc cgccgaaacc cgtgtccgga cgttttgcca 904501 cgtcaccggc ggcgggctcg ccggcaacct gcaacgggtc atcccgcatg gcctcatcgc 904561 cgaggtcgac cgcggcacct ggacacccgc gccggtattc accatgattg cccagcgcgg 904621 ccgggtcagg cgcacagaga tggagaagac gttcaacatg ggtgtcggca tgatcgccgt 904681 cgttgccccc gaagacacga cgcgcgccct ggccgtcctg accgcgcggc acctggactg 904741 ctgggtattg ggaaccgtct gcaaaggcgg aaaacaaggc ccgcgggcaa aactggttgg 904801 gcagcacccg agattctaag aaccagacct aaccgggtct aatgaggtca acgccacgcc 904861 gatgggaacc gaatcggcac cgtgcggggg gcagctccgt ggtgctagcg ccgccagtcg 904921 tcctcatcgt tccacgagtc gtcgtccgac gggccgtcgc cgtccagtcg gtcggtaccg 904981 gtacctgaca gctcacgctg aagccgctgg aagtcggtct gcggggagct gtatttcaat 905041 tctcgagcaa ccttggtctg ctttgcctta gcccggccgc ggcccatggg ggaaccccct 905101 cgcgaaataa cggagcggcc taacgagtag gcggctccga tctctggtgt cgtttattgt 905161 cctgccgaca gtttaccgtg ccgcccggtc gggcgcgggg cggcctgccc gccgttacgg 905221 agggcacggg taatcaccga ataccgccgc gcagccgctc gacggcccgc cgtccggcgc 905281 ccacatcgtc ggcgggcggc agcgagtcga cgtcgatcac cgcggcaacc tcggcttccg 905341 gaccggttac cagggcggtg tcgcccggca gaccccgttt gagcagggcc agtgccaccg 905401 gccctagctc tacgtgctcg accaccgttc ccagtcgtcc caccgtgcga ccgccggcca 905461 gcaccgcatc gcccgtcgac ggccgctgca ctgactcgtc cagatgcaac aacaccagca 905521 tccggggtgg tctacccagg ttgtgcaccc gtgcgacggt ctcttgccct cggtaacagc 905581 ccttgttcag gtggacggct ccggcgccgg ggccaccgat ccaacccact tcgtgaggga 905641 tggtgcgttc atcggtgtca acgcccagcc gcgggcgcct agccggcacc cggtgagcca 905701 ctcgatgggc ttcataggcc cagatgccgg ccgggcgcac acccgcctga gtcaggcgac 905761 gctgccagtc ggcacgatcg ccgcgcttca ccaccacgtc cagttcgatt tggcccgcta 905821 ggccgtcggg catccggcgg acaatcccgc cgccggcaag cggcacggcc agccactcag 905881 cgggcaagac atctagaccc agcgcgtcga gcactcgttc ctcagccagc cgcggcccca 905941 atagcgacaa caccgccata tcagcggcac gaggagtgac catcgaccaa aaaaccatct 906001 tgcgcaaata ggccagcagc ggttcacccc gccacggctc ggtatcgaga taggtcgtgc 906061 cacccagctc ggtctgtatc cagtgatcct caactcggcc ttgtccgtcc aggctgagat 906121 tttgggtgct ggcgccctca ggcaggtcgc tgacgtgttg tgtggagatg ctgtgcagcc 906181 aggtttgccg atcgccaccg tcgagggtga gcacggcgcg gtgcgagcga tccaccagca 906241 cggcatcggc ttgccccgcg cgttgctcgc ccagcgggtc gccgtaatgc cagatcgcac 906301 ccgcgtcggg tccggggtct ggggcaggga ctgcggccac acaacaactc tacgaaaagc 906361 cgcgctcggc ctcgttgacc agcgtgcagc taggctgcag ggacatgttg aggcagacgg 906421 gcgtggtggt cacgcttgac ggtgagatcc tgcagccggg tatgccgctg ctgcacgccg 906481 atgatcttgc cgctgtgcgg ggggatggcg ttttcgagac actgctggtg cgcgacggcc 906541 gagcctgtct ggttgaagcg cacctgcagc ggctgaccca atcagccagg ttgatggacc 906601 ttcccgaacc ggatctcccc aggtggcgcc gcgcggtcga ggtggcaacg cagcggtggg 906661 tggctagcac cgctgacgag ggcgcgctgc gcttgatcta cagtcgcggt cgggagggcg 906721 gctcggcgcc gacggcctat gtcatggtca gtccggtccc ggcgcgagtt atcggggccc 906781 gccgcgatgg tgtgtcggcg atcacgctgg accgcggttt gccggctgac ggtggcgacg 906841 ccatgccgtg gctgatagcc agcgccaaaa cactgtccta tgcggtgaac atggccgtcc 906901 tgcgtcatgc cgcccggcag ggcgccggcg acgtcatctt cgtcagcacg gacggctacg 906961 tcctggaagg ccctcgctcg acggtggtga tcgccaccga cggtgaccaa gggggcggga 907021 acccctgctt gctgacgccg cctccgtggt atccaatcct gcggggaacc acgcaacaag 907081 cgctcttcga agtggcccgc gcgaaaggct acgactgcga ctaccgtgcc ctacgcgtcg 907141 ccgatctctt cgattcccaa ggtatttggt tggtatcgag catgactctg gccgcccgcg 907201 tacacaccct ggacgggcgg cgattacccc gcaccccgat cgctgaggtg tttgccgaat 907261 tggtggacgc cgctattgtc agcgaccggt gatacggcaa cctctgttgt ggtcagcgcc 907321 ggccataccg ctcgccgtta tccgacgaac cgggacaacc gcgccgacag atgtggtacc 907381 agcccgccgt cggcatcgac gcgttcctcg acgtaggcca ggtcgccacc ttcgacgatg 907441 ccgtagagtc gtttggcgcc gccgaccaga acgccagacc gactgcgggc cagcgcatcg 907501 gtcaccaact cccacgagga ctgggtgcgc ggccgcccgt agaacagttc gacataaccg 907561 gccgaatgcg ccaatagcaa ctcgatcgcc tgagactcgc tcggatcgta cgggtcggcg 907621 acgaaccgcc agaatcccgc ttctcgtaag cctggttcct ggtagtcgcc cgtggcggtg 907681 agccgccagg accgggattc ccaattcaga tagtcgccgc cgtcgtgtga cacaacgatc 907741 tgctggccga accggtagtc gccgtcgggt ccgcggccct cgccttcgcc gcgccacacg 907801 ccgaccagtg gcagcagcgc cagcagtgca ttgttcaggt cggcaccttc gcgcaggttt 907861 gcggtatctg cgggaaccgg caaatcgtcg aaggcaggga tattgcgcgc ggcggtcgcc 907921 ttggcccgct cgacggcagc ggcgaccgca cggtcgccgg agcccgcagc atggacgccg 907981 ccggccccgg tcgcatccga gcccgcgccg gaactcacga ctcgtcggta acgagccggt 908041 acagcgtgta cagcgcgaac caggagataa ccacggtcgc caagaccagc atgatctcga 908101 agaacagcac cacggggacg agtgtatgcg gccgcggccg cttccgtggc cctggctcgt 908161 ctgggcctga ttggggccgg tcaggtgatc ttgacgtcta cctcgtggat gcccgcgccc 908221 gagggctgca ccaccgcgtc gccgttgccg gccgccgaca gcgcgcgcag cgtccaggat 908281 ccgggcgcgg cgaagaaccg gaaatcgccg gtggccgacg cgacgacctc cgcggtgaac 908341 tcgtcggagg agtccagcag ccgcacgaac gcgccgccca cggcctggcc gtcaccgtcc 908401 actacgcggc cggtgatcac cgtttctttt tccaggtcga cgctggccgg caatgtcagt 908461 ccttgcttgg gtccagagca catatcagct tcccaactcg atcggggcgc ccaccaggga 908521 gccgtattct gtccaactgc cgtcgtagtt cttgacgttt tggtgtccga gtaattcccg 908581 caacacgaac caggtgtgcg aggaccgttc cccgattcgg cagtaggcaa tcgtttcctt 908641 gctgttgtct aggccggcgt cggcgtaaag cttggccaac tcctcatcgg acttgaaggt 908701 gccgtcctcg ttggcggccc tgctccacgg cacgttgatg gcaccaggaa tgtgtccggg 908761 ccgctggctt tgttcctgcg gcaggtgcgc gggggccagg atcttgccgg agaactcgtc 908821 gggagagcgc acgtcgatga ggttcttgac gttgatggcc gccaggacct cgtcgcggaa 908881 tgcccgaatc gtgttatccg gcggggaggc ggtgtaggag gtcaccggcc ggctgaccgg 908941 gtcgctggac agcgggcgtc cgtcgagctc ccacttcttg cggccgccgt cgagcaactt 909001 gaccttctca tggccgtaga gcttgaaata ccagtacgcg taggcggcga accaattgtt 909061 gttgccgccg tacaggatca ccgtgtcctc gttggcgatg ccacgctcgg acagcagctt 909121 ggagaattgc tgggcgtcga cgaagtcacg tttgaccgga tcctgcaggt cggtgcgcca 909181 gtccaacttg atcgcgccgg caatatggtc acggtcatat gcactggtgt cctcgtccac 909241 ttcgacgaaa acgaccttcg gcgcgtgcag attgctctca gcccagtcgg cggagaccag 909301 gacatcgcag cgtgccatgg cgggaatcct ttcgcatagt tcggtgacca gcgtggtcaa 909361 ctggttaggc gggacgggga gtgttactgc ttgactgctc cttgggacgt ctgttgcaca 909421 gaaacggcgg gcgacacgct acggtggggc tcctaggctg ctctaagtgc tgcgcggacg 909481 tgcgcggcta ctcagcagct acagcaacag caacaacccg ctaggcggca cagatcaact 909541 gcgcgacgct tggtgagcat gggctcgatg cgggctgaca cgtcggacag cttacccaat 909601 cgcatagtgc tcaagccaac agtggtttca gggcagagcg caggtcggcg gccttgggga 909661 ccccggaggt ccggtagcgc tgtcgcccgt cgacatcgaa gatcaacgtg gtgggcagcg 909721 aaagcaccga aaatcgccgc gctgcctgcg ggttggagtc caggtcgacc tcgatgtgag 909781 caacatctcc cagatcggcg cagacgtcgc cgacccctcg gcgtacccgg tcgcagggcg 909841 cacaccctgg ggccctgaaa tgcacgacgg tcggcccggc cccggacagg cccagttccg 909901 cggtgcgcgc cggagccgcc ggtgtcgttt ccggaccaac ctcccgcagg atcactgacc 909961 gccgggtcag caaccaccgg gcaatggtcg ccagcgcacc tgtagcaacg gaagcgacga 910021 tcatggtcgt catgactgtt tgaactcgtc gagcgagatc gttactcccc gggtaatgcc 910081 ttcgatgatg acgtccgatc cgcgcgcccc cacggtgttt ggcaccaccc cgaacggcag 910141 cttctggttg ggcagcttgc tggcgaaggc gtgcagcacc gcatcccgct tgtcatccgg 910201 aaccggttgg tccgcggtgt cgggtccggt cacgacggcg gtgggggtga taaccaaggt 910261 cgcgcggtcg tccgaggcaa cggacaggtc caccaagacg ctgacccggt gagcgaagtt 910321 ggccgatatg ggcgtgccgc tgaacaccag cccgcggctg ccagatatcc cggactcggt 910381 agtgccgccg gtggcgtcgt tgctctcctg acggggcgcg gcgaccataa ggtcgctaat 910441 gcccaggtag cggcccaggt gcatggagtc gatgatgatg cgactctcca gctcgccgac 910501 cgggagcttg gcatcgggcc tgatcagcca ggacgcgtag gacaagtcga tcgaatgcat 910561 agtggcctcc agagtggccg tgccacttcc ggcgtgctcc acggcaaatg ccttgatttc 910621 tagctccgcg tagtgttccc gcatcgcctg cgggatgaat gggaaccgca ggatggcgac 910681 gaacgggtct gacctcaggt ttgccgcttt gcgcacagtg gtcgacagcc ggtactcggc 910741 atagatgctg gccccgaaat cggcgccgac ggcgcccacg atgagaacgg ccacgacgat 910801 cgccgccccg gtcaccccga ccagcacctt gcgcatcggc atattgtcgc ccagcgctcg 910861 agcccgtccc ggagcgcctc gtcaggcggc acgttatcgt tagatgagct gccgctaccg 910921 tcacatggcg cgatgaactg ggagacgcct ttcccacgac gctggagggg cttgttggag 910981 ttattactgc tgacctcgga gctgtatccg gatccggtcc tgccggcgct gtcgctgctg 911041 ccccacaccg tgcggacggc gccggccgag gcgtcttcgt tgctggaggc gggaaacgca 911101 gacgctgtgc tcgtcgacgc gcgcaacgac ctgtcgtccg ggcgaggcct gtgccgcctg 911161 ttgagctcga ccggccggtc gatcccggta ctggcggtgg tgagcgaagg cgggctggtg 911221 gcggtcagcg ctgactgggg gctggacgag atcctgctgc tcagcaccgg gcccgctgag 911281 atcgacgcca gactgcggct ggtggttggc cggcgcggag atctggctga ccaggagagt 911341 ctgggcaagg tgagcctggg cgagctggtg atcgacgaag gcacctacac cgcccggctg 911401 cgtggccgcc cgctggatct cacctacaaa gagttcgagc tgctgaaata cctggcgcag 911461 catgccggcc gggtgttcac tcgggcgcag ctgctgcacg aagtatgggg gtatgacttc 911521 ttcgggggca cccggactgt tgatgtgcac gtgcggcggt tgcgggccaa actcggcccc 911581 gagcatgaag cgctgatcgg cacggtgcgc aacgtcggat acaaagctgt tcggccggcg 911641 cgcggccgac cgccggccgc ggaccccgac gacgaagacg ccgatcccgg ccgggatggt 911701 atgcaagaac cactggtcga cccgttgcgc agtcagtgac ggcgcttgac tggcgctccg 911761 ctctgaccgc cgacgagcag cgcagcgtgc gtgcactggt cacggcgaca acagcagtcg 911821 atggggtagc acccgtgggt gaacaggtgc tgcgggaact gggccagcaa cgcaccgagc 911881 atctgctggt ggccggttcg cgaccgggcg gcccgatcat cggctacctc aacctcagcc 911941 caccccgggg cgcgggtggt gcgatggcgg agttggtggt gcatccgcag tctcgacggc 912001 gcggtatcgg caccgccatg gcccgcgcgg cattggccaa gaccgccggc cgcaaccagt 912061 tctgggcgca cggcacgctg gatcccgctc gggcgaccgc gtccgcgctg ggtctggtcg 912121 gcgtccgcga actgatccag atgcgacgcc cgctgcgtga tatccccgaa ccgacgatcc 912181 ccgacggggt ggtgatccgc acctacgcgg gcacgtccga cgacgctgag ctactccggg 912241 tcaacaacgc cgcgttcgcc ggacacccgg aacagggtgg gtggaccgcg gtccagcttg 912301 ccgagcggcg tggcgaggcg tggttcgatc cagacggcct gatcttggcc ttcggtgatt 912361 cgccacgtga acggcctggc cggttgctgg gtttccattg gaccaaagtg catcccgatc 912421 acccgggatt gggcgaggtg tacgtgctgg gcgtcgatcc ggcggcgcag cgccgcggtc 912481 tcggccagat gttgacgtcg atcggtatcg tctcgctggc ccgtcggctg ggcggtcgga 912541 agaccctcga ccctgcggtc gaacccgccg tgctgctcta cgtggagtcg gacaatgtgg 912601 cggccgtgcg aacctaccag agcctgggct tcaccaccta cagcgtcgat accgcctacg 912661 cgctggctgg cacggataac tgaccgaaga tgttcccccc caagaagtcg taagcaggag 912721 cttaagtggc caagcggttg gacctcacgg acgtcaacat ctactacggg tcatttcatg 912781 cggtcgctga tgtgtcgctg gcgattctgc cccgcagcgt cacggcgttc atcggtccct 912841 cgggctgcgg caagacgacg gtgctgcgca ccttgaaccg gatgcatgag gtcatccccg 912901 gagctcgagt cgagggtgcc gtactgctcg atgatcaaga tatctacgcc cccggtatcg 912961 acccggtcgg tgtccgccgg gcaatcggga tggtgtttca gcggccgaat ccattccccg 913021 ccatgtcgat tcgcaacaat gtggttgccg gcctgaagct gcagggtgtg cgcaatcgca 913081 aggtgctcga cgatacggcc gaatcctcgc tgcgcggcgc aaacctgtgg gacgaggtca 913141 aggatcgact ggataaaccc ggcggcggat tgtctggggg gcagcagcag cggttgtgca 913201 tcgcacgggc aatcgccgtg caacccgacg tgttgctgat ggacgagccc tgctcctcgc 913261 tggacccaat ctccaccatg gccatcgaag acctgatcag cgagctcaag cagcagtaca 913321 ccatcgtcat cgtcacccat aacatgcagc aggctgcccg ggtgagtgat cagacggcat 913381 tcttcaacct ggaagcggtg ggaaagccgg ggcggctggt agagatcgcc agcaccgaga 913441 aaatcttctc caacccgaac cagaaggcca ccgaggacta catctccggg cgcttcggct 913501 aggcccgatg ccctcgatgg ccaggctggc gtcaccgcgg gtggatgttt gctcggccta 913561 gggaaaggcg ccggtcgcct ggaagatcac gcgtcgtgcc acttccacgg cgtggtcggc 913621 aaagcgctcg tagaatcggc tcagcaacgt cacgtcgacg gcggccgcca ctccgtgctt 913681 ccattcgcgg tccatcagca cggtgaacaa atgccggtgc aggtcgtcca tcgcgtcgtc 913741 ttcttcgcgg atctgggcgg ccttttccgg gtcgtgcgac aacacgacct cttgggcact 913801 gttgcccaat tcgactgcaa ctcttcccat ttcggcaaaa taaccgttga cctcttcggg 913861 cagcgcgtgc tgtggatgcc gacggcgggc gatcttggcg acatgcagcg ccaacgcccc 913921 catccggtcg atgtcagcca ccatctggat ggcgctcaca atggctctga ggtcaccggc 913981 gaccggtgcc tgcaacgcca gaagaacgaa tgcactctcc tcggcccggg cgcttagcgt 914041 cgcgatcttt tcgtggtcgg agatcacttg ctcggccagc acgagatcgg cctgcagcaa 914101 ggcttgggtg gcccgctcca tggcgatgcc tgctagcccg cacatttccc cgagacgctc 914161 ggataattcc gagagttgct catggtaggc ggtccgcatg tgctaaagcc tacgttcccg 914221 accttggaaa atgccgtaag cgtcgtgtca atgcggctac tcgcaggtgg tgtcggcggc 914281 gttggtgacc gtcaggtcct cgggcagctt ggtcggtggg ctggaggagt tgcggcttat 914341 ctgcacgctg acggtggagc cactcggcag gggagcgcgc accgcgctga agtcttggcc 914401 cagcaccacc tggaccagtt ggccgatccc ggtcacccgc tcgatctttg actggccgaa 914461 cacggcggcc acggtggcgg cagcctgttc gttgccgggc gaaaaaaaca ctgtggtggc 914521 cagcagcgaa ctcgggtagt cgtccggagc catcacgttg aagccgttcc gcttgagctg 914581 atcggtggcg gtggtggcca aaccggcctg gccggtcgag ttagagacct gcactgtgac 914641 ctcttttggc gaggtcgtcg taacctgctg gtgctgaatc tcgttggtca gacccgcctg 914701 cggcgccttc ttggtggtgg tcggcggggt cgacggcgtg ttgcccagac gctgggcgtt 914761 gtgatcgttt tccaggggca gcggatcgtc gtcgatgatg gcggtgaaaa gcgccttcat 914821 gtcggaggta cgcgggggct cgtcgccgtt ctggtcggtt ataccggtcg gaacggtcac 914881 gaacgtgacg tgcccggccg ccatatgctg caacgatcga ccgagttcga ccaggtcttt 914941 ggtcttgacg ttgtccacgt agctgttacc gatgaacatg ttgacgacgt tgttgagcct 915001 gctgaggttg aacaaggtgt ccgtcgagat catcgaacgc agcagcgacg acaaaaacaa 915061 ctgctggcgt ttgatgcgcc cgtagtcgcc attgctctcg gtggtgacct ggcgagcgcg 915121 cacatagttc agcgcggtcg gcccgtcaat gacctggcgt ccggcgtgct ccagcaccgt 915181 gcccagttcg tagtcccgca acggggtggt gctgcatacc tcgacgccgc cgagggcctc 915241 gaccatccgc gcgaaaccga cgaagtcaat cgcgatgaac cggttgatgc tcaagcccga 915301 cagtttctga atgaccttca ctagacactt aggcccgccg aaggagaatg ccgagttcag 915361 cttggtctcc gtgtacacca gtctgggacc catcgttccc gtcttctcgt cgtagatggg 915421 tccgtactta ccggtctcgg ggttccacgc ctcgcattgg attggagtga tcgccaggtc 915481 gcgggggaac gacaccgcga cgacccgctc gcggctggcc ggaatgttga ccagcatgac 915541 ggtgtccgaa cgtgcgccgc cggcgtcctc ggcgtcgccg gcgccgatat tggcgttcgc 915601 cccggcacga gagtccatac cgacgagcaa gaagttctcg tcgccatgct gcccgctggg 915661 gttgacgatg tcgcccgaat gcgggtcgag cgcgcttacc atgttcagcc ggctgttctt 915721 cgacgcgctc cactgccatg ccccgccggt cagcgccaac gccagagcgg caaacagagc 915781 cgccagcgag cgcgcggcca gcaccatcgg gcgccggccg gagttcggcg ctggcttggc 915841 gggcgcgggc gacgttcggc ggatccgcaa tggccgcact cgagccgatc cggttagctg 915901 cttgccgggt agctcgggtt cacggcgggc gtggtcggcg cgcggatagt tggctgcccg 915961 gaggtcggga agctccgaga ggaactcgag cgagtgggcc gggatggcga tagcctcggt 916021 gtcctgctgg tcgtcggcgt cgtcgtggac cttcgggccg cggccggatg gctcgggttc 916081 gggggcgaca tggcggtgcg tggggaggtc aggaaaagcg gggccgagcc tggcgatcag 916141 atcggccaca ctaacggcgc cggtggcatg acagccgaca ttctgggtgt cccgcggacc 916201 ctgggctgcc acccatgtgg cgggcggtac cgtgatccat cggtcaacac catcggggaa 916261 tgctgactcg gagagccgtg cccacggcgc ggcgctctcg ccgtcactca tgtcctaccg 916321 gcctccgaga gtctaggtgg cggacgcccg cggtgttggc tgcgtgtcct acgcgcacct 916381 tcgcgcagca ccgccacgag tcggcgccgc acaatgcagc aaggcccaca tcgtactgat 916441 ttatcggtcc agacgcgatt tcgacagggt ctcgattcag ccacccgacc ccatggcgtc 916501 cgccccttcc ggcactcggc agtcgtcggg gtcggttagc cagccgtcgg gaagggccac 916561 ccgggcgggg gaaccctgcc ggccccgggc gccagtcgcg gagtccggga acggtaccgt 916621 gccgtccaac cggtccagca ggcagtcgag ctcgtcgaac gtcttgacca ttgctaacgc 916681 ccgccggagc gcggagcccg ccgggaagcc atggagatac caggcgatgt gcttgcggat 916741 atcgcgcatg cccttgtcct cgccgaagtg tgcggccagc aaggtgccgt gacggcggat 916801 gatgtcggcg acttcgccga gcgtgggtgg ggtgggggcc gggctgccgg tgaaagccgc 916861 ggacaactcg gcaaatagcc agggacggcc caggcagcca cggccgatga ccacgccgtc 916921 acagccggtg gtggacatca tggccagtgc gtcgccggca tcgtagatgt cgccgttgcc 916981 gagcaccgga atcgtccgga catgctgctt gagccgggcg atctgttccc agtcggcggt 917041 gccggaatag cgttgtgccg cggtacgggc gtgcagcgcg accgcagcgg ctccttcggc 917101 ctcagcgatg cggccggcat ccagatgtgt gtggtgggcg tcatcgatgc caatgcgaaa 917161 cttgaccgtc accggtatat cggtgccttc ggtggcgcgc acagccgcgg ccacgatctg 917221 accgaatagc cgccgtttga acggtagcgc cgccccgccg ccgcgcttgg tgactttggg 917281 cactgggcag ccgaaattca tgtcgatgtg atcggctaac ccttcgccag cgatcatccg 917341 agcggccgca tacgtggtgt ccggatcgac ggtgtacagc tgcagcgagc gtggtgattc 917401 gtccgcggag aacgttgtca tgtgcatggt gaccgggtgc cgctcgatga gcgcacgtgc 917461 ggtcaccatc tcgcagacat acagtccgct gaccgtgccg accttcgact gttccagctg 917521 acgacacagc gcccggaatg cgacgttcgt cacaccggcc atcggagcca gcacaaccgg 917581 gctggcgagc tcgatcgggc cgatgcgcaa cgccgggctg ggttggattg cccgcctcct 917641 gctcatcgcg ctgcgcgctc tgcatcgtcg ccgggctggg ttggattgcc cgcctcctgc 917701 tcatcgcgct gcgcgctctg catcgtcgcc gggctaacga cggctcatcg ccagtttgcc 917761 agcggtttta tgcagctcgt gtgcgctgac cttcttgccc gtacgggctt cccggtcgag 917821 ttggcgttgc ttggacacct cgaacttgtc gcaggccagc tcgaggtcct tgatcaccag 917881 ggccagctcg tcgcgcagct tagccccctc gccggtgaag tcctcgcgct cgaagatacg 917941 ccatttcttc agtaccggca tgacgacttc gtcgaggtgg atgcgcgggt cgtagacacc 918001 cccgacggcg atgaccacgg ctttgcgccg gaactcgggt acttggaagc cgggcatctg 918061 gaagtggctc aaaatcaggt gcagcgactt catggcctgg ttgggcacga ggtcgaacgc 918121 ggcctcgctg acgtcgcggt agaagatcat gtgcagattc tcgtctgccg agatcttggc 918181 catgagctgg tcggcgacgg ggtcgttaca tgccttgccg gtattgcggt gcgaaatccg 918241 ggttgccagt tcctggaaac tgacatagag gacggagtcg gtgaggctct ccgcgaaata 918301 gtggccctgg tggttttggc ctgggctgaa gccccggttg actacctcga ggcgaagttt 918361 ctccaactcg acagggtcga ccgatcgggt caccaccagg tagtcgcgca gcgcgatgcc 918421 gtgccgattc tcctcggcgg tccaacggtt gacccactgc ccccacgcgc cgtccatgcc 918481 catgttcatc gcgatctcgc ggtgatacga cggcaggttg tcctcggtga ccaggttctg 918541 caccatcgcc acctgggcga catcagaaag cttgctctgg tcggggtccc aatcctgccc 918601 gccgagcgcg tagtagttct tcccgtccga ccacgggatg tagtcgtgcg ggttccaggg 918661 cttgtgcatg ctcaggtgcc ggttcaggta cttctcgacg accggttcaa gttcgtgcag 918721 cagctgcagg tcggtcagct tggctgacat ggcgcctcca gttatctgtg tctaatggtt 918781 gcagtcaata tatctgtgtc tctcggtagc atcaagtttg ggcttcgcgc ggcatgttga 918841 gctgccagca gcgggcagga tgctggcatc ggcgggcccc ggtggccgcg tggggtgaac 918901 cccagtcgtc ctcagttgtg cggcccggct gggatggagt gttcggattc tccccgctcg 918961 cggtgcggtg cgtaggtggc ggcggtgctg agcaacatgt tgacgcagta gtcgatgaat 919021 tgcttgcggg tggctcccag ccgtccgttc agatatgcgg tgaacagacc ggtaagagcg 919081 ccgatcaagc tggtggcgac cagtttctgc agaactggat caacgatgcg ggacaacttg 919141 cgttgcagca actcgatgaa gttgggcatc cactccgcgc ccgaccgggt cagggccggt 919201 tctaccgccg gcgccagcaa cagcacgcgc ccgcgcaccg gatcgtcgac catcagctcg 919261 acgaattgct ctacggcctc gcgcggggtt tgcgcggacg tgagggttgc catcgctcgt 919321 gtgcagacgt cgtcgtagac cgcgcgaacg aaatgttcac ggtcggcgaa gctttcgtaa 919381 aagtagcgtt ctgtcaggcc ggcgtggcgg cacactgcgc ggacggtgag tgcgggtccg 919441 cctgcgccgc cgagcaactg cacgccggcg gcgacgaggt tgtctcgacg tagggcgtgc 919501 cgactttcca aggggacacc ggaccagcgg ccccggtttt gaccggtctg cacagctctc 919561 ctaaactcca tagtgacaac gtgcgtagtc agaattcgtg tggccaatga agattcagca 919621 ggcaaaacca ccagtgaccc aagatacgtc tgctacctgt ccgctgacca gcaccgtgca 919681 ggattcctcg ccggttgcgg gccagcttgg caggcctata gggttccgcg gactggccgg 919741 cggttgcccc gtgtcaccgc tgggttacga atcgccgccg ctgccgctgg ggccggattc 919801 gctgacgtgg cgatacttcg gtgactggcg cgggatgctg cagggaccgt gggcgggatc 919861 catgcagaat atgcatccgc agctgggcgc ggcggtcgaa gatcattcga cgttcttccg 919921 ggaacgctgg ccacggctgc tgcggtcgtt gtacccgatc ggcggagttg tcttcgacgg 919981 cgatcgagcc ccagtcaccg gtgtgcaggt gcgtgactac cacatcacca tcaagggtgt 920041 cgacggtgcg ggccgtcgct accacgcgtt gaatcccgac gtcttctact gggcgcacgc 920101 caccttcttt gtcggcacgt tgcatgtggc cgagcggttc tgcggtggcc tgaccgaggc 920161 gcagcggcgc cagctatttg acgagcacgt ccagtggtac cgcatgtacg gcatgagcat 920221 gcggccggtg ccggcgacct gggaggagtt tcaggactac tgggaccaca tgtgccgcaa 920281 cgtgctggag aacaacttcg cggcgcgtgc cgtgctcgac ctgaccgaac tacccaaacc 920341 gccattcgcc caacgagttc cggattggct gtgggccgcg ccgcgcaagt tgctggcccg 920401 gttcttcgtc tggctgaccg tcggactcta cgatccgccc gtgcgcgagc tgatgggcta 920461 ccggtggttg cgccgcgacg aatggttgca ccgccgcttt ggcgacatcg tccggctcgt 920521 ctttgccttg gtgccattcc ggtttcgcaa gcacccgcgg gctcgcgccg gctgggaccg 920581 tgccaccggc cgcatccccg ccgatgcgcc gctagtacag acgcccgcgc gcaacctgcc 920641 gccgcccgac gagcgtgaca acccgacgca ctactgccct aaggtctgac cccggacctg 920701 cggcgcaacc ggggcgtggt tgtgctcacc gttaattggc ttacccgaca tccttggtag 920761 ccgatgcctt agcgaccgac tgcagtccgc cggcagcacg gtggtggcgg ggaatcccgg 920821 gaccggcgtg ctcggcgttg aaaacggcgt cgatgacgag ctggcgcacg tgctcgttct 920881 ccagacggta aaagatcgtg gttccatcgc ggcgggtgcg caccagccgc gccattcgta 920941 gctttgccag gtgctgggag accgacggcg cgggcttgcc cacctgctcg gcgagttcat 921001 tgaccgacat ttcgcggtct gccagcgacc acagcacctg cacgcgggtc gcgtcggcga 921061 gcattcggaa cacctcgacc accaagcaga cctgatcgtc aggcaacggg tcaggtccac 921121 tatctgcgta catacgcaaa caatagaacg cgggcgtggt gggctgtcaa ggtcgcgggt 921181 cggcgcccgc tcagcccgtc ggagcggcga tcgcgctgcg ctcaccgccg ttgggttcct 921241 gccggaaccg gtagacatcc accgcgccag ccctgatatc gggccggtgc tcttggcgca 921301 tcggcaggcg ccggtcctgc cattccttgg cgaactcgtc gtagaaggtg gcgggctcga 921361 agtacctgcg gtcatcgacg taatgcggtt cataggcgtc acgcgacgtc aggaagacaa 921421 cctcatcggg tgagcagtag tacagcgatc catagcacat cggacacgga tgggccagca 921481 cgttgagagt ggtaccgacc aggtgctcag tgcccagctt ggtgcacgcg gcacggatgg 921541 caaggctctc ggcgtgggcg gtcggatcat tggtttgggc ccatcgtcga aaacctgcca 921601 tgcctgccgg catgtgcaag acatcggctg ggacgaaaaa tggcaatgcg acggctgttc 921661 gatcacgcac caacgtgacg acaacgccgc gatcaacctc gcacgctacg aggaaccacc 921721 tagcgtcgtc ggcccagttg gggccgccgt caagcgtgga gccgaccgta agaccgggcc 921781 tggcccggcg ggtggccgtg aagcgcggaa ggcaaccggc cacccggctg gcgaacaacc 921841 ccgagacggg gtgctagtcg cgtgaccact aaagatcact cacttgcaac ggtagttcgc 921901 agtggagacc acggtagtag ctagactatc tacatttatc gcatatccgt tttgcttgag 921961 ggggcaacga tggtacgcgc cgatcgtgat cgctgggatc tcgcgacgag tgtcggggcg 922021 acggctacca tggtcgccgc ccagcgcgcg ctggctgccg acccgcgata tgcgctgatc 922081 gatgatccat atgcggcgcc gttggtgcgt gccgttggta tggacgtcta cacgcggctg 922141 gtggattggc agatccccgt cgagggggat tccgagttcg atccgcagcg aatggccacg 922201 gggatggcct gccgcaccag gttcttcgat cagttcttcc ttgatgccac ccacagtggc 922261 atcggccagt tcgtcatcct ggcgtccggg ctggacgccc gggcttaccg ccttgcctgg 922321 ccggtgggca gcatcgtcta cgaagtggac atgccggagg tgatcgagtt caagaccgcc 922381 acgctgagcg atctgggcgc cgagccggcc accgaacgcc ggactgtcgc ggtcgacttg 922441 cgcgacgact gggccaccgc acttcagacg gcgggttttg atccgaaggt gccagcggcc 922501 tggagtgctg aagggttgct ggtatacctg ccggtcgaag ctcaggatgc gctgttcgac 922561 aacatcaccg cgttgagtgc tcccggtagt cggctggcgt tcgaattcgt gccggatacc 922621 gcgatttttg ccgatgagcg atggcgcaac tatcacaatc ggatgagcga gctcggattc 922681 gacatcgacc tcaacgagct ggtgtaccac ggtcagcgtg gtcacgttct cgactattta 922741 acccgcgatg gctggcagac ctcggcgctt acggtcacgc agttgtacga ggcaaacggc 922801 tttgcctatc ccgacgacga gctcgcgacg gcgtttgccg acctcaccta cagcagcgcg 922861 acgctcatgc gctaaagcaa gcgatctgac cgcttactgg cgaagcagct catctttcag 922921 gcgactggtg atcatctcct gaaacacgac ctgggccgga ccgtacaggt cctggaatgt 922981 cgacactaag gcgtccctgt tgtactcggg aatggagccg ccactgggag tccaaaagct 923041 atcgatgtcc agcaggaaga atggtccggt ttgggcgggt gttattcggc gcagatggta 923101 attgggatca agcgcttggc ccatacccgg gccgtagcgc acgatgagcg atttgcctgg 923161 ttgtagctca cggtagactg cggcaccctg ccactcggtc aggaccaggc cgccgggagt 923221 gaaacgctgc ggcccgagca gctgctcgtc gatccagttg ctccacgtga tccggccgtc 923281 gacacccgcg gggacgcgga tctccagaac aaagcgaaga ccgatacgct ccaacccaac 923341 gattgacgag acctgcgcgc gagcatccac gacccgcatc acaacgtcgg taaaggcctc 923401 aaagctgcgg taggcggtgg tctccacgac tatcgcctgg ttcttcagtg aagcggcggt 923461 ggtgttatcg cgattgacat aacgaacgaa acgatccgcg accggggtgg gggctccacc 923521 gggcgccgtc atcccccagc tgacgtcctg cgcctggcgt tcgatcggta gatcattgat 923581 aagcaggtgt ttgagctccc ggttcgctga ttcggtgagc gaatccgttg tcgggtgacg 923641 gatttccacc gtcaccaggg caacgggtgc gttgggctgg acctcatcct gatttgtctc 923701 ggggagcata gacagcaagc atagccaggt tgctttgctc agatcgccgg accgtgcatc 923761 gggagggaat cggcgatgcg cacggcttcg tgcccctgtt tgtgccccca ccaggactcg 923821 aacctgggac ctgcggatta aaagtccgta gctctaccaa ctgagctata ggggcgcgaa 923881 gactcaggat actgcgttgg cgtcggccgc tcgtttgagg aataggctgg gggtgaccta 923941 agctggcgtg gctcccaacg gtcaccacgt tgcgagtgcc ccggagagat tcggttctgc 924001 ccccttcgtc tagacggcct aggacgccgc cctttcaagg cggtaacgcg ggttcgaatc 924061 ccgtaggggg tacctgcgac gcggtatcgc ggagcacaca acacagcaag gccctgtggc 924121 gcagttggtt agcgcgccgc cctgtcacgg cggaggtcgc gggttcgagt cccgtcaggg 924181 tcgccaggac ggtgaggcac atgctgcctt ccggccaggt agctcagtcg gtatgagcgt 924241 ccgcctgaaa agcggaaggt cggcggttcg atcccgcccc tggccaccat ggtctacctg 924301 gataggcact gtggcggcac tgctacgtag ccgacctccc tgggtctggg tgattggtcc 924361 cgggctgcga tggtcgtgag cacacgcccg gatcaccgat gccgtcccgc cccggtaggc 924421 catcgcggcg atgatcgaga ttgccggccg ggttgatcgc tgcggattcc acccgggtcg 924481 aacggcgggt ccatctgctc ctcgatcgct cgtgaaagac ctgattgttc agccatttcc 924541 agcatcacag gcgccaaacc cattggccga catcaaattc cgctcgtcaa ccaccgccgg 924601 ctcggtggtg aacgcatgca gtgaatgggt caaaagtgtg gtcttggact gtagagaaat 924661 gcgacgtgag cgctggtgtt gtcccaggcc agaaggccca gaagacttgt cgcggttcgc 924721 acgccgatcg agtcaccgga ccatccatgg gcgatgcgcc ggaaaaccag acgcgcgcaa 924781 gcctcgaagg ccttggcgtg gcgaagggcc gccggctagg gcaaccctcg tattcccgga 924841 tgttggcggc ccgacgggat tacactgctt cctgctgatt cctccctgcg atcggtcgat 924901 cgcaggatcg gttggcatcg aggtcatgtc gctgtgggag gagatgtcgc gtgtcttatg 924961 tgagcgtgtt gcccgctacg ctggccacag cggcaacaga ggtggcccgc atcggctcgg 925021 cgctcagttt ggctagcgcg gtcgcggcgg cccagaccag cgcggtgcag gccgcggccg 925081 cggatgaggt gtcggcggcg atcgctgcgc tgttttccgc ccacgggcgg gattttcagg 925141 cgctcagcgc gcgggcggca gcgtttcatc acgagtttgt gcaggccctg gccgcgggtg 925201 cggggtccta tgcggtcgcc gagattgccg ccgcatcgcc gttgcagagc ctgatcgacg 925261 tgttcaacgc gcccatccag gccgccaccg ggcgcccgct gatcggcaac ggcgccaacg 925321 gccagccggg caccggggcc ccggggggcc cggcgggtgg ttgatcggca acggcggggc 925381 cggcgggtcc ggggcgcccg gcgccatcgg tggggccggc gggcccgcgg ggttgatcgg 925441 tgtcggaggt gccggcgggg ccggtggaga ctccgcggtc gcgggtgtca tcggaggggc 925501 cggtggggca ggcggggctg ccctgctgtt cggtgccggt ggggccggcg gggccggggg 925561 ttccggcggt tccggcgcag ctggtggggc cggtggcgcc ggtggggccg gcgggctgtt 925621 cgccagcggc ggcagcggcg ggttcggcgg gttcgcatcg acgggcaccg gtggggccgg 925681 cggcaccggt ggggctggtg ggttgttcgc cagcggcggg gtcggcggta ctggcggggg 925741 agccgggtcc ggcggtaccg gtggggttgg tgggacgggt ggggccggag ggctgttcgc 925801 tagcggcggc gctggcgggg ccggcgggtc cggcggtacc ggtggggctg gtgggacggg 925861 tggggccggc gggctgttcg gagccggtgg cgctggcggg ctcggcgggc aaggcaacca 925921 caccggcggg cacggtgggg ccggtggcag cgccggcctg ctcgcccttg gcgacggcgg 925981 cgctggcggg gccggcgggg ccgctaccac cggaaccggc ggggccggcg gggcgggtgg 926041 caaggccggc ctgctgttcg gctccggtgg ggccggtggg tccggtgggg ctgccggcac 926101 cttcggtgac accggtaact ccggcggggc cggtggggcg ggtggcaagg ccggcctgct 926161 gttcggctcc ggtggggccg gtgggtccgg cggcgctggg ggcttcgcca acggctctac 926221 cggcggtgcc ggcggggccg gcggcggggc cgggctgatc ggcaacggcg gcaacggtgg 926281 cagcggcggc acgtcggttg ccaccggggg ggccgggaac ggcggtgccg gcggcgccgg 926341 cggcggggcc gggctgatcg gcaacggcgg caacggcggc agtggcggaa tgggcgatgc 926401 cccgggcggc accggcgtcg gcggcatcgg tgggctgttg ttgggtttgg acggcgccaa 926461 cgccccggcc agcaccaacc cgctgcacac cgcgcagcag caggcgttgg ccgcagtcaa 926521 cgcgcccatc caggccgtga ccgggcgccc gctgatcggc aacggcgcca acggcgcccc 926581 gggcagcggg gcccccggcg ggcacggcgg gtggttgttc ggcggcggag ggaccggcgg 926641 gtccggcgtc agcggcgggg cgggcggaga tggcggggcc ggcgggatct tgttcggcgc 926701 cggcggggcc ggcggcgcgg gcggggccgt cacgggaacc ggcgccaccg gcgggtccgg 926761 tggggccggc ggtggagcct tgctgtttgg ggccggtggg gccggtggag ccggcgggtc 926821 cagcgggatt ggcgggttcg ccgcgggcgg ggccggtggg cccggagggg ccggtgggct 926881 gttcaacggc ggcggggccg gcggggccgg cgggtccggc gtcagcggcg gggctggcgg 926941 ggagggcggg gccggcgggg ccggtggcct gttcgccggt ggcggggccg gcggggccgg 927001 cggatcgggc aacaacgtcg ggggggccgg cggggccggt ggggtcggtg ggctgttcgg 927061 ggccggcggg gccggcggat ccggcggcgg cggtagcgtt gctggcgaca gtggggccgg 927121 cggcaacgcg ggcttgctcg cccccggtct cgccggcggt gccggcggtg gcggcgggca 927181 gggttttgac accggcgggg ccggcgggcc cggcggcgac gccggcctgc tggtcggctc 927241 cggcggggtc ggaggtgccg gcggattcgg cctcactacg ggtgggcctg gggcggccgg 927301 cggcgacgcc ggcctgctgt tcggctccgg cggcgctggc ggggccggcg gctccggccg 927361 aaccgacctc ggcggcgctg gcggagccgg cggcaaggcc gggctgatcg gcaacggcgg 927421 taacggcggg gccggcgggg ccggcgggaa cggcggcggg gacggcgggc ccggtggagc 927481 cgccttcggg ctcggtaacg gcggcaacgg cggcaacggg gggaccggca cgtccgcggg 927541 cagccccggt gccggcggcg ccggtggttc gctgatcggc gcggaggggc tgcccgggct 927601 gctgccctag ccggcccggt tggaccacgt gatcgacgac cgtcacaagt cgacacgccg 927661 aacgtgcaac cacggcggca tcacctggcg tgtcgccgcc accagcgcac gctcggcacg 927721 gagtttagca actactcatc cagaagccgg ccactacggc ctggccacct ggtttacccg 927781 catggacgcg atgaccgcac cgacctgagt cggcattgct ggttgcgctc atccggttat 927841 ggcaagccgt tctgtcccgg cgcgccaaac accccggcct tgccaccggt accgccggct 927901 ccgccgttga cgccgttgcc gccgttgcca ccgtagccga gggtagacgg ggcgagcatg 927961 ccgttgacaa ctatcgtcgt gtcgccgccg ttgccgccgg tgttagcccc gaagccggtg 928021 ccggcgttgc cgccgttccc acccacgccg actagtccga gggcgtcgcc gccgttgccg 928081 ccagcgccac cgttaccggt gggggcggcg ccgccggcac cgccggcacc gcccacggcg 928141 atcccaaccg ctacggcctc gccgccgtcg ccgccgtcgc cgcccatgcc gcccagcacc 928201 cccagggcgc caccaccgtc gccgccggcg ccgccgatgc cgatgcccag gatcacgggt 928261 gagctcaacc cgccaccgcc accggccccc ccgttgccgc cggtcccggt ggcggtgccg 928321 ccagctccgc cggcgccgcc gtgcagcacg gagaacccta ggaagtttgc gatgccagcg 928381 ccggcgccgc cgaagccgcc ggctccgccg gtggcgccgt ccccggtggc ggcaccacca 928441 gccccggcgg cgccgccaaa gcctaggccg aggacagcaa tgccctcgaa gacgccgtca 928501 ccgccggctc cgccggtggc gccgctagtg ccggcgccgc cctgcgcgcc cgcaccgccg 928561 atggcgatgg cgatcccgaa ggggctgctg gcggtgcccg tgccacccgg accgccgggt 928621 ccgccgactc ccgtggaagc gtcgccgccg gcggctccag cgccgcccag ggcaaagatc 928681 aggccgcggg cgctgccgcc agcaccgccg aacccgccgg ttccgctggt ggcgtccccg 928741 ccggccgcgc cggccccgcc gacggccgcg agtgcgccgg tagcgctgcc gccgttgccg 928801 ccgttggcgc cgttaacccc gactccggtg ccggcgttgc cgccgttgcc acctgcgccg 928861 acgaatccga agccgtcacc gccggcaccg ccgctgccgc cggtaccaac cgaagccccg 928921 ccgccgtgcc caccggcgcc gcccacgccg cccagcagcc cggtcccgct gcctccggcg 928981 ccgccgttgc cgccggtgtc ggtggcggct ccgccaaccc ccccgacgcc gccgatgccg 929041 gcgccgatca atccgagggc atcgccgccg gtcccgccat ggccgccgct accagccgaa 929101 gcggcgccgc cgggaccgcc ggcgccgccg gcgccgccca gcagcccgac gcccaatccg 929161 ccggcgccgc cgatgccgcc ggtctcggtg gcggccccgc cagccccgcc ggcgccgccg 929221 acgcccacgc ccagagccgc gaagccgccg gcaccaacgc caccggtccc gccggtgccg 929281 ccggcaccgg tcgcagcccc accaagcccg ccggccccgc cgtaggccgc gccgaacccg 929341 atgaagtcgg gggcaacagc gaagccgcca gtgccgccgg ccccgccggt cccagtggta 929401 gctgcgccac cattgccgcc agcaccgccc cagctcaagt cgagcgcgaa aacggtgccc 929461 gaggaaccgc cggcaccgcc ggcgccgccg gcaccgccgt tagtacctgc gccgccgtgc 929521 ccgccggcac cgccgatgcc gatgtcgatc ccgaaggggc tggcggcgcc accagagcca 929581 ccggcaccgc cggcaccgcc gcttcccatg gccgagtcgc cgccctgacc gccggacccg 929641 cccaggccaa ggaacagccc caatgcgttg ctgccggcgc cgccggcacc gccggttcca 929701 gtggtagcgg ccccgccggc gccaccggcg ccaccgatgg ctaccagcgc gccgccggct 929761 ccaccggcgc cgccgacccc gccgttcccg actccgctgg cggccccgcc agctccgccg 929821 ttgccgccaa tgccgaacat cagcgcgttg ccacccgccc caccggaccc gccgccggac 929881 ccgccggccc cgccagctcc gccgctaccc cacagccacc cgccggtgcc gcctttgcca 929941 cccgaggcgc cggtgcctcc ggccccaccg gtcccgccat ggcccagcag cccggcggca 930001 cccccggcac ccccggtctg ccccggagca cccgaaccgc cgttgccgcc gttgcccaac 930061 aaccagcctc caggcccacc ggcctccccg gtccccgggg cgccgttggt gccgttgccg 930121 ataaaaggtc ggcccgacaa cgcggcggcg ggtgcattga tggcgcctag caagccctgc 930181 tcgagggtct gcaacggcga cgcgttggcc gcttcggcgg ccgcataggc gcccatagcc 930241 ccggccagtg cctgcacaaa ccgggcatga aacgccgccg cctgcgcgct catcgcctga 930301 tactgctgac cgtggctgga aaacaacgcc gcgatggccg ccgacacctc atcgccagca 930361 gcggccaaca gcccgcttgt cgggacggcg gccgccgcat tggcagcggt cagggacgcg 930421 ccgatgcctg caagatcctc cgtggccatc gccaccaagt ccggcgctgc aatcacgaaa 930481 gacatccgac acctcccagc tggccggtgt gatctgactg tcgcccatcg ttacgatacg 930541 cgcatatagc gcctaccggg agacgaagtt gacactcgtc aacatccgat ggccgccgga 930601 gatccggcac ggctcggcgg tcgtttgggc gggcgttggc cccgcacgtt cgacagattc 930661 gacaagttcg tgcgcctcgc gcaacgagac aaccggcgac gccgcctaag gtcaagggcg 930721 gcgtgcgtta gcacttccgt cactcttgtc aattagccgc agcaaacgcc agtcgcccgt 930781 acgatggcgg caacggcgtc ggcggagcgg tttcccgctt ggccaacgcc gaagtcccag 930841 catgaccgat cgcggacgcc agtccgcaga agccggctta tcgacaatga ggccaaagag 930901 ctcaacccgt cagcggacat gtggcgcgcg ctggccagtg tggcgatcag tcgtgtgttg 930961 ctccactgct gccaagtcgg ccgtcatcgt ctgctgtgcg gccatcgcga ccacggcatg 931021 ctcgtttcaa gccacatcga cccagccgag caccgcaccc ccgacatcgc gggtcgattc 931081 gttgatcgtc agcatcgaag acgtacggcg catcgccaac tatgaggagc tcgccgcaca 931141 ttttcagacc gacttgcgtg aaccgccgga ggcggacacg aacgttccgg gcccctgtcg 931201 tgtggtgggc agcagtgatc gcaccttcgg aaccgactgg tcagagttcc gtagcgcggg 931261 ttaccacggc gttaccgacg acctcagacc gggcgggccg gtcatggtcg agacggttag 931321 ccaggcgata gcgctgtacc cggacccgag tacggcgcgc ggtgtgttcc atcggctcga 931381 gtcgtcgctg gcagaatgtg ctggcttgca tgacccctac ttcgatttca tcctcgacag 931441 gccggacgcc tccaccgtga ggatcggcgc tgcgggttgg agtcatgtgt atcgcctgaa 931501 atcgtcggta ttcatatccg ttggcgtgtt gggtattgaa ccggcagagc cgatcgccaa 931561 cgtcatcttg cagacgatca gcgatcgcat ccagtagtta gccgaggact ggaaagcagc 931621 agcggcggcg acgagcgcag cgtgttgagg gctgttgacg ccacgacgcc caccgttgcg 931681 aagaagaagg cgagaagcgt cgcttcggca ccgactgctg tcaccgcaac cgagctgtaa 931741 ctacggggat ctattggatg cgaggcgtaa tcaagcagcg tggcgatggg tctggtgtcc 931801 accgcaaagg agaagacatg ccatatgggg gaaagcttga cccacgagag cgctgtcccg 931861 aacaactcgg tgcctgtcag aaggatcagc gcactggccg ctatccaggc cgtcaggcgt 931921 gccggggata tcacgacgcc gaatgtcttt cgttggttat cccagactgt cgagcgacgt 931981 tgtttttgca ctgaacgtcg aatcttctga gactgccgcc gctttcgccg gcgccaagtc 932041 tcgggcttac ttaaccaggc gagccgccac cgtacgacag tcgcagtcgc taagacttgc 932101 tgctgcatcc aactcgtggc ggccttcatt gatcccgact accaccctgc ctaaccaatt 932161 ctgtatgacg cgccgtttga gaacgtacat ttgtgattgc ggttcgcatt taggagcccg 932221 gcgtgagctg gtcgagtaac gcctcgacca gcgggcgccg cgaagctgtg gtggtgggct 932281 agcccggtcg acccacggcg aagtgctggg ccagcaggtc gtggtcggcc tgtgtggcgc 932341 gcgtcgccag cacggcggcc tccggtgcgc tgacactggc gcgcatgtcg tggccgagca 932401 gcgcggcagc ggcggtgcgt aggtcgaagt cgtgacggcg tagcgcccac tggtctggtt 932461 tggcgtaaag ccggtcgagg tcgccggcgt accagtgcac caccaaggcc agatctgggc 932521 cgtctttgta gtcgtggtcc gcggaccgat cgagccatgc gtgcagtttg aggaccgcat 932581 agttcggcgg ttggggaagg tggactgtca ggccgccagg gagaggcaga acatcggcac 932641 gcaggtaggc gtcggtgcat ccgtggacgt tcatgagctg gttgcctggg ggatggcggg 932701 ttgtgccggt gggcgactcc acctcgccga acgggagggc atcgacggcg cggtcggcga 932761 tcaggaatcg gtgcccggtg ctgcccaggg cgcggaaggt ggcccgaatt gcctcgaagt 932821 ggtcccaatt gttcagggtc cctgcgatat cggtgtcgtt ggtggcccgc ggcggcaccc 932881 cgcggcagaa gcgccagtgc agtagatcgc ggcactgtgc cccgacgagc atcagctgtt 932941 cagccggcac gacgtcggca agtgctgtga cgatcggtgt cacccaggcc aggaggaccg 933001 ggtcataatc gggcgagtcg ctcatcctgc cttctcatga ggtgggcgac ttcgacctgg 933061 cgcggctcgc gcgaggcaag gaggtcggca tagatcaagg ccgtgggagc caaccccggt 933121 tgctcgtcag gtaggttgcg ccagaatagc tttcggatca cgatgctgcc gtgtgggtcg 933181 cggtgccagc ggttgtgtat aagcaggtcg gcgggtagcc cgggcgctgg ggtgtcgacg 933241 tagagcatca gtgattcggg attgcggatt tcgtcgggca gggcctgttc cccgctgacc 933301 gccactgcga gtccgtcggg tgcggaccac gtgtggatat caccactggc gaccaggagt 933361 ttgttggccc ggcccagacc ccccggatag gcagccgccc acaggtccag cagctcatcg 933421 gtgcgcacca gcctgcggcg ggagccgagg tgttcgaaga agccggtagt gcgcaacgta 933481 tccatcgtct ccttggccat accgaccgag acgccggcgc tcgcggcgat cgcacgcagc 933541 ggcgcgtcga ccagttgcgg tgcgtcaagc agtacgcaga caacctgcgc gcgcttgggg 933601 gtaaacgggt tacgcggtcc atcgctgtgc agtccgtcac cgagggtgcc cggttgtgcg 933661 gacacagctg accgtcggcc gcgcacgtcg atgagcaggc caccctggtg ccgcaaataa 933721 gcgttcccag ctccgtcgat gtaccagagt ccgcgagccc gcagcgtttc agcgctcgac 933781 ggatgcagac gcgggcccac cacaagcagc ggcgaaccag cgccggcggt atcccaggcc 933841 tgcagtgctg ccgttgccga caggtgagga aggtagaggg cagtgatcgt gagggggtga 933901 gcgtcgatct caaggtctag tgattcggga tgcgcggagt tcaatgctga taggccaccg 933961 agcacccgca ctccgtattc ggtgaggtga cgctcgacgg cctcagcgag gtcagccccg 934021 atctgatcca tgcgttcagt atatccgtac gttcagtttt attgaacata atgatttatt 934081 gaacatatca ggtcggagct ggtcgacttg gaaggtgtag cggtatccga gtcgcactca 934141 ctgcctcctg ccatgactca ccccaagggt gcaggttgtg cggcagtctg atgagttgcc 934201 gcagcatcgt tgccgcggcc tcctcgttgc ctgtctgaaa cctcgtctgc agtcgagggg 934261 tggtcagcac gcgccgggcc agacggactg gtctactgcg ccaaagcttg tcgctgcgct 934321 tggaggtcag gccgagcagg cgcgaggaac gacgaaccca acaagccatg gtggttggcg 934381 ccgtcgagag gtcggcggtc gccacaacgg gaagatcgcc ttgagcgtcg ctcgaccgcc 934441 gcctcgagtt gggtcataac gaagtagctg atgccgatca tgtcgacgtt tccgtcgcat 934501 cagcgtgcag cggcgaccca ctcgacgagg tctcggtgcc gccgcggcca gggcaccagc 934561 agtgacgagt ccaggcgccg tcgggccaag cagtcgcggt gccagccgtg gtgggtcggg 934621 cgatggttgg gtgtgctcat ttcgggaacg ccagggcgat cagcgtcggc aaactcgcgt 934681 cgatgtgccc gcggcgcaac aatccgcgac aatgatcggg tgcgtctgat cgggcggctc 934741 cgtctgctca tggtggggct ggtcgtcatc tgcggggctt gcgcatgtga ccgcgtgtcg 934801 gccggccgtt ggtccgagtc gccgagtgcg acctcgtggc ccgtccggcc ggtaaacacc 934861 acaacgccat ccggtcctgt gccgccagtc agcgaggcgg cgcgggcagc cgggttggtc 934921 gatgttcgcg gtgttgttcc cgatgccgcc atcgacctgc gctacgcgac ggcgaacaat 934981 ttcaccggca cacagctgta cccgcccggg gcaagatgcc tggtgcacga gtccatggcc 935041 gagggtctcg cggccgccgc ggcggtgctg cgcccacacg ggcaggtgct ggtcttctgg 935101 gactgctatc ggccccacga cgttcaggtc aggatgttcg atgtggtccc caacccggcc 935161 tgggtggcgc ggccgggcaa gtacgcgcat agccatgagg cggggcgttc ggtcgatgtg 935221 acgtttgcca gcgctcagcg gcagtgccca tcagtgcggc gatccggcga attgtgcctg 935281 gccgacatgg gcaccgactt cgacgacttt tcttcgcggg cgacagcgtt tgcaacgcag 935341 ggcgtcagtg ctgaggccca ggccaaccgt gcccacctgc gagccgccat gcaggccggg 935401 gggttgacgg tgtactccgg tgagtggtgg catttcgacg gccccggcgc cggcgtcgat 935461 cgcccgattc tcgaagtgcc agttgactga cgtctcatat agtgaaataa atgtccacta 935521 tttgggcgca gtggcggtag gctttgagcc gaacacctcg accatgggac cgcacggtga 935581 acgacaaacg tcgggcgatt tatacgcacg gatatcacga gtcggtgctg cgcagtcacc 935641 ggcgacgcac tgcggaaaac tccgccggct acctgctgcc ctacttggtg ccggggttgt 935701 cggtgctcga cgtcggttgc ggccccggga cgatcaccgt cgacctcgcc gctcgggtcg 935761 tgccgggatc cgtgaccggc gtcgagccaa ccgatgacgc cttaagcctg gcccgcgccg 935821 aggcccagct gcaccgcctg tcaaacattt cgttcaccac ttccgacgtg cataagctcg 935881 acttccctga cgacgcgttc gatgtcgtcc acgcacacca ggtgctgcag cacgtcgccg 935941 atccggtacg ggcactacag gagatgaggc gggtgtgtac accaggcggc atcgtcgcag 936001 ctcgcgatgc cgactattcg gggttcatct ggttcccgaa gcttccggcg ctggaccggt 936061 ggttggacct ttatgaacgg gcggctcgag ccaacggcgg cgaaccggat gccggccggc 936121 ggctgctgtc ctgggcccgt gcggcaggat tcgacgacgt cacgccgacg gccagtgtct 936181 ggtgtttcgc gacggcctcg gcccgcgaat ggtggggcct agtgtgggcc gaccggattc 936241 tgcaatccga tctggctcac cagctggtgg attcgggtct ggccactgcc gcgcaactcg 936301 aggagatctc cacggcgtgg cgagagtggg ccgcggcccc ggacggttgg ctggcgatac 936361 cccacggtga aatcctttgc cgggcataaa ctcaggcaca cgcgcgaggc tcgcgcggtt 936421 ggttgccgac gacgggcagg acgtggcccg gcgagatcaa atatcgtgca gccgaaggaa 936481 ttcacgcatc acccggtcga atcgcgccgg ctcttcgatg aacggcatgt gggaactgga 936541 ctcgaagaat tccaatcgcg agcccgcaat ccggccctgc atttctcgca tgtgctcagg 936601 cgaacattcg tcgaaacggc ccaccaccag caaggtcggc accgcgatgt cggccaaccg 936661 gtcgacgacg tcccagtctc gaacattccc aacgatgcga aagtcgctgg gcccaaacat 936721 cgtctcgaag atctcggttc ccatgttggc gaatgcttcc gtgagttccc ggggccaggg 936781 gcgggtgcgg cacagataag tctcgttcca ggttctgatc gcggcctggt attcggcgga 936841 atgggtggtg ccggccgcct cgtgacggtc aattgccgag cgagttgcca cgtccaagca 936901 cgacttcaag ctgaccagac tggccgaaaa ttcgggtatc gaagccgtgc tgttcgcgat 936961 ggtcagactg acggcgtcag gcgccttgtc gagcacgtac tgctgtgcca gcatcccacc 937021 ccacgaatgg ctgaagatgt gaaagcgggt aagggcaagg gcttccgcca cggttgccat 937081 ctcggccact gagcggttca tcgtccaaag gtctacgtct gacggacatg cggaatttcc 937141 gcaaccgagc tggtcccaga agatgacctc ccgctcatca gacaaccgtc gcagtggggc 937201 caagtagttg tgcggcaagc ccggcccacc gtgcactaca agcagcggac gaccaggacc 937261 gccaccaatc cgctggaacc agacgcgtcc acccgggacc gcgattgtcc cctccacttg 937321 acctccgatt tcggttgacc aacagacgca gaatcgcaca ttcgcccctt cgggggagtg 937381 cgagtttgcg tcgcctcgcc gggcatgtcg gtcagcgatg gcgcggtcga gaccagacgg 937441 cccgaggcgg tttgggtgga tcgacagtat cggtcgcgca gttaccggcg gactcggctt 937501 ctgctggccg gccggtcggg tgtgcccgtg cataccgctc tcggcttcac cgtggctgtg 937561 gccgtgtgca caccgggtga gacgcccggt tcgtggttgc ggccagcatc gtgcaccaca 937621 gcgctgcgcc ggccaaccgc ggtcgctacc acggaatctg gtcgatgacc cctgtagttg 937681 cttcggtggt tgtgccaatc atggcttcct acggcccgat tcatggtgct catctcttgg 937741 ccgcggtggt cgtggggtcg gccggtgccg cgctgtgcct gccgttggcg cgggccctgc 937801 gccgaccgac ccccagtgca atgacgacgg attgacggtg cggagcccgg ggatgtgctg 937861 agggcaccaa tgtggtgaaa gttgcacgca agcagcacaa tcggagccca gaatgggcac 937921 tgggcgcaga acccgagccg cagaagtaat gtgctggagg ggttactgca gcaaccacac 937981 ccccgggtgt cctccgatcg ggggaagggg ctttcgtcat cgtttcaggc cgatcggagg 938041 acgccggcac aggtcaacga tcctaacttg agttagtgac cacagcggcg gccatcgccc 938101 gcgaggaccg gttgcgttac accggtccgg agcgctgctc gggggacgga caagttcgag 938161 cggccgggga tcgctattcg acggtgatct ggctgctggg cggcaacttg ctggtgcgct 938221 cggccggatt cggctatccg ttcctagcct accacgtggc tggacgagga catggtgcgg 938281 gagcggtcgg cgcggtcgtg gcggcctacg gcctgggttg ggcggtgggg cagctgctgt 938341 gtgggtggtt ggtggaccgt gtcggggcgc gggtgacgct ggtatccacc atgctggtgg 938401 ccgccgccgt gctggtgctg atggccgggc tacacaccgt gccgggattg ctggttgggg 938461 ccatgatcgc cggcctggtt tgcgatgccc cgcgtccggt gttgggtgcg gtgatcgcgg 938521 agttggttgc cgacccacag cggcgggcac aactcgacgg ctggcgatac ggttgggtgc 938581 tcaatatcgg tgctgcgatc accggcgggg tcggcggtgt ggtcgcgggc tggttggaca 938641 ccccggtgtt gtactggatc aatggcatcg ggtgtgcgat cttcgcgggg ttggcaggcc 938701 gctgtatacc tgccgatgtg tgccgtagga ccgagtccgg ccttcgagct tgcaccgcca 938761 tgtcgaaagt tggctatcgg caggcactct cggacaagcg cctggtcctg ttggccgtct 938821 cgggtctggc aacgctcacg acgctgatgg gtttcttcgc ggcggtaccg atgctgatga 938881 gcgcgagtgg actgggtgtc ggggcgtacg gctgggtgca gttgatcaac gccctagcgg 938941 ttgtcgcggt gaccccgctg ttgacgccgt ggctgagcaa gcagctcgca cttggtccac 939001 ggccagacat tctggccggc gcgggagtgt gggtgactct ttgtatggcg gctgccgggc 939061 tcgcccgcac cacggtcggt ttcagtgtgg ccgcggctgc ctgctcgccg ggcgagattg 939121 cctggttcgt ggttgccgcc ggcatcgtgc accggatcgc ccctcccgcg cacggtgggc 939181 gctaccacgg gatctggtcg atggccgtcg cggcgtcgtc ggtggccgcg cctatcctgg 939241 ctgctttcaa cctggctaat ggtgggcgcc tagtgctggc ggccaccacg gtgacggttg 939301 gtttcttcgg ggccgctttg tgcttgccgc tggctcgtgt tctggcagct gccagttgcg 939361 gtccgttgag cagcaaggag ccgtcgcgtg actcgtacca gtgaagggtt ggctgcgttc 939421 gtggtcgatc agctggagga gctgtatcgc cggatgtggg tgttgcgact gctcgatatg 939481 gcgttggagc agttgcgcat cgaaggcctg atcaacgggc cgctgcaggg tggcttcggc 939541 caggaagcag taagtgtcgg tgccgcggcg gcgctgggcg aaggcgatgt catcatcacc 939601 acccatcgtc cgcatgccca acacgttggt actgacgctc cgctgggccc ggtgatcgcc 939661 gacatgctgg gtgcgaccgc aggcgatcta gaaggcgctg acgaggatgc gcacattgcc 939721 gatcctcggg ccgggctacc ggctgcaata cgcgtggtca agcaatcgcc gctgttggct 939781 atcggacacg cctacgccct gtggctgcgc gacaccggac gggtcacact ctgcgtgacc 939841 caagactgtg atgttgatgc cgatgccttc aacgaggccg cggacctagc ggccgtgtgg 939901 caacttccgg tggtgattct cgtcgaaaac attcgtggtg ccctaagtgt gcacctggac 939961 aggtacacgc acgagcctcg ggtttatcgc cgggctgtgg cctacggaat gccgggggta 940021 tcggtggacg gcaacgacgt cgaagcggtc cgtgactgtg tggccaacgc ggtggttcgg 940081 gctcgcgctg gtggcggccc cacgctggtc caagccatca cctaccgcac caccgatttc 940141 tctggatctg accgcggcgg ctatcgcgac ctggccggat ccgagcagtt tctggatccg 940201 ctgatcttcg cgagaaggcg gctgattgct gctggcacga cccgcggtcg gctcgacgag 940261 caggagcggg cggcatgcca acaggtggcc gatgccgtgg cgttcgccaa ggccagggcg 940321 cggcccaacg gcggtgggcc aatcagccga ccaacatccg gctggcacca acaaccaaag 940381 acccggttct gaggcctaga tgtacgttgg ccgcggacaa cgcggtcggt acatgccgtc 940441 gcgccgcggc cccagctagt cgagcagcct ctgccgcatc gcctcggcga ccgcggcagc 940501 tcggtcgctg acgccgagct tctcgtacaa ccgttgcacg tgggtcttta ccgtcgacgg 940561 cgccacatat agctcggctg cgatcgcggg gatgctttga ccgcacgcaa tgcgattgag 940621 cacctcgcgc tcgcgcgcgc tgagcaccgg ggccacgggt gccgcgcgct ggcgaatctc 940681 cccggcgagg cccccgacca gcgagggcgc caccacgtcg cggcccttcg cgcaatcgag 940741 caccgccttg acgatctcgg tgcgagtcga atccttgagc aggaatccgg cggcgccctg 940801 ttggagtgcc tggtagacga tcgccggctc gtcgtgcgcg gaaataagca gcacccgggt 940861 tggcaactcg tagctgcgca ccgccgccgc aacctgcgcg ccgtccatgc cgggcatgcg 940921 gtagtccagc aatgcgacgt cgggcaaatg ggccttgatc aactccaggg ccgcggcgcc 940981 gtcgtcggcc tcgccgacca cgttcaccga gccactcaac gaaagcgctc gcacaacgcc 941041 ctcgcgaaat aacgggtggt cgtcgccgac caccacgcgc actttctccg gctgcggatt 941101 gctcatggcg cgccgaccat ggcgatgagt ttagctgctc gtcggcaacc agccgctggc 941161 agtcgctgga cattgatttg cactccgacg tgcccagcta cggcaacctc ggacgtttgg 941221 gcggtcgcca tgagtacggt gtcctagtgg caatgaccag ctcggcggaa ctggaccggg 941281 ttcgttgggc gcaccagttg cgctcctacc gaattgcttc ggtattgcgg atcggtgtcg 941341 tggggctcat ggtcgccgcg atggtcgttg gaaccagccg gtccgaatgg ccacagcaaa 941401 tcgtgttgat cggcgtctac gcggtcgctg cattgtgggc tctgctgtta gcgtattcgg 941461 cgtcccggcg attcttcgct ttgcgacgct ttcgcagtat gggccggttg gagccatttg 941521 ctttcaccgc cgtcgacgtt ttgatattga cgggctttca gctgctgtcc accgacggga 941581 tctatccgct gctgatcatg atcctgctgc cggtcctggt gggccttgac gtgtcgacgc 941641 gacgggcggc ggtggtgctg gcctgtacgc tagtcggatt cgcagtcgcg gtgctgggag 941701 accccgtgat gctgcgcgcg attggatggc ccgagacaat atttcggttc gcgctctatg 941761 cgttcctgtg cgccacggcc ttgatggtgg ttcgcatcga ggagcggcat acccgttcgg 941821 ttgccggcct gagtgcgttg cgggcggaac tgcttgccca gacgatgacg gcctcggagg 941881 tgctgcagcg gcggattgcg gaagccattc acgatggacc gctgcaagac gtgctggccg 941941 cgcgtcagga gctcatcgag ttggatgccg taacccccgg cgacgagcgc gtcggacgcg 942001 cgttggccgg actgcagagc gcgtcggagc ggctgcggca ggccaccttc gagctgcatc 942061 cggcagtgct tgagcaagtt gggttggggc cggcggtaaa acagttggcg gcctctaccg 942121 ctcagcgttc gggtatcaag atctccaccg atattgatta cccaatacgt agtgggatcg 942181 accccatcgt tttcggtgtg gttcgcgaac tgctgtccaa cgtcgtgcgg cattccggag 942241 ctaccaccgc ctcggtcagg ctcggaatca ccgacgaaaa atgcgttttg gatgtggccg 942301 acgatggcgt gggggtcacc ggtgacacta tggcgcgccg cctgggtgag ggacacatcg 942361 gtctggcttc gcatcgggct cgggtggatg ccgccggcgg agttttggtt ttcctggcca 942421 cccccagggg gacccatgtc tgcgtggaac taccactgaa acggtgaatg gccgttgttg 942481 ccggtcaacc gatgtgccgg tggcagcgac gtgacccccg cgcaggtcga aagccttgct 942541 ggatcgatgg ttccgccggt gcccgccatg ggcccggccg gtcacgccgg ccagtccgca 942601 accggctgtc cagggccatc tcacgggcaa cgtcctggga ggcgctggca gcggcccggt 942661 tcagcccaca agccgcctgt cacagaatgt agtccaggcg ggtcgccatt ccggcgacct 942721 ggtgatagtt gttgtggcag tgcatcaccc acacgccagg attgtcggcg accaggacgg 942781 cgcgcatctt ctgcttgggc agcactatca cggtgtcctt gcgggcgccg gggctgccgt 942841 cggccttgat catctgaaag gtatggccgt gtaggtggat tgggtgatac atcatggtgg 942901 tgttatcgaa catcagggtt ggccgttggc ctagccgcac gtgcagtgga ttggtcgtgc 942961 tgtagggttc cccgttgatt gtccagtcgt acttggccat ggtgccgccc aaggtgaccg 943021 ggaggtcgtg ggtgggttcg ggccggccca ggttggcagt cgttgcggcg gtgaacattt 943081 ccacggtacc cactcgccag ttgagttcat ccggccgaaa ctgcgggtcg ggtgggctgc 943141 cggcgccggt agacagcagc gcacgcgcca gcgcgttctt gccttccgcg agtgcgacca 943201 ggggaaagac gccgccagcg gcggtcacca tgacgtcgta gcgttcggcc atgccgatca 943261 gcagagcgtc gacttcggtg ggaatcactg ggtaaccgtc ggtgtgggtg accgtcatcg 943321 aatgcccggc cagcgcgatg cggaacgcgg tgtcggcggc gctgttgatg atgcggatcc 943381 ggattcgctg gccaggcttg gccttaaaag acgtggccgc cacggggatt cgcccgttga 943441 tcagatagta cgggtaggcg atgtcccctc cgtcgccgcc gagcaggttg ctgtcaacgc 943501 cttcgccttc gggcatacct gttgtgtttt gcatggtggg tttgttcggg tcggtcagct 943561 cgccgtagag ctgttgcggg gacttcccga tgccgtccgt ccaatcgtcg aggatgatga 943621 tccattcggc gtcgtagtgg cctggctcag tcggatcgtc gacgacgaca ggcagatata 943681 ggccgtggtc gccttgaaga ccgacgtgcg gatgggccca gtaggtgccc ggatccggca 943741 cggagaaccg gtacgtaaag tcaccgccgg ggccgatgtt cgcagtcgcg ggctcggtgc 943801 catccatatc gttgcgcagc gcgatgccgt gccaatgcac cgacgtcgga tcacccagac 943861 ggttggtcac cgagacgaca atctcatccc cgacggtggc ccggatcagt ggtccgggga 943921 tggtgttgcc gtaggtcagc gtgctgacga tcggcccacc caggtcgatc ctcgccggct 943981 ggggggtcag cgtggcggta accgttcgcc cactgtgcgg ccgggccgcc tcggccgcgt 944041 cgattgcagc ggtcatcccg gcggcgccgg atgccgtggg cttcgaggcg caagcggcta 944101 gcgcaaagcc gctggcgatg ccggcgccga ggaagccgcg ccggctgaac cgcctcttgt 944161 cgaaggcgtt accgctcgtg gccagctcgg gcatcgatcg ctcctcgtct ggatttggtc 944221 tcgctcttcg taccctgccc agacatcggg cagtacgcaa cggttgatga tcaccacgcc 944281 atcatccgcc cttacaccct acccctatag ggtatatagt gggccacgtg gaaagcgggc 944341 acgtggtgtg gatgcgatcg gcgattgtcg cggtcgcgct gggggtgacg gtagccgccg 944401 tcgccgctgc atgctggctc ccccagctcc accgtcatgt ggctcaccca aaccacccgt 944461 tgacgacgtc cgtaggtagc gaattcgtca tcaacaccga ccacgggcac ctggtggaca 944521 actcgatgcc accgtgcccg gaacggctcg cgacggcggt gctgccgcgc tccgccactc 944581 cggtgttact accagacgtc gtggcggctg cgcccggcat gacagccgcg cttaccgacc 944641 ccgtcgcgcc ggccgcgcgc ggtccgccgg cggcgcaggg atccgttcgc accggtcaag 944701 acctgttgac ccggttctgc ctggctcgtc gctgaggggt cagcgccagg cggtggtggc 944761 cattcgccat cgccggtgac cgctgacccc catccagtgc cgcgtgtgac ttccggcccc 944821 gatgcagaag cgacgatcac tatgaacaac aacctgccgc tggcaaatcc ggtaaaccca 944881 acaagcatca cctccaaccc gcagatactc ctggccaacc gggcgcaccg caccttggtg 944941 aggtcgcggc agacccgcga ccggtaccgc ctcctcccgg agggatatca agtcactcct 945001 ggccggaatc gccacccggg caccatggtt ggcaataccc cggtgctttg gatacctgag 945061 ctgtcgggga cctcagaccc tgaccgtgga ttttgggcca agctagaagg attcaatccc 945121 gggggtatga aagaccgccc cgcgctgtac atggtcgaat gcgcgcgcgc ccggggcgat 945181 atcgcgcccg gtgccgcgat agtcgaatca accggtggca ctctgggatt gggcctagcc 945241 ctcgctggta aggtgtaccg gcacccggtc accctggtca ccgacccggg gctggaaccc 945301 atcatcgcgc gcatgctgac cgcctacggc gccggcgtcg atatggtgac gcagccgcac 945361 ccggtcggcg gatggcaaca ggcgcgcaag gaccgggttg cgcagctgat ggccgaatac 945421 cccggcgcgt ggaatccgaa ccagtacggc aaccccgaca acgtcggcgc ctaccggtcg 945481 ttggcgctgg agctggtcgc tcagcttggc cggatcgatg tcctggtgtg ctcggtgggg 945541 acgggtggac attcagcagg tgtcgcccga gtgctacggg agttcaaccc ggacatgcgg 945601 ttgatcggcg tggacaccat cgggtccacg atctttgggc agcccgcgtc gaacaggctg 945661 atgcgcgggc tgggctcgag tatttatccg cgcaatgtcg attaccgtgc attcgacgaa 945721 gtgcactggg ttgctccccc cgaagccgtc tgggcgtgcc gctccctggc cgcaacccac 945781 tacgccagcg gcggctggag cgtcggggcg gtcgccctgg tagccggctg ggcagcacgc 945841 aacttgccgg cggacaccac gattgccgcg gtctttcccg acggcccaca acgctacttc 945901 gacaccatct acaacgacgc gtactgcaac gaacacgaac tgctaggcgg acaacctccc 945961 accgagcccg acgagattgc ctcgccgcta gacgccgtcg tcacccgatg gacacgcagc 946021 accacggtga tcgatccaac ccaggtggtg tcgtaatggg agcgcgcgct atattccgcg 946081 ggttcaaccg cccgagccgg gtgttgatga tcaaccagtt cggcatcaac atcggcttct 946141 acatgctgat gccgtacctg gccgactacc tagccgggcc actggggcta gccgcgtggg 946201 cggtgggtct ggtgatgggc gtgcgcaatt tctcccagca gggcatgttc ttcgtgggtg 946261 gcacgctggc cgatcggttc ggctacaagc cactgatcat cgccggatgt ctgatccgca 946321 ccggcgggtt tgccttgctg gtggtcgccc agtcgctgcc cagtgtgctg atcgccgcgg 946381 ctgccacggg ctttgccggc gcgctgttca atcccgcggt gcgcggctat ctcgcggccg 946441 aagccgggga acgcaagatc gaagcgttcg cgatgttcaa cgtcttctac cagtcgggga 946501 tcctgctcgg cccgctggtt ggattagtat tgctggcgct ggatttccgg atcacggtgc 946561 tggccgccgc cggtgtgttc ggcctactca ccgtcgcgca gctggtcgca ctgccccaac 946621 accgggccga ctcggagcgc gaaaaaacat cgatcctgca ggactggcgg gtcgtcgttc 946681 gcaaccgtcc gtttctgacg ttagccgccg ccatgaccgg atgctatgcg ctgtcgttcc 946741 agatctatct ggctctgccc atgcaggcgt cgatcctcat gccacgcaac caatatctct 946801 tgattgcggc gatgttcgcg gtatcgggtc tggtcgccgt cggcgggcag ctgcgcatca 946861 cccgctggtt cgccgtcaga tggggggccg agcgcagcct ggtagtcggc gcgacgattt 946921 tggcggcctc gttcatcccg gttgcagtca tcccaaacgg ccagcggttc ggcgtcgccg 946981 ttgcggtcat ggcattggtg ctgtcggcga gtctgctggc ggttgcctcg gcagcgttgt 947041 ttcctttcga aatgcgtgcc gtggtcgcac tgtcgggcga ccggctggtg gcgacccact 947101 acgggttcta cagcaccatc gtgggcgtcg gagtcctcgt cggaaatctg gcgatcggat 947161 cgctcatgag cgccgcgcgc cgcttaaata ccgatgaaat tgtttggggc ggattgattc 947221 tggtgggcat cgttgcggtg gccgggctcc gtcggttgga cacattcacc tcgggttccc 947281 agaacatgac cggtcggtgg gctgcacccc ggtgacccgc gatccacaca gcccggactg 947341 cgggcgcgag ggcagctacc gcgacaccat cacccgcccg ttgaccgacc taccggtggc 947401 cggctatccg ttggtgccgc gggtcgcgtc gccccgctac cggtgcacaa cgccgcagtg 947461 cgggcgtgcg gtattcaatc aggatctcgc taacgtcgac cagtacctcg ttgtcaatca 947521 actggcgcac caactcatcg acggttcttc cctcataccc gatgctgaca agagatggga 947581 tgcgcgacga catgccgaca tgacgcacca tctgacatcg agccttaagg aaaatcaaag 947641 ctaatgccgc cacccctcgg cggcctgttc gtcgaaggtg cggtcaatgc gctcgaacct 947701 gcggcggatc gaagcgcgcg aggccgcatg cggaaggacg tagaggcggt tggccagaat 947761 cgcatcggct gttagctggg cgatatcgtc gacgcccagg ttgtcgtcct gcagggggag 947821 tggaccgggc gatcccgtcg ttgaggactg cgcgcaagcc gcgcctcgga ttcgttcaga 947881 gttggcaacc agattggttt cgacgaccat cgggcagagc accgacaccc caatgccgtc 947941 ggcggtgacc tcgcgggcca gcgtctccgc cagaccgaca accccgtact tggcaacgcc 948001 gtatgcgccg agtccggcat tgggcaccag cccggcaaag gacgcggtga acaccacatg 948061 cccgcccgtg ccctgctcaa gcaacctcgg caggaacgct tcgaccgtat ggatcgagcc 948121 ccacaggtcg acgtcgatca cccaacgcca gtcgtcgtgc gtcatctcca cgatcggacc 948181 gccgacaacg atgccggcgt tgctgaatac gacatcgacg tggccgagca ggcggaaagc 948241 ctcgtccgcg aggtgagtga cctcttctcg atgccggacg tcgcacatca cgctgtgcac 948301 atcgaacccc tcggcacgca ggtggttcac cgcctgccga agtcccggct tgtcaacgtc 948361 ccctagcacg actctggctc cgcggcgggc gaactcggtg ccggtagcca acccgatgcc 948421 actggcaccg ccagtgatga ccgcaccgcg cccgggaaac ccgtccacag cacgcaaccc 948481 tatttcaggc agtcacccgc gtcgactgcg ccgggcgagc gtgattctgg cgacgccaca 948541 gcggcatgtt gcgtcgcggt gttcacaatc ggttacagct gcgctagtcg cggcgcagat 948601 tcatggttga tccgcaggtg cagtgtcgtg caaggttgtc tcgacgatcc aggtgccact 948661 gtggaggcaa tcgatgacga cggatggccg cacaccggcg atccttgcag cccgaattcg 948721 gcggcctccg gcaaatatgg tgaaagacca gcttcggtga gtaccggcga cattcattcg 948781 ttggtgatcg cttcggacta tcgggtccct gatcccggta gagtgtggcc gctgctgcag 948841 cgcaacaaat cggctctggc cgacatcggc gcacaccacg ttctgatcta cgcgtcaacg 948901 cacgactctg gccgtgtgct ggtaatgatc ggagtacgca gtcgtgagcc gatcgtggaa 948961 ttgctccgct cacgggtctt cttcgactgg ttcgacgcca tgggcgtcga cgatatcccg 949021 gcggtcttcg ccggcgagat cgtcgaccga tttgtcgcgg cgcctactac gactcagtcc 949081 actccacggg ttcctggcgt tgtggtggcc gcgttcgcgt cggtgaacaa cgtgtccaac 949141 ctgaccgccg aggtccgttc tgcgatagcc aggtttaccg ccgcggggat tcgaaagacc 949201 tgggttttcc aggctttcga cgatgcgcac gaggttttga tcctgcagga gtttgccgat 949261 gaggcgggcg cgcggcagtg gatcgagcat cccgacgccg ccgccgaatg gatgagcggg 949321 gcgggagtgg gagcctaccc accgctgttc gtcggccggt tcttcgacat gatgcggatc 949381 gaggcgctgc agtgagcgca tcgctgggca ctcggcccgg cccgggtcag cgacctcact 949441 gcggcgccat ggatcccacg agttggccaa gcaggcgggg gatctcgagc cgcggcaaca 949501 ccacctcgac gagcaccatc cggtcccgcc gtgctgcggc gacggtgagg gcgtcgtcga 949561 gttggccata ggtttgggca cggaacgcga ggtgattggt cacacccagc gcgctgggaa 949621 gctcggtcca attccagctc acgatgtcgt tgtacggggc cgtctcgccg tggatggccc 949681 gttcgaccgt gtaaccatcg ttgttgacca ccacgatgac cggggacagc ccttcgcggg 949741 agaacgtgcc gagttcctgc acggtcaatt gtgcggcccc gtcgccgatc aacagcaccg 949801 tacggcggtc cggatgcgca accgcggccc cgactgccgc gggcagcgtg taaccgattg 949861 agccccacaa gggttggccg ataaaggtca ctccttgcgg caaccggtgg tccgccatgc 949921 cgtagaacga cgtcccctgg tcggcgagca ccacgtttcc gggtgtgagc gctgagcaaa 949981 cccggtccca caccatctgc tgggtgagcg gctcatcgcg cgccggcatc gccggcggcg 950041 gttcggcggg cggcggtacc accggcggcg aactgattcc gcgcccggtc aggatggtgg 950101 ccagcgcctg cagcgcggca ctcatttcca gtggtgcgaa cacctggtcg gccacgctgc 950161 tctggtattg cccgatgtcg atggtccggg ccgggtcgat ccgctggctg aagaagccgc 950221 tgaccatgtc ggtgaacacc actccggcgg tcaccagcac cggcgcccct tcgatcgcgg 950281 cgcgcacccg ttcggcgctg gccgcgccgg cgtagattcc caggaagttc ggcgagctct 950341 cgtcgagcag gctcttcccc cacatcaacg tggcgtgcgg caccacgtcg gcggccaaca 950401 gcgcctcgag ttctttgacg gcctgcaggc gatgaaccaa cagatcggcg agcaccgtca 950461 actggtggtc ggcaatgagt tcgatggcgg ccttggtgaa cagcgacagc gcgcgcgggc 950521 tggtgccgcc ggggtagcgg ggcaacggcg cagcgggcgg ttcagtgggg aagcgtgcta 950581 cgtcgctgga cagcaatata tatcctggac gcttctgctc ccgtacctcg gacagcaccc 950641 gatctatttc tctaccggcc gttgccggca tgagattggc ttgggcacag gtgatttcac 950701 ggctgatccg gagaaagtgc tcgaagtcgc cgtcgccgag ggaatgatgc aatgcccggc 950761 gagtgccctg ggcgtctttg gtcgggccgc caacaatgtg caccactggc acatgctcgg 950821 cgtaactgcc cgcgatcgca ttggtcaccg agagctcgcc gaccccgaat gtcgttacca 950881 ccgctgacat cccacgcagc cgcccgtacc cgtcggcggc atacccggca ttcagttcgt 950941 tggcgctgcc cacccaccgg atggtcgggt gggccacgat gtggtcgagg aattgcaggt 951001 tgtagtcgcc gggaacgccg aagatctcag agacgccgag ttcggcgagc cggtcgagta 951061 ggtagtcgcc gacggtgtag acgggatcgc tgcaggcatc gctcttctgg ggtgtcacga 951121 agacgaccgt acgccggatt gcggctattc ccgactggac gccgattcgc tatcgtgcgg 951181 ccatggccat caaggagtcg cgcgacatag ttatcgaagc aagtcccgag gagatcctgg 951241 atgtcattgc cgacttcgaa gcgatgaccg aatggtcgcc agcccatcag agcgtcgaaa 951301 tactcgagac cggagacgac gggcggccca gcaaggtgaa gatgaaagtc aagaccgccg 951361 gcatcaccga cgagcaggtg gtggcctata gctggaccga cagatcagtg cggtggacgc 951421 tggtcagctc cacccagcag cgctcgcagg atggaaagta cgagttgaca cccaagggcg 951481 acaacaccct ggtccagttt gagatcaccg tcgacccgca ggtgccactg cccggcttcg 951541 tgctgaaacg tgcgatcaaa gggacgatcg acacggccac cgaggcgttg cgcagccagg 951601 tgttgaaagt gaagaagggt caatagtcgc ggtgacgacc ggggggcccc tggccggggt 951661 gaaggtcatc gaactcggtg gtatcggacc ggggccgcac gccgggatgg tgctcgccga 951721 cctgggtgct gacgtggtgc gggtgcgccg cccgggtggc ctgacgatgc cgtccgaaga 951781 ccgcgacctg ctgcaccgtg ggaagcggat cgtcgacctg gacgtcaaaa cgcaaccgca 951841 ggcgatgctg gagctggccg ccaaggccga tgtgctgctg gactgtttcc ggcccggcac 951901 ttgcgagcgc ctcggcatcg gacccgacga ctgtgcgtcg gtcaatccgc gactgatctt 951961 cgcccgcatt accggttggg gacaggatgg cccgttggcc tcgacggcgg gtcacgacat 952021 caactacctg tcgcagaccg gtgcgctggc ggcgtttggc tacgccgacc ggcctccgat 952081 gccgccgcta aacctggttg ccgacttcgg cggcggctcg atgctggtgc tgctgggcat 952141 tgtggtggcc ctctacgaac gggaacgttc gggtgtgggt caggtcgtcg atgctgcgat 952201 ggtcgacggg gttagcgtgt tggcgcagat gatgtggacc atgaagggga ttggcagcct 952261 gcgcgaccag cgcgaatctt tcctgctcga cggcggcgcc ccgttctacc gctgctacga 952321 aacgtccgac ggcaagtaca tggccgttgg ggcaatcgag ccgcagttct tcgcggcgtt 952381 gctgagcggg ctcggcttgt cggccgctga cgtgccgact cagctcgatg tggccggcta 952441 cccgcagatg tatgacatct tcgccgagcg atttgccagc cgaacccgcg acgagtggac 952501 gcgggttttc gccggcactg acgcatgtgt tacgccggtg ctggcgtgga gcgaagccgc 952561 caacaacgat catttgaagg cacgatcgac ggtgatcacc gcccatggtg tccagcaggc 952621 cgcgcccgct ccccgatttt cccggacacc ggccgggccg gtcaggccgc cgccggccgc 952681 agccacaccg atcgacgaaa tcaactggta accacggtgg ctgccgaaca ccgcccacca 952741 acggcgcggc gttgctagcg tgaacgtcag tggccgtaaa agcatcgcgg gaatttgtca 952801 tcgacgcgcc ttccagaagt ggtgatggag gcgctggcag atgtcggcgt cctggcttcg 952861 tggtcaccgc tgcacaaaca ggtggaagtg atcgactact acccggatgg ccggccgcac 952921 catgtgaggg caaccgtcaa gattctgggg ctcgtcgaca aagaggtcct cgaatatcac 952981 tggggcccgg actgggtgtg ctgggatgcc gatcagacct tccagcaaca tggacagcac 953041 atcgagtaca ccgtgaaacc tgagggtgtc gatagggccc gggtgcgctt cgacatcacc 953101 gtcgagccgg cgggaccgat ccccggcttc atcgtcaagc gggcaagtga gcatgtgttg 953161 gatgccgcgg cgaaagggct gcagaagttg atcgcgggtg ccggcgatca aggaaacgcg 953221 aaatcgtgac gatgtgacgg gtccgcgtag cggatcgtga ttgctaattt ggtagcagtg 953281 gctatccgag catcgcgcga agtcgtcatc gaagcgcctc cggaagtgat cgtggaggcg 953341 ctcgccgaca tggacgctgt gccgtcttgg tcttcagtgc acaaacgggt cgaagtcgtc 953401 gacacttact ccgacggtcg accacatcac gtgaaggtca ccatcaaggt ggcgggcatc 953461 gtcgacacgg agttactgga gtatcactgg ggacccgact gggtggtgtg ggatgccgcc 953521 aagaccgcgc agcaacacgg ccagcacggc gagtacaacc tgcgccgtga ggataacgac 953581 aagacccgag tgcgattcac cctcacggtc gaaccctcgg cgcccctgcc ggcgttttgg 953641 gtcaacattg cccgcaagaa gatcctccat gcggcgacgg aaggactgcg aaagcaggtg 953701 gtggggcgcc gacggttcac gtcgggctag gtagcgggtc gctcggcgag cacgctcagt 953761 cgcctgattg cctcgtcgag ggtgtcgtct cgtttgcaga aggtgaagcg caccaggtgg 953821 ttccacacat cggcttgttg tgaggcctgt cctgcggcgg ggtcgcagaa cgccgacatc 953881 gggatggcgg ccaccccgac tttctccggt agcgccgcac agaattcggt gctgtcgtca 953941 taacccaacg ggcgcgggtc ggcgcatagg aagtacgtgc cgtagctgtc gtgcactgcg 954001 aagccgatct ccgtcaggcc cgctgccagc cggtcgcgcc gggcccgcaa cgagttccga 954061 agggccgcca cccaggcgtc ttcggtgtct agcgcgaggg cgaccgcagg ctgaaacggt 954121 gcgccgccca catagctcag gtactgtttt gcggcgcgca ccccggcgat gagttcggct 954181 gggccgcaag cccatccgat tttccagccg gtgcagttga acatcttggc cgcactggaa 954241 atggtgatcg tgcgctcggc catgccgtcg aaacccgcca gcggcaggtg tctggcgtgg 954301 tcaaacacta ggtgctcgta cacctcgtcg gtgatcacca caaggttcgc cgccaccgcg 954361 atctcggcga tggctgcgag ttccgtcgcg ctcagcaccg caccggtcgg attgtgcggc 954421 gagttaatga tcagcgcccg agttcgcggg gtcaccgcgc gtcgcagcgc gtcggcgtct 954481 agggcgaagc cgcggccatc gggcaccagc ggtacggtca cgcggtgggc gccggccatc 954541 gccaccaccg gcgagtagga gtcgtagaac ggctcgatca gcaacacctc cgagcccggt 954601 tcgaccagtc cgagcaccgc tgcggcgatg gcctcggtgg ctccgaccgt gaccagcacc 954661 tcggtctcgg ggtcgtagtc gacgccgaaa tggcgccgcc gctgggcggc gatggcccgc 954721 cgtagcggag cgcttccagg gccgggcggg tactggttga cgccgccggc gatggcgtct 954781 tgggcggcct gcagcatctt cggcgggccg tcctcgtcgg gaaagccctg tcccaggttg 954841 accgcgccga tacgggtggc cagcgcggac atttcggcga acaccgtggt cgcatacggc 954901 cgcagccgcg acaccgtcat ggcggtcgag cctatccggg cgacgatgcg cgccgcagcg 954961 ataccttgcc caaccaacag gttggccggg ggccctgtta gggtgccggt acgggaccta 955021 gtcttgaaga aggatccaaa cccccttttg tggaatttgt ggaacaggaa atcgacatgt 955081 ccgaagaagc cttcatctac gaggccatcc gcaccccgcg cggcaaacaa aagaacggat 955141 cgttgcacga agtcaagcca ttgagcctgg tcgtcggcct gatcgacgag ctgcgcaagc 955201 gccatcccga cctcgacgag aacctgatca gcgacgtcat cttgggctgc gtctcaccgg 955261 tgggcgacca gggcggcgac atcgcccgcg ccgcagtgct ggcatcgggc atgccggtca 955321 cctccggcgg tgtgcagctc aaccggttct gcgcgtccgg cctggaggcc gtcaacaccg 955381 ccgcgcagaa ggtgcgttcg ggctgggatg acctggtgct ggccggcggc gtggagtcga 955441 tgagccgggt gccgatgggc tccgacggcg gcgctatggg cctggacccg gcgaccaact 955501 acgacgtcat gttcgtcccg cagagcatcg gcgccgacct gatcgccacc atcgagggct 955561 tctcccgcga agacgtcgac gcctacgcgc tacgcagcca gcaaaaggcc gccgaggcgt 955621 ggtcgggcgg ctacttcgcc aagtcggtgg tgccggtgcg cgaccagaac ggcctgctga 955681 tcctcgatca tgacgaacac atgcggccgg acaccaccaa ggagggtctg gccaagctga 955741 agccggcctt cgaaggcctg gccgcgctgg gcggtttcga cgacgtggcg ctgcagaagt 955801 accactgggt ggaaaagatc aaccacgtac acaccggcgg caacagctcg gggatcgtcg 955861 acggtgccgc gctggtgatg atcggttccg cggccgccgg caagttgcag ggcctgactc 955921 cgcgggcgcg catcgtcgcc accgccacca gcggcgccga cccggtgatc atgctcaccg 955981 gccccacccc ggccacccgc aaggtgctcg accgcgccgg gctgaccgtc gacgacatcg 956041 acctgttcga gctcaacgag gcgttcgcgt cggtggtgct gaagttccag aaggacctca 956101 acattcccga cgagaagctc aacgtcaacg gtggcgccat cgcgatgggc cacccgctgg 956161 gtgccaccgg cgcgatgatc ctgggcacca tggtcgacga actggagcgc cgcaacgccc 956221 gacgtgcact catcacgctg tgcatcgggg gcggcatggg tgtcgcgacg atcatcgaga 956281 gggtttaaca gcatgccaga caacacaatc cagtgggaca aggatgccga cggcatcgtc 956341 acgctgacca tggacgatcc ctccgggtca accaacgtga tgaacgaggc ctacatcgag 956401 tcgatgggca aggccgtcga tcgccttgtc gccgaaaagg attcgatcac cggagtggta 956461 gtcgccagcg cgaagaaaac cttcttcgcc ggcggcgacg tcaagacgat gatccaggcc 956521 aggcccgagg acgccggcga tgtattcaac accgtcgaga ccatcaagcg gcagctgcgc 956581 accttggaga cattgggtaa gccggtcgtc gcggccatca acggggcggc gttgggcggc 956641 ggcctggaga tcgcgctggc gtgtcatcac cggatcgccg ccgacgtcaa gggcagccag 956701 ctcggtctgc cggaggtgac gctgggtctg ctgccgggtg gcggtggggt gacccgcacg 956761 gtacggatgt tcggcatcca gaacgcgttc gtgagcgtgc tggcgcaagg tacccggttc 956821 aagccggcca aggccaagga gatcggtctg gtcgacgagc tggtggcaac ggtcgaggag 956881 ctggtgcccg ccgccaaggc ttggataaag gaggagctca aggccaaccc cgacggtgcc 956941 ggggtgcagc cgtgggacaa gaagggctac aagatgcccg gcggcacccc gtcgtcgccg 957001 ggtctggcgg cgattttgcc gtcgttcccg tcgaacctgc gcaagcagct caagggtgcc 957061 ccgatgccgg cgccgcgggc catcctggcc gccgcggtcg agggggcaca ggtcgatttc 957121 gacaccgcca gccgcatcga gagccgctac ttcgcgtcgt tggtcaccgg ccaggtcgcc 957181 aagaacatga tgcaggcgtt cttcttcgac ctgcaggcca tcaatgccgg cgggtctcgg 957241 cccgaaggca tcggcaagac cccgatcaag aggatcggtg tgctgggtgc gggcatgatg 957301 ggcgccggca tcgcctacgt ctctgccaag gccggctatg aggtggtact caaagatgtc 957361 agccttgagg ccgccgctaa aggcaagggc tactccgaaa agctggaggc caaggcgctg 957421 gagcggggcc gcaccacaca ggagcgcagc gacgccctgc tggcgcgcat caccccgacc 957481 gccgacgccg ccgatttcaa gggcgttgat ttcgtgatcg aggcggtttt tgaaaaccag 957541 gagctcaagc acaaggtgtt cggcgagatc gaagacatcg tcgagcccaa cgcgatcctg 957601 ggatccaaca cctcgacgct gccgatcacc ggtctggcga ccggcgtcaa gcggcaggaa 957661 gactttatcg ggatccactt cttctcgccg gtcgacaaga tgccgctggt ggagatcatc 957721 aagggcgaga agacttctga cgaggccctg gcccgggtgt tcgactacac cttggccatc 957781 ggcaagaccc cgatcgtggt caacgacagc cgcggctttt tcacctcgcg ggtcatcggc 957841 acgttcgtca acgaggcgct ggcgatgctc ggtgagggtg tcgagccggc ttctatcgag 957901 caggcggggt cgcaggccgg gtatccggcg ccgccgctgc agctgtccga cgagctcaac 957961 ttggagctga tgcacaagat cgccgtcgcc acccgtaagg gtgttgagga cgccggcggc 958021 acgtaccagc cgcatccggc ggaggccgtg gtggagaaga tgatcgagct cggccggtcc 958081 ggccggctga agggcgcggg cttctacgag tacgccgacg gcaagcgatc cgggttgtgg 958141 cccggcttgc gcgagacgtt caagtcgggc tcgtcgcagc cgccgctgca ggacatgatc 958201 gaccgcatgc tgttcgccga ggcgctggaa acccagaagt gcctcgacga gggggtgctg 958261 acgtcgacgg ccgacgccaa catcggctcg atcatgggca tcggcttccc gccgtggaca 958321 ggtggcagtg cccagttcat cgtcggctac tccggcccgg ccggtaccgg taaggcggct 958381 ttcgtggccc gggcccgcga gctggcggcc gcctacggcg accgcttcct gccgccggag 958441 tcgctgctaa gctgagcgcg agcagacgta aaagcccccg cacgctcggc gtgtcggggg 958501 cttttacgtc tgctcgcgca acctaaattg ccgggcccag caggtcgtcg gcgtcgcgga 958561 tgatgtaacc gtagccctgc tcagctaaaa accgctgccg gtgtgcggcg tactcggcat 958621 ccaggctgtc gcgggccacc accgagtaga agatggcacc gcccccgtcg gccttgggtc 958681 gcaatatccg gccgagccgt tgcgcctctt cctggcgtga gccgaatgtt cccgaaacct 958741 gtaccgccac ggcggcttcc ggcaagtcga tggagaagtt agccaccttg gacaccacga 958801 gcgtagcgac ctcgccgcgg cggaaggcgt cgaacagtgc ctcgcgttcg ctggtccttg 958861 tcgacccctg aatcaccgga gcgccgagct cggcgcccag ctcgtcgagc tgatccaagt 958921 acgctccgat gaccagggtc tgctcatccg ggtgcttcgc cagaatcgac ttgaccacag 958981 caattttggt gtgcaccgtc gagcagatcc ggtagcgttc ttcgggttcg gcggtggcgt 959041 acatcatccg ctcgctgtcg gtcatcgtga cccggacttc cacgcactca gctggcgcga 959101 tccagccctg cgcctcaatg tccttccacg gcgcgtcata gcgctttggt ccgataaggg 959161 aaaacacgtc gccctcgcgt ccgtcttcac ggatcaacgt ggcggtcagc cccagccgcc 959221 gtttggactg caggtcagcg gtcatccgga agaccggtgc cggcaacagg tgcacctcgt 959281 catagatgat gagcccccag tcgcggctgt cgaacagttc cagatggcgg tactcgccct 959341 tagtgcggcg ggtgatcatc tggtatgtcg agatggtgac aggtcggatt tccttgcgtt 959401 ctcccgagaa ttcgccgatc tcattctcgg tgagcgaggt gcgcgcgacc agctctcgtt 959461 tccattgccg ggccgcgacg atattggtga ccaggatcaa cgtcgtcgcg ccggctttgg 959521 ccattgcggc cgcaccgacc agcgtcttgc cggccccaca tggcagcacc accaccccgg 959581 agccgcccgc ccagaacgag tccgcggcca gccgctggta atcgcgcagc tgccagccct 959641 cctggtgcag gctgatcggg tgcgcttcac catcgacgta gccggcgaga tcctctgcgg 959701 gccaaccgat cttgagcagc agctgcttga cccggccgcg ttcgctgggg tggacgacga 959761 cggtgtcgtc atcgatgcgg gcgccaagca tcggcgcgat cttcttgttg cgcagcactt 959821 cctcaagcac cgcgcggtcc aggctcacca gcgtcaggcc atgggccggg ttcttgacca 959881 actgcagtcg tccgtagcgg gccatggtgt cgacgatgtc gacgagcaag ggttgcggca 959941 ccgcgtagcg ggagtaactg accagcgcgt cgacgacttg ctcggcatca tggccggcgg 960001 cgcgagcatt ccacagtgcc agcggtgtga tgcggtaggt gtggacatgt tcgggtgcac 960061 gttccagctc ggcgaacggc gcgatggcgg cgcgtgcagc gccggccagt tcatggtcga 960121 cttccaacag caccgtctta tcggactgca ctatcaatgg tccgtcagtc aatggcgccg 960181 ctcctcctca tcgctgcgct ctgcatcgtc gccggcggta gtcaatggcg ccgctcctcc 960241 tcatcgctgc gctctgcatc gtcgccggcg gtagtcaatg gcgccgctcc tcctcatcgc 960301 tgcgctctgc atcgtcgccg gcgcgggggt catgggctcc attatcggtc gtgggccgac 960361 accaccaacg tgatgcggtg gatggcgaag tcacgcagtc gcccggatga cgagtcgaac 960421 gccaccagct ggccgccccg tagcgtgatc ggtgcgacca cccgctgagt ggcaacgccg 960481 gcggcatcga ggtagctgat caccaaggtg gcctggtcct tggccgcgcg ctgcaacagc 960541 gacatggtga ccgccgggtc gacgcggaca ttagcgaacg gcgctgcggt cacctcacgc 960601 agcacggcaa ccacggcttt caacgcctcg ctattgggtc tcggcggcgg tcggtatggc 960661 cggcgccgtt gcggtgtggg cacccgggcg ccgcgggttc gcacgtcgac aacggctccg 960721 gtggaatctt cggcggccgg ggcaaagccc gcgccgcgca acgtgacgag gacttcggat 960781 atcggagcgg gggacaccgc caccgttggg gccagggccc gcagtgccag cccgtcggct 960841 tcgggcgccg ccacgacctg ggccagtagc gttgggtcct cgcaccgcac gaacgatgcg 960901 gccatgccga tccgaagctg gccgtgccgg cgcgcgacat cgtcgatgag atatgtaagc 960961 ccttgtggta caggagtttt agaacgattt gcgaagaatt cctgcaacca gtcgcgggac 961021 ttgccgacat cgagggcatg ccggatcgac tgctcgctga cgcggtacac catcgccgtg 961081 ccggccgatt ccacggtggc gacggtggtc aggtcgtcgg ccagttcgcg ctgcagcggc 961141 cctggcacca cgacggtcag gtcggcctgc accaggaagt gatcgatggg cttgggcagc 961201 gcccgagcca tcacgccgac cgcggcggca ggggcagtag ctggctctaa ggcctcgtcc 961261 aacagtgcgc gagcaggcgt gctgatcgcc ccgcgcccca ccagacccag cgcatggccc 961321 tctgtcagca gatccgcgat aggcgcaggt tgcaatcgcc tggcccaacg tgggcggcgc 961381 cagatcagtg tcgccgacgc ccgggacgca tcgacgccgg cgccggcggg cagctcggcg 961441 agcatgccta gcaatagccg gcgatccagt ggggccgccg tggagaacag cgaatccgac 961501 agggcgccat agggtttggc gtcgggtccg cgggtaccga ttaacgccgg ccggcccgga 961561 aggtcaagcc aggcgctggc cagcaagtgc caacgctcgg cgggtgacat cgtggcgaat 961621 cgatcggcgg ccaccgttgg cgcccaaaaa ggtccgtcac tgtggggcgg ttcgggatcg 961681 ggcatgccgc tggcgatcag tccagccgcg gccgcaatct cgaggattag gcccagccgc 961741 ggctcgtcga ttcccgttgc cttggccagc cgcttgaatt cacgaacccc cagtccgccg 961801 ctgcgtagtt cggcaaccgg tgtggcgccg aggttttcga gcagtacgtc gacttcacgc 961861 agtaggtcga tgacggctcc ggccgccgca gcgtcggcgt cgtcgggtgt ggtggtggaa 961921 actaccgggt ccggcgcggt caactccatc ggaccgggtt gttcgccgcg cagcacctgc 961981 ccgacgtggc ggggcaagat caccgtttcg gcatcgattc gtcgcagcaa gcccatcgcc 962041 agcaaccgcg gcacgggtcg atcagatggc gcgccgggtg cggcgtcgcg agtgcgcccc 962101 acgggtgacc cttggagcaa tttgtccaga acgtcacgct gcgcggggtc gaggccggcg 962161 atcaggtcgg cgagctgatc cccggaacgc gaacttccct cgagggtgac ctggccggga 962221 tgccacggca acgccgtacc tgcgtctgtc gccacccgga ctgcggtctc gccccaggcc 962281 agggcacgtt gtttaaggtc agccagcgcg ccaagcacgt cggcttgggc ggcgcggtcg 962341 ccgatcactg ccagcagccg gacgatcggc accggtgcgg tatctgcctg cagcaccagc 962401 agtgcgtcga acaccgccag ccgcaggaag tcgagctcgt cggtggccgc cttgaccgac 962461 tggcgggcct gggcacgggc ggccagcgcg gcgatgctgc cgggtggtgg ctgggcaagg 962521 tcgggccgca gctccaacag ctgggtcagc cgttcatcgg gcaaggcggc cagccaggac 962581 cccagcggga tatccggggt gtgttcggtc attgctgatc agcgtaggcc ggaccagcct 962641 tgtggcgtgg gcgggtgcaa gacctgtcag aatggtttcg tggctgacat tgctgaaggt 962701 aaggcacgca agaccaggta cgtggaccat ggttggccga ccaccgatcc agacgaccat 962761 gcggtgagcg aactcgtgac cgaccgcacg ggtgcgctat cacccttcgg tgaattgacg 962821 ttcccggtac cgtccgacga cctgccctac atccacccgg tgaccgtcat caatcggtaa 962881 gccgccagga tggccagggc ttctggggca tccgactacc gctcgggcga gctgtcgcac 962941 caggatgagc ggggggcagc gcacatggtc gatatcaccg agaaggcaac cacgaagcga 963001 acagccgttg ccgcgggcat cttacgtacc tcggcgcagg tggtggcgct gatctcgact 963061 ggcgggctgc ccaaagggga tgcgctggcc accgcgcggg tggcgggcat tatggcggcc 963121 aagcgcacca gcgacctgat cccgctgtgc catcaactcg cgcttaccgg agtcgacgtc 963181 gatttcaccg tcggccagtt ggatatcgag atcacagcga cggtacgcag taccgaccga 963241 acgggcgtcg agatggaagc gctgaccgct gtcagcgtgg ccgccctcac gctctacgac 963301 atgatcaagg cggtcgatcc gggcgcgctt atcgatgaca tccgggtgct ccacaaagaa 963361 ggcggtcgtc gcgggacctg gacgaggcga tgagcacccg gtccgctcga attgtcgttg 963421 tgtcgagccg cgcggcggcc ggtgtgtata ccgatgattg cgggccgatt atcgctggat 963481 ggcttgaaca gcatgggttt tcgtccgtcc agccgcaggt ggttgccgac gggaacccag 963541 tcggcgaggc gctacacgac gcggtcaacg ccggagtcga cgtgatcatc acttccggcg 963601 gcaccggtat ctcgcccacc gataccacgc ccgaacacac ggtcgccgtg ctggactacg 963661 tcattcccgg gctggccgac gcgatccgcc gctccggcct gcccaaggtg ccgacatcgg 963721 tgctgtcgcg cggggtgtgc ggcgtggctg ggcggaccct gatcatcaat ctgccgggat 963781 cgcctggagg tgtacgtgac ggcctcgggg tgctcgccga tgtgctggac catgctctcg 963841 agcagatcgc cggtggagat cacccgcgat gacgcaggtc ctgcgcgccg cgctgacaga 963901 tcaaccgatc tttctggccg agcacgagga gctggtgagc catcggtcgg ctggcgccat 963961 tgtcgggttc gtcggaatga tccgcgaccg tgacggtgga cggggggtgt tgcggctgga 964021 gtactccgcg cacccgtcgg ccgcacaggt ccttgcggat ttggtggcgg aggtagctga 964081 agagtccagt ggcgtgcgtg cggtggcggc cagccaccgg atcggcgtct tgcaggtcgg 964141 ggaggccgcc ctggtggcgg cggttgccgc cgatcaccgg cgggcggcgt ttggcacctg 964201 tgcgcacctg gtggagacca tcaaggcgcg gcttcccgtg tggaagcacc agttcttcga 964261 ggacggtacc gacgaatggg tgggttcggt ttaaagtccg gcctcagccc gtcagccgat 964321 gacgtacggc tgtgcgagcg agtccagcgc atcgttgccg cagacgtcct gggcccgaat 964381 cgcctgccac agcttcttcg tataggcgat gttcgagact tggggcgttt cggcgggcgc 964441 ctcggtgacg tcgccgggcg gcggtgcgtc agctggttgg gggtcgggct cggggagttc 964501 caaatcggtg gcaaggccaa ccgggccgcc tggagctgtg gcgggctgat cgcccggcgc 964561 ggtttgctcg ttcaccgcag cgggtggtgc caggtcggcg ggcgcgggtg gcgccagttc 964621 ggcgggcgcg ggtggcgcca ggtcggcggg cgcgggtggc gccaggtcgg cggacgcggg 964681 tgccagatcg gcgggtggcg ccagttcggc gggagctgcc gggaggggtt cacccagcgg 964741 cgcgggcagg tcgtttacgg caagttccac gggtggtgcc aggtcggcgg gcgcgggtgg 964801 cgccaggtcg gcgggcgcgg gtggcgccag gtcggcgggc gcgggtggtg ccaggtcggc 964861 gggtggtgcc gggtcggcgg gagctgccgg gaggggttca cccagcggtg cgggcaggtc 964921 gtttacggca agttccacgg gtggcgcgac gtcggcgggc gcgggtggtg ccaggtcggc 964981 gggtggtgcc gggtcggcgg gagctgccgg gaggggttca cccagcggtg cgggcaggtc 965041 gttagcggca agttccacgg gtggcgccgg gtcggcgggc ggcggggcca gcggtgctgg 965101 ttcgccgttg accgcggccg cgtccaacgg agcgtccatc gctgccgaag cgggaagcac 965161 ttcgcggggt gttgcgttcg ataacccgcg gccgcacacc ggccaggcgc cgcgaccctg 965221 ggtggccagc acccgctcac cgacggcaat ctgctgctcc cggctggcca gctgagccga 965281 cggggcgaac tcgccgccac catgtgcggc ccaggtgctt tgagtgaact gcaagccacc 965341 gaggtaaccg ttgccggtgt tgatcgacca gttgccgccc gactcgcagc gggccacctg 965401 atcccattcc ccgtcggtgg ccgcggtcgc ctgagcggcc atggcgatgc cgccgccacc 965461 gagtactgcg ccggtaaagg cgatcttggc gacgctgacg ttggatgtgg tgggcttacg 965521 gtggcgtcca ctcatacgtt aggtaattcc tctcggtaca cgcctacgag gtcagctgtc 965581 gggttcgggt tggattcgcc gtggagagga tcacccggcc gcggtcgtac atcggcgaac 965641 gacgttggct tcaccccaag gagccgtatg cggctccggt ccgatctcgg cggacctggt 965701 gggtcccccg cctccatccg cggtcggaat ccctcgccca ctggatggag ttcggcgtgc 965761 tatcggcgag ggagggcacg tcattttggg ttaggttgac gagcctcccg agacggtagc 965821 ggtttcaggc gattccgtca cgtttaagaa aagtcggcgt ttccgtcaca atcgccggca 965881 agaacgccaa gaaatatagg catttgcgca ggtagtaagc cctcgcaatc ggagcgtgtc 965941 cgccccgtta tcgttccgtt atgtgggtaa tgtcacatgg ccttagccgc cggcgaaagg 966001 gggtagtacg tcaatcgtgt cgccggcgga taacgcgacg gcgtcatctc ggacgacaat 966061 cccgtcgcgc aggtaggagc atcgactcaa caccgtcgcg aggcgaacat cgcggaccga 966121 caggccgtct atcagctcgg cgactgtggc gccagatcgc agggtgactt tctccgaccc 966181 ggcaccagcg gccgctcggg cggccgcgaa gtagcggaca gtcacctgaa ttccggcgga 966241 ttcgtcggac acctgcgtca ccggttagcc accgatcgcg ctcatcgggc ggtcgggctg 966301 aatgaaatcg ggggcgttga tgccgtggcc ggcgggtttg ctccacatcg cggcacgcca 966361 tgccgcctcg atcgcgtcgt cgtcagcacc gccgcgcagt aggcggcgca ggtcggtctc 966421 ctcggtggag aacagacagc tgcggatctg gccatcggcg gtcagccggg tgcggtcgca 966481 cgtcgaacag aaggcgtgcg acaccgaggc gatgacaccg aaccgtccgc gtggcgtgtt 966541 cggtccggcg tcgaccagcc agagttcggc cggggccgaa ccgcgcggtg ccgggtcggg 966601 ccgtagccgg aagtggggcc gcagcgccgc cagcacgtcg tcggcgctca gtgcgatgtt 966661 ccgccgccag ctatgccccg cgtccagcgg catctgctcg atgactcgca attgataacc 966721 gcgctctagg cagaacctca gcaggtcgac gacatcctcg cggccggtcg tggggtcgag 966781 gacggcgttc accttgacgg gtgtcaaccc ggctgccttg gcggcggcca agccggccag 966841 cacatgcgca agccggtccc gacgggtgat agcagcgaag tgggcgcggt cgatgctatc 966901 cagcgagacg ttgacccggt ccaggcccgc ttcggccagg gcgcccgccc gccgcgccag 966961 tcccaccccg ttggtggtca gcgagatctc cgggcgcggc cgcagcctag ctgtcgctgc 967021 gaccacctcg tcgaggtggt gggccaatag cggctcgccg ccggtgaacc gcacgctggt 967081 gacgccgagc cgagttaccg cgatgtgtat cagcctggcc agttcgtcgg gccgcagcag 967141 ttgctcgccg ggcagccacc tcagccctcg ctcaggcatg cagtagctgc accgcaggtt 967201 gcagcggtcg gttagcgaca cccgcagatc gttggcgacc cggccgaacg tgtccaccaa 967261 agggccagtg gtgggtacga cccgcgggtc ggcaatgccg ttggtgcggc tgcgcagcgc 967321 cggcatgccc agcgcggtca gtgtcatgtg ggcacctgtg agttgacccc gacgatgtcc 967381 ttgcccagcg gcaccagcga caccgggatc agtttcaagt tggccagtgc tagcggaatg 967441 ccgatgatcg tgactgccat tgccgcggca ctcaccaaat gcccgagggc cagccagatc 967501 ccgaacagca gcacccagat gacgttgccg atcaaggccc cggtcccggc ggttggcttt 967561 tcgacgatcg tccggccgaa cggccacaac gcgtacgacg cgatgcgcag cgccgcgaag 967621 ccaaacggaa tggtgatgat gagcaggaag cagacaagcg acgccagcag gtacccgagg 967681 gccagccaga ggccaccgaa caccaaccag ataacgttca ggattagtcg catatcgcct 967741 ccagcggtag cgcaagccta ccgcgtgagg ggtaagcagg ggtgctcggc ggccgacgat 967801 ccgagtagga tcttcagatc gtcatcgcgt cccgcgcagg cgggacgcgt cttctgttgc 967861 caatccgagc gatccgtcag acaagcaggt gagaccagtg ccgaccggca aggtgaagtg 967921 gtacgacccc gacaaggggt tcggcttcct gtcacaggag ggtggcgagg atgtctacgt 967981 ccgctcctcg gcgttgccca cgggtgtcga ggcactcaaa gccgggcagc gggtggaatt 968041 tggcatcgcc tccgggcggc gcggaccgca ggcattgagt ctcagattga tcgaaccgcc 968101 gcccagcctc tcccggccgc gccgtgagcc ggcggccgag cacaagcaca gccccgatga 968161 gctgcacggc atggtcgagg acatgatcac gttgctggaa agcaccgtgc agccggagct 968221 gcgtaagggg cgctacccgg atcgcaagac tgctcgccgg gtcgccgagg ttgtccgggc 968281 ggtggcgcgg gagttcgagt cctaacgggg tcgggtggtg cgctggccca attgcgccga 968341 gctggcaacg ccgcgccgtt ccagggtcac ggtcggatcg attccgccgc actcggtttg 968401 acgagacgac gaccggctag cagttagccg ggttggccgg cgctgcccgg gctgccgccc 968461 atgccgggat tgccgtcggt ggttttcccg tcgccgcccg tgccgccctc accgcctgtg 968521 ccgccggcgc cacccgcgcc gcccgctcct ccggcaccaa cgccaccgtc gctgcccgcg 968581 cggccaatca gcccgccgga cccgcccttg ccgccgaggg cgccgggggc gccgccgttg 968641 ccgccgttgc cgccggtgtt gccgttaccg ccaaacgctg cgcccggacc actgttggtg 968701 ccgccaccac cggcgcctcc ggcacctcca gcaccaccgg aaccaccagt accaccggca 968761 ccggccgtgc cgccgacacc accggcgccg ccgttgccga tgagcagccc gccagcaccc 968821 ccgttaccgc cctggccgcc gataccgcct tcgccaccgg tgccgagggc gctcgcgccg 968881 tcaccgccca caccgccatc gcccccgaac gccttgcgtg aactgccggc gctgtcggtg 968941 ttggcggcac cgccgtcgcc cccggcgccg ccgccgccgc cgggggcacc gctaccaccg 969001 gtaccaccga ctccgcccgc gccgccggcg ccgccgtcgc cgatcagcag cccgccgcgg 969061 ccaccggcgc ccccggttcc gcccgctccg ccggtaccgc cggaagcgaa gctgtcgaag 969121 ccgttgccgc cgctgccgcc gttaccggct tccgcgttgc tgggagaatt ggtgcccacc 969181 tgggcatccc cgccttcgcc gccggcgccc cccacaccgc cgggggcctg ggcgttcccg 969241 ccggcaccgc cgttacctcc tatggctgca ttgccgatgg aagcggcgct gccgccggcg 969301 ccgccagcgc cgccataggc gccctcaccc cctttgcctc cctcaccgcc cacggcgtcg 969361 ccgttgacgg ttgagacggc gttcccgcca gacccgccag cgccgccgga cgtcatcgcg 969421 tcgccggcat cgccggggcc accgttaccg cctatggcgt tgccgcccag cgtctggtcg 969481 gtgccggtac tgccggcatc agcggtgccg gtgggtgtgg ggttgacgcc gtctgccccg 969541 gctgccccca taccgccctg cccgcccgcg ccgccagaac cgaacaggta ggcgttgccg 969601 ccggcacctc ctgctgcgcc cataccgcca ttgacgccgg ccaccccggt ccccccgagc 969661 ccgccggccc cgccattgcc gtataaccac cccccgttgc cgccgacgcc gccgaccgcg 969721 ccggcccctc cggccccccc ggtcccgccg atgccgatca acccggcgct gccgccggct 969781 ccgccggctt ggccgacccc gccggacccg ccattgccgc cgttgccata caacaagcca 969841 cccggcccgc cggcctgccc ggtacccggc gcaccattgg tgccgttgcc gatcaggggg 969901 cgcccgagca gcgtctgcgt gggcgcgttg atcacattca acgcttgctg cagcggggag 969961 gcatttgcgg cctcggcgct accatatgcg cccgcagccg tgctcaaggc ctggataaac 970021 cgttcatgaa acgcggccgc ctgcgtgccg agtgtctgat aggcctgggc gtgcccggaa 970081 aacagcgacg ccaccgctgc cgacacctcg tcggcgcccg cggccagcac tccggtggtc 970141 ggggccagcg ccgcggcatt ggccgcgctc aacgtcgagc cgatctgcgc caaattgttt 970201 gctgccgctg ccaccatctc cggcgtcgcc aatacatacg acatcgctgt cctcccgcag 970261 ggtcttcgtt gaccgatcgg ctgttactaa cgttagcgcg aacgcgggtc ggcgtctcca 970321 gtttctattt cttgacatgg aaaaacggcg gccccgaccc tgcctcagcg tcgcagccgt 970381 cgttggcggc gagcaccggt gaccgtgact ttggtagcgg cccgtccgca gtgggtgcca 970441 cgtagtattc ggacagatag gtagtggtag gcaaccttcg tgattcgtca gcgaggaggc 970501 ggcgatggca cagcaaactc aggtcaccga ggagcaagcg cgggcccttg ccgaggaatc 970561 tcgcgaaagt ggttgggata aaccgtcctt cgccaaagaa ctctttctgg gccgctttcc 970621 cttagggctc atacacccat ttcccaagcc gtcggacgcc gaggaggccc gaaccgaggc 970681 gtttctggtc aaactgcggg aattcctcga caccgtggac ggcagcgtca tcgagcgtgc 970741 tgcccagatc cccgacgagt acgtgaaagg cctggccgag ctgggctgtt tcggcttgaa 970801 gattccgtcc gagtacggcg ggttgaacat gtcgcaagtc gcctacaacc gcgtgctgat 970861 gatggtcacg acggttcatt ccagtcttgg cgcgttgttg tcggcgcatc agtcgatcgg 970921 ggtacctgaa ccgctcaagc ttgccgggac tgcggaacag aagcggcggt tcctaccgcg 970981 gtgtgcggcc ggcgcgatat cggccttttt actaaccgaa cccgatgtgg gctccgatcc 971041 ggcgcgcatg gcatcgacgg cgacgccgat cgatgacggc caggcttacg agcttgaggg 971101 tgtgaagttg tggaccacca acggtgtggt agcggacctg ctagtggtta tggcgcgggt 971161 accgcgcagt gaagggcacc gagggggaat cagcgccttt gtcgtcgagg ctgattcgcc 971221 cgggatcacc gtggagcggc gcaacaagtt catgggactg cgtggcatcg aaaacggcgt 971281 gacccggctt catcgcgtca gggtgcccaa agacaacttg atcggcaggg aaggcgacgg 971341 tctgaagatc gcgctgacca cactcaacgc cggacggctg tccctaccgg cgatcgcaac 971401 cggagttgcg aaacaggcgc tgaagatagc gcgggaatgg tccgtcgagc gagtgcaatg 971461 gggcaagccg gttggccaac atgaagcggt agccagcaag atctcgttca ttgccgccac 971521 caattacgcg ctcgatgcgg tggtcgagct gtccagtcag atggccgacg aaggccgcaa 971581 cgacatccgg atcgaggctg cgctggctaa attgtggtcc agtgagatgg cctgcctggt 971641 tggcgatgag ttgctacaga tccgcggtgg ccgcggatac gagaccgccg aatccctcgc 971701 cgcgcgcggt gagcgggcgg taccagtgga gcagatggtg cgggacctgc ggatcaaccg 971761 gatcttcgaa gggtccagtg agatcatgcg gctgctcatc gcgcgtgaag cggtcgacgc 971821 gcacctcact gccgcgggtg atctggcgaa ccctaaggcc gatctgcggc agaaggccgc 971881 ggcggcggcc ggcgccagcg ggttctacgc gaagtggttg ccgaagctgg ttttcggcga 971941 aggccaacta cccacgacgt accgcgagtt cggcgccctg gcgacacatc tgcgttttgt 972001 cgaacgctcg tcacgcaaat tggcccgcaa caccttctac gggatggcgc gctggcaggc 972061 cagcctggag aaaaagcaag ggttcctcgg ccgcatcgtg gatatcggcg ccgagctatt 972121 cgccatctcc gcggcgtgtg tgcgcgccga ggcgcagcga acggccgatc cggtcgaggg 972181 tgagcaggca tacgaactgg ccgaggcgtt ctgccagcag gccacgttgc gggtggaggc 972241 gctgttcgac gcgttgtggt ccaacaccga cagcatcgac gttcggctgg caaacgatgt 972301 gctggagggc cgctacacct ggctggagca agggatactc gatcagtccg aaggcaccgg 972361 accgtggatc gcgtcctggg aaccgggtcc atccaccgag gccaatctgg ctcggcggtt 972421 cttgacggtg tcgccatcga gcgaagcgaa actttagggc gcccgcgtgg ccggtcacgt 972481 ccgcggggga ccgcccgagt ctcgtcgggt accacgctgg cgcgtatcgc gtctgggtgc 972541 aggttctatt ccatgtcgtc gacaaacagc gccatcgatg cggtgaatcc gtgcagggca 972601 ttgcggcccg cgatcgggcc gatctctccg gcggcgaaga agccggcaag cggaattccg 972661 cctaggagtt cctcgatcgt cgacgcgtcg tggtcggcga ccccgaacat ccgccgcccc 972721 cgcccgttgc aggtgaacaa cagcgctcca gccgcgcgtc cgggcagccg cgccgcggcc 972781 cgctccacgg tcaggcgtag gtccttgtcg gccccggccg cgtcacggac ctggaactgc 972841 atggtggcgc cgacctggac aacctcgtcg atctcgatcg acccggtcga cgggtcggcg 972901 ccgagcagcc cgcggatcac gaaatcgccc tgacccggag ccgccaggtg ctcgtcgacg 972961 acgatcccga tctgtaggcc gtggctgacg agtgcccttt cgtcgggcga cagcccctcg 973021 acgatctcac gcagtcgctg caacggcgga cggccgccga gctcggtgat cagtatgccg 973081 tccgcgccgg tgacgatgta tgggtagccg atcggccggc aaccctgcga cacgaccggg 973141 acaccgcgca tcccgggcag gcgcacgccg acgacgccgg aggtgagcac gtcgtgatcg 973201 cggaacagcc gggtgtcgcc ccgccggcgc ccgccgctca ccacgccgcc cacgacggcg 973261 gtgcccggca ggtcggtgtt ggggtgctcg atgagcaggt tcgacgggaa tgtgtacggg 973321 tccggcagca gcagatgcag atcccgggcg gtgcggtcga accgataacc ggtgatcagg 973381 gcacccgagc cggtacggac aaagtccagc tggaatgtct cggcggccaa gccggacgcc 973441 agccacacca ccaccgcggg ctcgtcctcg atctcgtggc ggccggcgac gatggcctgg 973501 gcgatgcaac cgacaagcgc gggcggatcg atcatctgca gcaccgcgct caggacgtcg 973561 gcagcccggt cggtgtgtgc acgcgatcca agcaacaccg ccagcgacgg cgcctcaccc 973621 gccagctcgt cgcgcgcctg gcccgcagcc tccaccgcgg cctgccgcgc gtcgggcgtg 973681 gtgcaaaccc cgactccgat ccgcacagtt ccatgatgcg ccgatgtgcc ccgggtgtcg 973741 gcggctcttc ggaccgttgg cgccgaccgc gcttaagcgc ggtcggccgt cgagccgcgg 973801 cctcgtcaaa agataaggcg caccgaccat tccgcgtgcg gaacgtcgcg tagttcaccc 973861 gagtggtcga ccaccaacgt cagcaactgc acgacaatcc cggtcagccg cccgcgctgc 973921 gggtcgacag tggggatggt gaccgccaac cgggtgtccg gccgaaacaa ggtgctggtg 973981 gtgttggcgg ggtcctggta tacctgcagc aaacgccacg gcgcccggga aatgacttcg 974041 ggtaccgaga gctgcacggg atagcgttcg cttaccggca attcgccctg cgcctgcggg 974101 gtctgacagt cgtcgaggtc gaccacgttg cagtacaaat agggccccac gcgggtcagg 974161 tgcccgtgcg agtaagcgct gatctcgggt tgctgcggac cgtgtccgcg tactagcagc 974221 catgcaccgg ccccggccgc caccgagagc agaatcacca ggatcaccgg cagcgttgcg 974281 acaccgcgct tcactgcggc gccaccgccg caccacgacg ggtggtttct tgctcggcca 974341 tcacgggccg attaccgccc aggccaggga tcagcgaatc gccgcggaag ctgacgatgg 974401 tctgagccag acccaggatc agcagcgcgc tcaccgcagt gaagcccacc cacagctcgg 974461 tgtacaccaa cacgcccacc gcgccgccca gcacccaggc cagctgaaga gtcgactcgg 974521 aacgcccaaa ccccgatgcc cgcgactcct cgggcaggtc gtgctgcaac gaggcgtcca 974581 gcgaggcttt agcaatggca ctggaccctg ccgtgatcag ggtggcaatc gctgtcgctg 974641 ccaggctgcc ggccaccgcg gccgcgatgg ctaacacggt aactagcacg gtgcagcgca 974701 ccaccagcac agctggcctg cctagctgca ggcgtgcgct ggtgaaattg ccggcgaagt 974761 tgccgaccgc ggccgccgcg ccgatcaggc ccagcatgcc caattgcacc cacccgttgg 974821 cttcgtgcgc cttggcgaca aacgccggat acaagaacag aaagccgacc atcaccttga 974881 tggtgcagtt accccacagg gaggtaatga tgttgcggcc caacggttgt cggagtgttc 974941 cgccgaggtt cttgacttcc tccggccagc gtcgccgtag tctgccccta tcccggtggt 975001 agctcaatgt ggccgggacc tcaccgctgg tcacctcgac ccagcgcgga atgcgcatcg 975061 acagcgaagc gccagcgatg gtgatcgcga cgacgacgaa caacgcgccc ggcagctgga 975121 acaggtgggt gcagacgaat tcgactccgg ccgcaatcgc gccaccagcg atggtgccgc 975181 cgagcaggcc gaacacggtc agccgtgagt tgacccggac caagtcgatg gttggcggca 975241 tcaccctcgg tgtcactgcg ctgcgcagca cgctgaacga cttcgagaac accatcatgg 975301 ccagcgcaca gggatagagc acccatgacg ggaagctgcc ggtggcgccg tcgtagttca 975361 tgatcagcac caccgccaac gcggtccgaa gtccgaatga cagcgccaag gcgacgcgac 975421 ggccatgctg cagccggtcg agtgccggac cgatgagtgg agcgatcacg gcgaacggcg 975481 cgatggtgat caacaggtac aaggcgaccc tggacttgct ctccccgctg gcggccgcaa 975541 agaatagtgt gtttgccagt gctaccgcca ttgccgagtc gaccgcgaag ttcgccatta 975601 ccggccaggt caatgccgtc agtccagact tgtcggcgcc gtctgcggta gcggcccggt 975661 gcaccagcaa gtacatccga gaacccattt cgcggctgcg catggccgcg gcgcgggtga 975721 cggtgatccg ctcgcccgcc cttgtagtcc gtggcggtac gcggctgcgt tcaggttcgg 975781 gctgctcgcc cagcggcggg agatagcggt tggcactggg catcggcgga ggtcgacgcg 975841 atcggcgata gttggcgtcg tcgggagggt agttggccat gccagggtgc ccgttgaccg 975901 atccgtttcg ggtccggcgg cccggggttg gggccatccg gcccggatga tcacctcgcc 975961 gtccggacac aaatcaattc tgtcctatcc ggactcctgg cgtagccaac cgggtgtggc 976021 ttgccggccg tgtcttccgg cagtattgga agcgcgttac agagagggga cagcgtgacc 976081 gggcccaccg aggagtctgc cgtggcgact gtggccgact ggcccgaggg gttagcggcg 976141 gtgctcaggg gtgcggccga ccaagccagg gccgccgttg tggagttcag cggcccggag 976201 gcggtgggag actacctggg cgtcagctac gaggatggca acgccgccac ccaccggttc 976261 atcgcgcatc tgcctggcta ccagggatgg caatgggccg tcgtggtggc gagctattcc 976321 ggtgcggacc atgccacgat cagcgaggtg gtgctggtcc cggggcctac cgcactgctg 976381 gcgccggatt gggtgccgtg ggagcaacgg gtgcggccgg gagacttgag ccccggagat 976441 ctgctggcgc cggcgaagga tgatccgcgg ctggttccgg gttacaccgc cagtggtgat 976501 gcgcaggttg acgagaccgc cgcagagatc gggttgggtc ggcgctgggt gatgagcgcc 976561 tggggtcgcg cccagtcggc ccaacggtgg cacgacggcg actatggtcc cggctctgct 976621 atggcgcggt cgacgaaacg cgtctgccgc gactgcggtt tcttcctgcc gctggccggg 976681 tcgctgggcg caatgttcgg ggtatgtggt aacgaactgt ccgctgacgg gcatgttgtc 976741 gataggcaat acggctgtgg cgcccattcc gacaccactg cgccggccgg tggcagcaca 976801 cccatttatg agccgtacga cgacggtgtg ctcgacatca tcgagaagcc ggctgaatca 976861 taggttttct ctcacccgct gttccctact tttttttggg ggggggcacc agtcgaagaa 976921 acccgactga ttatcacccg tattgaacac tcccgagctg ttgtcgcccg agttcgccac 976981 acctgaagtc tggagggtgc ccgtattggc caagcccgag gtgaatctgc ccacgttgaa 977041 gaagcccgag ttaacgcggt gcctgcattc tggaagccgg agctagtgtc gctcgcgttt 977101 ccgaagccgg agctgccgtt gcccaagttc tggaagccgg agccactatt gccggagtta 977161 aagaagcccg agtgaccggt gcccgtgttg ccgaagcccg agttcgcgac gccttgagtg 977221 accgggctgc cgatgccggt gttcaggtcc cccgagttga agccgcccgt gttggtgtcg 977281 cccgaattac cccatcccgt gttgtcgtcg ccggcgtttc cgaagccgag gtttccagaa 977341 ccttcgtttc cactgccgat gttgaggaag ccggcatttc cgctgccgaa gttggtgttg 977401 ccgtcgtttc cgttgccgaa gttgaagaag ccggcgtttc cgctgcccag gtttgcattg 977461 ccgatgttgc cgttgccgag gttggcgttg ccgatattcc cgatgccgaa gttgtagtcg 977521 ccggtgttgc cgctgcccac gttgttgttg ccgatgttgc cgatgcccaa gaagttcccc 977581 acgccgatgt tctcgacgcc cagggccggg atcgcgagcg ctgctgggac ccccaccgca 977641 ccggccggcg cggcggtcac cacgggcggc aagacactca gcagctgctg ccacggggtc 977701 aactgcgaag ccaccgtcga ggctccaccg tgataaccca ccatcgccgc cacatcctgt 977761 gcccacaact gctcatagga cgcctcggtc gctgcgatcg ctggcaaatt ctgcccaaac 977821 agattcgaga gcaccaacga cagcaactgg ttgcgattgg ccgccaccaa tgctggatgt 977881 gcggtcgctg cccgcgcggc ctcatatact gcggccgcag ccttggcccc ggcagccgcc 977941 ccctcagcgc gtgccgtcgc cgcgttcagc caactcagat acggtgcggc cgcggccgcc 978001 atcgcggccg ccgccggacc ctgccacgcc gaacccggcc cagccgttag gcctgagatc 978061 agcaacgaaa acgaggccgc tgccatcccc agctcggcgg ccagcccatc ccaggccacc 978121 gccgccgcca gcatcggggc cggccccgca ccggcgtaga tccgcgccga attaacctcc 978181 ggcggcagca ccatgaaatt catcacgcca tcccttctca gctggccacc cccggcctag 978241 ccaccacgac ggcgggaccc ggctgccgcg atccgcgccg gcgggcctcg gtcgactaca 978301 gtggcgcgat cgctcgacaa cttgagcacc ttggcaaacg acggtatgtc caatcgcggc 978361 acattgtcgg ggttttcatc gaaatcctgt cgccaacccc gacagccggg ttccgggaag 978421 ccgggtgtcg cagtggttta ggtgtcgacg ttgaacaccc gggcaggcaa ccggccgtgg 978481 ctatttcggg tcgagatagg tttcgagtcc ggcttgtgcg ccgcgtgcgc cacggcgggc 978541 agcggcgagc tgccagacga agatggtcgt gccgagcaac cctgttgcca gcccggccac 978601 tgtcaccgga cgccaacttg cgaggccagg cacgacgaat gcggccaccg cggcgaccag 978661 ccaggcgagc gcgccgaccg cgatcaccgg ccacacctcg agcagcacgg gtggtagcgg 978721 tggcggctcg cgaatctgac tattttcgac gctcatcccg agtcaacata gcgcggcgat 978781 gatgcgtcgg cgaacggccc ggggtgggtg gcttccgcac cagcgggagg taccaccacc 978841 tgctggtggg tcgtcggccg gcaatgggtg gaaccgaaat cgtcgttcgc cgtttcagat 978901 gccctagtct gaacttccgt tgtaacctca gctgtgcttg acagcgatgc gcggctggcc 978961 agcgacttgt cattggcggt catgcggctc tcccgccaac tgcggtttcg gaacccgtca 979021 tcgccggtct cgctgtccca gctctcagcg ttgacgacgc tggccaatga gggcgcgatg 979081 accccgggtg cgttggcgat tcgtgaacgg gtccggccac cgtcgatgac cagggtgatc 979141 gcctcattgg ccgacatggg ttttgtagac cgcgccccac accccatcga cggtcggcag 979201 gtgctggtct cggtgtcgga atcgggcgcc gaattggtca aggcggcacg gcgggcccgg 979261 caggagtggc tggctgagcg gctcgcgacg ctgaaccgca gcgagcgtga catcctgcgc 979321 agcgccgccg atctgatgct ggctctggtc gacgaaagcc cgtgaccgaa ggccgttgtg 979381 cccagcaccc cgacggcctc gatgttcagg acgtctgcga tcccgacgac ccacggctcg 979441 acgatttccg tgacctgaac agcatcgacc gtcgtcccga tctgccgacc ggcaaggcgt 979501 tggtgatcgc cgagggtgtg ctggtggtgc agcgcatgct ggcctcacgg ttcacgccgc 979561 tggcgctgtt cggcaccgac cgccggctgg ccgagctcaa ggatgatctg gccggtgtcg 979621 gcgcgccgta ctatcgagcg tcggctgatg tcatggcacg ggtgatcggc ttccatctca 979681 atcgtggggt gttggcagcc gcgggccggg tgccggagcc gagcgttgct caggtggtcg 979741 ccggggcgcg caccgtcgca gtgttggaag gcgttaacga ccatgagaac ctgggctcga 979801 tcttccgcaa cgcggcaggg ctgagcgtgg acgcggtagt gttcggcacc ggctgcgctg 979861 atccgctcta ccgtcgtgcg gtccgggtat ccatgggaca cgcgttattg gtgccatatg 979921 cacgcgcggc cgactggccc accgaactta tgacgttgaa agagagcggc tttcgactgt 979981 tggcgatgac cccacacggc aacgcgtgca aactaccgga ggccatcgcc gcggtgtcgc 980041 acgaacggat tgcgctactg gtgggcgcgg agggcccggg cctaacggcg gccgcactgc 980101 ggattagcga tgtgcgggtg cgcattccga tgtcccgagg gaccgactcc ctcaacgtcg 980161 cgacggcggc cgcattggct ttctacgagc ggactaggtc gggccatcac attgggcccg 980221 gcacgtgaac gatcagcgcg accaagccgt gccctgggca acgggtttgg cggtcgccgg 980281 cttcgtcgcc gcagtcatcg cggttgcggt cgtggtgctg agcctcggcc tgatccgcgt 980341 gcatccgctg ttggccgtcg gtctcaacat tgtggcggtc agcgggttgg cccctacgct 980401 gtggggctgg cgccgcaccc cagtgctgcg ctggttcgtg cttggcgcgg cagtgggcgt 980461 ggcgggcgcg tggttggcgc tgctcgcctt gacgttgggg gacggctagc gacgcccgcc 980521 tgagcgcacc ccgagcagca catcttccca ggcaggtatg gcgggtttgc ctcgtcggtt 980581 gctgaccggc tgtgcggacg gcaccgtgag cgtcggctgt gcgggctcgg gctcatcgaa 980641 gtcgagatgg gcgaccggcg ccaacggccg tagcgggcga ttgaaggtgg ggttgatcag 980701 ctcatgggcc gtgtcgtcga tcgcggtggc ggttccgccg tgggcgccgg gggtgaagcg 980761 gaaatgcgcc aggttgtcgg agcggccagc cttccaggca agctgcaccg tccagcgact 980821 gtcctcgttg cgccacgcgt cccaggtgag gctgtcgggg ttaaggccgc gtgccaccag 980881 ggccgcggcg acggtctcct gcatggtcag caccgccggg ccgtcggcca ggaccgggtg 980941 cgccgcggtt gccagctcgg ccgcgcgcga gcgttccaac agtaccgggt gggcaaaccg 981001 gcggatacgg gcgatgtcgg agcccgatgc cgcagcgacc tgttcgacag acgcgccggc 981061 ccgaattcgg gcctgaatct ccttggggct cagcacgttg gtgacctcga tgtccagctg 981121 ggcttgctcc ggctggacgg agtcgtcccg tagcgccgcc cgcagtcggt cgtcgaccgg 981181 cagcttgaac tgttcggacg ggatggcacc ctggcagatg atgtttttgc cgtcggcatc 981241 gagcccaacg actttgagtt cccgcatggc ttctcctcgc aggctccggg caggacaacg 981301 ccggacctgt tacgtgcgca ctctagtgcg gtaaacgccg ttagcctcgt tgacacgcgg 981361 aggtgtcttg ccggcatggc gctggtgacc ggaatgcccg gtcacagccg cactaaggca 981421 gcgctaaagc cgctcgacca cccagtcgac gcactcggtg agggcgctga cgtcgtccgg 981481 ctcgaccgcg gggaacatcg cgacccgcag ctggtttcgg cccagtttgc gatacggctc 981541 ggtgtcgacg atgccgttag cccgcaggat cttcgcgacg gtcccggcgt cgacgtcgtc 981601 gacgaagtcg atcgtgccca ccacctgcga ccgcaacccg gggtcggtga caaatggcgt 981661 ggtgtagggc cgctcttgcg cccacgagta caaccgctgc gacgagtccg cggtgcgttt 981721 gaccgcccag tccaagccac cgttacccac cagccagtcg atctgttcgg ccagcagcgc 981781 cagcgtggcg atggccggtg tgttgtatgt ctggttcttc aagctgttct cgaccgcgat 981841 cggcagggac aggaaatcag gaacccagcg accggtcgcg gcgatggcct cgatccggct 981901 cagggcggcc gggctcatga tggccagcca caggccgccg tcgctggcga agttcttctg 981961 cggtgcgaag tagtaggcgt cggtctcggc gatgtcgacc ggtaggccgc cagcaccgga 982021 ggtggcgtcg atgacgacca aggcgtcatc ggagccctcc ggacggcgca ccgcaaccgc 982081 gaccccggtc gaggtctcgt tgtgggccca ggcgatcaca tcgactgacg ggtcggtttg 982141 cggctccgga gcactgccgg gatccgacgt gatgatgatc ggctcgccga cgaacgggtt 982201 cttggaaacg gcggaagcga acttcgcgct gaactcgccg taagtcaagt gcagtgagcg 982261 tttgtcaatc agcccgaagg cggccgcatc ccagaacgcc gtggcaccac cattgcccag 982321 tatcacctca tagccgtccg gcaacgagaa cagctcggcc aggcctgacc gaaccctgcc 982381 caccagattc ttgaccggcg cctgtcggtg cgacgtgccg aacaatgccg ctgcggtggt 982441 ggtcagcgtt tgcagttgct caagccggac cttcgacggg cccgacccaa agcggccgtc 982501 gcggggtttg atggcggtgg gaatttccag gtggggggtg agctggtcgg ccatgccatc 982561 agggtagtga ggggtaccga accgcggcga ctcgagcgga acgaaagcct gccggcacag 982621 gcgcgtagtg tgaacaagct cacatgcaag ccctggctgg tggctgggtc atagtgtcgc 982681 caagggtctg gataattccc ggtaccagcg gtaccgtgtt cgatacccgt gcggacgcac 982741 acctcggtgg ggaggcttcg aatggacagg acgcgcatag ttcggcggtg gcgccgcaac 982801 atggacgtgg ccgacgacgc cgagtacgtg gaaatgctgg ccacactgtc cgaggggtct 982861 gtgcggcgga atttcaaccc gtacaccgat atcgactggg agtcgccgga gttcgccgtc 982921 acggacaacg atccccggtg gatcctcccg gcgaccgatc cgttgggccg ccacccctgg 982981 taccaggcgc agtcgcggga acgccagatc gagatcggga tgtggcgcca ggccaacgtg 983041 gccaaggtcg ggctgcactt cgaatccatc ctgattcgcg gcctgatgaa ctacacgttc 983101 tggatgccca acggctcacc ggaataccgg tattgcctgc acgaatcggt cgaagagtgc 983161 aaccacacca tgatgttcca ggagatggtc aaccgtgtcg gcgcggacgt tccggggctg 983221 ccacggcggc tgcggtgggt ttcaccgctg gttccgctgg tggccggacc attgccggtg 983281 gccttcttca tcggcgtgct cgctggggag gagcccatcg accacacgca aaagaacgtg 983341 ttgcgcgaag gcaagtcgct gcatccgatc atggaacgag tgatgtccat tcacgtggcc 983401 gaggaagcgc ggcacatctc gttcgcccac gagtacttgc gtaagcggct gccgcgcctg 983461 acccggatgc agcggttctg gatctcgctc tacttccccc tgacgatgcg gtcgttgtgc 983521 aacgcgatcg tggtgccgcc caaggcattc tgggaggaat tcgacatccc gcgcgaggtc 983581 aagaaggagt tgttcttcgg ctcgccggag tcgcgaaagt ggttgtgcga catgtttgcc 983641 gacgcccgca tgctggccca cgataccgga ttgatgaacc cgatcgctcg gctagtgtgg 983701 cgactctgca agatcgacgg caagccgtcg cgctaccgca gcgagccgca gcgtcagcac 983761 ttggctgccg cgccggccgc atagcttgct acgagtgcac gcatgccgca cgtaattact 983821 cagtcgtgct gcaacgacgc gtcctgcgtc ttcgcatgtc cggtgaactg catccacccg 983881 acgccggacg agccgggctt cgcgacctcg gaaatgctct atatcgatcc ggtggcctgc 983941 gtggactgtg gtgcctgcgt aaccgcctgc ccggtcagcg cgatcgcgcc gaacacccgg 984001 ttggacttcg agcagctgcc gttcgtcgaa atcaatgcgt cgtattaccc gaagcggccc 984061 gccggcgtga agctagcgcc gacgtcgaag ctggctccgg tgactccggc cgccgaggtg 984121 cgtgtgcgcc ggcagccgct gacggtagcc gtcgtcgggt ccgggcccgc ggcgatgtat 984181 gccgccgatg agctgctggt ccagcaggga gtgcaggtca acgtctttga gaagctgccg 984241 acaccctacg ggctggtgcg ctccggggtg gcgccggatc accagaacac caagcgggtc 984301 acgcgactat ttgaccggat cgccggtcat cgccgcttcc ggttctatct caacgtcgag 984361 atcggcaagc atctaggcca tgccgagcta ttggcccacc atcacgccgt gctgtacgcg 984421 gtcggagcgc ccgacgaccg ccggctgacg attgacggga tgggactgcc gggcaccggt 984481 accgccacgg agctggtcgc gtggctcaac ggacatcccg acttcaacga tctgccagtc 984541 gatctcagtc acgaacgcgt ggtgatcatc ggcaacggga atgtcgcgct cgacgtggcg 984601 cgcgtgcttg cggccgatcc gcacgagctg gccgccaccg acatcgccga ccacgcgttg 984661 tccgcgttac gcaactcggc ggtccgtgag gtggtggtcg ccgcccgccg cggtcctgcc 984721 cattcggcgt tcaccctgcc cgagctgatc gggctcacgg ccggagccga cgtcgtgctt 984781 gacccgggag atcatcagcg agtactcgat gatctggcaa tcgttgccga tccgttgacc 984841 aggaacaagc tggagatctt gagcacgctg ggggacgggt cggcgcctgc gcgacgagtc 984901 gggcgcccgc ggatccggct ggcctatcgg ctcacgccgc ggcgcgtcct cggccagcgg 984961 cgggccggcg gagttcagtt ctcggtcacc ggaaccgacg agctgcgcca actggatgct 985021 ggcctggtgc tgacgtcgat tggctaccgc ggcaagccga ttcccgacct gccgttcgac 985081 gagcaggccg cgctcgtgcc caacgatggt ggacgggtca tcgacccggg caccggcgag 985141 ccggtgcccg gcgcatacgt cgcgggttgg atcaagcgcg ggcccaccgg gttcatcggc 985201 acgaacaagt cctgctctat gcagaccgtt caggcgttgg tggccgactt caacgacggc 985261 cggctgaccg atccggtggc tacaccgacg gcgctggatc agctggtgca ggcccgccag 985321 ccccaagcca tcggctgtgc gggatggcgg gccatcgacg cggccgagat tgcgcgcggc 985381 agcgccgacg gccgggtccg caacaagttc accgacgtcg ccgagatgct cgcggcagca 985441 accagcgcgc ctaaggaacc gcttcggcgg cgcgtgctgg cccggctgcg tgacctgggg 985501 cagccgatcg tgctaaccgt ccccttgtga tgacatggcg gcttggatct catccatgtt 985561 gacctcgcgc accggctggc ccagcgacca gtggtggccg aacgggtcgg cgaccacccc 985621 gtagcggtct ccccagagct ggtcctccaa ggcggtcacc accgtggcgc ccgcgttcag 985681 ggcacgctgg aacttggcgt cgacatcggt gacggtcaaa tgaatggtga ccggtgttcc 985741 gcccagcgag gtgggcgtca tcgacttgcc gccgcacatc tgcgggacgt cgtcgttgag 985801 catcaccgta aagccgttga tgcgtagtgc ggcgtggatc agtttgccat cgggaccggg 985861 gacgcgcccc agttcgacgg cgtcaaaggc cttgacgtag aagtcgatcg ccgaggcagc 985921 gtcgtcgacg acaaggtgtg gtgacagagc gggttcgacg ttgatcgcca tggtgtctcc 985981 ttgttgttgg tgtgctcggc caatccgggg cccggacagg ctcacggata ttgactcccg 986041 gcgcgatgga aaatcatcgc ggtgccgtca ttcaatcgcc ggacacgtgg ccaccgccca 986101 gcggtgtggc cagcaagccg aatctcaacc gcaggtgtgt tcaatgaata cttttccgtc 986161 acaacgtgat tgctgctttg tgtcgacaag cgcacttttc ggtctcgaca cgaatgctct 986221 tccgttacag cgcaagttga aactttctgc acgcaaccca tgccgaccat gtccgcgcca 986281 cccgctcaag cgccggtatg tggcgccttg gcggctaggc caaccgcccc cggcaacgcc 986341 agctgcacac gcccagcgaa gcgcgattgt cggtacgggt cgcgctgcga aacctgcctc 986401 ccattcgcac tagcaaaaga ctgtcgacaa gcgagcagtc gacttcaggc cgcgaccgaa 986461 ccggacgaga cgacaacaac atctgtcatc tcaatgcgct caccaggatc gctacaatat 986521 cagccagcta catgagccga tgtatatcca ggaaggctct gccgccgaca tgttggatcg 986581 ctcgcgcgga cagctgtacc ggctctacct ggctagtagg tgaattcaat ggcgcgttcg 986641 ctcattactc acccatgtgc acaataggtt cgcgtgcggc tcgccggcaa cgttggcaac 986701 atcccgattc ccattgattg cacgttgcgc ggcctaaccc aatattcccg gacgaacaac 986761 gccgaggtcg tgcagagcgt cgagacacac caccgtcccg ctaactttga tgccctcacc 986821 tgaggaaaac cacaggagcg tcaggtactc acccactgcg ggaattgcga tgacgttcaa 986881 accgatcgag gccgcgcagc tacgccagcg cgcagtgaac aggccgtaac tggaccgcgc 986941 ttgcgcaacg ttcgaaaagg gatccggtgg agcggcccga cgacaccaaa taggccatat 987001 cccccaaaga ctggtattga caaccgttct gatgccgcgt cagacttccc accacgccac 987061 ggaccgtcca acgccagaac tcaataccgt ctcgtcccag gcgaaaccgt gagcctagcc 987121 gatgatctcc tggcattggt cggactggac ttgatctgct cgctgacaag catacgtatc 987181 agtgctacga accgttcacg cggtgaacct gctgggcgca caaggagaat cgatggatta 987241 cgccaaacgc atcggccagg ttggggcgtt agccgttgtc ctgggggtgg gggcggcggt 987301 gactacccac gcgatcggct ctgccgcgcc gacggatccg agctcctcga gcaccgattc 987361 gccggtcgac gcgtgctcgc cgttgggtgg gtccgccagt tcgttggctg cgataccggg 987421 cgccagtgtg ccacaggtcg gcgtgcgaca ggtagacccc ggaagcatcc ccgatgactt 987481 gctcaatgcc ctgatcgact ttctggccgc ggtacgcaac gggttggtgc ccatcatcga 987541 aaaccgcact ccggtagcga atccgcaaca agtcagcgtc cctgaggggg gcaccgtcgg 987601 cccggtccgg tttgacgcct gcgaccccga tggcaaccgg atgaccttcg cggtgcgcga 987661 gcgcggtgca cccggtggac cccagcatgg catcgtgacc gtcgaccaac gaacggccag 987721 cttcatctac acagccgatc cgggtttcgt tggcaccgat accttcagtg tgaacgtcag 987781 cgatgacacc agcctgcacg tgcacggtct ggcgggatac ctgggtccgt tccatgggca 987841 cgacgacgtc gccaccgtga ccgtgttcgt cggcaacacc ccgaccgaca ccatcagcgg 987901 cgacttcagc atgctcacct acaacatcgc ggggctgccc ttcccgctat ccagcgcaat 987961 tctgccccgg ttcttctaca ccaaagagat tgggaagcgg ctcaacgcct actacgtcgc 988021 gaacgtccag gaggatttcg cctaccacca attcctcatc aagaaatcca agatgcccag 988081 ccagaccccg ccggagccgc ctaccttgct gtggcctatc ggtgtgccct tctccgacgg 988141 gctcaatacc ctctcggagt tcaaggtgca gcggctggac cggcagacat ggtatgagtg 988201 cacatccgac aactgcctca ccttgaaggg cttcacctac agccagatgc ggcttcccgg 988261 cggtgacacg gtcgacgtct acaacttaca taccaacacc ggtggagggc cgaccaccaa 988321 cgccaacctc gcgcaggtcg ccaactacat ccagcagaac tcggcgggcc gcgcggtcat 988381 cgtcaccggc gacttcaacg cgcggtactc cgacgaccaa agcgctctgt tgcaatttgc 988441 gcaggtcaac gggctcaccg atgcctgggt gcaggtagaa cacggcccca ccacaccgcc 988501 gttcgcgccc acttgcatgg tcggcaacga gtgcgagctg ctcgacaaga tcttctatcg 988561 aagcggccag ggagtgacgt tgcaggccgt cagctacggc aacgaggcgc cgaaattctt 988621 caattccaag ggtgagccac tgtcggatca cagcccggcg gtggtcggct tccactacgt 988681 cgcggacaac gtggccgtac ggtgacagcg gttgatcgcc aactggtttg ccgtcggcct 988741 caggcggtgg tgagtacccg ctcccagccg tcgaccgatt ccgggctgcg cgggcccggt 988801 cccacgtaaa tggccgacgg gcggaccagc ttgccgagtc gcttctgctc gagaatgtgg 988861 gcacaccagc cggcagtgcg cccacaggtg aacattgctg gcatcatgtt ggccggtacc 988921 cgggcaaagt ccaggaccac tgcggcccag aattcgacat tggtctcgat cgcccgatcc 988981 ggacggcgct ctcgcagttc tgacagcgca gcctgctcca ccgcgaccgc gacctcgtag 989041 cggggggcgc ccagccgctc ggcggccgcc cgcagcaccc gcgcccgcgg gtcctcggcg 989101 cggtagaccc ggtgcccgaa ccccatcagt ttctcgccgc ggtccaggat tcccttgacc 989161 acgctgcggg catcgccggc gcgttcgacc tcgtcgagca tcggcaggac gcgcgccggc 989221 gcgccaccat gcagcggtcc gctcatcgcc ccgattgcgc ccgacagcgc tgctgccaca 989281 tccgccccag ttgaggcgat cacacgcgcg gtgaatgtcg aagcgttcat gccgtgctcg 989341 gcggccgaca cccagtaggc gtcaatggcc tcgatgtgtc tggggtctgg ctcgccctgc 989401 cagcgcgtca tgaaacgtgc tgtgaccgtc gagcattcat cgatgattcg ctgcgggacc 989461 gccggctggt agatgccccg tgcggattgc gcgacatagg acagcgccat caccgatgcc 989521 cgggccagct gttggcgggc ggtggcgtcg tcgatgtcga gcagcggcgc atatccccag 989581 atgggcgcca gcatcgccag gccggcctgg acgtcgacgc gcacatcgcc ggagtgaatc 989641 ggcagcggga acggttcagc cggcggcagc ccgctgccga agttgccgtc caccagcagc 989701 gcccacacat cgccgaaggt gacccgctga cttaccaggt cttcgatgtc gacgccacgg 989761 tagcgcaggg ccccgccgtc tttgtccggc tcggcgatct cggtcgtaaa ggccaccacg 989821 ccgtcgaggc cggggacgaa attctccggg accactgtca tacgagaatt ctcacacctg 989881 gccccggcaa cgacgctacc ggctggtgcc aatcacggtg ccggcgatga gcgtgccgcg 989941 agaatcgtca cgagggtgag ccgcggcgtg ccgcctcgtc taccagttgt actcgggagg 990001 gcaagccaag tttggcgtag acgtgggtga ggtgggtttg cacagtgcgc ggcgagacga 990061 aaagccgttt tgcaatgtcc ttgttggata acccctcgct gaccaaccgc acgacgtcgc 990121 gttcggtcgg ggtcaacgag ccccacccgc gggccggtcg cttgcgttca ccgcgaccgc 990181 gttgtgcata tgcgatcgcc tcgtcggtgg acaaggcggc cccctcggcc caggcgcggt 990241 cgaaatcctc atcacccatc gcctcacgaa gcgccgtcac cgaggcctgg tagccggcat 990301 cccaaatctt gaagcggacc tgacgtgtct gttgccgaag ggcggctgcg gcaccgagaa 990361 ggcggacacc ttcggagtga ctgccgacct cgccggccag gccggcgagg agttccatgg 990421 catctggcat gccctggtag atgtgcagct cggcgccgca cgccagcgca gcatgagcat 990481 catcgcgcgc cagttctggt tcgccccgtg cggtggctac gcgcgcgcgt attgtcaacg 990541 ccaccattcg gtgccaccca ttggtcgcat cgacggcgtc gttggcgaac tgtcgtgcgg 990601 cgatcgcatc acctcctgcc agggctaact gcgccatcag gacctggtgc atggtcacct 990661 ggtcgggctg ggccctaaga atcggccgcg ccgcgtcgct ggcctcgagc gctgccgtga 990721 catcaccggc ggccagcgcg gcgtacgtca tcgccgcata accaatgcct tggtacacac 990781 cgcctaactc cgtcgcggct gcaatgcacg ccccggctat ggcgtgggcc gcgctggcgc 990841 cgcaatacgc cagcacctgg gcttgggtat ataggccgag aacctttgtc ggcacatcgt 990901 tggatgcctc ggcctcggca gtgatttccc tggatagctc gagggcttcg gtcagattgc 990961 cagcccacat ctgcgccaaa ctaagccaca agctgcagtg acgtgagacg aaccggtcgc 991021 cgatggtgtc ggccaggtcg cggcattctt ctgccgcggc tcgcaaagca ttcgggtcac 991081 ctgatatgca ggtccccacc ccccgccagt agaggatttg acacagcgtc catttgtcgt 991141 caatagcgcg tgccaggtcg gtcgcttcgg cgaaataggg cgcagcggcc tccgcgttgt 991201 agccactgct acagccgcag gcggtgagcg cccgcaccaa cgcggcgggg tcgcccacct 991261 cacgtgccat cgccagcgct tgttgtgcgg gagcgatgat gtcggtggcg cctaccggac 991321 tggtggccag ccaggtactg agcattgcct tgtcagcgag cgctcgcgcc cgtactgctg 991381 ttgacacagc gagccggtgg aacctttggt cttccaggat cgagttgaac caggacaacc 991441 cctcgcgcag gtgcgcccgc ccgaaccaga ttggttgcag cgaagatgcg agctgtaacg 991501 cttcggtgat atggccattt tcccggctcc aggcgaacgc ggcgcgcagg ttgtcgatct 991561 cggtctcagc ccgggcgaca agccgttggt gatcgttgtc cgcaggagtg ttgagtgagg 991621 cggccagcgc cgtgtagtag tcacggtgac gtgcgtgcac atcggcctcg ccggagtcgc 991681 ccagtttttc cagcgcgtac cgacgcaccg tttccagcag ccggtaccgc gtgcggccct 991741 ggcagtcgtc ggccaccacc agcgacttgt ctaccagcag ggtcagctga tcaagcaccg 991801 aaaacggatc caggtcgcta ccggcggcga ccgcccgcac cgcggcgagg tcgaacccgc 991861 cgacaaatgg cgccagtcgc cgaaacaaga tttgctcggt ctcggtcagc agtgcatgcg 991921 accaatcgat cgaggcgcga agtgtctgct ggcgctgcac cgcgccccgc acaccgccgg 991981 ccaacagccg gaaacagtcg tccagaccgt cggcaatctc gagcggtgac atcgaccgca 992041 cccgtgcggc agcgaactcg atcgccagcg gtatgccgtc tagccgccgg cagatctcgc 992101 cgacggccgc ggcgttgtga ttggcgatgg tgaacccggg ctgaactcgg ctggctcggt 992161 cagcaaacaa ttcgactgct tcgtcggtta tcgacatcga cggtacgcgc caggtgatct 992221 cgccggccat cccgatcggc tcccggctag tcgctaagat cgtcagctcc ggacaggccc 992281 ccaatagctc aacgaccaac gctgcgcacg catcgagaag atgttcacag ttgtccaaca 992341 ccatgagcat gcggcgattg ccgatgaatc ggcgaagact atccatggtt gaacggcccg 992401 gctgatcggg cagacccacg gcgcgcgcag ccgtggctgc gacgatcccg gattcagtga 992461 tcggggccag atcgacaaag cacaaaccgt cgcgaagttc ggatgcactc gcgatctgga 992521 ttgccagacg ggtcttgccg acaccgccgg ttccgcatag cgtcacgagc cggttctgcg 992581 ccaacagtgc ccgcacctca gcttatttgc gcacggcggc ccacaaatgt ggtgaactgc 992641 gccgggagaa tcgatgtcgg gctggatttg gccgtgcgca gtgggggaaa cttttcgcga 992701 atgtcggggt ggcacaactg catgacccat tcgggacgag gtagaccgcg cagcgggtgg 992761 cggccgagat cgacaagcca tgcatcggct gggagccggc cagtcactaa atcacctgtc 992821 gcagctgaca ggacaacctg acccccgtgt gccaaatcgc ggagacgcgc cgtccggttg 992881 atagtggggc cgacatagag ttcgtcgcgc aactgtacct cgcctgtatg aagacctata 992941 cgtagtcgga tcggcgcgag cgaggtccgc tgcagatcca gcgcgcatgc agcggcatcg 993001 ctagcgcgag tgaaagccgc aacgaagcta tcaccctcgt accgtttgac cggctgcacc 993061 ccaccgtgat tcgtgatagc ttccgacaca gtgtgatcca agtgcgcgat ggcggtcgcc 993121 atgtcctctg ggcacatttg ccataggtgg gtcgattcct cgacgtcggc taagagcaat 993181 gtcaccgtgc ccgtcggcgg caatctgctc acgtctaatc cctggttggc tataaggacg 993241 cgtctgcgtg ggggaacgaa ctcacatcgg ccaacatctg gtggagccgc atagcagcgg 993301 agcgaatggt accggagatc cagcgatcct agcgcagata tacgaaccct ggcgacgcac 993361 tttgcgcatg ttggcggatg atcttcgccc cgcaggatcg catggtcgat gtcgatgttg 993421 ggaggaaggc tgttatgaac tgcgttgaag agcacgatac gtgtctgacc actgctatca 993481 cgtcatcgca acaccttcgc ggcgccgcga agccaataag cacactacag ttcggggaag 993541 acacctggcc catcctcgaa acaggcctct cgcagcgatg ttcattaccg cccaaagaga 993601 ttgtcttcgg cgctgcacgg tgggcgctcg cggcggcccg cgggatgcta ccgcggccca 993661 cgaccgacag cccaccgcag cgtcagcgct acccgaagcg ctaccgattc ctggagcact 993721 cctgcctaga acgcgagatg cgtcgactat agaacagcgt cgcgtgtttg tctcggtagc 993781 tgctctgtat agtatgcgtt gcttaaccgc atgtgggagg gtgattttgg gctgttctgg 993841 ggggtcggag cgatgaccgg gcgatgtccg acggttgccg tggtcggagc gggtatgtcc 993901 ggaatgtgcg tcgcaattac gttgctgagc gcagggatta ctgatgtctg catctatgaa 993961 aaggccgacg atgttggcgg aacgtggcgc gataacacct atccaggtct gacatgtgat 994021 gtgccgtccc ggctctatca gtacagcttt gccaagaatc cgaactggac ccagatgttt 994081 tcacgcggag gcgaaatcca agattacttg cgtgggatcg ccgagcgcta cgggctgagg 994141 caccggattc ggtttggcgc cacggttgtc agcgcccgat tcgacgacgg ccggtgggtg 994201 ttgcgcaccg attccggaac ggagtcgaca gtagacttct tgatttcggc caccggcgtt 994261 ttacatcatc cccgaatacc gccgatcgct ggtttggacg acttcagggg gacggtgttt 994321 cactcggctc gctgggatca cacggttccg ctgctgggac gccgaatcgc ggtgatcggt 994381 accgggtcca cgggcgtaca actcgtctgc ggcctggctg gggtcgcggg taaagtcacc 994441 atgttccagc gcaccgcaca atgggtgctg ccgtggccta accctcgata ctcgaagctg 994501 gcgcgtgttt tccaccgcgc ttttccgtgt ctgggttcgc tggcctataa ggcatatagc 994561 ctttccttcg aaacgttcgc ggttgcgctc agcaatccag gtttgcaccg aaagctggta 994621 ggggccgtgt gtcgcgccag cttacgtcgg gtgcgtgacc cccgactgcg tcgggcactg 994681 acgcctgatt acgagccgat gtgcaaacgg ctagtgatgt ccggcggatt ctatcgggcg 994741 attcagcgtg acgacgtcga attagtcacc gccggtatcg atcacgtcga acatcggggc 994801 atcgtcaccg atgatggtgt gttgcacgag gtggacgtca tcgtgcttgc cacggggttt 994861 gactctcatg catttttccg gccgatgcag ctgaccggtc gcgacggcat caggatcgac 994921 gatgtgtggc aagacggtcc gcatgctcat caaaccgtcg caatacctgg atttccgaac 994981 ttctttatga tgttggggcc acacagccca gtgggaaact tcccgctgac agcggtcgcc 995041 gaatctcagg ctgaacacat agtgcagtgg ataaagcgat ggcgccatgg tgaattcgac 995101 accatggaac cgaagtcagc tgctaccgaa gcatataaca cggtgttgcg ggccgcgatg 995161 ccgaacaccg tctggaccac cggctgcgac agctggtacc tgaacaaaga cggtattcct 995221 gaggtttggc catttgcacc ggccaaacac cgcgccatgc tcgctaacct acatcccgaa 995281 gaatacgacc tgcgacgcta tgctgcggtg cgcgcaacta gtcggcctca aagcgcttga 995341 agcctatcga ggtgctggac ggtgacgttc gcgcgggatc ggccactaat cccgttctga 995401 cggcgctgac aaaggttata gcggtgacca ttggcgcagc ttcggtatcg gcttcgggca 995461 ccgctcggcc gacgcggcgc agatactcgg ccaatggagt agcggtcgcg cgccagcctc 995521 gctcatcgaa ccattccgtg gcccgcgccc accgctcgtt gtagaccatt tgaaagaacc 995581 tgcgcgggtc gccttgggcg ttcgcggcgc gttctcgttc gagcttcgct gcgaattcgc 995641 aagggtccag tggagttgct tcctcgacag caacgtggct accggggcta gccaaggtgt 995701 cgatgccgat aaacaggcgc tgctgggcct cggccgagag atagaccagc aggccctcgg 995761 cgatccaagc cgacggccgg ttggcatcaa atccgttgtt acacaaggct atctgccact 995821 catcgcgcag atcgacagca accgaccgac gttgggctcg cggccgtatg tgatagtcgg 995881 cgagcaccgc gttcttgaag tcgaggacct gaggtcgatc caactcgaag attgttgtcc 995941 cgattggcca ttgcaatcgg aatgcacggg aatccaatcc tgcagccaag atgaccacct 996001 gcttcatgcc ggcggccgtt gcccgggaga aatactcgtc gaaatacctg gtgcgggcac 996061 cttggaagtt gacgaaatgc tcaccgaagt ccccggttgt cagatagtga tcgggcagct 996121 tgccgtccaa tacgtcggcc cattcaccac ctgcggcacg gcagaaaacc tcggcatagg 996181 gatcgatggc cagcggatcg gccttctgcg tctccaatgc tcttgcggcg gctaccaata 996241 gtcctgtcga accaacactc gtggtgacat cccagctatc gtcctcggtc cgcattcatc 996301 gaactctagt tgctccagtc cgcccaccgc tgtcggtatc ccagcgcagt cggccgtgca 996361 cacatatctg cgcggtggac ttggtacttc tacgcgcatt cgccgatgtt ttgcgatccg 996421 cggcgggtct atggtgccat ttatgtgcca ggatcggtct tcaataacaa cgtcgcgaag 996481 cgaggggtcg tgacgtgaga gggctcgctt atgccggcgg tggatgccca gtagggcgac 996541 ggtccaggaa ttctcagaca gttatccgtt ctgccacaat ggattccggc cgatcatgat 996601 gccaaagatc gtctccgtcc aacattccac tcgccgccac ttgacgagct ttgtcggtcg 996661 caaggctgag ctgaacgacg tgcggcggct cctgtccgac aaacgactgg tgacgcttac 996721 cggtccggat gggatgggga aatcccgtct cgcgctgcag atcggcgccc agattgcaca 996781 cgaattcact tatggccgtt gggattgcga cttggctacg gtcactgacc gagactgcgt 996841 gtccatctcg atgctgaatg ccttgggctt gcctgtccag ccgggtttgt ctgcgatcga 996901 cacgctcgtc ggtgtcatca atgatgctcg ggtgctgctg gtgttggacc attgtgagca 996961 tttgctggac gcgtgtgccg caataattga ttcgctgtta cgttcctgtc cgagattgac 997021 gatcctgacg acaagtaccg aagcgatcgg gttggcgggc gagctgacct ggcgggtgcc 997081 cccgttgtcg ctgaccaacg atgccatcga gctgtttgtc gaccgggcac gccgagtgcg 997141 gtcggatttt gcgattaatg ccgataccgc ggtgacggtc ggggaaatct gccgacgctt 997201 ggacggtgtg ccactggcga tcgagctggc cgcggcgcga acggacacct tgtcgccggt 997261 ggagatcctt gctggtctaa atgaccgatt ccggctggtg gccggtgctg cgggcaacgc 997321 ggtgcgcccc gaacagacgc tgtgtgccac ggtgcaatgg tcgcatgctc tgttgagtgg 997381 acctgagcgt gcgttgttgc accggttggc agtcttcgcc ggcgggttcg accttgacgg 997441 cgcccaggcg gtcggtgcca atgacgagga cttcgagggc taccagacac tcggccggtt 997501 tgccgagttg gtggacaagg catttgtcgt cgtcgaaaac aacaggggcc gagcgggata 997561 ccggttgctg tattcggtgc gtcagtacgc gttggagaag ctcagtgagt cgggagaggc 997621 cgacgccgtg cttgcgcgtt accgcaagca cctcaaacaa cccaaccagg tagtgcgtgc 997681 tgggtcaggc ggggttcggt actgatgcgt gaacgtagct taaccgtcgg tgggaattga 997741 ccgcgccacc catagcagtc gagaggaaca cccgcagcaa agtgcgccaa caacaggagg 997801 ctgacgtcgt tgccctgggt cgaaagccag ggctgctatg tgtgccggaa aggttccgtg 997861 caatggatct tccgatggca gccgccgatg ccttattcct atgggccgag acgccgacgc 997921 ggccgctgca tgtcggcgcg ttggccgtgc tgagtcagcc cgacaacggg accgggcgtt 997981 acctgcgcaa ggtgttctcc gccgcggtgg cccgtcagca ggtggcgccg tggtggcgcc 998041 gacgcccgca ccggtcgctc acctcgctcg ggcagtggtc ttggcgcacc gagaccgagg 998101 tggacctgga ttaccacgtg cggcttagcg cattgccgcc acgggccggt accgccgagc 998161 tgtgggcgtt ggtttctgaa ctacacgccg gcatgctgga ccgctcccgc ccgctatggc 998221 aggtggacct gatcgagggt ctacctggcg ggcggtgcgc ggtctacgtc aaggtccacc 998281 atgcgctggc ggacggagtc tcggtgatgc ggcttttaca acggatcgtc accgcggacc 998341 cgcatcagcg tcagatgccc accttgtggg aggtgccagc gcaggcgtcg gtggccaaac 998401 acacggcacc gcgcggttcg tcgagaccac tgacgttggc caagggggtg ctgggtcaag 998461 ccaggggcgt cccgggcatg gtgcgcgtag tggccgatac cacgtggcgg gcagcgcaat 998521 gtcgcagcgg gccgctgaca ctggccgcac cacacacccc gctgaacgag ccgatcgccg 998581 gggcccggtc cgtggcaggt tgttcctttc cgatcgagcg gctgcgacag gtcgccgaac 998641 acgccgatgc caccatcaac gatgtcgtgc tggccatgtg cggcggggcg ttacgtgcgt 998701 acctgatcag ccggggagcg ttaccgggtg cgccgctgat agcgatggtg ccggtttcgc 998761 tgcgcgatac cgcagttatc gacgtgttcg gccagggtcc aggcaacaag atcggtacgt 998821 tgatgtgttc gctggcgacg cacctggcca gtccggtcga acggctgtcg gcgatacggg 998881 caagtatgcg cgacggcaaa gccgcgatcg ccggccgaag ccgaaaccag gcgctggcta 998941 tgagcgcatt gggcgccgcc ccgctcgccc ttgcgatggc cctggggcgc gtgcccgcgc 999001 cgctgcgccc accaaatgtg acgatctcca acgtgccggg cccgcagggc gcgctgtact 999061 ggaacggcgc tcgcctggac gcgctctacc tgctctcggc acctgtcgat ggcgcggcgt 999121 tgaacatcac ctgtagcggc accaatgagc agatcacttt cggtttgacg ggctgccgtc 999181 gtgccgtccc cgcgctgagc atcctgaccg accagctcgc ccacgaactc gagctactcg 999241 ttggcgtcag tgaagccggc ccagggacca gacttcgaag gatcgcaggg cgccgttaaa 999301 cggacgccgc gagtcatcac ccggccgagc gcgcagcggc ttaccttacg cgcggccgcc 999361 catggtgcca gagaccccac cccgggcagg cgggtcatcc cgatagcgac taccttcagc 999421 tataagcact tagtggggca gccatatcag ccaaagcgcg aaggggttct cgtggccgac 999481 accgacgaca ccgcaaccct ccgttacccg ggaggcgaga tcgacctgca gatcgtgcac 999541 gccaccgaag gcgccgacgg cattgcgctc gggccgctgc tggcaaaaac cgggcacacc 999601 acgttcgacg tcggcttcgc caacacggcc gccgctaaaa gctccatcac ctacatcgac 999661 ggagatgccg gcattctgcg ttatcgcggc tacccgatcg accaactggc ggagaagtca 999721 accttcatcg aggtctgcta cctgttgatt tacggcgagc tgcccgatac cgaccagctt 999781 gcccagttca ccggccggat ccagcgccac accatgctgc acgaggatct caagcggttc 999841 ttcgacggct ttccgcgcaa tgcccacccg atgccggtgt tgtccagcgt ggtcaatgcg 999901 ctgtcggcgt actaccagga tgctctggac cccatggaca acggtcaagt cgagctgtcg 999961 accattcggc tgctggccaa gctgcccacc atcgccgcgt acgcctacaa gaaatcggtc 1000021 ggccagccct tcctctaccc agataactca ctgacgctgg tggagaactt cctacggttg 1000081 acgttcggat ttcccgccga gccctaccag gccgaccccg aggtggtgcg ggcgctggac 1000141 atgttgttca tcttgcacgc cgaccacgag cagaactgct cgacgtcgac ggttcggctg 1000201 gttggctcgt cgcgagccaa cctgttcacc tcgatctcgg gtggcatcaa cgcactatgg 1000261 ggtccgcttc atggcggcgc caatcaggct gtcctggaga tgctcgaggg cattcgcgac 1000321 agcggcgacg acgtcagcga gtttgtacgc aaggtcaaga accgcgaggc cggggtcaaa 1000381 ttgatgggtt tcggtcatcg tgtctacaag aactacgatc cgcgggcccg catcgtcaag 1000441 gaacaggccg acaagatcct ggccaagctc ggcggcgatg actccttgct gggcatcgcc 1000501 aaggagctcg aagaggcggc gctgaccgac gactacttca tcgaacgcaa gctttacccc 1000561 aacgtcgact tctacaccgg cctgatctac cgggccctcg gcttcccgac caggatgttc 1000621 accgtgttgt ttgccctggg caggcttccc ggctggatcg cgcactggcg tgagatgcac 1000681 gacgagggcg acagcaagat cggccggccc cgccagatct acaccggcta cacggagcgc 1000741 gactacgtca ccatagacgc gcggtaggcc ggcgagcaga cgcaaaagcc ccctaaaccg 1000801 gcaggtatta ggggcttttg cgtctgctcg ccaggcaagc cagcactgcc atcgcggcgt 1000861 tgtgaccgcc gatgcccgac accgccccgc cgcgacgggc acccgagccg cacagcatga 1000921 tccgctcgtg gtcggtggct acgccccact gccgtgccgg tgtgtccagc ggatcgtcgt 1000981 tgtcagcgaa cggccaggac aacgcaccgt ggaagatgtt gccgccggtc atcccaagcg 1001041 tccgctgcag gtccagggtg gtcgtcgtct cgatgcatgg cttgctctgc gcatcggtcc 1001101 aaagcacgtc ctgaatcggt tcggccagaa cggaattcag cgacgctagg acggctgccg 1001161 tcagccgttc ggctaagcct tcggtgtcgc cgaacaccga gtgcggtgtg tgcaagccga 1001221 acaccgtcag cgtctgagcg ccggcatcgc gcaaccgggc ggacaggatg ctcgggtcgg 1001281 tcagcgaatg gcagtaggct tcgcagggta ggggatccgg caaccgcccg ctggctgctt 1001341 gcgagtacgc ggcatccaat tggctccatg tctcgttgac gtggaacgtc ccggcaaatg 1001401 cttgctgcgg tgtgacactg tcgtcgcgca accgggggag tcggcgcacc accatgttga 1001461 ccttgacctg tgcgcccggg gccagtgccg caaccggttc accgagcagg ctggccagca 1001521 ccgccggtgt gaccccgacc agaacgaacc ggccccggac caaatgctcg gcaccgtcgc 1001581 taccgtcgct gtggtagcgc accgtaccgt ctggatcaag ggcgaaaacg tctgcaccgg 1001641 tgactatttc ggcgccgtgg cgggcagctg ccgtggccag ggccgaggtc accgacccca 1001701 tgccgccgat tgggacgtgc cagactccgg tgcccccacc gaccaggtga tacaggaagc 1001761 agatgttctg catcagcgac ggttcgtgca tgcgggcgaa ggtgccgatc agcgcgtcgg 1001821 tggcgatcac cccgcgtagc aggtcattgg ccaccgcgcc ggcgatggca tgcccgatcg 1001881 gctcgtcgac catggcttgc caggcagcgg ccgcctcgtg gccgccgtat tccacaatgt 1001941 cgcggcgggc ctgctcgcgg gtgcgcagcg gctcgatcag ggtgggccac agccgtgcgg 1002001 tcaccagccg gcagcgccgg tagaacgcgg cgaagccgtg cgcatccggc gcggcgccga 1002061 tcgccgcgag gtgcgctgcg cgtggttcgc cggtgggccc gatgagcagg ccagagcgcc 1002121 cggccgtggc tggggcaggg gtgtatgagg aaaatggccg ccgcgccaac cgcaccggag 1002181 cgccgaggtc ggcgacgatg cgcgacggca gcaagctgac caggtacgag tagcgtgaca 1002241 gcgcgacctc gacaccgtcg aaggcctgta tcgacaccgc ggccccccca gtctgtgcca 1002301 gccgctcgag cagtcgcact cgaagcccgg cccgggccag gtaggcggcc gcgaccaagc 1002361 cgttgtgacc gccgccaacc acgacaacgt cgaagtccct gtcgtgatcg ctcatagtga 1002421 cggcggctat cgagacggat ctagccggtg tacccctcga cttggtcggc gggacgcacg 1002481 actgcttcgc gcgggtcacc accggtttgg cgcaatgccc gtcgctgtcg gagcaggtcc 1002541 cagcactggt cgagttcgat ctcgatgcgg cgcagttgct gctgctcctc ggactcgctg 1002601 atgccaccgt gccgcagctg cgctcgcaac gccttctcct cggccaccag gtcacggatg 1002661 tgtgccaggg tctcgctgtc tgtcggtttg cgtcccttgc ccatggctcc agtgtgcccg 1002721 atttgacgcg gtgtcccggc accgactcgg taggctgcat atcgcctgca gcacggacga 1002781 gacgcgttcg acgacctgag ggagtggcgt agtggcttct aaggcgggtt tgggccaaac 1002841 acccgcgacc accgacgcgc gacgaactca gaaattctac cggggctcgc cgggccgtcc 1002901 gtggctgatt ggcgcggtgg ttattccgtt gctgatagcg gcaatcggtt acggtgcatt 1002961 cgagcggccc cagtccgtta ccggaccgac cggtgtgttg ccgacactga caccgaccag 1003021 cacccggggc gcttctgcgt tgtccttgtc tttgctgtca attagccgca gcggcaacac 1003081 cgttactctg atcggtgact tccccgatga ggccgccaag gcggccttga tgacggcgct 1003141 caacggcttg cttgctccgg gcgtgaacgt catcgaccag attcacgtcg atcccgttgt 1003201 gcgatcactt gatttctcaa gtgcggaacc agttttcacc gccagcgtgc cgattcctga 1003261 ttttggcctc aaagtcgaaa gggacaccgt caccttgacc ggaactgccc cttcatccga 1003321 gcacaaggac gcagtgaagc gcgcggcgac cagcacctgg cctgacatga aaatcgttaa 1003381 caatattgag gttacggggc aggcaccgcc aggacccccg gcctccggcc catgtgccga 1003441 cctgcaatca gccatcaatg ccgtgacggg tggacccatc gcgtttggca acgacggggc 1003501 tagtctgatc ccagccgact atgaaatcct gaaccgggta gccgacaagc tcaaggcatg 1003561 tccggacgct cgggtgacga tcaacggcta caccgacaac accggcagcg aaggtatcaa 1003621 tatcccgttg agcgctcagc gagccaagat agtcgccgac tacctggttg cccgcggagt 1003681 tgccggcgat cacattgcca ccgtgggtct cggttcggtg aatccgatcg ccagcaacgc 1003741 cacacccgag gggcgcgcca agaatcgtcg cgtcgagatc gtggtcaact aaggagaacc 1003801 cagcatggat tttgtgatcc agtggtcgtg ctacctgctg gcgttcctgg ggggctcggc 1003861 tgttgcctgg gtagtcgtca ctctgtcgat caagcgcgcc agccgtgatg agggtgctgc 1003921 ggaggcgccc agtgcagccg agacaggcgc acagtgatgg aacacgtgca ctggtggctg 1003981 gcgggcctgg cgttcacgct cgggatggtg ctgacgtcga cgctgatggt ccggcccgtc 1004041 gaacatcaag tgctggtaaa gaaatcggtc cgcgggtcaa gcgctaagtc caagccgcca 1004101 acggcgagaa aacccgccgt caagtcgggc accaagagag aggagtcgcc gacggcgaag 1004161 accaaggtgg caacggagtc tgctgcggag cagatcccgg ttgccgggga gcccgcggcg 1004221 gagccgatcc cggtcgccgg cgagccggcg gcgcgtattc cggtggttcc gtacgcgccg 1004281 tacggcccgg gctcggcgcg cgctggtgcc gatggcagcg gaccgcaggg gtggctggtg 1004341 aagggccgct cggacaccag gctctactac actcccgaag atccgacgta cgaccctact 1004401 gtcgcccagg tttggttcca ggacgaggag tcggcagcgc gggcgttttt cacgccgtgg 1004461 cgcaagagca cacggcggac atgaggtcag ggccgcaggg ctaactgggc ccgggaaggc 1004521 gcaacacgag gcgcgcgcca cccagcgggc tgttctccag cgacgcggtg ccgccgtgca 1004581 actgggcctg ttgggccacc aacgccagcc cgagacccga ccccgaatga gatgccgtgg 1004641 acccgcggga gaaccgctcg aacaccactt ggcgctcacc ttcgggcact ccgctgccgt 1004701 tgtcgtcgat ggcgatctcc acgccggccc gcgagctgac cgcggagagt tgaaccaggg 1004761 tggcgccgcc gtgcttgacc gcgttggcga tggcgttgtc gacggccagg cgcaacccgg 1004821 ccggcaaacc cacgatgatg caggtcggcg acggcaccag cgatacatcg agatcggggt 1004881 agatccgggc cgcgtcgtgg gcggcgcggt cgagcaggtc ggtgatatcg accggcacgt 1004941 gatcgtccga ggtcgacagt tcgccctggg ccaaccgctc cagcgcgctc agggtggcct 1005001 caatgcgcga ctgggtgcgg atgacgtcgt tgagcacttc tttgcgctgg tcgtcgggca 1005061 gatccagggt ggacagcacc tccaggttgg tgcgcatcgc ggtcagcgga gtgcgcagct 1005121 cgtgggagga caccgccgcg aagtcacgcg ccgacgcaag cgcctccttg gttcggttct 1005181 gctcgttcca gatgcgctgc agcatgccgc gcatcgcctc ggcgatctcg atggcttcgc 1005241 tggcgccgtg tacttccacg cgtggcgcct cgtcgcccgc gtcgatggac cgggtctgct 1005301 cggcgagctg cttgaacggg cgtaccgcga acgcggccaa cagccaggcg aacaccgccg 1005361 ccgcgccgat ggcgaaggta cagatcagca gcacccggcg gtgcaggttg ttggtctcgg 1005421 ctacggtggc gtcatacgtc gcgcccaccg ccaccgacgt cggctcgggc ccggggatct 1005481 ccaccgtgcg cacgcggtag cgcaccccgc ggacgtaggt gtcggcgtag tcgtcttgca 1005541 gtttgggcag cgtgatgtcg gaattcgact tgatcacgtt gccacggcgg accgtgatga 1005601 gggcgtcctg gtcgttcggt gagcgcggga tctcgtcgag gccacgcggc acgaacggga 1005661 tcgcgaaacc cgcggcctcg tcgagccggc ggtccagccg ctccttgcgg tcgttggtga 1005721 tcccgaccca gacgacggtg ccgacaatga gtaccgggat cgcggcgccg atcgccgtcg 1005781 cgaccaccac ccgggttcgc agcgagggcg tacgggcgaa gatccgcgac agaatattca 1005841 tgcatgcccc gtcactgcat acgcagcacg aatccgactc cgcggacggt atgcagcagc 1005901 ctagggccac cgccggcctc cagtttgcgc cgcaggtacc cgatgaagac gtccaccacg 1005961 ttggtgtcgg cggcgaagtc gtagccccac accaattcca ggagttgcgc tcgggagagc 1006021 accgcggtct tgtgctcggc cagcaccgcg agcaggtcga attcgcgctt ggtcaggtcg 1006081 acgtcgacgc cgttgacccg ggcccgccgg ccggggatgt ccacctccag cgggcccacc 1006141 gtgatggttt ccgaggacga cgttgcagtg gagccgcggc ggcgcagcag cgccttcacc 1006201 cgtgccacca gctcggccag cacgaacggt ttcaccaggt aatcgtcggc gccggcctcc 1006261 aatccggcca ctcggtcatc gacagagctg cgtgcggata gcacacagac cgggacgtcg 1006321 ttgtccatcg cgcgtagtgc cgtcacgacg ctgactccat cgagcactgg catgttgatg 1006381 tcgagcacga tcgcgtccgg ccggttctcg gtggcgctgc gcaaggcctc ggcgccgtcc 1006441 accgcggtcg ctacctcgaa tccggacagc cgtaagccgc gttccagcga ggcgagcaca 1006501 tcggagtcgt cgtcgacgac caacacccga ggtgaggtca caccagtgtc catgccgccc 1006561 attttgcctg attaccgtcc agcagggtgg gagggtgagc cgccgggtcg cgtgctgggc 1006621 gagcagacac agagtcgcat caaaaccgcc gattttgtgc gactctgtgt ctgctcgcgg 1006681 ggtgcgcgcg ggttagtcgc ggggcaaccc gatccggcgg tagcgttgca accgagtcgc 1006741 gaggcgttcc ggggccggta tcttccgtaa cgcgtgcact tcggcggcga tggcgttcga 1006801 cagtcgtagg gcgaactcga tcggctcgtc tgcggcgtcg gggtactccg gcacgatggt 1006861 gtcgacaatc cccgacttca gtaggtcggc cgaccggatg ccttgggcgg cagcgagttc 1006921 ggcggcatga gcagtgtctc ggaacacgat cgcgctggct ccttcgggag gcaagggcgc 1006981 cagccagccg tggagtgcgg ccagcacccg gtcggcgggc aacatcgcca gcgccggccc 1007041 gccgctgccc tggcccagca ggatcgacac ggtcggggta tccagcgtga cgagctcggc 1007101 caggcaatgc gcgatctggc cggccagccc gccctgttcg gctgcggccg acaacgcggg 1007161 tccggccgcg tcaatgacca gcaccagcgg caggcacagc tcggcggcga gcgccatccc 1007221 gcgtcgggct tcgcgtaacg cagcgggccc gacagtgctt cccccgccgc ctactgccct 1007281 ttgctggccg aggaccaccg tgggttggcc gccaaagcgg gccagcgcca gcagcgtggt 1007341 cgccgcttcg ccttgatcgg ttcctgacaa caacacccgg tcggtggcgc cgtgtcgcag 1007401 tagctgcctg acgcccggcc ggtccggccg gcgcgatgcc accaccgagt cccacgtggg 1007461 cacatcgggt acgggcgcgg gcgtctgcgg tgccggaagc ggttcgggag cgtcgatgag 1007521 caccgtcaac gcacgatcca gcatcggtcg tagccggtcc agtgcaacga cgccgtcgat 1007581 gatcccatgc cgccgtagat tctcggcggt ttggacgccg gatgggaagg ggtcgccata 1007641 gagcaactca tagacccgtg gtcccagaaa gccgatcagg gcgcccggct cggcgacggt 1007701 gagatgcccc agcgagcccc acgacgcgaa aactccaccc gtggtcggat ggcgcaaata 1007761 gaccaggtag ggcaggcgcg cctggttgtg cagctggatg gccgcagcga tcttcaccat 1007821 ctgcagaaac gcgaccgtgc cttcttgcat gcgggtgcct cccgagcttg gtgacgccag 1007881 tagcggcagc cgctcggcgg tcgcccgctc gacggcggcg gtgatccgtt cggccgctgc 1007941 caccccaatc gagccgccca ggaagtcgaa ctcacaggcc accacggcca cccgccgccc 1008001 gaatacgcgt ccctcaccgg tctgcaccga ttcgtccgcg ccggtggccg cccgagcggc 1008061 ggccagctcc cgcgcatagg agtcggctac cggcaccgcc agcggctcgc tatcccagct 1008121 gacgaaagat ccccggtcta gcaccgcgtg ccgcagttgg tcggtcgtga tacgactcac 1008181 gcgatgaggc tatataggct gacccaatga tcggtatcac ccaggcagaa gccgtgctga 1008241 ccattgagct gcaacgcccg gagcgccgca acgccttaaa ttcccagctg gtcgaggagc 1008301 ttacgcaggc catccggaaa gccggggatg gatcggctcg ggcgatcgtg ctgaccggcc 1008361 aaggcaccgc gttctgcgct ggcgcggacc tgagcggaga cgcattcgcc gccgattatc 1008421 ccgaccggct catcgagctg cacaaggcga tggacgcctc cccgatgcca gtggtcggcg 1008481 cgatcaacgg tcccgccatc ggcgccggct tgcagcttgc catgcaatgc gacctgcggg 1008541 ttgtcgcgcc cgatgccttc ttccagtttc cgacgtcgaa atacggtctg gccctggata 1008601 actggagcat ccgccggctg tcgtcgttgg ttgggcacgg acgtgcccgc gcgatgctgc 1008661 tcagcgcgga aaagctgacc gccgagatcg cactgcacac cggaatggcg aatcgcattg 1008721 gcactttggc cgacgcccag gcctgggccg ccgagatcgc caggctggca ccactggcta 1008781 tccagcacgc caagcgggtg ctcaacgacg acggcgctat cgaggaagcg tggccggccc 1008841 ataaggaact cttcgacaaa gcctggggca gccaggatgt catcgaagcg caggttgccc 1008901 ggatggaaaa gcggccgccg aagttccaag gggcttaacc gtcatggtgc gccgagcgct 1008961 acgactggcg gccggcaccg cctcgctggc cgccggcacg tggctgttgc gtgcgctgca 1009021 cggcacgccg gccgcgctcg gtgccgacgc ggcgtcgatc agggctgtgt cggagcaatc 1009081 gccgaactat cgtgacggcg ccttcgtcaa cctggatccc gcgtcgatgt tcaccctgga 1009141 tcgcgaggag cttcggctca tcgtgtggga gttagtggcc agacacagtg cgagccggcc 1009201 ggcggcgccg atcccgttgg cctcgccgaa tatctaccgg ggtgacgcca gccggctcgc 1009261 cgtcagctgg ttcggtcact cgacggcgct gctggaaatc gacggctacc gggtgcttac 1009321 cgatccggtg tggagcgatc ggtgctcacc gtccgacgtc gtcggccccc agcgcctgca 1009381 tccgccgccg gtgcaactgg cagctctccc ggccgtcgac gccgtggtca tcagccacga 1009441 ccactacgac catctcgata tcgacaccgt ggttgcgctg gtcggcatgc aacgggcccc 1009501 gttccttgtg ccgctcgggg tcggcgccca ccttcggtcg tggggtgttc cgcaggatcg 1009561 cattgttgag ctcgactgga accagagcgc tcaggtcgat gagctcaccg tggtctgcgt 1009621 gccggcacgg cacttctcgg gacggttcct gagccgcaac accacactgt gggcctcgtg 1009681 ggcgtttgtt gggccgaacc atcgcgccta cttcggcggt gataccggat acaccaagag 1009741 cttcacccag atcggcgcgg accacggacc gttcgacctg accctgctgc ccatcggggc 1009801 ctacaacacg gcgtggccgg acatccacat gaaccccgag gaggcggtcc gggcgcacct 1009861 ggacgtcacc gattcgggct cgggaatgct ggtgccggtg cactggggca ccttccggct 1009921 ggccccccat ccgtggggcg agccggtcga gcggctgctc gcggcggctg aacccgagca 1009981 cgtcacggta gccgtgccgc tacccggtca gcgggtcgac ccgaccgggc ccatgagatt 1010041 gcacccatgg tggcggctgt aattccccgc agcgcccggc taatggtgct agggggcgag 1010101 ccgaggcgat caaaccaccg agtgttccgg ccgcgttggc tactatctgc ggccatgacc 1010161 aaacgagcgg caacggccgc catggtgatg ttgctgacgt taacggttgc ggatccacgc 1010221 accaggcact tggcccgccg tccgggttgc ccgatgcctc tcccaatgag aggtcagcga 1010281 tacagatccc cgctggccgc atcgacgatg ccgtggcaaa ggtcgacggc ctggtcggcg 1010341 agctgatgca gaataccggc atacccggaa tggcagtggc gatagtccat ggcggaaaga 1010401 cgttgtatgc caaagggttc ggtgtcagag acgtgggcaa aggtggtggt ccggacaaca 1010461 aggtggacgc cgacaccgtc tttcagttgg cgtcggtgtc caaatcggtc ggcgccacgg 1010521 tggtggcgca tgcggtaacc gacaacgtcg tgacctggga tacgcccgtc gtatcgaagc 1010581 tgccgtggtt tgcccttcgc gatccctacg tcaccggcca ggtaaccatt gctgacctct 1010641 actcgcatcg ctccggcctg cccgaccatg cgggcgatct gttggaggat ttgggttatg 1010701 accgtcgaca ggtactgcag cggctgaaat acctgccgct ggcaccgttt cgaatcagct 1010761 atgcctacac caactttggt gtgaccgcgg cggccgaagc ggtcgcggcc gcggccggcc 1010821 agtcctggga ggacctgtcc gacgaggtgc tctaccgccc gttggggatg gggtctacga 1010881 gttcccggtt caccgacttt ctggccaggc ccaaccatgc ggtcaaccac gtcaaggtcg 1010941 cagaccgatg ggaggcgcgc taccagcgcg atcccgacgc ccaatcacct gcgggcgggg 1011001 tgagttcgtc tcttaacgac atgacgcact ggctggccat ggtgctggcc gacggcgtgt 1011061 acaacggccg tcggatcacg tcgccggagg ccctgctccc cgtctacacg ccgcaggtga 1011121 tctctcgaca cccggtgtca ccgagagcgc gggccagctt ctatggctac ggattcaacg 1011181 tgggggtaac ctcttcggga cgcaccgagt acagccattc cggcgccttc gggctgggtg 1011241 ccgcggcgaa tttcgtggtg ctgccctccg aagacctggc catcatcgcg ctgaccaacg 1011301 ccgggcccat cggcgtgccg gagacgctga ccgccgaatt catggacttg gtgcagtacg 1011361 gccaggtacg cgaggactgg gcggccctgt acaagaaggc atttgccccg ctgaacgagc 1011421 tcgcgggctc gctggtcggc aagcaatccc cggccaaccc agcgccgagc agaccgctga 1011481 acgactacgt cggcgtgtac gccaacgact actgggggcc cgccaccgtg acctaccacg 1011541 acggccaact gcgcctgtcg ctggggccga agaaccagac gttcgatttg acgcactggg 1011601 acggcgacac tttcacgttc acgttgtcga ccgaaaacgc attgcccgga tcgatttcca 1011661 aggccacctt cgccggcgac acgttaaacc tggaatacta cgacgccgac aagctgggaa 1011721 cgtttacccg atgacccgtt cggcttcggc gacagccggt ttgaccgatg ccgaagtggc 1011781 gcaacgggtc gccgaaggca agagcaacga tatcccggaa cgggtcaccc gcaccgtcgg 1011841 gcagatcgtc cgggccaacg tattcacgcg gatcaacgcg attctgggcg ttttgctgct 1011901 catcgtcttg gcgacgggct cgttgatcaa cgggatgttc ggcctgctca tcatcgccaa 1011961 cagcgtcatc ggcatggtcc aggagatccg tgccaagcag acgctggaca aactcgcgat 1012021 catcggacag gcgaaaccgt tggtgcgcag gcaatccgga acgcgcacgc ggtcgaccaa 1012081 cgaggtggtg ctggacgaca tcatcgaact tgggcccggg gaccaggttg tcgtcgacgg 1012141 cgaggtcgtc gaggaggaaa acttggagat cgacgaatca ttgctgaccg gcgaggccga 1012201 cccgattgcc aaagacgctg gcgataccgt gatgtcgggc agtttcgtcg tctccggtgc 1012261 cggcgcctac cgcgccacca aggtcggcag cgaagcatat gcagccaaac tggccgccga 1012321 ggccagcaag ttcaccctgg tgaaatccga attgcgcaac ggcatcaaca ggattctgca 1012381 gttcatcact tacttgttgg tgccggccgg cctgctgacc atctacaccc agttgttcac 1012441 cacacacgtg ggatggcggg aatccgtgtt gcggatggtg ggcgcgctgg tgccgatggt 1012501 tcccgaaggc ctggtgctga tgacctcgat cgccttcgcc gtcggggtgg tcaggctcgg 1012561 ccagcgtcaa tgcctggtgc aagagttgcc cgccatcgag gggttggcgc gggtggacgt 1012621 ggtctgcgcc gacaagaccg gcacactgac cgaaagtggc atgcgggtct gcgaggtcga 1012681 agagctcgac ggggctggtc gacaggaaag tgtcgccgat gtgctggccg ccctggccgc 1012741 cgccgacgcc cgtcccaacg cgagcatgca ggcaatcgcc gaggcctttc actcgccgcc 1012801 gggctgggtc gtggccgcga acgcgccttt caagtcggcc accaagtgga gcggcgtctc 1012861 ctttcgcgat cacggtaact gggtgatcgg cgcgcccgac gtgctgctcg atccggcttc 1012921 ggtggcggcc agacaggccg agcggatcgg agcgcaggga ttgcgggtgc tgctgctggc 1012981 tgctggcagt gtggccgtcg accatgccca agcgccgggt caggtcaccc cggtagcgct 1013041 ggttgtgctg gagcagaagg tgcggcccga cgcccgtgaa acgctggatt attttgctgt 1013101 tcagaatgtt tcggtcaagg tgatctccgg tgacaacgcg gtgtcggttg gtgcggtcgc 1013161 cgaccggctc gggctgcatg gcgaggcgat ggatgcgcgt gcgctgccga cgggccgcga 1013221 agaactggcc gacacactgg actcttacac cagttttggc cgtgtgcggc cggaccagaa 1013281 gcgtgcgatc gtgcatgctc tgcaatcaca cgggcatacc gtggcgatga ccggcgacgg 1013341 cgtcaacgac gtgcttgccc tcaaggacgc tgatatcggt gtggcgatgg gctcgggcag 1013401 cccggcctcg cgtgcggtgg cacagatcgt gttgctgaac aaccggtttg ccacgctgcc 1013461 ccatgtggtc ggcgaggggc gtcgggtcat cggcaatatc gaacgggtcg ccaatctatt 1013521 cctgactaag acggtgtatt ccgtgttgct ggcgctgctg gtgggtattg agtgcttaat 1013581 tgccataccg ctgcggcgtg atccgctgtt gttcccgttc cagccgatcc acgtcaccat 1013641 cgcggcctgg ttcactatcg ggatcccagc gttcatcctg tccttggcgc ccaacaacga 1013701 gcgggcctat ccgggcttcg ttcggcgagt tatgacgtct gcggtgccgt tcggactagt 1013761 catcggtgtc gcgactttcg tcacctatct ggccgcttac cagggtcgct acgcctcgtg 1013821 gcaggagcag gaacaggcgt cgaccgctgc gctgatcacg ttgttgatga ccgcgttatg 1013881 ggtgctggcg gtgatcgcac gcccctatca gtggtggcga ctggcgctgg tgcttgcctc 1013941 cggactggcc tatgtggtga tcttcagcct tccgctggcg cgggagaagt tcctgctgga 1014001 tgcctcgaac ctggcgacga cgtcaatcgc gctggcggtt ggcgtggtgg gtgcggcgac 1014061 cattgaggcg atgtggtgga tccgaagcag gatgctcggt gtgaaaccga gagtgtggcg 1014121 ataaccgcga atcgccgcgc attagcgccc gcagttcggg caatccgagg gcgttgcggc 1014181 gtagtgcatc caggcggcca ttgatggctt cggtagggct ggtcttgccg cggcgccggt 1014241 cggggtgggc gtaggcggcg atcaggcgct gggagaagcc ccagcacatt ttgccgtgtg 1014301 ttgagcggtg ggtagcgcgt gcgcggggtg tgtcgtactc ggtagagcgg atctccgcgc 1014361 ggccccggtg accgcccgtg agctgttgga tggttggtgg tgcatatcgt cggtctgtcg 1014421 atcgagacca cagcaccgac cgactccgcg attactccca tcatggtccg ggaaatcaac 1014481 atcggtgaga tccccctagg cctcaggctg ggcagcgaca ccacactgct cgacgccgct 1014541 ctcgcgggtg ggtaacaccg gcagccagct ttcgggcttt tcccgaccgg ctctaagggc 1014601 tggttgcagt caaccgcacc gcgacaagta gggttcacca gaggatactg gggccaagct 1014661 cgtggcaaga aacggtacgc atgggaatcc tggacaaggt aaagaacctg ctgtcgcaga 1014721 acgccgacaa ggtcgagacg gtgatcaaca aagcgggcga attcgtcgac gagcagacgc 1014781 aaggcaatta ttctgacgcc atccacaagc tgcatgacgc ggccagcaac gtcgtcggca 1014841 tgagcgacca gcagagctag cacgcatggc gaaactgtcc ggatccatcg acgtaccgct 1014901 gccaccggag gaagcctgga tgcacgcctc cgatctgact cgttaccgag agtggctgac 1014961 catccacaag gtatggcgca gcaagttgcc cgaagtgctc gagaagggca cggtcgtcga 1015021 gtcgtatgtc gaggtcaagg gcatgcccaa ccggatcaag tggacgatcg tgcggtacaa 1015081 acccccggag ggcatgacgc tcaacggcga cggtgtgggt ggtgtcaaag tcaagctgat 1015141 cgctaaggta gcgccgaaag agcacggctc cgtcgtcagc ttcgatgtgc acctcggcgg 1015201 cccggccctg ctcgggccga tcggcatgat cgtcgccgct gcattgcgag ccgacatccg 1015261 cgaatcgctg cagaacttcg tcacggtgtt tgccggctga ccggcgaacg tgatcggtgt 1015321 cgatgagttt cagactccgg ggcggtcggt acctgtgaac cctgatccag ggcccgacac 1015381 agactaggag gtcatccgtg cctactcgta gtagcgcgcc gctgggcgca ccctgctgga 1015441 tcgacttgac gacttcggac gtcgaccgtg cccaagattt ctacggcacg gtgttcggct 1015501 gggcgttcga gtccgcggga cccgactacg gcggatacat caatgccgcc aagggcggtc 1015561 acccggtcgc cggcctgatg gccaatcggc ccgagtttca gtctcccgac ggctgggcca 1015621 cctactttca taccgtcgac atcggtgcga ccgtggccaa gttggctgcc gcgggcggtt 1015681 cgtcgtgcct ggacccgatg gaagtacccg gcaagggctt catgagcctg gcggtcgatc 1015741 cgtcgggtgc ggccttcggc ctgtggcagc cgctgcagca ccacggcttc gaggtgatcg 1015801 gtgaagccgg ctcgcccgtc tggcatcagc tgacgacgcg cgactaccgt tccgtcatag 1015861 acttctaccg ccaggtcttc gggtggcgca ccgaacagat ttccgacact gacgaattct 1015921 gctacaccac agcatggttc gacgatcagc aattgctcgg tgtgatggac ggcagctcct 1015981 gtctccccga aggcgttccg tcgaattgga ccatattctt tggtgccgag gacgttgacg 1016041 agacgttgcg ggtgatctgc gacaacggcg gaagtgtggt gcgggccgcc gagaacaccc 1016101 cgtatggccg attggccgcg gcagccgacc cgatgggcgt tgtcttcaat ttgtcgtctc 1016161 tgcaggcgta atggcgaatc gggctgccgc gtggcgcgcg gcgacccgcc catgcgcagt 1016221 attagtgtca caaccatgac gcgccgcctg cgccctggtt ggctcgtggc actttccgcc 1016281 gcggtcatcg cggccagcac ctggatgcct tggctgacga cgaccgtcgg cggtggaggc 1016341 tgggtcaacg ccattggggg cacacacggc agcctggagc tcccgcacgg gttcggcccg 1016401 ggtcagctca tcgtcttgct ttcctcgacg ctgctggtgg ttggcgcgat ggcgggacgc 1016461 ggcctgtcgg tgaagctttc ctcgattgcc gcgctggtcg tctcgctgct catcgtggca 1016521 ctcacggtgt ggtactacaa gctcaacgtc aacccacccg tgtcagccga atacgggctg 1016581 tacttcggtg ccgccggcgg ggtgtgcgcg gtgggttgct cgttgtgggc tgcggtgtcg 1016641 gccgcttcgc ctgggcgtcg tcgccatcgt gaagtggtgc ggtagaacat ttcagcccgg 1016701 cggaactcgt gttttccccg tgcggggctg gctcccgatt gggtagcccc gtacacgaaa 1016761 ggcgcaaaca caacctcgcg gccatccggg tcgcgataga tgacggctcc ggtagcttct 1016821 caaagggggc gttgttccac cggctgggtc gccacctctt gcaggccagt aagtgcggct 1016881 tgctgccccc gcttgtccga gcaataccgg cacagctgtt cgtgatggtc tgtttcgacg 1016941 atttccagcg tcaggctcga gctgggaagg ccgaatatcg cgccgttgct tccgtagctt 1017001 tcggcgaagg tctggtcgag cattcccacc agatcacggt agaaccgcac tgtctcttcc 1017061 aagttcgacg ggcggggccg attgacgcca atccctcggg ccagtgcgtt gttcggcggc 1017121 gcttttcgtt gtccaccgct actccgtttc gccgaggctg cacttctgca ggccgctact 1017181 cgcagttttg ctgcggattt tccggctcgg tgcaattcat agcccaacgg cagccgccgg 1017241 cgactctgcg tgatcccagc gacgcaactc ggcgcccggc acccacgccg aatgcgtgcc 1017301 gctggaaata cgttccggca gtgcaagctt gcatatcggg ccatcgccgg ggcgcgccgc 1017361 gtcgaaaacc aggcaatacg atgcgtcgtc gttcatgtcg gtggtgaggg tgaccagata 1017421 gccgtcgtcc tcggcgctgc tgcccacccg tggagccatc gcggtctcac ttccgtagac 1017481 gccgtcaccg aacgagtaac actcgtggtt gccggtgagc agatcgtgct taaccagtcc 1017541 gtcgaacagg aaccaactcg gtttgccggt agcggcatag gtgtaacggt agctgctggc 1017601 cgcgtaatcg gcgttgatgg ttccgaactc ggtgatggac tcggacagtt gctcctcgtg 1017661 gactgccccg gtcaccatat tgagccgcca ccgatgtagc cgggactgca gccgatccag 1017721 agccaggaac cgaaacagct tctcccactt cgttcctccg gtgtcaagtg gctgcggatc 1017781 gccttcgtag aagccgtcga gcacgatctc gtcgccctgc tcgtaggcgt tggtgaagtg 1017841 caacacgaac gttggatcgg cttcgaacca gcgaatgtcg ttgcctcggc gagcaacaac 1017901 cgcaaaccga gatggaatct ccggatagaa gcgtggtagg tgcacgtcgc gctcgagcag 1017961 cctgggatcc cagaacagtg gaaaatcgtt gaggattacg taattttcgg tgaacgccat 1018021 gtcatgcggt agccgcggcc cgggcagcgg aacatcgaca tagtgcacaa gctcattgtt 1018081 ctggtcgaca acgccgtagc gcatatacgg ctcttgcttg ctgtagttga agaacaacag 1018141 ttcgccggtc ttgttgtcta ccttcggatg tgccgacacg ccccagtcga acggaaacct 1018201 tccgtgccag ctctccttgc cgagcgtatt ggccgagtac gggtcgatcc gatacagatc 1018261 gccgcactgg tagaagctag tcagcgcgat acctcggtgg acgatgacgt cggtgctcga 1018321 cgcgtccttc atgaggccac gagcgcccca gccgtgttcc cgcttggcca gttgcaccgg 1018381 ttctgccaga cccggccaca gcggcccgcc ggcctcgttc tcggccaaga atccatcggt 1018441 gcgaataaat cggttgcggt agaaggcttt tccatcacgg aagccgacga catggatcat 1018501 gccgtcgcca tcgaaggggt ggtaggtcgc gaatgccggg tgtagcgggt tctcggtgtt 1018561 gcgcaggtag atgccgtcca ggtcggcggg gacttcgcct gtcacggtgg tcaggtcgtc 1018621 ggcatcccat tcggtggtct gtggtcgcca cggaccggtg cgataggggt ggtcgtcgtc 1018681 ttcgggaagg gtcgacaagt acttgccgac aatcgtgatg tccatttcac gatcctcgtg 1018741 tggtgctgac aacgaaactg accgtggtgg ccgtgctgcc accgaaattc agcgtgccga 1018801 acgcttcggc gttctcgacc tgatagtcac cggcaatgcc gctcacctgt ttggccgcgt 1018861 cgagcagcat ccgcacaccg gaagccccga ccggatgtcc gccaccgatc agtcctccgc 1018921 tggggttgat gggtagccgc ccgccgatct cgatctctcc gttctcgatg gccttccaag 1018981 attccccggg gccggtcaac ccgatgtgat cgatggccag gtattcgctg ggggtgaagc 1019041 agtcgtgcac ctcgatcccg tccagatcgt cgagggtcac ccgggcgcgg cgcagggcgt 1019101 ccagcactgt ggcccgcacg tgcggcagta ggtagggggc cgagtcgccc tgggcgacgc 1019161 ggtccagttt ctgccgcaga cccaacccga cggtgcgatg tccccagccg tcgatgcggc 1019221 cgatcgggcg cgcgtcgcga tggtcgcgca gataggcatc gctgaccagg accaatcccg 1019281 cgccgccgtc ggtcatctgg ctgcaatcaa accgtcgcag ccggccttcg gtaagagggt 1019341 tggtcgcgtc gtcgtcggtg atcgggtcgg ggatcgtcca gccgcgggtc tgcgcgttgg 1019401 ggttgcggcg cgcgttggcg aagttgagtt gagcgatggc ccgcaggtga gtgtcatcca 1019461 aaccgtatcg ccggtcgtat tcgtcggcga cctgagcgaa catcgacggc cataagtagc 1019521 gggcctcggc tccttcgtgc ccggtccagg ccgcggcact cagatgctcg gccgcggtgt 1019581 cgccgggcac ggtcttctcc agctctaggc ccacgacgag cgcgacacgg tacgcgcctg 1019641 atcgcaggtc ggccatcgcc gcgagcgtcg ccacgctgcc ggatgcgcac gcggcctcgt 1019701 gccgggtggc cggcgtgtcc cagagatcgt cgcagacagt ggccggcatc gcgccgaggt 1019761 ggccttgacg ggcgaacatc tcgccgaagg cgttcgcgac gtggacgact cccgcagcgg 1019821 ctaggtcggc ggcgtccacc ttggccgcgg tgagcgtgcc gtcgacgacc tccctagtca 1019881 ggtcggcgaa gtcgcggttc tctttgctga ggttgcgagc aaaatcgctc tgatagccgc 1019941 cgagaatcca gacaccgtcg tccatagccg tacgctacta caagcggtgt gaacggcccg 1020001 tcggatagcc acgctcacca ggcattttcc gcgcggcgac gaacggttgc cggactttta 1020061 ccgcgggggg tttccgggcg gcggctgctc tctaatcaca actaccgggg gtttgcggcc 1020121 gtcctcttgg ccgtcagtgc tggtgccgct acgggtgccg ccaccgcccg tcgtgccgcg 1020181 tgcggccagg ctcgccaaag ccatcccgct gagcaggcct gccggcatcc cgtttagggc 1020241 cgtcgggtcg gcgccggcgc tggagctgaa ggtgggtgtt gcctgaacgg cgagctggat 1020301 ctccggggcg gccgtggtcc agctgtgcgg caccgacaac gctccgacta atgctgcgtg 1020361 gccgacgccc gcggacaccg gcgccgcgcc cccgaagggg ccccagtgcg gctccggctc 1020421 gtcggtcgcc gaactcagtg gatggccctg cgtcggtccc agcccgccgg cgttcccgta 1020481 taggccgatg tgccagggtc tggccgtgtt cgtgatcgcg agcgcaatgc tgccggtcgc 1020541 gatggatgca atgtagagcg cgatcacgtc caattcccct atcggggtgg ggatcactat 1020601 cggctgagcg gatccgactt gcgggttgag ggtcgacgcg atccccaaca gtcccgatgt 1020661 cagcggatca gcgttggcgg ccaatgcgga cagaatgtcg ctcaggatcc ccgggggcag 1020721 ctgggccagt gtcgcctgtg catccgcaac ggcgcccgca ccggcggctt gggtcgccgc 1020781 ggctgcggcc gcgggcccgg ccgggccggt gccttgcacg ggtggagtga acggcggcaa 1020841 cgccgacgcg gccgcagatg ccccctcata gctgtacatc acggcagcgt cttgggccca 1020901 catttcggca tactcggcct gggtagccgc gatcgccgca ctgttttgcc ccagaatgtt 1020961 cgccgcgacc agcgacatca accggctgcg gttggccgcg acgagggatg gtggcaccgt 1021021 catcgcgaac gccgtcccaa acgcttccgc cgctgccctc gcctgtgtgg ccgtctcctt 1021081 cgccagcgcc gccgtggcgg ccagccaccc cacatacggc gttgccgcgg ccgccatcgc 1021141 ggccgccgcc ggccccatcc acggctcaac gatcagcgtc gacaccaccg atccatacga 1021201 gaccgcggcg gaagtcaact ccgcggccac accgtcccag gcggccgcgg cggctagcat 1021261 cgactccggc cccggaccgg aatacattcg gcttgaattc acttccggag gtaaaagccc 1021321 gaaatccatt gccagcaacc tccttaaccg gtcgcgacca cattgacggc ctcggtggtc 1021381 gcatacgcat cggcggtggc cgccgggagg gccacgaaca tgccatggac cagcgcggcc 1021441 ggcttactca ccactcggta gtgcttggtg tgcgcggtga accgggccgc cgtcaggacc 1021501 gacacgtcat tggcagcagg gggtaacacc cccgtcgtcg gggcacagac ggctgtgttc 1021561 cgagcactca cggcggtacc gatcgtcggc aagtcccccg tcgcggctgc caagaccacc 1021621 ggctggatgg tcacaaaaga catcggatac cacctgacgc ggatcgcttc atctgatcgg 1021681 tcgacatctt ctacataacc acggaaatgt ctgctttata acggaattag actactttgt 1021741 gttgtctggc gttgctctgc accgacggca tgggtaaacg tctgagatgc gggtgtcggc 1021801 ggtagctgaa aaaccgtgct gacaaccatg attcgccatt cccgaacgac ctgcgaactt 1021861 tgtcgcctag cgtaacgccg tggcgagatt tggctcgatt gttcgcagtg gcgttacgct 1021921 cgccacgcgt gagcctggat caggcaaacg cggctccacc tggccatttg ctgtccgaga 1021981 cggtagttac tcagcatggt gcacaggtct gtgcttgtct ggttgatggt gatttggcgt 1022041 tgcggtggcc gtgatgagga cgcggtgaga aacggagctt gaagatatgt cagcgaaaga 1022101 acgcggtgac cagaacgccg tcgtcgacgc cctgcggagt attcagcccg cagtcttcat 1022161 tccggcttca gtggtcatcg tcgccatgat cgtcgtttcc gtggtgtact cgagcgtcgc 1022221 cgagaatgcg ttcgttcggc tgaactccgc gatcaccggc ggcgtcgggt ggtggtacat 1022281 cctggttgcc accgggtttg tggtattcgc gctgtactgc ggcatttccc ggattggcac 1022341 tatccggctg ggccgcgacg atgagctccc cgagttcagc ttctgggcat ggctggcaat 1022401 gctgtttagt gccggtatgg gtatcggcct ggtcttctac ggggtggccg agccgctcag 1022461 ccactacctg cggccaccgc ggtcacgcgg cgtgcccgcg cttactgatg cggcggctaa 1022521 ccaggcgatg gcgctgacag tgttccactg gggcctgcac gcctgggcaa tttatgtcgt 1022581 ggttggcctc ggtatggcgt acatgaccta tcggcggggt cgccccttgt cggtgcgctg 1022641 gctgctggag ccggtcgtgg gtcggggccg tgtagagggc gccttggggc acgcggtgga 1022701 cgtcatcgcc attgtcggaa cactctttgg tgtcgccacg tcactgggct tcggtatcac 1022761 tcagatcgcc tccggcctgg aatatctcgg ctggatccgg gtggacaact ggtggatggt 1022821 cggcatgatc gccgccatca ccgccactgc gacggcgtcg gtggtcagtg gggtcagcaa 1022881 gggtttgaag tggctgtcga acatcaatat ggcgctggcc gccgcattgg ccctgttcgt 1022941 gttgttgctc gggccgacac ttttcttgct gcagtcgtgg gtgcaaaatt tgggaggcta 1023001 cgtccagtcg cttccgcaat tcatgctgcg caccgcgccg ttctcgcacg acggctggct 1023061 cggcgactgg actatcttct actggggttg gtggatcagc tgggctccgt ttgtcgggat 1023121 gttcatcgcg cggatttcgc ggggacggac gatccgggag ttcatcgggg cggtgctgct 1023181 cgttcccacc gtgatcgcct cgctatggtt tacgatcttc ggtgactcgg cgttgttgcg 1023241 gcaacgcaac aacggcgaca tgctcgtcaa cggggcggta gacaccaaca catcgctttt 1023301 ccgattgctg gacggtttgc ctatcggggc tattaccagc gttcttgctg tgctggtgat 1023361 cgtgttcttc ttcgttacgt cgtcggactc cggttcgttg gtcatcgaca tcttgtcagc 1023421 gggtggtgag ctggacccgc ccaagctgac cagggtctac tgggcggtgt tggagggggt 1023481 agccgcggcc gttttgctcc tgatcggagg tgctgggtca ctgaccgcgt tgcggacggc 1023541 cgctattgcc acggccctgc cgttctcaat cgtcatggtg gtggcgtgct atgcgatgac 1023601 caaagcgttc cacttcgacc tggccgccac acctaggctg ctgcacgtca ccgtgcctga 1023661 cgtggttgcg gcaggaaacc ggcgacgcca cgatatctcg gcgacgctgt cggggctcat 1023721 tgccgtccgt gatgtcgata gcggcacata tatagtccac cccgacaccg gcgctctcac 1023781 cgtcactgca ccaccagatc cgttggacga tcatgttttt gagtctgatc ggcacgtaac 1023841 gcgaagaaac acaacatcat cgagatgatg tgttatcgac ctgccgggtc gccgctgcct 1023901 ggaccggagc cggctacttc cggtaaacgc gcaccgctgg atgaatcgcc gcggcatgag 1023961 aagctcgacg gtggtgccgg gatcgtcgcg cacgatgtca tgctccaggg tgctggtcag 1024021 ccgatggcct ttggtgtgcc actgaccggg tcgatctccg cggccggcga ccacgccacg 1024081 gtcgcgtcca tagcacaggt cgcgcggcgc gcgacggcgt gacccgacat caagtcctta 1024141 tcggaggagc ttggcccctc gcgttggtcc gcggcaggct cggtcggcaa atcctcaaat 1024201 cggccccaag ttgcaccgag cgggagcggc ggtgacggcc aacgtgtggt gtcgtgcggg 1024261 cggcattcgg atggcgccac ggccggtcat cccggtggct acgcagcagc gcctgcggcg 1024321 gcaggcggat cgccagagcc tgggtagtag cggcttgcca gcgttgaatt gtacgcctat 1024381 caggcacaca attgatgtca tggctaccaa gcctgagcgg aagaccgagc gtcttgcagc 1024441 gcgcctgacc cctgagcagg acgcgctgat tcgtcgtgct gccgaggccg aggggactga 1024501 cctcaccaat ttcacggtta cagcggcgtt ggcgcacgcg cgcgacgtgc tggccgaccg 1024561 ccggctcttc gtactcaccg atgccgcgtg gactgagttc ctcgccgcgc tggaccggcc 1024621 cgtctcacac aagcctcggt tggagaagct gttcgccgcg cggtccattt tcgacaccga 1024681 ggggtgagcg gctacagcgc gccgcgacgt atcagcgacg ccgatgacgt cacgagcttc 1024741 agcagcggcg agcccagtct ggacgattac ttgcgcaagc gggcgttggc caaccatgtg 1024801 cagggagggt cgcgctgttt cgtgacgtgc cgtgacggtc gggtagtcgg cttctatgcg 1024861 ctagcgtcag ggtcggtcgc acacgctgat gctccgggac gggtgcgccg caatatgcct 1024921 gaccccgtgc cggtgatcct gctgtcgcgg ttggcggttg atcgcaaaga acagggcagg 1024981 ggcctgggca gtcatctgct gcgtgatgcg atcggtcgct gtgtccaggc tgcggactcg 1025041 atcgggctgc gggcgattct tgttcatgcg ttgcacgatg aggcccgcgc gttctacgtc 1025101 cactttgact tcgagatctc gccgaccgat ccgctgcacc taatgctgtt gatgaaagac 1025161 gctcgcgcgc taattggcga ctgatgctac gcgattgact atcgagagcc aggctacgtc 1025221 atctgatacc aaccaatcac cgaccacagc accgaccaga acaagccacg accactcggc 1025281 tgacacctga aaaccatggc tgaactgcgc aaacacagag tgcccccggc aggattcgaa 1025341 cctgcgacac cggctttagg agagccgtgc tctatcccct gagctacgag ggcggggacg 1025401 cctttgaata cctgactaaa acctagccgt tcgccgcgcc ggccgggact gtccgatatt 1025461 cggtgtaagt ggcgtttctc gggatttttc tttcggtcag cgttcttcgg cggctggcat 1025521 gcgatcggcg aacgtgatcg ccagggcgtt gagcgctggc ttccagcgta cggcccactt 1025581 ggtttgcccg gtgcccttgg gatccaggga gcgggtgacc aggtagagcg tcttgagtgc 1025641 tgactgttcg ttcgggaagt gtccacgtgc ccgcaccgcc cgccggtagc gcgcattgag 1025701 actttcaatt gcgttggtag aacacgggac tcgccgtatt tcgacatcat agtccaggaa 1025761 cggaatgaac tcttcccacg cgctgtccca cagccgtgtg atcgccgggt aaggcttacc 1025821 ccatttctcg gcgaactcct cgtagcgcaa cctggcctca gcggcactgg ctgcggtgta 1025881 gatcggcttg aggtcgacgc tgatcttgtc ccagtacttg cgggaggcat accggaaagt 1025941 gttgcggatc agatggatga tgcaggtctg caccgtggcc aacgggaacg ccgcggacac 1026001 gctgtcgggc aaccctttga ggccgtcgca gaccaggaag aagatgtctt tgaccccacg 1026061 attgcgcagg tcggtgagca ctgccagcca aaatttggct gactcaccgt cgccttcgcc 1026121 ggcccacatc cccaggatgt ccttgtggcc gtcgaggtcg acgccgatcg cggcgtagac 1026181 cggccggttg cggacctgcc cgtcgcggat cttgaccatg atcgcgtcga tgaacaccgc 1026241 ggcgtagacc ttctccagcg gcctggacca ccacgcctgc atctcctcga tgacccggtc 1026301 ggtgatccgc gagatggtgt ccttggacac cgacaccccg taaacgtcgg cgaagtgagc 1026361 cgcgatctcg ccggtggtca ggcctttggc gtacagcgac aacaccaccc ggtccacatc 1026421 ggtgacccgg cgcttacgtt tgcccacgat caccggctcg aaggtgccgt tgcggtcacg 1026481 gggcaccgca atctcgacct gtccgcacgc atcggttatc accttcttgt tacgagatcc 1026541 gttgcgtgag tttccacttc cacgcccggc tgcggcgtgc ctgtcgtagc cgaggtgttc 1026601 ggtcatctcc tcttgcaggg cggcttcgag caccgtcttg gtcagcgcct tgagcaaccc 1026661 gtcagggccg gtcaatgcga ccccctcagc gcgtgcctgg cgtaccagat cacccaccag 1026721 cgcccgctcg gcaccggaga gctcacgggc cgcaacggcc gcctcatcca cgtcctggcc 1026781 ggcgtgagcc ggctctatca cctgagcagc atccatgccc ttgagtgtgt ttggtcatag 1026841 cagtgattcc ttctgcccca cgccgggggc ggtcagaacc acttacaccg aatcagcgat 1026901 agacccctcc ggcggcgggg gggttggcgg tgtttgtggc gtccggtcgt cggggtgcgg 1026961 cgggtgtgag tgtagcgggc gcaacgaggg ccacctgacg ctcgggcgtg tgtggtgggc 1027021 gcttgtcggc caacgctctg gggttcagag ctgttgcgtg ttgagtgtgt tttagtgtgc 1027081 gttagtgtgt tctaattggc ggcgtgaatc tggcggattg ggcggagtcg gtgggggtga 1027141 atcgacatac cgcttatcgc tggtttcggg aggggacgtt gccggtgccc gcggagcggg 1027201 ttggccggtt gatcctggtc aagacggccg cctcggcgtc ggccgcagcg gcgggagtgg 1027261 tgctgtatgc gcgggtgtca agccatgata ggcgttcgga tctggatcgg caggtcgcgc 1027321 gtctaaccgc gtgggccacc gagcgtgact tgggggtggg gcaagtggtg tgcgaggtcg 1027381 gttccggcct gaacggcaag cgacccaagc tgcggcgcat cttgtcggac cccgatgcga 1027441 gagtgatcgt tgtggagcat cgggatcggc tggcgcgttt cggggtggag cacctcgagg 1027501 cggcgctgtc tgctcagggc cggcggattg tggtcgccga tcctggtgag acgaccgatg 1027561 atctggtgtg tgacatgatc gaggtcttga ccggtatgtg cgcgcggctg tacgggcgtc 1027621 gcggtgcgcg caaccgggcg atgcgtgcgg tcacggaggc caagcgtgag ccgggggcgg 1027681 ggtgatgatc gtcaggatgc gtagctgcgc tcaggccgcg aaggtggccg aggccaccgg 1027741 tggtgtgcag ctggcgggca agccgaaacc cgatgggaca ccgacgttct cccggtatgt 1027801 ggagatcggc gtggattttg aggcgcaccg gccggtggtg gagtcggttt cggtgctgtt 1027861 cgagctttat gacggcgacg ccaacagtta tgccgcgacc ggggggccgg gtgcccaact 1027921 gccgtcgggc tggatggtca cggcggcgaa attcgaggtc gagtggcccg ccgacccgca 1027981 gcgggcgggt ttggtgcgtt cacatttcgg cgcccgccgc aaagctttca actggggcct 1028041 ggcccaggtg aaggccgacc tcgacgccaa agccgctgat ccggcacatg agtcggtgga 1028101 ctgggacttg aagtcgctgc gatgggcgtg gaaccgagcc aaagatgacg tggcgccgtg 1028161 gtgggccgag aattccaagg agtgctactc gtcggggttg gccgatctgg cccagggcct 1028221 ggctaattgg aaagctggca agaacgggac ccgcaaaggc cggcgggtgg gcttcccgcg 1028281 attcaaatcc gggcggcgtg atcctggcag ggtgcggttc accaccggca ccatgcgcat 1028341 agaggatgac cggcgcacga tcacggtccc ggtgatcggg ccgctgcggg ccaaggagaa 1028401 cacccgccgg gtgcaacgcc acctcgtgag cgggcgcgcg cagatcctga acatgacctt 1028461 gtcgcagcgg tggggccggt tattcgtggc ggtctgctac gcgctgcgca ccccgaccac 1028521 cagatcaccg ctcacccagc cgactgtgcg cgccggaatg gacctgggag tccggaccct 1028581 ggccacggtc gccaccctcg acaccgccac cggcgagcag accatcatcg aatacccaaa 1028641 cccggccccg ctcaaggcga cactcgtcgc ccgtcgcagg gccggccgag aactttcccg 1028701 ccgcatcccc ggctcccatg ggcatcgggc agtgaaagcc aagctggccc gcctggatcg 1028761 ccggtgcgtg cacctacggc gggaagcagc ccaccagctc accaccgagt tggcgggcac 1028821 ctatggccag gtcgtgatcg aagacctcga cgtggccgcg atgaaacgca gcatgcgccg 1028881 gcgggcgttt cgccgatcgg tctccgatgc cgcaatgggt ttggtcgcgc cgcagctggc 1028941 ttacaaaacg gccaagtgca gcggcgtgct gacggtggcg gaccgctggt ttgcctccag 1029001 ccaaatccac cacggctgca ccagccccga cggcacaccg tgccggctgc aaggcaaggg 1029061 ccgcatcgac aaacacctgc tctgccctgt aacgggcgag gtagtcgacc gcgacagaaa 1029121 cgctgctttg aatctccgtg actggccgga taacgccagt cgtggtccag tcgggaccac 1029181 ggccccatcg gcacccgggc caaccaccac ggttggtaca ggccatggcg cggacaccgg 1029241 atcatccggc gccggcggag catccgtaag accccgccca cgcagggccg gacgcggcga 1029301 ggccaaaacc caaaccccgc aaggggacgc cgcatgagag tgcaactaaa acacactcaa 1029361 cggcaacggt gtcgtcggga tgccagcgcc gcccacgcat cttcacttga tcgagatcga 1029421 tcaggtgatc ggccgctcat tggcggccgc ggcatcatgc agatggttga cgagctgcgt 1029481 gcggccgctt ccggtccaaa atcgccagac agctaccagg aacgggccgc agttaccagg 1029541 ccctgtacca gggtagcggt gaccggtgac atgccgccga cgccggggag ggtactgcgt 1029601 gggcccagac cccttacccg aatcgatagt tccagctggg tcccgccgtc gcggacccgg 1029661 ttgaccggat tgtctggatg caggccgcgg agctcctccg ggatggcggc cagatcggtg 1029721 actacccgat agccgggcag ctggatgtgc cgcgcgagat gggcggcaag cgcgcggttg 1029781 cggcccccgg ccaacagctg ggtgctgcgc ccgatgcggt cgtagccgtg cagcgacacg 1029841 gcgacgtcaa cgtggtcaag gaattcggcg aggcgcgccg attccgcagg gtcgaaccgg 1029901 gccgacggca ggtggtgcgg gtagttgtcc ggatgacgca gcaggtacac cgaagcgccc 1029961 gcagcctcgg cggagcgttc ggcgatcagg tcggtcacct gctccaggcc gcccccgtgg 1030021 atggcgagga agccgaagcg ggaccgcagc tggctcgtct cgatgacgcc gggctggctt 1030081 agcaactccg aaagtgattg tggcgcaggc ccagatctcg atgacggtaa cactggcagg 1030141 ggccaccgcg cggggtccca gcggtgcaga tagtcgatcc agcgttgcgg cagcccgtgg 1030201 tgtcgagcgc cgtcgatgac gcgcggtaga tagcccggcc gcggccggcc cggcatcacc 1030261 cggtggtcaa tgtagaccca ggccggcaac gctgtgtcgt cggtgtgcac ggtcaaccgt 1030321 tcgcgccggt agcgcaccgg cacgccttcg gcgctgtcca acctgaccag gtcgcgctcg 1030381 gagagctgcc atagcacgcc atgcaccttg tttccggcga agggttcgac ggtggccacg 1030441 ccgcgctggt tgatcagcca gttgtgatcg ctgagcactg ccggccgcgg agcaccggcg 1030501 tcgggacagc gcgacgccat ctggtgggcg cacaggttgg acccgtaggc gaagtaggga 1030561 tgccggcggt ccggcattca gccggtcacc gtgagataga tcagcatcac gttgagcaga 1030621 ctaaccatca ccgcgaccac ccagccaacc caagtcgtgg cgcgatggtt ggtgtcgccg 1030681 cccatcaccg cggggctgcc ggtgagtttg accagtggaa gtaccgcaaa cggaataccg 1030741 aacgacagca ccacctgtga gagcaccaat gtgcgggtgg ggtcgaagcc cagcgtaagt 1030801 atcgccaacg cggggcccag cgtgattagg cggcgcacca gcatgggaac gctccagtgc 1030861 agcagcccct gcatgatcat cgcgccggcg taagcaccca ccgacgacga cgccaagccg 1030921 gacgccagca acccgaccgc gaagagcacc gcgatcgtcg cccccaaggt gtcgtggacg 1030981 gcgtggtagg cgccttcgat cgaggcggtg tccccacggc cccgcatgtt cagcgcggca 1031041 accagcagca tcgcggcgtt taccccgccg gctatcagca tcgccaggcc gacatcccag 1031101 cgggtgacgc gcagcagccg gcgccgctga gggcccggat cgggatgccc gtgccggtcg 1031161 cgcgcgagac ctgaatgcag gtagacggcg tgcggcatga cggtcgcccc catgatcgcc 1031221 gcggccaaaa gaacgctctc ggttccctga aagcgcggtg ccaaaccgcc gaggaccgca 1031281 ttggggggtg gtgtcacgac gaagaaactg gcggtgaagc cgatggcaat caccagcagc 1031341 aaggcggtga tgacgcgctc gaacaaacgt tgaccgcgcc gatcctggat cgtcagcagc 1031401 agcagcgaga ccaccccggt gatgatcccg ccgatcggca gcggcaggtt gaacatgatc 1031461 cgcaatgcga tagctccgcc gatcacttcg gccacatcgg ttgccatcgc gacgatctcg 1031521 gcctgtgccc agtaggccag ccgggccggg cgtcccattc gcttgccgat cgcttccggc 1031581 agtgagcgtc cggtcaccag cccgagcttt gccgacaggt actgcaccag ggcggccatc 1031641 acgttggcgg cgacgatcac ccataacaac aggtagccga actgggcgcc ggagctgacg 1031701 ttggctgcca cgttcccggg gtcgacgtag gcgatggccg cgacaaaggc tggcccgagc 1031761 agataccagc tcgtcttcag ggaagtccgg gtgtcctggg ccaactcacc gactttcgat 1031821 ccacgcgaac aaagatgcga gagtaaccga aattcgcccg ccaccaacca ccgggctact 1031881 cgggacctcc gctggctatc ggtagtcggg gttggcgaag tccggccggc agccggcgtc 1031941 ccacttggtg cgttgattgc cgtaggccgg gatgccgccg gcgacccgca acatctgcgc 1032001 gatgtgcatc agattgaacg tcatgaatgt ggtgttgcgg ttggtgaagt cgttctctgg 1032061 accgccggat ccggggtcga gatacgacgg tcccggcccc gcttcaccga tccagccggc 1032121 atccgcttgc ggcgggatgg tgtatcccag gtgttgcagg ctatagagca cattcatcgc 1032181 gcaatgcttg acgccgtcct cgtttccggt aatgaggcaa ccaccggcgc ggccgtagta 1032241 ggcgtactgt ccatcctcgt tgagcaggct cgagcatgcg tacaggcgct cgataacccg 1032301 tttcatcacc gagctgttgt cgcccagcca gatcggcccg cacagcacca ggatgtgcgc 1032361 atcgaggaca cgccgataca gggcgggcca ttcgtcggtc gcccaaccgt gttcggtcat 1032421 gtccggccat acgccggtcg ctatgtcatg gtcaactgcg cgcagagtgt cgacctggac 1032481 gccatgctca cgcatgatcc ccgagctgcg ctcaatgagc ccgtcggtat ggctgagctc 1032541 tggcgagcgc ttcagtgtcg cgttgatgaa cagcgcacgc agcccgtcga atcggggtgg 1032601 ggccgcggcg ttctggtcag aggttgtggt catacgtcat acccacctgc ctgtcatcgt 1032661 cgtgccgggt tgccgctggg cggcggtgct ggtgccaaga aatgaccgat caggcagcag 1032721 cgtaccgccc ttcaccggtg atcaggggta ggtcgagggt tgtccggata cccggttcgg 1032781 cggccaccac tgcagggatc gcgttgacga tgcgcatcgc ggtggcgacc agtccggcgt 1032841 ggttgtggtc cccgtggcgg ctgctcaggc agatgtccat ggcgtagcag ggctcgccgg 1032901 agatttcgat gcggtacgag ccgcccggct gggcgggctg cggccactcg ggacataggt 1032961 ccgcgcgcaa ccgggtcacg tgttccagga ctaccgctgg cacgccgtcg accaggccga 1033021 gcacctcgaa gcgcagggcg gcggcgctgc ccttaggaat atggcccgat gcaatgttga 1033081 aggcctccgg cgccggctcc cggacataca tttcctcgac cccgtcaagt gaaatgccaa 1033141 ggcccgcagc aagttgtcgg accactgatc cccaggccag gctgagcaca cctggctgca 1033201 gcagcatcgg gatctggtcc atcggcttac cgaagcccat cacgtcgaac atgactacgg 1033261 cgctgtcata ggtggcgtag tcgacgatct ccatgcagcg tatctgctcg atgctttcac 1033321 aggtgccggc caacgccatc ggcaacaggt cgttggcgaa acccggatcg atgccgttca 1033381 cgtacagact tgaatttcct gcgcgcgcag cgtcttgcaa aggcttgatg atctcgtcgg 1033441 ggatcacctg ccacggatat tgcaagaaca ccgggccgct gccgacgata ttgatccctg 1033501 ccgccaagat tcggcggtag tcttccagcg cctcgggcag ccgattgtcg gccatcgcgt 1033561 tgtagacggc gcaccgcggc ccggtggcga gcacggcgtt cagatcggtg ctggcccgca 1033621 cacccgtcga atccgccagc ccggcaagct ctgccgcatc cttgccggct ttggcgtccg 1033681 atgacaccca gacaccggtg agctcgaact ccgggtcggc gatgagcgca cgcaacgagt 1033741 gcacgccaac gttgccggtg cccaattgaa cgacgggtat ggccatggcg ggctccttag 1033801 cggtaggggt cagactgcga ctgctcgcgc atcatcggtt cacaggtccg gaatgggaag 1033861 gtcgagattg gggaaggtga gtccgccgtc gacctccaac gtcttgccgg tcaggaagct 1033921 gcccgccgga gaggccaaat acactgccgc agctgcaatg tcgacggggt caccgagccg 1033981 gcgcagtggt gtcgcctgct ccatcggcgc acgcagctcg tcgttggcgg ctaccacctc 1034041 cagcgccgag gtcaggatgg aacccggcgc gatcgcattg acccggacgc gtgggcacag 1034101 gtccagcgcc gccagccggg tgtagtgggc cagtgcggcc ttggcggtgc cgtaggcggc 1034161 gaaaccccgc gccgccagcc ggcccatggt ggagctgatg ttgatcacgc tgccgccgcc 1034221 ggagtgttcc agcatcaacg gcaccgccgc gacggtcagc gcgtgggcgg tgcccacgtt 1034281 gaaggcgaag gcgtccgcga ggtccttggt cgaggtgctt agcagcgtgt tgggcatggt 1034341 gccgccaacg ttgttgacga cgatgtcgag cttcccgaaa gctccgacgg cctgaccagc 1034401 cagctgcgcg gtcacctcgg gatgggccag atcggcggca acggtgtggg cgcggcggcc 1034461 ggcagcgcgg atctgttcgg cgacagcgtc aagctcggat gatgttcgtg aagcgatgag 1034521 gacatccgcg ccggcctggg cgaaagccaa tgcgatggct gctcccaggc cgcggccgcc 1034581 gccggtgatg acggcaacct tgtcgtcaag acggaacata tccaggatca tggcgccctc 1034641 ttttccggct gtcggccgaa acggtaacaa gcttgctgca gcttcctgtg actgctcccg 1034701 aaacctgggg gtgtgcctgc tgtgtatgca cggcatacgg acatccttcc cctgagaccc 1034761 gcggtcgaac cagccacgtg tccatcatca ggggtcaacc ccggccaagg gcgacggcac 1034821 gccaagttcg ccgaccgtta acctagtgct gttagcttca tttgctgcga gcaaaacagc 1034881 tggtcggccg ttaggaactg aattgaaact caaccgattt ggtgccgccg taggtgtcct 1034941 ggctgcgggt gcgctggtgt tgtccgcgtg tggtaacgac gacaatgtga ccgggggagg 1035001 tgcaaccact ggccaggcgt cggcgaaggt cgattgcggg gggaagaaga cactcaaagc 1035061 cagtgggtcg acggcgcagg ccaacgcgat gacccgcttt gtcaacgtgt tcgagcaggc 1035121 ctgccccggc caaaccctga actacacggc caatggttcg ggcgctggaa tcagcgaatt 1035181 taatggcaac caaaccgatt tcggtggctc agatgtaccc ctgagcaagg acgaggccgc 1035241 agcggcgcag cggcgttgcg gctcgccggc gtggaatctg ccggtggtgt tcggcccgat 1035301 cgcggttacc tacaacctca acagcgtttc ctcgctaaat ttggacggcc ccacgttggc 1035361 gaagatcttc aacggctcca ttacgcagtg gaacaatccc gcgatccagg cgctgaaccg 1035421 cgacttcacg ctgccaggtg agcggattca cgtggtgttc cgcagcgatg agtcggggac 1035481 cacggacaac ttccagaggt acctgcaggc cgcgtccaac ggtgcgtggg gtaagggcgc 1035541 tggaaagtcg ttccaaggcg gcgtcggtga gggcgcgcgg ggtaacgatg gcacgtcagc 1035601 ggccgcgaag aacaccccgg ggtcgatcac ctacaacgag tggtcgttcg cccaggcgca 1035661 gcacctgacc atggccaaca tcgtcacttc ggctggtggg gacccggtgg cgattactat 1035721 cgactcggtc ggccagacga tcgccggggc caccatctcc ggggtgggca acgacctggt 1035781 gctcgacacg gactcgttct accggccgaa gcgtcccggc tcctatccga tcgtgttagc 1035841 gacatacgaa atcgtttgct cgaagtatcc cgactcgcag gttggcacgg ctgtgaaggc 1035901 gttcctgcag agcactatcg gcgccggtca aagcggcctg ggggacaacg gatacatccc 1035961 aattccggac gagttcaaat cgaggctgtc gactgcggtc aacgcgatcg cctgatctga 1036021 ggttgacgtg gtcaccgagc cgctcacaaa gccggcgcta gtggcggtcg acatgcgccc 1036081 cgcgcggcgc ggcgagcggc tgttcaagct ggccgcgtcg gccgccggtt cgacgatcgt 1036141 catcgcaatc ctgctgatcg cgatattcct gttggtccgc gccgtgccgt cgttgcgggc 1036201 gaatcacgcc aatttcttca ccagtaccca attcgacacg tcggacgatg agcagctggc 1036261 gtttggtgtc cgggacttgt tcatggtcac ggcgttgagt tcgataacgg ctctggtgtt 1036321 ggcggtgccg gtggctgtcg ggatcgcggt gttcctcacc cactacgcgc cgaggagact 1036381 gtcgcgtcca ttcggcgcga tggtggatct actggccgca gtgccgtcga tcatcttcgg 1036441 gttgtggggg atctttgtgc tggcgcccaa gctcgagccg atcgcgaggt ttctcaatcg 1036501 caacttgggc tggttgttcc tgtttaagca gggcaacgtg tcgttggccg gcggcggcac 1036561 gattttcacc gcgggcatcg tgctgtcggt gatgatcctg cctatcgtca catcgatatc 1036621 acgcgaagtg ttccggcaga ctccgctgat ccaaatcgaa gcagcgctgg cgctaggcgc 1036681 gacgaaatgg gaggtagtgc ggatgaccgt gctgccatac gggcgaagcg gggtggtcgc 1036741 ggcctccatg ctgggtttgg ggcgggctct gggcgaaacc gtggccgtgc tggtcatcct 1036801 gcgctcggcc gcgcggccgg ggacctggtc gctgttcgac ggcggttata cgttcgcttc 1036861 caagatcgcc tccgctgctt cagaattcag cgaaccgctg ccgaccggag cctatatttc 1036921 ggcgggattt gcgttattcg tgctgacgtt cctggtcaat gcggccgctc gcgcaatcgc 1036981 cggcgggaag gtcaacgggt gagtccctca atgagcatcg aggcgctcga ccagccggta 1037041 aagccggtgg tgtttcgtcc gcttacgctg cgacggcgga tcaaaaacag cgtcgcgaca 1037101 acgtttttct tcacctcgtt cgtggtcgcg ttgataccgt tggtctggct gctttgggtg 1037161 gtgattgccc ggggttggtt tgccgtcacc cgatcgggct ggtggaccca ctcgctgcgc 1037221 ggcgtgctgc cagagcaatt cgccggtggg gtgtatcacg ccctgtacgg cacgctggtg 1037281 caggccgggg tggccgccgt gctggccgtg ccgctgggct tgatgaccgc ggtttaccta 1037341 gtggaatacg ggactggtcg aatgtcgcgg gtgactacct tcaccgtcga cgtgcttgcc 1037401 ggcgtgccct ctatcgtggc ggcgttattc gtcttcagcc tgtggatcgc caccctagga 1037461 tttcagcaga gcgcctttgc cgtggcgttg gcgttggtcc tgctgatgtt gccggtggtg 1037521 gttcgggcag gcgaggagat gctcaggttg gtgcccgatg aactgcgaga agccagctac 1037581 gcgttaggcg ttccgaaatg gaagacgatc gtgcggatcg tcgccccgat cgcgatgccg 1037641 ggcatcgtgt caggcatctt gttgtccatc gcgcgcgtcg tcggtgaaac cgcaccggtt 1037701 ctggtgctgg tcgggtacag ccactccatc aacctcgacg tcttccacgg caacatggcc 1037761 tcgctgccgt tgctgatcta caccgaactc accaatcccg agcacgccgg cttcctgcgc 1037821 gtctggggcg cggcgctgac cctgatcatc gtggtcgcca cgatcaacct ggccgcggcg 1037881 atgatccggt tcgtcgcaac ccgacggcgg cgactcccgt tatgacgtga gtttcaccac 1037941 tcggtcgttg ccgcggtcgg cgacgtagac ggtccggtcg ctgtccactg ccaccgcgag 1038001 gggggtgttg aggccggtga acggtagcac tgtcgaggtg gtcgacccgg ccaggagttt 1038061 gaccacctgg tttgtgttgt gctcggtgac gtagacggtt ccggcttcgt ccaccgcgat 1038121 gccccacggt gcggtgatat ccgtgaatgg cagcacgacc tggttattcg actcggcctc 1038181 tagcttgaca accctgttgt tgtcggtgtc ggtgacatag acgttgccgg agttgtcgac 1038241 ggccaccccg tcggggtcgt tgaggccggt gaacggcagc acggtctggg tcttggatcc 1038301 ggccgccaac ttcaccaccc tgttgttgcc ccggtcggcg acgtataccg caccctgggt 1038361 atccaccgcg agaccttcgg ggtagttgag gccgtcgaac ggtagcacgg tctggttgtt 1038421 ggacccggcc gctaacgtca ccacccggtt gttgaaatcg gtgacgtata cggtgccagc 1038481 gccgtccacc gccaacccct gcggctggta cagcccgttg aacggtaaca ccgtcgtgcc 1038541 ggttgacccg gtggccaact tgaccactcg gccgtacatg ccctcactgg tgacgtacac 1038601 gttgccggcg ctgtccactg ccaccccact cggcgagagg cggaagtcga tgccggtgaa 1038661 cggcaacacg gtctgtccgg atgcctgcgt cggcgaccac gaaggtcgta agaccaggta 1038721 gccggcggcg gcgacgatgg ccaccagtac gatcgcggca gcgccgacga cggcccacac 1038781 cttccgtttg ttgccggccg gcggcacagc gtgtcccagg gaggcctgga gcgcattcgg 1038841 gacggcaggg gagtgtccgg tttggctggg ccagttcccg ccgcggctgt ccgctgccag 1038901 gggtccggcc acggtcgcgg agtccccggg cgaccatcgg gcagcacccg gggtcggtgg 1038961 gccggtgccc gccccggcaa tgccggactc ggactggctc aagcccgtat cggccggagt 1039021 ggccagcaag gttgcgttgt caccgcgccg cagaatcgtc gtggcctggt gttgctcgga 1039081 tgtggtgagt gcgtcatggg cggcgatggc cagatcacca gcgctcataa agcgctccgc 1039141 ggggtttttg gccatgcctt tggcgatcac ctgatccagg gccggcggca cgcgcccggg 1039201 ccgtagctgg ctgggctgcg gggcagggtc cattagatgc gcggcgatca accgctcaac 1039261 gctgtcggcc cgatacggtg gggcaccggt caaacactca cccaacacgc acgccaacgc 1039321 atagatatct gcgcgatagg tgacctcatc gccggtgaac cgctccgggg ccatgtagtt 1039381 gtaggttccc acggcggtcc cggtctgggt cagccccggg tcggaggcgg cacgggcaat 1039441 accgaaatcg accagatagg cgaagtcgct cgcggtgacc agaatgtttt ccggttttac 1039501 gtcgcggtgc gttacgccgt tggcatgcgc ggcatccaaa gcggcggcga tctggcgcac 1039561 gatggccaca gctcgggccg gggtcagcgg accatactgt ttcaataggg cgcgtaaaga 1039621 ggtgccgtcg atcatgcgca tttcgacaaa gaactgtccg ttgatctcgc cgtagtcatg 1039681 gatcggcacg atgtgtggct cggtcagccg tcccgcggtg tcggcctcgc gttgcatccg 1039741 tgctcgaaac accgcattgt cggagtactg cggcgagatc aacttcagcg ccaccacccg 1039801 gtgcttgcgg gtgtcctcgg cctcataaac ctcgcccatc ccgcctcggc ccagcagccg 1039861 caatagctga tacggcccaa attgcgaccc tacctgcgga acggcatcgc tcaccgtcga 1039921 attcccttca ctaggtcaag aaatagcatt caccgcggcc gccaattttg cttggaacga 1039981 tttgggcaac ggaatggagc cgtattggtc caggccttct tggcctggac caatcgcggc 1040041 ttgcataaac gcccttaccg cagtaccggt cgtcgcatcc gggtatttcg agcagacgat 1040101 ctcataggtc gccagcacga tcgggtaaga gccaggctgg gtgggcctgt agaacgacga 1040161 cgtgtccaat accaggtcgt tgccttgtcc catgatcttg gccccggcga ttgtcttgcc 1040221 gaccgactcg gtggtgatcg ccactggatc cggacccgcc gacgtgatga tctgggccat 1040281 gttcaactgc ttacccaccg caaacgacca ctcgttgtag gtgatcgacc cgtcggtcgt 1040341 ctgcagtagg gccgacgtgc cgttgttccc gctggcgccg acgccgacgc ccccgttgaa 1040401 cgtttcgctg gcgcctttgc cccacgcccc gttggatgcg ccgtcgaggt atttctggaa 1040461 gttgtccgac gtaccggact tgtcgctgcg gaagataacg ctaatcggtg ttggcggcag 1040521 gtcggtgccg gagttgaggg cttggatctg tggatcattc cacacggtga tggtgccgtt 1040581 gaaaatcttg gcggtagtgg gtccgtcaag attcagcgtg ctcacgccct tgatattgta 1040641 ggtgatcgcg atcgggccga acaccgtcgg caggtcccat gccggggaac cgcaccgctc 1040701 cgccgaccgg tcaggttgac cggtcgacgg attcaacggg acatccgagc cggcgaaatc 1040761 ggtttcgttg ttgagaaact gggtcacccc ggcaccggac ccgttggcgt tgtagtccaa 1040821 cgtgtagccc gggcacgatc gcacgtaggc atagacgaac tgctccatgg cattttcttg 1040881 tgcggtcgag ccgctggagt ggagctcctt cttgccgccg cagtgcaccg acccagacgt 1040941 gccgcctgcg cctgacgacg agctgttggt gccaccgccg catgctgtca acaccagtgt 1041001 gccggcggcc aacaggctta ccgctgcgcc ggatcgggcg aacttcacgc aactcctctc 1041061 gagggggtcg tggtggcgga tccactcgcc accggtggtc gccgagccac cgacccgggg 1041121 tcggtattcg agccgtcacc gttgtgcatc gaaagaggtc tgatcattga aatcctagcg 1041181 ttcaggaggg gccgctgata ctgagggtcg acggcgcgct ttgtccaagg agcatcccaa 1041241 ggagcatgta gtaccctgcg ccgatggcgt gtgaacggct cggcggccag agcggtgctg 1041301 ctgatgtcga cgccgctgcg ccggcgatgg cggcggtgaa cctcaccctg ggtttcgctg 1041361 gcaaaaccgt gctcgaccag gtgagtatgg gctttcccgc tcgtgcggtg acgtcgttga 1041421 tgggaccgac cggttcaggt aagacgactt ttttgcgcac cctaaaccgg atgaatgaca 1041481 aggtctccgg ttaccgctac agcggtgatg tgctgttggg cggacgcagc atcttcaact 1041541 accgcgacgt gctggagttt cgccgccggg ttggcatgct gttccagcgc ccgaatccgt 1041601 tcccgatgtc aatcatggac aacgtgctcg ccggcgtgcg tgcccacaaa ctggtgccgc 1041661 gcaaggaatt ccgtggcgtc gcgcaggctc ggcttaccga ggtcggcctc tgggacgcgg 1041721 tcaaggatcg gctcagcgat tcaccgtttc gactctctgg tggtcagcag cagttgttgt 1041781 gcctagcccg tacgcttgcg gtgaatccgg aggtgttgct gctcgacgag cccacctccg 1041841 cgctggaccc gactaccacc gagaagatcg aagagttcat ccgatcgctc gctgatcgcc 1041901 tcacggtgat catcgtgacc cataaccttg cccaggccgc ccgcatcagc gaccgggcgg 1041961 ccctgttctt cgacggcagg ctggtggagg aagggcccac cgaacagctg ttctcctcgc 1042021 cgaagcatgc ggaaaccgcc cgatacgtcg ccggactgtc gggggacgtc aaggacgcca 1042081 agcgcggaaa ttgaagagca cagaaaggta tggcgtgaaa attcgtttgc atacgctgtt 1042141 ggccgtgttg accgctgcgc cgctgctgct agcagcggcg ggctgtggct cgaaaccacc 1042201 gagcggttcg cctgaaacgg gcgccggcgc cggtactgtc gcgactaccc ccgcgtcgtc 1042261 gccggtgacg ttggcggaga ccggtagcac gctgctctac ccgctgttca acctgtgggg 1042321 tccggccttt cacgagaggt atccgaacgt cacgatcacc gctcagggca ccggttctgg 1042381 tgccgggatc gcgcaggccg ccgccgggac ggtcaacatt ggggcctccg acgcctatct 1042441 gtcggaaggt gatatggccg cgcacaaggg gctgatgaac atcgcgctag ccatctccgc 1042501 tcagcaggtc aactacaacc tgcccggagt gagcgagcac ctcaagctga acggaaaagt 1042561 cctggcggcc atgtaccagg gcaccatcaa aacctgggac gacccgcaga tcgctgcgct 1042621 caaccccggc gtgaacctgc ccggcaccgc ggtagttccg ctgcaccgct ccgacgggtc 1042681 cggtgacacc ttcttgttca cccagtacct gtccaagcaa gatcccgagg gctggggcaa 1042741 gtcgcccggc ttcggcacca ccgtcgactt cccggcggtg ccgggtgcgc tgggtgagaa 1042801 cggcaacggc ggcatggtga ccggttgcgc cgagacaccg ggctgcgtgg cctatatcgg 1042861 catcagcttc ctcgaccagg ccagtcaacg gggactcggc gaggcccaac taggcaatag 1042921 ctctggcaat ttcttgttgc ccgacgcgca aagcattcag gccgcggcgg ctggcttcgc 1042981 atcgaaaacc ccggcgaacc aggcgatttc gatgatcgac gggcccgccc cggacggcta 1043041 cccgatcatc aactacgagt acgccatcgt caacaaccgg caaaaggacg ccgccaccgc 1043101 gcagaccttg caggcatttc tgcactgggc gatcaccgac ggcaacaagg cctcgttcct 1043161 cgaccaggtt catttccagc cgctgccgcc cgcggtggtg aagttgtctg acgcgttgat 1043221 cgcgacgatt tccagctagc ctcgttgacc accacgcgac agcaacctcc gtcgggccat 1043281 cgggctgctt tgcggagcat gctggcccgt gccggtgaag tcggccgcgc tggcccggcc 1043341 atccggtggt tgggtgggat aggtgcggtg atcccgctgc ttgcgctggt cttggtgctg 1043401 gtggtgctgg tcatcgaggc gatgggtgcg atcaggctca acgggttgca tttcttcacc 1043461 gccaccgaat ggaatccagg caacacctac ggcgaaaccg ttgtcaccga cggcgtcgcc 1043521 catccggtcg gcgcctacta cggggcgttg ccgctgatcg tcgggacgct ggcgacctcg 1043581 gcaatcgccc tgatcatcgc ggtgccggtc tctgtaggag cggcgctggt gatcgtggaa 1043641 cggctgccga aacggttggc cgaggctgtg ggaatagtcc tggaattgct cgccggaatc 1043701 cccagcgtgg tcgtcggttt gtggggggca atgacgttcg ggccgttcat cgctcatcac 1043761 atcgctccgg tgatcgctca caacgctccc gatgtgccgg tgctgaacta cttgcgcggc 1043821 gacccgggca acggggaggg catgttggtg tccggtctgg tgttggcggt gatggtcgtt 1043881 cccattatcg ccaccaccac tcatgacctg ttccggcagg tgccggtgtt gccccgggag 1043941 ggcgcgatcg cgctggggat gtcgaattgg gagtgtgtcc gcagggtcac cctgccgtgg 1044001 gtgtccagcg gcatcgtcgg tgcggtggtg ctagggcttg gccgtgcgct gggggagacg 1044061 atggcggtag ccatggtgtc cggcgcggtg ctgggggcca tgcccgccaa catctacgcg 1044121 accatgacca ccatcgccgc caccatcgtg tcgcagctgg attcggcgat gaccgattcc 1044181 accaacttcg cggtgaagac gctcgccgag gtgggtttgg tgctgatggt gatcacgttg 1044241 ctgactaatg tggccgcgcg cgggatggtt cgtcgggtgt cacgcaccgc gcttccggtg 1044301 ggacgcggca tctgacatgg gcgaatcggc tgagtccggg tcccggcagc taccggcgat 1044361 gtccccgccg cggcgatcgg tagcctatcg gcgcaagatc gtcgatgccc tgtggtgggc 1044421 ggcgtgcgtg tgttgtctgg cggtggtgat caccccgacg ttgtggatgt tgatcggagt 1044481 cgtcagccgc gctgtaccgg ttttccactg gagtgtgctg gtgcaggact cccagggcaa 1044541 tggcggcggc ttgcgcaacg ccatcatcgg taccgcagtg ttggccatcg gggtgatcct 1044601 ggtgggtggc acggtgagtg tgttgaccgg gatttatctg tccgaattcg ccaccggcaa 1044661 aacacggtcc attctgcgcg gcgcctacga ggtgttgtcc ggtattccgt cgatcgtgct 1044721 cggctacgtc ggctatttgg ccctggtggt gtacttcgat tgggggtttt cgctggcggc 1044781 cggggtgttg gtgctgtcgg tgatgagcat tccctacatc gccaaggcca ccgagtccgc 1044841 gctggcccag gtgccgacgt cgtatcggga agcggctgag gcactcgggt taccagccgg 1044901 ctgggcgctg cgcaagatcg tgctgaagac ggcgatgccc ggaatcgtca ccgggatgtt 1044961 ggtcgcgctg gccctggcga tcggcgagac ggcgccgctg ctgtacacgg cggggtggtc 1045021 gaattcgccg ccgaccggac aactcaccga ctcgccggtc ggctacctga cctacccaat 1045081 ttggacgttc tacaaccagc catccaagtc ggctcaggat ctgtcctatg acgcggctct 1045141 cttgctgatc gtgttcctgc tgctattgat cttcattggc cggttgatca actggctgtc 1045201 acggaggcgt tgggacgttt gagttggcct tcgagcgcgc cttcacgctg gcctccagct 1045261 tggcgagcag gtcggagacg tcttcgggct cgtccagcaa cctcggttgg tcctcggcgg 1045321 taaatgcctg cccaccttcg agtttggtgt cgatcagctc ctgtaactgc tcctggtagg 1045381 tgtcgtggta gcggtccgga ttgaagtcgt cggccatcga gtccaccacc tggccggcca 1045441 tcttgagttc cgcgggtttg atctccacct tctggtccag caccgggaag tcggggtcgc 1045501 ggatctcatc gggccacagc aacgtgtgca ccatcatcac ctctcgcttg ccgaaatcct 1045561 tgacgcgcaa cgccgccagc ctggtcttgt tgcgcagcgt gaaatgcacg atcgccatcc 1045621 ggtcggtctc ggcgagtgtc ttagccagca gcacatacga tttcgacgac ttcgaatcag 1045681 gctccaaaaa gtagctgcgg tcgaacatca tcgggtccac gtcggcggcg gggacgaact 1045741 ccaacacctc gatctcccgg ctgcgttctt caggcaagct ggcgatgtcg tcgtcggtga 1045801 tcgccaccat ttggccgtcg ccggactcgt aggcccgggc aagatcgcgg tagtcgacca 1045861 cctcgccaca cgcctcgcag acgcgcttgt accggatgcg tccgttgtcc ttggcgtgca 1045921 cctggtggaa cctgatgtcg tggtctgcgg tagcgctgta caccttgacc ggcacgttca 1045981 ccagcccgaa ggcgatcgaa cccgtccaaa tggctcgcat gtaagtgagt atgccttgat 1046041 tgtccgcgag cggaacgtca cggcgaaatt ccacgcgata tttgaccgtg acgttacgct 1046101 cgcgacttgt gtgaccgaca ggctacgttg aaagcatggg ttcggcgtcg gagcaacggg 1046161 tgacgctgac caacgccgac aaggtgctct atcccgccac cgggaccaca aagtccgata 1046221 tcttcgacta ctacgccggt gttgccgaag tcatgctcgg ccacatcgcg ggacggccgg 1046281 cgacgcgcaa gcgctggcct aacggcgtcg accaacccgc gttcttcgaa aagcagttgg 1046341 cgttgtcggc gccgccttgg ctgtcacgtg caacggtggc gcaccggtcc gggacgacga 1046401 cctatccgat catcgatagc gcaaccgggc tggcctggat cgcccaacag gcggcgctgg 1046461 aggtgcacgt gccgcagtgg cggtttgtcg ccgagcccgg atcaggtgag ttaaatccgg 1046521 gcccggcaac gcgtttggtg ttcgacctgg acccgggcga aggcgtgatg atggcccagc 1046581 tggccgaggt ggcgcgcgcg gttcgtgatc ttctcgccga tatcgggttg gtcaccttcc 1046641 cggtcaccag cggcagcaag ggattgcatc tgtacacacc gctggatgag ccggtgagca 1046701 gcaggggagc cacggtgttg gccaagcgcg tcgcgcagcg attggagcag gcgatgcccg 1046761 cgttggtcac ctcgaccatg accaaaagcc tgcgggccgg gaaggtgttt gtggactgga 1046821 gccagaacag cggctcgaag accaccatcg cgccgtactc actacgtggc cggacgcatc 1046881 cgaccgtcgc ggcgccacgc acctgggcgg agctcgacga ccccgcactg cgtcagctct 1046941 cctacgacga ggtgctgacc cggattgccc gcgacggcga tctgctcgag cggctggatg 1047001 ccgacgctcc ggtagcggac cggttgaccc gataccgccg catgcgcgac gcatcgaaaa 1047061 ctcccgagcc gattcccacg gcgaaacccg ttaccggaga cggcaatacg ttcgtcatcc 1047121 aggagcatca cgcgcgtcgg ccgcactacg atttccggct ggaatgcgac ggcgtgctgg 1047181 tctcgtgggc ggtaccgaaa aacctgcccg acaacacatc ggttaaccat ctagcgatac 1047241 acaccgagga ccacccgctg gaatacgcca cgttcgaggg cgcgattccc agcggggagt 1047301 acggcgccgg caaggtgatc atctgggact ccggcactta cgacaccgag aagttccacg 1047361 atgacccgca cacgggggag gtcatcgtga atctgcacgg cggccggatc tctgggcgtt 1047421 atgcgctgat tcggaccaac ggcgatcggt ggctggcgca ccgcctaaag aatcagaaag 1047481 accagaaggt gttcgagttc gacaatctgg ccccaatgct tgccacgcac ggcacggtgg 1047541 ccggtctaaa ggccagccag tgggcgttcg aaggcaagtg ggacggctac cggttgctgg 1047601 ttgaggctga ccacggcgcc gtgcggctgc ggtcccgcag cgggcgcgat gtcaccgccg 1047661 agtatccgca attgcgggca ttggcggagg atctcgccga tcaccacgtg gtgctggacg 1047721 gcgaggccgt cgtacttgac tcctctggtg tgcccagctt cagccagatg cagaatcggg 1047781 gccgcgacac ccgtgtcgag ttctgggcgt tcgacctgct ctacctcgac ggccgcgcgc 1047841 tgctaggcac ccgctaccaa gaccggcgta agctgctcga aaccctagct aacgcaacca 1047901 gtctcaccgt tcccgagctg ctgcccggtg acggcgccca agcgtttgcg tgctcgcgca 1047961 agcacggctg ggagggcgtg atcgccaaga ggcgtgactc gcgctatcag ccgggccggc 1048021 gctgcgcgtc gtgggtcaag gacaagcact ggaacaccca ggaagtcgtc attggtggct 1048081 ggcgcgccgg ggaaggcggg cgcagcagtg gcgtcgggtc gctgctcatg ggcatccccg 1048141 gtccaggtgg gctgcagttc gccgggcggg tcggtaccgg cctcagcgaa cgcgaactgg 1048201 ccaacctcaa ggagatgctg gcgccgctgc ataccgacga gtcccccttc gacgtaccac 1048261 tgcccgcgcg tgacgccaag ggcatcacat atgtcaagcc ggcgctggtt gcagaggtgc 1048321 gctacagcga gtggactccg gagggccggc tgcgtcaatc aagctggcgt gggctgcggc 1048381 cggacaagaa acccagtgag gtggtgcgcg aatgaagtgg gtgacgtatc gaagtgacca 1048441 cggcgaacga acgggagtgc tttccggtga cgccatctac gcgatgccgc cggacgtgtc 1048501 gttgctggat ctggtcgggc gcggcgccga cggtctgcgc acggcgggcg aacgggcagt 1048561 gcgctcaccg gccgcggtgg tagcgctcga cgaggttacg ctggcggcgc cgattccgcg 1048621 cccgccgtcg atccgggact cgttgtgctt tctggaccac atgcgtaact gccaggaagc 1048681 gatggggggc ggccgggtgc tcatggatac ttggtaccgc atcccggcgt tctacttcgc 1048741 gtgcccgtca acggttttgg gaccgtacga cgacgcaccc accgcacccg gaagtgcgtg 1048801 gcaggacttc gaattggaga tcgcggcggt tatcggaacc agcggcaaag acttgaccgt 1048861 cgagcaggcc gaacggtcga tcatcggcta taccattttc aacgactggt ccgcacggga 1048921 cctgcagatg ctggagggcc agctgcgcat cggacaggcc aagggcaaag acagcggtat 1048981 caccctgggc ccctatctgg tcacaccgga tgagctggag ccctattgcc ggggcgggaa 1049041 gctaagcttg cgggtgatcg ccttggtcaa cggcaccgtg atcggatcgg ggtcgaccgc 1049101 acagatggac tggagcttcg gcgaagtcat cgcctatgcc tcgcgggggg tgacgctgac 1049161 cccgggtgac gtgttcggct cgggcacggt gcccacctgc acgctcgtcg agcacctcag 1049221 gccaccggaa tcattcccgg gctggctgca cgacggcgac gtggtcaccc tccaggtcga 1049281 agggctgggc gagacgaggc agaccgtccg gacgagcggc actccttttc cgttggctct 1049341 tcggccgaat ccggacgccg aacccgaccg gcgcggggtc aacccggcac cgacgcgggt 1049401 gccgtttacc cgcgggctgc acgaagtcgc cgaccgggta tgggcgtgga cgctgcccga 1049461 cgggggatac ggcttcagca acgccgggct ggtcgccggg gacggcgcgt cgctgctcgt 1049521 ggataccctg ttcgacctgg cactgacacg cgagatgttg gccgcgatga agccggtcac 1049581 cgagcgggcg cccatcaccg acgccctgat cacgcactcc aacggcgacc acacgcacgg 1049641 cactcaactg ttggaccgct cagtgcgcat catcgccgcc aagggcacct ccgaggagat 1049701 cgagcatggc ccggcaccgg agatgctagc ccggatccaa accgccgacc tgggccccgt 1049761 tgcgacgcgg tatctgcgtg atcgcttcgg tcactttgac ttcagcggca tcaagctgcg 1049821 caacgccgac ctgacgttcg accgcgacct ggccatcgag ctcggcggcc ggcgagtcga 1049881 cctgctcaac ctcggtcccg cgcacaccac cgccgactcg gtcgtgcacg tggccgacgc 1049941 cggtgtgctg ttcgccgggg atctgctgtt catcggttgc accccgattg tgtgggcggg 1050001 cccgatcgcc aactgggtgg cggcctgcga cgcgatgatc gcgctggacg cgcccacggt 1050061 ggtgcctggg catggtccgg tcaccggccc ggacgggatc cgtgccgtcc gtggctatct 1050121 ggcgcacatc gccgaacagg ccgaggcggc ctaccgcaag gggctatcgt tgcccgaggc 1050181 cgtcgagacc atcgacctgg gcgagtacgc gagctggctg gactccgaac gggtagtggt 1050241 caacgtctac cagcgttacc gcgaattgga tcccgacacc ccgcgccagg acttgctggc 1050301 gttgctggtg atgcaggccg aatgggcggc gcgccactgt acgtagccac tcgggcgcgt 1050361 ttgtcacggg aatctgcgga ccggcgggcg catggtttgc ctgtccacga gcgacaaagc 1050421 cagcgcgcca aggattcccg atggcagcca tcactttgtc gcgctgaggc gggcacgaag 1050481 aacatcccgt ccagacagcg gccaatgtgg cgggtgtgaa aggcgccgcc gagcatggca 1050541 ccgggtccaa cggctctcac gaagctgatc ggggatcgat ccgttgtgat gcttaaactt 1050601 tcgcgatgac gttctcggcg aacatctcca gattgcggat cttggtctgc agcggctcgg 1050661 tgtcggggcc catggtgtac gggacacgga aaccgacgat gacgtccgtc acccctttgt 1050721 cctcgagccg cttgacgccg tccacggtga aaccgtccag ggagatcacg tggatttcga 1050781 acgggctggt tttccccgct tcctcgcgaa gccgcttgac cctggcgatc agccggtcga 1050841 gttcgtccgg atcgccgccg ccatgcatcc atccatcggc gcgcgccgcc cgtcgcagtg 1050901 ctgcatcggc gtggccaccg accaggatcg ggatcggctg ggtgggcgcc ggggtcatct 1050961 tggtcttggg tatgtcgtag aactcgccgt ggaactcgaa gtaatcgccg gtggtaaggc 1051021 cacgcacgat ctcgatgcat tcgtcaatcc gcttgccgcg cttagcgaac gggacgccca 1051081 tcagctcgta atcctccggc cacgggctag tgccgacacc cagcccgacc cggttgccga 1051141 tcagggcggc tagggaaccg gcctgctttg ccaccagagc cggcgggcgg atgggcagct 1051201 tgaggacgaa gaagttgaac cgcagcctcg tcgtgactgc gcccaatgct gctgtcagga 1051261 caaaggtttc gatgaaaggc ttgccgtcca tgaattcgcg gttgccgtcg ggtgtgtacg 1051321 ggtacttcga gtcggattcg aaggggtagg cgatgctgtc gggaatcgtc atgctgctgt 1051381 atcccgccgc ttcggctgcc ttggccagcg ggatgtagaa cgtgaagtcg gtcattgcct 1051441 ccgcgtagct gaaccgcacg tgattgcctt cctcgaagtg gccgtcccca acgagattag 1051501 aacgtgttct aatttgacgt gcaagcgggg cgcaacggct tggtcagagt tggttctccg 1051561 gcccaataat tgcccagacc gtcttgcccg acgaagtggg actgctgccc caggcgcggg 1051621 acaacgcggc aacgatcgcc aggccggaaa cgtcgatgcc cttcggtggg gacgccagcc 1051681 gaaccgccgg agcgctgctg ccgtcggaaa ccgcgatggt tgccgttggg ccatcgcttt 1051741 cgatccgcat caccgggtcg cttccggtgt gtttcagcac gttctccacg aatacgttga 1051801 cgacgaccaa cgcgactgga ataagcccgg gacgtgacca ttgggtgagc cattcgcgga 1051861 ccaactggcg tgactcgcga aggctgttca ggttggcggg cagttgtgcg tccgaacgct 1051921 tgaaattgcg gcgcgcgagc cgaccgatgg ccttgctcgc cgctttttcg gtcgggtaca 1051981 ccggcatgaa gcgggcgacc ccggtgcggg tgaccgccgc gcggccggcc cgatggccgc 1052041 agaccagcaa gaccggtaca tccgctcgga agtcggcctg ccagcgggcg ctgataaaga 1052101 ccgaccatgc cgattcctcg gcgacttgca gctcggtgac attgacgata acggcggacg 1052161 gctgctcgag cgtcgccctc gtgaggctgt cccggagcag tgcagaactg ctggagtcaa 1052221 gcgcaccgtc ggcggtcaag atgaccaccg aatcctgtgt acgtaccgca atggccagcg 1052281 ctgtcggtga cttggctgcc gtgctcaccg cgaccacttc cttgcgtccc ttgccccggc 1052341 gtcaggtgca catcgcaact tgggtcggag tgccaccata gccatggttc cgaaacggcg 1052401 ggacgccatg aaccggcatt ccggtcccat cctgtcgtcc ggtttcatag ccagctcctc 1052461 gaactcctgt cccgccaata gcttgaggat gccgtccgcc ttggcggcag aaaccctatc 1052521 ttttgatgat cgcgccgtcc ggcgcagcac ccatcaccca gggggtggtt acccacaaaa 1052581 acacgcgatc aacctccagt ccgggctatg cccagcctat gcaaacgcca gcaggtaggg 1052641 cccgggaatc cggccaacaa agatcaacga acgccgcgcc ggcgccggga tgcgttcaag 1052701 tggtggccga ggctgggccg cttcgggcat agggcggtgg gcccactccg gcgaccgagt 1052761 gggtacccca cggtgtttgt tcagtgatgc gtgcgggtgc gctacgtccg ccgatggtta 1052821 acgtcgccgc ccgggcatgg gtgagtgaag tctcgggcaa ggaatcgaat acggtgccct 1052881 gccagtggta gttgccgtcg atcggatcga ggtgaccggt aagccggacg cggacccgaa 1052941 agcgggcacc agcgagcgtt agcgtcgccg caccgtcgta ggtctgatcg tcctcggtcg 1053001 ccgcggatga caagtcgaac gcttcgaggc ccccagtctg ccgatggggc tgagcgggtt 1053061 tgagttgggc gcgctcgttg aatacctgct ggctgctgcg gcgcacctcg atgcggcggc 1053121 tggccgtgcg ctccatgagc ttcatgcatt cgacgacgca gcgtgcctgc gcggcggtat 1053181 cgggcccggt gatgaagaag tagttgggga aaccgtgaac ggcgacgccg aggtagggct 1053241 ccatgccatc gtcccaggct tggcggatgg tcacaccgcc ggcaccgacc agggtctgat 1053301 cgccgacctg atcggcgatc gcgaacccgg tgccgtagat gatggcgtcg acggggtgtt 1053361 ccacgccatc gctggtgcgg atgcccgagg aggtcagcgc gtcgatcgcc gccgtcgccc 1053421 aggcgaccgc tggatgctca gccccggtgc ggcgacgtag ccagcgtttg gcgcgtgtcg 1053481 tccacagtgg tactccggtg acgacgcggc gcggtgcctg ggtgaagacc gtgaccgacg 1053541 ccgccgattc agacaaccgg ctgatgtagt gggcggcggc ggcatcggtg ccgaccaccg 1053601 cgatgcgttt gccggccggg tcgaaatcgc ggtcccatgc cgccgaagtg ggcctgatgg 1053661 gcccgatcgc ccgtcgcgcc tggaaacgca ccaactttct gtgaccgcga cgctcggcct 1053721 cgctgacgcc ggccaccgca ttgtcatcgt cggcaggggt gctggtggcc gggacgccgc 1053781 agccgcgcgc gctcgggccc gatgcgctgg acgtcagcac cgacgacctg gccgggctgt 1053841 tggccggcaa caccggccgg atcaagaccg tcatcaccga ccagaaggta attgccggca 1053901 tcggcaacgc ctatagtgac gaaatcctgc acgtcgcgaa gatctcgccg ttcgccacgg 1053961 ccggcaagtt atccggcgca cagctcacct gcctgcatga ggcgatggcg tcggtgctgt 1054021 cggacgcggt gcgccggtcc gtcggccagg gcgcggccat gctcaaaggg gagaaacgtt 1054081 ctgggcttcg agtacatgcg cgcaccgggt taccctgccc agtgtgcggt gacaccgtgc 1054141 gggaggtgtc cttcgcggac aagtcttttc agtactgtcc aacgtgtcag accggtggca 1054201 aggcgctggc cgaccggcgt atgtcgcggc tgctcaagta gtcgatatgc tcaccggagt 1054261 gactcgccag aagatcctga tcaccggcgc cagttccggc ctgggcgccg ggatggcccg 1054321 atccttcgcc gcccagggcc gcgacctggc gctctgcgcc cgccgcacgg atcggctgac 1054381 cgaactgaaa gccgaactgt cgcaacggta tcccgacatc aagatcgctg tcgcggagct 1054441 ggacgtcaac gaccacgagc gggtgcccaa ggtattcgcc gaactcagcg atgagattgg 1054501 cggcattgac cgtgtgatcg tcaacgccgg aatcggcaag ggtgcccggc tgggctcggg 1054561 caagctgtgg gcgaacaagg caaccatcga aaccaacctg gtcgccgcac tcgtgcagat 1054621 cgaaacggca ctggacatgt tcaaccagcg cggttcgggg catttggtgc tcatctcctc 1054681 agtgctcggc gtcaaagggg tgccgggcgt caaagccgcg tatgcggcaa gcaaagccgg 1054741 tgtgcgctcg ctaggcgaat cgctgcgcgc cgagtacgcc caacgcccca tcagggtcac 1054801 ggtgctggag ccgggttata tcgagtcgga gatgacggcc aaatcggcga gcacaatgtt 1054861 gatggtggac aacgcaactg gcgtcaaggc gctggtggcc gccatcgagc gcgagcccgg 1054921 acgcgccgcg gtcccctggt ggccatgggc gccactggtg cggctgatgt gggtgctgcc 1054981 gccgcggctg accagacgct tcgcctagcg ggcgctcggc cacctagccc gcgcggccac 1055041 gttcggtgcg gtagcggcgc accagcccgt cggtcgagct gtccgactgc ggtggcggtg 1055101 aaccggcgcc ggtgattacc ggaagcagcg ccttggcctg cgtcttgccc agctccaccc 1055161 cccactggtc gaacgagtcg ataccccaca ccacaccctc ggtgaacacc tgatgctcgt 1055221 agagcgcgat caactgcccc agcaccgacg gcgtgagccg actggccaga attgaggtgg 1055281 acggccggtt gccgggcatc accttatgcg ctaccacgtg ggcgggggtg ccgtcggcgg 1055341 cgatctcctc ggcggtcttg ccgaacgcca gcacctgggt ttgggcgaag aagttgctca 1055401 tcagcagatc atgcatgctg ccggtgccct cggcggtcgg caggtcgtcg aggggttgag 1055461 caaagccgat gaaatcggct ggcaccagcc gggtgccctg gtgcagcaac tggtagaagg 1055521 cgtgctggcc gttggttccc ggttcacccc aaaagatttc accggtgtcg gcgctgaccg 1055581 ggctgccgtc ggcgcgcgtg gacttgccgt tggattccat ggtcaactgc tgaaggtagg 1055641 ccggaaaacg cgacaagtca ttggaatacg gcagcacggt gcgtgattgc gcaccgaaga 1055701 aattggagta ccacagtccg atcaggccaa gcagcaccgg cgcgttggat tccagcggag 1055761 cggtcgcgaa atggcggtcg atgatgtgga atccggccaa gaaatcggcg aaggcgtcgc 1055821 ggccgatcac cgtcatcaac gacagcccga tcgccgaatc caccgaataa cgcccgccga 1055881 cccaatccca aaaaccgaac atgttgtcgg tgttgatgcc gaagtcgtcg accaggcgct 1055941 tgttggtgga caccgcgaca aaatgccgcg acaccgcggc gtcgcccagc gcatcggtca 1056001 gccagcgacg cgccgcggtc gcattggtca atgtctccag cgtcgagaac gtcttcgacg 1056061 cgacgatgaa aagcgttgtg gcggggtcta gatcggcgag cgtggcgatc aggtcggcgg 1056121 gatcgacgtt ggacacgaag cgcgcggaaa tgcccgcgtc ggcatagtgg cgcaacgctt 1056181 ggtacaccat caccggaccc aaatccgaac caccgatgcc gatgttgacg acggtgctga 1056241 tccgctttcc agttgctccg gtccactcgc cgctgcgcag gcggtcggtg aaggcgccca 1056301 tcgcgtcgag cacggcatgt acgtcggtga cgacgtcttg gccgtcgacg acgagttcgg 1056361 cgtctcgggg cagccgcagc gcggtgtgca acaccgctcg atcctcagag gtgttgatat 1056421 gcacaccggc gaacatctgg tcgcgacgct cttcgaggtg ggccgtccgg gccagatcga 1056481 tcagcagcgc cagcgtctcg cgggtgacgc ggtgtttgct gtagtcgatg tagagatcgc 1056541 cgacgctgac ggtgagctcc cggccgcgac ccggatcgtc ggcgaagaac tggcgaagat 1056601 gggtgtttcc gatctgatcg tgatgtctgc gcagggcgtc ccatgccggg gtagcggtga 1056661 tgtcggggat tggcgcggag gtcatggttc gaccctaatg ccgtggagtg gcgtcgatca 1056721 gagccgctgt cttcgccgag cctttagtta tcgtgctcgg cggcactcgc cgtttgtcgc 1056781 ggtatctaca ggctcggcga tgcgggcctg cgctctcgcg gcctcggccc ccgccgaggc 1056841 cgctgaccgt cgcccagcac ccgctgcaga tcaggcagca tggcctgcaa tggcgcacgc 1056901 cagtacgccc aggtgtgggt ttcgccatcc gggaagttga accgccgttg cgccgcttgg 1056961 tacttgcttt gcaaagcctg tcgcctattg ccgctcaact ccgccgaggg cgtgccgttg 1057021 cccgaatacg cccagatacg gggtgctgtt ggccaccagc ttcgcggcat tgaccgttgg 1057081 ctcgctgtgg gcccacgccg gatcggtcgg cgggcccccg gatcaatgac gcggatccgg 1057141 caaccacgcc atttccactc gggatgatcc cctcactcgc cgccagccag tcggccagct 1057201 cttgggccat atcccgccgt tgccgaccgc cggccggtgc cagttggaat agaactcgcc 1057261 atgccaccgg tgggcatggc gagcgacaga ccggtctggt caaccgaata ccggcaggtg 1057321 tatatcccgg ccgttgtagt cgtctcgggc gagtatgccg tcggacaagt accacgcatg 1057381 agggccgcca ccttgaaact ccaccttgat taggcgatgc atcgaccgcg acgggaccat 1057441 cagtcgatgg gtagaccccg ccgcgactac gttgacgagg ttgtcgagct ggcgccttgc 1057501 ccgagcagcg cgatcaaagc tgccgcccat atgaccatca accggcgcaa ttgtgaaact 1057561 ccagctgcct ttgctgcatc catttcggcg gaaattcagc gcagcgatgc agaaattccc 1057621 ggcaaacagc ggcggaagtg acccattagt gaccgaggcg gcccctgccc aatcgcaaaa 1057681 gcaggatggc cagatcctta ccgtcgggtc ccagctcgct gtagcgttcg atgaccttca 1057741 tctcccggct gtgtaccagc cgagtgccac cggacgccat ccgggccttg ccgatggcct 1057801 tggaaacctc agcgcgtcgc ttgactaacg cgaggatttc ggcgtctagc cggtcgatct 1057861 cttcgcgcag cgtgtcgatc tcggggacag gttgggactc gagcatttcc aggttcatgg 1057921 ctgctaactc cgcgttctcg tgatgtgggg gttctggtct catccggtac tgggcctcac 1057981 acaagagacg agccccgaat ccggaagcgg accacggggc tctgcgaaag cagctagacc 1058041 acgggcaccg ctggccggta cccgtagaaa aatcggcgct gcgcgttgag cacgaaccga 1058101 gtgtgccatc aacggacgcg cccgcgcaaa aacttggcgg gaaaagtgca cccaaaattg 1058161 ggtggtggcg ccgaaggacc tgccgcgtgg cgatgagcct ggccaggcta tgccgcggtc 1058221 cgccgactcg tcgccgcgcg gcggtaagtt tggaccgaca tgagtgtgca cgcgaccgac 1058281 gccaagcctc ccggtccatc cccagcggac caactgctcg acggcctcaa cccgcaacag 1058341 cgccaggcgg tcgtgcatga gggttcgccg ctgctgatcg tcgcgggcgc gggttcgggt 1058401 aagaccgcgg tgttgacccg ccgcattgcc tatctgatgg cggcccgcgg cgtcggggtg 1058461 ggccagattc tggccatcac cttcaccaac aaagccgccg ccgagatgcg cgaacgggtg 1058521 gtgggcctgg ttggggagaa ggcccggtac atgtgggtgt cgacgtttca ctccacctgc 1058581 gtgcgtatcc tgcgcaacca ggcggcgctg atcgagggcc tcaactccaa cttttcgatc 1058641 tatgacgccg acgattcgcg gcggttgctg cagatggtgg gccgcgacct gggcctagac 1058701 atcaagcggt actcgccgcg actgctggct aacgccatct ccaacctgaa gaacgagttg 1058761 atcgacccgc atcaggcgct ggccggctta acggaggact ccgatgacct agcgcgcgcc 1058821 gtggcgtcgg tttatgacga ataccagcgg cggctgcggg cggccaacgc gctggacttc 1058881 gacgacctga tcggcgagac cgtcgcggtg ctgcaggcct tcccgcagat cgcccagtac 1058941 taccgtcgga ggttccggca tgtcctggtt gacgaatacc aggacaccaa ccacgcccag 1059001 tacgtattgg tgcgcgagct ggtcggccgc gacagcaatg acggtattcc ccccggcgag 1059061 ttgtgcgtcg tcggggatgc cgatcagtcg atctatgcgt tccgcggcgc caccatccgc 1059121 aacatcgaag acttcgaacg tgactacccc gacaccagaa ccattctgct ggaacagaat 1059181 taccgctcga cgcagaacat cctgtcggcg gccaactcgg tgattgcccg taacgcgggg 1059241 cgccgggaga agcggttgtg gaccgacgcc ggcgccgggg agttgatcgt tggctatgtc 1059301 gccgacaacg agcacgacga ggcccggttc gtggccgagg agatcgatgc gctcgccgag 1059361 ggtagcgaga tcacctacaa cgatgtcgcc gtcttctacc gcaccaacaa ctcgtcgcgg 1059421 tcactggaag aggtgctgat ccgcgccggt attccgtaca aggtcgttgg gggagtgcgc 1059481 ttttacgagc gcaaggagat tcgcgacatc gttgcctacc tgcgcgtgct ggacaacccg 1059541 ggcgacgcgg tcagcctacg gcgcatcctt aacaccccgc gccgcggtat cggggatcgt 1059601 gccgaggcgt gtgtggcggt gtacgccgag aacaccggcg tcggcttcgg tgacgcgctc 1059661 gtcgccgcgg cccaaggcaa agtaccgatg ctgaataccc gggcggagaa ggcgatcgcg 1059721 ggtttcgtcg agatgttcga cgagctgcgg ggccgcctcg atgacgacct gggggagctg 1059781 gtcgaggcgg tgctggaacg caccggatac cgccgcgagc tggaagcgtc caccgatcca 1059841 caggaattgg cccgcctgga caacctcaac gaattagtca gcgtcgcaca cgaattcagt 1059901 accgaccggg agaatgccgc cgcacttggc ccagacgacg aagacgtccc cgacaccggt 1059961 gtgctggcgg attttctgga acgggtgtcg ctggtcgccg acgccgatga gatcccggag 1060021 catggcgcgg gtgtggttac cttgatgacc ttgcacaccg ccaagggttt ggagttcccg 1060081 gtggtgtttg tgaccggctg ggaggacggg atgttcccgc acatgcgggc gttggacaac 1060141 ccgaccgagt tgtccgagga gcggcggctg gcctatgtcg gcatcacccg cgcccggcag 1060201 cggttgtacg tgagccgggc gatcgtgcgt tcgtcttggg gccagccgat gctcaacccg 1060261 gagtcgcggt ttctgcggga aatcccgcag gagctcatcg actggcggcg caccgccccg 1060321 aagccgtcgt tcagtgcccc ggtgagtggc gccggtcggt tcggtagcgc gcgtccatca 1060381 ccgacccgct cgggggcgag caggcgcccg ctgctggtgc ttcaggtcgg cgaccgcgtg 1060441 acccatgaca aatacggcct gggccgtgtc gaggaggtct ccggtgtcgg cgaatcggcg 1060501 atgtcgctga tcgacttcgg tagctcgggg cgggtgaagc tgatgcacaa ccacgcccct 1060561 gtcaccaagc tctgagattt cgcgccgagc gtgaagtcac ggcggctatt tcgcggattt 1060621 ctcgccctga gaacacgttc ggcgtcgttg ccgggtcaac cggtgtaatt gccgacgcta 1060681 agtccccgct tggcgagcca cggcactggg tccacgcgct cggtgccgcc caggagcacc 1060741 tcgaagtgca ggtgcgggcc ggtggaaaag ccacggctgc ccatggtggc gatctggtcg 1060801 cctgccatca cgcgctcacc gacgctgacc aacgtggtat tgacgtggcc gtatagcgtg 1060861 accgtgccgt cggcgtgcag cagcttgacc cacattccgt agccggcggt ggggccggcg 1060921 tcgatgacga cgccgtcgga caccgcataa atcggggttc cgatcgcgtt agccaggtcg 1060981 ataccggcgt gcagtacacc ccatcgataa ccgaaactcg acgtgaagat gcccttcgtc 1061041 ggcatgacat acagcgggcg ctgtagtcgc gcctcgcgct cggcgcgctc ctcggcgaag 1061101 gcaacccccc tggcgaactc cgcgttgtgc accgcagcac tcgccgccgg ctgggcggcg 1061161 atgacctgga cgccccgcgg tgggttgctt cccgaccctt cgttgagcgc cgatgcatga 1061221 gcggtcagca cggtctcggt gcgtggggtt tccgactgtt ggatcgccgt atgcgctgct 1061281 gcggccgccg cgcccgcggc catcgccgag atcagcaggc gcccccgggc cgcaccgatc 1061341 ggttgcttgc ggtgctgccc gacgcgccgg gacaccgggg tgacctccgg ggtcagcacg 1061401 accgtggggg ctaccagcca ttcgggggcc agatcgtcgg cgtcgtccag gtcgtcgagt 1061461 tctggagccg ctagcaactg cgcttcgtag tcgaagacgc agtcgtcccc taggtccagg 1061521 tcatcgagct ctgcgaaatc cagctcatcg tagagtgcta agccgtcgag gaatccgtcc 1061581 agcgggatga tttcggtgac ttcgttacgg tgatgatgcg gccaacgatc gcgaggtgtg 1061641 cgaatcgctg ccatggcagc agaacgggcg atacggtgct gggacaaatc tgaaatgtcc 1061701 tcggatcgtg accataacgt tatctggacc ctgagacgtt atccgcaacc ggatggtagt 1061761 ggcaacttca gcgcggaatt cggctgtgat tgtgagttgg atcacgtttc ggctggacaa 1061821 acatatcggt gagctgtgcc acaccgggtg gatgcggccg cggagttaat cggcggtctc 1061881 gatacagttc tccgtgcgag tcgccgattt cggcaccgcc tacctattgg tcgagcagta 1061941 agccgagcga agacggtgag cccatggatc ttttcgagta tcaagccaag gagttattcg 1062001 ccaagcacaa cgtgcccagc acgccgggtc gggtgaccga cacagccgag ggtgccaagg 1062061 ctatcgccac ggagatcggg cgtccggtga tggtcaaagc gcaggtcaag atcggcggcc 1062121 ggggcaaggc cggtggcgtc aaatacgccg cgaccccaca agacgcgtac gagcacgcca 1062181 agaacatcct cggcctggac atcaaaggac acatcgtcaa gaaactgctg gtcgctgagg 1062241 ctagcgatat cgccgaggag tactacctat ccttcctgct cgaccgggcc aaccgcacct 1062301 acctggcgat gtgctcggtg gagggcggca tggagatcga agaggtagcg gccaccaaac 1062361 ccgagcggct cgccaaagtc ccggtgaatg ccgtcaaggg cgttgaccta gatttcgcgc 1062421 ggtccatcgc cgaacagggt catcttccgg ccgaggtgct cgacaccgca gcggtcacca 1062481 tcgccaagct gtgggagctc ttcgtcgccg aggacgcgac gctggttgag gtcaacccgt 1062541 tggtgcggac gcctgaccac aagatcctcg cgctggatgc caagatcacc ctcgacggca 1062601 acgccgattt ccgtcagcct ggccatgccg agttcgagga tcgagctgcc accgatccac 1062661 tggagttgaa ggccaaggag cacgacctca actacgtcaa gctggacggt caggtgggga 1062721 tcatcggcaa tggcgcgggc ttggtgatgt cgactctcga cgtcgtcgcg tatgccggtg 1062781 agaagcacgg cggagtcaag ccggccaact tcctggatat cggcggcggc gcttcggccg 1062841 aggtgatggc cgcgggtctg gacgtggtgc tgggcgacca gcaggtcaag agcgtgttcg 1062901 tcaacgtctt cggtggcatc acctcgtgcg atgcggtggc gaccgggatc gtcaaggcgc 1062961 tgggcatgct gggtgacgaa gccaacaagc cgctggtggt tcggctcgac ggcaacaacg 1063021 tcgaggaagg ccgtcgcatc ctgaccgagg ccaaccaccc cctggtgaca ctggtggcga 1063081 cgatggacga agccgccgac aaggccgctg agctggcgag cgcctgagcg aaaggaccca 1063141 tgactcacat gtccatattt ctgagcaggg acaacaaggt cattgtgcag ggcatcaccg 1063201 gcagtgaggc caccgtccat accgcgcgaa tgctgcgggc gggcacgcaa atcgtcggcg 1063261 gtgtgaacgc acgcaaagcg ggcaccaccg tcacgcatga ggataagggc ggccggctga 1063321 tcaagctgcc ggtgttcggc agtgtcgcgg aggcgatgga aaagaccggc gccgatgtgt 1063381 cgatcatctt cgtgccgccg acgttcgcca aggacgccat catcgaggcc atcgacgccg 1063441 aaattccgct gttggttgtg atcaccgagg gaattccggt gcaggacacc gcctatgcct 1063501 gggcctacaa cctcgaggct ggccacaaga cccgcatcat tggccccaac tgtcctggca 1063561 ttatcagtcc cggtcagtcg ctggccggta tcacgccggc caacatcacc ggacccggtc 1063621 caattggtct ggtgtccaag tcggggacgt tgacctacca gatgatgttc gaactgcgcg 1063681 accttggatt ctccacggcg atcggcatcg gtggtgatcc ggtgattggc actacccaca 1063741 tcgacgccat cgaggccttc gagagggatc cggacaccaa gctcatcgtg atgatcggcg 1063801 agatcggtgg tgacgccgag gagcgggccg cagacttcat caagaccaac gtgtccaagc 1063861 cggtcgtcgg ctatgtcgcc ggatttaccg cacccgaagg caagacgatg ggccacgccg 1063921 gcgccatcgt ctccggctcg tctggcacag cggcggccaa gcaagaggcc ctggaggccg 1063981 ccggtgtgaa ggtcggcaag accccatcgg cgaccgcggc gctggcccgg gagatcttgc 1064041 tcagtctcta gggcgagcag acgcataagc ccccgcacgc tcggcgtgtc gggggcttat 1064101 gcgtctgctc gccctatacg caacaggcca acttggcggc cagccgctcc acgtacgcgg 1064161 ctgcgtcgtc tgcagacctg tccggcatac cgaacagcac ctccgtaacg ccaagctcgg 1064221 cccagcgcgc cagcttgtcg ggcaccggtt tgacgtccag ggccacgatc tgtggaagcc 1064281 cgtcgcggcc ggcggccgcc cagatgtctt gcagtaactt caccggctcg tcgatgtcga 1064341 cgtcgcgtgg agtggtgatc cagccgtcgg cgctgcgcgc gatccacttg aagttcttct 1064401 ccgtccccgc agcgcctacc agcaccggga tgtgcggctg caccggcttg ggccaggccc 1064461 agctaggtcc gaacttgacg aactcgccgt catagcaggc ctcctcttgg gtccacaacg 1064521 cccgcatcgc ctcgaggtat tcgcgcagca tggtgcggcg gcgtccgggt ggcacaccat 1064581 gatcgacgag ctcgtcggtg ttccagccga acccgacccc gacgctgacc cggccgtgcg 1064641 acaaatgatc cagcgtcgca atgcttttcg ccagcgtgat cggatcatgc tcgaccggca 1064701 gcgccaccgc ggtggcaagc cggatccgcg acgtcaccgc cgatgctgct cccaggctca 1064761 cccacgggtc caacgtgcgc atatagcggt cgtccggcag cgaagcgtca cccgtcgtcg 1064821 gatgggccgc ctggcgcttg accgggatgt gggtgtgttc gggcacgtaa aacgtgcgaa 1064881 acccgtggct ttcagcaagt ctggcggccg cggccggggt gatgccgcgg tcgctggtga 1064941 acagcacaag tccgtagtgc atgcaccgaa ttagaacgtg ttccacctgc gccgggcaag 1065001 cggccgtcca gtcgttaatg tcgcgagcgc cggtcgctcc ggcagcggca cccgaacgtg 1065061 cgctagcgtg gttgatcgaa tcgcgtcgcc gggagcacag cgtcgcactg caccagtgga 1065121 ggagccatga cctactcgcc gggtaacccc ggatacccgc aagcgcagcc cgcaggctcc 1065181 tacggaggcg tcacaccctc gttcgcccac gccgatgagg gtgcgagcaa gctaccgatg 1065241 tacctgaaca tcgcggtggc agtgctcggc ctggctgcgt acttcgccag cttcggccca 1065301 atgttcaccc tcagtaccga actcggcgga ggtgatggcg cagtgtccgg tgacactggg 1065361 ctgccggtcg gggtggctct gctggctgcg ctgcttgccg gggtggctct ggtgcctaag 1065421 gccaagagcc atgtgacggt agttgcggtg ctcggggtac tcggcgtatt tctgatggtc 1065481 tcggcgacgt ttaacaagcc cagcgcctat tcgaccggtt gggcattgtg ggttgtgttg 1065541 gctttcatcg tgttccaggc ggttgcggca gtcctggcgc tcttggtgga gaccggcgct 1065601 atcaccgcgc cggcgccgcg gcccaagttc gacccgtatg gacagtacgg gcggtacggg 1065661 cagtacgggc agtacggggt gcagccgggt gggtactacg gtcagcaggg tgctcagcag 1065721 gccgcgggac tgcagtcgcc cggcccgcag cagtctccgc agcctcccgg atatgggtcg 1065781 cagtacggcg gctattcgtc cagtccgagc caatcgggca gtggatacac tgctcagccc 1065841 ccggcccagc cgccggcgca gtccgggtcg caacaatcgc accagggccc atccacgcca 1065901 cctaccggct ttccgagctt cagcccgccg ccaccggtca gtgccgggac ggggtcgcag 1065961 gctggttcgg ctccagtcaa ctattcaaac cccagcgggg gcgagcagtc gtcgtccccc 1066021 gggggggcgc cggtctaacc gggcgttccc gcgtccggtc gcgcgtgtgc gcgaagagtg 1066081 aacagggtgt cagcaagcgc ggacgatcgg gcggccggcg ctcgtccagc tcgcgacctc 1066141 gtcagggttg cgttcggccc aggtgtggtg gcgttgggca tcatcgccgc ggtgacgctg 1066201 ctccaattgc tgatcgccaa tagcgacatg accggtgcgt ggggcgccat cgccagcatg 1066261 tggctgggcg tgcacctggt gccgatctcg atcggtggcc gcgcactggg cgtcatgccg 1066321 ctgttgccgg tcctgttgat ggtgtgggcc accgcgcgca gcacggcgcg ggccacatcc 1066381 ccacagtcgt cagggctcgt tgttcgctgg gtcgtcgcgt cggccctggg cggaccgctg 1066441 ctgatggcgg cgattgccct ggcggtcatt cacgacgcgt catcagtggt caccgagctg 1066501 cagacgccca gcgccctgcg cgcgttcact agtgtgctgg ttgtgcattc cgttggggcc 1066561 gcgaccgggg tgtggtcccg ggtaggtcga cgggcgctag ccgccacggc actgcccgat 1066621 tggctgcatg attcgatgcg tgccgccgcc gctggggtgc tggcgttgct cgggctttcc 1066681 ggcgtggtga cggcggggtc gctggttgtg cattgggcga cgatgcaaga gctctacggg 1066741 atcaccgatt cgatattcgg ccagttcagc ctcactgtac tttcggtgct ttacgcaccc 1066801 aacgtcatcg tcggcacctc ggccatcgcg gttgggtcca gtgctcacat tggcttcgcg 1066861 acgttcagtt cgtttgcagt tttgggcggc gatatcccgg cactgccgat cctggccgcg 1066921 gccccgacgc cgccgctcgg cccggcatgg gttgccttac tcattgtggg tgcttcgtcg 1066981 ggtgtggcgg tcggtcagca gtgcgcccgc cgcgccctgc cgtttgttgc ggctatggcc 1067041 aagctgctgg tcgctgccgt tgccggggca ttggtaatgg cggttctggg ttacggcggt 1067101 ggcggccggc tgggcaattt cggcgatgtc ggcgtggacg agggcgcctt ggtgttgggc 1067161 gtgctcttct ggtttacgtt cgtaggatgg gtcacggtgg tgattgccgg cgggatcagc 1067221 cgccgcccca agcggctccg gccggccccg ccggtcgagc tggacgccga tgaatcttcg 1067281 ccaccggtag acatgttcga cggggcagcg agcgagcagc cgcccgcttc ggtcgcggaa 1067341 gacgtcccgc ctagccacga cgacatcgcc aacggcctca aggcccctac tgccgacgac 1067401 gaggcgctgc ccttgtccga cgaaccgccg ccgcgggccg actaatctgc ggttggtgag 1067461 gccgcaactg tctgaggcct ttactcacgg tactgagtct gcactgggat gcaggctggt 1067521 ggtgctcaca cgctttgagg agccagacta ggctcgccgt gtgcaggaac cgcttcgtgt 1067581 acccccgagt gcacctgcgc ggctggtagt actcgcgtct ggcaccggtt cgttgctgag 1067641 atctctactc gatgccgctg tcggcgacta cccggcacgg gtagtcgccg ttggtgtgga 1067701 tcgcgaatgc cgggccgccg aaatcgccgc ggaagcatcg gtgccggtgt tcaccgttcg 1067761 gctcgccgac caccccagtc gcgatgcctg ggacgtcgcc atcaccgccg ccaccgcagc 1067821 ccatgagccc gacctcgtcg tttctgcggg ctttatgaga atccttggac cgcagttcct 1067881 ttcacgattc tacgggcgca ccctcaacac ccacccggcg ctgctgccgg ccttccccgg 1067941 cacgcacggt gtcgctgacg cgctggccta cggggtgaag gtcaccggcg ctacggtgca 1068001 cctggtagac gctggcacgg acaccgggcc aatactggcg cagcaacctg tgccggtgct 1068061 cgacggtgac gacgaagaga ctttgcatga acgaatcaag gtcaccgaac gacggctgtt 1068121 ggtagcggcg gtggccgcac tggccaccca tggcgtgacg gtggtcggac gaacagcgac 1068181 gatgggacga aaggtaacca taggatgagc accgacgacg gaagacggcc gatccgccgt 1068241 gcgctgatca gcgtgtacga caagaccggg ctggtagacc tggcacaggg cctgagcgcg 1068301 gccggcgtcg agatcatctc gactgggtca acggccaaga ccattgccga caccgggatt 1068361 ccggtgaccc ccgtggagca gctgaccggc tttcccgagg tgctcgatgg ccgggtcaag 1068421 acactgcacc cacgagtgca tgccgggctg ctggctgacc tgcgcaagtc cgagcacgcc 1068481 gcggccctcg agcaactcgg gatcgaggct ttcgaactcg ttgtagtcaa cttgtatccg 1068541 ttcagccaga ccgtcgaatc cggcgccagt gtcgacgact gcgtcgagca gattgatatc 1068601 ggcgggccgg cgatggtgcg ggccgccgcc aaaaaccatc ccagcgcggc ggtggtcacc 1068661 gatccgcttg ggtaccatgg cgtgcttgcc gcactgcgcg ccggcggatt caccctcgcc 1068721 gagcgcaaaa ggctggcgtc gttagcgttt cagcatatag ccgagtacga catcgccgtc 1068781 gcgagctgga tgcaacagac cctagcgccc gaacatcctg ttgccgcctt tccgcagtgg 1068841 ttcggccgaa gctggcgccg cgtggcgatg ctgcgctacg gcgagaaccc gcaccaacag 1068901 gccgctctct acggcgaccc caccgcctgg ccggggctgg cccaggccga gcaactgcac 1068961 ggaaaagaca tgtcctacaa caacttcacc gatgcggacg cagcctggcg ggccgccttc 1069021 gaccacgaac aaacgtgcgt ggcgatcatc aagcacgcca acccgtgcgg catcgcaatc 1069081 tcgtccgttt cggtcgccga cgcgcatcgc aaggctcacg aatgcgatcc gctgagcgcc 1069141 tacggcgggg tcatcgccgc caataccgag gtcagtgtcg aaatggccga gtatgtgagc 1069201 accatcttca ccgaagtcat cgtcgcgcct ggctacgccc ccggggccct cgatgtgctg 1069261 gcccgcaaga agaacatccg ggtgctggta gccgccgagc cactggccgg tggcagcgag 1069321 ttgcgtccga tcagcggtgg actgctgata cagcagagcg accagcttga cgcgcacggt 1069381 gacaacccgg cgaactggac cttggcgacc gggtcacctg cggaccccgc gacgctgacc 1069441 gacctggtct tcgcgtggcg agcctgccgt gcggtcaagt cgaacgcgat agtgatagct 1069501 gccgacggcg ccaccgtcgg cgtcgggatg ggtcaggtca accgtgtcga cgccgcccgg 1069561 ttggccgtcg aacgcggcgg cgagcgggtt cgcggcgcgg tggcagcctc ggatgcgttc 1069621 ttcccctttc ccgacggcct ggaaacgttg gccgccgcgg gggtcaccgc ggtcgtccac 1069681 cccggtggct cggtgcgcga cgaggaagtg accgaagcgg cggccaaggc cggtgtcacc 1069741 ctatatctca ccggggcgcg gcacttcgcg cactgaggcc gctggccgcg acagtgaaat 1069801 ccacgacgtg acacgccgga aacgcgtcgt gacattcact ctcgtggcca gaagaaagac 1069861 ggcgtcgtag cgtggaacgg tgatgtcacc cagtaacctg ccccgcaccg tgggcgagct 1069921 gcgtgccgcc ggtcatcggg aacggggggt caagcaggaa atccgggaaa atctgctgac 1069981 cgcgctggcc gacggcgaca acgtctggcc gggcatcctg ggtttcgacg acaccgtgat 1070041 tccccaggtg gagcgggcct tgatcgccgg tcacgacttt gtcctgctcg gcgaacgcgg 1070101 ccagggcaag acccggctgc tgcgcgcact cgcgggtctg ctggacgagt ggacgccggt 1070161 gatcgccggc gccgaactgg gcgagcaccc ctacacgccg atcacgccgg agtcgatccg 1070221 gcgggccgcg cagctcggcg acgacctacc ggtggcgtgg aagcaccgca gcgagcgcta 1070281 caccgagaag ctggccaccc ccgacaccag cgtcgccgac ctggtcggcg acgtcgaccc 1070341 gatcaaggtt gccgagggcc gcagcctcgg ggatcccgaa accatcgcct acgggctcat 1070401 cccgcgggcg caccgcggca tcgtcgcggt caacgagctg cccgacctcg ccgaacgcat 1070461 ccaggtgtcg atgctcaacg tcatggagga gcgcgacatc caggtccgcg gctacacgct 1070521 gcggctgccg ctggatgtgt tggtggtcgc cagcgccaac cccgaggact acaccaaccg 1070581 tggccgcatc atcacgccca tcaaggaccg gttcggcgcc gagatccgca cccactaccc 1070641 actggagctg gaggcggaga tgggcgtcat cgtccaggag gcgcacctga gtgcacaggt 1070701 gtccgactac ctgatgcagg tgctcgcgcg gtttgcccgt tacctgcgag aatcccgctc 1070761 gatcgatcag cgctccgggg tgtcggcgcg gtttgccatc gcagcggccg aaaccgtggc 1070821 ggctgccgcc cggcaccgcg gggcggtgct gggggagaca gacccggtgg cccgggtggt 1070881 cgatttgggc acggtgatcg acgtgctgcg cggcaagctg gaattcgagt ccggcgagga 1070941 gggccgcgaa caggcggtgc tcgagcatct gttgcgtcgc gccaccgccg ataccgcgtc 1071001 ccgggtgctg ggcggtatcg acgttggctc gttggtgacc gcggtcgagg gcggttcggc 1071061 ggtgacgacg ggcgagcggg tctcggccaa ggatgtgctg gcggcggtgc cgggcctgcc 1071121 ggtggtggac aggatcgcgc gcaagctggg cgccgaatcc gagggggagc gtgccgcggc 1071181 actggaactg gcgttggagg cgctatacct ggccaagcgc gttgacaagg tctgcgggga 1071241 gggccagacc gtctatggct aagtctgatg gtgacgaccc gctgcgcccg gcttcgccgc 1071301 gcttgcgatc gtcacgacgg cactcgctac gctactcggc gtacaccggc gggcccgacc 1071361 cgctggcccc gccggtggat ctgcgggatg cgctggaaca gattggccaa gacgtcatgg 1071421 cgggcgcctc gccgcgccgg gcgctgtccg agctgctgcg gcggggcacc aggaacctga 1071481 ccggcgccga ccggctggcg gccgaggtga accgccgccg acgggagttg ttgcgccgca 1071541 acaacttaga tggcaccttg caggagatca agaagctgct cgacgaggcc gtgctggccg 1071601 aacgcaagga gctggcccgc gcgctagacg acgacgcccg cttcgccgag ctgcagctgg 1071661 acgcgcttcc ggcctcgccg gccaaggcag tacaggagct ggccgaatac cgctggcgca 1071721 gcgggcaggc ccgcgaaaag tatgagcaga tcaaggattt gctcggccgt gagctgctcg 1071781 accaacgctt tgccggcatg aagcaggcgc ttgccggtgc caccgacgac gatcgccggc 1071841 gggtcaccga gatgctcgac gacctcaacg acctgttgga taagcacgcc cgcggtgaag 1071901 atacgcagcg ggacttcgac gagttcatga ccaagcacgg cgagttcttc ccggagaacc 1071961 cgcgcaacgt cgaggagctg ctggactcgc tggccaagcg agccgccgcc gcgcagcggt 1072021 tccgcaacag cctgagccag gaacagcggg acgagctgga cgcgttggcg cagcaggcat 1072081 ttggctctcc ggcgttgatg cgggcgctgg accgtttgga tgcgcatctg caggccgccc 1072141 gtcccggcga agactggacc ggctcgcagc agttctccgg tgataatccg ttcggcatgg 1072201 gggaaggcac ccaggcgctg gccgacattg ccgagctgga gcagctggcc gagcagctgt 1072261 cgcagagcta tccgggcgcc agcatggacg atgtcgacct ggacgcgctg gcccgtcagc 1072321 tcggcgacca ggccgccgtc gacgcccgga cgctggctga attggaacgc gcgctggtca 1072381 atcagggctt cctggaccgc ggttccgacg gccagtggcg gctctcgccg aaggccatgc 1072441 gccgcctcgg cgaaacggcg ttacgcgatg tggcgcaaca actttccggg cgccacggcg 1072501 agcgtgatca ccggcgtgcc ggcgccgcgg gcgagctgac cggtgcgacg cggccctggc 1072561 agttcggcga caccgagccg tggcacgtcg cccgcacgct gaccaatgcc gtgctgcgcc 1072621 aagccgcggc cgtgcatgac cgcatccgga tcaccgtcga ggatgtcgag gtcgccgaga 1072681 ccgaaacgcg cacccaggcc gctgttgcgt tgttggtgga cacctcgttt tcgatggtga 1072741 tggagaatcg ctggttgccg atgaagcgca cggcgctggc gctgcaccac ctggtgtgca 1072801 cccggttccg ctcggatgcc ttgcagatca tcgcgtttgg gcgctacgcc cgcacggtga 1072861 cggcggccga gctgacgggg ttggcgggtg tctacgagca gggcaccaac ctgcaccatg 1072921 cgctcgcgct ggccggccgg cacctgcgcc ggcacgcagg cgcccagccc gtggtgctgg 1072981 tggtgaccga cggcgagccg accgcccacc tggaggactt cgacggcgac ggtacgtcgg 1073041 tgttctttga ttacccgccc catccgcgca ccatcgccca caccgtgcgc gggtttgacg 1073101 acatggcgcg gctgggtgcg caggtgacga tcttccggtt gggcagtgac cccggtctgg 1073161 ctcggttcat tgaccaggtt gcgcgacggg tgcagggccg cgtggtggtg cccgatctcg 1073221 acgggctggg cgcggcggtg gtgggcgact acctgcgctt ccggcggcgc tagtttgttg 1073281 caatcatggt gctagcatcg tgctagcaat atgctaacat agtgcgatga agacgctgta 1073341 tctgcgcaat gtgccggacg acgtggtcga gcgactcgag cgcctcgccg aactcgccaa 1073401 gacgtcggtg tccgcggttg ctgtgcgtga gctcaccgag gcttctcgcc gcgccgacaa 1073461 tccggcgctt cttggggact tgcccgatat cggcatcgac acgaccgaac tgatcggtgg 1073521 tatcgacgcc gagcgcgccg gtcgatgatc gtcgttgacg cctcggccgc gctggccgcg 1073581 ctgctcaacg atggacaagc tcgacaattg atcgctgccg agcgcctgca tgtcccgcat 1073641 ctggtcgatt cggaaatcgc gagcgggctc cgcaggctag cgcagcggga tcggctgggc 1073701 gcggccgacg gacggcgggc cctccaaacg tggcgccgcc tcgcggtgac gcgttatccg 1073761 gtggtgggcc ttttcgagcg tatctgggaa atccgcgcga acctgtcggc atacgacgcc 1073821 agctatgtgg ccttggcgga agccctgaac tgtgcgctcg tcacagcgga tctgcggctc 1073881 agcgacaccg gccaagccca gtgtccgatt accgttgtgc ccaggtagcc gtggcacgga 1073941 tgttcgagga tccgtatatc acaacgcgat aggtcctgtt gacacaaggg aagcgcgggg 1074001 cgccgtcggc ggttcgtctc gtcgaaatgc gacaacaacg ccgtgcgcgg cacatcccag 1074061 tttgtgagac actgtgcgcg tgccctcgca gtggatgatc tcatcccggg taacggtagc 1074121 ctggaacatc gtcggctacc tcgtgtatgc ggccctggct tttgtcggcg ggtttgcggt 1074181 ttggttctcc ttattcttcg cgatggccac cgatggttgt cacgactcag cttgcgacgc 1074241 aagctatcac gtgttcccgg ccatggtcac catgtggatc ggagttggcg cggtcttgct 1074301 gctcaccttg gtggtcatgg ttcgcaactc gtcgcgaggc aacgtcgtga tcggatggcc 1074361 ttttgttggg ttgttggcgc ttggccttgt ctacgtggct gccgatgcgg tcttgcactg 1074421 atcgacgtgg ggttctgcgt cagtaggcgt cgcgggttcg gccgccgggg gatccgtaca 1074481 ggtacgggta gtgcacgtcg gggtcgttgg ccggtcgcat gttcagcggt ggcggcgcgg 1074541 tgcgccaggc ggccggggga tggcaaccgg tgttgtagga gtagccgagt gggccgcggt 1074601 tgtcgccatt gacgaggttt atcgtgacgc cgtcatcgcg gatttgagag taattcgggg 1074661 cgcccagcgg ctgtggcggg tcgccgggta cggagttgtt gggccgaaaa cccgccgcct 1074721 tgaacaccgg ggctagctcg gtgacaattt gcagccattg ctgcggagtg ggcgctggac 1074781 ggccaaagaa taactcgctt gcctcttgtc gaccgatggt gcgggtgaac gggtcgttgc 1074841 agccgttggt gagatggctc actgtgacgc ctgttgagaa ccgggtctgc ggtgaatatt 1074901 tcgcgatcat cgccctgatg gtggcgtcga ggttggcgag ctgctgctgg actgtctcaa 1074961 gatcgggtcg tccgttgacg attttctgcc ggcggtccag ctcgccccgc ccgggattgg 1075021 cgtaagggtc gaacgtattg ggtttgatac acccggccag cagtgcggcg atgccgagga 1075081 gcgccgcggt caggctgcgg gatgttcgct tcatgggtga tagttcgggt tgggtattgg 1075141 caatccgaac gggccgggcc cactgggcat cgtcggtggc ggcagcacag ggggtttgat 1075201 gaggtcgtcg ggcaatcccg ccagcacggc ggccaagttg taaccactca tccgaagctg 1075261 atcattgtct ccgttacgtg cgtactcaga gtgtccttat gctcgctcgt gaagttggcc 1075321 gtcgcccaag agcgggccgg gcgccaaacc ggtgttgacc gacagttgtg tcatgccggg 1075381 tacgtcctgg ggtgcggagc cgaatgcgcc gaattcgggg atggtattgg cgacatggtc 1075441 gttgacgccg atcatgtaga aggcatgccc tggctcaacg cccagctgcg acgcgtgcgt 1075501 gagctcggtg cccggtgaac cgtacaaaac gacgtcgctg accggtgccc cctgctgcag 1075561 cgccagactg gttaccaggg atccgtagga atgcccgaac gcggtgatgt gctgatcgct 1075621 gacattcgtg gtagcggcca agcctttgtc gaagcggttc aacggccccg cggcatcgcg 1075681 agccgaccag tcgtgcatca cgtctttgag gccgtccggc gcgtcatagc ccagccacgc 1075741 aatggatgcc accgcatcgt aatttggcca tccggctcgt tccctcagtt cggctgcctt 1075801 tgcgcgctga attccagctt ccttgaccat gtccccaacg ctcgaactca cccgcgtgtt 1075861 caggccgccc atcgtgacgc cgacgcgttc ggcgttgtcg acgtcgccaa ctcccacagc 1075921 cgccaacacc tttcgcgggt cactggcggt gtccaacaga atgaggctgg tgccgggatg 1075981 ggctgccaga gtatcccgca acgctcgcag atccgccagc ttgtcggtgt cggtgtgcca 1076041 gactccatct ctgctcaacc agccgttctg cagccgggtg agttctcgtt gcagcactga 1076101 aagattcagt tcgttgcgaa cggcgatggg aatgccgtcg cgattacgca gggtattggg 1076161 gaaccattgc ttgacccgat cctgctggcc cggggtcagc gaatgccacc accgcttgac 1076221 ctcctcaggg tcgctgtccg gcggcggcat ctgcggcatt gtgggtggcg catggctgag 1076281 ttgggcattg acctgctcgc gcgacaaggc cccggcggcg catcgaatcg ccgcggccag 1076341 atcctcatcg gcggtctcgg cgtcggccag cagacgtttg atgccctccg gattgcggtg 1076401 ttgaggatgg cctgctgatc ggccggggaa tacgacgaca agtcgggtgg tggcaacgcc 1076461 gtaccggtcg cgtaagcgat cgtcaggtga tgctcacgcg cggcatcgcg gatcacttgt 1076521 agccgcatct tgatcgcggc gacctcctcg gccgcctttt ccgcggcccg cgcgaccgct 1076581 tcacacgcgc cggcatggtg atcgagcagc accgttgtgt gatgggttgc tacctgtgcc 1076641 gcctcggcgg ccgcaccgcc aaagccgagc agccccatgg tgtcacgcag cgccgccgat 1076701 gcggtgcgtg tgccgtgcgc gcggtcgatc gcggcctgaa caccgtcctg atcgctgtgg 1076761 gatcccagcg ctcaatgtca gctaacgtca acgccgtcgc atcgggcgct tccaccgcct 1076821 gttggataaa ccgcggccag tgccgcggcg ttgtgttcct ccatctcggc gaacccgacc 1076881 gcggccagat gcatgccgta agaatggtcc ccgatgcggg ccgcgtgggc ggtgctggcc 1076941 tccgcccagc tatccagcaa tcccgacagc gcgcccgccg acgaaccgac ccatcccggc 1077001 cgcgcccctt cggccgcacc caggcagcag tggtgtgacg tcagcaaaga ctcgccgtgg 1077061 tcagcctgct gataaccgac ctgggacagg acttcaggaa tcgcccgcaa cgttctgcca 1077121 taccaactcg cttccacacg aaccaaactt tcggcggagt atggcacacg agcacattgc 1077181 gggcgattca cccgcatcga gctgaccggg cggcgcacct tgctatttgc ggctatttgc 1077241 gtggcttgcg gggtttgcgc ttgatgccca catcacccca cagcgagaag ccgcggatcc 1077301 tcaccgtcgg cacgccacgg gtgccctccc cgaccacctt gcggtcgaag ccgcccatca 1077361 ctcggtgacc gtggatctcc acgttgactt cgggtggcag cagaattgtc tgcgccccca 1077421 tgatcgagta cgcacggatg tccacctcgg tcgaggtgaa gtcggcgtag cgcagatcca 1077481 gcaccccgct gccccacaag gtgaacgtgg tcagcttctt cggcacgttc cagcggccgc 1077541 gtcgttcgaa tccgcccagt agcgccagca gcagcgtgga cggcgccgga ttgcattcgc 1077601 cacccctgcg cgggcctatc gccgcccccg gcagatcggc ccgcagccga tccagctcct 1077661 ggtaggtggt tgccgcatag gcccgcgcca gccggtcttc ataatcggtc agctgcaggc 1077721 ggccctgctc ggccgcgtag gccagcaact gcgcaatctg tatccggtcg gtgtccgacg 1077781 cacgcgcgga ctcgtcgcgc gagttcctcg cgtcacgctg cgccgagttg ctcatcgtcc 1077841 acgagcctac gacgtcaaga atttgcttca agaggtgttg gcgaaactgc aaatgttgcc 1077901 aggttcgact ccttgggtag cccaccccca gtggggtggg ataccatgaa cgggtgaggg 1077961 attaggggca agccatgagc aaggaattga ccgcaaagaa gcgcgcggcg ctgaaccggc 1078021 tgaagacggt tcggggccat cttgacggaa tcgttcggat gctggagtcc gacgcctact 1078081 gcgtggacgt gatgaagcag atttcagcgg ttcagtcctc gctggagcgg gccaaccggg 1078141 tgatgctgca caaccacttg gagacgtgct tttccacggc ggtgctggat ggtcatgggc 1078201 aagcggccat cgaagagctc attgatgccg tcaaattcac gccggcgctg accggtccac 1078261 acgcgcggct cggcggtgcc gcggtcggcg agtcggccac cgaggagccg atgccggatg 1078321 ccagcaacat gtgacgagcg ccggactccg gtgtttctcg ggacaacgac atacgaaagg 1078381 agcatccgcg atggtgtggc atggattcct agcgaaggcg gtacccaccg tggtcaccgg 1078441 cgcggtgggg gtcgcggcgt atgaggcgct gcgcaagatg gtggtgaagg ctccgctgcg 1078501 ggcggcaacc gtgtccgttg ccgcctgggg catacgctta gcacgtgaag ccgagcgcaa 1078561 ggccggggag agcgccgagc aagctcgact gatgttcgcc gacgtgctag ccgaagccag 1078621 cgagcgcgcc ggggaagaag ttccaccact ggcggtggcg ggttcggacg acggtcatga 1078681 ccactgacgt tctttctgac accgacgtct cgctgaaggt ggtctccaac gcgtcggggc 1078741 ggatgcgcgt gtgcgtcacc gggttcaatg tcgatgcggt tcgggccgtc gcgattgagg 1078801 agacggtctc ccaagtgacc ggggtgcacg ccgtgcacgc ctatccgcga acagcgtcgg 1078861 tggtgatctg gtactcgcca gagctcggtg acaccgccgc cgtgctgtcg gcgatcacca 1078921 aagcgcagca cgtcccggca gaattggtgc ccgcccgtgc cccgcactca gcgggtgtgc 1078981 gcggcgtggg cgtggtgcgg aaaatcaccg gcgggatccg ccgcatgcta agtcgcccgc 1079041 cgggcgtcga caagcccctg aaggcgtcgc gttgcggcgg ccgcccgcgc gggccggtcc 1079101 gcgggagcgc ctcgtggccg ggcgagcaga accggcgcga gcggcggacg tggttgccgc 1079161 gggtgtggtt ggccttgccg ttggggctac tggcgctggg ttcgtcaatg ttcttcggtg 1079221 cttacccgtg ggcggggtgg ctggccttcg ccgcgacgct gccggtgcaa ttcgtggccg 1079281 ggtggccgat tctgcggggg gcggtgcaac aggcgcgggc gttgacctcg aacatggaca 1079341 cgctgatcgc gctgggtacg ctgaccgcgt ttgtctactc cacgtatcag ttgtttgccg 1079401 gtggacctct gttcttcgac acctcggcgc tgatcatcgc gttcgtggtg ttgggccgcc 1079461 atctcgaggc cagagcaacc ggaaaagcgt ccgaggcgat cagcaagctg ctggagctgg 1079521 gcgccaagga agccacgctg cttgtcgacg gccaagagct cctggtgccg gtcgatcagg 1079581 tccaagtcgg agacctggtg cgggtgcggc ccggagagaa gatcccggtc gacggtgagg 1079641 tcaccgatgg gcgcgccgcc gtcgacgagt cgatgctcac cggcgaatcc gtcccggtcg 1079701 agaagacggc gggtgaccgc gttgccggcg caacggtcaa cctcgacggg ctgttgaccg 1079761 tgcgcgccac cgccgtcggg gcagacaccg cgctggcgca gattgtgcga ctggtcgagc 1079821 aggcacaggg cgacaaggcg ccggtgcagc ggctggccga ccgggtttcg gcggtgtttg 1079881 tcccggccgt catcggcgtt gccgtcgcga cctttgcggg atggaccctg atcgccgcca 1079941 acccggtggc tggtatgacc gccgcggtcg cggtgctgat catcgcgtgc ccgtgtgcgt 1080001 tgggcctggc tacccccacg gccatcatgg tcggcaccgg ccggggcgcc gaactgggga 1080061 tcctggtcaa gggaggcgag gtgctggaag cgtcgaagaa gatcgacacc gtggtgttcg 1080121 acaagaccgg caccctcacc cgcgcccgga tgcgggtgac cgatgtgatt gccggccagc 1080181 ggcgccagcc tgatcaggtg ctgcggctcg ccgccgcggt cgaatcgggc tccgaacacc 1080241 ccatcggtgc ggcgatcgtt gccgctgcac acgagcgcgg gttggcgata ccggccgcca 1080301 atgcgttcac cgccgtcgcc gggcacgggg tgcgggcgca ggtcaacggc gggccggtgg 1080361 tggtcggacg gcgcaagctc gtcgacgaac aacatttggt tctgcccgac cacctcgctg 1080421 cggcggccgt ggagcaggaa gagcgcggcc gcaccgcggt gttcgtcggc caagacggcc 1080481 aggttgtggg tgtgctcgcg gtagcggaca cggtcaaaga cgacgccgcg gacgtggtcg 1080541 gtcggctgca cgccatgggg ctacaggtag ccatgatcac cggcgacaac gcccgcacgg 1080601 ctgccgcgat cgccaagcag gtcggcatcg agaaggtgct ggccgaggtg ttgccgcagg 1080661 acaaggtagc tgaggttcgg cggctgcagg accagggccg ggtggtcgcg atggtgggtg 1080721 acggcgtcaa cgacgcgccc gccttggtac aagccgatct gggcattgcg atcggcaccg 1080781 gtaccgacgt ggccatcgag gcctccgaca tcacgctaat gtccggccgg ctcgatggtg 1080841 tcgtgcgcgc gatcgaactc tccaggcaga ccctgcgcac catctaccag aatctcggct 1080901 gggccttcgg ctacaacacc gccgcgatcc cactggccgc gctgggcgcg ctgaacccgg 1080961 tcgtggcggg cgcggcgatg gggttctcct cggtcagcgt ggtgaccaac tcactgcggt 1081021 tacgccgctt cggccgcgac ggccgaaccg catgatccat gacctgatgc ttcgttgggt 1081081 ggttaccggc ctgttcgtgc tgaccgccgc cgaatgtggt ctggcaatca tcgccaaacg 1081141 ccgaccgtgg acgttgatcg tcaaccacgg gttgcatttc gcaatggccg ttgcgatggc 1081201 ggtgatggcc tggccgtggg gcgcgcgggt tccgacgacg ggacctgcgg tatttttctt 1081261 gctggcggcc gtgtggtttg gggcgacggc cgtcgttgcg gtccgcggga ccgctacgcg 1081321 tggactgtac ggatatcacg gcttgatgat gctggccaca gcctggatgt atgccgccat 1081381 gaatcctcgt ttgctccctg tccgctcgtg caccgaatac gccaccgagc cggatgggtc 1081441 aatgccggct atggacatga ctgcgatgaa catgccgccg aatagcgggt cacccatctg 1081501 gttcagcgcg gtgaactgga tcggtacggt cggcttcgcg gttgcggcgg ttttctgggc 1081561 atgcaggttt gtcatggagc ggcggcagga ggcgacccag tccaggttgc cgggcagcat 1081621 aggccaagcg atgatggcgg ccggtatggc gatgttgttc ttcgccatgc tgtttccggt 1081681 ttgaggcagt tcgccgcctg tgtgtccgaa ccgcaaggta attcggaata ggctgttccc 1081741 aacctcctgc gtcgtaggcg ggggcccggc gggcctagtc agcggcccgc atcgtcgccg 1081801 gctggaccca gcggggcgga cgtttctgca ggaaggccag catcccttcg cgcgcttcgt 1081861 cggagacgaa cagcctggcc gactcctcgg tcaggcgttc ggcgtcgcgg tcgaaccctt 1081921 cgagcacggc ggccgtggtc agcgccttcg acgcggccag gccttgtggc gagccgcggc 1081981 ccacgtcggc gaccagcgcg gccaccgcgg cgtccacgtc gtcggccgcc atggtgatca 1082041 gtccgatgtc ggcggcttcg cgggcgccga acttctcgcc ggtcaggtaa tagcgggccg 1082101 cggcgcgcgg cgaaagcttg ggcagcagcg tcagcgagat gatcgccggt gccaccccga 1082161 tccgtgcctc ggtcagcgcg aacgtgcttt ccggtccggc gaccaccatg tcgcacgcac 1082221 cgaccaggcc gaacccgccg gcccgcacat gcccgttgat ggcgccgacc accggcagcg 1082281 gcgactcgac gatggcgcgc aacagcgccg tcatttcccg cgcccgcgcc accgccatcc 1082341 ggtacggatc accaccacca ccgccggcct cgctgaggtc cgcgccggcg cagaacgttc 1082401 cgccggtatg ccccagcacg accagccgca ccgccggatc tgcttcggcc gcactcagcc 1082461 cttgatgtag ttggctgacc agcgtgctcg acagcgcgtt gcggttgtgc ggagagttca 1082521 gtgtcagcct ggcgaagggg ccgccgcagg cggccgggcc agcgtagtcg acggggctgt 1082581 ccatcagtag gaccggggca gacccagcga tgtctgcgca acgaagttca gcaccatctc 1082641 gcggctgatc ggggcgatcc gcgccaagcg ggccgaggtc atcatcgctg ccacgccata 1082701 ttccttggtg aggccgttgc cgcccatcga ctgtacggcc tgatcgaccg cgcggctgga 1082761 tgcctcggcc gcagcgtatt tggccatgtt ggccgcctcg gccgcaccga agtcgtcacc 1082821 atggtcgtag agtgtggcgg ctttctgggt catcagcttg gcgagttcga cctcaatgtg 1082881 gcactgcgcc aacggatgtg ccaggccctg gtgcgcgccg atcggggtgg accacacctt 1082941 gcgggttttg acgtagtcga cggccctgcc gagtgcgaac cggcccatgc ccaccgcgct 1083001 agccgcaccc atgatgcgct cggggttcag gcccgcgaaa agctgtgcga tcgccgcgtc 1083061 ttcggctcca accagcgcat cggcgggtag ccggacgtcg tcgaggaaaa cctggaactg 1083121 gcgttcgggg ctgaccagct ccatctcgat cggggtgtag ctgaacccgg gagcgtcggt 1083181 gggcaccacg aacaacgcgg ggcgtagctt gccggttttg gcttcctcgc tgcggcccac 1083241 gaccagcacc gcctgcgcct ggtcgatgcc agaaataaag actttctggc ccttgatgat 1083301 ccagtcgctg ccgtcgcgac gcgcggtggt ggtgatcttg tgtgagttgg agccggcgtc 1083361 gggctcggtg atggcgaacg ccatggtcaa cgagccgtcg gcgatgcccg gcaaccagcg 1083421 cttcttctga tcgtcggtgc cgaacttggc gatgatggtt ccgttgatgg ccggtgacac 1083481 caccatcagc agcagcgccg agccggcggc ggccatctcc tccatcacca gcgacagttc 1083541 gtacatgcct gcgccgccgc cgccgtactc ttcgggcaga ttcaccccca aaaaaccgag 1083601 tttgcctgcc tcggcccata actcgctggt gtgttcgtgt ttgcgcgcct tgtccaggta 1083661 gtactcgtgg ccatagttgg ccacccaaga ggccaccgcc ttgcgcagcg cctgacgttc 1083721 ctcgctttcg ataaagctgg tgtctgtcac ggtgaatctc cttctgctgg gccattttga 1083781 ggtgcttcta ctcgtgcgag aatggcgcct acttcgacct gttgacccgt gttgacgctg 1083841 acgtgggtga gcacgccgtc ggcaggcgcg gcgatggtgt gttccatctt catggcctcc 1083901 agccagatca acggctgacc ggccgtgacc gtgtcgccaa cctcggcgcc gatccggatg 1083961 acgttgccgg gcatgggggc caccagcgag ccttgctcga cggccgagct cggctcgggg 1084021 aagcgtgaca gtgccaccag gtgaacgggt ccgcgcgccg agtcgacgta gacgtcgggg 1084081 ccgtggcggg caaccgtgaa gccgtgtgcg accccgtcct gggcgagcac cacctggtcc 1084141 acgtcagccg agaccagctg taccaccgga tcgccgggaa gcgccagacc cgttctggtg 1084201 aaccggtatt cgacgcggtg ttcggtgtcc gcgtcgtcac gataggtctt gacctgatag 1084261 cccgaggcca ggttgcgcca gccgctggga atcgagctga acacgcccgc gctcgcccga 1084321 ttgtgctcgg cgtcggccag cgcggcggcg atcgccgaca accggagggt cgcggtgtcg 1084381 gccagcggtg tcgacaactc ggccatgccg tgcgtgtcga aaaacccggt gtcggtggcg 1084441 ccgtcgagga acgccggatg acgcagcacg ttgaccaaga gctcacggtt ggtgcgcaga 1084501 ccgtgcagcc gggcgcgtac cagcgcatcg gccaacacaa gcgcggcctg ccggcgggtg 1084561 gcaccgtagg agacgacctt ggccagcatt gggtcgtagt ggatcgacac tgtggaaccg 1084621 tcgacgatcc cggaatccag ccggatgccg gtccgctgtc ccaacgagtc gaactgcgcc 1084681 cgaacccccg gaacctcaat cgtgtgcatc acgcctgcct gtggctgcca gccatgcgcg 1084741 ggatcctcgg cgtagaggcg ggcctcgatc gaatatccct gggcgggggg aggttcggtg 1084801 tcgagtcgcc cgcagtcggc aatcatgagc tgcagttcga ccagatccag cccggtggtc 1084861 tcttcggtga ccgggtgctc gacctgtagc cgggtgttca tctccaggaa gtagaactca 1084921 ccttcccggc caggtgagtc atcggcgagg aactccaccg tgcctgcccc ggtgtagccg 1084981 atcgcgctgg ccgccagccg ggccgcgtcg aacagcttgg cccgcatccc cggtacgcgt 1085041 tccaccagcg gcgacggtgc ctcttcgatg atcttctggt ggcggcgctg aatcgagcat 1085101 tcccgttccc cgaccgccca cacggtgcca tgggtgtcgg ccatgacttg cacttcgacg 1085161 tggtgcccgg tgggcaggta gcgctcgcag aatacggtcg ggtcgccgaa cgcggattgg 1085221 gcttcacgtc gcgcggcttc gacttcggcc ggcagggccg ataattcgtg aaccactcgc 1085281 atgccgcgac cgccaccgcc cgccgacgcc ttcaccagca ccggcagctg cgcggtggtg 1085341 acggcgtcgg ggtcgagttc ctcgagcacc ggcaccccgg cggcggccat cagcttcttg 1085401 gactcgattt tggagcccat cgcgcgcacc gcgtccaccg gtggcccgac ccaggttagg 1085461 ccggcctcct gcacggcggc cgcgaattcg gcgttctccg agaggaatcc gtagccggga 1085521 tgcaccgcgt cggctccggc tgcctgcgcg gccgcgatga tcgcctcggc gttcagatag 1085581 tcggtggtct gcggcagccg gacccgggcg tcggcctcgg cgacatgcgg tgccgcggca 1085641 tccgggtctg tgtagacggc gacggtgccg agccccagcc ggcggcaggt ggcgaacacc 1085701 cgccgggcga tctcgccgcg gttagcaacc aatactcgag tgattcccat cagcatcaca 1085761 tccggaagac gccgaagttc gacgtcccct tgatcgggcc attggcgatg gcggacaaac 1085821 acattcccag cacggtgcgg gtgtcgcgcg ggtcgatcac cccgtcgtcg taaagcatcc 1085881 cggacagcac caacggtagc gactcggctt cgatctggcc ctcgacggcg gcccgcatcg 1085941 ccgcgtcggc ggcttcgtcg acttgctgcc cgcgggcttc ggctgccgcc cgggccacga 1086001 tggacagcac gcccgacagc tgggcgccgc ccatcaccgc ggacttggcg ctgggccagg 1086061 cgaataggaa gcgcgggtcg taggcgcgcc cgcacatgcc gtagtgcccg gcgccgtagg 1086121 acgcgccgat cagcagcgag atgtgcggga cggtcgagtt ggacacggcg ttgatcatca 1086181 tcgagccatg cttgatcatc ccgccttcct cgtagtcctt gcccaccatg tagccggtgg 1086241 tgttgtgtaa gaacaacagc ggcgtgtcgg cccggttggc cagctggatg aactgggtgg 1086301 ccttctgtga ttcctcgctg aacagcacgc cgcgggcgtt ggccaggatg cccagcggat 1086361 agccgtgcaa ccgagcccag ccggtcacca gagacgaccc gtacagcggc ttgaattcgt 1086421 cgaactcgga gccatcgacg atgcgggcga tcacctcgcg cgggtcgaat gggatgcgca 1086481 gatccggggg cacgatgccg attagctcct cggcgtcgaa cagcggctcg gtcaccggag 1086541 cgggtgcggg tccctgtttg atccagttca gtcgcgccac gatgcggcgt ccgatgcgga 1086601 tcgcgtcgag ctcgtcgagc gcaaaatagt cggccaaacc cgatatgcgg gcgtgcattt 1086661 cggcgccgcc cagcgactcg tcgtcggact cttcgccggt ggccatcttc actagcggcg 1086721 ggccggccaa aaacaccttg gagcgttcct tgatcatcac cacgtgatcg gacatgccgg 1086781 ggacgtaggc accgcccgcg gtggagttgc cgaaaaccag cgcaatggtc gggatcccgg 1086841 ccgccgacag ccgggtcagg tcgcggaaca tctgtccgcc ggggatgaaa atctctttct 1086901 gggtgggcag atcggccccg ccggattcca ccagcgaaat gacgggaagc cggttttcga 1086961 aggcgatctg gttggcccgc agtatctttc gaagcgtcca cggattgctg gtgccgccct 1087021 tgaccgtcgg gtcgttggcg acgatcatgc attccacgcc gcagaccgcg ccgatgccgg 1087081 tgaccaggct ggcgccgatc tggaagttgc tgccgtaggc ggccagcggg ctcagctcca 1087141 ggaacgggga gtccgggtcg acgagcagct cgatgcgttc ccgtggtgtc aggttgccgc 1087201 gggcgtggtg ccggtcgacg tatttggggc caccgccggc gagcgccttg gccagttcgg 1087261 cgttgatctc gtcgagcttg ccgctcatcg tcgcggccgc ctcgtcgtag gcggaagcgt 1087321 tcgggtccag tgtggattgc agcacggtca cgattgatac cccagggttt tggcggccaa 1087381 agcggtcagt atttcggtgg tgccgcctcc gataccgagg attcgcatgt cccggtattg 1087441 gcgttcgact tcggattcgg ccatgtaacc catgccgccg aacagctgta cggcctggtt 1087501 ggcaacccac tccccggcct gcacggcggt gttcttggcg aaacacacct gcgcgatcag 1087561 gtcggtctcg ccggcgagct ggcgttccac cacatggtgc gcatagaccc gggcgacgtc 1087621 gatgcggcgg gccatctcgg ccagcgtgtt ctgcaccgac tggcgtgaaa tcagcggccg 1087681 accgaacgtc tcgcggtccc ggcaccactg cgcggtgagg tccaggcacc gctgggcgct 1087741 cgaatacgcc tgggcggcaa ggccgatgcg ctcggaaaca aatgcccggg cgatctgggt 1087801 gaagccgctg ttctcggcgc ccacgaggtt agtcgccggc acggccacgt cggtgtagca 1087861 cagctcggcg gtatccgagg aacgccagcc catcttgtcc agcttgcggg tcacctcaaa 1087921 gccgggggtg tccttttcca ccaccagcag cgaaaccccg gcggcaccgg gtccaccggt 1087981 tcgcaccgcg gtgaccacgt agtcggcccg cacgccggag gtgatgtagg tcttggcgcc 1088041 gttgatcacg taatggtcgc cgtcccgtac cgcgctggtc cgtagatgcc cgacgtcgga 1088101 gccgccgccg ggttcggtga tggccagcgc gccgatcttc tccccggcca aggtgggccg 1088161 cacgtacgtg gcgatcagcc gttcgtcgcc ggatgcgacc atgtgcggta cggcgatacc 1088221 gcaggtgaac agggacgcat acaccccgcc cggggcgccg gcctggtgca tctcctcgca 1088281 gatgatgacg gggtcggcgc cgtcaccgcc gccaccaccg accgcctcgg gaaagccggc 1088341 gcccagcagc ccggcggccc cggcgagccg gtgcaggccg cggggcaact cgccgatcct 1088401 ttcccactcg tcgacgtgcg gcaggatctc gcgctcggca aaggcgcgca ccgtttttcg 1088461 cagctgttgg cgctccggtg tggtccagat gttcacaaca gggtctccgg gatctcgacg 1088521 tggcggctgc gcagccactc acccagtccc ttggcctgcg ggtcgaagcg ggcctggtag 1088581 gcgacgccct ggccgaggat tgcctcgatg acgaagttca gtgcccgcag attcggcagc 1088641 acgtgacggg tgacgaccag gcctgccgtt tctggcagca gctccttgag tagctcgacg 1088701 gtcagcgtgt gcgccagcca gcgccactgc tcgtcggtgc gtacccacac gccgacgttg 1088761 gccgatccgc ccttgtcgcc gctgcgggcg ccagcgatca ggcccagcgg tacgcgccgg 1088821 gtcgggccag ccggcagcgg gtcgggcagc gccgggggat gtgccggcgc cagctccaac 1088881 gtctcagtgg cgcagggaat ctcggtgcgg gtgccgtcgg cgtgcacggc gatgtgcgcc 1088941 accttgccgg cgtcgacgta gccgggggtg aacacgccat acacctggcc gtcaccgggc 1089001 ggggcggtgg cggtgaaccc cgggtagctg gccagcgcca attcgaccgc ggccgaggag 1089061 aattgccgac ccacattggc agggtcggga tcgcgggcga cgcaggtgag cagcgcgctg 1089121 gcggtttctt cggtgtcggc gtcggggtgg tcggtgcggg ccagcgtcca ttgcagctca 1089181 gcgggtttga cggtcagcgc ggcctcgagc tggcgtcgca ccaagtcggc cttggcatcg 1089241 atgtccaggc cggtcagcac gaatgtcatg gcgttgcgga agccgccgat gctgttcagc 1089301 gacaccttgt aggtcggcgg cggcggttcg ccgatcacgc cgctaatgcg cactcgatcc 1089361 ggcccgtcgg gcgacagttc gacgctgtcc atccgggccg tcacatccgg gttggcatac 1089421 cgagcgcccg tgatctcgta gagcagctgc gcggtgatgg tgtcgacgct gaccaggccg 1089481 ccggtgccgt ggtgcttggt gatcaccgac gagccgtcgg cagcgatctc ggccagcggg 1089541 aagccggcgt gagtgaggtc gcctatctcg gtgaagaacg cgtagttgcc gccggtggcc 1089601 tggactccgc attcgatcac gtgcccggcc accacggcgc cggccagtcg gtggtagtcg 1089661 gtgcggcccc agccgaagtg cgcggccgcc gccccgacga ccaccgaggc gtcggtgacc 1089721 cggccggtga ccacgacgtc ggcgccgcgc tcgaagcagt cgacgatgcc ccatgcgccc 1089781 aggtaggcgt tggccgtcag tggcgtcccc agccccagtt cggccgcccg tggttgcagg 1089841 tcgtcgcctt ccacgtgggc gacctgcgcc ggaatgccca ggcgcgcggc cagcgcccgc 1089901 accgcgttgg ccagcccggc ggggttcagg ccaccggcgt tggtgacgat gcgcaccccg 1089961 cggtcatggg ccaggcccag gcagtcctcg agctgggcca ggaaggtctt cgcgtagccg 1090021 cgatcggggt ttttcatgcg gtcgcgaccg agaatcaaca tggtcagctc ggccaggtag 1090081 tcgccggtga gatagtccag ctcgccgccg gtcagcatct cgcgcatggc ggagaggcgg 1090141 tcgccgtaga agcccgagca gtttccgata cgcacggcac cacagtcagg gccatgcgat 1090201 tcctcccttg ggatcggcga cgctaccaac caaccggtag gttagcactg ccctgtttcg 1090261 cgacggagat cgcttcctga gtcgaagcgg cccggtctgc gccgtccatt ggagtagagt 1090321 ccgtttcgct acgggacgcc gggtgctttg ccggccccag gaggtcagcg ccatgtcctt 1090381 cgtggtcaca gcaccgccgg tgctcgcgtc ggcggcgtcg gatctgggcg gtatcgcgtc 1090441 catgatcagc gaggccaacg cgatggcagc ggtccgaacg acggcgttgg cgcccgccgc 1090501 cgccgacgag gtttcggcgg cgatcgcggc gctgttttcc agctacgcgc gggactatca 1090561 aacgctgagc gtccaggtga cggccttcca cgtgcagttc gcgcagacat tgaccaatgc 1090621 ggggcagctg tatgcggtcg tcgacgtcgg caatggcgtg ctgttgaaga ccgagcagca 1090681 ggtgctgggt gtgatcaatg cgcccaccca gacgttggtg ggtcgtccgc tgatcggcga 1090741 tggcacccac ggggcgccgg ggaccgggca gaacggtggg gcgggcggaa tcttgtgggg 1090801 caacggcggt aacggcgggt ccggggctcc cggacagccg ggcggccggg gcggtgatgc 1090861 cggcctgttc ggccacggcg gtcatggcgg tgtcgggggg ccgggcatcg ccggtgccgc 1090921 tggcaccgcg ggcctgcccg ggggcaacgg cgccaacggc ggaagcggcg gcatcggcgg 1090981 cgccggcggc gccggcggca acggcgggct gctattcggc aacggtggtg ccggcggcca 1091041 gggtggctcc ggcggacttg ggggctccgg cgggacgggc ggcgcgggca tggctgccgg 1091101 tcccgccggc ggcaccggcg gcatcggggg catcggcggc atcggcggcg cgggcggggt 1091161 cggcggccac ggctcggcgt tgttcggcca cgggggaatc aacggcgatg gcggtaccgg 1091221 cggcatgggt ggccagggcg gtgctggcgg caacggctgg gccgctgagg gcatcacggt 1091281 cggcattggt gagcaaggcg gccagggcgg cgacggggga gccggcggcg ccggcgggat 1091341 cggtggttcg gcgggtggga tcggcggcag ccagggtgcg ggtgggcacg gcggcgacgg 1091401 cggccagggc ggcgccggcg gtagtggcgg cgttggcggc ggcggcgcag gcgccggcgg 1091461 cgacggcggc gcgggcggca tcggcggcac tggcggtaac ggcagcatcg gcggggccgc 1091521 cggcaatggc ggtaacggcg gccgcggcgg cgccggtggc atggccaccg cgggaagtga 1091581 tggcggcaat ggcggcggcg gcggcaacgg cggcgtcggt gttggcagcg ccggaggggc 1091641 cggcggcacc ggcggtgacg gcggggcggc cggggcgggc ggcgcgccgg gccacggcta 1091701 cttccaacag cccgcgcccc aagggctgcc catcggaacc ggcgggaccg gcggcgaagg 1091761 cggtgccggc ggcgccggtg gagacggcgg gcagggcgac atcggcttcg atggcggccg 1091821 gggtggcgac ggcggcccgg gcggtggcgg cggcgccggc ggtgacggca gcggcacctt 1091881 caatgcccaa gccaacaacg gcggcgacgg tggtgccggc ggtgttgggg gagccggcgg 1091941 caccggcggc acgggtgggg tcggggccga cgggggtcgc gggggggact cgggccgcgg 1092001 cggcgacggc ggcaacgccg gccacggcgg cgccgcccaa ttctccggtc gcggcgccta 1092061 cggcggtgaa ggtggcagcg gcggcgccgg cggcaacgcc ggtggcgccg gcaccggtgg 1092121 caccgcgggc tccggcggtg ccggaggttt cggcggcaac ggtgccgatg gcggcaatgg 1092181 cggcaacggt ggcaacggcg gcttcggcgg aattaacggc acgttcggca ccaacggtgc 1092241 cggcggcacc ggcgggctcg gcaccctgct cggcggccac aacggcaaca tcggcctcaa 1092301 cggggccacc ggcggcatcg gcagcaccac gttgaccaac gcgaccgtac cgctgcagct 1092361 ggtgaatacc accgagccgg tggtattcat ctccttaaac ggcggccaaa tggtgcccgt 1092421 gctgctcgac accggatcca ccggtctggt catggacagc caattcctga cgcagaactt 1092481 cggccccgtc atcgggacgg gcaccgccgg ttacgccggc gggctgacct acaactacaa 1092541 cacctactca acgacggtgg atttcggcaa tggccttctc accctgccga ccagcgttaa 1092601 cgtcgtcacc tcgtcatcac cgggaaccct gggcaacttc ttgtcgagat ccggtgcggt 1092661 gggcgtcttg ggaatcgggc ccaacaacgg gttcccgggc accagctcca tcgttaccgc 1092721 gatgcccggc ctgctcaaca acggtgtgct catcgacgaa tcggcgggca tcctgcagtt 1092781 cggtcccaac acattaaccg gcggtatcac gatttctgga gcaccgattt ccaccgtggc 1092841 tgttcagatc gacaacgggc cgctgcaaca agctccggtg atgttcgact ccggcggcat 1092901 caacggaacc atcccgtcag ccctcgccag cctgccgtcc gggggattcg tgccggcggg 1092961 aacgaccatt tcggtctaca ccagcgacgg ccagacgctg ttgtactcct acaccaccac 1093021 cgcgacaaac accccatttg tcacctccgg cggcgtgatg aacaccgggc acgtcccctt 1093081 cgcgcagcaa ccgatatacg tctcctacag ccccaccgcc atcgggacga ccacctttaa 1093141 ctgacggccc ctccctggct cgtgataggg aaggggcgtc tgcagcgggc gttctcgatt 1093201 gtcgccgcgc tcatctgcgc gcggaagctc ataccaaaga ggaaggccca ccatggctgt 1093261 gcccacgcgc agaaagtcgc gcgcgaacac ccgaagccgg cgctcgcagt ggagggcccg 1093321 gccggacggg tgcgggccga acacaccggg cgggctggtg tcagctgatt accgacaccg 1093381 tgtcgccggc gaagttggtg acataaacct cgccggtgac ggggttgacc gccaccccgg 1093441 tcggagcggt gccgacggtg atgggggagc cggtgacggt gttggtggtc gggtcgatca 1093501 ccgacaccgt gttgctgtcg aagttggtca cgaagaccag gccggtgacg gggctgaccg 1093561 ccaccccgct tggaccgttg ccgatggtga tgggggagcc ggtgacggtg ttggtggcgg 1093621 ggttgatcac cgacaccgtg ccgctgccga aattggtgac gtagacgttg ccgcccgggt 1093681 tgaccgccac cccgtgcgga tcgttgaagc tggcgtgggt gatggtggtg acggcgccgg 1093741 cggccccacc ggcaccgccg accccacccg caccgccgat accgccgacc gggccgcggc 1093801 cggcaccgcc ggccgtgccg gcgcgggcga ggctgaccgc gccgccggtg ccgccggccc 1093861 caccgttgcc gatcaacccg gccgccccgc cggcgccgcc ggcctgtccg ggtgcccccg 1093921 acccgccgtt gccgccgttg ccccacagcc acccgccgtt accgccggct tgcccggtcc 1093981 cgtcgatccc gttcgcgccg tcgccgatca atgggcgccc ggtcagcgac tgaacgggtg 1094041 cgttgatcgc atcgagcacg ttctgcagcg gtgttgcgct ggccgcttcg gcgaccgcgt 1094101 aggtgctgcc agcttggctt aaggccagca cgaaccgttg ctgataggcc gcgacctgcg 1094161 cgctgatcgc ttgatagtgc tggccgtggc tgccgaacag cgcggcgatc gccgttgaca 1094221 cctcgtcttg ggcggcggcc aacacctggg tggtcgccgc cgccgcggtg ttggcggtgt 1094281 tgatcgccga gccgatccgc gctgcatcgg ccgcggctgt ggacactaac tgtggggcca 1094341 cgttgacaaa cgacatcgaa atcctcctga ccgccacgat gttgagatgc gggcggccca 1094401 ccgcctgtta ccgccgcggt gggtaaccgt ttattcggac gatccctgcc gttccacgcc 1094461 tgggcgcagg cgcaaaccgc accaacattg gtggaacgtg gtgcacactg cacctggggt 1094521 tctgccctca tcgtgtgtca gcaggcgaaa cccgcgcgga cgagaactcc tgcgttaagc 1094581 agcacaaatc gctgctcacg ctcaccggtc agcgcactga accggcccca tgtcgacgac 1094641 cggtgaggcg accgctcaac tcgtcggcgt caactcggcc attgccaccc tggtcgccga 1094701 ttcctgtccc acagccccac caccatcggg gcgacaaccg tgaactgacg gtcacgcccg 1094761 ggcccaaccc cggcccggaa ttgggccggg ccgtcttcaa ccggtatcct ccacgtcatt 1094821 gtcgacgcga ttgtcgccgc gcccacctgc gtgcggaagc ccataccaaa agaggaaggc 1094881 ccaccatggc tgtgcccaag cgcagaaagt cgcgctcgaa tacccgaagc cggcgctcgc 1094941 agtggaaggc cgccaagacc gagctggtcg gtgtgaccgt cgccggtcac gcccacaagg 1095001 tgcctcggcg cttgctcaag gccgcccggc tcggcctcat cgatttcgat aagcgctgac 1095061 gcgccggcgg ccgacgatca tatggccgcc gaacacaccg agcgcgccgg ctctccggtg 1095121 atcaccgaca ccgtgtcgtc gagagagtta gtgacgtaga ccacgccggt gacggggttg 1095181 accgccaccc ctgtcgggtc gagtccgacg gggatggggg agccggtgac ggtgttggtg 1095241 gccgggtcga tcaccgacac cgtgttgctg aactggttgg tgacgtagat gttgccgcct 1095301 gggttgaccg ccaccccata cgcaccggta ccgacgggga tggagccggt gacggtgttg 1095361 gtgttcgggt cgatcaccga caccgtgttg ctgtcgaagt tggtcacgaa gaccaggccg 1095421 gtgacggggc tgaccgccac cccgcttgga ccgttgccgt cggtgatgga gccggtgacg 1095481 gtgttggtga ccgggtcgat caccgacacc gtgttgctgc cctggttggt gacgtagatg 1095541 ttgccgcccg ggttgaccgc caccccgtgc ggatcgttga agctggcgtg ggtgatggtg 1095601 gtgacggcgc cggcggcccc accggcaccg ccgaccccac ccgcaccgcc gataccgccg 1095661 gccgggccgc cgccggcacc gccggcggtg ccggcgcggg cgaggctgac cgcgccgccg 1095721 gtgccgccgg tcccgccgtc cccgccgtgt ccacccacac cgattaaccc gccgtgacca 1095781 ccaaccccgc cggtgccacc gtcaccgccg gccacaccga aggttgtgcc ggctccgccg 1095841 gccccgccga caccaccggc cccgccgttg ccgaacagcc atccaccggc gccgccggct 1095901 ccgccgttcg cgccggcctc aaagggtagg ccctggccgc cagctccgcc ggccccaccg 1095961 ttgccgatca acccggccgc accgccggcc ccgccggcct gcccgggtgc ccccgacccg 1096021 ccgttgccgc cgttgcccca cagccacccg ccgttaccgc cggcttgccc ggtcccgtcg 1096081 atcccgttcg cgccgtcgcc gatcaatggg cgcccggtca gcgactgaac gggtgcgttg 1096141 atcgcatcga gcacgttctg cagcggtgtt gcgctggccg cttcggcgac cgcgtaggtg 1096201 ctgctagctt ggcttaaggc cagcacgaac cgttcctggt aggccgcgac ctgcgcgctg 1096261 atcgcttgat agtgctggcc gtggctgccg aacagcgccg cgatcgccgt tgacacctcg 1096321 tcgtgggcgg cggccaacac ctgggtggtc gccgccgccg cggtgttggc ggtgttgatc 1096381 gccgagccga tccgcgccgc atcggccgcg gctgtggaca ctaactgtgg ggccacgttg 1096441 acaaacgaca tcgaaatcct cctgaccgcg acgatgttga gatgcgggcg gcccaccgcc 1096501 tgttacccct gcggtgggta accgtttatt cggacgatcc ctgccgttcc acgcctgggc 1096561 gcaggcacaa accgcaccaa cattggtgga acgtggtgca cactgcacct ggggttctgc 1096621 cctcatcgtg tgtcagcagg cgaaacccgc gcggacgaga actcttccgc caagcagcac 1096681 aaatcgccct actcttgacc accaaacaaa acccgtccat ggggccaatg tggctgatgt 1096741 ggctaaacct cgtcgaacaa acccgcatac cacggcgcgc ctctcaggcc agtctcaggc 1096801 gctgcgacga cactggtgtc cgtgcgaatt cttgtcgttg acgacgatcg tgcggtgcgc 1096861 gagtcgctgc gccggtcgct ttccttcaat ggctattcgg tcgaactggc ccacgacggg 1096921 gttgaggcgc tcgacatgat tgccagcgat cgccccgacg cgttggtcct ggatgtcatg 1096981 atgccgcggc tggacggcct cgaggtgtgc cgtcagctcc gcggcaccgg cgacgacctg 1097041 ccgattctgg tgctgaccgc gcgcgactcg gtgtccgagc gggtggccgg gctggacgcc 1097101 ggtgccgacg actacctacc aaagccgttc gccctcgaag agctgctggc acggatgcgg 1097161 gcgctgctgc gccgcaccaa gcccgaggat gccgccgagt cgatggccat gaggttctcc 1097221 gacctgacgc tggacccggt aacccgcgaa gtcaaccgtg gacagcgccg gatcagcctg 1097281 acccgcaccg aatttgcatt gctggagatg ctgatcgcca atccgcggcg agtgctgacg 1097341 cgcagccgta tcctggaaga ggtatgggga ttcgactttc ccacctcggg caacgcgctg 1097401 gaagtctacg tcgggtatct acgccgcaag accgaggccg acggcgagcc gcggctgatc 1097461 cacactgtgc gcggagtggg ttacgtgcta cgtgaaacac caccctgatg tggtggttcc 1097521 gccgccgaga ccgggcgccg ctgcgcgcca ccagctcatt atccctgcgg tggcgggtca 1097581 tgctgctggc gatgtccatg gtcgcgatgg tggttgtgct gatgtcgttc gccgtctatg 1097641 cggtgatctc ggccgcgctc tacagcgaca tcgacaacca actgcagagc cgggcgcaac 1097701 tgctcatcgc cagtggctcg ctggcagctg atccgggtaa ggcaatcgag ggtaccgcct 1097761 attcggatgt caacgcgatg ctggtcaacc ccggccagtc catctacacc gctcaacagc 1097821 cgggccagac gctgccggtc ggtgctgccg agaaggcggt gatccgtggc gagttgttca 1097881 tgtcgcggcg caccaccgcc gaccaacggg tgcttgccat ccgtctgacc aacggtagtt 1097941 cgctgctgat ctccaaaagt ctcaagccca ccgaagcagt catgaacaag ctgcgttggg 1098001 tgctattgat cgtgggtggg atcggggtgg cggtcgccgc ggtggccggg gggatggtca 1098061 cccgggccgg gctgaggccg gtgggccgcc tcaccgaagc ggccgagcgg gtggcgcgaa 1098121 ccgacgacct gcggcccatc cccgtcttcg gcagcgacga attggccagg ctgacagagg 1098181 cattcaattt aatgctgcgg gcgctggccg agtcacggga acggcaggca aggctggtta 1098241 ccgacgccgg acatgaattg cgtaccccgc taacgtcgct gcgcaccaat gtcgaactct 1098301 tgatggcctc gatggccccg ggggctccgc ggctacccaa gcaggagatg gtcgacctgc 1098361 gtgccgatgt gctggctcaa atcgaggaat tgtccacact ggtaggcgat ttggtggacc 1098421 tgtcccgagg cgacgccgga gaagtggtgc acgagccggt cgacatggct gacgtcgtcg 1098481 accgcagcct ggagcgggtc aggcggcggc gcaacgatat ccttttcgac gtcgaggtga 1098541 ttgggtggca ggtttatggc gataccgctg gattgtcgcg gatggcgctt aacctgatgg 1098601 acaacgccgc gaagtggagc ccgccgggcg gccacgtggg tgtcaggctg agccagctcg 1098661 acgcgtcgca cgctgagctg gtggtttccg accgcggccc gggcattccc gtgcaggagc 1098721 gccgtctggt gtttgaacgg ttttaccggt cggcatcggc acgggcgttg ccgggttcgg 1098781 gcctcgggtt ggcgatcgtc aaacaggtgg tgctcaacca cggcggattg ctgcgcatcg 1098841 aagacaccga cccaggcggc cagccccctg gaacgtcgat ttacgtgctg ctccccggcc 1098901 gtcggatgcc gattccgcag cttcccggtg cgacggctgg cgctcggagc acggacatcg 1098961 agaactctcg gggttcggcg aacgttatct cagtggaatc tcagtccacg cgcgcaacct 1099021 agttgtgcag ttactgttga aagccacacc catgccagtc cacgcatggc caagttggcc 1099081 cgagtagtgg gcctagtaca ggaagagcaa cctagcgaca tgacgaatca cccacggtat 1099141 tcgccaccgc cgcagcagcc gggaacccca ggttatgctc aggggcagca gcaaacgtac 1099201 agccagcagt tcgactggcg ttacccaccg tccccgcccc cgcagccaac ccagtaccgt 1099261 caaccctacg aggcgttggg tggtacccgg ccgggtctga tacctggcgt gattccgacc 1099321 atgacgcccc ctcctgggat ggttcgccaa cgccctcgtg caggcatgtt ggccatcggc 1099381 gcggtgacga tagcggtggt gtccgccggc atcggcggcg cggccgcatc cctggtcggg 1099441 ttcaaccggg cacccgccgg ccccagcggc ggcccagtgg ctgccagcgc ggcgccaagc 1099501 atccccgcag caaacatgcc gccggggtcg gtcgaacagg tggcggccaa ggtggtgccc 1099561 agtgtcgtca tgttggaaac cgatctgggc cgccagtcgg aggagggctc cggcatcatt 1099621 ctgtctgccg aggggctgat cttgaccaac aaccacgtga tcgcggcggc cgccaagcct 1099681 cccctgggca gtccgccgcc gaaaacgacg gtaaccttct ctgacgggcg gaccgcaccc 1099741 ttcacggtgg tgggggctga ccccaccagt gatatcgccg tcgtccgtgt tcagggcgtc 1099801 tccgggctca ccccgatctc cctgggttcc tcctcggacc tgagggtcgg tcagccggtg 1099861 ctggcgatcg ggtcgccgct cggtttggag ggcaccgtga ccacggggat cgtcagcgct 1099921 ctcaaccgtc cagtgtcgac gaccggcgag gccggcaacc agaacaccgt gctggacgcc 1099981 attcagaccg acgccgcgat caaccccggt aactccgggg gcgcgctggt gaacatgaac 1100041 gctcaactcg tcggagtcaa ctcggccatt gccacgctgg gcgcggactc agccgatgcg 1100101 cagagcggct cgatcggtct cggttttgcg attccagtcg accaggccaa gcgcatcgcc 1100161 gacgagttga tcagcaccgg caaggcgtca catgcctccc tgggtgtgca ggtgaccaat 1100221 gacaaagaca ccctgggcgc caagatcgtc gaagtagtgg ccggtggtgc tgccgcgaac 1100281 gctggagtgc cgaagggcgt cgttgtcacc aaggtcgacg accgcccgat caacagcgcg 1100341 gacgcgttgg ttgccgccgt gcggtccaaa gcgccgggcg ccacggtggc gctaaccttt 1100401 caggatccct cgggcggtag ccgcacagtg caagtcaccc tcggcaaggc ggagcagtga 1100461 tgaaggtcgc cgcgcagtgt tcaaagctcg gatatacggt ggcacccatg gaacagcgtg 1100521 cggagttggt ggttggccgg gcacttgtcg tcgtcgttga cgatcgcacg gcgcacggcg 1100581 atgaagacca cagcgggccg cttgtcaccg agctgctcac cgaggccggg tttgttgtcg 1100641 acggcgtggt ggcggtgtcg gccgacgagg tcgagatccg aaatgcgctg aacacagcgg 1100701 tgatcggcgg ggtggacctg gtggtgtcgg tcggcgggac cggggtgacg cctcgcgatg 1100761 tcaccccgga agccacccgc gacattctgg accgcgagat cctcggtatc gccgaggcca 1100821 tccgcgcgtc cgggctgtcc gcgggaatcg tcgacgccgg gttgtcgcgc ggcctggcgg 1100881 gtgtctccgg cagcacgctg gtggtcaacc tcgcgggttc gcgttatgcg gtgcgcgatg 1100941 gaatggcgac gctgaatccg ctagcggcac agatcatcgg gcagttgtcg agcttggaga 1101001 tctgaatccg gatcgagtgt cgggctattg cgattctgtg ctcgcgcgag gcccgtcggt 1101061 tggcgatggt gtcccacggc cgccgtgcct ccccggcgag tccccgttcg tttgcgcgag 1101121 cagatcgcgg atttcggtga gcagcacgac ttgggtgtcg cccggctgct cgacctcccc 1101181 cttcttgcgt agtgtgttgt agggcagcac gactaggaag tacaccgcga acgcgatcag 1101241 gaaaaagttg atcgctgccg acaacaagac gttcaagtca atggtctgac caccgccgat 1101301 accgatccgc aagatgccga cgtcggactg tgcgttgacg ccgatccggt tgatcagcgg 1101361 cgtaatgatg ctgtcggtga acttggtgac caacgccgtg aacgctgtgc cgattaccac 1101421 cgcgacagcc aggtcgacga tattaccccg cgcgagaaac tccttgaatc ctttgagcat 1101481 gcgatgtcct ttctgcagtc ggcggccggc agtccgcgag tggaacacct agaaaaacta 1101541 gaccaggtgg tgtcaatggc cacgacgctg ggatcgccgt tgccatgggg agctgacgct 1101601 gccgggatcc ggtgctgttg tttgttgacg ggatgccctt gacttcgctg accgtggtgt 1101661 gcgcgtaacc ggccggtcgg gaacgcggcg acggatggcg cggtggccag gacagtgatc 1101721 gagatgacat cacgccaaca acgccttcag ctgtgagcga tccgggctag actaccgccg 1101781 aaatatccaa caaaggacct acatgaaccg gcaacctatc gttcagctga gtaacttgag 1101841 ctggacattc cgagaaggcg aaacccgacg acaagtccta gaccacatca ccttcgattt 1101901 cgagcccggt gagtttgtcg cgctgctggg gcaaagtgga agtggtaaaa gcactttgct 1101961 gaacctcatc agtggcatag aaaagcccac cacaggtgac gtcacaatta atgggttcgc 1102021 tatcactcag aaaaccgagc gagaccggac gttgttccgg cgcgatcaga ttggcatcgt 1102081 ctttcaattt ttcaacctga ttcccactct taccgtgttg gaaaatatta cgctgcctca 1102141 ggaactggcc ggagtttctc agaggaaagc ggccgtggtc gctcgtgacc ttctcgaaaa 1102201 agtgggcatg gccgaccgtg aacgcacctt tcccgataaa ctctccggcg gagaacaaca 1102261 acgggtcgct atttccagag cgttggcgca taatcccatg ctggtgttag ccgatgagcc 1102321 gaccggcaac ctggactccg ataccgggga taaagtcttg gatgttctgc ttgatctcac 1102381 ccgccaagca ggtaaaacct taatcatggc tacgcatagc ccgtcgatga cgcagcatgc 1102441 cgaccgggta gtcaacttac agggcggcag gttgatacct gccgtgaacc gagaaaatca 1102501 aaccgaccag ccggccagca cgatcctatt gcccacgtca tatgaatgac caagctcccg 1102561 ttgcttatgc accactatgg cgcacggcgt ggcgtcggct gcgtcagcgg ccgtttcaat 1102621 atattctgct ggtcctggga attgcgctag gcgttgccat gatcgtggct atcgatgtat 1102681 ccagtaattc ggcgcaacgt gccttcgatc tctctgccgc ggccatcacc ggaaaatcta 1102741 ctcaccggct ggtcagtggc cccgccgggg tggaccaaca gctttatgtc gatctgcgcc 1102801 gacacgggta cgatttttcc gctccggtaa tcgaaggcta tgtgttggcc cgcggactgg 1102861 gaaaccgagc tatgcagttc atgggcaccg acccatttgc ggagtcagct tttcgctcgc 1102921 ctttatggtc caaccaaaat atcgccgagt tgggtggctt tttgactcga cccaacggtg 1102981 tcgtgttaag ccgacaagtg gcacagaagt atggcttggc tgtgggcgat cgcattgctc 1103041 tgcaagtgaa aggtgcgcct accacagtaa ccctggtggg attgctgaca cctgcagatg 1103101 aagttagcaa tcaaaaattg tccgacctta tcattgctga tatttccacg gcccaagagt 1103161 tgttccatat gcccggaaga ctgagccaca tcgatttgat catcaaagat gaggccactg 1103221 caacacgcat ccaacaaaga ctgccggccg gtgtgcgtat ggaaacgtcg gatacccaac 1103281 gggacaccgt caaacagatg acggacgctt ttacggtcaa tttaaccgct ctcagtttga 1103341 ttgccttgtt ggtgggtatc tttttaatct acaataccgt gacatttaat gtcgtgcaac 1103401 ggcgaccgtt tttcgccata ttgcgctgtt tgggtgtaac ccgagagcag ttattttggc 1103461 tgataatgac ggaatccctc gttgccgggc tgattggtac gggcttgggc ctcttgattg 1103521 gaatttggct cggcgaaggc ttgatcggcc tggtgactca aaccatcaat gatttctatt 1103581 ttgtcatcaa tgttcgcaat gtgtccgtct ccgccgaaag cttgttgaag gggctgatca 1103641 tcggcatctt tgccgccatg ttagccacac tgccaccggc tatagaagcg atgcgcaccg 1103701 tccctgccag cacattgcgg cgctcctccc tggaaagcaa gataaccaag ctcatgccgt 1103761 ggttgtgggt ggcgtggttt ggtttgggta gctttggtgt attgatgctg tggttgccgg 1103821 gcaacaacct ggttgtggcc tttgtcggtc tctttagtgt gctgattgcc ctggcgctta 1103881 ttgccccgcc gctgacccgg tttgtaatgt tgcgcttagc tcctggctta ggacggctgc 1103941 tcggtccaat aggtcgaatg gcgccacgca atattgtgcg ctcgttgagt cgcacctcta 1104001 tcgccatcgc cgccctgatg atggccgtgt ccttgatggt aggcgtctcc atatcggtgg 1104061 ggtcgtttcg acagacgctg gccaattggc tagaggtgac tttgaagtcg gatgtctatg 1104121 tgtctccgcc gaccttaaca tccggtcgcc ccagcggtaa tctgcctgtg gatgccgtcc 1104181 ggaatataag caaatggcca ggagtgcgtg acgcagttat ggctcggtat agttccgttt 1104241 ttgccccgga ctgggggcgt gaggtggaac taatggcggt gtcgggtgat atttccgacg 1104301 gcaagcgacc atataggtgg atcgacggca ataaagacac gctctggcca cgtttcttgg 1104361 cggggaaagg ggtgatgcta tcggagccaa tggtatcgcg acaacacttg cagatgccgc 1104421 caaggccgat cacgctaatg acggattcgg ggccacaaac gttccccgtt ctggcggttt 1104481 tctctgacta cacctcagat caaggtgtga ttttgatgga tcgcgccagt tatcgggccc 1104541 attggcagga tgatgacgtg acgaccatgt ttcttttttt ggcatcgggt gcgaatagcg 1104601 gtgccttgat agatcaacta caagccgcgt tcgcgggtcg ggaagacatt gttattcaat 1104661 cgactcatag tgtccgcgaa gcatcaatgt tcatatttga tcgtagtttt accattacca 1104721 tcgcgttgca actggtggcc acggtggtgg cttttattgg cgtactgagc gcgctgatga 1104781 gtttggaatt ggaccgggct catgagttgg gtgtttttcg cgccattggc atgactaccc 1104841 gccaattatg gaagctgatg ttcattgaga ccggcctaat gggcgggatg gccggcttga 1104901 tggccttgcc aactggttgt attctagcgt ggattcttgt ccgcattatc aatgtccgct 1104961 cattcggctg gaccttgcag atgcactttg agtcggcgca ttttcttcga gccctgttgg 1105021 tagcggtggt ggccgccctg gcggcgggta tgtaccccgc ttggcgtttg ggacggatga 1105081 cgattcgcac ggcgattcgt gaggaatgac ggtacatgag aaaagcagga ttgaccggtg 1105141 ttgtactggt tctgacgctg acgctggtgg ctttctggtg gtggcaacgt ccgcgaacga 1105201 atgctgtggc tgctgactct ttagttggcg ttttggtcga tgagaataac gccggatatt 1105261 ccttggccac agtgccggga gccgttcggt ttccccggga tttgggtcct cattacgatt 1105321 accagacgga atggtggtat tacaccggta atctggaaac tgctgacggt cggcttttcg 1105381 gctaccagct tacttttttc cgcagggctc tcgcaccacc cggcgagggg gtcgccatag 1105441 cggatgcttc ttcatggcgc acgacccagg tctatatggc ccacttcgcg ataagtgata 1105501 tttcgaacag gggcttttat ccggctgaga aattcagtcg gcaggcgttg ggtttggctg 1105561 gtgctagctc ggagccgtat gcggtgtggc tagacgattg gtatgcgcgt gaatccaaca 1105621 acaattcggt gcaattgttt gctcgaactc agaacacggt gttggatttg acattgacgc 1105681 aaacgctgcc gcctatcttg caaggaaatg ctgggttaag tgtgaaaggc gcgcaaccgg 1105741 gaaacgcgtc caactactac tcgttagttc gtcaagaatc gcggggcact gtcagtgtta 1105801 atggcgacac attcatggtt agtggtttga gctggaaaga tcatgagtac atgaccagtg 1105861 cgctggcccc tgaagatgtg ggttgggatt ggttcgggct ccaattttac aatggcaccg 1105921 ctttgatgct ttttcagatt cgacaggcgg atgggagtgt gacccgattt tccagcggta 1105981 cctttgttgc cggggatggt ggcgtgatcc ctctcgagtc gtccgatttc cgcatcaaga 1106041 cgactgatcg ttggaccagt gaccagagtg gcgccaccta tccgattgca tgggaaatcg 1106101 aaattgaacg gataggtttg acgctgcgcg gggccgcatt aatggctaat caagaactgc 1106161 ggttatcgag gacttactgg gaaggggcgg ttgcccttga gggtcgttat caaggaatgc 1106221 cgatcagtgg tcggggatac gttgaaatga ccggctatgt acaacggctg tcttgaagtc 1106281 gggtaattgc cggtgattct tggtttagag gctctcgaat ggtcgtcggg cagttgtgat 1106341 atcgctgcaa accctagagt acttattcgt cgttgtgtca acaggtagtt gctggggtgt 1106401 gtcgctagtc gcacgcagat atcgcgtggt cgatcaatgt cgcaagggct cggcgaggtt 1106461 ggcggtcagg caaatagggg agctcctctc gcgcctgtgc ggcataggcg gctaccacat 1106521 tcttggcctt tcctatgccc ggtgagcaac gcagcagtgt gagggcttcg gcgacgtggt 1106581 cgtcgtggat cggtccggcc agcaactcac gcagccggct tgtgtcgggt gtctgctcac 1106641 gcagcgcgta gagcatcggc agcgtgtgga cagcttggcc aaggtcggcg cccgatagcg 1106701 tagcggagtc accggagatg gcgatgatgt cgcgcgagat ctcaaacgca gcaccgatca 1106761 tgcgccccaa gcgcgctacg cggcggatct gctcttcggc ggcgccggag agtgccgctc 1106821 cgagctgtcc ggatgctgcg atgagagagc cggtcttctc gtgcacgact cggaggtaat 1106881 gctcgatcgt gtcgatatgc gaggcggggc cccgggtcgc gcgcatctgc ccggtgatca 1106941 gctcggcgaa cgcctcggcg acgaccgcga aggcctcggg gtccagccgc gaggctagct 1107001 gtgaggccgt cgcgaatcgg tagtcaccgg cgaggattgc gaagttgttg gtccagcgtg 1107061 tgttgtcgct aggtgtcttg cggctcatgt cggactcatc cacgactctg tcgtgacaaa 1107121 gcgtccccag gtgcatcaac tcgatggctg cccccgcgac cgtgacctcc catccgtcgg 1107181 ggtcggagcc cagttgcgcc gcaagcaccg tgaaaagcgg tctaaacggg gtgccgccgg 1107241 cgtcgacaag gtgcgccacc gtgtcgcgca taacctcgtc ggcctgggag agttcgctat 1107301 tgatcagctc tgtaatccgg gcaatcccgt cgtggacgtt ggcggtgaat tgcgggtcac 1107361 ccaggctgac tgccgggatc atgctcgtgg ccgtaggcat gcgcacaaca ttgacacgtg 1107421 tacaagataa ggtatggcgt gttcagtgca gggtcagcgt caccgtctga cccagcgccg 1107481 caccggctac cgtattggcc agccgggctg gcagcgccac caaaacgacc cggtcactat 1107541 cagccgcctg ggccttctgc tgggccgaga ccaggaccac gatggcgtcg gtggccaaga 1107601 gccgtagagc tgccggcgaa tcggttaccg gcgcggccag cacgtcgacc acatccccga 1107661 cccgaacaag gtcgaccaaa gcgctgtcag ccagatgcag cggcacgatg cgggcgtccg 1107721 ggccggcagt cgactcggcc aaccggctgc ccagtaaacg cacgtcggtg agcacctcgc 1107781 cacggcgtgt cgggctggcc agcgtcgaac ccaccactgc gtccaggtca gcttgcgacc 1107841 cgtcgggaag cgtggtggcc gaacgttttt ccagcctgac atcaccggga gtcaatgcgg 1107901 taccggggcg cagatcgtgc gcggccacca ccacctcgga gcgatcatcc tctggattgg 1107961 accgcagcgc cgcaacgccg gccagcatga ccagcccggc cgcggcgaag cgccgggccc 1108021 gcacggtccg ggtccagtcc gggcgcaaaa acgccgatat ccggctgacc aggctcggat 1108081 tcagggagga ttccgccaca ccgcaaacgg taggcgcagc gccgtgctag gcagcgccgg 1108141 tcagaaatcc ccttgtggat aacctctcaa ctcagacggc cgcggcggcg gttgtggagc 1108201 tggttgactt ctcggttgac cccgaagcct tgctttcact cgagccagaa ctccccgaac 1108261 tccccgaact cttcgtcgac tcgctggtcg aggatccgtt ggtctggctc ttggacttct 1108321 tgcccgactc gcggctgtcg gtgcggtaga agccggtgcc tttgaacacc acgccgaccg 1108381 cattgaacag cttgcgcagc cggccagaac accgctcgca cgtggtcagc gcatcgtcgg 1108441 tgaaggcctg cacaacatcg aagcggttgg cgcactgggt gcactcgtag ctgtaggttg 1108501 gcacaagaac ctccggaaat gtcactcggc gttagcactc taccgtctca agtgctagaa 1108561 ccgctaggtg agttccgtca ttccccgcac ggcagcgcga tcagcccgcg ctccggtgtg 1108621 agcgcatgag tcatgggtac gtcgtgcggc tccgacggca acacgtcgac cagttcgaca 1108681 gtgcgcacca ccgcgactag acgagcgtgc gggtcgcggc accgcagcga gcgatcgtag 1108741 aagccgcgac ctcggcccag tcgcacgccc tggcggtcga cagccagcgc cggcaccagc 1108801 accaagctgg cctgcgccag cgcggcttcc ggcagccaag gttcgggtgg ttcgagcagt 1108861 ccccagcgtg cgcgcgcgag tccgccggca cggtactcgc cccaccgcaa cggcaacggg 1108921 aggtcaccgc cggcggtgcg cgccaccggc aacagcactc gccccgcgcg gcgcagcaac 1108981 acatccaaca tctcgattga ccccggctcg ccgcctaccg gcacatacgc gcagacggtg 1109041 ctgtcgctgg tgaccatgcg ctccaggtgt ccacgcaaca tccgggcctc ggcggcgcgc 1109101 acgtcgtcgg caacgcggcg tcgggccgcc aggagctggt cgcgcaacgc cgacttgctc 1109161 gccatcgcca tgtcctcaac gatgacacag ccccggccgt cccgcgcgag cgccgggaca 1109221 gcgccaacga agaggcgggc aatcagcacg ctgcgggtta tcgtgtgaac gatgtcacgc 1109281 ccagaagtac taacgccgtt cacggcaatc gtcccggcag ccggcctggg tacgcgcttt 1109341 ctgccggcca ccaagacggt gcccaaggag ctgctgcccg tcgtcgacac tcccggtatc 1109401 gagctggtgg ccgccgaggc ggccgcggcc ggtgccgaac ggctggtgat cgtcacctcc 1109461 gagggtaagg acggggtggt cgcgcatttc gtggaagacc tggtgctgga gggcacgctc 1109521 gaggcccgag gcaagatcgc catgctggcc aaggtgcgtc gcgccccggc actgatcaag 1109581 gtcgaatccg tggtgcaggc cgagccgctg ggactgggac acgccatcgg ctgtgtggag 1109641 ccgacgctgt cgcccgacga agacgctgtc gcggtgctgc tgcctgacga cctggtgctg 1109701 ccgaccggcg tcctggagac gatgtcgaag gtgcgagcca gcaggggcgg caccgtgctg 1109761 tgtgctatcg aggtggcgcg cgaggagatc agtgcctacg gggttttcga tgtcgagccg 1109821 gtccccgatg gtgactacac cgacgatccc aacgtgctga aggtcagggg catggtcgaa 1109881 aagcccaagg ccgaaacggc gccgtcgagg tatgcggcgg ccggccgcta cgttctagac 1109941 cgtgccatct tcgatgcgtt acgccgcatc gaccagggtg caggcggtga agtgcagctc 1110001 accgatgcga tcgcgctgct gattgccgag ggccatcccg tccatgtcgt cgtccaccaa 1110061 gggtcccgac acgacctggg aaatccgggc gggtacctca aggctgcggt tgactttgca 1110121 ttggatcgtg acgactacgg cccggacttg cggcgatggt tggtggcgcg actgggtctg 1110181 acagagcagt agcctggcga cgatacggca cggacggttc cggggtgggg gatgcccggc 1110241 cccatggctc gacggaaagg cgggcgctgt gcgttctgtg gaggagcagc aggctcggat 1110301 atcggccgct gcggtagccc cgaggccgat acgcgttgcg atcgccgagg cgcagggatt 1110361 gatgtgcgcc gaagaagtgg tcaccgaacg tccaatgccc ggttttgatc aggccgccat 1110421 cgacggctac gcggtgcgca gtgtcgatgt ggccggtgtc ggtgataccg gtggtgtcca 1110481 agtctttgcc gaccacggcg atcttgacgg tcgcgacgtg ctgaccctac cggtgatggg 1110541 aaccatcgaa gccggagcgc gcaccctgag caggttgcag cctcgccaag cggtccgggt 1110601 gcagaccggc gcgccgcttc ccaccctggc cgatgcggtc ctgccgttgc ggtggaccga 1110661 tggcggaatg tctcgggtgc gggtgctgcg cggggcgccg tcgggcgcct acgtgcggcg 1110721 tgcgggcgac gacgtgcagc ccggtgatgt ggcggtgcgc gcggggacga tcatcggcgc 1110781 agcccaggtg gggttgctgg cggcggtcgg ccgtgaacgg gtgctggtgc accctcgtcc 1110841 gcggctgtcg gtgatggccg tcgggggcga gttggtcgac atctcgcgga ccccgggcaa 1110901 cgggcaggtt tatgacgtca actcctatgc cttggctgcg gcgggccggg atgcctgtgc 1110961 ggaggtgaac cgggttggca tcgtcagcaa cgaccctacg gaacttggcg aaatcgtcga 1111021 gggccagctc aatcgggctg aggtcgtggt gatcgccggc ggggtgggcg gtgcggcggc 1111081 agaagcggtc aggtcggtgc tttccgagct cggtgagatg gaggtcgtgc gggtcgccat 1111141 gcatccggga tccgtgcagg gcttcggaca gctcggccgt gatggtgtac cgacctttct 1111201 gctgccggcc aacccggtca gcgccctggt ggtcttcgag gtgatggttc ggccgctgat 1111261 ccggctgtcg ctgggtaaac ggcatccgat gcgacggatc gtgtcggcgc gcacgctgtc 1111321 gccgatcacg tcggtggccg ggcgcaaggg ctacctgcgt ggccagttga tgcgtgatca 1111381 ggacagcggc gagtacctgg tgcaggcgct gggcggcgct ccgggggcgt catcgcacct 1111441 gctcgcgacg cttgccgaag cgaactgtct ggttgtggtt cccaccgggg ccgagcagat 1111501 tcgcacgggt gagatcgtgg atgtcgcctt cctggctcag cacggctgag ccgaaccacg 1111561 gcgactctgg tgaacttatg gcgctcgaat ccccggcatc cgggatggcc gatggccgtc 1111621 gggccgctgc gggtctcggc aggcgtgatt cggctgcggc cggtgcggat gcgtgacggc 1111681 gtgcattgga gccggatccg gttggccgac cgtgcacatc ttgagccgtg ggagcccagc 1111741 gcggacggcg agtggaccgt ccggcacacg gttgctgcct ggccggcggt gtgttcgggt 1111801 ctgcgttcgg aggctcgcaa cggccgcatg ctgccgtacg tgatcgagct ggatgggcag 1111861 ttctgcggcc agttgaccat cggcaatgtc acccacgggg ccttgcggtc ggcctggatc 1111921 ggctattggg taccaagcgc ggccactggc ggaggggtgg ccaccggagc gttggcgttg 1111981 ggtctcgacc actgcttcgg tccggtcatg ctgcatcgag tcgaggccac cgtgcgcccg 1112041 gagaatgcgg ccagtcgcgc cgtgctggca aaggttggct tccgcgagga ggggctgttg 1112101 cgccgttacc ttgaggttga ccgggcatgg cgagaccatc tgttgatggc gatcaccgtc 1112161 gaagaggttt acgggtcggt ggcctcgacg ctggtccgtg ccgggcatgc cagctggccc 1112221 taacgcggaa tcgcaaccaa actgtgactg gcgcgacacg tgtggcgtgt ggtgcttgtg 1112281 agagatgaat tacaggtgtg taattgccct gggcgctttg acccggccgc gctggccaac 1112341 gatggggcct cgcggggatc ggaaccgaag agagcaggtc atcatgccaa gcatcccgca 1112401 gtcgttgttg tggatatcgc tcgtggtgct ctggctgttc gtgctggttc ccatgctgat 1112461 cagcaaacgt gatgccgttc ggcgcaccag cgatgtggct ttggcgactc gggtactcaa 1112521 cggtggcgct ggtgcgcgcc tgctcaagcg aggtggtccc gccgcgggac atcgctgggg 1112581 gtacctcccg cccgaagggc agggggacga cccggactgg aagccggagg aagactggcg 1112641 cgacgacccg gtcgaggacg ggttcgccga cgtcgagcat gacatcgacg aggaccagga 1112701 ggccgacgat gcgcgccgtc ggggtgcggt tgtcatgaag gttgccgctc cgcagaccgc 1112761 aggtgccgac gagccggact acttagacgt cgatgtggtc gaagaagact cggaggcgct 1112821 tccggtgggg gctggcgctg cggtcggcga gtccgccgac gaggccgatg ccgaagctgc 1112881 tgacggagtt gcgggccacg ccgacccgga ggccgacccg gtcgaatacg aatacgaata 1112941 cgaatacgtc gaggacacct gcggtttgga gctcgaggag gacgaccagg aagcgccacc 1113001 gaccgtcgca tccggcacgt cacggcggcg ccgattcgac accaagaccg ccgccgcggt 1113061 cagcgcccgc aagtacacct tccgcaaacg tgcgttgatc gtgatggcgg tgatcctggt 1113121 tggctctgcc gccgcggcct tcgagctgac cccggtcgcg tggtggatct gtggtagcgc 1113181 caccggtgtg acggtgctct acctggcata tttgcgtcgg caaacccgca tcgaggagaa 1113241 ggtgcgtcgg cggcggatgc agcggatcgc gcgggcgcgg ctcggtgtag agaacacccg 1113301 tgaccgcgag tacgatgtgg tgccgtcgcg gctgcgccgt ccgggcgcgg tggtcctgga 1113361 gatcgacgac gaggacccga tcttcacgca cctggagagc gcggccccga tacggaacta 1113421 cggctggccc agggacctgc cccgggcggt gggtcagtag ggcgcgcagt tcggccatcg 1113481 gcgccgctgc tggtagcctg ctaccgatca ggggctatgg cgcagttggt agcgcgactc 1113541 gttcgcatcg agtaggtcag gggttcgaat ccccttagct ccaccatcta atcagtagcc 1113601 atcggcagcc tcgttggctg tgccgccgcg gacgtggttg agacggcgag cacagccctc 1113661 ggggcaatcc tggcaggtcg caatgcggtg gtgccgccac ggtgtccacg tcgaggcgcc 1113721 ggccttgtgg taccggtaaa gtgctgtggc gaccgcgatc tggcgcgaag cctgatgaag 1113781 atcgaatatt cggctgaata ttcgctaaga catgtgtggc ggcgtccgat cctgtcacaa 1113841 cctgccccta gggtcggtgc atgagcacga aatactacct gcagaaggtc cctgtcgaag 1113901 ccgtccagcc gggcttttcg ctggccattc cacacgatgg cgactatcgc cttttccagg 1113961 tcgactgcac gcaaatgtgc cagcgaagtg gccagccggt gatgatcaga ctcatgtcgg 1114021 agtccgtcga tggtggccag ccgtgggtct tggaatatga agcgggcacg gcggtaatcc 1114081 ggcttctcgg tgtttgccag gccgcttcgt agggtggcgt gtgctcgcta accgggcttg 1114141 gcggcggcta caaacggcaa cgcgcgttgt gtctactgct cgacgtccac tagcccggcc 1114201 gaccgagaca ggttgacgaa ggcattccgg tcaaacatcg tgagtccgat gttgccggcg 1114261 gcggcgttcg gcgcgtagcg catcggcggg cattggccgg catagctggt gtggatcgtg 1114321 atccgcccgg ctggccgcag cactcgcacc ttctcgcggg cgatccggaa cggttccggc 1114381 atcagctaca gcgcgccgaa acaacaaaca gcatcgaatg tttcgtcgcc gaatggcacc 1114441 atgcgggcgt ggccgcggat atgacacgtc cgtggcccac ggttgtccag ggcggtgctg 1114501 gtcagcgtcg gcgcagagat gtcgaacccg accgcaagac ccccgtccgg tggatgtccg 1114561 gacagcggct cagtgaaatt acctggccca caaccgatat cgagcactct gtgggcgcgg 1114621 ccgaggtgca gagacaccgc ggcgcggtgc cgctcggttc gggtggtgat gcggctggca 1114681 aggtggaagg aggccggacg ccacaaccgt tcgtacaacc gtaagctggg cttggcgccg 1114741 ggccggattg gacgggatag ccgaattgac cggcgcacga gtcgaagatc ttgcggggat 1114801 ggacgtcttt cagggatgtc cggccgaggg tctggtgtca ttggcggcga gcgttcagcc 1114861 gttgcgggcc gctgccggcc aggtgctgct gcggcagggc gagccggcgg tttcgtttct 1114921 gcttatctcg tcgggtagcg cagaagtcag ccatgttggc gacgatggtg ttgcgatcat 1114981 cgctcgggcg ctgccgggca tgatcgtcgg cgaaatcgcg ctgctgcgcg atagcccgcg 1115041 cagcgcgacg gtcaccacca tcgagccgct gaccggctgg acgggtggcc gcggcgcttt 1115101 cgccacaatg gtgcacatcc ccggggtcgg tgagcgattg ctgcgcaccg ccaggcagcg 1115161 tctcgccgcc ttcgtctccc cgattccggt acggcttgcc gacgggactc aactgatgct 1115221 acgccccgtg ctgcccggtg accgcgagcg gaccgtgcac ggacacatcc agttctccgg 1115281 cgagacgctg tatcgacggt tcatgtcggc tcgtgttccc agtccggcgt tgatgcacta 1115341 cctgtcggaa gtcgactacg tcgaccactt cgtctgggtg gtgaccgacg gaagcgaccc 1115401 cgtagccgac gcgcgttttg tgcgggatga aaccgatccg acggtcgccg agatcgcgtt 1115461 cacggttgcc gacgcgtatc agggcagggg gattggaagc tttctcatcg gtgcgttgtc 1115521 cgtggccgcc cgggtcgacg gcgtcgaaag gtttgccgcg cgcatgcttt ccgacaatgt 1115581 gccgatgcga acgatcatgg accgctacgg ggcggtgtgg cagcgcgagg acgtcggagt 1115641 catcaccacc atgatcgatg tgccgggtcc gggtgagctg agcttggggc gcgagatggt 1115701 cgaccagatc aaccgggtag cccggcaagt gatcgaggcc gtcggctgat caccgacccc 1115761 gggtcggtgc gtccgccgct ggcaccgcag ttcgccgctg atctgctagt caaaacggtg 1115821 tcgacgttgc gcagctcagg ggctgcgttg ggtagattga ccacgatgcg caaggcggta 1115881 ctggcagtcg gatcggtgtg ctggcttgtc ggctgctcat caggggccag ctccaccacc 1115941 gcctcgaccg gcgacatcgc caaggtggcc gaagtgaagt cgggctttgg acctgaatac 1116001 accgtcaccg atgtcactcc cagggccatc gatcccgggt tcttttccgc ccgcaaactg 1116061 cccgacgggc tgagtttcga tccggcgaac tgtgcgcaag tggcggccgg gccccagctg 1116121 ccgaccgggt tgcagggcaa catggccgcc gtctccgccg agggcaacgg caaccggttc 1116181 gtcgtcatcg cggtggagac gtcccagccg ctgccggccc ccagccccgg gaaagactgc 1116241 agcaaggtga ctttttccgg gacgcagctg cggggcggca tcgaggtggt cgatgtaccg 1116301 cacatcgacg ggacacagac gctgggcgtg catcgcgtgt tgcaggcggt cgtcggcggg 1116361 tcagcgcgca ccggcgagct ctatgactat tccgctcggt tcggggacta ccaggtgatt 1116421 gtcatcgcca atccactggt aatccctgga cggccggttg cgcgggtcga tacgcaacgc 1116481 gcccgcgatc tgctcgtaca ggcggtggcc gcggtccggg gttgaccgag ttagcggacg 1116541 tcgcgcggcc ggaactggat gctcacgcgc ggacccgtcg gcgccgatgt cttgggcacc 1116601 gcatgctcga aggtgcgttg acacgatccg cccatcacca atagatcgcc atgcgccaac 1116661 ggcagtcgca acgatggacc gcggccacgc ggccgcagcg cgaagacgcg ggtggcgccg 1116721 aggctgacga tcgccaccat agtgtcctca gtgctgccgc gaccaatggt gtcgccatgc 1116781 caggcgacgc tgtcagagcc gtcgcggtag tagcacagcc cggcggtggt gaagggctca 1116841 cccagttcgc cgccgtagat gtcgttgagc cgccggcgca tccgcgccag ctgcggatgc 1116901 ggcggatctt cgatggtcag gtcgtgaaaa ctcaccagcc gcggcacatc gaccacccgg 1116961 tcgtacatct gacggcgctc ggctcgccac ggcaccgtcg acaacaacgc gtccagcagt 1117021 tcttcgccgc cggtcagcca gcccgaacgg atgtcgataa aggctccgtc gccgagctgt 1117081 cttcgctcgt tgtgctcgaa gagcgcgcct tgaaccgcga tcgccacgcc gccaagctta 1117141 tcgcacattc gttcgatggc gccgccccgg ctacggtttg acctgtgggt gtcgaattgg 1117201 ggtcaaattc cgaggtcggc gcgctaagag tggtcatcct gcaccgcccg ggggccgaac 1117261 tgcgccggct cacaccgcgc aacaccgacc agctgctgtt cgacggcctg ccctgggtat 1117321 cccgcgcgca ggacgagcac gacgaattcg ccgagctgct ggcttcccgc ggtgcggaag 1117381 tgctgttgct gtcggacctg ttgactgagg cactacatca cagcggggcc gcccgcatgc 1117441 aggggatcgc cgctgccgtc gacgcaccgc ggctgggact gccgctggcg caagagcttt 1117501 cggcctacct gcgtagtctc gacccaggca ggttggcgca tgtgctgacg gccggcatga 1117561 ccttcaacga gctcccgtcg gacacgcgga ccgacgtgtc gttggtgttg cgtatgcacc 1117621 atggcggaga cttcgtcatt gagccgttgc cgaacctggt gttcacccgc gactcgtcga 1117681 tatggatcgg gccgcgggtg gtgatcccgt cgctggcatt acgggcacgg gtgcgcgaag 1117741 cgtcgctgac cgacctcatc tatgctcatc acccgcggtt caccggtgtg cggcgtgcct 1117801 atgaatcgcg caccgctccg gtcgagggtg gcgacgtgtt gttgctcgcc ccgggtgtgg 1117861 tcgctgtcgg agtgggcgag cggactacac cagcaggcgc ggaagcattg gcgcgcagcc 1117921 tttttgacga tgatcttgcg cataccgtgc tcgccgtgcc gatcgctcag cagcgcgcgc 1117981 aaatgcatct ggacacggtg tgcacgatgg tcgacaccga tacgatggtg atgtacgcca 1118041 acgttgtcga cacgctcgag gcgttcacga tccagcgcac acccgacggc gtgaccatcg 1118101 gcgatgcggc cccgttcgcg gaggcggctg ccaaggcgat gggaatcgac aagctgcggg 1118161 taattcatac cggaatggac cccgtcgtcg ctgaacgcga acagtgggac gacggcaaca 1118221 acacgttggc gttggcgccc ggtgtcgttg tcgcctacga gcgcaacgta cagaccaacg 1118281 cccgcctgca ggacgcgggc atcgaagtgc ttaccatcgc cggctccgaa ttgggtaccg 1118341 gccgtggcgg gccccgctgc atgtcctgtc cggccgcccg cgatccgctt taggagtggc 1118401 gatttcggcg cctggcggcg ccgcagatca ccgccagctg ggcagccaga tctccaggtt 1118461 ccaggtctgt tgtgagattg gcagaccggt gagcaccgga tacagccacg caaagttcgt 1118521 caccacgagg gccacgtagc agcagacgac gatcagcccc agtgtgcgtc gttcggagcc 1118581 ctgaccgggg tgatagagga tatcgccgag aaccagcgaa atgcccatca ccagaaatgg 1118641 cgccatggtc gctgcgtaga agaagtacat ctgccggtcg atgtcggcga accacggcag 1118701 ccaaccggcg cagtagccga ccaggaccac cgcataacgc cagtcccggc gcacaaacat 1118761 acgccacccc gcgtatgcca ggactggcac cgccagccac cacatcgcgg gcgtgccgac 1118821 cagcatctcg gccttgacgc acgactgtgc gccgcagcct gcaacgtctt gctggtcgat 1118881 ggcgtacagc accggccgca acgacatggg ccaggtccac ggtttggatt cccaagggtg 1118941 gtagttgcct gcggaattcg tcaggcccgc gtggaagtgg aacgctttgg cggtgtagtg 1119001 ccagagcgag cgcacggcgt cgggcagcgg aacaaccgag ttgcgaccga ccgcttgacc 1119061 gaccgcatgc cgatcgatcg cggtctcgga cgcgaaccac ggagcgtagg tggccagata 1119121 gaccgcgaac gggatcaacc ccagcgcata cccgctggga agcacgtcac gccgcactgt 1119181 ccccagccac ggtctttgca cttggtactg acgtcgcgcc gccacgtcga acgccagcgc 1119241 catcgcgccg aagaacagca cgaagtacac gccggaccac ttggtggcgc aagccaatcc 1119301 cagcagcacc ccggcgccga accgccacca gcgcacaccc acccgcggtc cccacacggt 1119361 ggcggcgctg cggccggcca gcagagcgat gtgcatccgt tcgcgaacct gatcgcggtc 1119421 gacgatgagc gcgccgaacg ccgcgacgac gaagaacgtc aggaagccgt ccagcagcgc 1119481 ggtccgcgcg gtgacgaagc tgaccccgtc gcagatcagc agcaccccgg cgatggcgcc 1119541 gaccaatgtc gaccggctga tccgccgcac gatccgcacc accagcgcca ccaggaccac 1119601 acccagcagg gcgccggtga accgccagcc gaatccgttg taaccgaaga tggcctcccc 1119661 gatcgcgatc agctgcttac cgaccggcgg gtgaaccacc aggccgtacc cggggttgtc 1119721 ttccacccca tggttgttca gcacctgcca ggcctggggt gcgtaatgct tctcgtcgaa 1119781 gatgggggtg ccggcatcgg tcagcgagcc caggttcagg aaccgggtca ccgtggccag 1119841 cagcgtgatc aggccggtca cgatccagcc gcgtaaccgg tccaggggcc cgaaatccgc 1119901 gaccggcacc agcgggccgg ggctgacgac gggtaccaca ggctcctcgg ggcggtcctt 1119961 ggccaggaca caggattctg ggggccgggc ggtcatcggt gtcgatcgta ggctgtccgt 1120021 catgtcctct ggtcgcctgt tgctcggcgc caccccgctg ggccagccgt cggatgcgtc 1120081 accacgcctg gcggccgcgt tggccaccgc cgatgtggtg gcggccgagg acacccggcg 1120141 ggtgcggaaa ttggccaagg ctcttgacat ccggattggt ggacgggtgg tcagcctgtt 1120201 cgaccgggtg gaggcgttgc gcgtgacggc ccttctcgac gcgatcaata acggtgcgac 1120261 ggtgctggtg gtcagtgacg ccgggacccc ggtgatcagc gatcccggct atcggctggt 1120321 cgcggcgtgc atcgacgcgg gggtttcggt gacgtgttta cccgggccgt ccgcggtgac 1120381 caccgcgctg gtgatgtccg gtctgccggc ggagaagttc tgcttcgagg gtttcgcccc 1120441 gcgcaagggt gcggcgcgcc gggcctggct ggccgaactg gccgaggagc ggcgcacctg 1120501 tgttttcttc gaatccccgc gccggttggc tgcgtgcctt aacgatgccg tcgagcagct 1120561 cggtggtgcc cgtccggcgg cgatctgccg ggagctgacc aaggtgcatg aggaagtggt 1120621 gcgcggatcg cttgacgagt tggcgatctg ggcggccggt ggtgtgctcg gcgagatcac 1120681 cgtggtggtg gcgggcgccg ccccccacgc cgaactgtcg tcgctgatag cccaagtgga 1120741 ggagttcgtc gcggcgggta ttcgtgtcaa ggacgcctgc agcgaggtag cggcggcaca 1120801 tccgggggtg cgcacccgcc agctttacga cgcggtgctg caatcacggc gggaaaccgg 1120861 cgggccagcg cagccgtagt cggtcaggtt aggggataca caccccgatg ggaccgaatc 1120921 cgggtgtgca cagacgcgac gggagcgccg gcagcggagg cggtcccggc agtgggggag 1120981 gtgccggcaa tgcgggcacg gccggtaacg gcggcagacc cgctggcagt gccggcaggc 1121041 cggccgccag cgccggcagc gccgcagcca gcgccgcagg atccacgcct ggcagcgccg 1121101 ggaggcccgg cggtagacca cctgccgcca gcgccggcag cgccgcagcc agcgtcgccg 1121161 gatccacccc cgccagacca gccggcagac cagcggccgc cagcatcggc agaccacctg 1121221 ccaccagcgc cgtcagctcc gccggcgaca tccccgccag acccggcaaa ctcgtcggca 1121281 gaccggccgc ggccgccatc gccatcaggt cggtcggcga cacacctggc agactcggaa 1121341 aacccacgcc tggcaggccg gcggccgccg ccatcgcgag cagactcgcc ggcgtcacgc 1121401 ccggcagggc gggcagcccc acagccggca gagcggccgc cgccgattgc gccccgggca 1121461 gcagcaggga ggccaccgtg ctggctgttc cgcgggccgt cggcaggatg ccggaagact 1121521 ccagcgcgtt aaccgcaagc acgagatagg tgacggccac cgcgctcgcg gtcaccaccc 1121581 ccgccgccgt gcctcccaca cccaggacac cgttcacgac cgcggcagcg gtgttcaccc 1121641 cggtgattgg gtcgggtacg ccggggatgc cgatgcccgg gatgccgatg cccgggatgc 1121701 cgatgcccgg gatgccgacg ccaggtacgc tcggcgcggc caggttcggc agggccggtg 1121761 ggggaggcag ggccgggccg gccgcaccgg gaatgttggg taggccggga acggctggcg 1121821 ggcccaccct gggcacggcg gcagccgccg gtacacccgg ccgcggaacc aatcccggcg 1121881 cgatcgtgtc ggcgaccggc tcgaacggcg ccggcaccgc gtgctccgct gcggccggcc 1121941 ccgatggtgt cgcggtgtcg ggcatcagga ccgcagccag ccgatcgcac tgctggccgg 1122001 cgctgcaggt gccgcccgcg acccgctccg gggtggacaa cgtcaacggt gccaccgcca 1122061 gcgttccccc tatcgcggca gcagccgcgg tgcccacgat cgcgagtctc attacaaacc 1122121 cctctcgaac tcgacacgag atagacacgc gtcgatggcc cgagcttagg cgcacccggc 1122181 acaccatgtg ggcgttatgc caatttccgc cgcccgctgg gctaccgcac tttgctggct 1122241 aaccgagccg gggtggtgcg cgtggcggcc ggcagcccga cgatgggagc ggctttgtgc 1122301 aggcactccg cccattcggc gtccggatcg gagtcggcgg tgatcccgcc gccaacgccc 1122361 agcacggcgt tgcctgcggt atcgaattcg acggtgcgga ttgcgacgtt gagctcgcat 1122421 ccggcgaccg gtgacgccaa accgactgtg ccgcaatata tcccgcggcg atatcgctcc 1122481 cattgtgaaa tcaattggcg agcccgcagt ttaggtgtgc cggtgaccga ggccggcggg 1122541 aaggcggcgt cgagcagcgc tgacatcggt tcctcgagcg gaacccgcgc cgacaccgtg 1122601 gacaccaggt gccacactcc cggcgctggt cgcaccacca acagctcggg caccgtcacg 1122661 gtaccggtaa ccgctacccg gccgaggtcg ttgcggacca gatccacgat catgatgttc 1122721 tcggccacct ctttggccga tgcccgcagc gccgacggcg gggcgtccag cggcagcgtg 1122781 cccttgatcg ggctcgatgt caccacggac ccgcggcggc gcaggaatag ctccggggat 1122841 agcgatgcga cggctcccca cggtccggcg acaaaggcgg accgggacgg agcggtacga 1122901 ccgaacccgt cgatgaagaa gtccagcggg gatccggtga ccgtcccggc gaattgggtg 1122961 cacacgcacg cttgatagac ctcgcccgcg ccgatagctt ccagacacgc cagtaccccg 1123021 tcgcggtgcg ctgcccggtc ggccggttcc cagtcgatcc ggcatgccgg tgccggtctg 1123081 gcgaccgatg cccgagtggt cgccaacgcg ctggccagcc agtccgctat cggcgcaccg 1123141 gacaggctct cataccacca ctggccgtcg cggtcgcggc gcagcacgca atcggtccag 1123201 ccgccggcgg cctcggggat ccggtggggt cgcccgtcgg cgccggcgtc cgggtaggac 1123261 aggtagccga cccagccgcc gcccaccgcc ccggtggcat cgggcccgcc ggtgcccggc 1123321 gggcccgaga acacgtcgtc gccgctgacc ggttgtatag acacactcgg tgcgatcacc 1123381 gccagcgcac cgaaccattc gccggtcagc gccgccggtg gtggcaagtc gagtcgactg 1123441 gtggcgcggc cgaccgcccg cagcaccgca ggcgctccgc caagatcgcc gagtcggtcg 1123501 attcgcaccg ttctagcttg acagaactgt ggattttcgc agcgcaagtg gctgcgtggg 1123561 gatttcgtcc gcgtgctaag ctcccacgct aagttcaatc cgtgaccggc tccggtctcc 1123621 gtcccggggg gtgttgctgt gcgagcagcc aatgccaatg ccgtttctcg ctgaccgcga 1123681 gacgttgacg ctcggtgtga tcttgaagta gcgatggttt taagaagtag gaaaagcacg 1123741 ctcggcgttg tcgtgtgctt agcgctggtg ctcggtgggc cgctcaacgg ttgcagcagc 1123801 agcgcgagcc accgcggtcc actgaacgca atgggaagtc cggccatacc gtcgacggcg 1123861 caggagatac ccaacccgtt gcgcggtcag tacgaagacc tcatggaacc gctgtttccg 1123921 caggggaacc ccgcgcagca acgctatccg ccttggcccg cgtcctacga cgcgagtttg 1123981 cgagtctcct ggcggcagct gcagcctacg gatccgcgca ctctgccccc ggatgctccg 1124041 gacgaccgca agtacgactt cagcgtgatc gacaacgcgt tgaccaggct cgccgaccgc 1124101 ggcatgcggc tgacgctgcg ggtgtacgcc tacagctcgt gctgcaaggc ttcctatccg 1124161 gacggcacta acatcgcgat tcccgactgg gagcgcgcta tcgccagcac caacaccagt 1124221 tatccagggc cggcgaccga tccctcgacc ggggtggtgc aggtggtgcc gaatttcaac 1124281 gattcgacct atcttaacga ttttgcgcag ttgctcgccg cgcttggtcg ccgctacgac 1124341 ggtgacgagc gcctcagcgt gttcgagttc tccgggtacg gggacttcag cgaaaatcac 1124401 gtcgcatacc tgcgcgacac gctcggtgcg ccgggtccgg gcccggatga aagcgtggcg 1124461 accctgggct attacagcca gttccgtgat cagaacatca ccaccgcgtc catcaaacag 1124521 ctaatcgcgg cgaacgtcag cgccttcccg catacccaac tggtgaccag tcccgctaat 1124581 ccggaaatcg tgcgagaact gttcgccgac gaggtcacca acaagcttgc cgcgccggtg 1124641 ggtgtccgct cggattgcct gggcgtcgac gcgccgttgc cggcctgggc cgagtccagc 1124701 acttcgcact atgtgcagac caaagacccg gtggtcgccg cgctgcggca gcggctggca 1124761 acggcgccgg tgatcaccga gtggtgcgag ttgccgaccg gcagttcgcc gcgggcttac 1124821 tacgagaagg gcctgcgcga cgtcatcagg tatcacgtgt cgatgacgtc gagcgttaac 1124881 ttccccgacc agacggcgac ctcgccgatg gaccccgcgt tgtacctggt gtgggcgcaa 1124941 gctaacgccg ccgcaggcta tcggtactcg gtcgaagcgc agccggggtc gcaagcgcta 1125001 gcgggcaagg tcgcgacgat ctcggtcacc tggaccaact acggcgctgc tgccgccacc 1125061 gaaaagtggg tgcccggcta ccggctggtg gattccaccg gacaggtggt tcggacgctg 1125121 ccggcagcgg tggacctgaa gacgctggtc tccgaccagc gcggcgatcg cagcagcgac 1125181 cagccgacac cggcgtcggt cgccgagacg gttcgcgttg atctgtccgg cttgcccgcg 1125241 ggccactaca cgctgcgggc cgcgatcgac tggcaacagc acaaaccgaa cggctcccat 1125301 gtggtgaact atccgcccat gctgttgtcc cgcgacggcc gcgacgattc cgggttttat 1125361 cccgtcgcca cgctcgacat cccacgcgac gcgcagaccg cggtcaacgc ttcgtaggtg 1125421 gctttcccgt cgctgcggtc cgctcacttg ccttcgggtg gttgcggcgg ctggtagcgg 1125481 ggaaataccc cggtgggcgg cggcagcgct gtgccggggg tcagccgaac acctacggcg 1125541 gcgaacgacc gctggtttgg ggcctggccg agcaggtcca aaattttgcc ggccgactcc 1125601 ggcatcaccg gctggatcag cagtgccgcg atgcggacta cctcgcaggt gacgtagagc 1125661 gtggtgcgga accgggcctg atcggcttcg gactcgctct tgcgcagtac ccacggctgc 1125721 tgcaccgaaa agtacttgtt cgcgtcgccg agcatcagcc agatcgcctc cagcgccagg 1125781 tgcatcgcct gtgcgtcgaa gtgaccgcgc actcgctcca acaagccatc ggcggtcgca 1125841 agcagcgcgg cgtcggcgtc ggcgaactca cccgggttgg gcaccctgcc gtcaaggttt 1125901 ttggccacca tcgacaacga gcgttgggcc aagttgccga gctcgttggc cagatcggtg 1125961 ttgatccgag tgacgatggc ctcgtcgctg taactgccgt cctggccgaa cgggacctcc 1126021 cgcaacagga agtagcggac ctggtccacc ccgagcgctt ccgccagggc aaccgggtcg 1126081 acgatgttgc ccaccgattt actcatcttc tcgccgcggt tgtgcaagaa cccgtgcgcg 1126141 aagatccttc gcggcaactc gattccggct gacatcaaaa acgccggcca atagacggca 1126201 tgaaacctga tgatgtcctt gccgatcatg tgcaaatcgg cgggccagta gcggcggaac 1126261 aactccgagt cggtatccgg gaagcccgcc ccggtcaggt aattggtcag cgcgtcgacc 1126321 cagacgtaca tgacgtggtc ggggtgctcg ggcacctgca caccccagtc aaacgaggtg 1126381 cgcgagatcg acaggtcgtc caggccgccg gagacgaagc tgatcacttc gttgcgccgc 1126441 gtctccggcg cgatgaagtc ggggttggcg tgatagtggg ccagcagctt gtcggtatag 1126501 gccgacagcc ggaagaagta ggtctgctcc tcggtccagg tcaccggcgt gccggtctct 1126561 accgtcaggc gcgtgccgtc gacaagttgg gtctccgatt cgacgaagaa ccgctcgtcg 1126621 cgcaccgagt accacccgga atagttgtcc agatagatgt cgccggccgc cgacatccgt 1126681 cgccagagtt ccttggacgc ctcgtggtgg tcggcatcgg tagtgcggat gaatcggtcg 1126741 aaggagatgt tcagcgcctc ctgcatgcgc tgaaacacgt cggaattgcg ccgggcaagc 1126801 gccgcggtgg gcacgcccgc tgccgcggcg gcttgtgcga ccttcaggcc atgctcgtcg 1126861 gtcccggtca ggaagcgcac gtcatagcga tccagccgtt tgaaccgggc gatcgcgtcg 1126921 gtggcgatgt attcgtaggc gtgacctacg tggggtgcag cgttgggata tgcgatcgcg 1126981 gtggtgacgt aatagggctt catttcgaca ccaccctatt gtgtgcgggt gagctccgac 1127041 cgcccagcca gacgagatcc accgcccgct ccggaacccc tggcgccgtt ggtcgacgcc 1127101 cacacccatc tcgacgcgtg cggtgcacga gacgccgata cggtgcggtc gctcgtcgag 1127161 cgagccgccg cggccggcgt gaccgcggtg gtcaccgtcg ccgacgacct ggagtccgcg 1127221 cgctgggtca cccgcgcggc cgaatgggat cggcgagtct atgccgcggt ggcgttgcac 1127281 ccgacccgcg ccgatgcgct caccgacgct gcccgtgccg agctcgagcg attggttgcc 1127341 caccccaggg tggtggccgt cggtgagacc ggaatcgaca tgtactggcc gggtcgcctg 1127401 gacgggtgtg cggagccgca cgtccagcgg gaggcctttg cctggcatat cgatctggcc 1127461 aagcggaccg gtaaaccgct gatgatccac aatcgtcagg ccgaccgcga cgtgctggac 1127521 gtgctgcggg ccgagggcgc gccggacacc gtgatcttgc actgcttctc gtcggacgcg 1127581 gcgatggccc gcacgtgtgt ggacgccggg tggctgctca gcctgtccgg gacggtgagc 1127641 ttccgtaccg cccgtgaact acgggaagcc gtcccgctga tgccggtgga gcagcttttg 1127701 gtggaaaccg atgcaccgta tttgaccccg catccccacc ggggcttggc gaacgaaccg 1127761 tactgcctgc cctataccgt gcgggcgctg gctgaactgg tcaatcggcg ccccgaagag 1127821 gtggcgctca tcaccacaag caacgctcgc cgagcttatg ggctagggtg gatgcgccaa 1127881 tgagcgcgcc gagcggccca taacacccgc gcgccggagt tgctcaacat tggccggttc 1127941 gttaccgtct tgtgatcgaa cgggtggggc ctctaggttt cggagggccc attttgcttt 1128001 ttgttcgctg tgtaggtggt tgagtgttgc cgaggtcggg gatatagcgc gttgactcta 1128061 cttaccaaac ttcatcagac ccaatcaccg atgttgcgcc tggtagtcgg tgcgctgctg 1128121 ctggtgttgg cgttcgccgg tggctatgcg gtcgccgcat gcaaaacggt gacgttgacc 1128181 gtcgacggaa ccgcgatgcg ggtgaccacg atgaaatcgc gggtgatcga catcgtcgaa 1128241 gagaacgggt tctcagtcga cgaccgcgac gacctgtatc ccgcggccgg cgtgcaggtc 1128301 catgacgccg acaccatcgt gctgcggcgt agccgtccgc tgcagatctc gctggatggt 1128361 cacgacgcta agcaggtgtg gacgaccgcg tcgacggtgg acgaggcgct ggcccaactc 1128421 gcgatgaccg acacggcgcc ggccgcggct tctcgcgcca gccgcgtccc gctgtccggg 1128481 atggcgctac cggtcgtcag cgccaagacg gtgcagctca acgacggcgg gttggtgcgc 1128541 acggtgcact tgccggcccc caatgtcgcg gggctgctga gtgcggccgg cgtgccgctg 1128601 ttgcaaagcg accacgtggt gcccgccgcg acggccccga tcgtcgaagg catgcagatc 1128661 caggtgaccc gcaatcggat caagaaggtc accgagcggc tgccgctgcc gccgaacgcg 1128721 cgtcgtgtcg aggacccgga gatgaacatg agccgggagg tcgtcgaaga cccgggggtt 1128781 ccggggaccc aggatgtgac gttcgcggta gctgaggtca acggcgtcga gaccggccgt 1128841 ttgcccgtcg ccaacgtcgt ggtgaccccg gcccacgaag ccgtggtgcg ggtgggcacc 1128901 aagcccggta ccgaggtgcc cccggtgatc gacggaagca tctgggacgc gatcgccggc 1128961 tgtgaggccg gtggcaactg ggcgatcaac accggcaacg ggtattacgg tggtgtgcag 1129021 tttgaccagg gcacctggga ggccaacggc gggctgcggt atgcaccccg cgctgacctc 1129081 gccacccgcg aagagcagat cgccgttgcc gaggtgaccc gactgcgtca aggttggggc 1129141 gcctggccgg tatgtgctgc acgagcgggt gcgcgctgac catccggctg ctcgggcgca 1129201 ctgagatcag gcggctggcc aaagagctcg actttcggcc gcgcaaatct ctcggacaga 1129261 acttcgtgca cgacgccaac acggtgcgac gggtggttgc cgcctccggg gtcagccgtt 1129321 ccgacctggt tttggaggtc gggccgggcc tgggatcgct gaccctggca ctgctcgacc 1129381 gcggcgcgac cgtcaccgcg gtcgagatcg atccactact ggcttctcgg ctgcaacaga 1129441 ccgtggcgga gcactcgcac agcgaggttc accgactaac ggtggtcaat cgcgacgtcc 1129501 tggccctgcg ccgggaggat ctagccgcgg cgccgaccgc ggtggttgcc aatctgccgt 1129561 acaacgtagc ggtaccggcg ttgttgcatc tgcttgtcga gttcccgtcg atccgtgtcg 1129621 tgacggtgat ggtgcaggcc gaggtcgccg aacggctcgc cgccgagccg ggcagcaaag 1129681 agtacggcgt gcccagcgtt aagctgcgct tcttcgggcg ggttcgccgc tgcggcatgg 1129741 tgtcgccgac cgttttctgg cccattccgc gtgtctattc cgggctggta cgcatcgatc 1129801 gatatgagac ctcgccctgg cccaccgacg acgcttttcg acggcgggta ttcgaactcg 1129861 tggacatcgc attcgcgcag cggcgcaaga cttctcgcaa cgcgtttgtg cagtgggcgg 1129921 gctcgggaag cgagtcggcg aatcgattgt tggcggccag catcgacccc gcccgtcgcg 1129981 gtgagacgct gtccatcgac gacttcgtgc ggctgctgcg acggtccggc ggctccgacg 1130041 aggccaccag caccggccgg gacgccaggg cgccggacat ttcggggcac gcgtcggcga 1130101 gctgacgggg cgccgccgcg tgtggtcggc gcgtcacagc gatagtctgc tgcggtgtcc 1130161 gcatctgacg gcaacaccgc tgaattgtgg gtgcccaccg ggtcggtcac cgttcgggtg 1130221 cccggaaagg tcaacctcta tctggcggtc ggcgatcgcc gcgaggacgg ctatcacgag 1130281 ctgaccacgg tatttcatgc cgtctcgctg gtcgacgagg taaccgttcg taacgctgat 1130341 gtgctctcgc tcgagttggt cggcgagggg gccgaccagc tgccgaccga cgaacgcaat 1130401 ctcgcctggc aggcggccga gctgatggcc gaacacgtgg gccgggcgcc ggacgtctcg 1130461 atcatgatcg acaaatccat tccggtcgcc ggcggcatgg ccggtggcag cgcggacgct 1130521 gcggcggtcc tggttgcgat gaactcgttg tgggaactca atgtgccccg ccgcgacctg 1130581 cgcatgctcg ccgcgcggct aggcagcgat gtgccgtttg ccctgcatgg tggtaccgcg 1130641 ctggggacgg gtcgcggcga ggagttggcc accgtgttat cccgcaacac cttccactgg 1130701 gtcctggcgt tcgccgacag cgggttgctc acctccgcgg tgtacaacga gctcgaccgg 1130761 ctcagggagg tgggggatcc gccccggctt ggtgagcccg ggccggttct ggctgcctta 1130821 gctgcgggtg atccggatca gctggcgccg ttgctgggta atgaaatgca agcggccgcg 1130881 gtgagcctgg acccggcgct ggctcgtgcg ttacgcgccg gtgtggaggc cggcgcgctc 1130941 gcaggcatcg tgtccggttc gggtcccacg tgtgccttcc tgtgcacctc ggcgagctcg 1131001 gcgatcgatg tcggcgcgca gctgtcgggg gcgggagttt gtcgcaccgt tcgagtcgcc 1131061 accgggccgg tacccggcgc ccgcgtggtg tctgcgccga ccgaagtgtg accgaattct 1131121 tgggagcatg cctcgggcgg ccaggggtat ccgcgcgtgc cgaggccggt gggtcgatcg 1131181 gctggcgcac cagcatgcca gcggtagggc cgcaggcatc cgccctcgcg aggtcggtgg 1131241 cgcgcatcaa agccaggcgc aaaagccata ccatgatgcg acagagccgc tcggcgagag 1131301 cctccgctac cggccagctc acggcgatag ctgcatcaac ggccatcgag acaacccgtc 1131361 ggcacgggaa tcctcgcagt tcaccgcggg gagtacggca aaggctgtga ccaagctgtg 1131421 acatcgccct caaacctcgg cagagtttgg cagctactta agagttgctt aagataatcc 1131481 gcggtgttgg gtcgtgggct catcaccgaa ccgagaccca accgctcccc aactgtgtgc 1131541 gcgcgcctgt cgcgatgtgg catccggtag gcggaccatg aaaacccgga ccttggggac 1131601 agcaccggaa ccgaggaggt tgccttgagc aggttcaccg agaagatgtt ccacaatgcc 1131661 cgcaccgcga cgacgggcat ggtcacaggt gaaccgcaca tgcccgtccg ccacacctgg 1131721 ggcgaggtcc atgagcgtgc tcgttgcatc gcgggcggcc tggccgccgc gggtgtcggt 1131781 cttggtgacg ttgttggggt gctggccggc ttcccggtgg agatcgcccc cacggcgcag 1131841 gccctgtgga tgcgcggggc cagcctgacc atgctgcacc agcccacacc gcgcaccgac 1131901 ttggccgtgt gggccgagga caccatgacc gtcatcggca tgatcgaggc caaggccgtg 1131961 atcgtctccg agcccttcct cgtggccatt cccatccttg agcagaaagg catgcaggtc 1132021 cttaccgtcg ctgacctttt ggcgtcggat ccgatcggcc ccatcgaggt cggcgaggac 1132081 gacctggcgt tgatgcagct gacgtccgga tctaccggct cccctaaagc cgtccagatc 1132141 acccaccgca acatctactc caacgccgag gcaatgttcg tcggcgccca gtatgacgtc 1132201 gacaaggacg tcatggtcag ctggttgccc tgcttccatg acatgggcat ggtgggcttc 1132261 ttgactatcc cgatgttctt cggtgcggag ctggtcaagg tcacgccaat ggacttcctg 1132321 cgcgacacgc tgctgtgggc gaagctcatc gacaagtacc agggcaccat gaccgcggcg 1132381 cccaacttcg cctacgcgct gctcgccaag cggttgcggc gccaggccaa gcccggcgac 1132441 ttcgatctgt cgaccctacg cttcgcgctg tccggcgccg agcccgtcga acccgccgac 1132501 gtcgaggacc tgctcgacgc gggcaagccg ttcggcctga ggccctcagc gatcctgccg 1132561 gcctacggca tggccgagac cacgctggcg gtgtccttct cggagtgcaa cgccggcctc 1132621 gtcgtggacg aggttgacgc cgacctgctg gcggctctgc gccgggccgt tcccgccacc 1132681 aaaggcaata cccgcaggct ggccacgcta ggtccgctgc tgcaggacct agaggcccgc 1132741 atcatcgacg aacagggcga tgtcatgccc gcccgcggcg tgggtgtcat cgagctgcgc 1132801 ggcgagtcgc taactcccgg ctacctgact atgggtggct tcatcccggc ccaagacgag 1132861 catggctggt acgacacggg cgacctcggc tacctcaccg aggagggcca cgtggtggta 1132921 tgtggccgcg tcaaggatgt catcatcatg gccgggcgca atatttaccc gaccgacatc 1132981 gagcgggcgg ccggccgcgt cgacggcgtt cgtccgggtt gcgcggtggc cgtgcgtctc 1133041 gatgccggac attcgcgcga atcctttgcc gtcgcggtcg agtcgaacgc cttcgaggat 1133101 cccgccgagg ttcgtcgcat cgagcatcaa gtggcccacg aggtggttgc cgaggtcgac 1133161 gtgcggcctc gcaacgtcgt ggttcttgga cccgggacca ttccgaagac gccgtcgggc 1133221 aagctgcgtc gggccaactc cgtcaccctg gtcacctaag gccgccgagc agacgcaaaa 1133281 tcccctcgac acgccggttg cgaggggatt ttgcgtctgc tcacgcgggt cgttaccagg 1133341 cgtggacgcg gttttgtgcg ggctccatgc cctgttcgat aagcagctcg gtggcatcgg 1133401 cggcctgctc gcagatcgtg gggacctcgg cgcgctcggc cggggtaaag ttctccaaca 1133461 caaacgccgc cgggtccttg cggccgggcg ggcggccgat cccgatacgc acccgctgaa 1133521 agtctttggt acccagcgcg gccaccaccg agcgcaaccc gttgtggccg ccttcgccgc 1133581 cgccgatctt gagccggatg cggccgaact cgaggtcaag gtcgtcgtgg atgacgatga 1133641 tgttggccgg cgccaccgag tagaacttcg ccagcggccc tatctggcgg ccggactcgt 1133701 tcatgtagca gcgcggcttg gccaaaacca gggagcgccc ggctgatcta ccagtggcga 1133761 cttcggcgcc ggaacgcttg tgtgccttga acttcgcgcc tagtcgcgcg gcgagcagat 1133821 cggcgaccac gaacccgagg ttgtgccggg tacgggcgta attggctcca gggttgccga 1133881 ggccgaccac gagcaacggc tcggccatgt cgcaagccgt ctactcggac tcgccagcgg 1133941 cctcggcttc gccggcttct accgcggctt cctcggcttc ctcggctcct gcgacttcgc 1134001 cctccagctc ctcggcggtt ggcgccttca ccacgttgac caccaacaga tcagggtcag 1134061 aaatcaggct gacaccggcc ggcagcgcga tctgcccggc ggtgagctgg gtgcctggtt 1134121 cggcaccttc gatggacacg gtcaactgct cgggaatcga cagcgcctcg gcctcgatct 1134181 cgatgctgtt ggtctcttgg gtgaccaggg tgtcgggtcc ggcctggccc tcgacgacca 1134241 cgctgacttc gacgacgacc ttctcgccac ggcgcacgac cagtaggtcg gcatgctgga 1134301 tggtgcggcg gatcggatgg atatgaagtg ccttggtcag tgccagctgt tccttaccgg 1134361 cgatgtcgag ggtcaacacc gcgttggtgc cggaatgccg cagtacggcc gcatagtcgt 1134421 gtccgggcag ctccaggtgc tgtggctcgg cgccgtggcc atacagcaca gcgggtatct 1134481 tgccggcgcg ccgggcccgc cgggacgcgc ccttgccggt ctcggtacgc accgtgacgc 1134541 gcagctggtt gcttgcggat ttggccatat gtcgctcctg ggtggctcgg ttacctcgtt 1134601 tgggggcacg gccagggtcg cgacagcttg tcggcctccg tcgataacgg tgttctgccg 1134661 gcctgctgta gaccgccgac caccctcgcc gtgacgcccg gctaggctaa cccatggcta 1134721 ctgcattggg gaaattcgat ccttgtgagc tgctcggata gctgtgcccc aaccgtgcgg 1134781 acaattactt tgccgcgacg acgaatccgg cgatgatcgc ctcgatgtcg gaagcgtgct 1134841 tgacggcctc gttggccaga ctcgtgatgg tgagctgcac caggtagcgc tgcttggccg 1134901 gcggtgcgcc ggttgggaag acgatccggt tccaggtgtg cagtcgcctg ccgtgcaggt 1134961 cataactgcc ctgaatcatc gaggacggaa acccgttgaa gtctgccgtc gaggagtcca 1135021 attcggtgaa gttcgtcgac agccgggcat cggcagtgcc atgcttgagc gcttcggcga 1135081 tatcgaagtc ccggtgcagc ttgaacacca tgagcatggc cgttggatag ctttcgccct 1135141 tggcgatcat ctccgtgttc ggggtgatgt tcggattttt catcggtgcc cagcccggtg 1135201 gtgtcggaat cgacacggtc aggtcggtca ggctgctcgg tgccaccggc tctccggtga 1135261 cgccgacgct ttccagatac ttccacagcg ggaccggcac ttccgtcgtg gtcgagacgg 1135321 cgctggtggt tgggctcgtg gacaaaatcg actggaagtc aggcgatttc ggtccgcaag 1135381 cgaccgctga cattgccagc gtggctaccg cgaccgcgac cgccaagggt ctcacagaat 1135441 cttgcggaca gcgtcgaccg gccaagcccg ccggatgccc tcaaggatga cggctgccat 1135501 ctatgcgtcc ccgtcgaaaa gtcctgttac tgagccgttt tcgaagaccg cccggattgt 1135561 gctggccagc agcggcgcga tggacaaaac ggtgagctgg gggaagcgct tgtcttcgcc 1135621 gatcgggagc gtgttcgtga cgatcacttc gcgggcgccg caggaggcca gccgctgcgc 1135681 agcggggtcg gagagcacgc cgtgggttgc cgcgatgatc acgtcaccgg cgccgtcgtt 1135741 gtgcagcaat gccaccgcgc cggcgatggt gccgccggtg tcgatcatgt cgtcaatcag 1135801 gacacaggtg cgcccggcca cgtcgccgac gacgcggttg gacaccactt ggttgggtac 1135861 ccgcggatca cgggtcttgt ggatgaaggc gaggggaaca ccacctaatg cgtcggccca 1135921 cttctcggcg atgcgtaccc ggccggagtc aggggagacg accaccatgt tgccgtccgg 1135981 gtagttgtct ctgatgtaac cggtcagcag gttctgaccg cgcatatgat cgaccggccc 1136041 gtcgaagaaa ccctggatct ggtcggtgtg caggtcgacc gtcacgatcc ggtcggcgcc 1136101 cgcggtcttg agcaggtcgg cgatcagtcg cgcggagatc ggttcgcggc cacggtgttt 1136161 cttgtcttgc cgggcatacg gatagaacgg catgacggcg gtgatccgtt tggcgctgcc 1136221 ccgtttgagc gcgtcgatca tgatcagctg ttccatcagc cacctgttca ccggtgccgg 1136281 gcaggattgc aggacgaagg cgtcgcaacc gcgtaccgat tcgtggaagc gcacgaagat 1136341 ctcgccgttg gcgaactccc gcgcgtcctg agaggtgacg tggacgtcga gctctttggc 1136401 tacctgctcg gccagctccg gatgggcgcg gccggcaaag agcatcaggt ttttgcgatt 1136461 atcggtccag tcgtggctca acgcgctgcc ctcgccgttt gggatcgaat tggattaccc 1136521 atggtacgta gcgcaccgcc cggatttgtc gccgggtagc cgggatgcga cttcacggtg 1136581 tctgatcagc gtcgggtggt tgtgtgggct gttggcaggc catttctgag gctctttttg 1136641 aggcctgagc cgctgggctg ccggggcgtt tgcgctgcac ccagttctcg atgttgcgtt 1136701 gcggacccgc cgacactgcc agcgcccccg gcgggacatc ctcccgcacc actgtgccgg 1136761 ccccggtata cgcgccgtcg ccgatggtta ctggggccac gaacatggtg tcggacccgg 1136821 tccgtacgtg cgaaccgacg gtggtgcgcc gtttggacgt accgtcgtag ttgacgaaca 1136881 cgctggaggc gccgatgttg ctgtactcgc cgatgtcggc gtcgccgacg taggtcaggt 1136941 gcggcacctt ggtgccggtg ccgatggtgg agttcttgac ctcgacgaac gcgcccagct 1137001 tgccgtcggc gcccaacgcg gttccgggcc gcaggtaggt gaagggcccg accgcggcgc 1137061 catccccaat cgacgacgac gaaccgtggg tgcgcaccac cgaggcaccg tcgccgacgg 1137121 cgacgtcggt cagggtggtg tcgggaccga cgacacagcg accgccgatc tgggtgcggc 1137181 ccagcaactg ggtacccggg tgaatgacgg tgtcgcggcc gatggtgacg tcgacgtcga 1137241 tccaggtggt agccgggtcg acgacggtga cgccggccag ctggtgagcg gccaccaccc 1137301 gccggttgag ttcggaggcc agctcggcca gctggacgcg attgttgacg ccggccacca 1137361 acgcgctgtc gtcgacgtgg ctggcatgta cggtctggcc gtcggagcgc aagatggcga 1137421 tgacgtcggt gaggtagagc tcctgttggg cgttgttgga gctcagccgg ctcagtgcgg 1137481 accgcagcgc ggcgatgtcg aaggcgtaga cgccggcgtt gacttcgcgg atttcccgct 1137541 gcgatggtgt cgcgtcggtt tgctccacga tcgccatgac ttcgtgatcc tgggtgcgca 1137601 ggatgcggcc gtagccgaag ggatcatcca gcgtcgtggt cagcaccgtc accgcagccg 1137661 acaccgcgcg gtgggtggcg atcaagtcgg ccagcgtgtc ggcgtccagc agcggggtat 1137721 ctcccgaggt gaccacgacg ttgccggcgt agtcatcggg cagcgcggac agcccgcaga 1137781 gtaccgcatg cccggtccct agcggtcgat cctgcagggc gacgtcgatc gttcggccta 1137841 gggtgtcggc gagttcaccg actagcggcg cgatgcgctg gtgatcgtgt cccagcacca 1137901 cgattagacg ctgcggcgcc agcttggcga tcgcatgcag tacatgcgac agcatgctgc 1137961 gaccggcgag tgtgtgcagc accttggggg tgtccgaacg catccgggtc ccgggcccgg 1138021 ccgctaggac caggaccgcg gtgtcaccag gaaacgtcat caaccctcct tgaagctccg 1138081 tcgccaggac tcgaacctga actatctgaa ccaaaatcag aggtgctgcc gattacacca 1138141 cgacggattg cacatcgatg tgactttaga cggtgtcaac gccgtcagca cagtcaacgc 1138201 tgtcgccgtc tacccaccgg ccccacgcaa accgataccc ttgttgatgt ggccggaccg 1138261 gataaagggc cggataaggc gccggaaaac ccgacgcggg tgacgcgcgc caggatgacg 1138321 gggaccgagc gccgtcacca gctcatcggc atcgcgcgat cgctgtttgc cgaacgcggt 1138381 tacgacggga cgtcgatcga agagatcgcg cagcgcgcca acgtatccaa gccggtcgtc 1138441 tacgaacatt tcggtggcaa ggagggcctg tacgcggtgg tggtcgatcg ggagatgtcg 1138501 gcgctgctgg acggaatcac ctcgtcgctg accaacaacc gatcccgggt gcgggtggag 1138561 cgggtcgcgc tggcgttgct gacctacgtc gaggaacgca ccgacggctt ccgcatcatg 1138621 attcgcgact cgccggcctc gatcagctcg ggcacctatt ccagcctgct caacgacgcc 1138681 gtcagccagg tcagctcgat tctggctgga gacttcgccc ggcgcggcct ggacccggac 1138741 ctggcaccgc tgtatgcgca agcattggtg ggttcggtgt cgatgacggc gcaatggtgg 1138801 ctcgatgcgc gcgaaccgaa gaaggaagtg gtggccgcgc acctggtcaa cctggtctgg 1138861 aatggcctga cccacctgga ggccgatccg cggctacagg acgagtagcg ggcggggaag 1138921 ccgggcccaa tgttgactaa cctcggcgcc ctagaatggc cgcatcatga ccgcaccggg 1138981 gcctgcctgc tcagataccc cgatcgcggg gctcgtcgaa ttggcgctga gcgcgccgac 1139041 attccaacag ctcatgcagc gcgccggggg tcgacccgac gaattgacgc tcatcgcgcc 1139101 ggccagcgcg cggctgttgg tcgccagtgc gctggctcgg caggggccat tgctggtggt 1139161 caccgccacc gggcgggaag ccgacgacct ggccgccgaa ctgcgtggtg tgttcgggga 1139221 tgcggtggcg ttgttgccgt cctgggagac actgccgcac gaacggctct cacccggtgt 1139281 tgacaccgtc ggcactcgcc tgatggcgct gcgccggctg gcccaccccg acgatgccca 1139341 gctgggccca ccgctggggg tagtggtgac ctcggtgcgc tcgctgctgc agcccatgac 1139401 gccgcagctg ggcatgatgg agcccctcac gctgaccgtt ggcgacgaat cccccttcga 1139461 cggcgtggtg gcgcggctgg tcgagctggc atatacccgg gtggatatgg tcggccggcg 1139521 cggcgagttc gctgtgcgcg gcgggattct ggacatcttt gccccgacgg ccgaacatcc 1139581 ggtgcgggtc gagttctggg gcgacgagat caccgagatg cggatgttct cggtagccga 1139641 ccagcgctcg attccggaga tcgacattca cacactggtt gccttcgcct gccgtgaact 1139701 gctgctgagc gaggacgtgc gggcgcgggc cgcccaactg gccgcacggc atcccgcggc 1139761 cgagagcacc gtcaccggca gtgcttccga catgctggcg aagctcgccg agggcatcgc 1139821 ggtcgacggc atggaggcgg tgttgccggt gctctggtcc gacgggcacg cgttgctgac 1139881 cgatcagctg cccgacggca cgccggtgtt ggtgtgcgac ccggaaaagg tgcgcacccg 1139941 cgccgcggat ctgatcagga ctggccgtga attcctggaa gcctcgtggt cggtcgcggc 1140001 gctgggaact gcagaaaatc aagcccccgt cgacgtcgaa caactgggtg ggtcggggtt 1140061 cgtcgaactg gaccaggtgc gggccgcggc ggcccgaacg ggtcatccgt ggtggacgtt 1140121 gagccaattg tccgacgagt cggcgatcga gttggacgtt cgggccgcgc cgtcggcgcg 1140181 cgggcaccag cgtgacatcg acgaaatctt cgcgatgcta cgtgcccaca tcgcgaccgg 1140241 cgggtacgcc gcgctggtcg cgccgggcac cggaaccgca caccgcgtgg tggaacggct 1140301 gtccgagtcc gacacccccg cggggatgct cgatcccggc caggcgccca agccgggagt 1140361 cgtcggggtg ctccagggcc cgctgcgtga cggcgtcatc attcccggcg ccaacctggt 1140421 cgtcatcacc gagaccgatt tgaccggcag ccgggtcagc gccgccgagg gcaagcggct 1140481 ggcggccaag cggcgcaaca tcgtcgaccc gctggcgctg acggccggtg acctggtggt 1140541 gcacgatcag cacggcatcg gccggttcgt ggagatggtc gagcgcacgg tcgggggcgc 1140601 ccgccgggag tatctggtgc tggagtatgc ctcggccaag aggggtggcg gggcgaaaaa 1140661 tactgacaag ctctatgtcc cgatggattc gctggaccag ctgtcgcggt atgtcggcgg 1140721 gcaggcgccg gcgctgagcc ggctgggcgg cagcgactgg gccaacacca agaccaaggc 1140781 gcgccgcgcg gtgcgcgaga tcgcgggcga gctggtctcg ctgtacgcca aacggcaggc 1140841 cagccccggg catgcgttct cgccggacac gccgtggcag gccgagctgg aggacgcgtt 1140901 cggcttcacc gagaccgtgg accagctcac cgccatcgaa gaggtcaagg cggacatgga 1140961 aaagccgatc ccgatggacc gggtgatctg cggcgatgtc ggctacggca agaccgagat 1141021 cgcggtgcgg gcggcgttca aggcggtcca agacggtaaa caggtcgcgg tgctggtgcc 1141081 caccacgctg ctggccgacc agcatctgca gacgttcggc gagcgaatgt ccggattccc 1141141 ggtgaccatc aagggtctgt cgcggttcac cgacgccgcc gagtcccgcg ccgtgatcga 1141201 cggcctggcc gacgggtcgg tggacatcgt gatcggcacc catcggctgc tgcagaccgg 1141261 ggtgcgctgg aaggatctgg gcctggtggt ggtcgacgag gagcagcggt tcggcgtcga 1141321 gcacaaggag cacatcaagt cactgcgcac ccatgtcgac gtgctgacca tgagcgccac 1141381 cccgatcccg cgcacgttgg agatgagcct ggccgggatt cgcgagatgt cgaccatcct 1141441 gacgccgccc gaggagcgct acccggtgct gacctacgtc ggaccgcacg acgacaagca 1141501 gatcgccgcg gcgctgcgcc gggagctgct gcgcgacggg caggcgttct acgtgcacaa 1141561 ccgggtcagc tcgatcgacg cggccgccgc ccgggtgcgt gagctggtgc ccgaggcgcg 1141621 ggtggtggtc gcgcacgggc agatgcccga ggacctgttg gagaccaccg tgcaacggtt 1141681 ctggaaccgc gagcatgaca tcctggtttg caccaccatc gtggagaccg gcctggacat 1141741 ctccaacgcc aacactttga tcgtcgagcg cgccgatacc ttcgggctgt cccagctgca 1141801 ccagctgcgt ggccgggtgg gccgcagccg ggagcgcggc tacgcctatt tcctctatcc 1141861 accgcaggtg ccgctgaccg agaccgctta cgaccggttg gcgacgatcg cgcagaacaa 1141921 tgagctgggc gcgggcatgg ccgtggcgtt gaaggaccta gagatccgcg gtgccggcaa 1141981 cgtgctcggc atcgagcagt ccggacacgt cgccggcgtc ggattcgacc tgtacgtgcg 1142041 gttggtcggc gaggccctgg agacgtaccg ggacgcgtac cgggcggccg ccgacggcca 1142101 aaccgtgagg accgccgaag aacccaagga tgtgcgaatc gacctgcccg ttgacgcgca 1142161 cctgccaccg gactacatcg ccagtgatcg gctgcggctg gagggctacc ggcggctggc 1142221 ggccgcctcc tctgatcgcg aagtggcggc cgttgtggac gagctaaccg atcggtatgg 1142281 ggccctgccg gagccggccc ggcggctggc ggcggtggca cggctgcggc tgctgtgccg 1142341 tggctccggc atcaccgacg tgacggcggc gtcggcagcg accgtgcggc tgtccccgtt 1142401 gacgctgccg gactccgccc aggtgcggct gaagcgaatg tatcccggag cgcactaccg 1142461 tgccacgacg gccaccgtgc aggttcccat tccgcgagcc ggtggcctcg gcgcgccgcg 1142521 aatccgcgac gtcgagctgg ttcagatggt ggccgatttg ataaccgcgc tcgctgggaa 1142581 accgcgccag catattggta taacgaaccc tagcccgcca ggcgaagacg gccgtggtcg 1142641 caacacgacg attaaggagc gacaaccgtg atgattgtcg tcctggtcga cccccggcgt 1142701 ccgacactgg tgcctgttga agcgatcgag ttcctgcgcg gcgaggtgca atacaccgag 1142761 gaaatgccgg tcgcggtgcc ctggtcgcta ccagcggctc gttcggcgca cgccggaaac 1142821 gacgcgccgg tgttgctgtc gtctgacccc aaccatcctg ctgtcattac tcgactggcc 1142881 gccggtgccc ggctgatctc ggcaccggat tctcagcgtg gcgaacgact cgtcgacgcc 1142941 gtcgcgatga tggacaagct gcgcaccgcc ggaccgtggg aaagtgagca gactcacgac 1143001 tcgctgcgca gatacctgct ggaggagacc tacgagctgt tggacgcggt ccgcagcggc 1143061 agtgttgacc agctgcgcga agagcttggt gatctcttgc tgcaggtcct ctttcacgcc 1143121 cggatcgctg aggatgcgtc gcaatcgccg ttcaccatcg acgacgtcgc cgacacactg 1143181 atgcgaaagc tcggcaatcg ggcgccagga gtacttgcgg gcgaatcgat ttcgctcgaa 1143241 gatcaactgg cgcaatggga ggcagccaag gcctcggaaa aggcgcgaaa gtcggtagcc 1143301 gacgatgtcc atacgggcca gccggcatta gcgctggcgc agaaggttat tcagcgtgcc 1143361 caaaaggctg ggctgcccgc tcacctgatc cccgatgaga tcacttctgt ttcggtttca 1143421 gctgacgtag atgcggaaaa cacgctgcgc actgccgttt tggactttat tgacaggctg 1143481 cgctgtgccg agcgggcaat tgccgtcgca cgccggggca gcaacgttgc cgagcagctc 1143541 gatgtgacgc cgctgggtgt gatcaccgag caggagtggc tcgcgcattg gccaactgct 1143601 gtcaacgatt cccgcggcgg gtccaagaaa cgtaaaggca tgcgataacc gccccgagtg 1143661 cgacggggta gtcaacaaac ccatgggacg atgatcgtga cggaagccgg tataggtgcc 1143721 ctacgaggga gagttgtgtc gccgagacgc tggttgcggg cggtcgccgt gataggggcg 1143781 accgcgatgc tgttggcgtc gagctgcact tggcagctga gccttttcat caccgacggc 1143841 gtgccgcctc cgcccggcga tccggtgccg ccggtggata cgcacgccgg cggccggccc 1143901 gcggatcagt tgcgcgaatg ggcggagaaa cgtgctgcgg cattgggaat tccggtcatc 1143961 gcgctggagg cctacgccta cgccgctcgc gtcgccgagg tcgagaatcc caagtgtcat 1144021 cttgcgtgga ccacgctggc gggcatcggg cgggtggaga gtcaccacgg aacctaccgg 1144081 ggcgccacga ttgcgcccaa tggggatgta agccccccga ttcggggcgt ccgcctcgac 1144141 ggcaccggcg gcaccctgcg catcgtggac agggacgggg gcggcctgga cggtgacgcc 1144201 gcggtggagc gtgcgatggg gccaatgcag ttcatttcgg aaacctggcg gttgtacggg 1144261 gtcgctgcca gaaacgacgg catcgccaac gtcgacaaca tcgatgatgc tgccctctcg 1144321 gcagcgggct atttatgctg gcgtggaaag gatctcgcga caccgcgagg gtggataacc 1144381 gcgctgaggg cctacaacaa ctccgttatc tatgcgcggg cggtccggga ctgggcgacc 1144441 gcgtatgcgg cgggtcatcc gctgtagcag gatgaaccgc taacccaggc tttacgctaa 1144501 cagcggtcgg ggccagccaa cccaagaccg tccgtgcagc agctacgacg caaggagaac 1144561 ccagtgccga ttatcgagca ggttagggcc cgagagatcc tcgattcccg cggcaacccg 1144621 acggtggagg tcgaggtggc gcttatcgac gggacattcg cccgggccgc ggtgccgtcg 1144681 ggcgcctcga ccggggagca cgaggccgtc gagttgcgcg acggcggcga tcgctacggc 1144741 ggcaaaggcg tgcaaaaagc cgtgcaggct gttcttgatg agatcggccc ggccgtcatc 1144801 ggactcaacg ccgacgacca gcgattggtc gaccaggcgc tggtggacct agacggcacc 1144861 cccgacaagt cccggctggg cggcaacgcg atcttgggtg tctcgctcgc tgttgccaag 1144921 gcggcggcgg attcggcgga gctgccgttg ttccgttatg tcggggggcc aaacgcgcac 1144981 attctgccgg taccgatgat gaacatcctc aacggcggcg cacacgccga taccgctgtc 1145041 gacattcaag agttcatggt ggcgccaatt ggcgcgccca gcttcgtcga ggcgttgcgc 1145101 tggggcgctg aggtgtacca cgcgctcaag tcggtcctga aaaaggaggg gctgtccacc 1145161 ggcctgggcg acgaaggcgg cttcgccccg gatgtggccg gcaccaccgc ggcgttggac 1145221 ctgatcagcc gggccatcga gtcggcgggc ttgcgacccg gcgccgacgt ggcgctggcc 1145281 ctggacgcgg cggccaccga gttcttcacc gacggcaccg gctacgtctt cgagggcacc 1145341 acccgtaccg cagaccagat gaccgagttc tacgcgggcc tgctcggcgc ctacccgctg 1145401 gtgtcgatcg aagacccact gtccgaagac gattgggacg gctgggccgc gctgacggcc 1145461 tcgatcggtg accgggtgca aatcgtcggc gacgacatct ttgtcaccaa tcccgagcgg 1145521 ctcgaggagg gcatcgaacg gggcgtggca aatgcgttgc tggtcaaggt gaaccagatc 1145581 gggacgttga ccgagacact cgacgcggtc acgctggctc accacggcgg ataccgcacg 1145641 atgatcagtc accgcagtgg cgagacggag gacaccatga tcgccgacct cgcggtggcc 1145701 atcggcagcg ggcagatcaa gacgggcgcg cctgctcgca gtgagcgcgt cgcaaaatac 1145761 aaccagctgc tgcggatcga agaggcgctt ggcgacgcgg cccgctacgc gggcgacctg 1145821 gcatttcctc ggttcgcgtg cgagacgaaa taggtacatg cccgaagcga aacggcccga 1145881 atcgaagcgc cggtcgccgg catcgcgccc ggggaaggcc ggcgactcgg ttcggggcgg 1145941 tcgcgccacc aagccttccg caaaaccctc cacgcccgca ccgcacgcca gccgcaagac 1146001 cactcgcacg ccgcatgagc acattgtcga acccatcaaa cgggcgatca ccgaatcggt 1146061 cgagaagcgc tccgaacagc ggctggggtt caccgcgcgg cgcgcagcga tcctcgccgc 1146121 ggttgtatgc gtgctgacgc tgaccattgc gaggccggta cgcacctact tcgcgcagcg 1146181 cgccgagatg gaacaactgg ctgcgaccga ggccatgttg cgccgccaga tcgctgacct 1146241 ggaggaacag caggttaagc tcgccgatcc ggcgtatatt gcggctcagg cccgcgaacg 1146301 gctcggcttt gtgatgcctg gagacatccc gtttcaggtc cagcttccgt cgacgccgtt 1146361 ggcgccgccg caaccggggt cagacgcggc tactgcgacc aacaacgaac cctggtacac 1146421 cgcgctgtgg cacacgatcg ccgacgaccc gcacctgccg cctgccgcgc caccggcacc 1146481 ggagcccgga cgtccgggcc cgctgccgcc ggcctcgcca aaccccgagc agcccggtgg 1146541 ttgatcgtgc cgatctggag gtggtcacgc ggcaactcgg ccgtgcaccc cggggtgtgc 1146601 tcgcgatcgc ctatcgttgc cccaacggtg aacccggcgt cgtgaaaact gcgccgagac 1146661 tgcccgacgg cacgccgttt ccgaccctgt actacctgac gcatccggtg ctcacggcgg 1146721 cggccagcag gttggagacc acgggactca tgcgcgagat gaaccggcgg ctgggccagg 1146781 atgcggagtt ggccgccgcc tatcgacggg cacacgagtc gtatctgtcc gagcgtgacg 1146841 ctctcgagcc gctcgggaca acggtctccg cggggggcat gcccgaccgg gtcaagtgcc 1146901 tgcatgtgct gatcgcgcat tcgctggcca agggcccggg gttgaaccca ttcggtgacg 1146961 aggcgctggc gttactggcc gccgagccac ggacggccgc gaccctggtg gctgggcagt 1147021 ggcgctaacc cgggtcgccg cgatcgactg cggtaccaac tcgattcgct tgctgatcgc 1147081 cgacgtggga gccgggttgg cgcgcggaga gctgcacgat gtgcatcgtg agacccggat 1147141 agtgcgcctg ggccagggag tcgacgccac cggtcggttc gcgccggagg cgattgcgcg 1147201 gacccggacc gccctgaccg actacgccga actgctgacg tttcaccatg ccgagcgggt 1147261 gcggatggtc gccacgtcgg ccgcccgcga tgtggtcaat cgcgacgttt tctttgcgat 1147321 gacggccgac gtgttgggcg ccgcgctgcc cggctcggcc gcggaggtga ttaccggcgc 1147381 cgaggaggcc gagctctcct tccgtggagc ggtgggcgaa ttaggcagcg ccggtgcgcc 1147441 tttcgtcgtc gtggacctcg gtggcggttc caccgagatc gtgctgggcg agcacgaagt 1147501 ggttgccagc tactcggcgg acatcggatg cgtccggctg accgaacgct gtttgcactc 1147561 cgacccgccg acgttgcagg aggtgtccac ggcccgccgg ctggttcgcg agcggctcga 1147621 gcccgcactg cgcaccgtgc cgctggagct ggcccggacc tgggtcgggc tggctggaac 1147681 gatgaccaca ctgtccgcgc tggcgcagtc catgacggcg tatgacgctg cggccattca 1147741 tctttcgcgg gtgcccggtg ctgatctgct cgaggtttgc cagcggctga tcggcatgac 1147801 tcgcaagcag cgggccgcgc tggcgccgat gcacccgggc cgggccgacg tgatcggcgg 1147861 tggcgcgatc gtggtcgaag agttggcgcg cgagctgcgc gagcgggccg gcatcgacca 1147921 gctgaccgtc agcgaacacg acatcttgga cggcatcgcg ttgtcactgg ccggataagt 1147981 cacatctgcc acacgcgtat ctgcgcgggg ggacactctt ctgcccgcct cgtagcgaca 1148041 accttggccg atgtcagacc cgcatgggaa tgttcggcca tgaccagaca actgcatgga 1148101 attgagcttc gatacgtgct caccctgcac ctggccgtcc atggaccggc ggccattacc 1148161 gaaatgatct aaggcctggg ctggcacggc tttggagtcc ggggcagggc atccaaggtg 1148221 gtgtcggagg cactgcgctg ggaaatcgga cggggccgag tataccggct cgggcgcgga 1148281 cgctacgggc cggggtacat cccgcgctcc accgaatacc ggattcacca acgcgtgttg 1148341 gcgttgcggg catccgccaa cgtgtcgctg cgaggcgggc aaagtgtaca tccgctccca 1148401 gcggaaacgc ctgtggcaga tgtgatttag gcttcgaagc ggtagcccat ccctgattcg 1148461 gtcagcagat gtttggggtg cgacgggtca tcctccaatt tgcgccgcag ctgcgccaga 1148521 tacacccgca ggtaatgggt ttcagtcgca tatgccggtc cccacacttc tttgagaagc 1148581 tccccgcggc cgaccaactt gccgcggttg cgggccagca tttccagcat gccccactcg 1148641 gtcggcgtga gatgcacttc ggcaccgtct ttgatgacct tcttgccggc cagatcgacg 1148701 gtgaatgaat cggtttcgat caccggctgc tccaactcgg cggccgcggt gttacgccgt 1148761 accgctgcgc gcagccgagc cagaaactcg tccattccaa acggtttcgt cacgtaatcg 1148821 tcggcgcccg catcgagggc ctggaccttg tccgacgaat cggtacgcgc cgacaacacg 1148881 atcaccggtg ccgtcaacca gccacgcagc ccgccgagca cgtcgatacc cgacatgtcc 1148941 ggcaggccga ggtcgaggat caccacatcg ggcggatgct cagcggcggc gcgcagcgca 1149001 cccgcacccg tcgaggcggt gatgacctgg tagccacgca cggtcaggtt gatacgcagc 1149061 gcgcgcagga tctggggttc gtcgtcaatc accaagacga gggtcatggg cggtcctcgg 1149121 gagccgccag atcgatcacc actgtgagcc cgccgcccgg ggtatcggta gccgaaatcg 1149181 tgccgcccat agcctcgacg aagccgcgtg ccaccgacat ccccagaccg acaccggtgg 1149241 tgttgtcgtg atcccccggc cgctggaacg gggcaaagag ttgctcctcg gtcccgcgcg 1149301 ggacccctgg gccctcgtcg atgacattaa tcaggacccg ctcacgcacc cgtcccgcgt 1149361 tgacccggac cacgcagtcg ggcgcatatc gcagcgcgtt gtcgatcagg ttggctagca 1149421 cccgctccag caacccggcg tcggccatcg ccacggcgtc tcccacgtcg accttgaccc 1149481 ggtcgatgcc ggatcggtaa aaaccggtgg cgcccttgcc gatgctgacc aaggcccgtt 1149541 gcaccgcttc ctccaggtat gcccggcgca gctgggggcg aatcacgccg gcagccaacc 1149601 gcgacgaatc gagcaggttt gcgaccaggg cggtgagttg gtcgatggac tcctcgatgg 1149661 tggccaacag ctcggcggta tcctcggggg agaaagcgac gtcttcggtg cgcaagctgg 1149721 acaccgcaac cttggccgcc gccagcgggg tgcgcaggtc gtggctgacc gccgacagca 1149781 gcgaccggcg cagctcatcg gccctagcga tggcctcggc ctggccggcc tcttccgcca 1149841 gctcgcgctg cttcaccaga cccgcggcct gtgtcgcgac cgcggtcagc actcggcggt 1149901 cgcgggcggc caacttgcgg cctgccatca gcatccaaaa ctcgtcgtcg ccgacttcga 1149961 ttgcggtgtc ggcggagtcg acgtcccgac acgggtttgt cccgacgcac gcgacggttt 1150021 cgcctgtcga tgcgccctgc cggacacgca gcatggtcac ggcccgttgg gaatacgttt 1150081 cgcggacccg ctgcagcagc gtggcaaggt ctgcgccgcg caacaccgaa ccggcaaaca 1150141 gggccagcaa ctcagcctcc tgggatgcgc gccgagcctc acgggttcgg ctagccgcgc 1150201 cgtccaccaa caccgccacc gcaacggcca tcgccaacaa cacgaattcg gttactgcgg 1150261 cgtccggttc ggcgatggtc caggtgtagc ggggctcggt cagaaagtag ttcagcagca 1150321 tgcccgacag caaggccgac aatgcggcgg gggcgacgcc gcccagcaac gccacgatca 1150381 gcacgccgat gaagaacaac gcgctctcgc cgccgatgcc catgaatcgg tcgagccagg 1150441 ccaccgtgat ggcgcagatc accgagggca ccaccagcgc ggccagccac gacgcgatat 1150501 gccgctcgcg cggggagacc cgcgaccacc cggaggcccg gctggccgcg ggatgggtga 1150561 ccatgtgaac gtcgatgccg ccgggctcct ggacggtgcg ggcgccgatc ccctcgtcaa 1150621 acaggcgtgc ccatcgcgat cgccgcgatg tgccgacgac gagctgcgtg gcgttcatct 1150681 cgcgggcgaa gtccagcagc gcggtgggca cgtcgtcgcc gaccacggtg tgcatggtcg 1150741 caccgaggct tgtcgccagc tcgcggaccc tgcccagctg cggcgcggac acccccgcca 1150801 ggccgtcgcc acggataacg tgaaccacca tcagctcggc gctggacttc gacgcgatcc 1150861 gcgatgcccg tcgcaccaac gtctccgact ccgggccgcc ggtcacggcg acgacgacgc 1150921 gttcccgcgc ctcccacgtg gcggtgatct ttttgtctgc gcggtacttc tccagggccg 1150981 catcaacttg gtcggccagc cacagcaacg cgatctcgcg cagcgcggtc agattgcccg 1151041 tgcggaagta gttcgacagc gcggcatcga cccgttcggc tgcatagacg ttgccgtgag 1151101 caagcctgcg ccgcaacgct tccggtgtga tgtcgaccag ctcgacctga tcggccgcgc 1151161 ggacgatctc gtcggggatc ttctccttct gctcgatgcc ggtgatttgc tccacgacat 1151221 cgtttaggcc ctccaagtgc tggatgttga ccgtcgagat caccgtgatg ccggcgtcga 1151281 ggatttcctg aacgtcctgc cagcgcttgg ggttcttgct gccaggtgtg ttggtgtggg 1151341 cgagttcgtc caccagcacc acctgaggat gacgtcgcag tactgcctcc acatcgagtt 1151401 cgggaaacct ggcaccccga tattcgacgt agcgcggcgg gatcatctcg atgccctcga 1151461 gcagtttcgc ggtcttgttg cgtccgtgtg tctcgacgac cgcggcgacc acgtcggtgc 1151521 cgcgctccag cctgcggtgc gcctcgccga gcatggcgta ggttttgccc acgccggggg 1151581 ccgcgcccag atagatccgc agctgcccgc gcttggtggt cacatgctca atcatccacc 1151641 ggtagggcgt aaagatcgcg caaagatcgg cgaagagcaa cgtcacggtc gtgttcctgg 1151701 ggggcccggc aactaccatc ctgctgggct atctgatgcg ctgcgatgcc ggtgcacaag 1151761 aatcgagagg actcacatgg ccgacttggt gttggtgctg accgtgatgg cctttgccgg 1151821 gctttgcctg ctctacgtcc gtggctgtga acggatcatt cgccgcgacg aaatcgggga 1151881 aacaacagtc gaactcacgc gagcgccggc cgaatggcga tgactacggt cgacaacatc 1151941 gtcgggttgg tgatcgcggt ggcgctaatg gcgttcctat tcgcggcgct gctgtttccg 1152001 gagaagttct gatgtccggg acgagttggt tgcagttcgc ggcgttgatc gcggtgctgt 1152061 tgctcaccgc gccagcgctg ggcggctacc tggccaagat ctacggcgac gaggccaaaa 1152121 agcccggcga tcgggtgttt gggccgatcg agcgcgtgat ctaccaggta tgccgagtcg 1152181 atcccggcag cgagcaacgg tggagcacct atgccctgtc cgtgcttgcg ttcagtgtta 1152241 tgtccttcct gctgctgtat gggatcgcgc ggtttcaggg cgtgctgccg ttcaatccga 1152301 cggacaagcc ggcggtgacc gaccatgtcg ccttcaacgc cgcggtcagc ttcatgacca 1152361 ataccaactg gcagtcctac agcggcgaag ccacgatgag ccacttcacc cagatgaccg 1152421 ggctggccgt gcagaacttc gtctccgcgt ccgccggcat gtgcgtgctg gcggccctga 1152481 tcagaggtct ggcccgcaaa cgggcgagca cgctcggcaa cttctgggta gacctcgccc 1152541 gcaccgtgtt gcgcatcatg tttccgctgt cgttcgtggt ggcgatcctg ttggtcagcc 1152601 agggcgtgat ccagaacctg catggtttca tcgtcgccaa cacgctggag ggcgcccccc 1152661 agctcattcc aggcgggccg gtggccagcc aggtcgcgat caagcagctc ggcaccaacg 1152721 gcggcgggtt cttcaacgtg aactccgcgc atccgttcga aaactacacg ccgataggca 1152781 atttcgtcga aaactgggcg atcctgatca tcccgttcgc gctgtgcttc gccttcggca 1152841 agatggtgca cgaccgtcgt caaggctggg cggtgctggc catcatgggc atcatttgga 1152901 tcggaatgtc agtcgcggca atgtcattcg aggccaaggg caacccgcgg ctggatgcgc 1152961 tgggggtgac acagcagacg acggtcgacc agtccggcgg caacctggag ggcaaggagg 1153021 tgcgctttgg cgtcggtgcg tctgggttat gggcggcgtc gacgaccggc acctccaacg 1153081 gctcggtcaa ctcgatgcac gacagctaca caccactggg cggcatggtc ccgctggcgc 1153141 acatgatgct cggcgaagtc agcccgggcg gcaccggcgt cggattgaac ggcctactgg 1153201 tcatggcgat cctggcggtt ttcatcgccg gcctcatggt aggccggaca ccggagtatc 1153261 tcggcaagaa gatccaggcc accgagatga agctggtgac gctctacatc ctggcgatgc 1153321 ccatcgccct gctgagtttc gccgccgcgt cggtgctgat ctcctccgcg ctggcgtcgc 1153381 ggaacaaccc tgggccgcat ggtctttcgg agattctata cgcctacacg tcgggcgcga 1153441 acaacaacgg gtcggccttt gccggtctga ccgcgtctac ctggtcatat gacaccacga 1153501 tcggagtggc gatgttgatc ggtaggttct tcctgatcat tccggtgctg gcgatcgccg 1153561 gctccctggc acgtaaaggc acgacgccgg ttaccgccgc caccttcccg acgcacaagc 1153621 cgctctttgt tggcctggtc attggggtcg tactgatcgt cggcggcctg acgttcttcc 1153681 ccgccctggc gctggggccg atcgtcgagc agttatcgac ccagtgatga tcgcacgcat 1153741 ggagacctcc gcaaccgccg cggcagcgac gtcggcaccc cggctccggc tggccaagcg 1153801 ctcgctgttc gatccgatga ttgtgcgctc ggcgctgccc cagagcctgc gcaagctggc 1153861 tccgcgggta caggcccgta acccggtcat gttggtcgtg ctggtcggtg ccgtgatcac 1153921 cacactggcg ttcctgcgcg acctcgcatc ctcgacagcc caagagaacg tcttcaacgg 1153981 tctggtcgcc gcgttcctct ggttcaccgt cctgtttgcc aactttgccg aggccatggc 1154041 cgaaggacgc ggcaaggctc aggcggcggc gctgcgcaaa gtccggtccg aaacgatggc 1154101 caaccggcgc acggctgcgg gcaacatcga atcggtccct tcgtcgcggc tggacctcga 1154161 cgacgtggtg gaggtttcgg ctggcgaaac gatcccgtcg gacggcgaga tcatcgaagg 1154221 cattgcctcc gtcgacgagt ctgcgatcac cggcgaatcg gcaccggtga tccgcgagtc 1154281 gggcggcgac cgttccgcgg tgacgggtgg caccgtggtg ctgtcggatc ggatcgtcgt 1154341 gcggatcacc gccaagcagg gacaaacatt catcgaccgg atgatcgcgc tggtggaggg 1154401 cgccgcacgg cagcagacac cgaacgagat cgcgctgaac atcctgctgg ctgggctgac 1154461 gatcatcttt ttgctcgcgg tggtgacgct gcagccgttc gccatctatt ccggcggggg 1154521 acagcgggtg gtcgtgctgg tggcgttgct ggtgtgtctc attccgacca cgatcggtgc 1154581 gctgctgtcc gcgatcggca tcgcggggat ggaccggctg gtgcaacaca acgtgctcgc 1154641 cacatctggg cgggcggtgg aggcggccgg cgacgtgaac acgctgctgc tggacaagac 1154701 cggcaccatc accctcggta accggcaggc caccgagttc gtgccgatca acggtgtgag 1154761 tgccgaggcg gtcgccgacg ccgcccagct gtcgagcttg gccgacgaaa ctccggaggg 1154821 ccgctcgatc gtcgtgctgg cgaaggacga gttcgggctg cgcgcccgcg acgagggcgt 1154881 gatgtcacac gccaggttcg tgccgttcac cgccgaaacc cggatgtccg gggtcgatct 1154941 cgccgaggtt agcggcatcc gtcggatccg caagggtgcc gcggctgcgg tgatgaagtg 1155001 ggttcgcgat cacggtggcc accccaccga ggaggtgggt gccattgtcg acggcatcag 1155061 ctccggcggg gggacacccc tagtcgttgc ggaatggacc gataacagca gcgcgcgggc 1155121 catcggcgtc gtccatctga aggacatcgt caaggtgggc atacgggaac gcttcgacga 1155181 aatgcgccga atgagcatcc gcaccgtgat gatcaccggt gacaacccgg cgaccgccaa 1155241 ggcgattgca caggaggccg gcgtcgacga tttcttggcc gaggccacgc ccgaggacaa 1155301 gcttgcgctc atcaagcgcg aacagcaggg cggtcggctg gtcgccatga cgggtgacgg 1155361 gaccaatgac gcacccgcgc tcgcgcaagc cgatgtcggg gtggcgatga ataccggcac 1155421 ccaggcggcc cgggaagccg gcaacatggt cgatctcgac tccgacccca ccaagctcat 1155481 cgaggtcgtg gagatcggca agcagctgct gatcacgcgg ggcgcgctga cgacgttttc 1155541 gatcgccaac gacgtcgcga agtacttcgc catcatccct gccatgttcg tcggcctgta 1155601 tccggtgctc gacaagctga acgtcatggc gctgcactca ccaaggtcgg cgattctgtc 1155661 ggcggtcatc ttcaatgcgc tggtgatcgt cgccttgatc ccattggcgt tgcggggcgt 1155721 gcggtttagg gcggaaagcg cgtcggcgat gctgcggcgc aacctgctga tctatgggct 1155781 gggcggtctc gtcgtcccgt ttatcggcat taaactggtc gatctcgtca tcgtcgccct 1155841 cggggtgtcc tgatgcgtcg tcaattactg cccgcgctca ccatgctgtt ggtgttcacc 1155901 gtcatcaccg gcatcgtcta cccgcttgcc gtgaccggcg tcgggcaact gttcttcggt 1155961 gaccaggcga acggcgcgct gctcgagcgg gacgggcagg tcatcggctc cgcccacatc 1156021 ggccagcagt tcaccgccgc gaagtacttc cacccgcgcc cctcgtcggc aggcgacggt 1156081 tacgacgctg cggcgagctc gggctccaac ctgggaccga cgaacgagaa gctgctggcg 1156141 gccgtcgctg aacgggtcac cgcctaccgc aaggaaaaca atctgccggc cgatacgctg 1156201 gttccggtcg acgcggttac cggctcgggt tccgggctgg acccggccat atcggtggtc 1156261 aatgccaagc tgcaggcacc gcgggtggcg caggcgcgca atatctcgat aaggcaggtc 1156321 gagcgtctga tcgaggacca caccgacgcg cgtggtctcg gcttcctggg cgagcgcgcg 1156381 gtgaacgtgc tcaggctgaa cctcgcattg gatcgcctct gactctcagg cggtagtggc 1156441 gatctgctgc tcgatcatcg ggagccgcac ccgaaacacc gtctggccgt tgcccgactc 1156501 ggccgtgacc gagccgcgat gcgccttgac gatcgagctg acgatggcca ggcccaagcc 1156561 gtggccggac ccattggacc gagacttgct ggcccgcacg aaccggtcga agaggtgggg 1156621 caggatctcc gggtcgatgt cggggccgtc gtcggtcacc gacaattcaa cacacggcgc 1156681 gttgggacca gtgcggtggc aggtgatccc gatggtcact gtgacgccgg gctgggtatg 1156741 cacccaggca ttggtgagta gattgctgac gagttgatgc aagcgggcat gatccccgtt 1156801 gacccagacc ggctcgtcgg gcagattctt cacccaacgg tgggtgggcg ccgcaaccgc 1156861 cgcgtcattc accgcgttga tgaccaggtc ggtcaggtcg aggtcctcgg tttctagatc 1156921 ttcgccctcg ctgagacggg agagcagcag cagctcgtcg accagcagcg tcatccgccg 1156981 cgcctcggat tcgatgcggg ccagcgcgta ttcggtggtg ggcggtaggt ccgagctatc 1157041 ctgacgtgtc agttcggcat agccctggat cgccgccagg ggagtacgca gctcgtggct 1157101 ggcgtcggtg atgaactgcc gcatccgcag atcggaatcg acgcgatgcg ccagcgcacc 1157161 atcgacgttg tccaacaagc gattcagcgt gtgcccgacg attccgacct cgttatccgg 1157221 gtcggtatcc cccggacgga ctcgcacgct gatctggtgg tcgtcatcgg taagtggcat 1157281 ggtggcgacc tcggcggcgg tcgcggcgac ccggcgcagc gggcgtagcg catatcccac 1157341 cacccacacc gtcagtgctg cggtaaccac cagtgcggcc ccaacaagcg cgacggtggt 1157401 gactttcttg cgggcgatga tctggttggc caggcttagc gatacgccga cgaacagtcg 1157461 atcggcgcca gcggcgctgc tgtcaacctg gtaggcgccc aggctgccca ggctttcgac 1157521 acgcggcggg ccgccgtccc acacttgcgc ttcgatcgcg cggatgacgt cgggcggagc 1157581 gggtcgtgct ccgtcttcgg agaaaacggc cgatccgatc accacgccgt cgtgcagcac 1157641 ggcaatgagg tttccgggcg tctggccggt gaactccagc accgcttgtg acatcgggag 1157701 gttgccggtg ggcgtggatg tttgcgcact gtcgcggtat ctggtgtaag agtggttcaa 1157761 cgcgtgcagg gattcgacta gctcggcgtc gttcatcgcg gtgacatagc cgcttaggct 1157821 cagcacggag acgacaccga cggccaccag cacaacggta acgaccgcca acacgccgag 1157881 cagcaattgc tggcgtaacg agcggggtcg ccagcagggg gcttttctgg accgagtgtt 1157941 tcggtccggg atcatgccag gctcattccg gcggacgcag catgtatcca atgccgcgga 1158001 ccgtatggat cattggctcc cggtcggagt cgatcttctt cctcagatag gagatataca 1158061 ggtcgacaat gctggtgcgg cctgcgaagt cgtagttcca aacccgatcc aggatctcgg 1158121 tacggctcag tgctcgtcgg ggattgcgca tcaggaatcg aagcagttcg aactcggtcg 1158181 aggagagcga gatcggcgta ccgtcgcggg ttacctcccg gctggccccg tcgagcgtaa 1158241 ggtctccgac ccggagtgcc tcatcggcgg gcctttccag atggctggag cggcgcagca 1158301 acccgcgcaa ccgggcgacc agctcctcga ggctgaacgg ctttgtcatg tagtcgtcgg 1158361 cgcccgaggt cagaccggtg acccggtcca tcacggaatc gcgcgcggtg aggaacagcg 1158421 tgggtgtgta gacgtcggat tctcggaccc gtcgcaggat ttccaacccg tccacatcgg 1158481 gaagcatgat gtcgaggacc agcacatcgg ggccgacctt gtcgaacttg gctatggcct 1158541 cttgcccgtc gtgggcgact tcgacatccc agccttcgta gtgcagcgcc atcttgacca 1158601 gattggtcag cgctggttcg tcatcgacca acaacacccg gatcggtgat ccatccgcgc 1158661 gatgaatccg tggcagctgc cccaggatgg cttgccgcgg acgttgactg cgcgtgtacc 1158721 ccgacatcgt cgtcatgctc ccgtatcctc tcaagtcctg tgcaagcgca catgcagttg 1158781 tcacgggatt cataaatttt tcaaatgtcg cttatgtagt tacttcggcc tgaaaaggtg 1158841 accgggcggg atgtcgggct tcggcggtga gaaagcggat ctcggtttcc gggtatacgg 1158901 agcccccggt ggaccggtta tgcggggagg gcgctgatcg tgaccaggtt gtgggcgaac 1158961 acgccgtgtc cgacccaggt ccgggtgcct tcgagaccgc cgatccggcc gcggtcccag 1159021 ccgtagccgc gtttgaggtg gctgatccgg ccttcgcatc cggtccgcca tttgatggtg 1159081 cggcggaacg cttttcggtg ttcttcggcg cgtcgatcct gcgaaggttt gcctttgcgc 1159141 gggatcagca cattcttgac gcccacctcg gtgagctgct ggtcgacggc ggcttcgcca 1159201 tagccgcggt cggcggtgac ggtgcgcggc gtgcgtccgg cgcgcttttt cacccacgcc 1159261 accgctggcg ccagctgcgg cgcatcgggt gggttgccct gctgcacagt gtgatccagc 1159321 acaatcccgt catcgttgtc gacgacctgg gccttgtgct caaactcgac cggcttaccg 1159381 agccgaccct tggtgatcgg ggcgggcatc accgtcgtgc aggctgaccc gtcgactcgc 1159441 cccgtccgaa gtgatgcccg cgacccgctg gcgggtctgc gccacaatct gacgcgtcgc 1159501 gttgagcagc tcggttaggt cgttgaccgc gcgcaccagc ccaccacagc ggcgacccgc 1159561 gaccgcatca cgctcaccgc gggcggccag cgcggcggcc ttggccttgg cccggagcac 1159621 cgcctgcttg gcgttgtcca gcagctgctg ggcctcctga gcagcggctt gggccagctc 1159681 ggccagctcg ccggtgaacc tcagtaccgc ggcccgcgct tcgtcacgcc ccagctccgc 1159741 acgcgagcgc agtttcgctg cgaccgcgtg cgcgcgccga ccggccgcgc gggagcggtc 1159801 gccaacccgg gtgcgcaccg cgccgccagc ggcctgaatc cgtttgccgg ttgcggcgat 1159861 ccggcgcatt gccttggcca acagacccaa gtcggtcgga taagacacgt tcgcccgcgc 1159921 caccgtggta tcggcccgga tccgattggt gcccagcagc ttggcctcgg ccgccttggc 1159981 caacaatgcc tcgttgagcc cgtcgatcgc cgccgatccg caacgcgtgg tgagcttcat 1160041 caatgtggtc ggatgcggca ccgacccgtc cagcgcaatg cggcaaaacc gccgtcaggt 1160101 gatcgaatca gccacctccc ggcacagcga ctcatagccc agccggtagc ggaacttcac 1160161 aaacatcaac tgcagataga cctccatcgg cgtcgacggc cggcccctgc gcgggtcgaa 1160221 gaacggcacg aacggggcga agaacgccgg atcgtccaac aatgcgtcca cccgggccag 1160281 ttcctcgggc agtcggcgca cctcgtcggg cagcagcgac tcccacaacc agcactgatc 1160341 gcctaaagta cgaaacacga tggcctcaat cccttccgca acaagggcat tgaggccatc 1160401 ttcccagttc agcaccatcc gaccggggat caacgcgccg actttagcag gtcgaagtag 1160461 ttagtcgttc agataacaac gtggccacac accaaccggt gtgcggccac gttgtaattg 1160521 acggcgcggg ccttaagcca gctttaggcc cagctggagc cgacggcgct gtcggtttgt 1160581 gccatgttgt tgccggcagc ctgcaccttc tgcccgtggg cgttggcctg ctcgtagatc 1160641 acctggaagt tacggcccag ctgggtaatg aacccctggc aggccgccga accggcgccg 1160701 ccccaaaagt cactcgcggt caacacatca gaaatgatgg cctgatgctc ggcctccagc 1160761 gacccggcct gagcgcggat catggcgccg tgagcgtcga cgtccccgaa ttgatagttg 1160821 atggtcatgt gtcctcctga gtcgtcgggc cgggtcagct gctgaggatc tgctgggagg 1160881 cctgctcttg ctgttcgtag ttgttggcgt cgcgaaccag cccgtcacgc accccgtgca 1160941 gcatgttcac gatgttgcga aacgcctgat tcatctgggt catggtgtct agcgaggtcg 1161001 cctcggccat gccactccag cccgcgcccg agatgttttg cgcggacgcc cacatccggc 1161061 gagcctcgtc ctccaccgtc tgggcgtgca cctcaaaacg gcccgccatg tcccgcatcg 1161121 cgtgcggatc cgtcataaaa cgcgaggcca tgctgctgtc tccttgtctc gaagtcgtca 1161181 cgttgttgaa gttctagcgg ctgtgatcgg cgcggtggtg gccgcgtggc ggacaggtta 1161241 tgactcaacg gttaattgct ggcctcaaac gagtgagatg tccccctttg tccgcatcac 1161301 acgacgacct gtttgggcat gacagtgggc ttgaatccgt accgcggccc ggcataggca 1161361 ccggtgccct tggcggccga ggccattccc ggcatcatcc cggtaactgg gccggcttct 1161421 tcggcggcga cggtccagcc gctgccttcg agcgctgtgg cgccggcggt tgtcgccggt 1161481 gcggccgtag accaggccgc cggcactgac agccggccga ccagggtggc ctcgcctaaa 1161541 cttgcgccga gccccgctgg cgtcaccgag tccgccaacc ccgcggcggc cgcactggcg 1161601 gcaccctcgg cagcctcgat ggcgccttcg gcgatcgcta ccggcgcccc actgttcagg 1161661 gcatttgcta ggaatatcgc ggtggggatg gcggcgttga cataccaagc ggcggtgttg 1161721 actgcgctgt tgatgatgtt tgccacgaac ggggtcgcga gcagggcgtc gatgtcggca 1161781 atgattccgc tcagccccgt cgagtcgaga accgatgtga ctggggaggc gagcccactc 1161841 accgcgttgg gcaggctact gatcaggtcc gctacgctca cctggttgac ggcggcggtg 1161901 gcggcagccg agccgaccgc ggcggactgg gcggccagcc cgcccgggtt ggtggtctgc 1161961 gacggcgggc ttaacggttg cagcatcccg gcggctcccg aagcggccgc gtagccgtac 1162021 atagccagag cgtcctgagc ccacatctcg gcatagaggg cttcggtcgc catgattgcc 1162081 ggtgtgttga tccccaggac gttcgtcgcg accagggccg ccagcagcgc ccggttggcc 1162141 gcgaccacct ccggcggcac tgtcatcgca taggccgcct cgtaggcggc cgccgacgcc 1162201 atggcctgcg agccggcatg cgcagcggct tcggcggtgt aggtcaacca agccagatag 1162261 ggctgggctg cggcgaccat cgccatcgag gccggaccca tccacgactc ggtggtcagc 1162321 cgggtgatca ccgactcata cgacgcggcc gtcgtaccca actcggcggc caggccgttc 1162381 catgcggccc cggcggccat catcggtcct gcacccgcgc cggcgtacat gcgtgcggag 1162441 ttgatctcag ggggtaaagc tccgaaatcc atggggtatt ccgtttccgt ggagttattt 1162501 ggctgaattt cgttgttggt tgagcgtggc cgcccgtacg tctgccgcct agacggttgc 1162561 tggcttgggc atgacgatgg gtttgacgcc gtagcgcggt gcaccgaagc cggcgctgtt 1162621 gcgtgcggcc gaggccaccc ctggcatccc ggggatgaac gtccccgcgg cggcctgcgg 1162681 cgcggcggcg gtccagcccg cgcccggcag tgtgctggtg gtggatacca gggtcgcctg 1162741 tccggcccag gcgggcggca ccgacaacat gccgatcgcg gatgcgctgc ccaggccggc 1162801 cgcaattccg gcctcgccga gggcggcttc ggccgcgccc aatgcaccca attcgcccaa 1162861 ggcggcttcc ccaccgagag ccgaggcggc ctcggccgcc tcctctgcag gcaggaggcc 1162921 gccgccggca aggccgatca gcgtagacgt ggcggaggcc cagttcccgg ccccgatgtt 1162981 gaggatgttg ccaatgccac ccgagagttc gggcgggaac aaccccgtcg tcgcttggat 1163041 gatggccgac gcttcccctg tgatccccga gagtggcgag gcggcagccg atgagttgag 1163101 cgactcggtg acgccgtagg taccggcgct gattcccaga gtgttgacaa acatgtcatg 1163161 catagcctga gcttcggcgc tgacctgctg gtagaaggtg ccgtacgcag tgaagagcgc 1163221 cgcctgcagc gccgaaacct catcgagggc cgccggagcg atggctgtgg tgggcgccgc 1163281 ggcggcagcg ttttgggctg ccatcgcagc accgatggtc ccgagttgcg cggccgcagc 1163341 cgtcaactct tcaggcactg tcttgaggaa tgacatccat tgctccttgt gtgtgaaacc 1163401 tgccggccgc tagcaccccg ggccgaccct gtgtgtttgc gtacggctgc ctgtggattg 1163461 gcgtaacgct aaccggccaa gcctccacag tcgcgaccga aaggcatggg acgcccgacg 1163521 tttacggttt tttaacgttt acgtcagcat ccttaacaag gtcttggcgg ctgacatggc 1163581 ggtgtgatct ggtgcccggg ctagcacact tcggcacaca aatgagacgc gcggcgcgcg 1163641 gattctaggc gaatgacggc tctttcgcac ctggcgtgtc gcggtagggt tggtgcactg 1163701 gatcgggtcc aagcgctaca ttcgccgtca agcctccaca gcccgattgg cagaggcagc 1163761 ggacaatccg cgctcacggg tgctggcgtt tgctagtgcc ggtaatcttc gaaagagtcg 1163821 cttctaactg ccaatatgcc gggtcgaagc cactgtccag cactgtcggc atccagatgg 1163881 gggcgttggc gcgctgatcg atacgccgtc tgcgcggctc ccggcacaat gagttcgtgc 1163941 ccgattcctg gccggtcgtt tgcgttgacg actggtctgt tgccggcctg gagacccagg 1164001 ggcaacaccc gcacgattgg ctcaaacatt cttcgcagaa gcggacgtgg ctcttcaagc 1164061 cggcgcgacc ggagcgcgat cgtttactcg gcgaagacgt ggcagaaaag ctcgccagcg 1164121 agttggcgcg gctacgcgat gtctccacaa caagagggga agctcacccg tcgtgcaaat 1164181 gctgagcggg ggtctggtcg gtcagcgtga acccgaggct ggccgcgtgg tcgaacgatg 1164241 ggcacagcgc ctccacatac tttgtctcca gcggcggcac atggaccgcc cagttgcggt 1164301 cgtgacgatc accgtgggcg atcaatgcgt cgaacgcgag gtaggtcgaa agcgcggaac 1164361 gtgggtagga gcgcttggtc ggcaggtgct gcgaaccgag caagcgcctg ctggatcgcc 1164421 tcgacgttgt gcccacgttg cccgggatcg tcccggtcgc agttgagcac aacctcgggc 1164481 atcaatgctt gcggcaaccg cacgtccttg accagcgcac cgcgcacgcc gtcacggaca 1164541 gccagctgga ccggtgccgc aggtatcccg actagggcgt gtctcccaat ttcggagttc 1164601 ccactcgggc gtggatgacg gcgcaggcca gcaggacgcc gccgaggtag gtcagggcgt 1164661 atttgtcgta gcgggttgcg atgccgcgcc actgcttgag tcgatggaag ccgcgttcga 1164721 cggtgttgcg tagcccgtag agcgcggcgt cgaatgctgg tggccgcccg ccggcagacc 1164781 ccttggcctt gcgccggtcg atctgatctt ggcgttcggg gatggtgtgc ttgatcttct 1164841 tagaccgtaa tgcggcacgg gtacttgggt gtgagtaggc cttgtcggcg agtaagcgga 1164901 aatccgtgct gcccagggcg tattcggtgc tggcatggcg atagtcgtcg agcaggggca 1164961 gcagttgcgg gttgtcgccg gcctggcctg cggtcaaccg gatccgcacc ggggcttcgc 1165021 gctgatcggt cagggcatgg atcttggtgg tcagcccgcc gcgcgagcgg ccgatcgcat 1165081 gatcgtcggg ttcatcggcg gatttcttgt aatccgacag tgccccctgt ggcgagcgtg 1165141 tccgagcagg cgcccgccga atgctggtgt gcccgcacgt tcgtggaatc caccgacagc 1165201 agcttctcga tatcctcggc cacctcagcg tccaccccga acaccgcggc aacgtgggcg 1165261 aacacctcgt cgcaggtacc atccagcgac caacggtgat ggcgcttcca caccgtttgc 1165321 cacggcccga actcagcggg caggtcccgc cacggacttc ccgtacggaa ccgccacgcg 1165381 atcccttcca ggataagccg gtgatcgcta aaccgtctgc cgggcttgcc ctcatgcgac 1165441 ggcatcaacg gctcgaccac ggcccagaac tcgtccgaaa tcacacccac tcgcgtcacc 1165501 ggccaatcct cgctggccag tacctaaaaa tttgggagac acgccctagg cgcgggctgc 1165561 agcggtagta ctttggcctg ttcggcgcat ctcctatggc tgcggcccgc tggctcaaac 1165621 cttgccttgc cacgccaagc cattcctagc cttgcctagc cacaccatgc cctgcctaga 1165681 cacagcgagc ctacgccgcg tcgagttcgg cgaaaatcaa actgacccac taccaccgga 1165741 ttgaagggtt tggtgcgtgt tgatacgtcc cgggttgtgc ctatgggagg gtgtccatct 1165801 ccacgatgcc gccgaagtcg agttcgtcga gtgctcggat cacttcgctc gacgggatgc 1165861 cgcgatagaa gggtgccgcg ttcggtccgg tgcccgtcga cggtgcttcc gcagagtcct 1165921 cgaccacgag gccgatcaca cggccgtctt gcgcaacgat cggaccgccg ctgttgcccg 1165981 gccgcgcgat tgccgagtag aggaaaatct tctgccggcc ggggatagtc gtcgcggccg 1166041 ggttgaccac ctcgccacgc tgcaccgtga tcgccatctc cgcagtcatc ggcacccgcg 1166101 ggtaaccgaa cacgtagacc tcatccgccc agtcgggatc acggaacgcc atgccgccaa 1166161 gccgcgggat gtacttgcct tcgggcatct cgaatttgat tactgcgacg tcgagcgtgg 1166221 ggtgcgggtg agcggtgccc gagaagttca ccaactcggc ttcggcgtgg ttgcttgacg 1166281 gatagacgga cagacctgcg ctcgtgcccg cgagcccggt cacgacatgt ttgttggtga 1166341 tgacgtgatt gtggtcgacg acgaggccgg ttccccaact atccaccgga ttgccagcgt 1166401 cgtcgtgacc ggcgagttga acggtcaccg cgttgtagct cgggatgatg agctcggcac 1166461 cgaacacctc ggacaaccag aggttgccgc cacgctgtcc cttcgatatc gccccctgcg 1166521 agatgtactt ctgccccatg actggcaatc gcgggtccca accgagcggc agcagaagtc 1166581 ccgcgcgttc catcgagctg aggatgcggt ggagggtcac cgcgtcgccc gcggcgggca 1166641 ggccgagggt gctcaggtat cgggagaaat ctgcgaccga ccacggttcg aagggcaccg 1166701 tcgtcggcag accgatatcc gagtcaaccg gtggtggttc gggtttgccg atcgccgccg 1166761 caaccaccgg gttgtggacc agcccgaaga attgatgggc gcacatcgcc acgttcacac 1166821 gccacgcagg agtcccgggc ttcaggtcgg ccgccgtgag ctgtcgcggt caggtgcttt 1166881 ccgcgccatc cgccgtcacc tctgccatgg tccatctacg gtatctgcga caagggcagc 1166941 gtcgatgcct cgacatgcag agtcggtgtt cgcttcacgc gaactaggcg cgcctagcct 1167001 ggacgagtcc ccgggccgac attcgcccga ggccttggcc tccatcacct aattgtgtgc 1167061 aaaaccgtat ctaattgata cgattgcgca catggctatc tgggatcgcc tcgtcgaggt 1167121 tgccgccgag caacatggct acgtcacgac tcgcgatgcg cgagacatcg gcgtcgaccc 1167181 tgtgcagctc cgcctcctag cggggcgcgg acgtcttgag cgtgtcggcc gaggtgtgta 1167241 ccgggtgccc gtgctgccgc gtggtgagca cgacgatctc gcagccgcag tgtcgtggac 1167301 tttggggcgt ggcgttatct cgcatgagtc ggccttggcg cttcatgccc tcgctgacgt 1167361 gaacccgtcg cgcatccatc tcaccgtccc gcgcaacaac catccgcgtg cggccggggg 1167421 cgagctgtac cgagttcacc gccgcgacct ccaggcagcc cacgtcactt cggtcgacgg 1167481 aatacccgtc acgacggttg cgcgcaccat caaagactgc gtgaagacgg gcacggatcc 1167541 ttatcagctt cgggccgcga tcgagcgagc cgaagccgag ggcacgcttc gtcgtgggtc 1167601 agcagctgag ctacgcgctg cgctcgatga gaccactgcc ggattacgcg ctcggccgaa 1167661 gcgagcatcg gcgtgaccaa gccctattcg tcgccgccaa cgaacctgcg ctcactacga 1167721 gatcggctca cccaagtagc ggaacggcaa ggtgtcgtgt tcggtcgact gcagcggcat 1167781 gtcgcgatga ttgttgtcgc acagttcgcg gccacgctca ccgacgacac cggcgctccg 1167841 ctgctgttgg tcaaaggcgg atcgtcgctg gaactgcgcc ggggaattcc cgattcgcgg 1167901 acctccaaag acttcgacac ggtcgcacgt cgcgatatcg aattaatcca tgaacagctc 1167961 gctgacgcgg gcgagacggg gtgggaagga ttcactgcaa tcttcaccgc ccccgaagaa 1168021 atcgatgttc ctggtatgcc ggtcaagccg cgccgattca ccgccaagct gagctaccga 1168081 ggccgggctt tcgcaactgt tccgatcgag gtctcctccg tcgaagccgg caatgccgac 1168141 caattcgaca ccctcacctc agacgcgctc ggcctcgtgg gcgtacccgc agcagtcgcc 1168201 gtaccctgca tgaccattcc ctggcaaatc gcgcagaagc tgcacgcagt aactgccgtg 1168261 ctcgaagaac cgaaggtcaa cgaccgcgct cacgacctgg tggacttgca gcttcttgaa 1168321 ggactgttgc tcgatgccga cctcatgccg acgcgcagcg cgtgcatcgc gatattcgaa 1168381 gcgcgcgccc agcatccttg gccaccgaga gtcgccacgc tgccgcactg gccgctgatc 1168441 tatgcaggtg cgctggaggg gcttgaccac cttgaactcg ccaggacggt cgacgcggcg 1168501 gcccaggcag tgcagcgatt cgttgcgcgg attgatcggg cgacgaaaag atgagtgctg 1168561 gcgcggcctg cggcgcacgg gagaacacag ggaccacccc ggttccatag tcaacgtcag 1168621 cggtgcgggt gtcgatcaga cgacgaatgg aatcgccctc gcattcctcg cgatcgagtg 1168681 cctatgagcc gcgctcctgc ggcctaggcg agcgcttccg gggctctcag acatcggcct 1168741 cgtggcggtg tgcgcggcgg catgtggctc tgtgatctct tgcgcgagcg ccgattgcga 1168801 atttcgtccg gcgaaaagtg accgctccgt gaccttaatg caagaggtgt gtggtgtgga 1168861 gaggggcggg aggaagggag tgaggcgacg gtgtcgagat gcagcgagga ttggtggact 1168921 tccggtagtt gtttaacaag gccccggaga ccagggggcg agggagagcg cgggccgact 1168981 tgggtgggtg agcctggctt gggctggtgc gtgagcggag gatcgctggt ggccccgtag 1169041 ttggcgttgg cctgcggacg tgccgcgcct gcgagggatt cgtcaatctt cctgttgatg 1169101 tcgcccgtgc cacgtcggtg agatgtcgaa gggatgtgac ctggtgcgtt cgcgaacagc 1169161 tgctgaccac ggccaccgac ggcgctcaac tgtcgtcgat tccatcccac ccgtgcttgg 1169221 actttcaaac tgtccggcgc cgatggggaa acctggtgtt tggccggaac gtggcgccga 1169281 gcctcgataa tatcagcagt tacgtccagg ggtgtggtgt acgggcaggt aaggccggtg 1169341 ggcgtgtcgt agcccagtag tgggcggtca tcgcgtgatc cttcgaaacg accagcaaaa 1169401 gtcaatcgaa ggaaatgacg caatgacctc ttctcatctt atcgacgccg agcagcttct 1169461 ggctgaccaa ctcgcacagg cgagcccgga tctgctgcgc gggctgctct cgacgttcat 1169521 cgccgccttg atgggggctg aagccgacgc cctgtgcggg gcgggctacc gcgaacgcag 1169581 cgatgagcgg tccaatcagc gcaacggcta ccgccaccgt gatttcgaca cccgtgccgc 1169641 aaccatcgac gtcgcgatcc ccaagctgcg ccagggcagc tatttcccgg actggctgct 1169701 gcagcgccgc aagcgagctg aacgcgcact gaccagcgtg gtggcgacct gctacctgct 1169761 gggagtatcc actcgccgga tggagcgcct ggtcgaaaca cttggtgtga caaagctttc 1169821 caagtcgcaa gtgtcgatca tggccaaaga gctcgacgaa gccgtagagg cgtttcggac 1169881 ccgcccgctc gatgccggcc cgtatacctt cctcgccgcc gacgccctgg tgctcaaggt 1169941 gcgcgaggca ggccgcgtcg tcggggtgca caccttgatc gccaccggcg tcaacgccga 1170001 gggctaccga gagatcctgg gcatccaggt cacctccgcc gaggacgggg ccggctggct 1170061 ggcgttcttc cgcgacctgg tcgcccgcgg cctgtccggg gtcgcgctgg tcaccagcga 1170121 cgcccacgcc ggcctggtgg ccgcgatcgg cgccaccctg cccgcagcgg cctggcagcg 1170181 ctgcagaacc cactacgcag ccaatctgat ggcagccacc ccgaagccct cctggccgtg 1170241 ggtgcgcacc ctgctgcact ccatctacga ccagcccgac gccgaatcag ttgttgccca 1170301 atatgatcgg gtactcgacg ctctgaccga caaactcccc gcggtggccg agcacctcga 1170361 caccgcccgc accgacctgc tggcgttcac cgccttcccc aagcagatct ggcgccaaat 1170421 ctggtccaac aacccccagg aacgcctcaa ccgagaggta cgacgccgaa ccgacgtcgt 1170481 gggcatcttc cccgaccgcg cctcgatcat ccgcctcgtc ggagccgtcc tcgccgaaca 1170541 acacgacgaa tggatcgaag gacggcgcta cctgggcctc gaggtcctca cccgagcccg 1170601 agcagcactg accagcaccg aagaacccgc caagcagcaa accaccaaca ccccagcact 1170661 gaccacctag actgccaccc gaaggatcac gcgaggaacc ttcactcgta caccacgtcc 1170721 ctggccttgg ccgaaggtag aacgccagca cgacttgctg ttgtcaactc ttgcgagtta 1170781 cgtgagtgcg gccggagcac acgctcgtat cgtcgtcaca gtcgaagggc gcgatcttga 1170841 gttcgacgta tcgaccttcg cccttgtggg cccgcagcag ctgcccgaag tcgagccgtc 1170901 gcagtagtga ccgggggccc agttagcgat ggctttgtca ctgtggaggg tctccctccc 1170961 gtagtgatgc accactcgca cgagagccaa ttcggccgcc cgtcgcgccg cagcagagcg 1171021 cggtggctct tcgtcgttca tttggtcatc gcctcgcgta gatgttccgc cgcgtcttcg 1171081 ccgcggacgc cggccgtgcg taggtcggcg tataccctcg gccataacat cgacctaaac 1171141 ccctgcaggt tctgttcggt cagccgggcg cacgctgggg tcgggaagaa gcgcaatatc 1171201 aaccgaccgc cggcgatttc ctgcaatccc gcagccattg ctgcacggcg aagatcgctc 1171261 catgatcgcc cgggcacgta gatttccatc ggagctattt ccgtctgcat tggcgcgagc 1171321 aggctcgccg acaacgcgct tgtggctgcc cactcaatgc ctgcggcatc ccacaactgg 1171381 ccggccttga ccacgccggc agtcggatcg cgccacagca caccagtcga aatagaaatc 1171441 ggcgaccgaa gcttgtctgc tgcctcagcg tatgcatcca acagcgcatc gcgatcaacg 1171501 atcaggcgcg ccgatttcgg gccgcgggca gtggcactgg ccagatggcc gtttttttcg 1171561 agaaacttca acgcctgagc gctgcttccc atcgagagac cggtggcctc tacaaccgag 1171621 gcgacagttg gaccggcgat gttcgccagc agcgcttcac atacggcaag tgtggcgcgg 1171681 cgccagccta tgcgcgcgtc gagtggggca ggtggcgcgc ctttcgtctc gatcaccaga 1171741 gtggttccag ttgatgtgtt tcggtagtgg atatcggcag cgcccgactc gtccacccac 1171801 ccaacaccgg cgtcatgtgc cgccttccgg gcaccaggag acatcgtggg tgcagccaaa 1171861 atgtcgggcc gggatgtggc gtggagtgcc tcggctacct gacggggcca accagtcgtg 1171921 agccagcgaa ccaggaactc tgcgccgtcg agcgacacaa tcacgtcgcg atgagggccg 1171981 tttacgcgtc gtgcgcgcac ttcgctgcga aacgcgcctt ccagcgcact cacggtgcgt 1172041 tcgtcccaag acatggaggc atcatacttc actaagggac gatactctac tgtttcagtg 1172101 aagtaccatc tacggatgaa gttcgattgc cacgtgcgat ccgacgcttg cacttcgctg 1172161 gcgggccgcg aacccgatca gctcctccag gtcgtcggca cgggtcagca aggcggcgct 1172221 gtccgggtgg gcgcgcatgc caaacaccag tcgctcaccg tcggggtcaa gcaacaccag 1172281 gtcgcggagc atggtggcgg gcggaaccca cacttcgggc tagctctagg gggcagggct 1172341 ttgacgggtc ttgacaaata cgtgtagcta cacgagtctg gagtaatggg caaaggggcg 1172401 gcgttcgacg aatgcgcttg ctacaccacc cggcgggcgg cccgacagct cggccaggcc 1172461 tatgatcgcg cgctgcggcc gagcgggttg acgaacaccc aattcagcac gctggccgtg 1172521 atctcgctgt cggaaggcag cgccgggatc gacctcacga tgagcgagct tgccgcccgc 1172581 atcggcgttg aacgcacgac gctaacccgc aacctcgagg tgatgaggcg cgacggactg 1172641 gtgcgggtca tggcgggtgc cgacgcgcgg tgcaagcgca tcgagctgac cgcgaagggc 1172701 cgcgcggcac tgcaaaaggc ggtgccccta tggcgcgggg tgcaggcgga ggtgaccgca 1172761 agcgtcggtg actggccacg ggtgcgacgc gacatcgcga atctgggtca ggcggcggag 1172821 gcgtgtcggt gatctttttt gcgcatatat gtgtagttac acccaactga ggagcaaatg 1172881 atggctaggc agagatttcg tgaccaggtg gtgttgatca ccggtgcctc cagcggcatc 1172941 ggggaggcga ccgcgaaggc attcgcccgt gagggcgccg tggtcgcctt ggcggcgcgc 1173001 cgcgagggtg cgttgcgccg ggttgcccgg gagatcgagg ccgcgggtgg gcgggcgatg 1173061 gtcgccccgc tcgacgtctc gtcgtcggag agcgtgcgcg ccatggttgc cgacgtggtc 1173121 ggcgagtttg gtcgcattga cgtcgtgttc aacaacgccg gcgtctcgct ggtaggcccg 1173181 gtcgacgcag agaccttcct tgacgacact cgcgagatgc tggagatcga ctacctcggc 1173241 acggtgcgcg tggtgcggga ggtcttgccg atcatgaagc agcaacgatc gggacggatc 1173301 atgaacatgt cgtcggtggt gggtcgcaag gcctttgcgc gattcgccgg ctactcctcc 1173361 gccatgcacg cgatcgccgg tttctccgat gcgttgcgcc aagagctgcg gggtagcgga 1173421 atcgccgtct cggtgatcca cccggcgctg acccagacac cgctgttggc caacgtcgac 1173481 cccgccgaca tgccgccgcc gtttcgcagc ctcacgccca ttcccgttca ctgggtcgcg 1173541 gcagcggtgc ttgacggtgt ggcgcggcgg cgcgcccgcg tagtcgttcc atttcagccg 1173601 cggctgctca tggtgggtga cgcgttctcg ccgcggtacg gcgaccgggt ggtccgcttg 1173661 ctcgagagca agatattcgg tcgcctgatc ggttcctatc ggggttcggt ataccgccat 1173721 cagccgaccg aatcagcgaa ggcacaggcg gcccagcccg agcgcgggta ctcgtcggcc 1173781 cggtgaggtt ggttggagcc aggctccacg tcgctgaggc gagcggcgtg cgcagcgcgt 1173841 agcggctcgt cggcacggtg tcgatggtct ccttggcgct gaatcgcgac gtgctggcga 1173901 tcacccgggc aagccgatca cgcagttcgt cgtcgggcgc cagctcaacg tcgagttgcg 1173961 ctctcccgct ttgagatcca gcgcgacacc tcgtcgcgcc ggtacacgac gcgccgtccc 1174021 aaggtgaagc tcgccggtcc gatgtccgag tgccgccagt gccgtagagt gccgacggga 1174081 acgccgatca tctccgaaac ttgttttgcg tccagcagat ccatgtttct cctcccgaca 1174141 tgggctggtt tccaatgtct ccaacagtgc tggcagcgtc cgtgttcggt cgcccatttc 1174201 gcttgcgcga ctgcgccata accggccagg tgaggcgcga cgggttcgag agtggcgccg 1174261 cggtattgtg cgactgccct ggccgagcgg agcagctcgt cgtcgtcgat gccccggcgg 1174321 tcgagttgga cgatgtcgac gtagtcccgc cagcgggtgc tggtgatgcc gcgttcgagg 1174381 atggtcactc ccttctcggc gatgatggtc tcgggcgcgt agcccaggag tgtgatcggc 1174441 tcgccgagga tccggtcgat ggtcacccgt gtgggccacg gcgcgatcgg ttcgccggtg 1174501 gacacatccc aggccgcgat gccctgccac ggtccgaccg acatagcgac tcgcacgcgc 1174561 aggcccgggt agtcggcccg ctcgcgaatt tcctgcacgc tgctcgtgtc gaggttgaac 1174621 gccaccccgt cgtcgatgtc gatcacggcg atgtcgcgaa ccacctgggt gagatgctcg 1174681 gcggtgacgt cggcgcgcat ggcgttggag tcggtgtcct tcgtcgggtg ccgaacgccg 1174741 taggcggcca gcaggatccg gcctttgagg acgaagtctg cggcatgcga ggtgcgggtg 1174801 agccgatcca ggaacgattc gagggtgtgt cgagtcaggt actcctgcgt cggtgcgccg 1174861 gtcccgcact tcgaggcagt agaacgagcg aggattggat ccggcgggac accgtgtcgc 1174921 cggagctcac gccagcatcg ccaatgtttg taagaccggg gacttctccc gcggtaggcg 1174981 ggtggcgatc tcgatcagcc gggcgggttt gccgcctcgg cgcagccact ctcgcagcgc 1175041 gtcacgcgcc agttcgtaac cgacttcgta gcggagccgg aatgtatcgg cgatcgagcg 1175101 ctcgggtgag ttagattccg attgtctgat ccgatcccgg gatcgtgatc tcgtcgcgtc 1175161 cgatctaaaa tgtggcccgg tcgaagtggt gccacgcaat cgcgcctgtg ctggccggtg 1175221 tcctcgacca gcgggggatg gcgatgtcca gcgcggcggg gatcgcgtcg gtcaggtcgt 1175281 ggtgcgtgag tgcggaggcc aggcagatcg tagcgtcggg gcggcgcgtg gcggcctcga 1175341 tccgatccca gtcggcggtc gacgcgtcta cgggtaggta gatgccgcgg gcgatgcggt 1175401 cccagcggcc tgcctgcgcg ccgcggtaaa gcgcgctgcg cgaggccggc ccgccccgca 1175461 gtgctcggtg tcagggcttc cacggcgcct atcccactcg tctttggtac ggaacgtagg 1175521 cagataactc tatgtgtaga cgtttcgtat cgatgcctga ggaaatcggg aacaggcccc 1175581 gcgggcgcat ggctattggg agtacggcgg ggcactgaca ttgcgaggcc accgtcggtt 1175641 ggcgccggta gcatggggat ttgtcgatgc ttggtgaagg agcaaccgtg ggcggtgaga 1175701 cgcctaagaa ggtggtcgtc tcatggactg ctgtgaagaa cgcggggtcg cgcgccacaa 1175761 gggcctcagc caagttggaa cgccgggttg tccccgttgg tcacaagcgg tcagctgccg 1175821 ttgcagcgca tatcgagaag cagcggtcac cgcagtccag atgccgctga cgcccggcta 1175881 cggtgagacc ccgcttccgc acgacgaact ggccgcgttg ctccccgagg ttgtcgaggt 1175941 gttggacaag ccgatcacgc gcgctgatgt ttatgacctc gaacagggcc ttcaggacca 1176001 ggttttcgat ctattgatgc cgacggctgt tgaaggctcg ttgtcgcttg atgagcttct 1176061 cagtgaccat ttcgtccgcg atctccacgc gcgtatgttt ggtccggtat aggactgggc 1176121 cgggcggtgg tgacgacgtg aactcaacat cggtgttgca ccggagcagg tcgccgtcga 1176181 ggtacgcaac gcgctcgaca ccatcgcgta ccgctgggtg cacaccgatg attggaccgg 1176241 tcggcaactg ggtattgttg ttcatgcaga ccttgtgcga atccatccgt tcaccgatgg 1176301 aaatgggcgc accacaaggc ttctcgctga tttggtgtac gcgacggttc agaatcccac 1176361 cgagctgcag tatgactggg agctcgataa actgcgctta cgtcgaacta cttcgcggct 1176421 acgaccgaga ccgggacatt gcggcgctcg ccgccttcat cggtgtgcgg cccatcgaga 1176481 cataggcagg ctgtcttgtt gaagccggcg accgggcgac ccaagcggag gaggtaccgc 1176541 ggatcactgc ggtaccgtcg acgcggtggc aaccaggcat caacgggcgg ggattgacga 1176601 ccgctggcat aagcgggtca aagggccgga cgggaacagg cgaaccgtgc ggtctgctgt 1176661 ctgcggcagg gtttcgcgct ggcgcgtcag gtgggttgac ggcggcggag aggagcacag 1176721 caagagcttc cagcgcaaac ctgacgcgca ggtacctgac ccatgccgaa ctgttgatgc 1176781 tcgccagggc cacgggccgg ttcgaaacgc tcaccttggt gctcggctac tgcggcttac 1176841 ggcggtttac ggttcggtga ggctgttgcc ctgcggcgca agcatgtggg ggatcgcgtg 1176901 ctgaccgtcc gatcgtcccc tacggcggtg accggcaagg gcatcgttga gtcgacgacc 1176961 aagacgaagc gggatcgtca cgtaccagtg cctgagcctg tttggcgcag gctccatgcc 1177021 gagttgccca ccgacccgaa cgccttggtg ttccccggcc gtaagggcgg attcctgcct 1177081 ctcggtgaat accgctgggc attcgacaac gccggcgacc aggtcgggat cgaaggctgg 1177141 taccgcacgg tctggggcac accacggcct cgctggcgat cagcgcaggc gctaacgtca 1177201 aggtcgtgca acggctcctt ggacacgcag cagcggcgat gacgctcgac cggcacggcc 1177261 atctgctcaa cgacgatcta gcggtgtggc cgatgcgctg tgcaaagtca tcgagaacac 1177321 tgcggtatca ctgcggtatg cggagacgga acagagtcgg gctccgggca tgagatagcg 1177381 cgtctgaact gcaacgcccc catagcccaa ttggcagagg cagcggactt aaaatccgtc 1177441 aagtgtcggt tcgagtccga ctgggggcac ggggaaatcg ttgttggcaa gtcatggcgt 1177501 tgggcactgc tgctgctcgc cgctcaagcc agcaacccaa cctggcgata cgttggtttg 1177561 agcggggcga ctcccgtcgg gccacctacg ccccgcctgt tgctatggcc ggacaaggag 1177621 catcgcgatg agcgtggatt acccccaaat ggctgctacc cggggaagaa tagaaccggc 1177681 cccgcggcga gttcgcggct atctcggaca tgtgctcgtc ttcgacacca gtgcggcgcg 1177741 ctatgtctgg gaggttccct actacccgca gtactacatc ccgctggcgg atgtccgcat 1177801 ggagttcctg cgcgacgaga accacccgca gcgagtgcag ctgggtccgt cgcggctgca 1177861 ctccttggta agcgccggtc agacccaccg atcggcggcg cgggtattcg atgtcgacgg 1177921 cgacagcccg gtggcgggca ccgtgcgttt caactgggat ccgctgcggt ggttcgagga 1177981 ggacgagccg atctacggcc atccgcgcaa tccctatcag cgggccgatg cgctgcgctc 1178041 gcaccgacac gtccgtgtcg agctggacgg cattgtgctc gctgacaccc gatcgcccgt 1178101 tctgctattc gaaactggga tacccacaag gtattacatc gatccggccg acatcgcttt 1178161 cgagcatctg gagcccacct cgacgcagac gttgtgtccg tacaagggga cgacgtcggg 1178221 ctattggtct gtgcgcgtcg gcgacgccgt gcaccgcgac ctggcctgga cgtatcacta 1178281 tccactgccc gccgttgccc cgatcgccgg cctggtggcg ttttacaacg agaaggtcga 1178341 cctcaccgtc gacggcgtcg ccctgccgcg gccgcacact cagttcagct agtgcttggt 1178401 ttgttcgccg gttggcggcc gccagcatgg tcaacctcat ctagggcgtg ggtgtcgggg 1178461 cgcagcaggc tgccggcgat ctcgcggaca ccgtcttggc tgtgcccaat ctagattccg 1178521 atcggcctga gtcttcttct gccggcgcag cgcatcggcg cgggccacga ttgcatcgac 1178581 gtggacggcc agccggcgct gggtcatcga cggccagcga gccgccctga gagcgagctc 1178641 ggcggccacg gcgccaacac ctcaccgtcg acggtcagat cgctgcgaca cacgatcgtt 1178701 tgaaacatcc ggtagtcgat gtcgccggcg gtgaagactt ggccgaccat ggctaggcac 1178761 tcgcgcatac cgcagctggc tggccgttgg cccctggctg atccgcaagg ccgcaccgac 1178821 ctcagcgatc accgccgcct gtgaccacta accagtctca tcgaaaatat attcgataca 1178881 gccacttgcc gtcgacattg accatgaggc gttcacgtcg cagggccgac gaaatatgct 1178941 gagacctgcc tactcgtgtg caatgtgata ttagcctcat tttgatttga attatgagaa 1179001 tttcttattt cccagttatg gggagcgtgt gctggttgtt agcgaagtac gctaaaactg 1179061 cagttactgc tcatagcact ggtttgccac ataccccgta tcgggatacg tcatgatcgg 1179121 tatcctgagc ggaacataag tcggtcacgt gacctaggta acagcgtcta attcgtgaaa 1179181 tttttgatca gaatttggtc gctagactta ttccagccca gtatgaatca gcgcttttgg 1179241 tgccgaaatg cggcgaatcc cgggcagtcg gcgtcgcaca gcacggttgc tgtgctgtcg 1179301 caagcctgga ggcccgcaga cacagcaagc gaggagcggc gcgtatgagc cgcgccggcg 1179361 acgatgcgga acgaagtgat gaggaggagc ggcgcatgag cgttatgaac ggccgggagg 1179421 tcgctcgaga gagcagagat gcccaggtct tcgagttcgg caccgcaccg ggctccgccg 1179481 tggtcaagat tccggtgcag ggcggtccga tcggtggcat cgccatcagc cgcgacggca 1179541 gtctgctggt agtgaccaac aacggcaccg acaccgtctc ggtcgtcggc accgacacct 1179601 gccgggtcac ccagaccgtc accagtgtca acgaaccgtt cgcgatcgcc atgggcaatg 1179661 cggaagccaa ccgcgcgtac gtcagcacgg tgtcgtcggc gtacgacgcg atcgcggtca 1179721 tcgacgtggc cacgaacacc gttctcggca cccatccgct ggcgctcagt gtgagcgacc 1179781 tgacactcag cccggacgac aagtacctgt acgtcagccg aaatggcact cgcggtgctg 1179841 acgttgcggt gctggacacg acgacgggcg cactgatcga cgtcgtagac gtttcccagg 1179901 cgccgggcac caccacgcaa tgcgtgcgga tgagcccgga cggaagtgtc ctgtacgtcg 1179961 gcgccaatgg gccatccggc ggcctgctcg tcgtgatcac gacccgcgcg cagtccgacg 1180021 ggggacgcat cgggagtcgc tcgcgttcgc ggcagaagag ctccaaaccc cggggtaacc 1180081 aggcggcggc gggcttgcgc gtggtggcga ccatcgacat cgggtcatcg gtccgcgacg 1180141 tcgcgctcag ccccgacggt gccatcgcct acgtcgccag ctgcggctcc gacttcgggg 1180201 cagtggtcga cgtcatcgac actcgcaccc accagatcac cagctcgcgc gcgatcagcg 1180261 agatcggcgg gttggtcacc cgggtgagcg ttagcggcga cgcggatcgc gcctacttgg 1180321 tcagcgagga tcgggtgacc gtgctgtgca cccgtacgca cgatgtcatc ggcacgatca 1180381 ggaccggcca gccgtcgtgc gtggtcgaga gcccggacgg aaagtacctg tacatcgccg 1180441 actactccgg caccatcacc aggacagcgg ttgcctcgac catcgtgtcc gggaccgagc 1180501 agctggcgct acagcgccgc gggtctatgc agtggttctc gcctgagctg cagcagtacg 1180561 cgccggcgct cgcctagctc gaacgcgctt ctcgggggaa cccgtttctc atgacttctc 1180621 gcggcgatag cattcgcccg aggaggacat gaggcgcgcc gagacccgta aggcggtaca 1180681 tcgatgtacg gcacgatgca ggactttccg ttgacgatca ccgcgatcat gcgccacggc 1180741 tgcggtgtcc acgggcgacg cacggtcacc accgcgacgg gtgagggcta tcggcacagt 1180801 agctatcgcg atgtggggca acgagctggc cagctggcaa atgcgttgcg ccgcctcggt 1180861 gttaccgggg accagcgggt tgccacgttc atgtggaaca acaccgaaca cttggtgacc 1180921 tacttcgcgg tcccgtcgat gggcgcggtg ctgcataccc tcaacatccg gctcttcccc 1180981 gagcagatcg cctatgtcac caacgaggcc gaagaccgcg tcattctggt cgacttgtca 1181041 ttggccagac tgctcgcgcc ggtgctgccc aaactcgaca ccgtgcatac cgtgatcgcg 1181101 gtaggagagg gcgacacgac gccgctgcgg gaagctggca agaccgtgct gcgcttcgcc 1181161 gaattaattg acgccgaatc ccccgacttc gggtggccgc agatcgatga gaactccgcg 1181221 gccgcaatgt gttacaccag cggtactacc ggcaatccca aaggcgttgt atacagccat 1181281 cgttcgagct ttctgcacac gatggcggcc tgcaccacaa acggtatcgg ggtcgggtcc 1181341 agtgacaagg tgctgccgat cgtgccgatg tttcatgcca acgggtgggg gctaccgtat 1181401 gcggccttga tggcgggtgc ggacttggtg ctacccgatc ggcatctcga cgcccgctcg 1181461 ctgatccaca tggtggagac gctgaagccg acgttggccg gcgcggtgcc aaccatctgg 1181521 aacgacgtca tgcattacct agagaaggac cccgatcacg acatgtcatc gctgcgtctg 1181581 gtcgcctgcg gcggatcggc ggttccggaa tcgctgatgc gcaccttcga ggacaagcac 1181641 gatgtccaga ttcggcagct gtggggcatg acggaaacat cgccgctggc caccatggcc 1181701 tggccgccac ctggcacccc ggacgaccag cattgggcat tccgcatcac tcagggccaa 1181761 ccggtgtgcg gggtggagac ccggatcgtc gacgacgatg gccaggtgct gcccaacgac 1181821 ggcaacgccg ttggcgaggt ggaggttcgc gggccctgga ttgctggctc gtattacggg 1181881 ggacgtgacg agtccaagtt cgattccggc tggttgcgca ccggtgacgt cggccgcatc 1181941 gacgagcaag gcttcatcac cctgaccgac cgcgccaaag acgtcatcaa gtccggcggt 1182001 gaatggatct cctcggttga gttggagaac tgccttatcg cgcacccgga cgtgctcgag 1182061 gccgcggtcg tcggcgttcc cgacgagcgc tggcaggaac ggccgctggc ggttgtcgta 1182121 gttcgggaag gggccaccgt tagtgctggt gatctgcgag cattcctggc ggacaaggtc 1182181 gttcgctggt ggttgccgga gcggtgggcg tttgtcgacg agattccccg caccagcgtg 1182241 ggcaagtacg acaagaaggc catccgttct cgctacgccg aaggtgccta ccagatcacc 1182301 gaggtgcaca cttgacccgc gcgagcagac gcaaaatcgc ccattttcgt gtcgaaatgg 1182361 gggcttttgc gtctgctcgc gggtagaaag gtgaccatga gcctgcgggt cattcaatgg 1182421 gcgacgggat cggtcggtgt ggcggcgatc aaaggcgtgc tgcagcatcc cgaactcgaa 1182481 ctcgtaggct gctgggtgca ttcggcggcc aagagcggca aagacgtcgg cgaaatcatc 1182541 ggttcaccac cattgggcgt gatcgcgact aacagcatcg acgacgtttt ggcgctggac 1182601 gccgacgcgg tgatctacgc gccattgctg cccagcgtcg acgaagtcgc cgcgctgttg 1182661 cgttcgggca agaacgtggt cactccgctt gggtggttct atccgagtga aaaggaggcc 1182721 gccccactgg aagtcgccgc gcaggccggc aatgcgacgc tgcacggcgc cggaattggg 1182781 cccggggctg tcaccgagct gttcccgttg ctcctgtcgg tgatgtccac cggtgtgact 1182841 tttgttcgct ccgaagagtt ttcggatctg cgcagctatg gagcgccgga cgtgctgcgc 1182901 tatgtgatgg gtttcggcgg cacaccggac agcgcgttga ccggaccgat gcagaaaatt 1182961 ctggacgggg gcttcctgca gtcggtacgg ctgtgtgtcg accggttggg ctttgccgcc 1183021 gacccccaga tccgcacttc gcaggaggtg gcggttgcga ccgccccgat cgactcgccg 1183081 atcggagtaa ttgagcccgg acaggtggcc ggacgccgct tccattggga ggcgctggtc 1183141 gaggacacag tggtcgtcca gatcgccgtg aactggttga tgggatcgga aaatctggat 1183201 cccccttggt cattcgggcc ggccggagaa cgctacgaga tcgaagtgcg cggcagcccg 1183261 gacacctgcg tcaccatcaa gggttggcaa ccgcagaccg tggcggccgg cttgaagagc 1183321 aaccccggga tcgtggcaac cgcggcgcac tgcgtcaacg cgatcccggc aacctgcgcc 1183381 gccccggcgg ggatccagag ctttttcgac ctgccgctca tcaccggccg ggccgctccc 1183441 gggctggcac gctagagttg ctggcggcgt ccccggccgg gatgtcgaga atcggacggg 1183501 taatccaatg gcaaagtctg tcgtcgtcga gcaatcgcga gcgattccgg tgcaatccga 1183561 ggatgcgttc ggtggcacgc tggcggcagc gctgccggtg atttgttcgc actggtacgg 1183621 cctgatccca ccaatcaagg aggtccggga tcaaacgggt gcttgggatt ctgtcggaca 1183681 ggcccgtgtc atcacgatgg tcggcggcgg gcgcgtgcgc gaggagctga ccagtgtcga 1183741 cccgccgcgg tcgttcggct acacgctcac cgacatcaag ggcccgttgg cgccgctggt 1183801 cgcgttggtg gagggcaagt ggagcttcgc tcccgcggat accggaacca cggtgacctg 1183861 gcaatggacc atccatccta gatcggcgct ggccgcgccg gtgttgccgg tgttcgccag 1183921 gatgtggcgg ggctacgcgc gcggggtgct cgagaagctt tccgctttgt tggtgggctg 1183981 agcggcgctg ccggcttcgt ctaccgtcgg ggtcatgtgc cgactctttg gcttgcactc 1184041 cggaaccgat gctgtcaccg cgacgttttg gttgctgaac gcctcggata gcctggccga 1184101 gcaaagccga cgaaaccccg acggcaccgg ccttggtgta ttcgacgaac accaccagcc 1184161 gcggctacac aagcaaccaa tagcggcctg gcaagacgcc gacttcgcca ccgaagccca 1184221 cgagctgacc ggcacgacgt tcgtcgccca tgttcgctac gcgacgaccg ggtcgctcga 1184281 catccgcaat acccacccat tcctgcaaga cgggcggatc ttcgcacaca atggggtggt 1184341 cgaaggactg gatgtcctcg acgaacggct gcgcgaggtc ggcgccgatg acctggtgtt 1184401 gggccagacc gactccgagc gcgtattcgc tttgatcacc gcttcgatcc gcgcccggga 1184461 cggcaacgaa tcagccggtc tgattgacgc gctgaggtgg ctcgcggcga atgtgccgat 1184521 ctatgccgtc aacgtgttgc tcagcaccgc gaccgatgta tgggcactgc ggtatccgga 1184581 gtcccacgag ctgtatatct tggaccgccg cggcgacggt gcgcccgagt tccacttgcg 1184641 aagcaagcga atccgcgcac actcgacgca cttgcgcgaa cggtcgtcgg tggtgttcgc 1184701 gactgaaccg atggatgaca acccgcgttg gcgcctgctg gacgcggggg agctggtcca 1184761 cgtggacgcc gccctgcggg tcaacaggag tctggtgcta cctgatccac ccagacatcc 1184821 gattcgccgg gaagatctca gcgagccggt actgcatgcg caacacacgt cggcgtgaac 1184881 tcgtgacaac tagacgcgcg ctggtattgg ccggcggagg actggccgga atcgcctggg 1184941 aaacaggtgt tttgcgcggc atcgcggacg aatcgccggc ggcggcccgg ctgctactgg 1185001 attcggatgt gttggtcggg acatcggccg gtgcaacggt cgccgcgcag atcagcagtg 1185061 gctgcccgct cgacacgctg tacgaacggc agctcgccga gacgtcggcc gagatcgatc 1185121 ccggtgtcga catcgatgcc atcactgatc ttttcctgac tgccgtgacc gagccgcaca 1185181 tttcgacgcg ccggcggcta caacggatcg gtgccgtggc gttggcggtc gacaccgttc 1185241 cggagtccgt ccgccgtcag gtgatcgccc agcgcttgcc gtcgcacgac tggccggacc 1185301 gggtgttgcg ggtcaccgcg atcgacatcg ccaccggcga attggttgtt ttccatcgcg 1185361 agtcgaatgt ggcgctggtc gacgcggtgg cggccagttg ctcggtgccg ggggcgtggc 1185421 ctccggtgac aattgccggc cgccgctaca tggatggcgg ggtggccagc tcggtcaacc 1185481 ttggtgtcgc cgacgattgt gatgccgccg tggttttggt gcccgccggc gccgacgcgc 1185541 cgtcgccctt tggcggcggg gcggccgcgg agatcgcggc agccaccggc atggtgtttg 1185601 ccgtgttcgc cgacgacgac tcgttggcgg ctttcgggcc caacccgctg gatccgctct 1185661 gccgtgtgaa ctcggcgatg gccggacgtc agcagggccg ccgcgaagcg caagccgttg 1185721 ccaggctgct cggcgtttga tcagccctcg atggtcgcag cggcagattc gtcgtcgtcg 1185781 atctcgaatg cttccaaggc ttgggtggcc agcgcgcggc cgacggcgat cacctccacc 1185841 gcgcggtgaa attccaggct tcggcacgtt gaacgcggta cctcgatcag caggtcggcc 1185901 ggatagcccg ccagcgtatg gcgcgccagt gcggattggg cgatatcgat cgtccgattc 1185961 atcacctcga aactgcccat tttgggtagc ccgggtgtgt cagcggcttc ctcgcggtca 1186021 gctggtgggc cggctggacg ctgctcgatc tccggagctt gcgaccagga atccgattcc 1186081 gcggccgccg cgccgaagcg actcagcacc gcccgcgccg taggccggtc gagcagcgac 1186141 cgggcggcgc tgacgtcaaa cagcgcggaa gtgctgcgca ccatgcggtt caaccactcg 1186201 gcggtgacgt tgggctccgc atcgcgagcg gggccggcct cactgccgtt aaggctgacc 1186261 gcgatggtca ggtcggcgtt gaccccggcg atcggcgcca tcggcagtgg atccaggatt 1186321 ccgccgtcgg ccagcaggcg tccgtcgact tcgtgtgggg cgatcacccc gggtatggcg 1186381 atggacgccc ggatcgccgc gtcgaggggg ccgcgctgaa accacaccga cttgccggcc 1186441 agtaggtcgg tggccaccgc ggtatagggg atcggcagct gctcgatggc gaccgggccg 1186501 acgatgtcgc gcaccgcgtc gagaatcttt tctgcccgca ggatgccggc cgcgctaata 1186561 gacggatcca gcagccgcaa gatggtgcgc tgcgtcaggg acttggccca gtgggcgaac 1186621 tcgtcgagtc ggccggccgc atgcacccca ccgaccaccg cgcccatcga cgagccggcg 1186681 atcccaacga tgtcatagcc gcgctcccgc agcgcctgga tcactccgat gtgggcgtaa 1186741 ccccgggcgc cgccgctgcc gagcgccagt gcgacgcgcg gcgaagacga ccctcgcacc 1186801 cggagggcag ctggtgcggg catgctttca ttctgctcgg cgaggtgccc ttatcgggat 1186861 ccggccacta gtttcttgca cccctgatct caattgccga gcgttatccg cattccgcgt 1186921 tggcggcggc gcgcgccgcg acgatcacgg ccgcctgccg tgccggggtc agcgccgccc 1186981 agcggatgtg ccagctgccg gcaactccgg atggcgatgc ttggaccacc gccagatacg 1187041 gctcgattaa cgactcgccg gagccgggct gggcatccat ccacagcctt gcggcgtgac 1187101 aggcctggta atactcctcc tcggtcgatt ccgcgggagc gtcaaccctg gtggtcacgc 1187161 cggccgggga gacgccgacg acgcccgccg gcaacgtacc ggcaacgctt gacgaacgtc 1187221 cggctttgct gctgccgccg cgagagcagc cggcaacggc cgacaaccac gccagcgcca 1187281 aaaccattgc gcacagcagg ggggcataac ggctcgggcg caccgtccca atctatgcaa 1187341 gactgaccgc gtgatggagc gctacggatt ttgtgggtgt tgtcggccct gacctgccgt 1187401 ccgccctgtc cgttcgactc tttggagttc tcccgtggtt atgcctcttg tcacgccaac 1187461 caccgcggtt ccatcaccgg gacccacacg gctgcgtgta gccgatctcc tgcgcgccac 1187521 cgaccaagcc gcagacgacg tgcttggcgg gcgctgcgac cacctgctac ccgacggtgg 1187581 tgtcccgcag acgcagcgct ggtacacccg catccacggt gacgaggagc tggatatctg 1187641 gctgattagc tgggttcccg gtcaaccgac cgagctgcac gaccatggcg ggtccctggg 1187701 agcgttgacc gtgctgagcg ggtcgctcaa cgaatatcgt tgggacggcc gtcggttgcg 1187761 acggcgccgc ctcgatgccg gtgatcaggc agggttcccg ttgggttggg tgcacgacgt 1187821 ggtgtgggcg ccccggccga ttggggggcc tgatgcggcc gggatggctg tggcgccaac 1187881 cctgagcgtg cacgcctact cgccgccgct gacggcgatg tcgtactacg agatcaccga 1187941 acgcaacacg ctgcgccgcc agcgcaccga attgaccgac cagcccgaag ggtcgggatg 1188001 agccgaatcg accgggtgct ggaggccgct cgccgccggt atcggcgcct tgcggccgac 1188061 caggtgcccg aggcggcgcg gcgcggcgcg gtgctcgtcg acatccggcc ccaagcccag 1188121 cgggcccggg agggcgaggt gccaggggcg ctagtgatcg agcgcaacgt cttggaatgg 1188181 cgctgcgatc ccaccagcga cgcccggctg ccccaggccg tcgacgacga cgtcgagtgg 1188241 gtgatcctgt gctcggaggg ctacacctcg agcctggcgg cagcgtcgct gctggacttg 1188301 gggttgcacc gggccaccga tgtcgtcggt ggctatcgtg cgctggcggc cggcggcgtg 1188361 ctggccgagc ttggtggtgc cgtgggcggg tagtttggct cgccgctgct ggctgggtcg 1188421 ttactgcccc ggcgtgccgg cgttgccgaa gatgagtcct cgagttccgc cggcgccgcc 1188481 ggcgccgtcg agtccggcga tcaggccggc gccacccttg ccgccgttgc cgccgttgcc 1188541 gccgtcaccg accaactggg cgtcgccgcc cttgccgccg ttgccgccgt tgccgccgtt 1188601 gccgtcgaca ccgccggcgg cggcgcccag accgccttgg ccgccgcccc cgccgttgcc 1188661 gccggtgccg ccgccgccga gcaggccggc gccgccgccg ttgccgccgt gaccgcccgc 1188721 gtgaccgctg ccgccgttac cgccggcggc ttgaagcccg gtcggcgggt tggtgccgcc 1188781 gctgccgccg ctgccggcgg tgctgcccgt tccgccggcg ccgccggcgc cgccgccacc 1188841 gaacagcctg gcggccgatc cgccgttgcc gccgttgccg gcgttgccgg tgtccccgcc 1188901 gttgccgccg ataccgggat tgatggccag accgttgggg gtgtcgccgc cctttccgcc 1188961 ggcgccgccg gctccggcgc tgccgccgct accgccggcg ccgccgttgc ccgacagcca 1189021 gccggccgac ccgccggtgc cgccggcccc ggcgttgccg ccgacaccgc caccgccacc 1189081 gttaccacct agtgcggcgt tgagcccggt gccgccgtcg cccccggagt tgccggcgcc 1189141 gccggccccg ccgttgccat acagcagccc accgccacca ccggcgccgc cgccgccgcc 1189201 gtcgccgccg acaccacccg taccaccctt accggcggtg gccacgacat gttcgccgcc 1189261 ggcgccggcg gccgcgccgt tgccgccggc gccgccgtgc ccggcgttac ctccgtgacc 1189321 gaacagcacg gcgcctcgtc cgccgttgcc gccggcgccg gcggtgccgc cggtgccgcc 1189381 ggtgccgccg tctccaccga attggccgcc gttgccggca ccgccggcgg tgccgccgcc 1189441 gccgccgttg ccggcgtcac cgccgttgcc ggacagccag ccggccgacc cgccgtcgcc 1189501 accgcggcca ccggcgccgc ccgcaccacc ggcgccgccc ggttggctgg gtgggccggg 1189561 ggcgccggga ctggcttgtc cgccggcccc gccggcgccg ccgtcaccgc cggcgccgcc 1189621 gtggccgtgg atccagccgc cggcaccgcc cgccccgccg gcgccggcgt caccgccctt 1189681 ggtgccgctg gccccggcgc cggcaccgtt gccgccttgt ccgccgtcac cgccgacgcc 1189741 gccgacaccg ccgttgccga acaatccggc cgttcccccg gccccgccgg caccacccgc 1189801 gacgcccggc gcgccgatgg ctccggcggg gccggcgccg ccggcgccgc cattgccgcc 1189861 gctgccgtag agccagccgc cgttgccgcc cgcgccgccg ttggcgccgg ctccgccggc 1189921 ccctccgttg ccgccgttgc cgatcaaccc ggccgacccg ccggtacccc cggtgagccc 1189981 ggcggtggtt tgggaaaacc cgttgccgcc gttgccgtac aacaacccac cggcgccacc 1190041 gttgggattg gccgcggtcc catcggcgcc gttgccgatc agcggacgcc ccaacagcgc 1190101 ctgggtgggc gcgttgatca aacccagcac ctgctgctcg acattggtcg cctcggcgct 1190161 ggcatacgcg ctcgccgccc cggtcaatgc ctgcacgaac tgctcgtgaa acagcgctgc 1190221 acgcgcgccc agctgttgat actggccggc gtgggcggaa aacagtgccg cgaccgccgc 1190281 ggacacctca tcggcaccgg ccgcggccag caccgacgtc ggggccaggg cggccgcgtt 1190341 ggccgcgctg attgccgaac cgataccggc cacatcggcc gccgcggcca tcagctgcga 1190401 cggagacacc aacacaaacg acacggtttc ctctccctga tttgctgata tgtagttgcg 1190461 atgttaacta gcgcacaccg caactggggc ggttttccgc cattgtctgg tcgcacgtat 1190521 acatttttgt gaattctttg agcggaattg ctcgtgcgat ccggctacgt tttcgaggtg 1190581 agatctgggt gggcggcgat gccccgtgct tcgatgatca atttggggat ctgaaatgtc 1190641 aaatgtgttg acattcattg ggtgatcttt cgcgccaccc ggcgacgtca aatacttgga 1190701 cataagccac tcgtcgttgt gtgatacgtc gtcacaccgg atctggccgt gcgggtttat 1190761 tgcccgggcg tgccggggtt gccggagatc tgcccgcgac taccgccggc gcctccagtg 1190821 ccgttgattc cgggcatcag gccggtgccg cctttgccac cgttgccgcc gttaccgccg 1190881 ttaccgatca actgggcgtc gccgcccttg ccgccgtcgc caccgttgcc gccgttgcct 1190941 ttggcgccgc tgccggcgcc cagaccgccg ttgccgccgt cgccgccggt gccgccgctg 1191001 cctccgctgc cgagcaggcc ggcggtgccg ccgctaccgc cggcaccgcc cgcgtggccg 1191061 ttgccaccgt tgccgccggt gccgccgccg aagccgccgc caccgccggt gccgccggtg 1191121 ctgcccatcc caccggcgcc gccggcgccg ccgtcgccga acagcttggc ggccgatccg 1191181 ccgtggccgc cgttgccggc gttgccggtg tctgcgccct ggccgccgtt accgggatca 1191241 ataccgctgt tgccgttgcc gccttggccg ccggcgccgg cggtgccgcc gcctccgccg 1191301 gtgccgccgt tgcccgacag ccagccggct gacccgccgt tgccgccgtt gccggcgttg 1191361 ccgccgccgc cgccggtgcc ggcgtcgccg ccgttgccgg acagccagcc ggccgacccg 1191421 ccgtcgccac cgcggccgcc ggcgccgccc gctccgccgg caccgccggc accgccgttg 1191481 ccgaacaatc cggccgttcc cccggccccg ccggcaccac ccgcgacgcc cggcgcgccg 1191541 atggctccgg cggggccggc gccgccggcg ccgccattgc cgccgctgcc gtagagccag 1191601 ccgccgttgc cgcccgcgcc gccgttggcg ccggctccgc cggcccctcc gttgccgccg 1191661 ttgccgatca acccggccga cccgccggta cccccggtga gcccggcggt ggtttgggaa 1191721 aacccgttgc cgccgttgcc gtacaacaac ccaccggcgc caccgttggg attggccgcg 1191781 gtcccatcgg cgccgttgcc gatcagcgga cgccccaaca gcgcctgggt gggcgcgttg 1191841 atcaaaccca gcacctgctg ttcgacgttg gtcgcctcgg cgctggcata cgcgcccgca 1191901 cttgacgtca gggccagcgt gaactggtca tgaaacgctg ccatctgccg ggcgatcgcc 1191961 tgatagccct cgccatgtcc gctaaacagt gccgcaatgt gggccgacac ttcgtcggcg 1192021 gcggccggca acaacctcgt cgtcgcggcc gcggccgccc tcgttgaggc attgatcgac 1192081 gatccgatgc tggccaaatc cccagccgcc gagctgagca tgtctggcac cgcaatcatg 1192141 taggacattt cgcgcatctc cctcatcgcc gggcgacgga tatcgggacc ggagtcaacg 1192201 tgatggcgcg agtctaagca cgcccggaac ggaaatgcag agtgttcgac aaatctttcc 1192261 ccaagacatt tttattggtc gcacgatggg cgtcgtcgtc gagcggtatg gcagcaccga 1192321 tttgtcttcc aggggaatgt tcgtaccgtt tcatgacgtc gactgtgtcc aatagcttta 1192381 catttcccgt ttttatttgc tgatgatgtc taacacctag acaaacaccg tcttgtcgtc 1192441 catcgatatg ggctcgggct agccgccacg ccgacggcgc acgccaaacc ggccgacccg 1192501 ctgcccgccc tacgagccga agggcttggc gttggcgtgc agcaatggct gcagccgctc 1192561 cgtcttctgc tgtgtccagc cgggcggcga gagcaccgcg gcccagccgt cggccacggt 1192621 ggcgacgtag cggtgaccat gcccgtccgg cacatgcgta gccaccgcca tatcggccga 1192681 aacctggacg aacgtcacca ccgggatcca gcgtgtctgg ggaaggacgt cgtagccccg 1192741 ttgctcccgc agccagtccg gctcccgaaa cagcaggcgg ggagtccacc aggcgatcgg 1192801 atcagaggca tgctgcagat acaccacccg cggtctgccc cacggcgcat cagggcgttg 1192861 caggtcgcgt gcgcgggcca cgaaacgcac gttgcggccg tcgtcgtaga tgggcagcca 1192921 ctgcggtgat ccggcatcgc ggttcgcagt caaggagttc caaacggtgt tgttgaacgt 1192981 cggtccgctg aacaacgcgc cgtcggtgcg ggcgaggatg ttgttgaggt tcatgaacgg 1193041 cgcttcaccg ccgaacgatc ccaggctctc gccgaacacg accagcttcg ggcgctgcga 1193101 ctcgggcagt tgacggatca gcttgtcgac cgcctcgaac agcgcctcgc cggcgtgccg 1193161 ggcattctcc ttgtccacca ggaaagacag ccagctcggc aagaacgaat actgcatgct 1193221 cacgatcgcg gtatcgccgt tgtacatgta ctccagcgcg gaggcttccg cctcgttgat 1193281 ccaaccggtt ccggtgctcg tggccactgc cacaacggcg cggcgcaagc caccggtgcg 1193341 cgctagctcg cgcgccgcca gctccgcggt ggccatgatg ccgtccgccg agttcaaccc 1193401 cgcataggtt cggatcggct cgacggccgg ggtgccgttg aacgcggtga ggtcggcgat 1193461 ggtgggaccg ctgtggacga aaattcggcc ctgatggccc agcgactccc acgacaccag 1193521 cgatcccggg ccacccgatc gcagcggggt tttcggcggt gccgaatccg gattcatctc 1193581 attgttgacc gcagcgaacg tgctgttcat ggaattcatc gcgaacttga gcaccacacc 1193641 gttgagcagt gtgatggtca gcaccacgag cagcaccacc acaatggccg ccgaaactcg 1193701 gaatggcgca atgcgatcga cctgtcccac cagaaaacgg aacagccatc ggatgaactg 1193761 gccgatttcg accagcgtga acagcacgac cagcgacaat gcggcggcca gcgggtagtc 1193821 gtaccaccgc aggtgctcga cacccattag gtcgcgcaca tcgtcttgcc agacatgaaa 1193881 ctgcactgcc atacccacca tgccgaccgc gccgactgcg atcagcggcg gccacgccca 1193941 gcgtggtggc ggcgggctgg aattgtgcga gcgcatgtag cggaccagcc agacggcgaa 1194001 gactcccaag ccgtatccga aggcgccgca gattccgctg accagtccct gaaacagcgg 1194061 accacgcggc agcagcgacg gcgtcatcga gaaccacacg aaaacgaggc ccatcgcggt 1194121 gccggtgaat gtgtagtggc gaatccacca agtgctgcgg atcggttgcg gttcaggggt 1194181 ttgtggagtt gctgcggtgt cgaccgcctg ctcagcgccg gtagctggtt cgtcgctggc 1194241 gttggtggtc gtcgctgcag ccggttccgt catcggtggg tgaactgggg agcgcgtttc 1194301 tcgatgaacg ctgccatacc ttcggattgg tcttcggtcg cgaaagccga atggaaaagc 1194361 cggcgttcgt agagcagccc ctcggacaaa ctggattcga aagcccggtt gacggcctcc 1194421 ttggccatcc gggccgccga ggccgacatc tgcgaaatgg tcgtggcagt ggccctggct 1194481 tcggtcagca agtcgtcggc cggcaccacc cgtgaaacca gaccgctgcg ctcggcctcg 1194541 gcggcgtcca tggtgcgccc ggtcaggatg aggtccatcg ccttagcctt gccgatagcc 1194601 cgggtcagcc gctgggagcc gcccatgcct ggcagcacgc ccagctttat ctcgggctgt 1194661 ccgaacttcg cggtgtcggc ggcgatcagc acgtcgcaca tcatcgccag ctcgcagcca 1194721 ccgccgagcg cgtatcccgc caccgcggcg atcgtcgggg tgcgcacggc ggccagcttg 1194781 ccccaggtgg cgaagaagtc ggcggtgaac gcgtcggcga acgtcaggtc ggccatttct 1194841 ttgatgtcgg ctccggcggc aaacgctttg gccgaaccgg tgatgatgat cgccccaatg 1194901 tccgggtcat cgtccagttc ggttgcagcg ctggtgacct cgttcatcac ctggctgttg 1194961 agcgcgttca gtgcctgggg acggttcagc gtgataatgc caactcgctg atcgcgctcg 1195021 accaggatgg tttcgtacgt catgcgctac ctctctagaa actcaagtca tcgtcgaccg 1195081 gttcgaaata ggcttcgatg tcggccgccg tgatcgcgtc cagggttgcc ggcgaccagt 1195141 tcgggttgcg atccttgtcg atcaactgcg cgcggatgcc ctccaccagg tcatgcgagc 1195201 gcagcgacgc cgatgacacc cgatagtcct ggatcaacac gtcttctagc gtgtcgagtt 1195261 tggcggcgcg acgcactgcc tgcaacgtca ccgacagcgc gatgggggag cggctggcaa 1195321 tcaggtcgga agcatttacg gctggttcgc cgccctgttt ccgcagcgcc gcaacgatgt 1195381 cggcgacgct gtcgccggca tagcattcgt cgatccaatc acgttgggcg gcaagcgtgc 1195441 tcggtggagg ttcgacggcg tgggcggcca atgcgctctc cacgccgccg gtgacgatct 1195501 tctgcgtgaa cgcatcgagg tcgccgtgtg gcacgaagtg gtcggcgaat cccagcgcga 1195561 tggcgtcggc gccggaaaac ggcgctccag tcagggcggc gtgcagaccc agcgcgccgg 1195621 gtgcacgcga cagcaaatac accccgccga cgtcggggat gaacccgatg cccacttcgg 1195681 gcatcgcgac cttggaggta tcggtaacca cccgggtgtt cgcgtgtgcg ctgacgccga 1195741 cgccgccgcc cattacgatg ccgtccatca acgccacgta gggcttggcg aaccggccga 1195801 tcagggcgtt gagcagatac tcgtggcgcc agaaccgccg cgcctcgacc ccgtccttgc 1195861 gggcactgtg gtagacggcc accacgtccc cgccggcgca aagtccgcgt tcgccggctc 1195921 cggagagcac caccgcgtgc accgcgtcct catgctccca gctcatgagc actgtggcca 1195981 gcaggtcgac catggtttgg ttcagtgagt tgatcgcctt ggggcggttg agcgtcacga 1196041 atccgacacc gccctcgacg tttgtcagga cctcatgcga ttcgccggtc acgggcctcg 1196101 cctcccctga agagtttgac cagcaatcta gatcgtggct cgcccagcgg tgcccgcggg 1196161 ggctaaggtt tatcgtgtac ccggatgaca acgctggccg ggaacccggg cctactactg 1196221 atcgttgagc ggatgttcgc acagctcgta gccatagcca tcaagagagg atccgacggt 1196281 gcgggagaca agcaacccgg tatttcgttc gttgcctaag cagcggggcg gatacgcgca 1196341 attcggaact ggcaccgccc agcagggatt cccagccgat ccctacctgg cgccctatcg 1196401 ggaagcaaag gccacccgcc cgctgaccat cgacgatgtc gtgaccaaga cgggcctgac 1196461 gctggctatg ttggcgggca ccgccgtcgt ctcctacttc ctggttgcgt cgaacgtcgc 1196521 actggccatg ccgctgacct tggtgggggc tttgggtggt ttggcgctgg tgctggtggc 1196581 caccttcggc cgcaagcagg acaacccggc gatcgtgctc agctacgcgg cgctcgaggg 1196641 cctgttcctg ggtgccatct cgttcgtctt ggctaacttc acggtggcgt ccgcgaatgc 1196701 tggggtgctg atcggggagg ccatcttagg gacgatgggt gtgttcttcg gcatgctcgt 1196761 cgtctacaag acaggggcca tccgggtcac ccccaagttc acccgaatgg tggtcgctgc 1196821 gctgttcggc gtgctggtct tgatgctcgg caacctcgtg ctggcgatgt tcaatgtcgg 1196881 cggcggtgaa ggcttgggct tacgcagccc cggaccgctg gggatcatct tctcgctggt 1196941 gtgcatcggc atcgcggcgt tcagcttcct gatcgacttc gatgcggctg atcagatgat 1197001 tcgcgcggga gcaccggaga aggcggcatg gggcgtcgcg ttaggcctga ccgtaacgct 1197061 ggtctggttg tacatcgaga tcctgcgcct gctcagttat ctacagaatg agtagcgctc 1197121 gttggccgtt gattctgcgt ccaccaggct gaccactcgc acttttgcgt ggtagacgca 1197181 ggatcaacgg ctgtgtcggt gggtgctgac accatgcccg catgcgggag atgggggcgc 1197241 agccgttcat cggcagcgag gcgttggcgg cgggactcat cagctggcat gagctgggca 1197301 agtactacac cgcgatcatg cccaacgtct atctggacaa gcggctgaag ccctccctgc 1197361 ggcaacgcgt tatcgcggcc tggctgtggt cgggccgcaa aggggtgatc gccggcgctt 1197421 cggcatcagc gctgcacggc gcgaaatggg tcgatgacca cgcattggtg gagttgatct 1197481 ggcgcaacgc cagggcgccg aacggggtgc ggactaagga tgagctactg ctcgacggcg 1197541 aagtccagcg cttgtgcggg cttactgtga ctaccgttga acgtacggcc ttcgacttgg 1197601 gcaggcgtcc acccttaggt caggcgataa ccagactgga tgcgcttgcc aatgccaccg 1197661 atttcaagat caacgatgtt agggagctcg cgaggaagca cccccatact cgcgggctgc 1197721 gtcaactaga caaggcgctg gatctcgtcg acccaggtgc gcagtcgccg aaggagacgt 1197781 ggctgcggct cttgctgata aacgccggct ttccacggcc gtccactcag atccccttgc 1197841 tcggcgtcta cgggcatcca aagtatttcc tcgacatggg atgggaggac atcatgctcg 1197901 cggtcgagta cgacggcgag caacaccgtc tcagccgaga ccagttcgtc aaagacgtcg 1197961 aacgcctgga atacatccgg cgcgccggct ggactcacat cagggtgctg gcagaccaca 1198021 agggacccga cgtcgtccgc cgggttcggc aggcttggga cacgttgaca tcacgacgtt 1198081 gactctgcgc ccaccacgtg tcctactcgc acttttgcgt ggtggacgca gagtcaacgc 1198141 actcgagcgc ctcgctcacg cgaggcgctc gatcaccatc gccatgccct ggccgccacc 1198201 gacacacatg gtttccagac cgaacgtctt gtcgtaggtc tgcaggttgt tcaacagcgt 1198261 ggtggtgatg cgcgcgcccg tcataccgaa cgggtgacct agggcgatcg cgccacctga 1198321 gatgttgagc ttgtcctcgt cgatgcccag ctcgcgcgcc gagcccagga cctgcaccgc 1198381 gaaggcctcg ttgatctcga ccaggtcgat gtcggtgatc gccatcccgg ctctttccag 1198441 cgccttcttg gacgcctcga tcggccctaa gcccatgatc tccggggaca gcccgctgac 1198501 cccggtggac acaatgcgcg ccagcggtgt caagcctaat tccttggcct tggtgtcgct 1198561 ggtgatcacc accgcggcgg ccccgtcgtt gagcggacag gcattccccg cggtcacggt 1198621 gccattcggc cggaaagccg gcttgagctc gctgaccttt tcgtaggtgg tacccggtcg 1198681 cgggccgtcg tcggtgctga ccgtggtgcc gtccggaagg gtgaccggcg tgatttctcg 1198741 ttcgaagaac ccgttcttga tcgcctcttc ggcccggttc tggctgcgca cgccccagcg 1198801 gtcctgttct tcgcggctga tgccggtcat gatggcgacg ttttccgcgg tctggcccat 1198861 cgcaatatag atgtccggca gcttctgatc ggtgcgggga tcgtgccatt cgtcggcgcc 1198921 ggcggctgcc gcggccgaac gttcctgagc cccgtcgaac agcgggttct tggtgtccgg 1198981 ccaggagtcg gagtttccct tggcgaaccg ggagacggtt tccacgcccg cggagatgaa 1199041 cgcgtcgccc tcaccggcct tgatcgcgtg gaaggccatc cgggtggtct gcagcgacga 1199101 cgaacagtac cggttgaccg tggtgcccgg caggaagtca tagccgagcg cgacggcgac 1199161 gacacgggcg atgttgaaac cggactcacc gcctggcagg ccacagccca tcatgaggtc 1199221 gtcgatctga tgggggttca gtgccggaac cttgtcgagc gcggcgcgca ccatctggac 1199281 ggccaggtcg tcgggccgca tgccgaccag cgatcctttc atggcccggc caatcggcga 1199341 gcgggcagtc gagacgatga cagcttctgg catgacggct cccggcatgg acaagacgtg 1199401 gtgaagttta ggtcaaatgt agtcgctacc caccggtcgg cacggcccgg gccggccggg 1199461 gccgccgcag ccgcgacatc atgctgtgtc gcgtgtggcc cggctcgagg gtggccgttc 1199521 caggccggga cggcgtttca tgaattggga tatcgagctt ttcggtcagc gcatcgcgca 1199581 gcgcaaggaa caacagatcg gccgccaggg cgtacgcggg cgccgacggg tggtagcggt 1199641 cggcggagaa catcagctcg ggcattgccc ggaatttggg agccagtaga tgtcctagcg 1199701 gcaccggcac cccaccggcc gccttgacgg ctgccgtttg ggcgcgggcc agccgcacac 1199761 cacgggtgtg cgctagcgcg cgcagcggct gcgggatggc ggtaatgacg ccgaggtcgg 1199821 ggcaagtgcc gaccaccact accgctccgc gggtgcgcaa cctgcgtacg cagtcggcca 1199881 gccgttgcgc agaggggcca atgccgttga gtgccgttat gtcgttggcg ccaatcatga 1199941 ttaccgccgc atccggcggc ggaccgacca cgaacatcgc atcgacttga ccgcagacgc 1200001 ctttcgaggt ggcgccgacg atggctttgg tgctcagccg gatccgcttg ccggtctgct 1200061 cggcgagtcc gcgggcgatc aacacgcccg gtacttcctc agcgctagcg cagccgtatc 1200121 ccgtcgccgt cgagtcacca aagatcatca ggtgcacgtc gaagggcact tcgcgtcgcc 1200181 accgttgcac gggcccaccg ccgcgggtgt atacgccgtc ggcgcggggc ggtgcgtcga 1200241 aggatttggg aattaccgtg cgcgcgtggg tcgcctgacc gaccagcagg ttgcgtgcgc 1200301 ccagataggc cgtgcccgtc gaggcgagtg cacccgcggt ggccaaagcg atcgtggaac 1200361 gccgtggcac gcgcatgctc acgggatcag tttaggacgg ttgtgccgat ttcgtggata 1200421 gctgacgaac aaacccgtca cggtgtggac caaatgtggt atcgaatcag actctttggc 1200481 tgtggcacct aaaaaagact gtcaagctaa gttcgcgggg ttggctgagc cagaggctca 1200541 gccgcttcgt cacatgctgt atcggactac aacggcgtag gaagtgttgg gcatgactgc 1200601 acccagtaag gtatccggct cacccagagt tgtcatttcg ccgcgcgacg tgttgaaggc 1200661 acgtagactc gaggcacgca agtttgcgat cagcgacggc gccccggtgg aggtcgtcga 1200721 gtctggtcca agtcttgttg cgcgattagc tgcgctggcg tcacgagtgg cggtccggcc 1200781 ggtgctagcg gtcggtagct atcttccgca tgcgccctgg ccgtggggtg tcatcgacca 1200841 ggctgcccgg gttctgctcc cagcgtcaac gaccgtaagg gccgcggtga gcctgcctaa 1200901 tgcgtccgcc caactggttc gggcgtcggg tgtgttgccg gcggacggca ctcgacgcgc 1200961 cgtcctgtac ctgcacggcg gcgcgtttct gacgtgtgga gcaaactcgc atggacgact 1201021 cgtcgagttg ctctctaagt tcgctgactc gcctgttctg gtggtcgact atcggttgat 1201081 tcccaagcac tcgatcggga tggcgctcga cgactgtcac gacggctacc ggtggctgag 1201141 gctgttgggc tatgagccgg agcagatcgt gctagcgggc gattccgcgg gcgggtatct 1201201 tgcgctcgct ctcgcgcagc ggctacagga agtgggggag gagccggcgg ctctagtcgc 1201261 gatctcgcca ctgctgcagc tagcaaagga acacaagcag gcgcatccca acatcaaaac 1201321 cgatgcgatg ttcccggcaa gggcgttcga tgcgcttgac gcattggttg ctagcgcagc 1201381 agcgaggaac caggtagacg gcgaacccga agagctctat gagcccttgg agcacatcac 1201441 accggggctg ccgcggacac tgattcacgt gtcgggctcc gaggtattgc tgcacgacgc 1201501 tcagttggcg gcggccaaac tggcggcggc cggggtgccg gccgaggtcc gggtatggcc 1201561 gggccaggtc cacgactttc aggttgcggc gtcgatgctg cccgaggcga tccgctcgtt 1201621 gcgtcagatc ggggagtaca tccgcgaggc caccgggtag cgggatgccg acggagcgcg 1201681 tgtgcctggc cggcaggcgc ctgagacgat gaacgcatgc ggatcgcgca acatatcagt 1201741 gaactcattg gtggtacccc actggttcgg ctgaactccg tggtacccga cggcgccgga 1201801 accgtggccg caaaggtcga gtatctcaac cctggcggca gctccaagga tcggatcgcg 1201861 gtgaagatga tcgaagccgc cgaggccagc ggtcagctga agccgggtgg caccatcgtc 1201921 gaacccacgt ccggcaatac cggcgttggt ctggcgttgg tcgctcagcg ccgcggctac 1201981 aagtgcgtgt tcgtctgccc ggacaaggtc agtgaggata aacgcaatgt gttgatcgcc 1202041 tacggcgccg aggtcgtggt gtgcccgacg gcggtcccgc cgcacgatcc ggccagctac 1202101 tacagtgtgt cggaccggtt ggtccgtgat atcgacggtg cctggaagcc cgaccagtac 1202161 gccaacccgg agggaccggc aagccattat gtgaccaccg gcccggaaat ctgggccgat 1202221 accgagggca aggtcaccca tttcgtggct ggcatcggca ccggcggtac catcaccggc 1202281 gctggccggt acctcaaaga ggtgtccggg ggccgagtac gcatcgtcgg cgccgacccg 1202341 gagggatcgg tctattcggg cggtgccggc cgaccgtatc tggtcgaggg ggtcggcgag 1202401 gatttctggc cggcggccta tgacccgagc gtgcccgacg agatcatcgc ggtgtccgac 1202461 tccgactcgt tcgacatgac caggcggctg gcccgcgaag aggcgatgtt ggtcggcggg 1202521 tcgtgcggga tggcggtggt tgccgcgctc aaggtcgccg aggaagccgg gcccgacgcg 1202581 ttgatcgtcg tcctgttgcc cgacggcggc cggggctaca tgtcgaaaat cttcaacgac 1202641 gcgtggatgt cgtcctatgg gttcctgcgc agccgccttg acgggtcgac cgagcaatcc 1202701 accgtcggtg atgtgttgcg ccgcaagtcc ggcgcgctgc ccgccctggt gcacacccat 1202761 ccgtcggaga ccgtgcgcga cgccatcggg attcttcgcg agtacggggt gtcgcagatg 1202821 ccggtggtcg gcgccgagcc gccggtgatg gccggcgagg tcgccggtag cgtctcggaa 1202881 cgcgagctgc tctcggccgt gttcgagggc cgcgccaagt tggccgacgc cgtgtcggca 1202941 cacatgagcc cgccgctgcg gatgataggc gccggtgaat tggtcagtgc ggccggcaag 1203001 gcgttgcgtg attgggatgc gttgatggtg gtggaggaag gcaagccggt tggggtcatt 1203061 acccggtacg acttgttggg cttcttgtcg gagggggcgg gacggcggta gtcgcgcagg 1203121 caggcgcgcc gcaatttagt tcggctacaa acaattacgg caggcggcca gtgccgcaca 1203181 ggtcgtgggc actgacccat tgggccccgt ggctcatctc accgccgggc gttccggtga 1203241 atccggtcct caggtactgt agtcccgcct agttcaccct agttcagctg aacctcagtg 1203301 gaaggtgtgc ccatgaccga acagccgccc cccggcgggt cgtacccacc gcccccgcca 1203361 ccgcctgggc cgtccggtgg gcatgagcca cctcccgctg caccacccgg cggcagtggt 1203421 tacgctccgc cccctccgcc ctcgagcggc agtggctacc cgcctccgcc gccaccgcct 1203481 ggcggggggg cctacccgcc gcctccgccg tcggccggcg gttacgcgcc gccgccgccc 1203541 ggaccggcga ttcgtacgat gccgaccgag tcctacacgc cgtggattac ccgggtgctg 1203601 gcggcattca tcgactgggc cccatacgta gtgctggttg gcatcggttg ggtgatcatg 1203661 ctggtcactc agacgtcgtc gtgcgtcacc agcattagtg agtacgacgt cggccagttc 1203721 tgcgtttccc agccgtcgat gatcggccag ttggtgcagt ggttgttgtc ggtgggcgga 1203781 ttggcttacc tggtctggaa ctacggctat cgccagggca ccatcgggtc gagcatcggc 1203841 aagtcggtgc tgaagttcaa ggtggtcagc gagaccaccg ggcaaccaat cggcttcggg 1203901 atgtcggtgg tacgccagct tgcccacttt atcgacgcga tcatctgctt cgtcgggttc 1203961 ctgtttccgc tgtgggacgc taaacggcaa acgttggcgg acaagatcat gacgacggtg 1204021 tgcgtgccga tctgatccgg gactgcactg cccacccgac cgtccgatga gcgaagaccg 1204081 cacgggacac cagggaatca gcggaccggc cacccgcgcc atccacgctg gctaccgccc 1204141 ggatccggcg accggggcgg tgaacgtgcc gatctacgcc agcagcacct tcgcccaaga 1204201 cggcgtcggc ggtctgcgtg gcggtttcga atacgcacgc accggcaacc ccacccgggc 1204261 cgcattggag gcctcgctgg cggcagtcga ggagggtgct ttcgcgcggg cattcagttc 1204321 cgggatggcc gcgaccgact gcgccctgcg ggcgatgtta cggcccggag accacgtcgt 1204381 cattcccgat gacgcctacg gcggcacatt ccggttgata gacaaggtgt tcacccggtg 1204441 ggatgtccag tacacgccgg tgcggcttgc cgatctggat gcggtgggtg ccgcgattac 1204501 tccgcgcacc cggctgattt gggtggagac gcccaccaat ccgctactgt cgatcgccga 1204561 tatcacggcc attgccgagc tgggcacaga cagatcggca aaagtattgg tggacaatac 1204621 ctttgcctca cccgcgttgc agcagccgtt gcggctgggc gccgatgtgg tgttgcactc 1204681 gactaccaag tacatcggcg gccattccga cgtggtggga ggtgcgctgg tcaccaacga 1204741 cgaagagctg gacgaggagt tcgctttctt gcagaacggc gccggcgcgg tgcccggacc 1204801 attcgacgcc tacctgacca tgcgcggcct gaagaccttg gtgctgcgga tgcagcggca 1204861 cagtgaaaat gcctgtgcgg tagcggaatt cctcgctgat catccgtcgg tgagttctgt 1204921 gttgtatccg ggtttgccca gtcatcccgg gcatgagatt gccgcgcgac agatgcgcgg 1204981 cttcggcggc atggtttcgg tgcggatgcg ggccggtcgg cgtgcggcgc aggacctgtg 1205041 tgccaagacc cgcgtcttca tcctggccga gtcgctgggt ggggtggagt cgctgatcga 1205101 acatcccagc gccatgaccc atgcgtcgac ggccggttcg caattggagg tgcccgacga 1205161 tctggtgcgg ctttcggtcg gtatcgaaga cattgccgac ctgctcggcg atctcgaaca 1205221 ggccctgggt taactaccgc gagcagacgc gaaagcaccc caaaaccgcc ggtttggggg 1205281 cttctgcgtc tgctcgcggg tacctaggag tggtacggct cggcgctgac tagggtcacc 1205341 gacacggtgc tgccgttggg caccgtgtag ctgcgggtct cgccgacctt ggcgtcgatc 1205401 agggccccac cgagcggtga attcggcgag tagacctcga gcttgccgtc gctgacgccc 1205461 tcctggcggg tggcgatgag gaacgtttcg ctgtccgact tgtcgccgtt gtagtacacc 1205521 ttgaccacag aaccgggtaa tgcgacgccg gattgcttgg gtgcctcgcc aacctttgcg 1205581 ttgctgagca agtcctgcag ctggcgaatg cgggcctcct gctggccctg ctcctcgcgg 1205641 gcggcgtggt atccgccgtt ctcgcgcagg tcgccttctt cgcggcggtc gttgatttcg 1205701 gcggcgatga ccgggcgatt cgcaatcagc tggtcgagct ctgctttgag tcggtcatgt 1205761 gactcttggg tcaaccaggt gacttgagta tccgtcatct cgtcgcgctc ctcgtgttgt 1205821 cgttcccgcg tagtcgggca agtttcggat ccctgccagc agcactgtcg ggaatatttg 1205881 gggtctcacc ccgggttgcc gccgctccgt tctgcgtacg gccgttaatg cagcaataca 1205941 cggccccggc aggaccgtgc atcgatccat gctaccacca cggtcagggg aggcgcaggt 1206001 agctgggcac ttcggtgcca caaccgtata cgtccgccat caccggcggc tgggaggatt 1206061 tcacggtcgt cgtcacctgc acggtggttg cctcggacgg tgggactagc agctcacgtc 1206121 tgccggtctc gctgccgttt gttgcccgaa ctcgcacgat gcaggccacc ggtcgggacg 1206181 ggtccgaacg tgtcacgctg atggtgaccg atgccgtctc gtcgtcgacc agtcgatagc 1206241 ccaccagcga accggtgacg gcgctggtgc tgatccgttg gtagccgatg acggcaatga 1206301 cgatgccggc cgcggcgacc agcaccccca gggcgatcgc gacacggcgc cgcgctcggc 1206361 gggacagtcg cgggcgtccg tagcgggcgt ctggtcgcgg aatgggggtg tgggtcatgc 1206421 ctgggttcac gccggcggga tgcaacgctt cgacaaaccg gaattatagg gtcacttata 1206481 ggcttaaggg ggcagccagg cggacggaca agggggcacg tgagcgaact gcggttgatg 1206541 gcggtgcacg cccaccccga tgacgagtcc agcaagggcg cggccaccct ggcgcgctac 1206601 gccgacgagg gtcatcgcgt gctggtggtg acgttgaccg gtggtgagcg cggcgagatc 1206661 ctcaacccgg cgatggacct gccggacgtg catgggcgca tcgccgagat ccggcgtgac 1206721 gagatgacca aggcggccga gatcctcggt gtcgagcaca cctggctggg cttcgtcgac 1206781 tccgggctac ctaagggtga tttaccgcca ccgctgcctg atgactgctt cgcgcgggta 1206841 ccgctggagg tgtccaccga ggcgctggtg cgggtggttc gcgagtttcg gccgcacgtg 1206901 atgaccacct acgacgagaa cggcggctac ccacatcccg accacattcg ctgccatcag 1206961 gtttcggtgg ctgcctacga ggcggccggt gacttttgcc ggtttcccga cgcgggtgag 1207021 ccgtggacgg tgtccaagct gtactacgtc cacggcttcc tgcgggagcg gatgcagatg 1207081 ttgcaggatg agttcgcccg gcacggccaa cgcggcccat tcgaacaatg gctggcgtac 1207141 tgggaccccg accatgactt tctcaccagc cgagtgacca cccgggtcga gtgctcgaaa 1207201 tacttcagcc aacgcgacga tgcgttgcgc gcgcatgcca cccagatcga cccgaacgcc 1207261 gaattcttcg ccgccccgct tgcctggcag gagcggctgt ggccgaccga ggaattcgag 1207321 ttggctcgct cgcgtatccc cgcgcgccca ccggagaccg aattgttcgc cgggatcgag 1207381 ccgtgaacca gattctgctc agcgtgattg ctgagggcgg gcccggtaac accggacccg 1207441 atttcgggaa ggctagcccg gtggggttgc tggtgatcgt gctattggtg atcgccacgt 1207501 tgtttctggt gcgttcgatg aaccagcaac tgaagaaagt tcccaagtcg ttcgaccggg 1207561 atcaccccga gctcgaccag gcagccgacg agggcaccga ccgcgacgga ccggcccgac 1207621 caccgggacc cccgcatgag tccggctaat ccgtccggga cgaataccct cgcgctggcc 1207681 accagcccgt acctgcgcca gcacgctgat aacccggtgc actggcagca gtggacgccg 1207741 caggcactgg cggaggcggc cgcgcgcgcg gtgccgatcc tgctgtccgt cggctacgcc 1207801 gcctgccact ggtgtcacgt catggcccac gagtcattcg acgacgacga ggtggccgcg 1207861 gccatgaacg cgggcttcgt ctgtatcaag gtcgaccggg aggagcggcc cgacatcgac 1207921 gcggtctaca tgaacgccac cgtcgcgctc accgggcagg gcggctggcc gatgacatgc 1207981 tttctcaccc ccaacggccg gccgttcttc tgcggcacct actacccgaa agcggctttc 1208041 ctgcaacttc tttcggccat atccgaaacc tggcgggaac gccgcgctga ggtggagcag 1208101 gcatctgacc atatcgctgc cgagttgcgc tcgatggctt cggggctgcc cgggggtggc 1208161 ccggaggtgg cgccggagct gtgtgacgac gcggtggcag gagtgctgcg tgagcaggac 1208221 acggcgcacg gcggatttgg cggtgcgccg aaattcccgc cgtcggcact gctggaagcg 1208281 ctaatgcggc actacgagcg cacccgatca ccggcggcgc tggaggcggt cgcacgcact 1208341 ggaaacgcca tggcccgtgg cggcatctat gaccaactcg gcggcggttt cgcccgatac 1208401 agcgtcgacg gtgcctgggt ggtaccgcat ttcgagaaga tgctgtacga caacgcgctg 1208461 ctgctgcgcg cctacgcgca ctgggcccgc cgtaccgggg atccgttggc ccgccgggtc 1208521 gccgcccaga ccgcgcgatt tctgctcgac gagttgggca gcaaagcacc ggccgacatg 1208581 ttcacctcgt cgctggatgc cgacgccgac ggccgcgagg gttcgaccta cgtttggacg 1208641 ccggtgcaac tgaccgaggt gctcggcggc gacgacggcc gttgggcggc agaggttttc 1208701 ggggtgaccg aggccggcac cttcgagcac gggacgtctg tgctgcagtt gcccgccgac 1208761 cccgacgacg cggcgcgtct ggaccgggtc cgcgccgcgt tgctggtggc ccgcctggcc 1208821 cgggcccagc ccgcccgcga cgacaaggtc gtcacgtcct ggaacgggtt ggcgatcacc 1208881 gcgctggccg aagccagcgt ggccctggac gaccccgcgt tggcgcacgc cgcgcggcgc 1208941 tgcgcgacca ggctgctgga cctgcacgtc gtcgacggcc gcctgcgccg ggccagcctg 1209001 ggcggggtgg tcggcgacag cgccgccatc ctggaggacc acgcgatgct ggccaccggg 1209061 ctgctggcgc tctaccagct gacctccgag ggcgcgtggc tgacggcggc taccggattg 1209121 ctggacaccg cggtggcgca tttcggcgac ccgcagcgcc ccggtcgctg gttcgacacc 1209181 gccgacgacg ccgagcggct gatgctgcgg ccctccgatc cgctggacgg ggcgacaccg 1209241 tcgggcgctt cgtcgatcgc cgaggcgctg ctgacggcgg gccatgtggt cgacggtgct 1209301 cgcgccgagc ggtattggca gctggcggcc gacacgctgc gggcgcatgc ggtgctgctg 1209361 gctcgggcgc cgcggtcggc cgggcattgg ctggcggtcg ccgaggcggt ggtgcgcgga 1209421 ccgctgcaga tcgccgtcgc gtgcgacctg ccgcggtcgt ccctgctggc cgacgcgcgc 1209481 cggctggccc cgggcggggc gatcgtcgtg ggcggcgcgg cgggttcgtc ggcgctgctg 1209541 gtcggccggg atcgggtggc cggcgccgac gccgcctacg tatgccgggg ccgggtctgc 1209601 gatctgccgg tgaccagcgc ggccgaactc gccaccgctt tgggcgtacc cggctagcgg 1209661 actcgggtgg cacccgtcca ccgtgaaatc cgcgacgcgg tgtcggcgtg tcgcgtcgca 1209721 attttcacgc tcgcgaccgc cctgggcgtg ccgggtcaga acaccacgaa ccacatcgcg 1209781 atgtagtggc agatcgccgc caccgcggtg caggcgtgga agaactcgtg gtagccgaac 1209841 gtcgtcggcc acgggtcggg ccagcgtacc gcgtagagaa tgccgccgat gctgtacaac 1209901 gcgccgccaa caaacagcaa caccaacgcg gtcaccccgg cgttgtgcag gatcgtcgcg 1209961 gtgtaccaga ccgccaccca acccagcaac aggtacagcg gaaccccgac cgagcgcggc 1210021 gccgccggcc aacacatctt cagcaagatt ccggcgatcg caccgcccca aacaatcgac 1210081 aacaccacgc gcccgtcgtg ggccggcaag gccagcagcg cgaacggcgt gtagctgccg 1210141 gcgatgaaca cgaagatcat cgagtggtcg gcccgcttca tccagttgcg ggccgtcgcg 1210201 gatttccaat tgacccggtg ataagtggcg ctgacggtga acatggtgat cgtggccgcg 1210261 gtgtaggcca gcgtcgtcag gcccgccttg gcggaaccca ccgcccacga caccgcgacc 1210321 agcgacgcac cggccaacac cgcggtgccg gcggaataca cgtggatcca gccgcggaag 1210381 cgcggtttgg tcaggacacg ggcgacacct tcgacgaggt ggtgggcagc gtgggccggc 1210441 gtccttgctt ccgcggtggt ggcggtgtcg gcctggccgc tcatttcgcc tgttgcctcg 1210501 tcttgtgctt gccggtgggt gtcgtcgaac acagtagtcg ggccaggtag cggacatctg 1210561 actcgacgtc tgggtcacag tagtctgggt atctgtggag atcatcccgc cgcggctcaa 1210621 agagccgttg taccggctct acgagctgcg cctgcggcag ggcttggccg cctcgaaatc 1210681 cgacctgccc cggcacatag ccgtgctgtg cgacggcaac cggcgatggg cgcgcagcgc 1210741 gggctacgac gacgtcagct acggctaccg gatgggtgcg gccaagatcg ccgaaatgct 1210801 gcggtggtgc cacgaagccg gcatcgaact ggccaccgtc tatctgctgt ccaccgaaaa 1210861 cctgcagcgc gatcccgacg agcttgcagc actcatcgag atcatcaccg atgtcgtgga 1210921 agagatctgc gcaccggcca accactggag tgtgcggacg gtcggggatc tggggttgat 1210981 cggcgaggaa ccggcccggc ggctgcgcgg tgcggtggaa tccaccccgg aggtggcctc 1211041 gtttcatgtc aacgttgctg ttggctacgg cgggcgccgc gagatcgtcg acgctgtgcg 1211101 cgcgttgttg agcaaggaac tcgccaacgg ggccaccgcg gaggaactcg tcgacgcggt 1211161 gaccgtcgag ggtatctcgg aaaacctgta cacctcaggc caacccgacc ccgatttggt 1211221 gatacgcacc tccggcgagc aacgcttgtc cgggttcttg ctgtggcaaa gcgcctactc 1211281 ggagatgtgg ttcaccgagg cgcactggcc ggcgtttcgc cacgtcgatt ttctacgcgc 1211341 gctgcgtgac tacagtgcga ggcatcgcag ctacggcagg tgaatccggc gcaggacgcc 1211401 tatgttgcgc tgttcggctg cctgcgcaga gtgcacatta gccggctcgt catgctgtgc 1211461 aatctgccca ggtgaaaccc ggtgtttggg atcctggata gcgataccat cgactgatcc 1211521 atgcgggaca tccgatgctg gactgatcgg agtaaggcga tgtcgtttgt agtcgtggcg 1211581 ccggaggtgt tggcggcggc cgcttcggat ctagcgggca tcgggtcgac actggcgcag 1211641 gccaacgccg cggcgttggc gccgaccacc gcggtgttgg ccgcgggtgc tgatgaggtt 1211701 tccgcggcaa tcgcgtcgct gtttggggcg catggtcagg cgtatcaggc ggtgagcgcc 1211761 caaatgtcgg cgtttcacgc ccagttcatg caggcgttga cgggtgccgg cggggcttat 1211821 gcggctgcgg aggcggtcaa cgtctcggcg gcgcagagcg tggaacaaga cctgttggcc 1211881 gcgatcaacg ctcgcttcga gcggattttt gggcgcccgc tgatcggtga tggcgccaac 1211941 ggcgggccgg gacaagacgg cgggcccggc gggttgctgt acggcaacgg tggcaacggc 1212001 ggcaccagca cgaccgtggg gatggccggc ggcaacggtg gtgccgccgg gctgatcggc 1212061 aacggtgggt tcgggggcgg cggcgggccc ggcgcggccg gcggcaacgg cggcgccggc 1212121 gggtggctat tcggcaacgg cggcgccggc ggtgccggcg gcctcggcgt agcgcccggc 1212181 gtgcccggcg gcgccggcgg tgccggcggc gccggcggtg tcggcggacc cgccgggttg 1212241 tggggccacg ggggtgccgg cggggcgggt ggtgccggcg tggctggcgc cggcggcttc 1212301 gaggggacga tcggtgccgg cggtgccggc ggtgtcggcg gtgccggcgg tgtcggcggt 1212361 gccggcggtg ccggcgggtg gctgtacggc gacgccggtg ccggtgggga tggtggtgtc 1212421 ggcggtgccg gcggcaccgg cgggttaggc aaccgtggcg gcgccggtgg cgccgggggc 1212481 gccggtggtg tcggcggcgc cgggggtgcc gccgggctgt ggggcggcgg tggtgccggc 1212541 ggggtgggtg ggaccggcgg cggcgccggc ctcggtgctc agagcgtcac cttcagtagt 1212601 agcttaagtg gcctttccgg tggcgacggc ggcgccggcg gggccggtgg cgccggtggc 1212661 gccggtggca ccggtgggtg gctgtatggc ggcggtggtg ccgccggatc cggcggggac 1212721 ggtggtaccg gcggtcaggg cggcgccggc ggcgccggtg tatttagcct attcggatcc 1212781 ggtggcggcc ccggcggcaa cggcggcgtc ggcggcgtcg gcggtgtcgg cggtgctggc 1212841 gggcgtgccg gcttgttcgg cgtcgggggc ctcggcggcg cgggtggcga cgccggtgac 1212901 tccggcgaag gcggcttcgg cgggccgggg ctcgccggcg ggctgttcgg caaccccggc 1212961 aacggcggcg tcggcgggat cggcggcgac gccgcagccg gcggcgccgg tggggccgga 1213021 ggcaacggtg gggccggagg caacggtggg tggttgttcg gcaatggtgg tgccggcggc 1213081 tccggtggcg acggcggcgc cgccggccgt ggcggtgccg gcaacttggg ctcggccggg 1213141 ggtatcaacg cccccgccgg taaccccggc agcggctcgg tcggcatcgg cggtgccggt 1213201 ggtgccggcg gcaccgccgg gctgttcggc gacggtgggg ctggtggggc cggtggtgcc 1213261 ggcgccgccg gcggcttcgg cggcatcagc gccgccaccc cctcggcggg cagtgagggc 1213321 gccatgggtg gggccggtgg tgttggcggc aacgccaggc tgttgggcac tggtggcgcc 1213381 ggtggagtcg gcggcggcgg cggggccggc ggcgacggag gccgcggcgg agtcgcaacc 1213441 cccggcggtc agggcggtga cgctggggac ggtggcgccg gcggggccgg cggcaatggc 1213501 ggcggcgcca gcggcgccgg cgggtggctg ttggggaccg gtggtgccgg tggtgccggt 1213561 ggtaacggcg gcaatggcgg aaaagccggt tttagccctg ggccgaccaa cttcggtctc 1213621 aacggcgccg gtggtggtgg tggtgtcggc ggcaacggcg ccaccggacc ctggctgttc 1213681 ggcgacggcg gccccacccc aggcagcacc ggtgccggtg cggccggtgg tcacggcggc 1213741 gacgcccagc tgatcggcaa cggcggccac ggcggggccg gcggcaccgg ggtgccgaac 1213801 gggtcaggtg gtgccggcgg cctcagcggg ctgctgttcg gcgagccggg ggcgaacggg 1213861 taggttcggc gccgctgccg tgatcgcggc gaggcgtcgg tgtccgcgtc cgtgcgggcg 1213921 aatccagtcc ggtctgagtg cgtctactac agcttgcgca gccgtagccg cttgatggca 1213981 tcggactggt taccgtctgc ctgctgtcca cagaaaacct gtgtgcgatc ccgacgagct 1214041 tgccgtgcgt gggctacggc gaccgtcgcg aattcgtcga cgcggtggcc gtagaagcca 1214101 tctgcgaaaa cctgaatacc tcggggcaac ccgatcccga cctggtgatc cgcacctcgg 1214161 gggaacaacg cttgtccggc caccgagggc ccactggcgg agtttcgcga cgtcgacttc 1214221 tgcgcgcgct gcgtgactac agtacgccac acgcgtcgat cccctacgtt ccgccgccct 1214281 atcgaagcga cgggatccac gcttcccggc tggcggttga atcggttttc gatgcattgg 1214341 ctgggcgcgt cgaactctaa agactttatg gaaattagtt gtacagtgat aaaaccgtta 1214401 tagggtccgt tgtcaaacaa tgataatcac gtgataggaa cgtgattcat cggtctgaag 1214461 tgcttatgat gatttatata taaaaccgtt atatgtgggt aaaggattgc ggatgtcata 1214521 catgattgcc acaccagcgg cgttgacggc ggcggcaacg gatatcgacg ggattggctc 1214581 ggcggttagc gttgcgaacg ccgcggcggt cgccgcgaca accggagtgc tggccgccgg 1214641 tggcgatgaa gtgttggcgg ccatcgctag gctgttcaac gcaaacgccg aggaatatca 1214701 cgccctcagc gcgcaggtgg cggcgtttca aaccctgttt gtgcgcacct tgactggggg 1214761 gtgcggagtc tttcgccggc gccgaggccg ccaatgcgtc acagctgcag agcatcgcgc 1214821 ggcaggtgcg gggcgccgtc aacgccgtcg ccggtcaggt gacgggcaat ggcggctccg 1214881 gcaacagcgg cacttcggct gcggcggcca acccgaattc cgacaacaca gcgagcatcg 1214941 ccgatagggg cacaagcgcc atcatgacca cggcaagcgc gaccgcgtct tccacgggcg 1215001 tcgatggcgg aatagcggcg acgtatgcgg tcgcctcgca atgggatggt ggctacgtgg 1215061 ccaattacac gatcacccaa ttcgggcgcg acttcgatga ccgattggcg gttgcaattc 1215121 actttgcctg aaaatgcctc tatttcgaac gcgtgctgcg ctcaacttgc ccagtcgggc 1215181 acgcagtaca ctcttgacgc ccgagagcta taacggcacc ccccgtggac tcgatcaccg 1215241 tcggctacca agcagcgcaa accggcggct actcgccacc gacaaatctg ctgatcaacg 1215301 gtcaagccgt caccatcgac cagaccccca tcacctcgtc gccaacgact ccgccaccca 1215361 ccacaccacc cgagatcccg accggtggaa cggtgatctc cacctagttc gggacgacta 1215421 cggtcaccgg aggctacgtg gtgcagaaca acgcgtggaa caacccccgc cgggcagacc 1215481 gtcaacgtca gccaaaccgg gttcaccatc accgagatga acggtgctgc cccaaccaac 1215541 ggcgccccgc tgagttaccc ctcgatctgc gagggcgtgc actggggcca cctcgtcggt 1215601 gggcaccaac ctgcctactg aggtgggcca gattttgtcg gcgccgacca gcatcgacta 1215661 caactacccg acgaccgggg tatgggacgc ctcctacgac atctgcctgg attccacacc 1215721 caagacgacc ggggtcaacc agcaggagat catgatctgg ttcaaccacc agggctccat 1215781 tcagccggtc ggctccccgg tgggcaacac caccatcgag ggcaagaact tcgtggtgtg 1215841 ggatggcagc aacggcatga acaacgcgat ggcctatgtc gcgaccgagc cgatcgaggt 1215901 ctggagcttc gacgtgatga gtttcgtcga ccacaccgcc accatggagc cgatcaccga 1215961 ctcgtggtac ctcacgagca tccgggccgg cttggagccc tggagcgacg gtgtgggtct 1216021 gggggtcgat tcgttctcgg cgaaagtcaa ctaaagacca cgttgacacc caaccggcgg 1216081 cccggcatgg gccgtcgcgg cgtagaagct ttgaccgcgg cgcgaaacgt tcgctgctgc 1216141 ggcccatgca gatcgcacac gcttgcttga acatcgggtg gagccggtgg taacgccagg 1216201 ctttgggtgt cggcgcggct cggcggtcag ctgcgcggac gcggtcggcc atcgtgacga 1216261 cgagatgctg gcggcatgta cggcaaccgc tggctcgtct tagagccatt tgctgaggcg 1216321 catgctttgc gtcatgcaaa gtgcatatgc cgccagcggg atggtgtgca ttctgtccat 1216381 gggaaaccgg gttgatggtg ggcgcgtcag cgatacgatc tgtgcaccct gacgacatgg 1216441 ccgatgcatg attgatcgga ggtaaacgat gtcgtttgtg attgctgcgc cggaggcgtt 1216501 ggtcgcggtc gcttcggatc tggcgggcat tgggtcggcg ctggcggagg ccaacgccgc 1216561 ggcgttggcc ccgacgacgg cgttgttggc cgcgggtgcc gatgaggtgt cggcggcgat 1216621 cgcggcgctg tttggcgcgc acgggcaggc gtatcagacg gttagcgccc aggcgtcggc 1216681 gtttcatgcc cagtttgtgc aggcgttgac tggcggcggc ggggcgtatg cggctgccga 1216741 ggccgccaac gtctcggcgg cgcagagcac cgaccagcgg ctgctcgatc tgatcaatgg 1216801 gcccacccag gcgttgttgg ggcgtccact gatcggtgat ggcgccaacg gcgggccggg 1216861 gcaagacggc gggcccgggg ggttgctgta cggcaacggc ggcaacggcg gcactagtac 1216921 caccgccggg gtggccggcg gcaacggtgg cgccgccggg ctgatcggca acggcggggc 1216981 cgggggcggc ggcggggccg gcgcggccgg cggcaatggc ggtgcgggcg ggtggctgta 1217041 tggcaacggc ggcgccggcg gggccggtgg gacatcggtg atacccggtg tcgccggcgg 1217101 caatggcggg gctggcgggt ccgcgggact gtggggtacc ggcggggccg gtggcgacgg 1217161 cggcaacggc cggtcggggc cagtcaacgt cgccggcagc gcgggcggca acggtggcgc 1217221 tggtggcgcc gccgggttat tcggtgacgc cggggccggt ggcaacggcg gcaagggcgg 1217281 tgctggcggc gccgccttta gcattaactt caccgcaggc gatggcggtg cgggaggtgc 1217341 cggtgggtcc ggcggccacg cattgctgtg gggcgccggc ggagccgggg gtaacggcgg 1217401 atccggcggc acggggggtg ccggcggcag caccgctggc gctggcggca acggcggggc 1217461 cgggggtggc ggcggaaccg gtgggttgct cttcggcaac ggcggtgccg gcgggcacgg 1217521 cgccgccgcc ggaaacggct tagccgcggg taatggcgtc agcagcagcg gcggcggcgg 1217581 tgccggtggg accggcgggg ccggtgggga cggtggcgcc ggcggggccg gaggcaacgc 1217641 caggctgtgg ggcgtcggtg gcgccggcgg ggccggcggg gacggtggcg ccggcggggc 1217701 cggcggcaaa ggcggctctg gcctcagcgg taacgccaac ggcggggccg gcggcgacag 1217761 cggccgtggc ggcacgggcg gcgccggcgg cgagggcggc gccgccgggc tgctggtggg 1217821 caccggcggg cacggcggtg acggcggggc cggcggcgcc gccgtcaagg gcggtgacgg 1217881 cggggccgcc gccggcacgg gcatcgccgg cgctggcggc cgtggcggcg cgggcggcag 1217941 cggtggcagc ggtggtgacg gcgggggcgg ggccgccggc cccgccgggt ggctgttcgg 1218001 cgatggcggg gctggcggga acggcggggc cgcggccgcc ggcggcgccg gcggccaagc 1218061 cggcggtggc ggcgggaacg gcggcaatgg cggcaacggc ggcaatggcg gcaatggcgg 1218121 caacggcgcc accggggggt ggctgtacgg caacggcggg gccggcggcc agggcgccac 1218181 cgccggagcc ggcggagccg gcgctaacgg cgtcagcagc accaatggcg gcggcaccgg 1218241 cggcaacggg gggatcggcg ggaccggtgg gtccggcggg gccggtggca acgccgggct 1218301 gttgggcgtg ggcggcgccg gcgggcacgg cgcctccggc ggcgccggcg ataggggcgg 1218361 cgctggcggt accgggttca taagcagtga cggcggtgct ggcggtgatg gcggtgatgg 1218421 cggcaacggc ggggccggcg gcaccggtgg gctgttgttc ggtgccggcg gcaatggtgg 1218481 ccccggcggg tctggcggtg ccgccgatat tggcggcaac ggcggcgccg gtaacggcgg 1218541 gggcaccgac gggaacggcg gtaatggcgg gtccggcggc ggcgccggca gcggcggtga 1218601 cggcggcggg gctggcggca acggtgcgtg gctgttcggc aatggcggcg ccggcggggg 1218661 cggcggaaaa ggcggcaacg gtgccggcgg cgggcttggc ggcggttcat tcggcctccc 1218721 cggcctgaac ggcagcggcg gcgacggtgg cgacggcggt aacggtgccc ccggcggggt 1218781 gctgtatggc aatggcggcg ccggcggcca ggggtcaagc ggtggcatcg gcggccccgg 1218841 cgccaccggc ggtgccggcg gcaaaggcgg tgatggtggc gatgcgcagc tgatcggcga 1218901 cggcggcaat gggggcaacg gaggcgcggg cggcaccggg ggcaccccgg ggcccggcgg 1218961 acccggcggg tccggcgggc ttggaggcct gctgttcggc caaaccggca cggctggcgt 1219021 gtcgccgtag ccggtaggct ggccgcctcc gcggcattgg cgtcgtcgca aacttcgcgc 1219081 acgccctggt gtcgatcgtt gccgctgaat tggcgccgat gaccgcaacc ggtatcgccg 1219141 ctacgccggc ccgaggcggg tacaccacgg ttttcgaggg atggcaatat ccgggagtgc 1219201 gccggctggc ggcctaactc gcctgcaccc ggcgattgga ccgccaatta cagcttgcgc 1219261 agccgcagcc ggttaatgga atgatcggcg tccttgcgca gcaccagggt ggcccgggga 1219321 cgggtcggca gaatgttctc cacgaggttg ggccggttga tggtccgcca gatctcgcgc 1219381 gcggcgacga cggcctgcga gtcagaaaaa gccgcgtagt ggtggaagtg tgattccggg 1219441 tcggcgaacg ccgtggtgcg catggccaaa aaccgtgata cgtaccactg ctcgatgtcc 1219501 tcgatccggg cgtctacata caacgaaaaa tcgaacagat ccgacaccat gagcgtgggg 1219561 ccggtctgca agacgttgag cccctccagg atcaggatgt cgggatggcg gaccacttgt 1219621 tctgcccccg ggatgatgtc gtagtgcaaa tgcgaataca ccggcgcaca tgcgtagtcg 1219681 gagccggact tcaccgaggt gacaaaccgc atcagtgccc ggcggttata gctttccgga 1219741 aaacctttgc gatgcatgag gtttcgccgc tgcagctcgg cgttggggta gagaaagccg 1219801 tcggtggtca ccagatctac ccgggggtgg tgatcccagc gagccagcag cgcctgcagc 1219861 acgcgggcgg tggtggactt gccgaccgcc acactgccgg ccacaccgat gatgaacggc 1219921 accggccggt ccgggttttg ttggggctcg ccgagaaatt ccgcggtggc cgcgaacagc 1219981 cgttggcggg cggcgacttg caggtgaatc agccgggcca gcggtaggta gacctcttcg 1220041 acctccaaca ggtcgatctg ctcaccgaga ccgcgcaggc caaccagttc ttcttcggtg 1220101 agggctagcg gagtcgacat acggagcgcg cgccactgcc ttcggtcgaa ctcgacatat 1220161 gggctcggct cgctaagccg cgacatggtg tcagtcttgc agggacgggt gcggggcctg 1220221 atggctgggc tggcgaagtg cggtgctggc agactccgtg tcggtgccga gggccggggg 1220281 taccccctgg gcttagctgg gcactggggc cagggcgcgg tgtttcgatg gaattcagct 1220341 gtggccctgt gaatttcgca cgctgacgcc ggttgatgct gtgagtcggg cacaaaccgc 1220401 ccaccgctac tcgtgaccta cgtggcagct ggggcactag tggctgccgt ttgcggtgca 1220461 gacgtgcaac ggtggatggc gtgtgctgca ttaagggtaa tcagcccggg agcggctcgc 1220521 tggatacact ggcgcccgtg actgctgcac ctgacgctcg cactaccgcg gtaatgtctg 1220581 ccccgctcgc tgaggttgac cccgatatcg ccgagttgct ggccaaggag cttggtcggc 1220641 aacgagacac cctggagatg atcgcctcgg agaacttcgt accgcgcgct gtgctgcagg 1220701 cccagggcag tgtgctgacc aacaagtacg ccgagggact gcccgggcgg cgctactacg 1220761 gcggttgtga gcacgtcgac gtggtggaaa acctcgcccg cgaccgagcc aaggcgttgt 1220821 tcggtgccga attcgccaat gtgcaaccgc attcgggcgc tcaggccaac gccgcggtgc 1220881 tgcatgcgct gatgtcaccc ggcgagcggc tgttgggtct ggacctggcc aacggtggtc 1220941 acctgaccca tggcatgcgg ctgaacttct ccggcaagct ctacgagaat ggcttctacg 1221001 gcgtcgaccc ggcgacacat ctgatcgaca tggatgcggt gcgggccacc gcactcgaat 1221061 tccgcccgaa ggtgatcatc gccggctggt cggcctaccc gcgggtgctc gacttcgcgg 1221121 cgttccggtc gatcgccgac gaggtcgggg ccaagttgct cgtggacatg gcgcatttcg 1221181 cgggtctggt cgccgcgggg ttgcacccgt cgccggtgcc gcacgcggat gtggtgtcca 1221241 ccaccgtgca caagacgctc ggcggcggcc gctccggcct gatcgtcggt aagcagcagt 1221301 acgccaaggc gatcaactcg gcggtgtttc ccgggcagca gggcggtccg ctcatgcacg 1221361 tcattgccgg caaggcggtc gcgttgaaga tcgccgccac acccgaattt gccgaccggc 1221421 agcggcgcac gctgtccggg gcccggatca ttgccgatcg actgatggct cccgatgtcg 1221481 ccaaggccgg tgtgtcggtg gtcagcggcg gcaccgacgt ccacctggtg ctggtcgatc 1221541 tgcgtgattc cccactggat ggccaggccg ccgaggacct gctgcacgag gtcggcatca 1221601 cggtcaaccg caacgccgtc cccaatgatc cccgaccgcc gatggtgacc tcgggcctgc 1221661 ggataggcac gcccgcgctg gcgacccgcg gcttcggcga caccgagttc accgaggtcg 1221721 ccgacattat tgcgaccgcg ctggcgaccg gcagttccgt tgatgtgtcg gcgcttaagg 1221781 atcgggcgac ccggctggcc agggcgtttc cgctctacga cgggctcgag gagtggagtc 1221841 tggtcggccg ctgacgcggg cctgtcgttg gcgcgcataa gcgcgagagc gccgatcacc 1221901 gcgcgacacg gcggcgcccg atttcacgaa atctgtgtat gcgagttaca gttaccgcat 1221961 ggcacagaaa cctgtcgctg atgcgctgac ccttgagctc gagccggtgg tcgaagcgaa 1222021 catgacccgc cacctcgaca ccgaggacat ctggttcgcc cacgactacg tcccgttcga 1222081 tcagggggag aacttcgcat tcctcggcgg acgcgattgg gatccatccc agtcgacgct 1222141 gcccagaacg atcaccgacg catgcgagat cctgctgatc ctcaaggaca acctggccgg 1222201 tcatcaccgt gagctcgtcg agcacttcat actcgaggat tggtggggcc gctggctcgg 1222261 ccggtggacc gcagaggagc acctgcacgc catcgcactg cgcgaatacc tggtggtgac 1222321 ccgggaagtc gacccggtcg ccaacgagga cgttcgagtc caacacgtga tgaagggcta 1222381 ccgagccgag aagtacacgc aggtcgagac cctggtgtac atggcgttct acgagcgctg 1222441 cggcgcggtg ttctgtcgta atctggccgc gcagatcgaa gagcccatcc tggccggact 1222501 catcgaccgc atcgcccgag acgaagtgcg acacgaggag ttcttcgcca acctcgttac 1222561 gcactgcctg gactacacgc gtgacgagac gatcgcggcg atcgccgccc gtgccgccga 1222621 cctcgacgtc ctcggggccg acatcgaggc ctaccgagac aagctgcaga acgtggccga 1222681 cgctggcatt ttcggcaagc cgcagctacg gcagctgatc tcggaccgca tcacggcatg 1222741 gggcctggct ggggagccct ccctcaagca attcgtcacg ggctagacac ccgtcggcgc 1222801 gcctgccctg cgggggtacg gccggcggag tagcgtcgca ctcgatggct agcgacatgc 1222861 tctgctgcca gggcggcacc ttccgtcacg acggctgtca tgacaagggc aggaccggcc 1222921 ccggtcctgg tgtcgctgcc cccgccgaca tgctcgggtg ggtccgctcg agcgccgtta 1222981 gctcgaggag cgctccgtga ccgatacccg cacgtacgtg ctcgacacct ctgtgctgct 1223041 gtccgatccg tgggcgtgca gccggttcgc cgaacacgat gtggtggttc cgttggtggt 1223101 gatcagcgag ctagaagcca agcgccacca ccacgagctg ggatggttcg cccgccaggc 1223161 gttgcgtctg ttcgacgatc tgcgcctaga acacgggcgg ttggatcagc cgattccggt 1223221 tggcacccaa ggcggtacgc tgcacgtcga actcaatcac accgacccgg cggtgctgcc 1223281 cgcaggcttt cgcaccgaca gcaacgactc gaggatcttg agttgcgccg ccaacctcgc 1223341 cgccgagggc aagcgggtca cgttggtcag caaggacatt ccgctgcgcg ttaaggccgc 1223401 cgcggtgggg ctggccgccg acgagtacca cgcgcaggac gtcgttgtgt ccggatggtc 1223461 ggggatgcac gagctcgaga ccgcttccgc ggatatcgat gcgttgttcg ccgatggcga 1223521 gatcgacctg gtcgaagccc gggacctacc gtgtcacacc gggattcggt tgctgggcgg 1223581 cggttcccac gcgctgggcc gggtcaatgc gcataaacgt gttcagctgg tgcgaggtga 1223641 ccgtgaggcg ttcggtctgc gtggccgctc cgccgagcag cgggtggcgc tggatttgct 1223701 gctcgatgag tcggtgggca tcgtgtcgct gggcggcaaa gccggcacgg gcaagtccgc 1223761 tttggcgttg tgtgcgggtc tggaagccgt gctggagcga cgcacccacc gcaaggtggt 1223821 ggtcttccgc ccgctgtacg cggtcggcgg ccaggagctg ggctacctgc ccggtagcga 1223881 gagcgagaag atgggcccgt gggcgcaggc ggtcttcgac accctcgagg ggctggccag 1223941 cccggcggtg ctcgaggaag tgctgtcccg tggcatgctc gaggtgctgc cgctgaccca 1224001 catccggggc cgctcgttgc atgactcgtt cgtcatcgtc gacgaggcac agtcgctgga 1224061 gcgcaatgtg ttgctgaccg tgctgtcccg gttggggacc ggttcccggg tggtgttgac 1224121 ccacgacatc gcccagcgcg acaacctgcg ggtcggccgc cacgacgggg tcgccgcggt 1224181 gatcgagaag ctcaaaggtc atccgttgtt cgcccacatc accttgctgc gcagtgagcg 1224241 ctcgccgatc gccgcgctgg tcaccgagat gctcgaggag atcaccgggc cgcgctgagt 1224301 gcgcctcccg cgagcagaca cagaatcgca ctgcgccggc ccggcgcgtg cgattctgtg 1224361 tctgcttgcc ggtagacttc ctgggtgccg aagcgacccg acaaccagac ctggcgctac 1224421 tggcgcacgg ttaccggtgt cgtggtcgcc ggtgcggtgc tggtggtggg cgggcttagc 1224481 ggccgggtca cacgggcgga gaacctgagc tgttcggtca tcaagtgtgt cgcgttgacc 1224541 ttcgacgacg gtccggggcc ctataccgac cggctgctgc acatcctgac cgacaacgac 1224601 gccaaagcca ccttcttcct gatcggcaac aaagtggccg ccaaccccgc cggcgcccgg 1224661 cgcatcgcgg acgcgggcat ggagatcggt agccatacct gggaacaccc caatatgacc 1224721 acgattccgc ccgaggatat ccccggccaa ttctccaggg ccaacgatgt gatcgccgcg 1224781 gcgaccggcc gcacgccgac gttgtatcgc ccggccggcg gactgtccaa cgatgcggta 1224841 cgccaggccg cggccaaggt tgggcaagcc gaaatccttt gggacgttat acctttcgac 1224901 tggatcaacg actccaacac ggcagcaacc cggcacatgc tgatgacgca gatcaagccg 1224961 ggttcggtgg tgttgttcca cgacacctac tccagcaccg tcgacgtggt gtaccagttc 1225021 atcccggtgc tcaaagccaa cggctatcgc ctggtgaccg tcagcgagct gctcgggccg 1225081 agggcgccag gaagcagtta cggcagccgg gaaaacggtc cacccgtcaa cgaactgcgt 1225141 gacattccgg ccagcgagat cccgccgttg cccaacacct catcgcccaa gccgatgccc 1225201 aacttcccga tcaccgatat tgcgggtcag aattcgggcg ggccaaataa cggtgcgtaa 1225261 cctcaggact tgttgacctt cagcgcctca atgaccctct cgacggtggc gcgcgaggtt 1225321 gcatcaccga tgggggtggc gcccaggaag acggtgaccg gcttggtgtc gaccgcgatg 1225381 atcgtgaccg aatcaccttt gacgttgcgt gaactgtcgg cgattgtgat atcggcgtct 1225441 acccgggcgg ccctgacccc gtcgacggtg atcgacgacg tcttggtcgg gcccagggtg 1225501 ggcgacgagc ctgcgtagcc ggggccgtcg gccacgcatt gcatcaactt cgatgcttgc 1225561 gcggcgacgt ccatggtggt gacgaagttg gttatcgcaa cctcggcttg catcatccac 1225621 tggtcggcac cggccacctc gtggccgacg cccaccgcgt cgatgaggtt cgggttctgg 1225681 tcgtcggaga acgccgacca cccgggtgcc gcgctggtcg ggaacgacag cttacccgca 1225741 ctgatcgaat cgccgatggg ctgcacaccg ccggacacat ttggggtaca accggttgcg 1225801 gtttgctggg aaaacggttg cgacgtggga gcactcgtcg ccggagaggt tgccgtggtc 1225861 gacttgttgt cgccgcggag gccgatcacc aggatcacca ccagtaggat gacacccagc 1225921 accgcgaggc cggcgaggat cagccacggt gtcttcgatc ctggcccggg cggaggtggt 1225981 cctggcggat agggccccgc cggccagccg ggcggatact gctggggtgg gtaggccggc 1226041 ggataggagc cgccctgcgg ttggcctccc caatacgggt cctgcccata cgtattcggg 1226101 ccgtaggggt agttgccgta ggggccagcg ggaggaaccg tcatagccga tcgctgtcga 1226161 gctgctcggc cttggccatt gccagcacgt ccagacggcg gtccagatcc tcgatcgaca 1226221 gcctgtcgcc gatcaggcca cggtcgatca cggtttggcg aatcgttttg cgttccttga 1226281 gtgcttgctt ggcgacggcg gccgcctcct cgtagccgat ggccgaattc aacggtgtca 1226341 cgatcgacgg tgaggactcg gccagccgcc gcaggtgctc gacgttggcg gtcagccctg 1226401 ctatgcagcg ctgggcgaac agccgtgaca cattggtcag cagcttgaag gactcgagga 1226461 tgttgcgggc catcatcggg atgtagacgt tgagttcgaa tgcgccgttg gccccacccc 1226521 aggcgatggc ggcgtcgttt ccgatcacct gcgcggcgac ctgcgtaacc gcctccggca 1226581 gaaccggatt cacctttccc ggcatgatcg agctgcccgg ctgcagatct ggcagttgga 1226641 tctcggccag gccggtcaat gggcccgatc ccatccagcg gatgtcgttg gcgatcttgg 1226701 tcagcgatac cgcgatcgtg cgcagcgccc cggacgcctc caccagcccg tcgcgggcag 1226761 cctgagcttc gaaagaatta gccgccgtac gcaattccga cagaccggtc tgcgcgacca 1226821 gcaccgcgac cactctgacg ccgaagtcgt cgggagcgtt gaggccggta cccaccgcgg 1226881 tgccgccgat cgccagctcg cccagcctgg gcagacacgc gcgcacccgc tcgatgccgg 1226941 cctcgatctg gcgggcatat ccgctgaact cctggccgag tgtcaccgga acggcgtcca 1227001 tcagatgcgt tcggcccgac ttcaccaccg tgtgccaatc aagagccttg gcggccaatg 1227061 cgtcgtgcag ctgctgcagc gctgggatga gatgagcgac cgcggcctcg gtggccgcga 1227121 tgtgggtggc cgtcgggaag gtgtcgttgg acgactgcga catgttcacg tcgtcgttgg 1227181 gatgcaacgt gaccccgccc ttggccgcga tggacgcaat cacctcgttg gtgttcatgt 1227241 tggagctggt gcccgagccg gtctggaaga cgtcgatggg aaactggtcg tcgtgttgac 1227301 cgtcggcgat ctcggcggcc gcggcgatga tggcgtcggc tttctccggc gccagcaacc 1227361 cgaggtcgga gttcacctgc gcgcaggcgc ctttcagcag gcctagcgcg cggatctggg 1227421 tgcgctccaa cccgcggccg gatatcggga agttctccac cgcgcgctgg gtttgcgcgc 1227481 gccacaacgc ttttgccggc acccggactt cgcccatggt gtcgtgctcg atgcggtaat 1227541 tggcgctgtc ggcgtcaacg gccattgatc gggttccttg tgtgtcgtgg gtgtgttagg 1227601 gcaatgggta cacggcgctg ctgtcgccgg tgaagtcgat cgcggagtat tcgttgagct 1227661 ttgaaagccg gtggtaggcc tcgatcatcc ggacggtgcc ggacttcgag cgcatcacga 1227721 tcgaatgggt ggtgcagccg ccggggtagt aacgcactcc cttgagcagg tcgccgtcgg 1227781 tgaccccagt ggcgcagaag aagacgtttt ccccggacac cagatcttcg gtggtcaaga 1227841 cctggttcag gtcgtaaccg gcttctaggg ccttgcggcg ttccgcgtcg tcgcgcgggg 1227901 cgagctgcgc ctggatcgcc ccgcccatgc agcggatcgc cgcggcggcg atgattccct 1227961 ccggggtgcc gccgatccca gctagcaggt cggtgccgga gtgcggtcgg cacgccgaga 1228021 tcgcgccggc gacgtcgcca tcggtgatca gccggatccg ggccccggtg gcgcggacgt 1228081 cgtggatgag ttgcgcgtgc cgcggcctgt ccaggatgca caccgtcatg tctcgcaccg 1228141 acaggtcctt gaccttggcg accgctcgga tgttttccga gatcggcgcg gtgatatcca 1228201 gcacgtgtgc ggcatcgggg ccgacggcga ttttgttcat gtagaacacc gccgacgggt 1228261 cgaacatggt gccgcgatcg gctaccgcca gcaccgagat ggcgttggtc atgcccttgc 1228321 tcatcagcgt ggtgccgtca atggggtcga cggcaaagtc gcattccggt ccgtcgccgt 1228381 tgcccacttc ttcgccgttg tagagcattg gtgcgtggtc cttttcgcct tcgccgatga 1228441 ccaccacccc gcgcatggaa accgagttga ccagttcgcg catcgcgtcg accgccgcgc 1228501 cgtcgccgcc ctccttgtcg ccgcggccta cccagcggcc cgcggccatg gctccggcct 1228561 cggtcacccg gaccagctcc atggccaggt tgcggtccgg ggcttcccgg cgcgatggcc 1228621 tggtgtgcga cgggtcgtgg ctggccaccg cggccgtcga cgaaccggat ccctcagctg 1228681 tcatggttgg tgattgtccc agaagccgaa ccgtgcgctg gagctgggat actggccatg 1228741 tgaccgccga gccgcagccg acccctaggc cggctaaacc gcggttgctg caggacggcc 1228801 gcgacatgtt ctggtcgctc gcgccgctgg tcgtggggtg catcctgttg gcgggcctgg 1228861 ttgggatgtg ctcgtttcaa ctgggcggga ccaagcgggg accgatcccg tcctacgatg 1228921 cggcccaggc gctgcgggca gacgccaaga cgctgggatt cccgatacgg ttgccgcaat 1228981 tgccaggcgg ctggacgccc aactccgggg gtcgcggcgg catcgagaac gggcgagcgg 1229041 acccggcaac cggtcaacgc cgcaacgcgg cgacctcaat cgtgggattc atcagcccga 1229101 ccgggagata tctgagcttg acccagagca acgccgacga ggacaagctg gtcggctcca 1229161 tccacccgtc gatgtacccg acggggacgg tcgacgtggg cggcacccgt tgggtcgttt 1229221 acgagggttc ggacgaaaac ggtgccgtcg agccggtatg gacgacacgg ctcaccggac 1229281 cgggcggggc cacccagctg gcaatcaccg gtgccggcag catcgatcag ttccgcacgc 1229341 tggcgtcggc gacgcaatcg cagcccccgt tgcccgcacg atagcgggtc tcactcagcg 1229401 gttgacggag gcggggcgtt tcttgacgtg gccgggcctc gacgcggcag ccacctgcgg 1229461 cggacgggtg gtgcttcgaa ctgttccagt tcgacgcctt tgtacaccgc gaggtagacg 1229521 tcgatggtgg tgacgatgag gatcatcagc accgggccga tgatgatacc ccaggggccg 1229581 aacatggtga taccggcgaa caccgacagc aacatcagcg ccgagttcag ccgcgcgtcg 1229641 cgcggcacca ggatcggccg caggacgttg tcgatgttgg taaccaccag cagatgccac 1229701 agcagcacga agattccccc ggcgatattg ccgtagaaga tcatcccgat gccgaacgga 1229761 atcgtcacga tgccgccgcc cagcgggatg atcgacaacg cggtgagcac gatggcgaag 1229821 atgaagaagc cgtggtgaaa tccggcgatg tagatcgatg cggcgccggc gactccctgg 1229881 cacgccgcga tgacgaactg gccgttcacc gtgccgcgga ccatcgagcc catcttctgc 1229941 aggtacagat ccgtgacgtc ttcgccgagc gggttgagct ggccgatcag tgtccttagc 1230001 ttctcgcggt tcaccaagag cgcgacgaac acgtacacaa agatgatggc cgacgtgatg 1230061 acaccggcga ggcttccggc ggcgtcgcgc aggaagtgca gcagccattc gccgacgttc 1230121 tgtgctaccg aaatcatcgc tttgcgcagt gcgtccgcgg taaccgtgat gtgcaggaac 1230181 ggcacccggt caaacaagcc gttgacgaat tgcaggatct tgtcgccgag ggtgctcaga 1230241 tcggtcgtcc gcacccagtc ggcgacggag tcgaccatgc gagcgatctg cacgatcgcc 1230301 agccccacca aggctcccac cggcacgacg acggcggcca gcgccgacaa caacgtgcag 1230361 gcggccgaca ggccggtatt gaagcgcttg gtgaaccact tgaaaagtgg cgtgaacaaa 1230421 taggcgccga cggctgccac cacgatcaga acgaaatagt tacgcaggaa gtacgcaccg 1230481 aacagcaaag cgatcaacgt gaggatcgcc agggcgcgct tctgagtgag cgtgaattcg 1230541 gtgttcaaag cgggtccgcc cttcgcttct tggtgctgac tctgcgtcca gcaggcgggt 1230601 tactcgcact attgcgtggt ggatgcagag tcaacggatg tcggtgcagt gctgtagacc 1230661 tatgccacca cccaatcgag gtcgaacgcg ttgccgatgg cctcggctag agccggctcc 1230721 tgcgacgcga gcaggtagcc gatttgacga ccaaggtcac acaccgggat cgtttgggtg 1230781 ttgtcgcagc tgacgactga cggttgattc agcccgttca ccgcgtctac cgggacttcg 1230841 gtggctagcc cacgcacggt tgtcgtgatc ggggcgacgg tgacgttcgt gaggtgcgga 1230901 cgtacgacct cgcgggtaag gatcaggacg ggtctagcct tgtcaagctg tgcgatgtgg 1230961 ataggtcgca tcagtcgatg tcgagggcgg tacgagcgca gtggccggcc agcgtatcca 1231021 gatcacccgt ggctgacgtg ttggtggcga ggatctccgc gtcgcgttcg gcgagacgac 1231081 ggcggcgttc ccgttccagc gcccgcagca cgacagccgc acggctacgg gcatgctgtc 1231141 cccggacttc gtcgtcgatg aacgcgacaa tctcatcggg caagcgaacc gcaatctgtg 1231201 tactcacttc acagatggta ccagtttggt atgcacccgc cccaaaaccg ttcgcgccgc 1231261 cggcgaggac gaccccccag ggtaggtaca ttccagaagt atggtcgtcg acagctgcgt 1231321 ggccgaatcc cgctatggtc cggtccgggg cgccgatgat ggccgcgtca aagtgtggaa 1231381 aggcatccgg tatgccgcgc caccactagg tgacctgagg ttccggacgc ccgaacctcc 1231441 cgaacggtgg accgaggtcg ccgacgccac aaccttcggt ccggcctgcc cgcagccggc 1231501 catccccaac atgccgctcg atttaggggc gtcgcagagc gaggactgtt ggagcctgaa 1231561 catttgggcg ccggcggaca ccgagcccgg tgacggaaaa cccgtgatgg tgtggctgca 1231621 cgggggcgcc tacatcctgg gatcgggcag ccagccgctc tataacggcc gcaggttggc 1231681 cgccagcggc gacgtggtcg tggtgacggt caactaccgg ctcggagcgc ttggcttcct 1231741 ggacttgtcg tcgttcaaca cgtcacggcg acggttcgac tcgaatatcg gcctgcgtga 1231801 cgtgctggcc gtgctgcgct gggtagcaga caacatcgcg gtgtttggcg gcgatcccga 1231861 gaaggtcacg ctgttcggtg aatccgcgcg ggaatcgtca cgaccctgct cgccaccccg 1231921 gcggccgcgg gtctgttcgc ggcggcgatc gcccagagct caccggcgac atcggtctac 1231981 gaccaggtga gggctcggcg cgtcgcggtt tgcgtcctcg acaagctggg aatcgacccg 1232041 tccgatgtgc acaggttcat gaagtgccga ccgcggcaat cctttccgcg tccagcgaag 1232101 tgttcaacga agtgccggtt cgtaaccccg gcacgctggc gttcgtcccg atcgtcgacg 1232161 gcgatctgct gcccgactac ccggtcaagc tggcgcagga gggccgctca cacccggttc 1232221 ccttgatcat cggcaccaac aagcacgagt cggcgctctt tcggttgatg cgctcgccgc 1232281 tgatgccgat caccccgcgc gatcacgtcg atgttcaccc agattgccgc cgaacagccc 1232341 gatctgcaag tgccaaccga ggagcagatc ggctccgcgt actcgcgatg gcggcgcaaa 1232401 gcacgctcat tgagtatggc taccgacgtc ggcttccgga tgccgtcggt gtggctcgct 1232461 gaagggcaca gcggggtggc gccggtgtat ctgtatcggt ttgactactc gactccgctg 1232521 ctgaagctgc tgctggtccg ggccgcccat gccaccgaat tgccttacgt ctggggcaat 1232581 ctcggaggat cccaggaccc tgcattgaag ttgggcgacg ccaaagccgc catagcggtg 1232641 tcccggaggg tacggacgcg gtggatcaat ttcgcgacgc ggggcaaacc cacgggtccc 1232701 gatggcgagc cagactggcc atgttacgag gaggcccatc gtgcctgcct gattatcggc 1232761 aggcgagacg ccgtcgtgca cgacgtcgac gcacacatcc gagcgacctg gggcagcaag 1232821 tggtgagttt cagataattc tggctacggc ttgactgtgg cggccgtttt ttccgcccgg 1232881 gcctcgttct tcatctgctc aaacagactc acgtagtacg gcaggcattc ggtcagcgcc 1232941 tgctgggtgg tgaacagcgg ctcatagccc aggtcgcggc gtgccttagc gatcgaaaag 1233001 tagttgtcca ggtacagtcg ttcgacggcc agcggctcga gcagcggcgc ggggaatccg 1233061 aaccggaagt gcagccgctg ccaccccgtc attacccagc ggaccgcggg gccggaaatc 1233121 cgcatcttcg gccagcgctg cccgcacgcc tcgagcaccg gccgagcgaa ctcgaacata 1233181 ttgatcggct ctgcgtcgtt gatgaagtaa gcctgcccgg gcgctgtgcc gtccggcacc 1233241 agatgggcag cggccaagat gaaaccgtga atcaggttgt gcacgtaaga gttatccagc 1233301 cgggccgact tgcgcccgac cagcaccttg acgtggccct tgagcacact ttcgaacagc 1233361 ttgcggaaca tcgtctgatc gccgtttccc cagatgccgc tgggccggat cgcgcacgtc 1233421 agcatgccgt cgacaccgtt ctgggccaac acgaatcgct cggcaaccac cttggtctcg 1233481 gtgtagaggt cgttgaaccg gtcggtatag ggcagcgtct cgtcaccgcc ggcgatgttc 1233541 tggccgccca tcaccacact gttggatgac gtgtagacga accgctgcac cccggcccgc 1233601 tggccggcgt gcagcaggtt ctcggtgccg ccgacgttga ccgcaaagct acgttggcgg 1233661 tactcgtcgg tgaccgacgc gccgcccatc agctcgatga tcgctgcggt gtggaagatc 1233721 gtgtcgatgc cgtccacggc cgcggcgcag acgtccgcgt cggtgatgtc cccttgcagc 1233781 acctccagtt gcggatgcgc aggcaacagc gacggcgcgc ggtcgaagga acgcacccag 1233841 tgcccgcggt ccagcaaggt ggtcaccagg ttggcgccca cgaagcccgc gccgccggtg 1233901 accagaacgc ggccgagctc ggttgtcagc gatgcatcac ccatgcggcg aagcataacc 1233961 ttgccttagc cgttttgggc ctcgtcgccg gccagcacat cggacacccg ctggcgtgca 1234021 ccagctaagt gctcctcgca ccttttggcg agttgctccc ctctttccca cagtcgcagc 1234081 gacgcatcga ggtccaatcc gccctgctcc agaagccgca cgacttccat cagctcgtcc 1234141 cggcaggctt catagccaag ctgactgaca ggcacagttg cgtgggttct gcccgtgtca 1234201 tcgccgttgg ggtcacagac cattggtttg tccttcactg accgccgcta gggctccgtc 1234261 ggcaacccgc acgcgcagct tggtgccttc cggtgcgtcg tggaccgacc gcagcacctg 1234321 tggttcggat ccgccctcgg gtcccgtctg agcaacggtc tgcactatgg catagccgcg 1234381 ggcgagcgtg gcggccggac ccagcgtggc caggcgtgcg gccagatgac cgatgcgttc 1234441 ggtctcggcg gcgaccatca gggtgaggtt gcgacgaagc gtcgagcggg ctcggtggac 1234501 ctcctcggcg cgcacgctga ccatcgtcat cggatcggcc agcaccgggc ggctacgcaa 1234561 ctgcgcgact gcccgttgct cgcgggaaac ccagttgcgc aacgcctggg cgctgcgccg 1234621 gcgcagatcg tcgatcagcc gctgctcggc tgcggtgtcg ggaaccactt tcttggcggc 1234681 gtcggtgggg gtggcggcgc gcaggtcgac gaccagatcg cacagcggat tgtcgggttc 1234741 gtgaccgacg gcgctgacca cgggcgtacg gcaggccgcg atcgcgcggc acaacgtctc 1234801 gtcagaaaac ggcagcaggt cctcgacgga gccgccgccc cgggccagca cgatcacgtc 1234861 gacgtccggg tctcgatcga gctcgcgcag cgcctcgacg atctggccga cggcgttggg 1234921 gccctgcacg gcgacgttgc ggacggcgaa acgtgccgct ggccagcgcg ccgaggccac 1234981 cgtcgtaacg tcacgttcgg cggcactcgc acggccggtg atcagaccga tcatgttggg 1235041 caggtacggg atcggccgct tgaggcgggg gtcgaagagc ccctcggcgt ccagcagccg 1235101 gcgcagccgg tcgatgcgtg ccagcagctc gccgatgccg acagcgcgaa tctcgctgag 1235161 ccgcaaggag aatgtgccac gtccggtgta gaacgagggc ttgccgcaga ccactacctg 1235221 aacgccttcg gccagcttca ccggcgcgga cagcaccagg tcgcgggaac acgtcacggt 1235281 cagcgacatg tcggccgcag gatcgcgcaa taccatgaac accgtcttgg cgtctgggcg 1235341 cattgtgatc tgggccaatt gcccctccac ccagaccgcg cccagcttgt cgatccagcc 1235401 cgcgacccgg attgccaccg cgcgaaccgg gaacggattc tccgctgaat tctgggtcac 1235461 ttcgcagtcg cgcgggtgat cctgttggcg agcagcgtct ggaacggggc acgggccttg 1235521 gtggcctgct cgtaggccag cagggcctcg agctcgggga catcgagcgt gtgcagcctg 1235581 gcccgcagct gggccagcgt cagcgccgga tagtcgagtt cggctgccac cgcaggcgtc 1235641 ggaactgtcg gcttggccgc cgacttgggg tgcttggcgg ttttgggatt ggtcgagcga 1235701 tccgcactcc tggatgctgt cgtcgtttcc ggcgtatcgg ataccgagta caacgcgaac 1235761 cgcccgtcgg accggcgatc gtcgttcttg gcttcgctgg catccgacaa gccgagcaat 1235821 ggaatcgaag tcccttcgag cgcgtcgggc aagtcctcgt cgaatgttgc ccactccggc 1235881 ttctcgtcct tgggcggaaa cagcgtctcc agggtgttgt cgcccttgat caccagttcg 1235941 gccaggccct gttggaatcg catcaccacg tgcgccgcct ggctggccag ggtcattggg 1236001 tacatcagga tggttcgtgg cagcttcatc gtctcctcaa cggcgactgt cgccgcgccg 1236061 accaatagcc gaaccccata cggtgcagta gccatggatc caagactgcc tcaagcagcg 1236121 gctaactcca agccggtggc cgtgagctgg cgggttcgtg tcggcccaaa gtaccctgaa 1236181 tgccatggtt ccgacggtcg acatggggat tcccggggct tcggtatcgt cgcgatcggt 1236241 ggccgaccgt cccaaccgta agcgggtgct gctggccgag ccgcgtggct actgcgctgg 1236301 cgtggatcgg gccgtcgaaa cggtcgaacg cgcgcttcaa aaacacggcc cgcctgtcta 1236361 cgtgcgtcac gagatcgtgc ataaccgcca cgtggttgac accctggcta aggccggtgc 1236421 ggttttcgtc gaagagaccg agcaggttcc cgagggagcg attgtggtgt tctccgcgca 1236481 cggggtcgcg cctacggtgc acgtcagcgc cagcgagcgc aacctgcagg tcattgacgc 1236541 cacctgcccg ctggtcacca aggtgcacaa cgaggccagg cggttcgccc gggacgacta 1236601 cgacatcttg ctgatcggtc atgagggcca cgaggaagtc gtcggtactg ctggggaagc 1236661 tcccgatcat gtgcagctgg tcgacggggt ggacgccgtc gaccaggtga ccgtccgtga 1236721 cgaggacaaa gtggtttggc tgtcgcagac caccctgtcc gtcgatgaga ccatggagat 1236781 tgtcgggcgg ttgcgtcggc gtttccccaa gctgcaggat ccgcccagcg acgacatctg 1236841 ctatgcgacc cagaatcggc aggtcgcggt caaggcgatg gcgcccgagt gcgagctggt 1236901 catcgtggtc ggctcgcgca attcgtcgaa ttcggttcgg ctggtcgagg tggcgctggg 1236961 tgccggggcg cgggccgccc acctggtgga ctgggccgac gatatcgact cggcctggct 1237021 ggacggcgtt accacggtcg gcgttacgtc gggggcatcg gtccccgagg tgctggtgcg 1237081 cggtgtgctg gagcggctgg ccgaatgcgg ctacgacatc gtgcaaccgg tgacaacggc 1237141 caacgagacg ttggtgttcg cattgccccg ggagctccgc tcacctcgct gagcacatcc 1237201 gctcacggtt agacgtcgta ttcccaggat tcagccggtg gtctgcgcgg tgcccgcgaa 1237261 cgatcccgcc gatcgaaccg ctgctcctcg cggtagttgt cccgccgcgc gtcgcgagta 1237321 gctgacccgc ggtagcggac ctgcgagatc ggatggtgtg tcgggttggt tggctcgctg 1237381 ggacgggcgc gtcggcgttg gggctcgtag gtgggctcgt agcgcgcata gggctggtat 1237441 cgagcacccc gacgttcgta ccggttgacg ggctcggcag gcccggaggg ttcggacggc 1237501 tggtagctgc ggtaagaatc gaagcggctg ctgcgcggtg ccggccgttc gtgggcattg 1237561 cggcgcgggt gcggatcatt ctgcggccta gggcggcggc gcgatcgacg ctcggctatg 1237621 ggttcgcggt tgtcctccga cgggggtcgg gcgtgtcggg aacgagtacg ggcaggccgc 1237681 tgggccgacc gccggccgcc gtcgtcgtcc gaatcgccgg tcatcagcga gctgagcttc 1237741 ctggcgatgc tgtcgaacag agccgtcccg agataccacc tgaccagtcc gatcagcagc 1237801 acgccggcag ccgtgcccag catcagcggg aaacgttcga tgagcgagta gccgcagttg 1237861 atcaagaggt ctttgaactt gccgatcgtg cccccgtgga acagccagta ggccccgggc 1237921 acggcgcaga aaagtatcag tggcggctgg acgagcgcgg tgaacaggtc cgactgccgg 1237981 acggccagga ccgcccccac gcagccggcg atatagcagc cggtaaagac gagggttagc 1238041 gccttgtggc ccgatccggc gtcgattgca tacccgatcg ccgtcgcggt gacggcgatc 1238101 aggatggcag cccaccacgg cacacctggg atgtgggggt gaatcgagcg gtgacttgcc 1238161 tgtaccgccg acctcgcccg ctgcgctgac acacgtcgac cgtaccggca atggcgccga 1238221 aggcggcacc gcctcgcctt aaacttggct ctctgtgagc ttgagcctgg ggatcgtggg 1238281 cctgcccaac gtcggcaagt cgacactttt caacgcgctg acccgaaaca acgtggtcgc 1238341 ggccaactac ccgttcgcga cgatcgaacc gaacgaaggt gtcgtctccc tgcccgatcc 1238401 ccgcctggac aagcttgctg agcttttcgg atcgcagcga gtcgtacccg cgccggtcac 1238461 cttcgtggat atcgccggcc tggtcaaggg ggcgtccgag ggagccgggc tgggtaacaa 1238521 gttcctggct catatccgcg aatgcgacgc catttgtcag gtggtgcggg tgttcgtcga 1238581 cgacgacgtg actcatgtca ccggacgggt cgatccccag tccgacattg aggtcgtcga 1238641 gaccgagctg atcctggcag atctgcaaac cctggagcgg gccacgggcc ggctggagaa 1238701 ggaagcgcgc accaacaagg cgcgcaagcc ggtctacgac gcggcactgc gtgcccagca 1238761 ggtgctcgac gccggcaaga cgctgttcgc cgcgggggtg gatgccgccg cgttgcgcga 1238821 gctgaacctg ctgaccacca agcccttcct gtatgtgttc aacgccgacg aggcggtgct 1238881 caccgacccg gcgcgagtcg gtgagctgcg cgcgttggtg gcgcccgccg atgcggtgtt 1238941 cctggacgcc gccatcgagt cggagttgac cgaactggac gacgagtcgg ccgcggagct 1239001 gctggagtcc atcgggcaga gcgagcgcgg gctggacgcg ctggcccggg cgggttttca 1239061 caccctgaag ttgcagacct ttttgaccgc gggccccaag gaagcgcggg cgtggaccat 1239121 ccatcaaggc gacaccgcgc cgaaggcggc cggggtgatc cacagcgact tcgagaaggg 1239181 tttcatcaag gccgagatcg tgtcctacga cgacctggtg gccgcgggtt cgatggcggc 1239241 ggccaaggcg gccggcaagg tccggatcga aggcaaggac tacgtgatgg ccgacggtga 1239301 cgtagtggag ttccgattca acgtgtaggc gggaaagccg ggacgcagcc agagcccaga 1239361 tcccatggca tcattgcttg catcgagtga tgcatgtatt gatgggagtt ggtgaatgag 1239421 gacgacggtg accgttgacg acgccttgtt agccaaagcg gccgaattga ctggggtgaa 1239481 agagaagtcg acgctcctgc gcgaggggtt gcagacactg gtccgggtgg agagcgcccg 1239541 gcggttggcg gctctcggcg gcaccgaccc gcaagctacc gcggcgccga gacgccggac 1239601 gtcgccccgg tgatcctggt cgacacttcg gtatggattg agcacctgcg cgccgccgac 1239661 gcgcgactcg tcgagctgct gggcgatgac gaggccggtt gccatccgct cgtcatcgag 1239721 gagctggcgc ttggctcgat caagcagcga gacgttgttc tcgatctgtt ggccaacctc 1239781 taccagtttc cggtggtgac ccacgacgaa gtgttgcggc ttgtcggtcg gcggcggttg 1239841 tggggtcggg gactcggtgc cgtcgatgcc aaccttcttg gttcggtggc tctggttggc 1239901 ggcgcgcgac tatggacgcg ggacaagcgg ttgaaggcgg cgtgcgcgga aagcggtgtt 1239961 gcgctggctg aggaagtgtc ctgagttgta taccgtcagc gttgctggga gtaatcgacc 1240021 cggtgccgcg tggcgcatgt tcggccatgt tcattgcccg atttggcgcg atagcgtgat 1240081 ttatgttgat ttgttacatt cgcactgaac ccttccgtat ctatttttat attgttgcgt 1240141 gacatatccg ctgtacgcgt gggacgggcc attatttgga taatgcgtga taagcaccac 1240201 aagaattgat ttcctatgga tattgtcggt agcgttcgcg tccatgattg ctcttgcaac 1240261 gctgttgacg cttatcaatc aagtcgtcgg cactccgtat attcccggtg gcgattctcc 1240321 cgccgggacc gactgctcgg agctggcttc gtgggtatcg aatgcggcga cggccaggcc 1240381 ggttttcgga gataggttca acaccggcaa cgaggaagcc gccttggcgg ctcggggctt 1240441 tcaacaggga accgccccca atgccttggt gatcggttgg aatggccacc acacggcggt 1240501 gacgctgccc gatggcacgc ccgtatccag tggtgaaggc ggtggcgtgc gggtcggtgg 1240561 cggtggcgcc taccagccca aattcaccca ccacatgtat ctgccgatgg atgtggacgc 1240621 gggagaagac cagccgccgg cgccagatga gccggtcacc gcggtcgacg acgtggaacc 1240681 ggaaatgcct gcaccgtgcc cgacccagcg cccgccggtg accccgagac ataacctgtg 1240741 caacaaactc cggactatgc caggggcgct ctcggccgcg ctggccgcgg cggcgccggt 1240801 ctggccggcc cctataagcg gctgccgcgg gttcagcacg tccctcttag caaaaagaaa 1240861 tcacccagta atcgtcggga aatagagtgt acccaaacca atccttccgt ggcggaaata 1240921 ttcttggcgc ttctccaacg ccttcgccaa atcgttgtcc acggaacgat ttcacttatg 1240981 caagcacggc gctgccatac ggatgtgtag tcgaatggcc gacgaaccgc gcttagaagc 1241041 cggcgcgcac cccttcgaag agggccggga caaggccccc gaacttcgtg ccactcagat 1241101 ggaccatgtc cggttcaccg aaggtcggcg tgaacgtaac cgtgaccggc tcgagcggag 1241161 ccagcagttc cgccaaccgg gtcgctgaca gcgaccaact cgtatccgta ctccggtgac 1241221 acgtcaatcg actgcgatat cgacgtctgg ccgaacaaga aaccgttgac ggcgttgccc 1241281 ggagcctgca gaaccgcacc ggtggccccc aacaggttcc cgctatgcag cgcaccgaca 1241341 aatgccgtgt tgctgtcctg caaccccctg accgtgccca gggcacccat acgtcatcgt 1241401 cgagcacaca gcgtagccgc cgggcgctcc ggctctgggt gaaatgacgc tggggcctca 1241461 aggccagcac cggttaccca cttctcggcc ccgggagcgc accatgcgca cggcgatgtc 1241521 gccccgtcag gcatgtgccc aaaccgtgga caacgcacgt tgtcaccgtt tatcgtgagc 1241581 gcaaagtggg agtatggagt gtacgtgccc ggcccgggta ccctgagcgg caatgatctt 1241641 catcgtcgtc aagttcgaga ccaaacccga gtggaccgag cgctggccgg atttggtcgc 1241701 atcgttcacc gcggccacgc gtgccgaaga gggcaaccta tggttcgagt ggtcccgcag 1241761 cctcgacgac ccggccgagt acgtcctggt cgaatccttc cgtgacggcg aggccggcgg 1241821 cgtacacgtc aacagcgatc acttcaggca ggccatgcgg gaactgccga aggcactggc 1241881 gtccaccccc aagatcatca gccaaaccat cgatgcgacg ggttggtcgg cgatggggga 1241941 gatgacggtc gggtaaccgg cgaggcccga tcagccgccc acgtcgaccg cgatttcgtg 1242001 acccagccga taacccggcg ccaggggcag cgagtcaccg ctccagaact tgccggggtc 1242061 gaaccagtgc gcgtccttgt cggtgaccag caatcccatt tcctcatagg tgatggccac 1242121 cgtctccgcg caatatgccg tcgccaggcc catcgtgcgc tgctgttgct tacgtcgctg 1242181 ggtttgttcg cgcaccttgc ggtccagcac cggtatgccg cgcagccaat cgttgagggt 1242241 cggaagccgg ccgcgcagcc accggccggt caaccgggcg gtggttggga aaggcgtgcc 1242301 gttcatccgc gcgatgaccc gcagcagttt gtcctcctgg tcgcgattgg cgtgcggtgt 1242361 cagttgacgc agccagcacc gctgccgata acggccggcc cactgctgca cgacttggcg 1242421 ggcgtcgttg agctgcacgc cgcggtggtt ggtgccggtc catacgtcga gcagcttgtc 1242481 gcccagttcg gcatgccaga tcagcggcgg caagtcgtcg atggccaccg tcatgccgac 1242541 gtggttcacc ggggcgttcg tcaaggtctg gatcgcccgg tcgggtcggg aacggccgcg 1242601 aaacagccag aggtcgccgg tgcgggtttc gttcagcgct cgatccagcg ctagcgtgct 1242661 cgggtccacc ccatgcacca taggcggata tagcctgtcg gggtgcgcaa cgtgtggaag 1242721 tgggtcgggc tggccggtgt cgccggcgtc gtcgcgggtg gcgccctggt ggcgcgcgat 1242781 caacggaaac gacgtgccta cacgcccgac gaggtgcggg cccgattgca ccagaggctg 1242841 gacgaatccg acgtcgacgg ttatcagtcc aggtccggcc cgggtgccgc gtcgagcgag 1242901 aacaggcgat agctgccgaa acggatatcg gcacagtcgc tgacggcgtc gtgcaccggt 1242961 tcgcctacca ggatctggcc gccgaccgct tgccccgcaa cccgagcggt cattgcgacg 1243021 ttgcggccga acagatcgtc accgtgccgc accgagcgcc ccatgtggtg ccgatccgca 1243081 cccgaattcc ctggttccgc ttacgctttg cgctgttgcg cagcgcgtcc tggatgtcga 1243141 tgccgcaccg caccgcctgt tcggcgcggg cgaacgcgat catgaacccg tcaccctgac 1243201 tcgtgaccat gtgcccggac cagcgccgca ccagctcatg aaccagcttg tcatgcgcgc 1243261 caatcaactt gacccatgtg cgatccccga ttcgttcgtc gagcgcggtg gactcctcga 1243321 tgtcggagaa caggatcacc acccggccgt ccggggttac ccgagccagg tcgggacgct 1243381 ctacctcggc ccagtcggcg gggtcctcga tcgagctgcg cacggccgct ccgaaccctt 1243441 ctttgcgcac caggttcgcg gtctgccaga ccgtctttac cgcttcacga ccacccgaca 1243501 gcattgcggg cgtcgagccg ctcccgcggt tgctcggttt cctggcgtat ccgcctcagc 1243561 cggatgcgca tcgggacgag tccgccggcc tcgatcgtgg cgatcccggc caggatgtag 1243621 accgcgatct gcagcgtcgg gttgtcgggc caatgggcta gggttgagtt cggccgccgc 1243681 gggaaagcaa gtctggaggt gcgggtttgg ttgacggcgg aggtggcgcg tcagatctgt 1243741 tggtgatctt cggaattacc ggtgacctgg cccgcaagat gaccttccgc gcgttgtatc 1243801 ggctcgagcg ccaccagttg ctggactgcc ccatcctggg tgtggccagt gacgacatgt 1243861 ccgtcgggca gttggtcaag tgggctcgcg agtccatcgg tcgtaccgaa aagatcgacg 1243921 atgcggtgtt cgaccggttg gcgggccggt tgtcctacct gcacggtgac gtcaccgaca 1243981 gccagctcta cgattcgctg gccgaactga ttggctcggc ctgtcggccg ctgtattacc 1244041 tggaaatgcc gccggcgctg ttcgcgccga ttgtcgaaaa tctcgcgaac gtgcggctgt 1244101 tggagcgcgc acgcgttgcc gtggaaaagc cgttcggcca cgacctggcc tccgcgctcg 1244161 aactcaacgc ccggctgcga gcggtgttgg gcgaagacca aatcctgcgt gtggaccact 1244221 ttctgggcaa gcagcccgtc gtcgagctgg agtacctgag gttcgccaat caggcgttag 1244281 ccgagctctg ggatcgcaac agcatctccg agatccacat caccatggcc gaggacttcg 1244341 gggtggagga ccgcggcaag ttttacgacg ccgtcggtgc cctgcgtgac gtcgtgcaaa 1244401 accatctgct gcaggtgctg gcgctggtga cgatggaacc gccggtcggt tccagcgccg 1244461 atgacctcaa cgacaagaag gccgaggtct tccgggcgat ggcgccgctg gatcccgatc 1244521 ggtgcgtgcg tgggcagtac ctcggctaca ccgaagttgc gggcgtagca agcgattcgg 1244581 cgaccgaaac gtatgtcgcg ctgcgaaccg agatcgacaa ctggcgctgg gccggggtgc 1244641 cgatcttcgt gcgggccgga aaagagctgc ccgcgaaggt caccgaagta cggctatttt 1244701 tacgccgagt tccggcattg gcctttctgc ccaaccgccg accggccgag cccaaccaga 1244761 ttgtgctgcg tatcgacccc gatccgggta tgcgactgca gatttcggcc cacaccgacg 1244821 actcgtggcg agatatccac ctggactcct cgttcgcggt ggacctcggt gaaccgatac 1244881 gaccctatga gcggctgctg tatgccggat tggtcggcga tcaccagttg ttcgcccgcg 1244941 aggacagcat cgagcagacg tggcggatcg tgcagccgct gctcgacaac ccgggtgaaa 1245001 tccatcggta cgatcgcggt tcctggggtc cggaagccgc gcagtcgttg ctgcgcggtc 1245061 accgcggttg gcagtcgccg tggctgcccc gcggcacgga cgcatgagtt caaggagacg 1245121 aaaaggcgat gcaactagga atgatcggtc tgggccggat gggtgcgaat atcgtccgcc 1245181 gcttggccaa aggtggacac gactgcgtgg tctacgacca cgaccccgac gcggtcaagg 1245241 cgatggccgg ggaggaccgg accaccgggg tggcctcgtt gcgtgagttg tctcagcggc 1245301 tctccgcccc gcgagttgtc tgggtgatgg tgcccgcggg gaacatcacc accgcggtga 1245361 tcgaagagct ggccaacacg ctcgaggccg gcgacattgt gatcgacggt ggcaacacct 1245421 attatcgcga cgatctgcgg cacgaaaagc tgttgttcaa gaagggaatt cacctactcg 1245481 actgtggcac cagcggcggt gtgtggggtc gggaacgtgg ctactgcctg atgatcggcg 1245541 gggatggcga cgcgttcgcg cgcgcggagc cgatcttcgc caccgtcgcg ccgggggtgg 1245601 cggccgcccc gcgcaccccg ggccgagacg gtgaggtcgc gccatcggaa caaggctatt 1245661 tgcattgtgg gccttgcggt tcgggtcact tcgtgaagat ggtccacaac ggcatcgaat 1245721 acgggatgat ggcctccttg gcggagggat tgaacatcct gcgcaatgcc gacgtcggca 1245781 cccgcgtgca acacggtgac gccgaaaccg cgccgctgcc gaatcccgag tgctaccagt 1245841 acgacttcga catcccggag gtcgccgagg tatggcggcg gggcagcgtg atcggctcct 1245901 ggctgctgga tttgaccgcg atcgcgctgc gcgaatcacc tgacctagcg gaattctccg 1245961 gacgggtctc cgactctggc gagggccggt ggaccgccat cgcggcgatc gacgagggcg 1246021 tgcccgcgcc ggtgctgacc accgcgctgc agtcccgctt cgcctcgcgt gacctcgacg 1246081 acttcgccaa caaggcgctg tcggcgatgc gcaagcagtt cggcggacac gccgagaaac 1246141 cggctaacta agtcgcctga cgaagtccac cacgacgtcg gtgaacgcgt cgttgtcgtc 1246201 gccggcggcg gtgcgccccg cgttggacaa ttcgacgaac tccgcgttgg gcaccttggc 1246261 caggaagtcc cgggcaccgt cggaactgac cacgtcggac agctttccgc gaatcaacag 1246321 gaccgggatc gtcaggccca tggcagcccg ttcgaagttc tcggtgcgca gctgcgggtc 1246381 gtgccccggc gcggtcatca tggccggatc ccagtgccag tgccagcgtc cgtctcgcag 1246441 gcgcagattc ctcttcaggc cctcgggact gcgcggcttg tcgcggtgcg gcagatactc 1246501 ggcgactgcg tcggcggctt cctcgagcga accgaagccg tcgatgttgc ccagcatgaa 1246561 gtcccggata cgggcgttgc cctccttctc gtaacgcggc accacgtcga ccaataccag 1246621 tccgttcacc gtctgcggac cggcgcgctc ggcgaccagg atgccagtca gtccgcccat 1246681 gctggcctcg accaccacca cacggcggcc gatcgcctcg acgacgtgta gcacatcggt 1246741 ggtcggggtc tccacggcat agtcggcgcc gggagcgcgg tcgctgtcac cgggtccgcg 1246801 ggtgtccagc gcaacgacgt ggtgcccctc gtcggccagg atctggccgg tgtttttcca 1246861 ggaaaaccgg ttttggccgc caccgtgcaa catcaggatc gtcggccgat cggccgctgc 1246921 ggcgccccga ttccactcgt cggcgaccag ggtaatccca cgagcaccgg aaaacgcgac 1246981 cgcttgggga ctgctgctca cggcgctcac gggtcctgac gttaccttgc tgggcacgcg 1247041 ccaaatcgtc atcgccgacc tggaggatgc ggtgatcaag gtgccctagc tactggcctc 1247101 ttgggttccg ccggttacgt tggaccatgc gggctggacg cggcgaacgg gagtcaacat 1247161 ggcggacgac aatggctgaa ccacactgga ttgacgtgaa gggtcccaac ggcgacctga 1247221 aagccttgac ctgggggccg gccggcgcgc cagttgcgtt gtgcttgcac ggctttccgg 1247281 ataccgccta cgggtggcgc aaggtcgcac cccggctggc cgagtccggc tggcacgtcg 1247341 tggcgccgtt catgcgtggt tatgcgccgt cttcgattcc ggccgacggc agctatcacg 1247401 tcggtgcgtt gatgcacgac gccctgcggg tgcgctcggc tgccggtggc accgagcgcg 1247461 atgtgatcat cggccacgac tggggcgcga tcgccgctac cggcctggcc gccatgcccg 1247521 acagcccgtt tgccaaggcg gtgatcatgt cggtgccgcc gtcggcggca tttcgcccgc 1247581 tgggccgggt gcccgagcgt ggccggttgc tgcgtgagtt gccgcatcag ctgctgcgca 1247641 gctggtacat cctgtacttc cagttgccct ggctgccgga gcgatccgcc tcctgggtgg 1247701 tgccgctgct gtggcggcgt tggtcgccgg gctatcacgc cgaggaagac ctgcggcatg 1247761 tcgacgccgc gatcgggacg ccggagggcc ggcgggcggc cttgggaccg tatcgcgcca 1247821 ccatgcgcaa cacccgggcc ccggcggact atgccgactt gaatcggctg tggaccgagg 1247881 cgccgaagct gccggttctg tacctgcatg gccacgacga tggctgtgcc acatcggcat 1247941 tcactcattg gacggcaagg gtgttgcccg ccggcagtga ggtggccgta gtggaacacg 1248001 ccgggcactt cttgcagctc gagcagccgg acaagattgc agagttgatc gtggcgttca 1248061 ttggctcacc cggctgaagt cgtggccggg caccggatgg cggccgtcga cgcgcagttc 1248121 tactggatgt cggccaaagt ccccaacgac cagttcctgc tgtatgcgtt cgatggtgaa 1248181 cccaccgatc tggaacgtgc cgtcgcgcag gtctaccgtc gagcccgtgg gtgtccgggc 1248241 ttagggatgc gagttcagga ccgtggtgct ctggcctacc cgcagtgggt gcccacaccc 1248301 gtgcaacgtg accaactggt ctgccacgac ctggccgatc gcagctggca aggttgtctg 1248361 gcggccgttg tcggcctcgc cagcaagcag ctggatatgc gccggatgcc ctggcggctg 1248421 cacgtgttca ccccggtgca cgacgttccg ggcgtcagcg gcctcggcac cgtcgccgtc 1248481 atgcagttcg cgcatgcgct gggcgacggc gcgcgggctt cggcgatggc cgcgtggctg 1248541 ttcggccggc cggccgcggt tcccgaaata gccaggtcgc gtgcgggttt cctgccgtgg 1248601 cgggccgccc atgcggcccg cgctcatctc cgactggttc gtgataccaa tgccgggctg 1248661 gtagcgccag gtgtcggatc ccggccgccg ctgtccacga atgcccgccc cgaaggtgtc 1248721 cgcgcggtgc gcaccctgct gcggcggcgc tcgcaactag ccggtcccac ggtgaccgtc 1248781 acggtgctcg ccgcggtgtc caccgggctg ttgggtctgc ttggcgggga tgtggacacg 1248841 ctaggcgccg aagtacccat ggccaaaccg ggtgtgccac ggtcatataa ccacttcggc 1248901 aacgttgtcg ttgggctgta cccgcggctg gagccggatg agcgggtgcg gcggatcgca 1248961 accgatttgg ccaacgctcg ccgtcgcttt gaacatccgg cgatgctctc cgctgaccgg 1249021 gcctttgcgg cggtaccggc ggcgctgctg cgttggggcg tatcgcagtt cgacgctgag 1249081 gtgcggccgg tgcgggtggc cggcaatacc gtggtgtcca gtgtttatcg cggggctgcc 1249141 gatctgagct tcggggacgc tccggtggtg ctgacggccg ggtatccggc gctgtcgccg 1249201 gcgatgggtc taacccatgg cgtgcacggc atcggtgata ccgtcgcgat cagtgtgcac 1249261 gcggccgagt ctgcggtgtc tgacatcgac gcctacatgc ggctgctgga cgcggctctg 1249321 cagtgaaaac tactgggcat caccggattt agccgcttcg tctcgtgtca gcccgacggc 1249381 ctggatcagc tcctcgtgta gttcgaacca cacggtgtgg taggagtcga tgagtgggcg 1249441 cgtcagccag gcgatgtcgc ccgctttgac cttgtccagc gccgcacgca atttcaccgg 1249501 gtacctgctc aaccgcggca gctgcatggc caccgtaccg atgatcgggc ccacccgccg 1249561 gtgtacgcca tcgaggcggg acagcaccgc ggcgtcgtat tcggcgtcgt cgtgtgtgtt 1249621 aggcttttcg cccttgagct gccagtcggt gaccagcctc ttgaaatcgg cgttgacgga 1249681 acggaaatcg cggtaagcgg cagccagcac ggtcgaatcg gcccggttgc gctcctcggc 1249741 aagcaagtcg tcgagcctca tccggccgct gggactgatc cgcaacggcg tggcgtcgac 1249801 caggaggccg gccgcggtca gcctgtcgac ggtcgcggcg acgtcggcaa ggtcttcacc 1249861 caaggtctgc gccaggtcgg tggtgatcac ccggcccttg agccgcacgg cctgcagtac 1249921 cgtcaactcg ctcatgaact gatccgttgc gcgatgtcgg ccagctcgcg caactccggc 1249981 gtgtcacttt ccgaccaggc agacaatgcc agaacgcctt ggcgcacttc gccttcatag 1250041 ccgtcgacgg tgatctcctt gccggccagt gccgccgcga ccccgggacc gcaacccacc 1250101 acggccactc gaccgagctc gcggctaacc accgccgcat gactggcggc accccccacc 1250161 tcggtgacaa tgccttgcgc ggcaagcatg cccatgacgt cctccggtct ggtgtgatct 1250221 cgcaccaaga tgaccggctc gccccggtcc gcagcgtcca gcgcctcgtc cacctcggtg 1250281 taggcggtcc cggataccac gcccgggcaa gcgggcaggc ccttggccaa aagcggtgca 1250341 gccaaccgtg tttccgtctg cagcgacggc cgtagcaaag tctcgatgtg cgtcggagtc 1250401 acccggcgca gtgtctcggt gtcgtcgatg agtccctcgt gatgcagttg cagcgccagt 1250461 cgcacggcgg cctgcgccga gcgttccgcc ccgcgggtct gcagcagcca cagctggctg 1250521 tcctccacgg tgaattcgat ctcctggacg tcgcctgcca tgcgctccaa actgcgggcg 1250581 gccgccatca gttggtcgta gacggccggc tgctggtcgc gcagggcggt gatcggtgcg 1250641 acggcgacca atccggacac cacgtcgtcg ccttggccgc cgggtagcca ttcgccgaac 1250701 ggttcgttgg ctccggtgat cgggttgcgt gaggacagca ccccggcgcc cgagttcgcg 1250761 gtgaggttgc cgaataccat cgcctgcacc accaccgccg taccgccttg gtcgtcgagg 1250821 ccgtgatggt cgcgataggc aacggcgcga ggtgagttcc aggaggcgaa taccgcctcg 1250881 atgctcgcgc gcaactgggc atacgggtcg tcggtaatgg gaccggcgct gccgacgatg 1250941 cgccgataca tgctggtgaa tcgccgtctg gtgtcgtggg cgaagtcggc ggcacccggc 1251001 ctggcaagta ctcgttcgac cgcgtcggtc atgcccacgt ccagaatcgt gtccatcatg 1251061 ccgggcatcg actgggtggc tcccgagcgc acgctgacca gcagcggatt cgggccacgg 1251121 ccgaacgtgc acgaggtttc tgtttccagc cagctcatcc gatccagcac gtcatcccag 1251181 atcgcggcga tcgtggatcc gggcgcggcg agatagcgca cgcccacctc ggtggtaatg 1251241 cagaatgcag gcggcaccgg cagatggtgc cggcgcatca tgtcgatgcc gtggcctttg 1251301 ttgcccagga tctcgcgtgg gtagttcgcg ccgccgtcca gcgccacaac ggcgttttcg 1251361 agagttccgt cggggcaacc attggctcgg gtgatacgag tcatgggcac cccttgatgc 1251421 tacttatggg caacgccaga ccgcccactg tgggcccaca gggggcgcct tggtcagcgg 1251481 tcggactact cagcttgtgt ctggtgttgg gccttaccca tgctgcgaga caacgccggc 1251541 tgccggtgat ggtggctggc ggcgtggaca gcgcaccggc ccaacggctt ggttcgaccg 1251601 gctcccccgc ctaacgctac gggtcgcctt cgtcgtctgc caggagcttt tccgggtgat 1251661 ggaacgtatt gactcgaggt tggccgtggt cgagatgtgg cggcggtagc cactcggtgt 1251721 cgccgtgggc gttcttgcgg gtcgtccagc cacgttcggc taacggatga tggccaccgc 1251781 agccgagtgt caggtcattg acgtcggtgt tgcggcactg ggcgtacggc gtgacatgat 1251841 ggacttcaca gtaatagccg ggcacgtcgc aaccaggtgc gctgcagcca ctgtccttgg 1251901 cgtacaacat aattcgctgc gccggggagg ccaggcgctt ggtgtggtag agcgccaggg 1251961 ccttgcctcg atcgaatatc gcgaggtagt ggtttgcgtg gcgggccagc cggatcacat 1252021 ccgatatggg caagatcgta cccccgccgg tgaggcccgc gccggccgcg gcctccaagt 1252081 ccttcagcgt ggtggtcacg atgatgctgg ccggtaatcc gttgtgctgg cccagattgc 1252141 cacttgtcaa caaactacgt aattcggcgt tgagcgcgtc gtggttccgc tgtgggcagc 1252201 tgcgggtgtc tcgccgcgcc tgctccttcg agggcgcgcc gttcacacac ggtgccttct 1252261 gctcggggtt gcacataccc ggggcggcca gcttggccca caccgcctcg atagtggcgc 1252321 gcagctcggg ggtcacatat ccgctgagcc gcgacatccc atcgacatct tgctttccta 1252381 acgtcaagcc gcggcggcgg gcgcggtcct cgtcggtgta gtcgccatcg gggttgaggc 1252441 agtccatgat ccgcgcggcc aatttggcca gctggtcggg acggtactgg gtggcctgct 1252501 tagccaagtc ccgttcggcc ttctccaggg tcttgaggtc tacccaggat ggtaggcggt 1252561 gcacgaaagc acggattact tcaacatggc cgtcaccaat taacccgtgg cgctgtgcct 1252621 ttgcggtggc ggtgagtagc ggtggcagcg gctcgccggt cagcgcacgg cgctggccaa 1252681 ggtcggcggc ctcggccact cgccgcttgg cctcgctgcg ggtgatgcgc aaccggtcgg 1252741 ccagcgtcaa tcccagcttg ccgcccagct cctcctcggt ggattgttcg ccgatctgat 1252801 tgatcaacgt gtgttcgacg ctgggcagct ggcgtcgcgc ggtctcgcag tgctccagca 1252861 gcgccaggcg ctccggggtg gtcaatgcgt caaaggtcag ccccagcacg cgggacagcg 1252921 cggtagccaa tgacgcgaag gcctccgtga tctcctcccg agtggaacac atgactgaat 1252981 gctatgtgca ggcaccgaca acaatgcttg cccagagcct gctgaaacca cagtaatata 1253041 aggggtttcg ttgtctgctg tggcgtcggg cggtcaaacc gattgctcgg tcgacgaata 1253101 aggcaagctg ctgcccgcgt tctcgtcgac cgcgacgcga ccaccgagat aggggaacgc 1253161 acgttgggcg cacgacgttc ggttgcagat cttgcagccc gccccgatcg ggacctccgt 1253221 gctcgggtcg tccaggacga caccggtgga gtagacgagt ttatgggcgt gcgcgaggtc 1253281 gcagcccagc ccgaccgcga agttcttgtg cgggcccaga tacccgagcc cgtcggcagc 1253341 ggtggtcttg gccacccaga agtacgacct gccgtcgggc atttgcgcca cctggcggac 1253401 gatcctctct ggctgggcga acgcgtcgtg gaccacccac agcgggcagc tgccgccgac 1253461 ccggctgaag tgaaacgccg tcgcggactg tcgctttgag atgtttccgg ccttgtcggt 1253521 gcggacgaag atgaacggta tccctcgctg ccgcgggcgc tgcagtgtgg agagccggtg 1253581 gcagacggtt tcgaagccca ctccgaaccg gcggcccagc aggtcgatgt catagcgtaa 1253641 ctgctctgcg gcacggtgga attcgcggta ggggagcagg aaggcgccgg cgaagtagtt 1253701 ggccagtccg atgcgcgcga cgccgcgggc ttcggtgctg agctggtcat cggtggccac 1253761 gatcgacgag atcaggtctg actggcccac cagcgccagt tgggtggcga tctggaaggc 1253821 gcgctgtccg ggcatcagcc agtgggcgac ccgaaggacc ttggtgtcgg ggtggtagcg 1253881 gcgcttggcg gtgtcgggca gattgtcatc gatcaccacc gagatgccga accggtcccg 1253941 catcagctcg gccagctgga tgtccaatcc gccggtccgc atcccgcttt cggtaaacat 1254001 ccgctccgcc gccatgtcca ggtcgtggat gtagttgttg cggtcgtaga agaagtcgcg 1254061 gacctcctcg aacggcatcg gccgcgcggg cggtagctcg gtttcggcgg tcgcacgaga 1254121 tcggtagccc tctagttcct cggtggcggc gcgcaaccgg cggtgcacgg caaccaggct 1254181 gtggccgacc tcgggcatcc gggcgacgaa ttcttcgatc tgggcgccgc tgaccgcgtg 1254241 ctcgacgccg atgtcggtga agacgtcgga caggtcggcc accaaccgtg cgtcggaatc 1254301 cgaggagaaa tactgcgccg acaggtcaaa ccgctcggta agcagaagca gcacgggcac 1254361 ggtgatgggc cgctggtcat tctccaactg gttgacatag cttgtggata agtccagggc 1254421 cttggccagc gccacctggg tgagcccgcg ctcttgacgt aaccgccgca ggcgggcacc 1254481 ggaaaacgtc ctcgaatacg tcctagccac cggtaagaca ttactccgcg tcatgttcgc 1254541 aaaatttgca aaatgtgccg gatcaggaca caaaagtacg ctttttcagg gtcttttgtt 1254601 ggtgtcctgt gctgcgtatg gtgcggatta tgttgatgca tgcggtccgg gcgtggcgca 1254661 gcgccgacga tttcccgtgc accgagcaca tggcctacaa gatcgcccag gtggctgccg 1254721 atccggttga cgtcgacccg gaggtagcgg acatggtgtg caaccgcatc atcgacaacg 1254781 ctgcggtgag cgccgcatca atggtgcgca gaccggtcac cgtggcccgc caccaggcac 1254841 tggcgcatcc ggtgcgacac ggggcgaagg tatttggcgt cgagggcagc tactcggcgg 1254901 actgggcggc ctgggccaac ggcgtcgccg cgcgtgaact tgactttcac gacacgtttc 1254961 tggccgccga ctattcgcac ccggcggaca acataccccc actggtggcg gtcgcccagc 1255021 agctcggcgt gtgcggcgcg gagctgatcc gcggtctggt aaccgcctat gagatccaca 1255081 tcgacctaac ccgcggaatc tgcttgcacg agcacaagat cgaccatgtc gcccacctgg 1255141 gcccggcggt ggccgccggc atcgggacca tgctgcggct cgaccaagag accatctacc 1255201 acgcgatcgg ccaggccctg catctgacca ccagcacccg tcaatcccgc aagggcgcca 1255261 tctccagctg gaaggcgttc gcgccggcgc atgccggcaa ggtcggcatc gaggcggtcg 1255321 atcgggcgat gcgcggcgag ggctcaccgg ctccgatctg ggagggcgag gacggggtga 1255381 tcgcctggct gctggccgga cccgagcaca cctaccgggt gccgttgccc gcacctggtg 1255441 aacccaagcg cgccattctg gacagctaca ccaagcaaca ctccgcggag taccagagcc 1255501 aggcgccgat cgacctggcc tgccggctac gtgagcgtat cggcgatctc gaccagatcg 1255561 cgtcgatcgt gctgcacacc agccaccaca cccatgtagt gatcggaacg ggatccggcg 1255621 atccgcagaa gttcgacccg gacgcgtcac gcgaaaccct cgaccactcg ctgccctaca 1255681 tcttcgccgt ggcactgcag gacggctgct ggcaccacga gcgctcctac gcgcccgagc 1255741 gggcgcgccg ttccgacacg gtggcactgt ggcacaagat ttccaccgtc gaggatcccg 1255801 agtggacccg ccgctatcac tgcgccgatc cggccaaaaa ggcgttcggg gcgcgcgcgg 1255861 aggtgacgct gcacagcggt gaagtgatcg tggacgaact ggcggtggcc gacgcccatc 1255921 cgctgggcac ccggccgttc gagcgcaagc agtacgtaga gaagttcacc gagctcgccg 1255981 atggtgtagt ggaacccgtt gaacagcaac ggttcctggc cgtagtagag agtctcgccg 1256041 atctcgagag cggtgccgtg ggtgggctga acgtgttggt cgatccgcgg gtgctggaca 1256101 aagcgccggt gattccacca ggaatctttc gatgaccggg ccgctcgcgg cggccaggtc 1256161 cgtcgctgcc acgaaatcga tgaccgcgcc caccgttgat gagcggcccg acatcaaaaa 1256221 gggcctcgcc ggcgtggtgg tggacaccac cgccatctcc aaggtggtgc cgcagaccaa 1256281 ttcgttgacc taccggggat atccggtcca ggatctggca gcccgctgca gtttcgagca 1256341 ggtcgccttc ctgctgtggc gtggtgagtt gcccaccgat gccgagctgg cgttgttcag 1256401 ccagcgcgaa cgagccagcc gtcgggtgga ccgctcgatg ctgtcattgc tggccaagct 1256461 gccggacaac tgccacccga tggacgtggt gcgcaccgcg atcagctatc tcggtgccga 1256521 ggacccggac gaggacgacg ccgcggccaa ccgggccaag gcgatgcgca tgatggcggt 1256581 gttgccgacg atcgtggcga tcgacatgcg gcgccgacgc gggttgcccc cgatcgcacc 1256641 gcacagcggg ctcggttatg cgcagaactt cctgcacatg tgcttcgggg aggtacccga 1256701 aaccgccgtc gtgtcggcgt tcgagcagtc gatgatcctc tacgccgagc acggattcaa 1256761 cgcgtcgacg ttcgccgccc gggtggtgac ctcgacccaa tccgacatct acagcgcggt 1256821 gaccggcgcg atcggcgccc tcaaggggcg gctacacggc ggcgccaacg aagccgtcat 1256881 gcacgacatg atcgagatcg gcgatccggc caacgcgcgg gagtggttgc gcgccaagct 1256941 cgcccgcaag gaaaagatca tgggcttcgg gcatcgggtg taccggcacg gcgactcccg 1257001 ggtgccgacc atgaaacggg cgctggagcg cgtggggacc gttcgcgacg gccagcgatg 1257061 gctggacatc taccaggtgt tagcggccga gatggcgtcg gccaccggga tcttgcccaa 1257121 cctcgatttt ccgaccgggc ccgcgtacta cctgatggga ttcgacatcg ccagcttcac 1257181 cccgatcttc gtgatgagta ggatcaccgg ctggaccgca cacatcatgg aacaggccac 1257241 ggccaacgcg ctgatccggc cgctgagcgc atattgcggg cacgagcagc gggtgttacc 1257301 gggcaccttc tagtcttatg ggccatggga tttctccagc cccgacttcc cgacatcgac 1257361 ctggccgaat ggagccaggg ctcccgcagc cagaagatcc ggccgatggc ccagcattgg 1257421 gccgaggtgg gttttggcac tccggtgctg ctgcacctgt tttacgtcgc caagatcctg 1257481 ttgtacgtcc ttgtcggctg gctgatcgtg ttgaccacca aggggattga tggattcacc 1257541 gatgcggcag cgtggtacgc cgagccgatc gtgttcgaga aggtcgtgct ctacaccatg 1257601 ctgttcgagg tgatagggct gggctgcggc tttgggccgc tgaacaaccg attcttcccg 1257661 ccgatgggct cgatcctgta ctggatgagg ttcggcacca tccggctgcc gccgtggccg 1257721 gatcgagtgc cgtggacccg cggcaccaag cgcaagccgg tggacgttgc cctctacgca 1257781 ctgctggtga tgatgttgct gtcggcgctg ttcaccgatg gcgccggccc cataccggag 1257841 ctgggcacca cggtcgggct gctgcccgcc tggcagatcg tgctgatcct gctgcttctc 1257901 ggtgtgctgg gcctgcgcga caaggtgatc ttcctggccg cccgcggcga ggtctacgcg 1257961 acgctgacgg tgacgttttt gttcggccgc ttgaacggta tagacatgat cgtggccgcc 1258021 aaactggtgt tcctggtgat ctggatcggt gcggcgacat cgaaactcaa ccggcacttc 1258081 ccttttgtga tctccacgat gatgtccaac aacccgctgt ttcggccgcg gttcatcaag 1258141 cggatgtttt tcaagaagtt ccccggcgac ctgcggcccg ggctgttgtc gcggattgtc 1258201 gcccacgtca gcactgttat cgagatgtgt gtgcccgtgg tgttgttcgt tgcgcacggc 1258261 ggctggccga cggtggtggc cgcgacgatc atggtctgct ttcacctggg gattctgacg 1258321 gccatcccga tgggggtgcc gctggagtgg aacgtgttca tgatcttcgg cgtcctgtcg 1258381 ctgttcgtcg gccacgcctg cctcgggtta gcggacgtga aaaacccggt gccgctggcg 1258441 atcctgatcg ccgttgtcgc gggaatcgtc attgcgggca acgtgtttcc ccgcaagatc 1258501 tcgtttctag ccgccatgcg ctattacgcc ggcaactggg ataccacgct gtggtgcatc 1258561 aagccctccg cggaggacaa gatcaaccgg ggcatcgtcg cgatcgccag catgccggcc 1258621 gctcagctgg agcgcttcta cggcaaggac cgagcccaga tcccgatgta tctgggatac 1258681 gcgtttcgtg cgatgaactc ccatggcagg gcgctattta cgctggcgca tcgggcgatg 1258741 gccggccatg acgaagacga ctacgtcatc accgacggcg aacgggtctg cagcactgcc 1258801 gtcggctgga acttcggcga cggccacctg cacaacgagc aactgatcgc ggcgatgcaa 1258861 cagcggtgcg gcttccaacc cggtgaggtg cgggtggtgc tgctcgacgc gcagcccatc 1258921 catcggcaaa cccaggagta ccggttggta gacgcggcga ccggggagtt cgagcgcggc 1258981 tatgtccggg tggccgacat ggtgaaccgg cagccctggg acgacgacgt gccggtccac 1259041 gtgctgccgg gctagctgct cgtcagctag cccgcgcgca cctcccgggc ggcggcgacc 1259101 atgttgtgca gcgacgcggt cacctcgtcg acattgcggg tcttcagtcc gcagtcgggg 1259161 ttgacccaca gccgctcggc cggcaccgcg cgcaacgcgg cccgcaacga gtcggccatc 1259221 tcctcagcgg agggcacccg tggcgagtga atgtcataga cgcccgggcc cacaccgttg 1259281 gcgaagccga tcgcgttcag gtcgtcgagc acctccatgt gtgaccgggc cgcctcgatg 1259341 gacgtgacgt ccgcgtccag atcggcgatc gcgccgatca cctcgccgaa ctccgagtag 1259401 cacagatgcg tgtggatctg ggtggcgtcc gagacgccgg aggtggccaa ccggaaagcc 1259461 cctaccgccc aacgcaagta ctcggcctgg tcggcgcgac gcagcggcag cagttcacgc 1259521 agcgcaggct cgtcgacctg gatgaccgcg atgccggcgg actgcaaatc cacggtctcg 1259581 tcgcgaatcg ccagcgccac ctggttggcg gtatcggcca acggctggtc gtcacgcacg 1259641 aacgaccacg ccagaatcgt caccggcccg gtcaacatgc ccttcaccgg tttgtcggtc 1259701 agcgactgcg cgtaggtgat ccactcgacc gtcatcgccc gcggccggga cacgtcgccg 1259761 tacaggatcg gcggacgcac acagcggctg ccgtaggact gcacccagcc gttctgggta 1259821 gcgaagaaac ccgccaattg ctcggcgaag tactgcacca tgtcgttgcg ctccggttcg 1259881 ccgtgcacca gcacgtcgag cccgagccgc tcctgtagcg cgatcacctc ggtgatctct 1259941 tgccgcatcc ggcgcacgta ctcggcctcg tcgatctcac cggcccgcag cgccgcacgc 1260001 gcaacgcgga tcgccgaggt ctgcgggtag gagccgatcg tcgtggtcgg cagcggcggc 1260061 aggtgcagtc gcgcgtcttg gctggcgcgg cgctgggcgg cattgccgcg gtgggctccg 1260121 gacgcgacga tcgcctcgat gcgcgcccgg atttgcccat tgtgtaaccg cgggtcgcgc 1260181 ttgcgggacg cgatggcggc gcgggacgac gcgatctcgt cggcgaccgc gtcgtgtccg 1260241 tcgcgcaggg cacgcgcgag aacgacgact tcgcgcacct tttcggcacc gaacgccagc 1260301 cagctccgca acgcgtcatc caggtcggtt tccggttcca gcgagtacgg cacgtgcagt 1260361 gtcgagcacg acgtcgagac ggccacggta gccgccgaac ccagcagggt cgccaacgtg 1260421 cccaacgccg cctccaggtc ggtgcgccag acgttgcgcc cgtcgacgac cccggccacc 1260481 agcgtcttgc cggccagctc gggtaccccg gccaccgagg tgtcggcacc ggccaccagg 1260541 tcgacgccga tggcttcgac cggggtgcga gccagcgccg gtagggccgc gcccgggtcc 1260601 ccgaagtagg tggcgacata gatcgcaggc cggttgctca ccgagcacag cgcggtgtac 1260661 accgcttcag ccagggcggg cgcgtcgggg gagaggtcgg tcaccagcgc cggctcgtcg 1260721 aactgcaccc actgggcgcc gccgtcggca agcagcgaca gcagctccga atagaccgga 1260781 accaactctt cgaggcgttc gatcggcgcc cccgcgccgt cgacggcctt gctcagcagc 1260841 aggaaggtga tcggcccgat gatcaccgga cgtgcgggaa tgccttgccc taacgcctct 1260901 ttgagttcgg cgagcacctt gccggggtgc agcgtgaacg tggtcgacgg cccgatctcg 1260961 ggtaccaggt agtggtagtt ggtgtcgaac cacttcgtca tctccagcgg cgcgatctgg 1261021 tcggtgcccc gcgccgcggc gaaatagcgg tccagcccgt cggaaaccgg gctcactcgg 1261081 ggcggcagcg cgccgagcag caccgcggta tcgagcattt ggtcgtagta ggagaaggtg 1261141 ttcaccggca ccgagtccag accggccgcg gccagggccg accaggtgtc gcggcgtaac 1261201 gtggcggcga cggcctccag ctcggatcgg ctggtacgtc cggcccagta gccttcggtg 1261261 gcgcgcttga gttcgcggcg cgggccgatg cgcggggagc cggtgatggt tgcggtaaag 1261321 ggttgacgac gtacaggctg ggtcacgtgc tgtccttcga tcgacgggtg gttcaccgcc 1261381 cgcggacgcg cagccgatcc gattgaggtg cacaccgatg cacccggcaa caggcacggc 1261441 caaacgccca ttccacgagg cgatgagccg ccgggcgcgg cgcgtccggc acggctggca 1261501 ggtcttcgga ctcgcaggct cgcacccggt gggtgctcct actggccgtc gcttcccagt 1261561 cgttgagacc agtgcttgtc tacttccaag acggcggtcg ttcctgcata ccgctgcggg 1261621 acagtcccgg attctcacca ggttccctct cgcgaagcat cgttgccccg ctcgatgccg 1261681 acgccctttc ggacgccagc agaccagctg cgtggtcaag gctactccgg tgacatcggc 1261741 cggcatggcc cggccggcgg caaaatcgct cggcgccgga tgtcctcatc gggcccgccg 1261801 cgatcgtcat gtgggtgaga ttcgggatag gcccggacca tgatgggtca acaggccgca 1261861 atacgccgca ctcacctgca ccagagacgt cgactggtcg gcccccgagc aggccgctga 1261921 catggccgcc taccagaagt tcgggcagga gcacgccgcc gcgatccgtg gcggcgccgt 1261981 gctgcacccg acggccaccg ccacgacggt ccgggtaacc ggcgcccgcg gcggcgacgt 1262041 cgtcaccggc gacggtccgt acgaggcggc cgacctggac gagcaagggc cattcccgat 1262101 ggagacggtc tacctgtggg aggacggccc gaacggtacg acgaggatga cgctgtaaaa 1262161 ccgtggtgag ccttcccgct tcgcgggaat cgccgcaccc gccatgacgg tggcggtcag 1262221 gcgggccaac gcgaaggatc tcgcgcggcg caggctgctg gaatccgggg gctaaccgtc 1262281 gaagaacccg gactggtcat taccggcgtt gaacccgcct gagctgttgt cgccggagtt 1262341 ggccaccccg gaggtggtgg tgaaggcggc gttggtggcc gagtttccga tgccggtgtt 1262401 gttgaagccg gtgttgaaca ggcccgtgtt aaagccggtt cccgagttgc tgatgcccac 1262461 gtgctggccg ccgccggaat tgagcagacc cgagtgaccg atgaagaagg cgccggtgtt 1262521 ggtgttctgg aagccggagt tcgcgtcgcc ggagttattg aagcccgagt tgccggtgcc 1262581 gatgttgccg aagccggagt tcatcaccgg ttggtccacc gggctgccga acccggtgtt 1262641 caggtctccg gagtggaagc cgccagtgtt gatatcgccc gagttggccc agccggtatt 1262701 gaagtcgccc gagttcaagt ctccggtatt caaggtgccc gagttgaagc tgcccgtgtt 1262761 gtaggcaccc gagttgccga cacccatgtt ctcaaatccc gagttgccga acccgaagtt 1262821 gttgttgcct gcgttcccga aaccgaagtt gccgctgccc gcgttcccga agccggtgtt 1262881 ggtgaagccc gcgttcccga aaccggtgtt ggtgtcaccg gagttgaaga agccgaagtt 1262941 gctgtcgccg gagttgaaga agccgatgtt gttgttgccg gagttgaaca agccgatgtt 1263001 gttgttcccg gagttgccga agccgaggtt gccgatgccg gagttcagtg cgccgatgcc 1263061 gatcatgttg tcgccggtga gcccgatacc gatgttgttg ttgccgagat tcgcaatgcc 1263121 caggttgttg ttgccgagat tcgcaatgcc caggttgttg ttgccgagat tcgcaaagcc 1263181 cacgttggga gagccgtgat ttgcgctgcc cacgttgaag gaaccggcgt tggcggtgcc 1263241 gaagttgaag ctgccgacgt tcccgctgcc ccggttgcca tcgccgatat tccccaggcc 1263301 gaagttaccg ttgccgtcgt tgccgctgcc caggttgagg ttgccgaggt tcccgctacc 1263361 gaagttggtg ttggcggtgt tgccgctgcc aaagttgaag aaaccggtat tgccgctgcc 1263421 caggttggcc tggcccgtgt ttccgctgcc taggtttgcg ttgccggtat tgccgttgcc 1263481 taggttgtag tcgccgatgt tgccgatgct gaagatgttg ccgatgccgg tgttgccgat 1263541 acccaatgcc gggatggcca gggcagcggg gccggaggcc agtgcgggcg ccgtcgggtt 1263601 gggcagggcg cgcaccgcct gtgcccacgg ggccagctgg gcggccaccg ccgaggcccc 1263661 gctgtggtag ctcaccatgg ccgcgacatc ggcggcccac aattgttcgt aggctgcctc 1263721 ggtcgccgcg atcgccgggg cgttttggcc gaacaggttt gatagcacca gcgacaccag 1263781 ctggtggcgg ttggcggcga ccagcagtgg atccacggtg gccgcccgcg cggcttcata 1263841 caccgccgcg gccgccttgg cctgtgcggc cgcgcttagc gcgcgtgttg ctgccgtgct 1263901 tagccagctg gcataggggg ctgccgcggc ggccatcgcc gtcgccgccg gaccttgcca 1263961 ggcggtgtcg gccagcgctg cggtggccga cgaaaacgag ttcgccgctt ggcctaactc 1264021 ggcggccagc ccgtcccagg ccgccgcggc ggccagcgtc gggcctgagc ccgcaccggc 1264081 aaacatcaac gcggaattga cctcgggagg caacaccaga aaactcatca cgccatccct 1264141 tccgcagctg gacgtgcccg ggccatcccc tcccgtgacc acaaacctcc gctggctgaa 1264201 tacgcacagc ccgatcctcc cggcgcgaag cagcgccgcg gtcccgcctg cttgacccca 1264261 gattccatgg cgcgcctccc accaccaaca ctgggccgat cgctcgacac ctcatgcagc 1264321 ttggcaatca aaacactatg agattcgcag ggcggcctca gcgttttcgc caaagcgctt 1264381 accccctgtt caaccccaac agcgcgatcg cgcttggcca cccattcggc ggctcggggg 1264441 cacggttgat gactacagtg ctacaccaca tgccggacaa gggaattcgc tacggcttac 1264501 agacgatgtg cgagggccgc ggccaagcca atgccaccat tgtggagttg ctgtgacagc 1264561 gaccgatagc cagccggcgg cgttgtcgag taccgcgaca atgtcatggt cattacgatc 1264621 aatcggccgg aagcccgcaa tgcggtcaat ggtgccgtca gcatcgtggt tggagacgcg 1264681 ctggaagaag cgcacgacaa ccccgatgtg cgggccgtgg tgatcaccgg cgccggcgac 1264741 aagtcgcttt gcgccggtgc cgacctcaag gcgatcgcac gccgggagaa cccgtaccac 1264801 ccgcatcacg gcgagtgggg catcgccggt tacaggcacc atttcatcga caagccgacc 1264861 agcgccgcgg tcagtggcac ggccttggac gacggtgccg agccagcgct ggccagcgac 1264921 ctggtggtgg ccgacgagca cacctaattc gggtttgccg gaggtcaaac gcgggctgat 1264981 cgccgccgcc gggggtgtac cggtgagccg ctgaccgcat ccgacgactg ggagtggggc 1265041 ctgatcaacc gggtcgtcaa ggagggttcg gtcgtcgagg ccgccctcac ctggccgtgc 1265101 gggtgaccgt caacgcgtcg ctgtcggtgc aggccagcaa gcggatcgcc tgtggtgtcg 1265161 atgacggggt cgtcgtcgac gaagggactc cgcacccagc gcgagatggg ttccctgatg 1265221 agatcgcagg acctcgggcg ttcgccgaga aacaggaacc ggtgtggcgg gcccgctgca 1265281 tcgtctcggc gccttggatg ggcttggcgg gcgtaccgtc agccagcact gtcgcattgc 1265341 caacgtttgt gggacttatc ccgatgccgg ggcgcagtgt cgcgctgagg tgggcacaac 1265401 gagcatcctt cccgggagaa ccaatgtggc ggatgtgaca acgcgccgac aacaccagat 1265461 cctgggctgt ctcagtacgc caggatgttc accccgtacc ggaatgccgt gggcagaagt 1265521 gcgcacagcg gcacgatggc acggcgtgcc gcgcgtggcg tactggccag caccaacccg 1265581 cgggtgacta gccggtaatc acgagtgatc cggtgccacg cggcctcata cgacgccggt 1265641 gtgtcgtcga cgatggcgct caccgccgcg gcggcctgct tgacggcaag gctgatgcct 1265701 tcgccggtta gggcatcttc gtacccggcc gcgtcaccga ccaaaagcac ccgccccgcg 1265761 acgcgccggg agaccacctg gcgcaaggga ccgcagccac gtgcgtgtcc gcggctcgcg 1265821 tcttgcagat ggtgtgcaag gctgggaaac caggcaagtt cgggtcgttg gcgggacaag 1265881 atcgcgacgc cgaccagatc cggttccacc ggagtcacat aagcctcacc ccaacgggac 1265941 caatgcactt cgacgaagtc cgaccacacc ggcagccggt aatgccagcg caccccgtat 1266001 cgccgtggtg tcccggcggt ggctttgatc ccgacggcgc gccggacggc cgaatgcagt 1266061 ccatcggctg ccaccaacca tttcgcgcga acgccggcgg cggtcacacc atgtgcgtct 1266121 tgctgaatag tggctacccg cgaccggatc cattcagtgt cttgctcttt ggctcgtgcc 1266181 gccagtgccg catgcagcgt ggtgcgtcgc acgccccgcc ccggcccggt gcgaaaccgc 1266241 gcctgcaccc gacgatgttc accaacgtag gcaatcccat gaaagggcag accgaccggg 1266301 tccacgccta gcgaggtcaa ttcggccagg ccaccgggca tcagcccctc gccgcacgcc 1266361 ttgtcgatgg gattctcgcg aggctcggcc acgatcaccg aaagtccacg cgcgcgtgcg 1266421 tgcaatgccg tggcgagtcc gccggggccg ccgccgacga ccaacaggtc ggtgtcgtag 1266481 ctggtcatat gtagcccaga acggagttct ccacccgcag acgaacggtc agcaaggtcg 1266541 cattggccag ggtgaaaacc agtgcggtca accacgccgt gtgcaccagt ggcaacgcga 1266601 acccttcggc caccaccgca acataattcg gatgccgcat ccaccggtag gggccccgcc 1266661 gcaccaacgt ggcgtgcggc aacacgatta cccgggtgtt ccaccgcttg cccagcgatt 1266721 tgacgcacca ccagcgcagg ccctggcttg ccaccactac ggccagcatc ggccagccga 1266781 gccacggtat gaaaggccgg tgcaaggccc acggttcgac gacgcagccc agcagtaggg 1266841 cggtgtgcag gataaccatc accacatagt gtgggcggcc aaactctttg ccgccctgcg 1266901 cgaaagacca ccgcgcgtta cgctgggcca ccaccagctc cgccagccgt tcgaagacga 1266961 ccgccaggat cagcaggtag tacacggccc taccacctaa gaagcaccga ctcggaggaa 1267021 aaacccgggc ctatgcacac tatcctggcg ggaccggcga tgcagcccag cccgaacggt 1267081 ggaatagctc cccctgcgat cgactccata ttgtcagcca cgttactggc gccggatggg 1267141 ttcagattct ggcgagtggg accgccattg ccgggccgtt ccacggcccg tatcgtcgcc 1267201 gcgctgtgct ggattgcgcg gcttctcctc gggccgttcc acggcccgta tcgtcgccgc 1267261 gctaggttgg acgctgtgcg gatcgtggtg agcagtgcca ccagaaatgc gggttcgtac 1267321 acctgtgtca gcaccggcag cgctggatgc cgcgagatta caccgcccct cgctgggccc 1267381 acgcctgggc cggtgaaccc cggcccgccc gctggcaccc tgcgaaccag cctgcacatc 1267441 ctgaccactc caaccgcgaa agtccggcct gcatgagcca atccaccact ccataccgca 1267501 gcagcgtgct tgccgagttt cgtcgtgcga tcaccaatgt cgctgtgccc catcatgaac 1267561 cgccgggaat cgtgcgccgc cgccgtgtgg tcgtcggcgt cacgttggtt atcggcgctg 1267621 tgatgctggg cttttcgctg aggcggacgc ccggcgagtc gagcttttac tggctgacgc 1267681 tcgcgctggc agccgtgtgg atcgccggcg cactgatgtc tggaccgctg catctgggtg 1267741 gcatctgttg gcgcggtcgc aatcagcgtc cggtcatcac cgggaccact gtcgggctgc 1267801 tgctagcagg catcttcggg gtgggtgcaa tgatcgtcag ggcaattcct ggcgcagctg 1267861 aaccgatagc ccgcgtcctg caattcgccc atcagggaac tctgctgccg atcctgctga 1267921 tcaccttgat taacggcatc gccgaggaga tgttctttcg cggtgcgctc tacaccgcgc 1267981 tgggacgacg ctatccggtg accatctcaa ccgtcctgta cgtcggcgcc accatggcca 1268041 gcgcgaatct gatgctcggc ttcgcagcga tcttcgtcgg tacggtgtgt gcgttggagc 1268101 gccgggccag cggtggagtg ctggcaccga tcttgaccca cttcgtgtgg ggcctgatca 1268161 tggtgttcgc gctgcccccg ctgttcgcgg tctgacgcgc gttcaggaac cggtgaagtt 1268221 gggggtgcgg cgttgcagga acgccgctgc gccctcggcg aagtcgtgtg ttcgcagcag 1268281 gacttcctgt ccatccaatt cgcgcgcgaa cgtgggttcc aattcggtga gggcggctgc 1268341 attgatggcg tttttggcct gggcgaacgc cagcgccggg ccggccagca accgtgaaat 1268401 caccttgtcc acctcggcct cgaagtcgct gtccggatat accgcgctga tcaggcccca 1268461 ggccagtgcc tcgcgggccg gcagttgctc ggccagcagc gccagccgca tcgcccggat 1268521 ccggccggtg gccgcggcga ctaacgccga tgcgccgccg tcgggcatca acgctacctt 1268581 ggtgttggcg agcatgaaaa atgcactatc agaagccaat atgaagtcac acgccagcgc 1268641 tagcgagaca gcgacgccga ccgctggtcc ttgaacgaca gctacaaccg ggtgcggtag 1268701 cgcggccacg gcgcgtactg cgcggttggc ctcttcgacg atggcggtcg gcggccctcc 1268761 gccccacaca tcgtccacag acatagacac tccggagctg aaaccgcggc ccaccccgcc 1268821 taggcgcacc accttgacca cgggatcggc cgccgcgcgc tccagcgtgt cggcgatccc 1268881 cgtcaggatt ggcacggtca gcgagttgag actgctaggg cggttgatgc gcaccgacaa 1268941 cactctgtcg gtcagggtga cgttgaggcc tgtgaccggc gttaatgcgg caatcccgga 1269001 atctggcatg tgcagcatcc taaatgaggg ccagctacac agagtggtta atgatgctcc 1269061 gcaaacatgc ccaaccagca gttggagtaa tcggtgagta cacgggcatc gacgcggccc 1269121 agtcgcggga ccgctagcgg gccgagagcg ctcaacggcc ggtgaacatg ggggtccggc 1269181 gctgctggaa tgccgttgcg ccctcggcga agtcgtcagt acgcaggagg agggcctggc 1269241 catccaattc gcgcaggaga gtgggtgcca actcggtgag cgtggccgca ttgatcgcgt 1269301 tcttcgtctt ggcgatagcc agcgctgggc cggccaacag ccgtgagatc aacttgtcca 1269361 cctcggcatc gaagtcggcg gccggataga cggcgctgac caggccccag gacaaggcct 1269421 cggcggccgg cacccggtcc ggcagcagcg ccatatgcat ggcgcggatg cggccgatcg 1269481 cggcctgaac caacgccgac gcgccgccgt cgggcatcaa ccccacgttg gtgtgagcga 1269541 gcatgaaaaa cgcattgtcg gaggccaata cgaggtcaca agcgagcgcc agggagacgc 1269601 cacagccgac ggttggtccc tgcacgacgg caacgaccgg ttgtggtagt gccacaatgg 1269661 cacgcaccgt gcggttggcc tccgcgacgg tgtcggtagg cgggccactg gcccacacat 1269721 cgtcaacgct gattgcccct ccggagctga agccgcgacc ggcgcccccg aggcgcacca 1269781 ccttcacccg tgggtcggtg gccgcgccct cgatcgcgtc ggccatccct gccagcaccg 1269841 gcttggtcag cgagttgaga ctctccgggc gatcgatggt caccgacagc accccgtcgg 1269901 ccagggtgac ggcgagaccc gggacaattg tccgagtgtc gatccggtag ttcgacatgt 1269961 ggttaacact aatcgacgac gccgtcaccg agctgcggcg acatgatctt cgtcgatacg 1270021 ccgtcgaggg cgtcaatggg agacgaaagg ccggtacatt catggcgggt ccgctgagcg 1270081 ggttgcgagt tgtcgagctg gcgggcatcg ggccgggccc gcacgcagcg atgatcctgg 1270141 gggacctcgg tgccgacgtg gtgcgcatcg atcgcccgtc aagtgtcgac ggtatttcga 1270201 gagacgccat gttgcgtaac cggcgtatcg tgaccgccga cctgaagtcc gatcagggac 1270261 tcgagcttgc gctcaaactc atcgccaagg ccgacgtgtt gatcgagggt taccgtcccg 1270321 gcgtcaccga acggctggga ttgggtccgg aagaatgtgc gaaggtcaac gaccggctga 1270381 tctacgcgcg gatgaccggc tggggccaaa ccggcccgcg tagtcagcag gccggtcacg 1270441 acatcaacta catctcgctg aacggcattt tgcacgccat tggccggggc gacgagcgac 1270501 cggtgccgcc gctgaacctg gttggtgact tcggcggcgg ctcgatgttc ctgctggtcg 1270561 gcatcctggc cgcgctatgg gagcggcaga gctccggcaa gggccaggtc gtcgatgcgg 1270621 cgatggtcga cgggtccagc gtgctgattc agatgatgtg ggcgatgcga gcgacgggca 1270681 tgtggaccga cacaagaggg gccaacatgc tcgacggcgg ggcaccctac tacgacacct 1270741 acgaatgcgc cgacggccgc tacgtcgctg tcggcgccat tgagccgcag ttctatgcgg 1270801 ccatgctggc cggattgggt ctagacgccg ccgagctgcc cccgcaaaac gaccgcgccc 1270861 gttggcccga actgcgggcg ctgctgaccg aagcgttcgc gagccacgac cgtgaccatt 1270921 ggggcgcggt gttcgccaat tccgatgcct gtgtgacgcc ggtgctggcg ttcggtgagg 1270981 tgcacaacga gccgcacatc atcgagcgaa acacctttta tgaagccaac ggcggatggc 1271041 aacccatgcc ggctccgcgg ttctcccgca ccgcttcgag ccagccacgc ccgccggccg 1271101 ccacgatcga catcgaggca gtgctcaccg actgggacgg ataggaagga ttcgtatgaa 1271161 gaccaaagac gccgtagccg ttgtcaccgg tggcgcctca ggcctgggtc tggccaccac 1271221 caagcggcta ttggacgctg gggcacaggt ggtcgtcgtg gacctccgcg gcgacgacgt 1271281 ggttggcggg ctcggcgatc gcgcgcgttt tgcgcaagcc gacgtcaccg acgaagccgc 1271341 cgtcagcaac gcgctagagc tggcggattc gctcggcccg gtgcgggtcg tcgtcaactg 1271401 cgccggcacc ggcaacgcga ttcgcgtact gagtcgcgac ggcgtgttcc cgctggccgc 1271461 gttccgcaag atcgtggaca tcaacctagt cggcaccttc aacgtgctgc gactgggcgc 1271521 cgagcggatc gccaagaccg aaccgattgg ggaagagcgc ggcgtcatca ttaacaccgc 1271581 ctcggtggcg gcattcgacg gtcagatcgg ccaggccgcc tactcggcgt ccaagggcgg 1271641 cgtagttggc atgaccctgc cgatcgcccg cgatctggcc agcaagctga tccgggtggt 1271701 caccattgcg ccgggtctgt tcgacacccc gctgctggct tcattgccgg cggaggccaa 1271761 ggcctcactg ggccaacagg tgccgcatcc ctcgcggctg ggcaaccccg acgagtacgg 1271821 ggcgctagtt ctgcacatca tcgaaaaccc gatgcttaac ggcgaggtca tccgtctgga 1271881 cggcgccatc cgcatggcgc cgcgctaagc cgcaccaaaa gaaagacccc cgcgttgcgg 1271941 gggaccggaa tcgggaacaa gaacttaccg acgaaaccat cggctgacgg ctggttcggc 1272001 catgaggagc cgtgcaagca tgcccatggt gtcgctcagc tcgcggtggg cagcgggtgc 1272061 aagtcttcga gctgctcgga ggtgtcgccc tctaccagca tgtcgccgtg gtagagagcc 1272121 tcgaagtcag ccttgatgac gtcggcactc gagtcgtcga tccacatgac agcgagccta 1272181 aaagccgcca ttaaggaatt agtgagtcac gattcggaaa acagtggcaa ttcctaccgg 1272241 tcggtagggt gctgcgccgg catggtggcc ggcatcgcgg gcatgcggca ggtgaaccac 1272301 tcgagcgccc gcatccgtat ctatggcagg cgttgtttga cagttgtaac ttatcgcaga 1272361 taagtcatcg cggatttggt gcgggtccgc gcgaccagca ccggctgcgg aggaaacgca 1272421 acatgctgca gaggatcgct cggctcgcca tcgctgcgcc gcgccgaatc atcgggtttg 1272481 cggtcttcgt cttcatcgcc gcagcggtct tcggtgttcc ggtggctgac agcctgtcgc 1272541 ccgggggttt ccaagatccg cgatcggagt cggcacgggc aatcgaggtg ttgaccgaca 1272601 agttcggcca gagcggtcag aaaatgctga tcgtggttac ggcagccgcg ggcgccgaca 1272661 gcccacctgc ccgcgaggtc gggactgaca tcgtcgaggt gctgcggcgg tcgccgttgg 1272721 tttacaacgt gacctcgccg tggactgtgc caccgactgc cgccgccgac ctgctcagca 1272781 ccgacggaaa atcggggttg atcgtcgtca acgtcaaagg cggcgaaaac gacgcgcaga 1272841 accacgccca aaccctgtca gacgaagtcg cccatgaccg cgacggcgtc accgtccgtg 1272901 ccggcggctc ggcgatggag tacgcccaga tcaatcggca gaacaaagac gacctgctgg 1272961 tgatggagtt gatcgcgatt ccgctgagct tcctggtgct gatctgggtg ttcggtgggc 1273021 tgttggccgc cgggctgccg atggcccagg ccgtactggc cgttgtggga tcgatggccg 1273081 tattgcgact cgttacgttt gccaccgagg tgtcgacctt cgcgctcaac ctgagtacag 1273141 cgttgggcct cgcgttggct atcgactaca cgctgctcat cgtcagtcgc tatcgcgacg 1273201 agctcgccga gggcagtgat cgagacgaag cactgatccg gaccatggcg cttcggggcg 1273261 cacggtgttg ttttcggcgg tcaccgtggc gctgtcgatg tcggcgactg cgctgttccc 1273321 gatgtacttt ctgaagtcgt tcgcctacgc cggcgtggct accgtggcat tcgtcgcgac 1273381 cgcgtcgatc gtgatcaccc cggccgcgat tgtgttgcta ggtcctcggc tagatgcgtt 1273441 ggacgtgcgc cgactggtgc gtcggctgct gggccggccc gatccggtgc acaaaccggt 1273501 caagcaactg ttctggtacc ggtcgagcaa gttcgtgatg cgccgttggc tgccggtcgg 1273561 tacggctgtt gtcgcgctgc tggtgctgct cgggctgccg ttcttgtcgg tgaagtgggg 1273621 tttcccggac gaccgggtgt tgccgcggtc ggcgtcggcc cgtcaagtcg gcgatatctt 1273681 gcgcgatgac tttggccacg atcctgcgac gcagataccc atcgtcgtcc cggacgctcg 1273741 tggtctcggc ccggtcgaac ttgacagcta cgcagccgag ttgtcccggg tgcccgacgt 1273801 atccgcggta gccgccccga cgggcacgtt cgtagacggc agctgggtgg gaacgccgcg 1273861 cggggccacc gggttggctg agggcagcgc gttcctgacg gtgagcagca cggcgccgct 1273921 gttttcgcga gcctccgata tccagctcaa gcggttgcac caggtggcag ggccggccgg 1273981 tcgatccgtc gtgatggccg gtgtcgcgca ggtcaaccgc gacagtgtcg acgcggtgac 1274041 cgatcggctt ccgatggtgc tagggctaat tgccgcgatc acctacgtac tgttgttcct 1274101 gctcaccggc agcgtggtgc tgccggcgaa agcgttggtt tgtaatgtgt tatcgctgac 1274161 cgcggcgttt ggcgcgttgg tgtggatctt ccaggaaggc catttcggtg ccctgggaac 1274221 gactccgagc gggacgttgg tggcgaatat gccggtccta ctgttttgca tcgcattcgg 1274281 tttgtccatg gactacgagg tgtttctggt ctccaggatt cgggagtact ggttggaatc 1274341 cggagccgcg cgacccgcgc gaagaagcgt cgcagaggtg cacgccgcca acgacgagag 1274401 cgtcgcgctc ggcgtggccc gcaccggtcg ggtgatcacc gcggcagcgt tggtgatgtc 1274461 catgtcgttc gccgcgttga tcgctgcgca cgtgtcgttc atgcggatgt tcggcctcgg 1274521 cctgacttta gccgtggctg cagacgccac actggtgcgg atggtcgtgg tcccagcatt 1274581 catgcatgtg acgggccgct ggaattggtg ggcaccgaga cccctggcgt ggctgcatga 1274641 gcggttcggt gtcagcgagg cagcagagcc ggtttcgagg agacgttccc acgccggtgg 1274701 gttgggcaag attgccggac gaagcgacgg tcagacgatc cctgcctcgc tgacgcgcaa 1274761 tggttgacgt ctcgatgaat ggtcttcgcc ggcaacgtgc ccggcggggc cccaacgcca 1274821 cattacggca gctggcggac tgggtgcagg cacgtcgccc atcggagaaa cgacgaggac 1274881 catcggagga atcctggcca tgacgtcagg cgcggccgct tcggcgtcca gggtcgacca 1274941 cccgcttttc gcccggatct ggcccgtggt cgccgcacac gaagccgaag caatacgagc 1275001 cctccgccgg gagaatctgg ccggtttgtc ggggcgggtg ttggaagtcg gggccggcgt 1275061 cgggacgaac tttgcctact acccggtggc cgtcgaacag gtcatcgcca tggagcccga 1275121 gccgcggctt gctgccaagg cccgcatcgc ggccgctgac gcacccgttc cgatagtcgt 1275181 gacggacaag acggtcgagg agttccgcga caccgagacg tttgacgcgg tggtttgctc 1275241 gctggtgctg tgctcggtga gcgacccggg cgcggtgctg gcgcacctgc gttcgctact 1275301 acggcgaggc ggggagctgc gctatctcga gcatgtggcc agcgccggcg ctcggggccg 1275361 ggtgcagcgg ttcgtcgacg cgacattttg gcccaggctg gcgggcaact gtcacacgca 1275421 tcgccatacc gaacgcgcga tcctcgacgc cggattcgtg gtggacagct cccggcggga 1275481 gtgggcattt cccgcctggg tgccgctacc ggtgtcagag ttggctctgg gccgcgcgca 1275541 ccggacctag ctatagctag tactgcagcc gtagataggg attgctgatg ctggcgtgtc 1275601 tgcgctggtc agggcggtga ccgcggcatt gttttcagtt tgtgacaact tctcaatatg 1275661 ccgcggtcgc cgcggctcat agcgtagacc ctgatcggtg gcaggcggag ttctcggcgg 1275721 tgctggatcg gatcgcgccg cgtttcgccc ggcaccagcc gttgcgccat gccggtgaac 1275781 tcatggccgg gatggtttcg ggcttggacc gcaagaattg ctggaccatc gccgagcacc 1275841 gcggtgatac caccccgatg ggttgcagca tctgttggca cgggccagct gggacgccga 1275901 cgatgtccgt gacgatctgc gtgactatcg ccattgatcg atggcgaagg accaggtcac 1275961 cagtatatcg atgatttgaa tagtccagcg ccgacattga tgatatctgt tgacgaatac 1276021 gcttgattta cgatgttcgg ccgcgggcag cgcgctccac cagaccgagc acagcgagga 1276081 cgcgacggcc gtcagcggcg tgctgtgcct caacagcgcc gaccaatagc gaagaaatca 1276141 agtccgtgct cacccgtgac cagggtgtca tgttcgtcga cgggtagaag cttgtcgccg 1276201 cggcgatcgg ctgctctggt gccggctgtg ccgacgggtc ggtccgcatc tgcttcagtg 1276261 attctgtgat gcgaccggca acgtcttcgt tgttgggtgt caatgtggtt cgtcgtcgtc 1276321 ttgttcgcac aggattttcg cggggtggtg gtatcgattt attcgcggtt ggccgtggtc 1276381 gaggtgtggt ggtggtagcc attcggtgtc gccgtgggcg tttttgcggg tcttccagcc 1276441 tttttcgaca aggcgattgt cggggccgca ggccagcgtg aggtcgttga tgtcggtacg 1276501 gtgggtggtt gtccacggcg ttacgtggtg gacctcactg tggtaggccg gggcgtcgca 1276561 acccggcctg gagcagccac gatccttcgc gtacaacatg attcgctgcg ccggggaagc 1276621 taaccgcttg gtgtgataca acgccaacgg cttagcgccg tcaaacaatg ccagatagtg 1276681 gttggcgtgg ctcgccatcc ggataaggtc cgacatcggc acccgcgaac caccaccggt 1276741 tacccccttg ccggtggcgg cttccagctc ctttagcgtg gtgctcacca cgatcgttac 1276801 cggcagcccc ttgtgttggc ccagctcacc ggaggccaac aggccccgca gcgcggccaa 1276861 aaacgcatca tgattgcgtt gcgcctggct gcgggtgtcg cggcgcaccg cgtccgcatc 1276921 cggtgtgtca tccacgagcg gggtctggtc atcggggttg cacgcccccg gtgcggccag 1276981 tttggccaac accgcctcga tggtggcccg caactccggg gtcagcagac cgctgatacg 1277041 tgacatcccg tcaaattcct gcttacccat cgtgatgccg cgcttgcggg cacgctcctg 1277101 gtcggaaaag ttgccgtcgg ggtgcagcca gtccatcagc tgcgtggcca ggccatgcag 1277161 gtgatcggga cgccgactgg tggccagttc ggccagctgg gcctcggcgg cctcgcggat 1277221 acccagatcc accgcggcgg acaactcctt gaagaaggcc tggatctcct taatgtgttc 1277281 tcggccgatc ttgccctcac gttgagcggc cgcggtcgcg gtcaactgcg ctggcagcgg 1277341 ttcaccggtc agggcgcggc gctcaccgag gtcttcggct tcggcgatgc ggcggctggc 1277401 ctcaccggga gtgatgtgta gccggttggc caacgccgtg cgcagcgtcc cgccgagctc 1277461 ttcctcgcag gcttgcccag cgagttggtt gatcaaggcg tgctcggcgg cgccctggcg 1277521 gcgccgttcg acctcgagtc gctgcaaaca ggccagcaat tccggggtgg tcaacgcatc 1277581 gcacttgaga tcgagcaccc gcgacaacga ggcgtggtag gcatccaacg ccgcggagat 1277641 ctcctcgcgc gtgtccgacc tcatgcctcg gattctacga agcaccactg acaagaaccg 1277701 ggccgtcata ggctcggaat gatcagtgag gcagaacgtt tcgctcacag cgaaaacagc 1277761 cgcgccatag cgactgccgc caccaaatgc cgcgtgcacg cagacacgcc agcgtcagca 1277821 atccctatcc acggctgcag tactagggcg tgtctcccaa atttttaggt actggccagc 1277881 gaggattggc cggtgacgcg agtgggtgtg atttcggacg agttctgggc cgtggtcgag 1277941 ccgttgatgc cgtcgcatga gggcaagccc ggcagacggt ttagcgatca ccggcttatc 1278001 ctggaaggga tcgcgtggcg gttccgtacg ggaagtccgt ggcgggacct gcccgctgag 1278061 ttcgggccgt ggcaaacggt gtggaagcgc catcaccgtt ggtcgctgga tggtacctgc 1278121 gacgaggtgt tcgcccacgt tgccgcggtg ttcggggtgg acgctgaggt ggccgaggat 1278181 atcgagaagc tgctgtcggt ggattccacg aacgtgcggg cacaccagca ttcggcgggc 1278241 gcctgctcgg acacgctcgc cacagggggc actgtcggat tacaagaaat ccgccgatga 1278301 acccgacgat catgcgatcg gccgctcgcg cggcgggctg accaccaaga tccatgccct 1278361 gaccgatcag cgcgaagccc cggtgcggat ccggttgacc gcaggccagg ccggcgacaa 1278421 cccgcaactg ctgcccctgc tcgacgacta tcgccatgcc agcaccgaat acgccctggg 1278481 cagcacggat ttccgcttac tcgccgacaa ggcctactca cacccaagta cccgtgccgc 1278541 attacggtct aagaagatca agcacaccat ccccgaacgc caagatcaga tcgaccggcg 1278601 caaggccaag gggtctgccg gcgggcggcc accagcattc gacgccgcgc tctacgggct 1278661 acgcaacacc gtcgaacgcg gcttccatcg actcaagcag tggcgcggca tcgcaacccg 1278721 ctacgacaaa tacgccctga cctacctcgg cggcgtcctg ctggcctgcg ccgtcatcca 1278781 cgcccgagtg ggaactccga aattgggaga cacgccctag ccgagaccgg cgagcgtgca 1278841 tccagggcga gattccgccc ggcaaaccgt cgccctgagt tcacgttcgg cgcccatagg 1278901 cgactatttc agcagggcgg gcaggcgctc caacagcccc ggcaacgctt ggctggccga 1278961 ctcgcggatg ctgatcgtcg cgctgccgga caacggcgtg ggctcgggat tgacttcgat 1279021 cacggcagtg ccgcgcgcca gcgccaggtc gggtaaaccg gccgccgggt agacgatcgc 1279081 cgaggtcccc accacgacca tcacgtcggc gctccctgtc gcctcgaccg cgctccgcca 1279141 cggctcctct ggcagcggct caccgaacca tacgatgtcg ggccggatca gaccgccgca 1279201 gtcgcagacc ggcggctcca cttcgatcgc aggctcgggc atctccggaa gggcgtcggt 1279261 gtagggcaca ccacaacgtg cacaacgaaa ttcgaaaagg ctgccgtgca ggtgatgcac 1279321 cgcaccgctg ccggcgcgct cgtgcagatc gtcgacattc tgggtgatga cgctgacctc 1279381 agcatggtcc tgccaggcgg cgatcgcgcg atgcccgtcg ttgggttcga cgttggccac 1279441 cagataatgg cgccataggt accatcccca gacccgctcg gggttgcgca gccagccttg 1279501 cgtgctggac agctcgtaag ggtcgaatcg ggcccacaat ccgttcttgt catcgcggaa 1279561 cgtcggtaca ccgctttccg cggagatccc cgcgccgctg agcaccgcca ctcgcatccc 1279621 acaaacatag ctgtgcttgg tagatactgg gtacgtggag ctgcgggatt ggttacgggt 1279681 cgacgtgaag gcgggaaagc cgttgttcga ccagctcaga acccaggtga tcgacggagt 1279741 ccgcgccggc gcattgccgc ccggcacccg gctcccgacg gtgcgtgact tggccgggca 1279801 gctgggcgtg gcggccaata ccgtggcccg cgcctaccgc gagttggaat cggcggcgat 1279861 cgtcgaaacg cggggacgct tcggcacttt catttcccgc ttcgatccga ccgacgccgc 1279921 gatggctgcc gcggccaagg aatatgtcgg cgtggcgcga gcgctggggc tgacgaagtc 1279981 cgatgcgatg cgctatctca cccacgtgcc ggacgactga attccagcaa agtcaggcac 1280041 ggccgcagcg gatcgaatac gggcaggcgg taaacggtcg acagcgccat attgacccac 1280101 aggccacggc ccggtggcac ccgcagatcg cggaccgcga cgacaccagg caccttattt 1280161 accaggtccg cggcctgcgc gacggacatg ctgaacggca tgcgcggcac cttgtagcgc 1280221 agcgaggttc gtaacccgag ccggctccac ccagcgaacc agcgcggcgg caggtcgaag 1280281 agcatttggc cgccaggaaa cgtttgagcg cattgggcga ttaaccccag tgcctgttcg 1280341 ggttgtaggt acatcagtaa tccttcggcg gtgatgaaca ccccgccggc gggatcgacg 1280401 gaatccatcc agctgtagtc cagcgcagac tgggcacaca ccgacacgcg cggcgagctc 1280461 ggcagcagcc gtgtccgtaa atcgacgatc ggtggcaggt caactgtcag ccaacggaac 1280521 tggccgcccg ggatggccac gtccaaacgc caaaagctgg tttgcaagcc ctccgccaac 1280581 gccaccacgg tggccgctgg gtgctgatcg agataatgct gtgccgccat gtcgaaggcc 1280641 cgtgctcgta gggcgaagcc ctggccggta gggccgaact tcgcgaagtc gaagtcgatc 1280701 gactcgacca gggctaccgc catcggatcg tcgataatgg catcgcggcg gcgggcctct 1280761 gcggcccggg cgttcagcgt cagcaaggcg gtctcggaga ctccggtgag tgcgacccgc 1280821 tgtttggcgg gcttatgggc actcaccgca acaccttagc cagcgtgcgc aggttgcggg 1280881 tcgtggtcga cgacttgtag cgcttcttgc ccatcgtctg gccgatggtg ctgtccaggg 1280941 tgctgccctt gggtacctgc cagtagagga cgccaagagg gtcgggtcca cgactgatgt 1281001 tctcgtcagg gccggctgtg tcggcgagtg cggatagctc gtcgagtatc gcggcgtcgg 1281061 caacgaaggt gacgtacgac tggtatccct cgagctcgca ttcaaatggg tatgccgtca 1281121 cgatggtgcg caccgtatcg acgtcgtaga tcaacgccca cgcgtcgtag ccgaatcgtt 1281181 cgcgtagcgt ggcttcggtc ttctcgcgca cttccgcggc accgcacgtc gactccagca 1281241 acacgttgcc gctggccagg atggtgcgca cattgcagaa tcccgcatcg gtcaacgccg 1281301 tcgccacctc ggccatcttg aggttgacgc cgccgacgtt gacaccgcgc agaaacgccg 1281361 cgaacttggc catacccgat tgcaccaggc cgccggagaa tgacgcaacg gcgacgtagg 1281421 ctcttggcat ggcccgccaa gtcttcgacg acaagctgtt ggccgtaatc agtggaaact 1281481 ccattggggt gctggccacc attaagcacg acgggcgccc ccagttgtcc aacgtgcaat 1281541 atcacttcga cccgcgcaaa ctgctgatac aggtatcgat cgccgagccg cgagccaaga 1281601 ctcgcaacct gcgtcgcgac ccacgggctt cgatcctggt cgacgccgac gacggatggt 1281661 catacgccgt tgctgagggc actgcgcaac tgacacctcc tgcggcggcg cccgatgacg 1281721 acaccgtgga ggcgctgatt gccttgtatc gcaacatcgc tggcgagcat tcggactggg 1281781 acgactaccg gcaggcgatg gtcaccgatc ggcgtgtgtt gctgacgctg ccgatctcgc 1281841 acgtatacgg cctgccgccc ggtatgcgct aacccccggg gctgcggacc tacggactgg 1281901 gtcggattgc ctcgctgctc ggcgggccgc atcctgcggc ccgcatcgtc gcgaggctgg 1281961 gtcggattgc ctcgctcctc gccgtgccgc atcctgcggc ccgcatcgtc gcgaggctag 1282021 gctgcgggta tgggtgaatc gaagtccccg caagagtcca gctcagaggg tgagaccaag 1282081 cgcaagttcc gggaagccct cgaccgcaag atggcacagt cgtcgagcgg atccgatcat 1282141 aaggatggcg gcggcaagca gtcgcgggcg cacggtccgg tggcgagccg tcgggaattc 1282201 cgccgcaaga gcggctagcc acggggcgcg gctgctcagc ggcgacccga acgttgccga 1282261 agatgctcat caagaggtcc gtcccgacag ctctacactg aggacgtgcc aaatctgcag 1282321 cttgtccaag agccggcagc cgacgcgctg ctgaacgcca acccattcgc gttgctggtg 1282381 ggcatgttgc tcgaccagca ggtgccgatg gagaccgcct tcgccgggcc gaagaagatc 1282441 gccgatcgga tgggtagctt tgacgccggc gacatcgccg actacgaccc ggataagttc 1282501 gtcgcactgt gctcggaaag gcctgctata caccgatttc cgggctcgat ggccaaacgc 1282561 atccaggcgc tcgcgcagat catcgtggac cgctacgacg gggatgcggc cgcattgtgg 1282621 accgccggcg aacctgacgg gaacgagttg ctgcggcggc ttaaggggtt acccggcttc 1282681 ggtgagcaga aggcgcggat ctttctcgcg ttgcttggca agcagtacgg agtgacgccg 1282741 aagggttggc aggtggcagc cggggagttc ggtcagcccg gcacctatct atccgtcgcc 1282801 gatatcgtcg acgccgggtc gcttgggcag gtgcgatcgc acaagaggca aaggaaagcg 1282861 gcggccaagg cagagggaaa ggcgccaacg tgaagacaca cctgacgtgt ccgtgcggcg 1282921 aagccatcac cggcaaggac gaggacgagc tggtcgagct gactcaggcc caccttgcca 1282981 gcgttcatcc cggcctggag tacgaccgcg acgccatatt gttcatggcg tactgatgga 1283041 ccattcccgc tggtgctagg gcaccaccgt tgagccgatc gtcggcatga actggcactg 1283101 ccggtccttg gtggtcacct gcccgaagat cgttgacatg atgctgcctg aaccggtgtc 1283161 ggcgattacg gtcaacgtgg tcggtccgtc cggattgatg tccgaacgcg gccgcagggt 1283221 ggcgctgccg gactttcccg tggtcaggtt cacccacgtg acgttcagcg gcaacctctg 1283281 cacgtcggcg ggccccggcg tgccgacggc cgtgaacacg taggcggtct ggccgggtcc 1283341 gggaccgggc agcgggatct tggcgggccc cgccaccgac agcgcggtcg cgatggaatt 1283401 gctgccgtcc gccacacaat tggggccgat cgaggggtac atgaagtcct gtgtgggcgg 1283461 agcgtcggcg ccgaagcccg aggcaggcgc cggagcggcc gctggcgccg gtgccgccga 1283521 ggccggcgcc ggagcggcgg cgcgcggcgc aggtggcgcc ggtggcggcc ccggtaccgc 1283581 aaccggttgg gctgcgtcag gtgctggtgc cggcgcggaa gccggcggcg cggcgaccgg 1283641 cggcgtaacc gtaggtgcca cagcgggtgc ggggcccgcg gcgtgggatg gatcgatgcc 1283701 ggtcggaaga tgtgcctgca cacctggctc ggcgcccagt gggggcacat gcggcaccgg 1283761 gatggcctcg ggcagggcga caccatgagg cgccggcaca cccagggcag cagagtcggg 1283821 attggtcggc tcagccacaa actggttcac cgaggaagcg acgttcttgg attcggtggg 1283881 cactgccgga ttgccggcga acgcagacgc tgcggccatg agcagttgcg tcgcctgtgc 1283941 cgggttcatc gccgcttgct ggattatcgg actcagctgg gccaacgccg gcaagcccgg 1284001 tagctgttga gtggggttgg gctgcggcgt cgccgggtcg gctgccgcgt tcggacacag 1284061 cgcgaacgcg gcagccgagg tgatgacgac ggcggccaaa cctttgcaca cactccaagt 1284121 gcttgccacg gtggtgttct cccggtgttc ggtgttggtc agccttctca cagatgcgtc 1284181 agggcagcgc ggcgagcaac gacggcggcc cgggcggtaa cgcgggcgcg ccgggagccg 1284241 gcggcgtcgg cgcgatgggc gctgccggaa tgacccccga tgcgagcgcc ggtaggtcgg 1284301 ctggcagcga aagctgttgc ggcacttgga gcggaaggta gggcagctgc ggtaggtcga 1284361 ccttcgccga cggcacgccg ggaacggagg ccggcaccgc ggccgccgcc ggggccggtg 1284421 cggttatccc cgggatcggg gcgttcactc cgggaatggt cggagctgcc gccggggcgg 1284481 tgacggggag cgccggtgcc gccggggtta tccccgggat cggggcgttc actccgggaa 1284541 tggacggagt tagcgcgggg gccgcggctg ccgccggtgc ggccggcgtc agtcctggaa 1284601 aggtggcggt tatcccgggg gcggcgggtg cgggttccgc aactttgggg gcgctcagcg 1284661 gcggagtggc gcccagagcc gtcgcgaggt tttgcaggat ttgcggtgcg ttggcggccg 1284721 agctgatcag ctgctgcgga atgttgggag caggcgccgg cgccggtgcc ggatctgcgt 1284781 gagcgatacc gcccgtaagt agtgcggcgg acgaaccgac caagacggcg gcggcgcgga 1284841 caaacgtcca gatggttggc atgtctctcc ctggttagcg gtgacgggtc tcgccgaacg 1284901 tatcgcggtg cagatgtgac tcaagtgaca cgtgtggcat ttatgtgatt gttacggata 1284961 cgagtggttg tggtgaccgg gcacccgagt gatgtgccgc accctgatcg acggcccggt 1285021 gcgctcggcg atcgctaaag tcaggcagat agacaccacc tcatccaccc cggcggccgc 1285081 caggcgcgtg acctcaccac cggcccggga gacacgcgcc gccgtgctgc tactggtcct 1285141 cagcgtcggt gcgcgactcg cctggaccta tctggcgccc aacggcgcaa acttcgtcga 1285201 cctgcacgtt tacgtgagcg gtgcagcgtc cctcgaccat cccggcaccc tgtatggcta 1285261 cgtctacgct gatcagaccc cggacttccc gctgccgttc acctatccgc cgtttgcggc 1285321 tgtggtcttc tacccgttgc atttggtgcc gttcggtctg atcgcgctgc tgtggcaagt 1285381 agtgacgatg gccgcgctct acggcgcggt tcggatcagc cagcgcctga tggggggcac 1285441 cgctgagacc ggtcatttcg ccgcgatgtt atggacggcg atcgccatct ggatcgagcc 1285501 gttgcgcagc acctttgact atgggcagat caacgtgctg ctgatgctgg cggcgctttg 1285561 ggcggtctac accccgcggt ggtggctatc gggactgctg gtcggggtgg cctcgggtgt 1285621 caagttgacg ccggcgatta ccgctgtcta cctcgtcggc gttcggcggt tgcatgcggc 1285681 cgcattttcg gtggtcgtgt tccttgccac cgtcggcgtg tcgctactgg tcgtcggcga 1285741 tgaagcccgc tactacttca ccgacctgtt gggcgacgca ggccgggttg ggcccatcgc 1285801 cacctccttc aatcaatcct ggcgcggcgc gatttcccgg attctcggtc acgacgccgg 1285861 ttttggtccg ctggttctgg ctgcgatcgc cagtacggcg gtattggcca tcctggcctg 1285921 gcgtgcgctc gacaggtccg atcggctggg caaactattg gtggtcgagt tgttcggcct 1285981 gctgctctcg ccgatctcct ggactcacca ctgggtgtgg ctagtgccgc tgatgatctg 1286041 gctgattgac gggccagcgc gtgagcgccc gggcgcccgg attttgggct ggggctggtt 1286101 ggtgttgacc atcgtcggcg tgccgtggtt gctgagcttt gctcaaccga gcatctggca 1286161 aatcggccgg ccgtggtatt tggcctgggc cggtctggtc tacgtggtgg cgacgctggc 1286221 gaccttgggc tggatcgccg cctccgagcg ttacgtgcgc attcggccgc ggcgcatggc 1286281 caattaggcc ccaaacattg cgtcgatatc gtgcgccatc gcaatgtcgt tttccgtgat 1286341 accacctacc gcatgcgtaa ccagcgcgaa agttactgtt cgccaacgga tatcgatgtc 1286401 cggatgatga tttacctcct cggctcgctc ggccacccgg cgtacggcgt cgataccggc 1286461 cataaacgtc ggaaacttga ttgacctacg caggacacca ccggcgcgct gccagccgtt 1286521 gaggtcgtgc agtgcggcgt cgacctgctc atccgttaac acagccatac ctcgacggta 1286581 taccgtcaca ggtcatgctg aatcagatcg tggttgccgg agccatcgtc cgcggttgca 1286641 cggtcttggt ggcgcaacgc gttcggccac cggagttggc gggtcgttgg gaacttcccg 1286701 gcggtaaggt cgccgccggc gaaaccgagc gcgccgcgct ggcccgagag ctcgccgaag 1286761 aactgggact cgaggtcgcc gacctcgcgg tgggcgaccg tgtgggcgac gatattgcgt 1286821 tgaacggcac gacgacgctg cgggcctatc gcgtgcatct gcttggcggc gaaccgcgtg 1286881 cgcgtgacca ccgggcgctg tgctgggtga cggcggccga actgcacgat gtcgactggg 1286941 taccagccga ccgcggctgg attgcggacc tggcgcgaac cctcaacggg tccgccgcag 1287001 atgtccaccg tcgctgttag gaaaccgacg gtgtggttga cggtggccgc cgtcaacttg 1287061 gttagaacaa cgtgacaaaa cgttaacttg ggtttgcatg cccgtagcga ttacgatggt 1287121 tttctggacg cgtggcgaca acttccgggc aggacgctga cgcccatcca tcgagatacc 1287181 cgatgttgac gagaggggtc cccgacccgg cggaccgggg cttgacgggc gcaatgcggc 1287241 gcggccggcc agcccgtaac gtccagcgag tgcggtcgcg cgccgacggc ccggccccac 1287301 accgctcatg acgaggaggg tcatcccgtg accgttacac ctcacgtcgg tggaccgctc 1287361 gaagagctgc tggagcgcag cgggcgcttc ttcaccccag gtgagttctc ggccgacctg 1287421 cgcaccgtaa cccggcgcgg cggccgcgaa ggtgacgtgt tctaccgcga tcggtggagt 1287481 cacgacaaag tggtccgatc cacgcacgga gtcaactgca ccggatcctg ctcatggaag 1287541 atctacgtca aagacgggat catcacctgg gaaacccagc agaccgacta cccgtcggtg 1287601 ggcccggacc ggcccgaata cgagccacga ggttgtcccc gtggcgcgtc gttctcctgg 1287661 tacagctatt cgccgacgcg ggtgcgctat ccgtatgccc ggggcgtgct ggttgagatg 1287721 taccgggaag ccaagacccg cctgggcgac ccggtgctgg cgtgggccga cattcaggcg 1287781 gatcccgagc gcagacgccg ctatcaacag gcccgcggca agggtgggct ggtccgggtg 1287841 agctgggccg aggccagcga gatggtggcc gccgcccacg tgcacaccat caagacatac 1287901 ggcccggacc gggtcgccgg cttctcgccg attccggcga tgtcaatggt cagccatgcc 1287961 gcggggtccc ggttcgtgga gctgatcggc ggcgtgatga cgtcgttcta cgactggtac 1288021 gccgacttgc cggtggcctc gccgcaggtg ttcggcgacc agaccgacgt gcccgaatcc 1288081 ggcgactggt gggatgcgtc gtatttggtc atgtggggct ccaacgtccc gatcacccgg 1288141 acgcccgacg cacattggat ggcggaggcc cgttaccgcg gcgctaaagt cgttgtcgtc 1288201 agcccggact acgccgacaa caccaagttc gccgacgagt gggtgcggtg cgccgccggt 1288261 accgataccg cgctggcgat ggcgatgggc cacgtgatcc tgtcggaatg ttacgtccgt 1288321 aaccaggttc cgttctttgt cgactatgtg cgccgctaca ccgacctgcc gtttttgatc 1288381 aagttggaaa agcggggcga cctgctggtt cccggaaagt tcttgaccgc ggccgacatt 1288441 ggtgaagaaa gtgagaacgc ggcgttcaaa cccgccctgc tggatgagct tacgaatacc 1288501 gttgtcgtgc cgcagggctc actgggattc cgtttcggtg aggacggtgt tgggaagtgg 1288561 aacctggacc tgggttcggt ggtgccggcg ctaagtgtgg agatggacaa ggctgtcaac 1288621 ggcgatcgca gtgctgaact ggttacgctg cccagctttg acaccatcga cgggcacggt 1288681 gagacggtgt cgcgtggggt gccggtgcgc cgggcgggca agcatctggt gtgcacggtg 1288741 ttcgatctga tgttggccca ctacggggtg gcgcgtgcgg ggctgcccgg cgaatggccg 1288801 accggctacc acgaccgaac ccagcagaac accccggcct ggcaggagtc gatcaccggt 1288861 gtgccggccg cgcaggcaat ccggtttgcc aaggaattcg cccgcaacgc gaccgaatcc 1288921 ggaggacggt cgatgatcat catgggcggc ggaatctgtc actggttcca cagcgatgtc 1288981 atgtaccgct cggtgttggc gctgctcatg ttgaccggat cgatgggacg caacggcggc 1289041 gggtgggcgc actacgtcgg ccaggagaag gtgcgtccgt tgaccgggtg gcagacgatg 1289101 gcgatggcca ccgactggtc gcggccgccg cgtcaggtgc ccggcgcgtc gtactggtat 1289161 gcgcacaccg accaatggcg ctacgacggc tacggcgcgg acaagcttgc cagcccggtg 1289221 ggtcgcggca ggttcgccgg caagcacacc atggacctgc tgacctcggc cacggcgatg 1289281 ggctggagcc cgttctatcc acaattcgat cggtccagtc tcgatgtcgc cgacgaggcc 1289341 cgcgccgcgg gccgcgacgt gggtgattac gtcgccgaac aacttgccca gcacaagctg 1289401 aagctctcga ttaccgatcc ggataacccg gtcaactggc cgcgggtgct caccgtctgg 1289461 cgggcgaacc tgatcggctc gtcgggcaag ggcggcgagt atttcttgcg gcatctgctg 1289521 ggcaccgact ccaacgtaca gtccgaccct cccaccgacg gtgtgcatcc ccgggatgtg 1289581 gtgtgggaca gcgacattcc agagggcaag ctcgacctga taatgtcgat cgacttccgg 1289641 atgacgtcga cgacgctggt gtcggatgtc gtgttgcccg ccgcgacctg gtacgagaaa 1289701 tccgacctgt ccagtaccga tatgcacccg tacgtgcact cgttcagtcc ggcgatcgat 1289761 ccgccgtggg aaacccgttc ggactttgac gcattcgccg ccatcgcgcg tgctttcagt 1289821 gcgctggcga aacgtcatct gggcactcgc accgatgtgg tgctgaccgc gctgcagcac 1289881 gacaccccgg atgagatggc atatcccgat ggcaccgaac gtgattggct ggcgaccgga 1289941 gaagtcccgg tgccaggcag gacgatgagc aagctcactg tggtggagcg ggactacacc 1290001 gcgatctacg acaagtggct gaccctggga ccgctcatcg accagttcgg gatgaccacc 1290061 aagggatata ccgtccatcc cttccgggag gtcagcgagc tggcagccaa cttcggggtg 1290121 atgaattccg gtgtggcggt gggtcgtccg gcgatcacca cggctaagcg gatggctgac 1290181 gtgatcctgg cgctgtccgg cacatgcaac gggcgactcg cggtcgaggg attcctcgag 1290241 ctggagaagc gtaccgggca gcggctggct catctggccg agggcagcga ggaacgccgc 1290301 atcacctacg ccgataccca ggcgcgtccc gtgccggtga tcaccagccc ggaatggtcg 1290361 ggcagcgaga gcggtggccg ccgctacgcg ccgttcacga tcaacatcga gcatcttaag 1290421 ccgtttcaca cgctcaccgg gcgtatgcac ttctacctgg cgcatgactg ggtcgaagaa 1290481 ctcggcgagc agttgcccgt ctatcggccg ccgctggaca tggcgcggct gttcaaccag 1290541 cccgagctcg gaccgaccga cgatggactc gggctcaccg tgcgctatct gacgccgcac 1290601 tccaagtggt cgtttcactc gacctaccag gacaacctat acatgttgtc gttgtcccgt 1290661 ggcggtccga cgatgtggat gagcccgggt gacgcggcga aaatcaatgt gcgcgacaat 1290721 gattgggtag aggcggtcaa tgccaacggc atctacgtgt gccgggcaat cgtcagccac 1290781 cggatgcccg agggtgtggt gttcgtctac cacgtgcagg agcgcaccgt ggacacgccg 1290841 cgcaccgaga ccaacggcaa acgcggcggc aaccataacg cgctgacccg cgtacgaatc 1290901 aaacccagcc acctggccgg tggctacggc cagcacgcgt tcgcgttcaa ctacctgggt 1290961 ccgaccggta accagcgtga cgaggtgacc gtggtgcgcc gccgcagcca ggaagtgcgg 1291021 tactgaccaa tgaagggccc gagcgacgct tgcggagcga gacgatgaag gtcatggcgc 1291081 agatggcgat ggtgatgaac ctcgacaaat gcattggttg ccatacctgc tcggtgacct 1291141 gcaagcaggc ctggaccaat cgctcgggaa ccgagtacgt gtggttcaac aatgtcgaaa 1291201 cccgtccggg tgtgggctac ccgcgcacct acgaggatca ggagcggtgg cgcggggggt 1291261 gggtgcgcga caagaagggc cggctgcggc tgcgcgacgg cggccggatc cataagctgt 1291321 tgcgcatctt tgccaacccc aagctgccca ctatcggcga ctactacgag ccgtggacct 1291381 atgactacga aaacctgaca tcggcgccgg cgggtgacac ctttccgacc gcggcgccgc 1291441 gaagcctgat cagcggcaat ccgatgaagg tgtcgtgggg atccaactgg gacgacaacc 1291501 tggccgggtc gccagagatc gtgccgaacg acccggtgct aaagaaggtc aaccaagtca 1291561 accaagaggt caagctgaag cttgaagaga ccttcatgtt ttacctgccg cggatctgcg 1291621 agcactgcct gaacccgtcg tgtgtggcgt cgtgtccgtc gggggcgatg tacaagcgca 1291681 ccgaggacgg catcgtgctc gtcgaccagg accgctgccg cggctggcgg atgtgtgtgt 1291741 ccgggtgccc atacaagaag gtgtatttca accacaagac cggcaaggcc gaaaagtgca 1291801 ccctgtgcta tccgcgcatc gaggtggggt tgccgacggt gtgctcggaa acgtgtgtgg 1291861 ggcggctgcg ctatctgggt ctggtgctct atgacgtcga tcaggtgctg caggccgcgt 1291921 cggtggaaag cgacaccgac ctctacgagg cgcagcgccg gatcctgctg gacccgcacg 1291981 atccgcgggt gatcgccggg gcgcgcgcgg aaggcatcgc cgacgagtgg atcgaggccg 1292041 cccagcggtc cccggtgtac gcgttgatca acacctaccg ggtggcgctg ccgctacatc 1292101 cagagtaccg gaccatgccg atggtctggt acatcccgcc gctgtcgccg gtggtcgacg 1292161 cggtcagccg cgacgggcac gacggggagg acctgggcaa tttgttcggc gcgctggacg 1292221 cactgcggat tccgattgcc tatctggccg agctgttcac cgcgggcgac accgaggtgg 1292281 tcgcgggcgt gttgcggcgg ctggcggcga tgcgctgcta catgcgcgac atcaacctgg 1292341 gccgggagac ccagccccac atcccggaat cggtcgggat gaccgaggag cagatctacc 1292401 agatgtaccg actgttggct gtggcgaaat atgaagagcg ctatgtcatt ccgacgtcgt 1292461 acgcggggga gctgccggcc gcggcgatga ccgacgatat ggggtgctcg ttgtcggtcg 1292521 acggcggacc gggaatgtac gagtccggtc cgttcgggca gggcagccct actccggtgc 1292581 caatcgccgt ggagagcttc cacgctctgc agcatgccgg tagcgcggcc accggcggcg 1292641 ctggccgatc ccgggtcaac ctgctcaact gggaccccaa cggcgcagcg gcggggctct 1292701 tcccggagcc tcagcccagc aaggatgtgg tccagcgatg aagttgctgt ctcgtgtccg 1292761 agagcggtcg agcgccacca caatgaggga ccgactggtg tggcagtcgg cctcgctact 1292821 gctggcctat ccggatgacg ggctggccga gcggctgcac atggtcgatg cgctgcgcgc 1292881 ccaccaaacg ggcccggcgg cggcgctgct agggcgaacg gtagcggagt tgcgtgccct 1292941 ggcgccgatg gccgcggcgg cgcagtacgt cgagaccttc gatatgcgac gccgatccac 1293001 gatgtatctg acgtactgga ccgccgggga cacccgcaac cgcggccggg agatgctggc 1293061 gttcgccacc gcctatcgag acgccggcgt caagccgccg cgtaccgagg cgcccgacta 1293121 cctgcccgtc gtgctcgagt tcgccgccac cgtcgacccc gaggccggac gtcggctgct 1293181 gaccgagcac cgtgtgccga tcgacgtgtt gcgcggcgcg ctggccgacg ccaagtcacc 1293241 ctatgagtac accgtggcgg cgatctgcga gacactgccc gctgccacca accaggaagt 1293301 gcgtcgggca caacgcctag ctcagtcggg gccgcccgcg gaagccgttg gtttgcaacc 1293361 gtttaccttg accgtcccgc ccaagcgcgc cgagggggcc tgaccttggc cgtcttggac 1293421 ttggttgaga tcttctggga tgccgcgcct tacgtcgttg tggcgatcgc ggtggtcggc 1293481 acctggtggc ggtatcgcta cgacaagttc ggctggacca cacgctcgtc gcagctctac 1293541 gagtcgcggt tgctgtcgat cggcagcccg atgttccatt tcggcagctt gctggtgatc 1293601 atgggccacg tgatgggcct gttcattccg gattcctgga ccagagcgtt cggcatgagc 1293661 gatcacctgt accatctgca ggcgctgctg cttggcgcgc ccgccggttt cgccactctg 1293721 ctcggtatcg ggttgctgat ctatcggcgg cgcatccaga caccggtgtg gctggctacc 1293781 actcggaatg acaagctgat gtacctggtg ctggtgtgcg cgatcgtggc tggcctggca 1293841 tgcacgctga tgggcgccac ccatgagggc gatatgcacg attaccggcg ctcggtgtcg 1293901 gtctggttcc gctcgatctg gatgctagcg ccgcgtggcg atctgatggc ccaggcgacg 1293961 ctgtactacc aggtgcatgt gctgatcgcg ctcgcgctgt ttgcgctctg gccgtttacc 1294021 cgattggtgc acgcgttcag cgcgccgatc gcctacctgt tccggcccta catcgtgtac 1294081 cgcagccgcg aggtggcggc caagcacgaa ttgatcggtt ccgcgccgcg tcgtcgtggg 1294141 tggtagttct ctgccacaat caccgtcgtg ccattccgca acgttgccat cgtcgcgcac 1294201 gtcgaccacg gcaagaccac cctggttgac gccatgttgc ggcagtccgg ggcgctgcgt 1294261 gaacgcggtg agctgcagga acgggtgatg gacacgggcg atctggagcg ggagaagggc 1294321 atcaccatcc tggccaagaa caccgccgtg caccgccatc acccggatgg aaccgtcacc 1294381 gtaatcaatg tcatagacac cccggggcac gcggacttcg gtggcgaggt ggagcgcggg 1294441 ctgtccatgg tggacggggt gctgctgctg gtcgacgcct ccgagggtcc attgccgcag 1294501 acgcggtttg ttctgcgtaa agcgctggcc gcccatttgc cggtgattct ggtggtcaac 1294561 aagacagacc ggcccgacgc ccgcatcgcc gaggtcgtgg acgccagcca cgacctgttg 1294621 ctagatgtcg cgtccgacct tgacgacgaa gcggccgcag cggccgaaca cgcgctgggc 1294681 ctgccgacgc tgtacgcatc cgggcgcgcc ggggtggcga gcaccacggc gccgcccgac 1294741 ggccaggttc ccgacggcac caacctggat ccgttgttcg aggtgctcga aaagcatgtg 1294801 ccgccgccga aaggagagcc ggacgcaccg ctgcaggcgc tggtcaccaa cctggacgcg 1294861 tcgacctttc tgggtcggtt ggcgctgatc cgcatctaca acggccgcat ccgcaaaggc 1294921 cagcaggttg cgtggatccg tcaggtggat ggtcagcaga ccgtcaccac tgccaagatc 1294981 accgaattgt tggccaccga aggcgtggaa cgcaaaccaa ccgacgctgc cgtcgccggc 1295041 gatatcgtcg ccgtcgccgg cctgcccgag atcatgatcg gcgacacgct ggccgcttcc 1295101 gcgaatcccg ttgccctgcc caggattacc gtggacgagc cggcgatctc ggtcaccatc 1295161 ggcaccaaca cctcgccgct ggcgggcaag gtgggtggtc acaagctcac cgcgcgcatg 1295221 gtccgaagca ggctggatgc cgagctggtg ggcaacgtgt cgattcgtgt cgtcgacatc 1295281 ggcgccccgg acgcctggga ggtacagggt cgcggcgagc tggcgctggc ggtgctggtc 1295341 gagcagatgc gccgagaggg tttcgaattg accgtgggta agccacaggt ggtgaccaag 1295401 accatcgatg gcacgctgca cgagccattc gagtcgatga ccgtcgactg ccccgaggag 1295461 tacatcggcg cggtcacgca attgatggcc gcgcgcaagg gccgcatggt ggagatggcc 1295521 aaccacacca ccggctgggt ccgcatggac ttcgtggttc ccagtcgcgg cctgattggg 1295581 tggcgcaccg acttcctcac cgagacccgt ggctccggtg tcgggcatgc ggtgttcgac 1295641 ggataccggc catgggcggg ggagatccgg gcccgccaca ccggttctct ggtatcggac 1295701 cgggccggcg ccatcacacc gttcgcgttg ctgcaactcg ccgatcgggg gcagttcttc 1295761 gtcgagcccg gccaacagac ctacgagggc atggtcgtcg ggatcaaccc ccgtccggag 1295821 gacctcgaca tcaatgtcac ccgggagaag aagctgacca acatgcgctc atcgaccgcg 1295881 gatgtcatcg agacgctggc caagccgctg cagctggatc tcgagcgcgc catggagtta 1295941 tgtgcgcccg acgaatgcgt cgaggtgacc ccggagatcg tgcggatccg caaagtcgag 1296001 ctggccgccg ccgcccgggc tcgcagccgg gcgcgcacca aggcgcgtgg ctagcaactt 1296061 ggcgcgctgg ccgcgcgagc gtaacgccac tgcgaaatcc agcccggctt ttcgcagccg 1296121 ggttacgctc gtgggggtac tggatagcct gatgggcgtg cccagcccag tccgccgcgt 1296181 ctgtgtgacg gtcggcgcgt tggtcgcgct ggcgtgtatg gtgttggccg ggtgcacggt 1296241 cagcccgccg ccggcacccc agagcactga tacgccgcgc agcacaccgc ccccgccgcg 1296301 ccgccctacc cagatcatca tgggcatcga ctggatcggc cccgggttca acccgcattt 1296361 gctgtccgac ctgtcgccgg tgaacgccgc aatcagtgcg ttggtgttgc ccagcgcgtt 1296421 ccggccgatt ccggatccca acacgccgac cggttcgcgc tgggagatgg acccgaccct 1296481 gttggtttcc gccgacgtga ccaacaacca cccgttcacg gtgacctaca agatccggcc 1296541 cgaggcgcag tggacggaca acgccccgat cgccgccgac gacttctggt atctgtggca 1296601 gcagatggtc acacagccgg gcgtcgtcga ccccgccgga taccacctga tcaccagtgt 1296661 ccagtcgctc gagggcggta agcaggccgt cgttacgttc gcacagccct accccgcttg 1296721 gcgtgagttg ttcaccgaca tcctgccggc gcacatcgtc aaggacatac cagggggctt 1296781 cgcgtccggt ttggctcgag cgctgccggt gacaggtgga cagtttcggg tggaaaacat 1296841 cgacccacag cgcgatgaga tcctgatcgc ccgcaatgac cgttactggg gcccaccttc 1296901 caaacccggc atcattctct tccgccgggc cggggcgccg gccgcgctgg ccgattcggt 1296961 acgtaacgga gacacccagg tcgcccaggt gcatggtggc tcggcggcct tcgcccagtt 1297021 gtcggccatc cccgacgtgc ggaccgcccg gatcgtgaca ccgcgggtca tgcagttcac 1297081 gctgcgggca aacgttccca agctggccga cacccaggtt cgcaaggcga ttttggggtt 1297141 gctggacgtg gacctacttg ccgccgtggg cgccggcacc gacaacaccg tcaccttgga 1297201 ccaggcgcag attcgttcgc cgagtgaccc gggttatgtt ccgaccgcgc ctcccgcaat 1297261 gagcagcgcc gccgcgctgg gtctgctgga ggcatcggga ttccaggtcg acaccaacac 1297321 gtcggtgtcg ccggcgccgt cggtccccga ttcgacgacc acgtcggtga gcaccgggcc 1297381 gccggaagtc atccgcggcc ggatcagcaa ggacggcgaa cagttaacgc tggtcatcgg 1297441 ggtggccgcg aacgatccga cctcggtggc ggtcgccaac actgctgccg accagctgcg 1297501 cgacgtcggc atcgccgcga ctgtgctggc gttagacccg gtcacgctct atcacgacgc 1297561 gctgaacgac aatcgggtag acgccattgt gggctggcgc caagccggcg gaaacctggc 1297621 gacgctgctg gcctctcgtt acggctgtcc cgcattgcag gcgacgacgg tcccggctgc 1297681 gaatgcgccg acgacggccc cgtccgctcc cattggccct acgccgtccg ccgcgcccga 1297741 caccgcgaca ccgccaccaa cggcgccgcg ccgcccatcc gacccgggcg cgctggtaaa 1297801 agcgccgtcg aatctcaccg gcatctgcga ccgcagcatc cagtcgaaca tcgatgccgc 1297861 actcaatggc accaagaaca tcaacgacgt gatcaccgcg gtcgaaccgc gactgtggaa 1297921 tatgtcgacc gtgttgccga tcctgcagga caccacgatc gtcgcggccg gcccgagcgt 1297981 gcagaacgtc agcctgtctg gtgcggtgcc agtgggcatc gtcggcgacg ccggccaatg 1298041 ggtgaagacc gggcaatagc cctggtcacg ccggcggaat cgtcggctag ctctcgcggc 1298101 gttcgccggt ggtgaggatc atggcgtcga taatgcgtgt gagctgctca cggtccggcg 1298161 gggatccggt aaacaagaca tgctgatgga tcaaggccgg tccgattcgt gcggtcatcg 1298221 gagtcagagt tgccgggtcg atttcgccgg aacgcacgcc cgcctgcagg atggactcga 1298281 caattcgcag ccgcggggcc cacaccgagt tgatgaagat ggcgcgcagc tcgggctcgt 1298341 gtaggagctg gctgacgatt tccatgctgg ggagggccgt cttgccggcc aggatttcgc 1298401 agttggcggt gaacaccgcc agcagattct cccttgccga ccggtcagcg cgcggctcgg 1298461 gtaccggcgg caaagcgtat tgcaccgcgg ccagcaccag ctcacgtttg ccggcccacc 1298521 gccgatacaa cgcggctttg ccggtttggg cgcgtgccgc gatgccttcc atggtcagcc 1298581 cgccgtatcc ggcggattcg agttcggcca gcgtcgcatc gtagagcgca cgctcaagca 1298641 cctcgccgcg ccgccggtac gggttggcct ttgcgggtgc gctcaccgtc atgctgcgat 1298701 actagccaac tgcggctttt ccgccggcgc ggttcgatcg atgcatcagg tgaggccctt 1298761 ttgctagccg gcggcgggtg accgcagtat cactccggaa cgggttcttg ccgcgacggc 1298821 gcccacagcg cccccggcca ggcttgccaa tcccagctgg gcccacgagg ttcccgacgg 1298881 accgcccggc gcggaggtcg ggacggcttt ggccaacggc gtaatcgatt tgtccgcaac 1298941 ccagctgggg ggcaccgaca accccccgat cgtatccgca tttcccaatg ttgccgacac 1299001 cgcgggcctg accgcgttgg gtagcaactc cagcgggccc ttgagcgtgg gtgtcaggct 1299061 ccgcatggcg aggttgccca tcatctggcc catgttgctg ccctcgacaa aggccaacac 1299121 gtattccacg ggtatcgacg agattcgcgc ggctgtgagc gggtaggtgc ccttcgtcaa 1299181 cagcgtccac agcttcgaag gcagattctt ggtcgccggc gcggcgaatg accccaccgt 1299241 ctcggacacc gaaccgatca gttcataaag tctggccagc gcgcccgggt tggcgatcgg 1299301 cgccgggggc gagaacggtg tcagccgtgc ggcggccgcc gccattgtgg cgtagaggtt 1299361 catcgcctcg ccatcttggg accagtactg ggcatacagt gcatcgaggg caaagatcgc 1299421 gggggtgtga atcccgaaaa tgttggtcgt ggcgagagcg aggcgcgtca gtcggttggt 1299481 ctcgatcacc ggcagcggca cgtgtgctgc gtgcgccgct tcataggcgc ctgccacgac 1299541 gctgatgtgg tcggcgacga gttcagccag ggaagcggtc gtgacaatcc aggcccgaaa 1299601 tggggcgacc gcagctgcca tgatcgtcga cgatggcccc cgccacgatg tgatcagccc 1299661 gttgatctca ctctcgaacc gactggccgc gtagctcagc tcgttggaca gattcttcca 1299721 ggcgttcgcg gctactagaa acggacgagc gctaccttgg atgttgaggg agttgaactc 1299781 cggcggaaaa attgtgaaat ccattgtcgc tcaaccgctg tctaggtgga ggtgcccgcg 1299841 cggttggcta attcggtgag ccaatacgaa gtcttgctgg tctgaagtgt ttggacaaat 1299901 gactcgtgga tcacatgggc ctggcgcgcg atcgccttgt acagctcgcc gtgcatggaa 1299961 aacagcatcg acgtcacgat ggacacaaga tcgtgggcgg gggattccac attggtgatc 1300021 agcggcgtga ccccgtcatc atgggcgctc atcgtcaccc cgatctcgtg gaggttggcg 1300081 gccgtttccc caatcgaatc gggccgtgtg gtgacaaaag acacgcgtgc atctccttcc 1300141 actgacgtgg tctgatggtg ggggtcagcg acgacttggg gttccgcacg gcattgtaga 1300201 cggaatcgtt cactaaggta ttttcaccat aacggcttcg gtcacaaaac ggtagcgatt 1300261 ctgttgagga attttttcga cgctcgcccg gtagggtgcc tccatgtctg agacgccgcg 1300321 gctgctgttt gttcatgcac accccgacga tgagagcctg agcaacggcg caaccatcgc 1300381 gcactacacc tcccgtggcg cacaggtcca tgtcgtcacg tgcaccctgg gtgaggaggg 1300441 cgaggtcatt ggcgatcgct gggctcaact caccgccgat catgcggacc aactcggtgg 1300501 ctaccgcatc ggcgagctca ccgcggcgtt gcgagcgctc ggggtcagcg caccgatcta 1300561 ccttggcggc gcgggtcgct ggcgcgactc cggcatggcc ggcacagacc agcggagtca 1300621 gcggagattc gtcgatgctg acccccggca gaccgtcggg gcattggtcg cgatcattcg 1300681 cgagctgcgg ccgcatgtcg tggtgaccta tgaccccaat ggcggttacg gtcatcctga 1300741 ccacgtgcac acccacaccg tcactaccgc cgcggtggcc gcagcgggtg ttgggtccgg 1300801 taccgcagat caccccggcg acccgtggac ggtgccgaag ttctactgga cggtcttggg 1300861 tctgagcgcg ctcatttcgg gcgcgcgagc cctggtcccc gacgatctgc gacccgaatg 1300921 ggtgttgccg cgggccgacg agattgcatt cgggtactcc gacgacggta tcgacgccgt 1300981 cgtcgaggcc gatgagcagg cgcgagccgc caaggttgcg gcactggctg cccatgccac 1301041 ccaagttgtc gtcggcccga ccggccgggc cgccgccttg tcgaacaacc tggcactgcc 1301101 catcctggcc gatgagcatt acgtgctcgc cggcggctcc gcgggcgccc gcgatgaacg 1301161 tggctgggaa actgatctgc tcgccggtct gggcttcacc gcgtccggca cgtaggctgc 1301221 caaccaggca gccacggaag gaaccccatg gaccccgacc tggaccctaa cctgcagcat 1301281 tggcaggacc gactcgacag cctgcagtgg gtcatcgggt cgatactctc tcagatcgac 1301341 agcgtgccaa cctgaccacc ggcgcgacag atcgagcaat ccgtttggtt gtcctggccc 1301401 tgttgactgt cgacggggtc gtgtctgcgc ttgccggggc tctgctgatg ccctggtata 1301461 tcggctcggc tccgtttccg atcagtgcct tgatcagtgg attggtcaat gctgcgctgg 1301521 tgtgggccgc agcgcgatgg accacatcgt cgcgggtggc cgcgctgccc ctgtgggcgt 1301581 ggctactgac ggtagcggcg atgagcttcg gcggccctgg cgacgatgtc attctgggtg 1301641 gccagggcct gctggtctac ggcgcgctgg tgttcgtcgt ggcaggggcc gtgccaccgg 1301701 cgtgggtgct gtggcggcgc agggtccaag ctgacggatc tggctagtcc gaagttaggg 1301761 caaagacggg aatcccggcg ggctgattgg cggcaacggc ggcaggaagc cgcgtatcca 1301821 gttgatctcg gtgttgatga attggttgat cgatgccgcg gtggccgttt cgatattggt 1301881 tagtgcctgg ctgaaagtga ctgtcccgtc cacgaagtcg atcgcattga acaggactgc 1301941 ctgcacgatg ggctcgccga ggtaatagaa gaagttgatc tgcggtgcca gtatgccgat 1302001 gtagggcagc catcccaccg cccatgcggt gaggttgaag ccgtactgca cccacggttc 1302061 gacggcgttg tagagattct tgattgcgtt gccgatcgac tcggcggcca gagccggcag 1302121 cgccgcggcg gcggccgctc cggcgcgcgg cagcagggcg ccgacggcgg acaaaccgct 1302181 ggccccaccg gtgggttgga gctgcagtgc accgctggca gcggcatttg cagcggcgct 1302241 accgccaagt gcactggagc cgccagtgcc gaggaacatg ctttggaccc ggctcagcgc 1302301 gttcgagccc ccacccgtcg aggcgtcggc gctaatcagt ggatgcccca gcaacgctct 1302361 tgcgggcgca ttcacggcgt tcaccatggt ctgcgcggcg ctggcctcgg cggttgcata 1302421 cgcgtctgca cttgctctca gcgcctgcac gaactggtcg tggaaggctg tcatcatctg 1302481 ccggctgagc tgctgatacc cctgagcatg cgcggaaagc agcgcggcga cctgagtcga 1302541 gacctcgtcc gcggctgcgg ccaggactcc ggtggtggga accgccgcaa ccacattggc 1302601 ggcgttaaga gtcgaaccga taccggccat gtccgcagcg gccgccgcca gtgcctctgg 1302661 cgccgcgaac acaaacgaca tctcgtacct tctcctggtt caccacgcgg cggctgtcgc 1302721 cgggggcttg ttcagacgct ggcctctcac ggatggtatc gcgatcggct gtgacctgcg 1302781 ccttactcca ccaaaccgtt ggtgccggac ggtcgacggc gtgccgagct cggcctggcg 1302841 ctactgttgc gcttatggcg ccaaggttgg ccagcatctc acctggtggg gcgtgcggat 1302901 gatatcagat tgcagggaag gtataccaac gtgccgcagc ctgtaggtcg gaagtccacc 1302961 gctctgccga gtcccgttgt accgccccag gcaaatgcct cagcgttgcg gcgggtactg 1303021 cgacgggccc gagatggtgt cacgctgaac gtggatgagg cggccatagc gatgaccgca 1303081 cgcggtgacg agctggccga cctgtgcgcg agcgccgcgc gggtgcgcga tgcgggtctc 1303141 gtgtcggccg gccggcacgg gcccagcggc aggttggcga tcagctattc gcgcaaggtg 1303201 tttatcccgg tcacccggtt atgccgggac aattgccact attgcacgtt cgtcaccgtg 1303261 ccgggcaagc tacgcgccca aggttccagc acgtatatgg aacccgacga gatcctcgac 1303321 gttgcccgcc gaggtgccga attcggttgc aaggaagcgc tattcactct cggtgaccgt 1303381 ccggaggcgc gttggcgcca ggcacgcgaa tggctcggcg aacggggcta tgactccacg 1303441 ttgtcctacg tgcgcgcgat ggcaatccgt gtgctggagc aaaccgggct gttgccgcac 1303501 ctgaacccgg gtgtgatgag ctggtcggag atgtcgcggc tcaaaccggt ggcgccgtcg 1303561 atgggcatga tgctggagac gacctcgcga cggctgttcg aaaccaaggg gctcgcccac 1303621 tacggcagcc ctgacaaaga cccggcggtg cggctgcgtg tcctgaccga cgccggccgg 1303681 ttgtccattc cgtttaccac cggtctgttg gtcggcatcg gcgagacgct atccgagcgc 1303741 gccgatacgt tacatgcgat tcgcaagtcg cacaaggagt tcgggcatat ccaagaagtg 1303801 atcgtgcaga acttccgcgc caaggaacac accgcgatgg ccgccttccc cgatgccgga 1303861 atcgaggatt acctggcgac ggttgcggtg gcgcggctgg tgctgggccc gggcatgcgc 1303921 atccaggcgc cgccgaacct ggtgtctggc gacgaatgcc gggcgctggt tggcgccggg 1303981 gtcgacgact ggggcggtgt ctcaccgttg acgcccgacc atgtcaaccc cgaacggccc 1304041 tggcccgctt tggacgagct ggcggcggtc accgccgaag ccggctacga catggtgcag 1304101 cggctgaccg cgcaacccaa atacgtacag gcgggcgcgg cgtggatcga cccgcgggtg 1304161 cggggacatg tggtggcgct ggcggatccg gcgaccggcc tggcccgcga cgtcaacccg 1304221 gtgggcatgc cgtggcagga gcccgacgac gtggcgtcct ggggccgggt cgatctgggc 1304281 gcagcgatcg acactcaggg ccgcaatacc gcagtgcgca gcgacctggc cagcgccttc 1304341 ggtgactggg aatcgatccg cgagcaggtg cacgagctgg cggtccgcgc tccggaacgc 1304401 attgacaccg atgtgcttgc cgccctgcga tcggcggagc gtgcgcccgc cggctgcacc 1304461 gacggcgagt atctggcgct tgccaccgcc gacggtcctg cgctggaagc cgttgccgca 1304521 ctggctgatt cgttgcgccg cgatgtcgtc ggcgacgagg tgacctttgt ggtcaaccgt 1304581 aacatcaact tcaccaacat ctgctacacc ggttgccggt tctgcgcgtt cgcccagcga 1304641 aagggtgacg ccgacgccta ctcgctgtcg gtcggagagg tcgccgaccg ggcatgggag 1304701 gcccacgtcg ccggggccac cgaagtatgc atgcagggcg gtatcgatcc cgagctaccg 1304761 gtcaccggct acgccgatct ggttcgtgcc gtcaaggcgc gggtgccctc catgcatgtg 1304821 cacgcgtttt ccccgatgga gatcgccaac ggcgtcacca agagcgggct gagcattcgc 1304881 gagtggctga tcggcctgcg cgaggccggg ctggatacca tcccgggtac cgccgcggaa 1304941 atcctggacg acgaggttcg ctgggtgctg accaagggca agctgccgac gtcattgtgg 1305001 atcgaaatcg tgacgaccgc ccacgaggtg ggtctgcggt catcatcgac gatgatgtac 1305061 gggcatgtgg acagtccacg gcactgggtc gcccatctta acgtgctgcg cgatattcag 1305121 gaccgtaccg gcggcttcac cgagttcgtc ccgttgccgt tcgtgcacca gaattcaccg 1305181 ttgtacctgg ccggtgcggc gcgccccggg cccagccatc gcgacaaccg cgcggtacat 1305241 gctttggcgc ggatcatgtt gcacggccgc atctcgcaca ttcagaccag ctgggtgaaa 1305301 cttggagtgc ggcgcaccca ggtgatgctc gaaggtggcg ccaacgacct gggcggcacg 1305361 ctgatggagg agaccatctc gcggatggcc ggttccgaac acggatcggc caagaccgtc 1305421 gctgagctgg tcgcgatcgc cgaaggcatc ggccgcccgg cgcgccagcg cactaccaca 1305481 tacgccctgc ttgcggccta gccccggcga cgatgccggg tcgcgggatg cggcccgttg 1305541 aggagcgggg caatctggcc tagccccggc gacgatgccg ggtcgcggga tgcggcccgt 1305601 tgaggagcgg ggcaatctgg cctagccccg gcgacgatgc cgggtcgcgg gatggggccc 1305661 gcatgggctt aatagttgtt gcaggagccg gcaaccgact cgacaaggcc gatgtactgt 1305721 gccgcccccg gcacagcttg caattgcgcg gccatggcag cgcgctgagg tggcggtgcg 1305781 gcgaggaaat tgcgcaaata ggactgcgcc accggtgagg cgttgaactg tgcggcagcc 1305841 cccggatccg tcgcgttgag cgcagctact acctgcccgt aattgcaggt ggtgttaatg 1305901 accgcgtcca cgggatctgc ggaggcgacc ccggccccga cggtcaacga cattgccacg 1305961 gcgcctacac cggcgctcaa tgcggtcaac gacagcctca tttatggaca ccttccccaa 1306021 actattgcac cgtcgttaag acggcgacga catctgccca gcggttgccg tctgcggtcg 1306081 agggtaccag gcgccgtggg cttgcttctc tcaaactggt tatcgggcga cactgcgcgg 1306141 ccataccaat ctgcaggtca gcagcgatga aacaacgttg tttacagccc gagaaatgag 1306201 tttatagcct ggccgcaagt tcggtgcctt gcttgatggc gcgcttggcg tccaactcag 1306261 cggcaaccgc cgcgccaccg atgatgtgcg ggttaatgcc gtgccggcgc agttcactct 1306321 ccagatctcg caccggttcc tggccggcgc agaccactac gttgtccacc gccagcagct 1306381 ggggccgcct gcgcttcggg ccgaagctga tgtgtaggcc gtcgtcgttg atctgttcgt 1306441 agttcacccc agacagctga tgaacgccct tggccttcaa cgacgcccgg tggacccatc 1306501 cggtggtctt gccgagccgc ttgccctgcg ggcctttggt gcgctgcagt aggtacacct 1306561 cacgggcggg cggcgccggc agtggagtcg tcaacgctcc gcgggcttct cgcggatcag 1306621 cgacccccca ttcggccttc cactctttga ggttgagggt gggtgaggag tcggtgacca 1306681 gcagttcggt gacgtcgaag ccaatgccgc cggcgccgac gacagccacg gttcgcccga 1306741 ccggtctgac accggtgatg gcttcggcgt aggttaacac catggggtgg tcgatgccgg 1306801 ggatggccgg aatgcgcggt gccacgccgg tggccaagac gacctcgtcg tagccggtca 1306861 actcctgggc ggccacccga gtgcccagtc gcacctcgac accgtgtttg gccagaatcg 1306921 tcgagaaata ccggatggtt tcgctgaatt cctctttgcc gggaatgcgg cgggccatgt 1306981 caaactgtcc accgataaag tcgttggcct cgaacagcgt gacccggtga ccccgttgcg 1307041 cggcgttggc cgccgtggcc agcccggctg gtccagcccc gacgacggcc accgagcggg 1307101 cgcgccgggt cggggacagc accaactgcg tctcgcgccc ggcgcgtgga ttgagcagac 1307161 acgacaccgt tttcctggca aatgcgtggt ccaggcaggc ttgattgcag gagatgcagg 1307221 tgttgatttc gtcgacccga ttggactgcg ccttgagcac ccagtccggg tcgctcagca 1307281 tcggccgggc cattgatatc agccgcacct gggtttcggc cagaatccgt tccgcggcct 1307341 gcggcatgtt gatccggttg gacgccacca ccgggatagt gacgtgttcg gcgacggcgc 1307401 tgctgatgtc gacaaacgcg ccgcccggca ctgaggtgac gatagtgggc acccgggcct 1307461 cgtgccagcc gaagccggag ttgatgatgg ttgcgcctgc cccttccact tcggttgcca 1307521 gcgcgacgat ttcatcccaa ctctggcctt ctgcaacgta gtcggccatt gacagccggt 1307581 aacagatgat gaagtcgcat ccgacggcgg cgcggctgcg tcggatgatc tcgaccggga 1307641 accggcgacg gttggccggt gtgccgcccc acgagtcggt gcgcttgttg gtgcgcggcg 1307701 ccaggaactg attgagcaga tacccttcgc tgcccatgat ttcgacgccg tcgtagccgg 1307761 catcgcgggc caactgcgcg cagcgggcga aatccgcgat ggtcgcttcg accccgcgag 1307821 ccgatagtgc tcgcggacga aacggggtga tcggcgcctt gatcggcgag gcgctgaccg 1307881 caagtgggtg gtaggcgtag cgtccggcgt gcaggatttg cagcaggatc tttgcacccg 1307941 aatcgtggac cgccctggtg attcggcggt gccgtcgggc ttgcgccgaa gtgacgagtt 1308001 cggaggcgaa cggcagcagc catccggtgc ggttgggcgc gtagccaccg gtgatgatca 1308061 gcccgacgcc gccgcgtgca cgttcggcga agtagtcggc gagccgatcg atatggcggg 1308121 cccggtcttc cagtccggtg tgcatcgaac ccataaccac ccggttgcgc agcgtggtaa 1308181 acccaaggtc caacggggac agcagatttg ggtatggatt tgtcatcgct tctcctggag 1308241 cgcttcagct acttcgtcga gccaatcgat ggcactttct tcggctcgga ttccgccgcg 1308301 cagcacgagg tattgatgca gtgcggcgcc atcgagcgcc gacggatctg cgaaggtgcg 1308361 cttctcgata ccgcgatagg tgtccagtga cttgacacgc tcggcgcgca gcgcggtgac 1308421 ttgggtatac agcgcggcaa cgtctccgta gccggcgcca cgcagcttga cggcgatatc 1308481 gcgcgtgctg ctgtcggtca gcgcactgcc gcggccgggc ctggtcgggc tgagcggctc 1308541 ggcgatccag cgagccagct cggcccggcc gctgtcggag atcgcgtata ccttcttgtc 1308601 gggccggcca tgctggagca cggtcgtcgc gcgcacccag ttgttgttct ccatcacccg 1308661 taacgtccga tagatctgct gatgggttgc ggtccagaaa tagccgatgg agcgatcgaa 1308721 tcggcgggcc aactcgtagc ccgagctggc ctgttcacac agcgacacca agatcgcgtg 1308781 gggtagcgcc atccgggcag catagacggc aagccggatt gctatgcaac taggtgcata 1308841 ttgaccgtgt acgccgacgc atgtgccaag tggtcgacgt gtatgtgcaa cgtctagtat 1308901 cagtaaccga acgcattgcc tcagcagggc ccggaggaag ccttggcgag gtggacagca 1308961 gcccacacat agcggtatct ggaagacatg ttgaggagac gtccgtgacg tacacgatcg 1309021 ccgaaccctg tgtcgacatc aaggacaagg catgcattga ggagtgcccg gtcgattgca 1309081 tctacgaggg cgcccggatg ctgtatatcc accccgacga atgcgtcgac tgtggggctt 1309141 gcgagccggt ctgccccgtt gaagctatct tctacgaaga cgatgtgccc gaacagtgga 1309201 gccattacac ccagatcaac gccgatttct tcgccgagct gggatcgccg ggcggtgcgg 1309261 ccaaggttgg catgaccgag aacgacccgc aagcggtcaa ggatctggcg ccgcagagcg 1309321 aggacgcctg agccggctgg gggcagcacc cgctcgcggc ggagtgtcgg cgtctctgcc 1309381 cgtcttcccc tgggacacct tggccgacgc gaaagcgctg gccggggccc atccggatgg 1309441 catcgtcgac ctctccgtcg gcactccggt cgacccggtc gcaccgctga tccaggaggc 1309501 gctggcggcg gccagtgccg cccctggcta tccggcgacc gccggcaccg cacggttacg 1309561 tgagtctgtg gtggcagcgc tggctcgccg ctacggcatc accaggctga ccgaggcggc 1309621 cgtgttgccg gttatcggca ccaaggaact catcgcctgg ttgccgacgt tgttgggcct 1309681 gggcggtgcg gatctggtcg tcgtgcccga attggcatat ccgacttatg acgtcggcgc 1309741 ccgcctggcc ggaacgcggg tgctgcgtgc ggatgcgctg acccagctgg gtccgcaatc 1309801 cccggcactg ctctacctga actcgccgag caacccgacc ggacgggtgc tgggtgtcga 1309861 ccatttgcgc aaggtggtcg agtgggcccg gggcagaggc gttctcgtgg tttccgacga 1309921 gtgctacctg ggattgggct gggacgccga accggtttcg gtgctacatc cctcggtgtg 1309981 cgacggcgac cacaccgggt tgctggctgt gcactcacta tcgaagagct catcgctcgc 1310041 cggctaccga gcgggtttcg tcgtcggtga cctcgagatc gttgccgagc tactagcggt 1310101 gcgcaaacac gccgggatga tggtgccggc gccggtacag gcggctatgg tggccgcgct 1310161 ggacgacgac gcgcacgaaa ggcaacagcg ggagcgctac gcacaacggc gtgccgcgct 1310221 gttgccggcg ctgggctccg cgggttttgc ggtcgactat tcggacgccg gattgtatct 1310281 atgggccact cgcggcgagc cgtgccgcga cagtgccgcg tggctggcgc agcggggcat 1310341 cctggtggca ccgggtgatt tctacggccc gggtggggct cagcacgtgc gggtggcgct 1310401 gacggccacc gacgagcggg ttgcggcggc ggtcggacgg ctcacctgtt agcgcgaaca 1310461 gacgcaactt gcggccgggt caccgccagg tcgtgcgcag ctgggttgtc accgagagcg 1310521 ggttatcgcc gcggaacaga tcgaggatgg cttgcccttg tggggagtct gctggcagtt 1310581 gtcggggtgg gccgatgtgc tttcgccatg cctgtgccag atgttgccgc cgatccttgt 1310641 ttcgtgcgaa ccagcggggc acggcgtgcc aggcaaccgt gccgggcagc gatagcccga 1310701 cgacggcacg aaccgcgaac agcctccggg caacgggccg ggcgggcggc gtgaggatct 1310761 tgcgtccgat gaggtagcgc ggttcggcaa gcggcgccag tagctcgtcg agggccgcgg 1310821 tgaaccgcag ggactgctcg gtgggcacgc cgtcgagttg acaccggatc cagccttcgg 1310881 ggtcggaggc caaccgtagt gccgcggatc ctcgctgtgc gccgcccgcg gcgtacagcg 1310941 catccgcgac gacggcggcc agttgctcga gcgcgttggg cgcgtggtcc aggcggcggc 1311001 tttcggccgc cgcagcggtt gccaccaggc caacacccgc cgcgacgatg gcgccggccg 1311061 tgccggcacc cgccagcatg ccgagattgg cggaggcaac tgcggtggcg gtgctggcgc 1311121 cgaccacgga aacggcggcc acggcacccc ttgccaggcg gaccggactg aactgtcccg 1311181 gcaccggtgg ggtcaatgcc gaggcgggga tgcggggtgc ggcgaccccg agcggctggc 1311241 gggagcgcac gcggatggtt gcgacgtcga ctccttcgta gggctcgccg attcgccacc 1311301 aggatctcgc ctgggcgcgt tcggcgacgc gctgcagcgc tcgcgccgtg atggcgtggg 1311361 tatcggtgac cggaggaccg tacggcgaca gcgacggatc gcagtgcgtc acacccgatt 1311421 cgatgagccc ctgcggggtt gccgcgtagt acccgtcatg tttgcgcacc aggcgcaggt 1311481 agtcggcatc accgcgcggg tgttctgtgg cgatacagca gaccgaccag ttgtccgcca 1311541 ccttgtgacc gtccgagggg tcgttgcgga tggcgcggcc gcgcatctga gtgatcgctg 1311601 cctgggtggt tgcgctcgtc aggtcgatat tgacgttgac cgccgcgcag tcccaccctt 1311661 cacctagtag cgaacgggtg ccgaccagga cgcgggcgcg gccggccagg aagtattcgg 1311721 tagccagcgc gacccacgta cgtggcgtga agccgccggt gccgcgcatg acccgcagac 1311781 tagggtgggc gtcaagcggc tcggcggtga cgagcgcgcc gcgctcggcg cagaaggcga 1311841 tcaggtcatc ttcgatcgcg gccgggcagg cgaaggtttg acctgttacc agaagggcgt 1311901 gcagtggggt gcggcggcgg tgatccgacg cggcgagcat ggcggcaacc agctgggccg 1311961 aacccgactg ctcgctgacg ggtgcgccct tcagcgatgt gggaagggcg ccggtcatcg 1312021 attcgaaatc gcagagcacc agcgcccgca accgcgcccc caagacggcg tcctcggtgt 1312081 cgaggatgtg cgcggtcgcg gcgatcttgg attcggacag cgcgcacagt ctgtctactg 1312141 gcgaggtcgc gacgcgtacg ccgcgactgg tcagccggta gcccaggccg ggtagcaccc 1312201 gcttgatcgc ggtcagcgcg tgcgcgtcgc gcggatccgc gctttgttgc aggtgcccga 1312261 cgctgaagtc ggtcaatacg ttaacccagt cctgggcatc gggcgcaatt cggtgctgct 1312321 cgcgcaggcg cacgccgtcg ggtagtggaa tcaggccgtc gtaggcgaag cgcaggccgc 1312381 tgcacgcgag gtcgggttcg gcacgctcga acgtcgacca ggcgatctga ttgccctcgc 1312441 gcgtcgctcg atccacgatc cgggtgtgca gccacgcggc caacgacatg ctgcccacct 1312501 tttggtcgat gagggccagc atgaggtcgg cgaagcgcgc ccggtgggtg ccgatccagg 1312561 cctgctcttc gggcgtcggt tgggtcagat agaccaactc ttggtaggga gccaggtcgc 1312621 cttccctaac cagagcgggt gtcgggatca cgaagtcggc ggtgccgaac agctcatcat 1312681 gcagggtgtg ctgccacgcg gtgagctctg tggccggggt cgccgttaga ccgatcagcg 1312741 cggtctgcgc tccgaggacc gacgccaacg cactgaccag ggcgccccac gtagctagca 1312801 gatggtggca ctcatcgagc accagcgtcc acgggcctag cgtcgccgcc cgctcgatca 1312861 ccgccctccc gttggggtgc aggagatcca gcaacgcttg ctggtcgcgg ttgcgcagga 1312921 cttcccgccg gactgtcgaa tcggtttcgg cgtcgatgac ggcaagcgac tgatacgtca 1312981 ggacgttcat cgccgaggca aggccacgct cggttccaca cttcgatgcc gaccggtccg 1313041 acgacggaaa actgttatcc cacgcggcgg cccactgcgc ctgcaccgcc gtgttgggaa 1313101 ccaacaccaa actccggcgc cccagccggc gcgctgcttc caggccgatc atcgtcttgc 1313161 ccgcacccgg cggcagcacc agataggcac ggttgtcgcc ggcagcgacg tcggcgtcga 1313221 acgcgtccaa cgcttgctgt tggtataccc gccagttgcc ggcaaaggcc cgcgattcca 1313281 ggtcgcggtg aggatccaca aggattcacc ctagccaagc acccacgttg ggcgcgaaag 1313341 acgcaaaagg ccccgaatcc aacggatttc ggggcctttt gcgtctgctc gcgcccgtgc 1313401 ggctcgtgcg gatcacacgc gcggtgcatg ctgctgtggc tgtcgagcag tgttgctacc 1313461 ttaactttcc caggcctacg acgtctggta gcggcatggc aacggcctgt gagttggctg 1313521 gataatgtgt tcttcgtcgt gctgtggcct gcagattaac aagtcccaca acagttttcc 1313581 cgttgtatcg gaccttgcag catgcgatgc tttcgtcttg agccactacc atgaagttag 1313641 tacgctaaac aatcctgagc ccgaatgtgt tggtaaatgg ggtttgggag cattcaccca 1313701 cggctggtac agggggactg cgtagtgcgc accgcaaccg ccacatcggt cgccgttatc 1313761 ggcatggctt gccggctccc gggcggcatc gattccccac aacgcctctg ggaagcgctg 1313821 ttacgcggcg acgatttggt gggtgagatt cccgctgacc ggtgggacgc gaacgtgtac 1313881 tacgaccccg aacctggtgt ccctggtcga tcggtatcgc gttggggcgc ctttctggac 1313941 gacgtcggcg ggtttgactg cgatttcttc ggcctgaccg agcgggaggc gaccgcgatc 1314001 gacccacagc accgcttgct gctggaagtg tcgtgggagg ctatcgagca cgcgggtgtg 1314061 gacccggcga cgctcgctga atcacaaaca ggtgtcttcg taggactgac acacggcgac 1314121 tacgagctgc tgtccgcgga ttgcggcgcc gcggaaggac cgtacggatt caccggcacc 1314181 agtaacagtt tcgcgtccgg gcgagtggcc tacacactcg gactgcatgg ccccgcggtc 1314241 acggtggaca ccgcgtgctc gtccgggttg acggctgtgc atcaagcctg ccgcagcctg 1314301 gatgacggtg aaagcgatct cgctcttgcc ggtggtgtgg ttgtcacgct agaaccgcgg 1314361 aagtccgtct cgggttccct gcaaggcatg ttgtcgccta ccgggcgttg ccatgccttc 1314421 gacgaagcag ctgatggctt cgtgtccggt gaggggtgcg tggtcctgct gctgaagcgg 1314481 ctaccggatg cggtgcgcga cggtgatcgt gtgctggcga tcgttcgtgg caccgcagcc 1314541 aaccaggatg gccgcaccgt gaatatcgcg gcgccgtcgg cgcaggctca gatcgcggtg 1314601 tatcagcaag cgttggctgc agcgggcgtc gaagcgtcga cggtggggat ggtcgaagcc 1314661 cacggcaccg gcacccccgt tggagatccg gtcgaatacg cgagcctggc cgcggtgtac 1314721 ggaaccgagg gtccgtgcgc gctgacgtcg gtgaaaacaa acttcggtca cctgcagtcg 1314781 gcatcggggc ccctggggtt gatgaagaca atcctggcgt tgcggcatgg ggttgtgccg 1314841 cagaacctgc acttctgccg gctgcctgat cagctggctg agattgacac tgaactcttt 1314901 gtgccgcaag cgaatacatc ctggccggac aacaccggac agccacgtcg cgctgcggtt 1314961 tcctcgtatg gaatgtcggg taccaacgtg catgccatct tggagcaagc gccggtatca 1315021 gaaccagcgg cttcgggacc tgagctcact cccgaagccg gtgggctggc gttgtttccg 1315081 gtgtcggcta cctcggctga gcaactacac gtcacggccg cccggctggc ggattgggtc 1315141 gaccagaacg gcaacgcggg cagtcgagtt agcatgcggg acctgggcta aacgctgtcc 1315201 tgccgccgtg cacaccgacc cgtccggacg gttgtgacgg cgagcagttt tgacgagctg 1315261 agcgcggcgc tgcgggacgt cgctggcgat cagattccct atcagcccgc agtggggcac 1315321 gacgaccgcg ggccggtgtg ggtgttctcc gggcaaggct ctcagtggcc cgggatgggc 1315381 actgaactgc tggtagccga accggtgttc gccgccaccg tcgcggcgat ggagccggtg 1315441 atcgctaggg agtcagggtt ttcggtgacc gaagcgatgt cggcgccaca gacggtcagc 1315501 ggtattgacc gggtgcagcc caccatcttc gcggtgcagg tcgccctggc cgcggccctg 1315561 aagtcgtatg gggtacgtcc tggtgccatc atcgggcact cgctcggcga ggctgcggca 1315621 gccgtggtcg ccggagcact gtcgctgcac gacggattgc gagtcatctg ccggcgctcg 1315681 cggctgatgt cgcgcatcgc cggtagtggc gcgatggcat cggtggaact gcccggccaa 1315741 caagtgttgt cagaacttgc gattcgtggg atctccgacg tcgtgctctc ggtggttgcc 1315801 tctccgacct caaccgtcgt cggcggcgcc acgcagtcga tacgtgacct ggtggcggcc 1315861 tgggagcagc aggatgtgct ggcgcgcgag gtagctgtgg acgtcgcttc acatacaccg 1315921 caggtcgatc ccatcctgga cgagttgctc gaggtcctgg ccgaggtcga tccgacggcg 1315981 ccggaaattc cgtattactc cgcaacgttg tgggatccgc gcgagcgacc gtcgttcacc 1316041 ggcgagtact gggtggaaaa cctgcggtac acggtgcgat tcgcggcggc ggtacaggcc 1316101 gcgctcaagg acgggtaccg agtgttcggc gagctggctc cgcatccgct gctcacctac 1316161 gcggtcgagc agaacgccgc cagtctcgac atgccgatcg caacgcttgc cgcgatgcgg 1316221 cgcggggaac agctgccgtt cgggttgcgc ggcttcgtcg ccgacgtgca caacgccggc 1316281 gccaaggtgg acttctctgt ccagtaccct gatgggcgct tggtggatgc gccattgccg 1316341 agctggacgc accgcaccct gatgctcagc cgtgaggatt cacaccgctc gcacaccggc 1316401 gcggtccagg cggttcatcc gctgcttggg gcccatgtgc acctgttgga ggaaccggag 1316461 cgtcacgtct ggcaggccgg ggttggcacc ggggcgcatc cgtggctcgg tgaccatcgg 1316521 atacacaacg tggctgcgtt tcccggtgcg gcctactgtg agatggcatt ggccgcggcg 1316581 cgcaccactc ttggcgagct gtcggaggtg cgcgacatca agttcgagca gacgctgttg 1316641 ctggacgagc agacggtggt ctcatcggcc gcgacgatcg ccgcgcctgg gatcctacag 1316701 ttcgcagtcg agagtcatca ggaaggcgag cccgcacggc gggccagcgc gatgctgcac 1316761 gcattggagg agatgccgca gccgcccggg tacgacacga acgctctgac cgccgcccat 1316821 gagtccagca tgagcggtga ggaactgcga aaaatgttta acagcttagg tattcagtat 1316881 ggtccggctt tttcaggcct agttgcggtg cacacggcgc gcggggacgt caccacagtg 1316941 ctcgccgagg tcgcgctgcc tggagccatc cgatctcagc agtcggcata tgccagccac 1317001 ccggccctgc ttgatgcgtg tttccagtcg gtgcttgttc atcccgaggt ccagaaggcg 1317061 actgtcggtg gtctgatgct gcccgtgggc gtgcgtaggc tgcgcaacta tcactcgacg 1317121 cgcagcgcgc actactgcct cgcccgggtc acgtcatcgt cgcgagccgg cgaatgcgaa 1317181 gccgatctcg acgtgttcga ccaggccgga acggtacttt tgaccgtcga gggattacgg 1317241 ctggccgcag ggatttccga acatgaacgc gcgaaccggg tgttcgacga gcgattgttg 1317301 accatcgagt gggagcgggg tgagctgcct gaggtgccgc agatcgatgc gggatcctgg 1317361 ctgctgctca gtgcgtccga agctgatccg ctgaccgcgc aactcgccga cgcgttgaat 1317421 gccgttggtg cccagagcac tagcgtggct tcggcgtcgg atgtcgcaca attgcgttcg 1317481 ctgctcggag gcaggctcac cggtgttgtc gtggtgactg gcccgccaac gggtggtttg 1317541 acacagtgcg gccgcgacta tgtgtcacag ctggtgggta ttgcccgcga gctcgcggag 1317601 ctgcccggtg agccgccgcg gctgttcgtg gtgaccagga gcgcggcgag cgtgctgccg 1317661 agcgatcttg ccaacttgga acaggcggga ttgcgtggac tgatgcgggt gatcgattcc 1317721 gagcatccgc acctgggtgc caccgcaatc gacgtcgaca acgacgagac cgtcgctgcc 1317781 ctggtggcca gccaactaca gagcgggtcg caggaggacg aaaccgcttg gcgcaatggc 1317841 atttggtaca ccgcccggct gcgtcccggt ccgttacgcc cggccgaacg gcgaaccgcc 1317901 gtcgtcgaat acagacgcga cggtatgcgc ctgcagatcc gcactcccgg cgacctcgag 1317961 tcgttggagt tcgtcacatt cgaccgggtc gcgccgggac cgggcgagat cgaggtcgcg 1318021 gtgaccgcat cgagtgtcaa cttcgccgac gttctggtcg ctttcgggcg gtatcccacc 1318081 ttcgagggct accgacagca gttgggcatc gacttcgccg gtgtggtgac cgcggtcggg 1318141 ccggatgtca ccgagcatcg gatcggtgat cacgtcggcg gcatgtccgc caatggctgc 1318201 tggagcacat tcgtcagatg cgatgcccgg ctggcggtga cgctcccgcc cgagctgccg 1318261 gtggccgccg ccgccgcggt accgaccgcc tccgcgacgg cttggtacgc cctgcacgat 1318321 ctggctcgca tctgctcgga cgacaaggtg ctgattcact cggggaccgg tggtgtcggg 1318381 caggcggcga tcgcgatcgc acgggccgcc ggatgcgaga tcttcgccac cgcgggcagt 1318441 gcccagcggc gacaactgct gcacgacatg ggtgtcgagc atgtctacga ctcacggagc 1318501 accgagttcg ccgagcagat ccgaggcgac accgatgggt atggtgtcga cgtcgtactc 1318561 aactcgctgc ccggcgccgc acaacgtgct gggatcgaat tgctggcctt tggcgggcga 1318621 ttcgtggaga tcggcaaacg tgacatctac ggcgacactc ggctcgggtt gttcccgttc 1318681 cgccgcaacc tgtcgctgta tgccgtcgac ttggcgctgc tgacacacag ccacccgcac 1318741 accgtccggc gcctgctgaa aaccgtctac caacacacgg tcgagggcac gctgccggtg 1318801 ccgcagacca cgcactatcc cattcacgac gctgccgttg ccattcgttt ggtcggcgga 1318861 gccgggcaca ccggaaaagt ggtgctcgat gtgccgcgta ccggtgaagg cgtggccgtg 1318921 gtgccccccg aacaggtccg cacgtcccgg cccgacggcg cctatctcgt caccggtggt 1318981 ttgggcggcc tcggcctgtt ccttgccggc gagctggcgg cggcgggctg cggacgcatc 1319041 gtgctcaact cccgttcgac gcccagcccg cacgccacca gggtcatcga gcggctccgc 1319101 gccgccggtg ctgatatcca ggtggaatgc ggtgacatcg ctgatgccgc aacggcccac 1319161 cgagtggtgg cggtggccac cgcctcgggc ttgccggtgc gcggcgtgct gcacgcggcg 1319221 gcggtggtcg aggacgctac gttggccaat gtcaccgacg aacttatcga ccgctgttgg 1319281 gcgccgaagg tacacggcgc gtggaacatt catcgggcca ccgccgcgca gccactggag 1319341 tggttctgct tgttctcctc ggccgcggcc ttggtgggct cgccgggtca aggcgcatat 1319401 gcggcggcca acagctggtt ggacgctttt gcccactggc ggcgggcgca gggccttccg 1319461 gctacctcaa tcgcctgggg agcatgggcc gagattggcc gcgctaccgc gctggccgaa 1319521 ggcaccggcg cagcgatcgc gcccgccgag ggtgctcgag ccttccagac gctgcttcgc 1319581 tacggccggg cgtactccgg ctatgccccg atcatgggta ccccatggtt gacggccttt 1319641 gcgcaacgta gccgatttgc cgaagcgttc cacgccacgg gccaaaatca accggccacc 1319701 gggaaattcc tcgccgaact gggcagcttg ccccgcgaag agtggccccg cacagtcagg 1319761 cggttggtat cggaccagat cagcctgctg ctgcggcgaa ccattgatcc ggaccggccg 1319821 ctgtccgact atggtttgga ttccttgggc aacttggagt tgcggacccg catcgaaacc 1319881 gaaacgggta tacgcgtcag tcccacaaag atcaccacgg ttcgcggctt ggccgagcac 1319941 gtgtgcgacg agctggcagc cgcccaatct gcgccggtct gatgacggcc cgggtgaagt 1320001 cgttgcggaa gtttgagatc gagccgagga gggcatgttg cgggttggac cgttgacaat 1320061 aggcacgctg gacgactggg cgccgagcac gggttcgact gtgtcatggc gaccttcggc 1320121 tgtcgcgcac acgaaagcgt cgcaggcgcc gatcagcgat gttccggtca gttatatgca 1320181 ggcgcaacat attcggggct attgcgagca aaaggcaaag ggactcgact actcgcggtt 1320241 gatggtcgtc agctgccagc agcccggcca gtgcgatatc cgggcggcca actacgtgat 1320301 caacgcccat ctccgacggc acgataccta tcgcagctgg ttccaataca acggcaacgg 1320361 acaaataatc cggcgtacga tccaggatcc cgccgacatc gagttcgtac cagttcatca 1320421 tggtgagctc acgctgccgc aaattcgcga gatcgtgcag aacacgccgg atcccctgca 1320481 atggggttgt tttcggtttg ggatcgtgca aggctgcgac catttcacat tctttgcaag 1320541 tgtggatcat gtgcatgtgg acgcgatgat cgtcggtgtc acgctcatgg agttccacct 1320601 gatgtacgca gcgctggtgg gcggccatgc ccctctcgag ctaccgccgg caggcagcta 1320661 cgacgacttc tgccgccgac aacacacgtt cagctccacc ctcacggtgg agtcgcccca 1320721 ggttcgcgcc tggacgaagt tcgccgaagg tactaacggt agctttcctg attttccact 1320781 cccacttggt gacccatcga aacccagtga cgcggatatt gtcaccgtga tgatgctcga 1320841 tgaagagcag acggctcaat tcgagtccgt ctgcacggct gccggcgctc ggttcatcgg 1320901 tggcgtacta gcctgctgcg gcctggctga acacgagttg accggtacga caacctatta 1320961 cggactaacg ccgcgcgaca cgcgccgcac tccagcggat gccatgaccc aaggttggtt 1321021 caccggccta attccgatca ccgtccccat cgccggctcg gcgttcggcg atgccgcccg 1321081 agccgcgcag acctcgttcg actcgggcgt gaagctcgcc gaagtaccct acgaccgcgt 1321141 cgtcgaattg tcgtccacgc taaccatgcc acgaccgaac tttcccgtcg tcaacttcct 1321201 cgacgcaggc gcggctccgc tttcggtact gctcaccgcg gagttaaccg gtacgaacat 1321261 aggagtgtac agcgacggtc gctactctta tcaactgtcc atctacgtca tccgcgtcga 1321321 gcaggggacg gcagtggcgg tcatgttccc cgacaacccg atcgcccggg aatcggttgc 1321381 ccgctacctg gcaacgctga agtctgtgtt ccaacgagtc gccgagagcg ggcagcagca 1321441 gaatgttgcc tgattcattc ccggtggtga acccatcttc gcgcggctag gtgaactcgt 1321501 cgcccggcgg ccttgggttg tggtcggctg ttgggtcgcg ctcgccctgg tactgccgat 1321561 ggcggtgcct tcactggcgg agatggctca gcgacatccc gtcgcggtcc tgcctgccga 1321621 cgcgccctcc agcgtcgctg ttcgccagat ggccgaggcg ttccacgaat ccggctccga 1321681 gaatatcttg gtagtgctgc tcaccgacga gaaaggcttg ggagcggcgg acgaaaacgt 1321741 ctaccacaca ttggtggatc gtctgcgaaa cgacgctaaa gacgtcgtga tgctgcagga 1321801 cttcctgact actccgccat tgcgtgaggt gctcggtagt aaagatggca aggcatggat 1321861 tctgccgatc ggtctcgcgg gcgacctggg tacacccaag tcctaccacg cttacaccga 1321921 cgtcgaacgc atcgtgaaac gaactgtggc cggaaccacg ttgacggcaa acgtgacagg 1321981 acccgcagcc acggtggcag acctgaccga cgctggggct cgggatcggg cttcaatcga 1322041 gctggcgatc gccgtgatgt tgctagtcat cttgatggtc atctatcgca acccggttac 1322101 catgctgttg cccctggtga cgattggcgc atccttgatg accgcgcagg cgttggttgc 1322161 cggcgtgtcg ctcgtcggcg gtctagccgt atccaatcaa gcgatcgtgt tgctcagcgc 1322221 aatgatcgct ggtgcgggaa cggattacgc cgttttccta atcagccgct atcacgagta 1322281 tgtgcggctc ggtgagcatc ccgagcgtgc cgtccagcgg gcgatgatgt ccgtcgggaa 1322341 ggtgatcgcc gcgtccgcgg caacggtcgg aatcaccttc ctcggcatga gattcgccaa 1322401 actcggtgtg ttctcaacgg ttggcccggc tctggcgatc gggatcgcgg tgtcgttctt 1322461 ggccgcggtc accctgctgc ccgccatcct ggtgctggcc tcaccgcgcg ggtgggtcgc 1322521 accgcgcggt gaacgcatgg cgacattctg gcggcgggcc ggaacgcgaa tagtgcggcg 1322581 gcccaaagct tatctaggcg ccagcttgat tggtctggtt gcattggcca gctgcgcgag 1322641 cctggctcac ttcaactacg acgaccgcaa acaattgccg ccttcggatc cgagttcggt 1322701 tgggtacgcg gcaatggagc accatttctc ggtgaatcag actattcctg agtacttgat 1322761 catccactct gcacacgacc tgcgaacccc gcgcggcctt gccgacctgg agcagctggc 1322821 gcaacgtgtg agccagatcc caggcgttgc catggttcgc ggtgtgaccc ggccaaacgg 1322881 ggaaaccctt gaacaggccc gggcgacata ccaagccggc caagttggca accggctggg 1322941 cggcgcgtcg cgaatgatcg atgagcgcac cggcgacctg aatcggctgg catcgggtgc 1323001 caacctgttg gccgacaatc tcggtgacgt tcgcggtcaa gtcagccggg ccgttgcggg 1323061 tgtccgcagc cttgtcgacg ccctcgctta catccagaac cagttcggtg gcaacaaaac 1323121 attcaacgaa atcgacaacg ctgcaaggct tgtcagcaat atccacgcgc tcggtgacgc 1323181 tctgcaggta aactttgacg gtatcgccaa cagtttcgat tggcttgact ctgttgtcgc 1323241 cgctttggat accagcccgg tctgtgacag caaccctatg tgtggcaacg cgcgcgttca 1323301 gtttcacaag ctgcaaaccg cacgtgacaa tggcactctc gacaaggttg tcggcctggc 1323361 gcgtcagctg cagtccacgc ggtcaccgca gaccgtgtcg gcggtggtga acgatctggg 1323421 gcgatcgctg aattcggtag tccgctcgct gaaatcactg gggttggaca atccggacgc 1323481 cgcccgggcg cgcctgatca gcatgcaaaa tggagctaac gacctcgcca gcgccggtcg 1323541 tcaggtcgca gacggcgtcc agatgctggt cgaccagacc aagaacatgg gcatcgggct 1323601 gaaccaggcg tcagcctttc tgatggcgat gggcaacgat gcgtcgcaac cgtcgatggc 1323661 gggtttcaat gtcccgccgc aagtgctgaa gtccgaggag ttcaaaaaag tcgcccaggc 1323721 gttcatctcg ccagacgggc ataccgtgcg gtacttcatt cagaccgacc tcaacccgtt 1323781 cagcactgcg gccatggatc aggtcaacac gatcattgac acagccaaag gtgcacagcc 1323841 aaatacctcc ctggctgacg cgtcgatatc aatgtcgggt tacccggtca tgctgaggga 1323901 catccgcgat tactacgagc gcgatatgcg gctcatcgtc gctgtgaccg tcgtcgtggt 1323961 gatcctgatc ctcatggcac tgctgcgtgc gatagtggcg ccgctgtacc tggtcggttc 1324021 ggtggtcatc tcgtacatgt cggcgatcgg gcttggtgtg gtggtgttcc aggtgttcct 1324081 ggggcaggaa ttgcactgga gtgtgcccgg cctagcgttt gtggtgctgg tcgccgtggg 1324141 tgcggactac aacatgctgc tggcgtcgcg gttgcgggac gagtcggcat tgggagtgcg 1324201 ttccagcgtg attcgcacgg tgcgttgcac gggcggagtg atcacggcag cgggtctgat 1324261 atttgccgct tcgatgtccg gcctgctgtt ctccagcatc ggaaccgtcg tccaaggcgg 1324321 cttcatcatt ggggtcggga tcctgataga cacgttcgtg gtgcggacca tcaccgtgcc 1324381 tgccatggcc acgctgctcg gacgcgcaag ttggtggccc ggacaccctt ggcagcggtg 1324441 cgcacccgaa gaaggccaga tgtcagcccg gatgtcagcg cgcacgaaga cggtatttca 1324501 agccgtggca gacggatcaa agcggtagtg tttagccgcc gaaggcgggg gagcccagta 1324561 agccgcgggc accttccacg atcgagcccg gagcggtcag cggatccagg cctcgcaccg 1324621 gatccacgga gaccggccgg gtgaaccaat tgtcgttgcg tgcataggcc gcgtcgatct 1324681 gtggttgcag cacactgtcg atttgatcga cttcggcgtc ggacatgccg aggtaacgca 1324741 gcggcaaggt caacgggagg tggttcacgg ggaccagata cgtcgtcgtg gtagcgcctc 1324801 gtgagttgac ggtggtcctg atgttctgcg ggggtacgtc accgggtccg gtgaacccga 1324861 ttggggtgtg cgcgatggca gcgccgatgg ccgcattggc gaccgctaac agattgtccg 1324921 gccggtccgg gaagtcgctg aagccgtcgt atgcggtgac gacatggttg gtgtcgtact 1324981 ggctatccac ctgctggggc atcgtatatt cgatgaaggg aatcggaatg tggctaccgg 1325041 ggggaaaaat tcgggccagg aagctcgctc cgaacgcatg acgtccggtg gggtcgccga 1325101 acgtcgtgaa ctgcagcttg tccggtgcag gtgccgtcgg gtcgttggcg agccgcgcct 1325161 gctcctggtc gagcacgagg gaaccctggg ataggccgac ggccgcggct ggatcggttc 1325221 cgtgatgaat tgcgttatca aggctgtttg tcccatcttt gaccgccacg cccaccgtca 1325281 tgttgtcttg gtggctccct ggcggcaaaa gcatggtggg ccaccagctg aaggccgctc 1325341 cggcgggata gtcgatgaga tcgtgctttg cgtttgggaa atattgagag ccagcctggt 1325401 tcgtgtactc gtaccaggga atgcccggca ttcgcgcgcc cccgagggcg tagacgactt 1325461 tggcggttga agcgtcgccc accggagagg gggacggagg tggcccagga gcccacgggt 1325521 acgcgggttc gcttgccgca atagcggttc cgaatccacc ggcccaaccc acgagccaga 1325581 ccgcgaatgc tcccgcaatc actcgcttca tctgcctctg catcgagaat cgcgtgcgtg 1325641 aaagcatagg aaagcagcta tcgttcggcg gttttcgggc ggttatgtcg ccatatctta 1325701 gtcagccacg tcccggccga cattaaagtt ggcagccaac aagctgtgaa tcgccctggg 1325761 tcagccccga ctagctcagc cgtccaaccg ggtgaattgc tgcagccggt attgctctac 1325821 acaggcggcc cttctgatct tgccgctggt tgtggtgggg atcgacccgg gcgggaccaa 1325881 gacgaggtcc gccacgttga gaccgtgcga gcgtgatatc gcggctgtga cgttgttctt 1325941 gatgacatcg agttcgtcca tcgcttcgcc ggcggaatcg ccgaggagct tgagctcgat 1326001 gacagtgact aacttctctg tgtgatcgac cggaactgaa atcgcagcga cccgaccacc 1326061 agtgatctcc tggacggtcg actcgatgtc ctcggggtag tgattgcgcc cgtatacgat 1326121 cagcatgtcc ttcatacggc ccacgatgaa catctcgtcc tcggagagga atccgaggtc 1326181 tcccgttcgc aaccaggatc catcaggagt acctgccgag gggtggacca gcattgcgcc 1326241 aaaggtgtgc cgtgtctcgt ccggtttgtt ccagtagcct tcggcgacgt tgtcgccctt 1326301 cacccagatc tcgccgatcg ttcccgcggg gcactcaatg caggtgtcgg gatccacaat 1326361 tcgcactgtt ggtgatgtcg gcatgccata gctcagcagc ggtgtgccgg tcttgggttc 1326421 acatcgattc gcactgcccg tggacagctt gtcaggttcg aagtagacga cttctggctt 1326481 gtcacccgaa ttgcggctgg ccacataaag agtcgcttcc gccagaccgt acgaaggccg 1326541 tatcatgtct tcgcggaaat tgtacggtgc aaaccggttg cagaatctac tgagcgtgtt 1326601 ggggtggact cgttcagcac cactggtgat gcccaggacg ttgccgaggt cgaggccttc 1326661 tatgtcggca tctgttgtct tgcggacggc caattcgaag gcgaaattcg gtgcggccga 1326721 ccacgaagga cttccgttgg ccagcgaatg tagccaacgc gctggccgtt gcaggaacgc 1326781 cagcgggcta gtgagttcac tgcggtagcc gcccaggatc ggtgcgatga tgccaaggac 1326841 caagcccatg tcgtggtaga acggcagcca cgacacgatg gtagtgtcag gtggcgccac 1326901 accgttgcgg tcgccgaagt agttcgacat cagctgttgg aaattcgcct gaaggttccg 1326961 atgcgagatc atgaccccag ccggagcgcg ggtggagcca gaggtgtact gcaagtacgc 1327021 ggcgcttggc agatccttca cccgaaagct cggtgaattc ccggtcaagt ccaatgaatc 1327081 gatttcgatg atcggcccta cgttgttcgt gttcggccgg tggatgtgct cggcaaccgc 1327141 ttctgcgacc gcagatgttg tcaggatgac cgaaggtgac gcgtcggcaa gcaccgcgct 1327201 gacacgttcg tcgtgagagc cgatctgcgg gactgacaac ggaaccgcta tcgctccggc 1327261 ctgcatcgaa cctaggaaag ccgcgatgta ggccaggccc tgcggagcca gaatcacggc 1327321 tcggtctccg gtcgtgcaat gccgcctgac ttcgtgagca acgatgcggg tccgtcgaaa 1327381 cacctctgac cacgtgagcg tttcggtgat gccggcccaa tcctgttcgt agtcgatgta 1327441 cgtgaacgcg gcgtcgtcgg gctgcaggcc ggcacgctcg cgcagcaagg acaagacaga 1327501 agagtcggac attggtgcta cattaccgtt tcgcgcgatc tccgataacc caagcgggca 1327561 gggggatggt tggcgatagc gatgctgatc ataacgttct gcaatgctgt gcatgtgctg 1327621 aaacaggttg acgcagagtc gaagtcggtg tacgcagggg cgccgtgagg ggcgtcacgg 1327681 tcgagttgct aagccgtgcg ttccatggcc cgcagcccca gcgaaaagag cagccgcaca 1327741 tccggatcgc ccagcgaggt cgacaacagc tgctcgatcc gccttatccg gtagcgaacg 1327801 gtgttgggat gcacttgcag tgaccgtgcg gcggcgccga tgtcgccgaa ggcatccagg 1327861 taggcacgca gggtctgagc cagcaccggg tcctgggcgc ccaggtcacg tatccgagga 1327921 tcgacgagcc gctggtcggt gccgaccagg gtgacgattt cgtcgagcag aacggtggtg 1327981 cgtgcctcgg ccagcgatgt cacctgcccc aagatcgggt ggcgctcggc actctcgagt 1328041 acccgatcca cctcgacgcg tgccgggttg acttcggcaa gtcccgcgac cggccccgcg 1328101 atggctgccc gtagtgctac tcccagctcg gcgcgcagtg cgctgattgt gccgcggacc 1328161 cacgaggtga cagctcggcc ggtcgtggtt tggggcagca gcacatagat ccgtgagccg 1328221 ttggcggcaa cctgagcgtc gtggcgaaaa gcgctggcgc tcaatgccat gacgtcaaca 1328281 agccgaacat ggcggactgc ggtatcgcgg ttttccgcgg tgtcgaaacc gatcagcgtt 1328341 gcgttgccct cggcggcgac gccgagttca cgggcgatgg tcgatacgtc gacgggtgct 1328401 gtggttgcgt tcagctcggc caggcccagt agttgctgta cccgcagcgc gtgcgtattg 1328461 ggctgggtcg ccagtcgcga catgatccgg gcggccagca ccgcagcacc ccgcaacatc 1328521 tcctcggcat cgtcggccaa cggctgcgag ccttgctgga cccagatcgt gccggcgaac 1328581 accggtggcc gcagcgcacc gacccccggc tgatgaatcc cgatggctag ccgaggacgc 1328641 aaccccagct cggggcgctc ggccacccgc accacctcac ggccgggccg cagggcatcg 1328701 aagatgcccc attgacctat ccactgcaga tgctcgggcg ggccggcgcg gcccaggatg 1328761 gacagccgac gcagctcgtc ggcctcgtcg ttggaggccg agtaggcgag cacgtgcgac 1328821 tgggcgtcct cgatgctgat catgccgtgg atgcggtcgg ccagggactg tgccaacccg 1328881 aacaggtcgg ttccggaatc gtcggtgggg tcggcccggt caccatgatg ctccaagaca 1328941 tgattcacca agtggtacag ccgttcccag cgggcccgcg gctccacggc taccaccgcc 1329001 gagccggcgc ggacggcccc ggccaccacc gagtccgacg ggtgcttgac gaagatcgcc 1329061 accggcgccc gttggcgtgc ctgatcgtcg acccagcgca ccgcctcgtc gtcggtgacc 1329121 ccgatcagga agaacacatc ggccgagccc gccgcggccg ccaggcccag ccgcacgtcg 1329181 tcggaatcga tcagcgccgt cgacgccacc ggcaggtcca ggccgcgcgg ggcgtccacc 1329241 aggctgacca cggtcgcatc cagcgccagg agcaactggc cgagccccac gccggcgatc 1329301 cgcatgttgt ccgatcctac tagcaagtcc gccagatctt gtctgatcgg ccaaacattt 1329361 gcgatgcctg ggcggggatg ctggcaggca tggacgcgat cacccaggtg ccggttccgg 1329421 ccaacgagcc ggtgcacgac tatgcgccga aatccccgga acggacccgg ctgcgcaccg 1329481 aactggcctc cctggccgat caccccatcg acctgccgca cgtcatcggc ggccgacacc 1329541 ggatgggcga cggcgagcga atcgacgtcg tgcagccgca ccggcacgcc gccaggctgg 1329601 gcaccctgac caacgccacc cacgccgacg ccgcggccgc cgtcgaagcc gccatgtctg 1329661 ccaaaagtga ctgggcggca ctgccgttcg atgaacgtgc cgcggtgttc ctgcgcgccg 1329721 ccgatctgtt ggccgggccg tggcgggaaa agatcgccgc cgcaaccatg ctcggccaat 1329781 ccaagtcggt gtaccaggcc gagatcgacg cggtctgcga gctgatcgac ttctggcggt 1329841 tcaacgtcgc tttcgcccga cagattttgg agcagcagcc gatcagtggc ccgggggaat 1329901 ggaaccggat cgactaccgc ccgctggacg gtttcgtcta cgcgatcacg ccgttcaact 1329961 tcacctcgat cgccggcaat ctgccgaccg ccccggctct gatgggcaac accgtgatct 1330021 ggaagccgtc gatcacccag acgctggcgg cctatctgac catgcaactg ctcgaggccg 1330081 ccgggttgcc gcccggggtg atcaacctgg tcactggcga cggattcgcg gtttccgatg 1330141 tggcactggc cgatccacgg ctggccggca tccacttcac cgggtcgacg gctaccttcg 1330201 gccacctatg gcagtgggtg ggtaccaata tcggccgcta ccatagctat ccgcgactgg 1330261 tcggcgagac cgggggcaag gacttcgtgg tggcgcacgc ctcggcccgc ccggatgtgc 1330321 tgcgcacggc cctgattcgc ggagcattcg attaccaggg ccagaagtgc tcggcggtgt 1330381 cgcgagcgtt tatcgcgcat tcggtgtggc agcggatggg cgatgagttg ctggccaaag 1330441 ccgccgagct gcgctacggt gacatcaccg acctgtccaa ctacggtggt gcgctgatcg 1330501 accagcgcgc cttcgtcaag aacgtcgacg ccatcgaacg ggccaaaggc gcggccgcgg 1330561 tcaccgtcgc cgtcggcggc gaatacgacg acagcgaagg ctatttcgtg cgccccacgg 1330621 tgttgctctc cgacgacccg accgacgagt cgtttgtcat cgagtacttc ggtccgctgc 1330681 tgtcggtgca tgtctacccc gacgagcgct acgagcagat cctcgacgtc atcgacaccg 1330741 gatcccgcta cgcgctgacc ggcgcggtca tcgccgacga ccggcaggcc gtgctgaccg 1330801 cgctggatcg gctgcggttc gcggcgggga acttctatgt caacgacaag ccgacggggg 1330861 cggtggtggg gcgtcagccg ttcggcggtg cacgcggatc gggcaccaac gacaaggccg 1330921 gttcgccgtt gaacctgctg cggtggacgt cggcgcgcag catcaaggag acgttcgtcg 1330981 cggccaccga ccacatctac ccgcacatgg cggtcgactg atggccggct ggttcgcgca 1331041 cacgctgcgc ccggcaatgc ttgccgccgg ccgctcggat cggctgggcc gcatcgtcga 1331101 gcgctcgccg ctcacccgcg gggtggtgcg ccggttcgtg cccggcgaca cgctcgacga 1331161 cgtggtggat atcgttaccg cgctgcggga ttcgggccgc tacctcagca tcgactacct 1331221 gggcgagaac gtcaccgatg ccgacgacgc tgccgccgcc gtgcgggcgt acctggggct 1331281 cttggacgtg ctgggccgcc gcggcgatat cgcatgcgac ggggtgcgac cgctcgaggt 1331341 gtcgctcaag ctgtcggcgc tcgggcaggc cctcgatcgc gacggccaga agatcgcgct 1331401 ggacaacgcc cgcgccatct gtgagcgggc cgagcgggtg ggcgcctggg tcacggtgga 1331461 cgccgaagac cacaccacca ccgattccac attgtcgata tcgggcgatt tgcgcgtcga 1331521 ctttccttgg ctgggcacgg ttgtgcaggc ctatctgcgg cgcacgctgg ccgattgcgc 1331581 ggagttggcg gccgtgggcg cccgagtccg gttgtgcaag ggcgcctatg acgaacccgc 1331641 atcggtggcc taccgagacg ccgcgcaggt caccgactcc tatctgcggt gccttagggt 1331701 attgacggcg gggcgaggct atccgatggt ggccacccac gacccggtga tcatcgcggc 1331761 ggtaccgggg atcacgcgcg aatcagggcg tagtcaaggt gatttcgaat accagatgct 1331821 ctacggcgtc cgcgacgacg aacaacgacg actgaccggc gccggtaacc acgtgcgggt 1331881 gtatgtgccc ttcggcaccc ggtggtacgg gtatttcctg cggcggctgg ccgaacgccc 1331941 ggccaacctg gcgttcttcc tgcgggcgct gaccgaccgc cgacgcgcgc gggggtgcgc 1332001 cgagcgctga aatcgccggt tgctgtcaca ttcggcgggg ctgtctcgtc cttgatgtta 1332061 tgaattccag catgggtcgg cgggaggaca catgtcgcaa cacgacccgg taagtgcggc 1332121 ctggcgggcg catcgggcct acctggtgga cctcgcgttt cgtatggtag gtgacatcgg 1332181 cgtggccgaa gacatggtgc aagaggcatt ttcccgcttg ctgcgggctc cggtcggcga 1332241 catcgacgac gagcgtggct ggctgatcgt ggtcaccagc cggctgtgcc tggatcacat 1332301 caagtcggcg tcgacacgcc gggagcgccc gcaggacatc gccgcatggc acgacggtga 1332361 cgccagcgtg tcatcggttg acccggctga ccgggtgact ctcgacgacg aggtccggct 1332421 ggctttgctg atcatgctcg agcgcctcgg ccccgcggag cgggtggtgt tcgtgctgca 1332481 cgagatcttt gggctgccct accagcaaat cgccacgacg attggcagcc aggcctccac 1332541 atgccggcag ctggctcatc gggcccgtcg caagatcaac gaatcgcgca ttgcggccag 1332601 cgtggagcca gcccagcatc gcgtcgtcac cagagctttc atcgaagcct gctccaacgg 1332661 agacctggac accctgctcg aggtgctgga tccgggtgtc gccggcgaga tcgacgcccg 1332721 caaaggcgtt gtcgtcgtgg gcgcggatcg ggttggcccg accatcctgc gccactggag 1332781 tcaccccgcc accgtcctgg tagcccagcc ggtgtgcggt caaccggcgg tgctggcctt 1332841 tgtcaaccga gcgcttgccg gcgtgttggc cctgtcgatc gaggccggca agatcacaaa 1332901 aatccatgtc ttagtgcagc cttcaacatt ggacccgtta cgggccgaac tcggcggcgg 1332961 ttagttaggt atcggaggta tgaccatgaa atcacttgcc gcgcttgacc ggccgagctg 1333021 gttgtcatcg tcggcgtggc cctggcagcc ctacctgctg agccaccatc agggcggcat 1333081 cgcggttacc gatatcggcg acgggccggc ggtgctgttc gttcacgtcg gcagctggag 1333141 ctttgtctgg cgtgacgtgt tgttgcgtct agccaacgat tttcggtgtg ttgccatcga 1333201 cgcaccgggt tgtgggctca gcgaccggct ctcaaccccg ccaacacttg cccaggcggc 1333261 cgatgcaatc acctcggtca ttgatgcgct gcagttacgt gacctcaccc tggtagccca 1333321 cgacctgggc ggcccggccg gcttcctggc cgccgcccgt cgcggcgacc gcgtcgcggc 1333381 actggccgcg gtcaactgct tcgcatggcg gcccacgggt ccgctgttcc ggggcatgct 1333441 cgcggcgatg ggcagcgccc ccgtgcgtga actggacgcg gccatcaatg cgcttgcccg 1333501 cgcgacgtcg acgcggttcg gggccggtcg gcactggagc cgcgcagacc gcgcggcttt 1333561 tcgggcggga atcgatgcgc cggcccgcag ggcgtggcat gcctacttcc gcgatgcgcg 1333621 ccgtgcccat gccctctata ccgacgtcga cgccgcgttg cgggggggtc tggccgatcg 1333681 gccactgctg accatcttcg gtcagttcaa cgatccgctg cggtttcagc cgcgctggaa 1333741 agagttgttt ccgacggcac gccaactgca ggtccgccgg ggcaaccact ttcccatgtg 1333801 tgacgaccca gacctggtgg ccggggcact cacgtctttc gtgcaacggt caacgtgagc 1333861 cgccgactgc cgtcacacct ggtacacctt gcggtttgcc gccgcgccgc cacatgccaa 1333921 gctactcgcc atggccgtcg ctattgcccg tccgaaattg gaaggaaaca tcgccgtcgg 1333981 cgaggaccgc cggatcggct tcgccgagtt cggcgccccg cagggtcgtg cggtcttctg 1334041 gctgcatggc accccagggg cccggcggca gatcccgacc gaagcccggg tctacgccga 1334101 gcaccacaat attcgtctga ttggcgtcga tcggcccggc atcggcgcct cgacgccgca 1334161 tcagtacgaa accatcttgg cgttcgccga cgatctgcgg accatcgccg acacgctcgg 1334221 catcgacaag atggccgtgg tgggcctgtc gggcgggggc ccatacaccc tggcgtgcgc 1334281 cgccgggctg cccgaccggg tggtcgccgc cggtgtcctc ggcggcgtcg cgccgacgcg 1334341 cggcccggac gcgattagcg gcggtttgat gcgccttggt tcggcggtgg cgccgctgct 1334401 gcaggtgggc ggcaccccgc tgcggctggg tgcgagcttg ctgatccggg cggcccggcc 1334461 cgtcgcgtcc cctgccctcg acctgtatgg cctgctctca ccgcgggccg accggcattt 1334521 gctggctcgg cccgagttca aggcgatgtt cctcgacgat ctgctcaacg gtagtcgcaa 1334581 gcagctcgct gcgccgttcg ccgatgtcat cgcctttgcc cgcgactggg gattccggct 1334641 ggacgaggtg aaagtccccg tccgctggtg gcacggagac cacgaccaca tcgtcccgtt 1334701 ctcccacggg gaacacgtcg tatcccggct tcccgacgcg aagttgttgc acttgcccgg 1334761 cgaaagtcat ctcgctgggc ttggccgtgg tgaagagatt ttgagcaccc tgatgcagat 1334821 ttgggaccgc gacctgcgga aatgatcggg cgtgtgaccg agctcgcatg ggcgggccgc 1334881 actgctttgc atcgccattt gtgcctattg acggccttaa tatgacatgc tgttgcctgt 1334941 gttagagccc gctgaccgcc cctgtgatgc ccccggatgg tttctctacc tcaccgacat 1335001 accgcgcgcg ggtgtcgagt acgggcaatt gctcgccgtg ctgccgctgc agcggatgct 1335061 gccggccggc gacggacatc cggtactggt gctacctggc ctgctggccg gcgacggttc 1335121 cacctggatc ctgcgacgga tcttgcgtcg cctcgggtac gcggcctacg gctgggggct 1335181 cggccgcaac atcgggccga cggccaaagc ggtatccggg atgcgggacc tcctcgacaa 1335241 gctccactcc cggtaccaca ccccggtgag cctgattggg tggagcctgg gtggcatctt 1335301 cgcgcgcggc ctcgcccgcg accatccgtc ggcggtgcgc caggtgatca cactgggcag 1335361 cccgtttggc atgagggaca cctgtgagac gcgctccgcg tggagcttca accggtatgc 1335421 gcatctgcac accgagcggc acgagttgcc gctggaaatg gaaagtgaac ctttgccggt 1335481 gccgaccacc gcgatctact cgcgctgcga cggcatggtc gcctggcaga cgtgcatgaa 1335541 ttcgccatcg gagcgcgcgg aaaacatcgc ggtgcgcagc agccacatcg gctacggcca 1335601 caatccgccg gtggtgtggg ccatcgccga ccggctggca cagccccagg gtgcatgggc 1335661 gccgtttcgg ccgccgaagg tgttgagccc gctgtttccg cgaccggata caccggcaga 1335721 ggcggtcagc accccccaga cgcgaccggc ctgacggggc aggcgatcac ggcgccgggg 1335781 tagcctcgct cacgtgctgc tggcctccct gaatcctgct gtcgtctccg ccgccgatat 1335841 cgcggacgcg gtccgcatcg acggcgacgt gctgagccgt agcgacctgg tcggcgcggc 1335901 aacgtcggtg gccgagcggg tcgccggtgc gcaccgggtc gccgtgctgg ccacgccgac 1335961 cgcgtcgacg gtgctggcga tcaccggctg cctgatcgcc ggcgtgccgg ttgtgccggt 1336021 acccgccgat gtgggcgtca ccgaacgccg gcacatgctc accgactccg gcgtccaggc 1336081 atggctgggc ccgttgcccg acgacccagc ggggctgcca cacatcccgg tgcgcacgca 1336141 cgcgcggtcc tggcaccgtt atccggagcc ctcacccggg gccatcgcca tggtggtcta 1336201 cacgtccggc accaccgggc cgcccaaagg cgtgcagctg agccggcggg cgatcgccgc 1336261 cgacctcgat gcattggcag aggcctggca gtggacggcc gaggacgtgc tggtccacgg 1336321 tctgccgctg tatcacgttc acggcctggt gctgggcttg ctcgggtcgc tgcggttcgg 1336381 aaatcgcttc gtgcacaccg gtaaaccaac gccggccggc tacgcccagg cctgttatga 1336441 agcgcacggc acgttgtttt ttggggtgcc gacggtgtgg tcacgagtgg cggccgacca 1336501 agctgccgcc ggggcgctca aaccggcgcg gctgctggtg tccgggagtg cggcactacc 1336561 cgtgccggtg ttcgacaagc tggtgcagct caccgggcac cggcccgtcg aacgctacgg 1336621 tgcttcggag tcgctgatca ccctatcgac gcgggctgac ggtgagcgtc gcccgggctg 1336681 ggtcggcctg ccgctggccg gtgtgcagac ccgactggtg gacgacgatg gcggtgaggt 1336741 cccgcacgac ggggaaaccg ttggaaagct tcaggttcgc ggtccgaccc tgttcgacgg 1336801 ctacctgaat caacccgatg ccaccgccgc ggcgttcgac gccgacagct ggtaccgcac 1336861 cggcgacgtc gcggtggtcg acggcagtgg gatgcaccgc atcgtgggac gcgagtcggt 1336921 cgacttgatc aagtcgggtg gataccgggt cggcgccggt gaaattgaaa cggtgctgct 1336981 cgggcatccg gacgtggcgg aggcggcagt cgtcggggtg cccgacgatg atctaggcca 1337041 gcggatcgtt gcctacgtag tcggctcagc gaatgtcgat gcggacgggc ttatcaactt 1337101 tgttgcccaa caactttcgg tgcacaagcg cccgcgcgag gtgcgtatcg tagatgcgct 1337161 gccgcgcaac gcgttgggga aagtgctcaa gaagcagttg ctgtcagaag gctgagctac 1337221 ggcgaattat cgtgtaccgc tggacagtta cgctggcaca ctgttactcc gacggcccgg 1337281 tgagcttagc gcatgggcct tgttgccgcg ccactgtagg gcttccaggg cgacggccac 1337341 atggacggag gtgtggtcga gcggtcgcgg tagcagccgc tgagcggact cgagtctgcg 1337401 cagaaatgta ttgcggtgag tgtggagacg ttttgcggcc cgggaggcgt tgcactgctc 1337461 gttgatgaag gtcagcaggg ccgtttgtag atctgggctg gcagactcga ggtctccaag 1337521 cgtactcgtg atgaattcgc ttgcagcatc tggattttgg ctgatcaatg cgaccatctt 1337581 aacgtcggca aagaaggcga cccgctgggt cgaccgtagc cgtgacaagg tgcgctgggt 1337641 gatgagcgct tcgaggtggc tgcgccggaa cccctccacc ccgttggcgg tggtcccgat 1337701 ggcgatgcgc gccccgggtg cgttgtccac cgccgcctgc actgtgtcga tgtcgagtcc 1337761 gtcggcgtcg gtcacccacg cccagcggct cgccgccccg gcgaccaccg tcagcggtcg 1337821 tgtcgatccc acggcgtggc agaacagatc agccgcccgg tcgaggtagc tgtggtcacc 1337881 gtcgagctcg tcgctccaga tgatggcagc ggtatgggca cgactcagcg ggtagcccaa 1337941 tttcgcttcg gcccgttcgg ggctgatagg ggcgccatcg agaatcagcc cgacgacctc 1338001 gaggcgttcg gcatgggtgc tgcgggtcag ttcgtcgtgt tccgactgca cttgcgcggc 1338061 gataccggtc agcgtggcct cgatgaagtc gttgacggag cgggccgaca cgtctagcag 1338121 ctcgcgcagc tcttgggggt cggaagtgag ttcgaacgca atccccatcc agaaccgcca 1338181 cccgatgtgc tcaccggttc gatagatgtt gaacgctact gtgtccagcc cccggcgcac 1338241 caggtctcgg gccatccgca gtggctcggt gccgagattg gcgggcaccc gagcaccagg 1338301 gtcacgcagg ttggccgcag cccagtacac caggttggcg cgattggccg tctggacaac 1338361 cttcgcaagc accggatcgt tggcgatcgc cggattggcc gcaatcgtgg cacggtccag 1338421 ttcctcgatc cactccgggc tgggattgag ggcgatgcgt gctccctcgc ggatcagctc 1338481 acgaattcgc ggcgaaggtt gttgccatgc cacgcgccga tcttagggcc agcgggtgca 1338541 atttgcacac tatgttggca ctattgtgcc ggattcacac tgcacggccg gtgtgtgcgc 1338601 gaaatcacgg tgtgggtctg ctggatgagt cgaccgtgtt gaacaacttg cgacacaccg 1338661 caatttgcga aatccgccac cgaccgggca tagtaaccca gctagtcgtc gttgtcgcgt 1338721 cgaaccacat ggtgaactgt gcggcgggtg cattttgcac atcaagtggg cgctgattgg 1338781 gaagatttac ccttcggcgg cggcggtagg tgcagattgc actttggctc atgctgattg 1338841 aaattttttg acctgttgcg gtccttgcgg gctcgccatc attggcggca gttcgtcacc 1338901 gacgaatcgg ggccaaggac gtaggcgacc agttcgcttg actgctaacc gctcctgatc 1338961 gtacccgtgc gagtgctcgg gccgtttgag gatggagtgc acgtgtcttt cgtgatggca 1339021 tacccagaga tgttggcggc ggcggctgac accctgcaga gcatcggtgc taccactgtg 1339081 gctagcaatg ccgctgcggc ggccccgacg actggggtgg tgccccccgc tgccgatgag 1339141 gtgtcggcgc tgactgcggc gcacttcgcc gcacatgcgg cgatgtatca gtccgtgagc 1339201 gctcgggctg ctgcgattca tgaccagttc gtggccaccc ttgccagcag cgccagctcg 1339261 tatgcggcca ctgaagtcgc caatgcggcg gcggccagct aagccaggaa cagtcggcac 1339321 gagaaaccac gagaaatagg gacacgtaat ggtggatttc ggggcgttac caccggagat 1339381 caactccgcg aggatgtacg ccggcccggg ttcggcctcg ctggtggccg cggctcagat 1339441 gtgggacagc gtggcgagtg acctgttttc ggccgcgtcg gcgtttcagt cggtggtctg 1339501 gggtctgacg gtggggtcgt ggataggttc gtcggcgggt ctgatggtgg cggcggcctc 1339561 gccgtatgtg gcgtggatga gcgtcaccgc ggggcaggcc gagctgaccg ccgcccaggt 1339621 ccgggttgct gcggcggcct acgagacggc gtatgggctg acggtgcccc cgccggtgat 1339681 cgccgagaac cgtgctgaac tgatgattct gatagcgacc aacctcttgg ggcaaaacac 1339741 cccggcgatc gcggtcaacg aggccgaata cggcgagatg tgggcccaag acgccgccgc 1339801 gatgtttggc tacgccgcgg cgacggcgac ggcgacggcg acgttgctgc cgttcgagga 1339861 ggcgccggag atgaccagcg cgggtgggct cctcgagcag gccgccgcgg tcgaggaggc 1339921 ctccgacacc gccgcggcga accagttgat gaacaatgtg ccccaggcgc tgcaacagct 1339981 ggcccagccc acgcagggca ccacgccttc ttccaagctg ggtggcctgt ggaagacggt 1340041 ctcgccgcat cggtcgccga tcagcaacat ggtgtcgatg gccaacaacc acatgtcgat 1340101 gaccaactcg ggtgtgtcga tgaccaacac cttgagctcg atgttgaagg gctttgctcc 1340161 ggcggcggcc gcccaggccg tgcaaaccgc ggcgcaaaac ggggtccggg cgatgagctc 1340221 gctgggcagc tcgctgggtt cttcgggtct gggcggtggg gtggccgcca acttgggtcg 1340281 ggcggcctcg gtcggttcgt tgtcggtgcc gcaggcctgg gccgcggcca accaggcagt 1340341 caccccggcg gcgcgggcgc tgccgctgac cagcctgacc agcgccgcgg aaagagggcc 1340401 cgggcagatg ctgggcgggc tgccggtggg gcagatgggc gccagggccg gtggtgggct 1340461 cagtggtgtg ctgcgtgttc cgccgcgacc ctatgtgatg ccgcattctc cggcggccgg 1340521 ctaggagagg gggcgcagac tgtcgttatt tgaccagtga tcggcggtct cggtgtttcc 1340581 gcggccggct atgacaacag tcaatgtgca tgacaagtta caggtattag gtccaggttc 1340641 aacaaggaga caggcaacat ggcctcacgt tttatgacgg atccgcacgc gatgcgggac 1340701 atggcgggcc gttttgaggt gcacgcccag acggtggagg acgaggctcg ccggatgtgg 1340761 gcgtccgcgc aaaacatttc cggtgcgggc tggagtggca tggccgaggc gacctcgcta 1340821 gacaccatgg cccagatgaa tcaggcgttt cgcaacatcg tgaacatgct gcacggggtg 1340881 cgtgacgggc tggttcgcga cgccaacaac tacgagcagc aagagcaggc ctcccagcag 1340941 atcctcagca gctaacgtca gccgctgcag cacaatactt ttacaagcga aggagaacag 1341001 gttcgatgac catcaactat caattcgggg atgtcgacgc tcacggcgcc atgatccgcg 1341061 ctcaggccgg gttgctggag gccgagcatc aggccatcat tcgtgatgtg ttgaccgcga 1341121 gtgacttttg gggcggcgcc ggttcggcgg cctgccaggg gttcattacc cagttgggcc 1341181 gtaacttcca ggtgatctac gagcaggcca acgcccacgg gcagaaggtg caggctgccg 1341241 gcaacaacat ggcgcaaacc gacagcgccg tcggctccag ctgggcctga caccaggcca 1341301 aggccaggga cgtggtgtac gagtgaaggt tcctcgcgtg atccttcggg tggcagtcta 1341361 ggtggtcagt gctggggtgt tggtggtttg ctgcttggcg ggttcttcgg tgctggtcag 1341421 tgctgctcgg gctcgggtga ggacctcgag gcccaggtag cgccgtcctt cgatccattc 1341481 gtcgtgttgt tcggcgagga cggctccgac gaggcggatg atcgaggcgc ggtcggggaa 1341541 gatgcccacg acgtcggttc ggcgtcgtac ctctcggttg aggcgttcct gggggttgtt 1341601 ggaccagatt tggcgccaga tctgcttggg gaaggcggtg aacgccagca ggtcggtgcg 1341661 ggcggtgtcg aggtgctcgg ccaccgcggg gagtttgtcg gtcagagcgt cgagtacccg 1341721 atcatattgg gcaacaactg attcggcgtc gggctggtcg tagatggagt gcagcagggt 1341781 gcgcacccac ggccaggagg gcttcggggt ggctgccatc agattggctg cgtagtgggt 1341841 tctgcagcgc tgccaggccg ctgcgggcag ggtggcgccg atcgcggcca ccaggccggc 1341901 gtgggcgtcg ctggtgacca gcgcgacccc ggacaggccg cgggcgacca ggtcgcggaa 1341961 gaacgccagc cagccggccc cgtcctcggc ggaggtgacc tggatgccca ggatctctcg 1342021 gtagccctcg gcgttgacgc cggtggcgat caaggtgtgc actccgacga cgcggcctgc 1342081 ctcgcgcacc ttgagcacca gggcgtcggc ggcgaggaag gtatacgggc cggcatcgag 1342141 cgggcgggtc cgaaacgcct ctacggcttc gtcgagctct ttggccatga tcgacacttg 1342201 cgacttggaa agctttgtca caccaagtgt ttcgaccagg cgctccatcc ggcgagtgga 1342261 tactcccagc aggtagcagg tcgccaccac gctggtcagt gcgcgttcag ctcgcttgcg 1342321 gcgctgcagc agccagtccg ggaaatagct gccctggcgc agcttgggga tcgcgacgtc 1342381 gatggttgcg gcacgggtgt cgaaatcacg gtggcggtag ccgttgcgct gattggaccg 1342441 ctcatcgctg cgttcgcggt agcccgcccc gcacagggcg tcggcttcag cccccatcaa 1342501 ggcggcgatg aacgtcgaga gcagcccgcg cagcagatcc gggctcgcct gtgcgagttg 1342561 gtcagccaga agctgctcgg tgtcgataag atgagaagag gtcattgcgt catttccttc 1342621 gattgacttt tgctggtcgt ttcgaaggat cacgcgatga ccgcccacta ctgggctacg 1342681 acacgcccac cggccttacc tgcccgtaca ccacacccct ggacgtaact tgacaccaat 1342741 ccacagcacc gagcagtgac agaaggtgcc ccaaggtgtg gtgaaactcg ctggacggtc 1342801 cccaggatgt tggcagcaca ttcaccggac atgaccggag caagaccgga catcctccca 1342861 taccgtcgtc gccgtgtaca tccgtagccc gtcctggcag gtgctgggtt gaccaaaatc 1342921 agcccaacac ctgccacgac gatgaagcgg gttgcgctgg catgtcttgt cggctcggcg 1342981 atcgaattct acgacttcct tatctacggc accgctgcgg cgctggtgtt tcccaccgtg 1343041 ttcttcccac acctggatcc cacggtggcc gccgtggcct cgatggggac atttgctgtg 1343101 gcgttcctat cccggccgtt cggcgcggcc gtctttggat actttggaga ccgcctcggc 1343161 cgcaagaaga ccctggtcgc cacactgttg atcatgggcc tggcaaccgt gactgtcggg 1343221 ctggttccaa cgacagtggc catcggcgcc gcggccccac tgatcctgac gaccatgcgg 1343281 ctgctgcaag ggttcgcggt cggcggcgag tgggccggtt cggcgctgct gagcgccgag 1343341 tacgcgcccg ccagcaaacg tggctggtac gggatgttca ccgttgtggg tggcggcatc 1343401 gcgctggtac tgaccagcct gacctttctg ggcgtgaact acaccattgg cgaaagcagc 1343461 cccacattca tgcagtgggg gtggcgcata ccgtttctgg tcagtgcggc gctgatcgcc 1343521 gtcgccctat acgtgcggtt caacatcgac gagaccccgg tgttcgcccg ggaaagggca 1343581 gacgaaaaaa cccgtttggg cccagccgaa acgccgattg cccaagtact gcggcggcag 1343641 cggcgagaga tagtcttggc cgccggcagc gccgtttgct gcttcggctt cgtctacctg 1343701 gccagcactt acttggccag ctacgctcaa acccgactgg ggtattcgcg cggcagcatc 1343761 ctgttcgaca gtgtgctggg tggactgctg tgcatcgtgt tcaccgcgct ttcttccgct 1343821 ctttgcgacc aactcgggcg ccgccgcgtc ctattggccg ggtgggcggt ggctctaccc 1343881 tggtcgctgt tggtcatgcc gctgatcgac tccggcagcc ccagtttgtt cgcggtggct 1343941 gtcgtcggca tgtatgccat cggcggattc ggtttcggac ccacggcatc gttcatccca 1344001 gaactgtttg ctactagcta ccgatacacg ggcagcgcgc tcgcggcgaa tctcgctggg 1344061 gttgccggcg gcgcgctacc gccggtgatt gccggcgcgc tggtggcaac ctatggcagc 1344121 tgggcgatcg gtgtcatgct ggccatcctc gcgttgatca gcctggtatg cacctatcgg 1344181 ttgcccgaaa ccgccggatc ggccctcgtc agccgctagt tggcgtgcag gtcctcgttg 1344241 agggcaatgc cctgaccgtc gcgggccagc acttcgaccg ccccgctgac ggaattgcgg 1344301 cgaaacagca ggttgctgct cccggagagc tcacgcgcct tgaccgaatt gctgtcgggc 1344361 atggtgaccc tcgtgccggc ggtcacgtac agcccggcct ccaccacgca gtcgtcgccc 1344421 agtgagatgc ccagaccgga gttggcgccg agcagacaac gcttgccgat cgaaatgacg 1344481 tgtgttccac cgccagacag cgtgcccatg atcgacgctc cgccgccgac atcggagccg 1344541 tcgcccacca ccacacccgc cgagatgcgg ccttccacca tcgaggcgcc cagggtgccg 1344601 gcgttgtagt tgacgaagcc ctcatgcatc acggtggtgc ccggcgccag gtgagcgccc 1344661 aaccgcacgc ggtcggcatc ggcgatacgt acgccggtgg gcacgacgta gtcgaccatc 1344721 cggggaaact tgtcgacgcc atacacagtc accggtccgc ggcggcgcag ccgcgcccgc 1344781 accgcctcga aaccgtctat ggcgcagggt ccgtgattgg tccacaccac attggtcagc 1344841 accccaaaca agccgccggc gttcaaccca tggggcgcca ccaggcggtg cgacaagagg 1344901 tgaagccgca ggtaagcatc gtatgggtca gcggcgacat cgtcgagcga gccgatgacc 1344961 gtacggaccg cgatggtctc ggtgcggcgg tcgtcatcgc ggccgatcag cgcggccagc 1345021 tcgacaggaa cgtcggacac cgccagtcgt gacgtcgcgc tggtgcccga ttcggtcagt 1345081 tccggcgcgg gaaaccaggt gtcgaggacc gatccgtcag cggcgagggt agccaggccg 1345141 atgcctgctg ctccagtcac ggtcgacacg ctacttgtgc cgccgaacag acacaaaacc 1345201 accctatttc gaccagaatc gggtgctttt gcgtctgctc ggccaactaa gctagcgccg 1345261 tgctggattt gcgcggggac ccgatcgaat tgaccgcggc gctgattgac atccccagcg 1345321 agtcgaggaa ggaggcacgc atcgccgacg aggtggaagc ggcgttgcgc gctcaggcat 1345381 cggggttcga gatcatccgc aacggcaacg cggtgctggc gcgtacaaag ctgaaccggt 1345441 cctcgcgggt gctgttggcc ggacacctgg acaccgtgcc agtggccggc aacctgccta 1345501 gccgccgcga gaacgaccag ctgcacggct gcggcgcagc cgacatgaaa tccggcgacg 1345561 cggtcttcct tcatctggcc gctacactgg ccgaaccgac gcacgatcta acactggtgt 1345621 tctacgactg cgaggaaatc gattcggcgg caaacggttt aggccgcatc cagcgcgagc 1345681 tgccggactg gctatccgcg gatgtagcca tcttgggtga gcccaccgcc ggctgcatcg 1345741 aggctggttg ccagggcacg ttgcgtgtcg tcctcagcgt gaccggaact cgcgcgcatt 1345801 cagcgcgttc gtggttgggt gacaacgcaa tccacaagtt gggtgctgtg ctggaccggt 1345861 tggccgtcta ccgggcacgc agcgtcgaca tcgacggttg cacctatcgg gagggcctct 1345921 cggcggtgcg cgtagcaggc ggcgtcgccg gcaacgtgat ccctgacgcg gcctcggtca 1345981 cgatcaacta ccgctttgcc cccgaccggt cggtggccgc ggcattgcaa catgtccatg 1346041 acgtgttcga cgggctcgac gtgcagatcg agcagacgga cgccgcggcc ggtgcgctgc 1346101 ctggcctgtc cgagcccgcg gccaaggcgc tggtcgaggc cgccggcggg caggtccggg 1346161 ccaagtatgg ctggactgat gtgtcgcgct ttgccgcttt gggcataccg gcggtcaatt 1346221 acggcccggg tgatcccaac ctggcgcact gccgcgacga acgggtgccc gtcggcaaca 1346281 tcaccgcggc cgtggacttg ctgcgccgat acctgggtgg ctagcgctgc tgtggcccca 1346341 agcgtgctgc cgccttggtc gcgtcggctg ccgcggctgc catcccgatc ccggccagct 1346401 cctcagccac cgcggtcagc tcggcagcat ctccgtcggc caggccacgg gcgtgcttga 1346461 cgaggatatt tcctacggtg cagtcgattt cggcggcgag gcgagtcacc gggtccaccg 1346521 cacggatgtc gcccaaccga accgcgttat gccaggcgca tagggccacc gccgcctgcc 1346581 cggcccgctc agccgtccgg gcggcctccc gggccgccgc gatggcccct gtcatgtcct 1346641 gggccgccgc cctggtccag gccctggcca gcccgagctc gggtgcgaac aacgcggact 1346701 tcgttccgtg ccgagcttca gcgcgctgca gtgtttttgc agactcggcg atatggcctt 1346761 gctgcgcgat ggccgttgcc aacaacatca gcgacagcgg accccacgag tagccggttc 1346821 gttccagtgt ggcggcggcc ggctccagca tcgatgccgc ggcgccgaat tcgcctttgg 1346881 tgatcagtac gtacgccaac aacacttcac cgatggaccg gccaggttgc tgcagctagg 1346941 cgaagtcggt gaaccgcttg gccagctcct gagccggcgc gacgtcgcct gccagcagca 1347001 gcgacgtgat ctgagccagg cccacggtga accgcagcag ccccggatgt tcggcggccg 1347061 acgcccgttc ggccagccgg tcaacgtcgc cgaaccggcc cattcgtgcc gatgataacg 1347121 cggcagcgct ggcggcccag gccacggcca tgtcgtcggc agccggtccg gacagcacct 1347181 cggtggccag cgtgatggcc cgcggcaagt ttccggagtt catcgcaaac gtggccgcca 1347241 gcgcatccag ggtgctgcgg gccgtgggct cggtcactcg gctgcgggtc gtctgcagaa 1347301 acgccgtggc gcgctcgggc tcgttgagca tccagaaccg attcgccgcc cggggtatcg 1347361 cccaggccat cagctcggtc tcggtcaatt cggcgggatt caccgccgcc agcaccgcgt 1347421 cagcttcgcg accgcgaccc tgccaaccga gtgcgtaagc caagggcagg cgtgccgcca 1347481 gggcgtccga cctatccagc gctgcccgcg ccaaccgttc ggcaagccgg acgtcgccga 1347541 gccgcagggc ctgcccggct gcggtcgccg catccgtgac cgcggccggg gtagcactgg 1347601 cggggacgtc gatggccagt gaggacagcc gtaactgatc gctgacatgg tcggatgggt 1347661 gcttggccag ctgcgcgacc agcgacacgc gcaatgcatg cgcgtgctcg gccgtcaata 1347721 cggcgcgtgc gcggtcggcg tacagcggat ggccgacaaa aatctcgctg gtatcgctgt 1347781 cgggacccac ccgcaccgcg ccggcggctt cggcttggcc gagcgtgtcc aactgctcgc 1347841 caccgaccag ggccaccagg tcggtgcgcg ccaacggttc ggcgatggcg aggtagtcga 1347901 caacggcgcg ggccggttcc ggcagggcgc acaggtactc gtcgatcacg ccggacagcg 1347961 gccgacgatc ctcgtctcga cagcgccacc ggccgtccac gtgttcgaga ccaccgccgt 1348021 cgatgaggtg gcgcagatac aacgggttgc caaggctgcg ccgaaagagc tcgtcggcgt 1348081 cggcgacgtc cagtgtcgcg tccagcgccg actccacgaa cgccgcggtt tgggccctgt 1348141 cgagcggctc gatggcgacc cgggtgagca ggtcatcgga ccagagcgca gctatagcgt 1348201 ccggtggctc ggcctccgag gcgacggtga ccaccagccg cgccgccccg gcccgcgcca 1348261 gctggtacac caaggtggcc gacagcggat ccaggttgtg cgcgtcgtcg accaccagca 1348321 gcagatcgcc agcatcaccg gtcagggaac tacgcgccgc ccgcagcagc gccgcgggcc 1348381 gcccaatgtc ggctccggag gcgggcaggc tgatcaaatg gcggaaagcg ccgaacggga 1348441 tggcccgccc tggagcggtt cccaccaccc agcgagcccg gccgctcctg ccgtcctcgg 1348501 acatgacctg ctcggcagcc agttgcgcca gcagcgtctt gccgacgccg tgtggcccga 1348561 ccagcaccac cccgcaccga tccggactgt cgacggccgc ctccacgtgt ttccagacgc 1348621 gcatcgccgg attttatggc ggttgcgccc aacgacattc gagcggggga taggccaaaa 1348681 atgtacgcgg ttcacatcgg tggtctacgt tctggtgtat gtcggcgaaa atcgacatta 1348741 ccggtgattg gactgtggcc gtgtattgcg cggcctcgcc aacgcacgcg gagttgctag 1348801 agctggccgc cgaagtcggc gcggcaatcg ccggacgtgg ctggacgctg gtgtggggag 1348861 gtggccatgt ttcggcgatg ggggctgtcg cctcggcggc gcgagcctgc ggcggctgga 1348921 ccgtcggcgt gattcccaag atgctggtgt accgcgaact ggctgatcac gacgccgacg 1348981 agctaatcgt caccgacacc atgtgggagc gcaagcagat tatggaagat cgctcagatg 1349041 cgttcatcgt gttgccgggc ggtgtcggca ccctagacga gctgtttgac gcatggaccg 1349101 acgggtatct cggtacccat gacaaaccca ttgtgatggt agatccctgg gggcatttcg 1349161 atggactgcg ggcatggctg aacggattgc tcgacaccgg ttacgtctca cccacggcga 1349221 tggaacggct ggtggtagtc gataacgtca aggacgctct gcgggcctgc gcaccttcct 1349281 gaggttggtc gacaaccaat tcgacatttc gcaaacgaat cgagggctta cgtgtccgat 1349341 tactacggcg gcgcacacac aacggtcagg ctgatcgacc tggcaactcg gatgccgcga 1349401 gtgttggcgg acacgccggt gattgtgcgt ggggcaatga ccgggctgct ggcccggccg 1349461 aattccaagg cgtcgatcgg cacggtgttc caggaccggg ccgctcgcta cggtgaccga 1349521 gtcttcctga aattcggcga tcagcagctg acctaccgcg acgctaacgc caccgccaac 1349581 cggtacgccg cggtgttggc cgcccgcggc gtcggccccg gcgacgtcgt tggcatcatg 1349641 ttgcgtaact cacccagcac agtcttggcg atgctggcca cggtcaagtg cggcgctatc 1349701 gccggcatgc tcaactacca ccagcgcggc gaggtgttgg cgcacagcct gggtctgctg 1349761 gacgcgaagg tactgatcgc agagtccgac ttggtcagcg ccgtcgccga atgcggcgcc 1349821 tcgcgcggcc gggtagcggg cgacgtgctg accgtcgagg acgtggagcg attcgccaca 1349881 acggcgcccg ccaccaaccc ggcgtcggcg tcggcggtgc aagccaaaga caccgcgttc 1349941 tacatcttca cctcgggcac caccggattt cccaaggcca gtgtcatgac gcatcatcgg 1350001 tggctgcggg cgctggccgt cttcggaggg atggggctgc ggctgaaggg ttccgacacg 1350061 ctctacagct gcctgccgct gtaccacaac aacgcgttaa cggtcgcggt gtcgtcggtg 1350121 atcaattctg gggcgaccct ggcgctgggt aagtcgtttt cggcgtcgcg gttctgggat 1350181 gaggtgattg ccaaccgggc gacggcgttc gtctacatcg gcgaaatctg ccgttatctg 1350241 ctcaaccagc cggccaagcc gaccgaccgt gcccaccagg tgcgggtgat ctgcggtaac 1350301 gggctgcggc cggagatctg ggatgagttc accacccgct tcggggtcgc gcgggtgtgc 1350361 gagttctacg ccgccagcga aggcaactcg gcctttatca acatcttcaa cgtgcccagg 1350421 accgccgggg tatcgccgat gccgcttgcc tttgtggaat acgacctgga caccggcgat 1350481 ccgctgcggg atgcgagcgg gcgagtgcgt cgggtacccg acggtgaacc cggcctgttg 1350541 cttagccggg tcaaccggct gcagccgttc gacggctaca ccgacccggt tgccagcgaa 1350601 aagaagttgg tgcgcaacgc ttttcgagat ggcgactgtt ggttcaacac cggtgacgtg 1350661 atgagcccgc agggcatggg ccatgccgcc ttcgtcgatc ggctgggcga caccttccgc 1350721 tggaagggcg agaatgtcgc caccactcag gtcgaagcgg cactggcctc cgaccagacc 1350781 gtcgaggagt gcacggtcta cggcgtccag attccgcgca ccggcgggcg cgccggaatg 1350841 gccgcgatca cactgcgcgc tggcgccgaa ttcgacggcc aggcgctggc ccgaacggtt 1350901 tacggtcact tgcccggcta tgcacttccg ctctttgttc gggtagtggg gtcgctggcg 1350961 cacaccacga cgttcaagag tcgcaaggtg gagttgcgca accaggccta tggcgccgac 1351021 atcgaggatc cgctgtacgt actggccggc ccggacgaag gatatgtgcc gtactacgcc 1351081 gaataccctg aggaggtttc gctcggaagg cgaccgcagg gctagcggat tccgggcgca 1351141 gtctcgatac ccgcactgga cgctcgacgg taaccaggca ctatggatgc gtgcgttcaa 1351201 caccgccggc ctcagccggt cgttcaacac cgccggcgtt agccggccat tcaacaccgc 1351261 cggcgttagc cggccattca acgctgtgcg gccgtccagt cgcaggtgat cgtgcgctga 1351321 tcatggcgat cgtcaaccgc accccggatt cgttttacga caagggtgcg actttcagcg 1351381 acgcggctgc cagagacgcg gtccaccggg ccgtcgccga cggtgccgac gtcatcgacg 1351441 tcggcggtgt caaagccggc ccgggtgaac gcgtcgacgt cgacaccgag atcacgcggc 1351501 tggtgccgtt catcgaatgg ctccgcggtg cttacccgga ccagctgatc agtgtcgaca 1351561 cctggcgcgc gcaggtggcg aaggcggcct gcgcggcggg ggcggacctg atcaacgaca 1351621 cctggggtgg cgtcgacccg gccatgcccg aggtggccgc cgagttcggc gcgggcctgg 1351681 tgtgtgcgca caccggcggc gcgctgccac gcacgcgacc cttccgggtg agctacggta 1351741 cgactacccg cggtgtggtg gatgctgtga ttagccaggt cacagccgcc gccgagcggg 1351801 ccgtcgcggc cggggtggcc cgcgagaagg tgttgatcga cccggcacac gacttcggca 1351861 agaacacctt ccatgggctg ctgctattgc gacacgtggc cgatcttgtt atgaccgggt 1351921 ggcccgtgct gatggctttg agcaacaagg acgttgtcgg ggagactctg ggcgtggatt 1351981 tgaccgaacg gcttgaggga acgctggcag ccaccgcgtt ggctgcggcc gccggggcgc 1352041 gcatgtttcg ggtgcatgag gtcgccgcca cccggcgggt gctggaaatg gtggcatcga 1352101 ttcagggggt ccggccgccg acgcgcacgg tgagaggact cgcatgacag catcggagct 1352161 ggtcgccggc gatctcgccg gtggcagggc ccctggcgcg ctgcccttgg acactacttg 1352221 gcaccgtccc ggctggacga tcggggagtt ggaagcggca aaggccggac ggacgatttc 1352281 ggtggtgctg ccggccctca acgaggaagc gaccatcgaa tcggtgatcg acagcatctc 1352341 tccgctggtc gatggcctgg tcgatgaatt gatcgtgctg gactccggtt ccaccgacga 1352401 caccgagatc cgggccatcg cctccggcgc ccgggttgtc agccgtgaac aggcgttgcc 1352461 cgaggtgccg gtacggcccg gcaaaggtga ggcattgtgg cgttcactgg cggccaccag 1352521 cggcgacatc gtggtgttca tcgactcaga cctgatcaac ccgcacccct tgtttgtgcc 1352581 atggctggtc ggtccgctgc tcaccggcga aggcattcag ctggtcaaga gcttttaccg 1352641 acggccgctg caggtcagcg acgtgacgag tggggtgtgc gccaccggcg gcgggagggt 1352701 caccgagctg gtggcgcggc cactgttagc cgcgctgcgg cccgagctgg gttgtgtact 1352761 gcagccgctg agcggtgagt atgcggccag ccgggagctg ctgacatcgc tgccatttgc 1352821 ccccggctac ggcgtggaga tcggcctctt gatagacacg ttcgaccggt tgggcctgga 1352881 cgcaatcgcc caggtcaact tgggcgttcg ggcgcaccgt aaccggcccc tagacgagct 1352941 cggcgcgatg agccgccagg tcatcgcgac cctgctgtcg cgctgtggaa ttcccgattc 1353001 cggtgtcggg ctgacccagt tcttgcccgg cggcccggac gatagtgact acacgcggca 1353061 cacctggccg gtatcactag tcgaccggcc gccgatgaag gtgatgcggc cgcgctgacc 1353121 gacaccgcgt cggcgcctta gggcaagatc gatgacgtgg cgttggtgtt ggtgtacctg 1353181 gtggtgctgg tcctggtggc gatcgtgctg ttcgctgcgg cgagcttgct attcggccgt 1353241 ggcgagcagt tgccgcccct gccgcgggcg acgacggcga cgacgctgcc ggcgttcggg 1353301 gtcacccgcg ccgacgtcga cgcggtcaag ttcacgcagg tgctgcgcgg gtacaagacc 1353361 agcgaggtgg actgggtgct ggaacggctc ggccgtgagc tcgaggcgct acgctctcag 1353421 ctcggggcga tccacgcctc gtcggaagac gccgaggccg agtctgacgc gtcaaaccct 1353481 tcgcgcggcg agaccgtcgt gcactaccgt tctgaccccg cgtgagcggc gacgggctgg 1353541 ttcgctgccc ctgggcggag gttcgtccag ggcccgatgc ccagctgtac cgcgactatc 1353601 acgacaacga atgggggcgt ccgctgtacg gccgggtggc tttgttcgag cgaatgagcc 1353661 tggaggcctt ccagagtggc ctgtcatggt tgataatcct gcgcaagcgg gagaatttcc 1353721 ggcgcgcatt ctctgggttc gacatcgaca agatcgctcg ctacaccgat accgatgtgc 1353781 gacggctact cgccgatgac ggaatcgtgc gcaaccgcgc caagattgag gcgacgatcg 1353841 ccaacgcgcg cgcagctgcc gatctggggt cgtccgaaga cctatccgag ctgctgtggt 1353901 cgttcgcgcc accgcctcgg ccccggcccg tcgacggttc cgaaattccc tcggtcagca 1353961 cggaatcgaa ggctatgtcg cgtgagttga agcggcgcgg gttccgtttc gtcgggccca 1354021 ccaccgccta tgcgttgatg caggcgaccg ggatggtcga cgaccatatc caagcatgct 1354081 gggtgcccac tgagcgacct tttgaccagc cgggctgccc gatggcggcc cggtgaagtc 1354141 attgcgccgg ggcttgtgca cctgatgaac ccgaataggg aacaataggg gggtgatttg 1354201 gcagttcaat gtcgggtatg gctggaaatc caatggcggg gcatgctcgg cgccgaccag 1354261 gctcgcgcag gcgggccagc ccgaatctgg agggagcact caatggcggc gatgaagccc 1354321 cggaccggcg acggtccttt ggaagcaact aaggaggggc gcggcattgt gatgcgagta 1354381 ccacttgagg gtggcggtcg cctggtcgtc gagctgacac ccgacgaagc cgccgcactg 1354441 ggtgacgaac tcaaaggcgt tactagctaa gaccagccca acggcgaatg gtcggcgtta 1354501 cgcgcacacc ttccggtaga tgtccagtgt ctgctcggcg atgtatgccc aggagaactc 1354561 ttggatacag cgctggcgtc cggcatgccc gtagcgctcc gccgttgccg ggtcggcgac 1354621 caaggcattg accgcctcag ccaatctggc ctggtaaccg gtcgcgtcgt cggcgtcgta 1354681 atgcaccagt gagccggtga tcccgtcggc gaccacctcg gggatcccgc cgacgtcgga 1354741 ggccaccacg gcggttgcgc acgccatcgc ttccaggttt acgataccca gcggctcgta 1354801 caccgacggg cacacgaaaa ctgttgctgc cgaaagtatt tctcgtagtt gtccgatggt 1354861 aagccggtct tggatccaaa acacgccagt gcgattgcgg gccagttcgg ccaccgcgac 1354921 gcgcacttcg tcggctactt ccggcgtgtc cgcagcaccc gcgcagagca ctagctgtac 1354981 gtccgatctg aatcggtgcg cggctgttac caggtggacg actccctttt gccgggtgat 1355041 tcgcccgacg aacaccgcca tgggccggtt cggatcgacc ccgagctcgg ccagcaccga 1355101 cccggtacgc gcgggcccgg ccggatacca cgtctcggtg tcgatcccgt tccggatgac 1355161 gtgcaccagg ttcggatcca ggctgggata gacccgcaac atgtcgttgc gcattgcaga 1355221 actgaccgca atgaccgcgt tggcggccag caccgcggtc tgctcgaccc atgtcgatac 1355281 ctggtagccg ccgccgagtt gctccttctt ccatggccgc aacggttcga gcgaatgtgc 1355341 ggtcaaaaca tgcgggatgt cgtagagtat cgcggccaga tgccccgcca gagcggtgta 1355401 ccaggtgtgt gaatgcacga cggtggccgc gctggcggca ttggccatca ccaggtccgc 1355461 ggacaaggtg gacagcgccg cgttggcgct gcctagcctc gggtcgggcc gataggcaaa 1355521 tgcgcccggg cggggtgcgc ccatgcagtg cacgtcgacc gcgcacagcc ggcgtaggta 1355581 ggcaaccagt tcggtgacat gtaccccggc tccaccgtaa acctccggtg ggtattcccg 1355641 agtcaacatc gccacccgca taccccgcac cgtagtgcgg tgacggggcg gcccgcgtgg 1355701 cgggccgagg aggaggcgga ggcggcacag cacccgtcga acggggccaa acaccttgac 1355761 ggacagcccg tcagagcagt agccaggggc ggattcccct tggcagtggt ttgcgggggc 1355821 cgataggttt gagccatgag agaagtgccg cacgtgctgg gcatagtctt agccggcggt 1355881 gagggcaagc ggctttatcc gctgaccgcg gaccgggcca agcccgcggt tcctttcggc 1355941 ggcgcctatc gattgatcga tttcgtactc tcaaacctcg tcaacgcccg gtatctgagg 1356001 atctgtgttc tcacccaata caagtcgcat tcactggacc gccatatctc gcagaactgg 1356061 cggttgtctg gtctggcggg tgagtacatc accccggtgc cggcacagca gcgcctcggc 1356121 ccgcgctggt ataccggctc cgccgatgcg atctatcaat cgctgaactt gatctacgac 1356181 gaagatccag actacatagt ggttttcggc gccgaccacg tctaccgtat ggatcccgaa 1356241 cagatggtcc ggttccacat cgacagcggt gccggcgcga cggtggccgg catacgggtt 1356301 ccacgtgaaa atgcgaccgc gttcggttgt atcgacgccg atgactccgg ccgtattcgc 1356361 agcttcgttg agaagccgct ggagccgccc ggaacccccg acgaccccga caccacgttc 1356421 gtctcaatgg gcaactacat tttcacgacc aaggtgctta tcgacgcgat tcgcgccgac 1356481 gccgacgacg accactcgga ccacgacatg ggtggtgaca tcgttccgcg gttggtggcc 1356541 gacggtatgg cggcggtcta tgacttctcc gataacgaag tgcctggtgc caccgatcgc 1356601 gaccgagcat attggcgcga cgtcgggacg cttgacgcgt tttacgacgc acatatggac 1356661 ctggtgtcgg tgcacccggt gttcaacctg tacaacaagc ggtggccgat ccgcggggag 1356721 tcggagaacc tggcgccggc gaagttcgtc aatggcggct ccgcacagga gtcggtggtt 1356781 ggtgccggca gcatcatctc ggcggcctcg gtgcgtaatt cggtgctgtc gtcgaacgtc 1356841 gtggtcgacg acggcgcgat cgttgagggc agtgtgatca tgcccggcac ccgcgttggg 1356901 cgcggggcgg tggtgcgcca cgcgatcctg gacaagaacg tcgtcgtcgg gcccggtgag 1356961 atggtcggcg tggatctgga gaaggaccgg gaacgcttcg cgatcagcgc cggcggcgtg 1357021 gtcgccgtgg gcaagggtgt ttggatctag gtccggttag cggcgcgagc agacacagaa 1357081 tcgcccattt cggcacgaaa ttgggcgatt ctgcgtctgc tcggcgcggt ggggcgcgcc 1357141 ggctagggcc ctggcggccc gggttggccg aacagctgcc cgccagcgcc gccgcgagcg 1357201 ccggccgcgg cggccccgcg ccacctccca cgccgccgtt gccgatcaac cccccgggcc 1357261 cgccgtcttg gcccggtccg ccattggcgc cgtcaccgat cgaacagtgc ctgggtggga 1357321 gcgttgatca cattcagcac gtcttgctgc acgctctgcg ccacagcagc gttgacggct 1357381 tcggcagccg cataggcccc gccagcgccg gtcagggctt gtacgaactg ctgatgaaac 1357441 gccgtcgcct gcaagctaag cgcctgatag gcctgagcgt gtctggcgaa cagtgacgcc 1357501 acgaccgccg atacttcatc ggcacaggcg gccagcatcg cggtggttgg ggctgccgcg 1357561 gcggcattgg ccgcgctcaa tgccgagccg atgcccgcca aatccgttgc cgccgatgcc 1357621 agcacgtccg gggcgccacc agatacgaca tggccacacc ttatcgtggg ctcgttacgg 1357681 catgcggtgt tttcgacgga ctcgtcaccg acgccgcgcg tgtgacgcgc gccgtcagcc 1357741 agcgctcggc aacccgggct acccagggac ctccggtatc agcaggtgcg cgtcgtagcg 1357801 tgggccccag tgcagcgtga cacgaccacg cggcgggcgt gggtaggcgg ccgggaattg 1357861 gccggtgagc gggttgcggg gggacaacca gcgtccgcca accaccagtc gtaactgttc 1357921 gccggcgcgg aacaatgtcg ccgacgggcc aagcgcgaca tcgacggcga cgacctcgcc 1357981 ggcggtgacc ggccggggcc gagcacacgc cgggaccggc tcccatggct gcgagagctc 1358041 ggggtcgagc tcgcgcagcg agacccgctg ccagccggtg gtcacccggt cacggcccca 1358101 gccgtaggac ccctcaaacg caacgaactg gccatcgcgc cacttctcca ctccgacgaa 1358161 caggttcgcg tcgtcgcagc catccaattg aacccacagg cgggcggcca tcgggccggt 1358221 caactcgatg tcttcgggga tcgtccaatt gaatgctgct gcccgagagc gagtttggaa 1358281 cctgatgctg cccgccgtcg gcggcggctc ggttgccagc agccccggcc cggcgagata 1358341 cattggccgc caacgcgtgc cggcaagcgg ccactgggtc tcttcacgca ccgcggtgat 1358401 ggtgtcgcga tcctcacgca cctcgaggcg aacgctgcgc gaaccggagg agccggccag 1358461 cgcgtctcgc aagaacttca gctgctcgga cagcgcggtc gctgagtaga aggtctccca 1358521 tttgcccccg cgatgggtat acagccgggc gtgaccgcag ccgctgcggg taaaagcgcg 1358581 gatcgacccg cggctgtgca agttgttgtc cgagaagcta ccgcagacca gcatcggaac 1358641 cttgatcgcc gacaggtcgg gtactcgcga gcgccagaaa tcgtcgcgca gcgggtgagc 1358701 ctcttgcatc tgctccatgt cgtaggtctg acgtgtgcga cgtcgcaccc cgcgcgacca 1358761 cagccgggtg aaccctgact cccggatgcc gccgggaaag gccaagtcgc ggtaggcgtc 1358821 ggtgaaaccc tcccacgggc agatcgcccg cagcgccggc ggttgcagcg cggccacggc 1358881 gtactggcta atggccagat aagacacccc cagcatgacg acgcgcccat cactccatgc 1358941 ctggtcggcg agccatccca ccaggtcgta ggtgtcctcg gcttcctggt gtgacagcag 1359001 gtctccggta ccgtcggagc ggccgcagcc gcgcgaatcc gcattgacca cgacgaagcc 1359061 ctgcgcggtc caccacgccg ggtccggcgc ctcccagccg gtcagcgccg agaaggtcag 1359121 cggcttcggc tggcgcagca tccggtattg tggtgagaac gtccaccggt tgccccgccg 1359181 ccgcggcagg gcgtccttgc cgtagggatg gatgctcgcg atcaccggcc tagccccacc 1359241 ttcggcgcta cgaaagacgt tgatccgcag cagcgttccg tcgcgggtag gcacctcgac 1359301 gtcgcgttct atgacgacgt cggccggcgg atcggtgacg gtgatcggcg gcttggcgac 1359361 gccgcgaacc cgctccagcg cataccggag agcaccggga cgtcgccacg gccggtccaa 1359421 ggcaggtgac gggtttctgg ccacgcccgt taccctaaag ctattcgacc gctaccacac 1359481 gtagggcacc aaccggtagc gcaccagttg ccggtattcg cggtacccgc tgagttcttg 1359541 cgtcagtagt ttttcctcgt cgaggatgcg gaacaccaac accagtgtgc cggggacgag 1359601 gatgaacatc gcccagtaag agcccagtgc cagcggtatg cctgtcatca tgaccacgtt 1359661 cccggcgtac atcgggtgtc ggacaatttt gtagagaccg tcggaggcca atatctggcc 1359721 cgcctccacc ctgaccgtcg aggcggcata cctgttctgg atgaccacca gcatggcgat 1359781 gccaaggccc gtcatcacta ggacgtcgcc gatcacgcac accgcggctg gcactgacga 1359841 ccaaccataa cgatggtcgc acgcgctcag caccatcatc gcgaagaacc ccagaaaagc 1359901 gccgatgacg atgaacttct gaatcgttcg gccctccgcg agcggaccgc tgcgcatgcg 1359961 acgttgaagg gccgcgggat cgttgcgagc cagatagatt gtggggccaa tcgtggtgct 1360021 cacaaatgcg gcgaggaaca cccacgcctg ccaatagtcg aacgtgccgg ctggcccgaa 1360081 taggagcgcg ccgaaaacga cgagtcctaa cacgccccat atgaatatct tcagcccaat 1360141 gtgcatggct cctcctagca gcgaacgtca cgccgtcgga aggccatggc gcccagggtg 1360201 atcagggctg catctatggc cagcagccac agcaacggca ccgcggtgaa atcgccgccg 1360261 ccgacccgcg ggatgtgggc gaacggctcc aggttgagca gcatctgcgg gaaccccgcc 1360321 aacgagccga gcaggtacag cgcgatgaac ccgaccagca cgccccacgc caccggcgtg 1360381 aaccgcggcg ccaacccgaa caatcccacg gtcaccgccg ataacaacca cacggccggc 1360441 agttgcacgg ccgcggtgcc gaccacggtg ggcagcttgc cgccgacgtc accgacggtc 1360501 atgccgtagg cgagtccggc cgccacgccg gagatcaggg tcgccaccgc cgatccggcc 1360561 agcgccatcg ccagatggct tgccagccaa tgggtccggg aaaccgcccc ggcgagcagg 1360621 gtctcggccc gcagcccggt ttcctcttgg tgcagtcgta gggtcagcga gacggcgaat 1360681 gcggcggcga ccatgccgat catggtgaag gccagcgcaa ggaaggcctg ttccagtgcg 1360741 ccggtgccgc ccatccgggt gacgatgtca cgcaccgcgg tgttatcgcc cagctgatcc 1360801 ccgatgccgt gcaccacact gcccatcacc agcccgtaca ggcacaggcc gacggtccac 1360861 aacagcaggg agccgcgatt gagccgccat gccagcccga agggctcgct cagcatgggc 1360921 ccggcggtgc cggcgccggg gcgttcggcg atcagtccgg caccgacatc acggccggcg 1360981 cgtaatcgat aggccagcac ggtaagcacg gccgcggtcg ccagcgacag cagcagcacc 1361041 caccaacgct ctcccgcgta gggtctgacc tgcagcgacc accccagcgg cgagcaccag 1361101 gacagcgtgc ccgagccggc atcaccgatg gcacgcagcg cgaacgcggt gcccaggacg 1361161 gcgaacgcga ccgcgcgggt gaatcgggcg ctcggcgaca gctgcgcggc caccgcggcc 1361221 accgccgtga agaccatccc ggaggccgcc agcgccacgc caaacgctac cgacccggcc 1361281 ggagccacat cggtggcaag cagacccaat gcaccgatcg cgccggtcgc gatcgacgca 1361341 ccgaacgaca gcagcagcgc gccggtgagg ttggcgtagc gcccgaccac ggtcgaatcg 1361401 atcaattcgg cacggccgct ttcctcgtcc gcgcgggtgt gccgaatcac cgtgaggatg 1361461 accgccaccg cgatgagggt gtgaaacatc ccggctttcc agattccgac cgcacccagg 1361521 ctgtcgttgt agaccggccc gtagagcgcg cgctgtgccg ggctggccat aatggcggcc 1361581 gccgcggcgg cgcgggcgga ccggtcgggg taaaccgttt cgacgctggc gatgtacacg 1361641 gtggccagcg gcaccgacag cagcagcacc cacagcggca acgacacccg gtcgcggcgc 1361701 aggtacaggc gcagcaaccc cagtgtgccg gtgaagcccg aaccgcggtg tggtgcacgg 1361761 tgtcctgcgg gtctcgcgcg atcgatgacc gtactgctca cggcgttgcc acctgttgct 1361821 cggctgcgac ctcggggccc aggctgtagt ggcgcaggaa cagctcctcc agggtgggcg 1361881 gctgactgac caggctgcgc acaccggcgt ggccgagcac ttggatgagt tctctcaggc 1361941 tttcgctgtc gacctgggcg cgcactgtgg tgccctcgat gctgatgtcc tcgactccct 1362001 tgatttggct gaggtctcct ggatcaccga tcatttcggc cttgatcgag gtgcggctga 1362061 ggtgccgcaa ggcgtctagt gaaccgcttt cgacggtctt gccggctcgg atgatggtca 1362121 ccttttcgca cagcgcttcg gtctcggcca gaatatggct ggacaacagc accgtcacac 1362181 cgcgttggcg tgcttcgccg atgcactgct gaaacacgtt ttccatcaac gggtccaggc 1362241 cgctgctcgg ctcatccaag agcagcagag tggcgtgcga cgacaatgcc gagatcaggg 1362301 agaccttttg gcggttgccc ttggagtagg tgcgcgcctt cttggttggg tccaggccga 1362361 agcgctcgat cagttccgcg cgacgagcgt tgtcgatgcc gcctcgcatg cgggccagca 1362421 ggtcgatggt ctcaccaccg gtcagcgacg gccacaatgt gacatcgcct ggaacatagg 1362481 cgatgtggcg gtgcaggtcg acggcgtcgg tccaggggtc accgcccagc aaccgcacgc 1362541 ttccgccgtc ggccttcacc aggcctagca ggatgcgcag ggtcgtggac ttgcccgcgc 1362601 cgttggggcc gaggaagccg tgcacttcgc cctcgcgcac cgtgaggtcg agcccgtcga 1362661 gcgcccgcac cgacccgaag tgcttggtca gtccgcgaat ctcgatgggc acctggtggt 1362721 tgtcagccga catgtgcttc tccttgttga gcttcggcca ggaaggcctc gtacatggcg 1362781 cggtcggcca gcaggccttc ggtgtagacc tccagggaag gcagcaccat gtcgtgcgcg 1362841 tagtcgcgta acgctgcacg gagatcggtt gggttttcgt gcatttgcag ataaagcagg 1362901 aagcctccgc ctccggtgat cgccagaaac cgagcacggg cgcgcgggtc gcggctgggc 1362961 ttgaccgtac cggcgcgtac tccttcgtcc aggtactcct cggcgttgtc gatcatcttc 1363021 tgccacagca tcttcgccag ctcgccgccg gattgcatgc tgcgcaccag gtatgccatc 1363081 agcggtgcgt aggattcgat ctcggccatc tgcgcgagcc aggtggtcgg gtcgttggac 1363141 ttcagtgccg cagccttgct gctgcggatc tcttcggcga cgaagtcgtc gcaggccttg 1363201 cgcagacctt ccttggaacc gaaatggtgg atgaccaatg ccgcgctcac ccccgccgct 1363261 tcggcgatgg ctcgcagccc gacaccgaat ccgtgccgac cgaactgttc gatggccgcc 1363321 tctctgatcc tggcgtgcgc ggtcagatcg gctgaacgca tgttcaggat attaaacgta 1363381 cgttcatccc cggtcaaggg agggcgccgt tgggaatccg tgaaggccgc gaactttgcc 1363441 gagcagacgc aaaatcgccc tggaacgcac ggttcagggc gattttgcgt ctgctcgccg 1363501 aattagtccc gcacggctgc cagcacgccg tcgcccagcg gcaccagtgc cggagtgagc 1363561 cgttcatcct cggcgataag ccgggccgcc tcgcgaaccg cgatcacctc ggcgtcgcgc 1363621 gccccgggat caccggcccg accgcccagc gccgcccggt gcacgacgat gaccccgccg 1363681 gatcgcagca gccgcacccc ctcggcgacg taatctggct ggtcgatcgg gtcggcgtcg 1363741 atgaatacca ggtcgtagga tgcgtcggcg agccgggtca gcacctcttg ggcgcggccg 1363801 ctgatcagcc tggtacgcga cggcccgatg cccgcctcgg caaaggcctg cctggcaagg 1363861 cgtagatgct cgggctcgat atcgatggtg gtcaagacgc cgtcgtcgcg catgcccgac 1363921 aacagccaca ggccgctgac gccggccccg gtacccactt cggccaccgc cttgcctccg 1363981 ctgagcttgg ccagcaagca cagcaacgca cccaccgccg gtgttaccgc cccggccccg 1364041 atgtcggttg cgcgctcgcg ggcgccggcc aggatcacgt cttcagatat tgacccctcg 1364101 gcgtgcgccc agagtgattc gcctcggctg ggggccggct ggccaggcat gtcgtcgtgt 1364161 ccgggggtgc cgtccatgcc cgcagcgtat gtccaattgg cgacgccgtc gggcaggcgc 1364221 gcctggttcg aacgccggcc gagcaccgag ctggacgctt gcggctgtac ccgacacgcc 1364281 cggcgtgccg gacgcgacga aggtcacttt gactcgatat tccctggaca gcgcaggtaa 1364341 cggtatggtt tctaagccaa agctcagatt gctcatatat ggcccatacg ccggtacgcg 1364401 acggtaattc ccatggaact cctcggcgga ccccgggttg ggaatacgga atcgcaactt 1364461 tgcgttgccg acggtgacga cttgccaact tattgcagtg caaattcgga ggatctcaat 1364521 atcacgacca tcacgacctt gagtccgacc agcatgtctc atccccaaca ggtccgcgat 1364581 gaccagtggg tggagccgtc tgaccaattg cagggcaccg ccgtattcga cgccaccggg 1364641 gacaaggcca ccatgccgtc ctgggatgag ctggtccgtc agcacgccga tcgggtgtac 1364701 cggctggctt atcggctctc cggcaaccag cacgatgccg aagacctgac ccaggagacc 1364761 tttatcaggg tgttccggtc ggtccagaat taccagccgg gcaccttcga aggctggcta 1364821 caccgcatca ccaccaactt gttcctggac atggtccgcc gccgggctcg catccggatg 1364881 gaggcgttac ccgaggacta cgaccgggtg cccgccgatg agcccaaccc cgagcagatc 1364941 taccacgacg cacggctggg acctgacctg caggctgcct tggcctcgct gccgccggag 1365001 tttcgtgccg cggtggtgct gtgtgacatc gagggtctgt cgtacgagga gatcggcgcc 1365061 acactgggcg tgaagctcgg gacggtacgt agccggatac accgcggacg ccaggcactg 1365121 cgggactacc tggcagcgca ccccgaacat ggcgagtgcg cagttcacgt caacccagtt 1365181 cgctgaacta ctcaacggcc gccgagcgcg tcggttcggc taccgcatgg ttgccaatcg 1365241 gtcccgaatc ctggggtttt accggctggc gatggttttc cggcaccgcg ccgcgctaca 1365301 ttcgagatac cggtggctcg ctaggtggcg gaaggaggtg gtgatggccg accccggaag 1365361 cgtgggacat gtgttccggc gcgcgttttc ctggctcccg gcgcagttcg cctcccagag 1365421 tgacgcgccg gtcggcgcgc cgcggcagtt ccgttccacc gagcacctgt caatcgaggc 1365481 catcgcggct ttcgtcgacg gcgagctgcg gatgaacgcg cacttgcggg ccgcgcatca 1365541 cctttcgctg tgtgcccaat gcgcggccga agtggacgac caaagtcgtg cccgcgccgc 1365601 tctgcgcgat tcccacccga tccgcatccc cagcacgttg ctcggattac tgtccgagat 1365661 cccgcgttgt ccacctgaag gtccatctaa aggttcgtct ggaggttcat cccagggccc 1365721 gcccgacggg gctgcggcag gcttcggcga ccgcttcgct gacggcgatg gcgggaatcg 1365781 gggccggcaa tcgcgggtgc gtcgctagcc ggtgagccac ttgtcgcagc gcatggcggg 1365841 gttgctgcga gttcatggcg agtggtcgcg atccgtggat actagggtgg acacggacaa 1365901 cgcgatgcct gcacgtttta gcgcccagat tcagaatgag gatgaggtga cctccgacca 1365961 aggcaacaac ggcggcccga acggcggagg ccgcctggcg ccgcgcccgg tttttcggcc 1366021 accggtcgac ccggcgtcgc gtcaagcgtt cgggcgtccg tccggggtcc aagggtcctt 1366081 tgtggccgag cgtgtgcgcc cgcagaagta ccaggaccag tctgacttca caccgaacga 1366141 tcagcttgct gacccggtgc ttcaggaggc gttcggtcgt ccgttcgcgg gcgccgaatc 1366201 gctgcagcgc catcccatcg atgccggagc gctggcagct gagaaagacg gtgccggccc 1366261 cgacgagccc gacgatccgt ggcgcgaccc cgcggccgcg gccgcgctgg ggacgccagc 1366321 gctagccgcg ccggcaccgc acggtgcgct ggccggcagc ggcaagctgg gtgtgcgcga 1366381 cgtgctgttt ggcggcaagg tgtcctactt ggcgctgggc atcttggtcg ctatcgcact 1366441 ggtgatcggc ggcatcggcg gtgtcatcgg ccgcaagacc gcggaagtag tcgatgcgtt 1366501 caccacgtcg aaggtgaccc tgtcgaccac tggcaatgcc caggaaccgg ccggccggtt 1366561 caccaaggtg gcggccgccg tggccgattc ggtggtgacc attgagtcgg tcagcgacca 1366621 ggagggcatg caaggttccg gcgtcatcgt cgatggccgc ggctacatcg tcaccaacaa 1366681 tcacgtgatc tctgaggcgg ccaacaatcc cagccagttc aagacgaccg tggtgttcaa 1366741 cgacggcaag gaggtgcccg ccaatctggt gggtcgtgac cccaagaccg acttggccgt 1366801 cctcaaggtc gacaacgtcg acaatctgac cgtggcccgg ctcggtgatt ccagcaaggt 1366861 acgggtcggt gacgaagtcc tcgcggtcgg cgcgcccctg gggctgcgca gtacggtgac 1366921 ccagggcatt gtcagcgcgc tacaccgccc cgttccgttg tcgggcgagg gctctgacac 1366981 cgacaccgtc attgacgcaa ttcagaccga cgcctcgatc aaccacggta actccggcgg 1367041 tccgctaatc gacatggatg cccaggtgat tggcatcaac accgccggta agtcactgtc 1367101 ggatagcgcc agcgggctgg gctttgcgat cccggtcaac gagatgaaat tggtggcaaa 1367161 ttctctgatc aaagacggaa agatcgtgca tccgacgttg ggcatcagca cccggtcagt 1367221 aagcaacgcg atcgcgtcgg gcgcgcaggt ggccaatgta aaggcgggaa gtcccgcgca 1367281 gaagggcggg atcttggaga acgatgtgat cgtcaaggtc ggtaaccgcg cggtcgccga 1367341 ctccgacgag ttcgtcgtcg ccgtgcgcca gttggctatc ggccaggacg ctccgataga 1367401 ggtggtccgc gagggtcggc atgtgacgct gacggtgaaa ccggaccccg atagcaccta 1367461 gagtgttcgc caacatcggt tggtgggaaa tgctcgtcct cgtcatggtc gggctggtgg 1367521 tgcttggccc ggagcggctc ccgggtgcca tccgctgggc ggcaagcgct ctgcggcagg 1367581 cgcgcgacta tctcagcggt gtgaccagcc agctacgtga ggacattgga cccgaattcg 1367641 atgatctgcg gggacatctc ggtgagctgc agaagctacg gggaatgact ccgcgggctg 1367701 cgttgaccaa gcacctactg gatggcgatg attccctgtt caccggagac ttcgaccgac 1367761 cgacgccgaa gaaaccggat gcggcgggct cggcggggcc ggacgctact gagcagatcg 1367821 gtgcggggcc catcccgttt gacagcgatg ccacctagat cggtgacggc cggcggtcgg 1367881 gcccggcgag ctaacacccg agcaacggcg gcaggccggc caccgagtcg atcacgtggt 1367941 gcggccgggt cgcgctggcg ccggccagcc agcgatccag cgtttgctgg cggaacttgc 1368001 cggtgcgcac cagcacaccc gtcatgccca ccgcctgggc ggccagcacg tcgttgtgca 1368061 gatcgtcgcc gatcatgacc atctgctgtg gatcgacacc gacgcggtcg gcggccgcca 1368121 ggaatccctc ggccgcaggc ttgccgatgg cggtggcggt cttgccgcag gcctgttcca 1368181 ttccggtcag gtacatcccg gtgtcgatgc gcagcccgtc ggtggtgttc caggtcatat 1368241 tgcggtgcat cgccaccacc ggaacgccgt cgagcatcca cccatagacc cggctgagcg 1368301 tgcggtgatc gaactggggg ccggcactgc cgagcacgac gacgtcgggg gcttcggggc 1368361 aatcctcggg accgatctcg gtcgacaaga cgacgtcgat gccgggcaag tcctcggtga 1368421 tgtcgccgtt gttcaccagg aagcaccgcg cgccgggata ggcgccgtgc aggtactcgg 1368481 ccgtcagcac cccggccgtg atcacgtcgt cggcggcgac ggggatcccc gcggcaccca 1368541 gcgcctcggc gatctgccgg cgggtgcgcg tcgtggtgtt ggtcagatac gcgcaggcga 1368601 ttccccgatg ggtcagttgc cgcacggtct cggcggcccc gggaatcgcg cgccacgaca 1368661 gcaccagcac gccgtcgatg tcgaacagca ccgccgcggc catcagatgc gccacgtcca 1368721 cacgatatcc gtcagttaga ccgtcgacat cgacaccagc gcggaaaaac cccagtgagc 1368781 atcgcgctga cgtcgatctc gacggtgagg ttcatcctgg ctcaggatcc ctcaagatcc 1368841 gtggcgcaac cacacactgt cggccaccca gggcgacgcg gcgccggcca ccgaccacgc 1368901 cagctccgcg ggcacatcga gcacctgata acccttgcgg cccgccacgg tggccgccac 1368961 gagcgtcgcc acccccgccc tccgctggaa cagtgtctgg cgcaccgtcc agccgatgat 1369021 gccggtgcag gcgatgcaat cgcggcgacg ctgtaggctg ccggcgcgcg caaccaacca 1369081 gccgtcggcg acgcggtgcc cgagtgatcg gacccgatcg acggccagcc cagcgcaacc 1369141 cgcggtcaac accgcccaca gtgtccacgc ccaccccggc acgccgagaa tcggcgccgc 1369201 tgcgatcagc gcaactccgg ccagcgtcgg gaccaacagc gcccgggtcc acctgcgccg 1369261 ggcggcggcc gggccgtgcc ggcgcagcgg ccccgctgcc gcgtcggtgt tgtcgatcag 1369321 gtcggtcagc acggccgtcg cggtctcgaa cggacatggt ggcagcagca tcgacgactg 1369381 gccctcgcca tgcacgccgg tcatcactgc gtccagccga gcaccgcgca ataaccgcac 1369441 cagcagtggt tcacgcaagg tggcgccacg cagccggcgc atgtcgtagg tgtgctcgcg 1369501 cacccgcagc agcccgtgcc gcaggtgtag caccccttct tgaccgctgc cgccgcggcg 1369561 cagcagcaga ttgccgtagg tcaaccagga gaacagcacc gccaacagtg ccgatacacc 1369621 caccaccagc agcacagtga ccgccaccac cagtaccacc ccggcgcgtt gcgcggcgtc 1369681 caccgcggac ctggcgaaac cggattccgg gagtcgcacg gccagtcccg tttggtagcc 1369741 aagcccgatc accgccccga tcatcaccag gcccgaaaag ctcagcggcg cataccgcaa 1369801 ccacgacgac tgccaccggg ccagcacccg accggtcggc tcgacgggtg ccagcgactc 1369861 ggccagcagc agcgcgcgca gcctgggcac ccgtgccgag tcgaccgcgt ccagttcgaa 1369921 ggcggcctca ccgcgggcct cctggccggt gcccacccgc agcaccgtca accccaacag 1369981 ccggtgcaac agccgcgcct cggtctgcac cgagcgaatc cggttgcgcg gcacggagac 1370041 cgcgcgccgg ctgagtatgc cggtacgcag cgacacgttt tcgtcgtcga tgcggtaggt 1370101 ggtgaaaaac caacgcagca cgccgaatac gaccgtcacg ccgagcgccg ccagcggcca 1370161 gaccgggttg ccggttgccg accccagcac cacggacccg atgagtaccg ggagctggcg 1370221 cagcatctcg tgcaccggat gcaccagcag catccgcggg ctgaggcggt gccaatcgtg 1370281 tggccggtcg gtcatgtcgc gtcctcgccg cgcagcgcgg cgatgtcggt cagctgcgcc 1370341 accacccgat cggcgacgtc ggtgtccaac gcctcgatgt gcaccgcgcc cgccgaggac 1370401 gccgtggtta cggtgacgtt ggccagcccg aacagccggt ccatcgggcc gcggtaggtg 1370461 tcgacggtct gcacccggga aatcggtgtg atgcggcgct cctgcacgag ccaaccggtg 1370521 cgggtgaata cggcctgcgg gctgatctcc caacggtgta cccggtaacg ccagagcggg 1370581 accaccccga tgtgcaccac catcgccacc gcggtgagag cggccgcggc caggtgcggc 1370641 cagggcggct ggggatgcac cgcccaccac accagctgcg cgatcaccgg gagtatccag 1370701 cccagcgacg cggacagcgc ccacatcacc ggcgcctggc tgctcggtcg atgggccggc 1370761 tcggcgagcg cgaggtgatt tctctgcggt ccggttgcgc ttggcacatt tcgagcatgg 1370821 tccaacggaa accgaacaca gtgatcgggg gtcgtggtta tcgtttgagc tagcgctcaa 1370881 caagatgcgt gccaactcac cctgccccgg ggaggcgcga tgagtcgaca gtggcactgg 1370941 ctggcagcga cgctgctcct gatcaccacc gccgcgtgca gtcgtccggg caccgaggaa 1371001 ccggattgcc cgacgaaaat aaccttgccg cccggtgcta cgcccaccac gaccctcgac 1371061 ccgagatgca tagtgcgcgc gaccaccacc ggcacagccg acggcgatgc ggcgtcgcgc 1371121 tggaccggaa ccgtgcggat cgccgggttc tatgcctcga tctgcaacgc ggtatgggac 1371181 gggaacgtca gccttgcggg aaaggacgag ctgaccggca aggctacgct tatcctcgtc 1371241 gaaaccagtt gcccgggcaa ggttgtcgcc ggcgaactcg tgctgaaggg gaacgtcggt 1371301 tcggacagcc tcgcgatcac ctgggcgcac cccgaactcc cgcagcgggc gttcgacctc 1371361 ggcgccggac agggcacgat ccgccgatcg ggcgaccgtg ccgagggaac gttcaactcg 1371421 gatatgggtg ggggcaccga gttcttcttg acgtggtcgc tgacgatgcg taactgacga 1371481 tcacaacgtg cccaccaaaa acagagtaga caacagtcga caattccctt gtactccggc 1371541 gctatgaagt cgatctccgt cggtgagctg cgccagaatc ccgctcccat gatcgccgac 1371601 ctcgaacggg gtgagccata cgcgctgacc cgccacaacc accggatcgg aacgatcatt 1371661 cctgccgtct cgtcggcaac actcattccc cggaaagcct agtacgccga gcagacgcaa 1371721 cggcacccaa tttcgaccag aatcgggttc ttttgcgtct gctcacgcgg tcaacgctag 1371781 cgtcgtgtcg ggtccaaccc cagcgacatg cccgccaatc cgcgtcgtcg agtcgacaag 1371841 ccgtcggcga tgctatgcag ttccttgccg atcgccgagt ccggcgagct caacacgagc 1371901 ggtacgcccg aatcgccggc ggccaccagt gcggggtcca gcgggatctg acccagcagc 1371961 ggcacgtcgg cgccgaccgc acgcgacaac cgctcggcga ccagccggcc accgccctcg 1372021 ccgaacacct gcatcgtggt gccgtccggc agcgtgagcc ccgacatgtt ctccacgacg 1372081 ccgacgatgc gttggcgggt ttgcagcgcg atgctgccgg cccgttcggc cacctccgcg 1372141 gcggccagct gcggggtggt gaccaccagg agttcggcgt tggggatcag ttgagccacc 1372201 gagatggcga cgtcgccggt tccgggcggc aagtccagca gcagcacgtc cagatccccc 1372261 cagtacacgt cggccagaaa ctgctgcaac gcccggtgca gcatcggccc gcgccacacc 1372321 accggggtgt tgccctgggt gaactgggct atcgagatga ccttcacctg gtgggcgatc 1372381 ggcggcagga tcatcgactc aacctgggta ggccggtcgg tggtgcccat catccggggg 1372441 atagagtggc cgtggatatc agcgtccagc accccgatcg acaggccgcg gacggccatc 1372501 gcggcggcca ggttgaccgt gacggtggac tttccgactc cgcccttacc ggaagccacg 1372561 gcatacaccc gggtcaagga atcgggttgc gcgaacggga tgacgggttc gcgggtatcg 1372621 ccacgcaact gcttacgcag ctcggtgcgc tgctcgtcgc tcatcacgtc caagctgacc 1372681 cgcaccgccg aagtgcctgg cacgtcggcg accgcccggg tgacacgctc ggtgatttcg 1372741 gacttcttcg ggcagccggc gatggtcagg tagatctcga cgtgcacgct cccatccggg 1372801 ccggtgtcga tgcttttgac catccccagt tcggtgatgg ggcgccgcaa ttcggggtcg 1372861 attaccttgc ccagcgcggt gcgtatcgcc gcgttcaggt cgccatcacg agttccggac 1372921 atcaccgccg agtgtaggcg gcttggcata cggccgagtg gtcagccggc aggagccggc 1372981 gccggcggcg ccaggcccgc gtcgccaggc gggccggcca atggatccgg aggtggggga 1373041 gcggcaggta ggaatggagg tgggggagcg gtaggcggga acggcggcgc gcccactggc 1373101 gggccatgtg agccaatgca gatcagcgtg cagccgggca tcggcgccga tgggtcaggt 1373161 gccatccacg ggaacatcgg cggtggattg agcgccgcct ggcgcggggt caagtcgatc 1373221 agcggcaggt gcgccatggg gccatcggcg gtcaggccgt tgacattgat cggcaagccg 1373281 ggcccgagac cctccggatt ctcgaggtgc gcgtcgccga gtggtggtgg cggaccggtg 1373341 atcgggggca agtcaaccgg gaacacaccg gtggcgtagc cggcggccca gcccagtacg 1373401 ttctgggcgt aaggcatcga gttgttgtag cgcaggagcg cggccatgac ctgcgccggg 1373461 tcgcgcaggt tgagcccacc gctacacagg tagcgggctg cggccaacgt ggagtcgaac 1373521 aggttctgcg ggtcagccac accgtcgtca tcgccgtcgg tggcgtaccg agcccaagtg 1373581 ccgggcaaga actgcattgg ccccatcgcg cgggcgtacg tgacgcgatt gccgacgctg 1373641 ctttggatga tgatctcgtt gcctggcagg gtgccgtcca gcgttgggcc gtagatcggc 1373701 tggatcgcgg tgccgcgcgc gtcggtggcg ccgccgtttg cgtgcatcga ctcgatgcgc 1373761 ccaatcccgg ccagcaagtt ccaactgacg ccacagccag gggcggcagc ggccatcttc 1373821 agctcggcgt tgcggtaggc ggacagtgcc atggccggaa tgccaagcgc accaggcgaa 1373881 ttcacgatca tcggtggtgg tggagccgat atggtagcta ccgccacgcg gaagctggtc 1373941 ggcgggcgct tcatggcgat gacgaccgga ccggacaggt ctatgccgga cgcggcgacc 1374001 gcggccaccg gggtgataac ggcgtgcacc ggcgcggttc tcccggggaa taccggagcc 1374061 gcgctgccga ccgcactggc gaataccaac ggggcaatcg ctgccacgcc gaatgccggc 1374121 gcccgcgtta ggcgacaagc tccccgccgc actgcagcga cggccgggcg tgcaccccag 1374181 cgtcccccaa tgtgcactcg accgtcctca gtgtgtgagc cgtcggaaac ctatgtcttc 1374241 ttagcttctt tcttcgtttc gtgaactaga tcaccataca taactcttgt cacgggagtg 1374301 gcgcaatggc cgactcggta atcaccccga tttcttggcg tgctgctccg cctcgtcggc 1374361 cacccgcggc tgcgccacat ccggatccgt cggctgcagc tccgccaaca gagcgcgcag 1374421 gctgtccagt tcgtggcgca ggtagtcgcg cgtggggacc tcgccgatgg ccagccgcag 1374481 cgctgccagc tcgcgggcgt tgtactcggt gtcggccttg gtctgtgcgg cccgccgacg 1374541 atcctcttcg aacaccgcgc ggtcacgctt ttcctgacgg ttctgggcga gcagaatcag 1374601 cggtgcggcg tacgaggcct gcgtggagaa ggccagattg agcaggatga aggggtacgg 1374661 atcccagcgc aagccgaccg caaacaggtt cagcacgatc catgtcagta cgagcagcgt 1374721 ctgcaccagc aggtaacggc cggttccgaa aaaccgtgcg atggattcgg ttgtcctgcc 1374781 gacggcctcg ggatccagcc gcggggcgag cgtgcgcgat gtgcgtgggg tgtacagacg 1374841 gcgcggcgcg aagggtttgc tcaccgtggt cctccgggtc tgtccggtgc tccggagggg 1374901 tcgagctccg gcatatctac acgccagtca tgcggcaata gatggtcgag caggtcgtcc 1374961 acggtcaccg ctcccagcag gtggttctcg tcgtcaacca ccggtccgca caccaggttg 1375021 taggcggcga agtagcgagt caccgcggcc agcggggtct ccggagtgag cgtgagcagg 1375081 tcagtgtcca caactccgcc gaccagctcg gccggcgggt cacgaagcag ccgctgcaaa 1375141 tgcacacaac ccaggtagtg cccagtgggc gtggccgtgg gcgggcgcgc gacgaacacc 1375201 attgacgcca gggcgggggt gagatcggga tcgcggaccc gcgccaacgc ctccgcaatc 1375261 gaggtgtccg gggtcaacac caccggatcg gaagtcatca atccgcccgc cgtgtcgggg 1375321 gagtgcgtca gcagccttcg cacctgcccg gagtcgccgg gatccattcg tgtcagcagc 1375381 aactcggctt cggtcggatt caggaccgcg agcagatcgg cggcgtcgtc gggatccatc 1375441 tcctccagca cgtcggccgc gcgttcggtg cccagttgcg acaacacctc ggcctgatcc 1375501 agttcgggca gctcctgcag gacgtcggcc aagcgcttgt cgtggagcgc cttgaacacc 1375561 tcgtggcggc gcttcggcgg cagcccgcgg atggcgtcgg ccacgtcgac cgctttccat 1375621 ccctcgaact ggtcgagcag ctgtgccacg tcttgacccg gcatcgccaa ggccgacggc 1375681 gtcaaccccg ccacgttgtg ccagtccacg acgtgcactg ggcagcgccg tcggagccga 1375741 cgttgggtgc ggacggcgac cctagtcacc atccagtcgc gacttcgggt ttgctcgaca 1375801 cccaggtcgg tgaccacgac gtcgacgccg gccagctcgg gtagtgcggg atcgttgacc 1375861 ttcaccaggg tgtcgagcac ttgacccagc gccagagcct cgcctggccg ctgctcgaag 1375921 cggtgcagtg acacgttgcc ggtgctcagt gtcaccgcgt gcggctcgat cgcggcgacc 1375981 cgcagaatcg gtatgaatat cttgcggcgg gtcgccaaat cgaccaccag cccgagcact 1376041 cgcggttgtt ggcggacaat gctgatgctg atcacgacat cgcgaacgcg cccgaaggat 1376101 tcgccgagcg gtcccagcac cgacatccgc gagagccgcg ccaggtacac cctgttgacc 1376161 gatcccatga ttgagagcct aggcagctgc cttccggatc aaccgagggt gggccaatgt 1376221 cgcctaatgc taagggatag cgaagatccc cgcgatcatg tagaccagca gggtcgcgat 1376281 gccaatcaca atgccggcca ccgccaggcc gtagccttct tcgcgtgtct gcttgatctg 1376341 gttgatggcg atcgcgccga acacgatgcc cacgatcgag ccgatgcagc aaagcacacc 1376401 gacgagcgcc gagatcagtg agacgagcgc catggtgttc atgccgggct gcgatgggcc 1376461 gtagccgtct aggtagcccg gctccgggta gtagccgccc ggagatccac cgtatggcgg 1376521 aggcatgggc gggtatggta tgtcgccgta gcctgctgaa gaagtgccgg ggggtggata 1376581 tccgggcggc gcatagcccc cgggtggcat cggcggtgga tagccggtcg gatacccggg 1376641 ctggtaagca ggcgggtaac cggacggcgg atacgccggg ggcgggtggt tggccatcgg 1376701 cgaagatgcc ggcggcgccc aaggagcgtc agcaatgggc tgttcggggg gccgctcacc 1376761 gaccggaggc ggtccacccg cggcgtcgtg cgcactctcg ccagaggagc cgctgggagc 1376821 cgtcatggtg atcaacctat cccggcaacg atgctcgccg ttcggtgggc ctcggtcgct 1376881 cgcgggttga gtggatagtg tgccgggagt agctggacct gactggacat gaaacgatgg 1376941 cgctgaaaaa ggggggcgga ggagaatgag aaccgatgac tagcccattc cagcccagac 1377001 aggttcccgg ttcaacaccc gccgccgcag gtgcgggtcg acgtggtgtg cccgcattgc 1377061 ccaccccgcc gaaaggttgg ccagtcgggt cgtatcccac ctatgccgag gcgcaacgtg 1377121 cggtcgacta tctatccgaa cagcagttcc cggtccagca ggtgaccatc gttggcgtgg 1377181 acctcatgca ggttgaacgg gtcacaggcc ggctgacctg gcccaaagtg cttggtggcg 1377241 gcgtgctgag tggcgcctgg ctgggcctgt tcatcgggtt ggtgctcggg ttcttcagtc 1377301 ccaatccatg gtccgcgctg gttaccggcc tggtggccgg ggtgttcttc gggctgatca 1377361 cctctgcagt gccgtacgca atggctcgcg gcacaaggga tttcagctcg accatgcaac 1377421 tggttgccgg tcgctacgac gtactttgtg atccgcaaaa tgcggaaaag gcacgggatc 1377481 tgctggcgcg tctggcgatc tgaagcccgg acgagaggca aatgtggtca tgagtcgcgg 1377541 gcggataccg aggctgggcg ctgccgtact ggtggcgttg acgaccgcgg cggcggcgtg 1377601 cggggccgat agccaggggc tggtggtcag cttctacaca ccggccaccg acggcgcgac 1377661 gttcaccgca attgcccaac gctgcaacca acagttcggc ggccggttca ccattgcgca 1377721 ggtcagcttg cccaggtccc ccaatgagca acggttacag ctggcccgac ggttgaccgg 1377781 taacgaccgc accctggacg tcatggcgct ggatgtggtg tggacggcgg agttcgccga 1377841 agcggggtgg gcgctgccgc tgtcggacga cccagcgggg ctggccgaga acgacgccgt 1377901 cgccgatacc ctgccaggcc cgcttgcgac ggccggctgg aaccacaagc tgtacgcggc 1377961 acccgtcacc actaatactc aattgctttg gtaccgacca gatttggtaa atagcccgcc 1378021 aacggattgg aatgccatga tcgctgaggc ggcccggctg cacgcggcgg gcgagcctag 1378081 ctggatcgcg gtacaggcca atcagggcga gggcttagtg gtgtggttca acacgctgct 1378141 ggtgagcgct ggtggatcgg tgctctccga ggacggccgg cacgtcacct tgaccgatac 1378201 tcccgcacac cgagcggcta cggtcagcgc gctacagatc ctcaaatcgg tggctaccac 1378261 gcccggcgcc gacccctcga tcacccgcac cgaagagggc agcgcgcggt tggccttcga 1378321 acagggcaag gccgcgctcg aggtcaattg gccgttcgtg tttgcgtcca tgctcgagaa 1378381 cgcggtgaag ggtggtgtgc ccttcttacc gcttaaccgg attccgcagt tggccggcag 1378441 catcaacgac atcgggacgt tcacgcccag cgacgagcag ttccgcatcg cgtatgacgc 1378501 cagccagcag gtgttcggtt tcgcgcccta tccggctgta gcgcccggcc agccagccaa 1378561 ggtgacgatc ggcgggttga acctggcggt ggccaagacg acccgccatc gagcggaggc 1378621 attcgaagcg gtgcgttgtc tgcgtgacca gcacaatcag aggtacgtct cgctcgaggg 1378681 gggtctgccc gcggtgcggg cgtcgctgta ctccgatccg caattccagg cgaagtatcc 1378741 gatgcacgcc attattcggc agcaactcac cgatgccgcg gtgcggccgg cgacgccggt 1378801 gtaccaggcg ttgtccatcc ggctcgcggc ggtgctgagc ccgatcaccg agatcgaccc 1378861 ggagtccacg gccgacgaac ttgccgcgca ggcgcagaaa gccatcgacg gcatgggcct 1378921 gctcccgtga cctccgttga acagcggacc gccaccgcgg tcttttcccg taccgggagc 1378981 cgcatggccg aacggcgact ggcgttcatg ctggtcgcac ccgccgcgat gttgatggtg 1379041 gcggtgacgg cctatcccat cggttacgcg ctgtggctta gcctgcagcg caacaacctg 1379101 gccaccccga acgacaccgc gttcatcggg ctgggcaact atcacacgat cctgatcgac 1379161 cggtattggt ggacggcgct ggcggtgacg ctggcgatca cggcggtttc ggtgacgatc 1379221 gaattcgtct tggggttagc gctcgccctg gtaatgcacc gcacgctgat cggcaagggg 1379281 ttggtgcgca ccgcggtgct cattccgtac ggcatcgtca cggtggtcgc ctcgtatagc 1379341 tggtactacg cctggacgcc gggcaccggg tatctggcca acctgctgcc gtatgacagt 1379401 gcgccactga cgcaacagat cccgtcgttg ggcatcgtgg tgatcgccga ggtctggaag 1379461 acgacgccgt ttatgtcgct gctgcttttg gccgggttgg cgctggtccc cgaggatctg 1379521 ctaagagcag cgcaggttga cggcgccagc gcctggcggc ggttgacgaa ggtcatcttg 1379581 ccgatgatca agccggcgat cgtggttgct ctgctcttca ggaccctgga cgctttccgg 1379641 attttcgaca acatctatgt gctgaccggc ggcagcaaca acaccggatc ggtgtcgatc 1379701 ttgggctacg acaacctgtt caaggggttc aacgtgggcc ttggttcggc gatcagcgtg 1379761 ctgatctttg gctgcgtggc cgtcattgcg ttcattttca tcaagttgtt cggcgccgcg 1379821 gcgcccgggg gtgagccaag tgggcgttga acgggtgggc gcgcggcgcg ccacgtattg 1379881 ggccgtcctg gacactttgg tcgtggggta cgcgttgctc ccggtgctgt ggattttcag 1379941 cctgtcactc aagccgacgt caacggtcaa ggacggcaag ctgattccgt cgacggtgac 1380001 tttcgacaac tatcgtggca tcttccgggg cgacttgttc agctcagcgc tgatcaactc 1380061 catcggaatc ggcctgatca ccaccgtgat cgcggtggtg ctcggcgcga tggcggccta 1380121 cgcggttgcc cggctggaat ttccgggcaa gcggctgcta atcggggctg ccttgctgat 1380181 cacgatgttc ccgtcgatct ctttggtcac accattgttc aacatcgaac gtgccatcgg 1380241 cctgttcgac acctggccgg ggttgatctt gccgtacatc accttcgcgt tgccgctcgc 1380301 gatctacacc ctgtcggcgt tcttccggga gatcccttgg gatctggaaa aggcggccaa 1380361 gatggacggt gcaacgcccg gtcaggcttt ccggaaggtg atcgtaccgc tggcggcgcc 1380421 gggcttggtg accgctgcaa tcctggtgtt cattttcgcc tggaacgatc tgctgctcgc 1380481 gttgtcgctg accgctacca aggcggcgat taccgcgccg gtggccatcg ccaacttcac 1380541 cggcagttcg caattcgagg agccgaccgg ctcgatcgcg gccggcgcga tcgtgattac 1380601 gatcccgatc atcgtctttg ttttaatctt ccaacgacgg attgtcgccg ggttgacctc 1380661 tggcgctgtg aagggatagc gcgatggccg agattgtgtt ggaccacgtc aacaagagtt 1380721 accccgacgg tcacacagcg gtgcgcgacc tcaacctcac catcgccgac ggcgaatttc 1380781 tgatcctggt agggccttcc ggttgtggca agaccacgac gctgaatatg attgctgggc 1380841 ttgaagatat ctcgtcggga gaactgcgca tcgccggtga gcgggtaaac gagaaggcgc 1380901 caaaggaccg tgacatcgcg atggtgttcc agtcgtacgc gctttacccg catatgacgg 1380961 tgcgccagaa catcgcgttc ccgctgaccc tggcgaagat gagaaaggcc gacatcgcgc 1381021 agaaggtctc cgagactgca aaaatccttg acctgaccaa ccttctggat cgcaagccct 1381081 cacaattgtc gggtggtcag cgacagcggg tcgcgatggg cagggcaatc gtgcgccatc 1381141 ccaaagcatt cctgatggac gagccgctgt cgaacttgga cgcgaagttg cgggtccaga 1381201 tgcgcggcga gattgcccag ctgcagcgga ggctgggtac caccaccgtc tacgtcaccc 1381261 acgaccagac cgaggcaatg acgctgggcg atcgcgtggt agtgatgtac gggggcatcg 1381321 cacagcagat cggcacccct gaggagcttt acgaacggcc cgccaatctg tttgtcgcgg 1381381 gctttatcgg ctcgccggcc atgaatttct tccctgccag gctgaccgcg atcggactga 1381441 ccctgccgtt cggtgaggtg acgctggccc ccgaagtcca gggggtgatc gcagcgcacc 1381501 cgaaaccgga aaacgtcatc gtaggcgtgc ggccggagca tatccaggac gcagcattga 1381561 tcgacgcgta tcaacgcatc agggcgctga ccttccaggt gaaggtcaac ttggtcgagt 1381621 ctttaggcgc cgacaaatat ctgtatttca ctaccgagag cccggctgtg cactcggttc 1381681 agttggacga gttggcggag gtagaggggg agtcggcgtt acacgaaaat cagttcgtgg 1381741 caagggttcc cgccgagtcc aaggtagcca tcgggcagtc ggtcgagttg gctttcgata 1381801 ccgccagact tgccgtcttc gacgccgact ccggtgcgaa cctgaccatt ccgcaccgcg 1381861 cctaatggcg gcgagcggac acataagccc ccgccacgcc gaaggatttg gagctttttg 1381921 cgtctgttcg ccgacgcgaa gctagagcca gtttctgttg cggaagacgt ggtagaggaa 1381981 cagacagata aggaccatcc cgccgatcac tgtcgggtaa ccccacctgg agtccagctc 1382041 gggcatgaag tgaaagttca tgccatagat gcccgcgatc atggtgggga ccgcgatgat 1382101 acctgcccac gcggatatct tgcgcatgtc catgttttgc tgcatgccga cccgggcgag 1382161 cgcggcctgc accagcgagt tgagcatgtc gtcgtagctg gcgatctggt cggcggcctc 1382221 ggtctggtgg tcggcgacgt cgcgcaggta gcgccgcact tctttcgaaa tgaggtcttt 1382281 gctctcggtc tgcatgcgct ggaatgcggt cgatagcgga ttcacgcacc ggcgcaactc 1382341 gaccacttcc cgcttgagca gatagatcgg ttcgatgtcg agcttgcggc ccggcgcgaa 1382401 cgctacttcc tcgatgctgt cgatatcggt ctccatgaga ttggtcacct cgaggtagtg 1382461 gtcgaccacg tagtcggcga tcgcgtgcat caccgcatac ggtcccaacc gcaaatgttc 1382521 ggggtcggca tccatccgct tacgcacctc ggataacccg ccgtgttcgc cgtggcggac 1382581 ggtgaccacg aaatccttgc cgacgaagat catgatctcg ccggttttga cgatctcgcg 1382641 ggccagtacc accgattcgt gcgggacgta gttgacggtc ttgaggacga ggaacagcgt 1382701 ctcgtcgtag cgctccaact tgggtcgctg gtgcgcgtgc acggcgtcct caacggctaa 1382761 cgggtgcaac ccgaaaacgt ctgctacgtc ctgcatctgg ttttcatcgg gctcgtgcag 1382821 cccgatccag acgaacgcct cctgcccggt cagttcgatc tcgcgcacct cgcgcagcgc 1382881 ggcggcgtag gtgtacttgc cgggcagtcg ctggccgcag acgtagacac cgcagtcgac 1382941 caaggcttgg gccggtggct gggcaacggg gtgtgcgttc ggcggctggg gtcgcgcgac 1383001 cggtcgcagc acttcgggca atgcgtcaaa ccctgggaac acgtcaacct ccgatcgcgg 1383061 tggatctgat cgggcggtgc tccaggttac gcgtcccggt atggaacttg gtaaacgtca 1383121 gtcgtagctg tgggggttgg accccagatg tccgtccggt gccggtgcgc tagtttcaac 1383181 ccgaagccaa gtccgtaagg agcagaaccg acgtgagcgc tagtcctctc aaggtcgccg 1383241 ttaccggcgc cgccggccaa atcggctaca gcctgttgtt ccgcctggcc agcggctctt 1383301 tgctgggccc tgaccgtccg atcgagctgc ggctgctcga gatcgagccg gcactgcagg 1383361 cgctcgaggg tgtggtgatg gaactcgacg actgcgcttt cccgctgttg tccggggtgg 1383421 agatcggttc cgatccccag aagatcttcg atggtgtgag cctggccctg ctggtcggag 1383481 cccgcccccg gggcgcgggc atggagcgaa gtgacctgct ggaggccaac ggcgcgatct 1383541 tcaccgctca gggcaaagcc ctcaacgctg tcgccgcgga tgacgttcgc gtcggggtga 1383601 ccggcaaccc cgccaacacc aacgcgctga tcgcgatgac caatgcgccc gacattcccc 1383661 gcgagcggtt ctcggcgctc acccggctgg accacaatcg ggcgatctcg cagctggccg 1383721 ccaagaccgg cgcggcggtc accgacatca agaagatgac gatctggggc aatcactcgg 1383781 ccacccagta ccccgacctg ttccacgcgg aggtcgccgg aaagaacgcg gccgaagtgg 1383841 tcaacgacca ggcctggatc gaggatgaat tcatcccgac ggtcgccaag cgcggtgcgg 1383901 cgatcatcga tgcgcgcggc gcgtcgtcgg ccgcctcggc cgcgtcggca accatcgacg 1383961 ctgcccggga ctggttgctg gggacgccgg cggacgattg ggtctcgatg gccgtcgtct 1384021 ccgacgggtc ctacggggtg ccggagggct tgatctcctc gtttccggtc accaccaagg 1384081 gcggcaactg gacgatcgtg agcggcttgg agatcgacga gttctcccgc ggccggatcg 1384141 acaagtcaac cgccgagttg gctgacgagc gcagcgcggt caccgagctc ggcctgatct 1384201 gagcgcaggt cagccgcgca ctgagcggag cccgagtcat cttgacgtgt gtttgtccag 1384261 gcatcatgat gacctgtatg cgcaccacct tgacgctcga tgacgacgtc gtccggctgg 1384321 tcgaagacgc agtgcatcgc gaacgccgcc cgatgaagca ggtcatcaac gatgcgctgc 1384381 gcagagcgct ggcgccgccg gtgaaacggc aggagcagta tcggttggag ccgcatgagt 1384441 cggctgtgcg ttccgggttg gatctggccg gcttcaacaa gttggccgac gaactggagg 1384501 atgaggcgct gctggatgcc acgcgtcggg cccggtgatc atccctgaca tcaatctgct 1384561 gctctacgcg gtcatcaccg gattcccgca gcaccggcgc gcgcatgcgt ggtggcaaga 1384621 caccgtcaac ggccacaccc gtatcgggct gacgtatccg gcgttgttcg ggttcctacg 1384681 gatcgccacc agtgcccgcg tgctcgccgc gccactgcca accgcggatg cgatcgccta 1384741 tgtgcgcgag tggctttcgc agccgaacgt ggacctactc acggcgggtc cgcgccacct 1384801 ggacatcgcg ttgggcctgc tcgacaagct cggcacagcc agccacctaa ccaccgatgt 1384861 gcaactggcc gcctacggca tcgaatacga cgccgagatc cattccagtg acaccgactt 1384921 tgcccgattc gccgatctga agtggaccga cccgttgcgc gaataatgac tgccgctctg 1384981 ccctcgggtc agccgttcag gccgtgctga ccgttggcgc cggtagcgcc ttgagtaccg 1385041 ggatcgccgg gggcgccggg gttgaacccg gtcccgccgc cgccgcccgc gccgccgttg 1385101 ccgcccgcgc cgccgaggcc cccggccgcg ccggagccgg ggctgcccga ctgtccgaac 1385161 agtccgcccg caccgccggt cccgccgttt ccgccgacgc caccggcccc gccggccccg 1385221 ccgtcgccgc cgttgccgcc gtcaccgccg tcgccgtcct ggttggccat gccgtcggcg 1385281 ccgatcccgc cgttgccgcc gttgccgccg ctgccgcctt gagcgccgat gcccccgtcg 1385341 cccccgacgc cgccgtcgcc gccggcgccg cccgtgccga gcagtagccc gccgcgaccc 1385401 ccgctgcccc caaagccgcc ggcgccacca acgtcagccg aggcaccgac gccgccgtcg 1385461 ccgccggcac caccattgcc cccggtggag ttgcccccag gaggattatc ttgattggca 1385521 tttcctccgg cgccgccggc accaccggga gcgccgatac cgccgttccc gccggcgcca 1385581 ccgttgcccc ctatgctgtt gccagcattt gcaacattgg cgctgccacc cgctccgccc 1385641 agccccccgc cgccgccggc tccgccgttt ccgccggcgc cgccattgcc gccgacagcg 1385701 tcaccaaagc cgctttgagc ggcgccaccg ttaccgccgg cacctccgga ggcgaagttg 1385761 gcgccgtcgc cgccgtcgcc gccggcaccc ccggacacgt cggtctgccc aaggttggtt 1385821 ccatccccgc ctatgccgcc tgcaccaccg cccacgccgg ggttgactgc gttgctgccc 1385881 gagccggcgt cggtcccgtt gccatcgggt ccggtagtgc cgtcggcgcc atcggtcgcg 1385941 tgcgtgacct gatgggacac cgggttttgc ccgttggcgc cggccgctcc tgccgctccg 1386001 gctccacccg cccccccgtt gccccatagc ccggcgttgc cgccgtggcc tccgttgccg 1386061 ccattgcccc cgatctgggt ggccgcccca ccgttgccgc cgagcccacc gttgccgtat 1386121 agccacccgc cgttgccgcc ggcaccgccg ttcgcgccgg gcccgccggc tcccccggcg 1386181 ccaccgttgc cgatcaaccc ggccgcgccg ccgcgaccgc cgggctggcc tggggtgcta 1386241 ctcgagccgc cgttgccgcc gttgccgtac aacaagccac cgtcgccccc gttttgtccc 1386301 ggcccgccgt tggcgccatc gccgatcagc gggcgcccca gcaacagctg ggtgggccca 1386361 ttgaccacat ccagcaccgc ttgcatcggg gaggcattgg cggcctcggc cgccgcatac 1386421 gagccggccg ccgaactcag tgcccgcaca aactgctgat gaaatgccgc cgcctgtgcg 1386481 ctcagcgctt gataggcctg ggcgttcccg gaaaacagcg acgcgatggc cgctgatacc 1386541 tcgtcggcac ccgcggccag cagccccgtg gtcggggcct cggccgccct gttagctgcg 1386601 gccagcgccg agccgatgcc ctccaaatca gcggccgcgg ccaccaacac gtcctgcgct 1386661 gcaatcagat actccatcgc ggggcctctc tcgcggcgag attgaccaac gggtcggcac 1386721 gaagcgtgtc ccgttgcttg acggtgcatt gcgtgtttgc ctggatcccc gcgccgacgg 1386781 tgtggatcgg gcccagtacc ctcaagcccg tgccaactgc atctgtcgcg gtgactatcg 1386841 gctcagacac ttcggtgtga gaatcaccag gatcctcgcg ctgctgcttg ccgtcctgct 1386901 tgcagtgtct ggcgtggctg gctgctcggc cgacaccggc gatcgccacc cggagttggt 1386961 ggtcggatcc acgccggact ccgaggcgat gctgctggcc gccatctacg tcgcggcgct 1387021 gcggtcgtac ggttttgcgg cgcacgccga aaccgccgcc gacccggtgg cgaaactgga 1387081 ctcgggcgcg ttcaccgtcg tacccgcttt caccggtcag atgttgcaga ccttgcaacc 1387141 cgatgcgtcg gtgcgctcgg atgcccaggt ataccgcgcc atcgtctcgg cccttcccga 1387201 gggcatagcc gcaggcgact acaccaccgc cgcagaagac aaacccgcgt tggtggtgac 1387261 tcaatccacc gccaaggcct ggggcggcgg cgatctcagc gagctgccca gccactgccg 1387321 cgggttgttg gtcgggcgcg ttgccggcgc ccacacaccc gcggccgtgg gaccgtgccg 1387381 gctgcccgcc ccgcgtgagt ttcggaatga cgcaacaatg ttcgccgcgc tgcgggccgg 1387441 acagctggtc gcggcctgga ccaccaccgc cgaccccgac atccccgcgg acctgatcat 1387501 gctgaccgac ggcaagcccg cgctgatccg ggccgagaac atcgttccgc tgtatcgtcg 1387561 caacgcgctg accgagcggc aactgctggc cgtcaacgag gtcgccggcg tgctggacac 1387621 cacggccctg atcgggatgc gccgccaggt ggccgcgggg gccgacccgg cggcggtggc 1387681 cgccggctgg ctcgccgaac acccgctggg acgttgagcc gccacgagcg tccgggtcga 1387741 cgcgatgaca caccgcgtcg gccgaacaac cttcgggcgc gctttcctca ccagccgtca 1387801 gcgcgggcgg ggtatcaacc ggccggtgat gatcggaaag atccgctgat atccggaacc 1387861 ggtcagccgg accaccaggt ccagtacctt ggcgtcgaca cccaccaaca cccgggcctt 1387921 gttcttggcc acccccgtca ggatgatctg cgcggcccgc tgtgggctga gatgggccac 1387981 ccgcttatcg aacgtctcgg ccagctcggc ctggtcaagt ccctcggcgg cggtggcgtt 1388041 acgggcgatc gcggtcttga caccgccggg gtgcaccgtc gtcaccttca ccgggtgacc 1388101 cgccaacgcc atttcctggc gcagcgcctc ggtaaagccg cggacggcga acttggccga 1388161 gttgtaggcc gcctgacccg gcgccgaaaa caacccgaac acgctggaga tgttgatgac 1388221 gtggccgtcc ccggaggcga tcaaatgcgg caggaacgcc ttggtgccgt tgaccacacc 1388281 ccaaaaatcg acgtccatca cccgttcgat gtccttgaac tggctgacct cgatatcgcc 1388341 ggtaaaggcg atgccggcgt tgttgtagat ctggttcaca gtgccgaagt gctcgttgac 1388401 cgcatcggcg taggctagga aggcttcgcg ttcggttacg tcgagtcggt ccgtcttgac 1388461 cggcgtgctg atcgccttta gccggtgctc ggtgtctgcc aggccgtcgg tgtcgacgtc 1388521 gctgatggcc accttggcgc ccgagcgggc cagctcgatt gccagcgcct gcccgatgcc 1388581 cgatcccgcg ccggtgacaa cggcgacctt tccggcgaac ccctccatga cgtaccctcc 1388641 cttgtctcgg ctgccatcag gttagccggt acccggggta cggcttaacg tggccggcac 1388701 gggttcattc ggtagctggc actgcgacga gcgatgtgga tgatctcgac tcggtggtgg 1388761 ccgtcgtcga tggcgtagac gacgcggtaa tcaccgcggc gggctgagtg gaggccttca 1388821 aggtcattgc gcagcggctt gcccaaccta tgcgggttgt taagcagcgg tccgaaaaca 1388881 aactcgacac atgcggcggc gatcttttcg ggtaagcgtt gcaggtcgcg tgccgctgtc 1388941 gcggtgatcg ccacgtggta gggatggtcg tcgctcaccg cgcggtgtaa cggttgcgga 1389001 tctcgtcgtt gctcacgaag cgccctgcgg caacatcggc gaggccttca cgaatggcct 1389061 cgctggcgcc aggggtgcgt agcacctcca gcgtttcctc gatggacgcc aggtcatcgg 1389121 ccgagatcaa taccgccgcc ggatgaccgt gccgggttat cgtgatgcgc tcgtgtgtca 1389181 gctcaacttc ggcgacgtac tcagagaggc gattgcggac ttcgcccagt gggacaacag 1389241 ccataaccgc gattgtagct aaaagtatgg ctaaaccctg tacgccgagc atcggcttac 1389301 cgagccgaac gcctcgtcgc tgtttgatgt ctcctcgagc gttcggctga gcgaactcag 1389361 ccgaacgcct cgtcgaggat ctcctgctgt tcgacggcgt gcaccttcga cgagcctgac 1389421 gacggggctg acatcgcccg gcgcgagatt cgcttgatcc cggccaactt gtcaggcagc 1389481 agctcgggta gttcgagccc gaatcgcggc cacgcaccct ggttggccgg ttcctcttgg 1389541 acccagaaga actccttgac gttctcgtag cggtccagcg tttcacgcag tcgacgcctg 1389601 ggcagcgggg cgagctgttc aagccgcacg atcgcgaggt cattgcggtt gtccttggcc 1389661 ttgcgggcgg ccagctcgta atacagcttg ccactggtca gcaggatccg gctgaccttg 1389721 ttgcggtctc cgatgccgtc ctcataggtg ggttcctcca gcactgagcg gaacttgatc 1389781 tcggtgaagt ccttgatttc gctgacggcg gccttgtgac gcaacatcga cttgggcgtg 1389841 aacacgatca gcgggcgttg gatgccgtcc agggcatgcc ggcgtagcag gtggaagtag 1389901 ttcgacggag tcgacggcat cgcgatggtc atcgaacctt ccgcccacaa ctgcaagaag 1389961 cgttcgatcc gggcagaagt gtggtcgggt ccctgcccct cgtgcccgtg cggtaacagc 1390021 agcacgacgt tggacaattg gccccacttg gcctcaccgg agctgatgaa ctcgtcgatg 1390081 atcgactgcg cgccgttgac gaagtcgccg aactgcgcct cccagagcac cacggcgtcc 1390141 ggattgccca cagtgtagcc gtactcgaag ccgacggcgg cgtactccga cagtggcgag 1390201 tcgtagacca ggaactttcc gccggtcggg ctgccgtcgg agttggtcgc cagcagctgc 1390261 agtggtgtga actcctcgcc agtgtggcgg tcgatgagaa ccgaatgccg ctgggagaag 1390321 gtgccgcggc ggctgtcctg ccccgacaag cgcaccagct tgccttcggc caccagcgag 1390381 cccagcgcca gcagctcgcc aaaggcccag tcgatcttgc cttcataggc catctcccgg 1390441 cgcttctcca gcaccggttg gactcgcggg tgcgcggtga agccgttcgg caaggcgagg 1390501 aacgcatcgc cgatccgggc cagcagcgac ttgtccaccg cagtggccag ccccgcggga 1390561 atcatctggt cggactcgac cgactcgctc ggctgcacac cgtgcttctc cagctcgcgc 1390621 acttcgttga acacccgttc cagctggccc tggtagtcgc gcagcgcgtc ctcggcctcc 1390681 ttcatcgaga tgtcgccacg tccgatcagg gcttcggtgt agcttttgcg ggccccgcgc 1390741 ttggtgtcga cgacgtcgta cacgtagggg ttggtcatcg acgggtcgtc accctcgttg 1390801 tgcccgcggc ggcggtagca cagcatgtcg atgacgacgt ccttcttgaa ccgttgtcgg 1390861 aagtccaccg ccaaccgcgc cacccagaca cacgcctccg ggtcgtcgcc gttgacgtga 1390921 aagatcggtg ccccgatcat ctttgcgacg tcggtgcagt actcgctgga cctggaatac 1390981 tcgggcgcgg tggtgaagcc gatctggttg ttgacgatga tgtggatggt gccgccgacg 1391041 cggtagcccg gcagattcgc caggttcagc gtctcggcga ccacaccctg accggcgaac 1391101 gcggcatcgc catgcaacat cagcggcacc accgagaacg cccgttggcc gtcgctgtcg 1391161 atgcttccgt ggtcgagcag atcctgcttg gcccgcacca atccctccag caccgggtcg 1391221 acggcctcca gatgcgacgg gttggcggtc agcgacacct gaatgtcgtt gtcgccgaac 1391281 atctgcaggt acagcccggt ggcgcccagg tggtacttga cgtcaccgga gccgtgcgcc 1391341 tgcgacggat tcaggttgcc ctcgaactcg gtgaagatct gcgagtacgg cttgccgacg 1391401 atgttggcca gcacgttgag ccggccccgg tgcggcatcc cgatgaccac ctcgtcgagg 1391461 ccgtgctcag cgcactggtc gatcgccgcg tccatcatcg ggatcacgct ttcggcgcct 1391521 tccagcgaga accgcttctg gccgacgtac ttggtctgta ggaacgtttc aaaggcctcg 1391581 gcggcgttga gcttgctgag gatgtatttc tgttgggcca cagtgggttt gacgtgcttg 1391641 gtctcgaccc gttgttcgag ccactccttt tgttcggggt cgaggatatg ggcgtactcc 1391701 acgccgatgt ggcggcagta ggcatcgcgc agcaagccca gcacgtcgcg cagtttcttg 1391761 tactgcgcac cggcaaagcc gtcgaccttg aacacccgat cgagatccca cagcgtcagg 1391821 ccgtgggtca gcacttcgag gtcggggtga ctgcggaacc gagctttgtc caaccgcagc 1391881 gggtcggtat cggccatcag atggccgcgg ttgcggtagg ccgcgatcaa gttcatgacg 1391941 cgagcgttct tgtcgacgat cgagtcgggg ttgtcggtgc tccagcgcac cggcagatat 1392001 gggatgctca gttcgcggaa gacctcgtcc cagaagccat ccgagagcag caactcgtgg 1392061 atggtgcgca ggaagtcgcc cgattccgcg ccctggatga tgcggtggtc gtaggtggag 1392121 gtcaaagtga tcaatttgcc gatgcccagc tcggcgatgc gttcctcgct ggcgccttga 1392181 aactcggcgg ggtattccat ggcgcccacg ccgatgatgg cgccctggcc gggcatcagc 1392241 cgcggcaccg aatgcacggt gccgatggtt ccgggattgg tcagcgaaat cgtcacgccg 1392301 gcaaagtctt cagtggtcag cttgccgtcg cgggcccggc gtacgatgtc ttcgtaggcc 1392361 gtgacgaact gcgcgaatcg catggtctcg caccgcttga tgccggccac caccagggaa 1392421 cgcttcccgt ccttgccttg caggtcgatc gccaggccga gattggtgtg cgccggcgtg 1392481 accgcggtgg gcttgccgtc gacttcggtg tagtgccggt tcatgttcgg gaatttcttc 1392541 accgcctgca ccagggcgta gcccagcaaa tgcgtgaacg agatcttgcc gccgcgggtc 1392601 cgcttcaact ggttgttgat gacgatccgg ttgtcgatca gtagcttggc cgggaccgcc 1392661 cggacgctgg tcgccgtcgg cacctccaac gacgcggaca tgttcttgac gacggccgcg 1392721 gcggcgccgc gcagcaccgc tacctcgtca ccttcggctg gcgggggaac ggcagttttg 1392781 gcggccagtg cggcgaccac gccgttgccc gcggccgcgg tgtcggccgg cttggggggt 1392841 gcctgcgggg cggccgcagc ggcccgctcg gcaacgagtg gcgaggtaac ccgggttggt 1392901 tcggcagctg gttgggaggt gggttcgggg ctgtagtcaa ccaggaactc gtgccagctg 1392961 ggatcgaccg aggaggggtc gtcgcggaac ttgcggtaca tctcttcgac cagccattcg 1393021 ttttgcccga atggtgaact tatgttggcc acggccgctg ttcgcctcga ttcttctgct 1393081 agttgaagtc ctgcaagcgc attgcgcggc gcctgctggc agtcggtgaa cggtctgccc 1393141 cataaaggct aacgctttgc cagcgattcg ccagagagac cgggcaacgc gcgctagctg 1393201 gcatcccgaa cggtcggtag cacgtgcagg gtgaccggcc agcgcgccgg cggggtgccg 1393261 aatgccgatc gcgcattacg gacgagcttc ttgccgacca gccgattgcc gatggcgccg 1393321 atgatcgcgc cgatacccat cggcaccagc ttgccaaaca tgagcgcgcc gcgtttcagc 1393381 gcgaatcgtt tgacgacgta tttgagcatt cgcgagttca acgacgatat cgccggcagc 1393441 ggcagcgagg ccatggtctc cgacacccag ccgccgctgg ttcggcccgg accgagcaga 1393501 tcggccaccg cagtagtgtt gtcgccgacc agcaccgcca agaccagggc acggcgccgt 1393561 tctcggtggt cgaggggaat ggcgtgtacc gaggccagcg ccagcacgaa cagcgcggtg 1393621 gcctcaagga acacgacaac ctctccggcc gcggcgaacc atgcggccag ggtgccgatc 1393681 cccggtaagg tcgcggccgc acctaccgcc gctccactgg ccgtcaccac cgacaagaag 1393741 cgtttctcga gcttggctac gatcttggcg gggctggccc ccgggtgggc gcgacgcagg 1393801 cgggccacat acgcctgtgc tgccgggccc tgtatccgcg aactccgttc gatgacctgc 1393861 gccaatgccc gcgtggacac tttgggccgc ccgccggtcc cggccagctg cgggtccggc 1393921 tcagctgcat ttgcggatcg attgtcgaac cttttccaag acctgattcg tcgagcgctc 1393981 atcttctctc ctgcgaatgg cgtcccctca ggctaatgcc ggttcaacga tccgagcatg 1394041 tgtttcggta gcggcgcggt tcaccgctcg aagcggaata atgcggcgtg gacattggtg 1394101 acgatacggg ttgccctggt gcatgccgtg acgcccgtga cccaatgcca ccgctagcaa 1394161 gccaaacgag gtgcgtgtat gactacggcg atacgccggg cggccgggag cagctacttc 1394221 cgaaacccct ggcctgcgct gtgggcgatg atggttggct tcttcatgat catgctcgac 1394281 tccaccgtcg tagccatcgc gaatccgacc atcatggccc agctacgcat cggttacgcc 1394341 accgtggttt gggtgaccag cgcctatctg ctggcctacg cggtgccaat gctggtggcc 1394401 ggccggcttg gcgaccggtt cggcccgaag aatctctacc tgattggcct gggggtattc 1394461 accgttgcgt cgctggggtg cggtctgtcg agcggtgccg gcatgctgat tgccgctcga 1394521 gtggtgcaag gcgtcggcgc cggattgctt accccgcaga cgctgtcgac gataacgcgg 1394581 atcttcccgg ctcatcgccg cggtgtcgcg ctgggcgcat ggggcaccgt cgccagtgtc 1394641 gccagcctgg tgggaccgtt ggccggcggc gcgctggtcg acagcatggg gtgggagtgg 1394701 attttcttcg tcaacgttcc cgtcggcgtc atcggcctga tcctggcggc ctatctgatt 1394761 ccggcactac cccaccaccc gcatcggttc gattggttcg gcgtcggatt gtctggtgcg 1394821 ggaatgtttc tgattgtctt cggactacag cagggccagt ccgccaattg gcagccttgg 1394881 atttgggcgg tgatcgtcgg cggtatcggg tttatgtcgc tgttcgttta ctggcaggcg 1394941 cggaacgccc gcgagccgct gatcccactg gaggtcttca acgaccggaa cttcagcttg 1395001 tccaacctca ggatagcgat catcgccttc gcggggacgg ggatgatgct gccggtgacg 1395061 ttttatgcgc aggcggtgtg tgggttgtcg ccgacccaca cggccgtgct gttcgcgccg 1395121 acggcgatcg tcggtggcgt gctggccccg ttcgtcggca tgatcattga caggtcccat 1395181 ccgttgtgcg tactgggttt cggcttctcg gtgctggcga tcgcaatgac atggctctta 1395241 tgcgagatgg ctccgggcac gcccatctgg cggctggtgt tgccgttcat cgcgttaggc 1395301 gttgctgggg cgttcgtgtg gtcgccgctg accgtcaccg cgacccgcaa tctacggccg 1395361 cacctggccg gtgcgagctc aggtgtgttc aacgccgtcc ggcagctggg ggctgtgctg 1395421 gggagcgcga gcatggccgc gttcatgacg tcgcgcatcg ccgccgagat gcccggtggt 1395481 gtggacgccc ttaccggtcc cgccgggcag gacgctaccg tgttgcagct gcccgagttc 1395541 gtgcgcgaac ccttcgcggc cgcgatgtcg caatcgatgc tgttgcccgc cttcgtcgcc 1395601 ctattcggga tcgttgccgc gttgttcctg gttgacttca ccggtgctgc ggttgccaaa 1395661 gagccgttgc ccgaatccga tggcgacgct gacgacgacg actatgtcga gtacatcctt 1395721 cgtcgggaac cggaagagga ttgcgacacc cagccgctgc gggcgtcgcg cccggcagcg 1395781 gccgcagcgt cacgcagcgg tgctgggggt ccgctggcgg tcagctggtc gacgtcagcc 1395841 caaggaatgc ccccaggtcc accaggccgt cgggcgtggc aggcagatac tgagtcaaca 1395901 gctccgagcg cactataacc gcggcatact gtgcccgact gaccgcgacg ttgagccgat 1395961 tccggttgag caggaacgag attccgcgtg gaacatcgtc ggcggacgag gccgtcatcg 1396021 agatgaagac caccggtgcc tgcccgccct ggaatttgtc gacggtgcct acccgtactc 1396081 cgtcagcccc gccaagtccg gcagacgcca accgccgacg gaccagcgcc acctgggcgt 1396141 tgtacggcgc gagcacaagc acatcggaag cggccagtgg ccgggtgccg tgctcgtcgg 1396201 tccacggcga gccgagcagc tgccgcagct cggcgaggat cgcctcggcc tcttcggggc 1396261 tttcgatcga attgcccttg tggtgcacgc cacgcgtatg cacccccggg ggatacccgt 1396321 cgaggcggcg cacggcggtg cgctcggtgt gggaacacag cctgccctcg taggacaacg 1396381 ccgacacggc cgcgcacacc gccgggtgca tccggtacga gcggtctaag aagtagccgc 1396441 gttcgtcggg cagcgtgtgt tgcccatcta ccagccacga caatgcggag gtgtcgacgg 1396501 gttcgggatg tgtgccctga cttacctgag gcagttgctg tggatcgcca agcagcaaca 1396561 ggtttgtggc cgcgggcgcc acggcgatgg tattggccag gcagaactgg ccagcctcgt 1396621 cgatcaccag cagatccagg ctggctttcg gcacccgatt gccgttggcg aagtcccacg 1396681 ccgtgccgcc gatcacgcat ccggcggtgt cgcggatgaa ttctgtgtac tggctcccgt 1396741 cgatcgactg ccagcgccca gcggtgtggt cgtgcggctt tttggcgacc tgccccgggt 1396801 ccaggccagc gctgatcaca ccttccaaca ggttctccac cgtggcgtgc gactgggcga 1396861 caacgccaat acgccaggca tgctcggtga ccaactccgc gatcacccgg gccgcggtgt 1396921 atgtcttgcc ggtccccgga gggccgtgca ccgccaggta tgacgagtcc aagtccagcg 1396981 ccgccgcggc gatatcggtg actgggtcac tgctgcgggg caatgcggcg ccgctgcgcg 1397041 tgcgaggagg gcgacgcagc agcacgtcca ttagcgcggt gctgggcagt tgcggcgatc 1397101 cggaagccac ggcagcggcc gtcgattcga tcgattcccg cagggccgtc gtcggcaccg 1397161 gcggcccggg agcgagcgcg aacgggagct gctgaaatgt attgccgtca ctgccggttc 1397221 gttcgacgat gaccacctcg gtgggcacag tggggtcgtc ggtctcaacc actgcggcgg 1397281 ggcccgcggc tcggcgatca ggattgtcgg tcatgcccgg cggcgccggg ggttcgtaga 1397341 gggcaaacac attcccgttg aggtccccac gtgccagttc accggtaagc cggacccgcc 1397401 gctgcggctt gcgcgcgcga ggcggcatat gccagtcgac ggtgaccgaa gcctcgctgg 1397461 caaggaagac gtccgtgctg tccgaccatt cgtcgacggg gtagttgagc cggtcgaagt 1397521 gcgcccacca gaacggcttg tcctcgcggc gatgatagcc gcgggcagcg gccagcaagg 1397581 cgaccgctgt ctgttccggc gtgcgctcgc cggcggcggc atcgccggtg aacttggaca 1397641 gtaccgacgc cagcgagtca ccgtcgtcga tagggtcggc gtccggaact ggttgagcgc 1397701 caatgggtgt gacgccggct tcccaggcgc gcatgagcag ccagtcacgc agcgcgcggg 1397761 tggaccggca gtcgtagtgg ttgtagcctt cgatctcttt gagcacggtt gccgcctcat 1397821 cgatgcggcc ggccgcgcgc agttcgcagt accgggcata ggagttgatc gagtcggcgg 1397881 cggtggtgac gtcgccggag cgtggctgcg tcccgaggta cagcggctcc agcgccttca 1397941 agctgaacga gtcggtgccc acccgaatgc tcttgcgtac caacgggtat aagtccacca 1398001 ggactccgtt gcgcagcaag tcgtcgacgt cgtcctcgcc gatgccgtag cgtccgacca 1398061 gccgcagcag cgcggtcttc tcgtagggcg cgtagtggta gatgtgcatg ttggggtggc 1398121 gccggcgccg tctggcgact atcgccagga aatcggtcag cgcctggcgt tcggctgtcc 1398181 ggtcatgcgc ccacaatggt cggaatactc ccgcccgtcc ggcttccagc accccgaaca 1398241 ggtattccag gccccactgt ttgccgtcgg cggtccacag cgggtcaccc tcgaagtcga 1398301 agaacaggtc gccggggttt ggctccggca gcagtgtcag cggccgcggg tcgacgatct 1398361 cgaactgtgg tgctcccgta tcgcgttggc ggatttgcag tttggcctgt gcggtcagct 1398421 tgcccagcgc gttcgtggtc aggccgggaa ccggcgcggt gtgatctgcc agttcggcga 1398481 tcgtggtgat gccggcctca aggagcttgt cgcgctggcg gactcgcatc cctccgacca 1398541 gtagcagatc gtcgctggcg cgcagccgct cggtgcactg cggacagcgg aagcacgcct 1398601 gcacgcgttc gtcgtcccag cgcaccgcgg tgcccgcggt gtagtggccg tccagcaatc 1398661 gctgtaaaag cgcacgctgg gaccggtaga ccgggatgag ctcgccgacg cggtagcgca 1398721 cgatcgtgcc gtcgccgagt tcgagctcgg cgtcggcagc caccggaacg cccgagtgaa 1398781 ccagcgcatc ggcataggcc gccagctgta gcagcgcggt cacggttggc gagcgggcga 1398841 gcttggtgtc ggcgacccgg taccggtgac cgtcgcggat caggaagtcg gcgaacccga 1398901 cgaagcggcc gtcgaacatg gcggcctgat acaccaccgg ggcgtggttg gcgatggcac 1398961 gtcgcgtcgc gtcggcggct gccgccagcc cggcgggcgt gtaggccggc cggccaatga 1399021 tagccaccgc gtcgccgaac tcgtggcgca gttggtcgag tcggcgtcct tcatgcgcgc 1399081 taccgagaac ggcggctcgc gccatcagtt cgtcgtcaac tgcgacggcc ggtccccggc 1399141 ctagtttcgc gtcgaattca cggagcagtg cgtactggca ccgggcggcg gctgcgagat 1399201 ccgaagcact gtagacgatg ctgtcaccgg tgacgaacac agcagcaact cctcggtgag 1399261 acaacggaca ggcaaactgg gctgcacccg tcggcttaac cgccggtggt gttgccgatc 1399321 agctcgacgc cgccgccgtt ccagcggaac ttgacaacgt tgttcaaccc gatgccgctg 1399381 gcatacgtca atgccaccgt gtctcccgtg cactgcgagg tgtcgatgcc ggtgaaccca 1399441 taggtatcgg gcaccccctg cggtatgtac ttgccgaggt ggaacatcac cgcgcgggtg 1399501 gtcggattgc cggcgttcgt gttggccttg atgaccaccg ccgacagctg ggcacactcg 1399561 ttgtagttgc cggccagcgg ttctgggttc cagggctgct cactgcgcgg atcgcgagga 1399621 agttcggaga cgactttggc gattgtgggc gaggcgaggt tcaccgcaca cgggtcgacc 1399681 ggcgcggcgc tgtggttgct gggtggggca gctgtcgcgg acggcgggct cggttcgctg 1399741 ctcggcgggg ccgggtgagc agttgacagg gatggggtgg cctccggcgt cttagcgacc 1399801 gtggagtcgc ccgaaccgca accggtcaac gtcgcggcga ccaatgcagc gaccacgcca 1399861 acacgcggcg tggtggggca gggtggtgac cacacaccgg gcaccgtacc gccatcgggc 1399921 ccgcgggtgc ggtaggcgtg gccgggtcac cactaaactt gacggcctga tggccttccc 1399981 ggaatattcg cctgcggcgt ccgctgcgac gtttgctgac ctgcagattc atccccgcgt 1400041 cttgcgggcg atcggcgacg tcggttacga gtcaccgacg gctatccagg cggctacgat 1400101 cccggcgttg atggcaggct ccgacgtggt ggggctggcg cagaccggca ccggcaagac 1400161 ggcggcattt gcgattccga tgctgtccaa gatcgacatc accagcaagg tgccccaggc 1400221 gctggtgctg gtgcccaccc gggagctggc tctgcaggtg gccgaggcgt tcggccgcta 1400281 cggtgcctat ctgtcgcaac tcaacgtgct gccgatctac ggcggatcgt cgtatgccgt 1400341 gcaactggcc ggattgagac gcggcgcgca ggtggtggtt ggcacccccg gtcgtatgat 1400401 agaccatctc gaacgggcga ccttggacct gtcgcgggtg gactttctag tgctcgatga 1400461 ggccgatgag atgctgacca tgggtttcgc cgacgacgtt gagcgcattc tgtccgagac 1400521 ccccgaatac aagcaggtcg ccctgttttc cgcgaccatg ccgccggcga tccgcaaact 1400581 cagcgccaag tatctgcacg atccgttcga agtcacttgt aaggcgaaaa ccgctgtggc 1400641 cgagaatatt tcgcagagct acattcaggt agcacggaag atggacgcgc tcaccagagt 1400701 gctcgaagtc gagccgttcg aggcgatgat cgtctttgtc cgcaccaagc aggcgaccga 1400761 ggagattgcc gaaaagctgc gtgcccgagg gttttccgcg gctgccatca gcggtgacgt 1400821 cccgcaggcg cagcgggagc ggaccatcac ggcgctgcgg gacggcgaca tcgatatcct 1400881 ggtcgccacc gatgtggcgg cgcgcggact cgacgtggag cggatatcac acgtgcttaa 1400941 ctacgacatc ccgcacgaca ccgagtccta cgtacaccgg atcgggcgca ccggcagggc 1401001 cgggcgttcg ggagccgcgc tgatattcgt ctcgccacgg gagcttcacc tgctcaaggc 1401061 gatcgaaaag gctacgcggc aaacgcttac cgaggcgcaa ttgcccaccg tcgaggatgt 1401121 caacacccag cgggtggcca agttcgccga ttccatcacc aatgcgctgg gcggtccggg 1401181 aatcgagctg ttccgccgac tggtcgagga gtatgaacgc gagcatgatg tcccgatggc 1401241 tgacatcgcc gcggcactgg ccgtgcagtg ccgcggcggt gaggcattcc tgatggcacc 1401301 cgacccgccg ctttcgcggc gcaaccgcga ccagcgtcgg gaccgtccgc aaaggcccaa 1401361 gcgtagaccg gacttgacca cctaccgcgt cgccgtcggc aagcggcaca agatcggtcc 1401421 aggcgccatc gtcggcgcca tcgccaatga gggtgggctg caccgcagcg acttcggtca 1401481 gatccgtatc gggccagact tctcgctagt agaattgccg gcgaagctgc cccgcgcgac 1401541 gctcaaaaag cttgcacaga cccgtatctc gggtgtgctg atcgaccttc ggccataccg 1401601 gccgcccgac gcggcgcgcc ggcataatgg cggcaaacca cggcggaaac acgtcggatg 1401661 accctgccca aggaaagagc cgcccagggc ggactcgagc ggatcgccca cgtggaccgg 1401721 gtggcgtcgt tgaccgggat ccgtgctgtt gccgcattgc tggtcgtcgg cactcatgcg 1401781 gcctacacca ccggcaagta cacccacggc tattggggcc tgatgtcgtc ccgcatggag 1401841 atcggcgttc cgatcttttt cgtgctgtcg gggttcctgc tattccggcc atgggttaag 1401901 tccgccgcta ccggcggccc cccgccgtcg ttgagccgct atgcgtggca ccgggtccgg 1401961 cggatcatgc ccgcctacac cgtcaccgtt ctgttggcct acctcgtcta tcacttccgc 1402021 acggcggggc ccaaccccgg gcacacctgg gtcgggctgt tccgcaacct caccttgacg 1402081 cagatctata ccgacggcta tctgggtgcg ttcctgcatc agggtctgac ccaaatgtgg 1402141 agcctcgcgg tggaggttgc cttctacctg gcgttgccgg cgttggcata cctactgttg 1402201 gtgctcgtct gccggcggcg atggcagccc aggttgctgt tggccaccat ggcggggctg 1402261 acgatgatca gcccggcatg gttgatcctg gtgcacaaca cgcactggat gcccgacggc 1402321 gctcggctgt ggctacccac ctatctggct tggttcgtcg gcggcatgat gctggccgtg 1402381 ctggcggcga tgggcgtgcg ctgttatgca ttcgtggcca taccgttggc ggtcatctgc 1402441 tacttcatcg tctccactcc gatcgcgggc gcgcccacga cgtcgcccac agcgctggcc 1402501 gaggcgctgg tcaagaccgc cttctatgcc gtgatcgccg tgctggcggt ggcaccgctg 1402561 gccttgggtg accaggggtg gtatgcccag ttgctggcca gccggccgat ggtgtttctt 1402621 ggtgagatct cctacgagat cttcctgatc catctggtga ccatggagat cgccatggtg 1402681 gacgtgctcg ggtatcgggt ttacaccagt tcgatggtga acctttgtct cgtgacgctg 1402741 gtgctgacga tcccattggc gtggttgttg caccgtttca ctcgggtcca gggtgaccgg 1402801 ccttcctagc ggcggcagaa gcaggtgtca cgatcgggac gacgaactcc gcgatcatcg 1402861 ctcgttcgtc ggcttcgtca cggccgggga acatcagcag cgatgtgagc atccggacca 1402921 cccagcgggc gcggcgttcg acggtggtcg gatcgtcggg acctagtgag ttgaggaatg 1402981 ccgcggccag ggccgcgatc acctcggacc gtccggccat ctcgccgccg atcggtgggc 1403041 gggtggtggt aaaccacgcg gccaacgcgg ggttgtcgcg gaccatccgc aacgtcgtgg 1403101 tgatgctcac cagcagccgt tcggcaggtt cgacgacatc ggcgatcttc accatgatct 1403161 cgcggccgag ccggcgggtc tcgcggtgca cgtacgcggt tcgcagcgcc tcgcggctgt 1403221 cgaagtaccg atacagtgtt gcgcgcgaac agcctgcggc cttggcgatc tcgttcatgc 1403281 cgatcgacgc cgggtcacgc tgcgtaaaga gtcgctcggc ggcgtcgagt atccgatctg 1403341 cggctaactc ggtccgacgc gcggacagcc agtcggtacc cgccatcagg atgtcactcg 1403401 gaacggcacc gacagcggac gccggacata actgccgccg gaccacacga tgcgtgactc 1403461 ggccacctcg aagtccgggc accgggccag cagttcggtc agcgccaccc ggcattgcat 1403521 ccgggccgcg gccgcaccca ggcagtggtg ggcgccgtgg ctgaaggtca agatgttgcg 1403581 cgggcaccga gtgacatcga gttcggctgc gtccgggccg tattggcgtt cgtcacggtt 1403641 ggccgagccg tacagcagca gcacccggcg accggccggg atggtggtgt caccgatcgt 1403701 gacgtcgcgc gtggttgtgc gcgccagccc ctgcaccggc gaggtgagcc gcagcagctc 1403761 ctcgaccgcg tcggggatgc cctctgggtc atccagcagc agccggcgct ggtcgggccg 1403821 ccggtgcagc aacggcatcg aaccgcctag catgccggtg acggtgtcgt tgccgccggt 1403881 gaccatggtg aacgtgaacg ccagtatgga cagtgtgccg gcggtgtcgc cgtcggcgcc 1403941 gaccccggcg gctaccaggt gggagatggc gtcgtcggcg ggctcggtgc ggcgtcgctc 1404001 gatcagcccg gtgaagtagg ccatcatcga gccgaccgcg tccagtgcgc cggtggtggc 1404061 gccgtcaacc gcgttcgccg ccacgatggc ctgggtccac ccgtcgaatt gcgtccaatc 1404121 ctcttcggga acaccgagat agtgcgccac caccatcgac gggagcggtt tgaatagttc 1404181 ggtgacaatg tcgccgccac cgttggcgcg cagcttttcg agccgctcaa cgacgaactt 1404241 gcgcaccgtg ggctcgacgg tttcgacctg tcgtggcgtg aagccgcgcg acaccagctt 1404301 gcgaaactcg gtgtggaccg gcggatcctg catcaccatg ggcggggtgt cgtgcagtcc 1404361 aatcatttcc agctcgccgt agttaacggt caagccttgc gccgacgaga acgtctgatg 1404421 gtcccgcgct gccgaccaga cgtcggcgtg ccgggacagc acgtagtagt cgtactcggg 1404481 acgctgcggc gggacgacgt ggtgcaccgg gtcgtggtcg cgcaacgcgc ggtacatcgg 1404541 ccacggattc ggccaggttt cggcggtggc gagctggaat tcgtgagaca ttactgatgt 1404601 catgtcttat gtctaagaca ttccatcggt aatatcaatc ggcgattgtg aatctggtga 1404661 cgcgacacgc cgaggacgcg tcgtgcggtt cacactcggc gggacgtcgc gacggatcag 1404721 atcgccgagc cgggattgag gatgccctgg gggtccagcg cttgcttgat gcgctggttg 1404781 agggccagga cgtcgggccc gagatagccg gccaaccacg gccgtttcaa ccggcccacg 1404841 ccgtgttcgc cggtgatcgt gccgcccagg ccgacggcca ggtccatgat ttcgccgtac 1404901 gcgaggtggg cgcgctctag catcgcggca tctgcggggt cgtacaccag caacgggtgg 1404961 gtattgccgt ccccggcgtg ggcgatcacc gagatcatca gattccgctc ctcggcgatg 1405021 cgcgcaatcc cggtgaccag ttcgcccagt gcgggcagcg gtaccccgac gtcctcgagc 1405081 agcaacgccc ccttgctctc gaccgccgga atggcgaacc gccgggccgc aatgaacgcc 1405141 tcgccctcat ccgggtcgtc ggtcgaaaac acgtctatcg caccgttttc ggcgaacacg 1405201 gcggccatca cggcggcgtc ttcggtggcc gcgcggccac gttcatcaga accagccacc 1405261 agcatggccg ccgcatcgcg gtccaggtcc atccgcaagg tgtcctcgac ggcgttgatc 1405321 gccaccgaat ccatgaactc cagcatcgcg gggcgaagtc ggccggtaac cccgagcacc 1405381 gcatcgaccg ccgcctgcac cgagccgaag ctggccacca cgatgctcga tgcattctgt 1405441 gcgggcagca gtcgcaacgt cacctccgtg atgacgccca gcgtgccttc gctgccgacg 1405501 aacagtttgg tcagggaaag cccggcgacg tccttgagcc gtgggccgcc cagccggacc 1405561 gcggtgccgt tggccagcac aacctgcatg cccagtacgt agtcgcctgt gacgccgtac 1405621 ttcacgcagc acagcccgcc ggcgttggtg gcgatgttgc cgccgatgct gcagatctcg 1405681 aacgacgacg gatccggggg ataccacagg ccgtgttcgg cggcggcctc cttcacctcg 1405741 gcgttgtaca ggccgggctg gcacactgcg gtgcgggtga ccgggtcgac ggtgatgtcg 1405801 cgcatctttt cggtggacag cacgatcccg ccatccaggg cggtcgcccc gcccgaaagg 1405861 ccgctaccgg ctcctcgggt caccacgggc acctggttcg cactggccca acgcagcacc 1405921 gtctgcacct cttcggtgcg ccgtggccgg atgattgcca gcggtttgcc ggccgaaggg 1405981 tcaaaggccc ggtcttgccg gtagccgtcg gtgacggcgg ggtcggtgac caccatcccc 1406041 tcgggcagct cggccatcag gccagccagc acatcggtat tcactgagcc gatcctacgg 1406101 gccgatcgat gtccgcttgg ggcgccagat ccagttcgcg cagcgcgggc agccggatcg 1406161 cgaccagccc ggtgcacacg atgggcagtg ccaacgcgag aaacgtggca tgcagtccag 1406221 cggcgtcggt cagtggaccg gccagcaaca gacccaacgg gccggcggcg taggccagcg 1406281 acgtcatcac cccgactacc cggccgcgca gatgctgtgc tgcccgcgtc tgtatcacgt 1406341 agttatagat cggctggatg ggtccgtaca ccaggccgac caccgcgcac aacaccatga 1406401 tgaccggcag tggcggcagg aacgcgatga ccatcgatgc caaacccagg gtaagaaccg 1406461 cggtcgacat ggtcacgcga cggggaacgc ggatagccaa cacggcatac cccagcgctc 1406521 ccaccaggcc gccgccggcg atcgccatca acgcccaacc cagctgcacc ggttgctggt 1406581 ggtcggtgaa gtatttcggg aacagcacgc tctccatcgg cagatacagc gcggtgacgg 1406641 tcaggtcaat catcccgagg gtgcgcaata cccgcaggtt ccagacgaag cgcagcccct 1406701 cggcgatccc ggataccaac ccttggggcc gcgaggtgtg gtgcggcttg ccggcaccct 1406761 cgagttgcag ggcggcaatc gcgaggatgg acaacccgaa tgccgtcgcg gtaatccaca 1406821 ttgtggtgat gccgccaacc gtcgcgatca tcaagccacc gatggccggg ccgacaataa 1406881 aggccaggtt gaggatcgcc tcgtaggcgc cgttgatgcg gtccaacgac cagcctgccc 1406941 gagcggcggc ctcgggcagc atcgagtcac gagccgtcat gcctgccggg ccgaaggcgg 1407001 ccgccagggc ggccaatacg gccagcacca gcacgttgac cgcgtcgccg ccgtaccccc 1407061 acgccaccag ggggacgccg gccaccgccg cacccgacag cgcatcggcc accatcgaca 1407121 cccggcgacg cccgaagtag tcgaccgcgg tgccggcgac cagcgtggcg aacaacagcg 1407181 gcagcatggt cgcactggcc acgatcgagg cctgcccagc gctgccctcg cgctgcaaca 1407241 ccagccacgg aaacgcgact atcgagacgc catcacccgc ggccgccatc agcgttgcga 1407301 acaggatcag gaatgccggg ccgcggttgc tgtttctcat gaatatcgcg gctgaatcta 1407361 gcgccaaacc ggtatggggg ccaccgaatt tctgcgctgc cgcagcccgg atgcaggatg 1407421 ttcgtgtgct catgcatccg aagaccggcc gggcgttcag gtccccggta gagcccggtt 1407481 ccggctggcc aggtgatccg gcgacaccgc agaccccggt ggctgccgat gccgcgcagg 1407541 tgtcagcgct ggccgggggc gctggctcga tctgcgaact caacgcgctg atcagcgtgt 1407601 gccgggcgtg tccccggctg gtcagttggc gtgaggaggt cgccgtcgtc aagcgccgtg 1407661 ccttcgccga ccagccctac tgggggcgcc cggtgccggg gtgggggtcg aagcggccgc 1407721 ggttgctgat cctcgggctg gcgcccgccg cgcacggggc caaccggacc ggacgaatgt 1407781 tcaccggcga tcggtcggga gatcagcttt atgcagcact gcatagggcc ggcctggtga 1407841 actcaccggt cagcgtcgac gccgcggacg ggctgcgggc caaccggatt cggatcaccg 1407901 caccggtgcg gtgtgcgccc ccgggcaact cgccgacacc ggccgagcgg ctgacatgct 1407961 caccctggct aaatgcggaa tggcggctgg tgtccgatca catccgtgcg atcgtcgccc 1408021 tcggcgggtt cgcctggcag gtcgcgttgc gcctggcggg cgcgtcgggg acacccaagc 1408081 cgcggttcgg ccacggcgtc gttaccgagc tgggagccgg tgtgcggcta ctgggctgct 1408141 accacccgag ccagcagaat atgttcaccg gtaggttgac tcctacgatg ctcgacgaca 1408201 ttttccgtga ggccaagaag ctggccggga ttgagtgacg tgaagacggt tgtggtttcc 1408261 ggcgccagtg tggccggtac ggcggcggcg tactggcttg ggcggcacgg ctattcggta 1408321 acgatggtgg agcgccatcc cgggctgcga ccaggggggc aggctattga tgtccgaggt 1408381 ccggcgctgg atgtgttgga acgtatgggg ttactggcag ccgcccagga acacaagacg 1408441 aggattcggg gcgcctcctt cgtcgatcgt gacggcaatg agctgttccg ggacaccgaa 1408501 tcgacgccca ccggcggtcc agtcaacagt cccgatatcg agctgctacg tgacgatctt 1408561 gtcgaattgc tctacggggc aactcaaccc agcgttgaat acctgttcga cgacagcatt 1408621 tccacattgc aggacgacgg cgactcggtg cgggtgacct ttgagcgcgc ggcggcccgc 1408681 gagttcgacc tcgttatcgg tgccgacgga ctgcattcca acgtgcgcag gttggttttc 1408741 ggtccggagg agcagtttgt caagcgatta ggaactcacg cggcgatttt taccgtgccc 1408801 aacttcctgg agttggacta ctggcagacc tggcattacg gtgactccac catggctggc 1408861 gtttacagtg cgcgcaacaa caccgaagcc cgcgctgcac tagccttcat ggacaccgaa 1408921 ctgcggatcg actaccgcga caccgaagct cagttcgccg aactgcaacg tcggatggcc 1408981 gaggacggct gggtgcgcgc gcaactgctg cactacatgc gcagcgcacc ggatttctat 1409041 ttcgacgaaa tgtcgcagat cctgatggat cgctggtcgc ggggcagggt agcgctcgtt 1409101 ggcgacgctg gttattgctg ctcgcccttg tcggggcagg ggaccagcgt cgccctgctg 1409161 ggtgcctaca tcctggccgg cgaactcaag gcggccggtg acgactacca actcggattc 1409221 gccaattacc acgccgaatt tcacggcttt gtcgagcgca accaatggtt ggtcagcgac 1409281 aacatccccg gtggtgcgcc gataccgcag gaggagttcg aacgaatcgt gcattccatc 1409341 acgatcaagg actactgagc gccttcaccc gggcgcagcc aggatggcgc tcgtcggccg 1409401 cttcaccgaa cctgaagatc tgcagacgaa gtacgagtag gggccggcaa atttaccggc 1409461 tcgacgcgca gaagcgccga gatttagcgg cgggtcaata cgacgaccgg gattggccgt 1409521 gacgtccggc tctggtagtt ggtgtatcgg ttggcgttgt tctcgttgac gatctgccag 1409581 agccgcgcgt agtccgggtc gtggggctgc accggtttcg ctgtcacacc gaatcgcttg 1409641 ggcccgacgt tgatttcgac gtccgggttg gccttgaggt tgtggtacca acccggcgag 1409701 cggggatcgc cacctttgga cgccacgatc aggtacgcgt cgccgtcgcg agcataggtg 1409761 agtgacgtgg ttcgcggctg gctcgtcttg gcgccggtgg tatgcagcag caaactcggt 1409821 ggcgcgccgg ggattcggtg tccgatccga ccgttagtgc ctcggtagat cgcgtcgtgc 1409881 agcctgagca gctgcacgcc tacgtggcgc tcaagccatc gggaaatgtc catggggtca 1409941 gtcttgcgca gcggcatcct gttgcgccag cgcctcccgc aggatccgtc cggtggcttc 1410001 ccggtccggg tcgcggcgca gcatcattcc cttggcgacc gacagcttgt cgccgttgcg 1410061 cggcggtaat acgtgcaagt gaacgtggaa caccgtctga aaagcggcac ggccgtcgtt 1410121 gatggcgatg tgtgtcgcgt cagccaactt cgtggcgcgg gccgcccgcg cgatgcgttg 1410181 gccgatggcg accatgtcag ccaacgcctc cggcggggtg tcggtgaggt caacggtgtg 1410241 tcgcttgggc agcaccagcg tgtggccgcg ggtgaacggg cggatgtcga ggatcgcgag 1410301 atagccgccg tcctcgtaga tccggatggc cggagcctcc ccggcgatga tcgcacagaa 1410361 cacgcagggc atgtcgctac ggtactggac ctctcggaga ccgcccaagt gaacgggata 1410421 cgctgccgcc gtggacccta ctgacctggc cttcgccggt gccgcggcac aggcgcggat 1410481 gctggctgac ggtgcactca ccgcgccgat gctgctcgag gtctacctgc aacgaattga 1410541 gcgtctggac agccacctgc gcgcctaccg ggtggtgcag ttcgaccggg cgcgtgcgga 1410601 ggccgaggcc gcccagcaac gcctcgacgc cggtgagcgg ctgccgctcc tgggcgtgcc 1410661 gatcgccatc aaagatgatg tcgacatcgc cggggaggtg acgacatacg gcagcgccgg 1410721 gcacggtccg gccgcgacgt ccgacgcaga ggtggttcgc cggctgcgcg cggcaggcgc 1410781 tgtcatcatc ggcaaaacca acgtgcctga gttgatgatc atgcccttca ccgagtcgct 1410841 ggccttcggg gccacccgga atccgtggtg cctcaatcga acccctggcg gcagcagcgg 1410901 cggcagcgct gcggcggtag cggccgggct ggcgccagtg gcactgggat ccgatggtgg 1410961 cggatcgatt cgtatcccgt gtacctggtg cggtctgttt gggctgaaac cacagcgcga 1411021 tcggatttcc ttggagccgc acgacggggc ctggcagggg ctgagcgtca atggcccgat 1411081 cgcgcggtcg gtaatggacg cggcgttgct actggacgcg accacaacgg tgcctggtcc 1411141 cgaaggcgag tttgtggccg cggccgcacg ccaacccggc cggctgcgaa ttgccttgag 1411201 caccagggtt ccaaccccgc tgcccgttag gtgcggcaag caagaactgg cagccgtcca 1411261 ccaggcaggt gcgttgctac gtgatctggg ccacgacgtc gtcgtccgcg atcccgacta 1411321 tccggcttcg acctatgcca actacctgcc ccgctttttc cgcggtatca gcgacgacgc 1411381 ggacgcgcag gcgcacccgg accgcctcga agcacgtacc cgagccatag cgcgtctagg 1411441 gtcgttcttc tccgaccggc ggatggcggc cctgcgggcc gccgaggtgg tgctgagcag 1411501 ccggatccag tcgatcttcg acgatgtcga cgtagttgtg acgccaggcg ccgcgaccgg 1411561 cccgtcccgc atcggcgcct accaacgccg gggtgcagtt tcgacgttgc tgctggtggt 1411621 gcagcgggtt ccgtactttc aagtctggaa tctgaccggc cagcccgcgg ccgtggtgcc 1411681 gtgggacttc gacggcgacg gcctgcccat gtcggttcaa ctcgtcggcc ggccgtatga 1411741 cgaggcgacg ctgctggcac tggccgcaca gatcgaatct gccagaccct gggcccatcg 1411801 gcggccgtcg gtgtcatgac attgcagtcg cccgctcgtt tttcacgttt ttgcccggcc 1411861 gcaggacatg tgcggcggcg ttaacgttga ctggtgacag accacgtgcg cgaggcggac 1411921 gacgcgaaca tcgacgatct gttgggcgac ctgggcggta ccgcgcgcgc cgagcgtgcg 1411981 aagcttgtcg agtggttgct cgagcagggc atcacccccg acgagattcg ggcgaccaac 1412041 ccgccgttgc tgctggccac ccgccacctc gtcggcgacg acggcaccta cgtatccgca 1412101 agggagatta gcgagaacta tggcgttgac ctcgagctgc tgcagcgggt gcagcgcgct 1412161 gtcggtctgg ccagagtgga tgatcctgac gcggtggtgc acatgcgtgc cgacggtgag 1412221 gcggccgcac gcgcacagcg gttcgttgag ctggggctga atcccgacca agtcgtgctg 1412281 gtcgtgcgtg tgctcgccga gggcttgtca cacgccgccg aggccatgcg ctacaccgcg 1412341 ctggaggcca ttatgcggcc gggggctacc gagttggaca tcgcgaaggg gtcgcaggcg 1412401 ctggtgagcc agatcgtgcc gctgctgggg ccgatgatcc aggacatgct gttcatgcag 1412461 ctgcggcaca tgatggagac ggaggccgtc aacgccggag agcgtgcggc cggcaagccg 1412521 ctaccgggag cgcgacaggt caccgttgcc ttcgccgacc tggtcggttt cacccagcta 1412581 ggcgaagtgg tgtcggccga agagctaggg cacctcgccg ggcggctggc cggcctcgcg 1412641 cgtgacctga ccgctccgcc ggtgtggttc attaagacga tcggcgacgc ggtcatgttg 1412701 gtctgtcctg atccggcgcc attgctggac accgtgctga agctggtcga ggtcgtcgac 1412761 accgacaaca actttccccg gctgcgagcc ggcgtcgcct ccgggatggc ggttagccgg 1412821 gccggcgact ggttcggcag cccggtcaac gtggcaagcc gggtgaccgg ggtggcgcgc 1412881 ccgggtgccg tgctggtcgc ggattcggtg cgggaggccc ttggtgatgc ccccgaagcc 1412941 gacggatttc agtggtcctt cgccggcccc cgtcgcctca ggggaatccg gggtgacgtc 1413001 aggctttttc gagtccggcg aggggccact cgcaccggct ccggcggcgc ggcccaagac 1413061 gacgatttgg ccggctcgtc accgtaggca ggcacaccgg tacacatggg cagacccggc 1413121 gtgactctcg gggggcgtct gacaccgcct tctgcgggtc ttgcgcggcc ggccttcacc 1413181 ccgtcttccg gcactttcga ttggtcacta accgggcctg cttcgatacc aaaaatacaa 1413241 cgtcgaatgg ctgatcacaa tggttctcgc caggccggac gctgttttcg cgccggccag 1413301 gaaccggtgt cacgtttcgc tgccggtgaa cgcgatgtca ttaaagatga aagtatgtaa 1413361 tcatgtaatt atgaggcacc atcacatgca cgggcggcgc tacggtcgcc ccggcggctg 1413421 gcagcaagct cagcaaccag atgccagtgg ggcggcggaa tggttcgctg gccgcctgcc 1413481 cgaggactgg ttcgacggcg accccaccgt catcgtcgac cgtgaagaaa ttacggtgat 1413541 tggcaagctg cctggactcg agagccccga ggaagaaagt gcggcccgag cctcgggccg 1413601 cgtgtcgcga ttccgcgacg aaacccgacc ggagcgaatg actatcgccg atgaagccca 1413661 gaatcgctac ggacgcaagg tgtcctgggg cgtcgaggtc ggtggtgagc gaatcttgtt 1413721 cacgcacatc gcagtaccgg tgatgacgcg gttaaagcag ccggaacggc aggtgctgga 1413781 caccttggtc gacgctggcg tggctcgttc ccgctcggat gccctcgcgt ggtcggtcaa 1413841 gctggtcggc gagcacaccg aggagtggct ggccaagctg cgcaccgcca tgtcggcggt 1413901 ggacgatctg cgcgcgcaag gcccggatct tccggcctaa acggccaccg ccgaatgcgt 1413961 cattccttgt tgactttgtc aacgatcttg gcggcgatct ggcctgcttg attggtgatc 1414021 cggtacccgc atgcgttgac gtcgacgacc acattgttgg ccacgctcat cgcgcgttgg 1414081 cattcccagc cctcagcgcc ttcttgggtg tctatcaccg tgatcgtcgg cgggctgcct 1414141 ttgacgtcgg caaacgtcca ccggtaggtc ttggccttat tcgtgacggt gaccgtcttg 1414201 cctgcgcagt tcttccattt gtcggccgaa gtctgcacga acgcgcgggc tttgtcggcg 1414261 gtcggaaagg cgacgacggc ttggttcacc caatgttcgt agttgtcgcc cggctcggat 1414321 gaaatcaagc cgttgatggc ggtgtagccg gtgccggcat acaccggatc ctggctggta 1414381 tacagcgcgc cctggcagtc cggcagggac accgtcaccg gcgaagagtc catcgatgtg 1414441 atcggtttgc ccggctgcat ggacgacgag cccatcacgg cgttgacttc tgaggagttc 1414501 agcagtaggg cgctaaggcg ctcctccgca accggctgag gcggctgtac cggcttgggc 1414561 cggatggcga tccagatgcc gatggcgccc aacacgagga cgagcacgac ggcggcggcg 1414621 ccggccacta agggccacgg gttggttttg cgcggggtct gggcccaggg gctggggccg 1414681 ccggacggcg gtgcgcccca gccgccgccc tggtagtact gcggggtggg agtgggtccg 1414741 ctggccggca tcgggccgct attgggcgcc caggacggct ggccggtggg gccaggccgc 1414801 tgtccggcag gccccggctg ggccgggggc gtgtaggacg gctttggtgc cggctggaca 1414861 cccggcgggg tgacgggggg tgcgggtggc tgccgaggag ccatggcggt ggccggcatg 1414921 gtcggaggcg ggacgggctt aggcggcgcg ggcagggtgg attcttggct gcggcgcagg 1414981 atgtcggcgg cgtggtcttg gtcggggtcg ctgagcgctt cgtgggcggc cagggccagg 1415041 tcgccggcgc tggcgtagcg gtcttcgggc tttttggcca tgccgcgggc gaccaccgcg 1415101 tcaaaggctt tggggatgcc cgggcggatg gcgctgggct gggggatggg tcccatcagg 1415161 tgggagctga ccagtgtgcc ggcgctgtcg gcgcgatacg gcggggcccc ggtcaagcat 1415221 tcgtgcagca cgcaggccag cgcgtagatg tcggcgcggt aggttacctc gtcgttggag 1415281 aaccgttcgg gggccatgta tttccaggtg cccaccgcgg tgcctaactg ggtcagtttc 1415341 tcgtcggtgg tcgcactggc gatcccgaag tcgaccagat aggcaaagtc gtcgcgggtg 1415401 atcagaatgt tttgcggttt gacgtcgcgg tgcatcaccc cgtcggcgtg tgcggcatcg 1415461 agcgccgagg cgatctgggt gatgatggcc accgcgcgcg gtggggtcag cgggccgaag 1415521 cgtttgagca cgctgtcaag gtcggtgccc tccaccaggc gcatctccaa aaacatttgg 1415581 ccgtcgactt cgccgtagtc gtggatgggc accacgtgag gttcctgcaa ccggccggcg 1415641 atgcgggctt cgcgtttcat ccgctcgcga aacaccgggt ccttgctgaa ttccgcggtc 1415701 atcagcttga cggcgacggt ccactccttg acggtgtgct cggcctcgta gacctcgccc 1415761 atcccgcccc ggcccaacag ccgtttgagg tggtagggcc caaacatcga gcccacccgc 1415821 gagtcctgtg cgtcgctcat cgctgatcct cccaaccaac ccgctgccgc cgacactatc 1415881 aacaacggtc aggtatcacg tcggctgcga tcgccgggcc cagcaacctt gccaggcaac 1415941 aatgacgcta ggccttcgcc ggctcgaccg cacgaaaatc tgccacatct tcgcgggatg 1416001 tcggcgactg cggtggctgt gccattcgct ggtacgcgcc gctgttcggc taccgaaaag 1416061 tgttgtggta attggttacc gcagcccagc gccggcggcc agcgcgcgac gttgccacga 1416121 aaagctttgt gtagcagtca tatccgtgga catcggtgtt aagggcttgt gtccacggat 1416181 ctacgtgccg ccatgcgtcc ccgcgctgat ctggaacgtg aattcatggt cacagatgcg 1416241 aatgtggtcg ccgtcgttca gcgtgaccgc ggagcggatt cgctcgtgct gcacatgcac 1416301 gccgttggac gatcggaggt cgttgatgac gtagttggtg cccgtgtcga cgatgacggc 1416361 gtggtggcgg ctgacgttgg cgctgtctag gacgatgtcg ttgtcatgca gacgcccgat 1416421 ccgggtcgcc gcggcttgca gtgggtagcc gcgacccgag gcgatgtcgt gcaggtaggc 1416481 caccgcctgc tggcccgacg ccatggtgcg ctgatcgagc accgtgacgg tgccggcagc 1416541 ggtggttttg gcggacttct tggcatccag cggttgctga cgcagaatcc gctcgttgag 1416601 agcgcgcaac gtcggaccgg ggtcgatgcc gaggtcgtcg gccagtgttg tcttcacccg 1416661 gcgataggcg cccagcgcat cggattgccg gtcggagagg tagtaggcgg tgatcagctg 1416721 tgtccacagc ggctcccggt aggggtgttc gaatgtcaga gcctcgagct cggcgatcac 1416781 tgcgctggcc cgcccacacg cgatttcggc ctccgccttg gcggtatggg caagaacctt 1416841 gtcttctacc agcgccgtgg caaagggttc gacgaactgg aagtcgcgca ggtcatcgag 1416901 caccggccca cgccattctc tcaatgcggc cgacaggtgg cggctggctt gttcgaaccg 1416961 gccggcggcg gccgcgtgca cgcccgcggt tttttcggca acaaaccgcc ccagatcgca 1417021 agtgttgtcg gggatgctga gccgataacc cggcggcgct gcggccaaca ccacccgtgg 1417081 gtcgatcccg gcgccaccga ggagcttacg cagattagac acgtaggagt ggatactcgc 1417141 gcgtgcgccc gagggtggcc actcctccca gagggcggtg attagggcgt cgactcctac 1417201 gggcctgttg cggttgatga ccaacatggc tagcacagcc cgttgcttgg gggtgcccga 1417261 tggcaccggg gtgccgtcga tagtcatctg caatggtcca agcaggccga agtcgagccg 1417321 cttctccact gtcgcgctac cagccattgc gggtcctccg tggcttgcgg tgccaaggtg 1417381 ccaatagggt gtcgctaccg gtcattgtga taccacgttt cgccgatgcg gtaagaaccc 1417441 aggatctcgg cacgccgtgc gatgtaccgg gtcggtggcc cttgacagcg gcatcggctg 1417501 tttccatgcg ggtgaaatgc tggccctgta aagatgatcg tgaatgtccc acgggaatcc 1417561 tgttggtgct catccaaaca tgcgatcggc gggcagccga cccggtgttc ttgcaacgag 1417621 tggctgcccg ctgtggtgat cgacattcga gcgcggttca ggtggtgacg gccatgaagt 1417681 cgtggctggt ggcccacgcc tcgacgaagg tttccatcgg gatctgctcg tcgcggcccg 1417741 tgggggtacc gctgtcgttg aggtgaacaa tgccgttttc ggtatcgaca ccggtcacca 1417801 ccacggcgtg gtcagaccgc gggttgccgg cactgtcggt ttcctcgacg ggctggcccc 1417861 agatcatctc ggcgttgatg ctgacgatca cggcgtgccc gctgcccaga tactgctcga 1417921 gggcggccat gccggtggcg actccggtgg ctgtggcgtg gtcctcgtcg gtgataacgg 1417981 cgtcgacgcc gtaatgcgcc agcagcgtcg gtatgtcggc cacgctggta cccattcccg 1418041 agttcgggtg ctcggcgtcg gccggctttg tgtagatgga cccggggtgc acgacgctgg 1418101 gtgtcgactg ggccactttg atgatggcgc gctcggaagg ctccctgccg gtcacttgac 1418161 cgatcacgtc cgcggccgac atcaggacgc agtcgtcgta tgtctgctgg cgccagtact 1418221 tggcggcggc tgccgggtcg ccatacatgg tgcccgccgc tgcgtcggcg gggctggcca 1418281 atcccagtgc aacggcaccg gcggccagcg cgaaggtggc ggtcttgaag gcggtggcga 1418341 ttttgctggt cgtcatcgtc ggtccttttc tcgttccgct atgcggagtg gatgttgaga 1418401 aaaggttccg atggtgacct ttttgttatc tctaggaatt cttggagtga tctgcagtgg 1418461 tcagccgagg ttcaccggtc gcgggcaggc cgatctgcgc gggcgcagtc gacagcgttg 1418521 ctaccgggat gcacggcggt accgacgatc ggtcggctgc ctaagcgggc gtgcgggatt 1418581 agttgcaggc ccaggtgtcg atgtagccgc cgccgagctt ggtcagggcg tccttcatgg 1418641 cggcggccaa ggtgggtcca actcctccct ggtatgccct atcgttggcg gcgacggcgc 1418701 cgcaggcggt gaaactggtg agcaccttgc agtcggagta gccacacgac ttgacggcgg 1418761 tggcttcggc agccgcccgg gttgggtagt cccacgatcg gccccacgag ccgttgccgg 1418821 agtaggcaat tgcgccatag acatcggcgg catttgctgg tgcgggagcc agggtgacgg 1418881 tcgtcgcggc ggcagtggcg acgccggcga cggccaccgc gaaccgtcgc cgaagagtaa 1418941 tcatcgtcgt cattggtgag tcctttccga atgccggcgg tgcggcggtt tcaacaagca 1419001 attaggacga tggctagacc ggtttggtgg cggtgacctg cttaccccag tcggacatcg 1419061 tcaacgtcac cgacgtgtct ttggtgggag cgatctggat ctggaccaag tgcgaggatc 1419121 catccgaagc gatccagacg gtggtgggca ccgttttgac gtcttcggag gtcagacgtg 1419181 agccggccag cgtcgcgatg tcgtcagcag acgagttccc ggtgatcttg gtggtcgcga 1419241 caccgtccgc ctgctggctg ccggcaaccg acgcgtcctt gaggttagcc aacaggttgg 1419301 ccaggccctt gttggggtcg aggagcaccg acacgttgta gatcgaggcg ccgttgccga 1419361 aatcggtgta ggtgccgggc tggcctaggt cggagtacag gtgaccgtca acatagacga 1419421 acttcgcgtc ttcgctcttg ttgccgacga gcaatgtcgc gctaccggtg gcaaccgtct 1419481 gcggtgtgtt ggagatatcg ccttcgagct tggtcacccg caggtttggc acgtcgcctg 1419541 tcaccgcaag tctgacgtgc attccggtga ccttgcgcat cgcatcggtg gcctgcttga 1419601 gtagcatggc cgcatcgccg ttggatgccg tggccgcggt gtcagacgct ttgccggcgt 1419661 ccccttcggt tgagcagccg ccgatcgcca ggacgacggc gagtatggcg gtggcggcgg 1419721 caacaacgga acaaggtgga tgcttcatcg aaatctcctc atgttggccc acagcttcgt 1419781 actgcatagc aatcccgttg cggcagagtc aacagccgac accgagtccg agtgagcgcc 1419841 gcacggcacc gcgagtcgaa tcggccgaat tgaatggcgt ttcaaacgct ttcgttgtcc 1419901 ggcggcaaag cgaatgcggg gatcccggtt gacgggatcc ccgcatcggg tgggcagcgg 1419961 ctaggtgagc tggctggcgt attgcgggca gtaggccttg gttgcgtcga cgacgaagta 1420021 ggctgcctgc ttagtggtca ggttggtttg gctgaggacc tcctcggcga tctcggtgcc 1420081 ggtttcgccg ctggccagct tcttgcagac cagctgggct tgctgggtgg ccacctgcgg 1420141 tgaggagaag gtgacgccaa tggactccat ctgagcaatg aaggcttcgt ctttggtgtt 1420201 ggcgccggcg gtgccggcgg tggcgacggc aagtccgatg gcggcggcgc cgactgcagt 1420261 ggtgaacgct gcgataatgc gaggcgataa cggcgataac atggtcaaga tccttcgcgg 1420321 tcgggatttc cctggatgac ctcagcttgc ggggggcgcc ttggcggatt ctcaacaact 1420381 tcttggtaac ctcgtgggcc cgcgtcgggc taggcccgcg tcatctggta atagaccccg 1420441 cgccgggcca acagctcggc gtggttgccg cgttcgacga tctggccggt ctggaccacc 1420501 aggatgtggt cggcatcgcg aatcgtcgaa agtcggtggg cgataatgaa actcgtacga 1420561 tcccggcgaa gctcgcgcat cgctcgctgg atgagcagct cggtgcgggt atcgaccgag 1420621 ctggtcgcct cgtccaggat caacagctgc gggcgggcaa gaaaggcgcg ggcgatggta 1420681 atgagttgct tctcgccgac gctgatgctg ccgccgtcgc cgctgacccg tgtctggtag 1420741 ccagcaggca gtgtgttcac aaaccggtcg acatgggccg ccctggcggc ttctactatc 1420801 tcgtctgtgg tggcctccgg ccgtccgtag gcgatgttct ccgcgatggt cccgtcgtag 1420861 agccaggtgt cttgcaacac catgccgatt cgcgatcgca gcgactgccg gcttaccgag 1420921 gcgatatcca ccccgtcgat caggattcgt ccggaaccga tctcgtagaa ccgcattagc 1420981 aggttcacca gcgtggtctt gccggctccc gtcggtccga cgatcgccac cgtgctaccc 1421041 ggttcggcca ccagcgacag gtcgcggatc accggcgtgc ccgggaggta agcaaagttc 1421101 acgtgctcaa actcgacccg tccggttagg ttcggcagct ccggctcagg ctccggcgac 1421161 tcctcgggct cgtcgagcac gtcgaacacc cgctccgcgc tggccacccc ggactgcagg 1421221 gcgttgtaca tcccggccag ctggctcagc ggcatgttga actggcggat gtactggatg 1421281 aacgcctgga tgctgccgag cgtgatctgc ccggtggcta cctgcaggcc accggccacc 1421341 gcgaccgcga cgtagccgag gttgccgatg aacgccgtcg ccggctgcac gagaccagag 1421401 aggaactggg cgccgaaacc ggcctggtag acgtcgtcat tcaactcgtg gaaccgttct 1421461 cgtgcggccg cttggtggcc gaacgtcttg actaccgtga acccgctgta ggtctcttcg 1421521 agatgggcgt tgaggcgccc ggtgctggtc cagtgagcta cgaatagggg ctgtgaccgc 1421581 cgggtgatcg cgcgtgtcac cagcagcgac agcggcaccg tcagcagtgt gatcagcgcc 1421641 agcaggcccg agatcgacac catcatggcc agcaccgcca ccatggtcag aatcgacgtc 1421701 accagctggc tgatcgtcat tgacagcgac gactggaggt tgtcgatgtc attggtgacc 1421761 cggctcagca gctcaccgcg ctgttgtccg tcgaagtagg acagcggcag ccggtgcacc 1421821 ttgtcttcga catcggtccg caacctgacc atcgttttct gcacggtgag gttgagcagc 1421881 cgggcttgtg cccaaatcat cagcgctgca gccagataca gcgccaacgc cagcgccagt 1421941 gttcgctcca ccgcggcgaa gtccacacct tggcccggca ccacgttcat cccggacagc 1422001 aggtcggcga aggtgttgtc accacgggcc cgagccgaag cgacggcctg tgccttggtg 1422061 attccccccg gtagccctcg cccgatcacg ccgttgaaca gcaaatcggt ggcatggccg 1422121 aggatccgtg gaacgatgac gccgatcgtc gtgccggcga ttcccagtgt gatcaccgcg 1422181 atgctcagcc ggcgttgtgg cgccagccgt ttcaccagtc gggctgccga tccccagaag 1422241 tcgcgggacc gcatgttcgg gggcgggctt gcggcacggg ggcgtgcgcc cggtggcgcg 1422301 gtcaccctac acccccgacc gtggcgctca gcgattgtga ggcggcgaat tcggcatagg 1422361 tggggcaatc ggccagcagc gtttcgtggg tgcccgtgcc gacgatctta ccgttatcga 1422421 caacgatgac ctggtcggcc tgagcggcat tcgaaatccg ttgtgtaaca acaatgatgg 1422481 ttgcatcacc agatacctgt cgcagcgatg cgtggacttt ggcgtcggtg tgcacgtcaa 1422541 gtgcggagaa cgcgtcgtcg aacacataga tggccggacg tcggatgacc gctcgggcta 1422601 tcgccagccg ttggcgctgc ccgccggaga agttgacacc accttgggcg acacgcgtct 1422661 gcagcccgtc tgtttgtaca aagccgtcgg ccgcggcgac ccgcagcgcc tcccacatct 1422721 cctgctcggt gactacctgg tctgggcccc cgccgtagcg caggttgtcc gcgacggttc 1422781 cggagaagag gtagctgcgc tggggcacca gcccgatcgc tgaccagagc cgctcggtgt 1422841 ggtactcgcg gacgtcgata ccgtcaacca agaccgcgcc agcggtgacg tcgtagagcc 1422901 ggcagatcaa cgacaccagt gtcgacttgc ccgaaccggt actgccgacg atcgcggtgg 1422961 tggtaccggg ccgcgcagtc aacgaaatgt cctgcagcac cgggcagtcg gcgccaggat 1423021 aggtaaaggt tgcgccagcc aagcgcacta cgcccgtgac cccgtccgtc gggaacttgg 1423081 gattgtcggg gttaccgagt gcggcgggcg tggaaagcac ctcggtgatg cgttcggcgc 1423141 agaccgacgc tcgtggcagc acggccagcg tcatggtcgc catcaacacc gccatcagga 1423201 tctgggcgaa gtaggacagg aaggcgatca gggagccgac ctgcatctgg ccgctgtcga 1423261 tgcgtagccc accgaaccag atcagtgcga cgctggatgc gttgatggtc agcgtggtca 1423321 ccggcagcat cagtgcttgc cagttgccgg cgctcagtgc ggcattcgac agcgccgtat 1423381 tggcctgcgc gaacttgtcg cgttcatagc cttcgcgggt gaaggcgcgg accactcgca 1423441 ccccggacag ctgatcgcgc atcacccggt tgatgccgtc gatcaggctc tgcatgcggc 1423501 ggaagagcgg cagcatgtgg gagatgatcc agtagtttgc tacggccaga atcggaacgc 1423561 tgaccagcag cagccatgtc agcgcggcct cctggtggat ggccatgatg attccgccga 1423621 cgcacatgat cggtgcggtg accagcacgg tggcggtcat ctggaccagg aacaggatct 1423681 gccggacgtc gttggtgctg cgggtcaaca acgtcggagc gccgaatcgg gcggtctcgc 1423741 gttccgagaa ggtgatgatg tgttcgaaca ttgccgagcg caggtcacgg ccgaaacccg 1423801 ccccggtccg ggagcccaga tagaccgccc cgatcgcgca cagcacctgc aatccggtca 1423861 ccccaagcat caccgcaccc agccgtacga tggtggcggt gtcgcccttg gcgacgccgt 1423921 cgtcgacgat tgcggcgttg accgtcggga ggtatagcga agccagggtg ctgaccagct 1423981 gcagcatcat cagcatcgcg accagccggc ggtacggtcg gatgtgctgg cgcagcaggg 1424041 ccaggagcat tgggtaactg tcgcacactg cgcatgctgc ctacccgcgc caggcatgag 1424101 tcttaggccg aaatgcctgg ttaactggcg tgtcgtggtt gacccgcggg cctgcggcta 1424161 cagtgcatgc tgtgatcggc agtgggagag gtagcggtgc ggcgtaaggt gcggaggttg 1424221 actctggcgg tgtcggcgtt ggtggctttg ttcccggcgg tcgcggggtg ctccgattcc 1424281 ggcgacaaca aaccgggagc gacgatcccg tcgacaccgg caaacgctga gggccggcac 1424341 ggacccttct tcccgcaatg tggcggcgtc agcgatcaga cggtgaccga gctgacaagg 1424401 gtgaccgggc tggtcaacac cgccaagaat tcggtgggct gccaatggct ggcgggcggc 1424461 ggtatcttgg gcccgcactt ctccttctcc tggtaccgcg gcagcccgat cgggcgggaa 1424521 cgcaagaccg aggagttgtc gcgcgcgagt gtcgaggaca tcaacatcga cggccacagc 1424581 ggtttcatcg ccatcggtaa cgagcccagt ttgggtgact cactgtgtga agtcggaatc 1424641 cagttctccg acgacttcat cgaatggtcg gtgagtttca gccagaagcc gttcccgctg 1424701 ccgtgcgaca tcgccaaaga actgacccgc caatcgattg cgaattcgaa atgagacgtg 1424761 tcctggtcgg tgcggccgcc ttgatcaccg cactgcttgt cttgaccggc tgcacgaagt 1424821 cgatttcggg taccgccgtc aaggcgggtg gggccggtgt cccgcgcaac aataactccc 1424881 aggagcgcta ccccaacctg ctcaaggaat gtgaggtcct gaccaccgac atcctggcca 1424941 agaccgtcgg tgccgatccg ctcgacatcc agagcacgtt cgtcggcgcg atctgccggt 1425001 ggcaggcggc caacccggcc ggtctgatcg atatcacccg gttctggttc gagcagggca 1425061 gtctgagcaa tgagcgcaag gtcgccgagg gcctgaagta ccaggtcgag acccgcgcga 1425121 tccagggcgt ggactcgatt gtgatgcgga cgggcgatcc caacggcgcc tgcggcgtcg 1425181 ccagcgacgc ggcgggagtg gtcggctggt gggtcaatcc ccaggctcct ggtatcgacg 1425241 cctgcgggca ggcgatcaag ctgatggagc tgacgctggc aaccaacgcc tagcgctggg 1425301 cgaggcggga gcgtgggcgt gagcgcgcgc agttgtacgg cactaacggc gtgtcggggt 1425361 acagacacgc gcgctcgcgg gttcggctgc cttcaaaagg aagtacgcgg ctgacggttt 1425421 gcggagcaag agcacctcta ccgtggcacg tgaaagccga ccagcgcggc acaccccggt 1425481 tcgacgtctg cccagtgtcc ggcgacgcgt agcacggcga tccccgacgt cgggaacttc 1425541 tccgagatgc gttccgcgac agcggcgtcg gtgccgctga tgctggccag gacgatggca 1425601 agggccgacg tcgttggctc gtgcccgacc acaagcagtg tggtgacgtt gtcgccaacc 1425661 cggttgatct cctcgatcac tgttccgggt gccgcgccgt agagccgctc ggcgtagcga 1425721 gcgggtgcgt cgatgccggt gtgcgccaag gtctgccggg cgcgcgtagc cgtggagcac 1425781 agcacggcat cgacggccgg caggttggcg cgcagccagc caccggccag cccggcctcc 1425841 cggatacccc gcggcgctag cggccggtca tggtcggcga tcccgtccgg gtacgcagac 1425901 ttcgcgtgtc gcatcagcac caggttgcgg tattgctcat tcactgggct gacgttagtt 1425961 cagtgacgtg cccgggatcg ctacggttgg tcgtcgtcct ggtccccgcc gcgctccgct 1426021 ggcatgggac agacttcgtt gcgatcgcct agctcgagcc gaggcgtcag ccatagggcg 1426081 ctgataggta gggcgagcat tctgtgccca aaggataggg ctggcatcgc ccgggcaagc 1426141 acgggcggca tgctgccccg ccggtgagtc cgcgcccggg acctgccggg cgaggtcccg 1426201 cgccttgtcg gtgtgcagac ctacactcgc tttgcgttga cagccacgca ctcaggaggg 1426261 atgggatgcg attcctgcac actgccgact ggcagctcgg catgacgcgt cactttctcg 1426321 ccggtgacgc ccagccgcga tattctgctg cccgccgtga cgcagtcgct ggactaaaag 1426381 cgctggccgc cgatgtgggc gccgaattcg tcgtagtcgc cggtgacgtc ttcgaacaca 1426441 atcagctcgc gccacagata gtcggtcaat ccttggaagc catgcgcgtg atcggccttc 1426501 cggtctatct gctgccgggt aaccatgacc cgctggacgc ttcgtcggtg tacaccagca 1426561 cgctgtttcg agccgaacgg ccggacaacg ttgtggtgct cgaccgagct ggcgtccacg 1426621 aggtccggcc gggagtccag atcgtcgcgg cgccgtggcg gtccaaggcg cccaccaccg 1426681 acccggttgc cgaggtgctg gccggcctgc ccacagacgc cgctattcgg ctgctcgtcg 1426741 cccatggggg tgtcgacgcg ctggaccccg accacgacaa accgtcgctg atcaggctcg 1426801 ccgcactcga cgacgcgctg actcgacagg cgattcatta tgtggcccta ggtgacaaac 1426861 attcgcttac ccaggtcggc agcagcgggc gggtctggta ctccggtgca ccggaagtca 1426921 ccaacttcga cgacgtcgaa ccggaccccg gtcacgtcct agtggtcgac atcgacgaaa 1426981 gcgacccgcg acatcccgtc accgtcgacg cccgtcgcat cggccgctgg cggttcgtta 1427041 cgttgcacca ccaggtcgac accagccggg acatcgccga cctggacctg aacctggatc 1427101 tgatgacgga caaggaccgc accgtggtgc ggctggccct gaccggttcg ctgacggtca 1427161 ctgaccgcgc cgcattggat acctgtctgg acaagtacgc gcggttgttc gcctggctgg 1427221 gtctgtggga acgtcacacc gacctagcgg tgatacccgt cgacgccgag ttcaccgacc 1427281 tcggcatcgg ggggttcgcc gccgcggccg tcgacgagct agtcgcgacc gcgcgcgggg 1427341 gtgacgacga gtccgccgtc gatgcccagg cggcgctggc actgttgctg cggctcgctg 1427401 accggggagc ggcgtgaagc tgcaccggct ggccctgacc aattaccgcg gcatcgcaca 1427461 ccgtgacgtc gaattccccg atcatggagt ggtggtggtg tgcggcgcca acgagatcgg 1427521 caagtcctcc atggtcgagg cgctggacct gctgctcgag tacaaggacc gctcgacgaa 1427581 gaaggaagtc aagcaggtca agccgaccaa cgctgatgtc ggctccgagg tcattgccga 1427641 aatcagcagc ggcccttatc gtttcgtcta ccgcaagcgt ttccacaagc ggtgcgagac 1427701 ggagttgacc gtgctggcac cgcgccgcga gcagctgacc ggcgacgaag cgcacgagcg 1427761 ggtccggacg atgttggccg aaacggtcga caccgaactg tggcatgccc agcgggtgct 1427821 gcaggccgcc tcgacggccg cggtggatct gtctggctgc gacgcgctct cgcgtgcgct 1427881 cgatctcgcc gccggtgatg acgccgcgct gtcgggcacc gagtcgctgc tcatcgagcg 1427941 gatcgaggcc gagtatgcgc gctacttcac cccgaccggg cgccccaccg gagaatggtc 1428001 cgcggcggtc tctaggctgg cggccgccga ggccgcggtg gccgactgcg cggcggcggt 1428061 agccgaggtc gacgacgggg ttcgtcgcca caccgagctc accgagcagg tggctgagct 1428121 gtcgcagcaa ctacttgctc accagctgcg gctcgaagct gcgcgagtcg ccgccgagaa 1428181 gatcgccgca atcaccgacg acgcccgcga agccaagctg atcgctactg ccgcggccgc 1428241 gaccagcggc gcttccaccg ccgcacacgc cggacggctg ggcctgctca ccgaaatcga 1428301 cacgcgcact gcggccgtcg ttgctgcgga ggcaaaagcg cggcaggccg cagacgagca 1428361 ggcgacggcg cgcgcggagg ccgaggcctg cgatgccgcg ctcacggagg caacccaggt 1428421 attgacggcc gtccgccttc gcgccgagtc ggcccggcgc accctcgacc agctcgccga 1428481 ctgcgaggag gccgaccggt tggccgcccg gctggccagg atcgacgaca tcgagggtga 1428541 tcgcgaccgg gtctgcgcgg agctgtccgc ggtcacgctg accgaggagc tactgagtcg 1428601 gatcgaacgt gctgcggcag ccgtcgatcg cggcggtgca cagctggcgt cgatctccgc 1428661 ggcggtggag ttcaccgccg ccgtcgacat cgagctcggc gtcggcgatc aacgggtgtc 1428721 gctgtccgcg ggccaaagct ggtcggtcac tgccaccggc cccaccgagg tcaaggttcc 1428781 cggcgtcctg accgcacgga tcgtcccggg cgcgaccgca ctcgactttc aagccaaata 1428841 tgctgcagca caacaggaat tggctgatgc gctggcggct ggagaggtcg ctgacctagc 1428901 cgccgcacgc tccgccgatc tgtgccgacg cgaactgctg agccgccgcg atcagctgac 1428961 cgccactctg gccggcctgt gtggcgatga acaggtcgac caactgcgtt cccgcctgga 1429021 acagttgtgt gccggtcaac cggccgagct cgatctggtt tcgacggata ccgctacggc 1429081 ccgcgctgaa ttggatgcgg tcgaggcggc tcgaatcgcc gcggagaagg actgcgagac 1429141 ccgccgtcag atcgctgctg gcgccgctcg ccggctcgcg gagacatcca cgcgggcaac 1429201 ggttctacag aacgcagcgg ccgccgaaag cgccgagctc ggtgcggcca tgactcggtt 1429261 ggcctgtgag cgggcgtccg tgggcgacga tgagctcgcc gccaaggccg aggccgacct 1429321 gcgggtactg cagacggccg agcagcgagt gatcgacctg gccgacgagc tcgcagctac 1429381 ggcgccggac gcggtagccg ccgagctggc cgaggccgcc gacgccgtcg agttgctgcg 1429441 cgaacgtcac gacgaggcca ttcgcgcgtt gcacgaggtc ggcgtcgaac tctcggtgtt 1429501 cggcacccag ggccgcaagg gcaagcttga tgccgccgaa accgagcgtg agcacgccgc 1429561 cagccaccac gcgcgggtcg ggcgccgggc ccgggccgcc aggctgctcc gctcggtgat 1429621 ggcacgccac cgcgacacca cccggctgcg ctacgtcgag ccataccggg cggagctaca 1429681 tcggctcggc cgcccagtgt tcgggccctc tttcgaggtc gaggtcgata ccgatttgcg 1429741 catccgcagc cgcaccctgg acgacagaac cgtgccctac gagtgcttgt cgggcggggc 1429801 caaagaacag cttggcatcc tggcgcgatt ggccggcgcg gcgctggtcg ccaaggagga 1429861 cgccgttccg gtgctgatcg acgacgcgct ggggttcacc gatccggagc gactagccaa 1429921 gatgggggag gtctttgaca ccatcggcgc cgacggacag gtgatcgtgc tgacgtgcag 1429981 tcccacccga tacggcggtg tcaaaggagc gcaccgcatc gatctggacg ccatacagtg 1430041 agcccgaaac ggggacatgc gatggacact cagagcgact acgtcgtggt cggtaccggc 1430101 tcagccgggg cggttgtggc cagccggctt agcaccgatc cggccacgac ggtggtggcc 1430161 ctggaggcgg ggccgcgtga caagaacaga ttcatcggcg tcccagcggc gttttccaag 1430221 ctgttccgca gcgagatcga ctgggattac ctaaccgaac cgcagccgga gctcgacggc 1430281 cgcgaaatct attggcctcg tggcaaggtg ctcggtggct cgtcgtccat gaacgcaatg 1430341 atgtgggtgc gtggattcgc atcagactac gatgagtggg ccgcgcgagc cggtccgcgg 1430401 tggtcgtacg ccgacgtgct cggctacttt cgccgcatcg agaacgtcac cgctgcctgg 1430461 cactttgtca gcggtgacga cagcggagta accggtccgt tgcatatttc ccggcaacgc 1430521 agcccaagat cggtgaccgc agcgtggctg gcagccgcac gtgagtgcgg atttgccgct 1430581 gcgcggccga attcccctcg accggaaggc ttttgcgaga ccgtcgtcac ccagcgccgc 1430641 ggtgctcgat tcagtactgc cgacgcctat ctgaagcccg cgatgcgccg taaaaacctc 1430701 cgtgtgctta ccggcgccac tgctacccgg gtggtcatcg acggcgaccg ggccgtcggc 1430761 gtggaatacc aaagcgacgg tcaaacccgc atcgtctacg cccgccgcga ggtggtgctc 1430821 tgcgctggtg ccgtcaacag ccctcagctg ctgatgctct ccggcatcgg cgaccgcgac 1430881 cacctcgccg aacacgacat cgacaccgtt taccacgcgc ccgaggtcgg gtgcaacctg 1430941 ctcgatcatc tcgtcacggt gctgggtttc gacgtcgaaa aggacagctt gtttgccgcc 1431001 gagaagcccg gccagttgat cagctactta ctgcgacgcc gcggcatgct cacctccaac 1431061 gtcggcgagg cgtacggatt tgtccgcagc cgacccgaac tgaagctgcc cgatttggag 1431121 ttgatttttg ccccggcgcc gttttacgac gaagcgctgg ttccaccggc tggtcacggt 1431181 gtggtattcg gcccgattct ggtcgcgccg caaagccgtg gccagatcac gctgcggtcc 1431241 gccgatccgc atgccaagcc tgtcatcgaa ccgcgttacc tgtccgatct cggtggcgta 1431301 gaccgggccg ccatgatggc gggcctgcgg atatgcgcgc ggatcgcgca ggcccgcccg 1431361 ctcagagatc tccttgggtc catcgcgcga ccgcgcaaca gcaccgagct ggacgaggcc 1431421 actctcgagt tggcgctggc cacttgttcg cacaccctgt accacccgat gggcacctgc 1431481 cgcatgggca gcgacgaggc cagcgtggtg gatccgcagc tgcgggtccg cggtgtcgac 1431541 ggactccgcg tcgccgacgc gtcggtgatg cccagcacgg ttcgtgggca tacgcatgcg 1431601 ccgtcggtgc tgatcgggga gaaggccgcc gacttaatcc gcagctgagc tggtcgccgc 1431661 cggctcagcg tcgcatgaac ccgatggcgg tgtagtccag gtctgccaga cccgtcgcgc 1431721 cgaagttggc cagcgtgctg cggaccgcaa cggtgccggg cgactgggta agcggcaggc 1431781 tgaatccttc ggcccagatc agctcgtcga cctggttggc caaggccctc gccttgccgg 1431841 gatcgagttc tgccagcgtt cgctcgatcg cggcgtcgat ttgcgggcta ccgatcttgc 1431901 cgaagttgct ttccccgtcc gaagcgtaga tctgggtgag cgatgacagc ggaaacgcgt 1431961 cgcccaccca gccgaactgt gcgatgtcga aagcccccac gttgacgtag tcgctgaaga 1432021 aaccgctgcc ggacttggcc tgaagttcga gtttgacgcc gatctgcgcc agggtgtgtt 1432081 gggcgatctg ggcgaactgc cgggtgcttt gtgcgtcgta gaacagatcg cggatgacga 1432141 gctggcgacc gtccttctcc cggaacgcgc cgcttcgcct ccagcccagg gcgtccagct 1432201 cccgtttcgc ttgttccggg ttgtaggcga caacgccgct gttgtcctgg tagccgtctt 1432261 ggccggcgac gaagacgtgg ttgttcagtg gcaccgggtc gctggtgagg ccgtattggg 1432321 cgaccctggc gatggtgtat cggtcgatgc ccttggcgat cgccaggcgc agcgccttgt 1432381 cggcgaggat cgacccaggc gcaccgttga gggtgaagtg ataccagctg ggcccggggg 1432441 cgcgccggat cgagatgccc ttggtgcgcg ccgcgatggt cagctggtcc agtgtgccga 1432501 cgccggtggc gtcgattgtg ttgttctgca gcgccggcag ccgggcggca tcatcgagca 1432561 ccaggtatgt gatgctgtcc aggcgtggcc gtgcccccca ccatctcggg ttacgggtca 1432621 acacgattcg ctgcgcggtg cggtccaggg cagacacgac gaacggaccc gccgacggac 1432681 cgggcccatc gagttgaccc ttattgaatg cctcgggtgt ggcggtcata ctggccggca 1432741 gcagcatgcc gttgcccgcg aacataccgc gccactccgc gtacggcttg gcgaacgtca 1432801 ccacggcctg ccggtcgtcg acccctctgg ttaccgacgc cacacgctcg gcgccgctgc 1432861 tagaagcgat ctcgaatgcc ttgtcggcgc cgctgatcgc atgaatctgg ctggcgatgt 1432921 cccgccaggt gatcggggtc ccgtcggacc acaccgcctc gggattgatg gtgtaggtga 1432981 ccacctgcgg ggcggtcctg gtcagctcga tgctggtgaa gtagttggtg tcgaccgtcg 1433041 tcgagccgtc cggtccgatg atgaacgcgc gcggcaaggt ggctttcatc atcgccgcga 1433101 cctcggcgtt gttgccgtcg atgtgcaaga tgttgaagtt gggcggaaag tcggtgagcg 1433161 acaggcgaag attgccgccg tcttgcaacg tggcgggatc ctgctgattg atgtcgctgg 1433221 tggtgccaac cgcggccctg cggtccgcag tgggcgcgag ttcgagttgg gtaccggagg 1433281 ccgagcatcc ggtgagcacc atagccacga cgagcggtgt taataacgcg aaagcccaat 1433341 atcgagtctg cgtccagggt ctggatttcc cctgaaacga cgccctgagc gcagacgcga 1433401 tgcccggggc gcagcctcgt cgctggccac ggtcagccac gacgggccgg atccggttgc 1433461 ggtaccgcgc ccagcagtcg cctggtgtac tcgtgtttcg gattgccgaa gacctcctca 1433521 ctgtcgccct gctcaacaac ggtaccggca agcatgaccg ccacctggtg ggcgaggtgt 1433581 ttgaccaccg aaagatcgtg ggaaacaaat aaatatgaca acccgaactg ctcttggagg 1433641 tcgagcagca ggttgatgat cccggcctga atggagacat cgagtgccga caccggttcg 1433701 tcgagtgcca ggatcttggg ttggagcgcc agtgcccgcg cgatgccgat gcgctgcttc 1433761 tgaccgccgg agaactcggc gggataacga ctggcgtcgc cgtggcgcag tccgacgata 1433821 tcgagcagct cggcgacccg cgcgtgagtc tcgttcttgc cgaacccatt ggcctgcaat 1433881 ggttcggcaa tcagatcgaa gaccggcagc cgcgggtcta aggacgccac cgggtcttgg 1433941 aagaccacct ggatgtcgcg gcgcagcgat cggcgttccg ctgtccccag cgtggcgacg 1434001 tcagtgccga ggacttcgat cgatcccgat tgcggcgcag ccagctccag gatctcgtgc 1434061 agggtggtcg acttgcccga accggattcg ccgacgatac ccaacgtgcg gccctgccgg 1434121 agttcgagac tgatgccgtc gaccgcgcgg acctcgccga tcgcccggcg cagcaccacg 1434181 cccttggcca gccggtaggt tttgactaga tgacgtaccc gcacgaccac cgaggcgtcg 1434241 ccgagtgcag ccgggcgggc ctcggttttg acccggtaga tgtcggcggc gctgcgcccg 1434301 gtgaccagct cggtgcggat gcaggccgcc cggtgatcgg tagcgacgtc aagcaattcg 1434361 ggttccgcgg taaggcattc gtcgatgact agcgggcagc gcggcgcgaa cgggcaaccc 1434421 ggtgccaagc ccgccagcga cgggggcgca cccggtatcg gcaccagccg ggtgccctgc 1434481 gcggcatcca gccgggggac cgagcctaaa agccccacgg tgtagggcat ccggcgatcg 1434541 cggtacagat cattcacccc ggccgactcg acgacccgtc cggcgtacat caccagcgcc 1434601 cggtcggcga actcggccac gacgccgagg tcgtgggtga tgatcagcac cccggcgccg 1434661 gtgacgtcgc gcgccgcctt gaggacgtcg aggatctgcg cctgcaccgt gacgtcgagc 1434721 gccgtggtcg gttcgtcaca gatcaacagg tcgggatcgt tggcgatcgc gatggcgatc 1434781 accacgcgtt ggcgttcgcc acctgaaagc tcatgcggaa acgcacggga acgccgctgc 1434841 ggctgcgaaa taccgaccag gtcaagcagt tccaccgcac gccgacgagc ggccttcttg 1434901 ccaacacggg gctggtgcac ctcgatggcc tcggcgattt ggtcgccgac ggtgtagaca 1434961 ggggtgagcg cagacatcgg atcctggaac accgtgccga tcgccttgcc tcgaaaccgg 1435021 gacatcgcgt tgtcggcaag ccccaacagt tcggtaccct gtagccgaac cgaaccacgc 1435081 acctgcgcgt actcgggcag caggcccacc accgccatcg ccgctgcgga cttacctgaa 1435141 cccgattcgc ccaccatcgc gaccacctcg ccgggctcga cgcggtagct gatcccgcgc 1435201 accgcggtca ccggatcgcc atcggtcctg aaggtgacgg ccaaatcggt cacctcgagc 1435261 agggggctca tcgcacacca cggcgcaggg atctgctggc tgggtccagc gcgtcgcgca 1435321 ggccatcgcc ggtcaggttg gcgcacacca gaatcaacac caggatactg gcgggaaaca 1435381 agaacaccca cgggaacgcg gtcgcggatg cggtgccgtc ggcgatcagg gtgcccagcg 1435441 acacatccgg cggttgaata ccgaaaccaa ggaagctcaa cccggtttcg gccaggatgg 1435501 cggcggcaac attgagggcg gcgtcgatga tcaagatgga tgcgacgttg ggcaccacat 1435561 ggccgacgat gatccggcgg ctggagacac ccatatatcg tgcggccctg atgaattcgc 1435621 gttctcgcaa gctcatcgtc atcccgcgca ccatgcgaga gctgatcatc cagccgaagc 1435681 cggccaacaa caagacaaga aacatgatgt ttgccgagtt cttggttcgc ggggtaacga 1435741 tggcgatcag gatgaagctg ggcactacta gcagcagatc gaccacccac atcagtgtcc 1435801 ggtcccgcca gccgccgaaa tatcccgaga tcgctccaac cgtggcagcg ataccagtcg 1435861 agatcaccgc aacgcaaaca ccaatcagca tcgacttctg catgccacgc agcgtctgcg 1435921 ccagcagatc ttggcccagc gcgttagtgc ccagccagtg cttggtgccc ggcggctgca 1435981 gcaatgcgtt gaaatcaagg tcgtcgtagg agtagggcaa tagtgggggc agcgcataag 1436041 cgctgacgaa cagcaggagc agcgccgcca gcgacgccac cgcggcccga ttgcgtagga 1436101 acctgcgcac cactagggtg cgccgcgagg cgaattccgt catgacaccc gtaccctcgg 1436161 gtccaaagcc gcgtagatca cgtccgagag caaaccggcc agcaacacga ccgcgccgga 1436221 gaacacggta attgccgcga cgatgttggt gtcctgagtc gagataccgc ggaccatcca 1436281 ttcacccatg ccgtgccagc cgaagatctt ctcgacgaaa accgctccgg tgaccaaccc 1436341 ggccaccccg taggcgaaca gcgtggccat cggtattagc gccgttcgca ggccatgctt 1436401 gagtagggcc cgtcgtcggg tcagcccctt ggcgcgggcg gtgcgaatga aatcctggcc 1436461 gaggacatcc agcatcgcgt tgcgctggta gcggctgaac ccggcggcgg ccgccagcgc 1436521 caacgtcagc gatggcagga tcaaatgctg caaccggtcg cctagccgat cccacacccc 1436581 gccggcaacg ccgggtgacg tctccccggt gtagtcgaaa agctggatgc ccactgccca 1436641 gttgacccgc agggcgccca ggatcaacag gttggccacc acaaacgtcg gtgtgctcaa 1436701 caccagcagc gccagcgtgg tcatgacgcg gtcgctgagc cggtactgcc ggatggcacc 1436761 ccacgccccg atcaccacac cggccaccgt gccgaatacc gatccaacga ccagcagccg 1436821 caggctgact ccgatccggc gccccagttc ggtaccgaca ggctggccgg tgatggtggt 1436881 tccgaagtcg ccacggacgg catgcgatac ccagttggcg tagcgggcca gtatgggtct 1436941 gtccaagccg agatcgtgtg ccttggcatc gataaccgct tgcggtgggc gcggactgcg 1437001 ttgcatcagg ctttccagcg gcgagaacgc cagcgaggtc aggcagtacg tcaaaaacga 1437061 cgccagcgcc agcagcacca ggtagttgag caaccggcgg gccagatagc gcgtcatgcc 1437121 caaccaccgc gtcgcattgg gacagggtag cgagcccggc gatggcgtgc cgccagcgcg 1437181 ccggttgatg gggtcacccg tgatccggat ggttccgctc gggccgattc tgatgcgtga 1437241 aaactgggta accggttgtt aaaattcacc gcggcgtcga tctgagtagc aaagtccaca 1437301 ccgcgatacc cgaggaggcc cgcgtgacgg ttaccgacga ctacctggcc aacaacgtgg 1437361 actacgcgag cggtttcaag ggcccgctac cgatgccgcc gagcaaacac atcgcaatcg 1437421 tggcgtgcat ggacgcccgg ctggacgtct accgcatgct gggcatcaag gagggcgagg 1437481 cacacgtcat ccgcaacgcc ggatgcgtgg tcaccgacga tgtgatccgt tcactggcca 1437541 tcagccagcg gctgctggga acccgcgaaa tcatcctgct gcaccacacc gactgtggga 1437601 tgctgacttt caccgacgac gacttcaagc gcgccatcca ggacgagacc ggcatcagac 1437661 ccacgtggtc gcccgagtcg taccccgacg ccgtcgagga cgtccgtcag tcgctgcgcc 1437721 gcatcgaggt caacccgttc gtcaccaagc acacgtcgct gcgcggcttc gtcttcgatg 1437781 tcgccaccgg caaactcaac gaggtcacgc cctagcagcc cgagccgtca gcctagggcg 1437841 cactggcgca ccggcagccc gccgagatgg ggctgcgttg acagcgatag ggaagcctgg 1437901 ttgcatagat ggcaataacc ataaatatgg tcaatcctac cggatttatc aggtatgagg 1437961 acgtggaaca ggaagccatg accagcgatg tgacggtggg ccccgcaccc ggccagtacc 1438021 aactgagcca tctgcgcttg ctggaggccg aagccatcca cgtcatccgg gaggtggccg 1438081 ccgagttcga gcggccagtg ctgttgttct cggggggcaa ggactccatc gtcatgctgc 1438141 acctggcgct gaaggcgttt cggcccgggc gactgccgtt cccggtcatg cacgtcgaca 1438201 ccggtcacaa cttcgacgaa gttatcgcta cccgagacga gttggtcgcc gcggccgggg 1438261 tgcggctggt ggtggcgtcg gtgcaggacg atatcgatgc cggtcgggtc gtcgagacca 1438321 tcccgtcgcg aaatccgata cagaccgtga cgctgctgcg ggccatccgg gagaaccaat 1438381 tcgacgcggc attcggggga gcccggcgcg acgaggagaa ggcccgcgcc aaggagcggg 1438441 tgttcagctt ccgcgacgag ttcggccagt gggacccgaa ggctcagcgg ccggaactgt 1438501 ggaacctcta caacggacgg caccacaagg gcgagcacat ccgggtcttc ccgctgtcca 1438561 actggaccga attcgacatc tggtcctaca tcggcgccga gcaggtcagg ctgccgtcca 1438621 tctatttcgc ccaccggcgc aaggtgtttc agcgcgacgg catgttgctg gccgtgcacc 1438681 ggcacatgca accgcgagcc gacgagccgg tgttcgaggc cacggtgcga ttccgcaccg 1438741 tcggggatgt tacctgcacc gggtgcgtcg agtcgtcggc atcgacggtc gcggaagtca 1438801 tcgccgaaac tgcggtggcc cgcttgacgg agcgcggggc gaccagggct gacgaccgga 1438861 tctcggaggc tggaatggaa gaccgcaagc ggcagggata cttctgatga cgacgctatt 1438921 gcggctggcg acagcgggtt ccgtcgacga tggcaagtcc acgctgattg ggcggctact 1438981 ctacgactcc aaggctgtga tggaagacca gtgggcgtcg gtggagcaaa cgtccaagga 1439041 ccggggccac gactacaccg acctggctct ggtcaccgac ggcctgcggg ccgagcggga 1439101 acagggcatc accatcgacg ttgcctaccg ctacttcgcc actcccaagc ggaaattcat 1439161 cattgccgac accccgggac acatccaata cacccgcaac atggtgaccg gtgcgtccac 1439221 cgcccaactg gtgatcgtac tggtggatgc ccggcacggc ttgctggagc aatcccgccg 1439281 gcacgccttc ctggcgtcgc tgctgggcat ccgccacctg gtgctcgcgg tcaacaagat 1439341 ggacttgctt ggctgggacc aagagaaatt cgacgcgatt cgagacgaat tccacgcctt 1439401 cgcggcccgc ctcgacgtgc aggacgtcac ctccatccca atctccgcgc tgcacggcga 1439461 caacgtggtg accaaatccg accagacgcc ctggtacgag ggaccgtcgc tgctgtcgca 1439521 tctcgaagac gtctacatcg ccggtgaccg caacatggtc gacgtgcgat tcccggtcca 1439581 gtacgtcatc cggccgcaca ccctcgagca tcaagaccac cgcagctacg cgggcaccgt 1439641 ggccagtggg gtaatgcgtt caggcgacga agttgtcgtg ctgccgatcg gtaagaccac 1439701 ccggatcacc gcgatcgacg gcccgaacgg cccggtggca gaagcgtttc cgccgatggc 1439761 ggtttcggtg cggctcgccg acgacatcga tatctcgcgt ggtgacatga tcgctcgcac 1439821 ccacaaccag cccaggatca cacaagaatt cgacgcgacc gtgtgctgga tggccgacaa 1439881 cgcggtgcta gagcccggcc gcgactacgt tgtcaagcac accacccgaa ccgtccgcgc 1439941 gaggatagcc gggctggatt accggctcga tgtcaacacc ctgcatcgcg acaagaccgc 1440001 aacggcgttg aaactcaacg aactgggccg tgtttcgctg cgcacccagg tgccgttgct 1440061 gcttgacgag tacacccgca acgctagcac cggctcgttc atcctcattg accccgacac 1440121 caacggaacg gtggcggcgg gcatggtgtt acgcgacgtc tcggcccgca cgcctagccc 1440181 gaacacggtg cggcacagat cgctcgtcac tgcgcaagat cggccgccca ggggcaagac 1440241 ggtgtggttt accggactgt ccggctccgg caagtcgtcg gtggccatgc tggttgagcg 1440301 gaagctactc gaaaagggca tctccgctta cgttctggac ggcgacaacc tacggcatgg 1440361 cctcaacgcc gacctgggct tttccatggc cgaccgcgcg gagaacctgc gccggctgtc 1440421 gcatgtggcc acactgctcg ccgattgtgg ccacctggtg ctggtgcccg cgatcagccc 1440481 ccttgctgag caccgtgccc tggctcgtaa agtgcacgct gatgcgggaa tcgacttttt 1440541 cgaggtgttc tgtgacaccc cgctgcagga ctgtgagagg cgtgatccca aagggttgta 1440601 cgccaaagcg cgtgcgggtg agatcacgca cttcaccggg atcgacagcc catatcagcg 1440661 gcccaagaac ccagacctac ggcttacgcc ggatcgcagc atagacgagc aggcgcagga 1440721 ggttatcgac ctgttggagt catcgtctta ggccggcctg gttgctctgc tgtccctggc 1440781 aagcgggtgg cacaatcctg aagcatgcgg atgtcagcta aggcggagta cgcggtgcgg 1440841 gcgatggtcc agctcgccac ggccgccagt ggcaccgtgg tcaagaccga cgatctggct 1440901 gcggcccaag gcataccacc gcagtttctc gtcgatatcc tgaccaacct gcgcaccgac 1440961 cgcctggtgc gaagccaccg cggtcgcgag ggtggttatg aattggcgcg tccgggcacc 1441021 gagatcagca tcgccgacgt attgcgctgc atcgacggac cgctggctag tgtccgcgat 1441081 atcggacttg gcgacctgcc ctactcgggc cccactaccg cgctgaccga cgtttggcgc 1441141 gcgctgcgcg ccagtatgcg gtcggtgctg gaggagacca cgctggctga cgttgccggt 1441201 ggcgcgctgc ccgagcacgt cgcccagctc gccgacgact atcgcgcgca ggagagcacg 1441261 cggcacggcg cctcgcgcca tggtgactag ccgccagagc catcggcagg gcctgcctga 1441321 gccaggtgca accgaaggag tcaacgaatg gtcagcacac atgcggttgt cgcgggggag 1441381 acgctgtcgg cgttggcgtt gcgcttctat ggcgacgcgg aactgtatcg gctgatcgcc 1441441 gccgccagcg ggatcgccga tcccgacgtc gtcaatgtgg ggcagcggct gattatgcct 1441501 gacttcacgc gatacaccgt tgttgccggg gacacgctgt cggcgttggc gttgcgcttc 1441561 tatggcgacg cggaattgaa ttggctgatc gccgccgcca gcgggatcgc cgatcccgac 1441621 gtcgtcaatg tggggcagcg gctgattatg cctgacttca cgcgatacac cgttgttgcc 1441681 ggggacacgc tgtcggcatt ggctgcgcgc ttctatggcg acgcctccct atatccgctt 1441741 atcgccgccg tcaatggcat cgccgatcct ggcgtcatcg acgtcgggca ggtactggtc 1441801 atattcatcg ggcgtagcga cgggttcggc ctaaggatcg tggaccgcaa cgagaacgat 1441861 ccccgcctgt ggtactaccg gttccagacc tccgcgatcg gctggaaccc cggagtcaac 1441921 gtcctgcttc ccgatgacta ccgcaccagc ggacgcacct atcccgtcct ctacctgttc 1441981 cacggcggcg gcaccgacca ggatttccgc acgttcgact ttctgggcat ccgcgacctg 1442041 accgccggaa agccgatcat catcgtgatg cccgacggcg ggcacgcggg ctggtattcc 1442101 aacccggtca gctcgttcgt cggcccacgg aactgggaga cattccacat cgcccagctg 1442161 ctcccctgga tcgaggcgaa cttccgaacc tacgccgaat acgacggccg cgcggtcgcc 1442221 gggttttcga tgggtggctt cggcgcgctg aagtacgcag caaagtacta cggccacttc 1442281 gcgtcggcga gcagccactc cggaccggca agtctgcgcc gcgacttcgg cctggtagtg 1442341 cattgggcaa acctgtcctc ggcggtgctg gatctaggcg gcggcacggt ttacggcgcg 1442401 ccgctctggg accaagctag ggtcagcgcc gacaacccgg tcgagcgtat cgacagctac 1442461 cgcaacaagc ggatcttcct ggtcgccggc accagtccgg acccggccaa ctggttcgac 1442521 agcgtgaacg agacccaggt gctagccggg cagagggagt tccgcgaacg cctcagcaac 1442581 gccggcatcc cgcatgaatc gcacgaggtg cctggcggtc acgtcttccg gcccgacatg 1442641 ttccgtctcg acctcgacgg catcgtcgcc cggctgcgcc ccgcgagcat cggggcggcc 1442701 gcagaacgcg ccgattagcc gcaccacgta taccccgcgg gcaggtggcc gctggccgat 1442761 agcctcatgt gtgtgagcgt gggcgagtca gttgcgcagt cgctgcaaca gtgggatcgc 1442821 aagctgtggg acgtggcgat gctccacgcg tgcaacgccg tcgacgagac cggcaggaag 1442881 cgctatccca cgctgggcgt cggcactcga ttccggacgg cgctacggga ttcactcgac 1442941 atttacggag tgatggccac gcctggcgtc gacctggaaa agactcgctt ccctgtcggg 1443001 gtgagatcgg acttgctgcc ggataagcgc cccgacatcg ccgacgtcct gtatggaatt 1443061 caccggtggt tgcacggtca tgctgacgaa tcctcggttg aattcgaagt aagcccgtac 1443121 gtgaacgcca gtgccgcact ccgcattgcc aatgacggca aaattcagct gccaaagtcc 1443181 gcaatactgg gtttgctggc cgttgccgtg tttgcgccgg agaacaaggg cgaggtcatt 1443241 cccccggact atcagctcag ctggtatgac cacgtgttct tcatcagtgt ttggtggggg 1443301 tggcaagacc atttccgcga aatcgtcaac gtcgaccggg catcgctggt cgccctcgac 1443361 ttcggcgacc tgtggaatgg ctggacgcca gttgggtaat cctggtcgct tgtcgccccg 1443421 ccgggctggg ttagattgcc cggctcctca acccgccgtt tcggcgtgca tcgtcgccgg 1443481 gctagccgtc tcggtcagcg gaccggatcg tcgacgccgc cgcctgcgcg gcggctacct 1443541 ggccgaacgt ggacggcggc ggcgctagag tcccggggcg ctcgacgacc tcggtcgccc 1443601 gcgccgcggc accgagaacc atggcccggt cggattcgtc cgcgaactcg cgctgtgctg 1443661 cccgcacgac cagggcaatt tgggtttgca ccgctacacg gcgcgacggg tcgacgcagt 1443721 tctgggcgac cgcgctgagc agctgcagca gcgcagtgag caccagcggc tcacgcgagc 1443781 cgtagcggcg gatctgggca catccgacgt gcaggtaggt ggcgaagctg gggtacggca 1443841 gccagaagag gagctccccg gcgcggtcgc ggcgcacgtc gtccggcagc gcccgcgatg 1443901 ccagcaccga ctccacggcc gaaagatggt gcacgacttg gatcgccgtg tacgggtcgt 1443961 tgagtgcggg cgatagtgcc cgcagcgcga tatccaccat ctgccgcaat ccgaagcgga 1444021 tgtcctgctg cagggtgcgc tcgaatccga tgtgcacatg acgtaagcag cgttgcggga 1444081 agtcagaccc tggcgcgccc ggcgcggtgc ccctgcgcca gcaccagccg agcaggcccc 1444141 cggcggtgac gtaatcgccg acgaaggtaa ccagcagcgc cgtataccgg ctggctgccg 1444201 ccaattcggc gatgtcgtcg acgtcgacgg tttgtaggta acccgagtgc ggggccaaca 1444261 gcggcaccgc atcagccggg gggctgggcg gtgtctctac ttgtcgatcc gccgtatccg 1444321 attccggata caactggtca accagcccca gcgtgcgcag ccgcaccttg tccatgatcg 1444381 tgtctatctg gatcgagtgc atgaggtggt gcaggaagta gatcagcgcg gcgatgctga 1444441 cgaatgccag cgcgagtgac ccggtgaccg cgactttggg aatgaacgcc ccgccgtcgc 1444501 ggtgctcccc gacggtgtgt agcccaccgg tgctgtaggc gaaggtgcag gcaaagatcg 1444561 ccagcaccac ctggttgggc acatcgcgca ggaaggttcg tagcaaccgc accgagaact 1444621 ggctggaggc gatctgtagg gacagcaccg tcagcgagaa gacgatgccg atggtggtga 1444681 tcatcgtggc cgacaccacg atcagcacgc ctcgggcgtc gcctggggtg ccctgaaaca 1444741 tcagcttgtc gatcagcgtg ccggatttca cgggaatcat cgacaggacc gctcccgacc 1444801 ccagaccgat cgcaacgccg aatgtcggca gcacccagac tgcgccctgt aagtaatcca 1444861 gtatggcttt gcgacggttg agcatgctgg ttgcggtcac cgaataagca tgcacccatc 1444921 cgcgagcact aggcggaact acgtaacact tcgatgcggc agtagaagca tttttccgct 1444981 ctcgcttcgc cgagcgtgca ctcatggcga gtttccggcc gttaacccca agtgatcgct 1445041 gcaacacttg gccagaggtg ttggcgctgc atgggttatc agaaggggtt tcggggtcgg 1445101 ggggatcggg tggccgatgg ggtgcagggg aagttctgga aggcgctcga atcggggtta 1445161 tcgccgacgg tgtgtcctgc tttcctacca aggccgactg caggcggatc cgtggcgtgc 1445221 cggtgttcga cggctatacg cggatggtcg cccggctgat gggatcgctc gccgtgttgc 1445281 ggtcggtgag cattccaaag ggctaccggg acttcggctt tggcagtcta cgtgcggtgg 1445341 cgccgaaaaa ctgcccggac gtgagtggct gaggcggccc aatttcggac taggatttct 1445401 ggccgctgga agtcactgat gacaccgtac gtcacccttg atcgacaagt gcggatgtgg 1445461 ggacccgtcc ggggtcccca catcgtggtg gtcgctgttt agctcgaggt cacgtactgc 1445521 gggcagtagg ccgacgcggc gtcaacggcg aacgtcttgg cgcccttggc gctcagaccg 1445581 gtcgccttgg ccaccgcctt gatgaccgct ttggccgagt gaccctcgtc gagggcgtcg 1445641 cagacggcgt gcgcgtcctt gatggcgcgc gctgcgctcg gcggagtgat cccgtccgcc 1445701 tgcagctgcg cgaggaacgc ttcgtcggtc gagcttgcgc tggcggtccc ggcgaagccg 1445761 agtgcggcca ggcccaaagt agcggcagtc aaggtggtgc caaccatgga ggcggcgaaa 1445821 cggcgagtga acattgatga tctccttgtg ctgatgtcat cggaggttgc gctggtttgc 1445881 gtgccctcag aatcagcacc gggccttgac agattctcaa taaatccttg gcaatatcga 1445941 taccggttcg acggtgtccc gacagtgcaa ggagaacggt ccgccatggc tgtgccggag 1446001 cgcgtcaggc gaatgagaca acacggaacg tgcactcggc gcaccgggtc gccagcaacg 1446061 cggcacgcgg ggcgccctgg ttcttacccc gacgaatttg agagcgagac cacgaagcca 1446121 actatgcggc cgccctcgcg ggtggcgccg atcacattgt tgtagccatg cgtgaggcta 1446181 gatcaaccct tgtgcccccg gcaggattcg aacctgcggc cttctgctcc ggaggcagac 1446241 gctctatccc ctgagctacg ggggcgcacg acgacacgtt gcgccatggg gccccgccag 1446301 agtagcgcat cgcggctacc cactgaccac cgcaacggat tcgaagccca accacctcag 1446361 cccataggat ggacgttcgt gacccccgct gacctggctg agctgctcaa agcgaccgcg 1446421 gccgcggtgc tggccgagcg cggcctcgat gcctccgcgt tgccgcagat ggtcacggtg 1446481 gaacgcccgc gcattcccga gcacggcgac tatgccagta acctggcgat gcagctcgcc 1446541 aagaaagtcg gcaccaaccc gcgtgagctg gccggatggc ttgccgaggc actgacaaag 1446601 gtcgacggta tcgcctcggc ggaggtggcc gggccgggct ttatcaacat gcggctggaa 1446661 accgccgccc aggctaaagt cgttaccagc gttatcgacg ccggccacag ctacggtcac 1446721 tcgctgctgc tggccgggcg caaggtcaac ctggaattcg tctccgccaa ccccaccgga 1446781 ccgatccaca tcggcggtac ccgttgggcc gcggtcggtg acgcgctggg ccgtttgctc 1446841 accacccagg gcgccgacgt ggtccgcgaa tactatttca acgaccacgg cgcccagatc 1446901 gaccgattcg ccaactccct gatcgccgcg gccaagggcg aacccacgcc ccaagacggc 1446961 tacgcgggca gctacatcac caacatcgcc gagcaggtgc tgcagaaggc gcctgacgcg 1447021 ctgagtctgc cagacgcaga gttgcgcgag accttccgcg caatcggcgt cgacttgatg 1447081 ttcgaccaca tcaaacagtc tctgcacgag ttcggtaccg acttcgacgt ctacacccac 1447141 gaagactcga tgcacaccgg cggccgggtc gagaacgcca tcgcccgact ccgcgaaacc 1447201 ggcaacatct acgagaagga cggcgcaacc tggttgcgca ccagcgcatt tggtgacgac 1447261 aaggaccgcg tcgtgatcaa gagcgacggc aaaccggcat atatcgccgg tgatctcgcc 1447321 tactacttgg acaaacgcca acgcggtttt gacttgtgca tctacatgct cggcgccgac 1447381 catcacggct acatcgcccg gctaaaggcc gcggccgccg ccttcggtga cgacccggcc 1447441 accgtcgagg tgctcattgg gcagatggtg aacctggtcc gcgacggcca accggtccgg 1447501 atgagcaaac gtgcaggcac cgtgctcacc ctcgacgacc tggtcgaggc gatcggcgtg 1447561 gacgccgcac gttacagcct gatccgctcc tcggtggaca ccgcgatcga catcgacctg 1447621 gcgctatggt cctcggcgtc gaacgaaaac ccggtctatt acgtgcaata cgcgcatgcc 1447681 cggctctcag cgctggctcg caacgccgcc gaactcgccc tgatcccgga tacaaaccac 1447741 ctcgaactgc ttaaccacga caaggagggc acgctgctgc gcaccctcgg cgaattcccg 1447801 agggtgctcg agaccgcggc ctccctgcgg gaaccgcacc gggtctgccg ctacctggaa 1447861 gacctggccg gcgactatca ccggttctac gactcgtgcc gagtgttgcc gcaaggcgac 1447921 gagcagccca ccgacctgca caccgcgcgc ctagcgttgt gccaggccac ccgtcaggtc 1447981 atcgccaacg ggctggcgat catcggcgtc accgcaccgg agcgaatgtg aacgagctgc 1448041 tgcacttagc gccgaatgtg tggccgcgca atactactcg cgatgaagtc ggtgtggtct 1448101 gcatcgcagg aattccactg acgcagctcg cccaggagta cgggaccccg ctgttcgtca 1448161 tcgacgagga cgactttcgc tcgcgctgcc gagaaaccgc cgcggccttt ggaagtgggg 1448221 cgaacgtgca ctatgccgcc aaggcgttcc tgtgcagcga agtagcccgg tggatcagcg 1448281 aagaagggct ctgtctggac gtttgcaccg gtggggagtt ggcggtcgcg ctgcacgcta 1448341 gctttccgcc cgagcgaatt accttgcacg gcaacaacaa atcggtctca gagttgaccg 1448401 ctgcggtcaa agccggagtc ggccatattg tcgtcgattc gatgaccgag atcgagcgcc 1448461 tcgacgccat cgcgggcgag gccggaatcg tccaggatgt cctggtgcgt ctcaccgtcg 1448521 gtgtcgaggc gcacacccac gagttcatct ccaccgcgca cgaggaccag aaattcgggt 1448581 tatcggtggc cagcggcgcg gccatggcag cggtgcggcg cgttttcgcc actgatcacc 1448641 tgcgcctggt tgggctacac agccacatcg gttcgcagat cttcgacgtg gacggcttcg 1448701 aactcgccgc gcaccgtgtc atcggcctgc tacgcgacgt cgtcggcgag ttcggtcccg 1448761 aaaagacggc acagatcgcg accgtcgatc tcggtggcgg cttgggcatc tcgtatttgc 1448821 cgtccgacga cccaccgccg atagccgagc tcgcggccaa gctgggtacc atcgtgagcg 1448881 acgagtcaac ggccgtgggg ctgccgacgc ccaagctcgt tgtggagccc ggacgcgcca 1448941 tcgccggacc gggcaccatc acgttgtatg aggtcggcac cgttaaggac gtcgatgtca 1449001 gcgccacagc gcatcgacgt tacgtcagtg tcgacggcgg catgagcgac aacatccgca 1449061 ccgcgctcta cggcgcgcag tatgacgtcc ggctggtgtc tcgagtcagc gacgccccgc 1449121 cggtaccggc ccgtctggtc ggaaagcact gcgaaagtgg cgatatcatc gtgcgggaca 1449181 cctgggtgcc cgacgatatt cggcccggcg atctggttgc ggttgccgcc accggcgctt 1449241 actgctattc gctgtcgagt cgttacaaca tggtcggccg tcccgctgtg gtagcggtgc 1449301 acgcgggcaa cgctcgcctg gtcctgcgtc gggagacggt cgacgatttg ctgagtttgg 1449361 aagtgaggtg acccgtgccc ggtgacgaaa agccggtcgg cgtagcggta ctcggtttgg 1449421 gcaacgtcgg cagcgaggtt gtccgcatca tcgagaacag cgccgaggat ctcgcggctc 1449481 gtgtcggtgc cccattggtc ctgcggggca tcggcgtgcg ccgcgtgacg accgatcgcg 1449541 gcgtgccgat cgaattgttg accgacgaca ttgaagagct cgtggcccgc gaggatgtcg 1449601 atatcgtggt ggaagtgatg gggccggtgg aaccgtcgcg caaggcgatc ctgggcgccc 1449661 ttgagcgcgg caagtccgtc gttacggcga acaaggcttt actcgccacc tccaccggcg 1449721 aattggcaca ggccgccgaa agcgcccatg ttgatctgta tttcgaggcg gccgtggcgg 1449781 gcgccattcc ggtcatccgt ccgctcaccc agtcgctggc cggcgacacg gtgctgcgag 1449841 tggccgggat cgtcaacggc accaccaact acatcctctc ggcgatggac agcaccggcg 1449901 ctgactatgc cagcgccctg gccgacgcaa gtgcgctggg ctatgcggag gctgatccca 1449961 ccgcagacgt cgaaggctac gacgccgcgg ccaaggcagc gatcctggca tccattgcct 1450021 tccacacccg ggtgaccgca gacgacgtgt atcgcgaagg catcaccaag gtcactccgg 1450081 ccgacttcgg atccgcgcac gcgctgggtt gcaccatcaa actgctgtcg atctgtgagc 1450141 gcataaccac cgacgaaggt tcgcagcggg tatcggcccg cgtctatccg gccctggtac 1450201 ctctgtcgca tccgcttgcc gcggtcaacg gcgcgttcaa tgccgtggtg gtcgaggccg 1450261 aggccgcggg ccggctgatg ttctacggcc agggcgcggg cggcgcgccg accgcctctg 1450321 cggtgaccgg tgacctagtg atggccgccc gcaaccgggt actcggcagc cgcggccccc 1450381 gtgagtctaa atacgctcaa cttccggtgg caccaatggg tttcattgaa acgcgctatt 1450441 acgtcagcat gaacgtcgcc gacaagccgg gcgtcttgtc cgcggtggcg gcggaattcg 1450501 ccaaacgcga ggtgagcatc gccgaggtgc gccaggaggg cgttgtggac gaaggtggtc 1450561 gacgggtggg agcccgaatc gtggtggtca cgcacctcgc cactgacgcc gcactctcgg 1450621 aaaccgttga tgcactggac gacttggatg tcgtgcaggg tgtgtccagc gtgatacgac 1450681 tggaaggaac cggcttatga ccgtcccgcc gacggccact caccagccgt ggccgggagt 1450741 gattgccgcg taccgtgacc ggctgccggt gggtgacgac tggactccgg tgaccctgct 1450801 cgagggtggt actcccctca tcgcggcaac taatctctcc aagcagacgg gctgcacgat 1450861 ccacctcaaa gtggagggcc tcaaccccac cggctccttc aaggatcgtg gcatgacgat 1450921 ggcggtcacc gatgcccttg cccatggtca gcgggcggtc ttgtgcgcat cgaccggaaa 1450981 tacctcggcg tcggcggcgg cctatgccgc ccgggccggc atcacctgcg cggtgctgat 1451041 accgcagggc aagatcgcga tgggcaagct cgcacaggcg gtcatgcacg gcgccaagat 1451101 catccagatc gacggtaact tcgacgactg cctggaactg gcgcgcaaga tggccgcgga 1451161 cttcccgacg atttcgttgg tcaactcggt aaacccggtg cgcatcgagg gccagaaaac 1451221 ggcagcgttc gagatcgtcg acgtgctagg taccgcgccg gacgtgcatg ctctgccggt 1451281 tggcaacgcc ggcaacatca ccgcgtactg gaagggctac accgagtatc accagctggg 1451341 cctgatcgac aagttgcccc gcatgctggg cactcaggcc gcgggcgcgg cgcccctggt 1451401 gctcggcgaa ccggtgagcc acccggagac catcgcaacc gcgatccgca tcggctcgcc 1451461 ggcgtcgtgg acttcggccg tcgaggcaca gcagcagtcc aagggccgct tcttggccgc 1451521 ctccgacgag gagatactgg ccgcatatca cctggtggct cgtgtcgaag gcgtattcgt 1451581 ggagcccgcg tccgcagcca gcattgcggg tctcctcaaa gcgatcgacg acggctgggt 1451641 ggcgcgtggt tcgacggtgg tgtgcacggt aaccggcaac ggtcttaagg atcccgacac 1451701 cgcgctcaaa gacatgccga gcgtgtctcc ggttcccgtg gacccggtag ccgtcgtcga 1451761 gaagctaggg ctggcctagt ggcgatcgca agcgcggcgg agccgggtgc ggcgggtcgg 1451821 cacggtttgg attgggtggc gatcgcaagc gcggcggagc cgggtgcggc gggtcggcac 1451881 ggtttggatt gggtggcgat cgcaagcgcg gcggagccgg gtgcggcggg tcggcacggt 1451941 ttggattggg tggcgatcgc aagcgcggcg gagccgggtg cggcgggtcg gcacgcatgg 1452001 tgactcaagc attgttgcct tctgggctgg tggccagtgc ggtggtggcg gcgtccagtg 1452061 caaacctggg cccgggcttc gacagtgtcg gtttggcgct gagtctctac gacgagatca 1452121 tcgtcgagac aacagattcc ggcttgacgg tgactgtaga cggcgagggc ggcgaccagg 1452181 tgccgctggg ccccgagcac ctcgtggtcc gcgccgtgca gcacgggtta caggcagcgg 1452241 gggtcagcgc cgccggcctg gcggtgcgct gccgcaacgc catcccgcac tcccgcggcc 1452301 tcggctcctc cgcggcagca gttgtgggcg gtcttgcggc cgttaacggt cttgtcgtac 1452361 aaacggattc gtcaccatcg agcgatgctg agctgattca gttggcttcg gagttcgagg 1452421 gtcatcccga caacgcggcg gccgcggttt tgggtggtgc cgtggtttcg tggactgacc 1452481 acagtggtga ccggcccaac tattcggccg tatcactgcg gcttcatccc gatatccgcc 1452541 tgttcactgc gattcccgag cagcgttcgt cgaccgcgga aacgcgggtg ctattgcccg 1452601 cgcaggttag tcacgacgac gcacggttca atgtcagtcg cgcggcgctg ctggtggttg 1452661 cgctcaccga acggcccgat ctgctgatgg cggccaccga agatctgctt catcagccgc 1452721 aacgtgccgc ggcaatgaca gcctccgcgg aatatcttcg gctgttgcgg cgtcataacg 1452781 tggcagcagc actgtccggg gcaggtcctt cgttgatcgc cctgagtaca gattcagagt 1452841 tgccgaccga cgccgtggag ttcggagccg caaagggatt tgccgttacc gagctgactg 1452901 ttggcgaggc ggttcgctgg agcccgacag taagagttcc cggttaatcc gcaaggttgc 1452961 gggggtttgc ttgcttccgg ccaggaagcg ggctatcctc ggagccgtcc agcaatcgca 1453021 gcatctgcat acgtactgcc ttgccgctag gacagccacc aattcttctt gtggacgagg 1453081 ttcgccgtat tcgccgctga tggcgatcac cgttgcaaag tcgatgattg gcgcactcgg 1453141 cgatttggct gactgcaaca aaaccccgta tgacgtgatc agcgggggaa ggaaaggaaa 1453201 tccgtgaccg atacggacct cattacggct ggcgaaagta ccgacggcaa gccgtcggat 1453261 gccgctgcca cagatccccc agacctcaac gccgacgagc cggccggctc gctggccacc 1453321 atggtgctgc ccgaactgcg tgcgctggct aatcgagccg gcgtgaaggg aacatcgggt 1453381 atgcggaaga acgaactgat cgctgcgatt gaggagatca ggcgacaggc caacggcgcc 1453441 ccagccgttg accggtcggc tcaagagcac gacaagggcg accggccgcc cagttccgag 1453501 gcaccggcca cccaggggga acagaccccg accgaacaga tcgattccca aagccaacag 1453561 gtccgcccgg agcggcgcag cgccacccgt gaagcgggac cctccggttc cggtgagcgt 1453621 gcgggcacag ccgcagacga caccgacaac cgccaaggcg gtcaacagga cgccaagacc 1453681 gaggagcgtg gcaccgacgc gggtggcgac caagggggtg accagcaggc ttcgggcggt 1453741 cagcaggcgc gcggcgacga ggacggagaa gcgcgtcagg gccggcgcgg acgccggttc 1453801 cgcgatcggc ggcgccgcgg tgaacgatcc ggcgacggcg ccgaggctga actgcgtgag 1453861 gacgacgtcg tccagccggt agccggcata ctcgacgtcc tggacaacta cgcgtttgtg 1453921 cgcacctccg gctacctacc cggtccgcac gacgtgtatg tgtcgatgaa catggtgcgc 1453981 aagaacggca tgcgccgtgg tgatgcggtg accggtgcgg tgcgggtgcc caaggaaggg 1454041 gagcaaccca accagcggca gaagttcaac ccgctggtcc gcctggacag catcaacggc 1454101 ggatcggtcg aagacgccaa gaagcggccc gagttcggca aactgacgcc gttgtacccc 1454161 aaccagcggc ttcgtctgga aaccagtacc gagcggctga ccacccgggt catcgacctc 1454221 atcatgccga tcggcaaggg tcaacgcgcg ttgattgtgt cgccgcccaa agcgggcaag 1454281 acaacgatcc tgcaggacat cgccaacgcg atcaccagga acaacccgga atgccacctc 1454341 atggtcgtgc tcgtcgacga gcggcctgag gaggtcaccg atatgcagcg ctcggtcaaa 1454401 ggcgaggtca tcgcttcaac tttcgaccgg ccgccgtcgg accacacgtc ggtcgccgag 1454461 ctggcgatcg aacgcgccaa gcggctggtg gagcaaggca aggacgtcgt ggtgctgctc 1454521 gattcaatca cccggctagg ccgcgcttac aacaacgcgt cgccggcgtc gggccggatc 1454581 ctgtccggtg gtgtcgattc cacggcgttg tacccgccca agcgcttcct gggggccgcg 1454641 cgcaacatcg aagagggcgg gtcgctgacc atcatcgcca ctgcgatggt cgagaccggg 1454701 tccactggtg acacggtcat tttcgaggag ttcaagggca ccggcaacgc cgagctcaag 1454761 ctggaccgca agatcgccga gcggcgggtt ttccctgcgg tcgacgtgaa cccttctgga 1454821 acccgcaagg acgagctact gctgtcgccc gacgagttcg ctattgtgca caagctgcgc 1454881 cgcgtgctat cgggcctgga ttcccaccag gccatcgacc tgctgatgtc gcagctgcgt 1454941 aagacgaaga acaactacga attccttgtt caggtgtcca agaccacgcc agggtccatg 1455001 gacagcgact gatccggcga gacggctcgc cgggaatgtc cgcacgcatc tcggtgtttg 1455061 gggtgatagc ggttgacctg gcataatcga tgctcaacga gttggaaccg gaccaggttc 1455121 tcggcacgcc acgacgggcg gccaccgatc acagagggca gcatgaaatc tgacattcat 1455181 ccggcatatg aggagaccac cgtggtctgc ggatgcggca ataccttcca gacgcgtagc 1455241 accaagccgg gaggtcgtat tgtggttgag gtttgttcgc agtgtcatcc gttctacacc 1455301 ggcaagcaga agatcctcga cagcggcggc cgggtggctc gcttcgagaa gcggtacggc 1455361 aagcgcaagg tcggagctga caaggcggtt tcaaccggca aatagctggc ttaccgacgc 1455421 ccgaactgtg caccagcggt acaggacggg cgtcggttcg cgttagggtc cgcgctcgcg 1455481 ggaagaaggt tgacatgacg cagccagtgc agacgattga cgtgttgctc gccgaacacg 1455541 ccgagctcga gcttgcgctg gcagatcccg cgctgcacag caatccggcc gaggcgcgca 1455601 gagtcgggcg ccggtttgcc cgattggccc cgatcgtcgc aacccaccgc aagctgacgt 1455661 ccgcgcgcga cgacctcgag accgcgcgcg agctggtggc ttccgacgag tcgttcgccg 1455721 ccgaggttgc cgcattggag gctcgggtgg gcgaactgga tgcccaactc actgacatgt 1455781 tggcaccgcg tgacccgcac gatgccgatg acattgtgct ggaagtcaaa tccggcgagg 1455841 ggggcgaaga atccgcgttg ttcgccgccg atttggccag gatgtatatc cgctacgccg 1455901 agcggcacgg ctgggcggtg acggtgttgg acgagaccac ctcggatctg ggtgggtaca 1455961 aggacgcgac gttggcgatt gccagcaaag ccgacacccc cgacggggtg tggtcgcgca 1456021 tgaagttcga gggcggggtg caccgcgtac aacgggtccc agtgacggaa tcccaaggcc 1456081 gcgtgcatac ttcggcggcg ggtgtgctgg tctatccgga gcccgaggaa gtcggccaag 1456141 tgcagatcga cgagtcggat ctgcgtatcg acgttttccg gtcgtccggc aagggcgggc 1456201 agggagtgaa taccaccgac tccgcggtgc gtatcaccca tctgcccact ggaatcgtcg 1456261 tcacctgtca gaacgaacgg tcgcagctgc agaacaagac gcgtgcgttg caggtgctgg 1456321 ccgctcggtt gcaggcaatg gccgaggagc aggcgctggc cgacgcgtcg gccgaccggg 1456381 ctagccaaat ccgcactgtg gaccgtagtg aacgcattcg cacctacaac ttcccggaga 1456441 accggatcac cgaccaccgg atcggttaca agtcacacaa tctcgatcag gtgctggatg 1456501 gcgatcttga cgcgttgttc gacgctctgt ccgccgcgga caagcaatcc cggttgcgac 1456561 aatcatgacc tccgcgccgg cgacgatgcg gtgggggaac ctcccgcttg cgggggagag 1456621 cggcacaatg accctgcgtc aggcgatcga cttggctgct gcgctattgg ccgaagcggg 1456681 ggtcgactcg gcgcgttgcg acgctgagca gttggccgct cacctagcgg gcacagaccg 1456741 cggtaggcta cccctgttcg agccgcccgg cgacgagttc ttcgggcgct atcgcgacat 1456801 cgtcaccgct cgtgcgcggc gggtgccgtt gcagcatctc atcgggactg tgtcgtttgg 1456861 gcccgtggtg ctgcatgtcg gcccgggtgt gtttgtaccg cgtccggaga ccgaagccat 1456921 tttggcctgg gccaccgcgc agtcgctgcc ggcgcggccg ctgattgtcg acgcatgcac 1456981 gggatctggc gcgttggcgg tcgcattggc ccagcaccgg gccaaccttg gactaaaggc 1457041 ccgcatcatc ggcattgacg actccgactg cgcccttgac tatgcccgcc gcaatgcggc 1457101 gggtaccccg gtagagttgg tgcgtgccga cgtcaccacg ccccgcctgc tccccgaact 1457161 cgacggacaa gtcgacctga tggtttccaa cccgccctac atccctgatg ctgctgtttt 1457221 ggaacctgaa gtagcgcaac atgacccgca tcacgcgttg ttcggcggtc ccgacgggat 1457281 gacggtgata tccgcggtcg tcgggcttgc tgggcgctgg ctgcgtcccg gtggcctgtt 1457341 cgccgtcgaa cacgacgaca ccacgtcgtc gtcaactgtc gatttggtca gcagcacaaa 1457401 acttttcgtg gacgtacaag cccggaaaga tctggccgga cggccgaggt ttgtgacggc 1457461 gatgaggtgg gggcacctcc cgcttgcagg ggagaacggc gccattgacc cgcgccagcg 1457521 acgatgcaga gcgaagcgat gaggagaagc ggcgccattg actgagacgt tcgactgcgc 1457581 cgaccccgag cagcgttcgc gtggaatcgt ctctgcggta ggggcaatca aggcgggcca 1457641 actggtggtg atgcctacgg acacggtgta tgggatcggc gccgacgcct tcgacagctc 1457701 cgcggtggcc gcgttgctgt cggcaaaggg gcggggtcgc gatatgccgg taggtgtgct 1457761 ggtcggctct tggcacacga tcgaggggct ggtctactct atgcccgacg gtgcccgcga 1457821 actgattcgc gcattctggc ccggcgcgct cagcctggtg gtcgtgcaag cgccgtcgct 1457881 gcaatgggat cttggcgatg cccatggcac cgtgatgctg cgaatgccgc tgcacccggt 1457941 cgccatcgag ttgttgcgtg aggtgggtcc gatggcggta tccagcgcca acatctcggg 1458001 ccacccaccc ccggtcgacg ccgaacaggc acgctctcaa ctcggcgacc acgtcgcggt 1458061 ctatctcgac gcgggtccat ccgaacagca ggccggctcc acgatcgtcg atctgaccgg 1458121 agccacccca cgcgtcctgc ggccggggcc ggtcagcacc gagcggatcg ccgaggtact 1458181 tggtgtggac gcggccagct tgttcggcta gccgccgaac gtgcacgcac tgcgaagatt 1458241 cggccaattg ttcgcagctg ttgcacgttc ggcgagtgtt cagctctcag gttggtgcag 1458301 tacggtctcg aggtgtccag cgatgtggcc ggcgttgccg gtggcttgct cgccctgtcc 1458361 tatcgcggcg ccggtgtccc gctgcgtgag cttgcgctgg tcgggctgac cgcggcgatc 1458421 atcacctatt ttgcgaccgg tccggtgcgg atgctggcca gtcgcctggg agccgtcgcc 1458481 tacccgcggg agcgagatgt gcacgtcacg cctacccctc ggatgggtgg gttggcgatg 1458541 ttcctgggca ttgtcggcgc cgtctttctt gcctcccagc ttccggcact cacccggggg 1458601 ttcgtctatt ccaccggcat gcccgcggtg ctggtggccg gtgcggtgat catgggcatc 1458661 ggcctgatcg atgatcgttg gggtctggat gcactgacga agttcgccgg ccagatcacg 1458721 gcggcgagcg ttctggtcac catgggtgtc gcctggagtg tcctgtacat cccggtgggt 1458781 ggtgtgggca ccatcgtctt ggaccaggct tcctcgatcc tgcttaccct ggcgctgacc 1458841 gtttcgatcg tcaacgcgat gaactttgtc gacggtctcg acgggctggc cgccggcctg 1458901 ggcctgataa cggcgctggc aatctgcatg ttctcggtgg gtttgcttcg tgaccacggt 1458961 ggtgacgttt tgtactaccc gccggcggtg atttcggtgg tcctggccgg ggcctgcctg 1459021 ggctttctgc cacacaactt ccaccgggcc aagatcttca tgggcgattc cgggtcgatg 1459081 ctgatcggcc tgatgctggc cgccgcttcc accaccgcgg ccgggccgat ctcgcagaac 1459141 gcctacggcg ctcgtgatgt atttgctttg ctgtcgccgt tcctgctggt ggtggcggtc 1459201 atgtttgtgc caatgctcga cctgctgcta gcgatcgtcc gtcgcacccg cgccggccgc 1459261 agcgcgttta gcccggacaa aatgcacctg catcaccggc tgctgcagat cggtcattcc 1459321 catcggcgcg tggtcctgat catctacctg tgggtgggca tcgttgcctt cggcgccgcg 1459381 agctcgatct tctttaaccc gcgcgacacc gcggcggtga tgctgggcgc gatcgtggtc 1459441 gccggcgtcg cgacactgat ccccctgttg cgccgcggcg acgactacta cgacccggac 1459501 ctggactagc ccggagccga gaactacgac aaggagtagt agtggtgtct accttgtggt 1459561 acggtgcggc tagaaccccg aaggagacct cgcgggttgc cggcccccgg cccatcggat 1459621 gcgtatccgg tcgcgccgat tcacgaccga catagggagc taccccttgg gtgattccgg 1459681 tgcgacgact gcgatacgct cggcgggcca ccgatcagtc gatcgggtgg tttccgctcc 1459741 atcagcccgg aattgaggtg ccgcagtgac gacaccagcg caggacgcgc cgttggtgtt 1459801 tccctctgtt gctttccgtc cggttcgcct ttttttcatc aacgttggac tggccgcagt 1459861 ggcgatgttg gtcgccggcg tgttcggtca cctgacggtc gggatgttct tgggtctcgg 1459921 gttgctgctg ggtttgctca atgccctgct ggtgcggcgt tcggccgagt cgatcaccgc 1459981 caaagagcac ccgttaaaac ggtcgatggc cctcaactcg gcatcgcgac tggcgattat 1460041 caccatcctc gggctgatca tcgcctacat tttccggccc gctggattgg gcgtcgtgtt 1460101 cgggctggca ttcttccagg tgctgctggt ggcaacgacg gccctgccgg tcctgaagaa 1460161 gctgcgcact gcgaccgagg aaccggtcgc aacttattct tccaatggcc agaccggggg 1460221 atcggaagga aggagcgcca gcgatgactg agaccatcct ggccgcccaa atcgaggtcg 1460281 gcgagcacca cacggccacc tggctcggta tgacggtcaa caccgacacc gtgttgtcga 1460341 cggcgatcgc cgggttgatc gtgatcgcgt tggcctttta cctgcgcgcc aaagtgactt 1460401 cgacggatgt gccaggcggg gtgcagttgt tttttgaggc gatcaccatt cagatgcgca 1460461 atcaggtcga aagcgccatc gggatgcgga tcgcaccctt cgtgctgccg ctggcggtga 1460521 ccatcttcgt gttcatcctg atctccaact ggctggcagt cctcccggtg cagtacaccg 1460581 ataaacacgg gcacaccacc gagttgctca aatcggcagc agcggacatc aattacgtgc 1460641 tggcgctggc gcttttcgtg ttcgtctgct accacacggc cggtatttgg cggcgcggta 1460701 ttgtcggaca cccgatcaag ttgctgaaag ggcacgtgac gctcctcgcg ccgatcaacc 1460761 ttgtcgaaga agtcgccaag ccaatctcgt tgtcgctccg acttttcggc aacattttcg 1460821 ccggcggcat tctggtcgca ctgatcgcgc tctttccccc ctacatcatg tgggcgccca 1460881 atgcgatctg gaaagcattt gacctgttcg tcggcgcaat ccaggccttc atttttgcgc 1460941 tgctgacaat tttgtacttc agccaagcga tggagctcga agaggaacac cactagtacc 1461001 ggatgctggt aacggctacc agagccatca aggaggataa ggaaatggac cccactatcg 1461061 ctgccggcgc cctcatcggc ggtggactga tcatggccgg tggcgccatc ggcgccggta 1461121 tcggtgacgg tgtcgccggt aacgcgctta tctccggtgt cgcccggcaa cccgaggcgc 1461181 aagggcggct gttcacaccg ttcttcatca ccgtcggttt ggttgaggcg gcatacttca 1461241 tcaacctggc gtttatggcg ctgttcgtct tcgctacacc cgtcaagtaa ttcgacggca 1461301 aatggttgca ataggtagca atgggtgaag tgagcgcgat tgtcctggcc gccagtcagg 1461361 cggcagagga aggcggcgag tccagcaact tcctcattcc caacggcacg tttttcgttg 1461421 tgctggccat cttcctggtg gtgctcgctg tcattggcac tttcgtggtg ccgccgatct 1461481 tgaaggtctt gcgggaacgt gacgctatgg tcgccaaaac gctggccgac aacaagaagt 1461541 cggacgagca gttcgccgcc gcacaggccg attacgacga agccatgacg gaagcccgag 1461601 tccaggcgtc gtccttgcgc gacaatgccc gggcagatgg ccgtaaagtc atcgaggacg 1461661 cacgcgtccg ggccgaacaa caggtggcat cgacgttgca gaccgcccat gagcaattga 1461721 agcgggagag ggacgccgtg gaactcgatc tgcgtgccca cgtgggcacc atgtcggcga 1461781 ctctggccag tcgaattctc ggtgttgacc tcaccgcttc agccgcgacg aggtaaccac 1461841 gaatgtcgac gtttatcgga cagctgttcg ggttcgcggt catcgtttat ctggtgtggc 1461901 gatttatcgt gccgctcgta gggcgtttga tgtccgcacg gcaggacacg gtgcgccaac 1461961 agctggcgga tgcggcggcg gccgccgacc ggctggcgga ggcgagtcaa gctcacacga 1462021 aggcgctgga agacgccaag tcggaagcgc accgtgttgt ggaagaggcc aggacagatg 1462081 ccgaacgcat cgcagaacaa ctagaggccc aggccgacgt cgaggcggag cgcatcaaaa 1462141 tgcagggtgc ccgtcaggtc gacctcatcc gggcacagct gacccgtcag cttcgcctcg 1462201 agctcggtca cgaatcggtg cgccaggcaa gggaattggt acgcaatcac gtggccgatc 1462261 aggcacaaca atcggccacc gtcgaccgct tcctggatca gctcgatgcg atggcgccgg 1462321 ctacggccga tgtcgattac ccactgctgg ccaagatgcg ctcagccagc cggagggcat 1462381 taaccagcct ggtggattgg ttcggcacca tggcccagga cctcgaccat caaggtctga 1462441 ccaccctcgc cggcgagctg gtgtcggtag caagactgct ggaccgcgag gccgtcgtca 1462501 cccgctatct caccgtgcca gccgaagatg cgacgcccag gatccggctg atcgaacggc 1462561 tggtgtccgg caaggtcggc gcgccaacgc tcgaggtgtt gcgcacagcc gtatcgaagc 1462621 gctggtcggc caattccgat ttgatcgatg cgatcgaaca cgtgtcgcgg caggcgctgt 1462681 tagaactcgc cgaacgtgcg ggtcaggtcg acgaggtgga agaccagtta ttccggtttt 1462741 cccgcattct cgacgtgcag ccccggcttg ccatcctgtt gggtgactgt gccgttccgg 1462801 ccgaaggccg agtccggttg ctgcgcaagg tgcttgagcg tgccgacagt accgtcaacc 1462861 cggtcgtggt cgcgctgttg tctcacaccg tcgagctgct gcggggtcag gcagttgagg 1462921 aagcggtgct gttcctggcc gaagttgcgg tggctcgccg cggcgaaatc gtcgcgcagg 1462981 tcggcgcggc ggccgagctc agcgatgctc agcgcactcg cctcaccgaa gtgctgagcc 1463041 gtatctacgg tcaccccgtg accgtgcagc tgcatatcga cgccgcgctg ctgggcggat 1463101 tgtccatcgc ggtcggtgac gaagtgatcg acggtacgct ctcgtctcgt ctagctgcgg 1463161 ccgaggcacg actgcccgac tgaacccgaa ctagtcagca caaaccgaag taggaagacg 1463221 aaaagctatg gctgagttga caatccccgc tgatgacatc cagagcgcaa tcgaagagta 1463281 cgtaagctct ttcaccgccg acaccagtag agaggaagtc ggtaccgtcg tcgatgccgg 1463341 ggacggcatc gcacacgtcg agggtttgcc atcggtgatg acccaagagc tgctcgaatt 1463401 cccgggcgga atcctcggcg tcgccctcaa cctcgacgag cacagcgtcg gcgcggtgat 1463461 cctcggtgac ttcgagaaca tcgaagaagg tcagcaggtc aagcgcaccg gcgaagtctt 1463521 atcggttccg gttggcgacg ggtttttggg gcgggtggtt aacccgctcg gccagccgat 1463581 cgacgggcgc ggagacgtcg actccgatac tcggcgcgcg ctggagctcc aggcgccctc 1463641 ggtggtgcac cggcaaggcg tgaaggagcc gttgcagacc gggatcaagg cgattgacgc 1463701 gatgaccccg atcggccgcg gccagcgcca gctgatcatc ggcgaccgca agaccggcaa 1463761 aaccgccgtc tgcgtcgaca ccatcctcaa ccagcggcag aactgggagt ccggtgatcc 1463821 caagaagcag gtgcgctgtg tatacgtggc catcgggcag aagggaacta ccatcgccgc 1463881 ggtacgccgc acactggaag agggcggtgc gatggactac accaccatcg tcgcggccgc 1463941 ggcgtcggag tccgccggtt tcaaatggct tgcgccgtac accggttcgg cgatcgccca 1464001 gcactggatg tacgagggca agcatgtgct gatcatcttc gacgacctga ctaagcaggc 1464061 cgaggcatac cgggcgatct cgctgctgct gcgccgtccg cccggccgtg aggcctaccc 1464121 cggcgatgtg ttctatctgc attcgcggct tttggagcgc tgcgccaaac tgtccgacga 1464181 tctcggtggc ggctcgctaa cgggtctgcc gatcatcgag accaaggcca acgacatctc 1464241 ggcctacatc ccgaccaacg tcatctcgat caccgacggg caatgtttcc tggaaaccga 1464301 cctgttcaac cagggcgtcc ggccggccat caacgtcggt gtgtcggtgt cccgagtcgg 1464361 cggcgcggcg cagatcaagg ctatgaaaga ggtcgccgga agcctccgct tggacctttc 1464421 gcaataccgc gagctagaag ctttcgccgc tttcgcttct gatttggacg ccgcatcgaa 1464481 ggcgcagttg gagcgcggcg cccggctggt cgagctgctc aagcagccgc aatcccagcc 1464541 catgcccgtt gaggagcaag tggtttcgat cttcctgggc accggcggtc acctggactc 1464601 ggtgcccgtc gaggacgtcc ggcggttcga aaccgaatta ctggaccaca tgcgggcctc 1464661 cgaagaagag attttgactg agatccggga cagccaaaag ctcaccgagg aggccgccga 1464721 caagctcacc gaggtcatca agaacttcaa gaagggcttc gcggccaccg gtggcggctc 1464781 tgtggtgccc gacgaacatg tcgaggccct cgacgaggat aagctcgcca aggaagccgt 1464841 gaaggtcaaa aagccggcgc cgaagaagaa gaaatagcta accatggctg ccacacttcg 1464901 cgaactacgc gggcggatcc gctcggcagg gtcgatcaaa aagatcacca aggcccagga 1464961 gctgattgcg acatcgcgca tcgccagggc gcaggctcgg ctcgagtccg ctcggcccta 1465021 cgcttttgag atcacccgga tgcttaccac cctggccgct gaagccgcac tggaccatcc 1465081 gttgctcgtc gagcgcccgg agccgaaacg agccggcgtg ctggtggtgt cgtccgatcg 1465141 tggtttgtgc ggcgcataca acgccaatat tttccgtcgc tccgaggagc tgttctccct 1465201 gctgagggag gccggaaagc agccggtgct gtatgtggtg ggccgtaagg cgcagaacta 1465261 ctacagtttt cggaactgga acatcaccga gtcgtggatg ggtttctccg agcaacccac 1465321 gtacgagaac gccgccgaga tcgcttcgac cttagtggat gcgttcctgc tcggcaccga 1465381 caacggcgag gatcaacggt ccgacagcgg cgagggcgtc gacgaactgc acatcgttta 1465441 caccgagttc aagtcgatgc tgtcgcaatc ggcggaggct caccggatcg cccccatggt 1465501 ggtggagtac gtcgaggaag acatcggacc gcgcacgctg tactcgttcg agcccgacgc 1465561 gacgatgctg ttcgagtcat tgttgccgcg ctacctgact acccgggtgt acgcggcgct 1465621 gctggagtcc gcggcgtcgg agcttgcctc gcggcaacgt gcgatgaagt cggccaccga 1465681 caacgccgat gacctcatca aggccctgac gctgatggca aaccgcgagc ggcaggccca 1465741 gatcacccag gagattagtg aaatcgtcgg tggcgcaaat gcgctcgccg aagcccgcta 1465801 ggcccaagct aggttagccc cacgaggaag cgaagaagat atgactacca ctgccgaaaa 1465861 gaccgaccgg ccgggaaagc cgggaagctc cgacaccagc ggccgcgtgg tacgggtcac 1465921 tgggcccgtc gtcgacgtcg agtttcctcg cggttccatc cccgagctgt tcaatgcact 1465981 gcacgctgag atcaccttcg agtcgctggc gaaaaccctc accttggagg tggcgcagca 1466041 cctcggcgac aacctggtgc gcaccatctc gctgcagccg accgacggct tggtgcgcgg 1466101 cgtcgaggtg atcgacaccg ggaggtcgat ctcggtgccg gtcggtgagg gtgtgaaggg 1466161 ccacgtcttc aatgcgctgg gagattgcct ggacgagccg ggatatggcg aaaaattcga 1466221 acactggtcg attcaccgca agccgccggc gttcgaggag ctggagcctc ggaccgagat 1466281 gctcgagacc ggtctgaagg tggtcgacct gctgactccg tatgttcgtg gcggcaagat 1466341 cgcactgttc ggcggtgccg gggtgggcaa gacggtgctg attcaggaga tgatcaaccg 1466401 catcgcccgt aacttcggtg gtacgtcggt gttcgccgga gtgggcgagc gcacccgcga 1466461 gggcaacgat ctgtgggtcg agcttgccga agccaacgtg ctcaaggaca ccgcgctggt 1466521 attcggacag atggacgagc cgccgggcac ccgtatgcgt gttgcgctgt ctgcgctgac 1466581 gatggcggag tggttccgtg acgagcaggg tcaagacgta ttgctgttca tcgacaacat 1466641 cttccggttc acccaggctg ggtcggaagt gtcgacgctt ctcggccgga tgccgtcggc 1466701 cgtgggatac cagcccacgc tggccgacga gatgggcgag ctgcaggagc gcatcacctc 1466761 gacgcgggga cgctcgatca cgtcgatgca agccgtctac gtgcccgccg acgactacac 1466821 cgacccagcg ccggcgacca cgttcgccca cctggacgcc acgaccgagc tatcccgtgc 1466881 ggtgttctcc aagggcatct tccccgccgt ggacccgctg gcgtccagct cgaccatcct 1466941 ggaccccagc gttgtcgggg atgagcacta ccgcgtggcc caggaagtca tccggatcct 1467001 gcagcgttac aaggaccttc aggacattat cgcgatcctc ggtatcgacg agttgtcgga 1467061 ggaggacaag cagctggtga accgcgcccg gcgtatcgag cggttcctat cgcagaacat 1467121 gatggcagcc gaacagttca ccggccagcc gggttcgacc gtcccggtga aggagaccat 1467181 tgaagcgttc gaccgcttgt gcaagggcga tttcgatcac gtacccgaac aggccttctt 1467241 cttgatcggt ggccttgatg acctggccaa gaaagccgag agtctcggcg ccaagctgtg 1467301 acgggagttg tggcatggcc gaattgaacg ttgagatcgt cgccgtcgac cggaacatct 1467361 ggtcgggtac ggcgaagttt ctgttcaccc gcaccaccgt cggtgagatc ggcatcctgc 1467421 cccgccacat tccgttggtg gcccaattgg tcgatgacgc catggtgcgg gtcgagcggg 1467481 agggagaaaa ggacctgagg atcgcggtcg acggcgggtt cctgtcggtg accgaggagg 1467541 gcgtcagcat tctcgccgaa tctgccgagt tcgagtcgga gatcgacgag gccgccgcca 1467601 agcaggattc cgaatccgac gatccccgca tcgctgccag gggccgcgcc agattgcgcg 1467661 ccgtcggcgc gatcgactaa cccgccgatg agcgcgccca tgatcggcat ggtcgtgctc 1467721 gtcgttgtcc tggggttggc cgttctcgca ctgagttatc gtctgtggaa gctgcgccag 1467781 gggggaacgg ctgggatcat gcgggacatc cctgcggttg gaggtcacgg ctggcgccac 1467841 ggcgtaatcc gctatcgcgg cggcgaagcc gcgttctacc ggctttctag tctgcgcttg 1467901 tggccggatc gccggctcag tagacggggt gtggagatca tttcccggcg cgcgccccgt 1467961 ggcgacgaat tcgacatcat gaccgacgag attgtcgttg tggaactgtg cgacagcacc 1468021 caggaccgaa gggtaggtta cgagatcgcg ctcgacaggg gcgcgttgac cgcatttctg 1468081 tcgtggttgg agtcccggcc gtcgccgcgc gcgcgccgcc gtagtatgtg acgcactggt 1468141 cagcagacgc aaaagccccc atttcgggct ctactgactg atctgtgggt ggttgtgtcg 1468201 gcctggcagg gtggggcggt ggccggcgag ggtgagcatg gctagggcga tgagggcttg 1468261 tggtgagcgg aatccgaacg cgatccgggt cagtaggcgg atcttggtgt tggtggattc 1468321 gatcaggcct tgggataggc cgtggtcgag ggcggcgtcg atggccaccc ggtggcgttt 1468381 gatgcgggcg gcaagctcga cgaataccgg gatgcgacag cgctgggccc aggagatcca 1468441 ccggtccagg gcctgtttac cttcctcgcc cttgaccgaa aacacatgcc gcaggctctc 1468501 tttgagcagg taggcgcgat acagacgggg atcggtcttg gcgatccagg ccagtttggc 1468561 gctttggcgt tcggtgaggt cctcggggtt cttccacagc gcgtagcggg cgcccttgag 1468621 ccgccgtgcc cgctcgcggc ccggacgtgg tgcggcgttc ttaccgggcc ggccccggcc 1468681 ccacttgggt tcggtgcgcg cgatcgcccg tgcgtcgttc caggctcggc gccgctcgac 1468741 gtcgagcgcc tcggtggccc aggccaccac atgaaacgga tcggcgcatt gaatcgcatc 1468801 cgggcagcgc tcggtgacca cgtcagcgat ccagtccgcg gcatcggccg aaacgtgagt 1468861 aatctgggcg gcccgctcag cgcccagggc atcgaagaac aagcccaggg tggccttgtc 1468921 gtggcccggg gcggcccaca ccaaccggcc gctgtcgtga tcgacgacca ccgtcaggta 1468981 ccggtggtgg cgcttgtagg agatctcatc gataccgatg cggcgcaagt tcgcgaaccg 1469041 gtcaatgcgc ttttcggtgt cggcccagac ccgggccacg atcgccccga cggtgcgcca 1469101 ggcgatccgc atcaactcgc acaccgcggt cttcgaacac gccaccgcca gccaggccac 1469161 cgtgtcatcg aaagcatacg tgtgcccggc atgatgacgc gcccacggca ccgccaccac 1469221 cgtcggccca tgggtggggc agttcacccg cggcgcctcg gcctccaaga acacctcgac 1469281 ggtgccccaa tccagactgc gccattggcg caggcccgca ccgcggtcat accaggacgc 1469341 cttgcgaccg cagcgaccac agcggcgcaa cactgcactt cgtggccgca cccgggcgat 1469401 cacccgcgca ccgtctccgg cgtcatcctc ctcgaattcg atgtcctcaa tcacggtgcg 1469461 cttgtcgaca cccagcagcg cacgaaatag cctcacattg cgcacgtcgt tgtcggctcc 1469521 ttgtgtttct gatccttgac aagccagaaa ccttaagcca caacgacgtg cgcctactca 1469581 ggacacaaac tcacccacgg aagtgtcaga agagcccaaa aaccgtgggt attgggggct 1469641 ttcgcgtctg ctcgcacgcg gaaggtgccg ctagctcgcc gtcctatcac caccgggccg 1469701 ccacagcacg tcaccgtcgg gattggctac ccgcgacagg atgaacagca gatccgacag 1469761 ccggttcagg tatttcgccg gcagtacgct gacgccttcc gggtgagcgt cgaccgcggc 1469821 ccacgcggat cgctcggccc ggcgaacgac ggtgcgagcg acgtgcaaca gcgccgacag 1469881 cggtgaacca ccaggtagta caaaggattt tagtgcaggc aggcccgcgt tgtatgcgtc 1469941 gcaccaccct tcgagccgat cgatatagga ctgtgcgatt cgcagcggag ggtgcttcgg 1470001 gttttccact atcggagtcg acagatccgc accggcatcg aacaagtcgt tctggatctg 1470061 ccgcagcaca tccgtgattt gagtgtccgg gtggcccagc gccagggcgg ccccgatcgc 1470121 ggcgttggcc tcgtcgcaat ccgcgtatgc caccagtcgg gcgtcggttt tggcgacacg 1470181 ggacatatcg ctcaatcccg tcgttccgtc atcgccggtt cgggtataga tgcgggtcag 1470241 gtggactgcc atgagcaaac ggtactcgct gactggcttg gctcactgac aaggcaaaac 1470301 ccctttacta cactgaccgg gtggccgagc gtttcgtcgt gactgggggc aaccggttat 1470361 caggcgaagt ggccgtcggc ggcgccaaga acagcgtgct caagctcatg gctgcgacgt 1470421 tgttggccga gggcaccagc acgatcacca actgtcccga catcctcgat gtgccgctga 1470481 tggcggaggt actgcgtggt ctgggcgcca ccgtcgaact cgacggtgac gtggcccgga 1470541 tcaccgcacc tgacgagccg aagtacgatg ccgacttcgc tgcggtgcgg caattccgcg 1470601 cctcggtctg tgtgctggga ccgctggtcg ggcggtgcaa acgggccagg gtcgcgctgc 1470661 cgggcggtga cgcgatcggg tcgcgtccgt tggatatgca ccaggcgggc ctacggcaat 1470721 tgggtgccca ctgcaacatc gagcacggct gcgtggtagc ccgagcggaa acgttgcgcg 1470781 gtgcggagat tcagttggag ttcccctcgg tgggagccac cgagaacatc ttgatggccg 1470841 ccgtggtggc cgagggagtc accactattc acaatgcggc tcgagaaccc gacgtcgtcg 1470901 acttgtgcac gatgttgaac cagatgggcg cacaggtcga aggtgcgggt tcgccgacaa 1470961 tgaccatcac cggtgtcccg cggctgcatc caaccgagca ccgggtgatc ggagaccgta 1471021 tcgttgccgc cacatggggc atcgctgccg caatgacccg tggtgatata tcagtggcgg 1471081 gcgtagaccc ggcgcatctg cagctggtgc tgcacaaatt gcacgacgcg ggcgcaaccg 1471141 tcacccagac tgacgccagc ttccgggtga cccagtacga gcgtccgaag gctgtcaacg 1471201 ttgcgacctt gccgttcccc gggtttccca cggatctgca gccgatggct atcgctttgg 1471261 cgtcgatcgc cgacggcaca tcgatgatca cggagaacgt gttcgaggcg cggttccgct 1471321 tcgttgaaga gatgatccgg ctcggtgcag acgctcggac cgacgggcac cacgccgtgg 1471381 tgcggggcct cccgcagctg tcgagcgctc cggtgtggtg ttcggacatc cgtgccgggg 1471441 ccggcttggt gctggcgggg ctcgttgccg acggcgacac cgaggtccac gatgtattcc 1471501 acatcgatcg cggatatccg ttgttcgtgg agaacctggt gagtctcggt gccgagatcg 1471561 aacgggtatg ctgttaggcg acggtcacct atggatatct atggatgacc gaacctggtc 1471621 ttgactccat tgccggattt gtattagact ggcagggtcg ccccgaagcg ggcggaaaca 1471681 agcaagcgtg ttgtttgaga actcaatagt gtgtttggtg gtttcacatt tttgttgtta 1471741 tttttggcca tgctcttgat gccccgttgt cgggggcgtg gccgtttgtt ttgtcaggat 1471801 atttctaaat acctttggct cccttttcca aagggagtgt ttgggttttg tttggagagt 1471861 ttgatcctgg ctcaggacga acgctggcgg cgtgcttaac acatgcaagt cgaacggaaa 1471921 ggtctcttcg gagatactcg agtggcgaac gggtgagtaa cacgtgggtg atctgccctg 1471981 cacttcggga taagcctggg aaactgggtc taataccgga taggaccacg ggatgcatgt 1472041 cttgtggtgg aaagcgcttt agcggtgtgg gatgagcccg cggcctatca gcttgttggt 1472101 ggggtgacgg cctaccaagg cgacgacggg tagccggcct gagagggtgt ccggccacac 1472161 tgggactgag atacggccca gactcctacg ggaggcagca gtggggaata ttgcacaatg 1472221 ggcgcaagcc tgatgcagcg acgccgcgtg ggggatgacg gccttcgggt tgtaaacctc 1472281 tttcaccatc gacgaaggtc cgggttctct cggattgacg gtaggtggag aagaagcacc 1472341 ggccaactac gtgccagcag ccgcggtaat acgtagggtg cgagcgttgt ccggaattac 1472401 tgggcgtaaa gagctcgtag gtggtttgtc gcgttgttcg tgaaatctca cggcttaact 1472461 gtgagcgtgc gggcgatacg ggcagactag agtactgcag gggagactgg aattcctggt 1472521 gtagcggtgg aatgcgcaga tatcaggagg aacaccggtg gcgaaggcgg gtctctgggc 1472581 agtaactgac gctgaggagc gaaagcgtgg ggagcgaaca ggattagata ccctggtagt 1472641 ccacgccgta aacggtgggt actaggtgtg ggtttccttc cttgggatcc gtgccgtagc 1472701 taacgcatta agtaccccgc ctggggagta cggccgcaag gctaaaactc aaaggaattg 1472761 acgggggccc gcacaagcgg cggagcatgt ggattaattc gatgcaacgc gaagaacctt 1472821 acctgggttt gacatgcaca ggacgcgtct agagataggc gttcccttgt ggcctgtgtg 1472881 caggtggtgc atggctgtcg tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg 1472941 agcgcaaccc ttgtctcatg ttgccagcac gtaatggtgg ggactcgtga gagactgccg 1473001 gggtcaactc ggaggaaggt ggggatgacg tcaagtcatc atgcccctta tgtccagggc 1473061 ttcacacatg ctacaatggc cggtacaaag ggctgcgatg ccgcgaggtt aagcgaatcc 1473121 ttaaaagccg gtctcagttc ggatcggggt ctgcaactcg accccgtgaa gtcggagtcg 1473181 ctagtaatcg cagatcagca acgctgcggt gaatacgttc ccgggccttg tacacaccgc 1473241 ccgtcacgtc atgaaagtcg gtaacacccg aagccagtgg cctaaccctc gggagggagc 1473301 tgtcgaaggt gggatcggcg attgggacga agtcgtaaca aggtagccgt accggaaggt 1473361 gcggctggat cacctccttt ctaaggagca ccacgaaaac gccccaactg gtggggcgta 1473421 ggccgtgagg ggttcttgtc tgtagtgggc gagagccggg tgcatgacaa caaagttggc 1473481 caccaacaca ctgttgggtc ctgaggcaac actcggactt gttccaggtg ttgtcccacc 1473541 gccttggtgg tggggtgtgg tgtttgagaa ctggatagtg gttgcgagca tcaatggata 1473601 cgctgccggc tagcggtggc gtgttctttg tgcaatattc tttggttttt gttgtgtttg 1473661 taagtgtcta agggcgcatg gtggatgcct tggcatcgag agccgatgaa ggacgtggga 1473721 ggctgcgata tgcctcgggg agctgtcaac cgagcgtgga tccgaggatt tccgaatggg 1473781 gaaacccagc acgagtgatg tcgtgctacc cgcatctgaa tatatagggt gcgggaggga 1473841 acgcggggaa gtgaaacatc tcagtacccg taggaggaga aaacaattgt gattccgcaa 1473901 gtagtggcga gcgaacgcgg aacaggctaa accgcacgca tgggtaaccg ggtaggggtt 1473961 gtgtgtgcgg ggttgtggga ggatatgtct cagcgctacc cggctgagag gcagtcagaa 1474021 agtgtcgtgg ttagcggaag tggcctggga tggtctgccg tagacggtga gagcccggta 1474081 cgcgaaaacc cggcacctgc ctagtatcaa ttcccgagta gcagcgggcc cgtggaatcc 1474141 gctgtgaatc cgccgggacc acccggtaag cctaaatact cctcgatgac cgatagcgga 1474201 ttagtaccgt gagggaatgg tgaaaagtac cccgggaggg gagtgaaaga gtacctgaaa 1474261 ccgtgtgcct acaatccgtc agagcctcct tttcctctcc ggaggagggt ggtgatggcg 1474321 tgccttttga agaatgagcc tgcgagtcag ggacatgtcg caaggttaac ccgtgtgggg 1474381 tagccgcagc gaaagcgagt ctgaataggg cgacccacac gcgcatacgc gcgtgtgaat 1474441 agtggcgtgt tctggacccg aagcggagtg atctacccat ggccagggtg aagcgcgggt 1474501 aagaccgcgt ggaggcccga acccacttag gttgaagact gaggggatga gctgtgggta 1474561 ggggtgaaag gccaatcaaa ctccgtgata gctggttctc cccgaaatgc atttaggtgc 1474621 agcgttgcgt ggttcaccgc ggaggtagag ctactggatg gccgatgggc cctactaggt 1474681 tactgacgtc agccaaactc cgaatgccgt ggtgtaaagc gtggcagtga gacggcgggg 1474741 gataagctcc gtacgtcgaa agggaaacag cccagatcgc cggctaaggc ccccaagcgt 1474801 gtgctaagtg ggaaaggatg tgcagtcgca aagacaacca ggaggttggc ttagaagcag 1474861 ccacccttga aagagtgcgt aatagctcac tggtcaagtg attgtgcgcc gataatgtag 1474921 cggggctcaa gcacaccgcc gaagccgcgg cacatccacc ttgtggtggg tgtgggtagg 1474981 ggagcgtccc tcattcagcg aagccaccgg gtgaccggtg gtggagggtg ggggagtgag 1475041 aatgcaggca tgagtagcga caaggcaagt gagaaccttg cccgccgaaa gaccaagggt 1475101 tcctgggcca ggccagtccg cccagggtga gtcgggacct aaggcgaggc cgacaggcgt 1475161 agtcgatgga caacgggttg atattcccgt acccgtgtgt gggcgcccgt gacgaatcag 1475221 cggtactaac cacccaaaac cggatcgatc actccccttc gggggtgtgg agttctgggg 1475281 ctgcgtggga acttcgctgg tagtagtcaa gcgaaggggt gacgcaggaa ggtagccgta 1475341 ccagtcagtg gtaacactgg ggcaagccgg tagggagagc gataggcaaa tccgtcgctc 1475401 actaatcctg agaggtgacg catagccggt tgaggcgaat tcggtgatcc tctgctgcca 1475461 agaaaagcct ctagcgagca cacacacggc ccgtacccca aaccgacaca ggtggtcagg 1475521 tagagcatac caaggcgtac gagataacta tggttaagga actcggcaaa atgcccccgt 1475581 aacttcggga gaagggggac cggaatatcg tgaacaccct tgcggtggga gcgggatccg 1475641 gtcgcagaaa ccagtgagga gcgactgttt actaaaaaca caggtccgtg cgaagtcgca 1475701 agacgatgta tacggactga cgcctgcccg gtgctggaag gttaagagga cccgttaacc 1475761 cgcaagggtg aagcggagaa tttaagcccc agtaaacggc ggtggtaact ataaccatcc 1475821 taaggtagcg aaattccttg tcgggtaagt tccgacctgc acgaatggcg taacgacttc 1475881 tcaactgtct caaccataga ctcggcgaaa ttgcactacg agtaaagatg ctcgttacgc 1475941 gcggcaggac gaaaagaccc cgggaccttc actacaactt ggtattgatg ttcggtacgg 1476001 tttgtgtagg ataggtggga gactgtgaaa cctcgacgcc agttggggcg gagtcgttgt 1476061 tgaaatacca ctctgatcgt attgggcatc taacctcgaa ccctgaatcg ggtttaggga 1476121 cagtgcctgg cgggtagttt aactggggcg gttgcctcct aaaatgtaac ggaggcgccc 1476181 aaaggttccc tcaacctgga cggcaatcag gtggcgagtg taaatgcaca agggagcttg 1476241 actgcgagac ttacaagtca agcagggacg aaagtcggga ttagtgatcc ggcacccccg 1476301 agtggaaggg gtgtcgctca acggataaaa ggtaccccgg ggataacagg ctgatcttcc 1476361 ccaagagtcc atatcgacgg gatggtttgg cacctcgatg tcggctcgtc gcatcctggg 1476421 gctggagcag gtcccaaggg ttgggctgtt cgcccattaa agcggcacgc gagctgggtt 1476481 tagaacgtcg tgagacagtt cggtctctat ccgccgcgcg cgtcagaaac ttgaggaaac 1476541 ctgtccctag tacgagagga ccgggacgga cgaacctctg gtgcaccagt tgtcccgcca 1476601 ggggcaccgc tggatagcca cgttcggtca ggataaccgc tgaaagcatc taagcgggaa 1476661 accttctcca agatcaggtt tctcacccac ttggtgggat aaggcccccc gcagaacacg 1476721 ggttcaatag gtcagacctg gaagctcagt aatgggtgta gggaactggt gctaaccggc 1476781 cgaaaactta caacaccctc ccttttggaa aagggaggca aaaacaaact cgcaaccaca 1476841 tccgttcacg gcgctagccg tgcgtccaca ccccccacca gaacaaattt gcatagagtt 1476901 acggcggcca cagcggcagg gaaacgcccg gtcccattcc gaacccggaa gctaagcctg 1476961 ccagcgccga tgatactgcc cctccgggtg gaaaagtagg acaccgccga acatacaaaa 1477021 acacccccgg taacggtggt gtttttgtat gtttatatcg actcagccgc tcgcgagcgg 1477081 gcgaattatg gcttcgattt tcgcaatgac gataccctcg cgggcggggg cgctcagtcg 1477141 aagagcgtca agtctgcggg cgcccggctt ttctccaact cgagcagagc tcgtttccgg 1477201 ttgattccac cgccgtaccc ggtgagcttt ccgctggcgc cgatcacgcg gtggcacggg 1477261 acgatgatgg cgatgggatt gtggccgttg gccaatccca cggcgcgtgc ggcgccgggg 1477321 gcgccgatct ggtcggcgat ttccccgtag gaccgggttt ccccgtacgg gattgtcagc 1477381 aatgctttcc atactcgttg ctgaaagtcg gttccccgga ggtcaagttc cacatcgaat 1477441 tcggtgagct cgccggcgaa ataagcgttg agttggtcga cagcgccaga aaatgcgccg 1477501 gggtcgggtg tccagtgtgt gcggcttggc tcatacgtct gctcgagcat ccgcaggttc 1477561 gtcaacaccg agccatgccc ggccagggtt aatggcccga tggggctatc gatggtgcgg 1477621 tagtgaatca tgcgatcttc tcctgcggtg gccattggtt taccggatgt tccagggtgg 1477681 tccacaggtg ctgggtggca taggagcgcc aggggcgcca gcgagcgctg tgcaccgtca 1477741 gggctcgtcg ttgtgcaggc aggcccagct ttttggcggc cagccgcagg ccgagatcac 1477801 tggccggaaa ggcgtccggg tcaccgaggc cgcgcatggc gatgacctcc gcggtccagg 1477861 ggcccactcc gggcagcgct agcaactgcc cgcgggcgcg ttgccagtca catccggcgt 1477921 ccaggaccag acttttgtcg gcaaggctgg cgacgagcgc gtttatggtc ctttgacgcg 1477981 ccttggggac ggccagatgg ccgggatcga tctcagcgag ctgctcgatc gacgggaagg 1478041 tgtgggtcaa agcgccgtgg cgatcgtgga ccggccgtcc gtaggcggcg accagtcggc 1478101 ccgcgtgagt gcttgcggcc ttcgtcgata cctgttgggc gaggaccgcc cgcacggcga 1478161 attctgcctc gtcgactgtg cggggaatgc gttgcccggg tgccttgccc accactgcgc 1478221 gcagatccgg atcggcgccc agcgcctcga cgatcgcttc gggatcggcg tcgaggtcca 1478281 gcagccgtcg gcaacgtgca gtggccgtca tcaggtcgcg gaaatcatcg agcacaagca 1478341 ggcagcgcac atgatcgggt gccggcgtca ggctgacgat gccgttgccc catgggagcc 1478401 gtagcgtgcg tcggtacgca ccatcgcgga cctcttcgca acccggcacc gcggtggcgg 1478461 ccagatggcc gaaaacaccc tcgaaggcga atggtgcacg gacgggtagc cgcagcgaca 1478521 ccgtgcccgc tgatgcggtg gcagactcga atcgggcggc cgcgcgcgca cgcaatgccg 1478581 tcggtgtgcc gtcgcacgcc aggcgaacgg tgtcgttgaa ctgacggatg ctggaaaacc 1478641 cggcggcgaa tgcgacatcg ccgaacggca ggttcgtggt ctcgatcagc acccgggcgg 1478701 tctgcatgcg ttgggcgcgg gccaacgcga gcggaccggc gccgaccacg gcctgcaaca 1478761 gccgctccag ctggcgaatg gtgtaaccga gctgggccgc gaggccgctg acaccgtcgc 1478821 ggtccaccgt tccgtcggca atcagccgca tcgcccgcgc cacgacgtca ctacgcacat 1478881 tccattccgg agacccaggc gaggcgtcgg ggcggcaccg tttgcaggcc cggaatccct 1478941 ccccctgagc ggccgccgca gtcggcagga accggacatt gcgcgcgaac ggtggccgga 1479001 cggggcaact cggccggcag tagacaccgg tggtcaaaac cgcgacgacg aaccagccgt 1479061 cgaaccgggc gtctttggac tggatcgccc ggtagcagcg ttcgaagtcg tcgtgcaccc 1479121 ttcaacaatt acacccgccc accgacatga ctggcggaaa aacgacattg tgatggggtc 1479181 gtcgtgggtt cgggcaggtt acctacgcgg cttggtcagc ccgaccggct tggccagccg 1479241 gaccggttgg tcgtgcccac gaagtttcac atgcctaccc aaagaccaac gggcgcgctc 1479301 ctcttcgctt gcggcgtcca cggcctgtgc cgaagccagc aacttgccgg gacgcgattt 1479361 ggccagttcg cacaatcggg ccgcctcgtt gaccggctcc ccgatcacgg tgtactcgaa 1479421 ccgttctcgg gcacccacgt tgccggcaat gacctgcccc gccgccacgc cgatcccggc 1479481 ctggcactcg ggcatttcgt tgaccagccg atcggctatc gcccgcgcgg cggccagtgc 1479541 cttgtcttcg ggacagggaa gccggttcgg ggcgccgaag atggttagcg acgcgtcccc 1479601 ctcgaacttg ttgaccaatc cgtggtggcg gtcgacctcg tcgacgacaa tcgcgaagaa 1479661 cttgttgagc agcttgacga cgtcggccgg cggccggctg gtcaccaatt gcgtcgagcc 1479721 gacgatgtcg atgaacacga cggcgacgtg gcgttcttcg ccgcccagtt tcgaacgttc 1479781 acgctcggcg gcggcggcga cttcgcgtcc gacgtggcgg ccgaacagat cgcgcactcg 1479841 ttcgcgctcc cgcagtccgg cgaccatcgc gttgaaacca cgctgcagct cgccgagttc 1479901 ggtgccgtcg aagaccacca ggttggtccg tagctcgccc cgctcgacgc gccgcagcgc 1479961 cgcacgcacc acccgcaccg gggtcgccgt cagccaggcc aggatccaca tcaggatgaa 1480021 cccgaacacc aatgtgacca tcgagatgat cagcacgccc gtcgcgaact gcatccgagt 1480081 gagattgagc agcaccattt cgaacatcgc catcagggcg atgccgacga cgggtactcc 1480141 cgaaccgagc agccacacca ccatggtccg gcccaggatt cccggcgcca accggcgtgg 1480201 cggcggcccg gcctcgagcg cctgggcggc gaacgggcgc aatgcgaact cggtatgcag 1480261 ataggttgcg gttgcgacca atacgccgca aaagctgacc gcgaacagga atcgcgggat 1480321 gaacgcgttg ttgatcaggc cgtagagtgt cgtcaagagc gccgtgccaa caccccagaa 1480381 catgaggtgg cccacggcga ctcgccaggg ggccaggaag gtgcggcgct cctcctcacg 1480441 agtcggtttc cgtccttcga tcgcccagcg cagggcttgc acggtctgcc tggtcagtgc 1480501 gtagctaccc aaagcgaggg ctagcaggac atagcccggt accaccccga acgtgagcca 1480561 ccgtggcgtg tcgcgaacga tgctcggttc ggggatggcg atcgtcacca atagcagggc 1480621 aaccccgatg ccgagcaggt tcgcggtcac gaccagcgcg gtcagcatga cctggatccg 1480681 tacccgtcgg cgccgttggc tttccgaaac ccgcccaagc agccaggagc cgtacgcggg 1480741 agtttctggc agccggccgc tctgccgggt caccgtctcc agcacccgac ccaagcgttg 1480801 cgccgtgctc ttcttggccg acattgtggc gtcagactag tttgtcgaag agtcgggtgc 1480861 gaccggttgg cgcgctcgtg ttgtttgccc ggcttaggtg ggcacggcca gccgagtcgg 1480921 ctgctcatgt ccgcgcagcg tcaccgtctc gcccaaagac caatgggcac gttcggtttc 1480981 gctggcagcg tgcagtgtgt ccgaggatgc tagcaatcgc gcggggtgtg atttggccag 1481041 ttcgcacaat cgggccgcct ggttgaccgg cttgccgacc actgtgtatt cgaatctttg 1481101 cttggcgccg acattgccgg cgacgatctg gcctgccgcc accccgatgc cggcttggac 1481161 ctcgggcatc tcgttggcca gccgatcggc tatggcccgg gcggcggcca gcgcggcgtc 1481221 ttcgggacgg tcgaggcggt tcggggctcc gaagatggcc agggcggcgt cgcctgcgaa 1481281 cttgttgatc agtccgtggt gacggtcgac ctcgttgacg acgatcgcga aaaaccggtt 1481341 gaggagcttg accacgtggg cggcaggttg gttgtccacc agctgggtgg agccgacgat 1481401 gtcgacgaag acgacggcgg cgtggcggtc ttcgccgcct agctgtggtc gttcacgctc 1481461 ggcggcggcg gcgacttcgc gtccgacgtg gcggccgaaa aggtcgcgca cgcgttcgcg 1481521 ctcgcgcagg ccgttgacca tcgcgttgaa accacgctgc agctcaccga gttcggtgcc 1481581 gtcgaacacc accagatccc ctcgcagatc cccctgctcg acacgcttga gcgcagcgcg 1481641 caccactcgc accggcgccg ccgtcagcca agcaagaatc cacatcacga gaaacccgaa 1481701 gatcaacgtg gttatcgaca ggatcaacac cgccgacgcg agctgcgttt cggtcagatt 1481761 gtgcaccaat aggacgtaga gcgctgtcgt ggcgatgccg gtcacaggca cgcctgaacc 1481821 tagcgaccac accgtcatcg ttcggcccat gatgcccggt gcaaaccgtc gtggcggtcg 1481881 tcccgcttcg agtgctttag cggctacggg tcgcagcgcg aactcggtga acaagtagca 1481941 attggtggct accaaaacgc cgcagatagt caccgagaac aaaattatgg tgacgaatac 1482001 gcggttggcc agcccgtaga gcgtggccaa caacgccccg ccgatatccc acagaatgag 1482061 gtggacggct gccactcgaa acgggagcag caaggtgttg cgcccgtcgg cctggctcgg 1482121 cgcgcgttcc tcgatcgccc accgtatgga cgctctgacg attcgcgtgg ttatccagta 1482181 ggtgccgatg gccagtgcga gcgtcgcata ggccggtgcg accccgaagg tgacccacca 1482241 tggggcgtcg gtgtagatgc taggcaccgg aaaagcgaag gtcaccacta gcagcgcgac 1482301 cacgatcccg gtcaggttcg ccgtcatgat atagacggtc acgatgcgct tgatgcgtac 1482361 ccagcgacgc gacgggcttt ctgacacccg cccaagcaac caggagccat acgccggggt 1482421 ctcgggcagc tggccgcact gacgggtcat cgtctcgagt gcctggccca ggcgttgcgc 1482481 catggtcttt ttcgccggca tggtggcgtc agcctaatct gtcggatgcg ccccacggta 1482541 aatcgtgtgg gtctggtgat cgcccagtgc accgccgact atgtcggccg actgagcagg 1482601 catctgcagc tgttgaactg gcgacgtcag ccggatgggc tggtcgtgcc cgcgaagtgt 1482661 cacggtctcg cctaaagacc aacgggcaca ttcgttttca ctggcaccgc gcaacgtttg 1482721 cgacgacgcc aacaatcggc tcgggtatga ttttgccagt tcgcacagtc gtgcagcctc 1482781 gttgaccggt tcgccgatca cggtgtattc gaaccgttcg tgggcgccga cattgccggc 1482841 gacaacctga cctgccgcta ccccgatgcc ggcttggcac tccggcattt cgctggctag 1482901 ccggtcggcg atggctcgtg cggtggccag cgcggcatct tcgggatggc tcaggcggtt 1482961 gggggccccg aagactgcca gcgaggcgtc tccctgaaac ttgttgacaa gtccacggtg 1483021 atggttcact tcatcgacga ttaccgtgaa gaaccggttg agtagcatca cgacctctgc 1483081 cgcaggccgg ctggtgacca attgagttga accgacgatg tcgacgaaga cgacggcgac 1483141 atggcgctct tcgccgccca gttttggtcg ctcgcgttcg gctgctgcgg cgacctcgcg 1483201 accgacgtgg cggccgaaga gatcgcgtac gcgttcgcgc tcgcgcaggc cctcgaccat 1483261 tctgttgaaa ccacgctgta gctcaccgag ttcggtcccg tcgaatacga ccagatcgcc 1483321 gcttagatcg ccctgctcta cgcggttgag cgcctcgcgg accacgcgca caggcgtggc 1483381 cgtcagccaa gcgagaatcc acatcaggat gaatccgaag atcaacagtg gtgcccacag 1483441 gatcagcact gtgatcatga attgatcatt ggagagttcc caaaacgtat cgtcgaagat 1483501 ggcggtgagg gcgacaccga cattgggtac gcctgaacag agcagccaca ccagcatggt 1483561 tcggcccacg atgcctcgca ccagcgatcg tggtgttgct cccacttcga gcgcctgggc 1483621 ggccatcggg cgaagcgcaa actcggttaa cagatagcag ctggtggctg cgacaacgcc 1483681 gatgacgccc atcgaaaaca ggaaccgcgg gataaacaac cggttggcca ggccgtagat 1483741 tatcgtccac aacgctgcgg cggcgcccca caggaaaaga actgccaacg ccactcgcag 1483801 tgggactagg aaagcgctgc gcgcctcatc atggctgggg gtgcgttcct cgattgccca 1483861 ccgcaacgct cgagccgttt gcctggtgag ccagtaggtg cccagtatga aggcgagcac 1483921 gcagtatccc ggaacgatcc cgaacgacac ccaatgcggg gcgtccaaaa tcacgcttgg 1483981 tttcggaaag gcgaccgtca gtagcatggc accgacaatg agcccgatca cgttcgtgac 1484041 caaaatggcg acggtcagca tgccctggat acgtacccgc cgcatccgtg ggctctccga 1484101 cacgcgccca agcagccatg agccgtatgc gggcgtctct ggccgtcgtc cagtgcgcgg 1484161 gctgagagtc tcgacggccc ctggcaagtg tcgagtggtg gccttctcgg atggcatggt 1484221 gacgtcagcg tagtgtgtcg gtcacgctct aaggaacaac gtcgttgcgc gctctaaggt 1484281 gagtcgggtg cgtctagtca tcgcccagtg cactgtcgac tacatcggcc ggctcaccgc 1484341 gcatctgccg tccgcgcgcc ggctgttgct gttcaaggcc gacggatcgg tcagcgtaca 1484401 tgctgacgac cgcgcctaca agccgttgaa ctggatgagt ccgccgtgct ggttgaccga 1484461 agagtccggc ggccaggcgc cagtgtgggt ggtcgagaac aaggccggcg agcagctgcg 1484521 catcactatc gaaggaatcg agcacgacag tagccacgag ctgggcgtgg accccgggct 1484581 ggtcaaggac ggcgtcgagg cccacttgca ggcgttgctc gccgagcaca tccaattgct 1484641 gggcgaaggg tacacgctgg tccgccgcga gtacatgacc gcgatcggac ccgtcgacct 1484701 gctgtgcagc gacgaacgag gtggctcggt cgcggtggaa atcaagcggc gtggcgagat 1484761 cgacggcgtg gagcagctga cccgctacct cgagttgctc aaccgcgaca gtgtgctcgc 1484821 gccggtcaag ggggtgtttg ccgctcaaca gatcaagccg caggctcgga ttctggccac 1484881 cgaccgcggg atccgttgtt tgacattgga ttacgacaca atgcgcggga tggatagcgg 1484941 cgagtaccgg ctgttctgag ttgcgcgatt aaactgatgc gatggctcgg cgccgcaaac 1485001 cgctgcaccg gcagcggccg gaaccgccgt cgtgggccct gcgccgagtg gaagcggggc 1485061 ccgatggcca cgagtatgaa gtacgaccgg tcgctgcggc ccgcgccgtc aagacctatc 1485121 gctgtccggg gtgtgatcac gaaatccgtt ccggtactgc acatgtggta gtgtggccga 1485181 ctgacttgcc gcaagccggc gtcgatgacc ggcgtcactg gcacaccccg tgctgggcga 1485241 accgagcaac ccgcggtccg actcgaaaat ggacctaggc ttttggcggc tggtgcgccc 1485301 tgctggtgcg ccttaggggg ccggctccac caactcgatc agaaccccgc cggcgtcttt 1485361 cgggtggatg aagttgatcc gtgagttcgc ggtgccacgc ctggccgtct cgtagaccag 1485421 ccggacgccc tgggagcgca gccgccgaca catggcgtca agatcgctga cccggcacgc 1485481 cagctgttgg atgcctggcc cgcgcttgtc caggaacttc gctatcaccg aggattcgtc 1485541 gagcggggcc atcaactgga tttgcgccgc ggagcccggc accgccagca gtgcctcgcg 1485601 gatgccctga tcgtcgttga tttcctcgtg gaccaggatc atgccaaggt ggtcgtgata 1485661 ccactcgatg gcaacgtcca ggtcggcgac cgcaataccg acgtgatcga gtccagttac 1485721 caacgaggta gccagcatgt gacgggcgtg gacttgatcg gtcgtcatca cacaacggta 1485781 acctgaaggg aaagaatctg cttctccggg tcggtcagat cggctttcgg gtgcgctgag 1485841 gaggtagtca taacgacatc ggtgattgtt gctggcgcgc gtacacccat cggcaagttg 1485901 atgggctccc tgaaggattt cagcgccagc gagctgggtg ccatcgccat taagggcgcc 1485961 ctggagaagg ccaacgtgcc ggcgtccttg gtcgagtacg tgatcatggg ccaggtgttg 1486021 accgcgggtg ccgggcaaat gcccgcacgg caggcggcag tggcggccgg catcggttgg 1486081 gatgtccctg cgctgacgat caacaagatg tgcctgtccg gcatcgacgc aatcgcgctg 1486141 gctgatcaac tcattcgggc cagagagttc gacgtggtgg tggccggcgg tcaggagtcg 1486201 atgacgaagg cgccccacct gttgatgaat agccggtcgg gttacaagta cggcgacgtt 1486261 acggttttgg accacatggc ctacgacggt ctgcacgacg tgttcaccga tcagccgatg 1486321 ggcgcgctca ccgagcaacg caacgacgtc gacatgttca cccgctccga acaggacgag 1486381 tacgcggctg cgtcccacca aaaggcggcc gcggcatgga aggacggcgt attcgccgac 1486441 gaggtgatcc cggtgaacat cccgcagcgc acgggcgatc cactgcagtt caccgaggac 1486501 gaggggatcc gcgccaacac caccgccgcc gcgctggccg gtctgaagcc ggcgttccgt 1486561 ggcgacggca ccatcaccgc cgggtcggcg tcacagatct ccgacggtgc ggccgcggtg 1486621 gtggtcatga accaggaaaa ggcccaggaa ctggggctga cctggctagc cgagatcggc 1486681 gcccacggtg tggtggccgg gccggattcc acactgcaat cgcagccggc caacgcgatc 1486741 aacaaggcgc tggatcgcga gggcatctcg gtggaccagc tcgacgtggt ggagatcaac 1486801 gaggcgttcg ctgcggtggc attggcctcg atacgcgaac tcgggctgaa cccccagatc 1486861 gtcaacgtca acggtggtgc gattgccgtc gggcatcccc tcggcatgtc agggacgcga 1486921 atcacgctac atgcggcgct gcagttggca cgccggggat cgggcgtcgg ggttgccgca 1486981 ttgtgcgggg ctggcgggca gggcgacgca ctgatattgc gggccggata gcggttgagg 1487041 ggtcggtggc ggccagtgtg atcttggtca taccaaccga tcgcggtatg tcggctcctg 1487101 ccgcagggtc ggcgccaccg ggtggatcga tgaccgcagc ggcatgacag acttgacggc 1487161 gtgacgcgtc cgcgaccccc gctcgggccg gccatggccg gtgctgttga cctctccggc 1487221 atcaaacaac gtgcccagca aaacgctgcg gcgagcacgg atgccgaccg ggcactgtcg 1487281 acgccgtccg gtgtgaccga gatcaccgag gcgaacttcg aggacgaggt gatcgtccgg 1487341 tccgacgaag tgccggtggt ggtgttgctg tggtcacccc gcagcgaggt atgcgtcgac 1487401 ttgcttgaca cgctgtccgg cttggccgct gccgctaagg gcaagtggtc gctggcgtcg 1487461 gttaacgttg acgtcgcacc cagggtggca cagatattcg gcgtccaagc ggttccgacc 1487521 gtggtggcct tggctgcggg acagccgatc tcgagcttcc agggcctcca gcccgcggac 1487581 caactgagtc gctgggtgga ttccctgttg tctgcgacag ccggaaagct caagggcgca 1487641 gcgagttccg aggagtccac cgaagtcgat ccagcggtgg cacaggcgcg ccagcagctc 1487701 gaggatggcg actttgttgc cgcgcgcaag tcatatcagg cgattttgga tgccaaccca 1487761 ggaagcgtcg aagccaaggc ggccatccgc cagatcgaat tcctcatccg cgcaaccgca 1487821 caacggcccg acgccgtctc ggtcgccgac agcttgtcgg atgacatcga cgccgcgttt 1487881 gcggcagccg acgtgcaagt cctcaaccag gatgtgagtg cggccttcga gcgcctgatc 1487941 gcgttggtgc gtcggacatc tggagaagag cgcacccggg tgcgcacccg gctgatcgag 1488001 ctgttcgagc tgttcgaccc cgccgatccc gaggtcgtgg ccggtcggcg caacctcgcc 1488061 aacgcgctgt actgaggccg gctggcgagc agacgcagaa tcgcctaaac ccgcacgggt 1488121 ttaggcgatt ctgcgtctgc tcgcgctggg cggctacgac aacccgggtg atccgttcag 1488181 gccgagcagc ccggcggtgc cgccggcgcc acccttaccc ggcgcactgc cgactccgcc 1488241 gtttccgccg ttgcctccat tgccgatcag ggtggcgttg ccgccggagc cgccgacacc 1488301 gccgtttgcc accacgcttg cgccgccggc gccgccgtcg ccgccgttgc cgaccatccc 1488361 ggccttgcca cccttgccgc cgttcccccc gtcgccggcg atgccgagtc cgccggcgcc 1488421 gccggcgccg ccatcgccgt tgagcaggcc ggcgttgccg ccggccccac cgtcgccggc 1488481 ggtctcgccg aacccgccgg ctccgccggc cccgcccgca cccgagagcc cggcggcgtt 1488541 gccgccggcc ccgccggccc ccccgaccga cccgaattcg atgccagtcc cgccggcgcc 1488601 accagcgccg ccgtcaccga tcaacccgcc ggtgccgccg gtgccgccgc taccggccgc 1488661 gccccggacg ctgtcgccgc cggtaccgcc ggcgccgccg tcgccgatca gcttggcggc 1488721 cccgccgctg ccaccggacc cgccgatgcc gccggccatt tggtccgcac tggaggcgcc 1488781 gaaccctccg gtgcccccgg cgccgccggg accgaacagt gcgccggcac cgccgatgcc 1488841 gccgacgcct cctttgccgc cggtgccgcc ggggtcgggc gcgccgagac cggttccgcc 1488901 ggtgccgcca atgccgccag cgccgaagag gatgccggcg ttgccgcccg ccccgccggc 1488961 cccgccctca ccgcccacga gttggttacc ggtgccagtt ccgccggtgc cgccggtccc 1489021 gccgttgccg gtgaagatgc cgccgtcgcc tccggtgccg ccgttccctc cggccgagcc 1489081 ggagactccg aacccgccgg cgccgccggc accgccattg gagaacagcc cgccgccccc 1489141 gccggtgccg ccggtgccac cggtgccccc gttgacgctc aacccgccgg tgccgccggc 1489201 accgccggcg gccaacacct cgaacagccc gctgcgaccg ccggccccgc cggcgccacc 1489261 gttgaagcct ggcccgccgg ccccgccgat gccaccggct ccgaacagcc cggccgcccc 1489321 gccgtcgccg ccgagcccgc ctgtgccggt gttcccgact ccgccgggcc caccggcgcc 1489381 gccggagccg aacagcagcc cgccggcccc gccggccccg ccagccgcgc cgttgcccgg 1489441 gccgtcaccc ccggccccac ccgccccgcc gttgccgaat agccccgcgg ctccacctgg 1489501 gccgccggcc tggccgggcg ccccggaccc gccggccccg ccgttgccgt acagcagccc 1489561 gccggccccg ccggcttgcc cggtccccgg tgcgccgttg gcgccgttgc cgatcagcgg 1489621 acgccccagc aacagctggg tgggcgtgtt cacaatgttg agcaggccct ccaacggcgc 1489681 cgcggcggcg gcctcagcgc tcgcgtacgc gccagcgccc gcagtaaagg tttgaacgaa 1489741 ctgggcatga aacgccgaca tctgagcgct caccgcctga taggcctggc catgcgcggc 1489801 gaacagcgac gcgatggccg ccgacacctc atcagcaccg gctgccagaa gttccgtcgt 1489861 cgggcccaat gccgcggcgt tggcggcccc cagcgtcgac ccgatgttcg ccaaatccga 1489921 agcggccctc accagcgtct cgggagcggc gattacaaac gacatgcttt cctccgatca 1489981 gctgtgcgtc gagtatccag ctcgagttag cacagggtag cgctatcgct tagcctttct 1490041 gatcaatctc ggagtgcagt gtgcagagtg catcgaatcg gctcatcagg catgtgcaat 1490101 ctgctcatgg caggcgctag gcgggcgtca gccacagcgc cgaagtgggc ggcagcacca 1490161 gcaccgcgga cgccgggcgg ccatgccagg ggtcgtcggt ggcgtccacg ccgccgaggt 1490221 tgccgatccc tgagccgtgg tagatcgtcg cgtcggtatt gagcacctcg cgccagcggc 1490281 ccgcgcgcgg cagcccgagt cgatagtcac ggtgttcggc acctgcgaaa ttgaacacgc 1490341 aggccagcac cgagccgtcg ctgccgtagc gcataaagct caacacattg ttggcggagt 1490401 cgttggcgtc gatccaagaa tagccttcgg gggtggtgtc taagctccac agcgccgggt 1490461 ggcatcggta gatgtcgttg atgtcgcgca ccagccgctg aatcccgttg gagaagccgt 1490521 tttcgtcgag ttggaaccag tccaggccgc gctgctcgga ccattcggcg cgttggccga 1490581 attcctgacc catgaacagc aattgcttgc cggggtgtgc ccattggtag gcaagcaggc 1490641 tacgcaggcc ggcggccttg acgtgattgt tgcccggcat ccgcccccac agcgtgcctt 1490701 tgccgtgcac cacctcgtca tgactgagcg gcaacacgta attttcgctg aacgcataca 1490761 gcatcgagaa cgtcatctcg tggtggtggt agctgcggta caccggatct cggctgacgt 1490821 agtcgagcgt gtcgtgcatc cagcccatgt tccacttcat cgaaaagccc aggccgccaa 1490881 tgttggtcgg gcgggtcacc ccagaccacg gcgtggactc ctcggcgatg gtgacgattc 1490941 ccggcgcgac cttgtgcgcc gtggcgttca tctcctgcag gaactgcact gcttccaggt 1491001 tctcccggcc gccgtggacg ttgggggtcc agccgccctc gggtcgcgag tagtctagat 1491061 agagcattga ggccaccgcg tccacccgca ggccgtcgat gtggaactcc tgtagccagt 1491121 acaacgcatt ggctaccaga aagttgcgca cttccgggcg gccgaagtcg aacacgtatg 1491181 tgccccaatc cagttgctcg ccgcgtttgg gatcggaatg ttcgtagagc ggagtgccgt 1491241 cgaaccgtcc cagggcccac gcgtccttcg ggaagtgcgc tgggacccaa tccacgatga 1491301 cgccgatgcc ggcctggtgc agggcgtcga ccagcgcccg gaagtcgtcg ggtgtgccga 1491361 atcgtgatgt cggcgcatag taggacgtga cctgataccc ccatgatccg gcgaatggat 1491421 gctcggcgac gggcaacagc tccacatggg taaacccttg atccacaatg taatccgtca 1491481 actcacgagc aagctggcgg tagctgagtc caggccgcca cgaaccgaga tggacttcgt 1491541 aggtgctcat cgcctcgttc accgggttgc gcagcgcacg cccagccatc cagtcgtcgt 1491601 caccccaggt gtagtcactc gacgtcaccc gcgatgcggt ctgcggcggc acctcggtgc 1491661 cgaacgcgaa cgggtcggcc cgatcggtaa ccacgccgtc ggcgccgtgc acgcggaact 1491721 tgtacagacc gtcgcaaggg aagtcgggcc agaacaattc ccatacccct gatgggccga 1491781 gcacccgcat gggggcttcg tggccattcc aaccgttgaa ctcgccgatc aagctgacgc 1491841 ccttggcgtt gggcgcccac acggcgaacg acacgccact caccacaccg tcggccgtgg 1491901 taaacgagcg ggggtgggca cccaggactt cccaaagccg ttcgtggcgg ccctcggcga 1491961 acaggtgcag gtcgacctcg cccagggtgg gcaggaatcg gtacgcatcg gccacggtgt 1492021 gtggctcgca accttcatag gtcacctgca ggcggtagtc gatgaggtcg acgaacggca 1492081 atgcgacggc aaacaggcca gaatcgaggt gctgcaacga gaaccggtcc ttaccaacga 1492141 gcgcgacgac ctcgacggca tgcggacgga acgctcggat gacggtatgg tcgtcgtatt 1492201 cgtgggcgcc caggatgccg tgcgggttgt gatgtgtacc cgccaccaag cgcgccattt 1492261 cggccggctc gggtgcaagg tgctccccgg tgagtttctc ggatcgactc atgagcccgt 1492321 cacctcctgc gcagcagcgt gtttcggctc tcgtagggca cggctggcat gttgatgatg 1492381 tgggcgactg cccgtgctgg gtcgatgcgg atgtaattgg cttgccccca ttggtattct 1492441 tcgccggtta tctcgtcgcg cacccaaaac cggtcgtagt cctccatgcc caacgccgcc 1492501 atgtccaacc acagcgtagc ttcttcagga ccaaatgcgt tgagtgtcac caccaccaac 1492561 acgcagtcgc cggtggccgg gtcgaacttg ctgtaggcca gcaacgcgtc gttgtcaacg 1492621 tggtgaaaat gaatggtacg caactgttga aacgccgggt gcagccggcg aattatattg 1492681 agccgtgtga tgaacggctg caaagatcta ccctggtcca gcgcgctggc aaagtcgcgg 1492741 ggacgcaatt cgtacttctc cgagtccagg tactcctcgc tgccctcgcg caccgcacgg 1492801 tgctcgaaaa gctcataacc gcagtacatc ccccaggctg ggctcatggt ggcggccagc 1492861 accgcgcgga tggcgaacat gcctggaccg ttgtgctgca gcaccgcgtg caggatgtcc 1492921 ggggtgttga cgaacaggtt gggccgacgg tagtcggcga gttcggctat ctggttgccg 1492981 aattcggtga gctcccactt ggtcgtgcgc caggtgaaat agctgtagga ctgcgtgaag 1493041 ccgagcttgg ccagcccgta ctggcgggcg ggcggggtga aagcctcgga caggaacagc 1493101 acgtcggggt cgacggtctt cacctgcgcg atcagccagg cccagaagtt gggtggtttg 1493161 gtgtggggat tgtcgacgcg aaagaacttg acgccgtggt taacccaatg ttgcaccacg 1493221 cgcagcactt cgtcgtacag gccctcggga tcgttgtcga agttgagcgg atagatgtcc 1493281 tggtacttct tcggtggatt ctccgcgtag gcgatggtgc cgtccggcag ctcggtgaac 1493341 cactgccggt gttcgcgggc ccacggatga tccggtgcgc attgcagcgc caggtccagc 1493401 gcgacctcca tgcccagatc gcgtgccgcg gagacgaagt cgtcgaagtc gtcgatggtg 1493461 cccaggctgg gatgaacggt atcgtgaccg ccctcatcgc taccgatcgc ccacggcgat 1493521 cccacgtctg tcggtgcggc ggtgggcgag ttgttgcgac ccttgcgatg caccttgcca 1493581 attggatgga tcggcggcag gtacaccacg tcgaacccca tgccggcgat gcgcggaagt 1493641 tctgccgcag cggtggcgaa ggtgccgtgt accgggttgc cgtcgtcgtc ccacccgccg 1493701 gttgagcgcg gaaacatctc ataccaagcg ccgaaccggg ccaacggccg atccacccag 1493761 acgccgaatt gctcgccccg ggtgaccagg tcccgcagcg gatagtcggc cagcagctct 1493821 tcgatttccg gtgtcagggc caacgcggtg cgggtcaccg ggtcaccggg ggtccgcagc 1493881 gctgccgcgg ccgccaggag gggatcgcgt aacccgcgcg gcacaccggt cgccgcgcgc 1493941 tccaacagca ccgcgcctac caacaggtcg ttggacagct cggtctctcc ctggccggca 1494001 tctagcttgg ctatcagccc atggcgccag gtgtggatcg ggtcacccca accatccacc 1494061 cggaaggtcc acaatccgac ccggtcgggg gtgaactggc cgtggaaaac gaagggctcc 1494121 tggccgctcg tcatcgggat cagcagcggc ttgacgcgtt gttggggctc gctcggcgtc 1494181 ggaagcaccc tggcccgggg tctgtcggtg aggtgtgggt aacgcactcc gaggtagcgc 1494241 acgaccagcg tcgctgcgac ggcctcgtgg ccttcacgcc agaccgccgc gctgaccggg 1494301 accacctcgc cgaccaccgc cttggcggga tatacgccgc acgaaacgac gggcgcgacg 1494361 tcatcgattt cgacacgacc gggcacccac cactccgttt ccgttccgat tgcccggcca 1494421 ctcaccggga catcttgtat gtgtcgttcc ttgtgtgtcc ttcttgcgcc cgatacccac 1494481 cctagtatcc gatcacaccc gcgaaggcac agcggtcggc gggcgcactg cacgcggtgg 1494541 catcctcagt aaggtaagga cgcgtgaaag cccttcgccg gtttaccgtc cgagcccacc 1494601 tacccgaacg tcttgccgcc ctggaccagc tgtctaccaa tctgcggtgg tcctgggaca 1494661 aaccgacaca ggatctgttc gcggcgatcg accctgcact gtgggagcaa tgcggtcatg 1494721 atccggtggc gctgctgggc gcggtgaacc cagcgcgtct cgacgaactt gcgctggacg 1494781 cagaattttt gggcgccctc gatgagctgg cggccgactt gaacgactac ctgagccgtc 1494841 cgctgtggta tcaggagcag caggacgccg gggtagccgc acaagccctg ccgaccggga 1494901 tcgcgtactt ctcgctggag ttcggggtag ccgaggtgtt gcctaattac tcgggcggtc 1494961 ttgggattct cgccggcgac catctgaaat ccgcgtccga tctgggcgtg ccgctgatcg 1495021 cggtggggtt gtactaccgc tccggctact tccggcaatc gcttaccgcg gacggctggc 1495081 agcacgagac ctacccatcg ctggacccgc aagggctgcc gttgcgtctg ctcaccgacg 1495141 ccaacgggga tccagtgctg gtcgaggtcg ccctgggaga caacgccgtg ttgcgcgccc 1495201 ggatctgggt agcgcaggtg ggtagggttc cgttgctctt gttggattct gatatcccgg 1495261 agaacgagca cgacctgcgc aacgtcaccg accgcctcta cggtggcgac caggaacatc 1495321 gcatcaaaca agagatcctg gccggcatcg gcggggtgcg ggcgattcgt gcgtacaccg 1495381 ccgtcgaaaa gctcaccccg cctgaggtct tccacatgaa cgagggccac gccggattcc 1495441 tcggcatcga acgcatccgt gaactggtca ccgatgcggg tttggatttc gacaccgcat 1495501 tgactgtggt gcggtccagc acggtgttca ccactcatac tcccgtcccc gccgggatcg 1495561 accggttccc gctcgagatg gtgcagcgct acgtcaatga ccagcgcggc gatggccggt 1495621 ctcggctgtt gcctgggttg ccggccgacc gcatcgtcgc gttgggcgcc gaggacgatc 1495681 cggccaaatt caacatggca cacatgggcc tgcggctggc gcagcgggcc aacggcgtct 1495741 cgttgctgca tggccgggtc agtcgtgcca tgttcaacga gctgtgggcg ggattcgacc 1495801 ccgatgaggt gccgatcggc tccgtcacca acggtgtgca cgcgcccacc tgggcggcgc 1495861 cgcagtggtt gcagctgggc cgcgagctgg ccgggtcgga ctctttgcgc gagcccgtcg 1495921 tttggcagcg actgcatcag gtcgatcctg ctcatctgtg gtggatccgc tcacaactgc 1495981 ggtcgatgct ggtggaggac gtccgggcgc ggttgcggca atcatggctg gaacgtggtg 1496041 caacggatgc cgaactgggt tggatcgcga cggcattcga tccgaatgtg ctcaccgtcg 1496101 gcttcgcccg gcgggtcccg acctacaagc ggctgacgtt gatgttgcgc gatcccgatc 1496161 ggctcgagca actgctgctc gacgaacagc ggccgatcca gctgatagtg gctgggaagt 1496221 cgcacccggc cgacgacggg ggcaaagcgc tgatccagca ggtggtgcgg ttcgccgacc 1496281 ggccgcaggt ccgccaccgc atcgccttcc tgccgaacta cgacatgtcg atggcccggc 1496341 tgttgtactg gggctgcgac gtctggttga acaacccgct gcggccgcta gaggcgtgtg 1496401 gtacctcggg catgaaaagc gcgcttaacg gcgggctgaa tttgtcgatc cgtgacggct 1496461 ggtgggacga gtggtacgac ggcgaaaacg gttgggagat accgtctgcc gacggtgtgg 1496521 cggacgagaa ccgtcgcgac gacctggagg ccggcgcgct ctacgacctg ctggcacaag 1496581 ccgtggcacc gaagttctac gagcgcgatg aacgcggggt gccgcagcgg tgggtagaga 1496641 tggtccggca taccctacaa acgctcgggc ccaaggtgct ggcttctcga atggtgcgcg 1496701 actacgtcga gcattactac gcgccggcgg cgcagtcttt tcgccggacc gcgggcgccc 1496761 agttcgacgc ggcccgcgag ctggccgact accgccggcg cgcggaagaa gcgtggccca 1496821 agatcgagat tgccgacgtc gacagcaccg gtctgccgga tactccactg ctcgggtccc 1496881 agctgaccct gacggcaacc gtgcggctgg ccgggctgag gccaaacgac gtgacggtgc 1496941 agggggtgct gggcagggtc gacgccggcg atgtgctaat ggatccggtc accgtcgaga 1497001 tggcgcatac cggcaccggc gacggcggct acgagatctt ctcgacgacg acgccgctgc 1497061 cgctggcggg gccagtcgga tacaccgtgc gggtgctgcc tcgccacccg atgctggccg 1497121 ccagcaacga gctcggcctg gtcaccctgg cctgacccgc cgagaagacg caaaagctcc 1497181 taaatctggc cgatttagtg ggcttttgcg tctgctcgcg caaggcgccg cagggccgcg 1497241 cgcacttgcg tggcgttggt ggtctgccaa aagggcggca gcgaggctcg caggaattcg 1497301 ccatagcggg cggtagccat ccgtgaatcg agcaccgcaa ccacgccccg atcggtgacg 1497361 cgccgtaaca gccggccgga tccctgtgcc agcagcagcg ccgcgtggct ggcggcgacc 1497421 gtcatgaagc cgttgccgcc acgggcggcc accgcacgct ggcgggcact cagcagggga 1497481 tcgtccggcc gggggaacgg gatgcggtcg atcaacacca acgacagcga cggtcccggc 1497541 acgtcgaccc cctgccacag cgacagcgtg ccgaacaggg aggtcgccgc atcggcggtg 1497601 aacttctcca ccagcgtgga cgtactgtcg tcgccctgac acaacaccgg cgtggacagc 1497661 cgttcgcgca tggcctcggt ggctgcccgg gcggcccgca tggacgagaa cagccccagg 1497721 gtgcgcccac ctgcagcggt gatgagttcg gcgatctcgg tcagttgttc ggccgagccg 1497781 ctgccgtctc ggcccggcgg cgggagatgg gcggccacgt agaggattcc cgactttgcg 1497841 tgctggaaag gcgagcccac gtccaggcca cgccagggcg tgtctgcagt caggccccat 1497901 gccgtggcca tcgcgtcaaa cgacccgccg attgtcagcg ttgccgaggt caatacggtc 1497961 gttgcacggg cgaacacctg ggtggccaac agctcggcca ccgatagcgg agccacccgc 1498021 agcaccgcgc gagccgattc gtggttgtcc tcgtgctcca gccaaaccac gtcgctgcgg 1498081 tcagggatag cgggggcgaa cgacgccagg attcgtgacg cggtatcgga tatttcggtc 1498141 agtaccgcgc ccgcttcggc gcgcacggac gccgtcgtgg tgtcgctgcc ggtatcgatc 1498201 gctgagcgcg ccgcactggc cgcatcgcgc agcgcgctca gataggtcgc catctcgtca 1498261 tcgaggcaat caatgcggcc cggtctggcg tcgtgaatcg ccgaactgaa ggtagccgaa 1498321 gccgcctgaa gccgctgggt cactttcggg tcgaccagcc gggtgatccg tcgtgcggcc 1498381 ataccgagcg tggcagacgt cagctcagcg gcggctaccg aggtcacccg gtcggccaat 1498441 tcgtgagcct cgtcgacaac cagcagccga tgttctggca gtaccgccga ttcggcgacg 1498501 gcatcgatgg ccagcagcgc gtggttggtg acgacgacat cggccaggcc ggccgctcca 1498561 cgagcccgtt cggagaagca ctccgagcca aacgggcagc gggccacgcc gaggcattcc 1498621 cgcgccgaaa cgctgacctg cgaccaggat cggtctccca caccgggctt aaggtcgtcg 1498681 cgatcaccag acacggtcgt cgaagcccag gcggttagcc gttgcacatc gcgtcccagc 1498741 gcggtgaccg ccaccgggtc gaagagctcc tcctgcggcc gctcgtcgtc atggtcactg 1498801 gctgtgactg agttgtggat cttgttcagg cacaggtagt tccgtcgacc tttgagcagg 1498861 gcgaacttcg gtcggcgggg gagcgcattg gtgagcgaat ctaccagctg gggcaggtca 1498921 cgatcgacga gttgacgttg caaagcgatc gtcgccgtcg acaccacgac cggcgcgtcg 1498981 tcgcaaagag cgcggatgat cgcgggaacc agatacgcca gcgacttgcc ggttccggtg 1499041 ccggcctgga ccaccaagtg ctcaccggtt tcaaacgcat gcgctaccgc ggcggccatc 1499101 tcttgctggc cgcgacgccg ggtgccgcca agtgccgcca cggcgatggc aagcagctca 1499161 ggcacagaca tggataccga ctcggacacg ggacgtggtc acatccttgc gctcaggccg 1499221 ggatcgtgcg tgtcggaatc gccggctcgc cgggtgccaa tttcagcccg tcgcctggca 1499281 ggctgcgcaa cccggatgcc accagctgcc gtgccgcggc caggctggta tcggctaccg 1499341 gttgcccggc gcggaccagt ggcagcgtca aaacccggtg cggctcgaca atgaccggcg 1499401 gacggcccgc cggatgcacg agctcctcgg tgatggtgcc cgtcgcacgg gagcgccgca 1499461 gtgcctcttt gcggccgccg ggggattctt tgtagctgct gcgcttttgc accggtacac 1499521 cgtctacctc gaccagtttg tagaccatgt tggcggtcgg cgcgcccgac ccggtgacca 1499581 gcgacgtgcc cacgccgtag ctgtcgacgg gttcaccgcg caacgcggcg atgctgaact 1499641 cgtcaaggtc gccggacacc acgatgcgcg tccgggtggc tcctagccgg tcgagctgct 1499701 cccgcgcttg gcgggccagt accccaagct caccggaatc gatgcggatc gcgccgagct 1499761 cagcgccggc ggcggcaacg gcattggcca caccggtcgt gacgtcatag gtatccacca 1499821 gcagcgtggt accgggtccc agcgcttcga cctgggcgcg gaatgcggct cgctcggcta 1499881 gttcggtggg gccgccatgc tgggcgtgca acatggtgaa tgcgtgtgcc gcggtgccgt 1499941 gcgcgggcac tccgtagcgt cgctgcgccg ccaagttgga tgacgcggcg aaaccggcga 1500001 tatacgccgc ccgggccgct gccaccgcgg cgcgttcgtg ggtgcgccgc gagcccatct 1500061 cgatcagtgg gcgccccccg gcggcgctga ccatgcgcgc cgctgccgag gcgatcgctg 1500121 tgtcgtggtt gaagattgac agcaccagcg tttcgagcag gacgcattcg gcgaagctgc 1500181 cgcgtaccga gagcaccggt gacccgggaa aatacagctc cccctcggca tagccgtcga 1500241 tatcgccgcg gaaccggaat tcgcgaagat accgcaccgt ggccgggtcg aggaattggg 1500301 ccagcaactc gcacgcgtca gcgtcgaacc tgaactgcgg caacgcttcc agcaaccggc 1500361 cggttccggc gacaactccg tagcgacggc cggtggggag tcggcgagcg aacacctcga 1500421 atgtggtggg gcgattggcg ctgccgtcgc gcagggcagc cgccagcatg gtcaactcgt 1500481 acttgtcggt caacagcccg gctgggtctt gattgtcggg ctctccctct cgccgcctgg 1500541 cggctggggg tggccccaca gcggtccgac gcggtccgca gcgtcgcccg gttgggaccc 1500601 agtcgttcac accgccacgg tatcggctcg cggccacggt gcgctgggta tcctggggcc 1500661 atggctgttg tgtcagcgcc cgccaagcca ggtaccacct ggcagcgcga gtctgctccg 1500721 gtcgacgtga cggacagggc atgggtcacc atcgtgtggg acgacccggt caacttgatg 1500781 agctacgtga cttacgtgtt tcagaagttg ttcggctaca gcgagccgca tgccaccaag 1500841 ctgatgttgc aggtgcacaa cgaaggtaag gcggtggtgt ccgcgggcag ccgagagtcc 1500901 atggaagtcg acgtgtccaa gctgcatgcc gccggtttgt gggcgacgat gcagcaggac 1500961 cggtgagatt cgaggatatt cgggatccat cgtgcgcagg tggaagcgcg tcgagacccg 1501021 cgatggtccc cgctttcgat cgtcgttggc tccgcatgag gccgccctgc tcaagaacct 1501081 ggcaggcgcg atgatcgggc tgctcgacga tcgcgactct tcttcgccgt cagacgaact 1501141 cgaggagatc accggcatca agaccgggca tgcgcagcgt ccgggtgacc cgaccttgcg 1501201 tcggctgttg ccggatttct accgtcccga tgacctggat gacgatgatc cgacggccgt 1501261 cgacggctcc gagagcttca acgctgccct gcgcagcctg cacgaacctg agattatcga 1501321 cgccaaacgt gttgccgcgc agcagttatt agacacggtt ccggacaatg gcggccggtt 1501381 ggagctgacg gaatccgacg ccaatgcttg gatcgccgcc gtcaacgacc ttcggctggc 1501441 gctcggagtg atgcttgaga tcggcccgcg tgggccggag cgcctgccgg ggaaccaccc 1501501 gttggccgcg cacttcaatg tctaccagtg gctgacagtc ctgcaggaat acctcgtgct 1501561 ggtgctgatg gggtctcgat gatctgcgcg gcggcccgat gaactccatc accgacgtcg 1501621 ggggcatccg ggttggccac taccagagac tggaccccga cgcgtccctc ggcgccgggt 1501681 gggcttgtgg cgtcacggtg gtgttgccgc cgcccgggac ggtcggtgcg gtcgattgcc 1501741 gcggcggcgc ccctggaacc cgcgagactg atctgctgga cccggccaac agcgtgcgct 1501801 tcgtcgacgc cctgttgctc gccggcggca gcgcctacgg tctggccgcc gccgatggcg 1501861 tcatgcgctg gctagaggaa caccggcgcg gcgtcgcgat ggacagcggc gtggtgccca 1501921 tcgtgccggg cgcggtgatt ttcgaccttc cggtcggcgg ctggaattgt cggccgacgg 1501981 ccgatttcgg ctattcggcc tgtgcggcag ccggagtcga cgtcgcggtc gggacggtgg 1502041 gcgtgggggt tggggcgcgc gccggagcgc tcaagggcgg tgtcgggact gcatcggcta 1502101 ccctgcagtc cggtgtgacc gtcggtgtcc ttgctgtggt aaatgccgct ggcaacgtcg 1502161 tcgatccagc caccggcttg ccgtggatgg ccgacctagt cggcgagttc gcgttgaggg 1502221 ccccgccggc cgagcagatt gctgcgctgg cgcagttatc gtccccgctg ggagccttca 1502281 acaccccgtt caatacgacg atcggtgtga ttgcgtgtga cgccgcgctg agccctgcgg 1502341 cttgccggcg catcgcgatt gccgcccacg acgggttggc ccgcaccatc cggccggcac 1502401 acaccccctt ggatggcgac acggttttcg cgctggccac cggcgcggta gcggtgccgc 1502461 cggaggccgg cgtgccggcc gcattgtctc cggagactca gctggtcacc gcggtcggtg 1502521 cggcggcggc tgattgcctg gctcgtgcgg tgctggccgg cgtgctcaat gctcagccgg 1502581 tagccggaat accgacctac cgtgacatgt ttcccggagc attcgggtcc tgaaacttcg 1502641 gtgttgctta ggaaaggaac cgtctacgtg ctggtgattc gcgcagacct ggtgaatgcg 1502701 atggtggccc atgcgcgtcg cgaccacccc gacgaagcct gcggagtgct ggccggaccc 1502761 gagggctctg accgtcccga gcggcatatc ccgatgacca atgccgagcg ctcgccgacc 1502821 ttctaccggt tggattccgg tgagcaactg aaggtgtggc gggctatgga agatgccgac 1502881 gaggtcccgg tcgtcatcta tcactcgcac actgcgaccg aagcgtaccc gagccgtacg 1502941 gacgtgaagc ttgccaccga acccgacgcg cactacgtgc tggtgtccac ccgcgacccg 1503001 caccggcacg agctacgcag ctaccgcatc gtcgatggcg ctgtcaccga ggaacctgtc 1503061 aatgtcgtcg agcagtactg aaccgttccg agaaaggcca gcatgaacgt caccgtatcc 1503121 attccgacca tcctgcggcc ccacaccggc ggccagaaga gtgtctcggc cagcggcgat 1503181 accttgggtg ccgtcatcag cgacctggag gccaactatt cgggcatttc cgagcgcctg 1503241 atggacccgt cttccccagg taagttgcac cgcttcgtga acatctacgt caacgacgag 1503301 gacgtgcggt tctccggcgg cttggccacc gcgatcgctg acggtgactc ggtcaccatc 1503361 ctccccgccg tggccggtgg gtgagcggag cacatgacac gatacgactc gctgttgcag 1503421 gccttgggca acacgccgct ggttggcctg cagcgattgt cgccacgctg ggatgacggg 1503481 cgagacggac cgcacgtgcg gctgtgggcc aagctcgagg accgcaatcc gaccgggtcg 1503541 atcaaggacc gcccggctgt gcggatgatc gagcaggccg aggccgacgg gttgttgcgg 1503601 ccgggcgcca ccatcctgga gcccaccagc ggaaacaccg gcatttcgct ggcgatggcg 1503661 gcccggttga aggggtaccg attgatctgc gtgatgccgg agaacacatc ggttgaacgg 1503721 cggcagctgc tcgagctcta cggcgcgcag attatcttct cggcggccga aggcgggtcc 1503781 aacactgcgg tggccaccgc caaagagctg gccgcgacca acccgtcatg ggtgatgctg 1503841 taccagtacg gcaatcccgc caacaccgac tcgcactact gcggcaccgg ccccgagctg 1503901 ctggccgacc tgcccgaaat cacgcacttc gtcgccggcc taggcaccac gggcacgctg 1503961 atgggcactg gccgtttcct gcgcgagcac gttgccaacg tcaagatcgt ggcggccgaa 1504021 ccccgctacg gtgagggggt atacgccctg cgcaacatgg acgaaggctt tgtgcccgag 1504081 ctgtatgacc cggaaatact gaccgcgcga tattctgtcg gcgcggtgga cgcagtgcgc 1504141 cgcacccgcg agttggtgca caccgaaggc atctttgcgg gcatctcaac cggcgcggtg 1504201 ctacacgccg cactcggagt cggggccggc gccctggcgg ccggcgagcg ggccgacatt 1504261 gcgttggtgg tcgccgacgc cgggtggaag tatctgtcca ccggcgccta cgccggtagc 1504321 ctggatgacg ccgagaccgc tctggaaggg caactatggg catgaccccg cgccggaagc 1504381 gacggggagg agcggtgcag ataacacggc ccacaggccg tccgcgaaca ccgacaacgc 1504441 agacgacgaa gcgcccgcgc tgggtggtcg gcgggacgac gatcctcacc ttcgtcgcgc 1504501 tgctctatct cgtcgaactg atcgaccagc tgtccgggag tcggctggac gtcaacggca 1504561 tcaggccgct gaaaacagac ggcctgtggg gcgtcatctt tgcgccactt ttgcacgcga 1504621 actggcacca cctaatggcc aataccatcc cgctgctggt gctggggttt cttatgacgc 1504681 tggccgggct gtcccggttt gtctgggcca ccgcgatcat ttggattctg ggcggcttgg 1504741 gcacttggct gatcggcaat gtgggcagca gctgtggccc gaccgaccat atcggcgcct 1504801 ctggcctgat ctttggctgg ctggccttcc tattggtgtt cgggcttttt gtgcgcaagg 1504861 gatgggatat cgtcattggg ctggtggtct tgtttgtcta tggcggcatc ctgctcggcg 1504921 cgatgccggt gctgggccag tgtggtggcg tgtcatggca gggtcattta agtggtgcgg 1504981 ttgctggcgt cgtggcggcg tatctgttgt ccgctccgga gcgtaaggcc cgtgcactga 1505041 aaagggccgg cgcgcgttcc gggcatccga agttatgaat tcgccgttgg cgcccgtcgg 1505101 agtctttgat tccggcgtcg ggggactgac ggtcgcgcgg gccatcatcg accaactgcc 1505161 cgacgaggac atcgtctacg tcggcgacac cggtaacggc ccgtacggtc cgctgaccat 1505221 cccggagatc cgggcgcacg cgctggccat cggcgacgat ctggtcggcc gaggcgtcaa 1505281 ggcgttggtg atcgcctgca actcggcgtc gtcggcgtgc ctgcgggatg ctcgcgagcg 1505341 ctaccaggtg cccgtcgtcg aagtgatact gccggcggtg cggcgtgcgg tggccgccac 1505401 ccgcaacggc cgcatcgggg taatcggcac gcgggcgacc atcacttcac acgcctatca 1505461 ggacgcgttc gctgcggccc gcgacaccga aatcaccgcg gtggcttgcc ctcgcttcgt 1505521 ggacttcgtc gagcgcggcg tcaccagcgg tcgtcaggtg ctcggtctgg cgcagggcta 1505581 cctggaaccg ctgcagcgcg ccgaggtcga cacgctagtg ctgggctgta cgcactatcc 1505641 actgctgtcc ggactgattc aactggcgat gggcgagaac gtcacgctgg tctccagcgc 1505701 cgaggagacc gctaaggaag tggtccgggt gctcaccgag atcgacttat tgcgtccgca 1505761 tgacgcgccg ccggcaactc ggatatttga agctacgggc gaccccgaag cgtttaccaa 1505821 attggccgca cgattcctgg gtccggtgct cggtggtgtg caacccgttc acccatcgcg 1505881 cattcattag gccatggaag agattctcgt caccgaatgc gtcgatgtat tccgcatcgt 1505941 tgtatcgggc atggcacagt agtgtccgtg cggataaccg tgctcggatg ctccggtagc 1506001 gtcgtggggc cggattcgcc tgcgtcgggg tatttgctcc gagcgccgca cacaccgccg 1506061 ttggttatcg acttcggcgg gggtgtgctc ggcgcgctgc aacggcacgc ggatcccgcg 1506121 tcggtgcatg tgctgctgtc gcatctgcat gcggaccatt gtctggactt gccgggactt 1506181 tttgtgtggc ggcgttacca cccgtcgcgt ccctctggca aggcattgtt gtacggcccc 1506241 agcgacacct ggtcgcgatt gggggcggcg tcgtccccgt acggtgggga gattgacgac 1506301 tgttcggata tcttcgatgt tcaccactgg gccgacagtg agccagtgac gttgggcgcc 1506361 cttacgatag tgccgcggct ggttgcccac ccgactgagt cgtttggcct gcggatcacc 1506421 gatccgagcg gtgcgtcact ggcttatagc ggcgacaccg gcatttgtga ccagctcgtc 1506481 gagctggctc gcggcgtcga cgttttcctc tgcgaggcct cctggacaca ctcgcccaaa 1506541 catccacccg atctacacct gtcgggcacc gaagccggta tggttgccgc gcaagccggc 1506601 gttcgtgagc tgctgctgac gcatatcccg ccgtggactt cgcgtgagga cgtcatcagc 1506661 gaggccaagg ccgagttcga cggcccggtg cacgcggtgg tatgcgacga gacgttcgaa 1506721 gtccggcgag ccggctaggt ctagggttgg cgtcgtgtcc aagcgagaag acggccggct 1506781 cgaccacgag cttcgcccgg tgatcatcac ccgcggtttc accgaaaacc cggcgggatc 1506841 ggtgctcatc gaattcggtc acaccaaggt cctgtgcacc gccagcgtca ccgaaggggt 1506901 gccccggtgg cgtaaagcaa ccggtctggg gtggctcacc gcggagtacg ccatgctgcc 1506961 gtcggccacc cacagccgct ctgatcgcga gtcggtgaga ggcaggctta gcgggcgtac 1507021 tcaggaaatc agtcggctca tcggccggtc gctgcgcgca tgcatcgacc tggcggcgct 1507081 gggggagaac acgatcgcta tcgattgtga tgtgttgcag gccgatggtg gcactcgaac 1507141 cgcggccatc accggcgcct acgtggcatt ggccgacgca gtgacctact tgtcggcggc 1507201 gggtaagttg tccgacccca ggccattgtc gtgtgccatc gccgcggtca gcgtcggtgt 1507261 tgtcgacggc aggatccggg tggatctgcc ctacgaggaa gattcgcgcg ccgaggtcga 1507321 catgaacgtc gtcgctaccg acaccggaac cctggtagag attcagggca ccggcgaagg 1507381 cgcgacgttc gcacgttcga cactggataa gctgctggac atggcactgg gcgcctgcga 1507441 cacgttgttt gccgcacaac gcgacgcgtt ggcgctgccg tatccgggtg tgctgccgca 1507501 gggaccgcca ccgccgaagg cgtttggcac ctgaccgcgc cgcgacgatg cagagcggag 1507561 cgatgaggag gagtggcgct tgtgaccaag cttctggtcg ccagccgcaa ccgcaaaaag 1507621 ctggccgaac tgcgccgggt gttggacggc gccggactat cgggtttgac gctgttgtcg 1507681 ctgggcgatg tgtcgccgct gcctgaaaca ccagaaaccg gtgtgacatt cgaggacaac 1507741 gcgctggcca aggcgcgcga cgcgttctcc gcgaccggac ttgccagcgt tgccgacgac 1507801 tccggtttgg aggtggccgc actgggcggc atgcctggcg tgctgtcggc ccggtggtcc 1507861 ggcaggtatg gcgacgatgc cgcgaacacc gcgctgttgc tggcgcagtt gtgcgatgtg 1507921 cccgatgagc ggcgcggagc agcgttcgtg tcggcctgcg cgttggtctc ggggtccggc 1507981 gaagttgtcg tgcgcggtga atggcccggc acgatcgccc gtgagccgcg cggtgacggc 1508041 gggttcggct acgacccggt cttcgtcccg tacggtgacg accgcacagc ggcccagctg 1508101 agcccggcgg aaaaggacgc ggtatcccat cgcggtcgcg cgttggctct gctgctgccg 1508161 gcgctgcgct ccctggcgac aggctaaagc ccgaagcggg ccttgatctc tttggtctgg 1508221 aagtgctcga cgacgatgcc gagcagcgga attgtgccgg cgagcagaac accggctgtt 1508281 ttgccgagcg gccagcggac cttgaccgcc aggttcaacg tcagaagcag atacgtgaag 1508341 tacacccagc cgtgcaccac accgatccac gtcggcggat tgtcaacctt gacgacgtag 1508401 cggaccacga tctcgtagca cagtgcgatg agccagaggc ccgtcgtcca cgccatgatc 1508461 cggtagccga gcaaagcggt gcgaatcctc tcgacggcga tggcaggctc ggcgtgctgc 1508521 gccgcgggcg tttcgggtgc ggtcatgcgg tggtcctgtt ctgcttcctg gcatcgtcct 1508581 tggctagctc ggctaggtag gcgttgtatt cccgtagtac gggatcgtcg ggtggctgct 1508641 gcgccggctt cggccgctcg ggcagcaatc cggcaggtat ctcggcggcg gcgccgccgg 1508701 tgggcggttg cgggggcgtc tcttcatacc gaacgaagtt gcggtacgcg tagacgcaga 1508761 accaagcaaa caatggccac tgcaacgcgt aacccagatt ttgaaaggtg cccgaggtcg 1508821 attgaaacct ggtccactgc caccaaccca gggccaggca accacaggtc gcgatgatca 1508881 ccaacgcgat cagcgcgggt ctgcgacggc gggtagtgga caccccacga cgttaccgcg 1508941 cactgctcta ttgggcgccc gggcgcgatg tggcgatatc cactaagtac aaggctagcc 1509001 ttgcctaata ccccaggtgt agcctccttc gccatgacct catcgccgtc caccgtcagc 1509061 actacgctgc tgagcatcct gcgcgacgac ctcaacattg acctgactcg agtcacgcct 1509121 gatgccaggt tggtcgacga tgtgggactg gattcggtgg ccttcgcggt cggtatggtg 1509181 gccatcgagg agcggctcgg agtcgcactg tccgaagagg agctcttgac gtgcgacacg 1509241 gtcggagaac tggaggcagc gatcgcggcc aaataccgcg atgagtgagc tcgcggccgt 1509301 gctcacgcgg tccatgcagg cctctgccgg cgacttgatg gtcctcgacc gcgagacctc 1509361 gctgtggtgt cggcacccgt ggcccgaggt acacgggctg gccgagagcg tagcggcctg 1509421 gctgctagac catgaccgac ccgccgcggt gggtctggtc ggcgaaccga cggtcgagtt 1509481 ggtcgccgcg atccagggtg cctggcttgc cggcgctgcc gtgtcgatcc tgcccgggcc 1509541 ggtacgtggc gccaatgacc agcgatgggc ggacgcgacg ttgacccgtt tcctcgggat 1509601 tggggtgcgc accgtattga gccagggttc ctaccttgcc cgcctgcgat cggtcgatac 1509661 ggccggcgta acgatcggag atctcagcac ggcggcgcac accaatcgtt cggccacacc 1509721 ggtggcgagt gaagggcccg cggtccttca aggtaccgcg ggatcgacgg gcgcgccccg 1509781 taccgccatc ctttcgccgg gcgcggtgct cagcaacttg cgtgggctca atcagcgcgt 1509841 gggcaccgat gctgcgaccg acgtcggttg ctcatggtta ccgctgtacc acgacatggg 1509901 gctcgctttc gtgctctctg ctgcgctggc cggtgcgccg ctctggttgg ccccgacgac 1509961 ggcgttcacg gcgtcgccgt tccgttggtt gagttggctc tcggacagtg gtgccaccat 1510021 gaccgcggca ccgaacttcg cctacaacct catcggcaaa tacgccaggc gggtatccga 1510081 ggtcgacctg ggtgccctgc gagtgacgct caacggtgga gagccggttg actgcgatgg 1510141 gctgacgcgg ttcgcggagg cgatggcacc gttcggattc gatgccggcg ccgtgttgcc 1510201 ctcctacggg ctcgccgagt cgacgtgcgc ggtgaccgtg ccggtccccg gaattgggtt 1510261 gcttgccgac cgtgtcatcg acggcagcgg tgcgcataag cacgcggtcc tgggtaaccc 1510321 catccccggt atggaggtac ggatctcgtg cggtgatcag gcggcaggca atgcgagccg 1510381 tgaaattggc gaaatcgaga ttcgcggtgc gtcgatgatg gcgggttacc tgggtcagca 1510441 gccgatcgac cctgacgatt ggtttgccac cggcgacctc ggctatcttg gcgctggcgg 1510501 cctggtggtg tgtggtcgcg cgaaggaagt catctccatc gcgggacgca acatctttcc 1510561 gacggaggtc gagctggtgg cagcgcaagt tcgcggagtg cgcgaaggcg ccgtggtcgc 1510621 cttgggcacc ggtgatcgct cgacccgccc cggtctggtg gtcgcggccg agttccgcgg 1510681 cccagacgag gcgaacgccc gcgccgaact gatccaacgc gttgcgtccg agtgcggtat 1510741 cgtcccgtcc gacgtcgtct tcgtgtcgcc tggatcactg ccccggacgt cgtctggaaa 1510801 actgcgccgc ttggcagtcc ggcgctccct ggagatggcg gactgatgac ggccggctcc 1510861 gacctcgacg acttccgcgg tttgctcgcc aaagcgttcg acgagcgggt ggtggcatgg 1510921 accgcagaag cggaagcgca ggaacgtttt ccgcgccagt tgatcgaaca cctgggtgtc 1510981 tgcggcgtat tcgatgcgaa gtgggcgacc gacgcccgtc ccgacgtcgg taaactcgtc 1511041 gaactcgctt tcgcgttggg ccagctggcc tctgccggca tcggtgtggg tgtcagcttg 1511101 catgactcgg cgatcgcgat tttgcgccgg tttggtaagt cggactactt gcgggatatc 1511161 tgcgatcagg cgatccgtgg cgccgcggtg ctgtgcatcg gagcctcgga ggagtccggc 1511221 ggatccgacc tgcagatcgt cgaaaccgag atacggtccc gtgacggtgg tttcgaggtc 1511281 cgcggcgtca agaaattcgt gtcgctgtct ccgatcgccg accacatcat ggtggtggcc 1511341 cgcagcgtcg accacgatcc gaccagtagg cacggcaatg tcgcggtcgt ggccgtgccg 1511401 gccgcacaag tcagcgtgca gaccccctac cgcaaggtcg gtgcgggacc gctggatacc 1511461 gccgcggtct gcatcgacac ctgggtaccg gccgatgcac tggttgcgcg ggccggcacg 1511521 gggctggcag ccatcagttg gggactggct catgagcgga tgtcgatcgc cgggcagatc 1511581 gcagcgtcgt gtcaacgggc gatcggaatc accctggccc gcatgatgag tcgacgtcag 1511641 ttcggtcaga cgctgttcga acaccaggcg ctgcggctgc gtatggcgga cctgcaggcg 1511701 cgtgtcgatc tgctgcggta cgcgctgcac ggcatcgctg aacaggggag actggaactg 1511761 cgcacggcgg cagcggtcaa agtcaccgcc gcccggctcg gtgaggaagt catctccgaa 1511821 tgcatgcaca tcttcggtgg ggcgggttat cttgtcgacg aaacgacgct tggcaaatgg 1511881 tggcgggaca tgaagctcgc ccgggtcggc ggcggcaccg acgaggtgct gtgggaattg 1511941 gtggctgccg gcatgacgcc cgatcacgac ggttacgcag ccgtggtcgg agcttccaaa 1512001 gcgtagagcg ccatgcgccg gtttgtcgtg tcatgctcac cgaggaactt gcatccggcc 1512061 cactcacaca accgacgggt cgcggtgttg cggtgatcgg ggtcgaacat gatccgccgg 1512121 caacgcggct cgttggcaaa gacgctggcc acgatccgcg gtagcagcag cgggccgaag 1512181 ccccgattga ccttcgacaa gtccgcgatg gccgcgtgca gccccaaatc gtaggggtct 1512241 gcgtcgtagt agtgagaaat caaatccttt gctgcccagt ataattcgag ataaccacca 1512301 tctgttccgt gccagctgcc gatcaatggc aacgaatagg ttccctcaag ttgggcgttc 1512361 aggtgttgac gccaacgtga cgccggccag tcgtactccc aggccgccgc cagatgagga 1512421 cggttcatcc actccgccaa catctccgcg tcggtcagct gtgcgacccg caacccgtat 1512481 ggcggctcca acgatggaac gggcgggcgg gcgaggcgtc gtacctggtc aggtaggtcg 1512541 aatcgctcgc gggctagccg aaccagcgcg tcgtcggcct ggccagcgga tgtgggtttg 1512601 gtcattgcgg gccgagctta ccggagggct cgctgcttag gttaggcatg ccatacatgc 1512661 gtgagccggg atcacgtcgc ccgctgcccg gctgtccggg ggtcgaggcg gtacgatcgc 1512721 tacgcccgcg ggcgtgatga aattggcaaa catgccggtt ttaggtgccg gtgctcgaaa 1512781 gagtttgagg gttcgagtcc ctccgcccgc actccatggt ccccgagttt gaccttcggt 1512841 aaggcaaccc ttagtttgga cgagatcgtc cgactggggc cgactgggtt gtatgcgcgg 1512901 gctgagtatc agcgcggtcg cggcgcagct cggggtatcg gcggagcgcg acgccgttgc 1512961 acgccggttg gccggtaacc cagcgttcgt ggtcgcccga tctgagaagt cgtggcggat 1513021 taggccgccg cgagagagga ccgctgatgg cacgcgggtt gcagggtgtg atgttgcgca 1513081 gtttcggcgc gcgcgaccac accgcaacgg tgatcgaaac catttcgatt gcaccgcatt 1513141 tcgtgcgggt ccggatggtt tcgccgacgc tcttccagga tgcggaggct gagcccgccg 1513201 catggctgcg gttctggttc cccgacccga acgggtccaa caccgagttc cagcgcgcct 1513261 atacgatctc cgaagctgac cccgccgcgg gccgcttcgc ggtcgacgtt gtattgcatg 1513321 acccggcggg tccggcctcg tcgtgggcgc gcaccgtcaa acctggcgca accatagcgg 1513381 tcatgtcgct gatgggctca tcgcggttcg acgtgcccga ggagcagccc gccgggtatc 1513441 tgctaatcgg cgactcggcg tcgattccgg ggatgaacgg gatcatcgaa acggtcccga 1513501 acgacgtccc gatcgagatg taccttgaac aacacgacga caacgacacg ttgatcccgc 1513561 tcgcaaagca tccccggctg cgggtgcgct gggttatgcg ccgcgacgag aaatcgctgg 1513621 ccgaggcgat cgagaaccgc gactggtcgg actggtatgc gtgggcgacg ccagaggctg 1513681 ccgcgctgaa atgcgtccgg gtgcggctgc gcgacgagtt cgggttccct aagtccgaga 1513741 tccacgctca ggcttactgg aacgccgggc gtgccatggg cacccaccga gcaaccgaac 1513801 cggcggccac cgaacctgag gtgggcgcag ccccgcagcc agaatcggcg gtgcctgccc 1513861 cggcgcgtgg cagctggcgc gctcaggctg ccagccggct gctggcgccg ctaaagctgc 1513921 cgctggtgct ctcgggtgtg cttgcggctc tggtcacgct ggcgcagttg gcgccgttcg 1513981 tgctgttggt cgagctgtca aggctgctgg tctccggcgc cggcgcgcac cggttgttca 1514041 cggtcgggtt cgccgcggtg gggttgctgg ggaccggggc cttgctggca gccgccctca 1514101 cgctgtggct gcacgtgatc gatgcccgct tcgccagggc gttgcgcttg cggctgctga 1514161 gcaagctgtc ccggttgccg ctgggctggt tcaccagccg cgggtccgga tcgatcaaaa 1514221 aattggtcac cgacgacacg ctggcgttgc actacttggt cacccatgcc gttccggacg 1514281 cggtcgccgc ggttgtcgcc ccggtggggg tgctggtcta tctgttcgtc gtggactggc 1514341 gagtggcgct ggtcttgttc gggccggttc tggtctacct gaccatcacg tcatcgctca 1514401 cgatccaatc cgggccccgc attgttcaag cgcagcggtg ggcagagaag atgaacggcg 1514461 aagcgggtag ttacctcgag ggtcagccgg tgattcgcgt cttcggcgcc gcgtcatcga 1514521 gcttccgtcg ccggttggac gagtacatcg gattcctggt cgcctggcag cggccgctgg 1514581 ccggcaagaa aaccctgatg gatctggcca ctcgcccagc aacgttcctg tggctcatcg 1514641 ccgctaccgg caccttgttg gtagccacgc atcgaatgga tccggtgaat ttgttgccgt 1514701 tcatgttctt gggtaccacg ttcggtgccc gcctgctcgg gatcgcctac gggctcggcg 1514761 gcctacgcac gggacttctg gcggcccggc acctgcaagt cacactcgac gaaaccgaac 1514821 tcgccgtgcg ggaacatccg cgcgaaccgc tcgacggcga ggcgccagca actgtggtgt 1514881 tcgaccacgt caccttcggg taccgccctg gagtgccggt gatccaggat gtatcgctta 1514941 cgctgcggcc gggcacggtc accgcgctcg tcggcccgtc cggctccggc aagtcgacac 1515001 tggccaccct gctggctcga ttccacgatg tcgagcgagg tgcgatacgc gttggtggac 1515061 aggatattcg atcactggcc gcggacgagc tgtacacgcg agtcggcttt gtgctacagg 1515121 aagcccagct tgtgcatggc accgccgccg aaaacatcgc gctggcggta ccggatgccc 1515181 ccgccgaaca ggtccaggtc gcggcccgcg aagcgcaaat ccacgaccgg gtgcttcggc 1515241 tgccggacgg ctacgatacc gtgctcggag ccaacagtgg tctttcgggc ggggagcgac 1515301 agcggctcac cattgcccgt gccatcctcg gcgacactcc ggtcctcatc ctcgacgagg 1515361 ccaccgcgtt tgccgatccg gaatcggaat accttgtgca acaggcgctt aaccggctga 1515421 cccgggaccg caccgtgctg gtaatcgccc atcgactgca taccatcacc cgggccgacc 1515481 agatcgtcgt gctcgatcat ggtcggatcg tcgaacgcgg cacccacgag gagttgcttg 1515541 ccgcgggcgg acgctactgc cggctgtggg acaccggcca gggcagccgg gtggcggtcg 1515601 ccgcagcgca ggacggcacc cgatgatccg cacctggata gcccttgttc cgaacgacca 1515661 ccgcgccagg ctaatcggct ttgcgctgct cgcgttttgt tccgttgtcg cgcgagcggt 1515721 gggcaccgtg ttgctggtgc cgctgatggc ggcgttgttc ggggaggcgc cgcagcgcgc 1515781 gtggctgtgg ctgggctggc tgtccgccgc gaccgtggcc gggtgggtgc tagacgccgt 1515841 gaccgcacgc atcggtatcg agctgggttt cgccgtcctt aaccacaccc aacatgatgt 1515901 ggcggaccgg cttccggttg tccggttgga ttggtttacc gccgaaaaca ccgcgacggc 1515961 acggcaggcg atcgcggcca ccgggccgga acttgttggc ctggtggtta atctggtgac 1516021 accgttgacc agcgcgatcc tgctgccggc agtgatcgcg ctggccctgt tgccgatctc 1516081 ctggcagctc ggcgtggctg cactggccgg cgtgccgttg ctgctggggg cgctgtgggc 1516141 ctccgcagcc tttgcgcggc gtgccgatac cgcagcagac aaagccaata ccgcgctcac 1516201 cgaacggatt atcgagttcg ctcggactca acaggcattg cgggccgccc ggcgcgtcga 1516261 gccggctcga agtctggtcg gcaacgctct ggccagccag cacaccgcga cgatgcggtt 1516321 gctgggcatg cagataccgg gccagctgtt gttcagcatc gccagccaac tggctttgat 1516381 cgtgctcgcc ggcaccaccg cggcgctgac catcacggga acgctcacgg ttcccgaggc 1516441 catcgccctg atcgtggtga tggtccgtta cctcgagccg ttcaccgctg tcagcgagtt 1516501 ggcgccggcc ctcgagagca cccgcgcgac cctggggcgc atcggatcgg tgcttaccgc 1516561 accggtcatg gtggccgggt ctggcacgtg gcgtgacggc gccgtggtcc cgcgtatcga 1516621 gttcgacgac gtcgccttcg gctacgacgg cggcagcggg ccggtcctcg acggggtcag 1516681 cttctgcttg cagccgggaa ccacgacggc gatcgtcgga ccgtctggct gcggaaagag 1516741 cacgatcctg gcgctgatcg cgggcctgca ccagcccact cgcggtcgtg tcctcatcga 1516801 cggcaccgat gtcgcgacgc tggatgcccg ggcgcagcag gcggtctgca gtgtcgtgtt 1516861 ccaacatcct tacctgttcc acgggacgat ccgcgacaac gtgttcgctg cagacccggg 1516921 cgctagtgac gatcagtttg cgcaagccgt ccggctggcg cgggtggacg agctcatcgc 1516981 caggctgcca gacggcgcaa acacaatcgt tggcgaagcc ggctcggcgc tgtccggcgg 1517041 cgagcggcaa cgcgtaagca tcgcacgggc tctgctgaaa gccgctccgg tgctactggt 1517101 cgacgaggcg accagcgcac tggacgccga gaatgaggcc gcggtggtcg acgcgcttgc 1517161 ggccgatccg cgatcacgca cccgggtgat cgtcgcccat cggttggcaa gcatccgtca 1517221 tgccgaccgc gtcctgtttg ttgacgatgg ccgagtggtc gaggacggtt cgatctccga 1517281 gttgctcacc gcgggtgggc gtttcagtca gttctggcgc caacagcacg aggccgccga 1517341 gtggcagatc ctcgccgagt aacgcgagaa accaccgcgc cacgcagata gccacttcct 1517401 ccgtgaatct gcatcgcgag gtcggccacc ttgccagcta gttcggtgta gaagagcttc 1517461 gccgccgacg gtgcaaaata tgatattcgc atggcgtcat tgctgaacgc tcggactgcc 1517521 gtaattaccg gcggtgcaca agggctgggg ttagctatcg gccagcgatt cgttgccgag 1517581 ggtgcacggg ttgtgcttgg tgatgtgaat ctcgaagcga ccgaggtcgc agccaagcgg 1517641 ctgggcggcg atgacgttgc tctggcggtg cggtgcgatg tgactcaagc cgacgacgtc 1517701 gacatcctca tccggaccgc tgtcgagcgt ttcggcggtc tggatgtcat ggtcaacaac 1517761 gccgggatca cccgcgacgc aacgatgcgc acgatgaccg aagagcagtt cgatcaggtc 1517821 atcgcggtgc atctgaaggg aacatggaac ggtacccggc tggcggcggc aatcatgcgg 1517881 gaacgcaagc ggggcgccat tgtgaacatg tcttcggtgt caggcaaggt cggtatggtc 1517941 ggccaaacca actactcagc ggccaaggcc ggcatcgtag gaatgaccaa ggcggccgcc 1518001 aaagaacttg cacacctcgg cattcgggta aacgcaatag ctccggggtt gatccgttca 1518061 gcgatgacag aagctatgcc gcaacgcatt tgggaccaga agcttgccga agttccgatg 1518121 ggtcgcgccg gcgagcccag cgaagtcgct agcgtggccg tgttcttggc ttcggatcta 1518181 tcctcgtaca tgaccggcac cgtgttggac gtgactggcg gccggttcat atgacaccga 1518241 gatcattgcc acggtacggc aattcgtcaa gaaggaaatc tttcccaatg caccggccct 1518301 cgaacgtggc aacagctacc cgcaagaaat cgtcgatcgg ctgggtgtta ttggcttgct 1518361 cggtcgccgg ctgcaagggt atcgacacca ccgagttcat tctcgggcgt gccggcgcat 1518421 tcgagctggc ggtgcgcgct gcccagcacc gtcataggta cttgacgatg gtcaacgtcg 1518481 gacgagcgcc accacgtcgc tgccgaacgg tatgcatggc ggctaccgat actccgcgga 1518541 atatcagatt gaacggctga tgcctgatgc gcccgttgct gctcagcgga gcgggaacca 1518601 gcgcgatcca gaagcctctg aggactcgaa ggctggcctc cggagtccat cgatgatgtg 1518661 cagttgcatc gcgattgccg ccaggggcgt tgtcgcttga gcacatctgg gcataggctg 1518721 ccatcttgga gggcaggcaa cctgcatgat agggaggaga atatggcccg cacgcttgcg 1518781 ttgcgcgcat cggcgggact cgtcgcgggt atggcaatgg ccgcgatcac gctcgcacct 1518841 ggggcccgcg ccgaaaccgg tgagcaattc cccggggatg gggtgtttct cgtgggaact 1518901 gacattgcgc caggcaccta ccgcacggag gggccgtcga atccccttat tttggtgttc 1518961 ggcagggtgt ccgagctctc aacctgctca tggtcgacac acagcgcacc cgaggtgagc 1519021 aatgagaaca ttgtcgacac caacacctct atgggcccga tgtcagtggt gatcccgccg 1519081 accgtggcag ccttccagac gcataactgc aagctttgga tgcggatctc ataggggccg 1519141 gcgtacccgg taccggccgc gggcctacca cgtgccggaa ctggaagcgc agtaagccct 1519201 caacgcgcca ccgctttggc ccgcgcgccc ggcgtaggcg catcggcggt ggccgtgggg 1519261 cggcgcactg cgacctcacc agcggctttc gagctttgtt cgatcaaccg gccagcatgg 1519321 tcgaggatgc attcgagacc atattcgaaa ttggtttcat cgggggcccc gatccgatgc 1519381 cccctcccag ttgcgtgagc aagcagcgga gtcgtcgcgg gatcgatggc cacggggtgt 1519441 tcaatggcgg atggtccgct gcccgccgac tggctcttgc gggagagccg atctagcacc 1519501 accgatccgc gcacgtggac cgaaaccgcc gagtagatgt cgaaagcgtc ttcgagcgac 1519561 aggcccgccg tcaccagatt ggcgatggcc ttctccatct cttgggcgcc caaccgcgcc 1519621 gttttcgggg acagcgccgc tcgaatcagt atcagatcgc acagtacggg gttgtccgcg 1519681 aacgtcttcc gcatcgagcg ggcatgattg cgcaacgttt cgcgccagtc gccggcttcg 1519741 atgtacgggg tagcgaacac gtacttgctc aaagcgcggt cggtcatcgc gttgagcaga 1519801 tcgtccttct tgcggaagta ccagtagatg ctggtgaccc cgacgccaag gtgtttgccg 1519861 agcaatggca tgctcaagtt gtctatcgat acctgctggg cgagttcgaa tgcgccgctg 1519921 atgatgtcct cggggttgat ggatccgcgc tgccgtcgtt gacgcttgcc tggggttgtc 1519981 tgcattgccg ttacggcacc tccatcaaga taacgccggg tcagttgcag gtatgcaggt 1520041 cggcggtagt cgtcgtgcgg acaacatgtg ccgcatggcc tccccgggga caggccggga 1520101 gaacaagaag ccttgcgcac ggtaacagcg ctgatccaat agaattctgg cggcagcctc 1520161 ggtctcgacg ccttcggcta ctacatcgag ttggaagcct tcggcgagtg tcatgatgcc 1520221 gcgcacaatg accagatcgc tagtgttggt tccgagttgc cgcacgaatg ttttgtcgat 1520281 cttgagcgtg tcgatcggta gcgtctgcaa cagtgatatg gcgctatagc cggtgccgaa 1520341 atcgtcgata gcgatgtgaa cgccgacttc tttgagtcga gccagggtgg ctctggcggt 1520401 atgtaggtct tgcaccacaa cgttttcggt gatttccaaa cacacggacg aggcgtccag 1520461 accgtgctgg ccgatcgtgt ctgcgacgaa gtcaacaaac ccgcccgtca ccagctgtcc 1520521 agctgagacg ttgatacgca gcagcgcgtc gtggcccaaa ccggctgact gccactcgga 1520581 gaattcattg caggccctcc gcagcaccca tctatccaat tcgcctgcaa ggttgatgga 1520641 ttcggccaca gggatgaagc agcccggtgc cagcagccca cgggtggggt gctgccaccg 1520701 gaccaatgcc tcggtcccga caatgtcgcc ggtccgtagg tcgacctcgg gtaggtagac 1520761 caggcgaagg gcgtcggatt cgataccacg tcgaaggtgt agttcaatat cgttgcgcag 1520821 ttcgccgctg accgacatgt ccgcggtgaa aatcgcgacg ctatctccgc cggcgtgttt 1520881 ggctgccaga gcggcttggt cggctcggcg caggaggtcc gacggtgtgt gctgtccggg 1520941 agtccctgag gcgacaccga tactgacggt gcgggtgagc acctcaccgc cgatagcgac 1521001 gtggtccttg agctggtcgc gaagacgttc ggcgagcggt tgagcggcat cggcactcat 1521061 tggagatgcg ggtatgagga cgaattcgtc gccgccgagt cgggcgatca ggctctcgcc 1521121 aacgagtgcg tcaccgatcc gttgggcgaa cacatggatg aactggtcac cggcggcgtg 1521181 gcccaggtag tcgttgatgg ccttgaggcg gtccaagtcg agaaatagcg ccgcgaccgg 1521241 gccaggttgt ccgggggcca gtctttggtc caggtgctgc agcaacgcgc gacggttatg 1521301 cagtccggtc agatcgtcat ggtcggccag atagcgaagc cgcgcctcgg cggcgacgcg 1521361 agcctgcacc tgggcgaaga gtgtagcgat ggtcatgagg gcgttaagct cggcctcgtg 1521421 ccatttccga tcaccgaact tgatgaaccc cagcagtcca gtggtgatct cgccagatac 1521481 cagcggcacg gcggcagccg acgttaccgg aaccccgcgg gcttcttcga tgaggcgttg 1521541 atagtcctcg gtggccggct cgggccggaa cacgagaggc tctttggcgt gttcgcatag 1521601 cgcaaacacc gggtcggcat cagcgaagta gatcagcctg agcggatcgg ggtccggtat 1521661 gttgaggcga ggtggccatt cggccaccag cctcgtcgcg cgcctgtcgc gatcgttatg 1521721 acgcaaaaag ctgacatcta cgcccagctg ttccactaga taggccaaaa cgcgctgact 1521781 gacttcggct gacgtggcag cgtcgactgt catgagctgg ttggctacgg tggtgacgag 1521841 ctcctcaagc tgcggcgtcg cggtgtcgtt gcacatctcg gatgctatct gtgcggctct 1521901 ggtatggcgt gccgtacgcg tcggcggcta cacaccgacg gcggtggcgc gtggaacaac 1521961 ctgaagatca acacctcgtg cccttctttg cccggcttga ccagttcccg aaagtcgagt 1522021 tgcaggcggt gcagctgtgc ggcgaaatgg ggtgacgctt ggtcgaggtc gtggcggcca 1522081 cgtgcataca ggaagatcgg tgacatcggt tgtacggcca gtccatgttg ttgggccaca 1522141 atccacaccg cctgcatggc tgatccgcca cgcgcaaaat cggtgagcgt ggcgccatca 1522201 acgtagacga ttgcgagcgc tgaactcgcc gacacgcgct cattggtgtt gtcttcgagg 1522261 gctgttccgc aatcccattg cgctagccgt gccacgacgt cggagcgtcg caggatatcg 1522321 agaacccgca attcgccgga atccagttcg aggcttcgga catcgatgcc cgcatcgagc 1522381 gaagggtcgc ccggccaccg gagctcggac atcatttcct catgtagcct cggggtgaga 1522441 tagcggattc ggtctgcagc cgctaaaatt gttgcagccc ggtcgatctc gtttcgtgac 1522501 agcaacagct gtaaccgcgc accctcagcc gcggcggtgt tcgttaacaa ctcaacggtc 1522561 gcggggtgga cgtgaccggg cataccgtgg tggcgattgg tcgttctgag cagcatcggc 1522621 cggtaaaggg ccgcaaggct tggatcatca ccacggccaa aatgcattgt cgcttgcagc 1522681 ggcgagtcgg gctgggattc gtcgaactct actgatccca ggacccggtg tgcagcggca 1522741 gcgacacgcg cgttaaacat ggccgcgccg acggccactg cgctaccacg aaacgcgata 1522801 tccattgcgc tggtgtgctc aggtgctagt cggatggtca gcgaatgctg tttggccaca 1522861 acatgccatg gctgaacgtt gccccctgaa ggcgcgcgaa tcgccgcctg agccacgatt 1522921 tcgctggttg gctgcggctc ggctggcgct gttggcggca cggactcgag caaccatccg 1522981 ttcccgcgag acggcatggg cggttgatcg aggcgatcta gcgctgcgga cacatccacc 1523041 cgtacccggc cagactcaag tggttctccc agaccgattc tgcgtaccgc ttcagctacc 1523101 gtcgctgcgc ccacccagat atcgcctgcc aactgcggcc atccccacaa cgtctggtca 1523161 acttcgatca tcgaagccgc acaacgcgcc gagagctctt ggcaatcaag gatgttgaga 1523221 acgtggggga ctttgtcttt tgtggtcagt ccacacagct tgtcggcgtc gatgtcgccc 1523281 aatagcccat gaaagatcgg tcgcccaggt tcgacgtcgt agcgttcgac atcgaccagg 1523341 ccgcggtcac tggtcgccat cagtacgggg acaccacggg cgcacgcggc ttgtcgcagt 1523401 atcactttga tatccagcga gtcgcattct tcgataacga cgtcaaggcc gtcgaggaac 1523461 tcgtcgacgg attccggcga gagcccggat gtaacgaggt ccacggccag gtagggatcc 1523521 agctccgcga tcctgcgcgc cgcaatcatc gccttgttga ggccaatgtc gaagacgccg 1523581 accggcacgc gattcaggtt cgacagctca attttgtcga aatcggccaa ccgcagtgtg 1523641 ccacaggcac cttcggcggc aagggtgtat gcgatcgcat ggccggcgct gagtccgacg 1523701 acgccgaccc gtagcgcgtg cagtgcgcgt tgttcctcag cggtgatgag gtgcctgttg 1523761 cggtccaagc gcacggcacg gaacccccgg agacccagaa tggcaacaac catgcgccgc 1523821 cagggataat aggcccatcg cttcgcttct tctagcagat ctggatcagg ctgtggcagc 1523881 aggcgccgca cgcccgctag ctgttctgcg aatcggtcga cgaactcgat gctcggatct 1523941 gagcgtagtc gatcgagcac caggacatcg tcgtggtcat cgtcacgaag gacgagaatg 1524001 ccggtgctgc cgccctcgtg tgggatggtc actgttcggc tccagcggtc gctgcggtgg 1524061 ttgcgctcaa cgcttctaca tcgcgcagaa gcttgcgcga ctcgacaagc attcttgaca 1524121 gttgttttgg ctcggcatgg ttagccaagg ttctgcggtc ccaccagatc atcttggtcc 1524181 ggtagcgctc gtccgggtat gctgccgccg ggattctcgc tgctattact ccccccgaag 1524241 aacgccaccg gtccagcgcg tgggccgccg cggtccccat cacaaactga acccccaaca 1524301 gggacatgct tagcggtagg gcgcgcgcca aggcggcagc aatcgcatca ctgcgctgcg 1524361 cgtcactatt aacccacccg gacttcactt ccacgacccc gaatggcgcc cggtcattga 1524421 tcatcttgcg caccgcggat aatccgggat tgccagccca ttcgactacc gcatgcgagt 1524481 catcggctga ccgcagcggt ccgattaccc gagcgccccc gactacatct cctccaatat 1524541 caatggcggc aaagaacaac tgtgtatcgg aaccgtcact gatggcgtcg agatctaagg 1524601 tacactcgac tccgtgctta ctataggcgc gaagcgcacc ctgaaggtat gtattccaca 1524661 acgtgggatc gagcgcgggt tgcgatacca caagccggca ttgcgcatca gaaacccaca 1524721 cactgagatt ttccgaaaaa tgcaacttct gcggtgcgat aggacgaagt tgagcggtgg 1524781 tcatgattct ccaatctgtt aggtatccgg caattaacac gagatttgct gcccctgtat 1524841 cgagcagcgc agacgttggg gctgcgcccg gagaattgct gccgttgcgc agaacggcgc 1524901 cgcacggcag ggttcaacgc ccggccgcgc tggtatttat cgagtcgctg cgagagccgt 1524961 gaacaaattg tcacgaaatc gtgcgcacgc gcgttcacaa ataccacgcg cacgcgctcg 1525021 aaaactacat gaccagatag ccagattttt ccggaccggc aaagcgttgt tcagtgttgg 1525081 tcacggctct tgatcgtatt taccccgggt ggcgtagacc ctatcgatgg tggaccccgt 1525141 tcatcggggt aatcgaatgg atcatgcaaa atattacttt gacgagtatt cattccgatt 1525201 caaccggtcc cacccacctc tcatgcgtgc ggggcatact gttctacggc ttggtgaaac 1525261 acgccgttgc catcgatcta cacccacgcg acctactctc tgaaaaagtc gaccggcagt 1525321 gccttggcaa agtgccagcc ttgtgcggct ttacagccga aggcgcgcaa ccgggcggct 1525381 tggctggggg tttcgactag ctttgcagtg acggtgatac cgagcttgtc gccaaggtcg 1525441 atcattgccc gggtgatctg ttcgttggcc agccgagctt gaatgtcgcc atcgaggcac 1525501 tcgatgaact ttcccccgag tttgaccacg tcgacgggga ggcggggaag gtaggcgagg 1525561 ctggagaatc caatgccgaa gtcgtcgatg gcgatgccga cgccgagagc ggacaattct 1525621 tgtagcctgg tcaccgcctt ctcgtctctg ctaaggcgcg cgtcctcggc cagttcgagc 1525681 tgcagggcat gggcgggcag gccggtttcg ccgagcacac cttcgaccag caccaggaag 1525741 ccgggatcgc agatggtgct ggcggagacg ttgacgctga caaacggttg cgggtcggtg 1525801 ctgtggtcac gccaactgcg gacgtggcgg caggcctgct cgagcacgaa ggccgtgagc 1525861 ggcaccatca gtccgttgtt ctcggcacgg tcgatgaacc ggcccgggag tagcgtgccc 1525921 aacgtcgggt gttcccagcg cagcagggcc tcggcgccga tgatgcggtt gtcggcaagc 1525981 cggatgattg gctggtagac gaggaagaat tcaccgcgat ccagtgccac gcgcatcgaa 1526041 gtggacagat aatggcgagt gttgacctgg tcgcggtcgg agtccgccca ttggtcagga 1526101 ttggctacca tcgctcgcgc ttgcatcgcg cccctaaaca tctcttcgta gtcgatcaac 1526161 ttggtcggcc tgagcgcgca agcgaacgct gtagcgcgct gacaacaacg atccatccaa 1526221 gggctgcatc aggattcaca gcccggtggg cacctcgccg accgcggtgg caacgcgaag 1526281 cacaccaccg aagtcgtctt gacccgaacc gtcgcagtag attctggagt cctgggaggc 1526341 aaagatcgtc agcgataatg cgtaaaagtc cgtcacgtac tacgtagaag gtccgtgagt 1526401 gcagccgttc cgggcatgca cgaaccggcg cttacacgtc gaaggcggct gcgcggcaat 1526461 cagtctcggt gggtaaccca ttgtcggcgg gcgatcggtt acctctcgaa tcgacggccg 1526521 cccgcatctg agttagccag gccagcggtt tcctacgggc gctgggtgca aagatacgac 1526581 ttccgggtgc aatagttacg cgctatcgct gatgttcttg tccgcaccgg ccttcagagt 1526641 tgagccaacg cgtagtcgcc actcggcact acggtgggcg cgtcatcgac gcttcgctga 1526701 cggcccgagg tggcagatgt tgcgctcgct gcagatcgcc gatcaaatcg ctcgtacggg 1526761 tcacatgcca gtgaggcgtc ttgatctgat ctggatcagc gcacgaaacg ccgcgagacg 1526821 ggagcttgat ctgggcgtgg ctgcgctggt ggaggctgtg acgttgctca ctgctgacgt 1526881 cgagggctcg acacggctgt cgcagacgcg actcaacgag ctagcggccg attacccaac 1526941 cttggatcag aacatatcgg aagctgtcgc ggcccatggc ggggtgacgc gaccggtaga 1527001 ccaggaggtg ggtagcggtc tcgtcgtcgc gttcctgcgt gctggcgacg cgatcgcgtg 1527061 cgctttggaa ctgcagctct caacgttggc gcctatgcgg ccgcgtgtcg gtgtgcacac 1527121 cggcgatgtc cggctgcgcg gcgacggcac catcaccggc tccgcgatca acgagagtgc 1527181 gtgtctgcgc gacctcgcac acgaaggcca gactttgctt tcagccgcca ctggcgatct 1527241 ggtcatcgac cagcttccgg caaatacctg gctgaccgac gtcggcaagt accccctgcg 1527301 gggtttgcat cgccaagaac gggttatcca gttgtgtcat cgagacctac gcaatgagtt 1527361 tccgccgctg cggatgtcgg tcggtaacag atccagcctt ccggcccagt tcaccacttt 1527421 tgtaggccgt gacgcacaga tcaacgaggt gcaagaggtc ctgacgaact accggctggt 1527481 gacgctgcgc ggcgagggcg gtgtaggtaa gacgcgtctg gcgatccaga tcgcggccgc 1527541 gtcggaattt cgcgatggtc tgtgtttcgt cgacttggca ccgattgccg atcccggcat 1527601 ggtgtccacc accgcggccc atgctctagg tctgatcgat cggccgggca gctcaacatt 1527661 cgacactctt agtcatgcca tcggcaactg ccacatgcta atggtgttgg acaactgtga 1527721 gcacgtgttg gatgcgtgcg ccgagctggt cgttgagctg ctgggtgcct gcccggagtt 1527781 aagcattttg gcgaccagcc gcgagtcgat cggcgtgacc ggcgaggtca catgggtggt 1527841 gccgtcgttg tctccggcga acgaagcaat ccagttgttc actgaacgtg cgcgcctagt 1527901 ccaacccaat tttgagatcg ttgctgacaa cttcgacgcc gtgagcgaga tctgccggcg 1527961 gctagacggt atgcccctgg caatcgagtt ggccgcggca cgattgcggt cgttgtcgcc 1528021 aaacgagatc gccaacagtt tggatgaccg attccgcctg ctgaccggtg gtgctcgcag 1528081 tacggtgcag cgccagcaga cattacgggc atctatggat tggtcgtacg cactgctgac 1528141 tgacaccgaa cggatcctgt tccgccgcct tgcggtgttt gtgggcggtt tcgacctcac 1528201 cgcggcgagc gaagtcgccg ccgccggcgg cgacgacttc gtcgagcggt attcagtgct 1528261 tgatcaactg acgctgcttg tcgacaagtc gctggtggta gccgaagaaa gccgaggcag 1528321 tacgcgctat cggctgttgg aaaccgtacg ccagtatgcg ctagaaaaac tgaacgaatc 1528381 cgaagaaatc gacggggtgc gcgctaggca ccggacccac tacgcaacca tggcggcagg 1528441 gctgaacgtt cccgcctcca ccgactatga acaacgcctc ctgcaggctg aagccgaaat 1528501 cgataatttg cgtgccgcat tcacctggag ccgtggaaac ggcgatattg cagccgcatt 1528561 gcagctcgca tccgcattgc aaccgctgtg gtcgcagggg cgcatgcgcg aagggctggc 1528621 ctggctcgaa tccatcctcg agcgggaagg cgacaatcat cttgtgccgg cgggggtttg 1528681 ggcgcgggcg cttgcggaga aggtaatact caaggcttgg ccggccacga gcccgatggg 1528741 cgcccccgac atcgtcgcgc aggctcacca tgccttggcg ctggcacgcg acgcaggcga 1528801 ctgcgcagtg ttggctcgag cgctcgtcgc atgtggctgc ggcagtggtt gcgacacgga 1528861 agccgctcaa ccctacttcg ccgaggcgat cgagctggcg cgcgccatta acgatgagtg 1528921 gacattgagc caaatcgatt attggcaggt ggtcgggatc ttcatatcgg gtcagccaat 1528981 tcctttgcga gctgcggccg aacaagctcg agagctcgcc gacagcatcg gaaaccggtt 1529041 cgtctcacgt caatgccgcc tgtttgcctg cctggcgcag atatgggaag gcgacgcgaa 1529101 cggagcattg gcactatctc gcgacgttac cgccgaggcc gaggtggcaa acgatgtcgt 1529161 tactaaggta ctcggtttgt atgtcgaagc catggcactg tcttacatcg gcgacagcgc 1529221 cgcccggacc atcgctggtg cggctctcga agctgccacc gagttaggcg ggatttacca 1529281 agatctgggt tacggagcga taactcgcgc ggcgttggcc gcgggcgacg tagcggccat 1529341 tgaggctagc gaagcgagct gggatcttcg caatcaacac aacgtggtaa cggcacacca 1529401 cgagctgatg gcgcaggcag ccctggttcg cggcgatgtg accacggcaa gacgtttcgc 1529461 cgacgaagct gtgcttgcga gcaccggatg gcatctgatg atggcgctga tagcacgggc 1529521 gcgagtggcg attgcgcagg acgagctggg aaaggcacgc gatgacgccc acgccgcggt 1529581 ggcgtgcggc gtcggtgtgc agacgtacct cgcgatgccg gatgccctag aacttctcgc 1529641 aggtctggcc ggtgaggccg gtaaccacgg tcaagcagtg cgccttttcg gcgcggccgc 1529701 ggcccagcgg cagcgtacgg gggaggttcg ccacaagatt tgggacgccg gctatgaggc 1529761 cgccacggcg gcgcttcgtg atgcgatggg cgacgaagat ttcactgccg cctgggctga 1529821 gggtgccgcg gcccccttgg acgaggcgat cgcctacgca caacgcggtc gcggcgaacg 1529881 caaacgccca agcaacggct gggacgcgct gaccccggcc gagcacaaaa tcgtaaagct 1529941 cgtcaccgaa ggactggtca ccaaggacat cgccgcgagg cttttcgtct caccgcgtac 1530001 cgtgcaaaca cacctcaccc acatctacac caagctcgac gtcacctccc gtgtccaact 1530061 tgtacaggag gccgcgcaac actcgaccta ggattgcgcg gccagcgcag gcccggagtt 1530121 cgaatcggat gcaatacgca accaatctgg gctcttctgc gcgttgtcgc tgatgttcat 1530181 ggctcttcgc gcccccatgc ttgagcgcat gaacggtttg catacagatg acgcgccggt 1530241 caattggctc gagcggcgag gtggccggct tacgtcgagg cggagggtga cgttgctcca 1530301 tgctggagtg gaacacccga tgcggctgtg gggcgtccaa tccgaggcga taactgccgc 1530361 gatggtgctt agccggaagg tatcggccat cattgccgga cactgcggtg tgcgcctagt 1530421 tgatcagggc gtgggcgatg gcttcgtcgc cgcgttcgcc catgccagcg atgccgtcgc 1530481 atgtgctctg gagttgcacc aggctccgtt gtccccgatc gtcctgcgca tcgggattca 1530541 caccggtgag gcgcagttgg tcgacgagcg catctacgcc ggcgccacaa tgaacctggc 1530601 tgcagagcta cgggatttag cccatggtgg gcagaccgtg atgtcgggtg ctaccgagga 1530661 tgcggtactc ggccggcttc ccatgcgcgc ttggctaatt ggcttgaggc ccatggaagg 1530721 gtccccggaa gggcataact tcccccagtc acaacgcata gcacaattgt gccatccgaa 1530781 ccttcgcaac acctttccgc cgctgcgcat gcgcatcgcc gatgcgagcg gaattcctta 1530841 tgtggggcgg attctggtta acgttcaggt agttccccac tgggaaggag ggtgtgccgc 1530901 agcggggatg gtccttgctg ggtgaagcgc cattgagggc cagacgatag gttggccagc 1530961 gacgtcctca actcagactc tcggcgcgac ctgaccggcg gttacgatca tctgctcgga 1531021 cattcgcaag agagcgtgct cgcccactcc ctgcggcaag gtgtaggcca gctcgcgcaa 1531081 cttaaccgcg cactcgtaga gcgtctggcg gaaatgcgtt tcggtcatcg gggtgccttc 1531141 gctcgtcggc gagatcggca gcggtgtctt gtgctcatac agatcaatct cattcatcag 1531201 agccatcatt cgccggctag cagctgcaaa tcatcagcac atccacgtat gtcgtcggct 1531261 gcccagcgcg caatgtgtgg cacggcgagt tgatgttcaa cctcggcgtg tcctgcatac 1531321 tggatttctt actgtaaagt cacccaaatg ggtggtgccc gccggctcaa gctcgacggg 1531381 agcatcccca accagctcgc ccgggcggcc gacgcggccg tcgcacttga gcgcaatggt 1531441 ttcgatgggg gctggacagc tgaagccagc catgatccct ttctcccgct gctactggct 1531501 gccgagcaca cgtcgcgact tgagcttggc accaacatcg cggtagcgtt cgcgcgcaat 1531561 ccgatgattg tcgccaacgt gggctgggac ctacagacgt actcgaaggg aagattgatc 1531621 ctcggtctgg gaacccagat ccggccgcac atcgagaaac gattcagcat gccctggggt 1531681 catccggcac gtcggatgcg tgaattcgtc gccgcgctgc gtgcgatctg gttggcttgg 1531741 caggacggga ccaagctttg cttcgagggt gagttctaca cccacaagat catgaccccg 1531801 atgttcacac ccgagccgca gccctatccc gttccgagag tcttcatcgc cgctgtcggt 1531861 gaagcgatga ccgaaatgtg cggcgaagtc gccgacggcc acctcggtca ccctatggtc 1531921 tcgaaacggt acctcaccga ggtgtcggtg ccggcgctgc tacgtggcct ggcgcgatcg 1531981 ggtcgcgatc gcagtgcctt cgaggtgtcg tgcgaggtga tggtggccac tggcgcggac 1532041 gacgccgaac tggcggccgc ctgcactgcc acgcgcaagc aaatcgcctt ctacggatcc 1532101 acgccggctt accgcaaagt cctcgagcag catggctggg gcgatctgca cccggagctg 1532161 caccgcctct ccaagctggg tgagtgggag gccatgggtg ggctaatcga cgacgagatg 1532221 ctcggtgctt tcgcggtggt cggtccggtg gacacgatcg ccggtgccct tcgcaatcgt 1532281 tgtgagggcg tcgtcgaccg cgtcttgccg attttcatgg ccgcatctca ggagtgtatt 1532341 aacgccgcac tgcaggactt tcgccgttga gcgcgccatc ggtggatgag gccaccaaga 1532401 tcgctgcccg catagagggc ccgcattgcg tgcggatcgg cgttacccgg cggcgggcac 1532461 acggggcatt acgtacgccc gcggcggcat ccgcaacgca ttgctaaccc cgccgaaccc 1532521 gccgccgcta ttggtcagtt gccccagcgg tagcccgccc agcatgtgtc cgggggcggt 1532581 ttgggcggcg ctggtcaggc tggtcagcgg cagcgcccgc gccgccgggg tgaccgcctg 1532641 gttggccgcg gcccaggcct gcggcaccga caacgaaccg accgaggccg cccgacccaa 1532701 gttggcggcc accccagcgc ccagacccga agaacccagc gacgaaccca gctggctgcc 1532761 cagcgagctc atcgcctgga ccccgttttg cgccgcggtt tccacggcct gagccgccgc 1532821 cggagcaaag cccttcaaca tcgagtgcaa ggtgctggcc atcgacacac ccgagttggt 1532881 catcgacacg tggttgttga gcatcgacac gatgttgctg agcggcgaca gatgcggcga 1532941 gatggctttc cagagttcac tcagttggtc gaacggccag atgcttttcg tgggctgggc 1533001 cagttgttgc agcgcttggg gcacattgtt catcaactgg ttcgccgcgg cggtgtcgat 1533061 ggcctcctcg accgcgacgg cctgctcaag gagcccgccg gggttggtga tcagtggggc 1533121 gtcctcgaac ggcagcaacg cctcggtcgc cgtcgccgcc gtggcggcgt agccaaacat 1533181 cgcggcggcg tcttgggccc acatctcccc gtattcggcc tcgttgaccg cgatcgccgg 1533241 ggtgttttgc cccaagaggt tggtcgctat cagaatcatc agttcagcac ggttctcggc 1533301 gatcaccggc gggggcaccg tcagcccata cgccgtctcg taggccgccg cagcaacccg 1533361 gacctgggcg gcggtcagct cggcctgccc cgcggtgacg ctcatccacg ccacatacgg 1533421 cgaggccgcc gccaccatca gacccgccga cgaacctatc cacgatcccg tcgtcagacc 1533481 ccagaccacc gactgaaacg ccgacgcggc cgaaaacagg tcactcgcca cgctgtccca 1533541 catcttcgcg gcggccacca gcgaggccga acccgggccg gcgtacatcc tcgcggagtt 1533601 gatctccggt ggtaacgccc cgaagtccac cacttcgata atccttccgc tcggccataa 1533661 ctagcaccaa tgatggacag caaacaacgt cggcaacagg tcaaattctc tcaagtagtc 1533721 acaaccccag atgcaaagtg caccagccgc cctgccgcga agctaaatcc agctgaacaa 1533781 tctgaacatc aggtaaatac agtggcaact aatcttaaat aacccggccg attagacagc 1533841 ggccgagatc tgtttgaggt cgggccgtga ttgtccggag aacggccggt atctcgcacg 1533901 agcagccacc cgcgcccccg tcagacttgg cgaccgccta cggcaaccta aaccggggtg 1533961 aacttggtga tcagccaatt gccgtcgacc ttggctaggg tcaccatcac gctgctggcc 1534021 gccatcgacg gattggggct gtccttactg gtagtgctct ggtcgacaaa aaccagaacg 1534081 acggccgaat ccggatgtag ctccgacacg gccgcgcgca ccaccttggc ggtggttttc 1534141 agtgacttct gtttggccgc cggagccacg atctgctgcg tgaactggtc gtagtaggac 1534201 aggaaatcgc cggcgaggtg cgacctggcg gtagcgaagt cttggtcgag cgtgtcgggt 1534261 gaatacgaca acagcgcgat tgtcccgtca gacgccgcgg cgacggcagc acgggcggcg 1534321 ccggagtccg tctgctgatc gggtcggtat tgctcaaggt atagccatcc cgtcgcgccc 1534381 ccagagatca acatgagcag gatgagaatc accggaacgg gtttcaaggt aacctgcatt 1534441 cgccacaggt cacggtgccg ctgacccttc tgcgcggtag attccgttgc agagtcggtg 1534501 tcaaatgcct cggtcgccga atcaccggct tcgcctgcgg ctgagtcgat ctcagcgact 1534561 tcggtggcgt cagtggtttc ggtgttgacg tcgcgtacgt catcggtcac ggtacgaact 1534621 caactttcga catcttgtac tgtcccccct cttcggtcac ggtcactttg agccgccacg 1534681 cacgtggttc gtctttcgcc ccagcggaat tggtgacccg tgaagtcgcc gcgacgagca 1534741 ccacggcgga atgctcgttc atggattcga cggctgtcgc gttcaccgtg ccttcggtga 1534801 ccactttgga ctgttcgaca accttggtga aatcggctgc ccgctgctgg aagtcatccc 1534861 tgaattcgcc ggtggagctg tcgatcacac gcgcgacgtc ttctttggcc ttgttgaagt 1534921 ccagcgaggt catgttgatg acaccttgct tggctccggc ggcgaacgcc gcggcgcgct 1534981 gctggcgttc ggtggcctca tggtgttgcc acacaatgta tccgctgagc ccggtgaagc 1535041 cgcagatgat gacgactgcg gccgccatgg caatcgtgga cagtcttggt aaccgcaccc 1535101 gcaaccgccg tcgccaggat gccgaccgtg cggcctcctg gtctgcggcc tcatagtcgt 1535161 catagtcgtc atagtcttcg gcgtcttccc agtctgcata ctcctcgggg acgttctcgt 1535221 cctcggctgg ggccatcgcc agcgcctcac gcttcaaccg ggcggcacgg gcacgggccc 1535281 gcgccgcggc ggccagcgct tcggcttcgg cggcttcggc ttcggcggcc aacgccatcg 1535341 cgtcggcttg cgatgtcccc gcgtccgacg gtggttcggt tgtctcagcc atcgtggata 1535401 cagcccgtca gtatcattct cgactgaact ccggtatcgc tcacagtatt gcccattgct 1535461 aacattcgag gccccagcga actcctacga ggaccgatca ggcagacgat cacccacctg 1535521 agttttcccc ggcagcatga ctggtgttga ctctctcgaa aacatactct ttacttcgac 1535581 cctccaagcg tttctccaaa agattcccgg ttgtgccatg tcgtgtcgtg aacgcgtgca 1535641 ggcgatggtc gacggaacgc ccgtgcaggc tcgttgagcc gcctactcct gggcgaagat 1535701 gtcctcggtg tcggcgccga caacgggcag ctggaccagc gacaagacat ggtgcgccgg 1535761 gctgccgggt ggggccacca gcacgcactc ggtcccctgt ttgcgtgccc ggtcacaagc 1535821 ggcagcgagg gcgccgacgc cggccgaacc aaggtgggtg acggcactga ggtcgatcgt 1535881 cacgggtgct atgccagaac ggctttcgac ggcgatctgg cggtccaatg tggctgcggt 1535941 ggtcgaatcg acgtcgcccc ggacaacgat gcggccggat tcaactaggg agacgaattc 1536001 gctgtcgatg gtttgttgga aagctgcccg gcgaaccatc gtgtcggtga caaaccgcgc 1536061 cggccgcgac aggcgatgcg taagtgtggc agtcgttccg ccggcgccat gcatgatgcg 1536121 cgcctccgac actagggcct cggccatcgc caggccgcgc ccacggccac gggcgccgtc 1536181 gcggtggtcc ttccattggc cccggtcgat taccgatgcc cgcacgttgc cgtcgccggc 1536241 cagcgcggcc gcgacaacga tgcccttgga gacgtccgtg gcgtatccgt gttcgaccgc 1536301 gttctcgacg aattcggaga tcgcgtgcac gatatcggcg atgtcggagt ggtcggcgcc 1536361 gatctctgcc agccactcac gaagctgggc tcgaacggtt cgtgccgcgt tgatcgtcgc 1536421 atccagcgtt atgtgcagcg gcggcgttgg cgcccggcgt tgcatcgcaa gcagggtcac 1536481 atcgtcgttg tagccggtgg accgcagcag caattcaagt gtgtccgaac agagtcggtc 1536541 gatgggccgt gccggggcgt cgagcacaaa gccgccactg ccgctggcga tgctggccgc 1536601 taggtcggca aattcggcgg tgctggcctc gagcggccga ccgggccgct cgatcaggcc 1536661 gtcagtgtaa aagaggatcg cgtcgccgat gttgagcact tcactgcgca ctggaaatcc 1536721 ggttccgctg ccgagcggac ccgcgccggt tggttcgaca taccgcgcac tcgcgtccgc 1536781 ggtcaccagc agcggtggcg ggtgtccggc tgtgcagtac tggaattcgc ccgaggtgaa 1536841 gtcgagcgag ccgacacaca tggtggccga tttcgatcca ggtacctgtt tatggaagcg 1536901 gtccactgcc tcaagcgcct cgacgaccgt gtaccccgcc gagatctgca tgcgtaacgc 1536961 cgtacgtaat tgcgacatga ccgctgcggc ctccacgccg tggcccacga cgtcgccaac 1537021 gacgagcacc aaccgatccc cgagggccag cgcgtcgaac cagtcgccgc cggccgcggt 1537081 atcctcggcg gcgaccaggt actcggcggc tatgtcggcg ccgggaacca cgggcaccga 1537141 cgcggccagc aacgcctgct gcataacggt ggctgaatcg cgcacattgc gatagcgctc 1537201 ggacagttcc tccacgcgcg cctcggccgc ctgccgggct cgcactcggc tggtgacgtc 1537261 gtccacaatg agctgcacgc cctcgatcga tccgtccgcc cggcggcgcg gtgtgacgac 1537321 aaagtcgaag tatcgttcct caactccgga accgtcgtaa tcagtttgta gtcgccactc 1537381 cgatcctgat tgcggctcac cggtttgata gacccggtcc aacatttcgt agatctgctg 1537441 accctccagt tcgggataga cctcccgagc gggctgtccc acggtgtcaa gcaatggact 1537501 gaagccgcga taggccgcgt tcactgcgac aaagcgatgg tcaggcccct cgaggccaac 1537561 cagaatcgca gggatgtgct cgaaaatgcg tcgtacatcc tcggccgcac cgaccgtttt 1537621 gtcccagtcc atttcggccg ccatttggcc gtccctccta cggaccgatg tagcaaacgg 1537681 gtcaacgtgc gcagaccaat tcgccaggca acgcaaccag gttatcaacg tgccctacca 1537741 gcttgccgga aaagcaaaag tgcgtttggg gcaggccgcc acttatgtcg ctgacagcgc 1537801 ggactccgtg gtcgggtgta cgggcagcac gtcgccgtat ccgcaggcgt ggatgatgcg 1537861 cgcgacagca cggtcgcggc ttaccaggcg cacgtccacg ccccggcgtc gacaccgttc 1537921 ggcctcgtga gcaaggacgg cgactgcgca gcagcccatg aaatcgaggc cgttgaggtt 1537981 gaccacgagt ggttccggcg cggtggtggc cgcggccgcc ttcgtgacca gatcttgcca 1538041 agtgtgctca ttggcggcgt cgatctcgcc acgcgcatgg ataatcacag ccgagtcgtg 1538101 gtgctggatg gtcgccttga gcgcgttgct caccggagta gtgaatgacc ctgcctgagt 1538161 cgggttcatg gtgcactcct catcggcggc acccgagccc ccaattggat tgccggtcct 1538221 gcgcgccgcg gaaaaccgtc gtctttgtat agcaaggccg gcccgctccg tctatagcgc 1538281 cgaagccggc ggccaatgag cttctcggcg tcctcggagc cgaccccatt ctccgcgcgg 1538341 gtggccccgc agatgcgatc cagcaacccc gccgactcgc tccggacagg tggtggtagc 1538401 cctcgtaggc tcagcaatcg tggacttgca ttcacgacca ccgtggtcga acaacgcggt 1538461 gcgtcgtctt ggcgtggcac tgcgcgacgg agttgacccg ccggtcgact gcccgtcgta 1538521 cgccgaggtg atgctgtggc atgcggactt ggccgccgaa gtccaggacc ggatcgaggg 1538581 ccggagttgg tctgcgtcgg agttattggt tacctcacgt gcgaagagcc aagacaccct 1538641 gctagcaaag ctgcggcgtc ggccttacct gcaactgaac accatccaag acatcgcagg 1538701 tgtccgcatc gatgccgacc tcctgctggg cgagcagacg agacttgctc gcgagatcgc 1538761 cgaccacttc ggtgctgacc agcccgctat tcatgatctg cgtgaccacc cgcacgccgg 1538821 ctaccgggcc gttcatgtct ggcttcggtt acctgccggt cgtgtcgaga tacagattcg 1538881 caccattttg cagagcctgt gggccaactt ctacgagctt ctcgctgacg cgtacggtcg 1538941 gggcatccgc tatgacgagc ggccggagca gctagcggcc ggcgttgtcc cggcacagct 1539001 tcaagagctg gtaggggtta tgcaagacgc ttcagcggat ctggcgatgc atgaagccga 1539061 gtggcaacac tgtgcagaga tcgaataccc cggccagcgg gcgatggcgc ttggcgaggc 1539121 gagcaagaac aaggcgacgg tgctcgcaac gaccaagttt aggctggaaa gggccatcaa 1539181 tgaggccgag tcggcagggg gaggtgggtg aggtggctgg ctatgtcgtc gaatacaacc 1539241 ggcgcaccca cgtgcgtcgc atcaccgagt tcgccacccc gcaagaagcg atggagcacc 1539301 ggttgaagct ggaagccgag cgcaccgaca gcaatatcga gatcgttgcg ctcgtcagta 1539361 agtcgttggg aaccctgaag caaacgcatt cgcggtactt cactggtgaa gagctgaacg 1539421 tcggaaacgg cgcgcggtag gcccttgggt ttccgcgagt gtgccgggtc cggtcgacat 1539481 ggggaggttc ggtcaacatg tctacccggc actagagccc gagcgcccga taggtgcggc 1539541 ggacgaattt tggttgcgcg gtccgcagtt tcgccaggga tgggttaccc gcgacggccg 1539601 cggcagcgtc cacggacaga tcgggacaat gctggatcag atacagcagt atcaggtcgg 1539661 cgctcgggtc ggcctgccac catgtcccgt acgcgccggg ccagctgaag gtcccgagcc 1539721 cgcccggccc gaacagcggc ctggacttcg ccggatcggt caccaccgat aggttcagcc 1539781 cgaagccgcg gcccacccag aacggcgccc ccagaaagct gtgccgtttc tgctcgtcgg 1539841 tcagccggtc ggtgcgcatc aggcgcaccg attcaggtga caacacccgg accccgtcga 1539901 ccgtcccgtc gcccaacagc atccgcacga accgcaggta gtcatcggcg gtcgaccaca 1539961 acccgccgcc ggcgttacag aacgacggcg gcgtgacgtg tggcggcccc atcacgtcgt 1540021 gccgcaaccg gtcttgttcg tcgagccggt acatggtcgc ggcccgtcgc tgcgcgtcgg 1540081 ccgacacgta gaagccggtg tcggtcattc ctgccggacc cagcactcgc tcgtcgatga 1540141 tctggtacag cggtgcgtcc tcgatgcggg agacaatgac acccaagacg tcgatggcgt 1540201 ggctgtaggt cacccggtcg ccaggttggt gcacgagcgg aagggttgcc agcgctgcca 1540261 gccaaacgtc gggaccctgg ccgaacggca gtcgctgata ggcccgcgaa attggccccg 1540321 acaccgagaa accgtaagcc aggccgctgg tgtgagtgag caggtcctcg atcaaaatgg 1540381 ctcgtcgcgc gggatgtgtg cgatccagcg ggccggcggc atcgtccagc acggccacct 1540441 tgcagagctc cggtgcccaa cgcgtgatcg ggtcacgcag tgccagtttg ccctcgtcga 1540501 ccaggctcat cgccgccgcc accgtgaccg gcttggtcat cgacgcgatg cgaaacagcg 1540561 tgtcgcgttg catgggcacg cccgcgtcga tatcgcgata gccgatctcg ttgacttgca 1540621 acaatttttc gcgctgccag accatggtta ccgcgccgga aagcaggccc gcgtcgcata 1540681 cctcgcggat ggacgcctga ttgccgtcga gattcacccg gttcaggata ctgtccgagc 1540741 cagcgcggct cggcggatta ctgattgtgc gaacgttttc ccgcgcaccg gtcgcgtgtt 1540801 actgtcgcgc tctccggcga atgtgatctg gggaacatgc tgtgagcgcg gcggcatgct 1540861 agtgacgatg gtgtcgctgc tggtgaacca gggtgtgggt aggcagtcac cgagacccgc 1540921 aaccatggac ggggctggat tcgaggctcc gtgcatgccg tacgactagg ggtagcgccc 1540981 agctgctcaa taccatcggt tggataacaa aggctgaaca tgaatggctt gatctcacaa 1541041 gcgtgcggct cccaccgacc ccggcgcccc tcgagcctgg gggctgtcgc gatcctgatc 1541101 gcggcgacac ttttcgcgac tgtcgttgcg gggtgcggga aaaaaccgac cacggcgagc 1541161 tccccgagtc ccgggtcgcc gtcgccggaa gcccagcaga tcctgcaaga cagttccaag 1541221 gcgacgaagg gcctgcattc cgtccacgtg gtggtgacgg taaacaatct ctcgaccctc 1541281 ccgtttgaga gcgtcgatgc cgacgtgacc aaccaaccgc agggcaatgg ccaggcggtg 1541341 ggcaacgcca aggtcagaat gaagcccaac accccggtgg tggccaccga gttcctggtc 1541401 acgaacaaga ccatgtacac gaagcggggc ggcgactatg tctcggtggg tccggcggag 1541461 aagatctatg acccgggcat catcctggac aaggaccggg ggctgggcgc ggtcgtcggg 1541521 caagtgcaaa acccgacaat ccagggacgt gacgccatcg acggcctggc caccgtcaag 1541581 gtgtccggga ccatcgacgc cgcggtgatc gatccgatcg tgcctcagct aggtaagggt 1541641 gggggcaggc tcccgataac cttgtggatc gtcgacacca acgcctcaac gccggcaccc 1541701 gccgcgaacc tggtgcggat ggtcattgac aaggaccaag gcaacgtcga catcacgctg 1541761 tccaattggg gtgcgccggt caccatcccg aacccggcgg gataacaggc gcgaaccggc 1541821 ccggtccagc cccatcgctg gtcgatggcc tggccggtcc ggtactcgtc cgcgggcgga 1541881 ggccgccttc gaagaaatcc tttgagaatt cgccaaggcc gtcgacccag catggggtca 1541941 gctcgccagc ctgaaccgcc ccggtgagtc cggagactct ctgatctgag acctcagccg 1542001 gcggctggtc tctggcgttg agcgtagtag gcagcctcga gttcgaccgg cgggacgtcg 1542061 ccgcagtact ggtagaggcg gcgatggttg aaccagtcga cccagcgcgc ggtggccaac 1542121 tcgacatcct cgatggaccg ccagggcttg ccgggtttga tcagctcggt cttgtatagg 1542181 ccgttgatcg tctcggctag tgcattgtca taggagcttc cgaccgctcc gaccgacggt 1542241 tggatgcctg cctcggcgag ccgctcgctg aaccggatcg atgtgtactg agatccccta 1542301 tccgtatggt ggataacgtc tttcaggtcg agtacgcctt cttgttggcg ggtccagatg 1542361 gcttgctcga tcgcgtcgag gaccatggag gtggccatcg tggaagcgac ccgccagccc 1542421 aggatcctgc gagcgtaggc gtcggtgaca aaggccacgt aggcgaaccc tgcccaggtc 1542481 gacacatagg tgaggtctgc tacccacagc cggttaggtg ctggtggtcc gaagcggcgc 1542541 tggacgagat cggcgggacg ggctgtggcc ggatcagcga tcgtggtcct gcgggctttg 1542601 ccgcgggtgg tcccggacag gccgagtttg gtcatcagcc gttcgacggt gcatctggcc 1542661 acctcgatgc cctcacggtt cagggttagc cacactttgc gggcaccgta aacaccgtag 1542721 ttggcggcgt ggacgcggct gatgtgctcc ttgagttcgc catcgcgcag ctcgcggcgg 1542781 ctgggctccc ggttgatgtg gtcgtagtag gtcgatgggg cgatcggcac acccagctcg 1542841 gtcagctgtg tgcagatcga ctcgacaccc caccgcaaac catcggggcc ctcgcggtgg 1542901 ccctgatgat cggcgatgaa ccgggtaatt agcgtgctgg ccggtcgagc tcggccgcga 1542961 agaaagccga cgcggtcttt aaaatcgcgt tcgcccttcg caattcggcg ttgtcccgcc 1543021 gcaagcgctt cagctcagcg gattcttcgg tcgtggtccc gggccgtgcg ccggcatcga 1543081 cctgcgcctg gcgcacccac ttacgcaccg tctccgcgca gccaacacca agtagacggg 1543141 cgacctcact gatcgctgcc cactccgaat cgtgctgacc gcggatctct gcgaccatcc 1543201 gcaccgcccg ctcacgcagc tccggcgggt acctcctcga tgaaccacct gacatgaccc 1543261 catcctttcc aagaactgga gtctccggac atgccggggc ggttcagccg cgccggctgg 1543321 caaccgttcc cgctcgagaa agacctggag gaataccagt gacaaacgac ctcccagacg 1543381 tccgagagcg tgacggcggt ccacgtcccg ctcctcctgc tggcgggcca cgcttgtcag 1543441 acgtgtgggt ttacaacggg cgggcgtacg acctgagtga gtggatttcc aagcatcccg 1543501 gcggcgcctt cttcattggg cggaccaaga accgcgacat caccgcaatc gtcaagtcct 1543561 accatcgtga tccggcgatt gtcgagcgaa tcctgcagcg gaggtacgcg ttgggccgcg 1543621 acgcaacccc tagggacatc caccccaagc acaatgcacc ggcatttctg ttcaaagacg 1543681 acttcaacag ctggcgggac accccgaagt atcgattcga cgaccccaac gatctgctgc 1543741 accgggtcaa agcgcggcta gccgagccag cgctggccgc ccggatcaag cgcatggaca 1543801 cactcttcaa cgccatcgtt gcagtactgg ccgtgggtta tttcgcggtt cagggtgtgc 1543861 ggttggtgga accgagctgg atgccgctgt gggccttcgt gattgcgatg gttctgctgc 1543921 gcagttcgtt ggccgggttc ggtcattacg cactgcaccg cgcgcaacga ggcctcaacc 1543981 gggttttcaa caatgccttc gatctcaact atgtggcctt gtccttagtc accgccgacg 1544041 gacacaccct gctgcaccac ccgtataccc agagcgaggt ggacatcaag aagaacgtgt 1544101 tcacgatgat gatgcggcta ccgtggttgt atcgcgttcc cgtacatacg attcacaaat 1544161 ttggccacat gctcagcggc atggcgatcc ggatcgtcga cgtcttcagg atcacgcgca 1544221 aggtaggtgt cgaggaatcc tacggaagct ggcgcgccgc gcttccacac ttccttggat 1544281 cggccggggt gcgcttgctt ctggtgagtg aattggtggt cttcgcgatc gccggcgact 1544341 tctggccctg ggcactgcaa ttcgtagcga cgctgtgggt tagtaccttc ttggtggtgg 1544401 cgagccatga gttcgaggac gacacccagg gcggtgccgt caacggcgag gactggggca 1544461 tagatcaact cgagcacgct aatgacctaa cggtgatcgg gaaccgctac gtcgactgct 1544521 tcctgtcagc cggcctgagc tcccaccgag tccatcacgt gctgccgttt cagcgcagcg 1544581 gcttcgcgaa catcgtcacc gaggacgttt tgcgtgagga agcagcgaag ttcggtgtcg 1544641 agtggcttcc cgcaaagggt ttcatcaccg atcggctgcc gaggctgtgt cggaagtatc 1544701 tgttgacgcc gtcgcgccaa gccaaggagc gtcattgggg tttcgtccgc gagcactgct 1544761 cgccggcggc attgaaagcc agtgccagct acgtggttgc gggtttcgtc ggaatcgggt 1544821 cggtatgaac gtctcagctg agagcggtgc gccgcgccgg gccggccaga ggcatgaggt 1544881 tggccttgcc cagttgccgc cggctccgcc caccacggtg gcggtgattg aagggcttgc 1544941 gacgggcacg ccgcgtcggg tagtcaacca gtccgacgcc gccgatcggg tcgccgagct 1545001 tttcctcgat cccggtcagc gggaacggat tccgcgggtg tatcaaaaat cgcggatcac 1545061 cacgcgccgg atggcggtcg acccgctcga cgccaaattt gatgtcttca ggcgggaacc 1545121 tgcgacgatc cgtgatcgga tgcatctgtt ctacgaacac gcggttccgc tggcggtgga 1545181 cgtgagcaag cgtgccctgg ccggcctgcc ataccgtgcc gccgagatcg ggctgctggt 1545241 gttggccacc agcaccggat tcatcgcgcc gggcgtggac gttgcgatcg tcaaagagct 1545301 cgggctctcc ccgtcgatat cacgtgtcgt ggtcaatttc atgggatgtg ccgccgcgat 1545361 gaatgccctg ggcaccgcca ccaactatgt tcgtgcccac ccggccatga aggcgctggt 1545421 ggtgtgtatc gaattgtgct cggtgaacgc tgtttttgcc gacgacatca acgacgtcgt 1545481 cattcacagc ttgtttggcg acgggtgcgc ggcgttggtg atcggcgcca gccaggttca 1545541 ggagaagctc gagccaggca aggtggtagt ccgcagtagt ttcagtcagc tgctcgacaa 1545601 caccgaagac ggtatcgtgc ttggcgtcaa tcacaacggc atcacctgcg agctgtcgga 1545661 gaatctcccc ggctacatct tcagcggggt cgcaccggtg gtgacagaga tgttatggga 1545721 caatggatta cagatatccg atatcgatct ctgggcgatc catccgggtg gccccaagat 1545781 catcgagcag tcggtgcgct cgctggggat ctccgcggag ctggcggcgc agagctggga 1545841 cgtgctcgcc cgcttcggca acatgctcag cgtatcgctt atctttgtgc tagagacgat 1545901 ggtgcagcag gcggagtcgg ccaaagccat ctcgacgggg gtggcgttcg cgttcgggcc 1545961 gggcgtcact gtcgaaggca tgctgttcga catcatccga cggtgaccgc catgaattca 1546021 gaacacccga tgaccgaccg ggttgtgtat cgatcgttga tggccgacaa cctgcgatgg 1546081 gatgccctgc aattgcgcga cggcgacatc attatctcgg cgccgtccaa gagcggcctg 1546141 acctggacac agcgcctggt gtccctgctg gtgttcgacg ggcccgactt gcccggaccc 1546201 ttgtcgacgg tgtccccgtg gctcgaccag accattcggc ccatcgagga agtggtcgct 1546261 actctcgatg cccagcagca ccgccggttc atcaagaccc acacgccgtt ggacggcctg 1546321 gtgctcgacg accgcgtcag ctacatctgc gtaggacgcg acccgcgcga tgccgcggtg 1546381 tcaatgctgt accaatcggc caacatgaac gaagaccgga tgcggattct gcacgaggcc 1546441 gtagtgccgt ttcacgagcg aatcgccccc ccgtttgcgg aactcggtca tgcgcgcagc 1546501 ccgaccgagg agttccggga ttggatggag gggccgaatc agcctccccc tggcataggt 1546561 ttcacacatc tgaaggggat cggcactctg gccaacatcc tgcaccagct aggcacggta 1546621 tgggtccgcc gtcacctacc caacgtggcc ttgtttcatt acgccgatta ccaggcggac 1546681 ttggcgggcg agctgctccg gccggcaagg gtcctcggta tcgccgcgac ccgcgatcga 1546741 gcccgggacc tggcgcagta cgccacgctg gatgcgatgc gctcccgcgc gtcagaaatc 1546801 gctcctaaca ccaccgacgg catctggcac agtgacgagc gtttcttccg ccggggcggg 1546861 agtggcgact ggcagcagtt cttcaccgaa gccgagcacc tgcgctacta ccaccgcatc 1546921 aaccagctgg cgccacctga tctgctggcc tgggcacacg agggccgccg gggatacgac 1546981 ccggccaact gaggttcagt gccgcattct ctcctgtcag ttgctgcact ttagacgctc 1547041 aatgcgctgc gacaacatta aatgtcagca gtcacaccca gtgtggggga aatttgcata 1547101 tgcgatttag ttgtgtgtag cttgttttgc tgtctgtacg actgcaccga ggggtgagcg 1547161 cgtgtcgcac gaaagtctgt tcgaagaaag cgaagcgccc tacgcggcgc tgtgcgtagt 1547221 tgccaacttc acgacagacg gcgagtgagc aggcgctcat caccagggct acgagcccag 1547281 cacaggggac gcggtgaagc gcatgtccca cgaatccgtg ttccaacaga gtgaagcgct 1547341 ctacacggca tatttttcgc ccaacggcga atgagcgagc gccgatcggt gcgttaggcc 1547401 gggcgggcga ccgcccccgt cgccccttta agtgcgcatg tgcgtagtcc agtcgagggt 1547461 cgggagctgg cccagtgccc caagatgcga tgcggctggc cacattctca tccgcaacgc 1547521 tagttaccac aagtcacacc atacccattt tggcagaaac tattgcacat acagataatt 1547581 gtcggtagct tgtcttgcgg tgcagagaac ggaggaggga atcgcgtgcc ccacgaaatc 1547641 ttgtttgacg cggacgaaaa ggcattctcg gcgttttgca ttatctcgtt tacgaccgac 1547701 agcgagtgaa gctgcggtca tcgggggcgc cactcccaga gaggagagga ggtgaatcgc 1547761 atgtcacagg aaaccttgtt ccaagaaagc caagcgctct acgccgcgta tttctttgcg 1547821 gccgacggtg aatgaccggt cgccgattgg cgcgattccc cgcattcagg gctggcgtag 1547881 cgcaagacga tgacgtgggg tcgaccctga gtcagggctc gacgacaggt gtgttgtcgg 1547941 gcccgaattg gtcgtactgg ccaagccgtg tattagggtc tgcggacccg acgacgatcg 1548001 ctcaccggca cggcacccac cgcatcacta gcccggacga gacctggctg gccctgcagc 1548061 cctttctcgc gccagcaggc attaccgggg tcgccgacgt gacatggctg gattgtcttg 1548121 gcattccaac ggttcaggcg gtgcgcccgg catcgctgac gttgtcggtc agccagggca 1548181 aagccgccag ctatcgggct gcccaggtct cggcggtgat ggagtccttg gagggatggc 1548241 acgccgagaa cgtcactgcc gacttgtggt ctgcgaccgc ccgggatctc gaggcagacc 1548301 tgacttacga ccccgcccaa cttcgccacc ggccgggcag cctctaccac gccggcgtca 1548361 agctcgattg gatggtcgcg acgacgttgc tgaccggtcg ccggacctgg gtaccgtgga 1548421 cggcggtgct ggtgaacgtg gcaacccgcg attgctggga accgccgatg ttcgagatgg 1548481 acaccaccgg actggcctcc ggcaactgct acgacgaggc caccttgcac gccttgtacg 1548541 aggtgatgga gcggcatagc gtggctgcag cggtcgccgg agagaccatg ttcgaggtgc 1548601 caactgacga tgtcgccggc tctgacagcg cccacctggt tgagatgatc cgtgacgccg 1548661 gggacgatgt ggaccttgcc cgcatcgatg tctgggacgg ttactactgt tttgccgccg 1548721 agctcacctc cgcgacgctg gaggtgacct tcggcgggtt cgggttacac cacgacccta 1548781 acgtggcgtt atcgcgggcg atcaccgaag ccgcccagtc gcgcatcacg gcaatcagcg 1548841 gagcccgcga ggacctcccg tcggcgatct accaccggtt cggccgggtg catacatacg 1548901 cgaaggcgcg aaagacgtcg ttgcggctga accgcgcgcg gccgacaccg tggcgggtgc 1548961 ccgatgtcga ctcgctgccc gagttggtgg cgtcggcggc gacggcggtg gccaaccgat 1549021 ccggcaccga gccgctggcg gtcgtgtgcg acttcgccga tgcctgtgtc cccgtggtga 1549081 aggtgctcgc cccgggcctc gtgctgtcga gcgcatcgcc gatgcgcaca cccctacagg 1549141 aggctgaatg acggcctgcg gcaggattgt cgtcaccgct gggcccacga ttagcgccgc 1549201 ggacatccgc tcggtggtgc cggatgccga ggtggcgccg ccgattgcgt ttggccaggc 1549261 gctctcctat gacttgcggt cgggtgacac gctgctgatt gtcgacggat tgttctttca 1549321 gcagccgtcg gttcgacata aggagctttt gacgttgatg gccgacggtg tccgagtcgt 1549381 cggatcgtcg agcatgggcg ccctgcgggc cgctgagctg catccattcg gcatggaggg 1549441 ctatggctgg gtcttcgaaa gctaccgaga tggggtactc gaggccgacg atgaggtcgg 1549501 cgtggtgcac ggcgacgccg acgacggcta cccggtcttc gtcgacgcgc tggtgaacat 1549561 gcgccacacc ctggcgcggg ccgtcgcaac tggtgtggtg tgctccgagc tggccgagcg 1549621 gatcatcgag accgcgcggg ccacaccgtt caccatgcgc acctgggcgc ggctgctgag 1549681 tgaggtcggc gccccggacc agcgcggcct cgccgcacag ttgcggtcac tgcgggtcga 1549741 tgtcaaacac gccgatgcgc tgctggcgtt gcggcagctc ggccagcgcc cccgggtgga 1549801 gccgcttcgt ccgggtccgc cgcccaccgt gtggtcgcgg cggtggcggc agcgatgggc 1549861 accgcccacc tccgtcgccg catcggccga ccacggcgag tcttttgtcg acgtcaccga 1549921 cttggaggtc ttgtcgtttt tgagcgtgag ctcggttgac tactgggcct accggccagc 1549981 actgcaacag gtcgctgcct ggtactggac gttgaaacac cccgaacaat ccggaagcgt 1550041 cggtgagcgt gccgcacgag ccgtcgccga ggtggcatcg gagggctacg ggcgcgccct 1550101 ggaattcatt gcctatcgct acgcacttgc caccggcatc atcgacgaga ccggctttcc 1550161 cgaggcggtc gcagcgcatt ggctcaccac cgaagagcgc cacggcctgg gcaatgaccc 1550221 catctcgatc tcggcgcgag tgatcacccg cacgttgttc gtcgtccggt tattgccggc 1550281 gatcgaccat ttccttgacc tgctgcggaa ggactcccga ctgccccgat ggcgtgccat 1550341 ggcggcccac gcactctgca agcgcgacga tctggcccgg caaaagccgc acctgaacct 1550401 gggccggccc gatccgacgc aattgaagcg cctctttggg gcccgatggg ggacccaggt 1550461 gaaccgcatc gagttggccc ggcgtggact gatgaccgag gacgccttct atgctgccgc 1550521 caccccgttc gccgtcgcgg ccgtcgacga ccaactgccg cgcatcgagg tcggcacctt 1550581 aggacccgcg ccgctgagcg cggacgttcc agaacgccat ttcgacttcg gttccgtcta 1550641 actcgcggcg cacggtggcg ggctccagcg actcgatatc ccagccagcg ccaccgagga 1550701 cgtcgcgcag cgtttgctcg gataccgtcg accgcggcca ttcctcatcg ggcggcatgg 1550761 cgttggagaa gcagctgagt agcagggtgg cgcccggtcg ggtggcccgg tgcaccgagg 1550821 cggcgtagct gcgcttgccg tcgtcgtcta ggcagtggaa catcccgcag tcgatcacgg 1550881 tatcgaacgc gccggtgtag ccggtcagct tggtggcgtc acccactgcg aacttgacat 1550941 cgactccggc gtcgctggct cgccgtttgg cggtggtcag cgcggtggga gagatgtcca 1551001 acccggtcac ctggtagccg ttcctggcga ggtagatcgc gttgtcaccg agcccgcacc 1551061 cgatgtcgag cacgtcgccg tgcacccagc cgccggtgtg ccagccgatg acattgtcct 1551121 tgggcgcttt ggtgtcccac ggcggtgtcg tgatcggcgg gaggccctcg ccggggcttt 1551181 cgccacggta gagcgcgtcg aaatctatac ctggcatgct ggccagctta ggcggcgtgt 1551241 aggtgggtga gggcgacacc gattctggct tccacctggc taacgtctat ctccaacggc 1551301 ccgggcagtg gtggcgcggt gcagtgatag tacatcccgg tgggcgtggt gaattcagct 1551361 gtgtgacggc cggtttcgtc cgtgtcggtg ctgacgcgcc agccgggagc ttccttgacg 1551421 tagttacagc gttcacacga tccgaggccg ttggtcgcgg tggtcgggcc gcctcgatga 1551481 tgcggctggg cgtggtcacg gtggcggatc ggggcatcgc agtagggcat gcgacagcgc 1551541 tgatcgcgca acccgatgaa cgcggccagc cccttcggga accggcgtgc ccgcgattcc 1551601 atcgccacca aggcccccga gcgcggatga cggtagagcc ggcgcagcgt ggcccgtgac 1551661 cgcgtatcgg caaccgcgtc gcgcaccagg ttgcgggcca cggccgccgg gatggggcca 1551721 tacccgtcga ccaccgccgg ggcgcggtcg ccagctaaca gtgtctcgtc ggagagcacc 1551781 aggttgaccg ctaccggttg ggccgcctcg gcgggttgtc cggtgacccg ctcgaccaac 1551841 gtgtcggcca ttacctggcc ccgtgtccga tcgtcgaatg tcgtgtcggc ggcccgcttg 1551901 agcgccgcat agaccgacac gcctcgggcc accggaagca acgccgtcac ccaggtcatg 1551961 gtgtcggggg ccgggcggat cgtcaccgtg cgttcggtct cggccctggc ggcccgctcc 1552021 accaccgcct gggcatcgag ccggtaggca atcgcccggg ccgcggcggc gatccgcgca 1552081 tcacccatcc cgtccaatgc ggacatgtcg gcgcacagct cggcgtcgag tgcgcggcga 1552141 tcctcgacgt ccaggcaggc cgactcccgc acgatcagcg tggcccgcca ctccgatagc 1552201 cgcccgacct cgagcgcggc gagtgtgtgc ggcatctcat acaccaacgc cttcgcgaac 1552261 cccaggtggc gcccgccgcg cgccggcgaa tcccgtcgcg ccagcgctac ttcactggcc 1552321 accccacgcc cgcgccgccg tgccggcacc cccgcatccg cctcattgca gcgacgcaac 1552381 ttgtccagcg ccgccgcagc acgtgcctga ccggcggccg cggccgattt gacccgctcc 1552441 agctcggcga tccgcgcggt caggctcgcc tcatcgtcgc gcgaatccac gcccgcgagg 1552501 ctcactaaat cgaacatgtg ttcgagtata gcaggcctgg gccaccgcgg ccaccgcacc 1552561 gcgggcccgc agcgtgcgag tgctacgctg ccgagcggtc gacatccttt aacgatccgt 1552621 ccagagaggc ggagaaggag gtcaaggttt cccatgggtg ctgcgggtga tgccgcaatc 1552681 ggccgggagt cccgcgagtt gatgtccgcg gccgacgtcg gccgcacgat ttcgcgcatc 1552741 gcgcatcaga ttatcgagaa gaccgcgtta gatgacccag tcggacccga cgcgccgcgg 1552801 gtggtgctgc tgggaatccc gacccgtggc gtgacgctgg cgaatcgcct ggccggcaat 1552861 atcaccgaat acagcggcat ccacgtcggc catggcgcgc tggacatcac cctgtaccgc 1552921 gacgatctga tgatcaagcc gccgcggccc ttggcgtcga cgtcgatccc ggccggtggg 1552981 atcgatgacg cgctggtgat cctggtcgat gacgtgctct actccgggcg ctcggtgcgt 1553041 tccgccctgg acgcgctgcg cgacgtgggc cggccgcggg cggtgcaatt ggcggtgctg 1553101 gtcgacaggg gtcaccggga actgccgctg cgcgccgact atgtgggcaa gaacgttccg 1553161 acctcgcgca gcgagagcgt gcacgtgcgg ctgcgcgagc acgacggccg tgacggcgtg 1553221 gtgatctcgc gatgacccca aggcacctgc tgaccgccgc cgacctcagc cgcgacgacg 1553281 ccaccgccat cctcgacgac gccgaccggt ttgcgcaggc gctggtcggt cgcgacatca 1553341 agaagctgcc gacgctgcgg ggccggaccg tcgtcacgat gttctatgag aactccaccc 1553401 gcacccgggt gtcgttcgag gtagcgggta agtggatgag cgccgacgtg atcaacgtca 1553461 gcgctgccgg atcttcggta ggcaagggtg agtcgctgcg ggataccgcg ctgaccctgc 1553521 gcgcggccgg ggctgacgcg ctgatcatcc gccatcccgc gtccggcgcc gcccatctgc 1553581 tggcgcagtg gaccggcgcc cacaacgatg ggccggcggt gatcaacgcc ggtgacggca 1553641 ctcatgaaca ccccacgcag gcgctgcttg atgcgctgac catccgtcag cgcctcggcg 1553701 gcatcgaagg ccggcgcatc gtgatcgtcg gcgacatcct gcacagccgg gtcgcccgct 1553761 ccaacgtcat gctgctggac accctgggcg ccgaggtggt gctggtggcg ccacccacat 1553821 tgctaccggt cggggtgacc ggctggccgg ccaccgtctc ccacgacttc gatgccgagc 1553881 tgcccgccgc cgacgcggta ttgatgctgc gggtacaggc cgagcggatg aacggcggtt 1553941 ttttcccgtc cgtacgggag tactcggtcc gctacgggct aaccgagcgg cgccaggcga 1554001 tgcttcccgg ccacgccgtg gtgttgcacc cgggaccgat ggtgcgtggc atggagatca 1554061 catcttcggt cgcggactcg tcgcaatcgg ctgtgctgca acaggtttcc aatggagtcc 1554121 aggtgcggat ggcggtgctg ttccatgtgc tggtgggagc gcaggatgcc ggtaaagagg 1554181 gtgcggcgtg agcgtgctga ttcgtggtgt gcggccctac ggcgaggggg agcgggtcga 1554241 cgtactcgtc gatgacggcc agatcgccca gataggaccg gatctggcga tccccgatac 1554301 ggccgatgtc attgacgcca ccggacacgt gctgctgccc gggttcgtcg atctgcacac 1554361 ccatctgcgc gagccgggcc gcgagtatgc cgaggacatc gaaaccggtt cggccgcggc 1554421 cgctttgggc ggctacaccg cggtgttcgc gatggccaac accaaccccg tggccgacag 1554481 cccggtggtc accgaccacg tctggcaccg cggccagcag gtcggcctgg tcgacgtgca 1554541 ccccgtcggc gcggtcaccg tcgggctggc cggagccgag ctgaccgaga tgggcatgat 1554601 gaacgccggc gccgcccagg tgcggatgtt ctccgacgac ggggtctgcg tgcatgaccc 1554661 gctgatcatg cgccgcgccc tggaatatgc caccggtttg ggcgtgctga tcgcccagca 1554721 cgccgaggag ccccggctga cggtcggcgc cgtcgcgcac gagggaccca tggcggcgcg 1554781 gctgggcctg gcgggatggc cgcgggccgc cgaggaatcg atcgtcgccc gcgacgcctt 1554841 gctggcccgt gacgccggcg cccgggtgca catctgtcac gcgtcggccg cgggcaccgt 1554901 cgaaatcctg aaatgggcta aggaccaggg tatttcgatc accgccgagg tcacccccca 1554961 ccacctgttg ctcgacgatg ccagattggc cagctatgac ggcgtgaacc gggtcaaccc 1555021 gccgctgcgc gaagcttccg acgcggtcgc cctgcgacag gcgctggccg acgggatcat 1555081 cgactgtgtg gccacagatc acgccccgca tgccgagcac gagaaatgcg tcgaattcgc 1555141 cgcggcccgg cccggcatgc tcgggttgca gacggcattg tcggtggtgg tgcagacaat 1555201 ggtggcgccc ggcttgttga gttggcgcga tatcgcgcgg gtgatgagtg agaacccggc 1555261 gtgcatcgca cgcttgcccg atcagggccg gccactggag gtgggggagc cggccaacct 1555321 gacggtggtg gaccccgacg ccacctggac ggtcaccggc gccgacctgg ccagccggtc 1555381 ggccaacacg ccgtttgagt cgatgagcct gcccgccacc gtgaccgcga ccctgctgcg 1555441 cgggaaggtg accgcgcgcg acgggaagat ccgggcatga actccggcac gctggcgggg 1555501 tcgctgatct tcgcggcggt gctcgtcatg ctgatcgcgg tgctcgctcg gctgatgatg 1555561 cgcggctggc ggcgccgttc ggagcggcag gcggagctgc tcggcgactt gcccgacgtg 1555621 cccgagcacg tgagctcggc cacggtcacc acccgcggcc tgtacgtggg cgccacgctg 1555681 tcgccggcct ggaacgagcg ggtcaccgtc ggtgatctcg ggtatcgcag caaggcggtg 1555741 ctcacccggt atccgtcggg catcatggtg gaacgcgcac gggctcagcc gatttggatt 1555801 cctacggagt cgatcgccgc cattcgcatg gaacgcggcg tcgccggcaa ggtggtggcc 1555861 ggcatcggga tactcgcgat ccgttggcga ctgccgtccg gcaccgagat cgatgtcggg 1555921 tttcgggcag acaaccgcga cgaataccag gagtggctgg aggaacccgt ttgagcaaag 1555981 ccgtattggt cctcgaagac ggccgggtgt tcaccggcag gccgttcggc gcgaccggac 1556041 aagcgctcgg ggaggccgtg ttttccaccg gcatgtccgg ttatcaggag acgctgaccg 1556101 atcccagcta tcaccgtcag atcgtggtgg ccaccgcgcc gcagatcggc aacaccggct 1556161 ggaacggcga ggactccgaa agccgagggg agcggatctg ggtcgccggt tacgcggtgc 1556221 gcgacccgtc gccgcgcgcg tccaactggc gcgccaccgg cacgttggaa gacgaactca 1556281 tccgccagcg catcgtcggg atcgccggca tcgacacccg ggccgtggtg cgccatctgc 1556341 gcagccgcgg gtcgatgaag gcgggggtgt tctccgacgg ggcgctggcc gagcctgccg 1556401 acttgatcgc gcgggtgcga gcacaacagt cgatgctggg cgccgatctg gccggcgagg 1556461 tcagcaccgc ggagccgtat gtcgtcgaac ccgacgggcc accgggtgtt tcgaggttca 1556521 ccgtggccgc cctagatctt ggtatcaaga ccaacactcc gcgtaacttc gcccggcgcg 1556581 ggattcgctg ccatgtgctg ccggcatcga ccaccttcga gcagatcgcc gaactcaacc 1556641 cgcatggcgt gttcttgtcc aacggccccg gcgacccggc caccgccgat cacgtcgtcg 1556701 cgcttacccg cgaggtgctg ggcgccggaa tcccgttgtt cggcatctgt ttcggcaacc 1556761 agatcctggg ccgcgcgctg ggcctgtcga cctacaagat ggtgtttggg caccgcggca 1556821 tcaacatccc ggtcgtcgac cacgccaccg gtcgggtggc ggtgaccgcg caaaaccatg 1556881 gcttcgccct tcagggggag gcgggccaat ccttcgccac cccgttcggt cccgcggtgg 1556941 tcagccacac ctgcgccaac gacggtgtgg tcgaaggcgt caagctcgtt gacgggcggg 1557001 cgttttcggt gcaataccac ccggaagccg ccgccggccc gcacgatgcc gagtacctgt 1557061 tcgaccagtt cgtggagctg atggcagggg agggccgcta gtgccccgtc gcaccgatct 1557121 gcaccacgtg ctggtcatcg gctccgggcc gatcgtcatc ggccaggcgt gcgagttcga 1557181 ctactccggg actcaggcgt gccgggtgct gcgcgccgag ggcttgcagg tcagcctggt 1557241 gaactctaat ccggccacca tcatgaccga cccggagttc gccgaccaca cctacgtaga 1557301 gcccatcacc ccggcgttcg tggagcgggt tatcgcccaa caggccgagc ggggcaacaa 1557361 gatcgacgcc ctgctggcga ccctgggtgg gcagaccgcg ctgaacaccg cggtcgcgct 1557421 gtacgagagc ggggtgctgg aaaagtacgg cgtggaactc atcggcgccg atttcgacgc 1557481 catccagcgc ggcgaggacc ggcagcggtt caaggacatc gtcgccaagg ccggtggcga 1557541 atccgcccgg agccgagtgt gtttcaccat ggccgaagtg cgtgagacgg tcgccgagct 1557601 cggcctgccg gtggtggtgc ggccgagctt caccatgggc gggctgggtt cggggatagc 1557661 gtactccacc gacgaggtcg accggatggc cggcgccggg ctggcggcct cgcccagcgc 1557721 caacgtgctc atcgaggaat cgatttacgg ctggaaggaa ttcgaactcg agctgatgcg 1557781 cgacggccac gacaacgtgg tggtggtgtg ctcgatcgaa aacgtcgacc cgatgggtgt 1557841 gcacaccggc gactcggtca ccgtcgcgcc ggcgatgacg ttgaccgacc gggaatacca 1557901 gcggatgcgc gacctgggca tcgcgatcct gcgcgaggtg ggtgtggaca ccggcggctg 1557961 caacatccag ttcgcggtca acccgcgcga cggtcggctg atcgtcatcg agatgaaccc 1558021 gcgggtgtcg cgttccagtg cgttggcgtc caaggcgacc ggctttccga tcgccaagat 1558081 cgccgccaaa ctggccatcg gttacaccct cgacgagatc gtcaacgaca tcacagggga 1558141 aacgccggcc tgtttcgaac ccaccctgga ctacgtggtg gtcaaggcgc cgcggttcgc 1558201 gttcgagaag ttccccggtg ccgatcccac cctgaccacc accatgaaat ctgtcggtga 1558261 ggcaatgtcg ttgggccgca acttcgtcga ggcgctcggc aaggtgatgc gctcgctgga 1558321 gacgacccgc gccgggttct ggacggcacc ggatcccgac ggcggcatcg aggaagccct 1558381 gacccggctg cggaccccgg ccgaaggccg gctctacgac atcgagctgg cgttgcggct 1558441 gggtgcgacg gtggaacggg tggccgaggc cagcggtgtc gacccgtggt tcatcgcgca 1558501 gatcaacgag ctggtcaatc tgcgcaacga actcgtcgcg gcacccgtgc tgaacgccga 1558561 gctgctgcgg cgcgccaagc acagcggact atcggatcac cagatcgcgt cgctgagacc 1558621 ggaattggcc ggcgaggccg gcgtgcggtc actgcgcgtg cgcctgggca tccacccggt 1558681 atacaagacg gtggacacct gcgcggcgga gttcgaagcc caaaccccct accactacag 1558741 cagctacgag ctcgaccccg ccgccgaaac agaggtggcc ccgcagaccg aaaggcccaa 1558801 ggtgctgatc ctcggttcgg ggcccaatcg gatcggccag ggtatcgagt tcgactacag 1558861 ctgcgtacac gcggcaacca cgttgagcca ggctggcttt gagaccgtga tggtcaactg 1558921 caacccggag acggtgtcca ccgactacga caccgcggac aggttgtact tcgagccgtt 1558981 gacgttcgag gacgtcttgg aggtctacca cgccgaaatg gaatccggta gcggtggccc 1559041 gggagtggcc ggcgtcatcg tgcagctcgg cggccagacc ccgctcgggc tggcgcaccg 1559101 gctcgccgac gccggggtcc cgatcgtggg caccccaccg gaggccatcg acctggccga 1559161 ggatcgcggc gcgttcggcg acctgctgag cgccgccgga ctgccggcgc caaagtacgg 1559221 caccgcaacc actttcgccc aggcccgccg gatcgccgag gagatcggct atccggtgct 1559281 ggtgcggccg tcgtatgtgc tcggtggtcg cggcatggag atcgtgtatg acgaagaaac 1559341 gttgcagggc tacatcaccc gcgccactca gctatccccc gaacacccgg tgctcgtcga 1559401 ccgcttcctc gaggacgcgg tcgagatcga cgtcgacgcg ctgtgtgatg gcgccgaggt 1559461 ctatatcggc gggatcatgg agcacatcga ggaggccggc atccactccg gtgactcggc 1559521 ctgtgcgctg ccaccggtca cgttgggccg cagcgacatc gcgaaggtgc gtaaggccac 1559581 tgaagccatt gcgcacggca tcggcgtggt ggggctgctc aacgtgcagt acgcgctcaa 1559641 ggatgacgtg ctctacgtcc tggaagccaa cccgagagcg agccgtaccg ttccgtttgt 1559701 atccaaggcc acagcggtgc cactcgccaa ggcatgcgcc cggatcatgt tgggcgccac 1559761 cattgcccag ctgcgcgccg aaggcttgct ggcggtcacc ggggatggcg cccacgcggc 1559821 gcgaaacgcc cccatcgcgg tcaaggaggc cgtgttgccg tttcaccggt tccggcgcgc 1559881 cgacggggcc gccatcgact cgctactcgg cccggagatg aaatcgaccg gcgaggtgat 1559941 gggcatcgac cgcgacttcg gcagcgcgtt cgccaagagc cagaccgccg cctacgggtc 1560001 gctgccggcc cagggcacag tgttcgtgtc ggtggccaac cgggacaagc ggtcgctggt 1560061 gtttccggtc aaacgattgg ccgacctggg ttttcgcgtc cttgccaccg aaggcaccgc 1560121 agagatgttg cgccgcaacg gtattccctg cgacgacgtc cgcaaacatt tcgagccggc 1560181 gcagcccggc cgccccacaa tgtcggcggt ggacgcgatc cgagccggcg aggtcaacat 1560241 ggtgatcaac actccctatg gcaactccgg tccgcgcatc gacggctatg agatccgttc 1560301 ggcggcggtg gccggcaaca tcccgtgcat caccacggtg cagggcgcat ccgccgccgt 1560361 gcaggggata gaggccggga tccgcggcga catcggggtg cgctccctgc aggagctgca 1560421 ccgggtgatc gggggcgtcg agcggtgacc gggttcggtc tccggttggc cgaggcaaag 1560481 gcacgccgcg gcccgttgtg tctgggcatc gatccgcatc ccgagctgct gcggggctgg 1560541 gatctggcga ccacggccga cgggctggcc gcgttctgcg acatctgcgt acgggccttc 1560601 gctgatttcg cggtggtcaa accgcaggtg gcgttttttg agtcatacgg ggctgccgga 1560661 ttcgcggtgc tggagcgcac catcgcggaa ctgcgggccg cagacgtgct ggtgttggcc 1560721 gacgccaagc gcggcgacat tggggcgacc atgtcggcgt atgcgacggc ctgggtgggc 1560781 gactcgccgc tggccgccga cgccgtgacg gcctcgccct atttgggctt cggttcgctg 1560841 cggccgctgc tagaggtcgc ggccgcccac ggccgagggg tgttcgtgct ggcggccacc 1560901 tccaatcccg agggtgcggc ggtgcagaat gccgccgccg acggccgcag cgtggcccag 1560961 ttggtcgtgg accaggtggg ggcggccaac gaggcggcag gacccgggcc cggatccatc 1561021 ggcgtggtcg tcggcgcaac ggcgccacag gcccccgatc tcagcgcctt caccgggccg 1561081 gtgctggtgc ccggcgtggg ggtgcagggc gggcgcccgg aggcgctggg cggtctgggc 1561141 ggggccgcat cgagccagct gttgcccgcg gtggcgcgcg aggtcttgcg ggccggcccc 1561201 ggcgtgcccg aattgcgcgc cgcgggcgaa cggatgcgcg atgccgtcgc ctatctcgct 1561261 gccgtgtagc gggtgccctg ccaccgcgcc gctaaatccc accagcatgg ggtggtgagc 1561321 ccagcgctcg tgtgaccaaa ctcaccgccc tgggccgtcg tcacgctgtg ttaacctctc 1561381 gttcaaatga tattcatatt caatagtggc gctaagtgtc cggttgaatc cccgttgaac 1561441 ccccaacaga tggagtctgt gtcgtgacgt tgcgagtcgt tcccgaaagc ctggcaggcg 1561501 ccagcgctgc catcgaagca gtgaccgctc gcctggccgc cgcgcacgcc gcggcggccc 1561561 cgtttatcgc ggcggtcatc ccgcctgggt ccgactcggt ttcggtgtgc aacgccgttg 1561621 agttcagcgt tcacggtagt cagcatgtgg caatggccgc tcagggggtt gaggagctcg 1561681 gccgctcggg ggtcggggtg gccgaatcgg gtgccagtta tgccgctagg gatgcgctgg 1561741 cggcggcgtc gtatctcagc ggtgggctat gaccgagccg tggatagcct tccctcccga 1561801 ggtgcactcg gcgatgctga actacggtgc gggcgttggg ccgatgttga tctccgccac 1561861 gcagaatggg gagctcagcg cccaatacgc agaagcggca tcagaggtcg aggaattgtt 1561921 gggggtggtg gcctccgagg gatggcaggg gcaagccgcc gaggcgtttg tcgccgcgta 1561981 catgccgttt ctggcgtggc tgatccaagc cagcgccgac tgcgtggaaa tggccgccca 1562041 gcaacacgtc gtcatcgagg cctacactgc cgcggtagag ctgatgccta ctcaggtcga 1562101 actggccgcc aaccaaatca agctcgcggt gttggtagcg accaatttct ttggcatcaa 1562161 caccattccc attgcgatca atgaggccga gtacgtggag atgtgggttc gggccgccac 1562221 cacgatggcg acctattcaa cagtctccag atcggcgctc tccgcgatgc cgcacaccag 1562281 ccccccgccg ctgatcctga aatccgatga actgctcccc gacaccgggg aggactccga 1562341 tgaagacggc cacaaccatg gcggtcacag tcatggcggt cacgccagga tgatcgataa 1562401 cttctttgcc gaaatcctgc gtggcgtcag cgcgggccgc attgtttggg accccgtcaa 1562461 cggcaccctc aacggactcg actacgacga ttacgtctac cccggtcacg cgatctggtg 1562521 gctggctcga ggcctcgagt tttttcagga tggtgaacaa tttggcgaac tgttgttcac 1562581 caatccgact ggggcttttc agttcctcct ctacgtcgtt gtggtggatt tgccgacgca 1562641 catagcccag atcgctacct ggctgggcca gtacccgcag ttgctgtcgg ctgccctcac 1562701 tggcgtcatc gcccacctgg gagcaataac tggtttggcg ggcctatccg gcctgagcgc 1562761 cattccgtct gctgcgatac ccgccgttgt accggagctg acacccgtcg cggccgcgcc 1562821 gcctatgttg gcggtcgccg gggtgggccc tgcagtcgcc gcgccgggca tgctccccgc 1562881 ctcagcaccc gcaccggcgg cagcggccgg cgccaccgca gccggcccga cgccgccggc 1562941 gactggtttc ggaggcttcc cgccctacct ggtcggcggt ggcggcccag gaatagggtt 1563001 cggctcggga cagtcggccc acgccaaggc cgcggcgtcc gattccgctg cagccgagtc 1563061 ggcggcccag gcctcggcgc gtgcgcaggc gcgtgctgca cggcggggcc gctcggcggc 1563121 gaaggcacgt ggccatcgtg acgaattcgt cacgatggac atgggtttcg acgcggcagc 1563181 tccggcccca gagcaccagc cgggtgcccg ggcgtccgac tgtggtgcgg gacctatcgg 1563241 atttgctggc acggtgcgca aagaggcggt cgtgaaagcg gcggggttga ccacgctggc 1563301 cggtgacgac ttcggcggcg gcccaacgat gccgatgatg cccggcacct ggacccatga 1563361 tcagggcgtg ttcgacgagc atcgctgata gctgactggg cagtggctgg caaacagctg 1563421 agagagcact cgagagctat cgtcagggca atgtccgatg atgctgagca cccgcgtttg 1563481 gggcactagc agccacgatg atccttgttg ggttgcaccg cggagatgtc ggcgaaaatt 1563541 ggcagggttg cgttgacgca accatggcgc gacacgcgcg ataggtcgcc caaccgcgag 1563601 tgatccccgg cactgcgagt tgcgacgcca cctgccgcca ccagtcgtcg gccgtcgtcg 1563661 accggttgag caggtccgga aagccgaaat ccattgttag gcaacactat tcatgtccca 1563721 tgccagccat gccggcacgg acacggggct ccgtcgagag gccttcgagg tcgcccggcg 1563781 gaccgctggc cggtggcacg tgctactccc acgctgcacg tttgtcccca aaaccagggg 1563841 gtcgggttag atttcgtcag gaagcctgag tacggtcgtc tgcgctggcc ggcgtacccg 1563901 gccgggacaa acaacgatcg attgatatcg atgagagacg gaggaatcgt ggcccttccc 1563961 cagttgaccg acgagcagcg cgcggccgcg ttggagaagg ctgctgccgc acgtcgagcg 1564021 cgagcagagc tcaaggatcg gctcaagcgt ggcggcacca acctcaccca ggtcctcaag 1564081 gacgcggaga gcgatgaagt cttgggcaaa atgaaggtgt ctgcgctgct tgaggccttg 1564141 ccaaaggtgg gcaaggtcaa ggcgcaggag atcatgaccg agctggaaat tgcgcccacc 1564201 cgccgccttc gtggcctcgg tgaccgtcag cgcaaggccc tgctggaaaa gttcggctcc 1564261 gcctaacccc gccggccgac gatgcgggcc ggaaggcctg tggtgggcgt acccccgcat 1564321 acgggggaga ggcggcctga cagggccagc tcacaattca ggccgaacgc cccgtggggg 1564381 gaacccgccc aggagcgcca gtgagcgtcg gcgagggacc ggacaccaag cccaccgcgc 1564441 gtggccaacc ggcggcagtg ggacgtgtgg tggtgctgtc cggtccttcc gcggtcggca 1564501 aatccacggt ggttcggtgt ctgcgcgagc ggatcccgaa tctgcatttc agtgtctcgg 1564561 ccacgacgcg ggcgccacgc ccgggcgagg tcgacggtgt cgactaccac ttcatcgacc 1564621 ccacccgctt tcagcagctc atcgaccagg gtgagttgct ggaatgggca gaaatccacg 1564681 gcggcctgca ccggtcgggc actttggccc agccggtgcg ggcggccgcg gcgactggtg 1564741 tgccggtgct tatcgaggtt gacctggccg gggccagggc gatcaagaag acgatgcccg 1564801 aggctgtcac cgtgtttctg gcgccaccta gctggcagga tcttcaggcc agactgattg 1564861 gccgcggcac cgaaacagct gacgttatcc aacgccgcct ggacaccgcg cggatcgaat 1564921 tggcagcgca gggcgacttt gacaaggtcg tggtgaacag gcgattagag tctgcgtgtg 1564981 cggaattggt atccttgctg gtgggaacgg caccgggctc cccgtgaccc acgtcgtgac 1565041 tagtcagtat ttagctttcc aagccgctct acgccgccag gagaaatttc acgtgagtat 1565101 ctcgcagtcc gacgcgtcgt tggccgccgt ccccgccgtg gatcagttcg atccgtcgtc 1565161 aggtgcatca ggtggctacg acaccccgct gggcatcacc aatccgccca tcgacgagtt 1565221 gctggaccgc gtctcgagca aatacgccct cgtgatctat gcggcaaagc gtgcccggca 1565281 gatcaacgac tactacaacc agcttggcga gggcatcctc gaatatgtcg gtccgctggt 1565341 tgagccgggg ttgcaagaga agccgttgtc catcgcgttg cgcgagatcc acgccgatct 1565401 gctcgagcac accgagggcg agtagcaggg caggcctgag gtggtggacc ataaacggat 1565461 ccccaagcag gtaatagtcg gtgtctccgg gggcatcgcc gcctacaagg cgtgcacggt 1565521 tgttcgtcaa ctcaccgagg ccagtcatcg cgtccgagtc attcccaccg aatccgccct 1565581 gcgcttcgtc ggtgccgcga ccttcgaggc gctctccggt gagccggtgt gcaccgacgt 1565641 tttcgccgac gttccggcgg tcccgcatgt tcacctcggc cagcaggccg atctggtcgt 1565701 agtggcgccg gccaccgccg acctgctggc ccgcgcggcg gccggtcgag ccgacgatct 1565761 gctgaccgcg acgctgctga cggcgcggtg tccggtgctg ttcgcgccgg cgatgcacac 1565821 cgagatgtgg ttgcatccgg ccaccgtcga caacgtggcc acgctgcgcc gccgcggcgc 1565881 ggtggtgctc gagcccgcga caggacggct taccggcgcc gacagcgggg ccggccgact 1565941 gcccgaggcg gaggagatca ccaccctcgc ccagctgctg ctggagcggc acgacgccct 1566001 gccctacgat ctcgcggggc gaaagctgct ggttaccgcc ggtggcacac gcgagccgat 1566061 cgatccggtg cgctttatcg gcaaccgcag ctccggcaag cagggctatg cggtggcgcg 1566121 ggtggccgcc cagcgcggcg ccgacgttac tttgatcgct gggcataccg cagggctcgt 1566181 cgatcccgcc ggcgtcgagg tggtgcacgt cagctcggcc cagcaactcg ccgacgcggt 1566241 gtccaagcac gctccgaccg ccgacgtatt ggtgatggcg gcggccgtcg ccgacttccg 1566301 gcccgcgcag gttgccaccg ccaaaatcaa gaaaggcgtc gaaggcccac cgaccatcga 1566361 gctgctgcgc aacgacgacg tgctggccgg ggtggtgcgg gcccgagccc atggacaact 1566421 gcccaacatg cgggccattg tgggcttcgc agccgagacc ggcgacgcca atggcgacgt 1566481 gctctttcat gcccgagcta aactgcgacg caaaggctgc gatctgttag tcgtcaatgc 1566541 cgtcggcgaa ggcagggcct ttgaggtaga cagcaacgac ggctggctac tggcgtccga 1566601 tggtaccgag tcggcattgc agcacggctc caagacactg atggcgagcc gtatcgttga 1566661 tgcaatcgtc acgttcctgg caggctgtag cagctaacgg gtccggcggc cggttctgta 1566721 cgggtcctgg acaggtgctg gacgatccct tgctcgattg gacgagctga gattgatgcc 1566781 tgaggatata attcggctaa ctatttatcg gaaggatgac gatagtgagc gaaaagggtc 1566841 ggctgtttac cagtgagtcg gtgacagagg gacatcccga caagatctgt gacgccatca 1566901 gcgactcggt tctggacgcg cttctagcgg cggacccgcg ctcacgtgtc gcggtcgaga 1566961 cgctggtgac caccgggcag gtgcacgtgg tgggtgaggt gaccacctcg gctaaggagg 1567021 cgtttgccga catcaccaac acggtccgcg cacggatcct cgagatcggc tacgactcgt 1567081 cggacaaggg tttcgacggg gcgacctgcg gggtgaacat cggcatcggc gcacagtcac 1567141 ccgacatcgc ccagggggtc gacaccgccc acgaggcccg ggtcgagggc gcggccgatc 1567201 cgctggactc ccagggcgcc ggtgaccagg gcctgatgtt cggctacgcg atcaatgcca 1567261 ccccggaact gatgccactg cccatcgcgc tggcccaccg actgtcgcgg cggctgaccg 1567321 aggtccgcaa gaacggggtg ctgccctacc tgcgtccgga tggcaagacg caggtcacta 1567381 tcgcctacga ggacaacgtt ccggtgcggc tggataccgt ggtcatctcc acccagcacg 1567441 cggccgatat cgacctggag aagacgcttg atcccgacat ccgggaaaag gtgctcaaca 1567501 ccgtgctcga cgacctggcc cacgaaaccc tggacgcgtc gacggtgcgg gtgctggtga 1567561 acccgaccgg caagttcgtg ctcggcgggc cgatgggcga tgccgggctc accggccgca 1567621 agatcatcgt cgacacctac ggcggctggg cccgccacgg cggcggcgcc ttctccggca 1567681 aggatccgtc caaggtggac cggtcggcgg cgtacgcgat gcgctgggtg gccaagaatg 1567741 tcgtcgccgc cgggttggct gaacgggtcg aggtgcaggt ggcctacgcc atcggtaaag 1567801 cggcacccgt cggcctgttc gtcgagacgt tcggtaccga gacggaagac ccggtcaaga 1567861 tcgagaaggc catcggcgag gtattcgacc tgcgccccgg tgccatcatc cgcgacctga 1567921 acctgttgcg cccgatctat gcgccgaccg ccgcctacgg gcacttcggc cgcaccgacg 1567981 tcgaattacc gtgggagcag ctcgacaagg tcgacgacct caagcgcgcc atctagcgtc 1568041 gagggcgcga gcagacgcag aatcgcacgc ggaaaggctt ccgcgtgcga ttctgcgtct 1568101 gctcggcgct agctgctgat gcggtagtcg ccgaggtcga accgccggct gcgccagtag 1568161 gcttcgaccg tggtggtcgg gcgcaacggg acgtcaccgt tcttgtcgaa gtaatagctg 1568221 ttggccagcc gacaactgtc ctgccagaag acctggcggt gccggcggcg catcacctcc 1568281 gcgaaatagc gagcgttggc ttcttcggtc acctcgatgc gggtggcgcc ggtgcggcgg 1568341 gctcgcttca ggcaccggat gatgtggtgt gcctgcgtct cgatgagcgc gaagtacgac 1568401 gacccgacgt agccgtacgg tccgaacacg gtgaagaagt tcgggtagcc gggaacgctg 1568461 acgccctcat aggcctgcag ccgatgctcg tcccagaacc ggctcaagga cgcaccgcca 1568521 gttccggtga cggcataggt cgggatgctg tcggtgtcta gcaccttgaa gccggtcgcc 1568581 agcaccagca catcgatctc gtggctggcg ccgtcggtgg tggccaccgc agtgggtgtg 1568641 atcttgtcga tcggctcggt gaccagccgc acgttgtccc ggttgaacgt cgacagatag 1568701 gtgttgtgga agccgggccg cttgcacccc accgcgtatc ggggggtgag ttgctcgcgc 1568761 accaccggat cgtggacctg ttggcgcagg tagcgccgtc ccgctgactc catgtgcttg 1568821 gccaacggaa acaccgcgaa gtagtgcgcc gcgatgggga acgttgcttc cacgaaggcc 1568881 tggctgagca gccggtggac ggctttgccg ccgggaatcc gcatcgccca gcggacggct 1568941 gtgggcagtg gaacgtcgaa tttggggaaa caccaaatag gggtgcgctg aaaaacggtg 1569001 aggtgggaga caattggcgc catctcggga atgacctgca ccgccgaggc cccggtgccg 1569061 atgatcccga cgcgcttgcc ggtcaggtcc tgggtgtgat cccagcgtgc ggtgtgcatg 1569121 gtgacgcctt caaacgagtc caccccgtcg atgtcgggta gtttgggcac cgtcagaatg 1569181 ccgcatgcgc tgatcaggaa cctggctgtg atttcgccgc ccgggtccgt ttgcacccgc 1569241 cacaggctgt gctcgtcatc gaactcggcg gcaagcacct tggtgttcaa ccggatccgc 1569301 gaccggatgc cgtatttgtc gacgcagtgt tcggcgtagg ccttcagctc gtgtccgggt 1569361 gcataggtgc gcgaccagtg ccggctctgc tcgaaagaga actgatagga gaaggacgga 1569421 atatccacgg cgataccggg ataggtgttc cagtgccagg tcccgccgac accgtcgccg 1569481 gcttcgacca cgaggtagtc gctgaatccc gcccggtcga gcttgattgc ggcgccgatc 1569541 ccggagaacc cggcgccgac gatcagtgcg tggtagtcgg gcatcatcgc ctcctcccga 1569601 tgacgtgtac tccgtgcttg ggtcgcaggg tcagcgtcgc ctcgagttcg acgtgatagc 1569661 caggggcgag gtcaaaggtg aagtgttgac tcatgattgc cgccatcaaa accatctcca 1569721 tcagggcgaa gctctgtccg atgcagatgc gtcggccgcc accgaacggc aggtatgcgc 1569781 agcgaggacg gtccgtgggg caccgcaaaa accggccagg atcgaatcta tccgggtcgg 1569841 gccaccagcg cgggtcgtgg tgaatgtggt gaatcgggat gacgacggtg gtgccgcggc 1569901 gaattcggtg tccgtcgatg atgtcatcat cgacggcctc gcgcgcgatt atccacaccg 1569961 acgagaagta gcgttgcgat tcctgcaggc acgcggtggt ccaggccagc ttgcccaggt 1570021 cgtcggcggt cgggcggcgc atgcccagca cgtcgtccag ctcggtgagc atgtggtcgc 1570081 gggcctgcgg gttcagcgcc atcagatacc agaaccagga catggcgttg gcggtggttt 1570141 cgtggccggc gagcatgaac gtcagagctt catcgcgtac tcgctggcgg ggccagattc 1570201 cgccgtcggc gctcagcaac acgttgagca ggtccgcgga gttagtcggc tcggccagtc 1570261 gccgatcgat caccgagttg atggcgcgat ccagggtcag cgtgatctct tgcatttccc 1570321 gcaacggcgg cggcagatga acacccgagt agatacacca gatcagcgtg tcgtaaaccg 1570381 tccgcggcat cagcccccac agccccagcc gctccagctt ttccgcccgc cgcaggccgc 1570441 gagtcgcaag atcgtgcatg gactgcacca acggcccgaa gtcctggctg aacagggcgt 1570501 tggcgactac ccgcaatgtc gtctcgacca tgctttggtg catgtcgaac tgcgcgccgg 1570561 gcacccgcgc ggcggtgacg tcggcgattg ggtcgatcat cagaccgacg agtccgcgca 1570621 ggtggcgccg ggcgaaggtc gagtttaacg cgccgcgatg tcttgcccat gagtcgccct 1570681 cgtcggtgag caagttaaga ccggcggtgg cccggatcgg tccgtattcg tcggatttga 1570741 catatttcag gcgggcctcg tgcagcacat ggtcgacgta gtcggggtga ctgatcgaga 1570801 caaaacgtct gccagcacaa cgaaatcggg tgatgtcgct gccgcgtagc cggcccagga 1570861 agccgtcgcc ggcgtcgaat ccgatggtga tggcttcccg ggtcatcgtc caggtgctca 1570921 tccgcttggc cggtcccttc aggggccgct gggtggtggc ggtggccatg acttcactgt 1570981 atggatgacg ctgactggcc cgaaatgaga ctatgggaca aagtgttgtg agtttaggac 1571041 agcctcgtgg gacatctacc gcctccggcc gaggtgaggc atccggtgta tgcgacccgg 1571101 gtgctgtgtg aggtggccaa cgagcgcggg gtgccgaccg ctgatgtgct ggcgggcacg 1571161 gcgatcgagc cggccgacct cgacgatccg gacgcggtgg tcggtgcgct tgacgagatc 1571221 accgcggtgc gccggttgct ggcccgattg cccgacgacg ccggtatcgg gatcgacgta 1571281 ggcagccggt tcgcgctcac ccacttcggg ttgttcgggt ttgccgtgat gtcatgtggc 1571341 acccttcgcg aactgcttac catcgcgatg cgctatttcg cgttgaccac catgcacgtc 1571401 gacatcacgt tgtttgaaac cgccgacgat tgcctggtcg aactggatgc cagccacttg 1571461 ccggccgatg tccgtggatt cttcatcgag cgcgatattg ccggaatcat cgcgacgaca 1571521 acgagtttcg cgcttccgtt agccgcgaag tatgcggatc aagtatcggc cgaactggcg 1571581 gttgacgcgg aattgttgcg cccgttgctc gagcttgtgc cggtgcacga cgtcgcattc 1571641 gggcgcgcgc acaaccgggt gcacttcccg cgtgccatgt tcgacgagcc gttgccgcag 1571701 gccgaccgcc atacgttgga aatgtgtatt gcacaatgcg acgtgctgat gcaacgcaac 1571761 gagcgacgcc gtggcatcac ggccttggtg cgcagcaagc tgtttcgcga ttccgggctt 1571821 ttcccaacgt ttaccgacgt tgctggcgaa cttgacatgc atccgcggac gctgcggcgt 1571881 cgacttgccg aggaaggcac ttcgtttcgg gccttgctgg gcgaggcgcg ctccaccgtg 1571941 gccgtcgacc tgctacgcaa cgtcgggctg acggtgcagc aggtgtccac ccggctgggc 1572001 tacaccgaag tctcgacgtt ctcgcatgcg ttcaaacgct ggtatggcgt tgcgcccagc 1572061 gaatattcgc gccgcgggta gaccagccct tttcagggtt tcgcggcccg cgtcggtttg 1572121 gtcgggttag gcggggccgg gctggccggg cggaccgggt tggccgggct ggccgaacag 1572181 ggttcccccg gtcccgccga cgccgccgcc cccgccgttg ccgggggtgc catcgttgcc 1572241 ggccccaccg tttccgccgg cgccgccgcc cccgccgttg ccgattagga cggcggcccc 1572301 accgtttccg ccggtcccgc cgttgccgcc ggtaccgtcc tcgccggcgg tgccgccctt 1572361 tccgccggtc ccgccggtcc cggcgtcgcc gatcaggccg gcggcaccgc ctcgcccgcc 1572421 ggtcccgccg gcgccgccct tgccgaacac gccgaagccg tcgccgccct tgccgccggt 1572481 gccgccggtg ccgccggtgc cgtagagttg tccgccgttg ccaccggccc cgccggcacc 1572541 accaattccg ccattgccgt cgggcccctg ggcgctgtgc ccgccggccc cgccggtgcc 1572601 gccgtggcca atcagaccgg cggacccgcc ggctccgcca gcaccgccaa gaccgttcgg 1572661 actggtgacg gtcccgcctg cccccccggt gccgccgttg ccaatgagtt gcccaccagc 1572721 gccgccggct ccaccggcgc cgccgccctg gccacggata tcaccgccgt tgccgccggc 1572781 accgccgtcg ccgatcagcc aggcattgcc accggccccg ccggcgccgc taattgcgcc 1572841 ttgcccgccg ttgccgccgg caccgccgtt gccgatgagc ccgccggtac cgccggcacc 1572901 gcccgcgccc ccgaagttat tctccccgac tttgcctcca accccagttt gcccgccgcg 1572961 cccgccggcc ccgccgctgc ccgacaggcc gcggccgtcg ccgccggtgc cgccggcccc 1573021 gccgttgcct cctgagctga cgccggttcc gccctgcccg ccgtgtccgc cggcgccgcc 1573081 gtcgccgtgc agccagccac caccgccacc ggcgccgccg ataccccccg tgccgccgtc 1573141 gccggacttg ccgccaccga cccccccttg cccgccggtg ccgccggacc ctccgctgcc 1573201 ccacaatccg gcggcgccgc cggcaccgcc ggcaccgccg acaccaccgg ggtcgccggc 1573261 caccccgacc ccgccattgc cgccggcccc gccgttgccc cacagccatc cgccggcccc 1573321 gccggcacca ccggacgcgc cggtgccacc caggccgccg gccccgccgt ggccgatcag 1573381 cccggccgcc ccgcccgcac cgccggcctg accggtggcg ccggacccgc cgttgccgcc 1573441 gttgccgtac aggatcccgc cggccccgcc ggcctgcccg gtccccggcg ccccgtcggc 1573501 gccgtggccg atcagcgggc gccccagcag cgccatggtc ggcgcgttga tcgcacccag 1573561 cagctgctgc tcgacgttgg cggcctcggc gctggcatac gcgcctgccg tgctcgtgag 1573621 ggtctgcacg atctgctggt gaaacgccgc cgcctgagtt ctcagcgcct gataggtctg 1573681 cgcgtggccg ctaaacagcg ccgccacggc ggcggacacc tcatcggcac cggcggccag 1573741 cacacgcgtc gtggcggccg cggccgcagc attggccgtg ctgatcgccg agccgatgct 1573801 tgccagatcc gtcgccgccg cgcccagcat ttccggctgt gcaaacaaaa acgacatgac 1573861 cgtccccctg aatcctgtgg gtatgagcag acttgtcgtg atcgtgcagc ataagcgcag 1573921 gtgatatagg ccatcattgg taatgttata gaaacgttat aggtgatctt gaccttgtca 1573981 aattgttcga caaggagtgc ggtcttattg caactttgtt tattaatgtc gcgcggcccg 1574041 cggcctggga cctccgtcgg acagcggcga cacgatgcaa ctatgggggc cgcagcgagg 1574101 tgtcgtcggt gtcatgcccg cggtcggtgc cccggcaccg caaatggtgg tttcagctgc 1574161 tcgaacatgg ggaaatgcca cacgttgagg gttgccaatt gcaggtcctg gacgtcggcg 1574221 gtagcagcta tcagatagtc gccaagtcca atccggttgt ggctgcgacg atatcggcgc 1574281 atcatgtcgc cggcgcggcg tgcgattacc tcggttgctg gctgtacccg aaacgatgca 1574341 agcaggcgcc acacctcgcg ccgttcggcg gtccgcattc cgccgatgag ttcggcggtg 1574401 gacaccacgc tgatcgccag cggtccgtcc ttgcgggcgc tgacaagcca atcgcgagca 1574461 gcaacgacac cccgcaaatg cgcgatcagc acatcggagt cgacaaggat catgaggtgg 1574521 cgcgccacac ctgggcaagg tgctgttcac gaccaccgga gcgacgcacc ggcggatcca 1574581 ggtggcgaag cgtgccgaac gaatcgttta tagcctgcag gtccgatgca aggtcgtccc 1574641 cagcggtggt gagggctcgg ttcagcagga ggcggatcag ctcggcgcgc gaaacacctt 1574701 cttgcgcggc caacttgtcg aggcttgccg tctgctcctc gtcgaggtag atgttggtcc 1574761 gcttcataca ccatatcata catcacaatg tgcggcccgg gcggcaccgc ggcgggcggc 1574821 gattcagccg accgggcatg ccgccgacgt tatgcgtgca acgccctctt cagcgccgcc 1574881 aagccgcggc cggtggcttc ggccgcggcg ggcaccacca gggcgaagtt gacgtagccg 1574941 tgcaccatgg tgggctcgtt gcttagctct acggaaaccc ctgcggccgt gagcaattcg 1575001 gcgtagcaag caccgtcgtc gcgcagcgga tcatgctcgg cggtgccgat gaaggcggga 1575061 ggcaggccgg acaggtcagc gtttcccggg gccagtgtcg tgggcagcat cgtgtgatca 1575121 ctgatgtcca gccccggcac ataccaggcc aggaacgcgt cgatgacgtc acggtccagg 1575181 attggcgcat cggcattttc ggtgaaagac ggcagcgaca ggtcggccat ggtcgtcggg 1575241 taccacagca gctggaacac cagcggcggt ccgccgacat cccgggccaa ctgcgccatg 1575301 accgccgaga tgttgccgcc cgcagagtca ccggccacgg cgatccggct cgggtcaccg 1575361 cccagttcgg cggcgttttc gccgacccag cgcaatgccg cccagctgtc gtcgatcccg 1575421 gccgggtagg gatgttccgg ggcaagccgg tagtcgacgg acaccacgat ggcctgcgcg 1575481 ccgacggcgt gggcgcgggc gacggggtcg tgggtgtcca gaccgccgag cgaccagccg 1575541 ccaccgtggt agtagacaac cacgggcagg ttgtcgcgaa cgaccggcgg ccagtagacg 1575601 cggaccggaa tgtcggtgag cccgtcgtag ccaacggtcc gttcctcgat ccgtagctcc 1575661 ggcagcaact ccgggggtgt cttcagctgg cggagccgcg cgcgggcgac ttcgacaccg 1575721 tcggccgcgg tgaaggtcac cggaaaggta tcgagcagca tcttcagcac gggatcgata 1575781 tcaggccggg cgacggtcgg ctctgtcatg ggcctaccgt acgaccgcca ggcctatccg 1575841 tgtagcacaa cccgtagcgc caccagccca cggttggtgg cctcggtggc ggcgggcacc 1575901 acaccggcat agccaacgta gccgtgcacc agcgtctggg cgttgtgcac ctcgacggga 1575961 acaccggcgg cggccagcag ctcgccgtac cgaatcccgt cgtcgcgcaa agggtcgtag 1576021 ccggcgacag cgatgtaggc cggcggcagg tcggccaggt tctccgctcg gccgggcgcc 1576081 attggcgctg gcgggttgtg caagtcgatt tcgcctgcgt accaacggga gaacgcggca 1576141 attgccttga cgtcgaggat cggtgcgtcg gcattctcgg ccaacgacgg cagcgattgg 1576201 tcccacagag tggagggata ccacaacagc tgaaacacaa tgggcgggcc gcccatatcg 1576261 cgggctcgct gcgcgatcac cgcggcgatg gtgccgccgg cggaatctcc ggcgacggcg 1576321 atgcggccga ggtcagcacc gacctggcgg ccatgctcgg cgacccaccg cgttgcggcc 1576381 caagcatctt cgatggcagc ggggtagggg tgctcaggcg ccagccggta gtcgacggac 1576441 acgacaatcg cgtcagcgcc gacggcgtgc tggcggcagg tgccatcgtg cgtgtcgagg 1576501 tcgcccatga cgaatccgcc gccatggaaa tacagcacaa cgggcgcctc ggcttgatcg 1576561 ggacacgttg gcggccaata gatccgggtc ccgatcggcc ccgccggtcc atcgatcgca 1576621 aggtcaacga cccgcagctc ggggtgcacc ggctggcgcg gtagatcgcg caaccgctgg 1576681 cgcacggcct cgatcccatc gtcgatcgat agccgaaacg gaaccgcatc cagtaccttc 1576741 agcaggatgg ggtcgatcgc gggtttctcg tcggcggtgt tgtccaaact gggcataccg 1576801 gtaccgtacg cacctcgctt gctggccggc ggctgggtgg tcgccggctg ggcgggcctc 1576861 gcctacggcg tgtacttgac cgtgatcgca ttgcgcttgc caccgggcag cgagttgacc 1576921 gggcacgcga tgttgcagcc cgcgttcaag gcatcgatgg cggtgctgct ggccgcggcc 1576981 gcggttgccc atcccatcgg ccgcgagcgg cggtggttgg taccggcgct gctgttgtcg 1577041 gccaccggcg actggttgtt ggcgatcccc tggtggacgt gggcgttcgt gttcggcttg 1577101 ggggcattcc tgttggcgca cttgtgcttc attggtgccc tgctgccact ggcgcggcag 1577161 gcggctccat cgcgtggccg ggtcgctgcc gtggtggcga tgtgcgttgc gtccgcgggg 1577221 ctgctggtgt ggttctggcc gcacctgggg aaggacaacc tgaccatccc ggtcacggta 1577281 tacatcgtcg cgctgtcggc gatggtgtgc accgcgttgc tggcacggct gccgacgatt 1577341 tggaccgcgg tcggggcggt gtgtttcgcc gcgtcggact cgatgatcgg cattggccgg 1577401 ttcatcctcg gcaacgaggc gttggcggtg ccgatctggt ggtcctacgc cgcagccgag 1577461 atcttgatta cggccgggtt cttcttcggc cgcgaggttc ctgataacgc cgcagcacct 1577521 acggatagct agcggaccgg ttgtctagca gcggatctcg cggtcaagcc cgcacgcccg 1577581 tcgaagtaga gccgatcgcg cgggtgctgc cgatgttgtc ggtgccgcac ctggaccgcg 1577641 acttcgacta cttggtgccc gccgaacact ccgacgatgc ccagccgggg gtgcgggtac 1577701 gggtgcggtt tcacggtcgg ctggtcgacg ggtttgtcct agagcgccgc agcgacagcg 1577761 atcaccacgg caagctgggc tggctggatc gtgtggtgtc gcccgaaccg gtgctcacca 1577821 cggagatccg ccggttggtc gatgcggtgg cggcgcgcta cgccgggacc cgccaggacg 1577881 tattgcggct cgcagtgccc gcccggcacg cacgggtgga gcgggaaatc accacggccc 1577941 cgggtcggcc ggtggtagcg ccggtcgacc cgtcgggttg ggcggcctac ggtcgcggtc 1578001 ggcaattcct ggccgcgctg gccgactcgc gcgctgcgcg ggccgtttgg caggcgctac 1578061 cgggcgagct gtgggcggac cgattcgccg aggctgccgc gcagaccgta cgtgccgggc 1578121 gcacggtact ggcgatcgtg cccgatcagc gggatctgga caccctgtgg caggccgcga 1578181 cggccctcgt cgatgagcac agtgtggtag cactgtcggc cggcctgggc ccggaggcac 1578241 gctatcggcg ctggctggcc gcgttgcggg gcagcgcgcg gctggtgatt ggcacccgca 1578301 gcgcggtgtt cgcgccgttg agcgagctgg gcctggtcat ggtctgggcc gacgccgacg 1578361 actccctggc tgagccgcgg gcaccctatc cgcacgcccg tgaggtggcg atgctgcggg 1578421 cgcatcaggc gcggtgcgca gcgctgatcg gcggctacgc ccgcacggcc gaggcccacg 1578481 cgctggtgcg tagcggctgg gcgcacgacg tggttgcacc ccggccggag gtgcgtgcac 1578541 gctctcctcg cgtggttgcc ctcgacgaca gcggatacga cgacgcgcga gacccggccg 1578601 cccgcaccgc acggctaccg tccatcgcgc tgcgcgccgc gcgctcagcg ctgcagtccg 1578661 gggcgccggt gctggtgcag gtgccgcggc gcgggtacat cccctcgctg gcctgcgggc 1578721 gctgccgggc gatcgctcgt tgccggtcgt gcacgggtcc gctatcgctg caaggcgccg 1578781 gctcgcccgg tgcggtatgt cgctggtgtg gacgggtgga cccgacactg cgatgcgtgc 1578841 gctgtgggtc ggacgtggtg cgtgccgtgg tggtgggggc ccggcgcact gccgaagagc 1578901 tcggccgggc attcccgggt acggcggtga ttacgtcggc cggcgacacc ctggtgcccc 1578961 agctcgacgc cggcccagcc ctggtggtcg ccactccagg agccgaaccc cgggcgcccg 1579021 gcgggtatgg ggcggcgctg ctgctggata gctgggcgct gctgggccgt caagacttgc 1579081 gcgcggccga ggacgcgctg tggcgctgga tgacggcggc cgccctggtt cggccgcgcg 1579141 gggcgggcgg tgtggtgacc gtggtcgccg aatcgtccat tccgacagtg caatcgctga 1579201 tccggtggga tccggtcggt cacgcggagg ccgaactggc agcccgaacc gaagtcggcc 1579261 tgccgccaag tgtgcacatc gctgctcttg acggccctgc cggcaccgtg acggcattgc 1579321 tggaggcggc tcggctgccc gacccggatc gcctccaagc cgatctgctg ggcccggtgg 1579381 acctgccacc cggcgtccgt cgcccggcgg gcatccccgc cgatgcgccg gtcatcagga 1579441 tgttgctgcg ggtgtgccgc gagcagggcc tggagttggc ggcgagtctg cggcgcggca 1579501 tcggtgtgct cagtgcgcgg caaacccggc aaacccgtag cctggttcgg gtacagattg 1579561 acccgctgca tatcgggtaa acggagtaac cgctagctca acacttccgg gcggtgaaga 1579621 taaggtattc ccactgcatc acgccgtcgc agaggtattc gcgacaaagt tcggtgattt 1579681 cggcgtcgag tgtggcgacg cactcggggc tgtcggcgat ggagcggtag gcgttgatcg 1579741 ccgggccgta gaaattcttg aaatagtcgc gacattcgtc cgggcaaccg aaccggtcca 1579801 ctgtcagcga tcctcgccgg gtacggatgt cggacacatg gtcgcgaaac aggccactca 1579861 cgtaatcctc gcttccccac cacacctcgt gcggcgctcc cgccggcagc gtcggccggt 1579921 acggtctgat ggtggacagc aatttgccgt agaaaccctc gggggtccag ttcagggtgc 1579981 tgatcttgcc gccgcgccgg cagacccggg ccagttcgtc ggcggtgcgc tgatgacgcg 1580041 gggcgaacat caccccgatg gtcgagagca ccgcatcgaa ttcgccggcg ctaaacggga 1580101 gggcttctgc gttggcttcc cgccagccga gctccagtcc ggctgccgca gcacgcgcct 1580161 gggcgcggcg cagcagctcg ggcgtcaggt cgctggcagt gacgtgggca cctgccatgg 1580221 ctgccgggat cgatacgttg cccgagcccg cggccacgtc aagcacgcga tcgccgcggc 1580281 gaataccgct ggtggagact aggattgggc caagcggggc caacagctcc tcggcgatgg 1580341 cggcgtagtc gcccaatgcc cacatttgcc gatgcgtggt cgccggcgcc tggcgctcgc 1580401 tggtgggtgt gtagacagtc atcggaactc ctgcgagacg tcgggtgagg ctggtaccga 1580461 attgtgtcag cagacaacag tatacgttct aaataatcaa tgtcgacgat ggtcagatgc 1580521 tagactttcc tgacttaccc gcacggtgta cgacgaagtt gacgccgggg acggccccgg 1580581 gaaaggggta atgatgccaa cggaatatcc ggcgacagcc gaggaatccg tggacgtgat 1580641 caccgatgca ttgctgacgg cgtcccggtt gctggtagcc atctcggccc attcaatcgc 1580701 tcaggtcgat gaaaacatca ccatcccgca gttccggacc ctggtgattt tgtctaatca 1580761 cggtccgatt aacctggcta cgctggcgac gttgctgggt gtgcaaccgt cggccaccgg 1580821 ccgcatggtc gaccggttgg tcggcgccga actgatcgac cggttaccgc accccacctc 1580881 tcgacgggag ctgctggcgg cgctgaccaa gcgtggacga gatgtcgtcc gtcaggtcac 1580941 cgagcaccgg cgcaccgaga tcgcccgcat cgtggaacag atggcaccgg cggaacgcca 1581001 tgggctggtg cgtgccctga cggcgttcac cgaggcgggc ggtgagcccg acgcacgcta 1581061 cgaaatcgag tagctagcgg ccgagcccgt gtcgggccgt ccgttacgtg ctgggacgac 1581121 ccgacacagg ccggattgcc cgcctcagcg cttttcggcg gtgagcagca ggtactccca 1581181 ttccatgaca ccgtccgaca ggtattgcgc tgcgagttcg acaagctggc ggtcgagctc 1581241 ggcggccagc accgcgttgt caccgatgtg cgcgtaggcc tcgatcgtcg ggccatagtt 1581301 gttcttgaag tagtcgtgga cggcctgggc ggtgtcgaac cgcttcactt ccaacaagcc 1581361 acgggccgtc ttgaggccag tgactccatc gcccagcaga ccagtgacat aggcctcacg 1581421 tccccacaac gccgacggcg gcagatccgc cgacacgctg ggccggtatg gcctaatggt 1581481 tgccagcatc cggccgaaga atccctcgca cgtccagctg atcacaccga tcgtcccgcc 1581541 aggccggcag acgcggacca gctcgtcggc cgcggcctga tgatccggtg cgaacatcac 1581601 gccgatcgct gagatcaccg tgtcgaactc gtcgtcggca aacggcaggg cttgcgcgtt 1581661 ggcttcctgg tattgcaggg tcagcccctg ttgggcggcc ctggcctggg accgctgcag 1581721 cagctcgggc gtcaggtcgg tggaaatgac cgtggcaccc gtcttggctg cgggcagcga 1581781 aatattgcca gagccagcgg cgacgtcgag cacccgaaca cccggcccga tgcccgcggc 1581841 ggcaaccagg atcgggccga gtggcgccat cacctcttct gccatcaggg cgtagtcacc 1581901 cagggcccac atcgcccggt gtgtggccgc aagcgtttgg tcctcgcgag caggtgtgtc 1581961 gatagtcatc aggtctcctg agaagtaagt gatgtggctg cgaacttcga catcgttgtc 1582021 gcgggcacgg cgggagcctg ggcagtagcg tgccttgcgt acccaccgga tacagtatgc 1582081 atcagaaata gtgtattcct ctaactatcg cgcgtgtcgg aattgtggcc cacgccacgt 1582141 cggcggcgct tcttagactg ggcgcgtgcg ccttgtcttt gccggcaccc ccgaacccgc 1582201 gctggcctcg ctgcgcaggc tcatcgaatc gcccagtcac gacgtgatcg ccgtgttgac 1582261 ccgtccggat gccgcctccg gccggcgggg caagccgcag ccgtcaccgg tggcccgtga 1582321 ggcggcagag cgcggcattc cggtgctgcg gccatcgcga ccgaactcgg cagagttcgt 1582381 cgccgaactg tcggatctgg cgccagagtg ctgcgccgtg gttgcctacg gagccctgct 1582441 cggcggtccc ttgctggccg tgccgccgca tggctgggtc aacctgcact tctcgctgct 1582501 gccggcctgg cgtggcgcgg cgccggtgca ggccgccatc gccgcgggag acacgatcac 1582561 cggagccacg acgttccaga ttgagccaag cctggactcg ggaccgatat acggtgtcgt 1582621 caccgaggtg atccagccga ccgacaccgc gggcgatcta cttaagcgac tggcggtatc 1582681 gggggcagcg ctgctatcga ccacgctgga tggcatcgcc gatcagcggc tgacgccgcg 1582741 gccgcaaccg gcagacgggg tcagcgtggc gccgaaaatc accgtagcga atgcccgggt 1582801 gcgatgggac ttgccggcgg cggtcgtgga gcggcggatc cgcgccgtca ctcccaaccc 1582861 cggcgcctgg acgctcatcg gtgacttacg ggtcaaactt ggaccggtgc acctcgacgc 1582921 cgctcaccgg ccatcgaagc ccttgccgcc cggtggaatc cacgtggaac gcacgagcgt 1582981 gtggatcggc accggctcgg aaccggtgcg gctgggccag attcagccgc ccggcaagaa 1583041 actcatgaac gcggccgact gggcgcgggg cgcacggctg gacctggccg cacgggcaac 1583101 atgaccccta gatcgcgtgg gccgcgccgc cggccgctgg acccggcgcg tcgtgcggcc 1583161 ttcgagacgc tgcgggcggt tagtgcgcgc gacgcctacg cgaacctggt gttgcccgcg 1583221 ctgctggccc aacgcggtat cggcggtcgc gacgccgcgt tcgccaccga gctgacatac 1583281 ggcacctgcc gagcccgcgg cctgctcgac gcggtcatcg gtgcggccgc cgagcgttcg 1583341 ccgcaggcga tcgatccggt gctgctagac ctgttgcggc tcggcaccta ccaattgctg 1583401 cgcacgcggg tcgacgcaca cgccgcagtg tcgaccaccg tcgagcaggc cggaatcgaa 1583461 ttcgattcgg cgcgagcagg tttcgtcaac ggtgtactac gaacgatcgc cggccgagac 1583521 gagcggtcct gggttggcga actcgctcct gatgcgcaga acgatccgat cgggcatgcc 1583581 gcgttcgtgc atgcgcatcc ccgatggatc gcccaggcct ttgctgacgc gttgggcgcg 1583641 gcggtcgggg agctcgaggc agttttggcc agcgacgacg aacggccagc ggtgcacctg 1583701 gcggcacgcc ccggggtgct gaccgccggc gaactggccc gcgcggtgcg cggaaccgtc 1583761 ggtcggtatt cgccgtttgc ggtgtatctg ccgcgcggtg acccggggcg actggcgccg 1583821 gtgcgcgacg gccaagcgct ggtccaggac gagggcagcc agttagtcgc ccgagcattg 1583881 accctggcgc cagtcgacgg cgataccgga cggtggctgg acctgtgtgc cggaccgggc 1583941 ggcaagaccg cgctgttggc cgggctgggt ttgcagtgcg cagcccgggt gaccgcggtg 1584001 gaaccctcgc cacaccgcgc ggacctggta gcacagaaca cccgcgggct gccggttgag 1584061 ctcttgcgtg tcgacgggcg gcacaccgac ctcgacccgg gtttcgaccg ggtgctggtg 1584121 gatgcgccct gcaccgggct gggcgcgtta cgccgtcggc cggaggcccg ttggcgtcgt 1584181 cagccggcgg acgtagcggc actggccaag ctacaacgcg agttgttgag cgccgccatc 1584241 gcgctgactc ggcccggcgg tgtcgtgctc tatgccacat gctcgccgca cctggccgag 1584301 actgtgggtg ctgtcgccga cgcgctacgc cgacatccgg ttcacgcgct cgatacccgc 1584361 ccactgttcg agccggtgat cgcggggctg ggggaggggc cccacgttca gctgtggccg 1584421 caccggcacg gtaccgacgc catgttcgcc gcggcgttgc gccgcctgac gtgaggttcg 1584481 ccgcagcggc tcagtaatgt gtcgctcatg gccggtagca cggggggacc gctgatagcg 1584541 ccgtcgatcc tagccgctga tttcgccaga ctcgcggacg aagcggccgc ggtcaacggc 1584601 gccgactggt tgcatgtaga cgtgatggac ggtcacttcg tgccaaacct gaccatcggc 1584661 ctgccggtgg tggagagcct gctggcggtc accgacatcc cgatggattg ccatctaatg 1584721 atcgacaacc cggaccggtg ggctccgccg tatgccgagg cgggcgccta caacgtcacc 1584781 ttccacgcgg aggccaccga caacccggtc ggcgtggccc gcgatatccg ggccgcgggg 1584841 gccaaagccg ggatcagcgt gaagccgggg accccgctgg agccatacct ggacatcctg 1584901 ccccatttcg acaccctgct cgtcatgtcg gtagagcctg gcttcggtgg ccagcggttc 1584961 attcccgagg tgctgagcaa ggtgcgtgcg gtgcgcaaga tggtcgacgc gggcgagctg 1585021 acgatcctgg tcgagatcga cggcggcatc aacgacgaca cgattgagca ggctgccgag 1585081 gccggcgtcg actgctttgt cgccggatcg gcggtgtacg gcgccgatga cccggccgcg 1585141 gcggttgcgg cactacggcg acaggccggt gccgcctcac tccacctgag cctatgaacg 1585201 tggagcaggt caagagcatc gacgaggcta tgggtctcgc catcgagcac tcctaccagg 1585261 tcaaaggcac gacttatcca aaacccccag tgggggccgt cattgtggat cccaacggtc 1585321 ggatcgtcgg cgccggcggc accgagccgg ccggtggcga tcatgccgag gtggtggcgc 1585381 tgcgccgggc cggcggattg gctgccggcg ccatcgtggt ggtcaccatg gaaccctgta 1585441 accactacgg caagactccg ccatgcgtga acgctctgat cgaagccagg gtggggacgg 1585501 tggtctacgc cgtcgccgac ccgaacggga tcgctggggg tggcgcgggc cggctgtcag 1585561 cagcgggcct acaggtgcgg tccggggtgt tggctgaaca ggtggcggcc ggaccgctgc 1585621 gggagtggct ccacaagcaa cgcaccggtc tgccgcatgt cacctggaag tacgccacca 1585681 gcatcgacgg ccgcagcgcc gccgccgacg gctccagcca gtggatctcc agcgaggccg 1585741 cacgcctgga tctgcatcgc cgccgcgcca tcgccgacgc gatcttggtc ggcaccggca 1585801 ccgtcctcgc cgacgacccg gccctgaccg cgcggctggc cgacggctcg ctggcgccgc 1585861 agcagccgct gcgcgtggtg gtgggcaagc gcgacatacc gccggaagca cgggtcctca 1585921 acgacgaggc acgcaccatg atgatccgca cccacgaacc tatggaggtg ctcagggcgt 1585981 tgtcggatcg caccgacgtg ctgctggaag gaggtcccac cctcgccggc gccttcctac 1586041 gagcgggtgc gatcaaccgg atcctggcct acgtcgcacc gatcctgttg ggcggtccgg 1586101 ttaccgcggt cgatgacgtc ggggtgtcca acatcaccaa cgcgttgcgt tggcagttcg 1586161 acagcgtcga aaaggtcgga ccggatctgt tgctgagctt ggtggctcgt tagagcggct 1586221 ccacttgggg cgccagggtc ggttgctcct ggacttccgg ttcatcggca tgttccttgc 1586281 ggccgctgat caacagacct agcaccgcgc cgaacacgca cacgatcgcg gtgatggtga 1586341 atatctcgcc gtacatcagc gcgaacgcct gctggtaccg ggctccaatt gcggccgcgc 1586401 gctcgagcag gctggcgttg ggcgggatgg ccgccgacaa ccccgccagg atctggttga 1586461 accggtacaa cccccaggcg ctcagcgcgg ccacgccgat caacatgccg gtcatccggg 1586521 cgaccaccac cgccgccgaa gcgatgccgt gctgggccga cgggacaacc cgtagggtgg 1586581 ccgacgatag cggcccgatc accagcccca accctaaacc agccaccacc aggtcggtgt 1586641 gcatcgccgg cacggtgaac aatccgagga tgttgtgccg atcggccaac aggtccaccg 1586701 gccagtggga aataagccag taaccgtacg ccgcaataag cagtccggca aaggccaccg 1586761 cacggtcacc ggccctggtg gcgatccacc cgcccgtcac tgccccgatc ggtagggcga 1586821 taaggaacca cagcagcatt ccggccgcct gagcctggtc catctgcagc acgccctggc 1586881 cgaacagctc gacatcaacc agcgtcacca tcagcgccgc gccggcggcg acggaggcac 1586941 ccagcgcgga caggaacggc cggaagtgca caccggccgg gtcgatcagc cgggtgcgag 1587001 cgaaacgttc ccaaccgaag aacgccaccg cggcaacgag agcgccgacc agcaacggag 1587061 ccccgtagtc cggcagtacg tgtttgccgt cgggattggg gttgtacagc ccgatgacgg 1587121 cgaggcccaa cgcgagtgcc agcagcagac caccgaccag gtcgactcgc tcgggctccg 1587181 tgctgcggtc gtgtgagggc aggctgaagt ggatcattac catggcgatc gcggtcaacg 1587241 ggacgttgat ccagaacacg tcacgccagt cgtgcaatag ccaaacgatg aagattccgt 1587301 acaacgggcc cagaacgctg ccgagctcct gcgcggcgcc gataccgccg agcacgccgg 1587361 cgcggttgcg ctgcgaccac aaatcggcgc ccagcgccag cgtgatcggc aatagcgcgc 1587421 cgctggcaac accctggatc gtgcggcccg cgatcagcat gtggaaatcg ccaaaatgcc 1587481 cggccagcgc ggtcactacc gagccgatga tgaacccggc caggctgacc tgcagcatca 1587541 gcttgcgccc gaatcggtcg gaagcccggc ccagcaacgg catggcggcg atgtagccca 1587601 ggaggtacat cgtgacgatc caggtgatcc ggtggagttg gttgatcggt ataccaacgc 1587661 tgttcatgat gtcgcgcatg atggtgacca cgacataggt gtccagggcg cccagcagta 1587721 ctgccaggct gcccgcgcta atcgcgactc gacgtcctgc tcgcatgctg atcagctcac 1587781 cgggggcttc gtgacctgga ccttctcgcc ccatttcgac aaggtcatct ggacggaatt 1587841 gcccgagccg cggtccaact gggcctgtgc cagttgatga tcgccggtct cctgaatcca 1587901 gacggtcgcc ggcaccggct gcgtcgcgtt gaacggcggc gctatctggt tcaccgcctg 1587961 tgccgatacc ttcccgctga tgcggatggt gttctggccg ttgatggtat cccgcccttc 1588021 ggcttttgcg tcggcgaaat tcgccagcac gttggccagg ccggtatccg gattcagcac 1588081 ctgggcgggg tcgtagatgt cggcggcggg accgaaatcg ctccactggt tgggcgtcag 1588141 ggtggcgtac aggatcccgt cgaacaccac gaagtcggca tcgatatcag acccacccag 1588201 cgtgagcttg acgtttcccg tcgcggcggt ggggttggtg gtgagatcgc cgctcagcgt 1588261 cttcagagac agtcccggga tcttgccgtt gaccgtcagc accatgtgcg cgctcttgag 1588321 agccttggtc tgcgcggtgg cctcctcgac cagcggcttc gcgtccggaa gtggtccgcc 1588381 gcttggcttc gagcccgacg agcagccggc aacgacagtg gcggcgatgc taacggcggc 1588441 gaggacggcg atgcgacggc agtggcgtct gggggtccgc ataccctgca tcgtagaggg 1588501 tgtctgtgag ttggccggtc ggcgagtggg gtgcgggtcc gcgggattgc tgcctaacct 1588561 ggtgcgatgt tcaccggaat tgttgaggaa cgcggagaag tgaccgggcg tgaggccctg 1588621 gtcgatgcgg cgcggctgac catccgcggt ccgatggtta ccgccgacgc cggccacggc 1588681 gactcgatcg ctgtcaacgg cgtgtgtctg acggtcgtcg atgtattgcc cgacggccaa 1588741 ttcaccgccg acgtgatggc cgagacactg aaccggtcca acctgggtga gctacggccc 1588801 ggcagccggg tgaacctgga acgcgccgcg gcgctgggca gccggctcgg cgggcacatc 1588861 gtgcagggac atgtggacgc caccggtgaa atcgtggcgc gttgtccctc cgagcactgg 1588921 gaagtggtgc gcatcgagat gccggcttcg gtggctcgct atgtcgtcga aaagggctcg 1588981 atcaccgtcg acgggatttc tctgacggtc tccgggctcg gcgccgaaca gcgggactgg 1589041 tttgaggtct cgctgatccc gacgacccgg gagctgacca cgctggggtc cgctgcggtg 1589101 ggaacccggg tgaacctcga agtcgacgta gtcgcaaagt atgttgagcg gttaatgcgg 1589161 agcgccggct gacatcgctc gccgagggag ggagccccat gtcttgcatt ccggacgaga 1589221 tcgatacgcc cgacgtgctg atcgaccgcg acatccttga ccgcaacatc gggcgaatga 1589281 gttccgccgt cgccgcgaaa gggatcgccc tgcgtcccca cgtgaagacg cacaagctgc 1589341 ctgagatcgc ccatatgcaa ctccgcgcgg gcgcgcggcc tgacggtggc caccatcggg 1589401 gaagtcgagg tattcgtcga ccacggcgcc gacgacgtat tcatcaccta cccattgtgg 1589461 atcggcacac gccaagccga ccggctccgt cagctggctg accgcgctcg catcgctgtc 1589521 ggtgcgggca ccgccgaggg cgcttcgaac accggcgcac ggctcgcaga cgccgctggc 1589581 gcgatcgatg ttctcatcga aatcgacagt ggccatcacc gcagcggcgt ccgtgccgaa 1589641 caagtgttgg aggtcgccca cgccgtcggt gaggctgggc ttcacctggt gggggtgttc 1589701 accttccccg gtcacagtta tgcgccaggt aaacccggcg aagccggcga gcaagagcgg 1589761 cgcgctctca acgacgcggc gaacgcgctg gtcgcggtgg gcttcccgat cagctgccgc 1589821 agcggtgggt ccactcccac cgcattgctc accgccgcgg acggggcctc cgagacgtcc 1589881 cggcgtctat gtgctcggtg acgcccagca actggaactc gggcgctgcg cgccggcgga 1589941 catcgcgctg accgttgccg ccaccgtagt gagccgccag gactgcaggt ccggcttgcg 1590001 ccgaattgtc cttgactgcg gtagcaagat tctcggcagc gatcgtccgg cctgggcgac 1590061 tgggttcggc cgtctgatcg accacgccga tgcgcgcatc gcggcgctgt cggagcatca 1590121 cgccaccgtt gtctggcccg acgacgcccc gctcccgccg gtgggaacac gtctgcgggt 1590181 gattcccaac cacgtgtgcc tgaccaccaa cctcgtagat gatgtcgccg tggtgcgcga 1590241 cgcaaccctg attgatcgct ggaaagtcgc cgcccgcggt aagaaccatt gatcctgtcg 1590301 cacttggtca cggcaatacc gcctggctca atggttcata ctgaatggaa cacgtgggct 1590361 tcgcgtgcgg ccaggcctga cagctaggta gcaaagatga cgaggttgga ctccgtcgag 1590421 cgggcggttg ccgacattgc ggcgggtaag gccgtcatcg tcatcgacga cgaagaccgg 1590481 gagaacgagg gtgacctgat cttcgccgcc gagaaggcaa cgccggagat ggtggccttc 1590541 atggtccgct acacctccgg atacctgtgc gttccgctgg acggtgccat ctgcgaccgg 1590601 ctgggcctgt tgcccatgta cgcggtgaac caggacaagc acgggacggc atacaccgtc 1590661 acagtcgatg cacggaatgg cattggaact ggcatttcgg cgtccgatcg ggctaccacc 1590721 atgcggttgc tggccgatcc gaccagtgtg gccgacgatt tcacccgccc cggtcacgtg 1590781 gtccccttgc gggccaagga tggtggggtt ctgcgccggc ccggccacac cgaggccgcc 1590841 gtggacctgg cccggatggc cgggctgcaa cccgcggggg cgatttgcga gatcgtcagc 1590901 caaaaagatg agggctcgat ggcgcacacc gatgaattgc gggtgttcgc cgatgagcac 1590961 ggtctggcgc tgatcaccat tgctgacttg atcgaatggc ggcgcaagca cgagaagcac 1591021 attgagcggg tcgccgaggc gcggattccg actcgtcatg gggagtttcg cgccatcggc 1591081 tacaccagca tctacgagga cgtggaacat gtcgcgctgg tccgcggcga gatcgccggg 1591141 cccaacgccg acggtgacga cgtgctggtc cgggtgcatt cggagtgctt gaccggcgat 1591201 gtgtttgggt cacgccgctg cgattgcggg cctcagctgg acgccgcgct ggcgatggtc 1591261 gcccgtgagg ggcgcggcgt ggtgctgtac atgcgtggcc acgagggccg cggcatcggc 1591321 ctgatgcaca aactgcaggc ctaccaactg caggacgccg gtgccgacac cgttgacgcc 1591381 aatctcaagc ttggactacc tgccgacgca agggattacg ggatcggcgc acagatcctg 1591441 gtcgatcttg gggtacgttc gatgaggctg ctgaccaaca acccggccaa gcgggtggga 1591501 ctggatggat acggattgca catcatcgag cgcgtgccgc tgccggtgcg ggccaacgcg 1591561 gagaacatcc gttacctgat gaccaagcgt gacaaattgg ggcacgactt ggctgggttg 1591621 gacgattttc acgaatccgt gcatctgccc ggagaattcg gcggtgcctt gtgaagggtg 1591681 gcgccggggt gccggatctg ccgtcgctgg atgcgtctgg tgtgcggctg gcgattgtcg 1591741 ccagcagctg gcacggaaag atctgcgacg cgctgttgga cggcgcccgc aaggtggccg 1591801 ccgggtgtgg cctcgatgac ccgactgtgg ttcgggtgct cggcgcgatc gagattccgg 1591861 tggtggcgca ggaattggcc cgcaatcatg atgccgtcgt cgcacttggc gtcgtgatcc 1591921 gcggtcagac accacatttc gactacgtgt gcgatgcggt aacccaggga ctgacccggg 1591981 tatcgctgga ttcctcgacg ccgatcgcca acggcgtgct gaccaccaac accgaggagc 1592041 aggcgctgga tcgggcgggg ctaccgacgt cggccgagga caagggcgcc caggcgactg 1592101 tggcagccct ggccaccgcg ttgaccctgc gcgagctgcg cgctcactcg tgaccgccgc 1592161 accgaacgac tgggacgtcg tgttgcgtcc tcactggacg ccgttatttg cctacgctgc 1592221 agcgtttctg atcgcggtag cgcacgtcgc ggggggcctg ctgctcaagg tcgggtccag 1592281 tggcgtggtc ttccagaccg ctgatcaggt ggcaatgggt gccctggggc tggtcctcgc 1592341 cggggcggtg ctactgttcg cgcggccgcg gctgcgggtg ggttctgccg ggctttcggt 1592401 gcggaatctg ttgggtgaca ggatcgttgg gtggtctgaa gtgatcggtg tgtcgtttcc 1592461 cggcggtagc cggtgggcgc ggatcgacct ggccgacgac gagtacatcc cggtgatggc 1592521 gatccaagca gtggataagg accgcgccgt ggccgccatg gacacggtgc gctcgttgct 1592581 ggctcgatac cggcctgacc tgtgcgcccg ctgaagcgac ttcccgtacg atcgcgaaat 1592641 ggcatgtctt gggcgccctg gctgtagggg ttgggcgggg gcgagcttgg tccttgtggt 1592701 ggtgttggcc ctggctgctt gcaccgagtc ggtagcgggc cgcgcgatgc gtgctaccga 1592761 ccggtcgtcc gggctgccca catccgccaa gccggcgagg gcgcgcgacc tgctgctgca 1592821 ggacggggat cgcgctccgt tcggccaggt aacccagtct cgcgtcggcg acagctactt 1592881 caccagcgcc gttccacccg agtgctcggc ggcgctgctg ttcaaaggtt ccccgctgcg 1592941 gcctgacggc tcgtcggacc acgccgaggc ggcttataac gtcaccggtc cgctgccgta 1593001 cgcagagtcg gtcgatgtct acacgaatgt cctgaacgtc cacgatgtgg tctggaacgg 1593061 gttccgcgac gtgtcccact gccgtggcga tgccgtcgga gtgagccggg ccggcagatc 1593121 gacgcccatg cgactcaggt acttcgctac gctgtcagac ggtgtcctgg tatggaccat 1593181 gagcaatccg cgctggacgt gtgattacgg attggctgtg gtcccgcacg cggtgctggt 1593241 gttatcggcg tgtggcttca agcccggatt ccccatggcg gaatgggcgt cgaaacggcg 1593301 ggcccaactg gacagccagg tttaacgcca gcccccatgc tcttcgcggg cgggtttgaa 1593361 ccggccaaac ggggtcaaag tcacggcggc ctgggcatac tcaaatgtgt cccacggccc 1593421 accatcggat cccgacgacg gcccactgtg aactgcgccg ctcgtggtgc attacccaga 1593481 ccacgatgag agaatggcgg ggaaatgggt gaattacggt tggtgggcgg tgtgctccgg 1593541 gtccttgtcg tggtcggtgc ggtgttcgat gtggcggtgc taaacgccgg tgcggctagt 1593601 gccgacggcc cggtccagct gaagagccga ttgggcgatg tttgcctgga cgccccgagt 1593661 gggagctggt tcagcccgct ggtgatcaac ccctgcaatg ggaccgactt tcagcgctgg 1593721 aatctcaccg atgaccggca ggtcgagagc gtggccttcc ccggggaatg cgtgaatatc 1593781 ggaaatgctt tgtgggcgcg cctgcagccc tgtgtgaact ggatcagcca gcactggact 1593841 gtccagcccg acggcctggt caagagtgat cttgatgcct gcctcacggt tctcggcggt 1593901 ccggatcctg ggacctgggt gtccacccgc tggtgcgacc ccaatgcacc cgaccaacag 1593961 tgggatagcg tgccgtaacc ggcctgcccg gcgaaccccc gcctttctgg gcgccgtcga 1594021 agcgaccact agcctagata cgtgccagat cccgcaacgt atcgccccgc gcccgggtcc 1594081 atcccggtcg agccgggcgt gtaccgattc cgggaccagc atgggcgagt catctacgtc 1594141 ggcaaggcca agagcctgcg tagccggctg acgtcctatt ttgccgacgt ggccagccta 1594201 gcgccgcgga cccggcagct ggtgaccacc gcggccaagg tcgaatggac ggtcgtgggg 1594261 accgaggttg aggcactgca gctggaatac acctggatca aggagttcga tccgcgattc 1594321 aacgtccgct accgcgacga caagtcctac cctgtgctgg cggtcaccct gggcgaggaa 1594381 tttccccggt tgatggtcta tcgcggtccg cggcgcaagg gtgtgcgcta tttcgggccg 1594441 tactcgcacg cgtgggcaat ccgggaaacg ctggatctgc tcacccgggt gtttccggcg 1594501 cgaacttgct cggcgggggt gtttaagcgg cacaggcaga tcgatcgtcc atgcctgctc 1594561 ggctacatcg acaaatgttc cgcgccgtgt attggcaggg tcgatgcggc ccagcaccgc 1594621 cagatcgtgg cagacttctg cgactttctg tccggcaaga ccgaccggtt cgcccgcgcc 1594681 ttggaacagc aaatgaacgc cgcggccgag caactggact tcgaacgagc ggcgcggctt 1594741 cgcgacgacc tgtccgcact gaagcgtgcc atggaaaagc aggccgtggt gctcggggac 1594801 ggcaccgacg ccgacgtggt ggcattcgcc gacgacgaac tcgaggcggc ggtgcaagtg 1594861 ttccacgtgc gcggcggacg ggtccgcggc cagcgtggct ggattgtcga aaagccagga 1594921 gagccaggag attccggaat ccagttggtc gagcaattcc tgacacagtt ctacggcgac 1594981 caggcggcgt tggacgacgc cgccgacgaa tccgccaacc cggttccccg cgaggtgctg 1595041 gtgccctgtt tgccgtccaa cgccgaggag ctggccagct ggctgtccgg cctgcgcggc 1595101 tcaagggtcg tgctgcgggt gccgcgccgc ggggacaagc gggcactggc cgaaacggtg 1595161 caccgaaacg cagaagatgc actgcaacaa cacaagctga agcgggccag cgatttcaac 1595221 gccagatccg ctgcgctgca gagcattcag gactcgttgg gcctggcaga cgcacccttg 1595281 cggatcgagt gtgtcgacgt cagccatgtg cagggcaccg acgtggtcgg gtcactggtg 1595341 gtgttcgaag acggcctgcc gcgcaagtcg gactaccgcc acttcgggat ccgggaagcc 1595401 gcaggccagg ggcgctccga cgacgtggcc tgtattgccg aggtgacccg gcgccgcttc 1595461 ctgcggcacc tgcgcgatca gagcgatccg gatcttcttt ctccggaaag gaagtcgcgt 1595521 agattcgcct atccgcccaa tctgtacgtc gtcgacggcg gcgcgccgca agtcaacgcg 1595581 gccagtgcgg taatcgacga actcggtgtt accgacgtcg cggtgatcgg cctggccaag 1595641 cggctggaag aggtatgggt gccgtcggag ccggacccga ttatcatgcc gcgcaacagt 1595701 gagggactct atctgctgca gcgagtgcga gacgaggcac accggttcgc tatcacctac 1595761 catcgcagca agcggtcgac gcggatgact gcctcagcgc tggactcggt gccgggattg 1595821 ggggagcatc gccgcaaagc gctggtcacc catttcggat cgatcgctcg cctcaaggag 1595881 gccaccgtcg acgaaatcac cgctgttccc ggtatcggcg tggccacggc cacggccgtc 1595941 cacgacgcac tgcgacctga ctcatcgggg gccgcgcgat gatgaaccat gctaggggcg 1596001 tcgagaatcg ttcggaaggc ggcggtatcg acgtcgtctt ggtaaccggg ctgtccgggg 1596061 ccgggcgcgg cacggcggct aaagtgctgg aagacctggg ctggtatgtg gccgacaatc 1596121 tgccgcccca gctgattacc cgcatggtgg acttcgggct ggccgccgga tcacggatca 1596181 cccagctggc ggtggtaatg gatgtgcgat cgcgcggatt caccggcgac ctcgattcgg 1596241 tccgcaacga gctggccacg cgtgccatca ccccgcgtgt ggtgttcatg gaggcgtccg 1596301 atgacacgtt ggtgcgccgc tacgaacaga atcgccgcag tcatccgctg cagggtgagc 1596361 agactctggc cgagggcatt gccgcagagc gcaggatgct agcaccggtt cgcgccaccg 1596421 ccgacctgat catcgacacg tcgacactgt cggtgggggg cttaagggat agcatcgagc 1596481 gtgccttcgg cggtgatggc ggcgcgacca ccagcgtcac cgttgaatcc ttcgggttca 1596541 agtacggcct gccgatggac gccgacatgg tcatggacgt gcggttcctg ccgaacccgc 1596601 actgggtgga cgagttgcgg ccactgaccg gccaacatcc ggccgtgcgc gactatgtgc 1596661 tgcaccggcc gggcgcggct gagttcctcg agtcctacca tcggttgcta tccctggttg 1596721 tcgacggcta ccgccgagag gggaagcgct atatgacaat cgccatcggc tgtaccggtg 1596781 gtaagcatcg cagcgtcgcg atcgctgaag cactgatggg acttctgcgc tccgatcagc 1596841 aactgtcggt gcgggcgctg caccgggatc tgggtcgcga atgaccgatg gcatcgtcgc 1596901 gctgggcggc ggacacggct tgtatgcgac gctgtctgcg gcccgccggt tgacacccta 1596961 cgttaccgcc gtggtgaccg tcgccgatga cggtggctcg tcgggccggc tgcgcagcga 1597021 gctcgatgtg gtgccgccgg gcgatctgcg aatggccttg gcggcgttgg catccgatag 1597081 cccgcacgga cgcctgtggg caactattct gcagcacaga ttcggcggca gtggtgcgct 1597141 ggccggacat ccgatcggca atctgatgct agcgggcctg tccgaggtgc tggccgatcc 1597201 ggtcgcggct cttgacgaac tcgggcgcat cctcggggtg aaaggcaggg tgctgccgat 1597261 gtgcccggtc gcgcttcaga tcgaggccga tgtctccggt ctggaggccg acccgcgcat 1597321 gttccgcctg atccgtggcc aggtggcgat cgcgaccacg cccggaaagg tgcgccgggt 1597381 gcggctgctg ccgactgacc cgccggcgac ccggcaggct gtcgacgcca tcatggctgc 1597441 cgatctggtg gtcctggggc ccgggtcgtg gttcaccagc gtgatacccc atgtgctggt 1597501 gccgggtctg gccgcagcgc tgcgagcaac gtcggcccgc cgtgccctgg tgctcaacct 1597561 ggtggctgaa ccgggagaga cggccggttt ctcggtggag cgtcatctgc acgtgctagc 1597621 ccaacacgcg cccgggttca ccgttcacga catcatcatc gacgccgaac gagtgccgag 1597681 cgaacgggag cgggagcaac tgcgccgcac ggcgacgatg ctgcaggccg aggtccactt 1597741 cgccgatgtc gccagacctg gtacaccttt acatgacccg ggcaagctgg cggcggtcct 1597801 cgacggggtg tgtgcgcgcg acgtcggcgc gtcggagcct ccggtggcgg ccacacagga 1597861 gataccgatc gacggtggac gaccgagggg tgacgacgcg tggcgatgac gaccgatgtc 1597921 aaagacgagc tgagccgact ggtggtgaag tccgtcagcg cgcggcgcgc ggaggtcacc 1597981 tctctgctgc gattcgccgg cgggttgcac atcgtgggcg gccgcgtggt ggtcgaagcc 1598041 gagctggacc tgggcagtat cgcacggcgg ctgcgtaagg agatcttcga gctctacggc 1598101 tacacggcgg tggtgcatgt gttgtcggcc agcgggattc gcaagagcac ccgctacgtg 1598161 ctgcgggtcg ccaacgacgg cgaggcgttg gcacgccaaa ccggactgct tgacatgcgc 1598221 ggtcgtcccg tgcggggtct gccggcccag gtcgtcggcg gcagcatcga tgacgctgaa 1598281 gctgcgtggc gaggagcatt tttggcgcac gggtcgctga ctgagccggg acgctcctcg 1598341 gcgttggagg tcagttgccc gggcccggag gccgcgctgg cgctggtggg tgcggcacgc 1598401 cggcttgggg tcggcgccaa ggctcgtgag gtgcgcggtg ccgatcgcgt ggtggtgcgc 1598461 gacggtgagg cgatcggcgc actgctgacc cggatggggg cccaagacac ccggctggtc 1598521 tgggaggagc ggcggctgcg tcgtgaggtg cgtgcgacgg ccaaccggct cgccaatttc 1598581 gacgacgcca atctgcgccg ctcggcgcgg gccgcggttg ccgcggccgc ccgggtggag 1598641 cgtgccttgg agatcctcgg cgatacggtg cccgagcact tggcctcggc cggcaaattg 1598701 cgtgtcgagc accggcaggc gtcgctggag gagctgggcc ggcttgccga tcctccgatg 1598761 acgaaagacg ctgtagccgg acgtattcgg cgattgttgt cgatggcgga tcgtaaggcg 1598821 aaggtggacg gcatccccga tacggagtcc gtagtgacgc ccgatctgct ggaagacgcc 1598881 tagcgggctg acttacttcg gtgccacgca caccaattgg ctgcttgccg ggggtattgc 1598941 tggcccttcg atttcctcgg gcggctgcag agagactgac gcggaatcgc agcgccctcc 1599001 ggcaccgagg ctcttgatct cggtgacgac gaatcggctg aactcccggt ttgcagaacg 1599061 tgttccaggc acaagcgcgg tggctacccg cggtgaaggc agcgattcgt cgcacgccga 1599121 cggcgcgtac agcagcacgg atggcggctt gccgggggtc gtcaccgccg gatagcagta 1599181 tccgacccgc accaggtact tcaggcagta atacgcccga tggttctggg tgatcaattc 1599241 gtagtcgatc cgcatgcaac tcggagcgtt ggcatggaat ccgtcattgc ggatccgggc 1599301 ctcccggcta tcgcaagcaa cgacctgcgg acgagacggc gccagcttgg agtcgtaggt 1599361 gaaccggttc aggcacatcc ccaagtccat ggagaacacc aacttcagcg tcgcacgatc 1599421 gtagccccgc ggctcgttgt aacccgaagc ggaagcggtc tggcacgcac tcagcagcag 1599481 cgtgagaatc cccagcaaca ctgggaaaac gagcttctcg gctggcggtc gccggtacga 1599541 cgggaagcta taccgcctcg ccgatgtttg ggccgaagct tgcacacatt gacgataact 1599601 tggtcgcgag accgcagaag ctggcctcga cggcgcgccg gggactacgg tcataccatg 1599661 aagcggcttt cgagcgttga tgctgcgttt tggtccgcgg aaaccgcagg ctggcatatg 1599721 cacgtgggcg cactggcgat ctgcgatccc agcgacgcgc ccgaatacag ctttcagcgg 1599781 ctccgcgagt tgatcatcga acggctgccg gagatcccgc agttgcggtg gcgggtcacc 1599841 ggcgccccgc tcggactgga ccggccgtgg ttcgtcgagg acgaggaact cgacatcgac 1599901 tttcacatcc gccgcatcgg tgttccggct cccggtgggc ggcgcgaact cgaggagctc 1599961 gtcggacggc tgatgtccta caaactggac cgttcccggc cgctgtggga actgtgggtc 1600021 atcgagggcg tcgagggcgg ccgcatcgcc acgctgacca agatgcatca cgccatcgtc 1600081 gacggtgtct ccggtgccgg gctgggcgaa atcctgttgg acatcacacc agaaccacga 1600141 ccaccgcaac aggaaacggt cggcttcgtg ggattccaga ttccgggcct ggaacgccgg 1600201 gcgataggtg cgctgatcaa cgtgggcatc atgacgccct tccgcatcgt caggctgctg 1600261 gagcaaaccg tgcgtcaaca gatcgcggca ttgggtgtgg ccggcaaacc ggcgcgatac 1600321 ttcgaagcgc ccaagacgcg gttcaatgcg ccggtgtcgc cgcaccggcg ggttaccggc 1600381 acacgcgtcg agctggctag ggccaaagcg gtcaaggacg cgttcggcgt caagctcaac 1600441 gacgtcgtct tggcgctggt ggccggggcg gcccggcaat acctacagaa gcgtgacgag 1600501 ctgcccgcca agccgttgat cgcgcagatt ccggtctcca cccgcagcga ggaaacgaag 1600561 gccgacgtcg ggaaccaggt cagctcgatg accgcgtcgc tggcaaccca tatcgaggat 1600621 ccggccaagc gcctggcggc catccacgag agcaccctca gcgccaagga aatggctaag 1600681 gcgccctccg cgcaccagat catggggctg accgagacca cgccaccggg tctgctgcag 1600741 ctggccgccc gggcctatac ggccagcggg ctgtcacaca acctggcccc aatcaacctc 1600801 gtcgtctcca atgtccccgg tccacccttc ccgctatata tggccggcgc gcggctggat 1600861 tcgctggtgc ccctggggcc gccggtgatg gacgtggcgc tgaacatcac ctgcttctcc 1600921 taccaggatt atctggattt cggcctggtg accacacccg aggtggccaa cgacatcgac 1600981 gagatggccg atgccatcga accggcactg gccgagctgg agcgtgccgc ggaatagcaa 1601041 tagctggcct atagctgact acgtggccgg cgggttggtc gcgtacaccc aagacaggaa 1601101 gcgggccacg gcctcggcgg tgtgatgcgc ccgcggggag ccgaagacgt cgaaggcgtg 1601161 ttgggcgtgg ggcaggtccg cgtaggcgac gggcgacttc gacaccgccc gcagttcctc 1601221 gacgaacgca tgggcttcgg ccacggggat cagggagtcg tggcggccgt gcagaacgaa 1601281 gaacggtggg gcgtcggccc gcacatggtg gatcggtgag gcatcgacga agatgtcgcg 1601341 gtgcgtgctg aatttccgtt tcaccacgaa cgtttcgagc aacccgacga attcccgacg 1601401 ccccggcgca tcggtcgtaa accagtcgta acgcccgtat accggaaccg ctgccgccac 1601461 cgaggtgtcg acctgttcga acccgggctg aaatcgcgga tcgttggggg tcaacgccgc 1601521 cagggcgcac agatggccgc cggccgaacc gccgctgatg gcaacgaaat tcggatcccc 1601581 gccgtaggcg gcgatgtttt ccttgaccca cgccagcgcg cgcttcacgt cgacaatgtg 1601641 gtcgggccag gtgtggcgcg gcgacacccg gtagttcagc gacacgcata cccagccgcg 1601701 cgcagccaga tggctcatca acggatacgc ctgcgggcgg cgccacccca gtacccaggc 1601761 gccgccgggc acctgtacca gcaccggtgc cttggcgtcg cgtggcaggt cgcggcggcg 1601821 ccagatgtcg gccaggttgg cccgcccgta tgggccgtag cacacgacgt tcgtcgtctc 1601881 gacgtagcgc cggcgtgcca tggcggtacg cagcgggaga ttgcgacctc tgctacgcat 1601941 cggttccgtg ggcagggtag cgagttcctt agcgtagtcg ggcccgagct gttcggtcag 1602001 gcccgcttcg agcaccggtc caggggtggt ggcgccgcgg tagcggatca ccgcaaggat 1602061 cacccaggcc gctgccgtta aggccagtgc cgcctttcct ttcagcccac cgaagtcgcc 1602121 tcggcggccg cggcgcagtg cgtccagcac ggaggcgcct aggtacactc ctggcacttc 1602181 cgacgtcggc cagcccaacc aaaacgccag aaccgtgctg tagccgctac cggacagtgg 1602241 gcgtaatccg ttggcggcat tgagcaattc caccgctgca cgtgttaacg gtctcgggcg 1602301 tgccatccgc cgaaatcgca ttagctgccg acccgtgatt gcagctcggt gcgcaggatc 1602361 ttgccggtaa tgccgcgtgg cagctcgtcg aggacggcga tgtcgcgcgg taccttgtag 1602421 ttggccaggt tgtctcggac atgctgcttg agggtttccg gggtggccga aacaccgggc 1602481 ttgagcacca cgaaggccgc cagccgctgg ccgtactgct ggtcgtccac gccgatcacc 1602541 gcggcctcgg ccacgtcggg gtgggtggcc agcgtcttct ccacctcgat cgggtagatg 1602601 ttctcaccgc cggagacgat catctcgtcg tcgcgcccga cgacgaacag ccggccgttc 1602661 tcgtcgaggt agccgacgtc gcccgatgac atgaacccgg catggaaatc ctttgcggcg 1602721 ccagatgtat agccatcgaa ttggctgtcg ttgcggacgt agatggtgcc gacctcgccg 1602781 gtgggcacct cggtgaactg ctggtccagg atccggattt cggttccttc ggcgggccga 1602841 cccgcggtgt cgggtgcggt ccgcaggtcc gccggtgtgg cggtggcgat catcccggcc 1602901 tcggtcgcgt tgtagttgtt gtagatcacg tcgccgaatt ggtccatgaa tgcgatcacg 1602961 acatcgggcc gcatccgaga acccgacgcg gcggcgaacc gcaacgaccg gccgtcgtag 1603021 cggtttcgaa tctcggccgg caggtccatg atgcgatcga acatcaccgg caccaccacc 1603081 agacccgtcg cgtggtggcg gtcgatcagg tccagcgtcg cctccgggtc gaacctgcgt 1603141 cgcgtgacga tcgtgcaggc cagcgaggag gccagcacca gctgcgagaa gccccaggca 1603201 tgaaacatcg gcgccacgat cacggtgacc tcctcggccc gccacggcgt gcggtccaag 1603261 atcgccttca gtgtcccgat gccaccgcca gaatgcctgg cgcccttggg tgttccggtg 1603321 gttccggagg tcagcaggat cacttttccg tggctgccgg tgtgctcggg ccgccgtccg 1603381 gcgtgcgcgg ctacaagttt ctcaacggtc aggtcgtggt cttcgtcggt ccacgccacg 1603441 atacgggtgg cctgcggttt ttccgccagc gcgcgatcca ccgtcgcgct gaactcttcg 1603501 tcatagacga cagtgtcgac gccttcgcgg gtaaccacct cggccagtgc cggaccggcg 1603561 aaggaggtgt tgagcaacag gatgtgcgcg ccaatccggt tgaccgccaa cagcgcatcg 1603621 acgaagccgc gatgattgcg gcacatgatg ccgacgaccc tggggggtcc ggctggcagg 1603681 gcctgaagcg ccgcggccag cgcgttgccg cgttcgtcga gctggcgcca ggtcagcgtg 1603741 cccagttcgt cgatcaggcc ggggcggtcc gggcagcgtc gggccgcacc ggcgaacccc 1603801 gccgtaaacc ccatgccttc gcggcgcatg gcggcgacga tccgcaggta gcggtctggt 1603861 cgcagcggag cgatcaaccc tgcccggcgc atggtggcga tcaagccgaa tgcttgtctg 1603921 atacgcatgg cttagcccag aatcgggaag cggcgcttgg cggcgaggtc gttgagggct 1603981 tgctgcatca ccgaccgtac gtgctcgtcg accgcgtcga catcagggtc ctcgccgaac 1604041 tgcttggtga ggttgatcgg gtctaacacc tgcatgacga tcttggcggg cagcggcaga 1604101 ttgggcggga tcgcggcgga gaacccgaac ggaaagccga acgagatcgg caggatgtcg 1604161 ctgcggagca gtcgcttgag ccctagccgc cgggcgagcc aggtgccgcg ggacaggtag 1604221 agctggcttt cctggccacc gatggacacc gccggcacga tgggcacgcc agcttcgacg 1604281 gcagtgctga cgtatccctt gcggccgttg aagtcgatca cgttctccgc gaaagtcggc 1604341 cggtacgcgt catagtcgcc gccgggaaaa acgaccacca cacccccgga ccgcaacgcc 1604401 ttagccgcgt tttctcgggt ggcgcgaatg tagccggtgc gtcggaacaa gtccccggtc 1604461 aggcccatga acaagatgtc gtggctgagc gtgtagaccg gtcggtcgta gccgaacttg 1604521 tcgtagaagt cgacgctgaa gaccggcacg tccatcggga acatgccacc ggagtggttg 1604581 gccacgacca gtgcgccacc cggcgggaag gagtccaggc catgcacctg cgaccggtgg 1604641 taggtcttca agactggacg cagcacactt atcaggcgct gggttaggcc agggtcgaat 1604701 ttgccgatgt cgccgatacc tgcatcgtcc ccgttaccag ggctatcggt ttcgctcaac 1604761 tgttctccct cgaggcctcc gaggcctcat tgccgcgtcg ggtctttaga tggtagcgat 1604821 gcacggtgga taggcacacg cggcaggtct gctagcaagg acgagaggtg gtccagagtg 1604881 gctgaagctg gtggcgggcc catttcggtg atcgcccggc atatgcagtt gattcgcgat 1604941 gacttcatct ccgagttgtt tgacaagatg aaggcggaga ttcgggggct ggattacgac 1605001 gcgcggatgg cggacctgtg gcgggcgagc atcaccgaga atttcgtgac ggccgttcac 1605061 tatttggatc gcgatacgcc gcagtccttg gtggaggctc cagcggccgc gctggcatac 1605121 gcccgcgccg cggcgcagcg tgatattccg ttgtccgggt tggttcgggc gcaccggctc 1605181 gggcatgcgc gtttcttgga ggtggcgatg cagtacgtgt cgctgctgga gcccgctgac 1605241 cgggtgtcga cgatcatcga gctggtgaat cgctccgctc gcctcgttga cctggtggcc 1605301 gaccagttga ttgtcgccta tgagcacgaa cacgatcgct ggctgagtcg ccgcagcggt 1605361 ctgcaacagc aatgggtcag cgagctgctc gccgataccc cggtcgacgt tccgcgggcc 1605421 gagcgcgcgt tgggctatcg gttggacggt gtgcatatcg ccgcggtggt atgggtcgat 1605481 tcggcggtgc ccatcggtga tgtggtggcg caattcgacc aggtgcgctg cttgctggcc 1605541 ggggagctgg gccccgaact gggccccgtg gcgaactcgc tgatggtgcc gaccgatgag 1605601 cgcgaggcac ggctgtggtt ttcgcccgcg cccacgcggg ccttcgcccc gtcgcggatt 1605661 cgcgcggcgt tcgagtcggc gggaatccgg gcgcgtttgg cgtgcggtcg ggtaggggac 1605721 gggctgcgtg ggttccgggc gtcgttgaaa caggccgaac gagtgaaggc gttggccctg 1605781 gccggtggcg cccggcccgg cggccgggtc atgttttatg acgatgtcgc gccagtcgcg 1605841 ttgctggccg acgatctaga ggaactgcgg cggttcgtca ccgatgtgct gggtgacctg 1605901 agtgttgacg acgagcgcaa tagctggcta cgcgagacgt tacgggagtt cttgctgcgt 1605961 aaccgcagct acgtcgccac ggccgacgcg atgatcctgc accgcaacac cattcaatac 1606021 cgggtgatcc aggcgatgga actatgcgga cagaatctcg acgatcccga tgccgcgttt 1606081 cgggtgcaga tggcgctgga ggtctgccgc tggatggcac cggcggtgct ccgcgccaaa 1606141 caatagtgtc tcggtaaccg ccggtccgtt catgccgtgc gcacaatcgt ggtcgtgagc 1606201 ttcggtgtcg gcgcatatgg tctccgacgg attcggcgcc taacgtttgc ccacgtcaaa 1606261 caacccgacc agaaagccag ccgggtccgc cagagggggg cggacccggc gtatacccaa 1606321 ttcgcgtcgc tcggttctag ttgggcgcta tcatccgttg ccacggggtt ggtcggaagg 1606381 tcggtatgtc gttcgttttc gcggtgccag agatggtggc ggcaaccgct tccgatttgg 1606441 ccagcctcgg agcggcgctg agcgaggcca ccgcggcggc ggctatcccc accacacaag 1606501 tactggccgc ggccgccgat gaggtgtcgg cggccatcgc ggagttgttc ggtgcgcacg 1606561 gccaagaatt tcaagcgctc agcgcccagg catcggcgtt tcatgaccgg ttcgtgcggg 1606621 ccctaagcgc cgcagcgggc tggtatgtcg acgccgaggc cgccaacgcc gcgctggtgg 1606681 acaccgcggc caccggcgcg tcggagttgg ggtcaggtgg gcgcacggcg ctgattctgg 1606741 gctccaccgg aaccccgcga ccgcccttcg actacatgca gcaggtctac gaccgctaca 1606801 tcgcacccca ctacttgggc tatgcgtttt ccggcctgta cacgcccgcg cagtttcagc 1606861 cgtggaccgg catccccagc ctgacctacg accaatcggt cgccgaaggc gccggctatc 1606921 ttcacaccgc gatcatgcag caagtcgcgg ccggcaatga cgttgtggtg ttgggtttct 1606981 cgcagggcgc gtcggtcgcc accctggaaa tgcgccatct ggcaagcctg ccggccggcg 1607041 tcgcgccgag tccggatcag ctctcgttcg tattgctggg caaccccaac aacccaaacg 1607101 ggggcatcct cgcccggttt ccgggtctgt acctgcagtc gctcggcctg acgttcaacg 1607161 gtgcgacccc ggacaccgac tacgcgacca ccatttacac gacccaatac gacggctttg 1607221 ccgacttccc gaagtacccg ctcaacatcc tggcggacgt caacgcgctg ctgggtattt 1607281 actattcgca cagcttgtat tacgggctca cgcccgagca ggtcgcttcg ggtatcgtcc 1607341 tgccggtgtc ttcgccggac accaacacca cctatattct gcttcccaac gaggatctgc 1607401 cgctgctgca gccgctgcgc ggtattgtgc ccgagccgct gctggatctc atcgagccag 1607461 acctgcgcgc gatcatcgaa ttgggttatg accgaaccgg atacgccgat gttccgaccc 1607521 cggccgcact gttcccggtg cacatcgacc cgatcgcagt cccgccccag ataggcgctg 1607581 cgatcggtgg tccgctcacc gccctggatg gcttgctcga caccgtgatc aacgatcaac 1607641 tcaatcccgt cgtaacgtcg ggcatctatc aggccggtgc tgagctgtcg gtggccgcgg 1607701 ccggctacgg tgctcccgca ggcgtcacca atgccatttt tattgggcag caagtgttgc 1607761 cgattttggt ggaaggcccc ggtgccttgg tgacggccga cacccattac ctggtcgatg 1607821 cgattcagga tttggccgcc ggtgacctca gcgggttcaa ccaaaacctg caactcatcc 1607881 cggctaccaa catagccctg ctggtcttcg cggccggaat tcccgctgtg gcggccgtcg 1607941 ccatccttac cggtcaggat tttccggtat aggcccccgg cccccgctgt accgagctcg 1608001 gccagtgaag aacaacccca ggcgttgcca gtccgaatag attgtattcg tcagccggcg 1608061 caggacagga agcgaggccg ccatgggatt tctgaagccc gatcttcccg acgtcgatca 1608121 cgacacctgg ttgacccagc cacgccggac acgattgcag gtcgtgacac gggactgggt 1608181 agaacacggt ttcggaacgc cgtatgcggt gtacctgctc tatctgacca agattgcggt 1608241 gtacgtcgcc gccggcgccg cgatcatctc gctgaacccc ggactgggcg ggctgagccg 1608301 cataggcgac tggtggacac agccgatcgt gtaccagaag gtcatcgtct tcacgttgct 1608361 gttcgaggtt ttgggttttg gctgcggatc cggcccgctg accgggcggt tttggccacc 1608421 catcgggggc ttcctttatt ggttgcggcc caacacaatt cggctgcctg cttggccgga 1608481 taaggtcccg ttcacccaag gcgacacccg caccgtcgtc gacgtcgcct tgtatgccat 1608541 cgtgttgatc ggcggggtgt gggcgctgtt gtcacccggc tcgccaggtc cggggggaac 1608601 gccggtcacc gccgccggcg acgtcggcct gatcaacccg gtgctggtag tgccgacgat 1608661 cgtcgccctg ggcgtcttgg ggctgcgtga caagacgatc tttcttgccg cccgcggcga 1608721 acactactgg ctgaagctat tcgtgttctt ttttcccttc accgaccaga tcgcggcgtt 1608781 caagatcatc atgctgtgct tgtggtgggg ggcggcgact tccaaactca accaccattt 1608841 cccctacgtc gtcgcggtga tgaccagcaa caacgccctg ttgcgcagca gagtgttcaa 1608901 cccgatcaag cacctgcttt accgcgacca cgccaacgat ctgcggccct cctggctacc 1608961 gaaactcatg gcccacgggg gtggcaccac ggcggaattc ctggtgcccg ggattctggt 1609021 gctcgtcgcc gacggtcacc catggcggtg gttcctcatc gggttcatgg tgctctttca 1609081 cctcaacatc ctgtccaacc tcccgatggg ggtcccgttg gagtggaacg tgttcttcat 1609141 cttctcgctg tgctatctat tcggccacta cggcgcgatc actgccaccg accttcggtc 1609201 gccgttgctg ctggcgatcg tgatcgcggt ggttgccgtg gtgatcatgg gaaacctgtt 1609261 gcccgaaaag atttcgtttc tgcccgccat gcgctactac gccggcaact gggccaccag 1609321 catctggtgc ttccgaggtg atgcggaagc caccatggaa accagcgtcg tgaaaagctc 1609381 tgcgctggtg gtcaatcagc tggccaagct ctacgacggg gccacggccg aaatcatgac 1609441 cgacaaggtc gccgcattcc gggccatgca cacccacggc agggcgctca acggcctgct 1609501 gccccgcgct ctcgatgacg aagctcacta ccgcatccgc gagggcgaaa tcgtggccgg 1609561 gccactggtc gggtggaatt tcggcgaggg ccatctgcac aacgagcagc tggtggccgc 1609621 cgtgcagcgg cggtgcaact tcgccgacgg cgatctgcgg gtgatcattc tcgaaggtca 1609681 gcccatccac gttcagaagc agtggtatcg cattgtcgac gccaagaccg gtttgttcga 1609741 ggccggttac gtcacggtcg aggacatgtt gagccgccag ccatggcccg agcccggtga 1609801 cgagttcccg gttcacgtca cgacgcaacg cggcacgcca tcaaagccat gacgaccgcg 1609861 gtcgtcgtcg gagccgggcc caacggcctg gccgcggcga tccacctggc ccgtcacggt 1609921 gtcgacgtgc aggtgctgga ggcgcgcgac accatcggcg ggggagcacg ctccggtgag 1609981 ctgacggtgc ccggggtcat ccacgaccac tgttcggcgt ttcatccgct gggcgtcggg 1610041 tcgccattct gggcggcgat cgacctgcaa cgctacgggc tgacgtggaa gtggccggac 1610101 gtcgactgcg cacacccact cgatgacggc accgcgggcg tgctatatcg gtcgatcgaa 1610161 gccaccgccg ccggcctggg tcccgacggc aagcggtggc agcgcgccgt gggtgacctc 1610221 gccgccggat tcgatgagct ggccgaggat ctgctgcgcc cggtgctcaa catgccgcgt 1610281 cacccgatcc gcctggcccg ctttggtccg cgcgcggcgc tgccggccac cgccatggcg 1610341 cgtcggtttc acaccgagcg ggcgcgcgcg ttgttcggcg gcgccgcggc gcacgtctac 1610401 accaggttgg atcggccgct gaccgcgtcg ctggggttga tgatcctggc cagcggccat 1610461 cgccacggtt ggccggtcgc ccggggcgga tccgggtcga tcacgaaggc gctggccgcg 1610521 gccctggacg cgtacggcgg caccgtcgcc accggggtga ccgtcaccag ccgccgcgac 1610581 atccccgacg ccgacatcgt gatgctcgac ctcagcccgg ccgcggtgct cgggatctac 1610641 ggcgatgtga tgcccacccg catcaaccgg tcctatcggc gctaccgcgc cggatcgtcg 1610701 gccttcaagg tcgacttcgc catcgagggc gacgttgggt ggaccaaccc cgattgccgg 1610761 cgcgcgggca ccgtccacct gggcgggacc ttcgcggaaa tcgcagacac cgaacgtcaa 1610821 cgcgcccaag gcacgatggt gcagcgacca ttcgtgctcg tcgggcagca gtacctcgcc 1610881 gacccgtccc gctcggtcgg caacatcaac cccatctggg cctacgcgca cgtgccgttc 1610941 ggctacaccg gcgacgccac cgccgccgtc atcgaccaga tcgagcggtt cgcccccgga 1611001 ttccgcgacc gcatcgtggc aaccgtcagc acctccacca ccgaactgca aacgtacaac 1611061 cgcaacttca tcggcggaga cattatcggc ggcgccaacg accggctgca ggtcatcttc 1611121 cgcccgcgcg tggccgtcga tccgtatgcg atcggtgtgc cgggtgtcta tctgtgttca 1611181 cagtccgcgc cacccggtgc cgggatccac ggattgtgtg gctaccacgc cgccgaatcg 1611241 gcgctgaggt ggctgcgcaa gcgacgttga cgcaggtcat cgtcgagatc gacgttagcg 1611301 cgacgtccac tcgtgccgta gccaaaacgt gacggaggtt tgatcgaatt gctaaggcgc 1611361 gcctgcactt ccactcttca atgcacctct accatcactg gtgcaactgt gtcgttgaca 1611421 gggaattgga gccatgcggg cggtttttgg gtgtgctatt gccgtcgtcg ggatcgctgg 1611481 gagcgtggtt gcggggccgg ccgacataca cctggtggcg gcgaagcagt cttacgggtt 1611541 cgccgtcgcg tcggtgctac caacgcgcgg ccaggtggtg ggcgtggcgc accccgtggt 1611601 ggtgacgttc agtgcgccga taactaaccc agccaatcgg cacgcggccg agcgcgccgt 1611661 tgaagtcaaa tcgacgcccg cgatgaccgg caagttcgaa tggctcgaca acgacgttgt 1611721 gcagtgggtt cccgaccgct tctggccggc gcacagcacg gtggagcttt cggtgggcag 1611781 cctgtcgagc gatttcaaga cgggtcccgc cgtcgtcggg gttgccagca tctcccagca 1611841 cacgttcacc gtgagtatcg acggagtcga ggagggaccg ccgcctccgc tgccggcgcc 1611901 gcaccaccga gtgcacttcg gcgaagatgg ggtgatgccg gcatcgatgg gtagaccgga 1611961 atacccgacg ccggtcggct cctacactgt cttgtccaag gaacgctcgg tgattatgga 1612021 ttcgagcagc gtcggcatcc ccgtcgacga tcccgatggt taccggcttt cggtggatta 1612081 tgccgtccgc atcaccagcc gcggcctcta cgtgcattca gccccgtggg cccttccagc 1612141 actgggactt gaaaatgtca gccacggctg cataagcctg agccgcgagg acgcagagtg 1612201 gtattacaac gcggtcgaca ttggcgaccc ggtcattgtg caggaatagc agctgatgcg 1612261 ggcgtcgccc gcagagcgcg tcgacggcgc gtacgcgggt gcggggcctc acacccagtc 1612321 cgtcctggaa gaggaccagc gtcagcgcgc acctgcgggc gcagaggccg aaggaccggg 1612381 cagaaccggc tgaccaggca ccggtccgcc agctggcgcc ggatcggtca gcgcatcctt 1612441 gaccccggac atgccaatga tgggagcact gaccacacca tccccgggag caccagccag 1612501 gaccggccca agcgcaatca gcggagttcc gaccggtatc accggagccg gaacggcggg 1612561 taccggtacc ggtgcgcccg gtatcggtac cggtccgccg gggattggta ccggtgcgcc 1612621 cggtatcggt accggtgcgc cagggattgg taccggtgcg ccgatgggca ccggtgcagc 1612681 tgccggcact ggcccaggcg cgacgaacgg aacaccagcc atgtcagtaa gtgcggcact 1612741 gcacgctccc gcggctgccg gtccaccggc agccaccggg tcgccggcgg ctaccggcgc 1612801 gtcgcccgcc atgccctgga tgcacgcgta gccacccgtc atcagcgggt cagccgccgc 1612861 gtccgggctt aacgctatag cagctgcaaa caacccagcg ccggcaatta ctttgatgtt 1612921 gaaccgattg acgatcgcca tcagcgtcaa ctctcctcta ttcgcgcgca gatatttccg 1612981 caatcaattt ggttcagcag aaccgcatag ccgtatcgag ttccttttcg accatcggct 1613041 caattgtcag catcctatgg ggaacatgag ccccgccgca ccgggccgtt tccaaatggt 1613101 gacgtcacaa cggtgtcaca agccagcgca atgtccgcgg tagggacgcg gcggctggga 1613161 tcggtggggt gagcgcccgg cttctcaaag cgaggggagc cccgggactc ttaccggccg 1613221 aaggcggcgg gtgtcactga tctaggctga cggccagtgg ttgtttagcc aacaaggatg 1613281 acaacaaata agccgaggag agacaagtga cggtccgagt aggcatcaac gggtttggtc 1613341 gaatcggacg caacttctac cgggccttac tggcccaaca ggagcagggc accgccgacg 1613401 tggaggtggt cgccgccaac gacatcaccg acaacagcac gctggcgcat ctgctcaaat 1613461 tcgactcgat tctgggccgg ctgccttgcg atgtcggcct cgaaggcgac gacaccatcg 1613521 tcgtcggccg cgcgaaaatc aaggcgctcg cggtccggga ggggccggcg gcattgccat 1613581 ggggagacct cggcgtcgac gtcgtcgtcg aatccaccgg cctgttcacc aatgcggcca 1613641 aagccaaagg ccacctggac gccggcgcca agaaggtgat catctctgcg cccgccaccg 1613701 acgaggacat caccatcgtc ctgggagtta acgacgacaa gtatgacggc agccagaaca 1613761 tcatctccaa tgcgtcgtgc accacgaact gccttgcgcc gctggccaaa gtgctcgacg 1613821 atgagttcgg catcgtcaag ggcctgatga ccaccatcca cgcctacact caggatcaga 1613881 acctgcagga cgggccgcac aaggacctgc gtcgcgcccg cgccgccgcg ctgaacatcg 1613941 tgccgacctc caccggcgcg gccaaggcca tcggcctggt gatgccgcag ctaaagggca 1614001 agctcgacgg ttatgcgctg cgggtgccga tccccaccgg ctcggtcacc gaccttacgg 1614061 tcgacttatc cacacgggcc agtgtcgatg agatcaacgc ggcgttcaaa gccgcggccg 1614121 aaggcaggct caagggcatt ctgaagtact acgacgcgcc gatcgtctcg agcgacatcg 1614181 tcaccgaccc gcacagttcg attttcgact ctgggttgac caaagtcatc gacgaccagg 1614241 ccaaggtggt gtcgtggtac gacaacgagt ggggctactc caaccgcctg gttgatctgg 1614301 tcacgctggt cggcaagtcg ctctagccat gagcgttgca aacctcaagg atctactcgc 1614361 cgaaggtgtt tcggggcgtg gagtgctggt gcgctccgat ctcaacgttc cgctcgacga 1614421 ggacggcacc attaccgatg cgggccgcat catcgcgtcg gcgccgacgt tgaaggcgtt 1614481 gctcgacgcc gacgccaagg tggtggttgc cgcgcacttg ggacgtccca aggacgggcc 1614541 ggacccgaca ctgtcgctgg cgccggtcgc cgtggcgctg ggtgagcaac tcggccggca 1614601 cgtccagctg gctggagacg ttgtcggcgc cgatgcgctg gcccgcgccg aggggctcac 1614661 cggcggcgac atcctgctgc tggagaacat ccgcttcgac aaacgcgaaa ccagcaagaa 1614721 cgatgacgac cggcgggcac tggccaagca gctggtcgaa ctggtcggaa cgggaggcgt 1614781 tttcgtctcc gacggctttg gggtggtgca ccgcaagcaa gcctcggtct atgacatcgc 1614841 aaccctgttg ccgcactacg ccggcacgct ggtcgccgac gagatgcggg tactggagca 1614901 gttgaccagc tcgacccagc ggccctatgc ggtagtgctc ggcggatcaa aggtgtccga 1614961 caagctgggt gtcatcgagt cgctggcgac caaggcggac agcattgtga ttggcggcgg 1615021 aatgtgcttc acattccttg ctgcacaggg attttcggtt ggcacatcgc tgctggaaga 1615081 cgacatgatc gaagtctgtc gcgggctgct ggaaacctat cacgacgtgt tgcggctgcc 1615141 cgtggatcta gtggtcacgg agaagttcgc cgccgactcg ccgccccaga cggtcgacgt 1615201 cggcgctgtg cccaatggct tgatgggcct ggatatcggg ccgggatcga tcaaacggtt 1615261 cagcacgctg ctgtccaacg ccgggaccat cttctggaac gggccgatgg gagtattcga 1615321 gttcccggct tatgcggccg gcaccagagg cgtcgccgag gcgatcgtcg ccgccaccgg 1615381 caaaggggcg tttagtgtgg tcggcggcgg tgactccgcg gccgcagtgc gcgcgatgaa 1615441 catccccgag ggcgccttct cacacatatc caccggcggc ggtgcctcgc tggaatacct 1615501 tgagggcaag acgcttcccg gcatcgaggt actgagccgt gagcagccaa ccggaggagt 1615561 tttgtgagcc gcaagccgct gatagccggc aactggaaga tgaacctcaa ccactacgag 1615621 gcgatcgcgc tggtgcaaaa gatcgcgttc tcgttgccgg acaagtatta cgaccgggtt 1615681 gacgtcgcgg tgatcccgcc gtttaccgac ctgcgcagcg tgcaaaccct ggtcgacggc 1615741 gacaagctgc ggttgaccta tggtgcacaa gacttgtcac cacatgactc cggtgcctat 1615801 acgggtgacg tcagcggcgc ctttctggcc aagttggggt gcagttacgt tgtcgtcggg 1615861 cactccgagc ggcgcaccta tcacaacgag gatgacgcgc tggtggccgc caaagccgcc 1615921 accgcactca agcatggctt gaccccaatc gtgtgtattg gcgagcacct cgacgtccgc 1615981 gaggcgggaa atcatgtggc ccacaacatc gaacagttgc gtggatcgct ggccgggcta 1616041 ttggccgagc agatcggcag cgtcgtcatc gcctacgaac cggtctgggc gatcggcacc 1616101 gggcgggtgg ccagcgccgc cgacgcccag gaggtgtgtg cggcgatccg aaaagagttg 1616161 gcctcgttgg cctcgccgag gattgccgat acggtgcggg tgctctacgg cggctcggtg 1616221 aacgccaaaa acgtcggcga catcgtggcc caggatgacg tcgatggtgg cctggtcggc 1616281 ggggcgtcgc tggacgggga gcatttcgcg acgctggccg cgattgcggc cggtggtccg 1616341 ttgccgtagc ggatcgcggg cgtgctacac ccgtagacct tcgagtaggg ccataaatgc 1616401 gcgttcgacc tcgactctgg tccggtcttt gtccgtcgcg tccgcgatct gcagcgcgga 1616461 ttcggttagc gcggccagca gcagatgcga aagtggtggc aacggtacgc gctgaatcac 1616521 cccggcggcc atcccgcgtt cgagagcccc gaccagcaga ccaagcccta gcgcatgtcg 1616581 atccggcgcc attcgcccca cccgagcact gacgggccgt caatcgcaat gacctgcagc 1616641 gcatccggtt tggtcgccgc gtcaaggaag gcgtggaagc cgacgaccag cagatccagg 1616701 cgtcggtgac cttcgctatg gcggcttcga cgtcggcgac caggtcggct tcgacaacct 1616761 cgagtaccgt ctggaacaga tctttcttgc tgtcgaagtg gtagtccagg gcgccacggg 1616821 tgactcgggc acgggtgacg atgtcttcga tcgagacgtc accatagtcg cgccgcgcga 1616881 ataggtaacg gccagcgtcg acgagggctc gacgcgtcgc gtccgtgtgg tccgagcgcc 1616941 tgctggccgt catttcgacg tcaagcccgg cttcgcatgg ttgtcaacca gccacgccag 1617001 gccgacggat gcttgactac cttgatcaac agtgggagcg agtcgaaata gctcacgcgt 1617061 tctacggcct tgtcgccacg cagcaggaac cgatcgacga ctggccactc gacgacctcg 1617121 ctgccgagcc gtgctatcag ccggaactcg atgaacacca cgtcgcctgc ttggctccac 1617181 cggtcaactt ccccgtgcag gtcaggcagc aaacccagaa tccgagtgaa ctcccgctgg 1617241 gccgccccca ggccgtgcct cggcggtgac agtggccgta ccaggactac gtcgggatga 1617301 aggtgatcgg tcagtctatc cggcgacggc gccttccaga agtcggcgaa cccttcgacg 1617361 aatgcgttgg atgcgctcat ctgcatggcc ctttcggtgt ttgttcgctc gacagtctta 1617421 ctgcgtaagc ctgggggcga attcagcgga catcgttgct tatcggtagg aagctacggc 1617481 cgtcacagtg gtctcagcag cgggggaata cacattttgc ccgccccggc gcgacaactc 1617541 ggttgaagtc atgcccggat cggcatgttt ggccacgaac ggaatcgcga cagcgccacg 1617601 gcgtcgagcc tcgccatgca cctagccggc gcctttgaac tcgtgagcgg accgaagtgg 1617661 accgcctgtc gcttcgaggc gggcacagtg cgtattccct cgcaagggaa gcgccggtgg 1617721 caggcgtgac agccgcggtc agtgcacgcc tcaaagccga tgaggcgcga cggcctgggt 1617781 tctacgcggc aggcagcggt ccgctgccgc aggttcgggg gagtacgcta cccgtcatgg 1617841 aattggccct gcagatcacg ctgatcgtca cgagcgtgct ggtggtgttg ttagtactgc 1617901 tgcaccgggc caagggtggc gggctatcga cactgttcgg cggtggtgtg cagtcaagcc 1617961 tgtccggctc gacggtggtg gagaagaacc tggaccggtt gacgctgttc gttaccggca 1618021 tctggctggt gtccatcatc ggcgtggcgt tgctcatcaa ataccgctag cgctggtcgg 1618081 ctaccgccga ccggaccggg ggaagcggta gctcattgcc gattacgact tggtgcagcg 1618141 caggattctg ctgaccatga ccgggctggc cagcgcgctc agaaacagtg gttagtcggc 1618201 ctgaccggtc acccgtgctt tccttgcgcg ccattggcgc cgccgatccc gtcgggcaca 1618261 ccgacgccgc caggtccgcc ggtgccgccg tcgccgccaa agccgggatt gccgccacct 1618321 tggctgggcc cgccgtcacc gccgttgccg ccggcgccgc cgttaccgcc ggcgccggtg 1618381 ccgcctccgc ctgccccacc cgcggcgccg ttgccgccgt tgccgccgtt gccggcttgg 1618441 cctttgccgt cgaggctttc gatatagccg ccggtgccgc cggtgccgcc tgcgccgcca 1618501 gcgccgccgg cgccggcgct gctgccattg ccgatggtca atgcgctggc gccgccggtg 1618561 ccaccgacgc cgccgttgcc gccggtaccg cctttgccgc cgatcattga gctgccgccg 1618621 ccggtgccgc cggcgccgcc gtcgccgccg gcgccgccgg cgccggcgct gctgccgccg 1618681 atgccagctg tgccgccagt accaccggcg ccgccggtgc cgccgtcgcc gccgatgccg 1618741 ccagcgccta gcgccgtgcc gccgtcgcca ccttggccag cggtgccgcc gttgccgccg 1618801 gcgccgccat tgccgaacag ccggccacca gccccacccg cagcgccgtt gccgccgtcg 1618861 ccgccacggg cgccgttggc accgctgtta ggactgtcgc cggcaccgcc ggcgccgccg 1618921 tccccgccgg tcccaccggc gccgccggtg ccgaacatcc cagcagcacc accggcatca 1618981 ccgccaccgc cgttaccgcc agggctggcg gggacggggg ggaggccgcc gccgccgtcg 1619041 gcgccgctgg cgccagtacc gccgttgccg ccggcgccgc cgttgccgct tagccagcca 1619101 ccggctccac cggcgccgcc ggctccaccg gccgcgccgg ttccggccgc cgggctgtaa 1619161 ccggcaccgc cggccccgcc gttaccgaac attccggcat ccccgccgtt gccgccgttg 1619221 gggtgggcgg cgtcgccggc tccgccgttc ccgccgttgc cccacagcaa cccgccggcc 1619281 ccaccgtttt gaccgggcag cccgtcggcg ccgttgccga tcagcggacg ccccagcagt 1619341 gtctgggtgg gcgcgttgat ggccgcgagc acctgttgtt cgagggcctg caagggggag 1619401 gcgttggcgg cctcggcggc ggcatacgag cccacgctcg cggttaaggc ctgcacgaac 1619461 tgttgatgaa atctggccat ttgggcactg agcgcctgat agtcgcgggc gtagccagaa 1619521 aacaacgacg cgatggccgc cgacacctca tcggccccgg cggccaggac accagccgtc 1619581 gtgggtgccg cggctccgtt ggccgcgcta agcgccgcac cgatgctcgc cacatccgct 1619641 gccgccgctg acaacattcc cgggactacc atcacgttcg acatcgctgc agtctaaaac 1619701 ctggtgccat cgttgcgacg caaaacaatc gacatgctta ccatttctga gctcaactag 1619761 ctgctaggtt gccgcactag actgctgcaa atgcaggtct atacgtcggc aacgcactgg 1619821 ggcgtgttca ccgctcgggt gcacggcggc gacattgcgg ccgtggccgc gctcgccagt 1619881 gacaccaacc cggctccgca gctgcaaaac ctgcccggcg cggtacgtca ccgcagccgc 1619941 atcgccaacc ccgccgtacg gcgcggatgg ctgcagcatg gcccggggcc cagctcggct 1620001 cgcggcgccg aagagttcgt ggaggtcagc tgggacgagt tgatcgagct gctggcttcc 1620061 gagctgcgcc gtaccgtcga ccgctacggc aacgaggcga tctatggcag ctcctacggc 1620121 tgggccagcg ccggacggtt ccaccacgcg caaagccagg tgcaccggtt cctcaacatg 1620181 ctcggcgggt acaccgcatc ccggcacagc tacagcgccg gcgcgtccga agtgatcttc 1620241 ccgcatatcg tcggcgcggc cctgttcgaa gccctggccg agaccacgac ctgggatgtc 1620301 atcgtcgacc acaccgcgct gttggtggcg ttcggcggat tgccggtgaa gaacaccgcg 1620361 gtgatgcccg gcggtaccac cgctcatccg gaccgcgact acgtcggccg gtaccgggct 1620421 cgcggcggtc ggctggtgtc ggtcagcccg ctacgtgacg acatcgccgc gatcgccggt 1620481 ccgctcgacg atcgatgtcg ctggcttgcg ccggtgcctg gcaccgatgt ggcgatcatg 1620541 ctcgggctgg catacgtgct ggccaccgag tcgctggccg atcgcgcgtt ccttggcagg 1620601 tattgcaccg gctacgaacg cttcgagcgc tacctgctgg gcctggatga tgggattccc 1620661 aagacacccg aatgggccgc cgcgctgtcc gggctcgccg ccggcgatct gcgagatctg 1620721 gcccgccgga tggccgagca ccggactctg atcaccacca gtctgtcgtt acagcggata 1620781 gagcacggcg agcagaccgt gtggatggcc gcgaccctag cggcgatgct gggccagatc 1620841 gggcttcccg gagggggttt cggtcacggc tacagcagca acggcgtcgg caacccgccg 1620901 ttggcgtgcg gcctgccggc attgccgcaa ggcaacaatc cggtgtcgac gttcattccg 1620961 gtggcggcga tcagtgagct gctgcagcgg cccggccagc ggctggccta caacggccga 1621021 ttgctggagc tgcccgacat caagtgcgtc tactgggccg gtggaaatcc gttccaccac 1621081 caccagaacc tgccgcggct gcgtcgtgca ctgtctcggg tagacacgat cgtggtacac 1621141 gaacagtatt ggaccgcgat ggccaaacac gccgacattg tggtgccaac caccaccagt 1621201 ttcgagcgcg acgacttcgc cgccagcaag accaatccca ccttgatcgc aatgcctgcg 1621261 atggtgccgc cgtatgccaa cgcccgcgac gactaccaca cgttctccgc gttggcccac 1621321 cggctggggt tcggcaagca attcaccgag ggccgcagcg cgcgcgagtg gctcgagcac 1621381 atgtacgaca agtggtcggc cgagctggat ttcccggtgc cgtcattcgc cgaattctgg 1621441 cggaccggcc ggctggaact accgaccaga accggtttga cgtggcttgc cgatttccgg 1621501 gccgacccgg cggcccatcc gttggggaca cccagcgggc ggatcgagat cttctcggac 1621561 acggtcgacg cgtttgcctt gccggactgt gccgggcacc ccacctggta tgaaccgtcc 1621621 gaatggctag gcgggccgcg ggccgcgcgc tacccgctgc atctgatcgc caaccagccg 1621681 cggacccgac tgcacagcca gctcgatcac ggcggcgcca gcatggcatc gaaaatccgt 1621741 ggacgagaac cgatccggat tcacccggat gacgccgcgg cccgtgagct tactgacggc 1621801 gacatcgtgc gcgtgttcaa cgaccgcggc gcctgcctgg cgggtgtggt gatcgacgac 1621861 gggctacggc ccaaggtggt gcaactgtcc accggtgcgt ggttcgatcc cgccgatccg 1621921 cgcgacccgg actcgatgtg tgtgcacggc aatcccaatg cgctgagcaa cgattccggc 1621981 acgtcgtcac tggcccacgg cagcaccggc cagcatgtct tggtccagat cgagaggttc 1622041 actggcgaac tgccgccggt gcgcgcccac gagccaccgc ggctggctta gcgccggacg 1622101 tcgacttgtt gggcgcgaaa cgccgcaatg gaccgaacga ctcgacgtaa gtgtgccctg 1622161 ctggtgtcgg ctcgagtcgc agcacgggtg agcaccacgt gcgccactag ccctgagcga 1622221 agtgtcgctg caaccgccgg tgccgatgac cgaagagcgc gcgcaaccct gccgcgatga 1622281 gcggcgcggc aaacctgagt ccggcacgcg tctggaacgt gatgcggtcg cggacaatcg 1622341 ttttcgtgtc accctcgggc gtcacggtgc gttcgtgctg ccattgccgc atgctcagca 1622401 tcgtcgaatc ctcgcgaaac cgccgtcccg gctcgagctc ggcgatgctg agccggtcat 1622461 agtcgaatgg caacacaccg aacagtcgca gccaggcacg tccgatcggc gcgccgatcg 1622521 gcaccgtgtc gacggtcatc cctttcgcgc cgcgaggcac cgacatcgtc atccaggggc 1622581 gcaactcatc gttgatgccc tccggggtga cgacccgttg ccacacctgc tcggcaggtg 1622641 cggcgacgac gctttgccgt tcaatgagca ccggttcagc gtatccgacc acgcggcgcg 1622701 gtggggctac gtctccctcg cctcggtggc tgcctaaagg ccgttccgtc ccgggttgag 1622761 ttctgcgatg cagaggtggc agatcgtcaa tgcgggcgag aatttgttcc ggcctctgtt 1622821 gatgcgggtg acatcggaag gtgtgggtaa agggatcagc ccgagatcat gcaatcactg 1622881 tcctgacaac cagattcagc acggcctggt aatcgacagg attctgggac tatcagactc 1622941 cagcatcacg gttctcaccc gggcccaggt cgaggcgatg gtcgcggcgc tgccgcgaag 1623001 ctactgattc cgcgcagctg ctctgtcagg gccgctgact tttctctcgg tcatcgtggt 1623061 cgcaggcgcc gcactcggtg tcttcgggtg gggaagcgcg acctcgaagg ccactgaaac 1623121 gccttacgga gacgcgacga accaaatgcc gacgaatacg gcgaggccgg tggctaccgg 1623181 gagcctgcca cagaggatcg cccaacctgc ccagatcgtt gcctggccga ggaacatcgg 1623241 gttccgcgag aacgcgtagg gacctccagc tcctcgatga cccgcctcag ttcgtcggct 1623301 cgtgcacggt ccggtttcgg agccggtcca acacgccgcg aaccgcgtgc tcggtgaccg 1623361 acagcggtga catcaccgtt tcgccgaggc tcacgatgta gtcgatccga tcgacgatgg 1623421 cttccaccgg ctcgaccaac acgatgagcc gtttcgcgag atcgtccagg ctgtgcaggg 1623481 taccttcgag atggtccaga ccgtcctcca agcgctccac ggtgctgttc agctgtgaca 1623541 gcgagctgtt cagctcggcc atggtcttac ccagaccgtc caggacgtct tcgacctgct 1623601 ccaccgtctt gtcggcgttc aatgcggcct gggtgagggt tttcattcgc cgtcgcacgg 1623661 gcgcggggcg gccgcttctg tctgccatga cggtcattat gaccctgacg cggttaactc 1623721 ggaagcttgg cggcggcgtc gcggtccagc agccagagcg tgttctgacg cccgacggcc 1623781 ccggccgccg gtaccgaaac cggatcggcg ccgccgatgg ccgcggccac ggcgtcggcc 1623841 ttacccggcc cggaaaccag cagccacacc tcgcgggaac gctgaatcgc cggcagggtc 1623901 aaggtgattc ggcgtggcgg cggtttcggc gagtcgtcga ccgccaccac catgcgggtg 1623961 ctctcgagga cggcggggct gtgcgggaac agcgagttaa tgtggccctc gggccccatg 1624021 cccagcaggt ggacgtcgaa attcggcgcc gggtcacctg gtgcggcact ggcggccagc 1624081 acctgttcgt aggccagggc cgcggcgtcc agatcgccgc cgaagtcacc atcactggcg 1624141 gccatcgggt gcacctggtt cgatggaatg tcgacgtgat tgagcaacgc ccgccgggcc 1624201 tgcttgagat tgcgctcgtc atcgtcttcg ggaacgtagc gttcgtcgcc ccagaacagg 1624261 tgcaccttgg accattcaat ctgctgtgct tgggcgctga ggtagcgcag aagcgcaatc 1624321 ccgttgccgc ccccggtcag cacgatcagc gcctgccctc tggccgccac cgcggccccg 1624381 atggcgccaa ccaagcgctt acccgcggcc gcgaccagaa tgtcgctatc ggggaagatc 1624441 tcgatgctac tgctcaccgg tactgcacct tcttgattcc ctcgagcgcg gcgcagtaga 1624501 tttcgtcggg gtccagccgg cgcaggtctt cggctaggca ctcaccggtt accctgcgcg 1624561 ccaaaggaac cagagcgtcg ggcttgcccg tccgggtcag ggtggccgtg attccctcct 1624621 ggggacggct tagcacgatg gtctcgctgt tgcgcaccag ctcgactttg agttcgccga 1624681 ccgcccgtcg caccggacct tcgatccggc tggctagcca gccggctagg acgtcgagcg 1624741 ccggttcggt cttcaagccg gacaccagcg ccgactcgat cggctcgtgt cgcggctggt 1624801 cgacggccga cgtgagcagc gcacgccaat aggtgatgcg gctccaggcc agatcggtgt 1624861 cgccggcgcc gtagccggct agccggctct tgatggccga cagcgggtcg attgcgttgg 1624921 tggcgtcggt gatgcgccga attgctaact tgcccaacgc atcctgtgct ggcaccgccg 1624981 gtgcgatgtc gggccaccac gccaccaccg ggatgtcggg cagcaggaag gggataacga 1625041 cgctgtcggc gtggccggcc agtggcccgg acagccgcag caccacaaac tcgccggcgc 1625101 cggcgtcagc gccgacccgc agttgtgcgt ccagccgcgg tctgtcggcg tacggatcgc 1625161 cccgcatcgt tacgatgatg cggctgggat gctcatggct ggcgtcgttg gccgcctcga 1625221 tggactcttc cagcatggct tcgctgtccg gcgcaatgat gagcgtgagt acccggccca 1625281 tcgcgacggc gccgatcttt tcgcgcagct cgtcgagctt cttgttgacc gcggtggtgg 1625341 tggtgtcggg caagtcgaca atcatctgcg ccgctcctcc tcatcgcttc gctctgcatc 1625401 gtcgccggcg cggatcacta tggccgccgc cattcccggc cggtgcggcg cagcatctcc 1625461 aaggatgatt ccggacccca ggtacctgcc tcgtaggcgt cgggcgtccc gtgtgccgcc 1625521 caatgttcca acgctggatc gaggatctcc cacgccagtt cgacctccgc gttgaccgga 1625581 aacagcgagg gctcgccgag caggacgtcg aggatgagcc gctcgtaggc ctccggtgaa 1625641 tcttcggcga atgccgagcc gtaggagaag tccatgttga cgtcgcggac ttccatggcg 1625701 gtgcccggca ccttggagcc gaaccgcaat gtgacacctt cgtcgggctg cacgcggatg 1625761 accatcgcgt tggtgcccag ctcgtcggtc atggtggcgt cgaacggcag atgcggcgcc 1625821 cgcctgaaga ccagagcgat ctcggtcacc cggcggccca atcgttttcc cgttcgcaga 1625881 tagaacggca cgccggccca ccggcgcgta tcgacttcca gggtgatagc ggcgaaggtt 1625941 tcggtggtgg agtcctcggc gaacccctcc tcgtcgagca gcccaaccac cttctccccg 1626001 ccttgccagc cggcggcgta ctggccgcgg ctggtggtct ggtcgagtgg ctcggcaagg 1626061 cgggtggccg agagcacctt gatcttctcg gcctgcaacg ctgccgggtg gaagctgacc 1626121 ggctcctcca tcgcggtcag cgccagcagc tgcatgagat ggttctggat gacatcgcgg 1626181 gccgcgccga tgccgtcgta atagcccgcg cgcccgccca ggccgatgtc ttcggccatg 1626241 gtgatctgta cgtggtcgac gtagtgcgca ttccagatcg ggtcgaacag ctggttggcg 1626301 aaccgcagcg ccaggatgtt ctgcaccgtc tctttgccca ggtagtggtc gatgcggaag 1626361 accgcttcct ccgggaagac cgcgttgacc gccttgttca gctcgcgtgc gctggccagg 1626421 tcgtggccga acggcttctc tatcacgact cggctccacc ggtcgccttg cgggcgggcc 1626481 aggccggact tgtgcagctg ctcacacacc accgggaagg atttgggcgg gatcgccagg 1626541 tagaaggcgt ggttgccgcc ggtgccgcgc tcggcgtcga gcttctccag cgtctcggcg 1626601 agttgggcga acgcgtcgtc gtcgtcgaaa gtgcctggca caaaacggaa tccctcggcc 1626661 agccggtccc agttctgttg ccgaaacggt gttcggcagt gctcttggac ggcgttgtac 1626721 accacttgac cgaaatcctg ggtgctccag tctcggcggg caaaccccac cagcgagaat 1626781 gtgggcggca gcaggccgcg gttggccaaa tcgtagacgg ccggcatcac cttcttgcgg 1626841 gccaggtcgc cggtgacgcc gaaaatcacc atgccgcacg ggccggcgat tctgggtaat 1626901 cgcttgtccc gcttgtctcg tagcgggttg cgccacgacg ccgcggcgtg ggccggtttc 1626961 attgggcagc ggtgtcgaga tgcgcccggg tttcctggag tagctcgttc caggaggcct 1627021 cgaacttccg cacgccttcc tcctcgagga cggcaaacac gtcggtgagg tcgatgccga 1627081 tcgcccccag ctggtcgaac accgcctggg catcggatgc agttccggtg accgtgtcgc 1627141 cttggatcac gccatgatca gcgacggcgt caattgtctt ttccggcata gtgttcacgg 1627201 tgtgtggggc gaccaactcg gtgacgtaga gggtgtccga gtaatcgggg ttcttcacgc 1627261 cggtggaagc ccacaacggg cgctggaccc gggcgccgtc gaccttgagg gaccgataac 1627321 gatcgctgtc ttcgaagacc tcccggtagg tggcataggc caggcgggca ttggcgacac 1627381 cggcctggcc gcgcagttcg agcgcttgcc gcgagccgat tctgtccagc cgcttgtcga 1627441 tttcggtgtc cacccgggag acgaaaaacg atgccaccga atggatcttg gacaggctgt 1627501 gtccggcttg ccgggccttt tccatcccgg tcaggtaggc gtccatcacc tcgcggtacc 1627561 gctgcacgga gaagatcagc gtaacgttga ccgaaatccc ttccgccaga acggcactga 1627621 tggcgggcag accggcctta gtggccggga tcttgatgaa aaggttcggc cggtcgacga 1627681 tcttccacag ctcgattgcc tgttggatcg ttttttcggt ttcgtgtgcc agccgcgggt 1627741 cgacctcgat cgacacccgg ccgtcgaccc cgtcggagtc ctcccactgg gggaccagca 1627801 cgtcgcacgc gctgcgcacg tcgtcagtgg tgacggtgcg gatggtggca tccacgtcgg 1627861 cgccgcgcgc ggccagctcg gcgatctggg cgtcgtaggt gtggccctcc gacagcgcct 1627921 tctgaaagat cgacgggttg gtggtcaccc cgacgacgct cttggtgtcg atcagctcct 1627981 gcagattgcc cgagcgcagc cggtcccgcg acaggtcatc cagccacacc gatacccccg 1628041 cggcgctcaa tgcggccagg ttggggttct gagcggtcat cggtaatcac ccttcctcag 1628101 ttatccagcg ctcgttcggc ggcggcggcc acggcctcgg cagtgaagcc gtactcgcgg 1628161 aacaaggtct tgtggtccgc ggattcgccg tagtgctcga tcgagacgat ctcgcccgtg 1628221 tcgccaacca gctggtgcca gcattgcgcg acgccggctt cgacggccac ccgcgccgac 1628281 accgtcgggg gcagcaccgc gtcgcggtac tcgtagggtt gggcctcgaa ccactccagg 1628341 cacggcatcg acaccacccg agcgaggatg tcgttgtccg ccagcaacgt ctgcgccgcg 1628401 accgccagct gcacctccga gccggtggcg atgagaatga cgtcgggttc ctcgcccggt 1628461 tgcagaccac cggcgtcact cagcacgtaa ccgccgcggg caaccccctc ggcgtcggtg 1628521 ccgtccagca ccggcacacc ctggcgggtc aggatcaacc cgaccggccc gctgccgttg 1628581 cggcgggcca ggatcgtgcg ccaggcgtag gctgtctcgt tggcatctgc cgggcgcacc 1628641 accgacagcc gggggatcgc gcgcagcgcc gagaggtgct cgatcggttg atgggtgggc 1628701 ccgtcttcgc cgaggccgat cgagtcgtgc gtccagacgt agatggtgtc gatgtccatc 1628761 aacgccgcca gccgcaccgc cgggcgcatg tagtcggaga actgcaggaa ggtgccgccg 1628821 taagcccggg tgggtccgtg cagcacgatg ccggacagga tggcacccat cgcgtgctcg 1628881 cgaacaccga agtgcaaggt gcgaccatac cagtgcgcgg tgtactcctt ggtggaaatc 1628941 gagggcgggc caaaggagtc ggcgcccttt atcgttgtgt tgttgctgcc cgccaggtcg 1629001 gccgaaccgc cccacaactc gggcagtttc ggcccgagcg cggacagcac cgcacccgag 1629061 gccgcacggg tggccagcgc cttggacccc ggttcccagt ggggcaagtc ggcgtcccag 1629121 ccgtcgggca acttctgcgc gagcagccgg tccagcagcg ccttgcgctc gggttcacgc 1629181 cgcgcccagg catcgaattc gagctgccag cgttcgtggg cctgtttgcc gcgggccacc 1629241 agccctcggg tgtgggtgag gacgtcctcg cggacctgga acgtcttgtc cggatcgaag 1629301 ccgacgatct tcttgactgc ggccacctcg tcgtcgccca gcgccgcgcc gtgcgccttg 1629361 ccggtgtcca tcaggttcgg cgccggatag ccgatgacgg tgcgcagcgc gatgaacgag 1629421 ggccggtcgg tgaccgcctg cgcattggcg atggcctcct cgatgccgac gacgttctca 1629481 ccgccctcaa cctcttgcac gtgccagccg tacgcgcggt agcgggccgc ggtgtcctca 1629541 cacagcgcga tgttggtgtc gtcctcgatc gagatctggt tgcggtcgta gaacacgatg 1629601 aggttgccca gttgctggac cgcggccagc gacgacgcct ccgaggtcac cccttcttcg 1629661 atgtcaccgt cggaggcgat gacatagatg tagtggtcga aggggctggc gcccggttcg 1629721 gcgtccgggt cgaacaggcc gcgctcgtag cgcgaggcca tcgccatccc gaccgccgac 1629781 gccagtccct gccccagcgg gccggtggtg atctcaacgc cgggggtgtg gcggaactcc 1629841 gggtgtccgg gggtcttgga tccccaggtg cgcaacgact caatgtcgga cagttccagg 1629901 ccgaagccgc cgaggtagag ctggatgtag agggtcaggc tgctgtgccc ggccgacaaa 1629961 acgaaccgat cgcggcccag ccagtgtgtg tcgctgggat cgtgacgcat tgtccgctga 1630021 aacagcgtgt aggccaacgg agccaggctc atcgccgttc caggatgacc gttgccgacc 1630081 ttttggacgg catcggcggc caatacccgg atggtgtcga cggcagccga atcgatctcg 1630141 gtccagtagt cgggatggcg cggtcgggta agcgcggaga tctcttcgag tgtggtcaca 1630201 aattcagtcc tcgagtcagc aagatgatca gtcctcaccc tagtgcggga atcccggcgc 1630261 ttgcagtgcc gcatatccgg gtacccatcc gggccctgtg aaacgtaacc cgcgcgctac 1630321 ccacgcttcg cattcggtgc cgatatgccg aaaaatcacc gtcatcgacc ctgcggctct 1630381 gctgctgggg ctacgtcgaa caccgtacgt cgcagaagtg tggtgcgggt cgggcggccg 1630441 gcttaatcgc ggtgataatc ggttggtcgg cgatcaccgg catcatcggt tggccggcgc 1630501 tggtgatgct gttcgccggg cctcgcgtcg gcgagccggg caagccggtg cgcctgccga 1630561 tcccatggcg ggatgttggt gggtaccgcc cgaccggaag aagcatcgcg gcatgccggc 1630621 gtggcgagcc tcggggtcta cacgaattcg ccgccgccga gcccgccgaa gccaccgccg 1630681 ccaccggcgc cgccggcggt acctgtggcg atggaccccg ggctaccgag gccgccgaga 1630741 ccgccgagac caaggaggat gctgaagccg ccgccaccgc cctgcccccc gtggccaccg 1630801 gtcccaccgg tgcctgttcc aaagggcccc gcgtcgccgg tgccgccggt gcccccggag 1630861 ccacccatcc cgccccggcc accgacgccg gcaaaaccat tgccgccaaa gccgcccgca 1630921 cctccgttgc cacccatccc aggctgagag ccgttgtggc cggtgccgcc ggtgccgcca 1630981 gcgccgccgg tgccgccggt gttaccgttg ccgccgttgc cgccagtgcc gcctctgccg 1631041 ccggtgaggc cgccgttggc accctggccg ccggtgccgc cggtgccgcc ggtgccgccg 1631101 gtgccccagt cgccgggggt gccacctggg ccgctggaac cgccaagtcc tgcatcgcct 1631161 ccgcgtcctg catcgcctcc gcggcccccg ccgccgccgt caccaggtga ggtgacaagg 1631221 tcgccactgg cgccgttgcc accgttgccg ccgttgccgg gtgtcccgcc ggtcccaccg 1631281 ttgccgccgg ctccggtgag gccttggccg ccgttgccgc ctctgccgcc gttgccgcct 1631341 ctgccgccgt caccgccatc gccctcgttg gtgccgagga cgcccttggc gccggtgctg 1631401 ccggcgccgc cagtcccgcc gatgccaccg ttgccgccgt tggcgccggt gccgccgtta 1631461 ccgccgttac ccccgtggcc gccggggccg ccgtttccgc cgctggcagc gccgtggccg 1631521 ccgtgaccgc cgttgccgcc gtcgtgcagg atgctgccgg ccggccccgc cttgcctgcg 1631581 gtggagccgg tgccgccggg gccgccggca ccggcgttgc cggcgttgcc gccgtcgccg 1631641 cctcgcccgc cgccgccgcc ggcgaaggcc cctgctccct ggccgttgcc gccgttggcc 1631701 ccgtcaccgg gagcaccgcc gtcgccgccg gccccaccgg caccgcccgc gccgtcgctg 1631761 actacgcctt gaccgccgtt gccgccggcc ccgccgttgc cgccggcgcc gccgtgcccg 1631821 ccggcaccgc cgggttgtcc gggcgcaccc acggccacgc cgttggcacc ggcggcgccg 1631881 ttgccgccga atccgccgag gccgccgttg ccgccggcgc caccgttacc gccgttcagg 1631941 ccggccccgc cggccccgcc ggcgccaccg ttgccgccgg ggttaccgtt tggcccgttt 1632001 tcaccagggt tggtggcgtt ggcactcatg ccaccaaacg cgccgtcgcc gccgcggccg 1632061 ccgttgccgc ccgtgccggc gctgccgccg ttgccgccat tgccgccgtc gccgccgttg 1632121 ccgccgacca cttgggagtt gccgccgttg ccgccgtcgc cgccgtcgcc gccgctggtt 1632181 ggagtgaagc cgtgggcgcc cttggcgcct ggggtagagc cggcgccacc gctaccgccc 1632241 tgcccgccgg cgccggggtt accgccgtta ccgccgtgac cgccgttacc atcgccgaag 1632301 gcgaagttgc cgttggcgcc gttgccgccg tcaccggcga gcccgccggc cccccctttg 1632361 ccgccggacc cgccgacacc ctggattccg ttctggccaa agaggttccc cgccaaaccg 1632421 ccgggcccgc cttggccgcc gttaccgcct tgcgcgccgg gcccgccgtg gccgccgtcg 1632481 ccgcccttgg cgcccggcgt ggtggcgttg gcgccgttgg cgccgttgcc gccggcccca 1632541 ccggtcccgc cgtcgccccc gaagtctccg ccccggccgc cggccccgcc cgccccgcca 1632601 gccccgccgt tctggccgct cgtgccggat tcgcccgcgg tggtgggcga ggaaccggcg 1632661 acaccggcca tgccgtcccc gcctttgccg ccggccccgc cattaccaac aagcccgccg 1632721 ttgccgccct tgccgccggc cccgccggcc ccgccggcga cggtggcgtt cgcgccgttg 1632781 ccgccggtgc cgccgttgcc gccgctggtc ggggtggcgc cgcgggcacc gtctgcaccc 1632841 gcggtggatc cggcgccgcc gatcccacca gcaccaccga tgccgcggct accgccgttg 1632901 ccgccgttgc caccaactcc atcgccgccg ttatcgaacg tgcccttggc accgttgccg 1632961 ccatcaccgc ccatgccgcc ggcgccgccg tttccgccgg ccccgccggc acccatgctg 1633021 ccgtcctggt gggtggctgc aagcgcctta ccgccttgcc caccggctcc accgccaccg 1633081 ccggctccac cgttgccgcc cttgccgccg tcggtgccat ccgcgcctgc ccccaggccg 1633141 ttaaggccgg tggcgccggt ggcgccgttg ccgccgttgc cgcccttacc gccggcgccg 1633201 ccagcaccgc cgtcgcctgc ttgggctccg ccgtcgccgc ccttaccgcc agcgccgcca 1633261 gctccgccgc caccgccgtt agggtcgccg ccagaaggcg gggcaccggg ggcgccgttg 1633321 ccgccggcac ctccggcgcc gccattgccg accagcccgc cggccccgcc ggccccgccg 1633381 ttaccgccgg ctttgccgcc cgatgagaag tgggcgccgt tgccgccggc cccgccgttg 1633441 ccgccgctgg tggggctggc cccggccgcg ccgtgggcac cgatcgtgga gccggctccg 1633501 ccggtgcctc cggccccgcc ggcgccgggg tcaccgccgt tatccccagc gacaatcaag 1633561 gcacgagaaa atccggcccc gccggccccg ccggtcccgc caaccccacc ggccccgccg 1633621 gccccaccgg cgccggccag ccagccgccc cgccctccgg tgccgccatc gccggcgtcg 1633681 ccgccgaccc caccggacgt accgtgcggg gacaagtcct caccggctgc gccggccaca 1633741 ccctccgcgc cgtgtccgcc ggcaccaccg tgcccgccca cgcccaacag cccggccgca 1633801 cccccgacac cgccgtgtcc acccacacca ccgatcgggc cgggcccgcc ggcacctccg 1633861 tgcccgccgg ccccgtagag ggtcccgccc aggccaccgg caccaccggt accgccgacc 1633921 ccgccgggcc cgccgggccc gccgggcccg ccggttccgc cgaccccgaa cagtccggcg 1633981 ttgccgccgg ccccgccggt tgccccgccc agcaggctct gcccgccggc cccgccgact 1634041 ccaccattgc ccagcagcca gccgccgcta cccccggccc caccggcggc gccggcccca 1634101 ccggccccac cggccccgcc ggtgccgaac aacccggcgg ccccgccggc cccgccgact 1634161 tggccgggcg cgcccgagcc gccggcccca ccgttgcccc acaagatccc gccggccccg 1634221 ccggcctgcc cggtgccggg tgctccagcc gccccatcac cgatcaacgg gcgacccagc 1634281 aacgcctggg tgggcgcatt gagggcattg agcacgttgt gctccagcgt cgccaacggt 1634341 gcggcgttgg tcgcctccgc gctgacatac gagccgaccg cggcgcttaa cgtctgcgca 1634401 aatcggtcat gaaacgccgc cacctgcgtg ctgatcgcct gatactcccg agcatggctg 1634461 ccaaacagcg tcgcgatcgc cgccgacacc tcatcggcgc ccgcggccag cacgctggtg 1634521 gtcgaccccg ccgccgccgc attggccgcg ccgatcgatg acccgatgcg cgccacatct 1634581 aaggctgcgg ccgccaccgt ctccggggcc acgatcacca acgacatcac agacctcccg 1634641 ccacgcccct gccccttcgg caggtcacac tcctgccaga taagggtcgc gccgccacct 1634701 tgtccgattc caggtcaaaa tccccataac cagcacgaat ctgctgtgca cagtgcacat 1634761 tcgccctact atcggctcgt ggcattgcgg ctagcaacgg ttggtcttcg ggcccaatcc 1634821 ttagggcgtc acactgatca atcccagata gcgattttca tcgggctggt gtgaaaattg 1634881 tcctgaccgc ggttcgggct ggcgagcggt gccgatatgc cggcgaagtc gtgtgaatcg 1634941 accctgcggc tctgctgcca cagttacccg gtctaccatc gtgcgtagta gaagctgcgc 1635001 gcggctgcga ttcccgagga gttagtgcgt gaacgttcgc gggcgcgtcg cgccgcgccg 1635061 agtgactggt agggcaatga gcaccctgct ggcctacctg gcgttaacca agccgcgagt 1635121 catcgagctg ctgttggtca ccgcgatacc ggcgatgctg ctggccgacc gcggcgccat 1635181 tcatccgctg ctcatgctca acacgctcgt cggcgggatg atggccgccg ccggcgccaa 1635241 cacgctcaac tgcgtcgccg acgccgatat cgacaaggtg atgaagcgaa ccgcgcgccg 1635301 gcccttggcg cgggaagcgg tgccgacccg aaacgcgttg gcactcgggt tgacgttgac 1635361 ggtgatctcg ttcttctggc tatggtgcgc cacgaacctg ctggcggggg tgctggccct 1635421 ggtcaccgtc gcgttttatg tgttcgtcta cacgctttgg ctcaagcgac gcacgtcaca 1635481 gaacgtggtg tggggtgggg cggccggctg tatgccggtg atgatcggct ggtcggccat 1635541 caccggcacc atagcctggc cggcgctggc gatgttcgcg atcatcttct tctggacgcc 1635601 gccacacacc tgggcattgg cgatgcgcta caagcaggac taccaagtgg ccggggtgcc 1635661 gatgctgccg gcggtggcga ccgagcgtca ggtcaccaag cagatcttga tctacacctg 1635721 gctgaccgtg gccgcgacgc tggtgctggc gttggcgacc agttggcttt acggcgcggt 1635781 ggccctggtg gccggtgggt ggttcctgac gatggcccac cagttgtatg ccggggtgcg 1635841 cgccggcgag ccggtcaggc cgctgcggct gtttctgcag tcgaacaact atctggcggt 1635901 ggtgttctgc gcactggccg tcgactcggt gatcgcgctg cccacgctgc actgattggg 1635961 ggcccagttc cgctgcggtg ccggccctgc tcggccaacg tagtcagatg gttggatcgc 1636021 caccggcgcc accggcgccg cccgcgccac cagcaccgcc gctgccatct gggtccgtcg 1636081 agtcgccgag gacgccggcg ccgccattgt cgccaaatac cgtgagacct agcagggtgc 1636141 cggcgccgcc cttgccgccg gccccgccgt ttccgccgcc gccatcgccg atgatgtttt 1636201 ccccgccctt gccgccagcc ccagcgttcc cgccggctcc gccactggcg ccggtgccgc 1636261 cgggtgcaac ggcgttggcg ccgttaccgc cgttgccgcc tttgcccccg gtgtctgcaa 1636321 agtcgggggt cgcaccctgc gcggcgcggg tcacgccgtc accgctgagc cccccgagcc 1636381 cgccagcgcc gctgaagcca ggattgccgc cgttgccgcc atggccgccg ttggcaccgg 1636441 gtgcgacggc gttgccgccg gtcccgccga ccccaccgtt gccgccttta ccaccgtcct 1636501 ggccacgctc gcccgcggtg gtggcattgg caccctcggc accactacca ccgagcccgc 1636561 cgtctgcgcc gcggccgcca gtcccaccgg ccccgccatt gccggcgaga gttccgccgt 1636621 cgccgccggc gccgccctgg ccgccgttgc cgccgctatt gcctttgcca ccgactgcgc 1636681 ccgaatcgct cgcgttcgtc cctgcggcgc cgttggcgcc gttgccgccg gcgccgccgt 1636741 tgccgaccag cccgccatgg ccgccgggtc cgccgttggc gccgttggtg cccgcggtgg 1636801 tggcgttggc gccgttgccg ccggccccgc cgttgccgcc gctggtgggg gtggcgccga 1636861 tggcgccctg agcgccggtg atggagccgg ctccgccggt gcctccggcc ccgccggcgc 1636921 cggggtcacc gccatggccg ccggccccgc cggcacctgc gttgaaggcc tggttgccgg 1636981 ggccgccggc tccgcggtca ccgccgacgc caccagcgcc gccggtcccg ccggccccgc 1637041 cggcgccttg gccgcccagc aggctgatca ggccgccggc cccgccgggg ccgccagccc 1637101 cgccagcccc gcccatcccg ccgttaccac catcaccgcc gttatcccca gcgacaatca 1637161 aggcacgaga aaatccggcc ccgccggccc cgccggtccc gccaacccca ccggccccgc 1637221 cggccccacc ggcgccggcc agccagccgc cccgccctcc ggtgccgcca tcgccggcgt 1637281 cgccgccgac cccaccggac gtaccgtgcg gggacaagtc ctcaccggct gcgccggcca 1637341 caccctccgc gccgtgtccg ccggcaccac cgtgcccgcc cacgcccaac agcccggccg 1637401 cacccccgac accgccgtgt ccacccacac caccgatcgg gccgggcccg ccggcacctc 1637461 cgtgcccgcc ggccccgtag agggtcccgc ccaggccacc ggcaccaccg gtaccgccga 1637521 ccccgccggg cccgccgggc ccgccgggcc cgccggttcc gccgaccccg aacagtccgg 1637581 cgttgccgcc ggccccgccg gttgccccgc ccagcaggct ctgcccgccg gccccgccga 1637641 ctccaccatt gcccagcagc cagccgccgc tacccccggc cccaccggcg gcgccggccc 1637701 caccggcccc accggccccg ccggtgccga acaacccggc ggccccgccg gccccgccga 1637761 cttggccggg cgcgcccgag ccgccggccc caccgttgcc ccacaagatc ccgccggccc 1637821 cgccggcctg cccggtgccg ggtgctccag ccgccccatc accgatcaac gggcgaccca 1637881 gcaacgcctg ggtgggcgca ttgagggcat tgagcacgtt gtgctccagc gtcgccaacg 1637941 gtgcggcgtt ggtcgcctcc gcgctgacat acgagccgac cgcggcgctt aacgtctgcg 1638001 caaatcggtc atgaaacgct gccacctgcg tgctgatcgc ctgatactcc cgagcatggc 1638061 tgccaaacag cgtcgcgatc gccgccgaca cctcatcggc gcccgcggcc agcacgctgg 1638121 tggttgaccc cgccgccgcg ctgttggcta caccgatcga tgacccgatg cgcgccacat 1638181 ccgaggccgc cgcggccacc gtctccgggg ttacgatcac caacgacatc acagtccacc 1638241 cgccacgccc ctgccccttc ggcaggtcac actcctgcca gataagggtc gcgccgccac 1638301 cttgtccgat tccaggtcaa aatccccata accagcacga atctgctgtg cacagtgcac 1638361 attcgcccta ctatcggctc gtggcattgc gggaaacctc accgcgaata catgagctga 1638421 tccgcgaggc agcgcgaatc gccctcaacc cgacccagga atggctcgac gaattcgacc 1638481 gtgccattct ggccgccaac ccatccatcg ctgccgaccc cgccctggcc accgttgtca 1638541 agcgttccaa tcgggcgcat ctcatccatt tcgcggccgc caacctgcgc aatcccggcg 1638601 ccccggtgcc cgcgaacctt ggtcccgagc cgctgcgcat ggcccgtgat ctcgtgcgcg 1638661 tcggtttaga tgccttggcc ctcgacatct accgcatcgg acaaaacgtg gcctggcggc 1638721 gctggacgga catcgcgttc ggactgacct ccgaccccga cgagttgcac gaattactgg 1638781 atgtgccatt tcggacagcc aacgagttcg tcgacaccac ccttgcgggc atcaccaccg 1638841 agatgcaatt ggaacgcgac aagctcaccc gcgacgttcc tgccgaacgc cgcaaaatcg 1638901 tccagctgct catcgacggt gcccccatca gccgtgagca cgccgaagcg cgattgggct 1638961 accctctcga ccgatcccac accgccgccg tcatctgggg tgaccaggcc cagggcgacc 1639021 acagccacct ggaccgagtc gccgacgcgt tcggccatgc cggcggatgc ccgcacccgc 1639081 tggtcgtggt agccggcgcc gcgactcgct gggtgtgggt aaaagacgcc cccgggtttg 1639141 acatcgacct gattcacgag gtgctccatg acatacccga cgcgcgtatc gccatcgggg 1639201 ccaccgcgcc gggaatcgag gggttccggc gcagccaccg agacgcactc accaccgctc 1639261 ggatgattat ccggctggaa tcaccgcacc gagtcgcctt tttcaccgac gtcgagatgg 1639321 tcgcgttgct caccgaaaac gccgagggtg ccgacgactt catccaacgc accctcggaa 1639381 acctcgagtc ggccagcccg gctctgaaaa cgacgctatt gaccttcatc aaccagcagt 1639441 gcaacgcttc tcgggccgcg agacttctct tcacccaccg caacaccttg atgaaccgac 1639501 tcgagaccgc gcaacgactt ctgccccgcc ctctcgccga caccaccatt cacgtcgccg 1639561 tcgcactcga agcccagcag tggcgggaga agccaaccag cgatcctccg gcaaagaaag 1639621 agtcgaatgg caccaagatg cgttagcaag acagcgcagc acagaccgct acgctacggc 1639681 agcagcacga ccgagccgac cgtcttgcga gcctccaggt cctgatgggc gcgcaaggcg 1639741 tcggccagcg ggtaacgtcc gccgaccgcc acggtgatcg cttcgctgcc gatcgcgtcg 1639801 aacagctcag cggcccgcca gctgaactcc tcgccggtgc gggtgaagtg gaacagcgag 1639861 ggacgggtga ggtacaccga tccggcggca ttgaggcgct gcggatcgac cggtggaacc 1639921 ggaccgctgg cggcgccgaa cagtgctaat gtcccgcgga cagccaggct ggctaggctg 1639981 gcgtcgaagg tggtggcgcc gacaccgtcg taaacggctt gcacaccggt gccgccggtc 1640041 agttcgcgaa cccgcccggc gaactgccag gcatcctccg ggtagtcgag aaccacgtcc 1640101 gcgccggcat ccttggacag cttggccttc tccgccgtcg aaacggtggt gatcacccgc 1640161 acccccaggt gagtggccca ttgtgtcagg atcaagccga cgccgccggc gccagcatgc 1640221 accaagacgg tgtcaccacg cttcaccggg tacaccgact tcagtaggta atgcgccgtc 1640281 aggcccttca gcagcgccga agccgctacc tcagacgtga cgtcgtcggg gaccttggcg 1640341 gtcagagatg ctggcgctgt gcagaattcg gcgtaggcgc cgttggctga ggcgctgacc 1640401 acgcggtcgc cgacgctgat ggcggtgtcg gctgcggtaa cccctgggcc gacggcctcc 1640461 accgtgccgc atacctcgga gccgatgacg aacgggagtt cgcgcggata ttgcccggag 1640521 cggaagtagg tgtcgatgaa gttgacaccg atggcctcgg ccttgatcag gagctcgccg 1640581 tggccgggtt gaggttgcgg ctggtcgacg tggcgtaaga cgcctggccc gccggtttcg 1640641 gtgacttcga ttgcgtgcat gtggctatca tgcccgggca tgaagcttgc ccggccggac 1640701 gtcttccatc cgcgcgtcgt tttggcgggt tggccacagc agcccgccgg tgacggcgac 1640761 gatgctgggc tggttgcggc cctgcgccac cgcggcttgc atgctggttg gctgtcttgg 1640821 gacgatcccg aaatagtcca cgcggatctg gtgattttgc gggctacccg cgattacccc 1640881 gcgcggctcg acgagttttt ggcctggact acccgcgtgg ccaatctgct gaactcgcgg 1640941 ccggtggtgg cctggaatgt cgagcgccgt tacctacgtg acctgatgga tcggggggtg 1641001 ccgaccgtgc ccggcgaggt gtatgtgccg ggagagccgg tccggttgcc acgcaaaggc 1641061 caggtcttcg tcggtccgac catcggtacc gggacacggc gctgtagtgc ccggttcgct 1641121 gccgagttcg tcgcgcaact gcacgcggcc ggccaggcgg tgctcgttca gcccggaggt 1641181 tccggtgacg agaccgtgtt ggtcttcctt ggcggtgagc cgtcgcatgc gtttaccaag 1641241 caggccgaca cttggcgcca gaccgagccc gacttcgaaa tctgggacgt gggtgcggcc 1641301 gccgtggccg gcgcggccgc gcaggtgggt gttgacccag gtgagctgct ctacgcgcgg 1641361 gcccacatca caggtggaag ccgagatccc cggttgctgg aattgcaatt ggtggacccg 1641421 tcgctgggct ggcagtggct ggacccagac atccgcaatc ttgcccagcg tgacttcgcg 1641481 ctatgcgtcc agtcagcgtt ggagcggctg gggctgggcc cgttctccca tcgacgccca 1641541 tagcgcggcg gtggccgccg taaccgccgc ggcaccggcc acgtgaatgg cgaccagggc 1641601 ggcgggtacc ccggtgaagt attgcgtggt accgacggcg gcttgcgtgg caaccagggc 1641661 gagcagcacg gcgagtcgca ccagaatcgc ccgggtggca cccacggcca gcagcccgaa 1641721 acccaacccg atcagcagcg caaggtaggc aaccaacagc gacgaatgca tatgcaccaa 1641781 ggtggtgatt tcgactttca gccgcggcac ggtccggctg gggctgcgat ctcccgcgtg 1641841 cgggcctgcc gccgtgacta gcgtgcccgt caccagcacc gcggccaggt tcagcgcgct 1641901 gagcgccgtg agcgcacgca acgggctgac caccagttcg tggacgactc cgtcatcggg 1641961 ctggccgatc ttgacgtaga gcagcaccgc cagccacacc atcgtcatcg acgccagcag 1642021 gtggatggcc accgtccacc acagcagccc ggtgcgtacg gtgatgccac cgatcatcgc 1642081 ctgcaccacc gtcgacaccg gcatcagcca cgcgtaggcc aggacttccg tgcgccggcg 1642141 cgcccgggtg acgaccagca cggccagtgc cgcggctatc accaccgcaa acgtgaccat 1642201 ccggttgccg aactcgaccg cctgatggac ccgcggcacc tcggcgacca ccaccggggt 1642261 gaagctaccc ggaaaacact gcggccaggt cggacacccc aggcctgagg cggtaacccg 1642321 gacgattgcc ccggtgacgg cgatgccgcc ctgggtgagg atgacgattg cggcgatgac 1642381 ccgctggaca cgcaggctgg gagacaccgc ccgatcgtaa ggcaccaaaa actacacgct 1642441 gtagtacggg cggaccggtg tcgaaactgc aaccacgcac cgatgcgtcg gcgtgtcttg 1642501 tgcgtggttg cagtgtcgcg aagccgggcg gccggttcag gtgaaccgga accagcgcag 1642561 tgcggccagt gcggccagcg cgccccacac cgctaggacg acgatcccga accagtccac 1642621 cgacacggtc atggcctgcg acagcgcctc ggtgagcgcg cccgacgggg taacccgagc 1642681 cacccatttg aacgccgtcg ggatcacgtt cgactccaag gtcagcgcac cgaaaccggc 1642741 gaatacgaac cacatcaggt tggcgacggc gagaacgatc tcggctcgca aggtgccgcc 1642801 gagtagcagg ccgagcgccg caaagcccgc ggtacccagc gcgatgatcc cggcgcccaa 1642861 tgtcagggcc gtcagcgccg gccgccagcc gagcgcaaag ccgatggcgc ccaagatgat 1642921 ggcctgcaag aacaccacgg caaccactgc cagcgacttg ccggcgatga tcccccaaac 1642981 cggcagcggg gtagcaccga gtcgtttgag ggcgccgtag cggcgatcga acgcgaccgc 1643041 gatggcttgc ccggtgaatg cggtggagat caccgcaagc gccatgatga ccggaacaaa 1643101 ggtggcggcg cggttgtggc cgaacgagcc catcggcagc aaagtcagcc cgaccagcag 1643161 ggtgatcggg atgaacatgg tcaacagcag ttgctcgccg ttgcgtaaca gcagcttcaa 1643221 ttccaggctg aactgtgcgg caagcatcag ggggacggcg ttggggcggg ggtccgggct 1643281 gaaggtgccc gcgggaaaag cggggcgatt ggtttgggtc actgccgcaa cttcctgccg 1643341 gtgagatcca ggaacacgtc ttcgaggctg cgttgctcga cccgcatgtc ggtggctagc 1643401 acgtcgattt gtgcgcacca cgcggtgacc gtcgccagca cctgcgggtc aaccggacct 1643461 tcgaccaggt actcgcccgg ggtcagctcg gtggcctggt agccctcggg cagtgccgag 1643521 gccagcagcg acaggtcgag ccgcggcggc gcggtgaacc gcaactggtc tttggcgccg 1643581 ctgcgcatca gttctgccgg tgtgcctgcg gccaccgtca ccccgtggtc gatgatcacc 1643641 aaccgatcgg cgagttcctc ggcctccttg agatgatgcg tggtcagcac cacggtcacg 1643701 ccatcgcggc gcagcgcgtc gatcaactcc cacaccagta cccgggcatg ggcatccatg 1643761 cccgcggtgg gctcgtcgag gaacaccagt tggggacgcc cgaccagcgc gcaggccagc 1643821 gcgagtcgtt gctgctgccc gccggagagc cgtcgatagg tggtgcgggc ggcctcggtg 1643881 agacccaagg tgtccagtag ccagtgcggg tccagcgggt tggcggcgta ggacgcgacc 1643941 agatccagca tttcgccggc gcgtgccgcc gggtagccgc cgccaccctg caacatcacg 1644001 ccgatgcgtg cgcgcaggcg tgcgttgtcg gtgatcgggt ccagtccaag tacctcaatg 1644061 ctgccggcgt ccgggcggac gaagccctcg cacatctcga cggtcgtggt cttgcccgcg 1644121 ccgttggggc ccagcagcgc catcacttcg gcgtcatgca cgtcgagatc gaggttggaa 1644181 acggcggtta ttgacccgta tcgcttacat accccgcgaa gccgcagtac cacctcgggg 1644241 gtgtctgggg cgcggttcac gagcgccgct cctcctcatc gcttcgctct gcatcgtcgt 1644301 cggcgcggtt cacgagcgcc gctcctcctc atcgcttcgc tctgcatcgt cgtcggcgcg 1644361 gctcacgtgg aatcagcgta ggcgtcgggc gctgccgtcg gccggcgggt cgcaggggtc 1644421 ttgctggccg actccgcggc ggtgaccact tgctcggctg caagtggccg ccatggtaac 1644481 cgggtgtagg tcagggcaat caggaggatc acgatgatgg cgctggccgc ggtcgcatcc 1644541 acgatctgaa acagcgcgaa gcggtcacca ttggcggtcg gaccgaaaat tccgacgatg 1644601 agggtggcca ggattgccgc tacccgaaat cccgggcggg ttgcccaggc ggccagcgga 1644661 attatcgccc acagcaggta ccagggctgc acgacgggaa acagcagcac ggtgacagct 1644721 agcgcaacgc ccaggccgcc gatcgggtgc agccggccgc ggagcacggc caataacagc 1644781 cagcacacca tcaccgtgat gatcagcacg ccgatggcgc gggtgagtga caacacggcg 1644841 gtggtgtgat cacccaggcc cagcaggatg ccgacgtgcc cggtgcccag ggccagcagt 1644901 gtcggcggcg acatccagct gcgcaccaca ttggcggtgc ccagcgtgtt gatccagccg 1644961 aatccgagac cgctggccca acccaggatg gccattatcg ccagcgttag actcgccatc 1645021 acagcggcgg cgagcagcag tgctcgcaag ttgccacccc agcggtatgc cagcactgtc 1645081 gtgacgaagc ccatcgccag cagcgagggt agcttcactt gcgacgacag cgtgatcagg 1645141 atggaacccg ccagcagcat ggccaggggc ccccattccg gacggggttt gactgcccgg 1645201 ctcgcgcccg cacgcgggga tgcccccagc tcgggccgcc ggctggcccg tattgtggcg 1645261 gggcccaacc gccaggtttc gggcgacggg cgtggggtat tcgccatatc aaggccgcgc 1645321 agcgcgaatt cgacgccggt cagcatcagc ccgagcatca gcgcttcgtt gtggatgccg 1645381 gcgaccaaat gcatgatcag cagcggattg gccgcgccta gccacagcgc gctgacctcg 1645441 gcgacgccac agcgctgagc tagccgaggg gtcgcccaca cgatcagggt cacaccgatc 1645501 aacaccacaa gccggtggca gagcacggca gcgacgatgt tttccccagt cagcgacgag 1645561 attccgcggc cgatccacaa gaacagcgga ccatatggcg ccggtgtctc ccgccacagg 1645621 ctgggcaccg acagggtgaa cacgtggccg aggcccaagc cggacgccgg acccacccgg 1645681 taagggtcga gtccgtccct gccgatctca ctttgggcta gatatgagta gacatccttg 1645741 ctgtacatcg gtggtgcgat caatagcggc agcatccaga gcagcagggt gcggtccagt 1645801 ttgccgcgcg acatccgccg cctgcccagc gtgaaccggc cgagcatcag ccaggccagc 1645861 gccatcatga ccgccccggt cgtggtcatg gtcaacgaca ccgtttggat tcgtgacggc 1645921 agattgagca gccggacccc gaaggtgggg tcctggacga cgggtcgggc cccggcgccc 1645981 agggcgccga tggccatcag gacggtgccg gtggccccaa acaggcgggt gcgcgccagc 1646041 gcggtgagct cggtagtggt cagcggtgca cccaccgcct gctcgtcgcc atgcaggctg 1646101 gcgatcgacc agctcagcgt atggtggcgg gctgccattg gtgcagccta acggcatgcc 1646161 cgggaattgc ttaggcgatc tcaatgtgac cagcacaacc ctgccgcata gggcatccct 1646221 ggtagaccga tcaacggaat tttgtcacac tgatgttgtg aaaatcccgg cggtctctac 1646281 cactgtcccc gcggcagtct cggacggtca cactcgtcgg gccattgtgc gcttgctgct 1646341 ggaatccgga tcgatcaccg ccggcgagat cggtgaccgg ctgggcctgt cggccgccgg 1646401 tgtgcggcgt catctggacg cgctgatcga ggcgggtgac gcggaagcgt cggcggccgc 1646461 gccgtggcag caggtgggac gcgggcggcc cgccaagcgc taccggctga ccgcggccgg 1646521 ccgggccaag ctcgaccact cctatgacga cctggcgtcg gcggccatgc ggcagctgcg 1646581 ggagatcggc ggcgaggagg cggtgcggac gtttgcccgg cgccgtatcg acgccatcct 1646641 ggccgacgtc gcgccggccg acggtcccga cgacgccgcg ctcgaggcgg ccgccgagcg 1646701 gatcgcaacg gcgctcagca aagccggcta cgtcgccacc accacgcggg tgggcgggcc 1646761 gattcacggt gtgcaaatct gccagcacca ttgcccggta tcccatgtcg ccgaggaatt 1646821 ccccgaattg tgcgaaaccg agcagcaggc catggccgag gtgctcggca cccacgtcca 1646881 gcggttggcg accatcgtca acggagactg cgcctgcacc acccacgtac ccctgtcgcc 1646941 ggcgcccagc ccgcgcccac ccgccaccag caccgaagga gcgtcccgat gacactcacc 1647001 ccagaggcca gcaagagcgt tgcccagccc ccgacccagg ctcccctgac ccaggaagag 1647061 gcgatcgcgt cgctgggccg gtacggctac ggctgggcgg actccgacgt cgcgggtgcc 1647121 aacgcgcagc gcgggctttc cgaggcggtg gtccgcgaca tctccgcgaa gaagaacgag 1647181 cccgattgga tgctgcagtc gcggctgaag gcgctgcgca ttttcgaccg caagcccatt 1647241 ccgaagtggg gctccaacct cgatggcatc gatttcgaca acatcaagta cttcgtgcgc 1647301 tccaccgaga agcaggccgc gagctgggat gatttgccag aggacatccg caacacctac 1647361 gaccggttgg gaatcccgga ggccgagaag cagagattag tagctggagt agccgcacaa 1647421 tacgaaagtg aagttgtata tcaccagatc agagaggatc tggaggctca aggagtcata 1647481 tttttagaca ctgatactgg tttgcgagaa cacccggata ttttcaagga atatttcggt 1647541 acagtaatcc ctgccggcga taataagttt tctgcattga atactgcagt ttggagtggt 1647601 gggtccttta tttacgtccc gcccggtgtt cacgtcgaca ttccgctgca ggcctacttc 1647661 cgaatcaaca ccgagaacat gggccagttc gagcggacgc tgatcatcgc cgatgagggc 1647721 tcttacgtgc actacgtaga gggctgcctg cccgccggcg agctcatcac gaccgccgac 1647781 ggcgatttgc ggcccatcga gtcgattcgc gtcggtgact tcgtcaccgg ccacgacggg 1647841 cggccacacc gcgtcaccgc tgtacaggtg cgtgacctcg atggcgagct gttcaccttc 1647901 acaccgatgt cgcctgccaa cgcattctct gtcaccgccg agcaccccct tctcgctatt 1647961 ccccgcgacg aggtgcgtgt tatgcggaag gaacgcaatg ggtggaaggc tgaagtcaac 1648021 agcaccaagc tgcgtagcgc cgagccgcga tggatcgcgg cgaaggatgt ggccgagggt 1648081 gacttcctga tctaccccaa gccgaagccg atcccccaca ggacggtttt gccgctcgag 1648141 tttgcgcgcc tggcgggcta ctacctggcg gagggtcacg cgtgtctcac caatggctgt 1648201 gagtcgctga tcttctcgtt ccacagcgat gagttcgagt acgtcgagga tgtgcgccaa 1648261 gcgtgcaagt cgctgtacga gaagtcggga tcggtattga tcgaggagca caagcattcg 1648321 gcgcgcgtca ccgtgtacac gaaggcgggc tatgcggcga tgcgcgacaa cgtcggcatt 1648381 ggatcgtcga ataagaagct gtcggatctg ttgatgcgtc aagacgagac gttcttgcgt 1648441 gagctggtcg acgcctatgt gaatggagac ggcaacgtca cgcgccgtaa cggggcggtg 1648501 tggaagcggg tacatacgac atcgcgcctc tgggcgttcc agttgcagtc catcctggcg 1648561 cgtctgggtc actacgccac tgttgaactg cgccgaccgg gcggccctgg tgtgatcatg 1648621 ggccgcaacg tcgttcgcaa ggacatctac caggtgcagt ggaccgaggg cggccgcgga 1648681 ccgaagcagg cccgcgactg cggcgactac tttgcggtgc caatcaagaa gcgagcggtc 1648741 cgcgaagcac atgagcccgt ctacaacctc gatgtcgaga atccggacag ctacctcgcc 1648801 tacgggttcg ccgtgcacaa ctgcaccgca ccgatctaca aatcggattc attgcactca 1648861 gcggtggtcg agatcatcgt gaaaccccat gcgcgcgtgc gttacaccac catccagaac 1648921 tggtcgaaca acgtctacaa cctggtcacc aagcgggccc gcgccgaagc cggggccacc 1648981 atggagtgga tcgacggcaa catcgggtcc aaggtgacca tgaagtaccc ggcggtctgg 1649041 atgaccggcg agcacgccaa gggcgaagtg ctctcggtgg cgttcgccgg cgaagaccag 1649101 caccaggaca ccggcgccaa gatgctgcac ctggcgccga acacgtcgag caacatcgtg 1649161 tccaagtcgg tggcccgcgg cggcggccgc acctcctacc gtggcctggt gcaggtcaac 1649221 aagggggcgc atgggtcgcg gtccagcgtg aaatgcgatg cgctgctggt ggatacggtc 1649281 agccgcagcg acacctaccc ctacgtcgac atccgcgagg acgacgtcac catgggccac 1649341 gaggccaccg tgtccaaggt cagcgagaac cagctgttct acctgatgag ccgcgggctg 1649401 accgaggacg aggcgatggc gatggtggtg cgcggcttcg tcgagccgat cgccaaggag 1649461 ctgccgatgg agtacgcgct ggagctcaac cggctgatcg agctgcagat ggagggcgcg 1649521 gtcggatgac ggctccggga ctgacagcag ccgtcgaggg gatcgcacac aacaagggcg 1649581 agctgttcgc ctcctttgac gtggacgcgt tcgaggttcc gcacggccgc gacgagatct 1649641 ggcggttcac cccgttgcgg cggctgcgtg gcctgcacga cggctccgcg cgggccaccg 1649701 gtagcgccac gatcacggtc agcgagcggc cgggcgtata cacccagacc gtgcgccgcg 1649761 gcgatccacg actgggcgag ggcggcgtac ccaccgaccg cgttgccgcc caagcgtttt 1649821 cgtcgttcaa ctccgcgact ctggtcaccg tcgagcgcga cacccaggtc gtcgagccgg 1649881 taggcatcac cgtgaccggg ccgggggagg gcgcggtggc ctatgggcac ctgcaggtgc 1649941 gtatcgagga gcttggcgag gcggtcgtgg tcatcgacca ccggggcggc ggaacctacg 1650001 ccgacaacgt cgagttcgtt gtcgacgacg ccgctcggct gaccgccgtg tggatcgccg 1650061 actgggccga caacaccgtt cacctcagcg cgcaccatgc tcggatcggc aaggacgcgg 1650121 tgctgcgcca cgtcaccgtc atgttgggcg gcgacgtggt gcgaatgtcg gcgggcgtgc 1650181 ggttctgcgg tgcgggtggg gacgcggaac tgctggggct gtatttcgcc gacgacggcc 1650241 agcacctgga gtcgcggctg ctggtggacc acgcccaccc cgactgcaag tcgaacgtgc 1650301 tgtataaggg tgcactgcaa ggtgatccgg cgtcgtcgtt gcccgacgca cacacggtct 1650361 gggtgggtga cgtgctgatc cgtgcgcagg ccaccggcac cgacaccttc gaggtgaacc 1650421 ggaacctggt gctcaccgac ggcgcgcgtg ccgactcggt gcccaacctg gagatcgaga 1650481 ccggcgagat cgtcggcgcc ggacacgcca gcgccaccgg tcgcttcgac gatgagcaat 1650541 tgttctacct gcgttcgcgc ggtattcccg aagcacaggc ccgccggctg gtggtccgcg 1650601 gcttcttcgg tgagatcatc gccaagatcg cggtgcccga ggtacgcgag cgcctgaccg 1650661 cagccatcga acacgagctg gaaatcacgg aatcaacgga aaagacaaca gtctcatgac 1650721 cattttggaa attaaggacc tgcacgtcag cgtggagaac cccgcggagg cggaccacga 1650781 gatcccgatc ctgcgcggcg tcgacctcac cgtgaaatcc ggtgagacac atgccttgat 1650841 gggacccaac ggctcgggca agtcgacgct gtcctacgcc atcgcgggcc atcccaaata 1650901 ccacgtgacg tcgggcacca ttaccctcga cggcgcggac gtgctggcga tgagcatcga 1650961 cgaacgtgcg cgggccggcc tgtttctggc catgcaatat cccgtcgagg tgcccggtgt 1651021 ctcgatgtcg aacttcctgc gctcggcggc aaccgccatt cgcggcgagc cgccgaaact 1651081 gcggcactgg gtcaaagagg tcaaggccgc gatggccgcg ctcgacatcg acccggcctt 1651141 cgccgagcgc agcgtcaacg agggtttctc cggtggcgag aagaagcgcc acgagatcct 1651201 gcagctagaa ctgctcaagc ccaagatcgc catcctggac gagaccgact ccggcctgga 1651261 cgtcgacgcg ctgcgcgtgg tcagcgaggg ggtgaaccgc tacgccgaat cccagcacgg 1651321 cggcatcctg ctgatcacgc actacacccg catcctgcgc tacatccacc cggaatacgt 1651381 gcacgtgttc gtcggcggcc gcatcgtcga gtccggtggt tcggagctcg ccgacgaact 1651441 cgaccagaac ggctacgtgc gtttctcccc cgcaagcggg cggtaccccc accaacccgc 1651501 gccaaccgga gcctgacatg acggcctcgg tgaactcgct cgatctggcg gcgattcgcg 1651561 ccgatttccc catcctcaag cgcatcatgc ggggtggaaa cccgttggcg tatttggact 1651621 ccggcgccac ctcacaacgc ccgctgcagg tcctcgacgc cgagcgcgag ttcctgaccg 1651681 cgtccaacgg cgcggtccat cgtggcgcgc accagctgat ggaggaggcg accgacgcct 1651741 acgagcaggg ccgcgcggac atcgcgttat tcgtcggcgc cgacacggac gagctggtgt 1651801 tcaccaaaaa tgccaccgag gcgctcaacc tggtgtcata tgtgctgggg gacagccgtt 1651861 tcgagcgtgc cgtcggcccc ggcgacgtga tcgtcaccac cgagctggag catcacgcca 1651921 acctgatccc gtggcaggag ctggcccggc gcaccggggc cacattgcgc tggtacgggg 1651981 tgactgacga cgggcgcatc gacctggact cgctgtatct ggacgaccgt gtcaaagtcg 1652041 ttgcgttcac ccatcattcc aatgtgaccg gggtgctgac accggtgagc gagctggtct 1652101 cccgcgccca ccagtcgggt gcgctgaccg tgctggacgc ctgccagtcg gtgccgcacc 1652161 agccggttga cctgcacgaa ctcggcgtcg acttcgccgc gttttccgga cataaaatgc 1652221 tgggccccaa cggaatcggt gtgctgtacg gccgccgtga gctgctagcg cagatgcccc 1652281 catttctcac cggcggttcg atgatcgaaa cggtgaccat ggaaggcgcc acctacgcgc 1652341 cggcgccgca acggttcgag gccggtaccc cgatgacctc ccaggtggtc gggttggccg 1652401 ccgcggcccg ctatctcggc gcgatcggca tggccgcggt ggaggcccac gagcgggagc 1652461 tggtagccgc ggccatcgaa ggcctgtccg gcatcgacgg tgtgcggatc cttggcccga 1652521 cgtcgatgcg ggaccgaggg tcgccggtgg cgttcgtcgt cgagggcgtg cacgcgcacg 1652581 acgtgggtca ggtactcgac gacggcggcg tggcggtgcg ggtcgggcac cactgcgcgc 1652641 tgccgctgca ccgcaggttc ggtctggccg ccaccgcgcg ggcgtcgttc gcggtgtaca 1652701 acaccgcaga cgaggtggac cgcttggtgg ccggcgtgcg gcgatcccgg catttctttg 1652761 gaagagcgtg acgttgcgtc tggagcagat ctatcaggac gtgatcctcg atcactacaa 1652821 gcatccgcag catcgggggc tgcgggagcc gttcggcgcc caggtgtatc acgtgaaccc 1652881 gatctgcggc gacgaggtca cgctgcgggt cgcgttgtcc gaggacggca ccagggtcac 1652941 cgacgtttcc tatgacggac aaggctgttc gatcagccag gccgcgacct cggtgctcac 1653001 cgaacaggta atcggacaac gcgtgccgcg ggcgctgaac atcgtcgacg ccttcaccga 1653061 aatggtgtcc tcccgcggga ccgtgccagg cgacgaggac gtcttaggcg atggggtcgc 1653121 gttcgccggg gtggccaaat acccggcccg ggtgaaatgc gcgctgctcg gatggatggc 1653181 gttcaaagat gcgctggccc aagccagcga agccttcgag gaggttacag atgagcgaaa 1653241 ccagcgcacc ggctgaggaa ttgctcgccg acgtcgagga ggcgatgcgc gacgtcgtcg 1653301 acccggagct ggggatcaac gtcgttgacc tgggcctggt ctacggcttg gacgtgcaag 1653361 acggtgacga agggaccgtc gcgctgatcg acatgaccct cacgtcggcg gcgtgcccgc 1653421 tgaccgatgt catcgaggat cagtcgcgca gcgcgctggt cggcagtggc ctggtcgacg 1653481 acatccgcat caactgggtg tggaacccgc cgtggggccc ggacaagatc accgaagacg 1653541 gccgcgaaca attgcgggcg ctcggcttca ccgtctgaac cggcgcgtcg ccgaacgtga 1653601 actgagggcg gagaatccgg caaaataccg ccgtgagttc acgttcggcg ggcggtgcga 1653661 gcgaaacccg cctcagaagg cgtcttcggg cacgcgcatg atgtcgtcgt cgatgttttc 1653721 gatgacactg cgcaccccgg tcagtttcgg cagcatgttc ttcgcaaaga acgccgcgac 1653781 cgcgatcttg ccccgataga acgcttcatc gttctgcgat ggcccgtcgg ccagtgcggc 1653841 gtgtgcgacc ccggccagca cgagcagccg ccagccgatg agcaagtcgc ccacggcgag 1653901 caaatagcgc acggatccga gccccacctt gtagatgtcg ctggagtgct gcgcggcgga 1653961 catcaggtac ccggtcagcg cgcccgtcat tgccgtgatg tcgtcgagcg cggtgcgcag 1654021 cagctcggct tgcggtttta gcgacgggtc aatgttctcg acggtgtggg tgacctgagc 1654081 cagcacaaat tgcaaagcct tgccgtgatc gcgcacgatc ttgcggaaga agaagtccag 1654141 tgcctggatc gccgtggtgc cctcgtagag ggaatcgatc ttggcgtcac ggatgtactg 1654201 ctcgagggga tagtcgacca gaaagcccga gccgcccagc gtctgcagcg actcggtgag 1654261 gatttcgtag gcgcgttctg aacccacgcc cttgacgatg ggcagcagca gatcgtccac 1654321 gcggtgcgcc atgtcgtgat cggcacccga aacccgttgg gccacagcgt cgtcctggtg 1654381 agcagcggca tacaggtaca gcgcccgcag gccttcggca taggcctttt gggtcatcag 1654441 gctgcgccgc acgtcggggt ggtgcatgat tgtgacccgc ggcgccgtct tatccgtcat 1654501 ctgggtcaga tccgcgccct gcacccgctc cttggcgaag gcgagtgcgt tgagatagcc 1654561 cgtcgacaat gtgccggcgg acttaactcc gatggtcatg cgagcatgct caatcaccgt 1654621 gaacatctgc gcaatcccgt tgtgcacgcc gccgaccaga tagccaacgg cgggcacgtc 1654681 ggcaccgccg aacgtcaatt cgcatgtcgg agaggacttt aagcccatct tgtgttccag 1654741 gccggtcacg tagacgccgt tgcgggcgcc gagctcgaac gtatcggggt cgaagaggta 1654801 gttgggaacg tagaacaggc tcaacccctt ggtgcctggg ccggcgccct caggtcgggc 1654861 caacaccaaa tggaagatgt tctccgcggt attgccgaca tccccaccgg agatgaaccg 1654921 cttgacgccc tcgatgtgcc aggtgccgtc gggttgttcg aacgctttgg ttcgacccgc 1654981 gccgacatcg gaaccggcgt cgggctcggt gagcaccatg gtggcctgcc agccgcgctg 1655041 cacgccctcg gccgcccacc tgcgttgctc atcattgccc tcgatgtaaa gggactgggc 1655101 cagcaccggg cccaggttga aaaagcacgc cgacgggttg gcgcagtaga tcatttcgtt 1655161 gacggcccat gccagcggcg gcggcgctgg catgccaccg atctcctcgg ccaggcccag 1655221 ccgccaccag ccggcctcct tgattgcctg cactgtcttg gccaactcgt cgggcacgct 1655281 gatggagtgg gtgttcgggt cgaagaccgg tgggttgcgg tcggcgtagc cgaaggattc 1655341 ggcgatcgga ccctcggcca gccgcgccgc ttcggccaag atggtgcgga ccgtgtcgac 1655401 gtccagatcg ctgtagcgtc cggtgcccag gaccgcgccg atatcaagga cttcgagcag 1655461 gttgaactcg agatcgcgga cattggcgat gtagtgtccc aatgcggttc ccttcaggtg 1655521 gctgatcggc cctgatcggg cccagtctct ccgagcggga agaacgtacg caaccgtaac 1655581 ctgcggtggg agggcggaac tgcggcgact atgttccgtt cgcgccgggc aggccgagca 1655641 gcagcccgcc cctgccgccg agcccggggg cgccggcccc gccgccgtcg ccgccgtcac 1655701 cgccgttacc gatcagctgg gcgttgccac cgttgccgcc gttgccgccc aacgcgccgc 1655761 catcgccgcc ttccccgccg ttgccgaaca acccggcctg gccgccggcc ccgccgtggg 1655821 cgctcgatgc ccccccggct ccgctgccgc cggcgccgcc gttgccatag aagaacccgg 1655881 catcgccgcc acgcccagcg ctacccgcgg atagggctgc cccgccggca ccaccgtcgc 1655941 cgaacaggaa ggccctgccg ccggcgccac ctccgccgag gaagctgctg gcgccagcac 1656001 cgccgttgcc aaaaaacagc ccgccgttgc ctccagagcc accggctccc atgccgttgg 1656061 ggctgatgcc acccgcgccg ccggccccga agagcacggc ggagccgccg atgccgccgg 1656121 caccgccgcc accgccgcta ttgccgccgg ccccgccgtt gccgaacagc cacccgccgg 1656181 tgccgccggc gccgccgttg gcgcccaggg cgcccacgcc gccgttgccg ccgtggccca 1656241 gcagtccggc ggcgccgccg ttgccgccgg ggccggcagc gggcgagaag ccgttgccgc 1656301 cattgccgat caggagtccg ccggcctggc cgttggggtt cgccgcggtc ccatcggcgc 1656361 cgttgccgat cagcggacgg ttcaacagcg ccagggtggg cgcgttgatg acgtcgaata 1656421 gcggctgcaa ggggccaaag ttggtggcct ccgcggcggc gtacgcctgc gcgccgccgg 1656481 acagggcttg cacgaactgg ctgtgaaacg ccgcggcttg ggcgctgagc acctgatagg 1656541 tctggccgtg cgcgccgaac aacgccgcga ccgccgccga cacctcatcg gcgcctgcgg 1656601 ccgcgacagc ggtcgtctgg gccgccgcgg ccgagttggc cgcgctaatc atcgaaccga 1656661 ggcgcgcgag attccccgcc gctcccgaca cgaactccgt attcgcgacc acgaacgaca 1656721 tctggcacct ccgcaatgaa gagctagcga ccgacgtatc ttatcgcgat ccagcggccg 1656781 cttcacccgt ttcggggtaa cgcaccccgc cagaatggtt aatccgttag tggccccgct 1656841 tgccttgtgc cagtgaccaa ttcaatcgca taccgcaatg caatcgagat ttttggtcgt 1656901 tcctgcgtcc ctacactcgg ttcatcctga cgaattcgca cccctgtcgt gaggccgccg 1656961 gaatgacctt gaccgcttgt gaagtaactg ccgcggaggc tcctttcgac cgcgtttcaa 1657021 agaccattcc ccacccattg agctggggag ccgcgctgtg gtcggtagtc tccgtgcgct 1657081 gggccaccgt ggcgctgctg ctgtttctcg ccggactagt ggcgcaactg aacggtgctc 1657141 ccgaggccat gtggtggacg ctttacctgg cctgttatct ggccggcggc tggggctcgg 1657201 catgggcggg cgcacaagcg ttgcggaaca aggcacttga tgtggatctg ctgatgattg 1657261 ccgcggcggt cggagcggtc gcgattgggc agatcttcga cggcgcgctg ctgatcgtga 1657321 tcttcgccac gtccggtgcg ctggatgaca ttgccaccag acacaccgcg gaatcggtca 1657381 aaggcctgct ggacctcgcg ccggatcagg cggtggtggt ccagggcgac ggcagcgaac 1657441 gggtggtggc ggccagcgag ctggtggtgg gggaccgggt ggtggtgcgg ccgggggacc 1657501 ggatacccgc agacggtgcg gtgctgtcgg gggctagcga cgtcgaccaa cgctcgatca 1657561 ccggtgaatc gatgccggtg gccaaggccc gcggtgacga ggtgttcgcc ggcaccgtga 1657621 acggatcggg tgtattgcat ctggtggtca cccgtgaccc gagccagacc gtggtagccc 1657681 gcatcgtcga actggtcgcc gacgcttcgg cgacgaaggc caaaacccaa ctgttcattg 1657741 agaaaatcga gcaacgctac tccctgggca tggtcgcggc cacccttgcc ctcatcgtta 1657801 ttccgctgat gttcggcgcc gacctgcggc cggtgctgct gcgcgccatg accttcatga 1657861 tcgtggcatc gccatgcgcg gtggtgctgg ccaccatgcc gccgctgctt tcggcgatcg 1657921 ccaacgcagg ccgtcatggg gtgctggtca aatccgcggt ggtcgtcgaa cgcctggccg 1657981 ataccagcat cgtcgctttg gacaagaccg gtacgctgac ccgtggcatc ccgcgactgg 1658041 cttccgtcgc accgctggac cccaacgtgg tcgatgcccg gcgattgttg caattggcag 1658101 ctgccgcaga acaatccagc gagcacccgc ttggccgggc gatcgtcgcg gaagctcgtc 1658161 ggcgtggtat cgccataccg cccgccaagg acttccgcgc ggtcccgggc tgcggggtcc 1658221 acgccctggt gggcaacgat ttcgtcgaga tcgccagccc gcaaagctac cgcggtgcac 1658281 cgctagcaga gctggcgccg ctcctttctg ccggcgccac tgccgccatc gtcttgttgg 1658341 atggagttgc catcggtgtg ctcgggctca ccgatcagct tcgtccggat gccgtggagt 1658401 ccgtcgcggc gatggctgca ttgaccgccg caccaccggt gctgctcacg ggtgacaacg 1658461 ggcgagcggc ttggcgggtc gctcggaacg ccgggatcac cgatgtgcga gccgcattgc 1658521 tgcccgagca gaaggttgaa gtcgtgcgca acctgcaggc cggtggtcac caggtgctgc 1658581 tcgtcggcga cggcgtcaac gacgctcccg ccatggccgc cgcccgcgcc gctgtcgcca 1658641 tgggcgccgg cgccgatctg accctacaga ccgcagacgg ggtgaccata cgggacgaac 1658701 tgcacaccat cccgacgatc atcgggttgg cacggcaggc gcgccgggtg gtcaccgtca 1658761 acctggccat cgcggccacc ttcatcgccg tcctggtgct gtgggacctt tttgggcagc 1658821 tgccgctgcc actgggtgtg gtgggtcacg aagggtccac tgtgctggtg gccctcaacg 1658881 gcatgcggct attgaccaac cggtcgtggc gggccgcggc ttcggctgcg cgttaggctc 1658941 gatgtcgcag aactgaccag ggctgcgtta ggggtgcccg tgaccactcg agacctcacg 1659001 gcggcgtatt tccaacagac catctccgcc aacagcaacg tgcttgtgta cttttgggca 1659061 ccgctgtgcg ccccgtgcga cctgttcaca ccgacctacg aggcgtcgtc gcggaaacac 1659121 tttgacgtcg tgcatggcaa agtcaacatc gaaaccgaga aagatctggc ctcgatcgcc 1659181 ggggtcaagt tgttgcccac gctgatggcc ttcaagaaag gcaagctggt cttcaaacaa 1659241 gccggcatcg ccaatcccgc gatcatggac aatctggtgc aacaactccg ggcatacacc 1659301 ttcaagtccc cggccggcga aggtatcggc cctggaacaa agacttcatc ctgaggcgtt 1659361 gaggcaggcg tgactacccg agacctcact gccgcacagt tcaacgaaac catccaaagc 1659421 agcgacatgg tgctcgtcga ttattgggcc tcctggtgcg gcccgtgccg cgcgttcgcg 1659481 ccgacctttg ccgagtcgtc ggaaaaacac cccgacgtgg tgcacgccaa ggtcgacacc 1659541 gaagccgaac gagagcttgc agcggccgct cagatccgat ccatccccac gatcatggcc 1659601 ttcaagaacg gcaagttgtt gttcaaccag gccggcgcgc tgccgccggc agcattggag 1659661 agcctggtgc agcagctcaa ggcctacgag gtggaggccg gcgaagccac cacccagaac 1659721 gggcgagccc aacaagcctg accgggcgcc aggcgcccgg ctgtgcccca ccgctgcgcg 1659781 gcgcaagtcg tcgccgggta ccgttcaacg gtgagtttgg tcctcgtcga acacccgcgg 1659841 cccgagatcg cgcagattac cctcaaccgg ccggagcgga tgaactccat ggcattcgat 1659901 gtcatggtgc cgctcaaaga ggccttagcg caggtcagct acgacaactc ggtgcgggtg 1659961 gtggtgctga ccggcgcggg tcgagggttt tctccgggtg cggatcacaa gtcggcgggg 1660021 gtggtgccgc acgtcgagaa cttgactcgg cccacctacg cgctgcgttc gatggagctc 1660081 ctcgatgacg tcatcttaat gctgcgacgg ctgcaccagc cggtgatcgc cgcggtcaac 1660141 ggccccgcca tcggtggtgg gctgtgcctg gcactggctg cagacattcg ggtggcctcg 1660201 agtagcgcct acttccgggc cgccggtatc aacaacgggc tgaccgccag cgaattgggg 1660261 ctgagctacc tgttgcccag ggccattgga tcctcacgtg cgttcgagat catgttgacc 1660321 ggtcgcgacg tcagcgccga ggaagccgag aggatcgggc tggtatcccg tcaggtaccc 1660381 gatgaacagc tgctagatgc ctgctacgcg atcgccgcac ggatggcggg attctcgcgg 1660441 ccgggaattg agttgaccaa acgtacgctg tggagtggac tggacgccgc cagtctggag 1660501 gcgcacatgc aggccgaggg cttggggcag ctcttcgtcc ggctgctcac cgccaacttc 1660561 gaagaagcgg ttgccgcacg ggccgagcag cgggcgccgg tgttcaccga tgacacgtaa 1660621 cagcgcccaa gacaaccgac gaccagggag cgaatgtgat cacagctacg gacctcgagg 1660681 tccgcgctgg cgcgcgcatc ctgctcgcac ccgacggccc cgacctgcgt gtgcagcccg 1660741 gcgatcgtat cgggctggtc ggacgtaacg gtgccggcaa gaccaccacg ctgcgcattc 1660801 tggcggggga ggtcgaaccc tatgccgggt cggttacccg tgccggcgaa atcggctacc 1660861 tgccacagga tcccaaagtt ggcgatctcg acgtgctggc ccgtgaccgg gtgctgtccg 1660921 cccgcggact ggacgtcctg ctcactgatc tggagaagca gcaggcgttg atggccgagg 1660981 tcgccgacga ggacgagcgt gaccgcgcca tccgccgtta cggtcagctc gaggagcgat 1661041 tcgtcgcgct gggcggctat ggcgccgaaa gcgaagccgg ccgcatctgc gccagcctag 1661101 gcttgcccga gcgggtgctg acccagcggc tgcgtaccct ttccggaggt cagcgccgcc 1661161 gggtggaact agcccgcatt ttgttcgccg cgtccgagag tggcgctgga aattccacca 1661221 ccttgttgct cgacgagccg actaaccacc tcgacgctga ttcgctgggc tggctgcggg 1661281 acttcctgcg cttgcatacg ggcgggctgg tggtcatcag ccacaacgtg gacctggtgg 1661341 ccgatgtcgt caataaagtg tggttcctgg atgccgtgcg cggccaggtc gatgtttaca 1661401 acatgggctg gcagcgctac gtcgacgctc gggccaccga cgagcaacgt cgcatccggg 1661461 aacgcgctaa cgccgaacgc aaggcggccg cgctgcgtgc acaggccgcc aagttgggcg 1661521 ccaaggccac caaagccgtt gcggcccaga acatgttgcg ccgcgccgat cggatgatgg 1661581 ccgcactcga cgaggagcga gtcgccgaca aggtggcccg gatcaagttc cccaccccgg 1661641 cggcgtgtgg acgcacaccg ctggtggcca acggtctggg caagacgtat ggctcgctgg 1661701 aagtcttcac cggtgtcgac ttggccatcg accgcggctc gcgggtggtc atactcggac 1661761 tcaacggtgc cggcaagacc acgctgctgc gattgctggc cggtgtcgag cagcccgaca 1661821 ccggagtgct ggaacccgga tacggtttac ggatcggcta tttcgcgcag gagcacgaca 1661881 cgctcgacaa cgatgccacc gtttgggaga acgtccggca cgcggcaccg gatgccggcg 1661941 aacaggacct gcgcggcctg ctgggtgcgt tcatgttcac cggtccgcag ctcgagcagc 1662001 cggccggcac gctctccggc ggtgagaaga cccggctcgc gctggccggc ttggtggcct 1662061 ccaccgcgaa tgtgctgctg ctcgatgaac cgaccaacaa tctcgatccg gcctcgcgcg 1662121 agcaggtgct cgacgcgctg cgcagctacc gaggtgcggt ggtgctggtg acgcatgatc 1662181 ccggggcggc cgcggcgctc ggtccccaac gggtggtgct gttgcccgac ggcaccgagg 1662241 actactggtc cgacgagtat cgagatctca tcgagctggc ctgacctaga tgcggctgcc 1662301 gcgtaacgat ttcggccaaa gcaccaccgg ggcggcggcg ggttcttagg ctaggtgcct 1662361 gggatcgacg gagggtaccg atgcggaagt caaagaagac gcgcgatcag ctgctgcgcg 1662421 agttgcgcaa cgcctacgag ggcggggcca gtatccgcaa cctggcggcc accaccggcc 1662481 ggtcgtacgg atctattcac agcatgctgc gcgagtcagg caccacgatg cgcggccgcg 1662541 gcggccccaa tcgccgttcc cggccgcgtt gatccgccga ttgtgaatct gacgacgcga 1662601 cagcggcgtg tcgcgtcgtc agattcacag tcagcgcatg tcaagaccga cgcaccgagt 1662661 tctccaccag gtcgaggacg gcggctagcc gctgcgggtc ttcgccggag gccagccggg 1662721 ccagcaatcc gtcgagcacc aggtccaggt agcaccgcaa aacgtcgcta ggcacatcgt 1662781 cacgcactcg gttagcctgc ttttgccggc gcagccgatc ggtggtcgcc gccgccaatt 1662841 ccgcggagcg ctccgcccag ccgcggctga agtcagggtc gttgcgcagc ttgcgtgcga 1662901 tctccaacct ggtggccagc cagtcgaact ggtcgggcgc ggcaagcatg tcgcgcatca 1662961 caccgatgag gccttcgcgg gatgctacag ccgccattcg ctcggtatcc tcgcgcgcca 1663021 gcgcgaaaaa cagcgcgtcc ttgtcgcgga agtggtgaaa gatcgcaccg cgcgacatcc 1663081 cgattgcctg ttccaggcgc cggaccgtgg ccttgtcata gccgtattcg gcaaagcaac 1663141 ggcgcgcacc gtcgaggatc tgacggcggc gagccgccag atggtcctcg ctgaccttgg 1663201 gcacgggcgc tcggtcagcc tgacttcagt atgttgcgca gcacgtactg caggatgccg 1663261 ccgttgcggt agtagtccgc ctcaccgggg gtgtcgatgc gcaccacggc gtcgaactcg 1663321 atcgtggcgc cgtcgccctt ggtggcctgg acgcacaccg tcttgggtgt cttgccgtcg 1663381 ttaagcacgt cgataccggt gatgtcgaag acctcggtac cgtcgagtcc caacgacgac 1663441 gctgactttc cttcggggaa ctgcagcggg atcacgccca tgccgatcag gttggaccgg 1663501 tggatccgct cgaatgactc ggcgatcacc gcccgcacgc ccagtagcaa tgtgcctttg 1663561 gccgcccagt cccgtgacga acccgacccg tactctttgc cgccgaacac aaccagcgga 1663621 atgtgttgcg ccgcatagtt ctgcgcggcg tcgtagatga acgcctgcgg accgcccggc 1663681 tgggtgaagt cgcgggtata accgccggac acgtcgtcta gcagttggtt acgcagccgg 1663741 atgttggcga aggtgccacg aatcatcacc tcgtggttgc cgcggcgaga accgaaggag 1663801 ttgtagtcct tgcggtcgac accgtgttcg tcgaggtagc gcgccgcggg agttccgggc 1663861 ttgatggcgc cggcggggga gatgtggtcg gtggtcaccg aatcaccgag cagcgccagc 1663921 acccgggcac cgctgatgtt gccgaccggt tcgggtttgg ctgtcatccc ctcgaaatac 1663981 ggcggcttgc gcacgtaggt cgaattcggg tcccactcaa aggtgttgcc gctcggggtt 1664041 ggcaggttgc gccagcggtc gtcgcccttg aacacgtcgg cgtagttgcg ggtgaacatc 1664101 tcctggttga tcgccgcggc gatggtgtcg gagacatcct gctgcgatgg ccagatatcg 1664161 cggagaaaaa cgttcttacc gtctttgtct tgaccgagcg gctgggtttg gaagtcgaag 1664221 tccatggtcc cggccagcgc gtaggcgatg accagcggcg gcgatgccag gtagttcatc 1664281 ttcacgtctg ggttgatacg gccctcgaag ttccggttgc cggacagtac cgcggtcacc 1664341 gaaaggtcgt tgtcgttaac cgcttttgag atttcctcgg gcagcggccc ggagttgccg 1664401 atgcaggtgg tgcagccgta gccgaccaga tagaagccga gcttctccag atacggccac 1664461 aggccggatc tgtcgtagta gtcgttgacc acttgcgagc ccggggcaat cgtggtcttc 1664521 acccacggct tcgaggtcag tcccttttcg acggcgttgc gggccagcag cgccgcgccc 1664581 agcattactt cggggttgga ggtgttggtg caggacgtga tcgcggcaat caccaccgcg 1664641 ccgtggtcga gcacgaattc gccgagttcg tccgacttca cccgcactgg gttgctcacc 1664701 cggccatcgg catgcgcggc agccgagtgc acggtttcgt cagtggcgac gtcgtcgttg 1664761 gcgaacgtca gctgccccgg gtcgctggcc gggaatgtct cctcgactac ctcgtccagc 1664821 ttcgagtgcg ggtcgtgggg ggaatccggg gaaccattgc cgacatagtg gtaaatctgc 1664881 tcgcggaatg ttgatttggc ttgcgccaac gcgattcggt cctgtggacg ctttggtccg 1664941 gcgatcgacg gcaccacgtc ggataggttg agttcgaggt attccgagaa ctccggctcg 1665001 tgcttgggat cgtgccacat gccctgcgcc ttggcgtagg cctcgaccag tgcgacctgc 1665061 tccggcgtgc gaccggtaaa ccgcagatac ttgatggttt cttcgtcgat cgggaaaatc 1665121 gctgcggtgg aaccgaattc gggactcatg ttgcccaggg tggcgcggtt ggccagcggc 1665181 acctcggcca cgccctcgcc gtagaactcg acgaatttgc cgacgacgcc gtgctggcgc 1665241 agcatctcgg tgacggtcaa caccacgtcg gtggcggtga ctcccggctg gatctcgccg 1665301 gtcaacctga aacccacgac ccgcgggatc agcatcgata ccggctgacc cagcatcgcg 1665361 gcctccgcct cgatgccgcc gacaccccac ccgagcacac ccaggccgtt gaccatggtg 1665421 gtgtgtgagt cggtgcccac gcaggtgtcg gggtaggcca ctccgtcgcg agtcatcacc 1665481 acgctggcca ggtactcgat attgacctgg tgcacgatgc cggtgcccgg cggcaccact 1665541 ttgaagtcgt cgaaagcgcc ttggccccag cgcaggaatt ggtaacgctc accgttgcgc 1665601 tggtattcga tttcgacgtt gcgctcgaat gcgtcggcgc ggccgaacaa atcggcgatc 1665661 accgagtggt cgatcaccaa gtctgcgggc gccagcgggt tgaccttgtc cgggttgccg 1665721 cccagatcgg cgatcgcctc gcgcatggtg gccaagtcga cgatgcacgg tacgccggtg 1665781 aagtcctgca tcaccacccg ggcgggcgtg tactggatct cgatgctggg ctcggcctta 1665841 gggtcccagt tggcgatggc ctcgatgtgg tccttggtga tgttgctgcc gtcctcgttg 1665901 cgcaacaggt tctcggcgag cactttgagg ctgtagggga gtttcgcggt attggggacg 1665961 gcgtcgagac gatagatctg gtaactcttt tcgccgacct tcagggtgtc gtgggctccg 1666021 aatgagttca cagatttgct agtcacatca actcccaggg atttggttcg cccgccgacg 1666081 ggccgtgtcg acggcgtggt gtcagcctag cagtacgctt gtcctgcttt gttgccgtgt 1666141 gggtgcgcgc cgaagtgcga gcagcgcgta acgtgccagt agcacgtcgg caggaaggat 1666201 gcgatgaccg ggccatattt tcctcagacg atcccgttcc tgcccagcta cattccgcaa 1666261 gacgtcgaca tgaccgcggt caaagcggag gtcgccgcac tcggtgtcag cgctccaccg 1666321 gcggccacgc cgggcctgct cgaggtggtc cagcacgctc gcgacgaggg catcgatctc 1666381 aagatcgtgc tgctcgacca caacccgccc aatgacacac cgctgcgtga catcgcgacc 1666441 gttgtcgggg ccgactactc ggatgccacc gtcttggtgc tcagcccgaa ctatgtcggc 1666501 agttacagca cgcaataccc ccgggtcacg ctcgaggccg gggaagacca ttccaagacc 1666561 ggcaatccgg tgcagtccgc gcagaacttt gtccatgagc tgagcacacc cgagtttccc 1666621 tggagcgcgc tgaccattgt tttgctgatc ggtgtgctgg cagcggctgt gggtgctcgg 1666681 ttgatgcaac tgcgcgggag gaggtcagca acgtcgactg acgccgcccc aggggcgggg 1666741 gacgatctca atcaaggcgt ctagccagcc acatctatct cttctcgtgt tgccgcgcta 1666801 accgggcggt tgtttgcggc aaacgcgcga ggtcaccgtt gggtcacatt agtcgcacgt 1666861 accgggggca gtttgtgact tacgtttcca tagcgtcaga tgtgacgtac ggtgcaaatg 1666921 atgcttgtgg tgtcgttggc gttgacctgc gctgtccctc cgagttgagc cctaggagat 1666981 ctgagtcgaa tgagacggaa tcgccgtggc tcgccagcgc gaccggccgc acggtttgtc 1667041 cgtccggcaa ttccgtcggc tttgagtgtg gccctgctgg tatgcacacc ggggctggct 1667101 accgccgatc cacagacgga caccatcgcc gcgctgattg ccgacgtcgc caaggccaac 1667161 cagcgcctgc aagacctgag cgacgaggtt caggccgaac aggaaagcgt taacaaggcg 1667221 atggtcgacg tggaaaccgc tcgggacaac gctgccgcgg ccgaagacga cctggaggtc 1667281 agccagcgcg cggttaagga cgccaacgcg gcgatcgccg cggctcagca ccggttcgac 1667341 accttcgcgg cggccaccta catgaacggt ccctcggtca gctacctcag cgcgagcagc 1667401 cccgacgaga tcattgccac tgtgaccgcc gccaagaccc ttagcgccag ttcccaagcg 1667461 gtgatggcca acctgcagcg ggcccggacc gagcgggtga acacggagtc ggcggcgcgg 1667521 ctagccaagc agaaggctga taaggccgcc gccgacgcaa aggccagcca ggatgccgcg 1667581 gtggcggcgc tcaccgagac ccggcggaag ttcgatgaac agcgcgagga ggtccaacgc 1667641 ctggccgccg agcgcgatgc ggctcaagcc cgactgcagg cggccaggtt ggttgcctgg 1667701 tcctcggagg gtggtcaggg tgcgccgccg ttccggatgt gggatcccgg atcgggccct 1667761 gccggtgggc gtgcatggga tggcttgtgg gaccccacgc tgcccatgat ccccagcgcc 1667821 aacatccccg gcgacccgat cgcggtagtg aaccaggtgt tggggatctc ggcaacgtca 1667881 gcgcaggtca ccgccaatat ggggcgcaag ttcctggagc agctgggcat cttgcagccc 1667941 accgataccg gcatcaccaa cgctccggcg ggctcggccc agggccggat tccgcgagtt 1668001 tatgggcgcc aggcttctga atacgtgatc cgccgcggca tgtcacagat cggggtgccc 1668061 tattcctggg gcggcggcaa tgccgcgggc ccgagcaagg gcatcgactc cggggccggc 1668121 accgtcggct tcgactgctc aggcctggtg ttgtactcgt ttgctggggt gggcatcaag 1668181 ctgccgcact actcgggttc gcagtacaac ctgggccgca agatcccgtc ctcgcagatg 1668241 cgccgcggcg acgtcatctt ctacggcccg aacggtagcc agcacgtgac gatctacctc 1668301 ggcaacggcc agatgctcga ggcgcccgac gtcggtttga aggtgcgggt tgcgcccgtg 1668361 cgcacggctg gcatgacccc gtatgtggtc cgatacatcg agtactagac gaggattcat 1668421 gcgccacacg cgttttcacc cgatcaaact ggcctggatc accgcggtgg ttgccggcct 1668481 gatggtcggt gtggcaacgc ccgccgatgc cgaacccgga caatgggatc ccacgctgcc 1668541 ggcattggtc agtgcggggg cgcccggaga tccgctggcg gtagccaacg cgtcgttgca 1668601 ggccaccgcc caggccaccc agaccacgct ggatttgggc aggcagttcc tcggtgggtt 1668661 gggaatcaac ctcggcggcc ctgctgccag cgctcccagc gccgccacaa ccggcgcgag 1668721 ccggattccg cgggccaacg cccgtcaggc cgtcgaatat gtgattcgcc gggccgggtc 1668781 gcagatgggg gtgccctatt cgtggggtgg tggctcgctt cagggcccca gcaagggcgt 1668841 ggactcgggg gccaacactg tcggcttcga ctgctcaggt ctggtgcggt atgccttcgc 1668901 cggggtcggc gtgctgatcc cgcggttctc cggtgatcag tacaacgccg gtcgccacgt 1668961 tccgcccgct gaggccaagc gcggcgacct gatcttttac ggcccaggcg gcggccagca 1669021 cgtcaccctg tatctgggca acggccaaat gctggaggca tccggaagcg ccggcaaagt 1669081 cacggtgagc ccggtgcgaa aggccggaat gacgccgttc gtgactagga tcatcgaata 1669141 ctgagccagg tgtgatttgc cgggcaccac cgcggcgtcg acggaatcca ggaggcctgg 1669201 aatagttgaa cgcgggcgcg tcgctgcccc gcgacgttgg tcatgtcggc agtcgtgtcc 1669261 gattgagctg tggaggattt tgatgacatc agcaggtggg ttccccgcgg gcgccggcgg 1669321 ttaccagacc ccgggtgggc attcagcttc gccagcccac gaggcgcccc ccggtggtgc 1669381 cgaggggctg gccgccgagg tgcacacgct ggagcgggcc atcttcgagg tcaagcggat 1669441 tatcgtcggc caggaccagc tggtggagcg gatgctcgtc ggcctgctgt ccaaggggca 1669501 tgtgctgctt gagggcgttc ccggcgtggc caagacgttg gcggtggaga ccttcgctcg 1669561 ggtggtcggc gggacatttt cgcgcatcca gttcaccccg gatctggtgc ccaccgacat 1669621 catcgggacg cgcatctacc ggcaaggcag ggaggaattc gacaccgaac tcggaccggt 1669681 ggtggccaac ttcctgctcg ccgacgagat caaccgggct ccggcgaagg tgcagtcggc 1669741 gttgctggaa gtcatgcagg agcgccatgt gtccatcggc ggtaggacct tcccgatgcc 1669801 cagcccgttc ctggtgatgg cgacgcagaa cccgatcgag cacgagggcg tctacccgct 1669861 accggaggcg caacgggacc gcttcctgtt caagatcaac gtgggctacc cgtcgcccga 1669921 agaagagcgc gaaatcatct accgtatggg tgttaccccg ccgcaggcca agcagatcct 1669981 gagcacgggc gacctgctgc ggctgcagga gatagcggcc aacaacttcg tccaccacgc 1670041 gctggtcgac tatgtcgttc gagtcgtctt cgccacccgc aaacccgagc agttggggat 1670101 gaacgacgtg aagagctggg tcgcgttcgg cgcatccccg cgtgcttcgc tgggcatcat 1670161 cgccgccgca cggtccctgg cgctggtccg gggccgtgac tatgtcatcc cgcaagacgt 1670221 catcgaggtc attcctgatg tgctgcgaca ccggctcgtg ctcacctatg acgcgctcgc 1670281 cgacgaaatc tcaccggaga tcgtcatcaa ccgtgtgctg cagactgtgg cgctgccaca 1670341 ggtgaatgcc gttccacagc aaggccattc ggtgccgccg gtgatgcagg ccgcggccgc 1670401 ggcgagcggc cggtgaccga atccaaagcg ccggcggtgg tgcatccgcc gtcgatgctg 1670461 cgcggggaca tcgacgaccc gaagctggcg gcggcgctgc gcaccctcga gttgaccgtc 1670521 aagcagaagc tcgacggtgt cttgcacggc gatcacctcg gcctgatacc tgggccgggt 1670581 tcggagccag gggagtcgcg cctctaccag cccggtgacg atgtccgccg gatggactgg 1670641 gcggtcaccg ctcgcaccac tcacccgcat gtccggcaga tgatcgccga ccgggaactg 1670701 gaaacctggc tggtggtcga catgtcggcc agcctggatt ttggcaccgc ctgctgcgag 1670761 aaacgtgacc tcgcggtggc ggcggcggct gccatcacct tcctcaacag cggcggcggc 1670821 aaccggctcg gtgcgctgat cgccaacggc gccgcgatga ctcgggtgcc ggctcgcacc 1670881 gggcgccaac atcagcacac gatgttgcgc accattgcga ccatgccgca ggcccctgcg 1670941 ggggtccgcg gcgacctggc ggttgccatc gatgcgctgc gccggcccga acgtcgtcgc 1671001 gggatggcgg tgatcatcag cgattttctg ggcccgatca actggatgcg tccgctgcgg 1671061 gcgatcgcag cccgccatga ggtgctggcc atcgaagtgc tcgatccgcg cgatgtcgaa 1671121 ttgccggacg tgggtgatgt ggtgctgcag gacgccgaat ccggggttgt gcgcgagttc 1671181 agcatcgacc ctgcgctgcg cgacgacttc gctagggcag ctgcggcgca ccgggccgac 1671241 gtggcgcgca ccatccgcgg ttgcggggca cccttgctat cgcttcgcac cgaccgcgac 1671301 tggcttgccg atatcgtacg attcgtcgcc tctcgccggc gtggggcatt ggcgggacac 1671361 cagtgatggg tcagttatga cattgccgtt gctggggccg atgacgctat ccggcttcgc 1671421 gcattcatgg ttcttcctat tcctgtttgt cgtggccgga ctggtcgcgc tgtacatcct 1671481 gatgcagctg gcgcgccagc ggcgaatgct gcggttcgcc aacatggagt tgctggagag 1671541 cgtcgcaccc aagcggccat cccgctggcg gcatgtcccg gcgatcctgc tggtgttatc 1671601 gctgctgctg ttcaccatcg cgatggccgg tccgacgcat gacgtccgga ttccccgcaa 1671661 ccgcgcggtg gtgatgttgg tgatcgacgt gtcgcagtcg atgcgcgcca ccgacgtcga 1671721 gcccagccgg atggtggccg cgcaggaggc tgccaagcag ttcgccgacg agttgacccc 1671781 gggcatcaat ctgggattga ttgcctacgc gggcacggcg acggtcctgg tgtcgccgac 1671841 gaccaaccgg gaggcgacca agaatgcgct ggacaagtta cagttcgccg accgtaccgc 1671901 caccggggag gcgatcttca ccgcgctgca ggccatcgcc acggttggcg cggtgatcgg 1671961 tggcggcgac acgccgccgc cggcgcgcat cgtgctgttc tccgacggca aggagacgat 1672021 gccgaccaac ccggacaacc ccaagggcgc ctacaccgcc gcccgcaccg ccaaggacca 1672081 gggcgtgccg atttcgacga tctcgttcgg caccccatac ggcttcgtcg agatcaacga 1672141 ccagcgccaa ccggtgcccg tcgacgacga aacgatgaag aaggtcgccc agctctccgg 1672201 tggaaattcc tacaatgcgg cgactttggc cgagctgagg gccgtttact cgtcgctgca 1672261 gcagcagatc ggctacgaga ccatcaaggg tgacgccagc gtcggctggt tgcggttggg 1672321 tgcgctggcg ctggcgttgg cggcgctagc ggcgctgctc atcaaccggc ggttgccgac 1672381 ttagcttctc ccgcggcccc ggcagcccgc gagcgtaacc tggctgcgat ttccggcgcg 1672441 gattttcgca gtgcggttac gctcggaaag cgcgggcctc gcccacgcgg cggatgatgt 1672501 cagcggggtg gtcctcggcg acgacccgga ccacgatcca cccgtagcgg tgctggactt 1672561 tctcgtgccg gaggatgtct ttccggtagt ggtagcgact ggtcagatgg tggtcgccgt 1672621 catactcggc cgcgaccttg atgtcttgcc agcccatatc caaatgggct tccgcccagc 1672681 cccattcgtt gcgcaccgcg atctgcgtct gggggcgcgg aaagccggcg cggatcaaca 1672741 acaagcgcag ccaggtttcc ttgggggact gggcaccgcc gtcgacgagg tccagagcgg 1672801 ctcttgcggc cttcatgcca cggcggcccc gatagcgctc gatcagcggc tcgacgtcgg 1672861 ccaccttcaa atcggtggcc tgtatcaggg cgtcgacggc cgcgacggcg gggtccaatg 1672921 gaaatcgact ggtcaggtcg agcgccgttc gctccggtgt ggtcacgcgc atgccctcga 1672981 tgacgcagat ctcgtcgggc tcgatgcgct cttcccagac ttgcagcccc ggggcacggc 1673041 ggcggttggt gtcgatgatc gcggcgggaa gatccgcgtc gatccacttg gcgccatgga 1673101 aggcagaagc cgagtagccg gccagcacgc cgcggcggcg cgagcgcagc cacagcgctt 1673161 ttgcacgcaa ttgcgcggtc agttccacac cctgcggcac gtacacgtct ttatgtagcg 1673221 cgacatacct gctgcgcaat tcgtagggcg tcaatacacc cgcagccagg gcctcgctgc 1673281 ccagaaaggg atccgtcatg gtcgaagtgt gctgagtcac accgacaaac gtcacgagcg 1673341 taaccccagt gcgaaagttc ccgccggaaa tcgcagccac gttacgctcg tggacatacc 1673401 gatttcggcc cggccgcggc gagacgatag gttgtcgggg tgactgccac agccactgaa 1673461 ggggccaaac ccccattcgt atcccgttca gtcctggtta ccggaggaaa ccgggggatc 1673521 gggctggcga tcgcacagcg gctggctgcc gacggccaca aggtggccgt cacccaccgt 1673581 ggatccggag cgccaaaggg gctgtttggc gtcgaatgtg acgtcaccga cagcgacgcc 1673641 gtcgatcgcg ccttcacggc ggtagaagag caccagggtc cggtcgaggt gctggtgtcc 1673701 aacgccggcc tatccgcgga cgcattcctc atgcggatga ccgaggaaaa gttcgagaag 1673761 gtcatcaacg ccaacctcac cggggcgttc cgggtggctc aacgggcatc gcgcagcatg 1673821 cagcgcaaca aattcggtcg aatgatattc ataggttcgg tctccggcag ctggggcatc 1673881 ggcaaccagg ccaactacgc agcctccaag gccggagtga ttggcatggc ccgctcgatc 1673941 gcccgcgagc tgtcgaaggc aaacgtgacc gcgaatgtgg tggccccggg ctacatcgac 1674001 accgatatga cccgcgcgct ggatgagcgg attcagcagg gggcgctgca atttatccca 1674061 gcgaagcggg tcggcacccc cgccgaggtc gccggggtgg tcagcttcct ggcttccgag 1674121 gatgcgagct atatctccgg tgcggtcatc ccggtcgacg gcggcatggg tatgggccac 1674181 tgacacaaca caaggacgca catgacagga ctgctggacg gcaaacggat tctggttagc 1674241 ggaatcatca ccgactcgtc gatcgcgttt cacatcgcac gggtagccca ggagcagggc 1674301 gcccagctgg tgctcaccgg gttcgaccgg ctgcggctga ttcagcgcat caccgaccgg 1674361 ctgccggcaa aggccccgct gctcgaactc gacgtgcaaa acgaggagca cctggccagc 1674421 ttggccggcc gggtgaccga ggcgatcggg gcgggcaaca agctcgacgg ggtggtgcat 1674481 tcgattgggt tcatgccgca gaccgggatg ggcatcaacc cgttcttcga cgcgccctac 1674541 gcggatgtgt ccaagggcat ccacatctcg gcgtattcgt atgcttcgat ggccaaggcg 1674601 ctgctgccga tcatgaaccc cggaggttcc atcgtcggca tggacttcga cccgagccgg 1674661 gcgatgccgg cctacaactg gatgacggtc gccaagagcg cgttggagtc ggtcaacagg 1674721 ttcgtggcgc gcgaggccgg caagtacggt gtgcgttcga atctcgttgc cgcaggccct 1674781 atccggacgc tggcgatgag tgcgatcgtc ggcggtgcgc tcggcgagga ggccggcgcc 1674841 cagatccagc tgctcgagga gggctgggat cagcgcgctc cgatcggctg gaacatgaag 1674901 gatgcgacgc cggtcgccaa gacggtgtgc gcgctgctgt ctgactggct gccggcgacc 1674961 acgggtgaca tcatctacgc cgacggcggc gcgcacaccc aattgctcta gaacgcatgc 1675021 aatttgatgc cgtcctgctg ctgtcgttcg gcggaccgga agggcccgag caggtgcggc 1675081 cgttcctgga gaacgttacc cggggccgcg gtgtgcctgc cgaacggttg gacgcggtgg 1675141 ccgagcacta cctgcatttc ggtggggtat caccgatcaa tggcattaat cgcacactga 1675201 tcgcggagct ggaggcgcag caagaactgc cggtgtactt cggtaaccgc aactgggagc 1675261 cgtatgtaga agatgccgtt acggccatgc gcgacaacgg tgtccggcgt gcagcggtct 1675321 ttgcgacatc tgcgtggagc ggttactcga gctgcacaca gtacgtggag gacatcgcgc 1675381 gggcccgccg cgcggccggg cgcgacgcgc ctgaactggt aaaactgcgg ccctacttcg 1675441 accatccgct gttcgtcgag atgttcgccg acgccatcac cgcggccgcc gcaaccgtgc 1675501 gcggtgatgc ccggctggtg ttcaccgcgc attcgatccc gacggccgcc gaccgccgct 1675561 gtggccccaa cctctacagc cgccaagtcg cctacgccac aaggctggtc gcggccgctg 1675621 ccggatactg cgactttgac ctggcctggc agtcgagatc gggcccgccg caggtgccct 1675681 ggctggagcc agacgttacc gaccagctca ccggtctggc tggggccggc atcaacgcgg 1675741 tgatcgtgtg tcccattgga ttcgtcgccg accatatcga ggtggtgtgg gatctcgacc 1675801 acgagttgcg attacaagcc gaggcagcgg gcatcgcgta cgcccgggcc agcaccccca 1675861 atgccgaccc gcggttcgct cgactagcca gaggtttgat cgacgaactc cgttacggcc 1675921 gtatacctgc gcgggtgagt ggccccgatc cggtgccggg ctgtctgtcc agcatcaacg 1675981 gccagccatg ccgtccgccg cactgcgtgg ctagcgtcag tccggccagg ccgagtgcag 1676041 gatcgccgtg accgcggaca tccgggccga gcgcaccacg gcggtcaacg gtctcaacgc 1676101 atcggtggca cgctgagcgt ccgacaacga ctgcgttccg atcggcaatc gactcagccc 1676161 ggcactgacc gcgatgatcg catcgacgtg cgcggcattc tcgagcaccc gcaatgcgcg 1676221 cgatggcgcg tggtcgggaa cccggtgttg ccgtgacgat tcgagcaact gctcgacgag 1676281 gccacggggc ttggcgacgt cgctagatcc cagtccgatg gtgctcaagg cttcggcggc 1676341 cgagcgcacc gctgaccgca acgcgtattc ggcatcgccg agctcgtagt gttccaacac 1676401 tggggccccg ggaagtgaat acaccatcca agaaagtgca cacaattcgg gcgtgagtgg 1676461 ctcgctctgc gcggcttcgt cgacatcgcc ataggagaac tccgggacca ggccaacggc 1676521 gctgccggga tcctccgggt tggcgacgat caccgcctcg ccggcggcaa gggcgtcgtg 1676581 ctcgaactgt gttcccgcag ccagcccgcg cacatcgccc ggcaccggca acaccacatt 1676641 gatcgtcccc cgcagtcggc gccggcccac cgcggcgcgc agtgtctgca ggagcgagac 1676701 cgttccagca tcgtggacgt cgggccacgg cagcccggtg tggcccgcag ccacggcatc 1676761 ataagctgcg acggattgcg ttggcgccca aagtgataat gcatccaaca cgtcgtcggg 1676821 agcagccttg ccggcgagcc aagcgttagc ccagatcgac agcgaaacac tgggacacca 1676881 catgatcttg cagtgtagtt gttcgacccg gctgacgcgg atcacgcgta tcctaagcgc 1676941 atgcccgtcg ctttgatctg gcttatcgcg gcgttggtgc tcgtcggcgc agaggcactg 1677001 accggcgaca tgttcttgct gatgctcggc ggcggtgcgc tggccgcctc ggtaagcagc 1677061 tggctgctgg cttggccgat gtgggccgac ggggcggtgt ttctcctcgt ctcggtgctg 1677121 ctgctggtgt tggttcggcc ggcggtgcgg cgccggctga cgcagaccaa aggtgtgcag 1677181 ctgggcatcg aggcgctgga gggtaagaag gcggtggtgc ttggtcgggt ggcccgcgac 1677241 gggggtcagg tgaagctgga cggccaggtg tggacggcgc gcccgctcaa cgacggtgat 1677301 gtgttcgaac ctggtgactc ggtgaccgtg gtgcaaatcg acggcgccac ggcggtggtc 1677361 ttcaaggacg tgtagggact cgagaaagga attccggtgc aaggagccgt tgctggtctg 1677421 gtgtttctgg ccgtcctggt gattttcgcc atcatcgtgg tggccaagtc ggtggcgctg 1677481 atcccgcagg cggaggccgc ggtgatcgag cggctgggtc gctatagtcg tacggtcagt 1677541 gggcagttga cgctgttggt gccgttcatc gaccgcgtcc gggctcgggt ggacctgcgc 1677601 gagcgggtgg tgtcgtttcc gccgcaaccg gtgatcaccg aggacaactt gacgctgaac 1677661 atcgacaccg tcgtctactt ccaggtgacc gttccgcagg cggcggtgta cgagatcagc 1677721 aattacatcg tcggggtcga acagctcacc accaccaccc tgcgcaacgt tgtcggcggg 1677781 atgacgctgg agcagacgtt gacctcgcgt gaccagatca acgcccagct gcgcggcgtt 1677841 ctcgatgagg cgaccggccg ctggggtctg cgggtggcgc gggtggagct gcgcagcatc 1677901 gatccgccgc cgtcgattca ggcgtcgatg gaaaagcaga tgaaggccga ccgggagaag 1677961 cgagcgatga ttctgaccgc cgaaggtacc cgggaggcgg cgataaaaca ggccgagggg 1678021 caaaagcagg cgcagatcct ggccgccgag ggcgccaagc aggccgcgat cttggctgct 1678081 gaggccgatc ggcagtctcg gatgctgcgc gctcagggtg agcgcgccgc ggcctacctg 1678141 caggcgcaag ggcaggccaa ggccatcgag aagacgttcg ccgcgatcaa ggctggccgg 1678201 cccaccccgg agatgctggc ctaccaatac ctgcagacgc tgccggagat ggcgcgtggg 1678261 gacgccaaca aggtatgggt ggtgcccagc gacttcaacg ccgcactgca ggggttcacc 1678321 aggctgctgg gcaagccggg tgaggacggg gtgttccggt tcgagccgtc cccggtcgaa 1678381 gaccagccca agcacgcggc cgacggtgac gacgccgagg tcgccggctg gttctccacc 1678441 gataccgacc cgtcgatcgc tcgggcggtg gctacagccg aggcgatagc ccgcaagccg 1678501 gtcgagggtt cgctggggac gccccccagg ttgactcaat agagtggtcc gatgagtggt 1678561 ttgacctcac cgaaaaccta tgcggtactg gcagctctgc aggcgggcga cgcggtggcg 1678621 tgcgccatcc cgctgccacc tatcgccagg ttactcgacg acttggacgt tccggtcagc 1678681 gttcgcccgg tgctgccggt ggtcaaggcc gcctctgcgg tcggtttgtt gtcggtcacc 1678741 cgattcccgg ccttggcgcg gctgacgaca gcgatgttga cgttgtactt catcctcgcc 1678801 gtgggggcac atgtccgggt gcgagatcgc gttgttaatg cgattccggc ggcgtcattc 1678861 ctgacgttgt tcgcgctgat gacggcaaag gggccggagc gcacttaagc atggaggcgc 1678921 aactcgacct atggcagtgg tgtgtcggtc ggtgaggtcg aggtgctcaa ggtcgaaaac 1678981 agccgggtgc gcgccgagca gctggccaaa ctgtacgaat tgcgctcaag tcgggatcgg 1679041 gtcagggtcg acgccgcact agccgagctg agccgcgccg cggccgcccg cggttgtgcc 1679101 ggtactagcg ggctcggcaa caacctgatg gcgccggggc cgccccattc cctcctggga 1679161 cgggatcgct gacgccacaa tcgacctgct acgaaggctg gccgagcggc tggggtacac 1679221 actggattgg cgagcgatcc gtggagccga acccgttgcc accgccattc tgcgtcggtt 1679281 agtctctttt tcgaccttgg ggcgcggagg gtcgttatgg tgtgtcacag tgctttgctg 1679341 tcaaaggcat tggcggtgcc gaccaagcga cactgggcag tgcagaaatc ctcgtgaaat 1679401 acgctcaact cgctgacaaa cgcgctcggg tatatgtcct ggtgtcgacc tggttggtcg 1679461 tgtggggtat ctggcatgtg tattttgtcg aagctgtctt tccgaatgcc atcctgtggt 1679521 tgcattatta cgcggccagc tatgaattcg ggtttgtacg tcgcgggctg ggcggtgaac 1679581 tgattcgcat gttgaccggc gatcatttct ttgccggcgc ctataccgtt ctgtggacgt 1679641 ctatcacggt gtggctgatc gcccttgccg tcgtggtgtg gcttatcctt tccacgggca 1679701 accggtccga gcgcaggata atgcttgccc tcctcgttcc ggtgctaccc tttgcctttt 1679761 cttacgccat ctataatcca catccggaac tcttcgggat gaccgcgttg gtagccttca 1679821 gcatttttct gaccagggcc cacacctctc gaacccgggt gatcctcagt acgctgtacg 1679881 gacttacgat ggccgtgctg gcgctcatac acgaagcgat tccactggaa ttcgcactcg 1679941 gcgcggtgct ggcgataatc gtgttgtcga agaatgcgac aggtgcgaca aggcgaatct 1680001 gtactgcgtt ggccatcggt ccggggaccg tctcagtatt gttgctcgct gtggtcgggc 1680061 gtcgcgatat cgcggaccag ttgtgtgccc atatcccgca tgggatggtc gaaaatccgt 1680121 gggcggttgc aacgacaccg cagcgagttc tcgattacat attcggtcgt gtcgagagcc 1680181 atgcagatta ccacgattgg gtgtgcgagc atgtgacccc gtggtttaac ctcgactgga 1680241 ttacctctgc aaagctggtg gccgtggttg gcttccgcgc actattcggt gcattcctcc 1680301 tcgggttgct gttcttcgtt gccacgacat cgatgatccg ctatgtctcc gccgtgccgg 1680361 tcagaacctt ctttgccgaa ctgcgcggca atctggcgtt gccggtgctg gcatcggcat 1680421 tgctggttcc gctgttcatc accgctgtcg actggactcg ctggtgggtg atgatcacac 1680481 tcgacgtggc cattgtctac atcttgtacg cgatcgacag accggagatc gagcaaccgc 1680541 cgtcgaggag aaacgtgcag gtcttcgtct gcgttgtgtt ggtgctggcg gtgataccga 1680601 ccgggtccgc caacaacatc ggcagatgag gcaccccgcg ggaccacccg aaggcgggca 1680661 tggtgacgta ggccaaccgc cgctgacatg cttgggacgg tgatgctgtt gcaggcctat 1680721 taggggttgt cggatcggga gccttgtgac cggttggccc ttgatctgcg ttgggaggcc 1680781 gcggcggggt tgacggtgca cgcgccgtcg ttgcatccca cggtgttggt cgggatgcgt 1680841 aaccggctgc gggcttcgga tccgacgtgt tggtggatgc accattttca aatgccgtcg 1680901 cgggtcgaaa cttgggtgcc gtcgaagaat aaccccacca aaggccctac atcagcgccg 1680961 tcctacgttc gtgtgtcgga caatccttag tgccgatgcc ggatattcgg gcactaacgg 1681021 aaaagacgtc ctccgcgtag aggctccgtt gttcgaggcc cagttacagg ggcaaggtca 1681081 gtggccgtga cctctgcttc ccgacacgag aatgctggcc gaccgaacgt agcgcggtgc 1681141 gttgacggca tcgagctgcc acgccaaatt tgcacgcgct gatgcgctga ccccgaccga 1681201 aggtttatca aatgagagcc ggctcgcgca cagggtcgtc gtaacccggc atgcgtcggt 1681261 gctgccgtcg ataattgcgg atctcataga cgagcccagt cagacctagc gcgcccgtgc 1681321 ataccgacac caggatcagc agcgggctac cgctaccggc gaacgcgtcg cccaggatga 1681381 ccaccgccgc ggtgccgggg agcaagcctg ccaaggtcgc ccaggcgaag gacaggatcc 1681441 gcacgcccga ggcgccggcg gcatagttga tcgccgcgaa cgggacgacg ggaatgagcc 1681501 gcagcgacaa gatggccagc cagcctcgct cacgcagacg ctcgtccagc cggttgatcg 1681561 ctcggcggcg caccagactg ttcagctgcc agccggtggc acgcaccagc agcatcgcga 1681621 ttaccgcgct agcggtgctg ccgaccaccg cgatgaatac gcccaccaca gagccgaaca 1681681 acagcccggc ggccaacgtg aacgcggtgc gggggaatgg cggcaccgtg acgacggtat 1681741 gcaccagcaa aaatgccagc gggaaccacg cgcccagtga cttggcccag tcgcgcaatt 1681801 ccaccgcagt gggcaccgga accagcagcg cgaccactac cagtactgtg attcccacca 1681861 ctgttcccac gatgcgcggc agcgacgcct gacgcgcgac cgcgccgagc gaggtggcga 1681921 taccgtgcac ggtttcggtg gtgttgcaga tggcgggagc cgtcacgtct tcggagcgta 1681981 cggggtcaac atgaataact cgtttcccca ggctggcgtt tcgtcacact ccggccgcga 1682041 ttgccgcacc tgggcgtcta tatgggcgtc ccgatcaact agccttatta gttaagtgac 1682101 aatcccgaag caagcccaag caacatcgct aattgctggg aaaacaggag cagtcggtgt 1682161 ccattgatgt acccgagcgt gccgacctag aacaggttcg cgggcgctgg cgcaacgcgg 1682221 ttgccggtgt gctgtccaag agcaaccgta ccgactcagc acaactcggc gatcaccccg 1682281 agcggctgct ggatacccag accgctgacg ggttcgccat ccgggccctc tacaccgcgt 1682341 tcgacgagct cccggagccg ccgttgccgg gccagtggcc ctttgtgcgc ggcggagacc 1682401 cgctgcgcga cgtgcattcc ggctggaagg tcgccgaggc gtttcccgcc aacggtgcga 1682461 cggccgacac caacgcggcg gtgctggccg cgctcggcga gggggtcagc gcgctgctga 1682521 tccgggtggg ggagtcgggt gtggcgcctg accggctcac ggcgctgctg tccggggtgt 1682581 atctgaacct ggcgccggtc atcctcgacg ccggcgccga ctaccgcccg gcctgcgacg 1682641 tcatgctggc gctggtcgcc cagctcgatc ccggccagcg cgacaccctg tcgatcgacc 1682701 tgggcgccga cccgctgacg gcgtcgctgc gcgatcgtcc cgccccgccg atcgaggagg 1682761 tcgtcgcggt cgcatcccgg gcggccggcg aacgtgggct tcgtgcgatc accgtcgacg 1682821 gaccggcctt ccacaacctg ggcgcgaccg cggccaccga actcgcggcc accgtcgcgg 1682881 ccgcggtggc ctacctgcgg gtgctcaccg aatccgggct cgtggtgagt gacgcgctgc 1682941 ggcagatcag cttccggctc gccgccgacg acgaccagtt catgacgctg gccaagatgc 1683001 gggctctacg tcaactgtgg gcgcgggtcg ccgaggtcgt gggcgacccg ggtggcggcg 1683061 cggccgtcgt gcacgcggag acgtcgctac cgatgatgac ccagcgtgat ccgtgggtga 1683121 acatgctgcg ctgcacgctg gcggccttcg gcgccggtgt cggtggcgcg gacaccgtgc 1683181 tggtgcaccc gttcgacgtg gcgattcccg gcggctttcc cggcacggcg gccggctttg 1683241 cgcgccggat cgctcgcaac acccaactgc tgcttttaga agagtcgcat gtcggcaggg 1683301 tgctcgatcc cgccggcggg tcgtggttcg tcgaagagct caccgaccgg ctggctcggc 1683361 gcgcctggca gcgtttccag gccatcgagg cccgtggcgg cttcgtcgag gcccacgact 1683421 tcctggccgg ccagatcgcc gagtgcgccg cccgccgcgc cgacgacatc gcccatcggc 1683481 gcctggcgat caccggcgtc aacgaatacc cgaacctggg cgaacccgcg ctgccgcccg 1683541 gtgatccgac atcgccggtg cgccgctacg ctgccggatt cgaagcattg cgcgatcgat 1683601 ccgatcacca cctagcccgc actggcgcac ggccgcgggt gctgttgctg ccgttgggtc 1683661 cgctggccga gcacaacatc cggacgacct tcgccaccaa cctgctggcg tccggcggca 1683721 tcgaggcgat cgacccggga acggttgatg cgggcaccgt cgggaatgcc gttgccgatg 1683781 ccggttcgcc cagcgttgcc gtgatctgcg gcaccgatgc gcgctaccgg gacgaggttg 1683841 ccgacattgt gcaagcggcc cgagccgccg gtgtttcgag ggtgtacctc gcgggtcccg 1683901 agaaggcgtt gggagatgcc gcacaccggc ccgacgagtt tttgaccgcg aaaatcaatg 1683961 tggtgcaagc cttgtcgaat ctgctgacgc ggttgggggc ctagatgaca accaagacac 1684021 ccgtgatcgg cagcttcgcc ggcgttccgc tgcatagcga gcgtgccgcg caatcgccca 1684081 cagaggccgc ggtgcacacg catgtcgccg ccgccgcggc ggcgcacggg tacacgcccg 1684141 aacagttggt gtggcacacg ccggaaggca ttgacgtcac accggtatac atcgccgccg 1684201 accgggccgc cgccgaagcc gagggctacc cgctgcacag cttcccgggc gagcccccct 1684261 ttgtgcgcgg cccctatccg acgatgtatg tgaaccagcc gtggaccatc cgccagtacg 1684321 ccgggttttc caccgccgcg gattccaatg cgttttaccg acgcaacctg gccgccggcc 1684381 agaaggggct gtcggtggcc ttcgatctgg ccacccaccg cggctacgac tccgaccatc 1684441 cccgcgtgca gggcgatgtc ggaatggccg gtgtggcaat cgattccatt ctcgacatgc 1684501 gacagctgtt cgacggcatc gacctgtcga ccgtgagcgt gtcgatgacg atgaacggtg 1684561 cggtgctgcc gatcctggcg ctgtatgtgg ttgccgccga ggagcagggc gtggcgccgg 1684621 agcagctggc cggcaccatc cagaacgaca tcctcaaaga gttcatggtc cgcaacacct 1684681 acatctatcc gccgaagccg tcgatgcgga tcatctccga catcttcgcc tacaccagcg 1684741 ccaagatgcc caagttcaac tccatctcca tttccggcta tcacatccaa gaagccggtg 1684801 ccacggcgga tttggagctg gcctacaccc tggccgacgg cgtcgactac atcagggcgg 1684861 gcctgaacgc cggcctggac atcgacagct tcgcgccccg gctatcgttc ttctggggca 1684921 tcgggatgaa tttctttatg gaggtcgcca aactgcgggc cggccggttg ctgtggagtg 1684981 agctggtcgc acagttcgcg cccaagagcg ccaaatccct ttcgctgcgt acacattcgc 1685041 aaacatcggg gtggtcactg accgcccagg atgtgttcaa caacgtggcg cgcacatgca 1685101 tcgaggcgat ggccgccacc caggggcaca cccagtcgct gcacaccaac gccctggacg 1685161 aggcgctggc gctgcccacc gatttttcgg cccgcatcgc gcgcaacacc cagctggtgt 1685221 tgcagcagga gtcgggcacc acgcggccga tcgacccgtg ggggggctcc tactatgtgg 1685281 agtggctgac ccatcggctc gcgcggcgag cccgggcgca catcgccgag gtcgctgaac 1685341 atggcggcat ggcgcaggcc atcagcgacg gcatccccaa gctgcgcatc gaggaggcgg 1685401 ccgcgcgcac ccaggcccgc atcgactccg gtcagcaacc ggtggtcggg gtgaacaaat 1685461 accaggtgcc cgaggaccac gagatcgagg tgctcaaggt cgaaaacagc cgggtgcgcg 1685521 ccgagcagct ggccaaactg cagcggctgc gggcaggccg ggacgagccg gcggtacggg 1685581 ccgcgctggc cgagctgacc cgcgccgccg ccgagcaagg acgcgccgga gcagacgggc 1685641 tgggcaataa tctgctggcc ctggccatcg acgccgcccg ggcccaggcc accgtgggcg 1685701 agatctccga agcgctggag aaggtgtacg gacggcaccg ggccgagatc cgtaccattt 1685761 ccggggtcta ccgcgacgaa gttggaaagg cccccaacat cgcagccgca accgagctag 1685821 tggagaagtt cgccgaggcc gacggccgcc ggcccaggat tctgatcgcc aagatgggcc 1685881 aggacggcca cgaccgcggg cagaaggtga tcgcgaccgc gttcgccgac atcgggttcg 1685941 acgtcgacgt ggggtcgctg ttttccaccc ccgaggaggt ggcgcgtcag gccgccgaca 1686001 acgacgtgca cgtgatcggg gtgtcctcgc tggccgccgg ccatctgacg ctggtgccgg 1686061 cgctgcgcga cgcgttggcg caggtgggca ggcccgacat catgatcgtg gtcggtggtg 1686121 tcatcccgcc gggcgacttc gacgagctgt acgccgccgg ggccaccgcc attttcccgc 1686181 cggggacggt gattgccgac gcggcgattg acctgctgca caggctggcc gagcggctgg 1686241 ggtacacgct ggattagcga gaggcccgcg gtgccgtttc tggttgcatt atccggtatc 1686301 atctcgggcg tgcgtgatca ttcgatgacc gtgcggctcg accagcaaac tcgccagcgc 1686361 ctgcaagaca ttgtgaaagg cggataccgg agcgctaatg cggcgatcgt cgacgccatc 1686421 aacaagcgct gggaggcgct acacgatgag caactcgacg ccgcctacgc ggccgcgatc 1686481 catgacaatc cggcgtaccc gtacgagtct gaggccgaac ggagcgccgc gcgggcccgg 1686541 cgcaacgcca ggcagcagcg ctcggcacag tgaacgcgcc gttgcgtggt caggtctatc 1686601 gatgcgacct cggatacggg gccaaaccgt ggctcatcgt ctccaacaac gcccgcaacc 1686661 gtcacaccgc cgacgtggtg gctgtgcgcc tgacaacaac gcggagaacc ataccgacct 1686721 gggtcgccat gggccccagc gatccattga ccggatacgt caacgcggac aacatcgaga 1686781 ccctcggcaa agacgagctc ggtgactacc tcggtgaggt cacgccggcg acgatgaaca 1686841 aaatcaacac ggcgctcgcg accgcgctgg ggctaccgtg gccatgatgg ccgcatccca 1686901 cgacgacgac accgtcgacg ggttggcgac ggccgtgcgc ggcggtgacc gtgcggcgct 1686961 gccacgggcc atcacactgg tcgagtcgac ccgccccgac catcgtgagc aggcgcaaca 1687021 gctgctgctg cgattgctgc cggactccgg gaacgcccat cgcgtcggca tcaccggggt 1687081 cccgggggtg ggcaagtcga ctgccatcga ggcgctgggc atgcatctga tcgagcgcgg 1687141 gcatcgggtg gcggtgctgg cggtcgaccc gtcgtcgacc cgcacgggtg gatcgattct 1687201 tggtgataaa acccggatgg cgcggctggc ggtgcacccg aacgcctaca tccggccgtc 1687261 cccgacgtcg ggaacgctgg gtggggtgac gagggccacc cgggaaacgg tggtgctgtt 1687321 ggaggcggcc ggttttgatg tgatcctgat cgaaaccgtc ggggtgggcc agtccgaggt 1687381 cgcggtggcc aacatggtcg acacgttcgt gttgctgacc ttggcccgca ccggtgatca 1687441 gttgcagggc atcaagaagg gcgtgctgga gctcgccgac atcgtggtgg tgaacaaggc 1687501 cgacggggag caccacaaag aggcccggct ggccgcccgg gagctgtcgg cggcgatcag 1687561 attgatctat cctcgcgaag cactgtggcg cccaccggtg ctcaccatga gcgcggtgga 1687621 gggcagggga ctggccgagc tgtgggacac cgtcgagcgt catcgccagg tgctcaccgg 1687681 ggccggcgaa ttcgacgccc gtcggcgcga tcagcaggtc gactggacct ggcagctggt 1687741 tcgcgacgcc gtcctggatc gggtgtggtc caatccgacg gtgcgcaagg tccgctccga 1687801 gctcgagcgt cgggtccgcg ccggcgaact gaccccggcc ctggcggctc agcaaatact 1687861 ggagatagct aacctaacgg ataggtaaat aaatccgtgt ttgccgatgg tcgctgcgaa 1687921 atccacgtaa gttcgaccgt gtgatggttg acaccggagt cgatcaccgc gcggtttcgt 1687981 cccacgacgg accggacgcg ggccggcggg tgtttggtgc ggcggaccca cgctttgcgt 1688041 gcgtcgttcg agcctttgcc agcatgtttc cggggcgccg gttcggtggc ggagcgctgg 1688101 cggtgtatct cgacgggcag ccggtcgtcg acgtgtggaa ggggtgggct gatcgggccg 1688161 gatgggtgcc gtggtcggcg gattccgcgc cgatggtgtt ctcggcgacc aagggcatga 1688221 cggccacggt catccaccgg ctggccgacc gggggctgat cgactacgaa gctcccgttg 1688281 ccgagtattg gccggcgttt ggcgccaacg gcaaggcaac cctgacggtt cgtgacgtga 1688341 tgcgacacca ggccggcctg tccggattgc gtggcgcgac gcagcaagac ttgctggatc 1688401 acgtcgtgat ggaagagcgg ctggcggcgg cggtgcccgg gcggctgctg ggcaaatccg 1688461 cctaccacgc gctgacgttc ggttggttga tgtcgggcct ggccagggcc gtcaccggaa 1688521 aggacatgcg cctgctgttc cgcgaggaac ttgccgagcc gttggacacc gacggcttgc 1688581 acctgggtcg gccgccggcc gacgcgccga cgcgggtcgc cgagatcatc atgccgcaag 1688641 atattgccgc caatgcggtg ctgacctgtg cgatgcgccg gctcgcccat cggttctccg 1688701 gcggatttcg ctccatgtat tttcccggcg ccatcgcggc cgtgcagggc gaggcgccgt 1688761 tgctggacgc cgagataccc gcggccaacg gggtggcgac ggcgcgagcg ctggcgcgga 1688821 tgtacggcgc aatcgccaac ggcggcgaga tcgacggcat acggttcttg tcgcgggagc 1688881 tggtcacggg cctgacccgc aaccgacggc aagttctgcc ggatcgaaat ctattggtgc 1688941 ccttaaattt tcatcttggc tatcacggta tgccgatcgg caacgtgatg ccggggtttg 1689001 gtcatgtggg cttgggcggc tcgatcggct ggacagaccc ggagaccggg gtggcgttcg 1689061 cgctggtgca caaccggctg ctgtcaccgt tggtgatgac cgatcacgca ggctttgtcg 1689121 gcatctacca cctgatccgg caggccgccg cccaggcgcg caagcgtggt taccagccgg 1689181 tgacgccatt cggggcgccg tactcggagc cgggagccgc ggcgggctaa tctgcccgcc 1689241 taatcggcct gccggcagcg gcgctcggcg ccacggtgtc gcgatgcttc ccggatgccg 1689301 acctagctcg cggttttggt cgcgatgacg atgtcctgga agcttaggcg tggttcccgg 1689361 ccactccatg agccgtagtg caatggttcg tgcacggcga ggccgaactt gccatagaca 1689421 tccctgacga aggtctccgg caagccgatt gcttcttcgg gccgcttctt gtggattgtc 1689481 cgataacccg gtccctcatg ctggaagttg tgcgcactct ttccttccgc gatgtgggct 1689541 aacgactcgt cattgagcaa gaagtacgtg cacaggcatc gtccgccggg cttcagcacg 1689601 cgggagatct cgtccagata gtgctccacg tccggcggaa acatgtgggt gaacaccgag 1689661 gtaagaaaca ccacatcgaa cgacgcatcc ggatatggaa agcgaaagtc tagtgactgg 1689721 tatttccctt tcgggttgta cagcgagttg tagatgtcgg agacctcgaa ctggaagttg 1689781 gggtgcgccg aggtgatgtg ctcctggcac cacgcgatgg ctttctgcga gatatcgaag 1689841 ccggcgtagc gtccctcgct gttcagatag ccggtgagcg gcaacgccat ccgccccgag 1689901 ccgcagccga cgtcgagcac cgcttcgtcc ggctgcagcc cacacaggtc gaccagatac 1689961 ccgacgaatt cagcaccgac ttccttgtag gcgccgccga cgaattgtcg cagggatttt 1690021 ggaggcagcg cctcggcgga gccaccgtcg gctgaaccgc gtttcgagcg cgtcaggatg 1690081 ttctggaaaa gtcgcttaat gatgcacctc agttatcggc cgcgcttgaa ggttcaggaa 1690141 tcctccaggc ggaagccgac tttcatagtc acctggaagt gcgcgaccgc tccgtcgacc 1690201 aggtggcctc gaattgactg tacttcgaac cagtccagcg cgcgcatggt ctgcgcagct 1690261 cgggccagac cgccctggat tgccgcgtcg acgccgtcgg gcgaggtccc gacgatctcg 1690321 atcactcggt aggtgtgatt gctcatcgtg tcccctcaca ttcttttacc cgctcttacc 1690381 ggccagcggc acaccagaat agtccggtgc catcggggga gccctctacg gccggtcact 1690441 ttgagcactt gccgcgcggc agcttcggcc ggattctctc cgtcctcaat gccgctgccg 1690501 accatcatcc acgtgagttg ctcgtcgtcg ggattgcgac cttcgaccag aagcgcccgg 1690561 cggtgggggt cgatgagcac gacccgggtg gtgcggcgac gccggcggtg gtcatcaact 1690621 acgagagccg gtcgtcggct ggcggcacca tcggccattc aacaacgtca caggtagcgt 1690681 gctgtttgta tcagcagccg aaacgcccag cgctccggcc gaccaaggcg gcagcgacga 1690741 ccgcagcgac aacctggatc gaacgagtcc aaaaccgccg cggacgccac tcggccctcg 1690801 tatgatcccg aggagatacc ctacggggtg gattggggat ggatcggcga tgcgcctctc 1690861 gatcgtaacg actatgtaca tgtcagagcc ttacgtgctg gagttctaca ggagagcgcg 1690921 cgcggcggcg gacaaaatca cgcctgacgt cgagatcatc ttcgtggatg acggctcgcc 1690981 ggacgcagcg ctccagcagg ccgtctcgct gctcgacagc gacccctgtg ttcgggtaat 1691041 tcagctttcg cgaaatttcg gccaccacaa agcgatgatg accggcctgg cgcacgccac 1691101 gggggatctc gtctttctga tcgactcaga cttggaagag gacccggctc tcctagagcc 1691161 gttctatgaa aagctgatct cgacgggcgc cgacgtagta tttggttgcc acgcgcggcg 1691221 gcccggcggt tggttgagga atttcggacc gaaaatccat tatcgggcgt ccgccctgct 1691281 gtgtgacccc ccgcttcatg aaaatactct caccgtgcgg ctgatgacag ccgactatgt 1691341 acgcagcttg gtccagcacc aggagcgtga actttcgatt gccggtctgt ggcagattac 1691401 tggtttttac caggtgccca tgtccgtaaa caaggcatgg aaaggaacga ccacatacac 1691461 gtttaggcgt aaagtagcga cactggtcga caatgtcact tcatttagca acaaacctct 1691521 agtcttcatt ttctatcttg gtgcggccat ttttattatt tcaagctcgg ccgcgggcta 1691581 tctgatcatc gatcgaattt tctttcgcgc tctgcaagcg gggtgggcat ccgtgatcgt 1691641 atccatctgg atgctggggg gtgtgacgat tttctgcata gggctggtcg gaatttatgt 1691701 atccaaagtc ttcatcgaaa ctaagcagcg gccatacaca attatccgaa gaatctacgg 1691761 ttcggattta acaacccggg agccatcctc tctgaagacc gccttcccgg ccgcgcacct 1691821 gtcgaacggg aaacgcgtca catcagagcc agagggattg gcaactggca acaggtgaat 1691881 aagcgtagca tgattcctgt aaaggttgaa aacaatactt cgctcgatca ggtgcaagac 1691941 gctcttaatt gcgtcgggta cgcggttgta gaagatgtgc ttgatgaggc gtcactggca 1692001 gcgacccgtg atcgcatgta tcgtgtacag gagcggattc ttaccgagat tggcaaagag 1692061 cggctggcaa gggccggtga gctcggtgtt cttcgactca tgatgaagta tgaccctcat 1692121 ttctttacct ttcttgaaat acccgaagtc ctaagcatcg ttgatcgtgt gctatctgaa 1692181 acggccatct tacatctgca gaatggcttt atccttccgt ccttcccgcc cttctccacg 1692241 ccggacgttt ttcagaatgc gttccaccaa gactttccca gggttctgtc cggttacatt 1692301 gcctccgtca atattatgtt cgccatcgat ccctttacac gagacaccgg cgcaacgctc 1692361 gtagtgccgg ggagccacca gcgcatagag aaaccggacc atacctacct cgcgcgcaat 1692421 gccgttcccg ttcaatgcgc ggcgggctcg ttgttcgttt ttgactctac gctttggcat 1692481 gcggctggcc gaaacacctc cggcaaagac cgcttggcca taaatcatca gtttacgcgc 1692541 tcgtttttca agcagcagat cgactacgtc cgcgcgctgg gcgacgccgt ggttctggag 1692601 cagcctgcgc gtactcagca actgctcgga tggtacagtc gagtggttac caatctggac 1692661 gagtattacc agccgccgga caagcgattg tatcggaagg ggcaaggcta gttttgcgag 1692721 aattccgttg cgcctatttg aaagcccgac atgaaacgat cgcttttaag cgcatatgtc 1692781 tgttctgcaa aaatgtctaa tttttccgat aaaggttggt gggaaagctc gatgcgtgcc 1692841 gtgttttgta ggtggccgga tgatccactt agacaggccg tggaagcaga atttgcgcgt 1692901 cccgatggcg ttgcggtggc gtaatggcct ggcgaaagct cgggagaatt tttgctccgt 1692961 cgggcgaact cgactggtcg cgaagtcatg ctgcgctacc ggttcctgaa tggatcgagg 1693021 gtgatatttt ccgcatctat ttcagcggcc gcgatggtca gaatcgttcc agtatcggta 1693081 gcgtgatcgt cgatctcgcc gtgggcggca agattctgga cattccggcg gagccgattt 1693141 tgcgccccgg cgctcgagga atgtttgacg actgtggggt gtcaatcgga tcgattgtgc 1693201 gtgccggcga tacgcgactt ttgtactaca cgggctggaa tctcgctgtc accgtgccct 1693261 ggaaaaacac cataggcgtg gcgattagcg aagcaggtgc accattcgag cgatggtcta 1693321 cttttcccgt cgttgcgctg gacgagcgtg atccattctc gctttcttat ccctgggtca 1693381 tccaagatgg agggacatac cgtatgtggt atggctcaaa tctaggctgg ggagagggca 1693441 ccgacgagat acctcacgtg atcaggtatg cgcaatcaag ggacggtgtc cactgggaaa 1693501 agcaggatcg cgtgcatatc gacacaagcg gatccgacaa tagcgcggcc tgtaggccgt 1693561 acgtcgtccg cgatgcggga gtatacagaa tgtggttttg cgctcgcggt gcgaaatatc 1693621 ggatttactg cgctacatcg gaggatggtt tgacttggcg gcaactcggc aaagatgagg 1693681 gcatcgacgt ttcgccagat agctgggact cggatatgat cgagtatcct tgcgtgttcg 1693741 atcacagggg acagcgcttt atgctttatt cgggcgatgg ctacggtcgc accgggttcg 1693801 gtttggcggt gctggagaac tgatcagggc tgacaataga tgtttagcgg ctgatgatgc 1693861 gcttcccgct cgaataggct gagaccatta ttgccgcggt agcgatgatt tcccggatta 1693921 tcgtcgtcgc cgcgatcact cactgctcgt cgaggccctt taagggcttc attgtatcct 1693981 tcgcactgct tatcttcatg cgcgcaacgt caggatgcgc gtgagcgcct cgacaacgcg 1694041 gctctgatct acctcctgaa gtccaaccca catcggcaga cggattaggc gggaagccac 1694101 gtcgttggtg acggtcaggt tgccattggt gcggccgtag cgacgcccgg ccggcgaatc 1694161 gtgaagcggc acgtaatgaa agaccgcgcc tataccttcg ctcgtcagac gcgccagcac 1694221 ctcctcccga tcggcgctgg gcgctagtaa cacgtagtac atgtgggcgt tgtgagagca 1694281 gccctgtggg atgatcggac ggcgcaggag cccccgctgt tccaatgatt cgaagctttc 1694341 atgataccgg ttccataggt ccaatcggat acgcgtgatc cgctcggctt cctcgaactg 1694401 agcccataga aaggcagcga ctaattcgct gggcaaatag gaagaccctt tgtcctgcca 1694461 cgtatatttg tcgacctcgt tgcgaaggaa gcggctgcga ttggtgccct tttccctgag 1694521 aatctctgcc cggagcagga agtcttatga gttgacaagc agggcgccgc cttcgccgga 1694581 aatcacattc ttggtctcgt gaaatgagag cgctcccagg tcgccgatgc tgccgagcgc 1694641 ccgcccacga tacgacgcca tcgcgccttg ggccgcgtct tcgaccaccg ccaggttgtg 1694701 gtgcgtggcg atcttcatga tcgcgtccat ctcgcaggcc acgccggcat agtgaacggg 1694761 gacgatggcc ttggttcgcg gggtgatggc gtctacgatg cgagtttcat caatgttgag 1694821 cgtgtcgggc cgaatatcga caaagactgg cacaccaccg cgcaacacga aggcgttggc 1694881 ggtagagaca aaggtgtatg acggcagtat gacttcgtcc ccctcctcta tgtccagaag 1694941 cagcgccatc atttccagcg cggcggtgca tgagggggtg agtagtgcct tgcgacaacc 1695001 ggtctgctgt tcgagccatg catggctacg ccgggtgaag ggaccatcgc cggccaggtg 1695061 gccgcaagaa tgcgcttcgg cgatgtacgc gagctcccgg ccggtcatgt acggccgatt 1695121 gaatggaact ttgtgatctg acactcgacg ccaacttctc aaatcatcga acagggcgct 1695181 gaagtgttcg gtgatcgggg tcgaacatcc accagaattc tccttgtggc cggcggatcc 1695241 ctagcctttt caggtatccc aacatgcctt cactatttct tcatatcttc cgcaactccg 1695301 tgctgggcac cggacggcgc tccgtcttgg ttcctatata gacaccatcc gcgtcagcgt 1695361 cgccaaggag tagggcgccc gctccgacca cacaccgtga accgatggtg atatggtcgc 1695421 gtagcgttgc attgacgcca atgaaagatt gctcctctat taccacgcca ccggatacga 1695481 cgatatgaga cgctagaaaa cagtgatcgt gaatcgtcga gtgatggccg atatgattgc 1695541 cgctccacaa tgtgacgttg ttgccaatcg atacgaatgg ctggatagtg ttgtcttcaa 1695601 gcaggaagac attttcaccg atccgcccat cgttcaagac ggtagcgtgg gagctcacat 1695661 agctggcgag ttcgtagccg agagccttag cggcaagata tttttccttc cgcacaccgt 1695721 tcagtttggc gtaggccagc gccacgaaca tcgcgtggga ctccggcgga aagcgttgtg 1695781 cgacctcgtc gaaggccact aaaggcaggc cgcaaaactc ggacacgctt gcatagtctc 1695841 ggtcgactgt gaacgcgacg acctcatatt ccgaatccct tgtgaagtag taatgtgcga 1695901 gctgagcgat gtcgccgctc ccaaaaatta ccaatggttt ggtcatgacg ccttcctaac 1695961 cagaattgtg aattcataca agccgtagtc gtgcagaagc gcaacactct tggagtacct 1696021 gcgcttgcag agatcaaata gggcgcatgg gtcagcatag tacaggtcgt cgcgcatctt 1696081 tgatgcatcg gaataagatg tcaggcaatt aaaagagaag ccacggcgac tcgcggcatt 1696141 cagcatgtcg agcgtcgctt cgatgtgagc gcaccattcc gtgtccaacg atttcagacg 1696201 aacattgaat attccactcg cgacgctata gtccgcctcc cgatctatgc gcgccgcgca 1696261 gatgaagtct gcgttcgccc gaccttcgaa acgtagtgcg gccgcgcgca ccatttcggg 1696321 ggagacgtcg atgccggtgt aatcagtttt gaagccacgc gcatctaggt agtccagtag 1696381 agccccatag ccacagccta gatcgttgat cgaaaatggg tccgccgcat tgacaatgcg 1696441 caccagctgg tcaaagcgca acgcctgccc ggcttcgccg ttccaatcga cgccgcgcgg 1696501 gtgccgtgtg cttcgagttt cgatgcgtag taacgggcca cgtcagcgag catggtcgtt 1696561 gcgtcttccg ccatgaagct gcctcacgat ttgtgtgtgt gggcgtcggt gcgtgggtcc 1696621 gagactatac cttcaacagt tgcatgccga ggctgcggcg ggcaatgacc caaaaacccg 1696681 ccggcacggt tcgccgagca aggaagcgtg gagacgatag ataatttcac tggcgacagt 1696741 acctcaaata gtccggagcc tcggctccga cgttaaagag cagatccaga atcgacacgg 1696801 cgggctcgaa ccctccccac aattgcttat aatcgcggta gccgtcataa tcgaaccaag 1696861 ttacccggat gctaagttcg tcgaacacgc gctcatcgac atacgaacgg gctgaggggc 1696921 cagagacata ttcggtcgct gcggcctgtt ggcagaggtt ggccagtctc tcggtcttgc 1696981 cgtcggctaa ttcgtagtcc cacgaatttg ccagtcgcgt gctgataccg agataactgc 1697041 aaatcgcatt caatagacgc ctgttgagta aggaaagatt cgtgtgctgt tcttcgaggt 1697101 aaatcggcgc gagccagtca gcgatctccg caaaatgagc ggccgcgctg tagttgaatt 1697161 ctagtgcccg ccagtgcgct ttcgcccaat cggtgccgtc gatcagcgtc tcacgtatct 1697221 tttgatggaa acgtcccttc acctggacgg gaacagttat ccactgtaac ccctggctcg 1697281 ttttgatccg atttctgttt cgccaatcac gcttggtata ttgcatgtca tcatagatga 1697341 tgaattcatc gacgaatgca atcaggtcaa aatatcctcg ccaaggtatg taatttgatt 1697401 gaacaatcgc gactttcttc aacgcggtgt ctccaattta gaataacaaa tacgtcgcgc 1697461 ccgcgacagc tccgctggag cgagttcaag cgattctgcg acatattcaa tatggtgctc 1697521 gggaaggcca ggatgggccg cgacccgggg cgtccggtgc gcgatgaacg tcgcatcgtc 1697581 tcctgtgaga taattgcatc cgatcatata gggctggctg cggctaggtt gctggcaaaa 1697641 agatatcgcg gccgatccgt ttctggtttt gtcttgatga tcaaatccgc ttccgttcac 1697701 gagatcgatt cctggtcttc ccccagcgtc gcgatgtcga taggtgtcgc gctttgttcg 1697761 tacccgcact acgcggcggc gagaacctcg ccaccgaatc gggattgggg ggaggatacc 1697821 actcggtcga ggcccgtcac cggccttcta gcgggttgac catcagtgtt tgcagggccc 1697881 tatcccggta tggcgcacca cgggatcggc agcgttccgg ttgctggcgt ggtacctcgt 1697941 tgtggcgccg tggtccatgt cgattgagtg cgtggatcag tgtaaaccgt tgcgcgccat 1698001 gttctgtagg cactggttcg ggttgtggtt aggctgcacg gttggcaggt taccaaccac 1698061 tgagcccctg ggcggatgtg agctcggact ccgcctatgg ggtgtaattt tggcagattg 1698121 ggccgggtcc ccgtggtgag gactcctcaa ccggattggg taagcatgag gtggtgctgg 1698181 cagcggtgtc ctggtcgctc tcccgagtag gcccgttgtg actgtcatgt gggcgagcgg 1698241 gtttgcgcgc gtaggagacg atgattacta cgcacgtgac caaccacaag aacggtgccc 1698301 atgtcaccgt ggtgaaaacg agtggcgtgg taccgactac ccctttggct cccagctgtc 1698361 catagagcgg cacgtagaac ggctggcccg ggaccgcgac gttgacgatg ctcagcgcca 1698421 cggccaaact cacgcagacg ccgaccgcgc ggcggcggtc tccatgggct gcgagttggt 1698481 cgaatatccc agcaccagga ggcccgttgg ggtctcgggc taccagtgca gcgattggca 1698541 agacgaaaac gagatagtag aaggcgacgt ccgcggggga gaaggtggcg gtggcgagca 1698601 acacaatccc caccatgaca ggcgggatac ggcgtccgag cgccagcacg gcgaccacga 1698661 ctatgactag gacagcaaac ccgatctgcg ttcgcggacc agtgaggaaa ccctctggga 1698721 tcttgcccga ttgatagttc ttgatgctat cggggatcag caggagtgcc ttgccaaagg 1698781 acacgttccg cgggtctcga agccctccga acgaactatt gaacttgatg atgccgtgga 1698841 tcgactgtgc gatcgtcccc gggaagcctc gtggccacaa cagaaaggct gcgatattgg 1698901 acaccaccac gccggtgatc ccgataccag cccaccgcca ttgtcgagcc gccaacaaca 1698961 ccacgccgag aacgacgaac tgcggcttta ccaggacggc caagatcacc gtgatggtgg 1699021 cgaggcccca ccgctgtcgg gacaacgcca cgaagtaagc cagcgcgatc ggtaccacga 1699081 accctgtcga gttgcctcga tcgatgaccc cccacgccgg gatggccgcg gcgcccagtg 1699141 tcacgaagat gaccactcgc tccagaccac gtgccccccg ggccgcccag atggcgggag 1699201 atatgaccgc catcgttagg gcgaccaggt aacagatcag ccccaagcgc ggcgcaccca 1699261 gccaatggct gggtagtccg aaaatcgcat acggtatgcg ggcgggggcc catgcagcaa 1699321 ccgcggtcgg ctggtaatcg gcgggtagcg agatcaggta gtccgcggga ttgggttgaa 1699381 tcccggcggc ggcgaccatg gcgtagtcgc tgaagcagtg ccgaccgata ttcatgcccc 1699441 aatcaagcca acagtcccca gggactacca aaagagtgga aaagacgtcg accgcgtacc 1699501 actgactgag ggcgtacgcc gtcgccgccg aaatcaccga cgccagcagg atggtgccga 1699561 gcatgagggt gcgctcggat tgggagccga tcgcccagag ccgctcccgg ctcgcggtca 1699621 cggcaccgcg caacacctcc gggggtcgct tcatctggat tctcctcggt tctgcgcgaa 1699681 acggtagcag agcgccatgg ttgccaacgc ggtcgccggg cagtctagac cggatcttcc 1699741 tcgtggcaac cgacaacagg acgtcgttgc cgaaagggcg ctgggcaccg acatctagga 1699801 tgaacccaca gccacgcccc gacgttatgc catggcgaag agcgaccggc aggagcggga 1699861 acccagtgaa gcgagcgctc atcaccggaa tcacaggacc ggacggctcg tatctcgcta 1699921 agctcccgct gaagggatat gtggccgctg gtagcccggc cgaggtctat ttctgctggg 1699981 cgacacggaa ttatcgcgaa ttgtatgggt tgctcgcggt caacagcatc tggttcaatc 1700041 acgaatcacc gcgtcacggc gagacattca tgactcgtaa tcctgcacca tatcgcggtc 1700101 ggcaacgagg cgctgatcga tgcgcagacg ctgatgcgcc ggcccacccg gataggtatc 1700161 agtattgggg cgttccggcc agcgtacgag gcgtgatcga ccgcgcaatg ggtgtttgcg 1700221 ttgagtaata atctgaaccg tgtgaacgca tgcatggatg gattccttgc ccgtatccgc 1700281 tcacatgttg atgcgcacgc gccagaattg cgttcactgt tcgatacgat ggcggccgag 1700341 gcccgatttg cacgcgactg gctgtccgag gacctcgcgc ggttgcctgt cggtgcagca 1700401 ttgctggaag tgggcggggg ggtacttctg ctcagctgtc aactggcggc ggagggattt 1700461 gacatcaccg ccatcgagcc gacgggtgaa ggttttggca agttcagaca gcttggcgac 1700521 atcgtgctgg aattggctgc agcacgaccc accatcgcgc catgcaaggc ggaagacttt 1700581 atttccgaga agcggttcga cttcgccttc tcgctgaatg tgatggagca catcgacctt 1700641 ccggatgagg cagtcaggcg ggtatcggaa gtgctgaaac cgggggccag ttaccacttc 1700701 ctgtgcccga attacgtatt cccgtacgaa ccgcatttca atatcccaac attcttcacc 1700761 aaagagctga catgccgggt gatgcgacat cgcatcgagg gcaatacggg catggatgac 1700821 ccgaagggag tctggcgttc gctcaactgg attacggttc ccaaggtgaa acgctttgcg 1700881 gcgaaggatg cgacgctgac cttgcgcttc caccgtgcaa tgttggtatg gatgctggaa 1700941 cgcgcgctga cggataagga attcgctggt cgccgggcac aatggatggt cgctgctatt 1701001 cgctcggcgg tgaaattgcg tgtgcatcat ctggcaggct atgttcccgc tacgctgcag 1701061 cccatcatgg atgtgcggct aacgaagagg taatgacatg gcgcaagcga catcgggcat 1701121 tcgcgcggca ctttcgcaac ctgctgtgta tgaggcgtat cagcggattg cgggcgctaa 1701181 aagcgggctt gcgtggatca caaccgaccc catccagtcg ttgccaggca tgcgtactct 1701241 cgacctcggt tgctggccag cggtgataca cagctccccg ccagtggacg tgacatgtac 1701301 gagagacggc atgagcgcgg aatgtgcgac cgtgccgtcg agatgaccga cgtcggcgct 1701361 acggcagccc ccaccggacc tatcgcgcgg ggcagcgtcg ctcgggtcgg cgcggcgacc 1701421 gcgttggccg ttgcctgcgt ctacacggtc atctatctgg cggcccgcga cctacccccg 1701481 gcttgttttt cgatattcgc ggtgttttgg ggggcgctcg gcattgccac cggcgccacc 1701541 cacggcctcc tgcaagaaac gacccgcgag gtccgctggg tgcgctccac ccaaatagtt 1701601 gcgggccatc gtacccatcc gctgcgggtg gccgggatga ttggcaccgt cgcggccgtc 1701661 gtaattgcgg gtagctcacc gctgtggagc cgacagctat tcgtcgaggg gcgctggctg 1701721 tccgtggggc tactcagcgt tggggtggcc gggttctgcg cgcaggcgac cctgctgggc 1701781 gcgctggccg gcgtcgaccg gtggacacag tacgggtcac tgatggtgac cgacgcggtc 1701841 atccggttgg cggtcgccgc ggcagcggtt gtgatcggat ggggtctggc cgggtacttg 1701901 tgggccgcca ccgcgggagc ggtggcgtgg ctgctcatgc tgatggcctc gcccaccgcg 1701961 cgcagcgcgg ccagcctgct gacgcccggg ggaatcgcca cgttcgtgcg cggtgccgct 1702021 cattcgataa ccgccgcggg tgccagcgcg attctggtaa tgggtttccc agtgttgctc 1702081 aaagtgacct ccgaccagtt aggggcaaag ggcggagcgg tcatcctggc tgtgaccttg 1702141 acgcgtgcgc cgcttctggt cccactgagc gcgatgcaag gcaacctgat cgcgcatttc 1702201 gtcgaccggc gcacccaacg gcttcgggcg ctgatcgcac cggcgctggt cgtcggcggc 1702261 atcggtgcgg tcgggatgtt ggccgcaggg cttaccggtc cctggttgct gcgtgttgga 1702321 ttcggccccg actaccaaac tggcggggcg ttgctggcct ggttgacggc agcggcggta 1702381 gctatcgcca tgctgacgct gaccggcgcc gccgcggtcg cggccgcact gcaccgggcg 1702441 tatttgctgg gctgggtcag cgcgacggtg gcgtcgacgc tgttgctgct gctgccgatg 1702501 ccgctggaga cgcgcaccgt gatcgcgctg ttgttcggtc caacggtggg aatcgccatc 1702561 catgtggccg cgttggcgcg gcgacccgac tgatttgtgc cccaggtcga caaatcacgc 1702621 cgtctcgtca gtgagcactc cgtcctcggg tccgatcctt ccaggagacg ttgcaacctg 1702681 atttggctca aattggtgcg caccgagggt cgggcacatc gtagggtcgc aacagtcaca 1702741 tgtgtcactg caccgggcga cacccgatgt cccggctctc agcgacagct gtctgacctg 1702801 tggttttgtt cccaagttgg tcgtggctgt gcgggattgg aggtggcgtg ggggtcgcgt 1702861 cgtatggatt ctcctcctcg gttccgcgcg aaacggccgc aggcgcaatg gtcaccaact 1702921 tggccgcggt ggagtctagc ctcacatttt cctggtcgcc cccgacaacc aggaggtcgc 1702981 tgcagaacgg gcgttcccta cccacatcta ctatgaagcg acagcggcgc cccgctgtga 1703041 tggctgagca tgaccgacag aggcgggaag acagtgaagc gagcgctcat caccggaatc 1703101 accggccagg acggctcgta tctcgccgaa ctgctgctgg ccaaggggta tgaggttcac 1703161 gggctcatcc ggcgcgcttc gacgttcaac acctcgcgga tcgatcacct ctacgtcgac 1703221 ccgcaccaac cgggcgcgcg gctgtttctg cactatggtg acctgatcga cggaacccgg 1703281 ttggtgaccc tgctgagcac catcgaaccc gacgaggtgt acaacctggc ggcgcagtca 1703341 cacgtgcggg tgagcttcga cgaacccgtg cacaccggtg acaccaccgg catgggatcc 1703401 atgcgactgc tggaagccgt tcggctctct cgggtgcact gccgcttcta tcaggcgtcc 1703461 tcgtcggaga tgttcggcgc ctcgccgcca ccgcagaacg agctgacgcc gttctacccg 1703521 cggtcaccgt atggcgccgc caaggtctat tcgtactggg cgacccgcaa ttatcgcgaa 1703581 gcgtacggat tgttcgccgt taacggcatc ttgttcaatc acgaatcacc gcggcgcggt 1703641 gagacgttcg tgacccgaaa gatcaccagg gccgtggcac gcatcaaggc cggtatccag 1703701 tccgaggtct atatgggcaa tctggatgcg gtccgcgact gggggtacgc gcccgaatac 1703761 gtcgaaggca tgtggcggat gctgcagacc gacgagcccg acgacttcgt tttggcgacc 1703821 gggcgcggtt tcaccgtgcg tgagttcgcg cgggccgcgt tcgagcatgc cggtttggac 1703881 tggcagcagt acgtgaaatt cgaccaacgc tatctgcggc ccaccgaggt ggattcgctg 1703941 atcggcgacg cgaccaaggc tgccgaattg ctgggctgga gggcttcggt gcacactgac 1704001 gagttggctc ggatcatggt cgacgcggac atggcggcgc tggagtgcga aggcaagccg 1704061 tggatcgaca agccgatgat cgccggccgg acatgaacgc gcacacctcg gtcggcccgc 1704121 ttgaccgcgc ggcccgggtc tacatcgccg ggcatcgcgg cctggtcggg tccgcgctgc 1704181 tacgcacgtt tgcgggcgcg gggttcacca acctgctggt gcggtcacgc gccgagcttg 1704241 atctgacgga tcgggccgcg acgttcgact tcgttctcga gtcgaggccg caggtcgtca 1704301 tcgacgcggc ggcccgggtc ggcggcatcc tggccaacga cacctacccg gccgatttcc 1704361 tgtcggaaaa cctccagatc caggtcaacc tgctggatgc cgccgtggcg gcgcgggtgc 1704421 cgcggctgct gttcctgggc tcgtcgtgca tctacccgaa actcgccccg cagccgatcc 1704481 cggagagcgc gctgctcacc ggtccgttgg agccgaccaa cgacgcgtac gcgatcgcca 1704541 aaatcgccgg catccttgcg gtccaggcgg tgcgccgcca acatggcctg ccgtggatct 1704601 cggcgatgcc caccaacctg tacgggccag gcgacaactt ttcgccgtcc ggctcgcatc 1704661 tgctgccggc actcatccgc cgctatgacg aggccaaagc cagtggcgcg cccaacgtga 1704721 ccaactgggg caccggcacg ccccgacggg agttgctgca cgtcgacgac ctggcgagcg 1704781 catgcctgta tctgctggaa catttcgacg ggccgaccca tgtcaacgtg ggaaccggca 1704841 tcgaccacac catcggcgag atcgccgaga tggtcgcctc ggcggtaggc tatagcggcg 1704901 aaacccgctg ggatccaagc aaaccggacg gaacaccacg caaactgctg gatgtttcgg 1704961 tgctacggga ggcgggatgg cggccttcga tcgcgctgcg cgacggcatc gaggcgacgg 1705021 tggcgtggta tcgcgagcac gcgggaacgg ttcggcaatg aggctggccc gtcgcgctcg 1705081 gaacatcttg cgtcgcaacg gcatcgaggt gtcgcgctac tttgccgaac tggactggga 1705141 acgcaatttc ttgcgccaac tgcaatcgca tcgggtcagt gccgtgctcg atgtcggggc 1705201 caattcgggg cagtacgcca ggggtctgcg cggcgcgggc ttcgcgggcc gcatcgtctc 1705261 gttcgagccg ctgcccgggc cctttgccgt cttgcagcgc agcgcctcca cggacccgtt 1705321 gtgggaatgc cggcgctgtg cgctgggcga tgtcgatgga accatctcga tcaacgtcgc 1705381 cggcaacgag ggcgccagca gttccgtctt gccgatgttg aaacgacatc aggacgcctt 1705441 tccaccagcc aactacgtgg gcgcccaacg ggtgccgata catcgactcg attccgtggc 1705501 tgcagacgtt ctgcggccca acgatattgc gttcttgaag atcgacgttc aaggattcga 1705561 gaagcaggtg atcgcgggtg gcgattcaac ggtgcacgac cgatgcgtcg gcatgcagct 1705621 cgagctgtct ttccagccgt tgtacgaggg tggcatgctc atccgcgagg cgctcgatct 1705681 cgtggattcg ttgggcttta cgctctcggg attgcaaccc ggtttcaccg acccccgcaa 1705741 cggtcgaatg ctgcaggccg atggcatctt cttccggggc agcgattgac gcgccggcgc 1705801 gtcaatctat ttcgacattc gcgtgaagac gttttcccag aatcgactgt tgtaggcgta 1705861 gaactcccgg ccgcgtaggt aggcatgtga tattcgcctt cccccgaacg ggtagcggcg 1705921 atgaaggtcg cccatgcggc gcagatcacc gaagaccgcg cttggttccc ggtgcgagcc 1705981 gacgcccgtg gtgtcgaact cgcacagcac acaccgaatc gtgaccggct cgcataccag 1706041 cgcggcccgc aatatgaatt cctggtcggc ggcgatcccg aaatcaaggt cgtagccacc 1706101 gatcttggcc accagcgatg atccgaagaa cgatgcttga tgcggaacaa cctgcttgcc 1706161 ggccaggaat ttgcgcaggc tgaaaggtat cgggccgcgc acccgatcga gcccgacgag 1706221 acgatccatc ccgaagcccc acaattcgga caccggtccc ttgccggata gcgcctccac 1706281 ggcctgggct accacgtcgg gcccggaaaa acgatcggcg gagtgcaaga accacaacag 1706341 atcacccgat gcgtgcgcga tgccctggtt catcgcgtcg taccgcccgc cgtcgggctc 1706401 ggactgccaa tacgcgaagc ctggttcaca cccggacagg tatgccacca cgtcgtcgcc 1706461 gctgccaccg tcgattacga tgtgctcgat gcgtccccgg tagcgttgcg cccgcacact 1706521 tttcaccgtg cgctgcaacc cgtcgaggtc gttgaacgag atcgttatca ccgagacggt 1706581 cggagcagac gtcaccgagt tcccctaggt tgctggcggc gattgtggat caccgggtct 1706641 tgataccgat gaaggtgcct cgaagattcg ccgcatagga acctccgagc aacgactcgg 1706701 cgatgcttgg ttccaagttg tcgtactcct ccatcaccag gtcgacgccg acgtctttga 1706761 tggcctgaag taggtgctcg cgttgaatcc agaatgaccg gcgattgtcc caggacgccc 1706821 attttgcggt gtcgcgctgg ccaaacgagc ggtcgtcgga aaactcggta aaccacctac 1706881 cgggaagtcc ctcatgttcg gtgggcgccg agagcatgaa cttcaccggc gccggccgcc 1706941 gcagcaaccg atcggtcaat tgtcgtgccg tcgtgggcaa ccggagccat ttatcgctcc 1707001 ggttgatgat cgagaagtgc gtctggagaa tcagcagctt gttcgttacc gacgagaggg 1707061 tttccaggta ttgcttcgga ttctccaggt ggtagaagag gccgcagcag aagacggtat 1707121 cgaagagccc gtggttggcg atgttgaggg cgttgtcgtg gacgaaccgg agattcggca 1707181 ggttggtctt cgatttgatg tagttgcagg ccgccatgtt cagctcgcga acctcgatcc 1707241 cgaggacctg aaatcccatg cgcgcgaacc cgaccgcgta cccgccttcc aagcagccga 1707301 catcggccag gcgtaggtgg ctcttgtccc cgggaaagac ggtttccaga atcccgcgcg 1707361 ccgagatgaa ccaggacgat tcgtctaacg tgcgcgagga ctccggtatc gtcaaggttc 1707421 cgtcgtcgag gcgaacgttg tgggcggtga attgtaccgc gccggccgaa tgttcctgtg 1707481 ccatcacttg gttagcccct tcggctggtc ctgggtttgt cgacatggtc aggctcgaca 1707541 gccgcgtcgg agccgggagg gccacacatc cacgagcccc ctgcggctcg gcgtcgcggc 1707601 ggcgagcttg cgccactggg tcttgagccg ccgcgcgggt gtcgccccgc ggtgctgcag 1707661 cgccagcatg gcgatccggg gatggcgcgc gatggtttcc tgcagcgcgg cgcgcccctc 1707721 cgggcctgga acgttggcga tctggcgaag gatccagtcg gccatgacgg cgatgagctc 1707781 ctcgcgcgcg gggtctcccg ggaacaggtc gagcatcgcg tcaaacgtcg ccgcatgccc 1707841 cggaccctgc gtcaaccaga actttggcgg gtccaccacc tggttgtgcc acatgccttg 1707901 ggcgtggcgg cgatacacgg ccatggtgtc gggcaacatg gcgatgtcgc catgcaccgc 1707961 gtgccggacg tgcagatacc agtccagggg catgacgtcg gcaggaatgt cgtcgtagcg 1708021 ctcgaggcga cggtacacgg ccgagttggt ctggatgaag ttcatcaaga tcaacgcatc 1708081 caggctcaag ttgccccgca cccgaaccgg ggggaacttc gagtccttgg catggccgtc 1708141 ctcccatatc actcggacgg gatggaagca caccgtcgtc ttggggtgcc ggtcgaggaa 1708201 tgcgacctgt ttgcttagct tcagcggatc gatccagtag tcgtccgcct cgcacaacgc 1708261 gacgtactcg ccgcgagcgg ccgacagggc gccggtcagg ttcccattga ggccgaggtt 1708321 ttcggtcctg aagatcggcc ggaacacgtg cgggtaccgc tcggcgtact cacggatgat 1708381 cgccggggtg gcatcggtcg acgcgtcgtc ggcgacgatg atctccaccg ggaagtcggt 1708441 ttgctggtcg agaaagctgt cgaaggcctg acgggcgtag cccgcctggt tgtgagtggt 1708501 cgagacgatg ctcaccttgg ggcaaagctg gggactcacc gtcggccctt ttcctgcgcg 1708561 gccgcaaggg tattgcgatg gcgaacgtga atcgcctgtg cccgccggcc gtcggccgtc 1708621 gtggcctggt ggtcggcgga cgtacggcac acgctggcga agtatagcga gggtgcactg 1708681 acgttgggct cgaaccgcgt ggcgcgcggt gtgggcgcac cgtctcgagt cggtgctggt 1708741 tggctcgcgg cctacaacgg cgctctccgc ggcgcgggcg taccggatat cttagctggt 1708801 caatagccat ttttcagcaa tttctcagta acgctacggg gcgcgccgtg ccgtagtagc 1708861 gtccccactg atgtggacga tggtgctcct tttggggttg gggatggcga ttgacccggc 1708921 gcgtctggga ctcgcggtcg tcatgctgtc gcggcgtcgg cccatgctga atctgttcgc 1708981 cttctgggtg ggcggcatgg tggcgggtgt cggcatcgcg ctagccgtgc tggtgttcat 1709041 gcgcgatgtc gccttggcgg ccatacaagg cgtggtgtcc gcggccaacg agttcaggga 1709101 agcggtcggg atcctggcgg gtgggcgtct gcacatcgtc atcggtgtca tcatgctgct 1709161 gttggccgcg cgcatggtgg ctcgcgcgcg ggcgcaggta ggggtaccgg tagggccagt 1709221 gggggtagcc gacggtggaa tgtcggccct ggcgctagcg cagcgccccc cgggtcttgt 1709281 tgcgcggctg gaagtgcgta ctcaacagat gctgcagggc gacgttgtgt ggccggcgtt 1709341 cgtggtgggc gtcgcctcgt ccgcaccgcc cttcgagagt gtggtggcgt tgacggtcat 1709401 catggcatcg ggagccgaga tcggcactca gctcggcgca tttgtcgtgt tcaccctcct 1709461 ggtgcttgcg gtcatcgaga ttccgttggt cgcctacctg gcgataccgc agcaaaccca 1709521 gcaggttatg ctgcggtttc aggattgggt acggtccaat cgtcggcaga tctccctcac 1709581 catcctgata ggggtcgggt tcctcttttt gtaccagggc gtgactagtc tctgagtcgc 1709641 catgtggtgc ctggtgatgc atcaagcgtg gtatcggtga acccggcgaa accgcttatc 1709701 tcggtgtgca tcccgatgta caacaacggc gccaccatcg agcgctgtct gcgtagcatc 1709761 ctcgaacagg agggcgtcga gttcgagatc gtggtcgttg acgacgactc gtccgacgac 1709821 tgcgccgcga tcgccgcaac gatgctgcga cccggagacc gcttgctgcg aaatgagcct 1709881 cgcctcggcc tcaaccgaaa ccacaacaaa tgtctggaag tcgcgcgcgg cggacttatt 1709941 cagttcgtac atggtgatga tcggctgctc cccggagccc tgcagacact cagccgacgt 1710001 tttgaggatc ccagtgtcgg aatggctttc gccccccgac gggtggagag cgacgacatc 1710061 aagtggcaac aacggtacgg cagggtccat acccgtttcc gcaagctgcg cgaccgcaac 1710121 cacgggccgt cgctggtctt gcagatggta ttgcacggcg cgaaggaaaa ttggatcggc 1710181 gaaccgaccg ccgtgatgtt tcggcggcaa ttggcgctgg acgccggtgg ttttcgcacc 1710241 gatatctacc agctcgtcga tgtggacttc tggcttcggt tgatgctgag gtcggcggtc 1710301 tgcttcgttc cgcacgagct ctcggtgcgc cgtcacacgg cggcgacgga gaccacacgg 1710361 gtgatggcga ctcggcgcaa cgtgctggac cgacagcgca ttctcacctg gttgatcgtg 1710421 gacccgttgt cgcccaacag cgttcgcagc gccgcggcgc tgtggtggat acccgcatgg 1710481 ctggccatga tcgtggaggt ggccgtgctc ggaccgcagc ggcggacgca cttgaaggct 1710541 ttggcgccgg ccccattccg cgagttcgcc cacgcccggc gtcaactgcc gatggctgac 1710601 tagcagtcgc actctgcctg gccgtcgtcg gagccacaga caattccaac ccatttggcc 1710661 tggcggccaa gatgacattt ttacaaggta aggctagcct taagcgtccg cgtatccagg 1710721 acctcgggtc tgttgcgttg tggttgcctc gcatgcgacg gagtgctctg cgccaacggc 1710781 ccaggtcgtc cgagaaggcc agccttgacc tgtacagctg tggcgacccg aacgttgcac 1710841 agcttggcga cgaatgccga gttggtcgag tcggccgatc tgaccgtcac cgaggatatt 1710901 tgctcgcgaa tcgtgtcgct gccagttcac gaccacatgg ccattgccga cgttgcgcgg 1710961 gtcgttgcgc cgttcgggga agggttagcg cgcggtggtt gacccgacag cgacggattc 1711021 gcccaaggtg agtatcgtct cgatctccta caaccaagag gagtacattc gcgaggccct 1711081 ggacggcttc gccgcccaga ggaccgagtt ccccgtcgag gtgatcatcg ctgacgatgc 1711141 ctccacggac gccaccccga ggatcatagg agagtacgcc gcccgctatc cgcagctgtt 1711201 tcggccgatc ctgcggcaga ccaacatcgg tgtccacgcc aatttcaagg atgtgctgtc 1711261 cgccgctcgt ggcgagtacc tcgcactgtg cgaaggcgac gattactgga ccgatccgct 1711321 gaagctgtcc aagcaggtaa agtacctgga ccggcatccg gagacgacgg tgtgttttca 1711381 tcctgtgcga gtgatctatg aggatggcgc aaaagactcc gagttcccgc cgctcagctg 1711441 gcgccgcgac ctgagcgtcg atgccctgct cgcgcggaac ttcatccaaa ccaactcggt 1711501 cgtgtaccgc cgtcagccga gctacgacga catcccggcc aacgtcatgc cgatagattg 1711561 gtacttgcat gtgcggcatg cggtgggcgg cgagatcgcc atgttgcccg agacgatggc 1711621 ggtctaccgt cgccacgctc acggtatttg gcattccgcg tacactgacc gccgaaagtt 1711681 ttgggagaca cgaggccatg ggatggccgc gacgctcgag gcgatgctcg acctagttca 1711741 cggccaccgc gagcgcgagg cgatcgtcgg tgaggtgtcc gcctgggtgc ttcgcgagat 1711801 cggaaagaca cccggccgac agggtcgcgc cctgcttctg aagtccatcg cggaccatcc 1711861 gcggatgacg atgctgtcgc tacaacaccg gtgggcgcaa acgccctggc ggcggttcaa 1711921 gcgccggctg tccaccgagt tatcgagctt ggcggcgctt gcgtacgcca cccgacggcg 1711981 cgcactcgaa ggtcgggacg gcggttatcg cgaaaccact tctccgccga ccggtagggg 1712041 acgtaacgtc cgcggatcac atgcctagat cttgatagat cgcccgtctg gcctctatgg 1712101 atggagcatg cgggatcgga ccggttgccg ccgactcgac gaccgaaaga gccatcaaat 1712161 agccttgcgg cccatctttg agatctgtca acccgccggt cctgatgtcc tccaggctct 1712221 ggtcgggatg agctagtgcg gttcccgaac tcggcatctt cgtcagtcct ggagagaaac 1712281 aacaccagcg aaggtagtgt gatgtccgtg gtcgaatcct ctcttcctgg tgtgctgcgt 1712341 gaacgcgcca gttttcagcc caacgacaaa gcgctcacct ttatcgatta cgagcggtcc 1712401 tgggatggtg ttgaagaaac tctgacgtgg tcgcagttat atcggcgaac gcttaacctc 1712461 gccgcacagc taagagaaca tgggtcgacc ggcgatcggg cattaattct ggcgccacaa 1712521 agcctcgact atgtcgttag ctttattgcc tcgctgcagg ccggaattgt cgcggttccg 1712581 ctttcgattc cccagggtgg tgcccacgac gagcgcaccg tttccgtgtt cgccgatacc 1712641 gcaccggcga tcgttctcac ggcgtcctcg gtcgtcgaca atgtcgtcga atacgtccag 1712701 ccgcagcccg gccaaaacgc accggcggtg atcgaagtcg atcggctgga tcttgatgct 1712761 cggccgagct ccggttctcg ttctgccgct cacggccatc cggatatctt gtacttgcag 1712821 tacacctcgg gttccacgcg cacgccggcc ggtgtcatgg tctcgaataa gaatcttttc 1712881 gccaatttcg aacaaattat gaccagttac tacggcgtct atggcaaggt cgccccgcca 1712941 ggctccaccg tggtgtcgtg gttgccgttc tatcacgaca tgggtttcgt cttgggactg 1713001 atattgccga ttctggctgg catccccgcc gtgctgacca gcccgatcgg tttcctgcag 1713061 cgcccggctc gctggataca gatgttggca agcaacactc ttgcgtttac cgccgcgccg 1713121 aacttcgcat tcgatctggc gtctcgtaag accaaagacg aggacatgga gggcctcgat 1713181 ctcggtggcg tacacggcat cctcaacggc agcgaacggg tgcagccggt gacgctgaag 1713241 cgcttcatcg accggttcgc cccgttcaat cttgacccca aggcgatacg tccgtcgtac 1713301 ggaatggcag aggccacggt atatgtggcc acccgcaagg cgggtcaacc gccaaagata 1713361 gtgcaattcg atccccagaa gctgccggac ggccaagctg agcggaccga aagcgacggc 1713421 ggcacaccgc tggtcagcta cggcatcgtc gacacccagc tggtgcgcat cgtcgacccg 1713481 gacaccggca tcgagcgccc cgcgggaacg atcggtgaga tttgggtgca cggcgacaac 1713541 gtcgccatcg gctattggca gaaacccgag gcgaccgaac gcacctttag cgcaacgatc 1713601 gtcaatccct ccgaaggcac acccgcagga ccatggctgc ggacgggaga ttcgggtttc 1713661 ctctccgagg gtgagctgtt catcatgggg cgcatcaagg acctcttgat cgtgtacggg 1713721 cgcaaccact ctcccgacga tatcgaggcg acgattcaga cgatcagtcc gggccgctgt 1713781 gcggcgatcg ctgtttccga gcatggtgct gagaagctgg ttgccattat tgaactcaag 1713841 aagaaggacg agtccgacga cgaggcggcg gaacgactgg gtttcgtgaa acgcgaagtg 1713901 acctcggcaa tctcgaagtc gcacgggttg agcgtggcgg atcttgtgct cgtctccccg 1713961 ggctcaatcc caatcaccac cagcggcaag atccggcgag cacagtgtgt ggagctgtac 1714021 cgtcaggacg agttcactcg cctggacgca tagcacccac aggcgaggct cccgcaatgg 1714081 ggcgcaatgg ggatcgtcac accagtagca ccagcccctg gaggggcaac aggggaaaac 1714141 tgagttgagc gccaaccgtg cgcactgagg ctcaggtgct cagcttcgcg tcgggctttg 1714201 accccgcgtg accgactgcg ggttcgccga tagacgtgtc atcccaacgg tcgtagctcg 1714261 gtaggccggc aagaccgaac agcggcagcg agtggccgag tagatggtcg acgggttctt 1714321 taccgatgtg ggcgccgtcg ccgttgggtt tcgacacgcc atccacgaca tcgtaggccg 1714381 gcatggcggc atagccgaac agcggaagcg aatgtctgcg caggtggtcg atcaggtact 1714441 ccccgttgcc ttcggctagg tcagcgggct tgccgttggg aaccgacttg gttgccgcct 1714501 tgcttgcgtg cccgttggtg ttgcggacga ccttggtgtg gggcggcttg ggcgccggga 1714561 tcggggcctt gcgtcggtgg cctttcaccc gccgcagcca ccgatcggct ttggtcggcg 1714621 gcgtggatgg gtcgcgtccc agctcggacg gccaccagtt tgcccggcca atcatcgtgg 1714681 tcaaggccgg caccgtgacc gtgcggacca ggaaggtgtc cagcacgatc ccgatgccga 1714741 tggtgaaacc ggcctgagcc atcgtgttga tgctcgcgcc caccagaccg aacatcgacg 1714801 cggcgaagat gagacccgcc gaggtgataa caccaccggt ggagcccacg gttcggatga 1714861 cgccgatgcg tataccgtgt ggtgattcgt cgcggatgcg tgaaatgagc agcatgttgt 1714921 agtcagcgcc gatggcaacc aataatatga aggacagtcc cggcaggctc caatgcattt 1714981 cctggcccag tatcaattgg aaaacgagag ttcctatgcc tagggccgac aagtaagaaa 1715041 tcagcaccga gcctatcaga tatatcggag ccacaagtgc gcgcagcaga atgacgagaa 1715101 tcaagaatac gataacgatc gtcgcaatga cgatgaattt catatcgctg ttgtagtagt 1715161 cgcggatatc ccgcagcgca gtcggaaccc ccgccagacc tatcgtggca tcctcgagtt 1715221 cggtattcgg tcgcgcggaa tccgcaacac ggaggatatc gttgacctga tccatcgcct 1715281 cggtggtggc cggattcagc gcgctctgca cgaagtaccg cgccgcatga ccatcggccg 1715341 acaggaaaat ctgggcgccc ttcttgaact cgtccctcga aaaaatctgc ggtggaatgt 1715401 tgaagcccgc cattgacggc ttgtccgcat cccgcttgat ccccaacagg aagtcggcgg 1715461 cctcgttgag ccctgagccc atctttttga cctgatcgac caattcctgc acgcctgccg 1715521 ccagcgctgc gctgccgtcg gcgagagcgt tggctccttg ctgcatttga gccaatttgg 1715581 tgggtaggcc gtcgaccgct ttgagggtgc tgacgacttg cttcagttgc ccgtccagtg 1715641 tgctcaccgt ccgggcgagt gtctggtatt cctgcgtctg ttgcagggtg acggctagcg 1715701 ctctgatgga cctgagcagg ccgtcgtcct gcgcctggac aatcgccgcc aactgtgcgc 1715761 gcgacgtccg acaggcggga tcgctgttac acaccgggct ggagttgagg gcgttgacca 1715821 tagggctggc ccaagtggcg atttgttcgg catcggtgac ggtcccgctc agattgtccc 1715881 ccagagcccg catgcgcccg acatattggg acgcattttc cagttgtcgg atggtcttgt 1715941 cgccgcccat caggtccatc atggcctgca gggtgttgac tatcccgctc gagctggcca 1716001 cggccccatt gatttcgttg cgtatttggg cgagggcgtc ggccaactgg tgcgcaccgc 1716061 cggtcagctg gtccagctcg cctccgtgct cttcgagcag ggtggtcgct tcgtcgagct 1716121 tgccgcccac ttcaccagcc tgaaacgaga ccttggtctc cttcagaggt tccccgttcg 1716181 gtcgggtcaa gccccgcacc atcacgatgt tgggcaattc tgctatctcg cgggacatca 1716241 tctcgatgtc ggcaagcgcg ccgggtgtcc gcaggtctcg gggggatttg atgaacagca 1716301 ccatcggagt catcgcgttc atcgggaaat ggcggttcat cgcctcgtat cctttgacgc 1716361 tttcgacgtg ctgcggcacc gtcttgagat cgtcgtagtt gaatcggatc agcagcgtgc 1716421 agccggccag ggcgaccagc acaatgagac tgccgaccag gtggatggtg gaccgacgca 1716481 cgatgcgaac acccgaacgc cgccacattc gactggtcag gtcgcgtcgc ggcttgatcc 1716541 agccccgccg tccggtgagt gtcaggatgg cgggcagcag ggtgaccgca cccagcagcg 1716601 acaccgtgat ggcaaccgca attgccgggc ccaccgccga aaacacttcc agtttggtga 1716661 acaccatcgc cagaaatgtg acggcgacgg tggccgccga tgcggtgatc accttgccga 1716721 tggacatcaa cgccttcttg accgccatgt ccgatttttc gccgtggcgc acatagtcgt 1716781 gatagcgact tatcagaaag acggcgtaat cggttcccgc cccgatcatg accgcgctca 1716841 taaagacgat cgcctgcatg ttcacggcca ggccgaactc ggcgagcccg gacaacgtgc 1716901 cctgcgcagt gaccaccgac gctccgatgg tggccagcgg caccagcatg gtcaccaggt 1716961 tccgatagac gaggatcagg atgatcagca cgctgaccgc ggtgccgatc tcgatgatcc 1717021 gcacatcttt ctcgccgagc tccgtcaggt cggcgaccgt ggcgatcgga ccgctgaggt 1717081 ggacggtcag gctggttccc gcgactgttt gcttgacgat cgcggcgacg cgtttgaacg 1717141 ccgcttgtgt ctcaggcgac gcggcatcgc ccgcgaacgt gatgggcagg ttccaagcct 1717201 tgttgtcctt gctggccaac agctccttca tttcggggac ggcgagaaaa tcctgaaccg 1717261 atattttgtc ctgcgtgtcc gcccgcaggt tttcgatcag ttttcggtag acggcctcgt 1717321 cggcgggtcc cagcccgttc tcgttggtca agaggaccaa aaggagggcg gaggtctcaa 1717381 ttttttcctg gaaagccgcg ctcatctcct tttgcaggac catcgatggg gccccgggcg 1717441 gcaggggagc ttgctcgcgc tttgcggctt gcgcctgcag cgttgggagc aacagcgtca 1717501 gcgcggccgc caccgcgatc cagcacccaa tgacgatcag cggccatcgc accacgaagt 1717561 tgccgatacg gtcgaacagt cccccggctt tggcctcgtc atgccttgcc acccgataac 1717621 cgtacaagcc tggcaatcgg tggcgtgggg aaatgacgat aaccgcatta accgtgacgt 1717681 tgccgttact ttggcggcgt ttgaccactg cgggcgtcaa atacgcagat caggggcatt 1717741 tcgtgggatc ggctggcgtg cccgcagccg acgctggcgg gcgggatgcg gcgtccgaac 1717801 agatagctcg ctggactcaa acttgcacgg tcgtgctggt ttgcggtcac ggtccggcaa 1717861 agtgggcatt tcggtcctgg tgcacctcgc ggtcgtgcga cactctcccc gtggctctta 1717921 ggtatcgcct gcagtccaat ccgttggtcg gcaagctcac gaccaagtac ttcttgccgc 1717981 ttggcactcg ccaggtcggc gatcacgtgg tgtttttcaa cttcggctac gaggaggatc 1718041 cgccgatggc gttgccgctg tcggagtccg acgagcccaa tcggtattgc atccagctct 1718101 accaccagac ggccagtcag gtggacctca ccggcaagga ggtgctagag gtcagttgtg 1718161 gcgccggtgg cggggcctcc tacatcgccc gcaacctagg tccggcctcc tacacggggc 1718221 tggacttgaa tccggccagc atcgacctct gccgggcaaa gcaccggctg cccggcctgc 1718281 agttcgtgca gggcgacgcg cagaacctgc ctttccccga cgaatccttc gatgcggtgg 1718341 tcaatgtcga agcctcgcac cagtaccccg actttcgcgg cttcttggcc gaagtggcgc 1718401 gcgtgcttcg cccgggcgga cacttcctct acaccgattc ccgtcgaaat cccgtcgtcg 1718461 ccgaatggga ggcggcgttg gccgatgctc cgctgcgcac gatttcgcag cgggacatcg 1718521 gcgcgcaggc caagcgtggg ttggatgcga acacggcgcg ttcgcaagag gccatcggcc 1718581 gccgcgcacc cgtattgctg gccggcttga cccgctgtgc ggtgcgtgtg ctggactggg 1718641 atctacgtcg cggcggcggg ttcagctatc ggatctactt gttcgccaag gattgattcg 1718701 gcgagaccac acccatgaaa aactcatgaa atttgtcgtg gccagctatg ggactcgcgg 1718761 cgacatcgag ccctgcgcag cggtcggcct ggagctgcag cggcgcggcc atgatgtgtg 1718821 ccttgccgtg ccgcccaacc tgattggttt cgtggaaacg gccgggctgt ctgctgtcgc 1718881 atacggaagc agggactctc aggagcagct cgacgagcag ttcctgcaca acgcgtggaa 1718941 acttcagaac cccatcaagc tgctgcgtga agcgatggcg cccgtcaccg agggctgggc 1719001 ggagctgagc gcgatgttga cgccggtggc cgccggggcc gacctgctgt tgaccggtca 1719061 gatctaccag gaggtggtcg ccaacgtcgc cgagcaccac ggcattccgt tggccgcgct 1719121 gcatttttat ccggtgcgag ccaatggcga gatcgccttt cccgcgcggc tgccggcgcc 1719181 actggtccgc tccaccatca cggccatcga ctggctgtat tggcgcatga cgaaaggtgt 1719241 tgaggacgcg cagcggcgtg aactgggcct gccgaaggcg tcaactcccg cgccgcggcg 1719301 aatggccgta cgcgggtcgc tggagatcca agcctacgac gcgctttgct tcccggggct 1719361 ggcagcggaa tggggcggcc gacgcccgtt cgtcggcgcg ttgacgatgg aatcggcgac 1719421 cgacgcggac gacgaggtcg cttcatggat cgctgccgat acaccgccga tttatttcgg 1719481 ctttggcagc atgccgatcg gatccctggc cgaccgggtc gccatgatca gtgcggcctg 1719541 cgcggagttg ggcgagcgcg cgttgatttg ctcgggaccc agcgatgcga ccggaatccc 1719601 gcagttcgat cacgtgaagg tggtgcgtgt ggtcagccac gcggcggtct ttcccacctg 1719661 ccgtgcggtc gtccaccatg gcggcgcggg caccaccgcc gccggtcttc gagccggtat 1719721 ccccaccttg attctgtggg tcacctccga ccagccgatc tgggctgctc agatcaaaca 1719781 gctgaaagta ggccggggga gacgcttttc aagcgccacc aaagaatcgc tgattgccga 1719841 ccttcgaacg atacttgcgc cggactatgt cacccgagcg cgggagatcg cgtctcggat 1719901 gaccaaaccc gccgccagcg tcacggccac cgccgatctg ctcgaagatg cagcccgccg 1719961 tgcgcgctaa gcgagggtgg cgcttcggcg aatggccttc ggcgcgagga tgatcgttgt 1720021 acgctccgct tgtgtccctg atgattacgg tgccggtgtt tgggcagcac gaatacaccc 1720081 acgcactcgt ggccgacctg gaacgtgagg gcgccgacta tctcatcgtc gacaaccgcg 1720141 gtgattatcc taggatcggc accgagcgag tgagcacacc gggagagaac ctaggctggg 1720201 ccggggggag cgagctcggt ttccgacttg cgttcgcgga gggttactcc cacgcaatga 1720261 cgctcaacaa cgacacccgg gtctcgaagg gatttgttgc cgcgttgctc gactcgcggc 1720321 taccggccga cgccggaatg gtcgggccga tgtttgacgt gggttttccc ttcgcggtag 1720381 ctgacgagaa accagacgcc gaaagctatg ttccgcgagc gcgataccgg aaggtgcccg 1720441 cagtcgaggg aacggcgctg gtgatgtcgc gggattgctg ggatgcggtc ggcggcatgg 1720501 acctgtccac gttcgggcgc tacggatggg ggctcgacct ggatctcgcg ttacgggctc 1720561 gaaagtccgg gtatggcctg tacacaaccg agatggccta catcaaccat ttcgggcgca 1720621 agaccgccaa tacgcacttc ggtgggcacc ggtatcactg gggtgcaagt gcggccatga 1720681 tccggggatt gcgtcgaacg catggctggc ccgccgctat gggtatcttg cgggagatgg 1720741 ggatggccca tcatcgtaag tggcacaagt catttccgct cacctgcccg gcgagctgct 1720801 aggcgtgctc ccaggcgttt ggcgtgccgt cgcctccagc aggtccgcgg ccgcggtgac 1720861 ggcggctgtc ggccgggtca tccgtgtcga gatctcacgt gcccgcgcgg cgcattccgg 1720921 cgccaggatc gatcgtagct ccttgagcaa tgacccgcgg gtgatgttcg taaagcgttt 1720981 ggcagagccg actttgagtc gttggacggc accggcccag atcggttgat cggccacgtc 1721041 ccagagaatc agcgtgggca ttcccgctcg caggccggcg gcggtggtac cggcgccacc 1721101 gtggtggacg accgcgcggc acttgggaag gatggtcgaa tagttgacca ggccgacacg 1721161 tttcacgtgg tcggcatgac gaatgcgggt ggagttggct gccggagaat agatcagggc 1721221 tcgctcgccg agctgtgcgc agacatcgga gatcatggcg agcgtttgga cgggcgtttg 1721281 gacgggcgtg ctgccgaagc cgaagtagat gggtggtgtt ccggcggcga tccacgactc 1721341 gagttcttcg ttgggttcgc tgtgtaactc catggtcagc gggccgacaa acgggcggcg 1721401 gtcgctccat tcggccgcca gtccggggaa aaaaaccggg tcgtaggctt ggatttcggg 1721461 cgctccgcgt tccgccagcc gacgcaccgc cggcgccggt gctggcggta ggcccagttc 1721521 acgtcgttgc gcgcgatcgg catccttgct gacgtacgca tacagccgcc atgagacctt 1721581 catcgtcgcg cgcaccagag tcgccggcgt cggtatcgac gggatcgcga tttggccgtt 1721641 gacctgcatc ggaaagtgat gcagtgccgc agccggaatg tcgtagtact cggcgacgtt 1721701 ggctgccaca ccatgatatg tctggcccgt catcaccagg tcggcgccgt cggccaacgt 1721761 ggtcaacgtc gtgcccatct ccgcccagcc ttcgacgaat agttccttga cggcgcgggc 1721821 gaggttgagc ggattctggg ctctggtgag gttgcggacg aatgccgcga ccgtgttgat 1721881 ctgttcgtcc gagtccgggc cgtaggcgac gccggtcaga cctgccgact cgacgaactc 1721941 gatcaggttg ggcggcactg ccatatgaac tgcgtggcct cgccgccgca gctccacgcc 1722001 aaccgcggcg caaggttcga catcaccgcg ggttccgtgg accgccaaga caaacttcat 1722061 cagcgccttc ccgcgttcga cgtcaggcgg gtgccggcgc gtccctgtcg gccgccaact 1722121 tgtcgcacat cagatccgcc aggccacgaa cggtggtgtt gatttcggtg gcggaaatgc 1722181 ggatcccggt ttcggcttcc acccgcgcac gcagttcctg gctgctcagt gagtccaggc 1722241 cgtactcgct gagcagccgg tcggtgtcga tggtgcggcg taggattagg ccgacctgct 1722301 tggagagtag ccgccgcagc cggtctggcc attcctcgcg gggcaggtcc accagctcgg 1722361 caaggaattt gcttgtgcct gaacggtttt gccccaggga ttggaacttc tccgcgaatg 1722421 ggctgtgctg ggcgaaggct gtcagccagg gtgatccgat caccggggcg tagccgctgt 1722481 aggcgcggtt gtggcgcagc agggtctcga aggcgtaggc gccttcctcg ggggcgatgg 1722541 cgtcgccggt ttgttcggca aaggcgatcg cgcggccgat ctggccccag gcgccccagg 1722601 cgatggaggt ggctggtagg tcttgggctc gccgccagtg ggtgaaggtg tccagccagc 1722661 tgttggccgc ggcgtaggcg ccctgacccg gcgagcccac cagggcggcc gctgaggaga 1722721 atgagcagaa ccagtccagc ggctggtccg cggtggcccg gtgcagttgc caggcgccat 1722781 atgccttggg cgcccagtcg cgttcgatga gttcgtcggt gatgttggcc aaggtggcgt 1722841 cctcgaccac cgcggccgcg tgcagcacgc cgcgcagcgg caaacccgtc gcggtggccg 1722901 ccgtgaccaa ccggtcggcg gtgtccggct gggcgatatc gccgcactcc accactacgt 1722961 cagacccgat cgcgcggacg agttcgatgg tctccaacgc cttttggctg ggctgtgagc 1723021 gcgagctgag cacgatgcgg ccggccccgg cgttggccat cttctcggcc aggaataagc 1723081 ccagcccacc caggccaccg gtgatgatgt aggacccgtc tgaacggaaa acccgagcct 1723141 gttcgggggg aagcaccacg ctgctgcgcc cggcgtgggg gacgtcgagg atgagcttgc 1723201 cggtgtgctc ggccgcgccc atcacccgga tcgcggtggc cgcctcggcc agcgggtaat 1723261 gggtgctctg cggcatcggc agcacaccct cgacggtcaa ccgatacacc gtgctcaaca 1723321 gttcgcggac cgcagccgga tggctcaccg acatcaaccc caggtctaga ccgtagaacg 1723381 ccagattgcg ccggaatggc aagagttcca gtcgggtatt ggagtagatg tcgcgtttgc 1723441 cgatttcgat gaagcggccg cccagggcca gtagtttgag gccggccaac tgtgcggcac 1723501 cggtcacgga gttgagcacg atgtccacgc cgtagccggc ggtgtcgcgg cggatctgct 1723561 cggcgaactc gacgctgcgc gagtcataga cgtgttcgat gcccatgtcg cgcagcaggt 1723621 ctcgacgctt ttcgttgcct gcggtggcgt agatctgggc tccggccgca cgcgcgatcg 1723681 cgattgcggc ctggcccact ccgccggtgg cggagtggat gagcaccttg tcgccggcct 1723741 tgatccgcgc caggtcctgc agcccgtacc acgcggtggc gctggcggtg gtcactgccg 1723801 cggcttgggc gtcggtcagc ccctcgggca gtctggtggc caggcgggcg tcgcaggtga 1723861 cgaacgtggc ccagcagccg ttgggtgaca tgccgccgac ccggtcaccg accttgagtt 1723921 cgctgacccc gggcccgacc gcgctcacca ccccggcgaa atcggtgccc agctgcggct 1723981 gtcgcccgtc gagggtttgg tagcggccga aggtgaccag cacgtcggcg aagttgatgc 1724041 tggacgcggt gacggcgacc tcgatctctc ccgggcccgg cgggacccgg tcgaacgcgg 1724101 cgaactccaa ggtttgcagg tcaccgggag tacggatctg taggcgcatg ccggcctcgg 1724161 cgtggtcgac gacggtggtt tgccgctcct cggggcgcag cggggctggg cacaaccggg 1724221 cggtgtacca ctggtcgttg cgccaggcgg tctcatcctc gccgctggcc gccagcagct 1724281 gacgcgccac cgactccgcg ccggtctgct catccacatc gacatagctg gccttcaaat 1724341 gcggatgctc agcaccaatc acccgcaaca acccccgcat cccaccctgc tcaagattgg 1724401 gtcggtcacc agacaacacc gcctgagcat tgtgggtcag cacatacaac cgcggctctt 1724461 gggccgtgat ctctggaatc tcgcgggcga tacgcaccac atgtttgaca agctcgccgc 1724521 cgcgcacggg ggattccgcg tcggggtcgc cggtctgcgg cgcggtcaac acgaatacgc 1724581 cggtgaaccc gccggtgccg agctggtcgc gcagccgcgc ggcctgggct gcgtggtcgg 1724641 cgcgctgcgg ccaggacatc gttgtgcact gcgcgtcgtg caccttcagc gcgtcggtca 1724701 actgtgcggc caccaaatcc gtagcgtcac acgtgctgat cagcagccag gcgccgggtt 1724761 cggcgtggct gttttcgggc agctcacgtt cgtgccattc gatgctcagc agccgctcac 1724821 ccaaaacccg ggcacgttcg ctggcctgcg acgcgccggt acccaactgc agcccacgca 1724881 ccgccaacac caccgcgccg tgctcgtcca acacgtccag gtcggcttcc acgcccacac 1724941 cgcacgcggt caccgtcgtg cagcagtacc gggcatgacg ggccgaccca taggaccgca 1725001 accgccgcac acccaacggc agcaacaaac caccgtcggc cataccctgg acggcgggat 1725061 gagccgccac cgactggaag cacgcatcca gcagcacggg atgcacgccg taagctttga 1725121 cctgcgagcg aagcgggccc ggtaggttga cctcggccag caccgtgtcg ccggcccctt 1725181 cggcgatgta cgcgtcaacc agacccgcaa aagccggccc taagcgatga ccacgcttgt 1725241 ccagccattg ccgaacctcg gcgccgtcca ccttgtgggg atggctggcc agcagttcgg 1725301 cgatgttttt ctggggtggc tggtccgggg cgtcgtcggc ttcccggaca acgtgcagaa 1725361 ccgcggcgag ttgccgtgtg tacctaccat catggctggt ctctactgtg agtgggacaa 1725421 cgccgggggc ttctaccgtg gcggtgacgc cgatgggggt ttcgtcgtcg agcagcaaca 1725481 tctgctcgaa tcggatgtcg cggacttcgg aggcttcgcc gaggacggcg cgggctgcgg 1725541 ccaacgccat ctcgcagtag gcggctcccg gaagggcggc cgcgccgtgg atttggtgat 1725601 cggccagcca gggttgtgtc acggtgccga cctcgccctg ccagacgtgg cgttccggct 1725661 cctcgggcag gcgcacgtgg gagcccagca atgggtgtac ggcgacggta ttggcgtggg 1725721 cgatgcgacg ggtcgtgtcg tcgagcagca gacgacggtg gttccatgtg ggcagtggtg 1725781 cgttgatcag tcggccggtg gggtagagca cggcgaagtc gacggcggcg ccggcggcgt 1725841 agaggtcgcc ggccagtgcg cgcagcccgt ggggcagtgg ttgttcgcgg cgcatgccgg 1725901 ccagcgcagc cgcggacatg tcgaggctgc gggcggtctg gtcgaccgcg tgggtcagca 1725961 gggggtgggg ggtcagctcg gtgaagaccc ggtagccgtc ttcgagggcg gcttgcaccg 1726021 ccgcggcgaa gcgtacggtg tggcgcaggt tgtccaccca gtagtaggcg tcgcagtagg 1726081 gctcctcgcg cgggtcgaac gaggtcgccg agtagtaggg gatttccggt tgcagcgggc 1726141 tgatttcggc gagcgcttcg gccagttcgt cgaggatcgg gtcgacctgc ggggagtgcg 1726201 atgctacgtc gacggccacc tcacgggcca gcacgtcgcg ttgctcccag gcggccacca 1726261 ggtcgcgtac cgtctgggtg gccccgccga tcacggtgga ctgcggggag gccaccaccg 1726321 cgaccacggc gtcgttgacg ccgcgcgcca tcaactccga aagcacttgt tgagcaggca 1726381 gttccaccga tgccatggcg ccggcgccgg cgatacgggt catcagcgcc gaccgccggc 1726441 agatgacgcg cactccgtct tcgaggcaga gcgcgccggc gaccaccgcg gccgcggact 1726501 cgcccagcga gtggccgatg accgcgccgg gcgctacgcc gtaggacttc attgtggccg 1726561 ccagcgcgac ctgcatggca aacagggtcg gttgcacccg gtcgatgccg gtcacgacct 1726621 cgggggcggt catggcttcg gtcaccgaga agccggattc cgcggcgatc agtggttcga 1726681 tcgcggcgat ggtggcggcg aataccggtt cggtggccag caggtcggcg cccatgcccg 1726741 cccattgcga gccttgcccg gagaacaccc agaccggtcc gcggtcgtct tggccgaccg 1726801 cgggtgggta ggggggttcg ccggtggcga cttcccgcag cgcctcggtc agctccgcgg 1726861 tggtggcggc cagtacggcg gtgcgcaccg gccggtgtcc gcgccggcgg gccagggtgt 1726921 aggccagatc cgccggcgcc agctcgggtc cttgggcgtc gacccaatcg gccagccgcg 1726981 cggcggtctg ccgcagcgcg tcctgcgagc tggccgacag cgcgaacagc agcgcgccgt 1727041 cgataccggg tgtggccggg gtgtcgcctg gtgcaccgga ttcgggggct ggcaccggtg 1727101 cctgctcgac aatggcgtgc acattggtgc ccgtcatgcc atacgacgac accgccgcgc 1727161 gccggggcgt ttcttgatcg gcgccgggcc acggcgtaat ctcttgcggc acaaacaggt 1727221 tggtttcgat tgcggcaagc ttgtcaggca gggccgtgaa gtgcagattc tgtgggacca 1727281 cgccgtgttg gagggccagg accgccttca tcagtcccag cgctccagcg gccgactggg 1727341 tgtggccgaa attggtcttc accgatgcca gcgcgcaggg gccgtcgttg ccgtatacct 1727401 cggccaggct ggcgtattcg atggggtcac ccaccggggt gcccgggccg tgcgcctcga 1727461 ccatgcccac cgtagccggg tccacaccgg ccacatccaa cgcctcccga tacgccgcga 1727521 cctgcgcgga ccgtgatggt gtcgcgatat tgacggtgtg gccgtcttgg ttggcggccg 1727581 tgccacgaat tacggccagg atccggtccc catcggccag cgcatccggc aaccgcttga 1727641 gcgccaacat gacacaaccc tcaccggaga cgaaaccgtc cgcggaaacg tcgaacgcat 1727701 gacagcgccc ggtcgcagac aacatgccca acgccgagcc cgaggcgaac cgccgcggtt 1727761 cgagcatcac gtagacaccg ccggctagcg caatgtcgct ttcgccgtcg tgcaggctac 1727821 gacaagccag gtggatagcg gtgaggccag acgagcatgc ggtatctacc gtgatcgcgg 1727881 gaccctgcaa gcccatggcg tacgccaccc gcccggatgc gaagcaggca ttggtgcccg 1727941 tgttgccgta cggcccttcg aaagtctggt tgtcggcgtg taccaatatg tagtcggtat 1728001 gaaccaaccc cacgaaaacc cctgtccgcg aggccatctg gttcggtgtt aggccgccgt 1728061 gctccatggc ttcccaggag gtttccagca acaagcggtg ctgcggatcg atcgctatcg 1728121 cttctttctc cccgatcccg aagaactcgg gatcaaagtc gccgacgtta tcgaggtacg 1728181 cgccccattt gcagtcggtg cgtccgggca cgccgggttc ggggtcgtag tactcgtcga 1728241 tgtcccagcg gtcggcgggg atctcggtga ccagatcgtc gccccgcagc aacgcctccc 1728301 acaaccgatc gggtgagtcg atgccccccg gcagccggca ccccatacca atgacagcta 1728361 ccggcgtaac acgtgtccta tccacggtct ttgttctctc cttacccacg gttcaagctt 1728421 ttgccagcgg cgtatcgtcg aacttcggtc cgggttgata gaaccgcagc accaaacgca 1728481 cccaccgacc cccacgcttc acgccaaccc tttagttcat tggcgtgaac agcagcgtag 1728541 ccggttgccc cgatatatgt ggaaaaatcg ttcggacgta caaaaaaagt tcctgacgct 1728601 ggcgtcaact cgaaactgcc tcggaagtca tgattgattc atcagtcaat attaaagtcg 1728661 cagctcacaa ctataatacg ccggtgcagc ggacaattgc ggaagcgccg gacgcctcgc 1728721 ggtccgatgt cgcctttccc tgcctcgtcg tcaatatctg atggtggacg accgcccgtg 1728781 ccggaccggc ttaggtagcc agccgggctt cgcgccacgc aatttgccta gtcgtgaaag 1728841 acggattgcc gaagtgtcga aggcaacccg aactccgatg ttcaggttat gccaattggt 1728901 gcccggaaat ccccgaaatc gaaaatgtta cgtgcaggtt tcactggacg gatcaaggcc 1728961 gtcgtcgctg aagctgggcg gctggggcga catcgcgcga tccgccctcg gcgatgcgca 1729021 cgtacgccga ttgcatcgtc tctggatgcc gcgcgatcga gccctgcgcg atcggactac 1729081 tgggggacaa cgcggtgacg gtgctctcct cgtgaaactt gttgacccac atgcacgctt 1729141 gcgcgccgat ccagccatcg ccgaatactc tggcattcat ccggtccagt tgtattgcga 1729201 tgaccgcaga cagcagaagc gcgccggccg gcatcgaggc acgacgggaa cggaagccgc 1729261 cacctagagg atccaacgag catctatgct tttcccttcc cacggccgcg cgtgaggcat 1729321 cctcgctgtg cagcaccgcc aggtcaggga tcaacgcgcc gactatttct ccgtcgatgt 1729381 ggctggactg cacctgctcc gtctctcttt gctgccacca gcgccaggtt ggttgtggaa 1729441 gctgagtcac cgtcgggcga aaccgtcagc gttgacgaag cgttagaggt agtgtgctgc 1729501 cgtggtcgcg tcttcgattc ccaccgcgct gcgcgagcgc gccagtgtgc accccaatgg 1729561 tgcggccatc acctacatcg attacgagca ggactgggcc ggtgttgccg aaaccctgac 1729621 ctggtctcag ttgtatcggc gaatgctcaa tgtcgccgag ccgctccggc atgtgggggc 1729681 gaccggtgat cgggcagtga tactggcacc gcagggaatc gaatacgtcg ttggatttct 1729741 cggcgcgttg caggccggac gtatcgcggt tccgctgccg gttccacatg ccggcgccca 1729801 cgatgagcgt acgatttcgg tgctaagcga cacttcgccc gctgtcattc tgacgacgtc 1729861 gggggccgtt gacgatgtca gagaatgcgc tcagccacag ccaggccagt ccgcaccatc 1729921 aatcgttgag cttgatttgc tggacttaga ttctcggcag cgctcccgca gccctggcgc 1729981 gcgcccaacc ggcagggata cgccggaaac cgcgtatttg caatatactt cgggatccac 1730041 ccgtacgccg gccggtgtca tggtctcgaa caaaaatgtc ttcgccaatt tcgagcagat 1730101 cgtggccgac ttctttgcgc ccgagggggg cgtcgtcccg ccggacctca ctgtggtgtc 1730161 ttggctgccg ctgtaccacg acatgggtct tctattaggc gcgatcatgc cgatcctggc 1730221 gggtgtaccc accgtgttga cgagtccggt ggggttcctt cagcggccgg ctcgatggat 1730281 acaactgctg gcacgtaacg gtcgcacgat ttcggcagga ccgaatttcg ctttcgaatt 1730341 ggcggtgcgt aagacgtcag acgacgacat ggacggactt gacctcgccg gcgtgcacac 1730401 catcctcaac ggcagcgagc gagtacaccc ggcgaccctc aaacgatttg ctgaacggtt 1730461 cggccgcttt aattttgccg ccgcggcgct gcggcccgcg tatggcatgg cggaagcaac 1730521 ggtgtacata gcgacccgta atgtgaacga accaccagaa atcgtcgact tcgaatccga 1730581 gaaactgcct gcgggccaag cgatccggtg cccgagcgga agcggcacac cgctggtcag 1730641 ctacggcgtc ccacggtcac agctagtgcg catcgttgat ccagacacgt gtatcgagtg 1730701 tccgcaggga tcggtcggtg agatctgggt gcaaggtggc aacgttgcgt ccggctattg 1730761 gcacaaaccc gaggagagca agcgcacgtt tggcgccagg attgtcaccc cttcggcggg 1730821 cacacccgaa gcgccttggc tgcgaaccgg ggattcgggt ttcgtctccg gcggcgagct 1730881 gttcatcatc ggccgcatca aggacctctt gattgtgtat gggcgcaacc acgctcccga 1730941 cgacatcgag gcgaccatcc aggagataac ctccggccgc tgtgcggcga tcgcggtccc 1731001 cgaccacggc accgaaaagc tggtcgcgat tatcgaactc aagaaacggg gagactccga 1731061 cgaggatgtg gcggaccggc tgcgcatcgt caagcgtgac gtcgccgcgg cgatatttga 1731121 ttcgcacggt ctgagcgtgg ccgatctcgt tctggtgtcg cccgggtcga ttcccatcac 1731181 caccagcggc aagatcaggc gggcacagtg cgtccagctt taccgacggc gtgagttcac 1731241 ccggttagac gcttgactgc atcgttggag cttgttttcc attgtgctac aaccggtttg 1731301 ctgtctctgt ggcccagtgt tagtgggccg ctcggcattg actgagcacg acacgattcc 1731361 tagtgtgctg gtatgtcgga cggcgcggtg gtacgggcat tggtattgga ggcgccgcgc 1731421 aggctggtcg tgcgccagta ccggctgccg cgcatcggcg atgatgacgc actagtgcga 1731481 gtagaggcct gcgggctgtg cggcaccgat cacgagcaat acacgggcga gctggccggt 1731541 gggtttgcct tcgtacctgg ccacgagacg gtcgggacga ttgcggccat cggtccgcgg 1731601 gcggagcagc ggtggggcgt gtcggccggc gaccgagtag ccgtcgaggt attccagtcg 1731661 tgtcggcagt gcgctaactg tcgtggcggc gagtaccggc gttgtgtacg gcatggcctc 1731721 gctgacatgt acgggttcat cccggttgac cgagagcctg gcctgtgggg cggttacgcc 1731781 gaatatcagt acctggcgcc ggattcgatg gtgttgcggg tggccggtga cctcagcccg 1731841 gaagtggcca ccttgttcaa cccgctgggg gcgggaatac gttggggagt aacgattccc 1731901 gaaaccaaac cgggcgacgt cgtggcggtg ctgggtccag gaatccgggg gctgtgcgcc 1731961 gccgcggcgg caaaaggggc cggtgccggg ttcgtgatgg tgaccgggtt gggaccccgt 1732021 gacgccgacc ggttggcgct ggcggcacag ttcggagccg acctcgccgt cgatgttgcg 1732081 atcgatgacc cggtcgccgc cctgaccgaa cagaccggtg ggctggcaga cgtcgttgtc 1732141 gacgtgaccg ccaaggcgcc agcggcattc gcacaggcga tagcgctagc ccggcccgcc 1732201 gggaccgttg ttgtcgccgg cacccggggc gtgggcagcg gggcaccggg attttcgccc 1732261 gacgtcgttg tgttcaagga gctgcgtgtg cttggcgccc tcggcgtaga cgccaccgcc 1732321 taccgggccg cgcttgatct gttggtgtcc ggtcgatacc ccttcgcaag cctgcctcgc 1732381 cgctgcgtgc ggctcgaagg cgccgaggat ctgctggcta ccatggccgg tgaacgcgac 1732441 ggtgtcccgc ctatccacgg agtgctcaca ccatgacaac atcccgcgtg cccctgttgc 1732501 cggtcgacga ggccaaagct gctgccgacg aagcgggcgt gcccgactac atggctgagc 1732561 tcagcatctt ccaagtgttg ctgaatcatc cgcgactagc gcggaccttc aacgacctgc 1732621 tcgccaccat gctgtggcac gggaccctgg actcacggtt gcgtgagttg gtgatcatgc 1732681 ggattggttg gctcaccgac tgtgactacg aatggaccca acactggcgg gttgcttcag 1732741 ggcttggcgt gtcggccgac gatctgctcg gtgtacggga ttggcaaggg tacaacgggt 1732801 tcgggcccgc tgagcaggcc gtcctggcgg ccaccgatga cgtggtgcgc gagggcgcgg 1732861 tgagtgcgca gagctggtcg gcttgcgagc gggaattaca ttgcgacaaa gtggttctca 1732921 tcgaactcgt tacggtgata agcgcatggc gaatggtcgc ttcgatcctg cacagcctcg 1732981 aggtcccact ggaagacggc gtttccagct ggccgcccga cggcctttcg ccaaggtgac 1733041 tgcgccgagc gtgtaaccat ggcgagattc cgccggcgat ttttccgccc tgagtgcacg 1733101 ttcggcgcag aagcactaga cgatccggta ggtctgcaca gcgtgagcga cgatgttccc 1733161 gtcgggatcg gttgcggtga tctcggtaaa ggtgagttcc ttgcggcgtc gggcagtgcg 1733221 cgcatgacag agcaagtcac accgcttggc ggcgccggtg tactggatgc tcatcgcgac 1733281 cgtggcggcg cgggtgcccc tgtcgaagtc gtggttcgac caagcggcgg cggcaccggc 1733341 ggtgtccatc accgacgcga tcaccccacc gtgaaagtag gtgccgtcat tggtgaggtc 1733401 ggtgcgaaac gggagtcgga tcacgacgtc gtcgggttcg tagcgttcga acacgatgcc 1733461 gagcccgccg atgaacggcg tcctcggcat cagctcacgc accgcctggc gacgtttgtg 1733521 ctgctcttgg gcggtcaacg ggtcggacat ggcaggtaat ctaccctatt agattgacat 1733581 atcaatcaat aactcttagc gtcgtcgcaa tgcggaccag agtcgccgag ctgctcggtg 1733641 ctgagtttcc aatatgcgcg ttcagccact gccgggatgt ggtggcggcg gtgtccaatg 1733701 cgggcgggtt cgggatcctc ggtgccgtcg cacatagccc caaacggctg gagagcgagc 1733761 tgacctggat cgaggagcac acgggtggca agccgtacgg agtcgacgtg ctgctgccgc 1733821 ccaaatacat cggcgccgag caaggcggta tcgatgccca gcaggcccgg gagctcatac 1733881 ccgaagggca tcgcaccttc gtcgacgact tgctggttcg ctatggcatc cccgcggtca 1733941 ccgaccggca gcgttcgtcc tcggccggtg ggctgcacat ctcgcccaag ggttatcagc 1734001 cgttgctgga tgtggccttc gcccatgaca tccggttgat cgccagcgcg ctcgggccgc 1734061 cgccaccgga tctcgtggag cgcgcccaca accatgacgt gctggttgcc gccctagccg 1734121 gcacggcgca gcacgcgcgg cgacacgcgg ctgcgggtgt tgacctgatc gtcgcgcagg 1734181 gcaccgaggc cggaggccac accggcgagg tggcgaccat ggttctggtt cccgaagtcg 1734241 tcgatgcggt gtcgccaacg ccggtgctgg ccgcgggcgg gatcgcccgt ggccgccaga 1734301 tcgctgcggc gttggccctg ggggcggaag gcgtctggtg cgggtcggtc tggttgacca 1734361 ccgaagaagc cgaaacgccc ccggtggtca aggacaagtt tctggccgca acatcctcgg 1734421 acacggtgcg gtcccggtcg ctaaccggca agccggcgcg catgctgcgc acggcctgga 1734481 ccgacgaatg ggatcggcct gacagccccg acccgcttgg catgccgctg cagagcgcgc 1734541 tggtcagcga cccgcagttg cgcatcaacc aggccgccgg ccagcccggg gccaaggctc 1734601 gtgagctggc gacctacttc gtcggacagg tcgtcggctc actcgaccgg gtgcggtcgg 1734661 cccgctcggt ggtgcttgac atggtcgagg agttcatcga caccgtcggg caactgcagg 1734721 ggttggtgca aaggtgagcc gcgctagcgc gcggcggcgc cgagcggtca gcgatgagga 1734781 caagtcgcaa cggcgcgacg agatcttggc cgcggccaaa atagtgtttg ctcacaaggg 1734841 ttttcatgcc accaccgtcg cagacatcgc caagcaggcc ggcctggcgt acgggctgat 1734901 ctactggtac ttcgactcca aggacgactt gttccacgcc ttgatggccg gtgaagagga 1734961 ggcgctgcgc gcgcatgtcg cggccgaact ggcccgcgtt ggcgggtcta ccgaggcgcc 1735021 gcttcgggcc ctgttacagg ccgcggtaca ggccacgttc gagttcttcg aaaccgacaa 1735081 ggctaccgtc aaactactgt tccgtgacgc ttacgcgctt gggggccgat tcgaagagca 1735141 tctcggcgga atctacgagc ggttcatcga cgacatcgaa gccgtcgttg ttgccgctca 1735201 acggcgcggt gaggttgtcg aggccccgtc ccggatggcc gcgtacacgt tggcggcgct 1735261 ggtggggcag ttggcacacc gacggctgaa taccgacgat aacgtcaccg ccgcccaggt 1735321 agccgacttc gtggtgtcgc tggtgctaga cgggctgcgt ccgcgtgcac tggcggtcgg 1735381 ggcccgcggt ggtcgggccg cccgaacctg agcaaaggct gccaaataca tggtgaacgc 1735441 gtaaggattc gcgacacccg cccggatcac gttgaccgag acgggtaggt cgtgcatgat 1735501 cggtccggta agcacctcgt taggtgaggc ggctacacga acataggcca ctgaccccga 1735561 acgtcgagag acgccccggg tcaggacagc tcttcccggc ttaagggttg agcccaggtg 1735621 gcttccggct taccggacac gtcgtgtggt gccgaagctc tgacgagagg ggtgcggatt 1735681 tccggcagtt gccggcatct ctgtactcct gtgacgcgct ttatcgtgcg gacaaccgta 1735741 cgtgtcgtgg ccgtgaggag gtgagggacg catgagttcc ggtgacagtc cggaccgata 1735801 tccgggctct gtttcgtccc gatccggttt ccggcgcgac gttttgcgct gagtcgtcaa 1735861 accaagatca gccttcttgg atcggaaccg ctacgggacg ggaccaactc ggttcagtcc 1735921 atatgtgctc gttttgattt ccgtcctcgc ttgcaactcc gtctaggagg cgatcatgac 1735981 cgctgctctg cacaatgacg tagtaaccgt agcttcggcc cccaagctgc gggtggtgcg 1736041 ggatgtgccc ccggcccccg cgtccaagaa ggttgctcgc cggctcgacg cgcagccttt 1736101 cggcaccgga ggggacccgc tggtcgacgg ggcagctcgt ttgctgagca ttccgctgcg 1736161 ccacctctac gccgcgttgt ggcgcgtcgg gctgctcgag gtccaggcct agtccgatgg 1736221 gcaggcagcc gaccttgcgc cgcgatgtgg atttgcggcg ctgggcgaca atccccgtag 1736281 aatcagggga acggcatcga tccggcgatc accggggagc cttcggaaga acggccggtt 1736341 aggcccagta gaaccgaacg ggttggcccg tcacagcctc aagtcgagcg gccgcgcatc 1736401 ggcgtggcaa gcggggtggt accgcggcgt tcgcgcaccg gcgtggcgtc gtccccgagc 1736461 ctggattgca ggcacgcagt gccgaacggt gctggggcct ggggagacga cgcgcaaagt 1736521 gaccgataac gcatatccaa agctggccgg cggggcaccc gacctcccgg cactcgaact 1736581 cgaggtcctc gactactggt cccgtgacga caccttccgg gccagcattg ctcgccgcga 1736641 tggcgccccc gagtatgtgt tctatgacgg gccgccgttt gccaacggtc tgccgcatta 1736701 tgggcacctg ctcaccggct acgtcaaaga catcgtgccg cgatatcgca ctatgcgcgg 1736761 ttacaaggtg gagcgtcgct tcggctggga cactcacggg ctgcccgccg aactcgaagt 1736821 cgagcgccag cttggcatca ctgacaaatc ccagatcgag gccatgggta tcgccgcctt 1736881 caacgatgcc tgccgcgcat ccgtgttgcg ctacaccgac gagtggcagg cgtatgtaac 1736941 tcggcaagct cgctgggtcg acttcgacaa cgattacaag acgctcgatc tggcttacat 1737001 ggagtcggtg atttgggcct tcaaacagtt gtgggacaag ggcctggcct acgagggcta 1737061 ccgggtgctg ccgtactgct ggcgcgacga aactccgctg tcgaatcacg aactgcggat 1737121 ggacgacgac gtctaccaaa gccgccaaga tcccgcggta acggtgggct tcaaggtggt 1737181 gggtggccaa ccagacaacg ggctagacgg tgcctacttg ctggtgtgga cgacgactcc 1737241 gtggaccctg ccgtcgaacc tcgcagttgc ggtaagcccg gacatcacct acgtacaggt 1737301 ccaggcgggc gatcgccgtt tcgtactggc cgaggcacgg ctggccgctt acgcccgcga 1737361 actcggtgaa gagcccgtgg tgctcggcac ctatcgcggc gccgaactgc tgggcacccg 1737421 ctacctgccg ccgtttgcct atttcatgga ctggcccaac gcttttcagg tgctagcagg 1737481 cgactttgta acgaccgacg atggcaccgg catcgtgcat atggcaccgg cctatggtga 1737541 ggacgacatg gtggtcgcgg aggcggtcgg tatcgcgccg gtgactccgg tcgactccaa 1737601 gggacgcttc gacgtcaccg ttgccgatta ccaagggcag catgtctttg acgccaacgc 1737661 gcagatcgtc cgggacctga agacccaaag cggcccggct gcggtgaatg gcccagtgtt 1737721 gattcgtcac gaaacctacg agcaccctta cccacactgc tggcgatgcc gtaacccgct 1737781 gatctaccgg tcggtgtcgt cgtggttcgt cagggtgacg gacttccgag accgcatggt 1737841 ggagctaaac cagcagatca cgtggtatcc cgaacacgtc aaggacggcc agttcggcaa 1737901 gtggctgcag ggcgcccgcg attggtcgat ctcccggaat cgctactggg gtaccccgat 1737961 tccggtatgg aagtccgacg acccggccta cccgcgcatc gatgtctacg gcagcctcga 1738021 cgagctggag cgcgacttcg gcgtacgccc ggccaatttg caccggccct acatcgacga 1738081 gctcacccgt cccaacccag acgatccgac tggccgtagc acgatgcgac gcattcccga 1738141 tgtgctcgac gtgtggttcg actcgggatc catgccgtat gcccaggtgc actacccgtt 1738201 cgagaacctg gattggttcc agggacacta ccccggcgac ttcatcgtcg agtacatcgg 1738261 gcagacccgt ggctggtttt acacactgca tgtgttggcg accgcgctct ttgaccggcc 1738321 ggcattcaaa acctgtgtgg cgcatgggat tgtccttggt ttcgatggcc agaagatgag 1738381 caagtcgctg cgcaactatc cagacgtaac agaggtgttc gatcgcgacg gctccgacgc 1738441 catgcggtgg ttcctgatgg catcgccgat tctgcgcggc ggcaacctga tcgtcactga 1738501 gcaaggaatt cgcgacggtg tgcgacaagt cctgctgccc ctgtggaaca cctacagctt 1738561 cctggcgctg tatgcaccga aagtcggtac ctggcgcgtc gattcggtgc acgtgctgga 1738621 tcgctatatc ctggccaagc tggcggtgct gcgcgacgac ctcagcgagt cgatggaagt 1738681 ttacgatatt cccggtgcct gtgaacattt gcgtcagttc actgaggcgt tgactaattg 1738741 gtatgtgcga cggtcgcgtt cgcggttctg ggcagaagac gccgatgcca tcgacacgct 1738801 acacaccgtg ttggaggtga ccacgaggct ggccgccccg ctgcttccgc tgatcaccga 1738861 gataatctgg cgtggtctga cacgcgagcg atcggtgcac ctgacggact ggccagcgcc 1738921 cgacctgctg ccgtcggatg ccgacctggt cgccgcgatg gaccaggtcc gcgacgtgtg 1738981 ctcggcggca tcctcgctgc gcaaggccaa gaagctacgg gtgcgcctgc cgctaccgaa 1739041 actcattgtg gcagttgaga atccgcaact tctgaggccg ttcgtcgacc tcattggcga 1739101 cgagcttaac gtgaagcagg tcgaactgac cgatgccatc gacacctatg gccgattcga 1739161 gctcacggtc aacgcccggg tagccggacc acggctgggc aaagatgtgc aggccgccat 1739221 caaggcggtc aaggccggcg acggcgtcat aaacccggac ggcaccttgt tggcgggccc 1739281 cgcggtgctg acgcccgacg agtacaactc ccggctggtg gccgccgacc cggagtccac 1739341 cgcggcgttg cccgacggcg ccgggctggt cgttctggat ggcaccgtca ctgccgaact 1739401 cgaagccgag ggctgggcca aagatcgcat ccgcgaactg caagagctgc gtaagtcgac 1739461 cgggctggac gtttccgacc gcatccgggt ggtgatgtcg gtgcctgcgg aacgcgaaga 1739521 ctgggcgcgc acccatcgcg acctcattgc cggagaaatc ttggctaccg acttcgaatt 1739581 cgccgacctc gccgatggtg tggccatcgg cgacggcgtg cgggtaagca tcgaaaagac 1739641 ctgaggtcga ctgggcgacg agcgtaacgt cacggctgaa aatccgtgcc cgacttcgcc 1739701 gtggcgttac gctcgcggcg cggggacccg atctctaggg cgttgtcgcc cagatccacg 1739761 tcggccaagg ccgatggcag cggctgaggt tgatcgccat agcgaaaact agctcggtag 1739821 ccccaaatag catcacgggt gtggagtccc gctgggtgct gcacctggac atggatgcgt 1739881 ttttcgcctc ggtcgaacag ctcacccggc cgaccctgcg ggggcggccg gtgctggttg 1739941 gcgggctggg tgggcgaggt gtggtggccg gcgcgagcta tgaagcgcgg gcctacggtg 1740001 cccgatcggc catgccgatg catcaggccc gcaggctgat cggggtgacg gccgtggtgt 1740061 tgccgccacg cggggtggtg tacgggatcg ccagccgccg ggtattcgac accgtgcgcg 1740121 gcctggtgcc cgtcgtcgaa cagctttctt tcgatgaagc gttcgccgaa ccgccccaac 1740181 tcgccggggc agtggccgag gacgtcgaga cgttctgcga acggttgcgg cgacgggtgc 1740241 gcgacgagac cggcctgatt gcctcggtcg gagcgggctc gggcaagcag atcgccaaga 1740301 ttgcttctgg tctggccaaa cccgacggca ttcgggtagt ccggcacgct gaagagcaag 1740361 cgcttctcag cggattgccg gtacgacggc tgtggggcat cggcccggtc gccgaggaaa 1740421 agctgcatcg gctcggcatc gagacgatcg ggcagctggc cgcgctgagc gatgccgagg 1740481 cggccaacat cctaggcgcg acgattgggc ccgcgctgca ccggctggcc cgtggcatcg 1740541 acgaccgccc agtggtggag cgcgccgaag ccaagcaaat cagcgccgag tccacgttcg 1740601 ccgtcgatct gaccaccatg gagcaattgc acgaggcgat cgactccatc gctgagcacg 1740661 cgcaccaacg cctgctgcgc gacggccgcg gcgcccgcac catcacggtg aagctaaaga 1740721 aatccgacat gagcacgcta acccgctcgg cgacgatgcc ctacccgacg accgacgccg 1740781 gcgcgctgtt tacggtggcc cgccggctgc tgccggatcc actgcaaatc gggccaattc 1740841 gtcttctggg tgttgggttt tcgggtttga gcgacattcg ccaggagtcg ttgtttgccg 1740901 actcggactt gacgcaggaa acggcggcag cgcattacgt cgaaacaccg ggagcggtcg 1740961 tgccggccgc gcacgacgcc acgatgtggc gggtcggcga tgacgtcgcc caccctgagc 1741021 ttgggcacgg ctgggtgcag ggagcgggcc acggcgtggt caccgtgcgg ttcgaaacgc 1741081 gtggttcagg cccgggctcg gcgcggacgt tccccgtcga caccggcgac atcagcaacg 1741141 ccagcccgct tgacagcttg gactggccgg actacatcgg ccagctatcg gtcgaggggt 1741201 ccgccggcgc ctcagcccca acggtcgatg acgtcggcga ccggtgagtt ggccgccagc 1741261 gcggccatta gcagcacccg ggcctgggac ggcggcagtc gcggtaccat caccgcgcca 1741321 gcctccacca ggtcgtgccc gggaccatag cctgcgccga cccgcgcgcc ggcgacccgg 1741381 gtagacaccg cgatcaccac cggatcgctc ccgtctcgac agtggcgacg gactccctcg 1741441 atcacggcgg ccccggcatt gcccgagccc agcgcctcca gcaccacggc gcgcgcgccg 1741501 gctgccacac aggcgtccat cgccaccgcg tcacttcccg gatagacggc gacgatgtcg 1741561 actcgtggcg ccacggcagc gcccagatcg ccgagatagg gccgcgtctt ggtgcgcgtc 1741621 agccgcaccc cgcccgacgt gaagccaagc gactcgccgg cgaatccgca caggtccggg 1741681 ttggccacct tgtgcaggcc caaaggctgt aacacccggc cgccgaaact caccagcacc 1741741 ccgaggtcgc gggcggctgg gtcggcggcg accgcaagcg cgtcgcgcag attggccggg 1741801 ccatcggcgc cgggggcatc ggcgctgagc atggccccgg tcaacacgac cgggcggcta 1741861 cccgcatagg tgaggtccag ccacagagcg gtctcttcga gcgtatcggt gccgtgagtg 1741921 atgaccaccc catctgcgcc gccgcggaat gcctcctgca ctgcagcgcc tatccggtcc 1741981 caatcggccg gcgtcaactt tgagctgtcc agcgccatga ggtcgactac ttcgatgtcg 1742041 gagtccatgt cgagaccggc gatcagcgtc gccccgcaat gggttggccg tagcacccca 1742101 tcggggccgg cggtggtcga gattgtccct ccagtagtga tgacggtgag gcgggccatg 1742161 atgggatcat tgcgcacgtg gtttgctccc atccggccgc ggggtctggg cgggccatat 1742221 cggccctagg ggatgatgat ggtgtgcctg acgaaccaac aggatcggct gatccgctga 1742281 cctcgaccga ggaagccggg ggggcggggg aacctaacgc tcccgcgccg ccgcgacggc 1742341 tgcgcatgct gctgtcggtc gctgtggtgg tgctcacact cgacattgtc accaaggtgg 1742401 tagctgtcca actgttgccg cccggccagc cggtgtcgat tatcggcgac acggtgacct 1742461 ggactctggt gcgtaattct ggggcggcct tctcgatggc gaccggatac acctgggttt 1742521 tgacgctgat tgcgacgggt gtcgtggtcg gaattttctg gatggggcgg cggctggtat 1742581 cgccgtggtg ggcgctgggt cttgggatga tcctgggcgg tgccatgggc aacctggttg 1742641 atcgcttctt tcgggcaccg gggccgctgc gcgggcacgt cgtcgatttc ttgtcggtcg 1742701 gctggtggcc ggtgttcaat gtcgccgatc cgtcggtagt cggtggcgcc atcctgctgg 1742761 tcatcctgtc gatctttggc tttgacttcg acaccgtagg tcggcgacac gccgacgggg 1742821 acaccgtagg tcggcgcaaa gccgatggct gaccgctcaa tgcccgttcc ggatggattg 1742881 gcgggaatgc gtgttgacac cggactggcc cgcttgctgg gactgtctcg gaccgctgcg 1742941 gctgccctcg ccgaagaggg cgcggtcgag ctgaatggcg tgccggccgg aaagtccgat 1743001 cggctcgtct ccggcgcctt gctgcaggtg cggttgcccg aggcgcccgc gccgctgcag 1743061 aacaccccca tcgatatcga gggcatgacg attctgtatt ccgacgacga catcgttgcg 1743121 gtcgacaaac cggctgcagt tgccgcgcat gcgtcggtcg gctggaccgg accgacggtg 1743181 ctcggcggac tcgccgccgc cgggtaccgg atcaccacat ccggggtgca cgagcggcag 1743241 ggcatcgtgc atcgcctcga cgtcgggacc tccggggtga tggtagtggc gatctccgag 1743301 cgggcgtaca ccgtgctgaa gcgggcgttc aaataccgca cggtggacaa gcggtaccac 1743361 gcgctggttc aaggacatcc agatccgtcc agcggaacga tcgacgcgcc gatcggtcgt 1743421 catcgcggcc atgaatggaa gttcgcgatc accaagaatg gccggcacag ccttacgcac 1743481 tacgacacgc tggaagcgtt cgtggcagcc agcctgctcg acgtgcatct ggaaactggc 1743541 cgcacccacc agatccgggt gcacttcgcc gcgttgcatc acccatgttg cggcgacctc 1743601 gtttacggag ctgatcccaa gctagcgaag aggctcgggt tggaccgtca atggctgcac 1743661 gcgcgttcac tggcgttcgc tcatccggcc gacggccggc gggtggagat cgtcagcccg 1743721 tatccggccg atctgcagca cgcgctaaag atattgcgtg gcgagggttg accggcatca 1743781 cgaggtgcgg cagacgaacg tggcgccatg gaaatcgagg cgcacctcgc cctggtgttc 1743841 ccagtattca atgccttggc gaccataccg ggcgccactg ccggacagtt cgacgaacac 1743901 gatcacttgg tcgcctttcc agttgagcac ggccgtcttc gggtcgaatt ggttgtagaa 1743961 ctgtgcggtc aatgggccgt cctgcgtggg acaccggtag gtgaggacgg gtggagtcgc 1744021 ggtggcggga tcggcgattg ccaattggac cagccgggtc tggtaggcct cctgcacaca 1744081 ggtacgcggg tcggtgtctt gcgcacaagc atcacgcagc atcgtccagc tgctttgtgc 1744141 ggcttccagc gccgccgagc gtcgatgggc cagcgcctgt tgataggcgg tcgaaagccg 1744201 gtggtccaga ctggtcaact gccggtcgtg gcaaaccagt tgctgcacta tggttgccgg 1744261 tttggtgcag tcgagcgact gcccggcggt cggcgaggtt gtgttagccg gagggtttgc 1744321 ggcgcaggcg ctcagaacca gggcggtcac caggacgccg atccatctca tggaaacgga 1744381 ctacccggct accgacgcgg tgtccagcgc gacacgccac agggctcaga ctggtgccgt 1744441 ggtgctctcg cccgatgtga cgtcgaccgc cagcggcgcg atgacgccga ggatttccgt 1744501 gatcgtttcg gagggcacgc cggctgcggt cagcgcgtcg gccaagtgtc cggcgaccag 1744561 gctgaagtgg tgcatggtaa ttccgcgccc ctgatggact tgcttcatcg gcgcaccggt 1744621 atagggctcg ggcccgccaa gcgcggccgc gaaaaactcc acctgcttgc ccttgaggcg 1744681 gctcatgttc gtaccgctga agaaggccga tagttggtca tcggcaagca cacgaacata 1744741 gaagtcctcg acgacgactt cgatggcctc atgcccgccg atcttgtcgt agatgctgat 1744801 cggctcacgt ttgcgcaagc gtgacagtag tcccatcgtg ccaggggacc atgccggcgt 1744861 tgcctgccgg ttaggtcgcg atcacgctcg gattatcagc tgtaacaagc tgattgccgc 1744921 caacgtcgca cagatcccgt cgcaaacaga tccttggtcg ccgcaccggc cggtagtgga 1744981 ctccattcat cagctcatgt gctagtaggt tggcttcatg acccgtgtcc atcaccccac 1745041 gccgccatca ggagctaccc acgatgaatc ttggtgactt aacgaacttc gtcgagaagc 1745101 cgctcgcggc ggtgtccaac atcgtcaaca ccccgaactc ggccgggcga tatcggccct 1745161 tctacttgcg caacttgctc gatgcggtgc agggccgcaa cctcaatgat gctgtcaagg 1745221 gcaaggttgt cctcatcact ggtgggtcat caggcatcgg tgcggcggcc gcgaagaaaa 1745281 ttgccgaggc cggcggcacg gtggtgttgg tcgcacgcac cctggaaaac ctcgagaacg 1745341 tcgccaacga catacgggcg atccgaggca acggtgggac cgcccacgtc tacccgtgcg 1745401 atctatccga catggatgcg attgccgtga tggccgacca ggtgctcggc gacctcggcg 1745461 gcgtcgacat cttgatcaac aacgcgggcc ggtcaattcg gcgctcgttg gagttgtcct 1745521 atgaccggat ccacgattac cagcgaacga tgcagctcaa ctacctcggc gcggtccagc 1745581 tgatcctgaa gttcatcccc ggaatgcgag aacgccactt cgggcatatc gtcaacgttt 1745641 cctcagtcgg cgtgcagacc cgcgcgccgc gcttcggcgc ttacatcgcc agcaaggccg 1745701 cgctggacag cctgtgtgat gcgttgcaag ccgagaccgt gcacgacaac gtccgattca 1745761 ccaccgtgca catggcattg gtaaggactc caatgatcag cccgaccacg atctacgaca 1745821 agtttcccac gctgacgccg gatcaggcgg ccggtgtgat caccgatgcc atcgtgcatc 1745881 ggccccggcg agccagctca ccgttcggac agttcgccgc cgttgccgac gccgtcaacc 1745941 ccgcggtgat ggaccgggta cgtaaccgtg ccttcaacat gttcggcgac tcgtccgcag 1746001 ccaagggaag tgaatcccaa accgacacat cagaactcga caagcgaagc gagacgtttg 1746061 tgcgggccac ccgagggatc cattggtgac accatgagcc ttccgaaacc gaacaatcag 1746121 accaccgttg tgatcaccgg cgcctcctcc ggcatcggtg tcgaattggc tcgtggcttg 1746181 gccggccgcg gcttcccact gatgctagtg gcgcggcgcc gcgagcgcct cgacgaactg 1746241 gccgatcagc tgcgccagga acactgcgtc ggggtggagg tcttgccgct cgaccttgcc 1746301 gatacgcaag cgagggcaca gctggctgat cgcttgcgta gtgatgcgat tgccgggctg 1746361 tgcaacagcg caggtttcgg caccagtggg cgtttttggg agttgccgtt cgcacgcgaa 1746421 agcgaggaag tcgtcctcaa tgctctggcg ttaatggaac tcacccatgc cgcactgcca 1746481 ggcatggtca agcgcggcgc cggtgcggtg ctcaacatcg cctcgatcgc gggtttccag 1746541 ccgattccct atatggccgt gtattcggct accaaagcct ttgtgctgac gttctctgaa 1746601 gccgtgcagg aggagctgca cggaacgggc gtgtcggtga ctgccctgtg cccaggcccg 1746661 gtacccaccg agtgggccga gatcgccagc gccgagcggt tcagcattcc cctcgcccaa 1746721 gtttcgccgc acgacgtcgc cgaagccgcc atcgccggga tgctctccgg taagcgcacc 1746781 gtcgtgccgg gcatagtgcc aaagttcgtc agcaccagcg gcagattcgc tccgcgcagc 1746841 ctgctgctgc ccgcgatccg gatcggcaac cggctgcgcg gcgggcccag ccgctgatgt 1746901 gaggggcgtt ccggcctggt gccgaacgga gtgctgggcc tgggcaatcc cagccggcta 1746961 gccgcgttgt atgggttgca gctggcgcac gagtcgcagt gctgccagat gcacaatttg 1747021 ccctctgcag cgcgacaagt cactgttgcg tgtcgcgagg aggtgggcat aacgaccatc 1747081 cttgccggca gagacgaatg cggcgtgtgt gacaagacag ctgggttgga tggcgccgct 1747141 ccttagcggg ccatagcgca cggcccgctt cgtcgccggc gctagtctca tgcgatggcc 1747201 tctgttgagc tgtccgctga cgtccccatc agcccgcagg acacgtggga ccacgtttcg 1747261 gagctgtcag agttggggga gtggctcgtc atccatgagg ggtggcgcag cgagttgcct 1747321 gatcaactgg gcgaaggcgt ccagatcgtg ggtgtcgcgc gggccatggg catgcgcaac 1747381 cgggttacgt ggcgggtgac caagtgggac ccgccacatg aggtcgcgat gacaggatcc 1747441 gggaagggtg gaacaaagta cggagtcacc ctcaccgtgc gacccacaaa aggcgggtcg 1747501 gcgctggggc tgcgtctcga gctgggcggg cgtgcgctgt tcggcccgct gggttcggcg 1747561 gcggctcgcg ccgtcaaggg cgacgtcgag aagtcgctta agcagttcgc cgagctatac 1747621 ggctagccgc tagaagacac actttgcgac acgcccgaac ggtgtcggtc ctcggtcata 1747681 gactggcgtc cctatgagcg gttcatctgc ggggtcctcc ttcgtgcacc tgcacaacca 1747741 caccgagtat tcgatgctgg acggtgccgc gaagatcacg cccatgctcg ccgaggtgga 1747801 gcggctgggg atgcccgcgg tggggatgac cgaccacgga aacatgttcg gtgccagcga 1747861 gttctacaac tccgcgacca aggccgggat caagccgatc atcggcgtgg aggcatacat 1747921 cgcgccgggc tcgcggttcg acacccggcg catcctgtgg ggtgacccca gccaaaaggc 1747981 cgacgacgtc tccggcagcg gctcctacac gcacctgacg atgatggccg agaacgccac 1748041 cggtctgcgc aacctgttca agctgtcctc gcatgcttcc ttcgagggcc agctgagcaa 1748101 gtggtcgcgc atggacgccg agctcatcgc cgaacacgcc gagggcatca tcatcaccac 1748161 cggatgcccg tcgggggagg tgcagacccg cctgcggctc ggccaggatc gggaggcgct 1748221 cgaagccgcg gcgaagtggc gggagatcgt cggaccggac aactacttcc ttgagctgat 1748281 ggaccacggg ctgaccatcg aacgccgggt ccgtgacggt ctgctcgaga tcggacgcgc 1748341 gctcaacatt ccgcctcttg ccaccaatga ctgccactac gtgacccgcg acgccgccca 1748401 caaccatgag gctttgttgt gtgtgcagac cggcaagacc ctctcggatc cgaatcgctt 1748461 caagttcgac ggtgacggct actacctgaa gtcggccgcc gagatgcgcc agatctggga 1748521 cgacgaagtg ccgggcgcgt gtgactccac cttgttgatc gccgaacggg tgcagtccta 1748581 cgccgacgtg tggacaccgc gcgaccggat gcccgtgttt ccggtgcccg atgggcatga 1748641 ccaggcgtcc tggctgcgtc acgaggtgga cgccgggctt cgccggcgat ttccggccgg 1748701 tccgccggac gggtaccgcg agcgcgccgc ctacgagatc gacgtcatct gctccaaagg 1748761 tttcccatcg tactttctga tcgtcgccga cctgatcagc tacgcgcggt cggcgggcat 1748821 aagggtgggt cccggccgcg gctcggccgc cggctcgctg gtcgcctacg cgctgggcat 1748881 caccgacatc gacccgattc cacacggtct gctgttcgag cggttcctca accccgagcg 1748941 cacctcgatg cccgacatcg atatcgactt cgacgaccgg cgccgcggtg agatggtgcg 1749001 ctacgcagcc gacaagtggg gccacgaccg ggtcgcgcag gtcatcacct tcggcaccat 1749061 caaaaccaaa gcggcgctga aggattcggc gcgaatccac tacgggcagc ccgggttcgc 1749121 catcgccgac cggatcacca aggcgttgcc gccggcgatc atggccaaag acatcccgct 1749181 gtctgggatc accgatccca gccacgaacg gtacaaggag gccgccgagg tccgcggcct 1749241 gatcgaaacc gacccggacg tacgcaccat ctaccagacc gcacgcgggt tggaaggcct 1749301 gatccgcaac gcgggtgtgc acgcctgcgc ggtgatcatg agcagcgagc cgctgactga 1749361 ggccatcccg ttgtggaagc ggccgcagga cggggccatc atcaccggct gggattaccc 1749421 ggcgtgcgag gccatcggtc tgctgaaaat ggacttcctg ggcctgcgga acctgacgat 1749481 catcggcgac gcgatcgaca acgtcagggc caacaggggt atcgacctcg acctggaatc 1749541 cgtgccgctg gacgacaagg ccacctatga gctgctgggc cgcggcgaca ccctgggcgt 1749601 gttccagctc gacggcgggc ccatgcgcga cctgctgcgc cgcatgcagc cgaccgggtt 1749661 cgaagacgtc gtcgccgtta tcgcgctgta ccggcccggc ccgatgggca tgaacgcaca 1749721 caacgactat gccgaccgca agaacaaccg gcaggccatc aaacctattc acccggaact 1749781 cgaagaaccg ctgcgcgaga tcctcgccga gacctacggc ctcatcgtct atcaagagca 1749841 gatcatgcgc atcgcgcaga aggtggcgag ctactcgttg gcccgcgccg acattctacg 1749901 caaggccatg ggcaagaaga aacgcgaggt gctggagaag gagttcgagg gcttctccga 1749961 tggcatgcag gccaacgggt tctctccggc ggccatcaag gcgctgtggg acaccatcct 1750021 gccgttcgct gactacgcgt tcaacaagtc acatgccgcc ggctacggca tggtgtccta 1750081 ctggacggcc tacctcaagg ccaactatcc cgccgagtac atggccggtc tgttgacgtc 1750141 ggtcggcgac gataaagaca aggccgcggt ttatctggcc gactgccgca agctcggcat 1750201 caccgtgctc ccgcccgacg tcaacgaatc tggcttgaac ttcgcatcgg tcggccaaga 1750261 catccgctac gggctgggcg cggtgcgcaa cgttggcgct aatgtcgtgg gctcgttgct 1750321 ccaaacccgc aacgacaagg gcaagttcac cgacttttcg gactacctga acaagatcga 1750381 catctcggcg tgcaacaaga aggtgaccga atcgctgatc aaggcgggtg cgttcgactc 1750441 gctggggcat gcccgcaagg gtcttttcct ggtgcacagc gatgcggtgg actcggtgct 1750501 gggcaccaag aaggccgagg cactggggca gttcgatctc ttcggcagca atgatgatgg 1750561 gaccggcacc gcagatcccg tgttcaccat caaggtgccc gatgatgagt gggaggacaa 1750621 acacaaactc gccctagagc gcgagatgct gggactgtac gtctcggggc atcccctcaa 1750681 cggtgtggca cacttgctgg ctgcccaggt cgacaccgcg atcccagcga tcctcgacgg 1750741 cgatgtcccc aacgatgccc aagtgcgggt gggcggcatc ctggcgtcgg tgaaccggag 1750801 ggtcaacaaa aacggaatgc catgggcttc agcgcaattg gaggatctca cgggcggcat 1750861 cgaggtgatg ttcttcccgc acacctactc cagctatggt gccgacatcg tcgacgatgc 1750921 cgtcgtgctg gtcaacgcca aggtggcggt ccgtgacgac cgcatcgcat tgatcgccaa 1750981 tgacctcaca gtgcccgact tttccaacgc cgaggtggag cggccgctgg cggtcagctt 1751041 gcccacccgg cagtgcacct ttgacaaggt gagtgcgctc aaacaggtgt tggcgcgcca 1751101 ccccggcacc tcgcaggtgc atctgcggct catcagcgga gaccggatca ccacgctggc 1751161 acttgatcag tcgttgcggg tgacgccgtc gccggcgttg atgggtgacc tcaaggagct 1751221 gctcggccct ggatgtctgg ggagttagcg aggcgaccgc ccccagcggt ttccgcacga 1751281 tcgcccgtga gcgccgctaa tggatccagc ccgacgcccg actgtccccg ttgagatacc 1751341 ccgagacctc gtcgtcgaag ttggcgaagc ccgacatgag gctgccgaag ttgaagaagc 1751401 cagagatcga ggttcccgca ttgacgaaac ccgaaatgtt ggccgtaccg gtgatcgtgg 1751461 gtaccgagtt tctgaagccc gacagatggt cacccacgtt gatgatgccg gcgttgtagt 1751521 tgcccacgtt ggccaggccc gagttaccgg tgacaaattc ggcgtgttcg tcacgggcgt 1751581 tgttatcgaa gcccgagttg aaggtgccgg cgttagcgta gcccgagttg ttggtgcccg 1751641 tgtgcagcca gccggagttc tgtaccggct ggttgaccga gttgaacaat ccggtgttga 1751701 gatcgcccga gttgaagaag ccggtgttga tgttgccgga gttgaagtag ccggtgttca 1751761 cgttgcccga gttgaaatcg cccgtgttca tgctgccggc attgaagcta cccgtgttgg 1751821 tatggccgga gttaaacaga cccatgctgc cggtgaccag gtttccgccc gagtttccga 1751881 ttccggtgtt cgtggtgccc gagttgaacc agccggtatt ggtggtgccg gagttgaacc 1751941 agccggtgct ggtggtggcc gagttgaacc agcccgtgct gagctggccg gagttgccga 1752001 taccggtact gagctcgccg gagttgccga agcccgagct gcgctcggcc gaactgccga 1752061 actccacgct cagcgccccg actccattgc ccgagtttcc catgccgatg ttgttattgc 1752121 ccgagttgaa aaagccgata ttaccgttgc ccgagttccc gaagccgagg ttcccgctac 1752181 ctgaattgag ggcgccgaaa cctatctggt tatcaccggt gagcccgaag ccgatgttgt 1752241 tgttgccgct gttcccaaac ccgatgttat gagagccctt gttgccgaaa ccgatatttc 1752301 cactgccaag gttgccgctg ccaaagttgg tgtcgccggt gtttccgtta ccaaagttga 1752361 cgttaccggt gtttccgaac ccgaaattcg agttcccggt gttgccgcca cccacgttga 1752421 gatttccggt gttaccgccg ccaaagttgg tgtcgccggt atttccacta cccaggttat 1752481 agctgccgag gttgccgccg cccaggttat agctgccgat gtttccgcta ccccagttga 1752541 gcgtgcccgt gtttccactg tccgggttga gatccccggt gttgccgcca cccaggttgt 1752601 agctgccgat attgccgctg cccaggttga gatcaccgat attggcgttg cccaagttga 1752661 cgctgcccgt gttggcgcta cccgggttga agctgcccag gttgccgagg cccagattgc 1752721 cgctggccag attgaagcca ccgatggtga cgttgcccag atcgagaaac ggaaaccccg 1752781 cgatgatcac cgcaccgccg ggcgtggctg ccgcgctcgg cgacggtgtg aacggcgtca 1752841 gcgacaaggc gaccgccgac gcctcgccgt gataaccgag catcgcggcg acatcggcgg 1752901 cccacatctg ctcatagacg gcctcgacgg ccgcgatggc cggggcgttt tgccccaaca 1752961 gattcgaggc caccaaggag cgcaaccgac ctcgattggc gctcaccgcg cccggatgca 1753021 ccgtcgccgc cagcgccgcc tcgaacgccg ataccgccgc ctgggcttga cccgccgcct 1753081 gctcagcctg cgctgcggcc gtggtcagcc agcgagcata ggacgcggcc acgcctgtca 1753141 tggccgctga cgccggacct tgccaggacc cggtggccaa ctgcgaggtc accgccgaaa 1753201 atgaggccgc cgccgaaccc aaatcgccgg ccagcccggt ccaggccgac gccgccgcca 1753261 acatcggtcc cggcccggca ccagcgaaca tcagcgccga attgatctcc ggcggcaaca 1753321 ccgaaaaatt catcacaacc atcccgtcag ccggccacac ccaccgggct tcacggcgct 1753381 gtctggcccc aaccgcagcg aagcctacga aaaagccggg cgcttcggac gggcgcaggt 1753441 taaatccagg taacgcgtga cgaatctcgc gaggagcctc cttgcggcca tgggccgcca 1753501 cgggtctcgg tggtcgcggc cccgtgcttc cgcgtccttc gattgtggac gtacgctcac 1753561 cgatgtgacc tgggccatac tgatccgctg tcaaggagaa cggaaatgac cacaacagag 1753621 cgcccgacaa ccatgtgcga ggcgttccag cgcaccgccg tcatggaccc ggacgccgtt 1753681 gcgctacgga cccccggcgg taaccagaca atgacatggc gagactacgc ggcgcaggtg 1753741 cggcgggtcg ctgccggcct ggcaggtttg ggagttcggc gcggcgacac ggtctcgctg 1753801 atgatggcga accggatcga gttctacccg ctcgacgtcg gtgctcagca cgtcggcgcc 1753861 acctcgtttt cggtgtacaa caccctgccc gccgagcagc tgacctacgt gttcgacaac 1753921 gcggggacca aggtggtcat ctgcgagcaa cagtacgtcg atcgcgttcg cgccagcggt 1753981 gtgcccatcg aacacatcgt ctgcgtcgat ggcgcgcccc cggcacgctc tcgctgacgg 1754041 atttgtacgc ggccgcctcc ggcgacttct tcgacttcga gtcgacgtgg cgtgccgtac 1754101 aacccgagga cattgtcacc ctcatctaca cgtccggcac aacgggaaac cccaagggtg 1754161 tggagatgac ccacgccaac ctgctgttcg aggggtatgc catcgacgag gtgctcggaa 1754221 tccggtttgg cgatcgggtg acgtccttcc tgccatcggc gcacatcgcc gatcggatga 1754281 ccgggctgta cctgcaggag atgttcggca cccaggtcac cgcggtggcc gacgcgcgca 1754341 cgatcgcagc cgcgctcccc gacgtgcggc caaccgtgtg gggggccgtt ccccgggttt 1754401 gggaaaagct taaggccgga atcgaattca ccgtcgctcg tgagaccgac gagatgaagc 1754461 ggcaggcgtt ggcgtgggcg atgtcggtgg ctggcaaacg cgccaacgcc ctgctcgcag 1754521 gtgaatctat gtcggatcag ctggtcgccg aatgggccaa agccgacgag ttggtgttgt 1754581 ccaagttgcg cgagcggctg ggcttcggcg agctgcggtg ggccctgtcc ggagcggcgc 1754641 cgatccccaa ggagacgctc gcgttcttcg caggtatcgg catcccaatc gccgagattt 1754701 ggggaatgtc ggagctgagc tgcgttgcca ccgccagcca tccccgcgac gggcggctgg 1754761 gcaccgtcgg aaaactactt cccgggctgc agggcaagat cgccgaagac ggtgagtacc 1754821 tggtccgcgg tccgctggtg atgaagggtt atcgcaaaga accggccaag accgcggagg 1754881 cgatcgactc cgacggctgg ctacacaccg gagatgtctt cgatatcgac tccgacggct 1754941 atctgcgggt ggtggaccgc aagaaggagc tgatcatcaa tgcggccgga aaaaacatgt 1755001 cgccggccaa catcgagaac accatcctgg ccgcgtgccc catggtcggg gtgatgatgg 1755061 caatcggtga cgggcgaacg tataacaccg cgctgttggt cttcgacgcc gactctctcg 1755121 gtccgtatgc ggcccagcgt ggcctcgatg cctcgcccgc ggctctggcg gctgacccgg 1755181 aggtgatcgc gcgcatcgcc gccggcgtgg ccgagggcaa cgccaaatta tcgcgggtcg 1755241 aacagatcaa gcggttccgc atattgccca ccctgtggga gcccggcggg gacgagataa 1755301 ccctgacgat gaaactcaag cgccgtcgaa tcgccgcgaa atattccgcg gagatcgagg 1755361 agctctacgc cagcgagctg agaccgcagg tttacgagcc cgctgccgtg ccatcgacac 1755421 aaccggcatg acgggggcta gccagtgact gcacgggagg tgggccgcat cggactgcga 1755481 aagttgctgc agcgcatcgg tattgttgct gaatcaatga cgccgctagc gaccgacccc 1755541 gttgaggtta cccaactgct ggatgcccga tggtatgacg agcggctgcg tgcgctggcc 1755601 gacgagctcg gacgcgatcc ggacagcgtg cgcgccgagg cggcaggcta tctgcgggag 1755661 atggccgcct cgctggatga gcgggccgtg caggcatggc gcggcttcag tcgctggctc 1755721 atgcgcgcct acgacgtact ggtcgacgag gaccagatca cgcagctgcg caagcttgat 1755781 cgcaaagcca ccctggcgtt cgcgttctcg catcgttcgt acttggatgg gatgctgctg 1755841 cccgaggcga tcctggccaa ccggctctcg ccggcgctga ccttcggcgg ggcgaacctg 1755901 aacttctttc cgatgggcgc ttgggccaaa cgtaccgggg ctatcttcat tcggcgtcag 1755961 acgaaagata ttcccgtcta ccgcttcgta ttacgtgctt acgccgcgca gctggtgcaa 1756021 aaccatgtca acctcacctg gtcgatcgaa gggggtcgga ccagaacggg caagctacgg 1756081 ccaccggtgt tcgggatcct gcgttacatc accgatgcgg tcgacgaaat cgacggtccc 1756141 gaagtgtatt tggtgccgac ctcgatcgtg tacgaccagc tgcacgaggt ggaagccatg 1756201 accaccgagg cctatggcgc ggtgaaacga cccgaagacc tgcgctttct ggtccggttg 1756261 gcgcgacagc agggcgagcg actgggccgc gcctatctcg acttcggcga accgctgccg 1756321 cttcgcaagc gcctgcagga gatgcgcgcc gacaagtcgg gcaccggcag cgagatcgaa 1756381 cggatcgcgt tggatgtcga gcaccggatc aaccgcgcca caccggttac ccccaccgcg 1756441 gtggtgagtc tggccctgct gggcgcggac cgctcgttgt ccatcagcga ggtgttggcg 1756501 acggttcgcc cgttggccag ctacatagct gcccgcaact gggcggtggc cggcgccgcc 1756561 gatctgacga atcgctcgac gatccggtgg accttgcatc agatggttgc ttccggcgtg 1756621 gtgagtgtct acgacgcggg caccgaggcg gtgtggggca tcggcgagga ccagcacctg 1756681 gtggcggcgt tttaccgcaa caccgcgatc catatcctgg tcgatcgggc cgtcgccgag 1756741 ttggcgttgc tggcggccgc agagaccaca acaaacggct cggtttcccc ggcgaccgtg 1756801 cgtgatgagg cgttgagcct tcgcgacttg ctgaagttcg agttcttgtt ttctggccgt 1756861 gcccagtttg agaaagacct cgcaaacgag gtactgctga tcgggtcggt ggtcgacacc 1756921 tccaagcccg cggccgcagc cgatgtgtgg cgcctgctgg aatcggccga tgtgctgctg 1756981 gcccacctgg tgctgcggcc gtttctcgat gcctaccaca ttgtcgccga tcggctggcc 1757041 gcccatgaag acgactcttt cgacgaggaa gggtttctgg ccgagtgtct acaggtcggc 1757101 aagcagtggg agctgcagcg caatatcgcc agcgccgagt ccaggtcgat ggagctgttc 1757161 aagaccgcac tgcgcctggc tcgccatcgc gagctggtcg acggtgccga tgcgacggac 1757221 atcgccaaac gccgacagca gttcgccgac gagatagcca cggcaaccag gcgggtaaac 1757281 acaatcgcag aactggcccg caggcaatga gcgacaaatg cggccgccag ggccgctgcg 1757341 ccgtccagcg aacgggtcaa acggtggacg cgccatcccc ccgggcatag tctgaatgtg 1757401 atctaggtca cgtgccagca ccggaggagg cgggactatg gtcgcgacca ctacgcactt 1757461 cccgaagcaa aaagcgccct gcgggcacat ggttgacggc gatcaccaca tcgagcgcga 1757521 cgacgaaggc cttgcctacg acgacctcaa gttttcctgc ggctgccgcg aaatccggca 1757581 tttctaccac gacggatcca tgcgggtacg cacgattcga cacgacggca aggtgttgaa 1757641 ggacgagcac agcggcgatc acgaagcgtg aaccagcgcg atgaccgccc aacacaacat 1757701 cgtggttatc ggcggcggtg gtgcgggtct gcgcgccgcg attgcgatag ccgaaaccaa 1757761 tccgcacctg gatgtggcga tcgtttccaa ggtgtacccg atgcgcagcc acaccgtctc 1757821 ggctgagggc ggcgccgcgg cggtgaccgg tgacgacgac agcctcgatg aacacgcgca 1757881 cgacacggta tccggtggcg actggctgtg tgaccaagat gcggtcgagg ctttcgtggc 1757941 cgaggcgccc aaagagttgg tgcagctcga gcattggggc tgtccgtgga gccgtaaacc 1758001 agacgggcgc gttgccgttc gcccgttcgg cgggatgaag aagctgcgca cctggtttgc 1758061 cgccgacaag acgggatttc acctcctgca cacgttgttt caacggctgc tcacctattc 1758121 cgacgtcatg cgctatgacg agtggttcgc tacgacgctg ctggtcgacg acggcagggt 1758181 atgtggtctg gtcgctatcg agttggcgac cgggcgcatc gagacgatcc ttgccgacgc 1758241 ggtgattctg tgcaccggcg gatgcgggcg ggtatttcca ttcaccacca acgcgaacat 1758301 caagaccggc gacggcatgg cgctcgcatt ccgcgcgggc gcgcccctaa aagacatgga 1758361 attcgtccaa taccacccca ccggactgcc gttcaccggg atcttgatca ccgaggccgc 1758421 acgagctgaa ggcggctggc tgctcaacaa agacggctac cgctacctcc aggattacga 1758481 cctcggcaag cccacgcccg agcccaggct gcgcagtatg gagctcgggc ccagggaccg 1758541 actgtcgcag gccttcgtac acgagcacaa caaaggaagg acggtcgaca ccccgtacgg 1758601 ccccgtcgtc tatctagacc tgcggcacct gggggcggac ctgatcgatg caaagttgcc 1758661 gttcgtacgt gagctgtgcc gcgactacca gcacatcgac cccgtggtcg aattggtccc 1758721 ggtacgaccg gtagtgcact acatgatggg tggcgttcac accgatatca acggcgccac 1758781 aacgcttccc gggctatatg ccgcaggtga aacagcctgc gtgagcatta atggcgccaa 1758841 ccgcctgggg tcgaactcgc tgcccgagct gctggtgttc ggggctcgag cgggccgtgc 1758901 cgccgcggat tacgcagcgc gccaccaaaa gtcggaccgt ggcccgtcgt cggcagtgcg 1758961 ggctcaggcc cgcaccgagg ctctacggct agagcgtgag ctcagccgcc atggccaggg 1759021 aggcgaacga atcgcggata ttcgggcgga catgcaggcc accttggaaa gcgccgcggg 1759081 tatttatcgt gacggaccca ccctcaccaa agcggtcgag gagattcggg tgctgcagga 1759141 acgattcgcc acggcgggca tcgacgatca cagccgcaca ttcaacaccg agctgactgc 1759201 gctgctcgag ttgtcgggga tgctcgacgt tgcactggcg atcgtcgaat cgggtttgcg 1759261 ccgagaagaa tcccgtggcg cacaccagcg aaccgacttt ccgaaccggg acgacgagca 1759321 tttcttggcg cacaccttgg ttcatagaga aagcgacgga acgctgcggg tcggctacct 1759381 tccggtcact atcactcgct ggccaccggg cgaacgcgtg tatgggaggt aaggatgatg 1759441 gatcgaattg tcatggaggt ctcccggtat cggcccgaga tcgaatcggc cccgacattt 1759501 caggcctacg aggttcccct cacccgcgaa tgggcggtgt tggacggcct gacctacatc 1759561 aaggatcacc tcgacggaac actctccttc cgctggtcgt gccggatggg tatctgcggc 1759621 agtagtggta tgacgatcaa cggcgaccca aagctggcgt gcgcgacatt ccttgccgat 1759681 tacctacccg ggccggtgcg ggtggagccg atgcgaaact tcccggtgat ccgcgatctc 1759741 gttgtcgaca tcagtgactt catggccaag ctgcccagtg tgaagccgtg gctcgtccgg 1759801 catgatgaac cgcccgtcga agacggcgaa taccggcaga ccccggccga actcgatgca 1759861 ttcaagcagt tcagcatgtg tatcaactgc atgttgtgct actcggcgtg cccggtgtac 1759921 gcgctggacc ccgacttcct cggtccggcg gcgatcgcgc tggggcagcg gtacaacctg 1759981 gactcgcgcg accaaggtgc ggcggatcgc agggatgtcc tggccgcggc cgacggcgct 1760041 tgggcgtgca ccctggtggg cgaatgttcg acggcttgtc cgaaaggcgt cgatcctgcc 1760101 ggcgcgatcc agcgctacaa gctgaccgcg gccacgcacg cgctgaagaa gttgctgttc 1760161 ccttgggggg gcggatgagc gcctatcgcc agccggtcga aagatactgg tgggcgaggc 1760221 ggcgttctta cctgcgattc atgcttcgcg aaatcagttg catcttcgtg gcctggtttg 1760281 ttctctatct gatgctggta ttgcgcgccg ttggcgcggg cgggaattcc taccagcggt 1760341 ttttggactt cagcgccaat ccggttgtcg tagtgctgaa cgtcgtcgcg ttgagtttcc 1760401 tgctgctgca tgctgttacc tggttcggat cggcaccgcg cgcgatggtg attcaggttc 1760461 gcggccgccg ggtacccgct cgcgcggtcc ttgctgggca ctacgcggca tggctggtgg 1760521 tttcggtgat cgttgcctgg atggtgctgt catgactccc tcgacatcgg atgccaggtc 1760581 gcgccgacgc tcggcggagc ccttcctgtg gctgctgttc agcgccgggg gcatggtcac 1760641 cgccctggtt gcgcccgtcc tgctgttgct gttcggactc gcgtttccgc tcgggtggct 1760701 cgacgcgccc gaccacgggc acctactggc gatggtgcgc aacccgatca ccaagcttgt 1760761 tgtgctggtc ctggtggtac tggccctgtt ccatgcggcg caccggttcc ggttcgtgct 1760821 cgaccatggg ctgcaactgg gccggttcga ccgagtgatc gccctgtggt gttacggcat 1760881 ggccgtgttg ggctcggcga cggcgggttg gatgttgctc actatgtaaa gtcgctggcc 1760941 gggcgctttg gccgccggca cggtacggta cggacctgta ccaccacaac ggttctatgg 1761001 taggcgctgt gacccagata gcggatcggc ctacagaccc ctcgccctgg tcgccgcgag 1761061 agaccgagtt actggcggtg acactacggc tgctgcagga gcacggttat gaccggctaa 1761121 cagtggatgc cgttgcggcg agcgcccgcg ccagcaaggc aacggtctac cggcgctggc 1761181 cgtcgaaagc cgaattggtg ctggccgcgt tcatcgaggg catccgccag gtcgcggtcc 1761241 cgcccaatac cggcaacctg cgcgacgact tgctgcgact gggggagctg atctgtcggg 1761301 aggtgggcca acacgccagc accatccgcg cggtgctcgt cgaagtgtcg cgcaatcctg 1761361 ccctcaacga cgttttgcag catcagttcg tcgaccaccg taaggccctg atccagtaca 1761421 tcttgcagca ggccgtcgac cgcggtgaga tctccagcgc ggccatcagc gatgaactct 1761481 gggacctgct acccggctac ctcatcttcc ggtccatcat ccccaaccgg ccgcccaccc 1761541 aggacacggt gcaagccctc gtcgacgacg tgatactccc cagcctcacc cgatccaccg 1761601 gttgagtcag cggtgcgaat ggctgggcac cgttgtggtg tccggtcccg taccgtactg 1761661 ttgaatccgc ggatccccgc ctgaggtacg gggcgtggtc gcgccccggg caatagcgtc 1761721 gccggttatc gaaaggctaa cgggtgcagg ggatttcagt gactggcctg gtcaaacgcg 1761781 gctggatggt gagatccgtc tttgacacga tcgacggtat cgaccaactc ggcgagcagc 1761841 tggccagcgt gaccgtaacc ttggacaagt tggctgcgat ccagcctcaa ttggtggcgc 1761901 tgctaccaga cgagatcgcc agccagcaga tcaatcggga actggcgctg gctaactacg 1761961 ccaccatgtc cgggatctat gcccagacgg cggccttgat cgaaaacgct gccgccatgg 1762021 gacaagcctt tgacgccgcc aagaacgacg actccttcta tctgccgccg gaggcttttg 1762081 acaacccaga tttccagcgc ggcctgaaat tgttcctgtc ggcagacggt aaggcggctc 1762141 ggatgatcat ctcccatgaa ggcgatcccg ccacccccga aggcatttcg catatcgacg 1762201 cgatcaagca ggcggcccac gaggccgtga agggcactcc catggcgggt gctgggatct 1762261 atctggccgg cacggccgcc accttcaagg acattcaaga cggcgccacc tacgacctcc 1762321 tgatcgccgg aatagccgcg ctgagcttga ttttgctcat catgatgatc attacccgaa 1762381 gcctggttgc ggcgctggtg atcgtgggca cggtggcgct gtcgttgggc gcttcttttg 1762441 gcctgtccgt gctggtgtgg cagcatcttc tcggtatcca gttgtactgg atcgtgctcg 1762501 cgctggccgt catcctgctc ctggccgtgg gatcggacta taacttgctg ctgatttccc 1762561 gattcaagga ggagatcggt gcaggtttga acaccggcat catccgtgcg atggccggca 1762621 ccggcggggt ggtgaccgct gccggcctgg tgttcgccgc cactatgtct tcgttcgtgt 1762681 tcagtgattt gcgggtcctc ggtcagatcg ggaccaccat tggtcttggg ctgctgttcg 1762741 acacgctggt ggtgcgcgcg ttcatgaccc cgtccatcgc ggtgctgctc gggcgctggt 1762801 tctggtggcc gcaacgagtg cgcccgcgcc ctgccagcag gatgcttcgg ccgtacggcc 1762861 cgcggcccgt ggttcgtgaa ttgctgctgc gcgagggcaa cgatgacccg agaactcagg 1762921 tggctaccca ccgttaaggt ggtgggatgc cgctttcagg ggaatatgcg ccgagcccgc 1762981 tcgactggtc gcgcgagcaa gccgacacgt atatgaagtc cggcggaacc gagggcacac 1763041 agctgcaggg aaagccggtc atcctgctca ccaccgtcgg ggcgaagacc ggcaaactcc 1763101 gtaagacccc gctgatgcgc gtcgagcacg acggccagta cgcgatcgtc gcctcgctgg 1763161 gtggggcgcc gaaaaatccg gtctggtacc acaacgtcgt gaagaaccca cgggtcgagc 1763221 tgcaggacgg caccgtgacc ggcgactacg acgcccgcga ggtgttcggt gacgagaagg 1763281 ccatctggtg gcagcgcgcc gtggcggtct ggccggacta tgccagctac cagaccaaga 1763341 cggaccgcca gattccggtg ttcgtgctga ccccggtgcg cgcgggcggc tagccattgg 1763401 gatagggcgg cgtggcacca ttgaccggtg tccgccgaac tgagccagag cccgagcagc 1763461 tcgccgctgt tttcactatc tggggcagac atcgaccgtg ccgccaagcg gatcgcaccg 1763521 gtagtcacgc ccaccccgtt gcaacctagc gatcggttgt cggcgatcac tggcgccacg 1763581 gtctacctca agcgcgaaga cttgcagacg gtgcgctctt ataagctacg cggagcgtac 1763641 aacctgttgg tgcagttgtc cgatgaggaa ctggccgcgg gcgtggtgtg ttcttctgcg 1763701 ggcaaccacg cgcagggctt cgcgtatgcg tgtcgctgtc tgggtgtgca cggccgggtc 1763761 tacgtacctg ccaaaacccc caagcagaag cgtgaccgga tccgctacca cggcggggag 1763821 ttcatcgacc tgatcgtggg tgggtcgacc tatgatctgg ctgcggcggc ggcccttgag 1763881 gacgtggaac gcaccggggc cacgctggta ccgccgtttg acgacctgcg caccatcgcc 1763941 ggccagggca cgatagccgt cgaagtgctt ggccagctcg aggacgagcc ggacctggtg 1764001 gtggtcccgg tgggtggcgg cggctgcatc gcggggatca ccacctacct ggccgagcgg 1764061 acgaccaaca ccgcggtgct gggcgtcgag ccggctggtg cggccgccat gatggccgcg 1764121 ctcgcggcgg gcgagccggt gacgctggac catgtcgacc agttcgtcga cggcgccgcg 1764181 gtgaaccggg cgggcacgct gacctatgcc gcgctagccg ccgccggcga catggtttcg 1764241 ctcaccaccg tcgacgaggg tgcggtgtgc acggcgatgc tcgatctgta tcagaacgag 1764301 ggcatcatcg ccgaaccggc cggtgccctg tcggtcgccg gtctgttgga agccgacatc 1764361 gagcccgggt ccaccgtggt gtgcctgatt tcgggcggca acaacgacgt gtcccgttac 1764421 ggggaggtgt tggagcgctc gctggtccac ctgggcctca agcactattt cctggtcgac 1764481 ttcccgcagg agcccggtgc gctgcgccgg tttctcgacg acgtgctcgg acccaacgac 1764541 gacatcacct tgttcgagta cgtcaagcgc aacaaccggg agaccggtga ggcgctggtg 1764601 ggtatcgagc tgggatcggc cgcggatcta gacggtctgc tggcccggat gcgggcgacc 1764661 gacattcacg tcgaggcgtt ggaaccgggg tcgccggctt accgctatct gctgtagcga 1764721 ggcgtcggcg cgaccgtgcc gacaaacctc gcatgtgtat cgttggtgta tgtcgcgcac 1764781 caacatcgac atcgatgacg aacttgccgc cgaggtcatg cgcaggttcg gtctgaccac 1764841 caagagggcg gcggtcgacc ttgccctacg acggttggtc gggtcgccgt tgagccgtga 1764901 gtttctgctc gggctggaag gcgtcggctg ggaaggcgac ctggatgact tgcgaagcga 1764961 tcgcccagac tgatctcgat gatcctcatc gacacatcgg cctgggtgga gtacttccgt 1765021 gccaccggat caatcgccgc tgtcgaagta cgccggctgc tgtccgaaga agcagcgcga 1765081 atcgctatgt gtgagcccat tgcgatggaa atcttgagtg gcgcgctcga cgacaacacc 1765141 cacacgacgc tagagcggct cgtgaatggc ttgccgtcgt tgaacgttga tgacgcgatt 1765201 gactttcgtg ctgccgcggg tatctatcgc gccgcccggc gcgccggcga aacggttcga 1765261 agcatcaacg actgcctcat agcggcgctc gcgatccgcc acggtgcgcg tatcgtccac 1765321 cgtgacgccg actttgatgt gattgcccgg attaccaacc tgcaggccgc atcgtttcgg 1765381 tgagcatgcc gccccagcat caggccggct ccgcagcccg cagtatcgca agcgaatacg 1765441 ctgctagctc ggtggaatta tcgccgataa tcggcgactc ccaggccagc accagctcac 1765501 cgctgaccgg cacgcaggtt ggctcggcac caaggttgca ggcgatcatc agctggccgc 1765561 ggcgcatcac aacccagcgt tgctgctcgt cgtagtcgac cataaggtgg tccagccagg 1765621 ggtccgcaag gtcggcctcg ttgtgccgca aagcgatcag atcgcgataa aaccggtgca 1765681 acctggcgtg ttcgccggag ccggcttcgg cccagttcag cttgcagcgc tggaatgtct 1765741 gcgggtcctg cgggtccgga atgtcgtccg cggcccagcc atgttcggcg aactcctcct 1765801 tgcgtcctgc cacggtgcta tgggccagtt ccggttcggg atgtgagcaa aagaactgaa 1765861 acgggctgga ggccccccac tcttcgccca tgaaaagcat tgcggtatag ggagatccaa 1765921 gggtcaacgc cgccttgatc gcgagctggc caccggtcag gtattgcgat gggcggtcgc 1765981 cgagagcgcg gttgccgact tggtcgtggg tgcaggtgta ggcgagcagc ctggtggccg 1766041 ggatcgcaga agtgtccaat gcacgcccgt gccgacgacg ccggaacgac gaatacgtgc 1766101 cggcgtggaa gtagccgttg cgcagcgtgt acgcgagagt ggccagcgag ccgaaatccg 1766161 catagtagcc ttgccgctca ccggataccg cggtatggat ggcgtgatgg atgtcgtcat 1766221 tccattgggc ggtgatcccg tagccgccat ggctgggccg ggtgatcagc cgcgggtcgt 1766281 ttcggtcggt ttcggcgatc agcgacaacg gacggcccaa ctggcctgac agccagcggg 1766341 tcgcgttggc aagctcctcg aggacatgca cggcggtggt gtccaccagt gcatgcacgg 1766401 cgtccaaccg caagccgtcg gcgtggaagt cgcgcatcca tcgcagcgcg cagtcgatga 1766461 tatagtggcg aacctcgtcg gagtcggcgc cggcgatatt gatgccgtcc ccccacgggt 1766521 tgctggccga cgacaggtac gggccgaatc gcggcaggta gttgcccgat gggccgagat 1766581 ggttgaacac cgcgtcgatc aacacgccca aacgacgggc atggcatgcg tcgatgaacc 1766641 ggaccagacc gtcggggccg ccgtagggtt cgtgcacgct gtaccacagc acaccgtcat 1766701 atccccaacc gcgggttccg gcaaaggaat tgaccggcat cagctcgacg aagtcgattc 1766761 cgagatcgac caggtaatcc agcttttcga tggcggcgtc gaacgtgcca gccgtggtga 1766821 acgtgccgat gtgcaactcg tagatcaccg cgccctcgac cgaccgcccc ggccagccag 1766881 tgtcggtccg ggcagcacca aactggccgg gcggctccca ccgctgggag cgtgcgtgca 1766941 ccccgtcggg ttggcgggcc gatcgcgggt cgggtagcac ggtggggtcg tcgtcgagta 1767001 ggtatccgta gcgggcgtcc gccggcgccg ccaccgtcgt gtgccaccag ccgtcggctg 1767061 agcgggtcat cgcatgtacc gcaccgttca cgtcgagccg gaccagcgcg ggtttgggtg 1767121 cccatactcg gaattcaggc attgtcgcgc accagcagca ccacaggcag atccgcgaac 1767181 agctcgacgg ccggcgtgtg cccactggcc gtgaatccgg tgagggcatc tgtccacgac 1767241 ccgtcgggta ggggcagtac ggtgtggtcc cagccggttt gctgcaggcg caccgtccag 1767301 cgggtcaccg cgaccaggat gtcgtcaccg cggcggaacg caacgacgtg gtcggcggcc 1767361 ggcccggcgg cgaacaccgg atggtatgcg ccgcccagga agctctccgg atgggtgcgc 1767421 cgcagtcgaa gcgccgcggc caacacccga atcttagggt gctgcaaggc tttcagagcg 1767481 acacgccggg tgccgtagtc gacgggacgg cggttgtccg ggtcgaccag gctgtcgtcc 1767541 cacagttcgc tgccctggta gacgtcgggt acgccaggca cggtcaacgc gagcagctta 1767601 gcggccagcg cgtcgctttc ggcatgcgag ttgaggtggg ccacaagtcc ggtcagctcg 1767661 gacgccagcg gtccgtcgag caccagatca agccagccgt gcacgtcgtc ctcgaacgcc 1767721 cggttcgggt tgtgccacga ggtgtgccat gccgcctccc ggatcgcctt ctcggcgtaa 1767781 gtgtgcagcc ggccgcgcag cgcggcgctg acctctccac tcactggcca cactccgaag 1767841 acgttctgcc acagaaactg tccagtcacg gcatcagggg cgggcgcaat ggcttgggcg 1767901 tggccgatga acttggccca cagccacggc acttgggaca gcacgccgat gcgggcacgc 1767961 acgtcctcgc cgcgtttggt gtcgtgggtg gacagtgtcg tcatggaccg tggccacaac 1768021 cgagcacggg tggcggcccg gtgatgaaac tccgcggcgc ccacaccaaa ccggcgcggt 1768081 tctccgccca cttcattgag tgacaccagc cgggcatcac ggtagaacat acagtcttcg 1768141 acggccttgg cgctcaccgc gccgcacagt tgttgcaggc gtacggctgg ttcgccaccg 1768201 cgggccacag ctgcggcaat cagctgcagt ccaggtgcca attgtggtgt tgtcgaatgg 1768261 gtttcagcca acgcgcaggg taggacggcg gcttggccgg ggtaatcaca gcgatagcgt 1768321 ccgatgtggc gcagcagtgc agccaccgcc gcgggcaaca gcggatgatc ggcgccggcc 1768381 gccgccgcga tgcatcgccg caatcggcga agctcactgg ccaacgtatg gacggccgcg 1768441 tgcaccttga ggtcggccaa catcgccggc atctcctgat agtccacacc ggccgattcg 1768501 accagcgctg tcagtggtga ctctccttgg gggtcaacga ggacgccacc tatttcgcgc 1768561 agcacgtcat agccggtgga gccgtccact ggcagcgtgg gctctaacgc ctcgtcgacc 1768621 gccaggattt tttcgaccac gatccaggcg ttcgggccga gcagttcgcg cagctgggcc 1768681 aagtatccgc tgggatcgga tagtccgtcg aggtggtcga cgcgcacgcc gtcgacgagt 1768741 ccttcggtga accagcgagc gacctctgcg tggctggcgt cgaacacagc gcggtcttcc 1768801 tggcgcaggc cggccaacga ggtgatcgag aagaaacggc ggtagccgca cagcccgtgc 1768861 cgccatccga ccagccgata gtgctggcgg tcgtgcacag cggggccggt gccgtccccg 1768921 ctgccggggg cgacgggcag cgccaggtcg cccagccgca gcaggtcgcc gtcgactctg 1768981 aggttggcaa cgtcgctgtc ggagcccaat agcggcagga tgatccggcc atcacctagc 1769041 tcccagtcga tgtcgaagaa ctcggcatac gccgaggacc ggccgaactt caagacatcc 1769101 caccaccacg cgttctgctc gggcttgccg acgccgacat ggctgggcac gatgtcgacg 1769161 atcaggccca tgccccgcga ccgcgccgcc gcggataacc gcgctaggcc gtcagagcca 1769221 ccaagctcgg gtgacaccgt cgtcggatcg gtgacgtcat agccgtgggt cgacccgccg 1769281 accgccgtca aaatggggga caggtacaga tgcgataccc cgaggtcgtc gaggtagtcc 1769341 agcaggttct cggcatcggc gaaggtgaac ccgaatccgt tcgaccgacc gcgcatctgc 1769401 acccggtaag tggaaataac cggaaatgcc atatttcaca acgtcttacg caggaccagc 1769461 agcgagcgcg caggtaccga aaacgtgtca gtggcggtta ccgtcaggtc gatgtcaccg 1769521 acgggatcgt tggtatccag ctctccggtc cactgctgcg catagccgtc atgcggcatc 1769581 acgaactcca cgtcgtggtc atgggcgttg aagcacaaca ggaatgaatc gtcgactact 1769641 cgctcaccac gggcgtccgg tgcggtaatg gcttcaccgt tgagaaacac cgcaacacac 1769701 ctgtcgaagc ctctgcccca atcctcgtgc gtcatctccc gaccgctcgg tgtcaaccag 1769761 gcgatatcgc ggacttcgtc gccactgcgg atcggttcac cctcaaagaa ccggcgtcgg 1769821 cgaaacacct tgtggttctt gcgcaaggtc gtcgccttgc gtgcgaaagc tagcagatcg 1769881 gcattcttgt ccaccaatga ccaatccatc caagataatt cggagtcctg gcagtagacg 1769941 ttgttgttgc cgtattgggt gcgcccaatc tcgtcgccgt gggcgatcat cggcgtgccc 1770001 tggctgacca taagcgtggc ccacatgttg cgcatctggc gggcacgcag cgccaagatg 1770061 tcggggtcat cggtggggcc ctcgacaccg cagttccacg atcggttgta gctttccccg 1770121 tcgcggttgt tctcgccatt ggcctcgttg tgcttgtcgt tgtacgagac caggtcgttg 1770181 agtgtgaacc cgtcgtgggc ggtgacgaaa ttgatactgg cactgggccg gcggccggtt 1770241 gcttcgtaga ggtccgacga cccggtcagc cgggaggcga attcgcctag ggtggccggc 1770301 tcgcctcgcc agtagtcgcg cacggtgtcg cggtacttgc cgttccattc cgtccacagt 1770361 cctgggaagt tgccaacctg gtagccacct tcgccgacat cccatggctc ggcgatcagc 1770421 ttgacctgac tgaccaccgg atcttgttgc accagatcga agaatgccga cagccggtcg 1770481 acgtcgtgca gctcgcgggc cagcgtggac gccaggtcga accggaaccc gtcgacgtgc 1770541 atttcgatca cccagtagcg cagcgaatcc atgatcagct gcagggtgtg tgggtggcgg 1770601 gcattgaggc tgttgccggt accggtgaag tccttgtaga acctcaagtc gtggtccatc 1770661 agtcggtagt aggcggtgtt gtcgattccg cgaaagttga tcgtcggacc caagtggttg 1770721 ccttcagcgg tgtggttgta gacgacgtcg aggatgacct cgatgccggc ttcgtgcagg 1770781 ctgcgcacca tggttttgaa ctcggctacc gcgctgccgg cttgccgggt cgacgcgtat 1770841 tgatggtgcg gggcgaagaa tccgaaggtg ttgtaacccc agtagtttcg caagccgagg 1770901 tccagcagcc gggagtcgtg taggaactgg tgcaccggca tcaactcaac ggcggtgacg 1770961 ttgagctcgt tgaggtggtc gatgatcacc gggtgggcca ggccggcgta ggtgccccgg 1771021 agttcgggcg ggatactggg atgggtctgt gtcatgcctt tgacatgcgc ttcgtagatt 1771081 acggtctcgt ggtacggggt gcgcggcgac cggtcgtatg cccagtcgaa gaacggattg 1771141 atcacgacgc tggtcatagt gtggcccagc gagtcgacca tcgggggagt gctgtccggg 1771201 tcgacggcgt tgacgtcata ggaatacagc gcctgcccga aggtgaaatc gccgtggaac 1771261 gacttcccat acgggtcgag cagcagcttg ctggggtcac accgatggcc ggccgccggg 1771321 tcgaacggcc cgtgcacacg aaacccgtag cgctggccgg gggtgatgtt cggcagatag 1771381 gcatgccaga cgtacccgtc cacctcgtca agcgggatcc gcgactcgac gccgtcctcg 1771441 tcgatcagac atagctcgac cttctcggcg atctcggaga acaacgaaaa gttggtcccg 1771501 gcgccgtcgt aggtggctcc aagcggatag gcgttgcccg gccacaccgt gggtagagcg 1771561 ggcccggtcc cgtcggactc cccggcgttg ttcgacgaca tcacacgacc ttatccaggt 1771621 tctccggcgg gtgtaggcgt caccaccagt cggtgttcgc cgcgatttgc cgaccgagct 1771681 cgctggtcat cgtccgcatg taggtggggg tcaggtgatg actgtcgcgg tacaccagaa 1771741 catttccctc gaccgcgcgg caggtgtcgg tccggcatat cgcgtcggac atatcgagtg 1771801 gcttaagcag cgggaaccgc gcaacgaagt cgagggttgg attccgatcg accagcacct 1771861 tggaccgcgc gatcccacac gactgcggat tgccgccttt ggccaggcag tccgcaggga 1771921 tgaacggttg gccgtccttg accagccaag gggtatcccg catcgcgaga acgggaatgt 1771981 tgttgtcggc gaacgtttgc cagatcccga cataggttgc tggcatcaca tcgccgggtt 1772041 tgatgttcca cggtcgagtc gaggttgtga aaacgtagtc ggggtggtca gcgaccaact 1772101 tggccatcgc cgcttgcacc cactggtgac actgcggata gggagcgtta ttgcccatga 1772161 tcagcgggac ttcctcggtg gacaacgggc aacccatttt gaggtacgtc accaccttga 1772221 agtggtgcat gcgacccagc agatccagtg cggtcagcca gtgttcggcg tgtgaacccc 1772281 cggccagtgc gatggtccgg ggtgcgtcca catcgccgta ggtgcagttg atgatcgccg 1772341 ggttgacgaa gtcgctgatg cagccgtcct tggtcgaggt cggcaggtcg tgacggactt 1772401 ccaggacggt tgggcgcatc cgcagcttgg gcacccggac gtggtcgatc agggcccgcg 1772461 ccccgggata gtcgcgggag ctcaacccgc tcaactcttt gccggcggcg cgctggacga 1772521 tgacgtgctc acgccacgtg aacgaggtcg cggtaagagc gacgccaagc agtgccacca 1772581 cagatcccag cacgatcgtt ggccgacgca gccgcagccg ccagggaatc ggcgggaccg 1772641 ccgccggcga tctcacgccg gcgggtgccc gatagcgtaa tgggtcttcg acaagccggg 1772701 tggtcaggta tgccagcaac ccggatacca gcaggactgc cgcgccttcg acaaagttgg 1772761 cgtgccggtg cccggtgtag gagagccaga agatgagcag cggccaatgc cacagatacc 1772821 aggaataggc catcgcgccc agcgccacca acggagcggt ggctagcagg cgattgggca 1772881 gtggcagccg gtcgcgggta ccgggatggc cctgccggtt ggctccggca aggatcatca 1772941 gcatcgtggc tccgacgggt accagcgccc acggccctgg aaattccttg acaccgtcga 1773001 tcagggcgcc gcacgacagt atcgccgcca gcgcggcggt ggccaccgcg gtgcgcagcc 1773061 acatcggcca gcgcacatgg ggcaccacag cgccgaccag tgctcccgcc aacaactccc 1773121 aggcccgcgc gaaggtgttg tagtaagcgg tcgcctggta ggcgtgatgc gcaacgatgg 1773181 catagatgaa tgaggccaac gtcaacgtgc tcaataacac cacaaacatc gtccgcaggt 1773241 acggggcccg cgggccccga aacagtctgc gcagcaagta ggcgcacccg gcaacaagca 1773301 gcaggaaagc gagatagaac tgaccctgca ccgacataga ccagatgtgc tgcaaggggc 1773361 tcaccgcttc accggctcgc agatagttgg agaccgtgct agccagctcc caattctggt 1773421 aataccccaa gctggccagg ctctggttgg caaacgcttc ccaccgcgtc tgcggttgta 1773481 ttgcgatggt gagcagcgcg cagccggcga ggaccacaac cagtgccggg agcagccggc 1773541 ggatgagtcg gatcacttcg gctataggcg agagtgacag atccgggttg agggcggcgc 1773601 gaagtatttt cccgccaaag aagaagccgg acagcgccag gaacacgtct actccgccgg 1773661 aaacccggcc gaaccaaacg tggaacactg ccaccagggc gatcgcgaca ccgcgcaatc 1773721 cgtccaggtc gtgccggtaa aagccggtcg tacgggtccc catggtaacc gggggcaagg 1773781 ccggctccgg ggtcaaggcc ggtgggcgag gcggcgacag ggtcaacatg gttgacagtt 1773841 aatttaccca aaccagcctc ctgcttcgcg cgctgagcag cgggaagcag gaggcgggtt 1773901 tgggaggcga gaaagcaagc gggaccgtta gcgtgagcgc gcggtgccga agggaggcgg 1773961 ctggacgggc gcttgctgga cgggcgcttg ctggaccggc gcctgttgga ccggcgcctg 1774021 ttggacgggc gcctgttgga cgggcgcttg ctggacgggc gcttgctgga ccggcgctgg 1774081 ctggaccggc gcctgttgga cgggcgtcgg ctgggtcccg agaacccgga ccaggtaagg 1774141 cgtcatgccg ttggtgcgca ccggcgaaac ctggacgacg tcgcccacct ccagcatctg 1774201 gcccttcccg aggtataacg cgacgctttg cgtgccttcg gggccgtaga agatcaggtc 1774261 gcccttgcgc gcttgctgcg gcaggacctt ttgcccaacc ttgtacatct ggccggaaga 1774321 acgcggcagc tttagcccgg caccggcata ggcgtactgg atcaaaccgg aggcgtcgaa 1774381 cccgacggtg ttgatgccgg taccggtgcc gcgcgtgggg ccgctgatgc cgccgccggc 1774441 ccaggagaac ggcacgccgc gctgcgacag cccgcgcgcg atcacgacgt cggtgatctg 1774501 ttgataatcc accggccgcg tggccgggtc tgcggccgca agaccgggcg cggccaccat 1774561 cggggcgagc atcattgcca gaccgatcgc gaaggagccg cttttcatgc tgcgtttcat 1774621 ggggttgtaa cctccttggc actctcgggt ggtgtgtgcc tcagcacgtg acttcaccgt 1774681 ctgccattcc agccggaagt cactttattc acaccaatca ctacagacac tttgacaaca 1774741 gatgccggcc gcgtccatag ctggccagat ccaccagaag tctttttgcc gtaacgtgac 1774801 cggacggtga ctgccgcgct caatctttga tcggcagttg tgatttcagt cacgcgcgat 1774861 taatgccaat agcgttcgct gaatcccgct atcgcgtagc ccgcgatgga ggtgacggtg 1774921 atgacggcga tcgagatgat cggccagaag gacatgtacc agcccttcaa catggaaacg 1774981 aaggggccga tgacggctgt ggcgatggcc gcaccgatcc ctccccacat caccgggtag 1775041 atgtaatagt tgacgccgaa cggtaccagc gggcaagcat ccggcgggca cacgttgtcg 1775101 gtgaaggcga agagccgtga tggccagcta gtcatcgtga ccatgaccag aaatactgcc 1775161 aatatcgcta cggtacatac cacgtcccag ggcgctatcc gcagcgtgag tacccgtggg 1775221 ggtgtccgct cgtcgggctc gtctagagcc gaccgggatt cggtgccagc atcgggctga 1775281 gtgtcttcag gctgattcgg cggtgccatg catgcatgct ccccgatggc agaggttttg 1775341 gcgaccgtta ctgggatggg ccgtggcgtg gctgcattac cctcgatctc catggctgcg 1775401 gcgactggcg ggttgacgcc cgagcagatc atcgcggtcg atggcgccca tctgtggcac 1775461 ccttacagct ccatcggcag ggaagccgtg tcgccggtgg tggccgtcgc cgcccacgga 1775521 gcgtggttga cgctgattcg cgacggccag ccgatcgagg tgctcgacgc gatgagctcc 1775581 tggtggaccg cgatccacgg gcacggccac cccgctctgg accaggcgtt aaccacccag 1775641 ttgcgggtga tgaaccacgt catgttcggg gggctgactc acgagccggc ggcccggctg 1775701 gcgaagctgc tggtcgacat caccccggcg ggtctcgaca cggtgttctt cagcgactcc 1775761 ggctcggtgt cggtggaagt cgcggccaag atggcgctgc agtactggcg cggccgcggc 1775821 ctgcccggca agcgacggct catgacctgg cgcggcggct atcacggcga caccttcctg 1775881 gctatgagca tctgcgaccc gcacggcggc atgcactcgc tgtggaccga cgtcctggcc 1775941 gcccaagtgt tcgcgccaca agtgccacgg gactacgatc ccgcctacag cgcggcgttc 1776001 gaggcgcagc tggcgcagca cgccggcgag ctggccgcgg tggtcgtgga gccggtcgtg 1776061 cagggtgcgg gcggtatgcg ttttcacgac ccgcgctatc tgcacgacct gcgggacatc 1776121 tgccgccgtt acgaggtgct gctgatcttc gatgagatcg ccaccggctt cggccgcacc 1776181 ggcgcgttgt tcgccgccga ccacgccggg gtgagcccgg acatcatgtg tgtcggcaag 1776241 gcgctcaccg gcggctacct cagcttggcc gccaccttgt gcaccgccga cgtcgcgcac 1776301 accatcagcg ccggtgcggc cggggcgctg atgcacggcc ccaccttcat ggccaatccg 1776361 ctggcctgtg cggtctcggt ggccagtgtg gagctgctgc tcggccagga ctggcgcacg 1776421 cgcatcaccg aactggccgc cgggctgacc gccggcctgg ataccgcccg ggcgctgccc 1776481 gccgtcaccg atgtgcgggt gtgcggcgcg atcggcgtca tcgaatgcga ccgaccggtc 1776541 gacctggccg tcgcgactcc cgcggcgctg gatcgaggcg tgtggctgcg cccgtttcgc 1776601 aacctggtct acgccatgcc gccctatatc tgcacaccgg ccgagatcac gcagatcacc 1776661 tcggcgatgg tcgaggtcgc acggctcgta ggctcactgc catgaaagcc gccacgcagg 1776721 cacggatcga cgattcaccg ttggcctggt tggacgcggt gcagcggcag cgccacgagg 1776781 ccggactgcg gcgctgcctg cggccgcgtc ccgcggtcgc caccgagctg gacttggcct 1776841 ccaacgacta tctcggtctg tcccgacatc ccgccgtcat cgacggcggc gtccaggcgc 1776901 tgcggatctg gggcgccggc gccaccgggt cgcgcctggt taccggcgac accaagctgc 1776961 accagcaatt cgaggccgag ctcgccgagt tcgtcggcgc tgccgcggga ttgctgttct 1777021 cctctggcta cacggccaac ctgggcgccg tggtcggcct gtccggcccg ggttccctgc 1777081 tggtgtccga cgcccgttcg catgcgtcgt tggtggatgc ctgtcggctg tcgcgggcgc 1777141 gggttgtggt gacgccgcac cgcgacgtcg acgccgtgga cgccgcgctg cgatcgcgcg 1777201 acgagcagcg cgccgtcgtc gtcaccgact cggtgttcag cgccgacggc tcgctggcgc 1777261 cggttcggga gttgcttgag gtctgccggc gtcatggtgc gctgcttctg gtggacgagg 1777321 cgcacggcct gggtgtgcgt ggcggcggac gcgggctgct ctacgagtta ggtctagcgg 1777381 gtgcgcccga cgtggtgatg accaccacgc tgtccaaggc gctgggcagc cagggtggtg 1777441 tggtgctcgg gccgacgccg gtgcgggccc atctgatcga tgctgcccgg ccgttcatct 1777501 tcgacaccgg tctggcgccg gcggcggtgg gtgccgcacg ggccgcgctg cgcgtcttgc 1777561 aggccgagcc gtggcgaccg caggcggtgc tcaaccacgc tggtgaactt gcgcggatgt 1777621 gcggtgtggc tgcggtgccg gactcggcga tggtgtcggt gatcctgggc gagccggagt 1777681 cggcagtggc cgccgcggcg gcctgcctgg acgccggggt caaggtgggc tgcttccggc 1777741 cgccgacggt gcccgcgggt acgtcgcggc tgcggctgac cgcgcgcgca tcgctgaacg 1777801 ccggcgagct cgagctggcc cggcgggtgc tgacggatgt tctcgccgtg gcgcgccgtt 1777861 gacgatcctg gtcgtcaccg ggaccggcac gggggtcggc aagacggtcg tctgcgcggc 1777921 gctggcgtcg gccgcacgtc aggccggcat cgacgtggcg gtgtgcaagc ccgttcagac 1777981 cggcaccgcc cgcggtgacg acgacctcgc cgaggtcggc cggttggccg gggtgaccca 1778041 gctggccggc ttggcgcgat atccgcagcc gatggccccg gccgccgccg ccgaacacgc 1778101 cgggatggcg ttgcccgccc gcgatcagat cgtgcggctg atcgcagacc tggaccgtcc 1778161 cgggcggttg accctcgtcg agggggcggg cgggctgctg gtcgaactcg ccgagccggg 1778221 cgtcacgctg cgcgatgtcg ccgtcgacgt ggccgccgcg gctttggtgg tggtcaccgc 1778281 ggacctgggc accctcaacc acaccaagtt gacgttggaa gcgcttgctg cacaacaggt 1778341 ttcatgtgca gggctggtga tcggcagctg gccggacccg cccgggttgg tggcagcctc 1778401 gaatcggtcc gcgctggcgc gcattgctat ggtgcgggcc gctctgcccg ccggggccgc 1778461 gtcgctggat gccggggact tcgcggcgat gagcgcggcg gcgttcgacc gcaactgggt 1778521 tgccgggctg gtcggctgat ggtgcattcg atcgagctgg tcttcgacag cgataccgag 1778581 gcggcgatcc ggcgcatctg ggcggggttg gccgccgccg gcatacccag ccaggcgccg 1778641 gccagccgtc cgcacgtgtc gctggcggtg gccgaacgga tcgccccgga ggtcgatgag 1778701 ccgctgggtg cggttgcccg tcggctgccg ctggactgcg tgatcggcgc gccggtgctg 1778761 ttcgggcggg ccaatgtcgt gttcacccgg ctggtggtgc cgaccagcga gcttttggcc 1778821 ctgcatgccg aggtgcaccg gctctgcggc ccgcacctgg cgcccgcgcc gatggccaac 1778881 agcctgcccg gtcagtggac cgcccatgtc accctggccc gacgggtcgg tggtcaccaa 1778941 ttggggcggg cgctgcgcat tgcgggacgg ccgtcgcgga ttgacggtcg gttcgccggc 1779001 ttgcgccgct gggacggcaa cacgcgtgcc gagtacctgc tggggtgagg cgggcccaaa 1779061 aagcttgatg gcgaaggggt ttgatcgcaa cttcgtctta atggccagct cgcgggttcg 1779121 ggcgggtgct ggccaggtgg cgaggacgca cgtcgatgtg gggatgtcca aagatcttcg 1779181 cgggcggcga ttctcacgga tcgtcgtggt tgtcctcgtc gttgtggcgt agcagcttct 1779241 cgtggtggtg gaaggtgttg gtgcggggtt ggccgtggac tgctgaagaa cattccacgc 1779301 caggagatca accatgacca ccacaccagc acgtttcaac cacttggtga cggtaaccga 1779361 cctggaaacg ggtgaccgcg ccgtctgcga ccgcgaccag gtggccgaga cgatccgggc 1779421 gtggttcccg gacgcgccct tggaggtgag ggaagcgctc gttcggctgc aggccgcgtt 1779481 gaatcggcac gagcacaccg gcgagctcga agcgttcctg cggatcagcg tcgagcacgc 1779541 cgacgccgcc ggcggcgacg agtgcggccc ggcgatcctg gccggccgct ccgggccgga 1779601 acaagccgcc atcaaccggc aactcggact cgccggcgac gacgagcccg acggcgacga 1779661 caccccgccg tggagccgga tgatcgggct tggcggcgga agcccagcgg aagacgagcg 1779721 ctgacggtga acaccgcggc aacaggacgc tgggcggtcc cacgggcggg gcatggatag 1779781 cttccggccc atgggccgga agctatctcg gagaaacaaa tggcgccgct ggccgccgga 1779841 tcgcggagct ggagcggccg aaagccaagc agcggcagcg cgaggggcag gatcatggcc 1779901 gccaggctcg atattctggt ttggggccca tgggctacaa accagaatca gagcgtcatt 1779961 cgacgaaaac agacactgct atcggcgcag ccctcggcat ctccgccggc acctaccggc 1780021 ggctcaaacg aatcgacaac gcaacccaca gcgacgacaa agaaatccgc cggttcgcgg 1780081 agaaacaaat ggcgccgctg gtcgccggat cgccgagctg gaacgcccga aagccaagga 1780141 gcgccaacgc gagggtggtc gcctcggtgc atcgatcacc aatgccggct ttggtcccat 1780201 ggaaccaaag ccgtctcagc gccacactga caaggaggta ggcgcagccc tcggcatctc 1780261 cgccggcacc tacaagcggc tcaaacgaat cgacaacgca acccgcagcg acgacaaaga 1780321 aatccgcctg ttcgcggaga aacaaatggc gccgctggcc gccggatcgc cgagctggaa 1780381 cggccgaaag ccaagcagcg gcaacaggaa ggcggcgacc atggccgcca ggctcgatat 1780441 tctggcttgg ggcccatggg ccccaagcca gaatcggagc gtcgttcgac gaaaacagac 1780501 actgctatcg gcgcagccct cggcatctcc gccggcacct accggcggct caaacgaatc 1780561 gacaacgcaa cccgcagcga gttggcgcgt gggcggcccg gcacccctaa gcagaggccg 1780621 cccacgcctg gccctatcct acctacgcgg tagtctccac cttcagaact cgaaacgcgt 1780681 tgcgcaccag cacatctgat ccgaccctga accaggcgaa gaatccgcgc tgcccggtcg 1780741 gccggcgatt cggcccgaac aggtgaggca ccaactccac catggaccca actctgtcgc 1780801 cgatgaggaa ttgcttccag tcgccaagca ccagtggatg attcgtcgct gtcaccgccg 1780861 aatcaacggt gtccatgtgg gagacttcca ggacagactt cccggctagc atcggcggac 1780921 tgtcgtgcag cgatgggaat ttcagcgcgc cattcgaagt ttccgcctgc cgcaacgtgt 1780981 tgatggtgga caagttcgcc gcgaacgcgg cgctggcctg gaaccttggc ggcagcgccg 1781041 actgcaacgc gtaaacatcc gccgccacaa tcgcttctga ccccgcgccg acgaccacct 1781101 gatcggaggt gccggttagc gcgctgacga acccggtggg ctcgccgttg ccggagccgt 1781161 tgacgaacgc cgcggcctgc agttgctcaa cgctgtccgc gagaatcttg ccgatctcgc 1781221 caacgaagct cgccgcgtca ccctccagct cgatggagaa cggaatccag cagcttccac 1781281 ggtagttcgg caccgccggc tgggccaacg ctggcgaatc gtcggacacc tcctgggctt 1781341 cggagtacca acgagcttcg gcgccttcgg aagtcacgcc ccgccaaatc tcggaggtcg 1781401 tttgcaccac cctcgccacc tgccgaatcg ggttcgtcga cccatcaccc gacagcagga 1781461 tcgccgggtc cagcgccgcc gggatcagaa acccgccttg ggtgtccacc aggcccatcg 1781521 ctcgctgctc ggcggccacc gcggcagcct cacgccacgc ggccgcttcc cggtcggtcc 1781581 aaaccgtgtg ccccgcaaca ggattggaaa cccgcttgac gaacgcgccc aaatagtcgc 1781641 ggctgccggt ggccgccagc cagcgctgcg cccacgaggt ggactgcggc ggcccggtgc 1781701 ggcacaaggt ttccgcggtc tccgccgccc gcgacgacat caggccgtct cgcacacaag 1781761 aatccagtgt gcgaaacgcg gtgtcccgca acgagttgcc cggcggcgcg tcgccgtcgt 1781821 cgccgccggt gggagcgccg ggcaccaccc tcagctcacc ggcccggtag cggcgcagcg 1781881 cctcctcggc ttcgcggccg cggcggcgct gctccgcccg cagttcctcg gcgtggcgcg 1781941 tcagcgcctg aaaacgctgc gccgcctcac cggtcaggtc gccggcgaca ctgtcgagga 1782001 gctgcttcgc cgcgtcacgg gtttcaggta aagagaggtt tttgatgtcg tcgaattcgg 1782061 tcatagattg ttcaccaatc gagtagggac agccaggctt cggctgtcga acgggaaacg 1782121 actgtaagcg attccgcgcg caccccggcg atttgtgccc ccgaataggc cggaacgccg 1782181 gttagggaaa cctctaacag cgccgcttcg acgcgcacca gcacatcccc ttcgcgacgg 1782241 tcccggatcg gtcggaaacc caccgaaaac gagtcgacga caccagcttt tacgttcgcc 1782301 aaagcctcgt cgccgtccgg ggtgtccgca atctcgaacg ccccgaacaa gccgtgaggc 1782361 tcctcccgca actcaacggc ccggcccacc gggtagcggg ttcgagcgtc gtgagagacc 1782421 agcagcttca atttgtggcc gcgctcggcg atggagcgcc gaaaagcgcc aggagcgaac 1782481 atttcctgga actcgccgtc gaagtcgcgg acggtggtcg cctcgttgta gggcacgatg 1782541 gtgccgtgca cggttcggcc ttcgccagac cgcagctcgg ccatgcggaa aaggatgcta 1782601 ctcaaaattc ggccaccacc tagcagacgc aagaaacgcg cggaatcgct tgtggcgcat 1782661 ggcggccgct atccgggttc cagccgcccc gcggcgactg cccggcgtca gcggatgccg 1782721 agatgccaaa ctcgattgta tcacacacaa aaggtcatca ccggtccggg gcaaacgggt 1782781 tgagcccgtc gccgtcgtcg cccggcgcca ccgccagtcg ctgctcggcg gccggggtca 1782841 ggccaaactc ggaggccaag cgcagcagat gcatgcgcgc cgtctccgca accgtcaccg 1782901 ccgggttccg gtgcacgaca ccggatttcg gtgaggtaat tgtgaggcct tcggcgcgga 1782961 cccgctgaac cgccgcgacg tagacggacc aggtctcgca gtacgcggac aggagcgccc 1783021 gatcctcagg tttgagcagg tcaagccgct ccaaagtcgg tgcgacgcgc cgccattcgg 1783081 ccagcgcctc ggcgtcgagc cagtccgggg catccggtgc ctgacggata aacttcggcg 1783141 actcggggac tttccggccg ccggaatcgc ggccggggga gcggccctca accagtttga 1783201 gccgggccgg tttcggtggt cttggcatcg gtcctcccat caatttttag tctaggtaat 1783261 gagcgtgcat gcgcgccggc accgtggcgg tgtccgggct gggcctggtc acgatggcga 1783321 ccccgccccc tggtcgtcgt cctgctcgat aggtcgggcg tctcgcagcg ggtcgttgcc 1783381 gggatacgac gcgtggaagg caagccagtc acgatcggca tggacaagaa cattgcctcc 1783441 ggcttcgagg taggcgttct cggtgtcgcc gcggaggata agtgttccgt ctttcgcgaa 1783501 ggattcgagt gcttcgaccg ccgcatcggg tgagcaaccc gtctgcgagc agacaacctg 1783561 gactgccagc tcgtgcatca gttgtcgttc gtcgttggtc aggggccggt tgatcggggt 1783621 cactggtcga cctctatggt gtcgtcggtg ctgtcggcga caccctcggc gattaggaac 1783681 gggcacggct taccgacgtc gacgggacag ttcgcgcgct tccatttgtt gtcggcgaca 1783741 cccatgacgg gttcgccggt ctgcaggttc ggcaggatga tgtacggcac ggcgtcacca 1783801 cagcgtttgc aggtcgccat accgacatcg gcgttgaaag ccatgtcggc gattcgtcgc 1783861 cgcagttcgg cgtggtcggg ggtttcagcc atggcttgtg tcctttcaag cagggttggt 1783921 aagtgcggtt ctggcggcat tgagctgctg ttgcagtatc gggcatccgg ttggggcgtc 1783981 ggggtgcagc actttggata aagccctgta cacggcgggt gtccgctggg gtccgaccgc 1784041 ccggaacaac gctttggccc agtcggtgca ctgctgttgc gccgggtcgg cgggtccggt 1784101 gacggtgtgg ccgtggtagc gcagctcggc ggccagcagt ggggtccagt cagcgtcgat 1784161 gaaccagcag cgggtgtgcg cggaccagga gcgggcatag gcggggatcg tggacttgat 1784221 caacgacacg atcgcagagt cgtaggcgaa tcggacgctg tgccgaccgc cggatgccgg 1784281 ggtgatcgcg acagcggtca tgcggcaccg ccggtgtttg ctggtttggc ggtacagtcc 1784341 gcccggtggc ggccgtgaac ggccaggtag gtaccgcatc ctgggcagaa cggcacggca 1784401 ggccgtgacg catgtgacgt ttgcgcggta taaccgccat acgtgcgcgc gcgtaggcga 1784461 acctggaaat gcgtcacatg cgtcacgtta ggtgtgctaa tcatcgaaat catcggcccc 1784521 tctcaccgct attccggccc gccaacgacc atcacgggcc ttgtcagtga ccgggtatcc 1784581 gtgggtgtcg agcgactggc cgaacgcttt gcgcgagatt tcgggtacgc cttcttgcac 1784641 ccgccacctt tgccacgcct cgaacagatg cgtagtagtg gctttcagca ccggcgagct 1784701 ggtgacgcat tcgtcgtcga tgaacctctt tatcgtgtcg gagtcctcgc ggtaattcga 1784761 cgttgccgcg agcaccgcgt ccggctggga tagtccgatt cgctgatagt cgctccatcc 1784821 ggccaccgcc caggacagga tgctgtcggc ctccaactgc aaccgtgcgt ccagttcccg 1784881 gtcctgctcg tcggcaggaa tcactacttc aaacggcacc actcgaattc gccgccagat 1784941 ggccgtatca tcgccgggca ctctcggtag gtggttggtg atgagcagtg gggtatgtga 1785001 cggcgtgaat tccacgaagt cttgccgcat ctttcgggcg cggatggtgt cgccgccagt 1785061 cagccgtttt atcgttgatt cggccagccg gcgatctttt tcgctctcgg ataccgctac 1785121 ccatcgcacg ccgcggaggt ccatttcgcc tgttgggtga gcgttttccc ggtgcatgaa 1785181 aaggtcaggc tcagcggtgc aggcataatc gccaagggca tagcgaatcg ccttgtcgaa 1785241 cacagatttt ccgttggcac ctacaccgat aagaatcgcc aggacatgtt cgcggacggt 1785301 gcctagtagg ccgacgccgg ccaggcgttg cacgaacccg cgcacacctt catcgggcag 1785361 aacgcgggtc aagaacgctt gccagagagg cgattcggtg tcggactggt aggcaccgcg 1785421 gcatatcttt gtgatgcggt cagcgggcgc gtggggccgc aatttgagcg tgtgcaggtc 1785481 cagcgtccca ttcgcgacgt tgagcaagtg cgggtcgctg tcgaggtcgg ctaccgtcgc 1785541 ggcgaatggt accagtgcgg cggccaggtc gagcacgccg gccacgccgg acgccgattc 1785601 gcattttcgg acgtcggcgc gtaattcctt gtcgttgagg ctgtctgaga gcgcttggcg 1785661 cagctctgcc agcactgcac gtttggcttc gccgcggtcg tcggctgccc agcgtctgcc 1785721 gtcccaggag tgccagccga tcccggccac gtgcagcagc ttgtcctggt aacgttcggc 1785781 tagccggtag gcgattcggg cttggccgcg atgaacttgc gtcggtttgc caccgtcgtc 1785841 gatgagcacg tgcccgtccc ggtcgatcca gggggcgtcg ggatagtcgg tgccgtaggg 1785901 gatgtcggcc atcacgccac ccccgcccgc gggatgtaca cgccgcgccg tcggacgatc 1785961 tcgcgggcga tgccgggcca gtcggcggcc gcagatacgt cacgtgacgc ctgcgccatc 1786021 gcctcctggc acgtctctac cctcagagcc cagtgccggg ctgcgtcgca gatcgcggcc 1786081 catttgcgag gatcggcgtc gtcgagctga cgccaggccg gtgtgccggc catcggccac 1786141 gacccggcag catccaggac cggcgcgaca tgctcgtgca ccgaccacca cgacacggcg 1786201 cgtgacgcgg taggatcggc gctagacggt gtggcgactg tcgcgggtgc ccggtcctcc 1786261 gtggccgggc atcgtcgcgt cggcggcgac ccgccggcgc cggcggtcat cgggcaccgc 1786321 ctgaccgccg cacggggcgc agcagctcgg ccagccgggt gcgctgctcg tcagtcaggg 1786381 gcggcgctgc ggcgagggtg cggatgaggt agtccgcgat gttcgcggca acgagatcgg 1786441 ttttcgcggc gatgaactcg ggatcgtcgg atgcgcggga acgagacagt gcggctacgc 1786501 ggccgcgatg atggtagatg gtcgacacgt gcgactcctt ggggacacca aaaccccgga 1786561 gtcgaagccg gctacgtcgg agtctagcag ctaccacgcg ttggggtggc gcgtagtttg 1786621 ttcggcgtgt cgctttcgca gagcgtgcgc cacagccaca tggcgacgac caccgcgtcc 1786681 gactgcaccg caaaacccgg tgcgtagtcg gggttgccgg ccagtccggt cagtagccgc 1786741 ggaacgttct cgcacagtgt ttgcaggtcg ccgttgcgaa cccgcacggg ccgctgctcg 1786801 cacgtgcgca cgccggcggc gccgtacctc aggcaggcga attcccagtg cagccgcacc 1786861 cacccgtcgc ggcgcatttc aggtcgtagg cgagactcgt tgggcggcaa gccaataacg 1786921 gctcgcggct ggttgtcgtc gtcgagtaat gttgccaccg ctggcgcctg cggtaaccag 1786981 ccctgggtct cgtcgaccca aatgatcggc gcgacaagat cgcagcgctc gtgttcgatg 1787041 cggccatcgc ctttgcggcc gtggtcacag acgatcacga tgttgtggtg ccggctcatc 1787101 gccaattcac ctgcacccgt tcgggattga atatcctgcc gctcttgccg accggctgga 1787161 caacgacttc agcgaggacg tcgaggacgg cgcggaaccg gtccggcgac agctcggcta 1787221 tcatcccggc gacttgcggt gttcccaacg gtatcccgtc gaacactcgg agccgttcct 1787281 gatcctgttg gcgggcctga agtttcgtta tcttggcgtt gacgatgtcg gtgctgatct 1787341 tcacctggcg cgcggtcagt agcccttcgg cgcgttcgac ggcgagcctg tccagctccc 1787401 cgtagagggt ttccagttcc aggcggatgg tttcggcttc ggcggcgtcg tgaatctccc 1787461 ggcgcaacaa gtcaacggcg tcgggcatgg ccagccgctc ggccacgatg tgatacagga 1787521 tcggttcgat gttgtcggcc aggatggcca ccccgtggca cgccttgcac acgtagacga 1787581 cctggccgtc ggtgcggtag ctgccggcca ggtggttgcc gcatttgccg cagcctgcca 1787641 gcccggtcag caggtggcgg cgcacgcttt tgcggccggg ggcgcggccg ggggcgtcca 1787701 gcacggcctg ggcggcccag aacgtcgcct cgtccaccag cggcgaccac tgggccttgc 1787761 cgacaatcgc gtcgcggtcc accgggccgt agcgggcacc cttatatgcg cgtagtccgg 1787821 cgttgcgggg tttgcgcaag aatttcgaca gcgttgtagt cgtccacggg cggccggtga 1787881 tggtgaacgc cccggcgtcg ttccactggc ggcacacgtc gcccagggac gccccggcga 1787941 ggatgtcggc gtaggcctgt ttgaccagcg gcgctgtccg ggggtcgggt tcgggaccgt 1788001 tggggccggg caggtagccg aaggctttcg accagttggg gtggccgcgt tcagctttct 1788061 ggcgggcggc gcggcgctgt cgtgccttct tgtgctcggt ttcgtgagcg gccaccgacc 1788121 ccttcaggcg ggcgactagc cggccctggg gtgtcgccag gtcaacgtcg ccggcgacgg 1788181 tggccagggc cagccgcttc tcgtcggcta atgacatgaa ggcttccagc tcgatgggac 1788241 ggcgatggag ccggtccagg tcccaggcca ccacggcggc gatcttgccg gcggtgatgt 1788301 cggccaacat ctgctcgtag gcggggcggc gcttgccggt tgatgcgctg acgtcgttgt 1788361 cgaggtactc gacgggcacc cattttcgct gcccgcacag ctttaggcag tcctcgcgtt 1788421 ggcgggccac gccgagctgt tcgccggagc ggtcttctga gattcggagg tagacagcag 1788481 cacgcacagg tgtagtgtat ctcacaggtc cacggttggc cgtggtcgag gtggggtggt 1788541 ggtagccatt cggtgtggcc gtgggtgttg ttgtgggtgg tccagccttt ttcggcgagt 1788601 cggttgtcgg ggccgcaggc cagggtcagc tcggtgatgt cggtgcgtcc ggtgctggtc 1788661 caggcggtga cgtggtgggc ttggctgtgg taggccggtg cgtcacagcc gggtttggtg 1788721 cagccgcggt cgttggcgaa cagcatgatc cgctgggccg gggaggctag gcgtttggtg 1788781 tgatacagcg ccaggggtgt gccgtggtcg aagatcgcct gggggtacct cccgcttgcg 1788841 ggggagtagt ggtgggcgtg gctggtcatg cggatcacat cggccatggg tagcagggtg 1788901 ccgccgccgg tgaagccctt gccggcgccg gtttgcaggt cggtcagggt ggtggtgacc 1788961 acgatcgaga cgggaagacc gttgtgttgg cccagtttcc cggaggcgat cagcgcgcgc 1789021 agcccggcca gcagcccgtc gtggttgcgt tgggcttggc tgcgggtgtc gcggtcgatg 1789081 gcggccgcat cgggggtggt gtcgatgacc ggggtgtggt cgtcggggtt ggtcgcgccg 1789141 ggggcggcca gtttggctag cacggcttca aaggtggccc gcgcttgggg ggtcaggtag 1789201 ccacttagcc gtgacatgcc gtcgtattgc tggttgctca gggtgatgcc gcgtttgcgg 1789261 gcgcgttcgg tgtcggtgag gtcgccgtcg gggtgtagcc agtccatgac ccgctgggcg 1789321 tagcgggcca gctcgtcggg acgatattga gcggctttgc cggccaggtc ggcttcggcg 1789381 gcctggcggg tggacacatc caccgcggcg ggcaggtggg cgaaaaaggg cgcgaatcac 1789441 tttgacgtgc gcctcgccga tcaggccctg gcgttgggcg gtggcggtgg cggtcaactg 1789501 tggggctagc ggttcaccgg tgagtgctcg acgaggtccg agatcggcgg cgtcggcgat 1789561 gcgccgggcg gcgtcgggct tggtgatgcg taaccggttg gccagcgcgc agcacagcgt 1789621 gccgcccagt tcttcctcgc tggcttgggc gtcaagttgg ttgatcaacg cgtgacccac 1789681 cgccggtagc cggcgcacca agcattccag acgttccaga gaccgcagcc gttccggggt 1789741 ggtcaacacc tcaaaagaca cctcgtccaa gcggtccagc tcggcatcca gcgcatcaaa 1789801 gacctcgaca agctcctccc ggctattcgc taacatgttc gaatcataac gtcgggcact 1789861 gacaagaagt cgcgccgaca gctgctagaa ctggtgttag ctaagtgaat tcagtgactc 1789921 gagagccctc gcgagcttgg ccgcccacca ggtcggcggg gatgcctacc aggattcgat 1789981 cccgccaacc ggcaatctga ccaaccgggc ataacccccg ccggtgaacc gcagtttagt 1790041 gagcggcttg aggttgcggg atcgacgatt cggcgtctgg gccgctgtgt gggatgcctg 1790101 gcgggtcgag tgcgagtgct gatagctggg ccgctgccaa cgatccgtga cctccgccca 1790161 cgtcgcgttt gtccccgtgc gcaccgctac cgtagcctga acaccgtttc attcaggccg 1790221 ccgagcaggc ggcggatggg ttccgcgcgt gcggagatga cgaaggatgc aggggagtac 1790281 ctggtgacgc aagcggcaac gcgaccgacg aacgacgccg gccaggatgg cgggaacaac 1790341 tcggacattc tggtggttgc ccgccaacag gtgctgcagc gcggtgaggg cctgaaccag 1790401 gaccaggtgc tggcggtgct gcagctaccc gacgaccggc tcgaggagct gttggcgctg 1790461 gcccacgagg tgcggatgcg ctggtgcgga cccgaggtcg aggtcgaagg catcatcagc 1790521 ctgaaaaccg gtggctgccc ggaggattgc catttctgct cgcaatcggg gctgttcgcc 1790581 tccccggtgc gcagcgcctg gctggacata cccagcctgg tcgaggcggc caaacagacc 1790641 gccaagtccg gcgccaccga gttctgcatc gtggccgcgg tgcgcggacc cgacgagcga 1790701 ttgatggccc aggtcgcggc cggcatcgag gcgattcgca acgaagtcga gatcaacatc 1790761 gcctgctccc tagggatgct gaccgccgag caagtggacc aactggcggc gaggggggtg 1790821 catcgctaca accacaacct cgaaacggcg cgctcgttct tcgccaacgt cgtcaccacc 1790881 cacacctggg aagagcgctg gcagacgcta tcgatggtgc gtgacgcggg catggaggtt 1790941 tgctgcggcg gcatcctcgg catgggggag acgctgcagc agcgcgcgga attcgccgcc 1791001 gagcttgccg agctgggccc cgacgaggtc ccgctgaact tcctcaaccc gcggcccggt 1791061 accccgttcg ccgacctgga ggtaatgccg gtcggtgacg cgctcaaggc ggtggccgcc 1791121 ttccggttgg cgttaccgcg caccatgctg cggttcgccg gtggccgcga gatcaccctg 1791181 ggtgacctcg gcgccaagcg aggcatcctg ggcggcatca acgccgtgat cgtcggcaac 1791241 tacctgacca ccctcggccg gcccgcggaa gccgacctgg aactgctcga cgagctacag 1791301 atgccgctga aggcactcaa cgccagcctg taaatggtgg aaatcgtggc tggaaaacaa 1791361 cgcgctccgg tcgctgccgg cgtgtacaac gtgtacaccg gggaactggc ggatacggcc 1791421 acgccgacag cggctcggat gggtctggag cccccccggt tctgtgcgca gtgcggtcgc 1791481 cggatggtcg tccaggtccg gcccgacggc tggtgggcgc gctgttctcg ccacgggcag 1791541 gtggactcgg ccgacttggc gacacagcgg tgaccgagcc acccggtttt ggcggaccgt 1791601 ccgagccttc cggtgcaccg cggacgtcgc ggacacgggc ggtcctgttt gtgatgctgg 1791661 gtctgtcggc gaccggtgtg ttggtcggtg gcctgtgggc gtggatcgcc ccgccaatcc 1791721 atgccgtcgt ggccatcaca cgcgcgggtg agcgggtgca cgagtatctg ggcagcgaat 1791781 cccagaactt cttcatcgcg ccatttatgc tgctggggct cttgagtgtg ctggctgtcg 1791841 tggcatcggc attgatgtgg cagtggcgag agcaccgcgg accgcagatg gttgctgggc 1791901 tgtcgattgg gctgacgacc gctgcggcga tcgcggcggg agttggcgcg ctggtggttc 1791961 ggttgcgcta cggtgcgttg gactttgaca ccgtgccact ttcccgcggc gaccacgccc 1792021 tgacgtacgt cacccaggcc ccgccggtgt ttttcgcccg ccggccgctg cagatcgccc 1792081 tcactctcat gtggccggct ggcatcgcgt cgctggtata tgccctgctt gcggccggga 1792141 cggcgcggga cgacctgggc ggctatccgg ctgtcgatcc gtcgtcgaac gctcgtactg 1792201 aagccctgga aacccctcag gccccggtgt cctaggagag tcgcagccgc ccgccggcat 1792261 ccggagcgga ccgtgtctcc ggtcgggtgt cagcgcttgg attcaagcgg cagatcgtcg 1792321 aactggttta agtctggcgt gacgaggttg tgtgccaggt ccgagttcgc gccggtatgc 1792381 gcagagcgca ttggccaggt cagagcggac ggcggctcaa cttcctgccg gtgatcacct 1792441 tggccgcgat cacggccagt ctcgccatgc cggcgtaggt catcgggttg aagatggtcg 1792501 gccacgtggt ccggacgcgg tggtcggtca gtggcttgcc ggcgaaccgg tcggtgagcc 1792561 agcgaagcgt cattggggcc gacagcgggt gcagggacac atgttcgctg aacaggtcgc 1792621 ggtggtaggt gacgttggcg ccgccggctg tatagctgtc agcgagcgcg tcgatgtcag 1792681 agacgtcgat gaggtagtca tgcacggcct gcacgatcaa taccggcggg gtgggcaccg 1792741 cgctacccag cttggtgtcg ccgaagacat gggaaatttc cggcgtcgac agaatgtcct 1792801 caaggggttc gtcgaggaag tcacccatgt ccctgccggc catccggatc actgcgtcta 1792861 ccgttgtcat ctccgtcagt tgctccagca gctgacgtcc ttcgtcgttg gcgtgctcct 1792921 tgatcacccg ggccaggccg gggtagctgt gttgcagcgc ggccaccacc aacgcgggca 1792981 gaccggcaag aagagtgcca ttgagccggc ggaacgtgtg accaaggtca ccgacgggtg 1793041 atcccagcac ggcgccgacg atgtctaggt ccggtgcgta ctcgccgcat gcttcggcgg 1793101 cccacgcgct ggccagcccg ccgccggagt agccccacag cccgatcggc gttgccgggg 1793161 acaacccgac acgctcggaa ttcaaggcag cccggattcc gtcgaggact cggtaaccgg 1793221 gttcatacgg cgacccccac agccctttcg gcccttcatg gtcgggtact gataccgccc 1793281 atccttcggc aagtgcggcg ctgatcatca acagctccat ttgggtcagt gaccccaggg 1793341 ccttggcccg tcgtcgcagg gcatatgacg gaaaacagcg cgacgacatg gcatcgatcg 1793401 cacactggta cgacagcaag gggcaggtct gacccggggc aagctccgct gggacgatca 1793461 ccgtggtcac cgtcgcctcg gggttgccgt acatgttcgt ggtccggtac agcagctggg 1793521 tagcggtgac gggctgcgga atcaagccca taaacgccag ttcgacatcg cgcgagcgca 1793581 acaccgttcc gggcacggca tgctggtagc cggcaggtgg gaagtagaac ggatcgtcgg 1793641 atggcagcag cgggcgcact ttgcgctgca attcctcgtg cggtggccgg ccgatccatt 1793701 cggcgccggt cgcgcctgcc aaattgccgg gctctaccat taggctccct tcatggccat 1793761 ccggcatcct cgcgcgtgat cggtccctga cggggtagca gcgcggtttg cctgtcgcag 1793821 ttcagcgccg gcactcaagg tcagcgtcgg cactcgaatg gcgccagcgg ctcttatccg 1793881 gctcttaaag tctcatacaa gttacaggat ccaagggccg actccgaggc cagcgcggcg 1793941 tggcgcctat cacaggttgg gtacgccgag ttcccccatc gctggtgcga ccagattcaa 1794001 agctggccgg gaggccgcag tgcggcgaac tcgtcagtga ctcttagctg cgagtcggta 1794061 aaccggtaca acgccgccgg gcggccaccg ctgcggccgg actgcgcgat ggttccggtt 1794121 tgggtgatga ctctgcgacg ggccagtacc cgctgcaggt tggttgcgtc gacctggtag 1794181 cccagtgcgg cgccgtagat gtcgcgcagc gttgagagcg cgaattcttt tggggccaaa 1794241 gcgaatccga tgtttgtata ggacatcttg gcaatcagcc gggtgcgggc atgggtcacc 1794301 atcggaccgt gatcgaacgc cattggcggc aaggaactca ccgggtgcca gcgggtgtct 1794361 gctggcagct cgggggtggc gggggagggc accaccccca ggtaggtcga cgcgatcatc 1794421 cggatgcctg gcagccggtg tgggtcggaa aacaccgcga gctgttctag atgggccaac 1794481 tctcgcaggt cgactttctc ggccagttgg cgccgaaccg agctggtcat gtcttcgtcg 1794541 ttgcgtagcc gtccgcccgg cagcgaccac gcgccgcgct gcggctcctt cgcacgttgc 1794601 cacagcagca cattgagctg gggttttgcc gcaccgcggc tcatgccaac tccgcgcact 1794661 tgaaagacga cggccagcac ttcgtgggcg gtgctaccat gggccatgtt ttcgattata 1794721 agtcgaaaac ctgttggagc gcggaagggg cggcaatgac tgtgctgaat cgcacggaca 1794781 cgctcgtgga tgaactgact gccgacatca ccaacacacc gctcggctac ggcggggttg 1794841 acggtgacga acggtgggcc gccgagattc gccgtctggc gcatttgcgc ggggccaccg 1794901 tcctggcgca caactaccag ctgcccgcga tccaggacgt tgccgaccac gtcggggatt 1794961 cgctggcgct atcgcgggtg gccgccgagg caccggagga caccatcgtg ttctgcggag 1795021 tgcacttcat ggccgagacc gccaaaattc tcagcccgca caaaaccgtg ctgatcccgg 1795081 atcagcgggc cggctgttcg ctggccgatt cgatcacccc cgacgagctg cgcgcctgga 1795141 aggacgagca tcccggcgcc gtcgtcgttt cctacgtcaa caccacggcg gccgtcaagg 1795201 cgctcaccga catctgctgc acctcgtcaa acgccgtcga cgtggtcgca tccatcgatc 1795261 ccgaccgcga ggtgttgttc tgtccggacc aattcctcgg tgcacacgtg cgccgggtga 1795321 ccggccgcaa gaacctgcat gtgtgggccg gcgaatgcca cgtacacgcc gggatcaacg 1795381 gcgacgagct cgctgaccag gcccgcgcac atcccgatgc cgaactgttc gtgcatccgg 1795441 agtgtggttg cgcaacctcg gcgctatacc tcgccggcga aggagcattc ccagccgagc 1795501 gggtaaagat cttgtccacc ggcggcatgc tcgaagcggc gcacacgacg cgcgcccgcc 1795561 aggtgctggt cgccaccgag gtcggcatgt tgcaccagct tcgccgggcg gcaccggaag 1795621 tcgactttcg cgcggtcaac gaccgcgcct catgcaagta catgaagatg atcacccccg 1795681 cggccctgtt gcgctgcctg gtagagggtg ccgacgaagt ccatgtcgat ccgggaatcg 1795741 ccgccagtgg gcgtcgcagc gtgcagcgga tgatcgaaat cggccatccc ggcggtggcg 1795801 aatgatggcc ggtcccgctt ggcgggatgc ggccgatgtt gtcgtgatcg gcacgggcgt 1795861 tgccgggctg gcggcggcat tggccgccga tcgcgccggg cgcagcgtcg tggtgctcag 1795921 caaggctgcc cagacgcacg tgaccgcgac acactacgcg caaggcggta tcgcggtggt 1795981 gctgccggac aacgacgact cggtcgacgc tcacgtcgcg gacaccttgg ccgcaggcgc 1796041 gggcctatgc gatcccgatg cggtgtactc gatcgtcgcc gacggctacc gagcggttac 1796101 cgatttggtc ggagctgggg cacggttgga tgaatcggtc ccgggccgtt gggcgttgac 1796161 gcgcgaaggc gggcactcgc ggcgacgcat cgtgcacgcg ggtggcgacg cgaccggcgc 1796221 cgaggttcag cgggcgctcc aggatgccgc cgggatgctc gatatccgca ccggccacgt 1796281 ggcgttgcga gtgctgcacg acggtaccgc ggtgaccggg ctattagtgg tcagaccgga 1796341 cggatgcggc attatcagcg ctccgtcggt gatcctggcc accggcgggc tcgggcacct 1796401 gtacagcgcg accaccaatc cggcgggctc caccggcgac ggcatcgccc tgggattgtg 1796461 ggcgggcgtc gcggtcagcg atctcgagtt catccagttc caccccacga tgctttttgc 1796521 cggacgcgcc gggggtcggc ggccgctgat caccgaggcc atccgcggcg agggtgcgat 1796581 cttggtggac aggcaaggca attcgataac ggcaggcgtg catccgatgg gtgatttggc 1796641 gccgcgcgac gtcgtcgccg ccgccatcga cgcgcggctg aaggccaccg gcgatccgtg 1796701 cgtctacctc gacgcccgcg gcatcgaggg cttcgcgtcc cggttcccga cagtcacggc 1796761 atcctgccgg gctgccggca ttgaccccgt ccggcaaccg atcccggttg ttcccggtgc 1796821 gcactacagc tgcggcggca tagtgaccga tgtgtacggc cagaccgagc tgctcgggtt 1796881 gtacgccgct ggcgaggtgg cccgcaccgg gttgcacggc gccaaccgcc tggcctccaa 1796941 cagcttgcta gagggtttgg tggtgggcgg ccgcgccgga aaggccgccg ccgcccacgc 1797001 cgcggcggcc gggcgttcgc gtgcgacctc gtcagcgacc tggcccgaac cgatcagcta 1797061 caccgcactg gaccgcggcg acctgcaacg ggcgatgagc cgggacgcgt cgatgtaccg 1797121 cgccgccgcc gggctgcacc ggctgtgcga cagcctatcc ggagcacagg ttcgcgacgt 1797181 ggcttgtcgc cgcgatttcg aggacgtggc gctcacgctg gtcgcgcaga gcgtgaccgc 1797241 cgccgccttg gcccgcaccg aaagccgtgg ctgccatcat cgcgcggagt acccgtgcac 1797301 cgtgccggag caggcacgca gcatcgtggt ccggggagcc gacgacgcaa atgcggtgtg 1797361 tgtccaggcg ctagtggcgg tgtgctgatg gggttatccg actgggagct ggctgcggct 1797421 cgagcagcaa tcgcgcgtgg gctcgacgag gacctccggt acggcccgga tgtcaccaca 1797481 ttggcgacgg tgcctgccag tgcgacgacc accgcatcgc tggtgacccg ggaggccggt 1797541 gtggttgccg gattggatgt cgcgctgctg acgctgaacg aagtcctggg caccaacggt 1797601 tatcgggtgc tcgaccgcgt cgaggacggc gcccgggtgc cgccgggaga ggcacttatg 1797661 acgctggaag cccaaacgcg cggattgttg accgccgagc gcaccatgtt gaacctggtc 1797721 ggtcacctgt cgggaatcgc caccgcgacg gccgcgtggg tcgatgctgt gcgcgggacc 1797781 aaagcgaaaa tccgcgatac ccgtaagacg ctgcccggcc tgcgcgcgct gcaaaaatac 1797841 gcggtgcgta ccggtggcgg cgtcaaccat cggctggggt tgggtgatgc cgcgctaatc 1797901 aaggacaacc acgttgccgc cgccggatcc gtggtagacg cgctacgtgc ggtgcgaaat 1797961 gctgcacccg atctgccgtg cgaggtggaa gtggactcgc ttgagcagct cgatgccgtg 1798021 ctgccggaaa aacccgagct gatcctgctg gacaattttg cggtgtggca gacgcagacc 1798081 gcggtgcagc gtcgggactc gcgcgcgccc accgtcatgc tggagtcatc cggtgggctc 1798141 agcctgcaga cggcggcgac ctacgccgaa accggggtgg actacctggc ggtcggggcg 1798201 ctcacacact cagtgcgcgt gctcgacatc ggcttggata tgtagccggg cggccccggc 1798261 gcccattagg cggcgccgga tagggtaggc gccgtggcgc gaacgttcga agatctcgtg 1798321 gccgaagccg catcagcatc cgtcggcggc tggggttttt cctggttgga cggccgcgcg 1798381 accgaagaac gcccgtcatg gggctatcaa cgacaactca gtcagcggct ggcgaacgcg 1798441 acggctgcct tagatcttga gacaggcggc ggagaggtgc tagccggcgc gggcaacttc 1798501 ccgcccacca tggtcgctac cgaagcgtgg ccacccaacg cggctatggc cactaggcgg 1798561 ctgcatccgc tgggcgcggt cgtcgtcatc accggcgata aaccgccact gccctttgcc 1798621 gatgcggcgt ttgacctggt gaccagccgc caccccagca cccgatggtg gaccgagatt 1798681 gcccgggttc tccgggctgg cggcagttac ttcgcccaac acgtcggacc ggccacgctg 1798741 tgggacctgc gcgagcattt cctcgggccg cgagaacaca acggggccga tcagtacgcg 1798801 caggttgtgc gcacctgcat caccgacgcc ggcctcgaga tcgtcgacct gcagatggag 1798861 cggttgcggg tggaattctt cgacgtcggt gccgtcatct actttctgcg caaggtgatc 1798921 tggtttctgc cggacttcac cgtcgagggc taccacgatc ggctgcgtgc actgcatgag 1798981 cgcatccagg ccgaagggcc cttcgtcacc tactccaccc gcgcgctcat cgaggcccgc 1799041 aaaccgtcct gacgtcggcc ggggccttag gctcaggcga tatcgccgac gaagaccccg 1799101 atccggcgca gctgcaagcg cgccatccgc ggcagcccct gaccgtcgga gccggcgggc 1799161 acaatccggg ggttgatgac gtgaacaacg ccgcgggcga agtgcacgtc ggcttcgccc 1799221 gcggccaaca cgttcttgac ccaatccgtc ttaccgtgcg cgagcgcgat cgccagcaca 1799281 ccgtccttgc ggtaggcggt cacaatcgtt tggtatggct ttccagactt gcgaccgcgg 1799341 tgctcgatcg tggccgttcc gggtaggtag cgcgctatcg gtttgagcgc ccggttgatg 1799401 tacttgacct gcagacgctc gagccagagc gggaacacca tcggaacgcc cggggcgtta 1799461 ttcgggtgat cctttgcgga catggcggct cctctttgcc ggtcctttct actgcactgt 1799521 accggtcaga tatcgacttg agctgctctg ggagaatggt ctacgtgacc gcgccgccgc 1799581 ccgtgcttac ccgtatcgac ttgcggggag ccgagttgac agctgccgag ctgcgggccg 1799641 ctctgccacg cggcggcgcc gatgtggaag ccgtgctgcc gacggtacgg cccattgtgg 1799701 cggccgtcgc cgagcgcggg gccgaggccg cgctggactt cggcgcatcg ttcgacggtg 1799761 tgcggcccca tgccatccgg gtgccagacg cagcgctgga cgcggcgctg gccggactgg 1799821 actgcgacgt ctgcgaagcg ttgcaggtga tggtcgagcg gacccgcgcc gtgcactccg 1799881 ggcagcgtcg caccgacgtc acaaccacac tgggcccggg cgcgacggtc accgagcggt 1799941 gggttccggt cgagcgggta ggcctgtacg tgccgggggg caatgcggtg tacccatcca 1800001 gcgtggtgat gaacgtggtg cccgcccaag ccgcgggcgt cgactcgttg gtggtagcca 1800061 gcccgccgca ggcgcagtgg gatggaatgc cgcatccgac cattctggcc gcggcccggc 1800121 tgctgggcgt cgatgaggtc tgggcggtcg gcggcgctca ggcggtggcg ttgctggctt 1800181 acggcggcac cgacaccgac ggcgcagcac tgacaccggt cgacatgatc accgggcctg 1800241 gcaacatcta tgtcacggcc gccaagcgac tgtgccgttc gcgggtgggc atcgacgccg 1800301 aagcggggcc aaccgagatc gctatcctcg ccgatcacac cgccgacccg gtgcatgtgg 1800361 ccgccgacct gattagccag gccgaacacg acgagttggc tgccagcgtg ctggtcactc 1800421 cgagtgagga cctggccgat gccaccgacg ccgaactggc tggccagctg cagactacgg 1800481 tgcaccgcga acgggtgacg gccgcgctga ccggacgcca atcggcgatc gtcctggtcg 1800541 acgacgtgga cgccgccgtc ttggtggtga acgcttacgc cgctgagcat ttggagattc 1800601 agaccgccga tgccccgcag gttgccagcc ggatccgctc ggcgggagcc attttcgtcg 1800661 gcccgtggtc cccggtgagc ctcggcgact actgcgcggg atccaaccat gtactgccga 1800721 ccgcgggctg cgcccggcat tccagcggcc tgtcggtgca gacgttcctg cgcggcatcc 1800781 acgtcgtgga atacacggag gcggccctca aagacgtttc cggacacgtg atcacgctcg 1800841 ccacggccga ggacttgccg gcgcacggtg aggcggtacg gcggaggttc gagcgatgac 1800901 caggtccgga cacccggtta cattggacga cttgccgctg cgcgccgact tgcgtggtaa 1800961 agcaccatac ggtgcaccgc aattagctgt tccggtacgg ctgaacacca acgagaaccc 1801021 gcacccgcct acccgggcgc tggttgacga cgtggtgcga tcggtgcggg aagcggccat 1801081 cgacttgcac cgctaccccg accgcgacgc cgtggctctg cgtgctgact tggccggcta 1801141 tctcaccgcg cagaccggaa tccagcttgg tgtcgaaaac atatgggctg ccaacggttc 1801201 caatgagatt ctgcagcaac tgttacaggc gtttggcggt ccggggcgta gcgcgatcgg 1801261 tttcgtaccg tcctattcga tgcacccgat catctccgac ggcacccaca cggaatggat 1801321 cgaggcgtcc cgcgccaatg acttcggtct cgacgtggac gtcgccgtcg cggctgtggt 1801381 cgatcgcaaa cccgatgtgg tgttcattgc tagccctaac aacccgtccg gacaaagtgt 1801441 ttcgttacct gacctgtgta agctgctgga cgttgcgccc ggaattgcga tcgtcgacga 1801501 ggcctacggc gagttctcct cgcagcccag cgcggtgtcg ctggtcgagg agtatccgag 1801561 caagctcgtc gtcacgcgca ccatgagcaa ggcattcgct ttcgccggcg gcaggctcgg 1801621 atacctgatc gctacgcccg cggtgatcga cgcaatgctg ctggtgcggt tgccgtatca 1801681 cctgtcgtcg gtcactcaag ccgcggcccg ggccgcgctg cggcactccg acgacacctt 1801741 gagcagtgtc gccgcactga tcgccgaacg cgaacgcgta acaacctcat tgaacgacat 1801801 gggttttcga gtcatcccaa gcgatgccaa cttcgtgttg ttcggcgagt ttgccgatgc 1801861 gccggccgcc tggcggcgct atctggaggc cggcattttg atccgcgacg ttgggattcc 1801921 cggctatctg cgggccacca ccgggctggc tgaggagaac gatgcgttcc tgcgggcaag 1801981 cgcccggatc gccaccgacc tggtccccgt cacccgcagt cctgtaggag cgccatgaca 1802041 accacccaga cagccaaagc tagccggcgg gcgcgtatcg aacggcgtac ccgcgaatcc 1802101 gatatcgtca tcgagctcga ccttgacggt accgggcagg tggccgtcga caccggtgtt 1802161 ccgttctacg accacatgtt gaccgcgctg ggcagtcacg ccagcttcga cctcaccgtg 1802221 cgcgccacag gtgatgtcga aatcgaagcc catcacacca tcgaggacac ggcaatcgcg 1802281 ctgggcaccg cgctcgggca ggccctaggt gacaagaggg gcatccgccg gtttggcgat 1802341 gccttcatcc cgatggacga aacactggcc cacgccgccg tcgacttatc cggccgcccc 1802401 tattgcgtgc ataccggaga gccggatcac ctgcagcaca ccactattgc cggcagttca 1802461 gtgccctacc acaccgtcat caaccggcac gtgttcgaat cgttggcggc caacgcccgc 1802521 atcgcgctgc acgtccgcgt gttgtacggg cgcgacccgc accatatcac cgaagctcaa 1802581 tacaaggccg tcgcgcgcgc gttgcgtcaa gcggtcgagc cagatcctcg ggtgtcaggc 1802641 gtgccgtcca ccaaaggtgc tctgtgacag caaaatcggt tgtagtcctt gactacggct 1802701 caggaaacct gcggtcggcc caacgtgcgc tgcaacgagt aggcgccgag gtcgaagtaa 1802761 ccgccgatac cgacgccgca atgaccgctg acggactggt ggtgccgggc gtcggtgctt 1802821 tcgcggcgtg catggcgggc ctgcgcaaga tcagcggaga gcgaatcatc gccgagcggg 1802881 tggccgccgg ccgcccggtg ctgggggtct gtgtcggtat gcagattctg tttgcttgcg 1802941 gggtcgaatt cggtgtgcag acgccaggct gcgggcactg gccgggggcg gtcattcgac 1803001 ttgaggcccc ggtgattccg cacatgggct ggaatgtcgt ggattccgct gcgggcagcg 1803061 cgctgttcaa agggttggac gtcgacgccc ggttttattt cgtgcattcc tatgccgcgc 1803121 agcgatggga aggctcaccc gacgcgctgc tgacctgggc cacatatcgg gcgccgttcc 1803181 tcgctgcggt ggaggacggc gcattggccg ccacccagtt tcatccggag aagagtggcg 1803241 atgccggtgc agccgtactg agcagctggg ttgatggact ttaaaggata ctggtgatgc 1803301 cgctgatact tttgcccgcc gtcgacgtgg tcgagggtcg tgccgtgcgc ctcgttcaag 1803361 ggaaggccgg cagccaaacc gagtacggct cagcggtgga tgccgcgttg ggctggcaac 1803421 gcgatggcgc cgagtggatc catttggtgg acctggatgc tgcgttcggc cgcggttcca 1803481 accacgaact gcttgccgag gttgtcggca agctcgacgt acaggttgag ctatccggcg 1803541 gtattcgaga cgacgagtcg ctggccgcgg cgctggccac cggatgcgct cgggtcaatg 1803601 tgggcactgc tgccctggaa aacccgcagt ggtgtgcccg ggtgattggc gagcacggcg 1803661 accaggtcgc cgtcggcttg gacgtccaga tcatcgacgg cgagcatcgg ttgcgcggac 1803721 gcggctggga aaccgacggc ggcgacctgt gggacgtgct agaacgccta gacagtgaag 1803781 gatgttcgcg gttcgtcgtg accgatatca ccaaggacgg caccctgggc ggccccaatc 1803841 tggacctgct ggccggtgtt gccgaccgca ccgacgcccc ggtgatcgcg tccggaggtg 1803901 tgtccagcct cgatgacctg cgcgccattg cgactctcac gcaccgcggc gtcgaggggg 1803961 ccatcgtcgg caaggccctc tacgcccgtc ggttcacctt gccgcaagcg ttggccgcgg 1804021 ttcgggacta gatcggcgat gcacttggat tcgttggttg ccccgctggt tgaacaggcg 1804081 tcggcgatcc tggatgccgc aacggcgctc tttctcgtcg gtcatcgcgc cgattcagcg 1804141 gtccgcaaga agggtaacga cttcgccacc gaagtcgatc tagcgatcga gcggcaggtt 1804201 gtcgcagcgc tggtggcggc caccggcatc gaggtgcacg gcgaggagtt cggcggcccg 1804261 gcagtcgact cgcggtgggt gtgggtactg gaccccatcg acggcacaat caactacgcc 1804321 gccggatcgc cgttggctgc gatcctgttg ggcctgctgc acgacggagt tccggtggcc 1804381 ggcttgacct ggatgccatt caccgaccca cgctataccg ccgtggcggg tggtccgctg 1804441 atcaagaacg gtgtaccgca gccgccgctg gctgacgccg aactggccaa cgtgctcgtc 1804501 ggcgtcggca cattcagcgc cgactcacgg ggccagttcc cggggcgata tcgactggcg 1804561 gtgctggaaa agctcagccg agtgtcatcg cggctgcgca tgcacggatc caccggcatc 1804621 gatctcgtct tcgtcgctga cgggatactc ggtggtgcaa taagtttcgg aggtcacgtt 1804681 tgggaccatg ccgctggggt ggcgttggta cgagccgccg gtggcgtggt caccgacctg 1804741 gctgggcaac cgtggacccc tgcatcgcgt tctgccttgg ccgggccacc gcgcgtgcat 1804801 gcccagatcc tcgagattct tggcagcata ggggaaccag aggactactg agatgtatgc 1804861 cgaccgtgac cttccggggg ctgggggcct cgcggtacgc gtgatcccgt gtctggatgt 1804921 cgacgatggg cgggtggtca agggagtcaa cttcgagaac ctccgcgacg ccggtgatcc 1804981 cgtggaactc gccgccgtct atgacgcgga gggcgcggac gagttgacct ttctcgacgt 1805041 gaccgcgtcg tcgtccggaa gagccaccat gctggaggtg gtgcgccgca ccgccgagca 1805101 ggtgttcatc ccgctgacgg tgggcggtgg ggtacgcacc gtcgccgacg tcgattcgct 1805161 gctacgggct ggggctgaca aagtcgccgt caacacggcc gccatcgctt gcccggactt 1805221 gctggcggac atggcgaggc agttcggctc gcagtgcatc gtgttgtccg tcgacgcgcg 1805281 cacagttccg gtgggatcag ccccgacacc gtcgggttgg gaggtcacca ctcacggcgg 1805341 tcgtcgtggc accggtatgg acgccgtgca gtgggcggcc cgtggcgccg acctcggtgt 1805401 gggggagatc ctgctcaact cgatggacgc cgacggcacc aaagccggat tcgacctggc 1805461 tttgctgcgt gcggtccgtg ccgcggtcac ggtgccggta atcgccagcg ggggcgccgg 1805521 tgctgtggag cacttcgcgc cagcggttgc cgcgggggcc gatgcagtgt tggcggccag 1805581 cgtctttcac ttccgggagc tgacgatcgg tcaggtgaag gcggccctgg ccgcggaagg 1805641 aatcaccgtg cgatgacact cgacccaaag atcgcggcgc ggttgaagcg taatgccgac 1805701 ggactggtta ccgccgtcgt ccaggagcgg ggcagcggtg acgtgctgat ggttgcctgg 1805761 atgaacgacg aggccttggc ccgtaccctg caaacccgtg aggccactta ctattcgcga 1805821 tcccgtgccg aacaatgggt caagggcgcg acgtccggcc acacccagca cgttcactcg 1805881 gtgcgcctgg attgtgacgg cgacgccgta ttgttgacgg ttgaccaggt cggcggtgcc 1805941 tgccataccg gcgatcacag ttgcttcgat gccgcggtgt tgttagaacc cgacgactaa 1806001 cccgccgcgg aaagactggg gctagcggct cgcggcgcaa cagattgcag tggtcgcccg 1806061 cgaggcaaga gtgcccatcg acacgccgcc gagcgagcgc ggacatacca ccttgggatc 1806121 catgcagatg tcaagggggg ttgcccgtcc gggcgatggc gtcgatgaga atggcggtcg 1806181 atgctgaaac gagtgccctg gaccgttgtg ctgccttcgc tggcctttgt cgcgctggta 1806241 ttgacctggg gaaagcagat cggcccggtg gtgggcttgc tagcggcggt gctgttagcc 1806301 ggtgctgtcc tggccgcggt caaccatgcc gaggtggtgg cggcccgggt gggtgagcca 1806361 ttcggttcgc tggtgctcgc ggtcgcggtg acgaccatcg aggtggcgct gatcgttgcg 1806421 ctcatggtgt ccggcgggga cgatgcggcg acgctcgccc gcgacaccgt gttcgccgcg 1806481 gtgatgatca ccaccaacgg gatcgccggg ttgtccctgc tgctgggttc gctgcgctat 1806541 ggcgtgacgt tgttcaaccc ccacggcagc ggcgccgcgc tggccacggt caccacactg 1806601 gcgacgctga gcctggtgct gcccacgttc accaccagtc agtcgggccc cgagctatcg 1806661 cccggccagc tcatcttcgc cggcgccgcg tcgctgggac tctacgtgtt gttcctgttc 1806721 acccagactg tccggcatcg agacttcttc ctaccggtgg cgcaaaaggg cgcggtcgag 1806781 gatgacagcc acgccgatcc accgagcacc cgcgcggcgc tgctgagcct tggattgctg 1806841 ctcgtcgctt tggttgcggt ggtgggtctg gccaaggtgg aatcgccggt catcgaggag 1806901 gtcgtctcgg cggccgggtt tccgcaatcc ttcgtcggcg tggtcatcgc cacactggtg 1806961 ctgttgccgg agacacttgc ggcggcccgc gcggcccggc aaggccgcct gcagaccagc 1807021 ctcaatctgg cgtacggttc cgcgatggcg agtattggac tcaccatccc gaccatcgcc 1807081 cttgcttccc tgtggctcag tggcccgctg caacttggcc tcggtgccat tcagttggtg 1807141 ctgctggtgc tcacggttgt ggtcagcgtg ctgaccgtgg ttcccggtcg ggccacccgt 1807201 ctgcagggcg aggtgcatct ggtgttgctg gctgcttacc tgtttcttgc cgtcgtcccg 1807261 tgatgaatcc gtgcgcaagc gatggttttc gccgccgcta tccagatctg attgcccgca 1807321 gcgtcgctaa cgctttgtcg gcgtgggcgt ccatgctgaa ttcgctggag atcacgtcga 1807381 gcaccttacg gtcggtgtcg atgacaaagg tcgtgcgttt gaccggcatc aacttgccca 1807441 acagaccgcg cttgaccccg aattgggcgg cgaccgtgcc ttgggcgtcc gaaagcagcg 1807501 ggtagtcgaa acgccgcacc tcggcgaatt tggcctgctt tcgaacggga tcggtgctga 1807561 tgccgacccg gctggccctg acctcggcga attctttggc caagtcgcgg aagtggcagg 1807621 cttctttggt gcagccaggc gtcatcgccg ccggatagaa gaacaggacc acgggtccgt 1807681 cggatagcag gacgctaagc ctgcgaggag tcccggtctg atcgggcagt tcgaagtcgg 1807741 ctaccgtgtc accggttttc atagtcgtca ggctacaacc gattgcccga ctccttgcgc 1807801 gccgcttcgc ggctgggggt gcccccatgc gcgccgtttg cgcggcgtgc atcgtcgtcg 1807861 ggctacgccc gggccgatcg gcgtatctgg gaagatggtt cggtgcacgc cgacctcgca 1807921 gccaccacct cgcgtgagga tttccgcctc ctggcggccg agcaccgggt ggttccggtg 1807981 actcgcaagg tcttggccga cagcgagacg ccgctgtcgg cctaccgcaa gctcgccgcc 1808041 aatcgcccgg gtacgttcct gctggagtcg gccgagaacg gccggtcgtg gtcgcgatgg 1808101 tcgtttatcg gtgcgggggc gccaacggcg ttgaccgtgc gtgaggggca agcggtatgg 1808161 ctgggtgccg tgcccaagga cgctcccact ggcggagacc cgctgcgggc gctgcaggtg 1808221 accttggagc tgctggctac ggcggatcgt cagtccgagc cgggtcttcc gccgctgtcg 1808281 ggtggcatgg tcggtttctt cgcctatgac atggtgcgac ggctggaacg attgccggaa 1808341 cgggccgtcg atgacctctg cctgccggac atgctgctgt tgctggccac cgatgtggcg 1808401 gcggtcgatc accacgaggg caccatcacg ttgatcgcca acgccgtgaa ctggaacggc 1808461 accgacgagc gggtcgactg ggcctacgac gacgcggtcg ctcggctgga cgtgatgacc 1808521 gcagcgctcg gccaaccact accgtcaacc gtggccacct tcagccgacc cgagccgcgc 1808581 caccgtgcgc aacgcaccgt cgaagaatat ggtgcgatcg tcgaatactt ggtggatcag 1808641 attgcagccg gtgaagcgtt ccaggtggtg ccctcgcagc gcttcgagat ggacaccgat 1808701 gtcgatccca tcgacgtgta ccgaattctg cgggtaacca acccaagtcc ctacatgtat 1808761 ctactgcagg tgccgaatag tgatggtgca gtggactttt cgattgttgg atccagtccg 1808821 gaggcgctgg taacggtcca cgaaggctgg gcgacgacgc atccgatcgc cggaacccgg 1808881 tggcgcggaa ggacagacga cgaggacgtg cttctggaaa aagagctgct ggcggacgac 1808941 aaagaacgtg ccgagcatct gatgctggtc gacctcggcc gaaacgacct gggtcgggtc 1809001 tgcacgccgg gcactgttcg ggtcgaggat tacagccaca tcgagcggta cagccacgtg 1809061 atgcacctgg tgtccacggt gaccgggaag ctcggcgaag ggcgcaccgc gctggacgcg 1809121 gtgaccgcct gctttccggc cggcacgctg tcgggcgcgc cgaaggtgcg ggcgatggag 1809181 ctgatcgaag aggtggagaa gacacgccgc ggcctttacg gcggtgtcgt cggttacctt 1809241 gacttcgccg gcaacgctga cttcgccatc gccatccgca ccgcgctgat gcgtaacggc 1809301 acggcttatg tccaggcagg cggtggtgtg gtggccgact ccaacggatc ctacgaatac 1809361 aacgaggcga ggaacaaggc tcgggctgtg ctcaacgcga tcgctgccgc cgagacgctg 1809421 gccgctccgg gcgcgaaccg cagtggctgc taatgccggc agtgttcggc ccaaccgccg 1809481 ggccaggccg atgatcggca tcgcccagtt gctgttggtg gttgccgccg gggcgctgtg 1809541 gatggccgca cggctgccct gggtggtcat cgggtcattc gacgagctgg ggccgccgaa 1809601 ggaggtgacg ctgaccggtg cgtcgtggtc gaccgctttg ctgccgttag cgctgctgat 1809661 gctggccgcg gcggtggcgg cgctcgcggt gcgcggctgg ccgctgcggg cgctggcagt 1809721 gttgctggcc gcggccagct tcgcggtcgg ctacctcggc atcagtctgt gggtggtccc 1809781 ggatgtcgcg gcccgcggag ccgatcttgc ccatgtccca gtggtgacgc tggtcggaag 1809841 cgcccggcac tattggggcg cggtggcggc ggtgttggcg gcagtgtgtg ctttgctcgc 1809901 tgccgtcttc ttgatgagtt cggcggcgat tcgcgggtcg gctggcgagg acatggcgag 1809961 atatgcggcg ccccgcgccc gccggtcgat tgcccggcgc cagcactcga atgcggccgg 1810021 ccgggcggct ccgcaagacg acgggccgga tatggggccg cggatgtcgg agcgaatgat 1810081 ttgggaagct cttgacgagg gccgtgaccc gaccgatcgg gagcaggagt ctgacaccga 1810141 ggggcggtga cggaccgcgc gctgacggtc gctacccttc atggacgtcg tcgaaattga 1810201 cgagcgcgtg tgggtgacag tgggaaggga acggcaggca tgagtccggc aaccgtgctc 1810261 gactccatcc tcgagggagt ccgggccgac gttgccgcgc gtgaagcctc ggtgagcctg 1810321 tcggagatca aggctgccgc cgctgcggcg ccgccgccgc tcgacgtgat ggccgcccta 1810381 cgcgagcccg gcatcggcgt catcgctgag gtcaagcgcg ctagtccttc ggcaggcgca 1810441 ttggcgacca tcgccgaccc ggcaaagctg gcccaggcct accaggatgg cggtgcccgg 1810501 atcgtcagcg tggtgactga gcagcggcgt tttcagggat cgctcgacga cctcgacgcg 1810561 gtgcgggcct cggtttcgat tccggtgctg cgcaaggact ttgtggtgca gccgtaccag 1810621 attcatgagg cgcgtgcgca cggcgccgac atgttgttgc tcatcgtcgc cgcattggag 1810681 cagtcggtgt tggtgtcgat gttggaccgc accgaatcgt tgggtatgac agcactcgtc 1810741 gaggtccata ccgagcagga agccgaccga gcgctgaagg ccggggccaa ggtgattggc 1810801 gttaacgccc gcgacctcat gacgctggac gtggaccggg attgcttcgc gcgaatagct 1810861 cctggtttgc cgagcagtgt gatcaggatt gctgaatccg gcgtgcgtgg caccgctgac 1810921 ctgctggcgt acgccggcgc gggcgctgac gcggtgttgg taggcgaagg tctggtcacc 1810981 agcggcgacc cacgtgccgc ggttgccgat ctggttaccg cgggcaccca tccgtcctgt 1811041 ccgaaaccgg ctcgctagcc gtcgatgagc cgcttgcatc ttgagcctcg gtgatgacag 1811101 atctatccac cccggatctt ccgcgcatga gtgctgccat cgccgaaccg accagtcacg 1811161 atcctgattc cggcggccat ttcggcggcc ccagtggttg gggtggccgc tacgttcccg 1811221 aggcgctgat ggcggtgatc gaagaggtca ccgccgccta ccaaaaggag cgcgtcagcc 1811281 aggactttct ggacgaccta gacaggctgc aggcgaacta tgcgggccgg ccttcgccgc 1811341 tttacgaggc gacccggttg agccagcacg ctgggtcggc gcgaatcttt ctgaagcgag 1811401 aagacctgaa ccatactggt tctcacaaga tcaacaacgt gctcgggcag gcactgctgg 1811461 cgcgcaggat gggcaagacc cgggtgatcg ccgagaccgg tgccggccag cacggggtcg 1811521 ccacggccac cgcatgcgca ttgctcggcc tggactgtgt catctacatg gggggcatcg 1811581 acaccgcccg tcaggcgcta aacgtggccc ggatgcgatt gctgggtgcc gaagtcgtcg 1811641 cggttcagac gggctcgaaa acgctcaaag acgccatcaa tgaggcgttc cgggattggg 1811701 ttgccaacgc cgacaacacc tactactgct ttggtactgc ggccggaccg catccgtttc 1811761 caaccatggt gcgcgatttc cagcgaatca tcggcatgga ggcacgtgtg cagatccagg 1811821 gtcaggccgg tcggctgcct gacgccgtcg tcgcgtgcgt tggtggcggg tccaatgcca 1811881 ttggtatttt tcatgcgttt ctcgatgacc caggcgtacg gctggtcgga ttcgaggcag 1811941 ccggcgacgg cgttgagacc ggccggcatg ccgcgacatt caccgctggt tcgcccgggg 1812001 catttcacgg atcgttctcg tacttgctgc aagacgagga cggtcagacc attgaatccc 1812061 attcaatttc cgcgggtctg gattatccgg gggtgggccc ggaacatgcg tggctcaagg 1812121 aggccgggcg tgtcgattat cggccgatca ccgactccga ggcgatggac gcgtttggcc 1812181 tgctgtgtcg catggaaggc atcatcccgg ctattgaatc cgcgcacgcg gtggccggcg 1812241 ccctcaagct aggtgttgag ttgggaaggg gcgcggtgat tgtggtgaac ctgtcgggac 1812301 gtggcgacaa agatgtcgag acggccgcga aatggtttgg cttgctgggc aacgactgat 1812361 ggtggcggtg gaacagagcg aagcaagtag gctcgggccg gttttcgatt cctgccgtgc 1812421 aaacaaccgc gcggcattga ttggttactt gccgaccggg tacccggacg tgccagcgtc 1812481 ggtggccgcg atgacagcgc tagttgaatc cggttgcgac attatcgaag tcggggttcc 1812541 gtattcggac ccgggcatgg acggccccac catcgccagg gcaaccgagg cggcgctccg 1812601 tggcggggtg cgagtccggg atacgttagc cgcggtcgag gccatcagta tcgccggcgg 1812661 gcgtgcggta gtgatgacct actggaatcc ggtgctgcgc tatggggttg atgcattcgc 1812721 gcgggatctg gcggcggccg gaggactcgg cctgatcact cctgacctca ttcccgacga 1812781 ggcgcaacag tggctggcgg catccgaaga gcatcggttg gatcgcattt tcttggtcgc 1812841 gccgtcctcg acaccggagc ggttggcggc caccgtcgag gcttcacgcg ggttcgtcta 1812901 cgcggcgtcg acgatggggg tgaccggggc gcgggatgcg gtgtcgcagg cggcacccga 1812961 actggtgggc cgggtgaagg cggtgtctga cataccggtg ggcgtcggtc tgggtgtgcg 1813021 gtcgcgcgct caagccgcgc agatcgccca atacgccgac ggtgtcatcg ttggttccgc 1813081 attggtgacg gcgctaaccg aggggttgcc tagattgcgg gcactgaccg gagagctcgc 1813141 tgccggggta cgactaggga tgtccgcatg atgcggatgt tgcccagcta tatccccagc 1813201 ccaccgcgcg gggtttggta cctgggcccg ctacccgtcc gcgcctacgc agtttgcgtt 1813261 atcaccggca tcattgtcgc actgctgatc ggggatcgcc ggttgacagc ccgcggcggc 1813321 gagcgcggca tgacctacga catcgccttg tgggccgtgc ctttcggcct gattggcggc 1813381 aggctctatc acctggctac cgactggcgg acatatttcg gtgacggtgg tgccgggctg 1813441 gccgcggcac tgcgaatctg ggatgggggc ctgggcatct ggggtgcggt aacccttggt 1813501 gtcatgggcg cgtggattgg ctgccggcgt tgtggaatcc cgctgcccgt cttgcttgat 1813561 gcggtggcgc ctggtgtcgt gttggcgcag gctatcggtc ggctcggaaa ctacttcaat 1813621 caagagctct acggccggga aaccactatg ccgtggggtt tggagatctt ctaccgccgg 1813681 gacccctccg gattcgacgt cccgaattcg ctggacggcg tctcgacggg tcaggtggcg 1813741 ttcgtcgtgc agccaacgtt cctctacgaa ttgatctgga atgttttggt attcgtcgca 1813801 ttgatctaca ttgaccgccg gttcatcatc ggccacgggc gactgtttgg gttctatgtc 1813861 gctttctact gcgccgggcg attctgtgtt gagctgctgc gtgacgatcc cgccacgctt 1813921 attgccggca tccggatcaa ttcgttcacg tccaccttcg tgtttatcgg ggccgtggtg 1813981 tacatcatct tggcgccgaa ggggcgcgag gctcctgggg ccctgcgtgg cagcgagtat 1814041 gttgttgatg aggcgctgga acgtgaaccg gctgaactcg ccgccgctgc tgtggcctcc 1814101 gctgcgagcg ctgtggggcc ggttggcccg ggggaaccga accaacccga cgatgtggcg 1814161 gaagcggtga aagccgaagt cgccgaggtc accgatgaag tggccgcgga atccgttgtc 1814221 caagtagcag accgggatgg tgagtcaacc cccgctgtcg aggagacctc cgaagccgat 1814281 atcgagcggg aacaaccggg cgacctcgcg ggccaggcgc cagccgcgca ccaggtcgac 1814341 gccgaagctg catcggccgc gcccgaggag ccggcagcgt tggcttcgga ggcacacgac 1814401 gaaaccgagc ccgaggtgcc cgagaaggcg gcgcccatcc ccgatccggc caagccggat 1814461 gaattggcgg tcgccggacc tggggacgac cctgctgagc cggacggcat tcgacggcaa 1814521 gacgatttca gctcgagacg ccgccgttgg tggcggcttc gacggcgtcg acaatgacga 1814581 cccacgacgg cactgcctgg tcgccggtgc tggactcaat agaccgccga tcgggcggcc 1814641 gttgccgcag ccggaacgat gcgccgacga agttcccggt cacaaaatgg ccaccggctg 1814701 gaacggtaat cagccgaacc ccgacgcgct tacgagccag accactaagc ccagtaggct 1814761 agcaagcccg gcaggttcca tattttttcg caacccggac gcgcacgcga cgccggggcg 1814821 ctgcctccga tgcccgaccg ccacatgaat atctgtccgt accgctcttt cgtcacgtcc 1814881 gcaacactgg ccttcgccgt cggcgatggt cgctgtgccc agctaagcgc gacaactcgg 1814941 tttctgcagg tcaacgcccg cctccaatcc cgcacagccg cgaccaactc gggaacaaaa 1815001 ccgccggtca ggcagctgtc gctgagagcc gggcacatcg ggtgtcgccc ggtgcagtga 1815061 cacatgtgag agttgtggcc gtgcgatgtg cccgaccctc ggtgcgcacc aatttgagcc 1815121 aactcaggaa atgaatctct gagcggaggt gcaccggttg cccgcctcac aacgacatgc 1815181 tgaggcgcac acggtcgctc gcagccgggc acaacgaaca ctcctgctct gccgcgccga 1815241 tgttgggaac gcatgggcct acggccggca cgggtcgtgc gcccggctcg atctggcatg 1815301 ctgaaaggcg tgaccgatcc cctgcagcac ggtgccttcg agccgggctg gcaatccgca 1815361 ccacccggat atccaccgcc ttatccgcaa tatccggggc ctggctctta ctttgacccg 1815421 ttcgcgccat atggtcgcca tccggtcacc ggccaaccat tttccgacaa atcgaagact 1815481 gttgccggcc tgttgcagtt gcttggactg ttcggcatcg ccgggatcgg gcgaatctac 1815541 ctgggccata ccggcctggg catcgcgcag ctgctggtgg gctgggtgac gtgcggtttg 1815601 ggcgccgtca tctggggcgt cattgacgcc ctgctgatat tgaccgacaa agtcggcgac 1815661 ccttggggtc gtcccttgcg cgatggaagc tagcgggcgt caacgtcgct acgccgcggc 1815721 cggttcggtc gtgctattgg ccggcgcgct tggctacatc ggacttgtcg acccgcacaa 1815781 ctcgaattcg ctatatccac cgtgcctatt caagttgctt acgggctgga actgccccgc 1815841 gtgcgggggt ctgcggatga tccacgatct gctacacggt gagctggcgg ccagcatcaa 1815901 cgacaatgtc tttctgcttg tcggcgtccc agtgctggcc agttgggtcc tgctgcgccg 1815961 ccgccacggc gacttggcgc tcccgatacc ggtgatgatt gctgtggcgg tcgcggtgat 1816021 cgcgtggacg gtgctgcgca acctgccagg cttcccgtta gtgccgacga tcagcggata 1816081 gccgcgccta cccgcggtct ggttggctgg gctgcccgcg gtggtgttga ccggtgtgcc 1816141 gacccggcgg tgccggccct accgccgtcg cgactatgct gagtcgtcgt gacgagacgc 1816201 gggaaaatcg tctgcactct cgggccggcc acccagcggg acgacctggt cagagcgctg 1816261 gtcgaggccg gaatggacgt cgcccgaatg aacttcagcc acggcgacta cgacgatcac 1816321 aaggtcgcct atgagcgggt ccgggtagcc tccgacgcca ccgggcgcgc ggtcggcgtg 1816381 ctcgccgacc tgcagggccc gaagatcagg ttgggacgct tcgcctccgg ggccacccac 1816441 tgggccgaag gcgaaaccgt ccggatcacc gtgggcgcct gcgagggcag ccacgatcgg 1816501 gtgtccacca cctacaagcg gctagcccag gacgcggtgg ccggtgaccg ggtgctggtc 1816561 gacgacggca aagtcgcatt ggtggtcgac gccgtcgagg gcgacgacgt ggtctgcacc 1816621 gtcgtcgaag gcggcccggt cagcgacaac aagggcatct cgttgcccgg aatgaacgtg 1816681 accgcgccgg ccctgtcgga gaaggacatc gaggatctca cgttcgcgct gaacctcggc 1816741 gtcgacatgg tggcgctttc cttcgtccgc tccccggccg atgtcgaact ggtccacgag 1816801 gtgatggatc ggatcgggcg acgggtgccg gtgatcgcca agctggagaa gccggaagcc 1816861 atcgacaatc tcgaagcgat cgtgctggcg ttcgacgccg tcatggtcgc tcggggcgac 1816921 ctaggtgttg agctgccgct cgaagaggtc ccgctggtac agaagcgagc catccagatg 1816981 gcccgggaga acgccaagcc ggtcattgtg gcgacccaga tgctcgactc gatgatcgag 1817041 aactcgcggc cgacccgagc tgaggcctcc gacgtcgcca acgcggtgct cgatggcgcc 1817101 gacgcgctga tgctgtccgg ggaaacctcg gtagggaagt acccccttgc tgcggtccgg 1817161 acaatgtcgc gcatcatctg cgcggtcgag gagaactcca cggccgcacc gccgttgaca 1817221 cacattcccc ggaccaagcg tggggtcatc tcgtatgcgg cccgtgacat cggcgaacga 1817281 ctcgacgcca aggccttggt ggccttcact cagtccggtg ataccgtgcg gcgactggcc 1817341 cgcctgcata ccccgctgcc gctgctggcc ttcaccgcgt ggcccgaggt gcgcagccaa 1817401 ctggcgatga cctggggcac cgagacgttc atcgtgccga agatgcagtc caccgatggc 1817461 atgatccgcc aggtcgacaa atcgctgctc gaactcgccc gctacaagcg tggtgacttg 1817521 gtggtcatcg tcgcgggtgc gccgccaggc acagtgggtt cgaccaacct gatccacgtg 1817581 caccggatcg gggaagatga cgtctagccg ggtcgtgccg gacggtaaac ccatgtccga 1817641 cttcgatgaa ctactggcgg tattggacct caacgccgtc gcaagcgacc tgttcaccgg 1817701 atcccacccc agcaaaaacc cgctccggac atttggtggc cagctcatgg cgcagtcatt 1817761 cgtcgcgagc agccgaacgc taacccgcca ccacctaccg cccagcgcat tctcggtgca 1817821 cttcatcaac ggcggtgaca cggccaagga catcgagttc caggtgatac gactgcgcga 1817881 tgagcggcgc ttcgccaacc ggcgcgtcga tgcggtacag gacggcacgt tgctgtcctc 1817941 ggcgatggtg tcttacatgg ccggtggtcg cgggcacgag catgcgctgg atccgccgca 1818001 ggtggccgag cctcataccc ggccgccgat cggtgagctg ttgcgcggtt acgaggagac 1818061 cgtcccgcat tttgtcaacg cgctgcaacc gatcgaatgg cgctacgcca acgacccggc 1818121 ctggataatg cgggacaagg gcgatcggct tgcctacaac cgggtctggg tcaaggcact 1818181 aggggagatg cccgacgacc cggtgctgca cacggcgaca ctgttgtact cctcggacac 1818241 caccgtgctg gactcggtca ttaccaccca tggtctgtcc tggggcttcg atcgcatctt 1818301 tgcggcctct gccaaccact cggtgtggtt tcaccggcag gtcaacttcg atgattgggt 1818361 gctctactcg acgtcgtcac cggtggccgc cgattcacgt gggttgggtt cggggcactt 1818421 ttttgatcgc tcggggaagc tcatcgcaac tgtggtgcag gaaggtgtgt tgaagtattt 1818481 tcccgccacc cctgacagtg cggcaggacg ctcgtaggat tccgggtcag cacggctgtg 1818541 atcaggcgta acgttcctgg tagccagatg accgatggtg gcagcggccg gcgagccgct 1818601 gaattgccag cgagcgaacc cggaggtgac tgtgaagctg ccgtcggccg atgtggtacc 1818661 gaggctccgt ggtcgccagc gtgtagtcgt gcacgtcgat tcccgcacgg cccgctgtgt 1818721 cggcgcgctg gcgctggtgt gcgcggcctg ctggctgatc gcgctgctcg ccggcgacta 1818781 ccggcacgcc cagtgggcgg tcgccggccg gttgggctgg tcgctgacgg tcctggctgc 1818841 ggtggcattc attgctcgcg gcatcttcct gggccgcccg gtcacggcca tgcatgcgac 1818901 cgcggccggc ctatttttgc tcgccggact ggctgcccac gtgttggtcg cagatctgct 1818961 cggtgagatt ctgatagccg gttcgggatg ggcactgatg tggccgacgt cggcgcatcc 1819021 gcgacccgaa gatctgcccc gcgtgtgggc gttgatcaat gccacccgcg cggactcgct 1819081 tgctccgttt gccatgcagg cgggcaagag ccatcacttc agcgcggccg gcaccgcggc 1819141 tctggcgtat cggacccgta tcggctatgc ggtggtcagc ggcgacccga tcggcgacga 1819201 ggcgcaattc ccccagctgg tcgccgactt cgcggccatg tgtcacatgc acggctggcg 1819261 aatcgtggtc gtgggctgca gcgaacgacg gctcggcctg tggagcgacc ccatggtggt 1819321 cggacaatcg ttgcggccca taccgattgg ccgggatgtc gtcatcgacg tgtctaactt 1819381 tgagatgacc gggcgtaggt ttcgcaacct gcgtcaggcg gtgaaacgca cccacaattt 1819441 cggcgtcacg accgagatcg tcgctgaaca gcaactcgac gaccagcggc aggcggagct 1819501 ggccgaggtg ctggcggcgt cacctagcgg cgcccgcacc gatcgcggct tttgcatgaa 1819561 cctggacggc gtgctggagg gtcgataccc cggaatacaa ctgatcatcg cgcgagacgc 1819621 atcgggtcgg gtgcagggtt tccaccggta cgcgaccgcc ggcggcggca gcgacatgtc 1819681 tctggatgta ccgtggcggc gccgcggggc cccgaacggg atcgatgagc ggctcagcgc 1819741 tgacatgatt gcggccgcca aagatgctgg ggtacaacgg ttgtcactgg cattcgccgc 1819801 gttccccgac cttttcggcg ccaaccagct cggccgcctg cagcgtgtct gccgtgcgtt 1819861 gatccatatc ctcgatccgt tgatcgctct cgagtcgtta taccgatacc tgcgcaagtt 1819921 ccacgcgctg gatgagcggc gttacgtgct gatatcgatg actcaggtct ttgcgctggc 1819981 gttggtgttg ttgtcgctgg agttcgtccc gcggcggcga catctctgat ccgtcgctat 1820041 ggacagctcg gcgcattgaa tgtcgttggg caggtggtgg gtggctacca ccacggtccg 1820101 catagcgctc atgatcccgg agttcggggc cagcagatcg cgcagaaggt cggcgttggc 1820161 ggcgtcgagg tgttcgacag gttcgtcgag caacacgatc cgagccgggg aaagcaccgc 1820221 ccgggcgagc agcaaccttc tgcgctgacc cgccgagacc gcttgcgcgc caccgatcaa 1820281 caccgtcgac aacccctcgg gcaggccggc gagccagccg cacaggccga cccgatccag 1820341 ggcctcgatc agttcgtcat cggggcagtc tcctcgggcg gtcagcaagt tgtcccgaac 1820401 ggtggtagca aagatatgcg catcttcagc gaaaaagctg acagcgctgc gtaattcatc 1820461 ctcatcgaag tcgctcaggt tagttccgtc cagcaacacc cggccgtgca ccggcggcag 1820521 caagccggcc agcgtcatca acagcgtcgt cttgccggcg ccgctcgcgc cggtgacggc 1820581 cagccgggca cccggcggta ggtcaatcgt cacccggatc gactgcgcct cttggtgacc 1820641 gcaacacacg tcggccgcta gcaccccggt acctaccggc agtcgcgccg acaccgtgga 1820701 ttcggtctcg cggacccggt ttgacccagt caggtcgagc agacgagccg ccgcgatgcg 1820761 cgaccgtgtc aactggacgg cggcggcggg tagtgcaacg gtcgcctcga atgcggacag 1820821 cggcaacaac atcaggatgg ccagtgttgt gggcgcgacc gtgggggcca tgccgatccc 1820881 ggccaccacg gcgcccagca ggctggcccc gatcgccgcg gtcggcatgg cctcggcgat 1820941 cgcccccgtt cgtgcggcgg cgtcgagcgc atcggcccag gcatgttggc gccgttgtga 1821001 gtcggcgatg acgttgcgta gggcaccggc gacacgaagc tcgggggcat gctcaagggc 1821061 gatcatcgcc gacgtgtcgc gcatgccccg atgttggcgg gcgatcgctt cctgcgctgc 1821121 ggcggttctg ccggcaagcc agggcgcaac aacgccggca accaaaaggc agaccgccag 1821181 taccacggcg gctggcaccg aaacggccgc gacgaccgcg gtcgcggcta ctgccagcac 1821241 cgctgcgacg gctatcggca ccagagcacg caccagcatg ttggccagtt cgtcgacgtc 1821301 cgcgccgacg cgtgctgcca ggtccccgct gtgcagcccg acggcggccg ccgccggtcc 1821361 gtgggccagc cggtgataga taagggtgcg ggcccggccg gcggcccgca acgcggtgtc 1821421 gtgggtggcc agtcgctcgc agtagtgcag cacgccgcgc gaaatcgcga acgcccgcac 1821481 cgccacgacc gccaccgaca ggtccaggac gggcggcatc tgccaggccc gagtgatcag 1821541 ccaggccgac accccggcca gggccagcgc gctgcccagc gacagcacgc ccagcgcgac 1821601 ggccgccaag atccggggca accggggacc caacagccca gacgcggcca gcaggtcccg 1821661 ctggcggcga ctcacagcac tcggtcggtt catcgtcgga aaccatccga gttcacttcg 1821721 acgacccggt caccggccgc ggcgacctgc tggcgatggg cgacgaccag caccgtcgca 1821781 cccgcgcggg cacgctcgac aatggcgccc aacacgtgtt gttcggtgcg ggcgtccagg 1821841 tgcgcggtgg gctcgtcgag cagcagcacc gcagccggtg atccgagcgc gcgggccagg 1821901 cccagccgtt gccgctgccc cagggataac ccgacaccac cgcgccccag cacggtatcc 1821961 agcccgcggg gcaactcgtc tagtacagcg tcgaatccgg ctgctgcgca ggcacgctcg 1822021 agatcatcca cagggcccag cagaaccagg ttgtggcgga cggttcctgg gaccagcacc 1822081 ggccgctgcg gcagccacga cagttgccgc caccaggcag ccggtgccag gttggtgacg 1822141 tcgactccgg cgaccgtgat tcgtcctgac gacggtgcgg tgagcccggc gatcgcttgc 1822201 agcgtagtgc tcttgccggc gccgtttcgg ccggtcagca ccgtcacccg accgggttcg 1822261 atgtctgcgg tgagatcata cggtgcgcgg ccgtcgcggc ctctgacact gagtctctcc 1822321 aggcgaatca ccccgccgcg cgcggtgacc gttcgtcggc cgggtgttgg tgagggtgac 1822381 tcgccgagga gggcgaatgc cttgtcggcc gcggttctgc cgtcagctgc ggcatgaaac 1822441 tggaccccaa cgcgacgcag cggccagtac acctccggcg ccaatagcag caccgtcaaa 1822501 ccggccgtca ggctcatctc cccgaagacc agccgtagcc cgatgcccac cgcgaccagg 1822561 gccacgccca gcgtggccag caattcgagc accagggccg acaagaacgc gatccgcagc 1822621 gtcgccatcg ccgaccgccg gtggtcagca gacagttccg cgatgcgttg ttccgggccg 1822681 gaagcacggc ccagcgcccg cagggtgggg atgccggcaa tcaggtctaa caaccgggcc 1822741 tggacggcgg tcatggccgc cagcgcggcc gccgaggggt tagtggtagc cagcccgatc 1822801 agcaccatga agatcggtat caggggcagt gtgatcacca caatggccat tgacttcaag 1822861 tcatagagcc cgatcacggc gacggtggcc ggggtcagga tcgcggccag cagcaacgtg 1822921 ggcaaatagc cggtgaagta gggccgcaag ccgtccaggc cccgggtaat cagcaccgcg 1822981 gcggcgtctc gctgcgcagc cagttggctg ggtcggcggg cggttaccgc ggtcagcacc 1823041 tgaccggaca ggtcggcgat cactgcgctg gcgccgcgct gggccaggcg cgcttgtagc 1823101 cactgaatcg acgcacgcaa cccccacagc accaacagga ttgacagtgg ccctagccaa 1823161 cgacgcaggc cagccatccc agggttggcg gggtcgatga cgccggcgac gatgcttgcc 1823221 aacacgatcg ccgagccgat ggcgcagccg gagatcccga ccccgcaggc caccgtgctg 1823281 agtagatagc ggcgcagcgc cgccgatgcc tgccacagcc gcggatccag gggcgcccgg 1823341 gttccccggg ccttggtgct cagggcgcgc gcctcgccag accggtgggt ggaggtatcc 1823401 gttcagctga gatccgttgc cggaaaaccc aatacgtcca tgtctggtac gccaccgtca 1823461 gtggagcgaa gaacgcggtc acccacgtca tgatcttgag ggtgtacggg gtcgacgacg 1823521 cgttatggat cgttaggctc cactgcgggt tcagggttga gggcaccagg ttcgggtaca 1823581 gcgcgccgaa cagcagcacc accacagccg ccacgactat caacgtgcac atgaacgccc 1823641 agccgtcgga cacccgccgc cacactaaga ccgtcgccgc cgcctgcgcg caccccgcaa 1823701 ctgccagcac cagccacgtc cagtctttgc cgtatgccag ttgcgtccaa agtccaaagc 1823761 ccgcaaccag tcccgccaca ggaagcgaaa gccatacggc gaatcggtag gcatcgtcgc 1823821 ggatcggccc ggaggttttc aaagcgatga acaccgcgcc gtagagcgag aacagtccgg 1823881 cggtcgccag accgcccagc agggtgtagg cgttgagcac gtcgggaatc gacagggcaa 1823941 catgaccgtt cgcgtctacc gggagtccgc ggaccagaat ggcgaacgcc acaccccaca 1824001 acagggcagg cagccaggat cccgccgcga tcccgaagtc tgccccggtc cgccatttcg 1824061 ggtcgtcgat cttgccgcgc cattcgatgg cgacggcgcg caggatcata ccgaacagga 1824121 tcgccagcag cggcagatac agcgcggaga acacggtcgc gtaccagccg ggaaacgcgg 1824181 cgaatatggc cgcgccggcg gtgatcagcc agacttcgtt gccgtcccag accggtccga 1824241 tggtgttgag tgccgtgcgc cggtgggtct ccggatcgcc cataccgaca tgagcgaacg 1824301 gcgccatcag catgcccacg ccgaagtcga acccttctag gatgaagaaa ccgaggaaca 1824361 gcgctgcgat gacaccgaac cacaattctt ggagtaccac cggctgctcc tttccggggt 1824421 cagttggcct cagtaagcaa acgacaatgg tgctacctcg tcgtcgcggg gtgccccgtg 1824481 cgcagccggt tccgcgtcgt gttccagggg gccttcgacg atgtaacgct tgagcagcca 1824541 gcaccagatg accgcaagta ccgcgtagac caaggtgaac atcagcaaag acgtggcgac 1824601 cacggtggcg gagtgatccg agacgcctgc tttgacggtg agtcgaacca gctgatcacc 1824661 ggtcgggtta gggacgacga cccagggctg gcgccccatc tcggtgaaca cccatccggc 1824721 gctgttggcc aggaacgggg cgggcatggt tagcagcgcc agccaggaga accagcgttg 1824781 attggggatc tggccgccac gggtgagcca gagcgcaatc agtgcgaaca gcaccgggat 1824841 cgccatcaac ccgatcatca tgcgaaatga ccagtaggtg acgaagaggt tgggccggta 1824901 gtcgtttggt ccgaagcgct gctggtattc ctgctgcaga tcgcggatac cctgcaacgt 1824961 cacaccgctg atccggccct cggcgaggaa cggcaacaca tagggcactt cgatgacacg 1825021 ggtgaggctg tcgcagttgt tttgccggcc gaccgtcagg acagagaagt ttggatctgt 1825081 ctgggtatcg cacaacgatt cggccgacgc catcttcatc ggctgctgct ggaacatcag 1825141 cttgccttgg tggtcgccgg tgaacaacaa cccggccgtg gcggccaacg caacccaaca 1825201 ccccaggatg gtcgcgggac gatacatggc ttgggtatct gagtcggcgt gcgtggtgct 1825261 cgaacggacc agccaccagg cgctcaccgc ggcgacgaag gtcccggcgg tcagcagcgc 1825321 accgctgaca gtgtgggtaa acgccgcctg tgcggtgttg ttggtcagca gcacgacgat 1825381 gctgctcaac tcggcacgcc cggtggtcgg gttgtagtgc gcgccgaccg gatgctgcat 1825441 gaaggagttt gccgcgatga tgaagaacgc ggacacgttg accgcgattg cgacgatcca 1825501 gatgcaggcc agatgcacca gccggggcag cctgttccag ccgaagatcc acaacccgat 1825561 gaaggtggat tcgaagaaga aggccgccag gccctccatg gccagcgggg cgccgaagac 1825621 atcgccgacg aatcgggagt actcgctcca gttcatgccg aactgaaatt cctgcacgat 1825681 tccggtcgcc acgccgatgg caaagttgat caggaacaat ttgccgaaga atttggtgag 1825741 gcgataccag gcggggttat cggtgacgac ccacagcgtt tgcatgaccg cgatcagcgg 1825801 ggccaggccg atggtcagcg gtacgaaaat gaagtgatag acggtggtga taccgaactg 1825861 ccaccgcgaa atgtcgacga cattcatctg tcatctccgg agatactacg gggccgactg 1825921 atttggctac gacgaagtgt agtaggcacg agtgggcccg cgctactggc aatcgtgggt 1825981 gcaccgcgat tctgcggtca gccgagcgtc tgcgaagcct tgcggatggc gaacgacgac 1826041 gcgatctcgc atgtgccgat caccacgagc cagatgccga cgaccaacgc cagtatccag 1826101 atggactcga acggcgatgc catcaccaca atgccggcga tgaggctgat cacgccgacg 1826161 aagatggacc atccccgtcc cggcagcatc ggatcactaa tcgccgaaac cgtggtggcg 1826221 acgccgcgga agatgaaccc gatgccgatc cagatggcca gcaacagaac cgcgtcaccg 1826281 aaatggcgaa aggccagcac agccaggatg agtgaggcgg caccgctgat gaacaacagg 1826341 atccggccgc ccgccgaaac atgcaggctg aacgcgaacg caacctgagc gacaccggta 1826401 atcaggaggt agacaccgaa cgccatggca gcaacgagaa tggatattcc tggccaggcc 1826461 agcaccagga cgcccaggat cagcgacaga attcccgatg ccagagtgga cttccagaga 1826521 tgcggcaaca accttgggag agggctcacg acagggcttg gttccatggg cgcagtgtga 1826581 cacatgtagc ggccccggga tagcgcttgg cggtcagacc cctgccgtgc ggggttcggc 1826641 cccgcgcacc tcgccgggat ctgcggctac cttgcggccg atgaggtacc acgtgcgcat 1826701 tacgcctttc cccttgacgt ttatgtggcc gcgctcgcgc aacacgaagt cgtccttgag 1826761 acgctcgtaa acctcgtctg gcacctgaat ttgccccacc gaatcggtgg attccatccg 1826821 cgacgcgaca ttgaccgcgt cgccccacac gtcgtagaag aaccgtcgag aacccaccac 1826881 acccgccacc accgggccgg tggccaggcc cacccgcagc ggcaccgggt tgccgcgtgg 1826941 atccttcaat tgcgctgcga cattggtcat gtcgagcgca aagtccgcca gtgcttgcgt 1827001 atggtcaggc cggggccgcg gaacgccgct gacaaccatg taggagtccc cgctgacctt 1827061 gattttctcc agcccgtgct ggtcgaccag ctcgtcgaaa gcgctgtaga ggcggtccag 1827121 gaaccggacc aggtccgccg gcgcggtgct actggcgcgt tcggtgaacc cgacgatgtc 1827181 ggcgaacagc accgaggcct cgtcgtattt atcggcgatg atgtttcgct cgggttcttt 1827241 aagccgctcg gcgatgctgg ccggcaacat gttggccagc agtgcttcgg agcggtcgtg 1827301 ctccgcctcc atgaccgcct ccgcgcgcgc agtatcacgc agcgcgaacc acaccgttgc 1827361 gaccgctacc ccgcaggcgg agacggtcgt gaggacgaaa cttaccgaca tggcccaggg 1827421 cggctgaagc ccagtatcgg gcgggaccag gaactccagg gcaatcacca gaccggcggc 1827481 gaccgccgct aggcccaccg ctaacgcggt gtgttcgatg ccgaccagca acaccaccaa 1827541 cgcggcggct accaagaaga agaactgggc acccgcgtcg gtgcccacat cccagccgat 1827601 ggcgaagatc gccacatagg cggtgccgat gaacgtaagc ggtgccacca atcccccgaa 1827661 gcgatgtagc aggggcacga tcgcgaaagt aaccgcggtg aagacgttga tcagggcgat 1827721 gtaccagccc ccggccccgg tcgctagttg cattagcgcg aagctcccgg ttaccacgac 1827781 agcgagccag gcggtgatgg taagcacgcg ctgccgccgc gcgacgcttt cggcgtagtg 1827841 ctgcgtggga gcgcgggcct gagtgcgcac ggccgtgaca cagtctgggc gtcgtgtcga 1827901 gccatccgct gctatcggtg gggcgccgca ttttcttgcc gccacgaact aaagcctaat 1827961 cggtgagtta gcgtttaccg actctgtcgg cgctttccgg gtgcgttcgc ttggtgccct 1828021 cggtgggatt cgaacccaca ctggacgggt tttgagtccg tttcctctgc cagttgggat 1828081 acgagggctt gatccggtct cctactctag aggagccacg tcccgactca ccgccccccg 1828141 aggttcccga tcgcgcccgc tcacgacaca atgtccgtca tgaccggccc caccaccgac 1828201 gccgatgccg ctgtcccacg tcgggtcttg atcgcggaag atgaagcgct catccgcatg 1828261 gacctggccg agatgttgcg agaggaggga tatgaaattg tcggcgaggc cggcgacggc 1828321 caggaagccg tcgagctggc cgagctgcac aagcccgacc tggtgatcat ggacgtgaag 1828381 atgccgcgcc gggacgggat cgacgccgca tccgaaatcg ccagcaaacg tattgccccg 1828441 atcgtggtgc tgaccgcgtt cagccagcgt gatctggtcg aacgtgcgcg tgatgccggg 1828501 gcgatggcat acctggtaaa gcctttcagc atcagcgacc tgattccagc gattgaattg 1828561 gcggtcagcc ggttcaggga gatcaccgcg ttggaaggcg aggtggcgac gctatctgaa 1828621 cggttggaaa cccgcaagct ggtggaacga gcaaaaggcc tgctgcagac caaacatggg 1828681 atgaccgagc cggacgcttt caagtggatt caacgtgccg ccatggatcg gcgcaccacc 1828741 atgaagcggg tggccgaagt cgtgctggaa accctcggaa cacccaaaga cacctgaggg 1828801 cgagcagacg caaaatcgcc catttcgtac ccgaaatggg cgattttgcg tctgctcgcg 1828861 gaacctagcg cgcgacgatc accgacgagc cgtgcccgaa caggccctgg ttggcggtga 1828921 cgccgacctt ggcgtccgcc acctgccggc cggtggcctg accgcgcagc tgccaggtca 1828981 gctcgcagac ctgcgcgatc gcctgggcgg gaatcgcctc accgaaacac gccagcccgc 1829041 ccgacgggtt gaccgggacc ctgccgccga gggtggtcgc gccgctgcgc agcagcgcct 1829101 cggcctcacc cttggggcag agccccaggt gttcgtacca gtcgagttcc aacgcggtgg 1829161 acaggtcgta gacctcggcc aggcttaagt cttctggacc aataccggcc tccgcgtagg 1829221 cagcgtcgag gatctgatcc ttgaacaccc gctccggagc cggcaccgcg gcggtggaat 1829281 ccgttgcgat atccggcaat tcgggcaaat gttgcgggta tttcggggta acggtgctga 1829341 tcgcgcgcac cgacggcacg cccgccaccg agccaaggtg cttctcggtg aaagacttgc 1829401 tggccacgat gagtgcggcc gcaccgtcgg aggtggcgca gatgtcaagc agccgaagcg 1829461 gatccgagac caccgggcta gccagcacgt cgtcgatcga gttctctttg cggtagcggg 1829521 cgttcgggtt gtctaggccg tgccgggagt tcttgacctt cacttgagcg aagtcctcga 1829581 ctgtggcgcc gtacaggtcc atgcgccggc gcgccagcag cgcgaagtac accgtgttcg 1829641 tcgccccgat cagatggaag cgctgccagt cggggtcgcc cttgcgctcg ccgcccacgg 1829701 gcgcgaaaaa gcccttcggt gtggtgtcgg cgccgatcac cagcgccacg tcacagaaac 1829761 cggccaagat ctgcgcgcga gcactctgca gcgcttggga accgctggca cacgcggcgt 1829821 agctggagct gaccggcaca ccggtccagc cgagcttctg ggcgaacgtg gcaccggcga 1829881 cgaagcccgg atacccgttg cggatggtgt ccgctccggc gaccagctgc acgtgccgcc 1829941 agtccacgcc ggcgtcccgc aacgcggcgc gggcggcgac cacgccatac tcggtgaagt 1830001 cattacccca tttcccccac gggtgcatac cggcacccag gatgtaaacg ggttccggcg 1830061 cgctcatcct catcggcgcc gctcctcagc atcgctgcgc tctgcatcgt cgccggcgcg 1830121 cgatgggatc cgccacgcgt agacgatgcg ctgcacaccg tcgtcgtcgg cgaacagcgg 1830181 catggtcgtc agctccatct ccatgccgac cttcagatcg gcggccagcg tgccatcgac 1830241 cactttgccc agcacgatca gtccctcgtc ggccagttcc accgcggcca cggcgaacgg 1830301 ctcaaagggg tcgggtgccg ggtacggcgg tggcggggcg taccggtttt cggtgtagct 1830361 ccaaagcttt ccgcgggtcg acagtccgac cgactctagt gtgtcgctgc cgcaagccgg 1830421 attcggacaa ttgtccgccc ggggtgggaa gacgtacgtg ccgcactggg gacacttgcc 1830481 gccgagcaga tgcgggttgc cggccttatc ggtggtgaac catccatcga ttgccggttc 1830541 ttcacgggtg acctctggca ccggtccagc ctaccgagcc cgggcgtaaa actgaaacgt 1830601 gttgcagttc tgctggcacc tgcgcccgca ttccacgtca gcgtcggtgc ataaagtgtg 1830661 agccgtggtg actactgcca gtgcccccag cgaggatcga gccaagccga cgctgatgtt 1830721 gctggatggc aattcgctgg cgtttcgggc gttctacgca ctgcccgcgg agaacttcaa 1830781 gacccgcggc gggctgacca ccaacgccgt ctacggcttc accgccatgc tgatcaacct 1830841 gctgcgcgat gaagccccga cgcacatcgc ggcggctttc gacgtgtccc ggcagacctt 1830901 ccgcttgcaa cgctacccgg agtacaaggc caaccgatcg tcgacccccg acgagttcgc 1830961 tggccagatc gacatcacca aagaagtgct gggcgcactc ggcatcaccg tgctctccga 1831021 gccggggttc gaggccgacg acctcatcgc cacgctggcc acccaggccg agaacgaggg 1831081 ctaccgggtg ctggtggtca ccggggatcg tgacgcactg caactggtca gtgacgatgt 1831141 gacggtgctc tacccccgca agggcgtcag cgaacttacg cgcttcacac cggaggccgt 1831201 cgtcgaaaag tacgggctca cccctaggca gtacccggac ttcgccgcgc tgcgcggcga 1831261 ccccagcgat aacctgcccg gcatacccgg ggtgggggag aagaccgccg ccaaatggat 1831321 cgccgagtac ggctcgctgc ggtcactggt ggacaacgtt gacgccgtgc gcggcaaggt 1831381 gggcgatgcg ctgcgggcga acctggccag cgtggtgcgc aaccgtgagc tcaccgacct 1831441 ggttcgcgac gtgccgctgg cccagacccc ggacacgctg cggctgcagc cctgggatcg 1831501 cgaccacatt caccggctct tcgacgacct ggagtttcgg gtgttgcgcg accggttgtt 1831561 cgacacgttg gccgcggccg ggggacccga ggtcgacgag gggttcgacg tgcgcggcgg 1831621 cgcgttggcg cccggcacgg ttaggcaatg gttggccgag cacgccggcg acgggcgccg 1831681 agcgggcctg acggtggtgg gtacccatct gccgcacggt ggggacgcta ccgctatggc 1831741 cgtcgccgcc gccgacggcg aaggcgctta cctcgatacc gcgacgctga cgcccgacga 1831801 cgacgccgcg ttggcggcct ggctagcgga tccagctaaa cccaaagcct tgcatgaggc 1831861 aaaggcggcc gttcatgacc tggcgggtcg tggttggacc ttggagggcg tcacctccga 1831921 caccgcactg gcggcctacc tggtgcggcc ggggcagcgc agcttcaccc tcgacgacct 1831981 ctcgctgcgc tatctgcgtc gcgagctgcg tgcggaaaca ccgcagcagc aacaactttc 1832041 actgctcgat gacgacgata cggacgccga gaccattcaa acgacgatcc tgcgggcgcg 1832101 ggcagtcatc gacctggccg acgcgctgga cgccgagtta gcgcgtatcg actccaccgc 1832161 gctgctgggg gagatggagc tgccggtcca gcgggtgctg gcgaagatgg aaagtgccgg 1832221 tatcgccgtc gacctgccca tgttgaccga gctgcaaagc cagtttggcg accagatccg 1832281 cgacgccgcc gaggccgcct acggcgtgat cggcaagcaa atcaacctgg gctcacccaa 1832341 gcagctgcag gtcgtgctgt tcgacgaact gggcatgccg aagaccaaac gcaccaagac 1832401 cggctacacc acggatgccg acgcgctgca gtcgttgttc gacaagaccg ggcatccgtt 1832461 tctgcaacat ctgctcgccc accgcgacgt cacccggctc aaggtcaccg tcgacgggtt 1832521 gctccaagcg gtggccgccg acggccgcat ccacaccacg ttcaaccaga cgatcgccgc 1832581 gaccggccgg ctctcctcga ccgaacccaa cctgcagaac atcccgatcc gcaccgacgc 1832641 gggccggcgg atccgggacg cgttcgtggt cggggacggc tacgccgagt tgatgacggc 1832701 cgactacagc cagatcgaga tgcggatcat ggcgcacctg tccggggacg agggcctcat 1832761 cgaggcgttc aacaccgggg aggacctgca ttcgttcgtc gcgtcccggg cgttcggcgt 1832821 gcccatcgac gaggtcaccg gcgagctgcg gcgccgggtc aaggcgatgt cctacgggct 1832881 ggcttacggg ttgagcgcct acggcctgtc gcagcagttg aaaatctcca ccgaggaagc 1832941 caacgagcag atggacgcgt atttcgcccg attcggcggg gtgcgcgact acctgcgcgc 1833001 cgtagtcgag cgggcccgca aggacggcta cacctcgacg gtgctgggcc gtcgccgcta 1833061 cctgcccgag ctggacagca gcaaccgtca agtgcgggag gccgccgagc gggcggcgct 1833121 gaacgcgccg atccagggca gcgcggccga catcatcaag gtggccatga tccaggtcga 1833181 caaggcgctc aacgaggcac agctggcgtc gcgcatgctg ctgcaggtcc acgacgagct 1833241 gctgttcgaa atcgcccccg gtgaacgcga gcgggtcgag gccctggtgc gcgacaagat 1833301 gggcggcgct tacccgctcg acgtcccgct ggaggtgtcg gtgggctacg gccgcagctg 1833361 ggacgcggcg gcgcactgag tgccgagcgt gcatctgggg cgggaattcg gcgatttttc 1833421 cgccctgagt tcacgctcgg cgcaatcggg accgagtttg tccagcgtgt acccgtcgag 1833481 tagcctcgtc aggtaccaat ctgtccctac gacccaaccc tgtccggagc aacccaacaa 1833541 tatgccgagt cccaccgtca cctcgccgca agtagccgtc aacgacatag gctctagcga 1833601 ggactttctc gccgcaatag acaaaacgat caagtacttc aacgatggcg acatcgtcga 1833661 aggcaccatc gtcaaagtgg accgggacga ggtgctcctc gacatcggct acaagaccga 1833721 aggcgtgatc cccgcccgcg aactgtccat caagcacgac gtcgacccca acgaggtcgt 1833781 ttccgtcggt gacgaggtcg aagccctggt gctcaccaag gaggacaaag agggccggct 1833841 catcctctcc aagaaacgcg cgcagtacga gcgtgcctgg ggcaccatcg aggcgctcaa 1833901 ggagaaggac gaggccgtca agggcacggt catcgaggtc gtcaagggtg gcctgatcct 1833961 cgacatcggg ctgcgcggtt tcctgcccgc ctcgctggtg gagatgcgcc gggtgcgcga 1834021 cctgcagccc tacatcggca aggagatcga ggccaagatc atcgagctgg acaagaaccg 1834081 caacaacgtg gtgctgtccc gtcgcgcctg gctggagcag acccagtccg aggtgcgcag 1834141 cgagttcctg aataacttgc aaaaaggcac catccgaaag ggtgtcgtgt cctcgatcgt 1834201 caacttcggc gcgttcgtcg atctcggcgg tgtggacggt ctggtgcatg tctccgagct 1834261 atcgtggaag cacatcgacc acccgtccga ggtggtccag gttggtgacg aggtcaccgt 1834321 cgaggtgctc gacgtcgaca tggaccgtga gcgggtttcg ttgtcactca aggcgactca 1834381 ggaagacccg tggcggcact tcgcccgcac tcacgcgatc gggcagatcg tgccgggcaa 1834441 ggtcaccaag ttggttccgt tcggtgcatt cgtccgcgtc gaggagggta tcgagggcct 1834501 ggtgcacatc tccgagctgg ccgagcgtca cgtcgaggtg cccgatcagg tggttgccgt 1834561 cggcgacgac gcgatggtca aggtcatcga catcgacctg gagcgccgtc ggatctcgtt 1834621 gtcgctcaag caagccaatg aggactacac cgaggagttc gacccggcga agtacggcat 1834681 ggccgacagt tacgacgagc agggcaacta catcttcccc gagggcttcg atgccgaaac 1834741 caacgaatgg cttgagggat tcgaaaagca gcgcgccgaa tgggaagctc ggtacgccga 1834801 ggccgagcgc cggcacaaga tgcacaccgc gcagatggag aagttcgccg ccgccgaggc 1834861 ggctggacgc ggcgcggacg atcagtcgtc ggccagtagc gcaccgtcgg aaaagaccgc 1834921 gggtggatca ctggccagcg acgcccagct ggcggccctg cgggaaaaac tcgccggcag 1834981 cgcttgatct tgcagctgat cgcgttcacg taatgctgcg catcgggctg accggcggca 1835041 ttggcgccgg gaagtcgttg ctgtccacga cgttctcgca atgcggcgga atcgttgtcg 1835101 acggcgatgt gttggcgcgt gaagtggtcc agccgggcac cgaggggctg gcctcgctgg 1835161 tcgacgcgtt cggtcgcgac atcctgcttg cagacggagc gctggaccgg caggcgttgg 1835221 cggccaaggc gtttcgagat gacgagtcgc gcggtgtgct caacggaatc gtgcacccgc 1835281 tggtcgcccg gcgccgatcc gagatcatcg cggcggtttc gggggacgcg gttgtggtcg 1835341 aagatattcc actgctggtg gaatccggga tggcgccatt gtttccgctg gtggtggtgg 1835401 tgcacgccga cgtcgagcta cgggtgcgac ggctggtcga gcaacgcggc atggccgaag 1835461 ccgacgcccg ggctaggatc gctgcgcagg ccagcgacca gcagcgtcgt gccgtcgccg 1835521 acgtctggct ggacaactcg ggcagcccag aggatttggt gcggcgggcc cgcgacgtct 1835581 ggaacacgcg cgtccagccc ttcgcgcaca acctggccca acgtcagatt gcgcgcgcgc 1835641 cggctaggtt ggtgccggcg gatccaagct ggccggatca ggcgcggcgc atcgtcaacc 1835701 ggctaaagat cgcgtgcggg cataaggcct tgcgagttga ccacattggg tcaaccgccg 1835761 tgtcgggctt ccccgatttt ctagccaagg atgtcatcga catccaggtc accgtcgaat 1835821 cacttgacgt ggccgacgag ctggccgagc ccttgctggc cgccggctac ccacgcctcg 1835881 agcacatcac ccaggacacc gaaaagaccg acgctcgcag caccgtcggc cgctacgacc 1835941 acaccgacag tgccgctctg tggcacaagc gcgtgcacgc ctcggcggat cccggtcggc 1836001 cgaccaacgt gcacctgcgg gtgcacggct ggcccaacca acagttcgcc ctgctgttcg 1836061 tcgactggct ggcggccaat cccggcgcga gagaagacta tttgacggtc aagtgtgacg 1836121 ccgacaggcg cgccgacggt gagctcgcgc gctacgtcac cgccaaggag ccgtggttcc 1836181 tggatgccta ccagcgggca tgggagtggg cggatgcggt gcactggcgt ccctgaacga 1836241 gggcctgccg cactgggcga tgacgccatc gatcgagcag gccgcgcagc tgtcatcccc 1836301 ggccagcctc atctgaggct tccagctcgg gggcgccggc gcccggggcg gtgggcgctt 1836361 ctgctacccg agccggcacg cgcgcttcat gagccgctgc gccaggtcag ctccatcccc 1836421 ttggtggcca gccagcgggt gaggtcatag ccgttgcggg ccaggccctc gacggcgtcg 1836481 actgcgtgcc gcaccgcctg ctcggcgacc gtcggggtca acaggccgtg gcggacggcg 1836541 tcgagtagtt cgtcgacgtc ggcgagctcg gccccgccgc cggtgcggac ttcgatgtcg 1836601 aggtagtggt cttcggaacg ccatacggaa gggcccggtg tgtattcgcc gacgtccaga 1836661 tagtagtcgt gatcgcgttt gtggctggga ttgaagtgaa agacagtggc gcgtaggccc 1836721 aacgacggca acagccacga ctcgaggtag tggaattggg cacggcccgg ggtgggccgg 1836781 gccaggtaga gcccccacgg atgcaccgtg tactcatcga ccgcccgcac tatgcccttc 1836841 ggatcggtat tggtgtgggc gatcaggtcg aacgtctcgt gcttgggtgg gtgaatggct 1836901 caccctatct ggtcgcacga ggcgtgccgg tacatcgaca cgccggtact ggtggcattc 1836961 tgcgcacgct cgccgcacgg tgtgtccgcg ggtggctcta ggctggttgg cgtggctttc 1837021 gctaccgagc atccggtggt cgcgcattcg gagtatcgcg cggtcgagga gattgtgcgc 1837081 gccggcggtc acttcgaggt ggtcagtccg catgctccgg ccggcgacca gccggccgca 1837141 atcgacgagc tggagcggcg gatcaacgcg ggggagcgtg acgtggtgtt gctcggcgcc 1837201 accggcaccg ggaagtcggc gaccaccgcg tggctgatcg aacgcctgca gcggcccacc 1837261 ctggtgatgg cgcccaacaa gacgttggcc gcccagctgg cgaacgaact gcgagagatg 1837321 ttgccgcaca acgccgtcga gtacttcgtc tcgtactacg actactacca gccggaggcg 1837381 tatatcgcgc agaccgacac ttatatcgaa aaggatagct ccatcaacga cgacgtggag 1837441 cggctgcggc actccgcgac ctcggcgctg ctgtcgcgtc gtgacgtggt ggtggtggct 1837501 tcggtgtcct gcatctacgg cctgggcaca ccgcagtcct acctggaccg ctccgtcgag 1837561 ctgaaggtgg gcgaggaagt gccgcgcgat gggctgctgc ggctgctggt cgacgtgcaa 1837621 tacacccgaa acgacatgtc ctttactcgc ggctcgtttc gggtgcgcgg cgacaccgtc 1837681 gagatcatcc cctcctacga agagctggcg gttcgcatcg agttcttcgg cgacgagatc 1837741 gaggcgctgt actatctgca cccgctgacc ggcgaggtta tccgccaggt cgactcgctg 1837801 cggatctttc ccgctaccca ttacgtcgcc ggtccggagc ggatggcgca tgccgtctcg 1837861 gccatcgagg aagaactcgc cgagcgactc gccgagcttg agagccaggg caagctgctg 1837921 gaggcgcagc ggctgcggat gcgcaccaac tacgacatcg aaatgatgcg gcaggtcggg 1837981 ttctgctcgg gcatcgagaa ctactcccgc cacatcgacg gtagggggcc cggcacgccg 1838041 cccgcgaccc tgctcgacta tttccccgag gatttcctgc tcgttatcga cgagtcacat 1838101 gtcaccgtgc cgcagatcgg cggcatgtac gagggcgaca tctcccgcaa gcgcaacctg 1838161 gtggagtacg gtttccggct gccgtcggcg tgcgacaacc gtccgctgac ctgggaggag 1838221 ttcgctgacc ggatcgggca gacggtgtat ctgtctgcca ccccggggcc ctacgagctc 1838281 agccagaccg gcggcgagtt cgtcgagcag gtgatccggc cgaccggtct ggtggacccg 1838341 aaagtggtag tcaagccgac caaagggcag atcgacgacc tgatcggcga gatccgcaca 1838401 cgggcagacg ccgaccagcg ggtgctggtg acgacgctga ccaagaagat ggccgaagac 1838461 ctcaccgact acctgctgga gatgggcatt cgggtgcgct acctgcattc ggaggtcgac 1838521 acgttgcgcc gggtcgagtt gttgcgccag ctgcgtctgg gtgactacga cgtgctggtc 1838581 ggcatcaacc tgctccgcga gggcctagac ctgcccgagg tgtcgctggt ggcgatcctc 1838641 gacgccgaca aagaaggatt cctgcggtca agccgcagcc tgatccagac catcggacgc 1838701 gccgctcgca acgtgtccgg cgaggtgcac atgtacgccg acaaaatcac cgactcgatg 1838761 agggaagcca tcgacgagac cgaacgccgg cgggccaagc agatcgccta caacgaggcc 1838821 aacggaatcg acccacagcc gctgcgcaaa aagatcgccg acatcctcga tcaggtctat 1838881 cgggaggccg acgacaccgc cgtcgtcgag gtcggcggat ccgggcgcaa cgcatcccgc 1838941 ggccggcggg ctcagggtga gcccggccgg gcggtcagcg ccggcgtgtt cgagggccgc 1839001 gacacctccg ccatgccgcg cgctgagctg gccgacctaa tcaaagacct caccgcacag 1839061 atgatggcgg ccgcgcgcga cctgcagttc gagctggcgg cccggttccg cgacgagatc 1839121 gccgacctca agcgggagct gcgggggatg gacgcggccg gcctgaagtg accgaaacag 1839181 cgagcgagac cggcagctgg cgtgagctac tgagcaggta tctgggcacc tccatagtgc 1839241 tggccggtgg cgtcgcgctt tacgccacca acgagtttct gacaatcagc ctgctgccga 1839301 gcacaatcgc cgacatcggg ggtagccggc tgtacgcctg ggtgacaacc ctgtatctgg 1839361 tcgggtcggt ggtggcggcg accaccgtca atacgatgtt gctgcgcgtc ggggcgcgct 1839421 cgtcgtatct gatggggttg gccgtcttcg gtctggccag cctggtatgt gcggcggcgc 1839481 cgagcatgca gattctggtg gccgggcgta ccttgcaagg aatagccggt gggctgctgg 1839541 ccggcctagg ctacgcgctg atcaactcga ccttgcccaa gtcgctgtgg acccgtggct 1839601 cagcactggt gtcggcgatg tggggggtcg cgacgctgat cggaccggcg accggaggcc 1839661 ttttcgcgca gctcgggctg tggcgatggg cgttcggcgt gatgacgttg ctgaccgcgt 1839721 tgatggccat gttggtgccg gtcgcgctcg gtgccggggg ggtcggcccg ggcggcgaga 1839781 cgccggtggg cagcacacac aaggtgccgg tgtggtcgct attgctgatg ggggccgccg 1839841 cactggcgat cagcgtcgcc gcgcttccga actacctcgt ccagacggcc gggctgctag 1839901 ccgccgccgc gctgctggtt gcggtgtttg tggtagtcga ctggcggata cacgcagcgg 1839961 tgttgccgcc cagcgtattt ggctccggac cgttgaaatg gatttacctg accatgtcgg 1840021 tgcagatgat tgcggcaatg gtcgatacct acgtgccgct gttcggtcag cgactgggac 1840081 acctgacccc ggtggcagcc gggttcttgg gtgccgcgct ggcggtgggc tggacggtcg 1840141 gtgaggtcgc cagcgcctcg ttgaacagtg cacgagttat cgggcatgtc gtggcagccg 1840201 caccgctggt gatggcgtcg gggttggcgc taggcgccgt cacccagcgc gccgatgcgc 1840261 cggtggggat catcgcgctg tgggcgctgg cgctgctgat catcgggacc ggcatcggga 1840321 tcgcctggcc gcatctaacg gtgcgcgcta tggattctgt cgccgacccg gccgagagca 1840381 gcgcggcggc cgcggcgatc aatgtcgtac agctgatctc cggtgctttc ggcgccgggc 1840441 tggccggtgt ggtggtcaac actgccaagg gcggcgaagt ggcggcggct cgtgggctat 1840501 acatggcatt tacggtgctg gccgccgctg gtgtcatcgc ctcctaccag gccacgcacc 1840561 gcgaccggcg cttaccgcgt tgacttgacc acctgcgagt agtggaactg ccagcgctcg 1840621 acgatgcgga agccgaggta gctcgggaac cggtatacgg gcgtgcgccc gaagcctgtt 1840681 cccggtgaca acatttcgcc gacctgatga tcgggcaacg acttgtcacg attggctatc 1840741 gtccacagcg tggggcactt gtcgatcttg gccgtcgtaa gccacacagc gacatggcca 1840801 tcccacaaag tgccgacctt ggggccgtag gtgccgcgct cgacgtcaat cagcgaccgg 1840861 aacgccgccg gccgggtggc cagcagggcg cggatgggcc cgggtcgcca acccgcggtg 1840921 ttgtccacca gcaggcaatc cccgggcttg gcatgggcgc tgatgacatc tgccacctgg 1840981 ctgtaatccc agccctcttt cgcgtacggc ccccgctgtg tgaagaagta gttcggaaac 1841041 gctgcggcgg caaggagaaa cacgaccccg gcgatgagcc acggcttgcg ggcgatggtg 1841101 acgacgcaaa ccgccaggat gacggccgcg gcgggggcgg tgaggatcag gtagcgcggg 1841161 tagtagatcg gttcgacggt cgccgagtag atgaggacga cggcggtggg cacgacgatc 1841221 caggctgcgc tgacgagcac gagccggtgg gtatcgccac cgggtccacg agctccggcc 1841281 agatgcgccg cgatgccggc agcgacgatg aggcccgcga ggatggcgaa cggaacactg 1841341 tgatcgaaat actggcggtg tatgacgtcg agaatgatgt ttctgttcaa ccctgcgatc 1841401 cacccgacct gccaaacctg gccgtgggcg aacagtatga acggtgtcat ggccccgagc 1841461 gcggctgccg tgacgaccgt ccaccagatc acgggagatt tgcgtgattt cccggacgcc 1841521 agcagcggca ccatcgtcgc ataggccggt accaacaggg ccaggttgat actgaccaag 1841581 atcgacagca tcaaaaccag cgcgtagagc agccaccgcc gctgggtgtt gcaccgcacc 1841641 gcggccacga gtaatacggt cagccagacg gcggctgcta ccgacagcgc ggaggagcgt 1841701 gcttcgattc cggcccacgt caccctgggc agaatcgcga acacggctcc cgcacacacc 1841761 gccgtggtgc gtcccgaaaa ctgtttggca aaaaccacca cgccggcggc ggccgctcca 1841821 atggccaggc agctgggaag ccgcgaccat aattcggtgg gcggaaatat ggcgaaccag 1841881 ccatgcatca acaggtagta caggccgtgc acggcgtcga tatggcccag cagactccat 1841941 agctctggca atgtccggct ggctgaagcc gagatcgttg ccccctcgtc gaaccacaac 1842001 gatggcctgc ttgcccaggc gccgctgatg accgcggcca gcactgcaat cgccagcggg 1842061 tcgagcagcc ggccgcgcat ccgcgccacc aactcgtcga cgtgtgctgc cgcgggctgc 1842121 tccagagtgg aggcggacat gatgcgggtc accttagggt ccgcgcgatg atcctggtca 1842181 ccggcggttc ggcgactggg cagcccggcg tgcggcggtg cgccgggacg actcgcatgc 1842241 atttcccaaa aagccttgca cagcaacatt ttccgcgatc agcgtgcgta ttgaatcgtc 1842301 gtgtcatcgc caccattgtc ggctggttca ccgcgatcgg gcaaatgagg gttgcgccac 1842361 gccgttgcgg tgtgattaat ctgacctatc tatatccggc aacgcgatac tgtctggggt 1842421 tggcgtagca accgacacct gggagggtaa atgagcgcct ataagaccgt ggtggtagga 1842481 accgacggtt cggactcgtc gatgcgagcg gtagatcgcg ctgcccagat cgccggcgca 1842541 gacgccaagt tgatcatcgc ctcggcatac ctacctcagc acgaggacgc tcgcgccgcc 1842601 gacattctga aggacgaaag ctacaaggtg acgggcaccg ccccgatcta cgagatcttg 1842661 cacgacgcca aggaacgagc gcacaacgcc ggtgcgaaaa acgtcgagga acggccgatc 1842721 gtcggcgccc cggtcgacgc gttggtgaac ctggccgatg aggagaaggc ggacctgctg 1842781 gtcgtcggca atgtcggtct gagcacgatc gcgggtcggc tgctcggatc ggtaccggcc 1842841 aatgtgtcac gccgggccaa ggtcgacgtg ctgatcgtgc acaccaccta gcggccgtta 1842901 ccagccgcgc gcacgccatt cgctgaggct ggggcgttcg gcacccagct ccgtgtcgtc 1842961 accgtggccg gggtagatga cggtggagtc ggcgtacacg tcgaaaaccc gggtggtgac 1843021 gtcgtcgagc agttgggtga agtcggcagg ttgccaggtt ttgccgacac cgccggggaa 1843081 caagcagtcg ccggtgaaga gctgtgtgac gcctccggtc accggcccgc cgagggccag 1843141 cgcgatcgat ccgggtgtgt gtccgcgcaa gtggatgacg tcgaatgtca gctcgccgat 1843201 gcgcacgctg tcgccgtggg tgagcaaccg gtccggtttg accggcagcg ggtcggcgtc 1843261 gatcggatgg gccgcggtcg gcgccccggt ggccgcggcc accgcttgca gcgcctgcca 1843321 gtggtcgaag tgctggtgac tggtaacgat cagggccagc ttcggcgcgt accgccggac 1843381 caggtcgatg aggacctccg cgtcattggc ggcgtcgatc agcagggttt ctccggtcgc 1843441 tgaacacgtc accaggtagg cgttgttgtc catcgggccc accgatgcct tgaggatcgt 1843501 ggcgccgggc aggaagcgac gcgccgcctt gccgcgttcg acgtgtccgg tgtagttgtc 1843561 gtcgactgtt gtcatatgcg ccactgctcc tatgccggct gcgccggcat catcgtcgtt 1843621 ggcgcgggtc atatgcgccg acgttacgac gttaccggtc ccctgatggt tgtcggtacg 1843681 ggcacatagc atgggatacg gcctttggcc ggcgagatga gtttcagtga aagggacagc 1843741 gtggctgacc gcctgatcgt caagggtgcg cgcgaacaca atctgcgcag cgtcgacctc 1843801 gacctgcccc gcgacgcgct gatcgtcttc accgggttat ccggatcggg caagtcctcg 1843861 ctcgcgttcg acaccatctt cgccgagggg cagcggcgtt acgtggagtc gctgtcggcc 1843921 tacgcccgcc aatttctcgg gcagatggac aagccggacg tcgacttcat cgaggggctg 1843981 tctccggcgg tgtccatcga ccagaagtcg accaaccgca acccacgatc gacggtcggg 1844041 accatcaccg aggtgtacga ctacctgcgg ctgttgtatg cgcgcgcggg cacgccgcac 1844101 tgcccgacct gcggggagcg agtcgcgcgc caaaccccgc aacaaatcgt cgatcaggtg 1844161 ctggccatgc cggagggcac tcggtttctg gtgctggccc cggtggtgcg tacccgcaag 1844221 ggcgagttcg ccgatctgtt cgataagctc aacgcccagg gctacagccg ggtgcgggtc 1844281 gacggtgtgg tgcatccgct gaccgatccg ccgaagctga aaaagcagga aaagcacgac 1844341 atcgaggtgg tggtggaccg tctcaccgtc aaggccgccg ccaagcggcg gctcaccgat 1844401 tcggtggaaa ccgcgctgaa tttggccgac gggatcgtgg tgctcgaatt cgtcgatcat 1844461 gaactgggtg caccgcatcg cgagcagcgg ttctccgaga agctggcctg ccccaacggg 1844521 cacgcgctgg ccgtcgacga cctggagccg cggtcgttct cgttcaactc gccctacggc 1844581 gcctgccccg aatgcagtgg tctgggcatc cgcaaggagg tcgacccgga gctggtggtg 1844641 cccgatccgg atcgcaccct ggcgcagggt gcggtggcgc cgtggtcgaa cggccacacc 1844701 gcggagtact tcacccggat gatggccggc cttggcgagg cgctcgggtt cgacgtcgac 1844761 acgccctggc gcaagctgcc ggccaaggcc cgcaaggcga ttctggaagg cgccgacgag 1844821 caggtgcacg tgcgctaccg caaccgctac ggacgcaccc ggtcgtatta cgccgatttc 1844881 gagggtgtgc tggcgttcct gcaacgcaag atgtcccaaa ccgagtccga gcagatgaag 1844941 gagcgctacg agggtttcat gcgggacgtg ccctgcccgg tgtgtgcggg cacccggctc 1845001 aagcccgaga ttctggcggt gacgctggct ggggagtcca agggggagca cggcgccaag 1845061 tccatcgccg aggtgtgtga gctgtcgatc gccgactgcg cggacttcct gaacgcgctc 1845121 acgctgggtc cgcgcgagca agcgatcgcc gggcaggtgc tcaaggagat ccggtcgcgg 1845181 ctcgggtttc tgctcgacgt cgggctggag tacctgtcgc tgtcccgggc ggcggccacg 1845241 ctgtccggcg gtgaggcaca acgtatccgg ctggccaccc agatcggctc cggcctggtg 1845301 ggtgtgctct acgtgctcga cgagccgtcc atcgggctgc accagcgcga caaccgtcgt 1845361 cttatcgaaa ccctcacccg gttacgggat ttggggaaca ctttgatcgt cgtcgagcac 1845421 gacgaggaca ccatcgagca tgcggactgg atcgtcgaca tcggcccggg ggccggtgag 1845481 cacggtggcc gcatcgtgca cagcgggccc tacgatgaac tgctacgcaa caaggattcg 1845541 atcaccggcg cctacctgtc cggccgggaa agcattgaga taccggcgat tcggcgttcc 1845601 gtcgaccccc gtcgtcaact caccgtcgtc ggcgcccgcg agcacaactt gcgcgggatc 1845661 gatgtgtctt tcccgctggg tgtgctgacc tcggtgaccg gtgtctcggg ttcgggcaag 1845721 tcgacgttgg tcaacgacat cctggccgcg gtgctggcca accgcctcaa cggcgcccgg 1845781 caggtccccg gccggcacac ccgggtcacc gggctggact atctggacaa gctggtgcgg 1845841 gtggaccaat cgccgatcgg gcgcacaccg cgatccaacc cggccaccta caccggtgtg 1845901 ttcgacaaga tccgcaccct gttcgccgcc accaccgagg ccaaggtccg cggctatcaa 1845961 cccggacgat tctcgttcaa cgtcaagggc ggtcgctgcg aggcctgcac cggcgacggc 1846021 accatcaaga tcgagatgaa cttcctgccc gacgtgtacg tgccgtgcga ggtctgccag 1846081 ggggcccggt acaaccgcga aaccctcgag gtgcactaca agggcaagac cgtctcggaa 1846141 gtgctggaca tgtccatcga ggaagcggcg gagttcttcg agccgatcgc cggcgtccat 1846201 cgctatctac gcaccctggt cgacgtgggc ctgggctacg tgcggctcgg ccagcccgcg 1846261 cccacgctgt ccggcggtga ggcccagcgg gtcaagctgg cctcggagct gcagaagcgc 1846321 tccaccgggc gcaccgtcta catcctcgac gagccgacga cgggactgca cttcgacgac 1846381 atacgcaagc tgctcaacgt gatcaacggc ctggtcgaca agggcaatac ggtgatcgtc 1846441 atcgaacata acctggacgt gatcaagaca tcggattgga tcatcgacct gggcccggag 1846501 ggcggtgccg gcggcggaac cgttgtcgcc caaggcactc cggaggacgt tgccgcggtg 1846561 ccggcgagct acaccgggaa gtttctcgct gaggtcgtcg gcggcggtgc ctcggccgcc 1846621 acatcgcggt cgaacagacg gcgcaacgtc agcgcctgag ctggactatc gccgcgcgtc 1846681 aagtctgtgc tcacggcggc gaactgggtg cggtctcact catcggtgtg catcgactca 1846741 cggatctgag ctagccgttc ggctgccgcg cgctgccgct gcgcgtactg atcttcgagc 1846801 cggcggcctt gcgggctctc ggcgtcgagt tcggttgccc ccagggccgt tccgtatcgg 1846861 gtttcgatct tctcgcggac ggattcgaag gtcggtaccc cagcgctgtc gtaccgcgga 1846921 tcggactcgg aattcggcgt cgtggcttcc ggtggtgtcg gttcgtcggg catgctctgg 1846981 caatgctcct atctgccggt accggcgatc tgctgtgtcg tacccggcaa cgggatctta 1847041 ggcactcccg gagtggccag ttggccggcc agccatggca gcgccgcagc gaaaacccgg 1847101 tcggcgaaag gccagtcgtg cttgcccggt tgtggaacca cggcgcagta gatgccgttg 1847161 gcgcggccga gggcgcacag tgcattggcg gcagcggcct ggttgcctgg gttggcggcg 1847221 gcatcgcgac cggccagccg catcgtggtg gtatcggcga cagcgttgtc gggcgagggt 1847281 ggacccggcg aagagatcgc gaaccaaccc gacagtccgg tgtagctgcc atgccgggtg 1847341 atcaccgtcg tcgggtcaaa cgccgaccag gcgtcttcgt tgccgccgaa caacctgacg 1847401 atggtttgcg tcttgttgcc agcgttcggg tagaaatcac cggcgatgtc gacaaacgcg 1847461 ctaaacagtg tcgggtgcat gacggtcaga tccaccgcgc aggtcccacc catcgaccaa 1847521 cccacgatgc cccagctggt ctgttcggga ctgacgccga atttcgagac catgtagggc 1847581 acaacatctt tagtcaagtg gtcggccgcg ttgccacgcc gtccattgac gcattcggtg 1847641 tcgttgttga acgcgccgcc ggaatccacg aataccacga cgggagcatt gccgctgtgg 1847701 gcggccgcaa agtcgtcgag cgtcttcacc gcgttaccgg ctcgcgccca atcggcgggt 1847761 gtgttgaatt gaccgccgat catcatcacc gtcggcagct gcggcggcgg agggttctcg 1847821 gaacgatgct ctcggtcgaa ccaggccggc ggcaggtaca ccagttcgcc gcgatgcttg 1847881 aagtgtgatg cgtcggaagg gatcaccact ggcaacaacg tgccgtgcga cggccgcacc 1847941 ccactgtgcg ccagtgcggc aacagcggcc tgatcggcct ggtcgggcaa cgggccggag 1848001 gtgagctggt tccacgcggt ctgcacggtc gggaagtagc caacccacag gttgagcgtc 1848061 aaggtcgcgc tgagcagaca gaggggcacg gccagcagcg acgcgccgcg gcgccaccac 1848121 cgcgcgctgc gccagcccag gatcaacacc gtcgccgccg cgccggtcaa cgcgacccag 1848181 atccacagcg tgctcggcgg ccgttcgttg gccaggccgt tgccggtgac ataccagcgc 1848241 gtcccccatg ccagggtggc cccgatagcg gcggccgtcg gcagccaccg ccgttgccag 1848301 tgacgtgatc gccaccctgc cgccagcacc agcacgaccg cggtcacgac ctggacagcg 1848361 agcggcaccc aaccgtgcat cagcgatgtg tggcctactg ctaacggctg cgtcgcggct 1848421 ggcggcgtcg acgcggtcac cagttcattc tgagccattt cgggcggtga tttgttgggg 1848481 gtttcctgtg atccgacgga cgcccaccgg ctccggctaa tgcggttttg ccaacggaaa 1848541 gggcagtgtt tcgcgaatgc tgcgcccagt gatcagcatg accacccggt caatgcccat 1848601 gcccaagccg ccggtgggcg gcatggcgta ctccatcgct tgcaggaagt cttcgtcgag 1848661 ttccatcgcc tcggggtctc cgccggcggc cagcagggac tgctcctgca ggcggcgccg 1848721 ttgctccacc gggtcggtca gctcgctgta ggcggtgccc agctcgatac cccacgccac 1848781 caggtcccaa cgctcggcga caccgcgctt gctgcgatgc ggtcgggtca acggtgacac 1848841 cgatgtcgga aagtcgatgt agaacgtcgg ttgctcggtg cggcactcca ccaggtgctc 1848901 gtatagctcg agcacgaccg cgccggcatc ccattgggtc cgatagggga caccggcggc 1848961 gtcgcacagc ttgcggagag tggtcaagcc ggtatcggcg tcgatgcgtt caccgagtgc 1849021 ttccgagatc gcatcatgca ccgtccgcac cggccatatc ccggagatgt cgaccggttc 1849081 gaggtggtgg cgggtgccgt cggaaccctt gtccgtccgg ggccgcatgg cgatgggcgc 1849141 cccgttggcg gcctgggcgg cgttctggat gagttcgcgg cagccgtcaa tccactcaag 1849201 gtagtcggcg tgtgcttgat aggcctccag tagggtgaac tccgggttgt ggctgaagtc 1849261 gacgccctcg ttgcgaaagg cacggccgag ctcgaatacc cgttccacgc cgccgacgca 1849321 caggcgcttg aggtagagct ctggtgcgat gcgcaggaac agatccatgg aatacgtgtt 1849381 gatgtgcgtg acgaacggtc gggcggtggc gccgccgtgc agctgctgta ggatcggcgt 1849441 ttcgacctcg acgaatccct ttgcgaacag cgtctcgcgc acagcgcgca gcacgctgct 1849501 gcgagcggtg atcagcgcac gggactcagc gttgaccgcc aggtcgaggt aacgggtccg 1849561 gactcgggct tcgggatcca gtagcccctt ccacttattc ggcaacggtc gcaaacactt 1849621 accgatcagg cgccagccgc tgacgatcaa cgatggagtt ccggtcttgc tggcgcccat 1849681 gtgtccggtc atctccacca gatcacccag atcggtcgcc gcgttgaagt cggccgcgca 1849741 gccctggtcc aggcgtgaat tatccagcag cacttgcatt tcgcccgacc agtcgcgcag 1849801 ctgggcgaac aacacaccac cgtagttacg tattcgcatg atgcgtccgg acaccgacac 1849861 gctagcctgg tggtctgcgg ccagcgcctg tgccaccgtg tgactgggcg gccggcccac 1849921 gggaaaggcg tcaatgccgc tgctccgcag cttctctagc ttgtcgaacc gaactcgcac 1849981 ctgctcgggt agccgccgct cgaccccgtc gccattggtg aggcctactt gccgcagccc 1850041 gctcacgtcc ggtgccgagc cgtcgtgatg caataggccg gtggccgcca accgctcggg 1850101 cactgccgga tgatgccccg tgtgtactcg gttgcgccgg ctgaacggca gcacgaggaa 1850161 cccctctgcg atcaccgagg cgacgcccac tcggggaatc actcgggcgt cttcgtagca 1850221 ggcgtagcgc ggtacccatt cgggttggta cttcatgttg gagcggtaga gcgtctcgag 1850281 ctgccaccac cgtgagaaga agaccagcag cccccgccac aaccgggcaa ccgggccggc 1850341 gccgagttgg gcgccctgct cgaaggccgc gcgaaacacc gcgaagttca acgaaatacg 1850401 agtgatacca aggctttcag cgtgcaaggc gagttcgctg accataagtt cgatagtgcc 1850461 gttcggggat tgtggagaac gacgcatcaa atccagggag acaccggtgg ttccccacgg 1850521 caccagcgac agcattgcca gcacctggtt gtgcggatca atcgcctcca ccagcaggca 1850581 gtcggagtcc gcggggtcgc cgaggcggcc cagcgccatc gagaagccgc gctcggtctc 1850641 ggtgtcgcgc caggaatccg cccgtgtgat ggtctgcgcc atctcgtctt cggcaatgtc 1850701 gcgatgccgc cggatgcgca ccgtcaaccc cgcccgccgg gcccgcgtca cggcctggcg 1850761 caccccgcgc atctccgggc cggacaactt gaaatcggct ggccgcagga tggcctcatc 1850821 gcccagctcg agcgcggtta ggcccgcttc gcgatatgtc tgagcccctt gtgaactggc 1850881 gcccatcacg ccgggtgccc agccgtaggt ctggcacagc cgcagccacg cgtcgacggc 1850941 ctgcggccat gctctgtggt cgcctaccgg gtcgccgctg gctaggcaga caccgacctc 1851001 gacacggtag gtgatacagg cgcggccgct ggatgcgaat accaccgact tgtcgcgacg 1851061 ggtggcgaag tagcccagtg agtcgtcctt cccatacaaa tccaataacc cgcggatagc 1851121 ggattcgtcc tctccggtca gcgcattgtc agcgcgctga gataggaaca agacgatcgc 1851181 agccccgatc aacgcgaacg cgccgaacaa cccgaagatc gcgttgagga agacgtgcgg 1851241 tctgccggtg aacagatcgg gatcggcgag ggcgaatccg accacccggt tggccgcgta 1851301 acccaaccgc tcgtccggcg ctagtgatcc cggaaacagt tcgaccagac cccaagacgc 1851361 cacgattccg accaccgcgc cggcaagcca caccgcagcc gcccgaaaca gcgcgcccct 1851421 gcggaccttg gcccagaact cccgatagcc cagcaccaga acgacgattg ccacaacatg 1851481 cacggcgaat ccgagattct ccccgaagct ctcggcggcg gtgttgccgc ccgctgcgat 1851541 ctcggcggcg ttgaccacgg cggccaggac catatttgcc agcaagacca accaggcaat 1851601 gcgtttgcgt gccgttaacg cggcggccag caatgccagc acgaaggacc acgcgaagtt 1851661 ggtgtcgggg aagttgaaca gataatcgtt gatgaattcg cgcggaacct tgatgatcca 1851721 ccgaatcaac ggcgacacac tggccagtag tgacagggtc gcgatcacgc cgacggtcca 1851781 gccggctgcc gcgggaaccc agtgataccg ggagtttccc ctggtggccg agcgaggttt 1851841 ggtgagtgtc acagaccgcg aggatattcc caaaagccgg gaaatgcccg gcgttgcagc 1851901 cctttgtagc cccgcatcgg tgtgctgagg gcaccggctg atgtcggccg ttgtcttaga 1851961 tgacgtgtca tggctgttag actggacgcc gcgaccatcc cggcgaaggc cagggacagt 1852021 taagtggagt cccactccca ccgctagcca cgagatcgtt tcacaccttc tcaaggttca 1852081 gcggtccggt cacaggcatc tcggatgcct gttctgcgtg cagcgtgggc ggctttggcc 1852141 gcgatcggtc ggcattgggc cctgcttgtg cagggctttt tttgctgatg gtttgggtgt 1852201 gttccccacc tgattccggc cgggtccaac aagctggtcg cgcctggaac agcagccaac 1852261 gagggaggcc ccatcagcac tgaaacccgc gtcaacgagc gcatccgcgt acctgaagtc 1852321 cgattgatcg gcccaggggg ggagcaggta ggcattgtgc gtatcgaaga cgcacttcgc 1852381 gtcgccgcgg acgcagatct cgaccttgtc gaagttgctc ccaatgccag accgccggtc 1852441 tgcaagatca tggactacgg caagtacaag tacgaggccg cgcagaaggc gcgcgaatcc 1852501 cgcagaaacc aacagcagac cgtcgtcaaa gaacaaaagc tgcgaccaaa gattgacgat 1852561 cacgattacg agaccaaaaa gggtcacgtc gtccgcttct tggaggcggg atcgaaggtc 1852621 aaggtcacca ttatgttccg tggacgtgag cagtcgcggc cggagttggg ctatcgattg 1852681 ctgcagcggc tgggtgcgga cgtcgccgat tacggattca tcgagacgtc cgccaagcag 1852741 gacggacgca acatgacgat ggtgctggca ccgcaccgcg gtgcgaagac ccgcgctagg 1852801 gcccgccacc cgggtgaacc ggccggcggg ccgccgccca agcccacggc cggtgacagc 1852861 aaagccgcac cgaactagct cgccagcaag acacgcagaa cctagaaatt ctagaaattg 1852921 aggaaacatg cccaaggcca agacccacag cggggcctcg aagcggttcc ggcgcaccgg 1852981 taccggcaag atcgtccggc agaaggccaa ccgtcggcac ctgctcgagc acaagccgag 1853041 cacccgcacc aggcgcctgg acggccgcac cgtggtggca gccaacgaca ccaaacgggt 1853101 cacgtcgttg ctgaacggct gaccgtaccg ccggccggct ccggcacctg accaatcacg 1853161 tccgaacgag agtaggaaga tccatggcac gcgtaaagcg ggcggtcaac gcccacaaga 1853221 agcggcgcag catcctgaag gcatcgcgag gctatcgcgg ccagcgatcg cggctttacc 1853281 gcaaagccaa agagcagcag ctgcattcac tgaactacgc ctaccgtgac cgccgggcgc 1853341 gtaagggcga gttccgcaag ttgtggatcg cacggatcaa cgcggctgcg cgcctcaacg 1853401 acatcaccta caaccggctt atccaggggc tgaaggccgc cggcgtcgag gtggaccgga 1853461 aaaacctcgc cgacattgcg atcagcgacc cggcggcgtt caccgcgctg gtcgacgtcg 1853521 cccgggcggc actgcccgaa gacgtcaacg ccccctccgg ggaggccgcc tgatccggat 1853581 tccggcctga ggcagggcta cgccggtgct caccgaacgc tcggccaggg tggccacggc 1853641 ggtcaaactg catcgtcacg taggccggcg ccgggcggga cgttttctcg ccgaaggccc 1853701 caacctggta gcggcggcgt tggcgcgcgg gctggtacgg gaggtattcg tcaccgaagt 1853761 tgcggcgcgg cggcacgagc tcttgttggc cgcgcacgag gcttcggttc atctggtgac 1853821 tgagcgggcc gcgaaggcgc tctctgatac ggtcacgccg gccgggttgg tggcggtgtg 1853881 cgatctgccg gcgacccgac ttgaggatgt attggccggc tcacctcagc tgatcgcggt 1853941 gaccgtcgag atccgcgagc cgggcaacgc gggcacggta atccgcatcg ccgacgccat 1854001 gggtgccgcg gcggtgatcc tcgccgggcg cagcgtcgac ccatacaacg gcaagtgtct 1854061 gcgcgcgtcc accggtagca tcttcgcgat cccggtcgtc gtcgcgcccg atgtcggtgc 1854121 cgccatcgcc gacctgcgag cggccggact gcaggtgctg gccaccgcag tggacggcga 1854181 gatggctctc gacgatgccg atcggctgct tgccgagccg acggcatggc tgttcgggcc 1854241 cgaagcacac gggttgtcgg ccgagatcgc ggccttggcg gaccaccgcg tacacatcct 1854301 gatgtcggga ggggcggaga gcctcaacgt cgcggccgcg gccgcgatct gtctgtatga 1854361 gagcgctcgg gcgttgggcc gccgctgatt gtccggccct acgcagcgcg gctggggccc 1854421 cgcgccggcc gcacgccggc cagcgaaagt gtggaatgga ccagcgcccc ggcgcgttcc 1854481 atcagggcct tggcgggatc gagagccgac cgggtaaagc gatgggaacg gggtgggaag 1854541 tagtgcatcg gcagcccctg ccgatcccgt ggtggcgcca tgaagccggg cgcttgcacc 1854601 aggggcgcgt catggcgagt acggcctgcc gctcgccggt aggcggccag cgcgcgcggg 1854661 tgcaaccgga tttcgtcggg caccgccaaa aacgccagtt ctaccacctt gccgaacact 1854721 cgcagcaaca cctcgtcacc cggtgtccag tgcattcccg ccttctcccg cacggccggg 1854781 tcgaacaggc cggccgcgat ccagcgctga ccggctatta gcggcttgaa cagctgatcc 1854841 cagatcggcg ttggcatgag tacaaacctc ggtttgggaa tccgcatctg gaggatgtcc 1854901 acggtcgcct gattgatctc gagcttgtcg cggcacaccc ggtcccaata gtcctgaaag 1854961 tcttcccacg acttgggcac cggtcgcatg ctcatcccat acatccggta ccagcgcacg 1855021 tgctcctcga agagctggtg tttttcggcc tcggtcaagc ctccgcagaa gtattcggcg 1855081 accttgatga caagcatgaa aaacgtcgca tgcgcccagt agaacgtatc tggattcagc 1855141 gcgtgatagc gacgcccctc agcgtcgact cccttgatgg ttcggtggta gcccttgatc 1855201 tgctggccgg tctgggccgc tcggtcaccg tcatagacca cacccatgat cgggtacacc 1855261 gagcgggcta cccgctgcaa gggttcgcgg agcaggattg aatgctcctc gacaccggca 1855321 cctagctcgg gatacatatt ttggatcgcg ccgatccaca cacccatcat cccggtgcgc 1855381 aggtctccga aatatttcca ggtcagcgaa tcgggcccga gcgggtcggc ggatgtcctc 1855441 gatgcgacag tcatgactgc ctccgtgcca ggttagtctg cgcccacgat aggcattgac 1855501 aacgcgcgtt gtccacgatt tggtccgccg atatcgcgcc gtgtcaccca gtgcctcctc 1855561 cgggtggcaa cgagcgtgga cgaggactgc agctgcatag cttggcccgc ggtgcgtgcg 1855621 ggggcaggga gtccaatgaa aaatgttgct tagaacgcca gaaagttttt aactagatca 1855681 ggattgctta gctgtagact ttatttctca atgaccacgt aaggattgct gcggccagta 1855741 caacgtgtac aaggagtcgg gctatgtcgt ttctcaccgt ggcgccggac atggtaacgg 1855801 cggccgccgg gaatttggaa agcgttggct cggcactgaa tgaggccgct gcggcggcgg 1855861 cgccagccac ggttgggctg gcggccccgg ccgcggatcg ggtgtcggcg gtcgtcgcgg 1855921 cgatgttggg ggcatatgcc cgggattttc aaggcatcag tgctcagatc gcgggttttc 1855981 ataaccagtt cgtgggcgcg ttgcggggcg gtgcggccgc ctacgccagc gccgaagccg 1856041 ccaacgtcca gcagaccgtg gtgaacgccg tgaatgcgcc cgcccaggcg ctgttggggc 1856101 acccgttgat cgggcccgag acggtcggct ccagcgccgc cgcggtctcc ttcggcttcg 1856161 gcccgttgct cctcgctggt agcgatccgc tgctggccgt gccattcagc tatccggcca 1856221 gtctgcccac cccattcggt ccagtaacga tgacgctcaa cgggtcgttt gatccgctta 1856281 cccaacaggt tgttttcgac tcgggatcac tcaccgcgcc cgctccgttc gtgtacggtc 1856341 ttggtgcggt aggtccagct ctcaccacca tgaccgcgct gcaaaacagc ggcacagcat 1856401 tttccggcgc ggtgcaaagc gggaacctgc taggggccgc gggcgcgctt ctgcaagctc 1856461 ccggcaacgc ggtgaccggc ttcctgtttg gccaaacagc gatatcgcag tcgataccgg 1856521 ggccatcgaa tctgggctac gagtcggtgg gtatcagcgt tccggtcggg gggctcttgg 1856581 ctccgctgca gcccgtgacg gtcacgttga cgcccacatc tggtatgccg actgccattc 1856641 aattgagtgg tacgcagttt ggcggccttc ttcccgccct actcaacggt ttctaaccgt 1856701 ctgcggacag ccgccgcaaa ccgcgtgatc agcgtgtttg atgcgacttg tgccacaaac 1856761 accgaggtcg tcattggcgg gctcagcccg caccacctac ccttgccacg tggaggtcgg 1856821 gccgcaggat tcggagtccg gcgcgcccga cgagacggca accgccatgg cgtcgccagt 1856881 acctcgacaa cggtccgcac tacgctggct gcgcaccgtg aaccgcagcc ctggcctggt 1856941 gtcattcatc caccgggcgc gccgcctgtt gcctggcgat ccggaattcg gcgacccgtt 1857001 gtccaccgcg ggtgagggtg gtccacgtgc cgcggctcga gctgccgatc ggctgctgcg 1857061 ggatcgcgat gcggcctcgc gcgaggtcgg cctgagtgtg ctgcaggtgt ggcaggcgtt 1857121 gaccgaggcc gtttcccgcc ggccggcaaa cccggaggtg acgttggtgt tcaccgacct 1857181 ggtcggcttt tccacgtggt cgttgcacgc tggtgacgat gccaccctca cgctgctgcg 1857241 gcaggtggcc cgggctgtcg aatcccccct cctggacgcc ggcgggcaca tcgtcaaacg 1857301 gctgggcgac gggatcatgg cggtgttccg caatccgacc gtcgcgctgc gagccgtgct 1857361 cgtcgcccaa gatgctgtga agtcgcttga agtgcaaggc tatacaccgc gaatgcggat 1857421 cggtatccac accggccggc cgcagcggct ggccgccgac tggctcggcg tcgacgtcaa 1857481 catcgccgcc cgggttatgg aacgtgccac caaagggggc atcatgatct cgcaaccgac 1857541 cctggacctg atcccgcaaa gtgagttgga cgcgctgggc gtcgtggccc ggcgggtgcg 1857601 taaacccgtg tttgccagca agcccaccgg cattccgccc gacttggcga tctatcgcat 1857661 caagactgtt agcgagtcga cagctgccga taacttcgat gagatgagtc ccgatgcaca 1857721 gtagaacgcg atgatctacc gcgtcgcctg cctgctggcc cggatccggt tcaccgtggg 1857781 ctacgtggcg gctcttgcat cggtcagcac caccatcctg atgcatggtc cgcaggtgca 1857841 cgcccaggtg attcggcatg ccagtacgaa cctgcacaac ctggcccatg gacacctggg 1857901 aacgctgtgg aacagcgcct tcgtcatcga cgagggcccg ctttatttct ggttaccctg 1857961 cttggcgtgt ctgctcgcgg tcgcggagct gcagctgcgc agcttgcggc tgaccgtggc 1858021 gttcgtcgtc ggtcatattg gggcgacact gttggtggcg gccgtgcttg ccggggcgat 1858081 cgagatcggc tggttgccat ggtccattag ccgggtcagc gatgtcggga tgagctacgg 1858141 tgccctcgcg gcgctcgggg cgctgaccgc ggcaatccct gggcggtggc ggccggcatg 1858201 gattggttgg tgggtatcgc tgggcttggc gactgcgacc atcggcggtg gtttcaccga 1858261 tgccggccac acggttgcgt tgctgttggg catgttagtg actgcctgct tcacccggcc 1858321 cgcgcgctgg acactcgggc ggtgtgcctt gctggcggtg gcgtcggggt tctgcttggt 1858381 gctgctagcc catagctggt ggagcttggt gagtgggtcg gccttgggtc tactcggggc 1858441 cctgggtgcc gccgggtttg cgcgttggac cagagcgcgc gccacatcgc tgccacccgg 1858501 cgcgctggcg attccgcagc cggcgctaag tcgctgagtc ccgcacaacg cgtgccgagc 1858561 cgggccgacc gaatcaccta tgatttgcac ttgcgtcacg ccgttagcgg gcaagtcggg 1858621 tacgtccatc agtccagttt ccgctccgcg acgatgcggg cggtccgaat agcctcgtca 1858681 gcaaggagag tggcgccgcg tgggtgatcc ccccctcgag tcgattgtgt cgatgttgtc 1858741 gccggaggca ttgaccacgg cggtcgacgc cgcccagcag gccatcgccc tagcggacac 1858801 cctggacgtc ctggcgcgcg tcaagacgga gcatctcggc gaccgctcgc cgttggcgct 1858861 ggcgcggcag gcgctggccg tgctgcccaa agaacagcga gccgaggccg gtaagcgcgt 1858921 caacgccgcc cgcaatgccg ctcagcgcag ctacgacgaa cggctggcga cgctgcgtgc 1858981 cgagcgcgac gcggccgtgc tggtggccga aggtatcgat gtcacattgc cctcgactcg 1859041 ggtgccggcc ggcgcccggc acccgatcat catgttggcc gaacacgtcg ccgacacgtt 1859101 catcgcgatg ggatgggaac tggccgaggg gcccgaggtg gagaccgagc agttcaactt 1859161 cgacgccctc aacttccctg ccgaccaccc tgcgcgcggc gaacaagata ccttctacat 1859221 cgcgccggag gattcgcggc agctgctgcg cacccatacc tcaccggtgc agattcgcac 1859281 cctgctagcg cgtgagctgc cggtctacat catctcgatc ggtcgtacct ttcgcaccga 1859341 cgaactcgac gccacccaca cgcccatctt ccatcaggtg gaaggcctag cggtggaccg 1859401 cggtctgtcg atggctcacc tacgtggaac gctggacgct tttgcgcgcg ccgagttcgg 1859461 gccgtctgcg cggacccgga tccggccaca cttcttcccc ttcaccgaac cgtccgccga 1859521 ggtcgatgtg tggtttgcca acaagattgg cggcgccgcc tgggtggagt ggggcgggtg 1859581 cggaatggtg catccgaacg tgttgcgggc caccggcatt gatcccgatc tctactccgg 1859641 tttcgcgttc gggatggggt tggaacgcac cctgcagttt cgcaacggca ttcctgacat 1859701 gcgcgacatg gtcgaaggcg acgtccgatt ctcgttgccg ttcggggtgg gtgcctgatg 1859761 cggctaccct acagctggct gcgcgaggtg gttgcggtcg gcgcttcggg ctgggacgtt 1859821 accccaggcg aactcgagca gacgctgttg cgcatcggcc acgaggtcga agaggtcatc 1859881 ccccttggtc cggtggacgg cccggtgacc gtggggcggg tggccgatat cgaggagctc 1859941 accggctaca agaagccgat ccgggcctgc gcggtagata tcggcgatcg gcagtatcgc 1860001 gagattattt gtggtgcaac caatttcgcg gttggtgatc tggtggtggt agcgctgccc 1860061 ggtgccacgc tgcccggtgg attcaccatt agcgcccgca aggcctacgg tcgcaactcc 1860121 gacggaatga tctgctcggc agccgaactc aatttgggcg cagaccattc cgggatcctg 1860181 gtgttgcccc ccggagccgc cgagcccgga gctgacggcg cgggcgtgct ggggctcgac 1860241 gacgtggtct tccatctggc catcacccca gaccgcggtt actgcatgtc ggtgcgcggc 1860301 ttggcccgcg agctcgcgtg cgcctacgac ctggacttcg tcgaccccgc cagcaactcg 1860361 cgggtgccgc cgctacccat cgaggggcca gcctggccgc tgacggttca gcccgagacg 1860421 ggggtgcgcc ggttcgcgct acgcccggtc atcgggatcg accccgccgc ggtatcgccc 1860481 tggtggttgc agcgccgact gctgctctgc ggtatccgcg cgacctgtcc ggcggtcgac 1860541 gtgaccaatt acgtgatgct cgaacttggc caccccatgc acgcccacga ccgcaaccgg 1860601 atcagcggaa ccctcggagt gcggttcgcc cggtccggcg agaccgccgt gaccctcgac 1860661 ggtatcgagc gcaagctcga taccgccgat gtcctgatcg tcgacgatgc tgcgacagcg 1860721 gcgatcggcg gcgtgatggg ggcggccagc accgaagtgc gggccgactc caccgatgtc 1860781 ctgttggagg ccgcgatatg ggacccggct gcggtatcgc gtacccagcg gcggctgcac 1860841 ctgcctagcg aggccgcccg tcgttacgag cggacggtgg acccggccat ctccgtggcc 1860901 gctttggacc ggtgcgcaag gctgctcgcc gacatcgccg ggggggaggt ttctcccacc 1860961 cttaccgact ggcggggtga cccgccgtgt gatgactggt caccgccgcc gatccggatg 1861021 ggagtcgatg tgccggaccg catcgccggg gtggcctatc cgcagggcac tactgccagg 1861081 cgcttggccc agatcggcgc ggtggtgacc cacgacggcg acaccttgac cgtgaccccg 1861141 ccgagttggc gacctgatct gcggcaaccc gcagaccttg tcgaggaggt gctgcggctt 1861201 gaggggctgg aagttatccc gtcggtgctg ccaccggcgc ccgcgggtcg tggactcacc 1861261 gctgggcagc agcgccgtcg cacgatcggc aggtcgctgg cgctgtcggg ctatgtcgag 1861321 attctgccga ctccatttct gccggccggt gtgttcgatt tgtgggggct ggaagccgat 1861381 gactcacggc gcatgaccac gcgggtgctc aacccgctgg aggccgatcg tccgcaactg 1861441 gcgaccacgc tgctgccggc cctgctggaa gccttggtgc gcaacgtgtc ccgagggctg 1861501 gtcgacgtcg cgctgttcgc catcgcccag gtggtccagc cgaccgagca gacgcgcggt 1861561 gtcgggttga tcccggttga ccggcggccg accgatgatg agatcgccat gctggatgcc 1861621 tcgctgcccc ggcaacccca gcacgtcgcg gcggtgctgg ccggactgcg cgagcctcga 1861681 ggcccctggg gcccgggccg cccggtagag gcggctgatg cgttcgaggc ggtgcgaatc 1861741 atcgcgcgcg ccagccgcgt ggacgtgacc ctgcggccgg cccaatatct gccgtggcat 1861801 ccgggccggt gcgcgcaggt gttcgtcggg gaaagctcgg ttggtcacgc cgggcagctg 1861861 catcccgccg tgatcgagcg ctcgggtctg ccgaaaggca cctgcgcggt ggaactgaac 1861921 ctagatgcga ttccgtgcag cgcgccgctg ccggcaccca gggtgtcgcc gtatccggcc 1861981 gtgttccaag acgtcagcct ggtggtggcc gcggacatcc ccgctcaggc ggtggccgac 1862041 gccgtgcgcg cgggggcagg cgacctgctg gaggatattg cgttgtttga cgtgttcacc 1862101 ggcccgcaga ttggtgagca ccgcaagtcg ctgaccttcg cgctgcggtt tcgtgcgccg 1862161 gatcgcacct taaccgaaga cgacgccagc gccgcccgcg atgccgctgt gcaaagcgca 1862221 gccgaacggg tgggtgccgt gctgcgtggc tgaaccgact cagcacgcgt tcaacgaaaa 1862281 tttgacgacg gcatttcagc gcgccgcgtt tatacctcgc cgccctgtcc gggtagcggc 1862341 gccgccctaa ggggcaattg cctgcgctag ctgtgtggga gcgtagttca ccaacgcggg 1862401 aacgatgccg ccggcgggcg taccttcgag cgtaacggta accggcccga tgaccggtat 1862461 taccgccgtg gcctgaaacg gctgcagagg cgcaagaatg ccgccgacgg gaacctcgac 1862521 cgtcaccgga atcccccctg tcgccgatgt tggcagggcc agcggcagcc tggcctcgcc 1862581 attgaggaag ccgttggcga cgttggcggg agcaccgacc agggccgccg ctgccgcctg 1862641 caggtttccg gcctgcacgg cgctgacgaa cgctgtcgtg ctctcggcga atgcgattgc 1862701 cgtcgtgatc ggcgaaccca ccgcattaag ggtcatcgcc agcggcaatc caaacgtcat 1862761 caccccggtc aagttcgtgg tatcgatcga aaaggcgatg gttgtgtccg tgaccgtcat 1862821 caccacattg gtgaagtttt gtgacatggc gccggggatg ctcaggatgg ggaacaggtc 1862881 tcccaccggc ccgagcagca ggatgttcga caagtcactc gcgtcaacac cgctgacgaa 1862941 gaccttcacc accgccccta acacgtcggt caccgcgccg ctgacgtcgc ctgccgcgag 1863001 ggcttgcaag gccgattgca ggctcggcgg tatgccagcc agcccaatag cgaagtccct 1863061 ggtggcatct gtcagcgcgg ttagggtcag ctggccgtag ccgaactggt tggcgaggta 1863121 ctgctgcagg aacggcgccg ggtcggcaag ccaggtattg ccgatgctcg ccaggttggc 1863181 gaccgtgttg gcgatgaggt cttcgtatgg cccgaggatg ggaacactgc tgctcaggct 1863241 agggaaggcc agcgctgccg cgcccggtgg accggagctg ccactttggc cgaacaacac 1863301 cccgccggtg ccaccggtgc cgccattacc cggggcaccg gccgggctgc cggcgccgcc 1863361 ggcgccaccg gcgccaccgt caccaccgtt gccgatcaag gtggcgttgc cgccgtgacc 1863421 gccgtcgcta ccggtgccgc cggccttggt accggaaccg gcgccgcccg cgccgccggt 1863481 tccgccggtc ccgccgtcgc cgaacacttg gccggcgttg ccgccgtttc cgctggcacc 1863541 gcccttaccg ccgataccgt tgccgccact gtgatcccca ccggtaccac cggcgccgcc 1863601 ctgcccgcca ttaccgaacg cgatggcgct gcctccggtg ccgccgatac cgccggtgcc 1863661 accctcaagg gcgccatcgg cggtggtgcc accgttgccg ccgttcccac cggccccacc 1863721 gttgccccat atcagcccgc cggcaccgcc gtgaccgccg gcaccgccgg taccgccggg 1863781 acttgcgaag agggagccag agttagcccc accagtgccg ccgttcccac cggccccacc 1863841 gctgccgaga agcaacgcgg tgccgccgct gccgccggca ccgccgacac cggagctaaa 1863901 tagcgctgca gccccaccgg cgccaccggc cccaccgttg ccgccattgc cgatgaagct 1863961 actgcccgcg gcaccaccgg cgccgccggc accggcgttg gcgagtatgt tgatagcagc 1864021 cccgccgatg ccaccggccc ccccgttccc gccgttgccg tagagcagcc cgccgacgcc 1864081 gccggccccg ccggccccgc cggctccgct ggtagcgctg gccagatcgc tgctcgtccc 1864141 ccccttgccg ccgacgccac cggtcccacc gttaccgaac aagctggcgt tgccgccagc 1864201 acccccggca ccgccgacgc cggagtcgaa caatggcacc gtcgtatccc caccattgcc 1864261 gccggcccca ccggcaccgc cgttgccgta cagcaggccg gcgttgccgc cggccccgcc 1864321 agcgccggcg ttcatgccga cgcccaacaa tgacgtggcg gcgccgccgt cgccgccggc 1864381 accgccggag ccccacaggc cgacgctgcc gccggccccg ccggccacgc cgctaccggt 1864441 gagaccgctg gtgccgccag cgccgccggc accgccattg ccgaccaggg tattcccgcc 1864501 cgcacccccg gcggcgacgg tgctcgatcc gccgtccccg ccgttgccga acagtgcatt 1864561 tccacctgca ccgccagcct tcgaggtgct ggaaccaccg tccccgccat tgccgaacaa 1864621 cccgccgtcc gcgccggcta gccccgatcc ggccccagca ttgccgccgt taccaaatat 1864681 cgtcccggcg tggccggcgg ctccgccgga agccccactt ccgccgttcc cgccgttgcc 1864741 gaacagcagg gcgttgccgc cggccccacc ggccgcagcg gcactgcccc cgttgccgcc 1864801 ggcaccgccg ttgccataca acagtccgcc ggtgccgccg gccccgccag cgccgcctgc 1864861 acctccagca ccgccggcgc cgccggcgcc gccgttgccg atcaatccgg cggccccgcc 1864921 gttaccgccg gccccgccgt tgccgccgtt gccgtacaaa attccaccgg gcccgccgtt 1864981 gccgccggca tttgacccgg tccccgccac tccatcggcg ccgttgccga tcagtgggcg 1865041 ccccagcagc gtctgcgtgg gcgcattcac cgcgtcgagc agggcctgca tcgacgacac 1865101 gctggcggcc tcggcgccgg tataggccgc cgcgccgccg ttcaacaagc tcacgaactc 1865161 ggcgtgaaac gtcgccgccc gggcgttgag cgcttgaaat tgctgaccgt aggcgccgaa 1865221 tagtcgcgag acagccgccg acacctcatc ggcgccggcc gatgccagcg cggtcgtggg 1865281 ggtcgatgcg gcggcagcgg cttcgctcag tgccgagcga ataccagcta aattggcggc 1865341 cgctgctgtg accaagtccg gctccacgag taagaacgac atggcggtcc cccttcgact 1865401 cggcgcagct agtggacatg tgtcacggga aattcagcct agttgggtct tatgtcatgt 1865461 gagggaaaac gcacgttttc gcggacgcaa cttcgagtcc catcggcgcc gcccggcggt 1865521 gtgtcaagtc ccggcgcagt caccgcggaa tgagtttgca aactgttgca taacgatgca 1865581 aaatcggcag gtggccaatg cgacgaaggt ggcggttgcc ggtgccagcg gatatgccgg 1865641 tggtgagatt ctccgcctgc tgctcgggca tccggcgtac gccgacggcc ggctgaggat 1865701 cggtgcgctg accgcggcga ccagcgccgg cagcacgctc ggcgaacacc atccgcacct 1865761 gacgccgctg gcccatcgag tagtcgaacc caccgaagct gccgtgctcg gtggccatga 1865821 cgccgtcttc ttggccttgc cgcacgggca ttcggcggtg ttggcgcagc aactgagccc 1865881 cgagacactg atcatcgact gcggggcgga ctttcggctc accgacgccg ccgtctggga 1865941 gcggttctac gggtcgtcgc acgccggtag ctggccgtat gggttgcccg agctgccggg 1866001 cgcgcgggac caattgcgcg gcacccgccg catcgcggtg cccggctgct atccgaccgc 1866061 ggcactgctg gcgctttttc ccgcgctggc cgcagacctt atcgagcccg cggtgaccgt 1866121 ggtcgccgtg agcggtacct cgggggcggg tcgtgcggcc accaccgact tgctgggcgc 1866181 ggaggtcatc gggtcggcgc gcgcctacaa catcgccggc gtccaccggc acacccccga 1866241 gatcgctcaa gggctacgcg cggtcaccga ccgcgacgtc tcggtctcgt ttaccccggt 1866301 gctgatcccg gcctcccgtg gcatcctggc cacctgcacg gcacgcaccc gatcacccct 1866361 gtcgcagctg cgggcagcct acgaaaaggc ctaccatgca gagcctttca tttatctgat 1866421 gccggagggg cagctgccgc gcaccggcgc ggtgatcggc agcaacgcag cgcacatcgc 1866481 cgtcgcggtg gacgaggacg cgcagacgtt cgtggcgatc gccgcgatcg acaacctggt 1866541 caagggcacc gccggcgccg cggtgcaatc gatgaacctg gcgctgggct ggccggagac 1866601 cgacggcctt tcggttgtgg gggtggcgcc gtgaccgacc tggccggcac cacccggctg 1866661 ctgcgcgctc agggcgtcac cgccccggcc ggctttcggg ccgccggcgt cgccgccggg 1866721 atcaaggcct ccggtgcgct ggatctggcg ctggtgttca acgagggacc cgactacgcc 1866781 gccgccgggg tgttcacccg caaccaggtc aaggcggcgc cggtgctgtg gacccagcaa 1866841 gtgctgacca ccgggcggct gcgcgcggtg atcctcaact ccggcggcgc caatgcctgc 1866901 accgggccgg ccggcttcgc cgacacccac gccaccgcgg aggcggtggc cgcggcgttg 1866961 tcggactggg gaaccgagac cggggccatc gaggtcgccg tctgctccac cgggctgatc 1867021 ggcgaccggc tgccgatgga caagctgctc gccggcgtcg cccacgtggt gcacgagatg 1867081 catggcgggc tggtcggcgg cgatgaagcc gcccacgcca tcatgaccac cgacaacgtg 1867141 cccaaacagg ttgcgctgca ccatcacgac aactggacgg tcggcggcat ggccaaaggc 1867201 gcgggcatgc tggcgccgtc gttggccacc atgctgtgcg tgctcaccac cgacgcggcc 1867261 gccgagccgg ccgcactcga gcgggcgctg cgccgcgccg ccgcggccac gttcgaccgg 1867321 ctcgacatcg acggcagctg ctccaccaac gacaccgtgc tgctgctgtc gtccggggcc 1867381 agtgaaatcc cccctgccca ggccgatctc gacgaggccg tgctacgggt ctgcgacgat 1867441 ttgtgcgccc agctgcaggc cgacgccgaa ggcgtcacca aacgcgtcac cgtgaccgtg 1867501 accggggccg ccaccgaaga cgacgcgctg gtcgccgccc gccagatcgc ccgcgacagc 1867561 ctggtcaaga ccgcgctgtt cgggtccgac ccgaactggg gacgggtgct cgccgccgtc 1867621 gggatggcac cgatcaccct cgacccggat cgaatcagcg tgtcgttcaa cggtgccgcg 1867681 gtgtgtgtgc acggtgtcgg cgctcccggt gcgcgcgagg tggacctgtc ggacgcggac 1867741 atcgatatca ccgtcgacct cggcgtcggc gacgggcagg cgaggatccg aaccactgat 1867801 ctgtcgcatg cctacgtcga agagaactcg gcctacagct catgagccgc atcgaagcac 1867861 tgcccaccca catcaaagcg caggtgctgg ccgaggccct gccctggctc aagcagttgc 1867921 acggcaaggt cgtcgtcgtc aaatacggcg gcaacgcgat gaccgacgac acgctgcggc 1867981 gcgcgttcgc cgccgacatg gcgtttctgc gcaactgcgg catccatccc gtcgtggtgc 1868041 acggcggggg gccgcagatc accgccatgc tgcggcggct cggcatcgag ggcgacttca 1868101 agggcggatt ccgggtcacc acacccgaag tgctcgacgt ggcccggatg gtgctgttcg 1868161 gtcaggtggg ccgggaactg gtcaacctga tcaacgcgca cggaccgtat gccgtcggga 1868221 tcaccggcga ggacgcgcag ctgttcaccg ccgtgcggcg cagcgtcacc gtcgacggcg 1868281 tggccaccga catcggcctg gtcggcgacg tcgaccaggt gaacaccgcg gcaatgctgg 1868341 atctggttgc ggcgggccgg atcccggtgg tgtccacgct ggccccggat gccgacggcg 1868401 tggtgcacaa catcaacgcc gacaccgccg ccgcggcggt cgccgaagcc ctgggcgccg 1868461 aaaagctgtt gatgctcacc gatatcgacg gcctgtacac ccgctggccg gatcgcgact 1868521 cgctggtcag cgagatcgac accggcacac tggcgcaact gctgccgacg ctggaatcgg 1868581 gcatggtccc caaggtcgaa gcgtgcctgc gggcggtcat cggcggggtg cccagcgcgc 1868641 acatcatcga tgggcgggtc acacactgcg tgttggtgga gttgttcacc gacgcgggca 1868701 ccggcaccaa ggtggtgcgc ggatgaccgg cgcttcgacc acgacggcga ccatgcggca 1868761 gcggtggcaa gccgtgatga tgaacaacta cggcaccccc ccgatagcgc tggccagcgg 1868821 tgacggcgcc gtggtcaccg acgtggacgg cagaacctat atcgacctgc tcggcggcat 1868881 cgcggtcaac gtgctgggcc atcgccaccc cgcggtcatc gaggccgtca cccggcagat 1868941 gtcgacgctg gggcacacct ccaacctgta tgccaccgaa ccgggcatcg cgctggccga 1869001 ggagctggtc gcgctgctgg gggccgacca gcggacgcga gtgttcttct gcaactccgg 1869061 cgccgaggcc aacgaggcgg cgttcaagct gtctcggctc accggacgca cgaaactggt 1869121 cgccgcccac gacgccttcc acggccgcac catgggctcg ctggcgctca ccggacaacc 1869181 ggccaagcaa acgccgttcg cgccgctgcc cggcgacgtc acgcacgtcg gctacggcga 1869241 cgtcgacgcg ttggccgccg ccgtcgatga ccacaccgcc gcggtgttcc tggaaccgat 1869301 catgggggag agcggggtcg tcgtcccgcc cgcgggctac cttgccgccg cccgcgacat 1869361 cacggcgcgg cgcggcgcgc tgctggtgct cgacgaggtg caaaccggga tgggccgcac 1869421 cggagcgttc ttcgcccacc agcacgacgg catcaccccg gacgtggtga ccctggccaa 1869481 gggtctgggc ggcgggctgc cgatcggtgc ctgcctggcc gtcgggccgg ccgccgaact 1869541 actgacccca ggcctgcacg gcagcacctt cggcggcaac ccggtctgcg ccgcggcggc 1869601 gctggcggtg ctacgggtgc tggcgagcga cggcctggtc cgccgcgccg aagtcttggg 1869661 caaatcgttg cggcacggca tcgaagcgct cggccacccg ctcatcgacc acgtgcgcgg 1869721 acgcggactg ctgttgggca tcgcgctgac cgccccgcac gccaaggacg ccgaggccac 1869781 cgcccgcgac gccggttacc tggtcaacgc ggccgcaccc gacgtcatcc ggttggcgcc 1869841 gccgctgatc atcgccgaag cacagctcga cggctttgtc gccgccttgc cggcaatcct 1869901 ggaccgcgcc gtgggggccc cgtgatcagg catttcctgc gcgacgacga tctgtccccg 1869961 gccgaacagg ccgaggtgct cgagctcgcg gccgagctga agaaagaccc ggttagccgt 1870021 cgtcccctgc aagggccgcg cggggtggcg gtcatcttcg acaagaactc cacccgcacc 1870081 cggttctcct tcgagctggg catcgcgcag ctgggcgggc atgccgtcgt cgtcgacagc 1870141 ggcagcaccc agctgggccg cgacgaaacc ctgcaggaca ccgcaaaggt gttgtcccgc 1870201 tacgtcgatg ccatcgtctg gcgaaccttc ggccaagagc ggctggacgc catggcgtcg 1870261 gtcgcgacgg tgcccgtgat caacgcgctc tccgatgagt tccatccgtg tcaggtgttg 1870321 gccgacctgc agaccatcgc cgaacgcaag ggggcgctgc gcggcctgag gttgtcctac 1870381 ttcggcgacg gcgccaacaa catggcccac tcgctgctgc tcggcggggt caccgcgggt 1870441 atccacgtca ccgtcgcggc tcccgagggc ttcctgcccg acccgtcggt gcgggccgcg 1870501 gccgagcgcc gcgcccagga taccggcgcc tcggtgactg tgaccgccga cgcccacgcg 1870561 gccgccgccg gcgccgacgt tctggtcacc gacacctgga cgtcgatggg ccaggaaaac 1870621 gacgggttgg accgagtgaa gccgtttcgg ccgtttcagc tcaactcgcg acttctggcg 1870681 ctggccgact cggatgccat cgtgttgcat tgcctgccgg cccatcgcgg cgacgagatc 1870741 accgacgcgg tgatggacgg gccggccagc gcggtgtggg acgaggccga aaaccggctg 1870801 cacgcgcaga aggcgctgct ggtgtggctg ctggagcgct catgagccgc gccaaggccg 1870861 cgcccgttgc ggggcccgag gtcgccgcaa accgcgccgg ccgccaggcg cgcatcgtgg 1870921 cgatcctgtc gtcggcgcag gtgcgcagcc aaaacgaact ggcggcgctg ctggccgccg 1870981 agggcatcga ggtcacccaa gccacactgt cacgcgatct ggaagagctc ggcgcggtga 1871041 aactgcgcgg cgcggacggc ggcaccggca tctacgtggt gcccgaggac ggcagcccgg 1871101 tgcgcggcgt ctcgggcggt accgaccgga tggcgcggct gctcggtgag ctgctggtgt 1871161 cgaccgacga cagcggcaac ctcgcggtgt tgcgcacccc gccgggcgcg gcgcactacc 1871221 tggccagcgc catcgaccgc gcggccctgc cccaggtcgt cggcaccatc gccggtgatg 1871281 acaccatcct ggtggtggcc cgcgagccga cgaccggcgc gcaactggcc ggcatgttcg 1871341 agaaccttcg gtaaggagag tcatgtcaga gcgcgtcatc ctggcctatt ccggcggtct 1871401 ggacacctcg gtggcgatca gctggatagg caaggagacc ggccgtgagg tggtggcggt 1871461 ggcgatcgac ctcgggcagg gcggcgagca catggacgtc atacggcagc gggcgctgga 1871521 ctgcggcgcg gtggaggctg tcgtcgtcga cgcccgcgac gagttcgccg aaggctactg 1871581 cctgcccacc gtgctgaaca acgcgctgta catggaccgc tacccgctgg tgtcggcgat 1871641 cagccggccg ctgatcgtca aacacctggt cgccgcggcg cgcgagcacg gcggcggcat 1871701 cgtcgcgcac ggctgcaccg gcaagggcaa cgaccaggtc cggttcgaag tcgggttcgc 1871761 ctcgctggca ccggatttag aggtgttggc gccggtgcgc gactacgcgt ggacgcggga 1871821 gaaggcgatc gcgttcgccg aggagaacgc gatcccgatc aacgtcacca aacgttcgcc 1871881 gttctccatc gaccagaacg tctggggccg cgcggtggag accggcttct tagagcacct 1871941 gtggaatgcc ccaaccaagg acatctacgc ctacaccgaa gaccccacga tcaactgggg 1872001 ggtccccgac gaggtgatcg tcggcttcga acgcggcgtg ccggtgtccg tcgacggcaa 1872061 gccggtgtcg atgctggcgg cgatcgagga gctcaaccgc cgcgccggag cgcaaggtgt 1872121 cgggcgcctc gacgtcgtgg aggatcggct ggtgggcatc aagagccgcg agatctacga 1872181 ggcgcccggc gcgatggtgc tgatcaccgc gcacaccgaa ctcgaacacg tcaccctgga 1872241 gcgtgagctg ggccggttca aacgccagac cgaccagcgc tgggccgaac tggtctacga 1872301 cgggctgtgg tactcgccgc tgaaggccgc gctggaggct ttcgtcgcca agacccagga 1872361 gcacgtgtcc ggcgaggtgc ggctggtgct acacggcggc cacatcgcgg tcaacggccg 1872421 gcgcagcgcg gaatcgttgt acgacttcaa cctggccacc tacgacgagg gcgacagctt 1872481 cgaccagtcc gccgcccgcg gcttcgtcta cgtgcacggg ctgtcctcca agctcgccgc 1872541 ccgccgggat ctgcggtgac ggttctcccg cgagcagacg cagaatcgca ccgccacgcc 1872601 cgtcggcgtg cgattctgcg tctgctcgcc acagaaaagt gagcaccaac gaggggtcgc 1872661 tgtggggcgg gcggttcgcc ggcggcccgt ccgacgcgct ggccgcgctg agcaagtcca 1872721 cccacttcga ctgggtgctg gccccctacg acctcaccgc gtcgcgggcg cacaccatgg 1872781 tgctgtttcg ggccgggctg ctcaccgagg agcaacgcga cgggctgctc gccggcctgg 1872841 acagcctcgc ccaagacgtc gccgacggca gcttcggccc gctggtcacc gacgaggacg 1872901 tgcatgccgc gctggagcgg ggcctgatcg accgggtcgg accggacctg ggcggccgac 1872961 tgcgggccgg gcgctcgcgc aacgaccagg tggccgcgct gtttcggatg tggctgcgcg 1873021 acgcggtgcg ccgggtcgcc accggtgtgc tcgacgtggt cggtgcgctg gcagagcagg 1873081 ccgccgcaca cccgagcgcc atcatgcccg gcaaaaccca cctgcagtcc gcccagccga 1873141 tcctgctggc acaccatctg ctcgcgcacg cccaccccct gctgcgcgac ctggaccgca 1873201 tcgtcgactt cgacaaacgc gcggcggtgt ccccgtacgg ctcgggcgcc ttggccggct 1873261 cgtcgctggg cctggatccc gacgcgatcg ccgcggacct cggtttctcg gctgccgcgg 1873321 acaactccgt cgacgcgacc gccgcccgcg acttcgccgc cgaggcggcg ttcgtgttcg 1873381 ccatgatcgc cgtcgacctg tcccggctgg ctgaggacat catcgtctgg agctcgacgg 1873441 aattcggcta cgtcacgttg catgactcgt ggtccaccgg tagctcgatc atgccgcaga 1873501 agaagaatcc ggacatcgcc gagctggccc gcggcaagtc cgggcggctg atcggaaacc 1873561 tggccgggct gctggccacc ctgaaagccc agcccctggc ctacaaccgc gacctgcagg 1873621 aagacaagga gccggtgttc gattcggtgg cccagctgga gctgctgctg ccggcgatgg 1873681 ccgggctggt ggccagcctg accttcaatg tccagcggat ggcggagctg gccccggccg 1873741 gctatacgtt ggccaccgat ctcgccgaat ggcttgtgcg gcaaggtgtt ccgtttaggt 1873801 ccgcgcatga ggccgcgggt gcggcggtgc gtgcggccga acagcgcggc gtggggctgc 1873861 aggaactcac cgacgacgag ctggccgcca tcagccccga gctgaccccg caagtccgcg 1873921 aggtgctgac catcgaaggc tcggtgtcgg cccgcgattg ccggggtggc accgcgccgg 1873981 gccgggttgc cgagcaactg aacgccattg gtgaagccgc cgagcggctg cgccgccagc 1874041 tggtgcgctg agggggcctc gaaactttgc cggccagttc caggcgggct aaacttcggg 1874101 ctctaggcga cccggttgaa ccattcggcc tcgatgtgcg tgtcaaaggg gtgggaccag 1874161 tgagcgtcat cgcaggtgtg ttcggcgcgt tgccgccgta tcgctattca caacgcgagc 1874221 tcaccgactc gtttgtcagc atcccggatt tcgagggcta cgaagacatc gttcgccagc 1874281 tgcacgccag cgccaaagtc aacagccgcc acctggtctt gccgctggag aaatacccga 1874341 agctgaccga cttcggcgag gcgaacaaga ttttcatcga aaaagccgtg gacttgggcg 1874401 tgcaagccct ggcgggggca ctcgacgagt ccggtctgcg acccgaggat ctcgacgtgt 1874461 tgatcaccgc cacggtcacc ggactggcgg tgccgtcgct ggatgcccgg atcgccgggc 1874521 ggctggggct gcgcgccgat gtccggaggg tgccgctgtt cgggctgggc tgcgtggccg 1874581 gggcggccgg ggtcgcccgg ctgcacgact acctgcgcgg ggccccggac ggcgttgccg 1874641 cgttggtctc ggtcgagctg tgttcactca cgtatccggg atacaagccg acgctgccgg 1874701 gccttgtcgg cagtgcgttg tttgctgacg gcgccgcggc ggtggtggcc gcaggtgtga 1874761 agcgcgccca ggacatcggc gccgacgggc cggacatcct ggattcgcgc agccatctgt 1874821 accccgactc gctgcgcacc atgggatacg acgtcggctc ggccgggttc gagctcgtcc 1874881 tatcacggga cttggcggcc gtggtcgagc agtatctggg caatgacgtc accaccttcc 1874941 tggcttcgca cggcctgagc accaccgacg tcggcgcctg ggtcacccat cccgggggac 1875001 ccaagatcat caacgccatc accgagaccc tcgacctgtc gccgcaggct ctcgagctga 1875061 cgtggcgctc gttgggcgaa atcgggaatc tgtcgtcagc gtcggtgctg catgtgctgc 1875121 gtgacaccat cgccaaaccg ccccccagcg gaagtcccgg gttgatgatc gccatgggcc 1875181 caggcttctg ttccgaactc gtgttgctgc gctggcactg atgctggatt ccgcgagcgt 1875241 aacgccactg cgctattcgg atcgcaatct cgcagtgacg ttacgctcgg cggacctcgt 1875301 gccatgaaca gcactcccga agacctcgtc aaggccctgc gcagatcgct caagcaaaac 1875361 gagcgactga agcgagagaa ccgggatctt cttgcccgga ccaccgagcc ggtggcggtg 1875421 gtggggatgg gatgccgcta tccgggtggg gtggattcgc cggagacgct gtgggagctg 1875481 gtggcacacg gccgtgacgc ggtttcggag ttcccggcgg atcgcggctg ggatgtggcg 1875541 gggttgtttg accccgatcc cgacgcggta ggcaagtcgt atacccggtg cggcgggttc 1875601 ttgacggatg tcgccggttt tgacgccgag tttttcggga tcgcacccag cgaggcgctt 1875661 gcgatggatc cccagcagcg gttgctgttg gaagtgtcgt gggaagcgtt ggagcgggcg 1875721 ggcatcgacc caatcacgtt gcggggttcg cagacgggcg tgttcgccgg ggtgttccac 1875781 ggctcgtatg ggggccaagg ccgggtgccg ggtgacctgg agcgctacgg gctgcgtggc 1875841 tcgacgctga gcgtggcctc cgggcgggtg gcgtatgtgt tgggcctgca gggcccggcg 1875901 gtgtcggtgg ataccgcgtg ttcgtcgtcg ttggtggcac tgcatttggc ggtgcagtca 1875961 ctgcgcctcg gcgaatgcga cctggcgctg gtcggtgggg tcaccgtgat ggccaccccg 1876021 gcgatgttca tcgagttcag caggcagcgg gcgctgtccg ccgatggtcg ttgtaaggcc 1876081 tatgcgggtg ccgccgatgg gaccgcgttt gccgagggcg ccggggtgct cgtgctggcg 1876141 cggttggctg acgcgcgccg gttggggcat ccggtgctgg cgctggtgcg cggatcggcg 1876201 gtcaatcagg acggcgcctc caacgggctg gccacgccga atgggccggc gcagcaacgg 1876261 gtgatcactg cggcgctggc cagtgcgcgg ttaggtgtcg ccgacgtgga tgtggtcgag 1876321 gggcacggga cgggcaccac gttgggggat cccattgagg cgcaggcgat tttggcgacg 1876381 tatggacagc ggccggccga tcggccgttg tggctggggt cgatcaaatc gaacatcggt 1876441 catacgtcgg cggctgcggg ggtcgccggg gtgatcaaga tggtgcaggc gatgcgccac 1876501 ggcgtgctgc ccaagacgtt gcacgtggat gtgccgacgc cgcatgtgga ttggtcggcg 1876561 ggggcggtgt cgttgttgac cgagccgcgg ccgtggcacg tgccgggccg gccgcggcgg 1876621 gccggtgtgt cgtcgttcgg gatcagcggc accaacgcac atgtgattct ggaagaggca 1876681 ccggcagtgg aaccggttgg cgcggcccat ggcaacgacc cggtggcggt gccgtgggtg 1876741 ctgtcggcga ggtcggcgca agcgttgacc aaccaggcgc gacggctgtt ggcctgggtg 1876801 ggcgccgatg agaacgtgcg cccgctcgat gtggggtggt cgctggtcaa cacccggtcg 1876861 ctgtttgatc atcgggccgt ggtcgtgggc gccgaccgca ctcagctgat ggaagggctg 1876921 acgggtctgg cggccggcgt gcccggcgcc gacgtggtgg cgggccgcgc ccagacggtg 1876981 ggcaagacgg cattcgtgtt cccgggccag ggcgcgcagt ggctgggcat gggagcccag 1877041 ttatgtgcta ccgcaccggt gttcgccgaa catatccatc gctgcgaacg ggcgctgcgt 1877101 gagcacgtgg agtggtcgct gctcgacgtg ctgcgcgggg cacccggcgc accggggctg 1877161 gatcgggtgg atgtggtgca gccggcgttg tgggcggtga tggtgtcgct ggccgaattg 1877221 tggcggtcgg tgggtgtggt tcccgacgcg gtcatcgggc attcgcaggg ggagatcgcg 1877281 gcggcatatg tggcgggcgc cctgtcgctt cgggacgcgg ctgcggtggt ggcactgcgc 1877341 agccggttgc tggtgcggtt gggcggtgcc ggcggcatgg tctcgttggc ctgtggccag 1877401 ccgcaggccg agaagttggc gtcccaatgg ggagaccgac tgaatatcgc tgcagtcaat 1877461 ggtgtctcgt cggtcgtgct ggccggcgag acggatgccg tgacggagct gatgcagcga 1877521 tgtgaggccg aaggcattcg tgcccgcagg atcgacgtcg actacgcgtc acactcggcg 1877581 caggtggacg cgatccggga ggagctcatc gcggcgctgc gaggtatcga accccgtact 1877641 tccacggtgg cgttcttctc cactgtcacc ggcgaactca tggataccgc cggtgtgaac 1877701 gccgagtact ggtaccgaag catccgccag ccggtgcagt tcgaacgcgc cgtccgcaac 1877761 gccttcgacg gcggataccg ggtgttcgtc gaatccagcc cccatccggt cctgatcgcc 1877821 ggcatcgaag agacgttggt cgactgtgat cgcggcgcta cgggtgaacc gattgtcatt 1877881 ccgacgctgg gtcgcgatga cggcggggtg ggccggtttt ggctgtcggc ggggcaggcc 1877941 cacgttgcgg gcgtgggtgt tgactggcgt gccgcgtttg ccgacctggg aggccgccgg 1878001 gtggagttgc cgacgtacgc gtttgcgcgc cagcggttct ggctagacgg cctaggtgct 1878061 gttggcggcg atctgggtgg tgtcggcttg gtgggcgccg agcatggatt gttggctgca 1878121 gtggtgcaac ggcccgactc gggtggggtg gtgttgacgg gccggatatc ggtggtcgct 1878181 gcgccgtggc tggccgatca tgcggtgggc ccggtggtgc tgttcccggg cacggggttt 1878241 gttgagttgg ccttgcgggc cggtgacgag gtgggttgtt cggtgctgca ggagttgacg 1878301 ttgcaggcac cgttggtgct gccggcagat ggggtgcggg tccaggtggt ggtgggcggc 1878361 gtcgagcagt cgggtactcg gaatgtgtgg gtgtattcgg ctgccggcca ggcggattcg 1878421 agtccgggat ggacgttgca cgcgcagggc gtgttggggg ttggctcggt gcagccggcc 1878481 gcggagctgt cggtgtggcc gccggttggg gcacgggcga tggacgtcgc cgacgggtat 1878541 caggtgttgg cggcgcgggg gtatgggtat gggccggcgt ttcggggttt gcaggccttg 1878601 tggcggcggg gggccgaggt gttcgccgac gtcactctcc ctgagggtgt gccgatacgg 1878661 gggtttggga ttcatccggc ggtgttggat gcggcgttgc atgcgtgggg aattgtcgag 1878721 ggtgagcagc agacgatgtt gccgttctcg tggcaggggg tgtgtttgca cgcaagcggg 1878781 gctgcgcggg tccgtgtgcg actggcgccg gtgggccggg gggcggtgtc ggtggagttg 1878841 gccgatccgc aggggttgcc ggtgttgtcg gtgcggcagt tgatggttcg tccggtctca 1878901 gcggccgcgt tgtcgaggtc gaccgccggc gaccggggat tgctggagat gatctggaca 1878961 ccggtgccgt tggagggcgg cgacattggc gacgacgccg tggtgtggga gctgccgcct 1879021 cacgccggcg cgcaggccgg cggggatgtg ctggcagcgg tgtaccgggg tgtgcacgag 1879081 gtgttggagg tgttgcagtc gtggttggct agcgatgcga ccggtctggg tgtggtggtg 1879141 acgcgtgggg cggtgggtcc ggttgatgac gatgtcaccg atttggcggg tgctgcggtg 1879201 tgggggttgg tgcgctctgc ccaggctgaa catccgggcc gggtggtgtt ggtggatacc 1879261 gatgggtcgg tcgctgtcga ggatgcggtt ggtttcggcg cacgctcggg tgagccgcag 1879321 ctggtggttc gtcgaggccg ggtatatgcg gcacggttgg ccccggtagc ggccgggttg 1879381 actttgcctt cggcgtcggc tgggggctgg cggttggttg ccggtggtgg ggggactttg 1879441 gcggatgtgg tggtggcgcc cgttgctccg gtggagctgg cgacggggca ggtgcgggtg 1879501 gccgtgggtg cggtgggggt caatttccgg gatgtgttgg tggcgttggg gatgtatccc 1879561 ggcggcgggg aactgggtgt cgacggggca ggggtggtcg ttgaagtcgg cccgggggta 1879621 accggtttgg ccgttggtga ccgggtgatg gggttattgg ggctggtggg ttcggaggcg 1879681 gtggtggatg cgcggttggt aaccatggtg ccggcgggct ggtcgttggt ggaggcagcg 1879741 gccgtgccgg tggcgtttct gacggcgttt tacgggctgt cggtgttggc ggaggtcgcg 1879801 gcggggcaga aggtgttggt gcatgccggc accggcgggg ttggtatggc agcggtgtcg 1879861 ttggcgcggt attggggtgc agaggttttc gtcacggcga gtcgcgccaa gtgggataca 1879921 ttgcgggcga tgggttttga cgatatccat atctccgact cgcgatcgtt ggagttcgag 1879981 gaggcgtttc tgcgggccac cgagggcagc ggtgtggacg tagtgctgaa ctcgctcgcc 1880041 ggtgagttca ccgatgcctc gctgcggcta ctgcccagcg gtggccgctt tatcgagctg 1880101 ggtaaaaccg atattcgcga cgggcagacg gtggccgagc ggcatcgggg ggtgcggtat 1880161 cgggcgttcg atttggtcga agccggccca gaccgcattg cggcgatgct ttccgaggta 1880221 gtggggttgc tagcggccgg agtgttggcg cggttgccgg tcaagacttt tgatgcgcga 1880281 tgcgccccgg cggcctaccg gtttgtcagt caggcccgtc atatcggcaa ggtcgtgttg 1880341 accatccccg atggtccggg tgggcagtcc gggttggcgg ggggcaccgt ggtggtcact 1880401 ggggggaccg gcatggccgg ttcggcggtg gctacccatt tggtccggcg acatggggtg 1880461 gccaatctgg ttctggtcag ccgaagcggt gagcaggccg acagggcggc agaagtcgcg 1880521 gccctgttgc gcgagggcgg ggcccaggtg gcggtggtct cctgtgatgt ggctgatcgt 1880581 gatgcgctgg cggcattgtt ggcgggtctg gatccgcgct atccgcttaa aggggtgttt 1880641 catgccgctg gggtgttgga cgatgccgtg atcacgggct tgacaccgga tcgggtggat 1880701 acggtgttgc gggccaaggt cgatggggcc tggaatctgc acgagctaac cgaggacatg 1880761 gatttgtcgg cgtttgtggt gttttcgtcg atggccggga ttgtgggcac accggctcag 1880821 gggaattatg ctgcggcgaa tgcgtttttg gacgggttgg tggcctatcg gcgctcgcgt 1880881 gggctggccg gattgtcggt ggcgtgggga ctgtgggagc aggcctcggc gatgacccgg 1880941 cacctcggcg agcgggatcg cgccaggatg acgcaggccg ggctcgctcc gctaaccacc 1881001 gagcaggcgc tagggttcct ggacactgcg ctgcaggccg atcgcgcggt ggtagtggcg 1881061 gcccggctgg atcgtgccgc gctggccggc gctggtgctg cgctaccggc attattcagc 1881121 cagttggctg ccggtccgac ccggcggagg atcgacgccg ccgatacggc ggtgtcgatg 1881181 tcgggcttag tcagccggct gcatgcgctc acgcccgagc ggcggcagcg cgaactcacc 1881241 gatttggtga tcagcaatgc cgcggcggtg ttgggtcgtt ccagcagtgt cgatatcaac 1881301 gctcacaaag cattccaaga tctcgggttc gattccttga ccgccgtgga gctgcgcaac 1881361 cgactcaaga ccgccaccgg gctcacgttg tcgcccacgc tgatcttcga ctaccccacg 1881421 ccggccacgc tggccgaaca cctcgacagc cggctagtca ccgccagcgg tagcgatcaa 1881481 caaagcctgt cagaccgtgt tgacgacatc acccgcgagc tagttgtgct gcttgaccaa 1881541 cccgacttga gcgccaacgt caaagcgcac ctgcgcaccc gcctgcaaac catgttgacc 1881601 agcctgacca ctgaagacga cgacatcgcc gccgcgaccg aaagccagct tttcgccatc 1881661 ctcgacgagg aactcggctc ctaacccccc gcaaggaaca ccaatgtcgg gaaccaccac 1881721 gcatgttgac tacctgaagc gtctcacggc agatctgcgg cgcacccgca gacgcctgtc 1881781 cgacttggaa gccaagttgt ccgagccggt tgcggtggtc ggaatgggat gccgttatcc 1881841 aggtggggtg gattcgccgg agacgttgtg ggagctggtg gcccagggcc gtgatgcggt 1881901 atcggatttt ccggcggatc gcgggtggga tgtggacggg ttgtttgatc ctgacccgga 1881961 tgcatgcggg aagatgtata cccgccgcgg gacgtttctg gagcatgcgg gtgacttcga 1882021 cgccggattc tttggaatcg gtcctagcga ggcgctggcg atggacccgc aacagcgcct 1882081 gctattggaa gtgtcgtggg aagcgttgga gcgtacggga attgacccga ccaagttgcg 1882141 gggttcggca acgggtgtgt tcgccggtgt tatccatgcc ggctatgggg gccagctatc 1882201 cggcgagctg gaaggctatg ggttaacggg ttcgacgctg agtgtggcct ccgggcgggt 1882261 ggcgtatgtg ctggggttgg agggtccggc ggtgtcggtg gacacggcgt gctcgtcgtc 1882321 gttggtggcg ctgcatttgg cggtgcagtc gctgcggtcg ggggaatgcg atttggcgct 1882381 ggccggtggg gtgacggtga tggccacccc cgccgcattc gtcgagttca gccggcagcg 1882441 ggcgctggcg cgcgacggtc ggtgcaaggt atacgccggt gccgccgacg ggaccgcgtg 1882501 gtcagaaggc gccggggtgc tggtggtgga gcggctggtg gatgcacggc ggttggggca 1882561 tccggtgctg gccctggtgc gcggatcggc ggtcaatcag gacggcgcct ccaacggttt 1882621 gacggcaccc aatgggccat cccagcagcg ggtgattcgg gcggcgttgg ccagtgcgcg 1882681 actgcgcgcg gttgaggtgg atgtggtcga ggggcacggg accgggacca tgctggggga 1882741 tccgattgag gcgcaggcgc ttttggcgac ctacggtcag gaccgcgttg agcccctgtg 1882801 gttggggtcg atcaaatcga acatcggtca tacatcggcg gcggcggggg tggccggggt 1882861 gatcaagatg gtgcaggcga tgcggcatgg ggtgatgccc aagacattgc atgtggatgt 1882921 tcctacgccg catgtggatt ggtcggtggg ggcggtgtcg ttgttgactc aaccgcgggc 1882981 gtggtcggtt cacggccggc cgcggcgggc cggggtgtcg tcgttcggga tcagcggcac 1883041 caatgcgcat gtgattcttg agcaggcacc ggtagttgaa agtgttgtgc cagaagttgc 1883101 atccccaaca gcggcgtccg ccgtgccgtg ggtgctgtcg gcccggtcgg agcaggcgtt 1883161 ggccggtcag gcgcagcggc tgctggcttt cgtcgcggcc aacccggatt tggatccgat 1883221 cgatgtgggg tggtcgttgg tcaagacgcg ggcgatgttc gagcatcggg cggtggtcgt 1883281 gggtgctgat cgcggggccc tgctggcggg gttggcggcg ttggccgctg gtgagtcggg 1883341 tgcgggcgtg gcagtgggtc gagcgcggtc ggtggggaag acggtgttcg tgtttcccgg 1883401 gcaaggggcc caatgggtag gcatgggagc gcagttatat gccgaattac ccctgttcgc 1883461 cctggctttt gacgcggtgg ccgaagagct ggatcggcac ctgcggctgc cgctgcgaaa 1883521 cgtgctctgg gaaggtgacg aggcgctgtt gactagcacc gagttcgccc agccggcgtt 1883581 attcgcaatc gaagtggcgt tggcaacgtt gttgcagcac tggggtatca gcccggattt 1883641 cctgatcgga cattcggtgg gcgagatcgc ggcagcacat ttggccgggg tgttgtcgtt 1883701 gaccgatgcg gcgggtttgg tggctgcccg cggcaggttg atggcggagt tgcccgccgg 1883761 tggggtgatg gtggtggtgg ccgccagcga agaagaagtg ctgccagtgc tggtcgacgg 1883821 ggcgaatctc gcggcggtca acgcgccgca ctcggtggtg gtttcagggt gcgaggcagc 1883881 ggtcagcgat attgccgatc actttgcccg caggggccgc cgggtgcatc ggctagcggt 1883941 atcacatgcg tttcattcgt tgctgatgga accgatgctt gccgagttca cgcggatcgc 1884001 tgccggtatt tcggtgtcga aaccgcggat tccgttggtg tccaatgtga ccgggcagat 1884061 ggccggcgca ggctacggcg atggacagta ctgggtggag catgcgcggc gccccgtgcg 1884121 atttgccgag ggcgtccagt tgctgaatgc ggttggggcc acaaggtttg ttgaggtggg 1884181 tcccggcggt ggcctgacag cattggtcga gcagtcgctg cctttaggcg aggcgctatc 1884241 ggtggcgatg atgcgtagag agcaccccga agtgtcgtcg gtgctcggcg ccgtggcgac 1884301 attgttcact gcgggtgccc aaatggattg gccggcggtg tttggcagtc cgggtcgacg 1884361 gatcgaattg ccgacctatg cgtttcagcg gcagcggtat tggttgccgc ctacgtcggc 1884421 gggttcggca gacatcagcg gtgttggtct gctggcagcc cggcatggtt tgttgggtgc 1884481 ggttgtggag caaccggatt cggacgtggt ggtactgacc ggccggctat cggtggggga 1884541 gcagcggtgg ttggccgatc acgtgatcgc tggagtggtg ttgctcgccg gtgcggcttt 1884601 cgtggaactg gcgctgcgag ccgccgacca ggtggattgt ggggtggtcg aggagctgac 1884661 ggtggtgact ccgttggttt tgccgacggt gggcggggtg cagctacagg tggtggtggg 1884721 tgtcggtgag atgggtcagc ggccagtgtc gatatattca cgcaacgctg agtcggattc 1884781 cgggtgggtg ttgcatgccc ggggcgtatt gggggcaaag gcggttgccc cggcagcgga 1884841 tttgtcggtg tggccgccgc tgggtgctgc cccggttgat gtcgatggcg cctatcagcg 1884901 attcgccgaa ctgggctatg aatatggccg ggcgtttcag ggtctgacgg ccatgtggcg 1884961 gcgggaatcg gagctcttcg ccgatgttgc cgtccccgac gatgtcgatg tgacgttgag 1885021 tgggttcgga attcacccac tggtgctgga tgcggccttg catgcaatgg gcatggtggg 1885081 cgagcaggca gctaccatgc tgcccttctc ctggcaaggg gtctccctgc atgccgcggg 1885141 tgcgtcccgg gttcgggcgc ggatcgcgcc ggccggtgat ggcacggtgt cggtggagtt 1885201 ggccgatcag gcggggttac cggtgttgtc ggtacaggca ttggtcatgc gttcggtgtc 1885261 gtctcagctg ttgtcggcgg ccgtcgccgc tgccgatgcc gcaggtcgcg ggttgttgga 1885321 agtggcgtgg ttgccagtgg aattggcgca caacgacatc agcgccgacc tcgtggtctg 1885381 ggagttggag tctttccagg acggtgtggg tccggtgtat tcggctacgc atcgggtgtt 1885441 ggtggcattg cagtcctggc tggcccagga gcgggccggc cgactggtgg tgctgaccca 1885501 agggtcggtc ggccaggatg ccacgaactt ggccggcgcc gcggtgtggg ggttggtgcg 1885561 gtcggctcaa gccgaacatc cgggtcgggt gatgttggtc gattcggacg gctcgatgga 1885621 tgttggagat gtcattggct gtggtgaaga gcaattgatg atccggaacg gcacagccta 1885681 tgccgcccgg ctggcacagc ttcgaccaca gccgatcctg cagttgcccg ataccaactc 1885741 gggctggcgg ttggtcgccg gcggcgcggg cgcccttgag gatttgacgt tggcatcatg 1885801 ccctgcaaag gaattggcac ctggacaggt tcgaatagag gtgcgggctt tgggtgtcaa 1885861 tttccgggat gtgttggtgg cgttgggaat atatcccggt gccgcggagt tgggggccga 1885921 aggggcaggg gtggtcaccg aagtcggtcc aggcgtgacc ggtttagcag ttggtgatcc 1885981 ggtgatgggt ctgttggggg tggcggggtc ggaagcggtg gtcgatgcgc ggctggtggt 1886041 caagctgccg aaccggtggc cgctgaccga tgctgcgggt gtgccggtgg tgtttctgac 1886101 ggcctactac gcgttacgcg tgctggcgca ggtgcagccg ggcgagtcgg tgctggtaca 1886161 cgccgctgcg ggcggggtgg gtatggcggc agtgcaactg gctcggctgt ggggattgga 1886221 ggttttcgct actgccagtc gcggcaagtg ggacacgttg cacacaatgg gatgtgacaa 1886281 cacgcatgtt gccgattcac gcacactggc attcgaggag acgttttggc tgaccaccga 1886341 gggtcgcggc gtggatgtgg tgctcaactc gctggccggt gagttcaccg acgcatcgtt 1886401 gcggttactg ccgcgaggcg gtcgcttcat cgagatgggc aaaaccgagt tcgggacgcc 1886461 caggtcgttg cccaggacca tcctggggtg gcctaccggg ctttcgactt gatggaggcc 1886521 ggaccgcagc ggattgcgca gatgctggcc gagttagtcg agttgttcaa aactgaagcg 1886581 ctgcatcggc ttccagtcaa gtcatgggat gtgcggcacg ctcgggaggc gtatcggttc 1886641 ttgagccagg cgcgccatgt cggcaaagtg gtgctgacca tgccggacgc gtgggccgcg 1886701 ggcacggtgc tgatcaccgg tggcactggg atggcaggtt ctgcggtggc gcgtcatctg 1886761 gtgagtcgat acggggtgcg gcaggtggtg ttggccagtc gtgctggtga gcacacggag 1886821 agcgtcgcag cattggtgga cgagctcggc tcggccggcg cccgagtgca ggtggtgtct 1886881 tgcgatgtgg ccgatcgtga tgcggtggcg ggtttggtgg caagccaacc agatctgact 1886941 gcagtgtttc atgcggctgg ggttcttgac gatgcggtaa tcaccggatt gacgccggag 1887001 cgggtggata aggtattgcg ggccaaggtc gatggggcct ggaatttgca tgagctcacc 1887061 cggcacctgg atgtgtcagc gtttgtgttg ttttcgtcga tggccgggat tgtgggtgcg 1887121 ccgggccagg ccaattatgc tgcagcgaac gcgtttttgg acgggttggc ggcctatcgg 1887181 cgatcacgtg gactggccgc gttgtcggtg gcgtggggat tgtgggagca ggcttcggcg 1887241 atgaccgagc atttaggcga gcgggatcgg gtccggatga gtcgggttgg actggcgccg 1887301 ttgcctacca accaggcgat gggattcctg gatgccgcgt tgctggcgga tcggcccgtg 1887361 gtggtggctg ctcggctgga tcgtgccgcg ctggccggtg ccgagctgcc ggcactattt 1887421 agccagttgg ttgccggtcc gatccgacgg atcatcgacg gcgccgatga ggtgtcgggg 1887481 tcgggattgg cgtcgcggct gcacgggctg actcccgagc agcggcaccg cgaactcacc 1887541 gagttagtat gtagcaacgc cgcgatcgtg ttggggcatt ccggcactga gatcgacgcg 1887601 cacaaggcat tccaggatct cgggtttgat tcgctgacag cggtggagct gcgcaaccgg 1887661 ctcaagactg cgaccgggtt gaccttgcca ccgaccttga tctttgacta ccccacggcc 1887721 gccgagttgg ccgaacacct cgacatccag ctggcgaacg cccctgccgt cacggtcgac 1887781 caacccaacc cgtcgactcg tttcaacgag gtcacccgcg aactacaagc attgctcgac 1887841 caacccaact ggaaccccga cgacaaaacg cgcctgatca agcgattgca agcgattttg 1887901 accgattgca ccgctccacc ggccagctcc ggcccgtcta ccacccatga cgacgaggac 1887961 atcaccaccg ccactgaaag ccagcttttt gccatcctcg acgacgaact tggaccttag 1888021 cgcacgtgca accgacaggc atcgcaatca tcgggctggc atgcaggttt cccaccgtcg 1888081 tcagccccgg cgacctctgg gacctgttgc gcgacgggcg agaggctgct ggatccattg 1888141 acaacgtcgc cgatttcgac gccgactttt tcaacctatc cccccgcgag gcgagcgcga 1888201 tggaccccag gcaacgactg gcgctcgaac tcacctggga actgctcgaa gacgctttcg 1888261 tggtgccgga aacgctgcgc ggacaaccga tcgcggtcta cctcggagcg atgaacgacg 1888321 actacgcagt actgacgctc gcggcggacc gtgttgacca tcacgcgttc gctggcacta 1888381 gtcgggcaat catcgcaaac cgcgtgtcgt ttgctttcgg gctgcgtgga ccaagcgtga 1888441 cgatcgactc cggtcagtcg tcatccctgg tagcggtgca tctggcatgc gaaagcgtgc 1888501 gaacaggcga agcgccgctg gcgattgccg gtggtgttca cctcaacttg gcacgcgaaa 1888561 cagccatgct ggaacaagaa ttcggcgcgg tatcgccgtc cggccatacc tacgcattcg 1888621 atgaacgtgc cgacggctac gtaccaggcg acggcggtgg cctcgttctg ctgaagccgg 1888681 tgcaagctgc cctggacgac ggagatcgaa tccacgcgat catccgcggc agcgcggtcg 1888741 gcaacgccgg gcacagcgct accgggctga ccgtgccgtc ggtcgccggc caggtggacg 1888801 tcatcaggcg ggcgatgtcc ggcgcggggg tggattgcca tcaggttcac tacgtcgagg 1888861 cacacgggac cggcaccaag atcggcgacc cgatcgaggc gcgggcgctg ggtgagatct 1888921 tcgcggcgcg gcaacgtcgc ccggtgagtg tggggtcggt caagaccaat attggtcata 1888981 ccgggggagc cgctggaatc gccggattac tcaaggcggt gttagcgatt gaaaatgccg 1889041 tgattccacc cagcctcaac tacgtcggtg ccgcaattga tttggatagc cttgggcttc 1889101 gggtcgacac cgcgttgacg ccgtggccgg tggcggatga gccgcgacgg gctggggtgt 1889161 cgtcgtttgg catgggtggg acgaacgcgc atgtgatcct ggaacagggt ccgacgcagt 1889221 cgccagagat agtggaatct gttgccgcag cgggtagtaa cgctccggtg gcggtgccgt 1889281 gggtgttggc tgcgcggtcg ccgcaggcgc taaccaacca ggcggggcgg ttgttggcgc 1889341 acctgactgc cgacgacggc ctgaccgcgc tcgatgtggg gtggtcgttg gtgagtaccc 1889401 ggtcggtgtt cgaccatcgc gcggtggtgg tgggcgctga tcgggggcgt ctgatggcgg 1889461 ggttggcggg gttggccgcc ggtgagccgg gcgcgggtgt ggtggtgggt cgtgcgcggt 1889521 cggtgggcaa gacggtgttt gtgtttcccg gacaggggtc gcagtggctg gggatgggcc 1889581 ggcagttgta cggccggtac tcggtgtttg cccgggcttt tgacgaggtc gttgcggtgt 1889641 tggatgggca gctgcggctg tctgtgcggc aggtgatgtg gggcgccgat gccgggctat 1889701 tggaaagcac agagtttgct cagccggcgt tgtttgtcgt ccaggtggca ttggccgcgt 1889761 tgttgcaaga ctggggtgtg ctgcccgatc ttgtgatggg tcattcggtg ggtgagattg 1889821 ctgcggcgta tgtggccggg gcgttgtcgc tggtggatgc cgcgcgggtg gtggcggcgc 1889881 gcggccggtt gatgcaggcg ttgcccgctg gtggggtcat ggtggccgta gcggccagcg 1889941 aagacgaagt ggcaccgttg ctcaccgagg gcgtgtgcat cgctgcggtg aacgcgccgg 1890001 aatcggtggt gatttcgggt gagcaggctg ccgtgggtgt ggtagtggat cgattggtgg 1890061 ggttgggtcg gcgggtgcgg cggttggcag tgtcgcatgc gtttcattcg gtgttgatgg 1890121 accccatggt cgaggagttc tcgaaggtgc tggctgatgt ctgcgtgcgg gcgccgcgga 1890181 ttgggttggt ctcgaatgtg acaggtcagc tggccggtgc tgggtatggg tcgccggcgt 1890241 attgggttga acatgtgcgc aagccggtgc ggttcttcga cggtgtggga ttggctgaat 1890301 ccctcggggc cagggtgttt gtggaagtgg gtcccggtgc cgggttggag gcgtcggtgg 1890361 cgctgctagc cagggatcgg cctgaggtgg agtcggtgct ggccggggtg gggcgactgt 1890421 tcgccgaagg ggtggcggtt gattggtctt cggtctttgc gggtttgggc ggccggcggg 1890481 tggagttgcc gacgtatgga tttgcccggc agcggttttg gttaggtgac aatggcgagt 1890541 tgtcggtgga ccagacgggc aaagacgccg gcgcaattgc gcgattgcaa agcctagccc 1890601 caccggaact gcagcgccag ctggtagagt tggtgtgctt ccatgcagca atcgttttgg 1890661 gtcgcaagag cagccatgac atcgaccccg aatgtgcttt ccaagacttg ggatttgatt 1890721 caatgagcgg ggtcgaacta cgcaatcgtc tccagatggc tatcggtttg cccggcttgt 1890781 cgctgccgcg cactttgatc ttcgactatc ccactgcgag tgccctcgcc gaatgccttg 1890841 gccagctctt aggcggccaa cacgaatcat ccgacgacga gagtatttgg cagctgctga 1890901 aaaacattcc tatccaccag cttcgacgca ccggcttgct ggacaaattg ctgctgctgg 1890961 ccggccagcc cgaggagtcc ttggctggtc ggaccgtcag cgacgaggtt atcgactcgt 1891021 taagccccga agctcttatc gggctggcgc tcgatgagga cgagaacgat attcgatgac 1891081 gaaatccgtc ctggcaggct caaattatgc tatcggcata ggtgcaaata cgacaggcgt 1891141 tgaatagcga tgtttttgcg agatcgcgta atgtggctta aactttgggc ttcgagggtg 1891201 gcaagtaact taagtgggca ggggcatgag cgtcatcgcg ggtgtgttcg gtgcgttgcc 1891261 gccgcatcgc tatagccaaa gtgagatcac tgattcgttt gtcgagtttc ccggccttaa 1891321 ggaacacgag gagatcattc ggcgtttgca tgccgccgcc aaggtcaacg gtcgacacct 1891381 ggtgctgccg ctgcagcaat acccgtcgct gaccgacttc ggcgacgcca acgagatctt 1891441 tatcgagaag gctgtcgacc ttggcgtcga ggccttgctg ggcgcgctcg atgatgccaa 1891501 cctgcgcccc agcgacatcg acatgatcgc caccgcaacc gtcaccggcg tcgcggtgcc 1891561 gtctttggat gcccggatcg ccgggcggct tggtctgcgc cccgacgtgc ggcggatgcc 1891621 gttgttcggt ctgggctgcg tggcaggggc ggcgggcgtg gcccgcctgc gcgactacct 1891681 gcgtggcgcg cccgacgacg tcgcggttct ggtctcggtt gagctttgct cgctgacgta 1891741 tcccgcggtc aaaccaaccg tgtcgagtct ggtcgggacc gcactgttcg gcgacggagc 1891801 agccgcggtg gtcgccgtcg gcgaccggcg cgccgagcag gttcgcgctg gcggaccgga 1891861 catcttggac tcgcgcagca gcctgtaccc cgactcgctg cacatcatgg gttgggatgt 1891921 cggttcccat ggcctgcggc tgcggctttc cccggacctg acgaacctga tcgaacggta 1891981 cctagccaat gacgtcacca cgtttcttga tgcccatcgg ctgaccaaag acgacatcgg 1892041 cgcctgggtg agccatcccg gtggtcccaa ggtcatcgac gccgtcgcca cgagcctcgc 1892101 gctgcctccc gaggcgctcg agctgacctg gcgctcgctg ggcgagatcg gcaacctttc 1892161 gtcggcctcg atactgcata ttttgcgcga caccatcgaa aagcggccac ccagcggaag 1892221 cgccgggctg atgctggcga tgggtcctgg tttctgcacg gaactcgtct tactgcgctg 1892281 gcgctgactt cctgatttca acggtcaatc ccggccaggg gcgcagcgcg gcaaagttgg 1892341 ccgcccgaat gcggtgagtc cgctgagcgg gcaactgcag catggccctg gcgaccagcc 1892401 gagcgagaat cacggtcatc tcggtggtgg ccatgacggc tccgatgcat cggtgcagcc 1892461 cgccgctgaa cgggatgaat tcatgtggcg cgggtttgcg gtagtccgct gcgttgggat 1892521 cccagcgcag cggacggaat tcggttggct cgggccagat ttctgggagc cggtgggtga 1892581 cgtaggcgct gaagatcaac aggcgtcccg cccggatgcg atgcccgtcg aaccagaggt 1892641 cacgcagcac cctgcgggcc gagatcacgc cgggcgagta caggcgcagc gtctcgtgaa 1892701 caactccgtt gaggtaggtg agcgcgctca ggtcatcggc ggcggggact ctgccaccca 1892761 gcacgcgcgc gacctcgctg gccgcactct cccaggtgcc gggcacggtc agcagtgcgt 1892821 agatcgccca ggccagcgcg ccgctggtgg tctcgtaccc cgcggtgatc agcgaaacga 1892881 tcgaatcgcg aatctcgttg tcgcttaacg tagtaccctc ttcagagcag ccactaatca 1892941 acgtcgtcaa catgtggtcg tcgggtctgg gtgccgtgcg cgcgtcggcg atctgagcgt 1893001 cgatgaggtc gtcgatgcgt ttgcgggctg ccatggcccg tcgccacccg ggcgagttga 1893061 cccgctgctg cagccgcatc acctgaggcg gccgtcgggt taggtccagc aggggctgca 1893121 gttgctcacc gagaaaatcg gaatgtacgg cgaggcgctg gccgaacaga ctctcggcgg 1893181 tactgcgccg gaccgccgag cgcaactctt ggtagatgtc cagccgctgt ccgggctgcc 1893241 aaccgtcgat caccgtgtcg atattggaca ccatcgttgc cacatagcgc tggacgtgat 1893301 ggtgccgcag ccccggtgcc accacactgc ggcggcgccg gtggtccgcg ccgtcgctga 1893361 cgatcagcgc ggtcggcccg tcgacgggaa ccaggctctc aaacgtttgg ctccagctga 1893421 acgcgtcggc attggcgaac acgaatctgt tggcctctgc tcccaggaga taggtgtagc 1893481 catgcccacc gactccggcg ttgatcagcg gaccgcgcca tcgatacagc gccagcagcg 1893541 cttcgccaag cgggtagcgc accgtccgat acgtcctcat tcgagcatct ccgaaagctc 1893601 cagccagcga ttctccatcg ccgcgacgtg gtcttgcagg acacgtagtt gctgggtcag 1893661 ccgggtgatg ccgacgtggt cggactggtc atgctcggcc agttcggtat gtttggcggc 1893721 cacccggtcg gccaggcggg cgagttgacg gtcgactgcg gccaactctt tttcggtggc 1893781 acgtcgctgt gcgcccgaca tcgccggcgg cgctggccgc tcggccggtg ctggggcgct 1893841 aacgcgggca gccagctgca ggtattcgtc gatgccgccg ggcaggtgcc gcaaccggtc 1893901 atcgagaatc gcgtactgct ggtcagtgac ccgctcgagc agataccggt cgtgtgagac 1893961 gacgatcaac gtacccgccc acgagtcaag caggtcttcg gtcgccgtca gcatctcggt 1894021 gtccacgtcg ttggtgggct cgtcgaggag cagcacgttc ggctcggaca acagcgtcag 1894081 catgagctgc aaccgccgac gctgaccacc ggagaggtcg tcgactcgcg cggacagctg 1894141 gtcccggcgg aacccgagac gctctagcag ctgggtcggg gtaacctcgc ggccttcgac 1894201 ctgatagccg ccacgcagcc tgcctagcac atcggcgatc cggtcgtcgg caaacggtgc 1894261 cagatcgtcc ccgtgctgat cgagcactgc cagccggacg gcttgacacg tccgacaccg 1894321 ggctggacgg tgccggcgat caagcccagc agggtcgact tgccggcgcc gttagccccg 1894381 acgatgccga tacgttcacc cgggccgatc cgccattcga tatcgcgcaa caccgggcgg 1894441 cccccagaag gctggtacga gaccgacacg ccgagcaggt cgacgacgtc ctttccgagc 1894501 cgagcggccg ccagcttggc cagctccacg gtgttgcgcg gtggcggcac gtctgcgatc 1894561 agttggttgg cggcctcgat ccggaacttg ggcttgcagg tccgcgccgg tgcgccgcgg 1894621 cgcaaccaag ccagctcctt gcgcagcagg ttctgccgct tggcttcggc cgcggcggtc 1894681 agccggtccc gctcgacgcg ctgcagcacg tacgccgcgt agccgccttc gaaaggttcg 1894741 acgattccgt cgtgcacttc ccatgttgtg gtggcgacct cgtcgaggaa ccagcggtcg 1894801 tgggtgacca cgagtaggcc gccggtattg cgggcccagc gccgccgtag gtggtcggcg 1894861 agccaggtga tgccttggat gtcgaggtgg ttggtgggct cgtcgagagc gatcacgtcc 1894921 cattcgccga ccagcaggct ggccagttgc acccgtcggc gctggccacc gctgagggtg 1894981 ctgaccgggg tgtcccaggc gatgtcggat accaggccgg cgaccacgtc ccggatacgc 1895041 gggttgcccg cccattggtg ttcgggttgg tcaccgatga gcgtccagcc gacggtgcgg 1895101 ttggggtcga gggtgtctgt ttggctgagc gcgttcaccc gcaatccgct acgccgggtg 1895161 acccgaccgg agtccggccg cagttgaccg gtgagcaggc ccagcagact ggatttgccg 1895221 tcgccgtttc gcccgacgat gccgatgcgc gccccgtcgt tgaccccgag cgtgactgcc 1895281 tcgaacacca cctgagtcgg ataggccagg tgcacggcct cggctccgag taggtgcgcc 1895341 atggggccga ccctagcgtg gcgacgatgc gggctgggat gggccgctga ggagccgcgc 1895401 ggtcgagctc tagcgtggcg acgatgcggg ctgggatggg ccgctgagga gccgcgcggt 1895461 cgagctctag cgtggcgacg atgcgggctg ggatgggccg ctgaggagcc gcgcggtcga 1895521 gctctagcgt ggcgacgatg cgggctggga tgggccgctg aggagccgcg cggtcgagct 1895581 ctagcgtggc ggcccagccg cagtgcagtt gattggcggc ggggcttgcc gggtggggtg 1895641 gaggtcgttg taggcgtcga ttgggctggg tgtgattgag gtgtttgaac atttcgtggg 1895701 tttgcacccg gttgaacggg gtcaatgtca cggcgaccgg gatactcgaa tggacgtgcc 1895761 ggggccagcc ggcaagctgc tcgtggcggc tcggcagggg cgtcgtcggt agctttctcc 1895821 agccagccca actgcggact gactgaatca gtcttgggcc accaagtcac tggaatatgc 1895881 ttgggcacaa tacatcttga tgccatgcag tggccatggt cgtcggcgta ccgattggag 1895941 ccggccgttg ctaccacatt aatcggcatc agtgcctggt gggcgaatgg cagcgtgaag 1896001 caatacgccg gtgatctgac tgatcgtgtc gccacgatga cagtttgccg gcgcacgccg 1896061 gctccgcgag tgcattatcg acagtgacac gtttggcagg acccaaggag gccgagtcca 1896121 tgattcgtgc tgtgtggaat ggaacagtgc tcgctgaggc gccgcgaacc gtacgggtgg 1896181 aaggcaacca ctactttccg cccgagtcgc tgcaccgcga gcatctaatc gaaagcccga 1896241 ccacgtcgat atgcccatgg aagggtctgg cccattacta caacgtcgtc gtggacggcc 1896301 cctatggtcc ggttaacccg gacgctgcct ggtactaccg ccggcccagt ccactggctc 1896361 gccggatcaa aaaccatgtt gcgttctggc acggtgtgac ggtcgaaggt gaatccgaga 1896421 gtcggcatgg cttggcgcgc cgggttgtgg cgtggctcgg caaatagcgg cgtgatgcca 1896481 acggtcggac ccgcggacca cgcggcgggc ctagatcggc gcgcgacgcc tgaccagctg 1896541 ccgatatggc gtatcggcat catcagtggg ctggtcggca tgctgtgctg tgtcgggccg 1896601 accatcctgg cgttggttgg gattattagt gcggcaacgg ctttcgcgtg ggcgaacgac 1896661 ctctatgaca actacgcgtg gtggttccgc gtgagcgggc tcgcggtgct tgccattctg 1896721 gtgtggtggg cgctacgaca tcgaaaccga tgtagcgtca acgcaatccg ccggttacgg 1896781 tggcggctga tggcagtgct ggcaatagcg gttggtactt acggtgtctt gtccgctgtg 1896841 acgacgtggt tcggtacgtt cgtatagttg cagtattaga cgaacggggt cgccggcgac 1896901 gggtgcagca tgatttcgga gtcgttggcg catactgtcc ggccagcgtg ccgcagtagc 1896961 aagctacaag ccgccgcggc agcaagtacg gcggcgacgg tcagcaacgc gagatggtaa 1897021 gtgccggtgg cgtctttgag gtggccggtg gcgtagggac cggcgaagct cgccagactg 1897081 gccacggcat tgaccgtcgc gatggccacg gcgacccggg gaccggccag cgcggcggtg 1897141 caacggctcc agaaagcggg catcgcggca aggattccgg cgacggcgat ggtcagccaa 1897201 ctcagcgtca ctatcggtga catcggactc aatgccgcac cgagcgcggc gctgcccgcg 1897261 gccgttgttg gcagtgtgat atggcccgct tgggcgcccg agcggtcgat gctgcggtgg 1897321 ctccaggcca acatggccag cgcggcgaca ccgtacggca gggccgccaa cgtggcagcg 1897381 gtcagcgtgg cggtgccgtg tgccagcgac gcaactagtt ggggcagaaa gaactgcaac 1897441 gcatacagcg cgaaatacag gcccccgtag acgacagcga aaaggacaag atcccaaccg 1897501 gctccactcg accgaccggt cggggcaggg gtgtcctcgg tcagccgggc cgacagctct 1897561 gcacgttcct cgggggtgag ccagcttgcc cgttgcgggt tatccggcaa caggcgccga 1897621 agaagcggcg ccagcagcag tgcaggcaat gcctcgatca caaacattgc ccgccagccg 1897681 ggtagcccgg ccatgtgaac gtggccgacg atcagcccag acagcggcag gccgaccgtg 1897741 ttggcgaccg gaatggccag cagaaaggtg gctacggcgc gggctcgctg cgcgcacgga 1897801 aaccacaccg tcagatacgc gatgacgccg gggaagaagc cgccctcggc gacgccgagg 1897861 gcgaagcgcg ccagatacaa ggtgtgcgcg ctggtgacca aggccgtggc cgccgagcac 1897921 acaccccaag ccaggacgac cgccgtgagc gttcgaccgg caccgaagcg cgccaacgcc 1897981 gcgttggcgg gaacctggaa caggacgtag ccgaggaaga agacgccggc ggcggtgccg 1898041 tatgcggtgg cgctcaggcg caggtcggcg ttcatcgcca gggctgcgac cgagatgttg 1898101 gcccgatcaa cgaagttgat cacatacaac acgaacagca ggggcaacag ccggcgcgcg 1898161 gccttgccca gggcattgtg cgtggggctt gccgcgattg tcgccacctg cggctccttc 1898221 cgtgggcctg tcgaacaatt gcatcatgaa atgaccccaa cccggtcttt gtagtccggc 1898281 gtgtcactaa cacgatcggt tatgtcattg cagtaaaacg gatttggcgt tgcgccggat 1898341 gtgtttcgcc gtcaatctcg gcgtaggggc cggcgaagaa caggctccgg cccgcccgct 1898401 gtggtggggc gagcaggatg tcgcggccga tcgaccacgc gatgtggttg gcctgcaggt 1898461 tcgcgaacag gccgtgggtg ccgtacttcg tcgcacagga ggcgtccgcc ggtagccaac 1898521 ccagcccagc gacgaagaat tccgcccagc agtggtagcc gcacacctcg caatcctgcg 1898581 caccgggctg cggtagctcc aaggcctgac cgagcacaaa tcgtgcgggg atgtcgaccg 1898641 atcggcacag cgagacgaac aatgcgtgga tgtcgttgca gttgcccacc gagcaggtca 1898701 gggcatgctc ggtgctgccc aggaaagact gcttcgtcgc gtcgtagtcc atggcgccgg 1898761 tgacgtagtc gtagatgcga cgggcctgtt cgagcgggtt ggtctcgggg ccgacgacgt 1898821 cttgggccaa cgtacgggtg cgctcatcga catcgacatg tgcttcgggg atcaaggcgc 1898881 ggctgaacaa ttgcgccgtg gccaacgggc gggcccgtgc cggatccgga gcatgcccga 1898941 tcgcccggcg ttccacaaca tagcggatag accaactcgc cgccgtcgcc aagcgcagcc 1899001 ggctgtacaa catcaggttc ccgaactccg gctcacgcgt gaggtcatag ggatcctcgc 1899061 tggtcacctc gacgtccaga acgcgttgaa acgcgccgtc accgatgacc gggcaccaca 1899121 tctcgacggt gtgggcacct tgggtggaat cgatcgtgat gtgatcggtg atttcgaaca 1899181 gcccgatcgt cgcatccgcg tgtgcggata ccgcggggtc ggtgatcgtc atcggttagc 1899241 tccttccgct gagactggtt tatgttcgaa caaccggcag atcggctgcc agccattcgg 1899301 agaacccgcc gtcgagtcgg cgggcagaaa atccgttggg gcgcaacagt tctagcgcgt 1899361 cataggcata cacgcagtaa ggtcctcggc agcaggcgac gatgtcgatg ccggacggga 1899421 gttcatcaag ccgctcggcc agttcgtcga ggggaatgct cactgccccg ggcagatgcc 1899481 cggcggcgta ttccatggcc ggccgcacgt cgaggaccag caccgacccg gcggccaccc 1899541 gagcttgcaa ctcgtctcgg ctgatcggtt ccaggctgtc tctgtcggtg tagtactgcc 1899601 gcaccaggga gccgaccgag gccagattgc gttcggccac agcgcgcacc gcgcgcacta 1899661 cgtcccacac ctgcggatcc gacagtgcgt aaatcacccg tttgccgtcc cggcggctgg 1899721 tcaccaggcc ggcgcgccga agttgcaaca agtgctggga ggcattggca aacgtcaacc 1899781 ccgacgcacg agccagcgcg tccacactgc gttcaccctg caccagcaga tccaacagct 1899841 ccaatcgatg gccgctggac agcgcttgcc cgaccagggc gaactgctcg aagatcagct 1899901 tctttgcacc ggacatgccg ccgctccatt cctcgattca gatgttcgta tattcaattg 1899961 attgtttgat catgtcattc cgacacgctg ctgcggtttc gccgccgggg cgtcgcaccg 1900021 ctactcggtg ccggctacgg cctcacccgc ggccgcgggt tcgcgaccgg gccctgcgcc 1900081 gcgccctcgg ggtgggcgga atgtcctccg cggtcagcac cggtgcattc ctgaccaccg 1900141 tgtgcctcgc gcacctggtg ctcggcgcgc ttatgggtgt actagtgcac gaattcggcg 1900201 ccgacatgct gtcgttgtgg cccgtgggac cggcgctgtg tcattgagcc cgggcgcgta 1900261 atccgtgttg gtcggtgatc tcgatgaccg catacccgac ggtgatcaat cggtcgcgct 1900321 cgaactcctt gagaatcttg ttgatcgatg ggcgctgcgc tccaagcatt gcggcgaggg 1900381 tgcgttgggc aagttcgata cgggcatcga ttgcctcgtc gagcaggagc tgcgcaacct 1900441 gcgcgggcag cgggcggcca agcatgccca ttaaccgaat ctgcgcagtc gacacccgtt 1900501 gcgccacact cgacagccac cgccgtgcga tggccgggtg ggtagctagc agccgctcga 1900561 acgcctgccg gtccaggaac aggcaggtcg cttgggtcaa ggcgcgcccc gtgtagacca 1900621 tcggcatctc cagtagcagc gggatgtcgc catcgacatc gccgggatga aggatgttca 1900681 ccacggcgcg gcgccgcctg gagccgaccg cgagctcaat taatccgtgt cgcacaatcc 1900741 acaccccgtc cgcggtttga tcggcgtgga ataccactgc cccgggggca aactccttga 1900801 cttgtaacgt ttcggccaat gccgacacat cgtcacggtg cagtggcgcc gagcctccgc 1900861 gaccgacgca ccgcgcaatc caggctgcct gtcggacctg ggcctcggaa ggcggttggc 1900921 ccccagtcac cgcatgaacg agatgccgca gcgggcgcac cgaccgatct gccatggccc 1900981 ctccttgaga gcaggcgatg ccgtcatcgt gctgccaatt gtcagcgcgc gtggattgcg 1901041 tgcgggttgg cttgccctga atgggaaatt agtcgatcga agagaacacg caagcccgtt 1901101 ctgcgcccca ggcactctgt cagcacgctg acaaaccgat tcttggcgga gttttgccat 1901161 cggtatggta ttggggtgcc tactcgattg gcccgcggtg cgaccgtgcc gactcgccgt 1901221 ctgcaggaca tcaacgatca accggtggac gtcccggctg cgaccggaag gacacacctg 1901281 cagtttcggc ggttcgcggc ctgtccgatc tgccacctgc acctgcgcag cttcgccaac 1901341 cggcaccaag aggttgcgga cagtggaatc accgaggtgg tgttttttca ttcggcggcc 1901401 gacgcgctgc gcggatacca gtccttgcta ccgttcgccg tgatcgccga ccccgaccga 1901461 gtgcagtacc gcgagttcgg cgtagagaaa agtctgggcg ccatcactca tccgcgggca 1901521 ttgtgggctg ccgttcgggg gtcggcggcg atgttgcatc gcaacgatcc ggaacgggcg 1901581 ggcgtcggat tcggtgacgg cacaacgcat ctgggattgc ccgccgactt tctcctggat 1901641 gccgatggaa ctgtcgccgc tgtgcactat gggcgtcatg ccgacgacca atggtcggtg 1901701 gatcagctca tcgacatcaa ccgctcgctt ggaggtaagg gcactcagtg actcattccc 1901761 gtctgattgg cgcacttacc gtagtcgcaa ttatcgtcac tgcatgtggt tcgcaaccga 1901821 aatcccagcc cgcagtggca cctaccgggg acgcggccgc tgccacccag gtgccggcgg 1901881 gccaaaccgt tcccgcccag ctgcagttca gcgccaaaac ccttgatggg cacgactttc 1901941 acggggaaag cctgctgggt aagcccgcgg tgctgtggtt ctgggcgccc tggtgtccga 1902001 cgtgccaagg cgaagcgccg gtagtcggcc aggtcgccgc gtcacacccg gaagtgacgt 1902061 tcgtcggggt ggccggcctg gatcaagtac ccgcaatgca ggagttcgtc aacaaatacc 1902121 cggtgaaaac gtttacccag ctggctgata ccgacgggtc ggtctgggcg aatttcggtg 1902181 tcacccagca gcctgcgtac gcgttcgttg acccgcacgg caacgtcgac gtcgtcaggg 1902241 gtcggatgtc gcaggacgaa ctgacgcggc gcgtcacggc gttaaccagc cgttgatcga 1902301 cgccacgccg gtcggcttgg cgttggccca cgcagaaatg cctggccttc gcgacgagtt 1902361 ggggcttgcg ccgcgtgtga tactgccctc atgacgatgg ctcgggtgcg tcgcggcacg 1902421 gaactgttgt tgtcacctca gtcgccgccg gccaccggcg ggctgatcgt gttgaccggt 1902481 ctgcggctgt tggctgggtt gatctggctc tacaacgtgg tctggaaggt gccgccggac 1902541 ttcggtgagc gcggccggcg ggacctgtat cacttcacgc atctggcggt tgaacacccg 1902601 gtgttcacac cgttcagctg ggtgatcgag catgccgtgc tgccgtactt cacggcattc 1902661 ggttgggggg tgttgttcgc ggagtccgcg ctggcggtgc tgctgctgac cgggacggcc 1902721 gtgcggctgg ccgcgttgat cgggatcggg cagtcggtcg cgatcgggct gtcggtggcc 1902781 gagtcacccg gggagtggcc gtgggcgtac gcgatgctgc tgggcatcca cgtcgtcttg 1902841 ctgttcacct gctcgacccg gtacgccgcc gtcgacgcgg tgcgcgccgc cgccacgggg 1902901 tcggccgctc ggacggcggc gcagcggctg ctggccggtt ggggaatcgt gcttgggctg 1902961 atcggacttg tcgcggtatg gcgtggcctg ggcgatgatc gacccgccta tgtcgggata 1903021 cgggcgttgg agttctccct cggggaatac aacctgcgcg gcgcactggc gctgatcgcg 1903081 atcgcgctgg caatgttggc ggccgccaaa cgcggctggc gcaccgtcgc gttggtcgcg 1903141 gcggtggtcg cggtggccgc cgcggccgcc atctacctgc aagtcggccg gaccgcggtg 1903201 tggctcggcg ggacgaacac caccgcagcg gttttcgtgt gcgcggcggt ggtgagtctg 1903261 gcaaccgaat tccggatcgg acgggtggaa ggggcgtgat ggccacaccg ggcgttgtgc 1903321 aggaagtcgt ttccgtcgct gcagaacacg ccgagcgggt cgacaccgac tgtgctttcc 1903381 cggccgaggc ggtcgacgcc ctccgcaaga ccggcctgct gggtctggtg ctgccccgcg 1903441 agatcggcgg aatgggttcc ggaccagtgg aattcaccga ggtggtcgcc cagctgtcgg 1903501 ctgcatgtgg atcaacggcg atgatctatt tgatgcacat ggcggccgct gtcacggtag 1903561 ccgcgtcgcc tccgccgggt ctgccggatc tgttggcgga catggcttcc ggaaaacaac 1903621 ttggcacctt ggcattcagt gaaccgggtt ctcgttcgca cttctgggcg cccgtgtcca 1903681 cggcgagcgc cgacggtgac ggcatcgcgg tgcgggccga caagagctgg gtgacctcgg 1903741 cggggttcgc cgacgtctat gtggtgtccg tcggttcggc cgacggtgcc gcgggcgacg 1903801 tcgacctcta cgcggttccg gcggacacac cgggcctgcg ggtagcgggc accttcaccg 1903861 ggatgggtct gcgggggaat gcctccgcgc caatggccgt cgacattcgc atcccggatt 1903921 cgtatcgtct cggggaggcc ggcggcggat tcggcatcat gatgcaaacg gtactgccct 1903981 ggttcaatct cggaaatgcg gctgtctcac tgggtttggc gaccgcagcc accggtgccg 1904041 cggtcaagca cgtcgggacc gcccggttgg aacacctcgg tggcagcctg gccgagctgc 1904101 ccacgatccg cgcccagatc gctcggatgg gcaccacgct ggccgcgcaa aaggcgtacc 1904161 ttgaggtcgc cgccaacagt gtcagctcgc ccgacgacac caccttgacc cacgtgctgg 1904221 gtgtgaaggc ctcggtcaac gacgccgcgc tgaccatcac cgaatcggcc atgcgggtgt 1904281 gcggcggggc cgcgttctcc aagcatctgc ccatcgaacg cgccttccgc gacgcccggg 1904341 cggggtcggt gatggcgcca accgccgacg cgctctacga cttctacggc agggccgtca 1904401 ccgggctgcc gctgttctag gaggcgatat gtcaaccgaa ccgctcgtcg tgggagcagt 1904461 cgcatacaca cccaacgtgg tcccgatttg ggaaggcatc cgcggctact tccaagactc 1904521 cgaaagcccg gacacccaaa tggatttcgt gctctactcc aactacgcgc ggctggtcga 1904581 ttcgctgatc gccggccaca tcgacatcgc ctggaacacc aacctggcct acgtgcggac 1904641 cgtgctgcaa accggcgggc ggtgcacgcc attggcccag cgcgataccg acgtcgacta 1904701 caccaccgtg ttcgttgcac atgccggcag cgatctgcac ggcgctaaag acattgccgg 1904761 aaagcgcctt gcgctcgggt ccgccgactc tgcgcacgcg gccatcttgc cgctctatta 1904821 tctgcgccgg gcgggcatcg ccgagtctga cctgcaggtg atccgcttcg acaccgacat 1904881 cggcaagcac ggcgacaccg gtcgcagcga actcgacgcg gtggatgcgg tgctcgccgg 1904941 tgaggccgac gtggcggcga tcggcagctc cacgtgggcc gcgatgggcg ccgcggagct 1905001 gatgggggag tcgttgaccg aggtgtggcg caccgacggc tactgccact gcatgttcac 1905061 cgcgctggat acgctgcccg ccgaaagata ccagccgtgg ctcgaccggt tgctggcgat 1905121 gagctgggat gactccgagc atcgaaagat cctcgaactc gagggtttac gacgttgggt 1905181 gcctccgcac ctggacggct acaagccgct gttcgaggcc gtgcaggagc agggcatcga 1905241 cccgcgatgg tgatcataga gctgatgcgc cgggtggtag gtctcgcaca gggagctacc 1905301 gccgaggtcg ccgtctatgg cgaccgagat cgtgatctcg cggagcgatg gtgcgcgaac 1905361 accggaaaca ccctggtgcg cgccgacgtg gaccagaccg gcgtcggcac cctggtggtg 1905421 cgccgcggcc atccgcctga cccggcaagc gtgttgggcc ccgaccggct acccggggtc 1905481 cggttgtggc tgtacaccaa cttccactgc aacctgtgct gcgactactg ctgcgtctcg 1905541 tcgtcaccaa gcaccccgca tcgcgaactg ggggcggagc ggatcggccg aatcgtcggt 1905601 gaagcggcgc gctggggagt gcgcgaactg ttcctcaccg gcggtgagcc gttcctgctg 1905661 cccgacatcg acacgatcat cgcgacctgt gtgaagcagt tgcccaccac cgtcctcacc 1905721 aacggcatgg tgttcaaagg gcggggtcgg cgcgcgctgg aatccctacc tagagggctc 1905781 gccttgcaga tcagcctgga ctcggccacc ccggagctgc acgatgcgca ccgcggcgcg 1905841 gggacgtggg tcaaggcagt agctggtatc cggttggcgc tctcacttgg cttccgggtg 1905901 cgggtggccg cgacggttgc cagccccgca cctggcgagc tgacggcgtt tcacgacttc 1905961 ctcgacgggc ttggcatcgc acccggggat cagctggtcc ggccgatcgc gctggagggc 1906021 gccgcgtcgc aaggggtggc gctcacccgc gaatcgctgg ttcccgaggt gaccgtcacc 1906081 gccgacggcg tgtactggca cccagtggcc gccaccgacg agcgcgccct ggtcacccgt 1906141 accgtcgaac ccttgacccc ggcgctggac atggtaagcc ggctattcgc cgaacagtgg 1906201 acacgagccg ccgaagaggc cgcgttgttc ccgtgtgcgt agtgcccagt ctgccggccg 1906261 cgaacccagg attaattgct gatgacaagt attgccctac tgcactatag ttctgcttgc 1906321 acttgaaaac aacgaaccgt gatgcgggtc gtaagggatt ccggtaagga acacagtcaa 1906381 gttcttgcac gcgtcggcgg cagtgttgcc tcaacgccca aactgcacca aactgtttcg 1906441 cccacggcgg ggcgtgtctg agaggtatcg cgtgaccacc gcccataacg gatccgctcc 1906501 gcgttttcaa cgtacccgct ctggctacga cccggtcgca gtcaatcatt acatcgccga 1906561 actcgtgctg cgtcagcagg cgcagcactg tgagattgaa acgctcaagg cagaaatagc 1906621 cagtctgaag gacgaaaacg ctgccctgaa ggacacctcg ccgtcagcac aggcggtgac 1906681 cgatcggatg gcgaaaatgc ttcgactcgc tgtcgacgag gtcttccaga tgcagtcgga 1906741 ggcacgggcc gaggccgcaa cattagtttc tgcggctagg gatgaggcgg aagcggtccg 1906801 aacgcagaag cgagaaatgc tggcggatat gaacgcccgg caaagagcgc tggagtccga 1906861 gcatgccgac gtgatgcgcc gcgctcgtga agaggctgaa cagcttgtgg cgcaggcaac 1906921 cgccgaggtg gagcggatgc gtgtcatcga tgccagacgc cgtgagaaag ccgagcagga 1906981 acttgatgcc gaaatcatca ggcttcgcac cgatgcccaa tttcagatcg acgatcagct 1907041 gcaggccaca cagcaggagt gtgagaagcg gcttggcgaa gccaaaatcg aggccgatcg 1907101 acggctgcat gttgccgacg agcagattga gcacggcctc agcgaggctc ggcgaacgtt 1907161 ggaagagatc agccagcggc gagtcggcat cctcgaacaa ctagcgcgta ttcacgcaca 1907221 gctcgagaat attccagcgc tcctggaatc ggctcgacat agcgagacgg agccactgca 1907281 gtccataaac ggcgcggtcg ctgagctacg ggccatttag cgatcgcgtg cctgagcgcg 1907341 actcatctgt gacagttccg tcacggctgg gtcaggtgcc ggtgtcctgg cgacgccgac 1907401 tgcgcacaga ccgaaacagc acggtgtgga tgtgccatga tgtgcacgct gtcaaggcca 1907461 gtcgggtgac gatgcgggcc ggtgtggtcc gaggaggagc ccgacaattt aagctagtcg 1907521 ggtgacgatg cgggccggtg tggtccgagg aggagcccga caatttaagc tagtcaggga 1907581 gccctcagga gcggtggtgg atctcaattt ttcgatggtc acgcgaccaa tcgagcgcct 1907641 ggtggccacg gcgcagaacg gtctggaagt cctgcgactc gggggcctgg aaaccggcag 1907701 tgttccgtcg ccgtcccaaa tcgttgagag cgtaccgatg tacaagctgc ggcggtattt 1907761 tccgccggac aaccgcccgg gacagccacc ggtgggtccg ccggtgctga tggtgcaccc 1907821 gatgatgatg tcggcggaca tgtgggacgt cacccgtgaa gacggcgcgg tggggatcct 1907881 gcacgccagc gggctagatc cctgggtcat cgacttcggc tcacccgacg aggtcgaggg 1907941 cggaatgcgc cgtaacctgg ccgaccacat cgtcgccctc agcgaggcgg tcgataccgt 1908001 caaggacgcc actggccacg atgtgcactt cgtcgggtat tcgcagggtg gcatgttctg 1908061 ctatcaggcc gcggcatacc ggcgttcgaa ggacatcgcc agcgtggtcg cgttcggctc 1908121 gccggtggac accctggccg cgttgcccat gggcatcccg gcgaacatgg gcgctgcggt 1908181 cgccgatttc atggccgatc acgtcttcaa tcgcttggat atcccaagct ggatggcgcg 1908241 catgggtttt cagatgatgg acccactcaa aaccgcgaag gcccgggtgg acttcgtgcg 1908301 tcagttgcac gaccgcgagg cactgctgcc gcgggaacaa cagcgccggt tcctggaatc 1908361 cgaaggatgg atcgcctggt cgggcccggc gatctcggaa ctgctcaagc agttcatcgc 1908421 gcacaaccga atgatgacgg gtggtttcgc catcagcggc cagatggtga cgcttaccga 1908481 tatcacttgc ccgatactgg cgttcgtcgg tgaggtcgac gacatcggcc agccggcgtc 1908541 ggtacgcggc atccggcggg ccgcgcccaa ctccgaggtc tacgaatgtc tcatccgggc 1908601 agggcatttc ggtctcgtcg tgggatcccg agcggcacaa cagagctggc cgaccgtggc 1908661 cgactgggtg cgctggatct ccggcgacgg caccaaaccg gaaaacatcc acctgatggc 1908721 cgatcagccg gccgaacaca ccgatagcgg tgtggctttc agctcccggg tcgcgcacgg 1908781 catcggggag gtctcggagg ctgcgttggc gctggctcgc ggcgcggccg acgcggtcgt 1908841 tgcggccaac agatcggtgc gcacgctggc ggtggagacg gtgcggacgc tgccgcgact 1908901 agcccggttg ggtcagctca acgaccacac ccggatctcg ctgggccgca tcatcgacga 1908961 acaggcacac gatgccccga agggtgaatt cctgttgttc gacgggcgcg tgcacaccta 1909021 tgaggcggta aaccggcgga tcaacaatgt cgttcgtggc ctcatcgcgg tcggggtgcg 1909081 gcagggtgac cgtgtcggcg tgctgatgga gactcggccc agcgcgctgg tcgccatcgc 1909141 cgcgctgtct cggctgggag cggttgccgt ggtgatgcgg ccagacaccg acctgtccgc 1909201 gtcggtccgg ctcgggagag tgaccgagat cctgaccgac cctaccaatc tggatgctgc 1909261 gcgccagttg cccggacagg tgctggtgtt gggtggtggt gaatcgcgtg atctggatct 1909321 gccggccgac gcacttgaac agggccaagt catcgacatg gaaaaaatcg acccggacgc 1909381 cgtcgagttg ccggcgtggt atcgaccgaa tcccggattg gcgcgggatc tggcgttcat 1909441 cgcgttcagt tcggccgacg gcgacctggt ggccaagcag atcaccaact accgctgggc 1909501 ggtgtcggcc ttcgggaccg cctcgacggc ggccctcggc cgcagagaca cggtgtactg 1909561 tttgacgccg ctgcaccatg agtccgcact gttggtcagc ctgggcggcg cggtcgtggg 1909621 cggaacccgt atcgcattgt cccgcggctt gcgcccggac cggttcgtgg ccgaggtacg 1909681 ccagtacggc gtcaccgtcg tctcctacac atgggccatg ctgcgtgacg tggtcgacga 1909741 tccggcgttc gtgttgcacg gcaaccatcc ggtgcggttg ttcatcggct cgggcatgcc 1909801 gaccggattg tgggagcggg tcgtcgaagc gttcgcaccg gcgcacgtcg tcgagttttt 1909861 cgccaccacc gacggacagg cggtgctggc caacgtggct ggcgccaaga tcggcagcaa 1909921 gggccgtccg ttgcctggcg ccggacgtgt cgaacttggg gcctacgacg ccgaacatga 1909981 cctgatcctg gagaacgacc gcggcttcgt gcaggtcgcc ggtgtcaacc aggtcggggt 1910041 gctgctcgca caatccagag ggccgatcga tccgaccgcg tcggtcaaac gcggtgtctt 1910101 cgctcccgcc gacacctgga tatctaccga ctacctattc tggcgtgacg acgatgggga 1910161 ctactggctg gcgggtggac gcggctcggt ggtgcgcact gcgcgcggga tggtttacac 1910221 cgagccggtc accaacgcgt tgggcctcat caccggtgtc gacctcgcgg tgacctacgg 1910281 tgtattggtg cgcggtcgcc acgtcgcggt gtcggcggtg acgttgctgc ctggagcgac 1910341 catcacagcc gccgacttga ccgaagccgt ggcgagcatg ccggtggggc tgggacctga 1910401 catcgtgcac gtggtgccgc agctaacgct cagcggtact taccggccaa cggtcagcgc 1910461 gttgcgggcc aacgggattc ccaaggcggg ccgtcaggca tggtatttca actccggcgg 1910521 caacgagtac cggcggttga cgccggcggt ccgcaccgag ttgaccggcc agcatcggcg 1910581 cggcaatgct tgacgaggcg ctgctcgcca tcctggtgtg cccggcggat cgaggtccgc 1910641 tcgtcttggt cgaggacggc gacatccagg tgctctataa cccgcggctg cggcgcgcct 1910701 accgcatcga ggacggtatc ccggttctgc tggtcgacga ggcccgcgag gtcgacgagg 1910761 acgagcacgc ccgcctcatg gcgcgaggtc gtccggcagc tccccagtga ggtagcgctg 1910821 caggttgggc gcgatggttt gcacgatctg ttcggccggc aacgaagcaa acggttcgat 1910881 tctgacgatg tagcgcgcca tgaccacacc catcagttgc gacgcgacga actgggtacg 1910941 gatcttgccg gttcccggcg ggttgtcgac gcgggaccca agctccacgg tgaccacttc 1911001 ctcaaggaag gagcgcgcca ggcccacgtc ggagcctgag atcaaggatc tcagcgtcgc 1911061 gatcaacccg gcacccagtt cggaatccca aatcggcagc aacaaggacg gcagcttgta 1911121 accgagttcc tcgacaggcg cctcgcgaat cggaccgatg atgaccatcg ggtcgatcgg 1911181 aatgtggatc gcggcggcga aaagctgctg tttggtgccg aagtagtgat gcactagtgc 1911241 ggcatcaaca ccggccttgg cggccacggc tcggatcgat gttctgtcaa tgccgttgtg 1911301 cgcaaagagt tctcgggcac tggacaggat tcgctcccta gtgtcagagc tgccggcggg 1911361 tcgcccgggc cgtctgcggc tgttgtccgg cgccgccacg ctatgacgtc cgtcgccgca 1911421 gtgtcaccgc cgccagacac agcgacgcga ccgcgaaact cagcacgacg acgacgtcgc 1911481 gcaccgcgat accggtcagc tccggatgcg cacccacctg ttgtagcgcc tcgagcgcgt 1911541 agctggccgg catcacgtta ctgatccact ccagccacgt cggcatcagt gcccgcggga 1911601 cgatgatgcc ggcgagcagc agctgcggca ccatcaccag cgggatgaac tgtacggcct 1911661 gaaattcggt gcgggcgaag gcactacaca atagaccgag cccgacaccc aagacggcgt 1911721 tgacgatcgc gatcgcgaac acccacaccg ggctgcccgc cgtgtcaaag ccaaggaacc 1911781 agaacgccac aatgcaggcc agcgtggcct gcgccgccgc ggcgatcgag aacgcggtcc 1911841 cgtagccggc gagcagatca agccggcgta gcggggtggt caggatgcgc tccagcgttc 1911901 ccgaagccct ttcgcgttgc atggtgatcg ccgtgatcac aaacatcaca aagagtggga 1911961 acaggcccag tagcaccagg caagcggtgt tgaacccgga tggggtaccg gggcgatgcg 1912021 ggacgttctc gaacatgaaa tacatcagcg tgatgatcag gatgggtacc agcaagatca 1912081 tcgcgacact gcggtgatca gcggcaagct gccggagaat ccgcgccgta gtggccgtgt 1912141 agttctgcag cgttagccgg ccgcgggcac ggtggtggtg cgtcggacga tggacagaaa 1912201 cgcttcctcc agtgatgtgc atccggtttc ctttcgtaga cggtgcggcg ttgtgtgggc 1912261 cagcagctgc ccctggcgca gaagcaacag atcgccgcag cggtcggcct cgtccattac 1912321 gtggctggac accaacagcg tggtgccacg ccgcgccagc gccgtgaacc gatcccataa 1912381 ttcgacgcgc aataccggat ccaggccgat ggtcggctcg tcgagcacta gcagatcagg 1912441 ccggccgacc agcgcacacg ccagcgagac ccgggcccgc tggccgccgg acaggttggc 1912501 acaacgggcg gtgcggtgat cgcgcaggtc caccgcttcg atcacctcat cggcggcttg 1912561 cctgtcgacg ccgcagagtt cggcgaagta gcggatgttg tcgatcaccc gcaggtcgtt 1912621 gtaaatggtc gggtcctgag gcatgtatcc aacccgatgg cgtagttcgg ctgacccagc 1912681 cggttggccc agcacgctca ccgaacccga ggcaatgatt tgggagccaa cgatgcagcg 1912741 aatcagtgtt gtcttgcccg acccggacgg accgagcagg ccggtgatcg tgccgcaggc 1912801 gacccggacc gaaacatcct gcagggcaag gcgtttacca cggatgacgc gcagctggtc 1912861 gatgatgacc gcggggtcgg caccgtcgcg aagtaattca tcacttgatg aaatcatcat 1912921 gtgatgaata tccgccagtc gtgcgggttt gtcaagggcc ggtgcacaat cgtctctgat 1912981 gaacgctgag gaactggcga tcgacccggt cgcggccgcg catcggctgc tcggcgcaac 1913041 tattgccgga cggggtgtgc gtgcgatggt ggtcgaggtc gaggcgtatg gcggggtgcc 1913101 cgacggtccc tggccggacg ccgcggcgca ctcttaccgc ggccgcaatg gccgcaacga 1913161 cgtcatgttc gggcccccgg ggcggcttta cacctaccgc agccatggga tccatgtctg 1913221 tgccaacgtc gcgtgcgggc ccgatggcac ggctgccgct gtgctactta gggccgccgc 1913281 catcgaggac ggcgccgagc tcgccacgtc tcggcgcggg cagacggtgc gcgctgtcgc 1913341 actggcgcgc ggcccgggaa acctctgcgc tgccctcgga atcaccatgg ccgacaacgg 1913401 gattgacttg tttgatccgt ccagtccggt gcggctgagg ctcaacgaca cgcaccgtgc 1913461 caggtcgggg ccgcgcgttg gggtcagtca agccgctgac cggccgtggc gattgtggct 1913521 cacgggtcga ccggaggtgt cggcctaccg gcgaagctcg cgggcaccgg cccggggagc 1913581 cagcgactag agtcttgcgg gatgtctggc atgatcctcg atgagctcag ctggcgcggg 1913641 ttgatcgcgc agtcgaccga cctcgacacg ttggccgccg aagcacagcg cgggccgatg 1913701 acggtgtacg ccggcttcga tcccaccgcg cctagcctgc atgccggaca tttggtgccg 1913761 ctgctgacgt tgcggcgctt tcagcgcgcc ggtcatcgcc ccatcgtgct ggccggcggg 1913821 gccaccggca tgatcggtga tccacgtgac gtcggcgagc gcagtctcaa cgaggccgac 1913881 accgtcgccg aatggaccga acggatccgt gggcagctgg agcgcttcgt cgacttcgac 1913941 gactcaccaa tgggcgcgat cgtcgagaac aacctggaat ggaccggctc actatcggct 1914001 atcgagtttc tacgtgatat cggcaagcac ttctcggtca acgtgatgct ggcccgcgac 1914061 accatccggc ggcgtctggc gggggagggg atctcttaca ccgaattcag ctacctgttg 1914121 ctgcaggcca acgactacgt cgaattgcac cggcgccacg gctgcacgct gcagatcggt 1914181 ggtgcagatc agtggggcaa catcattgcc ggcgtccggt tggtgcgcca gaagctcggt 1914241 gccaccgtgc atgcgcttac cgtccccttg gtgaccgctg ccgacggcac caagttcggc 1914301 aaatcaaccg gcggcgggag cctgtggttg gatccccaaa tgaccagccc ctatgcctgg 1914361 taccagtact tcgtgaacac cgcggacgcg gatgtgatcc gctacctacg gtggttcacc 1914421 ttcttgtcgg ccgacgagtt ggccgagctg gaacaggcga cagcgcaacg cccgcaacaa 1914481 cgggccgccc agcgccggct cgccagcgag ctcaccgtct tggtgcatgg cgaggcggcg 1914541 accgcagccg tcgagcatgc cagccgggca ctcttcggtc ggggcgagtt ggcccgtctg 1914601 gacgaggcga cactggctgc tgcgttgcgg gaaaccacgg tcgccgaact caaaccgggc 1914661 agtcccgacg gaatcgtcga cttattggtg gccagcggcc tgtcggccag caagggcgcg 1914721 gcgcggcgca cgatccacga gggtggggtg tcggtcaaca acattcgggt tgataacgag 1914781 gaatgggtgc cgcaaagttc ggacttcttg cacggccgct ggttagtgct acgtcgtgga 1914841 aagcggagta tcgccggggt ggaacggatt ggctgagccg agccaccacg tcctcgacgt 1914901 cctcgggtcc caaggtgata tgcgacgtga gcggcccatg gaatatcgct gggcggtagg 1914961 ggagggccag cgggggatct tatctcgagg gatggggtgg ggatgcatcg ataagccccc 1915021 cgctgaagcc tggggttcga cggggatctc agacttgggg ggattgggag gtgatgagac 1915081 ccccgtcgaa gtctagtgcg ttgacctcac tcggcggtgt cgccggcgtg gaacaacggg 1915141 atcgagtacg tggtctcgct ctcactaaac agctgtgcgt gtgacaacgg gtcatcatcc 1915201 tttcatgtga caggcgagcg gcgttgcgtt gtagtcgatt tccacttcct gacttatctt 1915261 tggcgggttt ggactccgct ggtatcccac gactagtcgg tggccggggg aaatgccgaa 1915321 tcccgcatcc ggtggatcgt gaagtccacc aatcggggga cgatcggccc gcggtgcccc 1915381 cctacccggt taacgcgcac acattccaca cgaaacgcgt tagtgtgcaa acctttatcc 1915441 cactgtgctg tgaacgtgac tcttgttggc cactgttgtc gaggtgcctt aaatgacgca 1915501 agtgcgacaa caacgagaag cgggagatga cggcacacac acacgacggg acacggacct 1915561 ggcgaacggg ccggcaggcg acgacgttgc tcgcgttgct ggccggggtg tttggtggtg 1915621 ccgcgagctg cgcggcgccg atccaggccg acatgatggg taacgcattc ctgacagcgt 1915681 tgaccaacgc cggcattgcc tatgaccaac cggcgaccac ggtggcgcta ggcagatcgg 1915741 tttgtccgat ggtggttgcg ccgggcggga cgttcgaatc gatcacgtcc agaatggctg 1915801 agatcaatgg catgtcgcgt gatatggcga gtacgttcac cattgtcgcg attgggacgt 1915861 attgcccggc ggtgattgcg ccgctgatgc ctaaccggtt acaggcctga tagttacggg 1915921 gcgcagcaac ccccgtaacc tctaccgagt ggtcgacgac aggcaagggc gcaggggcgg 1915981 gcgacgaccg cgctcggctg ccgccgacaa ccgacctgcg ttccgggatg ggcccgcgat 1916041 tccgccgggt atccacgcca ggcaactggc gcccgagatc cggcgcgaac tgagcacctt 1916101 ggaccgtgcc acggccgacg cggtggcatg tcacctagta gctgccggcg agttgatcga 1916161 cgacgaccca gaagccgctc tgcgccacgc gcgggcggcg cgggttcggg ccagcaggat 1916221 cgccgctgtg cgcgaagctg tcggaatcgc cgcctaccgc tgcggcgatt gggcgcaggc 1916281 gttggccgaa ttgcgggcag cccgaagaat ggggagcaag tcccccctgc ttgcgctgat 1916341 cgcggattgc gaacgcggtc tgggccggcc gcagcgggcc atcgaattgg cgcgcgggtc 1916401 cgaggcggtc gagctcagcg gtgacgccgc cgacgagttg cgcatcgtcg ccgccggcgc 1916461 gcgcgccgat ctcgggcaac tggagcaggc gttgacggtg ttgtccacgc cgcagctcga 1916521 cccgggccgt acgggttcga ccgcggcgcg cctgttctac gcctacgctg aaatactgct 1916581 ggcgttgggc cgtggcgacg aggccctgca atggttccta cggtccgcgg cggcggacat 1916641 cgacggcgtc accgacgccg aagatcgggt agacgagcta ggcgcacgag aacagaaatg 1916701 aaaagcattg cgcaggaaca tgactgtctg ctgattgacc tggacgggac ggtgttttgt 1916761 ggccgtcagc ccaccggcgg cgcggtgcag tcgttgagtc aggtgcgcag ccgcaagctg 1916821 tttgtcacca acaacgcgtc gcgtagcgcc gacgaggtgg cggcgcactt gtgcgagctc 1916881 ggcttcaccg caaccggtga ggacgtcgtc accagcgctc agagcgctgc ccacctgctg 1916941 gccggccagc tggcgccggg tgcgcgggtg ctcatcgtcg gcaccgaggc gttggccaac 1917001 gaagtcgccg cggtcggatt gcgtccggta cgacgctttg aggatcgacc cgacgccgtc 1917061 gtacagggcc tttcaatgac caccggatgg tccgaccttg ccgaagccgc gctggccatc 1917121 cgggcgggcg ccctgtgggt ggcggccaac gtcgacccca ccttgcccac cgaacggggc 1917181 ctgctgcccg gcaacgggtc catggtggct gcgctgcgca cggccaccgg catggacccc 1917241 cgagtggcgg gcaagcccgc gcccgccttg atgaccgagg cggtggcccg gggcgacttc 1917301 cgggcggcac tggtggtcgg tgaccggctg gacaccgaca tcgagggtgc caacgccgcg 1917361 gggttgccca gcctgatggt gctcaccggg gtcaacagcg cctgggatgc ggtgtacgcc 1917421 gaacccgtgc gccggcccac ctacattggc cacgacctgc gctcgttaca ccaggacagc 1917481 aagctgctgg cggtggcacc gcagccgggc tggcagatcg acgtcggtgg tggtgcggta 1917541 acggtctgcg cgaacggcga cgtcgacgat ctggaattta tcgacgacgg gctatccatc 1917601 gttcgggctg tggccagcgc ggtatgggag gcgcgggccg ccgatcttca ccagcggcca 1917661 ctgcgcatcg aggccggcga cgagcgggcc cgtgcggcct tgcaacgctg gtcgttgatg 1917721 cgcagcgatc atccggtgac tagcgtagga acgcaatgac catcgatcct gaccagatcc 1917781 gtgccgaaat cgacgcccta cttgcttcgc tgcccgaccc cgccgacgcc gagaacggac 1917841 cgtctctggc cgaactcgaa ggcatcgcac gtcgtctttc cgaggcgcac gaggtgttgt 1917901 tggccgccct ggagtcggcg gagaagggtt gagtgcggcg tggcacgacg tgcccgcgtt 1917961 gacgccgagc tagtccggcg gggcctggcg cgatcacgtc aacaggccgc ggagttgatc 1918021 ggcgccggca aggtgcgcat cgacgggctg ccggcggtca agccggccac cgccgtgtcc 1918081 gacaccaccg cgctgaccgt ggtgaccgac agtgaacgcg cctgggtatc gcgcggagcg 1918141 cacaaactag tcggtgcgct ggaggcgttc gcgatcgcgg tggcgggccg gcgctgtctg 1918201 gacgcgggcg catcgaccgg tgggttcacc gaagtactgc tggaccgtgg tgccgcccac 1918261 gtggtggccg ccgatgtcgg atacggccag ctggcgtggt cgctgcgcaa cgatcctcgg 1918321 gtggtggtcc tcgagcggac caacgcacgt ggcctcacac cggaggcgat cggcggtcgc 1918381 gtcgacctgg tagtggccga cctgtcgttc atctcgttgg ctaccgtgtt gcccgcgctg 1918441 gttggatgcg cttcgcgcga cgccgatatc gttccactgg tgaagccgca gtttgaggtg 1918501 gggaaaggtc aggtcggccc cggtggggtg gtccatgacc cgcagttgcg tgcgcggtcg 1918561 gtgctcgcgg tcgcgcggcg ggcacaggag ctgggctggc acagcgtcgg cgtcaaggcc 1918621 agcccgctgc cgggcccatc gggcaatgtc gagtacttcc tgtggttgcg cacgcagacc 1918681 gaccgggcat tgtcggccaa gggattggag gatgcggtgc accgtgcgat tagcgagggc 1918741 ccgtagtgac cgctcatcgc agtgttctgc tggtcgtcca caccgggcgc gacgaagcca 1918801 ccgagaccgc acggcgcgta gaaaaagtat tgggcgacaa taaaattgcg cttcgcgtgc 1918861 tctcggccga agcagtcgac cgagggtcgt tgcatctggc tcccgacgac atgcgggcca 1918921 tgggcgtcga gatcgaggtg gttgacgcgg accagcacgc agccgacggc tgcgaactgg 1918981 tgctggtttt gggcggcgat ggcacctttt tgcgggcagc cgagctggcc cgcaacgcca 1919041 gcattccggt gttgggcgtc aatctgggcc gcatcggctt tttggccgag gccgaggcgg 1919101 aggcaatcga cgcggtgctc gagcatgttg tcgcacagga ttaccgggtg gaagaccgct 1919161 tgactctgga tgtcgtggtg cgccagggcg ggcgcatcgt caaccggggt tgggcgctca 1919221 acgaagtcag tctggaaaag ggcccgaggc tcggcgtgct tggggtggtc gtggaaattg 1919281 acggtcggcc ggtgtcggcg tttggctgcg acggggtgtt ggtgtccacg ccgaccggat 1919341 caaccgccta tgcattctcg gcgggaggcc cggtgctgtg gcccgacctc gaagcgatcc 1919401 tggtggtccc caacaacgct cacgcgctgt ttggccggcc gatggtcacc agccccgaag 1919461 ccaccatcgc catcgaaata gaggccgacg ggcatgacgc cttggtgttc tgcgacggtc 1919521 gccgcgaaat gctgataccg gccggcagca gactcgaggt cacccgctgt gtcacgtccg 1919581 tcaaatgggc acggctggac agtgcgccat tcaccgaccg gctggtgcgc aagttccggt 1919641 tgccggtgac cggttggcgc ggaaagtagc ggcgcgccga aggtgttgac tgaattacgg 1919701 atcgagtcgc tgggcgccat cagcgttgcc accgctgagt tcgatcgcgg ctttaccgtg 1919761 ctgaccgggg agaccggcac cggcaagacc atggtggtga ccgggctgca cctacttggt 1919821 ggtgcccggg ccgatgcaac tcgcgttcgg tccggtgctg accgtgccgt tgtcgaaggg 1919881 cgttttacta caaccgatct cgacgacgcg accgtcgcgg ggctgcaggc ggttctcgac 1919941 tcgtcggggg ccgagcgcga cgaggacggc agcgtgatcg cgttgcgctc gatcagtcgc 1920001 gatggaccgt cgcgcgccta cctcggcggc cgcggtgtac ccgccaaatc gttgagcggt 1920061 ttcacgaacg agctgcttac tctgcacggg cagaacgacc agctgcggtt gatgcgcccg 1920121 gacgaacaac gtggtgcact ggaccgcttt gcggccgctg gcgaagccgt ccagcgttac 1920181 cgcaagctgc gggatgcctg gctaacggcc cgacgcgacc tcgtcgaccg tcgcaaccgg 1920241 gcccgggaac tagcgcaaga ggccgatcgg ctgaaattcg cgctcaacga gatcgacacc 1920301 gtcgacccgc agccggggga ggacgtggcg ttggtcgccg acatcgcccg gctttccgaa 1920361 ctggacaccc tgcgggaggc cgcgactact gcacgcgcga cgttgtgcgg gacaccagac 1920421 gcggacgcat tcgaccgcgg cgccgtcgac agcctcgggc gggcacgtgc ggcactgcaa 1920481 tcgagcgatg atgccgcgtt gcgggggttg gccgaacagg tcggtgaggc gttgacggtg 1920541 gtcgtcgatg cggtcgccga gctcggcgcc tacctggacg agctgcccgc cgacgccagc 1920601 gcgctggacg ccaagctggc gcgccaagcc cagctgcgaa cgttaacccg caagtacgcc 1920661 gccgacatcg atggcgtgct ccggtgggcg gatgaggcga gggcaaggct ggctcaactc 1920721 gacgtctccg aagaagggct ggcagcgctg gaacgccgta ccggtgagct cgcccacgaa 1920781 ttaggccaag ccgcagttga tctcagcacg atccggcgga aggcggccaa gcggctggcc 1920841 aaggaggtca gcgcggagct gtccgccctg gcgatggccg atgccgaatt caccatcggt 1920901 gtgaccacag agctggccga ccacggcgat cccgtcgcct tggccctggc gtcgggcgaa 1920961 ttggcccggg ccggtgccga tggcgtcgat gcggtcgagt tcggtttcgt cgcacaccgg 1921021 gggatgacag tgctgccgct ggccaagagc gcatccggcg gcgaactgtc ccgggtgatg 1921081 ttgtccctgg aggtggtgct ggctacttcg cgaaaacaag cggctggcac cacgatggtg 1921141 ttcgacgaga tcgacgccgg cgtcggcggc tgggctgcgg tacagatcgg gcggcggctg 1921201 gcgcggttgg ctcgcaccca ccaggtcatc gtggtcaccc atctgccgca ggtcgccgcc 1921261 tatgccgatg tgcacttgat ggtgcagcgc accgggcgcg acggtgccag cggtgtgcgg 1921321 cgcctgacca gcgaggatcg ggtggccgag ctggcacgga tgctggccgg gcttggtgat 1921381 tccgacagtg gtcgcgcgca cgcgcgggag ttactcgaga ccgcgcagaa cgacgagctc 1921441 acctagcaag gctgtgactg aagtgatgtc atataacttg tgaggctaat gttacggcgc 1921501 gcctccacgc acctgcccag cttcaccgcc agaatccccc catgaggatg tcagcgcttc 1921561 tgtcccgtaa cacctcccgg ccgggcctga tcggcatcgc ccgggtcgac cggaatatcg 1921621 accgattgct gcgtagggtc tgtcccggcg acattgtggt tctcgacgtc ctggatctgg 1921681 accgcatcac cgccgatgca ctggtggaag cggagatcgc cgccgtggta aacgcatcgt 1921741 cgtctgtctc gggccgctat ccgaacctcg gtccagaggt gttggtcacc aacggtgtca 1921801 cgctgatcga cgagaccgga ccggagattt tcaaaaaggt caaagacggt gccaaggttc 1921861 gcttgtatga aggcggggtg tacgccggcg accgccggct gatccgcggt accgagcgta 1921921 cggatcatga catcgccgac ctgatgcggg aggccaagag cgggttggtc gcccacttgg 1921981 aggcgttcgc cggcaacaca attgagttca tccgcagtga aagcccgcta ttgatcgacg 1922041 gcatcgggat tcccgatgtc gacgtcgatc tgcggcgtcg gcacgtggtg atcgtcgccg 1922101 acgaacccag cggacccgat gacctgaagt ccctcaagcc gttcatcaag gagtaccaac 1922161 cggtgctggt tggtgtgggc accggcgcgg acgtgttgcg caaggcgggg tatcgcccgc 1922221 agctcatcgt cggcgaccct gaccaaatca gcaccgaggt gctcaagtgc ggtgcccagg 1922281 tggtgttgcc cgccgacgcc gatggacacg cgccgggcct ggagcgaatc caggatctcg 1922341 gtgtcggcgc catgacattc ccggccgcgg gctcggcgac ggatctggcc ttgttgctgg 1922401 ccgaccatca tggcgcggcg ctactcgtca ccgccggcca cgctgccaac atcgagacgt 1922461 tcttcgaccg cacgcgtgtg caaagcaacc cttcgacctt cctcaccaga ctccgggtag 1922521 gggagaagtt ggtggacgcc aaggcggtgg ccacgctcta ccgcaaccac atctcgggcg 1922581 gcgccatcgc attgctggca ctgaccatgc tgatcgccat catcgtggca ctgtgggtat 1922641 cccgcaccga cggcgtggtc ctgcattgga tcatcgacta ctggaaccga ttctcacttt 1922701 gggtgcagca cttggtctcc taggttttct tggacggtgg gttcatgatc tcgttgcgtc 1922761 aacatgcggt ctcactggct gcggtcttcc tggcgctggc catgggcgta gtgttgggtt 1922821 ccggcttttt ctccgatact ttgctgtcca gcttgcgtag cgagaagcgg gacctctaca 1922881 cgcagatcga ccgactcacc gatcagcggg atgcacttcg cgaaaagctc agcgcggcag 1922941 acaatttcga tatccaagta ggcagccgaa tagtgcacga cgcgctagtc ggcaagtcgg 1923001 tggtcatctt ccgcaccccg gatgcccacg acgacgatat cgctgcggtg tcgaagatcg 1923061 tgggacaggc cggcggtgcg gtcaccgcaa cggtctcatt gacccaggag ttcgtcgaag 1923121 ccaactccgc cgagaaactg cgctcagtgg tgaactcgtc cattctgccg gccggtagcc 1923181 agttgagcac caaactcgtt gaccaaggtt cccaagccgg cgacctgctc ggcatcgcct 1923241 tgctgagcaa cgccgacccg gcggcgccga ctgtcgagca ggcgcagcgg gacactgtgc 1923301 tggcggcact gcgcgaaacc ggcttcatca cctatcagcc ccgcgaccgc attgggacgg 1923361 caaacgccac ggtggtggtc accggcggag cgctctctac agacgccggc aaccaggggg 1923421 tcagcgtggc tcggttcgcc gcggcgctgg cgccgcgcgg gtctggcacg ctgcttgccg 1923481 gccgggacgg ttcggcgaac cgacccgccg ccgtcgccgt gacccgcgcc gatgccgaca 1923541 tggcggccga aatcagcacc gttgacgaca tcgacgccga gcccggacga atcaccgtga 1923601 tccttgccct gcatgacctg atcaacggag gccacgtggg gcactacggc accggtcacg 1923661 gggcgatgtc agtcacggtt tcccagtagg cccgcgttag ggcgtgttcc ccgcggtgag 1923721 gcgccgtgga tgttagggtg ggtttccgtg ggtcggcagg cccagcaagg ccagagaaat 1923781 cttggcagcg tcaagaacag ccctgcccgt cttcacggag gtcgctcagt gcgaaagcac 1923841 ccgcaaaccg ctaccaagca cctcttcgtc agcggcggcg ttgcttcctc gctcggcaag 1923901 ggactgaccg ccagcagcct aggacaattg ttgacggctc gtgggttaca cgtcacgatg 1923961 caaaagctcg acccgtacct caacgtcgac ccgggtacca tgaacccgtt ccagcacggc 1924021 gaggtcttcg tgaccgagga cggtgccgaa accgatctcg acgtcggcca ctacgaacgg 1924081 ttcctcgatc gcaatttgcc cggctcagcg aatgtgacta ccgggcaggt gtattcaacg 1924141 gtgatcgcga aggagcgccg cggcgaatac ctgggcgaca ccgtgcaggt gatcccccat 1924201 atcaccgacg agataaaacg gcgcatcctg gcgatggccc aaccggacgc cgacggtaac 1924261 cgcccggacg tggtcatcac cgaaatcggg ggcactgtcg gcgatatcga gtcacagccc 1924321 ttcctggagg cagcgcggca agtccggcac tatctcggcc gggaggacgt gttttttctg 1924381 cacgtgtcgc tggtgcccta cctggcgccg tcgggtgagc tcaaaaccaa gccaacacag 1924441 cactcggtgg ccgcactgcg cagcattggg attaccccgg acgcgttgat cctgcgctgc 1924501 gaccgcgacg ttcccgaagc gctgaaaaac aagattgcgt tgatgtgtga cgtcgatatc 1924561 gacggcgtta tctccacccc ggacgcgccc tccatctacg acatacccaa ggtattgcac 1924621 cgcgaggagc tcgatgcgtt cgtggtgcgc cgactcaatc tgccgttccg cgacgtcgat 1924681 tggaccgaat gggacgacct gctgcgccgg gttcacgaac cacatgagac agtgcgaatt 1924741 gctttggtgg gcaagtacgt cgaattatcc gacgcttacc tctcggttgc cgaggcattg 1924801 cgtgccggcg gattcaagca ccgggccaag gtcgagatct gttgggtggc atccgacggt 1924861 tgtgaaacga ccagtggtgc cgcggcggcg ctcggcgatg tgcatggggt gctcattccg 1924921 ggcggattcg gcatcagggg catcgagggc aagatcggtg ccattgcata cgcgcgggcg 1924981 cgcgggttgc cggtgttggg gctgtgcctc ggtttgcagt gcattgtgat cgaggccgcg 1925041 cgatcggtcg gtctcaccaa cgccaattcg gccgaatttg atcccgacac accagatccc 1925101 gttatcgcca cgatgcccga tcaagaagaa atcgtggccg gcgaggcgga tctgggcggt 1925161 accatgcgtc tcgggtccta ccccgccgtg ttggagccgg attcggttgt tgcccaggca 1925221 taccaaacta cccaggtgtc cgagcggcat cgccaccggt acgaggtcaa caacgcgtac 1925281 cgagacaaga tcgccgaaag cggcctgagg ttttccggga cgtcacctga cggacacttg 1925341 gtagagttcg tcgagtatcc gccggatcgg catccgttcg ttgtcggcac ccaggcccac 1925401 cccgagttga agagccgacc cacccggccg cacccactgt ttgtcgcatt cgtcggggca 1925461 gccatcgatt acaaggcggg tgagttgctg cctgtcgaga tccccgagat ccccgagcac 1925521 acacccaacg gtagctccca tcgggacggc gtgggccagc cgctaccgga acctgcgtct 1925581 cgtggctgag catgatttcg agacgatatc gtcggaaacc ttgcatacgg gagccatttt 1925641 cgcattacgt cgggaccagg tgcggatgcc tggtgggggt attgtgacgc gtgaggtcgt 1925701 cgagcacttc ggtgccgtag ccattgtggc gatggacgac aacggcaaca tcccgatggt 1925761 ttatcagtac cgccacacct atggtcggcg gctttgggaa ctgcccgcgg ggttgctcga 1925821 cgtcgctggg gagccacctc atctcacggc cgcccgggag ctgcgggagg aggtcgggct 1925881 gcaagccagc acctggcagg tgctggtcga tctggacacc gcgccgggct tcagcgacga 1925941 atcggtgcgg gtctatctgg ccaccggact gcgcgaggtg ggccggcccg aagcccatca 1926001 cgaagaagcc gacatgacga tggggtggta tcccattgcc gaagcggctc gccgggtgct 1926061 gcgtggcgaa atcgtcaatt ccattgccat tgccggtgtt ttggccgtgc acgcggtgac 1926121 gaccgggttc gcccagccac gcccactcga taccgaatgg atcgacaggc caacggcgtt 1926181 cgccgcgcgg agagccgagc gatgaagacg ctggcactgc aattgcaggg ctacctcgac 1926241 catctgacga tcgaacgagg tgtcgcggca aacacattga gctcctaccg acgtgatctg 1926301 cgccgctact ccaagcacct ggaagaacga gggattaccg atctggccaa ggtcggcgag 1926361 cacgacgtca gcgagttcct ggtggcattg cggcgcgggg atcctgattc cggcacggcg 1926421 gcgttgtccg cggtgtcggc ggcacgggcg ctgatcgcgg tgcgcgggct gcatcgcttc 1926481 gctgccgcag aagggctggc cgaactggac gtggcgcgcg ccgtccggcc accgacgccg 1926541 agccggcgat tgcctaagag cctgacaatc gacgaggtgc tatcgctgct cgaaggtgcg 1926601 ggcggcgata aaccgtccga cggcccgctg acgctgcgaa accgtgcggt gctggaactg 1926661 ctgtactcga ccggggcgcg gatctccgag gccgtcggcc ttgacctcga cgacatcgac 1926721 acccacgcca gatcggtgtt gttgcgcggc aagggtggta agcagcggct ggttccggtg 1926781 ggacgcccgg cagtgcacgc gctggacgcc tatctggtgc ggggacggcc cgacttagcg 1926841 cggcggggcc gcggaacggc ggcgatcttt ctcaacgcgc gcggcggccg gttgtcacgg 1926901 caaagcgcgt ggcaggttct gcaggacgcg gccgagcgtg ccggcatcac cgccggtgtt 1926961 tcgccgcata tgttgaggca ttcgttcgcc acgcatctgc tggagggtgg cgccgatgtc 1927021 cgggtggtgc aggaattgct ggggcacgcc tcggtgacca cgacgcagat ctataccctg 1927081 gtcaccgtcc atgcactgcg cgaggtgtgg gcgggagctc acccgcgggc acgctaagcg 1927141 atgaccgtca ctagcggtag cggttgctgg tcacttggct cgcccgcgac acagaggttg 1927201 cgcctctcgc tcatggatcg tcttcgtcgc tgtcgtgcag gagtttttcg gggtgaaagt 1927261 aactgttggt gcggggttgt ccatggtcga ggtgggctgg gggaagccat tcggtggtgc 1927321 cgtctttgcg tttgcgggtg atccagcccc cggtggtggc cagttggtgg tgggggccgc 1927381 agccctgggt gagttcgttg atgtcggttt cttggcattg ggcgaagtcc gtcacatgat 1927441 gcacctcggt gagatagccc ggtacgtcgc agttggggaa cgagcagccg cggtccttgg 1927501 cgtagaggac gattcgctgt ccgggtgagg ctagccgctt ggtgtgatag agggccagct 1927561 cgcggccgtg gtcgaagata cgtaggtagt ggttggcgtg gctggccagc cggatcacgt 1927621 cgctcatggg cagcagggtg ccgccgccgg tcagcgcgtg gccggcgcgt gattgcagtt 1927681 cggtcaggct ggtggacacg atgatggccg cgggtagccc gttgtgttgg cccagctccc 1927741 ccgagcacag cagggcccgc agcgcggcca gcaggccgtc gtggtggcgt tggccggcgc 1927801 tgcgggtgtc ggcctcgatc gcggcctgtg acggggtgcc ggccaggcag ggggtgtcat 1927861 cggcggggtt ggccatgccg ggggcggcca gcttggccaa cacggcgtcg acggtggcgc 1927921 gggcttcggg ggtcaggtag ccgctgatcg ccgacatgcc gtcggggcct tggttgccca 1927981 ggatgatgct gcggcggcgg gcgcggtcgg tgtcgttgta gttgccgtcg gggttcaaac 1928041 agtcggcgag tttggtggct agtttgtgta gttggtcggg gcgaaaccgg ccgcctaggg 1928101 tggccagctc ggcttcggct ttctcccggg tgggtaggtc cacatggtgg ggtagctggt 1928161 gcaggaagca gcggatgacc tgcacgtggg cggggccgag gtggccggcg cgttgggcgg 1928221 cggcggtggc ggtcagcaac gggggcaggg gttggccggt tagcgtgcgg cgtgggccca 1928281 ggtcggcggc ttcatggatg cgtcgggatg cttcgccgcg gctgatgtgt agccgttcgg 1928341 ccagggcgaa gggtagtttg ccgcccagtt cggtttggtc ggtttggtcg gcgagtttgt 1928401 tgatgaaggg gtgttcggcg gcgggtaggc gccgacggat cttttcgcag cgctgcagca 1928461 ttgccaggca ttccgggatg gtcaggtcgt caggggagac cttcaggacc cggttaaggg 1928521 cggtgtcgag gttgtcgaac gcggcgacgg cctcctcccg gctactcgaa tacatgttcg 1928581 aatactatca cggttagccg gccgatgcca tgctgattgt gggttaatcc aatgtggtgc 1928641 agttgaattc aggagcatcg ccagccgcga ggccacgcct attcggcgag cataatggtc 1928701 ggctcggaga catccagcaa catgaggcga tgaagacatc acgtgcgatg ggtggtcacg 1928761 gtgggcagct ctgacgcgct gtttcgcgta gtcgacggcg tgcaggtagc cccggccttg 1928821 acacgttccg gcccgctcaa gcgagtagtc cgcggatgtc gtcgacggtg ggtacggagc 1928881 cgaaggcgtt gccgtcgtcg acgacgctgg cgaataggtt tgaggtccag cccgaagccg 1928941 cgggcttgag gctgatgagg aaaaccggcg cgttgcgctc gttgaactgc tggatcacat 1929001 tggggcgtca tcgaggtcga tcgacggata catcagggaa tgcatggccg caccgtatcg 1929061 actcggtctg acagccatcc gcagccacac cgcaaccgca cgcgatgacc aatcgacgac 1929121 taaccgtcga ctaacccagg tattcggact ccaataccaa gtcgggcacc agggtctggt 1929181 attcgaggtg cgtcttgtgc tcaatggtgt tccatgacat gccttgttgc cggcgcatat 1929241 atgcacggta cttcggcgcg ccgggcaccc tgacattgtc ggcaaccacg atcgagcccg 1929301 ggtgcaacca gccccggtct aggatgctct gcagatcggg caggtaagcc ttcttgtcat 1929361 ggtcgaggaa cacaaaatcg agtgtgccag ttgcgaatcc gtgctcggtt agcgcgtcca 1929421 gggtgcgccc accgtcgccg atggtgccga ccacgcacac caccctgtca tcgacgccgg 1929481 catgcgccca tattcgccgg gcgttgctgg cgttggcttc ggcgagttcg acggagtaca 1929541 ccctggcctc cggagcggcc cgggcgatcc gcagcgcgcc gtagccgagg taggtgccca 1929601 actccagcgc caatgccggg tcggcgcgcc gaaccgccgc gtcgagcagc gtccctttct 1929661 cgtcaccgac gttgatgagc atcgacttct cataggcgaa cttgtcgatg gtggccagca 1929721 cgtcgtcgat gttgccggcc ccggcgtggg cgaggacata gtcgacggcc gccgcttcgc 1929781 gtccatcacc gatctggccc gtcgtggtga tattgcggat cccggccgcc atccgccaga 1929841 ccgaccaccg caacggggca atgcgcgctt tgcgaatcat cgctcgctag cttacgcaca 1929901 gatttcgcgg acctgcgggc acctggttca cctgctgaca ctggctcgac gacgaccgca 1929961 cttcggagtt tgggccgcgc gtggattttc attgcaagcc tggccatacc gcggccgagc 1930021 tgctgacgaa ccccgacgac ctggcagtga aaaccaaagc tgcggcggct ctgccggcgc 1930081 tgggtgacga gccaacccac ggcgagcagc acgaaccata gcgggaacca cgccaacgcg 1930141 gttgcggttt cggtttcggt ggtaagtgtc cagatcacga acgcgaaaaa caccagcacg 1930201 gcccagcaca tcaccacgcc accgggcatc ttgtacaccg agtcggtgtg acgctgtggg 1930261 tgtcggcgac ggtagacgag gtagctgatg atgatcattg cccacacaaa catgaacagc 1930321 agggatgaga ccgtcgtgac gagtgtgaac gccccaatca ccgaccgacc ggcatagagc 1930381 agcgggatgg aggtcagcag tagcggagcc gtcagcagca gggcgggtgc gggcacgccg 1930441 ccgcgattga gttggtggaa agcggccgga gcgtggcctt cgtcggcgag gccgaaaagc 1930501 attcgcccgg tggagaagaa gccggagttc gctgacgagg ccgctgcggt gaccacgacg 1930561 aagttgacga ccgacgccgc agcggcaagt ccggctaggg agaacatcgt cacaaacggg 1930621 gactcgccac tggcgaactg ccgccacggc acgacggcca ggatcgccag cagggcaccg 1930681 atgtagaaca ccgcgacccg caacggcacg gcattgatcg cgcggggaag ggtgcggcgc 1930741 gggtccgctg tctcagccgc ggcggtgcca acgagctcca caccgatgta tgcgaaaaac 1930801 gcgatctgaa agccactgac cacgcccagg aaacccgttg ggaagaaccc gttgtcgttc 1930861 cacaggttct cgatggtcgc gtgcacacca tgaggggaga cgaagttggt tgccaccagg 1930921 atcgcgccga cggcgatgag gcacacgatg gcagcgacct tgatcaatgc gaaccaaaac 1930981 tccagctccc cgaagtggcg gacgctgaac aaattgacag cgagaatcag ggcgaccgtg 1931041 accagggccg ggacccagat tggcaagccg ggccaccaaa acctggcata gccggtgatc 1931101 gcgacgaggt ctgcgatccc ggtgaccacc catgcgaacc agtacgacca ccccacgaaa 1931161 aagcccgccg ccgggccccg gaggtcggcg gcgaagtcaa cgaacgactt gtagttcagg 1931221 ttcgacagca gcagctcgcc catcgcgcgc aacacaaaaa acacaaaaaa cccaatgatc 1931281 ccgtagacca ccatgaccgc cggaccggcg agcgagatcg ttcgcccaga tcccatgaat 1931341 aggccggtgc cgatcgcgcc tccaatcgcg atcaactgaa tatggcggtt ggcaaggtcc 1931401 cgacgcaggt gcggctgggt gtctgtcggg tcggcagccg cgatatcgtc cggcatatat 1931461 ggcgtcctcg agttctgggg tagggaaggc ctcgcgttat ccggcaaacg gcggccggga 1931521 catcaccgta acccggaacc cgtagcgggg acccgcaccc cccgtaccgg tgcccgaacc 1931581 ggctagcggc atgccgccca acaggtttcc cgccgcaccg gcctccggtt gctcgacgat 1931641 atcgctgacc aggggtgcgg aggccgaacc cacggtcggg gctaggctcg gactggcccc 1931701 tgcccagttg ggcggcagcg ataacttgcc gatggtggcc gcgttgccta gacccgcgga 1931761 tacgggtccg gtaccgccaa ccgcggcgcc gacggccgcc ggcgccgcgg cggcagcttc 1931821 ggcggcctcg ggaccgatcc atcccagcgc ccgccacgat gtaataaggc tgttgccaat 1931881 accaatggcg aaatatggca aacccacggt gttgtaaaac agctgtgata tcggcagata 1931941 ccagttgatg aaccattcca gccaccccgg ggtcgcggcg gcggtcaacg cggacgacag 1932001 gggcgaggtg aggcccagca gcgtgttggg caagtgggcg atcagctccg ctattgcgct 1932061 ctgcgccgcg ccggctgagg tgccggcggc tttggcgact gcggacaact gcgtcgccgc 1932121 ggcggatggg ctggtggtgt tcggcggcgg ggcaaacggc gtcactttgg tcgcggtcgc 1932181 cgaggagccc gcgtaaccgt acatggccat ggcgtcttgg gcccacattt cagcgtattg 1932241 agcttcggtg gccgcgattg atgcggtgtt ttgaccgaac acgttatgcg tgaccagcga 1932301 cgtgagccgc gcgcgattgg ccgcgatcag cggcgggggc acaatggcgg caaacgcggt 1932361 ttcgtaagcg gccgccgccg cacgcgcctg actggctgcc tgctcagctt ggatggcggt 1932421 ggctcgcatc cacgccacat acggggcgac cgcttcgacc atcaacgtcg acgccggacc 1932481 cagccattct tcggtttgca gcgtcgtgat cacccgctcg tagccgacgg cggccacact 1932541 gagctcggcg gccagcccgt tccacgcgga cgctgcggca accatcggtg ccgagcccgg 1932601 gccgcaatac atgcgcccgg agttcacctc cggtggcaac gccccaaaat ccatcgctat 1932661 gaactcctta cctcgtcacg ggttttcggt gggctatccg acgttcggcc ggtcagccat 1932721 cacggtgagt cgtcttccat atcggcgtcc catatgggcg gcgcgactcc tgcccggagt 1932781 cggtgccccc cggagtagga ccgatgtttc agccgcctcg gcggcgctgc gaataccggg 1932841 aatcgatcgc gcgacggttt gcgcctgggg cgtggcgggc ggtgcggacc agcccggcgg 1932901 caccgacatc ggtccgatct tggcggccag agtcgcgctc gccgccaccg gtccagctcc 1932961 cagctgcgac cacgccgacc atccagcggc tccacccgcg ccggccgctg cctcggccgc 1933021 accggcttct gccaatgcgg tgctccacaa catcccgccg acgaactgca gagcattcag 1933081 cgttaaccca ccgctgtcgt aaatgaaccc ttcggcagtg gcgagggcgc ccaggaacat 1933141 catccagtat ttctgtatgt cgctccatgg aatcgccgcg gcccagctgt gctgcccccg 1933201 cagcgcggcc accgctgttg cgtggccgac gaggccggtc gcgttggtgg tttgcggcgg 1933261 tggtgcgaac ggagtcaaaa ccgtggcggg tgccgcggcg ctggcatagc cgtacatcgc 1933321 ggcggcgtct tgggcccaca tctcggcgta ttgggactcg gtggtggcga tcgccggcgt 1933381 gttttgcccg aaccagttgg tatcgacgag cgtcatcaac aaggtccggt tggccgcgat 1933441 cgccggcggg ggcaccgtca tggcgaaggc ggcttcaaag gccgctgcgg ccgccctagc 1933501 ctgcatcgcg gcctgttcgg ctagcgtcgc ggtggtactc agccagccga caaagggcag 1933561 gacggcggcc accatcgaat ccgatgccgg ccccgaccac caccgcatgt ttgtcagctc 1933621 cgagatcgcc gcaccgtagc cagtcgctgc cgacgacaac tctgcggcca gcccgtccca 1933681 ggccgccgcg gcagccatca gtggcccgga tcccggaccg ctatacatac gacccgaatt 1933741 gatctcggga ggtaacgccc caaagttgga cagggaatgc ccggcgatgc cgtcagcaac 1933801 ggcggtgacc ccaacaaggc agcaggcgac gctgcccggg gggacatgcc cctggttgac 1933861 cgggacatcg agggtcatcg aaaaccgcct cgttatgggt gggctggctc gacaccgtcg 1933921 tcgatacgat agctatgact agggcaacag tgacctagca cgttaatctc cataagagat 1933981 cttctgcgaa aaaggtttcg gccgtgtgac gcgcgtgtta ataccccata ggggtataat 1934041 cgttactgtt ggcaacgtct ggcgtcctgg ctcgggcgac acaccgtccc gatacatgtc 1934101 agcaaccggg tcgatcgtgg tgaatgcaca ggcgggcaag gcgaatgccg atgcgacccc 1934161 gacgaagtaa gagggtacgt aatcgataca ccatggggac atttgccctc catggcctca 1934221 cccatcgcct accgtcggcc tcgttgcaga cgacggctgc ccgccacccg gatgtgacgc 1934281 aattctcaat gcctgggcac taccgataac gccgacctgc cgcagctcgc gcatgtggac 1934341 gctgaaagcc cggaaggagc acaccggcat atccggcaag cccaccgcac ggaccgatcg 1934401 ccatggctct actcggtccg gagattctga gctacaagct agtgcgcggc gtttttctcg 1934461 attgccggat cgctgtggcg ctcagggcgt tacgtgaaag gttcggcagc ggtgctgccc 1934521 agcctggccg gtggcgaaca cggtcaacat ggtgaggccc tgcggcaccc gaaatgcggt 1934581 gagcagaacg acgtttggtg ccatcgcgga tagcagccag ccaagcttga acgctgcgag 1934641 cgagcccatg tagagcgttt ggtaccaaac cgatcggtgg gccaacttgc catgggctca 1934701 cagcggctat cgcgagcgtg tagccgatca tcgtccaggc gacggtggcc tgagcggcag 1934761 gggttgcctt attcatcctc ttgcggcatg gttgccgcag ggagtgccgg taagtctggt 1934821 cggcaacctg gcccgctgcg ggttgggttc ggattcgctc ggctagtaag gtgctcgcct 1934881 ggtgttacaa cgaatcgcta gagagctctt atcgggagtg gccgtcgcga tcgttgcgct 1934941 gccgctggcg atcgcgttcg gcattaccgc caccggaacg tcccaaggtg cgctcatcgg 1935001 gctctacggc gccatcttcg ccggattctt cgcggccgtg ttcggtggga cacccggaca 1935061 ggtgacgggc cccaccggcc ccatcaccgt cgtcgctacc gcaaccatcg ccgaacacgg 1935121 actcgagggt gccttcttcg cgtttatcct cgccggcgtc tttcagatcc tgttcggggc 1935181 gtgccggctc ggttcactca tccgctacgt gccccacccc gtgatctctg gattcatggg 1935241 gggaatcgcg atcctcatca tcatgaccca gctggatcag gtgcgcagca gctccctgct 1935301 cgtgttggta acggtcgtcc tgctgctggc tagcggccgg tttatcaaag cgattccacc 1935361 gagcctgctc gtcctggttc tggtcagctc ggtgctgccg ctcgcggcgc catggctgcg 1935421 cgacctgcgc gctgggccgg tctcgatcaa caggacggtc gactacatcg gcgagatccc 1935481 acaggccatg ccgtctttcg acttcccgca agtcgccaat tcgacgatgc tgcaggtgct 1935541 gctgtcggcg gtggccatcg cgctgttggg atccctcgat tcactgctga cgtcgctggt 1935601 catggacaac atcaggggca cccggcaccg gagcaacaaa gaactgatcg gccaggggat 1935661 tggaaatatc gccgccgggc tcttcggcgg gctgtccggt gccggcgcga ccgtccgatc 1935721 ggtggtgaac gtcagaaatg gtggtcagac cgccctgtcg gcggccactc acagtgtcgt 1935781 tttgttcgtt ttcgttgccg ggcttggtgc cgtggtgcag tacatcccgc tcgccgtgct 1935841 gtcggggata ctgatattgg ttgccgtcgg catgttcgac tggcacgcca tgcgcaaagc 1935901 gcatgtgtca cccaggggcg acgtcatcgt catgttcacg acgatgatca tcaccgtcgt 1935961 cgtcgacctc accatcgcgg tgatggtcgg aatcgccctc tcgctgctgg tccataggct 1936021 ccgatcccgg caacgcaaag ccaaggtcac ccaggacgac accggcacct atcgcatcga 1936081 cggtccgttg tcgttcctgt ccgtcgacgg tgtatttggc tccctgcgcg acggtcgtga 1936141 ggacgtgtcg ctggacctcc agcacgtcac ctacctcgac acctctggtg cccgggccct 1936201 gctgtatttc atcgaccact ccgagaagga cggcgtcgcg gtaagcatca agcggatccc 1936261 cccacgcctc gaaagccaac tcaccgcact cgccgacaac gagcaacgtg acaagctgag 1936321 aaccgtcctc gaatccgcct gacgcattgg ctggttgatt tgcctgcggg tctcccgggc 1936381 caggcgtcgg tagccgttag actttcctgc gatgtccccc ctgacgcccg tcaccacgag 1936441 ccacgaccgg gtatgaccga ccaccccgac accggcaacg ggatcggcct caccggacgg 1936501 ccaccacggg caatccctga ccccgcgccg cgcagctcgc acggcccggc caaggtcatc 1936561 gcgatgtgca accagaaggg tggcgtcggg aagacgacgt cgacgattaa cctgggtgcc 1936621 gcgctcggtg agtatggccg gcgggtgctg ctggtggata tggatccgca aggagcgctg 1936681 tccgcgggcc tgggcgtgcc gcactacgag ctggacaaga ccatccacaa cgtgctggtg 1936741 gagccccggg tgtcgatcga cgacgtgctg atccactccc gggtgaaaaa catggatctg 1936801 gtccccagca atatcgatct gtccgcggcg gagatccaac tggtcaacga ggtgggtcgc 1936861 gagcagacgt tggcccgggc gctgtacccg gtgctggacc gctacgacta tgtgctgatc 1936921 gactgccagc cgtcgctggg cctgctcacc gtcaacgggc tggcctgcac ggacggcgtg 1936981 ataattccga ccgagtgcga gttcttctcg ctgcgcggcc tggcattgct caccgacacc 1937041 gtcgataagg tgcgcgaccg gcttaatccg aagctggata tcagcggaat cctgatcacc 1937101 cgctacgatc cgcggaccgt caactcgcga gaggtcatgg cccgtgtcgt ggaacggttc 1937161 ggtgacttag tgtttgacac cgtgatcacc cgcacggttc gtttcccgga gaccagcgtc 1937221 gcaggcgaac ccattaccac ctgggcgccg aagtcggcgg gtgccctggc ctaccgtgcg 1937281 ctggctcgcg agttgatcga ccgatttggc atgtgaacgg ccttcagaac agcctggcga 1937341 acggtgggac ggcacccgag aacggctact cggctggttt tcgggtccgg ctgaccaact 1937401 tcgagggccc gttcgacctg ctgctgcagc tgatctttgc gcaccaactc gacgtcaccg 1937461 aagtggcgtt gcaccaggtc accgacgact tcatcgccta caccaaagcg atcggcgctc 1937521 ggctggaact agaggagacc acagcgttcc tggtgatcgc cgcaaccttg ctcgatctca 1937581 aagcagcccg gctcctgcca gccggacagg tcgacgacga ggaagacctc gcgcttctgg 1937641 aggtacgcga cctgctgttt gcccggctgc tgcaataccg ggcgtttaag cacgtcgcag 1937701 agatgttcgc cgaactggag gccaccgcgc tgcgcagcta tccacgggcg gtgtcgttgg 1937761 aggacgggtt cgtcggtctg cttcccgagg taatgctcgg cgttgacgct caccggttcg 1937821 ccgaaatcgc tgcgatcgca ttaaccccgc ggccagcccc gacggtggcc accgagcacc 1937881 tgcacgagtt gatggtctcg gttcccgagc aggccgaaca cttgctggcg atgctgaaag 1937941 cgcggggcag cggccagtgg gcgtcatttt cggagctggt cgccgactgc acggcgccca 1938001 tcgagatcgt ggggcgcttc ctggcgctgc tcgaactgta tcggacccgg gcggtagcat 1938061 tcgagcagtc agagccgctt ggcgcgctcc aggtttcgtg gaccggtgac gatgcagagc 1938121 gcagcgatga gaaggagcgg cgcttgtgac cgaacatatg cccgaacacg atccgagcta 1938181 tggcatcccg gatatcgctg agcccgcgga gctggatgcc gacgagctta agcgtgtgct 1938241 agaggcgctg ctgttggtga tcgacacccc agtgacagcc gacgcgttgg ccgcggccac 1938301 cgaacagccg gtctaccggg ttgcggcaaa gctacagttg atggccgacg agctcaccgg 1938361 gcgtgacagc ggcatcgacc tgcgccacac gagcgagggt tggcggatgt acacccgcgc 1938421 ccgattcgcg ccctatgtcg agaagctgtt gctggacggc gcgcgaacca agctcacccg 1938481 ggccgcgctg gagaccctgg ccgtggtggc ctaccgccag ccggtcacac gagcgcgggt 1938541 tagtgcggtg cgcggggtca acgtggacgc cgtgatgcgt acgctgttgg cccgcggcct 1938601 gatcaccgag gttggtaccg acgccgatac cggcgcggtg acgttcgcca ccaccgagct 1938661 cttcctggag cgcttgggat tgacgtcgct gtcggagctg cccgatatcg caccgctgct 1938721 tcccgacgtc gacacaattg acgacctgag cgaatccctg gacagtgagc cacgtttcat 1938781 caaactcacc ggtgagctgg cgtccgagca gacgctgtcg ttcgacgtgg accgtgattg 1938841 atggccgagc cggaagagtc ccgggagccc cggggcatcc gcctgcagaa agtgttgtct 1938901 caggctggaa tcgcgtcgag gcgagccgcc gagaagatga tcgtcgacgg ccgcgtcgaa 1938961 gtggacgggc acgtggtgac cgagttgggt actcgggtcg accctcaggt cgcggtggtc 1939021 cgtgtcgacg gggccagggt ggtgctcgac gactcgctgg tgtacttggc gctgaataag 1939081 ccgcgcggca tgcactcgac catgtccgac gatcgcggcc gcccgtgcat cggcgacttg 1939141 atcgaacgaa aggtccgggg caccaagaag ctttttcatg tcggacgcct agacgcggac 1939201 accgagggac tgatgctgct gaccaatgac ggcgagttgg cgcaccggtt gatgcatccc 1939261 tcccatgagg tgcccaagac gtatctggcg acggtgacgg ggtcggtgcc gcgtgggctg 1939321 ggccgaacgc tgcgagcggg aatcgaattg gacgacggac cggcgttcgt cgacgatttc 1939381 gcggtagtgg atgcgatccc cggcaagacg ttggtgcggg taacgctgca tgagggacgc 1939441 aatcgcattg tgcgccgact gctggcggcc gccggcttcc cggtggaggc attggtgcgt 1939501 accgatatcg gcgcggtgtc actgggaaag caacgcccgg gcagcgttcg ggccttgcgg 1939561 tcgaacgaga tcgggcaact gtaccaagcg gtgggcctgt gagtcgccta agcgcagcgg 1939621 tagtcgcgat cgacgggccg gccggcaccg gaaaatcctc ggtgtcaagg cgattagcgc 1939681 gcgagctggg cgcacgcttt ctggacaccg gggcaatgta tcggatcgtg acgttggcgg 1939741 tgctgcgtgc cggtgctgat ccgtccgata tcgctgccgt cgagacgatt gcgtcgacgg 1939801 tgcagatgtc gttaggctac gatcccgacg gagacagctg ttaccttgcc ggagaagacg 1939861 tttcggttga gatacgcggt gacgcggtca cccgtgcggt ctccgcggtg tcgtcggtgc 1939921 cggccgtacg cacccggctg gtcgagctgc agcgaacaat ggctgagggc ccgggcagca 1939981 tcgtcgtgga gggccgcgac atcggaaccg tggtgtttcc ggatgcgccg gtgaaaatct 1940041 tcttgaccgc ctcggccgaa acgcgggccc ggcggcgcaa cgcccaaaac gtcgcggcgg 1940101 gtttggccga cgactatgac ggggtattgg ccgatgtgcg ccggcgcgac cacctcgatt 1940161 ccacccgggc ggtgtcaccg ctgcaagccg ccggtgatgc cgtcatcgtg gacaccagcg 1940221 atatgaccga ggccgaggtg gtcgcccatc tgttggagct ggtcacgcgg cgaagtgagg 1940281 cagtgcggtg acccaggacg gcacgtgggt ggacgaaagc gattggcaac tagacgattc 1940341 ggagatcgcg gagtccggag cggcgcctgt ggtggcggta gtcggccggc ccaatgtcgg 1940401 caagtccacc ctggtcaacc ggatcctggg ccgccgcgag gcggtggtgc aggatattcc 1940461 cggcgtgacg cgtgaccggg tctgctacga cgcgctgtgg accggacgcc ggttcgtcgt 1940521 acaggacacc ggcggatggg agcccaatgc caagggcctg cagcggttgg tggccgagca 1940581 ggcctcggtg gccatgcgca ccgcggatgc ggtgatcctg gtggtcgacg ccggtgtcgg 1940641 tgccaccgcc gccgacgagg ccgcggcccg tatcctgttg cgatccggca agccggtgtt 1940701 cttggccgcc aacaaggtcg acagcgaaaa aggcgaatcc gacgccgcgg cgttgtggtc 1940761 gctgggcctg ggtgagccgc atgcgatcag cgcgatgcac ggtcgggggg tggccgacct 1940821 gctcgacggg gtgctcgccg cgctgcccga ggtgggggag tccgcgtcgg cgagcggcgg 1940881 tcctcgccgg gtggcgctgg tcggtaagcc gaacgtcggc aagagctccc tgctgaacaa 1940941 actcgcgggt gatcagcgat cggtggtcca tgaggcggcg ggcaccaccg tcgacccggt 1941001 ggattcgctg atcgagttgg gcggtgacgt ctggcggttc gtcgacaccg cgggattgcg 1941061 gcgcaaggtc ggccaggcca gtgggcatga gttctacgcc tcggtgcgca cgcacgccgc 1941121 catcgactcc gccgaagtgg ccatcgtcct gatcgacgcg tcgcagccgc tcaccgaaca 1941181 ggacttgcga gtgatatcga tggtcatcga ggccggacgg gcgctagtcc tggcctacaa 1941241 caagtgggac ctggtcgacg aggaccggcg cgagctgctt cagcgcgaga tcgaccgaga 1941301 gctggtgcag gtgcgctggg cgcaacgggt caacatctcc gccaagacgg gccgggcggt 1941361 gcacaagctg gtgccggcca tggaggatgc gctggcgtca tgggacacca ggatcgcgac 1941421 cggcccgctg aacacctggc tcacagaggt gacggcggcc acaccgccgc cggtgcgcgg 1941481 cggcaagcag ccacgcatct tgttcgcgac ccaggccacc gcgcggccac cgacgttcgt 1941541 gttgttcacc acgggttttt tggaggccgg ctatcggcgg ttcttggagc ggcggctgcg 1941601 tgagacgttc gggtttgacg gcagcccgat ccgggtcaac gtgcgggtgc gagagaagcg 1941661 ggccggcaag cgccgctgag cgcacctcga acgtgtgacc cgggtaaccg gggatggaca 1941721 gcgaggccgg ttctgctgtc ccataatgcg gctatgttca gctgcattac gggatttagg 1941781 tgttgacacc cgagcgctcg gcgcttacgc tttctcgtat aacgggtgat aagtaccgta 1941841 ttgcgggagt aggtggagga aatggcgctg gctcagcagg tgccgaacct gggtctggcg 1941901 cgcttcagcg tgcaggacaa gtcgatcctg atcaccggcg cgaccggttc gttgggccga 1941961 gttgccgccc gggcgctggc cgacgcggga gcgcggctga cactggccgg cggcaactcg 1942021 gccggtctgg ccgagctggt caacggcgcc ggcatcgacg acgccgccgt cgtgacctgc 1942081 cggccggaca gcctggccga tgcccagcag atggtcgagg cggcactggg ccgatatggc 1942141 cgtttggacg gagtgttggt ggcctcgggc agcaaccatg tggcgcccat taccgagatg 1942201 gccgtcgagg acttcgacgc tgtgatggac gcgaacgtgc ggggtgcctg gctggtgtgt 1942261 cgggcggccg gacgggtgct gctcgagcag ggtcagggcg gcagcgtggt gctggtgtcg 1942321 tccgttcgcg gcgggttggg caatgccgcc ggttacagcg cgtactgccc gtcgaaggcg 1942381 ggcaccgatc tgttggccaa gacattggcg gccgaatggg gcggtcacgg cattcgggtg 1942441 aacgcgctgg cgccgacggt gtttcggtcc gcggtgaccg agtggatgtt caccgacgat 1942501 ccgaagggcc gggccacccg ggaggcgatg ctcgcccgga tcccgttgcg ccgcttcgcc 1942561 gaaccggaag acttcgtcgg cgccctgatc tatctgctca gcgacgcctc gagcttctac 1942621 accggccagg tgatgtatct ggacggcggg tacaccgcat gctgacctcg cacgggttct 1942681 cccgtgccgc cgtcgtgggt gccgggctga tgggccggcg catcgccggc gtgctggcct 1942741 cggcgggcct ggatgtcgcc atcaccgaca ccaacgctga gattctccac gccgcagcgg 1942801 tggaggccgc ccgggtagcc ggtgctggcc gtggctcggt ggccgcggca gccgacctag 1942861 ccgcggcgat accagacgcc gacctggtga ttgaggccgt cgtcgaaaac ctggccgtca 1942921 agcaggaact cttcgaacgg ctggcgacac tcgcgcccga cgcggtgctg gccaccaaca 1942981 cctcggtgct gccgatcggc gctgtcaccg aacgggtcga ggacggcagc cgagtgatcg 1943041 ggacacactt ttggaacccg ccggatctta tcccggtggt cgaggtggtg cccagcgcgc 1943101 gcaccgcccc agatacggcg gatcgcgtcg tggcgctgct gacccaagtc ggcaagctgc 1943161 cggtgcgggt cgggcgcgac gtgccgggtt tcatcggcaa ccggctgcag cacgcgctgt 1943221 ggcgcgaggc gatcgcgctg gtcgccgagg gtgtctgcga cccgaagacg gtagatctcg 1943281 tggtacgcaa caccattggg ctgcgactgg ccaccttggg gccgctggaa aacgccgact 1943341 acatcgggtt ggacctcacc ctggccatcc acgacgcggt gatcccgagc ctcaaccacg 1943401 acccgcaccc cagcccgctg ctgcgggaac tggtcgccgc cgggcaactc ggggcgcgta 1943461 ccggtcacgg ctttctggac tggcccgcag gagcccgcga ggccaccacc gcccgacttg 1943521 cccagcacat cgccgcgcaa ctccaagcca acgaaaaagg aagggggaca tagccatgac 1943581 gttcgcctgg cccctcggtg ccgccgaatc gacgttggag ttctacgacc tgtcccaccc 1943641 ctggggacac ggcgcgccgg cctggccgta cttcgaggac gtgcagatcg aacgactcca 1943701 cggcatggcc aagagtcgtg tgctgaccca aaagatcacc accgtcatgc attccggcac 1943761 ccacatcgac gcgccggcgc acgtggtgga aggaacaccg tttctggacg agatcccgct 1943821 gagcgccttc ttcggcaccg gcgtcgtcgt ctcgatcccg aagggcaaat gggggatggt 1943881 caccgccgag gatctgcaaa acgctacccc cgacatccgg cccggtgaca tcgtcgtcgt 1943941 caacaccggc tggcaccaca aatacgccga cagcgccgag tactacgcct attccccggg 1944001 cttcgacaag aaagcgggcg agtggtttgc ggccaaaggc gtcaaggcgg tcggcaccga 1944061 cacccaggcc ctggaccatc cgctggccac ggccatcgcc ccgcacagtc ccgcggaggc 1944121 acagggcggc ctattgccgt gggcggtacg cgaatacgag gcgcagaccg gccgcaaggt 1944181 gctcgacgac ttcccggact gggaaccgtg ccatcgggcg atcctgtcgc agggcatcta 1944241 cggctttgaa aacgtcggcg gtgacctgga caaggtcacc ggcaagcgcg tcactttcgc 1944301 ggcgttcccg tggcgctggg tgggtggcga cggctgcatc gtgcggctgg tggcgatcgt 1944361 cgaccccacc gggagctatc gcatcgagac cggaaaggcg gtctgatgaa actgacacga 1944421 gcgtcgcagg cccccaggta tgtggcgccg gcgcatcacg aggtgtccac catgcggttg 1944481 cagggccgcg aggcggggcg caccgagcga ttctgggtgg ggctgtcggt ctatcggccc 1944541 ggcgggacgg ccgagccggc gccgacccgg gaggagaccg tctacgtcgt gctcgacggc 1944601 gagctggtgg tcaccgtcga cggcgccgaa accgtgttgg gctggctcga cagcgtgcac 1944661 ctcgccaaag gcgaactgcg atcgatacac aaccgcacgg atcgtcaggc gctgctgctg 1944721 gtgaccgtcg cgcacccggt tgccgaggtg gcgtgatgag ctgcaccggc gacgatgcag 1944781 agcgaagcga tgctgaggag cggtgcgaat gagcatcgtc atcaccgtcg cacccaccgg 1944841 ccccatcgcc accaaggccg acaacccggc gttgccgacg agccccgagg aaatcgcgac 1944901 agccgtcgag caggcctacc atgccggtgc cgcggtggcc cacatccacc tgcgcgacga 1944961 aaacgaaagg cccacagcgg atccgaacat cgcgcgccgg gccatggacc tcatcggcga 1945021 gcggtgtccg atcctgatcc agctgtccac cggggtcggc ttgacggtgc ccttcgagca 1945081 gcgcgagcaa ctggtcgagt tgcgcccgcg gatggccacg ctgaatccgt gctcgatgag 1945141 cttcggcgcg ggcgaattcc gcaacccgcc gcaagcggtt cgtcggttgg cggcacgcat 1945201 gcgggaactg gacatcaaac cggaactgga aatctatgac accgggcatt tggaggcgtg 1945261 cctgcgactg tgggcggaag acctgctggc cgaacccttg cagttcagca tcgtgctcgg 1945321 ggttcggggc ggaatggccg ccaccgccga taatctgctc acgatggtgc gccggctgcc 1945381 ccccggggcg atctggcaag tcatcgcgat cggtaaggcc aacatggaac tgaccgccat 1945441 gggcctggcg ctgggcggca acgcccgagt cggcttggag gacaccttgt acctgcgcaa 1945501 gggcgagctg gcgccgagca atctggcgct ggtatcgcgc acgatacgtc tcgccgaagc 1945561 cttggacctg ccgatcgcct cggtcgaaga agccgaggcg gcgctgcagc tgcccggcac 1945621 gtcctgagag gagctcgctt gtgtccgccg aagagcagga cacccgcagt ggtggcatcc 1945681 aggtgatcgc gcgggcggcc gaactgctgc gggtgctgca ggcgcacccc ggcggtctca 1945741 gccaggccga gatcggcgag cgggtgggca tggcccgctc gaccgtgagc cggatcctca 1945801 acgcgctgga ggacgagggg ctggtggcct cgcgcggggc ccggggaccc tatcggctgg 1945861 gcccggagat cacgcggatg gccaccacgg tacggctggg tgtcgtcacg gagatgcacc 1945921 cgttcttgac ggagttgtcg cgcgagctgg acgagacggt ggacttgtcg atcctggacg 1945981 gggatcgggc ggacgtcgtg gaccaggtcg tgccgccgca gcggctgcgg gccgtgagcg 1946041 cggtggggga gtcgtttccg ctgtactgct gcgccaacgg caaggcgctg ctggccgcgt 1946101 tgccgcctga gcggcaagcc cgcgcgctgc cgagtcgact ggcgccgctg acggcgaaca 1946161 ccatcaccga ccgcgcggcg ttgcgggacg agctcaatcg catccgggtg gacggtgtcg 1946221 cctacgaccg tgaggagcag accgaaggca tctgcgcggt gggcgcggtg ctacgggggg 1946281 tgtcggttga gttggtggcg gtgagtgtgc cggtgcccgc gcagcggttc tacggccgtg 1946341 aagccgagtt ggccggtgct ctgctggcct gggtttcgaa ggtagacgcg tggttcaacg 1946401 gcactgagga tcgcaaatga cagaagcgtt gtgcgacaag ctcgttgggg cctgggacct 1946461 ggtgtcctac gtggagcggg ccgcggcttt ggcgttggga tacctggcct acggcggacg 1946521 gtagttcgtc gacaaggcgt agggcgtggc cgggtttgca ggccggctgc ggtaggcttt 1946581 cgacctgccg ccggtggtgt cgccggtggc accgggctgt ggcgcagttt ggtagcgcac 1946641 ttgactgggg gtcaagtggt cgcaggttca aatcctgtca gcccgactta cgtttccgca 1946701 ggtagaccgc cctgctggcg gtcctcggct gccgctgagg cagtaccgcc aaggggtatg 1946761 tacagcaacc ggtacagcaa cccggtcaaa tccccagagc accgctgaga ccttccactg 1946821 cggctcgcgc cgcttcgtcg ctggtatgac cgcgccaccg tgctggacac cgcctaccga 1946881 gaccacctcg agcggttcgt tcgcaaacca cccgagccac ccgcgctacc ggccttcagc 1946941 gcgatcaacc caccaccaaa ggaggaccag ccgactcaat gaatccccga aaatcgtgtc 1947001 tcagaaatgt tgacaggttc cgcggtagat caggcgacaa gctcgatctc cgcattatgg 1947061 ccatgggatt gggcaagtcg cccgtcgcaa gtgataagcg gtacgccgag gccctcggcg 1947121 agggcgacgt aggctccatc ggccacggta tgagtggacc gaagttggta ggcacgctgg 1947181 gtgaatggct ttaacggcca acgccgaacg ggcaggctaa ggaagttgac aaccacgacg 1947241 agtccttcat gatcgctgat cagctgacgc acgaccgctt gacgtatcgc cccgatcacc 1947301 tcgacatcga aatgtgcagg ggcgtgcacg gtttcgcccc gcaagcgccg ggcgaccgcc 1947361 gcacccgccg gcgtcgtgag catgagctcg acggccgccg aggcgtccaa cacgatcact 1947421 cagatcgagc ctcgtcaaca agctcggctg cgctcgcgcc gagatcccgg cgcggcagtg 1947481 ccgctagacg gtcgagaacg tcgtcgaggg ccggttcttc cgcgatctcg gcaagccgtg 1947541 ctaggaggaa atcgctcagg ctcatccgtt gcgccgctgc gcgggccttc agctcgtgga 1947601 gaagctcgtc gggaacgttg cggatctgaa ccatggcgga catgttgtaa gcatatcgga 1947661 catgtgaaac acatgtccgg ttgccggtgt gaccggctgg ggcgtgtagg cgtcaaccca 1947721 cgccgtgcac gcggccatgg gcgggtgcag acttttgcca tgcaaccatg tgagctcacc 1947781 gccgtcgcgc tgaccgcaac gcccccgccc gcgcctccgt ccctgcgccg ggcaccggcg 1947841 tcgacgtcac cgcggctggc gtgatcgtgc ccgcccgcga gcctgagccc cagccgcgcc 1947901 gcgtgctgaa cggcctttcg gacgtacgcg cgttctttca caacaacacc gtgccgctgt 1947961 acttcatctc gccgacgccg ttcaacctgc tgggcatcta tcgctggatc cgaaacttct 1948021 tctacctgac ctactacgac tctttcgagg gcgaacattc gcgcgtgttc gtgccccggc 1948081 ggcgcgaccg cagggatttc gacggcatgg gggatgtgtg caaccacctg ctgcgtgatc 1948141 ccgagacact cgagttcatc aagaacaggg gtcccggtgg caaggcctgt tttgtgatgc 1948201 tggacgaaga gacccaggcg cttgcgcgcc aggcggggct cgaggtcatg caccccccgg 1948261 cggagctgcg tcatcgcctg gaatccaaga tcgtcatgac gcgcctggcc gacgaggcgg 1948321 gcgtacccag cgtgccgcac gtgatcgggc gggtgagctc ctacgacgaa ttgtcggcgc 1948381 tcgcgcacgg cgcagggctg ggagacgacc tcgtcgtcga ggccgcctat ggcaacgccg 1948441 gcagcgcaac gttctttgtg cgcggattgc gcgactggga ccagtgcgcc ggtggcatag 1948501 tggggcagcc ggaaatcaag gtcatgaagc gcatccgcaa tgtcgaggtg tgcatcgagg 1948561 ccaccgtgac ccgccacggc accgtgatcg gcccggcgat gacgagcctg gtcggttacc 1948621 cggagctgac tccgtaccgg ggcgcctggt gcggcaacga tgtttggcgt ggggcgctac 1948681 cacccgcaca gacccgcgcc gcgcgagaga tggtggcaaa gctgggcgac gtcttgagcc 1948741 gcgagggcta ccgcggctac ttcgaggtgg acctgttgca cgacctggac gccgacgagc 1948801 tctacctcgg cgaggtgaac ccgcgcctct ccggtgcaag cccgatgacg aacctgacca 1948861 ccgaggccta cgccgacatg ccactgttcc tcttccacct gctcgagtac atggacgtgg 1948921 actacgagct ggacatcgag gcgatcaact cgcgctggga gcggggctac ggcgaggacg 1948981 aggtctgggg tcagctgatc atgtcggaga cctcgccgga cctcgagctc ttcaccgcga 1949041 ccccacgcac cgggatgtgg cgcctgaacc acgacgggcg cgtctccttt gcccgccagg 1949101 gcaacgactg ggccacgatg ctcgacgagt ccgaggcctt ctacatgcgg gtcgccgcac 1949161 cgggcgacct acgctgcgag ggcgcccaac tcggtgtgtt ggtcacccgc gggcacctgc 1949221 agaccgacga ctaccagctc accgagcgcg gccggcgctg gatcgacggc ctcaaggcgc 1949281 agttcgcctc gacgccgctg acgcccgccg ccccgatcgt ctcgcggctc gtcgcacggg 1949341 cgtgagcggc ggcgtcccgg ccggtctcgc actggacaac tggctgtcgt cgccgtattc 1949401 gcattgggca ttccagcacg tcgaagactt catgccgacc acggtcatcg cgcgcggcac 1949461 cgagccggtc gtgacgttgc ccgcggacaa tgcgccgatc gccgacatcg gcttgaccag 1949521 cacggacggg atcgccacca ccgtgggcgc ggtgatggcc gccaccgcta ccgacgggtg 1949581 ggcggtcgcg catcgcggtg cgctggtggc cgagcagtac ctcgacggcc tgggaccccg 1949641 gacccgccac ctgctgttct cggtgagcaa gtcgctggtg gcggctgtgg tcggcgcgct 1949701 gcacggggcc ggggcgatcg agcttgacgc gccggtcacg gcgtacgtgc ccgccttggc 1949761 ggactgcggc tacgccggtg cgacggtgcg ccacctgctg gacatgcgat cgggtgtcgc 1949821 cttctcggag aactacgacg acccggccgc cgagattcac gtgcgcgagc aggtgatcgg 1949881 gtgggcgccc aagcgcggtc cggacctgcc cgccacgctg cgcgactacc tgctgacctt 1949941 gcggcggaag tcggcgcacg gcggcccgtt cgaatatcgc tcgtgtgaaa ccgacgtcct 1950001 cggctggatc tgcgaggccg cggccggaca gccgatgccc gaactgatgt cggaactact 1950061 gtggagccgc atcggggccc agtgcgatgc caccatcgcc ctagacgtag ccggcgcggc 1950121 gggcaccgga atattcgacg gcggcatcag cgcctgtctg accgacatga tccggttcgg 1950181 gtcgctgtac ctgcgcgacg gtgtctcgtt ggccggccag caagtggtgc ccgcggcctg 1950241 gatcgccgac accttcgacg gcggccccga ctcgcgtcag gcgttcgccg ccagccccga 1950301 cgacaacccg atgcccggcg ggatgtaccg caaccaagtg tggtttccct acccgggcag 1950361 caatgtcgcg ttgtgcgtgg gcatgtgcgg ccagctgatc tacgtcaacc gcgccgcgga 1950421 ggtggtcgcc gccaagctgt ccacccagcc gcactcccat gagccgcaca tgttagacac 1950481 cctgcgcgca ttcgatgcgg tggcacacga attgtcagga atcagatcga gttcgaccaa 1950541 cgacccgcag cggccttccc cgccagccca ggaggccagt ccggggtaac ggcttgtgcc 1950601 cacgtaaccg agttccaggg cgatgggctt attagcggaa atatgactcg tcccaggtat 1950661 ccatacgacg cttgcgtacc tcggcgagct tgtggtcaag cgccgcctgc tcattttcga 1950721 tggcacgacc ggcgtttctc acggcgttgt agacggcatc gtccagtttg catagatcct 1950781 ttgcggacac gtcggtcgat acgaaaaccg agaaccgaat acggtcgtcg agcagcgaaa 1950841 tgtcgatctt tggtgatggt ttgagttggc gctggaagtg tttggctagt gcttggatcc 1950901 aatactttgg ccaatgcggc accggtctca gatcgtagac gatgatggct tgctcgccgt 1950961 tgtagacgcc gatgggcgct tccgctcggc tgaagcatgg tcgccgcagg ttccgcaggt 1951021 cctggagttc gttctcttca ttacccacca tgagcctccg gcatctggtc tacggacacc 1951081 acggcttgcc gcatggctgg ggcgaaggga ctccaagcca tccaggatgg gaacgcgcgc 1951141 cgcatcgccg gcaggccgtc cagttcgatg cgctcggcgg agatctcggc ggccagtgtg 1951201 ctgcggccgc tgtagacccg atacagatct cgaggatggc cgcgcaccgt gaggtcgaca 1951261 ggtaggcatg gatcgtgcag gcacaccgag atgtccccag gttccaacac gagccaggcc 1951321 cacagtggcc gctcgccgtg gtagcggaac tccaccacca cccgccggcc gggaagggcc 1951381 tcggtgttga cgcgccggga gatccacaac gtgagtagtt cggggtcgca ttcggcggga 1951441 gtggggtcgg ccatcaacca acgggagacc cagtccccca gggtctgcag cacggggcgt 1951501 agctcctcgc cggccaccgt gaaccgatag cccccgcccg tgtgttcggg gaccgcttcg 1951561 atgatgcggt cgtgctgaag tcggcgtagc cgctgggcca gcaccgagcg ggagatgccg 1951621 ggcaggcccc gctcgatttc ggtgaaccgc agcgggccga agagcagctc ccgcacgatt 1951681 agcagcgtcc agcggtcccc cagcagctcc gccgcccgcg ctaccgggca gtactggccg 1951741 tacggctgca cgacaccagg ctagtcgcca tccctggctg cgtggttcgg aattcgaact 1951801 tcccgcaccc cctgtgggag gcgtaacgct tggtgctgga ggtgagaggc gatgaccgcg 1951861 acgctgacca agacgctggg ttccctcgac gatttcaggg gaacgctttg tgtccccggt 1951921 gatccggact accccagggt gcgggccatc tggaacgggc aggtggcccg cgaaccggcc 1951981 ttgatcgcca cgtgccacga cgcgtgcgat gtccgaacgg tgctgcggcg cgcggtggac 1952041 gccgggatgg tgaccgcggt acgtggcggc gggcacaacg tggccggcac cgcgctgtgc 1952101 gacggcggcg tggtgatcga cctctcggcg atgcgggccg tctcgctgga tccagcgact 1952161 gggcgggtac gggtgcaggg tggtgccacg ctcgccgatt tggaccacgc cacggtcccg 1952221 ttcgcccggg tggcccccgc cgggatcgtc accaccaccg gtgtcggcgg gctgacgttg 1952281 ggcggcgggg tgggttggac gactcgacgt ttcggactga gctgcgacaa cctggtcgcg 1952341 gtgcggctag tcaccgccgc cggcgactac ctaagcgtcg acgacgagcg cgacccggag 1952401 ctgatgtggg gcctgcgggg cgggggcggc aatttcggca ttgtcactga attcgaattc 1952461 gccacccatc cgttcggtcc ggtcgccgtg gccggcttcg tcgtctaccg gctggatgac 1952521 gggcccgcgg tgcttcgcgg ctaccggcag ttcgccgctg cggcacccga ggaggtgacc 1952581 acgatcgtgg tcttgcgcca cgccccgccg gcaccgtgga ttcccgttga ccagcgcggc 1952641 aagccggtgg tcatgatcgg cgccgtccac accgggagca tccagaccgg gatcgaagcg 1952701 ctgcgaccgg tcaagtccct cgccagaccc gtcgccgaca ccgtgtggcc gaccccgttc 1952761 ctggcccacc aggcggtgct ggacgcctcc aacccggccg gtcaccgcta ctactggaaa 1952821 tccgaccact tggccgagct gaacgacgag gccatcgact tgctagttga gcagacggcg 1952881 cagctgtcct cgccggacag cctcatcgga atcttccagc tcggcggcgc cgccgctcgc 1952941 ggcggtgagc gttcctgctt cccgagccgg cacgcgcgat tcatggtcaa ctacgccacc 1953001 cattggaccg aggcccgcga ggacgacctt caccgccaat ggacccgcga cgcgatcgag 1953061 gcgctggccc cgtacgggct gggcaccgcg tatgtgaact tcaccgccga cgacgcaccg 1953121 atgcacgtcg aaacacttta cagcacaacg gagttcagtc gtttggtgac cctcaagaac 1953181 cgactcgacc cggacaacgt gttccgcaat aaccacaaca tccgcccctc ggcatgaggg 1953241 ggcccaagtt gaccgtagga aggacgatca tggacctcta ttcaaacctc gtcgaagccg 1953301 aacaacgcct ggtcgcgctg gtttcgtcga tagaagccga cagctactcc tcgccgacgc 1953361 cgtgcgaccg ctgggacgtg cgggcgctgc tcagccacgc gctggcctcg atcgacgcct 1953421 tcgcggcggc cgtcgacgga gcacccggac cggacatggc gcaggtgttc agcggtgccg 1953481 acatcgtcgg ggacgacccc ctcggtgcga cgcagcggat cacccggcgg tcgcaggcgg 1953541 cctggtcgac cgtgcgcgat ctgaacgcgg agctgtcgac cttcatcggc gtgatgccgg 1953601 cggggcaggc tcttgcgatc atcaccttct ccaccgtcgt ccacggttgg gacctagcgg 1953661 tggccacggg ccaggccggc gaactcccgg agcacctggc cgaagcggcc caacaggtgg 1953721 cggccgaact ggttcccgtc ctgcgtccgc ggggcctgtt cgcacacgac gtcgacctag 1953781 cgggggaagc cacgcccact cagcggctcg tcgcccttac cggacggaaa ccgcggtgag 1953841 ctgcgtttgg ttgtcgcgtt cgatcattct ggcggcgtag ggctcatgga tccgacgtag 1953901 taggtttccc gcccggttgg gtatccgccg ccgtcggtgg cgatatggac gtggtcgtag 1953961 tggttgaggg tttctgagcc gtagtccgcc gtccagctcg gcgcgccgat gcctgggtag 1954021 tagccctgcc gccagatcac atggagcact ccccatcgtt tcgcattcgc caaggcaagt 1954081 ccggcgactt ggttgccgag ctggataccc tcgtcgctgt gatggttcgg gatcatcacg 1954141 tcgatcgcta acccgttggg atgccacttc aagggatcct gcctatagcc aaagatgttg 1954201 gtgatctgag gaaatagcac agagacggca cgggctaccc agatcgtctt gacctgcaac 1954261 ccctcttccg acgcaacgcc agcaggtagc gcgaactgga attgctgggc agcgacaggc 1954321 gcgcttgccg ccaacaagtc cgcttccgtg ggactggcga tgcgcggtgc gttggccggc 1954381 gcagagtcgg ggcctgtcgg gattgccgcc ggggtctcgc ggcagcacgt atgctctgcg 1954441 ccttgggcat agaggatggc ggcagagacg acgagcgagg ccgcgattgc caaccagcgg 1954501 ccccggccgt tggccaacac gcctttgctc acgaacagca ctttagtgtg tcgtgtgcga 1954561 cgcgtgtggc aacctttgct atcgattggt tgcagacccg cgttgtgcgc accgggcaag 1954621 ccgttcacgc tcatcgccaa cccgctgccg tcggcggtga aatggaagag tggtcggtca 1954681 ggcagccgct gatgaagatg gtgtcgtcgt ctgcagcgcg cacatcgaga ccgctgcgcc 1954741 ggaacagttc tgccagcgac acgccttcgg gttgccagcc cttggcggcg aggtagtcga 1954801 ggacgtggtt gcgtgggccg gtgtagacca acgaagccaa gtcgacgtcc acgccatgac 1954861 agcggaacgg gttggatatg gtccgcgctc gttcggcgct gaaatccgct ataccggtaa 1954921 caaattcggt ggctaccatg ctcccgggcg cgctgagtgc ggtgatgttg tcgaacagcc 1954981 tgtcttgagt ttgcggctta agatatatca gtaacccctc ggccagccac gccgtcggtg 1955041 ctgccgagtc aaacccggcg gcttgtaatg ccgtcggcca gtctgcgcgc aagtcgatgg 1955101 gcacggcgcg ccggatggcg gagggttcgg cgcccaggtc ggctaaggtt gtcgtcttga 1955161 actccatcac ttttggttgg tcgatctcgt ataccaccgt cctggtcggc cacggcagcc 1955221 ggtaggcccg ggagtccaac ccggacgcga ggatagcgac ttgccgaatc cccccagcgg 1955281 tggcgttaag gagatagtcg tcaaagtatt tggtgcggac cgcgtttccg tacaccattg 1955341 cctgcgccac ggccggtgaa acgtccgcga tcgtcgacat gtcgagttca ccgtccatca 1955401 tcttggtgaa caaatccagc cccaccgcac ggaccagggg ttcggcgaac ggatcgttga 1955461 tcaaaccgcg cggatccttg gtggccagcg cacgcccgac cgcgacaatg gtcgcggtga 1955521 cgccgacgct agacgtcaga tcccagttgt cgtcgtcggt gcgggccacc agcccaccct 1955581 agtctgattg cccggttcct cctcgcgccg caaacggcgc gcatcgtcac cgggcgtcgt 1955641 ctgattgccc ggttcctcct cgcgccgcaa accaagccgg ctggtgctgt gctattggcg 1955701 tcggaacaga cggccgtgct ggctacagaa ccaggcgatg ttgccgtccg gcccgcgcac 1955761 gaagttggag cgactgccgg tgggcttgtt gtcgggtcca aggtcgagcc catagtcggg 1955821 ccgatagaag gcgaggccca gattggcgct gttttggcca tccgggttgg catcgtcggt 1955881 gctcatgctt ccagcaagct ggccgtccct ggcccggaag tcgatgaccg ttgtctcgag 1955941 gtcgccattt tgggcgactt gcttggcgat gtaccggccc tcgtagggcg ccaggtcgac 1956001 ggcaccaagg cgttgcggcg tggccggaag attgctgagc ccggcgaatc tctgcaatgc 1956061 ccagtcggat gcgaaaaggt cgttgatcat atgaaatccg ccatcagagt tagtgagcac 1956121 ggtcatggcg aagtttcgat cgggcaccat gacgaaccca gagcgctgcc ccttccaggt 1956181 gccgccgtgc tcaacgatgg tcacattctc cgcggagggc cgcagcatcc aggtcacgcc 1956241 catcccggtc agttccaccc aaagtgttcc gcccgcccca gggttagagc gcattgcctt 1956301 cagcgattgt cggctcagaa tctgctcacc gttaggcgcc ctgccgtcgc cgaggtggaa 1956361 ctgtgcgtaa cgcagctgat ctcgcgctgt ggacatcaac ccaccggtgg ggttgcagct 1956421 gcgcgggaat gtccaaaagt cagtaacggc aatcggtttg ccgtcgacca cgctatgcga 1956481 tgcggccaca ttcagaccga ttatttggtc ggaaaagtag cgcgtgtgag caagctgcag 1956541 cgggtcaagc aacagcctct gaaccgtaga ttcgtaggtt gttccggcga caagctcgat 1956601 gatgcggccc gcaaccacaa gacctgaatt gttgtacgcg aacgcggttc ccggaggggt 1956661 gagctgcggt aggcgtgtca tcgccttgac atagagcgcc accgcgtcat cgccgcgccc 1956721 aaagtcctgc ccattgcgac catcccagcc tgcggtatgg ttgagcagtt ggcgaacggt 1956781 aaccgtagcg ctggctgatt cgtcggctac cgcgaagtcg gggatgtagc ggcgcacagg 1956841 tgaatccagg tccaccttgc ctcgctcgac cagccgcatc atcaccgtac ctgtgaaagt 1956901 ctttgtggtg gaaccgattc tgaagacagt gtcgccgtca acaggcatcg gatggtcgac 1956961 attggtgacc ccgtagcctt tgacgtattc ttgcccgccg gcccagacag caaccgcgac 1957021 gcccggaatc gcataggcct tcatgcccgc gttgattttt gcatcgagtt cgtcgaacgc 1957081 tgcaccaggg tctgcgcagt tgacagtttc aaccactgca gtggcgattt cgtgcggcag 1957141 tcgatctagc gcacgcacgt attcggtgac gaccgcgcgc ccatggcgcg tcccgcaccg 1957201 cgtgccggtc ggcgtcgcgg aactcaagat gatcggcgga cacaaggacc gcggcgaccc 1957261 ggccggtggc ggccgatctg aacagcttcg tggggggatc cgcttcgtca accaacgcgg 1957321 aaagcatggc tttggccttc cgcggtcgcg tccacatgag tgtcaatata gctggactaa 1957381 catgaacatc gcgaggccgg ttcttcgtgg taacgtgccg ggatcccaag ggactgccgg 1957441 aagcgaattt ggttgcgccg cttggggcgt cgcgagagat tcggcaatcc cctggctgga 1957501 ggatcccgtt cagccagggc gtaggcgctg cggcgtgcac ggcttggccc cacaacccgt 1957561 attgatgcca cctgaacaag aagaacccgg cattcgtcga gaatgccttt ggtcaccaat 1957621 cgcaggccga tactctgtgc cctagacacc cgcatttctt cgaaagaggt gacgatatgc 1957681 ctgcaccctc ggccgaggtt ttcgatcgct tgcgtaacct ggccgcgatc aaggacgtcg 1957741 ccgcacgtcc gaccaggacg atcgacgagg tcttcaccgg caagccgttg actacgattc 1957801 cggtcggcac ggccgcggac gtcgaagcgg cattcgccga agctcgcgcg gcgcagaccg 1957861 actgggcgaa gcgtcccgtc atcgagcgag ctgcagtcat ccgccgctat cgcgacctgg 1957921 tcatcgagaa ccgcgagttc ctcatggacc tcctgcaagc cgaggcgggc aaggcccgat 1957981 gggcggcgca agaggaaatt gtcgatctga tcgcgaacgc gaattattac gcacgagtct 1958041 gtgtggacct gctgaagccc cgtaaggcac agccgctgct gcccgggata ggcaagacca 1958101 cggtgtgcta tcaaccgaag ggcgtggtgg gggtgatctc gccgtggaac taccccatga 1958161 cgcttacggt gtcggactcg gtgcccgcgc tggtggccgg taacgcggtg gtgctcaagc 1958221 cggacagcca gacgccgtat tgtgcgctcg cgtgtgccga gctgctgtat cgggcgggtc 1958281 tgccgcgagc gctgtatgcg atcgtgcccg gtccgggctc ggtggtgggc accgccatca 1958341 ccgacaactg cgactacctg atgttcaccg gttcatcggc gaccggcagc cgcctcgccg 1958401 agcacgccgg ccgccggctt atcggtttct cggccgaact tggcggcaag aaccccatga 1958461 tcgtggcgcg gggtgccaac ctcgacaagg tcgccaaggc ggccacccgt gcctgcttct 1958521 cgaacgccgg ccagctgtgc atctccattg agcggatcta cgtcgaaaag gacatcgccg 1958581 aggagttcac ccggaagttc ggcgatgcgg tgcggaacat gaagctcggc accgcatacg 1958641 acttctcggt cgacatgggt agtttgatct ccgaagcaca gctgaaaacc gtgtccggtc 1958701 acgtggatga cgcgacggcc aagggcgcca aggtgattgc gggcggcaag gctcgacccg 1958761 acatcgggcc gctgttctac gagccgaccg tgctgaccaa cgtcgcaccc gaaatggaat 1958821 gcgcggccaa cgagacgttc gggccggtgg tctcgatcta cccggtcgcc gacgtggacg 1958881 aagccgtcga aaaggccaac gacaccgact acgggctcaa cgccagcgtc tgggccggct 1958941 ccaccgcgga gggccagagg atcgccgccc ggctgcggtc ggggacggtg aacgtcgacg 1959001 aggggtacgc gttcgcctgg ggcagcctca gcgcgccgat gggcgggatg ggcctctcgg 1959061 gggtcggccg ccggcacggt ccggagggct tgctcaagta caccgaatca cagacgatcg 1959121 cgaccgcccg cgtgttcaat ctcgatccgc ccttcggcat cccggccaca gtctggcaga 1959181 agtcactgtt acccatcgtg cgcaccgtga tgaagcttcc cggccgcagg tgacggcgcg 1959241 gcctagcgcc acttgatgcc gcacccgatc gacggtcgtt ggtcggggtt gactggccgc 1959301 ccggcgagca gggcgtcgac cgcggcccgg acgtcggcgg ccgtcaccgg tcggccattg 1959361 cccgggcggg agtcgtcgag ctgaccacgg tagacaagtc ggcgctggcc gtcgaagacg 1959421 aacgtgtcgg gtgtgcaggc cgcggagaag gcgcgggcga cgtcttgggt ttcgtcgtag 1959481 agatacggga acgtccagcc gtggcggcgg gcctcggcga ccatctgatc gggcccgtcc 1959541 tgcgggtagg tgacgacgtc gttactggag ataccgacca tcgggacgcc ttgatcggcg 1959601 aggtcccggc cgagcgtggc caatccggcg gcgacgtgtt gcacgtacgg gcagtggtta 1959661 cagatgaagg tgacgacgag ggcgggaccc gtgagctcgt cgaggctgac cgtggcgccg 1959721 gtcgccggct ggggcagtgt gaacgacggc gcgggggtgc cgagggcgag catgctggat 1959781 tcaacggcca tgccgtccag agtacggtcg cggtccagct tggcggagcc ctggttgccg 1959841 ctaccggacg gttgtcaccg ctgcgtgcag aacaggctgt cgatgtcgtg ttgccaactg 1959901 gcgttgcgaa cgcggatcag aatcgcccga gtgagcgcca gcagggcgcc cgcaaccgcg 1959961 gcgacgctca accagagtcc caaggcggcc agggccgcat ccgcaatggc acgggccggc 1960021 ggagctggtt catcgaccag ctgaccggca ctgtcgaccc aaatgccgac gcggtcaccg 1960081 gatttggttc ccggcttcgc gttgacctca ccgctgcgtt ctattccgtt cacgacccat 1960141 cgggcaggca cggtgatctt cgtgcgcggc ggcgctgacg tggcggtcgt gttgctgtcg 1960201 atcaccccct cgtgatcgat cacggtcgcg gttgcgggat ggcgggtctg ggcctggtgg 1960261 gcatagacgt ggctgcggga atcctggact gcggtgccgg ccgcggcggc gaacgggata 1960321 gtcagcagcg agaccgtgac ggccagcagc atgacgaccg cctcgagtcg atccgtccca 1960381 cgcaccagcg gattgcggct gaacacccgc agtatcgtcc ggcacggcaa gcgcagccta 1960441 aacgtgatca tggtggctcc ttcacgatcg cgggttgtgg cgatcatcgc tgtgaattgc 1960501 tcgtggctcc tagggtcgtt cggccttggg gctggggacg tcggtcacga atggctgggc 1960561 gccgtgcata tcgggtgaac cgggcgtcga acaagcgaag ttttattgtc ggataaggga 1960621 ctttcgcccc ttcccgcctg ctgtgtttgg tggcagtatt ggtgataccg gggaaacccg 1960681 gtgatctgcc cgaagtgctg ggcgattgag cgggtatgta cacccggttt gacctaccgt 1960741 cccaagacgg ggctaccgcc ttcgggcaga tcctcatcct gttactgcgg cgcaccgcgt 1960801 cagctcgttg atcgacagga agaacagcgc gccgcgatgg tcatcgctgc agccgtggtc 1960861 agcgggcagc gtagccagca cggtcgtcat gacgtggatc gcgccgtcga cggcgcaaac 1960921 tcgttgtgcc ggcttgccga aactgaccag cgcgacctga ggtgggtaga tcaccccgaa 1960981 gaccgcgtca accccctggt caccgacgtt ggtcatggtg atcgtgaggt ccgatagctc 1961041 cgagcccggg gacttcggtc cccagagccc ccgacttgga tcagtggtcg gatatcgctc 1961101 gatgacagcg atgagctcta cacaactggc cgaggccaga acacgaggtt cgcccgcgtg 1961161 ctcggaccat atctggtcgt tgtcaccgcc acggcgctag cgcacgcgtc gtcgcggacg 1961221 tcccgcttgt tacgggcgat tggtggccag gcggtcatgg tgctgatggc attgtcgggc 1961281 ggtatctcag ctacatcggc tggtttcgac gaaacgctcg aacttggtgg tcgaacgacc 1961341 gcggcgggcg gcagatgatg gcatgggtgt catcagcggc cccgatggcg tgcgatgacc 1961401 ggccgctgcg gccgatggtg gcggctaggt ggtgcagcat ggcaacgaag gtgatcgtcc 1961461 acaccgccaa cgcgacccaa ccttcgaatt cgccgatgct ttcgacgatg ggcagatggg 1961521 cggccaggcc gagacggtat gcgcccacgc cgtacatgcc gagcgggaac acgacgctcc 1961581 acaacgttgc ctcgtagcgc agcgggacac ggtggacgac atgtttccat atgctggcgg 1961641 cgaccagcgg tgggatcagc cacggtccga aggcccagaa caccaccgac gctcccgcaa 1961701 cgagtccgct ggtgacgata gccattggtg catcagccat ttcgacgatg tgggcgccgg 1961761 ccagcacggt gatagccgtg gcgcccatcg ccacccaata gggcggggtg agatccgcgg 1961821 gccgcagcgg gtagagcagc aggcgggcga cgaccaggct gccgacagcg acgtacagaa 1961881 acacgcctac tgaccaacta atcgcgtgcc gagcacgtcg gacgcagcga caaaggtgaa 1961941 catcccaaat ccccggcgcg gatcagctag gtcgtcggcg aattctttgc ggaagatgac 1962001 gattcgtgtc gtgctcaccg cgatcaagac agcataggcg gtgcaggtca cccacagcag 1962061 gacgacggaa agggcatacg tccacccgca caggcgggtg acgacagggc cgaaagtcgt 1962121 gtctcgcatc gaaatcgacg ccagcgcgga cttgttcgac gagtagacgt gtcgctaacg 1962181 tcgatctcga tgggcagtcc tgtccgctcg ccgaagacgc actcccgtca ccacccgcgc 1962241 cgccgcggcc gcgttagcac cagctcctcg cggctgcggt agatgatgta cgggcggaac 1962301 agatagccga tcggggcgct gaacgcgtgt accagccggg tgaacggcca caacgcgaac 1962361 aacgccaacc cgatcagcac atggatctgg taatacagcg gagcctcggc catcaggtcc 1962421 ccgcgcggtt gcagtaccca caccgagcgg aaccacaccg acaccgtctc gcggtagttg 1962481 tacgcctcgc cgacaacgcc ggagcccaac gccgtcgcac ccagtcccgc gacgatcgcc 1962541 gccaccagca cgaggtacat caccttgtcg ttgacggtgg tagccatgaa caccggcccg 1962601 cgggtgcgcc gccggtagat cagcagggta acgccggcca aggtggtgat gccggcgatc 1962661 gaccccagca cgacggcctg cacgtgatat gcgccctcgc tcaaaccggc ggcctgagtc 1962721 cacgactgcg ggatcacgag cccgataccg tggccgacga tgaccaccag gatgccgaaa 1962781 tgaaacatcg ggctggcgat ccgcagcagc cgcgactcgt acagctggga cgagcgggtg 1962841 gtccagccga atttgtcata gcggtagcgc caccaggagc cgaccgcgac gatcgtcatc 1962901 gtcacatacg gcacgacggt ccagaagagt tcgcccatca tgtcacccgt ccggcatacc 1962961 gcggccaccg tgtgtgcgta tggcaatgcg gcctcggtca gggcattgca cagcgcggcg 1963021 atgggcaccc ggtacccgct cagcaaccgt cgccccgcct cggggtcgac ggtcgcggcg 1963081 aattcgagca ccaccggcag gaagtccggg gtctcgccgc gcggtggtgc gacgtcggtg 1963141 ctgcggtagg tctgggcgaa ggccagcatc tcccggccgc ggttgcgggt gtcgccggcg 1963201 gtccagtagg tcaggtacag ggtggcgcgg cctcgcaggt cgaaggtgtc gacgtagcgg 1963261 gtcgccgcgg tcagcggatc ggcacggcgc agctcagaga ccgtgcgccc caacagatcc 1963321 gcggccggac cgtcgatgtg ggccagcaat tcctctgcgg tgccgagttg ccgtgagttc 1963381 gggtaggtca gcagcaccga ggcgcattgc cacaccacgt cccaccaatc tccggactcc 1963441 ggcacgtcgg tctggtcgcc gaacacctgc ggggaggcca ccggcaggtc ggcgtaccag 1963501 tcgtagaacg acgtcatcac cccgccgatt agctccacga accgcgaccc cgcggcgtgg 1963561 ctcaccatgg acatcgccgg gatgggggag aagccggcaa cccggtccgg gccgtatgtg 1963621 gagatggtgt gcacgtgggc ggcggcgatc atctcggtgg cctcggccca gctgacccgg 1963681 accagcccgc ccttgccgcg ggcgcgctgg tagcggcggc gccgccgcgg gtcggcctgg 1963741 atgtcggccc aggccgccac cggatcaccc aaacgtgcct tcgcctcccg atacatctcg 1963801 acaagcacgc cgcgggcgta cggatggcgc acccgcgtcg gcgaatacgt gtaccaggaa 1963861 aacgccgcgc cgcgcgggca gccgcggggc tcatactcgg gccggtccgg gcccaccgac 1963921 ggatagtcgg tctcctgcgt ctcccaggtg atgatgtcgt ctttgacgta gatcttccaa 1963981 gaacacgacc cggtgcaatt caccccgtgt gtggagcgga ccaccttgtc gtggctccac 1964041 cggtctcgat agaacacgtc gccgtcgcgg ccgccgcggc gggtcacggt acgcagatcc 1964101 gccgagatct cacccgggat gaagaaccgg ccgctgcgtg caagcagctc ctcgatgcgg 1964161 ctgccggtcc gtggtgtcac cgtcacctgg acgcctcctc actcaccggc tcccgcgcgt 1964221 gcagcgcggt gtaggtacac gcgaccagcg cggtcgccac cagcagcagc aacccgaccg 1964281 tgtagtcgtt gtcgaccggg tcgtaggtcg cgcccatcac cagcggcggg aagtaaccgc 1964341 ccaatccgcc tgccgcggcg acgattccgg tgaccgagcc gaccgatgcg gccggggcgc 1964401 ggcgggccac ccacgcgaac acgccgccgg tgcccacgcc gaggcagacc gccagggtga 1964461 tgaaggtggc cgccgaccac acctccggcg gcggctgcaa cgccgcggcg aacgccagca 1964521 gcgcggtccc ggcgagcgag gccagcacca cgtgcctcgg tgcgatccgg tcggagagcc 1964581 acccgcccac cggccgggcc agcaccgccg ccagggcgaa cccggcggtg cgagcgcccg 1964641 cgtcgaccgt ggagaacccg tagatcgtgg tgatgtaggt gggcaggtag ttgctgaacg 1964701 ccacgaaccc gccgaacacg atcgcgtaca gaaacgacat ctcccaggtc accggcaacc 1964761 gtgccgcggc cttgagcctg ggcagcaccg ggtcggcgtt gggccgaaag tagggtgcat 1964821 cacgaagcac gaccatggcc accacggcgg tcgacgcgag cgcggccgcg acgatggcgt 1964881 gggtggtgaa caggccgaac caccgtacaa accgcggggt gaagaacgcc gagagcgcgg 1964941 tgccgaccat gcccataccg aacacgccgg tggagaaacc gcgccgcgcc ggctggtacc 1965001 agttgttggc gaacgggatg ccgacggcga agatcgtgcc ggcaacgccc aggaagagcc 1965061 cgaaaaacac cagcaacgcg taggagccca tggttgccgc gaccccgacc gcgagcaccg 1965121 ggaggatcga cgccagcgtc accgcgatga gcatggcgcg cccgccgaag cggtcggtga 1965181 gcggcccggt gacgatgcgg ccaagggcac ccaccaggat cggggtggcg acgagcagcg 1965241 acgcctcggc gctggacagt gacatgtcac gcgcgtagct ggtcgacagc gggccgatca 1965301 ggttccacgc ccagaagttg accaccgaga tccaggtggc cagcacgaga ttggccgctt 1965361 gccctctcat cgacacgatc cggggtctcg gactccggcg aactccgcgc cccgcccgga 1965421 cagccatgcg ctaaccctgg cttcgatggc gccggctcag ttagggccgg aagtccccaa 1965481 tgtggcagac ctttcgcccc tggcggacga atgaccccag tggccgggac ttcaggccct 1965541 atcggagggc tccggcgcgg tggtcggatt tgtctgtgga ggttacaccc caatcgcaag 1965601 gatgcattat gaccagcgag ctgagcctgg tcgccactgg aaaggggagc aacatcatgt 1965661 gcggcgacca gtcggatcac gtgctgcagc actggaccgt cgacatatcg atcgacgaac 1965721 acgaaggatt gactcgggcg aaggcacggc tgcgttggcg ggaaaaggaa ttggtgggtg 1965781 ttggcctggc aaggctcaat ccggccgacc gcaacgtccc cgagatcggc gatgaactct 1965841 cggtcgcccg agccttgtcc gacttgggga agcgaatgtt gaaggtgtcg acccacgaca 1965901 tcgaagctgt tacccatcag ccggcgcgat tgttgtattg agggtgccgg cgcgttagcg 1965961 ccgacggaac gcctgcactg cggtaggcaa tgtcataaag atatggtctt cgccaatctt 1966021 atcgagaaga ctggcggccc tgagtgattc acgcaagtct tgtttgaccc gggccatggc 1966081 gaacactatt ccccgacgca gcagctcggt gcggagttgg tcgagcgcat ccagcgcagt 1966141 caggtcgacc tccacattgg attcggcgtt gagtacgaac cactcgactt gccccggatc 1966201 ctgatcgacc acggtcagtg ctcgcctgcg gaagtcttcg gcattggcga agcacaacgg 1966261 cgcgtcatag cgatacacca ccagcccggg cacgcgcttg gcctgcggat agtcatcgat 1966321 gtcgtgcatg ccggcaatgc ccggcacgaa cccgagaacg ctgtcatgcg gatgtgcgac 1966381 ccgacgaagc agttcgagga tggacagggc aaccgcggcg aggactccat agaacactcc 1966441 taggcctaac acggctgctg tggtggctag tgccagcatg agttcgctgc gccgaaaccg 1966501 cgccagtcgc cggaattctg acaagtcgat caagcgtagc gcggcatata ccaccaaagc 1966561 gcccagagcg gcgatcggaa acatggccag cagcccactc gcgaaaacca tcacgatgac 1966621 aacaagcccc aacgcgatca gcgagtacag ctgggtgcgg ccaccgacga cgtcggcgag 1966681 ggcggtacgg ctgctgctgg aactcaccgg aaaaccgtgt gtcagcccgg cggcgatgtt 1966741 gcaggccccg accgcgcgca gctcggcgtt ggcattgact tcctgacctc gacgagcggc 1966801 gaaggcgcgt gcggtcaaca caccgtcggt gaaggtaaca atcgcgatcc cggcagccgg 1966861 aatgatcagt gcccgcaagt cttccaccga aacgggcggc acacccggcg tcggcagacc 1966921 ggaaggtatc cgacccacaa tcgcaatacc tttggcatcc aaggacataa cggccactag 1966981 catcgtggcc gcaagaaccg cgatgatcgg tccgggggcg cgcggcgccc accgcgtgag 1967041 catagttagc agcgctagga cagacatggc taacacaaaa gtcggccagt gaactcgcgt 1967101 gacgctagtc gcgaaagagt gtacttcgct gaagaattcg ttgccttcga ccgaggtgcc 1967161 ggtgatagtg ccgagttggc tggagatcat gacaagcgcg atgccggcca tgtatccgac 1967221 gagcaccggc cgcgatcgca ggctggcgag gaaacctagt cgcgccgtgc cagcgagtag 1967281 gcagataagg ccgactagca atccgagggt tgccgccaga acggcatagc gtcgaagatc 1967341 cccggcggcc atcggagcga gcacggccgc cgtcatcaag gcggtggcgg attccgggcc 1967401 gattgaaagc tgccgggacg atccgagcag tgcgtaaatg gcaagcggcg cgatcgacgc 1967461 ccacagcccg gctgccggcg gtaggcccgc cacggtcgca tacgccatcg cttgcgggat 1967521 cagataggcg gccacggtca ggccggcgag gacatcgccg cgcagccaac gccgttggta 1967581 ttcgcggaac tgcaccaccc ctggtgccca gccggccgat gtcatcgtgg gaatcattgt 1967641 ccgacggctg gccgcttagc tagagtcggt ctagaacccg cccaatcttt atagaatcct 1967701 gaccatggaa ttggcggctc gaatgggcga gactttgaca caagcggtcg tagttgcagt 1967761 gcgggagcaa ctggcccgcc ggaccgggcg caccagatcc atttcgctac gcgaggagtt 1967821 ggccgccatt ggccggcgct gcgcggcctt accggtgctc gacacccgag ccgcggacac 1967881 gattctcggc tacgacgagc gcgggttgcc cgcctgatgg tgatcgatac ctctgcgctg 1967941 gtcgcgatgc tcaacgatga acccgaggcg caacggttcg agatagccgt ggcagcagac 1968001 cacgtttggc tgatgtcgac ggcgtcatat ccggagatgg cgaccgtgat cgaaacacgc 1968061 ttcggggaac cggggggacg tgaacccaag gtcagcggcc agcctctcct ctataagggt 1968121 gacgatttcg catgtatcga tattcgcgcg gttctcgccg gctgagccgg cgatgagcgc 1968181 cctgctggat ggggtgttgg acgcccacgg cgggctgcag cgatggcgcg ccgcggaaac 1968241 ggttcatggg cgggtacgca cgggagggct gttgcttcga acccgggtgc cgggcaaccg 1968301 cttcgcggac taccgcatca cggtgcatgt ccaacaggcc cggacggtct tggatccgtt 1968361 cccgcgtgac gggtaccgcg gagtcttcga gagcgggcag gtgcggatcg aaagccacga 1968421 tggcgcggtc atcagctcgc gcgcgcaccc gcgagcggcg ttcttcggac gctcgggcct 1968481 gcgccggaac atccggtggg acccgctgga ctcggtctat ttcgccggtt acgcgatgtg 1968541 gaactacctc accacgccgt acctgttgac gcgcgaaggc gtggcggtcg aggagggagc 1968601 gccctggcag caggagggcg agacctggcg gcgcctgatt gtgagcttcc cgccggatat 1968661 cgacacccac tcgcctcgcc agacctttta cgtcgatgcc agcggtctct tgcgccgcca 1968721 cgactacgtc ccggaggtcg ttggccactg ggcacgggca gctcattatt gcgccgaccc 1968781 cgtggatgtc gacgggtttg tattcccgac ttgccggtgg gtccacccga tcggcccggg 1968841 gaatcgctca ctgcccttcc caactctggt atcgatcctg ctgaccgaca tccgggtcga 1968901 gaccgattag gtttcgccgg aagtcgccgc acctcgcggt tgctgaaacc attagcctta 1968961 tgcctgtcac accaccgcgg ttggcggggt gaggagtcgg gcgatggatg gcaccgcgga 1969021 atcgcgggag ggtacgcagt tcgggccgta tcggttgcgg cggttggtgg gtcgcggcgg 1969081 catgggcgac gtctatgagg ccgaagacac ggtgcgcgag cggatcgtgg cactaaagct 1969141 gatgtcggag acgctctcca gcgatccggt cttccgcacg cgtatgcagc gcgaggcccg 1969201 caccgcgggg cgcctgcagg aaccgcacgt cgtgccgatt cacgacttcg gtgagatcga 1969261 cgggcagctc tacgtggaca tgcgcctgat caacggcgtg gatctggccg cgatgctgag 1969321 acgccagggg ccgctggccc caccgcgagc ggtcgcgatc gtgcgccaga tcggctcggc 1969381 gctcgacgcc gcgcacgctg ccggggcaac gcatcgcgac gtcaaaccgg agaacattct 1969441 ggttagcgcg gatgacttcg cctatcttgt cgatttcggg atcgccagcg ccaccaccga 1969501 cgaaaagctg acccagctcg gcaacacggt gggcaccctc tactacatgg cgccagagcg 1969561 gttcagcgag tcgcacgcaa cttaccgcgc cgacatttat gcgttgacct gcgtgttgta 1969621 tgagtgcttg accggatcac cgccgtatca gggagaccag ctcagcgtga tgggcgcgca 1969681 catcaaccag gcgatcccgc ggcccagcac ggtacggccg ggtattccgg tcgccttcga 1969741 tgcggtgatc gcccgtggca tggccaaaaa tccggaggac cgctatgtca cctgcggtga 1969801 tctgtcagcg gcggcgcacg cagccctggc caccgcggat caggatcgtg ccaccgacat 1969861 cttgcggcgc agccaggtgg ccaagctgcc ggtgccatcg actcacccgg tgtcaccggg 1969921 tacccggtgg ccgcagccga cgccatgggc tggcggggcg ccgccatggg ggccaccgtc 1969981 gtctccgctg ccccggtcag cccgccagcc ctggttgtgg gttggtgttg ccgtcgccgt 1970041 cgtggtggcg ctggcgggcg gcctgggtat cgcgcttgcc catccgtggc ggtcatctgg 1970101 accccgcacg tcggcaccgc cgccaccgcc gcccgcagat gcggtcgagc tccgcgttct 1970161 caacgacggt gtctttgtgg gtagctcggt ggcgccgaca acgatcgaca ttttcaacga 1970221 acccatctgt ccaccctgcg gcagtttcat caggtcgtat gcgagcgata tcgataccgc 1970281 ggtggccgac aagcagctgg cggtgcgcta ccacctgctc aacttcctcg acgaccagtc 1970341 gcacagcaag aactattcga cgcgagcggt ggccgcctcg tactgtgtag cggggcaaaa 1970401 cgacccgaaa ctctacgcca gcttctactc cgccctattc ggcagcgact ttcagccgca 1970461 agagaacgcc gcatcggatc gcaccgatgc cgaactggca catcttgctc aaacagtcgg 1970521 cgccgagccc acggcgatca gctgtatcaa gtcaggagct gatctgggca ccgcccaaac 1970581 gaaggccaca aacgccagcg agacgctggc cggcttcaat gccagcggta cgccgttcgt 1970641 gtgggacggc agcatggtcg tgaactatca ggatccgagc tggctcgcga ggctgatcgg 1970701 gtagcgcggg tggtgtggcc tcgtcccgga caattccgct tgctctcgca gcatgtccgc 1970761 agcggtgcgc ggttgtgacg gtgaattcac gatgctcgcc gttgatgtcg gcaggtacca 1970821 ccgcggtgtg gcttgcgtcg cggacggtgc ggtcagattc ggcgatggtc ccgagggcgg 1970881 cagctactat gccaacgaca ggcgcccaca aatatcctgc ggttgagttg cagaccgggt 1970941 gggtcgttca ccgatccact gtagggccgg tgactcagaa cgtggccgtt aattcgaaac 1971001 ccggcccagg ttgccaaccc gaagatttcg ggcgccgacc acattccgca gtcccgaaca 1971061 attcacgcac cacaaacacc ccacacagtc ggtgcagcgc acgcagccga tacaggccac 1971121 gcaccgggtg caggtgatgc atgctaggca tgccacacac tgccggacag ccacgcacaa 1971181 tacggtcagc agactgccga ttatcccgac gctgcccgcc gtggctgccg ccccggctat 1971241 cgcgacgctg cccgcggtcg cgaccgagcc ggcgactgcg acgctgcccg cggtcgccac 1971301 cgagccggcg actgcgacgg cgcccgtggt cgcggccgat cccgcgacgg cgatgctgtc 1971361 gatgctggcg atcgagcggt taattaccat gtgcggcttt cggtagccgg cagtcgtcgg 1971421 ccacgggcca ctgtgccgga catggtccaa gtttggtcag gtagcccagt tgtgagcggc 1971481 accaagggga taccggggcg attacgccgg cggtaacatc gcgcacgaat tgttcccagg 1971541 acaaccagcg gatcgcgtcg acctcgtccg agttcggccg gggctgttgg tcaacctgga 1971601 ctcggtagac ggggcagatc tcgttttcca cggtgccatc ggccatagcg gcccggtagc 1971661 ggaaccccgg caggatcaga tcgacccgat ctggggtcag tccgagttcg gcagcgagcc 1971721 gccggcgtat ggcgccgggt agcgattcgc caggcagggg gtgcccgcag caactgttgg 1971781 tccataccgc cggccacgtc ctcttggtgg cggcccgccg cgtgatcaac agctgatcgt 1971841 gcagatcgaa cacatagctg gagaacgcga ggtgcaaagg ggtgtcgccg gtgtgcacgg 1971901 tggccttgtc ggccacacct gtcgcgtcgc cgcggtcgtt gagcaaaacc acccgctcga 1971961 tcggtggagc tggccggtag ctgcgggtca tgccagacct ccttacgctt gcttgcgagg 1972021 gtcggttcgc ggccccaacg ctggcaaact accggagagt cacttgtcgc gtgcggagtt 1972081 ccacgattct cgtcgagtgt cgcaagccct gccctcctgg cgggctacga tgccgccatg 1972141 ccgctcgcgg aaggttcgac gttcgccggc ttcaccatcg tccggcagtt gggatccggc 1972201 gggatgggcg aggtgtacct ggcccggcat cccagactgc cccgccagga cgcgctcaag 1972261 gtactgcggg ccgatgtgtc agccgacggc gaataccggg cacggttcaa ccgcgaagcc 1972321 gatgccgcgg cgtcgctgtg gcatccacac atcgtcgccg tccacgaccg cggcgagttc 1972381 gacggccagc tctggatcga catggacttc gtcgacggca ccgacaccgt atcccttctc 1972441 agggatcgtt atccgaacgg gatgcccggc cccgaggtca ccgagatcat cactgcggtg 1972501 gccgaagcgc tcgactatgc ccacgaacgt cggctgttgc accgcgacgt caaacccgcc 1972561 aacatcctga tcgccaatcc tgattcacct gatcgtcgaa tcatgttggc cgacttcggg 1972621 atcgccggct gggtcgatga tccaagcgga ttgaccgcca caaacatgac tgtgggcacc 1972681 gtgtcatacg cggctccgga acagcttatg ggcaacgagc tcgatggacg ggccgaccaa 1972741 tacgcactag ccgcgacggc gtttcacttg ctgaccggct ccccgccctt tcagcacgcc 1972801 aaccccgccg tggtgatcag ccagcatctc agcgcgtcac ccccggcgat cggcgatcgg 1972861 gttcccgagc tgacaccgct ggacccggtc ttcgccaaag cgctggccaa gcaacccaag 1972921 gaccgttacc agcggtgtgt cgacttcgcg cgcgcactcg gccatcgtct gggcggcgcg 1972981 ggtgatcctg acgacacgcg ggtgtcgcaa ccggtcgccg tggccgcgcc cgcgaaacgc 1973041 tcgctgctgc ggaccgccgt catcgtcccc gcggtgctgg cgatgctgct ggtgatggcc 1973101 gtcgcggtcg ccgtgcggga gttccagcgt gctgacgacg agcgtgcagc gcagcctgcg 1973161 cggacgcgga ccaccacatc ggccggcacg accacttcgg tagcccccgc gagcacaacg 1973221 cgcccggccc ccacgacccc gaccacgact ggcgccgccg acaccgcgac tgcatcgccg 1973281 accgctgcgg ttgtcgccat cggcgccctc tgcttcccgc tcggcagcac cggcaccacc 1973341 aagaccgggg cgacggccta ctgctcgacg ctgcaaggca ccaacaccac catctggtcg 1973401 ctgaccgagg acaccgtggc cagtccgact gtgaccgcca ctgctgaccc gacggaggcg 1973461 ccgctgccca tcgagcagga atcgccgatt cgagtgtgca tgcagcagac cggccagacc 1973521 cgacgggaat gtcgcgagga gattcgcaga agcaacggct ggccgtgatg gtcggcttgc 1973581 ctgaccgggt gcacccgccc cggcgtcggc tgcggtcccg atacagttgg tgccgatgag 1973641 ccaaccagcc gccccgcccg tgttgaccgt gcggtatgag ggatcggagc gcacgttcgc 1973701 cgcaggacac gatgtcgtcg tcgggcgtga cctgcgcgcg gatgtccgcg tcgcacaccc 1973761 cctgatctcc cgggcacacc tgctgctgcg attcgaccag ggtcgctggg tcgccattga 1973821 caatggcagc ctcaatgggc tctacctcaa taaccgtcgg gtgccagtcg tggacatcta 1973881 cgatgcccag cgagtccata tcggaaaccc cgacggtccg gcgctggact tcgaagtggg 1973941 ccgccaccgg ggttcggccg ggcgaccacc ccagacgacg tcgatacgcc tgcccaacct 1974001 gtccgcggga gcgtggccca ccgacggccc gccgcagacc ggcacgctcg gctccggcca 1974061 gctacaacag cttccaccgg ccaccacccg gatacccgcc gctccgccat cgggaccaca 1974121 gccgcgatac cccaccggtg ggcaacagtt gtggccaccc agcggaccgc aacgggcgcc 1974181 gcagatttac cggccaccca cggccgcacc gccgccggcg ggtgcccgcg gcggaactga 1974241 ggcgggaaac ctcgcgacat cgatgatgaa gatcctgcgg ccaggcaggt tgacggggga 1974301 gttgccgccc ggtgccgtca ggatcggccg ggcgaacgac aacgacatcg tcattcccga 1974361 ggtgttggcc tcacgtcacc acgccaccct ggtcccgacg cctggcggca cggagattcg 1974421 ggacaaccgc agcatcaatg gcaccttcgt caacggcgcc cgggtcgacg cggcgctgct 1974481 gcacgacggc gacgtcgtga ccatcggcaa catcgacctc gtcttcgccg acggcaccct 1974541 ggcgcgccgt gaagagaacc tgctggagac ccgcgtcggc ggcctcgacg tgcgcggggt 1974601 gacctggacc atcgatggcg acaagacact gctggacggc atctcgttga cggcgcgccc 1974661 cggtatgctc accgccgtca tcggtccgtc gggcgctggc aagtcgacac ttgcccggtt 1974721 ggtggctggg tatacgcacc cgacggatgg cacggtgacg ttcgagggcc acaacgttca 1974781 cgccgaatat gcctcgctgc gcagcaggat cggcatggtg ccacaggacg acgtggtgca 1974841 cggtcagctg accgtgaaac acgcgctgat gtatgccgcc gaactacggc tgccgccgga 1974901 caccaccaaa gatgaccgca cccaggtagt tgcccgggtg ctcgaagaac tcgagatgtc 1974961 caagcacatc gacaccaggg tcgacaagct gtcgggtggt caacgcaagc gggcgtcggt 1975021 ggcgcttgag ctgttgaccg ggccgtcact gctgatcctc gacgagccga catccggcct 1975081 agatcctgcg ctggaccggc aggtcatgac catgctgcgg cagttggccg acgccggtcg 1975141 ggtggtgctc gtggttaccc actcactgac ctacctggac gtctgtgacc aggttctgct 1975201 gttggccccc ggcggcaaga ccgcgttctg tgggccaccg actcagattg gtccggtcat 1975261 ggggaccacg aactgggccg acatcttcag caccgtcgcc gacgacccag acgcggccaa 1975321 agcccgctac ctggcgcgga cgggtccgac cccaccaccg ccaccggtcg agcaacccgc 1975381 cgaactgggc gatccggccc ataccagctt gtttcggcag ttctccacga tcgcgcggcg 1975441 acagttgcga ttgatcgttt ccgaccgagg ttacttcgtc tttctggcgc tgttgccgtt 1975501 catcatgggt gcgctgtcca tgtcggtacc gggcgacgtg ggcttcgggt ttcccaaccc 1975561 gatgggtgac gcgcccaacg agcccggcca gatcctagtg ttgctgaatg tcggtgcggt 1975621 cttcatgggg accgcgctga ccattcgtga cctcatcggt gagcgagcca tcttccggcg 1975681 cgaacaggca gtcggcctgt ccactaccgc ctacctgatc gcgaaggtct gtgtctacac 1975741 cgtgctcgcg gtggttcagt cggcgattgt gacggtgatc gtcctggtcg gcaagggcgg 1975801 tccgactcag ggtgccgtag cgttgagcaa gccagatctg gagctgttcg ttgatgtcgc 1975861 ggtgacctgt gtcgcctcgg cgatgctcgg attggcgctg tcggcgatcg ccaagtccaa 1975921 cgaacagatc atgcccctgc tggtcgtggc ggtcatgtcg cagctggtgt tctccggagg 1975981 catgattccg gtcaccggac gtgttcccct tgaccagatg tcctgggtca caccggcgag 1976041 atggggtttc gcggcgtcgg ccgctacggt cgacctgatc aaattggtgc ccggtccgct 1976101 gaccccgaag gattcgcatt ggcatcacac cgccagcgcg tggtggttcg acatggccat 1976161 gctggtagcg ctcagcgtta tctacgtcgg ctttgtgcgc tggaagattc gcctcaaggc 1976221 gtgctaggcg gcagttcact gcccaaccca ggtggaatta acgggaatgg ctgtctcact 1976281 caccggctca acaggtggcc ttgggcgcgc gacgcgaccg cacccgccga ccgtgacgtg 1976341 cgactgattc tgagctaacg cacgcagggg gaactcgagc ccggtgacca gctcgagcgc 1976401 ggcgccgggc gggtgagatc gacgtgtggg tcgccaacgc cgtgctgcca gcctccggca 1976461 agctcgacag catcaccgcg gagccggttg gccgcgcgct gcggggacgg cgcgcttgac 1976521 ggcgaacgcg cccgagatcg ccctcctcgg cgtcgccgac caggtcgcgg ccggtcagat 1976581 tgacaagcgg tgaagccggt tgccgggtgg tgtctgctcc ggccgaccct ggggccgtcc 1976641 atggtggcat cctggcctgg tggggctact gattcggcta gccgagttgc tcgttgtgat 1976701 gctgccgctc atcggagtgc tatatgtcgg catcaaagcg ctgtcgtcct tcacgcggcg 1976761 gctaggggag gcgtctggcg atcttgcgtc ggatagcccc gcgatgccac gcccaaccac 1976821 tgtcgaaaac gacgcagcgc ggtggcgggc gatcactcgc gcggtcgagg cgcacgagcg 1976881 aacggatgca cgctggttgg aatacgagct cgacgccgcc aagctgctcg acttcccggt 1976941 catgaccgac atgcgggacc cgctcacgac ggcatttcac aaggccaagc tacaagccga 1977001 ctttcacaag ccgttgcggg cggaagatct tctcgacgac ccggacgccg cgggccacta 1977061 tctcgatgcg gttcgggact atgtgaccgc gttcgacacc gcggaggccg aggcgatgcg 1977121 cagacgcaga accggctttt cccgcgagga acagcagcgg ctggcaagag cgcaaagcct 1977181 gctgcgggtg gcatccgacg ccggcgcgac ggcccaggaa cgcgagcgcg catatcgttt 1977241 ggcgcgcacc gaactcgacg gactcatcgt gttgccggac cgtacgcggg ccggcatcga 1977301 gcgggggatc gccggcgagc tcgatgacta aggctgacct ttcggcaccg cgtcgccgtt 1977361 gctgtgccac gaccacgcat agagcgccca catgacgatg ggtagcagga tgtcggtcca 1977421 cagcgggacg ccgatgttgt atgggttggt gttgttctcc accacccagt agtagatgtg 1977481 gccggccgcg tctccgacgt actggatggt gagcaccacg attgtcgcca gccagaagtg 1977541 cccgcggaag cggtacgcca tcaggccgac caccccgatt gccaggtcgc ccattgcgtt 1977601 ctcccattgg aacccgccgt cgccgcgcgt atagccgatc aactcggcgg tccgctcgcc 1977661 gtcgaagacg tggtatcccg cgccgatgat cgataccacg cccacgatca gcaccatcca 1977721 ccacagcata tggatgtccg cggctgggcg gtgccggtga cgccggctct gcacgaacgc 1977781 accgattagc gcgacgatta ccccgacaat ggtgaacatt ccaacaccct tccctagctt 1977841 tagggtcccg tcatgctgtc gaatctcatt gaccgcacgc aacactagcg gacgggctgg 1977901 cgctcaccgc tgttgcgggc gtcccgagaa cgccggccga gtaatggggg agcggacctt 1977961 tccgtacttc atatcgcttt tgccggtccg gacgcgtggt ggtaagcgct gcctcgtggt 1978021 tcgcgcaccc acagggtgtc cgctttgccg accgcggttc cctcgtcgat caactggcgc 1978081 ttgagcacct tgtgtgtggc ggtgctggga aggtcggccg cgatgcggat gtatcgtggc 1978141 cgggctttag tggataggtc aggctgggcg tccagaaatg cttcgaacgc gtcagggtcg 1978201 aaggtgtcac ctgctcgcaa gaccaacgcc gccatcacct gatcgccgac gtattcgtcc 1978261 gggacggcat acaccgcgac acggttaata gccttgtatc gtaatagaat tcgctcgatt 1978321 ggtgccgctg tcaggttctc gccgtctacc cgcatccagt cggcggtgcg gccagcaagg 1978381 tagatccagc cttcagagtc ccggtatgcg aggtctccag accagtacat gccgtggcgc 1978441 atgcgctcgg cgttggcttc ggggtcattg tagtagccgg tgaagaagcc cgaccccgtc 1978501 gtgttgacca actcacctat ggcttcatcg gcgttggtga gtgctccgtg agcgtcgaac 1978561 cgcgcgacgg cgcactcggt gacggtttcg ccgttgtaca ccgcgacccc gtgggctccc 1978621 cggccgatcg agcccggtgg cgtgccgggt tcgcggatca cgatgaccgc gttctcggtc 1978681 gagccaaagc cgtcctcgac ctggactccg aagcggcgtg agaattcctc gatgtctttg 1978741 tcattggcct cgttgccgaa agccacccgc agcggattgt cggcatcgtc gtcgcgttcg 1978801 ggggtggcaa ggatataggc gagcggcttg ccgacgtagt tcatataagt ggcgtggtat 1978861 cggcggacgt cgtcgaggaa gccggtcgcc gaaaacgtcg ccggcgcgat cgcggcaccg 1978921 gagaccaccg ctggcgccca tcccgcgacc accgcgttgg agtgaaacag cggcatggat 1978981 acatagcagg tgtcctgttc ggtgagcccg aagcgctcgg tgaggctacg cccggcgaac 1979041 gtggccatta ggtgtgacac cggtaccgct ttgggatttc cgctggtgcc ggacgtgaag 1979101 atcatcatga acggatccat cgtgtcgact tctcgatagg ggacaaaggc gccgtcacca 1979161 gccaccaatt cagcccaccg cggtgtcgag gtatcaagga tccgcgcgcc cgcgaggtct 1979221 aaaccgtcca acagcgctcg gtggtcggca tcggtcacca cgatctggca atcggctcgc 1979281 ctgacgtcag cggccagtgc atcgccacgt cgcgttgtgt tcaggccaca cagcacatag 1979341 ccgcccaacc cggccgcagc cagctgggcc agcatctcgg gcgtattccc cagcagagag 1979401 ccgatatgcg tcggacgttg cggatcggcg attgtgatga gggccgccgc gcgggccgcc 1979461 gactccgcca ggtactgact ccaagtccat tgcagaccac cgtatttcac ggcaatcgtt 1979521 ggatcggata cgtgctggcg caagagcgat tgaatcgtgt cggtcatgaa ttcgctccca 1979581 tgtcgagtcg cgggctttgg ccgcgacgct gtcatccagc atgatcgcca cgatgccatc 1979641 aatggccagg aggtcgcgac atgacaacaa gatcaccacg ccggcagtgg attgcctcac 1979701 gatcgaacgt ctagattctc ccgcgtccgg cgcccctcag gtcacccctt atgctagggc 1979761 gctaatgggc gagacaacca cgtgcgcgat catcggcggc ggcccggccg ggatggttct 1979821 gggcctgctg ttggcgcggg caggtgtgca ggtcaccctg ttggagaagc acggagactt 1979881 cctgcgcgac tttcgtggcg acacggtgca tccgacgacg atgcggctac tcgacgagct 1979941 tgggctgtgg gaacgctttg cggctttgcc ctacagcgag gtccgcacgg ccacattgca 1980001 ttcgaatggt cgcgcggtga cctacatcga cttcgagcga ctgcatcagc cctaccccta 1980061 tgtcgcaatg gtgccgcaat gggacctgct gaacctgctg gcggaggccg cccaagcgga 1980121 accgagcttt acgctgcgga tgaaaaccga ggtgaccggg ttgctgcggg agggcggcaa 1980181 agttacgggg gtgcgctatc aaggagccga gggcccgggt gaattgcggg cggaattgac 1980241 cgtggcgtgc gacggccgat ggtcgatcgc ccggcacgag gctggactga aggcgcgtga 1980301 attcccggtg aactttgacg tgtggtggtt caagctgcca cgtgaaggtg acgccgagtt 1980361 ctcgttcctg ccgcgattct ccccgggcaa ggggctcggc gtgatcccac gcgaaggtta 1980421 tttccagatc gcctacctcg ggcccaaggg aaccgacgct cagttgcgcg agcgaggtat 1980481 cgaggaattc cgtcgggacg tcagcgaact gctgcccgaa gcgacggcat cggtggcggc 1980541 gctagcgtcc atggacgagg tcaagcacct caacgtcaag gtgaatcggt tgcgtcgttg 1980601 gcacattgat gggctgctgt gcatcggcga cgcggcgcac gcgatgtcac cggtggcggg 1980661 agtcggcatc aacctagcgg tccaagatgc ggtcgcggca gcgaccatct tggccgaacc 1980721 gctgcgtgag catcgagtca gcagccgcca cctggcagcg gtacggcgtc gtcgcgcatt 1980781 tcccaccgcg gtgacccaag cggtgcagcg ggtgttgcac cgaaggctgc tcggcccgct 1980841 gctgcagggc cgggacccca cgccgccggc ggccctgctt ggcctggtcg aacggctgcc 1980901 atggctctcg gcggtgcccg cctactttgt gggagttgga gtccggcctg agcatgctcc 1980961 ggccttcgca cgtcgcgggc ccggcaaccg caaaggccct tgagccgaca tgcgcgccgc 1981021 cgcgaatcgg cgtcttgggt atagcccgga tagcgccgtt ggcgctcatc aagccggtca 1981081 gcgggagcgt cgtggtggca gcacgtgatg tgtcgcgggt ggcgcgacca tggacgctgg 1981141 ctgctatgcc gtccacatgg cccacacgtt cggtggggcc acgccggaag tggtttcggc 1981201 gcaagccaaa ttacgcgatc cagcggtcga tcgggccatg acggccgaac tgaaatttcc 1981261 aggcgggcac accggcggga tccgctgttc aatgcggtcg tcggatctgt tgaatgtgag 1981321 cgctcgagtg gtcggcgacc gtggcgagtt gcgcgtgctc aatccggttg tgccccaact 1981381 cttccaccga ttgccgcccc tcgcatgcgt atcagctcga cgctttcgct gccgcagtgc 1981441 tgcgcgggca agcggtcaag acgacgccca aggacgcggt cgagaacatg agcgcgatcc 1981501 acgcgatcta tcgggccgcc gggctcccat cgcgcaaccc gagctgaata tggtcgccgc 1981561 gagcgggtcc gccgcctgac aggccaatgg cgtcggtcgc ttacccgcca gggttaggac 1981621 gtggtgcctt ggaagaaacc cgccaggttg gtgccgatat tggcaaagcc ggaaacgacg 1981681 ctggctaccg agaacggcag gatgcccctg ttggcgaagc ctgagacgcc gctgccaagg 1981741 ttggaaaagc ccgaggatag cccgccgaag ttctgatagc ccgagccgcc caacagcccg 1981801 gccgggttgg tgttgaacca acccgagagg cccgagccgt tgttgccgaa gcccgagttg 1981861 ccgcccgcac cggaattgaa gaagcccgac gaaggcgcgg tgctcgagtt gaagtagccc 1981921 ggccccccgg ggatcgcgaa ggccccgatc gtggtgctgg gcaggtggat gccgggaacg 1981981 gtgagcgggg gcgtggtgaa gccccccacg ccgatcggct cgatggtgag cggtggggtg 1982041 gtgatgggtg gggtggtgat ttgggggagg gtgaagccgg tgaggttgat ggggtcgatg 1982101 gtcagcggtg gggtggtgat gggtggggtg gtgatttggg ggagggtgaa gccggtgagg 1982161 ttgatggggt cgatggtcag cggtggggtg gtgatgggtg gggtggtgat ttgcggcagg 1982221 gtgaacccgc cgacgccgat cgagttgatg gttagctccg gggtgatgat ttcctgggtg 1982281 gtgatctgcg gcagagtgaa gccgcccacg ccgatcggag ggatcgcgaa ctccggggtg 1982341 gtgatagctg gggtggtgat ctgcggcagg gtgaagccat cgacgttgat agcggggaca 1982401 tcgatcccgg gtatgttgaa ggcgggcaga aagaatgaac cgatgacaat agggccggtc 1982461 aatgtgtatg ggtgaaccac caattgtggt aagtcaaact caccgaagat gagggcgcca 1982521 ttggtgaaag tactaagccc gccgccgggc ggctgaagcg caggcacatt ggtctggaat 1982581 tgtagggtaa agggtattcc aaaagccggt actgttatcc taggtgtgct taggaaaaca 1982641 tcccagccta tggagggcag gccaaattgg cccacgccaa tctggccgac cgttatcggt 1982701 tgagtatgta tcgcaggtag actaaagcca ccgattgtga tacccgcggg tatcgtcagc 1982761 tgcggaatag ttacttccgg aatctgcaat ggcggcaaat taaaagcacc caccgtaatg 1982821 ggcgggaccg tcaccggcgg aatggctacg gaaggaatac tcagcggagg caactgaaag 1982881 ccgcttacgg tgatgttggc gggtgtggtg gcggccggga tgttcaacga cggcaacgtc 1982941 aacccgggca ggctgaaggc gccgacggtg atgttggctg gtgtggtggc ggccgggatg 1983001 ttcaacgacg gcaacgtcaa cccgggcagg ctgaaggcgc cgacggtgat gttggctggt 1983061 gtggtggcgg ccgggatgtt caacgacggc aacgtcaacc cgggcaggct gaaggcgccg 1983121 acggtgatgt tggctggtgt ggtggcggcc gggatgttca acgacggcaa cgtcaacccg 1983181 ggcaggctga aggcacccac ggtgatgttg gctggtgtgg tggcggccgg gatgttcaac 1983241 gacggcaacg tcaacccggg caggctgaag gcgccgacgg tgatgttggc cggtgtggtg 1983301 gcggccggga tgttcagcga cggcagcgtt attgccggca gactgaaggc gggaaccgat 1983361 atccccggta tttgcagcgg cggcagagtc agatcaggtg tcgtaatact gaactgcagg 1983421 ctgccctgcc ccacgccccg gtagaagacg ccattgttca tgtcacccgt gttgaacagc 1983481 ccattattca tgtggccaat attgaagaca ccagtgttga tatttccggc gttgaggaaa 1983541 cccgtgttag catttcccgt gttgaacgtg ccggtgttgg acgaccccgg attgaagtcg 1983601 cccatgttat aactgccggt gttcaggctg cctgtgttcg cgttgccgac gtccaacata 1983661 ccggtgttaa acgagcccgc attgaagaag cccgtgttcc cgtgtccaga attccagcca 1983721 ccggtgttga aattacccga gtttccgatg ccaaagtttc cattgccgga gttgaagaag 1983781 ccgacgtttc cgctgcccga gttgaacaat ccgaaattcc cggtgcccga gttgagcccg 1983841 ccgattccga tctggttgtt gccggtaaga ccgatgccga tgttgttgtt gccagtgttg 1983901 ccaaagccga agttgcccaa gccggtgttg gcaaacccgg tgttgagatt gccaaggttt 1983961 cccacgccga cattgttgct gccgaggttc ccgaagccga tgttgttatt acccaggctt 1984021 gctgagccga tattggagtt accgaaattt ccggacccga aattgtagtt gccaaggttg 1984081 gcgttgccga tgttggcaag gccgttgttg gcgttgccga cgttgccacc gccgacgttg 1984141 gctatgccca ggttgatggc ggtgggtccg cccgcaagcg ccggtatgcc tgcggctgcg 1984201 gtcatggcgg ccgcaggcgc gccgctggcc aaccaagccg gcaagccagc caggttctgc 1984261 agcggtttac tgaacgggga cagcgccgag gcgatcgccg atgccccggc atggtaggca 1984321 gacatcgccg acacatcggc agcccacatt tgctcgtacg tggcttcaat ggcagcgatc 1984381 gccggagcgt tctgtccaaa caggttcgac atcaccagcg acaccaggtc ggcacggttg 1984441 gccgccacca gcatcggctg caccaccgcc gtcttgaccg cttcaaactc ggctatcatc 1984501 gccgcagcct gagcggccgt ctgctcggcc tggaccgccg ccgcggcaag ccacgccgca 1984561 tagggggctg ccgctgccgc catcgccgac gacgacgcgc cctgccacgc cccgcccacg 1984621 agtccggatg tcactgagcc gaaagaggct gcggccgagg ccaattccat ggccaacccg 1984681 tcccaggccg tcgcggccgc cgccatcggt tccggccctg ccccggcgaa tatcagcgct 1984741 gaattgatct ccggcggcag tacagaaaaa ttcatcgtcc agccttccct gcgtgccccg 1984801 cgtgatcagc ggtaaaccgt ggccggtgag tggctcttgg cccacaagct agacgctgaa 1984861 ccgtcgtggc cacataaata tcgcgcacaa atggccacga ctcataggtt tcgtaaattt 1984921 gatttacaaa aggcgctctc gggtcatgcg gaccgcaagc ggcgtccgaa cgcaggggct 1984981 atggcagcac ggtgtgcatc aacatcacgt tgtatgccga ccacaaagac aggttaaagt 1985041 agacgtcttt gcccgtcgac cagggatgca tcatcggcgc gtagatgccg ccgggcatct 1985101 gccatgacga caccagcatt tgctctgcgc tccacggtcc ttgcggagcc ggcgcggtcc 1985161 ttgccaccac gtcgttcata ccgttggtgt agagcgccag gtattgcttg aggtaggtgt 1985221 tgtattggac ggacatttcg cccaccgggc ccggaataac gggtgttgcc gcgtccggct 1985281 tgtttggaac ccaggagttc gagtcgccgt tccagtactg gtacttggtg aggtcgggca 1985341 caaagcgctg cggaactcgt gccagatatg ccgaaccgcc tcgcccgggc ggggtcccga 1985401 acgagtagag gtaaccgtcg ttggacttga ggtacgcccc catctggaag ttctcatttc 1985461 ccggaacgaa cctggctttt ccgccgctgt ccggtccgga cgcgcggatg gtgcccggga 1985521 agacccccca ggtctgacca ttgtccttgg acaccgcgat gcccgagtag ttcgtcgtcc 1985581 attccccatc acggccccaa ttcctgatgg acatgaagtt gacgtattgg gttttgccga 1985641 cggcgatgcc cgcggtcgga atgatccccg tctcgtcgcg cgcccatttg atgctgttga 1985701 tgagctgttt ggagaagccc ggttggcgta ccggtgagcc ggaatatctg ttggaagcgt 1985761 caccggatgt cacatgaact ccgttgccca ggtcgcggtc ttggctgcgg aacagcgtgt 1985821 tgtatcgcca ttgatggcca tcgacagcgc agtagccgaa tgtgtcgccg aagatcatga 1985881 gcacctgacg gttggcggga tcgccgttat cccaaggaat tccgaggtcg gtcccggaga 1985941 tgccgaagcg ttccagggtc ttgttggggc tgtccggtcc ggtcacccac tcggcgaggg 1986001 atgtggtagc cccggcgagc gtggcaccag gatccggcgc cgccgccgga gcagggtcgg 1986061 gtgctggggc tgggttcgga gttagctgag tggcattcgg gggttgtggg cccgtggctg 1986121 gcggattggg tgccggattg ggcccaggat tggccctggg gactagcgct tgctgttgta 1986181 gcggcgcggc atttctagca cccgggttga gcaatgcgga tatcagtgga cccagcttgg 1986241 gtagcggtgc acggtcgttg gcaccgcgag gcttgcgtcc ggtcggtatc ggaccgtgtc 1986301 caggtcgtac cgggccgagc gccgtggccc cggggtcggt cacaatggcg ctcggtggcg 1986361 gcggagcgtt cgcggcgtct ccgctgcacg gcgccgccat cgctggtggc gccaggccta 1986421 ttggaaccat gagtccaata gcggccgccc acgccagcga taccgacacg attcgaggaa 1986481 tcggcgacat gtcacacctt cccgggctgg acgttgcaat tgacgtccgc agttcgctga 1986541 tgtgacgata gtgatctctg ggactcttgt gatcagtgat ccactgatag gtatgcctcc 1986601 gtgaccgtgt cgcaacccat ctgttcatct ccgacctgcg ctgctgcact cggacttggt 1986661 accggtacat tcaaggccca tcggggccgc ggataccacg accaccggtg ccgaacatcg 1986721 acgatccgat caatttgcgt ccactgtcgc ccggacaggt caacaaggtg tggctctggc 1986781 aatcgctacc cggtccctgg atcgggtccg cacggaatac cgtgtacctg accggatttg 1986841 agttcctcga gccttagcac ggaccgctcg gaataccacg ggtaggcgtg gtttcctgcg 1986901 tgggcatgat ctgtggatca ggaacccgat acgggattcc acggtttatc gtgcccagcg 1986961 ccgcgttggg cacgcactgc ggcaccgttg atagcgcgtg cagcccggga taatccaggt 1987021 tgggccatga tgagttgggc gggacagcga agttgaacgt tgacgtcatg tcgccggtca 1987081 cactgcgccg ccaagccgtg aggttgggaa ctggcacccc gaaccgagtt tcgagcaatc 1987141 tcagctgtga ggtgtggtca aacgtgtcgt gaaccatctg cgggccacgg ctgtacggcg 1987201 aaatgacgaa gcagggaacg cgaaagccca aaccgatcgg cccgcgtatt ccgccggagc 1987261 ccggcacctg atcgatgtca ggcaccgtga catattcgcc gggagtcccg gccggcgcgg 1987321 tagcaggaac aacgtggtcg aaaaagccgc cgttttcgtc gtagctgacg atcagcgccg 1987381 tcttttccca caccgcagga ttggcaagca atattcttaa gatgttgacg attgcgaaag 1987441 ccccggccgc ggctggaacc gcaggatgtt cggattcgag aacattggga atcacccagg 1987501 agacccgcgg cagtctattg gctaagacgt cggccgcgaa gctcgcggga tagcttggtg 1987561 ccacgccaaa gcggacaaga tctgacctgg gatcggctga ctgtttgaaa gacgtcacaa 1987621 gcgagccgta agtaagaacc gaggagatgg gcccgagtgt cttgttgcga tacaccttcc 1987681 agctgacgcc ggcatcgcta agtgaaccgc cccggtgagt ccggagactc tctgatctga 1987741 gacctcagcc ggcggctggt ctctggcgtt gagcgtagta ggcagcctcg agttcgaccg 1987801 gcgggacgtc gccgcagtac tggtagaggc ggcgatggtt gaaccagtcg acccagcgcg 1987861 cggtggccaa ctcgacatcc tcgatggacc gccagggctt gccgggtttg atcagctcgg 1987921 tcttgtatag gccgttgatc gtctcggcta gtgcattgtc ataggagctt ccgaccgctc 1987981 cgaccgacgg ttggatgcct gcctcggcga gccgctcgct gaaccggatc gatgtgtact 1988041 gagatcccct atccgtatgg tggataacgt ctttcaggtc gagtacgcct tcttgttggc 1988101 gggtccagat ggcttgctcg atcgcgtcga ggaccatgga ggtggccatc gtggaagcga 1988161 cccgccagcc caggatcctg cgagcgtagg cgtcggtgac aaaggccacg taggcgaacc 1988221 ctgcccaggt cgacacatag gtgaggtctg ctacccacag ccggttaggt gctggtggtc 1988281 cgaagcggcg ctggacgaga tcggcgggac gggctgtggc cggatcagcg atcgtggtcc 1988341 tgcgggcttt gccgcgggtg gtcccggaca ggccgagttt ggtcatcagc cgttcgacgg 1988401 tgcatctggc cacctcgatg ccctcacggt tcagggttag ccacactttg cgggcaccgt 1988461 aaacaccgta gttggcggcg tggacgcggc tgatgtgctc cttgagttcg ccatcgcgca 1988521 gctcgcggcg gctgggctcc cggttgatgt ggtcgtagta ggtcgatggg gcgatcggca 1988581 cacccagctc ggtcagctgt gtgcagatcg actcgacacc ccaccgcaaa ccatcggggc 1988641 cctcgcggtg gccctgatga tcggcgatga accgggtaat tagcgtgctg gccggtcgag 1988701 ctcggccgcg aagaaagccg acgcggtctt taaaatcgcg ttcgcccttc gcaattcggc 1988761 gttgtcccgc cgcaagcgct tcagctcagc ggattcttcg gtcgtggtcc cgggccgtgc 1988821 gccggcatcg acctgcgcct ggcgcaccca cttacgcacc gtctccgcgc agccaacacc 1988881 aagtagacgg gcgacctcac tgatcgctgc ccactccgaa tcgtgctgac cgcggatctc 1988941 tgcgaccatc cgcaccgccc gctcacgcag ctccggcggg tacctcctcg atgaaccacc 1989001 tgacatgacc ccatcctttc caagaactgg agtctccgga catgccgggg cggttcagag 1989061 aggacttcat cgatgcgctg cgttccaaga ttggcgagaa gtctatgggc gtttatgggg 1989121 tcgactaccc ggcgaccacg gatttcccga cagcgatggc cggtatttac gacgcgggca 1989181 cccatgtcga acagacggcg gcgaactgtc cccaaagcaa gctggtgctc ggcggatttt 1989241 cccaaggtgc ggccgtgatg ggctttgtta ccgcggcggc gattccggat ggggcgccgt 1989301 tggacgcgcc caggccgatg ccgcccgaag tcgccgacca cgtggccgcc gtcacactct 1989361 tcggaatgcc ctcggttgcg ttcatgcact cgatcggcgc gccgccgatc gtcatcggtc 1989421 cgctatatgc agaaaagacc atccagctgt gcgccccggg cgaccccgtc tgttctagcg 1989481 gaggcaattg ggcggcgcat aacgggtacg ccgacgacgg catggtcgag caggccgcag 1989541 tgtttgccgc cggtcggctc ggttaaggca gtgtcagcca ctcgccactc agcccgacac 1989601 cgatcggacg tcgtgaccgg cgggaccgag aactgctcga tccgcaacaa cgccgcgacg 1989661 tggattgtgt cccatggtga gctgtgactt ggagtgcggg tggtgagctg aaggcccgtt 1989721 gtcgaccgaa acggggcgac gtccgcgact tcctgtacaa cctgatgctc tgggatttgg 1989781 gctgcggatg cgcggcgggg gttcgctctg gtgtcgtcgg tgttccgccg cgctacgtca 1989841 agccgtgctg cccatcccgg ccgagtacca gcccaccggc gccgccggca cccgccgtgc 1989901 ccccggcttt tccggcattg ccgccgttgc cgccgttgcc gatcaccacg gcgttgccgc 1989961 cggctccacc cttgccgctg gtggcgccat ccccgccggc gccaccgtca ccgccgttgc 1990021 cgtacagccc ggccttgccg ccggcgccgc cgttcccgcc ggcgccggta tcgctggcgc 1990081 cgccggcgcc gccggcgccg ccgaagccgc tgcgaaggcc ggtgatttgg ccggccccac 1990141 cggtgccgcc atcaccgcca gtgccattga ggctgtagcc cccgttgccg ccggccccgc 1990201 cggagccgta gaacaatccc gcgctgccgc cggcgccgcc agcaccggcc ttgcctgaca 1990261 ggctggagcc gccgctgccg ccggcaccgc ccgacgcatt gagggtgagc gagccagcat 1990321 tgccgcctac accgccaccc ccgccggcca tgcccccatg accgccggcc ccgccagagc 1990381 cgccggcacc gtacagccca ccgggcccgc cggcaccgcc tgtccctccg gcccccgtgg 1990441 agccgccgtt cccacctggt ccgccggttc ccccgtgggc gtacagcccg ccggccccgc 1990501 cggccccgcc ggcgcccccg gcggtgctgc cggtcccgcc ggcgccgccg gcccccccgt 1990561 tggcgaacaa cccggcagcg ccgccagtcc cgccggcgcc gccagtggta acgcctgcgg 1990621 tgaaagcgcc gccgccacac ccgccgagcc cagccgcgcc gatgagcaag ccggcgttcc 1990681 cgccggcccc gccgacgccg ccggtggtgg tggcggcccc gccgacacca ccggtaccgc 1990741 cggaaccgat caagaaggcg gatccgccgg cgccaccggc cccgccggca cccgccgttc 1990801 cgacgccgcc ggccccgccg gcgccaccgg tgccaaacag gatcccgcct gccccaccgg 1990861 cgccgcccgc gctgccgttg gtgccggccg caccggcccc gccgttgccg ccgttgccga 1990921 acaaccagcc gccggcaccg ccatcgtccc cggttcccgg cgtcccactg tcgccgttac 1990981 cgatcagcgg gcgtccggtc aatgcctcgg tgggttcgtt gatgaaactg agaatgtcct 1991041 gctgcaggtt gtgccatggc gaggtgctct cgggagcgtt atatccgtcg gcgcccagca 1991101 gcaacccgcc gaagccgccg aagccggact tgccggcgag cgcgccgatg ccgccctcgc 1991161 cgccgttgcc gatcagcacg gcatttccac cggccccgcc gacaccaccg gtgccgccac 1991221 tctcgccgcc gttgccgccg ttgccgccgt tgccgatcaa cccgggcgcc ccacccgccc 1991281 cacccgcccc acctgcggcg gtgcccgcgg ggcccccaga gccgccagca ccgccggagc 1991341 cgccggagcc gctgagcatg ccggcgctgc cgccgacccc accctgcccg ccggcggcga 1991401 agccgaaccc gccggtgccg ccacccccgc cggagccgaa gagcatgccg gcgttaccgc 1991461 cggctccgcc ggcgccgccc ttaccaccac cgaagacagt gccgccagcc ccaccggtgc 1991521 cgccggcgcc accggcggca cccagggaaa gcgtcccggc gttaccaccg ttaccggcag 1991581 cgccgccggt ggtcagtcct gacccgcctg ccccgccgtc cccgccggcg ccgaacaaac 1991641 cgccgccccc accgtccccg ccggccccgc cggtgccgag cgttccgtga tccccgaatc 1991701 cgcccgcccc gcccatgccg ccggcaccaa acaacccgcc ggccccgccg gcgccgcccg 1991761 ccccgcccgt gtgaccctgc ccaccggcgc cgccgacacc gccggtggtg aacagcccac 1991821 cggccccgcc ggcgccgcca gccccaccgg cagtgctgaa gctgaacccg ccggcaccgc 1991881 cggccccggc ggcgccggcg agcataccgg cgttgccgcc ggttccgccg gtaccgccga 1991941 tgccaccgac aagagacgtc gcagccccgc cggcgccgcc ggcgccgccg gccccaaaca 1992001 gcatggcgga cccgccagcg ccaccggccc cgccgatccc gttgttggcg gtggcggttc 1992061 cgccggcacc gccggccccg ccgttgccga acagcccggc ggccccacca gggccaccag 1992121 ccccgccgtt ggcgcccttt gcaccggatc cgccggcgcc accgttgccg atcaaccagc 1992181 cggcatcccc tccgttggcc ccggtgccgg gagcaccgtt agccccgtta ccgatcagcg 1992241 gacggccggt agcggccagg acgggcgcgt tgatcgagtt gagcagcggc gtcacggcgg 1992301 cggcctcggc ggccgcatag gcgccccccc cggtggtcag cgcctgcacg aaccgaccat 1992361 gaaacgccgc cgcctcggcg ctcgccgcct gataggcccg gccgtgcgcg ccgaacaacg 1992421 cagcgattgc cgccgagatc tcatcggcac cggcggccag caggctcgtc gtgttggccg 1992481 ccgcagccgc gttggcccca gcgatcgtcg agccgagatc ggctagatcc gtcgccgccg 1992541 ccgcgatagt ctccggcacc gcgatcacaa acgacatctg aaaacctccc acgaccgctg 1992601 accaccaggt aatgccgacg acccaggaag cctcggcgcc gggtgaatcg gtgccaatca 1992661 gcgtatgggc gggcaggcga cccaaccggt gttccagccc gactcatacc cgctgtcaaa 1992721 tgacctgaca atcactcggt ggtcacacgc tgcgtgcttc acattggtag cttgggcacg 1992781 tcggcaaccg tcacagctgt cacacgggtc cctgtggggt tggtcggcca ccggcgacaa 1992841 cgtttcctgc gcgccttgat ctgtcgccgc tgggcaggca tcgccgcgac ggccgtatca 1992901 ggcttggtcg gtgtgagccg ccaaatcggt attgacgaat tcgtcatcga actcccggcc 1992961 aagaccactt aggtctgatg gcctggttct cgtcctcaag ccgcgttagc accacttcgg 1993021 gacgccacgc ggttcagccc gttctcctcg aatagcagcc tgccggtgcc accggcgtct 1993081 gggcacccca gactttcgcg ccgctgtcac ccgttgcgaa ggcccccgca atggcacggt 1993141 caccgacatg tgatgccgag gggctgcgcc ggggctagat tcgcgtgcaa tgcgtgccta 1993201 aactttttgg cggggttggg gatttctgaa ccgatcagtc ccgggtgggc ggctatggag 1993261 cgactaagcg gactcgatgc tttcttcctc tatatggaga caccgtcgca gccgctgaac 1993321 gtgtgctgcg tcttggagtt ggacacctcg acgatgccgg gcggctacac gtacggccgg 1993381 tttcatgccg cgttggagaa gtatgtcaag gcggcgcccg aatttcggat gaagctcgcc 1993441 gataccgagc ttaacctgga tcaccccgtg tgggtggacg acgacaattt tcagatccgg 1993501 caccacctgc gccgggtcgc tatgcccgcg cccggagggc gtcgcgagct ggccgagatc 1993561 tgtgggtaca tcgccgggtt gccgctggac cgtgaccgcc cgctgtggga gatgtgggtc 1993621 atcgaaggcg gtgcccgtag cgacaccgtg gcggtgatgc tcaaggtcca ccacgccgtg 1993681 gtcgacggtg tcgccggtgc gaacctgctg tcccacctgt gcagcctgca gcccgatgcg 1993741 ccggcaccgc aacctgtccg gggcaccggt ggcggcaatg tgctgcagat agctgcgagt 1993801 gggctggagg ggttcgcgtc gcggccagtg cggctggcga cggtggtacc ggcgacagtg 1993861 ctcacattgg tgcgcacatt gctgcgtgcc cgtgagggcc gtaccatggc cgccccgttt 1993921 tcggccccac cgactccgtt caacggcccc ctcggtcggc tgcgcaacat cgcgtataca 1993981 cagctcgaca tgcgcgacgt caagcgtgtc aaggaccggt ttggggtgac catcaacgat 1994041 gtggtggtgg cgttgtgtgc cggagcgcta cggcgcttcc tactcgagca cggcgtgctg 1994101 cccgaggccc cgttggtggc caccgtgccg gtttcggtac acgacaagtc ggaccgaccc 1994161 gggcgcaacc aggccacctg gatgttctgt cgggtaccga gccagatcag cgaccccgcc 1994221 cagcgcatcc gcaccatcgc cgccggaaac accgtcgcta aagaccacgc cgcggccatc 1994281 ggccccaccc tgctgcacga ctggattcag ttcggcggct cgacgatgtt cggagcggcc 1994341 atgcggatct tgccgcacat ttcgataacg catagccccg cctacaatct gatcctgtcg 1994401 aatgtgcccg gaccccaggc ccagttgtac tttctgggtt gccgaatgga ctcgatgttt 1994461 cccctcggcc ccctccttgg caacgcgggc ctcaacatca ccgtcatgtc cctcaacggg 1994521 gaactgggtg tcggcattgt ctcctgcccc gacctgctgc cggacctgtg gggcgtggca 1994581 gacgggtttc ccgaggcgct caaagagctg ctggagtgca gtgatgacca gccggaaggc 1994641 agcaaccacc aggactcctg agtcgtacgt tcagaaccgg tagtcggtgc cggtgcccag 1994701 aacttcgatg gctgcgttga tgttcgggat cactgtggcg ccgtatcggc tgacgatctg 1994761 cccaagcgcg cgagcaaggt gcggacccac ggcctcggcg atgagggcgt cctcggcgat 1994821 gacgatgccg ttcaccatgt gggcagcgag cagccggccg tcgtgggcga cctcgtaacg 1994881 ccagtcaccg atctggattc gctcgggatt agaccgaaaa aagccacgtc gtgcgggggt 1994941 atgaatcact cccggaagtc cggcgaacac tttgaccacc aacgcgacac cgccgggacc 1995001 gacgagcgcg gccgcgacgg cgcggctcac tcgctcggta tcgaaatcag acatcagctg 1995061 tccatcggca ggacgaatga cggtgtgatc gtttccccgc tgccggtgcg gcgcacggcg 1995121 gttcccgcgg tgtagaactc caccgtgtgc actccccacg cgtagttcga gatggcgaag 1995181 tgcacgccca ccacgccggt tgcgccgtct cgctcggcct cgctctgcat gcgtgacatt 1995241 gccagctcac gcgcttggta gttgccttgc gtccactgtg gcatctccat gttgcggccg 1995301 atctggcgaa gcgtttgcat gaatccctgc acggcgatgt ggaatacgca attgcccatc 1995361 acgaacgcca ccggcgcaaa cccggatcgc agcagcgtca ccatgtcctg gccggataga 1995421 tgactggaga atgcttggcc gttgggacgc cgaaatgctc cgggcttggc ggtgtatcgc 1995481 actgcggtac cgaccgccat gaactcaagg tgttccccgc cctccccatg gtggcgccag 1995541 ttgagccgga caccgacgat cccgtccgct ttgagggcat cggcttcggc ctgcatgcgc 1995601 gccatcgcat tccagcgcgc ccggtatgtc gcctcggtga ggacacccag ttcctgttgc 1995661 tgcctcatgc cgctgaattg gaagccgacg tgatagaccg agacacccat gaccagctcg 1995721 atgggctcaa acccggcccc atgcagcaat gcgaactcgt tgatcgacaa gtcggacgtg 1995781 aatgacttct cagcgtgcga cagccgttcg ctggctactg gatcgagcga gcttgattgc 1995841 atcgttgtgc gtccttcctg tggtgtgtgt cagcgtacga cgcgcaaacc atgcagcgtc 1995901 tgccatcagc gtccccaggg catcggcggc gtcttggcgc cggcaacgct gttgtctggc 1995961 agtcgcgccg gggagtcgac gctaccggtc ggcaccgcgc cggccgcgca tgagtgaggt 1996021 ggcagcgcgt aacgcgccgc gtagtgcgta gacggcagtc accgccgcca acaggatcaa 1996081 caggacaaag gtcggcaacc tgaaccgccc cggcatgtcc ggagactcca gttcttggaa 1996141 aggatggggt catgtcaggt ggttcatcga ggaggtaccc gccggagctg cgtgagcggg 1996201 cggtgcggat ggtcgcagag atccgcggtc agcacgattc ggagtgggca gcgatcagtg 1996261 aggtcgcccg tctacttggt gttggctgcg cggagacggt gcgtaagtgg gtgcgccagg 1996321 cgcaggtcga tgccggcgca cggcccggga ccacgaccga agaatccgct gagctgaagc 1996381 gcttgcggcg ggacaacgcc gaattgcgaa gggcgaacgc gattttaaag accgcgtcgg 1996441 ctttcttcgc ggccgagctc gaccggccag cacgctaatt acccggttca tcgccgatca 1996501 tcagggccac cgcgagggcc ccgatggttt gcggtggggt gtcgagtcga tctgcacaca 1996561 gctgaccgag ctgggtgtgc cgatcgcccc atcgacctac tacgaccaca tcaaccggga 1996621 gcccagccgc cgcgagctgc gcgatggcga actcaaggag cacatcagcc gcgtccacgc 1996681 cgccaactac ggtgtttacg gtgcccgcaa agtgtggcta accctgaacc gtgagggcat 1996741 cgaggtggcc agatgcaccg tcgaacggct gatgaccaaa ctcggcctgt ccgggaccac 1996801 ccgcggcaaa gcccgcagga ccacgatcgc tgatccggcc acagcccgtc ccgccgatct 1996861 cgtccagcgc cgcttcggac caccagcacc taaccggctg tgggtagcag acctcaccta 1996921 tgtgtcgacc tgggcagggt tcgcctacgt ggcctttgtc accgacgcct acgctcgcag 1996981 gatcctgggc tggcgggtcg cttccacgat ggccacctcc atggtcctcg acgcgatcga 1997041 gcaagccatc tggacccgcc aacaagaagg cgtactcgac ctgaaagacg ttatccacca 1997101 tacggatagg ggatctcagt acacatcgat ccggttcagc gagcggctcg ccgaggcagg 1997161 catccaaccg tcggtcggag cggtcggaag ctcctatgac aatgcactag ccgagacgat 1997221 caacggccta tacaagaccg agctgatcaa acccggcaag ccctggcggt ccatcgagga 1997281 tgtcgagttg gccaccgcgc gctgggtcga ctggttcaac catcgccgcc tctaccagta 1997341 ctgcggcgac gtcccgccgg tcgaactcga ggctgcctac tacgctcaac gccagagacc 1997401 agccgccggc tgaggtctca gatcagagag tctccggact caccggggcg gttcaggccc 1997461 cgatggtgtg cccggtggtg atacgggcac accagcacca ggttggccag ctcggtggcc 1997521 ccaccgtcct gccaatgtcg gatgtggtgg gcgtgcaaac cccgggtggc cccacaaccg 1997581 ggaaccacac acgtgcggtc gcgatgctca agcgcacgac gcaaccgacg attgatctga 1997641 cgagtcgttc gaccgcagcc aatgacctgc ccgtcacgtt caaaccaggc ctcaaaggtg 1997701 gcatcacaga gcagatatcg gcgttcggac tcgctgagca gcggacccag gtgcaggcca 1997761 gcggcacgct cctgcacgtc tagatgcatc accacggtgg tgtgctgccc atgtggccga 1997821 cgagccacct cggcgtccca gccggcctca accagacgca gaaacgcctc aacattgccc 1997881 ggcaacgggg gccgctgatc cgacacaccg tcgctgttgt cgtgatcacg cttgtactcg 1997941 gcgatcaacg catccagatg agactgcaac gccgcatcga acttcgccgc ctccacgtgc 1998001 ggaagcttga ttcgccaaca actgaactgc tcatcggcgc tcctggtgat cgagggccgc 1998061 ggttccggcc gaaaatccgg ttcgggttcg ggtcgcggtt ccaacttgag cgcggtccgc 1998121 agctgattca ccgtggcaac gccggccaac tgcgcataat gcgcatccga accctcaccc 1998181 gcccgccccg cgatcacccc aacctgatcc aacgacaacc gcccctcccg cataccccgg 1998241 gcgcagcgcg gaaactccgg caaccgccgc gccaccgtgg cgatcgtgtg ggcgttgcct 1998301 gacgagcagc ccatcttcca ggccaccaac cccgccaccg accgcgcccc cgtcacaccc 1998361 cacaacccgt cgcgatccag ctcagccacg atctccacaa tgcgcccatc aatcgcattg 1998421 cgctgaccgg ccaactccgc caactcctca aacaacacct ccacacgctc ggcaggactg 1998481 actaccgctg cgccagacgt cgcggtcgag gacatgagtt catcatcgca gcagggtctg 1998541 acaactccgg ccaacccgaa tccacgcccg gggccgtgcc gtcatcaccc cgcaaagaga 1998601 tgctcggctc cgccggtacg ggcaccccac gatccaacac cgcctgctca gccgccgacc 1998661 actcaacaac cacaaccgtc aatgcagtta acccggcccc accacggccc caactacggc 1998721 gctcgatcca gcgcgatcca acaacaccaa aaccacacga tccgcaccgc actcgccccc 1998781 cgaaacggtc ctcacgatgc ccacgatggc cacctgaact atcccaggct ttgttcctag 1998841 tcggtgcgag ggccggggtt ggctggctcg cggggtgtga ggtgccggtg agggcggcct 1998901 cgtactcggc ctggactccg gtagcctagg gcttcgtgca ggcattcctg gttgtaccag 1998961 ccagccattc ggcggttgcc agtttgacgt cgtcgatgca ccgccagggt ctgccgcggt 1999021 tgatcaactc ggacttggag gcgacgttga ccgcgagggc gttgtcataa cagtcgccac 1999081 gagacccgac cgaaggggcg atcccgagct cagccagtcg gtcggtatag gtcagcgata 1999141 gttactgcga tccggggtcg gaatgatgca ccaactcaga aagatctgaa tttgattgcc 1999201 aaacagcatg attgaatact tgtacgggca gatcttcggt gcgcatcgtc gccgagacgg 1999261 cccaaacgac gatctttcgg gtgcacacgt cggtgacgaa cgcggtgtag cagaacccct 1999321 gccaggtccg cacgaacgtg atgtcggcga cccacaaccg gttgggctta ctgccttgaa 1999381 ttgccggttt accagatcag ccggccgtgg tcggctacgt cggtgacggt ggtgaacacg 1999441 gccgttgcac gccgcacagc ccggccttgc gcatcaacgg gcgggtttgt tctctgccga 1999501 ggtgccaacc cttgcgtttc atggcctggt gcatcttgtt aatcccgtag accgagtagt 1999561 tgtcgcggtg cgccgtgcgt aggcgaactt gagactatga ctgtgttttc cggccagtcg 1999621 gatgcgccct ggcatggccg gcggtaggag atccaatcgt gcattgtttt cgtgcagcca 1999681 tccaataccc ccctgggtac tatggcggtg ccacttcaac gagatagagg gtgcatgtga 1999741 ttggtgatca agacagcatc gccgcggttc tcaacaggtt acgccgtgct cagggacagc 1999801 ttgccggggt gatttcgatg atcgagcagg gccgcgactg ccgggacgtg gtcacccagc 1999861 tcgccgcggt atcgcgcgca ctcgaccgcg ccggattcaa gatcgttgcg gcagggttga 1999921 aggaatgcgt gtccggggcc acggccagcg gcgcggcacc gctgagtgca gctgagctag 1999981 aaaagctgtt cctggcgctc gcttgaatgg gcccgaagcc atcaataacc aaggccgccg 2000041 tccgtgtata cccatagggg tatattggac gccatgtcgg accagccacg tcatcaccag 2000101 gtcctcgacg acctgctgcc ccaacaccgc gctctacgtc accagattcc ccaggtgtac 2000161 cagcgatttg tagccctggg cgacgccgcg cttaccgacg gcgctctcag ccgcaaggtc 2000221 aaggagcttg tggcgctggc gatcgcggtt gtgcaggggt gcgatggctg cgtcgcatca 2000281 cacgcccaag ccgcggtacg ggccggcgct acagcgcaag aagccgctga ggccatcggg 2000341 gtcaccatct tgatgcacgg tggaccggcc accatccacg gtgctcgtgc ctacgcggca 2000401 ttttgcgaat tcgctgacac aacgccgtcc tagtcgtcgc ggccaccgag cggaccgcgc 2000461 tgacccgggc tgaaacgttc cgaggcggac tggcgaaacg catggtaggt cacgcggaaa 2000521 tgcggggcgt gttggcgcga tggcgatagc ctttgccgag ggttcaatgg tgaccgggcg 2000581 cccgccgggt ttccatgagg cgggaggtcc ctgatgtcct atctcgtcgt ggtgccggag 2000641 ttggtcgcag cggcggcaac agatttggcg aacatcggtt cgtcgattag tgcagccaac 2000701 gcggccgcgg cggcaccgac cacggcactg gtcgcagccg gcggcgacga ggtatcggcg 2000761 gccatagccg cgttgttcgg agcgcatgct cgggcatatc aagcgttgag tgcccaggcg 2000821 gcgatgtttc atgaacagtt tgtccgggcc ctcgccgccg gcggtaactc ctacgccgtc 2000881 gctgaggcgg caaccgcgca atcggttcag caagatctgc tcaacctgat caatgcgccc 2000941 acccaggcgc tgttggggcg tccgctgatc ggcaacggcg ccaacgggct gccgggtacg 2001001 ggccagaacg gcggcgacgg cgggattctg tacggcaacg gcggcaacgg tgggtccggc 2001061 ggggtcaacc aggccggtgg caatggcggg aatgctgggc tgtggggcaa tggcggatcc 2001121 ggcggagccg gcgggaacgc caccactgcc ggccgcaacg gcttcaacgg gggcgccggg 2001181 ggaagcggcg gtttgctgtg gggcaatggc ggtgccggcg gggccggtgg gaacggcggt 2001241 ccggctccgc tcgtgggcgg ggtgggcacc accggtggcg ccggcgggaa cggcggcggc 2001301 gccgggttgt tctacggttt cggcggcgcc ggtgggaacg gcgggatggg cggggtggca 2001361 ccgagcaccg gcccctcgat gggcatcctc ccggccggcg gtgtcggcgg gcctggtggc 2001421 tccggcgggg cgagcgcgct tgccttcggc tccggcggcg tcggcggtgc cggtggcttg 2001481 ggcgggccga ccgatggcac cgtccagggg gtgggcggct tcggcggtca gggcggcaac 2001541 ggcgggcaga gcggcttgtt gtttggcaac gcgggagccg gcggggcagg cgctgccggc 2001601 ggagccggca ccggcgacac cgagagcttc ggcggccacg gcggggccgg cggtgatggc 2001661 ggcgctgttg gcttgatcgg taacggcggg gccggcggca ccggatctcc cggcgctgtg 2001721 gtgggtggta acggcggcgt cggtggtctg ggtggcgccg gcagtcccgg gggtctgttg 2001781 tacggcaccg ggggggccgg cggcaatggc ggaccgggtg gtgacggtgg tactggcgcg 2001841 acggtgggct ttgccggctc cggcggtttc ggcggtgcgg ggggcatcgc ccagctgttt 2001901 ggcacgggtg gcatgggtgg tagcggcggt ggtataggcg ctggcaccac gaccgtggtg 2001961 ccgcccgacg tcgccccggt gggtggcaca ggcggcaatg gcggtcgcgc cgggctgctg 2002021 ttgggtgtgg gtggcatggg cggtaatggc ggtgccacca gcgtcggcgg gacgctctac 2002081 gccgccggtg gaaacggcgg cgacggcggg ttggtgtggg gcaacggtgg caccggcggg 2002141 agcggtggcg ccggcggggc gggcagcgtc ggcaacggcg gtgcgggtgg caacgcggca 2002201 ctgctgttcg gcaacggcgg ggcgggcggg gccggcggcg ccggcggcat cggtgccggc 2002261 ggagccggcg gcttcggcgc ggttctgttt ggcaacggcg gggctggcgg gagcggtgcc 2002321 cccggtggca tcggcgccgg tggcaatggc ggaaacgcgc tgctggtcgg caacggcggc 2002381 aacggtgggg caggtaccgg tggggctgct ggcggtgccg gtggctcggg cgggttgcta 2002441 ttcggccaaa atgggatgcc cgggccgtga gcgccccaac ccaggccaac cccctatggg 2002501 caatctgcac atcaattggc caggtcgaca gcagaccgca cacatctacg agattggttc 2002561 ccgatccgtg ggtggggccg ggaaaagcgg ctgtaagagt tggctaggtt cagtagggtg 2002621 gcggcgtgca tgaggtggct gctcgtgagc aacgttcgga cgggccgatg aggctggatg 2002681 cgcagggccg actgcagcgt tacgaggagg cgttcgctga ctacgatgca ccgtttgcgt 2002741 tcgtagatct cgacgcgatg tggggcaatg ccgatcaact gcttgcgcgc gccggcgaca 2002801 agccgatccg ggtggcgtcg aagtcgctgc gttgccgacc actgcaacgc gaaatccttg 2002861 atgccagtga gcgattcgac gggctattga cgttcacgct taccgagacg ctgtggcttg 2002921 ccggccaagg tttctcgaac ctgttgttgg cctacccgcc gaccgaccgg gcggcattgc 2002981 gtgcgcttgg cgagctgacg gccaaggacc cggacggggc gccgatcgtg atggtggaca 2003041 gcgtggagca ccttgacctg atcgagcgca cgaccgacaa gccggtacgg ctgtgtctgg 2003101 atttcgatgc cggctattgg cgcgccggcg ggcggataaa aattggttcc aagcgctcgc 2003161 cgctgcacac cccggagcag gctcgcgcac tcgcggtgga gatcgcgcgg cggccggcgc 2003221 taacgttggc ggcgttgatg tgctacgagg cccacattgc gggcctcggt gacaacgtcg 2003281 ccggcaagcg ggtccacaac gcgatcatcc gtcggatgca gcgcatgtcg ttcgaagagc 2003341 tgcgcgagcg tcgtgcccgg gccgtcgagc tggtgcgcga ggtcgccgac atcaagatcg 2003401 tcaacgccgg tggcaccggc gacttgcagc tggttgcgca ggagccgttg attaccgaag 2003461 cgaccgccgg ctcgggtttt tacgcgccga cactgttcga ctcgtattcg acgttcacgc 2003521 tgcagcccgc ggcgatgttc gcgctgccgg tatgccgtcg tcccggtgca aagaccgtga 2003581 ccgcgctcgg gggtggctat ttagccagcg gggtcggggc gaaggaccgc atgccgactc 2003641 cctacctgcc ggtcgggctg aagctcaatg cgctggaggg aacgggcgaa gttcagacac 2003701 cgctatccgg tgatgcagcc cgacggctga agcttggcga caaggtctac ttccgccaca 2003761 ccaaggccgg tgagctgtgt gagcggttcg accatctgca tctggtccgt ggcgctgaag 2003821 tagtcgacac cgtccccacc taccggggtg aagggcgcac cttcctctaa tgctgaaatg 2003881 gacgaggccc acccggctca cccggcagat gcggggcggc ccggtggccc aattcaaggc 2003941 gcgcgaagag gagctgccat gacaccgatc accgccctgc cgaccgagtt ggcggccatg 2004001 cgcgaggtag tcgagacgct cgcacccatt gagcgtgccg cgggcgagcc gggtgagcac 2004061 aaggcggccg agtggatcgt cgagcgcctg cgcacggcgg gcgcgcagga cgcgcgcatc 2004121 gaggaggagc agtacctcga cggctacccg aggctgcacc tcaagctgtc ggtgatcggg 2004181 gtggcggccg gcgtcgcggg cctgctcagc agacgtttgc gcatccccgc cgcgctggcc 2004241 ggggtgggtg cggggctggc aatcgccgac gattgcgcca acgggccgcg cattgtgcgc 2004301 aaacgaacgg agacgccccg gacgacatgg aacgcggtag ccgaggccgg tgatcctgct 2004361 ggtcagctaa cagttgttgt gtgcgctcac cacgacgccg cgcacagcgg caagtttttc 2004421 gaggctcata ttgaggaggt aatggtcgag ctgtttcccg ggattgtgga gcgcatcgac 2004481 acgcagctgc cgaactggtg ggggccgatc ctcgcgcccg cactcgccgg tgtcggcgcc 2004541 ctgcgcggca gccggccgat gatgatcgcc ggaacggtgg gtagcgccct ggccgccgct 2004601 ttgttcgccg acatcgcgcg cagtccggtc gtccccggtg ccaacgacaa tctctccgcg 2004661 gttgcgctgc tggtcgcgct ggccgagcgg ctgcgcgagc ggccggtgaa gggcgtgcga 2004721 gtgttgctcg tgtccctggg ggccgaggaa acgttgcagg gcgggatcta cgggttcctg 2004781 gcgcgacaca aacccgagct ggaccgcgac cgcacatact tcctgaactt cgacaccatc 2004841 ggctcacccg agctcatcat gctcgagggc gagggcccga cggtcatgga ggactacttc 2004901 tatcggccat tccgggatct ggtcatccgg gcggccgagc gcgccgacgc gccgctgcgg 2004961 cgcggcatcc ggtcgcgcaa cagtaccgac gcggtgttga tgagccgcgc cggctacccg 2005021 accgcgtgct ttgtgtcgat caaccggcac aagtcggtgg ccaattacca cctgatgtcc 2005081 gatacacctg agaatctctg ctatgagacg gtgtcccacg ccgtcaccgt cgccgaatcc 2005141 gtgatcaggg agctggcccg atgagcccga tatggagtaa ttggcctggt gagcaagtct 2005201 gcgcgccgtc ggcgatcgta cggccgacct cggaggctga gctggccgac gtgatcgcgc 2005261 aggcggcgaa aagaggcgag cgggtacgcg cggttggcag cgggcattcg tttaccgaca 2005321 tcgcctgcac ggacggggtc atgatcgaca tgaccggcct gcagcgggtc ctcgacgtgg 2005381 accagccgac tggcctggtg acggtcgagg ggggcgcaaa gctacgtgcg ctgggacccc 2005441 aattggcgca acgacggctc ggcctggaga accagggtga cgtggatccc caatccatca 2005501 ccggcgcgac cgcgaccgcg acgcacggaa ccggggtgcg tttccagaat ctgtcggcgc 2005561 ggatcgtttc gctgcggctg gtcaccgcgg gcggggaagt gctcagtctg tccgaaggtg 2005621 acgattacct ggcggcacgg gtttccctcg gcgcgctagg agtgatctca caggtcaccc 2005681 tgcagacggt tccgctattc acgttgcatc gccatgatca gcgacgctcg ctggcgcaga 2005741 cgctggagcg cctcgacgag ttcgtggacg gtaatgacca tttcgagttt ttcgtattcc 2005801 cttacgcaga taaggcgttg acgcgcacca tgcatcgcag tgacgagcag cccaaaccca 2005861 cgcccgggtg gcagcgcatg gtcggcgaga acttcgagaa cgggggattg agcctgatct 2005921 gccagaccgg ccgtcgtttt cctagtgtgg cgccgcgact gaaccgcctg atgacgaaca 2005981 tgatgtcgtc ctccaccgtg caagaccgcg cctacaaggt ctttgcgacc caacgcaagg 2006041 tcaggttcac cgagatggag tacgcgatcc cgcgtgaaaa cgggcgcgag gcgctccagc 2006101 gtgtcatcga ccttgtgcgc cgtcgcagct tgccgatcat gtttccgatt gaggtgcgat 2006161 tctccgcccc cgacgattcc ttcctgtcga ccgcatatgg gcgcgacact tgctacatcg 2006221 cggttcatca atacgccggt atggagttcg aaagctactt ccgcgccgtc gaggagatca 2006281 tggacgacta cgccggtcgg ccacactggg gtaaacgtca ctatcagacc gccgccacgc 2006341 ttcgtgagcg ctatccgcag tgggatcggt tcgccgcggt tcgcgatcgc ctcgatccgg 2006401 accgggtgtt tctcaacgac tacacccggc gcgttctcgg tccctgacaa cgaatcaacg 2006461 aaccctcgtg gtgttcggcc gatatcgaca cggtcacaac cgcgtaccga tatcagcggt 2006521 ggtatggcgt aacgggcacg atgcacaaat catggcagca tgcgcgttgg gagccaccgt 2006581 cgcgaaccaa gcgtgcgcgt tcacggattc gtccgcctga gttggcggat atcggttggg 2006641 ttcaacagga ggtagccaac ccatgacggc gaatcgaggg cccgctgcaa tctcgagcgg 2006701 ctcgaactct ggccgcgttc tcgacaccgc ccggggtatc ctcatcgctc ttcggcggtg 2006761 ccccgcagag accgcgttcg acgagttgca caacgccgct caacggcaca gattgccggt 2006821 cttcgaaata gcttgggcac tagtgcattt ggcggtcgag ggaagcacgc catgccggag 2006881 cttcgtcgat gcccagtcgg cggctcggcg ggagtggggt cagctttttg cgcatgcggc 2006941 ggcgtaatgc cagcttggcg gtggtgtggg gaagcaccgc cgccagctaa acggatcggc 2007001 ttcgaatcca ggagcccaat cagcgagtcc agtccggcga gtccgcggcg gcgcgcaacg 2007061 cggcgattat gcgctgctct ttttccagaa atcgtgcggt gggcgccggc accgagatcg 2007121 cgatcacgtt gtcgcccagg gcgcgtcgtg cgatcgcagc cgcggatatc cctggggtgt 2007181 gctcgttgcg gtcgaaagcg ataccggtgc gccggatctc gacgatctcg cgccgtagac 2007241 cttcggccac catgggatcc agacggcaga gcgcggcctc ggcgtcggcg tcgtcgagag 2007301 cagccagcgc cgcttttcca ttcgcggttc cgttcaacgg gaagcggagc ccgacggctg 2007361 agaccgcacg cagccggtaa gacgattcga tctggtcgac aaaccacatt cgctggccgc 2007421 gcagtaccga caggtcgacc gtttcgccgt cggtcgcgcg ggcaactcgc tcgacggtcg 2007481 gccggaacgc cgcggctatg tgggctccgg tgacacttcc gaatcccagc aaacgctcgc 2007541 ccagtgcgaa gcggccgtgc gaatcgacac taaccagccc cacctcgacc aggccgacca 2007601 gcaagcgtcg agtcgtcgat ttggccagcc ccagccgctc gcagagatcg actaggcgca 2007661 ggtgtcccgg ttcggcagct atttcgtcca gcgcggcgac ggcgcgacgg agcacctgga 2007721 tgccttcgtc gcgattcgtt gtcgactttc cttccgtagg cggcacaact gcaatatagt 2007781 gaaccgaaat acggatcaca atgattcgaa atacggacca ggagttttgc tatgagggcg 2007841 ctaccggccg ggcggcactt cttccggggc agtgacgggt acgaggcggc tcgccgcggc 2007901 accgtgtggc atcggcgcgt accggatcgc taccccgagg tgatcgttca ggctgtcagt 2007961 gctgacgaca ttgtcagcgc catccgctac gccacggtca atggccataa ggtgagcgtc 2008021 gtgtccggtg ggcacagttt tgccgccagc catctgcgcg atggcgctgt gctgctcgac 2008081 gtgagccgga tagaccacgc ctccatcgac gccgataagg gccgcgcggt cgtcggtcca 2008141 gggaagggcg gcagcgtgct catggccgaa ctggaggcgc agggcctgtt cttcccgggt 2008201 ggccactgca ggggagtctg tctcggaggt tatctgctgc agggcggata cggctggaac 2008261 agccggatct acggcccggc gtgcgagagc gtgattggcc tggacgtcat caccgccgac 2008321 ggcgcgcaga tccattgcga cgcagacaat cacgccgatc tgtactgggc cgcccgcggc 2008381 gccggtccgg gcttttttgg cgtcgtcacc tcgttttacc tgaagctgta tccgaggccg 2008441 gccacctgtg gcaccagcgt ctatgtctac ccattcgacc ttgccgacga ggtctttacc 2008501 tgggcccgcg cggtcagcgc cgaagtcgac cctcgggtcg agctgcaagc ccttgcctcc 2008561 cgcggtgaac cgagcatggg catcgacgtc cccgtcatct cccttgcctc gcccgctttc 2008621 gctgactcgc ccgaagaggc cgaacaggcc ctcgccctgt tcggcacctg cccggttgtc 2008681 gagcaggcac tggtcaaagt cccttatatg ccaaccgatt tgcctgcctg gtatgacgtc 2008741 gcgatgaccc actacctgtc agaccatcac tacgcggtgg acaatatgtg gacgtcggcg 2008801 tccgctgagg acctgctgcc gggtatccgc tcaatcctgg acacgctgcc cccgcatccg 2008861 gcgcacttcc tctggctgaa ctggggtcca tgccctcccc gtcaagacat ggcctatagc 2008921 atcgaagccg acatctactt ggcgctctac ggctcctgga aggatccggc cgacgaggcg 2008981 aagtacgccg actgggcgcg gtcccacatg gccgcgatgt cgcatctggc ggtcggcatc 2009041 cagctcgccg acgagaacct cggtgcgcgt ccggcgcgct tcgccagcga cgcggccatg 2009101 gccaagctcg accgggtgcg cgccgaatac gaccccgacg gtttgttcaa cagttggatg 2009161 ggaagaatct gatggccagc gatctgtacc tgggctaccg caacgacgac gcggacacgc 2009221 cgttcggcaa gttcttcaaa cccgagatgg ccccgctgcc acagcatgtc gtggtggcgt 2009281 tgcagcatgg cccccaggcc gggatggcgt tgctcgcctt cgacgacgcc gcgagcatcg 2009341 ttgatgaggg ctatcagcag accgagaacg gctacgggat tctcggcgac ggcagcatgc 2009401 aggtatccgt gcgcaccgac atgcccgggg tcactcccgc gatgtgggca tggtggttcg 2009461 gctggcacgg cagcgacacc cgccgctaca agctgtggca cccgcgggcc catctatcgg 2009521 cgcggtggaa ggacggcgac caggacagcg gggccggccg tcggggcgcg cagcgttacg 2009581 tcggccgctg gtcgatgatc agcgagtaca tcggctcgac gaaactgggt gccgcaatac 2009641 aattcgtcga gccggcggcc atgggtctgc ccgacgacag cgacgatacg gtgtcgatct 2009701 gtgcgcggtt gggctctgct gacgccccgg tggatgcggg ctggttcgtc catcaggtcc 2009761 gatcgacgcc gggcgggtcc gagatgcggt cacggttttg gatgggcgga ccgcacatcg 2009821 cggtgcgcaa ggcacccgag gtcgcgtcca aggcggtgcg tcccatcgcg tcgaagctaa 2009881 tcggcgtctc ggaatcgacc gcgcgtaatc tgctggtgta ctgcgcgcag gagatgaacc 2009941 acctggcggg gttcttggcg gacctgtggg aaagcttcgg tgacgagtga ggtttcagct 2010001 ttgctcggca aacgctggcg ccacgtattt ttcgaccagc cggcgttcgg cttcgtcgtt 2010061 ctcagctggc caatacatca gtgagagcac cacgcgtacc acccatttcg cgccttgcgg 2010121 atcaccgccg gctatgccgg tgagctcggt agcaaagtcc gcaagcaacg gtgactcggt 2010181 gagccaggcc aattcaccgg cgccaccgtg gatcgagccg aacatgagct tgcccagcgg 2010241 gtcggatcgg attcgctgaa gcgataacag gatcgccgcg acgactcgct cccgcccccg 2010301 cagagtttcg acatccgagc gcacgccgtc ggcgatccgg gccgcggccc gggtcagaac 2010361 gacatcccgg atctgggcct tgccgccggc acggcggtag atggtcgctc gggagcagtg 2010421 gacctcgcgg gctaatttgt cgatgtcgag tgcgttgagc ccgtagcgcg taatgaggtc 2010481 ggttgcggcg gcgtagatcc gttcggcagc gatcgtgcgg cggttgccgc ccacgatcca 2010541 atcgttaccc ggcactggtc aggcgcattt ccatcgagag gcgaagagcg attcttctca 2010601 tagtgagaca caagccttac ttattctcat cgtagttgca ggtccgcctc ccgcggtgag 2010661 acgttcgccg aaaggctccc cgggcgcagt tctcgacttg cagcgacgcg ttgaccaggc 2010721 ggtatccgcc gatcacgctg aactaatgac aattgccaag gatgccaaca cgttctttgg 2010781 tgccgaatcc gtgcaggacc cctacccgct gtatgagcgc atgcgcgccg caggctcggt 2010841 ccaccggatc gctaactcgg acttctatgc cgtgtgcggt tgggacgctg tcaatgaggc 2010901 catcggtcgt ccggaggact tctcctcgaa tttgaccgcc acgatgacct atacggccga 2010961 gggcaccgct aaaccgttcg agatggaccc actcggcgga cccacacacg tgttggccac 2011021 cgccgacgat cctgcccacg ccgtgcaccg caagctcgtg ctgcgtcact tggcggccaa 2011081 gcggatccgc gttatggagc agttcaccgt acaggctgcc gaccggctgt gggtcgacgg 2011141 catgcaggat gggtgcatcg aatggatggg cgccatggcc aatcgcctac cgatgatggt 2011201 cgtagctgag ctcatcggcc tgcccgaccc cgacatcgcc cagctggtga agtggggata 2011261 cgcggccact cagctactcg aagggttggt cgaaaacgat cagctcgtcg ccgcgggtgt 2011321 ggcgttgatg gagctcagcg gttacatctt cgagcagttt gaccgtgccg cggccgatcc 2011381 gcgggacaat ctgctcggtg agcttgccac cgcctgcgca tcgggggagc tggacactct 2011441 caccgcccag gtcatgatgg tcaccttgtt cgccgccggc ggcgagtcca cggcggcgct 2011501 gctgggcagc gcggtatgga tactggcgac acgtcccgat atccagcaac aggtgcgcgc 2011561 gaaccccgag ctgctgggag cgtttatcga agagacgctg cgttacgagc cgccatttcg 2011621 cggccactac cgccacgtgc gaaacgccac caccttggac ggcacggaac tgcccgcgga 2011681 ttcgcacctg ctgctgttgt ggggcgcggc caaccgcgat ccagcccagt tcgaggcacc 2011741 cggcgagttc cgtcttgacc gtgcaggagg caaaggccac atcagtttcg gaaaaggggc 2011801 ccacttctgt gtcggcgctg cactggcacg cttggaggct cgaatcgtct tgcgtctgct 2011861 gctcgatcgc acctcggtaa ttgaggcagc cgatgtcggc gggtggttgc ccagtatcct 2011921 ggtgcgccgc atcgagcggc tagagctagc tgtacaatag gcgctcgacg actcctattg 2011981 cagcacaacg gatatcagca acagcaggtg ccaaccgcgg cgatcggatg cgtgagaata 2012041 gtgaaagtgg ttgtcgcggt caggatttct gcgatcaacc ctacccgcat gacgccggcg 2012101 ggtggccccc gccggccacg ataaatgctt cgaccgccgt ggcccgctcg taaccttcga 2012161 cctccacgcg ccactcgtag attcctggtt ccaagggaat tcccgcagga atgttgaggg 2012221 tgagcggcat gcgaaccgag gtgccgtgga ttgcgccagg agcgcggccc gcctcggcgg 2012281 cggcttcaaa gaggatccgc tgcggcccgt gtggtcccgg cacgaccacc ggatcgccgt 2012341 cggcggtgag caactggcat ttcagctggt gctgcttatt ggtctcatcc cagtcgatgt 2012401 caaggaacag taccaaagcg aatggggggg tcggtgtttg gcattgccgc cagcccagcc 2012461 cgagcgcatg gaccttcccg gactgggcat cagcctgcgc cgcgtccgac aggaacagac 2012521 tgaccctcat gtcgccgccg ctgcgatcga actcccgggt tccgattcca cccctgtcct 2012581 tccccaatgt gcaactagcc gaaggtcggt caataccgca cccacttaga ctgactccat 2012641 cccgacggca ggataatacg tggcgaccgg tagatctatg ttgtgctatc tgggcggtgg 2012701 cagctggcgc ggaccgtcgg gggaacgcag ttcatgcgga ccctcccgtt gggtcagctc 2012761 cccggtcgac ccaactgata ggctcgccag gtctcgcgga ggcacctgcg ctaacggcgg 2012821 gtgatcatcg ttggtagcca gccggctacg gcgcgcagag tccgacgatg cgatcgggtt 2012881 gctttcggca ggggcagccg gggagtggga ttccagcggg ggctggtcgt tggggtcgga 2012941 accttcggca ttgaccgtca ccttgtgggt gcgcttgaac gaaaacgcga tctcctcgac 2013001 ctcttcgaac acctgacgcg cggtgcgcaa cggcgcagtg gtgttgtcga gcattcgtgc 2013061 gacaaacggc ggcaccagtg ggcgaatcca ccgagccgca gccttggtgg cgtctgggat 2013121 cgaccggatc actggggttg cacgcttctc gcgtggttca accgcaccct ccggttggac 2013181 ctgtgccggc aggtttttcg cgataccggg gcggtgatgg gctgccccag caggcaattg 2013241 ggcgaccgtc cggctggcag cttcggtttc ggcggcgatg ggcagataca cctcgtcctc 2013301 cactggctgc aaggttcgct ccgacgaagc gcgcactgga ccttcgagcg cttcgacgac 2013361 tcggcggcgt tgctgttcgc ggtcgatttc ggcctgggcc tcgatggcga ggcgggtctg 2013421 ggtgagttgg tgctcggccc acatgatttc ggcggcccgg cgaacctccg cccgcttgat 2013481 cgcgatcgcg gtgtccgcct ccaactcggc gcgctcccgt tccgctcgcg cggccgcgtg 2013541 gcgatcgtgc gttgtgtcac cgcgccatag ccggagaatc agcggcaaca ggtacagcag 2013601 cgcgaagaaa gcgatcgcca gcattcgcgc cgtcaaggcg ccggcgctgg ccaatgtcag 2013661 atcgttcatg gcgacccagc gcgagcccaa accacgaccc gcatccgcga ccacggcctg 2013721 gcgcacctcg gcaagagcct gctcgtcgtg tgccattttg gcgtccaagg caggtgcttg 2013781 gtgatcacga gccgccagcg cgttgtccag ctcacgctgc gcgtcggcga gaagctggtt 2013841 cgccgttcgt gtttcgggcc ctcggccggg aacgccggtg atccgggtct gcgggcaggc 2013901 cggagttggg tggtattcgc agcgtgcgac gaccagcgca tcgtccagtc gtccgcgcgc 2013961 ccgctcgacc gcgctgtcca gcgcagtgcg cgcattgcgg gcctgttgca gggaggccga 2014021 cgcttgcacg gccgccggcg tcgcgtcggc gctgtgcata gcttgttcat cgagacggcg 2014081 gtcgatggca ccggaaaaca tgaccagcgc agcgagttcg ccgacgacga aaccgacggc 2014141 gaccgcgacg gacgcgcgtc ccgtaacgcc ggcccgaccg cgagctgggc cactggccgt 2014201 accgcgggtc accgcgccga ccagcaggcc gagcaccagg gcgagcgagg cagccccgat 2014261 gggggacgag atcggcccct gggccgcctc gctcaccgcg aggctcgcga ggagtccggc 2014321 cagcgcggcg cccacggcca caatcacgcc ggccacggcg tgcgtggacc gctcgtgacg 2014381 ctcgccgagt tcgcgccagt gtccgccgcc gagccaggta agcagcccct cgattccgga 2014441 cacggccgag cgctgctcag catattcgtg ggcgcacatg agactgaaaa cacctcctgc 2014501 tggtcaagcc tggcaggccc ccgcccgaca caccgaatcg aagcggcccc ttgtggtgtt 2014561 gttcacaact gcgcgagaga tgacgcagat cacgtcgcgg ctgcccagcc gaatcctcag 2014621 cgagttcaat gtcaaaatta ccgcggcgcg agcggatcag cggccattat ggcaggtgac 2014681 gtgagacggt atacacctat gcaaaatcac gactacgtta cctacgaaga gttcggccgc 2014741 agattcttcg aggtagcagt taccccggac cgcgtcgccg ccgcgtttgc cgacatcgcg 2014801 ggcagcgagt tcgcaatgga accgatctcc cagggccccg gcgggatcgc caaggttagc 2014861 gcgaacgtca agatccgaga gccccgggtg acgcgaaagc tgggtgacct gatcacgttt 2014921 gtcatccata tcccgctgtc gatcgatctc cttcttgacc tgcgcctcga caagcagcgg 2014981 tttatggtcg ccggcgacat cgcgctgcgc gccaccgcac gcgccgccga gccgctgcta 2015041 ctgattgtcg acgtcgccaa accgcggccc tctgatatca cggtcaacgt gtcgtcgaag 2015101 tcgatccgcg gtgaggtgtt gcgcatcctc gcaggcgttg acggtgagat tcggcgattt 2015161 atcgcccagt acgtctctgc cgagatcgac tcgcccaaat cccaagccgc tcaagtcatc 2015221 aatgtggccg aacaattgga ctctacctgg agcggcccgt agccagctct ggatgcagtc 2015281 tggctgccgg ccaccgaaag ctcaccaaca gctcatcggt gaggtcgtcg cagcgcgcac 2015341 cgcctcggcc aaggtggctg ccctgcgatc ggtgaagatg tcctcgagca gcatcggctg 2015401 accgtcgggg ccggtgagcg gcacccgcca gttcgggtac tcgtcggtgg tgccaggctg 2015461 gttttgcgtc cggcggtcgc cgaccgcatc ggtcaacgcc actgccaaca gccgcgaggg 2015521 cgttcggccc aggtagcggt agagagccag gacggcctcc tccgagtcgg gctcggcacc 2015581 gtccgccagc agtccgaccc ggcgcagctc ggccatccag gctgcccggt cggcccgggc 2015641 ggattcgagt tccgcctcca cggggttggt taacaaccca agggactcgc gcagccgtac 2015701 ctggtcgccg gccaggtagc cggcggtcgg cggcagatca tgggtggtca ccgacgacaa 2015761 gcagtactcc cgccagcgtt cggccggcaa tggtgttcca gccggcccgc aatctcgatc 2015821 ctgctcaaac cagagaattg aggtgcccag caggccccgc aatagtagat agtcgcgtac 2015881 ccacggctcg acggtgccga gatcctcacc gacgacaacc gccccggccc ggtgggcttc 2015941 cagggcgacg atgccgatca tcgcgtcgtg gtcgtagcgc acataggtgc cttgggtggg 2016001 cggtgcgccg tcggggatcc accacaaccg gaacagcccg atgatgtggt cgatgcgtac 2016061 cgcaccggcg tgccgcaacg cggcctggat cagcgcgcga aacggtcggt actcctgctc 2016121 agcgagccgg tccggccgcc acggtggctg cgaccagtcc tggccgagtt ggttgaactc 2016181 atccggcggc gcacctgcgg tcacaccttg ggccagcacg tcctgcagag cccaggcgtc 2016241 ggccccgttg gggtgcacgc caacggcgag gtctgccatg atgcccagcg acatgccggc 2016301 ccggagcgcc tgcgactgcg cactggcgag ctgctcgtcc agctgccact gcagccagcg 2016361 gtggaaatcg acggcatcgg cgtgtttgtc gacgaaatcg gcgacacctg aggcatcggg 2016421 atgccgcagc gatttcggcc atcgatgcca atcatcgccg tacgtctcgg ccagcgcgca 2016481 ccaggtggcg aagtcgtcga gggcgcggcc ctcgcgggta cggaaggcgg cgtaggccag 2016541 ctcgcgaccc gccgaccgcg gcacccggtg cacgagcttg agtgctgcgc gtttggccgc 2016601 ccaggcgctg tcgcggtcaa tggtgtcgag ctggtcggcg tgctgttgca cgttggtgcg 2016661 caaccgttgc acccggccac gcttgggcag atcgacgagt tccggaatgg cctccacccg 2016721 aaggtagaga gggttgacga agcgtcgcga tgtcggcagg tagggcgatg gttcgattgg 2016781 cttcgagcgc ccagcgggcc cgggaagcgt agccgcatgc aggggattga ccagcacata 2016841 gccggcaccg tgcgcagacg ccgaccacag cgcgagattc gccaaatcgg tgagatcccc 2016901 gatgccccat gactgccggg accgcacgct gtagagctgg acggccaggc cccaggcacg 2016961 acggcctgcc agcttgtccg gcagccccaa ccaatccggc gtcacgacaa cagcggcgct 2017021 ggcctgcgag tcgcccgaac gcagattcac ccggtggtag ccgaggggca ggtcggcggg 2017081 caacacgaag ctggcctcgc cgatccagcg tccgtcaaga tcgaatggcg gggtgaaatt 2017141 gtcgacctgc accacctcgg cacgtgtcgt gccgtcctcg agctgcaacc acacgtcggc 2017201 cggagcgcca tcggtcacat gcaccctgaa ctgcgtctgc tctccggcgc gcatgacgat 2017261 ggtcgccggc aatggacgcg cccagtagga tcgcagctgc gcggccaggg cgtcattgcg 2017321 ttgctgttcg gtctgggcgg gaacgccgag ggcggcaaga gcagccacca atgtagcctc 2017381 ggagaccagc acctgccggc cagtccagtc cgtgtactcg gtggcaatgc cgaatcgtcg 2017441 ggcaagttcg accagcgaag gcgcgagctc ggtcatgtcg cccatcttgc gtccggcacc 2017501 cgtgtgcggg cgagcgcagg aatctgagcc ttccgtcagc acagcacggt tggctaccga 2017561 acaccactac gttgcaggtc aacgaggtag actgcggagc ggacagttcc acaggcggac 2017621 tcggtcattc gccgctacca tgcccagtga agacacgacg aatccttggg ggatccgcgc 2017681 agtggcaaat acccaggtca atgtccaggt gttctgagca gaccggaagg tgatctagcg 2017741 tggctgaaga gagccgcggg cagcgggggt cggggtatgg ccttgggttg tccacgcgga 2017801 cccaggtaac cggttatcag ttcctggcgc gtcgaaccgc aatggcgttg acacgctggc 2017861 gtgtgcgtat ggagattgag ccgggtcggc ggcagacgtt ggcggtggtg gcgtcggtgt 2017921 cggcggcgtt ggtgatctgt ctgggggcgc tgttgtggtc gttcatcagc ccgtccggcc 2017981 agttgaatga gtcgccgatc atcgcagacc gcgattccgg tgcgctctat gtccgtgtcg 2018041 gtgacaggtt gtacccggcg ctgaatttgg catcggcacg gctgatcacc gggcggccgg 2018101 acaacccgca cctggttcgg tcaagccaga ttgccaccat gccgcgcggt ccgctggtgg 2018161 gtatcccggg tgcgccgtca tcgttctcgc caaagagtcc acccgcgtcg tcttggctgg 2018221 tctgcgacac ggtagcgacc tcgtcaagca tcgggtcgct gcaaggcgtg acggtgacgg 2018281 tcatcgacgg gaccccggac cttaccggtc accggcagat tttgagtgga tcggacgcgg 2018341 tagtgctgcg ctacggcgga gatgcgtggg tcatccggga ggggcgccgg tcacgaatcg 2018401 agccgacgaa tcgagcggtg ttgttgccgc tggggttgac gccggagcag gttagccagg 2018461 cgcgtccgat gagccgggca ttgttcgacg ctttgccggt cgggcccgaa ctgttggtgc 2018521 cggaagtgcc gaatgcgggt ggtcctgcga cgttcccggg cgctcccgga ccgatcggga 2018581 cggtaatcgt cacaccgcaa atcagtggac cacaacagta ttcgttggtc ctgggcgatg 2018641 gagtgcaaac gctcccgccg ttggtggccc agatcctgca gaacgctggt agtgcgggca 2018701 acaccaagcc gttgaccgtg gaaccctcaa cgctggccaa gatgccggtg gtgaatcggt 2018761 tggatctctc tgcgtatccg gacaatcccc tggaagtggt ggacattcgc gagcatccgt 2018821 cgacctgttg gtggtgggag cggacggccg gtgaaaaccg ggcccgtgtg cgggtcgtgt 2018881 ccgggcctac cattccggtc gcggcgaccg agatgaacaa ggtggtgtcg ttggtgaagg 2018941 ccgacacgag tggccgccaa gccgatcagg tctacttcgg ccccgaccat gcgaacttcg 2019001 tggccgtcac cggcaacaac ccgggggccc aaacgtccga atcgctatgg tgggtgaccg 2019061 atgcgggcgc gcggttcggg gtggaggaca gcaaagaagc gcgtgacgcg ttggggttga 2019121 ccctgacgcc gagcctggcg ccgtgggtgg cgctgcggct gctgccacag ggccccacgc 2019181 tgtcacgagc ggacgcgttg gtggagcacg acacgctccc aatggacatg acccctgcag 2019241 agttggtggt accgaaatga agcgtggttt tgcccgcccg acaccggaaa agcctccggt 2019301 catcaagccc gagaatattg tcctatcgac accgctgagc attccgccgc cggagggcaa 2019361 gccctggtgg ctgattgtgg ttggcgtcgt ggtggtgggc ctgctgggcg gcatggtcgc 2019421 catggttttc gccagcggat cacacgtgtt cggcggcatc ggctcgatct tcccgctctt 2019481 catgatggtc gggatcatga tgatgatgtt ccgcggcatg ggcggcggcc aacagcaaat 2019541 gagccggccg aaattggacg cgatgcgcgc tcagttcatg ttgatgctgg acatgctgcg 2019601 cgagacggcc caagagtcgg ccgacagcat ggacgccaac tatcggtggt tccacccggc 2019661 gcccaatacg ttggcggccg ccgtggggtc accccggatg tgggagcgca agcccgacgg 2019721 taaggacctg aacttcgggg ttgtccgcgt cggcgtggga atgacgcgtc ccgaagtgac 2019781 ctggggtgag ccgcagaata tgccgaccga catcgagctg gagccggtga caggtaaggc 2019841 gctgcaggaa ttcgggcgct accaaagcgt cgtgtacaac ctgccgaaaa tggtttcgct 2019901 gctggtcgaa ccctggtatg cgctggtcgg ggaacgcgag caggttctgg gtttgatgcg 2019961 ggcgatcatc tgccagctgg cgttctccca cgggcctgac catgtccaga tgatcgttgt 2020021 cagttccgat ctagaccaat gggactgggt gaagtggcta ccgcatttcg gtgactcgcg 2020081 gcggcacgac gcggcgggta acgcgcggat ggtctacacc tcggttcgtg agtttgccgc 2020141 agagcaagcc gaattattcg cgggccgtgg ttctttcacg cctcgacacg cgagttcgtc 2020201 ggcgcagacc ccgaccccgc acaccgtgat catcgccgac gtcgacgatc cgcaatggga 2020261 gtacgtgatc agcgccgagg gtgtcgacgg ggtgacgttc ttcgacctga ccggctcttc 2020321 gatgtggact gacatcccgg agcggaagct gcagttcgac aagaccggcg tgatcgaggc 2020381 gctgccccgc gaccgcgaca cctggatggt gatcgacgac aaggcttggt tcttcgctct 2020441 caccgaccaa gtcagcatcg ccgaggcaga agagttcgcg cagaagctgg cgcagtggcg 2020501 gctggctgag gcctatgaag agatcggcca gcgggttgcc cacattggtg cccgagacat 2020561 ctagtcctac tacgggattg acgatcctgg caacatcgac ttcgactcgc tgtgggctag 2020621 ccggaccgac accatgggac ggtcgcgatt gcgggcgccg ttcggtaatc gctccgacaa 2020681 cggcgagctg ctgttcttgg atatgaaatc gctcgacgaa ggcggcgacg gcccgcacgg 2020741 ggtcatgtcc gggacgaccg gttccggtaa gtcgacgttg gtgcgaaccg tgatcgaatc 2020801 gctgatgctc agccatccgc cggaggagtt gcagttcgtt ttggcagacc tcaaaggtgg 2020861 ctcggcggtc aagccgttcg cgggagtgcc acacgtgtcg cggatcatca ccgacctcga 2020921 agaagaccag gcgctcatgg agcgctttct ggatgcgctg tggggcgaga tcgcccgccg 2020981 caaagcaata tgcgacagcg ccggtgtcga cgacgccaaa gagtacaact cggtgcgagc 2021041 caggatgcgt gcgcgcggtc aggacatggc gccgctgccg atgctcgtgg tggtcatcga 2021101 cgagttctac gaatggttcc gcatcatgcc gacggcggtc gacgtcctcg actcgatcgg 2021161 ccggcagggc cgcgcctact ggattcacct gatgatggcg tctcagacca tcgagagccg 2021221 agccgaaaag ctcatggaga acatgggtta ccgcttggtg ctgaaagcgc gtaccgcggg 2021281 agcggcgcag gcggccgggg tgcccaacgc ggtgaatctg cccgcacagg ccggtctggg 2021341 ctacttccgc aagagcctcg aggacatcat ccgattccag gcggaattcc tgtggcggga 2021401 ctacttccaa cccggcgtca gcatcgacgg cgaggaagcg cctgccttag tacacagcat 2021461 cgactacatt cgcccgcaat tgtttaccaa ctcgttcaca ccgctggaag ttagcgtggg 2021521 gggtcccgat atcgagccgg tagttgccca gcccaacggt gaggtgctcg agtcggacga 2021581 cattgaaggc ggcgaggacg aggacgaaga gggggtgcgc accccgaagg ttgggacggt 2021641 gatcattgat cagctgcgca agatcaagtt cgagccgtac cggctctggc aaccgccact 2021701 aacccaaccc gtcgccatcg acgacttggt caaccggttc ctcggccgcc cgtggcacaa 2021761 ggagtacggt tcggcgtgca atctcgtgtt cccgatcggg ataatcgatc gcccctataa 2021821 gcatgaccag ccaccgtgga cggttgacac ctccgggccc ggtgccaacg tgctaatcct 2021881 gggcgccggc ggttcgggca agaccactgc gctgcagaca ctcatctgct cagcggcact 2021941 gactcacacc ccgcagcagg ttcagttcta ctgcctggcc tacagcagca ccgcgttgac 2022001 cacggtctcc cgcatccccc acgtgggcga ggttgccggt cccaccgatc cctacggtgt 2022061 gcgccggacg gtggccgagt tgctggcgct ggtgcgcgag cgcaaacgca gcttcctgga 2022121 atgcggaatc gcgtcgatgg agatgttccg gcgccgcaag ttcggcggag aggccgggcc 2022181 ggtacccgac gacggcttcg gtgacgtcta cctggtgatc gataactacc gggccctggc 2022241 cgaagaaaac gaggtgctga tcgagcaggt gaacgtgatc atcaaccagg gcccctcgtt 2022301 cggggtgcac gtggtggtca ctgccgaccg cgaatcggag ctgcggccgc cggtgcgcag 2022361 cggcttcgga tcccgtatcg agctgcgctt ggcggcggtt gaggacgcca agctggtgcg 2022421 ttctcgattc gccaaggacg ttccggtcaa gccggggcgc ggcatggttg cggtcaacta 2022481 cgtccgcctg gacagcgacc cgcaggccgg cctgcacacc ctggtggctc gaccggcgtt 2022541 gggcagcaca cccgacaatg tcttcgagtg cgacagcgtg gtcgcggcgg tgagccggct 2022601 caccagcgcc caggctccac cggtgcgccg gttgccggcg cggttcggcg tggaacaggt 2022661 gcgggagctg gcctcgcggg acacccgcca aggcgttggc gctggcggaa tcgcctgggc 2022721 gatatcggaa ttggatctgg cgccggttta tctgaatttc gccgagaatt cgcacctgat 2022781 ggtgactggt cgacgcgaat gtggccgcac caccacgctg gccaccatca tgtccgaaat 2022841 cgggcggctc tacgcgccgg gcgccagtag cgcaccgcct cccgcccccg ggcggccctc 2022901 tgcgcaggta tggctggtcg acccgcgccg tcagctgctg accgcgctcg gttcggacta 2022961 tgtggagcgg ttcgcctaca acctcgacgg ggtggtggcg atgatgggtg aacttgcggc 2023021 ggcgttggcc ggtcgtgagc cgccaccggg cctgtccgcc gaagagttgt tgtcgcggtc 2023081 gtggtggagc ggcccagaaa tcttcctgat cgtcgacgac atccagcagc tgccgccggg 2023141 cttcgattca ccgttgcaca aggctgttcc gtttgtgaac agggccgccg atgtcggctt 2023201 gcatgtgatc gtcacgcgca ccttcggtgg ttggtcgtca gccggcagcg acccgatgtt 2023261 gcgggccctg catcaggcca atgcgccact gctggtgatg gacgccgatc ccgacgaggg 2023321 cttcattcgc ggcaagatga agggcggccc gctgccccgc ggtcgaggcc tgttgatggc 2023381 agaagacacc ggtgtgttcg tccaagtggc agccaccgag gtgcgtcggt agttcggcca 2023441 aaccgatcag ctccagcgta gcggcaagtt cttaagcgcg aaggacttgg acgggaaccg 2023501 tatttcgggc gcgtagtccg gcgcgagctc gaagtcggga atttgattca gccactcgcc 2023561 caccagcagg gtgagctcta aacgggctag atgcgaaccc aggcaacggt gtggaccgcc 2023621 gccaaatccc cagtgccggt gcacctttcc atccatcacc aactcgtcgg tggacatcgc 2023681 gtcgctgccg tcgcggttga ctgcggccat gcataaccgc actggtgacc ccgcaggcag 2023741 tgtcatgccg ccgacggtga cgggctcggt ggtaactcgc ggcgccaccg gcgccgatgg 2023801 ctccagccgg acgatctctt cgatgaaaac cctgatctgc ttgggattgt cgcgcagcat 2023861 ggcgcgcagc tgtggtctgc gggcgagctc gagcagcgaa aagcctaccg ctgccgtcac 2023921 ggtgtccagt cccgccagta tcaggaggtg gctcaaaccc aaaacctcga tctcgctcaa 2023981 cgggtcctcg ccgatctgca cttgcgacaa gacgtccggc cctgggtttc gccggcgttc 2024041 ggcgaccatg gccgtgagat actcgagcag ctcgcgcgcc gcagcgacat cggcttcggt 2024101 cgggtgaggt cgatccgaca tggcgatgac ggcgtctttc cagccgatca gacggtcacg 2024161 gtcttcgagc ggcaggccgt acaggacgag aaacaactga aacggaaaca gattcgcgag 2024221 atcggccatc gcctcgcact cgccccggcc tgcgatggcg tcgatcatag cgacagtgtg 2024281 acggcgcagc gacggtagcg ccttgctcaa agcggccggg ctgaagtatg gctgcaggat 2024341 cctgcggtat cgggtgtgct cgggcgggtc gaacgcgagc ggaaccaccg gcagcggatt 2024401 tcccggaggt tgcagcgctt tccgcgacga gaaaaccttc ggattccgca gcgccgcgag 2024461 cacatcttcg cggcgcgtca ggtagtacca gccgttcatg aacaccacgg gccccgcgtc 2024521 gcggagggtc ttccagccga caccccggtc aacggccatc ggtaacgtcg aatattcgag 2024581 ccgcggtaga taaaacgagc cggcgtggtc ctcgccgggg gtggtcatgc gctcaagtct 2024641 ttcgtgtctc cgttcttgtc gcaggtcgca gacgtagcca agcggtgccg acctagccaa 2024701 tatcgcacgt gggcgtgcac ccaccattgt ggtgtcgagc gcatctgggg gctcagcggc 2024761 taatcttcga agcgaactgt ccggtccaag ctggcgtgtg ctttgggcgg taaagggagg 2024821 aaatcccgtg aaagtccgtc tcgatccatc gagatgcgtg ggtcatgcgc agtgctatgc 2024881 cgtcgatccg gacctgttcc cgatcgacga ctcgggcaac tcgatcctgg cagagcacga 2024941 ggtgcggccc gaggacatgc agctgaccag agacggtgtg gccgcttgcc ccgaaatggc 2025001 gctcatcctc gaggaggacg acgcggactg acgattccgg gtcataccac aaaattaacg 2025061 ctggccaaac gatcgtttac gaggaatgaa tatttggcgt catcggcgct ggaggccggt 2025121 attgcaatct aatgtgtttt ctatgcaaca gttgcgcagc gacgccgtta tcgactagcg 2025181 gtgctatatt cggcgccttt tcgatgccga gcgcgcgtct cgttggccac gtttggtggc 2025241 aatgctcatc agggctcatc cggatcgcca acgcgatcgt gtgtggagag ggaggactgg 2025301 ttggacttcg gggcgttacc gccggagatc aattcgggcc gtatgtattg cggtccgggg 2025361 tcggggccga tgctggctgc ggccgcggcc tgggacgggg tggccgtgga gttggggttg 2025421 gctgcgaccg gttatgcgtc ggtgatagcc gagctgaccg gtgcgccgtg ggtgggtgcg 2025481 gcgtcgttgt cgatggtggc ggcggccacg ccgtatgtgg cctggctgag ccaagccgcg 2025541 gcgcgggccg agcaggcggg gatgcaggcc gcggcggccg cggcggctta tgaggccgct 2025601 tttgtgatga cggtgccgcc gccggtgatt acggcgaatc gggttttggt gatgacgctg 2025661 attgcgacca attttttcgg tcagaactcg gcggcgatcg cggtcgctga ggcgcagtac 2025721 gccgaaatgt gggcgcaaga cgccgttgct atgtatggct atgcggctgc gtcggcgagc 2025781 gcgtcgcggt tgattccgtt cgcggcgccg ccgaagacca ccaactccgc tggggtggtc 2025841 gcacaggtgg ctgcggtcgc ggcgatgcct ggactgctgc aacgactttc gtcggctgca 2025901 tcggtcagct ggtcgaatcc caatgattgg tggctcgtgc ggttgctggg ctcgattacc 2025961 cccacggaaa ggacgacgat cgttcgtttg ctcggtcagt cgtacttcgc gacgggcatg 2026021 gcgcagttct tcgcctcgat cgcacagcag ctgaccttcg gcccaggggg cacaacggct 2026081 ggctccggcg gagcctggta cccaacgccg caattcgccg gcctgggtgc aagccgggcg 2026141 gtgtcggcga gtttggcgcg ggccaacaag attggggctc tgtcggttcc gccgagctgg 2026201 gtcaaaacga ctgcactgac cgaaagcccg gtcgcccacg cggtgagcgc caaccctacc 2026261 gtcggttcgt cacacggacc gcatggcctg ctccgcggac tgccgctagg gtcgcggatc 2026321 actcggcgta gcggcgcctt tgcccaccga tatgggttcc gtcacagtgt ggttgcccgc 2026381 ccgccatcgg ccggataacg ccatgacctc agctcggcag aaatgacaat gctcccaaag 2026441 gcgtgagcac ccgaagacaa ctaagcagga gatcgcatgt cgtttgtgac tacccaacca 2026501 gaagcactgg cggcggcggc cggcagtctg cagggaatcg gctccgcatt gaacgcccag 2026561 aatgcggctg cggcgactcc cacgacgggg gtggtcccgg cggccgccga tgaagtgtcg 2026621 gcgctgacgg cggctcagtt cgcggcacac gcccagatct atcaggccgt cagcgcccag 2026681 gccgcggcga ttcacgagat gttcgtcaac actctacaga tgagctcagg gtcgtatgct 2026741 gctaccgagg ccgccaacgc ggccgcggcc ggctagagga gtcactgcga tggattttgg 2026801 ggcgttgccg ccggaggtca attcggtgcg gatgtatgcc ggtcctggct cggcaccaat 2026861 ggtcgctgcg gcgtcggcct ggaacgggtt ggccgcggag ctgagttcgg cggccaccgg 2026921 ttatgagacg gtgatcactc agctcagcag tgaggggtgg ctaggtccgg cgtcagcggc 2026981 gatggccgag gcagttgcgc cgtatgtggc gtggatgagt gccgctgcgg cgcaagccga 2027041 gcaggcggcc acacaggcca gggccgccgc ggccgctttt gaggcggcgt ttgccgcgac 2027101 ggtgcctccg ccgttgatcg cggccaaccg ggcttcgttg atgcagctga tctcgacgaa 2027161 tgtctttggt cagaacacct cggcgatcgc ggccgccgaa gctcagtacg gcgagatgtg 2027221 ggcccaagac tccgcggcga tgtatgccta cgcgggcagt tcggcgagcg cctcggcggt 2027281 cacgccgttt agcacgccgc cgcagattgc caacccgacc gctcagggta cgcaggccgc 2027341 ggccgtggcc accgccgccg gtaccgccca gtcgacgctg acggagatga tcaccgggct 2027401 acccaacgcg ctgcaaagcc tcacctcacc tctgttgcag tcgtctaacg gtccgctgtc 2027461 gtggctgtgg cagatcttgt tcggcacgcc caatttcccc acctcaattt cggcactgct 2027521 gaccgacctg cagccctacg cgagcttctt ctataacacc gagggcctgc cgtacttcag 2027581 catcggcatg ggcaacaact tcattcagtc ggccaagacc ctgggattga tcggctcggc 2027641 ggcaccggct gcggtcgcgg ctgctgggga tgccgccaag ggcttgcctg gactgggcgg 2027701 gatgctcggt ggcgggccgg tggcggcggg tctgggcaat gcggcttcgg ttggcaagct 2027761 gtcggtgccg ccggtgtgga gtggaccgtt gcccgggtcg gtgactccgg gggctgctcc 2027821 gctaccggtg agtacggtca gtgccgcccc ggaggcggcg cccggaagcc tgttgggcgg 2027881 cctgccgcta gctggtgcgg gcggggccgg cgcgggtcca cgctacggat tccgtcccac 2027941 cgtcatggct cgcccaccct tcgccggata gtcgctgccg caacgtatta acgcgccggc 2028001 ctcggctggt gtggtccgct gcgggtggca attggtcggc gccgagatct cggtgggtta 2028061 tttgcggtgg gattttttcc cgaagccggg ttcagcaccg gatttcctaa cggtcccgcg 2028121 actcaacggc accgcgccgt cagcaagttc cggtggtgtt gatcgcggta tccatgcagg 2028181 tggtgatggc gcggcgagac tggtcgtgtg cgctgaagca cagggtactt ggcggttgtg 2028241 gctcccggga tgtagctggc cgcccaacgt cccgcagcgt cggggtcagc ggcggagcag 2028301 cacggcgatt tagcctcaca accgagcagc tagctcgcgt ttcccagcgg ctcaatcccc 2028361 gtcgagccat tgaaaggcac ctcagatgtc gtttgcgact ccgcaaccgg agaaagggtt 2028421 cggaatggac ttcggggcgt taccgccgga gatcaattcg ggccgtatgt attgcggtcc 2028481 ggggtcgggg ccgatgctgg ctgcggccgc ggcctgggac ggggtggccg tggagttggg 2028541 gttggctgcg accggttatg cgtcggtgat agccgagctg accggtgcgc cgtgggtggg 2028601 tgcggcgtcg ttgtcgatgg tggcggcggc cacgccgtat gtggcctggc tgagccaagc 2028661 cgcggcgcgg gccgagcagg cggggatgca ggccgcggcg gccgcggcgg cttatgaggc 2028721 cgcttttgtg atgacggtgc cgccgccggt gattacggcg aatcgggttt tggtgatgac 2028781 gctgattgcg accaattttt tcggtcagaa ctcggcggcg atcgcggtcg ctgaggcgca 2028841 gtacgccgaa atgtgggcgc aagacgccgt tgctatgtat ggctatgcgg ctgcgtcggc 2028901 gagcgcgtcg cggttgattc cgttcgcggc gccgccgaag accaccaact ccgctggggt 2028961 ggtcgcacag gcggttgcgt cggtcagctg gccgaatccc aatgattggt ggctcgtgcg 2029021 gttgctgggc tcgattaccc ccacggaaag gacgacgatc gttcgtttgc tcggtcagtc 2029081 gtacttggcg acgggcatgg cgcggtttct tacctcgatc gcacagcagc tgaccttcgg 2029141 cccagggggc acaacggctg gctccggcgg agcctggtac ccaacgccac aattcgccgg 2029201 cctgggtgca ggcccggcgg tgtcggcgag tttggcgcgg gcggagccgg tcgggaggtt 2029261 gtcggtgccg ccaagttggg ccgtcgcggc tccggccttc gcggagaagc ctgaggcggg 2029321 cacgccgatg tccgtcatcg gcgaagcgtc cagctgcggt cagggaggcc tgcttcgagg 2029381 cataccgctg gcgagagcgg ggcggcgtac gggcgccttc gctcaccgat acgggttccg 2029441 ccacagcgtg attacccggt ctccgtcggc gggatagctt tcgatccggt ctgcgcggcc 2029501 gccggaaatg ctgcagatag cgatcgaccg cgccggtcgg taaacgccgc acacggcact 2029561 atcaatgcgc acggcgggcg ttgatgccaa attgaccgtc ccgacggggc tttatctgcg 2029621 gcaagatttc atccccagcc cggtcggtgg gccgataaat acgctggtca gcgcgactct 2029681 tccggctgaa ttcgatgctc tgggcgcccg ctcgacgccg agtatctcga gtgggccgca 2029741 aacccggtca aacgctgtta ctgtggcgtt accacaggtg aatttgcggt gccaactggt 2029801 gaacacttgc gaacgggtgg catcgaaatc aacttgttgc gttgcagtga tctactctct 2029861 tgcagagagc cgttgctggg attaattggg agaggaagac agcatgtcgt tcgtgaccac 2029921 acagccggaa gccctggcag ctgcggcggc gaacctacag ggtattggca cgacaatgaa 2029981 cgcccagaac gcggccgcgg ctgctccaac caccggagta gtgcccgcag ccgccgatga 2030041 agtatcagcg ctgaccgcgg ctcagtttgc tgcgcacgcg cagatgtacc aaacggtcag 2030101 cgcccaggcc gcggccattc acgaaatgtt cgtgaacacg ctggtggcca gttctggctc 2030161 atacgcggcc accgaggcgg ccaacgcagc cgctgccggc tgaacgggct cgcacgaacc 2030221 tgctgaagga gagggggaac atccggagtt ctcgggtcag gggttgcgcc agcgcccagc 2030281 cgattcagct atcggcgtcc ataacagcag acgatctagg cattcagtac taaggagaca 2030341 ggcaacatgg cctcacgttt tatgacggat ccgcatgcga tgcgggacat ggcgggccgt 2030401 tttgaggtgc acgcccagac ggtggaggac gaggctcgcc ggatgtgggc gtccgcgcaa 2030461 aacatttccg gtgcgggctg gagtggcatg gccgaggcga cctcgctaga caccatgacc 2030521 tagatgaatc aggcgtttcg caacatcgtg aacatgctgc acggggtgcg tgacgggctg 2030581 gttcgcgacg ccaacaacta cgaacagcaa gagcaggcct cccagcagat cctgagcagc 2030641 tagcgccgaa agccacagct gcgtacgctt tctcacatta ggagaacacc aatatgacga 2030701 ttaattacca gttcggggac gtcgacgctc atggcgccat gatccgcgct caggcggcgt 2030761 cgcttgaggc ggagcatcag gccatcgttc gtgatgtgtt ggccgcgggt gacttttggg 2030821 gcggcgccgg ttcggtggct tgccaggagt tcattaccca gttgggccgt aacttccagg 2030881 tgatctacga gcaggccaac gcccacgggc agaaggtgca ggctgccggc aacaacatgg 2030941 cgcaaaccga cagcgccgtc ggctccagct gggcctaaaa ctgaacttca gtcgcggcag 2031001 cacaccaacc agccggtgtg ctgctgtgtc ctgcagttaa ctagcactcg accgctgagg 2031061 tagcgatgga tcaacagagt acccgcaccg acatcaccgt caacgtcgac ggcttctgga 2031121 tgcttcaggc gctactggat atccgccacg ttgcgcctga gttacgttgc cggccgtacg 2031181 tctccaccga ttccaatgac tggctaaacg agcacccggg gatggcggtc atgcgcgagc 2031241 agggcattgt cgtcaacgac gcggtcaacg aacaggtcgc tgcccggatg aaggtgcttg 2031301 ccgcacctga tcttgaagtc gtcgccctgc tgtcacgcgg caagttgctg tacggggtca 2031361 tagacgacga gaaccagccg ccgggttcgc gtgacatccc tgacaatgag ttccgggtgg 2031421 tgttggcccg gcgaggccag cactgggtgt cggcggtacg ggttggcaat gacatcaccg 2031481 tcgatgacgt gacggtctcg gatagcgcct cgatcgccgc actggtaatg gacggtctgg 2031541 agtcgattca ccacgccgac ccagccgcga tcaacgcggt caacgtgcca atggaggaga 2031601 tgctagaggc aacgaagtcg tggcaggaat cggggtttaa cgtcttctcc ggcggagatc 2031661 tgcgccgaat gggcatcagt gccgcgacgg tggccgcgct ggggcaggcg ttgtcggatc 2031721 ccgcggccga ggtcgcagtg tatgcgcgac agtaccgaga cgacgccaag ggccccagcg 2031781 cctcggtgtt gtcgctgaaa gacggctccg gtggacgcat cgcgctgtat cagcaggcgc 2031841 gaacggcagg ttccggcgag gcgtggctgg ctatctgccc ggctaccccg cagttggtgc 2031901 aagtaggagt gaagaccgtt ttggatacac tgccctacgg cgagtggaaa acacacagca 2031961 gagtatgacg ccagggcgtg aaacccgaag tacaacaaca aatttgagca tcagatacaa 2032021 cccagatacg tacagggcaa attgctctag aatcgactgc aatactgcaa ggcaaggtca 2032081 accacaacga tttggtcgcg aggcaaggca aatgaaatcg gagttagtcg agccgcagct 2032141 cccggtgggc taccgcgcct cggtgcctac accgacggag ctccccgcgc cactgaagcc 2032201 acggtgtaac acgtttgcca tggcaggggg tacaggacga tgaccgcagt agctgacgca 2032261 cctcaggctg acattgaggg tgtggcatcg ccccaggctg tcgtcgtggg cgtcatggcc 2032321 ggcgaaggcg tccagatcgg cgtcctgctg gatgccaacg ccccagtttc ggtgatgacc 2032381 gacccgctgc tgaaagtggt taatagtcgg ctcagagagc tcggtgaggc tccactggaa 2032441 gccactggac gcggccgatg ggcgctgtgt ctggtggacg gcgcgccgtt gcgtgctacc 2032501 cagtcgctga ccgaacaaga cgtctatgac ggcgaccggc tgtggattcg gttcatcgca 2032561 gacaccgaac gtcgctccca agtcatcgaa catatctcca ccgcagtcgc ctcggatctc 2032621 agcaagcggt tcgccaggat cgacccgatc gttgctgtgc aggtcggggc gtcgatggtg 2032681 gcgaccgggg ttgttcttgc caccggggtg ctcggctggt ggcgctggca tcacaacacc 2032741 tggttgacca ccatctacac cgcggtgatt ggtgtgctgg tgctggcggt cgccatgttg 2032801 ctgttgatgc gtgccaagac ggacgcggat cgacgcgtcg ccgacatcat gctgatgagc 2032861 gcgatcatgc ccgtgacggt ggcggcggca gcggccccgc ccggcccggt gggctccccg 2032921 caggccgtgt tgggcttcgg agtgctgacc gtcgctgcgg ccctggccct gcggttcacc 2032981 ggtcgccgcc tggggattta caccacaatc gtcatcatcg gtgcgctgac aatgcttgca 2033041 gccttggcgc ggatggtcgc ggccacaagc gcggtgacgc tgttgtcgtc cttgttgttg 2033101 atttgcgtag tggcctacca cgcggcgccg gcactgtctc ggcggctggc cggcatccga 2033161 ctgccggtgt tcccgtccgc caccagccgg tgggtcttcg aggctcggcc cgacctaccg 2033221 accaccgtgg tggtgtccgg tggcagcgca ccggtcttgg aagggccgtc atcggtgcgt 2033281 gatgtgctgc tgcaagctga gcgcgctcgg tcgttcttga gcggcctgct aacgggactt 2033341 ggcgtgatgg tggtggtgtg catgacatcg ttgtgcgacc cgcacaccgg gcaacgttgg 2033401 ctgccgctga tactggccgg atttacctcg ggcttcctgc tgttgcgggg ccgctcctac 2033461 gtcgaccgtt ggcagtcgat taccctggcc ggaactgcgg tgatcatcgc tgctgcggtg 2033521 tgtgtgcggt acgcgctgga attgtcctcg ccgttggctg tgtccattgt cgccgcgatc 2033581 ctggtgctgc tgccggcggc gggcatggca gctgctgcac atgtgcccca caccatctac 2033641 agtccgctat tccgcaagtt tgtggaatgg attgaatacc tctgcctgat gccgatcttc 2033701 ccgctggcgt tgtggttgat gaacgtctat gcagcgattc ggtaccggta gcagcaggtc 2033761 gtggtgtggt cgcgcgggta ccgcgaccat tgccgcagtc ttgctagctt cgggcgcgct 2033821 gaccggcctt ccgccagcgt atgcaatttc gcctccgacg atcgatccgg gcgcgctgcc 2033881 acccgacggg ccgcccggac cgctggcgcc catgaagcag aacgcctact gcaccgaggt 2033941 cggggtcttg cccggcaccg actttcagct gcagccaaaa tatatggaga tgctgaacct 2034001 gaacgaggct tggcagttcg gccgcggcga cggtgtgaag gtcgctgtca tcgacacggg 2034061 tgtgactcca catccccggt tgccgcgtct gatccctggc ggcgactacg tgatggccgg 2034121 tggcgacggt ctgtcggact gcgacgccca cggcaccctg gtggcgtcga tgatcgcggc 2034181 ggttccggcg aacggggcgg taccgctgcc gtcggtaccg cgcaggccgg tcaccattcc 2034241 cacgaccgaa acgccgccgc cgccacagac ggtgaccctt tcaccggtac cgccgcagac 2034301 cgtgaccgtg attccggctc cacctcccga ggaaggagtt ccgccgggcg caccggtgcc 2034361 aggaccggag ccgccgccgg ctcctggtcc acagccgccg gccgtggacc gcggtggcgg 2034421 cacggtgaca gtacccagct actccggggg ccgcaagata gccccgatcg acaacccgcg 2034481 taatccgcac ccgagtgcgc catcgccagc gctgggacca ccgccggacg cgttcagtgg 2034541 gatcgccccc ggtgtcgaga taatctccat ccgccagtca agccaggcct tcggccttaa 2034601 ggacccttac actggggacg aagacccgca gacggcgcaa aagatcgaca acgtcgagac 2034661 aatggcgcgc gcgatcgtgc atgctgccaa catgggtgct tcggtgatca atatctccga 2034721 tgtgatgtgc atgagtgctc gtaatgtcat cgaccagcgt gcactgggtg ccgcggtgca 2034781 ctacgccgcg gtcgacaagg acgcggtcat cgtggctgca gcgggcgacg gcagcaagaa 2034841 ggactgtaag cagaacccga tttttgatcc cttgcagccc gacgatccac gcgcttggaa 2034901 cgcggtcacc acggtggtga caccctcgtg gttccacgac tacgtcctga cggtcggagc 2034961 ggttgacgcc aacggtcaac cgctcagcaa aatgagtatc gcgggaccct gggtctccat 2035021 ttcggcgccg ggaaccgacg tcgtcggact ctcgccccgt gacgacggcc tgatcaatgc 2035081 gattgacggc ccggataatt cgttgctggt tccggctggc accagttttt ccgccgcgat 2035141 cgtgtccggg gtggctgcgc tggtacgtgc taagttcccc gaattgtcgg cgtaccaaat 2035201 catcaatcgg ctgattcata ccgcccggcc acccgctcgc ggcgtcgaca accaggtcgg 2035261 ctacggtgtg gtcgacccag tggcagcact gacttgggat gtgcccaaag gcccggccga 2035321 gccgcccaag cagctgtcag cgccgttggt ggtgccgcag ccgcccgccc cccgcgatat 2035381 ggtgccgata tgggtggccg ccgggggatt ggccggggca ctattgatag gcggtgcggt 2035441 gttcggtacc gcgaccttga tgcggcgatc acggaagcag caatgaaggc tcagcgcagc 2035501 ttcgggttgg cgttgtcgtg gccgcgggtg accgcggtgt ttctggtgga tgtcctgatc 2035561 ttggcggtgg ccagtcattg cccggattcc tggcaggccg atcatcatgt ggcgtggtgg 2035621 gtcggcgtcg gcgtggcggc cgtagtgacg ttactgtcgg tggtcagtta ccacggcatc 2035681 acggtgattt cgggtttggc gacgtgggtg cgggattggt cggcggatcc gggcacgaca 2035741 ctgggtgcgg ggtgcactcc ggcaatcgac caccagcgcc gttttgggcg tgacacggta 2035801 ggggtgcgtg agtataacgg ccggctggtc tcggtgatcg aggtcacctg cggtgagagc 2035861 ggcccgtcgg gtcggcattg gcaccggaaa tcgccggtac ccatgttgcc ggtggtcgcg 2035921 gtcgccgatg gtttgcgcca gttcgacatt cacctcgatg gcatcgacat cgtgtcggtg 2035981 ctggtgcggg gcggggttga tgctgctaaa gcttcggcct cgctgcagga gtgggagccg 2036041 cagggctgga aatccgaaga acgagccggt gatcgcactg tcgccgatcg gcgccgcacc 2036101 tggttggtgt tacggatgaa tccgcagcga aatgtggctg cggtggcgtg tcgtgactcg 2036161 ttggcgtcga cgctggtggc agccaccgag cggttggtcc aggatctgga tgggcaaagt 2036221 tgtgcggccc ggccggtgac ggccgatgag ctgaccgagg tcgacagcgc cgtgttggct 2036281 gacttggaac cgacatggag tcgccccggt tggcgtcacc tcaagcattt caatggttat 2036341 gcgaccagtt tttgggttac gccgtcagac atcacgtcgg agaccttgga tgagctgtgt 2036401 ctgccagata gccccgaagt cgggacgacc gtggtcacgg tgcgtctgac cactcgggtc 2036461 gggtcgcccg cgctatcggc atgggtgcgt tatcacagcg acacgcgcct gcccaaggag 2036521 gtagcggccg gactcaaccg gctcaccggt cgccagttgg ccgcggtgcg tgccagcctg 2036581 ccggccccga cgcaccgtcc actcctggtc atccccagtc ggaacctgcg tgaccacgac 2036641 gagctcgtgc tgccggtggg ccaggaactc gagcacgcga caagctcgtt tgtggggcaa 2036701 tgacacgccc gcaggccgcc gccgaagatg cccgcaacgc catggtcgcc ggtctgctgg 2036761 catcggggat ctccgtcaat ggactgcagc ccagccataa cccgcaggtg gccgcccaaa 2036821 tgttcaccac ggcgaccagg ctggatccca agatgtgtga tgcctggctg gctcggctgc 2036881 tggccggcga ccagagcatc gaagtgctcg ccggcgcatg ggctgcggtg cggactttcg 2036941 gctgggaaac ccgccgcctc ggcgtgacgg atctgcagtt ccgccccgag gtgtccgacg 2037001 ggctattcct gcgactggcg attaccagcg tagattcgct ggcctgcgct tacgcggcgg 2037061 tcctcgccga ggccaagcgt taccaggagg cggcagagct gctcgacgcc accgatcctc 2037121 gccatccgtt cgacgccgag ctggtgagtt acgtgcgggg cgtgctgtac ttccgcacca 2037181 aacgctggcc tgacgttctt gcgcagttcc ccgaggcaac gcagtggcgt caccccgagc 2037241 taaaggccgc gggggcggcg atggccacca cggcgctggc gtcgctcggg gtgttcgaag 2037301 aggcctttcg gcgcgctcag gaagcaatcg aaggtgaccg ggtgccgggc gcggctaaca 2037361 tcgccttgta cacccaaggc atgtgcctgc ggcacgtcgg ccgtgaggag gaagctgtcg 2037421 aactcctgcg ccgcgtgtat tcgcgcgatg cgaagttcac cccggcccgc gaggcgctgg 2037481 ataaccccaa ctttcggctg atcctcaccg acccggaaac gattgaggcg cgcacagatc 2037541 cgtgggatcc ggacagtgcg ccaacccgcg ctcagaccga ggccgcccgc catgccgaga 2037601 tggccgcgaa gtacttggcc gaaggggatg ccgagctcaa cgcgatgctt ggcatggagc 2037661 aggccaagaa ggagatcaag ctcatcaagt cgacgacgaa ggtgaattta gcgcgtgcca 2037721 agatggggct tccggtcccg gttacgtcgc gccacacctt gttgctcggg ccgcccggta 2037781 ccgggaagac ttcggtcgca agggctttca ccaagcagct gtgcgggttg acagtgctgc 2037841 gcaagccgct ggtggtggag accagccgca ccaagctgtt gggccggtac atggccgacg 2037901 ccgagaagaa caccgaggag atgctcgaag gggcgttggg cggtgcggtc ttctttgacg 2037961 agatgcacac tctgcatgag aagggctact cccagggcga cccgtacggt aacgcgatca 2038021 tcaacacgct gctgttgtac atggaaaatc accgtgacga gctggtggtg tttggtgcgg 2038081 gttacgccaa agcgatggag aaaatgctcg aggtgaatca gggtctgcgc cggcgctttt 2038141 cgacggtgat cgagttcttc agctacaccc cgcaggagct gatcgcactg acccagctga 2038201 tgggtcggga gaacgaagac gtgatcactg aggaagagtc tcaagtgttg ttgccgtcgt 2038261 ataccaagtt ctacatggag cagagctact ccgaggacgg cgacctgatc cgcgggatcg 2038321 atctgttggg caatgccggc tttgtgcgca acgtggtgga gaaggcccgc gaccaccgta 2038381 gtttccgttt ggacgatgag gatctcgacg ccgtactggc cagcgatctc accgaattca 2038441 gcgaggatca gctgcgccga ttcaaggagt tgactcgcga ggacctggcc gaagggctgc 2038501 gcgctgcggt cgcggagaag aagacgaagt aggcactctt ttcgtcggtg tcactggcta 2038561 ctttgacctg aacagtcggc ggtgggtgag tggtctgtgg ttggcgaatg aggcggggcg 2038621 gggcggagac tggtccagat ggtgtccgtg cacgcggggg agggtgtggt gttcagccgc 2038681 tcagggcggg gtacgtgccg tctcaatccg tgctgtgtcc aaattgttta caattaacgg 2038741 tggtgccaca ccttaaattc caaatgtaaa tatatttgac gtcggtcaaa aatcccacgt 2038801 ttggcacaag tatcggtggc gcgttgccaa gtcattaggc aatcgagcgg actcccgggc 2038861 atggaaatgc gtgtctttcg tttgtgggtg tccggtatcc agacagcatc gcttgcgcct 2038921 cgactacagg tttgctacta aaattcctat gcgccatagt gattgagaag ggccacgccc 2038981 ccttcgtgtg acgcacggcg ggcgacggcg gcgccgtgcc cggcattggt tgggtgtcaa 2039041 tgaggcttca aggatatcta ccaaatttcc cagaaatatt tcacggaggc cgcaatggag 2039101 ctagcattta atcggcgtac ggtcaggcca atatatcgaa acatgagagg aatgatcgat 2039161 gagcgtcaag agtaagaacg gtcgtctcgc cgctcgggta ctggtggcac tggcggccct 2039221 gtttgcgatg atcgcgctga cgggctcagc atgtctggca gagggtcccc cgcttggccg 2039281 caaccctcag ggggcaccgg ctccggtggg tggcactgtg atcgtcgcgc cgatgcacag 2039341 cggcgtctga ccgccccgtt cgggatctgt acgcactttc atccgactgc gcggttgttt 2039401 gttagcgcat cggatgaaag tgtgccgtct cggctgagga aggaccgtcg cgatgctgcc 2039461 gaatttcgcg gtgctgcccc ccgaggtcaa ttcggcgagg gtgttcgccg gtgcggggtc 2039521 ggcgccgatg ttagcggcag cggccgcctg ggatgatcta gcctccgagc tgcattgtgc 2039581 tgcaatgtca ttcgggtcgg ttacgtcggg attggtggtt gggtggtggc agggatcggc 2039641 gtcggcggcg atggtggacg cagccgcgtc gtacatcggg tggctgagca cgtcggctgc 2039701 ccacgccgag ggcgcggccg gtctggctcg ggccgcggta tcggtgttcg aggaggcgct 2039761 ggccgcgacg gtgcatccgg cgatggttgc ggcaaatcgc gcccaggtgg cgtcgctggt 2039821 agcgtcgaac ttgtttgggc agaacgcgcc tgcgatcgcc gcgctcgaat ccttgtatga 2039881 gtgtatgtgg gcccaggatg cagcggccat ggcgggttat tacgttgggg cttcggcggt 2039941 ggccacacag ttggcatcgt ggctgcaacg gctacagagc atccccggcg ccgccagtct 2040001 tgatgcccgt ctgccgagct cggccgaggc accgatggga gtcgtccgcg cggtcaacag 2040061 cgcgatcgcc gccaatgcgg ctgcggcaca aaccgttggc ctggtcatgg gaggcagcgg 2040121 cacgccaata ccgtcggcca gatatgtcga gctcgcgaac gcgctgtaca tgagtggcag 2040181 cgtcccgggt gttatcgcgc aggcgctctt cacgccccaa gggctctacc cggtggtcgt 2040241 gatcaagaac ctcactttcg attcctcggt ggcgcagggt gccgtcattc tcgaaagtgc 2040301 gattcggcag caaattgccg ccggcaacaa cgtcaccgtc ttcggctact cgcagagcgc 2040361 cacgatctcg tcactagtga tggccaatct tgcggcttcg gccgacccgc cgtctccaga 2040421 cgagctttcc ttcacgctga tcggcaatcc caacaacccc aatggcgggg ttgccaccag 2040481 gttcccgggg atctcctttc caagcttggg cgtgacggcc accggggcca ctccgcacaa 2040541 tctgtacccg accaagatct acaccatcga atacgacggc gtcgccgact ttccgcggta 2040601 cccgctcaac tttgtgtcga ccctcaacgc cattgccggc acctactacg tgcactccaa 2040661 ctacttcatc ctgacgccgg aacaaattga cgcagcggtt ccgctgacca atacggtcgg 2040721 tcccacgatg acccagtact acatcattcg cacggagaac ctgccgctgc tagagccact 2040781 gcgatcggtg ccgatcgtgg ggaacccact ggcgaacctg gttcaaccaa acttgaaggt 2040841 gattgttaac ctgggctacg gcgacccggc ctatggttat tcgacctcgc cgcccaatgt 2040901 tgcgactccg ttcgggttgt tcccagaggt cagcccggtc gtcatcgccg acgctctcgt 2040961 cgccgggacc cagcagggaa tcggcgattt cgcctacgac gtcagccacc tcgaactgcc 2041021 gttgccggca gacgggtcga cgatgccaag caccgcaccg ggctcgggta cgccggtccc 2041081 cccgctctcg atcgacagcc tgatagacga cctgcaggtg gctaaccgca acctcgccaa 2041141 cacgatttcg aaggtggccg cgacgagcta cgcgacggtg ctcccaaccg ccgacatcgc 2041201 caatgcggcg ttgacgatcg tgccgtcgta caacatccac ctttttttgg agggcatcca 2041261 gcaagcgctc aagggcgacc cgatgggact cgtcaacgcg gtcggatacc cactcgcggc 2041321 cgacgtggca ctgttcacgg ccgcaggcgg tcttcagctc ttgatcatca tcagcgcggg 2041381 ccgaacgatt gccaatgaca tctcggccat tgtcccctga tcgtgttttg cgtgaacttt 2041441 aaagcgttgt gctgaggtat gttccgctcg cgtgtggggc ggcccgcgcg accacctatg 2041501 catgagcgcc aatggtcgag acaactacct gcgcggtcat cgggcggcca cccagagggc 2041561 atggttctcg ggctgctact ggctcgcgtg cttccatcga gcgtgaatac atgccgccaa 2041621 atcggcagtc ggcgccgctg gcgtgccgct agctgatcac aaagcgccga taccgatgcg 2041681 gctggccata gcaatgccaa tgttggcgaa tagatctcac gcgcggccca agccaacagc 2041741 gaggtgatgg tgatcattct ttacgttgcg attacctcgc cggaacgtga cacgagcaat 2041801 actcgccaac catgatcgcc agatatttgg aacgggtttg ggtccagcgg ccgccaaaaa 2041861 ccgactcgcc gccgtccctg acaactcagc ggcgagaggt gaacacgggt gatttgtcac 2041921 tacgggccgc tgcggttcct gcgctgccag ggggccgcga gtgcgattcc ggcgagccac 2041981 gcgattaggg attaagcgaa atggatttcg ggttgttacc gccggagatc aactcaggca 2042041 ggatgtatac ggggccgggg ccggggccca tgctggccgc cgcgacagcc tgggacgggc 2042101 tggctgttga gctgcacgca acagcggctg gctacgcctc ggagctatcg gctttgaccg 2042161 gggcatggag cggtccttcg tcgacgtcca tggcatctgc agccgcaccc tatgtggcat 2042221 ggatgagcgc caccgcagtg catgccgagc tggcgggcgc gcaagccagg ttggcgatag 2042281 ctgcctatga agctgcgttc gctgccaccg tgcctccgcc ggtgatcgcc gctaatcgtg 2042341 cccaactgat ggtgttgatc gcgacgaaca tcttcgggca gaacacgccg gcgatcatga 2042401 tgactgaggc ccaatacatg gaaatgtggg cgcaggatgc cgccgcgatg tacgggtacg 2042461 ccggctcgtc agcgaccgcc tcgcgaatga cagcgttcac tgagccgccg caaaccacta 2042521 accatggtca gttgggggcc cagtcctccg ccgtcgcaca aaccgccgcc accgcggccg 2042581 gcggcaacct gcaatcggca ttcccgcagc tgctctccgc ggttccccgc gccctgcaag 2042641 gcctggcatt gccgaccgca tcacagtcgg catcggcgac gccgcagtgg gttaccgacc 2042701 tggggaacct gtccaccttc ctgggcgggg cggtcaccgg cccgtacacc tttcccgggg 2042761 tattgcctcc ctccggggtg ccatacctgt taggcattca gagcgtcttg gtaacccaaa 2042821 acgggcaggg ggtaagcgcc ttgcttggca agatcggggg gaaaccaatc accggagcgt 2042881 tggctccgct ggccgaattt gctttgcata caccaatttt gggttcggag ggcttgggtg 2042941 gtggatcggt ttccgcgggt attggccggg caggcttggt cggaaagcta tcggtgcctc 2043001 agggctggac ggtggccgcc ccggagatcc catcgccggc ggcggcgttg caggcgacgc 2043061 gcctggccgc cgcgccgatt gcggccaccg acggcgcggg tgcgttgctc ggtggcatgg 2043121 cgctgtcggg cttggctggc cgcgctgccg ccggttctac cggccacccc atcggcagcg 2043181 ccgcagcacc cgccgtcggt gccgctgccg ctgccgtcga ggacctggcc accgaagcca 2043241 acatcttcgt gataccggcc atggacgact agcgccatgt cacgggagag aaggttgtcg 2043301 acacttttgc gaccagcgcc ggttcggtat gtggccaccg gggctgccaa tggggttacg 2043361 gcccgttaag gagggatgcg gtaatggatt tcggggtgtt accaccggag atcaattccg 2043421 ggcgcatgta tgccggtccc gggtcgggtc cgatgctggc cgcggcagcg gcctgggacg 2043481 ggctggccac cgaattacag tccacggcgg ccgactatgg ctcggtgatc tcggttctga 2043541 ccggcgtgtg gtcgggacag tcgtcgggga ccatggcggc tgcggccgca ccgtatgtgg 2043601 cgtggatgtc ggccacggcg gcgctcgctc gggaagcggc cgcccaggcc agcgcggcag 2043661 cggcggccta cgaggcagcg tttgcagcca cggtgccgcc gccggtcgtc gcggccaacc 2043721 gcgccgagct ggcggtgttg gcggcgacca acattttcgg tcagaacacc ggtgcgatcg 2043781 cggccgccga agcccgctat gcggaaatgt gggcgcaaga cgcagccgcg atgtatggct 2043841 atgccggctc gtcgtcggtg gcgacccagg tgacgccatt tgctgcaccg ccgccgacca 2043901 ccaacgcggc cggactggcc acccaaggcg ttgcggttgc ccaggctgtc ggcgcgtcgg 2043961 ccggcaacgc gcgctcactg gtgtccgagg tgctggaatt cctggcaacg gccgggacga 2044021 actacaacaa gacggtggcc agcctgatga acgcggtcac cggggtgccg tacgcatctt 2044081 cggtgtataa cagcatgctc gggcttggct tcgctgagtc aaaaatggtc ctgccggcta 2044141 acgacaccgt aatatcgacc atcttcggca tggtgcagtt ccagaagttc ttcaatccgg 2044201 tgacgccctt caatcccgat ttgatcccga aatctgctct aggggccggg cttggcctgc 2044261 ggtctgcgat ctcgagtggt ctgggctcga ccgcgccagc gatatcggcg ggtgcgagcc 2044321 aggccggctc ggtcgggggg atgtcggtgc cgccgagctg ggcagcggcc accccggcga 2044381 tccggacggt tgccgctgtg ttctcgagca ccggacttca ggctgtcccg gcggccgcaa 2044441 ttagcgaggg cagtctgctc agccagatgg ccctggcgag tgtggccgga ggggcccttg 2044501 gcggcgccgc tgcacgcgcc actggtggtt tcctcggcgg aggccgagtc accgcggtca 2044561 agaaatctct caaggacagc gactcaccgg acaagctgcg gcgggtggtc gcgcacatga 2044621 tggagaagcc cgaatcggtg cagcactggc acaccgacga ggacgggctc gatgatctac 2044681 tcgcggaatt gaagaagaaa ccgggcatcc acgccgtgca catggccggc ggcaacaagg 2044741 ctgaaattgc accgacgata tcagaatcgg gctagggcag ggttagggcg tgtcttccaa 2044801 ttgataggcc ccgaggcaga cacgagtcgc cagaccgcac cattgcttga gttggttgat 2044861 gcccttgaga tcggaacccg aatcccacag caggagaatt agtttcgtcc ccagaccggc 2044921 ggctacggct gcccgttctg cccaggcaaa ccgatcaatc cgcccttgcc gccttggccc 2044981 ccgggtgcgg gtgggacacc atctccgccg tcgccaccgg taccgatcag cagggcggcg 2045041 ttaccaccgt caccgccggc accgccagtg cccgcactgc cgccggttcc gccggcccca 2045101 ccggtaccgc cacttccccc gggtccgccg ttgccgatcc ccaggccgct tgccccgcct 2045161 tggccaccat cgccgccgtt gccgccgtcg ttgcccgacc cgccgacgcc gccggccccg 2045221 ccgatgccgc cggccccgcc gctaccgaac agtaggcctc cgctgccgcc cgcgccaccg 2045281 tcgctgccgt cacctccggc ttcctgaata atgttgccta ctccggtccc accggtcgcc 2045341 ccattccctc cggccccgcc gttgccgatc agaatggcgg cgccaccctg gccggcgctg 2045401 ccttcgaagc cggtacctcc gccgctgccg gcgttaccac cgttgccgcc attgccgtat 2045461 agcaccccac cttggccgcc gtcaccccct gatgcaccaa attgaatgct gaggctgccg 2045521 gccccaacgt caccaccgtt gccaccgtca ccgccgtgac caaacaaccc aaaaccctgg 2045581 acactaatcg gtccgaagtt aacggcacct acccagccgc caccgccagc agagccgcca 2045641 aagcccgcaa atccgttggt gcctgcgtca ccgccctgac ccccgttgcc gccggacgcg 2045701 ccgctgccga acaaccaccc gccgttgccg ccgtcgccgc ctgagccacc gaccccgccg 2045761 ctcccgccga agatggtagt acccagcgca gacccggccg ctccattccc gccgttgcca 2045821 ccggccccgc cgttgccgac taggcccgcg tcaccgccgt taccggctat cccggccgaa 2045881 ccgccggaag cgccgttcat gacgcccgtt tgagtggagt cggtgccgcc gccggcgccg 2045941 ctgtccccac cttgcccccc gttgccgatc agccacccgc cctgggcgcc gttgcccccg 2046001 ttgcctccta atccgccgct gccggccgaa tctccggctg tgtcattagt gccttgtcct 2046061 cccatgccac cgaccgagcc gttgccgccg ttgccgaaca acagtccacc gcgaccaccc 2046121 gaacccccgt ttccgccggc gcccccatcg aagcctgccg acgcaccccc gggcgctgtg 2046181 gcgttgcccc cgtttccgcc ggccccacca tgtccaccgt gaccatagat ccatccaccg 2046241 gcgccgccgt tgccgccgga cccgcccgct ttcccgggtt gaccggctgc gccgttggca 2046301 cccgccccgc cctggccgcc attgccgccg ctaccccata gcccagccga cccgccgttg 2046361 ccgccatttt ggcccgtccc gccggacccg ccgttgccac cgttgccgta tagccatccg 2046421 ccatcgccac cgttttgccc ggttcctgcg accccgttgg caccgttgcc gataagcgga 2046481 cgcccggtca gcgtctgcac gggtgaattg atggcatcga gggcggtttg gcccacgatt 2046541 tgcaatggtg acgagttggc tgcctcggcg ctggcgtact gggccgcccc cgcgctcatg 2046601 agctggacga actgctcatg gaatgcgacc gcgtgggcac tgagctcctg gtatgcctgc 2046661 ccgtgcgttg cgaacagcgc cgcaatcgca gccgacacgt cgtcagcgcc cgcaggcagt 2046721 aacgccgtta tcgggaccaa cgcttcggcg ttggcccggc taatcgccga accaatagtt 2046781 gctaggtcct ttgccgctgc atcgacaaac gccggcgcca cgatcatctg cgacgtccac 2046841 acctcctggc cgttgtcgtc gcatggggaa tccatacgac cgccaaagga attttggaac 2046901 cgacgccaac gttacagttt tgcggacccg ctatggggtg cattcaccag attcactggc 2046961 aacgatgtga accccgtgtc accccaagcg gggtcaatcc actgattact ctctagccca 2047021 aactatttcg cgctgacgct ggttttagtg atctggtggg ggcaatagac atgcgcggag 2047081 atcgcagcga acttgcaaca accgtccatc gaaaacccgg gattgcgggt ccgcagctcg 2047141 ttgacgacct gaagacccga ttcgccgctt tcgactaacg cgcatacggc cttgcccgat 2047201 gctatggctt gatccgggtg gctgtaggta atgcctgccc gctctagcga ggcaagaaag 2047261 accgcgtcgt caccgctggg ccccgcgtgg gccggaaccg ccaagccgat catcaacgga 2047321 atgctgagta gcgttgacac aactctcata gacaacgatt ctcccggaat tgcgcttctc 2047381 ttgcggtgca accggttacc gcgtcattcc aatacgttac ggctgcgcta acttcccgtc 2047441 tcagggtgtt cgggttgcgc tggacctgaa ggtcgtctgc tgaccggcgt tgtctgctcg 2047501 ctggctaaca gccgatcttg atagcctccg gggcatcgga tgagtcaagc cgttgggttg 2047561 acgcgcgtcg ctacgagtgt cacgattacc cttgcaagca cctcgctagg tgaggcgtct 2047621 gcgcggatat aggccactga cctcgaacgt cgaaagacgc ccagggtcag gacagctctt 2047681 cccggcttaa gggttgagcc caagtggctt ccggctggac cggccggata cgccgtgtgg 2047741 tgccaaagct ctgacgagag gggtgccgag ttcggtggtc tgctgggctg tcatcccttt 2047801 gtgctgtgca tcggcatccc cgtgtgcccc ggccgtgagg aggtgagagc gaaatgagtc 2047861 ccggcgatag tccgtatccg agatcgacga ccgtttcgtt ccgatccgac cccggcgccg 2047921 ttttcgcact ctgaatcggc cttccggttc gaaatccgtt atttcgcaag ctcgttgctt 2047981 cgcggccttg tgtgagtgac gttcacggga agtagccacg acagaagcgg tcataggcct 2048041 ccgggttcgg tcgtctgtca ggagaagacc catggcgttt gttcttgtct gtccagatgc 2048101 gctggccatc gcggccggtc agttgcgcca tgttggatcg gtgatagccg cgcggaatgc 2048161 ggtcgcggca ccggcaactg ccgaattggc cccggcggcc gctgacgaag tatcagcttt 2048221 gactgcaaca caattcaact tccatgccgc catgtaccaa gcggtcggcg cccaggcgat 2048281 cgccatgaat gaggcgttcg tcgcgatgtt gggcgccagc gcggattctt acgcggctac 2048341 cgaagccgcc aacatcattg ctgtgagcta acgaggagat caacgatgac tgccgcactt 2048401 gacttcgcca cgctaccgcc cgaaatcaac tcggcgcgta tgtattccgg cgcgggctcg 2048461 gccccgatgc tggccgcagc gtcagcctgg cacggcttgt ccgcagaact gcgcgccagc 2048521 gcactgtcat acagctcggt gctttcgacg ctgaccggtg aagaatggca cggtccggcg 2048581 tcggcatcga tgacagccgc ggccgccccc tacgtggcct ggatgagcgt caccgccgtc 2048641 cgggccgagc aggccggggc acaggcggag gctgccgctg cagcgtacga agccgcgttc 2048701 gcagcaacgg tgcccccgcc ggtcatcgag gccaaccgcg cccagctcat ggcgctgatc 2048761 gccaccaatg tgctaggcca aaacgccccc gcgatcgcgg ccaccgaggc ccagtacgcc 2048821 gaaatgtggt cccaggacgc gatggccatg tacggctacg ccggcgcctc ggcagccgct 2048881 acccagctga ccccgttcac cgagccggtg cagactacca acgcgtccgg cctggcggcc 2048941 cagtcggctg cgattgccca cgccaccggc gcctcggctg gtgctcagca aacgacgctg 2049001 tcgcagctga tcgccgccat accgtctgta ctgcaaggac tttcgtcatc gactgcagcc 2049061 acgttcgcgt cggggccgtc cggattgctg ggcattgtcg ggtctggatc ttcctggctc 2049121 gacaaactct gggcgttact ggaccccaac tccaatttct ggaacacgat agcttcgtcc 2049181 ggactgttct tgccgagtaa cacgattgcg ccctttttgg gtctactcgg cggcgtggca 2049241 gctgcggatg cggccgggga tgtgttggga gaggccacca gtggcgggct cggtggcgcg 2049301 ctggtggcgc cgcttggctc agcgggcggg ctaggcggca ctgtcgcggc cggcctgggc 2049361 aacgcggcca ccgtcggaac cttgtcggtg ccgccgagct ggacggcggc cgcaccacta 2049421 gccagcccct tgggctccgc gttgggaggc acaccgatgg tggcaccgcc cccagcagtg 2049481 gcggccggca tgcccggaat gcctttcggc accatgggcg gtcaaggctt cgggcgtgcc 2049541 gtgccccagt atggcttccg ccccaacttc gtcgcacgac cgcccgccgc cgggtgatcc 2049601 cgtagggggt gggttccctg gaaagcgcca gggtcacgat ggcgcagccg aatagccgac 2049661 agtgcttttc tctgcgaata ccggagttgg tcgcgcgaaa tcatttccgt ttagcgcgtt 2049721 caccagcgca ggcgggccag gctcaataag cggaaatttc tcgggcgaag cacccgtgca 2049781 gcagcgcaaa tagatgggat cggcaggacg tagacattgg gatatctggt gaagttcata 2049841 agagcttgac cagttggtgg gcagaactac gcgagcgtga ttagcatggc ggccatcgag 2049901 gggaccggag gtcagggatg ttggatttcg gggcgctacc accggagatt aattcggggc 2049961 gaatgtacgc gggtccggga tccggaccgt tgctggccgc cgcagcggcc tgggatgcgc 2050021 tagccgccga gttgtactcc gcggcggcgt cctatggctc aacgattgag ggcctcaccg 2050081 tagcaccgtg gatgggtccc tcctcgatca cgatggccgc cgcggtcgct ccatatgtgg 2050141 cgtggattag cgtcaccgcc ggccaggccg aacaggcagg ggcccaggcc aagatcgctg 2050201 cgggcgttta tgagacggca tttgcggcaa cggtgccgcc accggtaatc gaggccaacc 2050261 gcgctttgtt aatgtcgctg gtcgccacga acatcttcgg gcagaacaca ccggcgatcg 2050321 cggccaccga ggcccactac gcggagatgt gggcgcaaga tgcggccgcg atgtatggct 2050381 atgccggctc gtcggccact gcgtcgcagt tggcgccgtt cagcgagccg ccgcaaacga 2050441 ccaatccgtc ggcaacggcc gctcaatcag ccgtcgtcgc ccaggccgcc ggcgccgcgg 2050501 ccagctctga catcacagcg cagctgtccc agttgatcag cctgctaccc agcaccttgc 2050561 aaagcctggc gacaacagcg accgcgacgt cggccagcgc tggttgggac accgtcctgc 2050621 aaagcatcac cactatcttg gcgaacctca ctgggccgta cagcatcatc gggctgggcg 2050681 ctatacctgg cggctggtgg ctgacgttcg gccagatcct cggcctagcc caaaacgccc 2050741 caggtgtggc cgccctactg ggcccgaaag ccgccgccgg cgcgttgtcg ccattggcgc 2050801 cgctacgggg cgggtatatc ggagatatca cgcctctcgg tggtggggcc acagggggca 2050861 tcgcccgtgc gatctacgtc gggtcgctct cggtcccgca gggctgggcc gaagccgcac 2050921 cggtgatgag ggcggtcgca tcggtattgc cgggcaccgg cgccgccccc gccctggccg 2050981 ccgaggcacc aggtgccttg ttcggcgaga tggccctgtc gagtctggcc ggacgcgcgc 2051041 tggcaggaac cgcggtgcgc tctggtgccg gagctgctcg cgtcgcaggc ggttccgtca 2051101 ccgaagacgt cgccagcacg accaccatca tcgtcatacc cgcggactga caggactttc 2051161 gagatggcac ttgaactggg tgttagcccc caccggagag gagagaagga cggtgtcatc 2051221 gccactgtgg ccggtggctg gcggccagcc agttagcggc cggttgagga aaggtgtggc 2051281 aatggatttc ggattgcagc caccggagat cacctccggg gagatgtacc taggtccggg 2051341 cgccggtccg atgttggctg cggcagtggc ctgggatggg ttggcggccg aattgcagtc 2051401 catggcggcc tcctacgcct cgatcgtcga gggcatggcg agtgagtcat ggttgggtcc 2051461 gtcgtcggcc ggtatggccg ctgcggccgc accatatgtg acctggatgt cgggtacctc 2051521 ggcacaggcc aaggcggccg ctgaccaggc cagagccgcg gtggtcgcct acgaaaccgc 2051581 gttcgcggcg gtggtgccac cgccgcagat tgcggccaac cgcagccagc tcatatcgct 2051641 ggtggcgacc aacattttcg gacaaaacac cgccgcgatc gcagccaccg aagccgaata 2051701 cggcgaaatg tgggcccagg acaccatggc gatgttcggc tatgctagct cctcggcgac 2051761 cgcctcgcgg ctgaccccgt tcactgcacc gccgcagacc accaacccgt ccggacttgc 2051821 cggccaggcg gccgcaacgg ggcaagcgac cgccctagcg agcggcacca atgcggtgac 2051881 aaccgcgctt tcgagtgcag cggcgcagtt tccgttcgac atcatcccga ccctgctgca 2051941 gggcctggcc acactcagca cccaatacac ccaactcatg ggccaactca ttaacgccat 2052001 cttcgggccg acgggcgcaa cgacctatca gaacgtgttt gtcaccgcag ccaacgtcac 2052061 caagttcagc acgtgggcca acgacgccat gagcgcgccc aacctgggaa tgacggagtt 2052121 caaggtgttc tggcaacccc cgccggcgcc cgagatcccc aaatcgtcgt tgggtgccgg 2052181 acttggcctg cggtcagggc ttagcgcggg cctggcccac gccgcatcgg cgggtctggg 2052241 tcaggcgaac ctggtgggag acctgtcggt accgcccagt tgggcctcag ctaccccggc 2052301 ggtcaggcta gttgccaaca cattgccggc caccagcctg gctgcggccc ccgcgacaca 2052361 gatcccagca aacctgctcg gtcagatggc tctggggagc atgaccggag gtgccctcgg 2052421 tgccgccgcc cccgccatct acacgggcag tggcgcccgg gcccgcgcca atgggggaac 2052481 gcccagcgct gagccggtca agctggaggc tgtcatcgcg cagctacaaa agcaaccgga 2052541 cgcagtgcga cactggaatg tcgataaggc cgatcttgat ggcctgctgg atcgattgtc 2052601 gaaacagccc ggcatccacg cggtacacgt gtcgaacggc gacaaaccca aggttgcctt 2052661 gcccgatact cagttgggtt cacactgaac gtgattcgaa atccacactg atactggagg 2052721 tgattaccgg ctgaagcaaa gcgcattgga aatccaggct tagaccattg ccatgtggcc 2052781 gtgagattcg tcacgtcttg acatccgcgt ccggcgggtc accttcgacc gcggtcaatg 2052841 tcattggtag gtaagggctt tgctgtactg atggccgaat tttgactcga aaagtatgtc 2052901 gggccctcgc agcagatctg ccgcaggacg cgatgcaatt acaacgcacg atgggacaat 2052961 gcagacctat gagaatgcta gtagcgctcc tgctgagcgc cgccaccatg atcggcctag 2053021 ccgcacccgg gaaagccgat ccaacaggcg acgatgccgc cttccttgcc gcgttggacc 2053081 aggccggcat cacctacgct gacccaggcc acgccataac ggccgccaag gcgatgtgtg 2053141 ggctgtgtgc taacggcgta acaggtctac agctggtcgc ggacctgcgg gactacaatc 2053201 ccgggctgac catggacagc gcggccaagt tcgctgccat cgcatcaggc gcgtactgcc 2053261 ccgaacacct ggaacatcac ccgagttagc ggggcgcatt tcctgatcac cgcggtggtg 2053321 cgcggtggtg tggtgcgtcc gagggggttg cgatgcaccc ggttcgccta ggctcaaact 2053381 gctgttaacc tgcgcgtggt tggctgccgt ggccgtcttg cgatcgggaa ggactcggcg 2053441 tcatgcaaac gctgactgtc gccgatttcg ctctccggct ggccgtcgga gtgggttgcg 2053501 gggccattat cgggctcgag cgccagtggc gggcgcggat ggctgggttg cgcaccaacg 2053561 ctctggtggc gaccggtgct accttgttcg tgctgtacgc ggtcgccacc gaggacagca 2053621 gccccacccg agtggcgtcc tacgtggttt ctggaattgg attcctgggc ggcggggtca 2053681 tcctgcggga ggggttcaac gtccgcggtc tgaacacggc tgccacgctt tggtgctcgg 2053741 ccgcggtcgg agtgctggcc gcctccgggc atctggtgtt caccctgatt ggcaccggaa 2053801 ccatcgtcgc tgtccatctc ctggggcgcc cacttggccg gctggtcgac cgcgacaacg 2053861 ccgtcgaaga cgaagggctg cagccctacc aggtacgggt gatttgtcgg cccaaagcag 2053921 agacctatgt acgtgcccat atcgtgcagc gcaccagcag caacgacatc acgctgcggg 2053981 gtatacgcac ggggccggcc ggagacgaca acatcacgtt gacggcccac ctattgatgg 2054041 ttggccatac cccggccaag ctagagcggt tggtggcgga actgtcgctg cagccgggcg 2054101 tttacgctgt gcactggtat gccggtgagc acgcgcaggc cgaatgaccc acgacactag 2054161 gggcggggct gtactcgcgg cgcggccgca gccagcaagt ctgcccgact gccgttcagc 2054221 ggcgggtaga tccgccgggt attgattgac tgcttggtgg tcttggccgg tgcgccctgc 2054281 gataccactt tgcgttccca tccctcggtg tacaccgcgc ccgccgatcc tagatcgaga 2054341 accgtgacat accaagggat ccgaagagcc agcaacggtt ggtcgaacag atcgttgatg 2054401 acgttgcagc cggcatagcg gcccatcggg cgcccatgct gacacgacat gaccgacagg 2054461 tgctcgtcat ccatccgggc cgcggccaca tcgccagcag caaacatcgc aggcaccccg 2054521 atcacccgca ggtagtcgtc gacttgcagg cgtcccagcc gatcacgggc taccggcagc 2054581 tgctcggtca ggcggctggc ccgcatgccg gcgcaccaca ccacggtggc cgctgccagc 2054641 cgttcccccg atgacagcgt tacaccgccc gggctgacgg cggcaacgct cacgccggtt 2054701 ctggtctcga cgccgttgtc caacagcgcc tgttcgatca ccggccgcgc cgataaaccc 2054761 atatcggagc cgacgaaggg gttgtggtcg atgagtacca cgcggggggt gacaccatca 2054821 ccacgggcga acaacgcgtg cagtcggccc ggcaactcgc aggccgtctc gataccggtc 2054881 agcccggcac cgacgaccac gacggttgcc gccgccgatg tcagcggccc gccggccagt 2054941 ccttgcagat gctgctgtag cctgaccgcg ccgtcgtacg tgtcgacatc aaaaccgaac 2055001 tctgccagtc ctggcaacgc gggtttgacc acgtgactgc ccgacgcgag gaccagtcgg 2055061 tcatagctat atgaggcacc ggtcgacgtg gtgacgcggc ggccgtcggc gtcgatcgcg 2055121 gtcacctcgg cggtgacatg cgcaacgccg gcagggccga gcacgtcgcc gagcgggatg 2055181 cggcaggcgc tcagatcagc ctcatagttg cgaacccgga tatcatgaaa cggtttgttg 2055241 ctcaccacca tgacgtcgac cgtgcccgct aggacggcga gctcgtcgag tcgtcgggcc 2055301 gcaccgagcg ccgcccacag gcccgcgaac ccggagccga tcaccaccac ccgggtcaac 2055361 ggctaaacac ctgacgactc tggggtatcg ccgccgccgc gtggcgaccg ggcaggaaca 2055421 tccacacgtg ccaacctcct tcgagcccgg gccatccgat aaccccgtta gccgtcgcga 2055481 gcttacagaa ggtgcaggca tcgggattga gtgcatcatg ggataccggt gaataccgtc 2055541 agccggggca gccagggtag gggacacccc ccgctcgggc tgccagcgga gtatcgagcg 2055601 gatcgccatc ggcgtagcag ataccgggtc agagcagcgt acgctggcac attcggcttc 2055661 ggctcgctgg ttagcgattg ttagttgcac gcccagttga cgatccgccc gccttcgagt 2055721 cggttcacgg cgtcgtcttc tgccgcgcgg cgcgtgagtc cggttccgcc ttggtatttc 2055781 gagccgttgt aggcgaccgc gccgcacctg gtgaagcgac taaccacttt gcaagtcttg 2055841 tcaccgcact tttctagtgc gacttgctct gctcgcgccg gtgtgcgctg gtgccacgct 2055901 ttgcccgacg cgccgctggg ggcataggca atcgccccgt aatggataat cggagggata 2055961 ggcaacccgg caatttccga catcatgact tccgacatcg aaccgttggc gagatgggcg 2056021 tccaccgtcg gaaccagcag gatgcccagc ccgagagcag cccctaggcc ggcggctgcc 2056081 atcgcggttc ggcgtcggag gtttgtgatc atgtcctgcc ccctttctgc ggtcggtaat 2056141 ccagcggttt gaaagggttg agccgactta cgcgcagtgg atgcgtcgaa gggtcaatga 2056201 ggctgggtac tgagacggcc acggttggaa gcccggcgcc ctggccgatg atcgatcagg 2056261 tcatcgctgt atggaggctg cccacccacg gtgctcggtt cggtccggga ttctggcgct 2056321 tgtgtgtcat gtgcccaagt gtgcgataaa tatacctgac ccgggtaggg cataaagtct 2056381 ctaacagcac cgaccggata gggaacaacg gccttcgggc aagcggcttc actgtcaagt 2056441 cgtcacctgt cacgcatgcg agtcgtagcc tgtctgatgt ggatgccgtc gccggattct 2056501 tctcagcgct gcccgaggaa atgcgggacc cggtactgtt cgccattcca tgttttctat 2056561 tgctgctgat tctcgaatgg acggcggccc gcaagctgga aagcatcgag accgctgcta 2056621 ccgggcagcc acggcccgcc tcgggcgctt acctcacccg cgactcggtg gccagcatct 2056681 cgatggggct ggtttcgata gccaccaccg ccggctggaa gtcccttgcc ctgctcggtt 2056741 atgccgcaat ctatgcctac cttgccccct ggcagctgtc cgcccaccgg tggtacacct 2056801 gggtgatcgc gatcgttggt gtcgatctgc tgtactactc ctatcaccgc atcgcccacc 2056861 gagttcggct gatctgggct acccaccagg cgcatcactc cagcgaatac ttcaacttcg 2056921 ccaccgcgct gcgccagaag tggaacaaca gcggcgagat tctcatgtgg gttccgctgc 2056981 cactgatggg gcttccccct tggatggtgt tctgcagttg gtcgctgaac ttgatctacc 2057041 agttctgggt gcacaccgag cggatcgaca ggctgccgcg gtggttcgaa ttcgtcttca 2057101 ataccccgtc gcaccaccgg gtccaccacg gaatggaccc ggtgtatctg gacaagaact 2057161 atggcggcat cctcatcatc tgggaccgcc tgttcggtag ctttcagccg gagctattcc 2057221 gaccgcatta tggcctgacc aagcgggtcg acacgttcaa catctggaag ctgcagaccc 2057281 gcgagtacgt ggcgatcgtg cgtgactggc ggtcggcaac acgtctgcgg gatcggctgg 2057341 gctacgtctt cggaccgccg ggctgggaac cgcgcaccat cgataaatcc aatgccgccg 2057401 cctccctggt cacgtctcgg taacgtcgcg acccgacatt gcgaaagtat taccgtcggg 2057461 ttttggtacg ccttagccgt aaccggcggc gggcgatgcg cttggccccg acggatggga 2057521 gttcaaggtg gtccgcctgg taccacgcgc attcgcagcg acggtcgccc tattggcggc 2057581 cgggttttcg ccggcgaccg ccagtgccga tccggtcttg gtgttccccg gcatggaaat 2057641 ccgtcaggac aaccacgtct gcaccctggg ctacgtcgac ccagctctga aaatcgcgtt 2057701 taccgcgggg cattgtcggg gcgggggagc ggtcaccagc cgggactaca aggttatcgg 2057761 ccatctcagg gccatccggg acaacacacc cagcggctcc accgtggcca cgcacgagtt 2057821 gatcgccgac tacgaggcga ttgtgctggc tgacgacgtc acggcaagca acattttgcc 2057881 gagcgggcgt gcactggaat ccagaccggg tgtggttctt cacccgggcc aagcggtctg 2057941 ccatttcggc gtcagcacag gcgaaacctg tgggaccgtc gaaagcgtca acaacggctg 2058001 gttcaccatg tcccacggcg tgctcagtga gaagggggat tcggggggcc cggtctacct 2058061 ggcccccgat ggcggccccg cgcagatcgt cgggatcttc aacagcgtct ggggcggctt 2058121 tcccgcggcg gtgtcctggc ggtcgacgtc cgagcaggtt cacgcggatc tcggcgtgac 2058181 gccccttgct tagcaagcac cccgttagcg gccaccaggt tgatcgccgt gtgtttgcta 2058241 gagcggtgat ctcggttgtg tcagacttgc cgcgtgggca aacgccggga tgcgagggaa 2058301 cagatcgagg cgaaaattgt cgaactcggc cgtcgccagc tgctggatca cggcgcggcc 2058361 gggttgtcgc ttcgggcaat tgcccgcaac ctgggcatgg tgtcctcggc cgtataccgc 2058421 tatgtgtcca gtcgtgatga gctgttgact ttgctgctcg tcgacgccta ctccgacctg 2058481 gccgataccg tggaccgagc ccgcgacgac accgtcgccg actcgtggag tgacgacgtc 2058541 atcgcaatcg ctcgagcggt gcgcggttgg gcagtcacta accccgcccg ctgggccttg 2058601 ctatacggta gcccggttcc tggttatcac gcgccgcctg accgtaccgc gggcgtcgcc 2058661 acccgcgtgg tcggagcgtt cttcgacgcg atcgccgcgg gaatcgccac cggagacatc 2058721 aggttaaccg atgacgttgc gccgcagccg atgtcatcgg acttcgaaaa gatccggcag 2058781 gagttcggct ttcccggcga cgatcgtgtc gtcacaaagt gctttctgct ctgggcgggc 2058841 gtggtgggcg cgatcagcct ggaggtattc ggtcagtacg gggccgacat gctaaccgat 2058901 ccaggagtgg ttttcgatgc ccagacacgg ctgctggtgg ccgtgctggc cgagcattga 2058961 agctgctgca atcggcgtgt ccagccggaa ttagaacgtg ttcactcaag gctaccagtg 2059021 ctgacacttg cggtggtggc aaatgcaatc tgagcccttt ctggcctctg gcaagctggg 2059081 ctgtcctgcg agacgctcat ccttctcgtt ctgtcgctga tacagatcgc aggggttacc 2059141 cccggaccta gaagccgccg aaacggctct caccggcttg ttaggcgtcc ggaagcggat 2059201 tcggatgcgc gatgtccgct ttgcgcacga cacctgtagc agtctgggca agcccgcgat 2059261 gtcgtcgcga gtatctcgtt gagctatctc ggagagatgc ccttcgagtt agtatcgtcg 2059321 gttcgtgtag agaatatcta tagtgacttt tgcgggactg tgggccgggt ctacaccagg 2059381 ggctcgaagc cgcattggcc gaagcaagcg gaggtgcaag tgccgacatg agcggcgcca 2059441 atgagccgcg ccggcgacga tgcagtgggg gtaccgcccg cttgcggggg acgaagcgat 2059501 gacgaggagc ggcgccaatg agccgcgccg gcgacgatgc agtgggggta ccgcccgctt 2059561 gcgggggacg aagcgatgac gaggagcggc gccaatgagc accgacatac ccgccaccgt 2059621 tagtgcggag accgtgacgt cctggtcgga tgacgtcgat gtaacggtga ttggtttcgg 2059681 catcgccggc ggttgcgcgg cggtcagcgc ggccgccgcc ggcgcccggg tactggtgct 2059741 cgaacgtgcc gccgcggcgg gcggcaccac cgcgcttgcc ggggggcact tctacctggg 2059801 gggcggaacc acggtgcagc tggcgaccgg tcatcccgat tcacccgagg agatgtacaa 2059861 gtacctggtc gcggtctccc gagagcccga tcacgacaag attcgcgcct attgcgacgg 2059921 cagcgtcgag catttcaact ggttggaggg cctgggtttt cagttcgagc gtagttactt 2059981 tcccggcaag gctgtgattc aacccaacac cgagggcttg atgttcaccg gaaatgagaa 2060041 ggtgtggcca ttcctggagt tggcggtgcc ggcaccgcgc gggcacaagg tacccgtgcc 2060101 gggcgacacc ggcggtgccg ccatggtgat cgacctgctg ctcaagcgag ccgcaagcct 2060161 ggggatacag atccgctacg agacgggcgc caccgagctc atcgtggacg ggaccggcaa 2060221 ggtaaccggg gtgatgtgga agcggttctc cgaaaccggt gcaatcaaag cgaagtcggt 2060281 aatcatcgcg gccggcggat tcgtgatgaa cccggacatg gtggccaaat acactccgaa 2060341 actggccgag aagccgttcg tgctgggcaa cacctacgac gacgggttgg gcatccggct 2060401 gggtgtatca gccggcggcg ccacccaaca catggaccag atgttcatca cggctccgcc 2060461 gtacccgccg tcgatcttgc tcaccggcat catcgtcaac aaactcggac agcggttcgt 2060521 cgccgaggac tcctaccatt ccaggaccgc tgggttcatc atggaacagc cagacagcgc 2060581 ggcgtatttg atcgtcgacg aagcccacct ggagcacccc aagatgccgc tagtcccgtt 2060641 gatcgacggc tgggaaacgg ttgtggaaat ggaagccgcg cttggcattc caccgggcaa 2060701 cctggcggcg acgctggacc gctacaacgc ctacgccgcg cgcggcgcag atcccgattt 2060761 ccacaagcag ccggaattcc ttgcagcaca agacaacggg ccgtgggggg cgttcgacat 2060821 gtcgctgggc aaggcgatgt atgccggatt cactctgggc gggctggcca cgtcggtgga 2060881 cggtcaagta ctgcgcgacg acggcgcggt ggtggccggc ctgtacgcgg tcggggcatg 2060941 cgcgtccaat atcgcccagg acggcaaggg atatgccagc gggacccagc tgggtgaggg 2061001 gtcgtttttc gggcgtcgcg ccggagcgca tgcggcagcc cgagcgcagg gcatgtaagc 2061061 ctcctcgcgc cgcgactggg aatcctgcga cgcgacacgc cgacaaggcg tcgtgagatt 2061121 cacagtcgca gcgcggcttc aggtaagacg ccgggagcgc ggtagccggc ctcccggcta 2061181 cggtaacccg ttcatcccgt tcttacccaa cagcccgccg gcaccgccgg tgcccgcgct 2061241 gccgttaggt gtgccactcc cggcgttgcc gccgttgccg ccgttgccga ccaggatggc 2061301 accgccgcca gcgccgccgt caccgccctt ggcaccggtg ccgtttcctc cggcgccgcc 2061361 gtcaccgccg tcgccgatca gcccggcttt gccgccgagc ccaccggcgc ccccggcacc 2061421 gccgaagccg aatccgccgg cgccgccggc accaaacagc aggcccgcag tgccgccgtt 2061481 tccgccggcg ccgcccaccc cggtagcgcc accgccgagt gcgccggcgc cgccggcccc 2061541 gccggcgcct accagcaggc cggcgttgcc gcccgccccg ccggcaccgc cggtagtgga 2061601 cccgacccca cccgcgccgc cggcaccgcc gtcgccccag agcagggcgg acccgccgga 2061661 ccccccggca ccgccgttcc cgaccaatcc gattccgccg gcgccgccgg ccccaccgac 2061721 gccgaacagc ccaccggccc cgccggcacc accgggcccg ccgggggcgg tgcccaggaa 2061781 tgccacaccg tcaccgccaa caccgcccac cccgccggcg ccgaacagga gcccgccatt 2061841 gccgccggcc ccgccggcac cgccggtgac attagtgccg gtgccgccgg ccccgccggc 2061901 accgcccacg ccgaagaaca acccgccgtc tccgccggcc ccgccgtcac cggcgtcagc 2061961 cgcgagtccg ccgacgccgc cggccccgcc ggcgccgaac agcagcccgc cattgccgcc 2062021 ggccccgccg gccccaccaa taccgcccac cccaccaccg gcgcgtccgc cggcgccgcc 2062081 ggccccgccg gcgccgtaga gcagcccgcc ggccccgccg gccccgccga accctgcggt 2062141 gccggacgct acgttccccc cggcgccgcc ggccccgccg ttgccgaaca ggccagcggc 2062201 tccgccgttg cccccgggca tgccggccgc gccggagccg ccggccccgc cgttgccgat 2062261 caagattccg ccgtcgccgc cgtttgcccc ggtccccggg gccccgttgg ctccgttacc 2062321 gatcagtggg cgccccaaca gcgccagggc gggggcgttg atcacgtcga gcacaccctc 2062381 tagcggggcc gcgctggcgg cctcggcggc cgcatacgag cccgccccgg cggtgagcgc 2062441 ccgcacgaac tgctcgtgaa acagcgccgc ctgggcgctc agcgcctgat aggcctgggc 2062501 gtgtccggag aacaatgccg ccatcgccgc cgacacctca tcggcggcgg cggccaacac 2062561 cgtcgtggtc gggaccgcgg cggccgcgtt ggcggtgccg atcgtcgacc cgatacccgc 2062621 caaatcggtc gccaccgccg ctagcgcctc cgggatcgtg accacaaatg acatctggca 2062681 cctcgtcaac accctgtggc cccggcgcgg ggccgctacc gatcgcctgg tcactcccca 2062741 gagatcgacg gattcagcgt atcgcgatca cggaagcggc cacgccgatt tgggaagctc 2062801 gtcccggctt acacttcggc gggcgccgcc tcgactgggg ccagccgcca ttggccgcca 2062861 ccgagtagtt cgagctggtt ttcgtgcagc cgctcgaggg cggggcgatg gctgacgctg 2062921 atcacgatgc agtccggcag ctcgctgcgc agcaattggt agagcgcaaa ctccagcccg 2062981 gtgtccagcg ccgaggtact ttcgtcgagg aagaccgcct tgggtttggt gagcaggatg 2063041 cgagcaaagg caacacgttg ctgctcaccg ggggagagca ccttggccca gtcgcgttcc 2063101 tcgtccagcc ggtcacacag tggggccagc gccaccttgg tcagcgtgtc ccgcagggtg 2063161 gcgtcgggga tggcggccgc agagttgggg tagcacacca cgtcacgcag cgtccccagc 2063221 ggcacatacg gcaactgcga caagaacatc gtctcgttct cgccgcccgg ccggtgcagg 2063281 gtccccgatg cgtagggcca cagttccgcc agactgcgca gcagcgtggt cttgccggcc 2063341 ccagaacgcc cggtgatcac cagcgagcct ccgcggtcca gccgcacatc gagcgggtcg 2063401 atcaaccgat cgccggcagg cgtacgcacc tcgatgtcgt tgagctcgac ggactcgtcg 2063461 tcgctcggtc gggtcaggac cgcgggcagg gcgcggcctt tctcgttggc gtcgaccagc 2063521 ccatgcaatc ggatgattgc tgcgcggaag gacgcaaacg cgtcgtagtt gttgcggaag 2063581 aacgacaacg agtcgtgaat gttgccgaag gaagtcgccg tctgcccgac atcgccgaag 2063641 tcgatctgcc cggcgaataa tcgaggcgcc tggatgaccc acggcaacgg aacaattgtc 2063701 tggctcaccg acagattcca tccattgaat gcgatgctgc gccgaacgta gcgacggtaa 2063761 ttgtcgatca ccggcgtgaa ccgccgctgt agctgggtac cttccacccg ctcgccgcgg 2063821 tagaaaccca ccgcctcggc ggcgtcgcgt agccgaacca gcgcgtaacg gaaagcggca 2063881 ttgagctttt cattgcggaa gctgagccag atcaggggcc gcccgatgat gaacgagatg 2063941 accgtggcca cgaacacata gaccagcacg gtccagaaca ttgcgcgcgg gatggacacg 2064001 ccgaagatat tcagggtgcc cgagagattc cacaggatcg ctgtgaaaga aatcaccgaa 2064061 atgatcgact gcacggcccc gaaaagcagc gtgctggccg tcccgttgga gggagcattc 2064121 ggagtgccgc ctgccccggc ggtgaagata tcgacgtctt gctgaatgcg ctggtcgggg 2064181 ttgtcgatcg tttcgtcgat gaacaggtct cggtagtagg ccctgccgtc gagccagtct 2064241 tgtgtgaggt ggtgggttag ccagaccctc caggcgatga tgaagcgctg cgtcaagtag 2064301 atgtcggcca tgacccgggt cacgtgcagc acggccatca cgctgaaaac cccgatcgac 2064361 atccaaaatc ctcgcacgcc tgagcgtttg accgtgccat cgccagaggc gatgccctcg 2064421 aaggccttct gcaaggccgt gtacatgtcg ttgccttggt agctgaatag cacattcagg 2064481 cgcactgcca gcactaccga aagcaacaac acgccgagca tcagccacac gcgaacgctg 2064541 ttggggccaa cgaagtatgc gcgggtgatc cgccagaact gccggcccca gggcgtcaaa 2064601 tacctgagca gaaccaatat cgcgagcaca cagatggcac tgatcgtcca ggctttgccg 2064661 acccaataca cggaatccgg gaatgctcta gaccaatcga tggacggctt aaacaatttc 2064721 gggcccaagg tcgacgtctc ctcacaaaca gaaatccttc gggcgaaggt acccgaaggt 2064781 tgtcgatagg ctgccgatat gagcaccgac accgccccgg cccagaccat gcatgctggc 2064841 cggcttatcg cgcgccgact taaagccagt ggtatcgaca cggtcttcac gttgtcgggc 2064901 ggccacctgt tttccatcta cgacggctgc cgtgaggagg gcatccgcct gatcgacacc 2064961 cgccacgaac aaaccgccgc ctttgccgcc gaaggctggt cgaaggtgac cagggtgccg 2065021 ggcgtggccg cgctcaccgc ggggccgggg atcaccaacg ggatgagcgc gatggcggcg 2065081 gcccagcaga accagtcacc actggtggtg ctcggcggcc gggcgccggc gctgcgctgg 2065141 ggtatgggct ccctgcagga gatcgatcac gtgccgtttg tggcgccggt ggcccgcttc 2065201 gccgctacag cgcagtcagc cgagaacgcg ggcctgctgg tcgatcaggc gttgcaggcg 2065261 gcggtgagtg cgccgtcggg tgtggcattc gtcgacttcc cgatggatca cgcgttctcc 2065321 atgtcctcag acaatggccg ccccggcgcg ctcaccgagc taccggccgg tcccacccca 2065381 gccggcgacg ccctggaccg ggcggcgggc ctgctttcga cggcccagcg tccggtcatc 2065441 atggcaggta ccaacgtctg gtggggccat gcggaggcgg cattgctgcg tcttgtcgag 2065501 gaacggcaca ttccggtgct gatgaacggg atggcgcgcg gcgtggtgcc cgccgatcac 2065561 cggttggcct tctcacgggc gcggtcaaaa gcgctggggg aggctgatgt cgcgctgatc 2065621 gtcggtgtgc cgatggattt ccgtctgggc ttcggtgggg tattcgggtc gacaacgcag 2065681 ctcatcgtgg cagaccgcgt cgaacccgca cgcgaacatc cgcgaccagt cgcggcgggg 2065741 ctctatgggg atctgaccgc caccctttcg gcgctggccg gatctggcgg caccgaccac 2065801 cagggctgga tcgaggagct cgcgacggcc gagaccatgg cgcgtgatct cgagaaggcc 2065861 gagctggtcg atgaccggat cccattgcat ccgatgcggg tgtacgccga gctggccgcg 2065921 ctgctggagc gggatgctct agtcgttatc gatgcgggcg atttcgggtc gtacgccggc 2065981 cggatgatcg acagctatct gccaggctgt tggctggaca gcggtccgtt tggctgcctg 2066041 gggtcgggtc ccggctacgc cctggctgcc aaactggcgc ggccgcagcg ccaggtcgtg 2066101 ctcttgcagg gcgacggcgc gttcgggttc agcggcatgg aatgggacac gctggttcgg 2066161 cacaacgtgg cggtcgtgtc agtgatcggc aacaacggca tctggggttt ggagaagcac 2066221 ccgatggaag cgttgtacgg ctattcggtg gtggccgaac tgcgcccggg aacccgctac 2066281 gacgaggtgg tgcgcgcact gggcggccac ggcgagctgg tgtcggtgcc cgctgaactt 2066341 cggccggcgc tggaacgggc ctttgccagt ggcctgcccg ctgtggtcaa cgtgctcacc 2066401 gacccaagcg tggcttatcc acgccgatcc aacctggctt gacgtccagc cgggccgtga 2066461 acgtgcacgg ttgtccacga attgcggcct gtcggtgtac agacacgcac cctcgcggcc 2066521 ggccggcatt cgcgtaccgt tggtttgtgc ccaagaccac ccgcgctcaa cccggccggc 2066581 tgagcagccg attctggcga ttgctcggcg ccagcaccga aaagaaccgg agccgctccc 2066641 tggcggatgt aaccgcttcg gcagaatacg acaaggaagc tgccgatctg tccgacgaga 2066701 agctgcgtaa ggcggcaggc ctgctcaacc tcgacgacct cgcggagtcc gccgatatcc 2066761 cgcagtttct cgcgattgcc cgggaagccg ccgagcggag gaccgggctg cgaccatttg 2066821 atgtgcagtt gcttggcgcg ttgcgcatgc tcgccggaga cgtgatcgag atggccaccg 2066881 gtgagggcaa aacccttgcc ggggcgatcg cggccgccgg ttatgcgctg gccggccggc 2066941 acgtgcacgt cgtgacgatt aacgattacc tggcccgccg cgatgcggag tggatgggcc 2067001 cgctgctgga cgcgatgggc ctgacggtcg gctggatcac cgcggactcg acccctgacg 2067061 agcgccggac cgcatatgac cgtgatgtca cctatgcctc ggtcaacgag attggcttcg 2067121 atgtactgcg cgatcagttg gtgactgatg tcaatgacct ggtatcgccc aatccagacg 2067181 tggctctcat cgacgaagcc gactccgtgc tggtcgacga ggcgctggtg cccctggtgc 2067241 tggccggaac cacacatcgt gagacgccgc ggctggagat catccggctg gtcgctgagc 2067301 ttgttggcga caaggacgcc gacgagtact ttgccaccga ttccgataac cgcaatgtcc 2067361 acttgaccga gcacggggca cgcaaagtcg agaaagcgct cggtggcatc gacctgtact 2067421 ccgaggagca cgtcggcacc acactgactg aggtcaatgt cgcgctgcac gcgcatgtgc 2067481 tcctgcaacg cgacgtgcac tacatcgtcc gcgacgacgc ggtgcacctg atcaacgcgt 2067541 cgcgtggccg tatcgcgcaa ctgcagcgct ggccggacgg gttgcaagct gcggtcgagg 2067601 ccaaggaagg tatcgagacc acggaaactg gggaagtgct cgacaccatc acggtgcagg 2067661 ccctgatcaa ccggtatgcg actgtgtgcg gaatgacggg aaccgcgctg gccgccggtg 2067721 agcagctacg gcagttctac cagctcggtg tctcaccgat accaccgaac aagccaaaca 2067781 tccgcgagga cgaggccgac cgggtctaca tcaccactgc agccaagaac gacgggatcg 2067841 tcgagcacat caccgaggtg caccagaggg ggcagcctgt gctggtcggt acccgcgacg 2067901 tggccgaatc cgaggaactg cacgaacgcc tggtgcgccg cggtgtgccc gccgtggtgc 2067961 tcaacgcgaa gaacgacgcc gaggaggccc gggtcatcgc cgaggccggc aaatacggcg 2068021 cggtcacggt gtcaactcaa atggccgggc gcggcaccga catcaggctc ggcgggtccg 2068081 acgaagctga ccacgacagg gtcgcggaat tgggcggcct gcacgtggtc ggcactggcc 2068141 gtcaccacac cgagcggcta gacaaccagc tgcgcggtcg ggccgggcgc cagggagatc 2068201 ccgggtcgtc ggtgtttttc tcaagctggg aagacgatgt cgttgcggcc aacctcgacc 2068261 acaacaagct gccgatggca accgacgaaa atggccggat tgtcagcccg aggacgggta 2068321 gtctgctcga ccatgcccag cgcgttgccg agggccggtt attggatgtg cacgccaaca 2068381 cgtggcgcta caaccagctg atcgcccagc agcgcgccat catcgtcgaa cggcgtaaca 2068441 cgttgttgcg caccgtaacc gcgcgtgagg aactcgccga actggcgcct aagcggtacg 2068501 aggagctgtc cgacaaagta tccgaggaac gcctcgagac gatttgtcgg cagatcatgc 2068561 tgtatcacct cgaccgtggc tgggccgatc acctggcgta tctggccgac atccgggaga 2068621 gcatccatct acgcgcgctg ggccggcaga acccactcga cgagtttcac cggatggctg 2068681 tggacgcgtt cgcgtcgctg gccgccgacg ccatcgaggc ggctcaacag acgttcgaaa 2068741 ccgcgaacgt ccttgaccac gagccggggc tggacctgtc caaactggcc cggccgacgt 2068801 cgacatggac ctacatggtc aatgacaacc cactgtccga tgacacgctt tctgccctca 2068861 gtctgcccgg ggtgttccgc tgagctgccc agcgtaagcg ccgagcgtaa cgccactgcg 2068921 aaatttcggg cagaaaatcg cagtggcgtt acgctcgcgg ctaggggtgc ccccacagcc 2068981 cgccgtttcg gcgcgcatcg tcgccaggct agatccgatt gcccggctcc tcagcccgcc 2069041 gtttcggcgc gcatcgtcgc caggctaagg tcacggctca tggagccggt gctcacgcag 2069101 aatcgggtgc tgactgtccc caacatgttg agcgttattc gcctcgcgct catcccagca 2069161 ttcgtctacg tcgtgctcag cgcgcacgcc aatggctggg gggtagcgat cctggtgttc 2069221 agtggcgttt cggactgggc tgatggcaag attgcacggc tactaaacca gtcatcgcgg 2069281 ctgggcgcgc tgctggaccc ggccgttgat cgcctctaca tggtcactgt tcctatcgtg 2069341 tttggcctga gcggcatcgt gccgtggtgg tttgtcctta cgttgctgac ccgcgatgcg 2069401 ctgctggctg ggacgctgcc gctgctatgg agccgtggac tgtcagcgct accggtgacc 2069461 tacgtcggta aggcagcgac tttcggcttc atggttggct ttccgaccat tctgttgggg 2069521 caatgcgatc cattgtggag ccatgtgctg ctggcctgtg gttgggcatt cttgatctgg 2069581 ggtatgtatg cctacttgtg ggccttcgtg ctgtatgcag tgcagatgac gatggtggtg 2069641 cggcagatgc ctaagctcaa gggcagggct catcggccgg cggcccagaa cgctggtgaa 2069701 cgtggctgag tctgaccggc tgctcggcgg ctacgacccc aacgccggct acagcgccca 2069761 cgcaggggcg cagccacaac gcatcccggt tccgtcgttg ctgcgcgcgc tgctatcaga 2069821 gcatctggat gctggatacg cggcggttgc cgccgagcgc gagcgtgctg cggcaccacg 2069881 gtgttggcaa gcccgcgccg tcagctggat gtggcaggca ttggccgcga ccctagtcgc 2069941 cgccgtgttc gctgccgcgg tagcgcaggc gcgctcggtg gcacccggcg tgcgcgccgc 2070001 ccaacagttg ctcgttgcga gtgtgcgatc aacccaggcc gccgcgacca cgttggctca 2070061 acggcgcagc acactctcgg cgaaagtcga cgacgtgcgg cggatcgtac tcgcagacga 2070121 cgccgaggga cagcggctgc tggcccgtct cgacgtgctt agcctggccg cggccagcgc 2070181 accggttgtc gggcctggtc tgacggtgac cgtgaccgat cccggtgcga gccctaatct 2070241 ttccgacgtg tccaagcagc gggtcagcgg tagccagcaa atcatcctcg accgcgattt 2070301 gcagctcgtc gtcaactcac tgtgggaaag tggcgccgag gccatctcga tcgatggcgt 2070361 ccggatcggg ccgaacgtca cgatccggca agccggcgga gcaatcttgg tcgacaataa 2070421 tcccacgagt agtccctaca ccatcttggc ggtcgggccg ccacatgcca tgcaggacgt 2070481 cttcgatcgc agcgccgggc tgtaccgcct gcggctgctg gagacctcct acggtgtcgg 2070541 cgtcagtgtg aacgtcggcg acggtctggc attgcctgcc ggtgcgaccc gggatgtcaa 2070601 gttcgccaaa cagattgggc cctagtgaga gaagtcctgg tgaataggaa accatgggga 2070661 gcgatacggc ctggagtccg gcgcgcatga tcgggatcgc ggcgctcgcc gttggaatcg 2070721 tgctgggttt ggttttccat cccggcgtgc cagaggtcat ccagccgtat ctgccgatcg 2070781 cggtggtcgc cgcgctcgac gcggtgttcg gtggcttgcg cgcctatctc gagcggatct 2070841 ttgacccgaa ggtcttcgtg gtttcgttcg tgttcaacgt tttggtggct gccctaatcg 2070901 tctatgtcgg tgaccaactg ggcgtcggca cacagttgtc caccgcgatc atcgtcgtgc 2070961 tgggcatccg catcttcggc aacaccgcgg ccttgcggcg gcggttgttc ggagcgtgac 2071021 ggagatgaga tcaccgtgag tgagaatcgc ccagaacccg tggcagccga gacttccgcc 2071081 gccacaactg cgcgtcactc ccaagccgac gcgggcgctc acgacgccgt gcgacgtggt 2071141 cgtcacgaac taccagccga ccatccgcgc tccaaggtcg gaccgctgcg gcggacaaga 2071201 ttgaccgaaa tactgcgggg tggtcgctcg cgtctggtgt tcgggacgct tgcgatcttg 2071261 ttgtgcttgg ttctgggggt tgccatagtc actcaggtcc gtcagaccga ctccggtgat 2071321 tcattggaaa cagcccgtcc tgcagaccta ttggtgttgt tggattcgtt gcggcaacgc 2071381 gaggccacgt tgaacgccga agtgatcgac cttcagaaca cgctgaacgc gttgcaggca 2071441 tccggcaaca ccgatcaggc agcgttagaa agcgcccagg ctagattggc cgcgttgtcc 2071501 atcctggtcg gcgccgtggg tgccaccggg ccgggcgtca tgataacgat cgacgatccg 2071561 ggacccggag tagcgcctga ggtgatgatc gacgtgatca acgaactgcg tgccgctgga 2071621 gccgaggcga tccagatcaa cgatgcacac cggtcggtgc gggtcggggt tgacacctgg 2071681 gttgtcggtg tgcccggctc actgacagtc gacaccaagg tcctgtcccc gccgtattcg 2071741 attctggcga ttggtgatcc tccaacgctg gccgcggcga tgaacattcc tggtggtgca 2071801 caggacggtg tcaaacgcgt cggcgggcgg atggttgtgc agcaggccga ccgtgtggac 2071861 gtgaccgcct tgcggcaacc aaaacagcac caatacgctc agcccgtcaa gtgaactagc 2071921 ccaactccga gccgaccaga ataggattac cgtgagcgat atcccgtccg atctgcacta 2071981 caccgccgaa cacgagtgga ttcgccgcag tggcgacgac accgtccggg tggggatcac 2072041 cgactatgca cagtcggcgc ttggcgacgt cgttttcgtt cagctacccg ttatcggcac 2072101 cgcggtcacc gccggcgaga ccttcggcga agtggaatcg acgaaatctg tgtcggatct 2072161 ctatgcgccc atttcgggta aggtgtctga ggtcaacagc gatctggacg gcactccgca 2072221 attggtgaat tccgacccct acggagccgg ctggctgctg gacatccagg tcgacagctc 2072281 ggatgtcgct gccctggagt cagctttgac gacactgctc gacgctgagg cctaccgcgg 2072341 cacactgacc gagtgacgat tgctaaggtc cctgccagcg tcacgtggga ggtcgcgggt 2072401 ctgcacggat ccgggccggg cagggcaatc gagcctggga tccgctgggg tgcgcacatc 2072461 gcggacccgt gcgcggtacg gtcgagacag cggcacgaga aagtagtaag ggcgataata 2072521 ggcggtaaag agtagcggga agccggccga acgactcggt cagacaacgc cacagcggcc 2072581 agtgaggagc agcgggtgac ggacatgaac ccggatattg agaaggacca gacctccgat 2072641 gaagtcacgg tagagacgac ctccgtcttc cgcgcagact tcctcagcga gctggacgct 2072701 cctgcgcaag cgggtacgga gagcgcggtc tccggggtgg aagggctccc gccgggctcg 2072761 gcgttgctgg tagtcaaacg aggccccaac gccgggtccc ggttcctact cgaccaagcc 2072821 atcacgtcgg ctggtcggca tcccgacagc gacatatttc tcgacgacgt gaccgtgagc 2072881 cgtcgccatg ctgaattccg gttggaaaac aacgaattca atgtcgtcga tgtcgggagt 2072941 ctcaacggca cctacgtcaa ccgcgagccc gtggattcgg cggtgctggc gaacggcgac 2073001 gaggtccaga tcggcaagtt ccggttggtg ttcttgaccg gacccaagca aggcgaggat 2073061 gacgggagta ccgggggccc gtgagcgcac ccgatagccc cgcgctggcc gggatgtcga 2073121 tcggggcggt cctcgacctg ctacgaccgg attttcctga tgtcaccatc tccaagattc 2073181 gattcttgga ggctgagggt ctggtgacgc cccggcgggc ctcatcgggg tatcggcggt 2073241 tcaccgcata cgactgcgca cggctgcgat tcattctcac tgcccagagg gaccattacc 2073301 tgccgctgaa ggtgatcagg gcccagctgg acgcccagcc cgacggtgag ttgccaccat 2073361 tcggatctcc ttacgttcta ccgcgattgg tgcccgtagc cggcgacagt gctggcggcg 2073421 tcgggtcgga caccgcgtcc gtgtcgctca cgggtatccg gctcagtcgg gaagacctcc 2073481 tggaacgatc ggaagtggcc gacgagctac tgacggccct gctcaaagcc ggtgtgatca 2073541 ccaccgggcc gggcggcttc ttcgacgaac acgccgtcgt gatcctgcaa tgcgcacgag 2073601 cgctggccga atacggcgtc gagccgcggc atctacgcgc cttccgctcc gcggccgacc 2073661 ggcagtccga cctgattgcc cagattgccg gcccgctcgt caaggccggc aaggccggtg 2073721 cccgcgaccg ggccgacgac ttggcccgtg aggtggccgc gcttgctata actttgcaca 2073781 cgtcgctgat caagtctgcg gttcgcgacg ttcttcaccg ctgaggacta gacttcgttc 2073841 gacagcttgg tgttcgacgt cacggtagag acgtggcgcc caccgcgtcg tcgcaccgag 2073901 cgtgagtcgg acaccggttg catgtgcgga gggcagacgc agatgggtga agttcgtgtt 2073961 gtcggcattc gcgtcgagca gccgcagaac cagccggtgc tgttattgcg cgaggccaac 2074021 ggtgatcgat acctgccgat ctggatcggc cagtcggagg ctgccgctat cgcgctggag 2074081 cagcaaggcg tcgagccgcc acgtccgctg acccatgatc tgatcaggga tctcattgct 2074141 gcgctggggc attcgctcaa agaggtgcgc attgtagacc tgcaggaagg aactttctac 2074201 gctgatctga tcttcgaccg caatatcaag gtgtccgccc gtccctcgga ctcggtggca 2074261 atcgcattgc gagtgggtgt tccgatctac gtcgaggagg ccgtactagc ccaggccggt 2074321 ctgctgattc ccgacgaaag tgacgaggag gccaccaccg ctgttcgcga ggacgaggtg 2074381 gagaaattca aagagtttct cgacagtgtg tcacctgacg atttcaaggc cacctagcgc 2074441 ggcgacgatg cgcgccggga cggcgggctg aggaggcgcg cgataaggcc gagcgcggcg 2074501 acgatgcgcg ccgggacggc gggctgagga ggcgcgcgat aaggccgagc gcggcgacga 2074561 tgcgcgccgg gacggcgggc tgaggaggcg cgcgataagg ccgagcgcgg cgacgatgcg 2074621 cgccgcgacg gcgagcatcc attatttgcc ggccagcaac gtcacggctg cgtctcatct 2074681 ctggctgcaa ttgtcgacac gcctagcggt tagtgcctaa tgcgcccggc gaccgcgata 2074741 ctttgatcac gacctgatag ttaaccggga gcatcgcgcc catcgaacag cgtatgctct 2074801 ctaacactcg ggccctcagt aatggctgtc gggggagcca gtgacgcagc tagtgacaag 2074861 agcgcgatcg gcgagaggaa gcaccttggg cgagcagcca cgtcaagacc agctcgactt 2074921 tgctgaccac acgggcactg ctggtgatgg taacgacggc gccgctgcgg ccagcggacc 2074981 cgtgcagccc ggcctgttcc ccgacgattc cgttcctgac gagttggtag gttatcgcgg 2075041 accgagcgcc tgccagatcg ctgggatcac ctaccgccag ctcgactatt gggcgcgcac 2075101 atcgttggtt gtgccgtcga tccgtagtgc ggcaggatcc ggcagccagc ggctgtactc 2075161 gttcaaggac atcttggttc tcaagatcgt caaacggttg ctcgacaccg gtatctcgct 2075221 gcacaacatc cgggttgcag ttgaccatct gcgccagcgt ggcgtccagg atctggccaa 2075281 catcaccttg ttctccgatg ggaccaccgt gtacgagtgc acgtcggccg aggaggtcgt 2075341 cgacctcctg cagggcggcc agggtgtgtt cggcatcgcc gtctcgggcg cgatgcggga 2075401 gctgacgggt gttatcgccg acttccacgg tgagcgcgcc gacggcgggg agtcgattgc 2075461 tgcccccgaa gatgaactgg cctcccgacg caagcatcgc gaccgcaaga tcggctagcc 2075521 gagagttccc ccgcgaacag acacagaatc gcacgcggca ggctcctcgg atgcgattgt 2075581 gtgtctgctc ggcagtagac tggacaacgc atcgctctag tgcgggagag ttctgtggct 2075641 gccagctacg gacgccgaag gagcaatacc tctccgtcaa cctctcaggc acccggaccg 2075701 cgcgagacta cgatgcctct ggaaagcggt ggcgacccct ggcggtcctc acccgccgat 2075761 ggggaaaggc gattcacctg acggtggaca gagtcgccga atctctcagg cgcctggcgt 2075821 gcaggtgaag acagagggag agggccgcta gtcctctgct ttgtcaggag ttcaccgtgt 2075881 ccgaccattc gacgttcgca gaccggcaca tcggtctgga cagccaggcc gtcgcgacca 2075941 tgctcgccgt gatcggggtg gattcgctcg atgacctggc agtcaaggcg gtcccggcgg 2076001 gcatcctaga cacactcacc gacaccggag ccgcaccggg tttggacagt ctgccaccgg 2076061 ctgccagcga agccgaggcg ctggccgagc tgcgagcgct ggccgacgct aacaccgtcg 2076121 ccgtgtcgat gatcgggcaa ggctactacg acacacacac ccccccggtg ctgttgcgca 2076181 acatcatcga gaacccggcc tggtataccg cctacacgcc gtaccagccc gagattagtc 2076241 agggtcggct ggaagccttg ctgaacttcc agaccctggt caccgatctg accggcctcg 2076301 agatcgcgaa cgcgtcgatg ctcgacgagg gcaccgcggc ggccgaggcc atgactttga 2076361 tgcaccgcgc ggcccgcggg ccggtgaaga gggtggtcgt ggacgccgac gtgttcaccc 2076421 agaccgcggc ggtgctggcc acccgcgcca agccgctggg tatcgagatc gtcacggccg 2076481 acctgcgcgc cggtctgccc gacggcgaat ttttcggcgt catcgcccag ctgcccgggg 2076541 ccagcggccg gatcaccgac tggtctgccc tggtgcaaca ggcccacgac cgtggcgcac 2076601 tggtggccgt cggcgccgac ttgttggcgc tgacgctgat cgcgccgccc ggagagatcg 2076661 gcgctgacgt cgcctttggc accacacaac ggttcggagt gccgatgggg tttggcggcc 2076721 cgcatgccgg gtaccttgcg gtgcacgcca agcatgcgcg tcagctgccc ggccggctgg 2076781 tcggtgtgtc cgtcgacagt gacggcacgc cggcctatcg gttggcgctg cagactcgcg 2076841 agcaacacat ccgccgcgac aaggccacca gcaacatctg caccgcacaa gtgctgttgg 2076901 cggtgcttgc cgcgatgtac gcgagctacc acggcgcggg cgggctgacc gccatcgcac 2076961 gccgggtgca tgcccacgcc gaggctatcg ccggtgcact gggcgatgcg ttggtgcacg 2077021 acaagtactt cgacacggtg ttggcccggg tgcccggtcg tgccgacgag gtgctggcca 2077081 gggccaaggc caacggcatc aacctgtggc gtgtcgacgc cgaccatgtg tcggtagcct 2077141 gcgacgaagc caccactgac acccacgtgg cggtcgttct ggacgcgttc ggtgtagcgg 2077201 ccgccgcacc cgcccatacg gacatcgcaa cgcgcacatc ggagttcctg acgcatccag 2077261 cgttcacgca ataccgcacc gagacgtcga tgatgcggta cttgcgtgcg ctggcggata 2077321 aggatattgc cctcgaccgc agcatgattc cgctcggctc gtgcacgatg aaactcaacg 2077381 ccgccgccga gatggagtcg attacctggc ctgaattcgg gcgtcagcat ccatttgccc 2077441 cggcatctga taccgctggg ctgcgtcaac ttgttgccga cctacagagt tggctggtgc 2077501 tgatcaccgg ttatgacgcg gtgtcgctgc aacctaacgc gggctcgcaa ggcgagtatg 2077561 cgggcctatt ggcgatccac gagtaccacg ccagccgggg tgaaccgcat cgcgacatct 2077621 gcctgatccc gtccagcgcg cacggcacca atgccgcgtc agccgccttg gccggcatgc 2077681 gcgtggtggt ggtggactgc cacgacaacg gcgacgtcga cctcgatgac ctgcgcgcta 2077741 aggtcgggga gcatgccgag cggttgtcgg cgctaatgat cacctacccg tccactcacg 2077801 gcgtgtacga acacgacatc gccgagatct gcgctgccgt gcacgacgcg ggcggccagg 2077861 tatacgtcga cggagccaac ctcaacgccc tggtcggcct ggcccggccg ggcaagttcg 2077921 gcggtgacgt cagtcacctc aacctacaca agacattctg cattccgcac ggcggcggtg 2077981 gcccaggcgt cggcccggtg gcggtgcggg cgcacctggc accgtttctg ccaggtcacc 2078041 ccttcgcccc cgagctgccc aagggctatc cggtgtcgtc ggcaccatat gggtcggctt 2078101 cgattcttcc gatcacctgg gcatacatcc ggatgatggg ggctgaggga ctgcgggcgg 2078161 catcgctgac agcgatcacg tcggctaact acattgcgcg ccgccttgac gagtattacc 2078221 cggtgctgta caccggcgag aacggcatgg tcgcccacga gtgcatcctg gacttgcgcg 2078281 gtatcactaa gttgaccggt atcaccgtcg acgatgtcgc aaaacggctg gcagactatg 2078341 gttttcacgc accaacgatg agttttccgg tggccggtac gctcatggtg gagcccaccg 2078401 agagcgagag cctggccgaa gtggacgcct tctgcgaggc catgatcggc atccgcgccg 2078461 agatcgacaa agtcggggcc ggggagtggc ctgtcgacga caatccgctg cgcggcgcac 2078521 cgcacaccgc gcagtgcctg ctggcgtctg attgggacca cccgtatacg cgggaacagg 2078581 ccgcctaccc gctcggcacc gcattccgac ccaaggtttg gcccgcggta cgtcgcatcg 2078641 acggcgccta cggggatcgc aacctggtct gctcatgccc gccggtagag gcttttgcct 2078701 aaacgctcgt cgaccggccc ccggtcgagc tcgaggcccg ggtgctactg ggtgggtagc 2078761 tgacgtgtcg gctgctatgg gtcgttgtcg gggttgcgga gtttttcggg gtggcggcag 2078821 gtgttggtgc ggggttgacc gtggtcggag gtggggtggg gagctattcg gtgtcgccac 2078881 ccgcgctcca acaatgccag ctgttgcggg gtgctcagcg acaaaggttc agccgaagcg 2078941 ctcaatgatc gcggcggcga tccggtcggg ggcgtcctcc tggatgaagt gtttggcgtt 2079001 gggcagctcc accaggacgt ggtcgggaaa tgtcgcactc agtctgggga taatcgtttt 2079061 cggcctgaat gcgacatcct tcatccccca aatcaacagg gtgggcttgg tgcccagcgt 2079121 ggctggcacc tcccgggcga gccgtgccag caggggacgg gcggccagga tctgtttggg 2079181 catctcggct acgcctcggc gtgccgcggc gttgggctgc accgcccggt agtgcgccat 2079241 caccgcgcta ctcggccggt gctcggttcc cgcgggtatc aagcgctcga caaagaagtt 2079301 gcgccgtaag atcgcgtact gcactggcgg gctggacatc accctgctga aggccttcat 2079361 cgccagcgtg tccgccggcc agaaccacgt gttgcccaac acgacgccgc ggacccggtc 2079421 ggcacgctcg acagcgaccg ccatgctgat cgggccaccc cagtcctgac ccatgctcag 2079481 gtagcggtcc aggcccaggt gatcgacgaa ttcgccgatc acccgcgcgt gctcgtcgat 2079541 ctggtacccg aatcccgagg gacgctccga taacccgaaa cccagataat ccggagccac 2079601 acaacggaaa cggtcccgca gtgcgacgat gatgtcccga tacaggaaac tccacgtcgg 2079661 gttgccgtga cacaacagga tcggcggacc cgtgccctcg tcgacgtagt ggatgcgtcc 2079721 acgcgagctg tcgaaccagc gcgactcgaa cgggtacagc tgcggatccg gcgtgaaatc 2079781 gatgctcatt accctcctcc gatcgcgctc atgatggtat gcccgaaggg tgacatcacc 2079841 gagtgtccgg gagtggcgtg acggtggccg ctggctgccg acggctgtcg gaaaggtgtt 2079901 cgtccggtcg gggccgggcg acacgccaac aatgctcctg ctgcatggct atccgtccag 2079961 ttcgttcgac ttccgggcgg tgattccaca cctgaccggc caggcttggg taacgatgga 2080021 ttttctgggc tttggcttgt ccgacaagcc gcgcccgcac cggtacagcc tgctggagca 2080081 ggcccacctg gtggaaacgg tggtcgccca caccgtgacc ggcgcggtcg tcgtgctggc 2080141 ccacgacatg ggcacgtcgg tgaccaccga gctgctagcc cgtgatttgg acggccggtt 2080201 gccgttcgat ctccgacgtg cggtgctgag caacggcagt gtgatcttgg agcgggccag 2080261 cctgcgtccg atccagaaag tactgcgcag cccgcttggt ccggtcgctg cccggctggt 2080321 cagccgcggt ggcttcacac gagggtttgg ccggatcttc tccccagcgc acccgctgtc 2080381 ggcgcaggag gcccaagccc agtgggagtt gctgtgctac aacgacggca accggatccc 2080441 gcacctgctg atcagctacc tcgacgagcg gatacggcac gcgcagcgct ggcatggcgc 2080501 ggtccgcgat tggcccaaac cgcttgggtt cgtgtgggga ctcgacgatc cggtggcaac 2080561 aaccaacgtg ctcaatggac tacgggaatt gcgccccagc gccgccgtcg tggaactgcc 2080621 agggttgggc cactacccgc aggtcgaggc tcccaaagca tatgccgagg ccgcgctatc 2080681 gctgctcgtc gactagccgg ctacggctgt atcacgggca gatcgatgcg agaggcatgc 2080741 atccggctac ggtagacgcg cacggtcggt gcgcaaccgg gaaggatggc gaagtggctt 2080801 gcgtccgcgc cggcgatggc gatgcggatg cggtgccccg gttggaacag atacgacgtc 2080861 ggcagcaggt cgaatgtcag ccgggcaatc tcgcccggga ctaggggcca cgcgtccccg 2080921 ctcgcgaacg ttcggtaggg gaccacctgg cggtacggcg gcggcccgtc gctgagccgg 2080981 cggtggatgg cgcgtagctg gccctcggtg atgtaggcga cacggccgcg cggatcgacg 2081041 tcttccagat agacgaagaa ggtgccgtcg ctcgacgtcg acgtgataaa cagcgtgacc 2081101 accacatgac cggtcacctc caggggatgg tcgagcggtg cggaggtata ggtcagcagc 2081161 ttggcatcct gggccttgcg gtccgggtag caaacgtgtc caccgatgcc cacttgcgag 2081221 cgccagcgtg agcgctcgcc cgttccggcc gtctgatcca ccacgtattc gtctgcaccg 2081281 ctgtcgcaat cgggtgcgtc cgggcgcagc tgtcggtctg cggacaggta gtagctctgc 2081341 gtggtggcgg gcggcggcca ggtgtcggcc gacttccagc ggttctcgac catggtgaag 2081401 tagtgcaccg gcggctcgga gccgatgccc gtatcggccc ccttgacgtg atggtcgatg 2081461 aacctcaaca gctcgccgtc gtgatcgaag tcgggtctgc tgagcccgcg cagtgggtcg 2081521 acgcgccagc cgccggtgtg gttccatgga ccgaggatca agtggctgcc cggggtggag 2081581 acggtcagaa aacgtttgat tgcggcatgc gcatacccgc cgtcgaacca gccgctgtag 2081641 ctgtagatgg ccgctcccga cgcctgcacg tcacgccaat aattgtgcgg gctgatcagg 2081701 ttgatgctgc ccgactcgat cggtgtaccg atcggctcga gccgggcgtc aggttggcca 2081761 cgatagggat ccgaggcgga tacgtcgtcc cggaacgtca atgaccccgc gatctggtga 2081821 acgtcgtagt tgccgcgatg cgcggcgatg gccccgtccc gcagcgagcg atcacggtcc 2081881 tcctgcaccg gctgcatgcc ggtcaccggg agcttcgccc accacccgac cacttcgtgc 2081941 agggcgttgc ggtcgagcgc ctcgttgtag cgtccccagg tgtcggtgaa ccaggcggcg 2082001 tggatgccgc cggggaacgc gatgtcggtg tagacgtcga acagcgagaa gcacggggcg 2082061 atcacccgca ccgcgggatg ctggttgacc agcagtaact cggccgacgt gccgtcgtac 2082121 gaatttccca gcgcagcgac cgttccgttg caccaaggct ggcgcacgat ccagtcgacg 2082181 atctcggcgc cgtcccggat ctcgtcggag gaccattcgc acacgcgggc gccgaacgac 2082241 gcgcccgatc cgcgcacatc cacatcgacc caggcgtagc cgctggcgac gaaacgtctc 2082301 cgacgacgct tatctgcggc gatgtgctgg aggggcttgc ccccgagcaa catccgcaac 2082361 ggccagcgca actgcagcga ccggtagtag cgggtctgat gcaggatcgc gggcagcctt 2082421 gcggcactcg tcaggcccgc gggcaggtag aggtcgatgg cgatgcgcac cccgtcgcgc 2082481 atcgtcacat agcacgagga gtagcgcatc ccacgatatc tcgggtaggc ggatcgttgg 2082541 tccggcgcgg agtaccaggc cgcatccgag ccgccgcgtc tggtcatcgg gtagccaggc 2082601 gatcagctca agaagatgtt gaccgcggtt gccaggtcgg gggatgccga tgtctccagg 2082661 ttttggtagc tgccgccgct gagctgtgcg acggcttccc aggttgcccg atcgggatca 2082721 gcaccgaagt cgatgatgtt gaccgcgatc ggcttggccg ggtctgcgct cttgcggatg 2082781 aaatcctgca ggcccggccc gtcgagggtt tggtccgtat gcggccccgc ggtaataacc 2082841 agcacagaat tagcctggcc aacacggtaa ttggctagca tctcctgata gatcaagcgc 2082901 agagtggtga acgacaccgc gccaccgccc gaggagtatt gcttgcccaa cgcggccgtc 2082961 aaggccgcgg ggcggggctg gccgttgacc gggtcggcca atggcccggc cggcacctct 2083021 gttcggccct cgcggccgtc gaatgtccac agtccgacga ccgaactggg cggcatcgcc 2083081 ttgatccggt tctcaagcgc cgcaacgaca ttgctaagcc ggctattgcc gccttcatca 2083141 ttgggcatcg attggtcgag catgatggtc gcggccactc cggccgacgc ggtgaccatg 2083201 gtgtccgcca gggtcgcgcg catggagtcg tcacccaccg acaaagtcga aggcagcgct 2083261 gggaaactgg tgacggggct gctcggcggt ttgacgtcgc tgactcggaa accagctctg 2083321 gccagtttgg ccagttgctc gggcttgtgc aaatacctgg caaacgcgct ggccgccgac 2083381 gtttgctcct gcgatagcca tgcaccactg agcagcaccg tcggatagtc agcgaccgca 2083441 gccggccccg gcggcagcca ggaacccaag gtgttctcgg catctgaaag tgactggccg 2083501 cgctggaaca actgttgttc ggtggtgacc accgcgtgca cgggtgccgt ggcgacatcg 2083561 ccgggcttga gcagcgtgtc catcgccgcg gtcaaggagt cgtcggcgag cttaggtcgt 2083621 gcgcccatca gggtgcgcac cgcgccgata cccgctgttg ctggcgcgcc agcaggtgct 2083681 gacgcggcag ccaccgcctc gccggccaaa tacgcggcat cgccgttgcc actgctcggc 2083741 attgccagcc gcagtgatcc ccaggcaggc aagtccaagc cggacaacga gttcggattg 2083801 gtttgcaggc cgggcaacgc cgcccagttc tggttggcga gggcctgctg caattcgggc 2083861 cgcacggcga gcaacaccgg cgatatcacc agtgagcggc tatcgctaat ggcttggctg 2083921 cccgcggccc cggtaagccg cgccgccgag atggagctac tcggaatcca caatcccggc 2083981 tggccgccca gttcggtcgg ccatttgccg atgaaaccat tgatgacggc atcggagccg 2084041 gccgaggtga cagccactgc cacacaacgg tcgccgaccg ggcccgccga cgcgttgtag 2084101 ctgtcggctg actcctttac ctgatcggcg attgatgggt cggctataac agcgacggtg 2084161 tccttgccgc ccacgcagcg ggcggcagcc gtatgcgagc ggttggacaa cgcgtcaccg 2084221 aagaagcgcc acaagatcac cccggccacc attaccacca ctgcgacaag ggccacgatc 2084281 acgccgatac tgactccccg ccgcccgtcc gcgctacggt gcccggcctg ccagtcgccg 2084341 ggcccccgat gtccgaagcg aaacagcggc ggcggggcgg ccgctatggg ctcggcaccc 2084401 gttggctccc agtcggggcg gggtggaatg tcagggtagt cttctgagcc gctagccgag 2084461 tagccgccga cggcggagta gtggccctcg ctggataacg ggccgtcatc gggctgatct 2084521 acaccgggat agtcgtagct acccgatatg tcctcccagt gctgttgttc cgccgcatgc 2084581 ccgtcggaca ggtcgtcaac ggaatcctcg gggtcgggct tgctgtgcct acccataccg 2084641 gcgtctgcgt cctctccgtc gaaggccggc gcctgtcaag cacgagctac gcaccggctc 2084701 tgcccgatgg ggccggctct ctcccgcaag cgggcggtgc ccccacagcg gcccgctagc 2084761 gggccgcatc gtcaccggcc ctgtccgatg gggccggctt ctcagcggcc cgggccttaa 2084821 actcccgacg acgtcggtgc aggatcggct cggtgtagcc gttgggctgc tgggccccgg 2084881 acaagatcag ctcctgcgcg gccaggaagg cgatactgtc gtcgaagttg ggtgccatcg 2084941 gtcggtatgc cacgtcgccc gcgttttgtc gatcgaccaa cggcgccatc cgctccaagc 2085001 tggcccgcac atccgcgctg gtgatcacac cgtggcgcag ccagttggcc aacaattggc 2085061 tggagattcg cagcgtggcc cggtcctcca tgagcgcgac gtcgtggatg tcgggcacct 2085121 tcgagcagcc gacaccttga tcaacccagc gaaccacgta gccgaggatg gattgacagt 2085181 tgttgtcgac ctcttcgcgg atctcgtcgg gagcccaggc caattccttg gccagcggaa 2085241 tggtcagcaa ttgttcgatg gtggcgcgac gcttccccgc cagtccttgt tgcaccgcgg 2085301 cgacgtcgac ctggtggtag tgcagcgcat gcagggtggc cgcagtggga gagggaaccc 2085361 aggcggtgct ggccccggcg cgcggctggg cgatttttgt ctcgaccatg tcggccatca 2085421 gctcggtcat tgtccacatg cccttgccga cctgggctcg gccgctgaac ccggcggcca 2085481 ggccggcatc gacgttgtgg tcctcgtagg ccaagatcca cggctggctc ttcatggtgc 2085541 ccttgcgcac catcgggccg gcctccatcg aggtgtggat ttcatcgccg gtgcggtcca 2085601 ggaacccggt gttgatgaac accacgcggt ccgcggcagc tttgatgcac gccttgaggt 2085661 tgaccgtggt ccggcgttcc tcgtccatga tgccgatctt catggtgttt tgcggcaacc 2085721 ccagcacatc ttcaacccgg ctgaacagtt cgcaggtaaa cgccacctcg gccggaccgt 2085781 gcatcttcgg cttgacgatg tagatggagc cggtgcggct gttgatcagc ggcccgttga 2085841 cgtcgctggc ctttagcccg tggatggcga tcaggccggt gaatagggca tccatgatgc 2085901 cttcgaacac ctcgctgccg tcagtgtcga cgatggcgtc attcgtcatc aagtgaccga 2085961 cgttgcggac gaacatgagg ctgcgtccag gcagcgtgaa ctggccaccg ccgggtgcgg 2086021 tgtagttccg gtccctattg agcacccgca ggaaagcggt gccgtccttg tctaccgctg 2086081 ctgccaggtc gcccttgttc aggccgagcc agttccgata acccagcacc ttgtcggcgg 2086141 cgtccacggc ggccaccgag tcctcgaagt ccatgatcgt ggtgatcgcg gattccagga 2086201 tcacgtcctt gacgccggcc cggtcggtgg tgccgacctg cgactccgga tcgatcagga 2086261 tctcgatgtg caaaccgtga ttgattagca gcaccgatgt cggcgactcg gctgcgccgg 2086321 tgtagccggc gaactggccg gggttggcca ggccggtgga cttatccggc aaggcaacca 2086381 cgagctggcc atcctgcact gtgaaaccgg tggcgtcgcc aaaggaaccc gacgacagcg 2086441 gaacactgtc gtcgaggaac ttgcgggcat acgcgatcac cttgtcgcca cgaaccttgt 2086501 tgtacgtggg gcctttttcg gcgccgtcgg tctcggggat gacatcggtg ccatacaagg 2086561 cgtcgtagag ggagccccag cgagcgttgg ccgcgttcag agcaaaccgc gcgttgagca 2086621 ccggcaccac cagctggggg ccggcggtcg tggtgatctc agcgtcgaca ccggacgtgg 2086681 tgatggtgaa gtcatcaggt tcgggaagca ggtagccgat ctcggtgagg aactggcggt 2086741 aggcatccat gtcgatgggc tcgatcaccc gacgccggtg ccacttgtcg atctgcgcct 2086801 gcagctcgtc gcgggcgttc aacagagctt ggttctgcgg ggtcaggtcg gcgacgacct 2086861 tgtcgacgcc cgcccagaag ctgtccgggt cgatatcggt gccaggcagg gcttcattgt 2086921 tcacgaagtc gtagagcacc cgagcgatgc gcaagttgcc caccgacacg cgatctgtca 2086981 ttgcttcctc ccttactggc aattgctcag cctaccggcc gacaagacga ctactacatc 2087041 cggcgacccg caaccgcagg tcacgtcaag ctctgtcagc acctcggcac ccggcatgct 2087101 cgctggctgg caacgcgacg cagtggccgc agcgatcata cgggtggggc ggtctgccta 2087161 ctacaatccc gttggatccg ttctggccgg acagcatccc gccgggagcg gctccggcca 2087221 cgtcggtgcc gctcattgcg gcggtgtgat tccgaatcag gccagacgct tgatccccgg 2087281 ataggagtcg aacccacggt cgaagctcat cagccgggta atgtcgtggt gagccatgac 2087341 ggcgatgtgt agtgcatccc tggccgacaa cgtttgatag cgcaacaggg catccctcgc 2087401 gtgttcgaca tcggtgcgct cgatcggcag cacttcgtcg accacgccga taattgcatc 2087461 gaaagccggc tgaatcgcct cacggcgttt gattgccaca taccggtggc atatctcctg 2087521 cagcacctcg gcgtcggtga ctaggcgttc accgcccgac agcgccgact ccagcagacg 2087581 ttgcgcgtcc agcttatgcg ggtgcgaggc acccaccaga tacatgggaa tgttggagtc 2087641 aacgaggatc accgtgatcc ttcgcgctcc gcaccgcgtc cgcgttcgat ttcctcgagc 2087701 atctgctcga cgtcggctgt cgggaactca tggcgtgcgg cggcacggac agatcgcagc 2087761 ttcatgtcta gatcgccgcg cggttctcgc tcccgcgcct cccgcagcgt ccggcggacc 2087821 cactcggaca ctgtcgtgcg gtgccggcgt gcaatctctc ggagttcttc ccactcgtcg 2087881 gggtccagca gaacctgcag gcgcttactc atagcatgag tgtatacagc tcatacgggt 2087941 gtatgaatcc agctcgcctg cgcgcgggag ctatcccccg ggggacccgt tctggccggc 2088001 cagcgttccg cccgtaccgc cgctgccgcc cggcccgcca gagtcgccgg gcccaccgtc 2088061 gccgccgtcg ccgatcaact gggcgtagcc gccgttaccg ccggtgccgg gggtgccgtc 2088121 cccgctgacg ccggcggcgc cgccattgcc accgttgccg atcaacccgg ccgtgccgcc 2088181 gttacccccg gtgccgccgg cgccggtgac gggtaccgcg actgagggaa tctgggttgg 2088241 cccaccggag ccgccggcgc caccgttgcc gaccagcagc gcgccgttgc ctccgttccc 2088301 gccactgcca ccgccggggg cgaagacgcc ggtaccggcg gacccggcgc ctccggcgcc 2088361 accgttgccg atcagcccga cggcgttgcc gccgtcgccg ccgtggcctc ggagaaagcc 2088421 gtcggtttgc acgctgttgc cgccgtcgcc gccgttgccg atcagcgtcc cgccggtgcc 2088481 gccgtcgccg ccgttgccat cgaagaagct gaacccgccg ttaccgccgt cgccgaacat 2088541 cccggcgtta ccgccggtac caccgtcgct gaaaccttgc agggtgctgc ctccggaacc 2088601 gccgtcgccg tacagccacc cgccgttgcc gccgttgccg ccgttgccga tgccggtggg 2088661 ggcggcaccg acggagccgc cgtcgccccc gttgccgatc agccgggcgt caccgcccgc 2088721 tccgccgtcg gcggctcccg atgggacatt gccgccgttg ccgccgttgc cgtacagcag 2088781 tccgccggtg ccgccggtgc cgccggcccc gcccgcgaag ccggccttgc cgagcccgcc 2088841 ggccccgccg gccccgccat ggccgaacag cccgacggcg gctccaccgg gcccgccgat 2088901 cccaccggta gcgccgacgg gtccggtacc gagtccgccg gcaccgccgt tgccgccgtc 2088961 gccgaagagc agtccgccga ccccgccggc accaccggca aggccggtcg ccccgggccc 2089021 gccgatgcca ccgttgccgc cgttgccgat caaccctgca tccccaccgg cgccgccggg 2089081 ctggccgatc ccgccgttgc cgccgttgcc gccgttgccg tacagcaacc cgccgggccc 2089141 gccgggctgt cccggcgcgc cattggcgcc atcaccgatc aacgggcggc cgaacaacgc 2089201 ctgaaacggc ccgttgagca cgtccagcgc ggcggcgtcc gccgcctcgg caccggcata 2089261 ggcccccgcg ccggcggtca tggcatgcac aaactgctcg tgaaacagcg ccgcctgcgc 2089321 actcaatgcc tgataggcct gggcgtgcgc gccgaacaaa tccgcaacag ccgccgacac 2089381 ctcatcggcg cccgcggcca gcacccccat cgtgggcacg gccgcagccg cattcgccgc 2089441 gccgatcgcc gacccgatgc cggccaaatc cgacgccgcc gccaccacca cttctggggc 2089501 cgccaccaca aacgacatga cgcgctcctc acgggaccgg gtgcgcagtc ccagcggtta 2089561 cagcgtattg acgtcccgcc accacgtccg gcgttcgggc caactgatcc gaaacgattg 2089621 tcagcggcag cagcccccga ttacgctcgg tgtcccgtca gacaccgatc cctgcgtcag 2089681 tcaacgatgc gtcccgtcgc gcatggtgcc aaccaggtcc tccaccacgt cctccagcgc 2089741 caccatcccc acgacagaac cgttgtcggc ggttaccaag gccagatggc tgttgatgcg 2089801 ccgcatccgc gacagggcgt cggccagcgg caacgattgg ggaacccgcg gcagcgggcg 2089861 cacaacggcc agatcgatca cggtttgcgg attgtcaccg agggtcagca cgtccttgat 2089921 gtgcagatat ccgatgaacc ttccaccgcg atccaccacc ggaaagcggg agtagccggt 2089981 ttgcgccaag gcctgttcga ccccgccgat ggtgggcccg gaccctaccg ccgacacctg 2090041 cactgcccga atgttgacca gcggcaccgc gacatcggca accaggcgag ttcgaatccg 2090101 aagggctcgg gttagccgcg tgtgctcctc gtgatccagc aggccttcgg atagcgattc 2090161 ggcgatcatc tcggacagtt ccgcagtgga gacggcgatg tcgagttcat ccttcggctg 2090221 caccccaacc agccgcagta tcgcgttggc gcagttgttg tagaacgcga tgaacggccg 2090281 ggcgaggcgc acgtagacca ggtacggcgg gaccagcaac atcgctgttc gctccggacc 2090341 agccaaagcg atgttcttcg gcaccatctc accgagcagg acatgcagcg ccaccacgat 2090401 cgccaacgac aaggtgtgca gcagcgccgg cggtacaccg ctcagcccga acgacagctg 2090461 tagcagcttg acgactgccg gttcgccgac ccggccaagc aggatcgagg acaccgtaac 2090521 ccccagctgt gcgccggtca gcatcgccgg gagctgttcg cccgcccgga tcacggtgac 2090581 ggcagtggcc ttgccctgct cggccagcgc ttcgaggcgg tcacgacgcg ccgagatcaa 2090641 cgcgaattcc gcgcccacga agaacgcgtt ggcgccgatc agcaaaagcg ccagcaacac 2090701 cgcggacagc acatccatca gcggccccgc cccgacccgg ggtcggcatg gccgcccatt 2090761 ttgatcaact ccaacaagtc gatccggcgc ccgtccatct ggatcacggt ggctaaccac 2090821 cgcatcgagt cgtcgggaag tccgtcctgg tccaaggcag tcagctcgac cgtttcgccg 2090881 gccaccggga tgtggccgag ctctcgaagc accaacccgc cgatcgtctc gtacggaccg 2090941 tcgggggctc gatagccggt ggcgctggcc acctcgtcga tgcgtagcag acccgagacc 2091001 cgccatccgt tgccggctgc caccacatcc ggtgtcgcat cgtcgtgttc gtcgcggacg 2091061 tcgcccacga tctcttcgat caagtcctcc agggttacca tgcccgcggt gccgccgtac 2091121 tcgtccacaa ccatggcggt ctgtagcgca ctggcgcgga cctgcgccat caccgcatcg 2091181 ccgtcgagcg tcgagggcac caccgcgacc ggctcggcga ccgtcgttag cagcgtgtgc 2091241 gcgcgatcgc cgggcggaac ctcgaacacc tgcttgacgt gcacgatgcc gacggtcgca 2091301 tcgagatctc cctcgaccac cgggaagcgc gagaatcccg atgcggccgc ggccgcaacc 2091361 aggtcggcga tggtgtcatc ggtctgcagc gccacgatct tcgaccgtgg cgtcatcagc 2091421 tcctcggccg tcagggcgcc gaactgcagc gagcggcgca tcagccacgc cgtggcgtca 2091481 tcgagtgcgc cgctgcgcgc ggaactacgc accaacgaca ccagctcctg cggtgtgcga 2091541 gctgagcgca gctcctcggc cggctcgatg ccaagtcgac gcacgatcca gttcgccgct 2091601 ccgttcgtga gacggatggc cggggtgagc agcagtgaga acagcacctg gccggccacg 2091661 actgagcgcg cggtgcgcag cgggcgcgcc accgcgagat acttggggac cagctcgccg 2091721 aagaccatcg acagcgatgt cacgatcacc agggcaaaaa acgtgataag accgtcggcc 2091781 acccgatcag acattccgac tgcgaccagc ccaggatgcg gtagctcggc caccagcggt 2091841 tcggtcaggt agccggtagc caaggtggtg atcgagatac ccaactgagc acccgaaagc 2091901 tggaacgaca gccggtggtg tgcgcgctgg atgaagcggt cccgactggt gccgccgcgg 2091961 gcgttggcct ccacggtgct gcggtccagc gcggtcagcg agaattcggc cgcgacgaac 2092021 acccccgtgc ctgcggtgag cgccaagatc gccaggatgg tggcgacggt atcggtgagg 2092081 ttcacgggcg gctcggtcgt cgcgctatat cgggccgagc cagtaccggc cgctcgcctg 2092141 gaaaaccgac ggtgtggacg ggtgcccgcg gcacgtcatc cctttcgctc gcaaccgcgc 2092201 agcgcgatac tgcgggtttg aagtacacat cgtagcgaga tagctcgtgg cgccagcttc 2092261 accagccggc gggcagcgga tggccctcgg caaagcccgc tcccgactgc acgcccacca 2092321 cggcccgctc atgcagctcc gccaggttcg aggcacccac ataggtgcag gtgctgcgca 2092381 cgccagaagt gatgtggtca attaggtcct ccacacctcc gcggtcgggg tcaaggccca 2092441 tccgcgacgt cgagatgcct tcctcgaaca acgccttacg agctcggtcg aacgggttgt 2092501 ccgcgccggt ccgggccacc accgcccgct tggatgccat gccgtagctc tccttgtacg 2092561 gctgatcgtc gcggtcacgc atcaggtctc cgggggattc gtaggtgccg gcgaaccacg 2092621 atccgatcat cacgttcgag gcgccggcgg ccagcgccag agccacgtcg cgtggatgcc 2092681 ggatcccgcc gtcggcccag atatgaccac cgagctgcct tgccgcagaa gcgcattcga 2092741 gcacagcgga gaactgcggg cggccgacac cggtcatcat tcgggtggtg cacatggcgc 2092801 cggggccgac accgaccttg acgacgttcg ccccggcttt cagcagatcc cgggtgccct 2092861 ccgccgacac cacgtttccc gccgccagcg gcaaacccaa gtccagtgcc gagaccgcct 2092921 tgatcgcgtc caaggtcttg acctggtgtc cgtgtgcggt gtcgatgacc agcacgtcga 2092981 cgccggcttc ggcgagcgct cgggccttag cgcccacgtc gccgttgatg ccgacggccg 2093041 cgccgatccg cagccggccc gcgctatcgg tggccggggt gtagataccg gcgcggatag 2093101 ccccggtgcg gcttagcact cccgccaacg tgccgtcggc gtcggtcagc accgcaacgt 2093161 cgaccggggc gtgctccagc aggtcgaaga tcttgcgtgg ctcggttccc gctggagcgg 2093221 tcacatagtc cgtcacggcg atatcgcgca cccgggtgaa gcgatccacg cccaggcagg 2093281 acgattcgcg caccaatccg atcgggcgac cctcgaggat gaccaccgcg acgccatgtg 2093341 cgcgcttgtg gatgagcgcc atggcgtcgg acaccgaatc gtcgggtgcc agcgtcactg 2093401 gggtgtcgag caccaggtcc cggcttttga cgaacgccac cgtctgcttt accgccggga 2093461 tcggcagatc ctgcggcagg attacgatgc caccgcggcg ggcgaccgtc tcggccatcc 2093521 gccgcccggc taccgcggtc atattggcga ccactaccgg aatggtggtg cccgagccgt 2093581 cggcggtgga caaatcgacg tcgaagcgcg acgcgacctc ggatcggttc ggaacgatga 2093641 acacgtcgtt gtatgtcagg tcgtacccgg gtgggtgccc gtctagaaat ctcatcactt 2093701 acccctttac ccccttttag ttctagcccg ctacaccggt acttcggtgc ggtctgaact 2093761 ccatagtgtg tggaacttgc ctggttcgtc gatccggccg taggtgtgtg cgccgaagaa 2093821 gtcgcgctgg gcctgggtga gtgcagcggg cagccgcgcg gtgcgcagcg cgtcgtaata 2093881 cgacagggcc gacgagaatc ccggggtcgg gatacccagt tgggccgccg tcgacaccac 2093941 acgccgccaa ctgtcgatcg ccgattcgac ggcgccgcgg aaatacgggg ccacaatcag 2094001 actggccagg ttcgggctgg cgtcaaaggc ttccttgatg tggttgagga acttcgcccg 2094061 gatgatgcag ccgccacgcc agatggtggc caggtcgccc ggcgtgatgt cccagccgaa 2094121 ttcggcgctg ccggcctgga tctggttgaa gccctgagcg taggccacga tcttggaggc 2094181 gtacaacgcc tggcggacgt cttcggtgaa cgtggcgggg tcggcgggct gctcgccgag 2094241 cttgcccgaa gccagaccgc tggcggccga gcgttgcccc acggatcccg agagagcgcg 2094301 ggcaaacacc gcttcggcga tgccggtcac cggcacaccc aggtccagcg cggacttgac 2094361 ggtccaacgg ccggtgcctt tctgctcggc ccggtccacg atgacgtcga cgagcggttt 2094421 gccggtcttg gcatcggtct gccgcagcac ctcggcggtg atctcgacca ggtagctgtc 2094481 cagatcgcca ttgttccact cggtgaacac atcggcgatc gccggcgcgg tcagacctag 2094541 cccgtcgcgc atcagctggt aggcctcacc gatgagctgc atgtcggagt actcgatgcc 2094601 gttgtggacc atcttgacga agtgcccgga gccgtccggg ccaatgtggg tgcagcacgg 2094661 cacgccgtcg acatgcgcgg agatctcctc gagcagcgga cccagcgatt ggtatgactc 2094721 ggcgggtccg ccgggcatga tcgacggccc gttcaacgcg ccctcttcgc cgccggagat 2094781 cccggccccg acgaagtgca agccccgctc acgcatcgct ttctcgcggc gcatggtgtc 2094841 ggtgtacaac gcattgccgc cgtcgatgat gatgtcgccg ggttccatgg cgtcagcaag 2094901 ttcgttgatg acagcgtcag cgtcagtggc ctctccggcc ttgaccatga tcagcacccg 2094961 acgcggtttt tccagtgcgg caagaaattc ggggatcgtt tcactgcgca cgaacttgcc 2095021 gtctgagctg tgctccttaa gcagcgcgtc ggtcttggcg accgaccgat tgtgcactgc 2095081 cacggtgtag ccgtgccggg cgaagtttcg ggcgatgttg gaacccatca cggccaggcc 2095141 agtgacgccg atctgcgcga tgccggctgg cgattccgac gaactcatgt cctgcctttc 2095201 agttgggccc ggcttcgcta ggcgatgaac agccgctgca gctgcgtgag ccacggtacg 2095261 gccagcgcga cggtgggcac caccaggacg gcggccgcag ctagatatgc ggccgcggac 2095321 agaaccgcgc tatttccacg ccccgacagc cggcgcacgc ggagcaccgt gctgggacct 2095381 ccgacggcca acgcacccga cggcgcccgc ccggacgcac aggcgaccaa tgcccgagcc 2095441 aggggagtgc gcccggcggc gcgcaccgcg gcgtcatcgg ccaggagctc gacgagtagc 2095501 tgcaccgccc ccagcgcatt ggcgctgcgg accaaccgcg ggaaagccgc gtgcaccgcg 2095561 gtaaacgcct ccaggacaag atcgtggcgg gcgcgtagat gagcccgctc atgggtaagg 2095621 atcgccgcga cctcggcgtc ggcgagcgcg gtcagtgtgc cttcgctgac cacaacccgg 2095681 ctacgcacac cgggcagaca gtaggcaagg ggctgcgcga cgtccaagac ccgaaggtcg 2095741 cgggcccgcg cgcacggctg ggcaagcgcg ccattgtgtc cgaccccgac gagatcgacc 2095801 accatgcggt ggtgtgcccg tcgtcgtcgc gtggcggtgg cgacgcgcac cacggcgacc 2095861 gccagccggg caccgaccag cacagtcaac gcaaagacgg tgatgtaggc cgcccacagc 2095921 ggccagccga ggcggccggc cgcgccgacg aagctggtcg tagggcgtcc gtcgggaccg 2095981 ggcatgagca gcctgctagc gatcgcgatt ccggcgctga acgacgacag caccgcggcc 2096041 agggcaatcg cctgccacag caccatggcg gcgcgcggtg cgcgcagtgg ccacgttgcc 2096101 cgggctagca gggctggggt cgggccagcc agcagcaccg cgaggatggt gaaggccagc 2096161 gcggacacgc cgttagtctc cctcaagtct ccgttgccgc gccagccggt ggccgattgc 2096221 catgaccggc ttccaattcg gcgagcgcac gtcgtagcgc atccgcctcg tcggcaccga 2096281 ctcgctcgac gaagtgcacc agcgcggctt gcctgctgcc ggagtcctcg gcctgagcca 2096341 atgcatcgac catcagcccg gcgaccaatt cgtcgcggcc gtgcacggga gcgtagcggt 2096401 gggctcgatc gtcgcggatc tgcagcacga ggttcttctt tgccaaccgt tgcagcacgg 2096461 tcatcaccgt cgtgtaggca aggtcgcggc gcgccgacaa cgcttcgtgg acttggcgaa 2096521 cggtttgggg ttccgtcctg gaccacaaat ggtccatgac cgcgcgttcc aaatccccca 2096581 accgtgtcag cttggccatt gttcgttcat ctcctgcggg ttgaaaccag cgtactccgg 2096641 cttactactc gctgtcgtat ccaaaccggc gggcggccgt accgggccta tgcacccggc 2096701 tcgcaaacat tacacgctaa cgcttgctaa attagggcag ccttgcctat cattacttcg 2096761 tcgagccaca acgaccgcgg ccgagtcctg agggctgcag tgacccccgg tcgactcgat 2096821 cggcgagccc cgtgccttgg tgcacggggc tcgcccgttg gtgtagacac aaggacgtgc 2096881 agccatcgcc ggactcaccc gctccgctga atgtcaccgt gccgttcgac agcgagttgg 2096941 gtttgcaatt caccgaactg ggtcccgacg gggcccgagc gcagctcgac gtccggccca 2097001 agttgttgca gctgacgggc gtcgtgcacg gcggtgtcta ctgcgcgatg atcgagagca 2097061 tcgccagcat ggcagccttt gcctggctca attcgcacgg cgaaggcggg agtgtggtcg 2097121 gcgttaacaa taatacggat ttcgtgcgct ccatcagctc agggatggtg tatggcaccg 2097181 ccgaaccgct gcatcggggt cggcggcaac agctgtggct ggtcaccatc accgacgaca 2097241 ccgaccgggt ggtcgcccgc ggccaagtgc ggctgcagaa cctcgaggcg cggccttaac 2097301 ccgctcgaaa ccgttgaacc tgccgcggcg tggcaggatc gcagagcatg cgcctgacgc 2097361 cgcacgaaca ggagcgtttg ctgttgtcct acgccgccga gttggcccgc cggcgtcggg 2097421 cccgcggcct gcgcctcaat catccggaag ccatcgcggt gatcgccgac cacatcctgg 2097481 aaggcgcgcg tgacggccgc accgtcgcag agttgatggc atccgggcgt gaggtgctcg 2097541 gccgtgacga tgtgatggag ggagtgccgg agatgctcgc cgaggtacag gtggaggcga 2097601 cgtttccgga cggcaccaag ttggtcaccg tgcatcagcc gatcgcatga ttcccggaga 2097661 aatcttttac ggcagtggtg atatcgagat gaacgccgcg gcactctccc gcctgcagat 2097721 gcggatcatc aacgccggcg atcgtccggt gcaggtcggt agccacgtcc atctcccgca 2097781 ggccaatcgg gcgctgtcat tcgaccgtgc gacggcccac ggctaccgtc tggacatccc 2097841 ggcggcgaca gcggtgcgct tcgagccggg cattccccaa atcgtcgggt tggttccgtt 2097901 gggcggacgg cgcgaggtac ccggtctgac gctaaatccg cccggacggt tggaccgctg 2097961 atggcgcgac tgtcaaggga gcgctacgca cagctgtacg gacctaccac cggcgaccgg 2098021 atacggctgg ccgacaccaa cctgctggtt gaggtcaccg aagaccggtg tgggggaccg 2098081 ggactggccg gtgacgaggc ggtgttcggc ggcggcaagg tgctgcgcga gtccatgggc 2098141 cagggccgtg cgagccgggc cgacggtgcc cccgacaccg tgatcaccgg tgcggtgatc 2098201 atcgactact ggggaatcat caaggccgac atcgggattc gcgatggccg catcgtcggg 2098261 atcggaaagg ccggcaatcc cgacatcatg acaggtgtgc atcgggatct cgtcgtcggg 2098321 ccgtccaccg aaatcatcag cggcaaccgt cgaatcgtca ccgcaggcac cgtcgactgt 2098381 cacgtgcact tgatctgtcc gcagatcatc gtcgaagcct tggccgcggg caccaccacg 2098441 atcatcggcg gtggcaccgg acccgccgag ggcaccaagg ccaccacagt cactcccggc 2098501 gagtggcacc tggcccggat gctggagtca ctggacggtt ggccggtgaa cttcgcgctg 2098561 ctcggcaagg gaaacaccgt gaatcccgac gcactgtggg aacagttgcg cggtggcgca 2098621 tcgggtttca aactccacga agactgggga tcgaccccgg cggccatcga cacctgcttg 2098681 gcggtcgccg acgtggccgg ggtgcaggtt gcgctgcact ccgacactct caatgagacc 2098741 ggattcgtcg aggacaccat cggcgcgatc gccggacgtt cgattcacgc ctaccacacc 2098801 gagggcgccg gcggcgggca cgcaccggac atcattaccg tcgcggcgca accgaatgta 2098861 ctgcccagct cgaccaatcc gacccgcccg catacggtga acacccttga cgagcatctc 2098921 gacatgctga tggtgtgcca ccacctcaac ccccggatcc cggaggacct cgcgtttgcc 2098981 gaaagccgga tccgaccgtc caccattgcg gcagaagatg tgttgcacga tatgggggca 2099041 atctcgatga ttggcagcga ttcccaggcg atgggccgtg tcggcgaggt ggtgctgcgc 2099101 acctggcaga ccgcgcacgt gatgaaagcc cgccgcgggg cactggaagg tgacccgtct 2099161 ggtagccaag ccgccgacaa caaccgggtc cgccgctaca tcgccaaata caccatctgc 2099221 ccggccatcg cacacggcat ggatcacctg atcggttcgg tggaggtggg aaagttggcc 2099281 gacctggtgt tgtgggagcc ggcgtttttc ggggttcgcc cgcacgtcgt gctcaaaggt 2099341 ggggcgatcg cctgggcagc gatgggcgat gcgaacgcgt caatcccgac cccgcaaccg 2099401 gtgctcccgc gaccgatgtt cggcgcggcc gcggcaaccg cggcggcgac ctcggtgcac 2099461 ttcgtcgcgc cgcaatccat cgacgcgcgc ctggcggacc ggctcgcggt caatcgggga 2099521 ctagcgccgg tggccgacgt gcgcgcagtg ggcaagaccg acctgccgct caatgatgcc 2099581 ctaccgagca tcgaggtcga tcccgacacc ttcaccgtgc gaatcgacgg ccaggtgtgg 2099641 caaccgcagc cggccgccga actacctatg acacaacggt atttcctgtt ctaatgacct 2099701 cgctggccgt gctgctcacc ctcgccgact cgcggctgcc cacgggtgcg cacgtgcact 2099761 cgggcggcat cgaagaagcc atcgccgccg gcatggtgac cggcctggcc accctggaag 2099821 cgttcctgaa acggcgggtc cgcacccacg gcctgctgac ggcgtccatc gcggccgcgg 2099881 tgcaccgggg cgagctggcc gtcgacgacg ccgaccggga aaccgacgcg cgcacaccgg 2099941 ctcccgcggc cagacacgcc tcacgcagcc agggccgcgg gctgatcagg ctggcacggc 2100001 gggtgtggcc cgattccggc tgggaggaac tgggcccgag gccgcatctg gcggttgtgg 2100061 ccggacgggt cggcgcgctg agcgggctgg cgcccgagca caacgccttg cacctcgtct 2100121 acatcacaat gaccggctcg gccatcgccg cccagcgact gctggcgcta gatcccgccg 2100181 aagtgaccgt ggtgaccttc cagctgtccg aactgtgcga gcagatcgcg caggaggcca 2100241 cagccggact ggcagacttg tctgatccgc tgctggacac gctcgcccag cggcatgacg 2100301 agcgcgtgcg tcccctgttc gtttcctgaa aggtaaggca tggcaacgca ttcccatccc 2100361 cactcgcaca ccgtgcccgc tcggccaagg cgggtccgca aaccgggcga gccactgcgc 2100421 atcggcgtcg gcggcccggt cggctccggc aagaccgcac tggtggcggc gctgtgccgg 2100481 caattgcggg gagagctgtc gctggcggtg ctgaccaacg acatctacac caccgaagac 2100541 gccgacttct tgcgcacaca tgcggtgctg ccagacgacc ggatcgcggc cgtgcagacc 2100601 ggcggctgcc cgcacaccgc gatccgcgac gacatcaccg ccaacctgga tgcgatcgac 2100661 gagttgatgg ccgcccacga cgcgttggac ctgatcctgg tcgaatccgg cggcgataac 2100721 ctcacggcca ccttctcttc ggggctggtg gatgcgcaga tcttcgtcat tgacgttgcc 2100781 ggcggcgaca aggtgccgcg caagggcggg ccgggggtga cctattcgga tttgttggta 2100841 gtcaacaaga ctgacctggc tgcattggtg ggcgccgacc tggcggtgat ggcccgcgat 2100901 gcggacgcgg tgcgcgacgg ccgcccgacg gtgctgcaat cgttgaccga ggacccagct 2100961 gccagcgatg tcgtggcctg ggttcgtagt caactggccg ccgatggagt ctagtgttct 2101021 ggtggtcgcg tcgccgaatc ggttgccgcg catcgactgt cggggcggtg tccaggcacg 2101081 ccgaaccgcg cccgacacgg tgcacctggt gtcggcggcc gcgaccccgc tgggcggtga 2101141 caccatgaga atccgggtga tcgtggaacg gggtgcccag ctacggctgc gtagtgccgc 2101201 cgcgacggtg gccttgcccg gcgtggatac cctgacgtcg catgctcact gggagatcga 2101261 cgtgaccggc accctggatg tggacctgga gccgacggtc gtcgccgcct cagcccggca 2101321 tctgtcgcat gccaccttgc gcctgcacga cgacggtcgg gtccgcttgc gcgagcgcgt 2101381 gcagattggc agatgcaatg agcgcgaagg attttggtcg tcatcgctgc aggccgatcg 2101441 gcatggtcgt cccctgctgc ggcaccgggt ggaactgggt gccgggtctt tggccgacga 2101501 cgtcattgcg gcgccgcgcg ccactatcag cgagctgcgc tatccggcga cggcattcac 2101561 cgacgccatc gacgcacggt cgaccgtttt ggcgttggcg ggtggcggaa cactgagtac 2101621 ctggcaggct gaccggttgc ctggctaacg ctagctggcc accttagcgc ttgccgctga 2101681 gccctgcgcc tcggcggcca gctcggccag ctgttcgagc cgcgttcgcg caaatgcctg 2101741 ctggtcggtg atggtcagct ggccgcggcg agtactgagg aaagtcaccg tccacgacag 2101801 cagagtggtg atcttggtct tgaacccgat caggtacgcc aggtgcagca ccagccaaat 2101861 cagccaggcg ataaagccgc tgaactcaac gggaccgatc ttggccaccg ccgaaaacct 2101921 cgaaaccgtg gccatcgatc ccttgtcgaa gtactggaat ggctcacgct ccgccgggtt 2101981 ggcgccggcc agttcggcct tgatcgtgct ggcgacgtat ttcgccccct ggatggcgcc 2102041 ctgcgccaca cccggcacac cctccacagc ggccatatcg cccaccacga acacgttcgg 2102101 gtacccggga atggacaggt cgggcagcac ttggacccgg ccggcccggt cgagctcaac 2102161 ccgtgattgc tcggcaaggt ccctgcccaa ccgactggcc gaaaccccgg ccgaccagac 2102221 cttgcaggcc gactcgatgc gccggacggt gccgtcggag tccttgacgg tgatgccgtt 2102281 gcggtcgacg tcggtgacca tcgcacccag ctggatttcc acgcccagct tctgcaaccg 2102341 ggcagccgcc cgctgaccga gctttgcgcc catcggtggc agcaccgccg gggcggcgtc 2102401 aagcagaatc acccgcgcct tggtcgagtc gatgtgccgg aatgcgccct tcaacgtgtg 2102461 ctcggccagc tcggcgatct gtccggccat ttcaacaccg gtggggccag ccccgacaac 2102521 ggtgaatgtc agtagcttgg cccgccgttc cggatcgctg gaccgttcgg cttgctcgaa 2102581 agcgctcaat atgcggccac gcaactccaa cgcgtcgtcg atggacttca tgccgggtgc 2102641 gaattcggcg aaatggtcgt tgccgaaata agactggcca gcacccgcgg cgacgatcag 2102701 gctgtcgtag ggggtttggt aggtgtgacc gagcaattcc gagacgacgc actgcccggc 2102761 caggtcgatg tgggtgacgt tgcccaacag tacctggaca ttgcgctgct tacgcagcac 2102821 gacccgggtc ggcggagcga tttctccctc ggagataatc ccggtggcca cttggtacag 2102881 cagcggctgg aacaggtgat gggtggtgcg cgcgatcagc ttgatgtcaa cgtcggcccg 2102941 cttgagcttc tttgccgcgt ttagcccgcc gaacccagat ccgatgatca caactcgatg 2103001 cctacgaggt ggttgcgctg tgggttcttg ctggggactc atgttccgct gctcctgacg 2103061 gggtcacctc gatgagcgag ttcagttagc tactacggta gtcaacccga ccgctgcagg 2103121 cccagttgag gacatgtgtc atcagccaca ccacagcgtg cctgcgtcac cggcccccgg 2103181 tggctacaca cccagcagcg ggcgcagcgc ttcagcggcg gtggtgatga ccccgggcag 2103241 atagccgtgc ggagccaagt tgatgattaa tccgtcgaca ccggcatcga gcaccttggc 2103301 ctgaatttgg tcggcgatct gtgccgggct gcccaccacc acgcgaccgc tcatctccgc 2103361 gggaatcgca tctggcgaga gtgtctcgtc gatcatcacc gtcaacagca ggctggtctg 2103421 aagcgtcgac cggtcccggc cggcctcgtc gcaccgcgcg gccagcgccc gcatcttgcg 2103481 cggcagctcg tcgaccgccg ccacgatgtt gagatggtcg gcaaagcggg cggcgatcgc 2103541 gaatgtcttt ttctcaccac cgccgccgat caagattggg atgcggtcgc gataccgcgg 2103601 ctcggccatc gccgattcgg tggtgtacca atcgccgaaa aacgttgggc gctcaccctt 2103661 gaccattggc tcgaggatct gtagcgcctc ttcgagccgg ttgaaccggt cactgaaagt 2103721 gccgaactcg aagccgagct ggcggtgttc cagctcaaac caaccggctc caatgccgag 2103781 gatcgctcga ccggcgctaa ccacgtcgag cgtggtgatg atctttgcca gcagggtcgg 2103841 gctgcggtag gtattgccgg tcaccaacgc gcccagttgc agccgctcgg tcgccgtggc 2103901 cagcgcacca agggccgtgt aggcctccag catcggctgg tcgggcgtcc ccaacatggg 2103961 cagttggtag aagtggtcca tcacaaacag ggagtcgtaa ccagccgctt cggcctcacg 2104021 cgcttgagcg atgacggacg ggaaaagctt ctccacccct gtgccgtagg agaagttggg 2104081 gatctgtaga cccagccgaa tagtcacact acctaccgta gcgatcggcc ggtgaagcga 2104141 aaggttcagc cgaagtgagc cagcgcgccg tggctgacgt gcagcgtctg gccggtgatg 2104201 tggcgagccg caggggtggt aaggaacagc gccagccgcg caatctcggc cgcgacgggc 2104261 gcgggtgtgc gcgaaagccc ttcgtaaccg gtctgcacgc tgcggccgca agcgactgta 2104321 ttgatggtga tcccgcgcgt gccgaaaacg gcggcctggc ccgcgatcca attcgagagg 2104381 gccgctttga tcgcggactc ggcgccaccg gcaggcgggt tctccgccac cacgctgaca 2104441 atcgagccgc cggagcgcag gtgatcgccc acggattgca ccgtcagcac caccgagagc 2104501 accgtcgcgt cgagcgcatt gcgccaggcg ttggccgtgt cggacaccga gtaggcgcgc 2104561 gggtcaccgg catcccagga cggcgctggc acgttgacga tggtgtccag gtgacggggg 2104621 aacagtcccc gtgcctcggt gaggctggtc gggtcggtgg tgtcgcacac aacggcgtcc 2104681 acgtcgagtt ccttcgcggc gacctcgagg tcgccgcggc gggcacccac cagggtgacc 2104741 ttgtggccgt cgttgcgaaa gccttcagcc attgtgcgcc cgagatcggt atccccgccg 2104801 gtgaccagca cctccactgc catgacctcc tcgtgttcaa cgctgaaccc agaccctgga 2104861 ccgttgcctg gaatcgcatc gtgatggcgt aagctccggt agatgttact ggacagtagc 2104921 tattcgggga aactccgcac cgccacgacg cgcagacgat cttggtaacc attaggtttg 2104981 gccagtgcgt tggatcggac tgtcaactgg cctagtgtca gcgatgctgg tcgcgggcct 2105041 ggtggcatgt ggatcgaatt cacccgcatc gtcgccagcc gggccgacgc agggtgcccg 2105101 gtcgatcgtg gtgttcgcgg ctgcctcgct gcagtctgcg ttcactcaga tcggtgagca 2105161 gttcaaagcc ggcaacccag gggttaacgt caacttcgct ttcgctggtt cttctgagtt 2105221 ggccacccag ctgacccagg gcgcgaccgc cgacgtcttt gcatctgcgg acaccgcgca 2105281 aatggacagt gtggccaagg cggggttgct ggccggtcat ccgacaaact tcgccaccaa 2105341 cacgatggtc atcgttgccg ccgcaggcaa tcccaagaag atccgatctt ttgccgacct 2105401 cacgcggccg gggctcaacg tggtggtctg ccagccgtcg gtgccatgcg gatcggcgac 2105461 ccggcgcatc gaagatgcaa ccgggattca tctcaacccg gtcagtgagg aacttagcgt 2105521 gaccgacgtt ctgaacaagg tcatcaccgg gcaagccgat gccgggctgg tctatgtcag 2105581 tgacgcgctc agcgttgcca ccaaagtgac gtgtgtcaga tttcccgaag ccgcgggtgt 2105641 ggtcaatgtc tacgccatcg cggtgctaaa gcggacctcc cagcccgctc tggcccggca 2105701 gttcgtggcc atggtgaccg ctgcggcagg tcggcggatc ctggatcagt cgggtttcgc 2105761 caagccctga cgatgcaccc gcctacggat ctgcctcgtt gggtatatct cccggcgatc 2105821 gcggggatcg tgttcgtggc aatgccgctg gtcgcgatcg ccatccgggt cgattggccg 2105881 cgtttctggg cgctgatcac tactccgtct tctcaaacgg ccctgctgtt gagcgtgaag 2105941 accgccgcgg ccagcacggt gctgtgcgta ctgctgggcg tcccgatggc gctggtgctg 2106001 gcccgcagcc gcggacgact ggtgcggtcg ttacgaccgc tgatcctgtt accgctggtg 2106061 ctgccgccgg tagtcggggg tatcgcgttg ctctacgcgt tcggccggct cggcctgatc 2106121 gggcgctacc tggaggcggc cggcatcagc atcgcattca gtaccgcggc tgtggtgctg 2106181 gcgcagacct ttgtctcgct gccgtatctg gtgatttccc tagagggtgc agcccgcacc 2106241 gccggagccg actacgaggt ggtggcggcg acacttgggg cgcggcccgg cactgtctgg 2106301 tggcgcgtga ccctgccgtt gctgctcccg ggcgtggtgt ccggatcagt actggcgttt 2106361 gcccgctcgc tcggagagtt tggcgcgacc ctaacctttg ccggttcccg gcaaggggtc 2106421 acccgtaccc ttccgctgga gatttacctg cagcgggtga ccgatccgga cgcggcggtg 2106481 gcattgtcac tgctgctcgt tgtggtagcg gcactggtgg tgctgggtgt gggtgctcgt 2106541 acgccgatcg ggaccgatac caggtagccg gtcatgagca agctgcagct gcgcgcggtc 2106601 gtcgccgacc ggcgtttgga cgtcgaattc tcggtgtccg cgggcgaggt gcttgcagtg 2106661 ctcgggccca acggtgcggg caagtccacc gccctgcatg ttatcgcggg gctgcttcgc 2106721 cccgacgcgg gcttggtacg tttgggggac cgggtgttga ccgacaccga ggccggggtg 2106781 aatgtggcga cccacgaccg tcgagtcggg ctgctgttgc aagacccgtt gttgtttcca 2106841 cacctgagcg tggccaaaaa cgtggccttc ggaccacaat gccgtcgcgg gatgtttggg 2106901 tccgggcgcg ctaggacaag ggcgtcggca ctgcgatggc tgcgcgaggt gaacgccgag 2106961 cagttcgccg accgtaagcc tcgtcagcta tccgggggcc aagcccagcg cgtcgccatc 2107021 gcgcgagcgt tggcggccga accggatgtg ttgctgctcg acgagccgct gaccggactc 2107081 gatgtggccg cggccgcggg tatccgttcg gtgttgcgta gtgtcgtcgc gaggagcggt 2107141 tgcgcggtag tcctgacgac ccatgacctg ctggacgtgt tcacgctggc cgaccgggta 2107201 ttggtgctcg agtccggcac gatcgccgag atcggcccgg ttgccgatgt gcttaccgca 2107261 cctcgcagtc gtttcggagc ccgtatcgcc ggagtcaacc tggtcaatgg gaccattggt 2107321 ccggacggct cgctgcgcac ccagtccggc gcccactggt acggcacccc ggtccaggat 2107381 ttgcctactg ggcatgaggc aatcgcggtg ttcccgccga cggcggtggc ggtgtatccg 2107441 gaaccgccgc acggaagccc gcgcaatatc gtcgggctga cggtggcgga ggtggatacc 2107501 cgcggaccca cggtcctggt gcgcgggcat gatcagcctg gtggcgcgcc tggccttgcc 2107561 gcatgcatca ccgtcgatgc cgccaccgaa ctgcgtgtgg cgcccggatc gcgcgtgtgg 2107621 ttcagcgtca aggcgcagga agtggccctg cacccggcac cccaccaaca cgccagttca 2107681 tgagccgacc cgcgccgtcc ttgcgtcgcg ccgttaacac ggtaggttct tcgccatgca 2107741 tcaggtggac cccaacttga cacgtcgcaa gggacgattg gcggcactgg ctatcgcggc 2107801 gatggccagc gccagcctgg tgaccgttgc ggtgcccgcg accgccaacg ccgatccgga 2107861 gccagcgccc ccggtaccca caacggccgc ctcgccgccg tcgaccgctg cagcgccacc 2107921 cgcaccggcg acacctgttg cccccccacc accggccgcc gccaacacgc cgaatgccca 2107981 gccgggcgat cccaacgcag cacctccgcc ggccgacccg aacgcaccgc cgccacctgt 2108041 cattgcccca aacgcacccc aacctgtccg gatcgacaac ccggttggag gattcagctt 2108101 cgcgctgcct gctggctggg tggagtctga cgccgcccac ttcgactacg gttcagcact 2108161 cctcagcaaa accaccgggg acccgccatt tcccggacag ccgccgccgg tggccaatga 2108221 cacccgtatc gtgctcggcc ggctagacca aaagctttac gccagcgccg aagccaccga 2108281 ctccaaggcc gcggcccggt tgggctcgga catgggtgag ttctatatgc cctacccggg 2108341 cacccggatc aaccaggaaa ccgtctcgct cgacgccaac ggggtgtctg gaagcgcgtc 2108401 gtattacgaa gtcaagttca gcgatccgag taagccgaac ggccagatct ggacgggcgt 2108461 aatcggctcg cccgcggcga acgcaccgga cgccgggccc cctcagcgct ggtttgtggt 2108521 atggctcggg accgccaaca acccggtgga caagggcgcg gccaaggcgc tggccgaatc 2108581 gatccggcct ttggtcgccc cgccgccggc gccggcaccg gctcctgcag agcccgctcc 2108641 ggcgccggcg ccggccgggg aagtcgctcc taccccgacg acaccgacac cgcagcggac 2108701 cttaccggcc tgaccggatc cggccgcacc ccaagtgata cccctgggcg gggtgtcagc 2108761 gcggccgggc gctcttgagc cggcgcagcg gcgtccatgg agcgccgccg gccaacgcgg 2108821 cgttcttggc gccggcgcga acgttgttca ggtgccaacc ggtggtgggt cgtggttggc 2108881 gacttgtaca gcttccggtt ctccataggt cgcgccgggg acgggcagcg ggtcgtgtgc 2108941 gcgtctttca gtgcaccgtg cgaaacgccg acaccgttga actccacctg aaagcaccgc 2109001 tgaacagcag aaaagcgccc acgaaaacac cgtggggcgc cacacacgtt tgatcacgcc 2109061 acaacccacc gacaccgtca ctaccctcaa atcgttacgc agaagcggta taccgatatc 2109121 acggccctgt gctgggctaa gccagcgtct gcaaggagaa ccgcatggac atcacggcaa 2109181 caaccgaatt ttccgccatg aacctcgacg gcaagacggg tataggttgg ctcggctaca 2109241 tcgtcatcgg cggtatcgcc ggctggctcg ccagcaagat cgttaagggg ggcggctcgg 2109301 gcatcctgat gaacgttgtg atcggcgtcg tcggggcatt cggcgccggc ttggtcctta 2109361 acgcgctggg cgtcgacgtc aaccatggcg ggtactggtt caccttcttc gtcgccctgg 2109421 gcggggctgt cgtcctgctg tggatcgtcg gcatggtgcg caagacctag cgccaaactg 2109481 ttgtcggcca tgcaaattga gtgtgactgc ggcggccggc gacggtagcg gcatgatgga 2109541 gtgatggtct caccggcgac cacggcgacg atgagtgcgt ggcaggtgcg tcggcccggc 2109601 ccgatggaca ccggcccgct cgaacgagtg accacccggg tgccgcgccc ggcgccatcg 2109661 gagttgctgg tggccgtgca cgcatgcggg gtgtgccgca ccgatctcca cgtgaccgaa 2109721 ggtgacctgc ccgtgcaccg cgaacgggtg attcccggcc acgaggtagt gggagaggtc 2109781 attgaggtgg gctcagcggt gggcgcggct gccggtggcg aattcgaccg aggagaccgg 2109841 gtgggtatcg cctggctgcg tcacacttgc ggggtctgca agtactgccg gcgcggcagc 2109901 gagaacctct gcccgcaatc ccgctacacc ggctgggacg ccgacggggg atacgccgaa 2109961 ttcacgacgg ttcctgcggc tttcgcgcac catctgccga gcggctatag cgacagcgag 2110021 ctggcgccgt tgttgtgcgc cggcatcatc ggatatcgat cgctgctgcg caccgagcta 2110081 ccacccggtg gccggctggg tctctacgga ttcggcggca gtgcccacat caccgcccag 2110141 gtcgcgttgg cgcaaggcgc cgaaatacat gtgatgacac gcggggcccg cgcgcgcaag 2110201 ctggcgctgc aacttggcgc tgcatcggct caggacgccg ccgaccggcc acccgtgccg 2110261 ctggacgccg cgatcttatt cgccccggtc ggggatctgg tgctgcccgc gctggaagcg 2110321 ctggaccgtg gcggcatctt ggcgatcgcc gggatccacc tgacagatat tccggacctg 2110381 aactaccagc agcacttgtt ccaggagcgt cagatccggt cggtcacgtc gaacacccgc 2110441 gccgatgcgc gcgcgttctt cgacttcgcc gcccagcatc acatcgaggt caccacgccg 2110501 gagtacccgc ttggccaagc cgatcgtgcg ctgggcgacc tgagcgccgg ccgcatcgcc 2110561 ggtgccgccg tgctgctgat ctgaccgagc tcaggtcgac aggtgccaga ccagggcagc 2110621 ggccagggca cccatcccgt tcagcgacca atgcagtgcg atcggtgcga tcaggctgcc 2110681 gctgcgccgt cgcagccagc tgaacacgaa tccggccact ccggtggcca acaccgccag 2110741 catgacaccg gccaccagcc cgatgatccc gccaccgaac agtcgagtga agccgacatt 2110801 gctgctcgtg agccccagcg acgtcgcaat atgccacaga ccgaacagca ccgaacccgc 2110861 caccgcgaca ccccggaatc cccaagcccg attcagcgcc ccatgcaaca caccgcggaa 2110921 ggccagctct tcggggatga cggtttgcag cgggatcatg accatcgagg cgatcaccgc 2110981 gccggagatc gtcgcgtagt gatggttcat gaacatcggc cgggttatcg gcagcaggac 2111041 acctaccgag atcaccgcca ccaccagggc aacggccgct agcgcataga cgagcccgga 2111101 tttccagtgt tggcggctca gtccgagttc agcccagccc aggcctctac tccgcaccaa 2111161 gatcaccagt ccgaccgcgg cggccgggac ggtggcgatg ctcgcccacg gtgtggtgaa 2111221 atgcgcgatc aggttcgtca gtaccagcac caggacgacg acggcgatgt cgacatatat 2111281 ccggaaccgg tgcatcaccg agaggtgcga caccagtgga cctggatgaa cggctgcgca 2111341 agcagtcaag tggtcagaca tcgtcagcag agtctaccgg cggagggctc ggtgtccgct 2111401 ctcgcgcgta ggccttgagc tcggctgcga gcgcgtctgc cgccaacagc tggggaagca 2111461 gctccgattc agaggttcgg gcgcgaaaca cgagcccgac ggtcacgttg tgctcagggc 2111521 ggtaatcgac ggtgattgtg tcgccggcgc gcactgttcc gggagcgatc acccgtaggt 2111581 aggcgcctgg tttggcggcc cgggtgaagg tcttgatcca ataacgcaaa tccaggaagg 2111641 ccgcgaaggt ccggcacggg atccggggcg ccgagacttc caacaccaat ccgtcggagc 2111701 cgatgcgcca gcgttcacca atccgcgcgt acgtcacgtc gacgcccgag gtggtcagat 2111761 tctcgccgaa cattccgttg tgaagggtgc ggtgaagctg ggtttcccac gcgtcgaggt 2111821 cttctcgcgc atacgcatag acggcctgat catcaccgcc atggagcttc gggttgccga 2111881 cggtgtcgcc aaccaggccg ctgccgacac ccgcatgcat cgacccgggt gcccgcacca 2111941 tgaccgcctc agatgccgcc actttgtcga ttccggtcaa cttcgactgc gcgcgcggat 2112001 cagggttcgc ccgaacacga gccaggttga ccgacaacac atgcgccacc cgcacagggt 2112061 agctctgacg cgcgttggtc cacgccagcc ggcgcggcgc aacggtcact cctcgccgcg 2112121 agcccgagcc tcgtaggtcc tgcgcttctc catgtcgaca tcgtcggtga agacatgctc 2112181 gccgccgagg agtcggttca agccctcgga aacctggcgc ggcatgaacc gctgtgccac 2112241 gatcatcgag ccagccgctt tcgtgacccg cacccgcggt ttgggatgaa caatcagccc 2112301 gacgatcgcg tcggcgatat cggccggctc ggcgttcttg aatcctttga tcccaccggt 2112361 gcccgcaatg agctcggtgt tgacaaacga cggcaacacc atcgagaact tcacgccggc 2112421 cgaacggtat tcaagcctgg ccgaatcggt gaacgcgacc accgcgtgct tgctggcaca 2112481 gtaagtggcc acgcctacgg cgtagatttc cccggcaagc gaggcgacat tgataacgtg 2112541 tccccgcccg cgcgggacca tccgctgcgc cgccagcttg ctacccaaga tcaccccgta 2112601 gacgttgatg tccaggattc ggcgggttac cgggtctggt tcgtcgacaa tccgccccac 2112661 gggcatgatg ccggcgttgt tgaccagcac gtcgatcggg ccgagttggc gctcgacggc 2112721 gtcgaggaat cccgaaaacg aatccgggtc ggtgacatcg agtttgccgt acatgtcgag 2112781 gtcgagatcg gcacccgact ctttcgccat cgcctcatcg atgtcgccga tagcgacctt 2112841 ggctcccaag ttgtgcagcg cggccgctgt ggccaatccg atcccccggg cgccgccggt 2112901 gatggcgatt actttgtcct ggaccttgtc ccggatcttg acgccgatgg atgtcctgcc 2112961 tggcactgtc gtcccttcgc tcggcgggcc ttagccgccg tccaatgcgg tcgcgcccgt 2113021 gtagtcacgg tagccgcgaa cgccgatgaa acagctacgg tgtgcacgtg cccgaacgat 2113081 tgctcgatgc cgtgcgtgtg ctcgacttgt ccgacggctg ttctgctgga ggcaccgata 2113141 tggtgacacg actgctcgcc gacctgggcg cagacgttct caaggtggaa ccccccggcg 2113201 gcagcccagg acgccacgtg cggcccacgc tggccggcac cagcatcggg ttcgccatgc 2113261 acaacgcgaa caaacgcagc gcagtgctca acccgctcga cgagagcgac cgtcggcggt 2113321 tcttggacct cgccgccagc gccgacatcg tcgtcgactg tggtcttccg ggacaggccg 2113381 ccgcgtacgg ggcatcgtgt gccgagttgg ccgatcgcta ccgacacctg gtggcgctgt 2113441 cgatcaccga ctttggcgct gccggtccgc ggtcgtcatg gcgcgcgacc gatccggtgc 2113501 tgtacgcgat gagtggtgct ctctcgcggt cgggccctac cgccggcacg ccggtactgc 2113561 cgccggacgg tatcgcttcg gcaaccgcag cggtgcaggc agcctgggcc gtactggtcg 2113621 cctatttcaa ccgattacgt tgtggtactg gggattacat cgacttctcc cggtttgacg 2113681 ccgtcgttat ggcgttggat ccccccttcg gggcgcacgg gcaggtcgca gccggcatcc 2113741 gcagcaccgg gcgatggcgg ggacggccca agaaccagga cgcttacccg atttatccgt 2113801 gccgggacgg ctacgtacgg ttctgcgtga tggcgccgcg gcagtggcgc gggctgcgcc 2113861 gctggttggg ggagcccgaa gattttcagg accccaagta cgacgtgatc ggcgcacgtt 2113921 tggccgcatg gccgcagatc agcgtgttgg tcgcgaagtt gtgcgccgag aagaccatga 2113981 aggagttggt ggcagccggc caagcgctcg gggttcccat taccgcggtg ctgacaccgt 2114041 cgagaatcct ggcctccgaa cacttccagg cggtgggtgc gatcaccgat gccgagctcg 2114101 ttccgggggt gcgcaccggg gtgcctaccg gatacttcgt tgtcgacggg aagcgcgccg 2114161 gtttccgtac tccggccccc gccgcggggc aggacgaacc gcgctggctc gcggatccag 2114221 cgccggtgcc cccaccctca ggccgggtcg gcggctatcc attcgaaggt ctgcggattc 2114281 ttgatctggg catcatcgtg gccggcggcg agctcagccg gctgttcggc gacttgggcg 2114341 ccgaggtcat caaggtcgaa agtgccgacc accccgacgg gttgcggcag acccgagtcg 2114401 gggatgcgat gagtgaatca ttcgcgtgga cccatcgcaa tcacctcgcg ctgggcctgg 2114461 acctgcgcaa cagcgagggc aaagcgatct tcggtcgcct ggtcgctgaa tccgacgcgg 2114521 tgttcgccaa cttcaaaccg ggaaccctta cctcacttgg gttttcctac gatgtactgc 2114581 acgccttcaa cccccggatc gtgctcgccg ggagtagtgc attcgggaac cgagggccgt 2114641 ggagcacccg gatgggctac gggccactgg tgcgcgccgc caccggggtc acccgtgttt 2114701 ggacatccga tgaggcgcag ccggacaact ctcggcatcc cttctacgac gcgacgacga 2114761 tcttccccga ccacgttgtc gggcgggtcg gtgccctgct cgcgctggcg gccctgatcc 2114821 accgcgatcg aactggcggc ggagcccacg tccacatctc ccaggccgaa gtcgtcgtca 2114881 atcagctaga caccatgttc gttgccgagg ccgcccgagc gaccgacgtt gccgagatcc 2114941 acccggacac cagtgtgcat gcggtctacc cttgtgctgg cgacgacgaa tggtgcgtca 2115001 tctcaatccg ctccgacgat gaatggcgtc gcgcgacatc tgttttcggc cagcctgaat 2115061 tggcgaacga cccacgcttc ggggcaagcc ggtcacgcgt ggccaaccgt tcggagttgg 2115121 tggccgcagt gtcggcctgg accagcaccc gtaccccggt gcaagcggcc ggcgcgctgc 2115181 aggcggccgg agttgcggcc ggcccgatga atcgcccgtc ggatatcctc gaggatcccc 2115241 agctgatcga gcgaaacctg ttccgcgaca tggtgcatcc gctgatcgcc cgtccgctgc 2115301 ccgccgagac gggtccggct ccgtttcgtc acattccgca ggcaccccaa cgcccggcgc 2115361 cgctgcccgg acaggacagc gttcagatct gccgcaagct gctcggcatg accgcggacg 2115421 agaccgaacg cctaatcaac gagcgcgtaa tgttcgggcc ggccgtcact gcctaagtgg 2115481 tctcgccggt gtcgttcgtc gacggtcggc tgattgccct tccggctccg agatcgacgt 2115541 tttgcccgcc tgttcgtgct ttatctgcga agccccgatc tgggcgcatc ggggtgacgc 2115601 attcgggcag ctaaagcttt tcgacccgca agccggcggt gcccctcctc gttccgctgc 2115661 ccggtctgct cgatcggttc ggggtcgccg cgctaggccc aattgcccgg ctcctcctcg 2115721 ggccgttcca cgacccgcat cgtcgccggg ctaggttcaa gccatgccgg tagaccccag 2115781 gacgccagtg ctgatcggct atggacaggt caaccaccga ggcgacatcg acgccgagaa 2115841 gcagtccatc gaacccgtcg acctgatggc cgccgcggcc cggaaagccg cggattcgac 2115901 ggtgctcgag gcggtggatt cgatccgtgt ggtgcacatg ctgtcggcgc attaccggaa 2115961 tcccgggcag ctcctcggcg aacgaatcaa ggcgaggacc ttcaccaccg gttacagcgg 2116021 ggtgggcggc aacatgccgc aatccctggt caaccgggca tgcctggaca tccagcgcgg 2116081 gcgggccggc gtggtgctgc tggctggcgc cgaaacctgg cgcacccgaa cgggcctgcg 2116141 cgccaagggc agcaaactgg agtggactgt gcaggacgaa tccgttccgc tgccggacat 2116201 ggccggcgac gacgttccga tggccggtgc ggctgagctg cggatcaacc tggaccggcc 2116261 ggcctacgtg tacccgatat tcgagcaggc gctgcgcatc gcctacggcg agtcgatcga 2116321 gaaccaccga aagcggatcg gcgagctgtg ggcgcggttc agtgccgtag ctgctgacaa 2116381 cccgcacgcg tggatccgca acccggttac ggctgacgag atctggcagc ccggcccaca 2116441 gaaccggatg gtcagctggc cctacaccaa gcttatgaac tccaacaaca tggttgacca 2116501 gggtgccgcg ctgctgctga cgtcggtcga acgtgcgaca cgtctgcgaa taccggccga 2116561 acgctgggtt tatccacagg ctggcaccga cgcccacgac acaccggccg tcgccgaccg 2116621 ccaccgactg catcggtcga cggccattcg gatcgccggt gcccgggcgc tggaactggc 2116681 tgggctgggg ctcgatgaca tcgaatacgt cgacctgtat tcgtgctttc cctccgctgt 2116741 ccaagtcgcc gcaatcgaac tcggcctgga caccgacgat cctgcccgcc cgctgaccgt 2116801 caccgggggc ctgaccttcg ccggcgggcc gtggagcaat tacgtcacgc actccatcgc 2116861 caccatggct gaactgctgg cggccaatcc cgggcgccga ggcctgatca ccgccaacgg 2116921 cggttacctg accaaacaca gtttcggggt ctacggcacc gagccgccgt cggaattccg 2116981 ctgggaggac atgcaacccg cggtcgatag ggagcccacc ggagatgggt tggtcgagtg 2117041 ggaaggcatc ggcaccgtcg aagcgtggac cacaccagtc aaccgggacg gacaacccga 2117101 gaaggcgttc ctggcggtgc gcacgcccga cgggtcgcgc agcttggccg tgatcaccga 2117161 tcccgcatcg gtgcaagcaa cggtgcgcga ggacatcgcc ggcgtcaagg ttgccgtcgc 2117221 ccccgacggc accgcgaccc tgcgatagcc ggcgggcagc acgagtcacg ttccagaagc 2117281 aatggtcgcg caagcgacac tgacgtgcct attgtcatga ggagacgttg ggggaggtga 2117341 ggccgggtgc agatcctggt taccgacgcc acgggtgccg tcgggcggtc ggtcactcgg 2117401 cagttgatcg ctgccggaca cacggtgagc ggtatagccc agcacccgca cgatgctctg 2117461 gacccccgcg tcgactatgt ttgcgcgtcg ttgcgcaacc cagtgctgca agagttagcc 2117521 ggcgaagccg acgcggtgat ccatctcgcc ccggtcgaca ccagcgcccc gggcggtgtt 2117581 ggcatcaccg gactggcaca tgtggccaac gcggccgccc gcgccggtgc ccggctgctg 2117641 ttcgtttctc aggccgctgg gcgacccgaa ctatatcggc aggctgagac gctggtgtcc 2117701 accggttggg cacccagctt ggtcatccgt attgcgccac cggtcggccg ccaactcgat 2117761 tggatggtgt gccggacagt ggccacgctg ctgcggagca aagtctcggc acggccgata 2117821 cgagtgctac atctcgacga cttggtccgc ttcctggttt tggcgctgaa taccgaccgc 2117881 aacggtgtcg ttgacctggc cacccctgac accaccaatg tggtcaccgc gtggcggctg 2117941 ctccgatccg tggacccgca cttgcgaaca cgtcgggtcc gcagctggga gcaattgatt 2118001 cccgaggtgg atatcgctgc cgtgcaggag gattggaact tcgagttcgg ctggcaagcg 2118061 accgaagcaa ttgtcgacac cgggcggggc ctcgtcggcc gcagactgca cccggcaggc 2118121 gcgaccaacg gatcgggtca actagcactg ccggtggagg cgcccccgcg gtctgtgcct 2118181 tcccacgggg aacccttggg cagcgcggct ccagaagggt tggagggaga gttcgacgac 2118241 cgtatcgacg agcggttccc ggtcttcagc tcggccagtc tcgccgaagc gctgccgggt 2118301 ccgctgaccc cgatgacgct ggatgtccag ttgagtggac tgcgcgcggc cggtcgggcg 2118361 atgggtcggg tactggcgct tggcggtgtc gttgccgatg agtgggagag aagagccatc 2118421 gcggtgttcg gtcaccgccc gtatatcgga gtgtcggcca atattgtggc cgccgcccaa 2118481 ctgccggggt gggacgcgca ggccgtagcc cggcgggcac tgggcgagca accgcaggtc 2118541 actgagctgc ttccgtttgg tcgaccgcaa cttgcgggcg gaccgctcgg ctcggtcgcg 2118601 aaggtggtcg tgacggcgcg gtcgctggcc ctgctgcgcc atctccggag cgacacacac 2118661 cactatgttg ccgccgcaga tgccgagcac ctcgctgccg ggcagcttgc ctcgctaccg 2118721 gacgccggct tggaggtccg gattcggctg ttgcgtgatc gcatccacca aggctggatt 2118781 cttacggtgc tgtgggtgat cgacacgggc gtcacagcgg cgacgttaga gcacacccgc 2118841 gcaggctccg cggtgtccgg agggggcatg atcatggaaa gtggcagaat cggcgccgag 2118901 attgctccgc tggctgcggt gctgcgcgcc gacccgccgc tgtgcgcgct ggccaacgac 2118961 ggcaacctcg ccagcatccg cgcgctgtct gctcccgccg ccgccgcagt tgacgcggtc 2119021 attgcccgga tagggcaccg cgggttaggc gaagccgagc tggctaacct gacgtttgcc 2119081 gacgatccgg cgctactgct gaagacagcc gccgaaatcg ccgcgcggcc cgccgggcca 2119141 gctcacccag cgacgttgat ccagcgactg gctgccggca cgcgcagtgc ccgggagctg 2119201 gcgcacgaca ccaccatccg attcacccat gagctccgga tgacattgcg ggagttggga 2119261 tctcgacgag tcgcggcgga tgtgatagac gtcgttgacg acgtgttcta cctgacctgc 2119321 gacgaactga ttaccacgcc ggccgacgct cggctgcgaa tcaaacgtcg gcgcgccgaa 2119381 cgagaacgcc tgcaggcaca gcgcccgcca gacgttatcg atcatgcctg ggtacccgtg 2119441 gagtagcggt caacacacgt caattcgtcg tcaggtccgc caacggccac tgcggatcaa 2119501 ccagcctgtc aacgtcgacc gggttcccgg accggatcag gcccttgacg tcgtccacca 2119561 cgtcccagac gttgacattc atcccggcta gcacccggct gtcgccgtcg agccagaagg 2119621 agaggaactc gcggccggca acgttgccac ggaacaccac ccgatcacag ctgggggcgt 2119681 ggccgacgta ctccatgccg aggtcgtatt gatcggtgaa caaatagggc agttcagcgt 2119741 attcgcccgg ccggcccagc atgccggcag ccgccaccgc gggttgtttg agcgcgttgg 2119801 cccagtgttc ggtacggacg cgggtaccca atagcgggtg ttcagcggcg gcaatgtcgc 2119861 cgactgcgta gatgtcggga tcgctggtgc gcagcgatgc atcaaccaac acaccgccct 2119921 cgcccatcgc cagcccggcc tgttgggcga gttctacgtt gggcttcgcg cccacagcga 2119981 ctagcacggc gtcggcggca accgtcgacc cgtcacgcat cttgagcccg gtcgccttgc 2120041 cgtcggctgc agtgatctct tcgagctggg tctgcaaccg taagtccacc ccttgatctc 2120101 gatgtaggtc ggcaaacact ttgccaaccg cttccccgag cgcggccagc agcggttgta 2120161 tggcggtctc gacgacggtg acgtcgacgc cacgttgacg cgcactggcg gccacttcca 2120221 ggcctatcca gccggcaccc accactgcga gggaagaccc ctgcaccaga acggagttca 2120281 atgccacggc gtcgttgtag ctgcgcaggt agtggacgcc ggcggcatcg gatccaggta 2120341 ttggtgggcg ccgtggggcc gatcccgtgg ccaacagcag cttgtcgtag cgcaccgcag 2120401 cgccgtcggg aagctctacc gtgtgtgcgg accgatccaa tgacgacacc cgcacgccga 2120461 gccgcacatc cacgtcatgg tcgcggtacc aatcggaggt ctggatggtg aagtcgctca 2120521 gcgacttttt gccggccaga aactccttgg aaagcggcgg ccggtcgtag ggcaggtgct 2120581 cttcgtcgcc gaacaagata atccgaccgc cgaagtcgct gcggcgcaac gcctctacgg 2120641 ctttagcccc ggcaagtccc ccgccaacaa tgacgaacgt ggttgagctg gccataattg 2120701 ctgctccgtc ctgttgtgtg cggtgccgct tgacagccta cgagccggtc gcgtacctgg 2120761 gtcaaccggt cacctgcagg cgcagctcgt cgtcttacgc cactcgcact aacgcagcag 2120821 cgagcagcgc attggagctg ggtgccaccg acgccagctt cttcgggtca gtgggcaagc 2120881 cgagctgctt cgccgcggcg gtggctcgat cgtcgaaata cggtcgtacc cagatccaga 2120941 cgtcttggac ctcgcgtaag aaaatgtcgg caccggtgtc gccgattccg ttgaaagtct 2121001 tgagcatacg tttggcggcc gaaacgtcgg gtcgtgtgcg ctgggcgagt tcccgcaaat 2121061 caccggagta ctcgtcgcga acccggtgag cgatagcggt gagccgggtg gctgagctct 2121121 cgtcataccg cacgtagtgg gcacggccaa acgcactgat catcgtttgt cgctctgctg 2121181 acagcacagc tttgggtgtc cgcaggcccg agcagaacaa ttcccgggcg gcacgtgctg 2121241 ccgtggcggc accgatcggc ttgctggcca gcatgcacag caccagcagc tgaaacagcg 2121301 gcatcggttt gtccctgatc cggattcccg cctccgccgc gtaagtggtg ccggcgagtt 2121361 taagcagtcg tcgtgccagt ggctccggct tgatcacaag caaccgcata cccgcaatgc 2121421 gtggcggcaa accgcgacta ttgctcgggc aagcgcgctc cggcggccta agccccggtt 2121481 ccggccaacc cctgtcagtc caaatccacc cggatggtca gcaagtcggt gcccatcgcg 2121541 cgtacgccgg cactgttcag ccggggtagg ccgcgcagcc gctgcctcgg atcgtcgtcg 2121601 ggtagcaggt aggcggtccc actgcgccat cggccgccga tgcggacccg cacggcgggg 2121661 ttggccttga tgttgtagac gtaatcggaa tgctcgccgt gctcggacac catccagaac 2121721 tggttgtcta cgacgcgccc gcccaccgcg gtacgccgcg gctgtcccgt tttgcggccg 2121781 atggtttcga gcatggtcat cggcagttgc cggccgattg gattgaccac gaaccgttgc 2121841 acgcgatgga cgaattcccg cttgagattc atagctgcat tcaacgctac cgatctggcc 2121901 gcggcctcac gttggtgccc cgatagggcc gagccgccgc agttgtgtca cgtgccgagg 2121961 tgacagctcc tcaaggcagg tcacgcccag tagccgcatg gtccggatca cacctgtctg 2122021 aaggatctcg atcgcgcggt tgacgcccgc ctcaccaccg gccatcagcc cgtaaaggta 2122081 ggcccgcccg atcagcgtgc accgtgcccc caacgcgatc gccgcgacga tatcggcgcc 2122141 cgacatgatg ccggtgtcca ccaggatttc ggtgtgtttg cccagttcgc gtgccacgtg 2122201 gggcaacagg tggaagggta ccggggctcg gtcaagctgg cggccgccgt gattggacaa 2122261 cacgatgccg tcgacgccgc ggtccaccac ggcgcgggcg tcgtcgagtg tttggatccc 2122321 tttgacaacg agcttgcccg gccactgcga cttgatccag gccaaatcgt cgaaggtgag 2122381 gctggggtcg aacacggtgt tcaagtactc gccgacggtg ccaggccagc gatccagtga 2122441 agcgaaggcc agcggttcgg tggtcaacaa gtcgaaccac caccgcgggt gtcccatcgc 2122501 gtcgagaacg gttcgcagcg tcagcgccgg cgggatggac atcccgttgc ggacatcgcg 2122561 tagccgggca ccggcgaccg ggacgtcgac cgtgaccagc atggtgtcaa atcccgcggc 2122621 ggcgacgcgc cgcaccaatg ccatcgagcg gtctcgatca cgccacatat acagctggaa 2122681 ccatttgcgg ccctgcggca cagcgatgac gaggtcttcg atggcacagg tggccagggt 2122741 ggatagcgaa aacgggatcc cagccgcggc cgccgcccgc gcgccggcga tctcgccctc 2122801 ggtgtgcatc aagcgggtga acccggttgg cgcgatcccg aatggcaaga cggtgggctg 2122861 accgaggacg ttccagccgg cgcacacggt ggtgacgtca cgcaggattg tcgggtgaaa 2122921 ctcgatgtcg cggaaccctt gtcgagcacg cgcgatggac agttcgtcct cggcaccccc 2122981 gtcggcgtag tcgaacgccg ccctaggggt acgccgtttg gcaatgcgtc gcaggtcctg 2123041 gatggtcagc gcggcgccca ggcggcgctt ggaggtgtcg aactgcggcc tgttgaactg 2123101 gagcaggggt gccagatcgc gcactctggg cactcgccgg ttgaccgcca tccgtttatc 2123161 taaccagttt gatatgaagt cagcaagcga cccgttcgac ctgaagcgtt tcgtgtacgc 2123221 gcaggctccg gtctaccgca gcgtcgtcga ggagctgcgc gccggacgaa agcgcggtca 2123281 ttggatgtgg ttcgtcttcc cacaactccg cgggctaggt agtagcccac tggcagtgcg 2123341 ctacggcatc tcctcgctcg aggaagccca ggcctatctg cagcatgacc tgctcgggcc 2123401 ccgcttgcat gagtgcaccg ggttggtcaa ccaggtgcaa ggccgctcaa tcgaggaaat 2123461 cttcggcccg cccgacgacc tcaagctgtg ctcgtcgatg accctgttcg cccgtgccac 2123521 cgacgccaac caggactttg tcgcgctgct cgccaagtat tacggcggcg gagaggaccg 2123581 gcggacggtg gcattactgg cggtcacata gaccgcgcga tccaccgggg cgtcgacgcc 2123641 tgacagcgga tgtaggttcg ggctcatgga gaaggtgatc gccgtgctca tgcggcccga 2123701 gccagacgac gactggtgtg cccgccaacg agctcaagtc gccgacgccc tgctgggact 2123761 gggcgttgct gggctgtcga tcaatgtccg ggacagtacc gtgcgcgact cactgatgac 2123821 cctgacaacg ctgtacccac cggtcgcagc ggtggtcagc ctgtggaccc agcagtgcta 2123881 tggcgagcag gtagcagccg ccctcaggct actggctcag gagtgtgatg aactcggcgc 2123941 atacctggtg accgagtcgg ttccgctgac cttcccatcg ctcgtcgagt ccggttctcg 2124001 tacaccgggt ctggccaaca tcgcgctcct gcgccggccc gatggcctgg accaggcgac 2124061 ctggctgacc cgctggcagc gcgaccacac gcaagtggct atcgaggcac aggcgacatt 2124121 cggctacacc cagaactggg tggtacgagc cctcacccca gaggcaccgg gaatcgcggg 2124181 cattgtcgaa gagttgtttc ccgtggcggc gacaaccgat ctgaaagcct tcttcggagc 2124241 cgccgacgac aacgatctgc ggaatcggat aagccggatg gtcgcgagca catctgcatt 2124301 cggtgccaac cagaacatcg acaccgtgcc aaccagccgc tacgtgttca gaacaccgtt 2124361 caaggattga ggaacgtgag atgacaacac tcaacgaagc cgcggcactg gcggcggcag 2124421 aacgtgggct tgcggtggtt tccaccgttc gtgccgacgg caccgtgcag gcgtcgctgg 2124481 tcaacgttgg actgttgccg catcctgtca gcggcgaacc atctctggga ttcaccacct 2124541 atggcaaggt caaactcggc aaccttaggg cgcgcccaca actggccgtc acgttccgca 2124601 acggttggca gtgggcgacc gtcgaaggcc gagcacaact tgtcggcccc gacgatccgc 2124661 ggccgtggct ggtcgacggc gagcgattgc ggctgctact ccgcgaggtc ttcactgcgg 2124721 cgggtggcac gcacgacgac tgggacgagt acgaccgggt gatggcgcag gagcagcgcg 2124781 ccgtggtgct gatcacgccc acccgcatct acagcaacgg ctgagggact cagcaaacgg 2124841 cgtcgctcgt gcgacctgcg gggtcgagtt gggttgggtt gagtcgggcg gctgcgatga 2124901 tagctcgcag tgtgcgccgg cagcgtccgc agtcgccgcc agccccgcac acagcggcca 2124961 cttctttgga ggtcgacgca cctcgcgcca cggcgtcaca cacggtttgg ttggtgacgc 2125021 cgacgcacaa gcacacgtac atcagcaaac ccccagcaga tgctgcgtcg gcgaacgatc 2125081 aagccgcata ttagtggagt ctagcctaag ctgattagtg gagtctaacc taacaatgac 2125141 ccgcggcttg gactttgcgc cggcgagacg cgccgacgcc gcaacaaacc ctgcgccgac 2125201 ccgtactcgc tgcactagat tgagacgcgg cacgcaaacg tgctgttatc agcccaagac 2125261 gagcccgaca ccggtgcgct ccagccctgc ccacctggcg cggttcgcca cgacagcctt 2125321 atatcccata ggagtggtca tgcaaggtga tcccgatgtt ctgcgcctgc tcaacgaaca 2125381 attgaccagc gagctcaccg ctatcaacca atactttctg cactccaaga tgcaggacaa 2125441 ctggggtttt accgagctgg cggcccacac ccgcgcggag tcgttcgacg aaatgcggca 2125501 cgccgaggaa atcaccgatc gcatcttgtt gctggatggt ttgccgaact accagcgcat 2125561 cggttcgttg cgtatcggcc agacgctccg cgagcaattt gaggccgatc tggcgatcga 2125621 atacgacgtg ttgaatcgtc tcaagccagg aatcgtcatg tgccgggaga aacaggacac 2125681 caccagcgcc gtactgctgg agaaaatcgt tgccgacgag gaagaacaca tcgactactt 2125741 ggaaacgcag ctggagctga tggacaagct aggagaggag ctttactcgg cgcagtgcgt 2125801 ctctcgccca ccgacctgat gcccgcttga ggattctccg ataccactcc gggcgccgct 2125861 gacaagctct agcatcgact cgaacagcga tgggagggcg gatatggcgg gccccacagc 2125921 accgaccact gcccccaccg caatccgagc cggtggcccg ctgctcagtc cggtgcgacg 2125981 caacattatt ttcaccgcac ttgtgttcgg ggtgctggtc gctgcgaccg gccaaaccat 2126041 cgttgtgccc gcattgccga cgatcgtcgc cgagctgggc agcaccgttg accagtcgtg 2126101 ggcggtcacc agctatctgc tggggggaac tgtcgtggtt gtggtggctg gcaagctcgg 2126161 tgatctgctc ggccgcaaca gggtgctgct aggctccgtc gtggtcttcg tcgttggctc 2126221 tgtgctgtgc gggttatcgc agacgatgac catgctggcg atctctcgcg cactgcaggg 2126281 cgtcggtgcc ggtgcgattt ccgtcaccgc ctacgcgctg gccgctgagg tggtcccact 2126341 gcgggaccgt ggccgctacc agggcgtctt aggtgcggtg ttcggtgtca acacggtcac 2126401 cggtccgctg ctggggggct ggctcaccga ctatctgagc tggcggtggg cgttttggat 2126461 caacgtgccg gtttcgatcg cggtgctgac agtggcggca accgccgtcc ctgcgttggc 2126521 ccgaccgccc aaaccggtca tcgactacct tgggatcctg gtcatcgctg tggccacgac 2126581 cgctttgatc atggccacaa gttggggcgg aaccacctac gcctggggct cagcgaccat 2126641 tgtcgggctg ttgatcgggg ccgcagtggc gctgggtttc ttcgtgtggc tggagggccg 2126701 cgccgctgcg gccatcctgc cgcccaggct gtttggcagc ccagtatttg ccgtgtgctg 2126761 cgtcctgtcc ttcgtggtcg gattcgcgat gctgggtgca ctgaccttcg taccgatcta 2126821 tctggggtac gtggacggcg cgtcggcgac cgcgtcaggt ctgcgcacgt tgccgatggt 2126881 gatcggcctg ctgatcgcct cgaccgggac gggtgtcctg gtcggccgga cgggccgcta 2126941 caagatcttc ccggtcgcgg ggatggcgct gatggcggtt gcgttcctgc tgatgtcgca 2127001 gatggacgag tggacgccac cgctgctgca atcgctgtac ctggtcgtcc taggtgccgg 2127061 catcggattg tccatgcagg tgctcgttct catcgtgcag aacacgtcgt ctttcgaaga 2127121 cctcggcgtc gcaacatcgg gtgtgacctt cttccgggtg gtcggcgcct cgtttggtac 2127181 cgcaacattc ggtgcgttgt tcgtaaactt cctggaccga agactcggtt ccgcgctgac 2127241 gtcgggcgcc gtgcctgtcc cggcagtgcc atctccggct gtcttgcatc agctgcccca 2127301 gagcatggcc gccccgatcg tgcgggcata tgccgagtcg ctcacccagg tgttcctttg 2127361 cgcggtctcg gtcacggtgg tcggtttcat cctggcgctg ttgctgcgag aggtaccgct 2127421 caccgacatc cacgatgacg ccgacgacct cggcgacggg ttcggtgtgc ccagagccga 2127481 atcgccggag gatgtgttgg aaatcgcggt tcggcgtatg ctgccgaacg gggtgcgact 2127541 gcgcgatatt gcgacacaac ccggttgcgg actcggcgtc gccgagctgt gggcccttct 2127601 gcggatctat caataccagc ggctgttcga ggcagtacgg ctgaccgata tcggtagaca 2127661 cctgcacgtg ccctatcagg tctttgaacc cgtcttcgac cgtctggtcc agaccggcta 2127721 cgcggcacgc gacggcgaca tcttgacgct aaccccgtcc gggcaccgtc aggtcgactc 2127781 cctcgcagtt ttgatccgtc agtggctgct cgaccacttg gccgtggcgc ccggcttgaa 2127841 gcgacagcca gaccaccaat tcgaagccgc tctgcagcac gtcaccgacg cggtgctcgt 2127901 tcaacgagac tggtatgaag atctgggcga cctgtcggaa tcacgccaac tcgcggctac 2127961 aacgtagcga tgcttgccgc gcgtagccgc gcgagctgat ccgcgctgca gaatgactgc 2128021 catgacagcc acaccgcttg ccgcggccgc gatcgcccaa ttggaggcag agggcgtcga 2128081 caccgtcatc ggcaccgtcg tgaaccccgc cggactcacc caggccaaga ccgtgccgat 2128141 acgccggacc aacacattcg ccaatcctgg cctcggcgcc agtccggtgt ggcatacctt 2128201 ctgtatcgac caatgcagta ttgcattcac cgcagacatc agtgtggtcg gcgatcaacg 2128261 tctccgcatc gatctgtccg ccttgcgcat catcggcgac gggttggcgt gggcgcccgc 2128321 cgggttcttc gagcaggacg gcacaccggt ccccgcctgc agccgaggaa cactgagccg 2128381 gatcgaggcc gcgcttgctg atgccggcat cgacgcggta atcggccacg aagtcgaatt 2128441 cctcttggtc gacgcggacg gccagcggct gccttcgacg ctgtgggcgc agtacggtgt 2128501 cgccggggtg ctcgagcacg aggcgttcgt ccgcgatgtc aacgccgcgg caacggcagc 2128561 aggcatcgct atcgagcagt tccatcccga atacggtgcc aaccaattcg agatctcgtt 2128621 agcgccgcag ccgccggtcg cggccgccga tcagctggtg ctgacccgcc tcatcatcgg 2128681 ccgtaccgcc cgccggcacg ggttacgcgt gagcctatcg ccagcgccct tcgccggaag 2128741 tatcggatcc ggtgcccacc aacacttctc gctgactatg tcggaaggga tgctgttctc 2128801 cggtgggact ggagcagctg gcatgacctc ggccggggag gccgcggtgg caggagtgct 2128861 tcgcggacta ccggacgccc aaggcatcct gtgcggatcg atcgtgtccg gtctgcgaat 2128921 gcgacccggt aactgggccg gaatctatgc atgctggggt accgaaaacc gggaagcggc 2128981 ggtgcgattc gtcaagggcg gggctggcag cgcgtacggc gggaacgtgg aggtgaaggt 2129041 cgtcgacccg tcggccaacc cgtatctcgc gtcggcggcg atcctcggac tggcactcga 2129101 cggcatgaag accaaggcgg tgttgccgtc ggaaacgacc gtagacccga cacagctgtc 2129161 tgacgtggat cgtgaccgtg ccggcattct gcgacttgct gccgatcagg cggatgcaat 2129221 tgctgtactg gatagttcga aactgcttcg gtgcatcctt ggcgatcccg tggtagatgc 2129281 cgtggtcgcg gtacgccagt tagagcatga gcgctacggt gacctcgatc ctgcgcagct 2129341 ggccgacaag ttccggatgg cttggagtgt gtaacgatgg ccgactccgc cggttcggac 2129401 ctgacgcggc acacggccga agtgccgttg atcgatcagc acgtccacgg atgctggctg 2129461 accgagggga accggcggcg gttcgagaac gcgctcaatg aggccaacac cgaacccctg 2129521 gcagacttcg actcgggatt cgactcacaa ctcgggttcg ccgtgcgcaa ccactgcgct 2129581 cccatccttg gattgcctag gcacgttgat ccgcagactt attgggatcg ccgcagtcaa 2129641 ttcagtgaag ctgaattggc tcgcagattt ctgcaggccg ccggggtaac cgactggctg 2129701 gtggagaccg gaatcggcta cgacgtgtcc ggaatggcaa gcgtcgccgg cctcggcgaa 2129761 ctgtcgggca gccacgctca cgaggtggtt cgtcttgaac aggtggccga acaggccgtg 2129821 caggcatccg gcgactacgc ctcggcgttc aacgagatac tgcgccggcg cgcagccaca 2129881 gcggtggcaa ccaagtccat cctggcctat cgaggtggat tcgacggtga tctgaccgag 2129941 ccacccgcgg cgcaggtcgc cgaggccgcc aagcgctggc gcgaccgtgg cggtgtccga 2130001 ttacaggatc gggttctgct gcgcttcggg ttgcatcagg cgttgcgcct gggcaagccg 2130061 ctgcagttcc acgtcggatt tggcgaccgg gacgctgatc tgcacaaggc caatccgctg 2130121 tatctgctcg acttcctgcg gcagtccggc aataccccaa tcgtgttgct gcactgctat 2130181 ccctacgaac gagaagccgg ttatctggca caagccttca acaacgtcta tcttgacggc 2130241 gggttgagtg tgcactacct gggggcccgg tcgccggcct tcatcggccg actactggag 2130301 cttgccccct tccgcaagat cgtgtactcg tcggacggat tcggccccgc ggaactgcac 2130361 tttctcggtg caacgttgtg gcgcagtgga attcagcgtg ttctgcgtgg ctttgtcgag 2130421 cgcgacgact ggtgcgagac cgatgccctg cgggtggtcg acctaattgc ccatggcact 2130481 gccgcacgca tctatcgcct tggcgatcgg tagctttcag gtggcgcaag tgtggccccg 2130541 tcacgggcta accatggacc gtgccggacc cagtgtcacc ggcagcgtcg accaaccgcg 2130601 cagcacccgc gtgtcacgcc gacttccggc acccgcggcc cgcacatcgg ggaagcggtc 2130661 gaagaacgtt ctcagcccga cctcgccttc ggcgcgggcc agggcggccc ccaggcagaa 2130721 gtggcggccg gtagagaacg caagatgtcg tccggcattg gggcgttcga tgtcaaagcg 2130781 gtgcggatcc gggaacacag cgggatcgcg gttggcggct gctaggtaga tcaccacgac 2130841 ttcgccgcgt ttgattcgca caccagccac ctcgacgtca cggcaagcca cccgggcggt 2130901 gagctgaacc ggcgaatcca gccgcaggat ttcttcaacc gtattcggcc acagctccgg 2130961 atgttggcgc agtgtggcca gatgttcggg ggtatccaac aacatgcgaa tcccgttgcc 2131021 taacaggttc actgtggttt cgaatccggc gaccaaaacc agtccggcga tcgcccgaag 2131081 ttcggtctcg tcgagctgtg tctcgttgtc cccgctttcg gcgatctgga tcaactgact 2131141 catcaggtcg tcacccggag cgtgccgcaa ctgctgcaga tgcccttcca gccagcagtc 2131201 gaatcctcgt atcccctgct gcacacgcag gtactgccgc cacggaatcc cgatgtctag 2131261 actcggcgct gccaactcac caaattccag gacgcgcggc ctgtcatgct cgggcacgcc 2131321 caaaatttcg ctgatgacca cgatcggcag ttgcgagcaa tagcgtccta cgacgtccac 2131381 aatcccgggc tgctcagcga accgatccaa gagattgatc gcggtctgtt cgaccagatc 2131441 gcgtagcgcg ctgaccgccc gtgaggtgaa caccgccgac accgttttgc ggtagcgagt 2131501 gtgatcgggc ggctcgacgg ccagcagcga aggttctcgc agggggtgaa gttgatcgcc 2131561 gcgggtccgc cgctccagcc agcgcagcgg tggtggcaga ttctcgccga aggagacgac 2131621 gcggaagtcg tccgatcgca gcaggtcatg ggcgagccga tggtcgacgg tcaggtagtt 2131681 ggcgcggttg cgcaccaggg cgccgtggga ccggacttcg tcgtaaaagg gcaccggatc 2131741 ggtggcgacg gccggatccg cgatcagccg ggcctgcaag tcgccacgcc gaatcccgat 2131801 tgccgcaatg ccgcggatca ccccgtgcat cgccaaccag tgcagcttgt ccttcaccgc 2131861 gcctccgtcg atcgagtggc ttttcttcaa gactagaacc cgcaattcaa cattcggcga 2131921 ggatgttgaa gtctgttgac accaccgtgt tgggtttttt gctgctgatg ccgtaggcac 2131981 tgccggcaac tgtgtatgtg ttgcgggcgc ggtcggcgcg ggcgttgccc accccgccgt 2132041 gccagaagct gccgttgaac ccgtcgacgt tacggatctt cacccactgc gggattaccc 2132101 tgtcaccgct gagcaatacc acggcttgga cggtgctgtc gtggttgcgg atgtcgatgg 2132161 tccggtagga gtgctcctgg ctgcatgttg cgggacgtgt cgtgtgagtg acgccgtcaa 2132221 tagtcaggcg tgcggctttt cggggtaccg tctgagcttg cccgcacgcg gagagaccag 2132281 ccgcgacgac cattgcaact ccggtcactg tgaccaaccg attgcacacc agccacctcc 2132341 attcgggcct gagcattgtg ctcgggacat tacttccgtt ttggctccaa cgtggccagg 2132401 gacttggcaa tgtgacgtcg gacgaactcc ggactgacgc ccttgagccg atcaatccag 2132461 cgaatgcttc ggggcacata ccaatgcaac cgtgtggggt gctggtaggc ccgccaggct 2132521 gcctcggcga cgctggacga gggcatcagc cggaacatgc ccttcttggg cgcggcagcg 2132581 cggatctgct ccgcggagat cgtgtagggg ccctcgtcgg aatgctggcg cgtcgaggtg 2132641 aggatagcgg tgtcgatcag accgggcagc acgtcggcga cgcgaacccc atgacgctgc 2132701 cactcaacgc tcaacgcctc ggtcaacccc ttgacggcgt gtttggtcgc cgagtagacc 2132761 gcgatacgcg gcatgccata ggtgcccgag gacgacgacg tcgagaacat cagacttccc 2132821 ggtgctttct tgaggtaagg cagtgcggcg taggcgccag tgagcaccgc cttgaagttc 2132881 acgtcgacga cgcgcacggc ggcctcgtac ggcacgtcct cgaaccaacc gccttcgccg 2132941 atgccggcgt tgttccacat catgtcgaga ccgccgccga cattgccggc gcagaaatca 2133001 gcgagcgcac cctcaagggc cgccttgtcc gtaacgtcga cggcgcgggc ccacagccgt 2133061 tcggcaccaa gctgtacgcg cagggcagcc agcccatcct cattgcggtc tatcgcacct 2133121 actcgccagc cgttggcgtg gaaaagcgtt gcaccctcgc ggcccattcc actgccggcg 2133181 ccggtgatga atatcgcttt catgcggaat ccggaatagc cgaaccgccc tcagcctgct 2133241 tcaaccagat ctttgatgcg ctgcaacgtc ttggtcatgt ctcggatgtt gcggcgctga 2133301 cgcagccagc ccccgaacac ccggtagtac acggtggtca acacggacgg ggggagccga 2133361 aacgactcag tgacctcggt gccgtcggcg gtgggcgtca aacgataatg ccaattgttc 2133421 accggtctgt cgccgagcag cacagcaaac ccgaactcac ggcccggttc gcataccgtc 2133481 cagtagaccg gcccgatccc gttgcgccgg acatgcccgc ggaatcgagc gccaagcgcg 2133541 gggccggtgg caccgtcaag ccactcggcc tcgaaggttt ccggcgagaa ccggccggta 2133601 ttgcggacat ccgcgatcaa tgtccagatc ttgtccggcg gcgctgccat gtgaactgtg 2133661 gccgaacctt ccatgacctg atccaaacac atacgtcgac ctggtcatag accgcacacg 2133721 ccgccaaccg tcagcgcgga atacttgcct gaatgcctgc ccaaatgatc tcgttgatga 2133781 tttgcttgat gccctgcgcg ggtttcgacc acagtgcgat cggaaggcca gaggcggcgc 2133841 cgcacgtcgg ccacgcgtcc aatccctgtt cggcgagaac ccgattggca actgcgattt 2133901 gttgttcccg agaggcagct gctgggttgc cgacaccgcc gaatgcggcc caggtggccg 2133961 gcttgaactg cagtccgccg tatttgccgt ttccggtgtt ggccgcccag ttgcccccgg 2134021 attcgcactg cgcgacggcg tcccagttcg ggctgggacc ggcgtgggca acggcggtgg 2134081 agagcgacat ggatgccgtg acgagtcctg cggccatggc ggacttgatg agcggcttgg 2134141 cgattcttgt catgctcgac atatcgccgg aagtggccga agcgttaccg attagagaga 2134201 gtggtgagat cgggtgtcta ttgcaccgcg accggccgtg gtcggccggc aaaggatgca 2134261 caaccggatt gatcaggccg gcggtagggc ctggcaatac gactgtgttg ctgtcgtcag 2134321 ggcccgttga tagaggctat cgaggtggcg ggaccgcact atgtcgcgtt tggcgcggtc 2134381 gagttgggcg gcgcaggacg gcgcggacag caaactccag tgactccaaa tctgcgacag 2134441 catccgatta ttcagggagt cgatcgccga tcgcgatgcc gatagatccg gcggctccgg 2134501 gggcgcgctg gccgggttga gcttccagtc cgagaaccgg ctgtactcga ttgcctcggt 2134561 ggcgcgaatc tggtcgtcga agacgcgggt gacgtagtcg gggtcgatgt gctgcgagcg 2134621 ggcatcttcg cccaactttg cgagttgctg ttcgactcgg ccggaatcct caatgggcag 2134681 ctgagcacgc cacttgaagg ctgccaccgg gtcggcgacc tccaaccgct cagcggcggc 2134741 gtcgaccaac tcggctaact ggctggtgcc gtcggctcgc gccagcgggg ggcctagtgg 2134801 tgcaatcagc gacaacagga tgccgatcga gacggcggtc gcgaggtata tctcacgtgg 2134861 acgggtaagc aacccttcgg ttgatcccgt cagccggcgc ctaacgaact ctgcaggtca 2134921 cccttcatgg cgttgagctg agcgccccag tactcccagc tgtgcgtgcc gttgggcggg 2134981 aagttgaaca cggcgttgtg cccgcccgcg gcgttgtacg catcctggaa cttcaggttg 2135041 ctgctacgaa cgaagttctc caagaactcg gcgggtatgt tggcaccgcc caactcgttc 2135101 ggggtgccgt tcccgcaata aacccatagc cgggtgttgt ttgcgaccag cttggggatc 2135161 tgctgcgtag ggtcgttgcg ctcccatgcc gggtcactcg agggacccca catgtctgcg 2135221 gccttgtaac cgccggcgtc acccatcgcg aggccgatca ggctaggccc catcccctga 2135281 gaggggtcca gcagggccga cagcgagccg gcgtagatga actgctgggg gtggtaggcg 2135341 gccaagatca ttgccgacga gccggccatc gacaagccga ttgcagcgct gccggtgggc 2135401 ttcacggccc tgttggcgga caaccattgc ggcagctcgc tggtcaggaa ggtttcccac 2135461 ttgtaagtct ggcagccagc cttaccgcag gccgggctgt accagtcgct gtagaagctg 2135521 gactgcccgc cgaccggcat gactatcgac agtcccgact ggtagtacca ctcgaacgcc 2135581 ggggtgttga tatcccagcc gttgtagtcg tcttgggcgc gcaggccgtc gagcagataa 2135641 accgcaggtg agttgttccc accgctctgg aactgaacct tgatgtcgcg gcccatcgac 2135701 ggcgacggca cctgcaggta ctcgaccggc agccccggcc gggagaacgc gcccgcggtt 2135761 gccgctccgc cggcaagccc caccaggccc ggaaggacta cagccgctgc cgtgccgatc 2135821 atcaatcggc gtccccaagc tcgaatcttt cggctcacgt ctgtcatact tgtgcccctt 2135881 tgtcctgtat gtcgtcgtgt gctcgggcca gaacataccg tgtgtggagg ccaaatgtcg 2135941 attcgggcgc aaagtcgtct catttccgta tcggttaccg ccgcggacag agcaagtgtg 2136001 cttagggggc tcacaaacgg tatggcggta tggatctatc gcggatttct cagaatcgcg 2136061 gcccggggct accggctgtg ctcccccagg gaggccgaac ttgcgttcac cgcgtaggct 2136121 cgctcgaagc aagccgacga agaccacgct atcccggtct gttccggcgt ccgcgtaaca 2136181 ccgcactggg gtttgtggcg tgcgatggtg cgggctgagg gcatcggagg ttccgggaac 2136241 gattgaggtg cgagaatttg gacacggtac ttgggctctc gataacgcct accaccctgg 2136301 ggtgggtcct cgctgaagga cacggcgcag acggcgccat cttggaccgc aacgaattgg 2136361 agctacatag cggtcgtaac gcgcaggcca tacataccgc agagcagctg gcggcggaag 2136421 ttctgctcgc ccatgaagtg gccgctgcag gcgatcatcg gttgcgcgtc atcggagtga 2136481 cctggaacgc cgaagcttcg gctcaggcgg cgctgctggt agagtcgctg accggtgcag 2136541 gtttcgacaa tgtggtgccg gttcggcggc tacgtgccat cgagacactg gcgcaggcta 2136601 tcgcacccgt tatcggctac gagcaaatcg cggtatgcgt tcttgagcat gagtcggcga 2136661 ccgtcgtcat ggtcgacacc cacgacggaa agacgcagat cgccgtcaag catgtgtgcc 2136721 gcggattatc aggactgacc tcctggctga ccggcatgtt tggtcgcgat gcctggcgcc 2136781 cggccggcgt ggtcgtggtc ggctcggata gcgaggtcag cgaattctcg tggcagctcg 2136841 aaagggtcct gccggtgccg gtctttgcgc aaacgatggc gcaggttacg gtcgcgcggg 2136901 gtgcggccct ggcggcggcc cagagcaccg agttcaccga tgcgcagcta gtggccgaca 2136961 gcgtcagcca accaacggtc gcgcccaggc gatcccggca ctacgccggg gcggcggcag 2137021 cgttggccgc cgcggccgtg accttcgtgg cttcgctgtc cctagcggtg ggcatccagc 2137081 tggctccgca caacgatacc gggacggcga agcacggagc gcacaagccg acgccacgta 2137141 tcgcaaaggc cgtggcgccg gcggtgccgc ctccgccgac ggtcacgcca ccagtccctg 2137201 ctcgggcacc ccggccggct gcgcagcacg aaccacccgc tcgcgtcacc tccggcgaag 2137261 cgctcacgga gccgaacccg cctgaggagc aaccgaatgc ttctgcgccg caacaggatc 2137321 ggaatgacag ccagccgatc actcgagtgc tagagcacat acccggcgct tacggtgact 2137381 cggcaccccc agctgagtag tcggaggccg ccgtagccgg ttgcgaaacc tgttcgcgcg 2137441 gacccatgtc gaggcgaagc ggtgggtact cgtcgcgcat cagcgtggtg tatgcgcgga 2137501 cccgcaacgc ccaccggttt acaggttgta tagccctatg gggtaccggc cggtgaacag 2137561 gagcgccacc acggcgacga gcagcaggat caccagtagc gaaggccaca ttatgccgac 2137621 gcggtcgtgg gggtcgatca ggaagactcg ccaaccgctg gagaggaaga ccgccaggat 2137681 caggtagtgg gggatagcaa gtagccacca cttaatcagc accaggccgc ggctcaaccg 2137741 ctccggatag tcaacctcca agtcagccgg atactccgcc tttgtctgca ggctgaaggg 2137801 cgggtaccgg tcggttccca gcgccgacag cgcatagaag gcaacccgcc agcgccaccg 2137861 catgacgccg acattgaagt cgaacagcgt ccggggatat ctgcccgtga acaggatggc 2137921 aaagaacgcg atcacggtga ccaccacggc ggcaacgtgc aagaagaaca agacaatgta 2137981 gtgcgggatg gccaaaaacc acttgactag ccactgccaa cgtgacaacg caggatcgag 2138041 gtcaccccgg acccggactg gataggcgtc aggttgcatg atcgacggct cctttacatg 2138101 cgcgtcggct cgatccacga gccaggccca ttgtctctca tctgccgcgc atgggcgaag 2138161 ccatcgtcgt gcgctacgga caccggatcg acgtgcagta atagccttgg gctgtaggca 2138221 gctttccggg cgatgacggc ggcactggta atccattgtc ggccaacaat ttactgagag 2138281 gggtcggtac agattgccag ccgtggctat ccaggtacgt ggcaacgtcg ttgcgctcgc 2138341 cgtcgctgta tagcaggttg ggtgatttct atgtcgaacc cgtggacgct ccagcgctgc 2138401 gaagcggtgt caaggaggtc ggatgccggt ccgccatcgt catctaccga ctccccggcg 2138461 cgctgagcgc ggtgatgttg tccagcaagc gatcctgagc gtcgggcggc aggaatgcca 2138521 gcaggccctc ggcgagccag gcactgggtt ggttggggtc aaagcccgct tggcgcaacg 2138581 gggtcggcca gtcacgtctg aggtcgacgg gaaccacgcg aggtcagctg ttgcagtggc 2138641 atccagatca gcaagtgtcg tcatcggtat gcgcgcgcgt caagacctga ggccaggatg 2138701 acagcctgcc gaatccccgc agacgtcgcg tcacggaaaa acgcatcgaa gtaccgcgtc 2138761 cgggcggcca acaggtcggc caatcgctgc agtccccacg tgccgtcggg gtcgtcgacg 2138821 tcggtcgcct tgatgtttcc ggcggcccag cgagtgaaga agtcaatccc cacggctctc 2138881 accagtggtt cggcgaacgg atcatcgatc agcgggttat cggccctggt ggccaccgcg 2138941 cgggcggcag ccaccatcgt cgccgttgct ccgacgctag ttgccaggtc ccacgcgtcg 2139001 ttgttggtgc gcggcatcgg gatcctttcg gctcggccag cgatatacag ccttcgaagt 2139061 ccaccgcttg tgggatcaat cgtcctttgc ccgaaccgcg gtgaatgcca cgctcacttc 2139121 gtcggcgacc cgtattgaac ccatcaacag cgagtagggt ttgacgccgt agttggactg 2139181 gcgaaccgtg gtgtcggcag agatgcgcca cgcagcacca agatcctctg tgtgcaagtc 2139241 gatgacgtgt tctcgcgact ttccccggat gtgcagtttc ccggtcaggc ggtacccatt 2139301 cccggtctgg gcaatggctt ccgtggtaaa gcgaatatgg gggaagcggc tggcgttgag 2139361 cgttttcagc gcgttcgccc gcaccagagc tttctcaggc tcggacagcc ccttcacgcc 2139421 accctcaccg cgcatcacct cgaaggaatc cacctcagcc acaagctcgc cggcgacggg 2139481 atcggtgccg gaccagttca ccagggcctg ccaccgtgtc atcgcgatgg tcaggcgatg 2139541 acccaagcgc gcggctctgc caacgactcc ggtgcgaagt accagctcgc cgtcggaagc 2139601 atcaagagtc cacaccgcgt cgctcacgcc acgactgtat tcagacgacc tgcctgcccg 2139661 cccctcccgc cgcgtcttgt gggccacgac acaatcgtta tgcttggtga ggctcgccgg 2139721 tgccgttgga ggggtgcaac atgattcgcg aactggtcac caccgctgcg atcacgggtg 2139781 ccgcgatcgg tggggcgcca gtcgcgggcg cagacccgca gcgttatgac ggcgatgtgc 2139841 cggggatgaa ctatgacgct tcgctgggcg ccccatgctc cagctgggag cgcttcattt 2139901 ttggacgagg cccctccggt caggccgaag cctgtcattt tccgcctcct aaccagttcc 2139961 cgccggccga aaccggctac tgggtgatct cctacccgct atacggcgtc cagcaggtcg 2140021 gtgcgccgtg tccgaagccg caggcggccg cgcagtctcc ggatgggttg ccgatgctgt 2140081 gtctgggagc ccgtggatgg cagccgggat ggtttaccgg ggccgggttc ttccctccgg 2140141 agccataacc ggtgggcgtt tctcatgatc atgtgcgaag gccggcccac cgaatcaccg 2140201 atcccacggt ggctgcgctt cgtgcttacg tctgaccgtg ccggctcggc atggtatatc 2140261 ggggcaggct tcttcttcgc gccagtgctg gcggtgcttt cgccatggcc gaccatcacc 2140321 gcggtgctgt ggtggatcat cggactggcg ggactatggc tcggactgct cggaatcgcg 2140381 atggcagtcg gactggcccg ggtgttgcgt tccggcgccg aaataccgga agcctactgg 2140441 cgcacgctgg tcgactaccg atccgccaac gaataggaga ctccgatgag cttcaatccc 2140501 aaagatgcgg tcgacgctgt ccgggacatt gcggccaatg ccgtcgagaa ggcctcggac 2140561 atcgtggaaa acgccggcca catcatccgc ggcgacatcg ctggcggggc cagcggcatc 2140621 gtcaaggact ccatcgacat cgccacccac gcggtcgaca gaacgaaaga agtgttcacc 2140681 ggcaagacgg acgacgaagg ttagtcgaga ctagtcggcg cgcgcttgtc gtccgttgtc 2140741 aaacggacgc ggcagcattg agtgcgtcca accgggcggt cgcctcgagg tactcctgca 2140801 cccagcgttc gataacggta gccgtctttt ccaccttggt gaactgccca acaacctgcc 2140861 ccaccgggtt gaacgcgacg tcgacggtct cgttcgggta tttatgtgtg gctttgacgg 2140921 ccatgccgga gaccatgtat tgcaacggca taccgagcgg cttcgggctc tccggttgct 2140981 cccaggcctc agtccagtcg ttgcgcagca tccgggccgg cttacccgtg aaggaacgac 2141041 tgcgcacggt gtcgcggctg gtcgccttga cgtatgcggc ctgttgaacc gcggtgtttg 2141101 cggcttcctc gaccatcagc cactgcgaac cggtccatgc cccttgggtc cccagcgcca 2141161 acgctgcagc gatctgctga ccgctgccga tgccacccgc cgccaacacc ggaaccggcg 2141221 ctacctcctt gacgacctga ggccacaaca caatggagcc cacctcgcca cagtgcccgc 2141281 cggcctcgcc gccctgggcg atgatgatgt cgacgcccgc atcggcgtgc ttgcgggcct 2141341 gcgagggtga gccgcacaat gcggccacct tgcgacccga gtcgtggatg tgcttgatca 2141401 tgtccgctgg gggggtgcca agcgcgttgg cgaccatcgt catcttgggg tgcttcagcg 2141461 ccgcgtcgac ctgtggggtg gccgtcgcct cggtccaacc gagcagctgc agactgtcct 2141521 cgtcggcgtc ctcgaccggg acaccatgat cggcgaggat cttgcgggcg aagtccagat 2141581 gctcctgcgg gaccatcgac cgcagcgtct tggcgagctc atccgccgac agctgggagt 2141641 ccatgccctc gtacttgttc gggatcacga tgtcgacccc gtaggggtgg tcgccgatgt 2141701 gttcatcgat ccagttgagc tcgatctcca gctgctccgg cgtgaaccca actgctccga 2141761 gcacaccaaa accaccagct ttgctgacgg cgaccaccac atcgcggcag tgagtgaagg 2141821 caaaaatagg aaactcgata ccgagctcgt cgcaaatggc agtgtgcatg cctgctcctg 2141881 gaatgctagc ggacgcaaat agaactgaaa cgtgttctag tttagtaccc gtcttggtaa 2141941 ggtggccaac agcccaggtt ccggtcgggt ttcggcgcgc accccggcga agctgacgag 2142001 gcggtctaag gtcaccttca cccgcgcatg gccggccagc aacaacgacg gctgtcccac 2142061 cgagcagaag tactgggcga tggtgtgcac cgcggtcggc taccaccgcg acgaccccgc 2142121 cgcagaactg ctgttacgca acgaaggctt ggcagctgca gtccaaactg gccacctacg 2142181 tctacccgcc acagaaacta gtcgccaagg tccgtgcggg cgccaaagtg tccgacaacc 2142241 acgaccaggc gaccactctg ttccaccacg cgatcgatca cccaaccgtg accgtgcagc 2142301 agacctactc cctgatcaac cctcaatcgg ccccggggcg atggaccttg atccgctggg 2142361 gccccgccgg tagcctagtg ctgcgaatta cgctatgccg agtctcggaa ttgccggccc 2142421 gccgttcacc acgttcaaac gcccgagacc ggtgccaggc aggtacgcga acctcatggg 2142481 tctcaattcg ttctgccaca aagaaagtga gtaagccagc atgcgtgcgg tagtcatcga 2142541 cggggccggc agcgtcagag tcaacaccca gcccgacccg gcactgcccg ggcctgacgg 2142601 agtggttgtc gccgtgaccg ccgccggcat ctgcggatcc gatctgcatt tctacgaagg 2142661 cgaatatccg ttcaccgagc cggtggccct cggtcacgag gcggtaggca ccatcgtcga 2142721 ggccgggcca caggtgcgca ccgtcggagt tggcgacctg gtcatggtgt cttcagtggc 2142781 cggctgcggc gtctgcccgg gatgcgaaac ccatgatcca gtcatgtgct tctccggccc 2142841 gatgatcttc ggcgccggcg tgcttggcgg cgcacaggcc gatctgctgg cggtgccggc 2142901 cgccgatttc caggtgctca agatccccga aggtatcacc accgagcagg cactgctgct 2142961 cacggacaac ctcgccaccg gttgggcggc agcccaacga gccgatattt cattcggctc 2143021 cgccgtggcg gtcatcggcc tgggagccgt cggcctctgc gcgctgcgca gcgccttcat 2143081 acacggtgcc gcaacggttt tcgctgtcga ccgagtaaag ggacgcttgc aacgcgcggc 2143141 cacctggggt gctacgccga taccgtcacc ggcggccgag acgattctgg ccgcgacgcg 2143201 gggtcgcggc gcagactcgg tgattgacgc cgtcggcacc gacgcctcga tgagcgacgc 2143261 gctcaatgcg gtgcgccctg gcggcaccgt ctcggttgtc ggcgtgcacg atcttcagcc 2143321 gtttcccgtg cccgcactga cgtgcctgtt gcgaagcatc acgctgcgaa tgaccatggc 2143381 accggtacaa cgaacctggc cggaactgat cccgttgctg cagtcgggcc gactcgatgt 2143441 cgatggcatc ttcactacca ccctgccgtt ggacgaagcg gccaagggct atgcaaccgc 2143501 gagggcgcgc tcgggtgagg agctaaggtt ctgcttacgc cctgacagcc gtgatgtact 2143561 gggagcgcat gaaactgtcg atctttacgt ccacgtccgg cggtgtcagt ccgtagccga 2143621 cctgcagctc gagggtgctg cggacggggt cgacggccca tccatgctca actagccact 2143681 ccacgggatc ggtcttgtcg tcgtaggtga gcgcggagaa attcacgtca ccagacatat 2143741 tgacccccgg gtgtgcggtt tccagcgcgg cgagctgctc gtgatccaac cgggacccta 2143801 aggcgcccaa ggcaactcgg ctgccaggcg cacacaactc atcgatccgg gcgaacagag 2143861 catattgcgc atcgccggtc aggtagggca gtagtccctc gaccgaccag gcgctgggtc 2143921 gttgcggatc gaacccggcc gctgtcagcg gcgtgggcca gtccgtacgc agatctgctg 2143981 gcaccgccac ccggtgagct ttgggtacag caccccgctc acttagcacc cgtgctttga 2144041 attccaggac cttcggcaca tcgatctcga aaaccgttgt cccgggctgc cagtcaaggc 2144101 gataagcgcg gcagtccaga ccggcggcga cgatcaccgc ctgtcgtatg ccagcctcat 2144161 cagcgcagtt gaagaagtcg tcgaaaaacc gggtttgcac gccgtagagc cgagggaaag 2144221 cggtgccgtc ctccgacgtt ctcgggtttg ctaacagacc ctccagatac gggtcggccg 2144281 aagcggtgat gaaatgcttc gcgtattcgt cttggaccag cggtttaggg cccgtggtgt 2144341 gcagtgcacg ccaacccgca accagtagcg cggtgtagcc cacgttgctg acaatgtccc 2144401 agtggtcgtc atcggaacga agcgagccat actcaggtgt agtcatctca tcagccttcc 2144461 agcattacgg tcaccggacc gtcgttgacc agttcgacct gcatgtgggc accgaacacg 2144521 ccggcttcca cgtgcgctcc caactggcgc agcgctgccg cgaacgctgc tatcaggggc 2144581 tgcgccaccg cacctggcgc cgcggcgttc caggacggtc gccgaccctt cgcggtgtct 2144641 gcgtagaggg tgaactggct gattaccagg atcggtgcgt gcatgtcgga ggcggatttc 2144701 tcgtcggcga gaacccgcaa attccagagc ttttcggcga gacggcgcgc cttgtcgaga 2144761 tcgtcgccgt gggtgacacc gacgaacgcg accaggccct gcccgtccgg ccggatagcg 2144821 ccgaccaccc gaccatcgac cctcaccgca gccgatgaga cccgttgcac cagaacccgc 2144881 acgagcctcg atgctgccag gccggctatg cagtcgctgg ggctgggtag gctcattgtg 2144941 tgtctgtgct ggtcgcgttt tccgtcaccc cgctgggcgt gggggagggg gtcggcgaga 2145001 tcgtcaccga agcgattcgc gtggtccgtg attccggcct gccgaaccag acagatgcca 2145061 tgttcaccgt gatcgaaggc gatacctggg cggaagtgat ggccgtcgtg cagcgcgcgg 2145121 tggaggccgt ggccgctcgg gcaccgcgag tcagcgcggt gatcaaggtg gactggcgtc 2145181 ccggggtcac cgacgcgatg acccagaagg tcgctaccgt cgagcggtat cttctccggc 2145241 ctgaatagca gcgctaaacg cccgctcggc cgcatcccca tggaccgcaa ataccaccct 2145301 ttgcagcgac cccggccggt gccgacggac ggcgccgacc atcagccgcg cagcgtcgtc 2145361 gagcggaaag ccgcccacgc ccgtgccgaa agccaccagc gccagcgagc ggcaaccgag 2145421 ctcgtcggct ttccgcaggg tagcagcggt ggctgcggtg atgatctcgc ccgaggtcgg 2145481 acctcctagc tccatcgtcg ccgcgtggat cacgtagcgc gccggcatgt caccggccgt 2145541 ggtctcgacc gcttccccaa gcccaatcgg cgccttctcg gtggactcgc gctgcagctc 2145601 ggggccgccg gcgcgggcga tggccgcagc gacaccaccg gcatgccgca gtcgggtgtt 2145661 cgccgcattg gtgatggcgt cgagctcgag cttggtcacg tcggcctgat gtacctccaa 2145721 ctcgatcatc gacacattgt cccccctgca agtactcggc ggccgcggtg atgcacccct 2145781 tgttgtgttg gaccgtcgcc accatcgccc acacaatcga accctcgccc ggcgccactg 2145841 cggcccatcg agactccggc ccatcgagcc cagattcagg gtagcttgag gtgaacgagg 2145901 acaatcaagc ggctggcaag gacacagacc gatggctcgt gatccagttg taccgagggt 2145961 gcgaacagca tgagtggcga cgacgccggg ccgggcgagg tcagccatgc ccgcggcgtc 2146021 ggtgggccgg gcggagccgg aggcgccggt ggccggggtg gtgccggcgg tcgcggcggg 2146081 gcgggcggta gaggcggaga tggcggcata ggcggggcag cgggccccgg cggtcaaccc 2146141 ggccagggcg gggtgggcgg cgcacccggc cccggtggaa cccccggcga accaggtcag 2146201 cccggcaaac caggacaacc ggggcaaccc ggcagcccgg gacattagcg cgtgcgggtg 2146261 gcgtcgtcgc gcatgagcac gcatagccgc catctgcccg gtacgccctt gagttcctgc 2146321 tcaccacgct cggcgaaccg gtgccgtgat ccggcgacga tgtctcgcac ggtcgaggac 2146381 accagcacct cactgggtcc ggccagcgcg cagacgcgcg caccgatatg cacggccacg 2146441 ccggcgacgt cggtaccgtg cgaggcatcg cgcacctcga cctcgcccgc atgaataccg 2146501 atccggacct caatacccag cgcggcgacc gcgtcgacga tgtcgtccgc gcacgcgatc 2146561 gcggcactcg gactggtgaa cgtcgcgacg aaaccgtcac cggccgtgtt cacttcgcga 2146621 ccgccgaacc gctggatttc gtggcacacg atggtgtcgt ggttgtccaa caggtcgcgc 2146681 catcggtcgt cgccgagcgc ggcggcgtgc tgggtcgagc cgacgatgtc ggtaaacatg 2146741 atggtggcaa gcatgcgctc ggcgtcagcg ccgccgcgca cgccggtgat gaattcctcg 2146801 atttcatcga gcatcggccc ggtgtcgcca acccagtaca gggtatcggt gccgggtagt 2146861 tcgaccaagc gggatccagc gatgtgctcg gcgaggtagc gaccatgtcc caccgggatg 2146921 tacgtcgatc cgacacggtg caagatcagt gttggagcct cgatgtgtcc caagacatct 2146981 cgtacgtcgg cctcggctat gacctttgaa acggcacggg caatgctcgg cggtccggca 2147041 cggttgccgg cgagatccca ccaggctcga aacacgtcat ctccggccac ggtaggagcc 2147101 acgatgctca gcacgtcgaa gccccgctcg acggcatccg gttccagcgc caccgtcagg 2147161 aacgggtcag ctcgacgaac ctgggcgcct accgggtagt cgggcgccca tagtgggcgc 2147221 gccgagccgt tgacgacgat caggctgcgc acccgctcgg ggtagtcggc ggcgagaaca 2147281 agtccgttca tggcgtggaa actgggcgcg aaaattgtcg cctgctcgca tccgaccgcg 2147341 tccatcaccg cgatcgcgtc ctgggcccag aacttcggcc ccagcgtggt tatcgcggcg 2147401 agccgtgacg acaggccgac cccacgatgg tcgaggcgga tcaccctgct gaatgacgca 2147461 agacggcgat ggaaacggta cagcgatggc tcgtcgtcga tcgagtcgat cggcacgaac 2147521 ggccccggca acaccagcag atccgtcgga ccgtcaccca gcacctggta ggcgatatcc 2147581 atgtcgccgc attttgcgta gcgggtcctg tgaatgtggg gagcctgcgc cacggtccta 2147641 cgttagttca tgcgtaggct catggcggtg agcgcacgtg cgggcatcgt gatcaccgga 2147701 accgaggtcc tgaccgggcg ggtccaagac cgcaacggcc cctggatcgc cgatcggctc 2147761 ctggagctcg gggtcgagtt ggcacacatc acgatctgcg gcgaccgtcc cgccgacatc 2147821 gaggcacagc tgcgattcat ggctgagcag ggtgtggacc tgatcgtcac cagcggcggc 2147881 ctggggccga ccgccgacga tatgaccgtc gaggtggtgg cgcgctattg cgggcgcgag 2147941 ctggtgctgg acgacgagct ggagaacagg atcgccaaca tcctcaagaa gctgatgggg 2148001 cgaaatcccg ctattgaacc cgccaacttc gactccatac gcgccgccaa ccgcaaacag 2148061 gccatgattc cggccggatc gcaagtgatc gatccggtgg gcaccgcccc cggtctggtt 2148121 gtgccgggac ggccagcggt gatggtgctt cccgggccac cgcgcgagct gcagccgata 2148181 tggagcaagg ccatccagac ggctccggta caggatgcga ttgccggccg gacgacctac 2148241 cgacaggaga ccatccggat cttcggcctg ccggagtctt ctctggccga cacactgcgt 2148301 gacgccgagg cagccatccc gggttttgac ttagtcgaga tcaccacctg cctgcggcgc 2148361 ggcgagattg aaatggtcac tcgctttgaa ccgaacgccg cgcaagtgta cacgcaattg 2148421 gcacggttat tgcgcgaccg gcacggccac caggtctatt cggaagacgg tgcgtccgtg 2148481 gacgagctgg tcgcaaaatt gctaactggc cgccggatag cgaccgccga atcctgcacc 2148541 gcagggttgc tggcggcacg gctcaccgac cggcccgggt cgtccaagta cgtggcgggc 2148601 gcagtggtgg cctactctaa cgaggcgaag gcacagcttc tcggtgtgga tccggcgctg 2148661 atcgaggccc acggggcggt ttccgagccg gtcgcccagg caatggcagc gggggcgctg 2148721 caaggcttcg gcgccgacac cgccaccgcg atcaccggaa ttgcgggtcc gagtggggga 2148781 acgccggaaa agcctgtggg aacagtgtgc ttcaccgtcc tgctggacga tggccgaaca 2148841 accacccgaa ccgtgcggct gcccgggaac cggtcagaca ttagggagcg ctcgacgact 2148901 gtggcgatgc acctgctgcg gcgcaccctg agcggtatcc cgggctcacc ctagcgacgg 2148961 cgaaatcgac agcagcgcga caaagttcga cgagaagaca ccgcgctaat gtcgatttcg 2149021 atgacgaaca agaaaagcag tttccgtagt accaaagcgg attccggtgg catccttgcc 2149081 aatcgccgtc agcaccgcta cgaccaatag cacgggcacg atcgtcgcgg ccaaggcgaa 2149141 ggggtagcca tgggattcgg ccagacgctc ttgaatagga aggttgaacg ccgccagcag 2149201 attaccgagc tggtaggtta cgccggggta gacgccccgg atagcgtctg gcgacatctc 2149261 ggtcagatgc gcggggatca caccccaggc accctgtacg aagacttgca tcaaaaacga 2149321 acccaggcac aacatcgccg cagtgcgcga gtaagcgaac agcggcacga tcggcagtcc 2149381 cagcgccgca cagaaaacga tggtgtaacg gcggctgaac cgctgggaca acgtgccgaa 2149441 cgccagaccg ccgatgatgg cgccgatgtt gtagatcacc actatccacc tggcggtcag 2149501 gctggacaaa ccggcaccat gatcggtagt cgcggtcagg aaggtcgggt agacatcctg 2149561 ggtgccgtgg ctcatccagt tgaaggcggt catcaacagc actaggtaga caaaccggcg 2149621 cacaattgcg gggttaccca ggacatcgcg gattcgggtc ttggtgagcc gcatgcggtc 2149681 ctgcgcggct tcccagactt cggattcctt tacccggtac cggatgatca agctgatcag 2149741 agccgggatg atgcttaggc cgaacaacca ccgccacgac agccctagcc agttcatcac 2149801 caccagcgct gccacactgg ccagcagata gccgaacgcg tagccctcct gcagcagccc 2149861 ggagaagacg ccacgccgct cggctggaac cttctccatg gacagcgcgg cacccagccc 2149921 ccactctccg cccatgccaa tgccgtagag cagtcgcagg atcaccagca cggtgaagtt 2149981 gggtgcgaat gcgcacagaa atccgatcac cgaatagaac gacacgtcga ccatcagcgg 2150041 gacccgccgg cccacccggt cggcccatag cccgaacagc aacgcaccca cggggcgcat 2150101 ggccagggtg gcggtggtga gaaacgcgac gtcggtcttg gtgtggtgga aggtcgttgc 2150161 gatgtcggca tagaccagca ccacgagaaa gtaatcgaac gcatccatcg tccaacccaa 2150221 gaaagatgcc ataaaagcgt ttcgctggtc gccggtcaac cgcggtgctg ccacgtctgc 2150281 atcgtggcgt accgggcgcg gcaccgcgag tccggggaca tggcgaacag cggcggctcg 2150341 catgtccgtg gcaggatcgg gcaatggtgc cttttctgat gcgcgccgca gtgaccggat 2150401 tcgcattatg ggtggtgact cttttcgtcc cgggcatgcg gtttgcgggc ggcgacacaa 2150461 cgctgcagcg ggtcgccatc atcttcgtcg tcgcggtgat cttcggtctg gtcaacgcgt 2150521 tcatcaagcc catcgtgcag atcttgtcga tcccgttgta catcctgact ctcggtcttt 2150581 tccatgtagt cgttaacgcg tcgatgctgt ggcttaccgc gtggatcact gagcacacca 2150641 cccactgggg actgcagatc gaccacttct ggtggaccgc gatctgggcg gcgatcttgt 2150701 tgtcgatcgt cagctggatc ctgtcgctgt tggctcgtga ctttcgacgt gtcactcgcg 2150761 cacactagag ccacaaattt tggtgggggg acatcctagg ttttcggggc atgttccact 2150821 tatgcttact cacactgctt gccaacctcg tccaagacag gcaccctgtc ttcggcgtga 2150881 tgacgctgac ctcccgccct ccaatacgcc ggacggcagc acctaacagc acacgacgac 2150941 gggactgcaa atgatgcgca ctgtcgcgat tggaccaggt gccggtcctt cgagcacacg 2151001 gccgagttcg caacccagtg acctgcatag cggcctacgc gcggttaccg agtgcaccgg 2151061 ctcagcggtg gtcgttcatg tgggcggcga catcgacgcc agtaacgagg tcgcttggca 2151121 gcgtctggtg agcaagagcg ccgctatcgc catcgcgccg ggtccgttcg tcatcgacat 2151181 tcgggacctc gacttcatgg gatcatgtgc atacgctgtg ttggcccagg agtcggtgcg 2151241 gtgtcgccgg cgcggggtga atatgcggtt ggtgagtaac cagccgatcg tggcccgcac 2151301 cattgccgcg tgcggactgc ggcgactaat tccgctgtat gcaacggtcg agaccgcact 2151361 ggcgccgcct cccagcgcgc attgaccgac ccattaaccg accggtgcca cccaacccgc 2151421 catggtgtcg ggttaaccgc cgccgacaag attgaccacc tcccgcgcac aaccccatga 2151481 cagggtcacg ccgtcacctc cgtggccata gttgtggatg cacagcgctc gcccgatcgg 2151541 ttcagcttcc acccgcacgg acggccgatc aggacgcagc ccggtaatcg tctcaatcac 2151601 tgccgcctcg gcaagccgtg gttgtatgcg gcgacaccgt tgcaggatcc gctcggttat 2151661 ctccggctct ggggtggggt cccacctgcc agggatactg atgccgccgc agactacacg 2151721 ctgcgggtgg gcaaagtagc agatccattc cgagccgccg gtgcgctcga taaacagttg 2151781 ctctagacct ggattggtga ggacgacgtg ctggccgaac cgcggccaga ccgtggcgtc 2151841 gccggccagt tcccgagcgc ccagaccagc acagttgatc actatgggcg ccgcctcagc 2151901 ggcctcggcc agcgaccgta gcgggcgcgt ttcgatttca cagccagtcg ccgccaatcg 2151961 ctgggtcaga cagtcgaggt actggggcat atcgatcatc ggcaaggtgg catgaaaccc 2152021 agcacggaag cccccgggca cgtcggccgg gtcagccggc cgcacgtcgg ggatcagctc 2152081 caacccgggc ggcatcgcac cggtctcgat acgatcgccg acactcagcg ccggcgtcat 2152141 gcgcacgccg gtggcgggat ccttggccaa gtcgcgaaac acgtgcaatg actgttcgat 2152201 ccacccgcgt accttggcaa cgggttcctt cggccgcggc ccccagaccg cacccgccac 2152261 cgccgatgtc gtttgctgcg gcaatgcggc cgcccatacc cgcaccggcc accccgcctc 2152321 ggccaggcat atggccgacg tcagtccgct gacgccggcc ccaatcacga tgacctgttg 2152381 ctcacctatt gccacagcag gaccgtagcc gaagccagcg tcagttaggg ctgaggcact 2152441 cgccctccag tcggtccgag taagccgttg aggatgccga gctgattttg tagttgggcc 2152501 cccgcttcag gtccaggaac tccggcaggg gcagcgcctt cgctgcccgt gttctgccag 2152561 ggttggcagc cgtgcgtctt gaacgccttg tcggtcggct caatcgtcac tacctgtggt 2152621 ttcttgctga gtgcgttatc gatgagcgcg ccatcggggt tacccatccg cttccaatag 2152681 caggtgccgt cgccgacggg tcccgcggag ctgtacgtgc cgggagcgat gtcaatcccc 2152741 accgcatagg tgccgtcgct atcaattgcc gtcttcggtg tcggtgccgg ctccggatcg 2152801 gcgccggcga ggcccacgga tccggcccag cctgcgagga tcaggccggc gacggcaaag 2152861 gctgcagcag gagatggggc tggcttcaag cgcatcacac aatagcctac tggggcctac 2152921 cggtatccgg aactcactcg gcctggaagc aatcactcgt tctcccgccg ccgatgggct 2152981 tgttcgatcc ccatatgcgc ctgcgagcgc acggacggcg cgccaccgac gcagtgtccg 2153041 gcaatgatgc ggtaaatcgc ggacggcgcc aacgcttcca ccgagtcaca gccttgtccg 2153101 ccagcacacc gcccagaccg catgtatcgg aggatgtccg gaagccgttg gccacctccg 2153161 tgtcgagcaa ccaccgctgt cactgcattg ctgtcactaa atcgttgtcc ggcaacacgt 2153221 ttagagcgct cgcgtcaggc tgacctcctg gtggctcgca tcccgagcac cggctgggta 2153281 ccgcgacctt cgtcgaagtc cgccgcccac ggccagcgac cacgccggtc ggcccacacc 2153341 aactgcaagg ccgtcacctt gtcgccaaag atggcgatcg cacaatacaa atgcgcgtcc 2153401 ggatgtgtaa cctggaccgt ttcgacaaga gggccggctg ggagggtggt ctgcataccg 2153461 ggagtcagca agtcaccgac cagagccctg cgagcggcga tgttcaacaa ccgctgccca 2153521 cgtcgtggcg agaggccagt caccaccagt tcgggcaagc cgcgccgggt tagaccaacc 2153581 gtgtaggcaa atggccgtcg ctcgcactcc acgtgctgta ccgcccagcc atgcatgagc 2153641 attatcccgt acacctcgtc gaggtactcc tcggcggtgg cttccgggtg atcgcacatc 2153701 cagcacattt cggcgccctt tctcctcatc cccgtctcgt catccccgtc tcgtcgtgcc 2153761 tgcgaccacc atgcacgcgg ggtctgacaa atcgcgccgg gcaaacacca gcaccccgcg 2153821 agccggtcag ctcgcggggt gctgcggcgg gttgtggttg atcggcgggc agggccgatc 2153881 aacccgaatc agcgcacgtc gaacctgtcg aggttcatca ccttgtccca ggcagcgacg 2153941 aagtcctgca cgaacttcgg ctgcgcgtca tcggcgccat agacctcgac aagcgcccgc 2154001 aactccgagt tggacccgaa gaccaggtcc acgcggctgc cggtccactt caccttgcca 2154061 ctgccatcct tgccctggta ggtcccgtca tctgctggcg agggctccca ggtgataccc 2154121 atgtcgagca ggttcacgaa gaagtcgttg gtcagtgact cggaggcctc ggtgaacacg 2154181 cccagcggta agcgcttgta gtttgcgccg aggacgcgca ggccacctac cagcaccgtc 2154241 atctcagggg cactgagcgt aagcaggttc gccttgtcga gcagcatgta ctcggccggc 2154301 aacgggttgc cctttccgag gtagtttcgg aagccatctg ccttgggctc cagcacggca 2154361 aaggattcca cgtcggtttg ttcctgcgac gcatccgtgc ggcccggggt gaagggcacc 2154421 gtgatgttgt ggccagccgc ctttgctgct ttctctatgg cggcacagcc accgagcacg 2154481 acgaggtcgg cgaaggacac tttgatgttc cccggcgccg cggagttgaa tgactcctgg 2154541 atctcttcca gggtgcgaat gaccttgcgc agatccccgt cggggtcgtt gacctcccac 2154601 ccgacttgtg gctgcaggcg gatgcgacca ccgttggcgc cgccgcgctt gtcgctacca 2154661 cggaacgacg acgccgccgc ccatgcggtc gaaactagct gtgagacagt caatcccgat 2154721 gcccggatct ggctcttaag gctggcaatc tcggcttcgc cgacgaggtc gtggctgacc 2154781 gcagggaccg gatcctgcca cagcagggtc tgcttgggga ccagcggccc aaggtatctc 2154841 gcaacgggac ccatgtctcg gtggatcagc ttgtaccagg ccttggcgaa ctcgtcggcc 2154901 aattcctcgg ggtgttccag ccagcgacgc gtgatccgct catagatcgg atccacccgc 2154961 agcgagaggt cagtggccag catcgtcggg gagcgccctg gcccgccgaa cgggtccggg 2155021 atggtgccgg caccggcgcc gtccttggcg gtgtattgcc aagcgccagc agggctcttc 2155081 gtcagctccc actcgtagcc gtacaggatc tcgaggaaac tgttgtccca tttcgtcggg 2155141 gtgttcgtcc atacgacctc gatgccgctg gtgatcgcgt ccttaccggt tccggtgcca 2155201 tacgagctct tccagcccaa gcccatctgc tccagcggag cagcctcggg ttcggggccg 2155261 accagatcgg ccgggccggc gccatgggtc ttaccgaaag tgtgaccgcc gacgatcagc 2155321 gccgctgttt cgacgtcgtt catggccatg cgccgaaacg tctcgcgaat gtcgaccgcc 2155381 gcggccatgg ggtccgggtt gccgttcggc ccctccgggt tcacgtagat cagccccatc 2155441 tgcaccgcgg ccagcgggtt ctccagatcc cgcttaccgc tgtaacgctc atcgccgagc 2155501 caggtggctt ccttgcccca atagacctca tcgggctccc actggtcgac ccggccgaag 2155561 ccgaacccga acgtcttgaa gcccatcgat tccagcgcgc agttgccggc gaaaacaatc 2155621 aggtccgccc atgagagctt cttgccgtac ttcttcttga ccggccacag cagccggcgc 2155681 gccttgtcca agctggcgtt gtcgggccag ctgttaagcg gcgcgaaccg ctgcatgccg 2155741 cccccggcgc cgccgcggcc gtcgtggatg cggtaggtgc cggcagcgtg ccacgccatc 2155801 cggataaaca gcggcccgta gtggccgtag tcggcgggcc accacggctg cgaggtggtc 2155861 atcacttcct cgatgtcccg cgtcagggcg tcaacgtcga tggtcgcgac ctccgcggca 2155921 tagtcgaacg ccgcacccat cgggtcagcg acggccgggt tttggtgcag taccttcaga 2155981 ttgagccggt tgggccacca gtcctggttt ccgccgccct cgacggggta tttcatatga 2156041 cccacgacgg gacagccgtt gctagcggct ccggtggtgg tttctgtaat gggtgggtgt 2156101 tgctcgggca cagcattcct tccaggagtt ggtgttatcg ggctgtgatc acggatgtga 2156161 tcgcgaagtg tcggatatcg aacaatcagg acatagaccc cagtagatga cctccgcctc 2156221 gtccaacagg aagccgttat ggtccgaggc cgtcagacag ggtgcctcgc caacagcaca 2156281 gtcgacatcg gcgataaccc cgcaagaccg gcagacgatg tgatggtggt tgtcgccgac 2156341 cctggactcg tagcgcgcga cggagcccga gggttggatc tttcgcacca agcccgcggc 2156401 ggtcagggca tgcagcacgt cgtacacggc ttgccgggat acgtcgggca gcgcaaaacg 2156461 cacggcaccg aaaatcgttt ccgtgtcggc gtgtggatgc gcattcactg cttccaggac 2156521 ggcgacgcgc ggtcgggtca cgcgcaggtc ggccgtccgg agctgttcgg cgtagtccgg 2156581 tatagaggac acactagaca atatgactcc cttttctgga atcagtcaag actttggcta 2156641 gcgtgacagg cgtctgctag gacccgatcg ccccggggcc gctggatcgt gggatggcgg 2156701 gtggatcagc cttcgtatgt tccgatgagc cgggcctgca tggtggcggc ctgcgcgatc 2156761 acccgcgccg cttgtgtccc agccagtccc gcgagtggag gcacggcagg aaggtggtag 2156821 agggtaaacc ggtagtggtg tgtcccggtg cccgccggcg ggcaggggcc ggtgtatgcg 2156881 ggctgaccgc tggagttcgg caggctgatt ccgccaccgg gagtctcacc atcggcggtg 2156941 ctgccagcac caggggcgat cccgatcacg atccaatgga cgtaaggttc gcgaggtgcg 2157001 tccggatcat cgacaacgag tgcgccgcca aacggcgccg accaggtcaa cggaggcgcg 2157061 atattggctc ctttgcaggt gtactgttcc gggatcggcg caccgtcggc gaatgccgga 2157121 ctgctgattg tcagtacatc gccggtaggc gtttcgggca tactccgacc gagcgctgct 2157181 gctttcggcg ccagcggcgc cgcctttcga ctgtcaccgt tgccaccgta ggcaactagc 2157241 gccacgggga gcgccagccc caagatggcc agtgcgaacc ggtgaaatgc gtgcgccact 2157301 gtcgattcca tattgatcat tgtcgccagg cgcaattgga gaagccaggg tttcgaccac 2157361 ctcgccaggg atgccgcggc gtcagccttc gaatgtgccg acgagccggg cctgtccgct 2157421 ggcggcctgt gctatcgcct gtgccgcttg gactcccgtg gctcccggtg gcagctggag 2157481 cgcgacagga aggtggtaga gggtaaaccg gtagtggtgt gtcccggtgc ccgccggcgg 2157541 gcatggaccg aagtatcctt gccgaccacc agaattcggc acgctgtgcc caccagcagg 2157601 agtctgacca tccgccgtgc tgccagagcc aggggcgatt ccggtcacga tccagtgcac 2157661 gtacagtccg ccgaccgcgt cggggtcatc gacgacgagt gccagttcgg ctgcgcccgc 2157721 gggcgacgac cacgtcaacg gtggcgccac gttggccccc ttgcagctga attgcaccgg 2157781 gatcggggcg ccgtcggcga acatgggact ggcgatcgtc agtggctcgg cggccggcgc 2157841 cggcgttgtt gcgtcgacgg tcgtcgcttt cggcacgtat ggcggtgtct ctcgactgtc 2157901 accgcccccg cccccgcagc cacccagcgc cactacgagc gccagccccg cggtggctaa 2157961 tggggttcgg tgaagtgtgc tcgtcattgg agattccata gcacattgtt actaactggg 2158021 attcgagagt acagctgttt tgcggccgcg cttaccagac agccgggccc cgggccaccc 2158081 atcgcctcac ggtaccagca ccaccttgtc gacgttctcc cgtgcggcca gaatccgatg 2158141 tgcttcagga gcttcggcga acggcacgat tgcatgaacg atcggcagga tcgttccgtc 2158201 gttgagcgcc ttggtcagcg gcgcgatcca gggttcaagg gtgcggcgat cgtcccacaa 2158261 ccgcagcatg ttaagaccga tcacggtttt cgactcctcg agttgtttca tcaggttaaa 2158321 gccgcgcagc attgacaacg cgtggggcgc caccctgcgc atcgatcgtt tctcgccgtg 2158381 ctgcatattc gaaatcccgt agccaaccag ccttccaccc gggcgcagca gagtgtagga 2158441 ccgccgcagc gaggtgccgc cgagcgcgtc aagcacgacg tcatacgggc ccaatccctg 2158501 ccaccagccg tcccggcggt agtcgatcgc gcggtccaca ccgaactcgg ccagcttctg 2158561 atgtttttgg ggtgatgcgg tgccgtgcac ttcggccttg gctgctttcg cgaattggac 2158621 cgccgcgatg ccgactccac cggccgcggc gtgaatcagc acccgctcac cggcgcgcaa 2158681 cgatccgtag ccgtgcagcg ccgcccaggc ggtcgcgtaa ttcaccggga ccgcggcacc 2158741 ctgttcgaag ctcagcgcat cggggagcac aaccgagtcg gtggccgcaa cgttgacgat 2158801 ctcgcagtag ccaccaaatc gtgtaccggc caggactcgt tcgccgaccc ggttcgggtc 2158861 gaccccatca ccgacagcct cgaccgtccc agcgacttcg tatccgacca ccgccggaag 2158921 tttcggcgcg tctgggtaca ggccgacgcg ggcgagatgg tcagcgaagt tcacccctgc 2158981 tgcgcggacg gcgacccgca gctggcccgg gcccggtggc ggcgggtccg gtcgctgccg 2159041 cacctgcaag accgatgggt cgccatgttt ggtgatgacc actgctcgca taatgttctc 2159101 cttgtcaggc ttgacgggtc gcacccgcga acacccctct gtgatagcac gagttatcag 2159161 gaggttcggc ggggcgttac ctttgcggtt gtgcacttcg actgggagcg cctgaccgac 2159221 agcgtgcatc gctgccggct gccgttctgt gacgtcaccg ttgggctggt ccggggccgc 2159281 accggaatac tgctcgtcga caccgggacc accctcggcg aagcaacagc aatcgcggcc 2159341 gacgtcaagc agatcgctgg ttgccaggta acgcatgttg tgttgacaca caagcatttc 2159401 gaccatgtgc tgggttcctc ggtgttcgac caagcggagg tgttctgcgc tcccgaggtc 2159461 gtcgaatacc tacggtcggc taccgaccgg ctccgcgaag atgccctgag ctacggcgcg 2159521 gacacagctg aggttgaccg cgcgatcgcg gccctgaaac cacctcagca cgggatctac 2159581 gatgcagccg tcgatctcgg ggaccgcacc gtcaccatca ctcaccccgg cagcggccac 2159641 accacagcag atctcgtcgt ggtggcgccg gccaccggcc atgcagacgg cccaacggtg 2159701 gtcttcacgg gtgatcttgt cgaggagtca gccgatcctg atatcgacgc cgattccgac 2159761 ctggcggcct ggccggcaac gcttgatcgg gtacttgcga tcggcggccc tgacgccagc 2159821 tacgtcccgg ggcacgggaa ggtcgtcgat gcgcagtttg tccgtcgcca gcgcgcctgg 2159881 ttgcgaacac gtgcgagccg ccagcctcgt gaaacgccag ctactttgcc gtgcaagcgg 2159941 tgacgagcgc atccgggtcg gtaacgctga cccacaattc gcgcaccgtc atcgacttct 2160001 tccacatctt tgcctgttcg ggcggatcga tcgtcagtgc caccaggccc ttacgtgacc 2160061 cgttgaccag ccagcggccg aatccaaagt gcaccccggc tgcgtagacc cttgcgttgg 2160121 tcgcctctgc cttcgtgatc gacgtcaacg ggatgtcggc ggcaaatgcc catcccatct 2160181 tgacgtgcag gctccccgcc ccaacccata gctcgctgtt cttggggccg agcccgagcg 2160241 gcaccgcaag cgggagaaac caacggtcaa agcgcaactg ggtcggcacc aagatgaccc 2160301 taccggtgct agtgcggctc agtaccatgt aggagttagt ctcgaaccgc cccagtggcg 2160361 ttgcggaatt tgcgagccgt catcggtcag tgatctaggt cgcccgtccg gggatacact 2160421 cggtccgtca ggtgaatcgg ggctgcagag gagcgcaagg ccatggccat cgccgaaacg 2160481 gacaccgagg tccacacacc gttcgagcag gactttgaga aagacgtagc cgccactcag 2160541 cgatacttcg acagctcgcg ctttgctggg atcattcggc tctacaccgc ccgccaagtc 2160601 gtggaacagc gcggcacgat ccccgtcgac cacatcgtgg cgcgagaggc ggcgggcgcc 2160661 ttctacgagc gtctgcgcga actctttgca gcccgcaaga gcatcacgac gtttggcccc 2160721 tactcgccgg ggcaggcggt gagcatgaag cggatgggta tcgaggcgat ctacctcggt 2160781 ggttgggcta cctcagctaa gggctccagc accgaagatc cggggcccga cctcgccagc 2160841 tacccgctga gccaggtgcc tgacgatgcc gcggtgctgg tgcgcgcctt gctcaccgcg 2160901 gaccgcaacc aacactatct acgcctgcag atgagcgagc gacagcgtgc ggcgacaccg 2160961 gcttacgact tccgcccgtt tatcatcgcc gacgccggca ccggccacgg cggcgatccg 2161021 cacgtacgca acctgatccg ccgcttcgtc gaggtcggtg tgccgggcta ccacatcgag 2161081 gaccaacgac ccggcaccaa gaagtgcggc caccagggcg gcaaggtcct ggtgccgtcc 2161141 gacgaacaga tcaagcggct caacgccgcc cgcttccagc tcgacatcat gcgggtgccc 2161201 ggcatcatcg tcgcacgcac cgacgcggag gcggccaacc tgatcgacag tcgcgccgac 2161261 gagcgtgacc agccgttcct tctcggcgcg accaagctcg acgtaccgtc ctacaagtcc 2161321 tgtttcctgg caatggtgcg gcgttttacg aactgggcgt caaggagctc aatggtcatc 2161381 ttctctatgc gcttggcgac agcgagtacg cggcggccgg cggttggctt gagcgccaag 2161441 gcattttcgg cttggtctcc gacgcggtca acgcgtggcg ggaggacggc cagcagtcga 2161501 tcgacggcat tttcgaccag gtcgagtcgc ggttcgtggc ggcctgggag gacgacgcgg 2161561 gcctgatgac ctacggagag gccgtggcgg acgtgctcga attcggtcag agcgagggcg 2161621 aacccattgg catggctccc gaggagtggc gggcgttcgc cgcgcgtgca tcgctgcatg 2161681 ccgcccgggc aaaggccaag gagctgggcg ccgatccgcc atgggactgc gagctggcca 2161741 agaccccgga gggctactac cagatccgcg gcggcatacc gtatgcgatc gccaaatcgc 2161801 tggccgcggc accgtttgcc gacattcttt ggatggagac caagaccgcc gatctcgccg 2161861 acgctcgaca gttcgccgag gcgatccatg ccgagttccc cgaccagatg ctggcgtaca 2161921 acctctcacc atcgttcaac tgggacacca ccggcatgac cgacgaggag atgcggcgct 2161981 tccccgagga gctcggcaaa atgggcttcg tcttcaactt catcacctat ggcgggcacc 2162041 agatcgacgg tgtcgcggcc gaggaattcg ccaccgcgct gcgccaggac ggcatgctgg 2162101 cgctggctcg gttgcagcgc aagatgcgct tggtcgaatc tccctatcgc acaccgcaaa 2162161 cgctagtcgg cgggccgcgc agtgacgccg cattggctgc ctcctccgga cgcacggcga 2162221 ccacgaaggc aatgggcaag ggctccaccc agcaccagca cttggtgcaa actgaggtgc 2162281 cgcgcaagct gctagaggaa tggctggcca tgtggagcgg tcactaccag ctcaaagaca 2162341 aactgcgcgt acagcttcgg ccgcagcggg ccggctcgga ggtgctcgag ctcggcatcc 2162401 acggcgaaag cgatgacaag ctcgccaacg tgatattcca accgatccaa gatcgccgcg 2162461 gccgcaccat cctgttggta cgcgaccaga acacgttcgg tgcggaacta cgccaaaagc 2162521 ggctgatgac cctgatccac ctctggctcg tccaccgctt caaggcgcag gcggtgcact 2162581 acgtcacgcc caccgacgac aacctctacc agacctcgaa gatgaagtcg catggaatct 2162641 tcaccgaggt caaccaggag gtgggcgaga tcatcgtcgc cgaggtgaac cacccgcgca 2162701 tcgccgaact gctgacgccc gatcgggtgg cgctgcggaa gttgatcacg aaggaggcgt 2162761 agccagcgct gccaactgtc ttgggggcca accgggtgtg cgtcgaggtg gcgcacatcg 2162821 cgaaacgcga aggatgctgt cagacggcgt ctgcggtggc ctgtcgaaga tccagcgcac 2162881 cggcgttcac ctgcgtcggc ccgcggtcgc gactaccatc gccgcccccg tttacggccc 2162941 ggcacccggt gagaagaagc ccaggagcat ttggccgatg ttgttgacgc ccgagttaaa 2163001 cgcagcggtg aggtgaccaa cggtgctcgt gttgttgaag cccgagacgg tgttgcctag 2163061 gttcgccacg cccgacgcca gctgcccgac gttgtagatt cccgagactc cgccttgcag 2163121 cgcgttcggc acctggttcc agaggcccga aatgccgggc ccgacgttgc cgaagccgga 2163181 tgcgcttcca tcgccactgt tgaagaagcc cgaagacggg gtggtggtgg agtttccgaa 2163241 gcccggggcg ctcgtgatgt tgatcgggat gttgatcggt cccaagccgc cgttggcggt 2163301 caagttcagg ggggatccgg gaatggtgaa gccggggatc gtaaccgggc tcgtgccccc 2163361 gctcaacgga acattcaacc caaacggatt aatcgcgaaa ccagggatcg taaccgggct 2163421 cgtgcccccg ctcaacggaa cattcaaccc aaacggatta atcgcgaaac cagggatcgt 2163481 gacagcgttg gtagcaccgc tcagcggaat attcaaaccg aacggattaa cactgaatcc 2163541 ctggatgcca gactccaggg tgccgccggc cagcgtgacg cctaatacga atgtgctaag 2163601 cgggatgggg ccgatgtagc ccgtgaagat accagcgacg ttaaacggaa gttcgttgag 2163661 agtgatgttg accggtatcc tgatgttaat cgtaaggggg atgcgggaaa tagggacgcc 2163721 gggaacggtg atcggaccga caccacccag cgcgttcagg ctcaacggaa taccaggaat 2163781 agtaatatca ggcaccacaa tcggaccgac accacccagc gcgttcaggc tcaacggaat 2163841 accaggaata gtaatatccg gcaccacaat cggaccgaca ccacccagcg cgttcaggct 2163901 caacggaata ccaggaatag taatatccgg caccacaatc ggaccgacac cacccagcgc 2163961 gttcaggctc aacggaatac caggaatagt aatatccggc accacaatcg gaccgacacc 2164021 acccagcgcg ttcaggctca acggaatacc aggaatagta atatccggca ccacaatcgg 2164081 accgatgcca ccattcactt cgacgctcag tgggatggcg ggaatgctga gtgtgtctga 2164141 gtagccaatc agaccctggt aatcgcccct ccacagtatg ccgttgctgt agctgcccga 2164201 gatcagggcg ccggtgttaa ggtcgccaat gtttccccag ccggtgttga ggtcgccgag 2164261 gtttaggtac cccgtgttgg cgttgcccgg gttgaggtcg cccgtgttgg tgtcgccggc 2164321 gttgtagctg cctgtgttgt agcttcctgc gttgccgatt ccagtgttga cgttgccggt 2164381 gttgaacagg cccgtgttgg cgttgcccac gttacccagg ccggtgttgt agttgccgga 2164441 gttgccgatg ccgacgtttc cgttgcctga gttgaagaag ccgatgttgc cgttgccgga 2164501 gttgaagaag ccgatgttgc cgctgccgga gttcagcgcc ccgaatccga cctgattgtc 2164561 gccggtgagc ccgataccaa tatttccagt gcccgtgttg ccgaagccga tgttgccgtt 2164621 accgatgttc gcgaagccgt agttgttgcc gccgatgttc ccaaagccaa tgttgtgcag 2164681 ggcctccgtc aaccccggac ccgtgtttgc aaacccaagg ttgttgctgc cgacgtttcc 2164741 aaaaccgaag ttgttgcttc cgatgtttcc gaaaccgaaa cttccgttgc cgatgtttcc 2164801 gctaccgaag ttgtagctac cgacgtttcc gctacccacg ttgtagtcgc cgaggtttgc 2164861 gttgcccaag ttgagtgtgc cgtcgttggc gaagccgaag ttgaataacg tcccacctgc 2164921 ggcgttgcgc atgaagccgg cgagttggct gtcggtgtta ccgacgccgg agtgaaaggc 2164981 cgatgtcgct aggcccagcg tgctggtgtt gtagaggcct gagactgtgt tgccgaagtt 2165041 caagattccc gatgtcagtg gcccgacgtt aaggaatccg gagttgccga gattcccagc 2165101 aatgttccag aagccagatc cgcccgaacc gacgttcccg aaacccgatg tgccgcccgt 2165161 accgctgttg aagaagcccg atgacggggt ggtggtcgag tttccgaagc ctggggtgcc 2165221 cgcgatttcg atcgggatgt tgatcggccc gaggctgccg gacacgtcga tgcccaacgg 2165281 gattgagggg atcgtgattg gcggggtagt gagggggccg atggcgccgc ccacatcaat 2165341 acccaacggg attgccggaa gtgagtagcc atccgggaac accgtaaacg ggcctaaccc 2165401 tccgcccaca tcaataccca acgggattgc cggaagtgag tagccatccg ggaacaccgt 2165461 aaacgggcct aaccctccgc ccacatcaat acccaacggg attgccggaa gtgagtagcc 2165521 atccgggaac accgtaaacg ggcctaaccc tccacccaca tcaataccca acggaatagc 2165581 cggcaaacta taaccacccg ataagaaggt gatgggaccg atttgaccac tcactgtcac 2165641 gtaatctgga gggaatccgg ggaaaaatgg cggaatcgcg ggaatctcag gagtgcctag 2165701 ctgtatcgat atgctacccg ggcctatgct gccaacggtg ggatttacgc cgaataagcc 2165761 gatcgcaagc ggagacgcgg ggatcgaaat cgatcccacg ttaatgacct ggaacgccga 2165821 tagctctagg ccaatagaat ttagagtgat cggcgggatg ttgatggggc caacgagtgc 2165881 cccggtactg ttgatgccca gcccgatggc gggaacagta ataggcggaa cattgatcgg 2165941 ccccaccaac gctccggaac tgttaatgcc caggccgatt tcgggaatgg tgatggacgg 2166001 gatggtgatg gggccgacgg agccgaggcc gttgaggtct aggccagcag cgggaatggt 2166061 cagtgtgccg gagaagccga tcaagccctg gtagtcgcct cgccagaaga agccgttgct 2166121 gtagttgcca gagttgaatc caccggtgtt gacgttgccg gtgtttccca cgccggtgtt 2166181 gaggttgccg gggttgaaga agcctgtgtt ggagctgccc gtgttgaagt cgcccgtgtt 2166241 gaagctgcct agattgaagc tgcccgtgtt gtagttgccc gtgttgccga tgccagtgtt 2166301 ggcgatgccg gcgttgaaga agcccgtgtt ggcttggccc gtgttgccga tacccgtgtt 2166361 gtagctggta cctgagttcc cgatgccgaa gtttccggtg cccgtattgc cgatgccgat 2166421 gttgccggtg cctgagttga acaagccgat attgccggtc cccgagttcc agccgccgaa 2166481 cccctgctgg ttgtcgccgg tgaggccgaa gccgatgttt ccgttgcccg tgttgccgaa 2166541 gccgatgttt ccgttgccgg tgttgcccaa gccaacgttg ttgctgccga catttccaag 2166601 gccgaagttg ttgccgccgg gatttaggct gcccaagttc aaaatgccaa ggttagcggc 2166661 gcccatctgt ccgaagcccg agtttgccag gcctaagcta agatttgcca gcacaccctt 2166721 ggaactggtg atcgccgcgg tgacgacggc cgccggagcg gccgccaact gggcgggcag 2166781 gtctgtcaga ttctgcggcg gcgcagtgaa cggcgtcagg gccgacgcca ccgccgatgc 2166841 cccggcatgg taggccgaca tcaccgacac atcgatggcc cacatctgtt cgtatgcggc 2166901 ctcaatcgca gcgatcgccg gagcattctg cccgaagaag tttgagaaca ccaatgacac 2166961 caggtcggag cggttggccg ccaccagcgc cggttgcacc atcgccgccc ggacagcctc 2167021 gaactcggcc acgagggccg cggcttgggt tgccgaccgc tgggcctggg ccgctgccgc 2167081 ggcaagccat cctaggtacg gcgctaccgc agcggccatc gccgccgatg acggtccgag 2167141 ccacgattcg ccgaccaggc ccgacgtcac ggagttgaaa gaggccgctg ccgaggccaa 2167201 ttccatcgcc agttggtccc aggcgaccgc ggccgccgac atgggttctg atcccgcccc 2167261 gccgaatatg agggccgagt tgatctctgg tggcaatgtt gaaaaattca tggccccgac 2167321 tttccctggg tgcaccgaat tcatggcggc tcaccaaccc gcggtcggcg agcgccgtgt 2167381 cgctcgacgc tactcggcga tcttcgcggc cgtatgcata tcacccgaat agggccatga 2167441 ttcatagatc tcgtcaaact gatttacggc gggcgctttt tagccgcttt aggaatcgac 2167501 gccaaaccca acgaacgagc ctcagccaag gccgaaatcg attaattccc cgatgatttc 2167561 atcgttgtgg aggtcgtcgc aggcgtcgtt gatctgatcg tggcgattac ggctggtgat 2167621 cctctccgcg gggcggggtc cgcacggatt atggcgtggt gctctggaag aacaggcccg 2167681 acaggttgtt gccgatgttg gccaaaccgg agaccaagct ggtcacggca aacggcaggg 2167741 tgccggtgtt ggcgaagccc gatatgccgc tgccaaggtt ggagaagccg gagataagac 2167801 caccgtagtt ctggtagccc gagccggcta gcagcccaac aggacttgtg ttgaaccaac 2167861 ccgacagtcc cgagccgctg ttgccgaagc ccgagttccc accgattccg gcgttgaaaa 2167921 agcccaacga gggcgttgcg ctcgagttga agtatcccgg ccccgctggg attgcgaatc 2167981 cgcccatggt ggtgctcggc aggtggatgc tggcgatggt gagtgcgggt gtggtgaagg 2168041 ccgccaagcc caccggctgg atggtgaact ctggcgtggt gatctccggg atattgacct 2168101 gggggagggt gaaaccgcta agtccgatcg ggtcgatggc gaacggtgga gtcgttatct 2168161 cgggcgtcat gatctgagga agcgtgaaac cacccagcgc tatcggatcg atcgtgaacg 2168221 ccggggtggt aatcgccggg atgctgagct gcggcagcgt aaacccaccc agcgtgatcg 2168281 ggtcgatggt caactccggg gtcgtgaact gttgagtagt gatatccggc aggctcaatg 2168341 caccgacacc aatcggactg atcgtcaacg ccggagtggt gaattcttgg gtgctgatct 2168401 ctggcagggt gaacccgtcg accgagatcc ccccgaggga ccacggttgg atgacgacgt 2168461 tggggagggt gaagggggtg acgttgattg cgccgatcga gaagccgacg ccgttgattt 2168521 gacctccacc cacggtaatg gtcccagtat taataaaggc aggaggtgta ttagcgaagc 2168581 cgccaatctg cgggaatacc ccgggcatat tggtttgcaa ggcagtgatg ttgttcggaa 2168641 tgaacaccac caaattagtt atcgtaatgc cgttaaggct aaaggtggga agattgatga 2168701 caccagaatt tgcttgcgtg gctatgccgg gagtgctaaa gccgcctata cttatttggg 2168761 gtgtacttat taacggggtg tgtatcgtgg gtagcgtaaa tccgccgaca gtggtgccag 2168821 ccggaatcgt gatcggcgga accgtcaccg acggaatact cagcgtcggc agattgaacg 2168881 cacctagcgc tgtgccagcc ggaatcgtga tcggcggaac cgtcaccgac ggaatactca 2168941 actgaggcaa gttaaacgca cctaccgtga tgttggctgg tgtcgttgta gctggaatcg 2169001 tcaacgacgg caccgtcaac cccggcaaat caaacgcacc caccgtgatg ttagctggcg 2169061 tcatcgccgc tggaatcgtc aacgacggca ccgtcaaccc cggcaaatca aacgcaccca 2169121 cggtaacgtt ggccggcgtc gtcaccgccg gaatcgtcaa cgacggcaag gttatcgcgg 2169181 gcaggctgaa cgcgggaacc gagattccgg gtatttccag agacggaagc gtcaaatcag 2169241 ggctggtgat ggcgaactgc aggctgcctt ggcccacacc acggtaaaag acaccattgt 2169301 tcatgtcgcc cgtgttgaac aagccgttat tcatgtcgcc tatgttgaag gcgccggtgt 2169361 tgatgcttcc tgtgttgaac cagccggtgt tggcgccgcc cgtgttgaag gtacccgtgt 2169421 tcgacgggcc cgggttgaag gcaccgaagt tgtagtggcc gacgttgaag ctgccggtgt 2169481 tcgcgttccc aacgtcgaac atgcccgtat tgaaagagcc cgcattcagg aatccggtgt 2169541 tgccgtgtcc ggggttgaac aggccagtgc tgaagttacc ggagttcccg atgccaaagt 2169601 tgccattgcc ggagttgaag aaaccgatat tggcgctacc cgcgttgaat aatccgacgt 2169661 taccgttgcc ggaattgagt ccgccaatgc cgatttggtt gtttccagtc aggccgttgc 2169721 cgatattgtt gttgccggtg ttcgcaattc cgaagttgcc gatgcctgca ttggcgaatc 2169781 ccgtgttcaa attgcccagg tttgcaagtc cgaagttgtt ggcgcccgcg tttcctatgc 2169841 cgatgttgtt gccacctagg ttggccgagc cgatattgaa gctaccgaag ttgccggacc 2169901 ccagattgtt gttgccaagg ttggcgttgc cgatgttgcc gagcccattg ttggcattgc 2169961 cgaggttgcc accaccgacg ttggccaagc cgaggctagc ggcgatcgcc cggccggcaa 2170021 aagtcggcat gcccacggcc gtggtgagcg cggtcaccac ggccgcgggc ccggccgcca 2170081 gacccgccgg aagccgcagc gggagggcga atgccggtag cgccacagcg accgccgacg 2170141 ccccggaatg gtaggccgcc atcgccgata catccagagc ccacatctcc tcgtatgcgg 2170201 cttcggcggc cgcgatcgcg ggagcgtttt gaccaaaaag gttcgatatc accagcgata 2170261 tgaggccgga acggttggcg gccaccagcg ccggttgtac catcgccagc cgcacagcct 2170321 cgaactcggc caccatcacc tgggcctggg tggccgcctg ctcggcctgg gtcgccgccg 2170381 cggccagcca ccccgcatac ggggctgccg ccgctgccat cgccaccgat gaccgaccct 2170441 gccacgaccc gccgaccagc ccggctgtca ccgagccgaa agagacagca gccgaggcta 2170501 attcggttgc cagcccgtcc caggccgacg ccgccgccag catcggtccg gagcccgccc 2170561 cggcgaagat caaggccgag ttgatctccg gcggcaacac tgagtaatgc atcgctcccc 2170621 accttccggg gtgagcctgg tgctgatgaa aggtcacacg cccgtcgtcg ctgactcgtt 2170681 cgtagcgcat gagagtacgc ggagatcttg aattgtgtat ccgagcaaat gaaaccgtta 2170741 tctatttgtt atagacatat cgggcacgga tgcaaagttc ttttacacgc tatgcgtaat 2170801 cacgatccgt gcccgtctga tgtaaaccac cgacgtaggc gcactgatat aaatgcattt 2170861 attaccaagg tgattgggtg aaataattac cccggaaaac tgtgctcaat aggaacgatt 2170921 attagtttga atcactgcca taatccaccc tatgtgcaac ccggatgaat tccgatcgcg 2170981 tgcttattcc tgccaaacat tcgggcttta gccctggccc accacgcggg caccaatccg 2171041 acgctgcccc tacagcgaaa tcaccggcgc accgcctccc gctcggccgc cttcaccagt 2171101 tgacccgcga agaacctgac cgcgccaccc agcgccgccc gcatcaccgg ccccgtccca 2171161 cgaacctttt cggtaaacga gccactccag cggagatcgg taccgcccga cgcatttggt 2171221 gtaaggacca cctcgccgaa gtagtcctgg acgggtgtcc tcgcgccaac cagcttgtag 2171281 acgtggcgac ggtcctgctc atactcgacg gtctcttcct gcacgaacac cggccacatg 2171341 cctagtttgc ggatggcccc gatgccgccg ggcgcgggat caccgcgtcg cgcccaactc 2171401 gattgagcaa cgatgggctt ggcccaggtc gcccagttgc caccgtctgt cacgagccga 2171461 aacaaggttg cagccggcgc gctgctggtc ttggtgacct cgaacgaaaa tttccgaccc 2171521 gacatgcgcg actcccgaaa cgacaactga agcggcccga tatggtgctg ccgcgtaccc 2171581 taccgcgcag ccgtccgtgc cggccgtagt ggaccagcca aggtgttccc gcgctggccg 2171641 cagcaggcgc ataatcacga ggtgtcccgc gcagataccg tctcagtgcc ccgtgcgccc 2171701 acccaggctg aggtcgccgc agtgctgcgc atcatgacgc cgctgcgcaa ggtgattaaa 2171761 ccaaaggtct atgggatcga aaatgtgccg accgaacgcg cattgctggt tggcaaccac 2171821 aacacgcttg gcttggtcga cgcgccattg ctggccgccg agctctggga gcgggggaga 2171881 atcgtccggt cccttggcga ccacgcccat ttcaagattc cggggtggcg cgacgcgctg 2171941 acacgaacag gggtcgtcga aggcaccaga gagatcacct cggagttgat gcgacgcggc 2172001 gagctcgtca tggtctttcc cggcggcgcc cgtgaggtca acaagcgcaa gaacgagcgc 2172061 tacaagctgg tgtggaaaaa tcggctgggg ttcgcgcgct tggcaattca gcacggctat 2172121 ccgattgtgc cgttcgcttc ggtgggtgct gaacacggca tcgacatcgt gctcgacaac 2172181 gaatccccac tgctggcacc ggtccagttc ctcgccgaga agctgctcgg caccaaagac 2172241 ggtccggcgc tggtccgtgg tgtcggactg acaccggtac cgcgccccga acggcagtat 2172301 tactggttcg gcgagccaat cgacaccaca gagtttatgg ggcagcaagc cgacgataac 2172361 gccgcacgca gggtgcgcga gcgtgccgcc gccgctatcg aacacggcat cgagctgatg 2172421 ctggccgagc gcgcagccga tccaaatcga tccctggtcg gacggctctt gcgctcggac 2172481 gcctaaggcg cccctgaggc gttcccgggg cctgattcag aagtcagaag accgagtcga 2172541 cttgatcggg gattggggtg ccgtcgttgc gcaataccgg ttgtttcgat ccgtcggggt 2172601 tgatgaatgc ctccccgcat acgtaaggag cgtgctgggg cagcgggtcg ataaacatcg 2172661 ggttgatcgc ccacttaccg cccctggtga acaggccgtc gtaggcccgg cacatgaggt 2172721 cgtcctggtt gcggttgatc acgagtgaca ccacggtggc gtcgccgacg aaggtggcgt 2172781 cgctgttcga gtcgccggcg gcgaggactt gacgacgatc cgccgcgagc tgattgaagg 2172841 cttgcgggcc agtcaccccg aagatgacct gattggccca acaccgtttg ccatcaaggt 2172901 aggtcatgac tgaatcgtcg ccgtcgcgga cgcctccgca accgacgagg tgagcggtga 2172961 gtttcccgga ctggtcggcg acgctgcgga ctccgacgac atgctgatcg tctagaccta 2173021 cctcgcccgc ccacaccttg acgatcggtt cgggtgacgc tgacaccacc caggtgtcga 2173081 taccgtgtgc ctgcagagta ccgatgaggt ctttcatttg tggatagacg cggatgtaac 2173141 catcgacctg ctgtgttccg acctgctggg tggcgccgac atcggcggca aggttctgtt 2173201 tcttggcctg gtctgcgaat ccggcgagct cctcagcggt gtagcccgcc gacagtgcgt 2173261 tgctccacgc gtacggaccc gccaaccggc gcacgttgtt acccacgaaa gccggctgtc 2173321 ccgtggtggt ttcgccgtcg agaagggaaa ggatctcgtt cgcgcacaac gcattgctgc 2173381 cggtcggcag cggcttgccg gcaggtacaa ccttgccgca tgccacgctc agcgcgttcg 2173441 ccgccgcgtc ggtcaggtat cggctggcgg catgccaatc ctggttggct ggctgcagca 2173501 ccaggctgtg ctgcagcatg tagtagttcg tggcgtagcc gatgtcgttc ttgacgacgg 2173561 tgttgtccca gtcaaagatg gcgaccttgc gcgcagaacc gtccgcggtg ccggtgcacc 2173621 tgctgttggc atcgatcgcc gactgcagga attcacgaac tccgtggtgc cacttcagaa 2173681 acgcgtcgag ctgacgacag ccggacgctg gggtcggggg ttggtgggcc gagcagccga 2173741 tgacgccacc gagcacggtt gccattgcca acagcgacgg tatgagtcgc accatgtaag 2173801 cccttcgtca gcccttggtc gtgccagcat gcgccggatg gaagggggat gggaactgaa 2173861 tggttgcctg ctgaactgaa cgctgagcaa attcgatgcc gacgaaacat tatgggtttg 2173921 tttctcgacg gcaacccgtg cgcgattcga cagtcaccgc gatgctgccg acgccggccc 2173981 gcgctcccgg gcgatccgcg tgagcagcgt aatctcgtgc gcacggattt gcggcccgga 2174041 ctagcgcgaa agatactgtt gaacagatgg attcgactgt aacggcctcg atccgacgca 2174101 tgctgggact gctcgccgcc acattgctgc tcggcggctg caccggccag cacacgacac 2174161 gcacagcggc gagcaccaca tacacgcccc acatcaaggc cagcagtcag gacgtactgg 2174221 acggcgccat caatgccgac gagccaggtt gttcggccgc ggtaggagtc gaggggaaag 2174281 ttatctggtc aggcgttcgc ggcattgcgg atctggcatc cggcgccaag atcaccacgg 2174341 acaccgtgtt cgacatcgcg tcggtgtcca agcagttcac cgccaccgcg atcctgctgc 2174401 tcgtcgaagc cggaaagcta acactcgacg acccgatatc ccaatacgta cccgagctac 2174461 ccgactgggc ccaaaccgtc accgtcgagc agctcatgca tcaaaccagc ggcatccctg 2174521 attacgtcgc attgctggca gccagggggt atcaggtcag cgaccgcacc atcgaggccg 2174581 aagcccggca ggcgttagcg gccgcccccg agctgcaatt caagcctggc accaggttcg 2174641 attactccaa ctccaactac ttgctgctcg gcgagattgt ccaccgcgca tcgggacaac 2174701 cgctgcctga gttcctcagc gccgagatct ttcaaccgct tggtctggcc atggtggtgg 2174761 atccggtcgg gaaggttccc aacaaagccg tgtcatatga gaagggcact ggtggaaacc 2174821 ggtccgagta ccgggtgggc aatccggcct gggagcagat cggcgacggt ggcatccaga 2174881 ccacgcctag ccaactggcc cggtgggcgg acaactaccg gacaggaagc gtcggcggcc 2174941 tgaaactgct cgaagcacaa cttgccggtg cggtggaaac cgaacccggt ggcggcgacc 2175001 gctacggcgc cggaatcgtg tcgcgcgccg acggaacact cgaccacgcg ggcgcctggg 2175061 ccggattcgt cacggcattc cacatcagca gtgaccgacg gacttcggtg gccatcagct 2175121 gcaacaccga caagccggac ccggtggcca tggccgatgc gctggggcgc ctttggatgt 2175181 agcggggcta ccgcggttgg ccgccggtac ccaggctgca atcattcacg gtatggcgca 2175241 accaccgtca ctcctcacaa ctgacaatgg cctacccttc ggcgtgcaag gtgcctgcga 2175301 ctcccgtttc accggagtca tccgtgcctt tgctgggctg taccccggcc gcaagttcgg 2175361 gggtggggca ctgtcggttt atatcgacgg tcgccaggtc gtcgatgtct ggacggggtg 2175421 gtccgatcgg cagggcaaag taccctggac ggccgatacc ggggcaatgg tgttctccgc 2175481 gaccaaaggg ttggccgcaa cggtgattca ccgtttggtc gatcgcggcc ttttgtccta 2175541 cgacgcgccg gtcgcggagt actggcccga gttcggagct aacggcaagt ctgaggtcac 2175601 cgtcagcgat gtgttgcgac atcggtccgg actggcgcac ctcaaggggg tggacaagga 2175661 cgaggtcatg gaccacctcc tgatggagca gaagttggcg gctgcgccgc tagaccgcca 2175721 gcacgggaag ttggcttacc atgcggtgac ttacggatgg ctgctgtccg gcttggctcg 2175781 tgcagtgacc ggcaaaggca tgcgtgaact gttccgcgaa gaactcgctc gcccgctgaa 2175841 caccgatggt attcatctcg gccggccacc ggccgactcg cctaccaagg cggcacagac 2175901 acttctgccc caagccaagg tccccacccc actgctcgat ttcatcgcac caaaggttgc 2175961 ggggctgtcg ttctccgggc tgctcggcgc cgtctacttc ccgggcatcc tgtcgttgct 2176021 gcaagacgat atgccgttcc tcgacggtga ggttccggcg gtcaacggcg ttgtgaccgc 2176081 gcgcgccctg gccaagacgt atggggcgtt ggccaatgac ggtgtgatcg acggcacccg 2176141 actgctgtcg tcgcaggcgg tacgtggatt gacggggaag tccgagctat ggccggacct 2176201 taatctcggt cttcctttta cctaccacca gggttaccaa tcgtctccgg tgcctgggct 2176261 gctggagggg tacggccaca tcgggctcgg tggcacgatc ggatgggccg acccggagac 2176321 cggcagcgca ttcggatatg tgcataaccg cttgctgacg ctactgttgt tcgatattgg 2176381 ctcgttcgca gggctggctg cgctgctgaa cagcgccgtc gtggcagcac gtcgcgatga 2176441 ccccctggaa gtgccgcatt tcggtgcgcc ctatagcgaa ccgcgtcatg agcaggcggc 2176501 ctcgggtgca taactgctcc cgttatgccg cgagcgcgag cccgacgggc tagaactcgt 2176561 aaacgagtag ccagacgaga gcgacggccg ccaagaacag accaaccagg atagccgcgc 2176621 gggtaaccag tacctggcga tggaaccact ctcgcagctg ggtgaatcgc cagtcggtcc 2176681 aggcgtaggc gcgcacagcc cactgcgcct cgaccgcgag cagtcgaaac gcgaccagca 2176741 gggccgggat gccgagttcg gggagcagca cgatcatcgg cagggatacg acgaatagcc 2176801 cgccaccgac cacagcgagt gtcgcgcgaa tcagtagcgg cctggcccgt acccgctgtc 2176861 ggtatgcgag cactcgggcg agcgcggcgt cgcgggtgga agtcgggttg atgacgtcgg 2176921 ccgggtccat gactgctcct agtgtgcctg cctcgacgcc tagcggacgg ctgtgtcggg 2176981 ggtggtttgg ttcggactct agtggagccc ggttgcgcac tcgggtccga ccaatgcggg 2177041 gccgcgcctc atacgcacga taagcgtggg tgtatagact gcggttatga atgacggctc 2177101 ccggcaggaa ctcagggttc gtagcggcct actacaaatc gaggactgcc tggatgctga 2177161 cggcggcatc gcattgccgg caggcaccac gctgatctcg ctcatcgagc gcaacatcaa 2177221 gtatgtcggc gacctcgtgg cgtatcgcta cctggaccac gcccgttcgg ccgccggatg 2177281 cgccctggaa gtgacctgga cgcaattcgg tatgcgatta gcggccatag gtgcacacgt 2177341 gcaacggttc gcaggccccg gcgaccgcgt tgcgatcctc gcaccacagg gcatcgacta 2177401 tgtttgcggg ttctacgctg caatcaaggc aggcaccgtc gcggtgccgt tgttcgcacc 2177461 cgaactgccg ggtcacgccg agcgtcttga tacggcactt cgcgattcgg agccagcggt 2177521 catactcacg acggcggcgg cgaaaaacgc cgttgaaggt tttctgaaca acgttccgcg 2177581 cctgcgaaag ccgacagtcc tcgtcatcga tcaaataccc gaccgcgagg gggagctgtt 2177641 cgtcccggtc gagatggaca tcgacgccgt atcccacctg cagtacacct cgggctcgac 2177701 gcgacccccg gtcggtgtcg agatcaccca ccgcgcggtc ggcaccaacc tggtgcaaat 2177761 gatcctgtcg atcgacctgc tcaaccgaaa cacccacggc gtcagttggt taccgctgta 2177821 ccacgacatg ggcctatcca tgatcggctt tccggcggtc tatggcggac actccaccct 2177881 gatgtcgccc acggcgtttg tccgcaggcc actgcgatgg atccaggcgt tgtccgaggg 2177941 gtcgcggacc ggacgcgtgg tcaccgcggc gccaaacttc gcctacgagt gggccgcaca 2178001 gcgtggacta cccgcgcaag gcgacgacgt cgacctcagc aatgtcgtgc tgatcatcgg 2178061 ttccgaacca gtcagcatcg atgcggtgac cacgttcaac aaagcgttcg cgccctatgg 2178121 tttaccgcgt acagcgttca aaccctcgta cggcatagcc gaggcgaccc tgctcgtcgc 2178181 gaccatcgac catgccgctg agccgacggt tgtttatctt gacccagagc agttgggcgc 2178241 cggacacgcg acgcgcgtcg cgccggatgc gcccaacgcc gtcgtgcacg tgtcgtgtgg 2178301 ccatgtggcc cgcagcctgt gggccgtgat cgtcgacccg gataccggcc ccgaggcggg 2178361 cgccgaactg cccgacggtg agatcggtga ggtttggtta caaggcgaca acgttgctcg 2178421 ggggtattgg ggacggccgg aagaaacgcg gatgacgttc ggtgcccgct tgcaatcacc 2178481 gctcgccgaa ggcagccacg ccgacgggtc cgcgatcgac gacacctggc tgcgcaccgg 2178541 agacctcggc gtgtacctcg acggtgagct ctacatcacc ggtcgaatcg cggatctgct 2178601 gaccatcgac ggccgcaacc actatccgca ggacatcgag gccacggccg ccgaggcctc 2178661 gccgatggtg cggcgcggat acataaccgc tttcacggtg ccggccagcg acggggacga 2178721 ccgcaatcag cgactggtga tcatcgccga acgtgcggca ggcaccagtc gcagcgaccc 2178781 gcggccggcg ctcgacgcga ttcgcgcagc ggtttgcaac cgccacgggt tatccgttgc 2178841 ggacctgagt ttcctgccgg ccggcgccat tccacgcacc accagcggga agctggctcg 2178901 ccaggcctgc cgcgcccaat acctcagcgg tcgcctgggc gtgcattagc tacgatctac 2178961 ggctcccaaa tcagcagatc ctccatgccg ttgttcatcg cgacgatggt tggcgatggg 2179021 ccggtgacat cgaagtagat tttgccggtc gattgttcgc cttgggggat agtggctccg 2179081 ctaatggtgt cggggcccgc ggcttgccac agcacccggt agttgatgcc gtcggcggtg 2179141 cgggcattga actgcgagac cgcgggcgtg acgctgccgc gaatcgcatt gaccgtggca 2179201 gtggcctccc agacctggcc ggccaccgga tagccgggga tgactgccgt gctggatttg 2179261 agatcactga ccttccagcc gagcacgact tggccaacgg tgtcggtcat cgttagctca 2179321 ctgccaagtt ttccggtgat gggataggca gccaacgcga ccggtgccgc aaaggtcgcg 2179381 atggccgcca tggccacgac cgctactgcc gtcttgatca ttgtggtgag cttcattggt 2179441 ccctacctcc actacttgtt ggggcgatta cctggttcga acctcgccga cgtcattacc 2179501 ttaagccgca aatgacccgc tgctaactcc agattcgata ggaaccgtgg ggcagacgat 2179561 gccgttcaca tccgtagccg gcgcaccgac gacgggcgtg gccatgaatg cttgatggcc 2179621 gagtcgtagg cgaccagcgc aagggagcca aaccgcatgt caggatggtg tggtgaccgc 2179681 catacccggc ccgtcgggcg ccgaacccgg tgagagccgc gcgctcgcgg gttacccggt 2179741 gacgccgccg gcgctgcccc gcccggtgat cttcgaccag cgctggactg acctgacctt 2179801 catccactgg ccggtgctgc cggagagcgt ggcaggcagc tacccgcccg ggactcgccc 2179861 cgatgtcttc gccgatggga tgacttacgt gggtctggtc ccgtttcgca tgagcagcac 2179921 caaactcggc accgcactgc cgatcccgta tgtcggcacc ttcccggaga ccaatgtccg 2179981 gttgtactcc attgataacg ccggccggca cggggtgctt ttccggtcgc tggaaacagc 2180041 tcgactgact gtcgtaccgc tcacgcggat aggactcggc atcccgtacg cctggtcgag 2180101 gatgcggatg atgcgctctg gtaagcacat tacgtatcac agtgtccgcc gctggccacg 2180161 gcgcggactg cgcagcctat tgacgatcac catcggtgac ctggttgagc cgacgccgct 2180221 ggaagtctgg cttaccgcac ggtggggtgc gcatacccgc aaggctggcc ggacttggtg 2180281 ggtgccgaac gagcataagc cgtggccgtt gcgggccgcg gagatcgccg agttgaacga 2180341 cgagttgatc gacgcaagtg gcgtgcaacc cactggcgat cggttgcgcg ccctgttttc 2180401 accgggtgtg catgcccgat tcggccgtcc gtgtgtcgtt cagtgacgtt taggggcagg 2180461 tgtatccacc atcaatcacg atgtcggaac cggtcatata gctggaagcc tcgctagcca 2180521 gatacaggta gaggccagcg agttcttcgg gccggcccaa ccggcccaac ggaatcttgg 2180581 gctcccatag cggctggtat tccgtgtacg gttcgacgag ctcggtcagg atatagcccg 2180641 gactgacact gttcacccgg attttatgcg gcgccaactc cacggccatg gctttggtta 2180701 gatgaatgac cgccgccttg gaggcgcagt agtgggaaac ctgctgcggg acgttgatga 2180761 tgtggcctga catggaagca gtgttgatga tgaccccgcc ttggccttgt ttgaccatcg 2180821 ccttggcagc ggcctgcgcg gtaaggaaga cgcctgtcac attggtgttt tggaggcgct 2180881 ggaactcttc cagcggcatg tccagcatcg gagtgaccgt gatgatgccg gcgttgcaga 2180941 ccgcgatgtc gatcccaccc agctccgcgg tcacctgatc caacatgctg gtcacctgct 2181001 ggtgctggct cacatcgcag cagacgggca cgaccttgcc acctgatgtg ccaatctcat 2181061 ccgccaactt ctctaaggca tccaaatgcc gtgcggcgat cgccacttga gccccggctt 2181121 cgacgtatgc cagggcaact ctcttgccga tgccggtgga tgccccggtt atcagcgccc 2181181 tcttgccgtg caagtcgaac aggtccaaca cgctcattcg tgatcccctt tcgcgcgacg 2181241 cagggccgat acctgatgga atcacatgcc gaaatgcgtt cgatgaactg ccgcaatggc 2181301 ttccagtggt ccgctcactt cgacccgcgc tacggctcgg cgtccaaaga cgtacagcag 2181361 caactcgccg ggcggtccgg tcaggcgagc cgtcggctcg cctgaccgga ccctcacccg 2181421 cttaccggtt ccaacccact cgatctcaag cccgcaaccg tgcagccgcc gactcaggaa 2181481 gtggctgccg cgccgaacat ttcgccatag ggcagcatcc atttcgggcg tgaggcttcg 2181541 gggccctcgt ccgctggcgc ggcgaacgtc ctcgtgatgg acaaagaatt cgttgaggtt 2181601 cgccaaggta cgaacccatc cgatgcggaa gaaccccatc ggtggaccgg accgaatccg 2181661 agcgacgagc cacgtgaagt ctttactctg agccaatctc gctctacggc gttcggcaaa 2181721 ccgctggaag ggacccggta gaacgatgca aaggccagca acgagatcgc gttcacgcag 2181781 cacgatgtga gcggccaggt cgtgagcagt ccagccctcg atcagtgtag caaccgcagg 2181841 accgagctcc tcaaggagat cacagagctc caagcgttct tgcgcgtcca acgggacatc 2181901 agccacgccg cgggagtcta cgggcgacgt gcctgcgcgc caacgggctg ccgcttgcgc 2181961 cgtcgcgact gcacagcagc cagcgcccgc tcccaggcga gcagcgttgc ggccgtcaga 2182021 ttggccggtt tggcgctgtc cttggacagc agcgcggtcg cggcggcttt ggtggtcggc 2182081 gacgccttcg acatgtgacc ggagtcgaac ggcggctgcg ggtcgtactc gatcgccagc 2182141 tgaatcgcct tggcccgggc ctccccgccc agctgtccgg ccagccagag ggcgagatcg 2182201 agcccggcgg acacgcccgc gctcgtgaca atgttgtcct ggtgcacaat ccgctcgtcg 2182261 gcgaccggga tagcgccgaa tgccttgagc gcgggaagcg tcagccaatg cgaggtcgcg 2182321 cgccggcccc ggagccacac gaaccgcacc tgggcgtgcg gcaggtttcg cagcacctcg 2182381 tacgggccga ccacgtccag cgcggtaacg ccggggtagg ccacgaatgc gatttgcgtc 2182441 atcggtgttc tccctagtgt caggcgaagg ctttgcggta ttggtcgggt gatatcccga 2182501 cgcggcgaat gaagctgcgg cgcatggttt ccgcggtccc gaagccgcat cgggcggcaa 2182561 ttgccaccac ggtgtcgtgg gtctcctcca actggcggcg cgcagcctcg gtgcggatgc 2182621 gttcgacgta ccggccgggc gcctcgccga cctcgtcgct gaacacccga gtgaaatgac 2182681 gcgggctcat ggccgcacgt tgagccagtt cgccgatgcg gtgcgcgccc ccggctcggc 2182741 ctcgatggcc tcctgcaccc ggcggatcga ggtccgtttg gcgcgtggca tccacaccgg 2182801 agccgcgaac tgggtctgcc caccgggtcg gcgcagatac aggacgagcc agcgggcaac 2182861 cgtctgggca atctcggtgc cgtggtcgtc ttcgaccagt gccagcgcga ggtcgatgcc 2182921 ggcggtgact ccagccgcgg tccacacctt ctgcgaactg cgcatgaaga tcgggtcggc 2182981 atcgacccga acggccggaa attcgcgggc gaaatgttcg gcaaaggccc agtgcgtcgt 2183041 cgctcggtgt ccgtcccaac aaccccgctt cggccgcaag aaacgcgccc gtgcacacgg 2183101 tgacgacgcg gcgggcggtg ccggagacgg ctttgaccca gtcgatgagg gccggttcgg 2183161 accgtgcggc atcgactccg gcgccaccgg gcaggatcac ggtgtcgacg gggtcgccgg 2183221 ggaatcccac gataaccact cttcgcgcca tgaatgccag tgttggccag gcgctggcct 2183281 ggcgtccacg ccacacaccg cacagattag gacacgccgg cggcgcagcc ctgcccgaaa 2183341 gaccgtgcac cggtcttggc agactgtgcc catggcacag ataaccctgc gaggaaacgc 2183401 gatcaatacc gtcggtgagc tacctgctgt cggatccccg gccccggcct tcaccctgac 2183461 cgggggcgat ctgggggtga tcagcagcga ccagttccgg ggtaagtccg tgttgctgaa 2183521 catctttcca tccgtggaca caccggtgtg cgcgacgagt gtgcgaacct tcgacgagcg 2183581 tgcggcggca agtggcgcta ccgtgctgtg tgtctcgaag gatctgccgt tcgcccagaa 2183641 gcgcttctgc ggcgccgagg gcaccgaaaa cgtcatgccc gcgtcggcat tccgggacag 2183701 cttcggcgag gattacggcg tgaccatcgc cgacgggccg atggccgggc tgctcgcccg 2183761 cgcaatcgtg gtgatcggcg cggacggcaa cgtcgcctac acggaattgg tgccggaaat 2183821 cgcgcaagaa cccaactacg aagcggcgct ggccgcgctg ggcgcctagg ctttcacaag 2183881 ccccgcgcgt tcggcgagca gcgcacgatt tcgagcgctg ctcccgaaaa gcgcctcggt 2183941 ggtcttggcc cggcggtaat acaggtgcag gtcgtgctcc cacgtgaagg cgatggcacc 2184001 gtggatctga agagcggagc cggcgcataa cacaaaggtt tccgcggtct gcgccttcgc 2184061 cagcggcgcg accgtctgga gttcgtcacc gttggccgcg ctcatcgcgg cgaacatcac 2184121 cgtcgcccgg gtggcgtcga tctcgatcat catgtcggcg caggcgtgct tgaccgcctg 2184181 gaaggaaccg atcggtcgat cgaattgcgt tcgccgcccg gcgtattgca ccgccaggtc 2184241 gaggcaggcc tcggcgccgc ccagcatctc ggcggccaac agcacccggg ccacgtcgag 2184301 cacccgctcc atatcgtcgg gcgtcccggc ggtcagcggc tcggcggggg accccgccag 2184361 ccggagcgtg gcgaccggac gggtgatgtc aaacgagggc aacggtgtga cggtcacccc 2184421 gggggcgtcg gcggccacga cgtgcagaac gatcgacccg tcggccaccg cgggcaccac 2184481 gaacaggtct gcgacgtgac cgtgcagcac cggggtgcac tcgccggtga gtgcgggccg 2184541 accgtcgcgc cgaacggccc gaacggtggt agccgacgcg acgtcgtggc cactgacggc 2184601 gatcgttccg atccgcgcgc cggtaagcag accggcgagc aggcgcttgc gctgctcgtc 2184661 gtcgcccatg cgcagaatcg cttcgatcgc aaacaccgtg gccgcaaagg gaattggggt 2184721 gagcgcccgg ccgagttcgg caaacgcgat cgcggtctcg actaaggtgg cacccaatcc 2184781 gccgtgctcc ggcgggacgt gcagcgcggg taattcgagc tcggtgcaaa gccgttgcca 2184841 cagcctgcgg tcggatccgt ccgcggcagc catctcccgc acgggcgcgc cccggccaag 2184901 gaagccgcgc agcgaggcgc ggaaatcgtc ttgttcggtg ctgtatcgga agtccacgtc 2184961 agcagagcac ttcgggccgc ggctccttgg ggaggccgag cagccgctcg ccgatcacgt 2185021 tgcgctggat ctgcgagctg ccggcataga tcgtcgcggc ccgtgcgtag agcagctcat 2185081 ccatccagca ggccggggag tttggcgtac ccgcctccgg gaccagccgc gcaccgccgt 2185141 tgccgggccc ccgcgggccc agcgcctcga gccccaggat ttcgacggcg agatcggtgt 2185201 accggcggaa atattcgctc cagatgacct tcgtgatcgc ggcttccgcg ccgggcggcc 2185261 gtccggtcag ggccagggtg aggtcacggt agccccgata ccgcatgatc tgaacccggg 2185321 catagcacca cgccaagccg tctcgtaccc gtggatcggt gtgtaatccg cggtcacggg 2185381 ccagctcgca cagccgctgc aggtcccgct caaaatcgat ggcggcggtg gcgatgtgcg 2185441 atccgcgttc gaagccgagc agcgtcatgg cggtcgacca gccgtcgccg acccggccga 2185501 cgacattgcc ggcgctggtg cgggcatcgg tcaggaagac ctcgctgaac gaggagtgcc 2185561 cggccgcgtt gacgatcggc cggaccacga cgccgggctg gtccatgggc accagcagaa 2185621 acgacaggcc ccggtgtttc gcagcgctgg gatcggtccg cgccagcagg aagatccagt 2185681 ttgcggtggt gccggccgac gtccagattt tgtggccgtt gatcacccat tcgtcaccgt 2185741 cgagcacccc cctggtgcgc accgaggcca ggtcggagcc ggcctccggc tcggagaagc 2185801 cctggcacca ccgatgctcg ccgctgagga tgcgcggcag gaaatgccgc ttctgcgcct 2185861 cggaacccag ggcgatcagg gtgttgccca gcaggtcgat tccgagcagg tcgttttccg 2185921 cgcgttcggg cgcgccggcg cgggcgaatt cctcggcgag caccacttgt tccatcgggg 2185981 acaggccacc acccccgtat tccgtcggcc aggacaccgc gaccaggcca gcgccggcca 2186041 gggcccgccg ccagtgccgg gcgaactctt cccgctcgtg gggcggcagc gccccgggtc 2186101 cgggccaccc gggcggcagg tgctcggcca caaactcccg gatccggtcg cggaacgctt 2186161 ccgcttcggg tgggtagctg acgtccactg cgcgccccgg cctcagggcc gctgcttgat 2186221 cgcgggccgg atctgcggtg cggcgcgcca gtcctccagg ccgtactcga ccgttccgta 2186281 ggacagcttg ccgccggtga cttcgcccca gtgcgcgtga ttgagctggt ggatcttgaa 2186341 gcaaccgtcc agcgcggcgg aaaaccccat ggcatcgacg gtttggttca ccgattcctt 2186401 gatcagcagt gccgccatcg tcggcacctt cgcgatccga cgcgcgaatt cgattgtgct 2186461 ggtcgcgagt tcgtcagcgg gaaacacctt gctgaccatc cccagcgcgt gggcctcgtc 2186521 ggcgcctatg cagtcgccgg tgagcagcag ttccttggtc ttgcgcggcc cgaactccca 2186581 cggatgtccg aagtactcga ccccgcacat gcccagccgg gtgccgacca catcggcgaa 2186641 cacggtgtcc tcgctggcga cgatcagatc gcagcaccag gccagcatca accccgccga 2186701 cagcacggcc ccgtgcacct gggcgatggt gatcttgcgc aggttgcgcc accgcttggt 2186761 gttttcgaag tagtagtgcc actcctggcg gttgcgtgac tcgaccccgc cgaaggtcgc 2186821 cccgttgcac cggtagctgg ggtgctggtc cggcccgggc gagcgttccc ggatatcgtc 2186881 agcggatccg aggtcgtgac cggcggagaa ggcggggccg gcggcccgca ggatcaccac 2186941 ccggacggtg tcgtccgcct cggcaagttc gaaggcggcg cccagctcga ccagcatgcc 2187001 gcgggtctgg gcgttgcgtt gtttcgggcg gtccagggtg atcgcggcga tgcgcccatc 2187061 gtcgatggtt tcgtagcgga tgtattcgaa ctcccggggc cgtcgggagc gttccccgtc 2187121 cgaccggcga tcgaccggac cgaccctgcc gacgaacatg tccgctcctt actggacgtg 2187181 aacggctgac ctgtgcgagg ttacccgtcc cttagccaac atgtccatag ccaatacgca 2187241 catgagagtg atcgatatag acaaattccc atgcaaagaa gcacttgtgt acaacgaagt 2187301 atcttggtag tactgtgata tacgcaaagg gcgccaccgc agcgcgccgg gcatccgacc 2187361 ggtacaacca ggaagggttg acgatggaga tcggaatatt cctcatgccg gcccatccac 2187421 cggagcgcac cctctacgac gccacccggt gggatctgga cgtcatcgag ctggccgatc 2187481 aactcggcta cgtggaggcc tgggtcggcg aacacttcac cgtgccgtgg gagccgatct 2187541 gcgcccccga tctgctgttg gcgcaggcgc tgctgcgcac ccaacagatc aagctcgccc 2187601 cgggtgcgca cttgttgccc taccatcatc cggtcgagtt ggcccaccgg gtggcctatt 2187661 tcgaccacct cgcccagggt cggttcatgc tcggcgtggg cgccagcggc atcccgggtg 2187721 actgggcgct gtatgacgtg gacggcaaga acggcgagca tcgcgaaatg acccgggaag 2187781 cgctggagat catgctgcgc atctggaccg aggacgagcc ctgggagcat cgcggaaagt 2187841 actggaacgc caacggaatc gcgccgatgt tcgagggtct gatgaggcgc cacatcaagc 2187901 cgtaccagaa gccccacccg cccatcggcg tcaccgggtt cagcgccggc tcggagaccc 2187961 tcaagctcgc cggcgaacgg ggttacatcc ccatgagtct ggacctcaac accgaatacg 2188021 tcgccaccca ctgggacgcg gtggaggaag gcgcgctgcg cagcgggcga accccggatc 2188081 gccgcgattg gcggctggtg cgggaggtgc tggtggccga gaccgatgag caggcgttcc 2188141 ggtatgccgt ggacggcacg atgggacgcg ccatgcgtga gtatgtgctg ccgacgtttc 2188201 ggatgttcgg catgaccaag ttctacaaac acaatccgtc ggtgcccgac gacgaggtga 2188261 caccggagta tctcgccgag aacaccttcg tggtcggctc ggtgcagacc gtggtcgaca 2188321 agctcgaggc cacctacgac caggtcggcg ggttcggcca cctgctgatc ctcgggttcg 2188381 actacagcga taacccgggc ccgtggaagg agtcgttgcg gctgctggcc cacgaggtca 2188441 tgcccagact caacgcccgc ctcgccacca agcccgccac cgcggtggtg tagccatggc 2188501 ggttcgtcag gtcaccgtcg gctattcgga cggcacgcac aagacgatgc cggtgcggtg 2188561 cgaccagacg gtcctggatg ccgccgagga acacggcgtg gccatcgtca acgaatgcca 2188621 aagcgggata tgtggcacct gcgtggccac ctgcaccgcc ggccgctacc agatgggacg 2188681 caccgaggga ctgtccgatg tcgagcgggc ggcgcgaaag atcctcacct gccagacgtt 2188741 tgttacctcc gattgccgga tcgagctgca gtatccggtc gacgacaacg ccgccctgct 2188801 ggtcaccggt gacggtgtgg tgaccgcggt cgagttggtg tcgcccagca ccgccatcct 2188861 gcgggtggac acctctggca tggccggcgc gctgagatac cgggccggcc agttcgccca 2188921 attgcaggtt cccggtacca acgtatggcg caactactcc tacgcccatc cggccgacgg 2188981 ccgcggtgag tgcgagttca tcatcaggtt gctgccggac ggcgtgatgt cgaattatct 2189041 tcgcgaccgc gcccagcccg gtgaccatat cgcgctgcgc tgcagcaagg gcagctttta 2189101 tctgcgcccg atcgtgcgac cggtgatcct ggtcgccgga ggaaccggcc tgtcagcgat 2189161 cctggcgatg gcccagagcc tggatgccga tgtcgctcac ccggtctacc tgctctacgg 2189221 ggtcgagcgc accgaagacc tgtgcaagct cgacgaactc accgagctgc gccgccgcgt 2189281 tggccgcctg gaggtgcacg tcgtcgtcgc tcgcccggac cccgactggg atgggcgcac 2189341 cgggctggtc accgacctgc tcgacgagcg gatgctggcg agcggtgacg ccgacgtgta 2189401 tctgtgcggt ccggtcgcca tggtcgacgc agcccgaacc tggctggacc acaatggctt 2189461 tcaccgtgtc gggttgtact acgagaagtt cgtggccagc ggggcggcgc gccgccgcac 2189521 cccggctcgg ctggattacg cgggcgtgga cattgccgag gtgtgccgcc gcggccgcgg 2189581 caccgcggtg gtcatcggcg gcagcatcgc gggcatcgcg gcggcgaaaa tgctcagcga 2189641 gaccttcgat cgcgtcatcg tgctggagaa ggacggcccg caccgtcgcc gcgagggcag 2189701 gccgggcgcg gcacagggtt ggcacctgca ccacctgctg accgccgggc agatcgagct 2189761 ggagcgcatc ttccctggca tcgtcgacga catggtgcgc gagggagcgt tcaaggtcga 2189821 catggccgcg cagtaccgta tccggctggg cggcacctgg aagaagcccg gcactagtga 2189881 catcgagatc gtctgcgcgg gaaggccgct gctcgaatgg tgtgtgcgcc gccggctcga 2189941 cgacgaaccg cgcatcgact tccgctacga atcggaggtg gccgatctcg ccttcgaccg 2190001 cgccaacaat gccatcgtcg gcgtcgccgt ggacaatggc gacgccgacg gaggcgacgg 2190061 tttgcaggtg gtgcccgccg agttcgtcgt ggacgcgtcg ggcaagaaca cccgcgtgcc 2190121 ggagttcttg gagcgtctcg gtgttggcgc tcccgaggcc gagcaggaca tcatcaactg 2190181 cttctactcc acgatgcagc accgggttcc gccggagcgg cggtggcagg acaaggtgat 2190241 ggtgatctgc tatgcgtacc gccctttcga ggatacctac gccgcgcagt actacaccga 2190301 cagctcccgc accatcctgt ccacctcact ggtggcctac aactgctatt cgccgccgcg 2190361 taccgcccga gaattccgcg cgttcgccga cctgatgccg tccccggtca tcggggagaa 2190421 catcgacggg ctggagccgg catcgcccat ctacaatttc cgctatccca acatgctgcg 2190481 gctgcgctac gagaagaagc gcaacctgcc gcgggctttg ctggcggtgg gcgatgccta 2190541 caccagcgcc gacccggtgt cgggtctggg tatgagcctg gcgctcaagg aagttcggga 2190601 gatgcaggcg ctgctggcta aatacggcgc cggtcaccgg gatctgccgc gccggtacta 2190661 ccgggcgatc gccaagatgg ccgacacggc ctggttcgtg atccgcgagc agaacctgcg 2190721 cttcgactgg atgaaggacg tcgacaagaa gcgcccgttc tatttcggtg tgctgacctg 2190781 gtacatggac cgcgtgctgg agctggtgca tgacgatctc gacgcgtacc gggaattctt 2190841 ggccgtcgtc catctggtca agccgccgtc ggcgctgatg cgacccagga tcgccagccg 2190901 cgtcctcggc aaatgggcac gaacccgatt gtcgggccag aagacgttga ttgcccgcaa 2190961 ctacgaaaat catccgatac cagccgaacc cgcggaccaa cttgtaaacg cttaggagag 2191021 cccaacgtgt cgcaggtcca tcgaatcctg aactgccggg gcacccgcat ccatgccgtg 2191081 gcggacagcc cacccgacca acagggaccg ttggtggtgt tgctgcacgg gtttccggag 2191141 tcctggtact cgtggcggca tcagattccc gcgcttgccg gcgcgggcta ccgcgtggtg 2191201 gccatcgacc agcgcgggta tggccgctcg tcgaaatacc gggtgcaaaa ggcctaccgc 2191261 atcaaggaat tggttggcga cgtcgtgggc gtcctcgact cctatggtgc ggagcaggct 2191321 ttcgtggtgg gccacgactg gggtgcgccg gtcgcctgga ccttcgcctg gctgcacccc 2191381 gaccgatgcg ccggcgtggt gggaatcagc gttccgtttg ccggtcgcgg cgtgatcggc 2191441 ctgccgggca gcccgttcgg cgagcgccgt cccagcgact accacctgga gctggccggg 2191501 cccggaaggg tctggtatca ggactatttc gccgtgcagg acggcatcat caccgagatc 2191561 gaggaagact tgcggggctg gctgctcggg ttgacctaca ccgtttccgg tgaggggatg 2191621 atggcggcga ccaaggcggc cgtcgacgcg ggcgtcgacc tggagtccat ggacccgatc 2191681 gacgtgatcc gtgccggacc gctgtgtatg gccgaaggcg cgcggctcaa ggacgcgttc 2191741 gtctacccgg agaccatgcc ggcctggttc accgaggccg atctcgattt ctacactggc 2191801 gaattcgaac gttccgggtt cggcgggccg ctgagcttct accacaacat cgacaacgac 2191861 tggcacgacc tggccgacca gcaaggcaag ccgctcaccc cgccggctct gttcatcggc 2191921 ggccagtatg acgtcggcac catctggggc gcgcaggcca tcgagcgtgc gcacgaagtc 2191981 atgccgaact accgcggcac ccacatgatc gccgacgtcg gacactggat ccagcaggaa 2192041 gcgcccgaag agaccaaccg gctgttgctc gacttcctag gcgggctgcg gccgtgagct 2192101 gcaccttcga catggtcccg gagaccgtcg atcatctcga cgaggtcggg ctgcggcggg 2192161 tcttcggctg ctttccgtgc ggcgtgatcg ccgtctgcgc gatggtcgac gaccagccgg 2192221 tcggcatggc ggccagctcg ttcacgtcgg tttcagttga cccgccgctg gtatcgatct 2192281 gtgtgcagaa ctgttcgacg acgtggccga agttgcgcga ccgcccacgg ctcggtgtga 2192341 gcgtgctcgc cgaggggcac gacgcggcct gtatgagcct gtcgcgcaag gaaggtaacc 2192401 ggttcgccgg ggtgttctgg agcgaattgt ccagcggggg tgtggtgatc gccggggccg 2192461 gcgcctggct ggattgccgc ccgtacgcgg agatcccggc gggggatcac ctgatcgccc 2192521 tgctggagat ctgcgcggtg cgcgccgatc ccgagacacc gccgctggtg tttcacggta 2192581 gccggttccg ccggttggag tctcgatgaa gacgaccgat gtgcgggtac gtcgtgcgat 2192641 cacggcgatg gcgggcggtc acgccgtggt cctgaccggc gaccccaatg gcgatggcta 2192701 tctcgtcttc gccgcccagg ccgcgacgcc gcggctggtt gcctttgcgg tccggcacac 2192761 ctcgggttat ttgcgcgtcg cgctgccggg cgccgaatgc gagcgactgc acctgccgcc 2192821 catgtgtgac cgagacacca cgcattgcgt gtcggtcgac gttcgcggca ccggcaccgg 2192881 aatctcggcg agcgatcgcg cctggaccat cgcggcactg gcttcggcca cctccgtcgc 2192941 cgccgatttc caacgtccgg gccatgtggt gcccgtgcag gcgcaagccg acggtgtgct 2193001 gggtcggcgg ggacccgccg aggcggccgt cgacctggcc cgcctggcgg aacggcggcc 2193061 ggccgccgcg ctctgcgaga tcgtctcgcc cgataatccc gtccagatgg cgcaccacgc 2193121 cgagtcggtc gaattcgccg tcgaacacgg actggccatg gtctcgatcg gggagctggt 2193181 ggcgtatcgc cggcggatcg agccccaggt ggtccggttt acggcagcga cgctgcccac 2193241 ctgggccggc gcctcgcgtg tcatcggctt tcgtgacgtt tacgacctcg gcgagcattt 2193301 ggcggtcatc gtgggtgcgg tcggtgccgg ggtgcccgtg ccgctgcacg tccacatcga 2193361 gtgcctgacg ggcgacgtgt tcggctcgac ggcgtgccgc tgcggcgagg aactcaacgg 2193421 cgcgctggcg aggatgtcgg ctcagggcag cggcgtggtc ttgtatctgc gtccgcccgg 2193481 acccgcgcaa gcgtgcggct tgttcgcccg gggcgatgcg gcgaccgatg tcatgccgga 2193541 gaccgtgaca tggatcctgc gcgatcttgg ggtgtatgcg atccgacttt ccgatgatgt 2193601 gccaggattt gggcttgtca tgttcggggc gatccgagaa gccagcacgt tggcggccgc 2193661 aggttgaacc atccagacct ggccggcaag gtcgcgatcg ttactggggc gggcgccgga 2193721 atcggtctgg cggttgcccg gcgactcgcc gacgagggct gccatgtgct gtgcgcggac 2193781 atcgatggtg atgccgcgga tgccgcggcc accaaaatcg gttgtggcgc agcggcctgc 2193841 cgggttgacg tcagcgacga acaacagatc atcgccatgg tcgacgcctg tgttgccgcg 2193901 ttcggcgggg tggacaagtt ggtcgccaac gccggtgtcg ttcatctggc ttcgctcatc 2193961 gacaccaccg tcgaggactt cgatcgggtc atcgcgatca atctccgcgg cgcctggctg 2194021 tgcaccaagc atgcggcacc gcggatgatc gagcgcggcg ggggagccat tgtcaacctg 2194081 tcgtcgttag cgggccaggt agcggtgggc ggcaccggcg catacggcat gtcgaaggcc 2194141 ggcatcatcc agctcagccg catcaccgcc gccgaactgc gctcgtcggg catccgctcc 2194201 aacacgctgc tgcccgcatt cgtcgacacc ccgatgcagc agaccgccat ggcaatgttc 2194261 gacggggccc tgggcgcggg gggtgcgcgc tcgatgattg cccggctgca gggccgcatg 2194321 gccgcaccgg aggagatggc cggcatcgtg gtgttcctgc tgtccgacga tgcgtcgatg 2194381 atcaccggca ccacccagat cgccgacggc gggacgattg ccgcgctgtg gtgatcccct 2194441 cgggtcaggc ggtttcgaaa gatcacgcga gacattgcct gcgacggcat gctacatatg 2194501 tgattccggt gtattcgggc ctctgcgcat tgctttcgat cacaatgagc ttggccgcga 2194561 gccgtcttgt tcgttgagcc acggggccgt tcgaatgcgt tcgtcagaac tccggctcgg 2194621 attctcgcta gtttgctgac gtgtcatcga gagcaatcga cggcgacctc gagggccgtg 2194681 cagatggcgc gcatccggat gtcggcgagg cggccaagcc gattcaccaa taccgcgacc 2194741 gagacacttt cgactgagtc caaattcacc gcggaacggc gcgggatcgg gtcggaaccg 2194801 ggttcaagaa caacctcact ggctagccct cggatggtcg tggtgcaggg cgcgacaagt 2194861 gcgcgtcgca gccgagggat cgcggcatcg cgcgacagca cgacgactgg tcgccgaccg 2194921 atctcagcca tctcacacca ccacacctct ccgcgcgccg gaagtgcggt cacgagtctc 2194981 cagccgcccg ccgccacgac gctagatcgc cccactcgtc gggctcatcg accgggtgct 2195041 tgtcgtaggc cgcatagctg gcatccacct cggccgatcg atgacgagcc agtaatgccg 2195101 caagggcctc atcgatgagg gctgcgtcag tgattcctgc ccgcatgtcg cgcgcacttg 2195161 tcaagagtgc ggcgtcgaca gtagtgctca gccgtatgcg attcatgcca ctactatgcc 2195221 acactccggg gcgtggatcc gcctgatcgg acgcaacgtg ctcgatacgg gcgaaacatt 2195281 ggtcgctgga cgaattgatg aggtctaccg cgcagcgcaa cgtcacctgc aaccgggccg 2195341 tcttcacggt gcgggttccg tgtcgatgaa cgacgctgcg gcacaacact ttttgtactt 2195401 gtgccccgag ccgcaccagc actgttggtt gcggcccggt ggccaggcca tcacatcgtg 2195461 gtcaccgtgt gctgtcaggt acgcggcata ctcggcgcgg gcctccggcg agtccggctc 2195521 ctgaccctgt tcggcgcacc aggcagcgaa gggtgccacg cggatcgcgg cgaccgccag 2195581 tcctgggaaa ccagcctcgg cgaattcgac cagcttttgc tgcatcctcc ggcagtacag 2195641 cgggtgcgcc accggcccgt ccggaccggt caccaggtcg ctgccggcga agtctggcca 2195701 caggtcgagc gcccgctcgt agtcaccggc aggcagccac gccaatgaca ccgcggtgat 2195761 cggttcggcg gattccgccg cgggtgtctc atcgacggga ggcacccggc tggctccgtt 2195821 gtcactcatg gtccaacatc ctgccgcatc accaccgcac gcggcatatg atgctcgcag 2195881 tcgcggtggt gcggccttat cgccatgagc gaaatcttct gtatcactga tcattccgag 2195941 cctatgacgg cccggttctt gtcagtggtg cttcgtagaa tccgaggcat gaggtcggac 2196001 acgcgcgagg agatctccgc ggcgttggat gcctaccacg cctcgttgtc gcgggtgctc 2196061 gatctcaagt gcgatgcgtt gaccaccccg gaattgctgg cctgtttgca gcgactcgag 2196121 gtcgaacggc gccgccaggg cgccgccgag cacgccttga tcaaccaact cgctgggcaa 2196181 gcctgcgagg aagagctcgg cgggacgctg cgcacggcgt tggccaaccg gctacacatc 2196241 actcccggtg aggccagccg ccgcatcgcc gaagccgaag acctcggtga gcgccgcgcc 2196301 ctgaccggtg aaccgctgcc agcgcagttg accgcgaccg cggccgctca acgtgagggc 2196361 aagatcggcc gagaacacat taaggagatc caggccttct tcaaggagtt gtccgccgcg 2196421 gtggatctgg gtatccgcga ggccgccgag gcccagctgg ccgaactggc caccagtcgg 2196481 cgtcccgatc acctgcatgg cctggccacg cagctgatgg actggctgca ccccgacggc 2196541 aacttttccg accaggagcg tgcccgcaag cgcggcatca cgatgggtaa gcaggaattt 2196601 gacgggatgt cacgtatcag cggtctgctg accccggagt tgcgggccac catcgaggcg 2196661 gtgttggcca aactggccgc accgggggcg tgcaaccccg atgaccagac cccggtcgtg 2196721 gatgacacac cggatgcgga cgcggtgcgc cgcgacaccc gcagccaagc ccaacgacac 2196781 catgacggtt tactggccgg gctgcgcggg ttgttggcct ccggtgagct agggcagcat 2196841 cgggggttgc cggtgaccgt cgtggtgagc accacgctta aagagctgga agccgccacc 2196901 ggcaaggggg taaccggtgg tggttcgcgg gtgccgatgt cggaccttat ccggatggcg 2196961 agcaacgcgc accactatct ggcattgttt gacggcgcta agccgttggc gttgtatcac 2197021 accaagcggt tagcttcccc ggcgcagcga atcatgttgt acgccaagga tcgtggctgc 2197081 tccaggccgg gttgcgacgc cccggcctac cacagtgagg tccaccacgt aacgccgtgg 2197141 acaaccaccc accgtaccga catcaacgac ctcacgctgg cctgcggccc cgacaatcgc 2197201 cttgtcgaaa aaggctggaa aacccgcaag aacgccaaag gcgacactga atggctaccg 2197261 ccggcccact tggaccatgg ccaaccacgc atcaatcgat accaccaccc cgagaaaatc 2197321 ctgtgcgaac ccgacgacga cgaaccacat tgacacccaa tgaccgtggc attgccggtc 2197381 acgtcgcaac caagtactgc gaccgtagcc gcgctcaagg ctcggggtag acgagcgcgg 2197441 agagaggcac gttgccgagc tgcctgccga cgacgagtat cccaatatcg tgctcaccca 2197501 tagcgtttca gcgggcaacc aacgattgcc ggccagcgaa tctcggtggc ggtagccagc 2197561 atgaaggacg cagatgacct cgccgactac gggctgagca tagagcaggt gcgtgcagcc 2197621 gtcgactcgc atgtggacgt ggaccattct gtctcagcgc tgtgaccgca cggtagagtt 2197681 cgccatcgtg gctgacgatg acgtcaccgg tcaggatggc tccggcgacg gcaccgatcc 2197741 gcgcaccatg ctgggccggt ttgccaacca gcacaacgaa tgggtgcgcc tgagcgtgcg 2197801 ccacgtgctc gatgcgggcg aagcattgga tgccggacag attgattagg tctaccgcca 2197861 ctttcggcag gaaaaggcac tggacacacg ccaccgagcc ggccgtacca ccgttgacac 2197921 tcggcatcag caacccggaa acagccgaac ccctgatcat ctggccgacc tcgcccctgg 2197981 ccgcaccgcg accatcgggc tgcgggattc cagctgcctg cgcgtggacc gctacaacga 2198041 ccaggcgtcc gggcgagcgc tcatcgagat ccggttgtgc aacgaacgtg ccacgccgat 2198101 gccaatcccg atcgggctgt ggatgtttca gaccaagctc cacgtcaacg ccggcggcgc 2198161 tgacgtgttc ctgccggtct gcgacgtgct ggagcaagac ctcgccgagc gcgacgagga 2198221 ggtacgccag ctgaacctgc agtaccgcaa ccggttggag tatgcgatcg ggcggacttg 2198281 ctcggcggcc tggtcggtga acggctcgcg gcgcccgtcg gcagtgtgga ccacctggct 2198341 gccggtcgcc gaaacacccc acacccgggc ccggtcggtg gagaacgcgc tgttgtccat 2198401 ggacagtcgc ggaggggtta cgtagcggac tggcgtcgtt cgtcgcggga tatggaagct 2198461 ggtttcaggg tcaggcggct gtcgcggccg agctgcccga gcacctgcac ccgaccgccg 2198521 acgagaggct ggctcatgtt gcggccgaaa aggaagcgct gcgctgcttc cagttcatga 2198581 accaggtgat gcgcgatcac cgtaaaagct tgtcagaggt gcagtgaaca ctgtttccat 2198641 gaccaagagc aacgggcact gttgagacac agcgcgtcgc caacgggcgc tgcctgtggc 2198701 cgaacatcgt aaatcaagca tattcgtcaa cagatatcat caatgtcggc gccggactat 2198761 tcaaatcatc gatatactgg tggcctggtc cttcgccatc gatcaatggc gatagcttat 2198821 cgaggatttc taccaacttc gtgtcatcga agcgccatac aacggtttgc gatcccagtt 2198881 ccatatccgc agttccgctt tctcgaacta tccgttgctg tacaccatct atgtcgaaag 2198941 ttgcctgacc actctcatgg gccgatcgca cggcgtactg gaaaatgcga agcccatccc 2199001 ggtctgcggc cgccagaacc acgtcaccga agtagttatc cggcttgata ccgaaaacgg 2199061 tcattctggt ccaatcactt gtgagtcgga aggtccccga tgggaatatt ctgccacctg 2199121 gcggtcggcg aatcgtgggg gttgtaatcc caatgcggat agcggtaatt gtctcccgga 2199181 aaatatcgcc actcgccgcc gttctcgtcc cagaggcttt cgccgccgcg agcctgttgg 2199241 ataggacgac tgggcggtcc aaccgttagg ttgctctcgg cgggcgggct gacaccgggc 2199301 ggaggtaagc cttcgttcgg ttgtggtcca gcggggtcgg gagcaggagg gggttcgcct 2199361 tcgaccggca cgcccaattc gcctaaccgg gctctgattg cggcttgtct ttcaaagaga 2199421 gaacccttgt ccgcgatgca cgcgtcgtag gctgcctgct cgttgggcag aacgaaggtg 2199481 cgtccgcatc gggcgttgta ccgagcgatg tcagcgttga cggcgtccca ggctgcgcgt 2199541 gcctgtacgg ctgtcatgtc tttcgggtcg cccggcatcg gtgagggtgg atcttgtttc 2199601 cagctgcggt cgaccgcgtg gatttgcggt ttctcgttgt ggggcaccgg tgtcggtagc 2199661 gacggtgcga ttgggggttc gtggaagccg acggtgttaa ggggtgcggt agcggtggct 2199721 attttggcgg ccacttcgtg ttcgactccg atgagttgtg tcgcgcgttg gcggatatcc 2199781 ccggccaatg cttgtgcttg ggcctggcga gctgcttgtt cggcgaaggt gcggctggtt 2199841 cgggtgtcgg tgaccgagag gtcctcttcg acgttgaagc ccgcgttgtg ggcatcttga 2199901 acggcataga tgaccctgcg ctgggctgcg ccgatggttc cggcgccttc acgggcaagc 2199961 ccactcgctt ggcgcaaatg ctcggctatg ccactgacta tctgtaggtc agcgccggtt 2200021 cgctgtcgca gccatcaccg cctgcgcctt cccacgcgat gaagtgggat cggttacgca 2200081 tctctaggaa cacgtcttcc cactgatcgg cgaccttcgt ccagtagtag gccgcctcga 2200141 tgagatgttc ggtgtcccag gcgtggatat gcgacagggt cggcagcaat tacaccagcc 2200201 tcgtttgcgg cacggccgcc atctcggccg ctgcggccgc ctcgttgttg gcatactccg 2200261 ccgcggccgc ctcgaccgca ctggccgtgg cgtgtgtccg ggcggtaaac gccgccacgg 2200321 caagaccaac cgctgcgtgg gcaccgccca cagccgccgt ggtgggttgg aacggctgcc 2200381 ccagcggagg tggtgcaagg acgctgagtt cagtgcttcg cccgctccat tggctggccg 2200441 tagccgctac ctgttggata ttgacccgca gctcaccggc tttcatcctc ggaaagttta 2200501 atagcgagct acagggtggc aactcatcgc aggtcgagcc aactactgcc gggccgggtg 2200561 accgcagctc gtgctgaggc agcaccgagg ctggctgact caagcagtct cggcgtatgc 2200621 cagcctgatc gcgaacacgg gagtcaaccg gggcaaccgc cgtccgccgg acaacctcga 2200681 tccgatatca attaagcgat atcgtcatct ccgatggagc agatcgtgat ccgcaacctt 2200741 cccgagggga ccaaggcggc actacgggtc cgtgctgcac gtcatcacca ctccgtcgaa 2200801 gcggaagccc gcgcgatcct caccgcggga ttgttgggcg aagaagtccc catgccggta 2200861 ctgctggccg ccgacagtgg ccatgacatc gacttcgagc ccgaacgtct cggcctgatc 2200921 gcccgcaccc cgcaactgtg acctacgtcc tggacaccaa cgtggtgtcc gctttgcgcg 2200981 tgccgggacg ccaccccgcc gtggcggcgt gggcggactc ggtgcaagtc gccgaacagt 2201041 tcgttgtggc gataacgctg gccgagattg agcgaggcgt gatcgccaag gaacgcaccg 2201101 acccgaccca gagtgagcac ctacggcgct ggttcgacga caaggtgctg cgcatattcg 2201161 tgttcgcccg ccggggcaca aacctcatca tgcagcccct agctgggcat ataggttaca 2201221 gcctatattc tggtataagc tggttttaga cgaaaaggac cccacctcgg ggtctgatgg 2201281 ccaggggcag ggtcgtgtgc attggggatg caggttgcga ctgtacaccc ggcgtgttcc 2201341 gcgcgacagc gggtgggatg ccggtgctgg tggtcatcga gtctgggaca ggaggtgatc 2201401 agatggctcg taaagctacg tccccgggta agccggctcc gacgtcggga cagtatcgcc 2201461 cggttggcgg tggcaacgag gtgaccgttc cgaagggaca ccgtctgcct ccctcgccca 2201521 agcccggtca gaagtgggtg aacgtcgatc cgacgaagaa caagagcggc cgcggctgag 2201581 cttgtgccgt cgggatgggt gtcgcaccgt ctcggcgggt cgcccaagtg cataagtgct 2201641 ttgtcgctgc cctccggtac cgtcggagcc ccgtccaagc cggacaacga cgccactcga 2201701 ggcaggacaa gaccaactgt gccgccccct gatccagccg ccatgggtac ctggaagttc 2201761 ttccgggcat ctgtggatgg ccggccggta ttcaagaagg agttcgacaa gcttcctgat 2201821 caggcccggg ccgcgctgat cgtgctaatg cagcggtatc tcgtcggcga cctcgccgca 2201881 gggagcatca aaccgattcg tggcgacatt ctggagttgc gatggcatga ggcgaacaac 2201941 cacttccggg tactgttctt ccgctggggc cagcatcccg tagcgctgac agcgttctac 2202001 aagaaccagc agaagactcc caagacgaag atcgagacgg ccctggaccg gcagaaaatc 2202061 tggaaaagag ccttcggcga caccccaccg atctgaacaa cgcccaacca ctgttacgag 2202121 gctaggagag cacaaccatg agcattgact tccctttggg tgacgacctc gccggctata 2202181 ttgccgaggc gattgcggct gatcccagct tcaaaggcac tctcgaagac gccgaggagg 2202241 cacgcaggct ggtcgatgcg ctgattgcgc tgcgcaagca ctgccagctg agccaggttg 2202301 aggttgctaa gcgtatgggg gtgcgccagc ccaccgtgag cggtttcgag aaggaaccca 2202361 gcgaccccaa actgtctacg ctgcaacgtt atgcccgtgc attggacgcc cggctgcggc 2202421 tggtgctcga agttcccacg cttcgcgaag tgcctacgtg gcatcggctc tcctcttatc 2202481 ggggctccgc acgggaccac caggtccggg tgggtgcaga caaggaaatc ctgatgcaga 2202541 cgaactgggc ccgccacatt tcggttcggc aggttgaggt ggcatgactg accgaaccga 2202601 cgccgacgac cttgacctgc aacgcgttgg cgcgcggctg gcagcccgcg cacagatccg 2202661 cgatatccgg ctgctgcgca ctcaggccgc tgtccatcgt gcgcccaagc ctgcgcaggg 2202721 cctgacctac gacctcgagt tcgaacccgc tgtggatgcc gatccggcca ctatctcagc 2202781 atttgtggtg cggatttctt gccacctgcg cattcaaaac caggcggcag acgacgacgt 2202841 caaggaaggc gataccaaag acgagacaca ggacgtagcc accgctgatt tcgagttcgc 2202901 ggcactgttc gactaccact tgcaagaagg tgaagacgac cccaccgaag aagaacttac 2202961 ggcatacgcc gccacgaccg ggcggttcgc gctttatccg tacatccgcg aatacgtcta 2203021 cgacctcacc ggccgtctcg cactgccacc gttgaccctt gagatattgt ctcggccgat 2203081 gccggtttct cccggcgccc aatggccggc aacgagagga acgccctgac caaacgaggg 2203141 tgaatcaagc tgcccgacga ccatggtttc cacacctacc gccagatgca gcgctggact 2203201 gtcagcccag cggcacgggt cgagatcctg ggccgctact ggtggagaat ccgccgccgt 2203261 gccaccgaag gggcgaaggc gaaatccaaa ggcaaggccc gccgcggctc tcagttcaag 2203321 gttctcgaac acgggtgatg cggttcgagc ccgggaaggt ggagcgttag ccgcagggga 2203381 gggaatcttg gcgggtcggc cgacaagagg ttgaacttga ctgcgggaca gcagtttacg 2203441 gctcttgtcg ccacgcctac agcggattcg cataccgccg gggttcattg acaaccggcg 2203501 ggggttcgtt ccgccgtgtt tccgaggtag gtatcggcgg gggtgtatgt cggtaggcct 2203561 cgggaatgtc cgacaggcgc gatgggagat cttcgcgttg atcaccgcgc caatggatgg 2203621 tgtcgggatc atcccccggc tgacgggaaa tgcggccggc cattcttcct caagatcgag 2203681 tcagaggttc cggtcgacgt ccatccgttg gtgcaggact cgcacgacgt cgatggtgcc 2203741 ttcgccagtc acccgataga acaacgtgtg tgacccggcc gagagcttgc gatagccggg 2203801 gcgaatctcg tcgcacgctc gtccgatccg cgggtttgcc gcagcacggt cgatagcgtg 2203861 ttgaagttcg cgcaggtact gctcggcctg atcgacaccc caacggtcat aggtgcagtc 2203921 ccagatctct tccagatgtg cctgcgcggc aggcgagaga aggtatcggc tactcaccgg 2203981 ccacgcgagg cgtcagcccg cttacgaccg aggaatccgt cgaagtcgaa cggtgtcgag 2204041 ctgccgctgc gttcgccggc ctcgagagcc tcacgaagcg cgcgcagctg ggtttcacgg 2204101 tcctcgagca gtcgcaacgc ggagcggatg acttcactgg ccgaccggta gcggcccgcg 2204161 gcgatctcgc cgtcgatgaa ggcgctgtag tgctcgtcga ggacgaagga cgtgttctta 2204221 cccacgaacg cacaatacca attgttggta gtaggtgtta gcccctggga caccccaagc 2204281 cccagcggca gaatctcctg gggatcggca tggccgcacc aggcgcggcg cgcccagaca 2204341 tgtcagaggg tgaggcgaca ctggatgatc gacaccaccg aagcggcata tcggctgacg 2204401 tatcagccgg acggcacgtc gatcaccgtc cgggagaacc tggtcgacat cctggcgcgt 2204461 gagctgctcg gcccgatccg cggcccgcag gaggtgttgc cgttcagccc gcgctcgcaa 2204521 tacctggtcg ggcacctcgc cccggtaaag ctgaccggcg ccgcgctcat cgacgacaac 2204581 gcggtccagg cccgtgccaa cgccgaggcg ctcgccgagg gcggtggcgt gccggcctac 2204641 gcggccgacg aaacgacgcc gacaccgacg acgacgccca agaccgcgca cccaagcagg 2204701 gcctgatgat cccggcatca atgggtttac ggtttcaggt gccacccgat ctggtgtcgt 2204761 tcaccatcac cgcgtcatgg ataacctacg agaccgtcga gagcgggagg tgaccaaggc 2204821 cggccgtacg atagccagcg cgatagcagt gatctcgtcc cggcttcatc gcgcttgtcc 2204881 gggtgcgacg accgccaacg acagggcctc ggcggcttcc ttaaggcggt tgtcgtaggt 2204941 aaccagcgcg gtcaatggtg caacggatcc ggcggtttga gcagtggcta ggtgtatcgc 2205001 gtcgagcgag cgcagtgctg ggttggggta ggccgccgcg gtggagcgta tgaccgcgtc 2205061 gatttcgaaa cggtccagcc tggctagcac ggagggcacc gccggtagcc cttctgggga 2205121 gactgcgcgg atggctctgg atagctcaac ttcggtcaaa gccgatgtga tccaccgtag 2205181 ttcggtgcgg tcatcgagcc aatcagctaa agcgtcagat tcgacctcga tccgaattag 2205241 cttgaccagc gccgaggttt ccaggtagat cacgcgctag taccgctcct cggcgcgcat 2205301 gcgctccaac agcgttcccg agtcgagacc gccgcgcatc ggaattgtgg gccgaggcgc 2205361 cgggccatgc actctcgccg gttgcacact gccggtgctg atcagtgagt cgagagggcc 2205421 ggcagaagcc gggattattc gggcgataac cttgccgcgc tcagtcaggt tgatctcttc 2205481 accgcgcttg acgcgggcca ggaccttgga cgtctcctgg ttgagcgttc gtatggacac 2205541 ctcattcaca ccgataatgt actacctatt tgttctacat gctatgcgcg caagaggtta 2205601 cctgccccgc tggtcaggat cgccagcgcc aggccactga tctcgtcggc gactccggcg 2205661 tagcgcgtga gatgccaggt gcgagcgacg tcttcgatga agctaatcgc cgccgcgacc 2205721 agcagtcgcc cctgggcgac actggtcgcg ggtaccagct tgccgatgag gtcgatccac 2205781 acggcctcgc ggtcgccctg atttcgcagg tagccgtcgc gtacttcgac agaggcgtgc 2205841 gacagttcgg tgaccgacac tgccaccaga tccggagcgt ccaagctgat ccgaacgtgc 2205901 ccttggacaa ggccgcgcaa ccgttgtgcc gcttgctgat tcgctcgtag cgctcggatg 2205961 cactccaggc agcgccactc gtcgaggcgg cggatgagcg cgtccaggat ggcctgtttg 2206021 gaagaaaacg aacggtacag ccccgggccc gcgatgccgg ctcccttgcc gatttcgctg 2206081 gtgttgacgg ccggatagcc ctgcgcacgg aacagccgcg cgcccgcggc cagcagggtc 2206141 tcgtagcggg agaacagcac gtcggcctcg tcgcgtgcgg catcaccggc cggcagtggc 2206201 ggcaattcgc agacgggagg cgtccttgcc gcggccatac acgcctggta gagaagcttt 2206261 ttcagttcct cgcccggcag gcttaggctg tgccggccca ggctggtcaa agtgctggac 2206321 accgcccacg cccgcaactc cgaatgctgt ggactcagat cgggcacctc cagcagcacg 2206381 ctgtcacgca tgccggcgac gatcgcgttg atgcggcgcc ggaccgccgt gcggtcgtcc 2206441 tcgttgaggt agcgggcctc gcgctgccac agcaccgtca acgcccgaga ggcgaccgcc 2206501 gcggcgatca ggtcttccag atcggcgttc aacggccgcg gcgtcggctc cgtctcgccc 2206561 tcggtgagac gacgcgcgct ctggtactga tcctggccgg ttcggatcgc ttcggcgagc 2206621 aacgcctgct tgttgtcgta gtggcgatac aacgcgcgcg cggtcacccc ggccgcctcg 2206681 gcaatgtcct ccaatttgac cgaatggaag ccacgttcga tgaacagtcc aacggcctga 2206741 tccaaaatct gcttcttccg gtcctttggg cggcgcctaa cgggttgggc gacggatgcc 2206801 atcggctcga acccccttct tgcgcaccgg aatcacaaat cctgctagca gcatcgcctc 2206861 agcttcaccc cgctcattct tcacctcgaa tgcgccggtc accgggtgcg acacttaccg 2206921 gccgtcgttc atggtgacgt ttcgaggctg tgctgctgcc aagaccccag gaagtctcgg 2206981 acgagagact cgctagcctc cgtggtatcg ggcatcccta tcacccctgc tcgatcctca 2207041 atatcggact aacaaaatac atcatcgcgc ctgtatacgc gattacattg caatttatcc 2207101 ttatcaccct tcttagagtg catatcagta atagacatat cgcgctcctc gcgccccagg 2207161 aggcggtcga cgaattcgcc gtgcgcaacg acatgagccg tcgctgagcc tgaaaacctg 2207221 cagacaaagc gcgagtgggg gctggcaaaa ctacaggctc gttagcagca agttgcttcg 2207281 acgaccatgg tggcaacctc gccggtcgcg aaggctctgg tcggcgggcc cgaatcgagg 2207341 cggtcaggat gcggcatccg atcaccgccc gtcgggcgcg ctgttgatgc ctgatcgtgg 2207401 tgcctcgcca gcgtgactcg agccaacggc ttgaccggtg atgcgcctgt cggccgccaa 2207461 ggcagcagag cacatcgccc cgcgctatag gatactagca agatacatca tagccaatat 2207521 atgccagttt gcattgctat ttaccgatca gttgtccaag caatcgcgta ttggctatgg 2207581 acatcagcgg ttctgccgcg tacgctcacc aatgtcaccg atcgtcgacc tgtccggggg 2207641 gccagcgtgc gccacctcac ccaacggccc agcatcgaat ccagctggtg cgccgcgcca 2207701 tggtaatcgt ggccgacaag gcggccggtc gggtcgctga tccggtcttg cggccggtgg 2207761 gcgcgctggg cgatttcttc gcgatgacgc tcgacacgtc cgtgtgcatg ttcaagccgc 2207821 ctttcgcgtg gcgtgaatac ctacttcagt gctggttcgt ggcgcgggtg tcgacgctgc 2207881 ctggggtgtt gatgacgatc ccatgggcgg tgatctcggg gtttctcttc aacgtcttgc 2207941 tgaccgacat cggtgccgcg gacttttccg gcaccggctg tgcgatcttc accgtgaacc 2208001 aaagcgcccc gatcgtcacg gtcttggtgg tcgcgggcgc gggcgccacc gccatgtgcg 2208061 ccgatctggg tgcgcgcacc atccgtgagg aactcgacgc actgcgggtg atgggcatca 2208121 acccgatcca agcgctagcg gctccgcgcg tgctggcggc caccacggtg tcgttggcgc 2208181 tgaattcggt ggtgaccgcg acggggctga tcggcgcgtt cttttgctcg gtgtttctca 2208241 tgcacgtctc ggcgggggca tgggtgaccg ggcttaccac gctgacccac accgtggacg 2208301 tcgtcatttc gatgatcaag gcgacgttgt tcgggctgat ggccggactg atcgcctgct 2208361 ataagggcat gtcggtcggt ggcggcccgg ccggagtcgg ccgggcggtg aacgaaaccg 2208421 tggtgtttgc cttcatcgtc ttgttcgtga tcaacatcgt cgtcaccgcg gtcggcatcc 2208481 cattcatggt gtcctgaggt gaacccatga cggcagcgaa agcccttgta agcgaatgga 2208541 atcggatggg atcgcagatg cggttcttcg tcggcacgct ggccgggatt cccgacgccc 2208601 tcatgcacta ccgcggcgag ctgctgcggg tgatcgcgca aatggggttg gggaccgggg 2208661 ttcttgcggt gatcggtgga acggtcgcga tcgtcgggtt cttggcgatg accaccggcg 2208721 cgatcgtggc cgtgcagggc tacaaccagt tcgcttcggt gggtgtggag gcgctgaccg 2208781 gcttcgcgtc ggccttcttc aacacccgcg agattcagcc cggaaccgtg atggtcgcgc 2208841 tagcggccac cgtcggtgcc ggtaccaccg ctgcgctggg ggcgatgcgg ataaacgagg 2208901 agatcgacgc gctcgaggtg atcggcatcc gcagcatcag ctacctggcg agcacccggg 2208961 tgctggccgg agtggtcgtg gccgtccctc tgttctgtgt gggactgatg acggcctacc 2209021 tggccgcgcg cgtcggcacc accgccatct atggccaggg gtcgggcgtg tacgaccact 2209081 acttcaacac gttcctgcgc ccgaccgacg tgctctggtc gtcggttgaa gtcgtcgtgg 2209141 tcgctctgat gatcatgctg gtgtgcacct attacggcta cgccgcacat ggcgggccgg 2209201 ccggggttgg cgaggcggtc ggccgggccg tgcgtgcctc gatggtcgtc gcgtcgatcg 2209261 caatccttgt catgacgctg gccatctacg gccagtcgcc caactttcac ctggcgacct 2209321 agtgacatga gacgcgggcc gggtcgacac cgtttgcacg acgcgtggtg gacgctgatc 2209381 ctgttcgcgg tgatcggggt ggctgtcctg gtgacggcgg tgtccttcac gggcagcttg 2209441 cggtcgactg tgccggtgac gctggcggcc gaccgctccg ggctggtgat ggactccggc 2209501 gccaaggtca tgatgcgcgg tgtgcaggtc ggccgggtcg cccagatcgg tcggatcgag 2209561 tgggcccaga acggggcgag cctcagactg gagatcgacc ccgaccagat ccggtacatc 2209621 ccggccaatg tcgaggcaca gatcagcgcc accaccgcat tcggtgccaa gttcgtcgac 2209681 ctggtgatgc cgcaaaaccc aagtcgtgca cggctgtccg ctggggcggt actgcattcg 2209741 aagaacgtca gcacggaaat caacaccgtc ttcgaaaacg tcgtcgacct gctcaacatg 2209801 atcgacccgc tgaaactgaa cgccgtgctg accgcggtcg ccgacgccgt tcgcgggcaa 2209861 ggtgaacgga taggccaggc caccaccgac ctcaacgagg tgctggaggc actcaacgca 2209921 cgcggcgaca ccatcggcgg caactggcga tcgctcaaga acttcaccga cacctatgac 2209981 gcggccgccc aagacatcct gacgatcctg aacgccgcca gcaccaccag tgcgaccgtc 2210041 gtgaatcatt cgacgcagct ggatgccttg ctactcaacg ccatcggact atccaacgct 2210101 ggcaccaacc tgcttggcag cagccgagac aatctcgtcg gcgcggccga catcctggcg 2210161 ccgaccacga gcctgctgtt caagtacaac cccgaataca cctgcttcct gcagggcgcc 2210221 aagtggtatc tcgacaacgg cggctatgcg gcctggggcg gggccgacgg gcgcacgcta 2210281 caactcgatg tggcgctact gttcggcaac gacccctatg tctatccgga caacctgccg 2210341 gttgtcgcgg ccaagggggg tcccggcgga aggccgggat gcgggccatt gccggatgcc 2210401 acccacaact tcccggtgcg ccagctggtc accaacaccg gatggggaac cgggctggac 2210461 atccggccca accccggcat cgggcatccc tgctgggcca actacttccc ggtgacccgc 2210521 gcggtgcccg agccgccgtc gatccgtcag tgcatccccg ggccggcgat cgggcccaac 2210581 cccgcggcgg gggagcagcc atgagggaga acctgggggg cgtcgtggtg cgcctcggcg 2210641 tcttcctggc ggtatgcctg ctgacggcgt tcctgctgat tgccgtcttc ggggaggtgc 2210701 gcttcggcga cggcaagacc tactacgccg agttcgccaa cgtgtccaat ctgcgaacgg 2210761 gcaagctggt gcgcatcgcc ggcgtcgagg tcggcaaggt caccaggatc tccatcaacc 2210821 ccgacgcgac ggtgcgggtg cagttcaccg ccgacaactc ggtcaccctc acgcggggca 2210881 cccgggcggt gatccgctac gacaacctgt tcggtgaccg ctatttggcg ctggaggaag 2210941 gggccggcgg actcgccgtt cttcgtcccg gtcacacgat tccgttggcg cgcacccaac 2211001 cggcgttgga tctggatgcc ctgatcggtg gattcaagcc gctgtttcgt gcgctgaacc 2211061 ccgagcaggt caacgcgctg agcgaacagt tgctgcacgc gtttgccgga caggggccca 2211121 cgatcgggtc attgctggcc cagtccgcgg ccgtgaccaa caccctggcc gaccgtgatc 2211181 ggctgatcgg gcaggtgatc accaacctca acgtggtgct gggctcgctg ggcgctcaca 2211241 ccgatcggtt ggaccaggcg gtgacgtcgc tatcagcgtt gattcaccgg ctcgcgcaac 2211301 gcaagaccga catctccaac gccgtggcct acaccaacgc cgccgccggc tcggtcgccg 2211361 atctgctgtc gcaggctcgc gcgccgttgg cgaaggtggt tcgcgagacc gatcgggtgg 2211421 ccggcatcgc ggccgccgac cacgactacc tcgacaatct gctcaacacg ctgccggaca 2211481 aataccaggc gctggtccgc cagggtatgt acggcgactt cttcgccttc tacctgtgcg 2211541 acgtcgtgct caaggtcaac ggcaagggcg gccagccggt gtacatcaag ctggccggtc 2211601 aggacagcgg gcggtgcgcg ccgaaatgaa atccttcgcc gaacgcaacc gtctggccat 2211661 cggcacagtc ggcatcgtcg tcgtcgccgc cgttgcgctg gccgcgctgc aataccagcg 2211721 gctgccgttt ttcaaccagg gcaccagggt ctccgcctat ttcgccgacg ccggcgggct 2211781 gcgcaccggc aacaccgtcg aggtctccgg ctatccggtg ggaaaagtgt ccagcatctc 2211841 gctcgacgga ccgggcgtgc tggtggagtt caaggtcgac accgacgtcc gactcggaaa 2211901 ccgcaccgaa gtggcaatca aaaccaaggg cttgttgggc agcaagttcc tcgacgtcac 2211961 cccccgcggg gacggccgac tcgattctcc gatcccgatc gagcggacca cgtcgcccta 2212021 ccaactgccc gacgcccttg gcgatttggc cgccacgatc agcgggttgc acaccgagcg 2212081 gctgtccgaa tcgctggcca ccctggcgca gacctttgcc gatacgccgg cgcacttccg 2212141 caacgccata cacggggtgg cccggctcgc ccaaaccctc gatgagcgcg acaaccaact 2212201 gcgcagcctg ctggccaacg cggccaaagc caccggggtg ctggccaacc gcaccgacca 2212261 gatcgtcggc ctggtgcgcg acacgaatgt ggtcttggcg cagctgcgca cccaaagcgc 2212321 cgccctggac cggatctggg cgaacatctc ggcggtggcc gaacaactgc ggggcttcat 2212381 cgctgagaac cgccagcagc tgcgcccggc gctggacaag ctcaacgggg tgctggctat 2212441 cgtcgaaaac cgcaaagagc gtgtgcggca ggccatcccg ctgatcaaca cctatgtcat 2212501 gtcgctgggt gagtcgctgt cgtcgggccc gttcttcaag gcatacgtgg tgaacctgct 2212561 gccgggtcag ttcgtgcaac cgttcatcag cgccgcgttc tccgacctgg ggctcgaccc 2212621 ggccacgttg ctgccgtcgc agctgaccga cccaccgacc ggtcaacccg gaaccccgcc 2212681 gttgccgatg ccctacccgc gcacgggcca gggcggtgag ccgcggctga cgctgcccga 2212741 cgcgatcacc ggcaatcccg gcgatccgcg ctatccgtac cggccggagc cgcccgcgcc 2212801 gccgcccggc gggccgccgc ccggcccgcc cgcgcagcag ccgggagacc aaccgtgaca 2212861 acgaaactca gacgtgcccg ctcggtgttg gcgaccgccc tggtgctggt cgcgggcgtg 2212921 atcctggcca tgcgcaccgc cgacgccgcc gcccgcacga ccgtggtcgc ctacttcgac 2212981 aacagcaacg gtgtgttcgc cggtgacgac gtgctcattc ggggcgtgcc ggtgggcaag 2213041 atcgtcaaga tcgaaccgca accgctgcgc gccaagattt cgttctggtt cgaccgcaaa 2213101 taccgagtcc ccgccgatgc cgccgcggcg atcctgtcgc cgcaactggt gaccggccgg 2213161 gccatccagc tgacaccgcc gtatgccggc gggccgacca tggccgacgg cacagtaatc 2213221 ccgcaagagc gcaccgtggt gccggtggag tgggacgact tgcgggcgca acttcagcgg 2213281 ctgaccgcat tgctgcagcc cacccggccg ggcggcgtca gcacgctggg tgcgctcatc 2213341 aatactgccg ccgacaacct gcgcgggcaa ggcgccacca tccgcgacac catcatcaaa 2213401 ctgtcacaag cgatttcggc tctcggtgac cacagcaaag acatcttctc caccgtgacg 2213461 aacctgtcga cgctggtcac ggcgctgcat gacagcgctg acctgctcga acggctcaac 2213521 cacaacctgg ccgcggtgac ctcgctgctg gccgatggcc cggacaagat cggtcaggca 2213581 gccgaggacc tcaacgcggt cgtagccgac gtcggcagct tcgccgccga gcaccgcgag 2213641 gcgatcggca ccgcatcaga caagctcgcg tcaatcacca ccgcgctggt cgacagcctc 2213701 gacgacatca agcagacgct gcatatcagc ccgacggtgt tgcagaactt caacaacatc 2213761 ttcgaaccgg ccaacggcgc gctgaccggc gcgctggcgg gcaacaacat ggccaaccca 2213821 atcgccttcc tgtgcggcgc gatccaggct gcctcccggc tgggcggcga gcaagcggcc 2213881 aaattgtgcg tgcaatacct ggcgccgatc gtgaagaacc gccagtacaa ctacccgccg 2213941 ctgggggcga acctgttcgt cggggcgcag gccaggccta acgaggtcac ctacagcgag 2214001 gactggctgc ggcccgatta cgttgcacca gttgcggaca cgccgccaga tccggccgcg 2214061 gccgtgaccg tcgatcccgc gaccggcctg cgcggcatga tgatgccgcc ggggggtggc 2214121 tcgtgaggat cggcctgacc ctggtgatga tcgcggccgt ggtagcgagc tgcggctggc 2214181 gcgggctgaa ttcgctgccg ctgcccggca cgcagggcaa cggcccgggg tccttcgcgg 2214241 tccaggcgca gctgccggat gtcaacaaca tccagccgaa ctcgcgggtg cgggttgccg 2214301 acgtgacggt cggccacgtc acgaaaatcg agcgccaagg ctggcacgcg ttggtgacca 2214361 tgcggctgga tggcgacgtc gatttgcccg ccaacgcaac ggccaagatc ggcaccacca 2214421 gcctgctggg ttcctaccac atcgagctgg cgccaccgaa aggcgaagcg cggcaaggca 2214481 agctgcgcga cggttcactc attgcgctgt cacacggtag cgcctaccca agcaccgagc 2214541 agacgctggc agcgctgtcg ctggtgctca acggcggcgg actgggccag gttcaagaca 2214601 tcaccgaggc gttgagcacc gcgtttgccg gccgtgagca cgatctgcgc gggctgattg 2214661 ggcagctgga caccttcacc gcatacctca acaaccagtc cggtgacatc atcgcggcca 2214721 ccgacagcct caaccgcctc gtcggcaagt tcgccgacca gcaacccgtc ttcgatcggg 2214781 ccctggccac catccccgac gcgctcgcgg tgctggccga tgagcgggac acgctcgtcg 2214841 aggctgccga gcagctgagc aagttcagcg ccctgaccgt cgactcggtc aacaagacca 2214901 ccgcgaacct ggtcaccgaa ctgcggcaac tcggaccggt gttggagtcg ctggccaatt 2214961 ccggtccggc gctgacccga tcgctgtccc tgctggccac gttcccgttc ccgaacgaga 2215021 cgttccaaaa tttccagcgc ggcgaatacg ccaacctgac cgcgatcgtc gacctcacgc 2215081 tcagccgcat cgaccagggc ctgttgaccg gcacccgctg ggagtgtcat ctgacccagc 2215141 tcgagctgca gtggggtcgc accattgggc agttccccag cccgtgtacc gcgggctatc 2215201 ggggtacccc gggcaatccg ctgacgatcg cctaccgctg ggatcagggg ccctagatgc 2215261 tgcatctacc gcgccgagtg atcgttcagc tggccgtctt taccgtgatc gcggtgggcg 2215321 tgctggccat cacgttcctg catttcgtga ggctgccggc gatgcttttc ggcgtcggcc 2215381 gctacacggt gacgatggag ctggtcgaag ccggtgggct gtatcgcacc ggcaatgtca 2215441 cctaccgcgg ctttgaggtg ggccgggtgg cagcggtgcg gctcaccgac accggggtgc 2215501 aagcggtgct ggccctgaaa tcgggcatcg atatcccgtc ggacctcaag gccgaggtgc 2215561 acagccacac cgcgatcggc gaaacctacg tcgagttgtt gccgcgcaac gccgcctcgc 2215621 cgccactgaa gaacggcgat gtcattgcgc tggccgacac ctcggtgccg cccgacatca 2215681 acgacctgct cagcgcggcc aacaccgcat tggaggcaat acctcacgag aacctgcaga 2215741 ccgtcatcga cgagtcgtac accgcggtgg ccgggttagg gctcgaactt tcccggctga 2215801 tcaagggctc ggcggaactg gcgatcgatg ctcgcgcgaa tctcgatccg ctggtggcgc 2215861 tgatcgaccg ggcaggaccg gtgctggatt cgcagaccca cacctcggat gcgatcgcgg 2215921 cctgggcggc acagctggcc gcagtcaccg gccaattgca gacacacgac tcggcggtcg 2215981 gcgatctcat cgaccggggc ggtccggcgt tgggggagac gcgccaactg ctcgagcggc 2216041 tacaacccac cgtgcccatc ctgctggcca acctggtcag cgtcggccag gtcgcactca 2216101 cctatcacaa cgacatcgaa cagctgctgg tggtgttccc catggccatc gccgccgaac 2216161 aggccggcat cctggccaac ctcaacacca agcaggccta ccggggccag tatctgagct 2216221 tcaacctcaa cctgaacctg ccgccgccgt gcaccaccgg ctttctgccg gcccagcagc 2216281 ggcgcattcc cacgttcgag gactacccgg atcgcccggc cggtgatctg tactgccggg 2216341 tgccccagga ttcgccgttt aacgtgcgcg gcgcccgcaa catcccctgt gaaaccgtgc 2216401 cgggcaagcg cgcacccacc gtgaagttat gcgagagcga cgcgccatac ctgccgctga 2216461 acgacggcta caactggaag ggcgacccca acgccacggt gccgggtttg gggtccggcc 2216521 aggacatccc gcagacatgg caaacgatgc tgctgccgcc gggcagctga cggtgatgga 2216581 gggaggacac gatgtcggta gcagtggatt ccgacgccga ggatgacgcc gtatcggaga 2216641 tcgctgaggc agccggcgtg tcgccggccc cagccaaacc atccatgtcg gcgccgcggc 2216701 gcatgctgct gttcggcctg gtcgtcgtcg tcgctttggc ggtgctgttg tgttgctggg 2216761 gatttcgcgt ccagcgggca cgccatgcgc aggaccagcg tggtcacttc ctgcaagcgg 2216821 cccggcagtg cgcgctgaac ctaacgacca tcgactggcg caacgccgag gcggatgtgc 2216881 gccgcattct ggacggcgcc acaggcgagt tttacaacga cttcgcccag cggtcccagc 2216941 ccttcgtcga agtactgagg cacgcaaagg ccagcacggt cggcacgatc accgaggccg 2217001 ggctgcagac gcagaccgcc gacacggccc aggcgctggt ggcggtgtcc gtgcaaacgt 2217061 cgaatgccgg cgaagccgac ccggttccac gagcgtggcg aatgcgcatc accgtgcagc 2217121 gggtcggcga ccgggtcaag gtgtccgacg tcgggttcgt gccgtgagct ggtcgcgggt 2217181 gatcgcctac gggctgctgc ccgggctggc gttggcgctg acgtgtggcg cgggcttgct 2217241 gaaatggcag gacggcgccg tccgcgacgc cgcggttgcc cgtgcggaat ccgtgcgggc 2217301 cgcgaccgac ggcaccaccg cgctgctgtc ttaccggccc gacaccgtgc agcatgacct 2217361 cgagagcgcg cgaagcaggc tcacgggcac gttcctcgac gcctacacac agctgaccca 2217421 cgacgtggtg atccccggcg cacagcagaa gcagatctcg gccgtggcca ccgtcgcggc 2217481 cgcggcgtcg gtgtcgactt ccgccgaccg cgccgtcgtc ctgctgttcg taaaccagac 2217541 catcaccgtc ggcaaggacg cgccgaccac cgccgcttcc agcgttcggg tgaccctcga 2217601 caacatcaac gggcgttggc tgatctcgca attcgaaccg atctgacggg gggcaccagt 2217661 gcagcgccaa tcattgatgc cccagcagac ccttgccgcc ggcgttttcg tgggtgcgct 2217721 gctatgcggt gtcgtgacgg cggcggtgcc accacacgca cgcgccgacg tggtcgccta 2217781 tctggtcaac gtgacggtac gcccgggcta caacttcgcc aacgccgacg ccgcgttgag 2217841 ttacggacat ggcctctgcg agaaggtgtc tcggggccgc ccttacgcac agatcatcgc 2217901 cgacgtcaag gctgatttcg acacccgcga ccaataccag gcctcgtatc tgctcagcca 2217961 ggctgtcaac gaactctgcc ccgcgctgat ctggcagttg cgaaactccg cagtcgacaa 2218021 tcggcgctcg ggctgaggta aggggactga catgtcgcgt cgagcatcgg ccacgtgtgc 2218081 cttgtccgcg accaccgccg tcgccataat ggctgctccc gccgcacggg ccgacgacaa 2218141 gcggctcaac gacggcgtgg tcgccaacgt ctacaccgtt caacgtcagg ccggctgcac 2218201 caacgacgtc acgatcaacc cgcaactaca attggccgcc caatggcaca ccctcgatct 2218261 gctgaacaac cggcacctca acgacgacac cggttctgac ggatccacac cgcaagaccg 2218321 cgcgcatgcc gccggcttcc gcgggaaagt cgctgaaacc gtggcgatca atcccgccgt 2218381 agcgatcagc ggcatcgagt tgataaacca gtggtactac aaccccgcgt ttttcgcgat 2218441 catgtccgac tgcgccaaca cccagatcgg ggtgtggtca gaaaacagcc cggatcgcac 2218501 cgtcgtggtg gccgtttacg gacagcccga tcgaccttcc gcgatgccgc ccaggggagc 2218561 ggtaaccgga ccgccgtccc cggtggccgc gcaagagaac gttcctatcg accccagccc 2218621 cgactacgac gccagcgacg agatcgaata cggcatcaac tggctgccat ggatcctgcg 2218681 cggcgtgtac ccgccgcccg caatgccgcc gcagtaggcg gtcgctagcg caccgctgag 2218741 ttccgcggct gccagatctg ggccgggcac cggagattaa ccgcgtggga gaccggcagt 2218801 tccagcagcg catctgaggc gtcttcgatc gccggagccc taatcactgc gtgcggcggg 2218861 ccgcgttcga cccgcgcggg tcgataaggt cacggaaccg ttctgccggg tagactgccg 2218921 cacccaagtc tcggacccgg tcggtcaacg ctttgtccga tgtcaccaca cgaatctctt 2218981 gtggctgggc gccggatcgg accagccgga cgatctcgtc gtcggccgag ttggcggccg 2219041 ccttgggcgc atgcgccact tcgaccaccg atgacgggat ggcggtcgac ggcggccgct 2219101 cgaacaccac cgtcacgtcg tcgccccgag ccttggtgat ggcccacccc tcgagccttt 2219161 ccaccagcat caccatcgcg cgatggcggt cgcgccacca accatccgga cgacttccga 2219221 tcacgttcat accgtcgaca atccaccgca cacctcacgg tacgacggcg ccacctcacc 2219281 gcgtgtgtcg acgccggcta tgcgtttgcc gcactaccac catctgcgct ttcggtgctt 2219341 cttcagctct tgctggaact tctggtaatg ctccagcgcg aatcgctctt ccaaagcccc 2219401 aagggcgtta atgacctcgg gatctttgac cccaggggtc gatggccaat ctcaggttgg 2219461 taaatcgggt gctcagatcg gccctccgga ccaggttgtc gcctgggcag atgtgcgctc 2219521 gctaaccgcc aactcacttt caaactacgc tgcgagttgt gagcgtaatg tcagtgatct 2219581 gacggcaaag gtcacggatt tcgtcgagca gatggacggt atttcgcgaa aagcggttcg 2219641 acctactggc tcctggtgtg tggcctccca gggtgctggg ctgcggtttc gccaaccaac 2219701 ctgctggtcg gcgcgccgta ttctgaagac cggaccaacg aggggaccga gccatgtctc 2219761 agacacccgc tacaacccgc aaaacgtttc ccgagatcag ctcaagagcg tgggagcacc 2219821 ccgccgaccg gaccgccctt tccgcgctgc gccggctcaa aggcttcgac cagatcttga 2219881 agctgatgtc ggggatgttg cgggaacggc agcaccggct gctgtacctg gccagcgcgg 2219941 cacgggtcgg gccgcggcag ttcgccgacc tcgacgcgct gctggacgaa tgcgtggatg 2220001 tgctggacgc gtcggcgaaa cccgaactct acgtgatgca gtcaccaatc gcggatgcct 2220061 tcaccatcgg catgggcaag ccattcaccg tgatcacctc ggggctgtac gacctggtga 2220121 cacacgacga gatgcggttc gtgatgggcc acgagctcgg ccacgcactg tccggccacg 2220181 cggtgtaccg cacgatgatg atgcatctgc tgcggttggc ccggtcattc ggcgtcttgc 2220241 cggttggcgg ctgggcgctg cgcgcaatcg tggctgcgct gctggaatgg cagcgcaaat 2220301 cggagctgtc cggcgatcgc gctgggttgc tgtgcgcgca ggatttggac accgcgctca 2220361 gggtggagat gaagctcgct ggcggctgcc ggctggacaa gctggactcg gaggccttct 2220421 tggctcaggc ccgggaatac gagacatccg gcgatatgcg cgacggggtg ctcaagctgc 2220481 tcaacctgga gctgcagacc catccgttct ctgtgctgcg ggctgccgcc ttgactcact 2220541 gggtggacac cggcggctat gccaaggtga tagccggcga gtacccgcgt cgggccgacg 2220601 acggcaacgc caaatttgca gacgaccttg gcgcggccgc ccggtactac cgggacggct 2220661 tcgaccagtc caacgacccg ctgatcaaag gtatccgcga cggattcggt ggcatcgtcg 2220721 agggcgtggg acgggcagcc tcgaacgcgg ccgattcatt gggccgcaag atcaccgagt 2220781 ggcggcagcc ctcgaagtga cggcccctct gctacgtagc taagcacgcg cgaccggcgg 2220841 gctggggagc ccggtcagcg gtctcatagc attgcgaaca cgggacgtcg agaggggaag 2220901 agctgccatg ggtgaggcga acatccgcga gcaggcgatc gccacgatgc cacggggtgg 2220961 ccccgacgcg tcttggctgg atcgtcgatt ccagaccgac gcactggagt acctcgaccg 2221021 cgacgatgtg cccgatgagg tcaaacagaa gatcatcggg gtgctcgacc gggtgggcac 2221081 cctgaccaac ctgcacgaga agtacgcccg gatagccctg aaacttgttt ctgacattcc 2221141 caacccgcga atcctggaac ttggtgcggg ccatggcaag ctctcagcga aaatcctcga 2221201 gctacacccg acagcgacgg tgacgatcag cgatctagat cccacctcgg tggccaacat 2221261 cgccgcggga gagctgggaa cacatccgcg agcacgcacc caagtgatcg acgccaccgc 2221321 aatcgacggc cacgaccaca gctatgacct ggcggtcttc gcgctggcat ttcaccacct 2221381 gccgcctacg gtcgcctgca aagcgatcgc cgaggccacc cgggtgggga agcgctttct 2221441 gatcatcgac ctcaaacggc agaaaccgct gtcgttcacg ctctcttcgg tgctgctact 2221501 gccgctccac ctactgctgc tgccatggtc gtcgatgcgc tcgagcatgc acgacggctt 2221561 tatcagcgca ctacgtgcct acagtccctc ggcgttgcag acgcttgccc gcgccgccga 2221621 tccgggaatg caggttgaaa tcttgcccgc accgaccagg ctattcccgc catcgctcgc 2221681 cgttgtgttc tcccgttcga gctcagcgcc aacggaatct agcgagtgct cggccgatcg 2221741 ccaacccggc gaatgattcg gtagtagtgc agataagcca tcgccggtac cacgacgaac 2221801 gtgatcacga tcaaagcaat cgagaagtag ttcggaccac cccgcactag aaagatgcag 2221861 cggtagtcgt aggacactgc cagcccaacc gagaccacga tcgcaacaag cggtaacacc 2221921 ttgtcggtga acgcatttcg ccgcacagca gcatgttcta ctgcctgaga cctcgccaat 2221981 gcgatgagag cgatcggcac gatgatgaac tggacgaatc gggcgatcac cgccaggccg 2222041 gtcaggtgca ggttgtcgaa ccgcagcgcc aacgggaatg cgagcgccaa cgacgccgta 2222101 attgcgaagg agaccatcgg cacgtcgtat tggttcttgc gtgacaagcg tgtcggcaga 2222161 accccgctgt ccgctaacgc ggtccaaagc cgcggtgcac cgaacgaggc cgcgacattg 2222221 atgccgaaca tcgatatcag ggctccgacg acgatgatcg ttcggaaggt agcgtttccg 2222281 atggccgcgg ccagtttcac ggtgtcgtcc gacgcggcga tcttgttcga tccgagcagc 2222341 atcgctaccg ttagggtgag caagtagatc gcgccaaccg agaagatcgc gatcggtata 2222401 gctctcggca ggttccggtc cggcgcgtcc atttcttcgg cggcgttcgc gatcgattcg 2222461 aaaccggtga atgcgtacaa cgcgacaatc gtggccagcg ccatactcga gaacgtgccc 2222521 ttgccaattt cggcgacgcc aagcaacgag tacggggtcg cgctgtatgc cgaccacgcc 2222581 gttgcgtagt tgttcacgtg ctgggtggtg atgatccaca gcccgccgac aatgaatgcc 2222641 gagagcgcga atgccttgcc taccgttgac gttccgttgg cccacttgat cgcccggttg 2222701 ccgaagaggt tgatggccaa cagcacgccg ataaagccga gaaacgtcag cgtcttcaca 2222761 ctgaacagtt gctcggcgtc ggcccaggcc ttgtcgggga aggccactcg caacagcgtc 2222821 gagacgaaaa aagaagccaa caccccccaa gcgatggacg cggtaatggc gtgggtgaca 2222881 ccgacataga tgccgatccg gcgcccaaat gcggccgttg tgtaggcgta ggaggcaccg 2222941 tttgttctga cgtaccttgc cgccgtcgcg aagacgatcg ccacgacacc cgcgaaaatg 2223001 ccagctaaaa cataggccat cggcgcgaag ggtcctgcga gcccgatcac ctcacctgga 2223061 gttaggaaga taccggcgcc gattatcgag ttgatcccga gcatgacgac gctgcagaaa 2223121 cccagcttgt ggatcgcata tcctctcgtc cgcgggccga ccaccgcacc aaggctgtct 2223181 agcagggaat cctctaacgc accatagatt ctctagcgac gattcttgag ctcccggcct 2223241 gtcgatgccg gcgctgcagg tgagtcaccg cagtgggcgc accgaacact catttccgcc 2223301 gccccaaatc cgcgcagtga ccaccgcgcg gtcctcgcga gtctaggcca gcatcgagtc 2223361 gatcgcggaa cgtgggacca atacctgggt tgggccggct gcttcgggca gcaactcccc 2223421 cgggttgaag aagaaaatca ccccgtcgtt cgtgactgcg aagttctgat aattcaccgg 2223481 gtccaagccg gcattcggcg ctatcgatac ctgttgtccg gtctgcttgc tcagttcacc 2223541 ttgcacaatg gggaagacga ctggcagcgg atcggtgtca gcctgccaca gcgtgtcata 2223601 ggtgattggc ttgcgatagg cctggtccca atcgaaggcc ttgtacgtgg tcgttgggtg 2223661 cgtgccgccg gcgttctggt agaccttgag caccacggcc tgcgtaccac gcggcggtat 2223721 cgcggactgg tatgtggccg aggtgatatt caattcgtag ggggcttcgc gtggagtgga 2223781 cgatgtggcc gcgctgagga acttgtcgcg cgtctgggcg atgtaatttt ccagcgactt 2223841 ctggtcgggg tagtaactgg gcaggctgat gttgatgttg taggccgggt cggacatttg 2223901 aatctggcac gcctggccgg tatcggtgcc tttcaactcc tcgcagtagg tcttgggcgc 2223961 ggccgtggcc acacccgaac aacagagcaa aacgacagcc gtgaccagca tgaagatctt 2224021 gatgcgcacg tcgaaattcc tccgggagta gtttgcagca ccgccggccg caggcgggag 2224081 attggattgc cgcgatatct gagtcgacga caaacatagg gcatcgcgct gctgacgacg 2224141 atgcctgacc agactcaagc tagcagatcg atcgggcccg gtgtcgcgtg gtgctcgacg 2224201 cccccgacgc gctgggcggt tagaagtccc agtcggtgtc ggtggtgggt tggtgggtgc 2224261 ccattacgta tgagcttccg gagccggaga aaaagtcgtg gttctcccct gcaccggggt 2224321 cgagagctgc gcgcacggcc gggttcacct ggcaggtgtc acgatcgaat gcaggctggt 2224381 atcccaggtt ggctagcgcc ttgttggcgt tgtaacgcat gtagggcaaa acgtcgtcgg 2224441 tccagcccaa ctcgtcgtac aagtcgtgcg catagtcgat ctcgttcgcg tagagcgtgt 2224501 gcagcagctc gcaggtgtat tcgcggtggt cggcccgctc ggcgtcggtc aggtcggcca 2224561 aacctcgttg acatttgtag ccgatgtagt agccgtggac ggcttcatct cggatgatca 2224621 gccggatcag atcggcggtg ttggtgagct taccccgcga cgaccagtac atgggcaggt 2224681 agaagccgga gtagaacagg aaggactcca gcattaccga cgatgctttg cgcttgagcg 2224741 cgtcgtcacc gcggtagtag tcgacgatga tctgcgcttt tcgctgcagg taagggttct 2224801 gttccgacca gtcgaaggca tcgtcgatct gcttggtcga gcacagggtc gagaagatcg 2224861 agctgtagct cttggcgtgc actgactcca tgaacgccat gttggtcagg accgcctctt 2224921 cgtggggggt gaccgcgtcg tcgatcatgg ccactgctcc caccgtcgcc tgcgcggtgt 2224981 cgagcagggt caagccggtg aacacccgga tcgtcgtctg ctgctcggtg gaactcaacg 2225041 tttgccaaga tgccaggtcg ttggagagcg gaatcttttc cggcaaccaa aagttaccgg 2225101 tcaaacgttc ccagacctgc aaatctttag catcgagcaa ccggttccaa ttgattgcgt 2225161 gcacccgctc aacgagcttg ccggtcatcg agggccgtcc tgccttgcca tggtcatgcc 2225221 gctgttggcc ggtgcgtacg ctcctgtggg cgtcaagtcc ggcagtcggt ccttgggcat 2225281 ttcggccgtc ctccttgtca ttgacggtct ttcatggcgt gcaccagcac tgtagcttag 2225341 tgatttcggc tacccatatt ttattcttcg tgtcgctgaa ctcattacaa acagcgatca 2225401 ccgcgcatac ggttacgcga cgcctggcca gtagccgacg acgccgcgga actcaaggtc 2225461 ggtttgcggg aagtcgttgc cgacggccag cagtggttgg tggcccagct gggcggtcgc 2225521 gtacgtcata cagtctccga agttgagagc cgcgcggtgg cgccccttgc cgtatcgcag 2225581 aaaggctcgt tgcgtggcag cggcatgctc ggcggtgaaa gatgacacgc tcaagccgat 2225641 ttcgctgcga agtcgttcga agatcgtgcg cgcaacgggg ccgtgacggg cggtcaagac 2225701 aatcaggcat tcggcgacgg tgggtgcaga catgacgggg ctatgggcgc cggccagggc 2225761 ggccgcgacc agggtggcgt gcggccgctc gccttgaacc agggccacca cggcgcttgt 2225821 gtccacgatc attgcggtgc tcagactccg gttgcggggt cgtagccgag gatttgttcg 2225881 cgctcgagct tggtgatggg ggagcggtcg gcaagcaggg gccagatttc ggtacgcaag 2225941 atgtcgagaa gttgtgcctc acggtcgccg gcgcgcgact ccaaaaacgc cagctgggca 2226001 gacagggcat gccggatggc ggcagtcttg ctggtgtgca gccggtcagc gagttcggcg 2226061 gctagtcggt ctacctcagg gtctttgata ttcagcgcca caggtagatg gtaccagcaa 2226121 atagccacta tctacctaac gcgtgctgtg ccgtgcggta gctactgaaa atccgagatg 2226181 tcaaaggcag cgtctggata cgctgtatgc gcgcagggat ggtgatcgag gcggaggggc 2226241 ggcgtgtcat ttctggtcgt ggttcccgag ttcttgacgt ccgcggcagc ggatgtggag 2226301 aacataggtt ccacactgcg cgcggcgaat gccgcggctg ccgcctcgac caccgcgctt 2226361 gcggccgctg gcgctgatga ggtatcggcg gcggtggcag cgctgtttgc caggttcggt 2226421 caggaatatc aagcggtcag cgcgcaggcg agcgctttcc atcaacagtt cgtgcagacg 2226481 ctgaactcgg cgtcaggatc gtatgcggcc gcggaggcca ccatcgcgtc acagttgcag 2226541 accgcgcagc acgatctgct gggcgcggtc aatgcaccaa ccgaaacgtt gttggggcgt 2226601 ccgctaatcg gcgacggagc acccgggacg gcaacgagtc cgaatggcgg ggcgggtggg 2226661 ctgctgtacg gcaacggcgg caacggttat tccgcgacgg cgtcgggggt cggcggcggg 2226721 gccggcggtt ccgcggggtt gatcggcaat ggcggcgccg ggggagccgg cggacccaac 2226781 gcccccgggg gagccggcgg caacggtggc tggctgctcg gcaacggcgg gatcggcggg 2226841 cccgggggcg cgtcgagcat ccccggcatg agtggtggag ccggcggaac cggcggtgcc 2226901 gcaggacttt tgggctgggg agcgaacggc ggagccggcg gcctcggtga tggagtcggt 2226961 gtcgatcgtg gcacgggcgg cgccggaggc cgcggcggcc tgttgtatgg cggatacggc 2227021 gtcagtgggc caggcggcga cggcagaacc gtcccgctgg agataattca tgtcacagag 2227081 ccgacggtac atgccaacgt caacggcgga ccgacgtcaa ccattctggt cgacaccgga 2227141 tccgctggtc ttgttgtctc gcctgaggat gtcgggggaa tcctgggagt gcttcacatg 2227201 ggcctcccaa ccggattgag catcagcggt tacagcgggg ggctgtacta catcttcgcc 2227261 acgtatacca cgacggtgga cttcgggaat ggcatcgtca ccgcgccgac cgccgttaat 2227321 gtcgtcctct tgtccatccc aacgtccccc ttcgccattt cgacctactt cagcgccttg 2227381 ctggccgatc cgacaacaac tccgttcgaa gcctatttcg gtgccgtcgg cgtggacggc 2227441 gttctgggag ttgggcccaa tgcggtggga ccaggcccca gcattccgac gatggcgtta 2227501 ccgggtgacc tcaaccaggg agtgctcatc gacgcacccg caggtgagct cgtgttcggt 2227561 cccaacccgc tacctgcgcc caacgtcgag gtcgtcggat cgccgatcac caccctgtac 2227621 gtaaagatcg atggtgggac tcccataccc gtcccctcga tcatcgattc cggtggggta 2227681 acgggaacca tcccgtcata tgtcatcgga tccggaaccc tgccggcgaa cacaaacatt 2227741 gaggtctaca ccagccccgg cggtgatcgg ctctacgcgt tcaacacaaa cgattaccgc 2227801 ccgaccgtca tttcatccgg cctgatgaat accgggttct tgcccttcag attccagccg 2227861 gtgtacatcg actacagccc cagcggtata gggacaacag tctttgatca tccggcgtga 2227921 tcgagcctgt tcgccgcgaa tgtcgccgcc tggcttgtca tccccgactg aacatacgaa 2227981 acatgcgcca taatattgcc gcctccggtg catattggat cgtcgggagc acacaagttt 2228041 atggtcttag agctatacag cggaccgatt gtcggcaacg acccgccgcc ccacaacatg 2228101 ctggagaaac cactggatgg ctcgccgaaa agggcgacag cggcgacatg atctgccacc 2228161 gcgggcggca tcgccgaggt ggacaaatcg atgaccgtcg caccctgcga atagccacca 2228221 agcacaatcc tggtgttcgg gcagctggcg acggtgcgct ggatgtgggc gctcgcatca 2228281 tcggaaccgt ttgacgcgct cgcgcggtag tcgtcgcttg ctgggtagtt caccgcgtag 2228341 accccaatcg accgcccgcc aacttgcgag gtaagcgagt cgacgaacgc ctcaccgacg 2228401 tcgccaagac cagaagcctg atgcgtgccg cgagcgaaaa cgaccgcgat gtccgaacac 2228461 ggatccgcat gcgcggcacg accgccggcg ggtgcgctca ccagcgccaa ggtcgtcgca 2228521 accacgacac caacgatgcg aacaaggctg cgtggagtca tctgcacatg ctgacatact 2228581 gccggcgacc gaggtggcgg tgggccgctg agacatgacg tgcctcacgt cgtcggcgcc 2228641 cacgcagccc caggtcagaa cggtagcctt aggcgatgac cgactctgtg gtcgtccgcg 2228701 tcaagcccgg cagtcacaaa ggacccctgg tcgaggtcgg tcccaacggt gagctgatta 2228761 tctacgtccg cgagccggcg attgatggca aggccaacga tgcggtcacc cggctgctcg 2228821 cagctcacct tcaattgcca aagagccgag tcaaattggt gtccggagcg acgtcgcggt 2228881 tcaagcgttt ccgtctgagt cgttaagttc aacctgtttg aggaagcggg tccagcaagg 2228941 ccgggacatc gagaccaagc cgcgctaaca caacaacatg ctggcgtcgg tcaacccggt 2229001 cggcggcggc gttgctggcc ccggtacaga ccgcttgccg ccgccctcac cgtgtcggta 2229061 attcgcgcga tgatcggact gtccagtttc cagcattgcc aatagagagg gacgtcgagg 2229121 tgtatgtcgc agacccgtac gaacgatcca tcggcaagcg gagatgctgc cagcttctcg 2229181 gggaacatgc cccatcccag cccggcgcgc gctgcggcgg tgaagccctc tgtggtcggg 2229241 acaaagtgcg tcggtctggt gatggcgcga cgaaaggcct tacgcaccaa catgtcctgc 2229301 agcccatcgt cacgattcca cgccagtgac ggagctttag ccgccgcggc ggcagtgaac 2229361 ccgtcggata gatggcgctg gacgaatggc ctgctggcca ctggtaggta gcgcatttca 2229421 cccagcgggt gcacccggca gcccggcacc gggttccgct cggtggtcac cgcgcccatc 2229481 gccacaccct cccgtagcag ccgcgcggaa tggtcctggt cctcgatccg aacgtcgagc 2229541 aggacgtcgc cgagaccgtc gaacacggcc gaaaaccatg tcgccatgga atcggcgttt 2229601 accgcaatgg tgatccgcgt gcgtttcagc gacgcgttgc cacccatttc agcgagcgcc 2229661 tcggactcga gcaacgctgt ttgcgcggcc aaccgcaaca gcgggatacc tgcggtcgtc 2229721 gcccgacatg gcttttccct gaccaccagc acctggccga cctgctgctc caacgacttg 2229781 atgcgctgac tgacagccga cggggtgaca tgtaggcgct ccgcggccgc atcgaagctg 2229841 cccagttcga ccacggcagc caatgcggcc agctgtggac cgtcaagctg cggatccacc 2229901 atctcaggtg tagaccatct gcggagcgtc gcactgcaca ttaataatgc taatgtaaat 2229961 gaagaattat tagctatact gacccataca aactgcctag tgtcgattgc gtgaactcac 2230021 cactggtcgt cggcttcctg gcctgcttca cgctgatcgc cgcgattggc gcgcagaacg 2230081 cattcgtgct gcggcaggga atccagcgtg agcacgtgct gccggtggtg gcgctgtgca 2230141 cggtgtccga catcgtgctg atcgccgccg gtatcgcggg gttcggcgca ttgatcggcg 2230201 cacatccgcg tgcgctcaat gtcgtcaagt ttggcggcgc cgccttccta atcggctacg 2230261 ggctacttgc ggcccggcgg gcgtggcgac ctgttgcgct gatcccatct ggcgccacgc 2230321 cggttcgctt agccgaggtc ctggtgacct gtgcggcatt cacgttcctc aacccacacg 2230381 tctacctcga caccgtcgtg ttgctaggcg cgctggccaa cgagcacagc gaccagcgct 2230441 ggctgttcgg cctcggcgcg gtcacagcca gtgcggtatg gttcgccacc ctcgggttcg 2230501 gagccggccg gttgcgcggg ctgttcacca accccggctc gtggagaatc ctcgacggcc 2230561 tgatcgcggt catgatggtt gcgctgggaa tctcgctgac cgtgacctag tacagcacgt 2230621 gtgcacacgc gggttggacc acgtgatcgt cgatgggcac ataccgttcg gcaggagggc 2230681 gcgcggtcag tctgcacaac tcagtcacca gctgacacgc cgacggcggc ctcgcccggg 2230741 cgtgtcggcg ccaccagtgc acattcggcg tgacgcggcc ctacggatcg tgttggagct 2230801 gtagcccgtt gataccggtc gcgaacggtg aacggcgcta atcgggggag tggggtcgag 2230861 gctgtctggc cttccccgtc cgcaagttcg cgttcggccg ggccgatatc tggttcaggg 2230921 tgggtcgagg ccaaatttca tcacggttgc ggttgagcaa agttgctgta gcttgctcgc 2230981 gaggagacgg ccgatatcgc ctcattggca ttagtgttgg ctgtcatggc cggactgaac 2231041 atttacgtga ggcgctggcg gacagcgctt cacgcaaccg tgtcggcatt gatagttgcc 2231101 atcctcggac tcgccatcac cccggtcgct agtgcggcga cggccagggc gacgttgtcg 2231161 gtgacatcga cgtggcagac cggtttcatc gcccgcttca ccatcacaaa ctcgagcacg 2231221 gcgccgctaa ccgattggaa gcttgaattc gacttgccgg caggagaatc cgtcttgcac 2231281 acatggaata gcaccgttgc acgatctggc acgcactacg ttctcagccc agcgaattgg 2231341 aatcgcatca ttgcccccgg tggttcagcc acgggcggcc taagaggcgg gctgaccggt 2231401 tcttactcgc cgccgtcgag ttgtctgctc aacgggcaat atccttgcac ctagacgcga 2231461 ctgcgcactg aggctcgccg actgcaacaa tgcggctact gccaggtggg tctagtgggt 2231521 cgtcacggcc aacgtcatct cggagttgat gcggacggcg ccagagccct ggggctggtg 2231581 atgaccagaa ggttgcctga accgagaaat tggattgatc gcagtgccgg tggcgggcta 2231641 cggtcgggcg cgtgggcatc tacgcagtga cggtacgtcg tgtccgccct cggacggtcg 2231701 cgacgggcat ggggctggca ccggctccat gacgaatggg cagcgcgggt agtcagcgcg 2231761 gccgcagtgc ggcccggtga gctcgtgttt gacatcggcg ccggcgaagg ggcactgacg 2231821 gcgcatctag tgcgagcggg ggcgcgggtg gtcgccgtgg agttgcaccc gcgacgagtc 2231881 ggtgtcctcc gcgagcgatt ccctggcatt accgtggtgc acgcggacgc cgcctcgatc 2231941 cggttgcccg gccggccgtt ccgggttgtg gcgaacccgc cgtacgggat ttcgtcccgc 2232001 ctgctgcgga cgctgctggc acccaacagc gggcttgtcg cggccgatct cgtgctgcag 2232061 cgagccctcg tatgtaaatt cgcttctcgc aacgcgcgaa ggttcaccct gaccgtcggc 2232121 ctcatgctgc cacggcgcgc gttcctgcca ccgccgcatg tggattccgc ggtgctcgtc 2232181 gtccgccgcc ggaagtgcgg tgactggcag gggcggtaaa cccgcggccg ccagtaggtg 2232241 taccaccttt gctagaagtg gcacacttcg ttctatgtcg accactcgtc cgcgctacca 2232301 aataaccgaa accccggagg tagctcaggc attggaccgg gccgcccagc gatggcctgg 2232361 cgagccccgt tccaaattat tgcggcgcct gatcatcgat gctcgacgat ccgcgttccg 2232421 cgggtagcgt cgttgcgccg tacgacgatg gcgagctgct gcgtctcgcc gaactacgcg 2232481 ctagcagcgg gctaaaacta cctgattgct gcgtgccgga tgtggcaatt catcaccagg 2232541 caagcctcgc aacctttgac gacacgctcg ctgccgcagc acgcacaagg agcgtgcccg 2232601 ctagcacaaa cggcgcagct aacccaatac gaccagcttc acttgacata atgtcgctta 2232661 tcggcttata agtgatgcga gttgctcctt acgatgacca tggcacagcg gcatccttct 2232721 ctgcgccaag ctggccagct acgtggctcg aagttcttgg taaagagcag gcgtcagatc 2232781 gacgctttgt cgcagttgta gttggcccgg ccgagttcgc tgttcatacg cggtgacaac 2232841 gaggccgaca ccgcccgccg ccggcacgag gacaccttgc atgtgcaaga accaggccgc 2232901 atgtccgacc gcctggcacc ctgaccagtc gtcgccatag atgtcgtcgt tctcgagccc 2232961 cacggcttcc cgagcttgcg gggttgtcag atcgaggacg gccaggtccg tgacgtcgat 2233021 cgtgtgtagt cggtaggccg cctcgagcat cttctctgcg gtcgttgaag ccgcttgcgc 2233081 cgcccgttcc acctcaacca tgcaggcttg ggcggaatca gcaagataga tcgccggaaa 2233141 gagcagcggc ggattccacc tgcctccgaa tctgcgcgcg ccctcaccgg acaaggcgtc 2233201 acggtgcgcg ccggtatacc ggtagcacgt ttccgaccac tcaattgttc cgcgtgcgtc 2233261 gatacgctgg acgagccctt catcgagggc atcgctcaca cgaacactcc ctccgccatc 2233321 gcgtcgatga gcgccaacac gcgttggtac tcgccgtctc gcacgaggtc ggcaggcttg 2233381 cggtgttcca gtaaccgatt cggcgaaaac atccacacgt tcgcctggtc acgcggcagc 2233441 acttccgcga gggcgtcggc gacataggcc agctcgataa gtcgttgctt gttgaggcgt 2233501 tggggaacca cctgacctgc ggtccatcgc gccacggaac gcggcgaggc atcgacgatg 2233561 tcaccgactt cctcgtaggt caatcccaag cgctcgatcg cacccgacac ggtcgaggcg 2233621 agcacattta ctcccatggg cagcctgtct tccttttgtc tattgatttg tcatgtatta 2233681 tgacacgaac cgaggcgtcg atgcgagagg aacttcacga cgatgggcat tcagtttcgg 2233741 ctcgggccgg gtgatcacaa accggtcgag gacttcctgt cccgcgacca cgccggcacc 2233801 actgcgatca cgctggacac caacgccact cgtcaccagc acgacgctgc cgcagccgca 2233861 gtcgacgcag gcctagatgt ctactgggag ccagcagccg agcgcctcgc cgcgcacccg 2233921 gcttcgggct cgacaagttc cctctgtgaa acgggcagcc ctacgacacg gatgccctga 2233981 cgcgcgacgc ggcggcacgc gccgaactcg tcggcaggac tctcgacaaa cacccgtcga 2234041 tcgtcacgca cgtcacggcc ccacacttct acctcaccaa cgagcgcacc gcacgcctca 2234101 acatcgacct tgccgagcgc acgcgcttgg ccgtcggcta ggcggaccgc atgcgaacgg 2234161 ccttgacccc gagccacgcc cgtaatgaat gcaaccttgc cctcaagcct gcccacaaca 2234221 ccacctccgg cgagtagttc ccccggcggg ggggcttaca ccaagcagga acgtcaccgt 2234281 gacgaattgt cgcgtggcgc agtgtcaaag gtccagtacg cgacgaagtc ctcggtcaac 2234341 ctcgtgcatc aagctcgctg gcacctcccc aactcggtcg gtgaggtcag tcttgttgag 2234401 cgtgacaatc gccgtgacgt tgacgaccga gtcacgtggc agtcgcgttg tggtcgcggg 2234461 caagaacacg ttgccgggca ttgccgccag cgccgtattg gacgtgatca ccgctgcgat 2234521 cacagtggca aggcgacttg cgttgtacgg atctgactgg attacgagca ccgggcggcg 2234581 cttcgccggc tgactgcctg atggcggccc gaggtcagcc cagtagatct cggcacgact 2234641 aatcaccact catcgtccat ggtttctagc acgcggtatg cgttggccac ggcgagggcc 2234701 tccgcttcgt cggtgccatg gatgctctct agagccctgt cgatctggcc cgtgagcaat 2234761 tgggcgtcca gctcgtgcag gtagcgctgc gcagccttcg tgaagaactc ggaccgactc 2234821 atgccgagct cactcgcacg ccgcgatacc cgatcgaacg tctcatccgg cagagaaata 2234881 gctgtcttca tacagatagt ataaccgggt ataacttcca gaagacggcg gctgtttcgt 2234941 cacagtgacg ctattgctgg tccaaacaca ctccacgatt ccgcgcgtcg ctaccccggg 2235001 atagtccgat caggtgtctt gggtggcccg gcaagtggtt tgatgcgtcc ggcccgcacg 2235061 ccgttggcga tgacgatgac ctcggtgaac tcgtgcacaa gcacgaccgc ggccagtccg 2235121 aggatcccga acaacgccag cggcatcagc acggtgatga tacttaggga caatccgacg 2235181 ttttgcacca tgatctgccg cgagcgccgg gcatggtcta gggcttgggg cagatgccgc 2235241 aggtcttggc ccatcagggc gacgtcggcg gtttcgatgg cgacgtcggt tcccatggcg 2235301 cccatcgcga ttcccaggtc ggcggcggcc agggccggag cgtcgttgac tccgtcgccg 2235361 accatcgcgg tgggttgccg agcccgcagc tgtgcgacca gatgagcctt gtcctcgggc 2235421 cgcaattcgg catgtacctg ctcgatgccg gcttgggctg ccagggcggc agcggtggca 2235481 tggttgtcgc cggtgagcat cgtcacctgg tagccgccgg tgcgcagccc ggccaccacc 2235541 tcggcggctt ccgggcgtag ttcgtcgcgc acggcgatgg caccaagcag ctgctggtcg 2235601 cgttcgacga gaaccgctgt ggcgccggct tgttgcatgc acgccacatg atctgcgagc 2235661 tcggcggcat cgagccagcc gggtcgcccc agtcgcacca cccgcccgtc gaggcggcct 2235721 atcagcccgg cgcccgggac ggcttgcacg tcgctggcgg cggtcgtcgc ttgggtcgcg 2235781 gcaagcacgg ccacagccag gggatgttcg ctgcgggctt ccagggcggc tgccaccgcc 2235841 aacacttcct cgcgggtagc gccgtttgtg gtggcgacgt cgatgacgac gggccggttg 2235901 gcggttaacg taccggtttt gtccagggct accgcgcgga tggtgcccag ggtttccagc 2235961 gcggcgccgc ccttgatgag cacgccgagt ctggaggcgg cgccgatgga cgcgaccacg 2236021 gtgaccggaa cggcgatggc cagcgcgcac ggggcggcgg cgactaatac cacgagcgcg 2236081 cgttcgatcc agaccagcgg attacccaag acgctgccgg tcccggcgat cagcgccgcg 2236141 gcgatcatga tgctgggcac caacggtcgc gcgatacagt cggctagccg ctgactagca 2236201 ccttttcgga cctgttcggc ctccacgatg tgcacgatgc gcgccagcga gttgttggcc 2236261 gcggtagcgg tgacccccac ctgcagcacg cccaagccgt tgatcgaccc ggcgaacact 2236321 tcgtcaccgg gtccaacctc gaccggcacc gattcgccgg tgatcgcgga gacatccagg 2236381 gcggtgcgcc cggcacgaat gatgccgtcg gtggccaggc gttcgcccgg tttaacgatc 2236441 atctggtcac cgacgtgcaa ttcggttgag gccacgatgg tttcggtgcc ctcccgcaga 2236501 actgtggcct gatccggcac cagcgacagc agggcgcgca ggccacggcg agtgcgcgcc 2236561 gtcgcgtatt cctccaagcc ttcgctgatc gagaacagaa acgccagcgt agcggcctca 2236621 cccagctcgc caagtgcgac agcgcccagc gcggcgatgg tcatcagggt gcctacgccg 2236681 acgcggcctt cggccagtcg tttgaggctg gagggcacga atgtcgaggc cccaaccgcc 2236741 agcgcaaggg ccttcagtcc cagtacgacc ggccacagcg gataagccca tgcggcaact 2236801 agcgacgcgg tcagcaacac tccggagaat gcggctcgcc gcagtttggc gacttgccag 2236861 agctgctccg gctcgcggtc ctcgttgtcc tcgccgtcgc agcaggcatc gctcgtctcc 2236921 cccgatggct gcgcggccac gtcacgccga actcctgata gtgttcgcgt gctccagtcg 2236981 atgattttct gcactacccc ggccttgcgg ttactggccg agcgcgaagc atacgcgggc 2237041 accgccgccg cagggacggt ctcggcatcg atgattgccg acaggatggc agcggtgtcg 2237101 cagattgcgc gtgaatacca gatcacaatg gatgccgtcc gcggataggc atgcacggcc 2237161 tgcacaccgg ccaccttgcc gacggtgtcc tcgatcgcaa cggcccgtcc cgcgtcgaac 2237221 tgaaacccgg tggcctgcac acgcatccgc ccggctgcat cggatacaac ggtcagctgg 2237281 acctcggcgt caactacagt cgtcactcgt cgaccctggc gccagcgggc aggggcgcct 2237341 cctcaccgat gcgcccgcga gcctcggcaa cgacgtcggc gactgtcagc cgggccgact 2237401 cggccgccgc ctccgcgcgc cgggttccgc gcaggcccca ctccatcacg gtcaccgacg 2237461 cccggcgaat gggcgccgta cccagcgctt tgcgcagcgt ttcgtaggcg ctcaccccga 2237521 ccagtccggt gagcaccgcc ccggccgcct taaccaatag ctcatgcgta accacggtca 2237581 gttctccttt gctttgtcct gtaaccacaa gtcgtgtcgt ctgctgctca gctacctgtc 2237641 atctcgaccg cctccccgga cgcggcgcgc tcggcgacac agggttggtc ggtatccacc 2237701 gcgagaacga cctggaccaa ctcgcccaag gctcgcgcca ggtgactgtc ggccagcgca 2237761 taccgaacct gccggccctc ataggttgcg actaccagcc cgcagccccg caaacacgac 2237821 agatggttgg acacattcga tcgggtcaac ccgaggtgcg cagctagctg gccgggatag 2237881 caaacgccat ccagcaacgc caccagaatc cggcaccgcg tcggatcagc cagagcccgg 2237941 ccgagtcgag ccagggccga ttcccgcatc tcacacgtca gcatagatca aatagtacac 2238001 catatactgg tataacagca agagctgaat tgtacatcca tagcagatat gatcggcgcg 2238061 cgtcacaagc ttccggccgc agagccgcca actcacgata tcgttaaccg atatcccgag 2238121 ccgatagctg gcgggctcgg gtggtggcca gcggcgctgc gacgaaaggt gtgaccgtca 2238181 tgaaacagac accaccggcg gccgtcggcc gtcgtcacct gctcgagatc tcagcatccg 2238241 cagccggtgt gatcgcgctt tcggcgtgta gtgggtcgcc gcccgagccc ggcaaaggcc 2238301 ggcccgacac aaccccggaa caggaagtcc cggtcaccgc gcccgaggac ttgatgcgcg 2238361 aacacggagt gctcaaacgc atcctgctga tctatcgcga ggggatccgc cgcctccaag 2238421 ccgatgatca gagtcccgct ccagcactga acgaaagcgc gcagatcatt cgacgcttca 2238481 tcgaggacta ccacggacag ctggaagagc aatacgtctt ccccaagctg gaacaagccg 2238541 gcaagctcac ggacatcacc tcggtcttgc gcacccagca tcagcgcggc cgggtgctca 2238601 cggaccgggt actcgccgcc accactgcag cggctgcatt cgatcagcct gcgcgagaca 2238661 ccctggccca agacatggca gcgtacatcc gaatgtttga gccgcatgag gcgcgcgagg 2238721 acacggtcgt tttcccggcg ttgcgcgacg tgatgtccgc tgtcgagttt cgcgacatgg 2238781 ccgagacctt tgaagacgag gagcaccggc gctttggcga ggccggtttt caatcggtgg 2238841 tcgacaaggt cgccgatatc gaaaaaagcc ttggcatcta cgacctgagc cagttcaccc 2238901 ccagctaaag acactaatgc ccttgggtta gggaccatcg cctcctgacg cgatcgcgac 2238961 agctggctaa cgtcggtagt acacccatgc agaggggacg ccaatgtcag cccaacaaac 2239021 gaacctcgga atcgtggtcg gtgtggatgg ttcaccctgc tcgcatacgg cagtcgaatg 2239081 ggccgcgcgc gatgcgcaga tgcgcaacgt tgcgctccgc gtggtgcagg tcgtgccccc 2239141 ggtaataacc gccccggaag ggtgggcatt tgagtattcg cggtttcaag aagcccaaaa 2239201 gcgcgaaatc gtcgaacact cgtacctggt cgcccaagcg caccaaatcg tcgaacaggc 2239261 ccacaaggtc gccctcgagg catcctcctc aggtcgcgcc gcgcaaatca ccggcgaagt 2239321 gctgcacggc cagatagtgc ccacgctggc caacatctcc aggcaggtcg cgatggtcgt 2239381 gctgggctac cgaggtcagg gcgccgtagc cggcgccttg ctgggatcgg tcagctcaag 2239441 cctggttcgc cacgctcatg gccctgtcgc cgtaataccc gaggagccgc gaccggcgcg 2239501 cccgccgcac gcgccggttg tggtgggcat cgacggctcg cccacctcgg gattggcggc 2239561 cgagatcgcc ttcgacgagg catcgcgccg cggcgtggac ttggtggcgc tgcacgcgtg 2239621 gagcgacatg ggccccctcg actttcctag gctcaattgg gcgccgatcg aatggagaaa 2239681 cctcgaagac gagcaggaga aaatgctcgc ccggcgtctg agcggatggc aagaccggta 2239741 tcccgatgtc gtcgtgcaca aagtcgtggt gtgcgatcga ccggcacccc gcctgctcga 2239801 attggcacaa accgctcagc ttgtggtggt tggcagccac ggccgcgggg ggttccccgg 2239861 catgcatctc ggctcagtca gcagagcggt ggtcaattcc ggtcaggctc cggttatcgt 2239921 cgcccgaatc ccccaagatc cggcagtgcc ggcctgaggg cctgtgcgat ctgctcgggt 2239981 ggtgcccacc cgcgcggaaa gccccgtccg aaccgtgatt gggcaacgtc gggccgggcc 2240041 agcagcgctg gaccgtaggt ccctgcagtg gatgacttac ggccctgatc cacaccggcg 2240101 accgttaggc agggttgagc caaccgtcgg ttgagcgtct ggctgcgagg tgaggtgatt 2240161 gtcggcgtca gtgtctgcca cgacggctca tcatggcttg ccagcacatg aagtggtgct 2240221 gctgctggag agcgatccat atcacgggct gtccgacggc gaggccgccc aacgactaga 2240281 acgcttcggg cccaacacct tggcggtggt aacgcgcgct agcttgctgg cccgcatcct 2240341 gcggcagttt catcacccgc tgatctacgt tctgctcgtt gccgggacga tcaccgccgg 2240401 tcttaaggaa ttcgttgacg ccgcagtgat cttcggtgtg gtggtgatca atgcgatcgt 2240461 gggtttcatt caagaatcca aggcagaggc cgcactgcag ggcctgcgct ccatggtgca 2240521 cacccacgcc aaggtggtgc gcgagggtca cgagcacaca atgccatccg aagagctggt 2240581 tcccggtgac cttgtgctgt tagcggccgg tgacaaggtt cccgccgatt tgcggctggt 2240641 gcgacagacc ggattgagcg tgaacgagtc agcacttacc ggcgagtcga cgccggttca 2240701 caaggacgag gtggcgttgc cggagggcac accggtcgct gatcgtcgca atatcgcgta 2240761 ttccggcaca ttggtaaccg cgggccatgg cgccgggatc gtcgtcgcga ccggcgccga 2240821 aaccgaactc ggtgagattc atcggctcgt tggggccgcc gaggttgtcg ccacaccgct 2240881 gaccgcgaag ctggcgtggt tcagcaagtt tctgaccatc gccatcctgg gtctggcagc 2240941 gctcacgttc ggcgtgggtt tgctgcgccg gcaagatgcc gtcgaaacgt tcaccgctgc 2241001 gatcgcgctg gcggtcgggg caattcccga aggtctgccc accgccgtga ccatcacctt 2241061 ggccatcggc atggcccgga tggccaagcg ccgcgcggtc attcgacgtc tacccgcggt 2241121 ggaaacgctg ggcagcacca cggtcatctg cgccgacaag accggaacgc tgaccgagaa 2241181 tcagatgacg gtccagtcga tctggacacc ccacggtgag atccgggcga ccggaacggg 2241241 ctatgcaccc gacgtcctcc tgtgcgacac cgacgacgcg ccggttccgg tgaatgccaa 2241301 tgcggccctt cgctggtcgc tgctggccgg tgcctgcagc aacgacgccg cactggttcg 2241361 cgacggcaca cgctggcaga tcgtcggcga tcccaccgag ggcgcgatgc tcgtcgtggc 2241421 cgccaaggcc ggcttcaacc cggagcggct ggcgacaact ctgccgcaag tggcagccat 2241481 accgttcagt tccgagcggc aatacatggc caccctgcat cgcgacggga cggatcatgt 2241541 ggtgctggcc aagggtgctg tggagcgcat gctcgacctg tgcggcaccg agatgggcgc 2241601 cgacggcgca ttgcggccgc tggaccgcgc caccgtgttg cgtgccaccg aaatgttgac 2241661 ttcccggggg ttgcgggtgc tggcaaccgg gatgggtgcc ggcgccggca ctcccgacga 2241721 cttcgacgaa aacgtgatac caggttcgct ggcgctgacc ggcctgcaag cgatgagcga 2241781 tccaccacga gcggccgcgg catcggcggt ggcggcctgc cacagtgccg gcattgcggt 2241841 aaaaatgatt accggtgacc acgcgggcac cgccacggcg atcgcaaccg aggtggggtt 2241901 gctcgacaac actgaaccgg cggcaggctc ggtcctgacg ggtgccgagc tggccgcgct 2241961 gagcgcagac cagtacccgg aggccgtgga tacagccagc gtgtttgcca gggtctctcc 2242021 cgagcagaag ctgcggttgg tgcaagcatt gcaggccagg gggcacgtcg tcgcgatgac 2242081 cggcgacggc gtcaacgacg ccccggcctt gcgtcaggcc aacattggcg tcgcgatggg 2242141 ccgcggtggc accgaggtcg ccaaggatgc cgccgacatg gtgttgaccg acgacgactt 2242201 cgccaccatc gaagccgcgg tcgaggaagg ccgcggcgta ttcgacaatc tgaccaagtt 2242261 catcacctgg acgctgccca ccaacctcgg tgagggccta gtgatcttgg ccgccatcgc 2242321 tgttggcgtc gccttgccga ttctgcccac ccaaattctg tggatcaaca tgaccacagc 2242381 gatcgcgctc ggactcatgc tcgcgttcga gcccaaggag gccggaatca tgacccggcc 2242441 accgcgcgac cccgaccaac cgctgctgac cggctggctt gtcaggcgga ctcttctggt 2242501 ttccaccttg ctcgtcgcca gcgcgtggtg gctgtttgca tgggagctcg acaatggcgc 2242561 gggcctgcat gaggcgcgca cggcggcgct gaacctgttc gtcgtcgtcg aggcgttcta 2242621 tctgttcagc tgccggtcgc tgacccgatc ggcctggcgg ctcggcatgt tcgccaaccg 2242681 ctggatcatc ctcggcgtca gtgcgcaggc catcgcgcaa ttcgcgatca catatctacc 2242741 cgcgatgaat atggtgttcg acaccgcgcc aatcgatatc ggggtgtggg tgcgcatatt 2242801 cgctgtcgcg accgcaatca cgattgtggt ggccaccgac acgctgctgc cgagaatacg 2242861 ggcgcaaccg ccatgatgcc ccgtccgtga gtacggtgtg cgtgcggtcg atccggccag 2242921 agttaccagg tcggaactag ccagttacgt tgtactcgtg cggttctcgt agtcaaccaa 2242981 gcgtgcctgc agttcggcgt acggtacgga ccgtggcagc tgctctccgt cgctcacggc 2243041 ccgagccgcg tgggccgctg catacaaccc cgcgctgtag ggcactgaac cggttgacac 2243101 ccgggccacc ccgagctcac caaggtcggc gatcgtcaag ccgggcacgg gcaacgtgtt 2243161 aaccgggcac ggaatgttgc gagtgagctc agcaagttcg tcgggatcgt tggccagtgg 2243221 gacaaagacg ccgtcggcgc cggcatcgac gtagcgaagt gcgcgctgga tcgtgctggt 2243281 ggtatcggcg tgctggcgca accaataggt gtcgacgcgg gcgttgacga acacctcggg 2243341 gttacgttgt ttgatcgcaa cgattttagc ggctgccagg gcggggtcga tgagcttttc 2243401 ggcgctactg tcctcgatat tgattccggc tgtcgacagt tgtgcgacgt agtcagcaat 2243461 ggcgtcgggt tcgtcgctgt atccgtcctc gatgtcgacg ctgacgtagc attgcagcgg 2243521 tgccagggcg gccgccagtg cgatgttggc gccgcgagtg gcgcggtgcc cgtccgggtg 2243581 cccgccgctg gacgagaccc cgaaactggt tgtgccgata gccgtgaagc cctccgcgag 2243641 gtaggccagg gccgacggca catcccaggc gttgggcaac acgaacggaa caccttggtg 2243701 atgaagatcg tggaaactca ttccctacct ccctgctggc ggatgggcct gattgtatgt 2243761 gtgacccgcg tcagcagggt cagtcggtga gacccgtcgc cgctggccga ttcaactagg 2243821 ttgcggacgg atgaccactt cgttgggtat caccagaatc agtctgtcgt gctcgacgag 2243881 tgatgatgcg gcgcacaccg tatgccgcca caccgacacc gagcaccgcg gccccggcgg 2243941 ccaccgagga gagtggcagc gcgaacgcca ggactacgca gccgatcagt cccaccagcg 2244001 gaatcaggcg gcggggccgg ccctcgtcga gccccagagt caaggcggag gcgttggcga 2244061 tcgcgtagta gaccagcaca ccgaaggacg aaaagccgat cgcaccacgg atatccgctg 2244121 tcgccgccag cgccgccacc accgcgccaa ccaccagttc ggcacgaaag ggcaccttga 2244181 acctagggtg cacggcggcc agccagcgcg gtaggtgccg gtcgcgtgcc atcgccaagg 2244241 tggtgcggga gaccccgaga atcaaggcca gtagcgagcc caatgcggcc accgcggccc 2244301 ctatctgcac gacgggaatc agccagttca cccccgcgac ccgcatggcc tccgacaacg 2244361 gggcggcggc ccgcgcgagc cgctgcggac ccaacacagc gatcacggcc acggcgacca 2244421 gggcatacac cgccagggtg atgcccagcg ccagcgggat ggcgcgtggg atcgtgcggg 2244481 ccgggtcgcg gacctcctcc cccagcgtgg cgatgcgggc atagccggcg aacgcgaaaa 2244541 acagcaggcc ggccgcctgc agcatccccc agacgtgtgc atctacaccg atatcgagtc 2244601 gcgccgggtc cgcagcgccg gagccatagg cggcgaccac gactgcggtc aagaccacca 2244661 acaccacggc gacgatcgac cgggtgagcc aggcggactt ctgtatcccg gcgtagttca 2244721 ccgcggtcag tgccaccacc acggcgacgg ccaccgcgtg cgcttgcgcg ggccacacat 2244781 agaagccgac cgtcaacgcc atcgccgcac acgatgccgt cttgccgacc acaaagcccc 2244841 agcccgccag gtatccccag aagtcgccca gccgcatccg gccatacaca taggtgcccc 2244901 ccgaggccgg gtagcgcgcg gccagccgcg ccgacgagat cgcattgcag taggccacca 2244961 ccgcggccac tgccaacccg agcaacaacc cagaaccggc cgcgtacgcg gccggggcca 2245021 gggcggcaaa gattccggca ccgatcatgg acccaagccc gatcaccacc gcatccaaga 2245081 gccccagccg tcgccgcagc tcatctggaa tatcgcgtgg gtctagcggg cgtctcatgc 2245141 ctcgataagg ctacggcatc cgatatcggt atacgatatc tacccggaat ttgacgcccg 2245201 agacccgcat gcgtccaggg tttgtgggtt tggggtttgg tcagtggccg gtctacgttg 2245261 ttcgctggcc taaactccac ctgacgccgc ggcagcgaaa gcgtgtcttg catcggcgac 2245321 gattgctcac cgatcgcccg atttcgttgt cacaaattcc aatccgcaca ggagggccca 2245381 tgaacgaccc gtggcccagg ccaacgcaag ggccggcgaa aaccatcgaa accgactacc 2245441 tggtgatagg tgccggagcg atgggaatgg cattcacgga taccctcatc accgagtccg 2245501 gtgcgcgcgt cgtcatgatc gaccgcgcat gtcaacctgg tggacattgg accaccgcct 2245561 acccgttcgt gcggctacac cagccatcgg cctattacgg cgtcaactca agggcactag 2245621 gcaacaacac cattgacctc gtcggttgga accagggact gaacgaactg gcaccagtcg 2245681 gcgagatatg cgcctacttc gatgctgtat tgcagcagca actgctcccc accgggcggg 2245741 ttgactactt cccgatgagc gaatacctgg gcgacggccg gttccggaca ctggcaggca 2245801 ccgaatacgt cgtcaccgtc aatcggcgca tcgtcgatgc cacctacctg cgtgccgtcg 2245861 taccgtcgat gcggccggcg ccgtactcgg ttgcacccgg cgtcgactgc gtcgctccaa 2245921 acgaactgcc caaactcggc acccgggatc gctacgtggt cgtcggtgcc ggcaagaccg 2245981 gcatggacgt ctgcctatgg ttgctccgaa acgacgtctg ccctgacaag ctgacctgga 2246041 tcatgccgcg tgattcctgg ctgatcgacc gagcgacgct gcagcccggg cccacattcg 2246101 tcaggcagtt cagggaaagc tacggtgcga ctctcgaggc catcggggcc gcgacctcga 2246161 ccgacgatct gttcgaccga ctagagaccg ccggaaccct gctgcgcatc gacccctcgg 2246221 tgcgtccgag catgtatcgc tgcgccactg tgtcgcacct cgaactcgag cagctgcgcc 2246281 gtatccgcga catcgtcagg atgggccacg tccaacgcat cgagcccacc acgatagtgc 2246341 tcgacggcgg atcggttccc gccacaccca cggccctcta tattgactgc accgccgatg 2246401 gagcaccaca acgtccagcc aagccggttt tcgacgcaga ccacctaacc ctgcaagccg 2246461 tgcgcggatg ccaacaggtg ttcagcgccg cgtttatcgc gcacgtcgaa ttcgcctacg 2246521 aggacgacgc ggtgaaaaac gaactctgta ccccgattcc acacccggac tgcgatctgg 2246581 actggatgcg tctgatgcac tccgatctag gcaactttca gcgctggtta aacgaccccg 2246641 atctgacgga ctggctgagc tcggcgcggt tgaacttgct cgccgacctg ctgccgccgt 2246701 tgtctcacaa gccgcgggtg cgcgagcggg tggtgtcgat gttccaaaag aggttgggca 2246761 ccgccggcga ccagctagcg aagctgctcg acgccgccac cgcaacaacc gaacaacgct 2246821 aaggatcggc cgtgcaccat aaccgcgatg tcgacttggc gcttgtcgag cgacccagct 2246881 cgggatacgt ctacacaacg ggttggcgac tggccacaac ggacatcgac gagcaccaac 2246941 aactgcgcct cgacggtgtg gcgcgctata tccaagaggt cggtgccgag catctcgccg 2247001 atgcccaatt ggcagaggtc catccccatt ggattgtcct gcgcacggtc atcgatgtca 2247061 tcaacccgat tgagctaccc agcgacatca cctttcaccg gtggtgcgca gcgctttcca 2247121 ccaggtggtg cagcatgcgt gtgcagctgc aaggatccgc cggcggccgc atcgaaaccg 2247181 aagggttctg gatctgcgtg aacaaagaca ccctgacgcc gtcccgtctc accgatgact 2247241 gcatcgcacg tttcggcagc accaccgaaa accaccggct caagtggcgc ccatggctca 2247301 ccgggccgaa catcgatggt accgagacac catttccctt gcgtcgcacg gatattgacc 2247361 cgttcgagca tgtcaacaac accatctact ggcacggtgt gcacgaaata ctctgccaga 2247421 tacccaccct gacggcaccc taccgcgccg tgctcgagta ccgcagcccc atcaagtccg 2247481 gcgaaccgct gaccattcgt tacgagcagc acgacgacgt cgtgcgcatg cacttcgtcg 2247541 tcggcgacga cgtgcgcgcg gcagcgctgc tgcgcaggct ataaccgtct ggacgaatcg 2247601 gcggtatgcc gaccaccatg aaccaaggtc cgcaacgcat cgaagcacga ggagaatcca 2247661 tgtctggacg gttgatagga aaggtcgcac ttgtcagcgg cggggcgcgc ggtatgggtg 2247721 catcccatgt gcgggcgatg gtggccgaag gcgcaaaggt tgtgttcggc gacatcctcg 2247781 acgaggaggg caaggcggtg gccgccgaac tggccgatgc ggcccgctac gtccatctcg 2247841 acgttaccca acccgcgcaa tggacggctg cggtggacac cgcggtcacc gcattcggtg 2247901 gcctgcacgt gctggtcaac aacgccggca ttctcaacat cgggacgatc gaggactacg 2247961 ccctcaccga atggcagcgc atcctcgatg tcaacctgac cggagtcttc ctgggcatcc 2248021 gcgctgtcgt caagccaatg aaagaggctg gtcgcggctc catcatcaac atttcgtcga 2248081 tcgaggggct ggccggcacg gttgcttgtc atggctatac cgccaccaag ttcgccgtgc 2248141 gggggctgac caagtccacc gctctcgagt tggggcccag cggaattcga gtcaactcga 2248201 ttcaccctgg gttggtcaag acgccgatga ctgactgggt ccccgaagac atcttccaga 2248261 ccgcgctggg ccgcgcggcc gaacccgtgg aagtgtccaa cctcgtcgtc tacctggcca 2248321 gcgatgagtc gagctattcc accggcgcgg aatttgtggt cgacggcggg accgtagctg 2248381 gcctggcaca caacgacttc ggtgccgtcg aggtgtcctc gcagccggaa tgggtgacgt 2248441 aaacgccgat tggcaggcaa tgcccgaccg gtctggcgat gacgatcgcg tccgcgctca 2248501 accgcaatcg gatacccagc cggcctgtcc cgcacccggc ccaaggaacg gcgtcgtggt 2248561 ggctattccg actcgagtgg gtgatcatcc ttaggctcgt gcgcttggtc gaccgccgag 2248621 atagcaacga agccggcgcc ggcttggata ccgtcatggg cggcttcgat gtcgtaccgg 2248681 gcgagtcccg gcggttggtg cagcgtgcag cggcgggcga tgacccggaa tcccgagtct 2248741 gcgagcagtt gttcgagttc ggccgcggtg tagaagcggg cgtcgcggta gcctggctgt 2248801 ccgcgggccg cgcgcagagc gtacaggtcg gcccacggtg tcccgcgagg caagaacccg 2248861 ataacaaggc cgccgccgtc ggcgagcaga cgccgcgttt cccggaatat ggcggccggg 2248921 tcggtgacga aacagagcgt gaatgccatg aggaccgccc cgaagtgccg gctgacgaaa 2248981 gggaccgcct cgccgacggc attggcgacc aggacgccgc gccggcgtgc gaacatcagc 2249041 gcatcacggg atggatcgag tccgaaccgc acgccgagca ggtcggcgaa acgtcctgta 2249101 ccgacaccga tttccaagcg tggctgggca aagacctcga tgagcggccg caacgcggcg 2249161 acctcggtcg ccaggatcgg ccgcccggtg ggtgagtcat accaggcgtc gtaggccgcc 2249221 gcgtcgcgcc cggcggccga cgatgccggc atccgggtgt caggcgtcac cgcgagctga 2249281 ttccagcaac aatcggcgtt cggcggccgc gaccgacccc ggggtagcag caatcgcgcc 2249341 cgaatggacc gacactgagg tgattcccat ccggaccaga tgctcggcga aagtcgggtt 2249401 gcccgagagc gcttgaccac acagcgacga tgtgctgctg acagtgatgg ttggcatcgg 2249461 ttttcctttc ggcgttctca gatcgcgctg cgccagatgt ggtaggcctg tcccacggag 2249521 cgctcacgcg gccccgccgt gtcgatccgg tgcccggtgt cccagtccgc ttgccgggcg 2249581 gccaaggccg ccgcgatctc ggcggtggcg tcggagttgc ccccggctct agcaacgatt 2249641 ctgtcggcca tcacgtcaac cgtcgccgaa cacctgaatt cgacaatcgc cgagtgcgtg 2249701 tccgccgcga gacgccgggc gcaggcgcgc atctgcggat caccccaggt accgtcgagg 2249761 atcactgagt gcccactacc caagagcagg cgggctttgc gcagcgcctc ctggtagacc 2249821 gccacaacgt tggcacgact gtagagcccg gagtccaaaa cgccgggctc cccggtgatt 2249881 actccgcaat cgcgtagccg ccggcgcaca tcgtcggttg agatcacctg cgcccccacc 2249941 agttcggcga ccccgcgggc cagggtcgac ttgccggtgc ccggattgcc accgaccagc 2250001 gccaaccgga ccgtagcgtg ctgtaggtgt tgggtggcga tgatcaggtg gcgcacggcg 2250061 tccgcagcgg cctccggttt gccctgggag aatcgcacgc actcgacttt cgcgcgcacc 2250121 accgcgcgat aagcaatgta gaagtcgcgc agcgacgccg gggcggtatc acccgaacgc 2250181 accgcatagc cggccaggaa gtagtcccca agatctttgc ggcccaagaa ctccagatcc 2250241 atggccaaaa aggcggcgtc gtcgatgcgg tcgaggtagc gaagctcgtc ttcgaactcc 2250301 aagcaatcca gcagcgccgg ttcgccatcc accaagaaga tgtcatcggc cagtagatcc 2250361 gcgtggccgt ctacaataca accttctttg atccggccgg cgaacaaaac ctcgcgcccg 2250421 gaaacgaatt cgtcgaccat gtgttcaatc cgccgaatca catccccgga gaccactttg 2250481 tccgcgtggt ggcgaagttc ggccaggttt tcgtgccaac gccgcgccac cgcaccgacc 2250541 tcgccttgag tatcgatgca ccggttacgc tgtgcgcgct ggtgaaaccg ggccaacacc 2250601 tcagcgatcg cgtccagggc accctcgacc ggcaggccgg cggtcaccat cgacgccagc 2250661 cgctgcttgt cgcggtaacg ccgcatgacg acgaccggtt cggcgtgccc gccgcttgga 2250721 tcgctgagat gggcaatgcc caagtagctc tgcgcggcca gccgactatt caactcgaat 2250781 tcccggatac aggcgcgctc acgctgttcc gccgtgcgga agtcgcagaa atccgtcacc 2250841 acaggctttt tcgccttgaa cgcccggtcg ccggccaaca caaccactgc ggtgtgggtt 2250901 tcgcgcacat cgatgaaagg ctcatctgtc acaggatggg cgtcacacgt gccgtcgttg 2250961 gtcggtgagt ccatggcggt agccaagcca agtagtcacg actgccgtgc cacgatcact 2251021 ggcacccgcg cggcgtgtaa gaccgcgtta ctgaccgacc ccagaagcat gccggtcaag 2251081 ccacctcggc catgactgcc aacgacgaca agctgggcgg acgccgactt ttgcaccagc 2251141 ttccgcgccg ggcgatcgca aacgacaacc cggctcaccg gcacatcggg atagcgttct 2251201 tgccaacctg ccaagcgttc ggcgagacta agctccgctt cctgctgtac agccgagaag 2251261 tccaaacccg gaagttccac cacttcgacg tcactccacg cgtgcacggc gatcagttcg 2251321 acgccgcggc gcgacgcctc gtcaaatgcc accgccgtcg caagctccga aaccggcgaa 2251381 ccgtcgattc ccaccagcac gggagcgtgc tgcggatcag ggatcaccgc atcatcgctg 2251441 tggatgaccg cgaccgggca cccggcgcgt cgcaccaggc tcgagctgac cgaaccgagc 2251501 aagcctcggg ccagcgctcc ccggcccgag ctgcccaaca ccaccatctc tgcctcgttg 2251561 gagatttcaa ccatggtagg taccggcgtg gaaaatacga gctcgctctt tacgctgagc 2251621 tttcgatccg ctccaaccgc ctctttggcg agcttgacgg cgttggcgac gatctggcga 2251681 ccctcgtcct cctgccaaac cccccaggtc tccggatacg gcatcggcgg ccacgtcgct 2251741 acatcggcgt tcaccacgtg gaccacggtc agcggaatgt tcctcatcgc cgcatcggtg 2251801 gcaccccaac aggcggcggc atccgattcg agcgaaccat ctaccccgac gacaactccg 2251861 tgctgcttgc ggggtttaga catctcattc tcccttcgcc tcgagcaacg ctatgaaccg 2251921 ggacagtcac cggtcatgag gctttagtcc ccaatcggac ggccaaccga ccatgattgg 2251981 attcgacgcc cgaatccaag cgtgcgctgt ggcatcgtcg tcaatgtgac cggaccgccg 2252041 cccaccatcg accggcgcta ccacgacgct gtcatcgtcg gcctcgacaa cgtggtcgac 2252101 aaggccacgc gagtgcacgc cgcggcatgg acgaagttct tggatgacta cctcacccga 2252161 cgaccccagc ggaccggcga agaccattgc cccctcaccc acgacgacta ccgccgcttc 2252221 ttggccggca aacccgacgg tgtagccgac ttcttggccg cccgcggaat caggctgccg 2252281 ccgggctccc cgactgatct caccgacgac accgtgtacg ggctgcaaaa cctcgagcgc 2252341 cagacattcc tgcaactgtt gaacaccggt gtccccgagg gcaagtcgat tgcctcgttc 2252401 gcacgtcggc tgcaggttgc cggtgtccgc gtggccgccc acacctccca ccgtaactac 2252461 gggcacacgc tggatgccac cggcctggca gaagtgtttg ccgtctttgt cgacggcgcc 2252521 gtcaccgccg agctcgggct accggccgag cctaacccgg ccggcctgat cgagacggcg 2252581 aagcggctgg gagcaaaccc cggtcgctgt gtggtcatcg acagctgcca gaccggtctg 2252641 cgcgccggcc ggaacggcgg attcgcgctg gtgattgccg tcgacgcgca cggcgatgcc 2252701 gagaacctgc tgtccagcgg agccgacgcc gtggtcgcag acctggccgc tgtcacggtg 2252761 ggaagcggcg acgccgccat ctccacgatt cccgacgccc tgcaggtcta cagccaattg 2252821 aaaagactac tgaccggccg acgaccagcg gtgtttctcg atttcgacgg cacgttatcc 2252881 gatatcgtcg agcgccccga agcggcaacg ctcgtcgacg gcgcagcaga agcgttgcga 2252941 gcgctggcgg cccagtgtcc ggtggcggtg ataagcggac gcgacctggc cgacgttcgc 2253001 aaccgggtca aagtcgacgg gctgtggctg gccggcagcc acggcttcga attagtggcg 2253061 ccagacggca gccatcacca aaacgccgcc gccactgcag ctatcgacgg attggccgag 2253121 gcggcagcgc aattggccga cgcactccgc gaaatcgccg gagcagtagt ggaacacaaa 2253181 cgcttcgcag tcgcagtgca ctatcgcaac gttgccgacg acagcgtcga caacctgatt 2253241 gcggcggtgc gccgactcgg acacgcagca gggctgcgtg tcaccaccgg ccgcaaagtc 2253301 gtcgagcttc gcccggatat agcctgggac aagggcaaag cactcgattg gatcggtgag 2253361 cggctcggcc cggccgaagt cggccccgac ctacggttgc cgatctacat cggcgacgac 2253421 cttaccgacg aagatgcctt tgatgccgtg cgtttcaccg gtgtcgggat tgtggtgcgc 2253481 cacaacgaac acggtgatcg acggtctgcc gctacctttc gtctcgaatg tccttacacc 2253541 gtttgccaat tcctctccca gctggcttgc gatctgcagg aggcagtgca gcacgacgat 2253601 ccgtggactc tggtcttcca cggctacgac cccggccagg agcggctgcg tgaagcgctg 2253661 tgcgcggtgg gcaacggcta cctgggttcg cggggctgcg cacccgaatc agcggaaagc 2253721 gaggcacatt acccgggcac ctatgtggcc ggggtgtaca accagctcac tgaccacatc 2253781 gaagggtgca ccgttgacaa cgaaagcctg gtcaacctcc ccaactggtt gtcgctgacc 2253841 ttccgtatcg acggcggagc atggttcaac gtcgatacgg tcgagttgtt gtcctaccgg 2253901 cagacgttcg acctacgccg tgccacgttg acccgcagct tgcgattccg agacgccggc 2253961 ggacgagtga ccacgatgac ccaggagcgg ttcgcgtcca tgaaccggcc caacctggtc 2254021 gcactgcaaa ctcggattga atccgaaaat tggtcgggca cagttgattt ccggtcacta 2254081 gtcgacggag gtgtgcataa caccctggtg gaccgctatc ggcaactatc cagccaacac 2254141 cttaccaccg ccgagataga agtcctggcg gactcggtgt tgttgcgcac ccagacgtcg 2254201 caatcgggta tcgcgatcgc ggtcgccgct cgcagtaccc tgtggcgcga tggccaacgg 2254261 gtcgacgcgc aatatcgggt cgccagggac accaaccgcg gcggccatga catccaggtc 2254321 accctgtcag cggggcaatc ggtcacgctg gaaaaggtcg cgacgatctt cacgagccgg 2254381 gacgccgcga cattgacagc ggcaataagc gcacagcgct gtctaggtga ggccggtcgc 2254441 tatgccgagc tctgtcaaca gcacgtccgc gcgtgggcac ggctgtggga acgatgcgcc 2254501 atcgatttga ccggcaacac cgaggaattg cggctcgtgc gactgcacct actgcacctg 2254561 ctacagacca tttcgccgca taccgctgag ctcgacgccg gggtcccagc gcgcgggctg 2254621 aacggagagg cctaccgcgg gcatgtcttc tgggatgcgc tgttcgtcgc tccggtgctc 2254681 agcctgcgga tgccgaaggt ggcgcgatcg ctgctggact atcggtaccg acgactaccc 2254741 gcggcccgcc gagcggcgca ccgggcgggc caccttggcg cgatgtatcc ctggcagtcg 2254801 ggcagcgacg gaagcgaagt gagtcagcag ctgcacctca atccacggtc cgggcggtgg 2254861 actcccgatc ccagtgatcg tgcccatcac gtcggtctag cggttgccta caacgcgtgg 2254921 cactactacc aagtgaccgg tgaccgccag tatctcgtcg actgcggggc agagctgctg 2254981 gttgagatcg cacgcttctg ggtaggcctg gccaagttgg atgacagtcg cggccgctac 2255041 ctgatccggg gagtaatcgg tcccgacgaa ttccattcgg ggtatcccgg caacgagtac 2255101 gacggaatag acaacaatgc gtacaccaac gtgatggcgg tatgggtgat cctgcgggca 2255161 atggaggcgc tggacctgct accgctgacc gatcgccgcc atctgatcga aaagctcggg 2255221 ctgacaacgc aggagcgcga ccaatgggac gacgtgagcc gacgcatgtt cgttccattc 2255281 cacgacggcg tgatcagcca gttcgagggc tattcggaac tggcggaact ggattgggat 2255341 cactatcggc accgatacgg aaacatccaa cgactcgacc ggatcctgga agccgagggc 2255401 gacagcgtga acaactacca ggcgtccaag caagccgacg cgctgatgct gctctacctg 2255461 ctgtcttccg acgagctgat cggcctgttg gcccggcttg gctaccgctt cgcgcccaca 2255521 caaatcccag gcaccgtgga ttactatctt gcccgcacct cggatggatc taccctgagc 2255581 gctgtcgtgc atgcgtgggt tctcgcccgc gccaaccgga gcaatgccat ggagtacttc 2255641 cgtcaggtcc tgcgctccga tatcgccgac gtccagggcg gcacaaccca ggaaggaatt 2255701 cacctggcgg ccatggctgg cagcatcgac ctgctgcagc gttgctattc cggattggaa 2255761 ctgcgcgacg accggctggt gttgagcccg caatggccgg aagcacttgg accacttgag 2255821 tttccgtttg tgtaccgccg ccaccagctg agcctgcgaa tcagtggccg aagcgccaca 2255881 ttgaccgcag aaagtggaga cgccgagcca attgaggtcg aatgccgtgg ccacgtgcag 2255941 cggctacggt gcgggcacac catcgaagtc ggttgcagca ggtgaccaat gtcgcacatg 2256001 gtgggtcgac gatctctcct ggaaaggacg gccggccgcg gtctccctta ttgcgttggg 2256061 tgttgtgtgc tcgtcgcctg cgactaaggg cactccaccg ggatagccgc gaccagaggc 2256121 gtgtcgactc cgatcgggcc caccgctgcg gcaccacccg gcgaacccag cggagccact 2256181 cggcccggca ggacttggtg gaaaaaggcg gcgttgtccc ccagatgctg gtgttgatcg 2256241 tcgggtagat cgccttccca gtagatcgcc tcgacgcggc aggccggttt gcacgcacca 2256301 caatccacgc actcgtcggg gttgatgtag agcattcggg cgccctcata gatacagtcg 2256361 accggacact cctgcacaca ggacttgtcc atcacatcca cgcactcact accgatcaca 2256421 taggtcacaa acggcaagct accggcccga tgccgaggat cgcgcctatc caaagacccc 2256481 taccggaaag gaccaaaggc cttattcgtc aagttcgtca ctggcacgtc gacgcggggt 2256541 gcaagaaaac cggggcggtt cacccgaccg ccagcgggat tcacgctccc ccaggccata 2256601 aacttacgat agcccgtcat ttcaagagcg cgagaagttc atcgacactc ccggtggtca 2256661 agatctgatc cgcgggaacc gcaacgaccg tgtcgctcaa gggaaagcgg tgttcgccag 2256721 ggtagacgat tgccaacctc gccagttgga ggtcgacaag agccgagcgc atcgaccggg 2256781 aaatcgacgg tgtagacgtc cgcttgatct cgaatccata gggacggcca gataattcga 2256841 catagagatc gagttcggcg tcttgctggg tgcgccagta atacagcgga ttcggggcga 2256901 gcagggccgc aagctgctcg agcacgaacc cctcccagct cgcgccgagc ttcggattgc 2256961 gttcgagggc aagccgatcg tcgataccga gcaacctgtg caacaaaccg gtgtcccgga 2257021 tgtagatctt gggtgatcgg cgttgtcgct ttccgatgtt ggcgaaccag ggcgtcagct 2257081 gacggacgac gagtgcatcg gtgagcgcat cgaggtatcg ccgcgccgtc gtctgagcaa 2257141 cgtcgagtga gcgggcaagt tctgcgccgc tgaagagctg gccatggtag tgggcgagca 2257201 tcgtccacgc gcgccgcatc gtcgcggccg gaatgcgcac accaagctgg gcgagatcgc 2257261 gctccagaaa cgtggtgatg tagccgtcgc gccacgccgc ggagtcctcg ttggagcgtg 2257321 ccgtgaacga gggcggtaga cccccacgca accagaggcg atcggcggcc gaggatccga 2257381 cgtcgcggac cgtcaggccg gacaactcca ccaactcgac gcgtccggcc aaactttcgg 2257441 acgccagccc gacaagatcg ggtgaggcgc tacccaggat aagaaaccgg gccggcatga 2257501 caggcctgtc gacgagcacg cgtaggaccg gaaacagatc cggaatccgt tgcgcctcgt 2257561 cgatcgtgat caacccgcta aggccggata aagccaacat cgggtcggca agccgtgtcg 2257621 cgtcgacggg attttcggcg tcaaacgtac attcgggtgc ggacttgccc accagccggc 2257681 taagggtggt cttgccggct tgacgaggtc cggtaagcaa caccaccggc gctcggtgta 2257741 gcgcgcgtcg caaccgcgcg gcggcgtcgc ggcgttcgat caacatgcat gaaattctag 2257801 cggtaggcgc tgatatttca tggttagccg cccccgggag actcggtggt gggtcccaca 2257861 cgcctagaaa gtcgccggcg ataacgaccg gccaggtcag cggggttggc cgcagcccga 2257921 taaggctctc gatctcgtcc atcaggcatg ctccacatcg cctgcaccag ggcaaagctg 2257981 caccggtcgt gcgagccggt tagcaaatag cacgttcata cacataaatg tgtatagtgg 2258041 tgttgtgtca cggaccaaca tcgagatcga cgacgaactc gtggccgccg cacagcggat 2258101 gtaccgactc gattccaagc gaagtgccgt cgacctcgcg ctgcgccggc tcgtgggtga 2258161 accgttgggc cgcgatgagg ctttggcgct gcagggcagc ggtttcgact tcagcaacga 2258221 tgagatcgaa tcgttctcgg atacggaccg caagctcgcc gacgagtcgt agatgatcgt 2258281 cgacacctcg gtctggatcg catatctctc cacgtcagag tcgttggcca gtcgctggct 2258341 agccgatcgc attgccgctg actcgacggt gatcgtgccc gaggtggtga tgatggagct 2258401 gctgatcggt aagaccgatg aggacaccgc cgcactgcgc cgacggctcc tgcagcgatt 2258461 cgctatcgaa ccgctggccc cggtccgcga cgcggaagat gccgccgcca ttcaccggcg 2258521 ctgtcgtcgc ggcggcgaca ccgtacgcag cctgatcgat tgccaggtgg ccgcgatggc 2258581 gttgcggatc ggggtcgccg tggcgcatcg tgatcgcgac tacgaggcga tccgcacaca 2258641 ttgcggacta cgcaccgagc cgttgttctg actgcggaca cccggacgat ttcgtgtctc 2258701 acatctgacc cgtggccgtc gtcgtccgcc gccgggtaca tcgacatagt ggaccaggga 2258761 acatcgccag cgcatgagtg agcgcggata ccacccggtc cggggacgcg ttggcgctgg 2258821 ccgaagccga ccggcccagc gatgacatcg acttcaagga cgttcggctt tcagcgcgac 2258881 gatcatccgc ctcaggctgt cgcgggtcgc ttgcagcgcg gccgggtcta cggcgtcagc 2258941 aagtcggttg ttgacgacga tggcgcgttc ggtgattgcc tggagaacgc ggcgaccatt 2259001 ctcggttagc accagcagtg gtgatgtgcg gtggtcgggg ttgtgtctga gctcggccaa 2259061 gccgcaaacg accagatcgt tggccactcg ctgcaccccc tgacgggtaa caccaaggcg 2259121 gcgagcggct tggggcacgg tcagcgctcg atcggagacc acgctcagca gctgccatcg 2259181 cgcctgcgtg tgcccctctc tggcagcgac cacctcacct gagcgccgta gcaggccagc 2259241 gagctcgaat acgtctgcta ccagccgagc gatctcatcg gacatcccgc ctccaacttt 2259301 gacaatatat tgtcatcatg gttcgatgct gtcaaaatcg aaacggtcct gtcgtcgtcg 2259361 tgaaaccctt cgcatcggag aaaagatgag cgctccaatt acgaatcttc aagccgcaca 2259421 gcgtgatgcc atcatgaacc gaccagcggt caacggcttc ccccatctgg ccgagacgct 2259481 gcgccgcgcc ggtgtccgaa ccaatacctg gtggctaccg gcgatgcaaa gcctgtacga 2259541 gactgattac ggtccagtcc ttgaccaagg cgtgcccctg atcgacggcg tggccgaggt 2259601 cccggcattc gaccgcacgg ccctcgtcac tgcgctgcgc gccgatcagg cgggtcagac 2259661 gtctttccga gagttcgccg cggcagcctg gcgagccggt gtgctccgct acgtcgtgga 2259721 cctcgagaac cgcacctgca cctacttcgg cctgcatgat cagacgtata tggagcacta 2259781 cgcggcagtg gagccttccg gtggtgcccc tacgagttga gctgcgcccg tcgcagcgac 2259841 attccagcag accgcgacgt cagtcttggg cggcctgact atcgcgatga tccgtcgccc 2259901 gctcatcaac ccggttcgtg gtcaagactt ttcaccgggg cgacgtttcc tggggctagt 2259961 aaggcggttg ccgatcttcg tgaagcggcg gtgtccgaga cccacgacac caaggacgtg 2260021 ttagccgctt tggccgcgcg caagtccccg gtgcgacctt tctgatgcga tcgacgatgt 2260081 aggtgggatc tcgtgctctc cgcaccagtc gttgggatcc tgggcgattc cggacgcttt 2260141 gtcggtggtg acgcggtcga tgatccagcc tagcgccgaa cccgagccga gcaggcaacg 2260201 cccggcccca agtggtgcgc accgccgccg tggatcttga tgggagcacg cgaagctcac 2260261 tggtgcacca tccttgtgtc ggtgaccttg gatggattgc cgatgcaccc aaggcgccgc 2260321 tgggttatcg ccctgctcgc tcgacagccg tgatgtccac gatgagttct gcggagtccg 2260381 gcggtagccc cggacgcgcc gaccgtcgac aggactgagc gccgacgagc gccgaacagt 2260441 gagcggccca aaccactacc ctgcccgacg agccgcggaa cggcgtcacg ggtggaatcg 2260501 attgggcgcg agatgatcac gcggcgtcga tcgtcgatgc gcgtgggcgc gaggttcgcc 2260561 gcgccacgat cgagcacaac gccgccggac tgcgcgagct gctcgagctg ctgagccggg 2260621 ccggtgcccg cgaggtcgcc atcgaacgcc cggacggccc ggtcgtggat accctgctcg 2260681 aggccgggat cacggtggtg gtgatcagcc ccaaccagct gaagaatctg cgcggtcgtt 2260741 acggctcggc tggcaacaag gacgaccggt tcgacgcgtt cgtgctcgcc gacacgttgc 2260801 gcaccgaccg gtcccggctg cgccccctgc tgcccgacac cccggccacg gccaccctgc 2260861 gccggacctg ccgcccccgc aaagacctcg tcgcccaccg ggttgcgttg gccaatcagc 2260921 tgcgcgcgca cctgcgcgtc gtctttccgg gtgtggtcgg gttgttcgct gaccttgact 2260981 cgccgatcag cctcgcgttt ttgacgtttt tgccccgttt cgactgccag gaccgcgcgg 2261041 actggctgtc ggtcaagcgc ctggccggct ggctggccgc cgctggctac tgcggccgtg 2261101 ctccacgacc ggctcaccgg tgccccgcgc ggcgccaccg gtgacgaggg tgccgccaac 2261161 gcccacatca cccgggccat ggtcgccgcg ctcaccagcg tcgcgaccca gatcaagacg 2261221 ctcgacgcgc agatcgccga acagctctcc ttgcacgccg acgcgcatat cttcacctcc 2261281 ctgccccgct ccggcaccgt ccgcgccgcc cggctgctcg ccgagatcgg ggactgccga 2261341 gcccgtttcc ccacgcccga atcgttggcc tgcctggctg gcgtcgcccc ctccacccgt 2261401 cagtccggca aagtcaaaca cgtcggattc cgttgggccg cagacaaaca actccgcgac 2261461 gccgtctgcg acttcgccgg tgacagccgc cgagccaacc tctgggccgc cgaccgctac 2261521 aaccgcgcca tcgcccgagg acacgaccac ccccacgccg tgcgcatcct ggcccgcgcc 2261581 tggctctacg ccatctggca ctgctggcaa gacggcgccg cctaccaccc tgccaaccat 2261641 cgcgccctcc aggcactgct caaccaagat caagaccggg cggcttgaca cagggctact 2261701 catcggccta gcgggtgggc gccaccagcg ggtagcacga acgaaatcct tgatgcccca 2261761 aaccgtttaa gcgttactgc agggtacagg taccgagcgg gacccgctgc cgggcctagt 2261821 tgcttatcgg tggtggttgc ggctggaagg gttcatacca ccaccagtcg gcgcgctcgc 2261881 cggtgggccc aggccacggc gctaccgccg gcggcggctt cgtcgacgcc cgcgccaacg 2261941 atcccgcgct caaaggtcgg cccgcgctgt cggcgacggt gaggttgtct gccggtccgg 2262001 taatggtgat caggccccga tggtgtgccc ggtggtgata cgggcacacc agcaccaggt 2262061 tggccagctc ggtggcccca ccgtcctgcc aatgtcggat gtggtgggcg tgcaaacccc 2262121 gggtggcccc acaaccggga accacacacg tgcggtcgcg atgctcaagc gcacgacgca 2262181 accgacgatt gatctgacga gtcgttcgac cgcagccaat gacctgcccg tcacgttcaa 2262241 accaggcctc aaaggtggca tcacagagca gatatcggcg ttcggactcg ctgagcagcg 2262301 gacccaggtg caggccagcg gcacgctcct gcacgtctag atgcatcacc acggtggtgt 2262361 gctgcccatg tggccgacga gccacctcgg cgtcccagcc ggcctcaacc agacgcagaa 2262421 acgcctcaac attgcccggc aacgggggcc gctgatccga cacaccgtcg ctgttgtcgt 2262481 gatcacgctt gtactcggcg atcaacgcat ccagatgaga ctgcaacgcc gcatcgaact 2262541 tcgccgcctc cacgtgcgga agcttgattc gccaacaact gaactgctca tcggcgctcc 2262601 tggtgatcga gggccgcggt tccggccgaa aatccggttc gggttcgggt cgcggttcca 2262661 acttgagcgc ggtccgcagc tgattcaccg tggcaacgcc ggccaactgc gcataatgcg 2262721 catccgaacc ctcacccgcc cgccccgcga tcaccccaac ctgatccaac gacaaccgcc 2262781 cctcccgcat accccgggcg cagcgcggaa actccggcaa ccgccgcgcc accgtggcga 2262841 tcgtgtgggc gttgcctgac gagcagccca tcttccaggc caccaacccc gccaccgacc 2262901 gcgcccccgt cacaccccac aacccgtcgc gatccagctc agccacgatc tccacaatgc 2262961 gcccatcaat cgcattgcgc tgaccggcca actccgccaa ctcctcaaac aacacctcca 2263021 cacgctcggc aggactgact accgctgcgc cagacgtcgc ggtcgaggac atgagttcat 2263081 catcgcagca gggtctgaca actccggcca acccgaatcc acgcccgggg ccgtgccgtc 2263141 atcaccccgc aaagagatgc tcggctccgg ctccgccccc gccggggcca agggcacacg 2263201 agacaacgaa atcagcgaac ccaccatgga aacgctcaac ggcgtgggcc gcgaagccgg 2263261 cgaaatgctg ggagcagctg gtggacatcg catagatagg ccccagaccc agccagcacg 2263321 gctccaaccg tcgacgcgcc tagctgcaaa atcgcatgct tgtcagcgga taccggtata 2263381 ttttccggta tgttttcaga gccttatccg accgatggcg aagtcatgac ggaactcggc 2263441 gacaagttcc ttgctgctct tgttggcacc atcagggata cgcgcttcga catcgccgac 2263501 atgcggaact ggcggccggg atggtttccg accatgcata gccggtgtct gtccaacctc 2263561 atccacgaca gaatctgggc acacctggtc accctcatcg cgagcaatcc aggcaccagc 2263621 atcaaggaca agggtgccac ccgcgagatt gtggttggcg cacacctgcg gttgcgaatc 2263681 aaacgccacc acgcaggtga cgagatcagc acctacccga cccgaaccgc catcgaattc 2263741 tggcaacagg gcagccagcc cgccttcccg gggctggaag aggttcgcat tgcggtgggc 2263801 tatcggtggg accctgatac ccgcgagatc ggagcccccc tgctgtcgct tcgcgacggg 2263861 aaagatcacg tcatctgggt agtcgaactc gacgagcctg cggccggcgt gaagatcacc 2263921 tggaccccga tcgagccgac actaccgtcc atcgacttcg gtgacttggg tgaagactct 2263981 ggagcatcgg gggaacgatg aacggcctgg gagacgtgct cgcggtcgcc cggaaggctc 2264041 gtggactcac ccagatcgaa ttggccgagc tggtgggact cacccagccg gcgatcaacc 2264101 ggtacgaatc aggcgaccgt gaccccgacc aacacatcgt ggccaagctg gccgaaatcc 2264161 tcggtgtgac cgacgatctg ctcatacacg ggaacaggtt tcgaggtgcg ctcgcagtcg 2264221 atgcgcatat gcgccgccac aagaccacga aggcgtcggc ctggcgtcag ctggaggccc 2264281 ggttgaacct gttgcgcgtg cacgcgtcat tcctcttcga ggaagtggct atcaatagcg 2264341 agcaacatgt gcccgcgttc gacccggagt tcaccgccgc cgaggacgcc gcccggttag 2264401 tccgtgccca gtggcgcatg ccgatgggcc cggtcgtcaa cctgacccgg tggatggagg 2264461 ccgcgggctg cctggtgttc gaagaggact tcgccaccca gcgcatcgac gggttgtcgc 2264521 agtgggtcga cgactacccc gtcatgctga tcaacgccaa cgcagcaccc gaccgaaaac 2264581 gcttgaccct tgcccacgaa ctcggccacc tcgtgctgca ttccaccaac cccacggaga 2264641 acatggagac cgaagccacc gccttcgccg ccgagtttct catgcccgag agcgagattc 2264701 ggcccgagct gcgtcggctc gatctcggca agttgctcga actgaaacgg gaatggggcg 2264761 tctcgatgca agccctcctg gcgcgggcat atcgcatggg cctggtatcg gccgaggctc 2264821 gcaccaagct ctacaaggcg atgaacgcgc gcggctggaa aaccaaagag ccaggcatcg 2264881 agtccatcgt gcgagaaaaa ccgagcctac ccgcccacat cggcatgaca ctccgaagcc 2264941 gcggattcac cgaccagcaa gccgccgcca tcgccggata cgccaatcct gcggacaatc 2265001 cattccgccc cgaaggtggc cgcctccatg cgatttgact tccgattgac gctgggtttt 2265061 catgccgacg gcgccaggtg cggtcacaca aggcggccgg aacaggcatc gattcttggc 2265121 gacgccgttg ctgtaccgat agcgactgcc ccgtatcgat cccagggaac gtgaccatgg 2265181 tcgtagggat gacttgacag tttcaacggg gtgcgaccac cgttgcgctc agaaggcata 2265241 cgttggtgga acacgtcgga aagctgggag gtgaatctga tggctggcga ccaagagctg 2265301 gaactgcggt tcgacgttcc tctttacacg cttgccgagg catcgcggta cctggtggtt 2265361 ccccgcgcca ccctggctac gtgggctgac ggctacgagc gtcggccggc caacgcaccg 2265421 gcggtccagg ggcaaccgat catcacggct cttccccacc cgaccggcag tcacgctcgg 2265481 ctcccattcg tcggaatcgc cgaggcgtat gtgttgaacg ccttccgccg agcgggcgtc 2265541 cctatgcagc ggatccggcc atccctcgac tggctaatca agaatgtcgg gccacacgcg 2265601 cttgcgtccc aggatttgtg cacggacggt gccgaggtgc tctggcggtt cgctgaacgg 2265661 tccggggagg gcagtcctga tgatctggtg gtcagggggc tgattgtccc gcgatccggg 2265721 cagtacgtct tcaaggagat cgtcgagcac tacctgcaac aaatcagctt tgccgacgac 2265781 aacctggctt cgatgattag gttgccgcag tacggcgatg ccaacgtcgt cctcgatcca 2265841 cgccgcggct atgggcaacc ggtgttcgac ggaagcggcg tccgggtagc tgacgtgctc 2265901 ggcccattgc gcgccggcgc gacgttccag gctgtcgccg acgactacgg tgtgaccccg 2265961 gaccagcttc gagacgcgct cgacgccatt gcagcctgat cggaatctcc tcgccgacct 2266021 cgatcacatc tttgtcgacc ggagtttggg cgctgtgcaa gtcccgcaac tccttcggga 2266081 tgccggattc cggctgacaa cgatgcggga gcactacggc gagacgcagg ctcagagtgt 2266141 cagcgaccac aagtggatcg caatgaccgc cgagtgcggc tggattggat ttcacaagga 2266201 tgccaatatc cggcgcaacg ccgtcgagcg acggacggtg ctcgacacgg gagcccggct 2266261 attctgtgtg ccgcgggccg acatcctggc agagcaagtc gcggcacggt atattgcgtc 2266321 ccttgcggcg attgcccgtg ccgcacgatt tccgggacca ttcatctaca cggttcaccc 2266381 gagcaagatc gttcgcgtgc tctagtcgtt catcgctccg ttaaccgccg gcgaggccgt 2266441 cgacgatctt catggtctcg acgctgacgg tggtcacctt cttgatgagg tcgacgatgt 2266501 aggtgggatc gtcgtgttcg tcgcaccagt cgttggggtc gttgacgatg cccgacgctt 2266561 tgtcggtggt gacgcggtag cgctcgatga tccagccgag cgccgagcgg gagcgagcag 2266621 gtagcgctcg gcctcgtcgg gaatgccggc gatggtgacg cgggagtaga acgatcgcca 2266681 agtggtcggt cttggctgcc cacttcatcc ccggcgccac cggcaggtct cgcggtcatc 2266741 tcgaccaacg gagggccgtc ggtggttcgt atccggccaa gaacggcgag aacggtttgt 2266801 gcctctatgc cagggtgaat gtctcatctc ccaggcggac ggtgatatcc agttctccgc 2266861 caagagcgga cacgtatttg cgcagtgtgt tgacctgtgc ggagccgatg tcgccgttct 2266921 cgatgctgga tacccggctc tgccggatgt gcgccagcgc agccacctgg acctgggtga 2266981 gtgactgagc cgcgcgcagc tcccggagcc ggaatgcccg cacttcatcg cgcattcgtg 2267041 ccttgtgccg gtccaccgcc tcccggttaa cgggacgtac ggcgtccatg tcccgtagtg 2267101 tcatcgccat cgtgccactt accctttctt gcgcttgcgc ctctttggct tcgtgtcctc 2267161 gaactgtgcg agatgttcgg caaacatctc atcggccgct ttgatcttct cgtcgtacca 2267221 ctgggtccac cgcccggcct tgttaccggc ggccagcatg atcgcctgcc gcgccgggtc 2267281 gaaggcgaac agaatgcgga cctcggaccg cccttgtgat cctggacgca gctccttcat 2267341 gttcttgtgg cgcgacccac gcaccgtgtc caccagagga cagccaagtg cggggccctc 2267401 ttcctcgaga acctcgatag ctgcgaacac caattcgtag gtctctcggt ccaagccgtt 2267461 gagccaggcg gagatgcgct ccacatccgc cgtccacccc acagagtcgc agagtagcgc 2267521 gatacgcgat atcacacaag ggtgatattc ctccgggtaa gagcagcggg cgacggggct 2267581 accgtcgagg aaatgccggc aggcgaggac ggactctgcg cacccgggcc gttgaaacag 2267641 tagcctgtgc caggccgaga attcatcccc acgtatgagg cagtacagtg cgccgccgtg 2267701 cgcgttctcc catggaacgt tcacgggctc ccgtggatga caggcgtttc atgaacgcca 2267761 gcgccgccgc aacccgaccg aaagcggttg accccaagga gagctggaag tcgaggccac 2267821 caccttcgcc gcggagttgc tcatgcccga gagcgagact cgtcccgaaa tacgccggct 2267881 cgatttcggc aagttgctcg aactgaagcg ggaatgggcg tcgacccgct cgaccagccc 2267941 cagccgggtg accagcccca gccgggtgac cagccgatgc accgcggcga tcccaccgaa 2268001 gccggtggca tcgatgttgg cgccgacctc gtagcgcacc gcgcccgaac ccagcatcgg 2268061 cctgggctgc gccgcccagc gtccagcccg cgcgtgccgc gccgccaccc tgcgccctcg 2268121 gcgtgtgatg tttcgccgac tctgttcatg ggttatcttc ttcaccacaa aggcctttcc 2268181 tgctgggctg tgttgaggtc gcaaacccag ccagggtaag gcctttggcc tctcctaccc 2268241 ggccgacacg cttactgaag gcctagtcta ggcaggccat tcaatctgcg gaatcgaaaa 2268301 attcggttcc agcctgctcg tttcctttcc gacagcgatc tgacgttgcg taacgtcatt 2268361 tgtacggact cttttagcgg cattgatttc agatgccaac gccgtctgtg ctgtagcgcc 2268421 gattggccga aactgtaaat ttgtatgatt atttaaatct ttgacgaaca cgcgccacaa 2268481 acgtactatc tctttggcaa agtccaccgg catctcattc aacggttttg tttgcgcgtg 2268541 gtcgtcatat gttggtaact gtgtaaccgg ccgcctatct tgcgcgtgca tcatatgact 2268601 atgaatcggc cttctccagt gaaattgata caagatcgat ccgataagcg gtaccttgta 2268661 cacagtgcaa ttgtagtaat tcgcgttttg tcctacgctt gtattctgcg tgaagaattc 2268721 aaacacgcca ggcccgggcc gtcgtcaacc aattcgcggt atgcctcaac cactttcggg 2268781 aacagctcgg caacctgctt ggacgtcttg atgtccttgg cgaacgccac cgcccgacgc 2268841 atcggcggct caccggcgac aatgccggta ccggaccgct tggccaggcc attccagcag 2268901 ccgacgatct tggaggcgtc gtcgagcatc agctcgccgg aaaccccgga gagttcctgc 2268961 tgcaaccggg gcgcgatcac gccctgatcg acggtgagca ccatcacctt gtagtcggtg 2269021 agcagcccgc gctccaccgc ctcgccgaac gacagccggt gaaactccgg cccgaacgtc 2269081 agctcgtcgt ccatcgacac caactcggcg gagtgctggt cggccctgtc cttgatgctc 2269141 tcggtgaaaa tccttggcgt ggcggtcata tacagccgcc gggccgcctt cagatactga 2269201 ccgtcgtgca cccgcacgaa gttcgactca tcgtcccccg ccagcgtcac gccggtggtg 2269261 cggtgggcct cgtcgcacat caccaagtcg aactcgtcga cccccagccg ttgggccttg 2269321 gccaccgtgg gcagcgactg gtaggtgcaa aacaccacgg tcaggccctg ggcgcgcctg 2269381 cggtgcgcca tttcgtgcag caatacccgc gcgtcggtgg tgaccgggat cggcacatcg 2269441 tggacgtggt agtcctcggc cgagcgcgac accttggtgt ccgagcacac cgcgaacgcc 2269501 cgcacatcca gctcactctg tgcggtccac tcccgcagcg tctggctcaa cagcgaaatc 2269561 gagggcacca gcaacagaat ccgcgcgctg ccgccgttgt cggcggcgat gcgctcggcg 2269621 atcttgagcg cggtgaacgt cttgccggtg ccgcaggcca tgatcagctt gccgcgatcg 2269681 ttgcccaccg cgaacccgcg gaacaccgcg tcgatcgcct gctgctggtg cggccgcagc 2269741 tcgtggcgtt tggccggggt caggttcacc tgcaggtcgt cggccggcca ggcgatgtcc 2269801 cagtcgatcg gcgattcggc gatctcggcc atgccgatgc gctgcaccgg gaccaactga 2269861 tcggccagcg cgtcctcggc attgcggccc caccgatccg tcgtggagat gatcacccgg 2269921 ttggtgaagc ccgtcttgcc cgacgcggtg aaaaacgagt cgatgtcccc cttggccagt 2269981 gtgtgcgtcg gctcgtagaa cttgcactgg atcgcggtgt agttgccggt gtcacgttcg 2270041 cgggcgacca ggtcgattcc ggtgtcggtc ctgccccgcc gctccggcca gtcgatccac 2270101 caccacaccg cgtcgtactg ctgggccatc gtcgggtcca gctcgaaata gcgcaccatc 2270161 aactgctcga acttggtccc gcgctccgcg ttcgacggag ccttccggaa cgcctcgatg 2270221 acgtcgtgca ccgaccccat agttcaatga ccatactggc ggcaaccgac acgtggcggg 2270281 atccctcgcg ttcgatccaa cccaaccagc tcggccaacc gcatcgcggg ccggcatctt 2270341 cgccgtccta actcgggaaa tagcggttgt cactatctga gcgcagctat ctcatttgcg 2270401 gagaactagc cctgatcaat tcctgcctcg gttacgtgtg tcatgatcag ccggccagtt 2270461 cgaggttgag gtgaccttca catagtgaag cctcccgggt ttcgtgcgca ccttctttcg 2270521 agggaaggac gccacgctga gctgcgagtt cgtcgccgag catcgagccc ggttcgaggt 2270581 cgctgcgatc tgtcgcgtgc tgtgtgggca gggctgcaga tcacccggag aaccttctac 2270641 gcctgggcag cgtcggccgc cgtctaggcg tgccctgcgg gagatgacgg tcaccgagcc 2270701 cctggccggt tacgacgggc ccgataccga tggccgccgt aagcccgagt cactctacgg 2270761 tgcggccacg atctgggatc gacgagccat gttcagccgg ataggcgtgg atgagggcgg 2270821 tggtcagctt gggaacggtg tgggtgagtt cgtgttcggc gtcgtgggcg atgcggtgag 2270881 cttgcgcgag gtccagggcg gggtcgacgt cgagttcggc atcggcgtgc aagcggtgtc 2270941 cgatccagcg catccgcacg ctgcgtaccg cctgcacgcc gggccgggcc gccagggctt 2271001 gttcggcggc atcgaccatc gctgggtcga cgccgtcgag caggcggcgg aacacatctc 2271061 gcgcggcagt tcgtagcacg gccagaatcg ccgccgtgat gagcaggccg acgatggggt 2271121 cggccagtgg gaacccaagt gcgacaccgc cggccgagca cagcacggcc agcgaggtga 2271181 atccgtcggt tcgagcgtgt agtccgtcgg cgatcagggc ggccgagccg atgcggtgcc 2271241 caaccctgat gcggtagagg gcaacccact cgttgccgat gaatccgacc agcccggcca 2271301 gggcgaccca gccgacatgc tcgatctgct gcgggtggat caggcgggcg atggcttcgt 2271361 aaccggcgat gatggccgac atcgtgatca tcgcgaccac gaacgacccg gccaggtcct 2271421 cgacgcgacc gaatccgtag gtatatcggc gagtggcggg cttggcgccc aacgcgaacg 2271481 cgatccacaa cggcaccgcg gtcaacgcat cagcgaagtt gtggatggtg tcggcggcca 2271541 gcgcaaccga ccccgacatc accacgatca caatctggat gagcgcggtc aacccgagaa 2271601 ccaacaagct gatcttgacc gtacggatcc ctgccgcagt ggattccagg gtgtcgtcga 2271661 cgctgtcggc ggcgtcgtgg gagtgcggcg cgaagatctc cttgatcatc gccggcacac 2271721 ctcgtgaatg agcgtggtcg tgggtcatcg ggcgcaggcc ctttgtgaca gcaggccaga 2271781 tcggccgcgt tcgaccacca agcaagctct tttatctgca ttcatacgca gataatagcg 2271841 gatgctctcg ccggttccag tactagctgg gacggacgac gatcaccggg attctcaccg 2271901 aatgggctac cgcggaactc accgaaccca acagcatgcc ggaaaacccc ccgcgcccat 2271961 ggctgccgac caccaccagc tgagcttgct cagaatgctc gagcagccac cgagcgggct 2272021 tgtcgcacac cagcgatcgg tgcacgcgga catccggata ctgctcttgc cagccggcga 2272081 ggcgttcagc gaggacctca gcctctctct tctcgcgctc tcgccaatcc atccccagaa 2272141 ccggaaacat ccccagatcg gtccaggcgt gcaacgccac caggtccacc cttcggcggg 2272201 aggcttcgtc gaaggctagg gccgttgccg cctcagaggc tggcgatccg tcgatgccca 2272261 ccaacaccgg tgcatcggag tcgggagtcg cgccattacc ggaatgaatg atggccactg 2272321 gacaccgcgc atggtggagc aacgcggtgc tgatcgagcc gagcagcagt cgacccaatg 2272381 cgcccatccc ctggctgccg acgaccatca accaagcctg ttgggatgca tcgataagcg 2272441 tcggcacaac attggaaaag accaactcgg tatgcacctg cggcggtttg gactcaccca 2272501 agctgttggt gagcgcctcg cgggcctgct caatgacctg ctgtgcgttg tccttttgcc 2272561 actcagtcat attcgcgtac agctggccca ccggccagcc gacaaccaca ggggcaacaa 2272621 tgtgcagcag ggtgatgggc agctggcgca tgacggcctc acgggcggcc caggctaccg 2272681 ccgcgttgga ttgcgctgat ccgtcgacgc caacgagtat tccgtatttc gctgtcgcag 2272741 cagacatttc acgctccttg cggtcggaac acagtccatc aatccatcag cgcagcggtg 2272801 cagaccaccg cagcaaggtg cctccggtcg gcatgttctc gactgtgaat tcgccgcccg 2272861 cgtcgtcggc acgctggcgg agattgcgca ggccgctttc ggtgatgtcg ccggagatgc 2272921 cgacaccgtc gtcgacgacc tcgacccgca catcatcctc gacgctgacg ttgatggcca 2272981 ggctggtcgc gttcgcgtgc cggacagcgt tgctaaccgc ctcccgcaga accgcttcgg 2273041 cgtggttggc caggacggtg tcgacaacgg acagcgggcc cgtgtactgg accgtggtgt 2273101 gcagcgcggg gatcgcgagt tggtcgatga ccttgtccag tcggtggcgc agacccgtcg 2273161 cccgggaggg cccggcgtgt aggtcgaaga tcgcagatcg aatctcctga atgatttcct 2273221 ggagatcgtc gatgctgctg tagatggatt cccggacggc ggggacacgt gctcgcggag 2273281 cggcaccctg cagggtgagc ccgactgcga agagccgctg gatgacgtgg tcatgcagat 2273341 cacgtgcgat ccggtcgcga tcggtcagga tctccacttc tcgcatctgt cgctgcgcgg 2273401 tcgccagccg ccaggcgagc gcagcctggt cagcgaaggc ggccatcata tcgagctgtt 2273461 tgtcgctgaa cggctgttca tcggcactgc gaagtgcgac cagcacaccg gcaacagtgt 2273521 cggcggcacg cagcggcagc accagggcgg gcccgggctc caccgggccg tcgaccgcga 2273581 ggtcaagccg gtcgaaccgg cggggcgtac ggtcgtgaaa gactcccccg atcgacgttc 2273641 cgctgacggc aaccgtcatt tgcttgaccg ccggggagat ctctccggcc acctctacga 2273701 tgaccaggtc gtcgacctcg caagccggcg cttcgtcgtc gagcggcacc gccaccaagg 2273761 tggctgcccc agccatcaac gtcaacgctt cctcggcgat gagccgaaac accatggccg 2273821 ggtccgcacc ggccagcatc tgcgttccga tgtcgcgggt tgcctcgatc cacgcttccc 2273881 gggtccgtga ttcctcgaag agacgggcat tgtcaacggc aatcccggcc gcggcggcca 2273941 gcgcctgcac cagcacctcg tcgtcatcgc tgaacggctg gccatctgcc ttctcggtca 2274001 agtaaagatt gccgaacacc tcgtcgcgga tgcgcactgg aaccccgagg aaggtccgca 2274061 tcggcggatg gtgcagcgga aatccaaccg atgcgggatg ccgcgagata tcgtccagcc 2274121 ggatcggctt tggctcctcg atcagcgcgc cgagaacacc tcgcccctcc ggcaatgagc 2274181 cgatgaggtg ccgggtctct tcgtcgatcc cctcgtagac gaattcgacc aatctatggt 2274241 cgtaaccgcg caccccgagc gccccgtagc gggcatccac caactcggcg gcggtatgca 2274301 caatggcgcg cagggtggcg tcgagcttga gtcccgatgt gatcgccaag atggcgtcga 2274361 tcagaccatc cagccggtcg cggccttcga cgatctgttc aatccggtct tggacttcca 2274421 gcagcagctc tcgcaaccga agctgcgaca gtgtctcgcg caatggcggg ctgccagggt 2274481 taacgttcgc cctgtcaggg tgtgtcacat agctatgttg acaccggagc tgcgctcaac 2274541 caactggtct ggctacccag cggcacagtc acagatactg ctgaccgacg accagcaggg 2274601 tgcagccggc ctcctgcaac acggcgttgc ccggcgctcc cacaagttgc tccacatgct 2274661 cctggtcgct cgcgctgagc accaccatgt gtaccgatcg acccagccca gccagataat 2274721 ccagcagctc gccgtgcact gccgccgatt gcacccgcac atcgggatac cgtggttgcc 2274781 aacgggcaag ccagcggtcc aggctggcac ggacgtcgtc cccggtatcg cccactccgg 2274841 attgccggca ggtgaccacc cgaaccggcg agtcgcgcag ccgtgcttcg gccatcaccg 2274901 cccccagcaa aacaccgata tcggacgacc cgtccgcctc gacgacgatc catgcggcgt 2274961 cgcgtccgat ggggacccgg tggggtcgca cgatcgccac tgggcactgc gccgataacg 2275021 ccagggccgc tgcggtagat cccacccgct ccggtcggaa gtggtgcacg ccgatagcgc 2275081 caacgcacac cagggcagca gccgccgaag cgcggatcaa cgaggtgacc ggccgctcct 2275141 gggtgatctc cacctcgacc ttgaccggcc ggtccgccgc ctcgaccgct gtgaacgcgt 2275201 agcgcaccgc gttctcggcg gcggcgagtt tgcgagccgc cgcgccgtgt gcggcgtacc 2275261 cgggatcgtc gggttcgatc gcgtacagca gacgcagcgg gatgtcacgg ctggctgcct 2275321 cgtcgaccgc ccacagtgcg gcttgcacgg ccggcttcga gccatcaata ccgacgacga 2275381 tcgatggggg tttgtgtgat tggttcatgg cgaggcttcc gggttaacga tcgggtgcca 2275441 aacgtattga tcctgcccga cttcggtggg ttcggccgcc agctcgaaga acctctccac 2275501 atcgtcgcga ttgcaggccg cggtgcctgg cgtcagcagc atggctgcac ctgccgcgtt 2275561 tcccaagcga acggacttga tgagcgacca gccacggctg aggcccacgg taatcgcggc 2275621 caccatcgcg tcgccggcgc cgacaccgct aaccgcggtc atcggaatcg acgaaaatcg 2275681 atggctcgca tgtcgtgtgg ccaatagcgc gccctgagat ccaagcgaga ccaccacgac 2275741 ctcggcgcgc ccacggtcaa tgagttcgtg tgcggcggcc agttgttcgg gctcggtcag 2275801 cagttcggat ccgacgcact cgcgcagttc ccgcacgctc gccttgagaa gaaacacccc 2275861 ggacgaaatg tgctgcaacc cgccaccaga tgtatccagg atcagcggag tgctcgatcg 2275921 gcggcagatg tcggcaaccc gctgatagta gtcggcagcc acacctggcg gcaggctgcc 2275981 actggccacc acaaaggcgg ccgaagccgc cgcaccgcgc agttcgtcga ggcattgctc 2276041 ctgctccgcg acggtcagcg acggccccgg aagcacgaaa cgatactgct tggcggtcct 2276101 ggactcgttg accgtgaagc tctcccgcgt cgaggccgcg atcggaatga cgcgaaatgg 2276161 cactcccgca tcaccgagca gcgccatcag caggctcccg gtcgacccgc cggccgggaa 2276221 cagtgctgtc gagcaaccgc cgaggacatg cacaatgcgg gcgacattga taccgccgcc 2276281 gccgggatcg tagcgaggtg cgccacaacg cattttctcg gtcgggcgca ccacgtcgac 2276341 gctcgtcgtg atgtccaagg cggggttcat ggtcaaagtg atgattcgcg gcttgccttc 2276401 gtcccacgcc gctggctccg tcatcgtcgt ggactctgcg ctacagaccg gtcgggtagg 2276461 tttccgggtt ctcgccggcg atccaccggc tcgtcacctc gagaggttcc agggcacggg 2276521 tctgatcgat gtggatcatg gcgtcgaact ggtcggcggg ccgcacgtgc aagtagtgac 2276581 tttgccgttc cgttgccggt agataaacga cgccgatggc acgtcccaac cggacaacgt 2276641 ccagcggggc ttcggcgtcg cggcttagcc gcgctgacac caggaaactg tctgcagtct 2276701 ggtggaagag ctcctcgaca ctgccgtgca gtgccggccg aaccgctttg cgttgggcga 2276761 taccacccca ttcgctggcc gcggtgacgg tgcccgtgta cgtgctgaat ccgatgctgc 2276821 gcgactcgtc accgtatcgc tcacggacta tctggccgag ggtgagctgc ccgtcggccc 2276881 acacctcggt agcgcgtgcg tcacccacgt gggagttatg agcccacacc actattcgcg 2276941 ccggcggcgc atcgaggtgt cggtccaaat gcgtcagcaa actgccaagg gtctgcgcca 2277001 tgtgctggtc gcgcaggttc cacgaggtaa cgcgtccact gaacatggcc cggtaataca 2277061 cctctgcgtc gcgcaccgtc tgcgcgtttt gctgggcgta gaacagttcg tcctcggcaa 2277121 gcagcccgtc ttggcgcgca tacgccaggg cattgcgctg aacgtcgacc agttgctcga 2277181 cggcttcacg ttcgcacgac ggaccggcgc cgaatgcggc cgcgaatccg tacgcctgac 2277241 cgtcatcggc gcaggcatgg tcgaagcacg cataccgggc ccgcgcccgt gccgccgcac 2277301 gcgggtcgac cttgtcgaga tagctgatca cctcttggat cgaccgatgc aggctgtaaa 2277361 gatccagacc gtagaagccg gcttgccgca gcgcgcccga ctcgtagcgc tggttgcgtg 2277421 tgcgcagcca ttccacaaaa tctcggacca cggtgttgcg ccacatccag gcgggaaacc 2277481 gctcgaatcc gctaagcgcc tcgtcagcgt tggtgtcctc gccgaggccg cgaacgtacc 2277541 gattgacccg gtaggcgtcg ggccagtccg cctcggcggc taccgcacca aagcccttct 2277601 cctcgatcag ccactgtgtc atggcggccc gggcctggta gaactcgtgt gtgccgtgcg 2277661 agctttcgcc gatcaacacg attcgtgcat cgccgaccag ctccgccaac acctcgtgcg 2277721 tcggaacacc cccgggggcg tcgatcgcga ctctgcgcag aacatcggcc gccgttgacg 2277781 ccgcgggccg gcgcagcgac ggcccagcgg tcggggtggc caggagccgg cggacctcct 2277841 cgtcggtgac ctgccggaag tcccaaaacg actcaccgac ggccaggaac ggggtcggca 2277901 tggtcgcgca cacaacgtcg tcgacgaggc cggcgaactc ccggcacgtg gactccggcg 2277961 ccgccggcac ggcaatcacg atctgcgctg gttgcgcatc gcgcaatgcc tgtaccgccg 2278021 cgaacatgct tgcgccggtg gccaaaccgt catcgacgac aatgaccgtc ttgccggtga 2278081 tatcggtggg cgggcgctcg ccgcggtagg cggactcgcg ccgaagcagt tcccgaccct 2278141 cacgttcggc gatgtcgcgc agttgctgcg gtgtgatccg caggccccgc acgacgtcgt 2278201 cattgaccac gacgcggccg ccgctggcca gtgcaccaac ggcgaactcg tcatgccccg 2278261 gggcaccaag tttgcgcacg acgaaggcgt ctagcggggc atgcagtgcc gcggcaacct 2278321 cccatgcgac cgggaggcca ccccgggcca agccgagcac aatcacgtcc ggctggtccc 2278381 gataggcggc gagtaattcc gccagcaccc ggccggcctc gcggcggtca cggaacacgc 2278441 gccgcggcga gcgccgggtg acatcagccg ctgcggtcat cagcacggac ccagtggtca 2278501 gttggtggac cggatctgaa tgtgcttttc ggttggcttc ccttccgaaa ccgccaccga 2278561 cacagtaaga atgcccttgt cgtaggtggc cttaatgtcg tcctcgtcag cacctaccgg 2278621 cagcgacacc gtgcgaacga aggaaccgta cgcgaattcc gagcgaccgt cgaagtcctt 2278681 ctgctcggtg cgctcggcct tgatggtcag ctgaccatcg cggaccataa tgtcgacgtc 2278741 cttgtcgggg tcgaccccgg gaagctccgc gcgtacctcg tagcgcccct ctttcatctc 2278801 gtcttccagc cgcatcaacc gggtgtcgaa ggtgggccgg agtccggcga atgacgggaa 2278861 ggccgcgaac agctcagaaa actcggggaa gagggaccgc gggtggcgct gaacgggaag 2278921 ggtggtggcc atttgatgcc tcctaatcga tggaaacgga tgcctttgat ccgaccagcc 2278981 catcgtggcc agggctaggg acagaagtcc ccgaagcgcg ggccatttgt ccgcgcccgt 2279041 cggtgatcca cttggggacc attgaccctg ttgtctgcca accgccgttc agaaagatcg 2279101 gggtgatatc gaacagcgga ggttgatcat gccggacacc atggtgacca ccgatgtcat 2279161 caagagcgcg gtgcagttgg cctgccgcgc accgtcgctc cacaacagcc agccctggcg 2279221 ctggatagcc gaggaccaca cggttgcgct gttcctcgac aaggatcggg tgctttacgc 2279281 gaccgaccac tccggccggg aagcgctgct ggggtgcggc gccgtactcg accactttcg 2279341 ggtggcgatg gcggccgcgg gtaccaccgc caatgtggaa cggtttccca accccaacga 2279401 tcctttgcat ctggcgtcaa ttgacttcag cccggccgat ttcgtcaccg agggccaccg 2279461 tctaagggcg gatgcgatcc tactgcgccg taccgaccgg ctgcctttcg ccgagccgcc 2279521 ggattgggac ttggtggagt cgcagttgcg cacgaccgtc accgccgaca cggtgcgcat 2279581 cgacgtcatc gccgacgata tgcgtcccga actggcggcg gcgtccaaac tcaccgaatc 2279641 gctgcggctc tacgattcgt cgtatcatgc cgaactcttt tggtggacag gggcttttga 2279701 gacttctgag ggcataccgc acagttcatt ggtatcggcg gccgaaagtg accgggtcac 2279761 cttcggacgc gacttcccgg tcgtcgccaa caccgatagg cgcccggagt ttggccacga 2279821 ccgctctaag gtcctggtgc tctccaccta cgacaacgaa cgcgccagcc tactgcgctg 2279881 cggcgagatg ctttccgccg tattgcttga cgccaccatg gctgggcttg ccacctgcac 2279941 gctgacccac atcaccgaac tgcacgccag ccgagacctg gtcgcagcgc tgattgggca 2280001 gcccgcaact ccgcaagcct tggttcgcgt cggtctggcc ccggagatgg aagagccgcc 2280061 accggcaacg cctcggcgac caatcgatga agtgtttcac gttcgggcta aggatcaccg 2280121 gtagcgggcg ccgccgggac cgcgtctaag caccgcagct gaatcgggcg gatgatgtgt 2280181 cgatgagcgg atccggcgat ggcgacggtg tcgcgcggtt gggcagacat cttccgcggc 2280241 tattcgtccc cggccggctg agtgacgaag tcgatcagtt cttccacccg gccgatcaac 2280301 gccggctcta ggtcggtcca gtcgcgtact tgcgaacgga tgcgccgcca cgccgcggcg 2280361 atgtcggcct ggtcggcgtg cggccagccg agcgcatcgc acacgccgtg cttccactcg 2280421 atgtgccgcg gcaccctcgg ccaggcggcc agcccaactc gttgtggctt gaccgcctgc 2280481 caaatgtcga cgtaggggtg cccgacgacc aaagtgtcgg agccgcccgg tccgcggcgc 2280541 accacctcgg cgatgcgcgc ctctttcgac cccgcgacca aatggtcaac gagaacgccg 2280601 agccgacgcc gcgggccggg ccggaacttg gcgacgatct ccaccaggtc gtcgacgcca 2280661 ccgagatgtt cgacgacgac accttcgatt cgcaggtccg ctccccatac cgccgcgatg 2280721 agttcagcgt cgtgtcggcc ctcgacatag atccggctgg cccgggccac ccgggcacgc 2280781 gcgcccggca ccgcgaccga gccggatgcc gttcgcctcg ggccggcagc cgctgcgcac 2280841 cgcggcgcgg tgaggatcac cggcaggccg tcgagtagat acccggggcc cagcggaaac 2280901 ccgcgggtct tcccgtagcg gtcttccaag tcgatgcggc catattcgac tcggaccacc 2280961 gcaccgacgt agccggtctc ggcgtcttcg acgaccatgc cgagctcgac cgggtgctca 2281021 accgagcggg gccggcgccg cccgcctgcg gcaagcacgt cggttccata gcgatccagc 2281081 acgccgcaat actagggagc ctctctgccg gtcatcgccg cgacgcgccg catgggttct 2281141 cggaaaatgc ttgtaccagt cgactttccg gcgggccaac gtcgccaacc gatactcggc 2281201 tccaacgcca tgggtgacgg gatgcccgga tcacgtgtca caccacccgc gcacccttgc 2281261 ggaagaatat ccgtaagtct aaacttacgg ttcgtgtcca cttacagatc accggatcgc 2281321 gcttggcagg cgctggcgga cggcactcgc cgggccatcg tggagcggct ggcgcacggc 2281381 ccgctggccg tcggcgagtt ggcccgcgac ctgcccgtca gccgacccgc ggtgtcacag 2281441 cacctcaaag tgctcaagac cgccaggctg gtgtgcgacc gccccgcggg aacacgccgc 2281501 gtctaccagc tcgacccgac aggccttgcg gcattgcgca ccgacctcga ccggttctgg 2281561 acacgcgccc tgactggcta cgcgcagctc atcgactccg aaggagacga cacatgacac 2281621 gcccgcgaac cgatgccatc caccaccacg ttgtcgtcaa cgccccgatc gagcgtgcgt 2281681 tcgccgtgtt caccacgcgg ttcggcgact tcaagcctcg cgagcacaat ctgcttgcta 2281741 tcccgatcac cgagacggta ttcgaatgcc atgcgggagg ccatatctac gatcgcggtg 2281801 ttgacggaag cgtgtgcaaa tgggcgcgcg tgctggtcta tgaaccgccc agccgggtgc 2281861 tattcacgtg ggatatcggc ccgacttggc ggccggaaac cgatctggcc aagaccagtg 2281921 aggtcgaagt ccgcttcacc gcgcagtccg ccgagacgac acgcgtcgac ctcgaacatc 2281981 gccatctcga ccgacacggt ccgggctggg agtcggtcgc cgacggcgtt gacagcgagg 2282041 ccggatggcc gttataccta cgccgctata ccgacctgct ctgcatccag gtgcagccat 2282101 gatcgcggca gacgacgata ccgagaagtc catgatggac atggcccgcg ccgagcgggc 2282161 cgaactagcg gcgtttctga ctaccctcac actgcagcaa tgggaaacac ccagcctgtg 2282221 cgccgggtgg agcgtcaaag aagttgtcgc acatatgatc agctacgaag atctcggcgt 2282281 tttcgggttg ctcaagcgct ttgccaaagg ccggatcgtc cgggccaatg aggtgggtgt 2282341 cgacgaattc gctgggctca gcccacagga gttggccgac tatgtcggcc ggcatctcca 2282401 accgcgtggg ctgacagcgg gtttcggcgg aatgatcgcc ctcgtcgatg gcatgatcca 2282461 ccaccaggat atccgccgcc cgctcggtca gccccgcacc atccccgcgc agcgacttga 2282521 ccgcgtgttg cggctgatgc cgaagaaccc caggctgcga gctcggccac gcatcaaagg 2282581 gctgcgactg cgagccaccg acctcgactg gacaatcggc accgggcccg aagtaaccgg 2282641 gcccggcgaa gccttgctca tggcaatggc cggcaggcca gcggcggtca gcgacctctc 2282701 cggccccgga aagcccacgc tagccggacg actcggttaa cgacagctac agcgacggcg 2282761 tgaacgggcc gccgcagtca gccagacaat cggcgtaatt ccagttcgcc aagaactttt 2282821 gacccgcctg aaatccgcgt tggtaaagag cctcgcgttg ttcggcggtg atgtcgaagt 2282881 cgatcggact cacgtcgtgg gcgggcacga agatggtgcg ccgaacggta cacggatcgt 2282941 cgatgtaggc gttgtcctga ttgctcacca gtgtttcgat cgccgcgatg cccaacgaca 2283001 ctggcccttg gaccggccgg gtaggtggaa tgcccggacg cgctgacaac ctgatcccga 2283061 acgtgggcca tcgcggttca gcgtcggttc ggtcgaacag cgccaccgga aagttcgaca 2283121 gcaagccacc gtcgacccag gtagcgccgc gcacccgaac aggctcgaac acaaacggga 2283181 tcgccgatga ggcgtgcacc gcacgcgcca ccgagaagtc gtccgggtgg atgccgtagg 2283241 agtccaggtc ccacgggatg cgaacgagtc ggcgacggga taggtcgctg gcggtgacca 2283301 ccagcgacca ggcgaactgt tcgggtgcct cgccggtgcg caagtcgcca aaggtgtgca 2283361 cgcctaggtc agcgagcaaa ccgccgagca gctgttccag ataggccccg cggtaaacgc 2283421 cgtccgacaa cagcagagaa agtcccccgc cgatcaacgg cacgtgtcct atcagattgc 2283481 ggtcgaggaa cttcgggtag tcgatgctgc gcatcatctc ggcaagccgc gtcaccggct 2283541 caccggccgt ttgtagggcc gcgaccagcg acgcgacgat cgcacccgcg ctgctgcccg 2283601 ccaccctggg aaatcggtaa ccggcatcgg ccagcgcgtc caccgctcca accaacccta 2283661 tcccccggac cccgccgcct tcacacacca ggtcgacgcg tgctgtgctc accagcgcca 2283721 cgttagcccg gaatccgacg cccgtcgacg gcgaagaagt gcaggtgtcc cggtgtggga 2283781 catagccgca cgcgactacc ccgctcgggc gggccgcggc cgtccactcg agcgacgatt 2283841 gactggtcca tttcgcagcc gcccgacacg attcggccat acaagtaggc gtccgctcca 2283901 agttcttcga ccatgtcgac gtccatctcg atgccggcgc cgcccagctc caaatgttcg 2283961 gggcgaacac cgataatgac ctcggctgcc gtaccgacga ccgcacgcgg cagcaggatc 2284021 tgccaatcac ccagtgacac cgtggaatcg gcgatggaaa gcctgaacag gttcatcgcc 2284081 ggggaaccga tgaaccccgc gacgaacacg ttgcccgggt tgcggtagag ctctcgaggc 2284141 gaagcacact gttgcagcac accgtcagac agcaccgcga cgcggtcacc catcgtcatg 2284201 gcctcgacct ggtcgtgagt gacatacacg gtggtcgtac ccagttgccg ttgtaacgcg 2284261 gcgatctgat tgcgggtttg cccgcgaagt ttggcgtcaa gattggacag cggttcgtcc 2284321 atcaggaata cctgtgggcg ccgcacgatc gcacgaccca tcgccacccg ttgccgttgg 2284381 ccgccggaga gatctttcgg cttgcgatcc agataagatt gcagatcaag caatttcgct 2284441 gcggcaagca cccgctcgcg gatctcggcc ttgccgatct tggcgacctt caacgcgaag 2284501 cccatgttct gcgccaccgt catgtgcggg tagagggcgt agttctggaa caccatggcg 2284561 acatcacgat ccttgggatc gacctcggtg acgtcgcgct cgccgatccg gatacgccca 2284621 cagtccagcg tctccaagcc agccaccatc cgtaacgacg tcgtcttgcc acatccggac 2284681 ggccccacca ggacaacgaa ctcgccatcg ccgacgatca ggtcgagccg atccagggcc 2284741 ggtcggtccg tgccgggata gcgccgggtt gcctgctcaa aactcaccga agccatggtt 2284801 acccgccgag cccagtcacc gcgataccac ggacaaagga acgttgtgcg accgcataaa 2284861 ggatgaccaa cggcaccagc atcagcatcg acgccgccat cagcaccggc caccgggcga 2284921 cgtattcgcc ccgcaatcgg accaggccaa gggtgagcgt cgccaggctg tttcgctgga 2284981 tcatcagcag cggccacaga aagtcgttcc acacgttgac ccaggtgagc acacccagca 2285041 ccagcaccgc gggacgtgaa tgcggcagca gaatccgcca gtagatctgc cacggcgagc 2285101 aaccgtcgag aatcgcggct tcctcgagat cggtcggcag cgtgcggaag aactgccgca 2285161 tcaggtaggt accgaacgcg ctaccgaaca atcccggcac gatcatcgcc cacggcgtat 2285221 ccacccaccc cacgatccgc atgagaatga cctgtgggat gacggtcacc gtcaacggca 2285281 ccatcaaagt gctcaagtac aagacgaaca acgtatcgcg gccccggaac tgcagtcgcg 2285341 cgaaggcata accggccaac gagcagaaga agacctgccc ggcggtgaca catccggcat 2285401 acagcacggt gttgaagaac atccgccaga acggcatcaa cgcgaacacc tcgcggtagt 2285461 tggaccattg cggatgcgac gggaacagcg tcggctcggt cacctcgccg tccgccttca 2285521 gggagcccga cagcgcccag atgataggga acagcgcgca ccaagcgatc ccgatcagtc 2285581 ccgcgtacag ggcaagccca cgaatgaagt ggcggtggac tattcgatca gcccagccca 2285641 cgggacgcct cccaggagcg ccggtgcgta attcgcaact gcagcacggt caacaccagc 2285701 aagatggcga acatcaccca cgccaacgcg gacgcatagc cgaattccag gaacgaaaac 2285761 gcgtgctgga acagcatgat gcccaaaaca taggtagccg tctcgggacc accgttggca 2285821 ccggtaagga cgtagacaag gtcaaacgcc tggaacgcgt ggatgatcga tatgacaacc 2285881 acgaatgaca atgccccccg gatcagcggt accgtgatgg acacgaactg gcgaatctcg 2285941 ccggcaccat cgatcctggc cgcctcgtac acagtctccg gaaccccctg catcgcggcc 2286001 agcaggacga ccgtggcgaa gggcacactg cgccagacgc tgaccaggca aagcgagacc 2286061 atggcccatc ggggttcgat tagccatggg atggggccga ttcccagcca gccgagcatg 2286121 atgttgagta ggccattgtc ggtgttgaag acgaactgcc agacgaccgc catcaccacc 2286181 gaggaaatcg ccaacggcaa gaagacgacc gtccgaaaga ggctgatgcc tttgattttc 2286241 cggtttagaa aggcggcgac gacgaggctg acgataacgg tcggtaccac ggtgccgacg 2286301 gtgtaaaccg cggtgttgac cacggcgatg agaaacagcg gatcagaagt gaagaggttt 2286361 ctgaaattgt ccaacctcac gaacgtcgca tgcgtaaaca agtcccactt ctgaaagctc 2286421 atgtacagcg agaatcccag cggaaacagc atgaacacca caacggcagc caagttcggc 2286481 gcgacgaaca tacgccccgc ccacgcgcgt cgccccctgc gccgtgtcat ggattgcgca 2286541 gcacttcatc gacggcctgt gatagcccgg tcagcgaggt cgccggccgg gatccacgca 2286601 gcacgggtcc gaagtagcgg tccatcaggg cggcgatctt ctcccaggcc ggggtcaccg 2286661 gcaagccttc cgaataggcc ggcccctcgc tgagcacggc aagattgcct accctgcggt 2286721 gggcgttggc gaatccgtgc gagttgatcg ccgatctcag caccggcacg aacaggcggg 2286781 attcgccgat caatgcctgc cccaccgggc cggtcgcgaa ctttacgaat tcccacgcct 2286841 ggtccttgcg tcgactggtc gccgcaatgg ccagcccggt gacaccgata tctgaacagg 2286901 cggctcgtcc gcgcggaccg atgggcagtg gggcgacgtc gaagtccaga ccgtcggcac 2286961 ggtcgaacgt ctgatatcgc cagtgcccgg ccaacgcgat cccggccttg cccacagaaa 2287021 acaggtccgc cgtcgacatc gactgctgct cagcagcgct gggggccacc ttgtgcttgt 2287081 tggtcaggtc ggcgtagaac tgcaccgctt cgaggaaccc atcgtggtcg aaattgaggt 2287141 gggtgggatt catccgcgga accgaccacg gtacaccgtt attcatggcg aacaacccgg 2287201 cagcgtagaa cgagacccac gcgttgacga agccccattg cctgtcccgt cccgaccggc 2287261 cctgcttggt aagcgcctgg gcggcatcca ggaattcggc gaagctccat ggccgttccc 2287321 agctaccggg cggcggtggc acgccggcgt cgtcgaatag ctgtttgttg tagaacaaga 2287381 agttgccgga ccattgctcc ggaaaggcgt actggcctcc gttgaacgtg aaagtctcat 2287441 acagggcccc gatgctgtcc gatttcagct ccgcggcgaa agcctggtcg cgcgccaata 2287501 gcgtgttcag gtcaagcaac accccccggt cggccagttc ggcataggtc agttcccatg 2287561 ccatcagcac atccggacac ttgccacccg cgcaaaacgt tgcgagctgc tgcatgacgc 2287621 cgggtccgga caacagggcc cgtaccttga tatcgggata gcgccgctgg aattcgttga 2287681 cgacgcgcat ccggggacgg agctcgtccg gattggctgc aaaaaagaaa gtcaacgcgt 2287741 catcgtcatc ggcagcacac ccagcggccc agggagccag cgaggccgca gtaagcgcgc 2287801 ccgcaccccg taacagactg cgccgctcga acggcttatt gaccatcgtg ctcccgattt 2287861 tgggtcctgt ggtacaacga ccgtcaggct gggaagtacc gaatccgatt gatccggttg 2287921 ccgcgccacg gcacgtcggc gaacatgatg ccgcgccggt ggtccgacgc gagcgacacc 2287981 gcgacggtgg atccggcacc ggtcaccttg gtccagctag ccccgcgcag ctgctcgaac 2288041 agctcgacaa tgtcgagtag ctcatcctcg cctaaagtca ttgtggcagt gcgtgataac 2288101 gcgtgatacg cagcggactt gtctgcccgg gacgcggcgt tgaggaacgt ttccaccagc 2288161 ttcttgtgcc gccggcccgc ccggcgaaag ccggtcagga atcctgcggt gccgcccaac 2288221 ccttgattgc ctagcagcgc tcgcgacagt tgcagggcgg gtcttgtggc ccccgatccc 2288281 gtgcgcagaa actgcagcat catcgccggc aactcccagt acgcccgcag tgcggcaatc 2288341 tgccactcgc cggtaaccgg tcgtaggtca tagcgtagga aggcgggaat gaacaccgtc 2288401 acagccgagt ccatcgcgac ctcgagttcg agatcgcgca gcaccaccgt gccggagacg 2288461 atatccagat cgcgatggaa cgtgatatcc cgcggcccga tgaaggtgtc gtagaagcgg 2288521 ccgatggcct catgccccac ctgcggctgc gaacccaccg ggtcttcgac ccgcgcgtca 2288581 ccggtgaaca acccgaccca gccggcgcgg tcgtgcgcgg cggccgcttg cggcgagcgc 2288641 tccaccgccg ccaacagttc atcccggttc ggcggtgcca tcaggagctg caaaccaact 2288701 cgacgctggc ggtgcgcatc tcctccagcg cggcgacggt ggtatcggcc gacacacccg 2288761 ctgtcaggtc caccagcacc ctggtggcca agccattgcg taccgcgtcc tcggccgtct 2288821 ggcgcacaca atgatcggtg gcaataccga ccacatcgac ctcatcgacg ccgcgttgcc 2288881 gcagccaatt cagcagtggc gtgccgttct cgtcgactcc ttcgaagccg ctgtacgctc 2288941 cggtgtaggc acccttgtag aacaccgcct cgattgccga cgtgtccaga ctgggatgga 2289001 agtccgcgcc gggagtaccg ctgacgcaat gcggtggcca cgacgaggaa tagtccggtg 2289061 tgccggagaa gtggtcaccc gggtcgatgt ggaagtcctt ggttgccacg acgtgatggt 2289121 agtccgccgc ttcggccagg tagtcgctga tggcgcgggc cagcgcggcg ccaccggtta 2289181 ccgccagcga gccaccctcg cagaagtcgt tctgcacgtc gacgatgatc aacgcccgca 2289241 tacgtccacc atacgttcgg gcgactgccc gggcagtttg cctaccgacg cggcagccac 2289301 agatataggg tccatgacgc cgcgacgatc gcgaacatga ccagctgagc ggcggccacc 2289361 caaccggcgg gatagatcac gccggtgatg tagtgagcga caaatccgtc cggtgacaga 2289421 ggtgtcatcg cggccttggt gcgagcccag cgctccaccc aggtcagcgg gcagtcgacc 2289481 cgcttagcgg cgatgccgat cccccatatc accgccggaa catgcagcca catcgtgcgt 2289541 cgccaccgca gggcaaggaa accgccggca aggacgtaag cgatgaaagc gaagtgcatt 2289601 accaccgttg atacaacgac ggtttcgtac atctctcggg ttgcctttcc aggtcgcggc 2289661 gctccggcca ctgacagaaa aggttcaatt cgccagcgaa aacccgtccc atgcgatccg 2289721 gcggtgctga tgcggatcga actcgatgcg gcacctgcgg tcgaaaacca ggacggcacg 2289781 gtcgtcttgg gtgtaagccg gccagtcgtc gcccggaaca ccaatttggc tgaaacaacg 2289841 ccagcggcgt tgcacctcgt tgctgacccg aagggcggca cggcggtcgg cggcggcggt 2289901 cagcaatgcg ccaaatctgg tgcgatagat gtcgaagacg gcaaacagtt cggtggcatg 2289961 ggtggcgccg aaacccgacc agcgcagcgt ccgtggcgcg tagtcatatc ggtataggta 2290021 ggtgggcgca ttggcgccgt gagcctcggc gatctgccag gccgccgagc taaaggcgaa 2290081 gtcaccaccg agctggatgc acgccgaggg cgcagggtaa ttcgggtagg cggcggtaat 2290141 gcgttcacga tcggccggtt tcatgcccga cagtagctct tcaaccatcg gttcgttggt 2290201 cggcagcatc cccagaaagc gggtgaacaa ccgaccctct tcggcgttgg ttcccacgat 2290261 cagcggaacc gcgtgcaccc ggccggaccg catcgcctcg acggggtcca tgggcaggta 2290321 gtcgtcgccg aacaccggac caatcgggaa ggcgcccagc cttttccgca ttccctggcg 2290381 aatcaggtgg tgttgggctt ccaccagctg cgcgggggac gcctgcatca acgcattggc 2290441 ggcatcctgg gtacgcgcgc cgatcagatt ggcaaagcgt gccgcgaact cggcggccac 2290501 ctcgcgcgaa cgcaccatgc ccgccgctgg gctttccgag atcgccctgg cgaataggcc 2290561 tttggcggct ggcaccgcca acagtgtggc ggtgatatgc gcgcccgcgc tttcgccgaa 2290621 aatggtgaca ttgcctgggt caccgccgaa ctccgcgatg ttgtcgtgga cccaacgcaa 2290681 cgccaacacc aggtcgcgca ggtacacgtt gctgtcgagg gtgatctgcg gtgtcgacaa 2290741 ggacgacagg tcaagacacc ccaacgcgcc cagccggtag ttgaccgaca cgtacacgca 2290801 gccgcggcgt gccaacgctg cgccgtcgta tatcggggtt gccgagctgc ccaggatgta 2290861 gcccccaccg tggatgaaca ccattaccgg cagcggctgg gtggctggct cttcgggtgt 2290921 gacgacgttg agggtgagac agtcctcgct gcgggtctgg tacctgccga tgcccatcac 2290981 ggtgtagcgg cgctgctgag gagcacagtt ggcaaacgtg tggcagtgcc gtacgcccgg 2291041 ccagggctgc gctggctgcg gcgcccggaa tcgcagcgag cccaccggcg ccctggcgta 2291101 agggattgat cgccaacggt gcacaccgtc gcgcgtgaag ccttcaacga tgccggtggc 2291161 cgtgcgggcg cgcacggtgc gctcgtgcat agacccgacg gtagccgact ccagggccac 2291221 gcggcatgcg cagtgcagga atgggggcgg ggcggctagc ctgtcgggat gcggatcgcc 2291281 gcgctggtcg cagtgtcgtt gctgattgcg gggtgctcgc gcgaggtcgg cggtgatgta 2291341 gggcagtcgc agaccatcgc cccgccggcg cccgccccgt cggcggcgcc gtcaacacca 2291401 ccggccgcag gagcgccgat caccactatc gtgtcttgga ttgaggcggg tcacccggtt 2291461 gatcccgccg cctatcacgt cgccacccgc gacggcgtca ccacccagct tggcgacgac 2291521 gtcgcgttca gcgcttcgtc gggcacggtg gcctgtatga cggatgccag gcacactagc 2291581 ggcaccctgg cctgcctggt ccgactcgcg aacccaccac cccggcccga gacggcctac 2291641 ggcgaatgga agggcggctg ggtcgacttt gacggcatcc acctgcaggt cgggtccgcc 2291701 cgcgccgacc cgggcccgtt cgtctacggc aatggacccg agctggccaa cggggacacg 2291761 ctgtcgatcg gggactaccg ctgccgctcc tatcaagcgg gcctgttctg cgtgaactac 2291821 gcccatcagt ccgcggtccg gttcgccagc gccgggatcg agccgttcgg ctgcctgaag 2291881 ccggcgccgc cacccgacgg cgtgggcgtt gcgttcggct gctgaggtgc acccgtcaca 2291941 agctgacacg acgaactagg ttcagcgact gagatcgctt cccggaagcg ccggcccatc 2292001 ttcggacgcc agctcaacca catgaatttc cccggtagcc ccgtcgacct ccaccaatgc 2292061 tcctggtggc agaaaccggg tagctccctg ggcgtcgacc acgcaaggga atccgaactc 2292121 gcgggcgacc accgcggcat gtgacatcgg gccgccgagc tcggtcacca cggcggcggc 2292181 gtagcagaag gccgcggtgt atccgacgtc ggtgacctcg gcgaccagaa tctcgccggg 2292241 ctgcaaatcg tcgatggtct ccggacgcac gatccgcacc cggccgcgca cccgtccgcc 2292301 gcagacgccg actccgcgta gagtgtcccc ggctgccagc gccgccgccg acgaaggcga 2292361 cggttcccag cttccgctga acaccgtggg cggaacgatg ccggcaagcc tgcgctgttc 2292421 ggcacggcgc cgagccacca gccccgacac gtctgccggc agcgcatcga tttcatcgac 2292481 caagaggtag aacacatcgt ccggggtgtc gaagacgccg gcctcggtca gccggcgccc 2292541 gtactcccgc agcagagcac gcagcaccca gatggcacgc accatcctgt cgcggcggac 2292601 ctcgcggtcg cggagctggc gggccgccag caacgcaacg ggcttggccc gcaacggaat 2292661 caccggcgtc ggcggttgcg gcgctggcac cgcacgtagc gtcttggcta ccatccgcac 2292721 cagcaactcg gggttgtcgg catagctggt ggcggccatc tcgacttccg ccggaccgcg 2292781 gtgcccgatc agcgtcagct cggccagcac cgcggaatgg aactccggcg cctcgacagc 2292841 tagcttgtcc agacgctccc ccggctcggc cagcaaccga atcacgaccg gatcccgccg 2292901 tgccgcggcc accagccgct gcaccgcctc caccgatcgc gcgctgacca actccggccc 2292961 ggccgccggt gcggtgtccc gcccgcacaa tcctcgcaac aacacgttga acgccgcaca 2293021 cagcatgaac gaccccgagg ccagcaccca gccgtgcacg acgtggtcac gtgccaacaa 2293081 gatcaggctc aacaaccggc ggtcgtcgtg ggtagcgagg ttatcgaagg cgagacgctc 2293141 caggcgatcg acgtcggcga cataggcatc ggtgtcgcgg ggtgagccgg cggacaggcc 2293201 caccaggttg acgccgaaca ccccgatatt gcgtagcgta cgtaaccacc tgcgggcacg 2293261 gctggattcc gatggcggtc gctgcgcgcc aaagatgggc agcgaagcca tgctgggtcc 2293321 gaagaacccg ctgttgctga cgatcgtcgc cggcttggcg aaggggacgg ttgctgccat 2293381 gaaatgcgcc gacgtgatgg ccccgtacag ccggtgggcg aacaccgcga cggtccgcat 2293441 ggcgatttcg cgctggatca ccccgctggg ccgcagccgc tcggcgatgc ccaccccgcc 2293501 ggcacgcagg ccccgcacag tcaccgatgc cgacgacggc gagaacgggc cgggcagcgc 2293561 ctccgagagg ttggtggcca gataggtcgg gaagcgcggg tcgatcggcg tgtcgaactc 2293621 gccgttggcc ccttctgggc cggccaatct gggtgcgaca ccgtcgtctg ccggggagtc 2293681 gaccgccgga aggtcctgga tgttagccag ccgccaaggc agggaaaatg ttcgctttcc 2293741 cagaccgatc cggccacgca ccgccagggt gaagtcttcg agacactcct cggcgttcca 2293801 ggccggctgg aatccccagc ggtcacgcag gagcgtgaca tccatcaatg gcgcgctgtg 2293861 caggagttcg agttcggcga acgaggtgac acgtcgtagc actggggagc caataggcac 2293921 catgggccgc ccgagcgcgg ccgcaatgcg ccgaaacgtc aactcgccag gggcggcgag 2293981 attaacaggg ccgctgtcga ttaccgtgtc cagtagcgcg cgaaccaaca gccgctgcgc 2294041 gtcgtcggag tggacgactt gtacgacgcg atcagcatac ccggcgggta acaccggcag 2294101 agcaaacagc cgctgcaccc agttgtcgac atttcgaccg aaaatgagcg cgcagcgcac 2294161 ggcgacccat tccaggccgc agtcggccag catctgctcg acgcggggtt ggtgaccgct 2294221 ggacgtgaaa acgatgcgcc cggttccggt ctcggccatc gccttgagga cattggcggt 2294281 gccgtcgata ttgatgtggt cgtttcggcc acgcacccac gcacaatgcg cgaccacatc 2294341 cgcacctgtc atagcacttt cgacggcggt ggcatcccgg atatcggccg caatgaaatc 2294401 cgctgagctc ggccagctgt ccggtcgatg acgtgcgatt ccgacgacct cgtgaccctg 2294461 actcagcaat ctggcggtca ggccgcggcc gagaactccg ctggccccgg tgacggcgat 2294521 tctcacggtc ctactcgtcg tcgttccgaa acgccgcgtt gaccaggtcg tcgaggtcca 2294581 tgtccgcgat ctcttgttct gcggtcggcg ccaacgccgg gtcctggccg ctggtttcgg 2294641 tttcatttgc cagcgcgagc aacagatcca gcactcccgc ctgccgtaag cgcttgaccg 2294701 gaatggacgc cacaatgcgt tgtagttcgg cttccccggc cgccacggct gaagtgtctt 2294761 gcggtgatga gccgagcagt tctcgacgca tatagccggc cagcgccgcg gagttggggt 2294821 agtcgaagat gagcgtgggt gaaagcgcca ggccggtggc ggatttgagc cggttgcgca 2294881 tttcgaccgc ggtgagcgag tcgaaaccca actcctggaa tgccctatcc gggtcgatgg 2294941 cttcggggct ggcgctaccc agcacggtgg cgatgtgcga gcgcaccagg tccagcagga 2295001 cggcgtgttg ctcgtcttcg ggcagtcctt ccaggcgttg cagcagagcc gatttcgatt 2295061 tcgccgcggc caacgagtca tcgacctggc gcctggtcgg cgcgttgatc agatcgacga 2295121 acatcggcgg caacgtgccg ccatcgaact tgaccttcaa cgccgcaaag tcgatgtggg 2295181 cgggcagcat gaatggctcg tcgacgatca ttgcggtgtc gaacaattgc agggcgtcag 2295241 cagacgacat cgccacgatg ccgtcgcggg cgaagcgttt gaagtccacc gtcgccaggc 2295301 cgccggtcat ggcgctggcc tgatcccaca gaccccagcc cagggagatg gccggcagcc 2295361 catgggcccg ccggtgggcg gccagcgcat ccaaaaacga attggcggcc gcatagttgg 2295421 cctggcccga cgatccgacc agcccggcca tcgacgaaaa catgacaaac gccgacacat 2295481 ccaggtcgcg agtcaactcg tgcaggtgcc acgccgcgtc caccttggac cgcaacacca 2295541 catccacccg atccggtgtc agtgacatca ccaccgcgtc gtcgagtgcg ccggcggtgt 2295601 ggatcacgcc cgacaatgga tgctgaaccg gaatatcggc gatcaccttg gccaacgccg 2295661 ctcgatccgc cgcgtcacag gccaccacct gcacctgcgc accggcggcg gccaactcgg 2295721 ccaccagctc cgcagccccg ggagcatccg ggccgcgccg gctcaccaac accagattgc 2295781 gcaccccatg acgagccacc acgtgacggg ccaccgccga acccgccatc ccggtgccac 2295841 cggtgatcaa caccgtgccc gccgcccacg agccgggcat cagcatgacg accttgccgg 2295901 tgtggcgcgc ctggctcaga taacgcaacg ccgcaggcgc gcaccgcacg tcaaaagtgg 2295961 tgaccggcaa cggccgcagc accccatcgc cgaacagcgt ggcgagctcg gcaaggatct 2296021 gcgcaatgcg gtccggtccc ggttcgaata ggtcgaaggc gcggtagcgc acgcccgggt 2296081 actgctgggc gatcacgccg gggtcgcgga tgtcggtctt gcccatctcc aagaacaccc 2296141 cacccggtgc caccagacgc agcgacgcat ccacgaattc accggccagc gagtccaaca 2296201 ccacgtcgaa ccctcgaccg ccagtggccg cgcggaactt gtcctcgaac tctaggctac 2296261 gtgaatcgga tatgtggtcg tcgtcaaagc ccatggcgcg caaggtgtcc cacttaccct 2296321 tgctcgcggt cgcgaacacc tccaacccca gatgccgagc cagctgcacc gccgccatgc 2296381 ccaccccgcc ggtgccggca tggatcaaca cgcgctggcc cgacctagca gcggccaaat 2296441 ccaccagcgc gtagtgggcg gtggcgaaca ccaccgaggt ggtggcggcg gccgtgtgcg 2296501 accaccccgc cggcaccttg accagcagcc gctggtcggt gctggcgacg gttccggtgc 2296561 cctcggggaa caggcccatt acccggtctc cgaccgcgaa agatcccttg ttcaagctgg 2296621 tttcgataac gacgccgcag gcctcaacgc ccatgaccgc gtccggatcg ggatacagac 2296681 ccagcgcgat catgacgtcg cggaagttgg cggcaatcgc ggacaccgca actcgaacct 2296741 gcccggggcc cagcggcgcg tcggcatcgg gaatcagctc cagccgcaga ttctcgaagg 2296801 tgccggcggt gctcatcgcc aaccgccacg gccggtcact cggaggaacc aacagcccgc 2296861 ccaccgcgcg gctaccgtgc acccgcgccg tataaacctc cccgcgccgc cacaacacct 2296921 gcggctcgcc tgtcgtcact accgccgcca gggccgaatc gtcgagcggc gcatcggaat 2296981 cgaccagcac gatccggccc ggatgctcgg tctgcgccga ccgcaccaat ccccatacgg 2297041 cggcacccgc caaatcggtg acatcttcgc ccggcaatgc caccgcaccg cgggtcatca 2297101 ccaccaaaac ccctgcccca tcacgggtta gccacgactg caacacatca agcaccgaac 2297161 tcgtggcggc atacacgccc gccactacgt caccggccag aggcaccgac tcaaacacca 2297221 ccgccgccga gtcctccgtt gtcccccagg cgcacaccgg tagcggctcc accgcggccg 2297281 atggctgcgg cgaccaggtg acctcgaata gccggtccgg acccgagctc gacaccgccg 2297341 cccgcaattg ctgatcggtc accggtcggg ccagcatgga agcgactgac aacaccggca 2297401 atcccaaccc atcggccagc tcgatcgaca ccgccgacgg acccactggc gcgatgcggg 2297461 cccgcaccgc cgacgccccc gctgcatgca acgagacccc ctgccaggag aacgggacca 2297521 acaccgaacc ttggccacgc tcggcgcttt ccgcgctcaa caccaccgcg tgcaaggccg 2297581 catccagcag caccggatgc accccgaagc cggtgaccga gaccccggca tcggcgggca 2297641 acgccacctc cgcgaacacc tcatcacccc ggcgccacat cgcggtcagt ccccgaaacg 2297701 ccggcccgta gccgtatccg cgctcggcca gctgctgata gccgtccgcc acctcaaccg 2297761 ggacggcgcc cgccggcggc cacatcgcta gatccgcggt cggttccgcc gacccggcgc 2297821 gcagcgcgcc ctcggcgtgc aacacccagc cggtaccgac gtcaccacgc gaatacaccg 2297881 acaccccgcg cacgccggac tcgtcgggac cattgacgac cacctgaacc gccaccgaac 2297941 cggatgcggg caacaccaac ggcgcggcca gcgttgattc gtcgacaacg ccacaaccca 2298001 cttcgtcgcc ggcgcggatc gccaactcca caaatcccgc tcccgggaag atcgtcacgc 2298061 cggcaacgga gtggtcggcc aaccagccct gcacgctggg cgacagccga cccgtcaaca 2298121 ccaccccgcc cgaggccggc agatcgatca ccgcgcccaa gagcgcgtgc tcactggccg 2298181 ccaaccccaa gccggccgcg tccgccgcga caccatcacc ggacagccaa aaccgccgcc 2298241 gttggaaggc atacgtcggc aactcgacaa actgcgcctc gcctaccaca gcgcgccaat 2298301 ccaggtccat accggtgaca aacccttgcg cgacggcgtt ggtcaacgtc gccggctcgg 2298361 ggcgatcctt gcgcagcgca gacatcgttg tcaccgcaac gtcgggcaac gactcttcga 2298421 tcgacgcaac aaggccaccg ctgggcccga cttcgaggaa tcggctgcct ccggccgcct 2298481 gcgcgaagcg cacactgtcg gcgaaccgca cggcttgccg gatgtgacgt cgccagtagg 2298541 ccgctgatcc gaaatcgtcg cccgccaact gcccggtcac gttggagatg actccgatgg 2298601 tgggccggcc gatggcgatt ccggcagcga cggctgcgaa ttcgtcgatc atcggatcca 2298661 tcaacggcga gtggaacgcg tgggaaaccg ccagctggtg gactcgtcgt ccgtcggcgc 2298721 gcagctggtc ggccaccgcg gccacggcgt tttgtgcacc cgaaatcacc agtgacgctg 2298781 gaccgttgac cgcagcgatg tcaacctcag cgctcagcag cggccgcacc tcttcctcgg 2298841 cggcttgcac ggcgaccatc gccccaccgg ccggcaacgc ctgcatgagc cggccgcggg 2298901 cagccaccaa caccgcagcg ttctccaacg acaggacacc ggcgacatgt gccgcagaca 2298961 actcaccgat cgagtggccc atgacaaaat ccggtcgtac accccaggat cccagcaacc 2299021 ggaacagggc aacttccacc gcgaacagcg cgggctgcgc gaattccgtg ctgttcagta 2299081 ggttttcgtc gtgaccccac atcacttcgc gcagtgggcg cagcagatgc cggtcaagtt 2299141 cgcccactac ggtgttgaac gcctcggcga acaccgggta tccggcgtgc aatcccattc 2299201 ccatgcccag ccattgggag ccttggccgg ggaagacgaa caccgtctta cccgccgcag 2299261 tcgccgtgcc ccgaacaacc gagccgccca actggtcacc cgccagctca tcgagcccgg 2299321 ccaacaaccg atcacggtcc ccgccaacca ccaccgcccg atgctcaaaa accgaacgac 2299381 ccgccaacga ccaccccaca tcggcaacat cgaggccatc atcgccacgc acgtacgcgg 2299441 ccaaccgagc cgcctgcccc cgcaacgccg actccgactt cgccgacacc acccacggca 2299501 ccaccggccc cgcccaacca gcctcccgcc gcggcaccac cggcaccgcc tcgataatca 2299561 catgcgcatt agtgccacta atcccaaacg acgacacccc cgcacgacgc gtccgagcac 2299621 cagcaggcca cacccgcggc gcggtcaaca actccaccgc ccccgccgac caatccacat 2299681 gcgggctagg cacatccacg tgcaacgtcg ccggcaacag ctcatggcgc atcgccaaca 2299741 ccatcttgat caccccggcc acccccgccg cggcctgcgt atgacccata ttcgacttca 2299801 ccgaccccaa ccacaaaggt tctcccggct ccccccgatc ttgcccataa gtggccaaca 2299861 acgcctgagc ctcaatcgga tcccccaacg tggtcccggt cccatgcccc tccaccacat 2299921 ccacctcggc cgcgctcaac ccggcattgg ccaacgccgc ccgcaccacc cgctgctgcg 2299981 aaggaccatt aggcgcggtc aacccattcg acgccccatc ctgattaacc gccgacccga 2300041 ccaccaccgc caacaccgga tgacccaacc gccgcgcatc cgaaagccgc tgcagcacca 2300101 acatcccacc gccctcggag aatccggtgc cgtcggccgc cgcggcgaat gccttgcagc 2300161 gcccgtccgg ggataatccg cgccagcggc tgaattccac gaagatgtcg ggtgtggcgt 2300221 tgacggtgac gccgccagcc agcgccagat cgcactcccc cgaccgcagc gatcccaccg 2300281 ccatatgcaa cgccaccaac gacgacgaac acgccgtatc caccgacacc gccggaccct 2300341 ccaaccccag cacataggcc acccgacccg aggcgacgct ggacaattgg ccggtcagcc 2300401 ggaagccttc taccggctcg gcggcgaaca tgccgtagcc ttgcgtcatt accccggcga 2300461 ataccccggt ggcgctgccg cgcaatccgg tcggatcgat accggcccgc tccaacgcct 2300521 cccaggacaa ctccagcaac atccgatgct gtggatccat cgcgagggcc tcgctcggcc 2300581 ccaccccgaa gaaggcgggg tcgaagtcgc cgaccccgtc cacaaagccg ccggtgcggg 2300641 tgtagcacgc acccgcggcg tcggggtcgg ggttgtatag cccggccagg tcccacccgc 2300701 ggtccgccgg gaattcggag agcacgtcgc ggccctggat cagcatgtcc cacatgtcgt 2300761 ccggggaatt caccccgccg ggatagcggc acgccatgcc cacgatcgcg atcggatcct 2300821 cgctcgtggt gcgtaccgcg ggtgtgtgct tgatttcctg tgggaggccg gcaagttcgg 2300881 tgcggatata ggaggccagc cgattgggtg tcgggtagtc gaagatgagc gtgggtgaaa 2300941 gtgaaaggcc ggtggcggat ttgagccggt tacgcatttc gaccgcggtc aacgagtcaa 2301001 aacccaggtc ctggaacgcc ttgtcggggt cgatggcttc tggcgtgatg ttgcccagca 2301061 cggtggcgat gtgcaaacgc accaggccta gcaagacggc gtgctgttcg gcttcgggca 2301121 gcccgtgcag gcgatgcgcg agcgccgatt tcgactttgc ggcggccacg gagtcgtcga 2301181 cctgacggcg ggtcggcgcg ctggctaggt cggagaacat gggcggcacc gccaccgcat 2301241 gggctcgcag tgcggtgagg tcaatgcggg cgggcgccag gaatggctcg tcgacgatca 2301301 ttgcggtgtc gaacagttcc agcgcctcag cggtggacag cgccagcacc ccttcacgac 2301361 ccagccgggc caggtctgcg gcgtccaggc cgccggtcat ggcgctggcc tgatcccaca 2301421 gaccccagcc cagggagatg gccggcagcc catgggcccg ccggtgggcg gccagcgcat 2301481 ccaaaaacga attggcggcc gcatagttgg cctggcccga cgatccgacc agcccggcca 2301541 tcgacgaaaa catgacaaac gccgacacat ccaggtcgcg agtcaactcg tgcaggtgcc 2301601 acgccgcgtc caccttggac cgcaacacca catccacccg atccggtgtc agtgacatca 2301661 ccaccgcgtc gtcgagtgcg ccggcggtgt ggatcacgcc cgacaatgga tgctgaaccg 2301721 gaatatcggc gatcaccttg gccaacgccg ctcgatccgc cgcgtcacag gccaccacct 2301781 gtacctgcgc accggcggcg gccaactcgg ccaccagctc cgcagccccg ggagcatccg 2301841 ggccgcgccg gctcaccaac accagattgc gcaccccatg acgagccacc acgtgacggg 2301901 ccaccgccga acccgccatc ccggtgccac cggtgatcaa caccgtgccc gccgcccacg 2301961 agccgggcat cagcatgacg accttgccgg tgtggcgcgc ctggctcaga taacgcaacg 2302021 ccgcaggcgc gcgccgcacg tcaaaagtgg tgaccggcaa cggccgcagc accccatcgc 2302081 cgaacagcgt ggcgagctcc agcatgtact gatgcatccg gggacgtccc ggttcgaata 2302141 ggtcgaaggc gcggtagcgc acgcccgggt actgctgggc gatcacgccg gggtcgcgga 2302201 tgtcggtctt gcccatctcc aagaacaccc cacccggtgc caccagacgc agcgacgcat 2302261 ccacgaattc accggccagc gagtccaaca ccacgtcgaa ccctcgaccg ccagtggccg 2302321 cgcggaactt gtcctcgaac tctaggctac gtgaatcgga tatgtggtcg tcgtcaaagc 2302381 ccatggcgcg caaggtgtcc cacttaccct tgctcgcggt cgcgaacacc tccaacccca 2302441 gatgccgagc cagctgcacc gccgccatgc ccaccccgcc ggtgccggca tggatcaaca 2302501 cgcgctggcc cggttgtacg tcggccaaat gtatgaatgc gtagtacgcg gtggtgaaga 2302561 cagccgagat ggcggcggct tcggcgtagg accagtcggc gggcatcggc agcagcagcc 2302621 ggacgtcgcc ggccaccagg gtgccgctgc cgtcggggaa gaatccgaac accgaatcac 2302681 cgaccgagaa ttcggtgaca ccggggccga cctcgacgac cacgcccgcg ccttcgccgc 2302741 cgagcagcgc gtcgtgggtg aacatgccta gggtgatcat gatgtcgcgg aagttcgcgg 2302801 cgatggcgcg catggccacc cggacctggc cgggccccaa cggtgcgtcg gcgttgggaa 2302861 ccggctcgag ccgcagattt tcgaaggtgc ccgcgctgcc cagacccaac cgccatggcc 2302921 catcgcccgg cggcaccaag atggcatccg ccgcgcggct gccgcgcacg cgcgcggtgt 2302981 acacctgtcc gccccgcagc actacctgcg gctcgccagt cgccaacgcc atcgcgatcg 2303041 ccgcgtcgtc ggtggccgca tcggaatcga ccagcacgat ccggcccgga tgctcggtct 2303101 gcgccgaccg caccagcccc cacacggccg cgcccgccag atcggcgacg tcttcgcggg 2303161 gcagcgccat cgcgccccgg gtcgccacca ccagcacccc ggattcatgg tcggtcagcc 2303221 acgactgcac tgcggccaga gcctggtggc tgcgcacgta gctgccggct accggatctt 2303281 ggtcagccgc aaccgattca aagatctggt aggcgggggt aggccccggg gacgtggccg 2303341 ccgacgcggg cgaccagatc acttcgaaca gccggtcggg acccgagccc gacaccgccg 2303401 ccagcagctg ccgctcggtc accgggcggg ccaccatcga ggccaccgac aataccggca 2303461 gacccagccc gtccgccaac tccaccgaca ccgccgacgg ccccgccggc gcgatccggg 2303521 cccgcaccgc cgaggccccc gtggcatgca acgacacgcc ctgccaagcg aacggcaatg 2303581 cgagttcgtc cgggtcgccg gcgatcacga ccgcatgcaa gacggcgtcc aacaaagccg 2303641 gatgcacacc gaacccaccg actcccccgg ccgcctccgg cagcctcacc tcggcgaata 2303701 tttcctcgcc gcgggcccac atcgcggtca gcccgcgaaa cgccggtccg taccggtagc 2303761 cgcgtgtcgc caaccgctca tagccatcgg ccacgtccac cgtcacggca cctgccggtg 2303821 gccacaccga taggtccgcg cctggttcaa ccgacccggg ccgcaggata ccctcggcat 2303881 gcaaaagcca gcccgcttgc gcgtcagctc gggaaaatat cgacacacca cgggaattcg 2303941 aatcccggcc agcgtcgact accacctgca ccgcaacgga gccggtggcg ggcaacagca 2304001 ggggtgcggc cagcgtcagc tcgtcaagca ccgagcagcc gacttcgtcg ccggcgcgga 2304061 tcgccagctc cacgaatccg gtgcccggga acagcaccac gtctgaaacg gcgtggtcgg 2304121 ccaaccacgg ctgcacgttg ggcgacaacc gacccgtcaa caccaccccg ccggaggcgg 2304181 gcaggtcgac caccgcgccc agcaacgggt gttcgctcgc acccaacccc aaaccggata 2304241 cgtcggcgcc tgagccctcg gccgagagcc aaaaccggcg cttgtcaaag gcatacgtcg 2304301 gcagctccac atagcccgct ccgtccagcg tgccccgcca gttcacagcc acccccgcca 2304361 caaacgcgga cgccgccgag agcaggaatc ggtgcagccc accatctcca cgccccagcg 2304421 tggggacgac aatggcctcg ctgtcaccgt cggtgcacgc ggcgaatgtt tcctcgacac 2304481 cggtaatcaa cgccggatgc gggctggatt cgatgaacgt gcggtagccc tgctcgcagg 2304541 cgttgcgcac cgcctggtcg aatagcacgg tctggcggac gttgcggtac cagtagtcgg 2304601 cgtccaaacc agctgtatcc aaacgatttc cggtcaccgt agagaagaag acggtacgcg 2304661 tggatcgcgg ttcgatgccg gacagagctt cggcgagtgg gccacggatc gcctcgacct 2304721 ccaccgaatg cgaggcatag tccacctcga tccggcgggt ccgcagttcc ttggtggagc 2304781 acaccgcgat cagctcctcc agcgcgccca cttcgcccga caccaccacc gccgaggggc 2304841 cgttgacgac ggcgatgctg acccgatcgc cgaagggcgc caacaaatcc cgcgcctggt 2304901 cggcaccgca cgcgatggac accatgccgc ccgggccggc cagtccggcc agcaacttgc 2304961 tgcgcagcgt gaccacccgt gcggcgtcgc gcagcgacag cgcgccggca acgtaggcgg 2305021 cagcgatctc gccttgcgaa tgaccgatca ccgcatccgg atgcactgcg accgacttcc 2305081 acagctcggc cagtgacacc atcaccgcga acagcacggg ctgcaccaca tccacgcgat 2305141 ccagtcccgg tgcaccgggg gcgccacgca gcacgtccac cagcgaccag tcgacaaatt 2305201 ccgcgaacgc ctcggcacac gcgtcgatct gctgcgcgaa tgccggtgcg gtatcgagca 2305261 gttcgattcc catgcccagc cattgggagc cttggccggg gaagacgaac accgtcttac 2305321 ccgccgcagt cgccgtgccc cgaacaaccg agccgcccaa ctggtcaccc gccagctcat 2305381 cgagcccggc caacaaccga tcacggtccc cgccaaccac caccgcccga tgctcaaaaa 2305441 ccgaacgacc cgccaacgac caccccacat cggcaacatc gaggccatca tcgccacgca 2305501 cgtacgcggc caaccgagcc gcctgccccc gcaacgccga ctccgacttc gccgacacca 2305561 cccacggcac caccggcccc gcccaaccag cctcccgccg cggcaccacc ggcaccgcct 2305621 cgataatcac atgcgcatta gtgccactaa tcccaaacga cgacaccccc gcacgacgcg 2305681 tccgagcacc agcaggccac acccgcggcg cggtcaacaa ctccaccgcc cccgccgacc 2305741 aatccacatg cgggctaggc acatccacgt gcaacgtcgc cggcaacagc tcatggcgca 2305801 tcgccaacac catcttgatc accccggcca cccccgccgc ggcctgcgta tgacccatat 2305861 tcgacttcac cgaccccaac cacaaaggtt ctcccggctc cccccgatct tgcccataag 2305921 tggccaacaa cgcctgagcc tcaatcggat cccccaacgt ggtcccggtc ccatgcccct 2305981 ccaccacatc cacctcggcc gcgctcaacc cggcattggc caacgccgcc cgcaccaccc 2306041 gctgctgcga aggaccatta ggcgcggtca acccattcga cgccccatcc tgattaaccg 2306101 ccgacccgac caccaccgcc aacaccggat gacccaaccg ccgcgcatcc gaaagccgct 2306161 gcagcaccaa catcccaccg ccctcggacc agccgacccc atcagcccgc ccggcgtaag 2306221 gcttgcaccg gccgtcgggt gccagcccac gatgcctgct gaattccacg aagaccgtcg 2306281 gtgtggcgtt gacggtgacg ccgccagcca gcgccagatc gcactccccc gaccgcagcg 2306341 atcccaccgc catatgcaac gccaccaacg acgacgaaca cgccgtatcc accgacaccg 2306401 ccggaccctc caaccccagc acataggcca cccgacccga ggcgacgctg gaggtcatcc 2306461 cggtcagccg gtagccctcg atctcctcgg ccaacattcc gtagccgccg acgatgagcc 2306521 cggcgaatac cccggtggcg ctgccgcgca atccggtcgg atcgataccg gcccgctcca 2306581 acgcctccca ggacaactcc agcaacatcc gatgctgtgg atccatcgct aacgcctcgc 2306641 tgggcgaaat accgaagaac gcgggatcga aatccgcgac gccatccacg aagcccccag 2306701 tgcgcgcgta cgacttatgg cgcacgtcgg gatccgggtc gaacaacccg gccagatccc 2306761 acccacggtc ggtgggaaat tctgacatca cgtccctggc gtcggccacc atctgccaca 2306821 gcccttccgg ggaatcgacg ccccccggga agcgacacga catgcccacg atcgcgatcg 2306881 gctcgctcga gcgctccagc aacgcacggt tggtgcgctt caggcgttcc acctggacca 2306941 gcgctttgcg cagcgcttcg gtcgcatgct ggagttgatc aaccattact aacctcgcct 2307001 aactctcgct aatattggcc gtcgccgacc gccggatgcg gctcccgccg agtcaccgaa 2307061 gttgctgcac aaaacgacgc cgtcgtacgg cgctctggcg caagttcgct ggtgagtatt 2307121 gccaactccg gcaggatttc aaagcgtcca atactccctg ggcaccagtg cgcccgtgca 2307181 aagcctgccg tccatggcgc gactgtaccc gcccgcccgt caacgccgga tgggcgcatg 2307241 tcaatgcggt gctagcggtg gtcttcacaa cacagccgca cgaatgcagc gactaggcgc 2307301 cggctcggcg ccacccatcg gcagccctgg cggcccggat cagctcgtcg cacagatcgc 2307361 gcagttcggt cgccgcggct ccttcgtcga gcgcggtgac gacatcctcg gcggcgcatc 2307421 gcacctggta aacacgatcc gacagatcgg ccgcgtcgtc ggctgacaac acgaccgcat 2307481 cggcgggcag cgccctcacc tcaccccggg tcagcatggc gcgctgctcg taagcccgct 2307541 gccggcaaga ctgccggcaa taccggcggc gacggcccat gccgacgtcg gtcacgtcac 2307601 ggccacacca cccgcacggc tgcggacggg cacgacgagt catgcctgca gacattagtc 2307661 cgcccgggtg tccgatcccg gtatcattga tggtcgcgcc gcgcgcgtcg cgtgccggga 2307721 actacgcaga cggccgcagc gtttgccaac cggagccagt cgccagtacg caacctacca 2307781 gcagagccca gggctcacag gacctaaagg agtagcgccc atggctgatc gtgtcctgag 2307841 gggcagtcgc ctcggagccg tgagctatga gaccgaccgc aaccacgacc tggcgccgcg 2307901 ccagatcgcg cggtaccgca ccgacaacgg cgaggagttc gaagtcccgt tcgccgatga 2307961 cgccgagatc cccggcacct ggttgtgccg caacggcatg gaaggcaccc tgatcgaggg 2308021 cgacctgccc gagccgaaga aggttaagcc gccccggacg cactgggaca tgctgctgga 2308081 gcgccgttcc atcgaagaac tcgaagagtt acttaaggag cgcctcgagc tcattcggtc 2308141 acgtcggcgc ggctgacccg ggaaccccct gctcccggcc gggcaatgtc cggtcgtgcg 2308201 cgtgcgtggt ccgagcgcga aaggcgtccc tcgatgcccc agcgggcgac tttgaccagc 2308261 gcctcacgaa tgttggaccc gctcatcttg gacacaccga gctcgcgctc ggtaaaggta 2308321 atcggcacct cggtgacgac gaacccgttg ctcaccgtgc gccaggtgag atcgatctgg 2308381 aagcagtagc ccttggagtc cacgccgtcc aggtcaatcg cttcgagtgc ttcgcggcgg 2308441 tacgcgcggt agccagcggt gatgtcgtgg atcccgattc cgagcgccag gcgcgaatag 2308501 gtgttagcgg ttttggacag gactagccgc cgccaaggcc agtttcgtac cgtccccccc 2308561 gcgacatagc gcgaaccaat cgcaagatcg gcaccagcgt cgacggcgtc cagcaggcgc 2308621 tgcagctgtt cgggcgcgtg gctgccgtcg gcatccatct cgaccagcac cgaatactcc 2308681 cggctcaacc cccaggcgaa acctgccagg tacgccgcgc ccaaaccgtt cttggcggtg 2308741 cggtgcatca cgtgggtgcg gccgggatcg gcctgcgcca gctcgtcggc gagctggccg 2308801 gtgccgtcgg ggctgctgtc gtcgacgacc agcacgtgca cggcggggca tgcttgcgtc 2308861 agccgccggt ggatcaccgg aaggttctcc cgctcgttga acgtaggaat gatcaccagg 2308921 acgcgctggc tgggacggtt acccggggct gggggcgccg gctggccggt ggtcatgtaa 2308981 ctcctcgatg ttgctctgtg tcgtccgaaa ccggatgagt gtcggccgcc ctgctcgggc 2309041 tgaatgagtt cgtcgtcgga ttcactcagg gccggcggac cggaggcctc agatctgccc 2309101 gggggcgcat cggaatcgtc attttcgccc tttggctccg agcgcctcgg acgcgggaac 2309161 cacccattct gccgcatggc gacgagaacg accgctgcgg ctgccccgac gagaatccat 2309221 tgcaggattg gaccccatcg agttgccggt gtcagcctcg tcttgaggcg cacctggctg 2309281 tccaggtatg cgggctggaa aaagtcggtc cggatcagct cacccccgtc tggtgctatc 2309341 accgcactga tcccagtggt accggcaacc accacgtatc tgtcgtgctc gacggcccgt 2309401 accttggcga atgccagctg ctgttcgctc attgtcttgt tgaaggtggc gttgttgctg 2309461 ggcacggtca acagctgcgc gccgcccaga atcgacttcc gcggggcgcg gtcgaagatc 2309521 acctcccagc aggtagccac cccgaccggg accccagcga tgcgcaccac accggtgccg 2309581 ttgccgggca cgaagtggcc ggcgcggtcg gcgtagccgg agaggtgccg aaacagccac 2309641 ggcatgggca ggtactcgcc gaagggctgc acgattgcct tgtcgtggcg gtcggccggc 2309701 ccggtgccgg gattccagac aatggccgta ttggtccact ccggattttc acgaggacgg 2309761 cccggaacat ccatcagggt gccgatcagg atcggcgcgc cgatcgcttc ggccgctgcg 2309821 gagatccgtt gaccggcgtc ggggttgacg aacgggtcga tgtccgacga gttctccggc 2309881 cagatgacga actggggttg ctgcgccagc cccgcatgca cgtcggcggc cagccgcaac 2309941 gtctcctcaa cgtggttgtc tagcaccgcc cgacgttgcg cattgaagtc gagaccgagc 2310001 cggggcacat tgccctggac caccgcgacg gtgaccgtgg gttcgccgcc cgatccgcta 2310061 cccgcatgcc gcacctgcgg ccagacgacg atggcggcga acaagaccag gcatatgcac 2310121 gcggccggca gcaccaccgc cggcggcgca tccccctgac caccggttcg ccaccacttc 2310181 tcgatttcca gcgcgatcgc ggtcaagccg catccgacca gcgctacccc cgttgacagc 2310241 agcgccacac cgccgagctg gaccaacggc aacagcgggc cttcggcttg accgaaggcg 2310301 accgaccccc acggaaatcc accgaacgga aggatcgact tcaaccactc ctgcgccgcc 2310361 caccccaccg cgaaccagat cggccaaccc ggcaacaggc gtaccacgac ggcgaacaga 2310421 ccgaagatgc cggggaacag cgcgcacgtc gtcgccagtg ccaaccaggg cccggggccc 2310481 accagctcgc cgatccacgg caacaacgag acgtagaaca ccaggccgaa tagcaggccg 2310541 tagcccagcc cacccaccgg tgtcgtcgcg cggtgggtca gcacccaggc cagcaatgcg 2310601 agcgcaacca ccgccgccca ccagcagttg cgcggcggga agctggcata caacagcaga 2310661 ccggccacga tgctgaccac caggcgcgtc agccgcgtcc gcaccgcggt ccgtgtggtg 2310721 ggcagctgcg ctgccaccca ggcgccaagc ttcaccaggc gccggcgggc cgcggcgccg 2310781 agccaggcag ccgcgctcgg cgcgtcgggg ccttccgccg gctcggccga cagttcgatc 2310841 tctggatcgg cggggctctc cgggccggcc tcggcgacct cagcgggccg cgccttccgg 2310901 ccgaaccatt ccctagccat agatgaccgc acctcgatgc acggtttggc ggcaacgcgg 2310961 caaggcgtcg gtcgggccca gccgcggcaa tgcgggtacc cgggagcgcg ggtcggtaga 2311021 ccagcgctgg actgcgtcgc gcggtgcgtc gacgtcaaag tccccggcgt cccatatcgc 2311081 gtaggacgcg ggcgcgcccg gcaccagggt gccgatccgg ccgtctcgaa caccaccggc 2311141 ccgccagccg ccgcgggtcg cggcagcaaa cgccgcccgc gccgataccc cgctgcccgg 2311201 cgtgcggtga ttgaccgccg cgcgcacgct ggcccaggga tcaaagcccg tgacgggcgc 2311261 gtcggagcca agcgcgaggg gcacgccttg ggatgctaac agcgccagcg ggttgagttc 2311321 gctgcctcgc tgggcgccca ggcggcgagc gtacatgccg tcgccaccgc cccacagctc 2311381 atcgaagttg ggctgcacac tggcgatgac cccccaagcg cccagcttcg cggcctggtc 2311441 cgcggtgacc atctccacat gctcgaggcg gtggccgcag cgggcgacgg caaccacgcc 2311501 gagatctgcc accacccgtt cgaaggcggc gactgcggcc gacaccgcag cgtcgccgat 2311561 gacgtggaag ccggcggtca cttcggcctt ggtgcatgct cgtacgtgcg cttcgatgcc 2311621 gtctacgtca aggtggcagg tgccgatgca gtcgggggcg tccgcgtagg gctcgtgcag 2311681 ccaggcggtg cgcgacccga gcgccccgtc gacgaacaaa tcaccggcca gccctcgagc 2311741 cccggtctcg gtcaccaggt cacgggcctg ggccggcgtg gccacggcct caccccagta 2311801 cccgatcacc tcgactccgt gctcgagtgc acgcagccgc aaccagtcgt cgagcccgcc 2311861 gatttccgga ccggcgcatt cgtgcacggc gacgacgccg gccgcggcta tggcctgcag 2311921 cgccacggcc cgggcgtcgg caagctggac gtcggtcaag aggtagcgtg cggcggcccg 2311981 ggctaggtgg tgggcatcac cggtcagcgg ccgctgggcc gtgtaaccgg ttgccgccgc 2312041 cagctcgggg accagccgcc gcagtccgga ggagaccaac gcggagtgcg agtcgatcct 2312101 ggccaggtag gcgggacagt caccgagaac cgcgtctagg tcggcggtgc tgggcgcagc 2312161 attctccggc caggccgact catcccaacc gtgaccccac agcggctgac ccggatggtc 2312221 ggccgcatag tcggcgacca tccgtaggca ctgcgcgcgt gaggtcgcgg gccgcaagtc 2312281 cagcccgctg agcatcagac cggtcgcggt caggtggatg tggctgtcca cgaaccccgg 2312341 cgccacgaat cggccgtcga gatcctgcac gtcagcgtct gggaactggt cgcggccgac 2312401 gtcgtcgctg cccaaccagg cgacgacatc gccgcgcacc gccatcgcgg tggcttcggg 2312461 gtgggtgggg ctgtacaccc ggccgttgac caggagtttg acgggaatct ggctcacacc 2312521 gctaattcga ccccggcgat ggaggttctg cggctacccg agggggctga agggtcaacg 2312581 gctcgacatc tatgacgtcg atgacctcgc catcaataaa gtccgggtcg gtgccgctct 2312641 cgccgaaggc cccggccatg ttggccgccg catcggccgt cagtggcacg ttccgcagga 2312701 aaccgcgcac ggcgatcgcg gtcagcccgg gtcgagcgag cgcccggatc ggcggcacca 2312761 gcagcaacag ccccatcgtc gtggtgacca gaccaggaac aagcaccaag accgaggcaa 2312821 cggtgaccag cgcgccgtca ctcagtgcgc ttcgtggttc cgccaagccg gatcgcaacc 2312881 acaggagccg tcggccgagc tgccagccac cgagcggcgc cagcagaccg aacccgagga 2312941 cgaacgtcgc cagcaacacc agcaaagtcc agccaaaccc gatcgtcgcc gccagcgcga 2313001 aaaccaccgc gagctcgacg acggcgtagc tgagcagcag ccgcgacacc acgtgacgcc 2313061 aacgtctgcg ggctaggccc gagttcctcg ggggcggaca tcgaggctgc agttagatga 2313121 cgctatgaca acgatagaga tcgacgctcc cgccggaccc attgatgcgc tgctgggcct 2313181 tccccccggc cagggcccgt ggccgggtgt ggtggtggtg cacgacgcgg tcgggtatgt 2313241 ccccgacaat aagttgattt ccgagcgtat cgcccgggca ggctatgtgg tgctcacccc 2313301 gaacatgtac gcccgaggcg gccgcgcccg atgtatcacc cgagtctttc gcgagctgtt 2313361 aacgaagcgg ggccgcgcgc tcgatgacat cctggccgcc cgcgatcacc tgctggccat 2313421 gccagaatgc tccggtcggg ttggcattgt gggcttttgc atgggcggtc agtttgcgct 2313481 tgtcttgtcg cccagaggtt ttggcgccac cgcgcccttt tacggcactc cactgccgcg 2313541 ccacctcagc gagacgctaa acggggcatg cccgatcgtc gccagcttcg gcacccgcga 2313601 cccgctgggt atcggcgcag ccaatcgact acgtaaagtg accgcggcca aaaacatccc 2313661 cgccgatatc aagtcctacc cgggcgccgg gcacagcttc gcgaacaaac tgcccggtca 2313721 gccgctggtg cgcatcgcgg gattcggcta caacgaggcc gcgaccgaag acgcgtggcg 2313781 tcgggtcttt gagttcttcg gccagcactt gcgcgccggc tcgcctggtg agccttaggt 2313841 acgacttcga ctccccgcgg atgccgatga ccttgtcccg tcggagggcg gcggggctgt 2313901 catgtccgcg tgcaccccga aggcgagatg aacatgattg tcatcatgaa gtagtgggcc 2313961 acagctgcgg gtgtcagctg gcgaaaaatg cgcgcggcgc cctcttcgtt gcctgacgtg 2314021 tgcggcgcgc cgacatgggt ttggcgagca tggcctcggt aagttccccg gcttgccgga 2314081 tgcgggtcat gggcacagtg cagcgcgtcg ctgcctgtcc tggcccgggt agggcagcag 2314141 cgccatctcg cgggcgttct tgatcgcctg ggcgacttgg cgttgctgct ggactgtcag 2314201 gccggtcact ccccgggagc gaatcttgcc tcggtcagag atgaacaccc gcaatgttgc 2314261 ggtgtctttg taatcgacgc tctcgacgcc gaggctatcg agcaggtttt tcttcgcctt 2314321 cgtcgggccc tttcgcgcgg atttggcggc catctaccag ctggccttcc ggacaccggg 2314381 caggtgtccg tcgtgggcca gttggcggac ccgcacacgg gagagcccga atttgcggag 2314441 atgtccgcgc ggccggccgt cgatggcgtc gcggttgcgt aaccgcacgg gactggcgtc 2314501 gcggggctgg cgggcaaggg ctcgctgggc ggtactgcgc tgttcggggg cgctcgatgg 2314561 ggatcggatg atgtctttga gcgcggtgcg acgcgatgcg taacgggcga cggtggccgc 2314621 ccgccgctga ttcttgacga tcttggactt cttggccacg tcagcgttcc tcgcgaaagt 2314681 ccacgtgacg ccgcaggatc gggtcgtatt tgcgcaagat gagacggtcg gggtcattac 2314741 ggcggttctt gcgggtggtg taggtgtagc cggtgcccgc cgtggaacgc agcttcacaa 2314801 tcggccggat gtcggtgcgc gccatcagat ccgctgcccc tggcgacgca ggcgggccac 2314861 gaccgcttcg ataccgtcgc ggtcgatgac ctttataccc ttcgtggaca cccgcagccg 2314921 aatgcgacgg ccctcggagg gcaggtaata cgttcgttgc tgaatgttgg gcgaccatcg 2314981 ccgacggctt cggcgatggg agtgcgagac ggtgtttcca aatcccggct tgcggccggt 2315041 gacttggcag tgggcggaca aggggcaccc ttccttcgaa gctcggctta ttgaaaatca 2315101 ttttcgacaa cagctaggtg gcactgtacc gtcgacgtcg caataatgaa aactgttatc 2315161 gataaggagg acggtggcca ccccggtgat ccttgtcacc ggacacgagg gcaccgccgc 2315221 cgtgaccgct gacctgctgg gcctgctcac cgatcacggc actgcgacac ttcggtcagt 2315281 ggcaccagga tccgtgcggc gagccgatcc ccgcccacgg tgtcaccgcc gagaacaacg 2315341 acgacgacac cgggcatcca tgaaatccgc catccatccc gaccaccacc cccgtcgtct 2315401 tccacggtgc ccggtcctcc gccgcgacca agttgtactg gaaatgattg tcattacgat 2315461 ggtcgggcgg ccgagcgggc cgggcgaaag gaaatgggat gtgtggggca gcgtggcacg 2315521 cgcggtcacc ggcgggcatg tacccgtcaa atccatcctc accggcgccc atgccgaccc 2315581 gcattcgtac caggccagcc ccgcggacgc cgccgcgatc gtcgacgcgg agctggtgat 2315641 ttacaacggc ggcgggtacg acccgtgggt cgaccaggtg ttggccggcc atcctggtgt 2315701 ccaggcggtc gatgcctact cgctgctcgg cgccgtgggc gacgacgacg cgcccaacga 2315761 acacgtcttc tacgacccca atgtcgccaa ggcggtcgcg gcaacgatcg ccgaccggtt 2315821 ggcggacctc gacccgtcca attccgggaa ctatcgagcg aacgccgccg agttcagccg 2315881 cggcgccgac gcaatcgcaa tttccgaaca cgcgatcgcc accacctatc ccgacgccgc 2315941 ggtcatcgcg accgaacccg tcgtgcacta cctgctggcg gcagccggcc tgaaaaatcg 2316001 aaccccggct accttcatcg cggccaacga aaacggcaac gaccccaccc cggccgatat 2316061 ggcggccgtg ctcgacatga tcgccggccg tgaggtcgcg gcgttgctgg ttaacccgca 2316121 gacacctacc gcggcgaccg acgaactgca ggtggccgcc cggcgggcag gagtgccaat 2316181 caccgagttg accgagacct tgcccagcgg aaccgaccgg gaccagtttt gcgctgctga 2316241 ccggccagat cgtcggggtc ggtcactccg ggctgaccat gctgaccgtg gtttgtctgc 2316301 tcgtggtcac cgtgttggcg atctgctacc gaccgctctt gtttgccacc gtcgatccgg 2316361 aggtcgcggc cgcccgcggc gtgccagtgc gcgccctggg aattgtgttc gccgcactga 2316421 tgggcgtggt agccgcccag gctgtccaga tcgtcggggc actcctcgtg atgtctttgc 2316481 tgatcacccc cgccgcggcg gccgcccggg tcgtggttgc cccggtcgcc gcgatcgcga 2316541 cctcggtggt cttcgccgag gtttccgccg tcggcggcat cctgctgtcg ctggcgcctg 2316601 gagtcccggt gtcggtgttc gtggccacca tctcgtttgt gatctacctg atttgctggt 2316661 tgctccggcg gcgccgctaa ctagccggtc tcgctttcgg ccactttgag ctctaggcca 2316721 atgttgttcc gcatgccgcc gcgcagctta ctgacgaagg tgaacagctt gccctggatg 2316781 ccgtagcgct tgacgatcgc gtcgtagacg gcgcccgttt gggattcgtc gaggatggcc 2316841 gcggtggctt cgacggcctc gctggtcggc cggccgcgca aggtgcaggt cgccagcgtc 2316901 acccgcggcg tgttgcggat ccgcttgacc ttccacgatt tcttctcggt gatgaccagc 2316961 agtcgatccc cgcggtcggt gtccaaggcg gcccagatgg gaaccggctt gggccggccg 2317021 tccttggtga aggtggtcag cagcaggtac tgcgcctcgg caaggtcaga aaaggtaggg 2317081 gtcacgggtg ccaacctacc gcgcgagcag acgcagaatc gcactgcgcg gggtcccgcg 2317141 catgcgattc tgcgtctgct cgccgtactc aggcttccag gtcgccctcg gtttccagca 2317201 gcacctggcg caacccgtcc agggtttccg gtgccggctg tgcccacagg ccgcgaccgg 2317261 ccgcttccaa cagccgttcg gccatgccgt gcagcgccca cgggttggac tcggtcatga 2317321 acgtgcggtt ctgcgcgtcc aggacgtaac gctgcgtgag ctgctcgtac atccagtccg 2317381 ccatcacccc ggcggtggcg tcataaccga acagatagtc gacggtggcc gccatctcga 2317441 atgcgccctt gtagccgtgc cggcgcatcg cggccatcca cctcggattg accacgcggg 2317501 cgcgaaacac ccgcgtggtc tcctccgaca gcgtgcgggt gcggatcgcg tcgggtcggg 2317561 tgttgtcgcc gatataggcg gccggtgctt ggcccgtgag cgcccgcacg gtggccacca 2317621 tgccgccgtg atactggaag tagtcgtcgg agtcggcgat gtcgtgttca cgggtgtcgg 2317681 tattcttggc ggccaccgca atacgccggt actggcggtt catgtcgtcg atcgcctcgc 2317741 ggccatccag gtcgcgcccg taggcgaatc cgccccaggc ggtgtacacc tgggcgaggt 2317801 cggcgtcgtc gcgccagctg cggctgtcga tcagctgcag cagcccggcg ccgtaggttc 2317861 ccggtttgga tccgaaaatc cttgtggtgg ctcgccgttg atctccgtgg tgggccagat 2317921 ccgcttgggc gtgcgcgcgc acgtagttgt cctcggcggc ctcgtcgagg tcggcgacca 2317981 accgcaccgc gtcatcgagc atggtcacca catgcgggaa ggcatcacgg aaaaagccgg 2318041 agatccgtac cgtcacgtcg atgcgcgggc ggcccagctc ggccggctgc atgggcgcca 2318101 ggtcgatgac ccgccgcgag gcgtcgtccc ataccggccg aacccccagc agcgcaagca 2318161 cttcggcgat gtcgtcgccg gccgtgcgca tcgccgaggt gccccacacc gacagcccca 2318221 ccgaccgcgg ccaccgccca tgctcatcgc ggtagcgcgc cagcagcgaa tcggccagtg 2318281 ccacaccggc ttcccacgcc agccgggacg gcaccgcctt gggatccacg gagtagaagt 2318341 tgcgcccggt gggtagcacg ttgaccaggc cgcgcagcgg cgaccccgac ggcccggccg 2318401 ggatgaaccg gccgtccaaa gctcttagca cctgctcgat ttcggttgcg gtgccagcca 2318461 accggggtat cacttcggtg gcggcgaacc gcagcaccgc ggcggcgtcg gcgttgccgg 2318521 tgagtcggtc ggcggcggag gggtcccagc cggtggcctg cagggccgcg accagttcgc 2318581 gggctttcgc ctcggtctgg tcgactgtcg cgcgttcgtc ggtgccatcc tcggccaggc 2318641 cgagtgcctg ccgcaggccg gggatcgcgt gcgcgccgcc gaacagctgg cgggcccgca 2318701 agatggccag caccaggtcg agttcttgct cccccgttgg gttttgcccg aggatgtgca 2318761 gcccgtcgcg gatctggacg tccttgatct cgcacagcca gccgtcgacg tgtagcagca 2318821 tgtcgtcgaa cgagtcctct tccgggcgtt cggtcagtcc caggtcgtgg tccatcttgg 2318881 cggcgcggat cagcgtccag atctgctggc ggatggcggg cagcttgccg ggatccagcg 2318941 cggcgacgct ggcatgctcg tcgagcaact gttccaaacg cgcgatgtcg ccgtaggttt 2319001 cggcgcgggc catcggagga atcaaatggt cgactagcac cgcgtgcgcg cgccgcttgg 2319061 cctgggtgcc ctcgccgggg tcgttaacca gaaacgggta gatcagcggc agatcgccca 2319121 gcgcggcgtc gggtccgcag gacgccgaca tgcccagcgt ctttcccggc aaccattcca 2319181 ggttgccgtg cttgcccaaa tgcaccacgg cgtgcgcccc gaaaccgttc gagaatccgg 2319241 tatcgagcca gcggtaggcg gccaggtagt ggtggctggg cggcaggtcc gggtcgtggt 2319301 agatcgccac cgggttctcc ccgaagccgc gcggcggctg aaccatgagc accaggttgc 2319361 ccgctcgcag tgcggcgatg acgatctcgc cgtccgggtc gtggctacgg tcgacgaaca 2319421 gctcaccggg tggcgggccc cagtacgctg ttaccacgtc tgtcagttcg gcgggcaggg 2319481 tggcgaacca gtcccgatac tccttggccg acacccggat ggggttgccg gccagctggc 2319541 cttcggtgag ccagtcgggg tcgtgtccgc cgcattcgat caacgcgtga atcagcgcgt 2319601 cgccgtcgtt tgattcgaca cccggcagat cacccacccg atatccgcgc tgccgcatcg 2319661 cttgcagcaa ggccaccgcg ctggccgggg tgtccaggcc caccgcgttg ccgatgcggg 2319721 cgtgtttggt cgggtaggcc gagaagacca gggccacccg cttgtcggcg ggggcgacct 2319781 ggcgcagccg tgcgtgccgg accgccaggc ccgcgacccg ggcgcagcgc tccgggtcgg 2319841 ccacatagga gatcagcccg tcgtcgtcaa tctccttgaa cgagaacgga accgtgatga 2319901 tgcggccgtc gaactcgggc accgccacct ggctggccac gtccagcggc gacaggccgt 2319961 cgtcgttggc gcaccactga tcccgcgggc tagtcaaaca caggccttgc aggatcggga 2320021 tgtccagcgc cgccaggtgc tcaacgttcc agctgtcatc gtcgccgccg gccgaggcgg 2320081 cggccggctt gactcccccg gcggccagca cggtgaccac catggcgtcg gcgccgccga 2320141 gcctttccag cagccgcggc tcggcggtgc gcagcgacgc gcagtagagc ggcagcgggc 2320201 gtccgccggc gtcttcgatc gcccggcaca gcgcctcgac gtagccggtg ttgccggcca 2320261 ggtgctgggc acggtagtag agcaccgcga tcgtcgggcc ggtcttgccg gcgtccggac 2320321 gctccagcac cccccaggtc ggggtggcga ccggcggcgt gaacccgaag ccggtcatca 2320381 gcacggtgtc gcacaggaag gcgtgcaact cgcgcaggtt gtcgacgccg ccgtgggcca 2320441 ggtagatgtg ggcctgcagc gcggtgccgg ccgcgaccgt ggagcggtcg gtcaactcgg 2320501 catcggcggc ctgctctccg ctgaccagta cggccggtac cccgccggcg atcaccgtgt 2320561 cgattccgct ctgccaggcg cggtagccgc cgagaatccg gatcaccacg atcgacgctt 2320621 cggccagcag gtcggtcagt tccaggtcag acagccgcga gggattcgcc caccggtagt 2320681 tcttgccgct ggaccgggcg ctaatcaggt cggtgtcgga cgtcgacaac agcagaacgg 2320741 tcggttccgg caccaattct tcttaccgga gcaggactcg agcggtggcg tcgggcccgc 2320801 gagctttgta gccacgccta gactacaaac atgtctacat ccacgacgat tagggtttca 2320861 acccagactc gggatcgtct ggccgcccaa gcccgcgaac ggggaatctc gatgtcggct 2320921 ctgctcaccg aactggccgc ccaggccgag cgccaggcaa tcttccgcgc cgaacgcgag 2320981 gcctcgcacg ccgagacgac cacccaggca gtccgcgacg aggaccgcga gtgggagggc 2321041 acggtaggcg acggccttgg ctgagccacg gcgaggagac ctttggctgg tcagcctcgg 2321101 cgccgctcgc gcgggtgagc ccggcaagca tcggcccgcg gtggtcgttt ccgtggacga 2321161 gctactcacc ggaatcgacg acgaactcgt tgtcgtcgtg ccggtgtcaa gctcgcgctc 2321221 ccgcacccca ctccggccac ctgtcgcgcc ctcagaaggt gtagctgccg atagcgtcgc 2321281 ggtgtgccgc ggcgtccgcg cggtcgctcg tgcccgactc gtggagcgac tcggcgccct 2321341 caaacccgcc acgatgcgcg caatcgaaaa cgccctgacc ctgatcctcg gcctcccgac 2321401 gggacctgag cgcggcgagg cggcgaccca ttctcccgta cggtggacgg gtggccggga 2321461 cccgtgacgc ggacgcctgc cccggtgcgt tgcggccgca ccaggccgcc gacggggcgc 2321521 tggcgcggat ccggctgccc ggcgggatga tcaccgcggc acaactggcg acgctggcca 2321581 gcgtcgccag cgacttcggc tccgcgacac tggaactgac cgcgcgcggc aatgtccagt 2321641 tgcgcgggat ccgcgacgtg gcagcggtcg cggacgcggt cgccaaagcc gggctgctgc 2321701 cgtcggcaac acacgagcgg gtgcgcaata tcgtcgcctc gccgctgtcc ggccgggccg 2321761 gcgggctagc cgacgtgcgg gcatgggtcg gtgagctcga cgcggcgatc cgcgccgagc 2321821 cccggctggc ggaactgggc ggccggttct ggttcggtct cgacgacggc cgcgccgacg 2321881 tgtccggcct gggtgccgac gtcggcgtgc aggtgttccc cgacggtccc cgactgctgt 2321941 tgaccggacg tgacaccggc gtgcgggtgg ccgatgtcgc cgagaccctg atcgaggtcg 2322001 cgttgcgttt cgtcaagatc cgcgaaaccg cctggcgagt aacggaatta gccgatatcg 2322061 gcgagctgca gtccggtgtc gagctgggcc catccgttcg gcccgtcacc aaaacgcccg 2322121 tcggctggat accccaggat gacagccggg taacgctggg cgccgcggtg ccgctggggg 2322181 tcttgcccgc ccgggtcgcg gaatgcctgg ccgcgatcga ggccccgctg gtgatcacgc 2322241 cgtggcgatc ggtgctgatc tgcgacctcg acgacgcgac ggccgacgcc gcgctgcggg 2322301 tgctggcgcc gctgggcctg gtgttcgacg agaactcccc ctggctgaac atcagcgcct 2322361 gcaccggcag ccccggctgc gcgcactcgg ccgccgacgt acgggccgac gccgcgcggt 2322421 cactgaacgt ggagtcagcc gggcatcggc atttcgtcgg ctgcgagcgg gcctgcggca 2322481 gcccaccggc cggcgaggtg ctggtcgcca ccggcggtgg ataccggcga ttgcggccgt 2322541 agggtgagcg agtgctcgac tacctacgcg acgccgcgga aatctaccgg cggtcattcg 2322601 cggttatccg cgccgaggcc gatctggcgc gcttccccgc cgacgtcgcg cgggtggtgg 2322661 ttcggttgat tcacacctgc gggcaggtcg acgtcgccga gcatgtggcc tacaccgacg 2322721 acgtcgtcgc gcgggcgggt gccgcgctgg ccgccggtgc cccggtgctg tgcgattcgt 2322781 cgatggtggc cgccgggatc accacctcgc ggctgcccgc cgacaaccag atcgtctcgc 2322841 tggtcgccga tccacgcgcc accgagctgg ccgcccgtcg ccagaccacc cgatcggcgg 2322901 ccggggtcga gctgtgtgcc gagcggctgc ccggcgcggt gctggccata ggcaacgcgc 2322961 ccaccgcgct gtttcggctg ctcgaactgg tcgacgaagg ggcaccccca ccggcggccg 2323021 tgctgggcgg accggtgggt ttcgtcggat cggcacaggc caaagaggag ctcatcgagc 2323081 ggccccgcgg gatgtcctac ctggtggtgc gcggtcgccg cggcggcagc gcgatggccg 2323141 ccgccgccgt caatgcgata gccagcgacc gcgaatgagc gctcggggca cgctgtgggg 2323201 agtcgggctg gggcccggcg atccggagtt ggtgaccgtc aaggccgccc gggtgattgg 2323261 cgaggccgat gtggtggcct atcacagcgc cccacacggt cacagcatcg cccgcggcat 2323321 cgccgaaccg tatctgcggc ccggtcagct cgaggagcac ctggtctacc cggtgaccac 2323381 cgaggccacg aatcatcccg gcggctacgc cggtgcgctc gaagacttct acgccgacgc 2323441 gaccgagcgc atcgccacgc acctggacgc cgggcgcaac gtggcgctgc tcgccgaagg 2323501 cgacccgttg ttctacagct cctacatgca tctgcacacc cggctgacgc ggcggttcaa 2323561 cgccgtcatc gtgcccggtg tgacgtcggt gagcgccgcg tcggcggccg tggccacacc 2323621 gctggtggcc ggcgaccagg tgttgtcggt gctgccgggc acgctgccgg tcggcgagct 2323681 gacccgccgg ctggccgacg ccgacgcggc cgtggtggtc aagctgggcc gttcgtatca 2323741 caatgtgcgg gaggcgcttt cggcgtccgg cctactcggc gacgcgttct acgtggagcg 2323801 ggccagcacc gccggccaac gggtattgcc ggccgccgac gtcgacgaga ccagcgtgcc 2323861 gtacttctcg ctggccatgt tgccgggcgg gcggcgtcgt gcgttgctga ccggcaccgt 2323921 cgcagtggtg ggcctggggc ccggcgacag cgactggatg acaccgcaga gccggcgtga 2323981 gctggccgcc gcgacggatc tgatcggcta tcgcggctac ctggaccggg tcgaagtccg 2324041 cgacggccag cggcgccatc ccagcgacaa caccgacgaa cccgcccggg cgcggctggc 2324101 ctgctcgctg gccgatcagg gccgggcggt ggcggtggtg tcctccggcg acccaggggt 2324161 attcgcgatg gccaccgccg ttttggagga agccgagcag tggccggggg tgcgggtccg 2324221 ggtgattccg gcgatgaccg ccgcccaggc cgtcgccagc cgggtcggcg cgccgctggg 2324281 acatgactac gcggtgatct cgttgtccga ccggctcaaa ccctgggacg tgatcgccgc 2324341 gcgcctgacc gccgcggccg ccgccgacct ggtgctggcc atctacaacc cggcttcggt 2324401 gacccgcacc tggcaggtcg gcgcgatgcg cgagctgctg ctggcccatc gcgaccctgg 2324461 cataccggtg gtgatcggcc gcaacgtctc cggaccggtt tccggaccga atgaggacgt 2324521 tcgggtggtg aagttggccg acctgaaccc cgccgaaatc gacatgcgct gcctattgat 2324581 cgtggggtcc tcgcagaccc ggtggtattc ggtggattcg caggaccggg tgttcacccc 2324641 gcgccgctat cccgaggcgg gcagagctac cgcgacaaag tcgagccgcc acagcgactg 2324701 aaagagcttg cggccgaatt cctcaaggtc ggccaggctg cctccggaag gctcgccagt 2324761 tcgcgccacg cacccggcaa tctcccgaat cgtgcggcga ccgtcaacct gctgcagaaa 2324821 ggccaactgg gcggggctgg gcgccatgcg ccaacccggc caaaacatat cggtgccgga 2324881 gacaccgcaa cgcgtgcgca tcagcggtac gtaatcgagc gcggcaaccg tcgaaaaatc 2324941 gatcgtgtac tgctccttgg gtcggtcacg acggcacgcc ataaagagat gggtagcgtt 2325001 caaggtctcc agacgttcca tcacggacca ggccttgacc tcgggtaacg tgttcacggc 2325061 cgcataaaac tcgctgttcg ggacgaaaaa atcgtgcggg taatacggcg ccttgtggaa 2325121 ccatccctga aataccagtc cggcggacgt gaccagatcg acgcattcct cgacggtgta 2325181 actgcgttgg cgaccatgca agaacgtatc gacgagggcg ctatcggaaa gtaaatcccg 2325241 agctttcgtg agatagtttc ggagcggatg atacgtcggt agtaacgaga ttgcttcctt 2325301 cgccaatttg atcgatgcat cgtcctgccc taatccaaga tcacgaaaga ccgaaccgag 2325361 cagttcgact ccgatccgac cgtacttccc gtagagcatc gccgccacga cgccatcccg 2325421 gcgcaggcag tgggcgagtt ctttcatgcc cgcccgcgga tctgccaggt gatgtaaaac 2325481 gccggtcgat accacgaggt cgaagtcgcg tcccagcgtc gccagctctt cgatcggaag 2325541 cagatgcaac tccagattcg ccagcccgtg cttgtctttc agatattgct gatggtccag 2325601 tgccggtcga ctgatatcga tcgccactac tttcgccgca cgattggtga atgcgaaaat 2325661 cgccgcctgg ttggttccgc aaccggcgat cagaatatcc agatcgggcc ggtattcgcg 2325721 gtccggccat aatatccggt gggagtgcac cgggtcgaac cattcccaat tcgctgtggt 2325781 ccacgcctca agatcggcga tcgggtgcgg gtacaaccac cggtggtact gccgggacac 2325841 aatgtcggcg cgcggatgat cgtcggtcac ttcggtccca cgagcctatg caagcacacc 2325901 ggcaacgcac gtcgccgcct cggcgagcag cgcctcacgg ggctcggcgt catacccgcc 2325961 gccggcacga tcggacatga cggccaccac gtagggcacg ccggtcggtg accacacgac 2326021 cgcgatgtcg tttgctcgtc cgtagtcacc ggtcccggtc ttgtcgatca ccttccaatc 2326081 ggcgggaaag cccgctcgga tccgcttggc tccggtggtg ttgcgcgcca tccaatcggt 2326141 gagcagtgcc cgcttgtcgg gcggcaacgc gttgccgaga acaagctgct gcaacaccag 2326201 ggcgatggcg tgcggtgttg tggtatcccg ttcgtccccg ggcggatcgc ggttcaactc 2326261 cggttcctcg gcgtccaacc ggctcacggt gtcacccaag ctgcggaggt agccggtaaa 2326321 tgccgcggtg ccgcccccgg gaccgccaag atcggccagc aacaggttgg cggcggtgcc 2326381 gtcgctatag cgtatcgccg catcgcaaag ctgcccgatc gtcatcccgg tctgaacgtg 2326441 ttgttgggcc accggggaga tcgaccgaat gtcgtcactg gtgtaggtga tcagtttgtc 2326501 cagatgcgtg agcgggtttt ggtgcagcac cgccgccacg agcggcgcct tgaacgtgga 2326561 gcagaatgcg aaccgctcat cggcgcggta ttcgatcgcg gcggtggtgc cggtggcggg 2326621 cacatacacc ccaagccggg catcgtatct gcgctccagc tcggcgaagc gatccgccag 2326681 atccgctccg gccggcaagg ttgtcgatgc cggacgggcc ccgctcgcat gccgtgcaca 2326741 ccccgtcacg gaaaccagca ttgccatcgc taccagcagt tcgcgacgac cgaatcctct 2326801 gttgcgcatg ccgtagtatc acacgcgcgc agatggcagg cgccaaagcg cattcgacgc 2326861 cgcgctcccc cggctgctcg gcggcgggat ctacgacgac cggtcgtaga ctgaccggac 2326921 ctgccgggct atggtttatg cccatgaccg cgacggcaag cgacgacgag gccgttaccg 2326981 cactcgcctt gtcggcggcc aaggggaacg ggcgggccct tgaggcgttt atcaaagcca 2327041 cccagcaaga cgtgtggcgg ttcgtcgcct atctgtccga cgtgggcagt gcggacgatc 2327101 tcacccaaga gacattccta cgagcgatcg gcgccatccc gcggttttcc gcacgctcca 2327161 gcgcccgaac ttggttgctg gccatcgcgc gccatgtcgt cgccgatcac atccgccacg 2327221 tccgatcccg gccccgcacc acccgcggcg cgcgtcccga acatctcata gacggcgacc 2327281 gccatgcccg cggattcgaa gacctcgtcg aggtaaccac gatgatcgcc gacctaacca 2327341 ccgaccaacg ggaagcgctg ctgctgaccc agctgctcgg gctgtcctat gcggacgccg 2327401 cggcggtgtg cggctgcccg gtgggcacca tccgatcccg tgtcgctcga gcgcgcgatg 2327461 cgctgcttgc cgacgcggag cccgacgacc tcaccggcta ggcagaccgg ccacccacat 2327521 ggcggcccgg tggacagaat cgaccgccgc taccccagcc ggcagcagcg ggcgcgctat 2327581 catgaccacc gaaataccca gcgcagcagc ggcatccagc ttcgctcggg tcatcttgcc 2327641 accgctgttc ttggtgacca atgcgtcgat gcgctgctca cgcagcagtg cgaactcatc 2327701 gtggtaacca tatggcccgc gagatagcac cagtttgtgc cgccgcggca gggcggtgcc 2327761 atcgggcgcg gtaaccacgc ggatcaaaaa ccacgcgtcg ctgttggcga aggccgcaat 2327821 acccgagcgt ccggtggtca ggaacactcg cgaataacct tgttcagcaa caacgtctgc 2327881 agcctcgatg tccgataccg cgatgatggc ggtaccggga tcccacggcg ggcgagccag 2327941 taccaggtac gggagcccga gctcaccgca cacctgcgcg gcgtgcgcgg tgatggttac 2328001 cgcgaagggg tgggtggcgt cgacgacggc atcgatgcgc tcctctcgca gccaaccgcg 2328061 cagcccctcg acaccgccga acccgccgat gcgcaccgga ccgatcggca gggcagggtt 2328121 gggtacccgg ccggccagcg agctgacgat ctcaacgtgt gggtgcaact ctttcgccag 2328181 cgcacggccc tcggcggtgc cgccgagcaa caacacccgg gtcactgtgc ataccgaccg 2328241 tgccgtgcca ccgaatatag gtagctgtcg gtaaagccct cagcggtcag cacgtcgcca 2328301 acaacgatca cggcggtcct ggtgatcttg gcatcgtgca tccgcgcggc gatatcggcc 2328361 aacgtgccgc gtagcgtccg ctgttgcggc caactcgcga aagccaccac cgcaaccggc 2328421 gtttcgggtc ggtaaccacc gtctagcagt cgcggaacga tggcgtcgat ctgggctgcg 2328481 gccaggtgca agaccagagt ggcgcgggat cgggcgagcg cggccaggtc ctcaccgggc 2328541 ggtatgggtg tggacagcgt cgccacccgg gtgagcgtca ccgtctgcgc cacgcccggc 2328601 acggtgagtt cgcgctttag cgccgccgcg gctgcggcaa aagccggtac gcccggcacg 2328661 atttcgtagc cgatgcccag cgcgtcgagt tcgcggcact gttcggccag cgcgctgtac 2328721 agcgacgggt cgccggaatg cagccgggca acgtcgcggc cgtcggcgtc ggcgtcggca 2328781 agtttgcgca cgatttgttc gagggtcagc ggaccggtgt cgacaatcgt cgcgccgggc 2328841 ggacactgcg ccaacaggtc gtcgggcatg atcgaacccg catacaggca caccgggcat 2328901 cgttgcagga gccgttggcc gcggacggtg attaggtcgg cggcgccggg gcccgctccg 2328961 atgaaataga ccgtcatcgc ttggtcaccg accactgggt gaccggcagc tgtgggcgcc 2329021 aaccggtgaa gccgcccagc ggttcgccga gatagtgctg gaatcgtcgt agctcgccac 2329081 cgaggcgcga atatgcatgc gccagagcgg cttccgattc gacggtgaca gcgttggcga 2329141 ccaagttccc gcctgcgggc aggctgtcca ggcaggcctc aagcaggcct ggctgggtta 2329201 caccaccgcc aagaaaaatc accgacggcc gtgcggcgtc gtcgaacgca tcgggcgcgt 2329261 cgccgcgcac gtcgacgctc accccgaagg ccgcggcatt gaacccaatg ttgcggcggc 2329321 gccgttcgtc gcgctcgaac gccaccgcgg tgcagcccgg ccagctccga caccactgga 2329381 ccgcgatggc gcctgagccc gcgccgacgt cccataaccg ctgcccgggc cttggcgcca 2329441 gcgcagccag ggtcagcacg cggatcgggt gtttggtgat ctgcccgtcg tgcgcgaatg 2329501 cctcgtcggg tgcccacgac gtgcgctcgt cgagcaggta gcgcacggcg atcacgttga 2329561 gctcatcgac atcgaggggt gggtcgcagg cccatgcccg ggccgtaccg tcgcggcggc 2329621 gttcggccgg gccgccaagc tgttcgagca cgctgaactt ggagtcaccg cgaccgtgct 2329681 cggtcagcag caccgccagc gcctgcgggg tggaccgatc gccggacagc acgatggccc 2329741 ggccgccgcg gcgcaccgcg gtgtgtggtt gcgcggtgac caggctgatc acctcggtgt 2329801 catacacgtt ccagcccatc cgggcgcacg ccaacgtcac cgcggacacg tgcggcaaca 2329861 cggtcacgtt gtcgtggccg aacagccgga tcagggtgga gccgatacca tgcaacaacg 2329921 ggtcgccgct ggcaaccacg tgtaggtcag ccccatccgg tgacaggcct tgcaccgcgg 2329981 gcagcatcgg cgtcggccac tcccagcgct cggcggtgac ggtatcgtcg agcagggcaa 2330041 gttgccgttt cgagccgtaa attactgtgg ccctgcgcaa ttcggagcga gaatgctcgg 2330101 agagaccggt catgccgtcg gcgccgatcc cgacaacgat gatcatcggc gccgctctcc 2330161 cccgcaagcg ggcggtaccc ccaccgcatc gctgcgctct gcatcgtcgc ggatcatcgc 2330221 ggcatcctgc gccagacgaa ccggggaagc aaccgcagcg caacaaacat tggccgcagc 2330281 gcccacggaa tccacaccac gcgcttaccg ttgaccagcg cacgcgcggt cgcggcggcc 2330341 acccgctccg gggtgaccga caggggtgcg ggcgtcatgc cctcggtcat gcgcccgatg 2330401 acgaatcccg gccgcgcgat cagtaaccgc accccggtgc cgtgcaacgc atcggccagg 2330461 ccgctggcga agccgtccag gccggctttg gccgatccgt agacatagtt ggcgcggcgc 2330521 acccgaatcc cggcgaccga ggagaacacc accagcgatc cccgtccggc ggtgcgcatc 2330581 gccgctgcca gatgagtcag caggctgacc tgggcgacgt agtcggtgtg cacgatggcc 2330641 accgcgtgcg ccgcgtctgt ctcggcgcgg gcctggtcgc cgagtatccc gaaggccagc 2330701 accgcggtgc cgatggggcc gtgctcggca acgagcgaag cgaccaacgg gccgtgtgcg 2330761 gccaggtcgt cggcgtcgaa ctcccgggtg tgcaccgcta tagcgccagc tgcgcggagt 2330821 gcggcggcct ggtcggcgag ttgatcggcg ttccgcgcgg ccagcaccat cgtcgccccg 2330881 gcagccaggc gtcgcgcgag ttcgccgccg atctggctgc ggccgccgaa aattactacc 2330941 ggagcagcgc ccgtgtcgtc cacggctgcg attattgcct gcgctagcgt gagtggcgat 2331001 ggtcaacacc actacgcggc ttagtgacga cgcgctggcg tttctttccg aacgccatct 2331061 ggccatgctg accacgctgc gggcggacaa ctcgccgcac gtggtggcgg taggtttcac 2331121 cttcgacccc aagactcaca tcgcgcgggt catcaccacc ggcggctccc aaaaggccgt 2331181 caatgccgac cgcagtgggc ttgccgtgct cagccaggtc gacggcgcgc gctggctctc 2331241 actggagggt agggcggcgg tgaacagcga catcgacgcc gtgcgcgacg ccgagctgcg 2331301 ctacgcgcag cgctatcgca ccccgcgtcc caatccacgc cgagtggtca tcgaggtcca 2331361 gattgagcgc gtgctgggat ccgcggatct gctcgaccgg gcctgacaac cgaggtcatg 2331421 gcggcagtag gtaatgcacc caggcgccac cggcgggccc ggccacggcg tgcagacggg 2331481 cgttctgatt gcccgttcgg ggcagggtaa agtccgcgcc gatggctgtg caggctaggg 2331541 cagccccggc gaagaccacg ggtgccggcg tcacggtcca cctgcctgcc gcgtcccgac 2331601 aggccgcagg gtgtgggtca ccgcacgatg cggcgaccca gcggccatcc gcgccctgca 2331661 gggcgcatgc tccggcaccg gcacgcggtt cgtccggtgc ccagctccac aacgacgcct 2331721 gaatgcggcc gtcttcgggg agcagctgat cgaagccgaa cagattgacc ccgcaatcgg 2331781 tcatcgccgg caccttcggc ggggtaagcg cctgcggatt ggccggtgga cgggtcgggt 2331841 tggccaacgc cgtggccagc gtggagtcct cgtaatagcg gaccagtcgc caagcgtaga 2331901 caccgcggcc ataggtggca tcgcaggccg ggtatggccg gtagccggag ttcgagccgc 2331961 tttccagctc aacgccgctc cagtcgaaga cggcggccga ccaacctggc gcacaagacc 2332021 cgacgagcac ggctcgtgcg ccggatgcgc ggatttcctc ccgcgacacg tcgagtggaa 2332081 gcgggacaca gccgttggtg gcacgccggg ccgggttggg acggtagata aggcttgttc 2332141 cgtccgcacg ccgcaacact tggtcgaggg tagccaccac cgactcatac gccgacgcgt 2332201 tcttcagctg gtcctccagg tagagcagga tgacctcctc ggtatgcccg ggtgcgttca 2332261 accagttggc gatctgcggc agcactgtgg ccagcagagg ttcgacggtg cagcctaggt 2332321 tcgcgttctt cggtcccagc ccgtgacaca cggtgacgcc gggggcgccg tggccctcga 2332381 ggcggggcaa gtagtgcagg tctagctcga gcgcgcggac gtcgatgtcg agctgttggg 2332441 ccaacgacag ctgctggttt gagtctgcgt gcgagaccgt gaacgaatcg ctgaggctgt 2332501 tgaacgagtt gtgcgtgccg agccactgag tttcccgcag cggcaccggg tcttgcaacg 2332561 catcctggaa ccgcgcggtg cgatgcaccc aagactgtag gtaggcatca cgcgcggcct 2332621 gggtcacccg gtgcgcgagc ggaagcacgc accgcgcatc gggcacaccg acgcggcgac 2332681 actccgcagc gaccgcgtcg gcgaacttgc cgagcgccac gcaggggatc gcaaccgggc 2332741 ttattacgtc acaggatgcg gtgggcgagg gcggagcggg cacctggtag gcatcggcgg 2332801 ccaccggtgc cgcggttatc aacaccacgg ccaaggcgcc catgagggcc gcgctctgca 2332861 gccatcgggc gcggggcatg cgctactttg gcacgtcgat acaccgctta ccaggggtgt 2332921 tgtcgaagtg ttgcgtggtc tcgtcgaagc cgtcacgtaa ctccaagccg ccccgcgtcg 2332981 atgacgagac actagggctg cgaccgccag ggccgtgtag acgttgctct acaaggtcac 2333041 cggtcctggt cagaacttat ccgacggctc ctgcgcattt tcccgtacac aaccgcgggg 2333101 atgaggacca gcaacccgag actccagata tcccaccaca gtgacccctt cacggcattg 2333161 gcgattgcac tgatggccag cagaacccag gcgacaccca taaaggcgaa atagcacccg 2333221 cccctgagcc gtcccgctgg ccgaggccac agggagcctg cgacaccgcc gatgaggcag 2333281 acaaccacga cggcaacgct gaagacgaca acgggagtcg cgctacttgg tggcacagtt 2333341 gaccaccgcc gctcccgatc cgccaacccc cagtaaggcg gcgccaagta gtgcccagtc 2333401 ggcagggccg gccggaattg ctagggcggt cccaactacg ccagccgatg atcctgcgaa 2333461 acccgcaaca gccgccgtcc actccgcgct gttacatggc cggtgcagag cttgaaacgc 2333521 cagccggcgc gcctccaccg cgatggggtc atccggcggc ctagcggcca gctcgttgac 2333581 catgttgtcc acccaccgtt tggtcgcgtc tgatacgggt gcctctgtcc cggccggtgg 2333641 cagcatcatt gaagtgatcg gatcggagta cggtccgtcg gctccaccgg acgggtgtgg 2333701 cgctcctggc ggtggaggtg ttgggccgtc ttgtttgaag aagtcgacta gctgaaccgc 2333761 gctgtgggca acaaccgggg cgcccgcgaa gcggacattt cccacatcac cggtggtcgc 2333821 ggcgagctga ccggacacct cgttctccgc cgcaaccagt tgcccgacac gtaagcggat 2333881 gtcaccggcc aacgcctgag cctgagctag tcgagcggcc tgcactgcag ccggctgcgt 2333941 cgttttggtg tcggtgaccg ataggtcttc accgacgttg aaaccggcgt cctgggcgtc 2334001 ctctacagca tacataactc ttcgttgtgc cgcgtcgata gtgccggcgc cgttgcgcgc 2334061 gatcgtggct gctctccgca gctggtcggc tatgccactg accgttgaga agtcagctcg 2334121 ggttcgttgt cgcagcccgt cgcccccggc gccattccag gcgatggcat gggcctggtt 2334181 tcgcatctgc agaaacacgt cttcccaccg atccgcggtt tcggtccagt agccggccgc 2334241 atcgataagg tgctcggtgc tccatgcccg gatttgggac agggtggcca gcatctaaac 2334301 caccgtcacc tgcgtcaccg cggccatctc gctcgccgca gttgcctctt gatgggcgta 2334361 agctgcggcc gcggcagcca ccccggtagc cgtagcctgc gtccgggtgg cgaattccgc 2334421 tgccgcgcag cagattgctg cgttgatacc acttaccgcc accgtcgtgg cttggaatgg 2334481 ttggccggat tctggcggcg ttgccgaggc ggcgaactgg gcgccaaggc cctgcgattg 2334541 actggccgca acctcaagct gaccaagtac aacctgtagt tcattcgacc ccacccgcgg 2334601 gagtctaaat cgagaccacg cagagggcta ttcacgccga ttcaaagccg tcgaagaaac 2334661 gacaccaccc gcgggccgat gagacaggaa cgatcacaca ggtgcttgcg aagatccgtc 2334721 accacgtatg cgggcgaacg gtgtgttcgg cctgttggcg gccgccgcgt gcggtgttcc 2334781 catccccgtt atcgacaacc gcgccgagga gatgacgggc cggcacgcca caacggcaac 2334841 gagtttcagc atcacggacc agtcgtgcgc atcatgagga ctgccgcgcc gctgcgctca 2334901 ccgcggtcgt caaagcattg gatccaatga cgccatcgcg gtggcgcccg gtgaggtgcg 2334961 tgaccgtggt ctccggttat accttcgagc cgaccgcagg gtgacttgat cgtcaaatcc 2335021 acgacagtag ccttacacca agtccgaagg gagtagcggt gtttgtcgat gttgaacttt 2335081 tgcattcggg ggcaaacgag tctcactacg ccggtgagca cgcccacggt ggtgctgatc 2335141 agctgtcgcg gggacccctg ctgtcgggga tgttcggtac atttcctgtc gcccagactt 2335201 ttcacgacgc ggtcggcgcg gcccacgcac agcagatgcg aaacctgcac gctcaccggc 2335261 aggcgttgat cacggtgggc gagaaagcgc gccatgccgc gacggggttc accgacatgg 2335321 acgacggcaa cgccgctgag ttgaaagctg tggtatgcag ctgcgccaca taaacatccg 2335381 ggcgctgatc gccgaggccg gcggcgatcc ctgggcgatc gagcacagcc tgcacgcggg 2335441 tcggccggcc cagattgccg agctggcgga ggcgtttcac gcggcgggtc gatacaccgc 2335501 cgaggccaac gcggccttcg aggaagcccg tcgccgcttc gaagcgtcct ggaatcgaga 2335561 aaacggcgag cacccgatca acgactccgc cgaagtgcag cgcgtgaccg cggcgctggg 2335621 tgtgcagtct ttgcaattgc ccaagatcgg tgtcgatttg gagaacattg cggccgacct 2335681 cgccgaggcg caacgggctg cggccgggcg gattgcgacg ctcgaaagtc aactgcagcg 2335741 gatcgacgat cagcttgacc aagcgctgga actcgagcac gacccccgac tggccgcggc 2335801 cgaaagatcc gaacttgatg cgctgatcac ctgccttgag caagatgcca tcgacgacac 2335861 ggcgtcagca ctgggccagc tgcaatcgat acgcgccgga tactcggatc acctgcagca 2335921 atcgctggcc atgttgcgtg ccgatggcta cgacggggcg gggctgcagg gattggacgc 2335981 accgcaatcg ccggtgaaac ccgaagagcc gattcagatt ccgccaccag gcaccggggc 2336041 accagaggtg catcggtggt ggacgtcgct gacgtctgag gaacggcagc gtctgatcgc 2336101 cgagcacccg gaacagatcg gcaatctcaa cggcgttccg gtcagcgcgc gcagcgatgc 2336161 caacatcgcg gtgatgacgc gggacctgaa tcgggtacgt gacatcgcca ctcggtaccg 2336221 cacgtcggtt gacgacgtcc tgggtgatcc ggcgaaatac ggtctgtccg ccggcgatat 2336281 cacccgctac cgcaacgccg atgagaccaa gaaaggcctc gaccataacg cccgtaatga 2336341 tccccggaac ccctccccgg tatacctgtt cgcctacgat ccaatggcat tcggcggtaa 2336401 gggacgagcc gcgatcgcta tcggcaaccc cgacaccgca aaacacaccg ccgtgattgt 2336461 gcccggcacc agcagcagcg tgaaaggcgg ctggttgcat gacaatcacg acgacgcgct 2336521 gaacctcttt aaccaggcca aggccgccga cccgaataat ccgaccgcgg tgatcgcctg 2336581 gatgggatat gacgccccga acgacttcac cgacccgcgt atcgccactc cgatgctggc 2336641 ccgaatcggt ggtgcggcac tggccgagga cgtcaacggt ttgtgggtaa cgcatctcgg 2336701 cgtcggccag aatgtcaccg tgttgggcca ctcgtacggc tcgaccaccg tggccgacgc 2336761 gttcgccttg ggcggcatgc atgccaacga tgcggtgcta ctgggctgcc cgggaaccga 2336821 cctggcccac agcgccgcga gctttcacct ggacggaggc cgggtgtatg tgggtgcggc 2336881 ctctacggat ccgatcagca tgctcgggca gctcgacagc ctcagccagt atgtgaaccg 2336941 tggcaacctt gcgggtcagc tgcaaggttt agccgtcggc ctgggcaccg accccgccgg 2337001 cgacggattc ggttcggtga ggtttcgcgc tgaggtgccc aactctgatg gcatcaaccc 2337061 ccacgaccac tcctattact accaccgggg cagcgaggcg ttgcgcagca tggccgacat 2337121 cgcctccggt cacggcgacg cgctagcatc cgatggcatg ctggcccaac cacgtcacca 2337181 acccggcgtc gagatcgaca ttccaggtct tgggtcggtg gaaattgaca taccgggcac 2337241 gccggccagc attgacccag agtggagccg ccctccggga tctatcaccg acgaccatgt 2337301 tttcgatgcc ccactccacc gctgatcgac ggcttcggct gacgcggcag gctttgctcg 2337361 ccgcggccgt ggtgccgttg ctagcaggat gtgcgctggt gatgcacaaa ccccattccg 2337421 cgggttcgtc taatccctgg gatgattccg cgcacccgct caccgacgat caggccatgg 2337481 cccaagtcgt cgagccagcc aaacagatcg tcgccgccgc cgacctgcag gctgtcagag 2337541 cgggattctc gttcacctcg tgtaacgacc aaggcgatcc gccttatcag ggcaccgtca 2337601 ggatggcctt tctgttgcag ggcgatcacg acgcgtactt tcagcacgtc cgtgccgcca 2337661 tgctgtcgca cggctggatc gacggccccc caccgggaca gtacttccac ggcataaccc 2337721 tgcacaagaa cggagtgacc gcgaacatga gcttagcgtt ggaccacagt tacggagaga 2337781 tgatccttga tggtgagtgc cgcaatacga ccgaccacca ccatgacgac gagaccacca 2337841 acatcaccaa ccaactcgtt cagccatgaa ggcgtcgggt gccttcactg ttcccacatc 2337901 gatgtcagtg atcaccaacc cgtgtggcac gtggcgaccg gcgaccggcg agcccgcatc 2337961 gcaccaggta tcgaggaact cggacccacc ctggtcgaaa cggtacgccg ccgcgacgca 2338021 ctgccccgca tcgcccaagc cgtagtagtg gccgccaccc gcaactacgg cgtccccgac 2338081 aacgaaaccg acctactgcg gtcgcccagg ccaaggtggc caccaaacgc tgctggcatg 2338141 caggtggagt gcacagacac ggcagctgca atagccttac gcgggtgacc aacacccccc 2338201 ccacccacca caggacaatg gacaccaacc caccccccag cgccgccgcg ttcacgcaat 2338261 tggccgttgg cggcggtggc cagcgtcgcg attgccgcgg ttgtgctggg tgccgcagct 2338321 ttaatcgtgg cactgacgcg cccgacgaac agcggtccag ccaccgccgc tggaacgacc 2338381 gccgagccga catacaccgc agcagaaacc gccgccgcgc accaaaagtt atgcgaggtg 2338441 tacaaactgg cagcgcgggc ggtccaaatc gcgacaaacg gcgacaaccc ggcgttcgca 2338501 aacattgcca cagtcaatgg tgcggtgatg cttcagcaga cactgaatac gaccccggcg 2338561 ctcgtgcccg gcgagcgcac cgatgcactt gcactagcag aagcatatgg ccaagctaca 2338621 gcctttgcga tggagcaaga ccatccagcg tggcagtcag cagccaatga tgtcaatgcc 2338681 aaggatgcgc gcatgaaggc catctgcggt ggcgggtgat ctgccacccg gtcggtggtc 2338741 ggcgctcttg gtgggtgcgt ggtggccggc gcggcccgat gcgccgatgg ccggggtgac 2338801 gtattggcgt aaggcggccc agctcaagcg caacgaggcc aacgacctgc gcaacgagcg 2338861 atccctgtta gcggtaaacc aagggcgcac cgccgacgat ttgttggagc gatattggcg 2338921 cggcgaacag cgactagcca ccatcgcgca tcagtgcgag gtcaaaagcg accaaagcga 2338981 gcaagtcgcg gatgcggtga actatttgcg ggatcggctg accgagatcg cacaatccgg 2339041 caatcagcaa atcaaccaaa tcctggccgg caaagggccg atagaggcca aagttgccgc 2339101 ggtgaacgcc gtcatcgagc agtcgaatgc catggccgac catgtgggag caaccgcgat 2339161 gtccaacatt atcgacgcga cgcaacgagt gttcgacgag accatcggtg gtgacgccca 2339221 cacctggttg cgtgaccacg gtgtaagcct cgacactccc gcgcggccac gcccagtgac 2339281 cgctgaagac atgacttcta tgacggcgaa ctcgcctgca ggatccccat tcggtgctgc 2339341 tccgtctgcg cccagtcatt cgacgacaac cagcggcccg ccgacagctc caacaccaac 2339401 atcaccattc ggcactgctc ccatggtgct aagttcatct tcaacaagta gcggcccgcc 2339461 gacagctcca acaccaacat caccattcgg cactgctccc atgccgcccg gcccaccccc 2339521 accgggtacc gtctcaccac ccctaccccc cagcgccccc gccgttggtg ttggtggccc 2339581 gtcagtaccg gccgctggca tgccaccagc agcggcggcg gcaacagcgc cgttatcccc 2339641 acagtcgttg ggccagtcgt tcaccaccgg gatgacgacg ggcacgccgg ccgcggccgg 2339701 tgcacaggcg ctgtcggcag gggcgctgca cgcggcaacc gaacccctgc cgccaccggc 2339761 gccacccccg acgacaccca cggtcaccac accgacagtc gcgaccgcca ccacggccgg 2339821 gattccccac atccccgaca gcgcgccgac ccccagcccg gcaccgatcg cgccaccaac 2339881 caccgacaac gccagcgcca tgacacccat cgcgcccatg gtcgctaatg gcccgccagc 2339941 atccccggcc cccccggccg ccgcccccgc ggggccactg cccgcctacg gcgccgacct 2340001 gcgcccaccg gtaaccacac cccctgccac gccacccacc ccaaccggac ccatctccgg 2340061 tgccgcggtc acaccctcct cacccgcagc aggcggctca ctaatgtcac ccgtcgtcaa 2340121 caaatccacc gcaccagcca ccacccaggc ccaacccagc aacccaacac caccgctagc 2340181 cagcgccacc gcggccgcca ccaccggcgc cgcagccgga gacacctccc gccgagccgc 2340241 cgaacaacaa cgcctacgcc gcatcctcga caccgtcgcc cgccaagaac ccggattatc 2340301 gtgggctgcc gggctacgcg acaacggcca aaccaccctg ctggtcaccg acctcgccag 2340361 cggctggatc cccccacaca ttcgcctacc cgcccacatc accctgctcg aaccggcccc 2340421 ccgacgccgc cacgccaccg tcaccgacct actgggcacc accaccgtag ccgcggcaca 2340481 ccacccccac ggctacctca gccaacccga ccccgacaca cccgcactca ccggcgaccg 2340541 cacagcacgc atcgcaccca caatcgacga actcggaccc accctggtcg aaacggtacg 2340601 ccgccacgac acactgcccc ccatcgccca agccgtagta gtggccgcca cccgcaacta 2340661 cggcgtcccc gacaacgaaa ccgacctcct acaccacaaa accaccgaga tccaccaagc 2340721 cgtactgacc acctacccca accacgacat cgccacggtg gtcgattgga tgctgttggc 2340781 ggcgatcaac gcactgatcg caggcgacca gtcgggggcg aactatcacc ttgcctgggc 2340841 gatcgccgcg atatcaacga ggagatccag atgacgtcaa tcgaatcgca tcccgaacaa 2340901 tattgggcgg cggccggcag gccagggccg gtgccgctgg cgctgggacc cgttcatccc 2340961 ggtggaccga cgctgatcga cctgctgatg gcgctgtttg gcttgtccac gaacgccgat 2341021 ctgggaggcg cgaacgccga catcgaggga gatgacaccg atcggcgggc acatgcggcc 2341081 gatgccgcgc gcaagttctc ggcgaacgag gccaatgcgg cggagcagat gcagggggtg 2341141 ggcgcgcagg gaatggcgca gatggcgtca ggcatcggcg gagcgctcag cggcgcgctc 2341201 ggcggcgtca tggggccgct gacccagctc ccgcaacagg cgatgcaagc cgggcagggc 2341261 gccatgcagc cgctgatgag tgcaatgcaa caggcccaag gcgctgacgg actggcggcc 2341321 gtggacgggg cgcggctgct ggacagcatc gggggcgagc ccggtcttgg cagcggtgca 2341381 ggtggcggtg acgtcggggg cgggggcgct ggcggcacta cccccaccgg ctatctgggt 2341441 ccaccacccg taccgacgtc gtcaccgccg acgactcccg cgggggcacc gaccaagtcg 2341501 gcgacgatgc ccccgcccgg cggcgcttca cctgcctcag cgcacatggg tgcggccggg 2341561 atgccgatgg tgccgccggg cgcgatgggc gcccggggcg aagggagcgg ccaagaaaag 2341621 ccggtcgaaa agcgcctgac cgcgcctgcg gtccccaatg gccagccggt caagggccgc 2341681 ctgacggtgc ccccgagcgc accgaccacg aaacccaccg acggcaagcc cgtagttcgc 2341741 aggcgcatcc tgctgcccga gcacaaggac ttcggacgca tagctcccga cgagaagacc 2341801 gatgccggtg agtgacgatt cgtcgtcggc gttcgatctg atttgcgccg agatcgaacg 2341861 ccagttgcgc ggcggcgagc tgctcatgga tgccgcagca gcatccgaat tactactcac 2341921 cgtgcggtat cagctcgata cccagccgcg gccacttgtc atcgtgcatg gaccgctgtt 2341981 tcaggccgtc aaagcggccc gcgcacaggt gtacggacgc ctgatacagc tgcgacacgc 2342041 gcgctgtgag gtgctcgatg agcgatggca gctacggccg acgggtcagc gcgatgtgcg 2342101 cgcactgctg atcgatgtgc tgaacgtgtt gttggcggcc attaccgccg caggcgtgga 2342161 acgggcatac gcgtgcgcgg agcggcgggc gatggccgcc gcggttgtcg ccaagaatta 2342221 ccgggacgcg ttgggtgtcg agctgcagtg caattccgta tgccgagccg ccgccgaggc 2342281 gatccacgcg ctggcgcacc gcacaggggc taccgaggat gccgactgcc tcccgccggt 2342341 tgatgtgata cacgccgacg ttactcgccg catgcatggc gaggtggcga ccgacgttgt 2342401 cgcggccggc gaactggtga tagcggcgcg acacttgctg gaccccatgc ccaggggcga 2342461 gctcagttac ggcccactcc acgagggggg aaatgcggcc cgtaaatcgg tctatcgacg 2342521 cctggttcag ctatggcaag cgcgccgggc tgttaccgac ggtgacgtcg acctgcgcga 2342581 cgctcgcacg ctgctgaccg atctggacag cattttgcgt gagatgcgca cggccgcaac 2342641 cattcaacag agcggaacgg cgggcgatgg cggcggcggt cgtcgccaag attcgcggcg 2342701 acgcaatggg cctcgacgcc cagcgcgacg cggtacatcg cgcggccgcc gatgcgctcc 2342761 acgcgttgca atcggttggc atacaccaat aggcgaccct ttggcagttg agggtgtaga 2342821 ggagatcggc gcgtcgttgc cggggcggga gtcgacgcct tccgatgatg gaggttccct 2342881 acacccatca ggaagacctc gacgcgtcca tcgccgccgg tggtgcgggc ttggcctgtg 2342941 ctgacacatg accgctttcc gccgccttga ttgttgaccg gcactgggtt tgggggcggc 2343001 cgcgtcactg taggtgagta tgggacgtga gcgacatgtg cgacgtggtg tcgttcgttg 2343061 gcgccgccga gcgtgttctg agggcgagat ttcggccgag cccggaatct ggccccccag 2343121 ttcacgctcg gcggtgcggt tggtctctgg ggatcagcgc ggagacgctg cgccggtggg 2343181 caggtcaagc cgaggtcgat agcggtgtgg tggccggcgt gtccgccagc agaagtggga 2343241 gcgtaaagac cagcgagctt gagcaaacca tcgaaatact caaggtcgca acgagtttct 2343301 tcgcgcggaa gtgcgacccg cgacaccgct gatctgtgcg ttcggcgaca agcacaagca 2343361 cacctacggg gtcacaccga tctgtcgggc actggccgtg cacggcgtgc agatcgcctc 2343421 gcgcacctat ttcgcggatc gcgcggcagc gccttcgaaa cgcgcactgt gggacaccac 2343481 aatcaccgaa atcctggccg gctactacga acccgacgcc gagggcaaac gcccaccgga 2343541 atgcctgtac ggcagcctga agatgtgggc gcacctgcag cgccagggct tccggtggcc 2343601 ctctgccacg gtgaagacga tcatgcgggc caacggttgg cgcggagtgc ccctcgcagc 2343661 gcacatcaca caccaccgaa ccagacccgg ccgcggccca ggccctagac ctggcgggtc 2343721 ggcaatggcg ggctttagca acgaacctgc tggaagcggc cgacttcacc tacgcgccga 2343781 tgacgtggag ttccggctac accgcgttcg tggtcgacgc ctacgccggt gtgatcgcgg 2343841 gctgggaatg ctcgctgacc aaagacgcag cgttcgtcga acgcgcatta cgccacggcc 2343901 ttccagactc acctaggtca cccgtttggc ggagctattc atcatcgcga cgccggaagt 2343961 cagtatactg caatatattt cggcaagaca ccgatgctag ccgggctgcg gccgtcgata 2344021 ggcattgttg gcgacgccct cgacaacgcc ttatgtgaaa ccacgacagg gccccacagg 2344081 accgaatgca gccacggcag cccgtttcgt agcgggccga tccgcaccct ggctgacctg 2344141 gaagacatcg cctcggcgtg ggtggagcac acctgtcaca cacaacaagg tgtgcgaata 2344201 cccgggaggc ttcaacctgc gtagtgggcg gaagcgtttc acgacgcgat cggcttagcg 2344261 tatgcgcggg ccgataccac gggtgcacgc gatcacctgg aactggtgag ttggctatcg 2344321 tggtttggtg attacttgcg cttgggggct tgccgacggt tgcgccgggc gcaagtgggg 2344381 tgcggttttg cggttgatgg atggtagctg gtggcccacg agttgagtgc gggttcggtt 2344441 tttgccgggt accggataga gcggatgcta ggtgccggcg gaatgggcac cgtatatctg 2344501 gcgcgtaatc ccgatctgcc gcgtagcgaa gccttgaaag tccttgctgc ggagttgtcg 2344561 cgtgacctcg attttcgggc acggtttgtc cgcgaagccg atgtggccgc ggggttggat 2344621 catcccaaca tcgtggcggt tcatcagcgc ggccagttcg agggtcggct atggattgcg 2344681 atgcagttcg tcgatggcgg gaacgctgag gatgcgctgc gggcggcgac catgaccaca 2344741 gcgcgggcgg tgtacgtgat cggcgaggtc gccaaggcgc tcgactatgc gcaccaacaa 2344801 ggcgtgatac atcgcgatat caagccggcg aacttcttgt tgtcgcgagc cgctggcggc 2344861 gatgaacgag tgctgctaag cgattttggg atcgcgcgtg cgctcggcga cacgggactg 2344921 acgtccaccg gttcggtgct ggccacgttg gcctatgctg cgccggaagt tcttgcaggg 2344981 caaggttttg atggccgggc cgatttgtat tcgttggggt gtgccctatt tcggctccta 2345041 accggtgagg cgccgtttgc cgccggtgct ggagcggcgg tggcagtggt ggcgggtcat 2345101 ctgcaccaac cgccgccgac ggtcagcgat cgcgtgccag ggctgtcggc ggcgatggat 2345161 gcggtgatcg ccactgcgat ggccaaggat cccatgcgtc ggttcacctc agcgggtgaa 2345221 ttcgcacatg ccgccgccgc agccctgtac gggggagcca ccgacggatg ggtgccgccg 2345281 agccccgcgc cgcacgtcat atcgcaaggc gccgtgccag gttcgccgtg gtggcagcat 2345341 ccggtcgggt cagtgaccgc gttggccacg ccgcccggtc acggttggcc gccaggcctg 2345401 ccgccgctgc cgagacgacc gcgccgctac cgtcggggcg tggcggcggt ggcggccgtg 2345461 atggtggtgg ccgccgcggc cgtcaccgcg gtgaccatga catcgcacca accgcggacc 2345521 gcgacgccgc caagcgctgc agccctttct cccacctcgt ccagcacaac accaccgcaa 2345581 ccaccgatcg tgacaaggtc gcgcctaccc gggttgttgc cgccccttga tgacgtcaaa 2345641 aacttcgtgg gcatccagaa cctggtcgcc catgagccaa tgcttcaacc ccagactccc 2345701 aacgggtcaa tcaaccccgc ggagtgctgg ccggcggttg ggggtggcgt tcctagcgcc 2345761 tacgacctgg ggaccgtcat cggcttttac gggttgacaa tcgacgagcc gcccaccggg 2345821 actgccccaa atcaagtggg gcaactgatc gtggcctttc gcgacgcggc cacagcccaa 2345881 aggcatttgg ccgatttggc gtcgatctgg cgccgatgcg ggggtcgaac cgtaacactc 2345941 ttccgtagtg agtggcgaag gcccgttgaa ctgtcgacga gcgttcccga agtcgttgat 2346001 ggcatcacca ccatggtgtt gacggcgcag ggaccggtgc tacgagtccg cgaagaccat 2346061 gcgatcgccg cgaagaataa tgtgcttgtc gatgtcgaca tcatgacgcc cgacaccagc 2346121 cgcggccagc aggcggtcat cggcatcacc aactacatcc tcgccaagat acccggctga 2346181 gcgcgacacc attggcctag gacaccggca ccacgatcaa ctcgtgcggg cagttgttga 2346241 cagacacagc accgtcctcg gtcacgatca cgatgtcctc gatgcgggcg ccccaccggc 2346301 ccgggaaata gattcccggc tcgatggaaa acgccatgcc gggaaccaac accaggtcat 2346361 tgccggcgac gatatagggc tcctcgtgca cgcacagccc gatgccgtgc ccggtgcggt 2346421 gcacaaaata ctccgcgagc ccggcctcgg cgagcacgtc acgcgcggcg gcgtccacct 2346481 gctccgctgt cacccctggg cggatggcct cgaacgccgc ccgctgggct cgctgcaaca 2346541 tcgaatatga ctgcgctaca tcagaatcag gctcgccgat gctgtaggtt cgggtggagt 2346601 cggagtggta tccaggccca tacgtgccgc cgatgtcgac gacaacgatg tcaccctccc 2346661 gcaattcgcg gtccgaatat ccgtgatgcg ggtcggcgcc gtgcggcccg gaacccacga 2346721 tgacgaacgc tacctccgaa tgcccttcgg cgacaattgc ttcggcgatg tcggcggcta 2346781 cgtcggcttc cgttcggccc gggaccagaa actccggcac tcgggcatgc actcgatcga 2346841 tcgccgcgcc ggccttacgc agcgcgtcga tctcggtttc ctccttgacc atccgcagcc 2346901 tgcgcagcac gtcggtggcc aataccggca gcacacccag tgcgtcggcc agcggcaaca 2346961 tgtgcaacgc cggcatggaa tcggtgaccg cggtcgctac cggagctccg cccaacacgg 2347021 cactcaccaa cccgtagggg tcgtcaccgt cgacccaatc gcacacgcgc agacccaatt 2347081 ccgctgcggc ggattgcttg agggcggcga gctccagccg cggcagcaca accgccggcg 2347141 caccggcggc cggcaacacc aacgcggtga gccgctcgaa cgtctccgct cgcgacccga 2347201 tgaggtaaca caggtcgtag ccgggagtta tcaccagacc cgccagaccg gcgtccgccg 2347261 tcgcggccgc cgctaaagcc agccgccgtg cataaacctc ggcgtcgaat cggcgagaac 2347321 ccatgtcagc caggttaacc gcgcgttcgc gagcgctggc aagatagccc gcatgcccgc 2347381 acccgatccg atgcgtggcg acccgccgca cccggctccg ccgcgcttgc gatcgccact 2347441 ggacccaaca agtggcgacc cgctgcaccc ggctccgccg cgcttgcgat cgccactgga 2347501 cccaacaagt ggcgacccgc tgcacccggc tccgccgcgc ttgcgatcgc cactggaccc 2347561 aacaagtggc gacccgctgc acccggctcc gccgcgcttg cgatcgccac tggtgctact 2347621 ggacggcgcc agcatgtggt tccgctcgtt cttcggtgtg ccatcatcga tcaccgctcc 2347681 ggatggccgg ccggtcaacg ccgtacgcgg cttcatcgac tccatggcgg tggtgatcac 2347741 acagcagcgg ccaaaccggc tggcggtctg cctcgacttg gattggcgcc cgcagttccg 2347801 ggtggacctg atcccgtcat acaaggcaca ccgggtggct gagcctgagc ccaacggcca 2347861 gcccgacgtc gaggaggtgc ccgacgagct gaccccgcag gtcgacatga tcatggagtt 2347921 actggacgcg ttcgggatcg cgatggcagg cgccccggga ttcgaagccg acgacgtgct 2347981 gggcacgctg gcaacccggg agcgccgcga cccggtaatc gtggtcagcg gagaccgcga 2348041 cctgctgcaa gtggtcgccg acgatccggt cccggtccgg gtgctctacc tgggccgcgg 2348101 ccttgccaag gccaccttgt tcggaccggc cgaggtcgcc gagcgctacg ggttgccggc 2348161 acatcgcgcc ggcgcggcct acgccgaact cgcgctgctg cgtggcgatc cgtccgacgg 2348221 cctacccggc gtgccaggcg tcggcgagaa gaccgccgct accctactgg cccgacacgg 2348281 ctcgctagat cagatcatgg cggccgccga cgaccgcaag accacgatgg ccaagggcct 2348341 acgtaccaaa ctgcttgccg cgtcggccta catcaaggcc gccgaccggg tggtgcgggt 2348401 cgccaccgac gcaccggtca cgctgtcgac acccaccgac aggttcccgc tggtcgcagc 2348461 tgacccggag cgcaccgccg agctggcgac ccgattcggg gttgaatcct cgatcgcgcg 2348521 actacaaaaa gcgctcgaca cgctgcccgg atgacgatta ctgtggccgg ccgacctcgt 2348581 aggtgccctt gttgtcctgg aaggtcacgg tcacgcgctt tgaggtgccg tcgatgctca 2348641 ccgtgcattc gaaggtggcg ccctttttga ccgtggggtc tgaaccgttg ttgcacttga 2348701 cgtctttgac gttcttggcg ccgtaccccg tggtctcatc ggtgagaacc tgctgcacac 2348761 cggcctgcgc cttaatgacg tccagcttgg tggtgacgaa gaatccgggt gcccagaagc 2348821 cgagtattag aaccgcgccg atgaacagca cggccatcac ggcgatcacg ccgccgatca 2348881 ccgcaaccga acgcttcgac ccctgacccg actggccata cgggccgtat tgcccggggt 2348941 actgaccggg cggtgcgtac tggccgggct ggccgtactg tcccggctgg ccatattggc 2349001 ccggctgctg gtattggccg tactgaccgg gcacgccgag ctgggtgggc tgtgcaccga 2349061 actgttcggg ctgcgcatag ccgggtgtgg gctgcgggta ctgctgcggg tacgccgggt 2349121 cagccggctg ttggtactgc ggtgtgtacg ccggggcctg ccacgtcgcc tcctgggtcg 2349181 gctgctgctg ccagggatat cccgcggcca cggtggggtc cgaggaatgg tcggcgccct 2349241 ggccgggcgg ctgccacggc tgccttgggt ccgatccctg cggtccgctc atcgcttctc 2349301 ctcagtctgt gttaaccgta actctggccc agcctacccg gcgtcaaccg cgacgacgcc 2349361 gcgccgaatg tcaccgatag cgcgctttgc ggtagcccgc agttcggggt tgggcgcagc 2349421 gttacgaact tggtccagca gatcgagcac ctgacggcac caacgcacga aatcccctgc 2349481 caataacggt gatccgctgc cgttcacgtc ggcagcggcc aatgccgccg ctagatcacc 2349541 ggttcgcgac cagcggtaga tgactctgac aaagccatcg tcgggttcgc gactcggggt 2349601 gatgcggtgt gcctgctcgt cggcgcgcaa tgtcgtggac agccttgatg tctgagtcag 2349661 agcctgccgt aaccgcggtg tgggcacatc ggctccgaac ggggcgccct ggccgtcacc 2349721 accgcgcgtc tcgtagacca ccgccgacac cacccccgcc aattcggccg gctttaaacc 2349781 ctcccacgca cctgtacgta ggcactcggc caccaacagg tcgctctcgc tgtaaatccg 2349841 cgccagcagc cggccgtcgt cggtgaccac gggatcagtg gccgggccat cgatgaactc 2349901 ccgttcggtg agcagcccga cgaatcggtc gaacgtgcgg gccaacgagt tggtggcggc 2349961 ggcgaccttc ctctctaatt gcgcgttgtc gcgttcgatg cgtaagtaac gctcggcctg 2350021 gcggatctgg tcctcgagcc cgggcgaggt atgcaccgga tgacggcgca attgttcgcg 2350081 cgacgactcc agctccggat cgtgaaaccc gccggcctcg ctgacgcgcc gggcggctgg 2350141 aataaccaga cccgcggctg ccgatcgcag cgccgaggcc aggtcacgcc ggacccgcgg 2350201 ctggcggtgc tccacccgct tgggcagcgt catcgacccc accggcgtcg tgcccgagta 2350261 gtcggccgag gagatccgtc ccgcccatcg gtgttcggtt agcaccagcg gacgcgggtc 2350321 gtcgcggtcg cgggctgatt ccaggacgac ggccagacca ccgcggcggc cgtgggtgat 2350381 ggtgatgatg tcaccgcggc gcagcgcggc cagcgcatcg gtggccgcct gccgtcgctg 2350441 taaccgcgac gcgcgggcct gcgcacgttc cagctcggac acccgcgcgc gcaatcgagc 2350501 gtattcgagg atgggcgcat cagatccgcc cagttcggct gcgatctcgc cgagtatcct 2350561 gttgccccgc tcaattccgc ggaccagtcc gaccacggat cggtcggcct gatattgggc 2350621 gaacgactgc tcgagcagtc ggtgcgcctg ttgcggaccc atccggtgca ccaggttgat 2350681 cgtcatgttg tacgacgggg caaacgagct gcgcagcgga aaggtgcggg tggaggccag 2350741 gcccgccacc tcggacggtt caatttccgg gtgccagatc accaccgcgt gaccctcgac 2350801 gtcgataccg cgccggccgg cgcgaccggt cagttgggtg tactcccccg gcgtcagcgg 2350861 catgtgctgc tcaccgttga acttcaccag ccgctccagc accaccgtgc gggccggcat 2350921 gttgataccg agcgccagag tctcggtggc gaatacagcc ttgaccaaac cggcggtgaa 2350981 cagctcctcc accgtgtgcc ggaaggccgg caacatgccc gcgtggtggg cggccagacc 2351041 gcgcagtaac ccttcccgcc attcgtagta gccgagtacc gccaggtcgg agtcggccag 2351101 gtcaccgcag cggtggtcga tcacctcggc gatccgtgcg cgctcctctt cgctggtcaa 2351161 ccgcagcggt gaccgcaggc attgggtgac cgcggcgtca caaccggccc gggagaacac 2351221 gaaggtgatc gccggcaaca gcccttcagc gtcgagtttg gcgatcacct cgggtcggcc 2351281 gggtggccgg tagaagccgg gccggcccga gcctcggcgc cgaggctgcc aatcggccat 2351341 ccggtcggcc tcacggcgat gcgcgatgtg gcgcagcaac tcgcggttga cttggggctg 2351401 cccttcggct tcgccgatcc ggtaatcgaa caggtcgaac atgcgcttgc ccaccaagac 2351461 gtgttgccac aacggcaccg gccgatgctc gtcgaccacc accgtggtgt cgccccgcac 2351521 cgtctggatc caaccgccga actcctcggc gttgctcacc gtcgccgaca ggctgaccac 2351581 ccgcacgtcg tcgggcagtt gcaggatcac ctcctcccac accggacccc gcatccggtc 2351641 ggcgaggaaa tgcacctcat ccatcaccac ataggaaagc ccctgcagcg caggcgaatc 2351701 cgcgtagagc atgttgcgca gcacttcggt ggtcatcacc accaccggcg cgttgccgtt 2351761 gaccgacagg tcaccggtca gcagcccgat ctggtcacgg ccgtagcgtg ctgtgagatc 2351821 ggtgtgcttt tggttgctca gggctttcag cggcgtggtg tagaaacatt tactgccggc 2351881 cgccagcgcc aggtgcacgg cgaactcgcc gaccaccgtc ttgccagcgc cggtcggcgc 2351941 gcacaccagc acaccgtggc cgcgttccag cgcgctgcaa gcccgctgct gaaagtcgtc 2352001 gagcgagaac ggtagttccg cggtgaaccg gtccagctcg gccagctcag tcacgtcgcc 2352061 gccgcctcgc cagttgaccg cgcccgctcg cggctagcgg gcctacgtga cgtcgtcatg 2352121 agatccgatg accgatggcg ccggcaccgg cgagggcggg tcgatgaccg aagcttcgtc 2352181 gtcgggaatc gcggcttcgc gcttggcttt tcgcttgtca tgcacgcggg cgatctgaat 2352241 ggcgagctct agcagcacgg tcaacgccgc accgagcgcg gtcatcgaga acggatcgga 2352301 tccgggcgtg aagatcgccg cgaagacgaa catcgcaaag atcaacccgc gccgccaaga 2352361 cttgagccgc tcataggtca gcaggcccgc caggttcagc atcacgatca gcagggggaa 2352421 ttcgaagctg accccgaaca ccaccagcag gttgagcaga aagccaaagt agcggtcgcc 2352481 agacagcgcg gtcacctgca cgtcgctgcc gacggtcaac aaaaagccca acgccttgga 2352541 caacaccagg taggccagta cggcaccggc gacgaacagc accgctgctg ggatcacgaa 2352601 ggccaccgcg aagcggcgct ccctctggta gagaccaggc gtgatgaacg cccacagctg 2352661 gtagaaccac accgggcaag ccagcacaat gccggcggcc atcccgacct tgagccgcaa 2352721 catgaactgg tcgaacggcg cggtggccaa caaacggcac tctccgtcgg cgctgatatc 2352781 cgcccgggcc gactgcggca gggcacagta gggatgccgc agccactctc cgaggctgtc 2352841 caacccgaaa atcgaatgcg aataccagac gaacccgaag attgtggtga ccaagatcgc 2352901 ggccagggag atcagcaacc tggtgcgtaa ctcggtcagg tggtcgacca gcgacatcgt 2352961 cgcgtcagga ttgacgcggc tgcgcctgtt acgtgggttg agccgtttga gaagaccggc 2353021 ggcgcgcact gaagcgacgc ccgagctaag ccggccgagc ctcggtgctg tcttgaccag 2353081 acgccgctga ggggtcgaca cgctgcgatt gcaccggcgt gggggtctcg atagacgctt 2353141 ccgctttgtt ctcgttctgc agttcacgga cctcggactt aaagattcgc aatgacttgc 2353201 ccaacgagcg cgccgcatcg gggagcttct tggcaccgaa caacacgatc accacgacag 2353261 cgaggatcgc ccaatgccac ggactcagac tgcccacttt gattacctcc agacgttgac 2353321 ccgatgctac cgcagcggcc gcggcacccg gagatttcgc gccgtcacgg cggcgcagct 2353381 gcctggtatg catccagcgc ggccgtcgcg gcgtcgcgaa cccgctgagc gagcgactcc 2353441 ggcgccagaa cgcgcacgtc cgaaccgaag cccagcaata ggcgcgtcat ccaatcctca 2353501 gaggcgtagg tcatggccac ctcacaggag ccgtccggca gctgtcgtag ctcccgaatc 2353561 gggtagtact ccagcatcca cgaggccgac ggtgccaccc gcaacgtcgc cgacggcagc 2353621 gataggtcac cgtcgaacag cgacgtgtcc ggtggcgcct gccgtgccga ttccggcgga 2353681 accgcgggct cgcccaactc ggcggcatcg acaatccggt cgaaacggaa caggcgaacc 2353741 ccttcggcct cacgcgacca ggcctccaaa tagctgtgcc cgccgatcaa cagcacccgg 2353801 atgggatcca cgatccgagt ggtgagggtg tcatgcgacg cggcgtaata gtcgatggtc 2353861 agcgcccgac tgttccgcac cgcggcccgt acggccgcgg cggccgggct ttctgtgggt 2353921 gcctgttcgg caacggcggc caccgcgccg gccgcggcgg cgatcttggc gatggcgctg 2353981 cgcgccgcct gcgggtcaac cacgccggga atgtccgcta gcgcccgcaa cgccaccagc 2354041 agcccggtgg cctccggcga tgtgagcttt aacggccggt cgatgcccgc cgagaacgtc 2354101 acctcgatgg tgtcaccgca gaattcgaag tcgatgaggt cacccgggga atagcccgga 2354161 aggccgcaca tccacagctg gttgaggtcc tcctccagct gcttggcggt gacacccagc 2354221 tcggcggcgg cctcggcgcg ggtgatccgg gggttggcct ggaagtacgg caccatgttg 2354281 agcagccgca ccagccgggt ggacagggcg ctcatgccag tgctccggct tgcgcgcgta 2354341 gtcgggccag cacatcgtcg cgcagagacc cgggctgcag cacgattgcg tcggccccat 2354401 agccggtgat ctcacgcgcc agccggtcgc tggatcgaat ctcaagctcg atcacctcgc 2354461 catcgcgacc accaagttgt cgcggcccgg cggaccgccc ggcacgtcgc aacgcggtgg 2354521 cccgaccctc ggctacccat accgtggctt gctcaccggt cggcacctcc gtcaccttct 2354581 gcgccacgat gctgcgtagg tccacaccgg caggcacggt ggttgcgccg gccggcccga 2354641 ttggcgtcac ctgcgctccg atccgggaca gccggaagac gcgggttgca tcccggtcgc 2354701 ggtcgtggcc gaccagatac cagcggccct tctcggtaac cacaccccac ggctcgacgg 2354761 tccgaacggt gtacggctct gcgcgcgacg atcgatgaga gaactgcacc acctgcccgg 2354821 aatcgatggc cgacaacaag attccgagaa cgtcctcaga gccgcgcagt cccgaaacgg 2354881 ccgccgccga cgcgatggcc accggtgccc cggtatccaa gggatcgacg tccaccccgg 2354941 cggcccgcag cttcagcaac gcgccctggg tcgcggtgat caactccggt gactcccaca 2355001 gctgggtggc gacggctacc gcggccgcct catccggggt cagctcgaca ggcgacaggg 2355061 cgtaggcgtc gcggttgatg cgatagccct cggtgggctc caacgccgag accctgccga 2355121 cctcgagcgg aatgccgagg tcacgcagct cgttcttgtc gcgctcgaac atccgggaga 2355181 acgcctcaac gctggggctg tccgaatagc ctgccacgct ggacctgatc ttctccgcag 2355241 tgatgtagcc acgagtggac agcaaggcta tgacgagatt gaccagccgt tcgactttcg 2355301 aggtcgccat tggtggtgct acatgctcgc gatcagccgc ttaacccgct catcgaccgc 2355361 ccggaacggg tctttgcaca gcacggtgcg ctgcgcctgg tcgttgagtt tgagatgtac 2355421 ccagtcgacg gtgaaatcac gtcccgcctc ctgcgcggcg ctgatgaact caccgcgcag 2355481 ccgggcccgg gtggtctgtg ggggctgatc gacggcctcc gcgatttctt cgtcggtggt 2355541 gacgcgcgcg gccaaccctt tgcgctgcag gagatcaaag atcccgcgtc cgcgcttgat 2355601 gtcgtggtag gccagatcca gctgagcgat cttcgggtgg gacaactcca tgtcatagcg 2355661 gtcctgataa cgctgaaaca gcttgcgttt gatcacccag tcgatttcgg tgtcgacctt 2355721 ggcgaaatcc tggctttcga cggcatcgag ttggcggccc cacaggtcga cgacctgctc 2355781 gatctgcgcg ttgggctccc gagtctgcaa gtgctcgact gcgcgggtgt agtactcccg 2355841 ctggatgtcc agcgcgctgg cctgacggcc tccggccaac cgcaccggcc ggcgaccggt 2355901 gacatcatgg ctaacctcac ggatggcgcg gatcgggtta tccagggaaa aatcacggaa 2355961 ggcgactcca ctttcgatca tttccagcac gagcgccgcg gtgcccacct tgagcatggt 2356021 ggtggtctcg gacatgttgg agtcgccgac gatgacgtgc agccgccggt acttctcggc 2356081 gtcggcatgt ggctcgtcgc gggtgttgat aatggggcgg gatcgggtcg tggcgctaga 2356141 gacgccctcc caaatgtgtt cggcgcgttg gcttaagcag taggtggcgg ccttgggggt 2356201 ctgcagcacc ttgccggccc cgcagatcag ctggcgggtg accaggaagg gcagcagcac 2356261 gtcggagatc cgggagaact caccggcccg cacgatcagg tagttttcgt ggcagccgta 2356321 ggagttgccc gccgaatcgg tgttgttctt gaacaggtag atgtcgccgc cgatgccctc 2356381 gtcggccagc cgctgctcgg cgtcaacgag caggtcttcc agcacccatt caccggcccg 2356441 gtcatgggtg accagctgca ccaggctgtc gcattcggcg gtggcgtact cgggatgact 2356501 gcccacgtcg agatacaggc gcgcaccgtt acgcaggaag acgttggagc tgcggcccca 2356561 ggacaccaca cggcgaaaca ggtagcgggc cacctcgtcc ggggacagcc gacggtgacc 2356621 gtgaaatgtg caggtgacac cgaactcggt ttcgatgccc atgattcgac gctgcacgta 2356681 tttgagggta ctggttgttg gttggcggcg gcgcgatagc cacgcccgtt acccgtccgg 2356741 gccggacggg ccggggactc cgaacagcag cccgccggtg ccgccgctgc cgccgggccc 2356801 cgcggccccg tccggagtac cgggtccgcc ggcggcgcca gccccaccgg cgccaccgtc 2356861 gccgaacaag atggcggtgc cgccgtgccc gccgacaccg cccggcccgc cgggcgaggt 2356921 ggtgttcatg ccgggccccc cttggccggc ggccccgccg gcgccaccgt tgccgtacca 2356981 cacgccgccg ttgccaccgc tgccgccggg gttgcccgcg cccccgacgc caccgctgct 2357041 gaccgagcca ggcgcgccgc tcccgccgct accgccggca ccaccgttgc cgatgagccg 2357101 cgcgctgccg ccgttgccgg cgttggtgga gccaataaat ggcagccccc cattaccgcc 2357161 gtcgccgccg ccgccgccat cgccgtacag ccacccgccg accccaccgt cgccgccgcg 2357221 actacctacc tggaacaggc gcgcaccgag ccctgggtcg ccgccgttgc cgccgccgcc 2357281 gccgccgaca ccaatcagcc ccgcgtcgcc gccccggcca ccgctgccgc ccaacccggc 2357341 gaaaccgtcg ctggagacgc cggtacctgc atcgccaccg ttgccgccgg aaccggccgc 2357401 ccctccattg ccgtacagca gcccgccggc gccaccgaac tggccgaagc cgccactgcc 2357461 gccactgccg ccggccttgc cgctcccgcc gtgcccaccg tccccaccgt ttccgccggc 2357521 accgccgtga ccgatcagcc ccgcccgtcc acccaaaccg ccatcgcctc ccccgccgcc 2357581 agcaccgagg tctcccacac cgttgtcacc ggtaccgccg actcccccgg cacccccgtt 2357641 gcccagcagc aatccgccca ggccgccatt gcctcctgca ccgccgggcg caccgttgcc 2357701 gcccctgccg ccgttgccga tcagccccgc tgacccgccg gctcccccgg caaccccggg 2357761 gctcgtgctg tcgccaccgt tgccgccgtt gccccacaag atgccgccgt ccccgccggg 2357821 ctgtcccacc ggaccactgg ccccgtcggc gccgtcgccg accagcggac gccccagcag 2357881 cgtctgggtg ggcgcgttca ccgcgttcag caggttctgc tgcgcgttgg caatctcggc 2357941 gctcgcatat gaacctccac cggcgttaag cagttggacg aaccggtcat gaaacgccgc 2358001 cgcctgggcg ctgaccgctt gatagctctg gcctgggcgc caaatagcgc cgctatgccc 2358061 gccgacacct catcggcggc gggcgccaac gcccccgtcg tcgggaccgc cgccgccgcg 2358121 ttcgctgccc tgatggtcga acggatggcc gctaaatccg tggccgcagc cagcaatgcc 2358181 tccgggctcg caatcacaaa cgacattgcg cacctcccac caacccgcga taacccggct 2358241 gcgccggaac cgtcgatgcg tatggcagga atatcgtatt gcgatccccc accctcagtc 2358301 ggggtgttcg ccagattcgt cgcagctcag cgctgcgccg gcgccagcat tggcgatggc 2358361 tggtggttaa cgcgagtggt cgaaggtgat ggccggggca ctgttcgaac cgtcgttcgc 2358421 cgcagcgcac ccagcggggc ttctcagacg acccgtgacg cgaaccgtcg tgctgtcggt 2358481 ggccgctact agtatcgcac acatgttcga gatatcgctg ccggacccga cggagctgtg 2358541 ccgatccgat gatggcgcgc tggtggccgc gatcgaggac tgcgctcgtg tggaggcggc 2358601 tgcgagcgcc cggcggttgt cggcgatcgc cgagctgacc ggccggcgca ccggcgcgga 2358661 ccagcgggcc gactgggcgt gtgacttctg ggactgcgcg gccgcggagg tggctgcggc 2358721 gttgactatc agccacggca aagcctccgg acaaatgcat ctgagccttg ccctgaaccg 2358781 gctgccccag gtggcggcgt tgtttttggc cgggcatctt ggtgcgcggc ttttctcgat 2358841 catcgcctgg cggacctacc tcgttcgcga cccgcacgca ctgagtctgc tcgatgccgc 2358901 cctggccgaa cacgccggcg cgtgggggcc gctgtcggcc cccaaactgg aaaaggccat 2358961 cgactcctgg atcgatcgct acgatcccgg ggcgctgcgg cgcagccgta tctcggcccg 2359021 cacccgcgac ctatgcatcg gtgatcccga tgaggacgcc ggcaccgccg cgctgtgggg 2359081 ccggctgtat gccaccgacg ccgcgatgct ggatcgccgg ctcaccgaga tggcccacgg 2359141 cgtgtgcgag gatgacccgc gcaccctggc ccagcgccgc gccgacgcgc tgggcgcgct 2359201 ggccgccggc gccgaccacc tggcgtgcgg ctgcggcaag cccgactgcc cctccggtgc 2359261 cggcaacgac gagcgggccg ccggtgtggt catccacgtc gtcgccgacg cctcagcact 2359321 tgacgcacaa cccgacccac acctatccgg cgacgaaccc ccttcgcggc ccctcacccc 2359381 ggagacgacc ctgttcgagg cgttgacacc cgaccccgaa cccgatcccc ccgccaccca 2359441 cgcgccggcc gagctgatca ccaccggcgg cggtgtggtg cccgcgccgc tgctggccga 2359501 actcatccgg ggtggggcca ccatcagcca agtgcgccat cccggcgatc tcgcagcaga 2359561 gccgcactac cggccgtcgg ccaagctggc tgaattcgtc cggatgcggg atttgacgtg 2359621 ccggtttccc gggtgtgacg tgcccgccga gttttgtgat atcgaccatt cggcgccctg 2359681 gccgttgggg ccgacgcatc catcaaatct gaagtgcgcg tgtagaaaac accacctttt 2359741 gaaaactttc tggacgggct ggcgggatgt gcagttaccc gatggcacgg tcatctggac 2359801 cgcgcccaac ggccacacct acactaccca tcccggcagc cgcatcttct ttcccacctg 2359861 gcacaccacc accgccgaac taccccaaac atcaacggca gcagtcaacg tcgacgcacg 2359921 cggcctgatg atgccgcgac ggcgccggac ccgagccgcc gagctggccc accgcatcaa 2359981 cgccgaacgc gccctcaacg acgcgtacat ggccgaacgc aacaagccac catcgttctg 2360041 atgggcggct attcccacct catgtcaaac accccttctg gatgtcacgc cccttctgga 2360101 caccaccgac gagttctcgt gtcgccgcac ctatccaaga agaccaaccg ctacgatcgg 2360161 tcgatgtcgc ggcgccgcag tcgacgcagg agaaccgcga aacgtgccgg ccgctccgtc 2360221 gacaagagag aaggactgca tgctggtttt gcacggcttc tggtccaact ccggcgggat 2360281 gcggctgtgg gcggaggact ccgatctgct ggtgaagagc ccgagtcagg cgctgcgctc 2360341 cgcgcggcca cacccgttcg cggcgcccgc tgacctgatc gccggcatac atccgggcaa 2360401 acccgcaacc gccgttttgc tgttgccgtc gttgcgatcg gcgccgctgg actcgccgga 2360461 gctgatccgg ctcgccccgc gcccggccgc gcgaaccgat ccgatgctgt tggcgtggac 2360521 ggtaccggtg gtggacctgg accccaccgc ggcgttggcc gccttcgacc agcccgcccc 2360581 cgacgtccgc tacggcgcgt ccgtcgacta cctggccgag ctggccgttt tcgcgcgcga 2360641 gttggtcgag cgtggtcgcg tgctgcccca gctgcgccgc gacacccacg gcgcggccgc 2360701 ctgctggcgt ccggtgttgc agggacgcga cgtggtcgcg atgacctcgc tggtctcggc 2360761 gatgccgccg gtctgccgcg ccgaagttgg tgggcacgac ccgcacgaac tggcaacctc 2360821 ggctctggac gcgatggtcg acgccgccgt gcgcgcggcg ctgtcaccga tggacctgct 2360881 gcccccgcga cggggtcgct ccaaacggca tcgggccgtg gaggcttggc tgaccgcgtt 2360941 gacctgcccg gacggccggt tcgacgcgga gcccgacgaa ctcgacgcgc tggccgaggc 2361001 gttgcggcca tgggacgacg tcggtatcgg caccgtcggc ccggcgcggg cgacgtttcg 2361061 gctgtccgaa gtcgagaccg aaaacgagga gacgcccgcg ggctcgttgt ggaggctgga 2361121 gttcttattg cagtcgacgc aggaccccag cctgctggtc cccgccgagc aggcatggaa 2361181 cgacgacggc agcctgcgcc gctggctgga ccggccgcag gagctgctgc tgaccgaact 2361241 gggccgggcc tctcggattt tccccgagct cgtcccggcg ctgcgcaccg cgtgcccgtc 2361301 cgggcttgag ctcgacgccg acggcgccta ccgattcctg tcgggtacgg ccgcggtgct 2361361 cgacgaggct gggtttggcg tgctgctgcc gtcctggtgg gaccgccgcc gcaagctggg 2361421 cttggtcctg tccgcatata ccccggtcga cggcgtggtg ggcaaggcca gcaagttcgg 2361481 ccgcgagcag ctcgtcgagt tccgctggga gctggccgtg ggcgacgatc cgctcagcga 2361541 ggaggagatc gcggcgctga ccgaaaccaa gtccccgctg atccggctgc gtggccagtg 2361601 ggtcgcgctc gataccgaac agatgcgccg cgggctggag tttttggagc gtaagccaac 2361661 cggccgcaag accaccgccg agatcctcgc gctggccgcc agccaccccg acgacgtgga 2361721 caccccgctc gaggtcaccg ccgtacgcgc cgacggctgg ctcggggacc tgctcgccgg 2361781 ggccgccgcg gcgtcgctgc agccgttgga cccgcccgac ggattcaccg cgacgctgcg 2361841 tccctaccag cagcgcggtc tggcgtggct ggcgtttttg tcctcgctcg gtttgggcag 2361901 ctgcctggcc gacgacatgg gcctgggcaa gacggtgcag ctattggccc tggaaacctt 2361961 ggaatccgtt cagcgccacc aggatcgcgg cgtcggaccc acactgctac tgtgcccgat 2362021 gtcgttggtg ggcaactggc cgcaggaagc ggccaggttt gcacccaacc tgcgggtgta 2362081 cgcccaccac gggggcgccc ggctgcacgg cgaggcgttg cgcgaccacc tcgagcgcac 2362141 cgacctggtc gtgagcacct ataccaccgc cacccgcgac atcgacgagc tggcggaata 2362201 cgaatggaac cgggtggtgc tggacgaggc ccaggcggtg aagaacagcc tgtcccgggc 2362261 ggccaaggcg gtgcgacggc tacgcgcggc gcaccgggtc gcgctgaccg ggacaccgat 2362321 ggagaaccgg ctcgccgagc tgtggtcgat catggacttc ctcaacccgg gcctgctcgg 2362381 atcctccgaa cgcttccgca cccgctacgc gatcccgatc gagcggcacg ggcacaccga 2362441 accggccgaa cggctgcgcg catcgacgcg gccctacatc ctgcgccggc tcaagaccga 2362501 cccggcgatc atcgacgatc tgccggagaa gatcgagatc aagcagtact gccaactcac 2362561 caccgagcag gcgtcgctgt atcaggccgt cgtcgccgac atgatggaaa agatcgaaaa 2362621 caccgaaggg atcgagcggc gcggcaacgt gctggccgcg atggccaagc tcaaacaggt 2362681 gtgcaaccac cccgcccagc tgctgcacga tcgctccccg gtcggtcggc ggtccgggaa 2362741 ggtgatccgg ctcgaggaga tcctggaaga gatcctggcc gagggcgacc gggtgctgtg 2362801 ttttacccag ttcaccgagt tcgccgagct gctggtgccg cacctggccg cacgcttcgg 2362861 ccgtgccgcc cgagacattg cctacctgca cggtggcacc ccgaggaagc ggcgtgacga 2362921 gatggtggcc cggttccagt ccggtgacgg cccgcccatt tttctgctgt cgttgaaggc 2362981 gggcggtacc gggctgaacc tcaccgccgc caatcatgtt gtgcacctgg accgctggtg 2363041 gaacccggcg gtcgagaacc aggcgacgga ccgggcgttt cggatcgggc agcggcgcac 2363101 ggtgcaggtc cgcaagttca tctgcaccgg caccctcgag gagaagatcg acgaaatgat 2363161 cgaggagaaa aaggcgctgg ccgacttggt ggtcaccgac ggcgaaggct ggctgaccga 2363221 actgtccacc cgcgatctgc gcgaggtgtt cgcgctgtcc gaaggcgccg tcggtgagta 2363281 gcacctggta tccaccaccg tcccggcccc gtccggtcga gggtgggatc aaggcgcgca 2363341 gcacccgcgg cgcgatcgcg cagacctggt ggtcggagcg gttcattgcg gtgctggagg 2363401 acatcggcct gggtaaccgg ctgcagcgtg gccgcagcta tgcgcgcaag gggcaggtga 2363461 tctcgctgca ggtggatgcc ggcttggtca ccgcgctggt gcagggcagc cgggcccggc 2363521 cgtaccggat ccgcatcggg attccggcgt tcggcaagtc gcaatgggcg cacgtcgagc 2363581 gaaccctggc cgaaaacgct tggtacgcag caaaattgct gtccggcgaa atgcccgaag 2363641 acatcgagga cgtcttcgcc ggcctgggcc tgtcgctatt ccccggcacc gcccgagagc 2363701 tatcactgga ctgctcctgc cccgactacg cggtcccatg caagcacctg gccgccacct 2363761 tctacttgct ggccgagtcc ttcgacgagg atccgttcgc catcctggcg tggcgtggcc 2363821 gcgagcggga ggatctgctg gccaacctgg ccgctgcccg cgccgacgga gcggcaccgg 2363881 ccgccgacca cgccgaacaa gtggcccagc cgctcaccga ctgcctagac cgctattacg 2363941 cccggcaggc cgacatcaat gtccccagcc cgccggcaac cccatcgacg gcattgctcg 2364001 accagctgcc cgacaccgga ctcagcgccc gcggacggcc gctgaccgag ctcctgcgac 2364061 ccgcctatca cgccctgacg caccatcaca acagcgcggg cggctgatcc cagcgcaccc 2364121 cttcgaatcg gccgaagtca ctgtcgtagg acacgatgct ggcgcgatgc tcgacggcaa 2364181 gcgcggccag atgcgcgtcg ttgaccaggt tggcaccggt tcccacgtac gtcagcattc 2364241 tcgccaggat atcggcgtgc cggacggtcg gattcaccaa gacggcgctg ggtgcggcta 2364301 gccaatccgc gacctgggtg atggccgcct cccgcggaag cggacggggg aacaacccca 2364361 ccttggtcgc caatcgcacg aacgccaaca acggcaccca ggcgaacccg acgcggtcgg 2364421 cgcccgacag cgcaccgtca agccagcgca gcgacggctt gtggtgctca cttgtggtgt 2364481 tcacggcgta gagcaagacg ttcgcgtcga cgatcttcat caaccgctat gacccgcggc 2364541 gttgacggcg cacaagctct tcgtcctcga ggtcggccgc aagctgcaag gcccggtcga 2364601 ggttgaccgc agggacgccc aagtctgccg tgcgggtgct gaagtgactc ggcgcaggtc 2364661 gcccggaggc gccgtcgcga atcgcgtcgt tgagggcctt cttgaaggac acttgccgct 2364721 cggccatccg gcgccttacc aactgctcga cgtcgtcatc caatgtgaca gtcgtccgca 2364781 ttttgatagc atagcatcaa gattgtcgac agcatctcgt caatcggcgc gcgggcccgt 2364841 cactaatccg gcgattcgcc gtcggactgg gagtctttgg cgcccgtgga acccctttgt 2364901 gtcccttggc atctttgcga tccagttccc gcagccgttt tcccaacgcg gcacccgcga 2364961 tgcgccgaaa cgcgcgccgt agccggtcgg cgtcgagcac ggccacctcc agggtggcca 2365021 ggggtgggtt gagcaccacg gtcgtgaacg tcattcgcgg tagcccgact cggcgacctc 2365081 gagcagtcga cacgccttct gcacgggaag tccttctgcg gccatcgttg ctatggccgc 2365141 ttactgcctt ctagtccgtg cggctctcgc aacagctcac gggacctttt tgaggatcgc 2365201 cacttcaggt cttcaactcg cggatgccct cattggcaac gtttgcgcct gccttggggc 2365261 ggccggcagc caccaagtcg agcactttgc ggcggaacta ctcggggtaa cacttcggca 2365321 cggacacggc tcgttcgacg gacgtcgtga ccagaagtcg agcaaaccga ctccactcta 2365381 gctagtgata caagcttttt tgtagccgcg cgatgaaccg ccccggcatg tccggagact 2365441 ccagttcttg gaaaggatgg ggtcatgtca ggtggttcat cgaggaggta cccgccggag 2365501 ctgcgtgagc gggcggtgcg gatggtcgca gagatccgcg gtcagcacga ttcggagtgg 2365561 gcagcgatca gtgaggtcgc ccgtctactt ggtgttggct gcgcggagac ggtgcgtaag 2365621 tgggtgcgcc aggcgcaggt cgatgccggc gcacggcccg ggaccacgac cgaagaatcc 2365681 gctgagctga agcgcttgcg gcgggacaac gccgaattgc gaagggcgaa cgcgatttta 2365741 aagaccgcgt cggctttctt cgcggccgag ctcgaccggc cagcacgcta attacccggt 2365801 tcatcgccga tcatcagggc caccgcgagg gccccgatgg tttgcggtgg ggtgtcgagt 2365861 cgatctgcac acagctgacc gagctgggtg tgccgatcgc cccatcgacc tactacgacc 2365921 acatcaaccg ggagcccagc cgccgcgagc tgcgcgatgg cgaactcaag gagcacatca 2365981 gccgcgtcca cgccgccaac tacggtgttt acggtgcccg caaagtgtgg ctaaccctga 2366041 accgtgaggg catcgaggtg gccagatgca ccgtcgaacg gctgatgacc aaactcggcc 2366101 tgtccgggac cacccgcggc aaagcccgca ggaccacgat cgctgatccg gccacagccc 2366161 gtcccgccga tctcgtccag cgccgcttcg gaccaccagc acctaaccgg ctgtgggtag 2366221 cagacctcac ctatgtgtcg acctgggcag ggttcgccta cgtggccttt gtcaccgacg 2366281 cctacgctcg caggatcctg ggctggcggg tcgcttccac gatggccacc tccatggtcc 2366341 tcgacgcgat cgagcaagcc atctggaccc gccaacaaga aggcgtactc gacctgaaag 2366401 acgttatcca ccatacggat aggggatctc agtacacatc gatccggttc agcgagcggc 2366461 tcgccgaggc aggcatccaa ccgtcggtcg gagcggtcgg aagctcctat gacaatgcac 2366521 tagccgagac gatcaacggc ctatacaaga ccgagctgat caaacccggc aagccctggc 2366581 ggtccatcga ggatgtcgag ttggccaccg cgcgctgggt cgactggttc aaccatcgcc 2366641 gcctctacca gtactgcggc gacgtcccgc cggtcgaact cgaggctgcc tactacgctc 2366701 aacgccagag accagccgcc ggctgaggtc tcagatcaga gagtctccgg actcaccggg 2366761 gcggttcacg attgggccgc ccgtaaggaa tgcgtcatga gcgacttcgc atcacgggcg 2366821 accaatcatt aatttgtcaa accctttgac atgcactact tgtccacatt ttgtacacga 2366881 aatacctaac acactatggt gcacatcacg cacttccacg ttccgtattc ggtgtacgat 2366941 tttgtcacgc aactaagcgt tcaagaggga gtactatgac tcatccaaaa gtaaaagatg 2367001 acatagaaat agaagagtcg tggttccggt gcgggtagct cccgatggct tgactgtggt 2367061 aagcaccagt ggcgtgttcc ccgtggttga gaccaggaag ttttaaagtc ctacagcccg 2367121 cggtattccg cagaggacat tgtgtgcatt tcgcaccttc gggtgggaga aatcgggatg 2367181 atctcaccac cggccaccgg tgggcgcact ttgtaccctt cgattccgtt attcggcgga 2367241 tttaagcagt tcgcaccatt accaagcagc caatgaggaa gagcgcaggt gactaggtcg 2367301 cttgatcttt ccctgtgcag tagctcgggt tctttgagtt tcgaggagga gaaaccacat 2367361 gtcctttgtg aatgtagacc catttgggat gttggcggca gctgcgacac tggagtccct 2367421 tggttcccac atggcggtaa gcaatgccgc ggtggcctcg gtgaccacca aggttcctcc 2367481 cccggccgcc gactacgtat caaaaaagtt atcgctgttc tttagtagcc acgggcagca 2367541 gtaccaggtg caagccgctc ggggcacggc ctttcatcga aaattggtcc ggaccctggc 2367601 gaatggcgcg cttgcgtatg aggaagtcga gatcgccaac aacgaaggtt tctaacgtgt 2367661 cgccagttac gcacgagtgg ctaccagcga gtacaaggga gtaacgaatt atgcccaatt 2367721 tctgggcgtt gccgcccgag atcaactcca cccggatata tctcggcccg ggttctggcc 2367781 cgatactggc cgccgcccag ggatggaacg ctctggccag tgagctggaa aagacgaagg 2367841 tggggttgca gtcagcgctc gacacgttgc tggagtcgta taggggtcag tcgtcgcagg 2367901 ctttgataca gcagaccttg ccgtatgtgc agtggctgac cacgaccgcc gagcacgccc 2367961 ataagaccgc gatccagctc acggcagcgg cgaacgccta cgagcaggct agagcggcga 2368021 tggtgccgcc ggcgatggtg cgcgcgaacc gcgtgcagac cacagtgttg aaggcaatca 2368081 actggttcgg gcaattctcc accaggatcg ccgacaagga ggccgactac gaacagatgt 2368141 ggttccaaga cgcgctagtg atggagaact attgggaagc cgtgcaagag gcgatacagt 2368201 cgacgtcgca ttttgaggat ccaccggaga tggccgacga ctacgacgag gcctggatgc 2368261 tcaacaccgt gttcgactat cacaacgaga acgcaaaaga ggaggtcatc catctcgtgc 2368321 ccgacgtgaa caaggagagg gggcccatcg aactcgtaac caaggtagac aaagagggga 2368381 ccatcagact cgtctacgat ggggagccca cgttttcata caaggaacat cctaagtttt 2368441 gattcgggaa catcctaaga aacggggggc gtcgccgttg gagacgtcgc aacgtgtccg 2368501 cagtcccaag ggcaacagtg aagggcccac ggtgcgatcc ccaacacccg gctagagtgc 2368561 gcataatatt ttcccgcctc ggctcaaggc gtgcaccccc atcaccgcta accatgctgt 2368621 gtatcaacag atttcattgt cccggccgtc gcgcgaccga ccaatagggt gagttccatg 2368681 tgcgatatcg cctaacagcc ggctcccgta ctcccgtggc cgatgtgatt attgattacg 2368741 tggatcacca tgtgggtgat cgcggtcgac agctttggta ccgagcacat cgccacaacg 2368801 cgcggtacga atctagtaca caaatccgca ccagccgcca tgcgacttcg caggtcatag 2368861 ccccgcagag tcgccgaacc tgccgcagtg acaaaagtca ggacggccgg cgacgcgtcg 2368921 agccggggtt aggcgcagtt aacgtcgcag cggggtccca gacacgcgtc ggactttcgg 2368981 actcagcccg acgattcgcc gtcagactgc gggctttcct ggtctaccag caacgcttgc 2369041 agggcggagc cggtgatgcg ccggaacgcg cgccgtggcc ggttggcatc gagaacggcc 2369101 acctctaagc tggccacgcc aagggtgggt tgatcaccac ccgaggtgtc ggcactgccg 2369161 gcccgcaatg cagcgaccgc gatacgcagg gcgtcggtca ggctggcgtt ctcggcatac 2369221 gactctttga gcgcgttggc gatcggctcc gtggtgccgc ccatcaccac gaaatgcggc 2369281 tcgtcggcga tcgacccgtc gtaggtaata cgatacaact cagggcgttt cgtctcgccg 2369341 taatgcgcca cctcggccac acacaactca acctcgtagg gcttggcctg ttcggtgaag 2369401 atggtgccta gagtctgcgc gtagacattg gccaactgcc gacccgtgac gtcacgacgg 2369461 tcataggcgt aaccgcgggt gtcggcgaac tggatcccgc cgcggcgcaa attgtcgaac 2369521 tcgttgaact tgcccgcagc cgcaaaaccc acccgatcgt agagctcact gatcttctgc 2369581 agcgaccgcg acggattctc cgcgacgaac agcacaccac cggcataggc cagcgccacc 2369641 acgcttttgg cccgcgcaat gcccttacgc gccaactcgc tgcgctcgcg catcgcctgc 2369701 tcaggcgaga tgaaatacgg aaaactcact tctcaccgcc atcggagccg aaagtatccg 2369761 cacccgaacg gctttcgatg atcgcgcggg ccaattcggc aatccggctc tccggcacgt 2369821 caaccgcccc gtcggcgtcg atgatcaccg ccgtcggaaa gatgccccgc accaggtccg 2369881 gaccgccggt ggcggagtcg tcgtcggcgg cgtcgtagag cgcctcgacc gccacccgca 2369941 gccccgaatc accgtcggta acctgcgaat acaacttctt catcgacgac ttcgcgaaca 2370001 gcgaacccga gcccaccgcc tgatagccct cttcctcgat gttccaaccg ccggcggcgt 2370061 cgaacgaaac gatacgaccc gcgctctgcg ggtcagacgc atgaatgtcg tagcccgcca 2370121 gcaacggcaa cgccagcaga ccctgcatcg cggccgccag attgccacgc accataatcg 2370181 ccagccggtt gattttgccg gcaaacgtca gcggcacacc ctcgagcttc tcgtagtgct 2370241 caagttccac ggcatacagc cgggcaaact caaccgcgac cgcagccgtg ccagcgatgc 2370301 cggtagcggt gtagtcatcg gtgatataca ccttgcgcac atcacgccca gaaatcatgt 2370361 tgccctgcgt cgaacgccgg tcacccgcca tgacaacacc gccggggtat ttcagcgcga 2370421 caatggtggt gccgtgcggc agttgcgcat cgccgcctgc gagtggcgca ccgccgctga 2370481 tgcttgccgg cagcaactcc ggcgcctggc ggcgcaggaa gtcagtgaaa gaagataggt 2370541 ctacagcggg tgttccagag agtgaattaa tggacaggcg atcgggcaac ggccaggtca 2370601 ctgtccgccc ttttggacgt atgcgcggac gaagtcctcg gcgttctcct cgaggacgtc 2370661 gtcgatttcg tcgagcagat cgtcggtctc ctcggtcagc ttttcgcgac gctcctggcc 2370721 cgcggcggtg ctgccggcga tgtcgtcatc atcgccgccg ccaccgccac gcttggtctg 2370781 ctcttgcgcc atcgccgcct cctgcttcct catggccttt caaaaggccg cgggtgcgcg 2370841 tcacacgccc gctgtctttc tctaccctac cggtcaacac caacgtttcc cggcctaacc 2370901 aggcttagcg aggctcagcg gtcagttgct ctaccagctc cacggcactg tccaccgaat 2370961 ccagcaacgc accaacatgc gccttactac cccgcaacgg ctccagcgtc gggatgcgaa 2371021 ccagcgagtc gccgcccagg tcgaagatca ccgagtccca gctagccgcg gcgatatcag 2371081 ccccgaaccg gcgcaggcat tcgccgcgga aatacgcgcg ggtgtcggtc ggcgggttct 2371141 ccaccgcact cagcacctgg tgttcggtga ctaaacgctt catcgagccg cgcgcgacca 2371201 gccggttgta caggcccttg tccagccgga catcggagta ctgcaggtcg acgaggtgca 2371261 gccggggcgc cgaccagctc aggttctccc gctgccggaa accgtcgagc agccgcagtt 2371321 tggccggcca gtccagcagc tccgcgcaat ccatcgggtc acgctcgagc tgatccagca 2371381 cgtgtgccca ggtttccacg atgtcggccg cccgcgggtc cgggtcgcgg ctatccacca 2371441 acttagccac tcggtccagg tagatccgtt gcagcgcaag accggtcagt tcccggccgt 2371501 cggccagcgc aacggtcgct cgcagcgacg gatcgcggga gattgcgtgc accgcatgta 2371561 ccgggcgggc cagcgccagg tcggtcagat ctattgcgtg ggctggtcct tcttcgatca 2371621 ggtcgagcac cagcgccgtg gtacccaact tcagataggt cgacgtctcg gcaaggttgg 2371681 cgtcgccgat gatgacgtgc agccggcggt acctgtcggc gtcggcgtgc ggttcgtcgc 2371741 gggtgttgat gatgccgcgc ttgagcgttg tttccagccc tacctcgacc tcgatgtagt 2371801 ccgaacgctg ggatagctgg aagccgggct catcacccga gggcccgatg ccgacccggc 2371861 ccgagccggt caccacctgc cgggatacca gaaagggggt cagcccggtg atgatcgccg 2371921 agaacggtgt ctgccgcgac atcaggtagt tctcgtgcga cccgtaggag gctcccttgc 2371981 cgtcgacgtt gttcttgtac agctgcagtt tcgcggcccc gggcacgctg gcgacatggc 2372041 gggcagcggc ctccatcacg cgttcgcccg ccttgtccca gatcactgcg tccagcgggt 2372101 cggtgcattc gggcgcggag tattccgggt gcgcgtggtc gacatacagc cgcgccccgt 2372161 tggtcaggat catgttggcc gcgccgacct cgtcggcgtc gaccaccggc ggcggcccgg 2372221 ccgagcgact caaatcgaag ccccgggcgt cgcgcagcgg cgattccacc tcgtagtccc 2372281 aacgggtgcg tttggcacgc tgaatgccgg cggcggcggc gtatgccagc accgcctgcg 2372341 tcgaggtgag gatcgggttg gcggtcgggt ccgacggcga ggaaatgccg tactcgacct 2372401 ccgttccgat aatccgctgc atgccgtaga gcctaggccc gccgacgatg cgggccgcgc 2372461 agcgggccgc tgaggaggcg ggcatcaagc aacgcccgcc gacgatgcgg gccgcgcagc 2372521 gggccgctga ggaggcgggc atcaagcaag gcccgccgac ccagaacatc ggagcgggcc 2372581 gcgcaggagg tggacaatca agcagggccc ggcgctaggg taggccggca tgagcctttc 2372641 cgtccgtcgc cccccggcgg cccgagcagc ggccattgtg gaggctgaaa gctggttctt 2372701 gaagcgtggt ctgccctcgg tgctgaccat gcggggccgg tgccgtcggc tgtggccgcg 2372761 gtcggctccg atgttggccg cctgggcggt ggtcgagggc tgcctcatgg ccgtcttctt 2372821 cgtcaccgac ggcggcgaag tcttcatcag cgcgacgccg acgacagcgc aatgggtgat 2372881 cctggcgctg ctcgcggttg ctcttccgct ggcctccctc gtcggctggt tggtgtcgca 2372941 gatatcaagc gggcgtggcc aagcggcggt ggcgaccatg gcggtggcct tcgcggccgc 2373001 atccgacgtc atcgaatccg gcccgatcca gctgttgcgg accgccgtcg tggtgggcct 2373061 ggtgctgctg cagaccggct gcggcgtcgg gtcggtgctt ggctgggcgg tgcggatgac 2373121 gctggagcac cttgcgacgg tcggcacgct ggcggtccgg gccctgccga tcgtgctact 2373181 gacggcattg gtgttcttca acacctatgt ctggctgatg gccgccaaca tcaacggcga 2373241 gcggctgacg ctggcgatgg tttttctgct cgccatcgcc ggggcgttcg tcgtgtccaa 2373301 gacggtggaa cgggtgcgtc cgctgcttcg ctcaacgacg gtgatgcccc aaggcagcca 2373361 aagcctggcc ggcacaccct tcgcgaccat gggcgacccc tctcccggct tccccctcac 2373421 ccgggccgaa cgcctcaacg tggtcttcct gctggcggcc tcgcaactcg tcgagatcct 2373481 ggtagtggcg tcggtcggcg ccgcgatata cctcgttctg ggcatgatca ttctcactcc 2373541 gccgctgctt cgggaatgga cgcactacga ttcgatgacc acgacggtgc tcggcatgac 2373601 gttcccggcg ccggattcgc tcatccgtat gtgtcttttc ctgggcgcgc tgacgttcat 2373661 gtacatcagc gcccgcgcgg tcgacgacgc cgagtaccgc gcgatgttcc tcgaccctct 2373721 gatcgacgac ctgcacaccg cgctgctcgc gcgcaaccgc taccgcaaca acgtggtgac 2373781 cgcgccgtgc gccggtgttg acgccggtca cgtcgatgac taggttcacc ctgatgtcgg 2373841 ctcccgaacg ggtaaccggc ttgtccgggc aacgttacgg ggaagtcctt ctcgtaacac 2373901 ccggggaggc cggtccacag gccaccgttt acaacagctt cccgcttaac gattgtccgg 2373961 ccgagctgtg gtccgcgctc gatccgcaag ccctagccac cgaacacaaa gcggccaccg 2374021 ccctgctcaa cggtccgcgc tattggttga tgaacgccat cgagaaggcg ccccagggcc 2374081 cgccggtgac gaagaccttc ggcgggatcg agatgctcca gcaggccacg gtgctgctgt 2374141 catcgatgaa ccctgcccca tacaccgtca gccaggtcag ccgcaacacg gtctttgtgt 2374201 tcaacgccgg cgaagaggtc tacgaactgc aggaccccaa gggacagcgc tgggtgatgc 2374261 agacgtggag tcaagtggtg gaccccaacc tgtcccgagc cgacctgccc aagctgggtg 2374321 aacggctcaa cctgccagcc gggtggtcct atcatacccg cgtgcttacc agcgagttgc 2374381 gggtcgacac taccaaccgg gaggcccgcg tcctgcaaga cgacctcacc aacagctact 2374441 cgctggtgac cgcctgagcc ctacaggtac tggccgaggt tggactcggt atcaatagcc 2374501 ctgctggccg acgaactctt tccggtgacc agggtgcgga tgtagacgat ccgctccccc 2374561 ttcttgcccg agatccgcgc ccagtcatcg gggttggtgg tgttgggcaa atcctcgttc 2374621 tcggcgaact cgtcgacgat cgaatcgagc agatgctgta tacgcagtcc cggttggccg 2374681 gtctccagca ccgatttgat ggcgttcttc ttggctcggt cgacgacgtt ctggatcatc 2374741 gccccggagt tgaagtcctt gaagtacatg acttccttgt cgccgttggc ataggtgacc 2374801 tccaggaacc ggttgtcgtc gatctcggca tacatccggt cgacaacctt ctcgatcatc 2374861 gccttgatgc aggccgaacg gtcaccgtcg aactcggcga gatcgtcggc gtgcaccggc 2374921 aagaactcgg tcaggtactt cgagtagatg tcctgcgccg cttcggcatc aggccgctcg 2374981 atcttgatct tcacgtcgag gcgcccgggc cgcaggatgg cagggtcgat catgtcctct 2375041 cggttggagg cgccgatcac gatgacattc tcgagtccct ccaccccgtc gatctcgctg 2375101 agcagctgcg ggaccaccgt ggtctcgacg tccgaggaaa cgccggtgcc acgggtgcga 2375161 aagatcgagt ccatctcgtc gaaaaacacg atcaccggag tgccttccga cgccttctcg 2375221 cgggcccgtt ggaagatcag ccggatgtgg cgttccgttt ccccgacgaa tttgttcagc 2375281 agctcggggc ccttgatgtt gaggaagtac gacttcgcct cgtgggcatc gtcgccgcgg 2375341 acctcggcca ttttcttggc caacgagttg gccacagcct tggcgatcaa cgtcttacca 2375401 cagccgggtg ggccatagag caacacaccc ttgggcgggc gcagcgagta ctcccggtac 2375461 aactccttgt gcaggaacgg cagctccacg gcgtcgcgga tctgctcgat ctggcggctc 2375521 agaccgccga tgtcggcgta gctgacgtcc ggcacctctt ccagcaccag gtcttctacc 2375581 tcggctttgg ggatgcgttc gaaggcatag ccggctttgg tgtcgaccag cagcgagtcg 2375641 ccggggcgca gcttgcgcgg ccgggtgtca tcgttgaggg cctcagggag gccgtctggc 2375701 aggtcctcgg cgatcagggg atcagccagc caaacaacgc gttcctcgtc ggcgtggccg 2375761 acgaccagag cccgatgacc gtcggccagg atctcgcgca aggtggatat ctcgccgacc 2375821 gcctcgaatg tgccggcctc cacgacggtc agggcctcgt tgagccggac cgtctgcccc 2375881 ttcttcagcg atgcagcgtc aatattcggt gagcacgtca ggcgcatctt gcgacccgat 2375941 gtgaacacat cgaccgtgtc gtcgtcgtgc gtggccagca ggacgccgta gccactgggc 2376001 ggctgcccca gccggtcaac ttcctcgcgc agcgccagca gttgttgacg ggcttcttta 2376061 agagtttcca ttaatttgga attgcgggca gcaagtgagt cgatacgggc ttcgagttga 2376121 tgtatatcgc gggcagagcg cgtcggggca tgtgatccga cggcgttctc aagttgctcg 2376181 cgcaggaccg cagcctcgcg ccgcagctgt tctaattcgg cggcatcacc actggacagc 2376241 gggctatccc gggggatgcc gaatgcctca gaacgctctg actcacccat gttgcgctcc 2376301 tttcccacgc caggaatcgc gcggcggata ctccaacgct accggcgatc ggcgcttcat 2376361 gttggcagtc gaatgccgat ggaaagtaac aacttgtatc gctggtaatc tcggccccga 2376421 atccagccga tcaaacggac cttgagagga gcactgtgac cgcgaaatcc ctagccacag 2376481 gcgtagtggg cgacgcggcg atcagtgcgg cggccgccgc cgagacttct gctgcattcg 2376541 caagcggccg gtagccgagc gtgtcgctgg atgcgccgaa acatccgtgt aaccctgggc 2376601 gccgccacca tcgtggcggc gttagggctc tccgggtgtt cacaccctga gttcaagcgt 2376661 tcgtcgccgc ctgccccgtc actgccgccc gtcacgtcga gcccgctcga ggccgcgccg 2376721 atcacgcccc tgcccgcacc cgaagccctg atcgatgtgc tgtcccggct cgccgacccg 2376781 gccgtgccgg gcaccaacaa ggtgcagctc atcgagggcg cgacccccga aaacgccgct 2376841 gccctggaca ggttcaccac cgcactgcgt gacgggagct acttgcccat gaccttcgcg 2376901 gccaacgaca tcgcatggtc ggacaacaag ccgtccgacg tgatggccac cgtcgtcgtc 2376961 accactgccc atccggacaa ccgcgagttc acgtttccca tggaattcgt gtccttcaag 2377021 ggcggctggc aattgtctag gcagaccgcg gaaatgctgc tggccatggg taactcaccg 2377081 gattcgactc cgtcggctac cagcccggcg ccggccccat caccgactcc ccctggctga 2377141 gctcccgatg tggattggct ggctggaatt cgacgtgctg ctgggcgacg tgcgctcact 2377201 caagcagaag cggtcggtga cccgccccct ggtcgccgag ttgcagcgca aattcagcgt 2377261 gtcggccgcc gagaccggtt cgcatgatct gtaccggcgg gcgggcatcg gtgtggccgt 2377321 ggtgtccggt gaccgcagcc acgccgtcga tgtcctcgac aacgccgaac gtctggtagc 2377381 cgcacatccg gagttcgagt tgctgtccgt gcgccggggc ctgcaccgca ctgacgacta 2377441 agtggactgg ctcccagctg tgtctcccgc tacccgtcgc gtccctcgcg cttacgacct 2377501 agcggcgccg gagccacagc ccccggcgcc aaccggcgcg ttgctaccag gaacgcggta 2377561 tgcccgcgca tcgaatgctg cggccgaacc gccaacccta cgacgttcca gccccgctgc 2377621 agcgtctccc aggctctcgg ttcggtccag cactgcttgg cccgcagtgc ctccacgatc 2377681 ctcgacagct gagtgacggt ggccacgtag accatcagca ctccgccggc gaccagcagc 2377741 cgcgataccg cgtcgagcac ctcccacggc gccagcatgt cgagcacggc ccgatcaacg 2377801 gatccgtcgg gcagttcgga gtcggcgagg tcgctgacga ccagtcgcca gttgtccggc 2377861 ggctggccgt agcagccgct cacattgcgc cgggcgtgtt cggcatgatc ggcgcgctgt 2377921 tcgtaggaga tcacctgtcc ggccggccca accgcccgca gcaaagacaa ggtcagagca 2377981 ccggatccgg ctcctgcctc cagcacccgc gcgccgggaa atatgtcgcc ctcatgcacg 2378041 atctgggccg catctttggg atagatcacc tgcgggccgc gcggcatcga catgacgtag 2378101 tcgaccagca gcgggcgcag caccaggaac agggcgccgt tgctggattt gaccacgctg 2378161 ccttgctcca acccgatcac cgcgtcgtgg gcgatcgagc cacgatgagt gtggaattcg 2378221 gcaccgggag tcagcgacat ggtgtagcgg cgccccttag cgtcggtgag ctgaacacgt 2378281 tcgccgatgc tgaatgggcc ggttgctgac acgccgtcta gcgtgccagc cgactcgccg 2378341 cgatcggtgc tcggggttgt cggcgcccaa ccctaagctg cggacatggc cgaccagccg 2378401 gacccgccca caccacggcc ggcgttatca ccgtcacggg cgacggactt caagcaatgc 2378461 ccgctgctat accggtttcg cgcgatcgac cggctacccg aggcgacgtc ggcggcgcag 2378521 ttacggggtt cggtggtgca cgccgcgctt gagcagctct atgggctacc cgcggggctg 2378581 cgcagcccgg atactgcgag gtcactggtg cagcgcgctt gggaccagat ggtcgccgcg 2378641 gagcccgaac tggccggcga actggacccc ggacaaccaa cccagctgct ggaggacgcc 2378701 cgcgcgttgg tgtccggcta ctaccggctg gaagacccga ctcggttcga cccgcaatgc 2378761 tgcgaacagc gggtggaggt cgaactggcc gacggaactc tgttgcgcgg ctacatcgac 2378821 cgcattgacg tcgccgccac cggcgagctg cgggtggtcg actacaagac tggcaaggcg 2378881 ccgccggcgg cgcgggcgtt ggcggagttt aaggcgatgt ttcagatgaa gttctacgcg 2378941 gtggcgctat ttcggtcgcg cggcgtgccg cccacccggc tgcggctcat ctatctggcc 2379001 gacggccagc tgctcgacta ttcaccggac cgcgacgagc tattgcgttt cgaaaagacg 2379061 ttgatggcga tttggcgtgc tatccaatcc gcaggcgaga caggcgattt ccgccccaac 2379121 ccatcgcggc tctgcgattg gtgcccgcat caacagcgct gcccggcctt cggcggaaca 2379181 ccaccgccct atccagggtg gcccaccgag ccggcggcat aaacgatcgc gtcgaagtgc 2379241 ggtgtcatag ggccgccgcg gcggcgacga tggcaaaccc gcccaacacc gcgaccgaat 2379301 cctcgagcag cgcgatcggc aggtcgtggc cgccacgggc agccaccagc ctcgtacgtg 2379361 cctgatagcc gcccatggtg ccgagcacgg cgccgataac cccagcgcca agcccgcccc 2379421 accggtagcc ccacgcggtg ccgatgaccg cgccggcgaa cgcgcccaaa atgatccgga 2379481 cagcgaacac cggcgtcacg gtacgcggcg gtgttttggg acgtttgtcg ttaacgagtt 2379541 cggcgaccgc aagaacgctg acgatcacca cggtcacgaa attgcccatc caggatgccc 2379601 aggttccatg caggttgatc cagccgagaa aggcggccca ggagaccacg gccggggccg 2379661 tcagggaacg caacccggcg acgacaccga taagcagcgc cagcagcaga acaaggacat 2379721 gcgtcacagc gatccctcct gacacagacg ttatgggcaa tcaggcccca gcggacgcta 2379781 acacagcgtg ggccccgcca caggatcaga atcggcagaa cctgatgtcc gacgccagaa 2379841 tcgctttggc cccgatcgca gcgagctcat ccatgatgcc gttgacgtcc cggcgcggca 2379901 ccagggcgcg gattgccacc cagtccgggt cggccagcgg ggcgatggtc ggtgactcca 2379961 gccccggcgt gatcgccgtg gccttcttca acgccgagcg cgggcaatcg tagtcgagca 2380021 tcagatactg ctggccgaag accaccccct gcacccgagc gaccagttga tcgcgcgcct 2380081 cggtctggtc ttggccgtcc gtaccggccc gctcgatgag caccgcctcc gaatcgcaca 2380141 gcggctcacc aaaggccacc aggtcgtgct ggctcagcgt gcgacccgac cccaccacat 2380201 cggcgatggc atcggccacc ccgagctgca ccgagatctc cacggcacca tcaagtctga 2380261 tgaccgttgc ttcgattccc ttggtggcca gatctttccg gaccagattc gggtaggcgg 2380321 tggcgatccg catcccggct aggtcggcag tcgtccagtt ccgcccggcg ggagcggcat 2380381 agcggaagct ggacgacccg aagcccagcg ccaggcgttc ccgaacctgt gcaccggaat 2380441 cgcacaccag gtcgcgtccg gtgatcccga agtcgagctc tcccgaaccg acatatatgg 2380501 caatgtcttt gggccgcaag aagaagaact cgacgttgtt gaccggatcg atgacggtca 2380561 agtctttgga atcggtgcgg cggcggtagc cggcctccgc gaggatctcg gtggccggct 2380621 cgctcagcgc acccttgttg ggaaccgcga cccgcagcat gctcacagct ttcgatagac 2380681 gtcgtcgagg gacagtccac gggagatcat cagcacctgc gtccagtaca gcaactggct 2380741 gatctcctcc gccagtgcgt cgttggattc gtgctcggca gccagccaca cctcgccggc 2380801 ctcctcgaga agcttcttac ccagagcatg aaccccgccg tccaatgccg ccaccgtggt 2380861 gctgtcggcc ggccgggtgc gggcacgatc gccgagttcg gcgaacagat cctcgaaggt 2380921 cttcacggcc agcgattgtt gcacgtgtca gccagccaag tcacggtggt ttgacgccac 2380981 acgttcgcca ccgccgcgcc gcgcattagg gcatcctaat ataggttagg ctaccctagt 2381041 tattcctgtg gtcgaaggag gcagccgaac gtgaccttcc cgatgtggtt cgcagttccg 2381101 ccggaagtgc cgtcagcatg gctgtccacc ggcatgggcc ccggtccgct gctggccgcg 2381161 gccagggcgt ggcacgcgct ggccgcgcaa tacaccgaaa ttgcaacgga actcgcaagc 2381221 gtgctcgctg cggtgcaggc aagctcgtgg caggggccca gcgccgaccg gttcgtcgtc 2381281 gcccatcaac cgttccggta ttggctaacc cacgctgcca cggtggccac cgcagcagcc 2381341 gccgcgcacg aaacggccgc cgccgggtat acgtccgcat tggggggcat gcctacgcta 2381401 gccgagttgg cggccaacca tgccatgcac ggcgctctgg tgaccaccaa cttcttcggt 2381461 gtcaacacca tcccgatcgc cctcaacgag gccgactacc tgcgcatgtg gatccaggcc 2381521 gccaccgtca tgagccacta tcaagccgtc gcgcacgaaa gcgtggcggc gacccccagc 2381581 acgccgccgg cgccgcagat agtgaccagt gcggccagct cggcggctag cagcagcttc 2381641 cccgacccga ccaaattgat cctgcagcta ctcaaggatt tcctggagct gctgcgctat 2381701 ctggctgttg agctgctgcc ggggccgctc ggcgacctca tcgcccaggt gttggactgg 2381761 ttcatctcgt tcgtgtccgg tccagtcttc acgtttctcg cctacctggt gctggaccca 2381821 ctgatctatt tcggaccgtt cgccccgctg acgagtccgg tcctgttgcc tgccgggctg 2381881 accgggcttg ccgggctcgg tgcggtatcg gggccggccg gaccaatggt cgaacgtgtg 2381941 cactccgatg gtcccagccg gcaaagctgg cctgcggcca ccggagtcac cctggtgggt 2382001 accaacccgg ctgccctggt taccacgccc gcacccgctc cgaccacgtc cgcggcaccg 2382061 acggcaccgt cgactcccgg atccagtgcc gcccaaggcc tttacgcggt cggtggtccc 2382121 gacggggaag ggttcaaccc gatcgccaag acgacagcac tcgccggtgt taccaccgat 2382181 gccgccgcac ctgccgccaa actgcccggc gaccaagctc agagcagcgc cagcaaagca 2382241 acaagactgc ggcgacgtct ccggcaacac cgcttcgagt ttctggccga cgacggccgc 2382301 ctgaccatgc caaacacacc ggagatggca gacgtcgccg ccggcaaccg tggattggat 2382361 gcgctggggt tcgccggcac gatcccaaaa tcggcgcccg gatcagcgac cgggcttact 2382421 cacctaggcg gcggattcgc cgacgtcctg tcgcagccga tgcttccgca cacgtgggac 2382481 gggtcagatt aaacgttgaa gtacttggct tccggatggt gcaggacgaa cgcgtcggtc 2382541 gactgttcgg gatgcagctg taattcctcg gataacgtca caccgatgcg ttcgggctcc 2382601 agcagcgcca tcatcttggc gcggtcctcc agatccgggc atgcgccgta gccgaaggca 2382661 aagcgagcac cgcggtagcc gagcttgaaa tagtcttctt tcgcctccgg atcctcggcc 2382721 gccatcgccc gatccccgga gaacttgagc tcctcacgga tccgccggtg ccagtactcg 2382781 gccagcgcct cggtgagctg cacgccgata ccgtgcacct ccaggtagtc gcggtaggcg 2382841 ttggacgcga acagctcgtt ggcgaaatcc gcgatcggct gacccatggt caccagctgg 2382901 aacggcagca cgtcaacctc gccacgctcg gcggccagct cccgcgagcg gatgaaatcg 2382961 gcaatgcaca aaaaccgacc gcgctgctgg cgcgggaagt gaaaccggta gcgcaccggg 2383021 gcgtcgggct tgggctcggt gagcaccacg atgtcgttgc cctcggacac cgccgggaaa 2383081 tagccgtaca ccacggcggc gtgcgccaag atgccgtcgg tggacagccg gtccaaccag 2383141 taccgcagcc gcggccggcc ctcggtctcg acgagatctt cgtaggacgg accctcaccg 2383201 ccgcgctggc cgcgtaaacc ccactggccc aaaaacaatg cgcgctcatc gagcagaccg 2383261 gtgtagtcgg ccaccgccag gcccttgacg atccgcgaac cccagaacgg cggcgccggg 2383321 acctcgatgt cggccgcgac atcggagcgt tcgggcacct cgactggttc ttcggcggct 2383381 ttgcgctgtg cggcaatgcg tttggatcgc tggtggcggg ccttacgttc ggcttctttc 2383441 tcacgcgcct taatggcttc cgggctgttt tcgtcgggcg cctcgccgcg cttggcgctc 2383501 atgatggtgt ccatcaactt caggccctcg aaagcgtctc gcgcgtaatg cacttcgccc 2383561 tggtagatct cggccaggtc gttttcgaca tagctgcgcg tcaacgccgc gccgccgagc 2383621 agcaccggga acttttcggc gactccccgg gtgttcatct cctcgaggtt ttccttcatc 2383681 accacggtcg acttcaccag caggcccgac atgccgacca cgtcggcgct cttgtcctcg 2383741 gcgacttcga ggatggtggc gattggctgc ttgatgccga tgttgaccac ttcgtagccg 2383801 ttgttgctca agatgatgtc gaccaggttc ttgccgatgt cgtgcacgtc gcccttgacg 2383861 gtggccagca cgatgcgtcc cttgcccgaa tcgtcgtccg agcgctccat gtgcggttcc 2383921 agatacgcga cggcggcttt cattacctcc gccgactgca gcacgaacgg cagctgcatc 2383981 tggccggagc cgaagagctc gccgaccgtc ttcatgccgg ccagcagatg ttcgttgatg 2384041 atctgaagcg gcggcttttg cgtcatcgcc tcgtcgagat cggcgtccag gccgttgcgc 2384101 tcgccgtcga cgatgcgttg ggccagccgt tcgaacagcg gcagcccagc tagttcagcc 2384161 agtcggtcct ctttcgagga ggccgccgac acgccttcga acagccgcat cagctcctgc 2384221 agcggatcgt agtcctcgcg gcggcggtcg tagaccagat ccagggcgac gttgcgttgc 2384281 tcctcgggaa tccggttcat cggcaggatc ttcgacgcgt gcacgatcgc cgaatccagc 2384341 cccgcttctt ggcattcgtg caggaacacc gagttgagca cctggcgcgc tgcgggattg 2384401 agaccaaacg agatgttgga cagaccaagt gtggtctgca catccgggtg gcgctttttc 2384461 agttcgcgga tcgcctcgat ggtctcgatg ccgtcgcggc gggactcctc ctgaccggtg 2384521 gcgatggtga acgtcaaggt gtcgatgagg atggatgatt cgtcgacgcc ccagttgccg 2384581 gtgatgtcgt tgatcagccg ctcggcgatc tcgaccttct tctgcgcggt gcgggcctgg 2384641 ccctcttcgt cgatggtcag cgcgaccacc gccgcgccgt gctcggcgac cagcgccatg 2384701 gtcttggcaa agcgcgattc cgggccgtcg ccgtcctcgt agttcaccga gttgatcgcg 2384761 caacggccac ccagatgctc caaacccgcc tgcagcaccg cggtttcggt ggagtccagc 2384821 atgatcggca gcgtcgagga cgtggccagc cggctggcca gcgccttcat gtcggccaca 2384881 ccgtcgcggc ccacgtagtc cacacacagg tccagcaggt gggcgccgtc gcgggtctgg 2384941 tccttggcga tgtccaggca cttctggtag tcctcggcga tcatcgcctc acgaaaaccc 2385001 ttggagccgt tggcgttcgt tcgctccccg atcaccagaa ccgaggcgtc ctgggcgaac 2385061 gggattgcgg tgtacagcga cgacaccgac ggctcgtagc tgacctgtcg ctcgggacgc 2385121 ttgatgttcg caaccgcggc agccacttcg cggatatggg ccggggtggt gccgcagcag 2385181 ccaccgacca gcgagagccc gaactcggcg atgaagccgg ccagcgcctc ggccaattcg 2385241 tcgggcagca acggatattc ggcgcccttg gcgcccagca ccggcaaccc ggcgttgggc 2385301 atcaccgaca ccgggatgcg ggcgtgccgg gacaggtggc gcaggtgctc gctcatctcg 2385361 gccggacccg tcgcgcagtt caagccgatc atgtccacac cgagcggctc gacagcggtc 2385421 aacgccgccc cgatctcgct gcccagcagc atggtgccgg tggtctcgac ggtgacgtgg 2385481 gcaaacaccg gaatgtgccg cccggcccgc gtcatcgccc gccgcgaccc caacaccgcc 2385541 gccttcagct gcagtaggtc ctggcaggtt tccaccagga tggcgtcggc tccgccgtcc 2385601 agcatgccca gcgcggcctc ggtgtaggcg tcgcggatca ccgcgtattc ggtgtggccc 2385661 agagtcggca gcttggtgcc cggccccatc gaccccagca cgtagcgctt gcggtcggga 2385721 ctgcccagct cgtcggccac ccggcgtgcg atcgcggtgc ccttctgtga tagatcgcgg 2385781 atcctgtcgg cgatgtcgta gtcgccgagg ttggacaggt tgcagccaaa cgtgttcgtc 2385841 tcgacggcgt cggcgcccgc ttcgaaatag ttgcggtgaa tggtttccag cacgtcaggg 2385901 cgggtttcgt tgaggatctc gttgcagccc tccaggccgc ggaagtcgtc gagcgtgagg 2385961 tccgcggcct gtagttgggt tcccattgca ccgtcgccga ccatcactcg ctgcgacaag 2386021 acgtcgagca gatcggtgtc gtagaggtgc ttgtcggccg cagtcacatg gcaaggatag 2386081 tcggcctatg aaatttcctc agtcgttgac agcgctctgc caggtaccgc gacgtcgcat 2386141 cggtcacagc tgccacaaga gtctcagctg aggcaggcac acaacgtgcc cacctcagcg 2386201 cgacaaagcg tggccatcgc tactagccgg gccgcctcag acgacgtgca cggttcgcat 2386261 cgtcgcccgg gtggacgccg taggctgacc aggtgacccc atcggagggc aacgcaccgc 2386321 tgcccgaact gcacaacacc gtcgtcgtgg ctgcgttcga gggctggaac gacgccggcg 2386381 acgcggccgg cgatgccgtg gcacacctgg cggccagctg gcaagcactg ccgattgtcg 2386441 agatcgatga cgaggcctac tacgactacc aggtcaatcg gccggtcatc cgccaagtcg 2386501 atggggttac ccgggaactg cagtggccgg ccatgcggat ctcgcactgc cgcccacccg 2386561 gcagcgaccg cgacgtggtg ttgatgtgcg gggtggagcc gaatatgcgc tggcgcacgt 2386621 tttgcgacga gttgctggcg gtcatcgaca aactcaacgt ggacaccgtg gtgatcctgg 2386681 gggcgctgct ggccgacacc ccacacaccc ggccggtgcc ggtctcgggc gcggcctact 2386741 ccgcggcgtc ggcgcggcag ttcggccttc aagaaacacg ctacgagggc cccaccggca 2386801 tcgccggcgt cttccaatct gcctgtgtgg gggccggcat cccggcggtg acgttttggg 2386861 cggcggtgcc gcactatgtg tcgcacccac cgaacccgaa ggcgacgatt gcgttgctgc 2386921 gccgggtcga ggacgtgctc gacgtcgagg tgccgttggc ggacctgccc gcacaggccg 2386981 aagcgtggga gcgcgagatc accgagacga tcgccgaaga tcacgagctg gccgagtacg 2387041 tgcagacgct ggaacagcac ggcgacgccg cggtggacat gaacgaggct ctcggcaaca 2387101 tcgacggcga cgcgctggcc gccgagttcg agcgctatct gcgccggcgc cgcccggggt 2387161 tcgggcgcta gagggaggtt gcgctgcggc ggacgacggt gtcagccggg cggcccagga 2387221 tcgccggaat caccctgagt gcccggagcg ccggctttgc cgggattcgt gcctgtcgac 2387281 gtaccaccgc cggcaccagc ctcgccgcgc gcaccgccgc cgccgccggc cccgccctcg 2387341 ccgccgctgc ccatggcgcc gtgggcgccc gagtggccac tgagccagcc gcccgccccg 2387401 cccgcgccac cggcaccgcg ggcacctccg gcgccgccgg tgccaccatc gccaccatcg 2387461 ccgccgtcac cgccgcggcc accgaaggcg ttaccacccc caaaggcgct ggcaacgccg 2387521 ccgccaccgg tcccgccggc ccctccccca ccgcccacac cgccggcacc gccaatgccg 2387581 ccggcaccgc cagccccgcc gtcaccgatc agcagcccgc cggccccgcc agccccgccc 2387641 gcgccgccgg caccacccat gccgccagag gctccggtat tcccgttgcc gccgaagcca 2387701 ccgcggccac cttgggcaaa gcccccagtg aattcgtcgg cgaacccacc tttcccgccg 2387761 tcgccaccga ggccgccggg agcgccggcg cctccgatgc cgccattgcc accggcgccg 2387821 ccggccccgc cgttaccgat caatcccgca ctgttgccgg ccccacggtc ctgaccggcg 2387881 gcacccgccc caccgtgccc gccattgcca tacagcaatc cacccggccc accgggctgg 2387941 cccggcccgc cgttagcgcc gtcgccgatc aacggccggc caacaatgcc tgggtgggcg 2388001 cattgacggc attgagcact tcgtgtgcca acgtggcgtt ggcggtctcg gcggccgcgt 2388061 accacccgcc acccgaggtt agggccgcga caaactgcgc gagatacgcg gccgcctgcg 2388121 cgctgatctg ttggtattcc tgcgcattcg cgccgaacag cgccgcgata gccatcgaca 2388181 cctcgtcggc ggcggtcggc cgccaacgcc gtcgtcgggc ctgccacgag cgcattggcc 2388241 gcgctgatcg ccgaaccgat ccccgccaca tctgcggcgg ccgcggtcag aaagaaggtt 2388301 gcgcgattac gaacgacatg tagtctccaa ccgtttacgg ccgcccggca aggacctaac 2388361 gaaccgttaa gtaggcggcg acagcgcgaa cgctaccgtg accgcactcg cgcgacccca 2388421 cactaggaag cagcactaat gattttctta tcttctccgc agcatcgacg gcgccagccg 2388481 acgttgcggt gtgtgcgggt acgattccgg tggagttgcc gccaccccta gagtgggcga 2388541 ggatcgcaag agcaaattcc gcgccggtag gacaacgata ggaccgccat tacgaagccg 2388601 cccgagactc ctgaattgag cgcggcctca cagcgtgtcg gcgccttcgg cgaagaggcc 2388661 ggctatcaca aaggcctcaa gccccgacaa ctgcagatga tcgggatcgg cggcgcgatt 2388721 gggaccggcc tgttcctcgg cgccggcggc cggcttgcca aggccggacc tgggttgttc 2388781 ttggtgtacg gcgtgtgcgg ggtttttgtc ttcctgatcc tgcgggcgct gggtgagctg 2388841 gtgctgcacc gtccgtcgtc aggctcgttt gtgtcgtatg cacgtgaatt tttcggcgag 2388901 aaggccgctt acgcggtggg ctggatgtac ttcctgcact gggcgatgac gtcgatcgtg 2388961 gacaccaccg cgatcgccac ctacttgcag cgttggacga tcttcacggt ggtcccgcaa 2389021 tggattcttg ccctgatcgc cttgacggtg gtgttgtcga tgaacctgat ttcggtcgaa 2389081 tggttcggcg agctggagtt ttgggccgcg ctgatcaagg ttctcgcgct gatggcgttc 2389141 ctagtggtgg gaaccgtttt tttggccggg cgataccccg tcgacggcca cagcaccgga 2389201 ttgagcttgt ggaacaacca tggcgggctg ttcccgacaa gctggctgcc gctgctgatc 2389261 gttacctcgg gagtggtgtt cgcgtactca gcagtcgaat tggtagggac ggcggccggg 2389321 gagaccgccg agccggagaa gatcatgccg cgggcgatca attcggtggt cgctcgcatc 2389381 gcgatctttt atgtcgggtc ggtggccctg ctagcgctgt tgctgccgta taccgcctac 2389441 aaggccggcg agagcccgtt cgtcacgttc ttttccaaaa tcggtttcca cggtgccggt 2389501 gacttgatga acatcgttgt gcttaccgcc gcgctgtcga gcctgaacgc ggggctgtat 2389561 tcgaccggcc gcgtcatgca ttcgatcgcg atgagcggca gcgccccaag gttcaccgcg 2389621 cgaatgtcga aaagcggtgt gccctacggc gggatcgtgt tgaccgcggt catcaccctg 2389681 ttcggtgtcg cgctgaacgc cttcaagccc ggtgaagcct tcgagattgt gctcaacatg 2389741 tccgcgctgg gcatcatcgc gggttgggcc accatcgtgc tgtgtcagct tcgacttcac 2389801 aagctggcca acgccgggat catgcagcgg ccgcggttcc gcatgccctt ctccccctac 2389861 agcggctacc tcaccttgct cttcttgctt gtcgtgctgg ttacgatggc gtccgacaaa 2389921 ccgatcggca cctggacggt ggcgacactg attattgtca ttccggccct gaccgcaggc 2389981 tggtacctgg tacgcaagcg tgtcatggcc gtcgcccgcg aaaggctggg tcataccggg 2390041 ccatttccgg cggtcgccaa cccgcccgtg aggtcaagag actgatgctt cgaagaggtg 2390101 aatcgatcat ccgcaaccgt tacgccagta agccaccact gtacggaatg gcaatggtct 2390161 tcttggccat ggccgtcgtc gccgtgaccg cgtactttcg catgggctgg tggtcgatca 2390221 tcggttacgc cgccgctgcc attatcggag tgatcgggtt cgcactcgcc ttccgcgacc 2390281 tgtcctgaat cgagcgcgac agaacctcta ggaattctcg agtgattcgg tgtaggcgct 2390341 ggcaaagcgg ccgagcgcgg cgacctcggc atccatctgg ggcatcagct tggcaacggt 2390401 gttgcggatg ggacgttggc ctacccgggt ggacaacaac ggcttgagcc aacggaacag 2390461 ggccacccag cccgggcagt acacgcgatc ttttcggccc tcaatgccgt tgacgaatgc 2390521 ggccgcacac ttgttgaccg acgtggtctt gttcaacggc caagggaggc gcgccagcaa 2390581 ttcggcgaac gcaggcaggt cggccttggt atcgcgaacc aacgcggtgt cgatccacga 2390641 catgtgcgcc gagccgacgc tgacgcccag gtgtgcgacc tcgagtcgca acgcgttggc 2390701 gaagtgctcg ttacccgcct tcgacatgtt gtagggcgcc atcccgggcg gcgccgcgaa 2390761 cgcggcaagc gacgagacga tcaatacgta accgcggcgg tcgatcagcg cgggcaacgt 2390821 cgcccgcacc gtgtggaagt tacccagcaa attgacgtcc aacacccgcc ggaacgcctg 2390881 cgggtcgacc ttcagcacgg agccgtagct ggcgatgccg gcgttggcca cgacgacgtc 2390941 gatgccgccg aatcgttcga cggccgtctc ggctgcggcc tgcatggcgg gcaggtcgcg 2391001 cacgtcggct accacggtga gtaggcggtc gtcgccgccg agttcggcgc ccatcaccgc 2391061 cagctctgat ttgctcaggt cggtcagcac cagtttggcg cccttgttgt gcagccgacg 2391121 ggcgacctca gccccgattc cccgggcagc accggtaatg aagacgacct tgccttgcag 2391181 cgatgtcatg gccgaaaacg taccgccgcg ccggctacag gtccaccccg agcagggcat 2391241 cgatcgccgt cgccaccaac ttcggcgccc cggcatcgtg gccgccgtac tccaccgcat 2391301 cggtgaccca accatccagt gcggcaatcg ctttgggcgt atcgagatcg tcggccaggt 2391361 agcggcgcac ccgagcgaca acgtcaactg cggccggacc ggcgggaagt gcggttgcgg 2391421 tgcgccaacg gtgcagccgg gcggtcgcct cgtcaagcac ctgctggctc cagaaccgat 2391481 cggctcggta gtgtccggcg agcaaaccca gccgaaccgc cgatggctca acgtcctgcg 2391541 cacgcagcgc cgacaccagc acgaggttgc cgcggctctt tgacatcttg tgcccgtccc 2391601 agccgatcat cccggcatgc acgtagtgcc gcgcgaatcg ccgttcgccg ctgacacatt 2391661 cggcgtgcgc agcggtgaac tcgtggtgcg gaaagatcag atcgctacca ccgccctgga 2391721 tgtcgaggcc gcttccgata cgactgagcg cgatggctgc gcactcgaca tgccagcctg 2391781 gccggccagg cccgaacggg gacggccagc tgggctcacc gggccgcgcg gcccgccaca 2391841 acaacgcgtc gagttcgtcg ctcttgccgg ggcgccgcgg atcgccgcca cgttcctcgc 2391901 acagccgcag catggtgtca cggtcatacc ctgactcgta gccgaactgc agggtggcgt 2391961 cagcgcggaa gtagatgtcc tggtactctc ccatttcccg gtctatgaca taggccgccc 2392021 cgcacgccag cattttttcg atgagctcga ccatttcagc aatcgcttcg gtggccccca 2392081 cgtagtcttg cggtggtagc acccgcagcg ccgccatgtc ctcacagaac agggcgacct 2392141 cggcttgggc aaggtcacgc cagtcgacac cgtcgcgatc cgcgcgctca aatagtggat 2392201 cgtcgatgtc ggtgatgttc tggacatagt gcaattcatg accgagatcc agccacagcc 2392261 gatggatcag gtcgaacgtc acataggtgg cagcatggcc cagatgcgtg gcgtcgtagg 2392321 gcgtgatccc gcagacgtac atggtggcct tagatccggg cgccaccgga cggacctgcc 2392381 ggtcggcgct gtcgtacagc cgtagctgcg ggcctcgtcc cggcaacacc ggaaccggtg 2392441 ggcaatacca cgactgcatg tcctcgactc taaacggccc ggtgactcca gcctttctga 2392501 gcagcccgcg cgccgatcag cgccacgcgt cggcgatggc accgagcagg atcggcgcca 2392561 cctcggctcg acacatcagc agatccggca ggtaggggtc cagttggttg tatcgcagcg 2392621 gcgagccatc gagtcgtgac gcgtgcatgc cggcggccaa catcacccca gccggcgccg 2392681 cggaatccca ctcccattgg cctccggcgt gcaggtaggc gtcgacgtag ccgtcaatga 2392741 cggccatcgc tttggcgccc gccgaaccga tcgacaccgg ttggatcgcc agcgtctggc 2392801 ggatgcggtg caggactgcc ggtggccggg tggcgctgac ggcaatccgc aaggtgccag 2392861 gaacgccggc cggcgcggcg ccggaagtca ccgtatcggt gcggtacacc acgttgccac 2392921 gggccggcaa cgccaccgcg gcgtcggtga tctcgggctg gccattggag gaacgccgcc 2392981 acagcgcaat gtgtaccgcc cagtcgtcgc gacccggtgt ggagaactcg cgggtgccat 2393041 ccaacgggtc aataatccac acccgatcgg atttcagccg ggccagatcg tcgtgggcct 2393101 cctcactgag cactgcgtca cccggccgtt cggcctgcag ccgtcgcaac agcagcgagt 2393161 tagcctggcg gtcaccggct tccccgagcg tccatggctg atcgaaaccg atctccgcac 2393221 gcacctggag caacagcttt cccgcgtccg ccgccaggtc ggcggccagc tcggcgtcag 2393281 tcaggtcgtc cgtcagatca ggtgcggcag ggctcaccac ttcagtatcg ccgagctgaa 2393341 cgcaggcatt tgacaatggg gcttcacatc atgatgctct ataattcgca tcttgatgca 2393401 caatagtggg atgcgaacca cggtcagtct cgccgacgac gttgccgctg ccgtgcagcg 2393461 cttgcggaag gaacgctcga tcgggctgag cgaagccgtc aacgagttga tccgtgccgg 2393521 gctcacgaaa cgacaggtcg caaatcggtt ccagcagcag acgtacgaca tgggcgaggg 2393581 aatcgactac tccaacatcg gcgacgcgat cgaaacactg gacggcccgg caagcggcta 2393641 atgctcattg acgcaaacct cctgctctat gccgtcgacg agcgtgccgc gcggcaccgc 2393701 gccgcggttg gctggctttc ggaacaactc aacggctccc gtcgggtcgg cttgccgtgg 2393761 cagagcctgg ccgccttcct gcggatcggg actcatccac gtgcgttccc gcgaccactc 2393821 acacctgccg cggcattcga catcgtcgac ctaaaacgcc ggccagggaa tgggacgatg 2393881 cccattcggc ccgggcatca ccggttggtc cagcagcgat tgcgcgcgtc ggcgcagcgc 2393941 gccgatttcg gctgcggcga ttcgcccggc caacgcctcg gcaagcgggc cgccaagagc 2394001 atcggcgagc ccggcaaccg cctgcagaat ttggtcgtca atcggcttgc cggcccaacc 2394061 ccacagcacg gtgcgcagct tgttctcgac gtgcagacac aatccatggt cgaccccgta 2394121 gacctggccg tcgatgccgc acaggatgtg accgcccttg cggtcggcgt tgttgataag 2394181 cacgtcgaac accgccatcc ggcgcaaccg gatgtcgtcc gcgtgcatca aaacgacctc 2394241 gtcaccggcg tagtcgtagg cccgcagcac cggcagatag cccggccgcg gccggtgggc 2394301 gggaaacagg tcgaccaggt cgggcccggg cagagggtcg gagtcgaccg cgtcgccggg 2394361 ttgctgcacc cagagctgta gcatgcctat gcccgccgga ccgtctcgga tgatggtgtg 2394421 cggcaccagg ttccagccca actgtgtcga caccagatag gcgctgagtt cgcggccggc 2394481 cagcgttccg tcggggaaat cccacaacgg ccgctcgccc gagaccggct tgtagacgca 2394541 atgcaggctg cgcagaccca gcgtggactc acacaaaaag gtggcgttgc tcgccgagcg 2394601 gatccgcccg aggactgtca gctcgccgtc ggccaacacc gcatgctcgt catcccgcag 2394661 ggtcatcgcc agacccgagc agcacatcgc gccggtaacc gttggtgcgc gcacagatgt 2394721 gtccctcggg atccagcggt tcatcgcaga gcgggcacgg cgggcgtccc gcagagatga 2394781 cgcggtagga ccgagtagcg aactgtcggg cggactccgg cgtcagaaat acccgcaccg 2394841 cgtcgggccc ttcctcggtg tcgtcgagca ccacggaagc gtcgaactcc gcgtcggtga 2394901 cggccagcag ttcgaccacc acgctctgcg cctccgaatc ccagcccagc cccatcgtcc 2394961 cgacccgaaa ctcggcatcc accggcatga tcagggggct gaggtcgtcg atctcagtgg 2395021 gttccggggg caccggggtg ccgaaccggc ggttaacctc gaacagcagc gctccgatgc 2395081 gctcggcgag caccgcaacc tgctgcttct ccaggaccac cgacaccacc cgggagtcgt 2395141 gcaccgcctg taggtagaac gtgcggtttc cgggctggcc aacagtcccg gcgacgaaac 2395201 ggtcgggtgt gcggaatacg tgaattgcgc gggccatggc acctccaaaa taccgcgcag 2395261 acgccgttgc cgcgttcttc gtcgacggtc accccacacg ctagtcggtg gaaccgccga 2395321 tcaccgcgtc gccgggtggc accgccgcgt tcggctccgg tgacgcaccc tgggcggaag 2395381 ccgcggcctg caaagccggg gccagccggg cgccggtgtg gttgacgtgc agcacgaacg 2395441 ggcgcagctg ggtgtagcgg acgacactca ccgaacccgg gtcggcggtg attcgctgaa 2395501 agctgtccag atgcataccg aacgcgtctg cgatcaccgc cttgatgaca tcgccatggg 2395561 tgcaggccag ccacagcacg tcgtggccgt gctgatcggc cagccgccgg tcgtgttcgc 2395621 ggacggctgc cacagcgcga gtctgcacct gcgccaaacc ctcaccgccg ggaaacaccg 2395681 ccgcgctggg gtgggcctgg actacccgcc acaacggctc gtcgaccagg tcaccgattt 2395741 ttctgccagt ccattcgccg tagtcgactt cggagaaccg gtcatcgatg agcggctcca 2395801 ggcacagcgc ctcggccagc ggttcgacgg tgcgttgaca ccgcagcatt ggagaagacg 2395861 cgaccgcccg gatcggcagg tcaccaattc gatcgatcaa cccggtggcc tgctcgcgcc 2395921 ccttctcgtc gaggtcgacg ccggaccggc cggccagcac gcccgcggtg ttcgaggtgg 2395981 aacgggcatg gcgtagcaag atgacggtca tgtcgcggct accgtcccgg tagccagcag 2396041 cacgagcatg cccgtcccga cgagcacccg gtagccgacg aaccagtaca tgttgtgtcg 2396101 caccagaaac cgcagcagcc aggccaccgc ggtcagaccg aggacgaacg cgatcagggt 2396161 ggccaccagc aactgcgggc cagtagcgct catgccctcg gttaccgggt ggaatgcgtc 2396221 gggcaacgag aacaacccgg aggcgaacac cgctggaatg gccagcagga atccgaatcg 2396281 ggcggccagt tcacggtcga gtccgagaaa cagtccagcg ctgatggtcg acccggacct 2396341 ggataccccg gggaccagcg ccagggtttg ggcaatacca accaccacgg catcccgcca 2396401 ggtcaaccgc tcaatgtgac gactctggcg ccccacgtat tcggcgagtg cgatcacccc 2396461 ggaaaacacc accagcgcgg tcaccacgac ccacaggttg cggacgcccg accggatgtc 2396521 gtctttgaag aacaggccca gaatgcagat cgggattgtg ccgatgatga cataccagcc 2396581 cagccgataa tcggtgtttc gatgtgcctt cacgaccagg ccgtgcaacc aagcgctcag 2396641 gatgcgcaca atatcgcgcg caaagtagat cactacggcg gcctcggtgc ccaactggct 2396701 cacggcggtg aacgaggcac cggcgtcgcc gctgaagaag atccgcgaca cgatcgccag 2396761 atgtcccgag gacgacaccg gcaggaactc ggtcaaaccc tgggccgcgg ccaacacgat 2396821 gacttgccac caagacatcg ccggagccgc ggtcacgacg acgacggtac ccggttaccg 2396881 gcggccggta gacggatgcg cctaacgcga caccggcgcc gcggatgcca ccgagtcgcg 2396941 cacggccgca gcaaggctgc gttcgtcggt caggtcaatg tcgaccaggc cacggaccgc 2397001 catcgcgacc acatcctcag cctgcggcac cggaccggcc acgcgcggtc gatagatttc 2397061 gacgacgagc gagcgatgct cgatgtggaa ggagaaactg cgtccatcgc cgacttgccc 2397121 atacccgctg gcgaaaattc ccgtcgatat atcttcaaca gcaaactctt cgcgcttgtc 2397181 tgcgagatgc cggtcagcgg tgacggtcat gcccagagaa tacctctgga gtaccatttc 2397241 ccgtgggcga catgacgaga ttgaaagcaa cttgccagat tcggattcgt gagaggttga 2397301 cttcatgttt cgcatccgaa ggctgaccgt tgctaacagg gaataaacca gcagttcaac 2397361 gccgcttcat cggcctgttg atgttgtcag tcctggtcgc aggctgttct tcgaacccgc 2397421 tggctaactt cgcacccggg tatccgccca ccatcgaacc cgcccaaccg gcggtgtcac 2397481 cgcctacttc gcaagacccg gccggtgcag tgcgaccact gagcggccac ccccgggcgg 2397541 cactattcga caacggcacc cgccaattgg tggctctgcg cccgggcgcc gattcggcgg 2397601 cacccgccag catcatggtc ttcgatgacg tgcacgttgc accgcgcgtc atttttctgc 2397661 cgggcccggc agccgcgttg accagcgacg accacggcac ggccttcctt gccgcccgcg 2397721 gcggctactt cgtggccgac ctgtcctccg gtcacaccgc acgagtgaat gtcgctgacg 2397781 cagcgcacac cgatttcacc gcgatcgccc gccgctccga cggcaagctg gtgctgggca 2397841 gcgcagatgg cgccgtctac acgcttgcca agaaccccgc agttgacccg gcgtccggcg 2397901 ccgccaccgt agccagccgg accaagatct tcgcgcgcgt ggatgccctt gtaacacaag 2397961 ggaatacaac cgttgttctg gatcgtggcc agacctcggt gaccacgatc ggcgccgacg 2398021 gtcatgccca gcaggcactg cgcgccggcc aaggtgcgac gaccatggcc gccgatccgc 2398081 tgggccgggt gctgatcgcc gacacccgtg gtggccaact actggtgtac ggcgtcgacc 2398141 cgctgatctt gcgccaggcc tacccggtgc ggcaggctcc gtacgggctg gccggatccc 2398201 gcgaattggc gtgggtgtcc caaaccgcgt ccaacaccgt cattggttac gatctgacca 2398261 ccggaatacc cgtagagaag gtgcgttacc caaccgtgca acaacccaac tcgttggcct 2398321 tcgacgaaac gtcggacacc ttgtacgtgg tgtcgggatc cggtgccggg gtccaggtca 2398381 tcgaacacgc ggcgggcacc cgatgagcag ccgacccgcg gcgcggcgga cctggttgcc 2398441 taccggctgg gattccgaga tgtccgacga gtacgagtgg gcgccattgc gcctaccgcc 2398501 agaagtgacc agggtcagcg cgtccacccg gctgtccatc gaggccgaat accgcggctg 2398561 ggagctagca cgggtacgcc tctataccga cggcagcagg cgggtattgt tgcgccgcaa 2398621 gaaatctcgc tgggcagacg cagaggcgaa ccgccggcca gaccagccgc agctgtggct 2398681 ctgaaggccg gggccagccc gcgcgcagac cgctatcgga tgtatcccct ggtgcgtcgg 2398741 ctgttgttcc tgatcccacc cgagcacgcg cacaagttgg ttttcgccgt gctgcgcggc 2398801 gtggccgccg tggcgccagt gcgccggctc ttgcgccgac tgctgggccc gacggatccg 2398861 gtgctggcca gcacggtgtt cggggtgcgc ttcccggcac cgctcgggct ggccgcgggg 2398921 ttcgacaagg acggcaccgc actatccagt tggggtgcga tggggttcgg ctacgccgag 2398981 atcggcaccg tcaccgctca tccgcagccc ggcaacccgg ccccccgcct gttccggctg 2399041 gccgacgacc gcgccctgct gaaccggatg gggttcaaca atcacggtgc ccgggcactg 2399101 gcgatccgac tcgcgcggca ccgacccgag atcccgatcg gggtgaatat cggcaagacc 2399161 aagaaaacgc cggccggcga cgcggtcaac gactaccggg ccagcgcccg gatggtcggc 2399221 ccgctggcgt cgtatctggt ggtcaacgtc agctctccga acacaccggg gttacgcgat 2399281 ctgcaggcgg tcgaatcgct gcggcccatc ctgtctgccg tccgcgccga gacttcgacg 2399341 ccggtgctgg tgaagatcgc gccggacttg tccgattccg acctcgacga catcgcggac 2399401 ctggccgtcg agctagacct ggccggcatc gtggcaacca acaccacggt gtcacgcgac 2399461 ggcctgacca caccgggggt cgaccggttg ggtcccggcg gcatctcggg gccaccgctg 2399521 gctcagcgcg cggtccaggt gctgcgtcgg ctctatgacc gggtcggtga tcgattggcg 2399581 ctgatcagcg tgggcgggat cgagacggcc gacgacgcgt gggagcgcat cacagcgggc 2399641 gcatcgctgc tacagggcta taccggcttc atctacggcg gggaacggtg ggccaaggac 2399701 atccatgaag gcattgcccg caggctgcat gacggcgggt tcggctcgct gcacgaagcg 2399761 gtcggctcgg caagacgtcg gcaacccagc taaagcgcta acgctgctcg taggtgccga 2399821 agatgaccgc tcgtgcaatc gcgtgctgga acaggttgaa tcccagatat gcaggactcg 2399881 cgtcctcggg gaggtcgagc ttttcgacct tcaccgcgtg taccgcgacg tagtagcgat 2399941 gcaccccatg accgggaggc ggcgccgcac ccacataccg gcgcataccg gcgtcgttga 2400001 ccaatgtcag tgccccgccc ggcagttcgc ggccatcgcc gacaccctcg ggcaactcgg 2400061 tgacgttggc aggcaggttg gccaccgccc agtgccagaa cccggacagg gtgggggcat 2400121 cagggtcgta gacggttacc gcgaagctgc gggtctcgct gggaaatccc gaccacctca 2400181 gctgcggact ggcatccgcc ccgcccgcac ccatgatccc gctgacctgg ggtgtagcca 2400241 gcggctgccc atcggtgatc gaggttgacg tcaggctgaa ggacggcagc ttgggcagcg 2400301 cggcatacgg gtcgggtgaa gttgtcatgg tcagtcctct cgtgtgatcg acgttgcgac 2400361 tagcctcgtt ttcgactagc agtgtgtcag caagtgcgtt agcacctcgg tgccgaaccg 2400421 caacccatcg atgggtaccc gctcgtcgac gccgtggaac aacgaggtga aatccaagtc 2400481 cggcggcaag cgcagcgggc tgaagccaaa gcaccgaata cccaagcgcg cgaacgcctt 2400541 cgcgtccgtt ccaccggaca gcatgtacgg caccgtgcga ccgtctgggt cgaccgccaa 2400601 caccgcggcg ttcatggcgg cgaccagatc accgtcgaag gtggtctcat atgatggcag 2400661 atcgctgacc cactcccggg tcacgtcggg tccgatcagc gcgtcgactt cggcctcgaa 2400721 cgccgcccgg cgacccggaa gcacgcggca gtccacaact gcctccgcgg tcgccgggac 2400781 gacgttggcc ttgtatccgg ccttgagcat cgtagggttc gcggtgtcat gtagcactgc 2400841 cttcaacatg cgggccatcg ggccaagctt gtcgatcgtc ccggccaggt ccggcgagtc 2400901 aaggtcgaag gccagtccgg tctcctctcc gactacggcc aagaactggg cgacggtgtc 2400961 agtgcagacc agcggaaact ggtggcgccc taggcgagcg accgcctcac aaacggcggt 2401021 gaccgcgttc tggtcgtgca ccatcgagcc gtgcccagcc cggccgcgtg ccgtcagccg 2401081 catccactgg atgcccttct cggcggtttc aatcaggtac aggcgacgtt cgccaccatc 2401141 gtgccggggc acggttagcg agaaaccgcc gacttcaccg attgcctcgg tgatgccgtc 2401201 gaacagatcg ggcctattgt cgaccagcca gtgcgacccg tacttgccgc cgtgctcctc 2401261 gtcggcaacg aacgcgaaca ccagatcccg tggcggcacg atagcggcct gacgaaggtg 2401321 gcgggcaacc acaatcatca tgcccaccat gtccttcatg tcgaccgcgc cacgacccca 2401381 gacgtagccg tcttcgatgg cgccggaaaa cgggtgcaca ctccattcgg ccggttcagc 2401441 cggcaccaca tcgagatgcc cgtggatcag cagcgcgccg cgagaactat cggcgcccgc 2401501 cagccgggcg aacacgttgc cgcggccggg cgcaccggat tcaacgtatt caggttggta 2401561 gccgacttcg gcgagctgct cggcgaccca gcgtgcgcac tcggcctcac ccttggtggt 2401621 cccgggttcg ccactgttgg tggtatcgaa ccggattagc ctgctgacga cctgggcgac 2401681 atcatcgctg tggtcgcttg aagccccggt ctcatctgtc acagtcacct ttcctaccac 2401741 tcgtaaccct ggcgagccga tcgcccctgg cgcgccgggc ccgcgtcgtc gccgagctgg 2401801 atttgcttac gtgggctgat tgcctggctc ctcctcaccc cgttacccgg ggcgcatcgt 2401861 cgccgagctc gatttgattg cccggctcct cctcaccccg ttacccgggg cgcatcgtcg 2401921 ccgagctagg ttgggccggt gcggggcaat ccgatagcct tagctgccag ccccggtggt 2401981 tggttggtcc gagtggcgga atggcagacg cgctagcttg aggtgctagt gccctactaa 2402041 tgggcgtggg ggttcaagtc ccccctcgga cacaacttct tagctctata gatcaaaacc 2402101 aagccttgac ctcgtcaagg actaacgtta tgagtttgct cataccaacg atggtcatct 2402161 cgttgatgtc ctaataccta agaactcacc gatcactcga aggtgcggcc agagatctca 2402221 gcctcgaccg cgttcgggtt ctccattccg tgccgaacag ccagtatgtc gatagcctcg 2402281 tcggtcgtcc gataggcaac gtagtacctg aagggccgga ggtagatgtg tcgatagtgc 2402341 ttgaataacg gcgcaaacgc gttcggagcc tgcggaatcc gcttcgtcac ggcatcgaca 2402401 aacaagttgt aaagccgatc gatctgatct ggcgccgcgt ccgcgtagta ggaaaacgcc 2402461 tcgaataggt cgtcttcaac cccgttatgg acgcgcagcc tgcgcgtcat ccgagccggg 2402521 cgcggatccg cttgtcgaag tcatcaatgg tggaccaatg agcatcgtcg gtgtcattgg 2402581 cccgcgcttc gatgagcgcc tggttggcct cgctgatatg catgccctcg gctaggtttc 2402641 cgttgatgtg ctcgacgagc tcaatctgct catcacgcga cagtgcgtcg acgctcgcca 2402701 gcaatgcccg gttgaccacc actcaacgat acccaaggca gccaacgccg gcagcgcagc 2402761 attcggaggc caggctgaac ttcaagctgg caggtgtcat ccgctcagtt gaaagacctc 2402821 aacccgggtc gcaggtggcc aagtcccccc tcggacacca catgtgacgg gtcgaagacg 2402881 aggcacgccg cggacgactt cgagggtaag cagccgtatc ccggggaggc ctgccatgac 2402941 cacttctggg cctatggcag ctagcaaacg tcatgaatgg aagcgccacc atatgccggc 2403001 gatccgacgt tcgagcgttt gcggcgctcg tttcagccag ccgatctact gccggaactg 2403061 caagcggcag gagtgcatta cacaatcgct gtcgaggcgg cggacgatcc ggccgagaat 2403121 gagtctctgt tggccactgc gcgccaccat gattggatag cgcgcgtgat cggttgggtc 2403181 ccactcgccg atccggatga ggttaccgag agctcgacgc acgggcggca ccgcccggac 2403241 gcctcctggc gacgagatct gcggtgcccc ggcctgctgc cgcccgggtg ccaccagcca 2403301 gtcttggtcg taggcttggt aggtcagcag ccggaaatgc gaccgatgaa tccaccaagt 2403361 ggttttctcc ggcggacgcc gacccgcagg tttcgcgacc gccgcgatgc tggtcgcgta 2403421 ttggccgacg aacttgcgtc ctatcgcggc agggaccggt tgctcgtcct cggccttgcc 2403481 cgcggtggcg tccccgtcgg ctgggaagtc gcgtcggcgc taggcgccga attggatgta 2403541 tttctggttc gcaagctcgg cgtgccgcag tggcgcgagc tggcgatggg cgcgttggcc 2403601 agtgggggcg gggtcgtgat gaacgacgac gtggtttcca gcttgcgcat caccgaccag 2403661 caggtgcgtg cggcgatcga cagcgagacg gcagagctgc agcggcgcga gctggcgtat 2403721 cgcggcggac gccctgtcgt cgatccgcgc gccaggatcg tgatcctggt tgacgacggc 2403781 atcgccaccg gcgcgagcat gctggcggcg gtgcgcacca tccgtgccac cggaccggag 2403841 tcgatcgtcg tcgcggtccc ggtcggtccg gccacagcct gccgcgagct cgcggcggaa 2403901 gccgacgacg tggtgtgcgc aaccatgccg gcagcgtttg aggccgtcgg ccaggtctat 2403961 aacgactttc atcaggtcac cgacgacgag gtccgcgagc tgctcgcgac gccaaccaca 2404021 ggcgcagcga cctaacgaga ggattctcgt gaggtgactg ggatggtcag gatgcgtggt 2404081 cgagggtcta gatccggagc tgggcgacaa accacccgat aacctcccac gacgccccta 2404141 ccgaggtcgg tgtcgctgac gaccctactt ggcgctgtcg tcgcttcggt ccgccgatac 2404201 cgccgactcc tcggtcgctt cgctggcctc ctccgatggc tcctcactgc cgatgacacc 2404261 ggcgtcgacg gcttggctct cctcgggggc ttcctccggg tagtcgacgt cggcttcctc 2404321 cgcgacaccc gtttccccag ccccatcagc ttcgtccgcg ccaccttgct ggcgttctcg 2404381 caacgcatcg acgatcagca acgccacacc cagcacgctg gccccgatgc atacccaggc 2404441 cactagctgg ttgctggtga ccaccgcgaa caccaaggcc aggagcccaa tcagggccaa 2404501 gaccagcgca atgatcagca tcggtcatcc tccaaccggc tagcagcgac tgcccaacct 2404561 accaggatct ggctgccgac ctcgaaaact ggcgcgtgtc cggcacgcct ggtggctagt 2404621 ttttgccccg gttgaattga tcgaagccac cggcatccgc attggaatcg accggcgccg 2404681 ccgatccacg ctggccgagt tcctccagct gcgattccag gtaggtcttg agcctggtgc 2404741 ggtactcacg ttcgaaggta cgcagctgct cgaggcggcc ttcaagcacc gcgcgctgct 2404801 ggttgatggt tcccatgatc tcggagtgct tgcgttccgc atcggcctgt aaggcatcgg 2404861 ccttctcctg cgcctggcgc aactgggcct cggatcggga ttgggcatcg gccagcatgg 2404921 catcggcacg ctggcgggcc tcggcgaccg tggcgtcggc ggtgtgtcgg gcctcaccga 2404981 ggatctgctc cgcattggca cgggcatcgg ccagcatctt gtccgactcg gctttggcgg 2405041 tgtttgtaag ccggtcggcg gtgtcttggg ccagactcag cactcgcgcc gccttcaggg 2405101 cctgttcctc gttcatcccc gccgagaccg ccgccggcgc cggcttgccc ggttcgggct 2405161 catacgccgg gattgcctgg gtggcctgcg gcgtaacgcc ggcaccgccg cccgcggcga 2405221 gctcttgatc cagctcgttg atcctctgac gcagatcgga gttctcttcg atcaggcggg 2405281 tcagctcgtt ttccaccagg tcgaggaagg cgtcgacctc atcttcgttg tacccacgtt 2405341 tgccgatagg cggcttactg aacgccacat tgtggacgtc ggcaggtgta agcggcattg 2405401 tttgtcccct cgagttcctg gacggtcaaa cgatctggaa gtgtagaacg gagtggtagc 2405461 cgtggtgcaa ctaccgtcca tcctgtcaca ccagactcgg cggttgccga ttggactaag 2405521 taaataagga ccaatttcaa actctaagac caaataaatc acaatcctta gatttgaaat 2405581 cgtgcgcgcc aaacttgtcc ccaaatcgtg gccgaaccgt ctctcaatcc tcgtcatgca 2405641 cccggccgtg tgaccgcgcc gcggctcagg ccgcagcacc aaacgccagt tgcataccga 2405701 tgaacgcaac cagcagcagc accatgatcg acaggtcgaa ccggaccgcg ccgatcgtga 2405761 gttgcgggat cagccggcgc agcaccttca ccggcggatc agtgatcgac atgatgatct 2405821 ccaagatcac cacggtgaca ccggtgggac gccagtcacg gctgaacgag cggatgaact 2405881 caacgacgac ccgagcgatc agcagcagcc agaagatgaa cagcgcgaac ccaaggatct 2405941 gaaaaaacac caccaacgag agccccgacc ttactgagga ggatgaagaa atgttgcgtc 2406001 gccaccgatg cgggcggcaa cgccagccta ccgactcggg tggcgtgccc acatctcatg 2406061 gcgggccacg ccccgcccag cgtggatgcc caatgggtct acaggcgacc gtcgcgtcta 2406121 ttggtaggcg tagaacccgg tttcggcgat cctgcggcgc tcctcggggg acacatcgac 2406181 gtctgcaggc gagagcagga acaccttggt cgcgaccttg tcgaacgagc cgcgcagcgc 2406241 gaaggccagg ccggccgcga aatcgaccag ccgcttggca tcggcgttgt ccatcgacac 2406301 cagatccatg atgaccgggc tgccgtcgcg gaaccgctca ccgatggtgc gagcctcgct 2406361 gtagtccttg ggccgcagcg tggtgatctt cgagagcgga tggccatcct cgaacatcat 2406421 cgccatccgg cgggggtcca tcgctagcgc gccgcgggtg gagttgcgca gccacgatcc 2406481 gaagcgcggc cgtgtcatct ccgcgcggtc gaactcccgg ggccggaaac gtggttcgtc 2406541 cgcgtacccg ccgcgatatc ccggtggtgg atagtcggcc ggctcaccgc gcaggtcacc 2406601 gcgtgaatcg ctgcgcgcgt cgtcgtagtc gcgcccatcg tagcggccgt agtcgtcgtc 2406661 gaatcggggc cgcgcatacc cgcgcgaggg agcgcggtcg tcgtagtact cgtcgtcgta 2406721 atcctccatg ggagccatac cgaagtaggc cttgaccttg tgcagtgtgc tcattgcgtg 2406781 accccttcta gccctgggag atctgttgtc tgtgatgaag gtgtgactac agtgactatt 2406841 cacggtgacc gtaaccgccg cggacccaat agcgcggtac cgacacgcac acaggtcgaa 2406901 ccatgtttga cggcgacttc aaggtcgttg gacatgcccg ccgacagacc gatcgcgtgc 2406961 gggaacatcg cacgcacccg gttgtgctcc gattgcagcc ggtcaaaggc ctcgtccggg 2407021 tcccaatcca gcggcggaat gcccatcaac ccgaccagtt cgaggccctc tgactcctgc 2407081 acctgcgcgc aaatccggtc tacggcgccg ggcgtcgtgc tgtcgacgcc gccccgggat 2407141 ccgtcaccgt cgaggctgac ctggacgtaa acccgcagcc gctcgccacg acggtgttcg 2407201 gccagcgccg caacaaccgc ccgatccagc gcggtcacca accgcgagct gtccaccgag 2407261 tgagcggtgt gcgcccagcg agccagcgac ccggctttgt tgcgttgaat ccggcccacc 2407321 atgtgccagt gcacaccccc cgagtgaccc aactcggcag ccgccaacaa ccgattaagt 2407381 tcggccatct tggctgaagc ttcctgttcg cgcgattcgc caacggaccg acaacccaat 2407441 cgaaacaaaa tcgcaacatc ggttgctgga aagaatttgg taatcggtag aagttcaatt 2407501 tcgccgacat tgcgacccgc cgcctccgcg gccgccgcaa gtcgcgatcg cattgccgcc 2407561 aacgcatgcg tcaattccga ttcgcggtct ggatacgccg aaagatccgc cgccatcgcg 2407621 gtcattccat ccacaccaac gacgcgaacc gtccggtggg cgcatcgcgg cggtggctga 2407681 acaacgtcgg atcggccacc gtgcagcggg gatcgacgtc gatagactca acacccaaat 2407741 cgcggagctg gcaagcgatt ccggcgcgca ggtcgactcc gggagtgccg gcagcggtgg 2407801 tggtgcggct gcccggcaac gccgcctcga cctcatcggc catcgctgcg ggcacttcgt 2407861 agttgcgacc actgaccgcg ggacccaaca gtgccgagat gtcgcggacc tgggcaccca 2407921 ggctcaacat cacctccagc gcgcgaacca ccacaccgcg ctgcgcgcct gcccgaccgg 2407981 catgaaccgc ggcggcgata ccggcccgtg cgtcggccat cagcaccggc acgcagtcgg 2408041 cggtcacaac cgccagcgcc aatcggggtg tagcggtcac caatccgtcg gtgtcatcga 2408101 gtgccgtatt gcgcggctgg tcgaccagct cgacccgatc cccgtgcacc tggttcatcc 2408161 acaccactcg gttgccgggc agtccgatgg ctgcggccag ccgagcgcgg tttgccgcca 2408221 ccgcggccgg gtcgtcacca acgtggtcgc cgaggttgaa ggtgtcgaac ggtggggccg 2408281 acacaccacc tgcccgggtg gtggtgaccc gacggatgcg aacactcacg ttcccagtat 2408341 cgccgcgggc gatgtgccgc gtactggcga gcaagccgat gctctcagcg gcgcatgaag 2408401 ggcggcacgt cgacatcgtc gtcatcaccg ccgatgctca gggttgcgcc gttggtgtgc 2408461 aacggcacgc tgacggcgtc gaccggctcg aacaaggtcg aggtgagctt gcctgccttg 2408521 gctgactcga tccggtgggc gccgccggtc tcgcccatca ccggcttgcg gccgggaccg 2408581 ctgacgtcga agccggccgc gatcacggtc acccgcacct cgtcaccgag cgaatcgtcg 2408641 atgacggtgc cgaagatgat gttggcatcg gggtgagcgg cgtcttgtac caacgaggcc 2408701 gcctcgttga tctcgaacaa gcccaagtcg ctgccgccgg cgatcgacat cagcacgcct 2408761 tgcgcgccct ccatcgaggc ttccagcaac ggcgagttga tggcgatctc ggccgctttg 2408821 agcgaccggc cttcgccccg ggccgagccg atgcccatca gtgcggtgcc ggcaccggac 2408881 atgatgccct tgacgtcggc gaagtcgacg ttgattagac ccggggtggt aatcaggtcg 2408941 gtgatgccct gcacgccgtt gagcagcacc tcgtcggcgc tacggaaagc atccatcagc 2409001 gataccgcgg catctcccat ctgcagcaac cggtcgttgg gaatcacgat gagggtgtcg 2409061 caactctccc gcagcgccgc gatgccattt tcggcctgat tgctgcgtcg cttgccctcg 2409121 aacgagaacg gccgggtgac cacaccgacg gtcaacgcgc ccagcttgcg ggcgatgctg 2409181 gcgacgacgg gtgccccccc ggtgccggtt ccgcccccct cgccggcggt gacaaacacc 2409241 atgtcggcac cgcgcagcag ctcttcgatc tcgtccttgg cgtcctcggc ggccttacgg 2409301 ccgacctccg gatcggcgcc ggcgcccagc ccgcgggtgg agtcgcggcc gacgtcgagt 2409361 ttgacgtcgg catcgctcat caacaacgcc tgggcgtcgg tgttgatcgc gatgaattcc 2409421 acgcctttga ggccctgctc gatcattcgg ttgacggcgt tgacaccgcc accaccgata 2409481 cccacgacct tgatgacggc caggtagttg tgcggggggg tcatcgttcg gcttcctccc 2409541 tggtggggct cggttcttcg gtgtgtctgc tggcaaactc tcaacctcaa ccataggctt 2409601 agagttatgt caagtagttg ctcgtagtca gaaccgtatg gctacgacgg ttgctaaccg 2409661 tgcaggcgcg ccgatacgcg gcgggcattt tttcggctat ttcacggtcg gcaggtcggg 2409721 gctggacacg tcgtacgttc tgcctggctg ggtcaacagc gccgccagct tttcggcctt 2409781 ctcttcgcag cggtcggtgg ttccccagat caccacgcgg ccatcggcca acgtcagggt 2409841 gatcgaggcc accgacgggg ccgcgatccg ccccacctgg cttgcaactt caggatgcag 2409901 cgcggtcaac acctgcagcg ccgccttggt cgtcggatcg ctaggaccgg gattgtccac 2409961 atcgaaataa ggcaacgccg gcggtggcgg atcggtcgcg aagtcgacgc cgtcgcggtc 2410021 aaaaaggtgc gggccgtccg aaaaatcctt gaccaccacc gggacccgct cgacgatggt 2410081 gatccgcaag gccgacgggt actgccgctg cacccgcgca ctggccaccc gccggatcgt 2410141 ggccactcgg tcagcaacct gttgggtgtc gatctgcagc aacggcgttg ccggccgcac 2410201 tctggcggcg tcgagaacct cctcgcggct caccgccccg atcccgatga tcacgatctc 2410261 gcgggccgac atcgccggcg tgaagtacag cgcgagccca agcccgatcc cgacgacggc 2410321 cagcacgacc gtcgcgagca gcgccttcag ccctcgaaca acacctcggg cggccggttt 2410381 ggcggggttc tgctcactga cgatctgccc gcgggctcgc cgtttggccg cgcggcgagc 2410441 ctgctcgatc gcggtagctc gagcctgcgc ggcgcgacgt tcggcacgtt cgcggcgggc 2410501 gcgccgacgc ggcccttcga attctgggtg ctcggccggt tcgtccttcg attcggtggc 2410561 caacggctcc gtaaccgcct cctcgtcggc ggcgtcgtcg gccacgcgct cgatctgtgg 2410621 gtcctcgttg tgttccgtca tcccagcacc cccggacggc cgggggcgct tcggttggcc 2410681 cggacccgaa gggcggtcag gatttccggg cccagcaagg tcacgtctcc ggcacccatc 2410741 gtgacgatga cgtcgcccgg actagcggcg gcggccactt gctgtgcgac cgccgaaaaa 2410801 tccgggacgt agcgcatcgg cacagtgacg tgctcagcga cgctggctcc gctgacaccg 2410861 gccagcggtt gttcacgagc tccgtagacg tcgagtacga acacctcgtc agcggcattc 2410921 agcgcacgcc caaactcagc agcgaatgcc tttgtccgcg aatacaaatg gggttgaaac 2410981 acaaccatgc agcggccacc gtcgccctgt tcgagcacca tgcgcgccgc cgccagtgtc 2411041 gcgctgatct ccgtcgggtg gtgggcgtag tcatcgaaca cgcgcaccga cgcctttccg 2411101 acgccgcagg tcccaaccag ttcgaatcgt cgccgcactc cttcgaagcc ggccagcccg 2411161 tcgagcacct cgtcggccgg ggcgccgatc tgcaccgcgg ccagcagcgc tcccagcgcg 2411221 ttgagcgcca tgtgtcgccc gggcaccgac agccgcatca cgcggggacc ctgtgctgtg 2411281 gctagttctg aggccaaccg gatatgtgcg accgcgccga ccccctgttg ctgccacgag 2411341 accaacgtgg ctgccatggt ctcacccggc accgacccgt atcgcagcac tcgaattccc 2411401 agctcagtcg cgcgctgagc cagcgcggcc cctccggggt cgtcagtgca caccaccagc 2411461 gcacccccgg ggacaatgcg ctccacgaag gagtcgaaca ccgcaacata cgcctcgacg 2411521 ctgccgtaga agtccaggtg atcggactcg atgttggtga tcaccgcgac gtggggtgtg 2411581 tactgcaaca gcgagccatc gctttcgtcg gcttcggcga cgaaacagtc gccactgccg 2411641 tgatgggcgt tggtaccggc ctcccccagc tcaccgccga ccgcaaagga cgggtcaagc 2411701 ccgcagtgct gcagggcgac gatcagcatg gacgtcgtcg ttgtcttgcc gtgcgtgccg 2411761 gtgaccatca atgtggtgcg cccggccatc aacttggcca gcacggccgg ccgcagcacc 2411821 acgggaatgc cgcggcgcct cgcttcgacg agctcggggt tggttttggg gatggcggca 2411881 tgggtagtga cgaccgccgt ggcgccaccg ggcaacaggt ccagcgacga cgcgtcgtgt 2411941 ccgatccgga tcaacgcgcc ccgcgcccgc agcgcatgca caccgcgcga ctccttggcg 2412001 tctgacccgg agaccagccc gccgcggtcc agcaggattc gggcgatgcc cgacatgcca 2412061 gctccgccga tgccgaccat gtgcacccgc cgcagatcgg gcggcaactg ctcggtgctc 2412121 acgtcgttgt cctggcaccg gccccggtgg cgacggccag cgcggcccgg gccacctggc 2412181 ccgcggcatc gcgatgtccc accctggctg cggccgcggt catcgcggcc agccgcgcgg 2412241 ggtcggtgag cagcccggca acctggcggg ccaccaactc gggggtcagg gcggcgtcgg 2412301 cgaccaccat gccgccgccg gcattgacta ccggcaacgc attcagccgc tgttcaccgt 2412361 tgccgatcgg cagcggcacg tagatggccg gcagaccgac ggcggatact tcggcgaccg 2412421 tcatcgcccc ggcccggcag atcaccagat cggcggcggc gtaggccagc tccatccggt 2412481 ccaaataggg caccgccacg tacggtgggt caccttgagc ccgacggcgc aactccagca 2412541 cgttctgggg tccatgggca tgcagcacgc aaacaccggc ggcggccagg tcggcggcgg 2412601 cgccggacac cgcccggttg agcgagaccg cgccctgcga acccccgaac accagcagca 2412661 cccgcgcgtc gtcggggaag ccgaagtgtg cccgcgcctc ggctcgcagc accgcgcggt 2412721 ccagcgcggc gatcgacgca cggaccggga ccccaaccac ctcggcgcgc cgcagcccgg 2412781 aatccggcac cgcggagagc acccggtccg cggtatgggc gccgacccgg ttggccagtc 2412841 ccgccctggc gttggcttcg tggatcacca ccgggatccg gcgccggcgc cggggcggca 2412901 aaggcaggcc gcgagcggct aggtaagccg gtagcgcgac gtacccaccg aaaccgacga 2412961 cgacgtcggc gtcgacatcg tcgagcacgt cccgggcctc ccggacggcg cgccacaccc 2413021 gcgacggcag ccgggccagg tcgccgccgg gcttgcgcgg catcggcacc gccgtgatca 2413081 gctccaggtg gtagccgcgc tggggcacca gcctggtctc tagtccacgg agggtgccca 2413141 acgcggtaat ccggacgcgc ggatccaacg cgaccaaggc gtcggcgacg gccatggcgg 2413201 gctcgacgtg cccggcggtc ccgccgccgg cgagaacgac cgacacggaa tcagcagacg 2413261 gcgaggaacc acaagacggc gaggcggcat cggcgggccg gggcgccgtt gccccgcgcc 2413321 cgccggccgg ctggctgacc gtgtccttca cccgtaacgc tgaccttcca atgcccgaac 2413381 gcgccgtgtg cgacgctggc ccgcgtaccg ctggccagct ccatgatgca ctgatcgacg 2413441 aaccggcgga tcggccgtgc ggggcgagcc gggtcgcggg ggcaggccca tctgccgggc 2413501 aggctgtccg ggcgccgtgc ggggggtctt ccgcgcgggc tgcgtttggg ccggttgcgg 2413561 gttggcgcgc ttgcggtcac gaaacgcctc gagacgaggg ggcagatacg gctcgggcag 2413621 cggcagccgc agcaaccggt tcaccttgtc gtcgcgccca gcccgcagcg cggccaccgc 2413681 ctccggttcg tggcgagccg cgttggcgat gatgcctatc agcgaaagtg ttgcggccgt 2413741 ggaggttcca ccggcggaga tgagcggcag ctgcaggccg gtgacgggca gcagcccgat 2413801 cacatagccg atgttgatga acgcctgtcc cagcacccac agtgtcgtgg tggcggtcag 2413861 cagccgcagg aacgggtcgg cggaccggct agcgatgcgc atgccggtgt aggcgaacaa 2413921 tccgaatagc cccagcagtc cgagcgcgcc gacgagaccc agctcttcgc cgatgatggc 2413981 gaaaatgaag tcgttgtggg cgttgggcaa gtagttccac ttggccacgc cttggcccag 2414041 accgtcgccg aaaatgccac cttgagccag cgcgaacttt gcctgtcggg cctggtagcc 2414101 ggagtcttgc ggatcgtttt cggggttgag ccacgaccgc acccggtcgg atcggtagcc 2414161 cgcggacacc gccaggatgg cggccgagac gacgaccgcc gccagtgagc tgaggaagac 2414221 gcgcagcggc agccccgcat accacagcag gcccaacaag atgatgccca tcgacacggt 2414281 ctgtccgagg tcgggctggg ccacgatcag cgccagcgca acgacggcgg ccggcaccag 2414341 tggaatcagc atctcgcgca gtgaagcccg ttccatgcgc cgggcggcca gcagatgcgc 2414401 tccccagatg gcgaacgcca tcttagccag ctcagagggc tgcatcgaga agcccgcgac 2414461 cacgaaccag ccgcgcgagc cgttggcctc cttgccgatc cccggcacca gcaccagcac 2414521 cagcatcacg atggtgatcg cgaaaccgga gaaggcgatg cgccgcatga accgcaccga 2414581 catccgcaga cagacatagc cgccgataag acccacaagc gtccacaaga cctgcttgcc 2414641 gaagatcacc caagccgatc cgtcgtcgtc gtaggaccgc accgccgatg ccgacagcac 2414701 catgatcagt ccaagggtgg tcagcaatgc ggcaacggcg atgatgaggt gaaacgaggt 2414761 catcggacgg cccagccagg caccgaaacg ggtgcggggc ctcgccgaac ccgggttaga 2414821 ggcttcttcc gggcccgtcc gctgcccctc gaccggctcg gcccctcgag tctgggagcc 2414881 gtcggtgtcg ctggtgcccc gacgcagcaa ccgggttagc acgctgcccc cgcctaccgg 2414941 atcaccgcgc ggaccgcggt cgcgaatgcc tcgccccggt cggcataacc ggtgaactgg 2415001 tcgaatgagg cgccggccgg tgccagcagc acggtgtcac cgggttgggc catccgccgg 2415061 gccgcggcca ccgcagcggt catcacggca gcgccaacgg tctcaccggc tttgtcatct 2415121 tttgccacat ctagaacaca agcaacagga acctcaacag tcgcaggcat accagtatcc 2415181 tcgcctgcca caacctgaac gactgggaca tcgggcgcgt gtcgtgataa cgcctcggca 2415241 accgctgcgc gatcccggcc gatcagcacc gcaccgacca gccgcgacgc catcgccgca 2415301 acctcggcgt gaagcgacgc gcccttgagc aggccaccgg cgatccatac caccctcggg 2415361 tatgcaagca ccgaagcccg cgcggcgtgc gggttggtgg ccttggagtc gtccacgtag 2415421 gtgatgccgt cggcaacggc caccacctcg gcgcggtgtc ggcccactcg aaacgacgtg 2415481 accgcgtcgg cgatcgcacc ggcgggcacc ccgaccgagc gggccagcgc cgccgcggcc 2415541 agggcgtcaa gcacgccgac cggacctggc accggtatcg acgcgaccgg cagcagcgtc 2415601 aagtcgtcgg agaaggcgcg atcgaccagg tgggcgtcgc gcacgcccag ttcccgcgcg 2415661 gccggctcgc cgagccggaa gccgacccgc acctgcgccg gtgagccgtc cagcagtgcg 2415721 gccgctcggc tgtcatccag cccggccacc gctaccccgc cggtcagcac ccgggccttg 2415781 gccgcggtgt attcggccat cgtggcatgc cagtccaggt ggtcttcggc aatgttgagc 2415841 accgcgccgg cctcgggccg cagcgacggc gcccagtgca gctggaaact ggacaactcc 2415901 acggccagca gctcggccgg ctcgtccagc acatccagca ccgcactgcc gatattgccg 2415961 cacagcacgg cgcggcggcc accggcgatc agcatggcgt gcagcatcga cgtcgtggtg 2416021 gtcttgccgt tggtgccggt caccaccagc cagctgcgcg gcggtccgta gcagcccgct 2416081 gcgtctagcc gccaggctaa ctccacgtca ccccagatcg gcacccccgc cgccgcggcc 2416141 gcggccagta gcggggttgc gggcgagaag ccgggactgg cgaccaccag cgcatacccg 2416201 gttatctgct gcaccgcgtc cgaggaacta acggtcggca gcccacgttc ggcgtgcggt 2416261 cgcagcatga ccggatcgtc gtcgcacacc gtcggcgtcg caccaaaccg agtcagcacc 2416321 gcggccaccg cctgaccggt cacccggcca ccggctacca acacgggcgc acccggcccc 2416381 agagggtcaa gcacgtcagg caccgaccgc ggcaagccac tcaccgtaga acaaggccac 2416441 gcccagaccg caggtgatcg cggtgagcag ccagaaccgg atgatgaccg tggtttcagc 2416501 ccaaccgacc aactcgaaat ggtggtggaa gggcgccatc cgaaacatcc ggcgcccggt 2416561 ggtccggaag gtcaggattt gcaacaccac cgaggtgatc tcggcgacga acagcgcacc 2416621 cagcaccacc gcaaggatct cggtgcggct ggtcaccgac aaccccgcga tgacgccgcc 2416681 caacgccagc gacccagtgt cacccatgaa gatcttggcg ggcgcggcgt tccaccacaa 2416741 aaaaccgatg caggcgccag cggttgcggc cgcgatgagc gccaggtcca gcgggtcgcg 2416801 cacgttgtag cagcccaggc ccggcgccgt cacgcacgcg ttgcggtact gccagaaggt 2416861 gatcagcacg taggcggcgg tgaccatcgc catggtgccg gcggccagcc cgtccaggcc 2416921 atcggtgaag ttgaccgcgt tcgaccaggc gctgacgatg accacgcaga acaacacgaa 2416981 cagcaccggc gccaatgtga cggtggcgat ctcacgcacg taggacagat ccgcgctgcc 2417041 cggtgtcagg ccggcagcat tccggaactg cagcaccagc acgccaaaca gcacggcgga 2417101 ggtgatctgc ccgacggtct tggccgtctt gttcaacccg agattgcgcg acctgcggat 2417161 cttgatcaga tcgtcgatga acccgacgcc gcccaaagcg gtggctaggc ccagcaccaa 2417221 cagacccgat gcgccgatgc cttcaccgtc aaacgccagg cccgctaggt gggcgcccag 2417281 gtagcccgcc cagatgccgg ccagaatcgc caccccgccc atcgacggcg taccgcgctt 2417341 ggtgtggtgg ctgggcgggc catcctcacg gatctggtgg ccgaagccct gcttagtgaa 2417401 caaccggatc agcaccgggg tcagcaagat ggacaccgtc accgctacgg caacggcgat 2417461 aaggatctgc ctcatgggcg cacactcccg catgtgtcgt ctgcgaccaa tgcatcggcc 2417521 accgcaccca gcccggccgc gttcgaggcc ttgaccaaga ccacatcccc gggtcgcagc 2417581 tcggcgcgca gtagtgccag ggcggcgtca ccgtcggcca cattgacggc cgtgcgatcc 2417641 gcaccgtgat cagcagtggc ttcccccgag ccccacgccc cctccaggac cgctccgtgg 2417701 tgcatggcgc tgatcgacct cccggttccc acgacaacga gtcgagacac atctaagcgc 2417761 accgcgagcc ggccgatgcg atcgtgctcg gctatcgcgt cctcacccag ctcggccatc 2417821 tcacccagca ccgcccagct gcggcgggtg gcctcgggtt ggtgcgcgat ccaggccagc 2417881 gcctgcagcc cggcccgcat ggagtcgggg ttggcgttgt aggcgtcgtc gatcaccgtc 2417941 accccgtcgc cgcgggtggt cacctgcatc cgatgccgcg acaccggcgg cgccgcggtc 2418001 agcgcggccg cgacctgttc aacgctggcc ccacactcca gcgcgaccgc cgcggcgcac 2418061 agcgcgttag tgacctggtg gtcgccgcag accccgagtc ggacctcggc ttgggcatcg 2418121 tgggcatgca gcgtaaagcg cggcctggcc aattcgtcca gcgacaccgg ccccgcccaa 2418181 acgtcaccgg tgttgtcccg gctgacccgc accacccggg ccgcggtcag cttggccatc 2418241 gccgccaccg cggggtcatc agcgttgagg acgaccgctc cggaatgcgg aacagcctgc 2418301 ggcagttcgg ctttggtctg tgcgatgacc tcgcgggagc cgaactcacc caaatgtgcg 2418361 gtgccgacgt tgagcacgac tccgatcgac gggggcgcga tctcggcgag cgcggcgatg 2418421 ttgccgtgat ggcgtgccgc catctccaaa atcaggtagt cggtgcgccg cgtcgcgcgc 2418481 agcaccgtcc acgggtgacc cagctcgttg ttgaacgatc cgggcggggc caccacctcc 2418541 cccagcgggg ccagcacggc ggccatcagg tccttggtcg acgtcttgcc cgacgagccg 2418601 gtgatcccga tgatggtgag cccgccggcc accaactgcg cggccaccgc ggtggccagc 2418661 ttggccagcg cggccagcac cgccgccccc gacccgtcgt tgtcgtgctc gaggacgccg 2418721 gccaatacgt tcggcgcggc cactggcgga accacgatgg ccggcacccc caccgggcgg 2418781 gcggccagca cgacggcggc gcccgcggct accgccgacg cggcatggtc gtggccgtcg 2418841 gcgcgcgccc ccggcagggc gaggaacagc ccgcccgggc cgatggcgcg cgagtcgaac 2418901 tcgacggtcc cggtgacgcg gcggtgcgcg gcgtcttgcg gggagatatc ggccactgcg 2418961 cccccgacga tctcggcgat ctgcgcgacg gtcagctcga tcatgcgcgc cgctcgaggg 2419021 cctctagcgc ggcagccagc tccacccggt cgtcgaacgg gcggacccgc ccgccgccgc 2419081 gttgcccggt ctcgtggcct ttgccggcga tgagcaccac gtcgccgggg cgcgcccagg 2419141 caaccgcgtg ccggatcgcg tcccgccggt ctgcgatctc gacgacctgg gcatcaccgc 2419201 cgacttcggc cgccccagcc aggatttcgc ggcggatcgc cgtgggatct tcgtcacgcg 2419261 ggttgtcgtc ggtgacgacc accaagtcgg ccagctgcgc ggctatccgg cccatcgggg 2419321 cccgcttgcc cgggtcacga tcgccgccgg cgccgaacac caccgccagc cggcggtccg 2419381 ggtgcgccaa ggtggtcagc accgaccgca gcgcttccgg tttgtgcgcg tagtcgacca 2419441 gcgcgagaaa gccctggccg cggtcgatct gctcgagccg ccccgggacc cggatctcac 2419501 gcaggcccgg caccgcctgt tccggggaga ccccgacggt gtccagaatc gccagggcga 2419561 ccaggcaatt ggcgacgttg tagcggcccg gtagccggat tccgatgtga tgccctacgc 2419621 cggcggggtc gatggcggtg aattgttgcc cgcccgcgtc cgtgggcgcc acatccgtgg 2419681 cgcgccagtg tgcgggccgg tcggcggcgc tgacggtgat cgcgtcggcg gcccgcgccg 2419741 ccatcgcgcg cccggcgtcg tcgtcgatgc acaccacggc ggtgcgggcg cgcagtgccg 2419801 agtccggatc gaacaatgac gccttggcct cgaagtagtc ggccatgctg gggtggaaat 2419861 ccaggtggtc acgggagaga ttggtgaagg cgccgacggc gaaccgggtg ccgtccaccc 2419921 ggcccagcgc cagcgcgtgg ctggacacct ccatgaccac ggtgtccacc ccgcgttcga 2419981 ccatcgccgc cagcatcgcc tgcagcgtgg gggcctccgg ggtggtcagc gcgctgggaa 2420041 ggtcggcgcc gccgacgcgg atgccgatgg tgccgatcag cccggcgacg cgtccggcag 2420101 cccgtaaccc ggcctcgacc agataggtgg tggtggtctt gccggacgtt ccggtgatcc 2420161 cgataaccgt caaccgctcg gacggatgcc cgtacacggt ggcggccaag ccgccgagca 2420221 cgccgcgggg tgcggggtgc accaacacgg gcacggccgc tcgtccggcg atctcggcga 2420281 ccccggcggg gtcggtgagc accgcgacgg cgccgcgtgc gatcgcgtcg ccgacgtggc 2420341 gggccccgtg ggtggtcgag ccggtcaggg cggcgaacag gtcaccgggt gacacgtcct 2420401 gggcgcgcag cgtgaccccg gtgaccgtcc ggtcctcggt gacggcacgc tgagctggac 2420461 cctcggccag ggccgcgccg acctgatcgg ccagtgcggc caaccgaacg cccacgacgg 2420521 cgttggggcg caagccagtg ggcgcagcct ccacctgtgt cgccacctcc gttcgccgcc 2420581 gcgagatccc tcgggccagc gatgacaccc taccgacagg gcgcgcacac tcacccagtc 2420641 gggttttgcc gcgacacctg gccctcggcg gcggcgccga tccaggtgcc gatgcgccgc 2420701 gcggcggtga aggcggccca ggccagggcg ccaaccagcg ccgcatcggt gtcgagcagg 2420761 gatcgggccg cggcgacgtc gtcgtcggtc acctgatgcg gggccaggcc ggtcagcagg 2420821 gcaagacggg tgggcgcgtg caggtcggcg ggcagctcgg cggtgtgctc gttcgtccag 2420881 cgactgctca tcggcattgg ctcgccgtgc cacgacccca cgacccgcct gaccacctga 2420941 cgagtcggtg gcggcaggtg cggcgcggtg tccaggtggt ggctgagcgc ggcgaacgcg 2421001 gttgctatgg gctcggacgg tgttgcccat gccagatcgt cgggcagcgt tcgcggctcg 2421061 agccggcggg tggagcggcc cggccgatgc tccgcgcgca ccttgcgggc gaacaccagt 2421121 ccaccggcgc ggcgcatgag ctgttgggcg cgcgggcccc ccggcaggaa ggtttcgtcc 2421181 agcagcacca ggaccaggcg tgcgatgaag tggaattgca ccgcggtgcc caggtattcg 2421241 gcggcgacat ccgggccgaa cggtgccggc ggtcccgccg gtgtcccggt tcctgccgcc 2421301 cacgccacat acggcgcgtt cgggtcaccg gcggcaggtg ctgtgccggc caagatcgcc 2421361 gcggcggtgt cggtttggcc tgccgcgtac agcatggtgg tgtgtgcgtc gacgcaccag 2421421 gggcagcgca ggctggccgc gacggcggcg gcgacggctt ccttgcggcc acgcggcacc 2421481 tggcccacca gcagtgtctc gcgcaacgtc gcccagccgg cggtgagcag tccctcgtcc 2421541 ggggacagca tggcgagcgg ctcgggcagc cggccgaact cgcggcgggc ctcggcatag 2421601 acctcggcga ccgcgccgcc ggctcggcgg ggcgcgacgg gctcaatatg gttgacaaat 2421661 ttcatgattc gactccctcc tgggtggtgc cgactctggc cagggccgcg tcgatcgccg 2421721 ttcgcgcccg ctctccggcg ccgtcgtcgt cgagcagcag cagcgcccag ttggcctcca 2421781 tcgcgtaggc gtgcagctcg aacgcgagtt ggcgcacttc gatatccgcc cggatctcgc 2421841 cccggcgttg cgccgtttcg acgtcggccg tgatggcggc gattccggcc cgcccggtcg 2421901 cggcgatgcg gtcgcgcacc gggccaggct gtgagtccac gtcggcggcc gcggccgcga 2421961 aaaagcagcc gccggcacgt cgcgttccag gtatccgacc cacgcatgca tgagggcgcg 2422021 cacccggtcc accccgggcg gcgctgccat cgcgggagcc acgacctcgg cttcgaacac 2422081 gctcacggcg gcctcgacgg tcgccagctg cagctgctcc ttggcgccga aatgccggaa 2422141 caggcccgac ttgctcatgc ccagccgccc ggcaagctcg ccgatggaca gccccgagag 2422201 ccccttcacc gaggcgatat ccatcgcggc gcgcaggatc tgcgcccggg tttggcggcc 2422261 gacgtcggcg ctaggcatgg cttttgacct cccggtcgtc tccggcgaac gcatccacca 2422321 gcggggccga ccggtccagg gcggccagca cctggtcgcg gcccgccgag ggcagcgcga 2422381 gcgccacctc cgcgacaccg gcccggcggt actcgtgcag ggtcgccggg tcgccggccg 2422441 acgagtacac acagacctgg gcggtcgccg gatctcgccc ggcacgctcg aacgcggcgt 2422501 gcagcatcgg caacgcgccc aggagctcgc cgtacccctc gatcggctgc caaccgtcgc 2422561 cgtggcgggc gatcacctcg aacgcccgcg cactgggccg gcacccgaac agcaccggcg 2422621 gcgccacggc cggtttcggc cacgcccacg acggcggcac cgacgcgtgc gtgccctcgt 2422681 agtggaccgg ctctgcggcc catagcgccc gcatggcggc gagcttgtcc accgtcaccg 2422741 cgatccggtc ggcgaacggc acgccgtggt cggcgagctc ctccacgttc cacccgaaac 2422801 ccacccccag cacgaaccgc tcgccggaca tggcgcacag cgaggcgatc tgtttggcca 2422861 gcaggatcgg atcatgcacc gccaccaggc aggccccggt gcccacgcgc agccgcgtcg 2422921 tgaccgccgc ggcggcggcc agcgccacca ccgggtcata gcagcggcga taccagtccg 2422981 gcagctctcc accgggccac ggcgtgctcc tgctgatcgg cacgtgcgtc ttctccggca 2423041 catacaggcc cgcgaagccg cgctcctcgg cccacaccgc gaccaactgc gggggtgggg 2423101 tcaggtcggt gacgaactgc atgagcgaga cgagcatcgg cggcggcttt cattaagcac 2423161 gaacgttcgt gtttaacgat ggtccgcctg gggcgtgctg tcaatgccgg attgcgtgac 2423221 cgctcgctcg gggcccgggt cagccgtcgg cgccgtttgc tccaggggtg ccgaacagca 2423281 caccacggct gccgccggca ccaccacttc cgacgggtat gccgaagccg ccggccccgc 2423341 cgttgccgcc gttgccgatc accacggcgt tgccgccgtt gcccccgtta ccaccgtcgc 2423401 ctttaacgcc tggggggccg tcgccgcctt ccccgccgtt gccgccgcgc ccgccgtcac 2423461 cgatcagcct ggcgttgccg ccggcgccgc cgtcaccggc attacccggc gtgggagcct 2423521 gtccgccgtt gccgccgtcc ccgccgccgc cgccgctgcc gatcagcccg ccgatcccgc 2423581 cggcaccgcc gatgccgccg tccccgccgc tggtgccggc cgacgagaag ccgccgttgc 2423641 cgccgcgccc gccgacgccg ccgttgccgt agaacagccc gccgttgccg ccggccgcgc 2423701 cggcgccgcc ggaccccgca tcggcaaggg tggagctgtt gtcgttgccc ccgttgccgc 2423761 cggcaccgcc gttgccgccg ttgccgatca gcccgacgtg cccaccggcc ccaccggccc 2423821 caccggcccc accggcgccg ccggccccac cgctgttgcc gtggccaccg ttgccgccgt 2423881 gccccccgtc gccgccatta ccgatcaggg cggccccgcc ggcaccaccc gccccaccgc 2423941 tggcgccggt gttgctgagc ccgccgttac cgccgtcacc gccggcgcca ccgttgccga 2424001 tcagcccggc ggcgccgccg gcaccgccgt ttccgccgtg caccccggtc accccgttgc 2424061 ttccgttccc accagccccg ccggccccgc cgtcgccgta cagcagcccg ccgcgccctc 2424121 cgtcaccacc ggcgccgccg gcccccggtg ccccggatgc gctgccgccg gccccgccgg 2424181 ccccgccatg gccccacagc ccggcggccc cgccggcccc gccggcgggg ctgacgcccc 2424241 cggcggtgcc gagcccggcc gcacccccgg gccccccgtt gccgtacagc catccgccat 2424301 tgccgcccgc acctcccgct ccggtggccg cacccgcgcc tccggccccg ccgttgccga 2424361 tcagccccgc cgatccgccg ttgccgccgt tgggattggc ggtgtcaccg gccccgccgt 2424421 taccaccgtt gccgtacaac aagccgccgg gcccaccgtt ttgcccggga cccccgtcgg 2424481 cgccgttgcc gatcaacggg cgccccagca acgtttgggt gggcgcgttg atcgcgttga 2424541 gcagggtctg ctgcacgttg gcggcctcgg cgctggcata cgagcccgcg ccggcgttca 2424601 gggcccgcac gaactggttg tgatacgccg ccagctgagc gctcaacgtc tgatagccct 2424661 gggcgtgcga cccgaacaac gccgagatcg ccaccgacac ctcgtcggcg ccggccgcca 2424721 ggattcccat cgtggggcct gccgccgccg cactggcggc gctgatcgac gacccgatgt 2424781 tcgccaaatc cgtggcggcc gctgccatca cctccggcgc cgcaatcaca aacgacatcc 2424841 cgcacctccg accagctcag cacaacttca cgaatcccag acctgcgaca ccgtcggcag 2424901 ggctttcgat cctataacaa tctgaaaaca ggatgtcgca ctttccttaa aagagcttcc 2424961 gccaacccga tcgtcagcgc gcacatgttg cgcaaaagtt gttggagccg aaacgaaccg 2425021 gcgcgcgccg ttaccggcgc cgccgcccta ggtggcctgc aagaccaaag gaggcccggg 2425081 atcgggtgac agcgggacgt tttcgcgctg catcagccag cccgcgatgt tgtggaacag 2425141 cggggcggcc gagtgcccag gcgcgccgtc ggagttgcgc gccgggttgt ccaacatgat 2425201 gccgatcacg tagcggggat tgtcggcagt ggcgattccg gcgaaggtga tccaatacac 2425261 gtcgtcgaag tagcagccgc agccagggtt gatctgctgc gcggtaccgg tcttgccggc 2425321 catctgatag ccgggcaccc cggccgtcgg cccggtaccc tgctggtagc ccatcggatc 2425381 gcgttgcacc acggcacgca gcatctggcg cacggtctgg gcggtctgcg ccgacaccac 2425441 gcgaatgtcg tcggggcgcg gttcttcggt tcggctgccg tcgggtgcga cggtggcctt 2425501 gataatgcgt gggggtaccc gcactccatc gttggcgatg gcctggtaca tgccggtcat 2425561 ctgcagcaaa gtcatcgaaa gaccttggcc aataggaaga ttagcgaacg tactgcccga 2425621 ccactggtcg attggcggca ccagtccggc gctctcaccg ggcaggccca cgccggtgcg 2425681 ctgtcccaac ccgaacttgc ggagcatatc gtaatagcgt tccggtccga cacgttggga 2425741 aagcatcagc gtgccgacgt tggaggactt tccgaacacc cccgtggtgg tatagggcat 2425801 cacgccgtgc tcccaagcgt catgcacggt aacaccgccc atctggatcg agccaggcac 2425861 ctgtagcacc tcgtcggggc tgctcaaccc gtgctcgatg accgcggacg cggcgacgat 2425921 cttgttcacc gagcccggct cgaagggcga cgacaccgcc gggttgccca actgcttgtc 2425981 gccctggcgc ccgatgtctt gcgacgggtc gaaggtgttg tcgttggcca tcgcgagcac 2426041 ctcgccggtc ttggcgtcca ggacgacggc cgagacgttg tgagcccccg ataggttctt 2426101 ggcctgctgc acctgctgct gcacgtagaa ctggatgtcg ttgtcgaggg tgagcacgac 2426161 ggtggaaccg tggaccgcct tgtgccgatt ccggtagctg ccggggatga cgacgccgtc 2426221 tgacccacgg tcgtaggtga ccgatccgtc ggttccggcc agcaccgcat ccagggagtc 2426281 ctccagaccc agcagcccat gaccatccca gtcgatgcca ccgacgacgt ttgccgccag 2426341 cgacccaccc gggtactgac gcagatcctg tctttccgca ccgacctcgg gatacttcgc 2426401 gcagatcgcg ctggcgacag ccgggtcgac cgcacgcgcc aagtagacga aggtctcgtc 2426461 gctttgcagc ttcttcagca cggccgcggc atctggcttg ttgttcagct tgccggcgac 2426521 ctcctgggcg atatcgcgca ggcgctgctg cgggtcgggt gcagccgacg tcttcttcct 2426581 ggcctcttcc aattgccgcc gaatccgctt cggctggaac gtcagggcac gcgcctcgat 2426641 ggtgaacgcg agccggtcat tgttgcggtc gacgatgctg ccgcgagccg ctggctggac 2426701 gtcggtgacc ttgagttggc cggccgcctg cgcacgcagg cccgcggcat gtgatacctg 2426761 cagaaagaac aattgtgttg ccgcgaccaa catcaacacc aagatgaccg cgtttccggt 2426821 ccgatgccga aagacgaacg acgcaccgcg cgtcccgacg tccaccacct gccgggtgcg 2426881 cctcgcacga gtcgagcgac ccgcgggtgc gacgtctgac cgtgtcgcag ggcgggattt 2426941 cgtggcttcc tgggcttgcc gggctttctg cgttttgccg ggccgtttgc gttgcccaac 2427001 ctcctgggct cccggtggcc ggcgcaaacc gcgcgccggt cgcgtcgact gcgactgact 2427061 ggcccgcctg ggggcggcgc ggctcacctg ggagcccccg gcgccgttgg caccggcgcc 2427121 gtgacgggac cgaactgttc gccgttggcc ggcactggag cgggtggtgc caccatgggt 2427181 tgcgacccac ccgacagccc gggcgtcgca gccaccggtg ctggtccagg gagcccggcc 2427241 gggggcgccg cacccacctg gagcggcacc ggattttccg ctggtgccgg ggatggcact 2427301 gcgccgagcg gaggagccgg catcggaccc ggcgccccag gtatcggcac cggaccgggc 2427361 agctgcgggc cggcctgggt gggcaggtgg gttgcgccgc ccagcgtcgc tgtgccgtct 2427421 ggggtacgca ccagcacctc cgggccagac cgggcgggcg gagcgggatc atcggggcct 2427481 ggtgtcaccc ggaccggcac ctcgaggggc accgccgcgg gtttcggggg cggcggcgga 2427541 tcttcgggca acttcgtgtt cagcggcggc ggtggaactc cgtcagccgg cttgggtgta 2427601 ccgaccacca cccaattgcc gtccggatcc tgaaccaggt gggcggtatc cctcgtcggg 2427661 atcatgccct ggcgacgagc cgcctcggcc agcgccggcg ccgacgcagc ctcgcgtacg 2427721 tcgcgttcca gcgcttcctt gtgctgctgc agcatccggg tccgctcccg ggcgttgctc 2427781 agctggtagg acctctcggc ggcatcggtg gacaaccaca gtgtgaggcc tagtccgacg 2427841 ccgagcgaac cgataaccag caccacaaac ggaaccttgt ttgccaacgt gcgcggccgc 2427901 aggtcgatcg acgtgagccg ggcggcgaga cgctccatcg gcgtaggacg gaccagcttg 2427961 ggcgccttgg cttttcgggc cttggcccgc gccttggcct ggctggtgtt ctttgcgggg 2428021 gccggccggt cgaacgggct gagcatcggg ctggtttgcg gtccagggcg cgacacccgg 2428081 gcctgccggc cgggtgccga ggtcttgccg gcacggctcc ggatgcggcg cgacggcgcc 2428141 gagttcgtag tcgttcgcct cgtcgccgcg gcaggactgt cggctctcct gcgacgatcg 2428201 ctgctgcggc ttttcggtgc ctcacgcttg gccctcatga atcacccttc tcggttgccc 2428261 attgctgcga ttgcgcccgg tgctcgactc gttgcagggc ccgcaaccgc actggagtac 2428321 tgcggggatt gcgttcgatc tcagccacac tcgctcgttc ggcgccgtgc gttaacgaac 2428381 ggaatcgcgg ctcatggccg ggaagttcga ccggaagtcc cgcaggggtg gccgacgcga 2428441 ctgcctcggc gaacacccgt ttgacgatcc tgtcctctag cgactggtag gccagcaccg 2428501 cgatgcgccc accgatagcg agggcatcca gcgcggcagg aacggccgtg cgcagcgatt 2428561 ccagctcatc gttgaccgcg atgcgcagcg cctggaatgt tcgcttggct ggatgcccgc 2428621 cgacacgccg ggccggagct ggaatcgcct ggtacagcag ggcaaccagt tcggcggtcg 2428681 aggtgaacgg ggtttttgcg cgtcggcgga cgataccggc agcgatgcgc cgagcaaacc 2428741 gctcctctcc gtagcgacgc aggatgtcgg ctagtgccgc ctcgtcgtaa gtgttgacaa 2428801 tgtcagctgc ggtcaacggc gtcgtcgggt ccatccgcat gtccaatggc gcgtccgtgg 2428861 cgtaggcgaa gccccgctcg gcgcggtcga gctgcatgga tgagacgccg agatcgaaca 2428921 ggattccgtc gactgatccc actgcggcat aaccggattc agccagcgct gcgcccagac 2428981 agtcatagcg ggtgtgcacc agggtaagtc ggtcagcgaa tcgcaccagc cgagaccgcg 2429041 cgacgtccag agcggttggg tcacggtcga gcccgatcag gcgcagaccc ggcaatccct 2429101 ccaaaaaccg ctccgcatgc ccgcccgcgc cgatggtcgc gtcgagaagg accgcctgcg 2429161 agccgtctgg atagtagcgg gttagtgcgg gggtaagcag ttcgaagcaa cgttgcgcca 2429221 ataccggcac atgaccgaaa ccggttggcc ccgaacctgg atcagccacc gtgatacctc 2429281 cccaggtctg gcaagccgta cttcgggacg cggctattcc aggcgccgcc cctgcaccga 2429341 ggtccctgtc cgaagacacg aacctggcgt tggggaagta cgccagggtc gcttcgggca 2429401 gagaccacgg tgcacgggtt tgcacctcag aagatgtcac cgagtgcttc atcgctggcc 2429461 gcggagaagt tctcttcatg gatttgttgg tagttctgcc aggcttgcgc atcccagatc 2429521 tcgagatagt cgaccgcgcc gatcaccaca cagtccttgg aaaggcttgc gtagcggcgg 2429581 tggtcggccg acaaggtgat ccggccttga ctgtcgggat gctgttcgtc ggtaccggcg 2429641 gcgagattac gtaggaacgc tctcgcctcg gggttgcttc gtggcgcctt gctggcccgg 2429701 cgcgccagct gctcgaacgc cgcccgcggg taaacggcca ggctgtgatc ttggctcttg 2429761 gtgaccatca acccccctgc caacgcgtcg cgaaacttgg ccggcagcgt cagccgcccc 2429821 ttgtcgtcga gtttgggcgt gtaggtgccg agaaacatgg ggcacctccc tgccaaatcc 2429881 atctcaccca aacacctcag ccaccatacc ccacaatccc ccactttgcc ccataactgg 2429941 ggtatcaaag cggcgttttg ccgtctctgt accactgaag cgcgcggcta gcccggctac 2430001 gacctcagaa aaccgcatgt cgccgggcaa atgggtggca agtggggcca agtggggcac 2430061 aactggggct caaaccggac tcaatatcgc cgacagccgg tgacgacccg gctgggtgaa 2430121 ccgccccggt gagtccggag actctctgat ctgagacctc agccggcggc tggtctctgg 2430181 cgttgagcgt agtaggcagc ctcgagttcg accggcggga cgtcgccgca gtactggtag 2430241 aggcggcgat ggttgaacca gtcgacccag cgcgcggtgg ccaactcgac atcctcgatg 2430301 gaccgccagg gcttgccggg tttgatcagc tcggtcttgt ataggccgtt gatcgtctcg 2430361 gctagtgcat tgtcatagga gcttccgacc gctccgaccg acggttggat gcctgcctcg 2430421 gcgagccgct cgctgaaccg gatcgatgtg tactgagatc ccctatccgt atggtggata 2430481 acgtctttca ggtcgagtac gccttcttgt tggcgggtcc agatggcttg ctcgatcgcg 2430541 tcgaggacca tggaggtggc catcgtggaa gcgacccgcc agcccaggat cctgcgagcg 2430601 taggcgtcgg tgacaaaggc cacgtaggcg aaccctgccc aggtcgacac ataggtgagg 2430661 tctgctaccc acagccggtt aggtgctggt ggtccgaagc ggcgctggac gagatcggcg 2430721 ggacgggctg tggccggatc agcgatcgtg gtcctgcggg ctttgccgcg ggtggtcccg 2430781 gacaggccga gtttggtcat cagccgttcg acggtgcatc tggccacctc gatgccctca 2430841 cggttcaggg ttagccacac tttgcgggca ccgtaaacac cgtagttggc ggcgtggacg 2430901 cggctgatgt gctccttgag ttcgccatcg cgcagctcgc ggcggctggg ctcccggttg 2430961 atgtggtcgt agtaggtcga tggggcgatc ggcacaccca gctcggtcag ctgtgtgcag 2431021 atcgactcga caccccaccg caaaccatcg gggccctcgc ggtggccctg atgatcggcg 2431081 atgaaccggg taattagcgt gctggccggt cgagctcggc cgcgaagaaa gccgacgcgg 2431141 tctttaaaat cgcgttcgcc cttcgcaatt cggcgttgtc ccgccgcaag cgcttcagct 2431201 cagcggattc ttcggtcgtg gtcccgggcc gtgcgccggc atcgacctgc gcctggcgca 2431261 cccacttacg caccgtctcc gcgcagccaa caccaagtag acgggcgacc tcactgatcg 2431321 ctgcccactc cgaatcgtgc tgaccgcgga tctctgcgac catccgcacc gcccgctcac 2431381 gcagctccgg cgggtacctc ctcgatgaac cacctgacat gaccccatcc tttccaagaa 2431441 ctggagtctc cggacatgcc ggggcggttc aggggcttcc cgagactgcg attcccaaac 2431501 gatgacgccc aaacaaaaag cgggaccgcc gatggctgcc ccgctgccgc tggttgcgtt 2431561 cggcttactc gtcgaagcgg cgccggaacc gatcttccat acggctggtg aatgagcccc 2431621 cggccccctt ggtacgacgc tggcgcgaag ccccagcagc cgatccgcca cgatccatcc 2431681 tgccggacaa ccgaggaccg gtgatggcat acaccacacc accgaacatc acgacaaaac 2431741 cgaaaacgct gagtatcggg aaacttccga tcatggtctc tttgaacgcc acgccggaaa 2431801 ccaacatccc cagaccgatg atgaacaacg ccgcgccctg caggcgccgc cgcgcggtcg 2431861 gtgcgcggaa gcccccgcca cggacactcg atgcgaactt gggatcttcg gcgtagagag 2431921 cgctctcgat ctggtcaagc atccgctgct catgatcgga gagtggcatg cgtccctcct 2431981 tgccgacaga ctgtcacgta ataccgataa cacgcggatg cccattgcgc gggcaactaa 2432041 ctcagatgat acgaggtcaa tctgcgccgt accactggtt cgcgggcgat tctatcccgg 2432101 cggcgccgca gcgacgagct gagcggaaac ggccatacgc tacaagcccc gtccagcgcg 2432161 ggcggcctca tcggcttgtc cgatactggt gcgcaagcac gcatcggttc atcacatgag 2432221 gaggacaccg cgcgttggcg atattcctca tcgatctgcc gcccagcgat atggagcgcc 2432281 gcctcggtga tgccctgacg gtgtatgtcg acgcgatgcg ctaccccagg ggcaccgaga 2432341 ctttgcgcgc cccaatgtgg ctggagcaca tccggcggcg cggctggcag gcggtcgcgg 2432401 ccgtcgaggt aacggcagcc gaacaggccg aggccgccga caccacggcg ctgccgtcgg 2432461 ccgccgaact gagcaacgcg ccaatgctcg gagtggcgta cggctatccc ggggcgcccg 2432521 gccagtggtg gcaacagcag gtggtactgg gcttgcaacg cagcggcttt ccgcgcctag 2432581 cgatcgcccg actgatgacc agctacttcg agttgactga attgcacatc cttccccgcg 2432641 ctcaaggccg tggcctcggg gaggcgttgg cccgccgact gctagccggt cgcgacgagg 2432701 acaacgtcct gctctccaca ccggagacca acggtgagga caatcgggcg tggcggttgt 2432761 accgccggtt gggcttcacc gacatcatcc gcggctacca cttcgccggt gacccccgag 2432821 cattcgccat cctgggtcgc acgctaccgc tctaacccgc gcccgacagc ttgccgacgc 2432881 ggcatgcccg gtctggcacg atgacctggt gcgcgctagc tatgccccac cgtcatccca 2432941 aggatcgcga gtggcaagga cccgacggcg cggcatgctg gccatcgcga tgttgctgat 2433001 gctggtgcct ctggctaccg gatgcctgcg ggtccgagcc tcgatcacca tctcgccgga 2433061 tgacctggtg tccggggaga tcatcgccgc ggccaagccg aaaaacagca aagacaccgg 2433121 ccctgcgctc gatggcgatg tgccgttcag ccagaaggtt gcggtctcga actacgacag 2433181 cgacggctac gtggggtcgc aagcagtgtt ttccgatttg acctttgccg agctgcccca 2433241 gttggccaat atgaactccg acgccgccgg agtgaacctg tcactgcgcc gaaacggcaa 2433301 catcgtgatc ctggaaggcc gagcggatct gacatcggta tccgatcccg acgccgacgt 2433361 cgagttgacc gtcgccttcc ccgcagcagt gacttccacc aacggcgacc gcatcgagcc 2433421 cgaggtagtg cagtggaagc tcaagccggg cgtggtgagc acgatgagcg cacaggctcg 2433481 ttataccgat cccaacaccc ggtcgttcac cggagccggc atctggctgg gcatcgccgc 2433541 gttcgcggcc gccggtgtgg tggccgtgct ggcgtggatc gaccgggacc gctccccacg 2433601 gttgaccgct tcgggcgacc cgccaaccag ctagtccggc ttgcccggct cggcaggtga 2433661 ccagtaggca agcatttccg cgaaggtctc gaaagccgcg gccgaaacgc catacgtcgc 2433721 ctcgagatgg atgcttagcg gaaaacccag atcggcgacg ccgtctagca cacgcttgta 2433781 caagtcgacc atgagccggc gccgccgtgc gggttcactg ccggccaact tctgcacgaa 2433841 cgcctgctcg tcggccaccg cggcgtttcc cgggtcctgg atcagccagt tgatcaggcc 2433901 gatgcgggtc tcgaccttcg ggacaaagcc gaacgacagc agaatctcgg gtcggtgttc 2433961 ggtggtcctg gcgaactcgc gcaggaagcc cacgatcgcg tcggaataca acagctgggt 2434021 catgccgtag gttgcgcccc gactgcactt gaaattgagc cggccctgct cgccgtctcg 2434081 ggtggggatc acgatcacac cacggttggc caccagctgg cgatacagcg acagggcatc 2434141 cgtcggcgcg actccggagc cctcgccgtc ctgcatcgtg cgcggtacac cgacgaatac 2434201 gatgccctcc atgccggcat cggacagatc gaccagccgc cggtgcaacg atggctcgtc 2434261 catgaacgcg gttacctgcg tacacaggcc atggactccc gccaactccg gtttgatgat 2434321 cgaccagaaa tcgagtacat ccagcttcgg ctgcatcggg atgggcctat cgtcatcctc 2434381 ggcgatcatc cccggcatca ttacgtgccg tatccggccg tcaagcccgg atgcagccga 2434441 gtactgcacc accttgcgag catcttcgat tgcccgctcc ttgccaccct cgaggttcgg 2434501 tggcaccagc tccagcgcga tcgtgttgag ggtcacacgg ctcctcttcg tcaaacgagt 2434561 acttccatgg ccgccaatgg ggccaccggt gggccgcgcc gcgtcgcgca aatcgccatc 2434621 ctgggccggg ccggaccagc caacccaagg gcgctgaaga cagcataaac acgaaatagt 2434681 cagttagtcg aagcaacttg tgtggtttcc gcgagcccac ccgccgaatc atcgatagcg 2434741 gccactcgcg ccggcgcgga atacactgtc gggccatagg cacgccaaat gagaaagggg 2434801 cgccgcgctg agcctgaatg caccggcagc accggcagcg gtccagttgg ccggcgccat 2434861 caccgaccag ctgcggaggt atttgcacgg ccgccgccgt gcggccgccc acatgggcag 2434921 tgactacgac ggcctgatcg ccgacctgga ggatttcgtt ctcggcgggg gcaagcgcct 2434981 acgaccgctc ttcgcctatt ggggctggca cgccgttgcc agtcgggaac ccgatcctga 2435041 tgtgctgctg ctgttttccg cgctggaact gctgcacgcc tgggcgctgg tccacgacga 2435101 cctgatcgac cgttccgcca cccgccgggg ccgcccgacc gcccagctgc gctacgcggc 2435161 gctgcaccgc gatcgggact ggcgggggtc accggaccag ttcggcatgt cggcggccat 2435221 cctgctcggc gacctcgcac aggtctgggc tgacgacatc gtctcgaagg tctgccagtc 2435281 cgccctggca cccgatgccc agcggcgagt gcatcgggtg tgggccgata tccgcaacga 2435341 ggtgctgggc gggcaatacc tcgacatcgt cgcagaggcc agtgccgccg agtcgatcga 2435401 gtcggcgatg aacgtcgcga cgctcaagac cgcctgctac acggtatcgc gaccgctaca 2435461 gcttgggacg gccgccgcgg ccgacagatc cgacgtagcg gccatcttcg agcatttcgg 2435521 agcggacctc ggcgtagcgt ttcagttgcg cgacgacgtg cttggcgtgt ttggcgaccc 2435581 agccgtgacg ggcaagccgt ccggtgacga cctaaagtcg ggcaagcgta ccgtgctggt 2435641 agccgaagcg gtggaattgg cggacaggtc agaccccttg gcggccaaac tattacggac 2435701 ctcgattggc acccgattga ctgatgcgca ggtacgtgaa ctgcgcacgg tcatcgaggc 2435761 agtgggcgcg cgcgccgccg cggagagccg catcgccgcg ctcacccagc gagcactggc 2435821 caccctggcg tccgcaccca tcaacgcaac agccaaggcc gggctgtccg aactggccat 2435881 gatggctgcg aaccggtccg cctaaccgat gactactccg agccatgctc cagcggttga 2435941 tttggctaca gcgaaagatg ctgttgtcca acacctttcg cgacttttcg agttcactac 2436001 cggtccgcag ggcggaccgg cgcggctggg cttcgccggc gcggtgctga tcaccgcagg 2436061 cgggctggga gccggcagcg tccgccaaca tgacccgctg ctggagtcga ttcacatgtc 2436121 ctggctgcgc ttcggccacg gactcgtgct gtcgtcgatt ctgttgtgga caggtgtggg 2436181 tgtgatgctg cttgcgtggc tgggtctagg ccgacgggtc ctcgccggcg aagccaccga 2436241 gttcaccatg cgggcaacca ccgttatctg gctggcgccg ctactgctgt cggtgcccgt 2436301 cttcagccgg gacacttact cgtatctggc ccaaggggcg cttctgcgcg acggtctgga 2436361 tccttacgct gttggcccgg tcggtaatcc caatgcgctg ctggacgacg taagcccgat 2436421 ctggacgatc accaccgcgc cctacggtcc tgcgttcatt ctggttgcga agttcgtcac 2436481 ggtaatcgtc ggcaacaatg tcgtcgccgg aaccatgctg ttgcgtttgt gcatgctgcc 2436541 cgggctggcg ttgctggtct gggccactcc acgcttggcc agccatctcg gcacccacgg 2436601 cccgaccgcg ctgtggatct gcgtgctgaa cccactggtc ctcatccatc tgatgggcgg 2436661 ggtgcacaac gagatgctga tggtgggtct gatgaccgcc ggtatcgcgt tgaccgtcca 2436721 gggccgtaat gtcgcgggga tcatcctgat caccgttgcg atcgcggtga aggccaccgc 2436781 cggaatcgcg ttgcccttct tggtctgggt ttggctgcgt catctgcgtg agcgacgggg 2436841 gtaccggccg gtccaggcgt tcctggcagc cgccgcgata tcgctgctga tcttcgtcgc 2436901 ggtgttcgcg gtgctgtctg cggtagccgg cgttggccta gggtggctga ccgcgctggc 2436961 cggctcggtg aaaatcatca actggctgac ggtgcccacc ggggcggcca acgtgatcca 2437021 cgcgctgggc agagggctct tcacggtcga cttctacacc ttgctgcgga tcacccggct 2437081 gatcggaatc gtgatcatcg cggtgtcgct gccgctgttg tggtggcggt tccggcgcga 2437141 cgaccgggcc gcgctgaccg gggtcgcatg gtcgatgctg atcgtggtgc tgttcgtacc 2437201 cgccgccctg ccgtggtact actcctggcc gctggcggtc gctgccccgt tggcccaggc 2437261 acgacgggcg atcgcggcca tcgccgggct ctcgacttgg gtgatggtga tcttcaaacc 2437321 cgacggatcg cacgggatgt attcgtggct gcacttctgg atcgccaccg cctgcgcact 2437381 gactgcgtgg tatgtcctgt atcggtcacc ggaccggcgc ggagtgcagg ctgcaacccc 2437441 ggtggtcaat acgccatagc ctgggcccgg cgcaccacct cgcgagcctg gtgggcatgc 2437501 aatgcatcga cgggacgggc gttgctgacg gcgtcacgcg agccgtcgcg ggtgatggtc 2437561 agcgacggat cgggggtaaa cagccagcgc ataatctcgg tgtcgcgata gcccccgtcg 2437621 tgcaggatgg tcaacagccc cggcaggctc ttgaccacct gaccggagtt ggtgaagaag 2437681 acctgaggga tcaccacgcc accagcgcgc cgcacggcca ccagatgacc ttcccgcagc 2437741 tgctgggcca ccttgctgac cggaacgccg agcagctcgg cgacccgggg caggtcgtac 2437801 gtcggttcgt cagggtccaa aacgtcatcg ccagcgggaa tgctgcccac ccgcgcaagt 2437861 gtagagcctg gtgcgcggcc aggcatgcgc gttaggcttc cgttctgcat ccaatcgcgg 2437921 cggccaccta cgatgacccc gtggtcgaag ctggcacgag ggacccgttg gagagcgcgc 2437981 tgctggacag ccgctatctg gtccaggcca agatcgccag cggcggcacc tcgacggtct 2438041 accggggcct ggatgtccga ctcgaccggc ccgtcgcgct gaaagtgatg gattctcgct 2438101 acgcgggcga tgaacagttt ctgacccgct ttcgactgga ggcccgtgcg gttgcccggc 2438161 taaataaccg cgcgctggtc gcggtctacg accagggcaa agacggcagg cacccgtttc 2438221 tggtgatgga gctcatcgag ggcggtaccc tgcgcgagct gctgatagaa cgtggtccca 2438281 tgccgccaca tgccgttgtg gcggtgctgc gcccagtgct tggcgggctg gctgccgccc 2438341 atcgagccgg tctggtgcat cgcgatgtca agcccgagaa catcttgatc tccgacgacg 2438401 gcgacgtcaa actcgccgat ttcgggttgg tccgcgcggt cgccgccgct tcaatcacgt 2438461 ctaccggcgt catcctgggt accgcggcct acctgtcccc tgagcaggtc cgtgatggaa 2438521 acgccgatcc tcgaagcgac gtctactctg tcggcgttct ggtctacgag ctgctaacgg 2438581 ggcacacacc gttcaccggc gactcggcct tgtcgattgc ctaccaacgg cttgatgctg 2438641 acgtgccgcg tgccagtgct gtaatcgacg gtgtaccgcc acaattcgat gagttggtgg 2438701 catgtgcaac tgcccgcaac cctgccgacc gatacgccga tgcgatcgcg atgggcgccg 2438761 atctggaggc gatcgccgag gagctggccc tgcctgaatt ccgggtaccg gcgccgcgca 2438821 actccgctca acaccggtcg gccgcgttgt accgcagccg gattacccag caagggcagc 2438881 tgggtgccaa accggttcac caccctactc gccagctgac tcgccaaccc ggcgactgct 2438941 ccgagccggc ttcagggtcg gagcccgaac acgagccgat caccggccaa ttcgccggca 2439001 tcgcaatcga ggaattcatc tgggcgcgac agcacgcccg tcgaatggtg cttgtctggg 2439061 tgtcggtggt gctggcgatc accgggctag tggcgtccgc ggcatggacg atcgggagca 2439121 acctgagcgg cctgctctaa ggcaggcgag cagtcgcaaa agcccccatt tcggcacgaa 2439181 aatgggggct ggtacgtgaa ttaaggtgac cacggcaagc gtgacccgcc ggcgactgca 2439241 gcgaagccgg gtctgttggt gacagtgtgt atgtcggggt ttcaggcggc aggttcgagg 2439301 gtgaccccca atccttgggc ttcgagtttg gcgacgaggc gacgtcgttc tttgtcggga 2439361 tccatgcggg tggtgaagta gtcggcgccg agatcctggt aaggccggcc ggtggccagc 2439421 acgtgccaaa tgatgacgat cagcttgtgg gcgacggcga tgatcgcctt cttgttggca 2439481 gcgggactgc ggaagccacc gaacttgcgg acctggcggc ggtagtactc gcgcaggtag 2439541 ccatcggtgc gcacggcggc ccacgcgcac tcgaccagga ccggctgcag gtgctggttg 2439601 cctgtgcggc gggcaccgtg atggcgtttg ccggccgatt cgtggttgcc cgggcacagc 2439661 cgcacccacg aggccagatg ctcagccgag gggaaccagg ccgccgggtc ggcgccgatt 2439721 tcagagatga ccgtcgccga ggcacccacc ccgatccccg ggatcgatgc aatcagctcg 2439781 cgtcgggcac aaaagggatg catcagctgc tcgatctgct cgtcgagagc accgatcatc 2439841 gcatcgagct gatccagatg agccaggtgc aacctacaca tcagggcatg gtgatcatcg 2439901 aagcgccctt ccagcgcccg ctgcagatcg gggatcttcg agcgcataag gtgcgcttga 2439961 tcgtcttgcg cccgtcagga ccctcgcccg gcacccacac cgccaccgcg atgatgtcct 2440021 ggcccacatc aacaaaggcg caccgctcgt acagaatatg catcccacca gcccctttcc 2440081 ggctcagcgt cgcaaccaac aacgcgcgct gcgaagggag cccccaaaca tgaactaaag 2440141 agactggtac tcgcgctcgt agcagcaacc gggacacacc cgaaagtggg ggggctccaa 2440201 cgtcagtctc ttgcacggcc acacacagcc aagcccctac gacgtcgaca ccgcaacgca 2440261 cgcaccgatt ctcattcacc atgagcgggc gcaccagcgc ccatcatgtt cttttacgac 2440321 tgctcgccga gctagtcccg cagcatctcc gcgaccagga acgccaactc cagcgactgc 2440381 tgggtgttca gccgcggatc acatgccgtc tcatagcggc cggccaagtc cgtctccgaa 2440441 atgtcttgcg cgccaccaag acattcggtg acgttctcgc cggtaatctc gacatggatg 2440501 ccgcccggat gggttccgag ggcacgatgc acctcgaaaa aaccctgcac ttcatcgaca 2440561 atgcgatcga agtgacgggt cttgaacccc gtggacgact cgtgggtgtt gccgtgcatc 2440621 gggtcgcatt gccagatcac ctgatgcccg gtggcctgga ccttctccac gatcggtggc 2440681 aacagatcgc ggaccttgtg gttgcccatc ctgctcacca acgtcagccg gcccggctta 2440741 ttgtgcgggt cgagccgctc gacgtactcc acggccagtt ccggggtcat gttggggccc 2440801 aacttgaccc cgaccggatt agcaatcacc tgggcaaacg cgatgtgcgc gccatcgatt 2440861 tgtcgggtcc gctcgccgat ccacacggtg tgtgcggaca ggtcaaacag ttgtggttca 2440921 ccgtcgtcac cgtcggacaa cctcaacatg gcgcgctcgt agtcgagcac caaagcttca 2440981 tggctggcat agatttcggc ggtctgtaga ttgcggtcgg ccaccccaca ggcactcatg 2441041 aaccgcagcc cacgatcgat ctcggtggcc agcgcctcat agcgcgcgcc ggccggcgag 2441101 gtccggacga attcccggtt ccagtcgtga accagatgca gcgacgccag gcccgacgaa 2441161 gtcagcgcac gcaccaagtt catcgccgca ctggcgttag cgtaagcccg gaccagccgc 2441221 gacgggtcgt gctcgcgcgc cgcggcgtcc ggggcgaagc cgttgatcat gtcgccgcgg 2441281 taagaccgca gacccagcgc gtcaatgtcg gctgaccgag gcttcgcgta ctgaccggcg 2441341 atgcgggcca ccttcaccac tggcatgctg gcgccgtagg tcagcaccac ggccatctgc 2441401 aacaaggcac ggacattgcc ccgaatatgg ggttcggtgt tgtccatgaa tgtctcagcg 2441461 cagtcgccgc cctgcagcag gaaagcctca ccctttgcca cctgggccag ctgctcttgc 2441521 agccggacga tctcggacgg caccgtcacg ggtggcacgc tctccaacac cgtgcgcatc 2441581 gccaacgcct ggtcggccgg ccaggtgggt tgctgggccg ccggcttggc cagcgcggcg 2441641 tccagtcgtg ttcgcaggtc agtcggcagc ggcggaagcg acgggagctg gtcgatcggt 2441701 atgtcgacgg tccagttcat cggtccatgg taaccgggga tttcctgacg gctgctcagg 2441761 gcgaggttcg ctcggaggtc ctcgccggcg ggatctgact gtccgtctcc tcagcgggcc 2441821 gcgccgcggc ccgcatcgtc tgtggacgtg atgagacgaa accggcgcag ctgatctcgg 2441881 gcatcgacca gcgcgtcgtg gacgtcgcgt ggccgcggcg gcatccgggg gcatccccgg 2441941 tcctcccaca actgccgcag ttcccgggtg aaacggggca ctgtgggtgg caaggcagtc 2442001 atcgggcccc acaattgaca cagcgctaca tggtcgtagg cccccaccca ggcccacaac 2442061 tcgatcgaat ccgtgccgtc gatgcggagg aattcttcca ggtcaagacg aatctgctgg 2442121 cgcgagcgcc acagttgcga ggcgggcggc ggcagcttgg gcagcacatg ggtgcgcacc 2442181 cagctgccgg cccgctcggg atcgaattcc gtggatactg cgtagtattc gcggccgtct 2442241 tctgcgacca ccccgatcga gatcaactcg atggtgtgcc catcctcgat gaattcggtg 2442301 tcgtagaagt accgcaccgc cgcagcctaa tccgaccaga ccgagccgct gatcagaatg 2442361 ggcgcggttc tctccggcgg tggtgcgggg cgcacgtcct ggtcgagctg ggcgtcgacc 2442421 gcgcgctcgt cgggcatccg tggcgtcccg gcgatcacgt attgcagcca cagcttgatc 2442481 cgcaccaccg ggcggcgcca tgtccgttcc cgctgtagcg cacggcgcat cttttccggg 2442541 tggcgggtgt agcgccaccg ggcccacgga gcgtgcggac gcgacagccg gactgcgccg 2442601 acgaccaaca gcacgacaac gaacatgcca agcaacccgg tccaaacctt gcccttgagc 2442661 agcaccacca ccgccaacgg caacgtcaag accaatcctg cgatcagggt ggtttgcagc 2442721 accacccagt tggcgccttg ccgaaccggc aggaagaaga tcagcgggtg taggcccatg 2442781 atcaacagcc ccgcgacggc cactgcggca aagacggcgt ctaccgacgt gcgtccgtct 2442841 tcctcccagt agacatcgga cagatgcagg atcagtgcgt actcgtcgag caccaaggcg 2442901 gccccgactc cgaaaatgct cgccgctatg gtgaattcgg gttctcgacc gtcgactgac 2442961 aaggtgacca gcgtcagccc ggagatcatc accagcacca ccccaaacgc gacgtggtgg 2443021 atgtgcaccg acccgatgtg gacatttcgc ggctgccacc acctggccgg ccgaccgtcc 2443081 gcggcgcgac ggtggataaa ccgtacaaaa ctacgcgtga cgaggaaggt caggacaaag 2443141 gcgaccaagc agcacaacaa cggcagccgg ccacggtcga cgatgtcgtg ctgcagccag 2443201 tggaacacct ccaaaaagct acgcccacct tgactgcata tgcaggcgcc gtacagcgcc 2443261 accatgcgcg cctacgcgaa actactaggc tgttctgcga catgagtgca tggcgggcgc 2443321 ccgaggtggg cagtcgactc gggcggaggg tgttgtggtg cctgctgtgg ctgctggccg 2443381 gcgtggcgtt gggctacgtg gcctggcggt tgttcggcca cacgccgtat cgcatcgata 2443441 tcgacatcta tcagatgggc gctcgagctt ggctggacgg gcgtccgctg tatggcggtg 2443501 gtgtgttgtt ccacacaccc atcgggctga acctcccgtt cacctatcct ccactggcgg 2443561 ccgtcctgtt cagcccattc gcctggttgc agatgccggc tgccagcgtc gcgatcacgg 2443621 tgctaaccct ggtgctgctg atcgcgtcga cggcgatcgt gctgaccggc ctcgacgcat 2443681 ggccaacctc ccgactggta cccgcgccgg ctcggttacg ccggttgtgg ttggccgtgc 2443741 tcatcgtggc tccggcaacg atttggctgg agccgatcag ctcgaacttc gctttcggtc 2443801 agatcaatgt ggtgctgatg accctggtga tcgtcgactg cttcccacgc cgaacgccat 2443861 ggccacgcgg gctgatgttg gggctgggga tagccctcaa actcaccccc gcggtgtttc 2443921 tcctctactt cctgctacgt cgggacggtc gggccgcgct gacggcgctg gcgtcgttcg 2443981 cggtcgccac gctgctcggt ttcgtcctgg cgtggcgcga ctcctgggag tactggacgc 2444041 atacccttca ccacacggac cggatcggcg ctgccgcctt gaacacagac cagaacatcg 2444101 cgggcgcact cgcgcggttg acgattggcg atgacgaacg cttcgcactg tgggtggccg 2444161 gatccctgct cgtgttggca gcgaccatat gggcgatgcg gcgagtgttg cgggccggcg 2444221 agccgaccct ggctgtgatc tgcgtcgccc tgttcgggtt ggtagtttcg ccggtctcgt 2444281 ggtcacacca ttgggtgtgg atgctgccgg ccgtgctggt gattgggcta ctgggttggc 2444341 gtcgccgcaa cgtcgcgttg gccatgctca gcctggccgg ggtggtgctg atgaggtgga 2444401 caccgatcga cctgcttccc caacaccggg agacgactgc ggtctggtgg cgtcaactcg 2444461 cggggatgtc ctacgtgtgg tgggcgctgg cggtcatcgt cgttgccgga ctcaccgtta 2444521 ccgccaggat gacgccgcag cgctcgctta cgcgcggact gaccccggcg ccgacggcca 2444581 gctgactagc cagcggctgt ctcggggatt cgtgcggcgt ccgttgaatt gggatttgca 2444641 ccggcaccgc ccgcgttgcg gccgtctttg acactggcgg catagatgtc gacgtactcc 2444701 tgacccgaga gccccatcag ctcatagatc acttcgtcgg taacggcccg ctcgatgaaa 2444761 tggttaccgg ccaacccctc gaaccgggag aagtccatcg gcttgccaaa ccgaacggtg 2444821 accctgccga acctcagcat cttcctgccc ggcgggttga cgacgttggt accgatcatc 2444881 gccaccggaa tcaccggaac cccggtgtgc aatgccaacc gggctaggcc ggtcttgcct 2444941 ttgtagagcc gaccgtccgg cgagcgagtg ccttctggat acatgcccag cagcttgccc 2445001 tgacccagca acaccactgc cgtctgcagt gcgccctgcg cggagtcggc attggtgcgg 2445061 tcgatgggaa cctggccgga gacgctgtag aaccagcggt tgatccagcc tttcagtccg 2445121 gtgccggtga agtattccga tttcgccagg aaccagatac gacggcgaac taccaacgga 2445181 aggtagaagc tatccgccac cgcaagatgg ttactggcga ggatggccgg acccgaactc 2445241 gggatgtatt ccagtccttc aactttcggg cgaccaagca acgtaaagag cggacccatg 2445301 aaaatgtact tgaacaggta gtaccacatg gccctccctc tcgcccacac cggatggtgt 2445361 ctgcgccaac tgtacccatc cgcgatggct gcgactacct gcgcgggcag cggctcactc 2445421 ctcgatggta accgggatgg cctggtagtg ggatttcatg ctgccctcgc cgttggtatt 2445481 ctcgccgcct gacgcaccgg tctggccgcc gcccggcggg ccttcgggtg gcggtttcgc 2445541 cgaccggtca atgtcgtcga ctatggcgcg gatcacctcc agcagagcca gactgtgatc 2445601 cgcgatgacc gtcagcagcg gatgctgctc gccggttacc aacgctgcca acgcgcacaa 2445661 cggacaccac acttgctggc acttgccggt cccgggacct ccacccgaag ccatcgccgc 2445721 cgccacccgc accgcggggt cgatcccatc gaggattgcc tgcgccagct tgcgcagctc 2445781 gggacgaacg tcggtatggg ccccgctcac gtcggccaca cctccggatt tggtcggaaa 2445841 cgaactgtta actcaccacc ccgcagatgc gcgtccagca ccgtgcacct ccgcaatacg 2445901 gacgccaacc gaactcggcg ccgcatcccc ccagcactga cgatcaagtc gtcgtcggcc 2445961 cggcccaacg tcagcgtccc gggatcgagc tggggcaacg ctagccgcag tcggtatatc 2446021 gacgccaatc ccgacccgga ttccaggtcc acaataggct gtagcgggcc tggcggcgcg 2446081 cttccttggc gacgacgggc actatcgagc aacccgccca gggccttggg gccgatcggt 2446141 tcgccggcca agtgcggcac cagcaccagt gccacgtcac cgatggtggc atcgaggtcg 2446201 tcgaggacgg cacgctgctc accgatgcgt tcggcatacc agtggaaagc cggatggtcg 2446261 ggcagactgc gatactcata gttctcgtct tgcaccagaa gctgattgac gagcagctct 2446321 tcgacccgga cccccatgag cgccaacgac cccagcgtcc ggaccgcctc agcggcgacc 2446381 acccgctccg gagtcagcac caggtgggca ctgaccaggg caccgtcggt cagcaatgtg 2446441 cttagccgct cgacgctggc gcggatgcgc tccagcagtt ccgccagcac ggctgacctg 2446501 ccgtcgtcgg cgccgatgga caacctgcga tgccgcggcc aggcacgttc gacgtacagc 2446561 ccgaaggtgg cgggtagcgt caacatccgc aaggcgtccg ccgtcgaggc gcagtcgacc 2446621 acaatccgat cccatcgtcg ggctgccgca agctcgccga cggcgtgcag ccccagcacc 2446681 tcctggatcc cgggcagcgc gcagagttct tcgggcgcaa tgctgctcaa ctcggagccc 2446741 ggaaatctgc ggtccagggt ctcgaccacg tgcaaccacc ggccctcgag cagggccagg 2446801 gtatccagcg ccagcgcgtc gagaaatccg cccccggctt cggggtcgta ggcgagcacg 2446861 cgaacaggat cgccctgacc ggtaggcggg accgcgatgc ccagcacgtc gcccagcgag 2446921 tgcgcctggt cggtggatac caccaacact cgctggccgg ccccggcatc acataccgcg 2446981 gtggcggacg ccagagtgga ctttcctacc ccgcccttgc cgacaaagag actgatccgg 2447041 gcctgagccg gcgtaccgga atcactcagc cctcgactcg tttcttcaga tccttcaacg 2447101 cgccgtctat caacctgcgt tccgccttac gcttgagcat cccgatcatg gggacagcaa 2447161 ggtcgacggc aagctcgtag gtgacctcag tgccagaacc cttgggcgcc aagcgatacg 2447221 tgccttcgag ggactttagc agcgagctgg attcgagagt ccagctaagc gattggcggt 2447281 cttccggcca ctcgtaggac atgatcaagg tgtctttgaa gatggctgcg tccatcaaca 2447341 ttcgcgctcg tttcgggtag ccctcgtcgt cggcctctag gatctcgact tccttatact 2447401 ccgaaatcca ttgcgggtag gcttcgatgt cggcgatcgc cttcatcacc tcgcctggat 2447461 ccgcgtcgat gtaaatcgtc tgtgtcgtct tgtccgccac ctggctactt ccctttcccc 2447521 gcaagcgggt cggccccgat catctgcggg agctcccgat ctcccgggga gaaacggtac 2447581 tccctcgtgc caaccttgac ccggttaagt taccggagaa accccgatgg ggcgtgaccg 2447641 ttctagcact gtcttgacct cgaaggccat ttttttgccc gcgacccgtc ggtggtgcgt 2447701 cattctggcc aggttcatcc gggccagctg ccaggctgct accccggtcg gttcggcgtg 2447761 caggaaatag tgcagtagca ccccatccat cgacggttcc agccagatct ccatggtgcc 2447821 ggtcagggcg ccggtaaccg tccacctgat tcccttgtcg gcacgatcct cggtgacctg 2447881 tagccgtagg tcaggccacc accgacgcca gctgcatcga tccgcgaccg cggctgaaac 2447941 ccgcgcggcg tcggctgcga cataggtctc gtcagcgatc tggatgctgt tcatcgcctc 2448001 agcttcacat acccgaggcc gtgggcaagc cggaccccga agggcaccaa ccaacggaca 2448061 cgcgatatcg gtctattccg caccggcatc aacccctcta ggcttgacga cagcaaaccg 2448121 gacccggaag acggcaacag gtcaagtgag gtgttgatcg tgcgtgagat tagcgtcccc 2448181 gccccattca ctgtcggcga gcacgacaac gtcgcggcca tggtgttcga gcatgaacgt 2448241 gacgatcccg actacgtcat ctatcaacgc ctgatcgacg gcgtctggac cgatgtcacg 2448301 tgtgcggagg cagccaacca gattcgtgcc gcggctctcg gtttgatttc actgggggtg 2448361 caggccggcg atcgggtagt catcttctct gccacccgct acgagtgggc gatcctcgat 2448421 ttcgcgattc tggctgtggg tgcggtcacc gtaccgacct acgagacctc gtcagcggag 2448481 caggtgcgct gggttttaca agactccgaa gcggtggtgt tgttcgccga aaccgactca 2448541 cacgcgacaa tggtcgccga actctccggc agcgtgcccg ccctgcggga ggtactgcag 2448601 atcgccggtt cgggtcccaa cgcgctcgat cggctcacgg aggcgggcgc ctcggtcgac 2448661 ccggccgagc taaccgcccg cctcgccgca ctacggtcga cggacccggc gacgcttatc 2448721 tacacctcgg gcaccaccgg acgacccaag ggctgccagt tgacccaatc caacctggtt 2448781 cacgagatta agggcgccag ggcatatcac ccgacgctgc tgcgcaaggg tgagcggctg 2448841 ctggttttcc tgccgctagc tcatgtgctg gcgcgcgcga tcagtatggc cgccttccac 2448901 tccaaagtca ccgtgggatt caccagcgac atcaagaatc tgctgccgat gttggcggtg 2448961 ttcaagccga cggtggtggt gtcggtgccg agggtgttcg agaaggtgta caacaccgcc 2449021 gagcagaacg ccgccaacgc cggcaaaggg cgaatcttcg cgatcgccgc gcagaccgcg 2449081 gtcgactgga gcgaagcttg cgaccgcggc ggaccggggc tgctactgcg cgccaagcac 2449141 gcggtgttcg accggctggt ctaccgcaag ctgcgtgcgg cactgggtgg caactgccgc 2449201 gccgccgtct ccggcggcgc gccgctgggt gcgcggcttg gtcacttcta tcgcggcgcc 2449261 ggtctcacca tctacgaggg atacggcctg agcgggacca gtgggggcgt cgccatcagc 2449321 cagttcaatg atctaaagat cggaactgtc ggaaagccgg tgcccggcaa cagtctacgc 2449381 atcgccgacg atggcgagct gctggtgcgc ggtggcgtgg tattcagcgg ctactggcgc 2449441 aacgagcagg ctaccaccga ggcattcacc gacggctggt tcaagaccgg tgatctcggt 2449501 gcggtggacg aagacgggtt cttgacgatc accggccgca agaaagaaat tatcgtcacc 2449561 gcgggcggta aaaatgtcgc ccccgctgtg ctggaagacc agctgcgggc ccacccactg 2449621 atcagccagg cggtggtggt tggggacgcc aagcccttca tcggcgcgtt aatcaccatc 2449681 gaccctgagg cattcgaggg ctggaagcaa cgcaacagca agacagctgg cgcgtcggtg 2449741 ggcgatttgg ccaccgaccc cgatctgatt gccgagatcg acgcggccgt caaacaggcc 2449801 aatcttgcgg tgtcacatgc cgagtcgatc cgcaagttcc gaatactgcc cgtcgacttc 2449861 accgaggaca ccggcgagct gaccccgaca atgaaggtca aacgcaaggt ggtggccgag 2449921 aagttcgctt ccgatatcga ggcgatctac aacaaggaat agccgactgt gcccggctcc 2449981 tccccggccc gctcaacggg ccgcatcgtc gccgcgcaga aaatctgcta gcttggcggc 2450041 cagcgtgtcc caacgccact gcgccgtgac ccattctcgg ccggcggcgc ccatcgcgac 2450101 ggcccgatcc cgatcgatca gcaactcggc cacggcgtcg gccacccggt ccaccgacct 2450161 accgtcgacc actagcccag tcttgttgtg ctgcaccgtt tccggcgctc cgccagaatt 2450221 gccggcgatt accggcacgc cggcggcgga ggcttcgagg aacacgatgc ccaagccctc 2450281 gacgtccatc ccggcgccgc gggtgcggca tggcatggcg aacacgtcgg ccagtgcgtg 2450341 gtgggcggga agttcgtcgg ttgccacgcc gccggtgaac gtcacgtggt cggccacccc 2450401 acagtcgtga gccagcttgc gcaacgtctc tagatatgga ccgccgccga caatcaccaa 2450461 cgcggctcca tcaacgcgac gccggatcga cgggagcgcc gtgaccaggg tgtcctggcc 2450521 tttgcgcggc accaaccgcg acagacacac taccgtgggc cgctcgccta gccgatagcg 2450581 cttccgcaac tcggcgcgtg cggccggatc ggggcggaac cggtcggtgt ccactcccgg 2450641 cggtaggtat tccaacgaag ccgcgggccc gaacgcagaa gcaaaccggg accgcgtgta 2450701 gctgctgacg aaagtcacca cgtcggtgcc gtcgccgatg cggcgtagca ccgatcgagc 2450761 gaccggaagc atcgaccagc ccacttcgtg gccgtgcgtg ctggccaaca cccggctagc 2450821 tccagccagc cgggcacgcg gggccagcag ggccagcggt gcggccgcac cgaaccagac 2450881 ggtttcgatg tcgtgctcgg cgatcagccg gcgcatccgg acatcgaccg ttggacccgg 2450941 cagcatcacc gtgctgggat ggcgcaccac ccggtaaccg gcagcacggg ctgcgtcgtc 2451001 gaaggcgtcg gcgcctttcc actgcggtgc atacactgtc atcgcatgcg ctcgggagcc 2451061 gaccagccga ccgacgaact cccccagata ggactggatg cccccgcgtc ggggtggaaa 2451121 gtcgttagtt accaacagga cccggctcac ctgggtcagg ctagcgggtc caccttgcgt 2451181 gagcagacgc aaagtcgccc aaaatcgccg gtttccgggt gattttgcgt ctgctcgcgg 2451241 cggaagctag cccattagcc accgctgcca gcgcgcgagc aggccagcgg cgtcaatgcc 2451301 gaggacgtcg tgggcggcgg tagccaggtc gaagtgtccg acaccacaag tggccagata 2451361 aagctcacgc agcttggcgg taccgtaggc ggccgcgacg aaccgagcga accaccacgc 2451421 gcggtcatat gccagcgagc gctgtggccc cggagtgtcc aggtcggtgt ccgacggtaa 2451481 cgacagcgcc acagacaccg catccgcggg cggcggggtc ttgggcctgg caacgaaatc 2451541 ggccaccccc tcggccagcc atcgaggtgc atccagggcc gtgtcggccc gggccgcata 2451601 gtgaaaaagc tcgtggccca acactattcg tagcgccgct gggctcatgt gtgccgcgcc 2451661 cggcgcgaac acaatccgtt ggccgaccac cgtgcgacga gcaggatcga cacggtcgac 2451721 caccgtgatc gcggcgatgt ccgcccattg cgacgccaaa cccccgcctg cggcggcatg 2451781 aaactgctcg tcgctaccgg cggcaaccac aaagatgtcg tgcgaccaat cggtgcccca 2451841 gaatgccacc acctcgtcga ccgcggcgtc gatgcccgcc gcgatgcgcg acagcaagcg 2451901 gtcggtggcc gcgccaccaa ggctgagcag ccgcaccgtg cggtcgtcgg cgacccgcag 2451961 cgcgacaaac ccatcggctg gcgcgaccac ctgtgccggt gctgcaggtc catcgcgcac 2452021 cggattacca gacagggccg ccgcgccaat cagctccgca acaaacagac aggccagcaa 2452081 gatccgaggg caaagccgcc gcgcgacgag ccggtcagta acggcgggcg tcgtagatcg 2452141 gccccgacga gtccatcggc accacccgca ccggaacacc gtaggtggag gaatgaacca 2452201 taagaccatc accgatgtag atgcctgcgt gtgacgcgtc ggaatagaag gtcaacacgt 2452261 cgccgggctg cagatccgac aacgcgaccg gctgaccacc gtgagccagc gcctggctgg 2452321 agtgcggtaa cgcgatacca gcctgctgga acgcccacat caccaagcct gagcagtcga 2452381 acccgccggg cgcggcacca ccccacgcgt agggcgcgcc gacctgcgtc aacgccgctt 2452441 ggacaacggc cgtacggtcg ccgccagcgc cgtcgggctg cacgaaaggc aatccgggca 2452501 tcccaccagg cggcggcgcc acgccaggcg ccgggccgtc gccaggcggc gcaccgggcg 2452561 gcaacgccgc aggtggggcc ccgggggcga tcgcagcaac cgccgggacc ggtcctggat 2452621 cagcgagggc cgtgcgctcc tccggcgtca acgcgacgta ttgcgacttg acgacggcaa 2452681 tctgcacctg cagctggctc tgtttgtgct gcagattcgc tcgtaccgcg gcagcttgct 2452741 cggccgcgga cctggcatcg gccgccgatt tggctgcagc ctgctcggcc ttgacggcct 2452801 gttctccagc ggccttgaaa cgggccatct gcgtggacat ttgatgcgcc atcacccgct 2452861 gtaccgatag ccgatcgatc aacagttgcg gggactccgc cgtcaggatc gcatccatgc 2452921 cgtgggtacg accacccatg taggtagcgg ccgcgacctt gttcaccgcc gtctgaaaag 2452981 tcgccaagcg tgctctcgca gcatccaagg ccgttctgtt gtccgcaagc ttctggtcgg 2453041 cggcccgctg ggcagcgagc ttttcgttga gatccagctg cgcactgtgc agcgcctcgg 2453101 tggtctgctc ggcctgccgg gataactcgt tgagcttggc cagcgcgtcg tcggccggat 2453161 cagccagcac attcgcggcc aggacgccgg aggagacggt gaagctcgca aagaaaccta 2453221 tggcggaccg catgattaca cgcgcgatca accacctctg gtcgagcctc aaaatttgct 2453281 tccttaaacg ggccatcgac ggatgacgtc gagctggttt aggtctcaaa caggttacga 2453341 aacgatctcg gaattgtcca aaaggggaag ttaagaaaat ggatagattt ctaccatttc 2453401 gctgtggacg atcgtacttc tgctataggg ctccaggggc atcgacacgc aacgacctta 2453461 cgcgacaccg gatccgcgct ggcggcggac cggcaccagg cgcaaccgag gggccaatcc 2453521 gacatcggcg agcacttcca acgcagcacg ctcgtcatgc gacaggcttt ccggtacccc 2453581 gatgagcacg ctgaccacac agtcgcggca tccagatccg cgcgccgcgc aatcgtcaca 2453641 gtcgattacc accggcgccc ccggcccggg ggctgtgccg ccgctagtgt ctggtccgct 2453701 gcgtgccatg gggtcgttcc tctcggcttg gctcatgagg tcgtccgaac gctaatcgcg 2453761 agcaccgaca tccgttgccg ccgcgtgcgc gctcggcgta gggagcgttt gcgtgtcagt 2453821 gcaggggcct aacgtcgcgg ccatgggtgc aaccggtggg actcagctga gtttcgccga 2453881 cctggcacac gcccaggggg cagcctggac cccagccgac gagatgtccc tgcgcgagac 2453941 caccttcgtc gtggtcgacc tggaaaccac aggtgggcgc acgacgggta acgacgcaac 2454001 accgccggac gcgatcaccg aaatcggggc ggtcaaggta tgcggcggcg cggtgctcgg 2454061 tgaattcgcc accctggtaa acccgcaaca cagcattccg ccccagatcg tgcggctcac 2454121 cggtatcact acggcgatgg tgggtaatgc cccgacgatc gacgccgtcc tgccgatgtt 2454181 cttcgagttc gccggcgact cggtgctcgt ggcccacaac gctgggttcg atatcggatt 2454241 cctgcgcgcc gccgcgaggc ggtgcgatat cacctggccc caaccacagg tgttgtgcac 2454301 gatgcggctg gcccggcggg tgctgagccg agacgaagcc cctagcgtgc gtctggccgc 2454361 gctagcgcgg ctgttcgccg tcgccagcaa ccccacccac cgcgccctcg acgacgctcg 2454421 cgccaccgtc gacgtgctgc acgcactcat cgagcgagtg ggcaaccagg gcgtgcacac 2454481 ctatgccgag ctgcgctcgt atctgcccaa cgtgacccag gcgcagcgct gcaaacgggt 2454541 actggcggaa acactgccgc accggccggg ggtgtacctg ttccgcggac cgtcgggcga 2454601 ggtgctctat gtcggcaccg cggcggactt gcgccgccgg gtaagccagt acttcaacgg 2454661 caccgaccgc cgcaagcgga tgacggagat ggtcatgctg gccagctcga tcgatcatgt 2454721 cgaatgcgcg caccccctgg aggccggtgt ccgtgagctg cggatgctgt cgacgcatgc 2454781 cccgccgtat aaccgcaggt cgaagttccc ataccggtgg tggtgggtgg cgctcaccga 2454841 tgaagcattt ccacgcctgt cggtcatccg ggccccgcga cacgaccgcg tcgtcggccc 2454901 gttccgatcc cgctccaagg ccgccgagac ggcagcgctg ctggcacgct gcacgggact 2454961 gcgaacctgc accactcggc tgacacgttc cgcccggcac ggacccgcct gccccgagct 2455021 ggaagtgtcg gcctgcccgg ccgcccgcga cgtcacggcc gcgcaatacg ccgaggcggt 2455081 actgcgcgcg gcggccttga tcggcggatt ggacaacgcc gcgctggccg cggccgttca 2455141 acaggtcact gagctcgccg agcgccgtcg ctatgagagc gctgcccgac tgcgtgacca 2455201 cctcgccacc gccatcgagg cgttgtggca tggccaacga ttgcgagcac tggccgcgct 2455261 gcccgagttg atcgccgcca agccggacgg ccccagggag ggcggctacc aactggccgt 2455321 cattcgccac ggccaactcg ccgctgccgg cagggcaccg cgcggggttc ctccgatgcc 2455381 tgtggtcgac gccatccgcc gcggcgctca ggcgatcctg cctacgccgg caccgctcgg 2455441 cggggcactg gtggaggaga tcgcgctcat cgcccgctgg ctggccgagc cgggagtgcg 2455501 catcgtcggg gtctcgaacg acgccgcagg gttggcctcc ccagtgcgct cggccggccc 2455561 gtgggcagcg tgggcggcaa cggcgcgctc ggcccagttg gccggcgagc agctcagcag 2455621 aggttggcag tcagatctgc cgaccgaacc gcacccatcg cgcgagcaac tgttcggccg 2455681 caccggtgtc gattgccgca ctggcccgcc gcaacccctc ctcccaggcc ggcagccatt 2455741 cagcacggct ggataatccg gcgtgggcga cgatcgcacc ggcggcgttg agcaccacag 2455801 cgtcccggac cgggcccctg gcaccgccca acaccgcgcg caccgcggcc gcgttggctt 2455861 gcgcatcgcc tccagccagc tggtcaagct gggcgcgcgc aaacccgaat ccggcgggat 2455921 caaacgtcaa cttatccacg ctgcccgccg caacgcgcca gatcgtgctc gtggtggtgg 2455981 tggtcaactc gtccagccca tcgtcgccgt gtaccaccag cacactggac cggcgcgcag 2456041 caaacacccc ggccatcact tcggcgaggt cggcgaacgc gcatccgatc agtccagccc 2456101 ggggccgggc cggattggtc agcggcccga gaagattgaa cacggtgggc acaccgatct 2456161 cgcggcgtac cgcggccgcg tgccggtagg agggatggaa ccgcggcgcg aagcagaacc 2456221 cgatcccaac ctccgcgagg ctgcgcgcga ccaggtcggg tcccaggtcg atgcgcaccc 2456281 ccagcgcctc cagcgtgtcg gcgccaccgg acaacgagga cgccgctcgg ttgccgtgct 2456341 tgaccaccgg cacacccgca gccgccacca caatcgccgc catggtggat aggttcaccg 2456401 tgttgactcc gtcgccaccg gtgccgacga cgtcgacggc gtcgtcgggg accgtatcgg 2456461 cgggcaacgg atgcgcgtgg ctgagcatga cgccagcgag ctcaccgact tcgtcggcgg 2456521 tcggagcctt catcgtcatc gccaccgcga aggcggcgat ctgcgccggc cgcgcattgc 2456581 cggtcatgat ctggtccatg gcccaggcag cctggccccg cgccagatcg cggttgtcgg 2456641 tcaaccgccc caaaatctgc ggccaggacg gcaccgatgc ggcttctgct ttcggcgagc 2456701 ccccgcgaga tccccccgaa gaaccctcag ctgacagcgc cacgcgctga tggtcccatg 2456761 aggatcaacc aaccccaacc gcgccctgaa cacgtcgacg acttgcgcta accaaacggc 2456821 cgggcgacac gcggaactga cttaccgaaa tttccgaccc gggtagagtt cgacaactac 2456881 aaagcgtcat acttgcggat gtgacgagtg ctgttgggac ctcgggtact gccatcacat 2456941 cgcgcgtgca ttcgctgaat cggcccaaca tggtcagtgt cggcaccata gtgtggctat 2457001 ccagtgaatt aatgttcttt gctgggctgt tcgcgttcta tttctcggca cgagctcagg 2457061 ccggcgggaa ttggccgccg ccaccgacag aactgaatct gtaccaggcc gtcccggtca 2457121 cgctggtcct gattgcctcg tcgttcacct gccagatggg cgtgttcgcg gccgaacgcg 2457181 gcgacatctt cgggctgcgc cgctggtatg tgatcacatt cctgatgggc ctgttcttcg 2457241 ttctgggcca ggcctacgag tatcgcaacc tgatgtcgca cgggacgagc atccccagca 2457301 gcgcatacgg cagcgtgttc tatctggcca ccggattcca tggactgcac gtcaccggcg 2457361 gcctcatcgc cttcatcttc ctgctggtac gcactgggat gagcaaattt actccggcgc 2457421 aggccacagc cagcatcgtc gtctcttact actggcattt cgtcgacatc gtgtggatcg 2457481 cgctattcac cgtgatctat ttcatccgat gagccggcgt ccgacgaaca tcccacgaac 2457541 aggagtgctc ggttgacgaa actggggttc acccgatccg gtggcagtaa gagtggtcgc 2457601 acgcgacggc gcctgcgccg ccgattgtcc ggcggagtgt tgctgctgat agcgctgacc 2457661 atcgccggtg gattggcagc tgtgctgacc cctaccccac aggtggccgt cgccgacgaa 2457721 tcctcctcgg cgttgctgcg caccggcaaa caacttttcg acacctcgtg tgtgtcctgc 2457781 catggcgcca acctgcaggg cgtgcccgac cacgggccga gtctgatcgg ggtcggcgag 2457841 gccgccgtct acttccaggt gtcgaccggc cggatgccgg ccatgcgcgg cgaggcacag 2457901 gcgccgcgca aagatccgat cttcgacgaa gcacagatcg acgcgatcgg cgcctacgtg 2457961 caagccaatg gcggtgggcc gacggtggta cgtaaccccg atggcagcat tgcaacgcag 2458021 tcgctacgtg gcaacgacct gggccgcggc ggcgacttgt tccggctcaa ctgcgcctcg 2458081 tgtcacaact tcaccggcaa gggcggagca ttgtcgtccg gcaaatacgc acccgacctt 2458141 gcgcccgcca atgaacagca aatcctcacc gcgatgctga cgggtccaca gaacatgccg 2458201 aagttctcca accgccagct ctccttcgaa gcgaaaaagg acatcattgc ctacgtgaag 2458261 gtcgccaccg aggcgcggca gcccggtggt tacctactcg gcggattcgg acccgcaccc 2458321 gaaggcatgg ccatgtggat catcggaatg gtcgccgcga tcgggctggc actgtggatt 2458381 ggggcgcgat catgagccgc gccgacgacg atgcagtggg ggtaccaccc acttgcgggg 2458441 gacgaagcga tgaggaggag cggcgcatag tgcccggacc taacccgcaa gacggggcca 2458501 aagacggggc taaggcaacc gccgtccccc gtgaaccgga cgaagccgcg ctggccgcga 2458561 tgtccaacca ggagctgctc gcattgggcg gcaagctgga tggtgtccgg atcgcctaca 2458621 aagagccccg ctggccggtc gagggcacca aagccgagaa gcgcgccgag cgttcagtgg 2458681 cggtgtggct tttgctaggt ggcgtgttcg gactggcgct gttgctgatc ttcctgttct 2458741 ggccgtggga gttcaaggcg gcggatggcg aaagcgactt catctactcg ctgactaccc 2458801 cgctctacgg cctgactttc ggattgtcca tcctgtcgat cgccatcggc gccgtgttgt 2458861 atcagaaaag gtttattccc gaagagattt caatccagga acgtcacgat ggcgcttcgc 2458921 gggagatcga ccgcaagacg gtggtggcga acctgaccga cgcgttcgag ggctcgacga 2458981 tccgacggcg caagctgatc gggctgtcct tcggcgtggg catgggtgcg ttcgggctag 2459041 gcaccttggt cgcgtttgct ggtggcctca tcaagaaccc ctggaagccg gttgtcccca 2459101 ccgccgaggg caaaaaggcg gtgctctgga cgtcgggttg gaccccccgc taccagggcg 2459161 agacgatcta tctggcgcgc gccaccggca cggaggacgg accaccgttc atcaaaatgc 2459221 gcccggagga tatggacgcc ggtggaatgg agaccgtttt tccctggcgg gagtccgacg 2459281 gcgacggcac caccgtcgaa tcacaccata agctgcagga aatcgcgatg ggtatccgta 2459341 acccggtgat gctcatccgg atcaaaccca gtgacctggg ccgcgtggtc aagcgcaagg 2459401 gccaggagag tttcaacttc ggcgaattct tcgcgttcac caaggtctgc tctcatttgg 2459461 gttgcccgtc atcgctgtac gagcagcaga gctaccgaat cctgtgccct tgtcaccagt 2459521 cgcagttcga cgcattgcat ttcgctaagc cgatcttcgg tccagcggcc cgcgccttgg 2459581 cgcaactgcc gatcacgatc gacacggacg ggtatctggt cgccaacggt gactttgtcg 2459641 agcccgtcgg accagcattc tgggagcgaa caacaacatg agtccgaaac tgagtccgcc 2459701 gaacattggt gaggtcctgg cccgccaagc cgaagacatc gacacccggt atcacccctc 2459761 ggcggcgctg cgtcgtcagc tcaacaaggt cttcccgacc cactggtcgt tcttgctcgg 2459821 cgagatcgct ctgtacagct tcgtggtcct gctgatcacc ggcgtgtatt tgacgctgtt 2459881 tttcgatccg tccatggtcg acgtcaccta caacggtgtc tatcaaccgc tgcggggcgt 2459941 cgagatgtcg cgtgcctacc agtccgcgct ggacatttcc ttcgaggtgc gcggtggcct 2460001 gttcgtgcgc cagatccatc actgggccgc tttgatgttc gcggcggcaa tcatggtgca 2460061 cctggcacgc atctttttca ccggagcgtt ccggcggccc cgcgagacca actgggtgat 2460121 cggttcgctg ttgttgatcc tggcgatgtt cgagggctat ttcggctact cactgcctga 2460181 cgacctgctg tcgggactcg gtctgcgcgc ggcactctcg tcgatcacgc tgggtatgcc 2460241 ggtaatcggg acctggctgc actgggcgct gtttggcggt gacttccccg gcaccatctt 2460301 gatccccagg ctctacgccc tgcacatttt actgttgccg gggatcatct tggcgctgat 2460361 cgggctgcat ctggcgttgg tgtggttcca gaagcacacc cagttccccg gcccgggccg 2460421 caccgagcac aacgtcgtcg gcgtgcgggt gatgccggtg ttcgcgttca agtccggcgc 2460481 atttttcgcg gctatcgtcg gtgttctggg cctgatgggc ggcctgctgc agatcaaccc 2460541 gatctggaat ctggggccct acaagccatc acaggtgtcg gcgggctcgc agccagactt 2460601 ctacatgatg tggaccgagg gtctggcccg gatctggccg ccgtgggagt tctacttctg 2460661 gcatcacacc attcccgccc cggtctgggt cgccgtgatc atgggcctgg ttttcgtcct 2460721 gctacccgcc tacccattcc tggagaagcg gtttaccggc gactacgcgc atcacaacct 2460781 gttgcagcgg ccacgggacg ttccggtgcg caccgcgatc ggcgccatgg cgatcgcctt 2460841 ctatatggtg ctcactctcg cggcgatgaa cgacatcatc gcgttgaagt tccatatttc 2460901 gctgaatgca accacgtgga ttggccgcat cggcatggtg attctgccgc cgttcgtcta 2460961 cttcatcaca tatcggtggt gtatcggatt gcagcgcagc gatcggtcgg tgctcgagca 2461021 cggcgtcgag accggcatca tcaagcggct gccccatggc gcctacatcg agctgcatca 2461081 gcccctcggc ccggtcgacg agcatggcca cccgataccg cttcagtatc agggagcgcc 2461141 gctgcccaag cgaatgaaca agctgggctc ggccggatcg ccgggtagtg gcagttttct 2461201 gttcgccgac tccgcggcag aggatgcggc gctgcgcgag gcagggcacg ccgccgaaca 2461261 acgtgccctt gccgcactgc gcgaacacca ggacagcatc atgggttcgc cagacggcga 2461321 gcactagccc ggcgacgacc cgggtcggca cgacccggga aggaaccggg caaatcaagc 2461381 acagcccggc gacgacccgg gtcggcacga cccgggaagg aaccgggcaa atcaagcaca 2461441 gcccggcgac gacccgggtc ggcacgaccc gggaaggaac cgggcaaatc aagcacagcc 2461501 cggctaactg gactggggcg ccaccacccg gcgcagctgc cgagcgtata gccactcgat 2461561 caccggcatg cccgcggtga ccaccccggc caacccgtag ctgatccaag atggcccgtc 2461621 gtgaccgacc gccatgaggt aggtcgccgc cgccacggca atcaacgcaa tgccaatcgc 2461681 actggtcaac acgactgtcc cgcgcaacca gatccggtcc accgcctcac tggaccactc 2461741 ggcggccacc tcgaatgcat ccgcgtgctg tacgggtgcc gactcggcca cagcgcgttt 2461801 cgccggatgc ccggatccga tcgatcgccc gccccgcacg gatgcacccg tcggcctcgt 2461861 cgcgggctcg gcctcagcca tgcggcgagc tcgcaacagc accggtatcg cgcccacgat 2461921 gaccagtgcg gagaccacaa ttacggcgta cagcacccac gtggtgtgcg ggtttccggc 2461981 catctcgtgg aagcccctac ccaggtccat cagggcgaca gcggcggcca ccgacacgcc 2462041 ggtgaacacc agccacaccg cggcacatgc cccaaccagg atgcgatcga tgacgtccgg 2462101 cgagattaca tccggcccac gccggtatgc ggaatatctg ctcaccatca gcagctcgtt 2462161 tgcggtccat cgttggagtt cgatgagagc accgttccgt cgctcgtggt gatcgagcag 2462221 ttgagtttgc tgacccggaa aaggctggag gcctccaccg agccaacgtc ggattgcgag 2462281 atcggggtga ccgtcatgga ccacgggatg tacacattgt gctgtgtccg tcggcgcccg 2462341 gcggcatcga cgtaagtcac cgagataatg tcacccggcg ccttggtacc ggtcaccgaa 2462401 taggtgactt gccgcggacc ggtcggcgtg gtggtcgtgg gcggcggtgc cgccgccgtt 2462461 gtggtggtcg ccggcggcgg cgccgtggtt gtcgccggtg ggggcggtgg tggcggcgtc 2462521 acagtgaccg tctgtgtctc cgtcgctgtc gggatctcgg tggtgggcgg tggggctggt 2462581 ggcggcggtg gcggcgccgg cttggtggtc gtgatttcgt cctgcacggg cggtgcagag 2462641 gacgtagtgt cgccggtggc gagtttgctg gtatgtggtc gcgtgacgag caacgacacc 2462701 gaaaccacga gcgcaacggc ggcaattatg gcggcgacac cgaccaccca cggccagcgc 2462761 ggagcggcca gttcgtcgtc caggtcggac gactcctcat agtcgtcgta gtcatagagc 2462821 ctgagatcgg ctggcacata cgggccgccg gtgacgtgct cggattccgg ggcagagtat 2462881 gcccgagaat atgcgtcggt ctcgcccgtc tggtcactgg gcagtttgtc gccgcccccg 2462941 gcgacgggcg gcaagtggtt gccggaagcc cgttcgtcgc ccgtgtcgct gacgggttcc 2463001 gattcgggtt cgtcaggttc ccgtcccggg ggattcggcc cgctcatgtt tgcctaccct 2463061 gtccaactgc ctcaccaaca cgcgtggctt tccgcctgca tccttgcccg cgcgctcggc 2463121 gcattcttca ttggtgccac ggaaacccta cccaaccggg caggaccgag aagtctgggc 2463181 aaccgtgcta ctggtcaact gatgccctga ttgtgacctt cccggcgccg gatcagtgct 2463241 tctcaggacc gacgtaatat tcgaagacca atccggccgc cgaggcgagg atgaatgcca 2463301 caccggcggc gatcagccac gggagccaca acgcgatgcc gaccgctgcc accgagccgg 2463361 acaacgcgac catgatcggc caccagctat gcggactgaa gaatccaagt tctcctgcgc 2463421 cgtcgctgat ttcagcgcct tcgtagtcct cgggccggga atctaaccgg cgggccacaa 2463481 accggaagaa ggtggcgacg atcaacgcca tgccgccggt aagcgccagc gcagtggtgc 2463541 cagcccactc gacaccaccg gtggcgaaca tcgaggtcaa cacgccgtac agcaccgccg 2463601 tcaccacgaa gaacgcggcg acaaactcaa acagtcgggc ttcgatatgc atgagcgtcc 2463661 taacctacgg gctgcggggc caattcaccg cggcgagtat caaacgggtg ggtggtcacc 2463721 gcaaggggcg gctggttgat cgcccgcagg gcctcggcgt ttgtcttccc gtcgatgcgt 2463781 tgctgcaggt aggccttgaa atcgttgggg gtcacgacgc ggacctcgaa gttcatcatc 2463841 gagtgatacg tgccacacat ctcggcgcag tggcccacga atgctccggt cttggtgatt 2463901 tcttcgatct ggaagacgtt gaccgagttg tttgccaccg ggttaggcat cacgtcacgc 2463961 ttgaacaaga actccggcac ccagaatgcg tgtatcacat cggctgaggc catttggaat 2464021 tcgatacgct tgccggacgg cagcaccagc accggaattt cggtgctggt gcccaacgtc 2464081 tcgaccttgt cgaaattcag gtaggtccgg tcctcggtgt tgagcccgcg caccggcccg 2464141 accagctctt cgccgtactt gtccttgccc tctggcttgg aaaccatggc gcgcttgcgc 2464201 tccggatcgg caccatcata ggtcagtgtg ccgtctttga agttcaccct ttgatagcca 2464261 aacttccaat tccactggaa agacgtgata tcaatcacga cctcgggatc cttggctatc 2464321 tgcagcatct tctcctgcac cacgacggtg aaataaaaca gcaccgagat gatgaggaac 2464381 ggtatgacgg tgagaaccag ctctagcggc atgttgtagc cgaactggcg gggcaactca 2464441 gtgtcggtgt tcttcttccg gtgaaatacc gcggaccaga agatgagacc ccacacgatt 2464501 accccaaccg ccagggaggc gatcaccgcc ccgatccaca gttctcgatt gaggtgtgcc 2464561 tccggggtaa tgccctccgg ccaaccgatg cccagggctt ccgaccagct gcatccactg 2464621 acggtgacgg ccaatgcccc cagcattgct gcgagcgcca gctgtcgaag accacgggca 2464681 ggccctccgg agccgcgctg aggcctgcac tgcgacaagc gttgcaaacg acctggcccg 2464741 cgaggtgtca ctgttggcgc ctcctgtatc acaagctggg ccgactggga tagcaccggc 2464801 tgcggcgaga accatcggct aactcagaca tcgaatacta cgcagcgtag accacgccgc 2464861 ccgcgcgggc gacgatgcgg gccgaaacgg cccgctgagg agccgcgcca tcagccccgc 2464921 gggcgactgc ctggtcgtcg cgacccgccg gacgaggcat ccacaagagt cgccaagtgg 2464981 ggcatactgg ggcgccgtgt gtggactgct ggccttcgtc gcggccccgg ccggtgctgc 2465041 ggggcccgaa ggtgccgacg ctgccagcgc catcgcccgc gcatcgcatt tgatgcgcca 2465101 ccgcgggccc gatgaatcgg gcacctggca cgccgtcgat ggcgcctccg gaggcgtcgt 2465161 gttcgggttc aaccgactgt ccatcatcga catcgcgcac tcgcatcagc cgctgcggtg 2465221 ggggccgccg gaggctccgg accgctacgt gctggtgttc aacggcgaga tctacaacta 2465281 cttggagctg cgtgacgagc tgcgcaccca gcacggcgct gtgttcgcca ccgacggcga 2465341 cggtgaggcg atcctcgccg gctatcacca ctggggcacc gaggtgctgc agcggttgcg 2465401 cggcatgttc gcattcgcgc tgtgggacac cgtcacccgc gaattgttct gcgcgcgaga 2465461 tccgttcggc atcaagccgt tgtttatcgc caccggagcc ggcggcacgg cggtggccag 2465521 tgagaagaaa tgcctgctgg acctcgtcga gttggtgggg ttcgacaccg agatcgacca 2465581 tcgggcgttg cagcactaca ccgtcctgca gtacgtgccg gaacccgaga cactgcaccg 2465641 tggggtacgt cggctggaat caggctgctt cgcccggatc cgtgccgacc agctcgcgcc 2465701 ggtgatcacc cgttatttcg tgccgcgatt tgcggccagt ccgatcacca acgacaacga 2465761 ccaggcccgc tatgacgaga tcacggcagt gcttgaggac tcggtggcca agcatatgcg 2465821 cgccgatgtc accgtcggcg cgtttctgtc cgggggtatc gactccacgg ccatcgcggc 2465881 gctggccatc cggcacaatc cgcggctgat caccttcacc accggtttcg agcgcgaggg 2465941 cttctccgag atcgacgtcg cggtggcttc ggcagaggcc atcggtgccc gtcacatcgc 2466001 caaggtggtc agcgccgacg agttcgtcgc cgccctgccc gagatcgtct ggtacctcga 2466061 cgagccggtc gctgacccag cgctggtacc gttgttcttc gtcgcccgcg aggcccgaaa 2466121 gcacgtcaaa gtggtgttgt cgggcgaagg cgccgacgaa ctgttcggcg gctacacaat 2466181 ctatcgagaa ccgctgtcgt tgaggccgtt tgactacctg cccaagccac tgcgccggtc 2466241 gatgggaaaa gtttccaagc cactgccgga gggcatgcgc ggcaagagtc tgctgcaccg 2466301 cggatcgctg acactcgaag agcgctacta cggcaatgcc cgcagtttct ccggcgcgca 2466361 gctgcgcgaa gtactgcccg ggttccggcc ggactggacc cacacagatg tcacggcgcc 2466421 ggtctacgcc gaatcggccg gctgggatcc ggtggcgcga atgcagcaca tcgacctgtt 2466481 cacctggctg cgcggcgaca ttctggtcaa ggccgacaag ataacgatgg ccaactccct 2466541 ggagctgcgg gtgccgttcc tggacccgga ggttttcgcg gtggcctccc ggttgccggc 2466601 gggcgccaag atcacccgta ccaccaccaa gtacgcgctg cggcgcgcgc tggagcctat 2466661 tgtgcccgca cacgtgctgc accggcccaa gctcgggttc ccggtcccga tccggcattg 2466721 gctgcgtgcc ggcgagctgc tggagtgggc gtatgcgacg gtgggctcgt cgcaggccgg 2466781 tcacttggtt gacatcgccg ccgtgtatcg catgctcgac gagcaccggt gcggcagcag 2466841 cgaccacagc cgccggctgt ggaccatgct gatctttatg ctgtggcacg cgatcttcgt 2466901 cgagcacagc gtggtgcccc agatcagcga gccgcagtac cccgtccagt tgtaaccgcc 2466961 ccttcgcgag cagacgcgga atcgcatcgg cggggcccac acggtgcgat tccgcgtctg 2467021 ctcggcggtg ccgcggctag gccaagccgc ggctaggcca gcacggcgac gatctcggcg 2467081 gccgcgtgct cgccgtaagc accagccagc ctgctggccg cggcctcgta gtcccactgc 2467141 cactcctgag ttccggtcga ctccagcacc agcacggcaa ccagcgagcc cagctgcgcc 2467201 gaacgctcca ggcctagtcc ggcactgcgg ccagtcagga aaccggcgcg gaacgcgtcg 2467261 ccgacgccgg tggggtcggt ctggctggtt tcggggacca cgccgacgtg gatggtggtg 2467321 ccgtcaggtt ctaccaaatc gacaccctta ggacccaatg tggtcacccg caggtcgatc 2467381 tgcgccatca catcggcctc tgaccagccg gtcttggaca gcagcagatc ccattcgtag 2467441 tcgttggtga acaagtaagc agcaccgttg acgagcctgc gaatttcctc acccgacagc 2467501 ctcgccagct gctgagacgg atcggcggcg aaggccagcc ccagcttgcg acactcctcg 2467561 gtgtgcaaga acatcgcctc ggggtcgttg gcgccgatga tcaccaactc cggcttgccg 2467621 atggccgaca ccacgtcggc aagcttgatg ttacgtgcct ccgacatagc cccggggtag 2467681 aacgatgcga tctgggccat gtcgacatcg gtggtacagg taaaccgcgc cgtgtgcgcg 2467741 gtctcggaga tcagaacgtg gtcgcagttg acaccgcggg ctttcagcca gtcgcgataa 2467801 tcggcgaagt cggcgcctgc cgccccaact agcgcgacct cgccacctag cacaccgatg 2467861 gcgaaggcca tgtttccggc cacgccgccg cggtgcatca ccaagtcatc gactaggaag 2467921 ctaagcgaca ccttgtgcag gtgttcgggc agtagctgct cggaaaatcg gcctggaaac 2467981 cgcatcaaat ggtcggtcgc aatcgaaccg gttaccgcga tcgtcacaaa atctccgtcc 2468041 ttcgttccta aggttgccta gtctttcaac attatcggcg ccgcggcccg ccccgtcgcg 2468101 ttgagagctg acggcagctg ttgcgctagc ctgcctaggg agctcacctg attgccgatg 2468161 ctgccggctg acgcgacggg cggttgtcgc cctagcagct ggtcccgtcc accaccctag 2468221 gagaaccaca atgcccggtc cccactcgcc gaaccccggt gtcggcacca acggaccggc 2468281 gccgtacccc gagccctcat cccacgaacc ccaagccctg gactaccccc acgacctcgg 2468341 cgccgccgaa ccggccttcg ccccgggacc ggcagacgac gcggcgctgc cgcccgccgc 2468401 atatcccggc gtgccgccgc aggtgtccta cccgaagcga cggcacaagc ggctgctgat 2468461 cggcattgtg gtagccctcg cgctggtgtc ggctatgacg gcggcgatca tatacggggt 2468521 ccgcaccaac ggagccaaca cggcaggcac attctcggag ggaccggcca aaaccgcgat 2468581 tcagggatac ctcaacgcgc tggagaaccg cgatgtggac accatcgttc gcaatgcgct 2468641 gtgcggtatc cacgacggcg tgcgcgacaa gcgctccgat caggccttgg ccaagctgag 2468701 cagcgacgcg ttccgcaagc agttctccca ggtcgaagtg acctcgatcg acaaaatcgt 2468761 gtactggtcg caatatcagg cccaggtgct gttcaccatg caggtgacac ctgccgccgg 2468821 cggcccgcca cgcggtcagg tgcaaggcat cgctcagttg cttttccagc gcggtcaggt 2468881 cttggtgtgc tcgtacgtgt tgcgcaccgc ggggtcgtac tagcgtttta tcagttgaac 2468941 gaatccccgc acgcgcagga gccggtggcg ttgggattgt cgatggtgaa gccttgcttc 2469001 tcaatagtgt cgacgaaatc gatcgacgcg ccttccacat acggcgcgct catccggtcc 2469061 acgatcaacc tgacaccacc gaactccgcg gtttggtcac catccagcgt ccggtcgtcg 2469121 aagaaaaggt tatagcgcaa tccagcgcac ccccccggct gaaccgcgat ccgcagcgcc 2469181 agatcgtccc gtccctcctg gtccaacagc gacttcgcct tggcggcggc cgcttcggtc 2469241 aggatcacgc cgtgggtctt ggcgctcggc tcgttctgca ccgtcatgac ttctcctaga 2469301 tgtctcatcg ttgggtgggc cccgcccact agcgtttcag cctgcggaat ccagtctggg 2469361 gtctgcttgg ggaaaatccc acttcctcaa cggtaccctg aaggaccgct attcccgagt 2469421 cgcgccgcta cctgagacgc caagcccatg agctgattgg ccgcatcggc cagcgccaac 2469481 cgcaccgaac cggcgtactc agcgatggac aatgcggcca taatgcccgc cgaccgcaac 2469541 gcggacttgt ccagcgacac ctggccggcc aacactatca ccggaattgc gagcgggcgg 2469601 gccgcagccg cgatcgcacc aaccaccttc ccgtgcaggg attgctcgtc gaatcggccc 2469661 tcaccggtga cgatcagctc cgcatcggca aggtcgtcgg caaaatgcgt gtgctctgcg 2469721 atgattgccg cacccgactg gtaccggccg ccaaccgcga gcagcccagc cccgatacca 2469781 ccggcggcgc ccgcgcccgg ctcggcgctc accccgcgcc cggcggccgc gtccagttca 2469841 atcgcccatg ccgccagacg gccttccaac actgcgacgg tggccatgtc cgcgcccttc 2469901 tgcggcgcga acaccctggc cgtgccccat ggtcccagca atgggtattc gacatccgag 2469961 gcggcgatca cctcgacgtc ggccaactgt cggcgggccg cgtccaggcc gccaagctcg 2470021 gcaatcatcc ccttcccccc gtcggtacat gcgctgcccc ccaaccccac cacgatccga 2470081 gccgccccgg cccgcagtgc cgcggcgatg agctggccga cgcccttgct gtgggccgcc 2470141 agcgcggttt cgggcgtggg cgggccgcca agcaacccca gaccacaagc ctgcgcacac 2470201 tccaaatacg cggttgccga gcccggatcg aacacccacg ccgcgttcac gacggtgttc 2470261 agtggcccgc aaacacgcag ccggcgggtc tctcctagcc ggctgcccag cacctcaaca 2470321 aaacccggac cgccatcgga ttggggggcg acgatgaacg aatcgcctgg tcgcgaccgc 2470381 gtccagccgg tcgcaatggc cgcggcggcc tccaccgcag acaggctgtc gccgtagcag 2470441 tccggtgcca ccaacacccg catggcgggc agctggagtc ggccgggccc caagctaccg 2470501 gtcgcgtcat ccgaggcctg cgagcctttc atcactggcc agagtaggtc tgcgcaccca 2470561 cacgcgtacc taaacgcacg caaattccaa acgggccccg ccgcgaagta gcctggcgac 2470621 tgtgaagctg ctgggccacc ggaagagcca tggacaccaa agggccgacg catcacccga 2470681 tgccgggtcg aaagatggtt gccggcctga ttccggacgc acgtccgggt cggacacatc 2470741 gcgcgggtcg caaaccaccg gccccaaggg ccggcccacg cccaagcgca accaatcccg 2470801 tcgccacacc aagaagggcc cggtcgcacc ggcaccaatg actgcggccc aggcacgggc 2470861 ccggcgcaag tcgcttgccg gccccaaact tagccgcgag gaacggagag ccgaaaaggc 2470921 cgcaaaccgg gcccggatga cggaacgccg ggaacgcatg atggccggcg aagaggccta 2470981 cctgctcccg cgcgaccggg gcccggtacg ccgctacgtg cgcgatgtgg tggactcccg 2471041 gcgcaacctg ctcgggctgt tcatgccctc ggcgttgacc ctgctgttcg tcatgtttgc 2471101 cgtgccgcag gtgcagtttt acttgtctcc ggcgatgttg atactgctgg ccttgatgac 2471161 gatcgacgcg atcatcttgg gtcgcaaagt tggccggctg gttgacacga agttcccgtc 2471221 taacaccgaa agccggtgga ggctgggtct ttacgccgcc ggccgagctt cccagatacg 2471281 ccggttgcgg gcgccccgac cccaagtcga gcgcggcggc gatgttggct aacggacgcc 2471341 ggaagtcatc tcacccggtg tacaccctag tgctcagcgg gcggaccgaa ccgatcaagc 2471401 cggcgaaagg atgatcggct tcgcgccggt gtcgacgccc gatgcggctg ccgaagcagc 2471461 cgcccgcgcc cgacaagaca gcttgaccaa gccgcgggga gcgctgggca gtctcgagga 2471521 cctgtctgtc tgggtcgcgt cgtgccagca gcgctgtccg ccgcggcaat tcgagcgcgc 2471581 ccgggtggtg gtgttcgccg gtgaccatgg tgtggcccgg tccggggtgt cggcgtaccc 2471641 gccggaagtc accgcccaga tggtcgccaa catcgacgct ggcggggcgg cgatcaacgc 2471701 gctggccgat gtcgcgggcg cgaccgtgcg ggtcgcggac ctggccgtgg acgcggaccc 2471761 gctgtctgag cgcatcggcg cgcacaaggt gcgccgcggc agcggcaata tcgccaccga 2471821 ggacgcgttg accaacgacg agaccgccgc cgcgatcaca gccggccagc agatcgccga 2471881 cgaagaggtt gatgccggcg ccgacttgct catagccggc gatatgggaa tcggaaacac 2471941 taccgcggcc gcggttcttg tggcggcgct gaccgatgcc gagccggtcg cggtggtcgg 2472001 gttcgggacc ggtatcgacg acgccggttg ggcgcgtaag acggccgcgg tgcgcgacgc 2472061 cctgtttcgg gtgcgcccag tgttgcccga cccggtcggg ttgctgcgct gcgccggcgg 2472121 cgctgacttg gccgcgatag ctggcttctg cgcgcaggcc gcggtccgac gcaccccgct 2472181 gctgcttgac ggggtggcgg tgacagccgc cgccctggtc gctgagcgtc ttgcgcccgg 2472241 cgctcaccgg tggtggcagg cgggtcatcg atccagcgaa ccgggccacg ggctggcgct 2472301 ggcagccctc gggctggacc cgatcgtgga ccttcacatg cggctgggcg agggaaccgg 2472361 cgccgcggtg gcgttgatgg tgttgcgcgc cgcggtcgcg gcgctgtcgt cgatggcgac 2472421 cttcaccgag gccggcgtgt ccacccggtc cgtcgacggt gtcgaccgga ccgcaccccc 2472481 ggcagtctca ccgtgatgcg ttcgctggca acagctttcg cattcgcaac ggtgataccc 2472541 acaccgggct cagcgaccac cccgatgggc cgtggcccga tgaccgcgct gccggtggtg 2472601 ggcgcggcgc tgggtgcact ggcggcggcg atcgcatggg ctggcgcgca agtgttcggc 2472661 ccgtccagcc cgctgtccgg catgctcacg gtggcggtac tgctggtcgt cactcgaggc 2472721 ctgcacatcg atggcgttgc cgataccgct gacggactgg gctgctatgg gccgccgcag 2472781 cgtgcgcttg cggtgatgcg cgacgggtcg accggaccgt tcggggtggc ggccgtggtc 2472841 ttggtcatcg ccttgcaggg cctggccttc gcgaccctca ccacggtcgg gatcgctggg 2472901 atcacgctgg cggtcttatc cggccgggtc accgccgtac tggtctgtcg ccggttggtg 2472961 ccggcagccc acggcagcac cctgggctcg cgggtcgccg gtacgcaacc cgcgccggtg 2473021 gtggcggcct ggctcgccgt cctgctcgcc gtttcggtgc cggccggtcc ccggccttgg 2473081 caaggaccga tagcggttct ggtagcggtg acggccggcg cggccctggc ggcgcattgc 2473141 gtgcaccggt tcggcggtgt caccggtgac gtgctgggca gcgcgatcga gctgagcacg 2473201 acggtcagcg ccgtgacgct tgcgggcttg gcccggcttt agcaggcggc gagcgggacg 2473261 ctgcagtaga ctcatgtccg ccgtcccttc caacacaggg ctcccctccg tgtccccaga 2473321 ttaggggaca tgaaattcaa ccgacggtgt ccgattggcg gatcgttttg gccgcgcggc 2473381 atatatagcg tcgttaatca tgcccgcatc acgactggtc agacaagtgt ctgcgccacg 2473441 gaacctgttc gggcggctgg ttgcccaggg gggcttctac acggccgggc tgcagttggg 2473501 cagcggtgcg gtggtactgc cggtcatctg cgcacatcag ggcctcacct gggcggctgg 2473561 gctgttgtat ccggcgttct gcattggcgc cattctggga aattcgctgt cgccgctgat 2473621 tctgcagcgc gccggccagc tccggcacct gctgatggcg gcgatatcgg cgacggcggc 2473681 ggcgctggtt gtgtgcaacg ctgcggtccc ctggactggc gttggcgtcg ccgcggtttt 2473741 tttggcgacc acgggggccg gtggtgtcgt caccggagtc tccagcgtcg cctacaccga 2473801 catgatctcc agcatgttgc ccgcggtacg gcggggcgag ctactgctca cccaaggtgc 2473861 cgcggggtcg gtgctggcca ccggcgtcac attggtgatt gtgccgatgc tggcccatgg 2473921 caacgagatg gcgcgctatc acgatctgct gtggctgggc gccgcaggtc tggtttgctc 2473981 cggcatcgcg gcgctgttcg tcggcccgat gcggtctgtg tccgtcacaa ccgccacccg 2474041 aatgccactg cgggaaatct attggatggg cttcgcgatc gcccgctccc agccgtggtt 2474101 tcgccggtat atgacgactt acctgctgtt cgttccgatc agcctgggca ccacgttctt 2474161 cagcctgcgc gccgcccagt ccaacggcag tctgcacgtg ctggtgatcc tttccagcat 2474221 tggattggtc gtcggttcga tgctgtggcg acagataaac cgcctgttcg gggtgcgtgg 2474281 cctgctgctg ggcagcgcac tgctcaacgc cgctgctgcg ctgctgtgca tggtggccga 2474341 gtcgtgtggg cagtgggttc acgcctgggc gtacggcacg gcgttcctgc tggctacggt 2474401 ggccgctcaa acggtggtcg ccgcatcgat atcgtggatc agcgtcctcg cgcccgagcg 2474461 gtaccgcgcc accctgatct gcgttgggtc gaccttggcc gccgtcgaag ccaccgtgct 2474521 gggagttgcg ctcggcggaa ttgcccaaaa gcatgccacc atctggccgg ttgtcgtcgt 2474581 gctgacactg gccgtaatcg ccgcggtggc gagtctgcgc gcaccgacac gaatcggggt 2474641 gacggcggac acgagcccgc aagcagcgac cttgcaagcc taccgcccgg ccactcctaa 2474701 ccccatccat agcgatgaac gttcgacgcc gcccgaccat ctctcagtcc gccgcgggca 2474761 gttacgacac gtatgggaca gtcgccggcc cgcgccaccc ctgaaccggc caagctgtcg 2474821 ccgcgcggcc cgccgtccag cgcccggcaa acccgctgcc gcactacccc agccgcgcca 2474881 tccagccgtg ggtgtccgcg aaggtgcccc gctggatgcc ggtcagcgta tcgcgtagtg 2474941 ccatggtcac ctcacccggc tgaccgtcgg cgattctgaa ctcgctggca ccgtgccgca 2475001 cccgcgcgac cggggtgatg acagcggcgg tgccgcacgc aaacacctcg gtgatctcgc 2475061 cggcggcggc tttcttctgc cactcgtcga tatcaatcct gcgttcctcg accgcgaatc 2475121 cggcatcaat agccaactgc aacaacgaat cccgtgtgat cccgggcagc agggaaccgg 2475181 acagctccgg ggtgaccagc cgcgccgatc cgccgctgcc gagcacgaag aagatgttca 2475241 tgccacccat ctcttcgata tagcggcgtt ccacagcgtc cagccacacc acctggtcgc 2475301 atccgttctc ggcggcttcg gcctgcgcca gcaacgaggc ggcgtagttg ccgccgaact 2475361 tggccgcacc ggtgccgccc ggacaggccc gtacatactc cgtcgaaacc cagacgctga 2475421 caggggcgat gccgcccttg aagtacgcac cggccggcga ggcgatcaac aggtaacggt 2475481 attgggtggc aggccgcacg cccagtcccg gctcggtggc gaagatgaac ggccgcagat 2475541 acagcgcctc ctcaccgccg gcaccgggca cccaagcttt gtcgacagcg attagctggc 2475601 gcagggattc gatgaacacc gcgtcgggca gttcgggaat cgccaaccgc cgcgccgacg 2475661 aacgcaacct ggcggcgttg gcgtcggcgc gaaacgacac gatggacccg tcggcccagc 2475721 ggtaggcttt gagcccttcg aacacctcct gcgcatagtg cagcacgatc gccgagggat 2475781 ccagctcgat cgggccataa gggattaccc gcgcgttgtg ccaaccacgg ccctcggcat 2475841 agtcgatcga caccatatgg tcggtgtggt atttgccgaa acccggctcc cgcagcatcg 2475901 attcacgctg cgcgtcggtg gccggattga ccgcacgtaa caccgtgaat tgaagggagc 2475961 cgctggtcat gggccgattc tatccgtggg cgaacggtta ttgacggccc ggaggccact 2476021 ccgctgccac caagtggtga ctcagcgcgt tttcacggca acgaacggcg gacacaccac 2476081 ttgacattcg acagcacggc cgcggacgtc gacattgatt tgctggccgt cttcgatgcc 2476141 ggcatcactg tcgatcagcg ccagcccgat gccgacctgc aacgtgggag aaaacgttcc 2476201 cgacgtggtg accccaaccg tctcatcccc gacaagcaca gccagcccgg ggcgcagcac 2476261 accgcgaccg accatgcgca gcccccgcag cagccgccgc ggcccggccg ctttctcggc 2476321 caacaacgcc gcacgaccaa agaaggcgtc cttccgccag ccgaccgccc agccgcatcg 2476381 ggcctgcagc ggcgagatgt ccagcgaaag ctcgtgcccg tgcagcggat agcccatttc 2476441 agtgcgcagt gtgtcgcgag caccgaggcc ggcgggctcg ccgcccgcgg ctgataccgc 2476501 cgccaacagt gcgtcgaaca ccacacccgc cgactcccat ggcggcagca gttcgtaacc 2476561 gtgctcaccg gtgtagccgg tgcgacagac acgcaccggc acccccgagt acgaagcgtc 2476621 ggcgtagccc atgtagtcca tctcggttgg cagccccaac gcggtgagca cgtcggtcga 2476681 acacggcccc tgtacggcca gcaccgcgta ggaccgatgc agattggtga tgctcagacc 2476741 gcccggtgcg gcagcttgta gcgcgccgac caccgcggcg gtattggcgg cgttgggcac 2476801 cagaaagatc tcgtcgtcgc tgacgtagta ggcgatcagg tcgtcgatca caccgccgga 2476861 ttcggtgcag cacaaggtgt attgcgcctt gccgggcccg atacgaccca ggtcgttggt 2476921 gagcgcggag ttgacgaact gcgccgcacc cggtccacgg accagtgcct tgcccaggtg 2476981 gctgacgtcg aaaaggccga cggcggtgcg ggtggcgttg tgctcgctga cggttccggc 2477041 atacgagacc ggcatcagcc agccgccgaa ctcggcgaaa ctcgcaccca gctcgcgatg 2477101 gcggtcttcc agcggtccgt gtatcagctc tggcacatcg ctcacggcgt cccaccctaa 2477161 tgggcgtccc tgctggcaca cttaggcagg tgtacgattc cttggacttc gacgccctcg 2477221 aggccgccgg aattgccaac ccacgcgagc gggccggctt gctcacctac ctggatgagc 2477281 ttggcttcac ggtcgaagag atggtgcaag ccgaacgccg cggccggttg ttcgggctgg 2477341 ccggtgacgt cctgctatgg tccgggcccc cgatctacac cctggcgacc gcggctgacg 2477401 aactggggtt gtcagccgac gacgtcgcac gcgcgtggag tttgctcggc ctcaccgtcg 2477461 cgggtcccga cgttcccacg ctgagccagg ccgacgtcga cgccctggcg acctgggtcg 2477521 cactgaaggc gctggtgggt gaggacggcg cattcggcct gctgcgagtg ctcggcactg 2477581 ccatggcccg actcgccgag gccgagtcga ccatgatccg cgccgggtca ccgaacatcc 2477641 aaatgacgca cacccacgac gaacttgcca cggcacgggc ctatcgcgcg gctgcggagt 2477701 tcgtcccccg gatcggtgcg ctgatcgaca ccgtccaccg tcaccacctg gccagcgcac 2477761 gaacctactt tgaaggcgtc attggcgaca cgtcggcaag cgtgacgtgc ggtatcggct 2477821 ttgcggatct gtccagcttc accgcgttga cccaggcgct cacccccgcg cagttgcagg 2477881 acctgctcac cgaattcgac gccgccgtca ccgacgtggt gcatgccgac ggtggccggt 2477941 tggtgaagtt catcggcgac gccgtgatgt gggtgagctc gtcgcccgaa cgactggtgc 2478001 gggcggcggt ggatctcgtc gatcatccgg gtgcgcgcgc ggccgaactg caggtccgtg 2478061 ccggtcttgc ctatggcacg gtgctggccc ttaacggtga ctacttcggc aacccggtca 2478121 acctggctgc gcgcctggtg gcggccgcag cgccagggca gatcctggcc gcagcgcaac 2478181 tccgcgacat gttgccagac tggcctgccc tcgcccatgg cccattgacg ctcaaggggt 2478241 ttgacgcccc ggtgatggcc ttcgaactgc acgacaaccc tcgtgcgagg gatgctgaca 2478301 cgccaagccc cgccgccagt gattagggtg gttgcccgtg accaccgaac cgggttacct 2478361 atccccctcc gtcgccgtcg cgacctcgat gccgaaacgt ggtgtcggcg ctgcggtgtt 2478421 gatcgtgccg gtcgtctcga ccggcgaaga ggatcggccc ggcgcggtcg ttgcctcggc 2478481 cgagcccttc ctgcgcgccg acacggttgc cgaaatcgag gcgggcctgc gagcgctgga 2478541 cgccaccggc gccagtgacc aggtgcaccg gctggcggtg ccgtcgttgc cggtgggcag 2478601 cgtcctgacg gtcggcctgg gcaaaccgcg gcgcgaatgg ccggccgata ccatccgctg 2478661 cgccgccggc gtggccgcgc gtgcgctcaa cagttcggag gcagtgatca ccacgctagc 2478721 cgaattacct ggcgacggca tctgctcggc caccgtcgag gggctgatcc tgggcagcta 2478781 ccgattcagc gccttccgca gcgacaagac cgcgcccaaa gacgccggac tccgcaaaat 2478841 caccgtgctc tgctgtgcaa aggacgccaa gaagcgcgcg ttgcacggtg cggccgtcgc 2478901 gaccgcggtg gccaccgccc gggacttggt caacactccc ccaagccacc tgtttcccgc 2478961 cgagttcgct aagcgcgcaa agactttgag cgaatctgtc ggcctcgacg tggaagttat 2479021 cgacgaaaag gcgctgaaga aggccggcta tggcggggtg attggtgtcg gccagggctc 2479081 gtcgcggccg ccgcgactgg tgcggttgat tcatcgggga tcgcggctgg ccaagaaccc 2479141 ccaaaaggcc aagaaggtgg ccttggttgg caaggggatc accttcgata ccggcggcat 2479201 ctcgatcaag ccggcagcgt cgatgcacca catgacctcg gacatgggcg gagcggccgc 2479261 ggtgatcgcc actgtcacgc tggctgcccg gctgcgactg ccgattgacg tgatcgccac 2479321 ggtgccgatg gccgagaaca tgccgtcggc gacggcgcag cgcccgggcg acgtgctgac 2479381 ccaatacggt gggaccaccg tcgaggtgct caacaccgac gcggagggcc ggttgatcct 2479441 ggccgacgcc atcgtccggg catgtgagga caagccggac tatctgatcg agacatccac 2479501 gttgaccggt gcgcaaacgg tggcgctggg gacgcgcata ccgggtgtga tgggcagcga 2479561 cgagttccgc gaccgggtcg ccgcgatctc gcagcgggtg ggcgagaacg gctggccgat 2479621 gccgctgccc gatgacctca aggatgactt gaaatccacg gtggccgacc tggccaatgt 2479681 gagtggccag cgtttcgcag gcatgctggt ggccggggtt ttcctgcgtg agttcgtcgc 2479741 cgaatcggtg gattgggcgc acatcgacgt ggccggcccg gcctacaaca ccggcagcgc 2479801 ctggggttac acgcccaagg gcgccaccgg tgtgcccacc cgcaccatgt tcgcggtgct 2479861 cgaggacatc gcgaagaacg ggtaggcggc cgcccggacc caaagcactt cacgagtagc 2479921 ggttagatca cccgcagccg cgcggtactg cgcagcgcct gcggcagcac ccgggagatg 2479981 ccgtatagcg cataggcttc cggcgcgacc ggtctgatcg gcttcttctt cttgaccgcg 2480041 gacacgatcg cgtcggctac cttgtccggc ccgtagctgc gcagcgcaaa catcttgtcg 2480101 atctgccccc gccggccgtc gatcttctcc tcgtcggttc cgggcgcgtg gaaaccggtg 2480161 gtagcgacga tgttggtgtc aatgacaccg gggcagatgg tggtcagtcc gacaccggcg 2480221 gcatcgagtt cggcccgcaa acagtcggag aacatgtagg tcgccgcttt ggaggtgcag 2480281 tacgcgctga gcgactgcag cggcgcatag gcggccatcg acgacacgtt gacgatgtgc 2480341 ccgccagtcc cccgctcgac cagacgctgc ccaaaagcgc ggcaaccgtt caccacgccg 2480401 cccaggttga cggccagcac ccggtcgaac tgctcagccg gggtgtccag gaaccgaccc 2480461 gcctggccga tgccggcgtt gttgacgaca atgtcgggga ccccgtgttc ggcgctgacc 2480521 cgctcggcga atgcctcgac cgcctcggcg tcggacacgt cgagcacata ggggtacgcg 2480581 atgccaccac gtgcggcgat ctcggcggcg gtgtccttga cggtggcctc gtcgatgtcg 2480641 ctgataacga tctctgcacc ctcacgagca aaggcgagcg cggtctcgcg gccgattccg 2480701 ctgcccgccc cggtaaccga caccagcgtg tcaccgaagt acccgcgggg ccgtccgacc 2480761 tgggcgcgta acagcgcgcg gctcggctgc ttgccgtcgg ccaggtcggc gaagtcgtgc 2480821 acggcggccg ccatcacctg cgggtgcgac atcggcgaaa agtgaccagc tttgatgtca 2480881 cgccgccaga gccgcggcac ccagcgcgcc gtctggtcgt atccgtaggg ccgcacgtag 2480941 gggtcctggg aattgacgat cagctgcacc ggcacatcaa ctatcggaat ggcccggccg 2481001 cggcggctgc tggaaaacga ccgaaagtag tttgcggggt aagtcttgac cgagtgggcg 2481061 gcatcacggg ccagcgtctc cgagtgatga atctggtcga cgggaatgtc gccgaccatg 2481121 ttgcggcgga cggccgcact cgacagcgca acccgaagca gcagcggtgc gaccaccggt 2481181 accgagaaca aggccatgta gctcaaccgc agtgtctggc tgatcgcccg tagaaaggtt 2481241 cgcggacgcc aaggccgccg cagaccgcca taaacgtagt tgaccaggtg gtcttgactg 2481301 gggccggaca ccgacgtgaa cgaggcgacc cgatcactgg ctccgggccg gcgcaggtac 2481361 tcccacaccc ccaccgaacc ccagtcatgg gccagcacgt gcaccggctc accggggctc 2481421 agctcgccga tgacggcgtc gaaatcgtcg gcgaaatggg ccatggtgta ggccgaaatg 2481481 ggtttgggca ccgatgagcg accgacacca cggttgtcgt agcgaacgat ccggaaccgt 2481541 tcggccagca gcggaacgac accgtcccac agcacgtgcg agtccggaaa gccatgcacc 2481601 agcacgacgg tcgggccgtc gggattgcct tcgtggtaga ccgcgatgcg aacgccatcc 2481661 gggctgtcga ccagacggga catctgttgt gttgccggca tcgcacctcc gcccaccggg 2481721 acttgctgtt gcaaccagtc gcccaaaccg tagcaaggac ggccgactgc accgatgtcc 2481781 ccgccgaggt gtcggcaacg gccgccgggg ccaccaactc gccgcgccct ggatgtgtgt 2481841 cgctccgggc gcagtgacag gataggtttc gacatccacc tgggttccgc acccggtgcg 2481901 cgaccgtgtg ataggccaga ggtggacctg cgccgaccga cgatcgatcg aggagtcaac 2481961 agaaatggcc ttctccgtcc agatgccggc actcggtgag agcgtcaccg aggggacggt 2482021 tacccgctgg ctcaaacagg aaggcgacac ggtcgaactc gacgagcccc tcgtggaggt 2482081 gtcgaccgac aaggtcgaca ccgaaatccc ctcgccggcc gcgggtgtgc tgaccaagat 2482141 catcgcccag gaggatgaca cggtcgaggt cggcggcgag ctcgctgtca ttggcgacgc 2482201 caaggatgcc ggcgaggccg cggccccggc acccgagaaa gtccctgcgg cccaacccga 2482261 gtccaagccg gcacccgaac caccaccggt ccaaccgacg tccggagcgc ctgctggtgg 2482321 cgatgccaag ccggtgctga tgcccgagct cggcgaatcg gtgaccgagg ggaccgtcat 2482381 tcgttggctg aagaagatcg gggattcggt tcaggttgac gagccactcg tggaggtgtc 2482441 caccgacaag gtggacaccg agatcccgtc cccggtggct ggggtcttgg tcagtatcag 2482501 cgccgacgag gacgccacgg tgcccgtcgg cggcgagttg gcccggatcg gtgtcgctgc 2482561 cgacatcggc gccgcgcccg cccccaagcc cgcacccaag cccgtccccg agccagcgcc 2482621 gacgccgaag gccgaacccg caccatcgcc gccggcggcc cagccagccg gtgcggccga 2482681 gggcgcaccg tacgtgacgc cgctggtgcg aaagctggcg tcggaaaaca acatcgacct 2482741 cgccggggtg accggcaccg gagtgggtgg tcgcatccgc aaacaggatg tgctggccgc 2482801 ggctgagcaa aagaagcggg cgaaagcacc ggcgccggcc gcccaggccg ccgccgcgcc 2482861 ggccccgaaa gcgccgcctg cccctgcgcc ggcgttggca catctacggg gcaccaccca 2482921 gaaggccagc cggattcgtc agatcaccgc caacaagacc cgcgaatctt tgcaggcaac 2482981 ggcacagctg acacaaaccc atgaggtcga catgaccaag atcgtggggc tacgggcccg 2483041 ggccaaggcg gcgttcgccg agcgtgaggg cgtgaacctg accttcctgc cgttcttcgc 2483101 caaggccgtg atcgatgccc tcaagattca cccgaacatc aacgctagct acaacgagga 2483161 caccaaggag atcacctact acgacgccga gcacctagga ttcgctgtcg acaccgagca 2483221 gggcctgctc tccccggtca tccacgacgc cggcgatctg tcactggccg gtctggcgcg 2483281 ggcgatcgcc gatatcgcgg cccgtgcccg gtcgggcaac ctgaaacccg acgagttgtc 2483341 cggcggcacc ttcaccatca ccaacatcgg tagccagggc gcgttgttcg acaccccgat 2483401 cctggttccg ccgcaggccg ccatgctggg caccggggcg atcgtcaagc ggccgcgggt 2483461 ggtcgtcgat gccagcggca acgagtcgat cggggtgcgc tcggtctgct acctcccgtt 2483521 gacctatgac catcggctca tcgacggcgc cgacgccgga cgtttcctca ccacgatcaa 2483581 gcaccgcctc gaagagggag cgttcgaggc cgatttagga ctgtgatggc caacgccgtt 2483641 gtcgcgatcg cgggttcgtc tggcttgatc ggctctgccc tgaccgcggc gctgcgcgcg 2483701 gccgaccaca cggtgctgcg gatcgtgcgc cgggcacctg cgaattccga agaactgcac 2483761 tggaatcccg aaagcggcga attcgatccg cacgcgctca ccgatgtcga cgccgtggtc 2483821 aacctctgcg gcgtcaacat cgcccagcgt cggtggtcgg gggctttcaa acagagcctg 2483881 cgcgacagcc ggatcacacc caccgaggtg ctatccgccg cagtcgccga cgccggcgtc 2483941 gctaccttga tcaacgccag cgcggtgggc tactacggaa acaccaagga ccgggtggtc 2484001 gacgaaaacg actcggcggg aacaggtttt ctggcccagc tgtgcgttga ctgggaaacc 2484061 gccacgcggc cggcgcagca gagcggtgcc cgcgtggtgc tggcccggac cggagtggtg 2484121 ctgtctccgg cggggggcat gctgcgacgc atgcggccac tgttttcggt gggcctgggc 2484181 gcgcggctgg gcagcggccg gcaatatatg tcatggatca gcctggagga cgaggtgcgg 2484241 gcgctgcagt tcgctatcgc gcagcccaac ctgtccggcc cggtgaactt gaccgggccg 2484301 gcccccgtta ccaacgccga attcaccacc gcgtttggcc gcgccgtcaa ccgccctacc 2484361 ccgctgatgt tgcctagcgt cgcggtacgc gcggcgtttg gtgagttcgc cgacgagggg 2484421 ttgctcattg gtcagcgcgc catcccctcc gcgctggagc gagccggatt tcagttccac 2484481 cacaacacca ttggcgaggc gctcggctac gccaccaccc ggcccggcta ggcttgaccc 2484541 cgtctgccca gccgtgcgct ggcggccgag tagcctagct atcgtgacgg gttctatccg 2484601 gtcgaagctg tccgcgatcg acgtccgcca gctggggacc gtcgactacc ggaccgcgtg 2484661 gcagctacag cgagagctag ccgacgcccg ggtcgccggc ggcgccgaca cgctgctgct 2484721 gttggaacac cccgcggtct acaccgccgg acggcgtacc gagacacacg agcgacccat 2484781 tgacggcact ccggtcgtcg acaccgaccg cggcggcaag atcacctggc acggtccggg 2484841 gcaattggtc ggctacccga tcatcgggct ggccgaaccc ctcgacgtgg tcaattacgt 2484901 tcggcgcctt gaagaatcgc tgatccaagt ctgcgccgat ctgggcctgc acgccggccg 2484961 cgtcgacggc cggtccgggg tctggctgcc cggcaggccg gcgcgcaagg tcgcggccat 2485021 cggtgtccgg gtgtcgcggg cgacgacact gcacgggttt gcgctcaact gcgattgtga 2485081 tttggctgcc ttcaccgcca tcgtgccatg cggaatcagt gacgccgcag tgacatcgct 2485141 gtccgccgaa ctcggccgta cggtcaccgt cgacgaggtc cgcgcgacgg tcgccgccgc 2485201 tgtctgcgcc gctctggacg gcgtcctacc ggtcggtgac cgcgtgccct cacacgccgt 2485261 accatcgccg ttatgagtgt cgctgccgag ggccggcgcc tgttacgcct ggaggtgcgc 2485321 aacgcgcaga ccccaatcga gcgcaaaccg ccgtggatca agacacgagc ccgcatcggg 2485381 ccggagtaca ccgagctgaa gaacctggtc cgccgcgagg ggctgcacac ggtctgcgag 2485441 gaggccggct gccccaacat cttcgaatgc tgggaggacc gagaagccac cttcctgatc 2485501 ggcggtgacc agtgcacccg ccgatgcgat ttctgccaga tcgacaccgg aaagcccgcc 2485561 gagctggacc gcgacgagcc acgccgagtc gccgacagcg tgcgcacgat gggcctgcgc 2485621 tatgccaccg tcaccggcgt ggctcgcgac gacctgcctg acggcggggc ctggctgtac 2485681 gccgcgaccg tgcgcgccat caaggaactc aatccgtcga ccggcgtcga actgctgatt 2485741 cccgacttca acggcgaacc aacccggctg gccgaggtct tcgagtccgg cccggaagtc 2485801 ctggcacaca atgtcgaaac cgtgccccgt atcttcaagc ggatccggcc ggcgttcacg 2485861 taccggcgca gcctgggtgt gcttaccgct gcgcgcgacg ccggcctggt caccaagagc 2485921 aacctcatcc tcggcctggg cgaaacctcc gacgaggtgc gcaccgccct gggcgatctg 2485981 cgcgacgccg gctgcgacat cgttaccatc acccaatacc tgcggccgtc ggcgcgccac 2486041 catccggtcg agcgctgggt gaagcccgag gagttcgtcc agttcgcgcg attcgccgaa 2486101 gggctgggct tcgccggggt attggcggga cccctggtta ggtcgtcata tcgggcgggc 2486161 cggctctacg aacaggcacg taactcacgg gccttggcat cccgctagcc agcgtttacg 2486221 tattctggac gattatggcg aaaccccgaa atgccgctga aagcaaggcc gccaaagctc 2486281 aggcaaacgc tgctcgtaag gctgccgccc gccagcgccg cgctcagctg tggcaagcgt 2486341 tcaccctgca gcgcaaggag gataagcgcc tgctgccgta catgattggt gctttcttgc 2486401 tgatcgtggg cgcatcggtg ggggtcgggg tgtgggctgg cgggttcacc atgttcacga 2486461 tgatcccgct gggggtgctg ctgggtgcac tggtggcgtt cgtcatcttc ggccggcgag 2486521 cccagcgaac ggtttaccgc aaagccgaag gccaaaccgg cgcagccgcc tgggcgctgg 2486581 acaacctgcg gggcaagtgg cgggtgacgc ccggggtggc cgccaccggc aacctcgacg 2486641 ccgtgcaccg ggtgatcggc cggcccggtg tcatcttcgt cggcgaggga tcagcggccc 2486701 gcgtcaaacc actgctggct caggagaaaa agcgcaccgc gcgactggtc ggggacgtgc 2486761 cgatctacga cattatcgtc ggcaacggcg atggcgaggt tccgctggcc aagttggagc 2486821 gccacctcac ccgccttccg gccaacatca cggtcaagca gatggacacg gtggagtcgc 2486881 gactggcggc gctgggttcg cgtgccggtg cgggcgtcat gcccaaggga ccgctaccca 2486941 ccacggccaa gatgcgcagc gtccagcgca cggtccgccg taagtaacgc ggctcagcgt 2487001 cgcaccaccg ccgtagcagt gagccgatcg tgcagcccac gcccgtccga gtcggtgaac 2487061 agcggcggaa ccaccagccc gatcagcagg ccacgcacca ccagacggcc gatccccacc 2487121 ggccgccggc cacccactgc caccacgacc agacccagca tcaactgccc gggtgtgaat 2487181 ccgaacaagc ggaccgccgc caccccgagc agcagccaaa tcaccaggac aaccgtcgac 2487241 agcatcgggg tcgaccaaac accgaattcc acgcccagca acgccagacc gtaggcgatc 2487301 agccagtcga tcagcagagc cgccagccgg cgccccatcg gagccagcga acccggtccg 2487361 gtgtccggca agcccagcgt cttgccggga tagtcgggcg gcgatttcgc cgtcatcggg 2487421 cagacccgat aaccaggttc ccgttcggca tgccaccggt tacgatcttg ccgaccatgg 2487481 ccccacaata gggccgggga gacccggcgt cagtggtggg cggcacggtc agtaacgtct 2487541 gcgcaacacg gggttgactg acgggcaata tcggctccat agcgtcggcc gcggatacag 2487601 taaaggagca ttctgtgacg gaaaagacgc ccgacgacgt cttcaaactt gccaaggacg 2487661 agaaggtcga atatgtcgac gtccggttct gtgacctgcc tggcatcatg cagcacttca 2487721 cgattccggc ttcggccttt gacaagagcg tgtttgacga cggcttggcc tttgacggct 2487781 cgtcgattcg cgggttccag tcgatccacg aatccgacat gttgcttctt cccgatcccg 2487841 agacggcgcg catcgacccg ttccgcgcgg ccaagacgct gaatatcaac ttctttgtgc 2487901 acgacccgtt caccctggag ccgtactccc gcgacccgcg caacatcgcc cgcaaggccg 2487961 agaactacct gatcagcact ggcatcgccg acaccgcata cttcggcgcc gaggccgagt 2488021 tctacatttt cgattcggtg agcttcgact cgcgcgccaa cggctccttc tacgaggtgg 2488081 acgccatctc ggggtggtgg aacaccggcg cggcgaccga ggccgacggc agtcccaacc 2488141 ggggctacaa ggtccgccac aagggcgggt atttcccagt ggcccccaac gaccaatacg 2488201 tcgacctgcg cgacaagatg ctgaccaacc tgatcaactc cggcttcatc ctggagaagg 2488261 gccaccacga ggtgggcagc ggcggacagg ccgagatcaa ctaccagttc aattcgctgc 2488321 tgcacgccgc cgacgacatg cagttgtaca agtacatcat caagaacacc gcctggcaga 2488381 acggcaaaac ggtcacgttc atgcccaagc cgctgttcgg cgacaacggg tccggcatgc 2488441 actgtcatca gtcgctgtgg aaggacgggg ccccgctgat gtacgacgag acgggttatg 2488501 ccggtctgtc ggacacggcc cgtcattaca tcggcggcct gttacaccac gcgccgtcgc 2488561 tgctggcctt caccaacccg acggtgaact cctacaagcg gctggttccc ggttacgagg 2488621 ccccgatcaa cctggtctat agccagcgca accggtcggc atgcgtgcgc atcccgatca 2488681 ccggcagcaa cccgaaggcc aagcggctgg agttccgaag ccccgactcg tcgggcaacc 2488741 cgtatctggc gttctcggcc atgctgatgg caggcctgga cggtatcaag aacaagatcg 2488801 agccgcaggc gcccgtcgac aaggatctct acgagctgcc gccggaagag gccgcgagta 2488861 tcccgcagac tccgacccag ctgtcagatg tgatcgaccg tctcgaggcc gaccacgaat 2488921 acctcaccga aggaggggtg ttcacaaacg acctgatcga gacgtggatc agtttcaagc 2488981 gcgaaaacga gatcgagccg gtcaacatcc ggccgcatcc ctacgaattc gcgctgtact 2489041 acgacgttta aggactcttc gcagtccggg tgtagaggga gcggcgtgtc gttgccaggg 2489101 cgggcgtcga ggtttttcga tgggtgacgg tggccggcaa cggcgcgccg accaccgctg 2489161 cgaagagccc gtttaagaac gttcaaggac gtttcagccg ggtgccacaa cccgcttggc 2489221 aatcatctcc cgaccgccga gcgggttgtc tttcacatgc gccgaaactc aagccacgtc 2489281 gtcgcccagg cgtgtcgtcg cggccggttc aggttaagtg tcggggattc gtcgtgcggg 2489341 cgggcgtcca cgctgaccaa cggggcagtc aactcccgaa cactttgcgc actaccgcct 2489401 ttgcccgccg cgtcacccgt aggtagttgt ccaggaattc cccaccgtcg tcgtttcgcc 2489461 agccggccgc gaccgcgacc gcattgagct ggcgcccggg tcccggcagc tggtcggtgg 2489521 gcttgccgcg caccaacacc agcgcgttgc gggcccgggt ggcggtcagc caggcctgac 2489581 ggagcagctc cacgtcggct gcgggaacca gatcggcggc cgcgatgaca tccagggatt 2489641 gcagcgtcga ggtgttgtgc agggcgggaa cctggtgcgc atgctgtagc tgcagcaact 2489701 gcacggtcca ttcgatgtcg gccagtccgc cgcggcccag tttggtgtgt gtgttggggt 2489761 cggcaccgcg cggcaaccgc tcggactcga tacgggcctt gatgcggcga atctcgcgca 2489821 ccgagtcagc ggacacaccg tcgggcggat accgcgtttt gtcgaccatc cgtaggaatc 2489881 gctgacccaa ctcggcatcg ccggcaaccg cgtgtgcgcg tagcagggcc tggatctccc 2489941 atggctgtgc ccactgctcg tagtatgcgg cgtaggaccc cagggtgcgg accagcggac 2490001 cgttgcggcc ctcgggtcgc aaattggcgt cgagctccag cggcggatcg acgctgggtg 2490061 tccccagcag cgcccgaacc cgctcggcga tcgatgtcga ccatttcacc gcccgtgcat 2490121 cgtcgacgcc ggtggccggc tcacagacga acatcacgtc ggcatccgac ccgtagccca 2490181 actcggcacc acccagccga cccatgccga tgaccgcgat ggccgccggg gcgcgatcgt 2490241 cgtcgggaag gctggcccgg atcatgacgt ccagcgcggc ctgcagcacc gccacccaca 2490301 ccgacgtcaa cgcccggcac acctcggtga cctcgagcag gccgagcagg tccgccgaac 2490361 cgatgcgggc cagctctcga cgacgcagcg tgcgcgcgcc ggcgatggcc cgctccgggt 2490421 cggggtagcg gctcgccgag gcgatcagcg cccgagccac ggcggcgggc tcggtctcga 2490481 gcagcttcgg gcccgcaggc ccgtcctcgt actgctggat gacccgcggc gcgcgcatca 2490541 acagatccgg cacatacgcc gaggtaccca agacatgcat gagccgcttg gccaccgcgg 2490601 gcttgtcccg cagcgtggcc aggtaccagc tttcggtggc cagcgcctca ctgagccgcc 2490661 ggtaggccag cagtccgccg tcgggatcgg gggcatacga catccagtcc agcagcctgg 2490721 gcagcagcac cgactgcacc cgtccgcgcc ggccgctttg attgaccaac gccgacatgt 2490781 gtttcaacgc ggtctgcggt ccctcgtagc ccagcgcggc cagccggcgc cccgcggcct 2490841 ccaacgtcat gccgtgggcg atctccaacc cggtcgggcc gatcgattcc agcagcggtt 2490901 gatagaagag tttggtgtgt aacttcgaca cccgcacgtt ctgcttcttg agttcctccc 2490961 gcagcacccc ggccgcatcg tttcggccat cgggccggat gtgggccgcg cgcgccagcc 2491021 agcgcactgc ctcctcgtct tcgggatcgg gaagcaggtg ggtgcgcttg agccgctgca 2491081 actgcagtcg gtgctcgagc agcctgagga actcatacga cgcggtcatg ttcgccgcgt 2491141 cctcacgccc gatgtagccg ccttcgccca acgccgccaa tgcgtccacc gtggacgcca 2491201 cccgtaacga ctcgtcgcta cgggcatgaa ccagctgcag tagctgtacg gcgaactcca 2491261 cgtcgcgcaa tccgccgctg ccgagtttga gctcgcggcc gcggacatcg gcgggcacca 2491321 gctgctccac ccgccgccgc atggcctgca cctcgaccac aaagtcttcg cgctcgcagg 2491381 ctcgccacac catcggcatc aaggcggtca ggtaacgctc gccaagttcc gcgtcgccaa 2491441 cgactggccg tgctttcagc aacgcctgaa actcccaggt cttggcccag cgctggtagt 2491501 aggcgatgtg cgactcgagc gtacggacca gctccccgtt gcgcccctcc ggacgcaggg 2491561 cggcgtccac ctcgaaaaag gccgccgagg ccacccgcat catctcgctg gccacgcgcg 2491621 cgttgcgcgg gtcggagcgc tcggcaacga atatgacatc gacgtcgctg acgtagttca 2491681 gttcgcgcgc accgcacttg cccatcgcga tgaccgccag gcgcggtggc gggtgctcgc 2491741 cgcacacgct cgcctcggcc acgcgcagcg ccgccgccag agcggcgtcc gcggcgtccg 2491801 ccaggcgtgc ggccaccacg gtgaatggca gcaccggttc gtcctcgacc gtcgcggcca 2491861 ggtcgagagc ggccagcatt agcacgtagt cgcggtactg ggttcgcaat cggtgcacga 2491921 gcgagcccgg cataccctcc gattcctcga cgcactcgac gaacgaccgc tgcagctggt 2491981 catgggacgg cagtgtgacc ttgccccgca gcaatttcca ggactgcgga tgggcgacca 2492041 ggtgatcgcc caacgccagc gacgagccca gcaccgagaa cagccgcccg cgcagactgc 2492101 gttcgcgcag cagagccgcg ttgagctcgt cccatccggt gtctggattc tccgacagcc 2492161 ggatcaaggc gcgcagcgcg gcatcggcgt ccggagcgcg tgacagcgac cacagcaggt 2492221 cgacgtgcgc ctgatcctcg tgccgatccc accccagctg agccagacgc tcaccagcag 2492281 gggggtcaac taatccgagc cggccaacgc tgggcaactt cggccgctgc gtggcgagtt 2492341 tggtcacgac cacgacggta gcgcaaagcg cgtcggcgtc ggatcaaccg gtagatctgg 2492401 gctacagcga caggtaggtg cgcagctcgt atggcgtgac gtggctgcgg tagttcgccc 2492461 actccgtgcg cttgttgcgc aagaaaaagt caaaaacgtg ctcccccaag gcctccgcga 2492521 cgagttcgga ggcctccatg gcgcgcagcg cactatccaa actggacggc aattctcggt 2492581 accccatcgc tcggcgttcc tcgggtgtga ggtcccatac gttgtcctcg gcctgcgggc 2492641 ccagcacgta acccttctct acaccccgca atcccgcggc cagcagcacg gcgaatgtca 2492701 gatagggatt gcacgccgaa tcagggctgc gtacttcgac ccgccgcgac gaggtcttgt 2492761 gcggcgtgta catcggcacc cgcactaggg cggatcggtt ggcggccccc cacgacgcgg 2492821 ccgtgggcgc ttcgccgccc tgcaccagcc gcttgtaaga gttgacccac tgatttgtga 2492881 ccgcgctgat ctcgcaagcg tgctccagga tcccggcgat gaacgattta cccacttccg 2492941 acagctgcag cggatcatca gcgctgtgga acgcgttgac atcaccctcg aacaggctca 2493001 tgtgggtgtg catcgccgag cccgggtgct ggccgaatgg cttgggcatg aacgacgccc 2493061 gggcgccctc ttccagcgcg acttctttga tgacgtagcg gaaggtcatc acgttgtcag 2493121 ccatcgacag agcgtcggca aaccgcaggt cgatctcctg ctggccgggt gcgccttcgt 2493181 gatggctgaa ctccaccgag atgcccatga attccagggc atcgatcgcg tggcggcgaa 2493241 agttcaaggc ggagtcgtgc accgcttggt cgaaatagcc ggcgttgtcg accgggacgg 2493301 gcaccgaccc gtcctcgggt ccgggcttga gcaggaagaa ctcgatttcg ggatgcacgt 2493361 agcaggagaa gccgagttcg ccggccttcg tcagctgccg ccgcaacacg tgccgcgggt 2493421 ccgcccacga cggcgagccg tccggcatgg tgatgtcgca aaacatccgc gctgagtggt 2493481 ggtggccgga actggtggcc cagggcagca cctggaaggt cgacgggtcc gggtgcgcca 2493541 ccgtatcgga ttccgagacc cgcgcaaagc cctcgatcga ggatccgtcg aagccgatgc 2493601 cttcctcgaa ggcgccctcg agttcggctg gggcgatggc gaccgacttg aggaaaccga 2493661 gcacgtctgt gaaccacagc cggacgaagc ggatgtcgcg ttcttccagg gtacgaagaa 2493721 cgaattcctt ctgtcggtcc atacctcgaa cagtatgcac tgtctgttaa aaccgtgtta 2493781 ccgatgcccg gccagaagcg ttgcggggcg gcccgcaagg ggagtgcgcg gtgagttcag 2493841 ggcgcgcacc gcagactcgt cggcggcaag gtcccgtcga gaaaatagtg catcaccgca 2493901 gagtccacac actggttgcc atcgaacacc gcagtgtgtt gggtgccgtc gaaggtgatc 2493961 agcggtgcgc ccagctggcg ggccaggtct accccggact gatacggagt ggccgggtcg 2494021 tgggtggtgg acaccacgac gaccttgcca gccccggccg gcgccgcggg gtgcggcgtc 2494081 gacgttgccg gcaccggcca cagcgcgcac agatcgcggg gggcggatcc ggtgaactgc 2494141 ccgtagctaa ggaacggggc gacctgacgg atccgttggt cggcggccac ccaggccgct 2494201 ggatcggccg gtgtgggcgc atcgacgcac cggaccgcgt tgaacgcgtc ctggtcgttg 2494261 ctgtagtgcc cgtctgcatc ccggccgtca tagtcgtcgg caagcaccag caagtcgccg 2494321 gcgtcgctgc cgcgctgcag ccccagcaga ccactggtca ggtacttcca gcgctgaggg 2494381 ctgtacagcg cgttgatggt gcccgtcgtc gcgtcggcgt agctcaggcc acgtggatcc 2494441 gacgtcttac ccggcttctg caccagcggg tcaaccaggg cgtggtagcg gttgacccac 2494501 tgggccgagt cggtgcccag agggcaggcc ggcgagcggg cgcagtcggc ggcgtagtca 2494561 ttgaaagcgg tctgaaatcc cgccatttgg ctgatgcttt cctcgattgg gctaacggct 2494621 ggatcgatag cgccgtcgag gaccatcgcc cgcacatgag taccgaaccg ttccaggtaa 2494681 gcggtgccca actcggtgcc gtagctgtat ccgaggtagt tgatctgatc gtcacctaac 2494741 gcttggcgaa ccatgtccat gtcccgtgcg acggacgcgg taccgatatt ggccaagaag 2494801 ctgaagccca tccggtcaac acagtcctgg gccaactgcc ggtagacctg ttcgacgtgg 2494861 gtgacaccgg ccggactgta gtcggccatc ggatcgcgcc ggtacgcgtc gaactcggcg 2494921 tcggtgcgac accgcaacgc aggggtcgag tggccgaccc ctctcgggtc gaagcccacc 2494981 aggtcgaagt ggcggagaat gtcggtgtcg gcgatcgcgg gtgccatagc ggcgaccatg 2495041 tcgaccgccg acgccccggg tcccccagga ttgaccagca gtgctccgaa tcgctgtccc 2495101 gtcgcgggga cgcggatcac cgccaacttc gcttgtgtcc caccgggttg gtcgtagtcg 2495161 acggggacgg acaccgtcgc gcagcgtgca gtgcgaattt cgctggtgtc ggcgatgaac 2495221 tcgcggcagc tgttccaact ctgttgcggc gccacgaccg gcgcacccgg ggtttggccg 2495281 gcgccgggtt cttcagtcgc gccggccaac gggggcgctg ctaggggcag tccgccgagc 2495341 agcaacccga aggacagcag cgccgagctc aacggtctgc ggcgccacat ggccgccatc 2495401 gtctcaccgg cgaatacctg tgacggcgcg aaatgatcac accttcgttt cttcgccccg 2495461 ctagcacttg gcgccgctgg gcggcgtggt gccgccgatt aaatacgccg tcacgtactc 2495521 gtcaatgcag ctgtcgccct ggaataccac cgtgtgctgg gttccgtcga aggtcagcaa 2495581 cgaaccgcga agctggttcg ccaggtcgac cccggccttg tacggcgtcg ccgggtcatg 2495641 ggtggtggat accaccaccg tcggcactag gccgggcgcc gagacggcat ggggctgact 2495701 tgtgggtggc accggccaga acgcgcaggt gcccagcggc gcatcaccgg tgaacttccc 2495761 gtagctcatg aacggtgcga tctcccgggc gcggcggtct tcgtcgatga ccttgtcgcg 2495821 atcggtaacc gggggctgat cgacgcaatt gatcgccacc cgcgcgtcac cggaattgtt 2495881 gtagcggccg tgcgagtccc gacgcatgta catgtcggcc agagccagca gggtgtctcc 2495941 gcgattgtcg accagctccg acagcccgtc ggtcaagtgt tgccacagat tcggtgagta 2496001 cagcgccata atggtgccca cgatggcgtc gctataactc agcccgcgcg gatccttcgt 2496061 gcgcgccggc ctgctgatcc tcgggttgtc cgggtcgacc aacggatcga ccaggctgtg 2496121 gtagacctcg acggctttgg ccgggtcggc gcccagcggg cagcccgcgt tcttggcgca 2496181 gtcggcggca tagttgttga acgcgtcctg gaagcccttg gcctggcgca gctccgcctc 2496241 gatgggatcg gcattggggt cgacggcacc gtcgagaatc attgcccgca cccgctgcgg 2496301 aaattcctcg gcatacgcgg agccgatccg ggtgccgtac gagtagccca ggtaggtcag 2496361 cttgtcgtcg cccaacgccg cgcgaatggc atccaggtcc ttggcgacgt tgaccgtccc 2496421 gacatgggcc agaaagttct tgcccatctt gtccacacag cgaccgacga attgcttggt 2496481 ctcgttctcg atgtgcgcca caccctcccg gctgtagtca acctgcggct cggcccgcag 2496541 ccggtcgttg tcggcatcgg agttgcacca gatcgccggc cgggacgacg ccaccccgcg 2496601 ggggtcgaac ccaaccaggt cgaacctttc gtgcacccgc ttcggcaatg tctggaagac 2496661 gcccaaggcg gcctcgatac cggattcgcc gggtccaccg ggatttatga ccagcgaacc 2496721 gatcttgtct cccgtcgccg gaaagcgaat cagcgccagc gccgccacgt caccatcggg 2496781 gcggtcgtag tcgaccggta cagcgagctt gccgcataac gcgccgccgg ggatctttac 2496841 ttgcgggttt gacgaccggc acggtgtcca ctccaccggc tggcccagct tcggctccgc 2496901 catacgagcg cgtcccccga ccacgcggat gcagcccaca agaaccaacg ccacggcggc 2496961 gagcgcggcc cagatcaaca gcatgcgcgc gatcttgtcg cggcgagaca gcctcatgcc 2497021 cacaatgctg ccagagcaga cccgagatcc tggccagcgg ccaccgtcgg ccgactaacc 2497081 ggccgctgcc agcagtcctg ccatcgccga tggcgaactc gtcggccatc ccccatacgt 2497141 ccggtaacag atccgggcaa gacaccgacc cgtcgaccgg atccggcacg ggcgcgtcgg 2497201 cctcggcggt gcacaactgc gacatcaggt tggcgctggc accccgtcca cgccggcatg 2497261 gtgcaccttg gccatcgccc gagggcgatc cccgatgccg tccacccctt cgacgaaccc 2497321 atctcccacg gcggtcgccg gcagcgacgc gatgtggccg cagatctccg agagttcggc 2497381 ccgcccgccc ggcgacggca acccgatgcc gtgcaagtga cgatcgatgt gaggttcaag 2497441 gttcagcgca ctgctggcaa gctttttccg aaaccgcggc ctcgccttga tctggagtca 2497501 gaacgcgtca cgcagccggt caaaggcgta acccatgctc gagcaaacat gcatgggctg 2497561 agtggacgtt tccagacaca gcaactggcg tccaggccac tgagccgctg catgcgcgat 2497621 ggtatgccga tgggggcccc gggcgcgtct gaggggaaga agtggcagac tgtcagggtc 2497681 cgacgaaccc ggggacccta acgggccacg aggatcgacc cgaccaccat tagggacagt 2497741 gatgtctgag cagactatct atggggccaa tacccccgga ggctccgggc cgcggaccaa 2497801 gatccgcacc caccacctac agagatggaa ggccgacggc cacaagtggg ccatgctgac 2497861 ggcctacgac tattcgacgg cccggatctt cgacgaggcc ggcatcccgg tgctgctggt 2497921 cggtgattcg gcggccaacg tcgtgtacgg ctacgacacc accgtgccga tctccatcga 2497981 cgagctgatc ccgctggtcc gtggcgtggt gcggggtgcc ccgcacgcac tggtcgtcgc 2498041 cgacctgccg ttcggcagct acgaggcggg gcccaccgcc gcgttggccg ccgccacccg 2498101 gttcctcaag gacggcggcg cacatgcggt caagctcgag ggcggtgagc gggtggccga 2498161 gcaaatcgcc tgtctgaccg cggcgggcat cccggtgatg gcacacatcg gcttcacccc 2498221 gcaaagcgtc aacaccttgg gcggcttccg ggtgcagggc cgcggcgacg ccgccgaaca 2498281 aaccatcgcc gacgcgatcg ccgtcgccga agccggagcg tttgccgtcg tgatggagat 2498341 ggtgcccgcc gagttggcca cccagatcac cggcaagctt accattccga cggtcgggat 2498401 cggcgctggg cccaactgcg acggccaggt cctggtatgg caggacatgg ccgggttcag 2498461 cggcgccaag accgcccgct tcgtcaaacg gtatgccgat gtcggtggtg aactacgccg 2498521 tgctgcaatg caatacgccc aagaggtggc cggcggggta ttccccgctg acgaacacag 2498581 tttctgacca agccgaatca gcccgatgcg cgggcattgc ggtggcgccc tggatgccgt 2498641 cgacgccgga ttgccggcgc ggacgcgcca gcgggaccca tcggcgtcgc gttcgccggt 2498701 tgagcccggg gtgagcccag acattcgatg tgcccaacac catccgccac agcccaattg 2498761 atgtggcact ctatgcatgc ctatccccga ccaaccacca ccgcggcgac gcatcatgac 2498821 cggaggcgaa gatgccagta gaggcgccca gaccagcgcg ccatctggag gtcgagcgca 2498881 agttcgacgt gatcgagtcg acggtgtcgc cgtcgttcga gggcatcgcc gcggtggttc 2498941 gcgtcgagca gtcgccgacc cagcagctcg acgcggtgta cttcgacaca ccgtcgcacg 2499001 acctggcgcg caaccagatc accttgcggc gccgcaccgg cggcgccgac gccggctggc 2499061 atctgaagct gccggccgga cccgacaagc gcaccgagat gcgagcaccg ctgtccgcat 2499121 caggcgacgc tgtgccggcc gagttgttgg atgtggtgct ggcgatcgtc cgcgaccagc 2499181 cggttcagcc ggtcgcgcgg atcagcactc accgcgaaag ccagatcctg tacggcgccg 2499241 ggggcgacgc gctggcggaa ttctgcaacg acgacgtcac cgcatggtcg gccggggcat 2499301 tccacgccgc tggtgcagcg gacaacggcc ctgccgaaca gcagtggcgc gaatgggaac 2499361 tggaactggt caccacggat gggaccgccg ataccaagct actggaccgg ctagccaacc 2499421 ggctgctcga tgccggtgcc gcacctgccg gccacggctc caaactggcg cgggtgctcg 2499481 gtgcgacctc tcccggtgag ctgcccaacg gcccgcagcc gccggcggat ccagtacacc 2499541 gcgcggtgtc cgagcaagtc gagcagctgc tgctgtggga tcgggccgtg cgggccgacg 2499601 cctatgacgc cgtgcaccag atgcgagtga cgacccgcaa gatccgcagc ttgctgacgg 2499661 attcccagga gtcgtttggc ctgaaggaaa gtgcgtgggt catcgatgaa ctgcgtgagc 2499721 tggccgatgt cctgggcgta gcccgggacg ccgaggtact cggtgaccgc taccagcgcg 2499781 aactggacgc gctggcgccg gagctggtac gcggccgggt gcgcgagcgc ctggtagacg 2499841 gggcgcggcg gcgataccag accgggctgc ggcgatcact gatcgcattg cggtcgcagc 2499901 ggtacttccg tctgctcgac gctctagacg cgcttgtgtc cgaacgcgcc catgccactt 2499961 ctggggagga atcggcaccg gtaaccatcg atgcggccta ccggcgagtc cgcaaagccg 2500021 caaaagccgc aaagaccgcc ggcgaccagg cgggcgacca ccaccgcgac gaggcattgc 2500081 acctgatccg caagcgcgcg aagcgattac gctacaccgc ggcggctact ggggcggaca 2500141 atgtgtcaca agaagccaag gtcatccaga cgttgctagg cgatcatcaa gacagcgtgg 2500201 tcagccggga acatctgatc cagcaggcca tagccgcgaa caccgccggc gaggacacct 2500261 tcacctacgg tctgctctac caacaggaag ccgacttggc cgagcgctgc cgggagcagc 2500321 ttgaagccgc gctgcgcaaa ctcgacaagg cggtccgcaa agcacgggat tgagcccgcc 2500381 aggggcggac gagttggcct gtaagccgga ttctgttccg cgccgccaca gccaagctaa 2500441 cggcggcacg gcggcgacca tccatctgga cacaccgtta ccgggtgcct cgagcggcct 2500501 acccgcaggc tcgggcgagc aaccctcaag cgcctgcgcg gccgcacttt cggtgcggcc 2500561 ttcttggcct tgcttcgggt ggggtttgcc tagccacccc ggtcacccgg aatgctggtg 2500621 cgctcttacc gcaccgtttc acccttgcca ccacgaggat ggcggtctgt tttctgtggc 2500681 actttcccgc gagtcacctc ggattgccgt tagcaatcac cctgctctgt gaagtccgga 2500741 ctttcctcga ctcgacgctg aacctcgtga atccacacaa gccctacgcg agccgcggcc 2500801 gcccagccaa ctcatccgcg acgaccacgc taccccgctg ggcggtgtcg cggccagtgt 2500861 gaccgctgga cgacacggct agtcggacag ccgatccggc gggcagtcct tatcgtggac 2500921 tggtgacacg gtgggacaaa cgcgtcgact ccggcgactg ggacgccatc gctgccgagg 2500981 tcagcgagta cggtggcgca ctgctacctc ggctgatcac ccccggcgag gccgcccggc 2501041 tgcgcaagct gtacgccgac gacggcctgt ttcgctcgac ggtcgatatg gcatccaagc 2501101 ggtacggcgc cgggcagtat cgatatttcc atgcccccta tcccgagtga tcgagcgtct 2501161 caagcaggcg ctgtatccca aactgctgcc gatagcgcgc aactggtggg ccaaactggg 2501221 ccgggaggcg ccctggccag acagccttga tgactggttg gcgagctgtc atgccgccgg 2501281 ccaaacccga tccacagcgc tgatgttgaa gtacggcacc aacgactgga acgccctaca 2501341 ccaggatctc tacggcgagt tggtgtttcc gctgcaggtg gtgatcaacc tgagcgatcc 2501401 ggaaaccgac tacaccggcg gcgagttcct gcttgtcgaa cagcggcctc gcgcccaatc 2501461 ccggggtacc gcaatgcaac ttccgcaggg acatggttat gtgttcacga cccgtgatcg 2501521 gccggtgcgg actagccgtg gctggtcggc atctccagtg cgccatgggc tttcgactat 2501581 tcgttccggc gaacgctatg ccatggggct gatctttcac gacgcagcct gattgcacgc 2501641 catctataga tagcctgtct gattcaccaa tcgcaccgac gatgccccat cggcgtagaa 2501701 ctcggcgatg ctcagcgatg ccagatcaag atgcaaccga tataggacgc ccgacccggc 2501761 atccaacgcc agccgcaaca acattttgat cggcgtgaca tgtgacacca ccagcaccgt 2501821 cgcgccttcg tagccaacga tgatccgatc acgtccccgc cgaacccgcc gcagcacgtc 2501881 gtcgaagctt tccccacccg ggggcgtgat gctggtgtcc tgcagccagc gacggtgcag 2501941 ctcgggatcg cgttctgcgg cctccgcgaa cgtcagcccc tcccaggcgc cgaagtcggt 2502001 ctcgaccagg tcgtcatcga cgaccacgtc cagggccagg gctctggcgg cggtcaccgc 2502061 ggtgtcgtaa gcccgctgta gcggcgagga gaccaccgca gcgatcccgc cgcgccgcgc 2502121 cagatacccg gccgccgcac caacctggcg ccaccccacc tcgttcaacc ccgggttgcc 2502181 gcgccccgaa tagcggcgtt gctccgacag ctccgtctgc ccgtggcgca acaaaagtag 2502241 tcgggtgggt gtaccgcggg cgccggtcca gccgggagat gtcggtgact cggtcgcaac 2502301 gattttggca ggatccgcat ccgccgcagc cgattgcgcg gcggcgtcca tcgcgtcatt 2502361 ggccaaccgg tctgcatacg tgttccgggc acgcggaacc cactcgtagt tgatcctgcg 2502421 aaactgggac gccaacgcct gagcctggac atagagcttc agcagatccg ggtgcttgac 2502481 cttccaccgc ccggacatct gctccaccac cagcttggag tccatcagca ccgcggcctc 2502541 ggtggcacct agtttcacgg cgtcgtccaa accggctatc aggccgcggt attcggcgac 2502601 gttgttcgtc gcccggccga tcgcctgctt ggactcggcc agcacggtgg agtgatcggc 2502661 ggtccacacc accgcgccgt atccggccgg tccgggattg ccccgcgatc cgccgtcggc 2502721 ttcgatgaca actttcactc ctcaaatcct tcgagccgca acaagatcgc tccgcattcc 2502781 gggcagcgca ccacttcatc ctcggcggcc gccgagatct gggccagctc gccgcggccg 2502841 atctcgatcc ggcaggcacc acatcgatga ccttgcaacc gcccggcccc tggcccgcct 2502901 ccggcccgct gtctttcgta gagccccgca agctcgggat caagtgtcgc cgtcagcatg 2502961 tcgcgttgcg atgaatgttg gtgccgggct tggtcgattt cggcaagtgc ctcgtccaaa 2503021 gcctgctggg cggcggccag gtcggcccgc aacgcttgga gcgcccgcga ctcggcggtc 2503081 tgttgagcct gcagctcctc gcggcgttcc agcacctcca gcagggcatc ttccaaactg 2503141 gcttgacggc gttgcaagct gtcgagctcg tgctgcagat cagccaattg cttggcgtcc 2503201 gttgcacccg aagtgagcaa cgaccggtcc cggtcgccac gcttacgcac cgcatcgatc 2503261 tccgactcaa aacgcgacac ctggccgtcc aagtcctccg ccgcgattcg cagggccgcc 2503321 atcctgtcgt tggcggcgtt gtgctcggcc tgcacctgct ggtaagccgc ccgctgcggc 2503381 agatgggtag cccgatgcgc gatccgggtc agctcagcat ccagcttcgc caattccagt 2503441 agcgaccgtt gctgtgccac tccggctttc atgcctgatc tctcccagtt tcgtgatcga 2503501 ggttccacgg gtcggtgcag atggtgcaca cacgcaccgg cagcgacgcg ccgaaatgag 2503561 accgcaacac ttcggcggcc tggccgcacc acgggaattc gcttgcccaa tgcgcgacgt 2503621 cgatcagggc cacttgcgaa gctcggcaat gctcgtcggc tggatgatgt cgcagatcgg 2503681 ccgtaacgta cgcttgcacg tccgcggcgg ccacggtggc aagcaacgag tccccggcgc 2503741 cgccgcagac cgcgacccgc gacaccagca ggtcgggatc cccggcggcg cgcacaccgg 2503801 tcgcagtcgg cggcaacgcg gcctccagac gggcaacaaa ggtgcgcagc ggttcgggtt 2503861 ttggcagtct gccaatccgg cctaacccgc tgccgaccgg cggtggtacc agcgcgaaga 2503921 tgtcgaatgc cggctcctcg taagggtgcg cggcgcgcat cgccgccaac acctcggcgc 2503981 gcgctcgtgc gggtgcgacg acctcgaccc ggtcctcggc cacccgttcg acggtaccga 2504041 cgctgcctat ggcgggcgac gccccgtcgt gcgccaggaa ctgcccggta cccgcgacac 2504101 tccagctgca gtgcgagtag tcgccgatat ggccggcacc ggcctcaaag accgctgccc 2504161 gcaccgcctc tgagttctcg cgcggcacat agatgaccca cttgtcgaga tcggccgctc 2504221 cgggcaccgg gtcgagaacg gcgtcgacgg tcagaccaac agcgtgtgcc agcgcgtcgg 2504281 acacacccgg cgacgccgag tcggcgttgg tgtgcgcggt aaacaacgag cgaccggtcc 2504341 ggatcaggcg gtgcaccagc acaccctttg gcgtgttggc cgcgaccgta tcgaccccac 2504401 gcagtaacaa cgggtggtgc accaatagca gtccggcctg gggaacctgg tccaccaccg 2504461 ccggcgtcgc gtccaccgca acggtcaccg aatccaccac gtcgtcgggg tcgccgcaca 2504521 ccagacccac cgaatcccac gactgggcaa gccgcggcgg gtaggcctgg tccagcacgt 2504581 cgatgacatc ggccagccgc acactcatcg gcgtcctcca cgctttgccc actcggcgat 2504641 cgccgccacc agcacgggcc actccgggcg caccgccgcc cgcaggtacc gcgcgtccag 2504701 gccgacgaag gtgtcaccgc ggcgcaccgc aattcctttg ctctgcaaat agtttcgtaa 2504761 tccgtcagca tcggcgatgt tgaacagtac gaaaggggcc gcaccatcga ccacctcggc 2504821 acccaccgat ctcagtccgg ccaccatctc cgcgcgcagc gccgtcaacc gcaccgcatc 2504881 ggctgcggca gcggcgaccg cccggggggc gcagcaagca gcgatggccg tcagttgcaa 2504941 tgttcccaac ggccagtgcg ctcgctgcac ggtcaaccga gccagcacgt ctggcgagcc 2505001 gagcgcgtag cccacccgca atccggccag cgaccacgtt ttcgtcaagc tacggagcac 2505061 cagcacatcg ggcagcgagt catcggccaa cgattgcggc tcgccgggaa cccaatcagc 2505121 gaacgcctcg tcgaccacca ggatgcgtcc cggccggcgt aactcgagca gctgctcgcg 2505181 gaggtgcagc accgaggtgg ggttggtcgg attacccacg acgacaaggt cggcgtcgtc 2505241 aggcacgtgc gcggtgtcca gcacgaacgg cggctttagg acaacatggt gcgccgtgat 2505301 tccggcagcg ctcaaggcta tggccggctc ggtgaacgcg ggcacgacga ttgctgcccg 2505361 caccggactt aggttgtgca gcaatgcgaa tccctccgcc gccccgacga gcgggagcac 2505421 ttcgtcacgg gttctgccat gacgttcagc gaccgcgtct tgcgcccggt gcacatcgtc 2505481 ggtgctcgga tagcgggcca gctccggcag cagcgcggcg agctgccgga ccaaccattc 2505541 cgggggccgg tcatggcgga cgttgacggc gaagtccagc acgccgggcg cgacatcctg 2505601 atcaccgtgg tagcgcgccg cggcaagcgg gctagtgtct agactcgcca cagcgtcaaa 2505661 cagtagtggg ccggtgtgcg ggccaagaat ccagagcacc gccgacgcgt tgtctacgcg 2505721 gcgacaaccg cgacatcaca ggcagctaac agggcgtcgg cggtgatgat cgtcaggcca 2505781 agcagctgtg cctgggcgat gagcacacgg tcgaatggat gtcgatggtg atccggaagc 2505841 tctgcggtgc gcagtgtgtg cgtggtcaac tgacagcggc gacgtgccgc agcggcgcat 2505901 tcgatcgggc acgtaagaag ccgatggctc gggcggcggg agcttgccga ggcggtagtt 2505961 gatcgcgatc tcccaggcac tggcggccga caagagaatg ctgttgcgga cgtcctgaac 2506021 aatcgcccgt gtttcgttga cggcatccgc agccaaacgt gggtgtcgat gaggtagcgc 2506081 ttcaccggtg aaagcgttcg agcacgtcgt ctgacaacgg agcgtccaaa tcgtcgggca 2506141 cgcggtacac gccatggtca atgcctaacc gccgagtctc atgaggatgc agcggcacaa 2506201 gctttgctac cggctcgccg cggcgggcaa tctcaacctc tgcccgccgt agacgagccg 2506261 cagcagctcg gacaggcgtg tcttcgcctc gtgaacgccg acccgcttcg caggcgccca 2506321 gactttcgcg tcgaccacct gctcaccaaa cttcgcgatc atcgcctgat accacagcgc 2506381 caacgggtag cggtttgtcc aaccgcttcg tcaacgacaa tgggatcgtg accgacacga 2506441 ccgcgagcgg gaccaattgc ccgcctcctc cacgcgccgc cgcacggcgc gcatcgtcgc 2506501 cgggtgaatc gccgcagctg gtgatcttcg atctggacgg cacgctgacc gactcggcgc 2506561 gcggaatcgt atccagcttc cgacacgcgc tcaaccacat cggtgcccca gtacccgaag 2506621 gcgacctggc cactcacatc gtcggcccgc ccatgcatga gacgctgcgc gccatggggc 2506681 tcggcgaatc cgccgaggag gcgatcgtag cctaccgggc cgactacagc gcccgcggtt 2506741 gggcgatgaa cagcttgttc gacgggatcg ggccgctgct ggccgacctg cgcaccgccg 2506801 gtgtccggct ggccgtcgcc acctccaagg cagagccgac cgcacggcga atcctgcgcc 2506861 acttcggaat tgagcagcac ttcgaggtca tcgcgggcgc gagcaccgat ggctcgcgag 2506921 gcagcaaggt cgacgtgctg gcccacgcgc tcgcgcagct gcggccgcta cccgagcggt 2506981 tggtgatggt cggcgaccgc agccacgacg tcgacggggc ggccgcgcac ggcatcgaca 2507041 cggtggtggt cggctggggc tacgggcgcg ccgactttat cgacaagacc tccaccaccg 2507101 tcgtgacgca tgccgccacg attgacgagc tgagggaggc gctaggtgtc tgatccgctg 2507161 cacgtcacat tcgtttgtac gggcaacatc tgccggtcgc caatggccga gaagatgttc 2507221 gcccaacagc ttcgccaccg tggcctgggt gacgcggtgc gagtgaccag tgcgggcacc 2507281 gggaactggc atgtaggcag ttgcgccgac gagcgggcgg ccggggtgtt gcgagcccac 2507341 ggctacccta ccgaccaccg ggccgcacaa gtcggcaccg aacacctggc ggcagacctg 2507401 ttggtggcct tggaccgcaa ccacgctcgg ctgttgcggc agctcggcgt cgaagccgcc 2507461 cgggtacgga tgctgcggtc attcgaccca cgctcgggaa cccatgcgct cgatgtcgag 2507521 gatccctact atggcgatca ctccgacttc gaggaggtct tcgccgtcat cgaatccgcc 2507581 ctgcccggcc tgcacgactg ggtcgacgaa cgtctcgcgc ggaacggacc gagttgatgc 2507641 cccgcctagc gttcctgctg cggcccggct ggctggcgtt ggccctggtc gtggtcgcgt 2507701 tcacctacct gtgctttacg gtgctcgcgc cgtggcagct gggcaagaat gccaaaacgt 2507761 cacgagagaa ccagcagatc aggtattccc tcgacacccc gccggttccg ctgaaaaccc 2507821 ttctaccaca gcaggattcg tcggcgccgg acgcgcagtg gcgccgggtg acggcaaccg 2507881 gacagtacct tccggacgtg caggtgctgg cccgactgcg cgtggtggag ggggaccagg 2507941 cgtttgaggt gttggcccca ttcgtggtcg acggcggacc aaccgtcctg gtcgaccgtg 2508001 gatacgtgcg gccccaggtg ggctcgcacg taccaccgat cccccgcctg ccggtgcaga 2508061 cggtgaccat caccgcgcgg ctgcgtgact ccgaaccgag cgtggcgggc aaagacccat 2508121 tcgtcagaga cggcttccag caggtgtatt cgatcaatac cggacaggtc gccgcgctga 2508181 ccggagtcca gctggctggg tcctatctgc agttgatcga agaccaaccc ggcgggctcg 2508241 gcgtgctcgg cgttccgcat ctagatcccg ggccgttcct gtcctatggc atccaatgga 2508301 tctcgttcgg cattctggca ccgatcggct tgggctattt cgcctacgcc gagatccggg 2508361 cgcgccgccg ggaaaaagcg gggtcgccac caccggacaa gccaatgacg gtcgagcaga 2508421 aactcgctga ccgctacggc cgccggcggt aaaccaacat cacggccaat accgcagccc 2508481 ccgcctggac cacccgcgac agcaccacgg cgcggcgcag atcggccacc ttgggcgacc 2508541 ggccgtcgcc caaggtgggc cggatctgca actcatggtg gtaccgggtg ggcccaccca 2508601 gccgcacgtc aagcgcccca gcaaacgccg cctcgacgac accggcgttg gggctgggat 2508661 ggcgggcggc gtcgcgccgc caggcccgta ccgcaccgcg gggcgaccca ccgaccaccg 2508721 gcgcgcagat caccaccagc accgccgtcg cccgtgcgcc aacatagttg gcccagtcat 2508781 ccaatcgtgc tgcagcccaa ccgaatcgga gataacgcgg cgagcggtag ccgatcatcg 2508841 agtccagggt gttgatggca cgatatccca gcaccgcagg cacgccgctc gaagccgccc 2508901 acagcagcgg caccacctgg gcgtcggcgg tgttttcggc caccgactcc agcgcggcac 2508961 gcgtcaggcc cgggccgccc agctgggccg ggtcacgccc gcacagcgac ggcagcagcc 2509021 gtcgcgccgc ctcgacatcg tcgcgctcca acaggtccga tatctggcgg ccggtgcgcg 2509081 ccagcgaagt tccgcccagc gctgcccagg tggccgtcgc ggtggccgcc acgggccagg 2509141 acctgccggg tagccgctgc agtgccgcgc cgagcaagcc caccgcgccg accagcaggc 2509201 cgacgtgtac cgcaccggcg acccggccgt cacggtaggt gatctgctcc agcttggcgg 2509261 ccgcccgacc gaacagggcc accggatgac ctcgtttggg gtcgccgaac acgacgtcga 2509321 gcaggcagcc gatcagcacg ccgacggccc tggtctgcca ggtcgatgca aacactccgg 2509381 cagcgtcgca cacgtggtct acgctcagct atttatgacc tcatacggca gctatccacg 2509441 atgaagcggc cagctacccg ggttgccgac ctgttgaacc cggcggcaat gttgttgccg 2509501 gcagcgaatg tcatcatgca gctggcagtg ccgggtgtcg ggtatggcgt gctggaaagc 2509561 ccggtggaca gcggcaacgt ctacaagcat ccgttcaagc gggcccggac caccggcacc 2509621 tacctggcgg tggcgaccat cgggacggaa tccgaccgag cgctgatccg gggtgccgtg 2509681 gacgtcgcgc accggcaggt tcggtcgacg gcctcgagcc cagtgtccta taacgccttc 2509741 gacccgaagt tgcagctgtg ggtggcggcg tgtctgtacc gctacttcgt ggaccagcac 2509801 gagtttctgt acggcccact cgaagatgcc accgccgacg ccgtctacca agacgccaaa 2509861 cggttaggga ccacgctgca ggtgccggag gggatgtggc cgccggaccg ggtcgcgttc 2509921 gacgagtact ggaagcgctc gcttgatggg ctgcagatcg acgcgccggt gcgcgagcat 2509981 cttcgcgggg tggcctcggt agcgtttctc ccgtggccgt tgcgcgcggt ggccgggccg 2510041 ttcaacctgt ttgcgacgac gggattcttg gcaccggagt tccgcgcgat gatgcagctg 2510101 gagtggtcac aggcccagca gcgtcgcttc gagtggttac tttccgtgct acggttagcc 2510161 gaccggctga ttccgcatcg ggcctggatc ttcgtttacc agctttactt gtgggacatg 2510221 cggtttcgcg cccgacacgg ccgccgaatc gtctgataga gcccggccga gtgtgagcct 2510281 gacagcccga caccggcggc gtgtgtcgcg tcgccaggtt cacgctcggc gatctagagc 2510341 cgccgaaaac ctacttctgg gttgcctccc gaatcaacgt gctgatctgc tcgagcagct 2510401 cacgcatatc ggcgcgcatc gcatccaccg cggcatacag gtcggccttg gtcgccggca 2510461 gctggtccga cgtcattggc cgcaccggcg gtgctgtctg tcgcgccgcg ctgtcgcttt 2510521 gaaacccagg tcgctcaccc acgaccacga cactgccata tccggcgccc cgccgacaac 2510581 gaagcacagc tagccggtgg gcgcggacgg gatcgaaccg ccgaccgctg gtgtgtaaaa 2510641 ccagagctct accgctgagc tacgcgccca tgaccgccgc aggctacacg ccttgcggcc 2510701 aagcacccaa aaccttaggc cgtaagcgcc gccagagcgt cggtccacag ccgctgatcg 2510761 cgaacttcac ccggctgctt catctcggcg aaccgaatga tccctgaccg atcgaccaca 2510821 aaggtgcccc ggttagcgat gccggcctgc tcgttgaaga cgccgtaggc ctgactgacc 2510881 gcgccgtgtg gccagaagtc cgacaacagc ggaaacgtga atccgctctg cgtcgcccag 2510941 atcttgtgag tgggtggcgg gcccaccgaa atcgctagcg cggcgctgtc gtcgttctca 2511001 aactcgggca ggtgatcacg caactggtcc agctcgccct ggcagatgcc cgtgaacgcc 2511061 aacggaaaga acaccaacag cacgttcttt gcaccccggt agccgcgcag ggtgacaagc 2511121 tgctgattct ggtcgcgcaa cgtgaagtca ggggcggtgg ctccgacgtt cagcatcagc 2511181 gcttgccagc ccgcgatttc ggctgtacca atctgctggc gctccagttg cccagattga 2511241 ccgacgaggt cggcatcagc ccagctgtgg gcgccgcctc ggcaatctcg gcgggcaata 2511301 catggccggg ctggccggtc ttgggcgtca ccacccaaat cacaccgtcc tcggcgagcg 2511361 ggccgatcgc atccatcagg gtgtccacca aatcgccgtc gccatcacgc caccacaaca 2511421 ggacgacatc gatgacctcg tcggtgtctt catcgagcaa ctctcccccg cacgcttctt 2511481 cgatggccgc gcggatgtcg tcgtcggtgt cttcgtccca gccccattcc tggataagtt 2511541 ggtctcgttg gatgcccaat ttgcgggcgt agttcgaggc gtgatccgcc gcgaccaccg 2511601 tggaacctcc ttcagtctcc gcgggccatg tgcacaccgt cgcgatgggc attatcgtcg 2511661 cacagccaga accggtccac ccgcccgcct cagaaggcgg ccacgcacat tgtcaatgcc 2511721 tttgtcttgg tgtcgttgag ccgatcaacc cgccggttga attccgctgt cgacgcgtgc 2511781 gcaccgatgg catttgccac cgcgcgggcc gcgtcgacat atgcgttgag cgcatccccc 2511841 agttgcgcgg acagcgcggc gctcagactg cctgagaccg tcgaggcact gttgttgagc 2511901 gcgtcgatgg ccggaccttc ggtcggcccg gtgttgcggc cctgattgaa cgcggccacg 2511961 taggcgttca ccttgtcgat ggcgtccttg ctggtggccg ccagcgcgtc acacgaggtg 2512021 cgaatcgcct tggtcgtcag cgattgttgg cgctgcgact cccggatgct cgacgtcgcc 2512081 gccgaagccg acaccgacgc ggacaccgac gagcggtagg ccggtgcgac gttggtgtcg 2512141 ggcatggccg taccgtcggt gacagtggta catccgacga tccccatcag cagcagcgcg 2512201 atgcagccga gcgccagggc gcctcgcctg gggagctccc ccccgtgcct gcgaggcacg 2512261 gcgcgccatc cgatgagcac ggcatgtgag gttacctggt cgcagcgcga ccgcgctggc 2512321 cgtggtgtgt cgcgcatccg cagaaccgag cggagtgcgg ctatccgccg ccgacgccgg 2512381 tgcggcacga tagggggacg accatctaaa cagcacgcaa gcggaagccc gccacctaca 2512441 ggagtagtgc gttgaccacc gatttcgccc gccacgatct ggcccaaaac tcaaacagcg 2512501 caagcgaacc cgaccgagtt cgggtgatcc gcgagggtgt ggcgtcgtat ttgcccgaca 2512561 ttgatcccga ggagacctcg gagtggctgg agtcctttga cacgctgctg caacgctgcg 2512621 gcccgtcgcg ggcccgctac ctgatgttgc ggctgctaga gcgggccggc gagcagcggg 2512681 tggccatccc ggcattgacg tctaccgact atgtcaacac catcccgacc gagctggagc 2512741 cgtggttccc cggcgacgaa gacgtcgaac gtcgttatcg agcgtggatc agatggaatg 2512801 cggccatcat ggtgcaccgt gcgcaacgac cgggtgtggg cgtgggtggc catatctcga 2512861 cctacgcgtc gtccgcggcg ctctatgagg tcggtttcaa ccacttcttc cgcggcaagt 2512921 cgcacccggg cggcggcgat caggtgttca tccagggcca cgcttccccg ggaatctacg 2512981 cgcgcgcctt cctcgaaggg cggttgaccg ccgagcaact cgacggattc cgccaggaac 2513041 acagccatgt cggcggcggg ttgccgtcct atccgcaccc gcggctcatg cccgacttct 2513101 gggaattccc caccgtgtcg atgggtttgg gcccgctcaa cgccatctac caggcacggt 2513161 tcaaccacta tctgcatgac cgcggtatca aagacacctc cgatcaacac gtgtggtgtt 2513221 ttttgggcga cggcgagatg gacgaacccg agagccgtgg gctggcccac gtcggcgcgc 2513281 tggaaggctt ggacaacttg accttcgtga tcaactgcaa tctgcagcga ctcgacggcc 2513341 cggtgcgcgg caacggcaag atcatccagg agctggagtc gttcttccgc ggtgccggct 2513401 ggaacgtcat caaggtggtg tggggccgcg aatgggatgc cctgctgcac gccgaccgcg 2513461 acggtgcgct ggtgaattta atgaatacaa cacccgatgg cgattaccag acctataagg 2513521 ccaacgacgg cggctacgtg cgtgaccact tcttcggccg cgacccacgc accaaggcgc 2513581 tggtggagaa catgagcgac caggatatct ggaacctcaa acggggcggc cacgattacc 2513641 gcaaggttta cgccgcctac cgcgccgccg tcgaccacaa gggacagccg acggtgatcc 2513701 tggccaagac catcaaaggc tacgcgctgg gcaagcattt cgaaggacgc aatgccaccc 2513761 accagatgaa aaaactgacc ctggaagacc ttaaggagtt tcgtgacacg cagcggattc 2513821 cggtcagcga cgcccagctt gaagagaatc cgtacctgcc gccctactac caccccggcc 2513881 tcaacgcccc ggagattcgt tacatgctcg accggcgccg ggccctcggg ggctttgttc 2513941 ccgagcgcag gaccaagtcc aaagcgctga ccctgccggg tcgcgacatc tacgcgccgc 2514001 tgaaaaaggg ctctgggcac caggaggtgg ccaccaccat ggcgacggtg cgcacgttca 2514061 aagaagtgtt gcgcgacaag cagatcgggc cgcggatagt cccgatcatt cccgacgagg 2514121 cccgcacctt cgggatggac tcctggttcc cgtcgctaaa gatctataac cgcaatggcc 2514181 agctgtatac cgcggttgac gccgacctga tgctggccta caaggagagc gaagtcgggc 2514241 agatcctgca cgagggcatc aacgaagccg ggtcggtggg ctcgttcatc gcggccggca 2514301 cctcgtatgc gacgcacaac gaaccgatga tccccattta catcttctac tcgatgttcg 2514361 gcttccagcg caccggcgat agcttctggg ccgcggccga ccagatggct cgagggttcg 2514421 tgctcggggc caccgccggg cgcaccaccc tgaccggtga gggcctgcaa cacgccgacg 2514481 gtcactcgtt gctgctggcc gccaccaacc cggcggtggt tgcctacgac ccggccttcg 2514541 cctacgaaat cgcctacatc gtggaaagcg gactggccag gatgtgcggg gagaacccgg 2514601 agaacatctt cttctacatc accgtctaca acgagccgta cgtgcagccg ccggagccgg 2514661 agaacttcga tcccgagggc gtgctgcggg gtatctaccg ctatcacgcg gccaccgagc 2514721 aacgcaccaa caaggcgcag atcctggcct ccggggtagc gatgcccgcg gcgctgcggg 2514781 cagcacagat gctggccgcc gagtgggatg tcgccgccga cgtgtggtcg gtgaccagtt 2514841 ggggcgagct aaaccgcgac ggggtggcca tcgagaccga gaagctccgc caccccgatc 2514901 ggccggcggg cgtgccctac gtgacgagag cgctggagaa tgctcggggc ccggtgatcg 2514961 cggtgtcgga ctggatgcgc gcggtccccg agcagatccg accgtgggtg ccgggcacat 2515021 acctcacgtt gggcaccgac gggttcggct tttccgacac tcggcccgcc gctcgccgct 2515081 acttcaacac cgacgccgaa tcccaggtgg tcgcggtttt ggaggcgttg gcgggcgacg 2515141 gcgagatcga cccatcggtg ccggtcgcgg ccgcccgcca gtaccggatc gacgacgtgg 2515201 cggctgcgcc cgagcagacc acggatcccg gtcccggggc ctaacgccgg cgagccgacc 2515261 gcctttggcc gaatcttcca gaaatctggc gtagctttta ggagtgaacg acaatcagtt 2515321 ggctccagtt gcccgcccga ggtcgccgct cgaactgctg gacactgtgc ccgattcgct 2515381 gctgcggcgg ttgaagcagt actcgggccg gctggccacc gaggcagttt cggccatgca 2515441 agaacggttg ccgttcttcg ccgacctaga agcgtcccag cgcgccagcg tggcgctggt 2515501 ggtgcagacg gccgtggtca acttcgtcga atggatgcac gacccgcaca gtgacgtcgg 2515561 ctataccgcg caggcattcg agctggtgcc ccaggatctg acgcgacgga tcgcgctgcg 2515621 ccagaccgtg gacatggtgc gggtcaccat ggagttcttc gaagaagtcg tgcccctgct 2515681 cgcccgttcc gaagagcagt tgaccgccct cacggtgggc attttgaaat acagccgcga 2515741 cctggcattc accgccgcca cggcctacgc cgatgcggcc gaggcacgag gcacctggga 2515801 cagccggatg gaggccagcg tggtggacgc ggtggtacgc ggcgacaccg gtcccgagct 2515861 gctgtcccgg gcggccgcgc tgaattggga caccaccgcg ccggcgaccg tactggtggg 2515921 aactccggcg cccggtccaa atggctccaa cagcgacggc gacagcgagc gggccagcca 2515981 ggatgtccgc gacaccgcgg ctcgccacgg ccgcgctgcg ctgaccgacg tgcacggcac 2516041 ctggctggtg gcgatcgtct ccggccagct gtcgccaacc gagaagttcc tcaaagacct 2516101 gctggcagca ttcgccgacg ccccggtggt catcggcccc acggcgccca tgctgaccgc 2516161 ggcgcaccgc agcgctagcg aggcgatctc cgggatgaac gccgtcgccg gctggcgcgg 2516221 agcgccgcgg cccgtgctgg ctagggaact tttgcccgaa cgcgccctga tgggcgacgc 2516281 ctcggcgatc gtggccctgc ataccgacgt gatgcggccc ctagccgatg ccggaccgac 2516341 gctcatcgag acgctagacg catatctgga ttgtggcggc gcgattgaag cttgtgccag 2516401 aaagttgttc gttcatccaa acacagtgcg gtaccggctc aagcggatca ccgacttcac 2516461 cgggcgcgat cccacccagc cacgcgatgc ctatgtcctt cgggtggcgg ccaccgtggg 2516521 tcaactcaac tatccgacgc cgcactgaag catcgacagc aatgccgtgt catagattcc 2516581 ctcgccggtc agagggggtc cagcaggggc cccggaaaga taccaggggc gccgtcggac 2516641 ggaaagtgat ccagacaaca ggtcgcggga cgatctcaaa aacatagctt acaggcccgt 2516701 tttgttggtt atatacaaaa acctaagacg aggttcataa tctgttacac cgcgcaaaac 2516761 cgtcttcaca gtgttctctt agacacgtga ttgcgttgct cgcacccgga cagggttcgc 2516821 aaaccgaggg aatgttgtcg ccgtggcttc agctgcccgg cgcagcggac cagatcgcgg 2516881 cgtggtcgaa agccgctgat ctagatcttg cccggctggg caccaccgcc tcgaccgagg 2516941 agatcaccga caccgcggtc gcccagccat tgatcgtcgc cgcgactctg ctggcccacc 2517001 aggaactggc gcgccgatgc gtgctcgccg gcaaggacgt catcgtggcc ggccactccg 2517061 tcggcgaaat cgcggcctac gcaatcgccg gtgtgatagc cgccgacgac gccgtcgcgc 2517121 tggccgccac ccgcggcgcc gagatggcca aggcctgcgc caccgagccg accggcatgt 2517181 ctgcggtgct cggcggcgac gagaccgagg tgctgagtcg cctcgagcag ctcgacttgg 2517241 tcccggcaaa ccgcaacgcc gccggccaga tcgtcgctgc cggccggctg accgcgttgg 2517301 agaagctcgc cgaagacccg ccggccaagg cgcgggtgcg tgcactgggt gtcgccggag 2517361 cgttccacac cgagttcatg gcgcccgcac ttgacggctt tgcggcggcc gcggccaaca 2517421 tcgcaaccgc cgaccccacc gccacgctgc tgtccaaccg cgacgggaag ccggtgacat 2517481 ccgcggccgc ggcgatggac accctggtct cccagctcac ccaaccggtg cgatgggacc 2517541 tgtgcaccgc gacgctgcgc gaacacacag tcacggcgat cgtggagttc ccccccgcgg 2517601 gcacgcttag cggtatcgcc aaacgcgaac ttcggggggt tccggcacgc gccgtcaagt 2517661 cacccgcaga cctggacgag ctggcaaacc tataaccgcg gactcggcca gaacaaccac 2517721 atacccgtca gttcgatttg tacacaacat attacgaagg gaagcatgct gtgcctgtca 2517781 ctcaggaaga aatcattgcc ggtatcgccg agatcatcga agaggtaacc ggtatcgagc 2517841 cgtccgagat caccccggag aagtcgttcg tcgacgacct ggacatcgac tcgctgtcga 2517901 tggtcgagat cgccgtgcag accgaggaca agtacggcgt caagatcccc gacgaggacc 2517961 tcgccggtct gcgtaccgtc ggtgacgttg tcgcctacat ccagaagctc gaggaagaaa 2518021 acccggaggc ggctcaggcg ttgcgcgcga agattgagtc ggagaacccc gatgccgttg 2518081 ccaacgttca ggcgaggctt gaggccgagt ccaagtgagt cagccttcca ccgctaatgg 2518141 cggtttcccc agcgttgtgg tgaccgccgt cacagcgacg acgtcgatct cgccggacat 2518201 cgagagcacg tggaagggtc tgttggccgg cgagagcggc atccacgcac tcgaagacga 2518261 gttcgtcacc aagtgggatc tagcggtcaa gatcggcggt cacctcaagg atccggtcga 2518321 cagccacatg ggccgactcg acatgcgacg catgtcgtac gtccagcgga tgggcaagtt 2518381 gctgggcgga cagctatggg agtccgccgg cagcccggag gtcgatccag accggttcgc 2518441 cgttgttgtc ggcaccggtc taggtggagc cgagaggatt gtcgagagct acgacctgat 2518501 gaatgcgggc ggcccccgga aggtgtcccc gctggccgtt cagatgatca tgcccaacgg 2518561 tgccgcggcg gtgatcggtc tgcagcttgg ggcccgcgcc ggggtgatga ccccggtgtc 2518621 ggcctgttcg tcgggctcgg aagcgatcgc ccacgcgtgg cgtcagatcg tgatgggcga 2518681 cgccgacgtc gccgtctgcg gcggtgtcga aggacccatc gaggcgctgc ccatcgcggc 2518741 gttctccatg atgcgggcca tgtcgacccg caacgacgag cctgagcggg cctcccggcc 2518801 gttcgacaag gaccgcgacg gctttgtgtt cggcgaggcc ggtgcgctga tgctcatcga 2518861 gacggaggag cacgccaaag cccgtggcgc caagccgttg gcccgattgc tgggtgccgg 2518921 tatcacctcg gacgcctttc atatggtggc gcccgcggcc gatggtgttc gtgccggtag 2518981 ggcgatgact cgctcgctgg agctggccgg gttgtcgccg gcggacatcg accacgtcaa 2519041 cgcgcacggc acggcgacgc ctatcggcga cgccgcggag gccaacgcca tccgcgtcgc 2519101 cggttgtgat caggccgcgg tgtacgcgcc gaagtctgcg ctgggccact cgatcggcgc 2519161 ggtcggtgcg ctcgagtcgg tgctcacggt gctgacgctg cgcgacggcg tcatcccgcc 2519221 gaccctgaac tacgagacac ccgatcccga gatcgacctt gacgtcgtcg ccggcgaacc 2519281 gcgctatggc gattaccgct acgcagtcaa caactcgttc gggttcggcg gccacaatgt 2519341 ggcgcttgcc ttcgggcgtt actgaagcac gacatcgcgg gtcgcgaggc ccgaggtggg 2519401 ggtccccccg cttgcggggg cgagtcggac cgatatggaa ggaacgttcg caagaccaat 2519461 gacggagctg gttaccggga aagcctttcc ctacgtagtc gtcaccggca tcgccatgac 2519521 gaccgcgctc gcgaccgacg cggagactac gtggaagttg ttgctggacc gccaaagcgg 2519581 gatccgtacg ctcgatgacc cattcgtcga ggagttcgac ctgccagttc gcatcggcgg 2519641 acatctgctt gaggaattcg accaccagct gacgcggatc gaactgcgcc ggatgggata 2519701 cctgcagcgg atgtccaccg tgctgagccg gcgcctgtgg gaaaatgccg gctcacccga 2519761 ggtggacacc aatcgattga tggtgtccat cggcaccggc ctgggttcgg ccgaggaact 2519821 ggtcttcagt tacgacgata tgcgcgctcg cggaatgaag gcggtctcgc cgctgaccgt 2519881 gcagaagtac atgcccaacg gggccgccgc ggcggtcggg ttggaacggc acgccaaggc 2519941 cggggtgatg acgccggtat cggcgtgcgc atccggcgcc gaggccatcg cccgtgcgtg 2520001 gcagcagatt gtgctgggag aggccgatgc cgccatctgc ggcggcgtgg agaccaggat 2520061 cgaagcggtg cccatcgccg ggttcgctca gatgcgcatc gtgatgtcca ccaacaacga 2520121 cgaccccgcc ggtgcatgcc gcccattcga cagggaccgc gacggctttg tgttcggcga 2520181 gggcggcgcc cttctgttga tcgagaccga ggagcacgcc aaggcacgtg gcgccaacat 2520241 cctggcccgg atcatgggcg ccagcatcac ctccgatggc ttccacatgg tggccccgga 2520301 ccccaacggg gaacgcgccg ggcatgcgat tacgcgggcg attcagctgg cgggcctcgc 2520361 ccccggcgac atcgaccacg tcaatgcgca cgccaccggc acccaggtcg gcgacctggc 2520421 cgaaggcagg gccatcaaca acgccttggg cggcaaccga ccggcggtgt acgcccccaa 2520481 gtctgccctc ggccactcgg tgggcgcggt cggcgcggtc gaatcgatct tgacggtgct 2520541 cgcgttgcgc gatcaggtga tcccgccgac actgaatctg gtaaacctcg atcccgagat 2520601 cgatttggac gtggtggcgg gtgaaccgcg accgggcaat taccggtatg cgatcaataa 2520661 ctcgttcgga ttcggcggcc acaacgtggc aatcgccttc ggacggtact aaaccccagc 2520721 gttacgcgac aggagacctg cgatgacaat catggccccc gaggcggttg gcgagtcgct 2520781 cgacccccgc gatccgctgt tgcggctgag caacttcttc gacgacggca gcgtggaatt 2520841 gctgcacgag cgtgaccgct ccggagtgct ggccgcggcg ggcaccgtca acggtgtgcg 2520901 caccatcgcg ttctgcaccg acggcaccgt gatgggcggc gccatgggcg tcgaggggtg 2520961 cacgcacatc gtcaacgcct acgacactgc catcgaagac cagagtccca tcgtgggcat 2521021 ctggcattcg ggtggtgccc ggctggctga aggtgtgcgg gcgctgcacg cggtaggcca 2521081 ggtgttcgaa gccatgatcc gcgcgtccgg ctacatcccg cagatctcgg tggtcgtcgg 2521141 tttcgccgcc ggcggcgccg cctacggacc ggcgttgacc gacgtcgtcg tcatggcgcc 2521201 ggaaagccgg gtgttcgtca ccgggcccga cgtggtgcgc agcgtcaccg gcgaggacgt 2521261 cgacatggcc tcgctcggtg ggccggagac ccaccacaag aagtccgggg tgtgccacat 2521321 cgtcgccgac gacgaactcg atgcctacga ccgtgggcgc cggttggtcg gattgttctg 2521381 ccagcagggg catttcgatc gcagcaaggc cgaggccggt gacaccgaca tccacgcgct 2521441 gctgccggaa tcctcgcgac gtgcctacga cgtgcgtccg atcgtgacgg cgatcctcga 2521501 tgcggacaca ccgttcgacg agttccaggc caattgggcg ccgtcgatgg tggtcgggct 2521561 gggtcggctg tcgggtcgca cggtgggtgt actggccaac aacccgctac gcctgggcgg 2521621 ctgcctgaac tccgaaagcg cagagaaggc agcgcgtttc gtgcggctgt gcgacgcgtt 2521681 cgggattccg ctggtggtgg tggtcgatgt gccgggctat ctgcccggtg tcgaccagga 2521741 gtggggtggc gtggtgcgcc gtggcgccaa gttgctgcac gcgttcggcg agtgcaccgt 2521801 tccgcgggtc acgctggtca cccgaaagac ctacggcggg gcatacattg cgatgaactc 2521861 ccggtcgttg aacgcgacca aggtgttcgc ctggccggac gccgaggtcg cggtgatggg 2521921 cgctaaggcg gccgtcggca tcctgcacaa gaagaagttg gccgccgctc cggagcacga 2521981 acgcgaagcg ctgcacgacc agttggccgc cgagcatgag cgcatcgccg gcggggtcga 2522041 cagtgcgctg gacatcggtg tggtcgacga gaagatcgac ccggcgcata ctcgcagcaa 2522101 gctcaccgag gcgctggcgc aggctccggc acggcgcggc cgccacaaga acatcccgct 2522161 gtagttctga ccgcgagcag acgcagaatc gcacgcgcga ggtccgcgcc gtgcgattct 2522221 gcgtctgctc gccagttatc cccagcggtg gctggtcaac gcgaggcgct cctcgcatgc 2522281 tcggacggtg cctaccgacg cgctaacaat tctcgagaag gccggcgggt tcgccaccac 2522341 cgcgcaattg ctcacggtca tgacccgcca acagctcgac gtccaagtga aaaacggcgg 2522401 cctcgttcgc gtttggtacg gggtctacgc ggcacaagag ccggacctgt tgggccgctt 2522461 ggcggctctc gatgtgttca tgggggggca cgccgtcgcg tgtctgggca ccgccgccgc 2522521 gttgtatgga ttcgacacgg aaaacaccgt cgctatccat atgctcgatc ccggagtaag 2522581 gatgcggccc acggtcggtc tgatggtcca ccaacgcgtc ggtgcccggc tccaacgggt 2522641 gtcaggtcgt ctcgcgaccg cgcccgcatg gactgccgtg gaggtcgcac gacagttgcg 2522701 ccgcccgcgg gcgctggcca ccctcgacgc cgcactacgg tcaatgcgct gcgctcgcag 2522761 tgaaattgaa aacgccgttg ctgagcagcg aggccgccga ggcatcgtcg cggcgcgcga 2522821 actcttaccc ttcgccgacg gacgcgcgga atcggccatg gagagcgagg ctcggctcgt 2522881 catgatcgac cacgggctgc cgttgcccga acttcaatac ccgatacacg gccacggtgg 2522941 tgaaatgtgg cgagtcgact tcgcctggcc cgacatgcgt ctcgcggccg aatacgaaag 2523001 catcgagtgg cacgcgggac cggcggagat gctgcgcgac aagacacgct gggccaagct 2523061 ccaagagctc gggtggacga ttgtcccgat tgtcgtcgac gatgtcagac gcgaacccgg 2523121 ccgcctggcg gcccgcatcg cccgccacct cgaccgcgcg cgtatggccg gctgaccgct 2523181 ggtgagcaga cgcagagtcg cactgcggcc ggcgcagtgc gactctgcgt ctgctcgcgc 2523241 tcaacggctg aggaactcct tagccacggc gactacgcgc tcgcgatccc gtggcaccag 2523301 accgatccgg gtccggcggt cgaggatatc gtccacatcc agcgccccct catgggtcac 2523361 cgcgtattcg aactccgccc gggtcacgtc gatgccgtcg gcgaccggct cggtgggccg 2523421 ctcacatgtg gcggcggcag cgacgttggc cgcctcggcc ccgtaccgcg ccaccagcga 2523481 ctcgggcaat ccggcgcccg atccgggggc cggcccaggg ttcgccggtg cgccgatcag 2523541 cggcaggttg cgagtgcggc acttcgcggc tcgcaggtgt cgcagcgtga tggcgcgatt 2523601 cagcacatcc tctgccatgt agcggtattc cgtcagcttg ccgccgacca cactgatcac 2523661 gcccgacggc gattcaaaaa cagcgtggtc acgcgaaacg tcggcggtgc ggccctggac 2523721 accagcaccg ccggtgtcga ttagcggccg caatcccgca taggcaccga tgacatcctt 2523781 ggtgccgacc gccgtcccca atgcggtgtt caccgtatcc agcaggaacg tgatctcttc 2523841 cgaagacggt tgtggcacat cgggaatcgg gccgggtgcg tcttcgtcgg tcagcccgag 2523901 atagatccgg cccagctgct cgggcatggc gaacacgaag cggttcagct caccggggat 2523961 cggaatggtc agcgcggcag tcggattggc aaacgacttc gcgtcgaaga ccagatgtgt 2524021 gccgcggctg gggcgtagcc tcagggacgg gtcgatctca cccgcccaca cgcccgccgc 2524081 gttgatgacg gcacgcgccg acagcgcgaa cgactgccgg gtgcgccggt cggtcaactc 2524141 caccgaagtg ccggtgacat tcgacgcgcc cacgtaagtg aggatgcggg cgccgtgctg 2524201 ggccgcggtg cgcgcgacgg ccatgaccag ccgggcgtcg tcgatcaatt gcccgtcgta 2524261 cgcgagcaga ccaccgtcga ggccgtcccg ccgaacggtg ggagcaatct ccaccacccg 2524321 tgacgccggg attcggcgcg atcggggcaa cgtcgccgcc ggcgtacccg ctagcacccg 2524381 caaagcgtcg ccggccagga aaccggcacg caccaacgcc cgcttggtgt gacccatcga 2524441 cggcaacaac gggaccagtt gcggcatggc atgcacgaga tgaggagcgt tgcgtgtcat 2524501 caggattccg cgttcgacgg cgctgcgccg ggcgatgccc acgttgccgc tggccagata 2524561 gcgcagaccg ccgtgcacca acttcgagct ccagcggctg gtgccgaacg ccagatcatg 2524621 cttttccacc aaggccaccg tcagaccgcg ggtggcagca tctaaggcaa tgccaacacc 2524681 ggtaatgccg ccgcctatca cgatgacgtc gagtgcgcca ccgtcggcca gtgcggtcag 2524741 gtcggcggag cgacgcgccg cgttgagtgc agccgagtgg ggcatcagca caaatatccg 2524801 ttcagtgcgt gggtaagttc ggtggccagc gcggcggaat cgaggatcga atcgacgatg 2524861 tccgcggact ggatggtcga ctgggcgatc agcaacacca tggtcgccag tcgacgagcg 2524921 tcgccggagc gcacactgcc cgaccgctgc gccactgtca gccgggcggc caacccctcg 2524981 atcaggacct gctggctggt gccgaggcgc tcggtgatgt acaccctggc cagctccgag 2525041 tgcatgaccg acatgatcag atcgtcaccc cgcaaccggt cggccaccgc gacaatctgc 2525101 tttaccaacg cttcccggtc gtccccgtcg aggggcacct cccgcagcac gtcggcgata 2525161 tggctggtca gcatggacgc catgatcgac cgggtgtccg gccagcgacg gtatacggtc 2525221 gggcggctca cgcccgcgcg ccgggcgatc tcggcaagtg tcacccggtc cacgccgtaa 2525281 tcgacgacgc agctcgccgc tgcccgcagg atacgaccac cggtatccgc gcggtcatta 2525341 ctcattgaca gcatgtgtaa tactgtaacg cgtgactcac cgcgaggaac tccttccacc 2525401 gatgaaatgg gacgcgtggg gagatcccgc cgcggccaag ccactttctg atggcgtccg 2525461 gtcgttgctg aagcaggttg tgggcctagc ggactcggag cagcccgaac tcgaccccgc 2525521 gcaggtgcag ctgcgcccgt ccgccctgtc gggggcagac cacgatgcgc tggcgcgcat 2525581 cgtcggcacc gagtatttcc gcaccgccga tcgcgaccgg ctgctgcacg ccggcggcaa 2525641 gtccacccca gacctgctgc ggcgcaaaga caccggtgtc caggatgcgc ccgacgcggt 2525701 gttgctgccc ggcggcccca acgggggagg acgccgtcgc cgacatcttg cactactgct 2525761 ccgaccacgg cattgccgtg gtcccgtttg gtggcggcac cagcgtcgtt ggtgggcttg 2525821 accccgttcg caacgacttt cgcgcggtga tctccctgga tatgcggcgc ttcgaccggc 2525881 tgcaccggat cgatgaggtg tccggcgagg ccgaactgga ggccggtgtc accgggccgg 2525941 aagccgaacg tctgctcggc gaacatggct tctcgctcgg gcacttcccg cagagcttcg 2526001 agttcgccac catcgggggg ttcgcggcca cccgctcgtc aggccaggac tcggctggct 2526061 atggccggtt caacgacatg attcttgggc tgcgcatgat cactccggtg ggggtgctgg 2526121 atctgggtcg agtgccggcg tcggcggccg gcccggacct gcgccagctg gcgatcggct 2526181 ccgaaggcgt cttcggcgtc atcacccggg tgcggctgcg ggtgcaccgg attccggaat 2526241 cgacgcgtta cgaggcgtgg tcgtttcccg atttcgcgac cggggttgcg gcgctgcgca 2526301 ccatcaccca aaccggcacc ggccccaccg tcgttcggct ctctgacgag gccgaaaccg 2526361 gcgtcaacct cgccaccacc gaggcgatcg gggaaaccca aatcaccggc ggctgtttgg 2526421 ggatcaccgt gttcgagggc acccaggaac acaccgagag caggcacgcc gagacgcgcg 2526481 cgttgctggc ggcccgaggc ggcacctcgt tgggcgaagg accggcgcgg gcctgggaac 2526541 gcggcaggtt cgccgcgccg tatctgcgtg actccctgtt ggccgcggga gcgctctgcg 2526601 agaccctcga gaccgccacg gtgtggtcca acacccccgt gctgaaggcc gccgtgaccg 2526661 aagcgctcac cacctcgctg gccgcatcgg gtacaccggc gctggtgatg tgccacgtgt 2526721 cgcacgtgta tcccaccggc gcgtcgttgt acttcaccgt tgtcgccggg cagcgaggcg 2526781 atccgatcga gcagtggctg gccgccaaga aggcggcgtc ggatgcgatc atggccaccg 2526841 gaggaacgat cacgcaccac catgcggttg gttccgacca ccgcccctgg atgcgcgcgg 2526901 aggtgggtga tctgggcgtg acattgttgc gcacgatcaa ggcgacgctg gatccggccg 2526961 gaattctcaa ccctggcaag ctgattccat gagcgccggg cagctgcgcc ggcatgagat 2527021 cggcaaggtc accgcgctga ccaatcccct gtcaggccat ggcgccgccg taaaggctgc 2527081 acacggcgcg atcgcccggc tgaagcatcg gggggtggac gtcgtcgaga tcgtcggcgg 2527141 ggacgcccac gacgcacgcc atctgctcgc cgcggcagtc gcaaaaggca ctgacgcggt 2527201 gatggtgacc ggcggtgacg gagtcgtctc caacgcgcta caggtcttgg cgggcaccga 2527261 cattccgtta ggaatcattc cggccggcac tggtaacgac cacgcacgcg aattcgggct 2527321 tcccacaaag aatcccaagg cagccgcaga tatcgttgtt gacggctgga cggaaaccat 2527381 tgacctgggc cggattcaag acgacaacgg tatcgaaaag tggttcggta ccgtggcggc 2527441 taccggattc gactccctgg tcaacgatcg cgccaaccga atgcgctggc cacacgggcg 2527501 gatgcgctat tacatcgcga tgctcgccga actgtcgcgg ctgcggccgt tgccgttccg 2527561 gctggtgctc gacggcaccg aagagatcgt cgccgacctc acacttgccg acttcggcaa 2527621 tacccgcagc tacggcggcg gattattgat ctgccccaac gccgaccact cggacggcct 2527681 gctcgacatc accatggccc agtcggattc ccgtaccaag ttgctccgcc tgttccccac 2527741 cattttcaaa ggcgcccatg tcgagcttga cgaggtgagc accacacgag ccaagacagt 2527801 ccacgtcgag tgccccggta tcaacgtcta tgccgacggc gacttcgcct gcccgttacc 2527861 agccgagatc tccgcggtgc cggccgccct tcaggttctt cgcccccgcc acggataagc 2527921 gggtggtaac gactcggtcg taaagcgcga catccttcca aacccgctgt acgggaggaa 2527981 cagatgtccg gacaccgcaa gaaggcaatg ctcgccttgg cggctgcgtc gctggcagcg 2528041 acgctggccc cgaacgcagt cgcggccgca gaaccgtcgt ggaacgggca gtacctcgtg 2528101 acgttgtctg ccaacgcgaa aaccggcacc agcatggcgg ccaaccggcc agagtatcca 2528161 cacaaagcga actacacgtt cagctcgcgc tgcgcgtccg atgtctgcat tgccaccgtg 2528221 gtcgacgctc cgccaccaaa aaacgagttc atcccgcggc caatcgaata cacctggaat 2528281 gggactcaat gggtacggga gatcagctgg caatgggact gcctgctacc cgacggcaca 2528341 atcgaatatg ccccagccaa atcgatcacg gcctacacgc ccggtcagta cggaatcctc 2528401 accggcgtct ttcataccga tatcgccagc ggcacgtgta aaggcaatgt cgacatgcca 2528461 gtgtcggcca aaccgatcgt tggctgacgt tgccagccct gccgagcatg ggcggcacat 2528521 cacgcaaacg catggacgac cagcacagcc ccgaatgcgg cgataacggc gttgccggca 2528581 aggactgtca tccgacggac gcgggcggtc gcccgggacc tgagaaacgc tcccgccgag 2528641 acaagcagca actgccagag caacgacgcg agcgctaccc cgaccacaac agcgatcgcg 2528701 gtcgttgcgc gcaacgcgcg cgccagcgtc acggctactg cggtgaagta cacgaacgtg 2528761 gccgggttga tcgccgttag gccgaagatc aacgcaaacc gaacacagcc cagctgtttt 2528821 tgtggggccg gaaccggctc cggcgatgga cgcaacccgt gcccgattcc catcgcagcg 2528881 atgaccagca gcacgatcgc accgacgatt tcgggccaaa ccctcaacac gttgatcgtc 2528941 ggtgccgcaa ctgtctccaa atcgcggtag cgcaatacgc tacgtcgaca agggcgaccg 2529001 ccgcggcggc cggtattcca cgacgccagc cgcgctcaac acctgcttgc cgaggaaagc 2529061 gttcccggtg gcggcatcga ggcgtttgtc gatgtgcgac ttcgaccggc ggaggacagc 2529121 gacgactttc tggcatggtc gagcacggac accacgatcg acgatgccgt ccacgtcacc 2529181 ggaccctacg actacctgct acacattcgg gtctgcgaca cagcggacct ggaccgcctg 2529241 ttacgcaggc tcaagacctc cgcggaagct gcgcaaaccc aaacgcgcat tgcgctcagg 2529301 tcccggcgtt gacaccgcgc cagcaggcgc caccaaaccc ttagccaact ccccgactca 2529361 gccaagtcac ctcgccggcg tcgccgccgt cacgatacac ctcgagcgcc tggtcccagg 2529421 ccgttcccag caccgaatcc agttcggcgg ccagtgtgtc cgcaccttgg gccatcatcg 2529481 cccgcagccg catctccccc accatgatgt cgccgttggc gctcatcgcc ccgctccaca 2529541 gacccagttg cggggtgtga ctgaatcgct ggccgtcgac tccagggcta gggtcttcgg 2529601 tgacctcgaa ccgcagcacc gaccaggaac gcaaggcgtt ggctagtcgc gccccggtgc 2529661 ccaccggccc gacccaatta gtgaccgcac gcaactgcgg cggcagggcc ggttgcggcg 2529721 tccagaccag gttcgccttg gcctgtaggg tcgacgacaa cgcccactcg acatgcgggc 2529781 acaccgccgc gggcgaggcg tggatgtaca ccacaccgga cgtcacgtcg gcgaattggt 2529841 tcgacgctcg catctgctgc tccttcggtt ccacgaggga cgtcttcccc aacgacctgg 2529901 tgaacccgac aagcaggatg cctgctgtga aatttcgaat ttttgtgtcg tgcgtttcta 2529961 ttgtgccttg tgatacccgt gttgcgctag tgtgcggttc tgcctaggtg tactcggcta 2530021 gaaccgcgtc ggaaatcgcg ggccacaagt ccaacgccca gtcgccgaaa tcgcgggccg 2530081 tgaggaccac cagagccagg tccgccttgg gatccaccca gatgaaaccg cctgattggc 2530141 cgaaatggcc gaatgtccgc gtcgagttgc actcgccggt ccagtggggc gatttcgaat 2530201 tcctgatctc aaagcccagc ccccagtcat tgggccgctg cacaccgtac ccgggcagta 2530261 caccgtccag gccgggaaac tgcaccgtgg tcgcgtcggc atgcatctgc gccgagaccg 2530321 tcgatggacg cagcagatca cccgcgaaca ccgccaagtc cgcgaccgtc gaggtcgccc 2530381 cgaacccggc ggcagcgggg cccccgtcca gccgggtggt caccatgccc aggggttcgc 2530441 acaccgcctc ggtcaggtag cgcccgaact cgatccccga ctcccgctgc acgctctcgg 2530501 ccagcacggt gaaaccgtag ttcgaataca tccggcgggt gccggggcgg gccagcgcct 2530561 gatcggaatg catcgccaac cccgatgtgt gcgccagcag gtgacggacc gtggagccgg 2530621 gcgggcctgc cggggtgtcg agattcacca ccccctcctc aacggcgacc tgtgcggctc 2530681 gggccaccag cggcttggtg accgacgcca gcgcgaacac ccgcgcggta tcgccgtggg 2530741 tggctagcac ccctgcgggt ccgatcaccg cggcggccgc agccgggacc ggccagccac 2530801 caagcacttc gagagcggtc atcgactccg gcgcgtcact tccgggcgat gtagtagttg 2530861 ttcaacacgt ccgactcgat ctcggccacc gtcacgtcgg taaaaccggc gtcggcgagc 2530921 atcgaggtgg ccaactgcct gccccacacc gtccccaacc cggccccgtc aagcgccagc 2530981 gacaccgtca tgcagtgcat tagcgaggtc gtgtacaggt aggtgctcag cggaacgccg 2531041 acattgtctt ccagttgact cgatgccttg atgtcgacca tcagcagcac accaccgggt 2531101 cgcagcgcac gatagatgtt ctgcaggacg cgcgccggct gcgcctggtc gtgaatcgcg 2531161 tcgaacacgg tgatcacgtc gtaggccccc accttgtcca gctctgccag gtcatggcgc 2531221 tcgaaggtcg cgtttgccag gcccaaccga gccgcctcct cggtccccgc cgcaacggcc 2531281 tcgtcggaaa agtcgatgcc ggtgaatcgg ctcgcgccga acgcctgcgc catcagcttg 2531341 accgcgcgac cactgccgca accgaaatcg gccacgtcgg ctccggaccg caagcggtcc 2531401 ggaaggccgt cgaccagcgg gagcaccacg tcgatcaagg cggcatcgaa caccatgccg 2531461 ctcatctcgg ccatcagctt gtggaagcgc gggtattcgc tgtagggcac accgccgcct 2531521 tcccggaagc agcgaatgac cttttgttcg acctcgccga gcagcgaaac gaactgtgct 2531581 atcacggcga ggttgtccgg cccggccgca cgggtcagca tgccggcgcg gtgggcaggc 2531641 agcgagtagg tcgagctccc cgcgtcgtat tcgacgatct gcccggtggt catgccgcct 2531701 agccactccc gaacgtagcg ctcttccaac cccgcagcct cagcgatctc catgctggtg 2531761 gctggcggaa gtccggccat ggtgtccagc agcccggtct ggtgtccaac gctcaccagg 2531821 atcgccaaac cggcgctgtc gatggccgca acaaaacggt tgccgaattc ttcggtggtc 2531881 tcgagtgctc cgctcatctg cgccgctcct cctcatcgct tcgctctgca tcgtcaccgg 2531941 cgcgactcat ctgcgccgct cctcctcatc gcttcgctct gcatcgtcac cggcgcgact 2532001 catctgcgcc gctcctgctc atcgcttcgc tctgcatcgt caccggcgcg actcatctgc 2532061 gccgctcctg ctcatcgctt cgctctgcat cgtcaccggc gcgactcatc tgcgccgctc 2532121 ctgctcatcg cttcgctctg catcgtcacc ggcgcgactc atctgcgccg ctcctcctca 2532181 tcgcttcgct ctgcatcgtc accggcgcgc atggtcagcg acgctacacc gtaggttgga 2532241 caccatgagt cagacggtgc gcggtgtgat cgcacgacaa aagggcgaac ccgttgagct 2532301 ggtgaacatt gtcgtcccgg atcccggacc cggcgaggcc gtggtcgacg tcaccgcctg 2532361 cggggtatgc cataccgacc tgacctaccg cgagggcggc atcaacgacg aatacccttt 2532421 tctgctcgga cacgaggccg cgggcatcat cgaggccgtc gggccgggtg taaccgcagt 2532481 cgagcccggc gacttcgtga tcctgaactg gcgtgccgtg tgcggccagt gccgggcctg 2532541 caaacgcgga cggccccgct actgcttcga cacctttaac gccgaacaga agatgacgct 2532601 gaccgacggc accgagctca ctgcggcgtt gggcatcggg gcctttgccg ataagacgct 2532661 ggtgcactct ggccagtgca cgaaggtcga tccggctgcc gatcccgcgg tggccggcct 2532721 gctgggttgc ggggtcatgg ccggcctggg cgccgcgatc aacaccggcg gggtaacccg 2532781 cgacgacacc gtcgcggtga tcggctgcgg cggcgttggc gatgccgcga tcgccggtgc 2532841 cgcgctggtc ggcgccaaac ggatcatcgc ggtcgacacc gatgacacga agcttgactg 2532901 ggcccgcacc ttcggcgcca cccacaccgt caacgcccgc gaagtcgacg tcgtccaggc 2532961 catcggcggc ctcacggatg gattcggcgc ggacgtggtg atcgacgccg tcggccgacc 2533021 ggaaacctac cagcaggcct tctacgcccg cgatctcgcc ggaaccgttg tgctggtggg 2533081 tgttccgacg cccgacatgc gcctggacat gccgctggtc gacttcttct ctcacggcgg 2533141 tgcgctgaag tcgtcgtggt acggcgattg cctgcccgaa agcgacttcc ccacgctgat 2533201 cgacctttac ctgcagggcc ggctgccgct gcagcggttc gtttccgaac gcatcgggct 2533261 cgaagacgtc gaggaggcgt tccacaagat gcatggcggc aaggtattgc gttcggtggt 2533321 gatgttgtga tggccgccat cgagcgcgtc atcacccacg gcaccttcga actcgatggc 2533381 ggcagttggg aagtcgacaa caacatctgg ctggtcggcg acgactccga ggtggtggtt 2533441 ttcgacgccg cccaccacgc ggctcctatc atcgacgccg tcggcggccg caaggtggtt 2533501 gcggtgatct gcacgcacgg ccacaacgac cacgtgacgg tggcccccga actgggcacg 2533561 gcgcttgacg caccggtgct gatgcatccc ggcgacgccg tgctgtggcg aatgactcac 2533621 ccggacaaaa gctttcgcgc cgtttcagac ggtgatgcgg tgcgggttgg cgggacggag 2533681 ttgcgtgcgc tgcacacccc ggggcactcc cctggatcgg tgtgctggta tgcgccagag 2533741 ctgggtcccg gaacaggcac cgtgttcagc ggagacacgc tgttcgctgg cgggccgggt 2533801 gcaaccggcc gctcgtattc cgacttcccc acgatcctgc ggtcgatatc cggacggctc 2533861 ggcgcattac cgggcgacac cgtcgtgcac accggccacg gcgacagcac caccatcggc 2533921 gacgagatcg tccactacga ggaatgggtg gcccgtgggc attgatcccg cgggcgcgcg 2533981 cagaatgccg gtcgtagcgg cgtgtcggtg tacaagcacc gcgcggtcca tgagccgagc 2534041 gctacttatc cgcgcaatct gacactcgag ccaagctgcg gcgcagaaac accgcaaagc 2534101 cggcacccat gaccacaaat gccgtcactg gcacccagtc acccaaccga aggtagagcg 2534161 tgacattcga tgccaacgga acgttcacca cgatggcacc gttgaattcc gccgagcacc 2534221 aggccagccg acggccccgg gtatcaaagg ccgagctgtc gcccgacaag ctggcgtgca 2534281 ccgctgggat gccggcttcg acggcgcgca ccgcgggctg ggcggccaac tgcggctgcg 2534341 cccaactccc ttggaacgtc gaggtggaac tctgatacac cagcagcgcc gccccgagcc 2534401 gcgcggcgtg ccgggtcaga tcggagaagg tcatctcgta gctgatcaac ggggcgatat 2534461 gcaaggagtt caccgccaac accaccggcc cggcgccgcg ctgccgatcc tttgcggcgg 2534521 ccttgctgta gcgggtgatc cagccgaaaa gcgggcgcag cggagcacat attcgccaaa 2534581 cggaaccaac cgggtcttcc ggtagctgcc cacagcttcg tgcgcgccga caagcaccgc 2534641 cgacttgtag attcccccgt ccggtgccgg ggcgtcgacg ttgaccaaca aatccgcgcc 2534701 cacccgctgt gacagctcgg ccaggcgagc caggacgtca ggatggcggg tgaggtcttg 2534761 tccgacgctg ctttcccccc agaccaccaa gtccggccgc tggtccgcaa cggccgcggt 2534821 gaactcttca ccggccgcca gtcgagccgc cgcatcggct atgtcgccgg cctgtaccag 2534881 cgccacgcgc accgtcggac cgccgaccgg caccgagccc agcaggtagg aagccgggcc 2534941 gagtcccgca cacccaatca cgcatcccag cgcgaccagc cggccgcccg ttgcccggca 2535001 cacgagcacg ctcgcgatgg cggtattggt cgcaaccaga agaaaacttg tcagccacac 2535061 cccacccagc gacgccgacg ctagcgtcac gggctggctc cattgcgatg cacccagcaa 2535121 cgcccacgga ccgcccagcg attgccagga ccgcaccgct tcggctgcca cccacgcgct 2535181 gggcaccacg accagggcgg caccgacgcg gcatgtggtc accggtaccg acaacagccg 2535241 gtgcgccaac cacccggccg gcagccacag cacacccagg ccggcggcca acagcaccag 2535301 catcggacca gcactggtca ccagccagta ctgggttgcc agcacaaatc cgcccatacc 2535361 cgtccacgcc cgcagcgcgc cctcccacga cgtcggcgcg gcccgcacca ctaacagcag 2535421 tgggaccaag ccgaaccagg ccagccacca ccaagacggc gcgggaaagg ccagcgcggg 2535481 taacccgccg aacaccaacg ctgccgcaca accaatgacc ggttgtcgcc gggctcccgc 2535541 gcgcaacgcc atgccgatca gcatgccggc cacattcgcc tgcgtcgagg aaaagagcag 2535601 actaagaccg gcagtccccg ccagaaaggg agtgatttgc atggccaagg atctggtcgc 2535661 cacggtgccc gatctttccg ggaagctggc aatcatcacc ggcgccaaca gcggtctagg 2535721 cttcgggctg gcccggcggc tgtcggcggc tggcgccgac gtaatcatgg cgatccgcaa 2535781 tcgcgccaag ggcgaggcgg cggtcgagga aatccggacc gcggttccgg atgcgaagct 2535841 gaccatcaag gccctcgacc tgtcatcgtt ggcgtccgtc gccgcgttgg gggaacagct 2535901 catggctgac gggcggccga tcgacctgct gatcaacaac gccggcgtca tgaccccacc 2535961 ggaacgcgtt accactgccg acggcttcga attgcagttc ggcagcaacc atctcggaca 2536021 cttcgcgcta accgcacacc tgctgccgct gttgcgcgcg gcacagcgcg cgagggtcgt 2536081 ctcgttgagc agcttggcgg cccgccgcgg ccgcatccac ttcgacgacc tacagttcga 2536141 gaggtcgtac gccccgatga cggcctatgg ccagtcgaag ctggcggtct tgatgttcgc 2536201 ccgcgagctg gaccgccgca gccgcgcggc cggctggggc atcatctcca atgccgcgca 2536261 tcctggcttg accaagacca acctgcagat cgcgggaccg tcccatggcc gcgacaagcc 2536321 ggcgctgatg gaacgcttgt acaagacgtc ctggcgtttc gcaccgttcc tctggcagga 2536381 gatcgaagag gggatcttgc ccgcgctgta tgcagccgcc accccgcaag ccgacggtgg 2536441 cgcgttctat ggcccccgcg gccgctacga ggtcgccggc ggtggtgtgc gagaggccaa 2536501 ggttcccgca gccgcccgca acgacgccga tagcaagcga ctttgggagg tctccgagca 2536561 gctcaccggt gtcagctacc cgaaatcgcg ctgaactgcc cgatcccggg aacctgaggt 2536621 attccggggg ggagctgcgg aatctccgga atcggtggga tcggcgggat cggtggaggg 2536681 ctgggggacg tggtcgccgg cggctgcgtg gtcgccggcg gctgcgtggt cgccggcggc 2536741 gcggaagcgg gggtcgtcgg tgccggagtg atgacatcgg tggtcaccgc cggttgcgta 2536801 ttcgtcgttg tcggcggagg cggcaacggc tgctgcagcg gcggcgcggg cccaccggtg 2536861 gctggcgcct gtacgggagg tgcgggctcg gtagtggggc catcggatgc cggcgctggg 2536921 gacgggggcg gtgcggcggt cgtggtcaca cccggcctct gcggggtccc cggcgccgtc 2536981 ggctggtcgc cggtggacaa cccgatcgcc acggcggcac ccaccagcaa caccgccacc 2537041 gtcgtgccgg tgatgatcac ggccggcagg cgataccacg ggattggcgg ggacttgggc 2537101 tcgggctccg catgggcatc gtggtcgaag ctcagcgacg ggcgggccgc tgtgtagcca 2537161 ggggccggcc cgatgtggga gtcctcgtcg gcctccgacc aggccaaagc gggctgcagg 2537221 accgacgccg gcgcatcggc cggcgccgtc gccgtcgccg aggtgaccgc ggtcagcacc 2537281 gttgcgctgg tgtcgccggg tctgcgtgcc gcccacaacg cgccgccgaa agcggccgtc 2537341 aattgcggac gaggcgtcct gaccaccggc acgcagaaac gtccggacag cgtcgtggtg 2537401 actgccggga tatttgcacc accacccacc gaaacgatcg ctaccagctc ggccgtgcga 2537461 attccgctgc gggccagggt ttgttccaag gccctgccca cgctgtccag cgagtcacgg 2537521 attgtgtcct cgagctcgtt gcgggtcaac cggatatccc cgcccaacgc gtcggtcagc 2537581 gtggtcaccg tgcttgacga aagccgttcc ttggctttgc gacattcgat ccgcagctta 2537641 gtcagtgagc cgatcgccga ggtgccggct ggatcgaacg cgcccgtgcc cggtagttcg 2537701 gacatgacgt agctcaacag cgactgatcg atcagatcgc cggagaaagc ctgatggcgc 2537761 accgtcgcgg ccaccggccg atactcgtct gcggcgtcga cgagcgtgat gccggtcccg 2537821 ctgccaccga agtcgcatac cgcgacgatc ccacgggccg gtatgcccgg gtcggcccgt 2537881 atcgcgtaca gcgctgccgc ggcgtcaggg agcagtgaca gtggctgggc cgtactcgaa 2537941 gtcccgtgcg accattccga ggcccgacgc agcgcgctat ccaacgctgc taccgcagcc 2538001 ggcccccagt gggcgggata ggtcaccgtg acacttccgg gaagagcacg accgccggta 2538061 gcggtgtagg ccagcgccag cagtgcgtca gccactagcg cctcgctgcg gtacaccgag 2538121 ccgtcggcag ccacgatgcc gaccgaatct cccacccggt ctacgaagtc ggtgatcacc 2538181 aggcctggct cgtccagcct cgggttctcc gatggcacac cgacctcggg cgggcgctgt 2538241 cgatacagcg tcagcacggg tttacgtgtg atggagtgat cggcagccac agccgctagg 2538301 ttggtgacac cgatcgacaa gcctaatgcc ggtctcgccc ctgttgccat atggcccaat 2538361 ccccgtgtcc ggcggctcgt cgcaaccgcc tacctcgaat tttccgtcat acctatagcc 2538421 aatgtgggcg ccggtgatct ggatagcgac attgccgcaa cgcccggttg gtcagcaaat 2538481 ggtgcccatg ctggcgacca acgggacctc cggcgcggta aggcagccgg gctccagtaa 2538541 tcccagcggc taggccaagg cctcgatgtc gtcggtggcg acgatgccta ccggcttgga 2538601 gccgtgttga gaaatgagtt cggccgtcgg cagcaacctc cccactcagc aatcccagct 2538661 tcaccctaaa cctggcgttc gtacgccacc tagcatctgg tgggtgcgaa cggtgatgtc 2538721 gcgttgagcc gcatcggcgc cacccgtccg gcattgagcg cgtggcgatt cgtcacagtg 2538781 ttcggggtgg tcggcctgct cgccgacgtc gtgtatgaag gggcccgttc gatcaccggc 2538841 ccgctgctgg cttcgttggg agcgaccgga ctggtggtcg gagtcgtcac cggcgtcggt 2538901 gaggccgccg ccttgggctt gcggctggtg tcggggccat tggccgatcg aagccgacgg 2538961 ttttgggcct ggaccatcgc cggctacacc ctgacggtgg taacggttcc gctgctcggc 2539021 atcgcgggcg ccctgtgggt ggcgtgcgcg ttggtcatcg ccgagcgagt cgggaaagct 2539081 gtgcgcggcc ccgccaaaga caccctgctg tcgcacgcgg ccagtgtgac cggccgaggc 2539141 cgcggtttcg ccgtgcacga ggcgctggac caggtcggtg cgatgatcgg ccctctcacc 2539201 gttgccggga tgctcgcgat caccgggaat gcctatgcgc ccgcgctcgg cgtgctgacc 2539261 ctgcccggcg gtgccgccct tgctctgttg ctgtggctgc agcgtcgggt gccccgcccg 2539321 gagtcctacg aggactgtcc ggttgtcctc ggtaatcctt cggcgccgcg accctgggcg 2539381 ctgccggcgc agttctggct gtactgcggg ttcaccgcga tcaccatgct ggggtttggc 2539441 acgttcgggt tgctgtcgtt tcacatggtc agccacggcg tgctggccgc cgccatggtc 2539501 ccggtggtct atgcggccgc aatggccgca gatgcgctga cggccttggc ctcaggcttc 2539561 agctatgaca gatatggcgc gaaaaccctt gccgttctgc cgattctgtc gattctggtg 2539621 gtgctattcg ccttcacgga caacgtcaca atggtggtca ttggcacgtt ggtgtggggc 2539681 gcagcggtcg gaatacaaga gtccacgctg cgcggcgtgg tggccgacct ggtcgccagc 2539741 ccacggcggg ccagcgccta cggcgtgttc gccgcagggc tgggcgctgc gaccgccggg 2539801 ggcggcgccc tcatcggctg gctgtacgac atctccatcg gcacgctcgt tgtggtggtg 2539861 atcgcacttg aactgatggc cctggtgatg atgttcgcga tccgactacc ccgcgtagca 2539921 ccgagctaaa gaagcgatca ggcggcccaa cggaacagca ggttggtatg cgacaacatg 2539981 cttgaccggc acgccaacaa gcacgactgc caccgatcca ggtaagtggc ggccaaggac 2540041 ggtcaaccgg tctaggctcg ccagtattac cccttcaagg gcgaaggggg caggaggatc 2540101 tcgatgggcc tcaacacggc gatcgcgact cgggtgaatg gcacgccgcc gccggaggtg 2540161 ccgatcgccg atattgaact gggttccctg gatttctggg cactcgatga cgacgttcgc 2540221 gatggcgcct tcgccacctt gcgccgcgag gcgccgatct cgttctggcc cacgatcgag 2540281 ctgcccgggt ttgtcgcggg caatgggcat tgggcgctca ccaagtacga cgatgtcttc 2540341 tacgccagcc gtcatccgga cattttcagt tcgtacccca acatcacgat caacgaccag 2540401 acaccagagt tagccgaata cttcggctcg atgatcgtgc tcgacgatcc gcgccatcag 2540461 cggctgcgct cgattgtcag ccgagccttc accccgaagg tggtagcccg catcgaagca 2540521 gccgtgcgtg accgggccca tcggttggtc tcatcgatga tcgccaataa tcccgaccgg 2540581 caggccgatc tggtcagcga actcgcaggt ccactgccgc tgcagattat ctgtgacatg 2540641 atggggattc ccaaggcgga ccatcagcgc atttttcact ggaccaacgt cattctcggc 2540701 ttcggcgatc ccgatctggc caccgatttc gacgagttca tgcaggtttc ggcggacatc 2540761 ggcgcctacg ccaccgcgct ggccgaagac cgccgggtca accaccacga cgatctgacc 2540821 agcagcctgg tcgaagccga ggtcgacggc gagcggctgt cgtcgaggga gatcgcgtcg 2540881 ttcttcatcc tgctggtggt ggccggcaac gagacgacgc gcaacgcgat cactcacggc 2540941 gtgctggcac tgtcccgcta tcccgagcaa cgggacaggt ggtggtctga cttcgacggc 2541001 ctggcgccca ccgcggtcga ggagatcgtg cggtgggcct ccccggtggt ctacatgcgc 2541061 cgcaccctga cccaagacat tgagttgcgc ggcaccaaga tggccgccgg tgacaaggtc 2541121 tccctgtggt attgctcggc caaccgggac gagtcaaagt tcgccgatcc ctggacattc 2541181 gacctagcac gcaaccccaa tccgcatctc ggtttcggtg gcggtggcgc ccatttctgc 2541241 ctgggcgcca acctagcgcg tcgggagatc agggtcgcgt tcgacgaact acgcaggcag 2541301 atgcccgacg tcgtcgcgac cgaggagccc gcacggctgt tgtcgcagtt cattcacgga 2541361 atcaagacgc tgccagttac gtggtcctga aaggccgaac gtggctcggc gggtatatgg 2541421 tgcgccattc ccggtggctg tgggatttgc actacacagg aagcgttgtc gcccacccac 2541481 tggcggaccg gtaggcaccg atcggtgccg gcctgttttg ggtagcggat caagcgcaca 2541541 aacgactcgc ggtggccgaa caggatgatg ttggcgagac gccccgtctg gcatgaccgc 2541601 tgccgacgcg ttcgagtgcg gtcgagagcc aaaggcggct tgatcagccg ccaaccgcag 2541661 gccgaagacg tgccggctca ggtgtgtgac gatcgtagcc gtagcggtcg atgatctcgc 2541721 cccagtgctc atcgacaatc gcacgctgct cgacggtcag ttgatagctg ttggttttgt 2541781 agtccgcatg gtcagctagg tattgccgca gacgcggcag gtaacactcg aagtcgccca 2541841 gtcccaggtg ctggtatagc cggcgcagct gtccctcggg atcaccgatc aaatcctcat 2541901 aacgcaattc gtaaaagcgt gtggggtcaa cgagttctcg gccttcgtcc aactttcggt 2541961 ataggtcgac gtaggtcgac acgaccttgt cgtccaaccc gtcgaacgtc ggttgttgca 2542021 agccatgtat gcggtacagc gccttatgaa gatggatggt tgatggatag accacatagg 2542081 gatctcggac gatgtggatg aacttcgctt gcgggaatac ctccagcagc accttgattc 2542141 gaaaactatg cgttggattc ttgaggatca ccgtcttgcg acggcggaag tacacctgct 2542201 gaacgaaccg gaacagggtc cgtttccaga tttctagttc tcgcggtgcc acctgctcta 2542261 gatccaggta ctcctcatac tggggcggcc ggttcgggaa tgcgatggtc agatacggcg 2542321 acggcaggcc ctgcatacac cacacgaact cgtcttcctg cgggtgatgc aagctcaaat 2542381 ccatgttgtc cattgcccga tgcttcgata ccaggaattc cacatatggc gcaaaccact 2542441 cggtcagtag aaaatggtgt ggcgcaaggc attcgtagcc ggtgggaccg gtgtggcgat 2542501 catcgacgac caacagttca tgcagcaagg tggtgccggt acgccaatgc ccaacaatga 2542561 agattggcgg atcggcgatc accgtttcgg ccactcgcct accgaaaacg atcttctgcc 2542621 acaaccccag acaggaattg accatgctga gaaacgtata gaggaccgcg aagtgccagc 2542681 ggctgtgatg cacggcgaag cggttacgga tcaaaagccg catccaggcc gagaagttgc 2542741 agccgaccca cagcggtgcg gcccactcgc gccaccggga aagtcgagac gacgaacgga 2542801 gagccttcat ggtgcgacgc ggggggtaac ggcgacccgt aaccgggtca agccgcgaag 2542861 gttggcgttt gtcgtccacg tcggcggctc gaccacctct attcggtcga tattggcgac 2542921 gatctcgcgc aagatcgcct gaccctccat gcgcgccagc tgggtccccg gacacaggtg 2542981 gatgccggag ccgaacgcga gatgcccgac cgggttgcgg tcggcgcgaa agacatccgg 2543041 gtcttcgtac tggcgcgggt cacggttggc tgcaccccat gccagcagca ccagtgagcc 2543101 tgccgggatg accgcttgac cgaccgaata gtcgacgcgc gttgtgcggc agatgttttg 2543161 gattggcgat ataaagcgga ggtgctcctc gatcgccgac gggatcaggt ctggttgctg 2543221 cgcaaggagt gtcagctgat ctggatagtc ggccagcgtc agaaacaatg tgctaatcat 2543281 atgagcagtg ctctcatagc ccgcaaccag cagcaacacc gcgaagaaga acaattcgtc 2543341 atcgctgagt cgaccttgct cggcatgggt ggcaagcttc ccgagaacag tgcattccct 2543401 aagcagcccg ttgtcacgcc gatgagtgaa gagtgcacgc aatcgccgga atccggcaaa 2543461 gccctgcaca agcgaaatca acccggaggc tgacaaggca acgtcggtga tccgtaccgc 2543521 ctggttggac aaacggcaga aggcggcctc gtccggtcca tctacgccga gcacactggt 2543581 gatagcgcgc atcggcatcg gtgcggccac ggtggagacg acgtccgcgg gcgtctgggt 2543641 cagtaacccg ccgaccagtt ctcgggcaag ctggtcgacc atcgggcgcc acgtctccaa 2543701 cgcgccacgc gccatacctg gtgccagttg cttgcgcatc cgggtgtgcg ccggcggatc 2543761 ggacgtcggc agaaacggca gccacccccg tgagaaggtg accccacggg cgctggacaa 2543821 cgtgtcgtgg ttacgcgcag cctcgcggac gtcggcgtat cggctcaaaa tgtagacgtc 2543881 gcgcttgggg ttgtactgca cccgctcgcc ggccaacagc tctcgataat gcgggtaagg 2543941 atcagcggca atcgcgggat cgaacgggtc aaagtcggtg agctgcataa atttccggca 2544001 atgccggccg gtcaacctgg accgagcctt cccggcgacc ctcagcgcaa gtgctttcgc 2544061 gaccgcgggc ccgtaggttc gcacagtttg cgcgtcgcgc cacatgctgg tggctaccgg 2544121 gatgccacca gatgacgcgc gccggcgcgt gggaacgccc agagccgtgg tcgcgtcctg 2544181 cgcggtcaga ccaacgtcgg gcgtgcccgc taacgggcac ccggccagcc gcactcggtc 2544241 cggcgcgggc tcgggagggg actgtgtcgc ggtcatgacc ctccgaactc agagaggcgt 2544301 agaacagtca cagggtaacg gcgggcatcg caataattgc gcagtttcgc aaagcgtttc 2544361 gcaacgcaat aagatggtta cccggagttc ggacaggcga atctgcccag cgcaaggctg 2544421 gtgatagcgc cgaccaacgg cgccgtgatc ggtaaccgtt tccgaccggc cgataccggc 2544481 ccggccacca tagcggaggt caaccccacc tgttggcgga acgcccaaaa ctgggccgac 2544541 tgtgtaggca tgcgtcgcac ttgattggtc gccgacccgg caattcgcta gccgcgctaa 2544601 gggtcgcgca tcgttggcca caacaggcgc gacttgcgcg aatgtgcttt ctcgccggca 2544661 tcgcgatgcc taactttatg ttttcgagga gactgcgatg cggcttccag gccgtcatgt 2544721 gttatacgcc ctgtcggcgg tcaccatgct ggcggcctgc tccagcaacg gtgctcgtgg 2544781 cggcattgcg tcgacgaaca tgaatccgac aaacccaccc gcaactgcgg agaccgctac 2544841 cgtctcaccg acaccggctc cgcagagcgc gcgaaccgag acctggatta accttcaagt 2544901 cggcgactgc ctggccgacc tgccgccggc ggatctgagc cggataaccg tcacgattgt 2544961 cgattgcgcg acagcgcatt cggccgaggt atacctgcgt gctccggtgg ccgtcgatgc 2545021 cgccgtcgtt tccatggcca atcgtgattg tgctgccgga tttgcgccct acacaggcca 2545081 atccgtcgac accagcccat actcggtggc gtatctcatc gactcgcatc aggatagaac 2545141 cggggccgat cccaccccga gcaccgtcat ctgtttgctg cagcccgcca acggtcagtt 2545201 gctcaccggg tcggcccgtc gctgaccgga cgacccgttg ttcgggtgcg tggcacacga 2545261 caccaaccgg tatcgtctgt tgccgtgact tctccgattg ctccgaatac caaaagcgac 2545321 ggttctcgct gatgactacc ccacccgaca aggcgcggcg ccggtttctt cgcgacgcct 2545381 acaagaacgc tgagcgcgtc gcacgaaccg ctttgctcac aatcgaccag gaccagcttg 2545441 agcagctgct cgactacgtc gacgagagac tcggcgaaca gccttgtgac cacaccgccc 2545501 ggcatgcgca acgatgggcc caatcacacc gcatcgaatg ggagacgctg gccgagggcc 2545561 tacaagagtt tggtggctac tgcgattgtg agatcgtaat gaatgtcgaa cctgaggcga 2545621 tcttcggcta gtcctctgcc ggcgatgttc tcataacgac atggcaagcc acgcgcttga 2545681 ctaaactcag ccgacgtcaa accgcctgtc cccgatatgc cctgcgaggt tgcctcgtgg 2545741 ctgatgactc aaacgacacc gcgaccgatg tcgaacccga ctaccggttc acccttgcca 2545801 acgagcggac cttcctggcc tggcagcgca ccgctctagg cctgctggcc gcggcggtcg 2545861 ccctggtgca gctcgtcccg gaactgacga tccccggcgc acgccaggtg ctcggtgtgg 2545921 tgctcgcgat tttggcaatc ctcaccagcg gaatgggtct gctgcgctgg cagcaggcgg 2545981 atcgcgccat gcgccggcac ctgccattgc cccgtcaccc cacaccgggc tacctcgcgg 2546041 tggggctctg cgtggtcggg gtcgtcgcgc tcgcattggt ggtagccaag gcgatcaccg 2546101 ggtgaaccgt cactcgacgg cagcgagcga tcgcgggctg caggccgaac ggacgacgct 2546161 ggcctggacc cggacggcct ttgcgttgct ggtcaacggc gtgttgctga cgctcaagga 2546221 cacgcaaggc gccgacgggc cggctgggct gatcccggcc ggcctagctg gtgctgcggc 2546281 ctcgtgctgc tatgtgatcg ctctacaacg ccaacgagca ctttcgcacc gcccgctacc 2546341 ggcacgaatc actccccgcg gccaggtcca catcctcgcg acagcggtgc tggtgcttat 2546401 ggtcgtcacc gcctttgctc aactgctcta gcgcggcgaa cagacgcaaa agcccccgca 2546461 cgcacggagt gtcgggggct tttgcgtcta ctcgccaaat gcgatcgtgg ccgatggcgg 2546521 cgcggacctt cctgtaaatt gccggaattc acgattttgt gcggctagac caacgccggg 2546581 agccagcgtg cctgcgagga taggagcgcc tcggccgatc cgccggcgca gccgttcggt 2546641 cacaacggat ctgacctgct cagcctgcaa gtcaaccaca agaccggtcc aggctgatac 2546701 gcaaaatatg tgagtgtacc cgccgccaca gcggcagcag ctggatcccc cttttggtgg 2546761 acacgagatc cacccaatag gctgggccga tcgggcgata gacattgtca gttcgtgccg 2546821 gcaccctgat cactgacctc aacaccgagc gtcgaccccg tccctatggt ccaaggaaaa 2546881 caatgtcata cgtggctgcc gaaccaggcg tgctgatctc gccgacggac gacttgcaga 2546941 gcccccggtc agccccggca gcgcatgacg aaaatgcgga cggcataaca ggcgggacca 2547001 gagacgactc tgctcccaac tcacggtttc agctaggcag gcgcattccg gaagccaccg 2547061 cccaggaagg gtttctggtt cggccattca cccaacaatg tcagatcatc cacaccgaag 2547121 gagatcatgc tgttatcggg gtatccccgg ggaacagtta cttctcccgc cagcgcctac 2547181 gggatctcgg gctttggggt ctcacgaatt ttgatcgtgt ggacttcgtc tacaccgatg 2547241 tccatgtcgc cgagagttac gaagcgctag gcgattccgc aatcgaagcc cggcgcaagg 2547301 cggtcaaaaa catccgcggc gtccgcgcca agatcaccac cacggtgaac gaactcgatc 2547361 cggccggggc ccggctgtgc gttcgtccga tgtcggagtt ccagtccaac gaggcatacc 2547421 gggagctgca tgcggacctg ctcacgcgcc tgaaagacga cgaggacttg cgcgccgtct 2547481 gccaggacct agtgcggcgc ttcctgtcca cgaaagtggg tccgcggcag ggggcgacgg 2547541 ctactcaaga gcaggtgtgc atggactaca tttgcgccga ggccccgcta ttcctcgaca 2547601 cacctgcgat tctcggagtg ccgtcgtcgt tgaattgcta ccaccaatca ctgcccctcg 2547661 ccgaaatgct ctacgcccga ggatcgggac tacgggcatc gcgcaatcaa ggccacgcca 2547721 ttgttacccc tgatgggagc cccgccgaat gaccgcgacc gttctgctcg aggtcccgtt 2547781 ctctgcacgt ggggatcgga ttcctgacgc cgtcgcagaa ttacgaaccc gcgagcctat 2547841 ccgcaaggta cggaccatta ccggcgccga agcctggctc gtctcctcgt atgcactgtg 2547901 cacacaggtg ctcgaggatc ggcgtttttc catgaaggaa accgccgctg ccggcgcccc 2547961 ccgcctgaac gcgctgactg ttccacccga agtggtcaac aacatgggaa acatcgccga 2548021 cgcgggactg cgcaaggcgg tgatgaaagc gatcacaccc aaggcacccg ggttggagca 2548081 attcctacga gacaccgcga actcgctgct ggacaacctg attaccgagg gcgcaccagc 2548141 cgatctgcgc aatgacttcg ccgacccgct ggccactgcc ctgcactgca aggttctggg 2548201 catcccgcaa gaagacggcc cgaagctgtt ccgtagcttg agtatcgctt tcatgagttc 2548261 ggccgacccg atccccgccg cgaagatcaa ctgggatcgc gacatcgaat acatggccgg 2548321 aattctggaa aacccaaaca tcacgaccgg cctcatgggt gagctcagcc gcctccggaa 2548381 agatcccgcc tactcgcacg tctccgacga actattcgcg accatcggcg tcactttctt 2548441 cggtgccggc gtcatctcaa ccggcagctt cctcaccacc gcgctgatat cgctgataca 2548501 acgcccgcaa cttcggaact tgttgcacga gaagccggaa ctgatcccgg ccggtgtaga 2548561 ggaactgctg cggatcaatc tctccttcgc cgacgggtta ccgcgcctgg ccaccgccga 2548621 catccaggtc ggcgacgtgc tggtccgcaa gggggagctg gtgctggtgc tgctcgaggg 2548681 cgccaacttc gatcccgagc acttccctaa cccgggcagc atcgaactcg accggcccaa 2548741 ccccacctcg cacctcgcgt tcggccgcgg ccaacacttc tgtcctggat cagctctcgg 2548801 tcgccgccac gcacagatcg gcatcgaagc gctgttgaaa aagatgcccg gcgtcgacct 2548861 ggctgtgccc atcgaccaat tggtctggcg cacccgattc caaagacgca tccccgaacg 2548921 ccttccggtg ctctggtagg cttccggaaa ctcacccgag ccatcaccgc aagatttggc 2548981 aagcgttggg acagaacaat ttcgaccttg caccggccga aggcgctgcc ttctaccgaa 2549041 taaaagtacg ggcctccccc aaactccgaa atcgtcagta ccgcacgcaa ttcaaatgaa 2549101 ccgcaccctg acagcgagcg acgttaatga cgccattgtt gggccgccag cggcgagtcc 2549161 acaagtaccg catcgagtcc gattttgtga gccaggcggt agtcgtcgac agttttcacc 2549221 gcgaaaccca tgaccttcat gccggactgc gatctgaaac agtcgaccga ggcctcgtcc 2549281 cacaactcgg cattcaccgc ggagataccg gaccccaacg tgaattcttc ggtgacggtg 2549341 acatcgcgat gcaactcgaa tccggcccac ttcccaggat ccggctgcgg atcacagtga 2549401 tggttcaatg ccatgttgaa aaggcgctgg cgggtcacgt cacgactttc ggcgacctgc 2549461 agtccctcct gccgcgaggc tgcagccgtg atgtcagcgt tggtggaata tacgatcgac 2549521 cgcccggcag caccagtcct ggtcaacacc tgcgcgaccg ctgagaccag cggctgtggc 2549581 ggagtctgct tgaggtctag aaacagagtc atatcgggcg gagtcgcgcc aatggcttgc 2549641 tccagtgtcg gtatcggggt cgcccgttgc cggtagggat ggccctcgac gcccggcgtg 2549701 gtgaaattcc atcccgcgtt gagctgctgg agttgctgaa ccgtcttcga attcaccggg 2549761 ccggcgccgt cggtcaacgt tgccagatcg gacggacgat acagcaccgg cacgccatcg 2549821 ctgctgacct ggacggtcag ccacatgcca tccacaccag ctgcgactgc gttggtaatc 2549881 gccagaacgg tgttctcggg aaaatcgcgc gtacccgcgc gatgcgcgac aatcatcggg 2549941 tcgtcagtct ggcccagcgg caaagcatcc gccacaccgc aagtccctcc caaggcgatc 2550001 accagcgcca ccgtgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg 2550061 ggtcatgtca ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg 2550121 gatggtcgca gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc 2550181 ccgtctactt ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt 2550241 cgatgccggc gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttgcg 2550301 gcgggacaac gccgaattgc gaagggcgaa cgcgatttta aagaccgcgt cggctttctt 2550361 cgcggccgag ctcgaccggc cagcacgcta attacccggt tcatcgccga tcatcagggc 2550421 caccgcgagg gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc 2550481 gagctgggtg tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc 2550541 cgccgcgagc tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac 2550601 tacggtgttt acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg 2550661 gccagatgca ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc 2550721 aaagcccgca ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag 2550781 cgccgcttcg gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg 2550841 acctgggcag ggttcgccta cgtggccttt gtcaccgacg cctacgctcg caggatcctg 2550901 ggctggcggg tcgcttccac gatggccacc tccatggtcc tcgacgcgat cgagcaagcc 2550961 atctggaccc gccaacaaga aggcgtactc gacctgaaag acgttatcca ccatacggat 2551021 aggggatctc agtacacatc gatccggttc agcgagcggc tcgccgaggc aggcatccaa 2551081 ccgtcggtcg gagcggtcgg aagctcctat gacaatgcac tagccgagac gatcaacggc 2551141 ctatacaaga ccgagctgat caaacccggc aagccctggc ggtccatcga ggatgtcgag 2551201 ttggccaccg cgcgctgggt cgactggttc aaccatcgcc gcctctacca gtactgcggc 2551261 gacgtcccgc cggtcgaact cgaggctgcc tactacgctc aacgccagag accagccgcc 2551321 ggctgaggtc tcagatcaga gagtctccgg actcaccggg gcggttcacc gcgcccaaca 2551381 tagccgtctt caccatcggt ccccttcagg ctttccccac cgtagaaacg tgcgcaatgc 2551441 gcggcgcaca gtatcgaacc gtaccgctga gagccaacca cgatgatttg cccgcaccgg 2551501 cagcgataaa gtaagtcgcg gtcgggcacg cagcgcagcg ttggaaagtg aggcctccga 2551561 tgagtgaaat gacagctcgg ttttccgaaa tcgtcgggaa cgccaatttg ctgaccggcg 2551621 acgcaatccc cgaggactac gcacacgacg aagagttgac ggggccgccg cagaagccag 2551681 cctatgccgc caagccggcc acccccgaag aggttgccca actgctgaag gccgcctctg 2551741 aaaacggtgt gccggtgacg gcccgcgggt ccgggtgcgg cttgtcgggg gccgcacgac 2551801 cagtcgaggg tgggctgctg atctcgttcg accggatgaa caaggtcctc gaggtcgaca 2551861 ccgccaacca agtcgccgtc gtgcagcccg gggtggcgtt gaccgacctg gacgccgcta 2551921 ccgccgatac cgggctgcgg tacacggttt acccgggcga gctgtcctcc agcgtcggcg 2551981 ggaatgtcgg aaccaacgcc ggcgggatgc gcgcggtcaa gtacggagtg gcccgccata 2552041 acgtgctcgg gttgcaagcg gtattgccca ccggcgagat catccgaacc ggcggcagga 2552101 tggccaaggt gtccaccggc tacgacctca cccagctgat catcggctcg gagggcaccc 2552161 tggccttggt caccgaggtg atcgtcaagc tgcatccgcg gctcgaccac aacgccagcg 2552221 tgctcgcccc gttcgccgac ttcgaccaag tcatggcggc ggtgcccaag atcctcgcca 2552281 gcggcctggc acctgacatc ctggagtaca ttgacaacac ttcgatggcc gcactcatct 2552341 ccactcagaa cctggagcta ggtattccgg accagatccg cgacagctgc gaagcttatc 2552401 tccttgtggc gcttgagaac cgcatcgccg accgactgtt cgaggacatt cagacggtgg 2552461 gtgaaatgct catggaattg ggagcggtgg acgcctacgt gctcgaagga ggctcggcgc 2552521 gcaagctgat cgaggcccgc gagaaggcat tctgggcggc aaaagcactc ggcgccgacg 2552581 acatcatcga caccgtcgtc ccacgcgcgt cgatgccaaa attcctgagc accgcgcgcg 2552641 gtctggcggc ggcagcggac ggtgccgcgg tcggttgcgg gcacgccggc gacggcaacg 2552701 tacacatggc catcgcgtgc aaggatccgg agaaaaagaa gaagctcatg accgacatct 2552761 ttgctctcgc aatggaattg ggtggcgcga tctctggcga acacggcgtc ggccgggcca 2552821 aaaccggcta tttcctcgag ctggaagacc cggtcaagat cagcctcatg cgccgtatca 2552881 agcagagctt cgatccggcg ggcatcctca acccaggcgt tgtcttcgga gacacctgag 2552941 cacggacaag agccggccgg accaaggccg gtcatcggcc ggccaacagg cctgcaagtc 2553001 tcgagcgcaa catcttcgtg gacagctcgg tccgccggtc gtcaaagccg atttccccgc 2553061 atctgtccgg tcagtccgat gcagcgtcgg tcaccgttat tcatccggcg tttacccgtt 2553121 gctagccgcc atgacgtagc ctgctgacgc tcgatcgcca acacaagccg acatgagcga 2553181 caatgccaaa caccacaggg atgggcattt ggtggctagc ggacttcagg atcgcgcagc 2553241 gcgcacaccg caacacgagg gcttcctcgg gccggaccga ccatggcacc tgtcgttcag 2553301 tctgctgctg gcgggttctt tcgtgctgtt ctcgtggtgg gcattcgact acgcagggtc 2553361 cggcgcgaac aaagtcatcc tggtgctcgc caccgtcgtc ggcatgttca tggccttcaa 2553421 cgtcggcggc aatgatgtcg ccaactcgtt tggcaccagc gtcggcgcgg gcacgttgac 2553481 catgaaacag gcgcttctgg tcgcggcgat cttcgaggtc agcggcgcgg tgatcgccgg 2553541 cggcgacgtc accgagacca tccgcagcgg catcgttgat ctgtccgggg tgtccgtcga 2553601 cccacgcgac ttcatgaaca tcatgctgtc ggcgctatcg gcagccgcgc tctggctgct 2553661 gtttgctaac cgtatggggt acccggtgtc gaccacacac tcgatcatcg gcggcatcgt 2553721 cggcgcggcg atcgcgctgg ggatggtgag cggccagggc ggtgccgcac tcaggatggt 2553781 ccagtgggat caaatcggcc agatcgtggt gtcctgggtg ctgtcgccgg tgttgggcgg 2553841 cttggtgtcg tacctgctct acggcgtcat caaacggcac atcctgctgt acaacgaaca 2553901 ggccgaacga cggctaacag aaattaagaa agagcgcatc gcacaccgcg agcgccacaa 2553961 ggcggcgttc gaccggctca ccgagatcca gcagatcgcc tataccggcg ccctggcgcg 2554021 cgacgccgtc gcggcaaacc gcaaggactt tgatcccgac gaactggaat ccgattacta 2554081 ccgcgagcta cacgaaatcg acgccaagac atcgtcggtc gacgcgttcc gggccctgca 2554141 gaactgggtt ccgctggtcg ccgccgccgg atccatgatc attgtcgcga tgctgctgtt 2554201 caaggggttc aagcacatgc acttgggcct taccacgatg aataactact tcatcatcgc 2554261 gatggtcggt gcagcggtgt ggatggccac ctttattttc gccaagacac ttcggggcga 2554321 atcactttca cggtcaacgt ttttgatgtt cagctggatg caggtcttta cggcctcggg 2554381 cttcgccttc agccacggca gcaatgacat tgccaacgcc atcgggccgt tcgcggcaat 2554441 cctggatgtg ctgcgcacgg gcgccattga aggcaacgca gcggtgcctg ccgcggccat 2554501 ggtaacgttc ggcgtcgcgt tgtgcgcggg gttgtggttc attggacgac gggtgatcgc 2554561 caccgttgga cacaacctca ccacgatgca cccggcatcg gggtttgctg ccgaattgtc 2554621 ggccgccggg gtggtcatgg gagccacggt cctgggtctt ccggtttcca gcacgcacat 2554681 tcttatcggc gccgtcctcg gcgtcggcat cgtaaaccgg tccaccaact ggggactgat 2554741 gaaaccgatc gtgctagcgt gggtcatcac gctgccttcg gcggcgatcc tcgcctcggt 2554801 cggtcttgtc gcgctacgcg cgattttctg acgacgccgg gtccatcaac cccagcgcaa 2554861 cctccgcgag cagtcgctaa agcccccgac acgccgtgcg tgcgggggct tatgcgactg 2554921 ctcgccggac ggaggtccta cgtgctgcgg gaagtgatgt ggctgagcag gtctcgtatc 2554981 gcacccgccg gcggggtgcg cccaccgacc cagatggctc gaagctggcg ccgcaggttc 2555041 aacgcgggga tgtcgaccgc gagtaatcga ccgaacgcca ggtcatcggc tatcgctagc 2555101 cggctcatcg cagccggtcc agcgccggcc aagaccgcgg cccgcacggc cgcagccgat 2555161 gataattcca gcaccggtgg cgcttgctgc atgtcctccc cgagcgtgtc acgtaacgcc 2555221 gcggtgagtg aatcgcggat gccagagttc ggttcgcgag tcaccaaagg cgtctgagcg 2555281 agctcccggg cgctcactac tcgtgaccgt cgggcccact tgtgacccgg cggcacgacg 2555341 acgaccagtt cgtcgcgtgc aaccacaacg ctgcctaatc ccgtgggagg acaggggttt 2555401 tcgatgaatc caagatctgc gatgccgtca cgaacggctg cgatcgcatg ctcgctattg 2555461 gtggcggtca ggattacctc agggacagta ccaccgcggc gcatgtcggc ggcccgcaag 2555521 gacagcatcc aatgcggcat cagctgttcg gctatcgtct ggctggccac cactctgatg 2555581 cgctggcggc cttcggtgcg cagcgagccg aggccggcat cgatctcgtc ggcgacttcg 2555641 agcaagcggg ccgcccattc ggcgacgacg atgccggcag gcgtgagttg ggagccacgt 2555701 gtcgtccgga tggccaatcg caccccgatc tgggcctcca tcgatgcgag ccgccttgac 2555761 acagcttgtt gagtcaaccc gagttcgcgt gcggcgccgc caagactgcc ggcctcagcg 2555821 atggccagaa agatttcgaa gcaggtgagt ccgggcatac gagagctgag cggcatgcct 2555881 gatcaaatca caaccaatgg ttgttcccaa caacattcag acccctagtg acgacggccc 2555941 atgctcgaaa aatgccccca cgcgagcgtc gactgcggtg cctcgaaaat cggcatcacc 2556001 gacaacgacc ccgcgaccgc caccaaccgc aggctggcga gcacaattcg caagccgccg 2556061 atcgagcacg cggccgggcc cttagggtcc acatcacgcg ctggccaccg ttcgtacggc 2556121 ggggtggcct cgtaaggtaa ccacatgggc gctcctcgac tcatccacgt catccggcaa 2556181 atcggggcct tggtggtagc ggcagtgacc gccgccgcca cgatcaacgc atataggccg 2556241 ctggcgcgca acggattcgc atcgctgtgg tcgtggttta ttggcctggt ggttaccgag 2556301 tttccgttac cgacgctggc gagccagctc ggcgggctgg tgttgacagc ccaacgcctg 2556361 acccggccag tgcgggcggt ctcctggctg gtagcggcct tctcggcgct ggggctgctg 2556421 aacctcagtc gcgcaggccg tcaggccgat gcccagctca ccgccgcatt agacagcggc 2556481 ctggggcccg atcgccgcac cgcctcggcc ggtctgtggc gccgcccagc cggcggtggt 2556541 accgccaaga cccccgggcc gctgcgcatg ctgcggatct accgcgatta cgcacacgat 2556601 ggcgacatca gctacggcga atacggcagg gccaaccacc tcgatatctg gcgacgtccc 2556661 gatctagatc tgaccggaac agcgcccgtg ctgtttcaga tccccggcgg tgcatggacc 2556721 accggaaaca aacgcggaca ggcgcatcca ctgatgagcc acctcgccga gctaggctgg 2556781 atctgcgtgg cgatcaacta ccgacacagc ccgcgcaaca cctggccgga tcacatcatc 2556841 gacgtcaagc gcgccctggc gtgggtcaag gcgcacatca gcgaatacgg cggcgatccg 2556901 gacttcatcg ccatcaccgg tggttcggcc ggcggccacc tgtcgtcact ggccgcgcta 2556961 acgccgaatg acccacgatt ccaaccggga ttcgaagagg cggacacccg ggtgcaggca 2557021 gccgtgccgt tctacggcgt ctatgacttc actcgtctgc aggacgcgat gcacccgatg 2557081 atgctgccgc tgctggagcg aatggtggtc aaacaaccgc gcacggcgaa catgcagtcc 2557141 tacctcgacg cctcaccggt cacccacatt tccgccgacg ctcccccatt ctttgtgcta 2557201 cacggccgca acgactcgct ggttcccgta cagcaggcgc gtggcttcgt cgatcagctg 2557261 cggcaagtca gcaagcagcc ggtggtatac gccgaattgc cctttaccca gcacgctttc 2557321 gacctgctcg gctcggcacg tgcggcacac acggcgatcg ccgtggagca attcctggcc 2557381 gaggtctacg caacgcaaca cgcgggcagt gagccgggcc ccgcggttgc gatcccatag 2557441 cttttggggt tgaggtcgct agggttggcc ttgtgaagct gctcagcccg ctggatcaga 2557501 tgttcgcgcg catggaggcg ccgcgcacgc caatgcacat cggcgcgttt gcggtcttcg 2557561 acctgcctaa gggagcaccg cgcaggttca tccgcgacct gtacgaggcg atctcacaac 2557621 tggcgttcct gcccttcccg ttcgacagcg tgatcgccgg cggcgcgtcg atggcgtact 2557681 ggaggcaggt gcagcccgat ccgagctacc acgtccgctt gtccgcccta ccttatccgg 2557741 ggaccggccg cgatctcggc gcgttggtcg agcggctgca ttcgacccca cttgacatgg 2557801 ccaagccgct atgggagttg cacctcatcg aggggctaac cggccgtcag ttcgccatgt 2557861 acttcaaggc ccaccactgc gcggtcgacg gattgggtgg ggtgaacctg atcaagagct 2557921 ggctcaccac cgatcccgag gcacccccag gctcgggcaa gcccgagccg ttcggcgatg 2557981 actacgactt ggccagcgtg ttggccgccg ccacgacgaa gcgggcggtc gagggcgttt 2558041 ccgcggtcag cgaactggcc ggaaggctat ccagcatggt gctgggcgcc aacagctcgg 2558101 tgcgggcggc cctcaccacc ccgcgtaccc cgtttaacac ccgcgtcaac cggcatcgac 2558161 ggctagcggt gcaagtgctg aaactgccgc gcctcaaggc agtggcccac gccaccgact 2558221 gcaccgtcaa cgacgtgatc ctggcgtctg tcggcggggc ttgccgacgc tacctgcagg 2558281 agctgggcga cctgccgacg aacaccctga ccgcctcggt gccggtcggc ttcgagcgcg 2558341 acgcagacac ggtcaacgcc gcctcgggtt tcgtcgcgcc gctgggcacc tcgatcgaag 2558401 acccggttgc gcggctgacc acaatctcgg cgtcgaccac ccgcggcaag gccgaactgc 2558461 tggcgatgtc accaaatgcc ttgcagcact actccgtatt cggcttgctg ccgatcgcgg 2558521 tggggcagaa gaccggcgca ctcggggtga ttccaccgct gttcaacttc accgtctcca 2558581 atgtggtgct ctcgaaggac ccgttgtatc tttcgggcgc caagctggat gtgattgttc 2558641 cgatgtcgtt cctgtgtgac ggctatggcc tcaacgtgac gctggtcggc tacacggaca 2558701 aggtcgtcct cggctttctg ggctgccgtg acaccttgcc gcatctgcag cggctagcgc 2558761 agtacaccgg cgcggcattc gaggaactcg agaccgccgc cttgccatag cgaccaaacg 2558821 acgacaacgc tccgcccatc gccggcagta cccgccaatc accacggtgt agccgctcag 2558881 gagcggcccg ccagccggtc gatatcaacg atctccccgc gattgatgct cacccaatcg 2558941 cgcccatcga ggtaggggcg tagctgctgg gcaatgagtt caacatcggc tggtgacttc 2559001 ggtcgctgaa gttcgtagac gtgcggcagc cccgccatgc ccgtgacgac gctccacagg 2559061 ttcagggcgg ccggtccagc aggcgggtcc accagcaccg gcccaaaaag gcattgtccg 2559121 tccaaaaaga gcgtcggcac gccgtatccg cccgcggcga caacccgttg gtggtcggcg 2559181 cggacgtcgt cgtgggtcgt cggatcatcc agcgccgcgt ccaaaatcgc cgcattgacg 2559241 ccgacgtcgc acagtaggcg tcgcgccacc gcgggatcat gcggtttgcc gcccagggtg 2559301 tgcagctcat gaccgatcgc tgcataccac cgatcaagca acgacatgtt cgttcgacgc 2559361 agcagcgcac cgatccgcat caacgaccag ccataggacc agtctcgctc ccacgggtgc 2559421 ttcttgcccg ctaccaggtt gatctcctcg aggctgaaaa accgccagtt gatcgtgatt 2559481 cccaattgcg cgcgcacatc acggatccac accgaggtct gataggcgaa cgggcacaaa 2559541 gggtcaaagt ggaaatccac ggtggtcatc agacctgagt cctccagctg atcgagtcga 2559601 cacctcgatg acattgtgcc gtgcgccacg ttgtcagcgg actgagtcga cccaacatct 2559661 cgcggtgttc gccagggtgc cgaaacaggt caacgcggcg gtatgaatgg tcgacgcacc 2559721 ataggcgagg atgggctggt gttcgggctc gtcgttatcg ttgcgctggt cgccgccgtg 2559781 gtcgtgggga ccgtcctggg ccaccgctat cgcgtgggcc ctccagtgtt gctcatcctg 2559841 tccggttccc tgctgggtct gattccccgt ttcggtgacg ttcagatcga tggcgaggtg 2559901 gtgctgctgc tgttcctgcc ggcgatcctt tattgggaga gcatgaacac cagctttcgc 2559961 gagatccgct ggaacctgcg cgtcatcgtc atgttcagta tcgggctggt gattgccacc 2560021 gcggtcgcgg tgtcgtggac ggcacgagcg ctgggcatgg agtcccacgc cgcggctgtc 2560081 ctcggtgccg tgctctcccc caccgatgcc gcggcggtgg ccggcctggc gaaacggttg 2560141 ccgcgccggg cgctgacagt gctacgcggc gagagcctca tcaacgacgg gaccgcgctc 2560201 gtgctgttcg ccgtcaccgt ggcggtcgcg gaaggtgccg ctgggatcgg cccggccgcg 2560261 ctggtcggcc ggttcgtcgt ctcctatctc ggcggaatca tggccgggct gctggtcggc 2560321 ggcctggtga cattgctacg ccgcagaatc gacgcaccat tggaggaggg agccctgagc 2560381 ttgctgacgc cgttcgcagc gttcttgctc gctcaatctc tgaagtgcag cggtgtggtt 2560441 gcggtgctgg tttcggccct ggtcctcacc tacgttggtc cgacggtgat acgcgctcgt 2560501 tcccgcctgc aggcgcatgc gttttgggac atcgccacgt tcctgatcaa cggctcgttg 2560561 tgggtgtttg tcggcgtcca gatcccgggc gcgatagacc acatcgccgg cgaggacggg 2560621 ggactaccac gggccacagt cctggccctg gcggtgacgg gtgtcgttat cgccacccgg 2560681 atcgcctggg tacaggcaac cacggtcctg ggtcacaccg tggaccgggt cctgaagaag 2560741 cccacccgcc acgtcggctt ccgtcagcgt tgcgtcacaa gctgggccgg tttccgcggc 2560801 gcggtatcgc ttgccgcagc gctggcggtg ccgatgacca ccaatagcgg cgctccattc 2560861 ccagaccgca acctgatcat cttcgtcgtc tcggtcgtca ttctggtcac cgtgctggtc 2560921 caagggactt ccttgcccac cgtcgttcgg tgggcgagga tgcccgaaga cgtcgcgcac 2560981 gccaacgaat tgcagctggc ccgcacccgt agcgcccaag ccgccctcga cgctttgccg 2561041 acggtcgccg acgaactcgg ggtcgccccc gatctcgtca aacacctgga aaaggaatac 2561101 gaagaacgcg cggtgctcgt catggccgat ggcgccgact ccgcgaccag cgatctggcc 2561161 gagcgcaacg atctggtccg gcgcgtgcgt ctaggcgtgc tgcaacacca gcggcaggcc 2561221 gtcaccacgt tgcgcaacca aaacctcatc gacgacatcg tgctgcgcga gctgcaggcg 2561281 gcgatggatc tagaggaagt gcaactcttg gaccccgccg acgccgagtg agccggcgcc 2561341 gcccgctgat cgaaccagca acggttcagg ttttggccat tgctttcaca gactcattca 2561401 gcgtttcatt gcactggccg cagcgcgagc agggctgccg cacagcgatc ttggcgccta 2561461 tgcgaaggtg gtgcgatggt gatgtggacg ggcgaaagtt actgccaccg gcacgccgca 2561521 ctggcaccca acagaggagg atcaggcccg ccgcacccag ggtctacacg accggcgaca 2561581 tcctgcgtga tcggaagggc atagcgccat ggcaggaaca acgcgaaccg ggctgggcgc 2561641 cgttcggttg gctgcacgag ccctcgggcg caaggtgccc aaaagccgac gggcagtcag 2561701 tctaagtgtc ttgataggtg cggtgatagc agctcttgcc ggggcgctga ttgcggtaac 2561761 cgtaccggcg cggccgaatc gccctgaggc cgaccgtgaa gcactgtgga aaatcgtgca 2561821 cgaccgttgc gaattcggct atcggcgtac cggtgcgtac gctccctgca cattcgtgga 2561881 tgaacagtct ggaacggcgt tgtacaaagc ggattttgat ccgtaccagt tccttttgat 2561941 cccgcttgct cgtatcaccg gaatcgagga tcccgcccta cgggagtcag cgggtcgcaa 2562001 ttacctctac gacgcttggg ccgcacggtt cctcgttacc gcgcgcctga acaactcact 2562061 tccagagtca gacgtagtcc tcaccatcaa cccgaagaac gcgcgcactc aggatcagct 2562121 gcacatccac atatcgtgtt cgtcaccaac aacatcggca gccctgagga acgtggatac 2562181 ctcagagtac gttggctgga agcagctccc catcgacctc ggtggtcgca ggtttcaagg 2562241 attggcggtt gacacgaagg cgttcgaatc caggaacctg ttccgggaca tctacctgaa 2562301 ggtaaccgct gacggcaaga aaatggaaaa tgcatcgatt gcggttgcca acgtagcgca 2562361 ggaccaattc ctgctgctct tggcagaggg aactgaggac cagcccgttg cagccgagac 2562421 tctccaagac cacgactgct ccatcaccaa gtcctgatag cacgatgcca gcgggccaca 2562481 cgacagggcg cagtgtgcga acctgacccc gccacggcgg gccgttgatg gcattttgct 2562541 agtgtcggag cggcaatccg cctatatttc tcctcgccta ccagtgaggg agccgggctt 2562601 gactgatccg cgccacaccg ttcgaatcgc tgtcggagct accgcgctcg gcgtgtcggc 2562661 actcggggca actctgccgg cctgctccgc acacagcggg ccgggttctc cccccagtgc 2562721 gccgtcagct cccgcggccg cgaccgtcat ggtagaggga catacgcaca caatttccgg 2562781 agtggtcgag tgccgcacct cgccagcggt aaggacggcg acgccgtcgg agtcggggac 2562841 tcaaactaca cgggttaacg cacacgacga ttcggcctcg gtgacactgt ccctgtccga 2562901 ctccacgccc ccagacgtca atggttttgg tatctccctt aaaatcggaa gcgtcgacta 2562961 ccagatgccc taccagccgg ttcagtcccc aactcaggtc gaagcgacca ggcagggcaa 2563021 gagttacaca ctgaccggga cgggtcacgc ggtgatcccg ggccaaaccg gcatgcgtga 2563081 gctgccgttc ggggtacatg taacctgtcc gtaactacac tgattgcgcg acaagggaat 2563141 tagccgcgtt ggcaggcaac acggaggtga ccggtgcaag cccgtggtca ggtcctgatc 2563201 accgccgcgg aactggctgg catgatccag gccggcgatc cggtgtcgat cctggatgtg 2563261 cgctggcggc ttgatgaacc tgacgggcat gcggcctacc tacagggtca cctgccggga 2563321 gcggtatttg tgtcactcga ggacgaactg agcgatcata cgatcgccgg ccggggccgg 2563381 cacccgctgc cgtcgggggc tagtctgcaa gccaccgtcc gccgatgcgg aatccgacac 2563441 gatgtgccgg tcgtggtcta cgacgactgg aatcgagccg gttccgcgcg agcgtggtgg 2563501 gtgttaactg cggctgggat cgcgaatgta cgcattctag acggcggctt gcccgcgtgg 2563561 cggtccgcag gcggcagcat cgagaccggc caggtcagcc cgcagctcgg gaatgtgact 2563621 gtgctgcacg atgatttgta tgccggacag cggctaaccc taacggcgca gcaagccggt 2563681 gcgggtggtg tgacgctgct cgatgcgcgc gtaccggaac gtttccgcgg cgatgtcgag 2563741 cccgtggatg cggttgccgg tcacatcccc ggcgccatca acgttcccag cggtagtgtc 2563801 ctggccgacg acggcacgtt ccttggcaat ggcgccctta acgcactgct gtccgaccac 2563861 ggcatcgatc acggtggccg cgtgggtgtc tactgcggct cgggtgtcag cgcagctgtc 2563921 atcgtcgcgg cactggcagt gatcggccag gatgcggagc tgtttccagg gtcatggtcg 2563981 gagtggagtt cggatccgac ccgtcccgtc ggccgtggca ctgcatagtc agacgccggc 2564041 ccagttctgc aggaaggctt cggtgacccg ggcggcgttg ttggccgcaa tctgcttgta 2564101 aacgaagaac tggacgggga agcccggcag atgcagcggg tcgccgggcc cgtcggacat 2564161 accgcgaatt cccaggaacg ggacgccgtg tgcatcggcg accgcctgcg cggctgccgt 2564221 ctcctggtca accgcgtcga agccggggtt caccgtcgat acgatgttca ggttgctgat 2564281 cagagcgttc ttcagccagg gtcccgccgc ctggaaaaag ttaccggtat agccaagtga 2564341 gcgatcgggt gcactacagg gttggcagca aaaacgctgc cgccgttcgg gatgcaagga 2564401 aaagcctggc cgttgttctt gtcggagcta gacccgtcac cgccgacgaa cagttgcggc 2564461 tggcgcccca ggtggttcaa ccggacgacc ggaacgttcc tgcacagaca gacaggattg 2564521 ccgagcgtgt tgatgttgtc cagtacaaca gaaagcgtct gggcagtagc cagcatgccg 2564581 ggatcgaccc cacggaatgt tgccccgttg tccagggtcc accgtgctgg tattgccacg 2564641 tccccaatgc tggtgcggcc ggcaccaccg gcgacgcccg agaacatcac ggcggcaatg 2564701 gcaatggaag aagcacaggt aaagcgtgcg aaggcggtct cggtggtgtt ggtagcgttc 2564761 actaggccga tgccggtcat cgccacaatc accttcttgc cgctgatcga gcccaggtag 2564821 tagcgacgac ggtcggcgac caccaccggg ttggcgtcca gcgcggtgtg cgccagcacc 2564881 gcgtcggcct cagccggaaa cgccgacaag accagcgtgc gctgttcgca cgggatcaca 2564941 tttgccacgt atccgggatc ggccgccgcc acgccacagc ccagcgacaa cgcggccgcc 2565001 accaaaagac agtgccgcaa aggcgcgccc acaatccctt atccccaaaa atcgtgattt 2565061 gacatggatg ccggaactct ctgtcattta gccgtggccg atttggggct tggccctgat 2565121 tttcgcgcac catcggcgac ggacgaatat ttgttatcgt ttttttcgtc tagcgattcc 2565181 tcggcgttat ttcatcgcgg cggaacgagc cgccctatga ccaactgtgc aagcgtgatt 2565241 ggtcgatagc cccggtcggg ctatgttccc cggtgtggct agaccagttg accggtgcgg 2565301 gacgcggata cggctagtct gccggagtga tacctaaccc actcgaggag ctaacgctcg 2565361 agcaactgcg aagccaacgc acgagcatga agtggcgtgc gcacccagcc gacgtcttgc 2565421 cgttgtgggt cgcggagatg gacgtgaagc ttccgccgac ggtggccgat gccctccgta 2565481 gagctatcga cgacggcgac accggatatc cctatggaac ggagtatgcc gaagccgtcc 2565541 gcgaattcgc ttgccaacgt tggcaatggc acgacctgga agtgagccgc acggccatcg 2565601 ttcccgacgt catgctcggc atcgtcgaag tgctgcgtct gatcaccgac cgcggtgacc 2565661 ctgtgatcgt caactccccg gtatatgcgc cgttctacgc tttcgtgtcg catgacggcc 2565721 gccgagtgat cccagcgccg ctgcggggag acggccggat cgatttggac gcgctgcagg 2565781 aagcgttctc gagcgcgcgt gcttcaagcg gctcgagcgg caacgtcgcc tacctcctgt 2565841 gcaatccgca caacccgacg gggtcggtgc acaccgccga cgaactgcgc ggcatcgcgg 2565901 aacgcgccca acggttcggt gtccgggtgg tgtccgacga gattcatgcc cctcttatcc 2565961 cgtccggggc acggtttacg ccctatctga gcgtccccgg tgcggaaaac gcattcgcac 2566021 taatgtcggc ttccaaggcg tggaatctcg gcggactcaa ggcagccctg gccattgccg 2566081 gtcgcgaggc ggcggccgac ctcgctcgga tgcccgagga ggtcggtcac ggccccagcc 2566141 acctgggtgt catcgcgcac accgcggcgt tcaggactgg tggcaactgg ctcgacgcgc 2566201 tgctgcgcgg tctggaccac aatcgaacgt tgctaggcgc tctggtcgac gagcatcttc 2566261 ccggggtgca ataccgatgg ccgcagggta cttacctggc gtggctggat tgccgagaac 2566321 tcggcttcga tgacgcggct agcgacgaga tgaccgaagg cctggcggtg gtgtcagatc 2566381 tgtccgggcc agcccgctgg ttcctcgacc acgcgcgggt tgcgctcagt tctggtcacg 2566441 tcttcgggat tggcggtgcc gggcatgtgc gcatcaactt cgcgacctcc cgagccattc 2566501 tcatcgaggc ggtatcgcgg atgagccggt cactactcga gcgccggtag cgcgtccaga 2566561 gaaccgctag cgccaacacg atcacctcgg gtgacggtct tgtccgctcg gcggcccttc 2566621 agtgcccagc caatgcggcc gaccccgcgg cggccgcatt cggtagacaa aggaagtctg 2566681 acaccgtagg cgcctcgttg atcgcgtttt cgccgagaaa cgtgaaggcc gtttgcccgc 2566741 ccgtgcggat cagctacgat caaggcgaca catggaccag tcggccaacc atgcgtgtct 2566801 gcccaccccg ctggcgagca caacagggcg cgggcaagat catgagatgc ctgtcgaaga 2566861 gacctccacc ccccagaagc tgccccaatt tcgttatcac cccgatcccg tcggcaccgg 2566921 ctcgatagtc gccgacgagg tgagctgcgt gagctgcgag caacgtcggc cctacaccta 2566981 caccggcccg gtgtatgcgg aggaggagct taacgaggcc atctgtcctt ggtgtatcgc 2567041 agatggcagt gcggcgagtc gcttcgatgc cacgttcacc gacgccatgt gggcggtgcc 2567101 cgacgacgtt ccagaggacg tgaccgagga agtgctgtgc cgaacacccg ggttcacggg 2567161 ctggctgcag gaggaatggt tgcatcactg cggggacgcc gccgccttcc ttggcccggt 2567221 gggcgccagc gaggtggccg acctccctga cgccctggat gcgctgcgca atgagtaccg 2567281 cggctacgac tggcccgccg acaaaatcga ggaattcatc ctgacgctcg atcgaaacgg 2567341 gctggcgacc gcctacctct tcaggtgcct gagctgcggc gtccacttgg cctacgccga 2567401 tttcgcttaa cctcggcggc gactgagtcg acgcgagcgc ggatatcgga cgcttttgca 2567461 caacaatggt tccgacgtgg cacagctcag agaggagcag atcatggatg tcctacgcac 2567521 cccagactcc cggttcgaac acctggtggg ctacccgttt gcaccgcact atgtcgatgt 2567581 gacggccggc gacacccagc cgttgcgaat gcactacgtc gacgagggcc cgggcgacgg 2567641 tccgccgatc gtcttgctgc acggcgagcc cacctggagt tatctgtacc gaaccatgat 2567701 tccgccgctc tccgccgccg ggcaccgtgt gctcgcgccc gacctgatcg gcttcggccg 2567761 ctccgacaag ccgactcgca tcgaggacta cacctacctg cggcacgtcg agtgggtgac 2567821 gtcctggttc gagaatctcg acctgcacga cgttacgctc ttcgtgcagg actgggggtc 2567881 attgatcggt ctgcgcatcg ctgccgagca cggtgaccgg atcgcgcggc tggtggtcgc 2567941 caacgggttt ctccccgccg cgcaggggcg caccccactc cccttctacg tgtggcgggc 2568001 gtttgcgcgc tattctccgg tgcttcccgc tggccgtctg gtgaacttcg gcaccgtcca 2568061 cagggttccc gccggggtcc gagccggcta cgatgcacct ttccccgaca aaacgtatca 2568121 agccggcgcc cgggcgttcc cacggttggt gccgacctca cccgacgatc cggcggtacc 2568181 ggccaaccgc gcggcatggg aagccctggg ccggtgggac aaaccgttcc ttgccatctt 2568241 cggttatcgc gacccgatac tcgggcaagc ggacggtccg ctgatcaagc acattcccgg 2568301 cgcggcgggt cagccgcacg cccgcatcaa ggccagccac ttcatccagg aggacagcgg 2568361 aaccgaactc gccgaacgca tgctctcctg gcagcaggca acgtaaccgc gacggctgcg 2568421 gacgaaggat cggcagaatg gcgatggaga tggcgatgat gggcctgctc ggcaccgtgg 2568481 tgggtgcctc ggccatgggc atcgggggga ttgcgaagtc gatcgcggaa gcgtatgtcc 2568541 cgggggtcgc ggctgccaag gaccgtaggc agcagatgaa cgtcgatctg caagcacggc 2568601 gctacgaggc ggtgcgagtg tggcggtctg ggttgtgcag tgccagcaac gcctaccggc 2568661 aatgggaggc cgggtctcgg gacacccatg cgcccaacgt cgtcggcgac gagtggttcg 2568721 aaggtttgcg gccgcacctg cccaccactg gggaggcagc gaagttccgt accgcttacg 2568781 aagtccgttg cgataaccca actctcatgg tgctttcgct tgagattggc cgtatcgaga 2568841 aggaatggat ggtggaggcg agcggccgga caccaaagca ccggggatga ctgcgaagac 2568901 tcgcggttgg tagcgcaccc ggctggtgcg gcgccgacaa gctgcccaca ttcggtgaca 2568961 ctgaatttct gcagcaaaag cgcgagtgac caacggtctg cgaaattacc ggctcggggt 2569021 cggctacacc gtcgagcgac gcggtcgccg ccgcgccgag cccctcggta cggtggcaga 2569081 catgaaatat ctggacgtcg acggaatcgg acaggtcagc cggatcgggt tgggcacttg 2569141 gcagttcggc tcgcgtgaat ggggatatgg ggaccggtac gccaccggcg ccgcccgcga 2569201 cattgtcaaa cgcgcacgcg ccttgggggt cacgctgttc gataccgccg agatctacgg 2569261 cctgggcaaa agcgagcgta ttctcgggga ggccctcggc gacgaccgca ccgaggtggt 2569321 ggtggctagc aaggtcttcc cggtcgcgcc gtttccggcg gtgatcaaga accgcgagcg 2569381 cgccagtgcg cggcggctgc agctgaaccg tatcccgctg tatcagatcc accagcccaa 2569441 cccggtggtc cccgattcgg tgatcatgcc ggggatgcgt gacctgctgg acagcggcga 2569501 cattggcgcg gccggtgtct ccaactactc actggcgcga tggcggaagg ccgacgccgc 2569561 gcttgggcgc ccagtcgtca gcaaccaggt acatttctcg ctcgcccacc ctgatgcgct 2569621 cgaagatctg gtgccgttcg ccgagctcga gaaccgcatc gtgatcgcct acagcccgct 2569681 ggcgcaagga ctattgggtg gcaagtacgg actcgagaat cgtcccggtg gcgtgcgcgc 2569741 gttgaacccg ctgttcggca ccgagaacct gcgccggata gagccgctgc tggctacgtt 2569801 gcgcgccatc gccgtcgacg tcgacgccaa gcccgcccag gtggcactgg cctggctgat 2569861 tagcctgccg ggggtggtcg ccattcccgg agcgtccagt gtcgagcaac tcgagttcaa 2569921 cgtcgcggcc gctgacatcg agctcagcgc gcaatcccgc gacgcgctca ccgacgccgc 2569981 ccgggcgttt cgcccggttt ccaccggccg cttcctcacc gacatggtgc gtgagaaggt 2570041 cagccgtcgt tgagctcgct acaaggtacg cgcgagacgt tcggccagca gctcggcgaa 2570101 cctcgccgga tcctcgagtg cgccgccttc ggcgagaagc gctgtgccgt aaagtaattc 2570161 cgcggtttcg gccaatgatt tctcggcatc gtctgcgcgg tcctggtggg cttggcgcag 2570221 gccggtcacc aacggatggc tcgggttgag ctcaagtatc cgcttgccga ccggaacctc 2570281 ctggccggaa gcccggtaga tgcgcgcgag cgcgggtgtc atcccgaagg catcggtgat 2570341 cagacaggcc ggtgactcgg tcaggcgggt ggacagccgc acctccttga cgtgatcgct 2570401 caacgtctcc tgcaaccagg tcagcaggtc ggcaaattcc ttctgccgct cctcgcgctc 2570461 ggcctcgctg gtgtcctctt cggaactcaa gtccacctcg cccttggcaa ccgactgcag 2570521 cggtttgccg tcgaactccg gcaccattcc cacccagacc tcgtcgaccg ggtcggtgag 2570581 cagcagcact tcgtacccct tggccttaaa cgcctccagg tgcggtgact tcagcagttg 2570641 ttggcgcgtc tcgccggtgg cgtagaagat ctgttgctga ccgtccttca tgcgctcgac 2570701 gtattcggcc agcgtggtgg gttcctcctc gctgtacgtg gagacaaacg aagaaatacc 2570761 gagcagggtc tcccggttat cgatgtctga cagcagtccc tctttgagga ccctgccgaa 2570821 ctgtgtccag aacgtgcggt agtcctccgg ccggctggac tgcacgtcct tgatcgtgga 2570881 cagcaccttc ttggtcagcc gccggcggat ggccttgatc tgccggtcct gctgcaggat 2570941 ttcgcgagaa acgttgagcg acatgtcctg cgcgtcgacc acacccttga caaaacgcaa 2571001 gtactcgggc atgagctggt cgcagtcgcc catgatgaac acccgcttga cgtagagctg 2571061 gataccgacg tgggcgtccc ggtcgaacag atcgaacggg gcatgagacg ggatgaacag 2571121 cagggcctgg tactcgaagg tgccctcggc cttcatcgcg atgatctcga gcgggtcgtc 2571181 ccaggcgtgc gcgacgtgtt tgtagaactc cttgtactcc tgctcagaca cctcttcttt 2571241 gggcctcgcc cacagcgcct tcatcgagtt gagggtttcg gtttcgatgg tgacggtctc 2571301 ctcgccgcct tccccccctt cttcctggga ggctggggtg cggcgctcga cgtccatccg 2571361 gatgggccag gcgatgaagt cggagtattt cttgaccagg ttacggatct tccattccga 2571421 ggtgtagtcg tgcaggtcgt cctcggcgtc ttccggcttg aggtgcaggg tgaccgacgt 2571481 gccctggggg gcatcctcga cggactcgat ggtgtaggtg ccctcaccgc tggactccca 2571541 tctggtggcc gcgctctcgc cagccttgcg ggtaagcagt tggaccttgt cggccaccat 2571601 gaacgacgag tagaagccga tgccgaactg accgatcagt tcctcggagg cggccgcgtt 2571661 cttggcctca cgcagctgtg cgcgcagctc ggcggtgccc gacttggcca gcgtgccaat 2571721 cagatccacc acctcctcgc gcgccatccc gatgccgttg tcacgaacgg taagagtcct 2571781 tgcagctttg tctgcgtcga tctcgatgtg cagatcggag gtgtcgacct ccaggtcctt 2571841 gttccgcagc gcctcaatcc gcagcttgtc tagcgcatcg gaggcattcg agatcaactc 2571901 ccgcagaaac gcgtccttat tggagtagac cgagtggacc atcaaatcca gcagttgccg 2571961 ggcctccgcc tgaaactcca actgctcgac atgggcgttc atgagattcc ttccgacgac 2572021 atagcgactc gaatttagcg agctgcgatc cggcgccgag ctgggggtgg cctggctagg 2572081 ccgtatcgcg agcaagctga tagaggtcgg gatcgtgtgc gcagacgatg agtagatccg 2572141 ggtcgtggcg tcgatggagt tcgacgattc gggcctggtt gtcgcgcagt tggttgcggt 2572201 tatacgacaa cagcttttcc tcggcccgca tcacgaaggg cacccggaac cggccatcga 2572261 gggtgccgcg atgatagaag gcgtcgccgc agtgcaaaac ccagcggtga ccggcatcga 2572321 cagctaccgc ggcgtgcccg cgggtgtgac cgggcatcgg caccagaacg acaccggtgc 2572381 cgatggaatc gaggggtttg gccgatgcga atccgcgcca gggttccccg tcgggaccgt 2572441 gctccaccag cttcgggccg tgggcccact gtccgcgtcg atatcgcagt cgctcgcgga 2572501 gcgaaggggc gtggatggca ccgcgggctt cggcggcggt gacgtggagg tgagcctcgg 2572561 ggaagtcggc gatcccgccg atgtggtcga agtcgaagtg ggtgagcaca atgtgtcgaa 2572621 cgtcggacgt gcggtagccg agctgttcga tctggcgggc cgcggtttcg gcctgcaaga 2572681 atgccggccg caggacatga cggaatagac ctacccggcc ggggtcaagg cagtcctgga 2572741 taccgaagcc ggtgtccacc agcaccaatc catcgtcggt ctcgacgagc agaacgtggc 2572801 ataacagagc gatgccaaat gcattcatgg tgccgcagtt gaggtggtgg accttcaccg 2572861 gcggtccctt cgcttcgggg gcgacaccta acatactggt cgtcaaccta ccgcgacacc 2572921 gctgggactt tgtgccattg ccggccactc ggggccgctg cggcctggaa aaattggtcg 2572981 ggcacgggcg gccgcgggtc gctaccatcc cactgtgaat gatttactga cccgccgact 2573041 gctcaccatg ggcgcggccg ccgcaatgct ggccgcggtg cttctgctta ctcccatcac 2573101 cgttcccgcc ggctaccccg gtgccgttgc accggccact gcagcctgcc ccgacgccga 2573161 agtggtgttc gcccgcggcc gcttcgaacc gcccgggatt ggcacggtcg gcaacgcatt 2573221 cgtcagcgcg ctgcgctcga aggtcaacaa gaatgtcggg gtctacgcgg tgaaataccc 2573281 cgccgacaat cagatcgatg tgggcgccaa cgacatgagc gcccacattc agagcatggc 2573341 caacagctgt ccgaataccc gcctggtgcc cggcggttac tcgctgggcg cggccgtcac 2573401 cgacgtggta ctcgcggtgc ccacccagat gtggggcttc accaatcccc tgcctcccgg 2573461 cagtgatgag cacatcgccg cggtcgcgct gttcggcaat ggcagtcagt gggtcggccc 2573521 catcaccaac ttcagccccg cctacaacga tcggaccatc gagttgtgtc acggcgacga 2573581 ccccgtctgc caccctgccg accccaacac ctgggaggcc aactggcccc agcacctcgc 2573641 cggggcctat gtctcgtcgg gcatggtcaa ccaggcggct gacttcgttg ccggaaagct 2573701 gcaatagcca cctagcccgt gcgcgagtct ttgcttcacg ctttcgctaa ccgaccaacg 2573761 cgcgcacgat ggaggggtcc gtggtcatat caagacaaga agggagtagg cgatgcacgc 2573821 aaaagtcggc gactacctcg tggtgaaggg cacaaccacg gaacggcatg atcaacatgc 2573881 tgagatcatc gaggtgcgct ccgcagacgg ctcgccgcca tacgtggtgc gttggctggt 2573941 aaacgggcac gagacaacgg tgtaccccgg gtcggacgcg gtcgtcgtca ccgccaccga 2574001 gcacgcggag gccgaaaagc gcgctgccgc gcgggccggg cacgcggcga catagccggt 2574061 gaaaagctct gctggcgatg tggggcctac aggtctcacg tgtcgagccg cagcacacgt 2574121 gtggcgttac gccatagcca gtcctccagg acttcccggg ggaccggcag cgcacgcatc 2574181 tcgtcgcaca attgcaggta agggcggttg atgagaaatc cgccggtacc gtaaacgatc 2574241 ttgttgcgga ttgttgtctg cccaaaccgc atcagcggct cccatccagc gcccggtgaa 2574301 gcgaagtact tgggacggtg cgcggccaat tccaggtaga cgttcgggtg tttccaggcg 2574361 atcaggcatg cctgcagcac ccacgggtag ccgccgtggc tcatcaggat cgttaactca 2574421 gggaagcggc aggcaacgtc gtcgatgtgg cggggatggc cgagatcgct gagccgtgtc 2574481 cgagtccaat cggcggaggt gtggatggaa acgggcacac caagctcgac gcatttggcg 2574541 tagcaaggga agtaggcggg gtcggatgcg ggccgtccaa tcatgaacgg acgcaagctc 2574601 aacccgcgga aaccgtgctc gaccacccag cgctcgaact cgtcgactgc cgagtcgccg 2574661 gccaggatgt cggcaccggc gaagggtagg aaccgatctg gatagcgggc cgcgacggcg 2574721 gccaccgagg cattgtggac aaaggtgaca ccacacgtgg accgttcatc gaatcccgtg 2574781 atcagactgc gggtaatccc ggcgtcgtcc agggagtcca gtatttggtc gtctgtcctg 2574841 cgtagcgact ccgcgtaggc accgaactgc tcggcgctga tcgtcgtctt ggtgaagacc 2574901 tcgaaatacg acagcagctc gacgggaaat ccttcccgaa gatcgtcaat gacctcggcg 2574961 gacggaacga acggtgccca catatcgatg accggcaccc gcggttcggg cgcggtcatg 2575021 gggtgctccg cgggccaacc ggaccgtgca ggaagtcatc gaatccggca tcgcgctcca 2575081 cggcgaatgc ctgctcgaac gtcgtggcgg ggcggccggt gcacgtcgcc tccttgacat 2575141 cgactcgctt cccggtgagg tcgcacagga cacaacggtc cagcgcgccg tcgtcggctt 2575201 cctcggtagc aatgtcgtgg ctcatcgctc ctccgttgac tgtgtcgacc agctgagcat 2575261 gcgctcttat gcgattacgc caagtcaact gaccccgccg acgcttcgca tacctagtgt 2575321 cggccagggc cacctggccc gcccggacct cccggcccgc ctggtccgcc cggacccccg 2575381 ggtccgcctg gtcggtttac ggcggggagc cagaacacgc attgattcaa gtcggggctc 2575441 cacacccagc cgggcgcgca atcatcgtcg gcccggctgt tcgccggctc gacaagtccg 2575501 gtcaccgcca acaccgtcaa caccgcagtg aacgcgcaaa gcgcgcagcg tcgtagaaaa 2575561 tgcctcatcg cagacctcac ggtttgtcgt ccggcgctgg acctaggtta tcgccacgac 2575621 cgccgcggcg gcagcacacg tggcgactca ccgcggccgt agaaccggtt gagcagcaag 2575681 ccactgcgcg ttggtaagag cggatccaag cgccggcaac ggatggtcgg cgagggcgct 2575741 gatcgggcaa cgatgcccag gccaggcggc cccagcgaac gccgcaccgg ctggaggaag 2575801 atagccccat gacccaaacg ctgcgcctta ccgcgctgga cgagatgttc atcaccgatg 2575861 acattgacat cgttccttcg gtgcagatcg aggcgcgggt gtccggtcgt ttcgacctcg 2575921 accggcttgc cgctgccctg cgcgccgccg tcgccaagca cgccctggct cgggcgcggc 2575981 ttggccgcgc cagcctaacc gcacggacgc tgtattggga ggtacccgac cgcgcggatc 2576041 acctcgccgt ggagatcacc gatgaacccg tcggtgaagt tcgcagtcgc ttttatgcgc 2576101 gggctcccga actgcaccga agcccggtct ttgccgtcgc ggtggtacgc gagaccgtgg 2576161 gcgaccgcct cctgctcaac ttccaccacg cggccttcga cggcatgggc gggctgcgtc 2576221 tgttgctctc actggcccgg gcctatgcgg gcgagcctga cgaggtcggt ggccctccga 2576281 tcgaggaagc ccgcaacctt aaaggcgtcg ccggctcccg cgacctgttc gacgtcctga 2576341 tccgcgcccg cggcctggca aaaccggcca tcgaccggaa gcggaccacc cgggtcgccc 2576401 cggatggcgg ctcgcccgac gggccgcgct tcgtgttcgc cccactcacc atcgagagcg 2576461 acgagatggc aaccgcggtt gctcgtcgac ccgagggggc gacggtgaac gacctggcga 2576521 tggccgcgct ggcgttgacg atcctgcagt ggaaccgcac acacgatgtc ccagccgccg 2576581 attccgtgtc ggtgaacatg ccggtgaact tccggccgac cgcgtggtcg accgaggtca 2576641 tctcgaactt tgccagctac ctggcgatcg tgctgcgggt cgacgaggtg accgatctcg 2576701 agaaggcgac cgccatcgtc gccgggatca ccggaccatt gaagcaatcc ggcgccgccg 2576761 ggtgggtcgt ggatctgctc gaagggggaa aggtgttgcc ggcgatgctc aagcgccaac 2576821 ttcagctgct tctccccttg gtcgaagatc ggttcgtcga aagcgtctgt ctgtccaacc 2576881 tgggccgcgt cgacgtcccc gctttcgggg gcgaggccgg ggacaccact gaggtgtggt 2576941 tcagtccgac ggcggccatg agcgtcatgc cgatcggggt tggcctcgtc ggcttcggag 2577001 gaacgctgcg cgccatgttc cgcggcgacg ggcgaaccat cggcggcgag gcgctgggcc 2577061 gcttcgccgc actgtatcgc gacacactgc tgacctgagg gcccggcatg accgacaacg 2577121 agtgcccggc cgacagccga cggcgccatg tcctgcggct cgccctgttc gccgggattt 2577181 tgctggggct gttctacctg gttgcggtgg cacgagtcat ccacgtcgac ggggtccgta 2577241 gcgcgatcgt ggtggcgacg ggtccgatcg cacccctggc gtacgttgtg gtgtcggccg 2577301 cactcggcgc gttgttcgtc ccgggcccga tcctcgccgc cggcagcggg gtgctgttcg 2577361 ggccgctact agacaccttt gtgaccctgc cagctttctc ggccggcgcg caggccggaa 2577421 tgacgcccag gcgctgctgg gtgtcgatcg cgcccatcgc ctcgatgcac agatcgaacg 2577481 gcgcggattg tgggcggtgg tcggtcagcg cttcgtcccc ggcatctcgg atgcgctggc 2577541 ctcgtacacc ttcggggcgt tcggagttcc gttgtggcag atggtcgttg ggtcgttcat 2577601 cgggtcggcg ccacgggtgt tcgtctacac cgcgctgggc gcgtcgatca ccaacctgtc 2577661 gtcgccgctg gtttactcgg cgatcgcggt gtggtgcgtg accgccatca tcggggcgtt 2577721 cgccgcgcgg cgttggtacc ggaagtggcg tgcgcgcccg cgccggcggt gcggcctggc 2577781 tcagctcacg accggtagtc agcaacgcca cacgagtcac cggacaccgg cgggcgtcgt 2577841 catgcccggt tcactgtccg agcaccgccg tctccgtcaa gaagcgccgg atcgcatcga 2577901 gcatcacccg cccatcgagt agttccgggt cgttgtgacc cacaccggga accaccacgt 2577961 atcgcttagg ctcggcggcc gctgcgacca gccgctcact aagcgtagcg gggacgatgt 2578021 cgtcgctgcc gcccgcgatg accagcaccg gcgcgtgtac agaggcgatg cgctcgatcg 2578081 acgggtagtg gtccagcagc aaccggcgca gcggcagcca cgggtagtgc accgcgccga 2578141 cctcggccag cgacgtgaac ggagatctca gcacgagtgc cgccggcggc cgttgcacgg 2578201 ccagcccgac cgccaccgcc gcgccgaggg attcgccgaa ataggcaatg cgcgcggggt 2578261 cgacgtcgga ctggccggac agccactcct gcgcggcccg agcgtcggcg gccaggccct 2578321 gctcagacgg ccgacccggg ttaccgccgt agccgcgata gtcaaacagc aacaccgaca 2578381 ggcccaggcc atgcagcgcg acagccagct ccgcacgcat cgaccggtcg ccggcgttgc 2578441 cattgcacac cagcaccgcg ggcccactac cgcccgaagt atgcgggaag taccagccac 2578501 ccaagcgcat tccatcttgt gtttcgacca cgacatcgcg gccggcgggc aaaacggagg 2578561 aagccgatgg caccggaccc gcagacggga agtagattag ccgacgctgc tgcgaccaga 2578621 tgaacataat cacgcccgat gccaccagcg cgacgatagc gaccaccggc aacgcgcgac 2578681 acctctttag cgacatctag ccccgcaccg gtgcgacgca tcgaaagcgg ggtccccgcg 2578741 accagtggat taccgaaacc accgttccaa acagaaaatc gacacgaaat tcaacgacgc 2578801 ggcgggccgg cgatggccac gagacaccca caaccagcaa ccgccccaat catcacgcca 2578861 accagctcag tacaccgccg tggcgcgaac acgtgcctga ccggtgtgtg ctgaacgagt 2578921 acgacccgtc cctacaaatt gcggtggcgc cgggtggcgc ccccgaacct ggcggcactt 2578981 gccggggagc aggtatgcac tgaccgtcca cgttctcgta gtagccgcta ggacaggcaa 2579041 acaccgaagt cggcgtcgac ggagaaatgg ccgggacgaa gccgaaacca actgccgccg 2579101 caacaacgcc gacggcaaac cgcctccgag cagacactgc tagccttcga tcatcacgct 2579161 tacgactccg cgtcccagca aagcgtaccg agtacatcgc cagccgggaa gggatatggt 2579221 cccgcgacta gcggatcagc agagtgcgca gttccagtgc tctggcaaac caacacgtat 2579281 tgctcgccga tccaacatat tcgttgaacc ttgagaaagg cttgcggcgc atcgcccagc 2579341 ccagcgccac tgccaccacg ggaggagaaa tccaaccgtc accacgacac cacggatagc 2579401 gaagatcaac aaatgccacc cacgttcggg cgcaccaagg aagccaccgt cgcgatactt 2579461 acctatcgtt gcatccgttc tggcgatatt tttcaactcg cattcatgcg ccccctccgc 2579521 aagagccggg agcggctaat ggtggcaccg ggctaccatc gtcaataaca cacgacaagg 2579581 taagcgtcgt accaacaaac ggcgctggta cccgcacttg atgccaatag ctgccgtctg 2579641 gatatctgat tccgtcacaa tatccccacc cggtaatccc accaaagcca ccgccagggc 2579701 aatatcccat cgcactattc ggcatgtgcg gatcttgtcc cggcggtggc gcgggttcgg 2579761 cgttggcaga aaccatagaa aattcaactg ccatagtcaa tgtaccgatt gcgatagcaa 2579821 tactattttt atacattttc tcaacacctg aattcattcg tgtggggaat gcagcctttt 2579881 ggcccccaca tgcccggtgt cccatcgctg gcgggccagt agggacttct tccacggccg 2579941 gaagatcatt gcgcgttggt tgtgcgagcg ggcggctgac ggcttcgcat aatggcgtgg 2580001 acgggctgtc atcgttgtcc ctcagcgcta caacaagtca gggaaactct tcacaggcgg 2580061 tgccgtcgtc gccgtggtcg aggccaagac ggtaacccgg ctcaccccat agagcggggc 2580121 cacccccgcg tcccgccttg cagttctggt agtaccggaa ccacgcgggt atcggcgttg 2580181 gggctgcatg agccacaggt ggcgccacat cgccgaccgc gatcacagct gcgaggaccg 2580241 gtggacgctg catgatgagc cctacgtgta gtaccagacg gctttggttg tgactggctg 2580301 gtcagtcgcg taaaccgtgg acctggctac tgctgaaagt accatgacgc ggggcaacga 2580361 aacagcagca acgtcgacag acagcggaac tgtcggctac cgccgataac gttgtgtcat 2580421 gcgtgcggac atgtccgtca cctcgatgct cgaccgagag gtctacgtat acgccgaggt 2580481 cgataagctg atcggcctcc ccgccggcac cgcgaagcgg tggatcaacg gctacgagcg 2580541 tggcggcaaa gatcacccgc cgatcctccg cgtcacgccg ggagctacgc cgtgggttac 2580601 gtggggcgag ttcgtcgaga ctcgcatgct tgctgaatac cgcgaccgcc ggaaagtgcc 2580661 aatagtgcgg cagcgcgcag cgattgaaga actgcgtgcg cggttcaatc tccgataccc 2580721 gctggcacat ctgcggccgt tcttgtcaac gcacgagcgg gatctgacga tgggcggcga 2580781 ggagattggt ctgccggatg cggaagtgac gatccgtact gggcaagcgt tgcttggtga 2580841 tgcccggtgg ctcgccagca tcgcgacacc cggtcgggat gaggttggcg aagccgtgat 2580901 cgtcgaactg cccgtcgaca aggcctttcc cgaaatcgtc atcaacccaa gccgatatag 2580961 cgggcagccc acgttcgttg ggcgtcgtgt gtcgccggtg acgatcgccc aaatggtaga 2581021 cggcggtgag gaacgcgagg acctggccgc cgactacggt ctcagcctga agcagattca 2581081 agacgcaatc gactacacca agaagtaccg gctggcccga ctggtggcgg cataaggccc 2581141 ggcgatgctc gaagtcgaca aagtcaccca tgttgtcgat gaaaacctgc ttcggcttgg 2581201 tgtggccttg tcgccgtcag aaaagacacg gcccggtttg gccgcccgcc cgtcgacgac 2581261 ctgctaccgc aaggcatcct cgacaccgac tggatcccca tcgtcggggg tcgggtgggt 2581321 ggtcatcagc aacgacaggc atctccggac gcggccagtg gaggccgagc tggcggtcgc 2581381 ccacaagctc aaagtcgtgc acttgcatgg ccgtgtgggc ggactagtcc gcgtgggcac 2581441 agctgacgcg gctggctgcg cggtggccgg ccattgagca ccaatatgag aaggcaccgg 2581501 aagggccttg gtggttgtcg gtgcggagga gcaggaccgc cgtaatggag ttcgcgcccg 2581561 gcgccgtcga caccatagcg tcggacaaca tggctgccca aaatgtccac gatacggctg 2581621 tgaagacctc gaggtgatgg ccgaaaggtg accacctcgc agtggtagga cgacagcgac 2581681 ccgatcgaag gcaatgccgc cgcatcgagc gcgactttgg gcatgacagg atttcgagta 2581741 agcgcatcaa cgtgtccgaa atgtggggcg ggcggggctc gaacccgcga ccaacggatt 2581801 atgagtccgc ggctctaacc aactgagcta ccgccccttg tgctaactag ctgcagatat 2581861 gttctccacc gcgactgaat cagggtcgga ataccgcagt gatgccgcag cactcttgat 2581921 ggcctgcaca agcaaacctg ccacgcccgc caggtcgtcg ctgagcaggt ggccatggcg 2581981 gtccaaggtc atggccgctg tggcgtgtcc gagaagcctc tgcacgactt tgacattagc 2582041 gcccgcactg atcgccagcg acgccgtggt gtgcctcagc ccgtgcggga ccaggtcggc 2582101 aatgccaacc gccttgcatc ccttgtcgaa ggctctgcgg tactcctcga taggtaggtg 2582161 cccgccgcgg tagcttggga acacgagggc attgggctcg gttggcagtt catcacgcag 2582221 gcgctccgat accggctcgg ggacaggcac gtgacgcacc cggttggtcg tcgtctcgac 2582281 aatcccggcg ccggtcacac agatgagcga atcgtcaact cccggtcccc acgttcttgc 2582341 gacgcagggc cgctgcctcg ccgaagcgca gtccgcagta gccgagaacc agggtcagcg 2582401 tcagtaccgc aacaatccgt ggtacgaatc cgttactcac ccactccccc gcgctcggtc 2582461 ttggcagctt ccgcctctac ccacgcatcc aggtcggcga tatcgcaaaa cgtgtgtcgg 2582521 ccaaggcgat agctgcgcgg actggtgccc agtaacgcca gtacctcagc gtcgactcag 2582581 gtaggccgcc gaggtattcg gctgcggcct tggtgcccag ccgaaccaca gatgtcgccg 2582641 tcatcgctct acttcctgtc gtcgctcaac gcgcttatgt cccaatccct ttggcagtcc 2582701 cagggccgac cgcaaaatcc tttccattga ccgcacagta accattagcc cgatggcatc 2582761 taacaaccga agaacgccga gaagtcgaca ccaagatccc gatgtttgcc gtagcaggaa 2582821 cggcggtcac actcgggctg attcgagcca gtcgtacatg tcgcgccgcg tccagcgccg 2582881 atcccgcccg ccgaggcgca cataatgggg tcccggtgcg tcaattccgc gctctcgctt 2582941 tgcagcccag ccgtgcaggg tcgacactgg cacaccggtg atctcccgca cctagatggt 2583001 ggtaagcatc tccgcgtggc tttcgttgtc ttccatcatg tgctttggcc accagtagcg 2583061 acgacatcac cataaatcga caccctccgt tgaattgcgc cgtaaatcgc cacgacgaaa 2583121 gccgacggtc tccgctgcgc cggggcctac tcgccaacgg cctaagagag aggcaagctg 2583181 gggcattatt cgaacgttac gaaagccagt tcgattcatt cggatatatc gagaaggtgc 2583241 ggtatcgggg ctcagggtat cgagtcgaag acgtttatgc ccgagcggac agtggaccta 2583301 gcgccggtgc tgagcttcct gtcggcccat gagcggcggc gcggccgcac gctggccccc 2583361 agctacgcgc tggtgggcgc cacgagcacg accgcgtcga gctgccgcgc gaggttcatc 2583421 aggcgctaag gcaggtggtg gctgcgctgc acgccggcaa ggcggtgacc atcgcgccgc 2583481 agagcatgac gctgaccacc cagcaggccg ccgaccttct cggggtgagt cgtccgaccg 2583541 tggtgcgtct gatcaagagc ggcgagctgg ccgccgagcg catcgggaat cgccaccggc 2583601 tcgtgctcga cgacgtgttg gcctaccggg aggcccgccg gcagcgccag tacgacgcgc 2583661 ttgccgagag cgcaatggac atcgacgccg acgaggatcc cgaggtgatt tgcgagcagt 2583721 tgcgtgaggc gcggcgtgtt gtcgccgcgc gccgtagaac tgagcggcgg cgcgcctgag 2583781 accatcgctg catgctcgac acgtcgctgc tgtggtcaag ccggcagcgc gactttctgt 2583841 tgtcgttggc gacgtcgccg cgaactacga cgggcgggtg gtggtggcgc cgacaggcca 2583901 ggccgtcgac gtcgcggtac gtgaaggcgc cggcgatgtc ggctacagcg tcgagcgaga 2583961 gaatcttccg gccgacgatc cggtgcgcaa cggcaaccgc tggcgggtca tcgcggtcga 2584021 caccgaacac caccggatcg ccgcccgccg cctgggcgac ggcgcacgcg ccgccttcag 2584081 cggcgactac ctgcacgagc acatcaccca cggatatgcc atcaccgtcc acgccagcca 2584141 gggcaccacg gctcactcca cccacgctgt gctgggcgac aacaccagcc gagcaacgct 2584201 gtacgtggca atgacgccgg cacgcgagtc gaacaccgct tacctatgcg agcgaacggc 2584261 gggcgaaggc gcgcgagtgg atctcgccgg atgggacctt tgggtgagtg ggaaagctga 2584321 ggcaatgagt gacgagaaat ccgcatcgcc agtttggtgc cgtgtcggag ctcggtgcga 2584381 tcatcgggga aagcgttcct gctggtgagg gcagaattgt tgtgcacgtc gtgcgctata 2584441 ccgtggtgac gactcgccga agcatggact aaggaggtag ctgcgatgat gaaggagatc 2584501 gagctccatc tggttgacgc tgccgccccc agcggcgaga ttgcgatcaa ggacctagcc 2584561 gccctcgcga ctgctctgca ggaattgacg actcgaatca gccgcgaccc aatcaacacg 2584621 cccgggcctg gtcgcacaaa acagtttatg gaagagctct cgcaactggc cagcgccccc 2584681 gggccagaca tcgacggcgg gatcgaccta actgacgatg aattccaggc gtttcttcag 2584741 gcggcgcgtt cgtgaatcaa gtagcggcga cggtggtcga caccgacgtc ttcagcctga 2584801 tctaagacac cgactcgcgt gacctcggct gccgcgccca gacgccgtgg cacttctgcc 2584861 gttcggtcgc cgcctggctg gccggagtca tcacagcacg ctccaatagc gcctcatgga 2584921 atcagccggc gccctcgaat cgagccttac ccgcccgaaa cgacacgcct cgacggtacc 2584981 tggcgcgctg acctggccct acatcagtca acgtatacga accacagcgt cgcggagctg 2585041 ccagaccgcc gtcaaccgaa caccgtctga ccgtcaagcc caatgcgata ccgttcggtg 2585101 ccctgctgca ccctgggcgc atcagcaccc aacgacactg caaccttgtt gctggcgttg 2585161 cgcatgatgt caaaggtcag ctcgacggcc tcgtcgtccg agaaccggga gcgcacctcg 2585221 acggcaacgt cgacggcgag gtgcgcaggg gtccaaatta acgcatctgc atacctcagg 2585281 gcggcttttg cgcgaacgtc gagcaagacc gaggtatcga aacgctcgat ctcgccatac 2585341 aacgtctccg aaccgcccgc atcaagcgcg gagacctccc gcaacgactt gcacacccgg 2585401 caattgtgct gcgcagctcc acgcagccgc accagctcag aggtgaccgg gtccagtgcc 2585461 cgcatccggg ccaccgccgg cagaaatccg ttgaacaccg cagcggacag atcggtgttg 2585521 tgatcccagg agatcggccc ggttacccag cccagatact ccttgccgac gcccaatgct 2585581 tccaacccgg cgcgcacccg cggcacaaag tcggcgatgt acatcgccac aaccgcaccg 2585641 aaagcgtctt cccccaaatg cgtccacagc agggatcgct gctcgccggt gatcgctgag 2585701 acatcgacgc tgaactgctc ggcgaactcg gcaacgacgg cctcggccgg cgactccggc 2585761 tcgttcaccg caacctcaca cggcaacgac ggtagcgaca gcgcccgcgc gcacacctgc 2585821 ctcaccagcc ccgcaatccg gccgtcgccc ggcgatagcg ccaccaaccg acacagatcg 2585881 tcacgaaccg aaaccggggc cggcaccctt cacacgctac tgcgcctggc tcaccgagga 2585941 catgtggaag tcgggaatcc gcagcggcgg catcgcggta cgggtaaccc aatctgacca 2586001 ttcgcgcggc agtgtcggct cgctgacacc tgcttcggtg gcccgtcgca gcaggtccag 2586061 tgggctttcg ttaaaccgga agttgttgac cgccgcgctg acctcgccgt cttcgaccag 2586121 gtagacaccg tcgcgggtca gcccggtgag cagcagcgtg gtcgggtcga cctcgcggat 2586181 gtaccacagc gtggtcagca acagtccgcg ctcggtgccc gcgatcatgt cggcgagatc 2586241 ggccgacccg ccggtcatga tcaagttgtc ggcggcgacc gcaactgggg cgtcgaattt 2586301 ggcggcagtg gcccgtggat acgccagcgc attgatcaca ccgctgcgga tccagtccac 2586361 ctggctgatt tccatgccgt tgtcgaacac cgattgcgtc tccgaggagt tgctcaccgc 2586421 cacaaacggc gtacacgcca gacccggcgc agccggatcg gtgaacaacg tcagcggcag 2586481 ctcggtcaac cgctctccca cccgggttcc accgccagga gccgagaaag cggttcggcc 2586541 ctcctgcgcg ccgcgcccgg ccatcgacca acccaggtag atcatcatgt cggccaccgt 2586601 cgacggaggc atgatggtct ggtagcgccc ggccggcagc tcgacggtgc gttgcgccca 2586661 ccgcagccgc gtcgacagcc gctcgagcat cagatcgatg ggcacctcga cgaaatcggg 2586721 tgtgccgatc cccacccaag cgctggcgtc gccgcgtttg gcgttgatct cgatcgcccc 2586781 ggtgggctgg gtgtagcggc ggcgcagacc cgtcgacgat gccagaaacg tcgtggacac 2586841 actgcggtgc gcgtagccgt acaagcggtc ggccccgcgg aagcccctgc tcagtgagcc 2586901 ggcgataccg gtgaaaaccc ctgccccggt gcccggaacc ggggcatccc agtcgtcggg 2586961 ctctccggta tcggcaagca gcggcgcggc atcaccggcc tccggcgcgg agcgggccgc 2587021 gtcctgggag gacaccacca gaccgggcag caccgacggg tccacttcgg cggagaccac 2587081 ggagccgacg aaggcgctat ctccccgtcg gacgatcgaa atcacggtga cgtttcggct 2587141 gtgggaaacg ccgttggtgg tcatcgaatt gcccgcccaa cgcagtgtcg cctcgacctt 2587201 ttcggtgacc agcaccatgg tctcgtccgc ccggccagac ctggccgctt cctttaaaac 2587261 gatgttgacg gcgtgctgcg gctcgatcat cgaccacctt cagtacgagt attgagcaca 2587321 ttgacgcccc ggaacaacgc cgacggacag ccatggctga ccgcggcaac ctggccgggc 2587381 tgggccttgc cgcagttgat ggctccgccc attcgccagg tcgacggccc gcccacggct 2587441 tccatggcat tccagaaatc ggtggtgctc gattgatagg cgacatcacg cagctgcccg 2587501 tacagctggc cacctcggat gcggaagaaa cgctggccgg tgaactgaaa gttgtagcgc 2587561 tgcatgtcga tcgaccatga cttgtcgccg acaatataga tcccgtcgtc gacccggccg 2587621 atcaggtccg cggtgctgag gtcttcgatg cccggctgca gcgatatgtt ggccatccgc 2587681 tggatcggca cgtgatgtgg cgagtcggca tacgagcagc cgttggaacg tggctccccc 2587741 aaccgtgggg cgaacgcccg gtcgagctgg taaccaacga acaccccgtc acgcactaga 2587801 tcccagctct gcgcggccac tccctcgtcg tcgtaaccga cggtggccaa gccgaattcg 2587861 gcggtacggt cggcggtcac gttcatcacc ggcgagccgt agcgcagggt gccgagtttg 2587921 tctggggtgg caaacgatgt cccggcatag gcagcctcgt agccgatggc acggtcgtat 2587981 tcggttgcgt ggccgatgga ttcgtgaata gtcagccata ggttagtggg gtcgatcacc 2588041 aggtcggtgg gccccggcat cacgctaggc gctcggacct tctcggccaa cagcgatggc 2588101 agctgcgcga gctcgtcggt ccagttccag atctcgtcgc cggccaccac ttcccagccc 2588161 cgggcggtcg gcggagccaa cgtccgcatc gattcgaagt tgcccgccgc ggaatcaaca 2588221 gcaaccgcat ccaggcacgg cagcagccgc acccgctgtt gggtaatcga tgacccgaag 2588281 gtgtcggcgt agaaggtctg ctccttgacg gcgttcaagc tggccgatac gtggtcgatg 2588341 ccgtcggcgt ccagtaaccg cccggagtag tcgcgcagca cggcgatctt ctcggaggcg 2588401 ggaacgccga acggatcgat ccggtagttc gagacccact ccgcgtcggt gtatacgggc 2588461 tcgggcgcca atctgacccg ctcggtgttc agcgccgcca gcacggtagc cacgtgtacc 2588521 gcatggcgag cggtcgcggc cgcgacgtcg ggtgccaact cagcatggga ggcgaatccc 2588581 cacgtgcccg cgacgattac ccggacggcc aggccgagct cacggctgat caccgcggtc 2588641 tccagctcac cgtcacgcag ttggatgatc tcggtgctaa tgcggtgaac ccgcaggtcg 2588701 gcgtggctgg ccccggccgt ggcggccgcc gacaatgcgg cgtcggccaa ctgctggcgc 2588761 ggcaggtcca ggaagtcttc atcgatcccc cggttcggtg tcacgactcc accgtaacga 2588821 ccagctttaa tacacccatg cgcgacgcgc cacgtcggag gacggcactg gcatatgccc 2588881 tgctggcgcc cagcctggtg ggcgtggtcg ccttcttgtt gctgcccatc ctggtggtgg 2588941 tatggctgag cctgcaccgg tgggacttgc tgggcccact gcgctacgtc ggcctgacca 2589001 actggcggtc ggtgctgacc gattccggct tcgcagactc attggtggtc accgccgtct 2589061 tcgtggcgat cgtggtcccg gcgcagacag tactgggact gctggccgcg tccctgctgg 2589121 cccggcgact gccgggcacc ggcctgttcc gcacgctgta cgtgctgccc tggatctgtg 2589181 caccgctggc gatcgcggtg atgtggcgct ggattgtggc gcccaccgac ggcgcgatca 2589241 gcactgtgct cggacaccgc atcgaatggc tcaccgatcc aggcctcgcg cttcctgtgg 2589301 tttcggccgt cgtggtgtgg accaacgtcg gatatgtctc gttgttcttc ctagccggat 2589361 taatggcgat tccgcaggac attcacaacg ccgcacgcac cgacggcgcc agtgcctggc 2589421 agcgcttctg gcgcatcacc ctgcccatgt tgcggcccac catgttcttc gtcctggtta 2589481 ccggaatcat cagcgccgca caggttttcg acaccgtcta cgcgctgact ggcggtgggc 2589541 cgcagggcag caccgacctg gtggcccacc gcatctacgc cgaggcgttt ggggccgcgg 2589601 caatcgggcg ggcatcggtg atggcggtgg tgctgttcgt catcctggtc ggtgccaccg 2589661 tggtgcagca tctgtatttc cggcggcgga tcagctatga gctcacctag tcgcgtctcc 2589721 aacactgcgg tctacgcggt gctgacgatc ggcgcggtaa tcacgctgtc ccccttcttg 2589781 cttggcctgt tgacctcgtt cacttccgca caccagttcg cgacgggtac tccgctgcag 2589841 ttgccgcgac cgcccacgct ggccaactac gccgatatcg ccgatgccgg atttcgccgc 2589901 gcggcggtgg tgaccgcgtt gatgacggcg gtgatcctgc tgggccagct gacattttcg 2589961 gtgctggccg cctacgcgtt cgcgcggttg caatttcggg gacgtgatgc gttgttctgg 2590021 gtctacgtcg caaccttgat ggtgccgggg acggtgaccg tggtgccgct gtatctgatg 2590081 atggcccagc taggcctgcg caacacgttc tgggcgttgg tgctcccgtt tatgttcggt 2590141 tcgccgtacg cgattttcct gctacgcgag cactttcgcc tcatcccaga tgacttgatc 2590201 aatgccgcgc gcctcgacgg tgccaacact ttggacgtga tcgtgcatgt ggtgatccca 2590261 agcagccggc cggtcctggc cgccttggcg atgatcaccg tggtctcgca gtggaacaac 2590321 ttcatgtggc cgttggtgat caccagcggc cacaaatggc gtgtcctaac ggtggcgacg 2590381 gctgacctgc agtcgcggtt caacgaccag tggacgctgg tgatggcggc gaccacggtg 2590441 gcaatcgtgc cgctgattgc gctcttcgtg accttccagc ggcacatcgt cgcatcgatt 2590501 gtggtctcgg ggctcaagtg acccggcccc gccagtccac gctggtcgcc accgcccttg 2590561 tgctggtggc gatcctgctg ggtgtgacgg cggtgctatt ggggctctcc gccgaaccgc 2590621 gtggcggaaa gatcgtcgta acggtgcgac tctgggacga gccgattgct gcggcgtatc 2590681 gacagtcgtt tgcggcattc acccgcagcc atcccgatat cgaggtgcgc accaatctgg 2590741 tggcctattc gacctacttc gaaaccctgc gcaccgacgt ggctggcggc agcgcggacg 2590801 acatcttctg gctatccaac gcctacttcg ccgcctacgc tgacagtggc cggctaatga 2590861 agattcagac cgatgccgcc gactgggagc cggcggtggt tgaccagttc actcggtccg 2590921 gcgtcttgtg gggtgtgccg caactgacgg acgccggaat tgccgtgttc tacaacgccg 2590981 atctgctggc tgccgccggt gtcgacccca cgcaggtgga caacttgcga tggagtcgcg 2591041 gcgatgacga caccttgcgc ccgatgctgg ctaggctcac cgtcgacgcc gatggacgca 2591101 ccgccaacac gccaggattc gatgctcggc gggtccgcca gtggggatac aacgccgcca 2591161 acgatcctca ggccatctac cttaactaca tcggctcggc cggcggtgtg ttccagcgcg 2591221 acggcaagtt cgcgttcgat aaccccggcg ccatcgaagc cttccgctat ctggtcggcc 2591281 tgatcaacga cgaccacgtc gcaccgccgg cctcggacac caacgacaac ggcgatttct 2591341 cccgtaacca gtttctggct ggcaagatgg cgctattcca gtccggcacc tacagtttgg 2591401 cgccggtagc ccgtgacgcc ctcttccact ggggtgtggc gatgcttccc gccggccccg 2591461 caggccgggt aagcgtcacc aatggtattg ctgcagctgg taattcggcg tccaaacatc 2591521 cggatgcggt gcgtcaggtg ctggcctgga tgggcagcac ggagggcaac tcctacctgg 2591581 gccgccacgg tgcggccatc cccgcggtgt tgtctgcgca accggtctac ttcgactact 2591641 ggtctgctag gggcgtcgat gtcacgccgt tcttcgcggt gttgaacggt ccgcgcattg 2591701 cggcccccgg cggcgccggc ttcgccgccg gacagcaggc cctcgaaccc tacttcgacg 2591761 aaatgttcct cggccgtggc gatgtcacga caaccctgag gcaggcacag gcggcggcca 2591821 atgctgccac acagcgctag ttgcgatcta gcccggtagt actagcacgg ggaccgggct 2591881 gtagcgaatg atcttgccac tccaggagcc gaggaatact ctcgcgacat caccgaacgg 2591941 cgaggtgccc aaggccagga tctccccgtc ctgccagtcc gcagcgtcca gcgcctgcgc 2592001 ccagccgttc ccggtgacca cttgcagcac aacgtcttca ctcacgacgc cgttaattct 2592061 tagtttttcc aacagttctc gcgcttgcgc cgcccatgcc tccagaaccg aagcctcggc 2592121 atgcagcccc acttcgggcg gatacatggt ccggccgcgg accgcgaatg tgatcacccg 2592181 catcggcacg ccataccggc tggccaggtg gccgcatcgc ctcaccacgt cgaccgaacc 2592241 cgacgtcgcg gagtagccgc agctgagccg tgtcaaccgg tcggtgtagc aacggtagcg 2592301 gcggggggtg atcgccaccg gtaccggcga cgaatgcagc agccggtcgg cggtcgagcc 2592361 gatcaacacc cgcgcgcgcc gcccgctggg aaacgacccc agcaccagca cctcggcttc 2592421 gagttcctcg acgacgtcga gcagaccagc cgacaccgat cggtgtgcgc ggtggtggta 2592481 gctgacctcg atcccgtcgg ccagtctgcg caggtagcgc tgggcctctc gcgcggaggc 2592541 ggcagccagc tgctcagacc agagctcgta ctcggcgtcg acgcgggcga gcgacggtgt 2592601 cggccagtgc ctgcgcacga tggtggccac tgtgagcgac gtcttgtgca tccgcgcgac 2592661 gcggacggct agatgtaatg cggacggacc gaccttgcca gccaaatacc cgacgacgat 2592721 ggtcacggca cttcctcgtt gagcgcactg tggtgccgac cccacatcag gtaaaagatc 2592781 actgccaccg ccacccatcc gctgaacgcc agccaggtgt accagtgcaa gctggccagg 2592841 atatacccgc aggccagcac cgaaagaaca ggcgtcacag ggtaaccggg taccttgaac 2592901 cctcggggta agtcgggctc gcgcacccgt agaacgatca cacccacagc caccacgctg 2592961 aacgcggtga gcgtgccgat ggacaccatg tccgccaagc tatccagcgg tatgaaggcg 2593021 gccagcgtcg atgcgaagat cgcgacgatc accgtgttgt gcaccggcgt catggtgcgc 2593081 ggattcacct tcgcgaaccg cgccggcagc agcccgtcgc gccccatcgc gaacaggatc 2593141 cgggtctggc cgtacatggt gaccagcgtg acggtgaaaa tcgagaccac cgcaccggcg 2593201 gccagaatcg tgctggccca ttcgccatgc gtgacgttgt ccaagatgat ggccagcccg 2593261 gcggtttcct gctctgcgaa gtcctgccac ggttgggtgc ccagcgcggc cagtgcgacc 2593321 agcacgtaga caccggtgac gaccaccagc gctgcgatca gcgcacgcgg catggtcttc 2593381 tgcgggtcct tcacctcgtc gccggcggtc gacaccgcgt caaggccgat gtatgagaag 2593441 aagatcgtgc ccgccgcgga gccgatgccg gcgacgccga atgggacgaa atccttgagg 2593501 tggtcggcgc tgtacgcgct gaacgcgatg atcatgaaca tgcccagcac gccgagcttg 2593561 atcagcacca tgatcgcgtt gaccctcgcc gactcgctgg cccctcgaat caacagcagc 2593621 gcgcatagcc cgatcaggat gacggcgggc aggttcaccc aaccgggatg ggtgtcccac 2593681 ggcgccgccg acaatacgtg cggcatctga aatccgaaca gattactcag cagcttgttc 2593741 acgtagccac tccagccgac cgcgaccgct gcggtggcta ccccgtattc cagcagtagg 2593801 caggccgcca ccaccatcgc gaccgcctcg cccagcgtcg tgtacgcgta ggagtacgcc 2593861 gacccggaaa tcggcacggc ggaagccagt tccgcgtagc agatagccgc gagcccagcg 2593921 gcgatgccgg cgatgatgaa cgaaacaatc acgcccgggc cggcctctgg aactgcctgg 2593981 gcaagcacga aaaagatgcc ggtacctatc gtcgcgccaa ccccgaacat ggtcagctgg 2594041 aaggtgccga aactccgctt gaggttcccc gatgccccgg atgcgaccgg ggcgccgctc 2594101 accgggcggc gccgcagcat cagttctcga aggctcatcg acgttgtcgg caattatgaa 2594161 cccgcctccc atagcgcgtc ggcgaaccgg cgaaccgcgc agtcgatctc ctgcgcggtg 2594221 atcactaacg gcggcgcgaa ccgcagggcg gcgccgtagg tgtcttttaa cagcacaccg 2594281 cgatcggcca accgcatgct catgtctgtg ccaatggcaa gcgcccgttc gatgtcgacg 2594341 tcagcccacc atccgaggcc gcgcagggcc accgcaccat cgccgatcag gtccgccagg 2594401 cgctgatgca gatgcgcacc caatttagcg gagcgagctt gacattctcc ccagacgacc 2594461 atggaaacca cgggggtacc gatcgcggcg gccaacggat tgccgccgaa cgtcgacccg 2594521 tgttcgccgg gatgcaccac gccgaagatt tcgcggtccg cgaccatcgc cgacaacgga 2594581 accgcaccgc caccaagtgt cttgccgagc aggtaaatgt ctggcagcac acccccgtgg 2594641 tcgcaggcga acgggtaacc cgtacaggcc agccccgatt ggatttcgtc ggcgatcatc 2594701 agcacgttgt gctcgacgca gccggcaggt agtcgtcggc cgggacgatg atgcccgcct 2594761 ggccgggaat cggctcgagc aggtcagcga cggtgttgtc gtcgattgtc tgcgccggtg 2594821 ccgcagcatc gccaaacggt accgagcgga gtcccggggt agaaggttcg acgccgctgc 2594881 ccgcagccgg gtccgacgag aagctgacga cactgctggt gtggccatga aagttgttgt 2594941 ttgccaaaat gatatcgtgc cggcccgcgg ggaggccgtt gacgtcggct ccccacttgc 2595001 gggcgaccct aagaccgctc tccaccgctt cagcatcaga gttcattggc aacaccacgt 2595061 ctttgccgca cagctgggca agcgcggcgc ccaacggccc gagtcggtcg gcatgcaagg 2595121 cccgattcag cagggtgacg gtgtcgactt gggcatgagc cgtggcggtg ctcgcggggt 2595181 tgcgatggcc aaggttgacc gccgagtacg cagccagcca gtccaggtag cgcaggccgt 2595241 cgatatcggc gatccacgca ccctcagcgc tggccgccac cacaggcagc ggcgaataat 2595301 tgtgcgctgc atgcctttcg accagtgcca tagtggcctg agtggcatcc gcgagatttg 2595361 tcatgggtgt atctccagcg tgcagcactt gacggaaccg ccgcccttga gcagctcgga 2595421 cagatcgaca ccgaccggct cgaagccggc tgcgcgtaac tgcgccgcaa aacccatggc 2595481 cgcgaccgga agcactacgt tcagaccgtc agagacggcg ttgagtccga acacgaacgc 2595541 gtcggcactg ccgaccacaa tcgcgtcggg gaacagcgcc gacaactgtt cctgcgctgc 2595601 cgtactgaac gccggcgggt agtaggcgat cgtgtggtcg tcgagcacgg ccagcgcggt 2595661 gtccaggtga tagaaccgtg ggtcgaccaa ctcgagggag accaccggca gaccaagcac 2595721 cgcggcgatt tcggcgtgtg cgcgctggtc tgtgcgaaag ccgtagcccg ccaacaccct 2595781 ttcgccaacc atcagcaggt cgccctgtcc ctcgttgacg tggcgggtgg tcaccgggcg 2595841 atatccgacc gaggacatcc agctggcata ggctctagac tcaccagctc gttcggggaa 2595901 ccggaaccgg gcgaccacgg cgatgtcgtg cgcgatgaac ccaccgttgg cggtgtacac 2595961 catgtccggt aacccggaaa tgggctcgat cagatccacg ctgtggccta gccgaagata 2596021 ggtctggtgg aggtgctccc actgtgcttg cgcgacttgg acgtcgactg gcgcggtgac 2596081 gtccatccag gggttgatcg cgtatgcgac ggcaaagaag gccggcgggg tcattgcata 2596141 ccgccgcgtc cggggggtgc ggcgtgcagg tgaccctaga cgggcagcag cgacgtagga 2596201 atccgtcata aaccaacgat atttggctct gatttcacaa tcaaacgatg gtcgttgcgt 2596261 attttccatt gatacattgc gttaacctcg aatctgtggt gattcgttgc gtgcttagaa 2596321 cggaggaggg ccgatggacc gcctggatga caccgacgaa cgcatcctcg ccgagctggc 2596381 cgagcatgca cgggccacct tcgccgagat cggtcacaag gtgagtttgt ccgctccggc 2596441 ggtgaagcgc cgcgtcgacc ggatgctcga gagcggcgtc atcaagggct tcaccacggt 2596501 ggtcgaccgc aacgcgctcg gctggaacac cgaggcttac gtgcagatct tctgccacgg 2596561 caggattgcg cctgatcagc tgcgtgccgc ctgggtgaat atccccgagg tggtcagcgc 2596621 ggcaacggtg actggcacgt ccgacgcgat cctgcacgtg ctcgctcatg acatgcggca 2596681 tctggaggcc gccctcgagc gcatccggtc cagcgctgac gtcgaacgca gcgaaagcac 2596741 cgtcgtgctg tcaaacctca tcgaccgcat gccgccctag tgttccgcgc caatgctaga 2596801 aaaggcctgc tgagctacgt agacgcagca tgagcaggtc ctcgcgccgc caacccgcga 2596861 aacggcgcgt ctgtacaccg acacgccgtt agggcgcgcg cccacgccca gctatcgccc 2596921 aagctcacca tcgcgttggg cggcggcggt ggcggccaac atcggggcta tagcggctgg 2596981 ccggtccgcg cgccgcccgc gccgccacct aggagtgcaa tatcaggctc tctatcgcca 2597041 ccgctgtccc gctggccatg gcagtgatcg caagcgtcac ccagtcggca agtttgggtc 2597101 gcccaggatg cgctgacagc tggccggtgc cgccacgggc ggtgatcgcg tcgcccatct 2597161 cgtcggcacg gcgcagggtc accgtgatgg cagcggcaag caggtcgatc agctcgcgcg 2597221 catgccgctg gcgccgagcc ttgcggcttg gcggcatccg cttgggccgc agccggcgcg 2597281 cggcgtagag cacctggaat tcgtcgatca acatcgggaa ggcgcgcagc gcgagcgcca 2597341 acgccaccgc ccattcgtcg accgggatcc gcaacacccg aaacggccga cccaaagtgg 2597401 ctaccgcagg gctgatttcg gcaacattgg tggtccagga caccatcgcc cccagcgcca 2597461 ggagcacaac cgacagcgcg gtgatccgca ggaagtgcag tgcgccgccc aatccgagct 2597521 gcactccgcc cacggcgacc actggagtgc caccggctag cgcagcggtc agaaagccga 2597581 tcgcgaggac gatccacagc cagcgaggta ccgacggcag cgcgccgcgc ggaatgtgcg 2597641 cgattcgggc cgcggccagc accaaagccg ccatcatccc gatcgtcacc catcccgggt 2597701 agaacgtcag caacaccgaa atgccgaaaa ccaccaataa tttggtgccg gcccacaggt 2597761 cgtggatgac cgagctaccc ggcaccggaa tcaacagcac aatcggacgt gacgggcgac 2597821 gagtcccgtt gcgtgccggg gccgaagttg tggtcatgac attccccccg cctccgacgc 2597881 cgccgccgat tccagcacac cgtcgcgcag atgcagggta cgcgggcaaa gctcctccat 2597941 ccccgcgaag tcgtgcgaaa ctacgaccac cgtcaggccg cgcgcccgac gcaagtcttc 2598001 cagcagccgc agcaggccgc gctggctggc cgcgtccaac cccgccaacg gctcatcgag 2598061 gatcaacgcc cggggtgcac gcgcaagcag cccggccagc accacccgac gcatctggcc 2598121 cccgctgagc tggtcgattc gtcgcgcgcc cagcgcgggg tccaacccaa cgacagtcag 2598181 cgccgcagcc acccggtcct gctcgctagc cgaaaaacct gctgcggaag caacttccag 2598241 gtctacacgg ctgcgcatca gctgcagccg ggccgcctga aaagacaacg ccaccgcgcc 2598301 gacctgctcg tgggtgggcc gaccgtcaag taggcaggct ccggtcgtgg ggatcgtcag 2598361 cccggccatg atccacgcca gcgtcgactt ccccgagcca ttgccgccgt ggatcagcac 2598421 cccgtctccc tgctcaacaa cgaagttgat atcgcgcaac gcggtctttg cccacggggt 2598481 gccgctagcg tattcgtggc cgacgcccac cagttcgagc gccggcgcgt gctggggctg 2598541 atccaccccg atgaccgggg ccggcatcgc ggcggtgtgg accatatcgg tgttatccgg 2598601 cgaatcgctc aggctgagcg tgcggtcggc ggaatcggct tcgttgtcgt agtgcgtgat 2598661 gtgcaccaag gcggtccggt gccgctgcgt cagacccgac agcacggcca gcaaagcgtc 2598721 cctgccctgc tggtcaacca tggtggtgac ctcgtcggcg atgagcatcg ccggctcccg 2598781 ggccagcgct gccgccagcg ccaggcgctg cagctcacca ccggacaggc ttccggtgtc 2598841 gcgttcggca agcgcttcca agccgacctc gctcagcaac cggccaacgt cagcggtggt 2598901 acccagcggc agcccccaca ccacgtcgtc ggcaacccgg gtgcccagga cctggctttc 2598961 cggatgctgc aagacgacag cggtgccgcc cagctttccc aaacccaccg tgcccggacg 2599021 atccacggtg cccgacgtcg gtgcccggcc ggccagtatc agcatcaagg tggtcttccc 2599081 tgatccgttg gccccgatga tcgctaggtg ctcgccggcc cggacgtcga ggctgacctc 2599141 ccgcagcgca tcttggccgg cgcgggggta acggaagcgg accttgtcca accgcaccgg 2599201 caccggcccg atcagagcgt ccacgtcatc tcctggcggt gggtcaagtt tgtgtacatc 2599261 ggggattccg cgcatccgct ccagcaggcg cgacaacgcc caccacccaa tcagcgacac 2599321 gatcatgatc ccaatgttga aatagcccag cagcacccac ggccagtact gcagtccctc 2599381 ggcgaaatac cgcttgacgt cggcggctgc cccctgcatg tgcatccggg ccaaggtggc 2599441 ggcgataccg tccacgtttg cggtcatgac cttgaaaatc agatgccgca gtcggaccat 2599501 ggcggccaac atcccgacca tcgccgcgcc gaacacgaat ccgccgatca gcgacgagac 2599561 gaccaccgtc ggggtgcccc ggcccctgcg tttgacgatt ccggtcagcc caccgatgta 2599621 ggcactgtgg accaccccca tgaaaccgcc cagccccgcg atcaggaagg cgatcatccc 2599681 ggccgcaacc gtcgcggccg ccagcacgcg gagacggtag cggtaggcca gcaggccggt 2599741 gggcacggtg cccaacagcg ccagaccggc cgcgaacgga acgacgacgg agatgatcgc 2599801 ggtcaccgcg cacagcgccg ccatcaccga cgcctgcgcc aattcactcg gccgcagcgg 2599861 cccgccccga tgttgcgcgg ggcaagggcc gagcggggtc acttcaccga ttctgccagg 2599921 ctcaggcccg cacacggcgc agcacatcga ttagcctcgc atagcaaagc tatgcaacga 2599981 tggggggatg agtccctccc ccgccgccgc caaccgcagc gaggtcggcg ggccactacc 2600041 gggcctggga gcggatctgt tggcagtggt cgcgcggctc aaccgcctag ccacgcagcg 2600101 catccagatg ccactgcccg cggctcaagc cagactgctg gccaccatcg aagcccaggg 2600161 ggaagcccgg atcggcgact tggccgccgt cgatcactgc tcgcaaccaa cgatgaccac 2600221 gcaggtacga cgactcgagg acgctggact ggttacccga accgccgacc cgggagacgc 2600281 ccgggcggtc cgcatccgca tcacgccgga aggcatccgc acgttgaccg cggtgcgggc 2600341 agaccgcgcg gctgcgatcg agcctcagct ggccctgctc ccaccggcgg accgccgggt 2600401 gttggcggat gcggtagacg tgttgcgccg gctgctcgac catgccgcca ccacgccggg 2600461 ccgggcgacg cggcaatagg catcgagatg tcgaacgccg cgccgttggc ggtgtgggtc 2600521 ggatcgatgc gcccgaaaac gcaaagggaa tcgcttggcg gctcctgctg ctggagttgt 2600581 ccggaccatc ccgactactc cgaaaggcca atgcgagccg gctgattgac ggcgaacgcc 2600641 aacttggccc gaaaagaccg gcatttcact actatcaatg tgcctcgatc gtcgttggat 2600701 aacaaccgta gtgagtcgag aggaaccagt atgcagttcc tgagcgtgat tccagagcag 2600761 gtcgagtccg cggctcaaga tttggcgggc attcgctcag cgctgagcgc gtcttacgcg 2600821 gccgcagcgg gacccacaac agcggtggtt tccgctgccg aggacgaggt gtcgaccgcg 2600881 attgcgtcga tattcggcgc ctacggtcga cagtgccagg ttctcagcgc ccaggcctcc 2600941 gcgtttcatg acgagttcgt caacctgttg aaaactggcg cgactgcata ccgcaacacc 2601001 gaattcgcca acgcccaaag caacgtgctg aatgcagtga acgcaccggc ccgatcgctg 2601061 ttggggcacc cgagcgcggc tgagagcgtg cagaactcgg ccccaacgct aggcggtggc 2601121 cacagcaccg tgaccgctgg gcttgccgca caggccggtc gtgccgtcgc gacggtcgaa 2601181 caacaggctg cggctgcggt tgccccgttg ccaagcgccg gcgccggact ggctcaggtt 2601241 gtcaacggcg tcgtgaccgc cggacagggt tccgccgcca aacttgccac cgcgctgcag 2601301 agcgccgcgc cctggctggc caagagcggc ggcgagttca tcgtggctgg gcagagcgcg 2601361 ctgaccggtg ttgctttgct gcaacctgcc gtggtcggcg ttgttcaggc gggcggtacg 2601421 ttcttgaccg ccggaacgag cgctgctacc ggactgggtc tgctcacact tgctggtgtt 2601481 gagttcagtc aaggcgttgg caaccttgcg ctggcttcag ggaccgccgc gaccggactt 2601541 ggtctgctgg gcagtgccgg tgtgcaactg ttcagtcctg cctttttact ggctgtgccc 2601601 accgcgttgg gtggagttgg ctcgctcgcg atcgcagtag ttcagcttgt gcaaggcgtc 2601661 caacacctgt cgttggttgt gccgaacgtt gttgccggga tcgctgcact gcagaccgcc 2601721 ggtgcccagt ttgcccaggg tgttaaccac acgatgctgg ccgctcagct cggtgcccct 2601781 gggatagctg tcttacagac cgccggtggc cattttgctc aaggcattgg ccacctgacg 2601841 acggctggca atgccgctgt cacggtgctg atctcctagc cgggcggtcg agcttcatcc 2601901 cggagccgct acgttacgcc gagatgctgc acccggagaa tcggtccgat tgagttctgg 2601961 gaccgataag ttcggctggc gtcgatgccg gctgccgcac caaggccgcc tgcaacatcc 2602021 ccatgtcggt gaccgttcgg cggtcgtaca ctttccaagt cagaacggcg gcagcggcgt 2602081 agcacatcat gaagatccag aacgcatcgg tcccactgcc ggtgctgagg taggactcac 2602141 gcagcgccat attgattccg accccgccga gcgcgccgaa ggcggccaca aacccgatga 2602201 ctactcctga gatgatgcgt gaccagtcgc ggcgttcggc ttcactgaga tccagggagc 2602261 ggctgcacgc ctcaaaaatc gtcggaatca tcttgtacac agacccgttg cccaacccgg 2602321 ataggacgaa caacgcgacg aagcagacga agtagccgac catggtagcg ccccgatgct 2602381 ggccgacatg tcggccttcg agggtgctgg cactgatcag cagcccagcg gcgagcgtca 2602441 tcgccacaaa gactataagg gtcaagcggc ttccaccgac tcgatcggcc agccggccac 2602501 cgtaaatccg ggccaccgcc gccagcaacg gcccgacaaa cgccaactcg acggcatgca 2602561 gcgtcgcgcg cgccgggctt tgtccgcacg ccaggaagtt ggtctgcaac acctggccaa 2602621 acacgaagga gaagccgatg aatgagccga aagtgccgag gtagagcagc gagagcaacc 2602681 acgtgtcgcg ggtcgacaga accgcggaaa cgatcggccg aagccggttc acctgcaccc 2602741 ggtgctgttc gacattgttc atgaacagcg acactccgat taccgcgatt gccaccagaa 2602801 ccacatacag tgcgcagacc aggtaaggct tccgctcacc gacagtggcg attgccaaca 2602861 acccaactag ctggatcgcc ggcaccccga gattgcctac cccaccggca attccgagcg 2602921 ccgaaccctt gagccgatgt ggatagaaag cattggcgtt gctcattgac gacgcgaagt 2602981 tgccgccgcc taagccggtc agggccgcac acaccagata cggccacagc ggtagccccg 2603041 gatgggtcaa caacaccgtt gtgccaatgg ccggaattag caacacgatt gccgaaaaag 2603101 tcgcccagtt gcgaccgcca aagatcgcgc tggccaacgc gtagggcatc cgcaggaacg 2603161 cgccaaacag cgtcgcgatg gtgccgagca gaaacttgtc actggttgaa aagccgtaga 2603221 cgtcctgggg catcagcaac tccagcaccg gccagagcgt ccacaccgag taacccaggt 2603281 gaaccgtcac gaccgaccaa agcagattgc gtcgggcaat gcccttgttg cctgcctccc 2603341 acgctcctag atcctcggga tcccaatgcg tgatgtgacg tgagccaccc aggcgcctga 2603401 gcgaaggggc cgcgggactg cgcggcgact cctcgcgttg cagcagcgtg tgctgttcca 2603461 tcaccctcct tgttcccacc ctggtgcgaa tgcgggccgg cctaccaggg tgccagcctt 2603521 gcgtgtacga agttgtttcc tggcagcctg aaactcctgt agaactcctg taaaagtgct 2603581 gaaggcaata cacaattggg ctcgcccttg agccgagaag acctaaaccc tacatgtaaa 2603641 gctgcgctgt tgtcctcgca gcaagaaaac agcgaaagct attgtgctcg agtactactg 2603701 atgggggatc gagccgagcg cctcgagctt gccatctgat ccgatgtgga atcgcaccgt 2603761 gccgatgccg gtgggacagc actcctggtc gctgccgatc tgccattggt actgaaccgt 2603821 cacggtgtca tcgcctgcag gcaatacggt gatgtagggc ttcggattcc gagtcggcga 2603881 gcccagcggg atgttgcggt cgaagaacaa cagctgttgg ggagtcgact gggaggcaat 2603941 tgtcgggatg atttgcaccc aatgcaagcg gcagttgcgg gtatgtcctc gggtgatttc 2604001 gacccatttg gagcccggta ccacgatcgg gaccgcagcg atggcctgcc gtaccgtgtc 2604061 agcggtcggc ccgtcggaat ccttgcaggt gttcggtggt gacggtcgtg ttgtcggcgg 2604121 cttccaggcg caaccggagg cgcccagccc cacaatcagt gccaacagtg ccaacagtgc 2604181 caacagtgcc agaatcggga cggcgctacg ctgacgacgc acgtcacgag cttagcgaaa 2604241 actgggaatt tcccctacgt ttcatcaacg cctcaggtgt cgatcctaaa gcgcgggtgc 2604301 cgccggtatt cttgccccaa atcggtcggt tgacacccga tgcggtcggc gaagccatcg 2604361 gcatcgcggc cgacgacatc ccgatggcgg cacgctggat cggcagccga ccatgctcgc 2604421 tcatcggcca gcccaacacg atgggcgacg aaatgggtta cctgggacca ggtctagcgg 2604481 gtcagcggtg cgttgatcga ttggtcatgg gcgccagtcg atccacctgc tcccgattgc 2604541 cggtcatcgc gtccgtcgac gaacggctgt cggtgctcaa accagttcgg ccgcgcctgc 2604601 attcaatctc attcatcttt aagggccgcc ccggggaggt gtacctgacg gtcaccggtt 2604661 acaactttcg cggtgtgccg tagttcgggg tgtgctcgac ctgcctcgcc gagcgccccc 2604721 gacaatcggg tcgccatcta tgaaaggaca tctagcaaca ttcggccacc cagcgcttcc 2604781 gacataccga ggatcatggt tgagtcggga accgggatcc ccctaccggc ttcctgctgg 2604841 agccggacga gatcgaggcg atgcatgccg aaggattcct cgccgcactg gatctggcac 2604901 tcttctgcgg ccagggcagc gctgtacgtt cgcggcaaac gccgacccga tggccaaggg 2604961 cgtcgatcgt gcgctctgcg aaatcgtggc cgaacgccgg caactggacc tggacctggc 2605021 caaagcccaa gtccggtcgg cgctcgccaa ccagcgttac catcgcgacg tccattaaac 2605081 ccagcacggt cacgaacgga ggttgtgatg agcgacgccc gcgtgccacg gatcccggcc 2605141 gcgttgtccg caccaagtct caaccgtgga gtcggcttca cccacgcgca gcggcggcgg 2605201 ctggggctga ccggccggct tccgtcggcc gtgctcacgc tcgaccaaca ggccgaacgc 2605261 gtatggcatc agttgcagag cttggccacc gagctgggcc gcaacctgct tctcgaacag 2605321 ctgcactacc gccacgaggt gctgtacttc aaggtgctgg ccgaccattt gcccgaactg 2605381 atgccggtgg tgtacacgcc caccgttggc gaggcaatcc aacgcttctc cgacgaatac 2605441 cgcgggcaac gcggactgtt tctgagcatc gacgaacccg acgaaatcga ggaagccttc 2605501 aacacgttgg ggctggggcc cgaggacgtc gacctgatcg tgtgcaccga tgccgaggcg 2605561 atcctgggta tcggtgactg gggtgtgggt ggcatccaga tcgctgtggg caaattggcc 2605621 ctctacaccg ccggcggcgg cgtcgatccg cgccgctgcc tcgcggtgtc tctggatgtc 2605681 ggcaccgaca atgagcagct gctggccgat ccgttctatc tgggcaatcg ccacgcccgg 2605741 cggcgcggtc gggaatacga cgagttcgtc agtcgctata tcgaaacggc tcaacggtta 2605801 tttccgcgtg ccattctgca tttcgaggac ttcgggccgg cgaacgcgcg gaagatccta 2605861 gacacatacg gcacggatta ctgcgtgttc aacgatgaca tgcaaggaac cggcgcggtg 2605921 gtcttggccg ccgtatacag cggtctgaag gttaccggta tcccgctgcg cgatcagaca 2605981 atagtcgtct tcggcgcagg caccgcaggg atggggatcg ccgatcagat ccgggacgcg 2606041 atggtggcag acggtgccac gctcgagcag gcggtgtccc agatctggcc gatcgacagg 2606101 ccgggcctgt tgttcgacga catggatgac ctgcgcgact tccaagtgcc gtacgcgaaa 2606161 aaccgccacc agctcggtgt ggccgtcggg gatcgggtcg ggctgagcga cgcgatcaag 2606221 atcgcatcgc ccactatcct gctcggctgc tcaacggtct acggagcgtt caccaaagag 2606281 gtggtcgagg cgatgacggc gtcctgcaaa cacccgatga tctttccgct gtccaacccg 2606341 acgtcgcgca tggaagccat ccccgccgac gtgctggcgt ggtcgaatgg cagggcgctg 2606401 cttgccaccg gcagcccagt cgccccagtg gaattcgacg aaaccaccta cgtcatcggt 2606461 caggccaaca acgtgttggc gtttcccggc atcggactgg gcgtcattgt cgctggtgcc 2606521 cggttgataa ccaggcgcat gctgcatgca gcagcgaagg ccattgcgca ccaggccaat 2606581 ccgacaaatc ccggagactc gctgttgccg gatgtccaaa atctgcgggc catctcgaca 2606641 acggtcgccg aagctgtcta tcgggccgcc gtccaagacg gggtggcttc caggacgcac 2606701 gacgacgtca ggcaggccat agtcgacacc atgtggctcc cggcatatga ctaaccgcgc 2606761 actcgacggt catcgctgta ggcagcctct cgcttaggtc gctgcccgcg gtgtgcacgt 2606821 cacgcggaaa ccatcgccag ccggcgagaa acacgacagc cagtgttgca gtggcgacga 2606881 gcaacgccac ccgaatgcct tcgatgaaat cctcctccgc aatcgcgacg gggtcgcgat 2606941 gctcaatgtg ccgccggggg acgattccgc ccacatgcgc tcgcggattg gcactgtcga 2607001 taatgatctc ggcaaggacg tggcgctgga ccgggtcggg caccgcgcgc tccagatggg 2607061 gctcgagtgt ggccgaaagc caggcggcaa ggacggagcc caaaaccgcg aacccgatcg 2607121 tcgagccgat cgcccgctga gcactcatga tgccggacgc catgcccgca cgctcggcgg 2607181 ggaccgcggt catggcgacg gtcgtgatcg gcgtcaggca caacgcgacg ccgctcccgc 2607241 acaagcccag cccgaccagg accagggccg agctccggtg ctcgctgaag atgagcatga 2607301 gcagacccag catcaacatg cacagccccg ccaggatggg aacgcgtgct ccgatccggc 2607361 caaccaggtg cccaacaagt ggcgacacga tggccacggc cgcactgaac ggaaggatca 2607421 tcaggccggt cacgctcggg gtatagccgc gcacgttctg caggaactgg gtggtgagca 2607481 gcagcatccc atagacggcg aagaacaccg tgcagatggt cgcgatggcc agggcgtatg 2607541 aggtgtcgcg gaacagggtc agatccatca tcggattcga tgatctgcgc tcaagccaga 2607601 cgaacagggc gcagccgacg gcggctgtcc agagcatcac gatggtctgg acagacgtcc 2607661 agccgatctg ggggccttcg atgaccgcat acaccagggc acccacggca acgatgaaca 2607721 gcagctgccc ggacagatcg aagcggcgtg cccgctcgtt acacgactcc tcgacgtagc 2607781 acaaagtcag gaagaggacg agtgcgccca tgggcaggtt gacatagaag atgctgcgcc 2607841 acccccactg gtccaccagc agaccgccca gtgtcgggcc cgtcgtcgta ccgatgctcg 2607901 cgatggcggt ccagatcccg atggcgcgcg ccttctcctt cgcctccgga aaggccgcgc 2607961 tgaccagggc gagcgaggtt acgctgacgg ccgccgcacc taggccctgc gcgccccgcg 2608021 cggtggtgag caccgcgatt gagggcgcca acccgcaggc gatagatccc agcgtgaaca 2608081 acgaaacacc tatcaagtac cagcggcgcc gaccgtcgag gtcggcaagc gtcgccgccg 2608141 acatgatgaa gaccgccatt ccgaggctgt aggacgccac cacccactgc aggccgtcct 2608201 cccccaccgc gaaactgcgc tggatgtcgg gcagcgccac gttcacgatc agtgcgtcga 2608261 gaaagatcat gaacaggccc aggccagtgg cgatgagcgt gaggagctgc gtgcggttca 2608321 tgcgggcccc gatctacatg gatttcggtg gcgatctgtg accagacact aggctgcgcc 2608381 agcgacggcg tcagccgctt cggtcgattc gagccgaatg gtcgacggct gcggaaccga 2608441 ccgcaaaact ggggcaaaag gttcaccgcg ggtgtaagcc agctaggtga accgatcccg 2608501 ctggcccatg gcctatagtg ggcccatgca acaggccata cagctgcgct ttatcctccc 2608561 gcgccgcctc gccgtgggct gttgttgttg ttgattcctg gcgtccacag caatcctcgc 2608621 gctcttgccc gcaaacgggt ggaaatcggt gttcgcccgc ggcgtacagc cgccgcgcac 2608681 tcacgagtcg ttcagaaaga tcaacagcca tgaccgtgcc cacggatgca gccatcgact 2608741 tcgacgtcag ctgggaggcc aactgggcct ggaccgacac tgttgggcgt agcagatgag 2608801 catcgccgag gacatcaccc aactcatcgg gcgcacaccg ctggtccgac tgcgccgagt 2608861 caccgacggc gccgttgccg acatcgtcgc caagctggaa ttcttcaacc cggccaacag 2608921 cgtaaaagac cgtatcgggg ttgccatgct ccaagcggcc gagcaggcag gtttgatcaa 2608981 gccggacacg atcattctcg aacccacgag cggtaacacc ggcatcgccc tggccatggt 2609041 ttgcgcggca cgcggctacc ggtgcgtgct gaccatgccc gagacgatga gtctggagcg 2609101 ccggatgttg ctgcgcgcat acggtgctga actcatcctc actccgggtg cggacggcat 2609161 gtcaggtgcc atcgccaagg ctgaggagct ggccaagacc gatcaacgct acttcgtgcc 2609221 ccagcaattc gagaacccgg cgaacccggc catccatcgc gtcacgaccg ccgaggaggt 2609281 ctggcgtgac accgacggca aggtcgacat cgtcgtcgcg ggagtcggca ccggtggcac 2609341 catcaccggc gtcgcgcagg tcatcaagga acgcaagccg tcggcccggt tcgtggccgt 2609401 agagccggcc gcgtcgccgg tcctttctgg tggccagaag ggaccgcacc cgatccaggg 2609461 catcggcgcc gggttcgtcc cgccggtact cgaccaggac ctagtcgacg agatcattac 2609521 cgtcggtaac gaagacgcgc tcaacgtggc gcgccggctg gcccgggaag agggcttgct 2609581 ggtcggcatc tcctcgggcg ccgccacagt ggccgctctt caggtggccc gccggccaga 2609641 gaacgccggg aagctaatcg tcgtagtgct ccccgacttc ggcgaacgat atctgagcac 2609701 accgttgttc gccgacgtgg ctgactaagc catgctgacg gccatgcggg gcgacatccg 2609761 agcagcccgg gagcgggatc cggcggcccc taccgcgctg gaagtcatct tctgctaccc 2609821 gggcgtgcac gccgtgtggg gccaccgcct cgcccactgg ctgtggcagc gtggcgccag 2609881 gctgctcgcg cgggcagctg ccgaattcac tcgcatcctg accggtgtag atatccaccc 2609941 cggtgccgtc atcggtgctc gcgtgttcat cgaccacgcg accggcgtgg tgatcggaga 2610001 aaccgcggag gtcggcgacg acgtcacgat ctatcacggc gtcactctcg gcggcagtgg 2610061 catggttggc gggaaacgcc atcccaccgt cggtgaccgc gtgatcatcg gcgccggggc 2610121 caaggtcctc ggtccgatca agatcggcga ggacagccgg atcggcgcca atgccgtcgt 2610181 ggtcaagccc gtcccgccga gcgcggtggt ggtcggggtg cccgggcagg tcatcggcca 2610241 aagccagccc agtcccggcg gcccgtttga ttggaggctg cccgatctcg tgggagccag 2610301 cctcgattcg ctgctcacca gggtggccag gctggaggcc ctcggcggcg gcccgcaagc 2610361 agcaggagtc atccggccac ccgaagccgg gatatggcac ggcgaggact tctcgatctg 2610421 aggcaatacc cggccgccga caatgccttc ttcggcgccg cccaccgacg cgcatcatcg 2610481 gctgctagcc cccgcaccgg gttccgtcct cgccgaattc acctcgggcc ggaggttgag 2610541 ctgcttgggc ttcggcagcc gaaaccgggg cgatacaaac gtgggttgcg gatacgaccg 2610601 ctttgcgacg cggtttgtcc aacgcaggct tggaaaactt ctccaagcac gagcgagatt 2610661 actgattcga attggctctt gacagcaccg gcgaagaggt gtagagatgc gaatcactat 2610721 gtggacagca atctttggaa agctcttgct gtcaaatccg tcacgaacct atgcttagcg 2610781 ataccttgcg ccaaacatgc agtcgcttga ccgttgagat cgctgaggta tcggccatgg 2610841 atgtccctca cgagcagcca gccctctctt cgagcaaatc gaatcgcttt acttcgcaaa 2610901 ggcaaacaac tggtgtggga accaccactg ttgaacggct cgaaccgcgg ttatctcccg 2610961 cgtcccgcca catcactgag gctaaagctt tcggcaccga gtgccacgta agttccttta 2611021 cccgtgagca ggatcccgac agggcggtcc gtgtggagca gatccacggt gaagcgtatg 2611081 tcgccgccgg ccatgtgtac gaatctgcgc tcgatgaatt gggccggctg gacaattcca 2611141 acgccgagtt catcctcgac aaggcacgcg gtagcacccg agaaaccgag gtcatatacc 2611201 tgcatgcggt tcccgcggag cccctctccg gcagccaagg cgaaggaggc ctgcgaatag 2611261 tcggcatttc cgctgtgggg tcaattgacg acctcagtgc atttaaggcc gccaaaccgt 2611321 cgatgggcct ggcgcatcaa cgcaagcttt atgacgcgat cgaagacctg ggtcacggcg 2611381 gggtcaagga gattgcggca ttatcggtta cggccgatgc ccctcccacg gtgtcgtatt 2611441 cgctcatccg ggaggttttg cgcttgtacc accgaaccgg cgaaaaattg ataatcacat 2611501 ttgccatgcc agcatacgcc aagatggtga tgaattttgg tcgatttgcg atgcctcaag 2611561 tgggcgaacc gttctatgcg cacagaaata atgaccctag gacatcgaat gatctcttgc 2611621 tggttccctc aatagtcgag ccatcgaatt ttctcgagaa tatttcccgc ggggtcgtga 2611681 cagcggatga cggcccgacc gcgagaaggc gattcgccac cctatgctat atgaccgacg 2611741 gccttgatga ctatttcatg ccgttgactc ggcaggtcct tagcgaagga atccaagaca 2611801 tctgagttct ggaagcggta atgggcggtc gggcgtgcgc aactccggca acaaacagct 2611861 tggagctttt acgcgaagcg ggattcacta tccgaaccag accgctcggc aggggcatag 2611921 caataagctt caaccgattg acgcattgtg cgaactgacg gcgcccgcgc atggccaatc 2611981 cggaagacca tcattggcca gtggccgggc gctaacaggt tccagccccc caccagtgcc 2612041 gctcgaacat gcggtgcaac ccattcgcag gccggcaggg aaagcaccgc ggaagccgca 2612101 aagggctgca gttccgcgcc caatagtgtc gtccgcaacc agatgcgctc gaaaaccgcg 2612161 ccggcagtca gcgcacccga cgcgaggtcg agagacgtcg tcagcgcgcc cacatggggt 2612221 gccaatcggc acggcaggta ggccgcgcgc aacccgagcg cgtggtgcat gcccacggtc 2612281 cgcaggaggc gcagcacccg ccaatgccga agcccacgaa acatcgggcg catccacgct 2612341 tcaacctcaa gagacccggg cggcaaccca tcgtcgctgc tcgcggtcca gccaatgtcg 2612401 aagcggacgg ccgaaaagag ttcttcgtgt agttcacgag atcgaaagcg ctcagtttcg 2612461 gccaatctga ccaaccgaag gatctgtttc ctggtctctg gcgagtcaaa ccaatgcagt 2612521 tggatcccgt caatgccggt ggcctccgcc gagagtgcac ccaattcgcc ctgcgaaagt 2612581 ggcggtccac gaaagcgaac acgccggttg gtgcgccttc gctcgatcgc gccctcgatt 2612641 ggatccaccc tggtttgtgg caggcgatcc acgtcgatct ccgccaccag ccccggattc 2612701 ccgctatccg gaaaccagca caccttcgtt tcaaaaccaa ggcgtccagc gcgcagcttc 2612761 acgttttcga cagccgcacc gatcgcgacc aaactcatga tgcggcggtg ctcgggggcg 2612821 gacctccaag tctgatcgcc ccacaaccgc acccgcctac cggcatgttc gagctggact 2612881 tcgcgccggt tgtccgcgga tggcgccagc gccgccgcct cgacgagcga caggaattca 2612941 gcaggatcca gacccgtcat acccgggccc cagcggccgg cacgcattgc cgtggcagag 2613001 tggtcgcgcc gacgaacagt gcggaagcga tatgtctatc ccatgttcgc tcaaacagcg 2613061 gtgcgctggc agtatctgag tacaccattc taggtgcagc tcccaactag tagctcggtt 2613121 ccgtcctgtg ataccgcagt cccggtatta cccccgccga tcgtcgattt atgtagcggg 2613181 ccagcaatcg ccgcttgact ctctgtagag ggtggcgatt tccgcaccgt aaacgcttcc 2613241 gaacatagat gctgcggtaa gcatcgaact gatgaaagta gggagcagcg taaacgcgcc 2613301 catgtccgag cagaatcttc aacacctcgg ccgccaccac accggaagcc agatgacagg 2613361 cgaggccaac cgatggaccc gtgcgatttt cgatgtcgac gtaggacaga tctatggagc 2613421 gccgatgcgt cgcggatggt gctattccag ctataaatgc gacgaactta tccaccgtgt 2613481 tcatcgcatc agacagatcg aaataccgat cgaacgtcat acccttagga tcgaaaacga 2613541 cccaggccgt actgaacccg agcgggccag cgcctagcgc gtagattccc cgctgctgtg 2613601 cttcacgata gagcaggcga cgcaaatcga tttcgaacgc gtcgatgccg tccaccaaaa 2613661 catctgctcc ctctagaaag gtagctgcat tctctttccc aataggttcg cagaaagcac 2613721 ggatttctgc ttcagggtta atatcatgaa cgatattgcg catgacctct gccttggcct 2613781 ggccgttggt cgagcgcata gcgccgtact gccgattcga gttgcgtatt tcgaagacgt 2613841 ccgggtctgc aatggtgaac tttcctattc ccatccttgc gagggcgacc atgtcaattc 2613901 ccccaacccc acccatccca gcgattgcaa cgcgactatt ccgaagccgt tgttgttcgg 2613961 ttgggctaat caatccaagg ttgcgacaga aagcttcgtc ataagaccat ggtgcgcttt 2614021 ctttcacccg tccagagtcg ggggcatccg caccggctcg catcgcatca tcctcccacg 2614081 acgggccgct catcagcttg ggccatttca atgtacttga taccccgcgc tgcgggtagg 2614141 ccactgcgac gattcaaaca cggtgtcaca cggtgaatag tgtcgagatg ggctctgatc 2614201 aaccgtcgca aacccggttt cgcatcgata gcggaatcgc accgggttgc atggaggctg 2614261 ctgaccttgg aaaacaagat gtattcatta cgacaaaaca agcgccgcgg aaactttgca 2614321 cgctcgagca ttccgccgcg gctcacgcac atcctggccg ccttcccgca accgtccccc 2614381 ggaattactg atcaaaccct gggtttacca acttccgggc atggggcgaa ggtcgacagc 2614441 cagaacatgg ccgtgcgtga tatgggcatt cacgggacgg agccgctaag gagaccggta 2614501 cgattcaatc tccatatgag cggtgcggcg gctgttgtca ggtacgttga acaccggtgg 2614561 cgatcgggtg ccggcaggtt ggtcttctcc tgtgatgcga gcgcgcctcc gcgccaacca 2614621 ccgcgtgcga agcaggtgct gatgccacag tgctgatgtc acaaggaacc gcgagggggt 2614681 cccggaccct acatggtgcc gggcgaagtc cacatgagtg atacgccgtc aggcccgcac 2614741 ccaatcatcc cgcggacgat tcgcctggcc gcgattccca tcttgctgtg ttggctggga 2614801 tttaccgttt tcgtcagcgt cgccgttcct ccgttggagg cgatcggtga aacccgggcc 2614861 gtggcagttg cccccgacga tgcgcaatcg atgcgtgcga tgcgacgtgc cggaaaggtg 2614921 ttcaacgaat tcgattccaa tagcatcgcg atggtcgtcc tggaaagcga tcaaccacta 2614981 ggcgagaagg cccataggta ttacgaccac ctggtcgata cgctcgtact ggaccagagc 2615041 catatccagc acattcaaga cttttggcgt gatcccctga cggcggcggg tgcggtcagc 2615101 gcagatggta aggcggcgta cgttcaactt tacctcgccg gcaacatggg tgaagcactc 2615161 gcaaacgaat ccgttgaagc cgtccggaaa attgtggcga atagtacacc gccggaaggc 2615221 atcagaacct atgtcaccgg accggcggcc ttgtttgccg accaaatcgc cgccggtgac 2615281 cgaagcatga agctgatcac cggattaacg ttcgcggtaa tcaccgtgtt gctgctgctc 2615341 gtctatcgct cgatcgccac cacgctgctg attcttccca tggtgtttat tggactcggc 2615401 gcgacgcgtg gcaccattgc ctttcttgga taccacggaa tggtcggcct ttcgactttt 2615461 gtggtcaata tcctcacggc acttgccatt gctgccggta cagactacgc gatcttcctg 2615521 gtcggccgct atcaagaagc ccgccatatc ggccagaatc gcgaagcctc tttctacacg 2615581 atgtacaggg gcaccgctaa cgtcattctc ggatcgggac tgaccatcgc cggcgcaaca 2615641 tattgtctga gtttcgcccg gctgacgctg tttcacacca tggggcctcc gttggcaata 2615701 ggcatgctgg tttcggtcgc ggccgcgctg accctggcgc ccgccatcat tgccatcgcc 2615761 ggccgcttcg gcttgctcga ccccaagcga agactgaaga ccaggggctg gcgtcgtgtg 2615821 ggtaccgcag tcgtgcgctg gcccgggcca attctggcca cgtcggtcgc gcttgccctg 2615881 gtgggattgc tcgcactacc gggctaccgg cccggctata acgatcgcta ctacctgcgc 2615941 gctggcacgc ctgtcaaccg cgggtatgcg gccgccgacc ggcactttgg cccagcccgg 2616001 atgaaccccg agatgctgct ggtcgagagc gatcaagaca tgcgaaatcc ggccgggatg 2616061 ctcgtcatcg acaagatcgc caaggaggtc ctgcacgtgt ccggggtcga gcgggtgcaa 2616121 gcgatcaccc ggccgcaggg ggtgcccctt gagcatgcgt cgattccctt tcagatcagc 2616181 atgatgggtg ccacccagac gatgagcctg ccctacatgc gcgaacgcat ggccgatatg 2616241 ttgaccatga gcgacgaaat gctggttgcg atcaattcca tggaacagat gctcgacttg 2616301 gtgcagcagc tcaacgacgt tacccatgag atggcagcca cgacgcgcga gatcaaagct 2616361 actaccagcg aactgcgaga tcaccttgcg gacatcgacg atttcgtcag gccgttgcgt 2616421 agctatttct actgggagca ccattgcttc gacattccgt tgtgctcggc gacgcgatca 2616481 ctgtttgaca ccctagacgg cgtcgacacg ctgactgacc aattgcgggc ccttaccgac 2616541 gacatgaata agatggaggc gctcacaccg caatttctcg cactgctgcc gccaatgatc 2616601 acgaccatga agaccatgcg gaccatgatg ttgaccatgc gatcaacaat aagtggcgta 2616661 caagatcaaa tggccgatat gcaagaccat gcgactgcga tggggcaggc cttcgacacc 2616721 gcaaaaagcg gcgattcatt ctatcttcct ccggaagcct tcgataatgc agaattccag 2616781 caaggcatga agttgttttt gtcgccgaat ggtaaggcgg tgcgcttcgt aatttcccac 2616841 gagagcgatc cagcaagtac tgaaggtatc gatcgcatcg aagcgataag ggccgcgacc 2616901 aaagatgcca tcaaggcgac accattgcaa ggcgctaaaa tctatatcgg tggcacggct 2616961 gcgacctacc aagacattcg agacggtacc aagtacgata tcctcatcgt tggtatagcc 2617021 gcggtatgcc tggtatttat tgtcatgctc atgattaccc agagcctgat tgcgtcactc 2617081 gtcattgttg gcacggtact tctgtcattg ggtactgcgt tcggactgtc cgtgctcatc 2617141 tggcagcact ttgtcggtct ccaggtgcat tggacgatcg tcgcgatgtc tgtcatcgtc 2617201 ttgctggccg tcggttctga ctacaacctc cttttggtgt cccggttcaa ggaggaggtc 2617261 ggcgctggat taaagaccgg gatcatccgg gcgatggccg gcaccggcgc agttgtcacg 2617321 tcggccggtc tggtattcgc gttcaccatg gcgtccatgg ccgtcagcga actccgcgtt 2617381 atcggacagg tcggcaccac catcgggctc ggtctacttt tcgataccct ggtggtccga 2617441 tcgttcatga cgccatccat cgcagcgctg ctaggtcgct ggttctggtg gccgaacatg 2617501 atccactcga gacccaccgt cccggaggcg cacacacgcc agggcgctcg ccgaattcag 2617561 ccgcatctgc accggggttg atatgcactt cggtgccgtg atcggcgccc ggggtgttcg 2617621 tcgaccatgc gaccggcaac gcggccttgc gcacaggcgc gatcgctcat tcgtgcccgg 2617681 gcggtcgaag accaagagcg cgcagcagtt ggtcgcggtc ccacggccgg ccgctgccac 2617741 tagcattgtc gccggatgct gtcagcagcc catttcgagc tcgaagcccg gacaacttct 2617801 ttagcgtgtg gcgcaacccc cgaagactcg tcaacggaag aagcagtagc tgctcatcgc 2617861 gcccgccatc agcccggcgc gccgagttgc ccgggtcggc cgggttgccc tggtgtgccg 2617921 cgttgccggg gttggtcgtc gcgtgcatcg cctgcgcctt ggtcgccggc gtcgagccgg 2617981 attcggctgc caccgcagac gtggtagccg gcgacaccgc aggtgccgtg gctaccgcag 2618041 gtgccacctg ttcacccact ccgccgatag caccggaatg gccttcaccg ccgagccccc 2618101 agtgcccgcc aacccacgga gccccaccga agccgggcat caacccgccc tcagcagaag 2618161 cgcccgcagc gccggtgctc agaccgccgt caccgctggc catgccgact ccgccgtccc 2618221 cgcgaacaga cccgacgatg tccctgccag tcccggcacc accgactgcg tctgcgctgc 2618281 cggccccagt cccattgccg gctgcgctac cgaccccagc accactgcca cccgggtccg 2618341 cgacaccggc ggcacctgtt gcgccgctgc cgtgcgaagc agaaacgccg ctgccgtgcg 2618401 cggcgccagc cactccgggg ttaccgtcgg tgctgccgag ctggccggcc ccatgctgct 2618461 cgccgacctg accgttgccg atcggtccgc cgtcccagcc ctttccgcca gcctggccga 2618521 caccgccaaa cccgccggca ccgcgagatt gcccggtgcc cgtcccccca gtgcgatctt 2618581 gggcgagttc gctgaccacg ccctgcgctt tctgcagcgg caaggcgttg gcggcctcag 2618641 ccatggcata cgccgccgcg ccctcttgca ggatctgtac aaaccggtcg tgaaacagcg 2618701 ccgcttgagc gctaatggcc tgataggcct gcgcccgcgc gccaaacaac gccgcgatgc 2618761 cagccgacac gtcatcgccg ccggcagcca gcactcctgc cgtcggggcg gccgcagcgg 2618821 cgttggccgc gcgcatagtg gagccgatcg cggctagctc cccggccgac gccgccagaa 2618881 cgttcggggc cgcggtaacg tgcgacataa gcgagcacct gcccgtgttg ccaactcgct 2618941 gtgaccggat cgctggtcga cccgcgttgt caccgcgaat cctatcgcga tcgaccagga 2619001 acatcccagc attcaggcat gcctactgcg cctcacactg aagtgtcgag gtcggcggag 2619061 tcccggcatc atcaggcgag tggcatgcac tcaccaaccg cggccagctc ggcaccagct 2619121 tggtgtcggc gcacagagct gttcgggccc atacgtcgac gtagccgaac ccgccccgac 2619181 tctcgtcgga cacgttctgc tgtttggcgt ggccgaacga tcagatctcg tcgcgccgaa 2619241 cgtgtattgc cgggccggtg gaagagtctg tcgggagaaa aaggaaaagc cctgcagaga 2619301 ctggtgtgac acgccttgcg cagccacgcg gtcggaaaac cgaaccttag ctcatcagaa 2619361 cccaacacaa gaggcgggac aagccgagtt caagccgaac gccctgctcc cccgggagga 2619421 ctcgaacctc caacccttcg gttaacagcc gaacgctctg ccaattgagc tacaggggac 2619481 cgcctggtcc gtgcgaacgc tggcgcagtc gcgggacgac tctagcgtac tggtgtgacg 2619541 gcgcccaact agggagattc cttaccgatg ggagcaggct gatggcagca ggcacgatgc 2619601 cagtaggtgg tcggcagcac gttttcgaga agctggccag catcctgggc ttggtcgccg 2619661 cgccgctcat gctccttgga ttgagtgcct gcggccgcag cgccggcaag accagcgaac 2619721 cgacctgccc cacggagccg atcgatgcgg ccgacagctc gacaacaccg gacccctcgt 2619781 gtgtggtgcg ggccactgag atcaacggca acgggtcgcg catccagacc tggaccggca 2619841 gctatgatgc ggccgcaacc cagtccggtg gtgtgtgtgg tggcacctgc aacttccacg 2619901 ccacagtgcg gttcacggtc gacgaaggcc agatctcggg cagcgtcgat caggtctatc 2619961 aagcggcgat ggttgctatc gcaacacgcc ccacttcgcc atctctggca ccatgacgat 2620021 gacgcggtga ccatcgcgtg atccaagacg tacctgacgg gcaataagcc gataccaaag 2620081 ccgagcccgc atcacgccga aacaaccgcg gagtatctgc tcggcgtcgt gaattgggtg 2620141 accaagtgga acctcgattg cgtcgaaggc tgcaaatagg acatcgggta ccgcataacc 2620201 ggatcgggcg cgcgtagcca ggcgtgtaag gcaggatgga tgcaaccgca ccgttagtcg 2620261 gagggaccgc attgatcggg tatgtcgccg tgttgggact gggttacgtg ctgggcgcaa 2620321 aagccgggcg ccgccgctac gagcagatcg cgagcaccta tcgcgcactc accggcagcc 2620381 ccgtggccag gtcgatgatc gaaggcgggc gtcgcaagat cgccaatcgg atctcacccg 2620441 atgctgggtt tgtgaccctg gccgagatcg acaaccagac cgccgttgtc cagcgcgggg 2620501 tcgagcggca gccgaaaacc gcgcgctgac cctcacgcgg tgagatcgtc gccgctggcc 2620561 tgctctaaca ggctgcgccg ataggcctcc atggcgacca ggtcgccgaa cagcgcgtgg 2620621 tattcgtcgc cttgctcgat cggcgacatg cgctgcagtt tggacttcac ctcggcgatc 2620681 tgccgcccca accaaacctc ctgcagacgg gccagcacgc cggcgatata gcgcggcagc 2620741 ttgtcgtcgt cgacctgaat cgcctccacc cccagctcgc tgatcaaagc cgaggtcacg 2620801 gttgatgtcg tctgctggcg caccatatcc agccactgcg caccgctaag gccagccgag 2620861 gtaccgcccg ccgtgtcgat ggccgcgcgc acagccgcgt actcggggtg cgtgaagcct 2620921 tcgacggtca gcgcgtcgaa caccgggccg gccaacgccg ggtactgcaa cgccgatttg 2620981 agtgcctcac gctgtggcca cagggtcggg tcacgcggat caggtcggac tgcgagttcg 2621041 gtcggggggc cggcggtggg ccgctgcgct gcccgggcga tggtcgtcga tcccagtctg 2621101 cccagcctgg ggtgcttggt tcgtttggcc tcaccccgca cccgaccgat gacctgtgcg 2621161 acgtcggccc acccgaccca gccggcgagc tgacgggcgt attcgtcacg cagcgtgggg 2621221 tctttgatct ggcccaccat cggtacgcaa cggcgcagcg cggccaccct gccctcggcg 2621281 ctatccaggt ccatctcggc aatcgcggcg cgaatcgcga actcgaacaa tggggttcgt 2621341 cgtgccacga ggtcgcgcag ggcagcgtcg ccgcacttca gtcgtaggtc gcaggggtcc 2621401 atgccgtcgg gagccaccgc gacgaaagac tgaccagcca gcttctgctc accgtcgaag 2621461 gccttgagcg cggcggcgcg gccggcctcg tcgccgtcga aaacgtagat cagctcgccg 2621521 cggaagaagc tgtcgtccat catcagtctg cgcagcatcg ccaggtgctc gccgccgaat 2621581 gcggtcccgc acgacgccac cgcggtggtg accccggcca gatgcatggc catgacatcg 2621641 gtgtagccct cgacgacgac ggcctgatgt cccttggcga tgtcgcgttt ggccaagtcg 2621701 atgccgaaca tcaccgatga cttcttgtac agcaatgtct cgggcgtgtt gacgtacttg 2621761 gcctccatcg cgtcgtcgtc gaacagtcgc cgggcaccga acccgaccac ctcgccggcc 2621821 gaggtgcgga tgggccacag cagccgacgg tgaaaccggt ccatcgggcc gtgccggccc 2621881 tgccgggaca gtcccgcggc ctccagttcc tcgaactcaa aacccttgcg ctgcagatgt 2621941 tttgtcaatg agtcccagcc cgacggggcg aacccacagc cgaatttacg agcggccgcc 2622001 gcgtcgaagc tgcgttcggt caggtactgg cgagccggtg ccgcctcgtc ggactgcagc 2622061 gcctgcgcat agaacgctgc cgcggccgcg ttggcggcca gcagcctgct gcgactgccg 2622121 cggtcgcgct gcacgctggt ggccgcaccg gtgtagctga tcgtgtggcc gatccggtcg 2622181 gcaagcaact caaccgcctc gacgaagctg acgtgctcga tcttctggat gaacgcatac 2622241 acgtcgccgc cctcgccgca gccgaagcag tggaagtggc cgtggttggg ccgcacgtga 2622301 aaggacgggg acttctcgtt gtgaaacggg cacagcccct tcagcgaatc ggcaccggca 2622361 cgcctgagct ggacatagtc gccgacgaca tcctcgatac gggccccctc gcggattgcc 2622421 gcgatatcgc gatcggagat ccggccggac atcggctcag tctaaagcgt tcctgctgac 2622481 gccaagctga tcggcatcga tgcgttccaa ccgaccctcg gtataggagg cgatctgatc 2622541 aacgacgacc cgcaaccggg cagcgtcgtc ggcggcggta ttgaacgcag cggcataaac 2622601 cgggtcgagc gtctgcggcg cccccgagta cagcctgtgc gccacccggt gaatacgttc 2622661 gcgctgccgt gcctgggttt ccagatgccg agggtcggac atgatgaact gcagcgcgag 2622721 gattttcagt accgcgacct cggcacgtac cagatcgggc acctgcaggt cggcccggaa 2622781 gcgcaccaac ggtcccggac cggccgcggc ccgggtggtc gcgatcgcgg ccgatgcaaa 2622841 gcggcccacc agctcgctgg tcaaccgctt gagcgcgacc gatgccgaca aggtggcgtc 2622901 atacttgccg acggcggcca ccacgggcag ccgcgacagc cgccgcgcgg ccgccatcaa 2622961 ctcgtcggcg ctcacccggg agaactcgcg ctcccctaac ctggccagcg cggcagcgtc 2623021 ctcttcggcg gccagcacac gcaggtcgat gcgttcggag acaacgccgt cctcgacgtc 2623081 gtgaaccgag taggcgacgt cgtcggccca gtccatcacc tgcgcttcca ggcacgcccg 2623141 ctccgggggc gcgccttgcc gaacccatac cgccgattcg cggtcgtcgt cgtagaagcc 2623201 gaacttcctc cgctggctgc caagcccgtc accacgcatc cacggatact tggtgaccgc 2623261 gtccagggac gcgcgagtta ggttcagccc cgcactaagt ccttgtgcgt caactacttt 2623321 gggctcaagg ctggtcaaga tacggaagtt ctgcgcgttg ccctcgaaac cgccgtggct 2623381 ggctgcgact tcatcaagcg cccgctcacc gttgtgtcca tacggcgggt gcccgatgtc 2623441 atgggctaga ccggccaatt cgaccagatc aaggtcgcag cccagcccga tcgccattcc 2623501 ccgtccgatc tgagccactt ccagcgagtg ggtcagccgg gtacgcggcg tatccccttc 2623561 ccggggtccg accacctggg tcttgtcggc tagccggcgc agtgcggcgc tgtgcagcac 2623621 ccgggcccgg tcccgggcga agtcggagcg gtactgaccc tcagtgcccg gcagaccggc 2623681 agtctttggc gcttcggcta cccgccgctg gcggtcgaag tcgtcgtagg ggtcgtgctc 2623741 actcgcgctc accgacccac agtctgccag ggtggtcgcc gcacgcccgt atccgccggc 2623801 acagcgtcta aattgacggt atgcgtctcg ttcgcctgct cggcatggtc ctgactatcc 2623861 tcgccgccgg gctgctgctg gggccgcccg ctggcgcgca accacctttc cggctgtcga 2623921 actacgtgac cgacaacgcg ggcgtgctga ctagctccgg tcgcaccgcg gtgacggcgg 2623981 ccgtcgaccg gctctatgcc gatcgccgca tccgactgtg ggtggtctac gtcgagaact 2624041 tctccggtca gagtgcgctc aactgggcgc agcgcacgac gcggactagc gagctgggta 2624101 actatgacgc gcttctggcc gtggccacca ccggtcgcga atatgccttt ctagtgccat 2624161 ccgcgatgcc gggtgtcagc gaggggcagg tcgacaacgt gcggcgctat cagatcgaac 2624221 cggcgctgca cgacggcgac tacagcggcg cggccgttgc ggcggcgaac ggactcaacc 2624281 ggtcacccag ttcgtcgagt cgagtggtgt tgttggtcac ggtcggcatc atcgtcatcg 2624341 tcgtcgcggt cctgctggtg gtgatgcgcc accgcaaccg gcggcgccgc gccgacgagc 2624401 tggccgcggc acgccgcgtc gaccctacca acgtaatggc actggccgcc gtgccgcttc 2624461 aggccctcga tgacctctcc cggtcgatgg tggtagacgt cgacaacgcc gtgcgcacca 2624521 gcaccaacga gctcgcgctg gccatcgagg agttcggcga acggcgaacc gcaccgttta 2624581 cccaagcggt gaacaacgcc aaagcggctc tgtcccaggc gttcaccgta cgccaacaac 2624641 ttgatgacaa cacgcccgag acgccggcgc agcgacgtga gctactcacc cgagtgatcg 2624701 tgtcggcggc gcacgccgac cgtgaactcg cgtcgcaaac cgaggccttc gagaagctac 2624761 gcgatttggt gatcaacgcc ccggcccggc ttgatctgct cacccagcag tacgtcgaac 2624821 tgaccacccg gatcggcccg actcagcaac gcctggccga gctgcatacc gaattcgacg 2624881 ctgcggcgat gacgtcgatc gccggcaatg tcaccaccgc caccgagcgg ctggcgttcg 2624941 ccgaccgtaa catcagcgcg gctcgggatc tggccgacca ggcagtgagc ggacggcaag 2625001 ccggactggt ggatgcggtg cgtgccgccg agtcggcact cgggcaagcc cgggcgctgc 2625061 tcgacgcggt ggacagcgcc gccaccgaca tccggcacgc cgtcgcgtcg ctgccggcgg 2625121 tcgtggccga catccagacg ggcatcaagc gagccaacca acacctacag caggcgcaac 2625181 aaccccaaac cgggcgcacc ggtgacctga tcgcagcccg cgatgcggcg gccagggccc 2625241 tcgatcgcgc gcgcggagcc gccgatccgt tgaccgcatt tgaccagttg accaaggtcg 2625301 acgctgacct cgaccggctg ctcgccaccc tggccgaaga acaggcaacc gccgatcggc 2625361 tcaaccgctc acttgagcag gcgctgttta ccgcggagtc gcgggtgcgc gccgtctcgg 2625421 agtacatcga cacccgccgc ggcagcatcg ggccggaggc ccggacccgg ctggccgagg 2625481 cgaaacggca gctggaagcc gcacatgacc ggaaatcgag caacccgacc gaagcgatcg 2625541 cctacgctaa cgcggcatcg acgctggccg cacatgcgca gtcgctggcc aatgccgacg 2625601 tgcaatccgc ccagcgcgca tacacccgtc gtgggggcaa caacgccggc gcgatcctcg 2625661 gtggcatcat catcggcgac ctgcttagcg gaggcaccag aggcgggttg ggtggatgga 2625721 tccccacgtc gttcggcggt tcgtcgaacg cgccgggaag ttcacccgac ggcgggttct 2625781 tgggcggcgg cgggcggttc taagccacgc gccagcgcac ggggataccc gtacgctggc 2625841 gcgtgtggcc gtcgacctag gcttcttcct agggttcgtc gaccctgtca ggcccagctg 2625901 gagccgacgg cgctgtcggt ttgtgccatg ttgttgccgg cagcctgcac cttctgcccg 2625961 tgggcgttgg cctgctcgta gatcacctgg aagttacggc ccagctgggt gatgaactcc 2626021 tggcaagcca ccgaaccggc gccgccccaa aagtcacccg cggccaacac atcacgaacg 2626081 atggcctgat gctccgcctc cagcaacccg gcctgagcgc ggatcatggc gccatgagcg 2626141 tcgacatcac cgaactgata gttgatggtc atcgaacctg ttctccttcg cttgtaaaag 2626201 tattgtgctg cagcggctga cgttagctgc tgaggatctg ctgggaggcc tgctcttgct 2626261 gctcgtagtt gttggcgtcg cgaaccagcc cgtcacgcac cccgtgcagc atgttcacga 2626321 tgttgcgaaa cgcctgattc atctgggcca tggtgtctag cgaggtcgcc tcggccatgc 2626381 cactccagcc cgcgcccgag atgttttgcg cggacgccca catccggcga gcctcgtcct 2626441 ccaccgtctg ggcgtgcacc tcaaaacggc ccgccatgtc ccgcatcgcg tgcggatccg 2626501 tcataaaacg tgttgccatg ttgcctgtct ccttgttgaa cctggaccta atacctgtaa 2626561 cttgtcatgc acattgactg ttgtcatagc cggccgcggg aacaccgaga ccgccgatca 2626621 ctggtcaaat aacgacagtc tgcgccccct ctcctagccg gccgccggag aatgcggaat 2626681 cacgctgctg ctactccgtg gcacctcaaa gcggggttca gcgttctccg ccacccactc 2626741 gttccacgcc tgccactcat cccactcgtc catggccgcg gcctctgcgt ccgccccttc 2626801 ggccgcctgg acatcgacat cggcgccgtc ctccggcgca ggcgcccagt ggtcataatc 2626861 atcgggtggc acagccactc cccagctgag cggaaccgac aacccgccga gcattcccga 2626921 ctcagcccgt ttcgccacca ccgcgtcggg cggcaaaggc ggaccaagag gcaaaagcac 2626981 gttgtcccct tccaggccag ggtctcaaca catccacact caatggctaa acacgaacca 2627041 ccaagcactc agcatcgtat gacaatccgc ggacaatatc ccgggttttc taatttcgct 2627101 gccatgagcc cgtccagcgg ccctggcgcg ggtttcgacc tagccaaccg ttacgtctga 2627161 accatccccg gctagcagat gccgctggga atcggccgcg cgggaccgga ttcctggact 2627221 ggcgttgttt gggggtaggg cacccgatac ggcaggcctt cgttcaagaa acccagcacc 2627281 acattgggca cgcacttggc gaccttcggc aattgacgga ccgggtggtc cagattgggt 2627341 ggcgacgggt ccggcggggc cgcgaaattg aatgccgacg tcatatcgcc ggtgacactg 2627401 gcacgccagg gtgtcaagtt gggaaccggc accccgaaac gcttgccgat caattgcagc 2627461 tgcgatgtgt ggtcgaaccg atcatggacc atcagcccgc cgcgactgta aggcgaaatg 2627521 acgaagcagg gcacgcgaaa gcccaagccg atgggtccac gtattccgcc ggagccgtcg 2627581 accttgtcga tgtcaacact gttgggaatc cattcgccgg gtgtgccctc cggcgcggtg 2627641 agcggtgtga cgtggtcgaa gaagccgcca tgttcgtcat aggcgatgat caacgcggtt 2627701 ttctcccaca ccgccggatt gcgcagcaac acccttatca agttcacgat cgtcaccgca 2627761 ccgactgcca ccgggaatga cggatgttcg gactcgacgg tcaacggaac gacccaggac 2627821 acctggggca gcgtgttgtt gatgacgtcg cggatgaaat cccacgggta ggccggggcg 2627881 atgccataac gggccaggtc cgacctcgga tctgcggcct gtttgaaact gcccacatac 2627941 ccgttacggc tcaaggaagt gtcgttgagc ccgccgagca gcttgctgtt gtacaccttc 2628001 caactgatgc cggcgtcact gaggttctgc ggcatgatgc gccaggtgaa ggtcaacttc 2628061 ggctggatgg cgggttcgac gatctgcggc ccaccttgat ccccgtcggg attgacggtg 2628121 gcgctgatcc aatagagccg gttaggcatc gtcccgccaa gaagcgacga gaagtactgg 2628181 tcgcagatcg tgaaggtatc ggccaacaag tagtggatcg gtatgtcagg acgtgcgtaa 2628241 tagcccatca ccacgggcgt gttggccacc gaccgggtcc gcgcctgcgc cggcagccag 2628301 ccgtcattgg cgccgccgtt ccatgacaag tgcgcggcaa tccactggtg gtctgggtcg 2628361 ttgacgcact cgccaacccc gttgggaccc ccggtggtat tgatgcggta gggcagcgta 2628421 atgccggtgg ggtccagcgc ctgcgtctcc gggttccagc ccttttgttg aaacagcggc 2628481 gtcggagtgt cgaacccgtc gacggcagaa agcgtgccga aatagtgatc gaacgacctg 2628541 ttctcctgta ggcacagcac gatgtgctcg atatcggtca aatgacccga gcagggaccg 2628601 gcaccatagg ccttttcgat caccggtgcg gcccagtccg tcaaaaccgc cgctgccccg 2628661 gctccagccg ccttagccag gaatgctcgg cgtgacattc cggcgaatgc accttggctc 2628721 accacatcgg ctctccctcg tgtatttcgg cttaccgtcg cggccatcgc cgactgtggg 2628781 tcaacagaga ccgctgggaa tcccccgagt gggtgcggtt tcttgggtgg gcatcgactg 2628841 tgggaatggc acccgatagg gaattgccgt cttggtcacc gttcccagta ctgcgttggg 2628901 cacgcactgc ggcaacttcg gaagcgcatt gagccgcggg tggtccagat tgggtttcga 2628961 cgggttcggc ggagcggcga agttgaacgt cgaagtcata tcgccgaccg tcgcgtcccg 2629021 ccaagcggtg agattgggaa ccgggacgcc gaaccgcgcg cggatgagct tcagcgttga 2629081 cgtgtgatcg aaggtgtcgt ggaccatcag tgggccgcgg ctatagggag agatgacgag 2629141 gcaagggacg cgaaacccca gaccgatcgg cccacgaatg ccacccgagc cgggcaccga 2629201 gtcgatgtcg gggaccgtga cgaattcgcc gggcgtcccc ggcggcggtg tcggcggcac 2629261 gacgtggtcg aagaacccgc cgttctcgtc gtagttgacg atcagcgcgg tcttttccca 2629321 caccgcaggg ttggacagca agatccgcag tgcatcgaca attgcaacgg cgccgacgtt 2629381 caccgggaat gcgggatgtt cggacagcag aaacccgggc agcacccagg agaccttggg 2629441 taatcggttg tttctgacgt cggcggcgaa gtccagcggg taggtcggtg agatgccgaa 2629501 acgggccaga ttcgacctcg gatccgcggc ctgcttgaag tcattgacca gcccgttgta 2629561 gccgacgacg gtgttgttga gagcccccag caatttgttt tggtacacct tccagctgac 2629621 cccggcatct tcgaggttct ccggcatgat gcgccagctg tagtgctgca gaggttggat 2629681 attgggctcg atcagcaccg gcccgccgtc agtgccgtcg gggtcgatcc aggcgctcat 2629741 ccagtagagc cggttgggcg tggtcccgcc cagcagcgag caaaaatagc cgtcgcagac 2629801 cgtgaacgtg tcggctagca ggtagtgaat gggcaggtca cgacgcgtgt agaaacccat 2629861 cgtgaccggc acgttgccct gcaacggact gaacgggacc tgcgccggca gccagttgtc 2629921 gttggcgccg ccgttccacg agttgtgcat gccgatccag ctgtggtccg ggtcgttgac 2629981 gcattcgccg gcgaccagcg ggccccgggt ggtgtcgaag cgatatggaa gggtgacgcc 2630041 ggcggggtca accgcctgtg tcatcgggtt ccagcccgac tgcgcgaata ccaccggcgg 2630101 ggtggtgtca tcgaacccgc gggtgtcaga aagagtgccg aagtagtgat cgaatgaccg 2630161 attctcctgc atcaacaaca cgatgtgctc gatgtcggtc aaatgtccgg ggcaaggccc 2630221 cgctccgtag gctttttcga taatcggacc agccaaggac atgaaggccc cggcggtggt 2630281 agcggcggcg gctttggcaa aaaattgtcg gcgggtcatt ccgtcgacgg ggtgttcgct 2630341 ccccacgcgc cctccttgac ggcccacacg gccattgctg atcacggtat agttgcggcc 2630401 gcgatcggct atgccttgcc gaccggcgtg tcgtgttctg attccgcctg cctgccgggg 2630461 cgggcgcggg attggtgcgg gcgatttgct cgcgcacatg caagcaaatc gaacgccggg 2630521 agattaccgg gaaatttcag ctgcacagcc cgctgggagt cccgcggacg ggtgtggttt 2630581 cctgagttgg catcacctgc ggatagggca cccgataggg aatgctcggc aacgcgccgt 2630641 cggtggttcc caacaccacg ttagggatgc actgcggcag cttcggcagc gctcccagca 2630701 acgggtggct caagttgggt ctggtcgaat tcggtggagt cgcaaagttg aacgctgagg 2630761 tcatgtcgcc aaccacgccg tcgcgccagg cggtcatgtt gggaaccggc acgccgaacc 2630821 gggcgcgaat caacttcaat tgcgaggtgt ggtcgaacgt gtcggagacc atcagcgggc 2630881 cgcggctgta cggcgaaatg acaatgcagg gaacgcgaaa acccagaccg agcggaccac 2630941 gaatgccacc ggacccgggt actgcgtcga tgttgggcac cgtgacgaat tcgccgggtg 2631001 tcccgggcgg tgccgtgggg ggcgtgacgt ggtcgaagaa gccgccgttc tcgtcatagc 2631061 tgacgataag tgcggtcttt tcccacaccg cgggattgga cagcaagatc cgcagcgcgg 2631121 tcaccatgga caccgcgcca agcgctaccg gcagggcggg gtgttcggac tgcaggatgt 2631181 tgggaactaa ccaggagacc ttgggtagcc ggttggccct gacgtcggca gcgaagtccc 2631241 cagggtaggt cggggcgata ccgtagcggg ccaagttcga cctcggatca gctgcctggc 2631301 ggaaggcctg caccagcccg ttattgctga tgggcgtgtt gatgaatcgc ccgaggccct 2631361 tgttctggta caccttccag ctgaccccgg catcttcgag gttttccggc atgatgcgcc 2631421 aactgaattg ctgcagcggc aggaagcccg gctctaccaa ttggggtccc ccgtcggtgc 2631481 cggcggggtc gatgttggcg ctcaaccagt agagccggtt gggcagggtg cccgtcagca 2631541 gcgagcaatg gtagccgtcg cagatggtga acgtgtcggc cagcagatag tggatcggga 2631601 tgtcttggcg cgtgtagtaa cccatggtca aagggacata tggtcctgcg cgggtggtcg 2631661 cctgcgccgg cagccagttg tcgttggcac caccgttcca ggccaggtgc atccccaccc 2631721 actggtgctc ggggtcgttg acgcactcgc cgtccaggaa ggggcctcgg gtggtgtcca 2631781 agcggaacgg aatggtgacc ccggcggggt ccaacgcctg cgtcatgggg ttccaaccca 2631841 tttgttggaa tgccggcgac gcggcgttga acccattggt gctggaaagc gttccgaaat 2631901 agtggtcgaa tgaccggttc tcctgcatca gcaacacgat atgctcgatg tcggtcaaat 2631961 gtccgggaca aggcccggcg ccgtaggcct tttcaatcac cggtgcagcc cagtccatca 2632021 ggaatgccgc tgcgcctgcg ccagtgagct ttgtcaaaaa ctctcgacgt gacattccga 2632081 ggagtgggct tgcgctcact tgccctgcct tcctgcactc agctcagatc acgttatagt 2632141 gacgacagcg gtccatcgcg atacgccaac cggcgtgtcg cacgcggatt ttcgcgttcc 2632201 agcaaccgca accgcaccgt ttggcgcggc cgacggccgt ctaggggata tcgcagcggg 2632261 aagggtgccg taaccatgat tgtcgctggg tatcgggcac tcgccgacag taaaaaatta 2632321 ttcgaatccc gcattcctga caaaacttga tatgaccgat ctcaccggcc ggcttcggcg 2632381 cttaagtcac tagacagttc gaggtcagcg acgggatatc gcgctatcgg taaactaatt 2632441 tcgtatctgc ccaaccgcgc cgccaatgca gcgtccgtac catgtggact acggtgctga 2632501 tgttgactct ggtggcgacg gctgacaccg tccggatccg aactggcgtt cttttgtccg 2632561 cccattgctt gcattctggc tccggggcat agcgacaagt gttgccctgg ctgttgacgt 2632621 gctcttcggg caagcggact tcacgctttc aagcgtgcac tcggccgaac ttgccagcgc 2632681 gaactccacc agcggacacc ttcagatcgc gatggttgtg ctggcgctgc tgatcgccgg 2632741 gctcacggcc ggaggggctt tccgcatggc cagcggactg ggccacgcct aaagacttag 2632801 ctctctttcg cgagcgcgac cgcttcggtg cacttcattt cgccgacaat cacggcacca 2632861 aggccaggga tttccaacga cgtcgccgcc gcgatgactc ccgcgtcgac gcgccctgcg 2632921 cgctatccga tcccgacccg cggcaccaca ctgggccgag cctgcaccac atgcggattt 2632981 gcgccaccgc ggcccatcat cccggccggc atacccgccc cagcaccccc catacccatc 2633041 ggcatcggca tcatccccat gggcccgcct gccgccggca cctcagcagg catagccccc 2633101 aaacccgcca tcgccgaact ggccatccgc gcaggaaccg acccctccca ggtcggaggc 2633161 accgacatcg cccccaccaa ccgcgcctta cccaactccg ccgacatccc cgcacccaga 2633221 ccaccggcac cacctaggcc cgtccccgaa gcgatatcac cggcaaacgt cggcacatcc 2633281 gccgcagcca accccgcagc ctccgcaccg gccaacccag ccgtgttggc cgtagtcccc 2633341 atctgcgcca actgcatcat cggcccaatc aacatactgg ccggatacgc cgcggcctgc 2633401 cccacctgca gcgccgcatc caccggcaac cccgccgcca ccgactgcgc cgcagccacc 2633461 acagccggca ccccctccac cgcaccctcc gcgatcggag acaacgcagc cgaaaccgac 2633521 gtcgccatcc cggtcaactg cgcaccggcc tgggaagcca accccgccaa atccagcggc 2633581 ggcacactaa acggcgtcaa cgtctcagcc accgccgccg cccccgcgtg ataccccacc 2633641 atcgcaccca cgtcctgagc ccacatctcc acataatcga actcagtggc cgcaatcgcc 2633701 ggcgtgttct gacccaaaat gttcgtcgcc accaacgccc ccaacaacac ccgattcgcc 2633761 gtcaccgccg ccggatgcac cgtggccgcc aacgccgcct caaacgccgt cgccgccgcg 2633821 gtagcctgac cagccgacaa ctccgcctgc ccggccgccg cactcaacca ccccacatac 2633881 ggcgccgccg cccccgccat cgccaccgac gccggacccg accacggccc agccgccaac 2633941 ccggcgatca ccgcatcaaa cgaggacgcc gaggcccgca aatccgcagc caacccctcc 2634001 cacgccgccg ccgccataaa caacggcccc gaccccgcac cggcatagat ccgcgccgag 2634061 ttgatctccg gcggcaacca cgaaaaatcc aaaatcatcg caaccccaaa ccagccagcc 2634121 gcctcaacgg ctccgcctac cactctccag acacaaacca gcccacgggc ggatggtaag 2634181 acaatccaca ccgaaaatcc gcacttttac caaaacttta ttcatgaatt cggcatgagc 2634241 cgttcacgcc ggcacgtcac cgccgccagc caccgggcaa gtgtctagta actggacacc 2634301 ggaaggcagc caccgggcag gcctcgccgc aatccgcagc tacacggctc gcgatatttc 2634361 cgggccagag ttttagccac cgcgagccat cagcaactcg cgtaaagact gcgcgaagcc 2634421 aacgaaaaaa taaggcggca aaaatatccc gtcagacggt cacgtcatac cgagtgaggt 2634481 aaccgtgatt agaccaacta catcgcacta ccgaacggaa accaccacta tccgaacaag 2634541 ttcttgaaga aacccgaaag cccattgccg ctgaccagca ggcccgagtt gcccgtccca 2634601 aaattgaaaa atcccgaact catcacgccg gtcacaaaaa tcccggtgtt gttgacggcc 2634661 gcgttataga aacctgagtt gccgtagccc gtgttgagca cccctgagtt accgccgaac 2634721 atgcttgtgg tgctggtatt gacatagccc gagttgccga agcccgagtt ctgaatgccc 2634781 gagttgccac tgccagccgg gtcgttgtgc ccgaaaccag agttaccggt accggcgttg 2634841 aagaagcccg aattcggacc agcttgggtc atcgcgctga agaagcccgt gttgagcgtg 2634901 cccggattaa acccaccagt attgatgttg cccgagttcc ctaagccagt gttgacatca 2634961 cccgcattgc ccacaccgga gttgacgtcg ccagcattga ggaaaccact gttgccgtca 2635021 cccgaattcc caaaactcga gtttatgttt ccggcgttaa gactgccgaa gttgtagttg 2635081 ccagcgtcga aaaagcctgt gttgccggcg ccggcgttag cgaggccagt gtttgtgctg 2635141 ccgccgttcc aaaatccggt gttgacgttg cccgcgttcc cgaatccagt gttcgcagta 2635201 ccggagttcc cgaagcctac gttgccggtg ccggagttga acaaaccgac gtttccggtg 2635261 ccggagttcc cgaaaccgat gtttccgctg cccgagttca gtccgccgat gccgatctga 2635321 ccatcgccgg tgagcccgat accgatgttg ttgttgcccg tgtttccgaa accgaaattc 2635381 ccgctgccgg tgtttccgaa gccgatgttg ttactgccgg tattgccgct accaaagttg 2635441 aagttgccgt tgtttccgtt accgaagttc gtgtcgccga tgttgccgct gcccacgttg 2635501 gtgctgccga tgttgccgct gcccacgttg gtgctgccga tgttgccgct acccaggttt 2635561 tggctaccga agtttctgaa ccgccccggc atgtccggag actccagttc ttggaaagga 2635621 tggggtcatg tcaggtggtt catcgaggag gtacccgccg gagctgcgtg agcgggcggt 2635681 gcggatggtc gcagagatcc gcggtcagca cgattcggag tgggcagcga tcagtgaggt 2635741 cgcccgtcta cttggtgttg gctgcgcgga gacggtgcgt aagtgggtgc gccaggcgca 2635801 ggtcgatgcc ggcgcacggc ccgggaccac gaccgaagaa tccgctgagc tgaagcgctt 2635861 gcggcgggac aacgccgaat tgcgaagggc gaacgcgatt ttaaagaccg cgtcggcttt 2635921 cttcgcggcc gagctcgacc ggccagcacg ctaattaccc ggttcatcgc cgatcatcag 2635981 ggccaccgcg agggccccga tggtttgcgg tggggtgtcg agtcgatctg cacacagctg 2636041 accgagctgg gtgtgccgat cgccccatcg acctactacg accacatcaa ccgggagccc 2636101 agccgccgcg agctgcgcga tggcgaactc aaggagcaca tcagccgcgt ccacgccgcc 2636161 aactacggtg tttacggtgc ccgcaaagtg tggctaaccc tgaaccgtga gggcatcgag 2636221 gtggccagat gcaccgtcga acggctgatg accaaactcg gcctgtccgg gaccacccgc 2636281 ggcaaagccc gcaggaccac gatcgctgat ccggccacag cccgtcccgc cgatctcgtc 2636341 cagcgccgct tcggaccacc agcacctaac cggctgtggg tagcagacct cacctatgtg 2636401 tcgacctggg cagggttcgc ctacgtggcc tttgtcaccg acgcctacgc tcgcaggatc 2636461 ctgggctggc gggtcgcttc cacgatggcc acctccatgg tcctcgacgc gatcgagcaa 2636521 gccatctgga cccgccaaca agaaggcgta ctcgacctga aagacgttat ccaccatacg 2636581 gataggggat ctcagtacac atcgatccgg ttcagcgagc ggctcgccga ggcaggcatc 2636641 caaccgtcgg tcggagcggt cggaagctcc tatgacaatg cactagccga gacgatcaac 2636701 ggcctataca agaccgagct gatcaaaccc ggcaagccct ggcggtccat cgaggatgtc 2636761 gagttggcca ccgcgcgctg ggtcgactgg ttcaaccatc gccgcctcta ccagtactgc 2636821 ggcgacgtcc cgccggtcga actcgaggct gcctactacg ctcaacgcca gagaccagcc 2636881 gccggctgag gtctcagatc agagagtctc cggactcacc ggggcggttc aacaccgaaa 2636941 aattcaccac taccgcccct cctctaacaa atcattctca accgcacccc cgcgcgttac 2637001 cccaaacgac acgcggacac ccgtcaccga gacgtcctac gttgtctggg cgccaaaccg 2637061 gctcgatccc cgacttggct cacgattcgc ggctcagcat taatagagcc cgttgacctg 2637121 tgagtttgct tggtgacggg tcgaaaattg tgcacttgat gcactcagga gtacctggac 2637181 gcccggacgg ccaaccgggg cgccgccgaa ccacggtggc gcgccagatg actcaattga 2637241 cccgagtgct gctcccgctg tccgtaccgc tctttcgtca cgtccgcaac actggccctc 2637301 gccgtcggcg atggtcgctg tgcccacctt agcgcgacaa ctcggtttct gcaggtcaac 2637361 gcccgcctcc aatcccgcac agccacgacc aactcgggaa caaaaccgcc ggtcaggcag 2637421 ctgtcgctga gagccgggca catcgggtgt cgcccggtac agtgacacat gtgaccgttg 2637481 cgaccgtgcg atgtgcccga cgctcgatgc gcaccaattc gaaccaactc aggtcttacg 2637541 ctgcctggac gccgaactag ctcgatccag cgccgacccg caccccacta ccggcatctg 2637601 aaggtgagcc agagacgcgt cgaccaggaa gaaccgtggc cgcacgggtc acccgggcac 2637661 acccaaccgg gccgtggcaa gtgccgacta cctgaagaat cccgaaagtc ctacacccgc 2637721 attgaaagca ccggagttct ggctacccga atttaccgca cccgaactgt cgtcacccga 2637781 gttggagata ccggcgaggt tgttacccga gtttgcaatt cctgcattga aagagccaat 2637841 gtttgcaaac ccggagttga agccaggaag catggctggg ccggcgttgg agaagcccga 2637901 attaccgttg cctgtgttga agaacccgga gttgccggtg cccgaggggt cactgttccc 2637961 ccaacccgag ttgccggtac cgaggttccc gaagcccgag ttggcacctg cttgggtgag 2638021 cgcgctgcca aaaccggtgt tgacgttgcc gccattgaaa ccaccggtgt tgatatctcc 2638081 accgttaaag aaaccggtgt tgacgttacc tgcgttcgcg aaaccggtgt tcgagtcacc 2638141 cgcattgaag aagccggtgt ttccagcacc cgaatttccg aagccggtgt tctgaaagcc 2638201 cacgttgaag ctgcccgagt ttgagttccc gccgtcgaag ataccgacgt ttccgttgcc 2638261 ggcactcccg aagcccgtgt ttaaattgcc tgcgttccag aaaccggtgt tgatatttcc 2638321 ggcgttcccg aaacccgtgt tgccgtcacc cgagttgaag aagccgatgt ttccatcacc 2638381 cgagttgaag aagccgatgt tgttgttgcc ggagtttccg aagccgatgt ttccagtgcc 2638441 tgagttcagt ccgccgatgc cgatctgacc atcaccggtg agcccgatac cgatattgtt 2638501 gttgcccgtg tttccgaaac cgaaattccc gctgccggtg tttccgaacc caaagttgag 2638561 ggtgccatta ttcccgccgc caaagttgaa gtcgccggtg ttcccgccgc cgaaattgac 2638621 atcaccgtta tttccgttgg cgaggttgag cgtgccgaag tttccgctgc ccacattgag 2638681 gctgccgata tttccgctgc cgaagtttcc gctgccgaag tttccgctgc cagggttgta 2638741 gtcacccgtg tttccgctgc cggcattgcc ggtaccggtg tttccgctgc cccagttcag 2638801 gctgccgtag ttcccgctgc ccaggttggt gccgccgaca ttgccactgc ccacgttggt 2638861 accgccgatg ttgccgctgc ccaggttgag gctgccgatg ttgccgacac ccaaattcaa 2638921 ggtcagctcg gcgaggcctt gtgcagcgcc ttgtgcagcg gccgccggtg cgttagccgc 2638981 accgcctagc aagcccgaca agcccggcac cgcctgctgc catggcgcca acgccgccgc 2639041 cgccgccgat gccccaccgt gataacccac catcgccgcc acatcggcag cccacatctg 2639101 ctcataggtg gcctcagcag cggcaatcgc cggcgcattc tgcccaaaca cattcgacag 2639161 caccaactgc acaaacgcac tgcgattagc cgccaccacc accggatcca ccatggccgc 2639221 ccgcgccgcc tcaaacgcgc cggccaccgc cttagcctga accgcagccc ccccagcccg 2639281 cgccgccgca gcagccaacc accccgcata cggcgccgcc gccaccacca tcgccgccgc 2639341 cgccgcaccc tgccacgcct gacccgaccc acccgccaga cccgaggtca ccaacccaaa 2639401 cgactccgcc gccaacccca actcagccgc caacccatcc caggccgccg ccgccgccaa 2639461 catcggcccc gaccccgcac caaaaaacat ccgccccgaa ttaatctccg gcggcaacac 2639521 cgaaaaattc accactaccg cccctcctct aacaaatcat tctcaaccgc acccccgcgc 2639581 gttaccccaa acgacacgcg gacacccgtc accacggcgc cgcccaccca gcggccacca 2639641 cagctcaccg ggtcgtgccc ggaccggggc tgctagctgc ccttgagccg caccgcgaga 2639701 tagtcggcca cgctgctcat cgcaacccgg tcctgcgtca tggcgtcacg ctcccgcacg 2639761 gtgacggcat tgtcctgcag cgagtcgaag tcaaccgtca cacagaacgg ggtaccgacc 2639821 tcgtcctggc gccggtaacg ccgcccgata gcgccggcat catcgaaatc gatgttccag 2639881 catttccgta attcggcgcc caggtcccgg gccttcgggc tcaggtccgc gtgccgggac 2639941 agcggcaaca ccgccgcctt gaccggcgcc agccgcgggt ccaatcgcag caccgtgcgc 2640001 ttatccatcc cacccttggt attcggggcc tcgtcctcgg tgtacgcgtc gatcaaaaac 2640061 gccatgaatg accgggtcaa gccagctgcc ggctcgatga cgtacggcgt gtaccgaaca 2640121 tcgttgatct ggtcgtagaa agacaggtcg acgccggaat gccgcgcatg cgtcgatagg 2640181 tcaaaatcgg ttcggttggc cacaccttcc agttcacccc atggattgcc catgaagccg 2640241 aacttgtact cgatgtcgac ggtgcggtcg gagtaatgtg acaacttgtc tttggggtgc 2640301 tcccacaacc gcaggttctc ccgacgaata cccaggtcga tataccactg cagccggttg 2640361 tcgatccagt actgatgcca ttccttggca gtcgccggct cgacgaagaa ctccatctcc 2640421 atctgctcga actcgcgggt ccggaagatg aagttgcccg gagtgatctc gttgcgaaag 2640481 ctcttgccga tctgtccgat accgaatggc ggcttcttac gagcagttgt caccacgttg 2640541 gcaaagttca cgaagatgcc ctgcgcggtt tccgggcgca gatagtgcag cccctcctcg 2640601 gtctcgatgg gtccgaggta ggtcttgagc atcatgttga actcgcgtgg ctgcgtccac 2640661 tggccgggtt cgccggtttc cgggtcgcga atgtcggcca acccgttagg cggcggatgc 2640721 ccgtgtttgg cttcgtaggc ctcgatgaga tggtcggccc ggtagcgctt atgtgtgatc 2640781 agcgactcga ccagcgggtc atgaaagaca tcgacgtgac cggaagccac ccacacctca 2640841 cgcggcagga tgatcgacga atcgattccg acaacgtcgt cgcggccagt caccaccgat 2640901 cgccaccact ggcgcttgat gttctctttg agctcaaccc ctagcggacc atagtcccac 2640961 gccgactttg tgccgccgta gatctcgccc gacggataga cgaagcctcg ccgtttggct 2641021 aggttgacca cggtgtcgat gacgggcgcc acggggtggt gcactccctt cgagggatcg 2641081 ggcagacgcg cgcagcccga cacgactacg cgcaaaacat cagtcatggt agcgatcggg 2641141 acctgggtct cctattgcct ttgacatgca tcatcatgca tgtgacagtg gaggtcagtg 2641201 gcaggtcctt cctaatacgg cacttctcga ggtgaagact ccaatatggt gacgtccccc 2641261 tcaacgccga ccgccgccca cgaagatgtg ggtgccgacg aagtaggcgg tcaccagcat 2641321 cccgcggata ggttcgccga atgccccacg ttccccgcac caccgccgcg ggagatccta 2641381 gacgctgccg gcgagctgct gcgtgcgctg gccgcaccgg tgcggatcgc catcgtgctg 2641441 caattgcgtg aatctcaacg ctgcgtgcac gaactggtcg acgcactgca cgtgccccag 2641501 ccgttggtca gccaacatct gaagatcctc aaggcggcgg gcgtggtcac cggggagcga 2641561 tcgggccgag aagtgctgta ccgacttgct gaccaccacc tcgcgcacat tgtgctcgac 2641621 gccgtcgcgc acgccggtga ggacgcaata tgagtgcagc cggtgtccgc tctacccgcc 2641681 agcgggcagc catctcgaca ctgttagaga cgctcgacga ctttcgttcg gcccaggaac 2641741 tgcacgacga actgcgccgg cgcggcgaga acatcggtct gaccaccgtc taccgcacac 2641801 tgcagtcgat ggcatcctcc ggactggtgg acacactgca caccgacacc ggtgaatcgg 2641861 tctaccgcag atgctcggag caccatcacc accatctggt gtgccgcagc tgcggttcca 2641921 ccatcgaagt aggtgaccac gaggtggagg cgtgggcggc ggaggtggcc accaaacatg 2641981 gattctctga cgtcagccac accatcgaga tcttcggcac ctgctcagac tgccggagct 2642041 aggacaccac cgaggtcgag cgaccccaca cgccgaacgt gcaaccatgg cggctccgcc 2642101 cggcgtgtcg ccgccaccag ggcacgttcg gcgcacagcg agcacactcc tagccaacga 2642161 gcgcgctgcg gatcgtggcg cccgtctcca gcaccaaaag gatcaacgtg cgcaacgcgt 2642221 cgtcggtcaa accggtgccg ggaaagttgt atcgcagcat cacgtccgcg gtgttggacg 2642281 cgggccgacc agagcttcgc cgcgcagcct tctcgctgac cttttcccgc aggctcaccg 2642341 agccgaagtt gatgtcgcgt gcctgcttgg ccacttgctc ggcgagcctc ttggtcaacg 2642401 gcagatccca cgccaggatc tgggtaaggg acaccagctc gaggtcctca gcgatgctca 2642461 ctacccgcaa cgaggcaaac gtaccgtcat ggcgaaccgt cagcgcgccg tcgggttcct 2642521 cctcggcagg aaggacatcg cgcagtatcg atgccagccg gtccggtagg gatggcacta 2642581 ggcgctcccg aaccgccgag tgcgcgacgc gtattcctcg caggccgccc acaagtcgcg 2642641 gcggtcatag tcgggccaga gcttgtcctg gaatatgtat tcagcgtagg ccgcctgcca 2642701 cagcatgaag ttgctggagc gctgctcacc cgaggtccgc aggaagaggt caacgtcggg 2642761 aatgtcgggt cgctgcaggt ggcgggcgat cgtggattcg gtgatccgct ccgggttgag 2642821 cctgcccgcg gcgacctcac gagcgatttc gcgggtggct tcggtgattt cggtgcgtcc 2642881 gccgtagttg acgcaatagt tgatggtgat gacgtcgttg cttttggtca tctcctccgc 2642941 gaccgccaac tcattgatga cgctacgcca cagccgtggt cgtgaaccca cccaccggat 2643001 ccggacccct agcttcttta gggtgtctcg gcgccgtcgc accacgtcgc ggttgaagcc 2643061 catcaggaag cggacttcct cgggcgaacg cttccagttc tccgtggaga aggcgtagag 2643121 gctgagccac ttgatcccaa gttcgatagc accgcaagcg atgtcgatca ccaccgcctc 2643181 gcccatcttg tgaccttcgg tgcgggccag cccacgttgg gtggcccagc ggccattgcc 2643241 gtccatgaca atggcgacat ggttgggcag ccggtcggcc ggtattcgtg gcgcggccgc 2643301 tttcgaagtg tgctgcggtg gccggcaggg gcctccgtag ggcgctgccg gcaactccgg 2643361 gaagacgacg ggccacgtcg acgtatcagg aaaggtcggg tagtcgtcgg gggccggagg 2643421 cagctgcggg aagttgctgg acgtccgctt ccgtgcatcc ctagccaccg gctatatcct 2643481 gcccgatcag cgcggcgcga cgttcggcaa ccgatcgatc ggcctggtag aaccgctcca 2643541 ccagcggcaa cgttttcagc tgccgttcca gatgccattg caggtgtgcg gccaccaacc 2643601 cgctgacatg gctgcgggcc gattgcggcg ccgcctcggc ggcctcccaa tcgccgtcgt 2643661 acagcgcgga catcaggtct acgacgccca gcggcggtgt ggtcgagccg gccggacggc 2643721 agtgtgcgca gacactgccc ccggtcgcga tgtgaaacgc ccgatgcgga ccaggcgtgg 2643781 cgcagcgggc gcactcggtc aacgctggtg cccagccggc gatgcccatg gcgcgcagca 2643841 gataggcgtc caacaacagg tcccgaggcc gctgtccatc ggccaccgcc cgcagcgcgc 2643901 ccaccgtgag ccggtgcaga gccggagcgg gcgcccgctc ctcaccggcc aggcgttcgg 2643961 cggtttccag tatcgcgcat ccgcaggtgt agcggccgta atcggcgacg atgtcggtgg 2644021 cgaacgcgtc gacagagaca acctgggtga cgatgtcgag gttgcggcca gggtgcagtt 2644081 gcacctcgat atgcgcgaac ggctccaggc gcgcgccgaa tttgctgcgg gtgcgtcgaa 2644141 cacctttggc caccgcgcgg accaacccgt gatcgcgggt cagcagggtg acgatccggt 2644201 cggcttcgcc gagcttgtgc tggcgcagca caacagcccg gtcccgatac agccgcatca 2644261 caatagtttt gcaccccgcc acgacatcgc gggtatccgc gccgatagtc tcgtaccccg 2644321 tggttggcgc ttctgggtcg gatgctggag ccatttccgg ctctggcaac cagcgcctgc 2644381 ccaccctgac cgacctgctc taccagctgg ccacccgcgc agtgacgtcc gaagagttgg 2644441 tgcgacgttc cctgcgcgcg atcgatgtga gccagcccac attgaacgcc ttccgggtag 2644501 tgctcaccga atccgcgctg gccgacgcgg cggccgccga taagcggcgg gcggccggcg 2644561 acacggcgcc gctgctgggc attccgatcg cggtcaagga cgacgtcgac gttgctggag 2644621 tgccaaccgc cttcggcacc cagggctatg tcgcgcctgc taccgacgac tgtgaggtcg 2644681 tccggcgcct caaggcggcc ggagcggtga tcgtcggcaa gacgaatact tgtgaattgg 2644741 gccagtggcc gttcaccagc ggacccgggt tcggacacac ccgcaacccc tggtcgcgcc 2644801 ggcacacgcc gggtggatcc tcgggcggta gcgcggcggc ggtggccgcc ggcctggtta 2644861 ccgccgctat cggctccgac ggcgccggca gcatccgcat ccccgcagca tggacacacc 2644921 tagtgggcat caagccacaa cgcggtcgga tctccacctg gccgctgccg gaggcgttca 2644981 acggcgtcac ggtcaacggc gtactggccc gcactgtgga ggatgcggcg ctggtgctgg 2645041 acgccgcgtc cggcaacgtc gagggcgacc gccaccagcc acccccggtg acggtgtccg 2645101 atttcgtcgg catcgcccct ggaccgctga agattgcctt gtcaacccac ttcccgtaca 2645161 ccggctttcg ggccaagttg catcctgaga tcttggccgc gacccagagg gtgggcgacc 2645221 agctcgagct gctcggccat acggtggtga aaggcaatcc ggactacggc ctacggttgt 2645281 cgtggaactt tcttgcccgg tccaccgcgg gcctctggga atgggcggag cggctaggcg 2645341 acgaggtgac cctggatcgt cgcaccgtat ccaacctgcg catggggcac gtgctgtcgc 2645401 aggcgattct gcgcagcgcg cgccgccacg aagccgccga ccagcgtcgg gtcggctcga 2645461 tcttcgacat cgtcgacgtg gtgctggcac cgaccacagc acaaccaccg ccaatggcgc 2645521 gcgcgtttga ccggttgggc agcttcggca ccgatcgcgc catcatcgcc gcgtgcccgt 2645581 cgacctggcc gtggaacctg ctgggctggc cgtcgatcaa tgtgccggcg gggttcacct 2645641 ccgacggttt gccgatcggt gtgcaactga tgggaccggc caacagcgag ggcatgctga 2645701 tctcgctggc cgccgagttg gaagccgtca gtggctgggc gaccaagcag ccgcaggtgt 2645761 ggtggacgag ctaaaacccc agtcggccaa gctgtttggg gtcgcgctgc cagttcttgg 2645821 cgaccttgac ccgcaagtcg agatagacct tggtgcccag caggttttcg atctggctac 2645881 gggccgcggt acccacctcc cgcagccggg caccaccctt gccgatgacg atgcccttct 2645941 gactatctcg ctcgacgtac agcgcggcgt gtacgtcgat caggtcgtca cgcccctcac 2646001 gtggactgac ctcgtcaatc accaccgcca gcgaatgggg cagctcatcg cgcacgccct 2646061 gaagggccgc ctcgcggatg agctcggcca tcagaacctc ctcgggttcg tcagtcaact 2646121 caccgtcggg gtaatacgcg gggccggccg gcaatgccgc ggccagtacg tcgatcaaca 2646181 ggtctacccg gtcgccggtc atcgccgaaa ccgggacaat ctcggccgca ttcgtgacga 2646241 gttcgctgac cgctaccagc tgggcgacca ctttttcttt cggcaccttg tcaatcttgg 2646301 tgacgatgac caccagtgtc gtattggcag ggccggtcga acgaagctgc tcgacaatcc 2646361 accggtctcc cggaccgatc gcctcgtcgg cggggatgca tagcccgatg acgtcgaccg 2646421 ccgcgtaggt ttcgcggacc aagtcgttga gccgcttgcc cagcagagtg cgcggccggt 2646481 gcagaccggg agtgtcgacg aggatgatct ggaagtcgtc gctatgcacg atcccacgaa 2646541 tggcgtgcct ggtggtctgc gggcgcgtcg acgtgattgc cactttcgcc ccgaccagcg 2646601 cattggtcag cgtggacttg ccggtgttcg gccggccgac caaacacaca aagccagaat 2646661 ggaattcggt catgccggtt tcctcgccga acgtgaacac agggagactt ttcccgcttt 2646721 tttccgccgt gaatgcacgt tcggcgtcat agcgggttac ctgcccgatc ggtgacgatg 2646781 atcgcagcgg tcggggcgag ttcgcggacg gcggcaatgc ccggatcgtc aacggacccg 2646841 gccaccaaga cggcggcctg aagaccggtc gccccactgg acacggccgc ggccaccgcc 2646901 gcctgcagac cggtcagctc gagcgccgac agggccaccg gcgccgccgc gtacgtgcgg 2646961 ccgtcgacat cgcggaccgc cgcgccggca ccggcctcgg cacgtgccat cgccgcccgt 2647021 gccaacacaa ccagctttgc gtcctcggca tctagctgct cagccagggt gatcggcctc 2647081 ctcatcatcg gcgccgtcgg gttcggccgg actcagcaac acggtgccga ttcgtacccg 2647141 tccccgatga tcggtgccac cctcggcatg cagccgcagg ccatgcgata tcacctcagc 2647201 gccgggcagc ggcacccggc ccagttctag ggccagcagc ccgcccaccg tgtcgacgtc 2647261 aaggtcgtcg tcgaactcca cgccgtacag ctcgccgacg tcttcgatgg gcaggcgcgc 2647321 cgatacccgg aaacgcttgt cgcccaagtc ttccaccggc gccgtctcgg cctggtcgta 2647381 ctcgtcggca atctcgccga cgatctcctc cagcacgtct tcgatgctga cgaggccggc 2647441 tatcgcgccg tactcgtcga ccagcagggc catgtggtta cggtcgcgct gcatttcccg 2647501 cagcaatgcg tccagcggct tggagtccgg cacgaacaca gctggccgca tcacccgcgc 2647561 gacggtcgtt tcgcggccgc cgttcgtcga gcagaacgtc tgctcgacaa ggtctttcag 2647621 gtacaccacg ccgacgatgt cgtcgacgtt ctcgccgatc accgggattc gggaatgtcc 2647681 gctgcgtacc gccagggtca ttgcttgacc ggctgtcttg tcgctttcga tccagatcat 2647741 ctcggtgcgc ggcaccatca cctcgcgggc tggggtgtca cccagctcga agaccgactc 2647801 gatcatccgg cgctcgtcgg cagcaaccac gccccgctgc tgggctaggt cgacaacttc 2647861 gcgcagctcg atctcggatg caaacggccc gttgcgaaag ccgcgcccgg gggtcagtgc 2647921 gttgcccagc aacaccagca agcggctgat cggcatcaac aaccacgaga tcagccgcag 2647981 cggaagggcc gtggccaacg agatggaata tgcgttctgg cgcccaaggg tgcgtggccc 2648041 cactcccacg acgacaaagc tggccaaaac catgatgccc gcggcaagat acaaccccca 2648101 caccatgctg aagtggtatc ggatgaaaac caccagcagc gcggtcgcgg tgatctcaca 2648161 gctggtccgc agcaacacga ccaggttgac gtaccgcggc cggtcggcca tcaccttacg 2648221 cagcgacccc gcgcccggcc gctggtcgcg tactagctca tccacccggg ccggagacac 2648281 ggtgctgatg gcggcgtcaa tcgcggcgaa caacccaccc aaaccgatca atacgatcga 2648341 gccgagcagc tggtagtacc cggtcaaagg tcaaaatacc ttgacttgtc gagcaaccgg 2648401 cggtccttct cgtcctgccg gtcgtgctgg taggcctcaa cctggtcggc tacccactct 2648461 tcaagcaacc ggtcctgcag ggcgaacatc tctttttcct cgtctggctc ggcgtggtca 2648521 tagccgagca ggtgaagcac accgtggatg gtcagcaggg ccaattcgtg gcccaggctg 2648581 tggccggccg cagccgcctg ctcagcggcg aattccgggc acagcacgat atcgcccagc 2648641 atggacggtc ccggttcggg ggcgtcgggg cgaccacccg gctcgagctc gtccatcggg 2648701 aagctcatca cgtcggtcgg cccgggaaga tccatccagc gcatgtgtag gtcggccatc 2648761 gccgcggtgt ccagcagcag catcgacaat tcggcgcacg gattgacgtc catcttggcg 2648821 atgacaaacc gtgcgacact gactagttcc gcttccgaga cgtcgatgcc tgactcgttg 2648881 gctacctcga tgctcataag atgctcacgc acccatcatc ggcgaccgcg ggcgccggac 2648941 gcccgccgag ccgcccgatt cagccccgac ccgggctcct cgtaccgcgc ataagcgtcc 2649001 acgatctccg agaccagacg gtggcgtacc acatccacgc tggtcagctc cgcgatatgg 2649061 atgtcgtcga tgtcttcgag gatgtcgacc gccgcccgca gacccgaccg ggcgccgccc 2649121 ggcaggtcga tctgggtgac atctccggtg accacgacct tggatccgaa gcccaggcgg 2649181 gtgaggaaca tcttcatctg ctcggccgtg gtgttctgcg cctcgtccag gacgatgaac 2649241 gcgtcattca gggttctacc ccgcatgtac gccagcggtg ccacctcgat gactccagcg 2649301 gacatcagct tcgggatcag ctcggggtcc atcatgtcgt acagcgcgtc atagagcggt 2649361 cgtaggtacg gatcgatctt ttcgctcagc gtgcccggca gaaatccaag gcgttcaccg 2649421 gcttccaccg cgggtcgggt caagattatg cgggtcacct gcttggtctg cagcgcgtgg 2649481 accgctttgg ccatcgcaag ataggtcttt ccggtgccgg ccgggccgat tccgaagacg 2649541 atggtgttgg cgtcgatcgc gtccacgtag cgtttctggt tgagcgtctt gggccggatc 2649601 gtcttccccc gacgcgacaa aatgtctaga gtgagcactt cggccggtga ctcgttgcct 2649661 gtgccgacca gcatggcaac gctgtggcgc actacctctg gggtcagcga ctggccgctg 2649721 gccacgatcg caatcagttc ggagatcacc cgttcggcta gcgcgacatc cgccggctca 2649781 ccgcagaggg tcaccgcgtt gccgcgcacg tgcaggtcgg cactcagcgt gcgttcgagg 2649841 gcacgcagat tttcgtcggc cgaaccgagt aagcccacga cgaggtcagg cggaacgtcg 2649901 atgctgctgc gaacttgagc gtcggcttgc cgggctccag ccgcgtcagc agcgcgggtc 2649961 tcgcgggacg tcacctggct tctgatgcct gctttctggc ctatcgactg gaacctgtcg 2650021 aactgacgag tgttgaagtt tcattctaac gccggtcagg gacggcgtcg gagcacaacg 2650081 cacaacgccg agcccgtgcg cgctcacctt tatccgcgat gaggcctgtc tgtgtccgcc 2650141 cgttcgatgc cgacgaacgg cagccactct cgggcctgcc agctgtgcct gccggtgcgc 2650201 ggcaacatcc cgaccgtgcc catgccggtc cgccaagccg acgatcaccg ctcaagctgg 2650261 gccagccgtg agcgtcggcg ccccaatgat tcgggtggcg ggctagtaat cccttcgacg 2650321 ggggtttcca cggggtcgct ggtctgactg ccgcgccatt ggagggcgct gatggccacg 2650381 gcgacctgga tgatcgtgtg gtcgaggggt cggggtagga gtcgttgggc ggtttcgagt 2650441 cggcgtagca gggtgttgcg gtgggtgtgt agtacgtgcg cggcgcggga ggcgttgcat 2650501 tgttcgttga tgtaggtcaa tacggtggtg agcagttgag ggctggccga ttcgaggtcc 2650561 ccgagggtgc tggtgatgaa atcggctgcg ctgtccgggt tttcggtgag taccgcgatc 2650621 atgtggatgt cggcgaagaa agccaggcgc tgttgggatc gaagccgggc cagcatgcgt 2650681 tgggtggcca gggcgtcgcg gtggctgcgc cgaaacccgt cgattcctcg tgcggtggtc 2650741 ccgaccgcga tgcgggcatg tggtgcgtgg tcgagcacct ggtggattcg gtcggtgtcg 2650801 agggttgccg cgtcgctgac ccatacccag cgggtggccg cgctggcgac cgcgatcagg 2650861 ggctgtgggc atcccagtgc gcggccgaac gcgcgtgcgg tgtggtcgag gtggttttgg 2650921 ttgtcgtcgg gatcgtcata ccagatgatg gcggcggtgt gggatcggtc tagggggtag 2650981 cccagtttgg cttcggcgct ttggcggctg atgggggcgc cgtcgaggat cagttcgacg 2651041 atgcggcggt gttcggcgtg gacgtcgcgg gtcagttcgt cgtattcgag ctgcatttgt 2651101 gcggccaggc cggccagggt ggcgtcgatg aattcggagg ccgagcgaaa cggcagggtg 2651161 agcagttcgt gcagttcttg ggggtcggtg gtgagtccga acgcgatttc ggtccatcgt 2651221 tgccaggcga cgttttgtcc gacgcggtag acgtccagcg ctgaggcgtc tagtccgcgg 2651281 cgcaccaggt cgcgggccat gcgtagcggg tcggggccga ggtttgccgg tacgggttgg 2651341 ccgggtttgc gcaggttggc ggtggcgaag tggatcaggt gggagcggtt ggcgcggctg 2651401 accactgttg ctagggcggg gtcggcggcg atggatgggt gggcggcgag ggtggcgcgg 2651461 tcgagttcgt cgagccattc cggggtgggg tgcagggcga cttttgctgc ttggcggatg 2651521 agttcacgtc cgcgcggtgt gggtttgggc aataccacga gatgagacta gttgcctagg 2651581 tgcgttgtgc accacgttct ggggaatgtt ggtgaggttt actccttcag ccgtggtgga 2651641 cgtttagccg gtgtggcgcg ttcgggatta ttgggatgaa cggttaccca ccgcggcggc 2651701 agcgggccgt gcgcctgccg agtcgtcgac atttagcgtt caggaggtct cgatgtcgtt 2651761 ggtcagcgtg gccccggagt tggtggtgac ggcggtaccg gatgtggcgc gcatcgggtc 2651821 gtcgatcggt gcgcccgaca ccgcggcggc ggcgagaccg accaccagcg tgctggccgc 2651881 cggcgccgat gaggtgtcgg cggacgtcgt ggcgctcttt ggctgggtcg cccgttgatg 2651941 gtgatggggc cgctggggcg cccgagaccg ggcaacggcg gggccggcgg ctcgggtgcg 2652001 cccggccaag ccggcgagtg ggattctgac gaccggctac cggcgtgtca cgtcgcagta 2652061 ttcacagtcg ctcgctgatg catcccaacg agatgtgagc acaccgacag cacccaatgc 2652121 caccgcggcc gcggtcgatg tccgcagcac cgtcgggccc agccggaccg cgacggcgcc 2652181 ggcatcggtc agcgcggcaa gctcgtccgg tgcgatccca ccctcgggtc caaccacgag 2652241 catcaacgaa ccagcttgcg ccgcagcgat atccacaatc cgctcggtcg cctcctcgtg 2652301 caggaccagc accgccgcgc cggcggccac ctcttctcgg acacgctgta caagcattgg 2652361 cgtcgacaac acgccgtcga ccggcgggat gcgcgcccga cgagattgcc gggccgccga 2652421 gcggaccacc gctcgccacc gacgcaaacc cttgtcgaca cgcgccccgt cccagttcgc 2652481 cacgcagcgc gccgcctgcc atgccaggaa cgcgtcggct ccggcttcgg tggccagctc 2652541 gattgccaat tcggagcgtt cggatttggg cagcgcctgc accacggtca ccggtggccg 2652601 cacgggcggg acgctccagc gcctaagcac ccgggcccgc agcccgccac gtccggcctg 2652661 ctccaccaca cagcgggcca ggcgaccgac accgtcacca agcaccaact gctcgccggg 2652721 acggatccgc cgcacggtgg cggcgtgaaa tccttcgtcg ccgtctacga ccgccaccgc 2652781 accggtgtcg ggcagtgtgt cgacgtaaaa cagcatcgcc accatgtgcg ggccgtgatt 2652841 agcgcccggt gaaggtctcg cgcaaccggc tgaacagtcc gccggcggcg gcgtgggtcg 2652901 aacggacctc ggccacctcg cggtcgcggc gacccttcag ctcgcgcagc agttcgatgt 2652961 cctggtgatc cagccgggtc gggaccacca cctccacgtg aacgtgcagg tcgccacgcg 2653021 tgttggaacg caggtgcggc attcctcgac cgcgcagcgt gatcaccgaa cctggctgcg 2653081 tgccgggtgg aatggtgatc tcgctcaggc cgtccaggat ggcgtccacc gtgaccgtaa 2653141 cacccagcgc cgcgtcgacc atgggcaccg aaaccgtgca atgcagatgg tcaccttcgc 2653201 ggacaaagac gtcgtgcgcc tgctcatgga cctcgacgta gaggtcaccc gccggccctc 2653261 ccccgggccc gacctcgccc tgagcggcga gccgaactcg catcccgtcg ccgacaccgg 2653321 ccgggatctt gacgctgatc tcccgacggg cccggatccg gccatcgccc atgcattgct 2653381 ggcacgggtc ggggataacc accccgacgc cgcggcaggt gggacacggc cgcgacgtca 2653441 acatctgacc caacagcgat cgctgcacgg tctgcacctc cccgcggcca ccgcaggtgt 2653501 cgcagggtat cggaaccgaa tcgccgttgg tgcccttgcc ctggcaccgg tcgcacaaca 2653561 ccgcggtatc gacggtgacc tgcttggtga cacctgttgc gcactcttcg agatccagcc 2653621 gcattcgtag cagcgagtcc gaacccggcc ggacccggcc gatcggccct cgggacgccg 2653681 cgcccccacc gaaacccccg ccaaagaacg cctcgaacac gtcgccgagg ccgccgaagc 2653741 caccgaaccc attgccgccc gcagcggcgc tctccagcgg atccccgccc aggtcgacga 2653801 tgcgacgttt gtccgggtca ctgagcacct cgtaggcgac gctgatttct ttgaatttcg 2653861 cctgcgcagc ctcgtccggg ttgacgtcgg gatgcagctc gcgcgccagc ttgcggtagg 2653921 cgcgtttgat gtccgcgtcg ctggcgttct tgctcacgcc gagcagcccg taataatcgc 2653981 gtgccacgct tgattctcct atgccgcgtc tttatgccgc ttctcaagcg gctatccaca 2654041 aaccctgcag caggtgcgcg ttcatcgagc acccaggacg tcgccgatat aaagagcaac 2654101 cgcagccacg ctggcgatag ttcccggata gtccatccgg gtggggccca ccacacccat 2654161 accgccgtag acggtatggg cggtaccgta ggccgtcgac accatcgagg tgcccaccat 2654221 ctgctcagac gccgtctcat gacctatgcg aaccgtcacc ttgccggctt cctgctgagc 2654281 cgccagcagc cgcaacacca ccacctgctc ctcaagtgct tccaatattg accgcagtga 2654341 accaccgaag tccgcagcgt tgcgggttag gttggcggta ccgcccagca aaaggcgttc 2654401 ctcggtgtgc tccactagcg actccagcaa tacggtcgcc gcgcggccca cggcgtcgcc 2654461 caatccgccg gcgccgccca gctggctggc gaggtcggcg accgccaccg aagccgctga 2654521 aagcttcttg ccttccagcg cctggccgag tatttcacgc agctgggcta gctggtgatc 2654581 gtcgatgaca tcgccgagtt cgacgatgcg ctgatcaacc cggccggagt cggtgatgac 2654641 caccatcagc agccgggccg gtgtcagcgc gatcacctcc aagtggcgaa cggtcgacgt 2654701 tgacaacgtc gggtactgca cgacggccac ctggcgggtc agctgcgcca gcaatcgcac 2654761 ggcacggcgc agcacgtcgt cgagatcgac accggattca aggaagctct ggatcgcccg 2654821 gcgctcggcc gacgataggg gtttgacgtc ctcgagccgg tcgacgaact cgcggtagcc 2654881 cttctccgtg ggcacgcgtc cggaactggt gtgtggctga gtgatatagc cttcggcttc 2654941 cagcaccgcc atgtcattgc ggactgtggc cgacgagact cccaggttat ggcgttccac 2655001 cagggatttg gagccgatcg gttcctgggt tgcaacgaag tcggcgacga tggcacgcag 2655061 cacctcaaag cgacgctcgt cggcgcttcc catcgactgc tcacctcact tcttacgctg 2655121 cctgaccggc ttcattttac gttctcggcg gccactgacc gtcatctagc aggcgtctgc 2655181 cgatggtcag ggggcgtgtg ccccgactaa cgtgtccagc atgatctcgt cgggcttcag 2655241 ccactcggta gggaagatcc gccaatgatc ttcaaagggg tgcgggaagg caagccgtat 2655301 cccgagcatg gactgtccta tagggactgg tctcagatac cgccgcaaca gatccggctc 2655361 gacgagttgg tcaccacgac tacggtgctc gcgctggacc gcctgctgtc agaggactcc 2655421 acgttttacg gtgacctttt cccccacgcg gtgaagtggc gaggcaccac ctatctcgag 2655481 gacggcttgc accgggcggt gcgtgcggcc ctgcgcaacc gcaccgtgct acacgcgcga 2655541 gtgttcgaca tggacgcgtc accaggcggg cggcgtagct gaacagcggg ctgaagccgg 2655601 cccgccaatc agttccctgc ggcctgcagc aactccatcg ccgatgcgcg tgacagcatc 2655661 cagccgcctt gattcacgaa cgtgacgttc tgcgtgaccg gcgacgagag cttcggaccc 2655721 gagacggaaa cgtcggcggt ggccgaaccg gcggccgccg gctggatgtt cgtcacgctg 2655781 aacgacagcg gcagatcccc gtgctcggcg gccttcttca gcttgtggtc ggcgatgcgc 2655841 gcctcggtgc ccccgatgcc gccctcgacc agactgccct tgttcgcaaa cgacacgttg 2655901 ggatcggcga ggctgttgag caggctggtc aactgggcgg cggtcgggac gtcaggggcg 2655961 gatgccgggt ccaacggcag tggcgcgccg aagacgaccg gctgcatctg gtatacgacc 2656021 gggccgccag ccatgatcga agtcacaccg gccgcagcgg cgccgattgc agccgcggcg 2656081 gtcagacctg cggcgatcga tttcaccatc ttcatggttg tgttcttccg ttcgtttgcc 2656141 cgtcgattgt gcgtttggtt caaactaccg gtgcacgcgc cgggcaagtc tgtgtcgtag 2656201 ctgtgagcga gcggtcagtc ctcgaccatg gcgtcacgca ggctcttcgg ccgcagatcg 2656261 gtccagttct tttccacgta gtccaggcag gcggcacggc tggcttcgcc gtgcaccacg 2656321 cgccagccgg ccgggatatc ggcgaacacc ggccacaggc tgtgctggtc ttcgtcgttg 2656381 accagcacga agaatgcgcc gttgtcgtca tcgaaaggat tggtgctcac cgcttctcct 2656441 tgtgtcgttg tgtttggtcg gcgtaaagat gccggcgccg agcacccggt ccgacaacaa 2656501 gccgaggcag ctcaggttgg ggaacccggg cccctgggtg agtccggaca gggtgggcag 2656561 gaacagcttg ggcgtgacgt cggtgactgc caagtcgtag ccgatcgctt cctgcaggcg 2656621 gtcggcggtc agcggtccac ccagtcccag ctcgagcagg tcgagggtgt gctgactgaa 2656681 cagtgaggtg aaccacagcg gatcggcgcc cgagccgtcg atgacgagat cgaatccgtg 2656741 cacggtctcg aagttctcgc tgccccggtt ggtgctcagc gtcaaccgga tctgcccctg 2656801 acggcccacc gcgtgggcga cccggccacg cagatgatgg atgcggtcat cggccagcag 2656861 cgcttcctgc acggtcgccg agaacactcc tcggtcggtg cgggccagcg cgtcgcgccg 2656921 ttcgtcgaac gtcaaggccg cccagtcggt cggatcggaa aacagtgagt tctcgaagaa 2656981 tccctcgccg cgggtgaaca gggttacctg cggggagatg acggtgatgg ttgagacccg 2657041 atgccggaac agctcgttga gcatcgatgc ggccgtctct ccgccaccga tcaccgcgac 2657101 ccgctcggcg ttgatccggt cgtggccggc ggcacggtcc cagaactgtg cgattgagag 2657161 cacgcgcggg tttccgggca gtagcgactt ttcagcctgg ccgggcccgg tgatcatcaa 2657221 cgcgtcggcc tgcacggtgg tctcgtgggt gcacaacgcc cagcggtcac cggtgacggc 2657281 gagccgttcg acctcgccgt ggatcacctt gaggccaatg tgatcggcca cccaggctag 2657341 gtactgactc cacctgcgat gggtgggcgc cgggcggccc cggtcgatcc attccgcgaa 2657401 cgacgcggtg gcgatcagat acgactgcca gctgtagcgg gtcatccgct cgtccaattc 2657461 tgcgttgcgc cgtggcacca gcgccgaccg gtagggaaaa ccgacatcct tttctgggct 2657521 ggtgcccagc cggtgggctc cgtcggtcca gccaccgctg gcctgccagt tggccccgac 2657581 cccgatgcgt tcgacggcga tcacgtcggg cacgtcgacc cccatgtcac gcagcacgga 2657641 tgccttggcc gcgaccgcca ccgccttggc tccagcgccc aggaccgcga gcgtcggatt 2657701 catgctgtta tctccgccag cgcgccctgc cacagtgatt gcagcgtggc gacgtcgtcg 2657761 gcggacagga tgtcgggcag cgtgcgccac cgcgtggcta gcacgggagc gtcggcgggc 2657821 ccgaggagcg ccgccagcac cgtcagttcg tggcgcaccg gctgttcggg ttcaggcagt 2657881 tgccccacat cagccagtag tgcgcggtcg accgccagat ctcccacccc gacgtgcagg 2657941 ctacccagat agttcagcag cagctggggt tcgcggtggg cgcgtagtcg ctccgcggta 2658001 tcggcgcgca ggtaccgcag caggccgtaa tcgatgccgc tgccgggtat ccgcgcgaag 2658061 tcggtcgcgc cgtcgcagtg gatgcgcagc ggatagatcg cgctgagcag cccgaccgtg 2658121 tcgctggtgt cggcagtctt atcgacgtgg acgtccgcgc ggccatgcgt ctccaacgcc 2658181 aacagcggtg ctggtgtttg ttgaccgcgt tgccggcgcc aggcggtcac catccgcgca 2658241 gcggcggtag ccagcagatc ggtcatcgac cgtcccgtcg aaagcagccg cgcggtcaga 2658301 tcggcgtcgg agatcgacat ggtgatcgct agctcaccaa cccggtcggt ctgcggcgcc 2658361 accctgcggg cacccaacgg cggatcggcg ccctcgagtt cggcgaccca gaaatcaacg 2658421 ctatccagcg ccttagcccg ctgcgccagc agccgcgacc actgccggta gctggtgttc 2658481 tcgcgcgctg ggctgggcgc gcgcccggcc gccagcgcgt gcaggccggc gtcgagttca 2658541 cccagcacaa tccgccagga ggctgggtcc atcgccagca catgggcggt cagcaccaga 2658601 acaccgggcc cgtcgggttc gcgcagccac accgccgaga gcagtcggcc ggcctggggg 2658661 tcgagactcg ccagcgcgcc aagagtctgc tcggccaccg cggtgaccag ttcaccgctg 2658721 acccaaacct cgctgagaat gtccgttttc ggttgtgcga caagggccat cgcatcccgg 2658781 tcgaaccggc accgcaacac ctcgtgtccg tcgacgaccg cggccaacac ggcatccagg 2658841 cgttcgcggg tgatccggtc gggcaacctg atgacctcgg tttgtgccag ccggcgcggg 2658901 tcgccgtact cgtagagcca atgagtgttg ggtagcaccg ggatcggctc gccggcatcg 2658961 ttggccggtg cctgccatgc ggcatcggag tcaatggccg ccgcgagttc acggatggtg 2659021 tcgcactcca ccatcagcct ggcccgcaac gcaatcccac gacggcgcgc ggcctgcacc 2659081 accgacagcg ccacgatgct gtctagaccc atctgcaaaa agcccgcggt gacatcgacg 2659141 ttcgaggttt ccatgacatc ggcgaacgcc tcggccagca ccagctcggt cggtgtctgc 2659201 ggcggagttg ccggtccttc ggtgacattg attgccgcca aagcgttttc gtcgatcttg 2659261 ccgtgtggag tcagcggtaa ctcgtcgagg acgacgatat ggtgcgggac tagataacgc 2659321 ggcaaccgct ctagcagcat cgcccgcaat tcggccaccg gtggcggttg tggtccgcct 2659381 gccacatacg ccgtcagccg ggggccactg gcatggccgc gggccgtcac atggcaaccg 2659441 tgcaccgcat ggtggccgtt gagcaccgcg gcaatctcac ccggctcgac gcggaaaccg 2659501 cggatcttca cctggtcatc gctgcgcccg aggaactcca gtccaccgtc gggcaggcgg 2659561 cgcaccacat ctccggtgcg gtacattcgg ctaccgcgcc cgtttggctc agcgacaaag 2659621 cgcgccgcag tctcggccgg gcggccgagg taaccgcggg tcaactgggc gcccgccaga 2659681 tacagctcgc cggcgacgcc atcgggcacc ggccgcagcc aggagtccat gacgtaggcg 2659741 cgggtggtgc aggtcggacg tccgatgacc ggtcgcgcat gctcagcaac ggcggcgacc 2659801 acggcttcga ccgtggtctc ggtaggcccg tagcagttga aggccgtcat ggccgtgcgc 2659861 gcgcagttct gctggatcat ccgccacgtc gcggcgccca aggcttcgcc gccgagcgca 2659921 agcaccgcca acggcgcccg gtcgagcagt ccagcgttgt gcagctgggc gaacatcgac 2659981 ggcgtggtgt caatcatgtc cagaccgaat cggtcgatcg cttcgaccag cgcccctgcg 2660041 tcccgctgac gatggtcgtc gacaatgtgc accgcgtggc cgtcaagcag tgcgaccaac 2660101 ggctgccacg ccgcgtcgaa ggtgaacgac caggcatgcg cgattcgcag cgggcgcccg 2660161 agccgctggg ccgccggccg caacacgcgc tcgatgtggt cgtcggcgta ggccgacagc 2660221 gcccgatggg tgccgatgac acctttcggg gtaccggtgg tgccggaggt gaaaatcacg 2660281 taggccgcct ggtccaccgg caccgtgatg gcacggtcct cctcgagtat gtcagcgcca 2660341 accgaagcgg cgaacacgcc ctcatcgatg accaccggag ccgatgtctg gcgcaagatc 2660401 tcggcgacac gctcaccggg catcgccggg tccagcggca cgatcatgcc acccgccttg 2660461 aggaccgcca gcatggcggc cacgtagcgc ggaccacggg acagcgcgac ggccaccggg 2660521 gtctcgcgac tcacgtccgc gcggcgcagc ccagtggcca gccggtcggc caatgcatcc 2660581 agctcccggt acgtcagctg accatccgcc caactgaccg ccaccgagtc aggctgtgcc 2660641 gcagcgattt cggcgaaccg ggtatgcacc gcgggtgccg acgtcgtcac atccggcagg 2660701 ccgggtgcgg tcggatcgtg ctcgccgtcc agcagaatgt cgacgtcgcg cagcggccga 2660761 tcccaccggc tgaccaagcg ctgtaacaca gccagcaccc gcctgccgag gctttcgggc 2660821 gccatcgtgc ccagcgcacc gtcgagcacc tccactagca gcgtgagctc accggtgctg 2660881 cggtgcgcgg cgacggtcac cggaaagtgc gacaaactct ctagcgccac cggacggaac 2660941 gtcaccccgt ttgcgacgaa ctccgcggtg cccaccacct cgccgggcgg gaagttctca 2661001 tacaccagta gggtgtcgaa catctcaccg ataccggcga tggcacgaaa ctcgttgaaa 2661061 ccgagatagc tgtggtcgcg caacatggcg aattgacgtt gtaggacagc gcattgcccg 2661121 ccgacggtag cgcgggcgtc caggcggacc cgcagtggca ccgtattgat gaacaggccg 2661181 atcatcgttt ccacgccgga cagttcgctg ggcctgccgg acaccgtcac accgaacgtc 2661241 acatcgccac gaccggtgaa tgctgaaagc gtggtagccc aagccatttg aacaagtgtg 2661301 ctgatcgtga cgccacgggt gcgggcggca tcggccagct ccgcggtggc ttcacggtca 2661361 aggcgcactt cggtgcgtcc cggaataccc ggctgcacag gagtgtcggc gagtgccggc 2661421 gataacagag tcgggccgtc caggccattg aggtggtccg cccacattgc gcggctagcc 2661481 gtctgatcgc ggccggccag ccagccgatg tagtcgcgat acggccgcgg cgctgccggc 2661541 aacgcggcga cgtgaccacc agcccgatac aaggcgagca gctcggagac gaacagcggc 2661601 aacgaccatc cgtcgatgac gatgtggtgc gcgacgatga ccagatgcca acattcgtcc 2661661 ggtagttcga tgagcaggaa ccggatgagt ggtccgcggc cgacgtcgaa gcggcgccgg 2661721 cgctcttcgg ctgccagcgc cccgacctca ctggggtggg cgcgcacgtg acgccaaagc 2661781 acctcggcac tggatggtat tacctgcacg ggccggctca ggttcccgtg taggaagctc 2661841 gcccgcaggt tggggtgccg ggtcagcatc gcggcagcgc agtcgcgaag caaggcgatg 2661901 tcgagcgggc cggccgcgtc ggccgccatc gcgatcacat acgggtcggc ctctgcggcc 2661961 tcagagccgg actccgcggc gaccagtgtc gccctagaaa acagtccctg ttgcaatggg 2662021 ctgagcgcca tcacatcgtc gatggcgccc cgcgcgtcgg ctcgcgtcac ggccactggt 2662081 cccatgacgc ggtcagggcc gacagttcgt ctggggaaag ccctgatgtg ctcatcggcg 2662141 cgtgatgctt gtcgtccggc tcggcctcga cgtgaggctt ggcgtcgacc gcggcggcga 2662201 gctcacacaa aacgggatgc tcgaaaacca tccgcgcggt cagcggtatc ccgccatctc 2662261 gagcccgggc agccacctga gtcgcgagga tgctgtcgcc gccgaggttg aagaagtcgt 2662321 cgtagcgtcc gacctccccc acctcgagca cgtcggcgag gatggcagcc agcgcgcgct 2662381 cggtttcggt gtcggcgggc tcggccggca ccggtgccgc ctgcgccgtt ggcagccgtt 2662441 cgatttcggc cagcagctcc agttggccgt cggccttcca cactccgcgc tcgccgttgc 2662501 ggtagagccg agaacccggt tgcgcggcga acggatcggc aacgaatcgg gtcgcggtct 2662561 ccgatggccg ggccaaccgg gcaccgacgg cgggaccgcc accgtaataa acgtcaccca 2662621 ccacgcctac cggaacgggc ttaagtgcgt cgtcaagcag gtacacccgg gccgttccgg 2662681 cgcccgcatt ggaccggtcc aaaatgcgcc gtctggcttg cgcgctgacc atctcgacct 2662741 cgcgcagcgg ttggtccgga cggtcggcga acgcctcgac gacacggact agccagtcgg 2662801 cgaagcgttg tgcggtggcg cgctcataca actcggtgcg gtagatgacg tggccgcggt 2662861 actcgtcgcc gcaggcgaag aagttgaccg atagatcggc ttgcgcggca tcgaatgtcg 2662921 gctccagcac gcgcaacgtg gtgtcaccgt cgggcccggt gtcgatgacg tggtcttgcg 2662981 gcatttgttc gcgaacgtgc acaacaatgt cgaacaacgg attgcgggac agcgaccgct 2663041 gggggttgac cgcctccacc acctggtcga acggcaggtc ctgatgtgca tacgctgcca 2663101 gcgccatctg cctggtgcgc tgcagcacct cgcgcagcgt ggggttcccg cgcaggtcgt 2663161 tgcgcaacac cacgatgttg atgaagaacc cgatgagctg gtccaggttg gcctcgctgc 2663221 gaccggccac cggggcgccg atggggacgt ctaccccgcc gccggccttg tgtaacacca 2663281 ccgcgacggc ggcctgtagc agcatgaact cggtgacacc gaggtctcgg ctcacggcag 2663341 ccaatttgtc gcggatcgcg gcgccgagac gaaattcgac cgcgtcaccg gcaccgctga 2663401 gcagggccgg gcgcgggaag tccgggcgca gaccggtttc gcctgccagg ccccccagct 2663461 ggcggatcca gtagtcgcgt tgcggaccga cgatgcccgc accgtcgtcg agtagcgccg 2663521 actgccacac gctgtagtcg gcgtactgca ccggcagcgg tgcccacgac ggccgttgtc 2663581 cggtgctgcg ggcccggtat gcggtcagca gatcggtgaa caacacccca gccgaccagt 2663641 ggtcgccggc gatgtgatgc accaccagcg acaacacggt ctgctccggc gtgctcagca 2663701 gcgccgcccg gatcggccag tcggtttcca ggtcgaaaac gtaacctcgc tcgttgttca 2663761 gttcggctcg cagccacgcg gcgtcggacc cggcggcgca ccgcaccggc acctcggcgg 2663821 gcggctggat gatctggtgt ggcacgccgc cgatctcgcg gtagacggtg cgcaggatct 2663881 cgtggcgtgc caccacatcg gtgatggccg ccgcgaacgc gttggtgtcg cagggcccat 2663941 gcaatgccgc ggcgaaggga atgttgttga cggcgttggg cccgtcgaag cgatagttga 2664001 accagctacg catttgagac gacgacaatc gcactggccc gtcatgatcc acccgggtca 2664061 gccgcggcct cgccgaatcc gaatccaacg tatcgatgtg tccggccaac gcggtcaccg 2664121 tggcgaattc gaagatctcc cgcacaccga catcgacgcc gaacgcgttg cgcacggccg 2664181 caacgagttt ggttgccagc agcgagtgac cgccgaggtc gaagaacgag tcgtcagcac 2664241 ccactcggtc gcggccgagc agctcaccga acagttgggc aaggcgccgc tcggtggcgg 2664301 tctgcggcgc gcggaactcg gtgtccgacg cgatctgcgg ttccggcagc gcggcgcggt 2664361 cgattttgcc atgcgcggtg atcggaatct catccagcac aacataggcc gcgggcagca 2664421 tatattcagg cagtgccgcg gccacccggg cgcggatgcg gtcgagatcg acgccgacat 2664481 cggcgggtcc gtcgccgccc gcggcgggtg tcacgtagcc caccagactc ttgcccagcc 2664541 gcggcaggtc gctaaccacc acaacggcct gcccgaccgt agggtcgacc gcgatggccg 2664601 ctgctacgtc accgagttcg attcggaatc cgcgaatctt gacctgctcg tcggcacggc 2664661 ccacgaactc gatgtcaccg tcagcattgc ggcgcgccag atccccggac cggtacatgc 2664721 gggaaccggg attaaacggg tcggcaacga atcgctccgc ggtcagcccg gcgcggcgat 2664781 ggtatccgta tgcgacatgc gtccctccaa tatagatctc gccgatcaca ccggtcggca 2664841 ccggctgcaa cgaatcgtcg agcaggtgca tggtggtgtt gatcttgggc cggccgatgg 2664901 gcacgatgcg ggtgccctgt gggcccacca ctttaaaccg gctggcgttg atcacggttt 2664961 cggttggacc gtagaagttg tgcagcagcg catcgaatgt cgcgtggaac ttgtcggcca 2665021 cctcaccggg tagcggctcc ccgccgatgg gtacccgctg caacgtccgc cactggctca 2665081 cacccggcag cgacaggaac agcccgagta gggacggcac gaaatgcatt gccgtgatgc 2665141 cctcgtcgcg caacagggcg gtgagatatc caatgtcggt gagtcccccg gggcgtggta 2665201 tcaccatccg cgcgccacag gccagcgtgc cgaagatctc ggcgatcgag acgtcgaagc 2665261 tgggtgaggc gacctgcagt agccggtcgg tgtcgtcgac gtcgtattcg cccttgaacc 2665321 agacgaagta ctcggcgacg gggcggtgtg gcaccgcgac acctttgggc aatccggtgg 2665381 taccggacgt gtagatgaga taggccgtgt tgtctggccg tagcggccgg attcgatcgg 2665441 cgtcggtggg gtcgtcgctg cggtatccgg ccagctcacg tactggcgtg cgcagcacca 2665501 gtttcgcgtc gcagtcggcg aggatgaaat ccagccggtc ttgcgggtag ctgggatcca 2665561 cgggcacata caccgccccg gacttgacca cccccaaggc cgtgacgatc aggtccggcg 2665621 atttgtcaag aagtaccgcg acccggtctt cgctgcctat cccctgctcg atcagccagt 2665681 gccccaaccg gttcgacgcc tcattgaggt cgtggtaggt gaagtgttgg ccctcataca 2665741 ccacggcggt ggcgtcggga gtccgcgtgg tctgctcgtt caccaggtcc acgagggttt 2665801 tgacaggggt atcgaaccgc tcgccgcgcg acacctcgcg cagcctggcg gcgtcgcgct 2665861 catccatcag cgccagcccc gacaacgtgt tgtcgggggc ggccagcgca ttgtcgagca 2665921 gcacaccgaa gtgtcgaagc atctgcttgg ccagggcggg ttccaggatc tccaccaggt 2665981 gttcggcctc gaccagcaca cccgcgcggt cgaattcgac catgaagccc aacggcagct 2666041 gcgtgatgtt gctgcgcagg tcgtagcgct cgcactcgat gcctggcggg ttgaatccgc 2666101 cgccgtcggg ctcccggaaa ccgaagctga cccgggtcat gcgctcggca ccgtgccggc 2666161 gatcggggtt cagttccctt accacgcggt cgaggttgat ccgttggtgt gcgaacgccc 2666221 cgctggcgat gtcgcgggtg gcggtcagca actcccggaa actcatcgcc gattgcggtc 2666281 gcagccgcat cgctaccgtg ttgccgaaat agccgatggc atcttcggtt ccggcgccac 2666341 ggttgagcac cggagccgcc acgaggaagt cgtcactgtg ggtgtagcga tgcaccaggg 2666401 caccgaacgc ggccagcagc accatgtagg gagtgcaacc ggtgttcttc gccatcgtgg 2666461 ccacccgcgc agcggtgtcg gcgggcagcc gcaacgtggc gcgcgcggca cgccaactgg 2666521 tcggcacaca cgttccggct gggccgggaa gttccagcgg ctctggcgga tcggccatga 2666581 tcgcgcgcca atagttgagg tcggcctcgg tagtgtcggg tccggatgcg gccgacggac 2666641 ggtgttctgg ccccagatcg gcccctaggt cagctcgcga gtacgcctgg gtgagatcgg 2666701 tgaagaacac ccgccacgaa ccatcatccc aggcgatgtg gtgggccacc aacagcagca 2666761 cgtgttcgtc ggcagccgtg cgcaccaccg tgattcgcaa tggcgcgtcg cgggaaagct 2666821 cgaagggagc gcagaattcg cgctgagcca acacctccag gcgcagccgc tgggcgcgtt 2666881 gggacaggtc cgtcaggtcg tattgtgtcc agccggggcg aagatccgcg tgcacggtcg 2666941 gctgggcgac tccgtcgtcg ccgacagggt aggtggtacg cagtatccga tggcgacggg 2667001 cgacggcgtt gactgcgtcg cgcaacctgg ccagatcgat gtcaccggtg atgcggtagg 2667061 acacacagat gttgagtaac gcaccgctgg ggtcggccat ctgcacgaac cacatccggg 2667121 cctggccgtc ggagagccga tcgtcagtgt gcgggccaat gtcctgcgca gccgaggaca 2667181 ggccgcggtc ggcgagcctg cgacgcagca gctccaatcg ggcctcgtcg aggcgggcgc 2667241 cgatgtcggc ggtattagtc acgcgaaatg tccactttct gtgcggtgtg tgagcgctcg 2667301 tcggcatctt cgagtttcgc gacaagtcca tcaccggtga tgtcgcccat gagcgtggcc 2667361 agcgacaccg tcgcgccgat tgatcgtttg agtcggttac gcaagtccag tgccagcatg 2667421 gaatcgacac cgagatcgaa cagcgattcc tgcaggttca cctcgccggc ctgcgggatc 2667481 ccgagcacgg ccgccaattg ggtgcgcacc gcgtccacga tcgtcaggtt ggggtcggtt 2667541 ggaccctcgt accgttcgaa ttgcctgctg tccaacaaca tctgcaaccg ggccgcgtcg 2667601 gcggcgaaca ctagcgggtc gacagtgaat tcgtgcaggc tcgcctcgat cgcctgctgg 2667661 ggcgccatct ggcggagtcc agaccgctcg acgcgggcga tcgtaaccgc atccgcgatt 2667721 ccccgagctg gttcgccggc cttgggggcc tgccataggc cccatttcac cgccacgcag 2667781 tgcctgccct gggcgcgcag ctgggcggcc atcacgtcga gcagccggtt ggccgccgag 2667841 tacgcgacca ccccgtgtcc accccacacc cccatcaccg aggaacacag cagggttcgc 2667901 acatccgggc gcagcggcca cagctcgatc atctgggcca ggccgagcac cttggccgcg 2667961 aagttgtcaa cgacggcggc cgacgtcacc cccggtgcgg taccagagat cacgctgcct 2668021 gccgcgtgca cgatcaacga ggcgccgacg ccaccgtatt cggctgcaat cgctgacaac 2668081 tgggtgggat cggtgatatc gcacggcggc gacacgatca cggtgccatg ttgctttctg 2668141 agcatggcca ccgtcgcctg atccgcggcg cgccggctga gcagcacgat gcgccgtgcg 2668201 ccatgctcgg cgagataccg cgcgtagtgc atcccgatgg cacccgcgcc accggtgacg 2668261 acgacatcgt cgagcacgcc ggagtccaac gaccagttcg ggacggccgg ggcatcggcg 2668321 agggttcgct cgaacagcgt gtacccgttt accgagccgc gtagcgcggt ctcaccgaag 2668381 ccccgcagta ccgccgttat gaccgagacg ccgaggaccg ggtccaagtc ccacgacggc 2668441 aagtccaggt ggctgaaagt ctgttcggga tgctcgaatc cgatgcttcg atgcatcgcg 2668501 gccagcgcgg cctggccggc cgacggcacc gcgtccgctg cgtcgacctg ctcggcgccg 2668561 acggtgacca gacataccga ttggcaacgg gcaccgatat gcatcggata gtccagcaaa 2668621 ccggccccga cgaggtcggc gagtgcaccg gcggcccgga cggcgtcggt gtgttcgaag 2668681 tcgggcgcga tcaccaggat caactcggcg tcccgcgcag cactcagctc ggtatcgggg 2668741 tgcgaatcaa ttgctgcgca cagtgtttga gccagcgcgc ggtgagcacc gagatcgagc 2668801 actgcgaggt gacggtgccg cccagcgacc ggtgtcgacg gcaccatccg ttcccaccgc 2668861 tcaaccgcaa tggtcagtcc ggacaccggc ggcagcggtt cggggtgcgc ccacatcggc 2668921 accgcacgca tcggcgcgtt cgggaacccg gacagatcga cgtcgccgtc gagtgggtca 2668981 ccgcccaggt caccccacgg gtagccaggg tcagcgaccg ccgcgctaac aatattcgcc 2669041 gacaacgcat caacaaaccg ctcgccacga cgtgccgacc cgaccagcac agcgggaccg 2669101 tccggcaggt tggcggcgcc ctcacagttc tgaccgatcg caaacaacag cgcgggatgg 2669161 gccgatatct cgatgaacgc ccgtgctcca cagcggattg ccgattcgac agcgcggtcg 2669221 aaacgcaccg tatggcgcag gtttgcgtac cagtagtcgc cgaaagtggt gcctggcgcc 2669281 accacgtcgc cggtggttcc gccgatgaat tgcactggcg cttccataaa ttcggagtca 2669341 ggcagctgct cgcataattc atcgcggagc gattcgagca cgctggtatg caccgggaag 2669401 cccacggtga tcccgcgggc gaagtgaccg ctggaccgga ctgtgtcaac gatggccgct 2669461 accgcttggc gctcaccgga cacggcgacg gtcgaggagg cattgaccac agacagttcc 2669521 agccagccgc cggtggtcgc gatcagcgcg ctcgcgtcct gttcaccgat gcccagcgcc 2669581 gccaccgcat agcgaccagg caagcggccc accacgttgg cgcgggccgc caccacggcc 2669641 acagcatccg acaaggtgat acttcctgcg agataggccg ccgctacttc gccgaggcta 2669701 tgaccgactg ttagatcggg cagcacaccg caggaacgcc atacctccgc cagcgcaacg 2669761 gcatggacga actgcgcgcc ttcgatctcg atctcgcaga acgcttgccg ctcatcggtt 2669821 ccgggcgggg cgatcaggta tggcagcggc gagtcgacac cagcggccgc aaatgcggcg 2669881 gcgcacgtgt cggtcgcggt ccgataggtc ggcagctcgc ggtaggcgac ggcgcccatg 2669941 cccggccaat gaccaccctg gccgggaaag acgaacgcct ggcgcggggc cgagcccaac 2670001 gacgaccgcg cgatgagcgg atgctcgcgt ccggcggcca gcgcgcgcaa gccctcggcg 2670061 agttccagcc ggtcggcggc ccgaagcacc gcccgatgcc gacggacccg tcgggtcttg 2670121 cgcagctgcc gagccacttc ggtcacggtc gtagccggaa agcgctcgag gtagtcggcg 2670181 atggcccgag cgtccggccc gatcagttcc tcggcatggg cgctgagcaa aaccgcaacc 2670241 cgcccatcgg gcagctgttt gggggccatc acacctcccc acactcgggg ccacgctcgg 2670301 gcgcggaaac ggtgtccggc atcgaaacga tcacgtggct attggtaccg ctcatcccga 2670361 acgcggacac cgccgcggtg cgccatccgt caacggcccg ccacggcgtg agtttgtcgg 2670421 ccagccgcag accctgtttc tcccaatcga tttcgcggct gggctcgtcg acgtgcagtg 2670481 tcggcgggat cgcggcgtgc tgggcggcca gaatgacctt cacaaggccc agcccgcccg 2670541 ccgccgcctg agcatgcccg atgtttgact tgaccgatcc caacagcggc ccgcgtccgg 2670601 ccggggcggt gccgtagctg gctgccagtg accgcaattc ggtgcgatcg ccgagccggg 2670661 tcgcggtgcc gtgcccttcg accatcccga catcggcggg cacaactgct gcctgcgcga 2670721 tggcgcgccg gagcagtcgc gtttgcgcgt cgccgctggg cgcggtcagc ccgtcgctaa 2670781 gtccatcgga gttcaggcaa ctggcacgca cctcggcgag gacacgacgc cggtcagcgg 2670841 ttgcccgcga ccggcgctgc aggaggaaca tggcggcgcc ctctgcccag gcggttccgc 2670901 tggcgtgcgc gctgtagggc cggcagtggc cgtcgtcgga tagcgcgtgc tgcttggaga 2670961 actcgacgaa atagccgggc gtacccatca cgcacacgcc gccggcgagt gccaggtcgc 2671021 agtcgccggc ccggatagct tgaaccgcgg tgtgaaaggc cgccagcgcc gacgaacacg 2671081 aggtatcgac ggtcagcgcc ggcccggcca ggtcaagggt gtaggcgatg cgcccggaga 2671141 tgacacccag cgacgtcccg gtgatcagat ggccactgtg gtgggagaat tcggtcaaag 2671201 cgggaccgta ttcgagcgcc gaggcaccga cataacagcc cacatcgtga ccggccaggt 2671261 catcgggatt gatcccgctg ttctccaggg tgcgccatgc tactcgcagc cccacccgct 2671321 gctgcgggtc catcgccgtc gcctcgcgcg gtgagatgcg gaagaactca ggatcgaatg 2671381 tagttgcgct ggaaaggaat ccgccaaggt tgtggatcgg tttgaatccg tttcgacgcg 2671441 acccgtcgaa cagctcgcga agtgcccaac ctcgatcggt ggggaacggt ccgagtccct 2671501 cgcgctgttc ggagagcagt gtccagtagt cgtcggcggt ttcgacacca ccgggtgcct 2671561 cgatggccag cccgacgatg acgaccgggt cgttatcgga catcggcact caccatccgg 2671621 gccacggcgt cgaggtgatc gttgagatag aagtgaccac catcaaagtg cgacagcgtg 2671681 aagcgaccgg aggtgtgagt ctcccaactg gtcaacatct cccggctgat gcggtggtcg 2671741 cggttgccgc cgaccgcgtg gatgttggcg cggatgcgca cgtcgggtgg acatgaatag 2671801 ccgctgaggg cccgatagtc ggccttgacc gccggcacca gcagttcaac gaattcctcg 2671861 tcctcgagca gcacgggatc ggtgccgcca agatccacca tgtcggccag gacgtcacgg 2671921 tcggcggtcg gcaacggtcc ggacgcggcc accgtcgacg gagcctgacc ggaggaagcc 2671981 cacagtgcac gtaccggcac gccattgcgc tcggcgaggc gagcgaactc gaaggccact 2672041 atcgcaccca tgcaatggcc gaacagcgtc agcggagccg tcaggtgcca gtcgcccgcc 2672101 tcgaacagct cgagcgccag cgcctcgatg ctgtctgccg ccgggtggct gcgccggtca 2672161 gcccgctgcg ggtactgcac cacgaacgtg tcaacgtcgt tggccactaa cgattttgcc 2672221 aaccaccggt aagccgcggc agcgccgccg gcgtgtggaa acaccagcac cgcgccgggc 2672281 ttgtcagtac cggtgaaccg cttcacccac ggtttgaagg ctggctgtgc gggctgctcg 2672341 atcggatcga gcgccgccat cacgtcggca cttgtcatat tcgcgatttc taagtacacc 2672401 tcggcgacca gttcgagtcg gtcggcgttg gcttcccgac cggtgagcaa ctgggccaac 2672461 gcggcaatgg tcctggcggc aaacatgtcg gcgaccatca ggctcggcga atccagccac 2672521 cgccggatac cggcgacgac ctgggtcgca agcacggaat cgccgcccag ggcaaagaag 2672581 tcgtcgtgca cgcccacggc atcgttggca cggcccagga tgtccgcgac gatgcggcgc 2672641 agtgcccgct gaagcaccgt tcgcggcgcc gcatagggtg ccgatctgtc gccagaccgc 2672701 tcgacctcgg cggcaagcag ggcgccaacc tccgcgcggt cgatcttgcc gctgtcggta 2672761 aaggggatgc ggtctagcag cgtgacgtgg cgcggaatca tgtgcgcggg caccagatcg 2672821 gcgagctgct gtcgaatcga ctccgcggtc acgccggcat cgtcgacgca gaccgccgcg 2672881 gccagcacat cggacccgcc aggaagcacg gtggccgccg ccgcgtgcac accgggcaag 2672941 cgctgcagcg cggcttcgat ctcgccgagt tcgacgcggt acccgctgat cttgacgcgg 2673001 tgatcggcac ggccgacgaa ctccagggtg ccgtcgtgcc agtagcgggc cagatcaccg 2673061 gtgcgatacc aggtgcggcc gtcatgctcg acgaagcgct ccgcggtcag ctcgggacgg 2673121 ccacggtaac cccgggcgat tccgcgaccg gacacccaca actcaccggc cacccaatcg 2673181 gggcagtcgt cgccgctgtc ggccactacc cggcaggcgt tgttgggaaa cgggacgccg 2673241 tatggcaccg aggcccagtc cggtggcaga ttggccgcgt cctggacctc gaaaatggtt 2673301 gcgtggaccg cggtttcggt ggctccaccc aaccccgcga accgtgcgct cggcgcttgc 2673361 acctgcaggc ggcgggccag gtcgggacgc acccagtcgc cgccgacggc caccgctcgc 2673421 agcgacgaca gccggccccc gccgacttcg agcagcatgt ccaaccagcc cggcatgaaa 2673481 ttcaacgccg tgacctcgta agtgtcgata agccgggccc aggcgtcggg atcgcggcgc 2673541 tgcgcttcgt cgaccaccac gatcgctccg ccggagcgca gggcggcgaa gatgtccagc 2673601 accgacatgt cgcactccag cgtcgccagg gcaagccagc gatctgcggc gcctagctcg 2673661 aagtgccgga tgaaggtctc cacggtgttc atcgcggcgt cgtgcgccac ctcgacaccc 2673721 ttgggttccc cggttgagcc cgaggtgaac aacacatagg cgagcgcggt gggatcgcta 2673781 ggcccgggca cgaattctgc cggcgcggcg gcaagcacgt cagccagcaa cagcgtcggg 2673841 accggcaccc gcacttggca tggcgggccg caaacgagcg ctaagttgac cgaaccggtc 2673901 gccaggatgc gctccgcgcg gtcgcggggc tggtcgacgc cgatcggcag atagaccccg 2673961 ccggcggcca aaatccccag cacagccgcc acttgttcgc ccgttttcgg acccagcacc 2674021 gcgacggtgt cgccgactcg taggcccgca gcacgcagcg ccgcggccac cgccgatgcc 2674081 tggtcgcgca gttgggcgta gctcaagtcg ccggaactgg cgaacaccgc cggcgcgtcg 2674141 ggctgctgtt gggcctggcg gaaaaacccg tcgtgcagcg cctcggtgct gggggcggcg 2674201 gtgcgaccgt tcagcgccgc gcgcaccgcg cgttgcgcgg cgggtagcgc ggacgggctc 2674261 ggcgcatccc aggcgtcgtc cccggcggcc aaccggagca attcgtcgac ctggtgggtg 2674321 aacatggcgt cgatgacgcc gggtgcaaag accccctcgc ggacatccca gttcaccagc 2674381 acaccgccgt cgaactcggt gacctgggcg tcgagcagca cctggggccc ctgcgaaatg 2674441 atccatccgg gtgtgccgaa ttgctcggtg acgtccgggc agaaaaggtc gccgagcccc 2674501 agcgcgctgg tgaataccac cggtgccagc acctgggtgc cacggtggcg gctgaggtca 2674561 cgcagcacag acagcccggg gtatgcactg tggcctgcgg cgctgcgcag ggcttcctgc 2674621 accgcctgcg cccgcgccgc cgccgtgcgc gcaccggtca gatcgacgtc gagcaacagc 2674681 gaggaggtga agtcaccgac cagcaggtcg acgtctggat gcagggcctg gcgactgaac 2674741 aacggcaggt tcagcaggaa ccgcgacgac gctgaccaac gcgccagcac gttggcaaag 2674801 gccgcggcca gcgtcatcgc cggggtgatg ccgcgggccc gggctcgggc gaacaacgcg 2674861 tcgcgggtct gcgggtctag ccagtgccag cgccgggtgc tgcggcgccg gtcgcgttcg 2674921 ccgccggccc gggtaggcag cgcgggcgga tccggcagct gcgggatgcg ctgcgcccac 2674981 cagtcccggt cggcgtcgcg aaccggttgg ggcagcgtct cctccgcctc gatagcctgc 2675041 cggtattccc ggtaggtgta gcccagtgcc ggcggttcac ggccgtcata gagggccgcc 2675101 aggtcggcca gcaagatgcg gtagctcatc gcgtcagcgg cctgcatgtc caggtcgaca 2675161 tgtaggcggg tgcgctcccc cggtaataac gtcaacgcaa gttcgaatac cgcaccgtcg 2675221 agctgctggt gcgatttggc gtcgcggatc cccgccaacc gctgatcgac gacatccggg 2675281 gccacgtgac gcaggtcggc aacactgatg ggaaagtcgc gagatcccgc cgccggcggg 2675341 atgcgctggg tgccgtcggg caagaactgc acccgcagca tcgggtgccg cagcgccaac 2675401 cgggtggccg ccgcgcggag cctgtccgga tcgacccggg caccatcgaa ctcgacgtag 2675461 aggtgcccag ctaccccgcc gagctgttgg tggtcgtggc ggccgaccca catcgcgtgc 2675521 tgcatcggcg ccagcgggaa aggctcgcct tcctgggata acccggcatc ccctggtgcg 2675581 gcaactgccg tgggcgcgac gccggtgccg gcggacacca gttgggacca ggcctcgatt 2675641 gtgggtgtgg cggccagtgt ggcgaagtcg acggcgatgc ccttccggcg ccagcgcccc 2675701 accagcgaca tcatccggat cgagtccagg ccctgaccaa cgaggttggc gccggggtgc 2675761 agagcatcgg cgcggacacc gagcaactct gcgacctcgg cgcgaatgat ctccgagcac 2675821 gccgtagcat gcaccacaaa ccctcccctg ttagcacagg ctgccctaat tttagtggtt 2675881 accctatctt cgaaccacgc acctgcgcta ccagcccccc tgttaaggag cccacatgcc 2675941 accgaaggcg gcagatggcc gccgacccag tcccgacggc ggactgggtg gctttgtacc 2676001 gttccccgcg gatcgggccg cgtcgtaccg ggcggccggc tattggtcgg ggcgaaccct 2676061 ggacaccgtg ctctccgatg ccgcgcggcg ctggcctgac cgcctcgcgg tggccgacgc 2676121 cggtgatcgt cccggccacg gcggcctcag ttacgccgaa ctcgaccagc gggccgaccg 2676181 ggccgccgcg gcgctgcacg gcctgggcat cacgccaggc gaccgggtac tgctccagct 2676241 gccaaacggc tgccagttcg cggttgccct gttcgcgtta ttgcgggcgg gagcgatccc 2676301 agtgatgtgc ctgcccggtc accgcgccgc cgaattgggc cacttcgccg ccgtcagcgc 2676361 ggccaccggg ctggtggtcg ccgatgtggc cagcgggttc gactatcggc cgatggcgcg 2676421 cgaacttgtt gccgatcacc ccaccctgcg ccatgtcatc gtcgatggcg atccgggacc 2676481 gttcgtgtcg tgggcgcagc tgtgcgccca ggccggcacc ggttcgccgg caccgccggc 2676541 cgatcccgga tcgccagcgc tgctgctggt ctccggcggc accactggca tgcccaaact 2676601 cattccacgc acccacgacg actacgtgtt caacgcgacg gccagcgccg cactctgtcg 2676661 gcttagcgcc gacgacgtct atctggtggt gctggccgcc ggccacaatt tcccgctggc 2676721 ctgcccgggc ctgctcggcg cgatgaccgt cggggccacc gccgtgttcg cccccgatcc 2676781 cagcccggag gccgccttcg ccgccatcga gcgccacggt gtcaccgtca ccgcgttggt 2676841 tccggcactg gccaaactgt gggcccaatc ctgtgagtgg gagccggtga caccgaagtc 2676901 actgcggttg ttgcaggttg gcgggtccaa gctagaaccc gaggacgctc gccgggtacg 2676961 caccgcgctc accccgggcc tgcagcaggt gttcggcatg gcggaggggc tgctgaactt 2677021 cacccgcatc ggcgacccac ccgaagtggt ggagcacacc caggggcggc cactatgccc 2677081 ggccgacgaa ctgcgcatcg tcaacgccga tggtgagccg gtggggcccg gggaggaagg 2677141 cgaactcttg gtgcgcgggc cctacacgct gaacggctat tttgctgccg aacgcgacaa 2677201 cgagcgctgc ttcgatccgg acggcttcta ccgcagcggc gacctggtcc gccgccgcga 2677261 cgacggcaat ctggtggtca ccgggcgcgt caaggatgtc atctgccgtg cgggagaaac 2677321 catcgccgcc agcgacctcg aagaacagct gctgagccat ccggcgatct tctcggccgc 2677381 ggcggtggga ctacctgacc agtatctggg ggaaaaaatc tgcgctgcag tcgttttcgc 2677441 tggagctccg attacgcttg cggagttgaa cggctacctt gaccggcgtg gtgtggccgc 2677501 gcatacgcga cccgaccagc tggtcgcgat gccggcgctg cccacaacgc cgatcgggaa 2677561 gatcgacaaa cgagcgatcg tccgccagct cggcatcgcg acgggtcccg tgacgaccca 2677621 gcgctgccat tgactgacgt caacaagttg aattgactgc gttgcatgac cgacggtgtt 2677681 ccggcccgcg ggtcacttcg atcacgcggc gcggtagcgg tgagctcgat ggtgttgcgg 2677741 cccatcaccg gggcgattcc gccagacggg ccgtggggga tatgggcctc gcgccggatc 2677801 atcgccggac tcatgggcac gttcgggccc tcgctcgcgg gcacccgagt ggaacaagtc 2677861 aactccgttc tgccggacgg acgccgggtc gtcggcgaat gggtgtatgg accgcacaac 2677921 aacgcgatca atgccggacc cggtggcggc gccatctatt acgtacacgg cagcggttac 2677981 acgatgtgtt cgccccgaac ccaccggcgg ctgacatcct ggctgtcgtc attgaccggg 2678041 ctaccggtat tcagtgtcga ttaccgactg gcgccgcgct accgtttccc gaccgcggcc 2678101 accgacgtgc gggcagcctg ggattggtta gcgcacgtat gcggcttagc cgcggagcac 2678161 atggtgatcg ccgcggattc cgcgggtggc catctgaccg tcgacatgct gctgcaaccc 2678221 gaggtcgccg cccgacctcc ggcggcggtg gtgttgtttt cgccgctgat cgacctcacc 2678281 ttccggctgg gcgccagtcg tgagctgcag cgccccgatc ctgtcgtgcg cgctgaccgt 2678341 gcggcccggt cggttgcgct gtactacacc ggagtcgatc ccgcccacca ccggctggcg 2678401 ctcgatgttg ccggcgggcc accgctgcca ccgacgctga tccaggtggg tggagccgag 2678461 atactcgagg ccgatgcgag acaactcgat gccgacatcc gcgctgccgg cggcatatgc 2678521 gagttgcaag tgtggcctga tcagatgcat gtgttccagg ccctgccgcg gatgacgccc 2678581 gaagcggcca aagccatgac ctatgttgcc cagttcatcc gcagtacaac agcacgtgga 2678641 gacctctgaa cgttactggc gtgcaaccag ataaggcgtc aatgtggata gcttttcgca 2678701 agtctcctcg aattcgcgct ctggctccga ttcttcgatg atgccggcgc cggcccgcag 2678761 ccaagtccgc ccgccgacct ggtatgccgc ccgcagcgtc agcgcggcgt ctagcccgcc 2678821 atccgccgaa agcatcacca ccgcaccgga atacagccca cgtgggcact catcgaggcg 2678881 aaagatggcc tcaacgccag ctgctttcgg gattccggat gcagtgacag caggaaaaag 2678941 ggcttccagg gcggccatcc ggtcgctcga tggatccaac cgtgctctga tggtggagcc 2679001 gaggtgctgc acactgccgc gctcgcgcac cgtcatgaaa tcgatgaccg cagcactccc 2679061 tggttcggcg atgtcggtaa tctcctcaag cgaagagcgc actgaaatgg cgtgctcgac 2679121 aatttctttg gagtttgatt ccaggtcatc acgagccagt cggtcaatgg cgggaccacg 2679181 gcccaaggcg cgggtaccgg ccaacggctc ggtgatcacc actccgtcgg cgcgcaccgc 2679241 cgtgacgagt tcggggctgt aacccagagc acggattccg cccaactgca acaaaaacga 2679301 cctcaccggg gtgttgtgcc gacgccccag ccggtaggtc aacggaaagt cgatcgcgaa 2679361 aggcacttcg acacaacggg acagaatcac cttgtggtag cggccggcag cgatttcatc 2679421 gacggctacc gccacccgac ggcggaagcc ggatggatcg tcggagacgt cgacggagcg 2679481 ggactgcggc acctctcgca ccccggtggc gagtaatcgg tcgatggcct cgcggtggcg 2679541 aatcccagca tcgaacaggc gaatctcctt ttcgctcacc atgatccggg ttcggggcga 2679601 aaacacccgg gccagtgggg tgtgcggcgc cagccgctgc tgcaacccat agcggtgcac 2679661 gccgaattcg aaggcgaccc agccaaaagc ttgatcggtt tccagcaaca gccgatcgac 2679721 ggcttcgccc agggccgctc ccgggcgacc cgaccattgc tgtcgccgcg taacgccatc 2679781 acggatgacg cgcagttcgt cgctgtctag ctccaccatc gcctgcacac cggcggccag 2679841 gacccattgg ccgtcgcact cgtagagcag gtaatcctcg tcgacggact cggtaaccac 2679901 cgccgccagc tccgctgcca ggtcggcggg gttgacaccg gcgggcatcg ggatggacga 2679961 cgacgcggtg ctgacggcgc ctgtcgcgac gctgagctcg gacacagcta gtaaatgtag 2680021 cctaacctac ttaatgggtc gcagcccccc ggggtcgtcg catgtccaac gtgctcgact 2680081 ggaagaaaat gctcgtcggg agcaaatggc accagccggg gcggcgacag gacccaccca 2680141 cggccggacg gtccgcggac tgcgtttcgc agcgtaatca tttccgcagg cagaggcggt 2680201 cgcggccggt gctcgccggt taccatgccc gccaactcac gcacacgaaa tcgtgaaacc 2680261 tttgccaacc gtttactggc tagctacaaa gcaaggtttt gccttcgccg gaattctcct 2680321 aacatcactc actaaccacg tagaccatcc ggtcgacgac gtagtcgcgg tacgcgtggc 2680381 tcgccaagct cggtatgtcc gctgggtctg cccaggcatc gcccgatcgt tagccagtca 2680441 acagagagga cccgacgatg ttcgtaatcc ggctcgccga cggcgaagaa gtccacggcg 2680501 agtgcgacga gctgacgatt aacccagcaa ccggcgtcct cacggtctgc cgggtcgacg 2680561 ggttcgagga aaccaccacg cactactcgc cgtcggcgtg gcggtcggtg acacaccgca 2680621 agcggggggt cggcgttaga ccatccctgg tctcaactgc tcaataagcc cgagccacac 2680681 tttctagatt cgacttgata ttcctggtcg ctcccctgac gctgggtgct tcctggatcg 2680741 ccgcaccagg tatgggaggc gccaatgctg catgagttct gggtgaactt cactcacaac 2680801 ctgttcaagc cgctgctgct gttcttctat ttcgggttct tgatcccgat cttcaaggtg 2680861 cgattcgagt tcccctatgt gctctaccag ggcctaaccc tgtatctgct gctggccatc 2680921 ggttggcacg gcggcgaaga actcgccaag atcaagccgt ccaacgtcgg cgccatcgtt 2680981 gggttcatgg tggttggctt cgccttgaac ttcgtgatcg gcaccttggc atacttcctg 2681041 ctgagcaagc tgaccgccat gcgccgggtc gacagggcga cggtcgccgg ctattacggg 2681101 tcggactcgg cagggacatt tgccacctgt gtagcagtcc tgaccagcgt cggcatggcc 2681161 ttcgacgcct acatgccggt catgttggcc gtcatggaga tccccggctg cctggtggcg 2681221 ctgtatctgg tggcgcggct gcggcaccga gggatgaacg aggcggggta catggccgac 2681281 gagcccggct acaccacagc ggcgatgatc ggagcggggc ccggcacgcc cgcccggccc 2681341 gctcacagcg acagcctcac ggcccaagcc gagcgcggca tcgaagaaga gttggagctc 2681401 tcgctggaaa agcgcgagca tccaaattgg gatgaagacg gcgtcaaaga cagcggcacg 2681461 aatgcgtcga tcttctcacg cgagttgctg caggaagttt tcctcaaccc ggggctcgtt 2681521 ctcctcttcg gcggcatcgt catcggcctg atcagtggac tgcagggaca gaaggtccta 2681581 cacgacgacg acaacttctt tgtggcggca ttccagggcg tactttgcct gttcctgttg 2681641 gagatgggca tgacggcgtc gcgtaagttg aaggatctgg cgtcggcggg cagtgggttc 2681701 gttttcttcg gcctgctggc accgaatctg tttgcgacgc ttgggatcat cgtggcccac 2681761 ggctacgcat acgtcactaa caacgacttc gcgccgggca catatgtgct gttcgcggtg 2681821 ctctgcggcg cggcgtccta tatcgccgtc ccggccgtgc aacggcttgc gatccccgag 2681881 gccagtccga ccttgccgct ggccgcgtcg ctgggtttga cgttctccta caacgtcacg 2681941 atcgggatcc cgctgtacat cgagatcgcc cgcatcgtcg ggcaatggtt ccctgccacc 2682001 ggggcttcga tcggttagcc cagcagagtg cgcaccaccg cgtcggccag caatcgcccc 2682061 cggccggtga ggaccagtcg gtcgccgtgg tagtccagca atccgtcggc caacaccgcc 2682121 tcggcacgtt cccgttcggc agcccctagc cgggcgagcg gtagcccctg gcgcagccgg 2682181 accttcagca acacgtcttc ggtgtgcaaa gcgtcggcgc ccagctgctc gaagcccgct 2682241 accggcaacg tcgccccggc cagtatctcg gcgtaagtgt tggggtgctt gacattccac 2682301 cagcgtgtca cgccaatgta gccgtgcgcg cccggacctg cgccccacca ctggccaccg 2682361 tcccaataac ccaggttgtg ccggcactcg ccgcccggtc gacaccaatt ggacacctcg 2682421 taccaggcaa acccggccgc cgacagccga gcatcgacca actcgtagcg atgcgccagc 2682481 acgtcgtcat cgggcgcggc cagctcacca cgccgaaccc ggcgagccag tgccgtgccg 2682541 tgctcgacga ccaaggcata cgcggacaca tgatccacac cggcctgcac cgcggcgtcc 2682601 actgagcgca ccaggtcgtc gtcggactcc cccggggttc catagatcag gtcgaggttg 2682661 acgtgtgtga agccctccgc tatcgcctcg gtggccgcgg ccgccgcccg gcccggcgag 2682721 tgcacccggt ccaaggttgc cagcaccctc ggggccaccg actgcatgcc gagcgacacc 2682781 cgcgtgtaac cggccgcgcg gatcgtggcg aagaactccg gccacgtcga ctcggggttg 2682841 gcctcggtgc tgacttcggc gtcgggcgcc agcacaaagt ggtcccgcac catgtccagc 2682901 aacgtggcca ggcgctcccc cccgagcagc gatggcgtcc cgccacccac atacacggta 2682961 tgcaccgtcg gtgcgtccag cttggcggcc gccagttcga gctccgcccg cagcgccagc 2683021 agccaacggt ccgggctgac gccacccagc tgggccgggg tgtaggtatt gaagtcgcag 2683081 tacccgcaac gggtcaggca gaacgggacg tgcaggtaga ccccgaacgg ttgtccgggc 2683141 atgggcgcca ggccgggcag ctcaactggt gcctgccgaa ataccatgcc aaatcatcgc 2683201 atagcgcgta ccagctaggg tggccagcaa tgtaacgcag gcacacctca atcgtccctg 2683261 ctccccgaac aacctccagt ctcggccgcg aggaacgtca ggatgtgggt gagcgagccc 2683321 agcggtgcgt ctccctgact acaagaacta catttcggcc acgcacccgg gccttgggtt 2683381 ttcataatgt tgtctgcgac ctcgatctgt tgctggggac tcgcggccgc cggcgacccg 2683441 acaccaccgt tggaatccca cgtcgcctgg ctgatctgca gaccaccgta taacccgtta 2683501 ccggtgttgg ccgcccaatt gccgccggat tcgcattgcg cgatggcgtc ccaatcgatg 2683561 tcgtcggctt tcgagctgat ggtggacaga cccaacaacg cgacaaacat ggtcgcgaca 2683621 acggcggttt cgatgaacac cgtgcatacg atcctggcgc acctgtcacg tggtcggcca 2683681 gcacccgcag tagtaagcaa acccggtgtc atagcagctc caccttgctg gccagccagc 2683741 ggccgttcac cttgtccatg atcaccttga tccgactgcg gtcgatctgc ggcgttgggc 2683801 tgttccggtt gctgaccgac tggtcgatga acatcaggac cactaccttg ttcgtggtgg 2683861 ctgatttgac cgacgccgcc acgacggtcc cgtgggtggc cacccgattg tcggccagca 2683921 gttggcgaag gtgcgcactg gatttgccgt acttatcttt gaactcgccg gtcgaaccct 2683981 cgagaatgtc cctcatgttg tggtcgatcc gctcacagtc catggtggcc agcttgacga 2684041 catagctgcg tgcggcctgc agtgcctggc cggcggcgac gtctgtctga tgcttctcaa 2684101 agagcaccca tccgcaccat ccagacccgg ccaacgacac aaccaccgcg acggcgccaa 2684161 cccagccaat caccgatctg gttaaccgac cgcggccggg agtttcggcc ggttcgccgg 2684221 tgccgcctgg ctcacttgca ccgtggcccc ggccgaagat ggccattctg cgcacgattg 2684281 acctcgatca ctatccgcta agacaactat ctcagtagtc atatttggtc acatctgtca 2684341 ctcctgtcaa cgtcaggtgc gcgtctccca gcggattccc gggtcggcct atccatccat 2684401 ccaggcttgt tgcgtagttt tgatcatcgt gaaaagaaat ttgaccaggt cgcgcagctg 2684461 cacgccatcc atggcagaat gtcaccgtga ccgccgccaa gaacccgcgc cccgatctgc 2684521 gaatcgcgct ggtggctcgg cggcacatcg acctcaagcg ggtctgcagc tgtggctgtc 2684581 ggccttgacg ccgtaaaccc agcccacctg tatctgcagc cggcgaccgg atctgcccct 2684641 cccggaacaa gcggcgttta gcgcgtccta ggtcggcgat gtccgcgaag gagaaccccc 2684701 aaatgaccac tgcacgtccc gccaaggctc gaaatgaggg ccagtgggcg ctgggacatc 2684761 gcgagccact caacgccaac gaagagctga agaaggccgg caacccgctc gacgtgcggg 2684821 agcgcatcga aaacatctac gccaaacagg gtttcgacag catcgacaag accgacctgc 2684881 gagggcgctt tcgctggtgg ggcctgtaca cccagcgtga gcagggctac gacggcacct 2684941 ggaccggtga cgacaacatc gacaagctcg aggccaaata cttcatgatg cgggtgcgtt 2685001 gcgacggcgg cgcgctctcg gctgccgcgc tgcgcacgct gggccagatc tcgacggagt 2685061 tcgcgcgcga taccgccgat atctccgacc ggcagaacgt gcaataccac tggatcgaag 2685121 tggaaaacgt ccctgaaatc tggcgacggt tagacgatgt cggactgcag accaccgagg 2685181 cgtgcggtga ctgcccgcgg gtagtgctgg gctcgccgtt ggccggcgag tcgctcgacg 2685241 aagtgctcga cccgacctgg gcgatcgagg agatcgtgcg tcgctacatc ggcaagcccg 2685301 acttcgccga cttgccgcgc aagtacaaga ccgccatctc tggcctgcag gacgtcgcgc 2685361 acgagatcaa cgacgtcgcc ttcatcggcg tcaaccatcc cgagcacgga ccaggcctgg 2685421 atctgtgggt gggcggtgga ctgtcgacca acccgatgct ggcccagcgg gtcggcgcct 2685481 gggttccact gggcgaagtg cccgaggtgt gggcggcggt cacctcggtg tttcgcgact 2685541 acggctaccg gcgactgcgc gccaaggccc ggctgaaatt tctgatcaaa gactggggca 2685601 tagcgaagtt ccgcgaagtg ctcgaaaccg agtacctcaa gcgtccgctg atcgacggtc 2685661 cggcccccga accggtcaag catccgatcg accacgtcgg ggtgcaacga ctcaagaacg 2685721 ggctcaacgc cgtcggagtc gcccccatcg ccgggcgggt atcgggcacc atcctcacgg 2685781 cggtcgccga cctgatggcg cgggccggtt ccgaccggat ccggttcacc ccctaccaga 2685841 agctggtcat cctcgacatt ccggacgcct tgctcgacga cttgatcgcc ggtctggacg 2685901 cgctggggct gcagtcgcgc ccgtcgcatt ggcgccggaa cttgatggcg tgcagcggga 2685961 ttgagttctg caagttgtca ttcgccgaaa cccgggttcg agcacagcat ttggtgcccg 2686021 agctggaacg ccggcttgag gacatcaact cgcagctcga cgtaccgatc accgtcaaca 2686081 tcaacggctg cccgaactca tgtgcgcgaa ttcaaatcgc cgacatcgga ttcaagggac 2686141 agatgatcga cgacggacac ggcggctccg tcgaaggctt ccaggtgcat ctgggcggac 2686201 acctcggcct ggatgccgga ttcggccgca aactgcgcca gcacaaggtc accagtgacg 2686261 aactcggcga ctacatcgac cgggtggtgc gcaacttcgt caaacaccgc agcgaaggtg 2686321 aacgcttcgc gcagtgggtc atccgggccg aggaggacga cctgcgatga gcggcgagac 2686381 aaccaggctg accgaaccgc aactacgtga gctggccgcg cgcggagctg ccgaactcga 2686441 cggcgccacc gccaccgaca tgttgcgctg gaccgacgaa accttcggcg acatcggcgg 2686501 cgccggcggc ggcgtgagcg gacatcgcgg gtggacaacg tgcaactacg tagttgcttc 2686561 caacatggct gatgcggtgc tggtggatct ggccgccaag gtgcgaccgg gcgtaccggt 2686621 catctttctt gataccggct accacttcgt cgaaacaatc ggcaccagag atgcgatcga 2686681 gtccgtctat gacgtccggg tgctcaatgt cactccggag cacacagtgg ccgagcagga 2686741 cgaactgctg ggcaaggact tgttcgcccg caacccccat gaatgctgcc ggttgcgcaa 2686801 ggtcgttccc ctgggcaaga cgctgcgtgg ctactccgcg tgggtgaccg ggctacggcg 2686861 ggtcgatgca ccgacccggg ccaatgcccc gctggtcagc ttcgatgaga cgttcaaact 2686921 agtgaaggtc aacccgctgg cggcgtggac cgaccaagat gtgcaggaat acattgccga 2686981 caacgacgtg ctggttaatc cgcttgtgcg ggaaggctat ccgtcgatcg gttgcgctcc 2687041 gtgcacagcc aaacccgccg aaggcgccga cccgcgcagc ggacgctggc aggggctggc 2687101 caagaccgaa tgcgggttgc acgcctcgtg accgcgccgg cgacgatgca gagcgcagcg 2687161 atgctgagga gcggcgccat cgaagcaccg ccggcgacga tgcagagcgc agcgatgcgg 2687221 tgggggcacc tcccgcttgc ggaggagagc ggcaccatcg cgcctcagct cgtcctcacc 2687281 gcacacggca gcaaagatcc gcgatcggcc gccaacgcac gggctatcgc gggccggctg 2687341 gcgcgcatgc ggcccgggct cgacgtgcgg gtcgcgttct gtgagctcaa ctcgcccaac 2687401 ctggtcgacg tgctcaaccg ctgtcgagga gcagctgtgg tcaccccgct gctgctggcc 2687461 gatgcctacc atgctcgcgt cgacatccct gcccagatcg ccagctgccg cgttggtcac 2687521 cgggtacgcc aggccagtgt gctgggtgag gacattcggc tggtgtcagc gctgcatgag 2687581 cgcctcaccg agctgggggt ttcgccgttc gaccacacac tgggggtggt cgtgctcgcg 2687641 atcggctcat cgcatcccgc ggccaatgcg cgcacctcga cggtggcgtc aaggctggcg 2687701 gaggggaccc agtgggccgc ggtgacgacc gctttcatca cccgaccgga ggcttcgctg 2687761 gccgatgcca ccgatcggtt gcgacgccac ggtgcccgtc ggatggtcat cgcgccatgg 2687821 ctgctcgccc ctgggatact gtctgaccgg gtacgcggat acgcacggga agccggcatc 2687881 gcgatggcac aaccgctggg tgcacacccg atggtggccg cgaccatgtg ggatcgctac 2687941 cgacaagccg tggccggtcg gatcgcggcc taggtcttct cgaaggtctg ctggaacgga 2688001 tgtcctctgg tgagtgtttg gttgcgagcg ggcgccttgg tggctgcagt gatgctgtcg 2688061 ctgagcggat gtggcggctt ccacgcgggt gcgccaagca cggccggtcc gtgcgagatc 2688121 gtccccaatg gcacgccggc gcccaagaca cccccggcta ccgtgccttc gtcgcgcaac 2688181 ctcgcgacca accccgagat cgccaccggc taccgccggg acatgaccgt ggtgcggacc 2688241 gcccactatg cggcagccac cgccaatccg ctggccactc aggtggcctg ccgagtattg 2688301 cgcgacggtg gtaccgccgc cgatgccgtc gtggccgccc aggcggtgct ggggttggtc 2688361 gaaccgcaat cctccgggat cggcggcggc ggatatctgg tgtacttcga cgcccgcacg 2688421 ggctcagtgc aggcctacga cggccgtgag gtggccccag cggccgccac cgagaactac 2688481 cttcgctggg tcagcgacgt cgaccgcagc gcgcccaggc ccaacgcccg agcctcggga 2688541 cggtcgatcg gagtaccggg catcctgcga atgctggaga tggtgcacaa cgagcacggg 2688601 cgcacaccct ggcgcgacct cttcggcccc gcggtaacgc tggccgatgg cggttttgac 2688661 atcagcgcca ggatgggcgc ggccatctcc gacgctgcgc cgcaactgcg agacgacccg 2688721 gaggctcgca agtatttcct caatcccgac ggcagcccga aacccgcggg aacccggctg 2688781 acgaaccccg cgtactcaaa aaccctgtcc gccatcgcct ccgccggcgc caacgccttc 2688841 tattccggcg acattgccca cgacatcgtg gcggcggcga gcgacacatc gaatggccgc 2688901 acgccgggcc tgttgaccat tgaggacctg gcgggttacc tcgccaagag acgccaaccg 2688961 ttgtgcacga cctatcgcgg ccgggagatc tgcggcatgc catcgtcggg tggcgtcgcc 2689021 gtggccgcaa ccttgggcat cctcgagcac ttcccgatga gcgactacgc gcccagcaag 2689081 gtcgacctca acggcggtcg cccgaccgtg atgggggttc acctgatagc ggaggccgaa 2689141 cggctggcct atgccgaccg cgaccaatat atcgctgacg tcgattttgt ccggctgccc 2689201 ggcggctcgc tcaccacgct ggttgacccg ggctacttgg cagcacgcgc cgcgctaatc 2689261 tcgccgcaac acagcatggg cagcgccaga ccgggggact tcggcgcacc gacggccgtc 2689321 gccccgccag tgcctgagca tggcaccagc cacctcagcg tcgtcgattc gtacggcaat 2689381 gcggccacgt tgacgacgac ggtggaatct tcgttcggct cctaccacct ggtggacgga 2689441 ttcatcctca acaaccagct gagcgatttc agcgccgagc cacacgctac tgacggatca 2689501 ccggtggcta accgggtcga gcctgggaag cgaccgcgca gttcgatggc accgacgttg 2689561 gtgttcgatc actcgtcggc ggggcgcggt gcgctgtacg cggtgctcgg ttctccgggc 2689621 ggctccatga tcatccagtt cgtcgtgaaa acacttgtgg cgatgctgga ttggggtctg 2689681 aatccgcagc aggcggtttc cctggtcgat ttcggcgccg cgaactcgcc gcacactaac 2689741 ctcggcggtg agaatcccga gatcaacact tccgacgatg gtgatcatga cccgctggtg 2689801 caaggcctgc gcgcgctggg gcatcgagtt aatcttgccg agcaatccag tgggctctcg 2689861 gcgatcaccc gcagcgaggc gggttgggcc ggcggcgccg acccacgccg cgaaggcgcg 2689921 gtcatgggcg acgatgcctg agccgttcgc cggcgggcgg ccaaacgaac gcggaccact 2689981 tcgagccgat aattttgccg gccctctcgg gctttgtctg cggttttacc ggctcggtgc 2690041 attcgcgcgc tagccgatag ggtctatcgc catgtccggt gccacggtgg gtgcgcgcga 2690101 aatcaccatc cgcggagtcg tcctgggcgc attgattacc ttggtgttca ccgcggccaa 2690161 cgtgtacctg gggctaaggg ttggattgac attcgccact tccataccgg ccgcggtgat 2690221 ctcgatgggc gtgctgcggt tgttcgccaa ccactcagtg gtggagaaca atattgttca 2690281 gacgatcgcg tcggcggccg gcacgctgtc gtcgatcatc ttcgtgttac cggcactgct 2690341 catgatcggc tggtggagcg ggtttccgta ctggacaacg gcggcggtgt gtgcactggg 2690401 cgggatcctt ggcgtcatgt actcaattcc gttgcgccgc gcactcgtca ccggatcaga 2690461 cctgccgtac ccagaaggcg ttgccggagc cgaggttctc aagatcggtg actccgcacg 2690521 ggagatggag cacaaccgta ggggaattgg ggtaatcgcc ctgggcgcgg cagcggcggc 2690581 gggatatgca ctgctggcat ccctgcgggt gatcaacaac tcactgtcgg ccaccttccg 2690641 agtaggttcc ggtgcgacga tgatcggtgc cagcttgtcg ctggcgttga tcggcgtcgg 2690701 tcatcttgtt ggcgtcaccg tcggtgtcgc aatgatcgtc ggattggcta tcgcctttgg 2690761 ggtaatgctg ccaatacgga cagccggcca actgccgccg gacggggact acgccgtcgc 2690821 cgtcgccaga attttctcga cggacgtgcg gttcatcggg gcgggcgcca ttgcggtggc 2690881 ggccgcctgg acgttcttga agatcctggg gccgattctg cgtggcatcg ccgacgccgc 2690941 ggtctcagct cgaacccgac gccgagggca agcggttggc cagaccgagc gcgacatccc 2691001 gatccacatc gtggccatgg tggttcttct ctcgctgatc ccaatcggat ggctgctcgc 2691061 ggactttacc gacgggacac cgctcgatga ccgcaggccc ggcgccatcg ccgccggggt 2691121 actgctcgtc ttggtcatcg ggttgatggt cgctgcggtc tgcggttaca tggccgggtt 2691181 gatcggctcg tcgaacagcc cgatctcggg cgtgggcatt ctggtggtgg tgctggccgg 2691241 tctgctgatc aagactgcgt atggtccggc caccggctcg cagattccgg ccctggtggc 2691301 ctacaccgtg tttaccgctg cattggtctt cggcgtggcg actatttcca acgacaatct 2691361 gcaggacctc aaaaccggcc aactcgtcgg cgctacccca tggaagcagc aggttgcact 2691421 gatcatcggc gtgctcgtcg ggtcggtggt gatggcgccg atcctgcagc tgatgcaggc 2691481 tggattcggg ttccaggggg cgccgggcgc aacggccaac gcattggccg ccccgcaagc 2691541 cgcgctcatg tccgcgctgg ccaagggagt atttggtggc tcgctgaact ggtcgctggt 2691601 cggtgtaggg gccttgaccg gcgtgatagc ggtcgcgctc gacgagacac tggccaagac 2691661 gacaaccaac cttcggctgc cgccactagc ggtgggtatg ggtatgtacc tgtcggccgc 2691721 actgacgctg atgatcccga tcggcgcatt cctcgggcgg atctatgact cctgggcgcg 2691781 gtggtctggg gatgacgacg agcgcaagaa acggttgggc gtcatgctcg cgacgggcct 2691841 gattgtgggc gaaagcctat acggggtgct ctttgccgtc atcgtcgcga caactggcaa 2691901 agaggagccg ctggccatgg tcggcgacgg attcaggttt gcctcccagc cgctgggagc 2691961 catcgtcttt gccggcctcc tcgcttggct ctaccagcgc acccgggtca cagcgtcgta 2692021 ccggctggca gcgccggccg gcagctccaa gccactgccc gatttgcctg ggtaaccgca 2692081 ttgcgcccga ggggtccggc ttttcacagc aacttcacgg ttgacatcca ccttggctcg 2692141 cagctctgcg aggcagcctg aggtgacaaa gccggcggcc cgacacatgc agccgagttg 2692201 gctggctcgg aagggggaca gagttgacca tgacagcgag tgtggccaag gtgacagctg 2692261 cacgcccgga gccaagcgcg gcgtgggctg aagcccggcg gcgggtacgc caacgccgcg 2692321 aggacatgct gcgccatcct gcatttctgt ccaagcagct ccctgccgaa ccagcagacg 2692381 acgacggcgt cgcggccgtc tacgacatcg cgattgcgcg tcggcgccga cctgcttgag 2692441 cgggtcccgg cgggtcaacg tcggcggctg ccgggtaaac cggcaatcga cgaccgggcc 2692501 ttggcgggcg cgtcgcgttc tgccagctga actcgccgag cctggtcgat gtgcctgggc 2692561 tggtgcccgc gatgcccttg gacgcgctcc ggccggcgag acagccgacg agtggcttgg 2692621 gcgaatgcgc cacgatgcgt cggccagagg cgggtaacga gaaggtggcg gtgatctggg 2692681 aaagcctgga tgtcgttccc cccgagtcgc tatagtcaac tgcgccgatg ggtcaatgct 2692741 ggccaggcga tgctctggtc gacatggctt agcaatcctg acattttgga ggtgccggat 2692801 gtcgttcctg attgcttcgc cggaggcgct agcggcgaca gccacatatt tgacaggtat 2692861 cggttcggca atcagcgcgg cgaacgcggt cgcggccgcc ccgacaacag agatcctggc 2692921 ggcggggacc gacgaggtgt ccaccgccat ctcagcgctg ttcggcgctc atgcccaggc 2692981 atatcaggcg ctcagcgccc acgtggcggc atttcacgac cagttcgtgc ataccttgac 2693041 cgccggtgcc ggctcataca tggccgccga ggccgccgcc gcctcgcctc tgcaggcttt 2693101 gcagctggag ctgctcaacg ccatcaatgc acccaccctg gcgctgttgg gacgcccgtt 2693161 gatcggcgac ggcaccgatg cggcgccggg gagcgggggg gccggcgggg ccggcggcat 2693221 cttgatcggc aacggcggga ccggcggcgc cagcgactta gccgggaccg gccgcggcgg 2693281 ggtcggcggg gcgggcggcg ccggcgggct cttcggcatc ggcggcgccg gcgggggctg 2693341 cgggtccgcg gtggcgatcg ggggtgacgg cggggctggt ggcgccggcg gcgtgttcag 2693401 cggcggcggc gccggcgggg ccggcgacgc catcgggggt agcggcggcg cgggcggcac 2693461 cggtgggctg ttgggtggtg gcggcggcgc gggcggcgcc ggcggcgccg gcggcaatgg 2693521 cgggggcgcc agcaacagcg caagtatcgg gggtgacggt gggtccggcg gcgcgggcgg 2693581 catgctctac ggtgccggcg gcgtcggcgg caacggcggg gccgcggtcg ctatcggggg 2693641 tgacggcggg gccggcggca gggccggagc gatcggcaac ggcggtgacg gcggcaacgg 2693701 cgggacttcc aacacccccg gcggtagcgg cggcgacggc ggcaatggcg ggaacgccgg 2693761 actgatcggc aacggcggta acggcggcaa cgccgagatt gtcatctccg gcggtagcgt 2693821 cgccggcacc ggtggcaacg gcgggttgct gttgggcttc aacggcacga acgggctgcc 2693881 gtagcgggcg agcccgccgg cctctggatc acgtcgatgt gactttgacc cgttccacgc 2693941 cggcatcgtc gacgcccgat acgccaccgg caatcggcgg cacccgggtg gcacgcacgt 2694001 agacggtgtc accctcgcgt agggccagcg cctcggcatc gccgcgggtg atctgggcgg 2694061 tgaaggcccc gccggtggcc gcgctggtca actccacgcg gacctcgaag cccagcacca 2694121 ccacccgatc cacaacagcc cgtagcacac cggtggaccc ggcggtgccg tcagcggcgg 2694181 ccacggccat attgggagtc cggccgaccc ggatgtcgtg cgggcgcacc agggagccgt 2694241 tcaacgtgga aaccgctccc aagaaggaca tcacgaaggc gttcgccggg gcgtcgtaaa 2694301 cgtcggtcgg ggatccgacc tgctcgatac ggcccttgtg gagtacggcg atgcggtcgg 2694361 ccacatccag cgcttcggcc tgatcgtggg tgaccagcac cgtggtgaca tgcacctcgt 2694421 cgtgcaggcg gcgcagccag gcacgcagct cttcgcgcac cttggcatcg agtgcgccga 2694481 acggctcgtc gagcagcagc acctccggat cgaccgccag cgccctggcc agcgccatcc 2694541 gctgtcgttg cccaccggag agctgattgg ggtagcggct ctgaaatccg ctcaggccca 2694601 ccacctgcag cagattgtcg accttggcct tgatctcggc cttggggcgc ttacggatct 2694661 tcaacccgaa cgccacgttg tcacggacag tcaggtgttt gaacgccgcg tagtgctgga 2694721 agacgaatcc gatgccacgc cgctgtggcg gcacccgggt gacgtcgcgg ccgttgatcg 2694781 tgatggttcc ggtgtccggt tggtcgaggc cggctatggt gcgcaacagc gtcgacttgc 2694841 ccgaaccgct ggggcccaac aatgcggtca gcgaaccggt cggtacgacg aaatccacgt 2694901 ggtcaagtgc gacgaagtcg ccgtagcgtt tggtggcgtc ggccacgacg atggcgtagg 2694961 tcattttcac cgtctccttc tcagccctcg ctgaccgctc gtgcccggcg ggcgtctagc 2695021 accatctgga cgatcagcac caccacggaa accgccatca gcagcgtcga cagcgcgtag 2695081 gcaccgtact cggccccacg gtggtagcgg tcggagacca agagggtcag tgtttgcgat 2695141 gtccctggaa ggttcgacga gacgatgatg accgccccat attcgccgag ggttcgagcg 2695201 acggtcaata cgatgccgta cgtcaggccc caccggatgg agggcagcgt gattcgccag 2695261 aatgtctgcc accaaccgga acccagcgtc gccgccgcct gctcctggtc ggtgcccaat 2695321 tcgtgcaata cgggttccac ttcgcgcacc acgaatggac aggtgacgaa catgctgcca 2695381 agcacgattc ccggcagccc gaagatgatc ttgaagccaa ggtcctgctc gacgaagccc 2695441 agggcgccgg ccgatcccca cagcaagatc aacgagacgc ccacgatgac gggtgaaacc 2695501 gcaaaaggca gatcgataat cgcctgcaag acgcccttgc cgcggaaccg gttgcgggcc 2695561 agcaccaatg ccgtcgtgac tccaaagatc acgttcagcg gtaccacgat agccaccacc 2695621 agtagcgaca ggttcagcgc tgatatcgcc gccggggtac tgatccaggc gtagaactgg 2695681 ccaaagcccg gttcgaaggt ccgccacagg atcagcgcta ccggaacgat caacagcaca 2695741 aagacgtacc ccagcgcgac cgatcggacg aggtagcgag ccgccggcaa ggaggtcatg 2695801 cggccatctc ctcacgtttg gccgcacgcg cgccgacgac acgtaggatg agcagcacaa 2695861 tgaacgaaat cgagagcaat acaaccgata tcgcggccgc tccggtgcgg tcgtcgttct 2695921 cgatcagggt gcgaatccat tgcgaggaca cctcggtctt gcccggcacg gccccgccga 2695981 tcagaaccac cgaaccgaac tcgccgatag cgcgcgaaaa cgccaggccc gcaccggata 2696041 acaatgccgg cgtcagcgac ggcaacacca ccgaagtgaa gattttggca ccattagcgc 2696101 ccagcgacgc cgccgcctcc tcggtctcgc gatcgatttc cagcagcacc ggctgcacgg 2696161 cgcgcaccac gaacggcaat gtgacgaacg ccaacgccac cccaacaccg gtcgcggtgt 2696221 gttgaaaatg aagccccacc gggctgttgt tcccgtacag tgccaacatc accaggctgg 2696281 cgacgatggt gggcaacgca aacggcagat cgataatcgc atcgacgatc cgcttgccag 2696341 cgaagtcgtc acgcaccagc acccaggcga tcagcaagcc gaacaccagg ttgatgaccg 2696401 tgactgcggt cgaaatcgtc agcgttaccc ggaacgactc catcgcggca tgcgacgaga 2696461 ccgccagcca gaaggcccgc caaccaccgc ccgcggcctg ccagacgatg gcggccagcg 2696521 gcaacagcac gatcaccgaa agccacacca ctgccatacc gacccgaacg gaaggggggc 2696581 ccgcggggcc ggaaaggcgc gcgcggaact gcggcgcgcg gcgttcgccg accaacgatt 2696641 ccgtcatccg gtggcccgca gataaatctt ggtgatgctg ccggtcgcct tgtcgaacag 2696701 ctgaggatcc acgctgcccc agccaccgag gtcggcgatc gtccacagtt tcgccggcac 2696761 cggaaacagg tcggcaaaat cggcggcgac cgccggatcg accggccgga aaccggcctg 2696821 cgcccataac ttctgcgcct gcacggtgta ctggaagttt ctgaatgcgg tcgccgctcc 2696881 aaggtgtgtg ctggtcgcca ctacggccaa cggattttcg atcttgaacg tctgcggcgg 2696941 ggtgacgtgc tgcaccggtt tgcccgcccg ctcggtggcg atggcttcgt tctcgtagct 2697001 gatcaacacg tcaccgctgc cctggacaaa aacatcggtg gcttcccgcc ccgacccggg 2697061 gcgcaatttg acgtgttcat tcaccaatgt attgacaaag tcgatccccg cttggttatt 2697121 ccggccaccg tcacttttcg cggcgtaggg ggctagcaga ttccacttgg cagaacccga 2697181 actcagcgga ctgggcgtga tgacctcaat acccgggcgc aacaggtcat cccaatctct 2697241 gatgttcttc gggttacccg cgcggaccac aaacgtcacc accgacccga acgggatgcc 2697301 cttggtggca tcggcgtccc agtccttgtc aaccttgccg gccttgacca ggcgagcgat 2697361 gtccggttcg accgagaagt tcaccaggtc ggccggttta ccgtcggcaa caccgcgcga 2697421 ctggtcggcc gacgcgccat atgaggtaat cacctggact ccccggccct gttcggaagc 2697481 gttgaacgcg ggaatcaccg cactccagcc gggttccggg acggcgtagg cgaccagggt 2697541 gatgctcgta tgcgcacggt ccggtcccgc acggccgacc acgtcgctgg gaccgccatg 2697601 acaccccacg ccgataccgg cgatcaatgc gcacaccacc ccggcaggga taatgtgccg 2697661 ccagcgggat gcgctagcga tgcagctcgc ttcagaaagc gtcaaggaga gcattggcga 2697721 ccttccggtg cgggactttg gacaacgttc ccgtagcggc ggaaaggcga tcgctgaaca 2697781 ttgcaggact cacgaactcc acatcagacc gcgcacgggt ggggagtcag cgacaacagt 2697841 gcaggttggc cgcagcgccg caaacgagcg cgccgacata gcgccccgaa aaacccgatg 2697901 ctgcgtgcac gtggcgaagc ctaacagaat tcggctggcc gaccagttgg cgcgcagctc 2697961 aatgggtgag aagccaggtc acgatcacca gcgcaaccag cgtgaccaga accaacgtga 2698021 cgtgcgacct cggcatccgg gctacctggg cgcctgatcg gggcggcggg cgcggcgaat 2698081 caactgaatg acccggccga gcagggcatc cagcaatgcc gcggtgaaat aggccaaagc 2698141 cagcacgacc ggcatgactg ccaacgcctg cagcgcgaac gggagtccgg acagccacag 2698201 ctcgacgccg tcccaccaac tcaggaaccc gttcatcggg cccacactat agcgccggca 2698261 ggcaaaaccc caggtgtgtc gcgattacgg tgaccgccga cgccaaaccg cgacacggca 2698321 cacggctgct aggcccacct gagcacgcac ccaactacgc cgggcgccgg gcgtgaagtg 2698381 gacgccgagc aagtcgacag atgatgatgt cggcatggtc ctgcacgctc aaccccccga 2698441 ccaatcgacc gaaacagccc gcgaggctaa agcgttggcc ggggcaacgg acggggcaac 2698501 ggccacatcc gcggatctgc acgcacccat ggctctatcg tccagttcgc cactgcgcaa 2698561 cccgtttccg ccgatcgccg actacgcgtt cttgtccgat tgggaaacga cgtgcctgat 2698621 ttcgccggcg ggttcggtgg agtggctgtg tgtgccacgg ccggactccc ccagtgtgtt 2698681 cggcgcgatc ctggaccgca gcgccggcca ttttcgtctg ggcccctacg gtgtttcggt 2698741 gccttcggcg cgacgctacc ttccgggcag cctgatcatg gagaccacct ggcagaccca 2698801 taccggctgg ctgatcgtgc gagacgcgct ggtgatgggt aaatggcacg atatcgaacg 2698861 gcgatcgcgg acccaccgcc gcaccccgat ggactgggac gccgagcaca tcctgttgcg 2698921 cacggtgcgc tgcgtcagcg gcaccgttga actgatgatg agctgcgagc cggcgttcga 2698981 ctatcaccgc ttgggcgcca cctgggaata ctcggccgag gcttacggcg aggccatagc 2699041 ccgcgccaac acggagcccg acgcgcaccc gacgctgcgg ctgaccacca acctgcggat 2699101 cgggctggag ggccgggaag cacgcgcacg cacccggatg aaggagggtg acgacgtgtt 2699161 cgtcgcgctg agctggacca aacacccgcc gccgcagacc tacgacgagg ccgccgacaa 2699221 gatgtggcaa accaccgagt gctggcggca gtggatcaac atcggcaact tccccgacca 2699281 cccatggcgg gcgtacctgc agcgcagcgc gctaaccctg aaggggttga cctactcccc 2699341 caccggggcg ctgctcgcgg cgagcaccac gtcgctgccg gaaaccccgc gaggcgaacg 2699401 caactgggac taccgctatg cctggattcg cgactcgacc ttcgcgctgt gggggctcta 2699461 caccctggga ttggaccggg aagccgacga cttctttgcg ttcatcgccg acgtgtccgg 2699521 cgccaacaac aacgaacgcc atccgctgca ggtgatgtac ggggtgggcg gtgaacgcag 2699581 cctggtcgaa gcggagctgc accatttgtc cggctacgat catgcccgcc cggtgcgcat 2699641 cggcaacggc gcctacaacc agcgccaaca cgacatctgg ggttcgatcc tggactcgtt 2699701 ttacctgcac gcaaagtccc gcgagcaagt cccggagaac ctatggccgg tgctgaagcg 2699761 gcaggtggaa gaggccatca agcattggcg tgagcccgac cggggaatct gggaggtgcg 2699821 cggcgagccg caacacttca cgtcgtcgaa ggtgatgtgc tgggtcgcct tggaccgggg 2699881 ggccaaactg gccgagcgtc agggcgagaa aagctacgcc cagcagtggc gggccatcgc 2699941 cgacgagatc aaggccgaca ttctggaaca cggggtggac tcgcgcggcg tgttcaccca 2700001 gcgctacggc gatgaggcgt tggacgcctc actgctgctg gtggtgctga cccgattcct 2700061 gccgccggac gacccgcggg tgcgcaacac cgtgctggcc atcgccgacg agctgaccga 2700121 ggacggcctg gtgttgaggt accgggtgca tgagaccgac gacgggcttt ccggcgagga 2700181 aggcacgttc accatctgct cgttttggct ggtatcggcg ctggtcgaga tcggtgaggt 2700241 gggccgcgcc aagcggctgt gcgagcggct gttgtccttc gccagcccgc tgctgctcta 2700301 cgcggaggag attgagccgc ggagcgggcg tcacctgggc aacttcccgc aggcgttcac 2700361 ccacctggca ctgatcaacg ccgtggtcca cgtgattcgc gccgaggagg aagccgacag 2700421 ctcggggatg tttcagcccg ccaacgcccc catgtaggac ttccgatgcc gagcagacgc 2700481 aaaatcgccc aaattcgggc cgaaatgggc gattttgcgt ctgctcggca agcgtcaact 2700541 caattcgctg atcctgtcca tcatcgcgtg tgcgatatcg acggcgctgg tgctgatgtc 2700601 ggccgacccc tgatccgacg ggtgggtgat gccaaagaag gtgaccgcga cctcgaccac 2700661 gcaattgccc cgtacgccga cggcacgggc ctgagggacg gacgccagta tggagtgcgt 2700721 gccgcgtcgc agcgagaccg ttgccgcgac aactgaatcc gcaacccgga cgtcggtgat 2700781 ggagcgttga ccgaacgcgc tggcgggcac cgtcagcgtt gtgccatcac attccttcca 2700841 ctgcgcagaa aacctcgcga acagatcatc ggcggctgcc gcggaaggca gggcgacgac 2700901 accctcatcg acgtcatcca ccttcaccga ggaaccgtcg tgtcgccacg acacccgggc 2700961 gacgcttttg acctcgacgg accggtaaac gttccgctgc gtcaggtaac cgacgcccac 2701021 gcagtcagcg ggccgagccg atacatcact gtctcccaaa ctgtcgctgc ccccgaacac 2701081 cggcgggaaa ggtggaaggg cctgaaacgg ctggttgagg agcgttgaca gcgcagcgcc 2701141 gtcgagcggt acccgctgga tcagtgaacc catcagcgga cgcggcactg cgttcggcgc 2701201 cagacctgct ttcccggtcg tcgttgtggt gcacccggca gcgaggaaca cggcaaacag 2701261 cggaaccacc cagcgccagc ggtttgtcac ttcttgcctt tgtccccggc ggcatcggtg 2701321 gacaatgccg cgacgaaagc ctcctgtggc acctcgacgc gcccgatggt cttcatccgc 2701381 ttcttgcctt ccttctgctt ctccagcagc ttgcgtttgc gcgtgatgtc gccgccgtag 2701441 cacttggaca acacgtcctt gcggatcgcg cggatgtttt cgcgggcaat gattttcgat 2701501 ccgatggcgg cctgcaccgg cacctcgaac tgctggcgcg ggatcagctc cttgagtttg 2701561 gtggtcatct tgttgccgta ggcatacgcc gtgtccttgt gcacgatcgc gctgaacgca 2701621 tccaccgcct cgccctgcag caggatgtcg accttgacca gcgcggcctc ctgttcgccg 2701681 gcctcctcgt agtcgaggct ggcatagccg cgggtgcgcg atttcagtgc gtcgaagaag 2701741 tcgaagatga tctcgccgag cggcatggtg tagcgcagtt ccacccgctc gggggagaga 2701801 tagtccatgc cgcccaactc gccgcggcgc gactggcaca gctccatgat ggtgccgatg 2701861 aactcgctgg gcgcgatgat ggtggtcttg acgacgggct cgtagaccgt gcggatcttg 2701921 ccctccggcc agtccgacgg attggtcacc cggatttcgg tgccgtcgtc tttgtgcacc 2701981 cgatacacca cattgggtga ggtcgagatc aggtccaggc cgaactcgcg ctcaaggcgc 2702041 tcacgggtga tctccatgtg cagcaggccc aagaaaccgc accggaaccc aaaacccagc 2702101 gccaccgagg tttccggctc ataggtcaag gccgcgtcgt tgagctgcag cttgtccagg 2702161 gcgtcgcgca ggttcgggta gtccgaaccg tcgaccggat acaaccccga gtagaccatc 2702221 ggtttgggct cacggtagcc ggtcaacgct tcggcggcag ccccgcgggc ccgggagagg 2702281 ctggtcacgg tgtcgcccac cttggactgg cggacgtcct tgacgccggt gatcaggtaa 2702341 cccacctcgc cgacaccgag gccctcacac ggtttcggct cgggtgagac gatgccgacc 2702401 tcaagcagct cgtgggtggc gccggtggac atcatcatga tgcgctcacg ggggctgatc 2702461 ttgccgtcga cgacgcggac gtaggtcacc actccgcggt agatgtcgta aacggagtcg 2702521 aaaatcattg cgcgggtagg tgcctcggcg tcgccctgag ggggcggcac ctgtcggacc 2702581 acctcgtcga gcaggtcgga cacgccttcg ccggttttgc cggacacccg caacacctcg 2702641 gccggctcgc agccgatgat gtgtgccatc tcggcggcgt aacggtccgg gtcggccgcg 2702701 ggcaggtcga tcttgttgag caccgggatg atgtgcaggt cgcggtccaa cgccaggtag 2702761 aggttcgcca gcgtctgcgc ctcgatgcct tgcgcggcat cgaccaacag caccgcaccc 2702821 tcgcaagcct ccagcgcacg cgagacttcg taggtgaagt cgacatggcc cggggtgtcg 2702881 atcagatgca gcacgtagtc ggtcttgtcg acccgccagg gtagccgcac attctgggcc 2702941 ttgatggtga tgccgcgttc ccgctcgatg tccatccgat ccaagtactg ggcccgcata 2703001 gagcgttcgt cgaccacgcc ggtgagctgc agcatccggt cggccaacgt tgacttgccg 2703061 tggtcgatgt gggcgatgat gcaaaagttc ctaatctgcg ccggcgcagt gaaggttttg 2703121 tcggcgaaac tgctgatggg aatctcctgg agcgggggtt gacgggtatc cagggtatcc 2703181 gcgtcgggca gctgcgaccc aatcgcgctc ggtcgatcgc gtctatgctg cgagcatggc 2703241 gtccgcacgg aagtcacagt ggaaaacgtt gcagcgcttc gcggagaacc tggtgttcac 2703301 tgaggctcct aagctggtgc gtcacctgca aaacacgcag gaaacgcttc gcacaatccg 2703361 gcaagccgtc aagatcaccg cgaacatcat gaccaccgcc gtgccgtcgc caccggccga 2703421 aattgccgcg ggccggccgg tgaccagcac cagctgtccc accgcagcgc gagcccgcag 2703481 acttgtctac gccccggacc tcgatggccg ggccgatccc ggcgagatcg tgtggacttg 2703541 ggtggcctac gagcaggacc ccacccgcgg caaagaccga cccgtgctcg tcgtgggccg 2703601 agaccgcagc gttctgttgg ggttgctggt gtccagccag gagcgccatg ctgccgaccg 2703661 ggactgggtg ggaatcggtt ctggcgcttg ggactacgag ggccgagaaa gctgggtacg 2703721 gctggaccgg gtgctcgacg tacccgagga gagtatccgc cgcgaaggcg cgattctgga 2703781 acgcgaggtc ttcgacgtgg tagccgcccg gctgcgtgcc gactacgcct ggcgctaaac 2703841 cgggccgggc ggccagcgca atcggctggg caacgagccc cgatcaggcc ccaatcagcc 2703901 ccgcctggcg acgacgcggg ccgcccagcg gcccgctgag gagccgggca gtcagccccg 2703961 cccggcgacg atgcgggccg cccagcggcc cgctgaggag ccgggcaatc agccctgagt 2704021 gatgtaggac tgaagctgct gctgctcggc ctcgagttct cccatgcgcg atttcaccac 2704081 gtcaccgatg ctaacgatgc cgatcagttt cttcccgtcg agcaccggca cgtggcggac 2704141 ccggttttcg gtcatcagca cactgatctt gtcgaccgtg tcggattttg tacaggtggc 2704201 gacggtggtc gacataatct tggcgaccgg gcgagacagc acgctggcac catacgtgtg 2704261 tagctggcgc accacgtcgc gttccgacac gataccgacc acgccttcgg cgccgaccac 2704321 taccatggcg ccgatgttct gctcagcgag gccagcgagc agctccccga ccgtggcgtc 2704381 ggggttgatc gtcaccaccg ccgccccctt gttccgcaag acgtccgcga tgcgcatcaa 2704441 ggcctcccgc cggtggtgag ctggttcaca ccaggctacg gcgaactcgg gcggcgggaa 2704501 agccgatacc ggaatatgcg gcatctagca cccgaacccg caggtgcccg gcggtcggta 2704561 gctgcgtagc ccgggcagga attcggccgc cgacaacgcc catgtcggcc gcatcctcga 2704621 ggctaaaact cgttggccat cagccgaatc ggtcgatcgg ggccgctgga tccatcgagc 2704681 ttgtcaggat agggccatgc ttgagatcac gttgctcgga actgggagcc ccattcccga 2704741 cccggaccgt gccggaccat ccactctggt gcgggccggc gcgcaggcgt tcctggtgga 2704801 ctgcggtcgc ggcgtgctgc aacgcgcggc ggccgtcggt gtgggcgccg caggattgtc 2704861 ggcggtgctg ctcacccatt tacacggcga cgtgcttatc accagttggg tcaccaactt 2704921 cgctgctgat cccgcgccct tgccgatcat cggaccgccg ggcaccgccg aagtggtgga 2704981 ggcgacgttg aaggcattcg gtcacgacat cggctatcgg atcgcccacc acgccgatct 2705041 gacgacacca ccaccgatcg aggtgcacga atacaccgca ggcccagctt gggatcgcga 2705101 cggcgtgaca atccgggtgg cccctaccga tcatcggccg gtcacgccga cgatcggatt 2705161 ccggatcgaa tccgacggtg cttcggtggt gctcgccggt gacaccgttc cttgtgacag 2705221 cctcgaccag ctggccgccg gagcggatgc gttggtacac acggtgatcc gcaaagacat 2705281 cgtcacgcag atcccgcagc aacgggtcaa ggacatctgc gattaccact cgtcggtgca 2705341 ggaagccgcc gcaaccgcga accgcgcagg ggtgggaacc ctggtcatga cgcactatgt 2705401 gccggctatc gggcccggac aagaagaaca gtggcgggcg ctggccgcga ccgagttcag 2705461 cgggcggatc gaggtcggca acgacctaca ccgagtcgag gtgcacccgc ggcgctagca 2705521 cgccagctat gaccaaccag ccccgacacc agggcgatcg ataaggcaag aagtagatcg 2705581 cccgaaccag cgccgggtcc gtgctgaccc tcgggcgcca cacggtcttg cccagcaaac 2705641 cggtcagccc ggacgctccc gcccgccacg gtgccgccgg ccaacgccga tcgtcgaacc 2705701 ccacccggtc actgaaagct gccgcaggcg gttggctgat gcaacaccgc ggtggcaata 2705761 cgtgcagcgc gaccggctca tcgcggatct acggcgcaac cgcggtgatc ggcgtcacgc 2705821 cgcgggtgcg acccccacgg gaccccggtt cccactgctg tttggcggtg aatcgctgac 2705881 accgtggacg gcgcccagcc gcggctgttc gcggtggtgc agccgacccg atttcacgga 2705941 aacacaggct gtcatcagcg agggaaacta ttcgccgtgc aaagcatttc catggcgcca 2706001 caccgatagc cggcttgtgc tgatcgcacg tcccgatatc ttatgcagtc gcggtccgga 2706061 ggcaatgcgg gccaaagccg ccgatttgga cttggctgcg gcggcaaaga cggtcggagt 2706121 gcagcccgcc gccgatcagg tggcggcggc aattgccgca atattgctgt cacacgccca 2706181 gatctaccag gacatcagca cacagatggc ggcattccac gaccagctcg tagagaaccg 2706241 cacggcagat agcacgtcgt acgccagcgc cgaggccaac gcccagcaga gcctgctcaa 2706301 tgcgatggat gcaccgagct ggcaacagcg ccgagaaacc gtcggcgagg tggggctccc 2706361 agcggaccca gcgggatccg gcacggcgac ggcggcagtg gcggcggcga cgacggcgcg 2706421 ggcaggaagc cgttcggccg cccaggcaac cgtggcgcct atcggcgggc tgaaactccg 2706481 ccgcgaatct gcgctaagcc agccgggtga tctccaccac cacgtcgagg tcggtgacgc 2706541 cctccccaga gtagatccct ttcagcgggg aaacgtcggt gtagtcgcgg cctacaccca 2706601 cactgatgta ttgctcggtg atctcattgt cattggtggg gtcgtagtgc caccatccac 2706661 cggtccaggc ctgaacccag gcatggctgc gcccgtctac cgtctttccc accacggcat 2706721 cacgcttagg gtgtagatac ccagacacgt accgacaggg aattcccatg ctgcgcaaca 2706781 ccatcagcga caagtgcacg aagtcctggc agacgccctt gccttgttcc agcgcatcga 2706841 gcccggacga gtgcacactg gtggtgcccg gaatgtagtc cagctcgctg cgcgcccacc 2706901 gggcggcggc gactacggcc tcgctgggct catggcattt cctgatccgc ctgccgacgg 2706961 catcaacgcg ggcgcttgcc ggggtgtgcg gggttgggcg gagcacttcg tcgaacctgt 2707021 cgatcacggc cgtcgattgc aggtcggccc aggttgcctt ggcggccaac ggctccgggc 2707081 gctcggtctc caccaccgac gaggacgtca ccgtcagttc ggtgtgcggc gcatgcaagt 2707141 caaacgccgt cacggcagta ccccaataat cgatatagcg gtaggagcgg gtggccggga 2707201 tggtttcgac tcggttgagg acgaggttct gccgcgaact cgaccgaggg gtcagccggg 2707261 cttcgttgta tgaggccgtc accggcgact ggtagacata tccggtggtg tgcaccaccc 2707321 gggttcgcca catcaggatt cctcttggct tccgacgagt tggccacgct ggcctgcatc 2707381 cgaccacgca acccagggag ctgcgtgaaa gtactgcagc gccaatgcat ctccgacatc 2707441 acgacaggtc gtctgcaagc ccgccaggcg gctctccaag gtctcgagca ggacgccggg 2707501 ttgcacgaat tccagctcgc tgcgtgcttg ccctaacaac cgctgtgctt cggtggtcgc 2707561 cccgatccgg ctgtgcggat tgtgcatcaa ctcggcgaga ttgtgttcgg ccagcttcaa 2707621 cgagtgaaag accgagcgcg ggaaaagccg gtcgagcatc atgaactcca ccacccggcc 2707681 cgcgtccagc acaccgcggt aggtgcgcag gtacgtgtcg tgcgcacccg ccgagcgcag 2707741 cagcgtcacc caggccggcg acgatgcgct atcccccacc cgtgacagca acagccgcac 2707801 cgtcatgtcg acccgctcaa tcgcgcgccc aagcaacatg aagcgatatc cgtcgtcacg 2707861 caaaagcgtc gaatcggcca ggccggcaaa catcgccgca cggccctcga tgaacgacag 2707921 aaactcgtgc ggcccaaggc gtttggcagc gcgttcgcgt tcaggcaggg cgttataggt 2707981 ggtgttgaga cactcccacg tctcgctgga ggtgacttcc cgcgccgatt ttgcgttttc 2708041 ccgtgccgcc gagatcgcgt cgacaatgga agaaccaccc tggctattgg tgctgaaagc 2708101 caccaggtcc gtcaaggacc agacatccag ctcgtggtcg ggcggctcga tgcccagcac 2708161 ccgcagcagc agccgggagg cctggtcggg atcgacactg gaatcctcga gcaattgatg 2708221 caccgcgacg tcgagaatgc gcgcggtgtc gtcggcgcgc tcgacgtagc gaccgatcca 2708281 atacagtgct tcggcgttgc gggcgagcat cagtggaacg cctgctgttg ttgttgctgc 2708341 tgctgttgcg gttgttggtc gtgcggttca tacccggacg cgtccaccgt tgggtcgcac 2708401 agcggctgcg gcagcgaacg cacaatctgt gcagcgccca actcgcgggc ggccgccgaa 2708461 gcgcgcgggg ccagcaccca ggtgtccttg gagccgccgc cttggctgga gttgaccacc 2708521 cgggaaccct caaccaacgc cactcgggtc agcccgcccg gcagcaccca tacctcgtta 2708581 ccgtcgttga ccgcgaacgg ccgcaagtcc acgtagcggg gcgccagcgt gccttcgatc 2708641 cgggtcggca cggtcgacag ttccatcatc ggctgcgcga tccagctgcg gggatcgtcg 2708701 cggatctttt ggctaacggc cgccaattcg gcctgagagg cttccgggcc gaacacgatg 2708761 ccgtaaccac cggatccctc gaccggcttg aggaccaatt cgcggatccg gtccaacacc 2708821 tcttcgcgtt cgtcatccag ccagcatcgg agggtttcca cgttcgccag cagcggcttt 2708881 tcgtggaggt agtactcgat catggtcggc acgtacgtgt agacgagttt gtcgtcaccg 2708941 actccgttgc cgatcgcact ggacagcacg acgttgccgg cccgggcagc gttgaccaat 2709001 ccggccaccc cgagcaccga atcggcacgg aactgcagcg gatccaggaa ggcgtcatca 2709061 atgcgccgat agatgacgtc gacctggcgc tccccctcgg tggtgcgcat gtatacctgg 2709121 ttgtctcgac agaacaggtc gcggccctcg accaattcga cacccatctg ccgggccagc 2709181 aatgaatgct cgaaatacgc cgagttgtag accccagggg tcagaaccac gaccgtgggg 2709241 tcggcctcgt tggtggccgc cgagttgcgc agcgcgcgca gcaggtgcga agcgtagtca 2709301 tcgaccgccc gcacccgatg ggtggcgaac aggttcggaa agacccgcgc catggtgcgc 2709361 cggttctcca tcacatacga cacccccgac ggcgagcgca ggttgtcctc gagaacccga 2709421 aagtcgccgc ggtggtcgcg gatcaggtcg atgccggcga cgtggattcg cacaccgttg 2709481 ggtggcacga tcccgactgc ctgacggtga aagtgctcac aggaggtcac caaccggcgc 2709541 gggatgacac cgtcgcgcag aatctcctga tcaccataga tgtcgtcgag gtagcactcg 2709601 agggccttga cccgctgggt gatgccacgt tccagtcggg tccactcggg ggccgaaatg 2709661 acccgtggca ccaggtcgag cgggaacggc cgctcctggc ccgacagcga aaacgtgatg 2709721 ccctggtcga tgaacgcacg ccccagcgca tcagcgcggg ccttgagttc ggacgcgtcc 2709781 gacggcgcca gctcagcgta gatacctttg taggggccgc ggacaatgcc ctgggcatcg 2709841 aacatttcgt cgaaggccat cgcatagacg tccgacgtgt tgtagccgcc gaagatgcgt 2709901 tcgccgcgtg tgggcgaccg ccgccgggtc tcgttgagtt ggtttggcag actcacgcgt 2709961 ctcatgctgc ctcaaattcg acattccggc agaccacaga ttccgctttt gggcgaaaac 2710021 gtaaccgact gataacctgg gcagccgaat cacaccgaca aagggaactt gcacgtggcc 2710081 aacatcaagt cgcagcagaa gcgcaaccgc accaacgagc gcgcccggct gcgcaacaag 2710141 gcggtgaagt cctcgcttcg taccgctgtc cgtgccttcc gcgaagctgc ccatgcaggc 2710201 gacaaggcaa aggccgcgga actgctggcg tcgaccaacc gcaagctgga caaggcggcc 2710261 agcaagggcg tgatccacaa aaaccaggcc gccaacaaga agtcggcact ggcccaggcg 2710321 ctcaacaagc tctgacagcc acctgccgac tcatcggccg cggtcggcca ccaactcggc 2710381 gacctgccgg accgcggatt ccagcgcgta gtccgcatcc gcgacggcgc ccttgacgtt 2710441 agcattgagt tcggccacca acctcatcgc ggtcgccacc gtgtcacgcg accaccgccg 2710501 agcctgcttc tgggctttct gcacccgcca gggcggcatc cccagttgtg cggccaggcg 2710561 gtacgggtcg ccggactgcg gcccgacccg gccgatggtg tgcacggctt cggcgagcgc 2710621 atcggccaac accactagcg gctcaccgcg catcatcgcc caccgcaacg cttcggcagc 2710681 tcccgccacg tcgccggcta ccgccttgtc ggcgatgtcg aagcccctca cctcggcttt 2710741 gccgctgtga tagcgccgta cagcggcggc gtcgacggct cctccggtat cggcgaccag 2710801 ctgtgaacag gccgaggcga gttcgcgcac gtcggagccg acggcgtcca gcagggcggt 2710861 cacggtctcg tcgtcgacct tgacccgcag cgacgcgaac tcgctacgga tgaagtcggc 2710921 gcgctcactg accttggtga tccgcgcgca cggatgaacc tgcgcaccca tcgaccgcag 2710981 ctggttggcc agcgatttgg cgcgcccgcc acccgagtgg accactacca gcacggtgcc 2711041 ggccggaaga tcggcggcgg ccgactcgat taccgcggca gcgtccttgc ccgcctccgc 2711101 agcggccccc agcacaacga tccgctcctc ggcgaacagt gacgggctca gcagttcggc 2711161 gagctcatag gcaccgacgt cacccgcgcg cattcggctc accgggacgt cggctgtacc 2711221 tgcccgctgc cgagccgagc gcaacacgtc ggccaccgcc ctttcgacca gcagttcttc 2711281 gtctcccagg accaggtgca acggcttagc ctcgctcacc ccacgatggt gtcacgaagg 2711341 gccgaccagc ccggacagcg accaggcaag cagacatatg acggccaccg ccatcgtttt 2711401 gcacatggcc gcgcgaaacc agcgccagcg ccactgcgca accgtgaaca cggtggcgcc 2711461 accgaccagc agtacgccgg gcagacctgc ggccaccgga acggtcgccg cgggcacacc 2711521 cgacgcccaa tgcgccacgc gcaacaccca ccacacttcg ggcccggtga accggatcag 2711581 cacctgcgcg ccggccggcc acggcacgac cagcacggcc gcaacgctgc ccagcacggt 2711641 gatcggcgcg atcacggccg ccaccgccag attggccacc acggccacca gactgacccg 2711701 gccggagatg gcggccacca gtggcgccgt caccagctgc gcggccgccg cgactgcgag 2711761 ggcatcggcc agcaccttcg gacatccgcg gtcgaccaag cggcgtgacc aaaccggcgc 2711821 gatgacgacc agtgcacccg tggccgccac ggacagcgcg aagccgatgt ccacagcaag 2711881 atggggagcg gcagccagca aaaccagcac gctacccgac aaagctggaa tcgcctgccg 2711941 ccggcgcgca gacagcatcc ccacgagggc aatggcgccc atcacagctg cccgcaacac 2712001 gctggccgtc ggctgcacca ggatgacgaa tgccaccaac gcgacggccg cgcacaccac 2712061 ggccgcacgc ggtccgatca accgtgccga aaccagcgcc gccgcacaca cgatcgtgac 2712121 attggccccc gagaccgccg tcaagtgcgt caggcccgcc gcacggaact cgcggctggt 2712181 taaggcggtg accgtcgagg tatcgccgag aaccagggcc ggcaacatcg tggcctggtc 2712241 agcgggcagc acctcacgaa ccgcggccgc gaatcgatgg cggacgatgt gagcggcgcg 2712301 gtgtaccggg ccggcacggc ccacggtcgg ccgaccggtc gcattgaaca ccgcgaccgt 2712361 caggtcgtga cgcgccgggc gactgatacg cgcgcggaac tggacgggct gtccgaccat 2712421 cagctcgccg aagtccagcg ctcgcgcgaa aaccactacc cggccggatg tctcgtcatc 2712481 ccgcagccgt tgaaccgtcg cccggaacat caaccggccc cgccccagcg acactgggct 2712541 ctcgctgggg gtgaccgtga ccagcgcgga ggtgccaaat gccacggtga ttgggtggcg 2712601 atcgaccgcc tcggagcgca acgcgaccgc aagcccgtac cccgcgccca ccataccgac 2712661 cgcgaccagg ccggcgctga tcgaacccag tcgcggagcg tgccacgacc ggcgcgccac 2712721 acaccaccac agtgcgccgc cgccgagggc caccacgacg cagcacaagg cacacacgtt 2712781 gccgatcggc cacacgatcc cggccgccgt cacaatccag ctgaccagcg ccgccgggac 2712841 caggcgtacg tccaaacggg acgcgccgaa gcccatatgg cgcaccggta tcagacacgg 2712901 accagattgc gccgcttgtc cagccgcgcc ggaccgatgc cgtcgacgtc ggcaagctgg 2712961 tcgacgctgg tgaacctacc attgcgctgc cgccacgcca caatcgctgc ggcggtgacc 2713021 ggcccgatgc cgggcagggc gtccagctgc tccacggtcg cagtgttgag gtcgagcacc 2713081 tcagctgtct taggagctgt cttagggcct gtcgtggctg tgcccgaggt acccgccggt 2713141 cccggcgtcc ccgcaccgac cgagctgccc agcaccctcg gctgtcccga gggcggagct 2713201 agcccgacca cgatctgctc accgtcacca agctgccgag ccatgttcag tccgacggtg 2713261 tccgcgccgt ctaccgctcc gccggcggcc tgtagcgcat cggcgatccg cgcgcccggc 2713321 gccagggtga cgagtcctgg ggtgtgcacc aggccaacca cgctgaccac caccggcagg 2713381 ccggaacggt ccggcgagcc cgggcttgcc gacgacctag ggttcgtcgg cgaaaccggc 2713441 tctaccggag gaagtttggc tgacattacc ggctcagtcc ggtcgcggat caaggtgaat 2713501 accgtcacca gcaccgcgag ggcggcgatc accgccaatg cgacggcgcc ggcacggccc 2713561 ggatctgcgc gtatcctgtc cgcccaacct tgcccacggg aagtgtcggg aagccagcgc 2713621 ggcagcagcg agttcggatc gtcgcgtggc tcgtcgtggt ctggaccgtc gtccgttgga 2713681 tcgtgtggct ccgggtctaa gtgtgcagat gcggcgtgcg agtcgatatc cgggacggca 2713741 ccgagccgcc tttgcagtcg ctcggcgggc agttctgttc gcatgggccg accgcagctg 2713801 cggggaccgc cagaaccggc gcgcacgacg gcgtcgcgct gccctgctgt ggatcaatcc 2713861 gaggctgtgg acaagccgct ttggcgatgg atcaagatgg gacaaaccgc gccaacatcc 2713921 ccgaacaacc agcaccgggc tgcgacgtcc atccggactc ggctcaccgc gatcgagagt 2713981 gtactcggca acgcgatccg cgagtgctga gccgcgcggc cgatccccat ccatggcgtg 2714041 tggcgaccga agccggcgcg cagcacgcgg gtcgctgacc acgccgaaaa gcccgtcagc 2714101 ctagcgccgg cctagcacgg ccttcagaac tcgaacgcgg tctggacggg aacatcactg 2714161 gcaaacgccg cgtcgagtcg acgaagcagc tgggaatctt tggtgcgcaa ccggttagcg 2714221 gcggctaacg tcgaagcgcg gtgcgctcca aggtaaaggc tgcccagtac gtcccgatcc 2714281 atttcgatct cggctgccgc atcggtcggg gtacaccgcg cacggccgtc accgatcttg 2714341 agcgcgaacc ggccgccatc ggatacctcg aggaccgtgg aaaactcgcc aacttcgtga 2714401 gcgtaaccac gcgcctcgag tgcggccggt acgttcatga tgcgcaacca caggccgtcc 2714461 tggcgccagg tagtgcgggc cagtcgggta tcggtgagca ggtggggtaa cgggtcctgt 2714521 ggatgggtga tgatgctgat tcgctccatg gagtcgaggc caatcagggc ccgccacaac 2714581 gcacaatgcg catctgcggt taccgccctg agttcgctga cgcgcgctag cttgagatcg 2714641 gtgcgatcca cccggtacag cgcgtacccg tcgggatgca gtaacgcgaa cgattcacgg 2714701 tctccaccgg gcgcggcttt gcattctgcc agcagctcgt cccagagcac ctgcgggcgt 2714761 agcagcccgc ccggcacctg ctggcgccat cgctcgtaga tcgcctcaaa ctcgccgcga 2714821 tgctcggtgg gtctgaccaa ccggacgctg ctgccaccta ggccgccgcc cggtgcgtcg 2714881 gcgtgaaagc gcgcgaagcg tcggtcgacc gtcagctcat gcaaggtggt agcgggcccg 2714941 tagccgaacc ggccgtagat gccgccctcg ctagcatgca gtgccgcgac cggatagccg 2715001 gaatcggcta tgcggcggtg cagttcggcg cacatcgcgc gcagcaagcc gcgccggcga 2715061 tgcgtcggcg ccaccgcgac gaaactgaga ccggcggtcg ggagcaccac ttcaccaggc 2715121 accgtcaacc gcagatccat gtacagcgcc atcccgacca cctcagaacc cgggccggca 2715181 ccatcgcgga ccaccaccgc tccgtcggtg ggcaccaggg tccgccaggc ggtcgctgat 2715241 tcagggccga tgaaatcggt gaaactggcc gcggccagta ggaacatccc cggccagtcg 2715301 tcctcggtcg ggctacacag ggtcacagtc acagaatccg actgtggcat atgccgcggc 2715361 cacgtgcacg tgaatattac gacgacagtg tctggcaaag gatcacgcga tgcgggtagc 2715421 cccgccagcg tgacgccact gcgagaatca gcgacgaatt tcgccgtgac gttacgctgg 2715481 cggcgacgct cccacgtcga cgcatacccc gacggctccg gcaccgacgt gcagagcaag 2715541 taccggtccc atggcggtca ccatggccgg ctcacacgcc ggcagccgct ccgccagcgc 2715601 cgccgccacg tcgttcgcag ctgccgggtc ggcgacgtga tgcaccgcga gagcggcggg 2715661 gcggtcgccg acaagctggc aaacccggtc gatcatcacc gccgtcgcgt tgctcacagt 2715721 gcgaacccgt tggaccagaa caagttttcc gtcgtcgact gacagcagcg gcttgagcgc 2715781 cagcgcggtg cccaaccatg ccttggcccc actgatgcgc ccgctgcggc gcagattgtc 2715841 caaccgcgct acagcgacga acgcgtgaat ccggcttacc gccgcagccg ctgcgcgcgc 2715901 gaccgtatcc agctcatcgc ctgcggcggc tgcccgcccg gccgccagtg ccgcgaaacc 2715961 gacgcccatc gcggccgacc tcgagtcgat caccctaacg gcgggaccta gttccgccgc 2716021 ggtcagctcg gcggctcgaa aggtacccga cagcgccgac gaaatgtgca ccgccactac 2716081 cccgtcgccg ccactgtccg ccaacgcccg ttggtaggcg gcggacagct caaccggggt 2716141 cgccccagcg gtggtggcgt ggcgcttgtg gatgtcatcg gggatttcgt ccacaccgtc 2716201 gcgcaggtcg aggccgtcaa gcaagatatg cagcgggacc tggcggatcg accactgttc 2716261 gcgcaggtcg gccggcagtc gacacgacgt atcggtcacc accacaacgg tcaccggcgc 2716321 cgctctcccc cgcaagcggg aggtgccccc acctcatcgc ttcgctctgc atcgtcgccg 2716381 gcgcggggca tgtctcagcc gcgcgatttc tcgttcggca ccccggcttc ggccagtgcc 2716441 ttgagcatca gttcggcgac cgcctggtgg gcttcaaaat tccagtgaat gccatcacga 2716501 ttaccatatc cactcaatat ctgttctgcg acagcggctt tgagatcaac tagaggaatg 2716561 tcatggtgct gtgcccattc cgtgatcgcc gccaccgtgc ctgcgcggcc gtgatgggcc 2716621 ttgccgtagg tctcggcgat atgcaccgag ggcagcgatg cgatgatcgg tatgcccgga 2716681 cgattgaaat caattgcacc acgggtcttt tcaaggtact cagcggtcag gtgcggcggc 2716741 aacgccgcac gggccactgg cgacagtcgc ggttgaaccc aggcgtagcc gtcgcggacc 2716801 caccgtcgca gccaagacgg acgtacatag cggatgagct cacgcagcgc cgtcggtaat 2716861 accgacggca gcgaatccat tccgccggtc gcgaagatca ccgctccggc cctgggtaac 2716921 gccgcccaag cgcgcggatc ctgggttgcc gcccaccaga catcccgaca ggtccagccg 2716981 atgcggccaa tcagctctaa atcccaatct agttgggaag caacaatatt gggccagata 2717041 cgggggtcat cggcaggcag gccgccggtg ggcccgtagt aggccagcga gtcagcgaag 2717101 accaacaatg cgggcctgcg cccgcgccta gaggacatcg ctggagacct gcgccgaagc 2717161 attccacaca tcaaggcgcc accggatgct ctcgaagtcg gagcccgggg cccaatggcc 2717221 actcagctga gtccaactgg cattgcccat gccgcccaaa gccggccagt tggccaccgg 2717281 caacttcagc agcgccgccg acaacgcggc gatcagaccc ccatgggcta ccagcaccac 2717341 cgggcgatcc ggctcgtcag cgccacccca ttccggttcg ctggcaacca actcggcaac 2717401 caacggccga cttcgggcag ccacgtcaac cctgctttcc ccgccgtgcg gcgcccaggt 2717461 cgcatcctcg cgccaggcca accgggcgcc cggggcatca gcgtcgatct gagcgtgggt 2717521 taagccctgc caatcgccaa ggtgagtttc ccgcaatcgg gtgtcgaccc ggaccacaag 2717581 gccggtgcgc tcgcccagct tgaccgccgt gtcatatgcg cggcgcaggt ccgacgatac 2717641 gatcagtagc ggctgccgct tgcccagcac ctcggcggcc gcgaccgctt gggtgcggcc 2717701 aagttcgctc aactcagtgt ccagctggcc ctgcatccgg ctaccgacgt tgtagtccgt 2717761 ttgtccatgc cgcagcatca ccagtcgccg cgctctcatt gcgcacccgc tgagttcgcc 2717821 gataaatcaa ccggcaccac cgggcagtca ccccacaacc ggtccagggc gtagaaattg 2717881 cggtcgtcct gatgctggat gtgcaccacg atgtcccggt aatccaacag cgtccagcga 2717941 ccctcgcggg caccctcacg gcgggccggc cggtaacccg cctgtcgcat tttctcctcg 2718001 acctcatcga cgatggcgtt gacctgccgc tcgttggagc ccgaagcaat gacgaagcag 2718061 tcggtgatga ccagctgccc ggagacatcg atgaccacga cgtcatcggc gagcttggcg 2718121 gcggccgcgc cggcggccac cctcgccatg tcgatggctt cccggttggc ggtcataggc 2718181 cattcccagc ggccaggctg gtcgttgaac gcgcgcccgc gtcgcaggcg ccacagtaga 2718241 gccggcactt ggagacatac tgcacgacgc cgtcgggcat caggtaccac agcggccggg 2718301 actgctcggc gcgctgacgg cagtcggtcg acgaaatggc cagcgccggg atctcgacca 2718361 gagtcaacgc atccttggcc agctgaccca gcaggctagt gatgtgttcg ttgcgcaact 2718421 cgtagccggg ccggctgacc cccacgaacc gcgccaattc gaacagctcc tcccagccct 2718481 gccaggacat tatggaagct agcgcatcgg cgccggtggt gaagtacagc tcagagtccg 2718541 ggtgcaaagc atgcagatcg gccagcgtgt ccttggtgta ggtgggtccg ccgcggtcga 2718601 tgtcgacccg gctcacagag aatcggggat tggaggcggt ggcgatcacc gtcattaggt 2718661 agcggtgctc ggcggcggag acctgtcgac ccttttgcca gggttgcccg ctgggcacga 2718721 ataccacttc gtcgagatcg aacaggtcgg ccacctcgct ggcggcaacc aggtggccgt 2718781 agtggatggg gtcgaacgtc ccacccatga ctcccaatcg acgcccatgc acgattggcc 2718841 agcttactgg attatcttgc cgcagttccg ttcgcggcaa ctgccagcca gcctaagcga 2718901 gcagccattg ataaggcagc acgattggtt attcctaagc ctttgcgtga tcatcttggt 2718961 ctcgttccgg gtgaggtcga ggtcgtcgcc gacggggcgg gactgggtgt cgcgaccctc 2719021 gccggtgact ccctcggcga gcggcatggc ctaccggtga tacccgcggg cagtgcggcg 2719081 cgatgccggc cagcgttagc accgtgctcg tggacacgag cgtcgcggtc gcaccggtgg 2719141 tcgccgatca cgaccaccac gaagatacct ttcaagcgct acgtggccgc accctcggtc 2719201 tggccgggca cgcggctttt gaacgcagga cgctggcgac cgtggcgaag ctgcttgcac 2719261 acacattccc ggcgaccagg ttcctcggcg ctggggcggc gatgtcgctg ctacccgaac 2719321 tcgcaccggc cgaaatcgcc ggcggagccg tctaggatgc gctgatcggt acggctgcca 2719381 acgagcatcg gctccccctg gcaacccgcg accggcaggc gctgaaggtc taccgcgcgc 2719441 tcgcaatgga agccgagctg ctggcctgag cgtcgcggtt gcgcggccaa tcacacccgc 2719501 gccgctgcca ggccaacggc tgcccagctg cccggtccct cacgtttttc acccgatgta 2719561 cccgcaacga tctactcggt cgtgtagaag gggtctgtgg ataatttgcc gatcgaatca 2719621 gccgagtcga cgcggttggc gaaggcggcg atgacccgac ggttttacac ccgctcggtg 2719681 gtgaaaggcg agatcacgct gccggccgtg ccgagcatga tcgacgagta cgtgacaatg 2719741 tgcgccggcc tttttgcggg tgtgggcaga aagttttccg acgaagaact tgctcatctt 2719801 cgcgcggtgc tccagggtca gctggcagag gcgtacgcgg cctcccagcg ttcgaccatc 2719861 gtcatctcat acaacgcccc catgggcccg accttgcact accaagtccg agcccaatgg 2719921 cggacggtgg cgcaggaata cgagaactgg atcgccaccc gtgagccgcc gctcttcggt 2719981 accgaaccag acgcacgtgt gtgggcgctg gccaacgaag cagccgatcc tacgacgcat 2720041 cgggtgctcg aaattggcgc cggaaccggg cgtaacgccc tggcgttggc acggcgcgga 2720101 cacccggtcg acgtggtgga gatgaccccg aagttcgccg acatcattcg ctccgacgcc 2720161 gaacgagatt ccctcgacgt gcgcgtcatc atgcgtgacg tcttctcgac catggacgac 2720221 ttgaggcagg actatcagct gatggtgctc tccgaggtgg tgccggactt ccggacgacg 2720281 cagcagctgc gcaatctgtt cgaactcgct gcccagtgcc ttgctcccgg tgcccgcttg 2720341 gtgttcaacg ccttcctggc gaacggagat tacgcacccg accaagccgc gcgtgagttc 2720401 gggcagcaga tgtataccgg gatgtgcacg cgggccgaga tgtctgctgc agcggccggc 2720461 cttcctctcg aactcgtcgc cgacgactcg gtatacgact acgagaaaac gcacctgcca 2720521 ccgggcgcct ggccgcccac cagttggtac gccgactgga tccgtggcct cgacgtgttc 2720581 accaccaacg ttgagagctg cccgatcgag atgcgctggt tggtgttcca gaggaggcgg 2720641 tgagcagtcg caaaagcccc cgaaaccggt cggatttggg ggctggtacg tgaattaggg 2720701 tgaccacggc aagcgtgacc cgccggcgac tgcagcgaag ccgggtctgt tggtgacagt 2720761 gtgtatgtcg gggtttcagg cggcaggttc gagggtgacc cccaatcctt gggcttcgag 2720821 tttggcgacg aggcgacgtc gttctttgtc gggatccatg cgggtggtga agtagtcggc 2720881 gccgagatcc tggtgaggcc ggccggtggc cagcacgtgc caaatgatga cgatcagctt 2720941 gtgggcgacg gtggtgatcg ccttcttgtt ggcagcggga ctgcggaagc caccgaactt 2721001 gcggacctgg cgacggtagt actcgcgcag gtagccatcg gtgcgcacgg cggcccacgc 2721061 gcactcgacc aggaccggct gcaggtgctg gttgcctgtg cggcgggcac cgtgatggcg 2721121 tttgccggcc gattcgtggt tgcccgggca cagccgcacc cacgaggcca gatgctcagc 2721181 cgaggggaac caggccgccg ggtcggcgcc gatttcagag atgaccgtcg ccgaggcacc 2721241 caccccgatc cccgggatcg atgcaatcag ctcgcgtcgg gcacaaaagg gatgcatcag 2721301 ctgctcgatc tgctcgtcga gagcaccgat catcgcatcg agctgatcca gatgagccag 2721361 gtgcaaccta cacatcaggg catggtgatc atcgaagcgc ccttccagcg cccgctgcag 2721421 atcggggatc ttcgagcgca tactgccgcg cgccagatca gccagcaccg ccgggcggcg 2721481 ttcaccgtcg atgagcgcct ccaccatcgc ccgcaccgac ttgggggtga ccgaggacgc 2721541 cacgctgtcg gccttgatcc ccgcgtcttg aagcacattg cccaggcgct gcagcttcga 2721601 ggtgcgatgc tcgaccagct tgcggcggta gcggatcacg tcgcgggcgg ccttgatgtc 2721661 ggcgggcgga atcaaccaac cccgcagcag accgcattcc agcaggtgca ccaaccactc 2721721 ggcatccaag aggtcggttt tgcggcccgg ccgttcttca cgtgcccggc attgcacacc 2721781 agcagctcac tcgccgtggg ccaacaacgc gtgataagcg ggcgcaccag cgcccatcat 2721841 gttcttttac gactgcccgc ccggcctaca ccggcagtag ctggtcgatc accgtggcca 2721901 gctgcttggc ggatcggcat tcgtgcatcg tgatcacctc ttggtagcgc ggcaccgccg 2721961 agtcaccgct gccccacaga tgcttgggct ccgggttgag ccagtgcgcg tgccggctgg 2722021 cggtcaccat gtcggccagc acgtcggtgg ccgggttgcg gtagttggtg cgcccgtcac 2722081 caagcaccag cagcgagctg cgcggcgaca gcacatttgg gaagccctgc atgaacgaga 2722141 cgaacgcgtt gccgtagtcg gaatggccgt cgcgggcata cacaccagcc tcccgggtga 2722201 tccgctggat cgctatggcc aggtccgatt ccggcccgaa catatgggtc acctcgtcgg 2722261 tggagtcgat gaaggcgaag acgcgaaccc gggagaactg ttggcgcagc gcgtgtacca 2722321 gcagcagcgt gaagtggctg aagcccgcga ccgagcccga cacgtcgcac aacacgacga 2722381 gttccgggcg cgccgggcgg ggtttgtgca acaccaggtc gatcggcacg ccgccggtgg 2722441 acatcgactt gcgcagcgtc ttgcgcagat cgatcgatcc cgcgcgggcg cggcgccgcc 2722501 gggcggccaa ccgggtcgcc agggtgcggg ccaacggggc caccacccgg cgcatctggc 2722561 gcagctgctc acccgaggca cgcagaaact cgacgttctc ggaaagctgt ggaattccgt 2722621 acatctggac gtgctcgcgg ccgagttgct cggctgtgcg ccgcttggtc tcggcgtcga 2722681 ccattctgcg cagctgcgcg atcttttgtg cggcaagcgc tttggcaatc tgttcctggg 2722741 tggctgtggg ctcatcgccg tagggagcaa gcaggcccgc cagtagcttg ccctccagtt 2722801 cgtccagcgc catggccttg agtgcctgat acgacgagaa cgacggaccg cggctggaac 2722861 tgtacttgcc ataggcctca acgatccgcg cgatcatctc caccaaccgc tcgtccttgc 2722921 cggccaggtc ttggttgttg gccagcagat ccagcagcag ctgccgcata gcctcgacat 2722981 catcgggcgg caaacccccg gagcctgccg actcgtcttc cgtggtgatg accgcccgag 2723041 cccccagtgc cgcgggaaac cacaggtcga acatggcgtc ataggtatcg cggtggtcag 2723101 gccggcgcag caccgcacaa gcaatgccct cccgcaacac ctcacgatca cccagcccga 2723161 gggtggccat cacccggccg gcatccaccg tctctgacgg gcccaccgaa atcccgctgc 2723221 cacgcagcgc ttccacaaag cccaccaagt gtccgggcag cccatgcggg gcgagtggcc 2723281 gggcagcacg aatacgacgg gcggccacta gttcaacctg agctctccgg tggcccgttg 2723341 ctggtcggat tggtgcttga gaaccacgcc gagcgtggcg gcaacgaccg catcgtcgat 2723401 ggtgtccagt cccagtgcca agacggtgcg accccagtcg atggtctcgg cgatcgatgg 2723461 caccttctta agctgcatgc cgcgcagcac gccgatgatg cgcactaact cctcggcgaa 2723521 gtgctcgggc agctcgggaa ctcgggataa caggatgcga cgctccagct cgggggtcgg 2723581 gaagtcgatg tgcaagtaca ggcagcgacg cttgagcgcc tcggacagct cacgggtggc 2723641 gttggaggtc agcagcacga acggcgcccg ggtggcggtc agggtgccca gttcggggac 2723701 ggtcaccgcg aagtcggaca gcacctccag cagcaggccc tcgatctcga tgtcggcctt 2723761 gtcggtttca tcgatcagca gcacggtggg ctcggtgcgc cggatagcgg tcagcagcgg 2723821 acgctgcagc aggaactctt cgctgaacac atcggttttg gtggcctccc aatctcccga 2723881 gccggcctgg atacgcagga tctgcttagc gtggttccac tcatacaggg cgcgagcctc 2723941 gtcgacgccc tcgtagcact gcagccggac cagaccggat ccagtggcct gcgccacggc 2724001 gcgcgccagc tcggtcttgc cgaccccggc ggggccttcc accagcagcg gcttgccgag 2724061 ccggtcggcg agaaagaccg ccgtcgcggt ggcagtgtcg ggcaggtagc cggtctcggc 2724121 cagccgccgc gagacgtcgg cgatgtcggc gaacagcggc gtgggccggg cgggcacggt 2724181 cacgatcggg tctcctctag cacgatcggg tctcctctag ccaacggcgt caggccggac 2724241 gggtgtggcc ggctccccat gcgatccact tggtcgacgt caattccggt agtcccatcg 2724301 gtccgcgggc atgcagtttc tgggtggaga tgccgatctc ggcgccgaag ccgaattgct 2724361 cgccgtcggt gaacgccgtt gatgcgttca ccatcaccgc ggccgcatcg atctgttcgg 2724421 taaagcgttg ggccgcatca agattggtgg tcacaatcgc ttctgtgtgc ccggtgccgt 2724481 attcgttgat atgggcgatg gcagcgtcga caccgtcgac caccgccacc gcgatgtcca 2724541 gcgacaggta ttcgcggcgc aggtcggcct cgtccgggtc gagatgtacg gtgacaccgg 2724601 cgtgctgcag ggcggccagc aatcgaggca acgccgtttc ggcgatcgct gcgtcgacca 2724661 gcagcgtctc ggcggcgttg cagacgctgg gccgccgcgt cttggagttc agcaagatac 2724721 gctcggccac gtccaggtcg gccgcttggt gcacgtagac atggcagttc ccgacgccgg 2724781 tctcgatggt gggcacctgg gcatcgcgta cgaccgcctc gatcaggccc gctcccccgc 2724841 gtggaatcac cacatcgacc aggccgcggg cctgaatcag gtgagtgacg gtggcgcggt 2724901 cggcagccga cagcagctgg accgcgtcgg ccggcagctc caggccgacc agcgcggtgc 2724961 gtaacaccgc caccagggcc tcgttggact ttgcggccga cgagctgccg cgcagcaatg 2725021 cagcgttacc cgacttgagt gtcagcccga aggcatccac ggtgacattg gggcggccct 2725081 cgtagatcat gccgaccacg cccaggggga cgcgctgctg gcgcagctgc agcccgttgg 2725141 gcagggtata gccacgcagc acttcaccga ccggatcgcg cagtcccgcg acttgccgca 2725201 acccggcggc gataccgtcg actcgttgcg ggttcaagga caaccggtcc agcatggcgg 2725261 ccggggtgtc cgcctcgcgc gccgcgttca ggtcttcggc gttggccgcc aggatctggt 2725321 cgcggtgagc cagtagctcg tcggcagccg cgtgcagcgc gcggtctttg acagtcgtcg 2725381 gcagcgatgc cagccggcgg gcggccaccc gggcgcggcg tgcggcgtcg tgcacctctt 2725441 gacgcaagtc gagctgcgac ggtgctggca cggtcattgc cccagggtaa cgggcttgcg 2725501 ctggccaggt aagacgaccc gctccggacg ggccgcgcag cgatccggct gggtggttgc 2725561 tatgcgatca ggcgtacttg acggtcgccc ctgatcagct tgccgataat cccggcaaga 2725621 cgctggtagg acttctcgcg gccgccgaaa gagctaaaca ccaaaccgat tcgtcgcgcc 2725681 gggcaggggc gacgaatcgg gcgagttcca gccggcttcg cgtggtctcg acggcggccg 2725741 cggtctgcgg aatcagtgtc acccccagcc cgccggtcac gcactgcacg acggtggcca 2725801 gccacaccgc ccgggtgttg ggcagcatcg agcgtctggt cgcgtaggca gtgcccctca 2725861 tgcagtcaca acaaagtcag ctctgacagc gcggtcagcg gcacccgctg cttgccggaa 2725921 agacatgccc tgggggtgca ccgagaccgg cttccgacca ccgctcgccg caacgtcgac 2725981 tggctcatat cgagaatgct tgcggcactg ctgaaccact gctttgccgc caccgcggcg 2726041 aacgcgcgaa gcccggccac ggccggctag cacctcttgg cggcgatgcc gataaatatg 2726101 gtgtgatata tcacctttgc ctgacagcga cttcacggca cgatggaatg tcgcaaccaa 2726161 atgcattgtc cgctttgatg atgaggagag tcatgccact gctaaccatt ggcgatcaat 2726221 tccccgccta ccagctcacc gctctcatcg gcggtgacct gtccaaggtc gacgccaagc 2726281 agcccggcga ctacttcacc actatcacca gtgacgaaca cccaggcaag tggcgggtgg 2726341 tgttcttttg gccgaaagac ttcacgttcg tgtgccctac cgagatcgcg gcgttcagca 2726401 agctcaatga cgagttcgag gaccgcgacg cccagatcct gggggtttcg attgacagcg 2726461 aattcgcgca tttccagtgg cgtgcacagc acaacgacct caaaacgtta cccttcccga 2726521 tgctctccga catcaagcgc gaactcagcc aagccgcagg tgtcctcaac gccgacggtg 2726581 tggccgaccg cgtgaccttt atcgtcgacc ccaacaacga gatccagttc gtctcggcca 2726641 ccgccggttc ggtgggacgc aacgtcgatg aggtactgcg agtgctcgac gccctccagt 2726701 ccgacgagct gtgcgcatgc aactggcgca agggcgaccc gacgctagac gctggcgaac 2726761 tcctcaaggc ttcggcctaa ccgggatctg gttggccggg aatcaatgag tatagaaaag 2726821 ctcaaggccg cgctccccga gtacgccaaa gacatcaagc tgaacctgag ctcaatcacc 2726881 cgcagcagcg tgctcgacca ggaacaacta tggggaaccc tgctggccag cgccgcagcg 2726941 acacgaaatc cgcaggtatt agctgacatt ggcgctgaag cgaccgacca tctgtcggct 2727001 gcagcccgcc acgcagccct cggagccgcg gccatcatgg gcatgaataa cgtgttctac 2727061 cgtggccgcg gcttccttga aggccggtac gacgacctgc gccccggact gcggatgaac 2727121 atcatcgcca atccgggcat accgaaagcc aacttcgagc tctggtcctt cgcagtgtcc 2727181 gcgatcaacg ggtgctcgca ttgcctcgtc gcccacgagc acacgctgcg tacggtaggt 2727241 gtggaccgag aggcgatctt tgaagcgctg aaagccgcag caatcgtttc aggcgttgca 2727301 caagcgctgg ccacaatcga ggcactaagc ccaagctaag tgtctgtacg cgatgacgcc 2727361 gtgctgggtg acaccggtgc gaccaacacg gtactgtggg cgatcggcgg cggcgccttc 2727421 cacggagtca acttcgacaa cgcatccgac acccgaagcc tgtagtccct catcacctct 2727481 ccgtcctcgt cccagaagtc gtcatattct tggtcgaggt cggcgatctg cgcagtgaat 2727541 tgcccaagcg cgttgttgtc gatcagaatc tgcctctcag cacggttgtt gtagatctgc 2727601 gccaggggca ccatatcgtg atgtgcccat tcataggccc gcacgatctc gtggatctgc 2727661 ctctcgacct cagacagctg cacacagagg tcggtcagcc acctgacaaa cggcttggct 2727721 gcctccatca actgcatcac cactggaccc gcccaggcgt ccatcagaga cagcagcgtt 2727781 cggttgaacg acctctgcac ggccgtcatt tccacatcca acgacctcca cgccctggcg 2727841 gcagccaaca tcgagtcagg accggggccg gcatatatgt tggcggagtt gacctccggt 2727901 gggtacgctt cgaaatgcat ctgtttgttc gttgttccgt cggctcgtga cgactgtagt 2727961 gactcgttaa ctaaaggtct tgatgttgtc ggcctcggcg gtcgcatact tatcggcgcc 2728021 agtggtcaag gcgtgcgcaa actcctcaag aaccaccgcc gccgcagcga tggtctgccg 2728081 atacttcctt gcgtactcga ccaggaacgt cgccgccttc tccgacacca gatccgcagc 2728141 cggaggccgc acagcggttg tcatcggggc cacctgagca tcgctttgga tcgcacgatc 2728201 gcgaatccgt cgtacctccg tggccgccac ggtcaacgcc tcgggatttg tgatcacaaa 2728261 agacatgccg actccactcc gcgattaacg aacccccggc actgcaccgg gctgatcaac 2728321 caccggttgt ttgcgctgcg actgcccacg ttgagaaaac gcaacgactt cactggcata 2728381 attatccaac agacgaaggg atcaattccg ggtaagccgc tgcaaatcaa gccgactcaa 2728441 ggcacacaac gccacccgac gatacccccc atgctgacgt tcaaagcagt agggatgtag 2728501 cttaccatgc cccaatggcc gtcggaggcc agccaccagc gcacatgcct gcacggtgca 2728561 cctaatcggt gccggcttcc agcggccggc agttgacccg cgatgacaga cacgcccagg 2728621 ctcgttgagg ccgatcgcgt tctcacacag cgcattacgg ctcgtgacag atgagggagt 2728681 agagcgggtc ggcaagggaa acccatcatg gcgccgggct ccgccccggg ttgggccagt 2728741 gcgatggctg cgacgtcgtg gaaaagcgcg gtggggtgct cgggcatgac ggatgggcac 2728801 tcatcacatc ctcctgctcc acggcagtgt tcggctcgca ccgtcattgt gctgatggat 2728861 cagaacgtcc gctcgcacgc cggacaccgc gccgcgcacc gaggtagccg acgctagcca 2728921 ccaggaacgg gaccaaataa ttgatcacca tccgtaccca cgtgccgatc gtcgcggcgc 2728981 cctcggcaag ggtggcaccc tgattcaccg cacatagcac ggtacccacg atcagcgccg 2729041 tcggagctgc ggtgcgcagg gtgtggccgc gcagaaacag accgatcgct tggccaaccg 2729101 tgtcccaccg ctcgtcagcg tcgcgcaggc ccacgatgct cgccggccgc cgctggggat 2729161 tggtgcaagt cgcggcgaat tgcctgttgc gccttccttt gccgctcgtc gataacccga 2729221 cccagctcct gcagcagcat cggcttgttc atcacgacct gctcaaggtg ctcccggccg 2729281 atctgcagcg cggtcacctc ttccagcgct accgcaccgg cggggtcggg ttgccgagtc 2729341 agcgcagtca agcccagaaa cgtgcccttt ttgagggtgg caatcgcaac cacggatccg 2729401 tcatcggtcg taaccgtcag ccgcacgctg ccggcgatca cgaaggtgat acccatggga 2729461 accacacccg cgtgctgcac gatctcatca gtgccgtagc gcaccagcct cgcgtaccga 2729521 gccagcgact gctgatcgct agagctcagt cgcagctcgg gccccaccac cgtgcgcagg 2729581 gcggactcca cacgttcggc cgtcgagaac tcgtcgtcgg cctcgtcgag gtgtagccct 2729641 tcccgacgcg cggcgtacca gacccagcgc agaaacgttg cctgcgttgg gccttcgtcg 2729701 gccggtgatg tcagcctgac cgtcgttcga tactcggcgg ctcctcgggc aattgtggcg 2729761 ggcacgaccc cgggcttaac atgcggtagc gcgctggcag ccctgttcag catggcgcat 2729821 accttgtccg ggggatcgga cgtggaaaat gtggtcgtga tcgagcattc gtgcgccccg 2729881 gccggccggc tgagattggt aaacgcggtg gtggccaaca tcgagttggg catgatctgc 2729941 agtccgctgc cggtgtcgat atggacagcc cgccagttca cctcgacgac tcgtccgcgg 2730001 gctgtgggtg tttccaacca atcatcgatc cgaaagggct gttcgaacag catgaacaag 2730061 cccgacacga tctggccgac ggagttctgc agcatcaggc cgatgacgac tgacgtcaca 2730121 cctaacgcgg cgaacagtcc accgacccgc accccccaga tgtaggacag gatcaccgcc 2730181 aaacctatgc cgatcagcgc gaagcgcgcg acatcgacga agatggcggg tagccgcttg 2730241 cgccagctct gttggggcgc accctgaaac agggtggcat tcagtaacga cagcagcagc 2730301 accagcacca gaaatccgaa cgctgtcgtg agcacccgca cggtggggtc ctcggccggg 2730361 acttcagatg ccttgacaag cagcagcaaa accgcgccca ggggtagcag gtagtttcgc 2730421 agcagacttg cctgcctggc cagatggctg ttccgtcgga cgagtatgtt gtgcagttcg 2730481 gtgagaacga ttagcccggc cggcaatccg atcgcaatgc caacggccca gtagaaccat 2730541 gtcgagtcga gcaggttcat gatcgctccg acaatcggta gatcggctct tctaaccctc 2730601 cgacagaaat cgtgcccgca gccgtgaact gccacacgtc tcgcatcgcc tcatacacct 2730661 gcgaggtgac atagatgccg ggctgtggtg aaccgctgtg catttggtag gccagactca 2730721 ctgccgcgcc ccacatgtcg tagacgacac ttgatctgcc gaccagcccg ctaatgacgt 2730781 ccccggtgtt gataccgact cgcaggtgca gatcgttacc ggtttggcaa ttgaaccgat 2730841 cgacgatgcg ccgcatctct agggcgaagt cgacggttcg gggaatattg tccagccgtg 2730901 gcgtggttac cccgcaaccg gcgagatagc cattgtgcag cgtgcgaatg cgttcgacac 2730961 caaggtgttc ggcggccgaa tcgaactggc ggaccagctc gtcgacaatt ttgaccagtt 2731021 cgttacccga caggccgctg gaaatctcgt cgacacccag gatgtcggca aacaggacgg 2731081 tgacatcttg gtgctcctgc gcaatggtct gctccccaag gcggtaccgc tcgacaactg 2731141 gctcgggcat catcgatagc aataaccggt cgttttcctt gcgttgctcg ttgagcagct 2731201 cctctttggt ttgcagattc cgactcatct cgttgaaagc ggctgtaaga tcaccgattt 2731261 cgtcgcgtga ctttaccgga atgttgactt cgtagtcgcc tgcgctgatc ttctgggtgc 2731321 caacctcgag ccgccggatt ggccgcacca tcgcatgggc gatcagcatc gacgccacac 2731381 agatgacgac aatgatgcca actgtaacca gcacaagcgc cctgctgaac gacgcgacgg 2731441 ccgcgaacgc ctcagaatcg ttccgcgttg ccaggatcga ccagtgcaga tcggagtccg 2731501 gcacattcag cggcgcgtag gcctccagtt ccctgctacc cgtgtagtcg gtggaggtga 2731561 cggttccggt ctgtccgcgt tgggcggcgc gcagtccttc ggtcgcaaca ggctgcagca 2731621 gcgtcgtccc accgaactgg atcgctctgt tgaccacatc aagtgacgtg cctgctgcca 2731681 caacctgttt ccggtattcc tccgggtctt gcaggaagag ccgagaatcg gaccgcatca 2731741 gactgtccgg accggcgaga taggtttccg tcccactacc catgccagcc gcttgccatt 2731801 gcctgtcggc ggtcatgatc ttattgatct tgtcgatcgg caacggcagc gccaaaacgc 2731861 cctgagtttt gccgcccgct tcgaccggtg ccaccaacca cgcggtcggc acgccgagtt 2731921 gaggctgata cggcttgaag tcggtaatcc aggtaaagtc gacggcgttg gcgcccaacg 2731981 ctttaaggta ggcgtcacgc agattggatt cgcgatacgg cccggtcaga atgttggtac 2732041 cgaggtcggg gtccttgctc agggtataga cgatattgcc ccgggtgtcc agcaataccg 2732101 cgtcgtcgta atcgaaccgg gtgacgattt cccggaaata gctgttgaat tgcgcgttgg 2732161 cggccgacca tgcactgccg tcgccggcat cgtccagccg catcgcatct tggtccgacg 2732221 tgaatggtgc agtgtagtac gcctgaagat acctttgggc cggagaagtc ggcagcagcg 2732281 cggtgatgtc gagtttatcg ccggtcgtgc gttcgacggg tgtgatgaat tcgttgttgt 2732341 agtagttgac gatcgcctgt tgttgggcgg ggctgatcgt ggcgtcagcc agctggtcaa 2732401 agccggccgt gaaccgcacg acggcatcga caaccgtgag tccacgttcg taaatgacca 2732461 gcgaattcgt caggtcagaa aatagtgtct caactgcccg cttctgcgac tcgcgcaact 2732521 gggtcaaccg ctcgtaggcg gctgctctta gcgaagtgcg accagattga tagacaatgg 2732581 ccgcaatcgc cgcgacggac acgatactcg tcaacagcag cagcaccatg agcttggact 2732641 ggatgctggc ccggaaacgc ggccgacgcc ggagcacatt cttatggcgc ttcttagccg 2732701 gggtggattc actctcggct accgagtcca gtgcctcacc cgacgtcaac cggcttcccc 2732761 tctgctgtcg gtcgcaggct gctctacgac gcgccgatta cgcagcagac taacgtgccc 2732821 agcccacgat catcgcgttt gccgaaaagc caccagcgac caatcgagga atgcgccgcg 2732881 tacgggtcct cgtcatcgtc ctcgtcgcgc acgcgcttct tgaggggtgg tagcgattgg 2732941 ttctgggtgg tccgtaatcg aggtgagcca ggatagctcg gccgtatcgc aatgttcagc 2733001 gagtcgatgg atcaacacat gccccggcac cggcaaccgc tgcggagtgg tcaacgcatc 2733061 aaacgacgac gcactcagac catcgaccga cagcatcgag ccggtccagc atcccgacca 2733121 cccgtcccga tccgcaagca tgttcgaata ctgggacgcg ccaccgacag gacacccact 2733181 gatacccttg ggacaaaagt gacacaagtg atttcagcca acagcaagca tggcaaacgc 2733241 cagtgagact aacgtcggcc ccatggcgcc ccgggtgtgc gtggtaggca gcgtgaacat 2733301 ggacctgacg ttcgtggtgg acgcgcttcc gcgccccggc gagacggtgc ttgcggcgtc 2733361 gttgacccga acgccaggcg ggaagggcgc caaccaggcg gtggccgcag cgcgcgcagg 2733421 cgcgcaggta cagttctccg gtgcattcgg cgacgatcca gccgccgccc agctgcgggc 2733481 ccacctgcgc gccaacgccg ttggactgga caggaccgtc acggtgcccg gaccgagcgg 2733541 gacggcgatt atcgtggtcg atgccagcgc cgagaacacc gtgctggtgg cgccgggtgc 2733601 caatgcacat ctgactccgg taccctcggc cgtcgccaac tgcgatgtac tgttgaccca 2733661 gttggagatt cctgttgcaa ccgcgctggc agccgcgcgg gcagcccagt cggccgatgc 2733721 ggttgtcatg gtcaacgcct ccccagccgg ccaggatcga agctccttgc aggacttggc 2733781 cgctatcgcc gacgtggtga tcgccaacga gcatgaggca aacgactggc cgtcgccacc 2733841 aacacatttc gtgatcaccc tgggtgtgcg cggtgcccgg tacgtcggcg cggacggggt 2733901 gttcgaggta cccgccccaa cggtaacgcc agtggatacc gccggcgccg gcgacgtatt 2733961 tgccggggtc cttgctgcga attggccgcg caacccaggt tcgccggccg agcgactgcg 2734021 cgcattgcgg cgggcctgcg ctgcgggtgc gctggcaact ttggtgtccg gtgtcggcga 2734081 ctgcgcaccg gccgccgccg cgatcgatgc ggccctgcga gccaaccgcc acaacggttc 2734141 atgaccactg ctacgcaccg aaggagaccc gctgatgcga acgaccaccg cggccgactt 2734201 agcgctggca ctcttcgcgg ttttcagtgt ggtcggattc ggctgacgca gttggctgca 2734261 gcaccgacgc accggatcca ccggctttcg cggcgtcagc ggccgggtcg gttcgctgga 2734321 gtggattacc gggacgtgct ttgtcatcgc cctgatcgtg acggtggtcg ctgcggtgct 2734381 gcagcggacc aacgttgtcc aaccgctgaa tactctgcgc atggtctgga ttcaggttgc 2734441 cggcataatc ccggcgacgg ccgggatcgc ggccacggtt tacgcccagc ttgcgatggg 2734501 cgattcgtgg cggatcgggg tggacgagca ggagaacacc actctggtgc gcaccggccc 2734561 gtttaaatgg gtgcgtcacc ccatctacac ggccatgatg gcgtttggcc tcgggctgtt 2734621 gctggtgact ccgaatctcg ttgccctcgc cgggtttatc ctgctcgttg ccacgctcga 2734681 ggtgcatgtc cgccgcgtcg aagaacccta cctgttgcgg acgcacagtg ccgtctaccg 2734741 cggctacacc gccagcgtcg gccggttcgt cccgggtgtg gggttgatcc gctagccctt 2734801 gggcacctca cggtcgatct gatcgagcca gattcgcgct gacatatccg acggggcccg 2734861 ccaatcccca cgcggcgaca acgcgccccc gtgggacacc ttggggccgt tgggcaatgc 2734921 cgaacgcttg aactggctaa acgaataaaa ccgctggacg aaaatctgca gccaatgccg 2734981 gatttcggcc aatgaatagg acgggcgttc gctctttggg aagccgggcg gccagttgcc 2735041 ccgctccgca tcgttccacg catgccaggc caaaaacgca atcttcgacg ggcgaaatcc 2735101 gtagcgcagt acctgaaaaa gcgaaaagtc ctgtagggcg aaaggtccga ccttggcctc 2735161 gctgctctgc agctcctcct cgccggtcgg aatgagttcg ggggtgatct cggtgtcgag 2735221 caccgactgc aatacctcac ccaccttctc accgaactca cccgccgaaa tgacccaccg 2735281 gatcaggtgc tggatcagcg tcttgggcac accggcgttg acgttgtagt gcgacatctg 2735341 gtcgccgaca ccgtatgtcg accaacccag tgccagctcc gacaggtccc cggtgcccag 2735401 tacgattccc ccgcgctggt tggcgatacg gaaaagatag tcggtgcgca acccggcctg 2735461 gacgttctcg aaggtgacgt cgtacacttt ttcgccaacc gaatacggat ggccgattgt 2735521 gtgcagcatc aaccgagcgg tgtcgccgat atcgatttcg gagaaggtaa cccccagcgc 2735581 acgtgccagc ttgatcgcgt tgttcttagt gtgctccccg gtggcgaatc cgggcaacgc 2735641 aaacgccaga atgtcgctgc gcggccggcc ctcgcggtcc atggcatggg tcgcgacgat 2735701 cagcgcgtgc gtcgagtcca atcccccgga cacaccgata acgaccttcg gatagtccag 2735761 cgcccgcaac cgttgctcga gtccagacac ctggatgttg taggcctcgt agcaatcctg 2735821 ttgcaatcgt tgcggatcgg ccggaacgaa cgggaaccgc tcgacctcgc gcagcagtcc 2735881 gatgtcgcct gccggtgggt cgagtgcgaa gtcgatgcgc cggaacgatt ccgttaactc 2735941 ccggtggtga cgccggttgt cgtcgaacgt gcccatccgc agccgctccg accgaagcaa 2736001 ctcggtgtca acgtcggcga cactgcggcg cactcctttg gggaaacgtt cggactccgc 2736061 gagcagtgcg ccattctccc agatcatcgt ctgaccgtcc caggccaggt ccgtcgttga 2736121 ctccccctcc cccgcggcgg catagacata ggcagccaga caccgcgccg acgccgagcg 2736181 cgcaagcagc cggcggtcct cggcacggcc gatggtgatc gggctgccgg acagattcgc 2736241 cagcaccgtc gcgcccgcca gggccgcctc ggcgctgggc ggcatcggca caaacatgtc 2736301 ctcgcagatc tccacatgca acacaaagcc gggtagatct gacgcggcga acaacaggtc 2736361 cgtgccgaag gccacgtcgg cgccaccgat gcggatcgtg ccccgctccc cgtctccggg 2736421 cgccatctgg cgccgctcgt agaactcgcg ataggtgggt agatacgact tgggcaccac 2736481 gccgagcacg gcgccgcggt gaatgacgac cgcggtgttg tagatgcggt gtcgatgccg 2736541 cagcggagcc ccgaccacca gtacaggtaa caggtcggcg gattcggtca ccaggtcgag 2736601 cagcgcgtcc tcgacggcat cgagcagaga gtcctgcagt agtacgtcct cgatggagta 2736661 gcccgacagc gtcagctcag gaaagaccgc caacgctgcg ccatcgtcgt ggcacgcacg 2736721 ggccatgtcc aataccgacg cggcgttggc cgccgggtca ccgatggtgg tgtggtgagt 2736781 gcaggcggca acgcgcacga acccgtgctg gtaggcggag taaaagttca tcgtcctttc 2736841 attgtcgccc agcgacgtca gaacgcccga atcacccgcc gagtatccac gctcgacacc 2736901 gtggaatccc ccgcgctgct ggcagatggc ggcattgacc ggcgtgggga tgctaccgac 2736961 tgggccgctg ccgaccctgg gccctgattg gccgccgagc agtcccatga cgatccgcta 2737021 gttcacctcg gatacccgct cggccgcaat gcgcagctag cggccatgtt gatcgaaatc 2737081 atttggggta caccgcatct cggagcaata tggtagctaa acttgcttag cttgcttcgc 2737141 cgacaccgcg accagatcgt cggcgtgcac caccgggcgg cgcagctcgc cgggtagctc 2737201 agaggtggac cggcccacca tggtggccag ctcggacgcg tcgtaggcaa ccaccccgcg 2737261 ggctaccatg gccgcgtcgg gtgcacgcag ttcgaccaca tcgccgccgc aaaaccggcc 2737321 ggacaccgcg gtgatacccg ccgccagcag tgaccggcgt tgtcgcacca cagcgcgcac 2737381 cgcaccggcg tcgagagtca gtgcgccggt tgcttcggcg gcataacgca cccagaaccg 2737441 ccgggccgac agacgcgcgg gccgggccgc aaacaccgtg cccaccgacg cgtcggcgag 2737501 cgcggtcgcg gcgtcggccg cgggggccag cagtaccggc accccggcgt cggcggccaa 2737561 cagcgccgcc gccaccttgg acgccatgcc gccagtaccc aggtggctac tgcggccggc 2737621 gaccacaccg tccagatccg ccggcccgga cacctccgga atgaacgtcg cgtccgcggt 2737681 tttgcgcggg tcgcagtcgt agaggccgtc gatgtccgac agcagcacca aagcgtcggc 2737741 gccgaccagg tgcgccacca gtgcagacag ccgatcgttg tcaccgaacc ggatctcgtt 2737801 ggtggccacg gtgtcgttct cgttgacaat cgccaccgcg tgcaacgcgc gcagccgatc 2737861 cagcgtgcgt tgggcgttgg tgtgctgcac ccgcatcgaa atgtcgtgcg cggtcagcag 2737921 cacctggccc accgtgcggc cgtagcgggc gaacgccgcg ctccacgagt tcaccagcgc 2737981 gacctgcccg acgctggccg ccgcctgctt ggtcgccaga tctttgggac gacgggacag 2738041 cccgagcggc tcgatgccgg cggcgatggc gcccgaagac acgatgacga cgtcggaacc 2738101 cgccttcatc cgccgctcga ccgcctcggc cagtccggcc agccggccgg catcgaacat 2738161 cccggacggt gtggtaagcg ccgtggtccc gaccttcacg acaaggccgc gcgcggtccg 2738221 gattgcgtcc cgatgcggac ttctcatcag ccatccccgt gttcgcgacg ccgactccga 2738281 gcggcctttc gctcggccgc gcccacccgc ttgttgctgt ccagccgcgg atcggtgccc 2738341 cggccggaca tcgcgaccgg ctcacccgca ggcgtttgcg gctcccaatc gaacgtcatc 2738401 tcgccgatgg tcaccgcgca tcctgaccgc gcacccagcc tcagcaattc ctcctcgaca 2738461 cccaggcgcg ccagccggtc ggcgagatag ccgacggcct cgtcgttgtc gaagttggtc 2738521 tggtcaatcc aacgctcggg ccgggcaccg ctgacgacaa agccaccatg cccgtcgggt 2738581 tcgacggtaa aaccgctgtc gtccaccgga atcggacgaa tcaccggccg ccgtggcacc 2738641 gccaccggcc gcgcagcgtt gtagtccgag atcatctgcg acagcccaaa gatcaacggc 2738701 tgcaggtttt cccgggttgc ggtcgacacg cagaacaccg gccagccgcg ctgggcgatg 2738761 tcgtcacgga cgaactccgc gagctcgcgg gcctccggca catcgatttt gttgaggacc 2738821 accgcacgcg gccgtgcggc gagatcgccc agagccgcgt ccccttgcag cgtgggcgtg 2738881 tagcacgcga gttccgtttc cagcgcgtcg atgtccgaga tggggtcgcg gcccggctcg 2738941 gcggtagcgc aatccaccac atgcaccagt acagcgcagc gctcgatgtg ccgcagaaag 2739001 tccagcccca gaccacggcc ccgggatgcg cccgggatca accccggcac gtcggcgacg 2739061 gtgaacgcgt gctcgccagc cgagaccaca ccgaggttgg gcaccagggt ggtgaacggg 2739121 tagtcggcga tcttcggctt ggccgccgaa atcgccgaca ccagcgagga ttttccggcc 2739181 gacggaaacc cgaccaggcc gacgtcggcg acggtcttga gttccaaggt gaggtctcgg 2739241 gactgtccct tttcgccgag gagtgcgaaa ccgggggcct tacgcacgcg ggaagccagc 2739301 gcggcgttgc ccaaaccgcc acggcctccg gcggcggctt caaagcgggt gcccgcgccg 2739361 accaggtcgg ccagtagccg gccgttctcg tccaatacca cggtgccttc gggaactttc 2739421 acttccaaat ccgcgccggc ggccccgtcg cggttattgc ccatcccgtg cttgcccgaa 2739481 gccgcggtga gatgcgggcg gaaatggaag tcgagcaggg tgtgcacttg cggatcgacg 2739541 acgaagacga tgctgccgcc ccggccgcca tttccgccat cggggccgcc cagcggcttg 2739601 aatttctcgc gatggaccga agcgcagccg ttaccgcccg aacccgctct ggtgtggatg 2739661 acgacccgat cgacaaaccg aggcaccgag ctccccttca tctgcggagt gtgcagctac 2739721 tgcgggtttt gcccctcgtg aatcttcgca gtgggcgcac acgcgcgacg ctcaggcagt 2739781 ggtcgaaccg acgatgctca ccgtcttacg tccgcgtttg atgccgaact cgaccgcccc 2739841 ggccgtcttg gcgaacaagg tgtcatcgcc gccacgcccg acgttgacgc cgggatggaa 2739901 tttggtaccg cgctggcgga ccaggatctc gccggccttg acgacctggc cgccgtaccg 2739961 cttaaccccc agccgctggg cggcggaatc gcgaccgttg cgcgagctgg aagccccctt 2740021 cttgtgtgcc atgtctgtcg cctccgttat gcgatgccgg tgaccttcag gaccgtcagc 2740081 tgctgacggt gtccctgccg tttgtggtag ccagtcttgt tcttgaactt gtggatacgg 2740141 atcttggggc ccttggtgtg cccgagcacc tcaccggtca ccgcgacctt ggccagtgcc 2740201 ttcgcatcgg tggtgacggt ggcgccgtcg acaaccagag ccaccggcag ggacaccttc 2740261 tccccctgct cggattccag cttttcgacc ttgaccacat ctccgacagc gactttgtac 2740321 tgcttgccgc cggtcttgac gattgcgtag gtcgccatca ttgctcctgc ctcttcatac 2740381 ttccgctgca tgcgttgcgc ttcgcgcgcg ggccagcggc gggacgcgtg ctgggtcttg 2740441 ggcgggcacc tacaacggac cccgcatcgt ctccagccgt cagcctggcg acaactggtc 2740501 aagggtacgt gacctgcaac tacggggtca aaccagcggg gcctcagcga gatcgacgcc 2740561 agcacacgaa agtgcgccgg tagcgtcgat ctcgacgcta ccggcgcact ccggggcccg 2740621 ggtggtgacg tcatccgggt tggaccgctg atggctgcgg ctaacatcgt gccgaatcgc 2740681 gtccgatgtc gatctggagg aaccgccgat gaccgccccc ttggatcgtg cgccggtcac 2740741 ggatttgccg gctaacaaca aaggccgaga ccgcacccac tggctgtatc tcgcggtcat 2740801 tttcgcagtg atagccggtg tgatcgtggg gctgacggcg ccgtcgaccg gaaaaagcct 2740861 cacggtgctc gggacggtgt tcgtcaacct gatcaagatg atgatcgcac cggtcatctt 2740921 ctgcacgatc gtgctcggga tcggctcggt gcgcaaagcc gcggccgtgg gcaaggtcgg 2740981 cgggctggct ttggcctact ttctaacgat gtcatcggtg gcgctcggga tcgggttgat 2741041 cgtcggcaac ctactcagtc cgggtaggga tctgcacctt aggcctggtg cggtcggaag 2741101 cggcgcagca ttggccggcc aggctgcgga gtcacacgga atcgctgggt tcatccagca 2741161 gatcattccg aggtcgctcc cctcagccct tactgaaggc aacgtgctgc aggtgttact 2741221 cgtcgcgctg ctggtcggtt tcgcggtcca aggcctgggc cccgcaggcg agtccatcct 2741281 gcgtgccgtc gagaacctgc aaaagctggt gttcaaggtg ctcgtgatgg tactgtggct 2741341 ggctccgatc ggcgcgttcg gtgcgatcgc caatatcgtc gccacgactg gcttcaacgc 2741401 cgtcaccaac ctgctgctgc tgatggccgg cttctacctg acgtgcgtgg tgttcgtttt 2741461 cggcgtcctg ggagtgctac tgcgcatcgt gtcgggtttg tcgatctttc ggctgctgcg 2741521 ctatctagcc cgcgagtact tgctgatctt cgcaacatcg tcgtcggagg tggtgctgcc 2741581 cagactgatc accaagatga aacacttggg cgtgcaatcc agcacggtcg gcgtggtggt 2741641 gccgaccggc tactcgttca atcttgacgg caccgctatc tatctgacca tggcgtcgct 2741701 gttcatcgcc gacgcgatgg gacatcgctt gacatggggc gagcagatcg cgctgctggc 2741761 gttcatgatc atcgcgtcca agggcgctgc cggggtcagc ggtgcgggcc ttgcgacgct 2741821 ggccggcggc ctgcaggctc atcgccccga gctgctggac ggtgtcgggc tgattgtggg 2741881 gatcgaccgg ttcatgtcgg aagcccgttc gctcacgaac ttctccggca acgccgtcgc 2741941 aaccatcctg gttgcctcgt ggacaaagac cattgacctg tccaaagccg acgaggtgtt 2742001 gcgcggtcgt gatcccttcg acgaatcgac catggtcgat ccccacgatg aggagccacc 2742061 cgccgccaca ccccacgggg gcggcgtccc gacgaaccct gcgctgtgcg atttcgagca 2742121 ggtcagtcta ggcggattgg tgggccggcc ggccggcccg caacgcgccg acgtggacgg 2742181 gtaggggcca gctccgtgac accggggacg tcgacttcgc ccggggaacc gtccaagccg 2742241 gctgcatcct cctcgtcgac gtcggcatcg gcggcgtctt cgtcggagtc ttcgtcgtcg 2742301 gaatcggagt cttcaacgtc gagatcctcg tcgaggtcct cgtcgtcgag gtcctcgagg 2742361 tcctcgtcgg cgtcgagctc gtcctcgtct tcgtcggtgt cctcggtgtc ctcgaagtcc 2742421 gcttgggcgg tgtcgtcgag gtcagtgggc ggttgatcgc cggcctgctc ggcgagttcg 2742481 gcagcgggct ccccggattc ctcgtcaccg cgaccagcca gcgaggacaa gcccgctgcc 2742541 atcgccttga acatgggatg ctcaccggga gcgtgcacgg ggaccttggc gaccatgctc 2742601 ctatcactgg actcttcgga ccggctcttt ttcgatcgct tgccccgccg agcaccgggc 2742661 tcagactttc gcccagtcgc cgcggccgaa tcgaccgggt cggcgtgcag caggatcccg 2742721 cggccactgc agttcggaca cgatgtggag aacgcttcga tcagtccggt tcccaaccgc 2742781 ttgcgagtca actgcaccag ccccagcgac gtcacctcgg acacctggtg gcgggtgcga 2742841 tcgcgggcca gcgactcggt caaccggcgc aacaccaagt cgcggttgga ctccagcacc 2742901 atgtcgatga agtcgatgac cacgatgccg ccgatatcgc gcagccgcag ctggcgcacg 2742961 atctcctcgg ccgcttccag attgttcttg gtgaccgtct gctcgaggtt gcccccggct 2743021 ccggtgaatt taccggtgtt gacgtcaatg accgtcatgg cttcggtccg gtcgatcacc 2743081 agcgtcccgc ccgacggcaa ccacaccttg cggtccatcg ctttggccag ctgctcgtca 2743141 atgcggtgca ccgtgaagac gtccggcgcg gactggccat ccggcccgtc agcggactcg 2743201 tacttggtca acttcgaaac caattcggga gcaacagaat tcacgtattc attgatcgtg 2743261 ttccaagcct cgtcgccgga aacgatgagg ccgacgaagt cctcgttgaa caggtcacgg 2743321 ataaccttga ccagcacgtc cggttcttcg tacagcgcca ccgcagcgcc cgcggccttc 2743381 tccttggtct cttgtgcctt ggcctcgatc tgctcccagc gttcccgtag ccgagcgacg 2743441 tctgcgcgaa tgtcgtcctc tttgacgccc tcagacgcgg tacggatgat gaccccagcg 2743501 tcagacggca ccacctcgcg caggatctcc ttgagccgct gacgttcagt gtcgggcagc 2743561 ttgcggctga tcccggtcga cgacgcgccc ggcacataaa ccagaaatcg accggccagc 2743621 gacacctgcg tggtcagccg cgcgccctta tgccctaccg ggtccttgct gacctgcacc 2743681 acgacatagt cgccgggttt gagggcctgc tcgatcttgc gatcggcccc gcccaacccc 2743741 gctgcatccc aattgacttc accggcgtag agcactccat tgcgaccgcg cccgatgtcg 2743801 acgaacgccg cctccatcga cggcagcacg ttctgcacaa ttcccaggta gatgttgccc 2743861 accagggaag ccgaggccgc agacgtcacg aaatgctcca cgacgatacc gtcttcgagc 2743921 accgcaatct gggtgtaccg cgtgcccggc agcggtggct cggtgcggac ccggtcgcgc 2743981 accaccatca cccgctcgac cgcctcacgg cgagccagaa actcggcctc actcaacacc 2744041 ggtgggcggc gccggccggc gtcgcgcccg tcgcggcggc gttgccgctt ggcttccagg 2744101 cgggtcgagc cgtcgatgcc cttgatctca gtggagccag agccgccatc ctgcgagttg 2744161 ccggccttgt cacccgcgcg gggcacgcgt tcgtgtacga cagtgttggg cggatcgtca 2744221 ggcaacgggc cctctaacgc agcgtcgttg tcgtcaccag aagccgactt acgccgtcgc 2744281 cggcggcgcc ggcgacgatt gccggcctcc agcgaaccgt tttcgtcctc gccgttgtca 2744341 ccggcttcgg tatcttcgga atctcgatcg tcgccgtcgt cggtttcggc ggcgtccgcg 2744401 ctggtaaatt gttgggcccg gggctcggat tgctggtcaa ccggatcacc gtcggatcca 2744461 ccctgctccc cgcgtccgcg accgcgaccc cgacggccgc gacgtcgccg ccggttcgcc 2744521 ggccggtcta gctgcccttc gtcgtcagcg tcggaatcgt cggcgacgta gtcggggcca 2744581 tcgtcgacgt cctcgtcgtc cgctaacggc tcgggaatcg gctggggcgc gacgaacagc 2744641 ggcatatagt gcggccgctc cacgtcggca ttccgagtct cctgggtctc tagcatcagc 2744701 cgggactcgg gttcctcgga cgcttcgggc gcatggaccg aggccgccag cacgccggca 2744761 gtctcgagat gagtggccag cagatcgcgc acccggaccg catcgacgcg atccaccgtg 2744821 gaatgtgcgc tgcggacccg tccgtcgagc gcggtgagcg catccagcac ccgcctgctg 2744881 gtggttccca gcgttcgtgc cagcgaatgg actcttaggc ggtccggcag ttcctcatgc 2744941 tggctcggtt ctggtggatc tgaaggtggg gcaccgtcta tcacgtattc tcctcaagcc 2745001 cccgggcgcg tcttgatcga cgcggccacg cgagggcttc gctatctgcc cgggtcactt 2745061 gtctcccgag cttgtgatgg tcttgtcccg agcagctcat gacgaaccca ctcggcaccg 2745121 tgctgaatga cggcccgaca tgccgcgccg catcgaagga tggcgatggt cgcggttgcc 2745181 taagtcttca ttcgggcgtc cgacaccgct tcggcgacgt tcacccgtca tcagtatccc 2745241 acatcactgg gccgagtcac cttcccttga gctggggtgc tgcccaagcc gcccgggatc 2745301 ggcgcacccg gagctaggcg ccgggaaacc agagcgcgat ttcgcgctgc gcggattcgg 2745361 ccgaatcaga cccgtgcacc aggttgaact gcgtctctag agcgaagtcg ccccggattg 2745421 tgccgggcgc cgccgcctgc accgggtcgg tgccgccggc gagttggcga accgccgcga 2745481 tggctcgggt tccctccacg atcgccgcta ccaccggacc cgacgtgatg aactccagca 2745541 acgatccaaa gaatggtttg ccttcatgtt cggcgtagtg ctggctggcc aactccgcgc 2745601 tgacggtcct gagctgcagc gcagcgatgg tgaggccttt gcgctcgatg cggctgatga 2745661 tctcgccgat cagctgcctt tcgatgccat ccggcttgat cagtaccaga gtccgttcgg 2745721 tcacggtgcc caacactaga tgccgcaaga tgtatgccca aaccggtcat tgcgacaccc 2745781 ggtaatcccg acgccgccgc acctcggcac gcaaatacgc gatcaggacc cacaacgcgg 2745841 cgaacagcac gccaatgaaa cccacacccg ggtacacggc gaagccggca accagcaccg 2745901 gttgtgcgcc caggttcacc cagattgccc agggtctgcg ctgcagcccg gtcagcagta 2745961 tcaacagcac ggccagaccg accaaatagc ccagcgaggc cggacgcagc ccaccgccga 2746021 ccgcgtccac taccggtatt gccagcagca ccacgatcgc ctcgaggatc agcgtcgccg 2746081 ccatcaccgc gctgaatccc ttccacgggt cagccggctc acgcgaccgg tcggtcattg 2746141 cggatcacga ccgaacaagg tccgagccgc ccctgcggtg acaaccgagc cggtgatgac 2746201 gatcccggtt ctcgagaatg cgtccccggc cacatccggg tcggcggcgg cgtcgtcgac 2746261 cagtgaggtg gcaacgtcga tagcatcgcg caggttctcg gcggtgcgca cccggtcggg 2746321 tccgaaccgc tcgccggccg ccagcgccag ggcctcgaca tccagcgccc gcggcgaccc 2746381 gttgtgggtc acgacgacgg aatcgaacac cggctccagt gcggccagga tgccgtccac 2746441 gtccttgtcg cccagcacgc tgagcacccc gaccagaaat cggaagtcga actcatgcgc 2746501 cagcgtttgt gccagagcac tcgccccggc cggattgtgc gcggcgtcga tgaacaccgt 2746561 gggtgcgctg cgcatgcgct ccaaccggcc gggactggtg acggcggcaa agccggcccg 2746621 gacggcgtcg ccgtcgagct gacgctgcgc accggcaccg aaaaaggcct cgacggaagc 2746681 gagggcgagc accgcgttgt gcgcctggtg ttcaccgtgc agcggcaagt agatgtcgga 2746741 gtaaaccccg ccgaggccct gcagttgcag tacctgaccg ccgaccgcga tctgtcgccg 2746801 tagcaccgcg aattcggaat cctcccgggc caccgacgcg tcggcgcgca ccgattcggc 2746861 cagcagcacc tccatgacct tcgggacctg acgcccgatg accgcgacgg tgtccggcga 2746921 accgtcgggg gcccgagtga tgatgcccgc cttctccccg gcgatcccgg cgatatcggc 2746981 accgagatag tcgacgtgat caatgctgat cggggtgatg acggcgaccg gtgcgttgat 2747041 cacgttggtg gcgtcccaac gtccgcccat gcccacctcg accactgcca cgtcgacggg 2747101 cgcgtccgca aaggccgcga acgccatcgc ggtgagcacc tcgaacttgc tcatcgccgg 2747161 gccaccctta cccgcagaag cctgcgactg ctggtcgatc agcgccacca acggctcgat 2747221 ctcccggtag gtcgccacat actgcgccgg gctgatcggc ttgccgtcga tcgaaatgcg 2747281 ttccaccggt gactgcaggt gtgggctggt ggttcggccg gtgcgccggt gcagcgcggt 2747341 gaccagcgcg tcgaccatgc gcgccaccga ggtcttgccg ttggtgcccg cgatatggat 2747401 cgacggatag ctgcgttggg gcgagcccag caggtccatc aacgcgctga tccgggtcag 2747461 gctcggatcg atgcgggtct ccggccagcg ttggtcgagt agatgctcaa cctgcagcag 2747521 ggacgcgatc tcgtccggag tgggcacgac gccggtggcc gatcccgagt caggcgggcc 2747581 ggaattcgtc gaattcattg cagcgcagcc aaccgggtgg tgatgcgctc ggtttcctgc 2747641 tgcgccacgc gctggcggtc ccggatcttg gcaatgacgg cgtcgggcgc tttggccaga 2747701 aagtccgcgt tggccaactt ggcggcggtc gacgccagct ccttttgggc gccggccaac 2747761 tccttttcca ggcggcgacg ctcggcggcc acgtcgatgg tgcccgaggt gtcgagctcg 2747821 acgacgacgg tgcggttcat ctcggggccg agccgaacct ccaacgagac cgacggctca 2747881 aaatccgggc ccggctcggt gagccacgcc agcgaggtca cggcggccac ctggttgctc 2747941 agatccgagt cccgcacacc gtgcattcgg gccggaacct tctgccggtc ggccagacct 2748001 tgatcgctgc ggaaccgccg cacttcggtc accaacttct gcatatcgtt aatccgttgc 2748061 gcggcaacaa ggtccacgct aatcccggaa ggctccggcc agtcggcgct gaccagcgat 2748121 tccctgccgg tcagcgccag ccatagcgcc tcggtgagga agggaatcac cgggtgcagc 2748181 aggcgcagca gcgtgtccag cccggcggcc agcacggcgg tggtgtgtgt gagtccctgg 2748241 gcaagctgcg ttttggccag ttcgaggtac cagtcgcaga attcgtccca ggcgaagtga 2748301 tacagggact cacaagcgcg gctgaactcg tatccgtcga aggccgaatc aacttcggcc 2748361 cgaacctctt ccaaccttcc gagaatccag cggtcggcgt cggtcagctc gttcggcgat 2748421 ggcaggggtg ctggcgcggc gccattgagc agtgcgtacc gagtggcgtt gaacagcttg 2748481 gtcccgaaat tgcgcgacgc ccgcacggca tcctcgctca ccgccaagtc accaccggga 2748541 ctggccccgc gggccagcgt gaaccgcagc gcatcggccc cgaacatttc cacccaatcc 2748601 agcgggtcga tgacgttgcc cttggacttg ctcatcttgc ggccagactc gtcgcggatc 2748661 agcccatgca gaaacacgtc ggtgaacggc acctgcgggc cccggcggcc gtcgagggtg 2748721 atggcggcgt cgtcgccgac gaaggtgccg aacatcatca ttctggccac ccaaaagaac 2748781 aagatgtcat agccggtaac cagaacgctt gtcggataga acttttccag ctccgccgtc 2748841 ttgtccggcc aacccagcgt ggaaaacggc cacagcgccg acgaaaacca ggtatccagc 2748901 acgtcaggat cctgttccca gccctgcggg ggtgtttcgt ccgggccgac gcacacctgt 2748961 tcgccgtcgg gtccgtacca gatcgggatc cgatgccccc accagagctg tcgcgagatg 2749021 caccagtcgt gcatgtcgtc gacccaggag aaccagcggg gttccatgct ggccgggtga 2749081 atcacggtgt ccccgttgcg caccgcatcc ccggccgctt tggccagcga ttccacccgg 2749141 acccaccact gcagggatag ccgcggctcg atcggctcgc cgctgcgttc ggagtgtccg 2749201 acgctgtgca ggtagggtcg cttttcttcg accacgcggc cctgggccgc gagcgcttgg 2749261 cgcaccgcga cccgtgcctc gaagcggtcc atgccgtcga atcgcgttcc ggtgtcgacg 2749321 atccggccct tggtgtccag gatcgagggc atcggcagct ggtggcgcac cccgatttcg 2749381 aagtcgttgg ggtcgtgggc gggtgtgact ttgaccgcgc cggtgccgaa ttcagggtcc 2749441 acgtgctcgt cggcgacaat ggccagctcc cggtcgacga atgggtgcgc caggctggtg 2749501 ccgaccaggt gacggtagcg ctcgtcatcg ggatggacgg cgatcgcggt atcgcccagc 2749561 atcgtctcga cccgggtggt ggcgaccacg atgtggggtt gcgagtcgtc aagcgagccg 2749621 tacctaaacg acaccagctc gccttcgacg tcgcggtagt tgacctcgag gtcggagatc 2749681 gcggtctgca gcaccggcga ccagttgacc agccgctcgg cccgatagat cagcccggcg 2749741 tcataaagcc gcttgaagat cgtgcgcacc gcccgcgaca gaccttcgtc catggtgaac 2749801 cggtcgcggc tccagtccac cccgtcaccg agtcggcgca tctggccgcc gatggcaccg 2749861 ccagactctc gcttccaatc ccacaccttg tccacgaaca gctcgcggcc gaggtcttct 2749921 ttagtcttgc cgtcgaccgc cagctgctgc tcgaccacgc tctgggtggc gatcccggca 2749981 tggtcggtgc ccggctgcca gagcacctca tagccctgca tccgcttgcg ccgcgtcaag 2750041 gcgtccatca tggtgtgttc cagcgcgtgg cccatgtgca ggctgccggt cacgttcggc 2750101 ggcggcagca cgatcgaata ggccggcttg gtgctggtcg ggtccgcggt gaagtagcca 2750161 gcgtccagcc acttctgata gatggcgctc tccatcgcgg ccggatccca cgacttgggc 2750221 agcatatcgg cggcagggtg agggctggcg gtcaccgatc aattctagga accgcttcac 2750281 accggcatga aagcgcccga aaccgcccgg attcagctag ccagtcgcgt ggtctgcagc 2750341 gacacaccgg cggccggcaa acgctccagc agggcgtcac ccattgcggc cgcgggggtt 2750401 aacacaccac gcatgtcgga cagcttgtcg cgatccagtg ccagcgccag accacactcc 2750461 cccaacaaca ccgacgtcgc cttgtagccg gggtcaccat cttgggccat gcgcgccagg 2750521 taccgggctc cggtggttgt ggtggtgtag gtctcgatgc ggtagtagcc gcgctcgcga 2750581 gccgccgcac tggggccggt gccgggtttg gggacgacac gctttaccag tccccgcggc 2750641 agcaggcgga tgtagcggct ggccaagccg aacatcgcgt tgccgacacc gccgccgaca 2750701 accgatacca ccggcgccag caccgtggac cctacgctca tggtttcgct gtagcggaac 2750761 cgccggccgt aggcccagtc caggagcgcg ttgctgcggc gcacgatccg ggtgttggtg 2750821 ggcgccatga tgaatcccgc ggtccacaca ccggccagtt ccggcgcgag ccgacggcca 2750881 cgacgcgacg gcaggtcagg ctgtgggccc agttcgggtt cggcgccgcg gtctgggctc 2750941 agcatgtagg ggtcggatag ctggcggcgc gcatcgggat cgttagaagc ggtgctcaac 2751001 acctccagca tcgatgcgat ggtgccgccg gagaacccgc ctttgaagga acgcaccacg 2751061 cagttggtgt cggtcagctc gccggcgccg tcttctcgtg ccgcgtggta tagggcgtac 2751121 acgctcagat cagatgggac ggagtcgaat ccgcaggcgt gcacgatgcg tgcaccggtg 2751181 tcggcggcct gcttgtggta caagtcgatg ctgttgcgca tgaacatcgg ctcgccggtc 2751241 aggtcggcgt agtcggtgcc ggcggcagcg catgcggcca ccagcggcag cccgtagcgg 2751301 gtgtagggcc caacggtggt gaccacgacc tgggcgcggg cggccatggc ttgcagcgtc 2751361 gacggcaacg acgcgtcggc ggtcaggatc ggccaggtct gcgcggattc gcccagggct 2751421 tcgcgaacgg cgagcacccg ttgcgtcgac ctgccggcca gcgcgatccg ggcatctccc 2751481 ccggcccggg ccaggtattc ggcggtcagc ttgccgacga agccggtcgc cccgtacaac 2751541 acgatgtcga attcacgcgg cgtagcggtc acgggtttga cgctactccg gggtgcgcga 2751601 gcagacgcaa aagctcccaa atccgaccgg atttgggagc ttttgcgtct tttcgcggtg 2751661 gtcagccgcg gcggccgcag accggccagg cgcggatacc ctgcgaacgc agcacgttct 2751721 cagccacccg gatctgctcc tcccggctcg cgttggccgc ggaccccgag ccaccgttgg 2751781 cacgccaggt gccggcggtg aaccgcaggc cgccgtagta accgttaccg gtgttgatcg 2751841 accagtttcc accggactcg cactgcgcga tcgcgtccca gttcacgctg taggccacgg 2751901 gcacgggagg cgcttcctcc gcaggcgggg acaggaagtc cggggccagc ggcgggggga 2751961 ggttgggatc aaagcccgcg tcctccggag ccggcggagt atcgacgggt gcagcgtccg 2752021 gggccggcgg caggttcggg tcaaagccca cggcatccgg gccggctgcg gcgtttgggt 2752081 ccaagcccgc gtcgtcggca ttggcgatac cggctggtga cgtggtcacc aacgtcccgg 2752141 caatcgcggc ggcgatgagc gtcgtacggg cgttcttcaa cgttgttcct ttcgcggtgc 2752201 gcgcgcgcca aagccaaccc acgggcatgg gttagctgcc aggtgcattc gagggtgctg 2752261 cgtgggacgt gccgtctcgg tccggcacgg cagcggagcg cttgatctgc ccggcgcggc 2752321 tgctagccgc cgcctgcggg tcggccaacc gattcagccg tccccggctc cgctcgcacg 2752381 cgggtccgta gattcaattg ttgagatttc ttgctgcccg tctgccgggc caaggggacc 2752441 gtacgataac gatttggatt cgtcatctcc ggcaaaccga gatatcagtt caatcacaag 2752501 ccgatcacgg cgcggtggca caattgttgt tgcaggtcag aagtgcggtt ttggctcagc 2752561 tgtatctttg cgaccgcggc gctatcgtga gccgaatcac gcaaatattg tgaccccgga 2752621 cacggatttg tcaccatcgt ggccctggtc cgggatctga tccacacgcc gtggtgacct 2752681 gcgccacaac gacttgccca ccccgacgtc caccacacct cgaatcagct agactgctcc 2752741 caataatccg ccctaatact aagtgccgca ctgtgattca taggtaacct ggggcaccac 2752801 caaatagcag tctgccgtaa cagccggatc ctctaccgtc agcagactca aatgtcctcc 2752861 accccaacgc aatacgtgat caaccgcgca ccagagacgc caactgtagt caaggcagta 2752921 ctagaagcgg cggccatggc caatgttaat aacgtcttca ttgaaaacaa gacgagaata 2752981 tctcgaaagg ccaccagaaa attaatacgg aatagtatta gcgtccgggc tgcatcggtg 2753041 cgtgcaagcc tgcggccgaa ttgacgttgg tcagcggtcg ggaatccgcc atcacgatcc 2753101 gcagtgcatc cgaagcgtcg accagggcgc tcatctttcg ctcgccggca ccgaccaacg 2753161 tgtcgacgcg gtcggccaga tccgtccgat acaccgcggc caggtagtgg ttacggccat 2753221 cccagggcaa caccacttcg gcatcggtct gcaccgcgcg gcgcgcgaga tcctcgatca 2753281 attccactgt cagataaggc atgtcgaccg cacagacaaa cgcgagccgg acaccggcct 2753341 ccgcagccgc acgcaacccg cgaccggtcg ccggcagcgg ccccagcccc ggcagctcat 2753401 cacgcagaac ggggaccggc agcgtgggca acggttgtcc cggagcggcc atcacgaaaa 2753461 ccggcgcgca gcgctggccg agaatgccga ccatatgctc caccagcgtg gtggttcccc 2753521 cggggagggg cagggtggct ttgtcgcgac ccattcggcg ggattcacct cccgcgagaa 2753581 caaccccggc cagcggcact gtgtcgggcg cgagctcagc cacgtcagtc gacggtccaa 2753641 gtgtcgcgcc cgtgcaacag tgactgcagt gcagcggtgc cggacggggc ggcgtttcgg 2753701 gccgcgacta cctgcgagcg ggccgcatca tcgtaggttg gccggctgat gtgccgaaag 2753761 attcccagca cggtgtggtc caggttctga tcggacagcc gggacagcgc gaaggcgtag 2753821 gccgggtcgt cgacctgcgc atcgtgcaca atgatctcgt cgatggccac atcggccgtc 2753881 ttggccactt cgaggccgaa tccggacttg accacgcagt attcgccgtt ggccccgaag 2753941 acgatcggct cgccgtggcg gaccttgatg acccgctcct cggcgccctc cttgcgcagc 2754001 gcatcgaacg agccgtcgtt gaagatcggg cagtcctgca ggatttcgac cagggcagca 2754061 ccgcgatgct gggccgcggc acgcagcact tcggtcagcc cgttacggtc tgagtccagc 2754121 gcgcggccaa cgaacgtcgc ctctgccccc agcgccaacg acaccggatt gaacgggtga 2754181 tccagcgagc ccatcggtgt cgacttggtg accttgccga cctccgatgt cggcgaatac 2754241 tgtcctttgg tcagcccata gatccggttg ttgaacagca gaatcgtcac gttgatgttg 2754301 cggcgcagcg cgtggatcag gtggttaccg ccgatcgaca aggcgtcacc gtcgccggtg 2754361 accacccata ccgacagatc ctcgcgagcc agcgccagac cggtcgctat cgcgggcgcg 2754421 cggccgtgaa tcgaatgaaa gccgtaggtt tccaggtaat aggggaaccg gctggagcat 2754481 ccgataccgc tgatgaacac gatgttctca cgccgcagcc ccagttcggg caggaagttt 2754541 cggatggtgt tgaggatgac gtagtcgccg cagcccgggc accagcgcac ctcctggtca 2754601 ctggtgaaat ccttgccctt ctgcggctga tccgtggtgg gcaccccagc gttcttggtc 2754661 aagctcggag tcaggccgag ctcggtgccc gccaaatcac cggtcacgcc ggtcatgagc 2754721 tgtgcttcat cgccggagcg ggtcatccgt ttgctcccgc tcccgccgtg gccgccgaca 2754781 atctggcgac caacgtcttg tcttgctcaa gctcggccaa tctcccggca agtgcggccc 2754841 ggataaagcg cccaatctcg tcggccagga acgagacacc cttaaccttg gtgaccgatt 2754901 gcacgtcgac caggtactta ccgcgcagca cctgggccag ctggcccaag ttcaactccg 2754961 gagccaccac cttggggtaa cgccgcagca cctcacccaa attggccggg aacgggttga 2755021 gatagcgcag atgggcgtgc gctaccttgg tgcctcggcg acgcgcgcgc cggcacgctt 2755081 caccgattgg gccgtaggag ctgccccacc cgatcaacaa cagctcggcg tccccggtcg 2755141 gatcatcgac ttccagatcg ggaacatgga taccgtcgat cttggcttgg cgcaaccgga 2755201 ccatgaggtc atgattagtc ggctcgtagg agatgtcgcc cgagccattg gcagcttcca 2755261 gcccgccgat gcggtgttcc agaccggggg tgcccggaat ggcgaactgg cgggcaaggg 2755321 tttcccggtc acgggcataa ggctggaagg gctcgccggg tttggcgaag gtgtgcttaa 2755381 tgggcggtag cgcattgaca tccgggattc gccatggctc cgagccgttg gcgatggcgc 2755441 cgtcggacaa caagatcacc ggggtgtggt aggacaccgc gatgcgcacc gcctcaaggg 2755501 cggtttcaaa gcagtcggca ggagagcgcg gcgccagcac cgccaccggt gactcgccat 2755561 tgcggccgta gagcgcctgc agcaagtcgg cctgctcggt cttggtgggt agaccggtcg 2755621 acggcccgcc ccgctgcacg tctatgacca gcaacggcag ttcggtcatc acacccagtc 2755681 ccagcgcttc ggacttcagc gaaattcccg gtcccgatgt gctggtgact cccaacgcac 2755741 caccgtaggc ggcacccagc gcagcgcaga tgccgccgat ctcgtcttcg gcctggaagg 2755801 tgacgacatt gaagttcttg tgcttggaca gttcgtgcag gatgtccgac gccggagtaa 2755861 tcggataact gccgagcacg accggaaggc cggcgagctg accggccacc acgatcccgt 2755921 aggccagcgc ggtattgccc gagatctgcc ggtactcgcc gggcggcaaa gtcgcgggcg 2755981 gtatctcata ggtcgtgccg aaggcctcgg tggtttcgcc gtagttccag ccggccttga 2756041 gggccaacac gttggcctcg gcgatttcgg gcttgcgggc gaacttctcc ctgatgaagg 2756101 cctcgctgtg ctcgagctcg cgcccgtaca tccacgacag cagacccagc gcaaacatat 2756161 ttttggcgcg ctggccatcc ttcttggacg cgccgatcgc ctcgacggca cccagggtca 2756221 gtgtggtcat ggcgacggtg tgcaccacat agtcggacag ctcgccggac tccagcgggt 2756281 ttgtcacgta gcccactttc gtcaggttgc gcttggtgaa ctcgtcagag ttcacgatca 2756341 ccattccgcc aagcggtagg tcgccgatat tggccttcaa cgctgccggg ttcatggcga 2756401 cgagcacgtc gggacggtca ccggcggtca ggatgtcgta atcggctatc tgaatctgaa 2756461 aagacgacac tccgggcaac gtgcccgccg gtgcccggat ctctgcgggg tagttcggct 2756521 gggtcgccag atcgttgccg aaaagcgctg cctccgaggt gaatcggtcg ccggttagct 2756581 gcatgccgtc gccggagtct ccagcgaacc ggatcaccac attttccaag cgttgccgat 2756641 caggcgcggc atgaaatgcc gcgtcatgag actctggccc ggccccgctg ccgttcggat 2756701 ccacgtctcc gccttccatg tgttatcgga caggcactcc gcgctgcagc ttcaggttac 2756761 gcgtcgtcgg agcgacaccc ccgcgccgca cggcttgtgt cactggcggt agcgattatg 2756821 acatttcatt tcgggtgtaa ggcggtctcc gatgccatat atgcggccgg taaccgacca 2756881 aaaggcgaag tcagcgaggg ctggcggtag cgacgacaca gaactgtggt attggtcact 2756941 tccccccgag ggttggccgc gaccgcaccc ggacatccga atccacggtt tccggcatcg 2757001 cgaccaggta caggaggaag ccggccccgg cgagcgcacc cagcgacatg aatgccgcgt 2757061 catagcccgc gacgaccacg atccagccgg caacaagatt agacagcgcg gcaccaatgc 2757121 ccgttgccgt ggttaccgcc ccgaggctga tattgaaatg tcccgttccg tgtgtgacgt 2757181 cctgtacgac aaggggaaac aacgccccga aaatgccggc tccgataccg tcgagcaact 2757241 gcacgcccac cagccagtag gagttatccg acaacgtgta gaggaacccg cgagcggtca 2757301 agacagcgaa ccccaccaaa aagatcggct ttcgccccca cgcgtcggcc ctggtcccga 2757361 ccacatacgc caccggcacc atcacgacct gcgccgcgac gatgcacgac gacatcagcg 2757421 ccgttccttc gtctcgattg tgcaacgcca acagctcgcc gaccagcggc agcatcgccg 2757481 cgttggcgaa gtggaacgcg acaaccgccg ccccgaagat caccagttcg cggttgtgcg 2757541 ccaacacggt gaaccgcgac ggctgcggat gcggctcgcc gggcgcatgg tccataccac 2757601 gcgctaaatc gtggtcgacc gcgtccggcg ggatccgcag tgtcgccagc acgctgatca 2757661 acgccatgcc ggccagcacc cagaacacca ccaccggccc gaagaagtac gccagcgcgc 2757721 cggtcgcccc agccgccgac gcgttaccgg cgtggttgaa cgcttcgtta cgcccaatcc 2757781 gtctggcgaa aaactgagga ccgacagcac ccaacgtgat cgccgccaac gccggagcga 2757841 aaaccgagct ggcgatcccg gtgacggcct gcagcaccga gatggaatac aagcccgcaa 2757901 acagcggcat cgccactgcg gcggcggtga ccagcaccgc gccggcgacg accagcgccc 2757961 gcttggccgt ggtccggtcc accagggcgc caatcggcgt ctgggccacg atggccgcaa 2758021 tgccgccgac cgccatgacg aacccgatcg aggcttgatc ccaatcgtgg atcaacagga 2758081 ggtatatcga cagatagggg cccagaccgt cgcgaacatc agccaacgag aaattcagca 2758141 ggtccagcgc acgcgccacc cgtggcggca ctgccacaac ggtgcccgac atgcagtcgt 2758201 cgcggggcta cgcgctcttg tcgcggcgct ccgaacggga cggcttgcgt ggcacgattg 2758261 tcggcaacac gttgtcctgc acggtctcct tggtgaccac cactttggcg acatcgtcgc 2758321 ggctcgggat gtcgtacatc accggcagca ggacttcttc catgatcgcc cgcaggccgc 2758381 gggcaccggt gccgcgatgg atcgcctggt cggcgatcgc ttccagcgca tcgtcggtga 2758441 actccaactc cacgccatcc atctcgaaca gccggatgta ctgcttgacc aaagcgttct 2758501 tcggctcgga caggatcttg accaacgact ctttgtccag gttggtgacc gaggcgacca 2758561 ccggcaggcg gccgatgaat tccgggatca ggccgaactt gatcagatcc tccggcatca 2758621 cgtcggcaaa gtggtcggtg gtgtcgatct cggccttgga acgaacctcg gcgccaaagc 2758681 cgaggccccg cttgccgacg cgctcgtaaa tgatcttctc cagcccggcg aacgctcccg 2758741 cgacgatgaa cagcacgttg gtggtgtcga tctggatgaa ctcttgatgc gggtgcttac 2758801 ggcccccctg cggcggaacc gacgcctgag tgccctccag gattttcagc aaggcctgct 2758861 gaacgccctc accggagacg tcgcgagtaa tcgacgggtt ctcactcttg cgggcgatct 2758921 tgtcgacctc gtcgatgtag atgatgccgg tctcggcgcg tttgacgtcg tagtcggcgg 2758981 cctgaataag tttgagcaag atgttctcga cgtcctcgcc gacgtaaccg gcctcggtca 2759041 gcgcggtggc gtcggcgatg gcaaacggca cgttaagcat cttggccagc gtctgggcca 2759101 ggtaggtctt gccacaaccg gtgggtccga gcatcaagat gttcgacttg gtcaactcaa 2759161 cgggctcaca tcgggagtca cggcccttct ccccggcctg gatccgcttg tagtggttgt 2759221 acaccgccac ggccagcgtg cgtttggcgg tatcttgccc gatgacgtag ccctcgagga 2759281 actcccggat ctcggccggc ttgggcagct cgtcgagttt cacatcgtcg gcgtcggcga 2759341 gttcctcttc gatgatctcg ttacacaggt cgatgcactc atcgcagatg tacacgccgg 2759401 ggccggcaat gagcttcttg acctgttttt ggctcttccc gcagaacgag cacttcagca 2759461 ggtcaccacc gtctcctatg cgcgccataa tgctgatggc ctacttcctg atcgccgttc 2759521 gtgttgccgt gccccgtgta tgccccgacg ctacccgctt gctccggccc cccgcgaccg 2759581 ttagcaccga atagcgtcct agagatttca gggtgttcac gcctctcgtc tgaatgaaac 2759641 atatagcccg actgcgcccc actcgccgag acgcgcgatc cgtgtctctg gcgtgtcgcg 2759701 gtcgtaaccc caccgaggcc cgcgcgtcgc ggacccggca gcggcccgac cgccagctga 2759761 ccaccctaca gtggcgttgt ggaattggtc agcgattccg tgctgatcag cgatggcggc 2759821 ctggccaccg agcttgaggc gcgcggtcac gacctgtccg acccgttgtg gtcggcgcgg 2759881 ctgctggtgg acgctccgca cgcgatcacc gcggtgcata ccgcgtactt tcgcgctggg 2759941 gcccagattg ccacgactgc cagctaccag gcctcgttcg agggcttcgc ggcgcgcggc 2760001 ataggtcatg acgacgccac cgtgctgctg cgccgcagcg tcgaactcgc ccaggctgcg 2760061 cgcgacgagg tcggcgttgg cggtctatcg gtcgcagcct cggtcgggcc atacggcgcc 2760121 gcgctggctg acggatccga ataccgcgga tactacggcc tgtccgtcgc agccttgatg 2760181 aagtggcatc tgccacggct cgaggtgcta gtcgatgccg gcgctgacat gctcgccctg 2760241 gaaaccatcc ccgatatcga cgaagccgaa gcgctggtca acctggtgcg gcggttggct 2760301 acgccggcct ggctcagcta cacgatcaac gggacgcgga ctcgcgccgg gcaaccgctc 2760361 accgacgcgt ttgcggtggc cgcaggagtt cccgagatcg tcgccgtcgg cgtcaactgc 2760421 tgcgcacccg acgacgtgtt gccggccatc gctttcgccg tcgcccacac aggcaaaccg 2760481 gtgatcgtgt acccgaacag cggtgagggt tgggatggtc ggcgccgcgc ctgggtaggt 2760541 ccgcggcggt tttccggatc ttccgggcag cttgcgcggg aatgggttgc ggcgggcgcg 2760601 cgcatcgtgg gcggatgctg ccgagtacgg ccgatcgata ttgccgaaat cgggcgagcg 2760661 ctgaccaccg cgccgccccg aggctgaaag cgaaaattgc ctctactgcc tcatcgaggc 2760721 gttacctagg gttagttctt gtgaccgcga agcccggcta cgcatgagta agaaccgcat 2760781 tatgggcaac caaccggaga agtcagatgt gactgcggca cccgacaccg tggagggcga 2760841 ttcccacact gcaatgacac cgcgccagcg gctgaccgtg ttggcaacgg ggctgggcat 2760901 cttcatggtg ttcgtggacg tcaacatcgt caatgtcgca ttgcccagca tccaaaaggt 2760961 gtttcacacg ggcgaacaag gtctgcagtg ggcggtcgcc gggtacagcc tgggcatggc 2761021 ggccgtgctg atgagttgcg ccctgctggg cgatcgctac ggtcgcaggc gcagttttgt 2761081 gttcggggtc acgctcttcg tcgtgagctc tattgtctgt gtgctaccgg tcagcctggc 2761141 agttttcacg gtcgcacgag tgatccaagg tttaggagcg gcgttcatct cagtgctctc 2761201 gctggccttg ctaagccact cctttcccaa tccccgaatg aaagcacggg cgatatccaa 2761261 ctggatggcc ataggcatgg tcggtgcggc atctgccccc gcgctgggcg ggctcatggt 2761321 cgacggcctc ggttggcgca gcgtgttcct ggtgaacgtt ccgctcggtg ccatcgtgtg 2761381 gctgctgacg ctagtcggtg tcgacgagtc acaggatccc gagcccactc aactcgactg 2761441 ggtgggacag ctgacgctta tcccggccgt cgccctgatc gcatacacca tcatcgaggc 2761501 tccccggttc gaccggcagt ccgccgggtt cgtggcggcg ttgctgttag cggctggggt 2761561 actgctgtgg ctgtttgttc gacacgaaca ccgcgccgct ttcccgttgg tcgatctcaa 2761621 actgttcgcc gagccgttgt accgatcggt gctgatcgtc tacttcgtgg tgatgtcctg 2761681 ctttttcggg actctgatgg tgatcaccca gcacttccaa aatgtgcgcg acctatcgcc 2761741 gctgcacgcg ggtttgatga tgttgccggt ccccgcggga ttcggggtgg cgagtctgct 2761801 ggcgggtagg gcggtcaaca aatggggtcc tcagctcccg gtgctgacgt gcctggcggc 2761861 catgttcatc gggttggcga ttttcgcgat ctcgatggac cacgcgcatc cagtggccct 2761921 tgttggcctg acgatctttg gcgcgggagc cggcggctgc gccacaccgc tgttgcatct 2761981 tggaatgacc aaggtcgatg atggccgtgc cggcatggcc gccgggatgc tcaatctgca 2762041 gcggtcgctg ggcggcattt tcggcgtcgc cttcctgggc accattgtcg cggcctggtt 2762101 gggtgccgcg ctgccgaaca ccatggccga cgaaattccc gatcccatcg ctcgcgcgat 2762161 cgttgtcgac gtcatcgtgg acagcgcgaa tccgcatgcc cacgcggcat ttatcgggcc 2762221 aggacaccgg ataactgcgg cgcaggagga tgagatcgta ctggccgccg acgcggtctt 2762281 cgtgagcgga atcaagctcg cgttgggcgg cgccgccgta ttgctgaccg gcgcgttcgt 2762341 ccttggttgg acgcgcttcc cccggacccc cgccagctaa gtggtctcgc tcggtgcgcc 2762401 cccacagtcc ctgcgccgag atcgacgtta gcgtcacgcc ttatggtgat tttccgctct 2762461 ggcgtggatc tcggcgcatg tcgggtggcg accaccaagc cacgccacgg ccgcaccacc 2762521 cgcccatggc tcaggcggtt tgcgcggaga gcttccggta ctcgagcacc gtgtcgatga 2762581 tgccgtagtc cttagcctct tccgcggtca agatcttgtc ccggtcagtg tctttgcgga 2762641 tcactccggc gtccttgccg gtgtggcggg ccagcgtggt ttccatcagg gtgcgcatcc 2762701 gctcgatctc ggcggcctgg atctccagat cggagaactg tccctggatc acgcccgaca 2762761 acgacggctg atggatcaac acccgcgcat tcggcagcgc catgcgcttg cccggtgttc 2762821 cggcggccag cagcaccgca gccgccgagg cggcctggcc cagacacacc gtctggatat 2762881 cggcccgcac gtattgcatg gtgtcgtaga tcgccatcag cgaggtgaac ccaccgcccg 2762941 gcgagttgat gtacatggtg atatcgcggt cgggatccaa cgactccaac accagcaact 2763001 gtgccatgat gtcgttcgcc gacgcgtcgt cgacctggac gccgaggaag atgatgcgtt 2763061 cctcgaacag cttgttgtat ggattggact ccttgacccc gaagctggag tgctcgatga 2763121 acgacggcag gatgtagcgc gcctggggct ggatctgaga attttgggaa ttcactgtgc 2763181 ttctccattg acgtgggcgc gggtgatgat gtgatcgacg aaaccgtatt ccagggcttc 2763241 ggcggcggtg aaccagcggt cgcgatcgga atccgcctca atgcgctcga tcggctggcc 2763301 ggtgaattcg gcgttgagcc ggaacatttc tttcttgatc acggcgaact gctcggcctg 2763361 gatggcgata tcggccgcgc tgccggtcac cccgcccaac ggctggtgca tcaggatgcg 2763421 agcatgcggc agcgcgtagc gcttgccctt ggtacctgcc gccagcagga actcgcccat 2763481 cgaggcggcc atgcccatcg cgtaggtggc gatgtcacag ggcgccagca ccatggtgtc 2763541 gtagatcgcc atgccggcgc tgatcgatcc acccggcgaa ttgatgtaga ggctgatgtc 2763601 cttgctggcg tcttcggcgg ccagcagcag aatctgagcg cataaccggt tggcgatctc 2763661 gtcgttcacc tccgagccca ggaagatgat gcgctcggag agcaagcgct cgtagaccga 2763721 atccgtgagg ctaagaccct gcgagttcga acgcatgtca gtcacttggc tcacagtggg 2763781 gcacctgctt tcctcgagtt cttctatgct ccgacactaa ccaaccaggc tggctgtttc 2763841 gcggtcacgc accccctgaa accggcgcgt tcgcttacag cgtcatacgg tcacgttgtc 2763901 gcttcgtcgg acgccgcccg cgcggcaccc tcgtctgccg gttcggcctc ctcagcctca 2763961 ccggccgaca cacgcttgcc gaagaactca ctggtatcga tcgtgtttcc gtcactgtcg 2764021 gtgaccgtcg ccgcctccac tgcggccctg atcgccagct cgcgccgcac gtcagcgaac 2764081 atggtcggca gctggttgcg ctcttggagg tagccgaaca gctgctgcgg ctcgatgccg 2764141 tattgccgag acgtcgtcac cagtcgttcg gtcagatcat cctggccaac ttggacctgc 2764201 agctcatcgg ccagggcgtc tagcaacagc tgcctcttga cgtccttttc tgaggcggtg 2764261 cgcgcctcgg catcgaacgc cgcgcgtgac gagccttgct cgacgagcaa ctcattgaac 2764321 cgggcttcgt cgtgattaag accgctgagc gcgctgtgca gcacgctgtc gaattgggcc 2764381 tgcacatacg actccggcaa cggcacgtcg acctgttcga gtagcgcatc gatggtggcg 2764441 tttcgaatct gctcggcctg ctgggcgcgc ttggcctggc gcacctggtc gctgaggctg 2764501 gcccgcaatt cgtcgatgct gtcgaactcg ctggctaact gcgcgaattc gtcgtcgggc 2764561 tctggtagtt cgcgctcctt aaccgacctg accgtgacgg taacctgagc ttcctgcccg 2764621 gcgtgctcgc cggctgccag cttggcggtg aagacccggg actcgtcggc ggacagacca 2764681 acaaccgcgt cgtcgagacc tgcgatgagc cggccggagc cgacctcgtg ggagagtccc 2764741 tcagcggctg cgttcggtat gtcctctccg tcgaccgtgg cagacaagtc gatcgagacg 2764801 acgtcgccga cggccaccgg ccggtccacc gcggtcaggg tgccgaaccg ggtacgtaac 2764861 gactgcagtt cggcgtcgac gtcgtcctca ccgatttcga tcggatccac cgagaccgtc 2764921 agcgcgctca ggtccggggg actgatcttc gggcggatgt cgacctcggc ggtgaattgc 2764981 aggtcctggc cgtactcctt cttggtcacc tcgatgttgg gccggccgag cggttggaca 2765041 tccgactcgg ccaccgcctg tccgtaccgg ctgggcagcg catcgttgac gatttgatcc 2765101 agcatggcct cccggccgat gcgggcttcg agtagtttgg ccggcgcctt cccgggccgg 2765161 aagccgggca gccgcacctg tttggccagc tctttgtagg cccgctggaa atccggctca 2765221 agctcggcga atggcacctc cacgttgata cgaacccggg tggggctcaa ctgctcgacg 2765281 gtgctcttca cgggtgtgct ccttggtagt cgataacggc ggtcggctgg tcggggtgac 2765341 aggatttgaa cctgcggcct tccgctccca aagcggatgc gctaccaagc tgcgctacac 2765401 cccgcgctga cctcgcgatc ctacggcccg gcgacaccgg caccgcaatg acctcttgag 2765461 acctcacggg aaggtctcaa aacgactccg attagatttg atgtctgtca ccacgtacag 2765521 tcgcgctcga ctaaatacat gcgggcgtag ctcaatggta gagccctagt cttccaaact 2765581 agcgacgcgg gttcgattcc cgtcgcccgc tcgggccatg cgtttgttcg gcagaaaggc 2765641 gccatgcgcg acccatgaat cagcctgaca tcaagggctc gtgcgcgtcg gagttcacca 2765701 aggtacgcga cgcgttcgag cgcaactttg tgctgcgcaa cgaggtcggc gcggccgtcg 2765761 cggtgtgggt cgacggggat cttgtcgtca acctgtgggg cggctccgcc gacgccggcg 2765821 gtacccggcc ctggcagcac gacacgctgg ccaccgtgct gtccggtacc aaggcactaa 2765881 cggccacgtg tgtgcatcag ctcgtcgatc gcggtgagct tgacctgcat gcgccggtgg 2765941 cacgctactg gcccgagttc ggacaggcgg gtaagcaggc catcacgctg gcgatggtga 2766001 tgagccaccg ctccggggcg atcgggccgc gcggacggct gggctgggag caggtcgccg 2766061 attgggattt tgtctgcgag caactggccg ccgccgaacc gtggtggcag ccgggtgccg 2766121 cgcagggcta ccacatgacc accttcggtt tcatcctcgg cgaagtgttc cgccgcgtca 2766181 caggccgtac ggtcggtcaa tacctgcgta ccgagatcgc tgagccgctg ggtgcggacg 2766241 tccacattgg cttgcatccc ggcgaacagc tccgctgcgc cgatctagtt gataagccgc 2766301 acatccgcca attgctggcc gacgtccaag cccccggcta ccccaccagc ctaaacgaac 2766361 atcccaaggc tgcattgtcg gtgtcgatgg gcttcgcccc cgacgacgaa ctcggctcca 2766421 acgacctgca gctgtggcgt cagatcgaat tccccggcac caacggccag gtgtctgcgc 2766481 tggggctggc gacgttctac aacgggcttg cccaggagaa gctgctcagc cgcgagcaca 2766541 tggagctggt ccgggtctca cagggcggct tcgacaccga tctggtgctc ggcccgaggg 2766601 tcgccgacca tggctggggt ctgggctaca tgctcaacca gcgcggcgtc aatggaccca 2766661 acccacggat tttcgggcat ggtggcctcg gcggctcgtt tgggttcgtc gacctcgagc 2766721 accggatcgg ctacgcctac gtgatgaacc gcttcgacgc caccaaggcc aacgcggatc 2766781 cgcgcagcgt cgtcctgtcc aacgaggtct acgccgcgct cggggtaaac cgttcctaga 2766841 cggctagcca ccaggcggtc aggtctgaca gaccgggcac cagaaaacat tgcggccctc 2766901 gagcagtgcc gtgcggatca ctcccccaca cacccgacac ggctcgccgg ctcggcggta 2766961 cacataggtg cggggccggt cgggcagata tgacggcaga ccatggtcat gttcggggcg 2767021 caccacgatg atcttgccgc ggcgcaagcc caccttcatc aacgacacca gatcgttcca 2767081 ggccgcgtcg aattccggct caccgatccc gcggccgggc cgctgtgggt cgatccggtg 2767141 ccgaaaaagc aactcattac ggtagacgtt gccaacaccg gcgatcaccg tttggtccat 2767201 caagagcgcg cctatgggcc tgcgagactt ggtgatccga gaccatgccg acgacgggtt 2767261 ggcgtcgcta cgcaacgggt cgggtcccag cctggcaacc acgtccgcaa cctcgccgtc 2767321 gtcgatcgac tcacacaccg tcgggccgcg caagtcggtg ccgaattctg ccccgaccat 2767381 ccgcatccgc acctgccccg cgggttcggg tagccaccca tctgtggggc gtgcccattc 2767441 ggtgaaggtg ccatagagcc cgagatgcac gtgcaccacg gggccgccga cgtagtgatg 2767501 gaacaggtgt ttgccccagg cactggcccg ccgcaacacc cgaccgttga gcgcggaagc 2767561 cgaatcggcg aaccggccct gggggctgga caccgagacc ggcgcaccgg cgaaccggcg 2767621 ctggtgcagc cgggccagcc gatgcagcgt atgcccctca ggcacgggag tcaggccgga 2767681 gcgccgggca ccggcggcgc ttcgtgggtc cgttcgtact cggcgagaat gtcgatacgc 2767741 cgttggtggc gttgcgcttt cgaccacggc gtggtgacga aggcgtcgac tatcgccagt 2767801 gcctcggcca ccgtgtgcat gcggccgccg atgccgatca attgggcgtt gttgtgctcg 2767861 cgagccagcg ccgcggtctg cacactccag gccagcgcgc agcgagcgcc gggcaccttg 2767921 ttggcggcga tctgctcccc gttgcccgat ccgcccagca cgatgcccag gctgcccgga 2767981 tcggcgacag tgcgcgtcgc tgcggcaatg cagaatgccg ggtagtcgtc gtcggcgtcg 2768041 tagcgcaacg cgccgcagtc gatcggctcg tggccggttt gcttcaggtg ctcgatgatc 2768101 cgctgcttga gctcatatcc ggcgtggtcg gcccccaggt agacgcgcat gcccgacatt 2768161 gtgcccgaca cactgccggg cgccggcgcg ggcgcccgcc gatagtgaat tcggcgacaa 2768221 gaacccgggc gtgttccggc gccgaattca ctatcggcgg ctagtcgaac tgaggcggct 2768281 cggtgcgggt ccgcttgagc tcaaaaaagt gcgggtagga agcgaaggta accgaggcat 2768341 cccagagctt gccggcttcc tcgccgcgcg gaatcttcga gagcaccggc ccgaagaacg 2768401 ccacaccatt gacatggatc gtcggcgtac cgacgtcctc gcccaccgcg tccatcccgg 2768461 cgtggtggct tttgcgcagg gcgttgtcgt aagcgtcgct ggtagcggcc ttggccaact 2768521 ccgcgggcag accggcgtcc gccagcgact gggtgatgac ctcgtcgagt tcgtggttgc 2768581 cctggttgtg aatccggttg cccatcgcgg tgtacagcgg gtccaggact ttcgccccat 2768641 gggcctgctc ggcggcgatc gccacccgta ccggtcccca tgccctcgcc atgccttcgc 2768701 ggtattgctc gggcaggtcg tcacggtttt cgttgagtat tgccaggctc atgacgtgga 2768761 agttcacctc gatgtcgcgg acctttgcca cctcgaggat ccagcgcgac gtgatccagc 2768821 accacgggca cagcggatcg aaccagaaat cggcgacaga cttctggggg gccttctcga 2768881 gcatggcgcg gtcctctcgt tggagtcagc agcggtgagt acaccgccca gcacaaccac 2768941 ggccgccccg cacctgttcc cgccgacccg gttaagttgg acgccgtggc ccttccaaac 2769001 ctcacgcggg accaagccgt cgaacgcgcc gccctgataa ccgtggacag ctaccagatc 2769061 attctcgatg tgaccgacgg taacggcgct cccggcgaac gcaccttccg gtcgaccacc 2769121 accgtggtgt tcgacgcact ccccggcgcc gacacggtca tcgacatctc cgcccacacc 2769181 gtgcgccgcg ccagcctcaa cgaccaagac ctggacgtct cgggatatga cgaggcggcc 2769241 gggatcccgt tgcgcggact ggcccagcgc aacgtcgtcg tcgtcgacgc cgactgccac 2769301 tactccaata ccggcgaggg cctgcatcgg tttgtcgatc cggtggacgg cgagacctac 2769361 ctgtactcgc aattcgaaac cgccgacgcc aagcgcatgt tcgcctgctt cgaccaaccc 2769421 gacctcaagg ccacgtttga cgtgcgggtg accgcgcccg cgcactggaa ggtgatctcc 2769481 aacggcgcgc cgctggccgc ggcaaacggc gtacacacct tcgccactac cccgcggatg 2769541 agcacctatc tggtggcctt gatcgccgga ccatacgcgg cctggacgga cacttacatc 2769601 gacgaccacg gggaaatccc actcggcatc tattgccggg cctcgcttgc cgaatacatg 2769661 gacgccgagc ggctgttcac ccaaaccaag cagggattcg gcttctacca caagcacttt 2769721 ggcctgccat acgcgttcgg caagtacgac cagctcttcg tccccgaatt caacgccggc 2769781 gcaatggaaa acgccggcgc ggtgaccttc ttggaggact acgtcttccg cagcaaggtc 2769841 acccgggcat cctatgagcg gcgcgcggag accgtgctgc acgagatggc ccacatgtgg 2769901 ttcggcgacc tggtcaccat gacctggtgg gacgatctgt ggctgaacga gtccttcgcc 2769961 accttcgcct cggtgctgtg ccaaagcgag gccaccgaat tcaccgaggc ttggacgacg 2770021 tttgcgaccg tggagaagtc ttgggcgtat cgccaagacc agctgccgtc gacgcacccg 2770081 atcgccgccg acatccccga cctggccgct gtcgaggtga acttcgacgg gatcacctac 2770141 gccaagggcg cctcggtgct caaacagctc gttgcctacg tcgggctgga gcgctttctg 2770201 gccggcctgc gtgactactt ccgcacgcac gcttttggca atgccagctt tgacgatctg 2770261 ctggccgcgt tggaaaaggc ctcgggccgc gacctgtcga attggggcga gcagtggctg 2770321 aagacgaccg ggctcaacac cctgcgacca gatttcgagg ttgatgccga gggcaggttc 2770381 acccggttcg cggtgacaca gagcggtgcg gcacccggcg caggtgagac cagggtgcat 2770441 cggttggcgg tgggcatcta cgacgatgat ggttccaaga gttccggcaa gctggtccgg 2770501 gtgcaccgcg aggaactcga tgtctccggt ccgatcacga acgtccctgc gctggttggc 2770561 gtttcgcgcg ggaaactgat tctggtcaac gacgacgacc tgacctactg ttcgctgcgg 2770621 ctggacgagc ggtcgctaca gaccgcgcta gaccgcatcg ccgacatcgc cgagccgctg 2770681 ccgcgcacgc tggtgtggtc ggccgcctgg gaaatgaccc gtgaagccga actgcgtgcc 2770741 cgcgacttcg tgtcactggt gtccggcggc gtgcacgcag aaacggaggt cggggtcgcg 2770801 cagcggctgc tgctacaggc gcagacagcg ttgggttgct atgccgagcc cggctgggcc 2770861 cgggagcggg gatggccgca gttcgccgac cggctgctgg agttggcgcg cgaagccgag 2770921 cctgggtcgg atcatcagct ggcctatatc aactcgctgt gttcgtcggt gttgtccccc 2770981 cggcatgtgc agaccctagg ggcgttgctc gagggtgagc ccgccgcatg tggattggca 2771041 ggcttagccg tcgacaccga cctgcgctgg cggatcgtaa ccgcgctggc caccgcgggc 2771101 gccatcgacg ccgacgggcc ggagacaccg agaatcgacg ccgaggtgca gcgcgacccg 2771161 actgccgccg gaaagcggca tgccgcccag gcccgcgcgg cgcggccaca gttcgtcgtc 2771221 aaggacgagg cattcaccac ggtggtcgag gacgacaccc tggccaacgc cactggccgc 2771281 gcgatgatcg ccggcattgc cgcacccgga caaggcgagc tgctcaagcc gttcgcgcga 2771341 cgctactttc aggcgatccc cggagtatgg gcacggcgat ccagcgaagt cgcgcaatcg 2771401 gtggtgattg gcctgtatcc gcactgggac atcagcgagc agggcatcac cgccgccgag 2771461 gagttcctca gcgaccccga ggttccgccc gcattgcgcc ggctggtgct cgagggccag 2771521 gccgcggtgc agcgatcgtt gcgggcccgc aacttcgacg ctgacggcta gccctcaccg 2771581 cgagggcgcg tgtctgtaca acgacacgcc gcatcgggcg tacattcggg cgtgctcgcc 2771641 gggtcagccc ggcgcgatcc ccgcgctgag cacgcggatc gcgctgatca gcccatctac 2771701 cagctcaccc tgctcgaacg ctgaggaagc ggcggcaacc ccgagcggag ccgccgactc 2771761 ggcaccgcgg ccgcggactt gcgagccgta gaccacttcg atggcgcact ggttgggcga 2771821 gaccgcgagc agcacagcat tgtccggcgt gggcaccttg cccaagatct cgcgggcccg 2771881 cgcggcggtg tcacgaccca agtcgccgag gtagatggcg aacctcacct gacacgcccg 2771941 cgagctgtag gtcagcgcgt cgtccagggc gacgagatct gcgatgggga acgggtagtg 2772001 cacggacagt tccccgggct cggtgacccc cgagatccgt ccgctggtgg tcagcaccca 2772061 acccggcggc agctcggcgt gctcaatcgt cgcaacgtca ccacgtgcca ctggccccac 2772121 ctccaaccgt gaactccgat gcgtcatgcc cgtgtccgcc gtgcgcgctg ccaacgacct 2772181 cgtcggtggc ggcccacagg atgggcgggt gtgtccaagg ctccgacagt ttgtaggttg 2772241 ccgggtgagg tcccttgcgc gaccagatca gcacagacag cacaaccacc agcaacaacg 2772301 ggataccgac aaagaagagg tggatctcca tagcactcac gacgcaaacc gtatcccacc 2772361 gggttttcag gccgcaccct caccgaggta tcgcgcccag gacgggtcca gctccttgac 2772421 cgccgacagc agtcgccagt gcggtcccgt gggcggcagc ggcgcccggc ggagtgccca 2772481 gccaagctcg gtcaacagcc tgtcaccctt gcggtggtta cacggcgagc agcacgcaac 2772541 gcagttctcc caggagtggg caccgccccg gctgcggggt accacgtggt cgacggtgtc 2772601 ggccttgccg ccgcagtagg cacaacagaa ccggtcccga tgcatgagcg cggcccgggt 2772661 catcggaacc cgggcacggt agggaacccg gacataggag cgcaactgga tcaccgacgg 2772721 gaccaggatc gatctggtcg ccgagtggat gaccggcccg gacgggtctt cgtgcaccac 2772781 gtcggccttg ccacagatca ccatgacaat cgcccgccgc atcgacaacg cggtaagcgg 2772841 ctcgtaggtg gagttcagga gcagcacccg ccggcggttc cagatcgatg cgctctcgtg 2772901 acggttcggt ggatgggtct cgacgcctga cgcgagtcgg tgggagtgga cactgtgcag 2772961 gcatgaagcg ggcccggtta cgcctgccgc gacaccggaa ctgcggtggc cgcggcgctt 2773021 cttgccgtgc gccataggtc ctccgccgaa cagtccacca tgattcgcgg ctaatcgcac 2773081 gccaaatgcc acgtccacac cgtgtcgctc cggtgaacaa accgggggct ggctggtcgg 2773141 ccacgacaaa tagaccacaa tggaggggat ggatcagatg ccgaagtctt tctacgacgc 2773201 ggtcggcggc gccaaaacct tcgacgcgat cgtgtcgcgt ttctatgcgc aggtcgccga 2773261 ggacgaagta ctgcggcggg tgtaccccga agatgactta gccggcgccg aggaacgatt 2773321 gcggatgttc ctcgagcagt actggggcgg cccacgaacc tactcggagc agcgcggcca 2773381 cccccgattg cggatgcggc atgccccgtt tcggatctcg ctcatcgaac gcgacgcctg 2773441 gctgcggtgc atgcatacgg ctgtggcctc catcgactca gaaacgctcg atgacgagca 2773501 ccgtcgagag ttgctggatt atctggagat ggccgctcac tcgctggtca actccccgtt 2773561 ttgatggacc aacaccagcg accggatcca atgggccccg gctctcctcg cgccagcgct 2773621 cgtcgaccgg agccagatcc gatgggcgag ccgtggtggt cgcgagccgt gttctaccag 2773681 gtctatcccc gatcgttcgc cgacagcaac ggcgacgggg tgggcgacct ggacgggttg 2773741 gcgagccggc ttgaccacct gcaacagctc ggtgtcgacg cgatctggat caacccggtc 2773801 accgtctcgc cgatggcaga ccacggatac gacgtcgccg atccccgcga catcgaccca 2773861 ctcttcggcg ggatgccggc gttcgaacgg ttggtcgctg cggcacaccg gcagggcatc 2773921 aaagtcacca tggacgtggt gcccaaccac accagttcgg cgcacccatg gtttcaggcc 2773981 gcgctggctg acctcccggg tagcccggcg cgggatcgct atttctttcg cgacgggcgg 2774041 ggccccgacg ggtcgctgcc gccgaacaac tgggagtcgg tgttcggcgg gccggcctgg 2774101 acccgagtgc gcgaaccgga cggcaacccg ggccagtggt acctgcacct tttcgacacc 2774161 gaacagccgg acctgaactg ggacaacccg gaaatccttg acgacttcga gaaaacactg 2774221 cgcttctggc tggaccgcgg cgtggatggc ttccgcatcg acgtggcgca cggcatggcc 2774281 aagcccccgg gcctgccgga ctcaccggac ctgggcatcg aggtgctgca ccaccgcgat 2774341 gacgacccgc gcttcaacca cccgaatgtg cacgcgattc accgcgacat ccgcacggtg 2774401 atcgacgagt accccggagc ggtaaccgtc ggcgaggtgt gggtacacga caacgcccgc 2774461 tgggcggagt atctgcggcc cgacgaactg catctcggct tcaatttccg gctggcgcga 2774521 accgagttcg acgccgccga gatccgcgac gcggtggcga actccctggc cgccgcggcg 2774581 ctgcagaacg cgaccccaac ctggacgctg gccaatcacg atgtgggacg ggaggttagc 2774641 cgctacggcg gcggcgagat cgggctgcgc cgggccaagg cgatggcggt ggtgatgctc 2774701 gccctgccgg gcgtggtctt cctctacaac ggccaggaac tgggtttgcc cgacgtggac 2774761 ctgcccgacg aggtgctgca ggatccgacg tgggaacgct cgggacgcac cgaacgcggt 2774821 cgcgatggct gccgggtgcc gattccctgg tcgggcaaca ttcccccgtt cgggttctcg 2774881 acgtgtccag acacctggtt gccgatgccg ccggaatggg cggcgctgac cgccgaaaaa 2774941 caacgcgctg atgccggctc gaccttgtcg ttttttcgac ttgcactcag attacgtagg 2775001 gaacgaaatg aattcgacgg cgacgtcgac tggctggccg cgcccgacga tgcgctgata 2775061 ttccggcgtc acggcggggg tttggtgtgc gcgctcaacg ccgctgagcg tccgctggcg 2775121 ctgccggcag gtgaacccat cctggccagc gcaccgttga ccgacgccac gttgccaccc 2775181 aatgccgcgg cctggctggt gtagcggcat tccgagctat gcctgcccga catataagcg 2775241 catacgcatc ctaggcgggc accgtctagg tatgatgatg cggatcgccg tgcggctacc 2775301 cggggaagtc atcaccttcg tcgatagcga ggtcagccaa atccgcatac ccagccggcg 2775361 cgccgcagtg gtgttgcgtg cctcgaacgc gagcgacgcc gcgattctta ccgccaccga 2775421 acccaatcac cacctcgacg cactcgccgg acaggccgca aagctagcac caacatcgat 2775481 tgatgcggct catccagctc gcccagctag acgagacccg tgcctttacc cgcgaactgg 2775541 ccaggcctta cctcgcaccg ggtaaccgtg gcacccacct cgagcagcgt agccagcgaa 2775601 ctgctcatgc cctggccgag cgctgccgct agcggtgtgg tcggctggcg caccaccgcg 2775661 accgccagtc agcgatacca tcggccgatg tcggatactc cgttcgccga gccctatccc 2775721 gagcagcggc ccccctgggg tgtcccgcca ccaggttggg acggatcgtc gcggccagcg 2775781 ccctcgacga ctcctcgatc gcccgggcgg tggtctctag tggcggccct agcccttgcg 2775841 gtcgtctcat taggcgtggg catcgtcgga tggtttcatc ggcaaccgca cgacaagcca 2775901 tcaccggccc catccgcgcc gacgttcacc agccaacaga tttccgacgc gaaagaaaac 2775961 gtctgcgccg cacaccggat cgtgcgccag gcggccgtgc tgaataccaa tcaggccaac 2776021 ccggtacccg gagacccgac cggcgatttg gcggtggcag ccaacgcccg cctggcgctg 2776081 tatagcggcg gcgactacct gctgaggcgt ctcaccgccg agccagcgac tcctgccgag 2776141 ttgcgcgatg ccgtccgctc gctcgccaac gctctacaag agcttgcagt gaactatctc 2776201 gctggagctc ccgattccgt ggtaactccc ctgcggctgg cgctggaaag ggacaccaga 2776261 gccgtggatc cgctatgcgt gtgacggcga tccggaaatg aaccatcctc gcccatcagc 2776321 gcagcaccag cgccgcgtgg ccgcgatgcc gataaaccga gccgaagcgg gcgtccagac 2776381 gcaaccacgc cggcgatatc cggacccgga tcagctcgtc ggcgctgatc gtttctgcgg 2776441 actggggaag gaaacccatt gcggtcaaag cgaacacaca gcgcatgggt agcccaacga 2776501 cgacatcggc cgagctgacc tggatgacct cctgatcgag cagcgatacc ggcggaccag 2776561 ccgaactccc gtgctccttg gccagccgcg cgccacggtg cgccaggtcc aacatcaccc 2776621 gggccggtac gtcgtcgaga taggtgaagc cggactccgg cggcaaccca ccccgccacg 2776681 cggagtccat cgagtaaccg ggatcgacat agcccgaggc atccgttgtg gccagaccgt 2776741 gcgcgagtga ccgtgcggcc accgacagat cgtcgggtcg caccttgccg gccaccaccc 2776801 gactggccag cacgtcgaag cccgttgcta cccaagccga tagcaatccg gtagaccgcg 2776861 cgcgaatacg gataacggcg gcatcgtcga gccgaagcgc gtgatccacg aacgtggcca 2776921 gatccgcgcg gtgagccggg tcagggagcc acaacccacg ctcaaccacc ccgcctatcc 2776981 ccgaaaccac cgttgcaggt actcgcgatg gtgtggcgat agtcgaacca accgctgttc 2777041 ctcgatatgg aacgcggcca gctgcgactc ggcgatgacc gcaggcctcg agtctggctc 2777101 cgcgttgacc gaccgcacct cgtacccgag cgtgaagtcg accgcccgca gccgcttggt 2777161 ccagatcgtc acctgtagcg gcgagtcgga caaccgcagt tgacccttgt aggtcacccg 2777221 gacatcggcg atcagcagcc cggtggacgt gatgtcggct ccgaaagcat ccttaagaaa 2777281 cgggacccgt gcctcttcga gaatcgtgac catggtggcg tggttgacgt gctgatacat 2777341 gtcgatgtca gaccagcgca cccccaccgg cgtgacgaac ccgacgctca ccccgagatt 2777401 cctcgcccgc ttgtccgtgt catgcggcgg atctgccgcg cggcgaccga caacgtcgcc 2777461 agatccttct ggccgctggc acggatgtcg tcgagtgtcc gacgtgcccg cgccacccgg 2777521 gaggcgctga ggtgttccca ctcggcgatc ttttgctcgc tactctcgcc gggttccccc 2777581 acggccagca cgtcgaaaca caacgaccgt agcgcaccgt aaatatcgtc gcgaatcgcc 2777641 aagcgcgcca acgaatgcca gcggtcgtgt cggggcagct gggataccgc ggtcagcagg 2777701 ccatcggtgc ccagccggtc catcagggcg aaataggtgt cagcgacctc ggcggcgtcg 2777761 atgtcggcga tgtcggcgat gtcgatgatg tcgagcaggc tgtaccggta caggccggtc 2777821 gagacacggt aggccaagtc ttcaggcaca ccctgcgatg cgaattccgc agctgtcttt 2777881 tcgacgatgg ccttgtcatc accacgcaac cactccgaca tgcgcggtgt cagtgccttg 2777941 accatggccg cgaatcggtt gatctcggcg ccgacggcca agggctgcgg acggtagttg 2778001 agcagccagc gtccggcacg gtcgatcagc cgacgggtgt ccagcgtcaa cctgtctgac 2778061 agcgcgattg gcaggttcgc cgcacggatc cggcgccaaa tgtgaccgac accgaagatg 2778121 gcatcggtgg cgacataggt gcgcacggca tcgatcggcg tgacaccaac gtcttcggcg 2778181 atccggaacg cataggtgat gccggcggta tccaccagat cgttgatcag catggtggtg 2778241 acgatctcgc ggcgcagctg gtgggaacgg atctccgggg tgaaccgttc gcgcagcgcc 2778301 gtcgggaaat aacgaggcaa cctggaagcg aagacatcct gatccggtag ttcggtggct 2778361 agcacctcct ctttgagccc cagcttgacg tgcgccatca gcgtggcgag ttcgggcgag 2778421 gtgagcccga tgccggcctc ggagcgccgg gcaatctcct tctccgacgg cagcgcttcc 2778481 aattcgcggt tgaccccgcg ctcagccacc aaatacttga tctgcattgc gtgcaccggc 2778541 agcaggctgg ccgcgttggc gcgactggtg cccatcaagt cgttctgatc ttcgttgtcg 2778601 gcgagcacca gttgcgctac ctcgtcggtc attgactcga gcagctgtgt gcgttcgtcg 2778661 gctttgaccg tgccggcgct caccagcgag tcgatcagga tcttgatgtt gacctcgtgg 2778721 tccgagcagt ccacgccggc ggagttgtcc agcgcgtcgg tgttgatccg gccgccggac 2778781 agatcgaatt cgacacggcc caacgccgtc actccgagat tgccaccttc gccaatgacc 2778841 ttggcgcgca cttgattcgc gttgactcgc accggatcgt tggcgcgatc gccgacatca 2778901 gcatccgact ctgactcggc cttgatgtaa gtgccgatgc cgccgttgaa cagcaggtcc 2778961 accggcgccc gcagaatcgc ccgaataagg ttgggcgggg ccatctcggc ggcccccccg 2779021 tcaactgagc cgtcgatgcc gaggacggcg cggacctgcg cgctgagcgg gatggctttc 2779081 tgttcgcggc tgtacacccc gccgccctcg ctgatcagag acctgtcata gtcgctccag 2779141 ctggaccggg gcaactcgaa catccgccgg cgttcggccc acgacaccgc ggcatcgggg 2779201 ttggggtcga ggaagatgtg gcggtggtcg aaggcggcga tcagccggat gtgcttgctc 2779261 agcaacatgc cgttgccgaa tacgtcgccg ctcatgtcgc cgattcccac gacggtgaaa 2779321 tcctgggtct gggtgtcgat cccgatctct cggaaatgcc gttttacggc ctcccaggcc 2779381 ccccgggcgg tgatgcccat ggccttgtgg tcgtagccca ccgatccgcc cgaggcgaac 2779441 gcgtcgccca gccagaaccc ataggacttg gcgacatcgt tggcgatatc ggaaaaggtg 2779501 gcagtacctt tgtcggcggc cactaccaag taggcgtcgt cgccgtcacg tcgcaccacc 2779561 tcgggcgggg ggttgacgct tgcggtcgca tgatcgacgt tgtcggtgac atcgagcaac 2779621 ccggagatga acagctgata gcaggcgacc ccttcggcgc gggtggcgtc gcggtcggcg 2779681 gcggggtcgc cggtgggcag cgggggacgc ttgaccacga acccgccctt ggccccgacc 2779741 ggcacgatga cggcgttctt caccgcttgc gccttgacca atccgagaat ctcggttcgg 2779801 aaatcgtcac ggcggtccga ccagcgcaac ccgccacgcg caactgggcc gaacctcaga 2779861 tgcacgcctt cgacgcgggg cgaatacaca aaaatctcgt accggggacg cggcagcgga 2779921 agttcgtcga tcaactgggc attgagtttc agcgccaata catcacggca gcgggccgaa 2779981 ccctggcgtg tcacaaagta attggtgcgc aacgtggcct gaaccaacga cgcgaaggcg 2780041 cgcaggatcc ggtcggtgtc caggctcacc agcgcgtcga tgtccgcggc gacagcggca 2780101 gcggccgctt gggcatcgcg attgctcgcc gaccccgacg gcaccggaac gaaaagcgct 2780161 tcgaacagat cgaccaaaga ccgaacggta gcagggtgct cgttgagcac cgattcaatg 2780221 taggactggc tgtacgggaa gcccgcctgg cgcaggtact tcgcgtaggc acggagcagc 2780281 acgacctgct gccaagtcag cccggcacgc atcaccagct cgttgaatcg gtcgatttcg 2780341 acccggccgt gccagatcgc ggtcaccgcc tcggcgaatc ggtgcgcggt cgcggcccgc 2780401 tcggcaaccg tcggggccaa cgggatcgtg ggatgcggcg agatcttgaa ctgatagatc 2780461 cagaccggca gaccgtccgg ccgggtgacg gagaacggtc gctcttcgag caccacgact 2780521 cccatgcttt gcagcatcgg cagcagctgg ctcagcgaag cggtgcgccc accgaggaac 2780581 caggtcaact gggcgacacc ctgctcgtcg cgttcggaaa acaccagctt gaccgaatcg 2780641 tcggtcagct ccgtgatgac cgcaatgtcg ccaatggcat cggccggggt gacggcctgt 2780701 ttgtaggcct cggagaaggc ggcagcgtaa tgcatagcgt cggcctgtcc gacggagcca 2780761 gccgccgccg ccgcgccgat caaacggtcg gcccaggttc gcgcggcttc ggtcagcaga 2780821 ccctggatcc ggatccggtt ggcttcggaa acgtccaccg gcggggcggc cgccccttct 2780881 cctgccacac ccacttcggg tagccgcacc atgaaatgca tgagtgccca aggtgattca 2780941 ctgacccgag cggtgaactc cagtcgtgtt cccccgaact cgcggacaag gatgtcctcg 2781001 aattgcatgc gcacggcggt ggtgtagcga tctcggggca tgtagaccag gcacgacacg 2781061 aagtactgca accgatccgc gcgcaggaac aacaacgcct gccgttgcga tcccaagtcc 2781121 accacggccc tggccatggt cagcaggcgc tgcgcgctca gggtgaacag ctccggtcgc 2781181 gggacggtct ggatgacgtc gagcagcaat tggcctgggt ggctgggatc gctttcggcc 2781241 atcgccagcg cctcgcggac ccggcgcgag atcgtcggga tctccagcac gtccgcattc 2781301 atggccgcga cgctgaagag cccgacgaag cggtgctcga ccacgctgcc gtcgacgtat 2781361 tcgcggaccg cgatggcata gggataggcg ccgtaacgca ggtagctgcc gacccgcgct 2781421 tgggccaaca ccagcagttt gtcgtcgtcg gtcagccggg gacgcgaacc ggtgcggccc 2781481 cgcaggacgc ccataccgct tgacccctcg ccgtagacca tcccgtcagc cacccggcac 2781541 cgttggtagc ccagcagcag gaagttcccg tcacccagcc aacgcaacag ttccccgacg 2781601 tcttgtcggt cgggcgcgga aaatcggccg ccggcattgg attcgacttc tcccgccagc 2781661 tcgctcaggg tggcgatcag cgctgtggcg tcggtggcca cccgctggac gtcggccagc 2781721 accttgggca gcaaccgctc cacctcggcg aggcctttgt gatcaacggc gggcgagagc 2781781 gctacgtgca tccaggcctc acccaggtgc ggcgacgtgc cctcggcctt cggttcgatg 2781841 cgcagcagct ctcccgtggg gctgcggtgc acgtcgaaca ccggggtcag aatcgccgcg 2781901 taggcgattc caagccggtg cagcagcacc gtaacggaat ccatcagcat gccgccgtgc 2781961 tcggcgacca cctgcagcgc cggaccgaac cccgcgggat cgtccgcccg atagacggcg 2782021 acacagcttt caccggccgc gcggtgccgg ccaagccgat aatgtgcgcc cagcatggcg 2782081 ggcgtcagca gggaggctgg aagccaactg gcctcggcgg ccttggtggc ttccgacgag 2782141 tcgtcgcgcg gtcctcgata gctgtcgatg taggccttcg agatccagtc aggaatgtcc 2782201 gcactcgcgg tgaacgtggt ccacgcctca acatcctgct tagccccggg atcgatcgtc 2782261 atgccgattg ctcccaactc acgacgggta ccgctcgatt caattttccc gctcctgggt 2782321 gcggcgttcc ggacgcatcg tcacggggcg tgggcgaagc taacattagc cgcgcgtcag 2782381 cttgcggtgg gtgaccctat gcggtcgagc ggcgtcgaca ccgagccgtt ccaccttgtt 2782441 ctcctcgtag gcgccgaagt tgccctcgaa ccagaaccac ttcgcctcgt tgtcgtcgtc 2782501 accctcccac gccaggatgt gcgtgcacgt gcggtcaaga aaccagcgat cgtgcgaaat 2782561 caccacggcg cagccgggga agttcagcag agcattctcc agcgaaccca gagtctcgac 2782621 atccaggtcg ttcgtcggtt cgtcgagcag aatcaggttg ccgccctgtt tgagcgtcaa 2782681 cgcaaggttg agcctgttgc gctccccgcc ggatagcaca ccggccggtt tttgctggtc 2782741 cggtccctta aacccgaatg ccgacacgta ggcccgtgac ggcacttcgg tttgaccgac 2782801 ctggatatag tccagaccgt ccgagacaac ctcccagacg gtcttccgcg gatcgatgcc 2782861 agcacgggcc tggtccacgt aactcagctt gacggtctcg ccgaccttga cgctgccgct 2782921 gtccggtgtc tcgagcccga cgatggtttt gaacagtgtg gtcttgccta ccccgttggg 2782981 cccaatgacg ccgacgatgc cattgcgggg caagctgaac gacaggtcct tgatcagggc 2783041 gcgcccgtcg tagcccttat cgaggtggtc gacctcaacc accacgttgc ctaggcgggg 2783101 cccgaccggg atctgaatct cctcgaagtc gagcttgcgg gtcttctccg cctcggctgc 2783161 catctcctcg tagcgctgca ggcgcgcctt gcttttggcc tggcgcgcct tggccccgga 2783221 ccggacccaa gccaactcct cggtcaaccg cttttgcagc ttcgcgtcct tgcggccttg 2783281 caccgcgagc cgctcggctt ttttctccag ataggtcgag tagttgccct cataggggta 2783341 ggcgcggcca cgatcgagct ccaggatcca ttccgcgacg ttgtccagga agtaacggtc 2783401 gtgggtgacc gccaggatcg caccggggta gctggccaga tgctgttcga gccactgcac 2783461 actttccgcg tctaggtggt tggtcggctc gtcgagcaac aacaggtcgg gtttggacaa 2783521 cagcagtttg cacagcgcca cccggcgacg ctcgccaccg gataggttgg ttaccggctc 2783581 gtcggccggc ggacagcgca gcgcatccat ggcctgctcg agctgcgcgt cgaggtccca 2783641 cgcgtcggcg tggtccagtt cctcttgcag ccgacccatc tcttccatca gctcgtcggt 2783701 gtagtcggtg gccatcaatt cggcgacctc gttgaagcgg tcgagcttga tcttgatgtc 2783761 ccccatgccc tcttccacat tgccgcgaac ggtcttgtcc tcgttcagcg gcggttcctg 2783821 ttgcaggatg cccacggtgg cgccggtggc caggaaggca tcgccgttgt tcggcttgtc 2783881 caaaccggcc atgatccgca agacgctcga cttaccggcc ccgttggggc cgacgacacc 2783941 gatcttggcg cccggataga aactcaacgt cacgtcgtcg aggatcacct tatcgccgtg 2784001 cgccttgcgg accttcttca tcgtgtagat gaactcagcc atgccgcggt gttgcctttc 2784061 tggtccttcg ggttacctcg cgaaccatcc taggcaccgc cggggcagca tcgaggcgac 2784121 ccctaagccg atatgggcag ggggttgtgg ccagtgatgg cgtcgtcgac cacgacatcg 2784181 gaaaccgagt cggctgccga cgctggggcg tcggcggcac cggccgcccc ggtccccgtg 2784241 gcggccggga gatcaccggc gcttggaccg gtgtaggccg gcttttcgat gcgcacgatc 2784301 acgcgcgaca aatccggccc taccgacgtc gcccgcatct ccagcgacga gcgacgaatg 2784361 ccgtcccggt cctcatattc actggtgtac acgtgtccca ccacaatcac cggtgcgccc 2784421 ttgcccaatg ctgcgcccac cccggtgacc agccttcccc agcaattgac ggtgataaac 2784481 agcgagttgc cgggctccca accgccgtcg ctggtgcgcc ggcgcgaatt gctggccacc 2784541 cggaacttga cgacctcttg atcaccgact ttgcggcgct gcaaatcgtt gacgatgtga 2784601 ccgaccacgg tcagtgaacc gccccggtga gtccggagac tctctgatct gagacctcag 2784661 ccggcggctg gtctctggcg ttgagcgtag taggcagcct cgagttcgac cggcgggacg 2784721 tcgccgcagt actggtagag gcggcgatgg ttgaaccagt cgacccagcg cgcggtggcc 2784781 aactcgacat cctcgatgga ccgccagggc ttgccgggtt tgatcagctc ggtcttgtat 2784841 aggccgttga tcgtctcggc tagtgcattg tcataggagc ttccgaccgc tccgaccgac 2784901 ggttggatgc ctgcctcggc gagccgctcg ctgaaccgga tcgatgtgta ctgagatccc 2784961 ctatccgtat ggtggataac gtctttcagg tcgagtacgc cttcttgttg gcgggtccag 2785021 atggcttgct cgatcgcgtc gaggaccatg gaggtggcca tcgtggaagc gacccgccag 2785081 cccaggatcc tgcgagcgta ggcgtcggtg acaaaggcca cgtaggcgaa ccctgcccag 2785141 gtcgacacat aggtgaggtc tgctacccac agccggttag gtgctggtgg tccgaagcgg 2785201 cgctggacga gatcggcggg acgggctgtg gccggatcag cgatcgtggt cctgcgggct 2785261 ttgccgcggg tggtcccgga caggccgagt ttggtcatca gccgttcgac ggtgcatctg 2785321 gccacctcga tgccctcacg gttcagggtt agccacactt tgcgggcacc gtaaacaccg 2785381 tagttggcgg cgtggacgcg gctgatgtgc tccttgagtt cgccatcgcg cagctcgcgg 2785441 cggctgggct cccggttgat gtggtcgtag taggtcgatg gggcgatcgg cacacccagc 2785501 tcggtcagct gtgtgcagat cgactcgaca ccccaccgca aaccatcggg gccctcgcgg 2785561 tggccctgat gatcggcgat gaaccgggta attagcgtgc tggccggtcg agctcggccg 2785621 cgaagaaagc cgacgcggtc tttaaaatcg cgttcgccct tcgcaattcg gcgttgtccc 2785681 gccgcaagcg cttcagctca gcggattctt cggtcgtggt cccgggccgt gcgccggcat 2785741 cgacctgcgc ctggcgcacc cacttacgca ccgtctccgc gcagccaaca ccaagtagac 2785801 gggcgacctc actgatcgct gcccactccg aatcgtgctg accgcggatc tctgcgacca 2785861 tccgcaccgc ccgctcacgc agctccggcg ggtacctcct cgatgaacca cctgacatga 2785921 ccccatcctt tccaagaact ggagtctccg gacatgccgg ggcggttcac agtggcgttt 2785981 cgaacatttg ctcattcctt tcctagttgc gttggcacag ttgcgttggc accgggtgat 2786041 tccgcgaact gcccacgcat atgccgagtg ctattcacct cggccacacc gacatttgcc 2786101 gggatgagac cgtcgccgcg cgcaatcctg tgaatgaagc ggtaactgtg gattaaccaa 2786161 ttaattggcc gcttggcctg caaacctggg aaccagaccg aaacctcgct cagtattcac 2786221 aaaacggtcc aatggggcag ggtgacggcg ataacatccc aatgaccgtg attcttcgaa 2786281 ccatggcgac gtacgggcca cgacaacctg ccatcgaagg ggcgacgaca atgaagacaa 2786341 ggaacccacg gacgctgcta acctggctgc tcggcgcgat agttactggg ttgtacgtgg 2786401 ttttcgctac gggctgccaa ttgcaagcgc ccgcgcctcc cactccggaa ataggttggt 2786461 cgggcccgca ggctccactg ccggcgccgg atgcggcgcc aacgcacctc ggcgtctagc 2786521 cgatcgcggc ggacaagtcg cccggcaccc agggcgagca gggcttcacg acagctagtg 2786581 agctatagac gacttcgtgt tagcgccgct ggcggggacg ttggcgctga tggggatcga 2786641 gttcctcagc tgcccgtgga caagaaccgc accccgcagc gagtcggcat ggacactcgc 2786701 gacgccgcca cgagtctggc ggtgtacgcc cattgcgcgc gtcacgcgcc cactgaccca 2786761 gttcactggg gtgccgttcg ccgtgctcgc ggcggcgctc acggcgctgc atctgacggc 2786821 atggcgcacc gcattcggtt tttctgagcg ctgggaaaat ggccagccgt ctggctcatg 2786881 gcgtctacgc aacgccacgc ccccaacacg ttcttagatt cggtcgcgtc cttgacgcgc 2786941 tttgaactcg caggcgacga actggttgcg cgcgatctgc tcgacatagt cgaaatcccg 2787001 cagaatgttt cgtaactccc gccggaaggc gaccctacgt tcggcgaggt cggccgccgg 2787061 cgctatcagc tcctgatcga cggcgacctg gcgtgcagtg gcgaacagca gcgtcgatac 2787121 cggttcgctg ctgcggaccc ggccctgtgc cacaaactga cggccgaggc cgagcgccag 2787181 ctccgtcaac tcctcaggac cgatgtcagg cggagcatcg cgcaacacgt cggcaacgat 2787241 ctcataggct tcgaagaaga cccgcaacat cgcgtccgac atcagcggcc gtttggcata 2787301 cagcatcgcg tcgatctcat tgcccccgac gccaagatga tcctcccagt cttggtgcca 2787361 ggccatctct tgggcgatgt tggcccgaaa cgccgtggaa tccgcgaaat agaagtcgaa 2787421 cttcagcaga tcccgcaacc gcatcgcctg ggcccagaac gcggcgacgc ggtcaccttc 2787481 ggcgtgcttg gcatgggcca gcgcgagctc gacgatcgag gtctccaaaa acgcatggat 2787541 caccgagttc cggtagaacg ccgcggcgtg ctcgtcgtca ggcgctatgt accataccgg 2787601 ctcccggcca ctgtcgaccc gagtgaccgg gtggccgttg gacaacgcgt ccgccgccgc 2787661 acggacgcct tcgcgcgagc gcagtcgcaa tgcgcttgtc gaaaccggcg attgtttgcg 2787721 ttccagatag tccagtgagt cctgcaacgt gtggtgcagc tggtcgagcg tcaacgcggt 2787781 gccgcgggtg gtgagcagca gtgcggacac caaacccgtc gcggtcaccg gcgtcgcctg 2787841 caaaatcctc caggccacct cgaacgacat cttctgcaac gcaagccgtt tcgcggccgg 2787901 atcctgggtc agctcgccgt gcggtgcgcc gaggtactgg cgcatcgaga ccgcttcggg 2787961 gaagcgaacg tagatcttgc cgaagttgcg ttccccctgc gccttgatga agttgtagag 2788021 ccagcgcaaa ccttcgggcg tcttctccgc gccacgcgcg taggcggcgt attcggtgat 2788081 ctcgtgcagc tgatcgaagc aaatcgaaac cccctgcagc aggatgtcgt cactgcggcc 2788141 gtccaggtaa gcatcggcca cgtagctcat caaaccgagc ttgggcggca acatctttcc 2788201 ggtgcgcgac cgggtgcctt cgatggacca gctcaggttg aaccgcttct cgaccacgta 2788261 gcccacgtac tccttgagca cgtacttata cagtgggtcg ttgccgatat tgcgccggat 2788321 gaagatcatc cccgagcgcc gcatgagggg tcccatgaga ccgaacgaca ggttgatgcc 2788381 gccgaacatg tgcaccggcg gtaaccggtt gtcctgcatg gccaccggta ccaccacgcc 2788441 gtcgatgtag gaccggtgcg agaacagcag gaccgccgga tgagcctcca gtgcggcgcg 2788501 catcgccgcg acctgatact cgtcgtagtc gaattccgga tcgaagccgc ggctagccag 2788561 cctgccgagg acggaaacca ggtctaccga cacctggctc catccggtgg agagttcgtc 2788621 gagcatcttc ccggcatctt cgaccgtggc gcccggaatc cggtccaggc cggcacgaaa 2788681 tcgtgcggac gccaacatct ccggcttcac cagccgggga gatttgtatt gcggtccaag 2788741 gatccgatat tcggcgcgcg ccagcgccaa cagcgctcgg cggctgacga actgggcgaa 2788801 atcgcgcttg tgctctgcca ccgtggtatc gcgccactgc tggcgcagtt cggacacctt 2788861 ggccgactcg ccggccacca cccgcgcgcg cctgggatcg gtacgcagga tgcgacgctg 2788921 ctgacgctgg ctgggatggt agggatcccg acccgggagc agtgcggcca ccttgcccgc 2788981 ccggctgcga tcggcgggag gcagccagat cacccgaacc ggcacgatag aacggtcctc 2789041 gccagattgc gggctggatg cgaagccggg ctcgagctgc tcgaccagtg ccgtcagcgc 2789101 cgccggcgga gcgttgcgcg gtggcagctt caatatgtcg aacttcgagt ccggatggcg 2789161 tgcacgctgc tggcccagcc agcccatgat cagctccatc tcgaccggcg tcgccgtgga 2789221 agccagcacc agtgtgtcct cggcagtaag caccgcgctg gcatcggccg ccggtttggt 2789281 cacgaccgtc ctttggcgct agagcttggc gatgcggagg cctcaccatc cttgccagcg 2789341 atcttagatt cgctgggttt ggccttcggc gatgccttct ttgtagccgc cttcgtcgcg 2789401 gccgcgcctt tattggcggc gcttttggcg ggagccttct tagccggcac ccttttcgcg 2789461 gtggccttgg cgacctgagc cctggctttt ctggcggcct tctgctcggc gtacagatcg 2789521 accgcgggca acccatcgac cggccagtcc gccagcgtgt ccagatacag ctggcgcacc 2789581 tcggcgatac gatccggcag ggcgtccagg gtccagtcat cgaccggaat cggcggaaac 2789641 accgcgacgt cgaccgtgcc cggattgatc gtggtggagt tgcgcgaggc gacgatctcc 2789701 gcattgcgga tcacgatcgg cacgatcggg atcttcgcgg ccatggcgat acggaagggc 2789761 cccttcttga atgacccgac ttcggtggta tccaaccggg taccttcggg agcgatcacg 2789821 atcgatagtc cattgcgggc gcgctcctca accgtgtgca gtgtctccac cgcggcgacc 2789881 ggatcatcac ggtcgatgaa cacaccgtcc agcaacttcc ccagcgtgcc catgatcggg 2789941 tcgctcgcca gttccttctt gcccacccca acccagttgt cgcgcaccag cgcaccggca 2790001 atgaccgggt caacctggtt gcggtggttg aagataaaga cggcgggccg ctgggcggtc 2790061 agattctctt ttccgatcac attcaggtgc acgccgctgg tcgccagcag cagctgagag 2790121 aaggtggagg taaagaaatt cacgccgcgg cgccggctac cggtcagcac accgatccct 2790181 accgcgccgg ccgcgaccgg gacgatggtg ctcagaccgg caagtgtccg caactgccgc 2790241 cggatgccca caccgccgcg actgttgaac ttcaagatcg gccagccccg tcgcttggcg 2790301 accgcggcca tctttccttc cggattggtc ggtcgcggat tgcccaccag atacatcagg 2790361 gcgacgtcct cgtcaccgtc ggcatagaag taactgtctt tgagatcgat gtcgtgctcg 2790421 gccgcaaagc gttgcaccgc agtggctttg cccggacacc acaaaattgg cttcagcaca 2790481 cccccggtga gtatcccgtc ctcgttggtc tcgaacttgt tggtgagcat gttgttgatc 2790541 cccagaaaac gtgcgactgg gccaacttgg atggtcagcg ccgacgagct gaggaccacg 2790601 gtgtggccgc gggccacgtg agcccggacc agttcccgca tttccgggta gatccgggac 2790661 tcgatccgct gggcgaatag ccgctcgccg atttcttcca ggtcggtcaa gagccgcccg 2790721 gccagcgccg cggcggcctt tccgataagg tcttcgaact cgattcgccc gagcgtgtga 2790781 ttcaggccgg cctgaaccat accgagcagc tcgcccacgc ccatatcgcg gcgccgcagc 2790841 ctctcctggg tgaggatgac ggccgtgaag ccggcgacca gcgtgccgtc caggtcgaaa 2790901 aacgcaccga ccttcgggcc ggcaggactg gccagaatct cggctaccga accgggtagg 2790961 cgcaaatccg gcgccgactt ccgcgtcgcc cgctcttccc cctgctcgtc agcggcgctc 2791021 atgagcccga caccgatcga ggcactgaac cggctccttg agtatcgaac gacgccggca 2791081 gcacacgcgg tgccggacca ccggccagcg caaggatttc gtcgaaaccc gcctgcaggc 2791141 attgagcgaa caactcgtcg tttcgcaccg acgccctgtc gtagcgcacc gtgacggtgc 2791201 accacccgcc ccgggaaatt agcactacca tcatcgccac accgggcaac ggtccaatac 2791261 cgtactgccg cagtatcttc gcgccggcaa ggtaggtatc ccctgggtag accggaacat 2791321 tgctggcttg cacatcggaa ccgatcaccg aaccggtgat cccctccagc acggccgtcg 2791381 gcaagacact cagcaccggt gcaatggaac cgatgatgtt catcgcgggc tcgtcgcgac 2791441 gctgggtcat ctgcgcccgg atcttcttca tccgagccac cggatcgata gtgcccaccg 2791501 gcgccgccag gttgacaccg gtgaactggt tgccgccggc cgcatcgccc tcggcccgca 2791561 ggttgaccgg caccgccatc ggcagcgtgc tgatcggcac gcccagggcc tcgtggtagc 2791621 ggcgcagcgc gccacacaga cccgcaaggt aggcgtcgtt gatcgacccg ccgccggcct 2791681 ttgcggcctt gtgcaggtcg gcgagccgga tgtcgatggc ctcggtacgg gtggtcaggc 2791741 tgcgccggcg cagtaggggt gagggttcag cagctcggtt cagcacccgg atgcccgacc 2791801 tggcgtagcc caagatcccc gacacggtgg acaccggttc cagaacagcc cgcccggcca 2791861 tcgataccgc cccggacagc gcgtccagga caccgccgac gacagcaatt ggcaggtggt 2791921 tgatgccccg gcgcatcagg tcattggggg acagatcctc cggaatgggt tgcggcggcg 2791981 tcgacctagg tggtggatcg cgctcgaggt catagatctg cgcgaacatc tccacgccgc 2792041 cgacaccgtc ggtgaccgca tggctgacgt gcagcagcat cgccgctctg ccgtcagcca 2792101 taccctccac cagggtggcc gtccacagcg ggcgcgatat gtccagcggc gactgcagaa 2792161 tcacctcggc gagatcgagc acttcgcgca acgtggcggg tccggacaca cgcacccgac 2792221 gcacatggaa gtccagattg aagtccggat ccaccaccca gcgcggggcc gcggtcggca 2792281 aggtcggcac caccaccttc tgccgcagcc gcaacacccg tcgcgaggcg ttttcgaatc 2792341 gggtccggaa gcgatcccag tccggcgtgc cgtccagcag ttccagcgcc atgatccccg 2792401 aacgagtccg cggatttgcc tcgccccgat gcatcaaata gtcgaccggc ccaagctcgt 2792461 cggacaacct gggggactcg ccggactcag ccatggccac gaccccgcgc gggttgggca 2792521 actcgacgca caaactctgt caccgccgat cagacctcct gcttcaaacc cgccaccgcc 2792581 acgcaccaca gtgccaacac aacgctagtc gcgatgacgc ggtggtgaaa gccgatgcgg 2792641 gccatgatcc acccgcagca ccgatgccgc ggccacgacc gacgaaacct cgtgttgggc 2792701 agccgagttg gaacggccaa gctcagctgg ccggaggtga cgacagcgcc agcgaaccct 2792761 tgcgagcacc catccgtcgc ccgtagatca cacccaagaa gtccgagacc gcttcggcga 2792821 ccatccgcga tcggaccgtg gcggcgaggt cgaacgcgtg gtgggcgttg gggagctcag 2792881 cgtaggacac cgtcgcggca cccgcgtcgc gcagcgccgc gctgaaggcg cgagattgcg 2792941 cgctcggcac catcggatcc ttctcaccgt gcaacacgaa gaacggcgga gcctcgctgt 2793001 ggacgtacga aatcggcgac gccgccttga acagccccgg gttgtcgacg tagcggctac 2793061 gcatcacgaa gtgctccagg aacggcatca tcatttcgtg catattctcg gcgttggtga 2793121 ggtcgtagac gccgtagtag ggcgccgcgg cttgtaccgc cgtgtcggcg ctttcgaagc 2793181 ccggctgcag cgccggatca ttcgccgaaa gcgcggccaa cgcggccagg tgcgcaccgg 2793241 cggacccgcc ggtgatcgtg atgaaatccg gatcgccgcc atagtcggcg atgttctcgc 2793301 gaacccacgc aatcgccctc ttcacgtcca caatgtgcgc cggccacgtg caccgtgggc 2793361 tcttgctgta gttgatcgac acacagatcc agccgagttc caccatccgg ctcatcaacg 2793421 ggtaagcctg agggcgtttg ccgttgatgg tccacgcccc gcccgggacc tggatgagga 2793481 ccggagcccg gcggccgggc gctaaatcgg gacgccgcca gatgtcgagt agattctcgc 2793541 ggccgccggg cccgtacggg atgtcggagg tctgggccgc atagcggcga tggggtccgg 2793601 gaatgtgcgg taggttcagc agcccgctgc gccgggcagc ctctgactgt tcgccggtcg 2793661 gatgccacac taggtcacgg aaatccgggc cgaaagcgtc cacgagcgcc gcgtgcagga 2793721 tttgatccgc ccgctgcgcc gcccagctgg tgccaaaccg gccgatcgag cgtggtgata 2793781 tgcgggacag cgcgtggccg gtgacgacgc gggccggaaa ctccgcggac aaccatcctg 2793841 cgacccaacc gatggcgcac ggtgatccac gcagcagcag ggcgccggcc cggcaggtgt 2793901 ctctggcgtc ggccgccagc tgcgctccct ggcgcaatgc ctcggcgccg gcccgcgagc 2793961 accgcgaagt cacgctggcg atgtgcataa caaagcccac ccctcgacgt caggcacacg 2794021 catcgttgcg gtaaacggct ggttgccagc cggttttgta cgtgtgtcga ggatcacaca 2794081 ataaccaata attgacgtgg cggtagacct ttcgcgcgtg tggcgtctgg aaaaattcct 2794141 cgacggccac cgttagataa actgacctgc gcatcgcctc cgtagctcag gtggatagag 2794201 caagggcctt ctaatcccta ggtcgcacgt tcgagtcgtg ccgggggcac tgtggaaata 2794261 gcaggtcagc atggtggcgt ggcttgacac cgcctcgtta tgggtcgacg cccagagtcg 2794321 ccttcaaact caaaccacgg aggtgcccga tggcccaata cgacccggtc ttgctcagcg 2794381 tcgacaagca cgttgcgctc atcacggtca acgacccgga ccgacggaac gccgtcaccg 2794441 acgagatgtc ggcgcagttg cgtgcggcga tccaacgcgc cgaaggcgac cccgacgtac 2794501 acgccgtagt cgtgaccggg gcgggcaagg ccttctgcgc cggggccgac ctgagtgcgc 2794561 tgggcgccgg ggtcggcgat ccagccgagc cgagattgtt acggctctac gacggtttca 2794621 tggccgtcag tagttgtaat ctgcccacca tcgccgcggt caacggcgcg gctgtgggcg 2794681 ccggactcaa tctggcgttg gccgccgatg tgcgcatcgc cggaccggcc gcattgttcg 2794741 acgcccgctt ccaaaagctg ggactgcatc caggtggcgg cgcaacctgg atgctgcagc 2794801 gagcggtggg tccgcaggtc gcccgtgcgg ccttattgtt cggcatgtgc ttcgacgccg 2794861 aatccgctgt gcggcacggc ttggcgctaa tggttgccga cgatcccgtc accgcggcgc 2794921 tggagctggc cgccgggccc gcagccgccc cgcgcgaggt cgtgctggcg agcaaagcca 2794981 ccatgcgcgc cacagccagc cccggatcgc tggaccttga gcaacacgaa ctcgccaaac 2795041 gcttagaact tgggccgcag gcgaaatcgg tccagtcgcc cgagttcgcc gctcgcttgg 2795101 ctgccgctca acacaggtag cgcctaccag cctcgagggt ttccatggcg tgccccagtc 2795161 cgaagctgct gctgcttgac tccgcgcgct gggcccgagc gcgcgctgtt gtacggccca 2795221 aacggcgtgt cggtgtacag tcgcgcgctc gcggcttcag tccggccccc cgactccggc 2795281 aggcccgacg gcgcccagcg ctagccgggc gcgccggcca tgccttcggt gccggaaacg 2795341 ccaggggacc cggggccgtt ggtgaggccc cccgcgcctg cctcaccgcc gctaccgccc 2795401 gcgccaccgg caccgcctgc gccgcccgcg ccaccgatac cgtcagcgcc gctgactcct 2795461 gcggcaccgc tgaggaaccc tccggaccca cccgcaccgc cggcaatacc gccagcgcca 2795521 ccgttaccgc cgtttgcgcc gttgcccccg ttgccgcccg tcccgccggc cccgccgatg 2795581 gagttctcat cgccaaaagt actggcgttg ccaccggagc cgccgttgcc gccgtcaccg 2795641 ccagccccgc cgactccacc ggccccaccg actccgccgc tgccaccgtt gccgccgttg 2795701 ccgatcaaca tgccgctggc gccacccttg ccacccacgc caccggctcc gcccaccccg 2795761 ccgacaccaa gcgagctgcc gccggagcca ccatcaccac ctacgccacc gaccgcccag 2795821 acaccagcga ccgggtcttc gtgaaacgtc gcggtgccac caccgccgcc gttaccgcca 2795881 accccaccgg caacgccggc gccgccatcc ccgccggccc cggcgttgcc gccgttgccg 2795941 ccgttgccga acaacaaccc gccggcgccg ccgttgccgc ccgcgccgcc ggtcccgccg 2796001 gcgccgccga cgccaaggcc gctgccgccc ttgccgccat caccaccctt gccgccgacc 2796061 acatcgggtt ctgcctcggg gtctgggctg tcaaacctcg cgatgccagc gttgccgccg 2796121 cttcccccgg gcccccccgt ggcgccgtca ccaccgatac cacccgcgcc accggcgcca 2796181 ccgttgccgc catcaccgaa tagcaacccg ccggcgccac cattgccgcc agctccccct 2796241 gcgccaccgt cggcgccgga ggcggcactg gcagccccgt taccaccgaa accgccgcta 2796301 ccaccggtag aggtggcagt ggcgatgtgt acgaaagcgc cgcctccggc gccgccgcta 2796361 ccacccccac tgccggcggc tacaccgtcg gacccgttgc caccatcacc gccaaaggcg 2796421 ctcgcaatgt cgccctgcgc gactccgccg tcgccgccgt tgccgccgcc gccaccggca 2796481 gcggcggtac cgccgtcacc accggcaccg ccggtggcct tgcccgagcc tgccgtcgcg 2796541 gtggcaccgt cgccgccggt gccaccggtc ggcgtgccgg cagtgccatg gccgcccgtg 2796601 ccgccgtcgc cgccggtttg atcaccgatg ccggacacat ctgccgggct gtccccggtg 2796661 ctggccgcgg ggccgggcgt gggattgacc ccgtttgccc cggcgaggcc ggcgccgccg 2796721 gtaccaccgg cgccgccatg gccgaacagc ccggcgttgc cgccgttacc gcccgcaccc 2796781 ccgatgcctg cggccacgct ggtgccgccg acaccgccgt tgccgccgtt gccccacaac 2796841 caccccccgt tcccaccggc accgccggcc gcgccggtac caccggcccc gccgttgccg 2796901 ccgttgccga tcaacccggc cgcgcctccg ctgccgccgg tttgaccgaa cccgccagcc 2796961 gcgccgttgc caccgttgcc aaacagcaac ccgccggccg cgccaggctg cccgggtgcc 2797021 gtcccgtcgg cgccgtttcc gatcaacggg cgccccaaaa gcgcctcggt gggcgcattc 2797081 accgcaccca gcagactccg ctcaacagcg gcctcagtgc tggcataccg acccgcggcc 2797141 gcagtcaacg cctgcacaaa ctgctcgtga aacgctgcca cctgtacgct gagcgcctga 2797201 tactgccgag catgggcccc gaacaacccc gcaatcgccg ccgacacttc atcggcagcc 2797261 gcagccacca cttccgtcgt cggcatcgcc gcggccgcat tagccgcgct cacctgcgaa 2797321 ccaatactcg ctaaatccaa agccgcagtt gccagcagct gcggcgtcgc gatcaccaac 2797381 gacacctcgc acctcccgat accccatatc gccgcaccgt gtccccagcg gccacgtgac 2797441 ctttggtcgc tggctggcgg ccctgactat ggccgcgacg gccctcgttc tgattcgccc 2797501 cggcgcgcag cttgctgcgc gagttgaaga cgggaggaca ggccgagctt ggtgtagacg 2797561 tgggtcaagt gggaatgcac ggtccgcggc gagatgaata ggcggacgcc gatctccttg 2797621 ttgctgagtc cctcaccgac cagtagagcc acctcaagct ctgtcggtgt caacgcgccc 2797681 cagccacttg tcgggcgttt ccgtgcaccg cggcctcgtt gcgcgtacgc gatcgcctca 2797741 tcgatcgata acgcagttcc ttcggcccag gcatcgtcga actcgctgtc acccatggat 2797801 tttcgaaggg tggctagcga cgagttacag cccgcctggt agatcccgaa gcggaccgct 2797861 cccatgcgcc cccgggccgc gtcggccgcg ccgaacagcc gcaccgcttc ccggttgctg 2797921 ccggcatccg ccatcaccga ggcgaggcac tcgagaatgt cggggaccca taggtatgcc 2797981 ccaatggacg cggccacgcc gagggcgtcg tgggcatcgc gctcggcccg gtggcgatcc 2798041 ccttgggcga tctcgatgcg gcaacgggta gtcagggcgc gggcgcggtg cacgccacga 2798101 gtgatcgacg ctgcgccgtc ggccaatcgg tgcgccgcgt tcagatcacc tcgcgcacac 2798161 gatatttgag ccgaactggt ggggtcgttg atgatcgccg ccgcgctggc accaaagaat 2798221 cgcgttgccg attcgcgggc gtgttcggcg gccgcgacgt caccggcggc cagggtcgcg 2798281 aagaccagcg cggagcaggc cgagcccgac agcaccgggc tgagtccaac ggcggtgtcg 2798341 atgctggctt gggcggcggc ggccgcctcg gtgtcgccgc ggtgcgctaa cgcgtgcgcc 2798401 aagcaagcct ggcccgcgca gctgctaacc atgtcgtgcg cggcgtcgga ctcgccgatc 2798461 acctcgcgcg acaggccgac cgctgcctcg aggttgccct gccagagatt cgccgcggcc 2798521 agcgcccagc gacatgaacg tgaaaggaat gcatcaccaa tctcgtcggc gaggcttcgt 2798581 gcctcctcgc ccgccgcgcg ggtcgcgccc gggtcaccct cgccggcgaa cccgacatag 2798641 gcctgccagg ccagaacctc ggccaaccgc cacttgtcgc ccaccgcccg ggccaggccg 2798701 acggcctcgg ccagccacgg tcgcgccaga tccgcgttgt aggcggcgac acccccgcac 2798761 gcggtcagcg cccgcgccag cagggccgga tcctcgatgt cgcgcgctat agccagcgcc 2798821 ttctgggcat catctaggcg gtcggtgatg ccggccacgg catctatcag ggcccggtcg 2798881 gccagtgccc gcgcatacaa cccagggtcg gcccccgccg gatgtgcatc gtggtcggcc 2798941 agggcggcgg cgaaccaggc cagcccctct tgcaggcggc cccgggcacg ccacaacggc 2799001 tgcagacatg atgccaacag caacgcgtgg ccggtatcgc cattctcgcg gctgaacgcg 2799061 aaagcggccc gtaggttgtc gatctcgagc tcggcctggt tgagccggcg ttcatggccg 2799121 gccaccgagg gggcgtcaag cccggcggca acggccgcgt agtggtcgcg gtgtcgcgca 2799181 cgcacggcat cggcatcgcc ggattcacgc agcttctcca acgcatactg gcgcaccgtc 2799241 tctagcaggc ggtagcgcgt tcggccgtcg ctgtcgtcgg tcaccaccag agacttgtct 2799301 gccagcaggc tgagcagatc gaccacctcg tagcgctgaa cgtcaccgcc ggcggctgcc 2799361 gcttgggcac cgtcgagatc aaacccgctc gggaaaaccg ccagtcgccg aaacagcacc 2799421 tgctccggtc cggtcagcag cgcatgtgac cagtcgacgg aagcccgcat cgtctgctgg 2799481 cggcgcaccg caatacgcga tccaccggtc agcaggcgga accggtcatg caagctgtcg 2799541 acgatttcgg tcagcgccag ggcacgcacc cgcgacgctg caagttcgat cgccagcgga 2799601 atgccgtcga gtcggtggca gatctcggtc accagggcga ggttgtcggc agtgatctcg 2799661 agttcgggcc gcgcctcacg agcgcggtcg gtgaacaact cgatcgcctc gccgtgcccc 2799721 agcgggggaa cccgccaaat ctgctcaccg gccaccgcga tcggttcccg gctggtcgcc 2799781 aataccctca gcgctgggca cgccccgagc aacgcgacga tcagagccgc gcacccgtcg 2799841 agcaagtgct cgcagttgtc cagcactacc agcatgcgcc ggtcgccgat acgccgcaca 2799901 atggtgtcca ccgtcgagcg gcccggctga tccggcaacc ccaaaacccg cgccgccgcg 2799961 atcggcacca gcgccgggtc ggtgatcggc gccaggttga cataccaaac cccgtccgga 2800021 taaccgtcgg caacggcgct cgcgacctgt gtcgccaggc gtgtctttcc gaccccgccg 2800081 acaccggtaa gggtgaccca ccgtttgacg tccagcagcc cacggacttg cgccacttcg 2800141 tcgacgcgcc ccaccagccg agtgagctgg gccggaagac agtgcgcacc aacgactttc 2800201 cgggtccgca gcggcgggaa cgcgttgtgc agatcagggt gacacagctg caccacccgt 2800261 tccggtcggg gcaggtcgtc cagccggtag gtaccgaggt cgttcagcca cgcgtccttg 2800321 ggcagcaggt cagcaaccag atcgctggta gttcccgaca acacggtctg gcccccgtgg 2800381 gccagctcgc gcagccgggc ggtgcggtcg atggtcggcc ctacgcagtt gccctcgtcg 2800441 ggtgacgaca cctccccggt gtgcatgccg atgcgcagcc ggatcggtgc cagcggcgcc 2800501 cgctgcaagc ccagggcgca cgccacggcg tcggatgcgc gggcgaacgc caccaagaag 2800561 ctgtcgcctt cgccctgttc gaccgggcaa accccgcggt gctcgcgaac caattcggtc 2800621 agcgttcggt ccagtttggc gatcgccgtc gtgtcaagct gagaccccgg caggtgggtc 2800681 gcgccctcga tatcggccag cagcaacgtc accgtgcccg tcggtacaag ctcgctcaca 2800741 ccatctgcgc tccagtccac aggtaccacg tcgacgccgg ggtgaatctt gctcatgcta 2800801 gccagcatcg agccagcgcg tagcgcatta catcggcacc tgcgcctaga ttgctcgaaa 2800861 tctcttggcc gccggtccat gtgttctacg cgctttagtc gatgcattcg gcgaccggcg 2800921 tgccatcgcg gcggacctac agtgcccgtg ctgtccgctg gcaattgtga gtcccccagt 2800981 gctggcagca tcgcccgcaa gaaccgacac gaccgcatcg tgggcggtgc cgtcgaagtc 2801041 gccggctgac cgatcggcgg agtcaccggc ccgatggggt ttccgaaggc tagggaatga 2801101 tgacgatggg gcggccgcct cggccgcctt cgccgtaacc cccaaccatg cggaaaacga 2801161 gcctagcgtc gcccggccgc gcagagcgag ccatcgcggt ggcgccaacg acaggaagcg 2801221 atccggattc tctgaccatg gtgggtgttc tggctacgtg acgttaacgg agatggaggg 2801281 gccgccttcg ccgccttcac cgccggaacc gccggagcca gggtcgcccc tcccgttgcc 2801341 ggagccaccc gactcgcccg acgagccgac gccgccggag gtcaagccac cggcaccgcg 2801401 tccgccgtca cctccgcgcc cgccgtcccc gccgtcaccg ccgccgatgc tgcgaggcgg 2801461 aggggcgccg aagccgccgg agccgccggt cccgccgtcg cctccgtcac caccgggggc 2801521 gccaccgtct cgcccggccc cacccaagcc gccgttgccg ccgttgccac ccggcccgcc 2801581 gtcgcctgca tcagcaaagc tgccgttgcc gtccccaccg tgaccgccgt tcccgccgtc 2801641 gcctccgtca ccgccggggg cgccgaagcc ggccttgccg tgcgcgccac ttgtggaacc 2801701 gaaaccgcct tgtccgccgg ggcggcccca cccgccgtcg ccgccgtcac ctccgtcgcc 2801761 gccaggctct ccgtcaaaat ccgcgagata ggtaaagccg tcaccgccca agccaccatt 2801821 accagcgtcc ccgcccgacc cgccgtcacc gccgtccccg ccaacgcctc gattgccgac 2801881 ctcgccggcg ggtgccgacc cgccggcccc gccgtttccg ccggcgccgc cccacccgcc 2801941 gtagccaccg tcgccgccgt cgccgccgtc gcggcccgtc gtttcgttaa tgtcaaagcc 2802001 gtcaacgccg ttaccgccga ccccaccagc cccgcctagg cctccggccc cgccgtcacc 2802061 accgtcgccg gtctgagttc cgccggcgcc accggccccg ccgtcgcctc ccgccccacc 2802121 gctgccgccg tcgaagccgt cgaagccctt taggtcggag tcgggcgacc aacccgcgcc 2802181 accggccgcg ccgttgcctc cctggccgcc ggttccgccg ccgccgttca tcccggcgtc 2802241 gccgcccgcc ccgccgtgtc caccaacccc gccgccgccg ccggggctgc cgccccggcc 2802301 agccccacct tggccgccgg ctccgccgtt cccgccgtcg cccagaaatg ctccgccggc 2802361 gccaccagcc ccaccggcgc caccagcccc accgttgccg ccagcaacgg tgagccctcc 2802421 gagggcaccg tgcgcgccgt cgccaccctt gccgccgtca ccgccgtcac cgatgtcgcc 2802481 ggcgtcaccg cccttgcctc cagccccacc ggccccgcca tcaccgccga gagcttcggc 2802541 agcggtgccg tcggccccat caccaccggc tccgccgtcc ccgaatagcc cggcgttgcc 2802601 gccgtcaccg ccctggccgc cgtcgccgcc ggccgcggcg gccttggcac cgttgccgcc 2802661 gacgccgccg tcgccgccgg tcagtggccc gtgtttgctg gcgtccacgc cgttggccgc 2802721 ggaggtgccg ttgccgctgt caccccccag accgccgcga ccgcctgcgc cggggtcacc 2802781 gccgttaccg cccgctccgc cggcgccgcc gacggtgata ccaatgccgc cgttgccgcc 2802841 ggccccgcca acgccgccgg cgccgccgag tccgccgtcg ccaccgaccc caccggtgcc 2802901 gtgactgccg accgtccccg aaggtgcggc cccgccgacc ccaccgtccc cgccatgtcc 2802961 accgaccccg ccggcaccgc catcgccgcc accaccgccg gccccaccgg tgccgccgat 2803021 actgtcgata ccgttggcgc ccctggcccc ggccccaccg ctagcgccca caccgccgtt 2803081 gccgccggcc ccgccgttgc cgccggcacc gccgtcaccc gacaccgacc caccggcgcc 2803141 accggcacca ccggcaccac cggcaccgcc ggcctgcccc gcgtcgccct gacccccgtt 2803201 gcctcccggt tggccgaggg cgagggcatc tgaaccaggc gcgcccgaat tggccccgtt 2803261 ggcgccggcc gcgccatcgc caccattgcc gccggcgcca ccatcgccga cccggccggc 2803321 attgccgccg tcgcctccgt tgcccccggc gccgcccgcg acgctggctt gcgcaccgtt 2803381 gccaccgtta ccaccgttgc cgccgctgcc gggcccgtgg tcgctggcgt ccacaccgct 2803441 ggccgcgcgg gtgccgttgc cgctgtcgcc gcccaagccg ccgaggcctc ccgcgccggg 2803501 gtcaccgccg tcaccgccgt ccccgccatc actgccatga ccgccgtcac cgccgttgcc 2803561 gccggctccg ccgagcccgc cgtcaccgcc aacgccgccg acaccgtggc tgccgacctg 2803621 acccgcgggt gcggccccgc cggcgccgcc atcaccgccg ggcccgccgt caccgccaac 2803681 gccgccgaca ccgccgtcgc cgcctttccc gccggcgcct ccaacggcct cagcgctgtc 2803741 ggcgccggag gcgcccttgc cgccgccgcc accactagct ccggcaccac ccgcaccgcc 2803801 ggccccgccc ttgccaccgg gtccaccgtc gcccgacacc gacccaccgg cgccaccggc 2803861 tccgccggca ccgcccgcgc cgccggcctg ccccgcatcg cctcgaccgc cgttgccgcc 2803921 actacctaac gccgaactcc cggcaccacc gtcaccgccg gcaccgcccg cgccgccacc 2803981 gccaacgccg ccttgaccgc cgttggcctc gcttccttcg cccggttgcc ccgagagggt 2804041 gccgtcggcg ccgtcctcac ctacgttcgc gccattcgca ccggccgcgc cgctaccacc 2804101 gtcgccgccg gccccgccgt caccgaccaa cccgccgtta ccaccggcac cgcccttgcc 2804161 gccgttcgcg ccagccacgg tggcgtcggc gccgttaccg cccttgccgc cgttgccacc 2804221 actgtgcacg gtagcgccgg tcgcgccggt cacaccctcg gtggcgccgg cgccgccggc 2804281 gccacccttc ccaccggcgc ctgggtcacc accatcgccg ccggccccac cgtcaccatc 2804341 cttgaaagcc atgtcgccgc ggccaccgct gcccccgtta ccgggcgccc caccagcccc 2804401 gccggcacca ccgtccccgc caacaccctg gctgcccgcc cgacccgcag gtgcgtcacc 2804461 gccagcccca ccggccccac cgtcaccgcc gcgaccgccg gctccgccat caccaccgtt 2804521 gccgccgtca gatacgagca cagcattgaa accgtgagct ccgttaccac cggccccgcc 2804581 ggccccaccg ttgccgccgg caccgccggc cccgccatcg ccggcgtggg ccccacccgc 2804641 gccgccggcc ccaccggccc cgccgttacc tccatcctca ccgggggtac cggatgaacc 2804701 caggaagatc gccgtcatat cggcatagcc ggcacccgcg gctccgtcac cgccatgacc 2804761 gccggcccca ccgtcaccga ccaacccgcc gttgccaccg gcaccgccgt taccgccatg 2804821 accgccggcg accggtgcgt gggcgccatt gccacctttg ccgccgttgc cgccgctgac 2804881 cggcccgtcg ttgccggccg ccagaccatt ggcccccgcg cgagcaccgg cgccgcccga 2804941 cgcccccccg gctccgccag ccccacccag gcccgggtcg ccaccgtgac cgccggcccc 2805001 gccgtcaccg gcggccagcc aactgccacc gttgccgccg gcaccgccgt caccaggcgc 2805061 tccacccagc cccccacccc cgccagcccc gccgtctccg gcccggccga cataccccaa 2805121 tagtccagcc gacccgccag cacccccggc gccgcccacg ccgccgtttc cgccggcacc 2805181 gccattgccg ccggcgccgc cgtcaccgcc ggccgccgag ataccggccg gcccatttat 2805241 tccggtagcc ccggcaccgc cggcaccgcc ggccgcaccg gcaccaccgg ccccgccgac 2805301 accgccaacg ccaccggcgc cgccgttacc gagaagccac cctccccgac caccgttgcc 2805361 gcccacccca ccggcaccgc catcgccccc gtccgaccct gccaaaccgt caccgccggc 2805421 accgtcgtcc gacccggcaa caccagccgc cccatcctga ccgggcgtag caccgttggc 2805481 cccggccgca cctacaccac ccacgccgcc agcgccccca tgaccaaacc accccgcgtt 2805541 accgcccgcg ccgccggcgc caccaccagc cccaccggca ccaccggcgc cgccgttgcc 2805601 gaacaggccc gcgttgccac ccgccccgcc ggcgccacca ccagccccac ccataccgcc 2805661 gacgccgccc ccacccgcca gccatccacc cgtcccgccg gtacctccgg gtgcgcccgc 2805721 cccgcccgcg ccaccggcgc cgccgatccc aaataacccg gccgccccgc cggcgccgcc 2805781 cacctgcccg acggcaccgg cagcgccgtt gccgccattg ccgaacaaca atccaccggc 2805841 cccaccgggc tgcccaggcg ctgtcccgtg cgcaccatcg ccgatcagcg ggcgtcccag 2805901 tagcgtctgg gtgggcgcat tcacggctcc gagcacgact cgcatcgccg cggcgttggc 2805961 gatctcggtg gccgtgtacc acctcgcggc cgcggtcagc gtgtgcacga actggtcgtg 2806021 aaacgccgcg gcctgcgcac ttagcgcctg atactcctga gcgtgcgcgc tgaacaacgc 2806081 cgcgatgccc gccgacacct cgtcggcgcc ggcggccacc acttccgtcg tcggcatcgc 2806141 cgcgaccgca ctagccgcgc tcacctgcga accaatacgc gccagatcaa aagccgcagt 2806201 tgccatcatc tccggcgtcg cgatcacata tgacatctcg cacctaccca atagcccgac 2806261 cgtcgccgcg ccgctcccgc tgcgactagt gaccccttgg tctcttgagc cagcgacccc 2806321 aactaccgcc gcgacaggcc ttgttctgat ttgccgcgac gacctcccag gtgggtcgaa 2806381 cccactctgt cggccagcag caatgccacc gaacccgccg ccaccggatt gcccacgtcg 2806441 tctttgctga ctttgctgca gtccagaggt gccacgtcgg cacccgggtc aatctcgcgc 2806501 atgccagcca gcatcgagcc agcgggcacc gcaatacatc agcacgtgtg actagattgc 2806561 tccgaattct gtcgaacgcg ggtccgcgtg atctgcgcgt ttcggtcgat gctttcggca 2806621 gcccggcctc cgatcattaa cgaacccgag acgaggagag cgccatggtc gacacgagcg 2806681 cgcccgccag ccggctggac accgatccgc gccgcgctca tgtgagtctt agtaagcacc 2806741 cctaccagat tggagttttc gggtccggaa caattggtcc gagagtctac gaactggcct 2806801 atcaagtcgg tgccgagatc gcaaagcaag gccacattct catcagtggc gggatgactg 2806861 gcacaatgga agcctcctca cggggtgcgt cggacgccga cggccttgtc gtcggcgtcc 2806921 tgccgggcga caagtttacc gatggcaatg cctattccac gataaagatt ctgagcggta 2806981 tgcagtttgc tcgtaactac ataacaggtt tgagctgcca cggagcaatt gtcgtcggcg 2807041 gctcgagcgg cgcctatgaa gaagcccgtc gtgtctggga aggccgtggc cccgtggtgg 2807101 ttctagcgaa cagcggatcg ccaacgggtg cgtctgcgca aatgctgtcc atgcaggaaa 2807161 tctttggggt cgcctttccg gaggacaaac ccaagccctg gcgagtcttt tcggcggcaa 2807221 cccccgccga atcggtgtcg cttgtcattg gcctgatccg gaaaggatat gcccaacatg 2807281 agccgtagga taattaacga gttcggagta cagatctacg gggccacgat aggtgacacc 2807341 tgggccgggc tggtcagggc ggtgcttgac cttgggtctc agtgttttga cgaagaccga 2807401 gagcgtatag cgctgtccaa cgtccgcatc aagtcttcgg tgcagaatta tcccgatctc 2807461 actattgaag aacattgcaa cagcgcccaa ctaaaggcca tgctagattt catgttcaac 2807521 accgatacca tggaggatat cgatgtggtc aagagcttca gtcgtggcgc aaaaagctac 2807581 catcgccgga taaaagaagg acgaatgatt gagttcgtaa ttgagcgact gagtctaatt 2807641 ccggaaagca agaaagcagt ggtcgtgttc ccgacttacg aggattacgc ggcggtcatg 2807701 cgtaatcatc gagacgatta cttgccttgc cttgtttcga tacagttccg cttgttgcca 2807761 gacggcaaag attacgtctt ccacacgacg ttctattcgc ggtccatgga cgcctggcaa 2807821 aaaggtcacg gcaatctttt gtctatcgcc aagctatcgg attgggtgcg agagaacgtc 2807881 agtgcgcgca ttgggcgcaa gatcatgctt ggcccgcttg atggcatgat ttgtgatgtt 2807941 catatctaca aggagacgta tgcagaggct tgcaagcgtt tggccaacct cgaccttagg 2808001 cgaacacaat ttgacgcggt gcggaattag tgaggacgct aagcctcccc agctgatgcg 2808061 ttgatgcgct agcatcaggg ctgtgcgaac gacacttgac cttgacgacg atgtgatcgc 2808121 cgcggcacgt gaacttgcct ccagccagcg ccgctcgctc ggctcggtga tttccgaact 2808181 cgcacgccgt ggtctcatgc ccggacgcgt cgaggctgac gacgggctgc cggtgatccg 2808241 cgttccagcc gggaccccgc cgatcacacc ggagatggtc cgtcgcgcgc tcgatgagga 2808301 ctgacgcggg tggcgctgct cgacgtcaac gcattggtcg cgctggcgtg ggactcacac 2808361 atccaccacg cccggatccg cgagtggttt accgccaacg ccacgctcgg ctgggcgact 2808421 tgcccgctca ccgaagccgg cttcgtgcgg gtgtcgacga acccaaaagt acttcccagc 2808481 gcgatcggga tcgcagacgc tcgacgggtc ctcgtggcac tacgcgccgt gggaggccac 2808541 cgcttcctgg ctgacgacgt atcgctcgtc gatgacgatg ttccgttgat cgtcggttat 2808601 cgccaggtga ccgacgccca tctgctgaca ctcgcccgcc ggcgcggcgt ccgcctggtc 2808661 accttcgacg ccggtgtctt caccctcgcc caacaacgcc ccaagacgcc agtggagctg 2808721 ctgaccatcc tctaaccaaa gctgccagcc cgcccggcta cagatccaac agcgcggtct 2808781 ccggcgactc gatcagatcc cgcagctcac acatgaactg ggccacctga gcaccatcga 2808841 caacgcggtg gtcgaacaca caagtcaacg tcatcgtcgg ccgtgcgaca acctcgccgc 2808901 cgacgaccac cgggcgcggc ttgatcgccc ccagacccag gatcgccgct tcgggatggt 2808961 tgatcaccgg cacgccgtcg tcgactccca gcgccccgaa gttcgacacc gtgaacgtcg 2809021 aaccgcgcag ctccgcgggt gtgagagtgc cttcacgtgc gccggtgatt aattccgcta 2809081 cgcgggaggc aagttcgcgg gtgttcttgt cctgggcgtc ggtcaccacc ggcaccagca 2809141 atccacgctc agtggccgcg ccgaacccca gatgcacacc gcgatgcacg tgtacttgcg 2809201 ggccttcgcc cgagtcgacc cacgtcgagt tgagaattac gttgtgtttc aatgcaataa 2809261 ccagcagccg cagcgtcagc gcgaacggtg taatctcggg cgccgccgaa acgaaccggt 2809321 cgcgcagccg cagcagttcg gcgcaaatta cctcaacgct ggcctttgcg gtcggaatct 2809381 ccttgtggga caacgtcatt ttttcggcca tccgcgcgtg cacgccgtgg accggccgca 2809441 cgtccggccc ggctccgacg ccgcctcgag cagcggccag cacatcggcc cgggtgatca 2809501 caccgccggc gcccgaccca cgctgcaatg cggccaggtc gaccgccaac tctttggcca 2809561 gcttgcgcac taccggtgcc gccagcggcc ggcttgtccg tctactggtt tcgatcgcgg 2809621 tgtcggcacc gtagccgacc aacgtgggga ccgctccttc accgttaggc tgcgcaactg 2809681 ccgtgggccc ggtgtcgatc cgaactagct ccgcgcccac tttgagcaca tcgccttcgg 2809741 cgccgcctaa ctcgacgatc cggccggcat acgggctggg gatttcgacc tcggccttgg 2809801 cggtctccac cgaacacagc gtctggttga tctccacatc gtcgccgacg gcgacgctcc 2809861 aacacgtcac cgtcacttcc tgcagtccct cgccgaggtc gggcaccggg aaagacctga 2809921 tgctgtcctc accgctcatg gctgacgcag cacacgttcg acgcagtcca acagccggtc 2809981 ggggccgggt aaccacaatt tttccaaccg cgcaggcggg tagggtgtgt caaaaccgca 2810041 ggcacgcaac accggagcct ccaattggta gaacatctct tcctggatgc gcgcggccag 2810101 accggcacca tagccgaggc tgcgcggccc ttcgtgcatc accacgcaac gcccggtgcg 2810161 ctggatcgac gcagcaatgg tgtcgaagtc cagcggcgcc aacgaccgca gatcgataac 2810221 ctccagactc caatcatgtt gctgctctgc agtatccgcg ctagacaggg cggtgctcac 2810281 caggtttccg tacgttacca cggtcacatc ggtgccggac cggcgcacca tcgcgtgccc 2810341 gatcggcggt tccggccggc tagtgtcgac catcccgcgg ccgtggtagc ggcgtttggg 2810401 ctccagatac atcacggggt ccgggcaggc gatagcgtgc cgcagcagcc agtaagcgtc 2810461 accgggtgtc gacggcacca ccaccttgag gcccgcggtg tgcacccagt aggactccgt 2810521 ggagtccgaa tgatgttcgg ccgcaccgat accgccaaac gaggggatcc ggacggtcac 2810581 cggcatgtcc acctcaccgc gggtgcgagt ccggtacttg gccagatggc tcaccacttg 2810641 gtcgaaagcc ggataggaaa agccgtcgaa ctggatttct ggcaccggca caaagccacg 2810701 tagtgccaac ccgacggcta ttccgatgat cgcggactcc gccagtggcg tgtcgaagca 2810761 ccggtctgca ccgaacgtat cggccagtcc ctcggtcacc cgaaacaccc caccctcgac 2810821 cgcgacatcc tcgccaaaca ccaatacccg ctcgtcggcg gccatcgcgt cgtacagggc 2810881 gcggttgatc gcctggacca tggtcaacga ctgcgtgatg tcgctcaccg ctaccgcaag 2810941 cgtctcatcc ggcctggccg gacggtctgc gatttgagtc atgcccgcct cctcagtcag 2811001 tccgcgccag ttcggcacgc agctgttcgc gctgcgcctg caacccgggt gtgatttcgg 2811061 cgtacaccgt ggtgaacacc tcatcgacgt cgaagtcagg cgcatcaaag accgcgtcgc 2811121 gtagctcgga ccgcacgtgt tttgcccgag ccgtcacctg ttcctcgagg cgttgcgacc 2811181 acaggccctg atcttgtaag taagtgcgat agcgcggaat cgggtccagc gtcgcccagc 2811241 ggtccacctc ctcctggctg cggtaccggg ttggatcatc ggcggtggtg tgcggaccaa 2811301 gacggtaagt gaccgcctcg atcagcgttg gaccgtcgcc ggcccgagcc cgagcggcag 2811361 cttcggccat caccgcatag catgccagca cgtcgttgcc gtccacccgg atgcctggca 2811421 tcccgtagcc aatcgccttg tgcgcgatag atggtgcggc ggtctgcctg gataccggca 2811481 tcgagattgc ccactggttg ttctgcacgt agaacacgca cggtgtggtg aacaccgccg 2811541 cgaaattgag cgcctcatgt acgtcgccct cgctggtggc gccgtcgccc agaaaggcca 2811601 ccgtcacgga gtcctcgtcc aggcgttgcg cggccatcgc cgcgcccacc gcgtgcaagg 2811661 tctgggtgcc gatgggaacc gacatcggtg cacagcactt cgtggtgaat tgcagcccgc 2811721 cgtgccaggt tccacgccac gcgaccccaa catgtccagg cgggatgcca cgcactaggt 2811781 agacgcccaa ttctcggtat tgggggaaca accagtcggt tttgcgtagg caagccgccg 2811841 cacccacctg cgcggcttcc tgcccgcgac agggcgtgta caacgccagc tccccctggc 2811901 gctgcagatt gacgaattcg gtatccagct cgcgggtgac caccatcatc tcgtagagcc 2811961 aacgcagcgt ttcctcagga aggtcacggt ggtagcggcg ttcggccgtc ggcgtaccgt 2812021 ccgggccgac gagttgcacc ggctcaagat cgacagacat caacatccca gatggcctcc 2812081 gagaaccctc ccccataccg tctcctcagc tcgcgatcac aacgcggtta cgcgtcagaa 2812141 gatgccgtgc gttccatcct tagcgtcggc gctggtggtg cggcgctcac agcacatccc 2812201 gcctgggaaa ccgctgcgag ccgaagttga tcgccggccg caccaaagct tccgctcgcc 2812261 gcaacactgg cgcttcctcc attatgcccc caaatgtgaa gagtccggac cgatcgcgaa 2812321 cgcatcgcaa ccgtgtcgcg ggggatctgc gccctcattc ggaggtggct tccccggctc 2812381 gccgcaacat cgtttccgcg tgcgtgagca ctggagagtc gaccatctgg ccttcgaacg 2812441 cgaacgcccc acgctcgctt cgcgacgcgg ccaaaacccg ccgagcccag gccagcttct 2812501 cgtggctggg tcgataggcc ttgcgcacca ccgggatctg actcgggtga atgcacacgg 2812561 tcacgtcaaa gcccaccgcc gcggcgtctc tggcctcttc ctgcaagccc tcgacatcga 2812621 ggatatccag atgtacggca tcgagcgcga gacggccgaa cgcggacgcg gcgagcagga 2812681 tggtcgagcg gacatgtcgg gccacgtcac gataggcacc gtcggcccgc cggctcgagc 2812741 taccgccaag ggtggcgatc aagtcttcgg caccccacat cattcccacg gtgggatcgg 2812801 ccgcggcgat ttcggcggcg cacacggcac cgcgcgcggt ctccaccagc gcgatgacat 2812861 cacgcggcgc aagctcgatg acttgggccg ccgattcggc cttgggcagc atcaccgtgg 2812921 tataggcggt gcctgcgagg gcctccagat cgcgggcctg atcagcagta ccgcccgcat 2812981 tgatacgcac caccgtgcgt tccgggtcca gcggggtgtc ccgcaacgca ttgcgcgcgg 2813041 caggcttctg cgcctcggcc acgccgtcct cgaggtcgag aatcaccacg tcggccgcgg 2813101 cggcagcctt cgcaaagcgt tccggacgat cggcagggca gaacagccac cccggaccgg 2813161 cggcacgcag gttcattgcg cctccttaat ggactgcttt tggaccagcg tcgtgcgcac 2813221 cgcgcgggcc accacctcac cgtgctggtt gcgggcgatg tgctcgagtg tgacgatgcc 2813281 ctcgccgggc cggcttttcg actcacgttt accggtacag acggtctctg cataaagcgt 2813341 gtcgccgtgg aagaccggtt tgggaaacga cacctcggag aagccgaggt tggccacgat 2813401 ggtgcccaac gtcaactgcg caaccgacag accgaccatc gtcgagagag tgaacatcga 2813461 gttcaccagc cgctcgcccc gaaaacccgg ctgctgccca gcccacgccg cgtcgaggtg 2813521 cagtgactgg gtgttcatcg tcagcgtggt gaacaacacg ttgtcggcct cggtgaccgt 2813581 gcggccgggc cggtgcaggt atgtggtgcc gatctggaac tcttcaaacc acaagcctcg 2813641 ttgaagaatc cttctgccga ctgtggatcc tgcgacgcga cacgccgata cggcgtcgtc 2813701 agattcacgg tcgccggcgt gctttgtcac tgcagtccca acgatcgcgc gataagcatc 2813761 agctgcactt ccgtggtgcc ctcaccaatc tcgagcacct tgctgtcgcg gtaatgacgc 2813821 gccaccggat attcgttcat aaagccgtat ccgccgtgta tctgggtggc atcgcgggag 2813881 ttgtccatcg ccgcctccga ggagatcatc ttcgcgatcg ccgcctcctt cttgaagggc 2813941 ttgcccgcca acatctttgc ggcggcatca tagtacgctg tgcgggcaac atgggcgcgt 2814001 gcctccatcc gcgcgatctt gaagccgatc gcctgataag cgccgatcgg ctggccaaac 2814061 gactgacgct ggttggcgta cttgacgctc tcgtcaacac agccctgcgc cgcgccggtg 2814121 gccagcgctg caatcgcaat ccggccctcg tccaggatgg acaagaagtt ggcatagccg 2814181 ctcccccggg ctcccagcag gttctccctc gggacccgcg catcggcaaa tgtcagtggg 2814241 tgggtgtccg aggcgttcca gccgaccttg ttatagaccg gttccacggt gaatcccggt 2814301 gtgccgctgg gcacgatgat cgtcgaaatc tctttcttgg catccgcagc ggttccggtg 2814361 gtcccggtaa ccgcagtgac ggtgaccagc gatgtgatgt cggtgcccga gttggtgata 2814421 aattgcttgg agccgttgat gatccactcg tcaccttcga gacgcgccgt ggtgcgggtg 2814481 ctgcccgcgt ccgatcccgc tcccggctcg gtgagaccaa aaccggcgag cgcacggcca 2814541 gacgtcaagt cgggcaacca cttctgtttc tgctcctcgg taccgaaccg gtagatcggc 2814601 atcgcaccca ggcccaccgc ggcctccagc gtgatcgcta ccgattggtc aaccttgccc 2814661 agctcctcaa gtaccagcga cagcgcgaag tagtcgccgc ccatgccgcc gtactcctcc 2814721 ggaaacggca gcccgaacag gcccatctct cccatcttgg cgacaatttc gtatgggaag 2814781 ctgtgttccg catcgtgttt ggccgatacc ggcgcgacca cggtgcgcgc aaaatcggcc 2814841 accgtatccc gaagatcttg gtattccttg ggtaatatcc ccccagaaat cgttgtagtc 2814901 gttgtggtca tgatcctagt ccttgatcct cgccagtacc tgttcgactt tcacctgatc 2814961 gccaacggac accaacacct gtacccgtcc cgaaaccggc gcctccagcg agtgctccat 2815021 cttcatcgct tccaccacca ccaccacatc acccgcagag atctgggagc cggactcgac 2815081 ctgcacggcg atcacgctgc caggcatagg gctgacgacc tccgccggcc gcgcacccac 2815141 ggcgcggtga atcttgtgct cctcggcctc gcgcaggtgc caagtcccgc gctcgtcggc 2815201 gatccacagg tgccggtcag cctctgccca ccgataatcc cggcgcagcc cgcttatcgt 2815261 cacgctcatc tgttctcggg tgacctgcac gctcgcacaa tcgatctcac catcgccaac 2815321 ctgaacctgc gccgactcgg gtggccccca caccgaaacg gtctcgctgc gcagcggggt 2815381 gcgcatggcg gtgcggaccg gtgccatatg gcccccgccg cgccatccgg acggcgcggc 2815441 ccacaggtcg ccctgtgcgc gccgggccag ggcccactgg cggtagaggc cgccggcagc 2815501 tagcacgtcg tcaggcgccg gccgcgcagt gaaatcggcc gatcgctcgt ccagtacagc 2815561 ggtgtccaaa tccccgaccc gcacccgctc gtcggcgagc agaaagcgaa ggaactcgac 2815621 attggtctgc actcccagca ccgcagtccg cgccagcgcc tggtccagcc gatccagcgc 2815681 ttcctcgcga tcggccccgt gcgcaatcac cttggtgagc aacgggtcgt aatcactgcc 2815741 gaccaccgtg ccgcctagca gtgacgaatc cacccgcacc ccggggccgg cgggttcgaa 2815801 caccgccagc acccggccgc cggtgggcag gaattcccgc gcgggatcct ccgcatacac 2815861 ccgagcctcg atcgcgtgcc cacgcagctc gatgtcgttt tgggcgaagc ccaacttttc 2815921 gcccgcaccc acccgcaact gccactcgac caggtccaat ccagtaatcg cctcggtgac 2815981 cgggtgttcc acctgcagcc gggtattcat ctccatgaaa aagaactcgt cggggcgctg 2816041 cgcggagacg atgaactcca ccgtgccggc gccgacgtag tccacgcagc gggcggtgtt 2816101 gcaggccgcg accccgatgc gctcgcgggt ctgcgggtca agcagtggcg acggcgcctc 2816161 ctcgataacc ttctggtggc gccgctggag gctgcactca cgctcaccca gatgcaccac 2816221 gttgccgtga gcgtcggcaa gcacctgcac ttcgatgtgc ctgggccgca acacaaaccg 2816281 ctccaggaat agcgtatcgt ccccgaacga agacatggct tcgcgccggg cactcaccag 2816341 cgcctcaggc agccgcgccg gatcttgcac taaccgcatc cctttgccgc cgccgccggc 2816401 cgacggtttg atcagcaccg gatagcccac ctcagcggca gcggtgacca gcgcgtcgtc 2816461 cgtcagcccg gcgcgcgcca caccgggcac caccggaaca tcgaaagcgg cgaccgcgtt 2816521 cttggcggcg atcttgtcgc ccatcacctc gatcgcgcgc gccggcggac ccaggaacac 2816581 cacccgggcg cgttcacacg ccgcagcgaa atcggcattc tcggcaagaa acccgtagcc 2816641 cggatggatc gcctgggctc cggtgcgcgc cgcagcatcg agcaccttgc cgatatcgag 2816701 gtagctttcg cgtgctgggg cgggccccag ccgcaccgca gcgtccgcct ccaagacgtg 2816761 gcgggcatcg acgtcggggt cgctgtagac cgcgaccgac cggatgccta gccggcgcag 2816821 cgtccgaatc acccgaaccg cgatctcacc gcggttggcc actagtacgg tgtcaaacat 2816881 cgcctcacat ccggaagacg ccgtagccaa cctgatccag cggagcgtgg gcacacaacg 2816941 aaagggcaag cccaacaacc gttctggtgt ccgcagggtc tatgataccg tcatcccaca 2817001 gccgggcagt tgaatagtag gggttaccct ggtcttcgta ctgcgctcgg atgggcgcct 2817061 tgaacgcttc ctcctcgtcg ggtgaccagg gtgtgccggc cgcggacagc tgctcgccgc 2817121 gcacggtcgc caacacggac gcggcctgct caccgcccat caccgagatc cgcgcgttcg 2817181 gccacatcca caggaaccgg ggcgagtacg cgcgtccgca catcgaatag ttacccgcac 2817241 cataggatcc gccgatcacc acggtcaact tgggcacccg cgcgcaggcc accgcggtga 2817301 ccatcttggc gccatgcttg gcgattccgc cggcctcgta gtcgcggccg accatgaagc 2817361 cggcgatgtt ctgcaggaac agcagcggaa tcttgcgttt gtcgcacagc tcgatgaaat 2817421 gcgctccctt gagcgcggat tcgctgaaca acacgccgtt gttggcgacg atcccgaccg 2817481 ggtggccgtg gacgcgtgca aacgcagtca ccagagtctt gccgtattta gccttgaact 2817541 cgctgaattc gctgccgtca acaatccgca cgacgacctc atgaacgtcg taagggaccc 2817601 ggggatccgg gggcaccaca tcgtagagct cggcctgcgg gtacttgggc tcgaccgaac 2817661 ggcgcacatc ccattgggcg ggttcgcacg ggccgaaggt gtccgcgatc gcgcgcacga 2817721 tccgcagcgc gtcctcgtcg tcgtcagcca gatggtcggt gacaccggac gtgcgcgagt 2817781 gcaagtcgcc accgccaagt tcctcggccg agacgatctc gccggtggcc gccttcacca 2817841 gtggcggacc gccgaggaag atcgtgccct gctcacggac gatgacggcc tcgtcactca 2817901 tcgccggcac ataagcgcca cccgccgtgc aggagccgag aaccgccgcc acctgcggaa 2817961 tgcccttggc gctcatcgtc gcctggttgt agaagatccg cccgaaatgc tcgcggtcgg 2818021 gaaacacctc gtcttggcgg ggcaggaagg cgccgccgga gtcgaccaga tagatgcacg 2818081 gcagcatatt ctgcagcgcg acctcctggg cgcgcaggtg cttcttgacc gtcatcgggt 2818141 agtaggtacc gcccttgacc gtcgcgtcgt tggcgacgat cacgcactgg cgtccggata 2818201 cccggccgat cccggtgatg attcccgcgc ccggggattc gtcgccgtac atgccgccag 2818261 cggccagcgg agccagctcg aggaaagggc tgcccgggtc gagcaggcgg tccacccgtt 2818321 cgcggggcaa cagcttgccg cggctgacgt ggcgtttccg ggcgcgttcg ttgccgccca 2818381 gggcggcggc ggcgagctta ttgttcaatt ccgccaccag ccggcggtgc tcgtcggcga 2818441 acgagggggc tattgctatc gacggggtgg tcactgggtc gccaggtccc gaagcacaag 2818501 gggcggttga gtcttcgcga ctacctcgtc gaccgacacg ccaggagcgg tctggaccag 2818561 gtgcaggccg tcagcgcaga catcgatgac cgcgagttca gtgacaatgc ggtcgacgca 2818621 gcccacaccg gtcaacggca atgtgcaccg ctctaggatc ttggggctac cgtccttggc 2818681 ggtgtgctcc atcatcacga tcaccttgcg agcgccgtgt accagatcca tcgcgccgcc 2818741 catgcccttg accatcttgc cggggatcat ccagttggct aggtcaccgg tgaccgaaac 2818801 ctgcatcgcg ccaagcactg cgacatcaag gtggccaccg cggatgattc cgaacgaagt 2818861 cgacgagctg aagaatgcgg cacccggcag cgtggtgacc gtctccttgc ccgcgttgat 2818921 caaatcggca tccacgtcct cccgccgcgg gtaggggccg acgccgagga tgccgttctc 2818981 cgagtgcagg acgacatgga cgccgtcggg aatgtggttg ggaatcaggg tgggcatgcc 2819041 gatgccaagg ttgacatact gaccgtcttc gaactccgcg gccacccgtg cggccatctc 2819101 gtctcggctc cagcccgggg cgctcattgc cgcaccgtct ccctctcgat cttcttggcg 2819161 gggttgggca catgaaccac ccggtgcaca aacacgcccg gggtgtgtac ggtggcaggg 2819221 tcgatctcac ccggctcgac caagtgctcg acctcggcga tcgtgatcct gcctgcggat 2819281 gcgcactccg ggttgaagtt ggccgcggcg tggcggtaca tcaggttgcc gtgccggtcc 2819341 ccctgccagg catgcaccag tgcgaagtcg gtccggatcc cccgctcgag gacataggtg 2819401 acaccatcga actcccgagt ctccttggcc ggcgacacca ccgccacccc gcccgaggcg 2819461 tcgtagcgcc acggcaaccc gccgtcggcg acctgggtac cgacccctgc cggtgtatag 2819521 aaggccggta tgcccatccc tccggcccgc aaccgctcgg ccagcgtgcc ctgcggggtc 2819581 agttccacct cgagctcgcc cgcgaggaac tggcgggcga actccttgtt ctcccccacg 2819641 taggaggaga ctgtccggcg aattcgcttg tgttgcaaca atagtcccag accaacaccg 2819701 tcgattccgc agttgttcga gactgtttcc aggtcggtga caccgctatc caccaacgct 2819761 gcgatcagtg cttcggggat gccgcaaagc ccgaatccac caaccgcaag cgacgacccg 2819821 ttggctatgt ctgcgaccgc ctccgcggcg gtggccacca ccttgtccat accgcagagc 2819881 ctcctagcat ttcagttaat tatcattaac tgaggtgaga ataccattgc ccccgcggtg 2819941 cgtctaggga cctcactgtt ggccgcggag gtattcgagc gcctgttgtc gcatctccac 2820001 tttgcgtact ttgccggtga cggtcatcgg gaactcgtcg acgatccaca ggtaccgcgg 2820061 gatcttgaat cgcgcgatgc ggcccatgca gtactcgcgc agccgctcga tggtcagttc 2820121 cggcgcgtcg tttctcagct tgaccaccgc catgagctct tcgccgtatt tggcgtcggg 2820181 caccccgatg acgtgaccgt cgacaatatc gggatgcgtg tggaggagtt cctcgatctc 2820241 ccgcggcgag atgttctcgc cgccccggac gacgaggtct ttgatccggc cggcgatccg 2820301 cacgtacccg gacgggtcca tctcagccag atctccggtg tgcatccagc cgtcggcgtc 2820361 gatcacctcc gcagtcttct gcgggtcatt ccagtacccg gccatcaccg aatagcctcg 2820421 cgtgcagaac tcgccgacca ccccgcgcgg gaccgtctcg cccgtggccg gatccaccac 2820481 cttgatctca aggtgtggac ccacccgacc gaccgtgccg acccgtcgat ccaccgagtc 2820541 gtcggcgcgc gtctgcgtgg aaaccggtga cgtttcggtc attccatagc agatcgagac 2820601 cccgggcata tgcatgcgtg agatcacctt gcgcatcacc tcgaccgggc acgcggcgcc 2820661 ggccataatc ccggtgcgca gactgcccag ttcgtagtcg gtgaagtccg gcaggcccag 2820721 ctcggcgatg aacatcgtcg gcacgccgta caagctggtg catcgctcgt cctgcaccgc 2820781 gcgcagcgtg gccgcagggt caaagcccgg cgccgggatc accatggccg ccccgtgact 2820841 ggtggccgcc agatttccca ttaccatgcc gaagcagtgg tagaagggca ccgggatgca 2820901 aatccgatct tgtgcggtgt acccgagcag ctcgcccacc aggtagccgt tgttgaggat 2820961 attgcggtgg cttagcgtga cacccttcgg gtatgccgtt gtgccggagg tgtattggat 2821021 gtttaccgga tcactgccgt ctagcctcgc cgcggtctgc tgcagcgcag gcagatcggg 2821081 ctcggcaccc gccagcgcgt cccagcgatc gctttccagc aaaatcacgt cggccagatc 2821141 ggggcatcgc ggcccaacct cggccagcat cgcggcatag tccgcatcct tgaaactcgc 2821201 tacggcaatc accatcgcga caccggactg cctaagcgca tactccactt cgcggacccg 2821261 ataggcgggg tttatggtca ctaggatcgc gccgatctca gcggtcgcgt actggacgag 2821321 cacccactcc caccggttcg gcgcccagat gccgacccga tcgcccgggc cgatccccgc 2821381 ccgcaccagc cccgtcgcca gccggtgcac gtcagtcagc agttcgctgt aattgaaccg 2821441 tcgccgggcc accatgtcca cgagtgcttc ccgatgtccg tacctggcag cggtcgctgc 2821501 gaggttggcg ccgatggtcg actcgagcaa tgatggcgca ctcggaccgc gatcatagga 2821561 aagccgattg gggtctacga cttccgcggc tgccacggtt cctccgcctg gtgcctaccg 2821621 catgtctgac tcgcgttaac atcgaatagc tcgtgctacg ttagtgacga ttaaccgaag 2821681 tgtccagcat gagtcgtgta cggagaccgt cgtgacagcg tccgccccgg acggtcggcc 2821741 cggccagccc gaggccacaa atcgtcgcag tcagctgaag tccgaccgac gattccaact 2821801 cttggcagcc gccgaacgat tgtttgccga acgaggattc ctggcggtgc gactggagga 2821861 catcggcgcc gccgcgggcg tcagcggtcc ggccatctac cgacacttcc ccaacaaaga 2821921 gtcgctgctg gtggaattgc tggtcggcgt cagtgcgcga cttcttgccg gcgcacgcga 2821981 tgtgacgacc cgcagcgcta acttggccgc ggcactggat ggcctcatcg agtttcacct 2822041 tgacttcgca ctcggcgaag cagacctcat ccggatccag gaccgggacc tagcgcacct 2822101 gccggccgtc gctgagcggc aggtgcgtaa ggcccagcga cagtacgtgg aggtctgggt 2822161 cggggtgctg cgcgagctga acccaggcct ggccgaagcc gacgcccggc tgatggccca 2822221 cgccgtgttc ggactgctga actccacccc gcatagcatg aaagcggccg acagcaagcc 2822281 ggcacggacg gtgcgtgcac gcgccgtcct acgggcgatg acggtcgccg cgctatcggc 2822341 cgcggatcgt tgtctatagc tcgccaggct gcgatgtcgc cgggtacatc agcgcacccg 2822401 cacccagcgc gggtaccctg catgccatga ggtggacatg aacgatccac gtcgccccca 2822461 gcggtttggt ccccctctat ccgggtacgg gccgaccgga ccgcaggttc cccccaatcc 2822521 gccgaccgcc gacccggctt acgccgacca gtcgccgtat gcatccacgt acggcggtta 2822581 cgtttccccg ccgtggtctc caggagggcc cccgccaagg cctccccagt ggcccccagg 2822641 cccccacgag gccagtccga cccaacagct gccgcagtac tggcaatacg accagccccc 2822701 accgggcgga tttccccccg acgggctgac tcccccgcca ccgcaagggc cgagaacgcc 2822761 gcgctggttg tggttcgccg ccggctcagc cgtgctgctc gtcgtcgcgt tggtcatcgc 2822821 actggttatc gccaacggct cggtcaaaaa gcaaaccgcg atcgagccgt taccccccat 2822881 gcccgggcct agcccgacac gtccgaccac gaccacaccg accccaccct cacccagcgc 2822941 cgcaccggca ccgacaacta cgaccggtac gcccagtgag acggtcgccg gcgcgatgca 2823001 aaccgttgtc tacgacgtca cgggggaagg ccgggcaatc agcatcacgt acatggatag 2823061 cggcaacgtc atacagaccg agttcaacgt cgccctgccg tggcggaaag aggtcagcct 2823121 gtcaaagtcg tccttgcatc ccgctagcgt cacgatcgtc aacatcggcc acaacgtcac 2823181 ctgctcggtc accgtggccg gggttcaggt acgccagcgc accggggcgg ggttgaccat 2823241 ctgcgacgct cccagctagg aggattgcgc cgtcgtcagc gcaccgccgt gccgcgacac 2823301 ctgtacccgc agcatgagca gcaggccggt tgtcaacacg aggcacacgc cgccgagccc 2823361 ggcacggacc gtgtggaaca cgtcgacgaa gaccgaaaac aaccacggcc ccagaaacga 2823421 caccgcccgg ccggtcatcg tgtagagccc aaaggccaca ccctccttgc cgtgctgcgc 2823481 catatgcagc agcagagcgc gtgccgacga ctgcgccggc ccgatgaaca cacacaacag 2823541 cagcccgcac gcccagaacg ccgttgggcc cgacaacgtc agcaacgtga gcgccgcggc 2823601 gatgatggcg gccagtgatc cgacgatgac cggtttggac ccgatccggt ggtcgacgaa 2823661 cccacccagc acggccccca ccgcagccac cacgcttgcg gccgcaccaa agatcaggac 2823721 atcggcctgg gtgagcccgt atgcgttgac gccaagtacc gcgccgaagg cgaaaatggc 2823781 cgccagcccg tcgcggaata tcgcgctggc caccaggaag tagaccaagt tgcggtcgcg 2823841 ccgccactcc gcgctgatct ccgtccacag cttgcggtag ccgcccagca ggccggtcga 2823901 aggatgagac gccgcaccgg aatcgggtag tcggtgcgcg accaacaaca atggcaggcc 2823961 cagcaacgcc aaccaggccg ccgcaaccag catcgccatt cgcacgttga gtccgttcgc 2824021 gacgggtagc tgcagcaggc cgcgctgcga accgctacct gacatgaaac ccagatagat 2824081 caccagcaag agcgcgacgc tgccgacata gcccgacgcc caaccgaagc cggagatccg 2824141 gcccgccgtg ctgggtgtgg acagttggcg cagcatcgcg ttgtacggaa cgctggacaa 2824201 atcgctggac gccgcggtgg ccgcgagcaa aaccagcccg gcccacaggt agcgggggtc 2824261 gtcgcggatc aggaacattg cgcaggtcag cgcgaccgcg gtgccggtca gcacagacag 2824321 tgccacccga cggcggtgcg gagactccac ccacacgccg acgacgggcg ccagcacccc 2824381 gatggtcaac ccggcgaccg cccccgcacg acccaaccaa ctcgccggtg aggtgccgcc 2824441 cggcagaccc tgacccacgg cgctggtcag gtagacggag aacacaaagg ttgtcacgat 2824501 cgcgttcaga ccggtggaac cgcaatccca catggcccac gccaccaccc ggaagtgcag 2824561 gagggtgccc gcgcgcgacc ccgggttatt catgtccggc actttattgc ttttggcagc 2824621 gacccgctgc gcccggctcc gccgcgctcg cgatcgctac gtgtctacga ttggcgcatg 2824681 ccgatacccg cgcccagccc cgacgcacgt gccgttgtca ccggggcttc gcagaacatc 2824741 ggcgcggcgc tggccaccga actggccgca cgcgggcacc acctgatcgt caccgcacga 2824801 cgcgaggacg tgttgaccga gttggctgcc cggctggccg acaagtaccg cgtcacggtc 2824861 gacgtgcgac cggccgatct ggccgatccg caagaacgat cgaaactggc cgacgagctg 2824921 gctgcccggc ccatctcgat cctgtgcgcc aacgcgggta ccgcgacatt cggcccgatc 2824981 gcatcgctcg atcttgccgg cgaaaagacg caggtgcagt tgaatgccgt ggcggtgcac 2825041 gaccttacgt tggcggtgtt gccgggcatg atcgagcgca aggccggcgg catcttgatt 2825101 tctggttcgg cggccggcaa ttcaccgatt ccctacaacg ccacctatgc cgcgaccaag 2825161 gccttcgtga acaccttcag cgaatctctg cgcggtgagc tacgcggctc cggcgtgcac 2825221 gtcacggtgc tggccccggg cccggttcgc accgagctac cggatgcctc cgaagcgtca 2825281 ctggtcgaga agctggtgcc ggacttcctg tggatctcga cggagcacac cgcccgggta 2825341 tcgctgaatg ccttggagcg caacaagatg cgcgtcgttc cgggtctgac gtcaaaggcg 2825401 atgtcggtgg ccagccaata cgctccgcgc gccatcgtgg cgccaatcgt gggtgccttt 2825461 tacaagaggc ttgggggcag ctaggcatca cttccggcgg cggcgcccgg tgccgaagat 2825521 gctgcgggtg atctcgcgtg cggtggtgtt gaggacgctc ttgacggtcg gattcttgag 2825581 tatctcctcc cacaccgcgg ggccctgcgg ctccaccgga gcgggcatcg gcggaacttc 2825641 aaaatcgtcc ggccagggca gcggatcgta ctgccccctt ggggctgggg cctcctgggc 2825701 cggggcctct tgcgccggcg cgagtttggc gctcagtatc tcgtgggctg acgggcggtc 2825761 gatggtctgg ccatatacgg cctgcaacga gcttgcctgg gccgcggcgc caatcgcttc 2825821 ggctccgatc gcggccatca gcgaccgtgg cgctcgcatc ctggtccagg cgaccggcgt 2825881 cggtgcgccc ttctccgata gcacggtgac gacggcctcg ccggtgccca gcgacgtcag 2825941 cgcggactcc aagtcgtaga catcggtttt cgggtaggtg cgcacggtct tgcgcagcgc 2826001 cttgtggtcg tcgggggtaa acgcgcgcag cgcgtgctga attcgggctc ccagctggga 2826061 gaggacatcg ttgggtagat ccgtgggcag ctgggtgcag aagaacaccc caacaccctt 2826121 ggaacggatc agcttcacgg tctgctcgac ctgctcgaga aaggccttcg aggcatcggt 2826181 gaacaacagg tgcgcctcgt cgaaaaagaa caccagtttg ggcttgtcca ggtcacccac 2826241 ctcgggcagg aaggtaaaca ggtccgccag cacccacatc agaaaagtgg agaacatcgc 2826301 cgggcgcaac gcctggctcc cgaactccag caacgagatg atgccccgac cctggctgtc 2826361 gacgcgcagc aggtcctcgg gcctcagttc gggctcaccg aagaatgtgt cggcaccttc 2826421 ggcttccagg ttgaccaaag cccgcaggat gaccccggcc gtcgtgggcg acaccgcccc 2826481 aagggatttc agctctacct tgccctcatc actggtcaga tgggtaatga ccgcccgcag 2826541 atccttcagg tccagcagcg gaagtcctcg ttggtcggcc cagtgaaaga tcaggcccag 2826601 tgtagattcc tgggtagcgt tgagccccaa cacctttgcc agcagaatcg ggccgaagct 2826661 ggagatggtc gcacgcaccg gaaccccgac gccactggca cccagcgaca ggaactccac 2826721 cgggaaggcc gtcggcaccc agtcgtcacc ggtgtctttc gcacgggcgg ccgtcttgtc 2826781 ggcggcctcc cccgggcggg ccagaccgga caaatcgccc ttcacgtcgg ccatcagcac 2826841 tgccaccccc gccgcactga gctgttcggc gatcagctgc agcgtcttgg tcttgccggt 2826901 tccggtggcc ccggcgacca gaccgtgccg gttgacggtg gccagcggaa tgcgaatctg 2826961 cgcgctcggg tcgggttcgc cgtcgacgac gacggtgccc aactgcaggg cctggccttc 2827021 gacggtgtaa cccgccgcga tccgctgcgc gggcccgcca ggtccaccgg ccgccgattc 2827081 ggtgcccata gctggatcac actacttgcc cgggggagac agccgcgacg gctcgcatgc 2827141 gcctacgctg agcgctgtgc aagacgaact ggtgtggatc gactgcgaga tgaccgggct 2827201 cgatctgggt tcggacaagc tgatcgagat agccgccctg gtcaccgatg ccgatctgaa 2827261 cattctcggc gacggggtgg acgtggtgat gcacgccgac gacgccgcgc tgtcgggcat 2827321 gatcgacgtg gtcgccgaga tgcactcgcg gtcggggctg atcgacgagg tgaaggcatc 2827381 cacggtcgac ctagcgaccg ccgaggccat ggtgctcgac tacatcaacg agcacgtcaa 2827441 gcagcccaag accgccccac tggccggcaa ctcgatcgcc accgaccgcg cgttcatcgc 2827501 ccgcgacatg cccacgctgg actcgtttct gcactaccga atgatcgacg tcagctcgat 2827561 caaggaactg tgccggcgct ggtatccgcg gatctacttc ggccagccgc ccaaggggct 2827621 gacgcaccgg gcgctggccg acatccacga atccatccgc gaactgcggt tctaccgccg 2827681 caccgcgttc gtgccccagc ccggcccttc taccagcgaa atcgcggccg tcgtcgccga 2827741 gctttccgac ggggcgggcg cgcaggaaga aacagattcg gccgaggcgc cccagagcgg 2827801 ttaatatcga cgtcgccgct cattagcccc cgcgggggcg gccggcggcc atggtgagtg 2827861 tagttcagtt ggtagagcac caggttgtga tcctgggtgt cgcgggttcg agtcccgtca 2827921 ctcaccccaa cagggcggca gggtgtttat ggccctgggc cctttgctgt ccccgccgag 2827981 ggcgtgcacc tgcaaccttc gtgtctatga tctggtcctg tggcgaattc gaccactcgc 2828041 cgcgactgca cctggccgcc cgctccaaca cccgccggtc aaactgccat cggacagcat 2828101 gttccccgtc gccagggcct tggcaggtgt cggtttgccc ggtctatttg cctgccgcgc 2828161 aactatcgca cctccggcgt ggcttgttcg gactcactcg gtgtttcgtg ccatggttga 2828221 tgtgcaggac gtttgagacc ccaaccagct agaccaggat gagcgcttct gcgtcagccg 2828281 acaaggtcgt atgcgagtgc tgcgagctct gtgttcctaa acagctcgcg tcagcgattc 2828341 gcaacccata cggactcgtc cgtgggtggc gctgtcgcat ctgtaacgag caccaaggcc 2828401 agccggtcaa gatggcgcaa gaccacgaag aggaggtccg catccgttgg ggcgagacgg 2828461 tggacgaact ccacgctgcg ctggaccgcg ccgggccaag gccagggacg tggtgtacga 2828521 gtgaaggttc ctcgcgtgat ccttcgggtg gcagtctagg tggtcagtgc tggggtgttg 2828581 gtggtttgct gcttggcggg ttcttcggtg ctggtcagtg ctgctcgggc tcgggtgagg 2828641 acctcgaggc ccaggtagcg ccgtccttcg atccattcgt cgtgttgttc ggcgaggacg 2828701 gctccgacga ggcggatgat cgaggcgcgg tcggggaaga tgcccacgac gtcggttcgg 2828761 cgtcgtacct ctcggttgag gcgttcctgg gggttgttgg accagatttg gcgccagatc 2828821 tgcttgggga aggcggtgaa cgccagcagg tcggtgcggg cggtgtcgag gtgctcggcc 2828881 accgcgggga gtttgtcggt cagagcgtcg agtacccgat catattgggc aacaactgat 2828941 tcggcgtcgg gctggtcgta gatggagtgc agcagggtgc gcacccacgg ccaggagggc 2829001 ttcggggtgg ctgccatcag attggctgcg tagtgggttc tgcagcgctg ccaggccgct 2829061 gcgggcaggg tggcgccgat cgcggccacc aggccggcgt gggcgtcgct ggtgaccagc 2829121 gcgaccccgg acaggccgcg ggcgaccagg tcgcggaaga acgccagcca gccggccccg 2829181 tcctcggcgg aggtgacctg gatgcccagg atctctcggt agccctcggc gttgacgccg 2829241 gtggcgatca aggtgtgcac tccgacgacg cggcctgcct cgcgcacctt gagcaccagg 2829301 gcgtcggcgg cgaggaaggt atacgggccg gcatcgagcg ggcgggtccg aaacgcctct 2829361 acggcttcgt cgagctcttt ggccatgatc gacacttgcg acttggaaag ctttgtcaca 2829421 ccaagtgttt cgaccaggcg ctccatccgg cgagtggata ctcccagcag gtagcaggtc 2829481 gccaccacgc tggtcagtgc gcgttcagct cgcttgcggc gctgcagcag ccagtccggg 2829541 aaatagctgc cctggcgcag cttggggatc gcgacgtcga tggttgcggc acgggtgtcg 2829601 aaatcacggt ggcggtagcc gttgcgctga ttggaccgct catcgctgcg ttcgcggtag 2829661 cccgccccgc acagggcgtc ggcttcagcc cccatcaagg cggcgatgaa cgtcgagagc 2829721 agcccgcgca gcagatccgg gctcgcctgt gcgagttggt cagccagaag ctgctcggtg 2829781 tcgataagat gagaagaggt cattgcgtca tttccttcga ttgacttttg ctggtcgttt 2829841 cgaaggatca cgcgatgacc gcccactact gggctacgac acgcccaccg gccttacctg 2829901 cccgtacacc acacccctgg acgtaactcc gcgccgatga ctacaaggca aagatgctgg 2829961 ctgcgtttag gtctcacgat gccgtgttaa gagagttcga aaagctcggc cgctatcatc 2830021 agtcaaccgg gcacggctgc ctctgcggca aacgaaactg tgcaacgctg tccatcatcg 2830081 atagcaacca gatatatggc cacattgacc gaatgaatcg ccgcgacgag cttggctaag 2830141 ccacaacaga gagaaacaag gtggacgaca tcgcagcatt caagctcgac agcctgccgg 2830201 acataacctt cacggtcacg cgggccataa gttcgggtgg ggaaaatccg gcggggtttc 2830261 tcaatttcgc ggcgcgccga gagcaaccgg agatcctggg tggtggaggc cgtcctggac 2830321 cggtgggccc ggaagcggtc gatactccac gtattcgcgg cgggaaggtg ccgttcgtct 2830381 tccggacgct accgggttac accttctacg ccagccaaat cgagccgaga gtgggcgacc 2830441 cggaagggcc cacactcctg gctggattcg gcaatatccc tgagacttcg cagcggtcgc 2830501 cgggatggat ccgcatcacc tgcacggggc cagacgacga tgaggagctg gaattctttg 2830561 gattcgccgg gccagagtcc taaccaggcg atgaacgaag gatcggcgac ggctacgaac 2830621 ctggataggc aagaatggcg caccgaagcg tcactcgacg tcggccggcc ggagaacgca 2830681 ccacgaaacg aaacacttgt gaggaccaag attctccgat cttcgggtag cacccgagag 2830741 catgtcgtta ggcctgtcgg catgggcgcc ggcaaggtcc tccagccggg tgatgggcgt 2830801 cgcagagtac agacgtggct gctgtccgca ggctgaagcg gatgaagtga cagcccagcg 2830861 gcgcggccag aagctctcag aaagtccatc cctgcgcctc gatatagccc atcagagtta 2830921 gccacggcac gccaagagcg tcgcagacat cggggatgcg cggcttttcg atattgccgc 2830981 tcgcagtctc ctgggtaacc accgtggcgt tgttcaccat cgcgagcgcg atgacgaacg 2831041 ggtcggcggc gcttcgcctg ccaccctgcc ggaccatgtt cgggtgcaac cgcaagatgt 2831101 gccgcgccgc ctgctggatc tgttcatcca gaggacagaa caagccagtt tgcccgtccg 2831161 cccaccgctt cgcgtcatca tcacgcctgg cgagttcgcg ctgaacctca tcgaccgacc 2831221 tgatctgacc ggcgctgatc gcatcctcaa cccggcccca cagactgcga aacaccgctg 2831281 gccgaaacag atcacgccgt ccgttcagga tggcgctggt atcgaaggaa tagagcacag 2831341 cggttagacc acgctccgca gttcggctga ctcagccaac ttcggaatct ggctgacctt 2831401 ggcgtcgagg tagatcgcag cggtgttgct gtcgatgacg cggcggcggt gggcgtcggt 2831461 caccgcccgc acgtagccct taccgaggtc tcggacggta ttgcggtacc agttgccgcc 2831521 cccagccgat cgagcccgtt cggcctcgtc ctcgtgagcc gcgatgaact cggcgcggcg 2831581 ctgtcggtag acctcgaccg gcacgattcc aagcgtgctt agccgccgca ggaacgcctc 2831641 ggcactcacg ccaaaatgcg ccgcgaccgg ccgcagcgat tcgtaatccc acgaagacgg 2831701 agtctcgctg cgaacgatga cctccggccg cgctcgcacc acgtcggcag gcatcagcac 2831761 agcggcggcg atcgcgttgc atcgagcctc cagcgatcgg tcctgggtgc tcggatgagc 2831821 atcggcgatc acgtcacaca agccctcggt gtgcagcacc acgtgcacga actcatgcag 2831881 cagcgagaac aggcgagggc gggggtggtc gctgccattg agcacgatca ccggcaattc 2831941 gtcgaaatac agacacatac cgcgcatctc gtcgatagcg accttgccgc cgcgggtcgc 2832001 gagcaccaga acgccggacg tttcgatggc cgacacccag gcgttcagat gctcgtaagg 2832061 gtcaaccgag gccacgggga taggcaacgg gctgacctcg atcaaggcct tgcggattcg 2832121 tgccgcgata tccgcgtcgg cctcgtcgcc ggataggggc aaacgccagg cgcccggtat 2832181 ctcccggtcc tcggcgtcgg ccagctctag cgcgaagtcg cgttgcgtgt gtgcgcgacg 2832241 gaactcctcg tgaagccccg gcgtccattg acccgacgcg gcaccgtcca atcgtcggaa 2832301 gtcgcgtaag gtgtcaaacc cctcgggcgg ctcggacagg aagaacaccg ccagcgagcg 2832361 cttgtagacc tcggcggcct tgcgcagctg cgcgatggtt ggcacaacct cgcccacctc 2832421 ccaagccgcg acgcgatcat caggcaggcc gagtttgcgg gccgcggcta cctcggtcag 2832481 gccacacgac tcgcgagccc aacggagcac cgagctctcc accgaagcgg gaatcgaccg 2832541 catggcaatg atgatgcacc accccaccca cattggatgg ccgataccca cgcttggttc 2832601 ccgaccagcc gattaaccgc tcccccgcaa cctggcgaga cggtactcgc cgcgttcggc 2832661 gtctgggacg gtgtgccgtg agaccggctg cggtgtaacg ccttacgaac tagtgagcag 2832721 ggtgcaacgg gacggccgcc cactcgtcct gtccagccca acggacgtat agctgatttg 2832781 gaaggggatg gccccaagcc gctatcaaga ccatgttgag cccctctccg gggcgaatca 2832841 ccgtcttctt cgggacattt cgagtgatcg catcgattcg cgagaggtca atctcgacat 2832901 cttctgcgat atcgtcgccg atgttgcgca acacaaagcg gattttgtct gggttctcga 2832961 cacgccaccg gacgttaggt gccttaccgg acctgccgac cgccggtccc accgcccacg 2833021 agtacgcgaa cttggcagtc ccggtatgcg gccgaccggg ctttcgctcc caggtctccg 2833081 caaacctgcg caccgctgcc gcatcccaca ccgcgcctcc acgcaaatct gccaacggag 2833141 cgggaaaccc tgctgtcgac ctcaattggt gcaccctctg acgcgaaacc cccaactcat 2833201 ccgcgatctc agccgcagac atcaactcgg gcgttgtgaa cgcctcagcg cgcagacgat 2833261 gctctggctc gctaatgatc tgcacagcaa tgggactctt ggcttgaact accggcataa 2833321 cctcgccagc catcttggcg agcgcgtcga acacactcca atcgccgggc gcatagaccg 2833381 tgacgtcaat gccgtgtcct gggacccgag ataccagtgc gtcgaagccc tcgagctgcg 2833441 tctcccaggc gtccatggtc tccatcgaag ggtcagcatc aaacgtgaag gtgacgaccc 2833501 agtcggctgt cactgtgcgc cttccttcct gtgctgtgcc cgccgttcct tcttgctcgg 2833561 cggtggccac gtcaggcccg ctttcttcaa cgcgcccaat aggtctcgca tccggcggta 2833621 ctcgttgcta ggtgttgccg gaaaccgagc aatatagacg ccctgggggt tgtagaagcg 2833681 ggtgtagccg ctggcgtcat cctcaaccgt ccattgttgc gattgcgccc acttcgcgat 2833741 cttgatgatt gcgctgttca catcgtctcc ccaacacttg cgatgtgtca agagtaatgg 2833801 caagacgcga catcgtaaag gttttagccg gactcattcg aatatttgag cgatgtagcc 2833861 agtgagtggg tgctccgatg atcacggctt cgcgcgagct cgccccggct ggcctcatga 2833921 tcgccgaccg gctgggcttc acccggtctc agtggttctc ccagtcgcga aggaacaccc 2833981 gagtgtcgtc atggtccgcg cggttgggca ctgcggccat cggatgtcat cgtcgtacaa 2834041 cgaaccatgc ggtcgttgca gggcgtgtat caggcgctgt tggttgtctg gcgttcctcg 2834101 cggcgcgctt acgccttggc gttaccggcg cgccactggt cccacggaat gttccaatcg 2834161 ccgagcccgt cgatccccgg cagggtgcca cccacggtat tgaccacctc gacgatgtcg 2834221 ccgcgcttga catggtcgta gaaccactgc gcgttgctcg ggctgacgtt caggcagcca 2834281 tggctggtgt tggtgtggcc ctgagccccc accgaccacg gcgctgagtg cacgaagaca 2834341 ccgctgtagg agatctgggt ggcccagtcg acatcggtgc gatatccgtt gggcgagttg 2834401 acgggtacgc cgtaggtgga cgagtccatg atgatgtgct tgtaccgcga gccgacgatg 2834461 tatatgccgt tggccgtcgg ggtgctgtcc ttgcccatcg acgtcggcat ggactttacg 2834521 acctcgccat tcacccgcac ggtcagtatc ttggtgttgt cgtcggcggt cgcgatcacc 2834581 tcgtcgccga tggtgaagtg cgtctgcacg ttgtcctcgc cgaacattcc ctcgcccaag 2834641 tcgacgccgt aggtgttgac cgccacatca acggccgtac ctggcttcca gaaatgctct 2834701 gggcgccaac gcacttcacg gttattcagc cagtagaacg cgccctccac gggcgggttg 2834761 gtggtgatct tgatggcctt ctcggccgcg ccccggtcag cgatgttctc gtcgaatcgg 2834821 atcgccaccg gctcgccgac acccacgacc tccccatcac cgggcatgac gtagggcatg 2834881 gtcaggtgcg cgggggaact ggtctggaag gtcagctggc gggtcgccgc gccacccagt 2834941 ccaagcgccg tcgcgttcag cgtgtagcgc ctgttgtagc cgagctgctc agtggtcgac 2835001 cagcgcagtc cgtcggggct gagtcgaccg gccaccggcc tgccgttgtc gttgaccatg 2835061 gtgacggccg ccagcacacc gtcggcggcg gtcaccgaca ccggtgcatc cacggtgacg 2835121 ccgacggcgc cgtcggtgac cgacgcggtg agcttgggca ccagcagatc ggcgaacggc 2835181 gtgcccttgt ccgcgatgac cttgatcggt gcgggtccgc ggccgctgcc gcatgcgacg 2835241 gcaccgatca tcacggcggt catcatcagc gcggttaacc aggctctccg aaccctggtc 2835301 ctacccgcct gagctgcaat ccccaccttt ggcatgcctt ccctcacctc ccccactgcg 2835361 tcgtgaccga gctagactcg gctgtagtct aggtcctgac tggccgccac gctgcgatgc 2835421 tgataccaag ttcagtgtga gatttcacgc gagagcgcaa ggcctgttaa tgtgccttgg 2835481 ctaggtaatc gaggcgccgt tagctcagtt ggtagagcag ctgactctta atcagcgggt 2835541 ccggggttcg aaaccctgac ggcgcacagg tcaacgcgtt atttcggatg caccagccgc 2835601 agctgtcccg ttgggcgacg atttccgtat tcggaaggtg cacgccggtt accggatttg 2835661 ggcagcggat cggatcggag ccacggggat agctcgacga gacagccggg gaagccgcag 2835721 aaaattgggt tgtaggcgcg tgcaatagct acgctgcatg tggacagcgg ggaagaggtt 2835781 agttgtgtcg cgtctgatcg tggctccgga ctggctggcg tcagcagcgg cggaggtgca 2835841 aagcatcggc tcggcgctga gcgcggcgaa cgccgcggcc gcggccccca ccaccctatt 2835901 ggtggccgcc gccgaagacg aggtatccgc agcggccgca gcgctattcg ccaactacgg 2835961 ccgggagtat cagacgctga gtgtgcggtt cgcctcgctt gatcagcagt tcgcgcaagc 2836021 actgaactcg gcggcagcgt cgtatcagac ggccgaagcc acgggtgcgt cgctcgtgca 2836081 gaccgcgaca caaggtgtac tgggtgtgat caatgcgccc accgagttca tgttcggacg 2836141 ctcgctgatc ggcgacggag ctgacggcac ggctgccagc cccatcggcg agcccggcgg 2836201 aatcctgtac ggcgacggcg gaaacggcta ctcccagacc acgcccggag ctgtcggcgg 2836261 agccggcggg tcggccggat ttatcggtaa cggtggcgcc gggggcgccg gcgggcccgg 2836321 cgccggcggc gggactggag gcctcggcgg ctggttatgg ggcaacaacg gcgccgctgg 2836381 caccggcgac ccagttaacg ttgccgtccc cctgcgcgtg gaaaacaact ttccgctggt 2836441 gaacctcttg gtcaaccgcg ggccaactgt ccccatactg ctggacacgg gatcctcgag 2836501 tctcgtcatc ccattctgga aaatcgggtg gcagaacctg ggcttgccca ccgggttcga 2836561 tgtcgttcac tacggcaatg gcgtgagcat cgtctacgcc gacgtgccca cgacggtcga 2836621 tttcggtggc ggcgccgcta ccacaccgac ctccgtccat gtcggtatcc tgccgtaccc 2836681 gcgaaacctt gacagcctgg tcctcatcgc ttccggcggc gctttcggac ccaacggaaa 2836741 cggcatactg ggcatcgggc cgaatgtggg gtcgtatgcc gtcagcgggc ccggcaacgt 2836801 tgtcacgacc gatttgccgg gccaactcaa cgaaggcacc ctcatcgaca ttcccggcgg 2836861 ctacatgcag ttcggcccca acacgggcac tccaatcacc tccgtgaccg gggcaccgat 2836921 caccgtgctg aacgttcaga tcggcggcta cgaccccaac gggggctact ggtcactccc 2836981 ctcgattttc gattcgggcg gcaaccacgg aacgcttccg gcggtgattc tcggcacggg 2837041 ccagacaacc ggttacgccc cgccgggcac ggttatctca atctcaatac atgacaacca 2837101 gacgctgctg tatcagtaca cgacaaccgc gagcaacagc ccagtggtca cggcagaccc 2837161 ccgactcaac accggtctaa ccccgttcct gctgggaccg gtatatatct cgaacaaccc 2837221 tagcggtgtc gggacggtgg tgttcaatta cccgccaccg tagctttccg ccgggtccag 2837281 aaccgccgcg ccataagggc gtcacgttcg tccagaacct cggctaagtg cggagtgcgc 2837341 aatcatggtg cactgcaatg ggtttcccat cggtaactcc gggttggtca gcgattcctg 2837401 atcttgtgga tgaccacgac gacgaccaca gacccgatcc cgaccagtga cacggtcacg 2837461 atgggcttcc tgaggaaggc gatcacccga gtttttgcgt cgtcggcgag gcggcggggg 2837521 ttggcgcgct cggcgaggga atcgatggtc gccgccagtt ggtcgcgggt ttggtcgatc 2837581 tcctgcttga tggtattggg atcgcggtcc accacgtgct gtcctccaag ttctccagtc 2837641 gcccactgcc ggcctgcgtc gcccgccgaa ctaccctaga tcagtgacca aaaccacgcg 2837701 tctgaccccc ggagacaaag cccctgcctt caccctgccc gatgccgacg gcaacaacgt 2837761 gtcgctggcc gactaccgag gacgccgcgt catcgtgtac ttctacccgg cggcctcgac 2837821 accgggatgc accaagcagg cttgtgattt tcgcgacaat ctgggcgatt tcaccactgc 2837881 cggcctcaac gtcgtcggta tctcccccga caagccggag aagctcgcta cgttccgcga 2837941 tgcccagggc ctgacgtttc cgctgctgtc tgatcccgac cgcgaggtgt tgacggcctg 2838001 gggtgcctac ggggagaagc agatgtatgg caagacggtg cagggggtga tccggtccac 2838061 cttcgtcgtc gatgaagacg gaaagatcgt cgtcgcgcag tacaacgtca aggccaccgg 2838121 ccacgtcgct aagcttcggc gcgacctgtc ggtatagccg cgagcttggc cagcagcagc 2838181 gcttcggcgg tcgccgcgcg ttccagcaca cccagatgca ggctttcatt gacactgtgc 2838241 gcctgcgttc cggggtcttc taccccggtg acaaggatgg tcgcctgcgg gaacgcggcg 2838301 gcgaactcgg cgatgaacgg gatcgacccg cccattccca tatcgatcgg atcggcaccc 2838361 cacgcctgcc gaaacgccga ccgcgccgca tcatagacag ggccgctcgc ctcgatggcg 2838421 tagggctgtc cgacctcgcc gcgcgtgaca gtgacctggg cgccccaggg ggcgtgccgc 2838481 cgcagatggg cctccaccgc gtccaggtgc gccgtggcat cgcctccagg cgccacccga 2838541 atactgatct tggcccgggc ccgcgggatc agcgtattgg acgctgccgc aacggatgtg 2838601 gtgtcgatgc cgattacggt gatcgccggc ttcgcccaga gccgctgcgg caccgagccc 2838661 gtgccgattt ccgatactcc gtccagtaga cccgactcag cgcgtacccg tccagccggg 2838721 taatccacac gcgccgcggt gctttcgtgc atgcccgcca cggccacgtt gccgtcgtcg 2838781 tcgtgcaggc tggccaacag ccgcactagc acggtcagcg cgtcgggaac gacgccgccc 2838841 cacaacccgg agtgcagccc gtggtcgagg gtggcgacct cgacgacgca gtcggccatt 2838901 ccgcgtagcg acaccgtcaa agccgggatg tcggtgctcc aattgtccga gtcggcgatg 2838961 acgatcacgt cggctgccag cgcgtcacgg tgggcggcga gcaaccggcc cagtgacggc 2839021 gacccggatt cttcttcacc ctcgacaaag accgtgacgc ccaccggcgg tctgccgccg 2839081 tgtgcccaga atgcggccac atgcgtggcg atacctgcct tgtcatcggc ggtgccccgc 2839141 ccgtagagcc gcccaccacg ctcggtcggc tcgaacggcg gcgacaccca ttgcccgcgg 2839201 tcaccctcgg gctggacgtc gtggtgggca tagagcagca ccgtcggcgc ccccggcggc 2839261 gccgggtacc gcgcgatcac cgccggcgca ccgcgctcgc tgacaatccg cacgtcgtca 2839321 aaaccggcct gcgacaacag gtctgccacc gcacgcgcgc tgcggtgaac ctcgtcgcgc 2839381 cgatctgggt cggcccacac cgattcgatg cggaccagct cctcgagatc acaccgcacc 2839441 gacggcaaca cctcacggac gcgctcaacc agctcgcgag cagacgcaga gtcgcatgaa 2839501 aatccggatt tcgatgcgat tctgcgtctg ctcgcgctca cggggcctcc aggatggcga 2839561 ccgcggccgc ggtatcccct tcgtgggtca gcgacacatg gatcgtcacg tcggccaaat 2839621 actcagcgat ggccccggtc agcctgaccc gcggcctgcc ccacatatcg gtgaccacct 2839681 cgatatcgcg gtggatgtcc tccggcaaca ccggccgctg cgcgaaccgc gatccggacc 2839741 aggccttgat caccgcctcc ttcgcggccc agcgggccgc caggtgccgg gccgccgacg 2839801 aactcttgtc cgaggcgtcc cggcgctcac ccggggtgaa ggtctcggcg aacaccgttc 2839861 cgggctggtc gacctgctcg gcgaaatcgg gaatggagac caggtcgatc cccacaccga 2839921 cgatgcccat gggcggccac gttaatcgat ggcccagtcc ggcgacgatg cggtccgcgt 2839981 tgggggcacc tcccgcttgc gggggacgga ccgaagagat gccgggcagt caggccaagg 2840041 agcacgcggc gagcgtgtat ccatggcggc gacacgccga acaccgtcgc cctgagcgca 2840101 cgttcggcgc ccaacggcag ggtcagccga tatacgcctc gccgtcaccc agccgggccg 2840161 ccggattcag cagcatcgac gcctcctgcg gccgctcggg cgcgtggtgg tcgaagcgac 2840221 ggtcaccggg ccgctggtac atcggcgcac caccggcaat cgccgaggcc agccggcgct 2840281 gaccggccag caggcgggcg tcggcacgcc gctggtagtc cgcgcgctgt gcgggatcca 2840341 gcgaggcgat gaacgcctgc ggatgcacca acgcgaccag gcccgacaca tggccgaacc 2840401 cgaggctggt cagcatgccg gccttgagtg ggaacttgcc gccgagccgc aacgtgtcac 2840461 gcacccacac gaaatgcgcg gagccggcca gctcgtcgtc gacgcagtcg aggctgcggt 2840521 tgggtgggat caccccatcc cgcaatatct ggcagagccc catcatctgg aagaccgccg 2840581 cgccgccctt ggcgtggccg gtcaggctct tctgcgacac cacgaacagc ggggcgccct 2840641 cggaacggcc cagggcgtcg gcgagccgtt catgcaactc ggtctcgttg ggatcgttgg 2840701 ccagcgtcga ggtgtcgtgc ttggagatga ccgccacgtc gtcggcggcc acgcccagct 2840761 tggccagcgc ccgcgccagc ggtgaatcct tgccgccgcg gcccgccccc agcgcgccca 2840821 ggcccggggc cgggatcgag gtgtgcacgc cgtcgccgaa cgactgcgcg aacgccacca 2840881 ccgccagcac cggcagcccc atccgcagcg ccaggtcccc gcgggccaac aggatcgtcc 2840941 cgccgccttg ggcttcgacg aagcccagac ggcggcggtc gttgggccgg gaaaacttcg 2841001 agtcgtggat gccgcggccg cacatcatgg acgtgtcggc ggtggcggcc atgtcaccga 2841061 atccgatgat gccctccagc gtcaggtcat ccaggccgcc ggccaccacc agttgagcct 2841121 tgcccaaccg gatcttgtcg acaccttcct cgaccgacac cgcggcggtg gcgcacgcgg 2841181 ctaccgggtg gatcatcgca ccgtagctac cgacgtagga ctgaaccacg tgcgcggcaa 2841241 tgatattcgg caagacttcc tggaagatgt cgttcggctt gttgcggccc aacagattgc 2841301 cgtggtacat cgtctgcatc gacgtgccgc cgcccatgcc ggtgccctgg gtgttggcca 2841361 ccaaactcgg gtgcacgtaa cgcatcacct cggccgggct gaaaccggac gacaggaacg 2841421 cgtcgacggt cgccaccatg ttccataccg ccaaccggtc gatggaaccg gccatgtctg 2841481 cgctgatgcc ccacaccgtc gggtcgaacc cggtcgggat ctggccgccg acgacgcggg 2841541 acagcttggt ctttcgcggc acccggatct cggtgccggc cttgcggatg acctgccagt 2841601 cggtggagtc gggcaccggc cggatgaccg tgtgctcggg atcgaactcg acgaaggcgc 2841661 gcgcatcggc ctccgaggac accacgaacg cgaagtcctt ctccaggaac accgacacca 2841721 gcagcggcga ggcgtggtcg gggtcgatcg cgccgtcatc aacgaattcg cgaatgccga 2841781 cgcgctgcac cacggcgtcg tggtagcgct gcaccaactc ggattcgtcg accatttcgc 2841841 cggattcggt gtcgtaccaa ccgggttgcg ggtcgtcctc ccagcggatc aacccagtgg 2841901 tccaggccag ctccagcacg ccggccgccg acagctcgtt ttcgacctcc atctcgaacc 2841961 gggtgcgtga cgagccgtac gggccgattt cggcgccgcc gacgatcacc accaggtcgg 2842021 ccgggtcgac atcgaggtcg tcccattgcg gcggcggtgc gggggtgaaa ccccggggcg 2842081 gcgacggcag cgcggcgatg gcgccagggg cctcggcgtc ctcgtcgacg gccgccgctg 2842141 ccgacatctg ctcgcgcgcc ttggccgcca gctcggccat gtcgaggttg gcctcggcca 2842201 ggcccccggt caggtcggcc ttgatcggcg aacgcgccgc agccaccttg gattccgcat 2842261 cacacaggtc gagcagcagc gccgccatct cgtcggtcga gtaggtggtg accccggcct 2842321 cttcgacggc ggccacgatg gcatcgttgt ggcccatcag cccggtgccg cgggtccagc 2842381 cgatgagcgc gtgcgccagg ctgacccgtg ccgcccagga cgactcggcg tgccagcggc 2842441 tcaccacggc atccagcgcg gacttggctt cgccgtaggc gccgtcgccg ccgaacatgc 2842501 cacggttggg cgagccgggc agcaccacgt gcagccgcga cgcgatgtcg cgttcggcgc 2842561 cgatcgtcga caggccgccg atcagccgtt gcacggccca cagcagcact ttcatctcca 2842621 tctcggcgcg cgaaccggcc tccgacaggt ccccgaccac gcgtggcgcc gcgaacggga 2842681 acagcagcgt cggggtctgc gcgtctttga tgtgaatcga ctgcggccca aggctttcgg 2842741 tctgttcggt gccgatccat tcgaccaggg cgtcgacgtc ggagtaggac gccatgttcg 2842801 ccgcgaccag ccacagcgcc gcgccgtaac gggcgtggtc gcgatacagc gtgcggtaga 2842861 acgccagccg ctcctcgtcg agcttggagg tggtcgcgat gacggtggct ccgccgtcga 2842921 gcagccgagc caccaccgac gcggcgatcg aacccttcga agcgccggtc accacggcaa 2842981 cttcgccgcc gtagcggccg ggttcggggt tctcggcgcc ggcggcgatg cggccgtaca 2843041 gcgatgcatg gatctgccgg cccgcggcca gcgacttacc ttgccaccag gtagcctggg 2843101 tcgccacgac gtggccggca ccctcgaagc gctccgccag gcgcggccag tcggcgtcga 2843161 tgtcgccctc gtcggtcagc cacagcttca ccaggtcctc gcgggcgctg gcccagcggt 2843221 cgtcgaatac gacggccttc ttggggtcga acaccggtgc caccaaccgc ggccagtccg 2843281 ctcccagttc ggcggtgacc aagtcgatca gctcggaatc gggggcggcc ggcaaggcgt 2843341 tgacggggtc gtccagtccc agctgcccca gcaccaggcg ggccgcggag gccagcacgc 2843401 cctcacggcc ggtgatttgg tcggtgaact cgctgagcgc ggccgcgtcg atggtggcgc 2843461 cgccaccact accggccgac ggcagcgcta ccgaaacgcc ctggcgcgcg gccaccgatg 2843521 cgaccgccgc gtcgatgacc ttgtcgacgg aggcggcatc ggccagcgcg ccctcgtgca 2843581 ggtggcccat ggcgccgccg cgaacgctgc tgccctcgcg ggtgcccagc gcgacctcga 2843641 cggtgacatg cttggcccag ccctcaccga gctcccaggt cttcttcacc cgctcggcga 2843701 tggcgccggg ccgcttgccc gacggtccga ggacggtgcg aagctggtcg ttgatggcgt 2843761 cggaaagcac tgggccgtaa ggcttgtagg tgcgcgccag tttggtcacc tgtgagcgca 2843821 gaccggccag gtccgattcg gcggcgccgt caatggcacc gaggttcagc tcggagccca 2843881 ggtccaccag cagctggttg cgccgcgacg acgcaccgtc ggtgatggac tcgatggagt 2843941 cgagttcttc gatctggtcg atgcgcatct tggccgagag cgcgatcagc gccagcgtgg 2844001 catcggcggc gtcgaaaacc agatcgtcgg gacgcgggcc cgccgacgaa gcggccggcg 2844061 cgacgggggc ggcttccgag acgacgtccg gcgcgggcga ttccgcgacc ggctcgtctt 2844121 cctccggctc cggctccggg tcggtgtcgg tggcgaacag caccgcggca tcacgctcgg 2844181 cgttgagcac ttccactgtg ctgtgggcgt attcgggcag tttgagggtg ttggtggcaa 2844241 gacccgccac cgtcggtgag ctcttcacac cgatctcgac gaatcgctcc acacccagcc 2844301 cgccggcggc ctcctcgatg aacagcagat cctgcgtctc gatccagcgc accgggctgg 2844361 cgaattgcca tgccagcagc tcgatgaaca ccgtgcgcgc catctcgcgc ggacgctcgc 2844421 gaagccaggt gtcgtagtcg gcgaggatct cgtcgagcgg ctcggcgggc accaaatccc 2844481 ggatttcctg gatgaagtcg cggtccaggg tgaacaaccg cggcaccagg ttgggaatgt 2844541 agcgcccgat gatcaggtcg gggtccgcgt cgcgcggcat gacccggtcc agcgagcgcc 2844601 ggaattcggc caccccgacc cgcagcactc gcgagtggaa cggaacatcg atgccgggca 2844661 ccaaaatgaa cgaccgtcgg ccgccggtga gctcgcggcg ccgctccacc tcggcctcga 2844721 gcgcctcgag gccgcgtacc gtgcccgcga tcgcgtattg cgagccacgc aggttgaaat 2844781 tcacgatctc caggaattca ccggtgctct ccgcgatccc ggcgacgaac gcgggcacgt 2844841 cggcgtcgtc gaggtcgatc tgggacggcc ggatggccgc cagccgatag ttggagcggc 2844901 cgagctcgtc gcgcggaacg atgtcgtgca tcttcgaccc gcggtgaaac accatctcca 2844961 gcaaggcttc cagttggtag atgccggtca cgcaggccag cgcggtgtac tcgccgaccg 2845021 agtggccgca cgcgatggcg ccttcgacga aggctccctg ttcacgcatc tcggcgacct 2845081 gcgcggccgc caccgtcgcc atcgcgacct gggtgaactg cgtcaggtag agcaccccgt 2845141 cggggtggtg gtagtgcaca ccgctggcga tgatgctggt cgggttgtcg cggaccacgt 2845201 gcagtaccga gaagcccagg gtgtcgcggg tgaacttgtc cgcggtgtcc cacaccttgc 2845261 gggccgcctt ggagcgggcg cgcacctcca tgcccatgcc cttgtgttgg atgccctggc 2845321 cggggaatgc gtagaccgtc ttgggtgcgg ccagtcgcgc ggaggccgac atcactagat 2845381 ccgacccgac gcgcgcggcc acgtccacaa tctctgcgcc ctggtcgatt ccgacgcgct 2845441 cgacgcggaa gtccacctcg tcgccggggc gcaccatgcc caaaaaccgc gcggtccagc 2845501 cgaccagccg ggccggtggc cgggcctgcc cgtcggtggc ggtcaccgcg tgttgcgccg 2845561 cggccgacag ccacatgccg tgcacgatcg gcgactccag gccggcaagc agcgcggcgg 2845621 cccggtcggt gtgaatgggg ttgtggtcgc cggacaccac cgcgaacggg cgcatgtcga 2845681 ccggcgcggt gatcgtgacg tcgcggcggc gacggcgcgg ggtgtcggtg gcgttcgccg 2845741 acaccgcgcc accggctcgc gccgggtcgg cgagctcggc ggaaccggtg cgacccagga 2845801 tcgcgaatcg ctcctcgaga gtggcgatca cggcgccatc ggcgccggta acgacgaccg 2845861 agaccggcac gacgcggccc atgtccgtat cggttgcgtt ggcagccgtt gcggtgacgg 2845921 tcaattgggc cgggaccgtg ggcagctgac cgaccacgcg ggcggcgtgg tccagatgca 2845981 ccaggctcag caggccttcc accaccggct caccggtgtc ggtgaccgcc gatccgatgg 2846041 ccgcgaaaac cgctggccaa caagggccga cgagcgcgtc gggcacgttg gtgaggctgg 2846101 gtgccagcgg ctcaccgaac gtggcggtga cgccggtgtg gtcggcaaca cgctcggggt 2846161 gccagtccac cgtcaaagtg gccgtcccgt tggccaccgc aggcaagaac tccgggctgt 2846221 cgacaccggc ggcgatcgcc agcaccgtgc gcatggcgct ggtggcgtcc tcggtggcga 2846281 tcaccggggt gccgccatcg acggtgttgg ccggcaacgt gaatcggatg tcgacccagg 2846341 tgcccgagac gggcacgctc aaggcgacgt cgtcgccgtg cgtctgcagc cgggcgccgg 2846401 tggatgagtg tgtggcgcgc gggttttcgg gtccatcgtg cacctgccat tcggccgggt 2846461 cggcgatccg atgcaccggg ttggtcacgg tgcgaccggc ccagcgcaca tcgggtgcgt 2846521 cgaggacgac agccaacggt ccggccacgt cggcgcggcc cagccggcgc gacgcgacat 2846581 ccttcggctc gacaccggcg ccgagcactt catcgattgc ggcttgctcg aaacggtcca 2846641 gcaactcacc gacgggttca tccatccggg tgatgccggc taccgacgcg gtgcccggaa 2846701 tgatgcacac cgcatcggcg tcgtagcggg cgtcgtgggc ctgccacagc gagtcgctgc 2846761 gccaccagcg ccgcacgtcc tggtcgatca ccggcacgaa gttgaccggc ttgcccagcg 2846821 tcttgcacaa cgtcacgaaa aagggcacat ccgcgggatg caactgcacg gtctcggcgt 2846881 cggggtagcg cgccagcagg gcggcgatcg cctgctgcgg attgtccagc aggccagcat 2846941 cggtgaatag cgtctggatc gggccgaaat cctgtgggtg caaccgggct tcggcacgct 2847001 gcagcatctg ctcgaagcgg tcccgccagg tgtcggccag ccacgggctg cccaccgagg 2847061 cggtgtcggc ggtcgagttg ccttccccga tggccagttc gacgtagcgc cgcagccact 2847121 gcaggtaggt catgtcggcg acgtcgccga agtagggctt ggcggtcttg gccatcgccg 2847181 cgatgatctc gtcgcgacgc tccgcgaccg cctccgcgtc accggccacc tcgtcgagca 2847241 gccgcccgca ccgggatgcg ctgttgtcga tctcgtggat atcggcaccg agctgactgc 2847301 ggctggaggc catgccgccc tgcgcttttc cggcgctgat ccattggtcg gtgccctgag 2847361 tgtcgacgag catccgcttg accgatggcg acgtggtgga ttccttggtg gccatcgccg 2847421 cggtgccgac caggatgccg tcgatcggca tcaatgggaa gccgtaggcc tgcgcccagc 2847481 gcccggacaa atattccgca gcccttctcg gggtgccaat gccgccgccg acgcacaccg 2847541 tgatgttggc gcgtgagcgc aactccgagt aggtagccag cagcaggtcg tcgagatcct 2847601 cccaggaatg gtgcccgccg gcgcgcccgc cctcgacgtg catgatcacc ggcttggtgg 2847661 gcacctcggt ggcgatgcga atcaccgagc ggatctgctc gatggtcccg ggtttgaaca 2847721 cgacgtggct gatgccgatg tcgcccagtt cgtcgatcag ctcgacggcc tcgtcgaggt 2847781 ctgggatgcc ggcgctgatc accacgccgt cgatcgcggc gccggactgg cgggccttct 2847841 gcaccaaccg cttgccgccc acctgaagct tccacaggta gggatcgagg aacagcgcgt 2847901 tgaactgata ggtgcggccc ggctcgagca ggccggccat ttgttcgatg cggttaccga 2847961 agatctcttc ggtgacctgc ccgccgccgg ccagctcggc ccagtgcccg gcgttggccg 2848021 ccgcggcgac gatcttggcg tccacggtgg tcggggtcat gcccgcgagc aggatcggcg 2848081 agcggccggt cagccgggtg aacttcgtcg agagcttgac cctgccgtcg gggaggcgaa 2848141 ccacggtcgg tgcgtagctc gaccaggccc gggcaacctc gggggtggcg ccgacggtga 2848201 acaggttgcg ctggccaccg cgggtagccg ccggcacgat gccgatgccc aggccgcgga 2848261 tcaccggtgc ggtcagtcgg gtcaggatgt cgcccggccc caggtcgagg atccagcggg 2848321 cgccggccgc gtggacacgg gtgatctcgt cgacccagtc gacctttctg atcaagatgg 2848381 catcggccag ctcccgagcc aaggcgacat cgaggcccgc cttctcggcc cagcccgcga 2848441 cgatgtcgat cccgtcggat agccgcgggg tgtgaaagcc cacctccacc tgcaccggct 2848501 cgaagaccgg cgagaagacg tcgccgccgc ggaccttgtt cttgcggtcg gcttcttcct 2848561 tctcggagat ctggcggcaa taaagctcga aacgcgacag ctgctcgggg gtgccggtga 2848621 tgacgacggc acgccggccg ttgcggatgg acaacaccgg tggcagcacc gtgcgcacgt 2848681 cctgggcgaa ctcgtcgagc aaccggccga tgcgctcggg gtcggcgttg gtgaccgata 2848741 ccatcggcgg gcgatcgccc aggacggaaa ttccgcgccg gcgggccacc agcgttccgg 2848801 cggcaccgat caactgggcc aaggcaaaca gctcgacgtc gcgtgcccca ccagccttga 2848861 gggcttccac cgccagcaca ccttgcgaat gccccgccat ggcgaccggc ggggtggcca 2848921 cgaggtccat gccttgacgg gccagcgccc gggtcgccgc gatctgggta agcaacacgc 2848981 cgggcaccga cacggcggcc gacgtcaggt gcttgtcgga cggaaccggg tcctcggccg 2849041 ccagtgcgcg tacccattgc agcggctcga aaccgatcgg gcgcaccaca atcagctcgt 2849101 cggtgaccgg atcgagcaac agctctgcct caccgaccaa cgtcgccaac tcggtttcta 2849161 tcccggtggc cgacaccagc tcttcgaggg tttccagcca ggcgctgccc tggccaccga 2849221 atgcgacagc gtagggctca ccagccatga ggcgatcgac cagagcgtgg gtggtatgcg 2849281 ggctgtcccc gccgcgatca gcggacaccc ggtcgtgctc gtggatcgtc acggtctatg 2849341 tctccctatg tgcatcggta cgtgtcagtt cgtacagcgg cccaggctgc cgtgcggggc 2849401 atccccgact ccgcaccgac tcccagccga aatcctctga ccggtgtgtt gtcggtgggc 2849461 cggcccgtgg gtcgagcagc gcgacgggct gcatcggcct tataagagtc tcataaggat 2849521 cggtccacct tgtttacaca gatcggttac tggcgagttc tacgtacggg taaccgtgtc 2849581 gtgggtaacg ccgggttcga cggccggcgc gtatgtgttg accaaacgtc ctgcgtgcag 2849641 gtggttacgg tggagtagct ataactgcgc tgatcaaggc agttttgtta tcaaatcgtt 2849701 atgctgggaa ttcgctctac gccgggcgcg tgccgacgcg ccgacccaaa ggccgcgcca 2849761 ttggcggcgt tggcccgggg ttggcaatgc cgtgcagcgg gcgaacgagt gtttgctgta 2849821 gtgcagcggg ggccaggctc ggggcggcag gctaagccca ctgcccgaat tggggcttca 2849881 ggatttggtt gacgtccacc ccgaccccac caaccttgcg cttatcgatc tccacctggt 2849941 gcagatgcgc ggccgggtgg gtgtatccct tgggcgagcc ccagttgtgc tgccagaagt 2850001 acgagcccaa gccatcgttg acggcccagt cgatggtttt ggagttggcg tacacgccgg 2850061 tccgctggtg tccgatcacc gactcccagg accgcagata tggcacgatc tggttcttgt 2850121 actgctcata tgatgggttg tcgtcgatcg aggcgtagat cggggcgctc gtcgggccgc 2850181 cggcagcggc atgcagctcc gacccccgtc tggcgtgctg cacgccggcg ctggcaccgc 2850241 ccagccagtc ggcagtgctc cccttgccgt attgataaca ggacacgatc ttgagcccat 2850301 tgccgctcag gtcacgggcc tcgctgagct ggatcggctt gccaagcatc caggcgccgc 2850361 caggccgccg atcggacacg taccggattg cccccaccgc gccggcagcc ctgatctggc 2850421 tggcggggat gacaccggcg gcgtagtcca acagggtgcc cagcgaaccg gccgatgccg 2850481 gcgcggcgcg caacgacgac gcaacgacgc caagacccag cacgcccgga gtcgccgccg 2850541 cgaatttgag cacatcacgc cgagagaccg acatatgcca cagggtacga caaaaacaac 2850601 aactgtcaca ctggtttcag tggtcacgga tgcatcacac tggcagaaca catgcatgcg 2850661 gccataccga caccggtgcg gtctcgggca ggccgcctct ccctgcgacc actactacgg 2850721 tgtgatcgcc tacgctccca acggcgcaat gggcaaaatc gtcgcgccac cgcactcgag 2850781 gccaggcgga tatcgacgca taagaacttt gcggcgtctt agctgcaaag tgctcagcaa 2850841 cttcaccaac taccacgggg gagtccgacg atcgcgcccg ctggcagaac ctggacgtgc 2850901 aaccagttga gtagttccca cactgcgcgc cgagcgtggg ctggctgcgc cgaatgtgca 2850961 ctggtggcgg cgacacgccc gggcgacgcc gccgtggttg cacgttcggc gtaggcagcc 2851021 ccgtgcgctt gccgggcagg tgtcctcaaa ggtccaacta gacacacata tcagacacta 2851081 gtatgtacat atgaccgtaa agaggaccac gattgagctg gacgaagatc ttgtgcgggc 2851141 agcccaggcc gtcaccgggg aaacattgcg agcgacggtc gagcgcgcgc tgcagcagct 2851201 ggtggccgcg gctgccgagc aggccgccgc gcgccggcgg cggatcgtcg accatctcgc 2851261 gcacgccggc actcacgtgg acgcagacgt gctgctctcc gagcaggcgt ggcgatgacc 2851321 acctggattc tggacaagag tgcccacgtg cgactcgtgg ccggcgccac gccgccagcc 2851381 ggcatcgacc tcaccgacct cgccatctgc gatatcggcg aacttgaatg gctgtattca 2851441 gcacggtcag ctaccgacta cgacagccaa caaacgtcac tgcgcgccta tcaaatcctt 2851501 cgcgcaccca gcgacatctt tgaccgggtt cgccaccttc agcgcgacct agcccaccac 2851561 cgtgggatgt ggcatcgaac gccgcttccg gacctattca tcgccgaaac cgcgcttcat 2851621 caccgggccg gcgtgttgca ccacgaccgt gactacaaac gaattgccgt cgtacggcct 2851681 gggtttcaag catgcgaact ctctcgcggg cgctagcttc gcccgaatcc gtgagcggag 2851741 gcgataatcc ttacaggcca tcaaaaaagt cctcgtcgag ccgtaagagt tcgacggtct 2851801 gcaccgcctg gacaccgact cgataccgca cgagcagctc ggccagccga gcgccgtcga 2851861 tgagttcgat ccgggcgttg atccgctcag cttcctcgcg ggcaccgcgg gaaaacgatg 2851921 acgtggtgat gtagacgccc cggtcgccct gcttgcccag gagggcgccg gcgaactcgt 2851981 ggatcttcgg ccggccaatc gtttggtcga cggcgtatcg cttggcctgc acgtagatgc 2852041 ggtccagccc gagcgggtcc tggctgatga ttccgtcgat gccagcgtca ccggaggcac 2852101 tcgtccgttc caccgcgccg gctcgcccgt aacccatcgc ctccaaaagt ctgataacca 2852161 gatcttcaaa cccggtgggc gacaacgtga gtgccttctt caggatctcc ccctcgacgg 2852221 ctgcccggtt ctccgcaagc gcagcgtcga tgagatcctc gggtgagacc tgcacatcgt 2852281 ccccggacgg tcgcttggcg gtcgcgtcga ctggctgctt ggctttggtt cgctcacgaa 2852341 aagcgatgta cgacgggaac tcccgcagca cagccatgtc gacgcgctcg ggatgcgcct 2852401 tcaggacttg acggcccgtg tccgtgacct ggacgtggcc ccgcgtggga cggtcgagca 2852461 atccggcctg cgacatgtga gtgagagacc agtgcaccct gtcgtacatg gtcctttgcc 2852521 gaccgctggg caacatctgc gcccgctcgt cgtcggacag accgaactcg tcggacatcg 2852581 ccgcgatgac gtccttggcc gacttcgctt gtccatcggc aagatacgcg agaatcggcc 2852641 gcatcaacgt ctgggcatca gggatcgtca tggggagcca ttatccagct ggcttgtcag 2852701 ccctccgaac cggccaagtt gggtaagtcc atccggggct ccgtgttctg acaggcccgc 2852761 tgcaggcgtc gcatcttcct catctgcccc acgtgtaccc ggtcccgccg acctaaaagg 2852821 tcggcatatc cctgccatgc cgggacgcgt gaggcgggtg agacacaagg gaacgtgcac 2852881 ctcgcgcacc gggtcgccag cagccgcgac acgccgtcgt ccagtgccac accgaatgcg 2852941 gtgtcgggct cggcgtcaaa cgctgccgat cggccttgcc tcgtcaggcc gccgacagca 2853001 ccgccctggg ctcacggtcc gcggctccgc cgggatccga ccggcggcgg ctcaaccccc 2853061 tcgatcgtct tgagccggtc gacagaccga tcgaaagacg gccaccggat cgtcccggca 2853121 ggggcgagga agtccggcgt ccgagcaagc accgggcgat tgccctcaac gcggaagaca 2853181 acccgatcac ccgattgcag gccgagcgcg tcgcgcaccg ctttcggaac cgtcacctgc 2853241 cccttcgacg tgacgatggg ttcgtcggag tgcctgcttc accgttgccg tacgccgccc 2853301 gtaccctcac actctgtgga gctgctcgtc gccgccaacc ccgctgaaga ctcgcgcctg 2853361 ccctacctga tccggctgcc ggtgggcgcg ggactggtct tcgccacctc agacgtgtgg 2853421 ccgcgcacca aggcgctgta ttgccatcgc ctcgacatcg ccgactggcc cgccgacccc 2853481 gtcgtcgtcg accgggtcga gctacgcagc tgcagccgcc ggggcgcggc catcgacgtc 2853541 gtcgccgccc gcgcgcggga gaaccgatcg caactggtgc acaccatggc gcgcggccgc 2853601 caggtggtgt tctggcagag ccccaaaacg cgcaaacagt cgcggccggg cgtgcgcacc 2853661 cccaccgccc gcgccgccgg catccccgag ctgcacatcg tcgtcgacgc ccacgaacgc 2853721 tacccctaca cctttgccga caaacccgcg aagacgacgc gggaagccct gccctgcggc 2853781 gactacggcc tgaaagtggc cggccaactc gtggcggccg tcgagcgtaa agcgttggcg 2853841 gaccttactt ctggcgtgct gaacggcaac ctgaaatacc aactgaccga actggccgcg 2853901 ctgccacggg ccgccgtggt ggtcgaggac cgctactcgg agatcttcgc gcactccttc 2853961 gcccgcccga cggcgatcgc cgatgggctg gccgaattgc agatcggctt tcccaacgtg 2854021 ccgatcgtgt tctgccaaac ccgcaagctc gcccaggaat acacctaccg ctatctagcc 2854081 gccgccctca cctggttcgt cgacgatgcc gacgccacca cggttttcga gccggctgcc 2854141 gccgagcccg agcccagcag cgccgagctg cgcgcgtggg ccaaaagcgt cggcctgccg 2854201 gtgtccgacc gggggcgcct gcgcccgcag atcctgcagg cctggcgagc cgcccatccc 2854261 cggtgactac aacacctcga cgaggcctgc ggatgctgaa tcggccagtg cggcatcgaa 2854321 tgtgaccaac cggcccccgt agcgcgcggc caaggcgatg agatggcagt cggtgacccg 2854381 acggtggttg gacaccgcat cgcgatcgcc ggcgctccca acgatcagtg gcacatcgtc 2854441 aggccaaaac gtgtgcccgg caagagaagt catcgccgcc aactgagcga tcgcgatagc 2854501 cggcgtggtc gacacctgca tcacactgcg attgcttgaa attcggacat accctgcctc 2854561 ggtgatcggc gtggtggccc acccattcga ggagaactgc gtgaaccatc gctgcgcggc 2854621 cgcatggtga acgtgattcg gccagcccag cgcgatcagc acattgacat cgagcagtgc 2854681 cgtcacacgt cgtcctcgag cgcgcggacg acatcctcgg aagtcaccgt cggcgcatcc 2854741 ggcggaacat caaaaaccgg aaatccgtca acctcgacaa tcccaaccgg acggagcgac 2854801 ctacgcgcca actcagaaat taccgcgccg actgacttgc cctccgaccg cgcgatgcta 2854861 cgagcatctt ctagaacatc atcatcaatc tgcaacgtgg tgcgcatagc atcatgttac 2854921 ggggcttggg ccagctttca cgcgtcttcg gcgaccccct gcagcacact gtcgccgttg 2854981 acggtgccat tcaaagccga agcgtcccgc ggtacctcga aggccggcag cgcggcacct 2855041 accgtggcga cggcgttgcg cgcggcctcc atccgggcca atgccgcctg ggtgaacacc 2855101 gacaacccca ggtcggggtt gtatccgtgg atctctttga cgtcgagctg ggcaagaaag 2855161 tagatgatct ccttggaaac cagttgaccc ggcaccagta ccgggaagcc gggcgggtag 2855221 ggcaccacga acgtggtgga taccagagtc ttgccctcag ccagccggcg cccggccaag 2855281 ccgatctgca cgtactcacg gtcggcctct tcgtagccgg cgtagaaagc cgaccgcatg 2855341 tcaccgaaag agctggcgtc gtcggggcgg aaggcaaggt cgaactcgct gaaatctggt 2855401 agatgcggca gatcctgcgt gatctcctcg acgtggcgtc ggtgtagagc aaggtcggcc 2855461 ccgctggccg ccttctggct gcggtccaga tcgatcgcca cccgacgcaa cacatcgagc 2855521 agatagtgca cgctcgacca ggtgacgccg atcgtgaaga tcagcaacac gctgttgata 2855581 gacgttttgt tgatctggat gccgaatcgc tccatcagga tcttctcgcg gaagtcgtac 2855641 ccgttcatcc cggtcgcccc gataaacagg gtgagccgcg tcggatcgag cacgaattga 2855701 tcggaccgcc aggcttcgtt ccaatcggcc agagccccct gcctgacctg acggtacgag 2855761 ctgaccgtcg aggaccgaaa ggcatcggga accaggtcgg actcgtcaag gatgcggaac 2855821 cacttgctga tcagccggtc tttgcggacg cgatggcgga acaccagcgc catgttgtaa 2855881 acatggcgga ccagctcgaa cccttcgatg tcaacctgtc ggcgcgccaa gtccaacgag 2855941 gcgagaagtt gctggttggg cgaggtcgag gtgtgggtca agaatgcctc accgaacgcg 2856001 tcccgggtga gcgctttgaa atcctggtcg cgcacgtgga tcatcgatgc ctgccgtagc 2856061 gcggacagcg acttgtgagt cgaatgcgtc gcatacactc ggacccgagc gcggttgggg 2856121 tctggcaaca gccggtgatc aacccactcg gagcggtcca ctccgtccat cgacgcacac 2856181 caattccggt attcctcagc gtattccgca gtggacaaca tctgctcgag tcgctcggca 2856241 gcaatcatcg cggtccgctg ccgggcccag ggcaccgccg tcgcaaacgc ataccacgcc 2856301 tcgtcccaca aaaagcagat gtccggtttg atcgctagca cctcctccat cacccggcgc 2856361 gggttgtaca ccacgccgtc aaacgtgcag ttggtgagca acagcatgcg cacccggtgc 2856421 agctgtccgg cggcctcgag gtccagcagc gcctgcttga tggtgcgcaa cggcacggca 2856481 ccataaatcg cgtactgcgg cagcggatat gcgtcgaggt acatcgggta cgcgccggca 2856541 agtaccaggc cgtagtggtg cgacttgtgg caattgcggt cgatgagcac gatgtcgccg 2856601 gggcgggtca gggcctgcac gacgatcttg ttggcggtcg atgttccgtt ggtgacgaag 2856661 taggtctggt tggcgttcca ggtcaccgcg gctttgtcca tcgccgtctt gatgttgcca 2856721 tgcgggtcca gcagcgagtc cagtccacca gaggttgtcg aggtctcggc catgaagatg 2856781 ttgcggccgt agaactcgcc catgtcgtgc agtgacttgg agttgaagat gctggcgccg 2856841 cgcgcgacgg gaagggcatg aaattggccg accggcgccg ccgcataggc ccgcagcgca 2856901 tcgaaaaacg gtgtggcata acggtttcgt aaacccgcga gcaccgtgct gtgcaggtcg 2856961 gtgacgtcgt tgagccggta gaaggtgcgg tcgtagacgt cgggctcgtc ctgggtctcg 2857021 gcggcgatcg actcgtcggt gagcagatag aggtcgatgt ggggccgcaa ctcacggatc 2857081 cactcggcgc attccaccca gtcgtgggtc tcgtttgcca ccgcttcgtc gccatcggtg 2857141 cccagcagcg tggtcatcag cggcacccgg tcgcgggacc gcagcggcag gtcgtgacgg 2857201 atgatcgccg cctgaatctc gccattcagc gccaccgcgg tgatggcatc ttcgatgctg 2857261 gccaccacga gcaactcgaa ctgcacctcg tcggccggat tgcgcaactg ccgcaggcac 2857321 tcggccaagc tgtccggagc cgtcgccggg gagtcgtcgg cgagcagcac ggtgtagaac 2857381 tgctgctgtt tggcctgcgc taccagctcc tgctccgcca gtgacgcgga ggtgtcgaac 2857441 agcgctgtgc ggtcgccgta ttcggacagc agtcgtacgg ccaacgacac ttcctcggta 2857501 agccgcaccg tggaatgact atccagatga gcgcggaaag tcgccagatt ctgtgccccc 2857561 ggatacagcc agtaccgctc ataggcgccg atgcggtcca tcagccgctt cgcccgagcc 2857621 acgtcgtgtg tggtgtcgag cccggcgagg tcgacctccg ccaggtgacg acacgcgtca 2857681 tcgagcaggt tccaggtgtc caggcgggtg taggacgggt tggccaccgc ggccagcgcg 2857741 gagacatgca gccgtcgcgg gcggacgctg tttgggttca tgtcgtcacc tgttctctgg 2857801 tgcgggtagc gccgtagagt gcaaccaggc aattatcgcg cgcaggaccg ggtcagtcag 2857861 ctaagtcgtc gctgtccgcg atccgccgat tagcccgatt cccggagttg tccacccagc 2857921 gcagcaccgg cagctgcgaa agctcccggc ggcgtgccgg cagatcggtg acgtcaccca 2857981 gcgagcgcac cacggcctgc gtcaggccca tgatggcggc catcaccggt agcagcggct 2858041 gcaacggctt tgccgccgtg cgaacagtgc tgatgcggcg tgccaagatc aggcgttcga 2858101 tcgcgtgata cagccaggcg ccgacaccca gcgcgtagat cgacattgcc gaggcgacaa 2858161 tggtcagccc gtaggcggcg cacaccacgg caccgagcac caccgtcgca gccaggacgg 2858221 ccgcgaccac gacgcgcagc tcgagccggg tcatcacgcc cccccacgca ccgcttgagc 2858281 ggccgcacgc agctgcgggg tcaccagcat gacctggccc agcaccccgt tgacaaagcc 2858341 cggcgagtcg tcggtcgaca gctccttggc cagctggacg gcctcgtcga cgaccaccgg 2858401 ctccggcaca tccgccgcgt ggagcagctc ccataccgag acgcgcagaa tggcgcgatc 2858461 cacggcgggc aaccggtcca gcgtccagcc ccgcagatgc gcggtgatca ggtcgtcgat 2858521 gtgggcggcg tgttcactga cccctcgagc caccgcggcc gtgtacggat gtagccgggc 2858581 aatgtcgggc ttcgcttcgg ccagcgcggc acgggtgtcg accacctcgg ccgcgctgat 2858641 gccgcggacc tcggcctcga acagcagggc caccgcgcgc ttacgggcct gatgtcgtcc 2858701 gcgaaccggc tttctgtccg acatcgtcag gcgttgaccc ggcccaggta gctaccgtcg 2858761 cgcgaatcca cctttagttt gtctccggta ttgatgaaca gcggcacgtt gatctgggct 2858821 ccggtctgaa gggtggccgg cttggtgccc gcgctggacc ggtcgccctg caagccgggc 2858881 tcggtgtgag tgacctcgag ctcgacggtc accggcagct cgatgtatag cggcacgccg 2858941 ttgtggaacg ccacctgcac cggcatgccc tccagcagga accgtgccgc gtccccgacc 2859001 agggcctccg gcagcgggtg ctgctcgtag tcttggctgt ccatgaacac gaagtccgag 2859061 ccgtcgcggt aaaggtaggt ggtatcgcgc cggtcgacgg tggcggtgtc caccttcacc 2859121 ccggcgttga acgtcttgtc gacgaccttg cccgagagca cgttcttcaa cttggtgcgc 2859181 acgaacgccg gacccttgcc cggtttgacg tgctggaact cggtgattgt ccacagctgg 2859241 ccgtcgatta ccaggaccag cccgttcttg aagtcagcag tggtcgccac gtgggtctcc 2859301 tacagaatgg ccagttcttt ggggaaccgg gtcaacaatt ccggggtctg cccggcggtt 2859361 tcaggcattt tcggcgtccc gccagccact accaatgtgt cctcgatgcg gacaccgccg 2859421 cggccgggta aatagacacc gggctccacg gtcaccacgg agcccgccag tagtgtaccg 2859481 gcggatgtga ccccgatgcc cggcgcttca tgtatctgca ggccaacacc gtgtcccagt 2859541 ccgtgaccga agtgctcgcc gtagccggcg tcggcgatca gctggcgcgc tgcagcgtcc 2859601 accccccgca gctcggcacc cggcagcaac gcctgccgac cggcctgttg cgcctcggcc 2859661 accagctgat agatctctag ctgccagtcg gcggccttgc ccaacacgaa ggtgcgggtc 2859721 atatcggagt ggtacccggc gaccagggcg ccgaagtcga tcttcacgaa atcgccgacc 2859781 tgcagcaccg cgtcggtcgg ccggtggtgc gggatcgccg aattggcccc ggcagccacg 2859841 atcgtctcga atgacaccgc gtcagcgcca tgatcgagca tcagggcctc cagctcgcgg 2859901 ctcacctgcc gttcggttcg gcccggccgc aggccgccgc gggccaccaa gtcggtcagc 2859961 gcggcatcgg ctgcttcgca ggctagtcgc agcagcgcca gctcgccggc gtctttaacc 2860021 tcgcgcagtg actccacagt tccggatgcc cgcaccaact cggtgttctt gccctccagc 2860081 gcgcccgcca aggcgtccag gccgtccacc gtgaccacgt ggctctcgaa gcccagcttt 2860141 cccacgccgg cctcgccggc ccggccggcc aggtagcgcc cgaccgcgcg ctcgatagcc 2860201 acttcgaggt cgggcgcttg cgaggcggcc tgagtgcggt accggccgtc ggtggccaac 2860261 acggcatcgc gctcatcggc gaacaccagc aatgcgccgt tggacccgct gaagcctgat 2860321 agatatcgca cgtttatcag gtcgctgatc agcatcgcat ccaacccgga ggcagcgatt 2860381 tgtgctttca gcttgtctcg acgctgggaa tgtgtcacga cccttgacgg tactcgctac 2860441 gctgaatgcc catgactaac tggatgctgc gcgggttggc gttcgccgcc gcgatggtgg 2860501 ttctccgcct gttccagggg gcattgatca acgcgtggca gatgctgtcc gggctgatca 2860561 gcctggtgct actgctgctc ttcgcgatcg gaggggtggt gtggggtgtg atggacgggc 2860621 gcgccgacgc caaggcgagc cctgaccccg accgccgcca agacctggcc atgacctggc 2860681 tgttggccgg cctggtagcc ggcgcgctca gcggcgcggt ggcctggctc atttcgctgt 2860741 tctacaaagc gatctacacc gggggcccaa tcaacgagct gaccacgttc gcggccttca 2860801 ccgcgctcat cgtctttctg gtcgggatcg tcggggtagc cgtgggccgg tggctggtgg 2860861 accggcagct ggcgaaggca ccggtgcgac accacgggct tgccgctgaa cacgagcggg 2860921 ccgccgacac cgatgtattc tccgccgttc gcgccgacga cagtccgacc ggggagatgc 2860981 aggtcgcgca gcctgaggca caaaccgcgg ccgtcgccac ggtcgaacgt gaggcaccca 2861041 ccgaggtgat ccgcaccacc gaaagcgata cacccaccga ggttatccgc accgacaccg 2861101 aggcggacca gaccaagccc ggcgacgagc ccaagaagga ttaaccctca cgtcccgaca 2861161 tgctcagcta ggtaccgcag ggccagcagg tagccctgga tgccgagccc gacgatcacc 2861221 ccggtcgcga tggggctgag gtaggagtgg cggcggaact cctcacgcgc atgcacgttg 2861281 gagatatgca cctcgatcag cggagcgctc agctccgcgc aggcatcgcg cagtgccacc 2861341 gacgtgtgcg tcagaccgcc ggcgttgagg atcacgggtt cggccgcatc ggcggcctga 2861401 tgaatccagt ccagcagctg ggcttcgcta tcactttgcc gcacaacggc tttgagtccg 2861461 agctcggcgg cctcacgctc gatcagagcg accagctcgt cgtgggtggt gccgccatag 2861521 acggcgggct cgcgccggcc caaccggccc aggttggggc cgttgatcac gttcacgatc 2861581 agttcgctca tggggcgcaa actccggcgt aggcggttac cagcagaccg gggtccggtc 2861641 ccaccattcg gcccggcttg gccaatccgt cgagcaccac gaaccgcaac acacccgccc 2861701 gagtcttctt gtcgccggcc atgatttcca gcagctgggg cagcgcgtcc gggtcgtagc 2861761 tgaccggcaa tcccaacgag gacaggatgg tgcggtggcg ctgcgcggtc gcgtcgtcga 2861821 gccgcccggc aagcctggcc agctcggccg cgaacaccag ccccaccgac acggcggcgc 2861881 cgtggcgcca ccggtagcgt tcccggcgct cgatcgcgtg gcctaatgtg tggccgtagt 2861941 tgaggatttc gcgcagctcg gattcctttt cgtcggcggc gaccacctcg gccttgacgg 2862001 tgatcgcgcg ccggatcagc tcgggcagca cgtcgccggc cgggtcgagt gcggcctgcg 2862061 ggtcagcttc gatgagatcc aggatcaccg ggtcggcgat gaagccggcc ttgaccactt 2862121 cggccatgcc gcagatcatt tcgtcgcgtg gcaaggtttg cagcgtcgcc aggtccacca 2862181 ggaccgccaa cggctgatga aacgccccga ccaggttctt gccggcgtcg gtgttgatgc 2862241 cggtcttgcc gccgacggcc gcatcgacca tgcccagcag tgtggtgggc aggtgcacaa 2862301 tcgagacgcc gcgcagccag gtggccgccg cgaacccggc gacgtcggtg gcggccccgc 2862361 cgccgaggct gaccagggcg tctttgcggc cgattccgat gcggcccaac acctcccaga 2862421 tgaatcccac gacgggcagg tccttgccgg cctcggcgtc ggggatctcg atgcggtgcg 2862481 cgtcgacgcc cttgccggcc aagcgctttc ggatctcttc cgcggtctcg gctagtccgg 2862541 gctgatgcac gacggcgacc ttgtgccggt cggccagcag gtcttccagc tcgtcgagca 2862601 ggccggtacc gatgaccacc gggtatggcg gatcgacggc cacctgcacg gtcacgggtg 2862661 cgccgatatc ggtcatgtgg ccgcctcgct ggggctggga acctgcagcc gcgacaggat 2862721 atggcggacc accgccccgg ggttgcggcg attggtgtcc actcgcatgg tcgcgacgcg 2862781 ccggtacagc ggtgcccgct tggccatcag cgcgcggtat ttttcggcgc ggtcggggcc 2862841 ggccagcagt gggcgcacgg tgttgccgcc ggtgcggcgc acgccctcgg cggcgctgat 2862901 ctccaggtag acgacggtgt ggccggccag cgccgcgcgc acaccggggc tggtcaccgc 2862961 gccgccgccg agcgacagca caccgtcgtg gtcggccagt gccgcgcgca ccacgtcctc 2863021 ctcgatacgt cggaactcct gctccccgtc ggtggcgaag atgtcggcga tgctgcgtcc 2863081 ggtccgctgc tcgatcgcga cgtcggtgtc gagcaggccg accccgagcg ccttggccag 2863141 ccggcgcccg atggtggact tgccggagcc cggcaggccg acgagaaccg ctttgggtgc 2863201 catctgttaa ccggagaccc gcgcggccgg tgcttcgcgg tcggcgacgc tgcgctggta 2863261 ggcggcgatg ttgcgctggg tttcggccag cgaatccccg ccgaattttt ccagcgccgc 2863321 ccgggccagc accaacgcca ccatggtctc caccacgacc ccggccgccg gcaccgcgca 2863381 cacatccgag cgctgatgga tggcgacggc ctcatcgccg gtcgccaggt cgacggtggc 2863441 cagcgcgcgc ggcaccgtgg agatcggctt catcgccgca cgcacccgca gcggctgccc 2863501 gttggtcatc ccgccttcca gccccccggc ccggttggtg gagcggacga cgccgtcggg 2863561 cccggggtac atctcgtcgt gggcgcggct gccgcggcgg cgcgcggtct ggaatccgtc 2863621 gccgatctcc acgcccttga tcgcctggat gcccatgacg gcggcggcca gctggctgtc 2863681 gagccgatgg tcgccgctgg tgaacgaccc cagccccacc ggcaggccca gcgcgaccgc 2863741 ctccaccacg ccgccgaggg tgtcgccgtc tttcttggcc gcctcgattt gggcgatcat 2863801 gtccgcctcg gcggccttgt cgtaggcgcg taccgggctg gcgtcgatgg cgggtaggtc 2863861 ctcggcccgc ggcggcggac cctcgtaggg tgccgacgcg ccgatcgaga tgacgtggga 2863921 gagcacctcg acacccagcg cctgcctcag gaatgcccgt gcgaccgtgc ccgccgcgac 2863981 ccgggcggcg gtctcgcggg cgctggcccg ctccagcacc ggccgcgcgt cgtcgaagcc 2864041 gtatttgagc atgcccgcgt agtcggcgtg gcccggccgc ggccgggtga gcggggcgtt 2864101 gcgtgcgacg tcggccagct cggcggggtc gaccgggtcg gcggccatca cggtctccca 2864161 tttgggccat tcggtgttgc cgatctcgat ggcgatgggc ccgcccaggg tgctgccgtg 2864221 gcgtatcccg gacagcacgg tcaccgcgtc gcgctcgaac gtcatccgtg cgccgcggcc 2864281 gtagcccagc cggcgtcggg ccagctggtc ggcgatgtcg gccgaggtga cgtgcacgcc 2864341 ggcgaccatg ccttcgacca cggccaccaa ggcgcggccg tgtgactccc ccgcggtgat 2864401 ccagcgcaac acctgaccat cttcccatgc gccgccggcg gccaccgcac gtcaacgcac 2864461 ccactccgtg cgatcgcggt gatgtgcggc cccccggatg ccccgctagc atccctggcg 2864521 tggaagtggc tggcggcacc cgggcccggc tgcgggtcac agccgatggt ttgcaggcgc 2864581 tggccgggcg gtgcgcgacc ctggccggcg aattgtcggc cgcggtcgcg ccgtcggggg 2864641 cggtgttgtc gtggcaggcc aacgcggtcg cggtgaacgc cgcgcatgcc cgcgcgggtg 2864701 cggccgccgc ggctgtgagc gcccgaatgc gggccaccgc cgccgcgctg gggcaggccg 2864761 cccgccggta cgcgggccag gacaccgcag cggcggccgc cctgggggcg gtacgcccgt 2864821 gggggaccca ctgatggcta cgtcggggct gccgccgctg tcggcggtgc agtcgacgag 2864881 ctttgcgcat ctgagcgagg ccgccgccca ctggcggcgg ctggccacgc ggtgggagcg 2864941 cgccttagcc gaggtgcgcg attcgatgcg ccgacccggc ggcaccgact gggagggcca 2865001 ggccgcggcc cgcgcccact accggtcgac cgtcgacgtg gtgacgatcg gtcgcgcggt 2865061 ggaccggctg catgacgccg ccgccgtcgc cggccggggg aagaccagct ggaggccaac 2865121 cggcgggcgg tgctggacgc tgtcagcgac gcccgccggg acgggtttgc cgtcggtgag 2865181 gattacacgg tcaccgaccg ctccacgggt ggctcacgcc agcagcgggc ggcgcgtctg 2865241 ggccaagccc aggggcacgc cgactttatc cggcatcggg tgggcgcgct gctggccacc 2865301 gaccgcgata tcgcgacccg ggtcagcgcc gccacccaag gcctcgatga gctggcgttc 2865361 gaagacgtgc ccggggtcga caccccggcc gaggatgggg tgcaggcggt ggatttccgc 2865421 caggccccgc caccgggagc ccccgggggc atgtcctccg gcgacatcga cgcgatcgac 2865481 gcggccaatc gcgccctgct gcaagacatg ctggcggagt acagccggct gcccgacggg 2865541 caggtgaaaa ccgaccggct ggccgacatc gcggccatcc aagaggcgct gagggtgccc 2865601 gactcgcatt tgatctatgt ggccaggccg gacgaccccg ccgacatgat cccggcggtc 2865661 accgcggtcg gcgatccgtt caccgccgat cacgtgtcgg tgacggtccc cggggtgtcg 2865721 ggaaccaccc gtcagaccat cgccaccatg acccaagaaa cccgtgggct acgagaagaa 2865781 gcgagagtga tcgcccacag cgtgggtgaa agtgagaatg tggcgaccat agcgtgggtg 2865841 gggtatcagc cgccgccggt gctcgcgtcg tggaacaccg ttgatgacga tctcgcgcag 2865901 gccggcgctc cgaagttgga ggcgtttttg cgggatctgc aggcgggatc gcacaatccg 2865961 ggtcacacga cggcgttgtt cgggcattcc tacgggtcgt tgctgtcggg gatcgcgttg 2866021 aaggatggcg ccagttcact ggtcgacaat gcggtgctgt atggctcgcc ggggtttgac 2866081 gcgacctcac cggccaagct gggcatgaac gaccacaact tcttcgtgat gaccacaccc 2866141 gatgacccca tccggtatcc ggcgcgcctg gcacccctgc acgggtgggg atcagacggc 2866201 gccgacacca tcggcactgt aggccgccaa ggcacccctg cacgggtggg gatcagaccc 2866261 caacgagatc atcgccggat ccccggaccg ctaccgcttc acccatctgc agaccgacgc 2866321 gggatccact ccgctgggtg atcacaagac cgccgccagc gggcactcgc aatacggcca 2866381 agacccgctg caacggatga ccggctacaa cctggcgacc atcctgctca accggcccga 2866441 tctggcggtg cgcgaaagcc cacagcagtg atcgcaccac aaccgatttc ccgaacgctc 2866501 ccgcggtggc agcgcatcgt cgcgctgacc atgatcggca tatcaaccgc cctgataggt 2866561 ggctgcacca tggatcacaa ccctgacaca tcacggcgcc tgaccggcga gcagaagatc 2866621 cagctcatcg acagcatgcg caacaagggc tcctacgagg ccgcccggga gcgcctaacc 2866681 gccaccgccc ggatcatcgc cgaccgcgtc agtgcggcca tcccgggcca aacctggaaa 2866741 ttcgacgacg atcccaacat acaacagtct gaccgaaacg gagcactgtg cgacaagctc 2866801 accgcggata tcgcgcggcg gccgatcgcc aacagcgtaa tgttcggcgc cacgttctcg 2866861 gccgaggact tcaagattgc cgccaatatc gtgcgggagg aagccgccaa gtacggtgcg 2866921 accaccgagt cgtcgctatt taacgaatcg gccaagcgcg actacgacgt gcagggcaac 2866981 ggctacgaat tccgactcct gcaaatcaaa ttcgccacac ttaacatcac cggcgattgt 2867041 tttctgttgc agaaggtgct cgacctgccg gccggacaac tccccccgga accacccatc 2867101 tggccaacga cctcgacgcc acattgatcg caccacaacc gattccccga acgctcccac 2867161 ggtggcagcg catcgtcgcg ctgaccatga tcggcatatc aaccgccctg ataggtggct 2867221 gcacaatggg ccaaaacccc gacaaatcac cgcacctgac cggcgagcag aagatccagc 2867281 tcatcgacag catgcgccac aaaggctcct acgaggccgc ccgggaacgc ctcaccgcca 2867341 ccgcccagat catcgccgac cgcgtcagtg cggccatccc gggccaaacc tggaaattca 2867401 acgacgactc ctacggccaa gacttctata gaaatggatc gttgtgtaag gaactcagtg 2867461 ccgatatcgc ccggcggccg atggccaaac cggttgactt cggtagcaca ttctcggcgg 2867521 aagacttcaa gattgccgcc aatatcgtgc gagaggaagc cgccaagtac ggtgtgacca 2867581 ccgagtcgtc gctgtttaac gaatcggcca aacgcgacta cgacgtgcag ggcaacggct 2867641 acgaattcaa cctgggccaa atcaaattcg ccacacttaa catcaccggc gactgttttc 2867701 tgttgcagaa ggtgctcgac ctgccggccg gacaactccc ccccgaacca cccatttggc 2867761 cgacgacctc gacgccaacc ccgtgagcac caccatcgtt gctggcgtga tccagggtca 2867821 cctgccggtg atcctgccca cgcgcaggcg ggctcgcgat ctcgggcaca cgacggcgtt 2867881 atttcgggcg caaacgctcc aatgcatata tctcagtatc gaatacctat atgtttgctc 2867941 catgtctcgg cgtacaacga tcgacatcga tgacatactg ctggcccgcg cgcaagcggc 2868001 gctcggtacc accgggctga aggacagggt cgatgccgct ttgcgagccg cggtgcgcta 2868061 gtcggcgcgc actcggctcg ccgcgcgaat cgcctcgggt gccggcatcg atcggtccga 2868121 ggcgctgctt gcccagacgc gtcccgcgcg gtgatggtgt tctgcgtcga caccagcgcg 2868181 tggcatcacg cggcgcggcc ggaagttgcg cgccgatggt tggcggcctt gtccgcggac 2868241 cagatcggca tctgcgacca cgtgcggttg gagatcctgt actcggcgaa ctccgctacc 2868301 gactacgacg cgctcgccga cgaactcgac ggcttggccc gtataccagt cggtgccgaa 2868361 acctttacgc gcgcatgcca agtccagcgt gagcttgccc acgtcgccgg tctgcatcac 2868421 cgcagcgtga agatcgccga tcttgtcatc gccgcggcgg ccgaactttc aggcaccatc 2868481 gtgtggcatt acgacgagaa ctatgaccgg gtcgccgcca tcaccggcca acctacggag 2868541 tggatcgtgc cgcgcgggac cctttaaccg ctgataggcg ccatcactgg atgtatggtg 2868601 atgtcatgcg gactcaggtg accctgggca aagaggagct tgagctgctc gatcgtgccg 2868661 ccaaggcgag tggcgcatcg cggtccgaac tcatccgacg cgcaattcac cgtgcctacg 2868721 ggactggatc caagcaggaa cggctcgccg cgctcgacca cagccgtggc tcgtggcgag 2868781 gacgggactt caccggcacc gagtatgtcg acgccattcg gggcgacctc aacgaacgac 2868841 ttgctcggct cggtctggcg tgaagctgat cgacaccacc atcgcggtcg accaccttcg 2868901 cggcgaaccc gcggcagccg tgctgctcgc cgaactgata aacaacggtg aggagatcgc 2868961 ggccagcgag ctggtccgat tcgaactcct cgccggtgtg cgggaaagcg aactcgcggc 2869021 gctcgaggcc ttcttctcgg cagtggtgtg gaccctggtg accgaggaca ttgcccggat 2869081 cggcggacga ctcgcccgtc gataccggtc cagccaccgc ggtatcgacg acgtggacta 2869141 cctgatcgct gcgaccgcca ttgtggtcga cgccgacctg ctcaccacca atgtgcgcca 2869201 cttcccgatg ttcccggatc tgcagccgcc gtactgagca ctccctgggg catcagcctt 2869261 ggtcggcgat gagttgttcg atgagctcga cgatgcgctg ttggccggcg gcggccccgt 2869321 ccagcttgcc tcgcatctcg gtgaatccgt cgtcgactcg actaaaacgt tcttctacgt 2869381 gactgaaacg ttcggtcatc tcttcccgca gggcggtgaa atcttctcgc agggcgttga 2869441 agctaccgat tgtagctcgc cggaagtcgc ggaactcgcc aacgaactcc gtgacatcgc 2869501 gatcggccgc gccggctagc acgcgagcgg cggcggcatc ctgttcgctg gcccgcacgc 2869561 ggtcagccag ctcacgcact tgggattcca gcgcggtgac ccgttgttcg aggttctcgg 2869621 gcagcacgag cgaatcctac cgcgattcaa cgcaacgcag ccctgtcccg ggcggacacc 2869681 ggcattgggt gcacgtcgga taagcagggc tgagcggggc tcggctctac tcgggtctta 2869741 cctcgacaaa tccggccgcg ctgaagtcac catcgaaggc atacgcattt tggatgcctt 2869801 tctttcgcat caccgcgaag ctcgtggcat cgacgaacga gtactctcgc tcgtcgtggc 2869861 gtacaagcca ttcccatgcc tgctcttcca ggtcggctgt tacgtgctcg acgcgaacga 2869921 cggtgctcaa gcggattgca gcggcggcaa ccgccgcgcg gtgaccgcag cgccggttga 2869981 gcagcgtcca ggtctcgccc aggacatggt tggaggtcat caccacgggc ggtttgctgg 2870041 cccacaacct cttcgcggtg ccgtgccgag cgtcgccggc gttgccaagt gcagcccaga 2870101 aggacgtgtc gacgaagatc attcgtgctt tccgtaaacc acgtcgtcga cggacgcgga 2870161 caagtcggct tcccccacga acgatccgac gaaggcatcg accggatctg ggcccggctg 2870221 ccggaggtgc tcagcgacgt actcccggat cagcgccgcc ttcgacgtcc gccgccgtcg 2870281 cgcttcaaca gcaagcgctc ggtcaacgtc ttcgtcgatg tagatctgca gccttttcac 2870341 atggcaaata tacgccacta gcataatgct gtatacatcg gtagccgaga atcggatgct 2870401 tgccgctggc tgccgagttt gttgaactcg ccgccgtggt gacctggatg aagtgtgccc 2870461 gccgaaactg ccgccgcccg actaggcgac tggccaaagc gatgacagtg ctgacttctg 2870521 tacaggggcg aagcgagtgt ccgccccttt acgcgcgtcg taatcagccg ctgagttcgc 2870581 catggttccc atgcaagatc gctcacgttc gagcccggcg cgcggtgacc gacatgaaac 2870641 tcccgttacg agcaagcatg cggcaacgcc gcctcgacgg cgacggtcca agctgtcctc 2870701 tgaacgaatc aggttgcgct aagccaagat tcgttgtcaa acgacctctg gtctacactg 2870761 atatcgcgcc aatctcagcc cagcagcgcc aaccccaccg cccccaggct ggccacacac 2870821 atcgacggcc cgtgcggcag ggtgcggaca ccccatggcg tcaccatcac gccgcacacc 2870881 gcggtcagca gcggcgcggc cagcgccgcc agaaaccaca cctcgacccc gaagcagccg 2870941 gtcagcccgc ccagaccgat cgccagcttg acgtcaccgg cgcccatcgc ggcgggcaaa 2871001 gccaggtgca ccagcaggta caccccggcc aaggcggccg ccccggccag cgccggcaca 2871061 ccgcggccgg caaggcccgc gaagagcagg atcacccccg ccccgggcag ggtgagccag 2871121 ttgggtagcc ggcgctgccg gacgtcgcaa acgcacaaca ctcccatcca ggccaacacc 2871181 gccgccgcca gcatgctggg gcacgctagt ccaacgcggc cagcgcgcaa gtcatcgctt 2871241 cgcggggggc gggtagcccg gtgaactgct ccacctgcgc gaacgcctga tgcagcaaca 2871301 tctgcagccc gctgatcacc cgcccgcccg ccgatccgac cgcggcggcc agcggtgtgg 2871361 gccacggatc gtagatggcg tccaacagca ccgggatcgc ggccaaggtg ccggcatacc 2871421 ccgcggccac ctccgctgga atggtgctga ccagcacttc cgcggcggcc accgcatcgg 2871481 ccaacccacc gctgtcgaac gcgcagaacc gggtcgccac gccgacccgt gtgcccaggt 2871541 ccaccagccg ggccgccttg tccgagttgc gcgccaccac ggtgatgtcg gtgaccccga 2871601 gttcggccag ccccaccacg gccgccggtg cggtcccccc ggaccccagc accagcgcgt 2871661 gtccagcagc cgcccccaac gccccggcca ccccgtcgat gtcggtgttg tcggcccgcc 2871721 agccatgcgg cgtccgaacc agggtgttgg ccgaaccgac aaggtccgcg cgtgcggtgc 2871781 gctcgtcggc gaaccgcagg gcggcgaact tgcccggcat ggtcaccgaa acaccgaccc 2871841 actccggtcc gaaaccaccg accacgacgg gcaactcggc cgcaccgcat tcgatgcgct 2871901 cataggtcca gtcgtgcagc cccaacgccc ggtaggcggc caggtgcagc tgcggggagc 2871961 gggaatgcgc gatcggcgaa ccaagcacgc cggctttttt gggaccttcg ctcatcgcgc 2872021 gctgtcgagg acaccgttgt gtttggccag ctcgatgttc gccagatgct gctgatagtc 2872081 cctggtgaac agcgtcgtgc cctgggaatc gatggtgacg aagtacagcc agtcgccagg 2872141 tactggatgc tcggcggcgc gcagcgcgtc gacgccgggc gaacagatcg cggtggccgg 2872201 cagcccctgg gccatgtagg tgttccacgg tgtgcgctgg gcacggtcgg tgtcgctggt 2872261 ggccacctca cggcgatcca gcggatagtt cacggtcgag tcgaactcca acgtgcggtg 2872321 ttcgtgcagc cggttgtaga tgacccgggc caccttcggg aaatcctggg tgttggcttc 2872381 ctgctgcacc agcgaggcca ccacgagaat gtcatagggc gacaggccca gcgactttgc 2872441 ggtgtctacc aacccggatt tcatgtactc cacggcgccg gcgctgatca aggtcgccaa 2872501 gatggtttca gccgatgccg acgggtcgat gttgaaggtc cccggtgcga tcagcccctc 2872561 gatccggcga tggtcagtgc ccagctccat caccggccca accgcccagc gcggcactga 2872621 cagcatcgtc ggcgtgctcc tgctcgccgc cgcgcggagg tcggccaccg agacgcagcg 2872681 ttgggtaccg tcgagatcca cacaggtggc acgggagatc agcgcgaata tgccaggatt 2872741 caccacgttg gtcttcatgt cggtggtgtc gtcgagctga cgcccttccg gtatgaccaa 2872801 cttccccacc cggttgtgcg gatcggtaag ccgcgcgaca gcggaagccg ccgaaatctc 2872861 ggttcgcatc cgatagaacc cgggttggat cgaggaaatc gcggtgttgc cgtgcgcggc 2872921 atcgacgaat gctcggacgg tggccactac accgtgtttg agcagcgtct ccccgaccgc 2872981 cgtggtcgag tcaccggccc tgatctgaat cacgatgtct cgcttgccgg gaccggtgta 2873041 gtcgttaccg aagcccaaca tggtctgcca caacttggcg ccgacgacga cggccaccac 2873101 caccaccacg acgagcaggc tcagggcaaa tccgccggcg acgcgccgtc gccggcggat 2873161 ttgttgggcg tgtcggcgct gagcgcggct gactcgggtc ctgcggtgcc ggttcggtct 2873221 taccgacacc ggctgggcgc ggtggcggtg gccaccgtca ggcatcggag ccttcttgag 2873281 tcccggccat cgccgcgaga cgttcatcca gccagctctg cagtattgcc actgcggccg 2873341 cttggtcgat caccgcacgc tgctcggagg cccgcacccc cgcctgccgc aaagatcgtt 2873401 gagcactgac cgtggtgagc cgctcgtcgg ccagccgcac cggcgtagga gaaacacggc 2873461 gtgccagcgc ctcggccagt tcgattgcgt cttgggccga gcggccgatg cggtcggcca 2873521 gcgtgcgcgg gagcccgacg atcacctcga ccgcctccaa ctcggcggcc agcgcagcca 2873581 gcctgcgcag gtgcttgccg gaacgatcgc ggcgcaccgt ttccaccggg gtggccaaga 2873641 tcgcgtccgg gtcgctgcaa gccacgccga tacgcgcggc gcccacgtcg ataccgaggc 2873701 gtcgtccccg tccagggtcg tgcgctggat cgccgggccg gtcgggcggg cggtgctgtg 2873761 ctgggaccac tcaaccgacc cgcgctatca cggcgatctc ggagcggacc gcgtcgagcg 2873821 cggcgtcgat accggtcgga ttctttcccg agccctgcgc caggtccgcc ttaccgccac 2873881 cgcggccttc gaccgccacc gcaagttgtt tgaccaggtc gttggcacgg attccgaggt 2873941 cctgggcagc gggattggcc gcgaccgcat acggcacagt ttggctttcg ccctcggcaa 2874001 tcagcgccac caccgccggc tcgctaccca gcttgccgcg gatgtcgccg atcaacgacc 2874061 gcaggtctgc cgcggtcatc ccgccggaca ttcgctgcgc caccaaacgg acgttaccga 2874121 tccgctgagc cccggcggcg gcattggtgg cggctgcccg ggcgctggcc atccggacac 2874181 gttcgagttc cttctcggcg gcccgcaggc gctccactag attggccacc cgggccggta 2874241 cctcttcgga cggcaccttc agtgacgagg ccaacccggc catcaacgca cgctccttgg 2874301 ccaggtgacg aaacgaatcc aaccccacgt aggcctccac ccggcgcacc ccggagccga 2874361 tcgacgactc gcccaggatc gtcacgggac cgatctgcgc cgtgttgctc acatgggtgc 2874421 cgccacatag ctccagcgag aacggtccac ccatctccac cacccgcact tcgtcggggt 2874481 agctctcgcc gaacagcgcg atggcaccca tcgccttggc cttgtcgagc tgttcggtga 2874541 acgtgcgcac ctcgaagtcc gcttgcacgg cctcgttggt gacctcttcg acctgggtgc 2874601 gctggtcgtc ggtcaacgga ccctgccagt taaagtcgaa gcgcaaatat cccggccggt 2874661 tcagcgatcc cgcctgaacc gcgttgggcc ccagcacttg tcgcagcgcg gcatgcacca 2874721 tgtgggtgcc cgagtggccc tgcgtggcac cccggcgcca cccgggatcc accgccgcga 2874781 ttacggtgtc accctcgacg aattccccgg attccacgtt gactcggtgc acccaaagcg 2874841 ttttggcgat cttctgcacg tcggtaaccg cggcccgggc agcttcgctg gaaccggttc 2874901 cgctgatggt gccctcatcg gcgatctgcc cacccgattc ggcgtagagc ggggtgcgat 2874961 ctaagacaag ttcgacacgc tgcccttccc cggctccgcc ggctacaccg tgcgccacca 2875021 ccggaacccg cttaccgtcg acgaagatgc ccagaatccg cgcctgggaa cgcaactcgt 2875081 cgaatccggt gaactcggtg gcgccggcgt caaccagctc gcggtaggcg ctcaggtcag 2875141 catgcgcgtg tttgcgcgcg gcggcgtcgg ccttggcacg gcggcgctgc tcggccatca 2875201 gctcacggaa cccgatttcg tctacctgca gaccggtttc ggccgccatc tccagcgtga 2875261 gctcgatcgg gaacccgtag gtgtcatgca acgtgaaagc gtccgatccg gacagcacgg 2875321 tggctccgga tttcttggtg gagctagcca cctcctcgaa cagcctggaa cccgacgcca 2875381 gcgtgcggtt gaacgccgtc tcctcggcga ccgcgatccg gctgatccgc tcgaagtcgg 2875441 cgacgagttc gggatatgac gggcccatcg cgttgcgcac cgtggccatc aggtcgccaa 2875501 cgatcgcagc gtcgatgccc agcagcttgg cggagcggat cacccgacgc agcagccggc 2875561 gcagcacata accgcgaccg tcgttgccgg ggctgacgcc gtcaccgatc aggatcgcgg 2875621 cggtgcggct gtggtctgcg atgatgcggt accgcacgtc gtcttcgtgg ttgccgacgt 2875681 cgtaggcacg cgcggcgacc ctggccacgg tatcgatgac cggcctgagc aggtcggtct 2875741 cgtagacgtt gtgcacgtct tgcagcacca gcgcgatccg ctcgacgccc atgccggtgt 2875801 cgatgttctt gcggggcagc ggcccgagga tctggtagtc ctccttggtg gttccctctc 2875861 cgcgctcgtt ctgcatgaac accaggttcc agacctcgag gtagcggtct tcgctgacga 2875921 tgggaccgcc tgcgggaccg aattcgggtc cgcggtcgta atagatctcc gatgacggcc 2875981 cgcacggtcc gggaatgccc atcgaccagt agttgtcggc catgccgcgg cgctggattc 2876041 gctccgccgg cagcccggca acctcctgcc atagccggac agcttcgtcg tcgtcgaaat 2876101 agactgtcgt ccagattctt tccgggtcca ggccgtagcc gccggcggcg aggctgttgg 2876161 tcagcagtgc ccaggccagt tcaatggccc cgcgtttgaa atagtcgccg aagctgaaat 2876221 tgccggccat ctgaaaaaac gtgttgtgcc gggtggttat gcccacctcg tcgatatcgg 2876281 gggtacggat gcacttctgg atgctggtgg ccgtcgggta cggcggcgtg cgctgtccca 2876341 agaagaaagg cacgaactgg accatcccgg cgttgacgaa caacaggttg gggtcgtcga 2876401 ggatcaccga ggcgctgggc acctcggtgt ggcccgcctt cacgaaatga tcgaggaacc 2876461 gcttcctgat ctcgtgtgtc tgcactctac gttcttcctt gatccgtggt taagtccatt 2876521 accagcctat tcgccggatt atgagaaggc tgtccgacgg cccaattcgg cccgctcagc 2876581 cttccacaaa gctcaatcgc accgaccgcc gcggattgtc ctggttgagg tcgaccagaa 2876641 cgatgctttg ccaggtgccc agcaggggct ggccccccga gaccggcacc gtcaccgacg 2876701 gcgcaacaaa agccggtaac aagtggtcgg cgccgtgacc gtaggacccg tgcgcgtgcc 2876761 ggtagcggtc gtcgcgcggc aacaaccgca ccagcgtgtc caccagatcc tcgtcggaac 2876821 cggcgccggt ctcgataatc gcaacgccgg ccgtagcgtg cgggacgaac acgttgcaca 2876881 ggccatcatc atgggcggtg cagaaggcgc gcacggcgtc ggtgagatcg acaatgcggc 2876941 gacgcgcggt gtccacatcc agcacatcgg tatccacccg tcccagccta cggtgggggc 2877001 gcgccaacct gccaatccat tgacgtcgga ttgcccattg ccccggccgg cccgtcggag 2877061 gaaggtaatg attgaccggt ggcgccaccg gggcgctgcc ccgaacaatg aaagaggggt 2877121 ggatcgtgta cgcgcgctct accactattc aggcgcaatc cgagtgcatc gacaccggaa 2877181 ttgcgcacgt tcgcgatgtg gttatgcccg cactgcaggg gatggatggg tgcatcggcg 2877241 tatccctttt ggtcgaccgg caatccggca ggtgcatcgc caccagtgcc tgggagaccg 2877301 cggaagccat gcatgcaagc cgggaacagg taacgccgat ccgcgatcgg tgcgcggaga 2877361 tgttcggcgg cacgccggcc gtcgaggagt gggagatcgc ggcgatgcat cgcgaccacc 2877421 gctcggccga gggggcgtgt gtgcgggcga cctgggtcaa ggtgccggcg gaccaagtag 2877481 atcaaggcat cgagtactac aagtcgtccg tcctgcccca aatcgaaggc ctcgacggat 2877541 tctgcagcgc cagcctgttg gtcgaccgca cctccgggcg cgcggtgtct tccgcgacct 2877601 tcgacagctt tgacgccatg gagcgcaacc gggaccagtc gaatgcgctc aaggccacat 2877661 cgctgcgtga ggcgggcggc gaggaactcg atgaatgcga gttcgagctg gcgctagcgc 2877721 acctacgggt acccgagctg gtctgatcaa cccgccggcg gcagtaccgg cccgagcccg 2877781 acgctgggcc ggcactgctg tcgtgcgtcg agcggcgctc gcggtaggca ttgccaggct 2877841 cagccggttg gaggaaggta tttggtggga ccggtggcgc caccggggcg ctgccccgac 2877901 acgggagggg gtcgatcgtg tacgcacgct caaccaccat tgaggcgcaa cctctgtcgg 2877961 tcgacattgg aatcgcgcat gttcgtgacg tcgtcatgcc cgctttgcag gagatcgacg 2878021 ggtgtgtcgg ggtgtcgctg ttggtcgacc ggcaatccgg ccggtgcatc gccaccagcg 2878081 cctgggagac cttggaggcg atgcgcgcca gcgtcgagcg ggtggcaccc atccgcgacc 2878141 gcgccgcgct gatgttcgcc ggtagtgccc gggtcgagga atgggacatc gccctgttgc 2878201 accgcgacca cccgtcgcat gagggggcat gcgtgcgcgc cacctggctc aaagtggtgc 2878261 cagaccagct cggtcggtcc ctggagttct accgcacgtc cgtacttccc gagctggaga 2878321 gtctggacgg gttctgcagc gccagcctga tggtcgacca ccccgcttgc cggcgtgcgg 2878381 tgtcgtgctc gacgttcgac agcatggacg cgatggcccg caaccgcgac cgggcgagcg 2878441 agctgcgcag caggcgcgtc cgggaattgg gagccgaggt cctcgacgtc gccgaattcg 2878501 aactggcgat cgcacatcta cgggtacccg agctggtctg agcggacctg cttcccgcag 2878561 agcgcagcgg tcacccccgt ttcttgcgga tgattgcccg caggcggtcc aggcggccgg 2878621 cgatctcgcg ttcgccgccg cgaccagtgg gccggtagta gtccacgtcc accaactcgt 2878681 cgggcgggta ttgctgggcc acaacgccat ccgggtcgtc atgggaatat ttgtagccct 2878741 gtgcattgcc cagcgccgcc gccccggagt aatgcccgtc acgcagatga gccggcacca 2878801 gaccggcctt gccggccttg atgtcgttca tcgccgcggc caacgccgtg gtgacggcgt 2878861 ttgacttcgg tgcggtggcc aggtggatgg tggcgtgcgc cagcgtcagc tgggcttcgg 2878921 gcatgccgat cagcgccacc gtctgtgcgg cggcgaccgc cacctgcagc gcgctcgggc 2878981 cggccatgcc gatgtcctcg ctggccagaa tcatcagccg gcgggcgatg aaccgcgggt 2879041 cctccccggc gaccagcatg cgggccaaat agtgcagcgc ggcatcgacg tcggaaccgc 2879101 gcaccgattt gatgaaggcg ctgacgacgt cgtagtgctg gtcgccgtca cggtcgtagc 2879161 gcaccgcggc tttgtccacc gaccgctcga tggtttgcac gctgaccagc tcgccggccg 2879221 cctgggctgc ctcggccgct acttccagcg cggtcagggc gcgccgggcg tcgccggccg 2879281 cgagttgcac cagcaggtcg acggcctcag gcgctaccgc gactgccctg cccaggccgc 2879341 gggggtcatc gatcgcgcgt tgtactaccg cgcgggtgtc ctcggccgtc agcggccgca 2879401 gctgcaggat cagcgaccgc gacagcagcg gtgccaccac cgaaaacgac gggttctcgg 2879461 tggtcgccgc caccaacagc accacccggt gttccaccgc cgacagcagg gcgtcttgtt 2879521 gggtcttgga aaatcggtgc acctcgtcga tgaacagcac ggtctgctcg ccgtgaagca 2879581 gcgcttttcg cgaattctcg atgaccgccc gcacttcctt gacgccggcc gacaatgccg 2879641 acagggcctc gaaccggcgg ccggtggcct gcgagatcaa cgccgccagc gttgtcttgc 2879701 cgctgcccgg gggaccgtag aggatcaccg acgccacccc cgagccctcg accagccggc 2879761 gcaacggcga accgggcgcc agcaagtggt cctggccgac cacttcgtcc agcgacgccg 2879821 gacgcatccg caccgccagc ggtgccccgg ccgaagcgcc caggtcatgg ccggacgtca 2879881 tcggtacgcc gggcacgtca aacagaccgt cggacacggc ttcaggcata ccacgcccac 2879941 ctgacgacgc gaacgttcgc cgaagacgcc acacgaataa tccgcgcgcc ttcggcaaat 2880001 atttgctaag ttccggtttg cttagcgtcg cgcgggtacc gataaaagcg aactacgaag 2880061 cgattgggac agcgatgagc cagccgccag aacatccagg caatccggcc gacccccagg 2880121 gcggcaatca gggcgctgga agctacccgc cgcccggcta cggagcgcct cccccgccac 2880181 caggctacgg cccacccccg gggacctacc tgcctcccgg ctacaacgca cccccgccgc 2880241 cccccggcta tggcccaccg ccgggcccgc cgcctcccgg ttacccgacg catctgcaat 2880301 cgtcgggttt tagcgtgggc gacgcgatca gttggtcatg gaataggttc acgcagaacg 2880361 ccgtaacgct cgtcgtcccg gtgctcgcct acgctgtggc gttggccgcg gtcatcggcg 2880421 cgacggccgg gctcgttgtc gccctatcgg accgtgctac taccgcatac accaacacct 2880481 ccggcgtctc tagcgaatcc gtggacatca cgatgacccc ggccgcgggc atagtcatgt 2880541 tcctcggcta catcgctcta ttcgccctgg tgctctacat gcacgccgga attctgaccg 2880601 gctgccttga cattgccgac ggaaagccgg tgaccatcgc gacgttcttt aggccgcgca 2880661 atctgggcct ggtgctggtc accggactgc tgatcgtcgc cgtcaccttc attggtggcc 2880721 tgctctgtgt cattcccggc ctgatctttg gcttcgtcgc ccagttcgcc gtcgcttttg 2880781 ccgtcgaccg ttccacttcg ccgatcgact cggtaaaggc cagcatcgag acggtcgggt 2880841 ccaacatcgg tggcagtgtg ctgtcgtggc tcgctcagct cacggcggtg ctcgtcggcg 2880901 aactgctgtg ctttgtcggc atgctgatcg gcattccggt cgccgcgctc atccacgtct 2880961 acacctaccg gaagctgtcg ggtggccaag tcgttgaggc agtccggcca gcgcccccgg 2881021 tcggctggcc gcccggcccc cagctcgcat agtcggcacc cgccgacgcc ggctggccgt 2881081 cttggcccgc tggatttgtc acgcgctcac ccgaattggc atccggggcc tggaacgcgt 2881141 tagggcagtg gctttcccac aggttgacgt aaatgacctc caagataggt atcgaaccaa 2881201 ggttgcggcc gatgtgtacg tagttcgaga gttcgctgat ctgatcactc gcgtggtcga 2881261 tgcagtcgac ggaaccggca gccgccaccc aagggtgcgc aggtggttag caaatcgccg 2881321 acgaacacga cgccaccgcg tcatgcgcca tcgccgaccc cgccttggtg gctgagagcc 2881381 gctcgccggc gttaagctgc ccaacatcat gggcattcaa cgcgccgttc tcctcattgc 2881441 cgacatcggc ggatacacaa attacatgca ctggaaccgc aagcacctgg cccacgcgca 2881501 gtggacggtg gcacagttgc tggagtccgt catcgacgct gccaagggca tgaagttggc 2881561 gaagctggag ggcgacgccg cgtttttttg ggcaccaggg gggcaacacc agtgtcctgg 2881621 tatgcgaccg gcccccgcag atgcgccaga ggttccgcac gcggcgcgag cagatcaaaa 2881681 aagaccatcc ctgcgactgt aagagttgcg agcagcggga caacctgtcg atcaaattcg 2881741 tcgcccatga gggcgaagtg gccgaacaaa aggtgaagcg caacgtcgaa ctcgctggcg 2881801 ttgatgtcat cctggtgcac cgcatgctga aaaatgaggt gccagtgtcg gaatatctat 2881861 tcatgaccga cgtcgtagcg cagtgcctcg acgagtcggt gcgaaaacta gcgacgccgc 2881921 tgacacatga cttcgagggc atcggagaaa cgtcgacaca ctacatcgac ctcgccacgt 2881981 ccgacatgcc gccggcggtg ccagaccaca gcttcttcgg cctgctgtgg gcggatgtga 2882041 agttcgaatg gcacgcgtta ccgtacctgt taggtttcaa gaaggcctgt gcaggtttcc 2882101 gcagcctggg ccgcggcgcc accgaagagc ccgccgaaat gggctaatcg ggttcgcttg 2882161 gctcgatcgc cgatgatctc gaccgccacg accgaccccc tcacctcggt cgaacctcgg 2882221 cgaaccaacg cggcaacgcc agcccatgat catttgattg ggtccacgga agcaggtagc 2882281 ttccgtcgca tgctttttgc ggctttgcgt gatgtccaat ggcgaaaacg acgccttgtc 2882341 atcgcaatcg tcagcaccgg cctagttttc gcgatgacgc tcgttctgac cggacttgtg 2882401 aacgggtttc gggtcgaggc cgagcgaacc gtcgattcca tgggtgtcga cgcattcgtg 2882461 gtcaaggccg gcgcggcagg accgttcctg ggttcgacac cattcgccca aatcgacctg 2882521 ccccaggttg ctcgtgcgcc tggcgtcttg gctgccgccc cactagcgac tgcgccgtcg 2882581 acgatccggc agggcacgtc agcgcgaaac gtcaccgcgt tcggggcacc agagcacgga 2882641 cccggcatgc cgcgggtctc ggacggtcgg gcgccatcga cgccggacga ggtcgcggtg 2882701 tcgagcacgc tgggccgaaa cctcggcgac gatctgcaag tgggtgcgcg cactttgcgg 2882761 atcgtcggca tcgtgcccga gtcaaccgcg ctggcaaaga ttcccaacat cttcctgacc 2882821 accgaaggcc tacagcagtt ggcatacaac ggacagccga caatcagttc gatcgggatc 2882881 gacgggatgc cccgacagct cccggacggc tatcagaccg tcaatcgagc ggatgctgtc 2882941 agcgatctga tgcgcccgtt gaaggtcgcg gtggatgcga tcacggttgt ggcggtcttg 2883001 ctgtggatcg ttgcggcgtt gatcgtcggc tcggtggtct acctctctgc gttggagcgg 2883061 ctgcgtgact ttgcggtgtt caaggcgatc ggcgtgccga cgcgctcgat tctggccggg 2883121 ctggcgctgc aggcggtcgt cgtcgcgctg ctcgcggcgg tggttggcgg catcctttcg 2883181 ctgctgttgg cgccgttgtt cccgatgact gtcgtggtac ccctgagtgc cttcgtggcg 2883241 ctaccggcga tcgcgactgt gatcggtctg ctggccagcg tcgcaggact gcggcgcgtg 2883301 gtggcgatcg atccggcact agcgttcgga ggtccctagc catgggcggc ctaaccattt 2883361 ccgacctggt cgtcgagtat tccagcggcg ggtacgccgt gcggccgatc gacgggttaa 2883421 gcctcgacgt ggcgccgggg tcgctggtga tcttgcttgg gcccagcggc tgcgggaaga 2883481 cgaccctctt gtcctgcctc ggcggcatcc tgcgcccgaa gtccggctca atcaagtttg 2883541 acgatgtcga catcacgacg ctggagggcg ccgcgctggc gaagtatcgg cgtgacaagg 2883601 tagggatcgt cttccaggcg ttcaacctgg tctcgagcct taccgccctg gagaacgtga 2883661 tggtcccgct gcgcgcggcc ggcgtgtcac gagcggccgc gcgtaagcgt gccgaggacc 2883721 tgctgatccg agtcaatctc ggcgaacgaa tgaaacaccg cccgggtgac atgagcggcg 2883781 gccagcagca acgcgtcgcg gtcgcccgcg cgatcgcgct ggacccgcaa ttgatccttg 2883841 ccgacgaacc gaccgcgcac ctggacttca tccaggtgga ggaggtgctg cggctgatcc 2883901 gctcgctagc gcagggcgac cgtgtggtgg tggtcgcgac ccacgacagc cggatgctgc 2883961 cgctggccga tcgcgtcctt gagctgatgc cggcgcaggt gtcgccgaat cagccacccg 2884021 aaacggtgca cgtgaaagcc ggcgaggtgc tgttcgagca gtccacaatg ggcgatctga 2884081 tctacgtggt gtccgagggc gagttcgaga ttgtgcgcga attggccgac ggcggtgagg 2884141 aattggtcaa aaccgccgcg cctggggact acttcggtga aatcggcgtg ctgtttcacc 2884201 tgccacgctc ggcaacggta cgggctcgca gcgacgcgac agccgtcggt tatacggcgc 2884261 aggcgtttcg ggagcggctg ggtgtgacgc gggtggccga cctgattgag caccgcgagc 2884321 ttgccagcga atagttcggc accaagtcgc gatccctgag ggttgcgatg ggcgcggcgc 2884381 cgccgctgaa tcgaccgccc cccactgagc cgccgtggaa tactcgatga atcctgcggg 2884441 cgtgtccgca ctgcgtgtgg ctatggagtt ggggaacatg ttgcttggga taagaacgtg 2884501 aatgagggac cgctcttcac aatgtcaggc actgccgtga gaagtccgct actcgatcgg 2884561 gtgtatgtga gcagtcctgg catgggccga gatgccaaga gccgcatctc atgaccaccg 2884621 cgcgacgacg gcccaagcgg cgtggtaccg atgcgcgaac cgcgctgcgc aacgttccga 2884681 tactcgccga tatcgacgac gaacagctcg aacgactcgc aaccaccgta gaacgccgcc 2884741 acgtgcccgc taaccagtgg ctctttcatg ccggagaacc agcggactcc atctatatcg 2884801 tcgactcggg gcggttcgtc gctgttgccc cagagggaca cgtatttgct gagatggcat 2884861 ccggcgactc gatcggagac ctgggggtga tcgccggggc tgcccgctca gcgggagtgc 2884921 gagctctgcg agacggcgtg gtgtggagga tcgccgcgga gacgtttacc gacatgctcg 2884981 aggcaacccc gctactgcaa tcggcgatgc tgcgagcgat ggcgagaatg ctacgccagt 2885041 cacgacccgc caagacggct cggcgtccgc gggtcatcgg cgtggtatcg aacggggaca 2885101 ccgccgcggc cccgatggtc gacgcgatcg ctacttcact ggactcgcac ggtcgaactg 2885161 ccgtgattgc gccgcccgtc gaaaccacct ccgccgttca ggagtacgac gagctcgtcg 2885221 aggcgttcag cgaaaccctc gatcgcgcgg agcgaagcaa cgattgggtc ttggtggtcg 2885281 ccgaccgagg cgccggcgac ctgtggcggc actacgttag cgcgcaaagc gaccgactcg 2885341 tggtcctggt ggatcaacgg tatccgccgg atgcggtcga ttcgcttgct acccaacggc 2885401 cagtgcacct gatcacatgt ctggcagaac cggatccaag ttggtgggat cggttggcgc 2885461 cggtttcgca tcatccggcc aactccgacg gcttcggtgc ccttgctcgc agaatcgccg 2885521 gccgatcgct cggcctggtg atggccggtg gcggagcccg gggactggcg catttcggtg 2885581 tttaccaaga gctcaccgaa gccggcgtcg tcatcgatcg gtttggcgga acaagttcgg 2885641 gtgcaatcgc ttccgcagcg ttcgcgctgg ggatggacgc cggggatgcg atcgccgcgg 2885701 cgcgagagtt catcgcagga agcgacccac tcggcgacta cacgatccca atatccgccc 2885761 tcacgcgagg tggacgcgtc gatcgtctgg tgcagggatt cttcggcaac acgttgatcg 2885821 aacatctgcc cagagggttc ttctccgtct ccgccgacat gatcaccggc gatcagatca 2885881 tccatcggcg gggatccgtc tcgggcgccg tgcgcgcatc gatctcgatc cccggtctca 2885941 tcccgccagt gcacaatggc gagcagctgc tcgtcgacgg tgggctgttg aacaatctgc 2886001 cggccaacgt gatgtgcgcc gataccgatg gcgaagtcat ctgcgtcgac ctccgccgaa 2886061 cgttcgtgcc gtcgaagggc tttggcctgc tgccgccaat cgttacgccg cccgggctcc 2886121 tccggcggct tttgaccggc acggataacg cgctaccacc gctgcaagag acgttgctgc 2886181 gcgccttcga ccttgccgcc tccaccgcaa acctgcgcga gcttcctcgc gttgcggcca 2886241 tcatcgagcc cgacgtgtcg aagatcggag tgttgaactt caagcagatt gatgccgccc 2886301 tagaggctgg gcggatggca gcccgtgcgg ctttgcaagc acagccggac ctggtgcgct 2886361 gaacccgacc aagtgccgct acggcccact caggtgtccg gcaccgggcg tacgcgctgc 2886421 gccgggcggt ccggtgtgat ctcatcagca gctatgagca tcaaagttgc gctggagcac 2886481 cgcaccagct acacctttga ccggctggtg cgggtgtatc cgcacatcgt gcggctacgc 2886541 ccggcgccgc actcccgcac ctccatcgaa gcctactcgc tgcgcatcga gcccgccgac 2886601 cacttcatca actggcagca ggacgcgctg ggcaactttc tggcgcggct ggtctttccg 2886661 aatcccatgc gccaactgcg tattaccgtc gggcttatcg ccgacctcaa ggtgatcaac 2886721 cccttcgact tctttatcga ggactgggcc gagatatggc cctgcgcagg gatggcctac 2886781 cccaaggcgc tcgccgatga cctgaggccg tacttgcggc cggtcgacga agacggcgac 2886841 ggttcgggcc ccggcgagct cacgcaggcc tgggtgcgca acttcacggt gcccgatggc 2886901 acccgcacca tcgacttctt ggtcgcactc aaccgcgcga tcaacgccga cgtcggctac 2886961 tgcgtgcgca tggagcccgg agttcagaca ccggatttca cgctgcgcac cggcgtcggc 2887021 tcgtgccggg actcggcgtg gctgctggtc tcgatcctgc gtcagttcgg gctggccgcc 2887081 cggttcgtgt ccggctacct ggttcagctg gcatccgaca tcgaagcgct cgacgggccg 2887141 tcggggcccg ccgccgactt caccgacctg cacgcgtggg ccgaggcata catcccgggt 2887201 gccggctgga tcgggctgga cccgacgtcg gggctgttgg ccggcgaggg ccacattccg 2887261 ctggcggcta cgccccaccc cgccagcgcg gcacccatca gcggcggcac cgacgtgtgc 2887321 gacaccgtgc tggagttctc caacaccgtc acccgcgtac acgaagaccc acgtgtcacg 2887381 ttgccctaca ccgacgagtc ctggaagacc atctgtgagg tgggccagcg cgtcgatgag 2887441 cggctggccg ccgccgacgt ccggctgacc gtcggcggcg aaccgacgtt cgtgtcggtg 2887501 gataaccagg tcgccgaaga gtggcggacg gcggccgacg gcccacacaa acgcgaacgg 2887561 gcatccgacc tggccgcccg cttgaaggcg gtgtgggccc cgcagggact catccaccgc 2887621 ggtcagggca ggtggtatcc cggagagccg ttgccgcgct ggcagattgc gctgtattgg 2887681 cgcaccgacg ggcggccgct gtggaccaac gacgcgctgt tggccgaccc ctggggcgcc 2887741 ccgcccgccg accccgtcga cgacgacgcg gcctaccggg tgctcgccgg gatcgccgac 2887801 ggcttggggc tgccgatctc gcaggtgcgg cccgcctacg aagacccgtt gagccggctg 2887861 gctgcggccg tgcgaatgcc agccggcgac ccggtggaat ccggtgacga cctcggctgc 2887921 gacaccaacc ccgacacccc caccggccgc gccgcgctgc tggcgcgcct cgatgaggcc 2887981 atcacctctc cggctgcgta cgtgctgccg ctgcaccgcc gcgacgacgg gcaaggctgg 2888041 gccagcgcga actggcggct gcgccgcggt cgcatcgtgt tgctcgaagg ggattcgccg 2888101 gcgggcctgc ggctgccgct ggattcgatc agctggcgcc caccccgggc atcgtttgac 2888161 gccgacccgg tagctgtgcg atccacattg ccggcggagc tccacaccga ccgggccgta 2888221 gtggaggatc ccgagacggc tccgaccacc gcgttggtcg ccgaggtccg gggtgggctg 2888281 gtgcacatct tcttgccgcc caccgacgcg ctcgagcact tcatcgacct tgtcgcccga 2888341 gtcgaggccg cggcgacgac ggccaactgc ccggtggtga tcgagggcta cggcccaccc 2888401 ccggacccgc ggctgacgtc caccacaatc acccccgacc ccggcgtcat cgaggtcaac 2888461 atcgcgccca ccgcctcttt tgcagaacaa cggcaacagc tggaaaccct gtatcaacaa 2888521 gcgcgcctgg cccgactcac caccgaagcg ttcgacgtcg acggcacgca cggcggcacc 2888581 ggcggcggca accacatcac gcttggcggc gtcacacccg cggactcacc gctgctgcgc 2888641 cggcccgacc tgctggtttc actgctgacc tactggcagc gacacccgtc gttgtcctac 2888701 ttgttcgccg ggcgtttcgt cggcaccacg tcacaggcgc cccgggttga cgagggccgc 2888761 gccgaggcgc tctacgaact cgagatcgcg ttcgccgaga tcctccggct gtcgccgtcg 2888821 tccgggggcg gccggcccca accgtgggtg accgaccgcg cgctgcggca cctgctcacc 2888881 gacatcaccg gcaacaccca tcgcgccgaa ttctgcatcg acaagctcta cagccccgac 2888941 agcgcccggg gcaggctcgg cctgctggag ctccgcgggt tcgagatgcc gccgcacctg 2889001 cacatggcga tggtgcagtc gctgctggtg cgctcgctgg tggcgtggtt ctgggaccaa 2889061 ccgctgcgcg ccccgctgat ccgccacggc gccaacttgc acggtcgata tctattgccg 2889121 cacttcttga ttcatgacat cgccgacgtc gcagccgacc tgcgcgcgca cggcatcgcg 2889181 ttcgagacta gctggctgga cccgttcacc gagttccgct tcccgcgcat cggcaccgcc 2889241 gtattcgacg gcattgagat cgagctgcgc ggggccatcg agccatggca cacccttggc 2889301 gaggaggcca ccgcggcagg caccgcgcgc tatgtcgact cgtcggtcga gcgcatccag 2889361 gtccgcatca tcggcgccga ccggcaccgc tacgtggtga cctgtaacgg ctacccgatg 2889421 ccgttgctgg ctaccgacaa ccccgacatc cacgtgggtg gtgtgcggtt caaagcgtgg 2889481 cagccgccca gcgcgctaca cccgaccatc acggtcgacg gcccgttgcg gttcgagctc 2889541 atcgacatcg ccaccgctac ctcgtgcggc ggctgtacct accatgtcgc ccatccgggc 2889601 ggccgcgcct acgacgagcc cccggtcaac gctgtggagg cggaggcccg ccgcgcccgg 2889661 cgcttcgagg cgaccggctt caccccgggc aagctcgacc tgtccgacat ccgggagaaa 2889721 caggccagga tatccaccga tatcggcgcg ccgggcatcc tcgacctacg acgcgtgcgt 2889781 accgtgcaac agtaatggca ccctcagctt ctgccgctac caacggctac gacgtcgacc 2889841 gcctgctggc cggataccgc accgcgcgtg cccaggaaac actgttcgac ctgcgggacg 2889901 gcccgggagc cggctatgac gaattcgtcg acgacgacgg caacgtgcga ccgacctgga 2889961 ccgagctcgc cgacgcggtc gccgaacgtg gcaaggcggg gctggaccgg ctgcgctcgg 2890021 tggtgcacag cctgatcgac cacgacggca tcacctacac cgcaatcgat gcacaccggg 2890081 acgcgctgac cggcgaccat gatctggaac cggggccgtg gcgcctggac ccgctgccgc 2890141 tggtgatttc cgcggccgat tgggaagtgc tggaggccgg cttggtgcag cgatcgcgct 2890201 tgcttgatgc catcctcgcg gacttgtacg ggccccgcag catgctcacc gagggtgtcc 2890261 tgccgccaga gatgctgttc gctcatcccg gctacgtgcg tgccgctaac gggatccaga 2890321 tgcctgggcg ccaccaactt ttcatgcacg cctgtgatct cagccggttg cccgacggga 2890381 cttttcaggt caacgccgac tggacgcagg cgccctcggg ctccggctat gcgatggccg 2890441 atcgacgtgt cgtcgcgcac gccgttcccg atctgtacga ggaactggcg ccgcgaccca 2890501 ccacaccgtt cgcccaggcg ctccggctgg cactgattga cgcggcaccc gatgtcgccc 2890561 aagaccccgt cgtggtggtg ctcagcccgg gcatctattc agaaaccgct ttcgaccagg 2890621 cgtatctcgc aacgctgctg ggtttcccgc tagtggaaag cgcggacctg gtggtgcgcg 2890681 acggcaagct gtggatgcgt tcgctgggca cgctgaaacg cgttgacgtc gttcttcgcc 2890741 gcgtcgatgc ccactacgcg gatccactgg atctacgcgc cgattccagg ctcggtgtcg 2890801 tcggtttggt ggaagcgcag caccgcggaa cagtgaccgt cgtcaacacg ctgggcagcg 2890861 gcatcctgga gaacccaggc ctgttgcgct tcctgccgca gctatccgag cgcctgctcg 2890921 acgaaagccc gctgctgcac accgctccgg tctactgggg cggcatcgcc agcgaacgct 2890981 cacacctact ggccaatgtc tcgtcgctgc tgatcaaaag cactgtcagc ggggaaactc 2891041 ttgtcggacc gacactttcg tctgcacaac tggccgatct ggcagtgcgt atcgaggcga 2891101 tgccgtggca gtgggtgggc caggagctgc cgcagttctc gtcggcgccc accaaccatg 2891161 ccggggtgtt gtcgtccgcc ggggtaggca tgcgactgtt caccgttgcc cagcgcagtg 2891221 gttacgcgcc gatgatcggc ggcctcggct atgtactggc gcccggccct gccgcatata 2891281 cgctgaaaac cgttgcagca aaagatatct gggtgcgccc aacggagcgt gcgcatgccg 2891341 aggtgataac ggtgccggtg ttggcgccgc cggccaaaac cggagcgggc acctgggcgg 2891401 tcagctctcc gcgcgtgctg tccgatctgt tctggatggg ccgctacggc gagcgcgcgg 2891461 agaacatggc ccggctgctg atcgtcaccc gcgagcgcta ccacgttttc cggcaccagc 2891521 aggacaccga tgaaagcgag tgcgtgccgg tgctgatggc cgcgctgggc aagatcaccg 2891581 gatatgacac cgcaactggc gccggcagcg cttacgaccg ggccgacatg atcgcggtcg 2891641 ccccgtcgac actgtggtct ttgaccgtgg atccggaccg gccgggttcc cttgttcagt 2891701 cggtggaggg gctggcactt gccgcccagg cggtgcgcga ccagctgtcc aacgacacct 2891761 ggatggtgct ggccaatgtg gaacgcgcgg tggagcacaa gtccgacccg ccgcagtcgc 2891821 tggcagaggc ggacgccgtg cttgcgtcgg ctcaggcgga gacgctagcc ggcatgctga 2891881 cgttgtccgg ggtggccggc gagtcgatgg tgcacgacgt gggctggacg atgatggaca 2891941 tcggcaagcg tatcgaacgc ggcctgtggc tgaccgcgtt gctacaagcc acgttgagca 2892001 ccgtgcgcca ccccgccgcc gagcaagcca tcatcgaggc aaccctggtg gcgtgtgaat 2892061 cgtcggttat ctatcggcgc cgcaccgtag gcaagttcag tgtcgccgct gtgaccgagc 2892121 tgatgttgtt cgacgcccag aacccgcgct cgctggtgta tcagctggaa cggctgcgcg 2892181 ccgacctgaa agacctgcct ggctcgtcgg gatcgtctcg tccggaacgg atggtggacg 2892241 agatgaacac ccgcctgcgc cgctcacacc cagaagagtt ggaagaggtc tccgccgacg 2892301 ggctgcgcgc cgagttggcg gaactgctgg ccgggataca tgcctcgctg cgtgacgtgg 2892361 ccgacgtcct caccgccact cagttggcgt tgcccggcgg catgcaaccg ctgtggggtc 2892421 cagaccaacg gcgggtgatg ccggcctaaa cggtgcgacg gctgtgagcc ggctcgaaat 2892481 ccggggccac ctcgtcgacg acggtgtgga tgaaccgcat cttctccagc acagcggccg 2892541 gcagcacaaa ggggtatagg tcgtcgtggc ccatcgagcg attgaccatg ttcagcgacc 2892601 acgacagcgg cagccacttg tcgatgatgg tattaaaagc gctggggccc aacgccggcc 2892661 ggtcgaaggt tgccgacgcc ggtgccaggc cgcaccaggc cgcggtgtcc agggcgtcgc 2892721 ggatatgcag gtaatgagcg aacgtctcgg cccaatcctc actcgcgtgc atggtcgcat 2892781 acgacgagac aaagctgtcc tgccaacctt ccggcgggcc gccacggtaa tgccgatcca 2892841 acgcctggga gtagtcagcg tccgggtctc cgaacaactc gttgaaccgg gacagatagt 2892901 cgcttgacga ggcgatgagt cgatagaagt agtagtgccc gatctcgtgg cggaagtgcc 2892961 caagcagggt ccgatacggc tcgtccatct cgacccgcag ctgctcccga tgcacatcgt 2893021 cgccttcggc gagatccagt gtgatgactc cgttctggtg tccggtggtc acgttctcgt 2893081 gcgcgctgga caatagccgg aaggccaacc catggtcagg atcctggtcg cggccgacga 2893141 tcggcagctt cagctcgtgt agctcggcga tcagccgccg cttggcacct tcggctcggg 2893201 cgaactccgc cagcccggcg gtgttggtat cgctgggccg ctcgatggtc agcacacaag 2893261 aactgcaaag tccgccgagc tgatcactgg gcaccagcca attgcattgc gcgaggtgga 2893321 gattggcgca gagttggaca tcggcgtcgt cggcgatgac cagcagcgcc atccgcccaa 2893381 gagaaaaccc cagcgcgctg ccgcacgaca ggcaggcgga gttctcgaat gccaggcgct 2893441 gcccgcaatt tggacagtgg aagtcacgca tgcagcgcat caccttcgaa gggcacgaca 2893501 tcgacagaaa cgtcgatcac actgttctcg gagttggtgt agatgatgcc gcgtagcggc 2893561 ggcacgtctg cgtagtcgcg gccgcggccc acgacgatgt agcgctggtc gaccaactgg 2893621 tcattggtgg gatccagccc cagccactcg aaccgcccgg gctgctgcgg agtccacacc 2893681 gaggcccagg catgcgtcgc gtcgatgccg atcatccgat cctttccggg cggcgggtcg 2893741 gtggccaggt agcccgacac ataacaggcc gccaaaccgt tggcccgtag gcaggcgatc 2893801 gccagcctgg cgaaatcttg gcatacccct tcgcgggcca gcagcacctc gttgactcct 2893861 gtggaaatcg tcgtggaacc cgagcggtag gtgaagtcgg tgtagatccg cgacgcgaga 2893921 tcgcgcaata cctcgaccag ggggcgtttg ggcaggaagc taggagccgc gtactcacgc 2893981 accgcatcgg tgatctccgg cgggttcaag tccagggtga actcggtggc tagcgatccg 2894041 ggcagcccgg cgggccgggc cgcctcccac ggttgcagcg ccggcccgct ggtgtaaagc 2894101 ccgggcggcg gcggggacac gtcgacgatg gaatcgctgg tgatcgtcaa ggtgcggtgc 2894161 ggttcggtga cgtggaaata ggagctgatg ttgccgtacc cgtcgcggct ggtggaccgg 2894221 tcggcggggg ccgggtcgat ggtcagccgg tgtgcgacac aacgctgccg cagcgaattc 2894281 cgcggcgtga gaaacccgcg gccataggag ctggtcacca cgtcggagta gcggtattcg 2894341 gtgcggtgtg ttactcgata gcggtgagtg cccgacaacg gcaacgacaa cgagctatct 2894401 gctgacaaaa agctacctcc tggctgatca catcacacgc cggcggctcg tccggcgcga 2894461 tcgtcgcgca atgtggcgcc aagcgcacca tagccggagc acaattaaag cgtggctacc 2894521 tgggacgacg tcgcccgtat cgtgggtggg ctgccgctga ccgcggagca ggcaccgcac 2894581 gactggcgtg ttggccgcaa gctgctggcc tgggaacggc cgctgcgcaa gtccgaccgc 2894641 gaagccctga ccagggccgg atcggagcca ccgtccggcg acatcgtcgg tgtccgagtg 2894701 tcggacgagg gggtgaagtt cgccttgatt gccgacgagc cgggcgtgta cttcaccacc 2894761 ccgcatttcg acggctatcc agcggtgctg gtcaggctgg ccgagatcga ggttcgcgac 2894821 ctcgaggagt tgatcaccga ggcctggctg atgcaggcgc cgaagcagct ggtgcaggcg 2894881 tttctcgcca attcaggctg acatgcccga cgggcccggg cgttcgatta cccgttgtag 2894941 atcggtgaca cacgcttgga cgatatcggc gcgcaccact tcgttgctgc cacaagcagc 2895001 cgattgcagt gtcgacgcgg ttgcgcgggc ggcggccgcg tgctcgttcg ctgccgtcgg 2895061 atccgcgtcg gccaggccgg ttcccgcggc gaggtcggtg agcacggcgt gcacgggcgt 2895121 tggcagctta tcgccaccag gcccggcaat ggtgcgagcc agatgcaaca ccgaactgac 2895181 cagcagggcc aggtagacgg cctgttgatc gagatcgcgg acagtgctgc gcacccccca 2895241 tcggcggggc gctcgccgcg ccaccatggc agcgttggcg cgcacctcga tgagcccgtt 2895301 cagctgctga tgcagtcgat cagcggctgc catcggccag tcgggcgggg cgctggtggg 2895361 atcgctcacc gtgttcacca gctcggcgag gatgtcgcgc acagcggcca acacgtcggc 2895421 gcgcgcactg cacagcatga ccaccgggtc gggcgggaag agcagaatgc tgaacacgat 2895481 agccagccca ccaccgacca gcgcgtcgaa gaggcgttcg aaaaccacac tgccgttgga 2895541 cgcgaagacc aagaccagca ccgcggagac ggcggcctgg ttgatgaaca ttaagccttg 2895601 cgcgaccaac ccgcgtgcgc acagcaccgc gaccgacaac gcgatgaaca ccaccacacc 2895661 catggcgatc ggtccggaac caagcagagc atgcacgcca gcacccagca cgatccccag 2895721 cgccaccccg acgatcatct gttgggcacg tcgtgcgcgc agcacgttgg tcgccgacat 2895781 gcacaccaca gccgaaatcg gcgcgaagaa cgcctgcgga tggttgaaca cgtcatgggt 2895841 gagataccac gcgaggccgg cgacgaccga tgtctgggtg atcggccaca gcacggtgcg 2895901 caaccgttgg gcgaccgcac ggccgccgca ggccgtcctg actagcagcg aagcgctcat 2895961 gaacgcctat ttattcacac tcgggtgcga cgtcgtaacc gcaaagatct ggtcatgcct 2896021 gctggacccg cttgggctgg gcatctattc cggactcctt acgttgctga gcggtaatgg 2896081 gcgccggcgc gtcggtgagc ggatcgacgc cgccgccggt cttcgggaac gcgatcacct 2896141 cacggatcga gtccatcccg gccagcagcg cggtggtccg gtcccacccg aacgcgattc 2896201 cgccgtgcgg cggtgcgcca aacatgaacg cctccaacag gaatccgaac ttttcctccg 2896261 cctcggcctt gtccaggccc atcaccgcga acacccgttc ctggatatca cggcggtgga 2896321 tacgcaccga gccgccaccg atctcgtggc cgttgcagac gatgtcgtac gcgtcggcca 2896381 gcacgctgcc ggtatcggat tcgatgcggt cctcccattc cggtttcggc gcggtgaagg 2896441 catggtgcac cgcggtccag gcccccgagc cgaccgcgac ctcaccggcg gcggtcgctt 2896501 cgtcggccgg ctcgaacagc ggcgggtcaa cgacccagac gaatgcccac gcatcggggt 2896561 caatcaggcc cagccggttg gcgatctcga cgcgggccgc gcccagcagt gcccgcgacg 2896621 atttgaccgg accggccgag aagaagatgc aatcgccggg tttggccccg acatggtcgg 2896681 ccagtccggt gcgctcggcc tcggtcaggt ttttggccac cggaccgccc agcgtgccgt 2896741 cttcggcgac cagcacgtag gccagtccgc ggtggccgcg ctgcttggcc cagtcctgcc 2896801 agccgtccag cgtgcgccgc ggctgcgacg ccccgccagg catcaccacc gcgcccacat 2896861 acggtgcctg gaagacacga aatgtggtgt cggagaagaa atccgtgcat tcgacgagct 2896921 ccagcccgaa ccgcaggtcg ggtttgtccg taccgaatcg gcgcatcgct tcggcatagc 2896981 cgatccgcgg gatgggcgtc ggaatccggt agcctatcag cgcccacagc tcggtcagaa 2897041 cttcctcgga gatcgcgatg atgtcctcgg cgtcgacgaa gctcatctcc atatcgagct 2897101 gggtgaattc gggctggcgg tcggcgcgga agtcctcgtc gcggtagcag cgggcgatct 2897161 ggtagtagcg ttccatcccc gccaccatca gcagctgctt gaacagctgc gggctctgcg 2897221 gtagggcgta aaacgaaccg gggtgcagtc gggccggcac caggaagtcg cgcgctccct 2897281 ccggggtcga gcgggtgatc gtcggcgtct cgatctcgac gaagtcgtga cgcgccagca 2897341 ccgcgcgcgc agcggcattc acccgggaac gcagtcgaat cgccgcagcg gggtcgtcgc 2897401 ggcgcagatc gaggtagcgg tacttcagtc gcaactcctc acccgccggt tcgtccagct 2897461 gaaacggcag cggcgcacat tcgcccagca cggtcaacga cgtggcgttg acctcgatct 2897521 cgccggtggc gatctccggg ttggcgttgc cttccgggcg gatctcgacg acgccggcca 2897581 ccgatacgca gaattccgca cgcagccggt gagcctgcgc cagcacctca gtgtcctggg 2897641 ggtcgcggaa caccacctgt gcgatgcccg aagcgtcccg cagatcgatg aagatcacgc 2897701 cgccgtggtc gcggcggcga gccacccagc cggccaatgt cacctgctgc ccggcgtcgc 2897761 cttcccgtag caaacccgcg gcgtggctgc gcagcacaaa cactcccctt caaccggatt 2897821 aaccgactgc tcagtctaga ggtgcccgcg gcgcacatcg gtcacgcagg ataatttcgg 2897881 ctcatctcaa caaacattgc aacaggcatt gccctagtcg gacccggtgc cgtcggaacg 2897941 acggtcgccg cgctgttgca caaggccggg tattcgccgc tgttgtgcgg ccacactccg 2898001 cgcgccggga tcgagctccg gcgagacggc gcagacccca tcgtggtgcc cggtccggtg 2898061 cacaccagtc ctcgggaggt tgccggcccg gtcgatgtgc tgatcctggc ggtcaaggcc 2898121 actcagaacg acgccgcacg tccctggctg acccgcctgt gcgacgagcg caccgtggtg 2898181 gccgtgctgc aaaacggtgt cgaacaggtc gagcaggtcc agccgcattg tccgtcctcg 2898241 gccgtggttc ccgcgatcgt gtggtgttcg gccgagaccc agccgcaagg gtgggtgcgc 2898301 ttgcgcggtg aagccgcact ggtcgttccc accgggcccg cggccgagca gttcgccggg 2898361 ctgctgcgcg gtgccggcgc cacggtggac tgcgaccccg acttcaccac ggcggcctgg 2898421 cgcaaactac tggtcaacgc gctggcggga tttatggtgc tgtccggacg gcggtcggca 2898481 atgttccgcc gcgacgacgt cgcggcattg tcgcgccgct atgtcgccga atgcctggcg 2898541 gtggcgcgcg ctgagggtgc ccgactcgat gacgacgtcg tcgacgaagt ggtccgcctc 2898601 gtccggtcgg ccccgcagga catgggcacc tcgatgctgg ccgaccgggc agcccaccgg 2898661 ccactggaat gggatttgcg caatggggtg atcgtccgca aggcccgcgc ccacggcctg 2898721 gccaccccga tcagcgacgt gctggtgccg ctgctggcgg ctgccagcga cggtcccgga 2898781 tagcaatgta gctaatgtct agatcatgta cccctgcgag cgggtaggcc tgagcttcac 2898841 cgagaccgcg ccttacctct tccgcaacac cgtcgacctg gccatcacgc ccgagcaact 2898901 cttcgaagtg ctcgccgacc cgcaggcctg gccacgctgg gcaacggtga tcacaaaggt 2898961 gacctggacc agtcccgaac cgttcggcgc cggcaccacc cgcatcgtcg agatgcgcgg 2899021 gggtatcgtc ggcgacgaag agttcatttc gtgggagcct ttcacccgca tggcatttcg 2899081 gttcaacgaa tgctccacca gagccgtcgg cgcgttcgcc gaagactatc gggtgcaggc 2899141 catccccggt ggttgccggc tgacctggac catggcgcag aaactcgccg gcccggcgcg 2899201 gccggcgctg ttcgtcttcc ggcccctgct gaacctggcg ctgcgccggt ttctaaggaa 2899261 tctgcgcagg tataccgacg ctcggttcgc cgctgcgcag cagagttagg ctggatcggc 2899321 cgatttcggg agcgtgcgat gaccttcaac gagggtgtgc aaatcgatac cagcaccacg 2899381 tcgacctcgg gtagcggtgg cgggcggcgc ttggccatcg ggggcggcct cggtgggcta 2899441 ctggtggtgg tggtcgcaat gctgctcggc gtcgatcccg gtggcgtgct gagccaacaa 2899501 cctctcgaca cccgcgacca cgtagcaccc ggtttcgacc tgagccagtg cagaaccggg 2899561 gccgatgcca acaggttcgt gcagtgccgg gtggtggcca ccggtaactc cgtggacgcg 2899621 gtatggaaac cgctgttgcc cggctacacc cgcccacaca tgcggctgtt cagcggccag 2899681 gtaggcaccg gatgcggacc ggccagcagc gaggtcgggc cgttctactg cccagtggac 2899741 aaaacggcct acttcgacac cgacttcttc caggtgctgg tcacccaatt cggttccagt 2899801 ggcggcccat tcgcggaaga gtatgtggtg gcccatgaat acggccatca cgtgcagaac 2899861 ctgctggggg tgctcggccg cgctcagcag ggtgcgcaag gtgctgcggg cagtggcgtg 2899921 cgcacggagt tgcaggcgga ctgctacgcc ggggtgtggg catactacgc gtccaccgtc 2899981 aagcaggaga gcaccggtgt gccttacctg gagccgttga gcgacaagga catccaagac 2900041 gccctcgcgg ccgcggcagc ggtgggcgac gaccgtatcc aacagcagac gaccggacgc 2900101 accaaccccg agacctggac gcatggctcg gccgcgcaac ggcagaagtg gttcactgtc 2900161 ggataccaga ctggcgaccc caacatctgc gacacctttt ccgccgcgga cctggggtag 2900221 gcgaattacc agggacgagt cgagcactgc acgccgctgc cgccgtcctg cgacaccacc 2900281 acctggccgt ctacaacaat ctcgcagtgg aactccggat tgacccgcag gccgccgctg 2900341 gcggtgacga tcgcccactg gctcgggttt gccagcgtgg cggtatagac cagcggctga 2900401 ccgccagcga tcggagtgtg caaggtaatc atgtacttcg atgaatcggc attgaaagcc 2900461 gccatgctgg gcggatcggc gctcatgtac cgaatgttgg ccatcaggtc gctggtggtc 2900521 gtgacggtgt aggtcacctg atgcccgacc ggatccgcgc gggcaatcgc cgggatgacc 2900581 ccgctgagcg cggctccggc aaacgtcacc agcgcgacgg cgcttggcac tgtgcgcacg 2900641 gacgtcatat ctaaaacgct accggatgcg ttaccgacgc cggccggcac tgcatgcgat 2900701 gaccgtcgcc cgccatccgg gcaagccgaa ttgcgtgagc cgcaccgcca ttagcagccg 2900761 aaagctgtcg ttggcctcgg gcttcgcgct ctggaggcga tcgctggtgt gagcgtctac 2900821 gcagttcaga aagcctttcc gagcaacgcg ccgaggtaac ttcagatttc ggcagccggt 2900881 ttacccgcag gtaaaccagg gcgggtatga aacgtgagtg ggcgccgatc tgaagcagcc 2900941 gcaggatgcc gattcacccc cgaaaggggt tagccgccgt aggttcctga cgacgggcgc 2901001 ggcagcggtt gttgggacag gtgtcggcgc gggcgggacc gcgctgctgt cgtcacaccc 2901061 ccggggtcct gccgtctggt atcaacgtgg tcggagcggc gcgcctccgg tgggtggtct 2901121 gcacctgcag ttcggccgga atgccagcac cgaaatggtg gtgtcctggc ataccacgga 2901181 caccgtcggc aatccgcgag tcatgctggg cacgccaacc tctggcttcg gcagcgtcgt 2901241 ggtggccgag acccggtcgt accgggatgc gaagtccaat accgaggtgc gcgtcaacca 2901301 cgctcacctg accaacctga cacccgatac cgactacgtc tacgccgcgg tgcacgacgg 2901361 tacaactccg gagctcggga ccgcacggac cgcaccgtcg ggtcgaaaac cgctacgctt 2901421 caccagcttc ggtgatcagt ccactcccgc gttgggcaga ctggccgacg ggaggtacgt 2901481 cagcgacaac atcggatccc ccttcgccgg tgacatcacg attgcgatcg agcgtattgc 2901541 cccgttgttc aacctgatca acggtgacct gtgttacgcc aacctggcac aagaccgaat 2901601 tcgcacctgg tcggactggt ttgacaacaa cacccgctcg gcgcgctacc ggccgtggat 2901661 gccggcagcg ggcaatcacg agaacgaagt cggtaacggg ccaatcggtt atgacgccta 2901721 tcagacctac tttgcggtac ccgactcggg atccagcccg caactgcgcg ggctatggta 2901781 ctcgttcacc gccggctcgg tgcgggtgat cagcctgcac aacgatgatg tgtgctacca 2901841 ggacggtggc aactcctacg tacgcggcta ttcgggcggc gaacaacggc gctggctgca 2901901 agccgaactc gccaacgctc ggcgcgactc ggaaatcgac tgggtggtcg tctgcatgca 2901961 tcagaccgcg atctccaccg ccgacgacaa caacggtgcc gacctcggaa tccggcagga 2902021 atggctaccg ctgttcgacc agtaccaggt cgacctggtg gtgtgcggcc acgaacacca 2902081 ctacgagcgg tcacatccgc tgcgcggggc cctgggcacc gatacccgaa caccgatacc 2902141 cgtcgacacc cgcagcgacc tcatcgactc aacccgggga accgtgcacc tggtaatcgg 2902201 tgggggcggc acgtcgaagc cgaccaacgc gctgctcttc ccgcagcctc ggtgccaggt 2902261 gataaccggc gtcggggatt ttgatcccgc gatccggcgt aagccgtcca tattcgtgct 2902321 cgaggatgcg ccgtggtcgg cgttccgcga ccgcgataat ccttacggct tcgtggcctt 2902381 cgacgtcgac ccgggtcaac ccggcggcac tacctcgatc aaggcgacgt attacgcggt 2902441 gactgggccg ttcgggggac tcaccgtcat cgaccaattc accttgacca agccgcgcgg 2902501 cggatagctc agaacagggt cgcctgaacg ggtaccagtg ccgcttcggt ctccggcggc 2902561 gccgggcgat gatcacccgc caaccgatac tttgcgatca gcggtgccac ccgttcccgc 2902621 agcatctcgc ggtagctcgg cggtagatat ggcccgcgcc ggtacagttc gcggtaccgg 2902681 ctgaccagtt cgggatgcgc gcgggccagc cagcacatga accagccgcg cgtcgaaccc 2902741 cgcagatgca ggccaaagac cgttacaccg gtggcgcctg cggccgcgat ctggcccaac 2902801 agttggtcaa ggtgctcgcc ggagtcggtg agttgtggca gcaccggcgc gaccatcacg 2902861 tgacagtcca agccggcggc gcgaattgcg gtaatgagcg ccagccgcgc ctgcggtgtt 2902921 ggcgtacccg actcgacatc ccggtgcagc tccgggtcgc caacggccag cgacaccgcc 2902981 accgacaccg gcacttgttg ggcggcctcg gcgatcaacg gcaagtcccg tcgcagcagg 2903041 gtgcccttgg tcaggatcga cagcggcgta ccggatgccg ccagcgcgcc gatgatgccc 2903101 ggcatcaggg cgtagcggcc ctccgcgcgc tggtaggggt cggtgttggt gcccaacgcg 2903161 acggtctcgc gccgccagga cggccggcgc aactcgtgac gcagcacagc ggcgacgttg 2903221 gtcttgacca ccacctgggt gtcgaagtcg gtgcccggat tgaagtccag gtactcgtgg 2903281 gtggggcggg cgaaacaata gcgacaagca tgcgagcagc cgcggtagcc gttgacggtg 2903341 tagcgaaacg gcaacgcggc cgcgttgggc accttgttca gcgctgattt gcacaacacc 2903401 tcgtggaagg tgatgccgtc gaattgtggc gcgcgaacgc tgcggaccag gccgatccgc 2903461 tgcaaccccg gcagcgcccc gtcgtcaacg ggcatcccgt tcaccgcgac ggcttgccgg 2903521 gcccaacgca taccattatt cgaacaaccg ttctatactt tgtcaacgct ggccgctacc 2903581 gagcgccgca caggatgtga tatgccatct ctgcccgcac agacaggagc caggccttat 2903641 gacagcattc ggcgtcgagc cctacgggca gccgaagtac ctagaaatcg ccgggaagcg 2903701 catggcgtat atcgacgaag gcaagggtga cgccatcgtc tttcagcacg gcaaccccac 2903761 gtcgtcttac ttgtggcgca acatcatgcc gcacttggaa gggctgggcc ggctggtggc 2903821 ctgcgatctg atcgggatgg gcgcgtcgga caagctcagc ccatcgggac ccgaccgcta 2903881 tagctatggc gagcaacgag actttttgtt cgcgctctgg gatgcgctcg acctcggcga 2903941 ccacgtggta ctggtgctgc acgactgggg ctcggcgctc ggcttcgact gggctaacca 2904001 gcatcgcgac cgagtgcagg ggatcgcgtt catggaagcg atcgtcaccc cgatgacgtg 2904061 ggcggactgg ccgccggccg tgcggggtgt gttccagggt ttccgatcgc ctcaaggcga 2904121 gccaatggcg ttggagcaca acatctttgt cgaacgggtg ctgcccgggg cgatcctgcg 2904181 acagctcagc gacgaggaaa tgaaccacta tcggcggcca ttcgtgaacg gcggcgagga 2904241 ccgtcgcccc acgttgtcgt ggccacgaaa ccttccaatc gacggtgagc ccgccgaggt 2904301 cgtcgcgttg gtcaacgagt accggagctg gctcgaggaa accgacatgc cgaaactgtt 2904361 catcaacgcc gagcccggcg cgatcatcac cggccgcatc cgtgactatg tcaggagctg 2904421 gcccaaccag accgaaatca cagtgcccgg cgtgcatttc gttcaggagg acagcccaga 2904481 ggaaatcggt gcggccatag cacagttcgt ccggcggctc cggtcggcgg ccggcgtctg 2904541 accgcaaccg ggcctcatgc taggccaccg gcgaccgacg gacttcccgc gcgagccgct 2904601 ccaaaagcct cagccgctcg gggtggtcgg ctcgtcaaac gacagcccta tcagccgaga 2904661 caccacgttg tgcagcgcgt caaacacctc caggatctct tctcggctac tcgaaaccca 2904721 tgtttgaaac gtatgacgcc caccgacaag aatggccgcc ttgaggccct gcggccacgg 2904781 tggcgcaagt gatttcggtg actccggctg gaagcggcga ctacccagcc agccgcgaaa 2904841 ttacttcggc cacaaccgaa tccatcgaga ccgaaacttg ctcacccgtc gtcaagtcct 2904901 tcactgcgac cgtcccggcc tcgatgtcgc ggtcgcccgc taccaacgca acacgggcgc 2904961 cggaacgagc ggccgcgcgc atcgcgcctt tgagcccgcg atcaccatag gcaaggtcaa 2905021 cccgcacccc ggccgcgcgc agtcgtccag ccagcaccgc cagcctgagc ttggccgcct 2905081 cgccaagcgg cacgccgaac acgtcgcacc gggcgctgtc ccccgccgtc ttgccctcgg 2905141 cccgcagcgc cagcacggtc cggtccacgc ccagcccgaa cccgatgccc gacaagtcct 2905201 gcccgccaag ctggtgcatc aggccgtcgt agcgcccccc gccgccgatc cccgattgcg 2905261 caccaagccc gtcatggacg aactcgaagg cggtcttggt gtagtagtcc aggccgcgca 2905321 ccatgcgcgg gttgatgaca tagggcactc caagcgcgtc cagatgggcg agcacggtgt 2905381 cgaaatgctg cttggcgaca tcagacagat gatccagcaa caccggcgcc gacgccgtca 2905441 tcgcacgcaa ttcgggtcgc ttgtcgtcga gcacccgcag cggattgatc cctgcgcgcc 2905501 tgcgggtgtc ctcgtcgaga tcgagtccaa acaagaactc ctgcaacagt tcccggtact 2905561 gcggacggca actctcgtct cccagggagg tgatttccag ccggaacccg tcgagaccca 2905621 acgagcggaa cccggcgtcg gcaatggcga tcacctcggc gtccaacgcc gggtcgtcga 2905681 cgccgatcgc ctccaccccg acttgctgta actggcgata ccggccggcc tgcggacgct 2905741 cgtagcggaa aaacgggccc gcataacaca acttcaccgg cagcgcgccg cgatccagcc 2905801 cgtgttcgat caccgcacgc accaccccgg cggtgccctc gggccgcagc gtcaccgagc 2905861 ggtcgccacg gtcggcgaac gtatacatct ccttggacac cacgtcggtg gattcaccca 2905921 cgccccgggc gaacagggcg gtgtcctcga agatgggcag ctcgatgtgg ctatagccgg 2905981 cttgacgggc cgccgcgagc agcccgtcgc gcaccgcgac gaactgcgcc gagtcgggcg 2906041 ggacgtagtc cggtaccccc ttgggggccg aaaatgacga gaattccgtc accggctcaa 2906101 gccctcaagg aacggattga agcgccgctc ggccccaatg gtggtggagt tgccgtgccc 2906161 gggcagcacc accgtgctgt cgtcgagcac caggagtttg tcgacgatgg agcgcaacag 2906221 gtcgcggccg ctgccgccgg ccaagtcggt gcggcctatc gcacgctcga acagggtgtc 2906281 accggtgaac acgatgtcct tgtcgttgtt ggtcgcctgc aggacccgga agaccaccga 2906341 cccgcgggtg tgacccggtg tgtgatcgat gttgaccgag atgccgccga ggtcgatctt 2906401 gtcgccgtct cggtccagct ccacaacctg tttaggctca cgaaagaacg cacccgcaac 2906461 cagctgcgct atccgcgggc ccaggccgta gatggggtcg gtcagcatga accggtcggc 2906521 gggatgcaca taggtggggc agccgaaggt gtctgagacc ttctgcgcgg accagatgtg 2906581 atcgatgtgt ccgtgggtga gcagcaccgc ggcaggggtc agccggttct tgtcgaggat 2906641 gcgacgcagc gtgcccatcg caccctggcc cggatcgacg atgacggcgt cggttccggg 2906701 ccgctcggcc agcacataac agttacacgc cagcaacccc gcaggaaatc cggtgatcaa 2906761 cacggttccc agtttcccat ccccggcgtc cggggacgag gcgggccgcg aacatgggcc 2906821 acttgacacc ggtcgcggcg ccccgattag cctgtgcttt cgtgccgacc aatgctcagc 2906881 gacgtgccac agccaaacgc aaactcgaac gacaactaga gcgccgcgcc aagcaagcca 2906941 aacgccgtcg catcttgact atcgtcggtg gctcactcgc agcggtggcc gtgatcgtcg 2907001 cggtagtcgt cacggtggtg gtcaacaagg acgaccacca gagcaccacg tcagcaaccc 2907061 ccaccgactc ggcctcgacc agccccccgc aggccgcgac cgctcccccg ctgccgccgt 2907121 tcaagccgtc ggccaacctc ggcgccaact gccagtaccc gccgtcgccg gacaaggccg 2907181 tcaaaccggt caagttgccc cggaccggca aggtacccac cgacccggcc caggtcagcg 2907241 tgagcatggt gaccaaccag ggcaacatcg gtctaatgct ggccaacaac gaatcgccgt 2907301 gtacggtcaa tagtttcgtc agcctcgcgc agcagggttt cttcaagggc accacttgtc 2907361 accggctgac cacctcacca atgttggcgg ttctgcaatg cggcgaccct aagggcgacg 2907421 gcacgggcgg tccgggctac cagttcgcca acgaataccc caccgaccaa tactcggcga 2907481 acgaccccaa gttgaacgag cccgtcatct atccgcgcgg gacactggcc atggccaacg 2907541 ccggccctaa taccaacagc agccagttct tcatggtcta ccgggactca aagctgccac 2907601 cccaatacac cgtgttcggc acgatccagg ccgacggact gaccaccctg gacaagatcg 2907661 ccaaggccgg cgtcgccggt ggcggcgaag acggcaagcc cgccaccgaa gtcaccatca 2907721 cgtcggtgct gctggattag cccgacgctc gccgagcaga cacagaatcg cacgaaatca 2907781 gcccgcccaa tgcgattctg cgtctgctcg gcggagaaaa gcgcgctacg cggccgaggt 2907841 cacccggtag acgtcgtaga caccttcgac gttgcggacg gcgttgagca ggtgcccgag 2907901 gtgcttgggg tcacccatct cgaaggtgaa tcgactgatc gccacccggt cccccgaagt 2907961 ggtgaccgac gcggacagga tattgacctt ctcgtcggcc agtgcgcgcg tcacatccga 2908021 cagcagccgg tgccggtcga gtgcctcgac ctggattgcc accagaaaca ccgacgacgg 2908081 cgacggcgcc catagcacct cgatgatgcg ctcggcctgc tgctgcagcg atgcggcgtt 2908141 ggtgcagtcg gtgcggtgca cactgacccc gccgccacgg gtgacgaacc ccataatcac 2908201 atcgcccgga accggcgtgc agcacttggc cagcttggtc agcacgcccg gggcgccggg 2908261 gacggagacc ccgacatcgt cggtgctgcg tgggcgccgc ggcatggtcg ccggcgtgga 2908321 ccgctcggcg agttcctctt ccgcctggtc gataccgccg agctcggcca acaaccgctg 2908381 cacgacgtgt ttcgccgaca cgtgcccctc accgatggcg gtatagagtg ctgacacgtc 2908441 cgcgtagtgc agctcgcggg ccaccgccgc catggactca ccattgacca agcgctgcaa 2908501 cggaagtcca ccgcggcgca cctcgcgggc catcgcatcc ttaccggtct ccaacgcctc 2908561 ctcacgccgc tccttggcga accactggcg gatcttcgtc tttgcgcgcg gcgacaccac 2908621 gaactgctgc cagtcccgcg acggcccggc gttcggcgcc ttggacgtga aaacctcgac 2908681 aacttctccg ttttccagct tgcgttccag cgctaccaac cggccgttca ctcgggcgcc 2908741 gatgcagcgg tggcccacct ctgtgtgcac cgcgtaagcg aagtccaccg gcgtcgaacc 2908801 ggttggcagc gtgatcacgt cgcccttggg ggtaaacacg aaaatctctt gcaccgcaag 2908861 gtcgtagcgc aatgattcca agaactcacc ggggtcggcc gcctcacgtt gccagtcgag 2908921 cagctgacgc atccaggcca tgtcgtcgat ctccgcggcg gcatgcggat gaagaacacc 2908981 gttgcggccc ttggcttctt tgtagcgcca atgcgcggcg atgccgtatt cggcggtgcg 2909041 gtgcatgtcg cgggtacgga tctgcacttc cagcggcttg ccctcaggcc cgaccacagt 2909101 ggtgtgcagt gactggtaca caccgtatct gggctgggcg atgtagtcct tgaaccgacc 2909161 cgccatcggc tgccatagcg aatgcactac gccgacagcc gcgtagcagt cccggatttc 2909221 gtcgcacagg atgcgcacac cgaccaggtc gtggatgtcg tcgaagtcgc ggcccttaac 2909281 gatcatcttc tggtagatcg accaatagtg cttggggcgg ccctccaccg tcgccttgat 2909341 cttcgacgcg gtcagcgtgt tgacgatttc ggcacgcacc ttggccaggt aggtgtcccg 2909401 ggacggcgcg cgaccggcga ccagccggac gatctcctcg tacttcttgg gatgcaggat 2909461 cgcgaaggac aggtcctcca actcccactt gacgctggcc atgcccagcc gatgcgccag 2909521 gggtgcaatg acttccaacg tctcacgggc cttgcgggcc tgcttctccg gcggcaagaa 2909581 gcgcatggtg cgcatgttgt gtaaccggtc agccaccttt atcaccagca cccgcggatc 2909641 gcgggccatc gcggtgatca tcttgcgaat agtctcgcct tcggcggcgc tgcccaacac 2909701 cacccgatcc agcttggtca ccccgtcgac gagatggccc acctcttcgc cgaattcctc 2909761 ggtcaacgcc tccagggtgt aaccggtgtc ctcgacggtg tcgtgcagca gcgcggccac 2909821 caaagtggtg gtgtccatgc ccaactcggc cagaatgttg gcaacggcca acgggtgggt 2909881 gatgtaggga tcaccggact gccgcaactg gctggcatgc ctttggtcag cgacctcgta 2909941 ggctcgctgc aagatcgaca ggtcggcctt gggatagatc tcccggtgca ccgccaccaa 2910001 cggctcgagc accggattgg tggtgctgcg ctgggcggtc atccgccggg ccaatcgggc 2910061 ccgcacccga cgcgacgcgc tgatgctggt cttaagagtc tcgaccggcg actcgggcgt 2910121 ctcgagagcg ggctcgagag ccgcagaagc ctccgtgggc ggtgcaaccg cttgcgccgt 2910181 gagctggtcc tcggccacgt tcgtcacctc cgacctagag gatatccctc acaggcggct 2910241 caggctgtgc accggcagcg gtgcgagcgc cgcgcgaccg ctcaaccccg caagttccac 2910301 cactacggcc gccccggcca cgttggcgcc accgcgctca agcaggcgtc gcgtcgcgcc 2910361 gatggtgccg ccggttgcta acacgtcgtc aatgatcacg acacggcggc ccgcaacctc 2910421 gatgccctca gcgagaatct ccagagtggc ggcgccgtac gccctgtagt actcctcgct 2910481 gagcaccggc cggggcagct tgccgccctt gcgaacggcc agcacaccca cttcgagccg 2910541 ggtggcgacc gcggctgcca ccagaaaccc gcgggcgtcg acgccggcca ccaggtcagc 2910601 tccggacgcc cgatcggcca gcgcttcggt taccgcggcc aatcctcttc ggtcggcgaa 2910661 tagcggggtg aggtccttga actcgacgcc gggaaccgga aagtcggcca catcccgggt 2910721 cagcgacgca accacgtcgg ccacagatat ggctgagctc cggcgggact caccgagcgc 2910781 caatacccgc ccgtcgtcga cccaacgctg ccggcggcgc ttcccccgtg cctttaagga 2910841 gagccccgtc gcgatcacgt tcaacacgta gtcaccagcc catgtaccgc catggcacac 2910901 atcctctccc agacagcccg gagcacctgc gacactacgc tccgataggt ccgcttctcg 2910961 tcgtggaatt ctgtcaatta cctgcagatg gcactggcca tcgtcaccgc gccagcgccc 2911021 agcgatccat gttccaccct gccccccatc gcgtcggatt cctgctcacc gcatacattt 2911081 tcgtcgacat caacaacgtg cgctgctgcc ggtacaacgg caaggttggc atctcatccc 2911141 agagcaccgg cgcggcctcg gcaagcaacc tggcccgctc ggcggggtcg gccgacaccg 2911201 cgagcgcgct gatgatgccg tcgatctgag cgtttgcgta ccccgataga ttgtttccgt 2911261 tgccgctgtg caagtcatag gcatccatcg cacacgatcc gctcgatccg ctgccggtgg 2911321 ccccaccggt gctcgccaac aatacgtcaa tctttccgtc ccgcagcgct tgcggtccgg 2911381 gtgtgtccac cgtcacatcc gaaacggtga tcccggccgg ggcgcaggcg tcggcaatgg 2911441 ttccgatggt ggccgccaac cgagcgttgg gcctgccgta gccgatccgc acggtcagcg 2911501 gcgtaccacc cagcgcgtcg cgagcggcgg cggggtccac ccggccgaac tgacgtgctt 2911561 cggcggcgcc gtcggcatcg gtgagggcat cgtcggtcgc cggggacagc cgcgagttgg 2911621 caatcggaac cccggcatcc cgagcgatcg cgtcccgggg tacacacaac gcgagcgcgc 2911681 ggcgggtgcg gctttgcgcg agtgaacctt gtggtgcgaa gatcagctgc tcgatcccgg 2911741 ccgacgggta gtcggtgcgc tggtagctgt cgggggttac cagggatccc gatgaaccgg 2911801 ccgcgacgtc gaccacgtcg acgctgcggt tgttgacccg gtcttggata tcggctccct 2911861 gcggccagac ggtgatccgc ttcgtgatcg ccttggtgcc ccaccaacga tcattggcga 2911921 cgagcaccac ggcgccatcg tccaggacgg attcgatctt gtacggtccc gacgagggga 2911981 agcggctgcg gacttcgtcg tggctgcggc ccggcttgag gtcccacgtg gaattccaca 2912041 gtcgcgcaat ctgttccacc gctgacacgt tgttgcttag caacgccgcg gtaacatcga 2912101 tgtgcagctg gtcggcgatc acgtgcgacg gcatcagcga cgtcgcggtg aacagctggg 2912161 agtggtcaac gacactgcga tccgggatga acgacacccg ggcctttttc tgccccgccg 2912221 tgcactcgat gttggcgatg tcgacatagc cggcctgcgt agcagcgtcg aagccgggaa 2912281 agcggccgga ttgtgccgcc caggccaata ccaggtcgtc acaggtcacc ggcctgccgt 2912341 cggaatagac ggcgtcgtcg gagatctggt agtcgaggat caacggcgac ccctccacca 2912401 ccgagaccgt tccgaagtcg cggtcagcca ccacttggcc gtcggggccg tgatagccaa 2912461 acccggtgag agtccgggcg aatgcctgcg ccccggccga cgcggcaccg atgacggtat 2912521 tggtgttgta ggtgaccagc gcgccgtcga ccacgtagtc gatctgagcc gcggcgctgc 2912581 ccgaacacgc ggtcagcgtg gttgcggcga ccaacgtcgc ggtaccaacg actcgcaggc 2912641 cggcgatgcg cgtatgacgc cggcggcggg gggccaccgc gcctaccgcc gaccggcgtt 2912701 ccgcttgccg gtcggacgcc tggtaccgac gggacgcact gggcgcgccc ccggcgccgg 2912761 cttgctggag ccctgggccg cccgcggggc ggattggctg ctggcctgcg tgatccccac 2912821 cagcgactgt tcatcagcgg ctgccggctg ctcgccgcca tccgtgctgg cgtcctctga 2912881 tcccgccggc gagccggagt tacgccgttt gagcacccga cgggtgtggt tgcgcaccaa 2912941 ctccgtgcgc tcacggaggg taaccaacag cggcgtggcg aagaagattg acgagtaggt 2913001 gccgatgatg atgccgatca gctgcaccag cgccaggtct ttgagagtgc cgacgcccag 2913061 cagccagacc gccaccacca tcagcgccaa caccggcaac acgccgatca ggctggtgtt 2913121 gatcgaccgc atgaacgtct ggttgatcgc caggttggcc tgctcggcga aggtgcgccg 2913181 ggtggtgtgc tggaagccat gggtgttctc ctcgaccttg tcgaacacga tgacggtgtc 2913241 atagagcgag aacccgagaa tggtcagcag gccgatgacc gtggccgggg tgacttcgaa 2913301 acccaccagg gaatacacgc cggcggtgac ggtcaggtcg aagagcatgg ccgttatcgc 2913361 cgagatggtc atgtagcgct cgtagcgcac ggtaatgtag agggcgacca gcaccagaaa 2913421 caccaccagc gcgatcaccg ccttcttggt gatctgaccg ccccaggtct ccgacaccgc 2913481 cgagtcgctg atggcctgct tgctgggctg accgtcggtt cccttgggcc cgaaggcctc 2913541 gaatagggcg tcccgcagct tggccgtctg gtcgctggtc agcgtctccg aacgaatctg 2913601 caccgtcgcc gaagcaccgg ccccgacgat caccaccgac tggggctcac tgccgagggc 2913661 ccggtagtag acgtcttcga cctgcgcgac ttgggtgctg ccacgcggga acgacaccgt 2913721 ggtaccgcct ttgaaatcga tgccgaaggt gaacccacga aagacgatgc tggcgatggc 2913781 caccgcgacg atcgcaccgc tcacgccaaa ccacaaccgg cggcgtccca ctacctcaaa 2913841 cgccccggtg ccggtgtaca ggcgcgaaag gaagctatgg tgccccagct tcgaggcggt 2913901 gtctgtggtg ctgtcgccgt cggtccgcgc cacagcactc tcggtggcct cggtgagttc 2913961 gaccgccgac gtggcttcgt cgtcgcggcc ggtctttgct ttcgacgcca tcggctatcc 2914021 ccgtcccgtc cgagccatgg cccggcgttc gcgtgcgacc tgctgcaccg ctcccaggcc 2914081 gttgtatgcc ggcttggcca gcagcgacga tttggacgcc agatacacca acggccacgt 2914141 caccaagaac accacgacga ggtccaggat cgtggtgagg cccagggtga acgcgaaccc 2914201 cttcacctga ccgatcgcca gaaagtacag cacggcagcg gccaggaaag tgacggcgtt 2914261 gcccgacacg atcgtcttgc gggcacgcgc ccaaccgcgc ggcactgccg accggaacga 2914321 acggccttcg cggatctcgt ctttgatgcg ttcgaagaac accacgaacg agtcggcggt 2914381 ggtcccgata ccgatgatca ggcccgcaat accagccaga tctagggtgt agttgatata 2914441 tcggcccaag agcaccagga tcgcaaaaac cattgagcca gaagccacta gcgacaaggc 2914501 cgtgagcagt cccagcactc ggtagtagag cagcgaatac accagcacca acagcaggcc 2914561 gatcgcaccc gcgatcatgc ccgcgcgcag cgatgacaac cccaaggtcg ccgaaaccgt 2914621 ttgggcttcc gacggttcga aggacagcgg cagcgacccg tacttgagga cgttggcgag 2914681 ctggcgtgcg gtcgccgcgg tgaatggcgg atccccaccg ctgatctggg ttcggccgcc 2914741 ggggatcgct tcctggatct gcggtgcact gacaacctgc gagtccaggg tgaacgccgt 2914801 ctgggtgccg atatgggcgg cggtgtagtc ggcccagatg ttggccgccg gacccttgaa 2914861 ctgcaggtcg acgacgtagc cgatgccgcg ctggtccata cccgaggtgg cgttttggat 2914921 ctggtcgccg ctgatgatcg acggcgccag caggtacgcg gtcttgtggt cggtcgagca 2914981 ggtcaccaac ggcagtttcg ggtcgtcgtt gccggccaaa atgtcgtcgc tctcgcagcg 2915041 ggtcgcctgg aattgcagtg caaccatctg catgtattgg ttggtgctct gccgcagctt 2915101 cttctcctgg gcgatgcgct cggcgagatc cttgcgcgga tccgtggccg gcgcctcagc 2915161 gggcggcgcc ggcggcgggc tggccggtga ggtcgggttg ggcgatggcg ccgggtcctg 2915221 cggatagggc cgcggttggg ccccaggttg cggtgaagcc ggcgcccccg attgggctgg 2915281 cggcggtgcg gcgggttgac cgggcggctg cggttcggcg ctgggtgccg gctgcggttc 2915341 ttcggctgcg ggctgcgccg gcatcgagtt gagcaccggc cggatgtaca gccgagcggt 2915401 ctgtccgagg ttgcgtgcct cgctgccgtc gttgccgggc accgtgatga ccaggttgtc 2915461 accgtcgacg accacctccg acccggacac tcccagcccg ttgacccgcg cgctgatgat 2915521 ttgctgcgcc tgtgccagcg cttcccggct cggggccgag ccgtccggtg tgcgcgcggt 2915581 cagcgtgacc ctggtgccgc cctgcaggtc aatgccgagt ttgggggcgg tgtgcttgtc 2915641 cccggtgaaa aacaccagca aatagatgcc gatcagcatc accaggaaca ccgacaggta 2915701 acgggcaggg tgcaccggcg ccgaagacga tgccacgttc cttgtatctc ctcgagaatc 2915761 agttttctac ccccgacaga gcctacgtgt cgcgccgggg cgcgtcgcgc aagcggctcg 2915821 tcggttccgg tcggccggtt gccggtcagg aatcgttggt cacccggcgc tcgccggcca 2915881 cgtcgtcaac atccttgtca aggtcctcgt tgagctcctc gtcgatgtcg tcgtccggca 2915941 gaattcggtc acgaatcgcc aacttcatcc acgtggtgac caccccgggc gcgatctcga 2916001 ggtcgatggt gtcgtcggca atggcgacga tggtggcttc cagcccagaa gtcgtgtgta 2916061 cccgctcccc gggctgcaac gagtcgtgca gatcgatggt ggcttgcatg gcccgtcgct 2916121 ggcggcgcga cgcgaagtac atgaacccac ccatgatgag caggaacggc aagaacaaaa 2916181 cgaaactctc catcaacccg tctttcgtat tggtattgcg atcacggtgc caggcctacc 2916241 cgcgggccgc gcacctggta acagtccagt gtgcccgtcc agtctggcag gccggaaaca 2916301 tcggtcagca gataggcttt accagcgatg tgaaccggcg agccgggtga ggaggatctg 2916361 tggccagcct gcagcagagt cggcgcctgg tcaccgaaat ccccggtccc gcatcgcagg 2916421 cactgactca ccgccgggcg gcggcggtgt ccagcggtgt tggggtcacc ctgccggtgt 2916481 tcgtagcccg cgccggcggc ggcatcgtgg aagacgtgga cggtaaccgg ctcatcgacc 2916541 tgggttcggg catcgcagtg acgacgatcg gcaactcgtc gccacgcgtg gtggatgcgg 2916601 tgcgcacgca ggtggccgaa tttacccaca cctgcttcat ggtgacgcca tacgaggggt 2916661 acgtggccgt cgccgagcaa ctcaaccgga ttaccccagg ttcgggcccc aagcgctcgg 2916721 tgttgttcaa ttccggcgcc gaggcagtcg agaacgccgt caagatcgca cgctcctaca 2916781 ccggcaagcc cgcggtggtg gcgttcgacc acgcctacca cggtcgcacc aacctaacga 2916841 tggcgctgac cgccaagtcg atgccctaca agagcggctt cggtccgttc gcgccggaga 2916901 tctaccgagc gccattgtct tacccctatc gggacggcct cctcgataag caactggcta 2916961 ccaatggtga gctagccgcg gcccgagcca tcggcgtcat cgacaagcag gtaggcgcga 2917021 acaacctggc cgccctcgtc atcgaaccga tccagggcga aggcggtttc atcgttccgg 2917081 ccgaagggtt cctacctgcc ctcctcgatt ggtgccgcaa gaaccatgtg gtgttcatcg 2917141 ccgacgaggt gcaaaccggc tttgcccgta ccggggcgat gttcgcctgc gagcacgagg 2917201 gccccgacgg tctagagccc gacctgatct gcacggccaa aggcatcgcc gatggattgc 2917261 cgctgtcggc ggtcaccggc cgcgccgaga tcatgaacgc cccgcacgtg ggcggcctgg 2917321 gcggcacgtt cggcggcaac ccggtggcct gtgcggccgc gctggccacc atcgcaacca 2917381 tcgaaagcga cgggctgatc gagcgggccc gccagatcga acgcctggtg accgaccggt 2917441 tgacgacgct gcaggccgtc gacgaccgga tcggcgacgt gcgtggtcgc ggcgccatga 2917501 tcgccgtaga gctggtcaaa tccggaacca ccgagcccga cgccgggctg accgagcggc 2917561 tggcgaccgc ggcccacgcc gccggcgtca tcattttgac ctgcggcatg ttcggcaaca 2917621 tcatccggct actgccgccg ctgaccatcg gcgacgagct gctgagtgag gggctggaca 2917681 tcgtgtgcgc gatcttggcc gacctctgac ggcctgccgg ccccgactgc gtcatcccgt 2917741 gccgcatctc acagccgatc agcagcaggc ttgcattgtg taatatattt actttagcta 2917801 acgttctatt ggtcgggcgc agcgccgcgc cgtcgatttc ccaccctttc cggcacgccg 2917861 aggtgaccgc atgtcgatca acgatcagcg actgacacgc cgcgtcgagg acctatacgc 2917921 cagcgacgcc cagttcgccg ccgccagtcc caacgaggcg atcacccagg cgatcgacca 2917981 gcccggggtc gcgcttccac agctcatccg tatggtcatg gagggctacg ccgatcggcc 2918041 ggcactcggc cagcgtgcgc tccgcttcgt caccgacccc gacagcggcc gcaccatggt 2918101 cgagctactg ccgcggttcg agaccatcac ctaccgcgaa ctgtgggccc gcgccggcac 2918161 attggccacc gcgttgagcg ctgagcccgc gatccggccg ggcgaccggg tttgcgtgct 2918221 gggcttcaac agcgtcgact acacaaccat cgacatcgcg ctgatccggt tgggcgccgt 2918281 gtcggttcca ctgcagacca gtgcgccggt caccgggttg cgcccgatcg tcaccgagac 2918341 cgagccgacg atgatcgcca ccagcatcga caatcttggc gacgccgtcg aagtgctggc 2918401 cggtcacgcc ccggcccggc tggtcgtatt cgattaccac ggcaaggttg acacccaccg 2918461 cgaggccgtc gaagccgccc gagctcggtt ggccggctcg gtgaccatcg acacacttgc 2918521 cgaactgatc gaacgcggca gggcgctgcc ggccacaccc attgccgaca gcgccgacga 2918581 cgcgctggcg ctgctgattt acacctcggg tagtaccggc gcacccaaag gcgccatgta 2918641 tcgcgagagc caggtgatga gcttctggcg caagtcgagt ggctggttcg agccgagcgg 2918701 ttacccctcg atcacgctga acttcatgcc gatgagccac gtcgggggcc gtcaggtgct 2918761 ctacgggacg ctttccaacg gcggtaccgc ctacttcgtc gccaagagcg acctgtcgac 2918821 gctgttcgag gacctcgccc tggtgcggcc cacagaattg tgcttcgtgc cgcgcatctg 2918881 ggacatggtg ttcgcagagt tccacagcga ggtcgaccgc cgcttggtgg acggcgccga 2918941 tcgagcggcg ctggaagcgc aggtgaaggc cgagctgcgg gagaacgtgc tcggcggacg 2919001 gtttgtcatg gcgctgaccg gttccgcgcc gatctccgct gagatgacgg cgtgggtcga 2919061 gtccctgctg gccgacgtgc atttggtgga gggttacggc tccaccgagg ccgggatggt 2919121 cctgaacgac ggcatggtgc ggcgccccgc ggtgatcgac tacaagctgg tcgacgtgcc 2919181 cgagctgggc tacttcggca ccgatcagcc ctacccccgg ggcgagctgc tggtcaagac 2919241 gcaaaccatg ttccccggct actaccagcg cccggatgtc accgccgagg tgttcgaccc 2919301 cgacggcttc taccggaccg gggacatcat ggccaaagta ggccccgacc agttcgtcta 2919361 cctcgaccgc cgcaacaacg tgctaaagct ctcccagggc gagttcatcg ccgtgtcgaa 2919421 gctcgaggcg gtgttcggcg acagcccgct ggtccgacag atcttcatct acggcaacag 2919481 tgcccgggcc tacccgctgg cggtggttgt cccgtccggg gacgcgcttt ctcgccatgg 2919541 catcgagaat ctcaagcccg tgatcagcga gtccctgcag gaggtagcga gggcggccgg 2919601 cctgcaatcc tacgagattc cacgcgactt catcatcgaa accacgccgt tcaccctgga 2919661 gaacggcctg ctcaccggca tccgcaagct ggcacgcccg cagttgaaga agttctatgg 2919721 cgaacgtctc gagcggctct ataccgagct ggccgatagc caatccaacg agctgcgcga 2919781 gctgcggcaa agcggtcccg atgcgccggt gcttccgacg ctgtgccgtg ccgcggctgc 2919841 gttgctgggc tctaccgctg cggatgtgcg gccggacgcg cacttcgccg acctgggtgg 2919901 tgactcgctc tcggcgctgt cgttggccaa cctgctgcac gagatcttcg gcgtcgacgt 2919961 gccggtgggt gtcattgtca gcccggcaag cgacctgcgg gccctggccg accacatcga 2920021 agcagcgcgc accggcgtca ggcgacccag cttcgcctcg atacacggtc gctccgcgac 2920081 ggaagtgcac gccagcgacc tcacgctgga caagttcatc gacgctgcca ccctggccgc 2920141 agccccgaac ctgccggcac cgagcgccca agtgcgcacc gtactgctga ccggcgccac 2920201 cggctttttg ggtcgctacc tggcgctgga atggctcgac cgcatggacc tggtcaacgg 2920261 caagctgatc tgcctggtcc gcgccagatc cgacgaggaa gcacaagccc ggctggacgc 2920321 gacgttcgat agcggcgacc cgtatttggt gcggcactac cgcgaattgg gcgccggccg 2920381 cctcgaggtg ctcgccggcg acaagggcga ggccgacctg ggcctggacc gggtcacctg 2920441 gcagcggcta gccgacacgg tggacctgat cgtggacccc gcggccctgg tcaaccacgt 2920501 gctgccgtat agccagctgt tcggcccaaa cgcggcgggc accgccgagt tgcttcggct 2920561 ggcgctgacc ggcaagcgca agccatacat ctacacctcg acgatcgccg tgggcgagca 2920621 gatcccgccg gaggcgttca ccgaggacgc cgacatccgg gccatcagcc cgacccgcag 2920681 gatcgacgac agctacgcca acggctacgc gaacagcaag tgggccggcg aggtgctgct 2920741 gcgcgaagct cacgagcagt gcggcctgcc ggtgacggtc ttccgctgcg acatgatcct 2920801 ggccgacacc agctataccg gtcagctcaa cctgccggac atgttcaccc ggctgatgct 2920861 gagcctggcc gctaccggca tcgcacccgg ttcgttctat gagctggatg cgcacggcaa 2920921 tcggcaacgc gcccactatg acggcttgcc ggtcgaattc gtcgcagaag ccatttgcac 2920981 ccttgggaca catagcccgg accgttttgt cacctaccac gtgatgaacc cctacgacga 2921041 cggcatcggg ctggacgagt tcgtcgactg gctcaactcc ccaactagcg ggtccggttg 2921101 cacgatccag cggatcgccg actacggcga gtggctgcag cggttcgaga cttcgctgcg 2921161 tgccttgccg gatcgccagc gccacgcctc gctgctgccc ttgctgcaca actaccgaga 2921221 gcctgcaaag ccgatatgcg ggtcaatcgc gcccaccgac cagttccgcg ctgccgtcca 2921281 agaagcgaaa atcggtccgg acaaagacat tccgcacctc acggcggcga tcatcgcgaa 2921341 gtacatcagc aacctgcgac tgctcgggct gctgtgatcg ggcctggccg ccgcggcgcc 2921401 gggtaaccaa gcagcccgtt acgcccagtt cgcctatgag aaggcagtaa gaagcgcgaa 2921461 aaatggcaga ccccgacgga ggccctctga aagagtcttg atcatcaggg cgcgtgacat 2921521 gtgtcacatg acgggttggg agggtggctg atgtcgtttg tcacggcagc tccagagatg 2921581 ctggcgacgg cggcgcagaa tgtcgcgaat atcggcacat cgctgagtgc ggcaaacgcg 2921641 acggcagcgg cgtccacgac ctcggtgctg gcggccggag ccgacgaggt atcgcaggct 2921701 atcgcaaggc tgttcagtga ttacgccacg cactatcagt cgctgaacgc tcaagccgcg 2921761 gcatttcatc acagcttcgt gcaaacgttg aacgccgccg gtggcgccta ttcgagcgcc 2921821 gaggcggcca acgcttcggc gcaggcgttg gaacagaatc tgttggccgt gatcaatgcg 2921881 cccgcccagg cgttgttcgg gcgtcccctg atcggcaatg gcgcgaatgg aacagcggcc 2921941 agccccaacg gcggtgatgg tgggattttg tacggcaacg gcggcaacgg cttctcccaa 2922001 acgaccgccg gggtggccgg cggcgccggt ggttccgcgg gcctgatcgg caacggcggc 2922061 aatggtggcg ccggtggggc cggtgctgcc ggcggggccg gcggcgccgg cggatggctg 2922121 ctcggcaacg gtggcgccgg cggtcccggc ggcccaacgg acgttcctgc cggcacaggt 2922181 ggagccggcg gggccggcgg cgacgcccca ttgatcggct ggggcggcaa cggcgggccc 2922241 ggcggtttcg ctgcttttgg aaacggtggg gccggcggca acggcggcgc cagcggttcg 2922301 ctctttggcg tcggcggcgc cggcggcgtc ggcggatcga gcgaagacgt cggcggcacc 2922361 ggcggggccg gcggcgctgg ccgcggtcta ttccttggcc tgggcggtga tggcggcgcc 2922421 ggcggcacca gcaacaacaa cggcggtgac ggtggcgccg gcggcaccgc gggaggtcga 2922481 ttgttcagcc tgggcggtga cggtggcaac ggtggtgccg gtaccgcaat cggatccaac 2922541 gccggtgacg gtggcgccgg cggtgacagc agcgccctga tcggctacgc ccagggcggc 2922601 tccggcggcc tcggcggctt cggcgaaagt accggcggcg acggcggcct gggcggcgcc 2922661 ggcgctgtgc tcatcggcac gggcgtcggc ggtttcggcg gcctcggtgg cggctccaac 2922721 ggcaccgggg gcgcgggcgg cgcgggcggc acgggcgcca cgctgatcgg cctgggcgcc 2922781 ggcggcggcg gcggcatcgg cgggttcgcc gtcaacgtgg gcaacggcgt cggcggtctg 2922841 ggcggccagg gcggccaggg cgccgcgctg atcggcctgg gcgccggcgg tgccggcggt 2922901 gccggcggcg ccacagtcgt tggacttggt ggcaatggcg gtgacggcgg tgacggtggc 2922961 ggcctgttta gtatcggcgt cggtggggac ggcggcaacg ccggcaacgg cgccatgcct 2923021 gccaatggcg gcaacggcgg caacgccggg gtcattgcca acggctcctt tgccccgtcg 2923081 ttcgtcggct tcggcggcaa cggcggcaac ggcgtcaatg gcggcaccgg cggcagcggc 2923141 gggatccttt ttggcgccaa cggcgcgaac ggaccgtcgt agcgggtcct ccagcgcact 2923201 actcgaacaa ccccggttga ctcgctccga ccggtggcgt catgcccagg tgcgtccagg 2923261 ccagggcggt ggccacccgg ccgcgcgggg tgcgcgcgac catacccgcg cgcaccagaa 2923321 atggttcgca cacctcctcg accgtggcgg cctcctcccc gaccgccacc gccagcgtcg 2923381 acacacccac tggaccaccg ccgaagctgc gggtcagcgc cgagagcacc gctcggtcca 2923441 gccggtccag acccagctcg tcgacgtcgt agacctccag tgcggccttg gcgacgtcgc 2923501 gggtgatgac gccgtcggcg cgcacctcgg cgaagtcacg cacccggcgc aacaaccggt 2923561 tggcgatccg cggcgttccc cgagaacggc gggcgatttc ggcgccggcg tcggcgccca 2923621 gctcgatacc cagaattccg gcggagcggg ccagcacccg ctccagctcg gcgggctcgt 2923681 agaaatccat gtgcgcggtg aagccgaacc ggtcgcgcag cgggccggtc aacgcgcccg 2923741 accgggtagt cgccccgacc agggtgaacg gcgcgacctc cagcggaatc gacgtggccc 2923801 caggaccttt gccgaccacc acatcgacgc ggaagtcttc catcgccaga tacagcatct 2923861 cctcggcggg ccgggcgatg cggtggatct cgtcgataaa caacacgtcg tgctcgacca 2923921 ggttggacag catcgccgcc aggtcaccgg cgcgttccaa cgccggcccc gacgtcaccc 2923981 gcagcgagga ccccagctcg gcggcgatga tcatcgccaa cgacgtcttg cccaagcccg 2924041 gcggaccgga cagcagaatg tgatccggtg tgccgccgcg gtttttggct ccctcgatga 2924101 ccagctgcag ctgttcgcgg acccggggct ggccgatgaa ttcgcgtaac gagcgcggcc 2924161 gcaggctgac gtcgatgtcg ccctctccga cggtgagtgc gggcgaaacg tcgcggtcgg 2924221 accgctcggt catcgggcct tccccagcaa cgacaaggca gaccgcagcg cgctggatgt 2924281 cgtcgcgtca tggttggcgg ccagcaccgt atcggtggcc tcctcggcct gtttggccgc 2924341 aaagcccagg ccgaccagag cctcgaccac gggactgcgc accgcgtggc cgttggtcga 2924401 gagtgcgccg ccggtggctg ccaccccaac cttgtcgcgt agttccaaca ccatgcgttc 2924461 ggcgccccgc ttgccgatcc caggcacccg ggtcagggcg gcgacgttgc cgtcggccag 2924521 cacctgccgt agcgccggag cgtcgtgcac ggccagtgcc gccatcgcca gccggggccc 2924581 aacgccggag accgacagca gcgtcaggaa taggtcgcgg gtttccccgt cgggaaaccc 2924641 gtacagcgtc atcgagtcct cgcgcacaat catcgcggtg atcagccggg cctcggtgcc 2924701 ttgccgcaac gtcgccagcg tcgccggtgt cgcgttcact cggtagccca caccggcggc 2924761 ctcgatcacc acatggtcaa gcgccacctc gagcacctca ccgcggaccg aggcgatcat 2924821 cgggcggcct tcagcttggc taggtacgca tgacgctgct gcgctgctcg tgcttccgcc 2924881 ctcgacgtgg cctcagccat ccgggcgatc gtcggcgccc gccaacagtg acagatcgcc 2924941 agcgccaaag cgtcggccgc gtcggccggt gtcggtttag cttgcagcgc aaggattttg 2925001 gtgaccatcg cggtgacctg agccttgtct gcggaaccgt tgccagtgac cgccgccttg 2925061 acctcgctgg gggtatggaa atgcacgtcg acaccacgtt tggccgccgc cagggcgatc 2925121 acgccgccgg cctgcgcggt gcccatcacc gtggtcacgt tgagctgaga gaacacccgt 2925181 tcgatagcca ccacctccgg atgatgggtg tccagccagt gctcgacggc atcgctgatg 2925241 gccaacaggc gctgcgccaa ggccgcatcc gacggtgtgc gcaccacgtc gacatccagc 2925301 gcggtgagct gccgaccacg cccactctcg ataagcgaca gcccgcatcg ggtcaacccg 2925361 ggatcgacac ccatcacccg caccgcacgc tccctcagcc atttccgaac aatcgttcga 2925421 tacgctagcg gatcgtcccg acatcccgcg caggacacgc ctatggaacg tgcgatggta 2925481 aatttcctac catgcgaaca accatcgatg tcgcaggacg tctggtgatt cccaagcgga 2925541 ttcgcgagcg ccttggcttg cgcgggaacg accaggtgga gatcaccgag cgcgatgggc 2925601 gcatcgagat tgagccggcc ccgaccggtg tcgaactcgt tcgggaaggc tcggttctcg 2925661 tcgcacggcc agaacgtccc ctgcccccgt tgaccgacga aatcgttcgg gaaacgctcg 2925721 atcgcacacg gcggtgatcg caccagacac cagcgtgctg gttgccggat tcgcgacctg 2925781 gcacgaaggg cacgaggccg ccgtgcgcgc gctcaaccgt ggcgtccatc tgatcgcgca 2925841 cgcggctgtg gaaacctatt cggtcttgac ccggctacca ccgccgcatc gtattgcccc 2925901 tgttgccgtc cacgcctact tggcggacat cacctccagc aactacctgg cactggatgc 2925961 ctgctcatat cgcggcttga ccgaccacct cgccgagcac gatgtcaccg gtggcgcaac 2926021 ctacgatgcc ctggtcggct tcacggcgaa agctgccggc gcaaagctgc tgactcgcga 2926081 cctgcgcgcg gtcgaaacgt acgagcgatt gcgggtcgag gttgagctgg tgacctgaga 2926141 aaccgttgcc gttgagtgtg tttgagttgc acgctcaccg acacccggat ggtgcaccag 2926201 tgagctgggg tgaccgcggc cgagacctgc cgggttcccg gccggacaac tcgcccgttg 2926261 tgacccccgg tcccgcgaaa gctgttacgt taaacggcgc catcgatatg cgaccgatcg 2926321 accaaccgcg gcgcagcggt acgagagggt atgcgtggga aatctgctgg tcgtgattgc 2926381 cgtggcgctg ttcatcgccg ccatcgtcgt tctcgtcgtg gccatccggc ggcccaaaac 2926441 accagccacg ccgggcgggc gccgggatcc gctggccttc gacgcaatgc cgcaattcgg 2926501 cccccgccaa ctcggacccg gcgcaattgt cagccacggt ggcatcgact atgtggtccg 2926561 cggatcagtc acctttcgcg agggtccctt cgtgtggtgg gaacacttgc tggaaggcgg 2926621 cgacacgcca acctggctga gcgtgcaaga ggacgacggg cgtctcgagc ttgcgatgtg 2926681 ggtgaaacgc accgatctgg gcttgcagcc cggtggccag cacgtgatcg acggcgtgac 2926741 gtttcaggag accgagcgcg gtcacgccgg atataccacc gagggcacga cgggcctgcc 2926801 ggccggcggt gagatggact acgtcgactg cgccagtgcc ggtcaggggg ccgacgagtc 2926861 catgctgctg tcattcgagc gctgggcacc ggacatggga tgggagatag cgaccggcaa 2926921 gtccgtactg gccggcgagc tcaccgtcta ccccgcgccc ccagtctcgg catagggccg 2926981 aatcggtgcc acttcatcag ctcgccatag cgccggtgga cgtatcaggg gcattgcttg 2927041 gactcgtgct gaacgcaccc gcgccgcggc cactggccac ccaccgactg gcccacaccg 2927101 acggcagcgc actgcagctc ggcgtcctcg gcgcgtcgca tgtcgtcacc gtcgagggac 2927161 gcttctgcga ggaagtctcc tgcgtggccc gcagccgggg cggcgatctg cccgagtcca 2927221 cccacgcacc cggctaccac ctccaatccc ataccgagac gcacgacgag gcggcgtttc 2927281 ggcgactcgc gcgccacctg cgtgaacgct gcacgcgggc aaccgggtgg ctgggcggtg 2927341 tgtttcccgg tgatgacgcc gcgctgaccg cactcgccgc cgaacccgat ggaaccgggt 2927401 ggcgttggcg gacttggcat ctgtacccga gcgcgtccgg cgggacggtg gtccacacga 2927461 cgagccgatg gcgtccatga gccgcaaccg cctgttcctg gttgccggca gcttggcggt 2927521 tgccgccgcc gtgtccttga tctctggaat cacgctgctg aacagggacg ttggctcgta 2927581 tatcgcctcg cactatcgcc aagaatcccg tgacgtgaac ggaacgcgat acctgtgcac 2927641 cggatcgccc aaacaggtgg ccaccacgct cgtcaagtac cagaccccgg cggcgcgcgc 2927701 gtcgcatacc gacaccgagt acctgcgtta ccgcaacaac atcgtgacgg tcggacccga 2927761 cggcacctat ccgtgcatca tccgcgtcga aaacctcagc gccggatata accacggcgc 2927821 atatgtcttc ctgggccctg gattcacccc tgggtccccg tcgggcggtt cggggggcag 2927881 cccgggcggt cctggcggca gcaagtaagg cgatgacgca aaggagagag tcatgtatta 2927941 ggccggagtc gatttcggga ccatcagcct taccccgatc ctgcatgggg tggtggccac 2928001 cgtcttgtac ttcctagtgg gcgccgccgt gctagtcgca ggctttctga tggtcaacct 2928061 gttgaccccg ggcgatctgc gtcgcctagt gttcatcgac cgccgcccca acgccgtggt 2928121 tctggccgcc acaatgtatg tggcgctggc catcgtcacc atcgccgcca tctacgccag 2928181 ctccaatcag ctggcccagg gcctgatcgg cgtggcggtg tacggaatcg tcggtgtcgc 2928241 gctgcagggg gtggcactgg tgatcctcga gatcgcggtg ccggggcgat tccgtgagca 2928301 catcgacgca cctgcgctgc atccggcggt gttcgctacc gccgtcatgc tgctggcggt 2928361 agcgggggta atcgccgccg cgttgtcatg acgtccaccc ggcaggcggg cgaagccacc 2928421 gaagcttcgg tacggtggcg ggccgtgctg ctggccgcgg tcgcggcgtg cgcggcctgc 2928481 ggtctcgttt acgagctcgc gctgctgaca ctggcggcga gcctgaacgg cggcgggatc 2928541 gtggccacct ccctgatcgt cgcgggctac atagccgcgc tgggagcagg cgccttgctg 2928601 atcaagccgc tacttgcaca cgcggccatc gcgttcatcg ccgtggaggc ggtgctgggc 2928661 atcatcggcg gattgtccgc ggcggcgctg tatgcggcgt tcgcgttcct ggacgagctc 2928721 gacgggtcga cgctggttct tgcggtgggc accgccctga tcggcgggct ggtcggcgcc 2928781 gaggtgccgc tgctgatgac gctgttgcag cgcggccgcg tggcaggggc cgccgatgcc 2928841 ggacgcaccc tggccaacct caacgcggcc gactatctgg gcgcgttggt cggcgggctg 2928901 gcctggccat tcctgctgct gccgcagtta gggatgatcc gcggtgcggc ggtcaccggc 2928961 atcgtcaatc tggcggccgc cggggttgtg tcgatcttcc tgctgcgcca cgtcgtgtcc 2929021 ggccggcaac tggtgaccgc cttatgcgcg ctcgccgcgg cgctcgggct gatcgccaca 2929081 ctgctggtgc attcccacga cattgagacc accggccgcc aacagctcta cgccgacccg 2929141 atcatcgcct accgacacag cgcctaccag gaaatcgtgg tcacccgccg cggcgatgac 2929201 ctgcgcctct acctggacgg aggtttgcag ttctgcaccc gcgacgaata ccgctacacc 2929261 gaaagcctgg tctacccggc agtctccgat ggcgcgcgtt cggtgctggt gctcggtggc 2929321 ggcgacggac tggcagcccg cgaactgctg cgccaacccg gcatcgagca gatcgtgcag 2929381 gtggaactcg accccgcggt catcgaactg gcgcgcacca ccctgcgcga cgtcaacgcc 2929441 ggttcgctgg acaacccgcg cgtacacgtc gtgatcgacg acgccatgag ctggctacgc 2929501 ggcgccgcgg tccccccggc tggcttcgac gcagtgatcg tcgaccttcg cgaccccgat 2929561 actcccgtgc tgggtcggct gtattccacc gagttctacg cactcgccgc ccgcgcgctc 2929621 gcgcccggcg ggctcatggt cgtgcaggca ggcagcccgt attcgacccc gactgcgttc 2929681 tggcgcatca tctccacgat ccggtccgcc gggtatgccg tcacgcccta ccacgtgcac 2929741 gtgcccacct tcggcgactg gggattcgcc ctggcacgcc ttacagacat cgcgcccacc 2929801 cccgctgtgc cgagcactgc ccctgcactg cgcttcctgg accaacaggt gctcgaggcc 2929861 gcgaccgtgt tttccggcga catccggccc cgcacgttgg acccgtcgac cctggacaat 2929921 ccgcacattg ttgaggacat gcggcacggc tgggactagc gcacccatct agggcggcca 2929981 gggtttgcac aacgcagcac gggttccgaa cggaaccggg gcccgctcgt agcccggcca 2930041 taaaagcata aaaacagtat gctgggtaaa tgaagaccac gctcgacctg cctgatgaac 2930101 tgatgcgcgc tatcaaggtc cgcgcggcgc agcagggccg caagatgaaa gatgtcgtga 2930161 ccgaactgct cagatccggt ctgtcccaga cgcacagcgg ggctccaatc ccaacgccgc 2930221 ggcgcgtgca gcttcccctg gtgcattgcg gtggcgcggc tacccgcgaa caagaaatga 2930281 cgccggagcg tgttgccgcg gccttgctcg accaggaggc ccagtggtgg tccggacacg 2930341 acgatgctgc tctgtgacac caacatctgg ctggcgttgg cgctttccgg acacgtgcac 2930401 cacagggcct cgcgcgcatg gctagacacc atcaacgcgc ccggagtcat ccacttttgc 2930461 cgcgcaaccc aacagtcgct ccttcggctg ttgacgaatc ggacggtgct gggcgcgtat 2930521 ggcagcccac cactgaccaa ccgcgaagcg tgggcggcct atgccgcgtt cctggatgac 2930581 gaccgcatcg tgctggccgg cgccgaacct gatggtttgg aggcccagtg gagagccttc 2930641 gccgttcgcc agtcgccggc gcccaaggtt tggatggatg cctacctagc tgctttcgca 2930701 cttaccggtg gattcgagtt ggtgacgact gacaccgcct tcacccagta cggcggaatc 2930761 gagctgcggc tcctggccaa gtgacagcgc aagccccgca gtgctcactc gtcgtcgagg 2930821 gcggccagca cctcgtcgga cacgtcgacg ttggtccaca cgttctgcac gtcgtcactg 2930881 tcttctagcg cgtcgacgag cttgaacact ttccgtgcgc cgtccaggtc cacgggcacg 2930941 ctgaccgagg gttgaaagct ggcctcggcc gattcgtaat cgatgccggc atcttgcaaa 2931001 gcgctacgaa ccgcgaccag ttccgcgggc tcggagatga cctcgaaact gtcgcccagg 2931061 tcgttgacgt cctcggcacc ggcttccaga acagccgcca gcacatcgtc ttcggtcaag 2931121 ccgttctttt ccagggtcac cacgcctttg cgggagaaca ggtaggacac cgaccccgga 2931181 tcggccatgg tgccaccatt gcgcgtcatc gccacccgca cctcgctggc ggcgcgattg 2931241 cggttgtcgg tcagacactc gatcagcacc gccaccccgt tgggcgcgta gccctcgtac 2931301 atgatggtct gccagtcggc gccgccggcc tcctcgccgg cgccgcgctt gcgggcccgt 2931361 tcgatgttct cgttgggaac cgagctcttc ttcgccttct gaatcgcgtc gtagagcgtg 2931421 gggttgccgg ccggatcacc gccaccgaca cgcgccgcca cctcgatgtt cttgatcagc 2931481 cgggcgaaca tcttgccgcg gcgggcgtcg acgacggcct tcttgtgctt ggtggtggcc 2931541 cacttggaat ggccgctcat cgcagtgatt tacctcttct gttgctcgtt cgccagacga 2931601 gtctacgtgg gggttgtggg cggcgagcca accggcacga gcagacacaa aagctccaaa 2931661 tttcggcctg aaacgggtgc ttttgcgact gctcacgccg cggaggtgac gatgtcgacg 2931721 aacaactgat gaatgcggcg atcgccggtc atctccggat gaaacgcggt ggcaagcacc 2931781 gcaccctggc gcaccgcgac gatgtgcccc gccgcgcggg ccagcacctg cacaccgtca 2931841 ccgactcgct caacccatgg cgcccggatg aacaccgcgc gcaccggatc gtctagacca 2931901 gcgaactcga tatcgccttc aaacgagtca acctgacttc caaaagcatt gcgccgcacc 2931961 gtcatattca tcgcacgcag gggcagcgcc tggcggcctg ccgcaccggc gtccaggatc 2932021 tcgctggcca acagaatcat gcccgcgcac gaaccatagg ccggaagccc atcggcgagc 2932081 cgggcccgca gcggtcccag caggtcgagg tcgagcagca ggtggctcat cgtggtggat 2932141 tccccgcccg ggatgaccag cgcgtccacc gcgtcaagtt cgtcgcggcg ccgcaccgtc 2932201 atcggctcgg ccccgcattc gcgcagcgca gccaggtgct cccgggtgtc gccctgcagc 2932261 gccagcaccc cgacccgtgg aacgctcaca gcccgctcac tgccccaccg accggtgacc 2932321 gcgccggtgg cgggtcagcc cctcctgcat gaccgccgcg accatctccc cggaccgtgt 2932381 gaaaatctcg ccccgagtca gcgcacgacc gccgctggcc gacggcgacg actggtcgta 2932441 cagcaaccac tcgtcggcgc ggaagggtcg catgaaccac atcgcatggt ccagcgatgc 2932501 cacctgcagc tggtcgcgca catcgaggtg gttgacttgt gccgatccca gcagcgtgag 2932561 gtcgctcatg taggcgagtg cacagatgtg caacaccggg tcgtcgggca acgggtcacg 2932621 gtggcgaagc cacacctgct gctgggaagc cttgcccggc aaaagccgca ggcgctcccg 2932681 gggcacgatg cacacgtccc actcgtcgaa ctgccggaac ccggcatcat cgaaaacctt 2932741 gatcgagttc aaccccggca ggccgtcggg cggcggcgcc gctggcataa cgtcttggtg 2932801 ggtaatgccc tcctgttcgg tctggaacga cgccgccatg ctgaatatgg tttccccgtg 2932861 ctggactgcg ttgacccgcc tggtgcagaa cgatccaccg tcgcggatgc gttcgaccag 2932921 aaaaaccgtg cgctccttgg catctccagg ccgaagaaaa tagccgtgca gcgagtgcac 2932981 catgtaccgc gggtcgacgg tgcgcaccgc cgacaccagc gactggccgg ctacatgacc 2933041 accgaaagtg cgttgcagga agcccgattc ggggctgaac acgcttcctc ggtagatgtt 2933101 gacctcaagt tgctcaagat caaggatctc ttcgatcgac acgcgatgac cgtctgctcg 2933161 tcgcgggttc tcaccagccg cgctgggcga gccgatgacc gacagcgatc tcgtccacgt 2933221 tgatgcccac catcgcctcg cccagcccgc gcgacacctt ggccagcaca tcgggatcgt 2933281 cgaagaacgt ggtggccttg acgatcgcgg cggcgcggtg ctcaggggcg ccggacttga 2933341 aaataccgga acccacgaag acgccctcgg cgccaagctg catcatcatc gccgcgtcgg 2933401 cgggcgtggc gatacccccg gcggtgaaca gtgtgaccgg caacttgccc gcccgagcta 2933461 cctcggcaac gagttcatag ggcgcttgca attcttttgc cgcgacaaac aattcgtcct 2933521 ccgacatcga cgtcaaccgg cggatctcac caccgatggc ccgcatgtgt gtggtcgcgt 2933581 tggagacgtc tccggtcccg gcctcgccct tggaccggat catggccgct ccctcgctga 2933641 tgcgcctcaa cgcctcaccg agattggtcg ccccacacac gaaaggcacc gtgaagttcc 2933701 acttgtcgat atggtgggcg tagtcagcgg gcgtcagcac ctcggactcg tcgatgtagt 2933761 cgacgcccaa cgtctgcagg atctgcgcct cgacaaagtg gccgatgcgc actttagcca 2933821 tcaccgggat ggtgaccgcg gcgatgatgc cctcgatcat gtcggggtca ctcatccgcg 2933881 acaccccgcc ctgggcgcgg atatcggcgg gcaccctttc caacgccatt accgcaaccg 2933941 caccggcgcc ctcggcgatg cgggcctgct ccggggtgac aacgtccatg atgacgccgc 2934001 ccttgagcat ctcggccatg ccgcgcttga cccgcgccgt accggtcgct gggttacctg 2934061 caggatccat ggtgcctcct cttgtcccca ctacgatacg accgctaccg cgccggtctg 2934121 ctagccactc aggggcgtgg ccaggacgcc gattggtaaa ttacgaatcc ctcagccgtg 2934181 cagcaccgga ggccggaatg gacgatgacg cccaaatggt cgcgatcgat aaagaccaat 2934241 tggcaaggat gcgtggcgaa tacggcccgg agaaggatgg ctgcggagat ctggacttcg 2934301 actggctcga cgacggctgg ctcacgctgc tgcggcgctg gttgaacgat gcacaacgcg 2934361 ccggagtgag tgaaccgaac gcgatggtgc tcgccaccgt tgccgacgga aaaccggtga 2934421 cccgttcggt actttgcaaa atcctggacg agtccggtgt cgcgttcttt accagctaca 2934481 cctccgccaa aggcgagcag ctcgccgtga caccatacgc atcggcaacc tttccctggt 2934541 accagctagg tcgccaggca cacgtacagg gcccagtcag caaggtcagc accgaggaga 2934601 tattcacgta ttggtccatg cgcccccggg gcgcgcagct gggtgcgtgg gcctcgcagc 2934661 agtcgcgccc ggtcggttct cgcgcccagc tcgataacca gctcgccgag gtgacgcgtc 2934721 gcttcgccga ccaggaccag atcccggtgc ccccaggatg gggcggctac cgcatcgctc 2934781 cggaaatcgt ggaattctgg cagggccggg agaaccgcat gcacaaccga atccgcgtcg 2934841 ccaatggccg gctggaacgg ttgcaaccct gatcgtcgag tctggccacc tcgcgggcga 2934901 agtttgacgg aacctcgcag atcttgccgg acatgccata gagtctttga ccggaatgcc 2934961 cgctgacccg tgacgacgcg gtcaccgggg atacccgccg cggtggtggc caaccgataa 2935021 cggccaaccg agaaagtaca cagcgatgaa tttcgccgtt ttgccgccgg aggtgaattc 2935081 ggcgcgcata ttcgccggtg cgggcctggg cccaatgctg gcggcggcgt cggcctggga 2935141 cgggttggcc gaggagttgc atgccgcggc gggctcgttc gcgtcggtga ccaccgggtt 2935201 ggcgggcgac gcgtggcatg gtccggcgtc gctggcgatg acccgcgcgg ccagcccgta 2935261 tgtggggtgg ttgaacacgg cggcgggtca ggccgcgcag gcggccggcc aggcgcggct 2935321 agcggcgagc gcgttcgagg cgacgctggc ggccaccgtg tctccagcga tggtcgcggc 2935381 caaccggaca cggctggcgt cgctggtggc agccaacttg ctgggccaga acgccccggc 2935441 gatcgcggcc gcggaggctg aatacgagca gatatgggcc caggacgtgg ccgcgatgtt 2935501 cggctatcac tccgccgcgt cggcggtggc cacgcagctg gcgcctattc aagagggttt 2935561 gcagcagcag ctgcaaaacg tgctggccca gttggctagc gggaacctgg gcagcggaaa 2935621 tgtgggcgtc ggcaacatcg gcaacgacaa cattggcaac gcaaacatcg gcttcggaaa 2935681 tcgaggcgac gccaacatcg gcatcgggaa tatcggcgac agaaacctcg gcattgggaa 2935741 caccggcaat tggaatatcg gcatcggcat caccggcaac ggacaaatcg gcttcggcaa 2935801 gcctgccaac cccgacgtct tggtggtggg caacggcggc ccgggagtaa ccgcgttggt 2935861 catgggcggc accgacagcc tactgccgct gcccaacatc cccttactcg agtacgctgc 2935921 gcggttcatc acccccgtgc atcccggata caccgctacg ttcctggaaa cgccatcgca 2935981 gtttttccca ttcaccgggc tgaatagcct gacctatgac gtctccgtgg cccagggcgt 2936041 aacgaatctg cacaccgcga tcatggcgca actcgcggcg ggaaacgaag tcgtcgtctt 2936101 cggcacctcc caaagcgcca cgatagccac cttcgaaatg cgctatctgc aatccctgcc 2936161 agcacacctg cgtccgggtc tcgacgaatt gtcctttacg ttgaccggca atcccaaccg 2936221 gcccgacggt ggcattctta cgcgttttgg cttctccata ccgcagttgg gtttcacatt 2936281 gtccggcgcg acgcccgccg acgcctaccc caccgtcgat tacgcgttcc agtacgacgg 2936341 cgtcaacgac ttccccaaat acccgctgaa tgtcttcgcg accgccaacg cgatcgcggg 2936401 catccttttc ctgcactccg ggttgattgc gttgccgccc gatcttgcct cgggcgtggt 2936461 tcaaccggtg tcctcaccgg acgtcctgac cacctacatc ctgctgccca gccaagatct 2936521 gccgctgctg gtcccgctgc gtgctatccc cctgctggga aacccgcttg ccgacctcat 2936581 ccagccggac ttgcgggtgc tcgtcgagtt gggttatgac cgcaccgccc accaggacgt 2936641 gcccagcccg ttcggactgt ttccggacgt cgattgggcc gaggtggccg cggacctgca 2936701 gcaaggcgcc gtgcaaggcg tcaacgacgc cctgtccgga ctggggctgc cgccgccgtg 2936761 gcagccggcg ctaccccgac ttttctaagc ggtccacaaa ccgtgcacgt cagcggatgg 2936821 gctgaggaac gccggcatcg cgcgcggctc cgttgtccag cgcgacgtcc accagccggt 2936881 tggctgccgg caacagctcg cctagttgca acgggtacac ccgctcgccc gccgccacca 2936941 gctgcgcgat gtcgttcgcg tcacaccagc gggcatcgcg aatatagcgg cgttccaact 2937001 cggttcgccc ctgcacagca ggctcgaacc gacgcgtccg gtgcaccagg tagaactcct 2937061 cgctgtcgat cagcgacccg ttgaactcga agacctcgtc gcgtcgccag ataggtccga 2937121 tcatgtcggc cggggccacc cgcagaccgg tttcttcggc cagctcccgg gcggcggcct 2937181 gggccagccg ctcacccggt cgcacttggc ccccgacggt gaaccaccac ttcggcgccg 2937241 cgccgtcccg aaacgccggg ttcgccggat ccgatccgca cagcaacaac acggcaccgc 2937301 tgtcatccaa tagcaccacc cgcgccgagg tgcggcgacc ggacgcaccc tgatcgccgt 2937361 gcaccaatgc gtggggtcgc tcgacgatct cgaaataggt tggcagcaca gcggttccac 2937421 caagccgcag caatcgcacc agccgtcgtt cccccagagc gagggtgtcg cgaacggcgt 2937481 cgttgtggaa gcggcgggcc agcaggacgc gggcttccgc gtcggctaac tcggcgatca 2937541 gggccgcggg cagcgacgcg gggttgacca tcgccaacgc ggccgaaagc tcgttctccg 2937601 cattctcgcg cgcatgccgg ggcgcgccct ccgcggcgtc ggctaaggcg gccagccgac 2937661 tgccctgggg ggcaccgccg tacgcgtcga tcgccaccgc acgtgccacc accgctcgtc 2937721 gcgcgagcgc gctgtccagc gactgccacg acaagtcata gcgcacgttc aaccggttca 2937781 accggttggc cgtctgatat ccccaggcgc cgaacgcaac cagcacaacg agcagcactg 2937841 cgccggccag gaccagccac gtcatcagct ggccacctga accttggcgc ccgacccggc 2937901 gaccgtctcg tacactcgca tgatctggct ggccaccacc gaccagtcat accggcggac 2937961 ggccgcgttg ccggccgcca catagcgctc ccgcaggaca tcgttctcca gcaccgcaat 2938021 cagtccatcg gccaacgcgg cggcctgcaa gtctggcggg tccaccggca ccaggtgccc 2938081 gacctcaccg tcgcgcagca cacgccggaa ggcgtcgagg tcgctggcca ccaccgcagt 2938141 gccggcggcc atcgcttcga ccagcacaat gccgaaactc tcaccgccgg tgttgggcgc 2938201 acagtagacg tcggcgctgc gcatcgccga agcttttccg gcgtcgtcca cctgacccag 2938261 aaagcgcagg tgcgccgcca aacggcccgc ctggccgcgc aactggtcgg cgtcgccgtg 2938321 gccgacgatc agtagctgga catccggaaa ccgctgcacc accttcggca gcgcgtcgag 2938381 caaaacggcc atgcccttgc ggggctcgtc gtagcgaccc aggaacaaca ccgttttacc 2938441 ctggcgcggg tacccgtcca gccgcgctgc cgaggcgaag gaatcaacgt ccaccccatt 2938501 ggggatctcc accgcatcgg atcccaacgc ctccatctgc cagcgccggg ctaggtcgga 2938561 caccgcgatc cggccgacga tcttctcgtg catgggccgc agaatgccct ggaacaccgt 2938621 cagcgtcagc gacttggtgg tcgaggtgtg aaatgtcgcc acaatcgggc cctcggcaat 2938681 gttcagggcc agcatcgaca ggctcggcgc attcggctcg tgtagatgca gtacgtcgaa 2938741 atcaccatgc gcaagccact ttttgacctt gcggtgggtc gccggaccga accgcagccg 2938801 ggccaccgag ccgttgtagg gaatcggaac cgccctacca ccggagacaa agtaatcagg 2938861 cagtgcggca tgcggggagg ccggcgcgag cacactgacc aagtggccgc gggtgcgcat 2938921 cacctcggca agctgtagca catgcgactg caccccgccc gggacgtcga acgagtacgg 2938981 acaaatcatg ccgatccgca tcaggctttc ctcatctgga cctcagttgc gcccgccgcg 2939041 attcggataa gtcggccagc cactggggct gcagcatgtg ccaatccgcg ggatgggcgg 2939101 caatgttctg cgcgaagcgg tcggccagcg cctgtgtgat ggcagcgacg tcaccgctgg 2939161 tgcaatccag cgccggatac acctggaaac cccagccgcg gccctcgaac cagcaatgtg 2939221 tgggcagcaa tgccgcaccg gtctcgaccg ccagcttcgc cggccccacc ggcatccggg 2939281 tgggctcgcc gaagaagtcg acctcaacac cggtgcgggt gagatcgcgc tcggccatca 2939341 ggcagaccac tcggttgttc ctcagccgct cagagagcac ctcgaacggc ggccgttcgc 2939401 cgccggacag cggcagcacc tcaaatccca ggctttcgcg gtagtcgata aagcgctggt 2939461 acagcgattc gggttttagg cgctcggcga cggtggtgaa ggtgccgtgc cgctgcacca 2939521 gccacatccc ggccatatcc cagttgccgc tgtgcggcaa cgccagcacg gcaccgaggc 2939581 ccgcggccag cgccgcgtcc aggtgatcca gtccaccgat cacgcggtcg agctggcggg 2939641 ccagcttgcg gtggtttatc gtcggcagcc ggaacacctc acgccagtag cgcccgtagg 2939701 actccagcga ggcgcacatc agcgggtccg gcaccgcggc tggcggcaca cccaggacgc 2939761 gggccaggtt cttgcgcagc tgctcgggcc cgccgtggcg ggcaaagtag cgcgctccgg 2939821 tgtcgaatgc gttgcgtacg gcgaactctg gcagcgcccg tacggccatc cagccggccg 2939881 catacgccca gtcggtcgcg gtgcgcgtca cggaactgcg cggatctttg ggcagcttca 2939941 agcccttaag gccggcaatc accggtcgcc ctttccagga atcgccatcc gatcgatggc 2940001 tccgggtgaa gtccagaccg tgtgcaaccg ctgcacgcag gtgatcacgc tggcgacggc 2940061 cagcagccac atccccaccg acaacgccgg cggccagggc acaaacggga agtccgacac 2940121 cccggcgccg gtcagcacga tgatcaaccg ttccggccgt tcgatgaagc cgccgtcgcc 2940181 gcgcagcccg ctggcctccg cccgggcctt gatgtaagag atcacctgcg aggtgaccag 2940241 acagatcaag gtcgcgatca ccagcggtcg gtcgcgcatg tgaaacgcta tccaccacag 2940301 cagaccgcag aacaccgcgc cgtcactgat gcggtcacag gtggcgtcca gcaccgcgcc 2940361 gaagcgagtg ccgcccccgc gctcccgggc catcgccccg tccagcatgt cgaacaacac 2940421 gaagaaccac accacacacg cacccgcgaa cagcttgccc atcgggaaca gcgtcagcgc 2940481 tcccgccacc gacgcggtgg tgcccaggat ggtgacgacg tccggcgtga ggccgacccg 2940541 cagcagtccc ctggcgatcg gggtggtaat ccgggcgaac gccgcccggg acaggaaggg 2940601 cagcttgctc atggttgccg agcccactcg gtggcaagca gccgacgggt gtcgcgcagc 2940661 agctgcggaa tcaccttgga gcccccgatg atggtgatga aattcgcatc gccaccccac 2940721 cgtggcacca catgcacgtg caggtgctcg gccagcgacc cgcccgccga tgtccctagg 2940781 ttcaggccga cattgaagcc gtgcggacgc gacacgttct tgatcacgcg aatcgccttc 2940841 tgggtgaacg ccatcaactc ggcgctctcc aaatcggtga gatcctcgag ttcggatacc 2940901 cgacgatagg gcaccaccat caagtgcccg gggttgtacg ggtacaggtt gagcacggcg 2940961 tagaccagct tgccacgagc gaccaccaga ccctcttcgt cggacagctg cgggatctcg 2941021 gtgaacggct gcgcagggct ggccgaggaa ttggggtcac gcttcactgg cgcttcggcc 2941081 aggtagttca tccggtaggg ggtccataac cgctgcagct ggtcgcgctg gccgacaccc 2941141 cgatcgaaga tggtgtggtc ctcggtggcc cgatccgtgc ggtcctcgtc actcacgacc 2941201 ggccactttc accagttccg ctgtaggaac cgcattttcg cggtcagcga tccaggcgac 2941261 aatggccgcc accgcatcgt cacgggccac accgttgatt tgggtgcggt caccgaaccg 2941321 gaaactcacc gcgccggcgg cgacgtcacg atcacccgcc aacaccatga acggcacctt 2941381 gtggttggtg tggtgcacga tcttcttggc catccgatcg tcgctggcgt ccacctcggc 2941441 ccgcaccccg tgcgacttca gttgcgtggc aacctcttcc agataggcga cgtgctcatc 2941501 ggcgaccggg atgccgacca cctgcacggg cgccaaccag gccgggaacg cccccgcgta 2941561 gtgctcggtg agaatgccga agaaccgctc gatcgaccca aatagcgcgc ggtggatcat 2941621 caccgggcgg tggcgggttc cgtcggcggc ggtgtactcc aggccgaaac gttccggaaa 2941681 gttgaagtcc agctggatgg tcgacatctg ccaggtgcgg cccagcgcgt ctttgacctg 2941741 cactgaaatc ttgggcccgt agaacgccgc gccgcctgga tcgggcacca gctccagccc 2941801 ggattcggcg cccacctcgg ccagcacggt ggtggcttcc tcccagacct cctcggcgcc 2941861 gacgaacttc tccgggtcct tggtggacag ttcgaggtag aagtcggtga ggccgtagtc 2941921 ggcgagcagg tcgagcacaa accgcagcag cgaccgcagc tcgtcgcgca tctggtcgcg 2941981 ggtgcagaag atgtgcgcgt cgtccatggt cagcccacgc acccgggtca acccgtgcac 2942041 cacaccggac ttctcgtagc gatacaccgt gccgaactcg aagagccgca acggcagttc 2942101 ccgataggat cgcccgcgcg cgcggaagat caggcagtgc atcgggcagt tcatcggctt 2942161 gaggtagtag tcctggccgg gtttgcgcag cgagccgtcg gcgttgtact ccgcgtcgat 2942221 gtgcatcggg gggaacatgc cgtcggcgta ccagtccaga tgtcccgagg tgtggaacaa 2942281 ctgggccttg gtgatgtgcg ggctgttgac gaactggtag cccgcctcgg tgtgcttgcg 2942341 ccgcgagtag tcctccagtt cgcgacgcac gatgccgccc ttggggtgga aaaccgctag 2942401 gccggaaccg atttcgtcgg ggaagctgaa caggtccagc tcgacaccca gcttgcggtg 2942461 gtcgcggcgc tgcgcctctt cgatgaactc caggtgcctg tcgagcgcct cctgggattc 2942521 ccacgcggtg ccgtagatcc gttgcaggct ggcgtttttc tgatcgcccc gccagtaggc 2942581 ggccgagctg cgggtgagct tgaacgccgg gatgtgtttg gtggtcggga tgtgcggtcc 2942641 gcggcacagg tcgccccaga cgcgctcgcg ggtgcggggg ttgaggttgt cgtaggcggt 2942701 gagctcgtca ccgccgacct ccatgatctc ggcgtcaccc gatttgtcgt cgacgagttc 2942761 cagcttgtag ggctcgttgg ccagctcggc gcgggcctgt tcggtggatt cgtagacccg 2942821 ccggtcgaac agctggcctt ccttgacgat ctggcgcatc cgcttttcca gcgccgccaa 2942881 gtcctcgggc gtgaacggct cgggcacgtc gaagtcgtag tagaagccgt cggtgatggg 2942941 tggtccgatg ccgagcttgg cctgcggaaa cagctcttgg acggcttggg ccaacacgtg 2943001 cgcggtcgaa tggcggatca cgctgcgacc gtcgtcggtg ttggcggcca ccggcgtgat 2943061 atcggtgtcg acgtcgggca cccagctcag gtcgcgcagg ttgccgtcgg cgtcgcgcac 2943121 gacgacgatc gcatcgggcg taccgcgccg cggtaaaccc gcttcgccga cggcggtggc 2943181 cgcggtggtc ccggcaggaa cccgaattcg ggcttgcgac gggtcgccgc catcgactcc 2943241 cggggcgggt tgtgcggggg cgctcatcgg gtcggtctcc aaggcttgga cgtgtcgaaa 2943301 cgatcgcgac catgctatcg gggcgcacgt cgacgaccgt aagccgagtg accggatggg 2943361 ttttcgatca ccggtgtggg cgatcggtac cgggcaggtg accgggtgct ttacggcggc 2943421 tcgatgagcc caaaggatgt tgacgacctg gctacccagc aggacgtcga cgacggacag 2943481 tcgatagagc gtcgctggac ggggagcggt cagcgacgct ggcggcggtc gccgccgacg 2943541 ggccgctacc gtagcaactc gcaaatccag gtctggattt ccggcgccgg ccggctccgt 2943601 tagccgtcgg ctccgttggt gccggccagg ccgggtggcc ccaacagtga tgcaccgccg 2943661 ctgccgccgg gcccgccgaa gcctggagcg ccattgaaga ggctcccggc gccgccgttt 2943721 ccgccgtttc caccgttgcc gatcagtccg acgctgccgc cgttcccgcc tttcccgccg 2943781 tcaccgccgg acccgccagc agtgccggcg ctcgctccgt tcccgccggc cccggcgctc 2943841 ccaccggccc ccgcgttgcc gatgagggca ttgccgccgt ttccaccgct gccgccgcta 2943901 ccgccattcc ctccgaaggc cgtaacagac ccgggcgacc cggcggcacc gccgtttccg 2943961 ccggccccgc cgttaccgta gagtagcccg ccggcgccgc cgttaccgcc ttgcccgcca 2944021 aacccaacgc ccccgaaatc ggcggacaca tcaccaccgg ctccgccggc tccgccattg 2944081 ccgccgttgc cgatcagtcc ggtggttccg gcgtgaccac cgttaccccc gaatccggac 2944141 gctggaccgt tctgagaaat gcctgccccg tttccggcgg ccccgccgtc cccgctgttg 2944201 ccgatcagta ggccgccgtt cccgccgttc ccgccgttgc cgccgctagc ccccgcggag 2944261 ggctcgccgc cgccggtgcc cccggccccg ccggtcccgc cggcgccgat cagcccggcg 2944321 ttgccgccgt tcccaccatg cccgccgata gcgaggttgg tgccggcccc accgatcccg 2944381 ccgttcccgc cggccccgcc gttgccgaac agccatccac cggcgccgcc ggctccgccg 2944441 ttcgcgccgg cctcaaaggg taggccctgg ccgccagctc cgccggcccc accgttgccg 2944501 atcaacccgg ccgcaccgcc ggccccgccg gcctgcccgg gtgcccccga cccgccgttg 2944561 ccgccgttgc cccacagcca cccgccgtta ccgccggctt gcccggtccc gtcgatcccg 2944621 ttcgcgccgt cgccgatcaa tgggcgcccg gtcagcgact gaacgggtgc gttgatcgca 2944681 tcgagcacgt tctgcagcgg tgttgcgctg gccgcttcgg cgaccgcgta ggtgctgcca 2944741 gcttggctta aggccagcac gaaccgttgc tgataggccg cgacctgcgc gctgatcgct 2944801 tgatagtgct ggccgtggct gccgaacagc gcggcgatcg ccgttgacac ctcgtcttgg 2944861 gcggcggcca acacctgggt ggtcgccgcc gccgcggtgt tggcggtgtt gatcgccgag 2944921 ccgatccgcg ctgcatcggc cgcggctgtg gacactaact gtggggccac gttgacaaac 2944981 gacatcgaaa tcctcctgac cgccacgatg ttgagatgcg ggcggcccac cgcctgttac 2945041 cgccgcggtg ggtaaccgtt tattcggacg atccctgccg ttccacgcct gggcgcaggc 2945101 acaaaccgca ccaacattgg tggaacgtgg tgcacactgc acctggggtt ctgccctcat 2945161 cgtgtggcag caggcgaaac ccgcgcggac gagaactctt ccgccaagca gcacaaatcg 2945221 ccctacaccc cagtgaatct ccggacgcca ctacgacagc gcgcaacggt cgcctcatcg 2945281 actgtgtgca cgcgcgcttc gcgatgcgct gccgtggcaa gctggccagg tggacctcaa 2945341 tgcgctggcc gatctgccgc tgacctatcc ggaggtgggc gcgacagcga ccggacgact 2945401 gcccgcgggc tacaaccacc ttgacgtgtc gacgcagatc ggcaccggcc gccagcgttt 2945461 tgagcaggcc gccgacgccg tcatgcattg gggcatgcag cgcaacgccg gcctgcgggt 2945521 gcgggccagc tccgaaaccg ccgtcgtgtc cgcggtggtg ttggtgggaa tcgctttcct 2945581 gcgtgcgccg tgccgagtgg tgtatgtcat cgacgaaccc gacgtgcgcg gattcggtta 2945641 cggcactttg ccgggccatc cggtgtccgg cgaggaacgg ttcgcggttc gctgcgaccc 2945701 gatgacctcc gtggtgtttg ccgaggtgtt gtcgttctcc cgtccggcga cctgggcgag 2945761 caaagccgcc gggccgctgg gcgcggtgac ccagcgcttc atcgcccagc gctacctgcg 2945821 cgcggtgtga ggcgccggcg ccctggttaa ggccgcccga tgcctccgct gtgcacgccc 2945881 tgcgccagcc gggcgagcgc gatggcgcca accagcaaac cgaagtcgcg cagcgcgatg 2945941 tcgtagaaac cgggtccggt gaccaggttg agaatgatcc cggccagcca ggccgcgact 2946001 acccaggcgc cgatgcgcgg tgcgaccgca accaatacgc cggccacaat ctcgattgcc 2946061 ccgaccaagt acatgcattg gtcggcggtg ccgggcacga gatcgttgat ccagccggcc 2946121 agatacatgt tccagtgctg cggatgggtc agcagattga agaacttgtc cagcccgaac 2946181 aggatgggcg cgaccgtgaa cagcgtgcga agcaatacgt atgcagagta tgccggatcc 2946241 ttcagctggt ctgcgagagc agggctggtc gttggtctga tgctcatagc tgcctcccga 2946301 cttctaacag acaacaattt gaacgttaga tcctatagac tgtatcgtca agtgttttgt 2946361 ctgttagaga tggcttgctg aagtggacgg ccgagcttcc ttcgaacgcg acgtcgccgg 2946421 gatcggggca ctcgtggatc cggtgcgtcg ccagctctac caattcgtgt gctcacaatc 2946481 gatgccggtg agccgagacc aggcggccga cgccgtcggc atcccgcgcc accaggcgaa 2946541 attccatttg gaccggctca ctgccgaagg cctgctggat accgagtacg cgcgcctgac 2946601 cggccggtcc ggccccggcg ccgggcggac cgccaagctg tatcgccggg ccggccgcga 2946661 catcgccctc agccttccac agcgggagta cgagcttgct gggcggctga tggccgcagc 2946721 catcgtgctg tcggccacca ccggggagcc gaccgtggaa gtgctcaacc ggatcgccca 2946781 tgactacggc caagccatgg gcgccgccgc caccacccgg ccgcccgcag accccgcggc 2946841 ggcgctggag ctgacgctgg atgtgctgcg caagtacggt tatgaacccc gccgcccggc 2946901 tggccctggc gacgatgagg tcgagctggt gaactgcccg ttccacgcac tggcccggga 2946961 gcagaccgag ctggcctgca atatgaacca cgccttgatc acaggcgtgg ccgacgcgct 2947021 ggcaccgcac agcccggccg ttcggttggc acccggaccg gcccggtgtt gtgtagtact 2947081 caagcgatgt tcggctcacg accccgagtg agcatcgggc agggatttca gcacggtcag 2947141 catgatcacc gaatcctcga cggcgtgcag cgcatgccgt gtcggcggaa tcgcgacgta 2947201 gtcgccggcc ctgccgttcc acgcgtcctc accggcggta aggcacacat ggccctgcag 2947261 cacttgcagc gtcgcctcgc ccgggctgtc atgctcggac aggtcgtggc cggcaagcaa 2947321 tgccagcacc gtctgccgaa gctcgtgggt gtgaccaccg tggatggtgt gggcagcccg 2947381 tccgctgtgt gtctgttgcg cctcggccag cttttcggcg gccaggctgg tcagcgaaat 2947441 ggattccatc ggcgcgtcct ttcagccgtt cagtagcagt atccccgcga cgagcaacgc 2947501 aaccaccttg actatctcca aaccgacata gatgtggtga ccgcgggagc ggggagcctg 2947561 cagcccggcc aatacctgat tggaccgtcg agtcaatcga ggacgcaccg caatcaactg 2947621 gacggccaac gcagccaacg cgaccgaaaa cgccgcggcg atccgcgccg gcgtcgagcc 2947681 gaccaccacg atcgcgagga tgacaagggc gaaaccgacc tcaacggtat tgagcgcacg 2947741 gaagaccaac cggccgatgc cgagcccgat ctgcagcgtc actcctgccg cccggaactt 2947801 cagcggagct tccagaaacg agatcgccac caccattccc agccagacga acgcgacggc 2947861 gacctcgatc gccggtccgg cgctcaccga atggctcctt ccagcggcgt gaagtgggcc 2947921 aggcacaggt cgggttcgac gaacgcgtcg agccggtcga cggtgactgg cgccccccat 2947981 gtttgcaagg ctccccgcat gatccctaag tggacggggc agacgacacc ggcttgagtt 2948041 tcggcgagtt ccagaaacgg acagtgccgc agaccgacct gttgcctgcc gttggatgcc 2948101 cggcgctcgg gagcgaagcc aaggtcgtca agcaccgcga ccaagtggtc gatcgtctcc 2948161 tcggtgtcgg caccggccgg cggcgcttcg agctggcgcc cccacgcccg gcccgcggac 2948221 aacgccatgg cccgcgaatc ccgttcggcg gcaaggccac tggcgaggat ctcggcaagc 2948281 agccggtaac gccgcgtccc agtgctatcc gtccgccgga ccgcccgaaa catcagcggc 2948341 gggcgccccg gtcggccgcg gccgggctcg acccgctcca cctggccatc agcgaccagg 2948401 ttatcgaggt ggaagcggac ggtgttggga tgcacgccca acttgccggc gatcgcggcg 2948461 atgctcatcg gaacccgcga cgcacacaat gcccgcagca ccgcacgacg gcgccccacc 2948521 ggctcttgca gtgacctgat gatgacactc acccccataa ggctcgtcgg ctgcgcctga 2948581 gcaatgcagt aagtttacac aaacggactt gtaaaaacct gcggaggtgg ggtctatggc 2948641 caacaaacgt ggcaatgccg ggcagcctct gcccttgtcg gatcgagacg acgaccacat 2948701 gcaggggcac tggctgctgg cccggctggg caagcgggtg ctgcgtcccg gcggcgtcga 2948761 actcacccgg acactgctgg cccgcgccga ggtgaccgac gccgacgtgc tcgagctggc 2948821 accgggcctg ggccgcaccg cagccgaaat cttggcccgc aacccgcggt cgtacgtggg 2948881 ggcggagagc gatcccaacg cggccaacct ggtccgacac gttctcgccg gccgcggcga 2948941 cgtccgggtc accgacgcgg ccgataccgg attatccgac gccagcgccg atgtcgtcat 2949001 cggcgaggcg atgctgacca tgcaaggcaa cgcggctaaa cacacgatcg tcgccgaggc 2949061 ggcgcgggtg ctgaggccgg gtggccgcta cgcgattcac gaactagcgc tggtgccgga 2949121 cgacgtcgca gagcaggtcc gcaccgacct gcggcagtcg ctggcccgcg cgctcaaggt 2949181 caatgcgcgt ccgctgaccg ttgcggaatg gtcgcacctc ttagcgggcc atggactggt 2949241 cgtcgaacac gttgtcaccg cttccatggc gttgttacaa ccgcgacggg tgatcgctga 2949301 cgaaggcctc ctgggtgcgc tgcggttcgc cggaaacctg ctcatccatc gtgccgcgcg 2949361 tcggcgagtc ctgttgatgc gccacacatt ccgcaggcat cgtgaacgct tgacagccgt 2949421 cgccattgtc gcgcacaaac cgcacgtcga ttcgtgatcc attgaggacc taagcccgtt 2949481 gggctagtga caaacgcctc ctgagcaaaa ccctcctccc ccgttaccgt cgtgcggtag 2949541 ggacaagcca catcggccga gcgggcgatc agccaacgac aggaggaccg cgatgtcatc 2949601 gggcaattca tctctgggaa ttatcgtcgg gatcgacgat tcaccggccg cacaggttgc 2949661 ggtgcggtgg gcagctcggg atgcggagtt gcgaaaaatc cctctgacgc tcgtgcacgc 2949721 ggtgtcgccg gaagtagcca cctggctgga ggtgccactg ccgccgggcg tgctgcgatg 2949781 gcagcaggat cacgggcgcc acctgatcga cgacgcactc aaggtggttg aacaggcttc 2949841 gctgcgcgct ggtcccccca cggtccacag tgaaatcgtt ccggcggcag ccgttcccac 2949901 attggtcgac atgtccaaag acgcagtgct gatggtcgtg ggttgtctcg gaagtgggcg 2949961 gtggccgggc cggctgctcg gttcggtcag ttccggcctg ctccgccacg cgcactgtcc 2950021 ggtcgtgatc atccacgacg aagattcggt gatgccgcat ccccagcaag cgccggtgct 2950081 agttggcgtt gacggctcgt cggcctccga gctggcgacc gcaatcgcat tcgacgaagc 2950141 gtcgcggcga aacgtggacc tggtggcgct gcacgcatgg agcgacgtcg atgtgtcgga 2950201 gtggcccgga atcgattggc cggcaactca gtcgatggcc gagcaggtgc tggccgagcg 2950261 gttggcgggt tggcaggagc ggtatcccaa cgtagccata acccgcgtgg tggtgcgcga 2950321 tcagccggcc cgccagctcg tccaacgctc cgaggaagcc cagctggtcg tggtcggcag 2950381 ccggggccgc ggcggctacg ccggaatgct ggtggggtcg gtaggcgaaa ccgttgctca 2950441 gctggcgcgg acgccggtca tcgtggcacg cgagtcgctg acttaggttc agcggcgaac 2950501 gacaagcacc gaacactcgg cgtgacggaa caccggatgt ccggatggcc cgaccagccg 2950561 cgctagctga ccggcctcac caccgccgat cactgccagc tgtacgcgct cgtcgtggtc 2950621 ggccaggaac cgggcaatac ccgtgtgagt ggtgatcggg tagacgcgca catcgggatg 2950681 acggtggtgc caatcctgca cgcgacgttc gaattcgccg tccggaatct cccggagctc 2950741 ctccggtcgc ccgccgagtg ccagtatggg cgcttgccgc aacttcgctt cccgggcagc 2950801 gtattccagc acggcctcgt tatccggtgc gtcggtcatg cgcaccacga tccagttgat 2950861 gtcagacgct ggctggtcca cttttgagcg catgacggcg accgggcaat gcgccttttc 2950921 ggccagctcg gttgccgtcg aacccaagat cgagctggcg tagcgcccga ttcccacgga 2950981 gccgacgcag atcatctcgg cgtcgcgcga tgcctccaca agcaccgggc cggctggccc 2951041 gcgggggatg tcggtttcga tcttgacgag cttgcccgcg gcctcaacag cggactgcgc 2951101 ttcccgaagc gatctttcag catgcgcaag gtcgcggtcg tagtcgtccg gggacggatg 2951161 tgtcggcttg atcactgaga ccagtcgcag cggcaccgct cggctgatgg cctcgtcaac 2951221 cccccacaat gcggccgtaa tcgccgcgtg cgaaccatcg ataccaacaa tgattgtttt 2951281 catcgtcggc tctcctctcc cagacatttc ccgatgctcg atcaccccgc atcggaaaac 2951341 ctgtccgcat cttggggact cgtggtaaag gtcggttccg gctgggccaa ccggtagacg 2951401 tcaatcagcc gcgcgacatc gctgggagtg acgatgccga ccaccgcgct cccttcggtg 2951461 accagcgcac ggctgcgcgg gccgagcggt gccatccgct ctaggagcgc ggtcagcggc 2951521 tcttgtggtc gggcggtcgg cacgctgtgc agcggcagcg caatgtcacc tacgctggta 2951581 gtgctgcgcc ggctaggcgc aacatcgcgc agctgccgca atgccaccag gcccgtgatc 2951641 gatccgtccc gatcggcaac cggatatgcc gagtgccgtt caccaagcac gtaacgctgg 2951701 atgaaatcct cgacattgat ccatccggga gccgtatgcg gttgggcggt catcgcatcg 2951761 gccacacgca ccccggcaaa cagctgctgg gtcgaaatcc gggtctcctc ctcgcgagcg 2951821 gcagcgaaga taaaccagcc aatgaaggct aaccagaccc caccgacgag gccaccagcc 2951881 acaaactcgg ccaatcccaa cgcgatcaag accagcgcaa ccacccgtcc ggcccgcgcc 2951941 gcaccgatcc cggcgcgcac actatcgccg tggcggcgcc acagataggc ccggaccaac 2952001 cgcccaccgt ccaacggcgc gccaggcagc agattgaaca gccccagcag caggttgaca 2952061 gtagccaacc accaagcaac gctgatcacg atggccgggg tccgcacgcc ggcgagcgtg 2952121 atggccaacg caccgaatgt cgccgacagc gccaggctgg tagccggacc cgcgaacgcg 2952181 atccggaaag cggctttggg cgtctttgcc tcgccgccaa gcgcggtcac cccgccgaac 2952241 agccacaacg tcacgctctc aacggatacc ccggcgcgac gagcgacgac ggcgtgcgcg 2952301 agctcatgag ccaacagcga cgccagcaac atgaccgcgc cacctgcgcc gagaagccaa 2952361 tagaccacgg ccgggtagcc tccgacggta cccggcaaca tggtcgccag actccaggtg 2952421 aacaaccaca ggatcaccaa cacgctccag tggacgttca ccacaaaccc ggcgatccgc 2952481 ccaagcggga tcgcatcacg cattgggtac ctccgatgct ggcggataaa gcctttcgtg 2952541 ccggcggatg atccgaggtc gctagctggc gagggccatg ggcgagcaga ttgccttgac 2952601 gaactgcaca atggcgtgct cgggcaggtg tcgggcgatg tcggcttcgg tgacgattcc 2952661 gaccaagcgg tgctctgaga tgaccggaac acggcggacc tgatgttctt ccatgacgtt 2952721 gagcatctcc tggatgcttg cgttcgcatc gacgtagtag atgctgtccc gggccaactc 2952781 gccagccgtg gcggtattcg ggtctaggcc cgcagccagg cctttgatca caatgtcgcg 2952841 gtcggtgagc atgccgtgca gccggtcgtc gtccccgcag atcggcaacg cgccgatgtc 2952901 gtgctcacgc atgtattgag cggcagcggt tagcgtctcg tgttcgccaa cacaggtcac 2952961 acctgcgttc atgatgtcgc gtgcggtggt catcgggatc ctcctcgagt cggggtgcta 2953021 ttgctgatct gctgccgaag gtacgaccac gtcgtagcga acactagggt cgtttgaccc 2953081 gtgggccgcg ggtcgatgga cccgtactgg cgcgcgttga ggcagctggc ttgcctggct 2953141 tgtcctcgcc gtaggccacc tcaaagtcga aggttgtcaa ttgatttcac cagccggata 2953201 tagcgctatg ggcggccgca ggaccgatag tgatgccgat cggccccgat cggggtaacc 2953261 ggcaatggaa caactgacaa ccatgaaggc tcgtttcgac ggaagcggaa gacgccgaca 2953321 ggcacatgag cctcgcgacg gggccaatcc gttggctttg cgaccgtggt cgtaggtcct 2953381 ggcggagccg ggttgccaca tccgtcacaa gctgacacgc cgaacgtgca accagggcgg 2953441 catcgcctgg gtgtgtctcc gccaccagtg cacattcggc gcagccagcc cacgctcggc 2953501 gcggagttag gcggaacggt cgcgctgtgt ccgtggcgcg tccaacaggc ccgactgctc 2953561 cagcgcagcc tggacaaacc gtcgtaccgg ccgcgactgg aagaagccag tgtgaccgcc 2953621 tggataccac acgatttcgg gtttgcccca gtgctcccag aggcgagtca cctgttcgcg 2953681 tggatgcacg agtcggtcgg caatgcccgc gtagataaag cggcccggca tgggcaccag 2953741 tggcgtaagt gagagcggcg agatcattcg gccgatcggt tcggccatct tgacggtgtg 2953801 gcggcggggg tctttgtgcc gaagaccgca gtggcggccc aacaactcga tcagatcagc 2953861 cactgggaca ccgagaatcg cgcaggcgag accttcttcg aggctggcga ccaatgacgc 2953921 gatgtagccg cccagcgaga gaccgttcaa cccgatcagc gactcctcct cctgcgatcg 2953981 tatccaggac aacagccgcc ggatatccca caccgcttga gccgtcccat gcacatcgtc 2954041 gagaacatct tctccgggaa aaacggcgcc cttcggcaga ccttgcccgc ggggaccatg 2954101 catcggaaga accggcatga caatgttcag gccgagttcg tcatgcagct tccaggcgcg 2954161 gaacaccgcg agatccaacg gggccctgcc catctcggtg ccgtgtacac aaaccagcca 2954221 gggacgcggc tctgggtgcc gcagtaacag ggcgtactcg cgattgttcg cagtgtatga 2954281 gagccaccgt tggctgcccg gttcacccgg atgcggcgta aacccactgt cgaagaagat 2954341 gcgataaaag gagcgtctgc ggtccttgac ctttcggacc gcgacctcgg tgagcggtgg 2954401 gggctgggca aaaaatccgc taggcttctc cagccatctg cgattcccat agaactccag 2954461 tccagcggcc acttcttggc tgatgcgctc gaacactcga tgattgctga ccggacgtcg 2954521 tgccttgagg cccagcagga cgatttcgtc tcgaaaggct tgcgccgcta aggcaatagt 2954581 gggccgtgcg atcggcagtt tatcgggctg ttgacccaga tagtcgcgcc acgattgagc 2954641 gacgtacaga ccggtgtgca tgaacggtcc catggcgccg ctcaagaccg gtggactcag 2954701 gcgaaaagcc gagcgttcgt gggtgccgtc gctcgcagaa cttgccatgg cagcaaagct 2954761 aaccgcgtgc ggaacgacgc gttagggact tacgtcccgc cggaagtcac ctgtgtggtg 2954821 gtggccactg tcgagaccgg cggcccgttg tggtggccca agtgccctaa ggtgatcagg 2954881 tgccgcagcc cggccagcac gccgtcagag tttcacgggg cttggtcgcg gccgatggcg 2954941 tcctcatcgt ggggtcgatg accgaggtgg acgcggcgcg accgggcaca tcgacggtcc 2955001 ccggggcttt gtgggccagt gaagtgacga aagaccccag tggacacgga cttcggcatg 2955061 tccacgcaac gaccgaggca ctccggtatt cgggctgttg gcccctacgc atgggccggc 2955121 cgatgtggtc ggataggcag gtggggggtg caccaggagg cgatgatgaa tctagcgata 2955181 tggcacccgc gcaaggtgca atccgccacc atctatcagg tgaccgatcg ctcgcacgac 2955241 gggcgcacag cacgggtgcc tggtgacgag atcactagca ccgtgtccgg ttggttgtcg 2955301 gagttgggca cccaaagccc gttggccgat gagcttgcgc gtgcggtgcg gatcggcgac 2955361 tggcccgctg cgtacgcaat cggtgagcac ctgtccgttg agattgccgt tgcggtctaa 2955421 gcaccaccta acggtgtcgt cccgaaggga cgattgccga tccggtggat gactttggtc 2955481 cctatgcctt cccgctggac cgcacaacga tcgaaggtgc cacgacgcat agaagacatg 2955541 gccatgccac accctgatag cattgcagca agctacatgt actgctctac caggatcctt 2955601 atgggcaaca gtgggtttga gttatgaaac ccgtgggcac atacccttcc gcgtcgtact 2955661 ggtcagtctc gacagcgaag agatcaccgg ttgatccacc aagcatgcat tggcgggcat 2955721 ctgcataaac ggtgacgtat cagcacaaaa cagcggagag aacaacatgc gatcagaacg 2955781 tctccggtgg ctggtagccg cagaaggtcc gttcgcctcg gtgtatttcg acgactcgca 2955841 cgacactctt gatgccgtcg agcgccggga agcgacgtgg cgcgatgtcc ggaagcatct 2955901 cgaaagccgc gacgcgaagc aggagctcat cgacagcctc gaagaggcgg tgcgggattc 2955961 tcgaccggcc gtcggccagc gtggccgcgc gctgatcgcg accggcgagc aagtactggt 2956021 caacgagcat ctgatcggcc caccaccggc tacggtgatt cggctgtcgg attatccgta 2956081 cgtcgtgcca ttgatagacc ttgagatgcg gcgaccgacg tatgtatttg ccgcggttga 2956141 tcacaccggc gccgacgtca agctgtatca gggggccacc atcagttcca cgaaaatcga 2956201 tggggtcggc tacccggtgc acaagccggt caccgccggc tggaacggct acggcgactt 2956261 ccagcacacc accgaagaag ccatccgaat gaactgccgc gcggtcgccg accatctcac 2956321 ccgactggta gacgctgccg accccgaggt ggtgttcgtg tccggcgagg tgcggtcacg 2956381 cacagacctg ctttccacat tgccgcagcg ggtggcggtc cgggtgtcgc agctgcatgc 2956441 cggaccgcgc aaaagcgcct tagacgagga agagatctgg gacctgacat ccgcggagtt 2956501 cacccggcgg cggtacgccg aaatcaccaa tgtcgcacaa caatttgagg cggagatcgg 2956561 acgcggatcg gggctggcgg cccaagggtt ggcggaggtg tgtgcggctc tgcgtgacgg 2956621 cgacgtcgac acgctgatcg tcggagagct aggcgaggcc accgtggtca ccggtaaagc 2956681 gcgtactacg gtcgcgcggg atgccgacat gttgtccgaa ctcggcgaac cggtagatcg 2956741 cgtggcaagg gccgatgagg cgttgccatt cgccgcgatc gcggtaggtg ccgcattggt 2956801 ccgtgacgac aaccggatcg cgccactaga tggggtgggc gcattgctgc gttatgccgc 2956861 caccaaccga ctcggcagcc atagatccta ggatgctgca ccgcgacgat cacatcaatc 2956921 cgccgcggcc ccgcgggttg gatgttcctt gcgcccgcct acgagcgaca aatcccctgc 2956981 gcgccttggc gcgttgcgtt caggcgggca agccgggcac cagttcaggg catcggtccg 2957041 tgccgcatac ggcggacttg cgaatcgaag cctgggcacc gacccgtgac ggctgtatcc 2957101 ggcaggcggt gctgggtacc gtcgagagct tcctcgacct ggaatccgcg cacgcggtcc 2957161 atacccggct gcgccggctg accgcggatc gcgacgacga tctactggtc gcggtgctcg 2957221 aggaggtcat ttatttgctg gacaccgtcg gtgaaacgcc tgtcgatctc aggctgcgcg 2957281 acgttgacgg gggtgtcgac gtcacattcg caacgaccga tgcgagtacg ctagttcagg 2957341 tgggtgccgt gccgaaggcg gtgtcactca acgaacttcg gttctcgcag ggtcgccacg 2957401 gctggcgatg tgcggtaacg ctcgatgtgt gaattgagac ctgattcatg aaaatcgtcg 2957461 aggagacccc ataccggttc cggatcgaac aagagggcgc gatgcgggtg cccgggatcg 2957521 tgttcgcgtc caggtcgttg ctgcctcgtg acgaaggcga catggccctt gatgcaagtg 2957581 gtcaacgtgg ctacgctgcc ggggattgtc cgggcctcgt atgcgatgcc cgatgtgcac 2957641 tggggatatg gtttcccaat cggcggcgtg gccgcaaccg acgtcgacaa tgatggagtc 2957701 gtttccccag gcggtgtcgg cttcgatatt tcgtgcggcg taagactctt ggtcggcgaa 2957761 gggctggacc gcgaggagct gcaaccacgg ttgccggcgg tcatggaccg gcttgatcgc 2957821 gcgataccgc gcggagtggg cacggcgggt gtgtggcgac tacccgaccg gaacacgctg 2957881 caggaggtgc tcaccggtgg tgcccggttt gcggtggaac aggggcatgg cgtcgcgcta 2957941 gacctcgagc ggtgcgaaga cggcggtgtg atgacaggag cggacgcggc caaaatcagt 2958001 gaccgggccc tccaacgcgg gcttgggcag atcggcagcc ttggctcggg caaccacttc 2958061 ctggaagtcc aggccgtgga ccgcgtctac gatccggttg cggccgcgcc gatgggtctg 2958121 gcggaaggga ccgtctgcgt gatgatccac accggctcac ggggcctggg ccatcagatc 2958181 tgcacggatc acgtccgcca gatggaacaa gccatgggcc gatacggaat cgcggtgccc 2958241 gatcgccaat tggcttgtgt gccggtgcac tcccccgatg ggcaggccta tctcgccgcg 2958301 atggcggcgg cggccaacta cggacgcgcc aaccgccaac tgctgaccga ggcgacgcgt 2958361 cgtgtgttcg ctgatgcaac cggaacacct ctggacctgc tctacgacgt gtcgcacaac 2958421 ctggccaaga tcgagacgca tccgatcgac ggtcagctgc gctcggtgtg cgtgcaccgc 2958481 aagggcgcca cccgctcgct gccgccgcac catcacgagc tgccggccga actggcagcg 2958541 gtcggccaac ccgtgctgat acccgggacg atgggtacgg cgtcatatgt gcttgccggg 2958601 gtcaccggca acccggcgtt cttttccacc gcgcatggtg ctgggcgggt actgagccgt 2958661 caccaggccg cccgccacac cagcggtgaa gcgatacgcg ccagcctcgc aaaacgtggc 2958721 atcatcgtcc gcggtacctc tcgtaggggt atcgccgagg aaaagccgga ggcctacaaa 2958781 gacgtcgacg aggtcatcga agccagccat cagagtggcc tcgcgcgcaa agtggctcgc 2958841 cttgttccct tgggctgtgt caaaggatga atcaacggcg aacattccag ccgtcgcgac 2958901 cgccttcttc agtggtgcag acccgtgacc ggctgatggg tactggcttc gatatccgac 2958961 gacgtcaaag cgaatagctg attcgccaaa tccgacaagg cccgggcgat cgcaagttcg 2959021 tcgccgatct gggccaccgg ctcatcggcc ggatcgagtc gcgccaaacc aacacccacc 2959081 atctgcctgc ctgcccagga cagccgcgcc ttcgcccggg tgcgctcgtc gtgttcctca 2959141 atcagcacat caatttggca ggtttttcca acgtgctcgc tgtctgtcat cgcggcctcc 2959201 ctgtcggatt tgcgcttacg cccgccgatc tgccccgcta gctgaacgcg gtatctatcc 2959261 aatcaccaca atcggtcgtg gagtaggcca gaattctttt cgcccgaccc gggcccgcct 2959321 agcactgaca accgctagat ggccttcagg aggtctgctt tgcccttggt acggagtgtg 2959381 tacagaggtg agccgcgcaa ctgctcaatg cgagccgcca tcttgtcacc gagctcctcg 2959441 agctcggcat cggtgatgtg caccggtgta ggagcgggga tcatgtcgcg ttcctctacg 2959501 tcggcgtgcg cctccaacac ggtccggaac acgttccact cttcttcata cccgggcgcg 2959561 cgctgcggag tgcgcagcag cgtcgcgagc tgatcaacca cctgacggtg ctcggcgtgg 2959621 gtacccgtga ttggtttgcc ggccgcggaa agggcagggt agtacaggtc atcctcgatg 2959681 cggaagtgaa tgtccagctc gatgagcatc tcgtcgaaaa ggacatggcg ctcttcgcta 2959741 ttcaccggcg cctcgccgac tttgcggccc agtcctttaa gcacggtgtg gtggcgcttt 2959801 aatacgtcgt aggcattcac ttcgttgctc tattccgtat tcgggatcaa cgagacaacc 2959861 gtaacctcgc gccgcggccc attaatgtga ggtagctgtg aatcagcaca aagaagcctg 2959921 tgcagtagcg cgacgctcgg cgtaccggca cgagtccgac ggcccgcatg tccatgcggc 2959981 cgccggcacc agcgccgagg cccccgcagg ataccgggat ctgcagctcc tcgtgcggaa 2960041 acagttgccg cagttcgggt tcgggcagtt cggcgagcgt gatgttcgcg cctaacggca 2960101 acaactatcc gtcggcgccc tgggtgccgg gcgggcccat attgccctgg atgccggagc 2960161 tgccacccgg tgacccaccc gcgccgccgg cgcccccgtt gccgcccagc gcgaaatcgc 2960221 cgccctgacc gccggtcgcg ccggtcccgc cgttgccgcc gttgccgccc tggccgccga 2960281 ggccaccttg cccacccgtg ccgcctgcgc cggtgccgcc ggcagctcct gcccacccga 2960341 tcagcccgcc ggctccgccg ctgccgccgg tggtcccgcc ggcgccgccg gtaccgccag 2960401 tgccaccagc gccccccacg ccgcctgtac cgccgccacc gccaattgtc gctcccccgc 2960461 cggtggtggt acccgcgccg ccggcgccac cgttgccgcc ggcaccaccg atgccgccga 2960521 tgccaccggt gccgccgaca ccgccggcac ccccgccacc accaagcccg atgagcgacc 2960581 cagccgcccc gccgttgccg ccgacaccgc cgctgccacc cataccgccg gtaccgccga 2960641 caccgccgag gccccccaga ccgccggtgc cgccttccgc ggtacccgca ccgtcggtga 2960701 gaccctctcc gcccgcgccg ccgacaccgc ccgcgaagcc ggcggcgcca ccaccaccgg 2960761 tgccgcccgt cccgccggcc ccaccggcgc cgccgttgcc gattaacatc ccgccacgtc 2960821 caccgtttcc accggcacca ccggtgccgc cgttaccgcc cgccgcgcct agtgccccgt 2960881 taccgtcacc gccgattccg ccgtcaccgc cgaaagcgtc acctacaccc gtgttgtgcc 2960941 cctgcccccc cttgccgcca gcaccaccca cgccaccgtc gacccctccg gtggcaccgt 2961001 cacccccctc acccccggta gccacgccgc cggcgctgcc gtcggtctca cctatgccgc 2961061 cagcgccgcc agcgccgccg gcaccgccat cggtaccggc agtacccccg gctccaccct 2961121 taccgccggt gccgtcgttg ccgtcgagcg actcccccag cccgccctgc ccgccgacgc 2961181 cgccagcctc gccgacgcca ccggcggggc cgggaccccc gttcccgcca gtttgattcc 2961241 cgttgccgct gttgtcggta ccgttcgcac cggtgttggg gttcgcaatc gagccggggt 2961301 tgaccccgtt tgtcccggcc agaccggtgc caccctgccc gccggcacca ccggacccga 2961361 accagttggc attaccgccg ttgccgcccg cgccgggcat cccgcccagg acacccgcca 2961421 cggccggccc accctgtccg ccggcaccgc catcgcccaa caacatcccg ccggcaccgc 2961481 cattaccacc ggccgccccg gccccaccca gacccccaac accgccattg ccgatcaata 2961541 gcggacccgc accgccgtca ccaccgggcg caccgtcccc accaacgccg ccaccgcccc 2961601 cggtcccgaa gtagctggcc gcccctccga cgccgccggc ggcgccaagg ccaccggccc 2961661 cgccgaatcc accattgccg aacacgccgc caacgccacc gctcccgccc acgccgccgg 2961721 tggtgcccac ccccgccgcg gccccgccag caccgccgaa gcccccgctg ccgatcaacc 2961781 ccgtcgcccc accgacaccg ccagtaccgc cgaccaaagt ggcccctgca gccccaccag 2961841 ccccaccggt cccgccatta cccagcaacc atccaccgcg accaccgaca cccccggcag 2961901 caccggaccc gaccagcccg tccccacctt taccgccagt cccgccgtta ccgatcaacc 2961961 ccgcatcccc gccagcacca cctggctgac ccggcgcacc cgatccgcca ttcccgccgt 2962021 tgccaagcaa cagcccgccc ggcccaccgg gagcccccgt cccgtcggcc ccgttagcgc 2962081 cattgccgat caacgggcgc cccaacaacg cctgggcggg ggcatttacc acgcccaaca 2962141 aatcctgcaa cggcgcagca ctggtggcct cggcaactac gtatgagcgc gcgccgttcg 2962201 taagggactg cacgaactgg gcgtgaaacg ccgacagctg cgcaccaaaa gcctgatagc 2962261 tctgcgcgta cgacccgaac aaggccgcaa ctgccgccga aacctcatcc gcggcagccg 2962321 ccaccacccc cgtggtcggc aatgccgccg ccgcattcgc cgcgttgatc gtcgacccaa 2962381 tgttggccag atccgaagcc gccatcgtca atgcttccgg caccgcaatc acaaatgaca 2962441 tctgcgacct cctggaccgg acaacccgca tggtcgccgc ggatcatcga gcactcggca 2962501 gcaacaaatc ctatcccgcc tcgcagacgg cggaggccat ttggccgccg gcgcgtactc 2962561 ttcgctacga ccgccagagc ccttggttag cgaccggatt cgaccgccgc atgagccaaa 2962621 ctgttaccgg tgtgggtgtg cagaactgcg cagttagcaa acgccgatgc agcgcggtgg 2962681 accacagcag ccgcacaccg taccggcgct gagtgataaa cccgacccgg gcccggcgga 2962741 tgcgatatcg tcttgcggct atggcgggta tgccagaggg caaactcatc ctcctcaacg 2962801 gcggatccag cgcgggaaag acgtcgctcg ccttggcgtt tcaggatctt gccgccgagt 2962861 gttggatgca cattgggata gatctgttct ggtttgcgct gccgccagag cagcttgacc 2962921 ttgcgcgggt gcggcccgag tactacacat gggacagcgc ggtcgaggcc gacgggctgg 2962981 agtggttcac cgtgcacccg ggccccatct tggacctggc catgcattcc cgctaccgcg 2963041 ccatcagggc atacctggac aacggaatga acgtcatcgc cgacgacgtg atctggacac 2963101 gtgagtggct ggtagacgct ctgcgggttt ttgagggctg ccgagtctgg atggtcgggg 2963161 tccacgtatc cgacgaggag ggtgcccgcc gggaattaga acgcggcgat cgccaccccg 2963221 ggtggaaccg aggcagtgcg cgcgctgccc acgccgacgc cgagtacgac ttcgagctgg 2963281 ataccaccgc gaccccggtc cacgagctgg ccagggagct gcatgagagc tatcaagcct 2963341 gcccgtaccc catggctttc aaccggttac gcaaacgctt cctatcttga aatggagcca 2963401 aaagtcgtgc gcaactggaa ctttcactcc tggcaaacgc tggggcgacc cgtcaccgcg 2963461 cgcttgggtt cgggtcgaat cgtcggccgc gcgggtcgtg cggaacattg cacccgacgc 2963521 ggcggaatcg gagttgagaa gtacatggcg ggacgcaccc ggcaccggtc aggcattctt 2963581 tacccatgga tgtggaggcc ctgctgcagt cgatcccgcc gctcatggtc tacctggtgg 2963641 tcggcgcggt ggtagggatc gagagcctgg gcatccccct tcccggcgag atcgtgctgg 2963701 tcagtgccgc ggtgttgtcg tcgcaccccg agctggccgt caacccgatc ggcgtcggcg 2963761 gcgctgcggt gatcggcgcc gtggtcggcg attcgatcgg ctactcgatc ggccgccgct 2963821 tcggcttacc gctattcgac cggctgggcc ggaggttccc aaaacacttc ggccccggtc 2963881 atgtcgcgct tgctgaacgg ttgttcaacc gatggggagt ccgagccgtg ttcctcggtc 2963941 gcttcatcgc gctgctgcgg atattcgccg gaccgctcgc tggcgccctg aagatgccct 2964001 acccgcgctt cctggccgcc aacgtcacag gcggcatctg ctgggccggc ggcaccactg 2964061 cactggtcta cttcgccggg atggccgccc agcactggtt ggaacggttc tcctggatcg 2964121 cgctggtcat cgcggtcatc gccggcatta cggccgcgat cttgctgcgc gaacgcactt 2964181 cgcgcgcgat cgccgaactc gaggccgagc actgccgcaa agccggtacc accgcggcgt 2964241 gaccgaccgg cttgaatccg gtacccacgc tcacaggagc tgcaatctag acagatctcc 2964301 agtcatgtca taaaaatgag atctgaaatt acttgacaag cttgtcttcg gacagtgcgg 2964361 ggcatccgcc gcggtggctg tacgccgtcg attaggagcg caccatgggc ctgatcacta 2964421 cagaaccacg ctctagtccc cacccgctca gcccacggct cgtccacgag ctaggcgacc 2964481 cacacagcac gctgcgggca accactgacg gcagcggggc agcgttgttg atccacgcgg 2964541 gcggcgagat cgatggccgc aacgagcatc tctggcgtca attggtcacc gaggccgccg 2964601 ccggcgtcac ggcgcccgga ccgctcatcg tcgacgtcac cgggctcgat ttcatgggct 2964661 gctgcgcttt cgccgcactg gccgacgagg cacaacgatg tcggtgccgc ggcatcgacc 2964721 tgcgtctggt gagccaccag ccgatcgtcg cccggatcgc cgaagcgggt gggctgagcc 2964781 gagtgctgcc catctacccg accgtcgata ctgcgctcgg caagggcacg gccggtccag 2964841 cccgttgctg atcccggccg taagagcacc gagccgaccg ccggtggccc caccgctagg 2964901 gccgatcgca ccgccgcgcg acgatgttcg cgtcaggcgc gcatgcggta tcgcttgcct 2964961 tgcaaggtaa tccacttcgg acatccacga tgcaggtcgc gatcaagtcg ggcgcgccgc 2965021 agcagtcagt ggccgcgagg ggcgtacatg atcacggcta ccccggccat gcagccaagg 2965081 gcaccgatga catcccaccg gtcgggccgg aacccgtcca gggccatgcc ccaggcgagc 2965141 gaaccggcga caaacacacc accgtaggcg gccaagaccc gaccgaaatg ggcgtccggc 2965201 tgcaatgtgg cgaagaaccc atagacccca agcgcaataa ctccgagtcc cgcccaaagc 2965261 caaccccgtt gctcgcggac gccctgccat accagccacg cgccaccgat ctccgcaacc 2965321 gccgccagga cgaatagcag gattgaccgc accaccatgg ttgcgagcct acgagatccg 2965381 ctgccctgcc gccccccaac caatcgcgca ccccaaatgc ttcccgtcac ccgcgctcag 2965441 ccagacaccg gtgttggcta caactatggt tcccggatca ggcgcagcag ttcgggttga 2965501 gcacggtaca cagcgcttgc agggcttcag gatgtacccg atggaagacg tgcatgcccc 2965561 ggcgatcgga aatgaccagg ccggccttgc gcagctgggc caagtggtgg ctgacggtgc 2965621 catcgctgag gctgagcgcc gccgctagtt ggccgctgac ctgctcgccg gccggcgagc 2965681 tgaacaggta ggacatgatc ttgactcgtg ccgggtcggc cagggccttc agccgcagcg 2965741 ccaccgccaa ggcgtcgccg tcgctcatcg gccccgccgc caccggggcg cagcacacgg 2965801 gagcggagat gtcaatcacc ggcagcgact tgggcatagg cccaccctgc cagatacctt 2965861 gacatatatc aaagagatgt tgcacactgg gttcggcgcc attttgatat aagtcaaaca 2965921 actgggaggt gtctaccaat gtcccgcgtt cagctagccc tcaacgtcga cgacctggag 2965981 gccgcaatca cgttctactc caggctgttc aacgccgagc ccgccaaacg caagcccgga 2966041 tacgccaact tcgcgatcgc cgatccgccg cttaagttgg tgctgctgga gaaccccggc 2966101 accggcggta ccctcaacca tctcggtgtg gaagtcggct cgagcaacac cgtgcatgcc 2966161 gaaatcgccc ggttgaccga agccggactg gtcaccgaga aggagatcgg caccacgtgt 2966221 tgctttgcca cccaggacaa ggtgtgggtg accggcccgg gtggggaacg ctgggaggtt 2966281 tataccgtgc tggccgactc cgagaccttc ggcagcggtc ctcggcacaa cgacaccagc 2966341 gacggcgaag caagcatgtg ctgcgacggc caagtcgccg ttggcgcaag cggctaactg 2966401 taggcctgac cccggggtgc gtctccaagc cgcggagccc accccgggcc actcaatgcc 2966461 ccctaacccg cgtagcgccg ttcaccgcgt ggccgcttgc ggacctgatt cgatatttgt 2966521 caatattgat gtatgtcgaa tctgcatccg ttaccagagg tggcgagctg cgtagtcgcg 2966581 ccgctggtgc gcgaaccgct gaatcctccg gccgcggccg aaatggcggc ccggttcaaa 2966641 gccctggccg atccggtgcg attgcagctg ctgagctcgg ttgccagtcg cgccggcggc 2966701 gaggcctgcg tctgcgacat ttccgcggga gtcgaggtga gccagcccac gatttcgcat 2966761 catctcaagg tgctgcgcga cgcgggtttg ctgacctcgc ggcgtcgggc ctcgtgggtg 2966821 tactacgccg tggtccccga ggcgctgacc gtgttgtcga acctgctcag cgtgcatgcc 2966881 gatgccgcac ccgccctggg ggcaccggca tgacggagac ggtcacccgc accgccgccc 2966941 cggcggtggt gggcaaactc tcgacgctgg accgcttctt gccggtgtgg atcgggtcgg 2967001 caatggccgc cgggctacta ctgggccggt ggattcccgg cctgcacacc gccctagaag 2967061 gggttcagct cgacgggatt tcgctgccga tcgcgctagg cctgctgatc atgatgtatc 2967121 cggtgctggc caaggtgcgc tacgaccgcc tcgacaccgt caccggtgac cgcaagctgc 2967181 tactcagctc gctgctgctg aactgggtac tgggcccggc gttgatgttc gcgctggctt 2967241 ggctgctact ggcggatctg cccgagtacc gcaccgggct gatcatcgtg ggcctggctc 2967301 gctgcatcgc catggtgatc atctggaacg acctggcctg cggggatcgc gaagccgccg 2967361 ccgtgctcgt cgcgttgaac tcgatctttc aggtggccat gttcgccgcg ctcggctggt 2967421 tctacctgtc ggtgctaccg ggttggctgg gcctcgagca gaccaccatc gccacatccc 2967481 cgtggcagat cgccaagtcg gtgctgatct tcctcggcat cccgctgctg gccggctacc 2967541 tgtcgcggcg gatcggcgaa aagaccaagg gccgcaactg gtatgaatcc cgcttcctgc 2967601 ccaaggtggg accgtgggcg ctctacggtt tgctgttcac catcgtgatt ctctttgcgc 2967661 tgcaaggaga tcagatcacc ggccgaccgc tggacgtcgc acgcattgcg ctgccgctgc 2967721 tggcctactt cgccatcatg tgggtaggcg gctacctact gggggcggcg ctgcggctag 2967781 ggtatcggcg caccaccacg ctggcgttca ccgccgcgag caacaacttc gagctggcca 2967841 tcgcggtggc catcgccacc tacggcgcca cctccgggca agccctggcc ggagtcgtcg 2967901 ggcccctgat cgaggtaccc gtcctggtgg ggttggtcta tgtgtccctg gcgctgcgca 2967961 accgcctcgc cggtcccaac gcgacccacg atgccgacaa acccagcgtc ctattcgtct 2968021 gtgtgcacaa cgccggacgt tcccagatgg ccgccgggct attgacccac ttggccggtg 2968081 accgcatcga agtccgttcg gccggaaccg agcccgccgg tcaggtcaat ccgacggctg 2968141 tggccgcgat ggccgaaatg ggcatcgata tcaccgccaa tgcccccaca ttgctcaccg 2968201 gcgggcaggt ccagtccagc gacgtcgtca tcacgatggg ctgcggcgat gcctgccctt 2968261 acttcccggg tgtctcctac cgcaactgga aactacccga tcccgccggc cagcccctcg 2968321 acgttgtgcg catgatccgc gacgacatcg cagaccgcgt ccaagccctg atcgccgagc 2968381 tgctggccac cgccaagacc agatagcgtg tgccacgctc ggtgctgcgc cgatacgtga 2968441 ggtcccggct gggatcggat tttccgcgtg tacggcggct aggcaccagc ggatcgcatt 2968501 tgtactggtt agagacttgc cgagtggccg cattagcctg cgtggagcgc ttggtcaaaa 2968561 agctcggccc tgttcggccc tatgggttcc tgttgatctg ccctgttcgt agtctcgaca 2968621 aagcggctgc ccgagatcgc gtgcgacgat atcgggagcg gctgcggcaa cgaggtctgc 2968681 ggccgataca gatctgggtt cccgatgtga acgcacccga atttgtcggc gaagcacacc 2968741 gtccgtcggc gctcgtcgcg gcccgcgaat acgaggacga cgatcaagcc ttcgtcgatg 2968801 cggtatcggt cgactgggac gacgccacct gacgtgcggc gcggcgacat ccacaccgcg 2968861 gcggcgcgtg gtgcctacac cggcaagcca cgccggtcgc ggtcatccag aatgaccggt 2968921 tcgattcgac ggcctcggtt accgtcgtgc cgtttaccac gcgtgatgtc caggcatccc 2968981 tgatgcgaat cccggcccca gcgtccaaca ccaccgggct gaccgagacc agtcgcctga 2969041 cggtcgacaa ggtgacaaca tcccccgcac cagcctgacg cggcaggttg gtcggttatc 2969101 ggccaaaaac atggtcaggc tcgaccgtgc attgctggtt ttcctggccg gctgacaatt 2969161 gcgccacctg gtcatcagaa ctgatcgggc ggggaaacga aacggggctc ccagcggagg 2969221 tcatgagttg gcgcgccggt ttcgccgcga tctctccgaa cttgaccgct aaacctcggg 2969281 gcagaagtca tgaacaagcc cgttaggagg cgtttgaggc cgtaaatgtt gatgagggcg 2969341 gggaaagtgt cgtcatggcc gtcgcgctga attcaccacg cccccacgac ggagctcgtg 2969401 ggcacccagc attcactgct taccactacg atctcgctca cgaggttcga gcagccactg 2969461 tcgcctgccg ccaacgaata atgctccctg acctagtggt cccggctggg atcgaaccag 2969521 cgaccttccg cgtgtgaagc ggacgctctc ccactgagcc acgggaccgg cgccgaggag 2969581 atgaacgagg tcgaagatta gcacgtgcaa gacatcgtca gcagcagtct acgtgcgctt 2969641 cacatagggg ctgcgatagc ctagagccgc aacgtaccaa gagatttgtg tgggcccgct 2969701 cacctcgact atcgtcgtgc ttcgcaccgg gcgacgatct cgttcgttgc gcgcggatgt 2969761 agcgcagttg gtagcgcatc accttgccaa ggtgagggtc gcgggttcga atcccgtcat 2969821 ccgctcgaag gtgctagtgg catcaaatcc cagcggtgga gtggccgagt ggtgaggcaa 2969881 cggcctgcaa agccgtgcac acgggttcga ttcccgtctc cacctccagg ttcaaccccc 2969941 agcgcgatta gctcagcggg agagcgcttc cctgacacgg aagaggtcac tggttcaatc 2970001 ccagtatcgc gcaccagtgt tcgagcaggt caggcctggt ttttaccggg ccttcgccgt 2970061 ttccgcgcaa taaacgcgca atagtgccgc cgctgggtgc gccccacgga ggagtttgct 2970121 aaatgaccac cacgccccga caacccctgt tctgcgccca cgccgacacc aacggcgacc 2970181 cgggccgctg cgcctgcggc cagcagctcg ccgacgtcgg cccggccacc ccgccaccgc 2970241 cctggtgcga accgggcacc gaacccatct gggagcagct caccgaacga tacggcggcg 2970301 tcacaatctg ccagtggaca cgatattttc cggccggcga cccggtggct gccgacgtgt 2970361 ggatcgccgc cgacgatcgt gtcgttgacg gccgggtgct gcgcacccaa ccggcgattc 2970421 actacacgga accgcccgtg ttggggatcg gcccggcggc ggcccgccgg ctggccgctg 2970481 agctgctcaa cgccgccgac accctcgacg acggccgccg gcagctagac gacctcggcg 2970541 aacaccggcg gtgaacaccg cgacccgggt ccggctggcc cgcaaacgcg ccgaccggct 2970601 caatctgaaa ctaatcaaga acggccacca cttcaggttg cgtgacgccg acgagatcac 2970661 gctggcggtc gggcacctag gggtggtgga agccttcctg gcggcggcca agtcgcaaaa 2970721 caagccgccc ggtccgccgc cgagcctcca cgccccgcca tcctggcggc gcgacatcga 2970781 cgactacctg ctcaacctga acgccgccgg tcaacgccca gcgacgatcc ggctacgcaa 2970841 gacggtgctg tgcgcagccg cccacggcct cggccgccca cccgccgacg tcaccgccga 2970901 acacctcctg gactggctag gcaaacagca gcacctctcc ccagagggcc gcaaaaccta 2970961 tcgcagcacg ttgcggggct tcttcgtgtg ggcctacgaa atggaccggg tgcgcgacta 2971021 tgtcgcagac tccctgccta aggtgcgctg cccgaaacag ccgccccgcc cggccggcga 2971081 cgacgtctgg caagcggcgc tggccaaggc cgaccgtcga atcgagctga tgatccgcct 2971141 agccggtgag gccgggctgc gacgcgccga agccgcccag gcgcacaccg gcgacttgat 2971201 ggacggcggg cttctcctcg ttcacggcaa aggtggtaaa cgccgtattg tgccgatcag 2971261 cgactacttg gccgcgctca tccgcgacac cccgcacggc tacctgttcc ccaacggcac 2971321 cggcggccac ctcaccgccg aacacgtggg aaaactcgtc tcccgggcat tacccggtga 2971381 cgcgaccatg cacaccctgc ggcaccgata cgccacccgc gcctaccgcg gctcccacaa 2971441 cttgcgagct gtacaacaac ttctcggtca cgcctcgatc gtgacaacag aacgctacac 2971501 agcgctgtgc gacgacgagg tgcgcgccgc agcagcagcc gcatggtgag tcgccctggc 2971561 gtttgctgca gccgatcggc gtcacccccg acaggcggct cgtattcggc cagcggcggc 2971621 tcgaggctgc acggctgctc ggatgggagc gcatcccggt gcacgtgtgc cacacgatcg 2971681 ccgacgtggt cgaccgggcc aaagccgaac gctccgaaaa cacgcttcgc aaggatttca 2971741 ccccctcgga gctgctcgcc gctggtcgcc ggatcgccga gctggaacgg ccgaaagcca 2971801 aacagcggca acgcgaaggc ggcgaccatg gccgccaggc tcgatattct ggcttaggct 2971861 ccatggagcc taagccagaa tcagagcgcg atgcccacaa agccgacact gccatcagcg 2971921 aagccctcgg catctcccgc ggccactacc agcggctcaa acgaatcgac aacgcaaccc 2971981 gcagcgaagc tggctaccgg gatggtttaa acggttggag cggctgaccg ccggtgcccg 2972041 ggatgggccc cggcggcaac ttgtccaacg ggcgacgctc acgtccacgc ttgcgcagct 2972101 catcttcgtg aaccgccccg gcatgtccgg agactccagt tcttggaaag gatggggtca 2972161 tgtcaggtgg ttcatcgagg aggtacccgc cggagctgcg tgagcgggcg gtgcggatgg 2972221 tcgcagagat ccgcggtcag cacgattcgg agtgggcagc gatcagtgag gtcgcccgtc 2972281 tacttggtgt tggctgcgcg gagacggtgc gtaagtgggt gcgccaggcg caggtcgatg 2972341 ccggcgcacg gcccgggacc acgaccgaag aatccgctga gctgaagcgc ttgcggcggg 2972401 acaacgccga attgcgaagg gcgaacgcga ttttaaagac cgcgtcggct ttcttcgcgg 2972461 ccgagctcga ccggccagca cgctaattac ccggttcatc gccgatcatc agggccaccg 2972521 cgagggcccc gatggtttgc ggtggggtgt cgagtcgatc tgcacacagc tgaccgagct 2972581 gggtgtgccg atcgccccat cgacctacta cgaccacatc aaccgggagc ccagccgccg 2972641 cgagctgcgc gatggcgaac tcaaggagca catcagccgc gtccacgccg ccaactacgg 2972701 tgtttacggt gcccgcaaag tgtggctaac cctgaaccgt gagggcatcg aggtggccag 2972761 atgcaccgtc gaacggctga tgaccaaact cggcctgtcc gggaccaccc gcggcaaagc 2972821 ccgcaggacc acgatcgctg atccggccac agcccgtccc gccgatctcg tccagcgccg 2972881 cttcggacca ccagcaccta accggctgtg ggtagcagac ctcacctatg tgtcgacctg 2972941 ggcagggttc gcctacgtgg cctttgtcac cgacgcctac gctcgcagga tcctgggctg 2973001 gcgggtcgct tccacgatgg ccacctccat ggtcctcgac gcgatcgagc aagccatctg 2973061 gacccgccaa caagaaggcg tactcgacct gaaagacgtt atccaccata cggatagggg 2973121 atctcagtac acatcgatcc ggttcagcga gcggctcgcc gaggcaggca tccaaccgtc 2973181 ggtcggagcg gtcggaagct cctatgacaa tgcactagcc gagacgatca acggcctata 2973241 caagaccgag ctgatcaaac ccggcaagcc ctggcggtcc atcgaggatg tcgagttggc 2973301 caccgcgcgc tgggtcgact ggttcaacca tcgccgcctc taccagtact gcggcgacgt 2973361 cccgccggtc gaactcgagg ctgcctacta cgctcaacgc cagagaccag ccgccggctg 2973421 aggtctcaga tcagagagtc tccggactca ccggggcggt tcatcggcgg ccttgcgtgc 2973481 ctgctcagcc tggcggcgcc aagcctcata gcgacgccga atctccctct caatcgcgcg 2973541 ctgcacaccc atccggaact gatcctggac acgctgctgc tgccgaaccc aacgctcaag 2973601 ctcacgccgg tagtcgttga ctgatctcgc cacccaaaat cacccctctt gaccctcttg 2973661 gttctctttt tggcggcgtg ggcgacccgg cacccctaag tctccgggcc gtgcgggccg 2973721 ctgggagccg aaaggttgct aaagttctcc ctttttgccc gcacgacccg aaaagggccg 2973781 cccacgcctg gcacctacgc ggtggtctgc accttcagca cgcggaacgc attgtccacc 2973841 agcacatcag aaccgactcg gaaccagcag aagaatccgc gctgtccggt cggtcggcgg 2973901 ttgccgccga acacgtgcgg caccagctcc accgtcgacc cgacccggtc ggtgatgatg 2973961 aactgcttcc agtcgccaag caccagcggg taattggtgg cggtcaccgc cgcgtccacg 2974021 gtgtccatgt tcgacacctc ccagatgtgt ttcccggcca gcatcggcgg gctggcgtgc 2974081 agcgatggga atttcagcgc cccattcgcg gtttccgcct ggcgcagcac gttgatggtg 2974141 gacaagttcg ccgcgaacgc gctgttggat tgaaagcgcg gcggcaacgc cgactgcagc 2974201 gcgtaaacgt cggcggctac aacggcttcc gtccccgcgc cggtgacggt gtagtccgcg 2974261 gtgccggtca gtgcggagac gaatccggtg ggctcgccgt tgccggagcc gctgacgaac 2974321 gccgccgcct gcagctgctc aaccgaatcc gctaggacgc ggcccacctc tgcgacgaat 2974381 ccggcggcgt caccctcaat ctcgagactg aacggaatcc agcaggagcc acggtagctc 2974441 ggcaccgccg gctgggccag cgttggcgaa tcgtcggaca cctcctgggc ttcggagtac 2974501 caatgagcct cggcgccttc ggaggtcacg ccccgccaaa cctcggaggt cgtttgcacc 2974561 accctcgcca cctgccggat cggattcgtt gaaccatcac ccgacagcag aatcgccgga 2974621 tccagcgccg ccgggatcaa aaacccgccg gcggtgtcca ccaagcccat tgctcgctgc 2974681 tcggcggcca ccgcggccgc ctcacgccac gcggccgctt cccggtcggt ccaggtcgtg 2974741 tgccccgcaa cagggttcga aaccctcttg acgaacgccc ccaggtagtc gcggttgccg 2974801 gtggccgcca gccagcgctg cgcccacgac gtcgactgcg gcggcccggt gcggcacaag 2974861 gtttccgcgg cttccgccgc ccgcgacgac atcaggccat cgcgcacaca aacgtccagt 2974921 gtgcgaaacg cgatgtcgcg caacgagttg cccggcggcg cgtcgccgtc gtcgccgccg 2974981 gtgggagcac cgggcaccac cctcagctca ccggcccggc agcggcgcag cgcctcctcg 2975041 gcttcgcggc cgcggcggcg ctgctccgcc cgcagttcct cggcgtggcg tgtcagcgcc 2975101 tgaaaacgtt gcgccacatc accggtcagg tcgccctcga cggagtcgag gagctgtttt 2975161 gccgcggaac gggtttcgtc gaggctgagc tgtttgatgt cgccatcgtc agcgaaatgt 2975221 tgttcattag tcatgagaga gttaccaatc catcagggct aacctggctt cggctagcga 2975281 acgggaaacg actgcaagcg attccgcgcg cacaccggcg atctgcgcgc ccagataggc 2975341 cggaacgccg gtcaaggaga cctccaacag cgccgcctcg acccgcacga tcacatcccc 2975401 ttcccggcgg tcccggatcg gccggaaacc caccgaaaac gcgtccacca caccagcttt 2975461 cacattcgcc agggcctcgt cgccgtccgg ggtgttcgca agctcgaacg ccccgaacaa 2975521 gccgtgaggc tcctcacgca gctcgacggc ccggccaacc gggtagcggg ttcgagcgtc 2975581 gtgggagacc agcagcttca ccttgtggcc gcgctcagcg atggagcgcc gaaaagcgcc 2975641 aggagcgaac atttcccgga actcgccgtc gaggtcgcgg acggtggtca cctcgccata 2975701 aggcacgatg acgccgtaca cggtgcggcc ctcaccaggc cgcagctcgg ccgtgcggaa 2975761 aaggatgcta ctcaaaattc ggccaccgcc tagcagacgc aagaaacgcg cggaatcgct 2975821 tgtggcgcat ggcggccgct atccgggttc cagccgcccc gcggcgactg cccggcgtcg 2975881 gcggatgccg agatgccaaa ctcgattgta tcacacacaa aaggtcatca ccggtctggg 2975941 gcgaacgggt tgaactcgtc gtcgtcgggg tcccccgccg ccgccagcac agcagccaaa 2976001 ttcgcctcag cgcttggcgt gcaccccaat tcgcgcgcga gcaccaaaac gtccctcgtc 2976061 gcggcccggg ccgcggccac ggcaggatgc accgtcaccc gtcggctgcg ggcgttcgtc 2976121 gcgatgaaac cctgttcacg gtaggctgtt acagcctgca tgagctgatc ccaggcgacg 2976181 cagaaggagg tcagcacccc aaggtcggac tccttcagca ggtttaatgc cgcaagctcg 2976241 ggaacgacgc gcccccacat gtctttagcg cctggcggca accaatccgg gcattccggc 2976301 gcaacacgct cgaacgccgc cggtggtgta acccgccggc cgccagaatc acggcccggc 2976361 gagcggccgc cgaggagttt caactgcgcc ggcgccggcg cgggaccacg cctacccatt 2976421 ttcaacacca ccctcctctt tccgggtttc gggtcgcgaa tgccatgatg ccaaaaaacg 2976481 ccccataaaa cttgagcgcg cacacgctct cccaccgtgg cggtgtccgg tcgggcggtt 2976541 gctggcgatg gcaacccacc ccccctacct gcgggtttcg ggttttcact gtttgctgtc 2976601 gggttcgtcg gggaagtgat acggatgcca gccgagcttc atccccgcct caagccggcg 2976661 ctgctcatcg tcgagatgca tcgacaacag cgccagttcg gctgtgtcgt cgtcgatttc 2976721 gtgtgatgtc gccaccatct cggcgtacgc tcggcggata gcctcgaggt cgcgctgccg 2976781 ttgggcgcgg cgctgctcgg cggacggaac gtcggccggc caaccatgtt gccgcccaac 2976841 gcgattccga cgcggggcgt tgagccctgc ggcgatggct ggctggcgtt tagtgcgctt 2976901 gtgggtcaat gatgggctcc tttctccctg gaaaatgatg tgatcgacgg tgttccgggt 2976961 gtccgacagt cgggttctct cggcgggctc acggcggatc accccggtcg acggccgccg 2977021 ccgcggcggc cgtcgcggcg aacaaaacgg ccgcgacgcc gtgcgactcc gccacagcgc 2977081 ggaccaacgc tcgcgcaagc tcgacggccg cggtccgcca tgcgcccgcg gcgtcgccgg 2977141 ccagcgcggc ggccgaagcc tcgactggcg ggccgccgac aagctcgtcc gcggcggcca 2977201 gcaacgtccg agcagccaac gcgtggccgc tcatcgggcg ccgtcccgag cgctagccgc 2977261 cgctcgacct cggcagggcc ggcatttgcc ggcggccttg gcctcagtac tgaggagctt 2977321 gttggggcat cccggcccgg agcacagcgg cgcgtcgccg ttggggtgcc cgttgggcgc 2977381 cggcggctcg tacggcaagt cgcccgcctc cgggagatcg gttgcatcgg ttgcgccggt 2977441 tgcatcaccg ggatcgccaa ccggcggtga aaccgcggaa accgcggaaa ccgataaatc 2977501 tcgttcctcg ggggtttcgt cgtcggcaga gagataccgg gaccacgcat cctcgaactg 2977561 ggtccgcgaa taccctttgt agggtggttc gccaccactg tgctggaact tcggcccgat 2977621 gccgtatctg ccgagccggg tcgcgaggcc gcgcgcgtcg agcgggtcgc cgcggcggat 2977681 ggagccccac ggtccctcct ccatccggtt cagtccggtc aggatgtcgc tggtgcgcat 2977741 ccggtcccgg tcgctgaaga ctcgacggat atcccgcagc agcagcacgc ctatgctggg 2977801 cttggctcct cgatttgcgg ttgcatccgt ttctgcggtt gcacgggcgg ttttgggcca 2977861 gtgcccgccc gcggtgtcag caaccgcaac cagggactcc cagacgtcgg cgcgccggtc 2977921 ggtcaccccg tccggcatcg ccggccaacc gctttccagc gggttaatgg cggccgccca 2977981 gttcgccaac cggtcgtgca gcttctcggc ctcggggccg ttgacgcggg ggcgccacgg 2978041 ctccacgggt tcggttggtg ccctcctgcg catcctcacc acgatcgacc gagacatgat 2978101 ggtgtcgggc aggtcgtcga ggccggccaa ggcgaccgca cagtacgctg gcagttcctc 2978161 ggtctcaacg atcttgccgc ggatgacgca gcggcccgcg acggctccct tgcggtggcc 2978221 ggcgttgatc acgccgcgaa tttcctcgtg ttctttagct ttcgggccaa acagggtgtc 2978281 acactcgtcg tacaggacgg tcggccgccc gaccggatcg gccacccgac ggaacaggta 2978341 ggccggtgtg cagttgatgg catgcaccgg ccggggcact agcggttccg tgacttcgag 2978401 tgcgcggctc ttgccagagc cgggttccgg tgacaaaaaa gcgattcggg gcgttgagtc 2978461 ccacgcctcc ataaaccagc aatgcgcaat ccagagggtg tgcgcgatca gttcatggtc 2978521 gcttggatag actacgaacc gccgcaagaa tgccctaatg tcgtcgagca attcggcgcc 2978581 gaccggcggc atcggctggc cgtcctcgtc acaccagatc gggtcgggat agtcacggcc 2978641 gtaggggatg tcagccatct cagaccacca cccgccgaat gtaggcgtca cgccgacgct 2978701 ggatctcccg agagaccgcc ggccagtcgg cggcggcgga tacgtcacgt gatgcctcgg 2978761 ccgacgcggc ctggcacgtc tccacccgga gtgcccaatg ccgagcagcg tcgcagatcg 2978821 cggcccattt gaccgggtcg gtgtcgtcga ggtcgcacca cgccggggtg ccggccatcg 2978881 gccattccac ggcggcggcc agggtcggtg cgacatactc gtgcaccgac caccacgaca 2978941 cggcgcggga cgcggtagga tcggtgctag acggtgtggc gactgtcgcg ggtgcccggt 2979001 cctctgtggc cgggcatcgt cgcgtcggcg gcgacccgcc gacggcggtc atgcggcacc 2979061 accgaacggg tgcatggcgc cgtcgacctc atcgcggcgc agacggacga ggcgggtgcc 2979121 ggagcggtat ccgcgtaggc ggccgtcggc gatcatctgg cggaccgtgc ggtcggtgac 2979181 cgctagatat tcggcggcct cactgatcgt gatgtaccgc cgtgacaacg ggggagcgtc 2979241 tgccatgccg ggcctttcgg tctcgtgaga gaccgtccac ccgagactcg gcgacgggaa 2979301 cgcgcacatg cgcgcaccgg aaaatttacc cgcctagctg gctcaagcgc aagcataatg 2979361 cgctgaacgg aattacgtgt cgcgcctctg ctattgatgg atcgtcagcg tcggggatgg 2979421 tcgacgttct cagctcgtga agcttcgccc cgaaaccgtc gaggatcgcc gcggcggtca 2979481 tctcggatat cggcgcatac tttcggcatg cccggcattc gagtttccac tcgatatggc 2979541 ggccccactg tgtttcggtg aagcgggtgt tcgaccgcat ctgtatgtgc ccgccggcgg 2979601 gtcgttcgtc gtcgatccag gcgatgatga gcgctcccgg ttcgtcgtcg cagttgcaca 2979661 taactacgta cttaaccgca tcggccatca tcacatctcc tggttctcgg ccagtttgct 2979721 taacagtgcg gcgatttcgc ggtcccggcc cttggcggcg tgctggtagc ggagtgcggc 2979781 gccggctgtg ctgtgtccta gccgctgcat cagttcggcc agtgtggcgc cggtggatgc 2979841 agccaacacg gcgccggagt gtcgaaggtc gtgcacccgt aagtctggtc ggccggcggc 2979901 ttttcgggcc ttgtagaaca tgcggtacag cgccgagggt gctaggtgac ggttggggtc 2979961 gttgaccgat gggaacagca gggactcccg gccggggttg acgtgtttgt gaaggtggtc 2980021 ttcgatggcg ggtatcagat gtggcgggat acttatgtcg cgcactcccg catcgctttt 2980081 cggtgtcgtc accttgaagc cttcgcccac ccgaacgaca gcccgccgca cccgcgcaac 2980141 ctcgccgtgc aggtcgatgt ctttgcggcg taattcggtc agctcgccgt agcgcatggc 2980201 cagccatgcc gccatcagca cgaacgcctg gtaggggtcg ggcatggctt tggtgatggt 2980261 ttccagctcg tcgagggtgg cgggcctgat cttgtggacg cggcgggcgg tggacgcgcc 2980321 tgagatgcgg caggggttgg agtcgatcag gtcgtcggcc aaggcggtct gcatgattgc 2980381 gcgcagcaag ctgtaggagt gtgcccgcat ggtcggtgtg cccacggcgg tggtggcgta 2980441 ccagcggcgc acggcggccg gggtgatgtc gcgtaggtcg gtgtcagcga aggtggccag 2980501 gatgtggttg tccagcagtt tgcgatagtg ggcgcgggtg cggtccttga ttccacgctg 2980561 cttcagccat ccttcggcgt actcaccgaa tggggctccg gggcggtctt cctgacccga 2980621 tgccggggac catagttgtc ggtcgatttc gcggcggcgg tcggtgagcc atgcttcggc 2980681 gtcgatcttg gcgttgaagg ttttgggggc gatgtacacg cggccgtcgg ggccggtgta 2980741 gctggcttgc cagcggccgg agttgaactg tcggatgcga ccgaatttgc gtctctgacg 2980801 cttgccggtt tgcgtcactg tcgtcccctg tcccgcgcaa taaacgcgca ataagagact 2980861 acatcagatg ccgcttgctt ccgcacgctt ccgggggtac tgttgtctat gtcgcctggt 2980921 cagaggcttt ctgtacaggt cagacagtat cccaccggcc cactagtgaa actggttcaa 2980981 tcccagtatc gcgcaccacg attgacctgc ggtttcatcc acaaaatctg ggctgcgtga 2981041 actaaatgtg aactgactcg gtgcaaccac cgaaaggttc ctctgttccg tgcccacgcc 2981101 gacaccgacg gtgaccccac cagatgcgcc tgccgcccgc tggctagcct ggcctgttgc 2981161 tgcaagcgcc tggtcgacgc ccgctatcac gctgttgtcg cgtccaccga actcaccgag 2981221 gcacgccgca cccgcgcaac cgagctgacg gagctgatca ccaccgcgct cgccttctgc 2981281 gaacggctgc aaacggtcgt tgagggtgac cggcgggctg aggtgacccg atgagcggcg 2981341 gctggctcgc cgagcacctc ggcctgtcca caaaccggct ccggcacgaa ctcgcagacc 2981401 ggctcgacgc gcactacggg ccacccgcac agaacaggga gctcgcgcgg ccgagcctgc 2981461 ggattatcaa cgagggcact gatggatgac ctgacgcggc tccggcgcga gcttctggac 2981521 cgattcgacg tgcgggactt cacagactgg cctccagcat cgctgcgagc cctcatcgcg 2981581 acctacgacc cctggatcga catgacggcc agcccgccac agcctgtatc gcccggaggg 2981641 cctcgactcc gactcgtgcg attaaccacc aacccatccg cgagagcagc ccctatcgga 2981701 aacggtgggg actcttctgt ttgcgctggt gagaaacagt gccgcccacc gtagcggcct 2981761 gcgcgtggca attgaccgac ctgacccgag tagccgccag tgggctgtaa gccattcttt 2981821 acggcagcct gttgtaaagg taacgtttac acgtggaggt gagggctagc gcccgcaagc 2981881 acggcatcaa cgacgacgcc atgctccacg cataccgcaa cgcgctgcgc tacgtcgaac 2981941 tggaatacca cggcgaagtt caactgctgg tgatcggccc cgaccaaacc gggcgccttt 2982001 tagagctggt catcccagca gacgaaccac cccggattat ccacgccaac gtactacgcc 2982061 cgaagttcta cgactacctg aggtgatgag ataagagtga agcacaagac cgacattgac 2982121 gagtggctcg acacgatcga gcccaacccg gccgacgccc acgatgccag ccacctgcgg 2982181 cgcatcatcg ccgcgaaaga agcggtccaa acagccgaat ctgagttgcg ggccgcagtg 2982241 aatgctgccc gcgccgccgg cgacacctgg gcagccatcg gcgtcgccct cggcatcacc 2982301 cgccaggccg cgttccaacg gttcgggcca cacagcacag cgagccccta aaccggcgcg 2982361 cctccgcggt ggagttgacg acgaccagac agggccgaag cggagtcaca gcgtctggcc 2982421 gacacacgtg gcgtcgtgtt tgctaggcat gggttttgtg tttgctgtcc cccacaaccc 2982481 cagacccgta caaatcccca gacccctaca cacagcgaca cggcgacccg ccgtctcctg 2982541 agtgtgtttg ctaaaatttc gtttgttctg gtcgatcact tattgtgttt gccggttttg 2982601 gcgatgggct tgattcctct gacagcaaca ccagttggcc ccttcctggc caggacgtga 2982661 tagaccacgc tggtgggtca tgcgcaccgg agcacccgat gatcgtcgtc cgtacggccg 2982721 aggcggccga gcaggccctg actgagggcc agctggtctg cccccgccgc ggatgtggcg 2982781 acaccttgcg gcggtggcga tatggacggc gccggcatgt gcgcagcctc ggctcgcagg 2982841 tgatcgatgt gcggccccag cgggtgcgtt gccgcagatg cgaaagcacc catgtgctcc 2982901 tgccagcggc gctacagcca cgcctagggc gcggcggcgg cggccagtta cgtccagggg 2982961 tgtggtgtac gggcaggtaa ggccggtggg cgtgtcgtag cccagtagtg ggcggtcatc 2983021 gcgtgatcct tcgaaacgac cagcaaaagt caatcgaagg aaatgacgca atgacctctt 2983081 ctcatcttat cgacaccgag cagcttctgg ctgaccaact cgcacaggcg agcccggatc 2983141 tgctgcgcgg gctgctctcg acgttcatcg ccgccttgat gggggctgaa gccgacgccc 2983201 tgtgcggggc gggctaccgc gaacgcagcg atgagcggtc caatcagcgc aacggctacc 2983261 gccaccgtga tttcgacacc cgtgccgcaa ccatcgacgt cgcgatcccc aagctgcgcc 2983321 agggcagcta tttcccggac tggctgctgc agcgccgcaa gcgagctgaa cgcgcactga 2983381 ccagcgtggt ggcgacctgc tacctgctgg gagtatccac tcgccggatg gagcgcctgg 2983441 tcgaaacact tggtgtgaca aagctttcca agtcgcaagt gtcgatcatg gccaaagagc 2983501 tcgacgaagc cgtagaggcg tttcggaccc gcccgctcga tgccggcccg tataccttcc 2983561 tcgccgccga cgccctggtg ctcaaggtgc gcgaggcagg ccgcgtcgtc ggggtgcaca 2983621 ccttgatcgc caccggcgtc aacgccgagg gctaccgaga gatcctgggc atccaggtca 2983681 cctccgccga ggacggggcc ggctggctgg cgttcttccg cgacctggtc gcccgcggcc 2983741 tgtccggggt cgcgctggtc accagcgacg cccacgccgg cctggtggcc gcgatcggcg 2983801 ccaccctgcc cgcagcggcc tggcagcgct gcagaaccca ctacgcagcc aatcacggtc 2983861 gacacaatgc ataacgtcaa cctactgttg acgtcatgcc ggagcccaca cccaccgcct 2983921 accccgtccg cctcgacgag ctcatcaacg ccatcaaacg ggtgcacagc gacgtgttgg 2983981 accaactcag cgacgccgtc ctggccgccg agcatctcgg cgaaatcgcc gatcacttaa 2984041 tcggccactt cgtcgatcag gcccgccgct cgggcgcctc ctggtccgat atcggcaaga 2984101 gcatgggcgt caccaaacag gccgcgcaaa agcggttcgt cccccgagcc gaagccacca 2984161 cactggattc aaaccagggc ttcaggcgtt tcacgccgcg ggcccgcaac gccgtggtcg 2984221 cggcccaaaa cgccgcgcac ggagccgcca gcagcgagat cacccccgat cacctgttgt 2984281 tgggagtgct cactgacccg gccgcactgg ccacggcgtt gcttcagcag caggagatcg 2984341 acatcgcaac cctgcgtacg gcggtcacgc tccccccggc agtcaccgag ccgcctcagc 2984401 cgatcccgtt cagcggcccg gcgcgcaagg tcctcgagct caccttccgc gaggcgcttc 2984461 ggctgggcca caactacatc gggaccgaac acctgctgct ggcactgcta gaactcgagg 2984521 acggggatgg gccgttgcat cgatccggcg tcgacaagag ccgcgccgag gccgacctga 2984581 tcaccacgct cgcatcgctc accggcgcca acgctgccgg cgcaaccgat gccggcgcaa 2984641 ccgatgccgg ctgaggcgag cgacccctcc ccttcgcggc gccgcgtgtg caatcatgcg 2984701 aaggtccccc accgggagcc gaggaggcac agatgcgcca ctggctgatc gtcctcgcta 2984761 cgctgctcgt cgccgccgcg ggcgttgcgg ccgccaacga cgtgccccgt gcgtgggccg 2984821 gcgacgcgcc gatcggccac atcggcgaca cgctgcgtgt ggacaccggc acctacgtcg 2984881 ccgacgtcac cgtcagcagc gtcgtaccgg tcgatccgcc gccgggattt ggctataccc 2984941 gcagcggcgt cccggtcaaa agcttccccg acagctcagt gacccgcgcc gacgtgacgg 2985001 tccgcgcggt ccgggtgccc aactccttca tcttggccac caatttcagc ttcaccggag 2985061 taacgccgtt tgccgacgcg tacaagccgc ggccgtgcga cgcatccgat tggctcgacg 2985121 ccgcgttggg caacgcgcca cagggctcga tcgttcgcgg cggggtgtac tgggacgcct 2985181 accgcgaccc ggtgtcggtt gtcgtgctgc tggacgagaa aaccggccag cacctcgcac 2985241 agtggaacct ttgacctgcg cctcgagatc gccacggccg acgtgaccga cgccgacgag 2985301 ttggccgccg tcgccgcacg caccttcccg ctggcgtgcc caccagcggt cgccccggag 2985361 cacatcgcgt cgttcgtcga cgccaacctg tcgtcggccc ggttcgccga gtatctgacc 2985421 gatccgcggc gcgccatcct caccgcccgc catgacggcc gaattgtcgg ttacgccatg 2985481 ctcattcgcg gtgacgaccg ggacgtggag ctgtccaagc tgtacctgct gccgggttat 2985541 catggcaccg gagccgctgc ggcattgatg cacaaggtgc tggctaccgc cgccgactgg 2985601 ggcgcgctcc gggtgtggct gggtgtcaac cagaaaaacc aacgcgcaca acgcttctac 2985661 gcgaagactg gtttcaagat caacggcacc aggacgtttc gactgggagc ccaccacgag 2985721 aatgactacg tcatggttcg cgagcttgta tgacccccgc cgtcagggcc agcaggcgag 2985781 atgtggcccg caggtacttc tttcggtatc caccggccag catttcctcg ctgaagatgg 2985841 tgtccagctt agcgccggac gccaccaccg gaatgccggc gtcatagagc cgatcaacga 2985901 gcgccaccaa ccgcagcgca acgttctggt cgtcgatgcc gtgcacgccg gtcagaaaca 2985961 ccgcggtcac accttcgatc agggtcagat atcgcgacgg atgcatggtg gccaggtgcg 2986021 cgcacagcgc gtcgaagtcg tcaagggtcg ccccctcaac acgtgcggca cgcgcggcca 2986081 cctcctcgtc ggacagcggc gccggtgccg gcggcagatc acggtgtcgg tagtccggac 2986141 cctcgatcct caccgtggtg aaaatgcttg ccagggtgtt gatctcgcgt agaaagtcct 2986201 gggcggcgaa gcggccctcg ccgagctgtt cgggcagtgt gttggaggtg gcggccaccg 2986261 aaaccccccg ctcgaccaga gccgaaagca gccgggagat cagcgtggtg ttgcccggat 2986321 cgtccagctc gaactcgtcg atacataaag cggtgtaatt ggccaacaga tcgatacagt 2986381 cggcgaagcc gaacacaccg gccagctggg tcagctcacc gaacgtcgcg aatgcctttg 2986441 gacatgtcgg cgcgtccggg ccggttccag gcagctggta gtaggcagag gccagcaggt 2986501 gcgtcttgcc taccccgaac ccaccgtcca ggtacagccc cacaccgggc aacacgtcgc 2986561 gcttgccgaa ccatttcttg cggcctgcac gccgctcgac ggcctgccgg caaaagtcct 2986621 ggcacgccac gacggcggcc gcctgggtgg gttcaaccgg gtcaggtcga tacgtcgcga 2986681 agctcacctc ggcgaacgtc ggaggcggcc gcagttgggc gatcagccgc accggagaca 2986741 cggtcggatg cctgtccacc aggtggtcca ccgaaccgca agcttcggag gcagacccgt 2986801 gcatggtggc actgtagcga cgtgctgcaa tcaaggtcat gcccgactct ggtcagctcg 2986861 gagccgctga caccccgcta aggctgctca gctcggtgca ttacctcacc gacggcgaac 2986921 tcccccagct ttacgactat ccggatgacg gcacctggtt gcgggcgaac ttcatcagca 2986981 gcttggacgg cggcgctacc gtcgatggca ccagcggggc gatggccggg cccggcgacc 2987041 gattcgtctt caacctgttg cgtgaacttg ccgacgtcat cgtggtcggc gtgggcaccg 2987101 tgcgcattga gggctactcc ggcgtccgga tgggtgtcgt ccagcgccag caccggcagg 2987161 cccgaggcca aagcgaagtt ccgcaactgg caatcgtcac caggtccggt cgccttgacc 2987221 gtgacatggc ggtattcacc cggaccgaga tggcaccgtt ggtgctcacc accacggcgg 2987281 tcgccgatga cacgcgccag cggctcgcgg gcctcgccga ggtgatcgcg tgctccggcg 2987341 acgatccggg cacggtcgat gaggcagtgc tcgtgtccca gctcgcggct cgcggtctgc 2987401 gccggatcct taccgaaggc gggccgacgt tgctcgggac attcgtcgag cgtgacgtgc 2987461 tcgacgagct gtgtctgacg atcgccccct acgtcgtcgg cggcctggcg cgccgcatag 2987521 tgacgggacc cgggcaggtg ctgacccgga tgcgctgtgc ccatgtcctc accgacgact 2987581 ccggctacct gtacacccgc tacgtcaaga cctgaaacag ctggacgtga atgcccgcct 2987641 cctcaccgac ccactacgcg gcccgcatcg tcgccgggtg aatggctact gtggtcggca 2987701 tgagtcggcc catgacgtca accgcgatgt tggtcgcgct gacctgctcg gcgacagtgc 2987761 tggccgcatg cgtcccggcg ttcggcgccg acccgcggtt cgcgacctac tcgggcgcag 2987821 gaccgcaagg cgcagccacc acgacaccac cgccggctgg cccaccaccg ctcgccgcac 2987881 ccaagaacga cttgtcgtgg cacgactgca cgtcacgggt gtactcgaat gctgggatcc 2987941 cagcagcgcc cggcgtcaag ctggaatgcg caagctatga caccgacctc gacccgctcg 2988001 tcggcgggtc cacagcggta agcatcggcg tagtgcgcgc gcgctccaac cagaccccga 2988061 gcgacgcagg acccctggtg ttcaccaccg gctccgacct accctcgtcg acgcagttgc 2988121 cggtctggct ggcacacgcg ggcatcgatg tgctccgcag ccaccccatt gtcgccgtcg 2988181 accgccgcgg catgggcatg tcgagcccaa tcgactgccg cgatcacttt gaccgcgacg 2988241 agatgcgtga tcaggcgcaa ttccaggctg gcgacgatcc ggtggccaac ctttccgaca 2988301 tctccaacac cgccaccacc gactgcaccg acgccatcgc gccaggcgag tccgcctacg 2988361 acaacaccca cgccgcctcg gatatcgagc gcttacgcaa actctgggac gtccctgccc 2988421 tcgccttcgt cggcattggc aacggcaccc aagtggcgct ggcctacgca gcatcgcgtc 2988481 ccgacaacgt cgccagactg atcctcgact ccccaatcgc gttgggggtc tctgccgaag 2988541 ccgccgccga gcaacaggtc cagggccaac aggcggcgct ggacgcattc gctgcgcaat 2988601 gtgtcgcggt gaactgcgcg ctgggctccc atccgaaagg cgcggtcagc gcgctgctgt 2988661 cggccgcccg gtccggtgat gggcccggcg gcgcgtcggt ggcggctgtc gccaacgccg 2988721 tcgccaccgc gttgggcttc cccgacagtg gccgggtcga tagcaccacg aaattggccg 2988781 acgcgctggc cgcggcccgc tccggggaca tgaacttgct gtccgccctg atcaaccgcg 2988841 ccgataccac ccgggatacg gacggtcagt tcatcagctc gtgcagcgat gcggtcaacc 2988901 gcccgacacc ggaccgggtg cgcgagctgg tggtggcttg ggggaagctc tacccgcagt 2988961 tcggcgccgt cgcggcgctc aacctggtga aatgcgtgca ctggcccagc agttcgccgc 2989021 cgcagccacc gaaagacctc aaggtcgacg tgctgttgct cggtgtgcaa aacgacccga 2989081 tcgtgggcaa cgaaggggtc gccgcgaccg ccgccacggc catcaacgcc aacgccgcca 2989141 gcaagcgggt gatgtggcaa ggtattggcc acggcgccag catctactcg tcctgcgcgg 2989201 tgccgccact cgtcgcctac ctggacactg gcaagctgcc tgacaccgac acctattgcc 2989261 ccgcctgata ttcggggcgg gcgggacgcg gtgtacggtg cgctggtgac ggcagctgac 2989321 tccatccgaa ccggcctagg cgcatccttg ttggccggat tccgtccgcg caccggcgcc 2989381 ccgagcaccg cgacgatcct gcggtcggcg ctctggccgg ccgccgtcct gtcggtgctg 2989441 caccgcagca tcgtattgac gaccaacggc aacatcaccg acgatttcaa gccggtctac 2989501 cgcgcggtgc tgaacttccg gcgcggatgg gacatctata acgagcactt cgactacgtc 2989561 gacccgcact acctgtatcc ccccggtggc accctgctga tggcgccgtt cggctacctg 2989621 cccttcgccc cgtcgcgcta tctgtttatc tcgatcaaca ccgcggccat cctggtcgcc 2989681 gcctacctgc tgctgcggat gttcaacttc acgctgacct cggtggccgc acccgccctg 2989741 attctggcca tgtttgctac cgagaccgtg accaacacgc tggtgttcac caacatcaac 2989801 ggctgcatcc tgctgttgga ggtgctcttt ctgagatggc tgttggacgg ccgagccagt 2989861 cgtcagtggt gcggcggcct ggcgatcggg ctgaccctgg ttctcaaacc cctgctcggt 2989921 ccgctgttgt tgctgccgct gctgaaccgc cagtggcggg ctctggtggc cgccgtcgtc 2989981 gttcccgtcg tcgtcaacgt ggccgcgctg ccgctggtca gtgacccgat gagcttcttc 2990041 acccgcacgc tgccctacat cttgggcacc cgggactact tcaacagctc gatcttgggc 2990101 aacggcgtct acttcgggct gcccacctgg ctgatcctgt tcctgcggat cctgttcacc 2990161 gcgatcacct tcggcgcatt gtggctgttg taccgctact accgcaccgg tgacccgctg 2990221 ttttggttca ccacctcgtc gggtgtgctg ctgctgtggt cgtggctggt gatgtcgctg 2990281 gcccagggct actactcgat gatgctgttc ccgttcctga tgaccgttgt gctgcccaac 2990341 tcggtgatcc gcaactggcc ggcgtggctg ggagtctacg gcttcatgac gttggatcgc 2990401 tggctgctgt tcaactggat gagatggggc cgcgcgctgg aatacctcaa gatcacctac 2990461 ggttggtcgt tgctgttgat cgtgacgttt accgtgctct atttccgcta tctggacgcc 2990521 aaggcggaca accggctgga cggcggtatc gatccagcct ggctgacgcc cgagcgggag 2990581 ggccagcggt gatcgcaagc gcggcgagcc gggcgcagcg ggtcaccgcc atcgggacta 2990641 gcggtgatcg caagcgcggc gagccgggcg cagcgggtca ccgccatcgg gactagcgtg 2990701 gacccatgac gcgcccaaag ctagaactgt ccgacgacga gtggcgtcag aagctcaccc 2990761 cgcaggaatt ccatgtgcta cgtcgcgccg ggaccgagcg gcccttcacc ggtgaataca 2990821 ccgacaccac aacagcgggc atctaccagt gccgggcctg tggcgccgaa ttgttccgca 2990881 gcaccgagaa attcgagtcg cattgcggct ggccgtcgtt cttcgacccg aaaagctccg 2990941 atgcggtgac cctgcgccct gaccactcgt tggggatgac gcgtaccgag gtgctgtgcg 2991001 cgaactgcga cagccacctg ggccacgtgt tcgccggcga ggggtatccc acgccaaccg 2991061 acaagcgcta ttgcatcaac tccatttcgc tgcgcctggt ccccggtagc gtgtagcgcc 2991121 gagattgacg ttttgcagac gccctctcgc actttcactg caaaacgtca gtctcggtga 2991181 aagtcagtcc acccgggtgg cgtgcacttc ccagaacggg gcatgtacgc ggccgcccac 2991241 cagccacggc ttaatcgccc ggaaccgctc cagcacgcac cgcacttgat cggccatatc 2991301 gggattgcgc gcggccatca gctctagggc ctcgacgctc aagttgacct ggtaggtggt 2991361 cgtccccagg taggtgatct cccaaccgcc gaccggaagc acctggcgaa aatcgtcctc 2991421 ggacaacgac cgcggcatgc tgaacccgtt gacgttgtgc tcgccgaatt cgaacatgta 2991481 cagccgtgca cccggcttgc tggcccggcg cagcgcccgc acatagcacc tttgcagctc 2991541 gggcgcggtg ctgaaggtgt ggtagaaggc gcaatcgacg acggtgtcga accggccgtc 2991601 cagcccgtcg agcgtggtgg cgtcgccgac ctggaagttc accgacaccc ccgccttacg 2991661 cgcgttgtcc cgagcccgct cgatggccgc gaccgacccg tcgatcccgg tggccgcata 2991721 tcccttggcg gcgtagtaga tcgcgtggtg cccgggcccg gtgcccgggt cgagcacctc 2991781 acctcggatc gcgcccaacg caaccagctg ttgaaccacc ggctggggac ccccgatgtc 2991841 ccatggcgtg gcggccggca acccgtgggc gacccgatca tcgcgataca tctcctcgaa 2991901 ccgggtggga tcggcaggat cgaactgggc cgtcatggca gcgagtgcac caactgctcc 2991961 accggcactc gcggaccggt gaaaaacggt gtctccgcgc gggtatggcg gcgcgcgtcg 2992021 gtggcgcgca gctcacgcat taggtcgacg atgcggtcca gctcgggcgc ctcgaacgcc 2992081 aggatccatt cgtagtcgcc cagcgcgaac gccggcaccg tgttggcccg gacgtccttg 2992141 tatccgcggg cggccatgcc gtgttcggcg agcatgcggc gacgttcctc gtcgggcagc 2992201 aagtaccact cgtaagaccg cacaaacgga tagacgcaga tgtaggcgcc gggctcctcg 2992261 ccggccagaa acgccgggat atgacttttg ttgaactccg ccggccggtg caggcccaca 2992321 ccgctccaca ccggcgtgca tgcccgcccc agcgtggtgg tgcgccggaa gtcggcgtag 2992381 gtggcctgca gggcctcgac acgttcggcg tgggtccaga ccatgaaatc ggcgtcggcc 2992441 cgcaggcccg cgacgtcgta gaggccgcgc accacaaccc cgcgctcttc ctgctgtttg 2992501 aaaaacgtgg acgcgtcgtc gatgatcgcg tcacgctggt caccgagcgc accgggactc 2992561 accgagaaca ctgagaacat caggtagcgc agcgtcgcat tcaacgcgtc atagtcaaga 2992621 cgggccatgg catctatcgt gccacctgcg catctaaggc ctcgatgacg ctggtgacgg 2992681 cccggcccgc cgcgccgacg caggccggca cgccgatccc gtcgaggtag ctgcccgcaa 2992741 cggccagcgt cggtggcagg ccggcgcgca gctcggcgac cacatcggca tggccgggac 2992801 cgtactgcgg catcgcctcg atccagcgcc ggacccgaac gtcgaccggg tcgacggcca 2992861 caccgaacac cgtgaccaag tcgtccgctg cccaggccag gagttggtcg tcggaggccg 2992921 tcagggccgg ttcgtcgccg aaccgaccga acgacagccg caacagcgcg acgtcgccgc 2992981 gctgacccca tttgcgcgac gacaatgtga tcgccttggc atgcggtgac tcgtcgccgg 2993041 ccaccagcac gccggaacag tgcggaaacg cggtgccgcc gggcaccgcc agcgccacca 2993101 ccgccgacga cgcgctcacg atctgccggg cggcggcatg tgtgcgcggc gcgatgccat 2993161 cgacgaggcg cgccaaccgc ggcgccggaa ccgccaggat gacggcgtcg gcctgccagc 2993221 ggccgccggt ttcgtcgcgc agcacccagc cgcgttcgag ctggaccacc ctggcccgca 2993281 cccagtgcac ccggctgcgc cggacgagcc cgtcgagcag cacctgatac ccgccgtcca 2993341 gcgcgccgaa caccggcccg ccgcttcccg gcggcagcgc ctgccggacc gcgtcggtca 2993401 cactggtcgc cccgcgatcc agggccgcgg ccacgctcgg ggcggccgcg cgcagcccga 2993461 tcgtcgccgc cgagcccgcg tataccccgc ttaacagcgg gtccaccgac cgggccacga 2993521 cttggtcgcc gaaccggtca gccaccaagt cggccaccgc gggatcgctg cccacctgcc 2993581 aggtgaacgg acgagcggct tcggcgtcga tccgcgccag ggttgcgtcg tcgaccagcc 2993641 ccgccatgga gcccgccgac gacgggatcc cgacgaccgt ctgcggcggc agcgggtgca 2993701 agcgctgctg gctgtagatg agcggccgcg cgccggtgct ggcgagttgg cggtccgaca 2993761 ggcccagctc ggccaaaagc gccggcatct cgggcctacg cagcacgaac gcctccgcgc 2993821 cgaggtccat tggctgtccg ccgatatgct cggtgcgcaa taccccgccg agcctatcgg 2993881 ccggttcgaa caaggtgatg gtcgcgtcat cgccgacagc ctgccgcagc cggtacgccg 2993941 aggtcaatcc cgaaatcccg cctcctacaa cacaatacga gcggggagtc atagcgagtg 2994001 tacgagcgag accaggtcgg ccagcaccgc gggatcgctt tctggcagca ccccgtggcc 2994061 gaggttgaag atatggccgg ccgcaccggc gtcgacggcg cggcgtccgt cgtcgacaac 2994121 ggcacgtgcg gcacgttcca ccgccggcca gcccgccagg accaccgccg gatcgaggtt 2994181 gccctgtaac gccgtgccgg gcaccacccg ggcggcggcg tcggtcagcg gggtccgcca 2994241 gtccacgccg acgacggccc ctcggcctgg ccgctccccg gctgtcacgg cctccgacat 2994301 cgcgcccagc aattcggcgg tcccaacccc gaagtgcgtc atcggcacgc catgctcgcc 2994361 cagcgcagcg aacacccggg cgctgtgcgg caacacgtac tggcggtagt cgatcggcga 2994421 gagcgccccg gcccaggagt cgaatacctg gatggcgtcc acccccgcgt cgatttggcc 2994481 gaccagaaac gcgatggtga ggtcggtcag cttggccatc agcgcgtgcc agctcgccgg 2994541 ctcggccaac atcatcgcct tgacgtgggc gtgatggcgg ctcggtccgc cctccacgag 2994601 gtaggaggcc agcgtgaacg gcgcgccggc gaaaccgatc agcggcacgt cgccaagctc 2994661 agcgaccaac aacgaagccg ccaccaatac cggttgaatc gcttgtggat caagtggttt 2994721 catggcggcg acatcggcgg cggtgcgcac cgggtccgcg atcaccggcc caacgtcggc 2994781 gacgatgtcc aaatccacgc cggccgcccg tagcggcacc acgatgtcgg agaacaggat 2994841 ggccgcgtcg acgtcgtagc ggcgtatcgg ctgcagggta atctcacagg ccacgtccgg 2994901 ttcgaaacag gccgccagca tgctgtaccg ctcgcgcagc gcccggtatt cgggcaacga 2994961 gcgcccggcc tgccgcatga accacaccgg cacccggctg ggcttgcggc cggtgacggc 2995021 ggccagatac ggcgactgcg gaaggtcgcg acgggtactc atcgaactca atgctgccac 2995081 gaccgccacc ccgcacctgc gtaacatcga cccaatgcca gttacctacg acgacttccc 2995141 cagcctgcgc tgcgaaatcc acgaccaacc tggtcacgaa ggcgtgctgg agctggtgct 2995201 ggactccccc gggctgaact cggtcgggcc gcacatgcac cgcgaccttg ccgacatctg 2995261 gccggtgatc gatcgcgacc cggccgtgcg cgtggtcttg gtccgcggtg aaggcaaggc 2995321 cttttcctcc ggcggcagtt tcgacctgat cgccgaaacc atcggcgact accagggccg 2995381 gctgcgcatc atgcgcgagg cccgcgacct ggtgctcaac ctggtcaact tcgacaagcc 2995441 ggtggtgtcg gcgattcggg gcccggccgt cggtgcgggt ctggttgtcg cgctgctcgc 2995501 cgacatttcg gtggcgggcc gcgccgcgaa gatcatcgat gggcacacca aactcggggt 2995561 cgccgcgggg gatcacgcgg cgatctgctg gcccctgctg gtcggcatgg ccaaggccaa 2995621 gtactacctg ctgacctgcg agccgctgtc cggggaggag gccgaacgca tcggtctggt 2995681 ctccatctgc gtcgacgacg acgatgtgct ccccaccgca acacgcctgg cggagcggct 2995741 cgccgctggc gcgcaaaacg ccatccgctg gaccaaacgc agcctcaatc actggtatcg 2995801 catgttcggt cccgccttcg aaacgtcgct cgggctggag ttcatcgggt tcggtggtcc 2995861 cgacgtccgg gaaggcctgg ccgcgcaccg cgaaaagcgc cccgcgcggt tcggcgccga 2995921 ccccgatccc ggcgccggca gctgagcaca gttcggcgcg cctgtgcaca cgtgtcggcg 2995981 gataggtcta ccgtcgaaat ctgtgacctc cgccggcgac gatgcagagc gcagcgatga 2996041 ggaggagcgg cgcttgacct ccgccggcga cgatgcagag cgcagcgatg aggaggagcg 2996101 gcgcttgacc tccgccggcg acgatgcaga gcgcagcgat gaggaggagc ggcgcttgac 2996161 ctccgccgag ccggccctat tccgcgaggc agttgcggcg atgaacgctg tcaccgtgcg 2996221 gccggaaatc gaactcggcc ctatccgacc gccgcagcgg ctagctccgt acagctatgc 2996281 gctgggagcc gagatcaagc atcccgaact cgacgtcatt ccggagcgtt ccgagggcga 2996341 cgccttcggc cggctgatca tgctgtatga cccggacggc tccgatgcat gggacggcac 2996401 tattcgcctg gtcgcctatg tccaggccga cctggactcg agtgaagccg tcgaccccct 2996461 gctgcccgag gtggcatgga gttggctggt ggacgcgctg acagcgcgca ccgaccaggt 2996521 gagggccctg ggcggcactg tcaccgccac cacatcggtg cgatacggcg acatctccgg 2996581 gccgccgcgc gctcaccagc tggagctacg ggcgtcatgg acggcgacca cccccgatct 2996641 gggcgcccat gtccaggcgt tctgcgacgt cctggagcac gcggccggcc tgccgccagc 2996701 cggggtcacc gacctgggct cgcggtcacg cgcctgacat gtgccccgag ccgtctcacg 2996761 cgggagctgc tgagtccgaa ggcacggaat cggaacccac ccccttgctc cggcccgccg 2996821 gtgggatacc ggatctgtgt gtgaccgtcg gtgaaatcgc cgctgccgca gaactactgg 2996881 accgcgggcg cggaccgttc gcggtagacg ccgagcgggc gtcgggtttc cgctactccg 2996941 gccgcgccta cctgattcag atccggcggg ccgaggccgg caccgtactg atcgacccgg 2997001 tcagccacgg cggtgacccg ttgaccgtgc tggcgccggt cgccgaggtg ctcagcacca 2997061 acgagtggat cctgcactcc gccgatcagg atctgccctg tctcgccgag gtcggtatgc 2997121 gaccgccagc gctatacgac accgagcttg ccgggcgcct ggccgggttc gatcgagtga 2997181 acctggcggc catggtcgag cggttacttg gactgggatt gaccaagggc cacggcgcgg 2997241 ccgactggtc caagcgcccg ctaccctcgg cctggctgaa ctacgcggcg ttggacgtgg 2997301 aactgctcat cgaactacgc gcggcgatct cgcgggtgct ggccgagcaa ggcaaaaccg 2997361 attgggctgc gcaggaattc gagcacctgc ggtcgttcga atcaaggcca cccccagcgg 2997421 ccgcccggca ggaccgctgg cgacgaacct cgggtatcca caaagtgcat gaccggcggg 2997481 ggctggccgc ggtccgcgaa ttgtggacag cgcgtgaccg aatcgcccag cgccgcgaca 2997541 tcgcgccccg ccggatcttg ccggactcgg ccattatcga tgccgccatc gccgacccaa 2997601 agtcagtcga cgaccttgtc gcgttaccgg tgttcggcgg acgcaaccaa cgtcgcagcg 2997661 cggctgtgtg gtgggcggca ctggcagccg cacgcgaaag cccagatccg ccggagatcg 2997721 ccgaaccggc aaacgggccg ccgccgccgg ggcggtgggt cagacggaaa ccggcagccg 2997781 ccgcacggct ggatgcggcg cgcgcggcgc tgacggaggt gtcgcaacgg gtgcgggtac 2997841 cgaccgagaa cctggtctca cctgatctgg tgcgacggct gtgttgggaa tgggaggaca 2997901 tctcgcagag ttctccagac ccgattgccg ctgtcgaggc gtacctgcgc accggccagg 2997961 cacgggcctg gcagctcgaa ctagtggtcc ccatcctgac cgcggcgttg acaggggctc 2998021 cggacgccgg cgcccagggc gatgatggct cttagtcgag atgttctgga atcgcgtcgg 2998081 acgcacacac cccggtaccc agcgcggcga cccagccggt gatccgccgg gccacgtcct 2998141 ggtcggtaag ccccagatcg gccagcacct cgcttcgaga cgcgtgctcg tagaactcct 2998201 gcggcaaccc gacatcgcgg cagggcacgt cgatctccgc gcgccgcagc gcggccgaca 2998261 ccgctgaccc cgccccaccg ttgaccccgt tgtcctctag cgtgacgagc agcttgtgct 2998321 gcaccgccag ttcgcgcaca ccgtcagaca ccggcaacac ccagcgcggg tcgatcaccg 2998381 tcacaccgat cccctggttg tgcagccgct tggccaccgc caacgccatc ggtgcgaacg 2998441 cgccgatggc caccaacagg acgtcgtggt tcaaaccatc ggcgggcgcc gccagcacat 2998501 ccacgcctcc acgccgctcc aaagccgaaa tatcttctcc cacatcacct ttggggaacc 2998561 gtaacgccgt cgggccgtcg tcgacgtcga gcgcctcgcc gagttcttca cgcaaccggg 2998621 tggcgtctct gggcgctgcc acccggatgc cgggcacgat acccagcatc gacaagtccc 2998681 acattccgtt gtggctggcg ccgtcgctac cggtgatccc ggcacggtcc agcaccatgg 2998741 tgaccggcag cttgtgcagc gccacatcca tcatgatctg gtcgaacgcc cggttcagga 2998801 acgtcgagta gatcgccacc acggggtgca gcccacccat cgccaacccg gccgccgacg 2998861 tcatcgcgtg ttgctcggcg atcccgacgt cgaacaatcg atccgggaag cgctgcccga 2998921 acgcggtcag cccggtgggg cccggcatgg ccgcggtaat ggccacgatg tcacggcgtt 2998981 tctgggcgta gccgataagt gcatcagaga aggtcgccgt ccagcctggg ccggccacct 2999041 tggtggcttg tccggtggcc ggatcgatcg ggaccgtgga atgcatctgc tcggcctggt 2999101 cggcctcggc cggcgggtag cccatgccct tgcgggtgac gacgtgcacg atcaccggtg 2999161 caccgaagcg ccgcgcgctg cgcagcgcga cctccaccgc ccgctcgtca tggccgtcga 2999221 ccgggccgac gtacttcaac ccgaggtcgg tgaacagcaa ctgcggcgac agcgagtcct 2999281 tgatgccggc cttgacgctg tgcaggaatc gaaaccacag accgccgaca agcggcaccg 2999341 cgcgcaccag gtcgcggccc gtctccagcg cctgctcgta ggccggctgc agccgcagcg 2999401 tggccagatg gtcggcgacg cccccgattg tgggcgcgta gctgcgccca ttgtcgttga 2999461 ccacgataat caccggccgg cgggatgcgg cgatattgtt cagcgcctcc cagcacatac 2999521 cgccggtgag cgcaccgtca ccgaccaccg cgaccacatg ccggttgcgg tgtccggtca 2999581 actcgaacgc cttggccaac ccgtccgcgt acgacagcgc cgcgctggcg tggctcgact 2999641 ccacccagtc gtgctcgctc tcggcacgag acggataccc cgacaacccg cccttcttac 2999701 gcagggttgc gaagtcctgg ctgcgtccgg tcaacatctt gtggacgtag gcctggtgac 2999761 cggtgtcgaa gatgatcgga tcgtgcggcg agtcgaatac ccggtgcagc gccaaggtga 2999821 gttccaccac tcccaggttc ggccccagat gcccccccgt ggcggcaacc ttgtggatca 2999881 ggaactcacg gatctcggcg gccagctccc gaagctgcgc ctgggaaagg tgctgcagat 2999941 cagcgggccc gcggatctgt tgcagcattt cgctagtgta cgcagcaacc cccccattgg 3000001 cccagcatgc ggccgccgat caaaagggcc gaaccacttt gatagcgtcg gtggccggcg 3000061 cgccgggaag cctggtcggc gactcattgt catccaactc cggagttcga tatgaaggta 3000121 aacatcgacc caaccgcgcc cacctttgcg acgtatcgtc gggatatgcg tgccgagcaa 3000181 atggcggagg actatcccgt cgtaagcatc gattccgacg cgctggatgc tgcccgcatg 3000241 ctcgcagagc atcgtctgcc tggactattg gtcaccgccg gagcgggcaa acagtatgcg 3000301 gtactccctg cctcacaggt cgtgcgcttc atcgtgcccc gctatgtgca agacgatccc 3000361 ttactggccg gtgtgctcaa cgaatcgacg gccgaccggt gcgccgagag attgagcggc 3000421 aaaaaggtcc gcgacgtgtt gcctgaccac ctggtcgagg ttcccccggc taacgccgac 3000481 gacaccatca tcgaggtggc cgcggtgatg gcacggctgc gcagcccatt gctcgcggtg 3000541 gtcaaagacg gctcgctgct cggggtggtc accgcatcgc gcctgcttgc tgcggcactg 3000601 aagacttgac ctcgtgagcg tcgtcgcggt caccatcttc gtggcggcct acgttctgat 3000661 tgccagcgat cgcgtcaaca agacgatggt ggcgctgacc ggcgcggcgg ccgtggtcgt 3000721 cctaccagtg atcacatccc acgacatctt ctattcccac gacaccggaa tcgactggga 3000781 cgtcattttc ttgttggtgg gcatgatgat catcgtcgga gtgctgcggc agacgggggt 3000841 gttcgaatac accgcgatct gggccgccaa gcgcgcccgc ggctcgccgc tacgcatcat 3000901 gatcctgctg gtattggtga gcgcgttggc gtcagccttg ctggataacg tcaccacggt 3000961 gttgttgatc gcgccggtca cgctattggt gtgcgaccgg ttaaacatca acacgacgtc 3001021 gttcctgatg gccgaagtct tcgcctccaa cattggtggc gccgcgacgt tggtgggtga 3001081 cccgccgaac atcatcgtgg ccagccgggc gggattgacg ttcaacgact tcatgctgca 3001141 cttgacaccg ctggtagtca ttgtgctgat cgccctcatc gctgtgctgc cccgcctgtt 3001201 cggctcgatc acggtcgaag ccgatcgaat tgccgatgtc atggcgctcg acgagggtga 3001261 agccatccgc gaccgcggac tgctggtcaa atgtggcgcc gtgctggtgc tggtgttcgc 3001321 ggccttcgtc gcccatccgg tgctgcacat ccagccttct ctagtggcgc tgctgggcgc 3001381 tgggatgctg atcgtggtct cgggtctgac gcgatccgag tatctatcca gcgtcgagtg 3001441 ggacacgctg ctgtttttcg ccgggctgtt cattatggtc ggagcgctgg tcaagaccgg 3001501 tgtcgtcaac gatctcgcgc gggcagcgac ccagctgacc ggcggcaata ttgtggccac 3001561 cgcgttccta atcctcggcg tctccgcccc gatctcggga attatcgaca acattcccta 3001621 cgtcgccacg atgacgcccc tcgtcgcgga gctggtcgcg gtcatggggg gtcaacccag 3001681 caccgacacc ccctggtggg cgctggccct gggtgccgac ttcggcggca acctgaccgc 3001741 aatcggcgcc agcgcgaacg tcgtcatgct cggaatcgcc cggcgcgcag gagctcccat 3001801 ctcgttctgg gagttcaccc gcaaaggggc ggtggtcacg gccgtctcga tcgcgctcgc 3001861 ggcgatctac ctgtggttgc ggtacttcgt gttgttgcac tgaccatctg tattgccgac 3001921 agacctgtag caccagacga cgccgcgatg agcggcctac gagaagattc ggaggatggc 3001981 cgatgagcat catcgccatc acggtgttcg tagccggcta tgcacttatc gcaagcgacc 3002041 gagtcagcaa gacccgggtg gcactgacgt gcgcggcgat catggtcggc gccgggatcg 3002101 tcggatcgga cgacgtgttc tactcgcacg aagccggaat cgattgggac gtcatctttc 3002161 tgctcttggg catgatgatc atcgtcagcg tgcttcggca caccggcgtc ttcgaatacg 3002221 tcgcgatttg ggccgtcaaa cgcgcaaacg ccgcgccgtt gcgcatcatg atcctgctgg 3002281 tgctggtgac cgcgctgggg tcggccctgc tggacaacgt caccacggtg ttgttgatcg 3002341 cgccggtgac gctactggta tgtgatcgac tgggggtcaa ttccacgccg tttttggtgg 3002401 ccgaagtctt cgcgtccaat gtcggcggcg cggccacgct ggtcggcgac ccgccgaaca 3002461 tcatcatcgc cagccgggcg ggactgacgt tcaacgactt cctgatccac atggccccgg 3002521 ccgtgctcgt cgtcatgatc gccctgatcg gtctgctgcc ctggctgctg ggctccgtca 3002581 ctgccgagcc cgaccgagtt gccgacgtgc tgtcgctcaa cgagcgcgaa gccatccacg 3002641 atcgcgggct gctcatcaag tgcggtgtcg tcttggtgct ggtgtttgcg gccttcatcg 3002701 ctcatccggt gctgcacatc cagccgtctc tggtggcgct gctgggcgcc ggtgtgctcg 3002761 tacggttctc ggggctggag cgatccgact acctgtccag cgtcgagtgg gacaccctgc 3002821 tgttcttcgc cgggctgttc gtcatggtgg gggccctggt gaagaccggt gtcgtcgagc 3002881 aactggcgcg ggcagcaacc gagctgaccg gcggcaacga gttactcaca gtcggtttga 3002941 ttctcggcat ctcggcaccg gtgtccggca tcatcgacaa catcccctac gtcgccacga 3003001 tgacgcccat cgtgaccgaa ctggtcgccg cgatgccggg ccacgtccac cccgacacgt 3003061 tctggtgggc actggcgcta agcgccgact tcggcggcaa cctgaccgcc gtggcagcca 3003121 gcgccaatgt cgtcatgctc ggaatcgccc ggcgctcggg cactcccatc tcgttctgga 3003181 agttcacccg caagggcgcg gtggtgaccg cggtctcgct cgtgttgtcg gcggtctacc 3003241 tgtggctgcg gtacttcgtg ttcggctaag cgccaacgct cacgcgtgct tagcgcgaaa 3003301 gcgccgaaac agcacccaga cgatggccag gttgtagacg gcacccccga cgagatacgg 3003361 ccaccaggtg ccgtgatcgc tggcgaccca aaatgccttg gcagcccagt acggtggtaa 3003421 gacgccgaac gcgaggttcc agttggaact gatgaaccac ggcaggcagg gcagcccggc 3003481 gatgagcatg cccagcgcac ggaccatcgc caggccctga atcttgttgt tcgccaccgc 3003541 aagaatcagc agcagcgtga ccaccgccga caggccggcc accagtccga tgggaatcag 3003601 tgaagacacc aggcccggtt cgaggatccc gctgcacgac atcgtcgcga cgacgtagat 3003661 ggtggtcacc accatcacgg tggccgcacg atagccgaaa aagaccgaca gcggcaccgg 3003721 ggttactcgc agcgccgtca tcgtgcccgc gtctacgtcg tccagcacca agaacgcggc 3003781 cagcgcaccg gcgacgatga tgctggtcaa caacaggaac gcggtgagga tcagtgggta 3003841 gtatccgacc aggtcgaatc cataacgccg cgccagcatc tcggtgaaca gcggcgtgag 3003901 cagcgcgact ccggtggtcc agatgaccgg tgcgatgacg agcatgacca gcagcggatc 3003961 gcggtaggtg cctcgaatgt cgttgcggcc gaacgcggcc aacgcccgtg ggcccgcaag 3004021 gctcgatatc gctctcacag cacacccgat ctttgcacga cataacggcc gaatagcgcc 3004081 ttggccgccc ggcacaatcc cgccgcacac acgattgggt agaccaccgc atacccgacc 3004141 tgccagggcg ccaagctcac ctgatcgaac gccgcgccga gcaagagcag cggcccctgg 3004201 gtggggatga ggtaaagcac cgggttgggc cacaggccgg agtagtgcac caccggcggc 3004261 gccagcatga tcgcgagcgg gatgaccgcc gccaggaacc aatcggtcac cgaggcgaac 3004321 ggcaacgagg aactgaagcc gaccagcagc atcagcagtg tgcccagcac gatgccggcc 3004381 accagcggca gcaggtggta accaagcccg tgaacgatgg tggccacgac aaccgcaacg 3004441 aacagcgaga tcgccagcag cacagttagt ttggcagcca ggtactccca gaaccgcagc 3004501 ggcgtcgaga cgatcgcgcc gatcgtgcgc tcctgcttct cgaagaacac ggtcccgccg 3004561 acgaagaaga acccgatgat cgcgatatca cccaccagga catagggttc ggcgaccggg 3004621 cgcaggctga ccggcatcgg cagcagcact gccagccaaa tcagtccgga gaaaacggcg 3004681 gcatgcaaga acttctgccg cacctgtagc gtcagctcga gccgcagcgc aggcaccaac 3004741 cgggtcatgt cagctgcctg ccggtgacct cgacgaagac atcgtcgagg ctggcctcgc 3004801 ggctatgaat ggtctcgacg tggtggtttc gcagcacgga gtggaacgcc gggtcgtcgg 3004861 caaggccgtc catgccgaac tcggcggtct cgagtccccc gccgtcgccc cggtattcca 3004921 cccgcacccg ccgccggctg cgagcgatct tcagttcggt gggactgtcc agtgcgacga 3004981 tcctgccgtc gacgacgaac gccacccggt cgcacagctc gtcggcggtg gccatgtcgt 3005041 gcgtggtgag aaagatcgtg cggccgcgcg ccttcaggtc cacgatgatg tccttgatct 3005101 tgcgggcgtt caccgggtcc agcccggagg tgggctcgtc gaggaacagc agctccgggt 3005161 cgttgatcag cgacctggcg aagggcagcc gcatctgcat gcccttggag tacttgccca 3005221 ctagggtgtg ggcgtcatcg gccaggccga cggcggccag cagctgcatc gggtcggccg 3005281 tcgcgccggc gtacagcgag gcgaagaagc gcaggttctc atacccggtg agcttttggt 3005341 agtggttggg cagctcgaag gagaccccga tgcgctcgta gtaatcgggt ccccactcgg 3005401 ccggctcttt gtcccacacc gtggcctggc cgccgtggtc gcgcagcagc ccgatgagaa 3005461 gcttctgggt ggtggacttg cccgcgccgc tgggacctag aagcccgaag atttcgccgc 3005521 ggccgacggt gaactccatg ccacgcaccg ccggctcggc cgcctttggg tagcggaagg 3005581 tgagcccgcg cacgcggatc acctcggttc ccacacgcgc cgatgccaca gcacggttga 3005641 gcgccgtcat gattggctcc gttccctttc gggcgagcgc ggtgcgccgg ctcatccaag 3005701 taaccagaaa gtcaccgcgc caatgctgat acctggttcc gaccagtctt cccggagcgc 3005761 caacccaaga ctactagctg cgctgctgta tacggagcaa cccacgacga ccacgggcga 3005821 gctggtcgag cagctgcatg acctctacac ctttcgggtc aacagcgcaa cgcactcgac 3005881 gtagtgagtc agcgggaacg cgtcgaacac cttgatcttc tccacggcgt aaccgtgacc 3005941 acggtagagg ccgatatcgc gcgcgaaaga cgccgcttcg caaccgatat gtatcaaccg 3006001 tggcaccccc gcaccggcca gcaagtcgac aacctcgcgc ccagcgcctg atcgcggtgg 3006061 atccagcacc gccagatccg cgccggcggg ttgcactgcc aacacccgcc gcaccgaacc 3006121 ggtgacgacc tccacctggg gcaaatcgac cagcgcggca cgtgcggccc cggatgccag 3006181 gcgcgaagtg tcgacggtca acacccgtcc ggactccccg accgcctcac ccagcaccgc 3006241 agcgaaaacc cccgcaccac cgtagagatc ccaggcggtc atgccggggg cgggctgagc 3006301 ccagtcagcg atcagatcgc tgtagaccgc cgccgcgtcg cgatgcgcct gccaaaaggc 3006361 cgttaccggc acccgccagc tgcgccggtg cacacgctgg tgggcgtggt aggcgccctc 3006421 caccacgttg gtcacggttc gggtcctatt ccgagggccc tgccgcacgg aacagaccac 3006481 atggcgctcg ccgtcgtcgt ccagagccac gtaaagctgg gcttccggcg gccagtcagc 3006541 cgctaccagg ccgtctagca tgccgacagg caactgcccg cagtccaggt cggttaccag 3006601 ctcgccactg tggtagcggt gaaaacctgg acgacggtct gcgcccacgt cgagccggac 3006661 tcgaatacgc caacccgtgg ggccggcatc cgacagcggt tgcgcctcgc cctgccagct 3006721 gtgccgcccg agccgttcca gctggttagc cacaacttgc gccttaagtg tgcgggccgc 3006781 ctccggagca gcaaacgcca gatcgcaaca cccggcgccg tcggccccgg cgatcgaaca 3006841 cagcgacccg atccggtcgg gcgacgggtc gatcacctcg aaagcctctg cgtgccagta 3006901 agagccacgt tgcgcggtca cccgcgcccg cactcgttca ccgggcaacg catagcggac 3006961 gaaaaccacc cggccctcgt ggtgcgccac gcagctaccg ccgttcgcgg gcgctccggt 3007021 gaccaacgtc agattcactg catcgtcgcc ggcgcgggtc actggcgccg ctcctcccca 3007081 tcgctttgct ctgcatcgtc gccggcgcgg gtcactggcg ccgctcctcc ccatcgcttt 3007141 gctctgcatc gtcgccggcg cgggtcactg gcgccgctcc tccccatcgc tttgctctgc 3007201 atcgtcgccg gcgcgggtca atcgaagatg ccccgtcacg tgtcaccggg agccgcgtgc 3007261 ggctgtaacg tcttgatccg ctccgacgac gtcagttgcc aaggcaccga agtcaccatc 3007321 acgccgggca tgaacagcaa ccggcccttg agccgcagcg cactctggtt gtgcagcagc 3007381 tgttcccacc agcgccccac gacatactcc ggaatgaata ccgtcaccac ggtccgtggc 3007441 gattccttgc tgacccgctt gacgtaatcg agcaccggcc gggtgatctc acggtacggc 3007501 gaggcgatga ccttgagtgg cacgctcaca tcgctgtcct gccactggcg caccagctcg 3007561 cgggtttccg catcgtcgac gttgaccgtc acggcttcca acacgtcggg ccgggtcgct 3007621 cgtgcgtagg tcaacgcgcg caacgtcggc aggtgcagct tcgacaccag cacgacggcg 3007681 tgattgcggc tgggcaacgt tatctcggct tcctcggcct gttccgccaa ctcccggttg 3007741 acggcgtcat agtgcctgtg gatgagcttc atcatcatga agaaccctcc catggcgacg 3007801 atcgcgatcc atgctccggc aaggaatttc gttaccagca cgatgagcag gacggtaccg 3007861 gtggacacga agccgaccgt gttaaccgcg cgggagcgca gcatcgcgcg acgggcgcgc 3007921 ggatcggtct cggcgctcag caaccgggtc cagtgccgga ccatgccgac ctgactcatg 3007981 gtgaacgaga tgaacacacc gacgatgtac agctggatca gcgcggtcaa ctcggcacga 3008041 aacgcgacca ccgccccgat cgccgccgcc gccaggaaca ggattccgtt ggagaacgcc 3008101 agccggtccc cacgggtgtg caactggcgc ggcagatagc tgtgctgcgc cagcaccgag 3008161 cccagcaccg ggaagccgtt gaaggcggtg ttagcggcca acaccaggat cagcgctgtc 3008221 accgcggcga tcagcaagaa ccccaggtaa aagcccccga acacggcctg cgccagttgt 3008281 gcgaccagcg tcttttgctg ataacccggc ggggcgcccg tcagctgggt gtccggatcg 3008341 tcgacgacct ggaccccggt ctctacggcc agcacgatca tgcccataaa catgctcacc 3008401 gcaatgatgc ccagcatcag cagcgtggtt gccgcgttac gcgacttggg cttttgaaac 3008461 gccggcaccc cgttgctgat cgcctcgaca cccgtcagcg ccgcacaccc cgacgaaaac 3008521 gagcgcgcca ccaagaacac cagcgcgaaa ccgacgatct ggccgtgctc tgcgtgcatt 3008581 tcaaaagccg cggactcggc ccgaaccgga ttgcccagca cgaaaatccg gaacaacccc 3008641 cacacgagca tggtgccgat tccggcgatg aacgcatagg tcgggatcgc gaacgccaac 3008701 ccggattccc gaaccccacg caagttcatc gccatgatca gcacgatcgc gccgacggca 3008761 aacaacacct tgtgctcgta cacgaacggg ctcacagagc cgatgttgga cgccgccgac 3008821 gatatcgaaa cagcaacggt gagaacgtaa tccaccatca gggcgctggc aaccacgaga 3008881 ccgccggtag cacccaggtt ggtggtgaca acctcgtagt cgcccccacc ggaggggtaa 3008941 gcgtgcacgt tctgccggta actagacacc accacgagca gaaccgcggc gaccgccagg 3009001 ccgatcaacg gcgccatcga ataggccgcc aggccggcca ccgagagcac cagaaatatc 3009061 tcctcggggg cgtaggctat cgacgacatc gcatccgagg cgaacaccgg caaggcgatc 3009121 cgcttgggca acaaggtgtg actgagccgg tcactgcgaa acggccggcc gatcagcaac 3009181 cgacgcgccg cggttgaaag tttggacacg agagccaagg gtaggcctat ccgagcgtgg 3009241 cggtagcgtt ccctagacga gaatgttcgc cgacgtaaat cggctggcca ccgcgggttg 3009301 ccgatcgcgt acggcgcacc ggacacagcc gagaggacct ctaatgcggg tggttgtgat 3009361 ggggtgcggc cgggtcgggg cttcggtggc cgacggactg tcccggatag gccatgaagt 3009421 cgcgatcatc gaccgtgaca gcgccgcctt caatcggctc agcccgcagt ttgccggcga 3009481 gcgggtgttg ggtcagggct tcgaccgaga tgtgctgctg cgtgcgggca tccagggggc 3009541 cgacgcattc gccgcggtgt cctccggcga caactccaac atcatctcgg cgcggttggc 3009601 ccgggaaacc ttcggtgtgc cgcgcgtcgt cgcgcggatc tatgatgcca agcgcgccga 3009661 ggtctatgag cgactcggca tccccaccat taccaccgtt ccctggacca ccgatcggct 3009721 gctcaacgcg ctaatgcagg acaccgaaac cgccaagtgg cgcgatccta ccggtaccgt 3009781 cgcggtcgcc gaggtcgtct tacacgaaga ctgggtgggc caccgggcga ccgatcttga 3009841 gcaggccacc ggcgctcgga ttgcgtttct gatccgattc ggaaccggtg tattgccgga 3009901 accgaagacg gtcctacagg ccggcgataa ggtctatatc gctgcgatat ccggccgggc 3009961 cgcagaggca gcggccatcg cagccttgcc acccagtgag gacttcgagt cgggggctcg 3010021 acgatgaaag tagctgtcgc cggagcgggt gcggtgggcc gctcggtcac ccgcgaactc 3010081 gtggaaaacg gacacgacat caccctgatc gagcgcaacc ccgaccacct cgacgccgcc 3010141 gccatcccgg aggcgcattg gcggcttggc gatgcctgcg aactgagcct gctggagtcg 3010201 attcacctcg aagagttcga cgtggtcgtc gccgccaccg gggacgacaa ggtcaacgtg 3010261 gtgctcagcc tgctagccaa gaccgaattc gcggtgccgc gggtggtggc ccgggtcaac 3010321 gatccccgca acgagtggct gttcaacgac gcctgggggg tcgacgtcgc ggtgtccaca 3010381 ccccgcatgc tggcgtcgct gatcgaagag gccgtcacga tcggcgactt ggtgcggctg 3010441 atggagttcc gcacgggtca ggccaatctg gtagagatca ccctgcccga caacacgccg 3010501 tggggcggca aaccggtgcg caaacttcag ctgccgcggg atgccgcgct ggtgacaatc 3010561 ctgcgcgggc cacgagtcat cgtgccggag gccgacgagc cgctggaagg cggcgacgag 3010621 ttgctcttcg tcgcagtcac cgaagccgag gaggagctga gcaggctgct gctgccgtcc 3010681 atgtaaccgg cgggctctac tcgcggccgg cgtcggcgtc gaattcagct gcacctccga 3010741 cggccgcggc gtcgtgagag gccaagatgg cgcgctgggc tgccttgatt gccgcgtagg 3010801 tggccagcgc ggcgagggcg gtcagcggcc aacccatccc gatcctggcc actcccagcc 3010861 aacccgtctt atcggcgtcg tagaggtgcc tttggacgat gaaccgggca gcaaaaacca 3010921 gcgtccaacc cagggtggcg acgtcaaacg caaagacagc gcgggacacg tcgcgccagg 3010981 cgcgatcgcg cccgctgagc cagctccaca agtagccgac tatcggccgc cggatcagga 3011041 tcgacagtgt gaagaccacc gcccacagca acgacatcca gatgcccagc aggaagtacc 3011101 ccttggactg tcccaccagg tacgcgatca gcgcgcacac ggctaccccg cagaatccgg 3011161 caaccaccgg ccgcgcagat tcccggcgca aaagccgcca cagcaggatc aaccccgcca 3011221 tgctcagggc gaacccaatc gcgggcagca agccggcggc gctggaagca accacaaaag 3011281 tcaccaccgg taatgacgaa tagaccaggc cgctcactcc gccggcctgc gccaacaggc 3011341 gctgggcgct agtgcggtta gcgttcacga gacaccggca attccgactg ccggatagtc 3011401 accgctgaat ttcgtaatgc gggttgtaga tagccttcgt cccattttcc agcttgccca 3011461 gccggccgcg cacccgcagg gtacggcccg tgtcgatgcc gggtatccgg cgttgaccca 3011521 accacaccag cgtgacggtg tcgctgccgt cgaacaattc ggcgcgaaca ccacccgagc 3011581 aacccttgcc attggtttcc acgctacgca gggtgccaac caccgtgacc tcctggccgc 3011641 gctggcagtc gatcgcacgc tgtgcgccgg cattgagcac ctcgtcggat aactcttcga 3011701 cgtcgcgttg ctccaggtcc tccgtcaacc gacgggtgag cctgcgcaga taaccctggg 3011761 cccccatggc ctctcctgac acgtcaccta cgttatggaa gtttcgtgca actgccggcg 3011821 tattccacct atgccaacgg ccaccgtaga cctgttggtt cccgggcgcc accgttggcc 3011881 ttggagcacc ccaaagtggc gggcactatc aagggatggc tgtcgatttg gatggggtca 3011941 caaccgtgtt gttgccggga accggatcgg acaacgacta cgtccggcga gcattttccg 3012001 cccccctgcg acgcgccggg gcggtgctgg tgacgcccgt tccgcatcct ggtcgcttga 3012061 tcgacggcta tcgcgccgcc ctggacgacg ccgcgcgcga cgggccggtt gtcgtcggcg 3012121 gcgtctcgct cggagccgca gtggcggcgg cgtgggcgct ggaacatccc gatcgcgcgg 3012181 tcgccgtcct ggccgccttg ccggcctgga ccggggaacc tgaattagca cctgccgcgc 3012241 aggcagcgcg gtatacggca gcgcggctgc gctgcgacgg tctggcggcg acaaccacac 3012301 gcatgcgtgc atctagcccc gtctggttgg ccgaggagct gacccgatcg tggcgagttc 3012361 agtggcccga gctgcccgat gctatggagg aggcggcggc ctatgtcgcc ccaagccgcg 3012421 ccgagctggc ccggctggtc gcgccgctgg ccgtggccgc ggcggtcgat gatccgatcc 3012481 acccgctgca ggtcgctgcc gactgggtgt ccgtagctcc gcatgcggcg ctacggacgg 3012541 tgacgctgga cgagatcggc gcggacgccg ccgcgctggg ctctgcctgc ctggccgctc 3012601 tcgccgaggt ctcgggcgct tgatcgcctg tttgtccgac ggcggagtgc gcgtaccgtt 3012661 tgggtcgccg agcctgtaat tttgcaggcc cccactcgca ctttgcctgc agagttacag 3012721 cctcagcgaa cagcgcgctc gtactgttga ggtcgtcgag ctagtcccga tcgcccgact 3012781 cctcctcacg ccgctacgcg gcgcgctcgt actgttgggg tcgtcgggct agccgcccgt 3012841 cgtgctgcgc aactgctgca tcgccgatcc ttgagcgccg cggcgtgcaa cccccgcggc 3012901 ggcttggcgt tgcgtgtcgg cctgtgccgc cgcagcctcc cttagttgcg ccgccatcgg 3012961 ctcgggcagg tgcaccggta acggcgtccg taccggcagc ggggtatcgc cccggcggac 3013021 cacggtgtcc gccaacgcct cacgggcctc ctcggtcagc gcatcgactg tctcttgtgg 3013081 gccgttgacg acgcatcgga tcatccagcg gtagccgtcg accccgatga agcgcaccac 3013141 accggcggcg atgccgatca cttcgcgacc ccacgggcca tccttgatcg aaactttggc 3013201 cgagtccttg cgcagcgagt cggcgagttc gccggccacc tcacgccaga gcccgccggt 3013261 cttaggtgcc gcgtaggccg caatgctgta gcgaccgttg ggtgtgatga cccacaccgc 3013321 gctgggaaca ccgctctcgg tcagctcgac ctgtacctga cccgcggccg gcatcggaat 3013381 cagcaccgag cccaagtcca gccgggccag caccgccacc gaagggtcat cgaagtcgtc 3013441 gatgtcgaat gggccctgaa gctcctcctg gtcttcgacg cctgacgcgg cggcagcgct 3013501 ggccaccacg gtgtcctccg gtcggacgtg ctcgtcggcc ggctggacgg gggcgtgtcc 3013561 ggccttgcgt ttgccaccgt ctttgcctgt gcgtctaccg aatgccatgg cgagcgccgc 3013621 tctcccccgt aagcgggtgg tacccccacc tcatcgcgcc ctcctttgca tcgtcgccgg 3013681 ggtcacaaac tcgcatgtcc gccggaggaa ccgtggccac cgtcgccgcg ggatgtcgag 3013741 gccagcccgg cctcgtcgaa cgacgagacc tcgaccagct cgaccaactc aacccgttgc 3013801 actagcaact gggcgattcg gtcaccgcga tgtaccacga tgggcgcggc tgggtccaag 3013861 ttgatcaggg ccaccttgat ctccccacga taacccgcgt cgatggtgcc cggactgttg 3013921 acgatcgaaa gccccacccg cgtggccaac ccggagcgcg gatggaccag cccgaccatg 3013981 ccgaacggga cggcgaccgc aacacccgtc cgtaccaggg cgcggcgccc aggtgccagc 3014041 tcgacgtctt cggcgctgta gagatcaacg ccggcgtcgc cgtcgtgagc gcggctgggc 3014101 agcgggagcc cggggtcgag gcggacgatc gccagagtgg tcgacacggg gccacagact 3014161 acccttgacc gcgtgtctgg gacgcgcctc gcgccgcaca gcgtgcgata ccgcgagcga 3014221 ttgtgggtgc cctggtggtg gtggccattg gctttcgcgc tagcggcgct tatcgcgttt 3014281 gaagtaaacc tgggcgttgc ggccctaccc gactgggtac cgttcgcaac gcttttcaca 3014341 gtcgcagccg ggacgctgct atggctcgga cgtgtcgaaa ttcgggtcac cgccggctca 3014401 gcggatggag ccggagtgaa gctatgggcc ggaccagcgc atctgccggt agccgtgatc 3014461 gcccgatcag ccgaaatccc ggccacggct aaatctgcgg cgctgggccg acaactcgat 3014521 ccggcagctt acgtcctgca tcgggcctgg gtggggccca tggttctggt tgtcctcgac 3014581 gaccccaacg atcccacgcc gtactggttg gtgagctgcc gccacccgga gcgggtgttg 3014641 tcggcgctgc gcagctgacc tatcaggcgg cgcagtcggt gcagatcatc acgccgttct 3014701 tctcgctggc caacctgctg cggtgttgca ccaaaaagca actcgagcag gtgaattcgt 3014761 cagcttgctt gggtacgacg cgcaccgaca gttcttcgcc ggacaggtcg gcgccaggca 3014821 gttcgaagga ctcggcggat tcggattcgt ccacgtcgac cacggccgac gccgcctcgt 3014881 tccgtcgtgc tttgagctct tcaagcgagt cctccgagac atcatcggtc tcggtacgcc 3014941 gcggagcgtc atagtcggta ggcattccta tcccctcaca tgcctcataa cttcaagcaa 3015001 cgctttgtac cagcgtcgaa cgcgtccacc aaacgattcg tgcccgtatc gtggcctatt 3015061 caagtgtgat ttacatcaca tattcatatt gcaccttgta cgcggcccta aacggtgcct 3015121 ttttgggtgc gaactacacc caatggtccg cctcctcacc gcgccgtgcc ggcacgcgtc 3015181 gtcagcggat taaagtgcac gtgtggtcgc acaaatcacc gagggtaccg ctttcgacaa 3015241 gcacggacgg ccctttcggc gacgcaaccc ccgacccgct atcgtcgtgg tggccttcct 3015301 cgtggtggtg acttgcgtga tgtggactct tgcactgacg cggcccccag atgtccgcga 3015361 ggccgcagtc tgcaacccgc ctccgcagcc ggcggggtca gcaccgacca accttggtga 3015421 acaggtgtcg cggacggaca tgaccgatgt cgcacccgcc aaactgagcg acaccaaagt 3015481 ccacgtcctc aacgccagcg gccggggcgg ccaagccgcc gatatcgctg gcgcactgca 3015541 agatctgggc ttcgcccagc cgaccgccgc caacgacccg atctatgccg gcacccggct 3015601 ggactgccaa ggccagatcc gcttcggtac ggcggggcaa gccaccgctg ccgcactatg 3015661 gctggtagcg ccgtgcaccg agctgtatca cgacagccgc gccgacgatt ccgtcgacct 3015721 tgcgctcggc accgacttca ccacgctggc acacaacgac gacatcgacg ccgtgcttgc 3015781 caacctgcgc cccggcgcca ccgagccctc agatcccgcg ctgctggcca agatccacgc 3015841 caacagctgc tgatcggccg gctcagtccg ggatcggctc taggccgttg aatcgctgta 3015901 gcgccgccaa cagctcgtcg gcgattccgg gcgcggcagc caccaccacc aaccccgcgc 3015961 cgcccgctct cggcgttgac aacaacacgc gggcccccgc ctcagcggcg atcaacgcac 3016021 ctgccgcaca gtcccacacc tgcaccccgt gctcgtagta ggcgtccagc cgacccgccg 3016081 ctaccatgca caagtccagc gccgcagaac cgatccgacg cacgtcgcgg accaacggca 3016141 caacatgagc cagcaattct gcctgcttct cgcggcaccg aaccgagtac ccgaagccgg 3016201 tacccagcaa cgccatcgac aactcgtcga caccggtgca ccgcaacaca tgtctccccc 3016261 gctcatcggt gagatgtgcg ccgaggcccg tcgccgccga atacaccgtg cgagcggcga 3016321 cgtcggcgac cgcgcccgcc accgtgatgc cgccaacctg tgccccaatc gacaccgcgt 3016381 acgccgggat gccgtagacg aaattcaccg tgccgtcgat ggggtcgagc acccaagtga 3016441 cccggtcgga gggtgtagcc gtcacgtcgg cgggaccacc accttcctcc ccgagaatcg 3016501 ggtcaccggg ccgaagttga gccaaccgat cacgcaagag ccgctccgtg tcggtgtcga 3016561 ccacggtcac cggatcggtc gggctgctct tcgcgcgcac cgcgccgtcg ccgtcgcccg 3016621 ccctggagat gccgaaaacc tcggcccgac gaccgcgaac gaaggccgcc gcctcggcag 3016681 caaggttttc ggccacagag cgcagccgcg cgggttcgtt gtcaggtcgt gtcaccggcc 3016741 tatcgcatca cagtcgccac ccgcatggtg gcgtggactc cagcggccat aacgccctcg 3016801 caactgccgg gccgcagttt aaggtgaggg tcatccacgt ctcgccgagg agattcgatg 3016861 accagcaccg gccccgagac gtccgaaaca ccgggtgcca cgacacagcg tcatggcttc 3016921 ggcatcgacg tcggcggcag cggcatcaag ggcggaatcg tcgacttgga caccggccag 3016981 ctgatcggcg accggatcaa gctgctgacc ccgcaaccgg ccactccgtt ggcggtcgcc 3017041 aaaaccatcg ccgaggtcgt caacggtttc ggctggcggg gtccgctggg ggtgacctat 3017101 cccggcgtcg tcactcacgg cgtcgtccgg accgcggcta acgtggacaa gtcctggata 3017161 gggaccaacg cacgcgacac tatcggcgcc gagctgggcg gtcagcaggt caccatcctc 3017221 aacgacgctg atgccgccgg gctggccgag acacgctacg gggccggcaa gaacaaccct 3017281 ggcttagtgg tactgctcac attcggaacc gggatcgggt ccgcggtcat ccacaacggg 3017341 acgttgatac ccaacaccga gttcggacat cttgaggtcg gcggcaagga agcggaggaa 3017401 agggccgcct cctcggtaaa ggaaaagaac gactggacct atccaaagtg ggccaagcag 3017461 gtgatacgcg tgctcatcgc catcgagaac gcgatctggc ctgacctgtt catcgccggc 3017521 ggcggcatca gccgcaaggc cgacaaatgg gtgccgctac tggaaaaccg cacaccagta 3017581 gtgcccgcgg ccctgcagaa caccgccgga attgtcggtg cggccatggc ctctgtcgca 3017641 gatacgacgc actgaaactt gcccgctcgg gctgtactcg tgcgcagtaa agttacaatg 3017701 gtcagcggcg gccgcccgac cgatagcgcg cgagtattca cgctgatatc aacgccgaca 3017761 ttcgacatag cagacacttt cggttacgca cgcccagacc caaccggaag tgagtaacga 3017821 ccgaaggggt gtatgtggca gcgaccaaag caagcacggc gaccgatgag ccggtaaaac 3017881 gcaccgccac caagtcgccc gcggcttccg cgtccggggc caagaccggc gccaagcgaa 3017941 cagcggcgaa gtccgctagt ggctccccac ccgcgaagcg ggctaccaag cccgcggccc 3018001 ggtccgtcaa gcccgcctcg gcaccccagg acactacgac cagcaccatc ccgaaaagga 3018061 agacccgcgc cgcggccaaa tccgccgccg cgaaggcacc gtcggcccgc ggccacgcga 3018121 ccaagccacg ggcgcccaag gatgcccagc acgaagccgc aacggatccc gaggacgccc 3018181 tggactccgt cgaggagctc gacgctgaac cagacctcga cgtcgagccc ggcgaggacc 3018241 tcgaccttga cgccgccgac ctcaacctcg atgacctcga ggacgacgtg gcgccggacg 3018301 ccgacgacga cctcgactcg ggcgacgacg aagaccacga agacctcgaa gctgaggcgg 3018361 ccgtcgcgcc cggccagacc gccgatgacg acgaggagat cgctgaaccc accgaaaagg 3018421 acaaggcctc cggtgatttc gtctgggatg aagacgagtc ggaggccctg cgtcaagcac 3018481 gcaaggacgc cgaactcacc gcatccgccg actcggttcg cgcctacctc aaacagatcg 3018541 gcaaggtagc gctgctcaac gccgaggaag aggtcgagct agccaagcgg atcgaggctg 3018601 gcctgtacgc cacgcagctg atgaccgagc ttagcgagcg cggcgaaaag ctgcctgccg 3018661 cccagcgccg cgacatgatg tggatctgcc gcgacggcga tcgcgcgaaa aaccatctgc 3018721 tggaagccaa cctgcgcctg gtggtttcgc tagccaagcg ctacaccggc cggggcatgg 3018781 cgtttctcga cctgatccag gaaggcaacc tggggctgat ccgcgcggtg gagaagttcg 3018841 actacaccaa ggggtacaag ttctccacct acgctacgtg gtggattcgc caggccatca 3018901 cccgcgccat ggccgaccag gcccgcacca tccgcatccc ggtgcacatg gtcgaggtga 3018961 tcaacaagct gggccgcatt caacgcgagc tgctgcagga cctgggccgc gagcccacgc 3019021 ccgaggagct ggccaaagag atggacatca ccccggagaa ggtgctggaa atccagcaat 3019081 acgcccgcga gccgatctcg ttggaccaga ccatcggcga cgagggcgac agccagcttg 3019141 gcgatttcat cgaagacagc gaggcggtgg tggccgtcga cgcggtgtcc ttcactttgc 3019201 tgcaggatca actgcagtcg gtgctggaca cgctctccga gcgtgaggcg ggcgtggtgc 3019261 ggctacgctt cggccttacc gacggccagc cgcgcaccct tgacgagatc ggccaggtct 3019321 acggcgtgac ccgggaacgc atccgccaga tcgaatccaa gactatgtcg aagttgcgcc 3019381 atccgagccg ctcacaggtc ctgcgcgact acctggactg agagcgcccg ccgaggcgac 3019441 caacgtagcg ggcccccatg tcagctagcc gcaccatggt ctcgtccgga tcggagttcg 3019501 aatcagccgt cggctactcg cgcgcggtac gcatcgggcc actcgtggtg gtggccggaa 3019561 cgaccggcag cggcgatgat atcgccgctc agacgcgaga cgctctgcgc cgcatcgaga 3019621 ttgcgctcgg acaggccggc gcaactctgg ccgacgtggt ccgtacccgc atctatgtga 3019681 ccgatatttc ccgctggcgc gaggtcggcg aagtgcatgc acaggctttc ggcaagatcc 3019741 gtccggtgac gagcatggtc gaggttaccg cgctgattgc gcccggcctg ctggtagaga 3019801 tcgaggccga cgcctacgta gggtcggcgg ttgcagaccg aaattcggga gccggcccga 3019861 aggacccgtc accagccggt gggtaggcgg cggccccaat cacagcgcgc accggcagtg 3019921 ggccgtagag atgcgggaaa agcatcgacc gcggatcagt aggcacgccc ggctcccaac 3019981 gcacgggtga gtcgagcgcc gccgggtcga tgtacagcag caccaggtca gcacggccac 3020041 ggtaaaggcg gttggcgggc aggtgaacct gctcgagtgt cgacaggtgg atataccccg 3020101 tcttgtcgga ctcgggatag atcccaccgc gttctcgggc atgcgaccac tcctgcaccc 3020161 cgcataggtg caccagcatg gcaggatcgg gcgtcattct caccaccctg cccgattggc 3020221 gggggcgaaa gtcgtgagaa atgacacacc cgacagcggc cggggaacac ggcgagaacc 3020281 ccgaacgtct gagaaggtga agatacccga gaacggagag ccatgaacgc aactctgacc 3020341 agtcctgagc tgactagagc agaccgctgc gaccgctgtg gcgctgcagc tcgggtgcgc 3020401 gccaagctgc cctccggagc cgagcttctt ttctgccagc atcacgccaa cgagcacgag 3020461 gcgaaactga ccgagatgtc cgccgtgctg gaggtcagcg ggagcgaata gaccgaactc 3020521 acccgtccac aatgccggta gcgcgcgcag ttttcggtaa tgctggactg gtatgagcga 3020581 ccaggtcccc aagccacacc gccaccacat ctggcgaatc acccgtagga ctttgtccaa 3020641 aagctgggac gactcgatct tctcggagtc agcgcaagcg gctttttggt cggccttgtc 3020701 tttgccgccg ctactgctgg gaatgctggg cagtctggcc tacgttgctc cgctattcgg 3020761 cccggacacc ttgcccgcga ttgaaaagag cgcgctttcg acggcccaca gctttttctc 3020821 ccccagtgtg gtcaacgaga tcatcgagcc caccatcggc gatatcacca acaacgcccg 3020881 cggtgaggtg gcgtcgctgg gcttcttgat ctcgctgtgg gcaggatcgt cggcaatctc 3020941 ggcgttcgtc gatgcagtgg tggaagcgca cgaccagaca ccgctacgcc acccggtccg 3021001 gcaacgcttc tttgcgctct tcctctacgt ggtgatgttg gtgttcctag tagcgaccgc 3021061 accggtaatg gtggtgggtc cacgcaaggt aagcgagcac atcccggaga gcttggccaa 3021121 cctgctgcgc tacggctact accccgcgct tattctcggt ctaaccgtcg gggtcatcct 3021181 gctataccgg gtggcactac cggtacccct gccgacgcat cggctggtcc taggcgcggt 3021241 gcttgcgata gcggtcttcc tgatcgccac cttgggcttg cgggtctacc tcgcgtggat 3021301 cacccgcact ggctacacct acggagcgct ggccacgccg atcgcgtttc tgttattcgc 3021361 cttctttggc ggctttgcga tcatgctcgg cgctgaactc aacgccgccg tccaggagga 3021421 atggccggcg ccggcgacgc atgcccaccg actgggcaat tggctaaagg cccgcatcgg 3021481 cgtcggcacg acgacgtatt cttcgacagc ccagcacagc gccgtcgctg ccgagccgcc 3021541 gagctagtca gcccttcttg agggtgtcgt aaatccgctt gcaatcggga cagaccggcg 3021601 agcccggctt gggcgcgcgg gtaacgggaa acacctcgcc acacaacgcc accacgtggc 3021661 tacccatgac cgcgctctca gcgatcttgt ctttcttgac gtagtggaag tatttcggtg 3021721 tgtcgctgcc ggtcccgtcg tcgacgcgtt cgtcggcgtc ggtacgttca atcgtctggg 3021781 tctgcatacc tgacattgtg cccttggcag gaaagctctc gaagccggag tgcactgcat 3021841 gtgggacagt agagtaatga agcacggctt gaggctgggt ttcaatggcc agttcgacga 3021901 cttcgacgac ttcgacgata agggccggcc ggtactgatt actgccgccg ctccctcgta 3021961 tgaggtggag catcgcacac gggtgcgtaa gtacctgacc ctgatggcat tccgggtccc 3022021 cgcgctcatt ctggccgcca tcgcctacgg cgcctggcac aacggactga tctcgctact 3022081 gatcgtggca gcctcggtgc cgttgccatg gatggccgtt ctgatcgcta acgaccgacc 3022141 gccgcgccgc gccgacgaac cccgccgctt cgacgtcgcc cgccggcgca tcccgctgtt 3022201 cccgaccgcc gaacggcccg cactcgagcc gcggcgacag ccggcagagc ggtcagcccc 3022261 gcggggattc gccgaccacg gttagccgtc tgttggccgg cgttccgggt tgtcggccac 3022321 tggccacact tctcaggact ttctcaggtc ttcggcagat tcctgcacgt cacagggcgt 3022381 cagatcactg ctgggtggga actcaaagtc cggctttgtc gttaaacccc atgacagtgc 3022441 aagccgatcg ggaggtcgct atggccgatg cacccacaag ggccaccaca agccgggttg 3022501 acagcgatct ggatgctcaa agccccgcgg cggacctcgt gcgcgtctat ctgaacggca 3022561 tcggcaagac ggcgttgctc aacgccgccg gtgaagtcga actggccaag cgcatagaag 3022621 ccgggttgta tgccgagcat ctgctggaaa cccggaagcg cctcggcgag aaccgaaaac 3022681 gcgacctggc ggccgtggtg cgtgatggcg aggcggcgcg ccgccacctg ctggaagcaa 3022741 acctgcggct ggtggtatcg ctggccaagc gctacacggg tcggggcatg ccgttgctgg 3022801 acctcatcca ggagggcaac ctgggtctga tccgagcgat ggagaagttc gactacacaa 3022861 agggattcaa gttctcaacg tatgccacgt ggtggatccg ccaggccatc acccgcggaa 3022921 tggccgacca gagccgcacc atccgcctgc ccgtacacct ggttgagcag gtcaacaagc 3022981 tggcgcggat caagcgggag atgcaccagc atctgggtcg cgaagccacc gatgaggagc 3023041 tcgccgccga atccggcatt ccaatcgaca agatcaacga cctgctggaa cacagtcgcg 3023101 acccggtgag tctggatatg ccggtcggct ccgaggagga ggcccctttg ggcgatttca 3023161 tcgaggacgc cgaagccatg tccgcggaga acgcggtcat cgccgaactg ttacacaccg 3023221 acatccgcag cgtgctggcc actctcgacg agcgtgagca ccaggtgatc cggctgcgct 3023281 tcggcctgga tgacggccaa ccacgcaccc tggatcaaat cggcaaacta ttcgggctgt 3023341 cccgtgagcg ggttcgtcag atcgagcgcg acgtgatgag taagctgcgg cacggtgagc 3023401 gggcggatcg gctgcggtcg tacgccagct gaagctggac atcctgagcc aggtagcaga 3023461 cggtatgccc gccgcgccag cggcgggcat accgctgcgg tggggcggcg ggcaaccatt 3023521 ttcgcagctg gccaagtaga ctcagctgca atggagggtg ctgaatgaac gagttggttg 3023581 ataccaccga gatgtacctg cggaccatct acgacctcga ggaagagggc gtgacgccac 3023641 tgcgtgcccg gatcgccgag cggctcgacc agagcgggcc gacggtcagc cagaccgtgt 3023701 cccggatgga gcgcgatggg ctacttcggg tggctggcga tcgccacctg gagctcaccg 3023761 aaaagggccg cgcgctggcc atcgccgtga tgcgcaagca ccgcctcgcc gaacggctcc 3023821 tcgtcgatgt catcgggttg ccgtgggaag aagttcacgc cgaggcatgc cggtgggagc 3023881 acgtgatgag cgaggacgtc gagcgacggc tggtcaaggt gctcaacaac ccgaccacgt 3023941 ccccgttcgg caacccgatc ccgggcctgg tggaacttgg cgtgggcccg gaaccgggcg 3024001 ccgacgacgc caacctggtc cggttgaccg agttgccggc cggctcgccg gtcgcagtcg 3024061 tcgtccgcca gcttaccgag cacgttcagg gcgacatcga cctgatcacg cggctaaaag 3024121 acgccggcgt ggtgcccaac gcacgagtaa ccgtcgaaac caccccaggc ggcggcgtga 3024181 ccatcgtcat cccgggccat gagaacgtca ccctgccaca cgagatggcc cacgcggtca 3024241 aggtcgagaa agtctgagct aacccgcacc taccctgcgc gttgaccgaa cgcacgtcga 3024301 ggcggcagtc gtattccgag ttgttcagcc cgttggtagc cggtgaccgc gatgtcacgg 3024361 atgtgctcag gtcgcagacc agactgcagt gccgtgtcca gcatgcccgc catccgatgg 3024421 cccggctcac agcacagcgc agcctgcagc gaaacaccgg ccagcggccc gtcaccgcgg 3024481 gcataggcgc tgaacgcgag caacaccagg gcctccaccc gccacggttc gggcagcacc 3024541 cgcgccagta acgcccacaa tgactcggcc gcaccagcat tctcgccgac ggcaagggca 3024601 tacagcatgt cgcggacccg cgcgtcgccc agtgcgcaac ccagccgcgc cagctccgtg 3024661 tcggacaagg actgaccgtc tgcgacccgg gccgcggcgg ccagcgcatt ttccacatcc 3024721 tggcggctgc agccgaccga atcagcacgg tgtgcgatct ctcggtcagc cgcttggtgt 3024781 cctagcgcaa cggcaagctc ggcggagcgc acagggtcgt ccacggcgat gacggcctgc 3024841 aggtcggagc gccgcgggta gagctgcctg ccgtccagca ccgccgccat cgccaacggc 3024901 gacgccgacg gatcgtcgat aacgccgctg cagccgcagc cgtccacaca atgccagcgc 3024961 ccgccagcgg ctacccggtc taccacgtgc gctgcccata gcacgatgtc gcgctgcgac 3025021 aacgccgccg cgagcgccgc gcacagctgc cggtactcct cattgcatcg cggacactgg 3025081 gctccgttcg cgtcaacgat caccgcgatc gcggccgccg ggttcgccgc ggcgacaagt 3025141 tctgcgagat ggccaacccg atcggcgagt tcatcacaga ggtcggcgcg catcaccgac 3025201 cctagttccc ccgctgccaa cgacaccaga accagcgatt tttccggcac gaagccgagg 3025261 atggccggta gcgcggcgat cagtgttgca gggcggttga gttcaaattg tcctcgatac 3025321 ttcgtcatga atgccacgct gactaccggc accgtcagcc ggtgcccacg tcacgcgatc 3025381 gagctgcctt cctgtggacg aaggcgtaac tgtgcgttct actgtcattt catggggtcg 3025441 atgcgtgaat acgacatcgt ggtgatcggg tcaggcccgg gcggacagaa agccgccatc 3025501 gcctcggcga agctgggcaa gtccgtggcc atcgtcgaac gcggccgaat gctcggcggc 3025561 gtctgcgtca acacaggcac gatcccatcc aaaacgttgc gtgaggctgt gctctacctc 3025621 accggcatga accaacgcga gctgtacggc gcaagctacc gcgtgaagga ccggatcacc 3025681 ccggccgacc tgttggcgcg gacccagcac gtgatcggca aggaagtcga cgtggtgcgc 3025741 aaccagctga tgcgtaaccg cgtcgatctg atcgtgggcc atggccggtt catcgacccg 3025801 cacaccatcc tcgtggagga ccaggcccgc agggaaaaga ccaccgtcac cggcgactac 3025861 atcatcatcg ccactggcac caggccggca cggccatccg gagtcgaatt tgacgaagaa 3025921 cgggtgctcg actccgacgg gatcctcgat ctcaaatcgc tgccatcctc gatggtcgtg 3025981 gtcggtgccg gcgtgatcgg catcgaatac gcctccatgt tcgctgcgtt gggcaccaaa 3026041 gtcaccgtcg tggagaagcg ggacaacatg ctggacttct gcgaccccga ggtcgtcgag 3026101 gcgctgaaat tccacctgcg cgacctggcg gtgacattcc ggttcggcga ggaagtgacc 3026161 gcggtcgatg tcggctctgc gggcaccgtg accaccctgg ccagcggcaa acagattcca 3026221 gccgagaccg taatgtactc ggcgggacgt cagggacaaa ccgaccacct cgacctgcac 3026281 aacgccggac tcgaggtgca gggccgcggg cggatcttcg tagacgaccg tttccagacc 3026341 aaggtagacc acatctacgc cgtcggcgac gtcattggct tccccgcctt ggccgcgacg 3026401 tcgatggagc aggggcggct ggccgcctac cacgccttcg gcgaaccaac cgacggaatc 3026461 accgaacttc agccgatcgg tatttattcg attcccgagg tgtcctacgt cggcgccacc 3026521 gaggtggaac tgaccaagag ctccatccca tacgaggtgg gagtggcccg ctaccgggag 3026581 ctggcccgcg gccaaatcgc cggcgactcc tacggcatgc tcaagctgct ggtttccacc 3026641 gaggatctca agctgctcgg cgtgcatatc ttcggcacca gcgccaccga gatggtgcac 3026701 atcgggcagg ccgtgatggg atgcgggggc agcgtcgagt acctggtcga cgcggtgttc 3026761 aactacccga ccttctcgga ggcctacaag aacgccgcac tggacgtgat gaacaagatg 3026821 cgcgcactca accagttccg ccgctgaggg tgccgagcgg atgtgaatcc gtctcggcgc 3026881 ccaagtaggc ttgccagcaa attcgccgcc gcccacgaac ggtcggcgtc gaacgtggcc 3026941 ccgcgctttt ggcgttgtgc agcacagcgg cagccagggt tggctgttca atcattgctg 3027001 tccgctgatt tgagggacac tggttacggc acctcggcga caaccccgag aggaggcaac 3027061 acccatggct cgcgatcaag gcgcagacga agcgcgagaa tatgagccgg ggcaacccgg 3027121 catgtacgag cttgagttcc cggcgcctca gctgtcgtcg tccgacggcc gtggtccggt 3027181 gttggtgcac gctttggaag gtttctccga cgccggccat gcgatccggc tggccgccgc 3027241 ccacctcaag gcggccctgg acacagagct ggtcgcgtcc ttcgcgatcg atgaactact 3027301 ggactaccgc tcgcggcggc cattaatgac tttcaagacc gatcatttca cccactccga 3027361 tgatcctgag ctaagcctgt atgcgctgcg cgacagcatc ggcaccccat ttctgctgct 3027421 ggcgggtttg gagccggacc tgaagtggga gcggttcatc accgccgtcc gattgctggc 3027481 cgagcgcctg ggtgtacggc agaccatcgg cctgggcacc gtcccgatgg ccgttccgca 3027541 cacacgaccg atcacgatga ccgctcattc caacaaccgg gagctgatct ccgattttca 3027601 accgtcgatc tccgaaatcc aggtcccggg tagcgcttcc aacctactgg aataccggat 3027661 ggcccagcac ggtcatgagg tcgtcgggtt caccgtgcac gtcccgcact atctcacgca 3027721 gaccgactat cccgcggccg cccaagcgct gctcgaacaa gtggccaaga ccggttctct 3027781 gcagctgccg ctggccgtgc tagccgaagc agccgcagag gtccaggcca agatcgacga 3027841 gcaggtccag gcaagcgccg aagtggctca agtggtggcg gcccttgagc gccagtacga 3027901 tgccttcatc gacgctcagg agaacaggtc gttgctaacg cgcgacgaag atctgccgag 3027961 cggcgacgag ctcggtgccg agtttgagcg gttcctggct cagcaggccg agaagaagtc 3028021 cgacgacgac ccgacctaac gccgcgaaag cggcccacaa aacggcccca gtcggcccga 3028081 caacaagatt ggcgaggatg accgagcgga agcgaaatct tcggccagtg cgcgacgtgg 3028141 caccgcctac gctgcagttc cgcaccgtcc acggttatcg gcgggcattc cggatcgccg 3028201 gttccgggcc ggcgattctg cttatccacg ggataggtga caattccacc acctggaatg 3028261 gggtgcacgc caagctcgcc caacgattca ccgtcatcgc tccggatcta ctgggccacg 3028321 ggcaatccga caagccgcgt gccgactatt cggttgcggc ttacgccaac ggcatgcggg 3028381 acctcctcag cgtgctcgac atcgagcggg tgaccatcgt gggccattcg ctcggcggcg 3028441 gggtagcaat gcaattcgcc taccagttcc ctcagctagt cgaccgactg atcctggtca 3028501 gcgcgggcgg tgtcaccaag gacgtcaaca tcgtcttccg gttggcctcg ttgcccatgg 3028561 gcagcgaggc tatggccttg ctacggttgc cgctggtgct gccggcagtg caaatcgccg 3028621 ggcggatcgt gggtaaggcc atcggtacca ccagcttggg gcacgacctg cccaatgtgc 3028681 tgcgcatttt ggacgacctg ccagagccga cggcttctgc ggcgttcggc cgcaccctgc 3028741 gggcagtggt ggactggcgg gggcagatgg tcaccatgct ggaccgatgc tatttgaccg 3028801 aagccatccc ggtacagatc atctggggca caaaggatgt cgtgctgcca gtccgtcacg 3028861 ctcacatggc gcatgccgcc atgccgggct cgcaattgga gattttcgag ggctcgggac 3028921 atttcccgtt tcacgacgac cctgcgcgct tcatcgacat cgtcgaacgc ttcatggaca 3028981 ccactgagcc cgccgaatac gaccaggccg cgctgcgcgc gttgcttcgc cggggtggcg 3029041 gcgaagcaac cgtcaccggc tcggcagaca cccgtgttgc agtactgaac gccatcgggt 3029101 ccaacgaacg cagcgctacc tgatcaccac cgggtctgtt agggctcttc cccaggtcgt 3029161 acagtcgggc catggccatt gaggtttcgg tgttgcgggt tttcaccgat tcagacggga 3029221 atttcggtaa tccgctgggg gtgatcaacg ccagcaaggt cgaacaccgc gacaggcagc 3029281 agctggcagc ccaatcgggc tacagcgaaa ccatattcgt cgatcttccc agccccggct 3029341 caaccaccgc acacgccacc atccatactc cccgcaccga aattccgttc gccggacacc 3029401 cgaccgtggg agcgtcctgg tggctgcgcg agagggggac gccaattaac acgctgcagg 3029461 tgccggccgg catcgtccag gtgagctacc acggtgatct caccgccatc agcgcccgct 3029521 cggaatgggc acccgagttc gccatccacg acctggattc acttgatgcg cttgccgccg 3029581 ccgaccccgc cgactttccg gacgacatcg cgcactacct ctggacctgg accgaccgct 3029641 ccgctggctc gctgcgcgcc cgcatgtttg ccgccaactt gggcgtcacc gaagacgaag 3029701 cgaccggtgc cgcggccatc cggattaccg attacctcag ccgtgacctc accatcaccc 3029761 agggcaaagg atcgttgatc cacaccacct ggagtcccga gggctgggtt cgggtagccg 3029821 gccgagttgt cagcgacggt gtggcacaac tcgactgacg tagagctcag cgctgccgat 3029881 gcaacacggc ggcaaggtga tcctgcaggg gttgcccgac cgcgcgcatc tgcaacgagt 3029941 acgaaagctc gtcgccgtcg atgcggtagg aacggtcaag ggcggtcacc tcttttgcgg 3030001 tcggggccaa tccgatcgac ccatccgcgc gtgtggacaa ttcgagttcg atgacgtcac 3030061 cggtcaccga ataggttcca acctcaattt cggtgatgcc gcttggatgg gcgagaacga 3030121 gttcaacgca gcccggtcgg caaacgcgga gataccccgt ctcggaatgc agcggcttcc 3030181 cgtcagctac cgccctggtc tgctgtgtgt acgtcagaaa cggtttaccc acatgggcga 3030241 atacgacttc ctcgaggtat tcgaacggcc ggatggtggg gtacttgccc gcaccgcgac 3030301 ccgcccaact ccccaggagg ggtgacagcg cctgcagggc aggggccaga tctcgggtca 3030361 tcgcccgctt gcgggggaca ggcatgcggg aagcctagcg ccgcgagatc ggtcagctgt 3030421 gggctgatag gttgcggtgc gcgcgaagcg cctcaatctc gcgcgcgaaa tcgtccgcgg 3030481 aagaaaacga ccggtagacc gacgcgaacc gtaggtaggc cacctcgtca agctcgcgca 3030541 acgggcccag gatagccagg ccgacatcgt gactcggaat ctccggcgac cccgcggcac 3030601 gcaccgaatc ctcgacttgc tgagccagca ggttcaacgc atcgtcgtcg acctggcgtc 3030661 cctggcacgc ccggcgcaca ccgctgatca ccttttccct gctgaagggt tcggtaacgc 3030721 cactgcgctt gactacggcc agcaccgcgg tctctacggt ggtgaatcgt cgtccacatt 3030781 cggggcacga cctccggcgc cggatcgcct ggccttcatc ggtttcccgg gaatcgatca 3030841 cccgcgaatc gggatgccgg cagaacgggc aatgcatggc cgctcctttg ccgtcttgac 3030901 atccgggtat cacagacgac tccgagcgta cctgtgtgct cccgcgggta gccactgcag 3030961 tcacgactga tgcgcatatt gcgtcgcggt cacccagtaa cgttgacaca gaacggtttt 3031021 cgcggacacc gggatggcct cagccaaccg gagcgatcag cgtctgaccc acggccaatg 3031081 ccggtgtctg caggccgttg agttcacgga tgcggtcggc aacctggcgg gtcggagcgt 3031141 tcggcgccac ccggaccgcc acgtcataca gggactcccc cgtttccacc cgtaccacgg 3031201 caagcctgtc gggcacccga ccggtcgaat cggccgaccc gtcggccgaa ccgccggtga 3031261 tcatctgccc gaactgcgcc accaaaccaa gccagagagt aatcgccgcg gcaagcagag 3031321 ccagccccac cgtcgtggcc ggcgggacgg gcctgctgcc atgcccagtc ctcgacatcc 3031381 cgaccccggt gcggtggtag cgcagcggcg caccccccgg cctcgatcgg ccgggcctgc 3031441 gcgattgcgc cggctcagcg cggcgccagc gaggtccatc gagcgggccc cgcagattga 3031501 gcggatcggg ggtatgcggt ggccggaccg gtgtcatgtt cgctcctcca actcagacgg 3031561 taatcgctcg cgtgttcgac actgtagtca ctcatgtgtt cgatatccga acatttgatc 3031621 gaagcgtgtc gcacgcgcaa aacggtagac cacaccaccg acacgtttcg gttggagccg 3031681 gacttccggc gcgaaggccc agccactcct cgtgccctcc cgcgaccgga acacgcctgt 3031741 cgaacacatg tttgattctt ggtgcgaatg cgactacatt cattgccatg aacgacagca 3031801 acgacacctc ggttgccggc ggagccgctg gtgcggacag ccgggtgctg tccgcagatt 3031861 cggcgctgac cgagcggcaa cgcactattc tcgacgtcat ccgcgcgtcg gtcactagcc 3031921 gcggatatcc gccgagcatc cgggaaatcg gcgacgccgt tggtctgacg tcgacgtctt 3031981 cggtggcgca ccagctgcgc accctggagc gcaagggcta cctacgccgt gacccgaacc 3032041 gcccccgcgc cgtcaatgtg cgcggtgccg acgacgccgc cctaccgccg gtgaccgaag 3032101 tggccggctc ggacgcctta ccggaaccca cctttgtccc tgtcctggga cgtatcgcgg 3032161 ccggcggccc gatccttgcc gaggaagccg ttgaagacgt cttcccgctg ccgcgtgagc 3032221 tggttggcga gggcaccctg ttcctgctca aggtgatcgg tgactcgatg gtcgaagccg 3032281 cgatctgcga cggtgactgg gtggtggtgc gacagcagaa cgtcgccgac aacggcgaca 3032341 tcgttgcggc catgatcgac ggtgaggcca ccgtcaagac gttcaaacgc gccggcggtc 3032401 aggtgtggtt gatgccgcac aacccggcct tcgatcccat cccgggcaac gacgcgacgg 3032461 tgctgggcaa ggtcgtcacg gtgatccgca aggtctgatg ctgatccgcg tgcaggctgt 3032521 caatccgccc taatgaagcc gttgacttgt gccacttctt cactggcgaa ccagagttcg 3032581 gccagcgtgt cgtggtatag cgcactgccg ggtgggtaat acaggccgaa gctcacactg 3032641 gctttgacgg gatagccgtt cggcatctgg tatgggtcct cgagcggcag atgaattgcc 3032701 gggcgcacgc ccgcactcgg cggttccggc ggttctgcgg ccgcgtgccg cccgcctctg 3032761 tcttcagctg cggatacagc cgccgccggc aacccagtgt cagcaagatc ggcagcgtgg 3032821 acatcgggcg gcaccgcttc aggcgccact gcttccggaa caaaggcttg cggaacgaaa 3032881 gtctccggaa ccactcgctc aggaacaatg agatcgggac cgacttcgga aaggtcggcc 3032941 tgcgacacaa cgggtgtcgg cgtggtgtcc accgcatcgg tgtcttcctc ctcgaccccg 3033001 acgttgctgg ggccggacag caagtcgctg ccgtagccct cctcacccgg aagatgctcg 3033061 gcatcaccga cagccgcccc ggcgccgcgc ggccagctga cccgcggtgt ggacccggcg 3033121 tccggcgcca caggttccgg cgggaactgg tcgccgaaac cgaaatgctc tgaaccgaag 3033181 tcctcgtcgg gcggccagtc tccatcagcg gcggtaccgt actcgacatc cccagcgcgg 3033241 tcatcgtcgt aagcagcggc gtcgtaccca cgtcgacgcc gacgcaatcc gaacaccacc 3033301 aacgccacca tcacgaccag gagcaccccc agggcggcgg cccctaacca ccaccaatgc 3033361 caggtgaact tcttgccggg tgggggcatc gccgaagtac ttggctggtt ctgccctgac 3033421 acctgcagac cggacagcaa gggcgctagg ttagcgggat ccgtagtgaa tgtgtttttt 3033481 gcacggttcc acgaaatcat gccgccggtg aatttctgtg agacaacatc gccgtcgacg 3033541 gtctggtcgc cgaccggggc gccgagcttg ccgttggggc cgcgcagctt gtcccacgcg 3033601 gccaccatgg ctccgcgcac gacgaacgcg ccgtggtccg gagtccagaa aatcaccggc 3033661 ttgtcggccg cggagaacct gacgatccgg ctggagggcc caaaaccacc atcagtttcg 3033721 ttggcgatgg ggaaacccaa gtcgctgctg accggtccgc ccagcgactc gtacttcgcc 3033781 aggatttcac cctcgacggc gtttgcaccg gttgccgggc tgaagaagac cttgccaccg 3033841 acgaagtcct gggcgatacc gtctccgccg atcgggtact gcccaccctt cttggcgccc 3033901 agcggacctg cggcaccacc tgctgcgcgc caggccatgt tgatcgccgc ggaaggatcg 3033961 atcgctacct gcaaaccctt cagctgctcg gccagcaccg ccggaacggt ggtgaactcc 3034021 ttggttgccc ggttccagga gacttcacca ccgctgaact tctgggcggt gacctcgccg 3034081 tcgtaggttt catccccgac cggggcaccc agcacgccac ccgagctgcc gagcttgtcc 3034141 cacgcggcat tcagcgcgcc gcgcacgacg aacgcaccgt gttcaggcgt ccagaaaatc 3034201 accgggttgt cggccgcgga gaacgtgctc acgcgactgt cgggtccggc aaggccgggc 3034261 acctcgttga tggtcgggaa tcccagatcg ctgtcggctg caccgcccag cgactcgtat 3034321 ttgtccagga gcgggccgta gaggtatttg gcaccggtgg ccggggtgaa aaacatcttg 3034381 ccgccggcga agtccagggc gaacccgtcg cctatcgggt aaacgtcacc tttccggaca 3034441 ccaagtgttg aagtgtcacc acccgccttc tcccacgcgg ccatcatggc gtcctcggca 3034501 tcgcccatcg gcgaagccgc caccgtgggc gccagcaaca cggcggtcac cgccgtggcc 3034561 gccaagccga gcagcgtacg cccgatcagc gtgctcaatt gacctctctg cccgttcacc 3034621 aagcctccca gccgatgccc tgcctagccc gccagccggt ggatctccca ccgtgggccg 3034681 gtccccgctg cggtccgtat tgtccccggg ctcgcataac attgctccag cgaacgacga 3034741 ttgcgaagtc caatcgcaaa tattacgaaa acggataccc agccgatgtc aaattgatgc 3034801 cggggcacgc tgctgtggtg agcaaccggg ctgcagcccg ggccgggttt gcgttaccgt 3034861 gccggaaacg acaaccggac tgatgcggtg agaggaatcc cggctgacat gggtgcttcc 3034921 ggcctggtct ggaccctcac catcgtcctg atcgccggct tgatgttggt cgactacgtc 3034981 ctccacgtac gcaagaccca tgtaccgacg ttacgtcagg ccgtcatcca gtcggcgacc 3035041 ttcgtgggga tagcgatcct gttcggcatc gcagtggtgg tgttcggcgg ctcagagctg 3035101 gcggtcgaat atttcgcctg ctacctgacc gacgaagccc tgtcggtcga caacctgttc 3035161 gtatttctgg tcatcatcag cagcttcggg gtgcctcgtc tcgcgcaaca aaaggtgctg 3035221 ttgttcggta tcgcgtttgc gctcgtcacg cgcaccggat tcatcttcgt cggcgccgcg 3035281 ctcatcgaga acttcaactc ggccttttac ctgttcggcc tggtcctact ggtcatggcg 3035341 ggcaacctcg ccagacccac cgggctagaa agccgcgacg ccgaaacgct caagaggtcc 3035401 gtcattatcc ggctagccga ccgcttcttg cggacctcac aggactacaa cggagaccgg 3035461 ttgttcacgg tctcgaacaa caagcgaatg atgaccccgt tgttgctggt catgatcgcc 3035521 gtgggtggca ctgacatact atttgcgttc gattcgattc cagcactttt cggcctgacc 3035581 caaaacgtct atctggtgtt cgccgccacc gcgttctcgc tgttgggcct gcgccagctg 3035641 tacttcttga tcgacggcct gctggatcgg ctagtctatc tgtcttacgg gttggccgtg 3035701 attcttggct tcatcggcgt caaactgatg ctggaagcat tgcacgacaa caagattccg 3035761 ttcatcaacg gcggcaagcc ggtcccgacc gtggaggtga gcaccaccca gtcgttgacg 3035821 gtgatcatca tcgtcctgct gatcacgacc gcggcgtcgt tctggtcggc gcgcggacgg 3035881 gcgcagaacg ccatggcgag ggcccggcgg tatgcaaccg catacctcga cctgcactat 3035941 gagaccgagt cggccgaacg cgacaagatc tttaccgcac tgctggccgc tgaacgccag 3036001 atcaacactc tcccaacgaa ataccgcatg cagcccggac aggacgacga cctgatgacg 3036061 ctgctgtgca gggcccatgc cgcgcgcgac gcgcacatgt gagcccgcgc tagctgaggg 3036121 ctagctgcgc ctaaacaccc aagccacgac cgatgatctc tttcatgatc tcggtcgtgc 3036181 caccgtaaat cgtctgtacc cgcgaatcga gataggcccg ggcgactggg tattcgcgca 3036241 tgtagccgta cccaccgtgc agctgcagac agcggtcgtt cagatacacc tgcttctcgg 3036301 tggcatacca cttggccatg gcggcctgct ctgccgtcaa cttccccgcc aggtgcagct 3036361 taatgaattc gtcgaccatg atgcgcacca cagtggcctc ggttgccagc tcggccagca 3036421 agaatcggct gttctggaag ctaccgatcg acctgccgaa cgccttgcgc tccttggcgt 3036481 actgcagtgt ctgctccagc acggattcca tccccgcggc cgccatgatg gcgatcgaga 3036541 tccgttcttg cggcaggttc tgcatcaagt agatgaaccc catcccctcc tggccgagca 3036601 ggttttcggc tggaaccgcc acgtcggtga aggacagctc ggcggtgtcc tgggcgtcca 3036661 acccgatctt gtccagctgg cggccgcgtt cgaatccagc catgccgcgt tcgacgacca 3036721 acaaactgaa cccttgcgca cccttttcgg gatccgtctg cgccaccacg atcactaggt 3036781 ctgaattgat cccgttggtg atgaacgtct ttgacccgtt tagcacgtaa tgatcaccgt 3036841 gtttgacggc acgggtggtg ataccttgca ggtcactacc ggttccgggc tcggtcatcg 3036901 cgatcgcggt gatcaattcc ccggtgcaga agttgggaaa ccagcgccgc ttctgctctt 3036961 cggtggccag cgccagcaag tacggcgcca cgatgtcgtt gtgcaggcca aaaccgatcc 3037021 cgctgtaccg tccggcgcag gtttcctcgg tgatgaccgt gttgtaccgg aagtccgcgt 3037081 tacccccacc gccatactcc tcgggcaccg ccatgcccag aaatccctgc ttgccggcct 3037141 ccagccacac gccgcggtcg acgatcttgg tcttttccca ttcatcgtga tagggcgcga 3037201 cgtggcgatc gaggaacgcc cggtaagact cgcgaaacaa ctcatgttcg ggttcgaaaa 3037261 gtgtgcgctg gtacttggtg gcactgccca tggatgccct ccggggaaga aaattctggt 3037321 gcccaacaat accaaccggg cggttggtcg gcaggtagcc ggggcgcgcc agccgctgcg 3037381 agcgtaacgc cacggcgagc ttgcgtgcac cgaattcgcc gtggcgttac gctcgcggcg 3037441 caaactcgcg caaggtggca gccagcgcct ccgggacacg ggccttgatc cgggtgccct 3037501 cgggcttgtg ctccgcctgc tgtatccgcc catcggcgtg cacacgggcc accaggtcgc 3037561 cgcggtcgta cgggatcacc acgtcgacgg cggtgtcggc gggcacaacc agctcggcca 3037621 tccgccgtcg gagcgcatcg ataccgtcgc cggtgcgggc ggaaacgaac accgcgccgg 3037681 gcagcccgtg ccgcagcttg gccagcatca ggtcgctagc gacgtcaacc ttgttcacta 3037741 ccagcagctc gggcggcgga tcgccgtcat ggtcggcgat cacctcggag atcacctgac 3037801 ggaccgcgtc gatctgggct agcgggtggc cgtcggatcc gtccacgacg tggaccaata 3037861 gatcggcgtg cacgacctcc tccagcgtgg agcgaaacgc ctcgaccaac tgggtgggca 3037921 ggtgccgcac aaagccgacg gtgtcggtga gcacgactgg cctaccgtca ccgaactccg 3037981 cgcgcctggt ggtgggttcc agggtggcaa acagcgcgtc ctgtaccagc accccggccc 3038041 cggtcagcgc gttgagcagg ctggacttac ccgcgttggt gtagccgaca atcgcgatcg 3038101 acggcacgtc actgtgccgg cgacggctgc gctgggtgtc gcggacctgt ttcatggccc 3038161 tgatgtcgcg ccgtaacttg gccatccgct cgcggatgcg gcgtcggtca gtctcgatct 3038221 tggtctcacc gggaccgcgc agacccaccc cgccaccact gccaccggcg cgaccgcccg 3038281 cctgccgtga catcgactca ccccagccgc gcagccgcgg cagcatgtac tccatctgag 3038341 ccagcgacac ctgggctttg ccctccctgc tggtggcatg ctgggcaaag atgtcgagga 3038401 tcagcgcggt gcggtcaata accttaacct gcacagcctt ttccaaggcg gtcaactgcg 3038461 ccggcgacag ttcgccgtcg cagatgacgg tgtcggcgcc ggtcgccacg atcacttcgc 3038521 ggagttcggc cgctttgccc gagccgatgt aggtcgacgg gtcgggcttg tcgcgacgct 3038581 ggatgagtcc ttcgagcacc tgggagccgg cggtttcggc caatgccgcc agctcggcca 3038641 ggcttgcccg gttgtcagcc gcgctgccct cggtccacac tcccaccaac accacccgct 3038701 ccaggcgcag ctggcggtac tccacctcgg agacgtcggc aagctcggtc gacaacccgg 3038761 caacccggcg cagcgccgat ctgtcctcga gtgcgagctc accgaggctc ggtgtgaagt 3038821 ccgaaaggcc cgtctggggc ggatctggat atgtcatagc cagtacccga tggtggcacg 3038881 tggcagctgg ccgcgcatct gaatttgccg gcataagccg ctgcctggga tcaccccatc 3038941 gcgttccacc aatcgtcagc gagatctccg cgggccacca acactgacgg cccacgcagg 3039001 aagctggtgg catcggtgac ggtaaccacg acctctccgc ccggcacgtg cacggtgagc 3039061 gttccggtcg gcgagcccac cgccgccaac gcggcgaccg cggccgcaac cgtcccggtg 3039121 ccacacgagc gggtttcccc cacgccgcgt tcgtgaaccc gcatccagac cgccccgtcg 3039181 accggcgcgg tgagtacctc gacattgacc ccgtcgggga actgcgcacc atcgaaactc 3039241 accggcgcac ccacgtccaa tgccgccagg ccgtcgacgg tcagctggga atccacgcac 3039301 gccagatgcg ggttacccac atcgacggcc aggccgtgaa accgcctgcc accaacaaca 3039361 gcctcccctg cgcccaatct gttggccttg cccatgtcga cggagacgtc ggcgtaggcc 3039421 gcctcgacgt ggtggcaggt gactggtcgc ggtccggcca gtgaccctac gacgaactcg 3039481 tcgcgaacct ccaggccact ggcacgcaag tagtgcgcga acactcgcac accgttgccg 3039541 cacatctggg ctgccgaccc gtcggcgttg cggtaatcca tgtaccagtc ggtcacgcgg 3039601 acaccctcgg gcaggctgtc cagcactcct accgcctggg cggctccggc ggtcgtaacc 3039661 cgcaacaccc cgtcggcgcc cagccccttc cgccggtcgc acaatgccgc cacccgggca 3039721 gcggtgagca ccaactcggc gtcgacgtca ggcagcaaca cgaagtcgtt ctgggtaccg 3039781 tggcccttcg cgaagatcat ctgcgccact cctcaatcac cagatcaggt tacgtgccgc 3039841 cataaccgca cggcgtcgtc gaccagtcgt gcgcggtccg gtgagctggc cacaccggcg 3039901 tcgagccagt gcacccggtg gtctcggcga aaccaggacc gctgccgtcg cacgtagcgg 3039961 cgggtgccca ggtacgtctg ctcccgcgcg gcgcgcatca tgtcagctcc agcaccggcg 3040021 tccagagcgg ctattacctg cgcgtagccc agcgcgcgtg acgcggtgac cccctcgcgc 3040081 agaccattgc ggagcagagt gcgtacctct tcaaccaggc cctgatcaaa catcaggtcg 3040141 gtgcgacggg ccaaccgctc gtcgagaatc gttgtctgac agtccaaccc gacgataacc 3040201 gtgtcccacc gcggcgcacc gatgcgtggc gcggacgcgg caaatggctg cccggtgagt 3040261 tcgaccacct cgagcgcccg caccgtgcgc cgggcatctg tgggcaggat tgccgcggct 3040321 gcagccgggt ctcggcgggc taactcggcg tgcagccgat ccaccccgac ctcggccaga 3040381 cgccgctccc atctcgcgcg tactgaagga tcggttgcgg gaaacgacca gtcgtcgagc 3040441 agggattgga catacagcat cgagccgccc accacgaccg gcaccgctcc ccgggctgcg 3040501 atcgcctcga tgtccgccgc ggcggcccgc tggtagcgcg ccacggtcgc ggtttcggtg 3040561 acatccagga catcgagttg atgatgcggg atgccacggc gctcgctgac gggcagcttc 3040621 gccgtcccga tgtccatgcc gcgatacagc tgcatcgcgt cggcgttcac gatctccacg 3040681 ctcaccctgg cgccgagccg cgcggcgacg tcgagcgcca actgggactt gccggcgccc 3040741 gtcggtccga taatcgccaa cggtctcacg gctgccagac accggcgaaa taccccacgc 3040801 cgtgtggcgc tcctcggtag aactccttgg ccgatcgcgg tcctggctcg gccaggccgg 3040861 ccagcacctg aaatgccacc cgcccaagca cctgtgcggg cagccgggtc aagacggcga 3040921 ggtcgccgct ggccaacgcg tcgtcgagag cccgctgcat accggcgccg tcggggtcat 3040981 agccgccggg agcgcggggc gtcagggtgt tcaggccgtc ggcgacgact agtaccccga 3041041 tcggatcggg ctcccggtcg atgtcggctc gcagttgcct gccacgtgcc accgcggcat 3041101 cggaaccgtg gtcgctggca tagacgtgga cctgtgccct ggcctcaggc cgggcctggc 3041161 cccgtaccca ggcggtaagt agcgcacaca ggggtaattc caccggaacg gcaactccgt 3041221 caccgtcctg cggcgcgagc ccgactcgca cgtcggcgcc gaagcccgca aaggtgccga 3041281 cgtcggtggg gcgcacgacg tcgtcggcgc gcccggttcc gacagcaatc cagcttttcg 3041341 gcaacaagga ggccgccgcg atcaccgcgg cccccaaatc ggccagctcg gcagcggcgg 3041401 ctccggccag ttcgggaacc aacaccggcg cggacggaac gatcccgatg gcgctcaaca 3041461 caacacaaag ctaacgcctt ggcgggcgga ttcggcctcg tggagcaacg gctgcaaaga 3041521 gaccgtgctg agaggccgac actgtcccgc gctcattggc cagagacggt caacgaaccg 3041581 caagctggcc cgcggccccc aattcgccgg cggaaaccgt catcatcgcg acctcgtcac 3041641 gggccaaggc tacggtcgcg acaaccacca ccactaccgc tgccaccagc gcgaccaggg 3041701 ccacccggcc ggtgtgcagc acctcatcga gcacggtgat ccccagcacc gaagcgatca 3041761 ccggcctcgc cacggtgatc gtcggcaacg aggcggttag cgcgcccacc cgcaacgacg 3041821 actgctgaag catcagcccg atcggtagaa ccaggatcca ggcatacaac gcgggggtcc 3041881 ggatcagtgt cgcgaacccc tcgccgagct ccgtcacgac ccctttggtc agcacggtga 3041941 ataccgccaa cgttgccgac gacgccaccg ccagcagcac cgcggacagc gaacccgagg 3042001 caatccgtgc accaaccaca caaagcacca ccgccggaac aaccacgaca gcaaccaccg 3042061 cccaggtcga gaagggggcc cgagtagtgc cggccgccgg gttgcccgac atgacgatga 3042121 cggccaccgc gccggccagc aataccgccc acatccactc cctgggagta cagcggtgat 3042181 gagtcaaccg agcatcgatc agcagcgcga acaacagtgc ggtggcctgc agcgactgca 3042241 ccaacaccac cgaacccatc gtcagcgcaa tggcctgcag ggtgaaactg gcgactgcgg 3042301 ccaggctgcc cagccaccac agagcgtgac gcaaagagag gtggaacaac gtgaaatggc 3042361 cgacatattc ttcagcggtg acctgtcgcg cggaccgctg aagtgtcaca tacccgatcc 3042421 cggccagcaa cgcggcgccc agcgccagaa tggtcgcgaa ttcgacgctg gccataggtg 3042481 acctcccacc gacattcggc ccggaagctg actgatacat ctcgatttaa gcagttgttc 3042541 aatgatgatg aactggcgcc agacaaatat cacaacaaaa cgttgcgcgc agacgcgtgc 3042601 ttcgtcatcg gcttcggaat tctgcggcat attcgctgcg ccgggattga tgaggaattg 3042661 ccatcatggt ggctcggccc caagtgcggt cggcgggtcg gccgtgcaat tgaccgttgc 3042721 ttacggccct cagcgcttcc acgggaggtg cgcgagtaac agctcggttc ggccccttac 3042781 taccggcggc agctggaccc cgacctcgat cagctctacg gatggcggaa aagcacaggg 3042841 ccatgacacc cacgatcggc aaatatcgcg acgaacagtc tgtcaagcgg cctcaatact 3042901 tgcttcaata ctgttggaaa cggtggcagg accgggtgag ggcatcgggc cgaccacatc 3042961 ggtgccgctt cgcgcggcag atgcgcggca tacgcgcgaa gggttgcaag gtaggtgaca 3043021 agcgcatgac ggccgacgag ccccgcagcg acgattcgtc cgggtcggcc ccccaaccgg 3043081 ctgccacgcc ggtgccccgc ccgggaccgc gtcccggccc ccggccggtg ccgcgaccca 3043141 cctcctaccc ggtgggtgcg caccctccca gcgacccgca ccgtttcggc cgtatcgacg 3043201 acgacggcac ggtgtggctg gtcagtgcga gcggcgagcg tatcgtcggc tcctggcagg 3043261 ccggcgatcc cgaagccgcg tttgcccatt tcggcaggcg attcgatgac ctgagcaccg 3043321 aaatcatgct gatggacgag cggttggcgt ccggcaccgg cgacgcacgc aagatcaaag 3043381 cccatgcgat cgcgctggcc gaaacgttgc cgacggcatg cgtgctgggc gatgtcgacg 3043441 cgctggcaga ccggttgaca agcattcgtg atcgcgcgga ggtcatcgct gccgccgacc 3043501 gctccagacg cgaggaacat cgagccgccc agaccgcccg taaagaggcg ctggccgccg 3043561 aagccgagga gctggccgcc aacgcgacac aatggaaggt cgccggtgac cggctgcggg 3043621 caatcctcga tgaatggaag acgattagcg gtgtggaccg caaggtcgat gacgcgctgt 3043681 ggaagcgcta ctcgacggcc cgcgatacgt tcaaccggcg gcgagggtcc cacttcgccg 3043741 aattggaccg tgagcgatcc ggcgtccggc aaagcaagga acggctttgt gaacgggccg 3043801 aggagttgtc cgagtcgacg gactggaccg ccaccagcgc ggagttccgc aagctgctcg 3043861 ccgactggaa agcggcggga cgcgcgagca aggatgtgga cgacgccctg tggcgtcgct 3043921 tcaaggccgc gcaggactcc ttcttcacgg ctcgcaatgc cgccaccgcc gagaaggagg 3043981 ccgagttgcg agccaatgcc gacgccaagg aggcgctgct ggccgaagcg gagcggctcg 3044041 acacgacaaa ccacgaggcc gctcgagcag cgctgcggtc gatcgccgag aagtgggacg 3044101 cgatcggcaa ggtgtcgcgg gagcgggccg cggagctgga gcggcgacta cgcgcggtcg 3044161 agaaaaaggt gcgagaagcc ggcgaagcgg attggtccga cccgcaggcg cgggcccgcg 3044221 ccgagcagtt ccgcgcccgg gccgagcagt ttgaacacca ggccgagaag gcagcagcgg 3044281 ccggtcgcac caaggaagcc gacgaggcga aggcgaacgc cgaacaatgg cggcagtggg 3044341 ccgaggcagc cgccgacgcg ttgacccgac gcccctaacg gtcggtgccg cggtcgggcg 3044401 ttgtcccggc ctcggagtcc gtttgcacgt ggtccagcag cgtcttgcac tgttgttgcg 3044461 cgacgacgcg gcgacgccgc tcctcggctg cgagctgcac gatggtgcgc gaccacacca 3044521 cctgggccca gtggaatgtc aacacgatcg cggtgatcca ggcgacgatg agcccgatac 3044581 cggggccggg atgaccggcg gcgaccgtct gacgcgacca tacggccagc agcccggtac 3044641 cgctggccat cgccgaaccc gccagcgcca cccaagccag cgcccaccgc cgggtgagca 3044701 acgccagcat cgagaagcca acgccgaaca ccagcgccaa ccaggcgaat acccgcgagg 3044761 gcagcgcgac ggcggccctg ccggcgccgt ggctgctgaa caacacatcc cagccgcgca 3044821 cgcttccggt atgcggcagg ataaacgacc ccaacagcac gaacaccagg attgcgacaa 3044881 ccaaagccct cgcgcctggc tcgatttcgc gcgcaacgcg gcgttctgcc gcctcgatct 3044941 cagcgcggag ggcgtcgaga tccccggcgt cgtgttcgtg gctcatcatc tgcatcctcc 3045001 gggcttggcc gcgctgaccg gcagcccgac cccaggcatg cccaggccga cggcgcgccc 3045061 cggctgcccg gcggtgtgcg cgtcgccggc gcgggtgcgg cggtgggtca ggacgccggc 3045121 gtcggcgatg aggtggtgcg gcgccgcttc ggtgaccttc gtggtgatga cgtcgccggg 3045181 acgcacgcgc ggctggccgg cggtgaagtg caccaggcgc ccgtcgcgcg cccgcccgct 3045241 catgcgcgcc gtgacggtgt ccttgcgccc ttccccggtg gccaccagca cctcgacggc 3045301 ctgcccgacc agggcgcggt tggcttccag cgagatttgc tcctgcagcg cgatcaggcg 3045361 ttcatagcgt tcctgcacaa cggctttcgg cagctgtccg tcgagttgcg cggccggtgt 3045421 cccgggccgc ttggagtatt ggaaggtaaa tgcggccgcg aagcgggccc ggcgcaccac 3045481 gtcgagcgtg gccgcgaagt cctcttcggt ctccccgggg aaaccgacga tcagatcggt 3045541 ggtaatcgcg gcatgcggga tggccgcccg cacgcgctcg atgatgccga ggtagcgctc 3045601 ggcacgatag gaccgccgca tcgcgcgcag gatccggtcg gatccggact gtagcggcat 3045661 gtgcagcgcg gggcagacgt tgcgcgtctg cgccatcgcc tcgatgacgt cgtcggtgaa 3045721 ttcggccggg tgtggggagg tgaaccggac ccgctccagc ccgtcgatgt ctccgcaggc 3045781 ccgcagcaac tcggcgaaag ctccccgatt acggggcaat gcggggtcgg cgaacgagac 3045841 gccgtaggcg ttgacgtttt ggccgagcag ggtgacttcg agcacaccgt cgttcaccaa 3045901 ggaccgcacc tcggccagga tgtctgccgg gctgcggtcg acctccctac cccgcagcga 3045961 cgggacgatg cagaacgtgc agctgttgtt gcagcccacc gagatggaaa cccacgcggc 3046021 ataggcagat tcgcgggagc tgggcagcga cgacgggaac tgttgcagcg cctcggcgat 3046081 ttcgacctgg gcgaccttgt tgtgccgggc gcgctccagc agcgtgggca aagacccgat 3046141 gttgtgggtg ccgaagacaa cgtctaccca cggcgccctg cgcagcacgg cgtcgcggtc 3046201 tttttgcgcc aggcagccac cgaccgcgat ttgcatgtcg ggattggcgc gcttgcgcgg 3046261 ggccagatgg ctgaggttgc cgtacagcct gttgtcggcg ttctcgcgga cggcgcaggt 3046321 gttgaacacc acgacgtcgg cctcggaacc gtcggtcgcc ctccggtagc cggccgcttc 3046381 cagcagaccc gccagccgct cggagtcgtg gacgttcatc tgacagccgt aggtgcggac 3046441 ctgataggtg cgcgctggcg ctcgccgcac gggcggcccg gcgccctcgc cggtcacccc 3046501 cgcggcggca tcgtgcgcca ccatcgaagt cacggggcca tggtacggcg gctgggcggc 3046561 tcgcggccca gcggatggtg tcgcctcgtc gcagcatcgg gctagcgggg acgcgctcga 3046621 cacggtggcc gatcacggct tcgctgcaca ccggctcgaa gaagtcggcc acgcgcatga 3046681 ggtagtcgcg tcgtcaccga cactatggct cgcttgcctc taaagcatcg cttatgccac 3046741 agaccagact tgtcggagcc gctgtctagc atcggggacc gggtgctcgg cgcggacaaa 3046801 cgtcatgaag ggaatcgata atgtcggatc gctcagcgat cgaatggacg ggggcaacct 3046861 ggaacccggt caccggatgc gaccgtgtat cgccgggatg tgaccactgc tacgcaatga 3046921 cgttagcgaa gcggctaaag gcgatgggct ccgacaagta tcaaaccgat ggtgacccca 3046981 gaacctccgg tccgggattt ggcgtcacca tccatccccg cagtcttgac gagccgttcc 3047041 ggtggcgaag cccccgcaca gtgttcgtga actcgatggc ggacctattt cacgccaggg 3047101 tggcgctctg gttcattagg gaagtgttcg aggtgatgcg agccacacca cagcacactt 3047161 accagatctt gaccaagcgc agcctgcgac tgcgtcgcct cgctcacaag ctggagtggc 3047221 cctcgaacgt ttggatgggg gtgtcggtgg aaaatgtcga cgccttccgc cgtatcgagg 3047281 acctacgaca ggtgcccgca gcagtaaggt tcctctcctg cgagccatta ctcgggcccc 3047341 tggacggaat aaatctaggt tcgattgatt gggttatcgc cggaggcgaa tctggtccaa 3047401 atttccgccc gatcgatcca caatgggttc gccatattcg cgatacctgt actgccgctg 3047461 atgtcccatt cttcttcaag caatggggcg gtagaacacc aaaggcattt ggacgtgaac 3047521 tcgacggacg ttgttgggat gaaatgccgc ttattgagat tagaaacccg gatcctcgga 3047581 ccaccagccg cgtgcacgcg gatcccatgt tggcgacggc gcccacagaa tctgcccagc 3047641 gttcgaatcc tggacagcta gttcgccaac gctgaataat cccatctcgc cacggtcctc 3047701 ggactccttc tgctgtttcg cgctcttggc ttgccgcatc atctctggct ccttttgagc 3047761 cgcccggttg tacaggtggc acatgatcgc atctccggcc cagtgatcgg tcgcgaacac 3047821 catgtcaaag attgtgacct tattgtgcat ctgcatggga atacgatgcg aatacttgta 3047881 tcccagctca tactccagct tgacgcgcat gagattaacc atctcggcac ggtaggcagg 3047941 cgcagttaga tggtggcgcc atcgcgctgc ctgtatccgc ttccaatccg cgtctccgta 3048001 catgcgggtg acctgctcga taaacagttc cgcgttcgtg cccttcacgc cccgcgcgat 3048061 catggtgggt gacatcaaca tccatagttc ggtcttgagg ttacgagggt tctggcgaaa 3048121 ggcggcgacc ttattgatcg tttcccaatg gacttcagcg gcctgttggt cgatgaaagc 3048181 gaaggtgggc gcccaccgcc aagggcctag ttcggcaagt gtttcatcga ttgttacgtt 3048241 ggaatcgccg gccacaacgc ggtacctacc gtcaccggga aagcgggtcc gaagggcgac 3048301 gtccaattca gaggcaagcg ggttaagctc gcaaaaccgg agccgcgtga aaggtggatc 3048361 ggctttcata gcgataagag aagagccatc aaatttctct cccatgtcgc ggtctatgtt 3048421 ctcgggctgg cccgccatca agtcgaggta aattcgttca cgagaagtct gactagccct 3048481 gttgaaggcc gggaggtacc cggcaagtat ctccagtttg tttcgcgtcc aatatgacca 3048541 ttctctagcc atcgaatccc tttagacgcg tcggcgctcc cgctcggcgg ccagctcggc 3048601 gataaccacc tcgcacgcca aggtctggcc gtacccacgg cgcgccaaca tcgccaccag 3048661 cctgcggctc acccgcgctt cgtcggtgcc gtcgtcgatc agcacctccc gccgcagcct 3048721 ggcccgtacc agcttttccg cccgcccccg ttcggcaccg gcgtcgatgc ccccgagcac 3048781 cgtggtgatc acgtcgtcgt cgacgccctt ggcgtgcagc tcggcagcca acgcgcgctt 3048841 gctctttgct gcgttcgccc gcctggactg aacccattgt tcggcgaagt cggtgtcatc 3048901 caccaggcca acggcggcca gccgatccaa tacccggttg ccgatgtctt cggggtagcc 3048961 gcgcttggcc agctggccgg ctaactcggc gcgggtgcgg gatcgcgcgg tgagcaggcg 3049021 caggcacagt gcccgcgcct gctcttcgcg ctcagaagtc gacgggggcg ggcaggacac 3049081 cgtcatttga gggatcatcg gtcaccacgg caccaatgcc aagcttttcc ttgatcttct 3049141 tctcgatctc gtcagccacg tcggcgttct ccaccaagaa gttgcgggca ttctccttgc 3049201 cctggccgag ctgctcgccc tcgtaggtga accaggcacc cgacttgcgg atgaggccct 3049261 gatccacacc catgtcgatc agcgagccct ccctgctgat tcccttgccg tagaggatgt 3049321 cgaactcggc ctgcttgaag gggggcgaac agttgtgcac gacaacccct tcggcgacga 3049381 gggtgtgcag ttcctcgacc tcgaggtcga acgttcgtgc ccgccgcgtt ggcagcactt 3049441 ctcggatcac ggaatagcgg agttcttccg ccagcatgtc gtgcaggaat ttgtcatcca 3049501 gggcatccgc gagcgcctgc acgcgatccc gacgaaggcg gctggcacct aagacctgct 3049561 tcattccacc gcgggggtcc ccggaagcta caccgatcat ggccgcggcc tcctgcgcgg 3049621 tcacgccgcg ctcgtccaga taattcagca cggcatcggt catctctgca gccagatatg 3049681 tcgcttgcga tccacgacgc cgcccctgcg tggcttctgg aatcgcctgg ataagcgcgg 3049741 caccgcgcgg cccccacatg ggaactgact ccgcgaatgc cgtgacgtta tccatacccg 3049801 agatccggac ctcgaacact tgacgtttgc tctggatccg tcgaccgttg acgatgctcg 3049861 gccgcttctg ggtcggatcg taatctcgaa cggtgctccc gacaccgaac cgcagcagca 3049921 gccaatgaat ctgatgcgcg agttgttcag aggtcgtcgt gtaaccgacc cgaagtgccc 3049981 cggtctgttc ccggctcacc cacccgtcgc tttcgaacag gccgaagagc agattgccga 3050041 caatgtcggc cgcgatgtcc ggctcgaaga accaattcgg aatcgtcttc tcccacgcga 3050101 gcttgccgta gataccggcc tgctgacaaa ggtctgccac accgttgcgc tcaccgggtc 3050161 gatgagcgat cgcgagtgag atacgcccct gcggatgggc cgcgcaaccg agcgtcgcag 3050221 cgattcgcgt cacgtcgtca atgagcgccc gctgaacatt gatgaagttg atcggagtct 3050281 tgccccccac ccaaccatcc ctgccatctc cgatcaggta gccaagcagc cgggcatgat 3050341 ccgccggaat cggcgcactg tcaccgaatc catcgaagcg tcgcggttgc gccaccctgt 3050401 ctcccttgcg gagttccccg gcggcacgcc agccgtactc tgtcagcacc ttgtgatcgg 3050461 gtgtcgccca cacgatggcg ccaccggcga tccgcaaccc gatcacatcc cgcgttccct 3050521 ggtcgaacca ggacaccacg ggccgcgcat gcagcgttcc gtccttggca gcagccacga 3050581 catgaatagg cttgcgccca tcgacaacat cctcgatgcg atgcgttgta ccggtgaccg 3050641 gatcgaagat ccgagtgccc tctgcgaggc acttgttctt gacgaccttg acccgggtgc 3050701 ggttgccgac cgcgttggta ccgtccttga gcgtctcgac tcgccgcacg tccatgcgca 3050761 ccgacgcgta gaacttcaac gcctttccgc ccgttgtcgt ctcgggcgac ccgaacatca 3050821 ctccgatctt gtcgcggagc tggttgatga agatcgccgt ggtgcccgaa ttattcagcg 3050881 cgccggtcat tttccgcagc gcctggctca tcagccgggc ctgcagcccg acgtggctgt 3050941 cgcccatctc gccttcgagc tccgcgcgcg gcaccagcgc cgccaccgag tcgatcacca 3051001 cgatgtcaag cgcacccgag cggatcagca tgtcggcgat ctcgagtgcc tgttccccgg 3051061 tgtccggctg gctgaccagc agcgaatcgg tgtcgacacc gagcttcttg gcatagtccg 3051121 gatccagcgc gtgctcggcg tcgatgaacg ccgcaacacc accggcggcc tgagcgttgg 3051181 ccaccgcgtg cagcgccacg gtggtcttac ccgacgactc cgggccgtat atctctatca 3051241 cccggccacg cggcaggccg ccaatgccca gggccacgtc tagtgcgatg gatccggtcg 3051301 gaatgaccga aatcggctga cgcgcctcgt cgccgaggcg catcaccgaa cctttgccgt 3051361 aactcttctc gatctgggcc actgccagct cgagcgcctt ttcccgatcg ggggtctgcg 3051421 tcatggtgcc tctcctgtgg tcggtgttcg attgaccggt atcggtcggt tggccgtgac 3051481 actagagaca gccactgaca agtcggctgc tccgaatgat caccacagta gccgaacacc 3051541 tgttcgattc aagtgtgaca cgccgcgtgt ggcaacatcg cgtccgcgct cgtcggcgcg 3051601 tcgaacgccc tggcggcggt gcggcccgac ttgcgtgcgc ggctggtccg gatcaccgac 3051661 gatctgctca acaccgctag cctggccgga tccggcgtgc tcaccggccc ggatctgacc 3051721 tttcggcgtc gcagctgctg cctgttctac cgggtacccg ccggaggcaa gtgcggcgat 3051781 tgcccgcttt gacgaatgtg caacctcacc accgatcgtg gggaacgtcg aagtcggcgc 3051841 acaatgcccg ccagacgtcg cggggctcga caccgtcctc gatggcctgg gcggcgctac 3051901 ggccgtcgaa gccggtcagc acgtgatcga gcagcaccga cgagccataa gccgccccga 3051961 aatgcagggc tacccgctcg tggaactccg tcagccgcac gccagccaac atacccagcg 3052021 cgctaccccg ccaacgcaag cgcgtcgtgg cagacccgca ccggatcggc cgcaccggcg 3052081 acgctggcgg ccgcccgccg cgcggcctcc cggaacctcg gcgacgacaa cacctcgttg 3052141 accgccgcca ccagcgcgtc ggcggtcaac ggccggatca gcaccgcgct accctgccgg 3052201 actacccggt tggcgatctc ccactgatcc ccgccaccgg gaaccaccac catgggcacc 3052261 ccggccagca gcgtcttggc caccatccca tgaccaccgc cgcagatcac cagatcggcc 3052321 cgcgtgagca gctcggcctg gctgcccagc ccggccaccg cccagggcgg caccgtcagg 3052381 tcggctccgc tcaaacgcga caccaccagg cgcgatcccg acggcaccgt ctcacccggc 3052441 gtcagagact gcaacgcgac ctccgtcaat ccggcggtcc cggtcaacgc ggtggacggc 3052501 gccacgacca ccaccggccc ggtgccggcg gggatggcca gcacccgatc ggtcggctcg 3052561 aaatgcagcg ggcccaccac gacggcctcg gccggccagt ccgggcgggg aacctcgagc 3052621 gcgggcagcg tggcgatcag ccggcgcagc ggcccgggat cgcgggccgg caatccgatc 3052681 tcgacccgaa cggcggcacg ctggcgcagc ccggcacgcc aggaccgccc cgtcagcgct 3052741 cgcatggtgg catcgcgcag ccggccgcgg ataccggtgc ctgcagccag tccgctgccg 3052801 atcggcggca gtcccttcga cggcaggtac agcggatgcg ggttgagttc cacccacggg 3052861 atccctagca gttcggctgc catgccgccg cacgccgtga tgacgtcgga caccaccagc 3052921 tccggttcca gagcccgcag ccgcggcacg ttgagcacgg ccatctgcgc cgctcgccga 3052981 tggatcctgg ccccggcgtc gagatcgcgg tcggtggccg ccagcccgtc cagctcgacg 3053041 gcgtcaatgc cagcggcgcg ggcggcttcc agccattcca ccccggtgaa cagggtgggc 3053101 gtgtcagcgg ctgcgcggaa acgctggcac agcgcgatcg ccggaaacga gtgcccggga 3053161 tccggcccgg cgaccacggc gacgcgcatc ggccctaccc tgccacagcg ccacagccgt 3053221 aggctgacag ccatggccga gctgaccgaa acatcgccgg aaacccccga aaccaccgag 3053281 gccattcgtg ccgtcgaggc gttcctcaac gccctgcaga acgaagactt cgacaccgtc 3053341 gacgccgcac tgggcgacga cctggtctat gagaacgtcg ggttttccag gatccgcggt 3053401 ggccgccgca cggcaacgct gcttcgccgc atgcagggcc gcgtcggctt cgaggtgaag 3053461 atccaccgca tcggcgccga cggcgccgcg gtgctcaccg aacgcaccga cgcgctaatc 3053521 atcggaccgc tgcgggtgca gttctgggtc tgcggcgtat tcgaggtgga cgatgggcgg 3053581 atcaccctgt ggcgggacta cttcgatgtc tacgacatgt tcaagggcct cttgcgaggc 3053641 ctggtggcgc tggtggtgcc atcgctgaag gcaacgctgt aggccgacct tccggatcaa 3053701 gcccaacgcg ctgtagaaca tcgggtagcg ctacagccag ccggctgccc gggcttatcg 3053761 ctactctgcg cggcgggcca gcaaagatgc gaagtgtggg cgaaaccgca aatgcatcgc 3053821 ctcggccgct atacgatccc catgcacagt cttgagggtg agctggcgat tttgggccga 3053881 cacgacgggc tgtggcgtgt ttggaggtct cagatgtcat ttgtgatcgc ggcaccggag 3053941 tttttaacgg cggcagcaat ggacttggcg agcatcggct cgacagtgag cgcggccagt 3054001 gccgccgcat cagcccccac ggtcgcgatc ctggccgcgg gcgccgatga ggtgtcgata 3054061 gccgtcgcgg cgctgttcgg aatgcatggc caggcatatc aggccctcag cgtgcaggca 3054121 tcggcgtttc atcagcaatt tgtgcaggcc ttgaccgcgg gcgcgtactc gtatgcctcc 3054181 gctgaagccg ccgccgtgac accgcttcag caactagtcg atgtgataaa tgcgcccttc 3054241 agaagcgcgc tcggccgccc cctgatcggc aacggcgcca acggtaaacc ggggaccgga 3054301 caagacggcg gggccggcgg actcttgtac ggcagcggcg gtaacggggg atcagggctg 3054361 gccggctccg gccagaaggg cggtaacgga ggagctgccg gattgtttgg caacggcggg 3054421 gccggcggtg ccggcgcgtc caaccaagcc ggcaacggcg gcgccggcgg aaacggcggc 3054481 gccggtgggc tgatctgggg caccgcgggg accggtggca acggcgggtt caccaccttt 3054541 cttgatgccg ctgggggtgc cggcggggcc ggcggcgccg gtgggctgtt cggcgcgggc 3054601 ggggccggcg gcgtaggcgg cgccgccctc ggcggcggcg cccaggccgc cggtggcaac 3054661 ggcggtgcgg gcggggtcgg tgggctgttc ggcgccggcg gtgccggcgg cgccggcggc 3054721 ttcagcgaca ccggtgggac cggcggggct ggcggggccg gcgggctgtt cggcccgggc 3054781 ggcggctcgg gcggcgtcgg tggcttcggc gacaccggtg ggaccggcgg cgacggcggc 3054841 agcggcgggc tgtttggcgt cggcggggcc ggcgggcacg gtggcttcgg cagtgctgcc 3054901 ggcggcgacg gcggcgcggg cggcgccggc ggcacggtct tcggctcggg cggggccggc 3054961 ggtgcaggcg gagtcgccac tgtcgctggc cacggtggtc acggcggtaa tgccggcctg 3055021 ctatacggca ccggtggggc cggcggagcc ggcgggttcg gcgggttcgg cggcgacggc 3055081 ggcgacggcg gtatcggcgg gttggtcggt tctggcggcg ccggcggcag cggcggcacc 3055141 ggtaccctaa gtggtggtcg cggcggggcc ggcggtaacg ccggcacgtt ctacggttcc 3055201 ggcggcgccg gcggcgccgg cggggagagc gacaacggcg acggcggaaa cggcggcgtg 3055261 ggcggcaagg ccgggttggt cggcgagggc ggcaacggcg gcgacggcgg tgccacgata 3055321 gcaggaaagg gtggtagcgg cggtaacggc ggcaacgcct ggctgacggg ccagggcggc 3055381 aacggcggca acgccgcatt tggcaaagcc gggactggca gcgtcggcgt cggtggcgcc 3055441 ggcgggctgc tggagggcca gaacggcgag aacggattgc tgcctagctg agccagctta 3055501 gccgcagctt ggcctcagcc accgggcgtg cggcggccca tcgaccgagg cacgtcgaaa 3055561 tcggtgcaca acgcccacca cgcgacgcgg ggctcaaccc cgtcctcgat cgcgcagatg 3055621 gcattgcgcc caccgaaacc ggtcagcaca tggtccaacc agcaccaaaa gcccggtacg 3055681 ccgcgccgaa tcgcgggctg accaactcgt ggaactccgt ccgccgtatg ccgccaaccg 3055741 cgtgcgtcag cgccttgtgg cacacccgca ccggatcggc tacctaaccc gcgccggccg 3055801 ctgccttggc gcggcacccg gtcggccatc caccgcagat ccccgccacc gaacgcaccg 3055861 aaacgccgac cttcggcccg cttcgtatcc ggttctgggc ctgcggcatt ttcgaggtac 3055921 aacgggcacg ctatggcatt accacttcgg cgtccaaggc gctggtgcgc ggtctgaccg 3055981 cgtcggcgtt ctcgtcgccg cgggctaccc tgtagcgaat gagcgacaac gcaatccgcc 3056041 cgcggcccaa cccgtggcag tacatccgct attgctacgg ggcgcggctg ccggactcga 3056101 tgcgagactg ggtgcgcaac gatctggccg gcaagggtgc ggccatccgg atgatgatcc 3056161 gcgtcgcggt tccggcggtg ctggtgctgg ccccgttctg gctgatcccg acgtcgctgg 3056221 acgtccactt gagcatgacg ttgccgattc tcatcccgtt cgtgtatttc tcgcatgcgc 3056281 tgaacaaggt atggcgccgg cacatgctgc gcgtgcacaa tcttgacccc gagctcgtcg 3056341 acgagcacgc ccgccaacgc gacgcccaca ttcaccgggc gtatatcgaa cgctacgggc 3056401 cacggccgga cccgaacgac taacgccggg gcaatccgcc gagctcgtca aacgcctgcg 3056461 cccaagcgac caggcgatcg gtggcgccgg ccaactcctc gcggtagcgc tgttgtccgg 3056521 gcccagcccc acccgcgccg ttcgccgagg aaaccaattg cgctgcggcg gtgaccattt 3056581 cgttgtactg acggacgccg gtgctcagct gcgcggtaaa cgcgttgatg gtcggcacca 3056641 gatacgaccg cgacgccgcc gagcattgca cggcccgctc catcgagacc acctcggccg 3056701 cggtcgccac catcgccgcc gaagtctggt tggccgcggc cgttaggtcg cggatctcgt 3056761 ccgccggcaa catggcgccc cgctccatga cacccaacag cgagaagaac ccgcgttcgg 3056821 aggcgcccag cgccgacatc gcgggtcgcg cggccgagcc cggtggtggc agccggcgca 3056881 cacttgcggg ccgccgcacc ggcagtggct ccgagcgcag ccagcggtag cgaagcagca 3056941 atagcgtcgc cggaatggcc tgcgtgaccg caatcgtgcc ggtaatcacc agcagcgacg 3057001 taaaccagcc ccaggccgcc aacagcgccg tcaccaaccc ccagagcaga cagcctgcgg 3057061 tgaataccag accccagcgc aatgcacggc ggcggcggcg cagcagccgg gcacgcggat 3057121 cgatggcgac gctgatcttt tgtgctacca ggtcggccaa atcaccggcg gtatccacgc 3057181 cgcgctgcag caacgaacgc cacggccggc gctgacccgc tttcactgcc atgccgaacc 3057241 gtctgcccaa ctactgaccg tagggctgct cggcaatagc cccgccagaa gtctcggtgg 3057301 ccggtctggg ggtagccgtg gtcccgccgg ccggcaacgc ttcaccgcgc atcgatgcgc 3057361 ggatctgttc caaccgtgaa tgaccggcca tctggatccc ggcctgctcc acctcgagca 3057421 tccggccctg caccgaactc tcggcaagtt cagccgaacc gatcgcgttg gcgtagcgac 3057481 gctcgatctt gtcgcgcacc tcgtcgaggc tcggcgtgtt gcctggcgcg gcgagctcac 3057541 tcatcgaccg caacgatgcg ctgacctgct cctgcatctt cgcctgctcg agctggctga 3057601 gcagcttggt tcgctcggcg atcttctgct gcagcaccat cgcatttcgt tcgacggcct 3057661 tcttggcctg agctgcggcg ctaagcgcct ggtcatgcag cgtcttgagg tcttcgacgc 3057721 tctgctcggc ggtcaccagc tgggctgcga acgcctcggc ggcgttgttg tattcggtgg 3057781 ccttggcagc gtctccggcg gcggtggcct ggtcggccag cgtcagggct tggcgcacat 3057841 tgacctgaag cttttcgatg tccgccagct gtcggttgag tcgcatctcc aattgacgct 3057901 ggttaccgat cacttgcgcc gcctgttgag tcagcgcttg gtgggtgcgc tgtgcttcct 3057961 caatggcctg ttgaatctgc accttggggt cggcatgctc gtcgatcttc gagctgaaca 3058021 gcgccatgag gtacttccag gctttaacga acggattggc catcagttag ctccgccttc 3058081 gcttcttgtg tgcgccagat ggtctcagcg ccctgtcgct caatttatcg ggtcagcgcg 3058141 cattgcccca cccatggcgc gcatcttgtc gacccggacc gaccggcgac ccttaggcca 3058201 ccgccagcga caccaccggc gcaatgacga ccttggtgct ggcgtcaatg gtggcgccgg 3058261 ttgctctgcc agccggggtg gcgcgggcaa ggcgctcttg acgcgccatc cgctcgcccg 3058321 catcgatgag caccaccgac aacgggagct gcagagccgt acaaatcgca ctgagcagct 3058381 cgctggaagg ctccttgcga ccgcgctcga tctccgacag atacccgagg ctcacccgcg 3058441 ccgaatcgga cacctcgcgc agcgtccgac cctgcgacat ccgcgctccg cgcagcacgt 3058501 caccaacgac ctcacgcacc aaagccgcca tcaaaaactc cttgtccacc tcgcaatcgt 3058561 catcaggtga acgccgccgg cggtggggtt ggttcccgca atcagctggc ggtctggcgg 3058621 atccccccga tgtcccgcag agccctggcc acgtaatcga caccagtgat cacggtgagc 3058681 aggatcgcgg cggccatcac taccaccgcc gcaacgtgca gcggacccga aagtggcaac 3058741 acgaataagc caattgccac cgcctggaca aaggtcttca gcttgccgcc ccagctcgcg 3058801 ggaatgacac cgcgcctaat aaccgccaac ctcaaaacgg tcactccgag ttcgcgggtc 3058861 aggattagca ccgtgaccca ccacggcaag tcgccgagca tcgacaatcc gatcagcgcc 3058921 gagccgatca gagtcttgtc cgcgatcgga tcgacaaacg caccgaattc ggttgccatc 3058981 ccgtaattgc gagccagcag gccgtcgaat cgatcggtaa tgcaggcggt tgcaaatatc 3059041 gcccacgcca ctacgcgggc cgcggagtgg tggccgccgc catagaacaa ggccagcagg 3059101 aagaccggga ccatcaccag ccgcaacagc gtcaggatat tggcgaggtt ggcaatgcgg 3059161 gcgcggcctg ctatctgacc cgtttcaggc tgcgccgaca cggcaacaga ataacgggtt 3059221 gacctgctca tgcgaccctt gatgtcgata ctgtttcaca cgtgaccgaa cgtccacggg 3059281 attgccggcc ggtggtccgg cgcgcgcgaa cctccgatgt gcccgcgatc aaacaactcg 3059341 tcgacaccta tgccggaaag atcttgctgg aaaagaatct cgtgacactc tatgaagcgg 3059401 ttcaggaatt ctgggtggcc gagcacccgg acctctatgg caaagtcgtc ggttgcggtg 3059461 cgttgcacgt gttgtggtcg gatctcggcg aaatccgcac cgtcgctgtc gacccggcca 3059521 tgaccggcca cggtatcggc cacgcaatcg tcgatcggct actgcaggtc gcccgcgatc 3059581 tgcagctgca gcgcgtgttc gtgttgacct ttgagaccga gttcttcgcc cggcacggat 3059641 tcaccgagat cgagggcacc ccggtcaccg ccgaggtgtt cgacgagatg tgccgctcct 3059701 atgacatcgg ggtcgccgaa ttcctggacc tgagctacgt caagcccaac atcctcggca 3059761 actcccggat gctgctggtg ctgtagcccg gcgagcagac gcaaaatcgc ctcatttcgg 3059821 cacgaaatgg gcgattttgc gtctgctcgg cgggctactc gccgccgtca ccccggatcg 3059881 cggccagtgt gcccgccaac tcgtcgggct tgaccagcac ctcacgggcc ttcgagcctt 3059941 cgctgggccc gacgatgccg cgggtctcca tcaggtccat caaacggccc gctttggcga 3060001 agccgacccg cagcttgcgc tgcagcatcg acgtcgaccc gaactggctg gacaccacca 3060061 gttccacggc ctgcaggaag acgtccatgt cgtcgccgat gtcggggtcg acgtcggtgc 3060121 gctccgcggt gggtttagcc gtggtgacgc cctcggtgta ttcgggttcg gcctgttcct 3060181 tgcaggcggt gacgacggcg tggatctctt cgtcggagac gtaagcgccc tgcagccgga 3060241 ggggtttgct cgcacccatc ggcaagaaca ggccgtcgcc catgccgatc agcttttccg 3060301 cgcccgcctg gtccaggatc acccggctgt cggtcagcga cgaggtggca aacgccagcc 3060361 gcgacggcac gttggtcttg atcagcccgg tgaccacgtc caccgacggg cgctgggtgg 3060421 ccagcaccag gtggatgccg gcggcgcggg ctttctgggt gatccgcacg atggcgtcct 3060481 cgacgtcacg cggcgcggtc atcatgaggt cggccaactc gtcgacgatg gccaccacgt 3060541 aggggtaggg ccgatactcg cgctggctgc ccagcggcgc ggtgatggcc ccggatcgca 3060601 ccttgtcgtt gaagtcgtcg atgtggcgca cccgggaggc ctgcatgtcc tggtagcgct 3060661 gctccatctc gtcgaccagc caggccagcg cggccgcggc cttcttcggc tgggtgatga 3060721 tcggcgtgat cagatgcgga atgccttcat acggcgtcag ttccaccatc ttcgggtcga 3060781 tcaggatcat cctgacctct tccggggtgg cccgggtcaa cagcgacacc agcatggagt 3060841 tgacgaagct ggactttccc gagcccgtcg agccggccac cagcaggtgc ggcatcttgg 3060901 ccaggttggc cgagatgaag tcgccttcga tgtccttgcc cagcccgatc accaacggat 3060961 gatggtcgcg acgggtctct cgtgcggtga gcacgtcggc caaccgcacc atttcccggt 3061021 cggtgttggg tacctcgatg ccgacggcgg acttgccggg gatcggtgcc agcatgcgca 3061081 cgctctcggt agccaccgcg taggcgatgt tgcgctgcag cgcggtgatc ttctcgacct 3061141 tgacgccggg ccccagttcg acctcgtagc gggtgacggt gggcccgcgg gtgcagcccg 3061201 tgacggccgc gtcgaccttg aactgggtca gcacctcacc gatggcgccg gccatgtggg 3061261 tgttggccgc actgcgtttc ttgggcggat caccggatat cagcaggtcc agcgacggca 3061321 gcgtgtaggg accctcgacg atccggtcca gcacttgggt atctttgcgg cggccgcgtc 3061381 ttccggagcc ccgaccggcg gaggcttccg gtatcgtcgc agtgtcatcc tgcggaacct 3061441 cggccgacgg ccaggccggt ggcccgtcgt cggagcacag gggcacctcg tcgtagtaac 3061501 cgtcggagaa gtcctggcgg gcgacttcga cggtgtccgc gtcgtcacca tcgaagtccg 3061561 cgaagtcctc gaagtcgtcg gcgtattccc gtggcaacag ccgggtgccg aacatggcgc 3061621 gcatggcatc tggcacctct cggatcgtga tcccggccag caggagcaat ccgaacagcg 3061681 cgccgatgaa taacagcggc gcggcgatcc aggcggtcaa cccgtccgag agcggcccgc 3061741 cgatcgcgaa accgatgaac cccgcggcgc gcaaacgcga ctccggggcc tcgggtgagc 3061801 ccgcccacag gtggcacaag ccgagaaacg acaagccgat caggctggcg ccgaggatca 3061861 gccgcggccg cgaatcgggg ttgggcgacg tacgcatcag caccacggcc acggcggcgg 3061921 caaccagcgg gagcatgacc actgccgacc cgatgaacgt ccgcaacaag gcgtcgaccc 3061981 acgcgccgag cggccgggcg gcgtcgaacc acgagctcgc ggcgactacc acggcaaggc 3062041 cgagcagcac cagcgcgatt ccgtcgcggc gatgcccggg ctcgatgtcg cgggctcgcc 3062101 cgatcgaccg cgccgcgccg ccggtgccct tggccgccat catccagacg gcacgcatgg 3062161 cccggccgca ggcgagtccg gtagacacca gcagcgaccg atggtgccgt cgggagggtc 3062221 tgccgacccc tttgacgggc ctcgacctct ttctgggcac ggccgatcgc gcacttcggg 3062281 acgcgccccg cgaagtggcc tttgacctgc tcgttcgagt gccggagcgg gcaacggtct 3062341 tgctagacat aacggcaagc ctagtcgcta tcacaccatc tacaccatcc gccacactgg 3062401 taacggcgat ctgctcgcct cgttgccagg gtctcctgag tagggtgaca agtgatcgtg 3062461 ccgcgtcacg ccgcccgacg cgcggagttc caggaggccc cagcatgccc gtcgtcgtcg 3062521 tcgccacgct gaccgccaag cctgaatcgg tcgacaccgt ccgcgacatc ctcacccgcg 3062581 cggtcgatga cgtgcaccgc gaacccggct gccagttgta cgcgctccac gaaaccggcg 3062641 agaccttcat cttcgttgag caatgggccg atgccgaggc gctcaaggcc catagcggcg 3062701 cccccgcggt tgccaccatg tttaccgcgg ccggcgagca cctggtcggg gcgccggaca 3062761 tcaaactgct gcagccggtt cccgccggcg acccgagcaa agggcagctg cgccggtgat 3062821 cgaccggcca ctcgaaggca aggtcgcctt catcaccggc gccgcgcgcg gcttgggccg 3062881 cgcacacgcg gttcgactgg cagccgacgg cgcgaacatc atcgcggttg acatctgcga 3062941 gcagatcgcc agcgtgcctt atccgttgag caccgccgac gacctggcgg ccaccgtcga 3063001 gctcgtcgag gacgccggcg gcgggatcgt ggccagacag ggcgacgttc gcgatcgcgc 3063061 atcactgtcg gtcgcattgc aggcgggcct tgacgagttc ggccggctcg acatcgtggt 3063121 ggccaatgcc ggtatcgcga tgatgcaggc cggcgacgac ggctggcgcg acgttatcga 3063181 cgtcaacctc accggcgtct tccacaccgt acaggtggcg atcccgaccc tgatcgagca 3063241 gggcaccggt gggtcgatcg tgttgatcag ctcggccgcg ggactggtcg gcatcggcag 3063301 cagtgatccc ggatcgcttg gctacgcggc cgccaagcac ggcgtcgtcg gcctgatgag 3063361 ggcgtacgcg aaccatctgg caccgcaaaa cattcgggtt aactcggtac atccttgcgg 3063421 ggtcgatacg ccgatgatca acaatgagtt cttccagcag tggctaacca ctgctgacat 3063481 ggacgcgccg cacaacctgg gtaacgcgct gcccgtcgag ctggtgcagc caaccgacat 3063541 cgccaacgcg gtggcatggc tggcgtccga ggaggcgcgc tatgtcaccg gcgtcacctt 3063601 gccggtcgac gcgggctttg tgaacaagag gtagctgatg gctcgaaatc ccgctgcgca 3063661 gaccgccttc ggcccgatgg tgttggcggc cgtggagcaa aacgaaccac ctggccgccg 3063721 cctggtggac gacgacctcg cggacttgtt cttgcccaga ccattgcgat ggctggccgg 3063781 tgcaacccgg tcggcggtgt tgcgtcgttt actcattagc gcctcggagt ggtccggccg 3063841 cgggttatgg gccaatctgg cctgccgtaa acgcttcatc ggagacaaac tcgacgaagc 3063901 gctcggcgac atcgacgcgg ttgtcatcct cggagccgga ttggacaccc gtgcctaccg 3063961 gttgacgcga cgagtgcgga tgccggtatt cgaggtcgac ctgccggtca acatcgcccg 3064021 caaggccaag acggtccgac gggtgctcgg tgaactgccg ctgtcggttc gcttggttgc 3064081 attggatttc gagcatgacg acctgctcac cgctctggcc gagcacggct accgtaccga 3064141 gtaccgggtg ttcttcgtct gcgaaggtgt gacccaatac ctcaccgagc gggccgtccg 3064201 gcggaccttg gagggcctac gcgcggccgc accgggcagt cgaatggtat tcacctacgt 3064261 ccgccgggac ttcattgacg gcaccaaccg ttacggtacc cggacgctat accacacggt 3064321 tcgccagcga cgtcaactgt ggcacttcgg cttagatccc gaggaagtag ccgggtttct 3064381 cgccgactac ggttggcggc tgaccgagca ggccgggccg gaggagcttg tccagcgcta 3064441 cgtcgagccc accggccgca acctcaacgc atcacaaatc gagtggtctg cctacgccga 3064501 gaagagtgag ccggttacac ctcgatgacc gtcggcacaa tcatcggctg gcggcgatag 3064561 gtttccccca cccacttgcc gaccgtgcgg cgcacccctt gagcgatccg gatcggatcg 3064621 gtgacgttgg cggccaccaa cgattccagc tctgcctcca ccttgcgcac ggcgggttcg 3064681 agcgccttgg gatcttcgga gaaaccccgc gagtgtagat gtggcgcagc caacggctgg 3064741 ccggtgccac gtctgaccac gacggtcacc gcgacaaagc ccgacgacaa aatgagccgc 3064801 tcgcccaggg tgatatcgcc gacgtcgccg gcgatcaagc cgtcgacgaa catcttgccc 3064861 accggcaccg caccggagat actggctttg ccggcaacca ggtcgacgct gacaccgttc 3064921 tcggccaaca gaattgactc ttgcggtacg ccggtactgg cggccagctt ggcattggcg 3064981 cgcagcatcc gccaggttcc gtgcaccggc atcacgttgc gcggccgcac cccgttgtag 3065041 aggaacagca gctcaccggc gtacgcgtgg ccggaaacat gcacccttgc ttgggcgttg 3065101 gtgacgactc tggcgccgat cttggacagt gcatcgatga ctccgaagac cgcctcctcg 3065161 ttgccgggga tcagcgacga cgacaacacg atgagatcac cagcagtcaa cgtgatgctg 3065221 cgatgctccc cacgcgacat tcgcgacaac gccgacatcg gctcgccttg ggtgccggtg 3065281 gtgatcaaca caacttggtc gggcgccatc gtttcggcgg cggcgatgtc gatgagatcg 3065341 gaatcagcca ctcgtaggaa gcccagttgc cttgcgacgc gcatgttgcg caccatcgat 3065401 cggccgacga acgacactcg ccggcccaat gccactgcgg catcgatgat ctgctgtacc 3065461 cgatccacgt tggaggcgaa acacgcaact atcacccgtc cgtcggcacc ccggatgagc 3065521 cggtgcagcg ttgggcccac ttcgctttcc gatggcccga caccggggat ctcggcgttc 3065581 gtcgagtcgc acagcaacag gtccacgccg gtgtcgccga gccgcgacat gcccggtaga 3065641 tcggtgggac ggccgtccgg tggcaattgg tcgaacttga tgtcgccggt gtgcaggatg 3065701 gttcccgcgc cggtatacac cgcgatggcc aacgcgtccg gagtggaatg gttgacggcg 3065761 aagtactcgc actcaaacac gccgtgccgg gtgctctggc cctcgcggac ctcgacgaac 3065821 accggtgtta tgcggtactc acgacatttc tctgcaacca gagccaaggt gaacttcgag 3065881 ccgacgaccg ggatgtcggg tcgcagcttg agcagaaacg gaatcgcccc gatgtggtcc 3065941 tcgtgcccgt gggtcaacac cagcgcctcg atgtcgtcaa gccggtcttc gacatggcgc 3066001 atgtccggca ggatcagatc gacaccgggc tcgtcgtggc caggaaacaa cacaccgcag 3066061 tcgataatca acagtcggcc caggtgttcg aaaaccgtca tgttgcggcc gatttcgttg 3066121 atgccgccca gcgcggtgac ccgcaacccg ccggaggtca ggggacctgg cgggggaagg 3066181 tctacatcca cttctgggcc accctttggc tcacctttag atcaccgaag caccgaggcc 3066241 gcgcgcatgt cggcggccaa cgcgtcgatc tgctccggtg tcgcggccac ctggggcagc 3066301 cggggatcac cgacgtcgat gccctgcagc cgcaagcccg ccttggacaa cgtcacccca 3066361 cccaggcggc tcatcgcgtt gcacagcggg gcgaccgcaa tgttgatctt gcgggcggtg 3066421 gcgatatccc cagaaccgaa ggcggacaac aactctcgaa gctgcccggc tgccaggtgg 3066481 gcaatcacgc tgatgaagcc cgtggcgccc atggccagcc agggcaggtt gagcgcgtcg 3066541 tcgccggaat agtaggccag tccggtgtcg gccatgattt gggcgccgct gtgcaggtcg 3066601 gctttggcgt ccttgactcc gacgatgttc ggatgcgacg ccaacgcgcg gatcgtgtcg 3066661 ggctcgatcg gcaccgccga ccgccccggg atgtcataga gcagcatcgg cagctcggtc 3066721 gcgtcggcga cggcggtgaa atgggcttgc agcccccgct gcggcggctt ggaatagtag 3066781 ggcgtgacca ccagcagccc gtgcgcaccc tcggccgcac aagccttggc cagccggatg 3066841 ctgtgcgcgg tgtcataggt gccggcaccg gcgataacac gggcccggtc ccccaccgct 3066901 tccaagacgg cccgcagcag ctcgattttc tccccgtcgg tggtggtcgg cgactcgccg 3066961 gtggtgcccg agaccaccag accgtcgcac ccctgatcga ccaggtggtt ggccagccgc 3067021 gccgcggtgg cggtgtccag ggagccatcg ccgctaaacg gtgtcaccat cgcggtcagc 3067081 agggttccta ggcgcgctgc gacgtcgaat ccgacggtgg tcacggctcc caaggttacc 3067141 tggcgcttta tcccggccgc gagcgcgcgt gtttgtccag cgacacgccg cctcaggctt 3067201 cggtcgccaa cgggctggtc gccacctcgg tgccgtcggc cagggtggtc acctcgaagt 3067261 cggcgaacac cgcgggggcc acggcggcga gctggcgcag gcattcgatg gccagtcgcc 3067321 ggatttccac gtcggcgtgc tcgctggccc gcattgcgat gaagtgccgc caggcccggt 3067381 agttgccggt caccacgatg cgggtttcgg tggcgttggg cagcaccgcg cgggcggctt 3067441 ggcgggcctg cttgcggcgc aggatcgcgt tgggttggtc ggcgaacttg gcttccagct 3067501 tggccagcag ctcgctgtag gtggcgcggg cggcgtcggc ggcctcggtc aggatgtggc 3067561 gcaggtcggc gtcgtcctcc atgccgggcg gcacgacgac ccgcgagtcc ttctcgggta 3067621 cgtagcgctg ggagagctgc gagtaggaga aatgccggtg gcggatcagc tcgtgggtgc 3067681 acgatcgcga gatcccggtg atgtagaacg acacgctggc atgctctagc accgagaaat 3067741 gtccgacgtc gatgatgtgc cggaggtagc cggcgttggt ggcggtcttg ggattgggct 3067801 tggaccagct ctgatagcag gcccggccgg cgaactcgac cagcgcgggt ccgccgtcgg 3067861 cgtcggtggt ccagggcacg tcgggtgggg ccaagaagtc ggtcttggcg atcagttgca 3067921 cgcgcagcgg cgcggtctcg gccacggcgc tcaccttagc gccggccgca actagacgaa 3067981 ctcggtgtgg caggtcagcc cgggctcccg gcgcagacgc gggtccgcgg tcagcagggg 3068041 gatgtcgagg tgactggcca gtgccacgta gagggcgtcg taaaacgtga agttgtgccg 3068101 cagggtccac gcccgtcgag cgtccgcgtt ggcgctggct cccagagttg aaccccacca 3068161 aatctgttgc ctgaagaagc cgatctacct aacggggatc gttgcccttg aagtcgcgaa 3068221 caaataggca agtgtccagc ggccagatcg gacccgcaac gaaagttgcg gtaccaatcg 3068281 ccgcaccgct cctgccgatg gctacaccgg gaccatcgta cgcagctgtg tcatgcatac 3068341 cggtcacccc gaatgacccg ataacaggta ccgttccaga tccccgcgac gccgcaggaa 3068401 gatcatgtcc tcgctgcagc tcaaggactt ccccgaaccg caaggttttc catccatcac 3068461 tcatctaagc cgccccagtt gctcccgcac gaccctttcc agccgcgccg actcatcgaa 3068521 cgcctccagc aacgccttcg acaaccgggc catcttctcg tcgatcggct ctccgtcgtc 3068581 ctcgaccgcg ggcgtaccca cataccgccc cggcgtgagc gcatagtcgg tcgccttgat 3068641 ctccgccaac gtcgccgact tacagaaccc cggaacatcc tcgtacataa tccctttgac 3068701 ggcagccgac ttcgacccgc gccacgcgtg gaaggtatcc ccgatgcgga cgatctcctc 3068761 gttggtcagc gcccgctcgg cccggtccac taggtcgccc agttcacgag cgtcgatgaa 3068821 cagcacctgc ccgcaccggt cgatagaccc ttgcttacct gccgccttgt ctttggcgaa 3068881 aaaccacagg cacaccggga ttccggtgct gcggaacagc tgggtgggta acgcgaccat 3068941 gcaggaaacc aaatccgcct ccacgatctg cgcgcgaata tccccctcgc cgttggagtt 3069001 cgacgacatc gacccgttgg ccatcaccac gcccgcccga cctcccggcg ccaacttgta 3069061 caggatgtgc tgaatccatg cgtagttggc gttattggcg ggcggaacac cgaagcgcca 3069121 gcgtgggtct tcctcgttgc gggcccagtc tttgatgttg aacggcagat tggccatcac 3069181 gtagtccatc tgcacgtccg ggtgctggtc gcgggcgaag gtatcactcc atcgggcgcc 3069241 gagccccttg ttgtcgatgc cgtggatggc gaggttcatc ttcgccatcc gccaggtctc 3069301 ctcaatgctt tcctggccat agatcgagac atccttcgga tcgccgtcgt gttcgtagat 3069361 gaacttctcg gtctgcacaa acatgcctcc ggaaccgcag cacgggtcat acacccgccc 3069421 actcgacggc tccagcacct ccacgatcac cttgaccacg ctgggcgggg taaagaactc 3069481 gccaccccgc ttcccttccg cgcgagcgaa attgccgagg aagtattcgt agacctcacc 3069541 catcagatcc cgggcgcggt gctcgccctg ccggctgaag cgcgcactgt taaataggtc 3069601 gatcagctca ccgagccggc gctggtcgat gttgtccttg ttatacagcc tcggcagcgt 3069661 cccaccgagt gttggattgg ccttcattac cgcgtccatc gcctcgtcga tcagctgacc 3069721 gatgttcttc gccggctcac caccaacggc tggcttgcct tttgtgttct ctgccaagaa 3069781 cttccagcgc gcactcaccg gcacgacgaa tacgccgtaa ccctggtact gctcgggatc 3069841 gtcgatcagg tcttctatct gagactcctc cattccttcg gccgccaact cggcacggat 3069901 tgcctcgcgc cgttcgtcat acgcgtcgga cacgtactta aggaacacca ggccgaggat 3069961 cacgtccttg tattggctgg ccgacagcga cccgcgcagc ttgtcggcgg ccttccagag 3070021 cgtgtctttg agctccttca tcgtcgacgg cgcctgcggc gcctgcttct tcctgggcgg 3070081 cattcccgtt tccttcctat cgatgcgccg cggcgatgcc gggcgtggtg ggccagctcc 3070141 tcgacaacac gaaggtcgca tcgggcgaat cacgctgtcc ctggggccac cacccattcc 3070201 acgggttgcc gtgtgatggc ggcgatgcgt tcgaagtctt ggtcgtagtg catgacgggt 3070261 atgccgtgat gctcggcgac cgccgcaatg atcaagtccg ggatcttgac cgagcggtga 3070321 aatcccttgt cggtcaatgc ttcttggatc tcccatgcac gaacccacac ggtgtcgggg 3070381 gtgttgacgt attcgagcgc gtcacgccgg taggtgccca gtgttcgatg gtcctcgcgg 3070441 gaacgcgccg agactccgaa ctcgagatcg gtaatgccgc accgggccag tagaccgcgt 3070501 tccatcaacg gttccaagcg atgtcggacc gcgggcaagt gcgcgcggta agccgctgat 3070561 ttgtcgagca aatagcgcgt ggtcatgccg tgttctctgg gtggccgtct cgccacattg 3070621 cgttgaccag agcttcgtcc tgggttccgg tggcgttctc ggccatccgg ttcatgagcg 3070681 agcgcgcggc actggctcgc aacgcggccc gcagcgcggc atgcacggtg tctttctttg 3070741 tcgtggtacc cagttccttg gcggcccgag cgagcaggtc gtcatcgatg tcgatcatgg 3070801 tgcgcgtcac acccggagag catactacta atgcatatcc gcgatgcata taacggatgt 3070861 atctcaggcg gggctcaggt gcacgcgggc cggatatcgg tatgcgtgaa gtcatcgcca 3070921 cgaaacagca gcggctcccc ggtgacctgg gccagggcgt agctgtaggt gtcgccgagg 3070981 ttgagacggg ccggatggcc gctgccgcgg ccgtagtcgc gatacgcctg cgcggccacg 3071041 cgggcttggt cggcgtcgac ggcttcgacc tggattccgt agtcgtccag caaacggtcc 3071101 accaatcgag agatctccgg ccggtcccgc cgctgcatga tcgcgcacag ttcgacgtag 3071161 ttgggcgcgg acattcggga gttcggtgac cgctccagcg cctccttgag cacctgcgcg 3071221 cccgattccc cgctcacgat ggcgacgatg gccgacgtat cgacgatcac cggggcagac 3071281 cgctgtcatc gtagaggtcg acctcgtgtc gccgaatcag gcgcttgtcg tcgtcgctga 3071341 gcagcttgtc gaggtcgcgc agggtctgtt cggcggcggc gcgccgggcc tccgcgcgtg 3071401 ccctgtcctc gcggtccaac tccgagaggc ggcgcgcgac ggcgtcctcg acagcagccg 3071461 tctggttggt gccggtgcgt gcggccagtt cccgcaccag cgccacggtg cgctggctct 3071521 tgatattgag gctcatggta gaaggctacc ggccagcggg tagaccatct atcccggaca 3071581 tcaacagcgg aagcagcgca tcgcggcagg atgccaggcg tgcggattca atccgccgct 3071641 cgttgcacag cgcacccagg ttcgcgattg cggccgcgtg tccgggagtc aaccggcgca 3071701 catcgcgcac ccaaacccgc aacagctggg tcggttggat tcgttgccgg cttcccgtca 3071761 tgcccccgac taactgccgc agttctgcca ggacatcggg ttgtcgcagc gccgcccaca 3071821 gggccgaagt gtcgacgccg actggccgca gcacgacgaa ctccgtactc gccagcgcca 3071881 tttccgacgg gaggctggtg atgttccaga ttcgcgggat tcttggattc agtttcggga 3071941 acaacacaca cggctgcgac acgacgagct ttgcgctcct gatcgttcgc ccaccgacgc 3072001 gactgggctg ggcgccgccg tcgaatgccg cgaaactgta atgggcgacg gtgctatcga 3072061 agtgctgcgc atcaagacat gcggttgacc tgctcgccag gctcgacaac ggcacgtatg 3072121 cagagagccg cccgacgatc gcaagcatca acgcctcggc ggcttcgatg acacggtcgt 3072181 tggcggcgat cttgtcgtcg aaggcgccta ggatctcgcc gattcgaggg cggtcgggcg 3072241 cggcgacggc cgataccgaa acgttccgca gaacaccctg actcagcagg ggctgtcccg 3072301 atccggcccg atatcggttg agcccgaaac ccagtagcgc gtaataccaa tatcgggttt 3072361 cctcgggctt cttggcccga cacgccagcg cgttgtcggt cacccacacg tcggaatcgc 3072421 aatagcgcag gctaccgcag tacgagccga cgcggccgac gacgatcagc gggccacgcg 3072481 cgttgtgttg ggcggaatat ccgataaccc cgtttgcacc atagacggga tagcggccgc 3072541 cgggctcgct cgctggcgac gtatggccag acgtatggcc attcgagaag tcgagatggt 3072601 cccctagcct taccttttcg actttctcga cgcggctcat ccgttagtcc gcttggtggc 3072661 cgcgcacagt tccccagcca gatcaccccg ggtggacacg gcgatccccc ccaatcccag 3072721 ccacgacgcc atcgacgcca gctcgccggc caacccctcg gcaaccgtga ccggcggtat 3072781 gtcgagttcg ccaagcacgc ccgccacggg caagctgtcc gcggcgcggt cggttttgcg 3072841 gtccacccgc gcaaccgggt gtccgtcaag gggccgcacg taatacaagt gccggcgttt 3072901 ggccgccact gccgccgaat ccagctgccg cacttggatt cgcgagatca gccgcttgcg 3072961 gtcggcaccg gcgatcgggc cggccgaccc gggttcggcg acgcctccta cgacgacggc 3073021 cgcccggcgg tcccacgcag cagtcgcggc gctcatgagc gcgcgccgcg acgatgcagt 3073081 gggggtacca cccgcttgcg ggggacgaag cgatgaggag aagcggcgct catgagcggt 3073141 ggtagctgta caaccggtac cgcaacccgg accggctgaa gcgccactcc cccgtctcgc 3073201 cccgccatgt ctcgtccagc acgggggcca gcgcgtcacc ggcttcgcgc ggcaggccga 3073261 tgtcgacctc ggtaacctca catctggtcg cgtacggcag cgccagcgca tagacttgtc 3073321 cgcctccgat cacccacgtc tccgggctgg tcagcgcctc ctcgagtgaa ccgacaacct 3073381 cagccccgct ggccataaag tcagcttggc ggctcagtac gacatttcgc cggccgggca 3073441 gcggccggac tttagccggc agcgaatccc atgtgcgccg gcccatcacg atcgtgtgcc 3073501 ccatggtgat ctcccggaaa tgcgcctggt cctcgggcaa gcgccagggg atgtcgccgc 3073561 cgcggccgat gacacccgat gtcgcttgag cccagatcag ccccaccatc gtcacacgcg 3073621 tcactccttg attccggctt gaaggctgtc cgagccgact tcattgtcgt cggcgcgcct 3073681 cataccgcga ctggagcttt gatcgccgga tgcggatcgt agttcttcac aacgatgtct 3073741 tcataggtgt actcgaagat tgaatcccgg tcggctagaa gtagtttcgg atatggccgc 3073801 ggctcgcggc tgagctgcag ccgtacttgc tcgacgtgat tgtcgtagat gtggcagtcg 3073861 ccaccggtcc agatgaactc gccgaccgac aagccggcct gggcggccat catgtgggtg 3073921 agcaacgcat agctggcgat gttgaacggc acacccagaa acaggtcggc gctgcgttgg 3073981 tagagctgac agctcagccg gccatcggcg acgtagaact ggaagaacgc atgacagggc 3074041 ggcagcgcca tccgctcgat ttcgccgacg ttccaggccg acacgatgat gcgccgggaa 3074101 tcgggatcgg tgcgcagcaa atccagcgcc gcgctgatct ggtcgatgtg ctcaccggat 3074161 ggagccggcc acgatcgcca ttgtacaccg tagatcggcc cgagttcgcc tgtatcactt 3074221 gcccattcgt cccagatggt gactccgtgc tcgtgcagcc aaccgatatt ggaatcgccg 3074281 cgcaaaaacc acagcagctc gtaggctacc gatttgaaat ggactttctt ggtagtgagc 3074341 agcgggaaac cggccgacaa atcatagcgc atctgctggc cgaacaggct gcgggttccg 3074401 gtgccggtgc ggtcggattt gggcgtaccc gtttcgagca cgaagcgcag caggtcctcg 3074461 tatggcgtca cgattgacac gcggtcagcc tagcggcgat cgcaagcgcg gcgaagccgc 3074521 cgcagcgact cgccgccaaa caaacccagc gggcgatcgc aagcgcggcg aagccgggca 3074581 cagcgagtcg acgggaatac acccagatcc gcgccacagg agtacaacgg aggccatgcc 3074641 gaaaaccacc gacaccgccg ctactcctga cggcacctgc gccgtgcgtc tgttcactcc 3074701 cgatggtccg ggccgctggc ccggtgtggt gatgtttcct gacgccggcg gcgttcggga 3074761 caccttcgac cggatggccg ccaagctagc cggattcggt tacgtggttc tgcttcccga 3074821 cgtgtactac cgcgaaggcg actgggctcc attcgatatg aagaccgcgt tcggcgatcc 3074881 gcaagaacgc gcacggatca tgtttatgat tggcacccta acgcccgacc gggtaacccg 3074941 tgatgccgat gcgcttctca actacctggc cagccgcccg gaggtgatcg gggaccgctt 3075001 cggtgtctgc ggctactgca tgggcgggcg aatgtcggtg gtggtggccg gccgcctgcc 3075061 ggatcgtgtc gccgccgcgg cagctttcca ccccggcggt ttggtggcca acagcccgga 3075121 cagcccgcac ttgctggccg accggatcag cgccaccgtc tacatcggcg gcgcggagaa 3075181 cgacccgtcg ttcaccgccg accacgccga gaaactcgac aaagcgttca gcgcggccgg 3075241 cgtgccgcac cgcatcgagt gctacccggc cgcccacggg ttcgcggtcc cggacaatcc 3075301 gtcttatgac gccgcagccg acgaacgcca ttgggcagca atgacagaga ccttcggcgc 3075361 agcgctcaac tagccccgcc aagcagacgc agaatcgcat taatcgcgcc cggtttgtgc 3075421 gattctgcgt ctgcttggca gcacctcagg cgccgcgacg tcgatcccga tgatgattca 3075481 gccgacgccg gtccgcggtg cgccccgcga gctacgcgtc gagttgcgtc cgcggcagtg 3075541 cgtggacgca ctttccacgg ggcaaaggcg cccctacacc ggcgcggtca atgctcagtg 3075601 ctgggtgcgg cccggaatcc cagcgcgttg ccgagcagta gaccgccgtc gatgatcatg 3075661 gtttcgccgg tgatccagct tgcggcatcc gaaaccagga acgcgaccgc gctcgctatg 3075721 tcggccggct ccccgattcg tccgagcgca atggtcgccg ccaacggatc ctcgtggtcc 3075781 ttccacagcg cctcggcaag cctggtgcga accaccccgg gacagatcgc attcacccgg 3075841 atgcgcggtg aaagctccag cgccagctgc ttggtgacgt ggatcagcgc ggctttggtc 3075901 gcgttgtaca tgcccatggc cggggactgg tgcatcccgc cgatggaggc ggtgttgacc 3075961 accgcgccgc cgtgctcgcc catccacgcc gtcacgacga gcgaggtcca catcagcggt 3076021 gcccacaggt tgacgtcgaa gatcttggcg aagcgggcgt ggtcctgctc gagcagcgga 3076081 ccgtaagccg ggttggttcc ggcgttgttg atcaggatgt caacgctgcc gaagcgctcg 3076141 agggtgaggt ccacacaacg ccgggcggca tcctcgtcga ccgcgtgtgc accaacgccc 3076201 agggcgcggt cgccgacctg tgcagcagcc tcgtcggcag cttcctgcct gcgtgcggtg 3076261 agcaccacat gggcgccggc agctgccagc tgttgggcga tggcaagccc gatgcctcgc 3076321 gatgcgccag taattatggc ggtgcggccg gtcagatcca gtgaggtcat ttggcttgcc 3076381 ttcggttgct gtggtggccg gactccgccg gcggggagcg tcggtagcgc ccccgcaccg 3076441 tatgcgacaa gaatgctagc gaaatcaaac cccacgaaac caccggtagt ggtggtgcta 3076501 tcgcgattgc cgtagcctgc acaacctcac gccagacttg agccactgcg accatctgcg 3076561 gcgtgtcgcg tgcgtggttt aagtgtcgcg aacggcgagg ccttacagcc tcatgattcc 3076621 gaatgattcc gaacggtatc cggcttgaac gtgccccagc tgtggcggat tctgacattt 3076681 ctcggccagc ccggccacgg gcaccctcgt aaccaaccat ttcgccgcta gcgagcccgg 3076741 cgggggcggc tgcgacgcca tggctccggc ggcttgattg acggtccggg cggcgtcggt 3076801 tgcggcccca ccgtcggttg ccgcaccggc cacgcctggc gggtcgctgt gcgggacata 3076861 gccggccggc ccggtcgatg ggccacaggc caatcagacg acgacctgtt tgggcatgac 3076921 gatgggcttg aacccgtacc gaggcccggc ataggcaccg gctcctttgg ccgccccaga 3076981 aatccccgcc atacccggca ttgcggcgac tggcccggct tcctcgggga ccgcccagcc 3077041 cgagccctcc agtgccgtgg taccagacgt catggccgga gcagcggtcg accaaccggc 3077101 cgggaccgac aggccaccga ccgaggacgc ctcgcccaga ctcgccgtca gcgaggcccc 3077161 accaacgccc actggcgtca ccgtgtgcgc caaaccagct gctgccgcgg cagctggaac 3077221 ggcatcggcg gctgcggtca cggttgccgg gtttagggca gcaaaggcgt gtccgaggaa 3077281 taccgcgttg gggatggtgg ccatgacgaa ccaggcggtg gtgttgaccg cgccgttgat 3077341 cgcgttctga acaaacgtga taccgagcag ctcctcaatg tcctgaatga ttccgcctaa 3077401 tcccgccgcg tcggccgccg atgtcagggg cgaagcgaac cccattaccg cgttcggcag 3077461 gttgctgatc agcgatccca gccccacctg ttggaccgtg ctggcggcag cggcatggct 3077521 gaccgcggcg gcctgaccgg ccagcccggc catgttggcg gtctgggagg gcgtgatcaa 3077581 cgggttcaac ctccccgccg ccgccgagga ggccgcgtaa ccgtacatcg ccagtgcgtc 3077641 ttgagcccac atttcgccgt agtgagcctc ggtcgccatg atcgccggtg tgttttgacc 3077701 caggacgttg gtcgccacca gtgccgcgag cagagccctg ttggcagcga cctccgccgg 3077761 cggaaccgtc atggcgaacg ccgcctcaaa ggcggccgcc gacgccatgg cctgtgcagc 3077821 cgcatgggcc gccgattcag cggtgtaggt caaccaggcc aaataaggct gggcagcaac 3077881 gaccatcgac atcgacgccg gacccagcca ctgttcggta gtcaactgca tgatcaccga 3077941 ctcgacggac gatgctgtag tgctcaactc gacggccagg ccgttccacg tcgccccggc 3078001 ggccatcagg ggtgctgcgc cggcaccggc gtacattcgt gtggagttga tttccggggg 3078061 taaagctcca aaatccattt tccctatccc tctattgatc tctattgatc gaaattcgct 3078121 acttctcaag tgcgggcaac cgcgtcgagg ccgcccccta taccgccggc ttgggcacga 3078181 cgatgggttt ggcgccgtag cgtggcgcac cgaagcccgc gctgctgcgc gtcgccgagg 3078241 ccaaccccgg catcccggga atgaccgtcc ccgcggcacc atgcggtgcc gcggtggtcc 3078301 agccagcgcc ctgcagtgtg ctggtgctcg ataccaggtt ggcctgtccc gcccagctgg 3078361 gcggcaccga caatgcgccg attgacgacg cccgactaag gccggcggct agcggagccg 3078421 cacccagacc ggccgcgatc ggcgcctcgc cgacggccgc ctccgccgcc cccagctccg 3078481 ataggcccgc gccctccaag ccctcctcga gggcggcttc ctcggcagcc ggaagaagac 3078541 caccgctggc cagccctagc aagtccgacg cggcggaggc ccagttccca gccccaatgt 3078601 tgaagatatt ggcaatatct gaaatccagg agggcacctt cccgggcgtg gaacccaaga 3078661 tgctcgcgat acccgacaac ggcgaagcgg ccgcggatga gttggcggcc tcggtggccg 3078721 cataggtgcc agcgctgacc cccagggtct tcacaaacag gtcgtatacc gcagctgctt 3078781 cagcactgac ctgctggtag agagtgccgt acgcggtgaa caacggcgcc tgtagcactg 3078841 atatctcatc agcggcggcg ggaatcacgc ccgtggtggt cggggcggcc gcggccgcgt 3078901 tctgggcgac catcgccgag ccgatggtct cgagcttgcc ggccgcagcc gccaactctt 3078961 caggctgtgt cgtcaggaat gacatcgatt gctcctcata tgactaagcc agcagggcta 3079021 gaaacctgtg aattatctga tcagtccctg ccgaatagct gatcaggtcc tgtgtttaga 3079081 taaggctaac gatccacacc tccgcaagcc cgatcaaaag gcgcaagcgc agaattcatt 3079141 tacggcttat ttacgccggc accggcagtc ttaacacgat ccttttgagc gtggcacctg 3079201 accgctcgcc gcagcagcga aatgaaacac gcgccgcggg agggttagcg caatgtggcc 3079261 gcggcggcgc gctggtcggc cgcgtgcgct tgtctcggtg tctccagatc agaagaggcc 3079321 gtgcttgggc ataacaatcg gcttgactcc gtaccgtggt ccggagtcgg caccaacact 3079381 gttggcggct acgaccattc caggggcagg cggcatcact gcgatcgggc cgtcctcctc 3079441 gggaactgcc cagcctgtgc catccaaggc cgcgccggct gccgtcgccg gcgctgcagt 3079501 agaccagctt gccggcaccg acaggcgacc aaccacggac gcattgccca aatcggcggt 3079561 cagcgctgtt ccgccgacgc ccgctggggc aaccgcgtgt gccaccgcgg ctgccgcgcc 3079621 gccacctgga gcggctccgc caacggttcc catagcatcg gcaagaagcg tcatattgcc 3079681 aatggcggcc gtggcaaagt ctgccacgcc acccaggccg tgaaacgcgg attctacgaa 3079741 cagcggaaca tcgagattga ggaactgcct gaccgcctcc aacccggtat cggccgcgga 3079801 catcaccggg gaggcgaagc tcaggacagc gtcggcgacg tcgctgatca ggtggctcag 3079861 acccacctgg cgcgcgaaag cggatgcgcc ggcttggcca acagcggcgg cttggtgtgc 3079921 gagcccggcc ggattggtga tgtgcgacgg cctggtcagc gggttcagtc ttgcggcgac 3079981 cgcggatgcg gccgcatagc cgtacatggc cgaagcgtct tgggcccaca tttcgccata 3080041 gcgtgcctcg gtagccgcga tggccgacac gttttgccca aggatgttgg tcgctgtcag 3080101 ttcagccaac agggctctgt tggcaaccac ctcggccggg ggcactgtca gcgcaaacgc 3080161 cgtttcaaag gcggccgcag acgccatggc ctgtgccgcc gcgagcgccg aggattcagc 3080221 ggtgcaggtc aaccagacca aatagggctg caccgcggcg gccatcgaca acgatgcggg 3080281 acccatccag tgctcggtgc tcagccgcgt gatgaccgac ccgacggagg acgcagctgt 3080341 gctcacctcg acagctatgc cgttccacgc agccgcagcc gccagcaggt ctgccgcgcc 3080401 cgcgccgcca tacatgcgcg cagaattgac ctccggaggt agagctccaa aatccactga 3080461 ggcgttccgt ttctggtcga gtgcagtggt ggccggtgct ccgtctgagg cagccattat 3080521 tccatcaagg tcagcgccag cgtaggcacc acgctcgcca cggcgtcgat ggcgcccaaa 3080581 tcatcccatt aactgcgcag cgacggttgc tccgaggttc cagcacgcct cgatatcggc 3080641 cttgctcggc ttgcccatca ccactacagt ctcagcggct tgcacccaac ccaggccggt 3080701 tgtgatggcg tcgacggctc gctcggctcc ctcggtgccc tcgttgccgt gaatgtacgc 3080761 gccgaacgaa cgcccacggg tggtgtccag gcagaggtaa tagcagacat cgaaggcatg 3080821 cttgagagca ccactgatgt accccagatt ggctggggta cccagcagat agccgtcagc 3080881 ctccagcatc tcgatcggcg aaaccgtcag ggcgggtcgt ctcaccacct cgacgccctc 3080941 aatctcggga tcggtcgcgc cggacaccac cgcctcaaac atctcctgca tgtgcggaga 3081001 cggcgtgtgg tgcacgatca gcaagcgccg caccgcagga ccctgtcact aaaagtgggg 3081061 taatcgacca aagcgtgcag aagcgctccg gacaggtagc ccaaggccgg caacgtggtc 3081121 atctggcccc cggcctagcg cgcccctcta gctgtagggc cgtcttcatc gcttcccgcg 3081181 cgcggcgccg atcccccgcg tagtcgtagg cgcgcgccag tcggtaccag cggcgccagt 3081241 cgtcggcgtc gtcttcgagc tcggtgcgca cggcagcgaa caacgcatcg gccgcgtctc 3081301 gctgaatgcg gccagaagcc cggcggggca gcgcgctggc gtcgatgtcc agtccgtctt 3081361 cggcgatcag acgggccagc cgctgatacg cgaatccggc ccgcagcgtg gcaatcatgg 3081421 cccacagccc aatgaccggc aggatcagca gcgccagccc cagcccggca gccgcggcgc 3081481 ggcccgaacc gatcattgcg acggcgacac gcccgagcat aaccaggtac gccaccatcg 3081541 ccacgcacat gaacgcgatt atcaactgga catacagggt gcgcctggtc atcacagtgt 3081601 cggtcagtgc agatcgagta ggggctcaag acctacggtg agaccagggc gttcggcgat 3081661 gcggcgcacc gccaacagca caccgggcac aaacgatgtg cgatcgaggc tatcgtggcg 3081721 gatggtcaga gtctccccct cggtcccgaa cagcacctcc tggtgggcga ccagtccggc 3081781 cagccgcacc gcgtgcaccg gtatgccgtc gacgtcggca ccacgcgcgc ccggcaggct 3081841 ggtactggtg gcatcgggat tgggcggcaa gccttttcgg gcctcggcga tcagcttcgc 3081901 ggtacgcgcg gccgtgcctg acggcgcgtc agccttgtgc ggatgatgca gctcaatgac 3081961 ctcggccgag tcgaaaaacc gtgcggcctg cttggcgaaa tgcatggaca gcaccgctcc 3082021 gatcgcgaag tttggcgcta tcaacaccga tgtgttgggt tttgcgacga gccacgattc 3082081 gacttgttga aaccgctcgg cggtgaaccc cgtggtaccg accacggcgt gaattccgtt 3082141 gtcgatgagg aactccagat tgcccatcac cacgtccggg tgggtgaagt cgatgacgac 3082201 ctcggtgtta ccgtccgtta gcaggctcag cggatcgccg gcatccagct cggcggatag 3082261 ggtcaggtcg tcggcggccg ccaccgcccg caccatcgtc gctccgacct tgcctttggc 3082321 tccaaggacg cctacccgca tggccttcac cctagaccgg gccgtcctcg aggccaacga 3082381 ccgcggctgc accaaacccg gcgtgcgccg tgaggcgctt gttgatcgag tggaggtgaa 3082441 agacctgcac ggtagttctg tcgcagctgt ctgaaccacc ccatcggcag attccgtgaa 3082501 gagccagata cggtgaaagt cgcacgtccg gttcgaaggg cggccacggg aaacggaccc 3082561 gcagcaacgc gggcaccgca cccatggtcg acccaactgc cacgcacccg gtgaccggtg 3082621 cgaagtccac catatcgacc agtgggcaac cggcacatcc caccacaggt tggtcggaaa 3082681 cggctggtgc acaacgaagc tccccaacgg ccaaaccgca gggatcccgc caccccacct 3082741 cgaccgcggt gcccacacca aacaactacg cgctgaccgc cgcgactgcg cccacgacct 3082801 atctaggctt taatgatccg aggcgtcagc agcgaaggtg ctcatgtgaa acccagcaat 3082861 atcaggattc gtgcagccaa accgatcgat ttcccgaagg tggcggcgat gcactatccg 3082921 gtttggcgac aatcctggac cggaatcctc gacccgtacc tactcgacat gatcggttcg 3082981 ccgaagctgt gggtcgagga gtcttacccg caaagcctga aacgcggcgg ctggagtatg 3083041 tggatcgccg agtctggcgg tcagccaata ggtatgacga tgttcgggcc cgacattgct 3083101 catcctgatc gcattcaaat cgacgctttg tatgtagccg agaacagtca acgtcacggc 3083161 attggcgggc gcctcctcaa cagggccctg cactcacatc cgtcagccga catgattttg 3083221 tggtgcgccg agaagaacag caaggcacgc ggcttctacg agaagaagga ctttcacatt 3083281 gacggccgca ctttcacgtg gaaaccactg tcaggtgtga acgtgcccca tgtgggctac 3083341 cggctttatc gatccgcccc gcccgggtaa gcatcaggcg tcgataacca cccgaccgct 3083401 cacggcccgc gacacacaga ccagcatctc gttatcgcct tcgatgatgc ggccgcggcg 3083461 gtcgacctgc ccggcaagga ctctcacctt gcaggtcccg cagaagccct gctggcagga 3083521 gtatgccgtc gtcgggtccc agtcgagcat gacgtccagc gccgaccggt tcgccggaac 3083581 tcggagcact cgcctcgacc gtgcgagctc cagctcgaac ggaactccgt cgacaaccgg 3083641 cggcgggctg aatcgctcgt aatgcagcgg cgcgtcggcg tgttgattgc gggccacgcg 3083701 caccgcttct aacatcccgg gcggcccgca cacgtaaacg gccgtcgtcg gccctgcgcc 3083761 ggccaacagt tcatcgacag acgcaaaacg accgtgctcg tcgtcggccc acaccgtgac 3083821 ccggccgggt gccaccgcca ctacctcgtc caggaacggc atgtactccc gaccgcgacc 3083881 ggcatagatt gcgcgccagt cgattccgcg ctgttcggcg gcccggatca tcggcaggat 3083941 gggcgtcacc ccgataccgc cgatcacgaa aagcacgtca cgctcggcca gaccgagatg 3084001 gaaggcgttg cggggacctt cgaactcgca cgtgtcacct acgtcgaagg cctcgtgcat 3084061 ctcgatcgaa ccgccgccgc cgtccgcgat tctgcgaatg gcgatccggt agtccgtacg 3084121 ccgtccgggc acaccgcaca acgagtactg tcggcgccgc cccgagggca gctgcacgtc 3084181 gatgtgccca ccgggcgacc aggccgggag caatccgcca ccggggtcag ccaacgtcaa 3084241 cgccaccacg tcgggagcga ccagctcgcg cttggtaacc accgcgggat tcgtgcgccg 3084301 caccggctgc acccgcgacg gttcccaccg cgaggccgcg cccaatcctc ccaataacgc 3084361 tcgtacaccc cacagcgctg tgaagaagcg gtcccggctg cggcgaccgt aaaggtcggc 3084421 gggcctactg gcccagctgg tctctggcac ggtgcgctcc gccattccta cggatcgtca 3084481 ccgatcagtg cgacgctcgc gcggcgggcg agacggccag gtagtccacg gccgccccca 3084541 gcccgcccag ctgggacggg tgaaaacccg gcttgtagta gtgccccacg acccgaagca 3084601 gccgcggcag cccgggcacc aaaccacggc gtgcggcctt gaaatagtcc cgccagcgcg 3084661 gctttgtccc cggtggcagg tacggatcca ccgaatacat gaaccgcact ccgcgaatcc 3084721 acagcagcaa catcaccggg gtaacggtca gctgggcacg cacctgccgc cagtaaccgg 3084781 cgcgcaagtg cttcatggtg tcgaaggcca cggctttgtg ctcgacttct tctgcaccgt 3084841 gccaccgcag catgtccagc atcacggggt ctgcaccgac ggcatcgagc tgcggggaat 3084901 tcaggatcca ctcgcccatg acggcggtgt agtgctcaat tgccgcgatg aacgaaacct 3084961 gctctagcaa ccagctgtac tgtcgtcgcg ggctccgccg aggactctcc cccagcagct 3085021 tttcgaacag ccacctgatc tggttggtaa acgctgtcac gtcgacaccc tgggcatcga 3085081 agtggtcaac cacgccggag tgcgcctggg aatgcatcgc ctcctgaccg atgaatcctt 3085141 gcacgtccag cctcagttga tcgtccttga tcagcggcag cgtcttcttg aagaccctga 3085201 cgaagaactc ctcgccggcc ggcagcagca tatgcagaac gttgagaacg tgggtggcca 3085261 tcggctcgtt gggcacatag tgaaatggca ggtttgtcca gtcgaattcg acatctcgcg 3085321 gctcgaggac gagacgttcg tggtcggcgg cgcgcgactc tgacgagtgc ggacccgtcg 3085381 cccggtcatc gacgctgacc attgctgccc cctcagaaaa cgtagccacg gcgtttacat 3085441 aaatgcccga catgtcgccc cagtagacat cacgtgttgg caagtatagt tgcgcgtacc 3085501 cgaggggtga agaacctgct cgccagcctg gcgccgaatg cacctcgacg ttcaccgcgc 3085561 ctcggcagcc gacgatgtcg gcttcaccgt gtcgaattcg tcgcctcgcc ctcgtcggcc 3085621 tactcgcaac tggtggcatc agtaatgcca ttgcgcagca acgcacttgc tacggcacgc 3085681 gactcgccat cagtgtcccg ccagcacttt cgctacccgg cctcagctcg gtccttgaga 3085741 cgctgcaggg tgcgtcgaat gtgctcggtg tttacgctcg cccgatcctt gacgccggtg 3085801 gccatccggg caactgcgcg gaaccagctg ggccgccggt cccaggtgct ctccgtaacc 3085861 cgacagccgt gttcggtagc gacgatgcca tattgccagc gtgaaatcgg aataatgccg 3085921 gaccgtacat cgaaagcgaa aacccgaccg ggatcggcgt cggtaacggt gcacgtcgtg 3085981 gtccagcgcc gtccaccgtt ttcgttgcga ccgacaaaca ccgctccctt gcgaacatcg 3086041 tcgcctttgc gcaactgcat cgccaccact tcctcggcca gcgaggccag tgtcggcaga 3086101 tcagtgatca gcccgtatac caggtcggga ttggcgtcga tctcaacggt gaccgtcaca 3086161 gaaggcccat cagggtctgg catcccgcga tcatagcccg ctgggcgggc cgctctagat 3086221 gggcgccgcc ccgcgcagat gctcgaagat cagggacgtc tgggtacctg cgacgtcggc 3086281 gtcggcattg aggttttcga ccacgaacga acgcaggtcc tcggtgtcgc gagcggcgac 3086341 gtgcaagatg aaatcgtcgg cgccggccag aaagtagaca tccatcacct gccgtttgcg 3086401 gcggatctgc tggatgaagc tgcggatttt cccgcgagcg gacgactgca agttgaccga 3086461 gatcatcgcc tgcaacggca aacccaccgc gaccgggtcg atgtcggtgt agaacccccg 3086521 gatcacgccg aggtccacca accgccgaac ccggccgtga cacgtcgacg gcgctatccc 3086581 gacagtgtcc gctaacgcgt tgttgggcat tctggcatcg ccatgcagca agctcaggat 3086641 tctgcggtcc acctcatcaa gttcagcggg tcgaacatcc ttcgacgagg cagcccggcg 3086701 agtcttgtgt tccgttgaat tatcacgcat atggcctcga aaaagaatta tcatcagcaa 3086761 tcttgcagat taatcgaact ttcttcatac tgaagcgtac agtatcgaga ggggtaatca 3086821 tgcgcgtcgg tattccgacc gagaccaaaa acaacgaatt ccgggtggcc atcaccccgg 3086881 ccggcgtcgc ggaactaacc cgtcgtggcc atgaggtgct catccaggca ggtgccggag 3086941 agggctcggc tatcaccgac gcggatttca aggcggcagg cgcgcaactg gtcggcaccg 3087001 ccgaccaggt gtgggccgac gctgatttat tgctcaaggt caaagaaccg atagcggcgg 3087061 aatacggccg cctgcgacac gggcagatct tgttcacgtt cttgcatttg gccgcgtcac 3087121 gtgcttgcac cgatgcgttg ttggattccg gcaccacgtc aattgcctac gagaccgtcc 3087181 agaccgccga cggcgcacta cccctgcttg ccccgatgag cgaagtcgcc ggtcgactcg 3087241 ccgcccaggt tggcgcttac cacctgatgc gaacccaagg gggccgcggt gtgctgatgg 3087301 gcggggtgcc cggcgtcgaa ccggccgacg tcgtggtgat cggcgccggc accgccggct 3087361 acaacgcagc ccgcatcgcc aacggcatgg gcgcgaccgt tacggttcta gacatcaaca 3087421 tcgacaaact tcggcaactc gacgccgagt tctgcggccg gatccacact cgctactcat 3087481 cggcctacga gctcgagggt gccgtcaaac gtgccgacct ggtgattggg gccgtcctgg 3087541 tgccaggcgc caaggcaccc aaattagtct cgaattcact tgtcgcgcat atgaaaccag 3087601 gtgcggtact ggtggatata gccatcgacc agggcggctg tttcgaaggc tcacgaccga 3087661 ccacctacga ccacccgacg ttcgccgtgc acgacacgct gttttactgc gtggcgaaca 3087721 tgcccgcctc ggtgccgaag acgtcgacct acgcgctgac caacgcgacg atgccgtatg 3087781 tgctcgagct tgccgaccat ggctggcggg cggcgtgccg gtcgaatccg gcactagcca 3087841 aaggtctttc gacgcacgaa ggggcgttac tgtccgaacg ggtggccacc gacctggggg 3087901 tgccgttcac cgagcccgcc agcgtgctgg cctgactctc ggccgctcgt tacgccgagc 3087961 acacgtcggg agtaagggaa gcgatgatgt cggccgcggg tcccggccgg gtcttccggt 3088021 gcgccgatcc cgcccaaagg tttgttccgt gcgggtcgtc cgcctgcacc gccgccgccc 3088081 gtatcggctt cgtcatctgg tggacctccg gataacccag cggcgccacg tggtcgagca 3088141 ggcgagtgaa gttgttggcc agaccgcgcg catacctacc cgagaacgcc cgagtgacca 3088201 gggtggcatc gaactctgga ttcttcagcg cggcacggtg tgcggcattg gtaccggctt 3088261 cgtcggccag cagcaatgcg gtaccaacct gcgcggcgat cgctccgcgg cgcagcacgg 3088321 cggccacgtc ctcagccgtg cccaggccac cggctgcaac cagcggcaca tcatgggcgc 3088381 tgccaatccg atcgaggagt tggtgcagcg actccgtacc gggttccatg tccggcgcga 3088441 acgttccgcg gtgcccgccg gcagccgggc cctggaccac caggctgtcc gcgcccgcgg 3088501 caatggccac accggcctcg tagaccgacg tcacggtgat cgagaccaac agtcccagcg 3088561 cgctcaaccg ctgcacgaca tccggcggcg gcgcgccgaa ggtgaacgac accacctccg 3088621 gacgaacatc ggctaccacc tcgagtttgc gcacccagtc gtcgtcgtca ccatagacgg 3088681 gctggcccac ctcggtgtgg tagtactcgg cgacctcttc gagctcgtcc gcgtaatact 3088741 ccagctgcgc ccagtcggcg acgctgggtt ggggcacaaa cagattggct ccgataggac 3088801 cggtagtggc ggcgcgcgca gcggcgatat cgtcggcgag ccggtccgcg ctcagatagc 3088861 cgccggcgac gaaaccaagc ccgccagcgt tggacaccgc cgcggccaac gccggggtgc 3088921 tcgggccgcc ggccatcggg gcgccgacga tcggcaccgc gatgtcccag aagcccaaca 3088981 ccatcgggct aattcgccga cggcgagcgc cggcacggcg cgagtgagga agcggacatt 3089041 tgagctaccc taccatcgct cgaagttgtt gcggcagtga tcgtttcgat ccgtgtgggc 3089101 caagaacggc agcaccgtag cgcctgctca gcaggtggcg ggccaccgcg ttgacctcct 3089161 ccacggtgac ctgctcgatt tgccgcaagg tgtgttcgat gctgcggtgc ttgccgtagt 3089221 tcaactcgct gcggccgagc cggctcatcc gggagctgga atcctccagc cctagcacca 3089281 gcccaccccg cagcgatccc ttggcgatgc cgcattccgc ctcggtgatg ccgtcgcgtg 3089341 ccacgctttc cagcacatcg gcggtcaccc gcatcacgtc ggcgaagcgt tcgggcaggc 3089401 aggccgcgta caccgaaagc gcgccgctgt cggcgaagag atccagcgcg gagtagaccg 3089461 agtaggccag cccgcgggtc tcgcggacct cctggaacag ccgggaactc aagccaccgc 3089521 ccagcgcggt gtgcagcacc gacagtgccc aacgatgctc ccagccgcgc ccgggtgtgc 3089581 ggatgcccag cgacacatgc gtctgttcgg cgtcgcggct aaccagtgtc aaccgggggc 3089641 tgccgttgac ccggccggta cccttgcgcg gcgcaactgg ccgtctcccc cggaccaacc 3089701 gggacccgaa gtgctcgcgg accaacgcaa ccagcccgtc gtgatccaca ttgccggcgg 3089761 ccgcgacgac catccgctcc ggggtatagc gccgcaggtg aaacgattgc agttgagccc 3089821 gcgtcatcac cgacacggat tgcgcgctgc cgatcaccgg gcgaccgacc gggtggtcgc 3089881 cgaacaacgc cgccaggaac atgtccgcca aggcgtcctc ggggtcgtcg tcgcgcatcg 3089941 cgatctcctc gaggacgacg tcacgttcca cctcgacatc gtcggcggca cagcggccgt 3090001 tgagcaccac atcggcgacc aggtcgacgg ccaacggcaa gtcgctgccg agcacgtggg 3090061 cgtagtagca ggtgtgctcc ttggcggtga atgcgttcag ttccccgccc accgcgtcca 3090121 tcgcctgcgc aatgtccacg gcagagcggg tgggcgtcga cttgaacagc aaatgctcaa 3090181 ggaagtgcgc cgccccggcc accgtggcgc cttcgtcgcg cgatccgacg ccgacccaca 3090241 ccccgaccga cgcggagtgc accgcgggca ggaattcggt gaccactcgc agcccgcccg 3090301 gcagggtggt gcgccgcggc gccagcgccg ccgcggggtc agctggtgac cgtcgcggca 3090361 tcggtagcgg cggcggtgct gtcctcgtcg gcgaccagga tcagggagat cttgccccgt 3090421 ttgtcgatgt cggcgatctc cacccgcagc ttgtcaccga cattgacaac gtcctcgacc 3090481 ttcgcgatgc gcttgccctt gccgagtttg gaaatgtgca ccagaccgtc gcggccaggc 3090541 agcaacgata caaaggcacc gaaatcggtg gtcttgacca cggttccgag gaaccgttcg 3090601 cccaccgtcg gcagctgcgg gttggcgatg gcgttgatct tgtcgatcgc ggcctgtgcc 3090661 gatggcccgt cggtggcgcc gacgaacacg gtgccgtcgt cttcgatgga gatctgcgcg 3090721 ccggtctcct cggtgatggc gttgatgacc ttgcccttgg gtccgatgac ctccccgatc 3090781 ttgtccaccg gaaccttgat ggtggtcacc cgcggggcgt agggactcat ttcgtcgggt 3090841 ctatcgatgg cctcagccat cacctccaag atcgtgaggc gggcgtcctt ggcctgctcg 3090901 agtgctccgg caagcacctg cgaagggatc ccgtcgagct tggtgtccag ctgcagcgcg 3090961 gtgacgaagt ccttggtccc ggcgaccttg aagtccatgt caccgaacgc gtcttcggcg 3091021 ccgaggatgt cggtgagggt gacgaagcga cgctccacaa cgccgtcgac cgccccttct 3091081 acttgaatgt cgtcggagac caggcccatc gcgatgccgg ccaccggcgc cttgagcggc 3091141 accccggcgt tgagcagcgc cagcgtcgac gcgcacaccg accccatcga ggtcgacccg 3091201 ttggagccca gagcctccga cacctggcga atggcatacg ggaattcctc gacgctcggc 3091261 aacaccggca ccagggcccg ctcggccagt gcgccgtgcc cgatctcacg ccgcttgggc 3091321 gaaccgaccc gaccggtctc gccggtggag aacggcggga agttgtagtg gtgcatgtac 3091381 cgcttcgatg tctccggccc caacgagtcg atctgctggg ccatcttgat catgtcgagt 3091441 gtggtcacac ccaggatctg ggtttcgccg cgttcgaaca gcgcgctgcc gtgcgcgcgc 3091501 ggaaccacgg ccacctcggc cgacaatgcg cgaatgtcgg tgatgccgcg gccgtcgata 3091561 cggaaatggt cggtgaggat gcgctgccga accagctttt tggtcagggc acgcaacgcg 3091621 gcgccgacct ccttttcgcg accctcgtag gtgtcggcga gccgctgcac aacctgggtc 3091681 ttgatttcgt cgatgcgctg gtcgcgctcg gctttaccgc cgatggtcaa cgcggcggcc 3091741 aactcgtcgg tggccaccga ggacaccgag tagtacacgt cttcgccgta gtcagggaac 3091801 accgggaagt cgacggtcgg tttgcccgac tttccagcgg catcggcaag ctcctgctgc 3091861 gcggtgcaca gcgcggcgat aaacggcttg gccgcctcca ggcccgcggc caccacgctt 3091921 tccgtcggcg cttgggcacc accttcgacg agctcgacga cgttttcggt ggcctcggct 3091981 tcgaccatca tgatggcaac atcaccctcg acgatccggc cggccacgac catgtcgaac 3092041 acggcgcgct cgatctggtc gacggtgggg aagccgaccc aggtgccgtc gatgagcgcc 3092101 acccgcacac cgccgatggg cccggagaac ggcagaccgc ccagctgggt ggacgccgac 3092161 gccgcgttga tcgccaatac gtcgtagaga tcgcccggat ccaggctgag aatcgtcacc 3092221 acgatttgga tctcgttgcg cagcccgtcg acaaacgacg ggcgcagcgg gcggtcgatg 3092281 agccggcagg tcaggatcgc gtcggtggag ggtcggccct cgcgacggaa gaacgaaccg 3092341 gggatgcggc cggccgcata catgcgctcc tcgacgtcga ccgtgagggg gaagaagtcg 3092401 aagtgttctt tggggttctt gctggcggtg gtcgccgaca gcagcatgtt gtcgtcgtcg 3092461 aggtaggcga ccaccgcgcc ggcggcctgc aaggccaatc ggccggtctc gaagcggatg 3092521 gtccgggtgc caaagctccc gttgtcgatg gtggcggtcg tctcgaacac gccttcgtca 3092581 atttcagcgg cagacatgac gtccgtgcgg cctctctgga ttattgagct gtttcgcgtc 3092641 gtcacgcgca atccagcggg ttcgccgaac cccgagagct tcccaggaga aaaggtctga 3092701 atgcggctac ggccatcgat cgaagcggcc gacctgcccc agatccggag agcccggcag 3092761 ccactaccga ggaccgcccg atacaggccg ggggtgctcc cttggatatg catagtgact 3092821 cgctggaacg gcacacgcgg ttctgcgcgt accgcaccat ttgctgggcc gaaccggccc 3092881 agaacgttct cactctacac gggcgaccgg cggcatttgc gtagaactcg ctttgccgag 3092941 ctaccccgcc tcagctccgc gggccgccgg tgacatcctc gacgcacacc gcgaaccgtc 3093001 gctgagtgta gacgtagccc acgccgctgg cgcactggtc gacgctgacg ggagagtcga 3093061 ggtctttcag gatctgggtg gcccgctgcc ggtgcggcac cgaggcgtcg tcgcagtcca 3093121 cccggaacgg gtcggtgttg tgggtagggt cgacgctcat acaaccgcca atcacccaat 3093181 cgatgtccag gcaaatggtg ttggttgagc cgttgaacgc attgcgcatc gaataggtgg 3093241 agtcgacgtc cgccgggcat tccgcgtggt cctcctgcac gacggcaacg accttgaagt 3093301 tggacgccgg gctcccgcac tccgccttag tggcctgcgg ccggtcgggc gtgccggcga 3093361 gtttgacgca gtcccccacc ttgagttcgg cgacgttggt cgctgacgaa caccccgtcg 3093421 ccacgacgaa caaggccgtg gtcgcggccg cgagccaggc gcgcatcgac gccgcgggtc 3093481 agcgacgcag gcccagccgc tcgatgagtg aacgataacg ctccacatcg atctgggaaa 3093541 tgtacttgat cagccggcgc cgccggccca ccagcaacag cagtcctcgc cgcgaatgat 3093601 ggtcgtgctt gtgcaccttg agatgctcgg tgaggtcggc gatgcgtttg gtcagcaacg 3093661 cgatctgtgc ttccggggat ccggtatcgg tctcatgcag gccgtaggag cgcagaatct 3093721 cctttttttg ctcggctgtc agcgccacga aatgtctcca tcaatgggtt cgcgatcatg 3093781 gatatcaggg cacggccacc gcgaaccgca gcacgcaccg atgtcgttgg acagtctagc 3093841 agcgggttga ccgccaaaca caaacgccgc aggtgccagc cgggggtcac gaccgcaaga 3093901 accgtcaacc cgtagacaac aggtcacgtg cccgctcggt atcggcaccc atcgcagcga 3093961 ccagctggcg caccgattcg aacttcttct ggccgcggat acgcccgacg aagtccaagg 3094021 ccacatgttg accgtagagg tcagcggtgg tgtccagcac gaacgcttcg acggtgcggg 3094081 tgcgtccgga gaaggtggga ttggtcccga ccgacaccgc ggcctggtag cgctcacccg 3094141 ggacgaccgt gccggtcacc ggcccatgcc cgagcaccgt gaaccaagcg gcgtacacgc 3094201 cgtcggccgg aatcgccgaa tacatcggcg gcgccacgtt cgcggtggga aagcccagct 3094261 ccgcgccccg cccctcaccg cgtaccacaa ccccctccac gcggtgcggt cggcccagag 3094321 cttccatggc cgccaccatg tcgccggcgt ccacgcagga ccggatgtag gtggaggaga 3094381 acgtcacggt ctcgttgctg tggtgctcgg acaccaacga catcgattcc accgcgaacc 3094441 cgaaccgctc gccagcccga cgcagcgtgt cgacattgcc ggcggccttt ttgccgaagg 3094501 tgaagttctc gccgacgacg acctccacca catgtaggtg ctcgacgagc agctcatgga 3094561 tgaagcgatc cggcgtgagc ttcatgaaat cggtggtgaa cggcatcacc aggaacactt 3094621 cgatgcccaa gtcttgaacg agctccgcgc gtcgggtcag ggtggtcagc tgcgccgggt 3094681 gactgcctgg atagaccacc tccatcgggt gcgggtcgaa cgtcatcagc acggccggta 3094741 caccgcgagc gcggccggcc ttgaccgcgt gcgcgatcag ttcggcgtgc ccgcggtgca 3094801 cgccgtcaaa taccccgatg gtgagcacgc atctgcccca atccgtcggg atctcgtcct 3094861 ggccacgcca gcgctgcacg atcgcaagcc tacggcgcac ggtggtcggc caggcgccag 3094921 attcaccggt gggctctggc cagcggccga tccgggaaca ccatgcacgc ggccgccgga 3094981 cacctggcgc agcacgtccg ggaacgccgg cggcaccgtg gccggtccca gagaattggc 3095041 gcgcgcatac cgaacgattg gtctcaagct ttacgccgac cattgatcag gtgatcaggg 3095101 agtgggtctg atgagtacgt ttagagaatg ccgcagcatg ttcgatgccg cggtgaagag 3095161 ctaccagtcc ggagacctgg ccaatgcccg agcggccttt ggccgcctca cagtcgaaaa 3095221 cccggacatg tccgatggct ggttggggct tctggcctgc ggcgaccatc atcttgatac 3095281 cttggccggt gcccatcaac actccgaagc actgtacagc gaaacccgcc gcgtcggcct 3095341 cacggacggc gaattgtccg ccgtggtcat ggccccgatg tatctggggt tgcgggtgtg 3095401 gtcgcgcgcc acgatcgggc tcgcgtacgc cagcgctcta atcatcgccg accgccacga 3095461 tgaagcggca gcaacgctgg acgacccggt catcacggag gacaccggcg ccgcccaata 3095521 ccgccagttc gtcatggcga cgctgttcca caaaactcgc tcctggtcca accttttgaa 3095581 ggtcaccgaa atttctccgc cgagcggggc caccgatgtc cgtgacgagg tggctgacgc 3095641 ggtggccgcg ctggcctcga ccgctgcggc gagtctgggc caattccagt tcgcgttgga 3095701 gctcgctgag caagtctcga caaccaatcc gcgggtgact gccgatgtga ccctcactag 3095761 ggcgtggtgc ctgcgcgaac tgggtgacga cgacgccgcc agagtggcac ttagcgccac 3095821 gaccaccggt gatgccccca ggacaaacac caccgcggaa caggctggta gcccccaacc 3095881 gaagtttcga catccttacg acgacggccg ggatctcctg gtggctcgcc gccgcccgcc 3095941 ggccggggac ggttggcgca aagcggtaac caaaatgact ttcgggcggg tgaatcccga 3096001 accgagcgcc aagcgcgagc aaaccgacga gctgattcag cgtatctgcg ctccactggc 3096061 cgatgtccat aagttggcgt tcgtctctgc caagggcggc gtaggtaaga ccacgatgac 3096121 ggtgctggtg ggcaacgccg tcgcccggct gcgcggcgat cgggtgatgg ctgtggacgt 3096181 cgatgccgac ctgggcgacc tgtcagcaag gttcagtgag cgcggtggcc cgcagaccaa 3096241 catcgagcat ttcgtgtcat cgcagcacac caagcgctac gcggacgtgc gtgtgcacac 3096301 ggtgatgaac aaagaccggc tggaaatgct tggtgcccag aatgatccgc gatcgacata 3096361 caagtttggc ccggaggact atggggccgc catgcagatc ctggaaaccc actgcaacgt 3096421 catactgctt gattgcggca caccggtcaa cgggccattg ttcagcaata tcctcaacga 3096481 cgtcactggt ctggttgtgg tggcatccga agacgtgcgc ggtgtcgagg gagcgttggt 3096541 cactctggac tggctggggg cgcatggctt tggccggttg cttcagcaca ctgtggttgt 3096601 tctcaacgca atccagaaaa cccggtcact tgtggattgc ggggccgccg aaaaccagtt 3096661 caggaagcgc gttccggatt tctttcggat tccctacgac ccgcatctgg ccacgggttt 3096721 ggcggtcgat ttcagctctc tcaagcgaag gacacgcaac gccgtgctgg atttggccgg 3096781 cggcctggca cagcactatc cggctagccg agtacggccc cgtggcgagg acagttggaa 3096841 aacctggatc gaaacgatgc gtcaggtcgg atgacggttt ggtcgagacc gagttggcgg 3096901 ccatttcccc gactgcgcac cgagcgcgcc gtcacgccgg tatctagact ctctggttgt 3096961 gagggctgac gaggagcctg gcgatcttag cgcggttgcg caggactatc tgaaggtcat 3097021 ctggaccgcc caggagtggt cgcaggacaa ggtcagcacc aagatgctgg ccgagaggat 3097081 cggggtgtcg gccagcacgg cctcggagtc cattcgcaag ctcgccgagc agggcttggt 3097141 cgaccacgag aagtacggcg cggtgacgtt gaccgattcg gggcgacgag ccgcgctggc 3097201 aatggtgcgc cggcaccggc tactggagac attcctggtc aacgagctcg gctaccgctg 3097261 ggacgaggtg cacgacgagg ccgaggtgct cgagcacgcg gtctcggatc gcttgatggc 3097321 ccgcatcgac gccaagctgg ggttcccgca gcgcgatccg cacggtgacc cgatcccggg 3097381 cgccgacggg caagtgccca cgccaccggc tcgtcagctg tgggcgtgcc gcgacggcga 3097441 cacagggacg gtggcccgta tctccgatgc cgacccgcag atgctgcgat actttgccag 3097501 catcgggatc agcctggact cgcggctgcg ggtgctggct cggcgcgagt tcgccggcat 3097561 gatctcggtg gcaatcgact cggccgacgg cgccaccgtc gacttgggga gcccggccgc 3097621 ccaggcaatc tgggtggtga gctgacggct ttggcccgcg agcgtaacgt ggctgcgatt 3097681 ttcggcacgg attttcgcag tccggttacg ctcgcgaagc cggttcgccc agcaggccct 3097741 tggcgatgtg ggttacctgg acctcgttgc tgccggcgta gatcatcagc gacttggcat 3097801 cgcgagccag ctgctccacc cgatattcgg ccatgtagcc gttgccgccg aacagctgga 3097861 cggcctccat cgcgacatcg gtggcggcct ccgaggaata cagcttgatc gccgaggcct 3097921 cggccagcgt cagctgtttg ccggctttga gccgctcgat ggcctgaaat accatgttct 3097981 gcacgttgat ccgcgcaact tccattttcg ccaacttcaa ctggatcagt tggaactgcc 3098041 cgatgttacg gccccacagc gtgcgggtct ttgcgtaatc cacacacagc cggtggcatt 3098101 cgttgatgat gcccaacgac atgagcgcca cgccgaggcg ttcgacggcg aaattggcgc 3098161 gggcgctgtc gcggccgtcc ccctcggcgc aaagcaggcg atccggggtc agccgcacgt 3098221 tgtcgaagaa caactcgccg gtcggcgaag acatcatgcc catcttcttg aacggcttgc 3098281 cctgcgtcag gcccggcatg ccggcatcga gcacaaagac cagcaccggg cggttacgcc 3098341 aatctgaggc gggctcaccg tcggcgagct tggcgtagac caccaggaca tcagcgtacg 3098401 gcccgttggt gatgaaggtc ttgtgcccgt tgaggatgta gtcttcaccg tcgcgggtca 3098461 cgtgagtctt catgccgccg aacgcatccg agccggagtc tggctcggta atggcccagg 3098521 ccgcgatctt ttccagcgtc accagcgtgg gcacccagcg ctcctgttgg gccagggtgc 3098581 cgcggctcat gatcgtcgcc gcgcccaacc cgaggctgac ggccaccgtg ctcagcaatc 3098641 cgatgctgac cccggccagt tcggacacca gcaccgcgac catcgaagcc tggtcagcca 3098701 gcccgaaact gcctgagctg tcccgctttt cccgcttagc ccgctcccca tccagcatct 3098761 ggttgaccga ctcggcaagc agcacgtcca gaccgaactg gctgaacagc ttgcgcgcga 3098821 tcggatacgg cgacagttca ccggtttcca atgcgtcttg gtgcgggcgg atctccttgt 3098881 cgatgaactg gcgaacggcg tcgcgcacca ttagatcggt gtcggaccac tcgaacatgg 3098941 cgtgctccct ccgatcgcgt ggctcaacgt tcggcccgtt ggtatgcggt gaccacggcg 3099001 gcgccgccca gcccgatgtt gtgttgcagc gcggcggtca cgttgtcgac ctggcgcgcc 3099061 tcggcggtgc cgcgcagctg ccaggtcagc tccgcgcact gcgccaaccc cgtcgcaccc 3099121 agcggatggc ccttggagat cagcccaccg gatgggttga cgacccagcg tccgccgtag 3099181 gtggtctggt tgtcgtcgat cagctcgggc gcctcgcccg gcccgcacag gccgagcgcc 3099241 tcgtagagca gtagctcgtt ggctgagaag cagtcgtgca gctcgatcac tccgaagtcc 3099301 ttcgggccga gtccggattg ctggtaaacc cgttgtgccg cttgcacagt catgtcgtag 3099361 ccgatgatat tgcgggcact gccatcaaag gtggaagcga agtcggtggt catcgcctgc 3099421 ccgacgattt ccacagcccg cccggcaagg ttgtggttgg ccaggtaatc ctcactggcc 3099481 agcaccaccg ccgccgaccc gtcggaggtg ggagagcact gcaatttggt cagcgggtcg 3099541 gaaatcatct ttgaggccaa gatgtcgtcc agggtgtatt cgtcctgaaa ctgtgcatac 3099601 gggttgttga ccgagtgctt gtggttcttg tagccgatct tcgcgaaatg ctccgcggtg 3099661 gtgccgtatt tcttcatgtg ttcgcggccg gccgccccga acatccacgg cgccaccgga 3099721 aagccgaact cgtcgatctc ggctaacgcc ttgacgtgcc tgcccagcgg cgactcccgg 3099781 tcgtcggcgc caccgcccag cgctccgggc tgcatcttct cgaagcccag cgccaacacg 3099841 caatcggcca gtccgccgcg gatggcctgc gcgccgaggt agagcgccgt ggatccggtc 3099901 gagcagttgt tgttgacgtt gacgatgggg atacccgtca tgccgagttc gtagagcgcc 3099961 cgctgacccg acgtcgattc tccgtagacg tagccgacgt agccctgttc aacttcgcgg 3100021 tagtcgatgc cggcgtcgcg cagcgctttg gtgcccgact ccctggccat gtccgggtag 3100081 tcccagcctt cgcgtcgccc gggcttttcg aacttcgtca tgcccacgcc aatgacgtaa 3100141 accttgttcg acgacccttg gttaggcatc gttgccgttg caagtgagtg atctttagtg 3100201 gtcacgcgac ttgcaccccg tctcggggtt gttcggcagc cttgcggctg cttcccttcc 3100261 gcgcttcacg gccaccagcc cggccaggcc gggtcttacg gtcggctcca cgcttgacgg 3100321 cggccccaac tgggccgacg acgctactgg tgtcctcgta gcgtgcgagg ttgatcgctg 3100381 cgcagtcatc acgctgatgc gatgccgagc acgaatcgca ttgccagtgc tcggcccatc 3100441 cgatctcttg gacatgcccg cagacgtggc aggttttcga cgatgggaac cagcggtcag 3100501 cgaccactag ttgtgacccg taccagcctg tcttgtagga caggtggcgg cgcggggtgc 3100561 ccagggccgc gtcggagagt ccgcgccggc gagcgcgggc acccgagagg ccctgttgcc 3100621 gcagcatccc tgccgcgtcc aggccttcga caacgatgcg gccgtgggtc ttagccaaat 3100681 gcgttgtcag acagtgcagg tggtgggtgc gaacatcgtt gacccggcgg tgcagccggg 3100741 atatttcggt ggtgcgctca cggtagcgac gtgagccttt cgtgcagcgc gaccgtgccc 3100801 ggcagacatg ccgtagctcg ttgagtgccg cgtcgagtgg ccgtggattc ggcactcgtt 3100861 cgagcaccgc gccgtcggcg gtggcgaccg tggccaggcg gcgcaccccg acatcaacgc 3100921 cgacccgtga accggggtcg gtcaccttcg gttgctgcgg gcgctgcacg aggacccgca 3100981 cactcgcatc gatccgggtc ccgttacggc gcaccgtgat cgcgagcacc cgcgaccggc 3101041 ctttggcgat gagccgctca acccggcgcg tgttctcgtg ggtgcggacg gtcccgatga 3101101 ccggcagcgt gaggtggcgc cggtcgggct cgacgcgcat cgctccggtc gtgaacgtca 3101161 cccggtctgg gtcgcgtccc ttcttcttaa accggggaaa gcccatcctt ttgccatcac 3101221 gtttgcctga tcgcgagttc tgccagttcc agtacgcgtc gaccgcgccg tcaataccgt 3101281 cggcgtaggc ctccttcgag cactccggcc accacacaac accggtctcg atgttgacgc 3101341 acacgtcgtt cttgacggtg ttccagcgct tccgcaacac ccgcagcgac ggctttgccg 3101401 tctggatccc ggtcgcctgc caggcgtcga tatcggcttt cagggtggcg acggtccagt 3101461 tgtaggcctt gcggcgggca ccgaaatgcc gtgccaacgc gcgggcctgc tcggcggtcg 3101521 gatcgagcgt gaaccggaaa gcctgaacca tccagccctc aggaatctcg aatttcgcca 3101581 tcaggcagcc tccgactctt cggcggcggc cgccaatgcg cgcttggccc ggttctgcgc 3101641 agcgcgcttg ccgtacagcc gggcgcacat cgaggtcaag atctcggtca tgtcccgtac 3101701 caggtcgtca tcaacctcgg ccgagtcgac cacgaccagc tcgcggcctt gggcggccag 3101761 cgccgcttcg acgtactcag agccgaaccg gcagaaccgg tctcggtgtt ccaccacgat 3101821 ccgcttcacc gatgggtcac gcagcagcgc aagaaacttt cggcggtgcc cgttcagcgc 3101881 cgaaccgacc tcggtcacga ccttgtcgac cgcgatctgc tcggtcgtgg cccaggcggt 3101941 cacccgcgcc acctgccgat ccaggtccgg cttctgatcc gctgacgaca ctcgcgcata 3102001 cacggccgtc cgcgcccggc gggatctatc ggccggctgg tcgtccacga gaatcagccg 3102061 cccggccttc cgcgccggca ccggcaacaa ccccgcatga aaccagcgat acgcagtcac 3102121 ccgcgcaaca ccgttgcgct cagcccacac cgccagattc atactgttgt tcctacagca 3102181 cgccactgac aactaccgac cactcagacc gcaacagctg acagcccctt ccgaattgaa 3102241 cagcggccca tcgccgtgcg acgtaggccg tgtagcccag tgtgccaccg ttgccgtccc 3102301 ggaccgcatc cccctacatt gaggccaggc tccaaccgaa tcgcccggct cctcctcacc 3102361 ccgctacccg gggtgcatcg tcgccgggcg gagcaccgcc accgacctgg tccgcgaacc 3102421 ctcgtcacgc agcagcgcga taacccggcc gtcggcgtca caggccgcgt acacgccgtc 3102481 gataccgacc gccggcaggg accggccgtt ggcggccgcg ctggcctccg cggcggtcag 3102541 gtcgcggcgc gcaaacatca gcaggcaggc ctcatcgagg ctcaggctca gcgcggggcg 3102601 ctccgcgaga tcgtcgagcg atctcgcctg gtccagctcg aagcggccga cgcgggtgcg 3102661 ccgcaacgcc gtcacatggc ctcccacccc aagcgcgtcg ccgaggtcgc gtgccaacgc 3102721 gcggatgtag gttcccgagg agcagtcgat ctccacatcg atatcgatga gctggtcgcg 3102781 ccggcgtgcg gccagcagct cgaaccggtc gatgcggatc ggccgggctt ccaattgcac 3102841 ggagcgcccc tggcgggcca accgataggc gcgtcggcca ccgaccttga tcgcgctgac 3102901 cgacgacggc acctgccgga tctcaccgcg cagccgctcc atcgcggcgt cgatcgcctc 3102961 gatggtcagg tgcttagccg gaaccgactg cagcacttga ccttcggcgt cctcggtgga 3103021 agtggtctga cccaagcgga tggtggcggc atacgacttg ggggccgccg tcagcagacc 3103081 gaggatcttg gtggcgcgtt cgatgccgat caccaacacc ccggtggcca tcgggtccag 3103141 ggtgcccgcg tggccgaccc gccgggtggc gaagatgcgg cggcaccgcc ccaccacgtc 3103201 atggctggtc attcccgcgg gcttgtcgat aaccacgatt ccggggccgg ttgcgctcat 3103261 agcacgatcg cggtcagcac cagtccgcgc tcaaccgacc agcgtccccg cagcgttgtc 3103321 agcggcggac ccgacagggt ggacccgtcg atgaggatac gggagacgaa gcgacccgtc 3103381 cagccggtgc tatcggtttc gaacgtgatg tgcgcgtcct cgaaacccag ccacctcttg 3103441 gtcagcggaa accacgcctt gtacgttgct tccttggcgc agaacaggat tcgatcccaa 3103501 tgcaacgccg ctggcatggt gcggggcatg tcggcgcgct cggccggcag gctgatcgca 3103561 tccagcacac cattgggcaa cacgtcgtgc ggttcggcgt cgatgcccac ggaacgcacc 3103621 gcatccctgc gtccgacaac cgcgccgcgg taaccggcgc agtgggtgag gctaccgacc 3103681 atgccgtcgg gccagcacgg ttcgcccttg tcgcccttga ggatcggcgc cggcggcaca 3103741 ccgagctggt ccagcgcgat gcgggcgcag tgacgcacgg tgatgaattc gttgcgccgc 3103801 ttggcaaccg atcgtgcgat caacggcgcc tcctcgggca gcggggtgag accgggtggg 3103861 tcggagtaca actcggcata cgccaaatcc tcgaacacgg tcgccggcaa caccgacgcc 3103921 accagcgtgc ctaccgtcat cgagactgcc gttgccgcaa tcgttcccgg aactgggcgg 3103981 cctgggttcg catctcgggc gtgatcacga agtgaccgcc gaagtcgttg aggtagccgg 3104041 gcgcgtattg gggatccggc agcacctgcc gcagccagga gtagggcttg cgccggcgcc 3104101 actcccgcgg gtaacccacc gacacctcct cgaaccgcac accgtcatac caggtggtgc 3104161 ggggaatgtg taagtgtccg tagaccgaac acacggcgtt gtagcgggtg tgccagtcgg 3104221 cggtcttggt ggttccgcac cacagcgaga attccgggta gaacagcgcg tcgcagggct 3104281 gtcgcagcag cggaaagtgg ttgaccagca cggtcggttg catccagtcg agctgttcga 3104341 gacgggcccg ggtggccgcg acccgctcgt ggcaccaggc gtcgcgggtg gggtacggct 3104401 cgggtgagag caggaactcg tcggtggcca cgacgttgcg ttccttcgcg atggccacac 3104461 cttcggcctt gctgtttgcc ccctccggca aaaagctgta gtcgtagagc agaaacatcg 3104521 gcacgatggt ggccgggccg cctcgttcgg tccataccgg gaacggatgc tcgggtgtga 3104581 cgacgcccat ctcgtcgcac atgttgacca gatagtcata gcgtgcgcgg ccgaagatct 3104641 gcatcgggtc gcggttggtg gtccacagct cgtggttgcc cggcacccag atcaccttcg 3104701 cgaaccgccg ccgcagcagg tccagcgacc agcggatctc gtcggtgcgt tcggcgacgt 3104761 cgccggcgac gatcagccag tcgtccggcg aggacgggta cagcgattcg gcgacgggtt 3104821 tgttgccgag gtgaccggtg tgcaggtcgg agatcgccca cagcgtcggc tcggcgccga 3104881 cggtctcctg ccccgatcct ttccaggtca cgacttacca ccctaacgac ccggcgaagt 3104941 gggaacgaaa tccagccagt tcgaccaacc gctacggcgt gagcagacgc aaaagccccc 3105001 atttcgggcc cgaaatgggg gcttttgcgt ctgctcggcc aacctagccc aactgctacg 3105061 gggtcggcga gggttttggg gtgtcggtcc ggctgatccg gcagtccgac tgtgcggtga 3105121 ggaccgccgc cttctgggtc gagcatttga actcgacgcc gccatgaccg gcgaagtaca 3105181 cgtcgtggtc ggccggcttg ttcttcatca ccccaaagcc ggtggcaccg aactgttcga 3105241 caccgtcctt gacgatctgc acggcctgca gccacttgtc gtcggggatc ggtccactga 3105301 aaacgatcat gaaatacgcc gctttggccc tggtccattc atactcacca ccgcagccgg 3105361 tccaggtgtc catatccgtt cgccacgtca ggccgggcac cagggccgtg atggcgttgg 3105421 ccagctgggt caccgccgcc cggtactggt ccttggcgtc ctccagcggg ggcttggcgc 3105481 gcaacgggtt ctccagctcg gcgaccttct ccgggctcaa cggcccctcc tcgccggccc 3105541 gcgtcccgtg gccactgggt ccgcacccag tcgccatcac acacaccaga gccagcagcc 3105601 acgccgtcgg ccaccgcatc aacgtccccc tctcagtgct gggccgggcg ctgccggcat 3105661 gccgccaccc agaattggcg gaagcagcgg cgggcccacc gtgttgtcgg gcagcccggc 3105721 ggcgatcgcc gccaggttat agccggacat ccgcagctgc ggctggccgg cggcatcgag 3105781 gaaggaccgc gggtagtccc cgtgggcata cactccgtca cgccagatcc cgcccggatc 3105841 aaaacccgcc tgtgacgaca gctccgtgaa cccgggggtc agataggggt ccaggcccca 3105901 tccgtgcagc ggcgccaacg gcgccaccag attggtgatg aggtcgtggg gggcctgcat 3105961 gacataagcg tgcccgtgat cgagcccgag ctgcgccggg ctgtacagct ccaagccggg 3106021 tgagccgtaa aacacgacgt cgttgaccgg atgggcgctc tgggcatcga ggtcctgcaa 3106081 cgccagcgac gccgtcagcg acccatacga gtgccccaac acggtcaggt ggccactggg 3106141 gttattggcg cgcacctgct gcaaataccg cgacagatcg gccgcgcccg cgtgtgcctg 3106201 cccatcggtc atggtctgcc acagatcgcc cgcactgccg gtgtcgagtg ggttcggggg 3106261 cgggtggtag cccatccagg cgatggtggc aaccgatgcg ggcttgccgg cagcattgag 3106321 ttgccggatt acctccgacc gcaggtcgcg ggcttcggtc accatgccgg gcagggcgcc 3106381 ccgggtggtg gacccgacgc cgggaaccgt caccgacaca ttggcggcgg tgtcgggatt 3106441 accgacggcc acggccgcca gcacctgctg atttgggtcc tcgggaatct gcagctgggt 3106501 caggtaggtc tcgggtgctc ggctcaacgc ctcgtcgacg gcatcgagct cacccagccg 3106561 gcccctggcg gcgctcagct cgtcggtaag cgctgccagt cggcccaccg cgtcaccgtc 3106621 gaggatgccg ttgtggtagt cacgggcggc ccgcacactc agttggtcat actccgcctg 3106681 taaccgctcg aggtgggcct gcaggcgggc gcgctcctcg cgcagccgct cctcgttggc 3106741 atcgctggcg agctgggtcg gggtcagccc ctcgggaccg accggcgggc cggaatcggc 3106801 cgggatgggc gcgtcaccgt cggccatatt gaccgctgag gccagctcct cgtcgacggc 3106861 attggcctcg gccataatcg catccagctc cgcctgcagc tccgtttgct tggccagcgt 3106921 ccgcgcccac tgcgcctcgg tggatcgcag cccggggatc ggcaccaccc ggttgatcag 3106981 cgcatcgatc gtcagctcgg cggccgcggc ggcatggcgt agtgcggcca gctcggactg 3107041 aaccttcaca atcccgtcgg cggccctgtc ggccgcccgg gcaaccgcca acgcctcgtt 3107101 gccgtgggcg tcgaggtctc ggcgaatgcc cgcgttgtgg tgtgccgccg cctcagcggt 3107161 cttgccaccc gagttcgcaa aaatcgacag cgcggccaac tgacgcgacg cctcgaacgt 3107221 cacctccgct cgggcactgg ccgcgtgaaa cacctcccgg accgcttgcg cgttccaccg 3107281 atcgatatcg gccacggtca gtggcacgaa tcacacccca cgcggaccag ctacgacgtc 3107341 ggcggaaaca cccacctggg cgagcgcctg cgcccgctcc gcctccgccg ccgcatgctg 3107401 gatagcggcc tcctgcagcc cgaatgcgtg atcaccgatc ctggtcagca gcgccctcga 3107461 cgcgtccaac cagtcgtcca tcttggcgtt gagcgccatc gccgaggcgc cctgccagcc 3107521 gaactgggcg gcctgcatcc gatagtccga cgacaaatgt ccgacggcca gaccctcacc 3107581 ctgcgtggtc acctgcgccg ccgagtgcat ccactgctcc ggactgatct gaaacacccg 3107641 ttgcttcctt gcgtccatcg aagtgcatca cattatgcgt cagcgggaac taccgcagaa 3107701 ttcaccgcat caaaggtggc ccgggttaga acaagttctc gtttgactgt gacgacgcgg 3107761 agccgacttg tacactcccg gcaagggacc gccgagggca gggggtgtcg tgttcaccag 3107821 ggtgcggctg atcggagggc tcggtgcgct gacggcagcg gtggtggtgg tgggcacggt 3107881 gggctggcag ggcatccccc cagcgccgac cggcggcgac gcggtccagc tgcgatcgac 3107941 cgcggcgccc atgtccacca cgatgaagag cccgatcgtg gcgaccaccg accccagccc 3108001 gtttgacccg tgccgagaca tcccgttcga cgtcatccag cggctcggat tggcctacac 3108061 gccaccggaa gccgaggagg ggctgcgctg ccacttcgac gcgggtaact atcagatggc 3108121 cgtcgagccg atcatctggc gcacctacgc ccagaccctg ccccccgacg cgatcgagac 3108181 cacgatcgcc ggccaccgcg ccgcgcagta ctgggtgcgg aagccgacgt atcacaacag 3108241 cttctggtac tcctcttgca tggtgacctt caagaccagc tacggggtga tccagcagtc 3108301 gctgttctac tcgaccgtct actccgagcc cgacgtggac tgcccgtcga ccaacctgca 3108361 gcgggcaaac gacctcgtcc cctactacag gttttaggtc cctaccctgg gcgtcgtgag 3108421 taccacctcc gctcggcccg agcggcccaa gctgcgcgcc ctgaccggac gagtcggtgg 3108481 gcaggccctg ggcggactgt tgggtctgcc ccgcgcaacc acccgctaca ccgtcggtca 3108541 cgtccgagtc ccgatgcgcg acggcgtcca gctggtggcc gaccactacg cacccgccac 3108601 gtcgcagccc gtcggcaccc tgctggtgcg tgggccatac gggcgccggt ttccgttttc 3108661 gctggtgttt gccaggattt acgccgcccg cggttatcac gtcgtgctgc agagcgtgcg 3108721 cgggacgttc gggtccggtg gcgtgttcga gcccatggtc aacgaggccg ccgacggcgc 3108781 cgatacggtg gcgtggctgc gtgaacagcc ctggttcacc ggccggttcg gcaccatcgg 3108841 cctgccctat ctgggtttca cccagtgggc gttgctgcac gatccgcccc cggagctggc 3108901 cgcggccgtg atcacggtgg ggccgcacga cttccgggcc tcggtgtggg gcaccggatc 3108961 gtttacggtc aacgacttcc tgggctggag cgatctggtt tcccaccagg aagaccccgg 3109021 tcgcatccgg gccggaatcc gccagctcac cgcgccgcga cgggtggcgc ggacggccgc 3109081 cacgttgccg ctgggtgagt cggcccggac gctgctcggc acgggtgcgc cgtggttcga 3109141 atcctgggtg gaacacaccg accgcgacga tccgttctgg gaccgactgc ggtttcccgc 3109201 cgcgttggac cgcgtccagg tcccggtgct gctcgtcggc ggctggcagg acatcttcct 3109261 gcggcagacg ctgcagcagt accggcacct gcgcgaccgg ggtgtgcacg tcgcgctgac 3109321 ggtcggtccc tggacacaca cccagatgct caccaagggg ctggccaccg gcgctcggga 3109381 atcgttggac tggttggacg cccacctcgg ccgggcgccg gcgctgcgcc ccagcccggt 3109441 gcgggtcttc gtcaccggcc agggctggcg gcacctgccg gactggcctc cggcgaccac 3109501 cgagcgggcg tggtacctgc agcccggtgg ccgcctgggt gagagcgctc cggcttccgg 3109561 cacgccaccg gcgacgtttc gctaccaccc cgccgacccg acaccgacca ccggtggtcc 3109621 gctactgtca tccaacggcg gttaccgcga cgacagccgg ctggccacgc gcgccgatgt 3109681 gctgtgcttc accggggcgc ccctcaccca cgacctctgc gtgcacggaa accccgtcgt 3109741 cgagctggtg cacagctcgg acaaccccta cgtcgacgtg ttcgttcggg tcagcgaggt 3109801 ggacgcgaag ggccggtccc gcaatgtcag cgacggctac cggcgccttg gtgacgcgcc 3109861 ggagctggtc cgcgtcgagc tggacgccat cgcccaccga ttccgcgccg actcccgcat 3109921 ccgggtgctg atcgccggta gttggtttcc ccgctatgcg cgaaacctcg gcaccccgga 3109981 accgatactc accggacggc agctcaagcc ggctacccac gcggtgcatt tcgggcgctc 3110041 ccggctgctg ctgcccgtcg gctaacggct ggtggtgcgg cggacccggg cggcgacccg 3110101 gccgataacc cgagcccgtc cagcggcgcg tgcccagtgg tctccccgac gcggaacctg 3110161 cgagagctac gaccataagt cgagatgcag tttcaaagcc tcatcgagct gggcaagttc 3110221 ggcggctgaa actcggccga ttggccggag caaccgctcg gtagcaatcg atctgatttg 3110281 ctcggcctgc gccttgcagt cgacctggag accagtagtg gtggccgaca acaacacctg 3110341 aaacggatag accttggcga tgttgctcgt caccggcacg acggtgatga cgccgcgccc 3110401 aagacgcgtg gcggtcgcgt tggcccggtc gttgctgacg acgacggcgg ggcgctggtt 3110461 gttcgcttcg ctacctcgag cggggtcgag atcgacctgc caaatctcac cgcggcgcat 3110521 caccgactcc gtcgccgacg gtctgctccc acgcgtccgt gtcgccggct gccgaccatt 3110581 cttgccatgc gttggcatag tcatcttcga gcgtggggta gcgaagcacg cggatcgcat 3110641 gctgcaggcc ggcggagcgg gatggtaatc ccgctcgttt cacatatgcg tccaggatcg 3110701 cgacgtcgtc atcggacagg ctcacgctca acttcacaac ctaagatgct accagggtcg 3110761 tacctaggta gtaataggtt cagcggctgg tcgcgcgcca gtcgcgcagc acttcctcga 3110821 cgtgctcacc cacccggtgc cgcgccgttt cccggtcgac acccgacatc agcagttcgt 3110881 cgaacgaagt gtcgatatgc cgcaccgacg ccgcgacggc cagcctcacc gcttcgggat 3110941 cgagcgcccg tccggccgcg ctacgtccga tcctgccgct gccgcgggtg gccgcgtggc 3111001 gggcgatcgc ctcggcccgg ccggccgggc agttcgggaa cagcgtgcga atcgcggcgc 3111061 cgaattcggc ttgcagacgc aggtcctcgt tggcccgtcg cgcctcgtcg cgctcccggc 3111121 ggcgggcgcg cacctccgca tcggcgaggc actcgttttc ggcgcgctcc agcgcctccg 3111181 cctcgaccag gatgccctga cgctcgtatc gcttacgcgc ccggctccac cgcaccacca 3111241 ccgccgaaag ccggctcgcc cgcttggccc ggcgggtaag cgcggcgtcc ccggacggca 3111301 agaagaccag atggccaagg tccgcgcagt ccaggcacaa cggccccgcg tcctcaagga 3111361 acatcaggtc accgctgccg ccacacgacg cgcatgacca gtcgttgacc ggcatgatca 3111421 cgaccaaatc ggggcgccgg ctctgccgcg cgaccgcacg ctccgagagc tccggcgaca 3111481 cccaatgcgt gcgatacgcg cgctcgatgg cgtcctcgcc ggtgacgctg aaccgcagcc 3111541 gacggcggtc ccgagtgcga gcgacgtaat cggtctccga cgggttgagc ccccggtcgc 3111601 gggcccagcg ccgcaacgcg gccatcacgg cggtgatctt gctgaggttg gcctgtacga 3111661 cttgctccag cgagtcgacg cggccctgcc gccactggtc gacatgcgag ggcgccagcc 3111721 agcccaggcc gagcagcaca tcgatcgcgc tgacgaaccg ctgtcgggcc agcgccgcct 3111781 gcgccgcccg ggccacccgc tgctccagag gttgacgtgc catgacctgc ccgagcctag 3111841 tcggactgcg caccgaggcc gcggaactga gttactccga ccagccggac gcgctcggag 3111901 tggcgatgcg cgaacgtcgg gaacaacaga acctcgttcg gccgccacgg agaaacgctt 3111961 ctcgccgcat caacaccgat cagacgtcga cgaagtacgt ctacattacg tacatgcccg 3112021 agactctgac tggtcgcctc aacttccgcc tgtctcctga acaggagcag gcccttcgcc 3112081 acgccgccgc gctcaccggc cagagcctgt cggggttcgt attgtccgcc gcggtcgacc 3112141 acgcccacga tctcttggcc cgggccaacc ggatcgagct gtccgaggcc gctttccgcc 3112201 gcttcgtcgc cgcgctcgac gagcccgacg aggcggctcc cgaattggtg cgcctcgcca 3112261 gacggaagag ccgcattccc ccccattgag cacccccgcg ctcggccccg tcgagctgtt 3112321 ggacccggac cggcacgaca cggcgcgctt ctccagcgat gttgaggttc tcgaccactg 3112381 gctgcgccga gtcgcgcccg tcgcggctgc cgccggcacg gccgctacgt gggtgctctg 3112441 tcgaggccgg cgggtagttg ggttctacgc gctcgccatg gggagcatcg agcggatccg 3112501 ggtgccatcg cggccgggcc ggggccaacc cgacccgacc cgatcccagt gctcgtcctc 3112561 gctcgcctgg cgctcgaccg gcaggagcaa ggcaccggtc tcggtggcga tcttctcctc 3112621 gatgccctca tccgatccgt ggccggtgcc cggcactacg gcgcccgcgc cctggtcgtc 3112681 gacgccatcg acgaccgcgc cgccgagttc tacggtcacc acggcttctt gcccctcgag 3112741 ggtcgacgcc tctaccggcg gatcagcgac atcgcgcggg cgctgggagt atgaagcgct 3112801 atcgtcgctt ggcgacgtgc tgccgatcga tcgcctcgaa tggcctcgtt gttgttgtcg 3112861 tcggtgatgg ggaggggcaa cggcaagatt ttggatccgg tggtggccac cacggggatg 3112921 ggccgctcga cggcgcggca gatgttgacc ggcccgaggt tgccgggccc ggccgagcag 3112981 gtcgacgggc gtagccttcg gcctcggggc ttcagcgacg aagccagggc gctgctggag 3113041 cacgtgtggg ccttgatggg catgccgtgc ggcaagtacc tggtggtcat gcatgacctg 3113101 tggttgccgc tgttgaccgc tgccggtgat cttgacaagc cgctcgtcac cgaggcgtcg 3113161 gtggccgagt tgaaggcgac agccctacca ggggcgaatc gcatgccgca ctgggccgca 3113221 gggacactcc ctgatggctt tccagcccgg gcggtgagga cgcgcacgtg aaaaccaacc 3113281 cccggtacgg cccggcgttc tactcagtga tgacggtgtt gttcctggcg ctgttcgtgc 3113341 taaatgtgtg cacccacggc tcgacgctgg gcctgatcag taccggaggc ctcgccgtgt 3113401 tgatgggcta catcggctac cggggctggt ccggcaagcg ccatatcaac cggcaatagc 3113461 gatcatcgac cggttccggc acacctgacc agcgccgtcg tcggccgcca accccacggc 3113521 tcgtgtgcca gccgacggtc accgtgtcgc ggcggcggga cacgaggaaa ctgcccacca 3113581 gccacaccta cttcgcgctc acttttaagt gaggcacttc ggcatcgaag gcggataaga 3113641 ccaagatcct ggatcgggtg gtgtccacca ccgggatggg tcgttcgacg gcccggcgga 3113701 tgctgaccgg cccggggctg ccggagccgg ccgagcaggt cgacgggcgc aggctgcggg 3113761 cgcggggctt cagtgacgac gccagggcgc ttttagagca cgtgtgggcc ttgatgggca 3113821 tgccgtgcgg caagtacctg gtggtgatgc tcgagctgtg gctgccgctt gaggccgccg 3113881 ccggtgatct tgacaagccg ttcgccaccg aagcggcggt ggcggagttg aaggcgatga 3113941 gcgcggccac cgtggaccgc tacctcaaac ccgcccgcga gcggatgcgc atcaaaggca 3114001 tctcgacaac caaaccctca ccattgctgc gtaattcgat caccatccac acctgttcgg 3114061 atgaggcgcc caaggtcccg ggggtgatcg aggccgacac tgtggcgcac tgcggcccga 3114121 gtctaatcgg cgagttcgcc cgcaccctga cgatgactga tctggtgacc ggctggaccg 3114181 agaacgcctc gatccgcaac aacgcggcca agtggatcct cgagggcatc aaggagtgcc 3114241 agcagcggtt cccattcccg atgacggttt tcgattcgga ctgcgggggc gagttcatca 3114301 atcacgacgt cgccggctgg ctgcaggccc gcgacatcgc ccagactcgc tcgcggccgt 3114361 accagaagaa cgaccaggcc catgtcgagt ccaagaacaa tcatgtggtg cgcaaacacg 3114421 cgttctactg gcgctatgac accggcgaag agctggagct gctcaaccgg ctatggccgt 3114481 tggtgtcgct gcggtgcaac ttcttcaccc cgaccaaaaa gcccgtcggc tacaccagca 3114541 ccgtcaacgg tcgccgcaag cgcatctatg acaagccggc caccccatgg cagcgcctgc 3114601 aggcatcggg cgtccttgat gcacagcaac tctcgaccgt ggccgcccga atcgaaggct 3114661 tcaacccggc cgatctgacc cgccagatca acgcgatcca aatgcagctg ctcgacctgg 3114721 ccaagaccaa gaccgaggcc ctggccaccg cccgccacat cgacctgcaa tcattgcaac 3114781 cgtcaatcaa ccgattggcc aaggcgaagt aatgcaagcc ccccacgcgc tcactatgcg 3114841 tgaggcacca gccacgcttc gcgctcactt ctacgtgagg cacctcggat gctgttgcga 3114901 atcctgttgg gccgccccag tttaaagtgg atgagcttgg tagaggcgct tacgtgtacg 3114961 ttgggaaaga cgcaacagtg gtcctaaaca aagatggcca agtggtaacc gcctgggcga 3115021 acagccgggc tggatggaga aatccgtgag caacgttctc gatgctattt caacggagca 3115081 ccgtcccgtg atcgagcaag aattagagaa tcgtaatccc gctctcttcg acgagcttcg 3115141 gcgcacagag aagccaacca acgaacagag cgacgctgtt atcgacgtgc tttccgacgc 3115201 cttgatgaag acctttggac ctgattgggt tccgaatgat tatgggttga aaatcgaacg 3115261 agcaattgac gcatacttag agacgtggcc gatataccga taatcgcttg acaccaacta 3115321 ttgccagcac caggcgccta ccgtgcatcg ggagcgcggc cgggctggta ttcgcgtggg 3115381 actgaaggag cttaggcagg aacgcacatg acgtacgcag ccagggacga tacgacgctc 3115441 cccaaactgc tcgcacagat gcggtgggtg gtgctggtgg acaagcgtca gctcgcggtg 3115501 ctgctgctag agaacgaggg accggtcgct tccgcgacgg acacgttgga tacgcgcggt 3115561 gatagcgact atgaaaacca gccggtcgac gcagtggagc ggctatgtcg gcgtttggct 3115621 gaccaggcgg tgcgtcagtg gggttttatg cagggcctca agcagaagct cggaccaggt 3115681 gtcgacgtgc ggatgaagct ggtggagtgg aaccgatgag ctttaatggc tcttccggaa 3115741 tcagagtgca tggatcagct gagccaggtt gccgcagtgc agtagtgaac ggattcggta 3115801 gtgggtgagg tttctgaatc cgagggcgtt acggcatagg gcttccagtc gtccgttgat 3115861 ggcttcggtg ggcccgttgg acgcgtggtg gtcgaagtag gccagcacat cgtggcggca 3115921 gcgccacagg gtgcggccta gtttggccag ttcctctagt acgacaggga caccggttcc 3115981 ggcgctgacg gtgagcagcg cgggggcttg ccgtacccgg gtttgttgtt tgtagtgccg 3116041 gggcggctcg gtgatcaggt cattggtagg cgacggccct cccgtcgtct cttgccggag 3116101 tgctacggga gggccgcctg tgtgcgcttg gaggcgcagt ggtcaccgta gaagcagatg 3116161 tcgatcaagt cgagcgtcgg ctggcggccg gtgagctgag ctgcccgtct tgcgggggtg 3116221 tgctggcggg ctggggccgg gctcggtcgc ggcagttacg cggcccggct ggtccggtgg 3116281 agttgtgccc gcgtcggtcg cggtgcaccg ggtgcggggt gacgcatgtg ttgttgccgg 3116341 tgagcgcgtt gctgcgccgc gccgacacgg cggcggtgat cgtgtcggcg ctggcggcga 3116401 aggccaccag ccgggtcggg ttccgccgga tcgccacgga tgtggctcgc ccggcggaga 3116461 cggtgcgggg ctggctgcgc cggtttgccg agcgtgtcga ggcggtgcgg tcggtgttca 3116521 cggtgtggct gtgcgcggtc gatgccgatc cggtgatgcc ggatgcaggt ggcggcgggt 3116581 tcgtcgatgc ggtggtggcg atcggcgcgc tcgcagctgc catcgggcgc cggttttcgc 3116641 tgcccacggt gtcgctggct gagaccgcgg tagcggtgtc aggtgggcgg ttgttggcgc 3116701 cgggctggcc cggcgagtgg gtgcaacacg agtcgaccct gccgtagccg tcgatcgggc 3116761 cgtaaacctg tgcgctgtcg tgtgttttga cagacagcaa atggaaagga gcggccggtg 3116821 gcggtcggcg atgacgagga gaaggtgcgc gcggagcgcg cgagggcgat cgggttgttt 3116881 cgctaccagt tgatttggga ggccgccgat gcggcgcatt ccaccaagca gcggggaaag 3116941 atggtgcgcg agttggcctc acgcgagcac accgatccgt tcgggcggcg ggtgcgcatc 3117001 agccgccaaa ccatcgaccg ctggatccgg ggctggcggg ccggcgggtt cgacgcgctg 3117061 gtgcccaacc cacgccagtg cacaccgcgt accccggccg aggtgctgga gctggcggtg 3117121 gcgctgcggc gggaaaaccc gcagcgcacg gcggcggcaa tccggcggat cctgcgtacc 3117181 cagttgggct gggcgcccga tgaacgcacc ctgcaacgca acttccaccg gctcgggctc 3117241 accggcgcca ccaccgggtc ggcgccggcg gtgttcggcc ggttcgaagc cgagcacccg 3117301 aacgccctgt ggaccgggga tgtgttgcac ggcatacgga ttgatctccg caagacctat 3117361 ctgttcgcgt tcttagacga ccattcccgg ttggtgcccg gctaccggtg gggccatgcc 3117421 gaggacacgg tgcggctggc cgccgcactg cgcccggcgc tggcctcccg cggcgtgccc 3117481 aacgcggtgt atgtcgataa cggctcgccc tatgtggatg cgtggttgtt gcgggcatgc 3117541 gcgaaactcg gtgtgcgcct tgttcattcc acgccaggtc ggccgcaagg caggggcaag 3117601 atagagaggt tcttccgcac cgtgcgcgag cagttcctgg tcgagatcac cggcgaaccc 3117661 gacgtcgtcg gccgacatta cgtcgctgat ctggccgagt tgaatcggct gtttacggcc 3117721 tgggtcgaaa cggtttatca ccgcagcgtg cattccgaaa ccgggcagac cccgctggcc 3117781 cgctggtcag ccggcggccc catcccgctg cccgcccccg agacgctcac cgaggccttc 3117841 ctgtgggagg agcaccgccg cgtgaccaag accgccaccg tctcgctgca cggcaaccgc 3117901 tacgagatcg acccggcgct ggtcggccgg aaagtggagt tggtgttcga cccgttcgat 3117961 ttgacccgca tcgaggtgcg gctggccggc gcgccgatga ggcgggccat tccgtatcac 3118021 atcgggcgcc attcacaccc gaaagccaaa cccgaaaccc ccaccgcacc gcccaaaccc 3118081 agcggcatcg actacgcgca gttaatcgag accgcgcacg cagccgaact cgcccgcggc 3118141 gtcaactaca ccgccctcac cggggctgcc gatcagatcc ccggccagct cgacctgctc 3118201 accggccagg aggcccaacc gaaatgatgc acaaactgat ctcgtattac ggtttttcgc 3118261 gcatgccatt cggccgcgat ctggcaccgg gcatgctgca tcgccacagc gcgcacaacg 3118321 aagcggtcgc ccgcatcggc tggtgcatcg ccgaccgccg catcggcgtc atcaccggcg 3118381 aagtcggcgc cggcaagacc gtcgccgtgc gcgccgcact agcgagcctg gatcgcagcc 3118441 gccacaccat catctacctg cccgacccca ccgtcggcgt ccagggcatc caccaccgca 3118501 tcgtcgcctc gctcggcgga caacccctca cccaccacgc caccctggcc ccacaggccg 3118561 ccgacgcgct agccgccgaa caagccgagc gcggacgcac ccccgtcgtg gtcgtcgagg 3118621 aagcgcacct gctcggctat gaccaactgg aggcgttgcg gctcttgaca aatcacgacc 3118681 tcgactcgtc aagcccgttc gcctgcctgc tcatcggcca acccaccctg cggcggcgga 3118741 tgaaactcgg cgtgctcgcc gcgcttgacc agcgcatcgg actccgatat gccatgccgc 3118801 ccatgaccga caccaacacc ggcagctacc tacgccacca cctcaagcta gccggacgcg 3118861 acgatgccct gttctccgac gacgccatcg ggttgatcca ccagaccagc cggggctacc 3118921 cccgcgcggt caacaacctc gccctgcaag ccctcgtcgc cgccttcgcc gccgacaagg 3118981 ccatcgtcga cgaatccacc acccgcaccg ccatcgccga agtcacggca gactgaacac 3119041 cacaccgaca ccccgaacac caccgacccc gccggacatc tcccggcggg gtcatttcat 3119101 gaccaaacgt cctcaccgtc aacgccgcca tcatgctcat cctgaatgcc ggtcaacaga 3119161 cgcggtggcg acccagtcgt cgtagtttcc gtcccctctc ggggttttgg gtctgacgac 3119221 tcgggcacgg ccgaaacacc gcgcgaaggg cggttcaagt ttccgtcccc tctcgtggtt 3119281 ttgggtctga cgactgggag gatgtcactc ggacatagct gtcatcggcg gtgtgtttcc 3119341 gtcccctctc ggggttttgg gtctgaggac atggagcagt agcgtggctg tggtgtggcg 3119401 ggcgatatgc gtttccgtcc cctctcgggg ttttgggtct gacgactgct gcacctcccg 3119461 cacccggtgc gattctgcgt ccagtttccg tcccctctcg gggttttggg tccgacgacc 3119521 ccgatagtcg cgctcgtcca tgtcccacca tgagggtttc cgtcccctct cggggttttg 3119581 ggtctgacga ctacctgata gaagccggaa agctccgtgc cgtcaggttt ccgtcccctc 3119641 tcggggtttt gggtctgacg acagggcact ggacctgtat gaggcacaga tggcgtacta 3119701 gtttccgtcc cctctcgggg ttttgggtct gacgacccgg atcggttacc cacgccgatt 3119761 tactggccat cgtcgggttt ccgtcccctc tcggggtttt gggtctgacg acacttgcgc 3119821 gcacaacgca tccgccatcc acggggcgtt tccgtcccct ctcggggttt tgggtctgac 3119881 gacctgaaag ggggactgtg gacgagttcg cgctcaaaat gtttccgtcc cctctcgggg 3119941 ttttgggtct gacgacttga acacgccgat acctatttgg tcgggagtga taaagtttcc 3120001 gtcccctctc ggggttttgg gtctgacgac cggacttgat cgacgcgaac ctgtctgacg 3120061 cgaacctgtt tccgtcccct ctcggggttt tgggtctgac gacggctgga aaagggcgcg 3120121 gggcaaccgc atcgtcaaga gtttccgtcc cctctcgggg ttttgggtct gacgacgcgt 3120181 tgtggtcgtg tcgtggagcc tgtatttcgc tggtttccgt cccctctcgg ggttttgggt 3120241 ctgacgacca ttagttggtg ttgtgatcgc taaacgccgg ggcagtttcc gtcccctctc 3120301 ggggttttgg gtctgacgac ctatccgcgg gaagagatca cgaatccggc gtcgaagggt 3120361 ttccgtcccc tctcggggtt ttgggtctga cgacatgctg agctgaggcg ccggatgatg 3120421 gtggtgctga aggtttccgt cccctctcgg ggttttgggt ctgacgactg acagggtgcg 3120481 gtggtcgctg atcggctccc cgagtttccg tcccctctcg gggtgaaccg ccccggtgag 3120541 tccggagact ctctgatctg agacctcagc cggcggctgg tctctggcgt tgagcgtagt 3120601 aggcagcctc gagttcgacc ggcgggacgt cgccgcagta ctggtagagg cggcgatggt 3120661 tgaaccagtc gacccagcgc gcggtggcca actcgacatc ctcgatggac cgccagggct 3120721 tgccgggttt gatcagctcg gtcttgtata ggccgttgat cgtctcggct agtgcattgt 3120781 cataggagct tccgaccgct ccgaccgacg gttggatgcc tgcctcggcg agccgctcgc 3120841 tgaaccggat cgatgtgtac tgagatcccc tatccgtatg gtggataacg tctttcaggt 3120901 cgagtacgcc ttcttgttgg cgggtccaga tggcttgctc gatcgcgtcg aggaccatgg 3120961 aggtggccat cgtggaagcg acccgccagc ccaggatcct gcgagcgtag gcgtcggtga 3121021 caaaggccac gtaggcgaac cctgcccagg tcgacacata ggtgaggtct gctacccaca 3121081 gccggttagg tgctggtggt ccgaagcggc gctggacgag atcggcggga cgggctgtgg 3121141 ccggatcagc gatcgtggtc ctgcgggctt tgccgcgggt ggtcccggac aggccgagtt 3121201 tggtcatcag ccgttcgacg gtgcatctgg ccacctcgat gccctcacgg ttcagggtta 3121261 gccacacttt gcgggcaccg taaacaccgt agttggcggc gtggacgcgg ctgatgtgct 3121321 ccttgagttc gccatcgcgc agctcgcggc ggctgggctc ccggttgatg tggtcgtagt 3121381 aggtcgatgg ggcgatcggc acacccagct cggtcagctg tgtgcagatc gactcgacac 3121441 cccaccgcaa accatcgggg ccctcgcggt ggccctgatg atcggcgatg aaccgggtaa 3121501 ttagcgtgct ggccggtcga gctcggccgc gaagaaagcc gacgcggtct ttaaaatcgc 3121561 gttcgccctt cgcaattcgg cgttgtcccg ccgcaagcgc ttcagctcag cggattcttc 3121621 ggtcgtggtc ccgggccgtg cgccggcatc gacctgcgcc tggcgcaccc acttacgcac 3121681 cgtctccgcg cagccaacac caagtagacg ggcgacctca ctgatcgctg cccactccga 3121741 atcgtgctga ccgcggatct ctgcgaccat ccgcaccgcc cgctcacgca gctccggcgg 3121801 gtacctcctc gatgaaccac ctgacatgac cccatccttt ccaagaactg gagtctccgg 3121861 acatgccggg gcggttcagg gttttgggtc tgacgactcg cggcgagcac gtctcaccca 3121921 gcaggcggtg aggttgggtt tccgtcccct ctcggggttt tgggtctgac gacacggacg 3121981 agctggaccg catcagcgat gctgagctga gggtttccgt cccctctcgg ggttttgggt 3122041 ctgacgactt gtctcaatcg tgccgtctgc ggtgacacgc tccaagtttc cgtcccctct 3122101 cggggttttg ggtctgacga ccaccaggat cagcgccaag ccagttagcg caatccagtt 3122161 tccgtcccct ctcggggttt tgggtctgac gacctcccgg accatctgca gctcgcccgg 3122221 gtccatgcgg tttccgtccc ctctcggggt tttgggtctg acgaccggag tcatccgcgc 3122281 gggccggcgc gattgttgcc gggtttccgt cccctctcgg ggttttgggt ctgacgactg 3122341 gcgatttacg acgctgacgg gaactcgtgc gaatgtttcc gtcccctctc ggggttttgg 3122401 gtctgatccg cgaaattcac tgcgcgttat tcaaggtttc cgtcccctct cggggttttg 3122461 ggtctgacga cccgagccga ccatccgcat cacaccgaaa gggttggcgc aagtttccgt 3122521 cccctctcgg ggttttgggt ctgacgacac gtggggagag ggaatggcaa tgatggtcga 3122581 cgaagtttcc gtcccctctc ggggttttgg gtctgacgac ctcggacagc atctccccgg 3122641 gcgggcagca gatatcccat gtttccgtcc cctctcgggg ttttgggtct gacgaccgac 3122701 ccgtggccgc caggttgccg ccgccgttgc tcacctggtt tccgtcccct ctcggggttt 3122761 tgggtctgac gacccggaag tcaactagag cgggtgtcga acgctgcccg gtttccgtcc 3122821 cctctcgggg ttttgggtct gacgacatgc gaatccgctg tcagcacatg ggattccgag 3122881 tgtttccgtc ccctctcggg gttttgggtc tgacgaccta ggcggccccg gcgaggctgg 3122941 gggcggtttc acgcgtttcc gtcccctctc ggggttttgg gtctgacgac cagcgcagac 3123001 ggcagccccg agtactcgct ctcctcaggt ttccgtcccc tctcggggtt ttgggtctga 3123061 cgacaggctg aaattgaagc cggaaatgac gacgcattgg tgtttccgtc ccctctcggg 3123121 gttttgggtc tgacgaccta agcccgctaa tcccgcacaa gtggtcagaa aagtttccgt 3123181 cccctctcgg ggttttgggt ctgacgacct gatgattggt cggcgtatga cgtgctactg 3123241 aggtgttgtt tccgtcccct ctcggggttt tgggtctgac gactagaagg cgatcactgg 3123301 aagcacggcg cttgcgagtt tccgtcccct ctcggggttt tgggtctgac gacttggtca 3123361 aaagctgtcg cccaagcatg aggcaaaaag tttccgtccc ctctcggggt tttgggtctg 3123421 acgacacgac taggggagcg tgatccagag ccggcgaccc tctatggttt ccgtcccctc 3123481 tcggggtttt gggtctgacg acgtgcaaga attccgggtt gcagtgcaac acggttttaa 3123541 gtttccgtcc cctctcgggg ttttgggtct gacgactcta tggacaattc gtccagcgtg 3123601 tggtaacaat gcctgctgat gatgtcaaaa gaacacaaac tcctctgcgc tgacaagccg 3123661 tccccttccg tagaacgtaa ctgccgcaac acctcttatc ttatagatcc ggatgttgtc 3123721 gcagtcgatg gcgaagcggt cgatacgtgc aactagtttc gcgagctggc ccttcgtcag 3123781 catcgcttcg aatgcggact cttggacgcg atagccaaac ccggccagga tcttcgcaag 3123841 tgaagcccgc cgccggttgt cgctgatgtc gtatattacg aggacgaaca tcttgcctat 3123901 agtgccgctg gactcgtcca ctttgagcgg gagattgaag tactcctcac ggctgcgagt 3123961 gggcatttag gctccggatg gctcggaggt gatatcgata tcgacgagcc gcgacgggtg 3124021 cccggcttcg ataacacgca cgaggctttg cagttgcaag tcgagggcgt actgaaaggt 3124081 gtatcggtga ggatcgcctt tgatgtaggt ggcggttcgt gcgattcgat taccaaaggc 3124141 gcgcgcgatg gatcgtgtgg cttcccgtgt cgcgaagacg gcccccgtgt cggagttctt 3124201 gctgaaagcc cgggtgtcga ccacaccgtc cgcgatcaat cgaagtacgg tgtcatcgat 3124261 gatcggcgcc cgccatacct ccatgaggtc gctcgccaac gttgcgtgcc ctcgtgaatc 3124321 ctggtgtagg aaaccgatat acgcgttcag gctgtgacgc tcgatcgccc ctatgatgtt 3124381 cttgtacagc agcgaatagc cgaggctgac catcgagttg aaggcgtcca acggcggccg 3124441 agtcgagcgg ccctggaatg cgaactcctg cgggacgaga tgccccagcg cggtgaagta 3124501 tgcctttgcg gcatttccct cgaacccgtt caactccgcc agggagcccg atcgatcgac 3124561 ccaggccagc gagtgcttca tcgtgcggat gctctcagca acgtcttgcc ccgacgtgtg 3124621 tgcccgaatc aaggcctgct gattcaggat cttcctcgac acgatccgct tgcttaacga 3124681 caggcagaac gcaggatcgt cggtgcggtg aacttgctga cggagccgcg gcgcgtatga 3124741 cacgtcgggt gttgagatcc ggccctggta gtggccgtcg gtcgtgaaga gctggatgtc 3124801 gcgctcacgc ttgagcatct caacgatgaa gggcgttgtc atcgtcggcc gcccaaacag 3124861 cgtgatgccg tccagcgtct cgatcggata ctggctctcg ccgagctcct cgctccacac 3124921 gatcacccgg ccgtcggcaa agctgatccg cgacacggag tccgagacat acagctgcac 3124981 catcttgcgc acctgttagc ccagcggtgc catatcaatc tgccggatga tctcgtcgtt 3125041 caaccggtca tagagcgtca aatcagcgcc tgtttcgcgg gcgaggatct tcagcagttg 3125101 ttctgggaga aggccgccat ccttcgtgat gcgatcctca ctgattgaga cgatctcgtg 3125161 tgctgcggtg ttgcggaccc ggctctcgaa ccttccgagt acttcaagag caccaactcg 3125221 atcgggtgcg aattggcgga gcagtgcgag ccagtccttg gtgtagaggt accactccgc 3125281 gtttggcgat ttcggagggt gcttgagcgc gcaccgtatc tccggctctc tttccagctt 3125341 tcggcggtcg acgcggccca tgtcgtcgag atagcggtcc tccggaaggt gttttgccac 3125401 agccgccctg agcacgatag tgattgccgg ggtagctgat cgtgcgaatt cagcccattg 3125461 ctcgcgcttt gccagcagcg caagagcact tatgtactca gcgaccttgt tcgcggggtc 3125521 atacgtgaac gcggtgtcct taaagaactt tggcgctacg aggtgttcca gcctcgagcg 3125581 gtgcatcgcg ccgcggatca gattgctcac ttgatcgggc aggcgcgagt ctgccgcgat 3125641 cgtcactgct gccgagtagt cgtacgacac gatcagctgc ttcaggttgg cccgctcaag 3125701 cagcgcgccg agcgcagcgg aagtcgcctc aaagcaacgg ttgggggctc caggctgatt 3125761 gtcgtcgttt gcgtcccaca ttagttcgag gtcgtaagcg tctggggatt cacgatcgcc 3125821 aggcttgctc aatgcccggg caggcgtgct tacttgcaca gcggtggtcc tgggaatgcc 3125881 aaacacattt atggccacca gcgccgcctg catcgcaggg gtgccggaac tggtattcag 3125941 cagaatggtt cgatcaggga actcagccga cagttcaacc aggtggttgc ggaaaaccgg 3126001 cacgaaaagg tcgaacctgt gcaccgacgg gttggtatag gtgactatgc gaacgtcggt 3126061 ctcaggcgcg agccgcgtga ttgccgcgga gtaccgccgg tccgcgttct caaaggcagc 3126121 tatctcggcg ctgaggaata gcacgacaac tattggtcga tagtggcgga cgatgtgtag 3126181 catcgggccg tcgccgagcg cggtgatcgg gtccgcagtt ccgataggcg agaacaggat 3126241 cattcggctc tcctgatcga cagctcgcac tgacccatct cgtagcatat gttgtcgatc 3126301 ttggttcgct tcaagacaag tggtgagacg cgtagttcgc gcgtcttgtc gacgtgcttg 3126361 actaccttcc cgaactgggc gtcgagcacc ttcgccatgt cgtcttggtc ggtgacaaag 3126421 gtcttgctcc gatagccggc tccgccgccc agatagacaa ttgggccaac tatcgcgttc 3126481 acgccagggt acatggctct gtactccgcg taacgcgcct gattcacgga cgcggctgtc 3126541 tcggccagcg tttcaaggaa ccgctcgccc tcacgccagc cgccgcgagc ggtgggactg 3126601 gtgtcgacca ccacgcggtg cgagattgag gttcccggcg ccaaacattc ccggaagagc 3126661 ggcaggccat caggcttgcc gtggacattc atgtccatct tctggcagat cagcagatcg 3126721 cttgttctca gtgcaggtga gtcggtgacc ctgatcgcct gaaacaggtc gttgaccgcg 3126781 tcttgcggac gggtgttggg gcgccccgat ttgcgcaact ccttccgctc aaacctttcg 3126841 ccgtactgcc ggtgctcccg cgtctggtgt cccggaacac gaacaggttg ggccgtccgc 3126901 ttatgcacaa gcgactgcag gtagatgctg cgaagcattc ccttgacagt cgaacccggc 3126961 acgtagggcc ttccaagagg gtctttgatg aaagcgtgaa tctcgttgag cgtaagcttc 3127021 tttcgagtca tgcgcccgcc tcgaccacga gatgcacgtc gcggttcgat cgacccgatc 3127081 ttcacctcgt aacctcgatg cttagcagga tccagcttga ccgcgtttgg ctctacccac 3127141 tctttgagtg gcgccgtcgc ctgtgcccca tcggtgttca tgacgaacgc ttcgaaagac 3127201 ttcctcttgt gagccggaat gtctgcgtaa agaagttcca tgtccgggaa gtagacccgg 3127261 tcgccctcca cgtggtactc cttcgaggtc cgcttctcgc cggatccgat aaacaccggc 3127321 cccaggcacc gcagcgtgag ttcgaacggc ttcaggtagg tgttcatgcg gcggactccg 3127381 ggagtgcgag aaatagcggt cgcgcgtagc tgtagaccgg atggtttccg cccaggctga 3127441 cgtcgaggat gcctccttgg aagggtcgcg agaagaccga gccggcggcg aatttgtaga 3127501 tgtcgcgttt gcgcaggggc atgtcagcgt atgtgctcga cgcgacgaat ccactgcgct 3127561 tgacgaggcg gtacgtcgcg ccggcgagtg cggcttcgag ctcgtcgtcc gtgggtaggg 3127621 atgtcgtgag cgtcatcaga ctggccgcgt cgactgtcgg cgtgagtgcg gcgggtgctt 3127681 ctgactcggt aaggttaaac gctccgaacc cgcttgtccg ttcgccgccc agcgcggaga 3127741 tccctttcaa cagcctggtg agtaggccga gctcggactc ggatccggtc gccagcaacc 3127801 acagacccgc gtccagctcg aaccggaagt agccgacacg gtacgggtcg gcgtctttct 3127861 ttccgttgtg gatcgctgcc ttcgctgaca cggcgtggac accgatcttg gtctgccgcg 3127921 ccgcgagttc tttcaggtcg gccgtgccat cgaggaagct gccaagctgg gcagcgggaa 3127981 gaaagccgat cttcttcgcc agcttcttct gcatacttga gccgtcggac cgaacgctgt 3128041 gcaggggctt gggaaccagg taatcgggcc ccacataggg cagcagatcg gtcaaccgca 3128101 gcgtcgagca cgcaacgagt tcgccaagca gctgctggcc acccatccgt agcgcttcaa 3128161 cgcaaagcgc agagtagagg gtgtccgcgg ggcagctaat cgtggacgac tcgaggccgt 3128221 ggtcgccgaa gtgtgtgcgg tcgaagtcga acctaaacag ccgcgagttc atggtttagc 3128281 ttctccagca gagaaccgtc gagggcgccg actgcggcgc gggctttcag gttgctgaac 3128341 ttgacctgcc cgtagccacg ggttccgctg ccgccgaggt agtcgagttc gagcaacttc 3128401 aggccgcgcg cgatggcgtt gaagtcctcg atgatctcat cggaggaagg cagagacgcc 3128461 ttctgttcct cgccgggggt gccgaaggag acctcgtaga caagtgagaa cgcgaactcg 3128521 ctgccgggga tcacgcgttc catctggcga aggtttgcct ttgcggtcac ccggttgatg 3128581 gcgttctcga atttcacctc ggtgagagtc ttagcgccgc gggcttcgag gtcgtctttg 3128641 ttggtgagct tcgtgtcgcg gaagacgagt cggcccgtca tgtactcctc ggtgtcgccg 3128701 aaaagccgac ggatatgggc gtggtcctca ttcggcttcc tgtaaaacgt ttctgtgtcg 3128761 gcgccgtatt ggcgggacag caaggtgcgg accttgccct tcaggctggt acccggaatc 3128821 atcggcagcc tgctcagcgg atcacgaacg acaggcttgt cgaccgcgcc gatggcggag 3128881 aagccatcgc cggccccgat ctgcaggccc gtcaggacgg tcagtgtccc ggttatctcg 3128941 atcttggcgt agctcgtagt cattgggttg tctcacttgt ccttcggatc gaggtacttc 3129001 ttgtatgcgg ctagggcttc catgtaccgg cagaatcgca gcagcccgtc gcggctatcg 3129061 cctatccctt ccagcgcttc taggagtttc gcgtttcgga cgaatgtctt aaccgcgtct 3129121 tcacgcccgg actggtagac gaaccggacc cgcaggtact ggaccttctc cttcagctga 3129181 cgcgggagcg tggggttggc gctctgctgc gcctcgtcga agagctgtgc ggtcaggctg 3129241 agtagcaccc gcagctgggt tgtggtcagc tcgaagccgt tctttttctt tggcaggccg 3129301 cgaattactt cggcctgttt cacatagtcg tcttggatga cgctcattcg gactcctcct 3129361 tgcgagtgcg atagatgtag aggtgcagcg cggtcttgag ttgcttggcg tctgtcggat 3129421 cttggaacca ttggtgtagc cggttagcaa actgctgaaa aggcgctgtg tcaccggtgg 3129481 ggttacgcat gcgcgtgagg aagtacaccc atctggcctt tgtgattcga tcgtcgcgtt 3129541 cggcgagtag ttcgagcagc ttgtagatga aggccatgcc gcgttcttcg ttgccactga 3129601 aatagtcggc gatgtgccgg tacttctcct cgatcacctt gctgagcagc tcatcccagc 3129661 cgaaggtgaa ctcgcgatcg aagagtgcaa ccccgttctt gccgggcagc gacttcgccg 3129721 cgtcttcgag atctccgact tcgcgggcca tcacggagat ggggtacttg tcggggaaca 3129781 tgccgatgcc agccgacacg gtgagtttgc cctgggtgaa ttcgtggaac cgctcccgaa 3129841 gctcgatccc gaactcgatg acgtcgtccc acgcgcccac gacgaagacg tcatcgccac 3129901 cggagtagat gatcgtggcc tcgcggggcc gcgccgggtc atcgccggtg atcgggcgca 3129961 gtttcgggcg tgccaacacg tagttgatgt gctgccggaa gaacaacgac agcatccggg 3130021 agaacgcggc cgtgcggcta atcgtgttga acttgccgtt gccttgctcc atgaagccgt 3130081 gcgtgaatgc ctggcccagg ttatcgacgt caaggcgcag aaccccgagg cgcgcgattc 3130141 cgctcgcacg cttcacgtag tcaccgaact ccatctgtgc gacgtagtcg cccacccaga 3130201 gcccggtgcc caaacactcg ccggcgaaga acttgttctt cgcgtaccgc cttcgggttt 3130261 ggggttgctg gagtgcctta tcggcgtcgg ctcggctaca gaacgtgagt gtggcgccga 3130321 acggcagggg cagacctttg gtggcgccgt cagagatgag taggaagcgg cgagactcgg 3130381 attgaatctg cgaagacgca gcggtcagcg cttggcacag gctgcacttt ggctcgtcgt 3130441 cggcgctgac cgtgcggttg accgtgtggc acacgctgca ttcccggtca cctttctgac 3130501 cgtcgtgatc gcgcgagttg agttcccgca gttggtcagc gctgtatcgg gcgagcttct 3130561 tcgcggaaag ttgctcgctc aactcacggt agagcccgct gtagcggagg gcgcggttac 3130621 ttgcctggct cgcactctcg ttcggccgac gcatcaggtc gttcgcggca agcggtacgc 3130681 tgcccgtggc gatgaagagc cgggttgcga agttttccag cagccagtcg ttggcctcac 3130741 gctcgaactg ttcgacggat ttccgcgcgg actccgtgtt gggcagcagc aggtacgcgt 3130801 gcccgccgcc ggagtagttg agattcgcgc ggctgagacc cacccgcgca agtagctcgt 3130861 cgatgagatg ctcggtcagc atctccaggt agaagctgcg ggcacgcagc atcttcgcgg 3130921 cacccgagga atggatcgtg tagatgaagt cctggatgcc tgagacgtcg aaagttgtga 3130981 gcaggaaggc tttttcgttg tagaaggtgt cctgcttgtc gaacagcgct gacttgaagt 3131041 cgctttgtcc ggtggcttgt aggtagtgcc agatgcaggc gccgagcgca cccgtcagct 3131101 tcaggtggtc gaagagtgag acgtcgacga cctcggacgc gtcggtcgag gacggcacga 3131161 acgacagcgt cgcctcgagg acgttgagga ggctggcgag gtaggtgtcg gaacgttcga 3131221 ggtcgaccag aatggcttta agtttgttga cgatggcggc gtagcggtcc ttgtcgaatt 3131281 cgatccggcg tggcgacggt atattgatcg gcttgcggtc gtcgagcatc tccggggcaa 3131341 atgccagatt cgctgtgccg gagccgaatc ggttgaacat cgaatacagg ggcgtgtccg 3131401 gatcccaagt gctcgcacca tggccgtcgt cggagtcggc cttgcggcgg tcggttccgg 3131461 ccgcgatatt gtaggcgatg taggccggcg catcggcggc aaggcggcca ttctcggccg 3131521 ccgtacgcag cgcagaactg tggtgatagc tgatcgcgtc gagaatgcgg cggtcggaga 3131581 ccccaatgtc agcctcatcc acctcgtcgg tgaactgcga cggattgcgg ctgtcgcgca 3131641 accacacctt cttcataaaa gcgcggccaa tcgcactgtg cctgcccggg tagccgagcg 3131701 ccgcgcgctg gaccggtttg ccaatgtcgt gcaagaggca gccgattatg gcctcgatga 3131761 gttgcgggtt catggcttcg gtacgcattt ttccctcggt gccagtggct ggacccggat 3131821 cgcgcccatc cccatggatg cctttattcc gcatcccgag aactccccga accacaacag 3131881 cgccgcgata tagctcgcaa aagtatccac accgcggacg gtgaacgtgg ccgagccggt 3131941 gaagccggga acacgcgccg cgcccaccgc gaacggggcc gacgccaccc ggaacgcgga 3132001 gaggcgaacc gactgaccga attcggcgat gaggccagga tcgggctctt cgccgtcgac 3132061 aattgcaccg tacttctgcg cgagactctg aaacacgagc cgcggatccg gccagaacac 3132121 gtactcgccg gattgcttga atgcggtagg cgtcaggaac tcgacccgga acttgcgcgt 3132181 ctcgggccgc gcgtagaaaa tgcgcgcgaa ttgacttagc gggttctgct ccagcgatcg 3132241 cgacgtgacc tgtgtcgcta tcccgctcgc acggagccga aaacccgcaa acgccgcgtc 3132301 gttgataggt ccgacgatct gctgccgcgc ctcgttcgtc agcgtgctga tcttccactc 3132361 caaagatgtg gtcgagcggg ccagcgcgta ctgactgtac gggttcaccg gcacggtgtg 3132421 gagggtctgc acataatcgg ccgggatcga ctccatgagg acgccatgaa gatgcggccc 3132481 cagggtcgcc accctcgcgc gttcgagcgg ggcatcaacc tctagagtca gcgtcaatcg 3132541 cgacaagtgt tccgtcatcc ggcgatctcc tcggtgagaa aagcccacca gaataagcgt 3132601 tggtgaaatc caggtcaagc ctgattccgc cgcactcgcg cgatgtcggc ctcggggctg 3132661 gccactccga cgtagcagat ccgtccttct gatgccgcct cggcgagcag ccaagggatg 3132721 gcttcctaac gagccggggt tgtagcggtg ggcgccggcc gcgcggagga tggcgccgcc 3132781 aatgctgctg ttgcccgcgc cgtcaattga cgccagcagg gaccgaaccg acagcagatc 3132841 actcgcagcg ccgactgggc gtacagctca gacccaagcg atgccgccca gtcaacccac 3132901 ggcctcgcgg acccgggcgg cgacctcggc cagcgcggcc tcgtcgtgca ccggcgccgc 3132961 caacgtcggc gtcaccggca gctgcaccca gctggtgcaa ccgccgtact cgggcctacg 3133021 cgccagccgg accggctcgg ccagcgggat cgccgagacc accaagacgg ccagtttgtg 3133081 cttgggccga aagtcgagcc ggtcggcgcg caccgactcg gcggtccaga tgtgcagatc 3133141 ctcgatggcg tccagaccct ctggccggtt aaccggcagt gcggcaacaa ctttcgctgc 3133201 ggcccgcagt agcacacact cgtcggtgct gtcggcggcc gccgggccca gcaggtcgcg 3133261 gtgctcgggg cgaacccgct cggcgtggct gtgcgcgacc gtcgggaaca acaagaactc 3133321 gtgggccgcc acctcgaagc gcttctcgcc gatcccgccc ttacgcagca gcaccgtctg 3133381 ccggccgtcc agcagcgcgt gcaccgccgc gctccactcc ttcagcgctg gcgtcaccac 3133441 gatcccgcga gccggaccga tgtccgaatg acgccagcac cgcagggttc cgaggggacg 3133501 ccgatcatct ccgagacgtt ttgccccggg cagtttcatt ggtcctgctg aatcaggccg 3133561 gtcatccagt gcatccaata gtgatgacag tactcgtgtc ttgctcacca ccacagccgg 3133621 attcgtgccc aactgctctc atctagtcga ttcagccgcg tccagccgca accgtgccag 3133681 cggaacggca cgatcgccgg caggttgatc aggaccgcag caccgccagc gcgttctcca 3133741 cttcgcggcg gtgccgttcg tcgcaggcgg cccaccgctg ctcgtcggcg tcgaggtcag 3133801 tgaggaacgc aaatcgcttc cgaacgcgag cttcccaggc agccatagcg acaggacggg 3133861 tcagcacgcc gatcgagtcg ggctggaagt cgtgctcgct gcgggcggcg aggacgtctt 3133921 cgacgcgtag tggccgggtg ccgcgccggt catcgacgac atcaccccac accttgagca 3133981 cccacagccg ccgcacgagc ggttcgtcaa tcgttcgcga ggcgaagtgg ttcaggtcgt 3134041 acaggtcccg tgccagcgca acgcggcggt accgcgcgag tttctctgcg caggcttccg 3134101 cttctgccac gaccggcagt gtcggcagcc caaaaccgta agccttatgg atcggcaact 3134161 ggatgaatgc gagcagctca gacggcaaag ccaacggccg ccgtgcgaac tcgacgctgg 3134221 cgacgatccg gggctcgccc aattccgtgt gccgcacccg caactgccaa tgccggccgt 3134281 cgcctcgtgt gctctgcacg ccgaattcga agccgccgac acgggcgccg tcgatcagct 3134341 cgcacacctc cagcacgacc tcatcgtcgg gcgcgctgaa gtccagatca gtggagaacc 3134401 gcccgacgtt gcccagccgg cacttccgta agctggtacc gcctttgaac accaggcggt 3134461 tatcgccgaa ctggacggtc tgcgacagca ggtacagcag gtggtcctgg gcgacgtcga 3134521 gcagagcggc gtcgtatgcc tcggcccgac caagagcgtg acgcgcaacg agcgcacggg 3134581 tcagaccggc cacagtcacg ccttgccgat cacgcgcagc agcggtacga cgagctcgtc 3134641 gacaagctga tactcgggag cccagacact ctcgccacgg tcgcggctgt gcgcggtggt 3134701 gaatcgagtc accggcatca cttcggtgtg ccgcttggcc agcagcgcct ggcctcgtgc 3134761 cggttcaccg cccgagtcca gcaggtagct tgcacgctgc caggccgatg tcggccggcc 3134821 cgacagtaga cgctccaggc gctcgtcact gcagtcggcg acgaggtcgt caaggtgggg 3134881 gacaaggtcg gcccacggcc cgaacgaggc cgggcgcgtg gcgatttgca caagtaatgc 3134941 ttctggtcct agcgccggta acccggtcgc ccacgcgacg aggtcgagcc gccgccggac 3135001 cagcaacgcg ggacgcggag ccagcagtgc ggtgtccgcc gcgttccagg ggatgcgcac 3135061 gacggacaca tacgatgcta ggccgtcggg cagccttttg gccggcggca gccagatcgg 3135121 gatgcggccg tcgggttggc ggtccaggta tccgaggtgc cacgctgcgg atgcaccggc 3135181 cagcatgaag cccgcgttct ggtcacgggc cagccacgag cgcagcggta gatacgggtc 3135241 cgagatggcg gcctcgccgg ggggaatgaa tgcccaggtg cctttcaccg gcagttggac 3135301 cagccaccca atgcggcgca gttcgcggat ggcggagtcg gggtcgcgtc cacacccagc 3135361 ctctgtaagc cgttgcgtca gatcctcttt cgtgacgact acgggccgat cgcgagcgag 3135421 gccggacacc acccgtgacg cccacgtggg gatgcgccga tcggcgccgg ctgggctcac 3135481 caccgaactt gaattcacac cggaaactat actatatctg tacgcaacaa tgttcaaact 3135541 caagaaatca cttgatttag gaacgggctt cggtcagtga cagtacgaaa cccgttccaa 3135601 actcaagtgc cctgtacggg ctggcggcga tgcggtgcaa cggcgagaga caaaacgcgc 3135661 ttcgcggacg accggccgac gcgccggaga gtcgccaaga acgtcacccc tgaaatcaag 3135721 tgggaccagg atgcactgac gcgttgctcg gaccagtcac ccaggcgatg cgcctcggct 3135781 caaaaactca acccacggcc tcgcggaccc gggcggcgac ctcggccagc gcggcctcgt 3135841 cgtgcaccgg cgccgccaac gtcggcgtca ccggcagctg cacccagctg gtgcaaccgc 3135901 cgtactcggg cgtacgcgcc agccggaccg gctcggccag cgggatcgcc gagaccacca 3135961 gcacggccag ccgatgcttg ggccgaaagt cgagccggtc ggcgcgcacc gactcggcgg 3136021 tccagatgtg cagatcctcg atggcgtcca gaccctctgg ccggttaacc ggcagtgcgg 3136081 caacaacttt cgctgcggcc cgcagtagca cacactcgtc ggtgctgtcg gcggccgccg 3136141 ggcccagcag gtcgcggtgc tcggggcgaa cccgctcggc gtggctgtgc gcgaccgtcg 3136201 ggaacaacaa gaactcgtgg gccgccacct cgaagcgctt ctcgccgatc ccgcccttac 3136261 gcagcagcac cgtctgccgg ccgtccagca gcgcgtgcac cgccgcgctc cactccttca 3136321 gcgctggcgt caccgcgatc ccgcgagccg ggcagccacg tcgggtcggc gcaacggcgg 3136381 gacggtcttc ggcggctgcc gccggggcgg cagggcgtcc agcaaccgcg tcgtcgtcgc 3136441 ggtcacctcg gcgacggcgg cctcaaacgc ctcggcggtg gccgccgacg ggtgcgtgat 3136501 gccactgacc ttgcgcacat actggcgcgc cgccgccgcg atctcgacgg gcgtggccgg 3136561 gggttgcagc ccgcgcagtt cggtgatgtt gcggcacatg ccctcaacga taggcgcggc 3136621 taccagacgg tgaccggtcg tgggtgccga tgactgcgta gccgccggtc cttggtcacc 3136681 agccgccagc cgtgttcgat cgcggtggcg tagatcaacc ggtcggccgg atcgccgggg 3136741 aacgacgagg gcagcgccac cgccgtggcg gcgaccgagg gcgtgatacc gacggtgcga 3136801 acgtgctcgg ccagctgctg aagccaggac agcaccggaa tcgccagttg gatgcgttcc 3136861 tgttcggcaa gccaagccag ctcgaaccac gaaatcgcgg cgacggcgag ctcgtcggcg 3136921 tgttcgatgg cctggctcgc cgccatgctg agacgctgcg gctcggccga ccaccagtag 3136981 gccacatgcg agtcgagcag caccgtcgtc atgaaacgtt ccacgaaacc ccggtggtga 3137041 agagttcgtc gtcatccgcg gccgccatcg ccacacccga gaatcgaccc ttcagcgcgt 3137101 gcggccccgt cgctgccacc agccgggcca cggtgcggcc gtgtttggtg atctcgatct 3137161 cctcgccctg ggccacttca tcaagcaagg agaggatctt cgccttcacc tccgtagcgg 3137221 tcatttttct ggtcattagg acagtctaac ggtcctgtta cggtgatcga atgaccgacg 3137281 acatcctgct gatcgacacc gacgaacggg tgcgaaccct caccctcaac cggccgcagt 3137341 cccgcaacgc gctctcggcg gcgctacggg atcggttttt cgcggcgttg gccgacgccg 3137401 aggccgacga cgacatcgac gtcgtcatcc tcaccggcgc cgatccggtg ttctgcgccg 3137461 gactggacct caaggagctg gccgggcaga ccgcgctgcc ggacatctca ccgcggtggc 3137521 cggccatgac caagccggtg atcggcgcga tcaacggcgc cgcggtcacc ggcgggctcg 3137581 aactggcgct gtactgcgac atcctgatcg cctccgagca cgcccgcttc gccgacaccc 3137641 acgcccgggt gggcctgctg cccacctggg gactcagcgt gcgcttgccg caaaaggtcg 3137701 gcatcggcct ggcccggcgg atgagcctga ccggcgacta cctgtccgcg accgacgcgt 3137761 tgcgggccgg cctggtcacc gaggtggtgg cccacgacca gctgctgccc accgcccgcc 3137821 gggtggcggc gtcgatcgtc ggcaacaacc agaacgcggt gcgggcattg ctggcgtcct 3137881 accaccgcat cgacgagtct cagaccgccg ccgggctgtg gctggaagcc tgcgcggcca 3137941 agcaatttcg cactagcggc gataccatcg ccgccaaccg cgaagccgtg ctgcagcgcg 3138001 gccgcgcgca ggtgcgttag cggcgatcgc aagcgcggcg aagccgggtg ctgggggtac 3138061 ctcccgcgtg cgggggacgg gtcgccgcca tcagcccttc agcgaagccg ggtctcggtg 3138121 cggctgttga agaggcgcac ctcctgcgag tgcggcacga tcgccaacga ctcacccacc 3138181 cgcaccgcgg tacgccggtc ggtgcggaac acgatgcgcg gtgcgcgtga cgaccagccc 3138241 cgctggtcga ccggcgttgc gtagacgaag gattcgaagc cgagctcctc caccaactcg 3138301 acgtgcacgg tcaacgatcc cggggtgccg atcgatgcca cgtcccagga ctccggccgc 3138361 acgccgacca gcacccgctc ggccgccggg tccggaaccg gtatcgccaa atccggtgcc 3138421 cgcaccacac cgtgggcgac ggcggcgtcg atgaggttca tcgccggcgc gccgatgaac 3138481 gtggcgacaa acgtgttgac cgggtcgtca tacagcgccc tcggcgtgtc aacctgttgc 3138541 agcacaccgt ctttgagcac cgccacccgg tcgcccatcg tcatcgcctc cacctgatcg 3138601 tgggtgacgt agacggtggt ggtgcccaac cgacgctgca atccggagat ctgtgagcgg 3138661 gtgctcaccc gcagcttggc gtccagattc gacagcggct cgtccatgca gaacacccgg 3138721 ggccggcgca cgatcgcccg gcccatcgcc acccgctgcc gctgcccgcc ggagagcttg 3138781 gcgggcttgc ggtccagcag atccgtcagc tccagcatgt cggcgacttc cagcacccgc 3138841 cggcgggtgt ccgcgcgcga catcccggcg tttcgcagcg cgaaccccat gttggcggcc 3138901 accgtcatgt tcgggtacag cgcgtagttc tggaacacca tcgccacgtc acgcgcccgc 3138961 ggcggcagat gcgtcacatc cacgtcgccg atgctgatgc gcccgctctc aatgggttcc 3139021 agcccggcca gcacgcgcag cgtggtggac ttgccgcaac cggacggacc gaccagaacc 3139081 agaaactccc cgtcggcgat gtcgaggtcc aagttgtcga cggtcggcgc gtcggcgccg 3139141 ggatagcgct gggtgacagc agagtactga acgttagcca tgccccgcca gcttccgcat 3139201 gatctgccga tccaggatga cctgcaaccg tttttggatg ttcgtgaagg tcttggtcac 3139261 gtcggctccg cgcagcccga tggattccag gccggcggag atgatccggt caccaccggg 3139321 caggaaaacc cgtgcgtagt cttgtgtccg ggtgtgtggc agctggtcga gcgccacccg 3139381 cgcacgggga ttgtccgcca gatagtgccg ttcgctggca tcgtcgacgg cggacttgcg 3139441 caccggcaga tagccggttt gctggctgaa gtaggcggtg ttcgtcgggt tggtgacgaa 3139501 tgcgatgaac ttgagcgcgt tgacttttcg ctcctcggag agcttggccg gtatcgccag 3139561 ccccgcaccg cccgtcggac aggcgggcgc tgcgtccggg cccgtgggca gcggtgcggc 3139621 gccgaagtcg aatcgggcag atgcggtgat gccggccagc gagccggtgg atgccacggc 3139681 cgaggccagg attccggtgg cgaactcgtt ggcaatatcg ttggcgaccg ccgcataacc 3139741 cttgccatgg atggagttcc gatagaagtt gccggccgcg atcgtggcgg gctcggtcaa 3139801 tgtcaatgtc cacttgtcgg agtaggcacc gccgaatgcc cagttcggtc cctgaaacgt 3139861 ccacgagatg aggtcggcgt tagcccagcc gtgcgccgat cgaccggcgc cgaccacgcg 3139921 ctgtaactcc ggaccccact cgtcgaactc tgaccaggat tgcggtccgc ggtcgggtag 3139981 gccggcctgt tgccacgccg ccttgttgta gtagaacagc ggcgtcgagc gagcatacgg 3140041 cacagcgtaa tggcggccgt tgaactcata gtcggccagc agcgaatcga cgtaatccgt 3140101 tgtgtccacc ccaacttggc cgaacaggtc gtcaagggca gtgagaacac cgctgagggc 3140161 gaaatggaac caccatcggt cgtcgagcaa aacgacgtcg ggcacgtcgg ttccgatgag 3140221 cgccgcattg aatttctgtg ccacctcgtc gtagtccttg ccggcgtcga tcagcttgac 3140281 cgacagagtg gggaatcggt cctggaaacg accgatcagc tcccgttccg ccgcgctgga 3140341 ttggccggga tgactggacc agaagtcgat tgggccggaa ccggacttca ccgaaccgcc 3140401 gccgcccatc ccggcgcagc cggcggtcac gccggcggcg gcagcggcca gcgcgaggaa 3140461 ttgtcggcgg ttcagcgggt ccatgcctat cccttgaccg cgcccgaggt gaggcccttt 3140521 atcatctgcc gctgcaaggc gatgaagacc agcaagatcg gcagcatcgc caacagcgtc 3140581 accgccatca ccgggcccca gttcgtcaca ccctcggcct gctgcagaaa cgtcagacct 3140641 atcggcagtg gtgccaccga ttcgtcgtcg gacatcagga acggccacag gtattcgttc 3140701 cattcgttga ccacggtgat gacaccgacg gcgaccatgg tgggccgcga catcggcaac 3140761 accacccgca gcagcagttg ccaccaccgc gcgccgtcca tccgggccgc ctcgatgatc 3140821 tcggcgggca gcgacagaaa gtggttgcgc atcaagaagg ttccaaacgc cacccccgcc 3140881 agaggcagga tgatgccggc aaaggtgttg cgcaggccca ggtgtgagat cagcgcgtag 3140941 ttggaaatca cggtgatctg gttgggcacc atcaacgcgg cgatgatcac caaaaacacc 3141001 gccgtgcggc ccgggaaccg gacaaacacc aagccaaagg cgctgagcac accgagcgtg 3141061 aacttcacca ccgccagcac cgacgtgatg atcagcgagt tgcgcagaaa cgtccagaac 3141121 ggaatctgct cggtggccgt gcggtagttc tgcgggtacc agcgcagcgg ccaccaactg 3141181 gtgggctgcg catagatgtc gggctgatcc ttgaacgagg tgaagaacac gaacagcaac 3141241 ggcccggcaa tcagcgtgac caccagcaac atggccgcgt agccaacgct gctacggagc 3141301 cgatccggcg tcactgccgc tgcccccgat ccatcacccg cacctggtag tacgtcacgg 3141361 ccagcagcac caggaacatg atcgtggcca ccgtggcgcc ataaccggcc cggaaattgc 3141421 ggaacgtctc cacatacacc tggtacacca tggtggtggt gccggtgccc tccggcccgc 3141481 cccgggtcat cacgttgatc acatcgaaca cctgcagcga gttgatcagc acggtgatcg 3141541 acaagaaaaa cgtggtcggc cgcagctgcg gcaacagcac tcgacggaac acggcccacc 3141601 ggctggcgcc gtcgatttcg gccgcctcca acagatctcg gcgtaccccc tgcaacgcgg 3141661 ccagatagat cacgaaggta tagccgaggt tcttccagac gtaggtgatg gtcaccatga 3141721 acaacgccca gcgcgcatcc tggtaaaagt cgggcacccc gaccccgatc cggcgcaaca 3141781 ggtcttgaat cagaccgaaa tgcgggtcga agacgaactg ggcggccagg ccgacagcgg 3141841 caccggagat cacgaacggc gcgaaaacag tggagcgcac caggtttcgt ccacgcaacg 3141901 gtcgatcgag cagcatcgcc agcgccaacc ccagcaccat cgagccgacc accgcggcac 3141961 cggtgaaaac cgccgtgttg aacacgatct ggcgggtgtc cgaccgggtg aaccactcgg 3142021 tgtagttgga taaccccaca aatcgggccg acggatcgga gacgttccag tcgaagaacg 3142081 acagccggat gttgtcggcc aacgggcgat agacgaacag cagcaatagc gccacattgg 3142141 ggccgaccaa cacgacgaac agcgcataat cgcgcacgcg ctctttcgat gaccgaagcc 3142201 gtgctcgttg cggcgccgcc atcggcgcag tgtagctccg tattctgtcg gcgagtttgc 3142261 cgccacggtc gatgaaccaa tcaccgccgt cacggcaatg tcggccagct acgccgcgcc 3142321 cgtcaccgcc caccgaccgc tgtacgcccg ccatccgacg aatatcagcc gcagcacgat 3142381 aaacgtgccc agtcccgacc agatacccgc cagcccccag ccatacgcca gcgacaacca 3142441 gacaagcggc aaaaagccca ccaacgcact cgccaccgtc gccgtccgca tgaacgcggc 3142501 gtcgcccgcg cccagcagca ccccgtcaac tgcgaaaaca attcccgcaa aaggcaattg 3142561 gactaccatg aaccaccacg gcaccccgat cgcggcgagt accgatcgat cgtcggtgaa 3142621 tagcccgggc agcaccgagg agcctagccc taacgccgct gccaaaattc ccgccgccaa 3142681 cagcgaaaac gccgtcaccc gccatgccac cgccttagcg tgcccggcat caccggcacc 3142741 caacgcggca ccgaccagcg actgcgccgc aatcgctagc gaatcaagaa ccagcgcaag 3142801 aagaccccac aactgcaaca cgacctggtg ggccgcgagc gcggcagcgc cgaacctcgc 3142861 ggccaccgcc gcagccgaga cataacaaac ttggaaggcc agggtccgca cgatcaggtc 3142921 ccgcgccatc atcagctggg cgcccagcac ggcgcggtcc ggccgcagcg acacccgctc 3142981 ggccagtaac gcaccggcaa acagcagcgc cgccagccac tgccccacca gattggccac 3143041 cgccgagccg gttaaccccc agcggggcaa ccccagccaa ccgtaaacca gcagcgggca 3143101 cagcagagcc gacgacccga agccggcgac cacataccgc agcggtcgca cggtgtcctg 3143161 cacgccgcgc agccagccgt tgccggcgag cgagaccagg atcgccggcg tgcccaggat 3143221 cgcgatccgc agccacggca aggccgccgc ggtgatgcca tcgccagaag cgatcgccga 3143281 caccagcggc gtcgcggtgg cttccaccac gacgacgacc aacgcgccca gacccaacgc 3143341 caaccaggtc gcctgtacac cttcggtgac cgcggccacc cggttgccgg caccgtaacg 3143401 acgcgccgcg cgcgctgtgg tgccgtagga caaaaacgtc gcctgggaac caaccaggcc 3143461 gagcaccaga ctgccgatag ccagacccgc cagcgatatc gcccccagcc ggcccaccac 3143521 ggcgatgtcg aacagcaggt acagcggctc ggcggccagc acgcccagcg cgggcaacgc 3143581 cagctgcgcg atctgacggc cgcccgcgcg gtgccccacc tggctcaacg gcggctaacc 3143641 aagcgccgcg cgcaacgacg ccacagcgtc gtcgatcgag ccggtggtcg tataccccgc 3143701 ggccagccgg tgaccaccgc caccgaaccc agaggcaacc gcggccaaat tcacggtctt 3143761 agcccgcatc gacaccgacc accgatgcgg ttcgacctcc ttgaacaccg ccgcgacctc 3143821 ggcttgttgc gtggtgcgga cgatgtcgac gatgctttcc acttcctccg agcgcgcagc 3143881 gacccactcc cggttgtcga cgacgacgta aaccagcccg cggccaccga ccgcctcgga 3143941 caccagctgc gccgaaccca acacccgcga tagcaacggc aaccaggtga agggatggct 3144001 gtccatcaag gtcctgctga cggtggcgtt gtccacaccg atctctacca gccgcgccgc 3144061 cagccgatac ccccgcacac tggcccagcg aaacgacccc gtgtcggtcg ccaacccggc 3144121 gtagatgcag tgcgcgacgc gcgggtctat cggtttcccc cacgcgtcga ggatctcggc 3144181 aaccatcgtc gtggtggaat ccgccgacgg gtcaatgaaa ttcgcggtgc cgaacaggtc 3144241 gttggaggcg tgatggtcga ttaccaggag ctcccgcccg gaatcagtta gatcgcccag 3144301 agcaccgagc cgatcaacac tcggaatgtc aacagtcaca accaaatcga catcgcggcg 3144361 catcacctca gggcggacca gcagatggca gcccggcagc gaacgcagcg actcgggcag 3144421 tgtcgccggc gcggcaaagc tgacctctac ccgcttgccg cacccgtcca acaccaatgc 3144481 caatgccaat ccggcgccga tggtgtcggc atcggggtgg acgtggcaga ctaccccgac 3144541 cctggcagcg gccgacaaca gcgcagcggc accgacggcg tccacgcggg cccccgcgcg 3144601 acgccgcccg tcgaccagct cactccttgg gtcgatcgtc gtcaccggtg tctcccccgc 3144661 aagtgagcgg tgcctccaca gcctcgggtc cgtcgctcgt tctgataccc agtcccccgg 3144721 gagccggtga ttgcgccacc gacccgttat cacggtacgg gtcggcctcc cccgccggtt 3144781 tggcgcccac ccggacccgc gccagatcgg catccgcggc gcgagcgcgg gccagcaact 3144841 cgtccatccg gtgcacactg tccgagatcg tgtcgagcgt gaacgtcaag gtgggagtga 3144901 accgaacgcc ggtgcccgcc ccgaccttgg tgcgcagcac ccctttggcc cgttccagcg 3144961 cggcggccgc gccggcgcag ttcggctcgt cgtgtagcgt gcgtcccatc accgtgtagt 3145021 acaccgtggc atcgtgcaag tcggcggtca ccttcgcatc ggtgatggtc accccggcca 3145081 atccaggatc cttgatctcg tactcgatcg ccgaggcgac gatcgcggcg atccgtttgg 3145141 ccagccgccg cgccctagca gcatcagcca tcaggcgcgt tccttctgga ccagctcgta 3145201 ggactcgatg acgtcgccct ccttgatgtc ggcgtaaccc agtgtcaggc cacactcgaa 3145261 gccgtcgcgc acctcggtca cgtcgtcctt ctcccggcgc agcgaagcga tcgaaaggtt 3145321 ctcggcgacc acgatgttgt cccgcaacag ccgcgccttg gcgttgcgcc gcatcacacc 3145381 cgaggtgacc aggcagccgg cgatgaggcc gaccttcgaa gaccggaaca acgcccggat 3145441 ctcagcccga cccagctggt tttcctcgta gatcggcttg agcaggccac gcagcgcctg 3145501 ctcgatctcg tcgatcgcct ggtagatgac cgagtagtag cggatctcca cgccttcgcg 3145561 gctggccagc tcggtcgcct tgccttcggc gcgcacattg aaaccgatga tcaccgcatc 3145621 ggaagccgac gccaggttga cgttggtttc ggtaatgccg ccgacaccgc ggtcgatcac 3145681 ccgcagcacc acctcgtcgt ccacctggat acccatcagg gcctcttcca gcgcctcgac 3145741 ggtaccggcg ttgtcgccct tgaggatcag gttcagctgg ctggtttcct tcagcgccga 3145801 gtccaggtcc tccaggctga tccgcttgcg tgagcgcgcc gccagggcgt tgcgcttgcg 3145861 agcgctacgc cggtcggcga tttggcgggc gatacggtcc tcgtcgacga cgaggaagtt 3145921 gtcgccggcg ccgggcaccg acgtgaagcc aatgacctgc acaggccgcg acggcagcgc 3145981 aacctcgacg tcttcgccgt gttcgtcgac catgcggcga acacggccat aggcgtcgcc 3146041 ggcgaccacc gagtcaccga cccgcagggt gccgcgctgc accagcacgg tagccactgg 3146101 gccgcgacca cggtccaagt gcgcctcgat cgccacaccc tgggcttcca tgtcggggtt 3146161 tgcccgcagg tccagcgcgg cgtcggcggt cagcaacacg gcctcctcca gcgcctcgat 3146221 attggtgccc tgcttggccg agatgtcgac gaacatcgtg tcaccgccga attcctctgg 3146281 cactaaacca tattcggtaa gctgcccgcg aatcttggcc gggtcggcac cctccttgtc 3146341 gatcttgttg accgccacca cgatcggcac gtcggcggcc tgcgcgtggt tgatggcctc 3146401 gaccgtctgc ggcatcactc catcgtcagc ggcgaccacc aaaatggcga tatcggtcgc 3146461 cttggcgcca cgggcacgca tggcggtgaa cgcctcgtgg cccggggtgt cgataaaggt 3146521 gatcagccgc tggctgccgt ccagatcgac ggccacctgg taggcaccga tgtgctgggt 3146581 gatgccgccg gcctcggcct cgcggacgtt ggccttgcgg atggtgtcca acagccgggt 3146641 cttgccgtgg tcgacgtgac ccatcaccgt caccaccggc gggcgaacct gaaggtcctc 3146701 ctcgccgccc tcgtcctcac cgtagctgag gtcgaaggat tccagcagct cgcggtcttc 3146761 gtcctccggg ctgacgacct gaacgttgta gttcatctcg ctgcccagca actccagcgt 3146821 ctcgtcgccg accgactggg tggccgtcac catctcgccg aggttgaaca gcgcctgcac 3146881 cagcgccgcg gggttggcgt cgattttgtc cgcgaagtcg ctgagcgacg cgccgcgtgc 3146941 gagccggatc gtttcgccgt tgccgtgcgg caaccgcacc ccgccgacga ccggagcctg 3147001 catcgagtcg tactcctggc gcttctgccg cttggacttg cggccgcgcc ggggcgcacc 3147061 acccgggcgg ccgaacgcgc cggcggcacc gccacgctgc ccgggacggc caccgccgcc 3147121 gcccccggga cggccgcgga agcccgttcc gggagcggca cccacgccgc caccccggta 3147181 gttgccgccg cccgcgtcgg aacggccagc gccgggcgca ccgggccggc ccccgggtcg 3147241 tggcgcacca ggacgtggtg gacgggcacc cccgacagct ccaccggggc gtggcggcat 3147301 gctgccgggc gaggcgcccg gacgtggaac cccgggccgg gcggtaccgg ggcggggagc 3147361 cggcggacgc gggatgggcc ggtcggcggg ttgcgccgac gagaacgggt tgttgccgac 3147421 gcgcggggtg cgaatccccg gcttcggcac cgggcccggc cgcgccccgg gagccatgcc 3147481 ggggtggggt gcctgagggc tgggcggcac tgcggtcgga ggctcgggtg cggcgggggt 3147541 agttggggag acgattgccg ccccgccgga atcggcggcc ttggcgggcg cggcagtcgc 3147601 cttgccgttg cctgcggcca tgtcgatcgc ggcgtccagc gccttgtcaa gggacttgtc 3147661 ggggcctttg ccgggggact tggcggtgcc tttcgccggg gcaggtttgc tgccaccgaa 3147721 cgattcacgc agccgacggg caaccggtgc ttccaccgtc gacgatgctg atttgacgaa 3147781 ttcgccctgc tcgctcagcc gggcgagaac ttccttgctg gttacaccga gttccttagc 3147841 caactcgtgt acgcgggcct tacctgctgc cactacatct cctgtccatg aggcgacagt 3147901 cgtgggccgc gcctcgggtt tagctatgac gcattgtcat cgggacttca cggtgtgctc 3147961 atgttctatt gctacctgtt ctgttgcccg gtggttcgag ctcgcctaga gactccaggt 3148021 actcgaccac tgcggatgtg tccggcgaac cggcgatgcg cagcgctctt gcgaaagccc 3148081 gccgccgaat cgcttgttgc gcgcactgcc gtagcggatg cagccacgca ccccgccccg 3148141 gcaggctggt cgctgtatca acgatcacgg cgtagttgcc gttcccggtc gacacagcca 3148201 ccactcgaag cagttcgacg gccaaccctc gctttcggca cccgacacac gtccgcaccg 3148261 gtccgcgggg attatccggg cgtcgatgcg ccgaggccga aggctcgcgc tggatcacgg 3148321 ctaagtgtag cgtcaccggg caagcccgat tgcccggcta tctgccgtgg gataacgcac 3148381 ggcgctagcg gtcgtgcgcc ataccgcggc tgactccggg ttcgggctga ccgggcgggg 3148441 gcggcggcgc atcgccgcga atatcgatac gccacccggt gagccgggca gccagccggg 3148501 cgttctgccc ttcctttccg attgccagcg acaattggaa atcgggcacc accacgcggg 3148561 cggcccgggc ggtctggtcg atcaccgaca ccgacaccac cttggccggc gacaacgcgt 3148621 tggcgacaaa acgcgccgga tcgtcgtcat agtcgatgat gtcgatcttc tccccggaca 3148681 gctcgctcat cacgttgcgg acccgttgcc ccatcggacc gatgcaagca cccttggcgt 3148741 tcaagccggc aacgttggac cgcacagcga tcttggagcg gtggccggcc tcccgggcca 3148801 ccgcgacgat ctccaccgat ccgtcggcga tctcggggac ttccagcgag aacagcttgc 3148861 gcaccagatt ggggtgcgtg cgcgacagcg taatcagcgg ctcgcgggca cctcgggtta 3148921 caccaactac gtagcagcgc agccggttgc catgttcata gctctccccc ggtacctgct 3148981 cagcggccgg gatcacaccc tcggaagcct tggtctcggt gccaatccgg acgacgacca 3149041 gaccgcgggc gttggcccgg ctatcgcgct ggatcactcc cgcaacgatc tcgccctcgc 3149101 gggtggagaa ctcgccgtag gtgcgctcgt tctcggcgtc gcggaatcgc tgcaacatca 3149161 cttggcgtgc cgtcgtggcg gcgatccggc cgaagccctc tggagtgtcg tcccactcgc 3149221 tgatgagatt gccagcctca tcggtctcac gggcgatcac ccgaacgaca ccggttttcc 3149281 ggtcgatctc gatgcgcgca tcggtctggt gaccttgggt gtgccggtag gcagtcaaca 3149341 gcgcggactt gatcgtttcg agcagttcat tgaccgagat accccggtcc acctcgatgg 3149401 catgcagagc agccatgtcg atgttcatgc tccggcctcc gtcccgcggg ccagccccat 3149461 ctcggaagac tgggccagtt ccaactccgc cggagccggt ggcgaaaact caacctggac 3149521 aacagctttc acaatctcag caagcgggat ctcacggact gcccagcccc ggtcttcccg 3149581 gatcaccaac gccaccgtgc cagcacgcat ctcgccgacc cggccggtca gtcgcgatcc 3149641 gtctgacaac accagctcaa ccttgcggcc tcgagcacgg cggaagtgct tttcgctggt 3149701 cagcgggcgt tccacaccgg gagagctgac ctcgagcagg tagcggcccc ggatcttgtt 3149761 cgcaccgtcc aggccgtcca gcaaagccga tgccctgcgc gacaatgcgg ctatcgtatc 3149821 caggtcgaga ggggcgtcac cgtcggcgat caccgctatc cgcggcgggc gggcccgcgc 3149881 atcgatgacc acgtcttcga tctcgtagcc ggcgcacgcg aaatctgcac cgagtagctc 3149941 gatcacctgc ctctgcgaag gtagcccggt ggtcacggcg agctcctcat cttgagttgt 3150001 ccggtcatct agcggaggcg ccgccagggc ggctcccagt gtcccgccgg cacgcagcag 3150061 ccggcgtagc taccaacgat acgccaggaa tcacgaatga cgccgtgatc acgccgttca 3150121 acgtctcgcc tgccttcgta gcggcgtctt tctgatggca ggatgttgct gtgcttagag 3150181 cagcaccagt catcaaccgg ctcacgaatc gacccatcag caggcggggt gtgctggccg 3150241 gtggcgccgc gctggccgca ctgggagtgg tgtccgcctg cggcgagtcc gcgcccaagg 3150301 cacccgcggt cgaagagctg cgctcgccgt tggaccaggc ccgacacgac ggtgcgctcg 3150361 cagctgccgc cgccacagcc atcgggatcc cgccgcaggt tgccgccgcg ctgaccgtcg 3150421 tcgccactca gcgaacctcg catgctcgag cgctggccac cgagatcgcc cgggccgcgg 3150481 gcaagctggt atccgctacg agcgaaacca gcagctccag tcccagccca accgatccgg 3150541 cggcaccgcc accagcggtg tccgacgtga tcgattcgct gcgcacgtca gcgggggaag 3150601 ccagtcgact agtggcgacg acatcgggct accgagcagg gttgctcgcc tccattgccg 3150661 cgtcctgcac cgcctcctat acggttgcgc tcgtgccttc aggcccgtcg atatgacctc 3150721 gtccgaaccc gcccacggtg ccacaccgaa gaggtccccc tccgagggga gcgccgacaa 3150781 cgcggcgctg tgcgatgcgc ttgccgtcga acacgccacc atttacggct acggcatcgt 3150841 ctccgcgctc tcgccccctg gtgtcaactt cttggtggcg gacgcgttga agcagcaccg 3150901 ccaccgccga gacgacgtga tcgtgatgct gtccgcgcgc ggagtcaccg ccccgatcgc 3150961 tgccgccggt taccagctgc ccatgcaggt cagcagcgcg gccgacgcgg cacgactagc 3151021 agtgcggatg gagaacgacg gggcaacggc ctggcgggcg gttgtcgagc atgccgagac 3151081 ggccgatgac cgggtgttcg cttcgacggc tctgaccgag agcgcggtga tggccacccg 3151141 ctggaacagg gtgctgggcg cctggcccat caccgcggcc tttccgggcg gggacgaata 3151201 gctacccggt gacggccgct gcgatatcgg tggccagcga ggcgccggca accagctcgc 3151261 gagtctgacc gctgaaccgg tcgcgcagct cgaccacgcc gtccgcccag ccgcgcccca 3151321 cgacaacgat ccagggcata cccaacagct cggcatcttt gaacttgacg ccgggcgatg 3151381 cctggcggtc gtccagcaac acctcaaccc ccagccgatc cagatcggcg gccagcgcgg 3151441 tcgccccggc gcgagcctgc gcgtccttgt tcgcgatcac caggtgaaca tcgaacggcg 3151501 cgaccgtcga cggccagcga aggcccagct cgtcgtggtg ctgctcggca acgacggcaa 3151561 ccaaccgaga cacaccgatg ccgtaggaac ccatggtcaa ccgcacaggc ttgccatcct 3151621 cgccgagcac gtcggcggtg aaggcgtcgg tgtatttgct ccccagctgg aagatgtgcc 3151681 caatttcgat accgcgcgcc atgaccagcg gaccggcgcc gtcgggagat ggatcgcctt 3151741 cgcgcacctc ggcggcctca atggtgccgt ctgcggtgaa gtcgcggccg gccaccaaac 3151801 cgacaacatg gcggccgggt tggtccgccc cggtgatcca gctggtgccg tcgactatcc 3151861 gcgggtcgac gagatagcgg acattgttct cccgcaacgc ctttggcccg atataaccct 3151921 taaccaggaa cgggtgcttg gcgaaatcat cgtcgtcgag caacgcgtag tcagccggtt 3151981 ccagcgctgc gcccaacctt ttgtcatcga cctcacggtc gccgggcacg ccgattgcca 3152041 gcagttcggt gtcccctccc ggctgtcgga ctttgattaa gacgttcttc agggtgtccg 3152101 cggcggtcac cgtgcggccg agatcggcct cgttggccca ggccaccagg ctggcgatgg 3152161 ttggggtgtc gccggtgtcg tggaccaccg cctcgggcag cccatcgatg ggcagggtgt 3152221 ccgggcgggc ggtgacaacc gcctcgacgt tggccgcata acccgactcg aggcaccgga 3152281 caaatgcgtc ctccccggac ggactctcag ccaagaactc ttcggacgca ctgccgccca 3152341 tcgccccgga cactgccgaa acgatgacat agcgcacctg aagtcggtca aatatgcgct 3152401 ggtaggcctc ccggtgagcg tggtaggccg ccttcagccc ggcggcgtcg atgtcaaagg 3152461 agtaggagtc cttcatgacg aactcccgag cgcgcaggat gccggcccgc ggccgcgcct 3152521 cgtcgcggta cttggtctgg atttggtaca gcgtgagcgg gaagtccttg taggagctgt 3152581 actcgccctt cacggtcagg gtgaacagct cttcgtgggt ggggcccagc aggtagtcgt 3152641 tgccgcggcg gtccttgagc cgaaacacgc tgtcgccgta ttgggtccac cggttggtcg 3152701 tctcgtacgg tgcccgcggc agcagggcag gaaataggat ctcctgtcca ccgatggcgt 3152761 tcatctcgtc gcggatgacc cgttctatgt tgcgcagcac tcgcaggccg agcggtaacc 3152821 agctgtacag cccgggcgcg acgggccgga tgtagccggc ccggatcagc agtttgtggc 3152881 tggccacttc ggcgtcggcg ggatcgtcgc gcagggtgcg caagaacaac tcggacatcc 3152941 gggtgatcac aggcggcaag cctaattcgc cgagcagacg caaaagcgcc caggtctgcc 3153001 cgaaaagggg agcttttatg actgctcggc gggaagggtt acagctcgcc ggcgtcgatc 3153061 gcttccttga cctcctgcgc atgggcaacc tgctgcggcg tatacccgat aaacagcgcc 3153121 ataccgccga cgatgatggc cgctccggcc acccacagca ggccgtaggt gtaggcgtgg 3153181 tcaagcgcgg ccaactgcac gtcgttcatg aacttcaccg gaccggtggt accgcccagg 3153241 tacagcgtgc gcgacgtgat cacagcctgg atgacggcga gcaccagcgg accgcccagg 3153301 ctctgcagca tcagcgcaat tgccgatacc ggaccgatct ggtcgaagcc gacgccagcg 3153361 atcgccgaca gagtcagcgg gacgacggcc atgccgatgc caatcccgcc gacgacgatc 3153421 ggcatgacca ggttggggaa gtagggcaca ccacggtgca tgaaaaatga gccgtacagc 3153481 atggcgccga atagcagata tccgccgccg atggtcaaca cccgtggcga aaaccgggac 3153541 accagctgcg aggacacacc taggccgatt cccatcgcga tgacgaacgg gatgaaacct 3153601 acgcccgcgc gtagcgcgct gtagcccaag atgtcctgca cgtacaggcc gatgcagacg 3153661 gtcaggctga acatgacgcc gccggccaac aggatcgcgc tgaacgtgac caaccggttg 3153721 cggtcgcgga acaagtggaa cggcacgacg gggttctcgg cagtgcgctc cacgatgaca 3153781 aacgcgacag cggccgccaa ggccaccagg cccgaaccga tggtaatgcc tgacatccag 3153841 cccttttcag gaccgatcga gaaggcgaaa accgccgcgg tgcatgccag cgtggccagt 3153901 atggccccgg tggcgtcgag cttcatccgt tctttgttgg tttcccgtag ggcggtgcgg 3153961 gccaggtaga tcatcaccag cccgatcggc acgttcacca ggaacgccca ccgccatgac 3154021 acctcggtca gtgctccgcc gaccaccagc cccatcaccg acccgatcgc ggtcatcgcg 3154081 gcgaacaccg ccgtcgcggc gttgcgggca ggtcccttgg ggaacgtggt cgccaccagc 3154141 gccagaccgg tcggagatgc gatggccgac cccacaccct gggacaaccg ggcgatcacc 3154201 aacgtcgcct cgtcccaggc gaccgcgcac agcaccgacg agatggtgaa tagcgcaacg 3154261 ccaacaatga aggtgcgttt gcgcccgatg gtgtcgccaa gccggccgcc gagcagcatc 3154321 agcccgccga aggtcagcac gtaggcggtg atcacccagc tgcggccggc atcagacaag 3154381 ctcagctcgt tttgaatctt aggtagcgcg acgatggcga cggtgctgtc catggtcgcc 3154441 agcagctgca tcccgccgat agcaataacc gcagcgataa agctgcgcga gggcagccaa 3154501 gtcgggtagt acctgctggg gcgctctgaa gcggtctcct ccgagcgcgg cgggcgcatc 3154561 ggggccggac ggtgtgggcg tccggctgtc cagttacgga ccgcccgctc tgtgtcgttg 3154621 agagccgtca tagcgggtta ccttacagta ttcttaagaa ttgtttaaac cccgaacgcc 3154681 gctcaggccg actacagccc cgatcacgat gatcgcggga ggtcggatcc ccgccgcgcg 3154741 gaccttctcc ggcgtgtcgg caagggtggc ccgcaacgtc tgttgagcgg cggtcgttcc 3154801 gtgttgaacc accagtaccg gcgtatccgc agttcggcca ccctttagca gaacgtcaac 3154861 gaaaagctcg atgcgttcga ccgccatcag caaaacgatg gtgcccgtca atgcagccaa 3154921 tgcatcccaa ttcactaacg attcgggatg accgggcgca agatggccac tgaccaccac 3154981 gaattcgtgg gtcatggccc ggtgagtgac tggaacgccc gccatagcgg gcacggctat 3155041 ggcactcgtc acacctggca ccacggtgac cgggattccg gcgtgggcac atgccagcac 3155101 ttcttcatag ccccgggcga acacgaaggg gtcgccccct ttgagacgga ccacaaagtt 3155161 gccggatctg gcccgttcga tcaggacagc gttgatcgcg tcctgggcca tggcccggcc 3155221 gtaagggatc ttggccgcgt cgatgacttc tacgtgcggc ggcagctcgg ccagcagttc 3155281 gggcggggcg agccggtcgg cgaccacgac atcggcctgg gcaagcagcc ggcgaccgcg 3155341 aaccgtgatc agttcgggat cgccgggacc gccgccgacc aacgccactc cgccgctgag 3155401 gacgtcggaa ctctgcgcag tgatgacgcc ctgctgcaac gcctcccgga ttgccgagcg 3155461 gatcgccgcc gaacggcggt gctcaccacc ggcgagcacc cccaccgaca ggcccgcata 3155521 gctgaatgac gccggggtca ccgccgtccc ctccaccgcg atatcggccc ggacgcaaaa 3155581 gatccgtcgg cgctccgcct cggcgacgac agccacgttc acccgcgcgt catcggtggc 3155641 cgcgatcgca taccaggcgc cgtcaaggtc gccgtcgcgg tagtcacgca ccgacaaggt 3155701 gatctggtcc atcgcctcga cggcgggggt gacgctgggg gcgatcacgt gcacgtccgc 3155761 gccactggcg atcagcaggg gtaaccggcg ctgggcgacc gtgcccccgc caaccacgac 3155821 gaccttcttg ccagccagcc gtaacccgac cagatagggg ttctcggtca cccgccaagc 3155881 ctagtggcga tcgcaagcgc ggggaccggg cgccgcgggt cgccaccatc agggccagtg 3155941 gcgatcgcaa gcgcggggac cgggcgccgc gggtcgccac catcagggcc agtggcgatc 3156001 gcaagcgcgg ggaccgggcg ccgcgggtcg ccaccatcag ggccagtggc gatcgcaagc 3156061 gcggggaccg ggcgccgcgg gtcgccaccc ctttggccgc gaatgtaacg ccactgcgaa 3156121 tttccggccc ggcttttcgc agtgccgtta cgctcgtgga gtattgcagg ccgcatgtgc 3156181 gacgaaacgc gccaccgcac cgggtgttgc ggccggatgg gtatgcaggt aggacgcgtg 3156241 cacgccgctg tgcaccgcgc cgtctcgcac gtcgtccacg tcttggccct ggtacaccca 3156301 cgcgggctga tagctatcgg cgaatgtgac tgcggttcgg tggaattcat gtccaaccac 3156361 gcgctcgccg acggagtaca gcgccgaatc aacaaccgcg acggcgtcgc gataacccag 3156421 cttgagatgc tgggtgaacc gcgccgatcc ggccaccaca ccgcacatcg ggtgtccgtc 3156481 gagttcagaa accagataga gcaggccggc acattcggca tgcaccgggg cgccggcagc 3156541 ggccagttcg ttgatctgcc gccggacggt gtcgttggcg gacaactcgg cggtgaactg 3156601 ctcggggaat ccgccgggca acaccaccgc gtccgtaccc tcgggcagag tttcgctgag 3156661 cgggtcgaac tcgaccactt cagccccggc ggcgcgcaac atctcggcgt gttcggcgta 3156721 gccgaaggta aacgcccttc cggccgcgat ggcaaccgtg gctggctggc gggcggtgtt 3156781 gccgacggca atcaccgggt cccatggcgg gtgggccgcc tggctcccgg cgcaggcgat 3156841 caccgcggcc agatcgacgt ggcgagcgac cacagcagtc atcgcctgca cggcgagccg 3156901 tgcgcgacgg ccgtactcga cggcggtaac cagacccaga taccttgtcg gcagctctag 3156961 ttcagctgtg cgtggaatgg cgcccaagac cgcgacaccg gcctggtcac acgcctgtcg 3157021 cagcacctgt tcatgtcggg ccgatccgac ccggttgagg atgacaccgg cgatccgagt 3157081 tgcggtgtcg aacgtggaaa agccgtgcag cagtgcggca acgctgtgac tctggccgcg 3157141 ggcatcgacc accaggatca ccggggcgcc aagcagagca gcgacgtgcg cggtggaccc 3157201 cgctgcgggc gcgcccccgg caggcccaat gcgcccgtcg aacagcccca gcaccccttc 3157261 gatcacggcg atgtccgcgc ccgcaactcc atgcgcgtac agggggccga taagccgctc 3157321 ccccaccagt accgggtcga gattgcggcc gggccgtccc gcggccaggg cgtgatagcc 3157381 ggggtcgata aaatccgggc ctaccttaaa cggcgcgacg gtgtgaccgg cctgccgcag 3157441 cgctccgatc aagcccgtcg cgatcgtggt cttaccgctg cccgacgcag gcgcggcgac 3157501 ggccaccgcg gatacccgca tcaccactcg atgcccttct gccccttgcg gcccgcatcc 3157561 atcgggtgct tcaccttggt catctcggtc accagatcgg cggccgcaac caaccgctgg 3157621 ggtgcgtctc gcccggtgat caccacatgc tgatggccag gccgggctcg caggacatcg 3157681 acgacttcgt cgacgtcgag ccaaccccac ttcagtgggt aggtgaactc gtccagcaga 3157741 tagaagtcgt gacgttgcgt ggccagccgg agcgcgatct cggcccaacc gtccgccgcc 3157801 gcggccgcac gatcgacgtc ggtgccggcc ttgcgagacg tacgtgtcca ggaccagccc 3157861 gcacccatct tgtgccactc caccgctccg ccgatcccgt gctggtcgtg cagccggccc 3157921 agttgacgaa acgccgcctc ctcacccact ttccacttag cgctcttgac aaactgaaac 3157981 accgcgatgt ccagaccagc gttccacgcc cgcaacgcca ttccgaacgc cgcggtcgat 3158041 tttcctttgc cttcaccggt gtgtaccgcc agtatcggca tgttgcgccg ggcccgggtg 3158101 gtcaggccat cgttgggcac tgcgagcgga ttgccctgcg gcatgtgtgg ttacctatcc 3158161 atcgtcaagc cacgccacgc acggcatgca ctagataatc cgcgtgcaac tgctccaacc 3158221 gaaccaccgg cgcacccagc tgacgagcca gttgcgctgc caaacccagc cgtacatacg 3158281 acgtttcgca gtccaccacc accgcggccg cgccctcggc gaccagcccg gcagccgcgg 3158341 ttcggctgcg gcccaacggg tccggcccgg cggtggcccg gccgtcggtc agcacgacca 3158401 ccagggggcg tcgggcgcgg tcgcgtacct tctcccggat gatcagcgca cgcgcggcca 3158461 gcagtccctc agccagcggg gtcttgccgc cggtgctgaa tcgggccagt cgccggccgg 3158521 cgatgtgcgc cgacgacgtc ggcgacagca acagcgttgc ctcgtgctgg cggaaggtga 3158581 tcaccgccac cttgtccctg cgctggtagg cgtcgcgcag cagcgacagg gtggcgccac 3158641 tgaccgcagc catccggtcc cgagcagcca tcgatccgga agcgtcgacg acgaagatca 3158701 ccagattgcc ttcgcgaccc tcgcggatgg cccggcgcac atcgtccggc cacgggcgca 3158761 acggcccggc tccgaacgca cgctcgccgg cggccagcag ggtagcgaac aggtgcagtc 3158821 catgtgcgtc ggggtcgctg acctcggcgg ccgccaccac actgcccgag gcgttgcggg 3158881 cccgagaccg tcgccccggc gcgcccgtgc cgacccccgg gacccgcagc gcgcgggtcc 3158941 ggaatatctt tgacggcggc gcgctcgggc gcggcgacga tcgcaagcgc ggcgaagccg 3159001 ggcgcggcgg gtcgtcgccc atcgagctcg gcgcaccagg ttctgtcgac ttcgagcgtg 3159061 agttcggttg tgaggcaggt tcattggctg actggccgcc cccgggcgga tcgggctcgg 3159121 gctctgggtc gacgctcgcc agcgccagcg cctcatccag ctggtcgcgg tcgatgccgt 3159181 gatcgtcgaa cgggtcgcga cgacgacgat gcggcaacgc cagttctgct gccgcccgga 3159241 tatcctgctc ctcaacggtg cggacaccac gccaggcggc gtgcgcggcg gcggtccggg 3159301 ccactaccag atcggcccgc atgccgtcca cgtcgaacgc cgcgcacaac gcagcgatgc 3159361 gccgcaactc gttgtcgccc aacaccacat cgtctaccgt ggcccgggcc gcggcaatcc 3159421 ggtgggccag ctccgcgtcg gcgtcggcat agcgtgcgac gaacgcatcc gggtcggctt 3159481 cgtaggccat ccgccggcgg atgacctgta cccgcacgtc gatgtcacgt gacgcctgca 3159541 cgtcgacggt cagcccgaac cggtccagca gctgcggacg cagttcgccc tcctccggat 3159601 tcatcgtgcc gatcagcacg aaacgggcct cgtgggaatg ggagatgccg tcgcgttcga 3159661 cgtgtacgcg tcccatggcg gcggcgtcga gcaggatgtc aaccaggtga tcatgcagca 3159721 gattgacctc gtcgacgtag agcacgccgc cgtgggcgcg agccagcagt cccggagaga 3159781 acgcgtgctc gccgtcgcgc atcacccgct gcagatccag cgagccaacc acccggtctt 3159841 cggtggcccc cagcggcagc tccacgaggc cggtctcggt gctcccggtc gcgaccgaca 3159901 acaacgcggc cagcccgcgc accgccgtcg atttcgccgt gcccttctcg ccacggatga 3159961 gcgccccacc gatctccggt cgcacggcac acaacaacaa cgcgagccgc agccgatcgt 3160021 gcccgacgat cgcgctgaac ggataaggct tcacggccgc tccacctgac cggagccggg 3160081 ccgcaacatg ggcacatgcg ggatgccgtc gtccaggaac tcgtcaccgt cgcggacgaa 3160141 gccgtgctgg gcatacatgg ccgtcaggta ggcctgtgca tcaatccgac aggggtagtc 3160201 gcccacctcg gccagtgccg cgcacagcag ccggttggag tggccctgtc cgcgggcgtc 3160261 gcgtttagtg cacagccggc cgatccggaa gaccttctca cccccggcgt gctcttccat 3160321 caggcgtagc gtgcacgtca cctctccgtc gggcgtttcc aaccagaaat gcctggtctc 3160381 ggcaagcagg tcacgcccgt ctagctccgg gtatgggcag gcctgttcga caacgaacac 3160441 ctccaccctc aacttgagca gctcgtaaag ggcccgggcg tcaaggtctt tggcccagac 3160501 gcggcgcagt gcttcggtca taagcgccgc tctcccccgc aagcgggcgg tacccccact 3160561 gtatcgtcgc cggcgcgggt catgcggcac ctaacttcag cgccttggtg ctccatgacc 3160621 acacctcgtc gaacagcgcg ggttcattcg acagctgcac ccccagcgac ggcaccattt 3160681 ctttgagcgt gggcagccag gattgatagc ggttggcaaa gcatttctgc agcacgtcca 3160741 gcatgatcgc caccgcggtc gaagcccctg gggagccgcc cagtagtccg gcaatactac 3160801 cgtcagcatc gccgatgacc gtcgtgccga actcgagcac cccgccgttg cgttcatctc 3160861 gccggatcac ctgtacccgc tgaccggcta tcgtcaactc ccagtccgaa tcgattgcgc 3160921 taggggcgaa ttcgcgcagc gcactgaccc gctcgggttc agagagacgc agctggctga 3160981 tcaagtagtt cagcagtctc cgctcggtga ggcccacgcc gagcacggac aacagattgt 3161041 ccggcctgat cgaccggggc aggtcgctga tctgcccgtg tttcaagaac ttcggcgacc 3161101 agccggcgta tggcccgaac accagccacg acttgccgtt gacaaaccgc agatccagat 3161161 gcaaggcgcc caacggcggg gcgcccggcg ccgggaagcc atataccttt gcccgatgcg 3161221 aggcggtgag cgccgggttc ccggcgcgca ggaaccgacc gccaatcggg aagccggcga 3161281 agcctttgac ctctttgatc ccggatttct gcagcaccgg caaggtgtca cccccggccc 3161341 cgacaaagac gaacttggtg ttcaacttgc gcttttcgcc ggtccggcgg ttgcacatgg 3161401 tgaccgtcca gctgccgtcg gattgccgcg agaggttgcg aacctcgtgc ccgaacaacg 3161461 cggtagtgcc attttgcacg caatagccga tgagttgttt ggcgagggca ccgaagtcga 3161521 cgtcggtgcc gtcggcggcc cagttgagcg ccaccggctc ggagaaggcc cgtttagcgg 3161581 ccatgaacgg cagccggcgg gcgaattcgt cgggactctc gatgaactcg gtgccggcga 3161641 acagcgggtt gccggccaac gccttttggc ggcgccgtag atactcgacg ccccgcgatc 3161701 catggacgaa actcacgtgc ggcacagggt tgaggaagct gcgcacgtcg gtgaggatgc 3161761 cgttttcggc cgcgtatgcc cagaactggc gggtgacctg gaattgctcg ttgacacgca 3161821 ccgctttggt gatgtcgatc gagccgtccg gcatttctgg ggtgtagttc atctcgcaca 3161881 gcgcggagtg cccggtgccg gcgttgttcc agggaccgct gctttcggcg gctaccgcgt 3161941 ccagccgttc gatcagggtg attgaccagt tcggttcgag ccgacgcagc agcaccccca 3162001 gcgtggcgct catgatgccc gcaccgatca gcacgacgtc ggttctggct aggtctgaca 3162061 ccggacggtt ggttccttcc ttggctgcgc cgctcccagg ttatcccgac gggtgttaac 3162121 acgatgacgt ccgcctcctg ggccagtaac cctgtgcagc gcggggcagc caacccaaga 3162181 caattacccc gaagcccaca atgtgcgtcc ctggccgcca tagaatccgc actatccgcc 3162241 cagtccggtt cttcttggga ggtaacgatg ttgtatgtag ttgcgtcacc cgacttgatg 3162301 accgcggcgg ctaccaatct ggcggagatt ggttcggcga tcagcacggc aaatggtgcg 3162361 gcggcactcc cgactgttga ggtggtggcc gcggccgccg acgaggtgtc cacgcagatc 3162421 gcggctctat tcggagcgca tgccaggagc taccaaaccc tcagcaccca ggcagcggcg 3162481 tttcatagtc ggtttgtgca ggcgttgacc acggccgcgg cttcctacgc cagcgtagag 3162541 gccgccaacg cgtcgccact tcaggttgcg ctagacgtga ttaatgcgcc cgcccagaca 3162601 ctgctcggac gtccgctaat tggtaacggc gccgacggat cgacaccggg gcaggccggc 3162661 gggcccggcg ggttgctgta cggcaacggc ggtaatggcg ccgccggtgg gcccaaccag 3162721 gccggcggcg ccggcggcaa cgccggcttg atcggcaacg gcggggcggg cggcgccggg 3162781 ggtgttggcg cggtcggcgg taaacgcggc acgggcggcc tgctattcgg caacggcggg 3162841 gccggcgggc aaggcgggct cggcctcgca ggtatcaacg gcggcagcgg cgggcaggga 3162901 ggccacggtg gcaacgccat cctgttcggc cagggcggtg ccggcgggcc aggtggcacc 3162961 ggcgccatgg gcgtcgccgg caccaatccc acccccatcg gcaccgcagc gcctggcagc 3163021 gacggcgtaa atcagattgg gaacggtggt aacacggacc tcaccggcgg cgccggtggc 3163081 gacggcaatg ccggcagcac caccgtgaac ggcggcaacg gcggtaccgg cggcgcagct 3163141 aggaactcat ctggtggtac cggtaactcc tttggtggtg ccggcggcgc cggaggcgac 3163201 ggcgccaacg gcggcgacgg tggcgctggc ggggaagccc tcaccgaagg cggtgccacc 3163261 gccgttagtg gtgctggtgg taagggaggt aacgccgagg cttccggcgg cgccggcggc 3163321 aacggcggca aaggtggctt tgctcaggcc accaccagcg tgaccggggg taacggcggt 3163381 aacggtggca atggccacga cagtaacgcg ccgggcggcg ctggcggcag cggtggcgtc 3163441 ggcggtgacg gcggccgtgg cggcctgctg gccggcaacg gcggcaccgg cggtgccggt 3163501 ggcaacggcg gtaccggtgg cgccggtgcc cccggcggtg ccggcggcgc cggcggcaaa 3163561 gccgacatcg ccaacagcct cggcgacaat gccaccgtaa ccgggggcaa tggcgggaca 3163621 ggcggagacg gcggcagcgc gctgggcacc gggggggctg ggggtgccgg aggtctaggt 3163681 ggtcacgggg gtgcaggcgg gctgctgatt ggcaacggcg gcgccggtgg cgctggcggc 3163741 ctcggcggtg cgggcggcgc cggcggtgcg ggcggtgagg gcggtgccgg cggcgccgga 3163801 ggcgaagcta ttcccggcgg ggcgtccacc aactccgccg gcggtgacgg aggggcgggc 3163861 ggtactggcg gcaatggcgg tgacggcggt gccggcggag cccccggcct cggtggcgcg 3163921 ggcggggccg gcggatggtt gatcggccag tcgggcagca ccggcggcgg tggcgccggc 3163981 ggtgccggtg gtgccggagg tgccggtggc gcgggcggca gcggcggtgc gggtggccat 3164041 ggcgacacta cctccggcaa gaacggttcg tctggcaccg cgggcttcga cggcaacccc 3164101 gggcagcccg gctgagcggc acaagatctg aacgcgctct aagctgaccc cgtgactggc 3164161 tgggtgcccg atgtgctgcc cggctattgg cagtgcacaa ttccgctcgg gccggatccc 3164221 gacgacgagg gcgacattgt cgcaaccctg gtcggccgcg gtccgcaaac agggaaagcc 3164281 cgcggagaca ccactggggc acaccacacg gtcctggcgg tgcacggcta caccgactac 3164341 ttcttccata ccgagctggc cgatcacttc gccaaccgtg gcttcgcgtt ctatgcactt 3164401 gacctgcgca aatgcggccg atcgcgagcg cccggccaga cgccgcactt catcaccgac 3164461 ctggcccgct atgacaccga actcgagcac tccctgtcca tcatcaacga gcagaaccgc 3164521 tcggcgaagg tcctggtata cggccactcc gccggcgggc tcatcgtgtc gctgtggctg 3164581 gaccggttgc gccagcgcgg cgagatcacc cgcgcggggg tcaccggcct ggtgctcaat 3164641 agcccgttcc tggatctgca aggcccggca atcctgcgcc tgccgctgac ctcggcgttc 3164701 ttcgccgcga tggcgcgaat gcgccccaag tgggtagccc ggccaccaaa agaaggcggt 3164761 tacggttgca cgctgcaccg ggactatgac ggagagttcg actacaacct gcaatggaaa 3164821 ccggtgggcg gtttcccggt caccttcggc tggattcatg ccagccgtcg tggccacgca 3164881 cggttacatc gcgggatcga cgtcggtgtg cccaacctga tcctgtgttc ggatcacacg 3164941 gtacgggaaa aggccgaccc ggcgaccctg caccgcggcg atgcggttct cgacgtcacc 3165001 catatcaccc gctgggccgg ctgcatcggc aaccgcagca ccgtcatcgc ggtggcggac 3165061 gccaaacacg atgtgttctt gtcgctgccg caaccgcgcc agatggctta tcgccgactg 3165121 gatctctggt tggacgacta cctcggcaca cacaacgaca ccgacgcttc ggcatcgtcg 3165181 gggaaagggt gatggcccct acaaatggaa acgtacgaca tcgcgatcat cggaaccggt 3165241 tcgggcaaca gcattctcga cgaacgctat gccagcaagc gggcggcgat ctgcgagcag 3165301 ggcaccttcg gcggcacctg cctcaatgtc gggtgcatcc ccacaaaaat gttcgtctac 3165361 gccgccgagg tggccaagac catccgaggc gcgtcgcgtt acggtatcga cgcgcacatc 3165421 gaccgggtgc gatgggacga cgtcgtctcg cgcgtcttcg ggcgcatcga tccgatcgcg 3165481 ctgagcggcg aggactatcg aaggtgtgcg cccaacatcg acgtgtaccg cacacacacc 3165541 cgtttcgggc cggttcaggc cgatggccgc tacctgttgc gcactgacgc gggtgaagag 3165601 ttcaccgccg agcaggtggt gatagccgcc ggatcgcggc cggtgattcc gccggccatc 3165661 ctcgcgtccg gcgtcgacta tcacaccagc gataccgtca tgcggatcgc cgagttgccg 3165721 gagcacatcg tgatcgtcgg aagcggcttc attgcagcgg aattcgcaca tgtgttttcc 3165781 gctctgggcg tacgggtcac cctggtgatc cggggcagct gcttactacg gcattgtgac 3165841 gacaccatct gcgaacggtt cacccgcatc gcatcgacca aatgggagct gcgcacccat 3165901 cgcaacgttg tggacggcca gcagcgcggc tcgggcgtcg cgctgcggct agacgatggt 3165961 tgcaccatca acgccgacct actgttggta gcgacaggcc gggtgtccaa cgccgacctg 3166021 ctggatgccg agcaggccgg tgtcgatgtc gaggacggcc gggtgatagt cgacgagtac 3166081 caacggactt cggcgcgtgg ggtttttgcg ctgggcgatg tctcgtcgcc gtacttgctc 3166141 aagcatgtcg ccaaccacga ggcccgcgtc gtgcagcaca atctgctctg cgactgggag 3166201 gacacccagt cgatgatcgt caccgaccac cgatacgtac cggctgcggt attcaccgat 3166261 cctcagatcg ctgccgtcgg actcactgaa aaccaagctg tggcaaaggg actcgatatt 3166321 tcggtcaaga tacaggacta tggtgacgtc gcgtacggct gggcgatgga ggacaccagt 3166381 ggaatcgtca agctcatcac cgagcgcggc tctgggcgct tactgggcgc acacatcatg 3166441 ggttaccagg catcctcgct catccaaccg ttgatccagg cgatgagctt tgggctgacc 3166501 gccgccgaaa tggcccgcgg ccagtactgg attcatccgg cgctgccgga ggtggtggaa 3166561 aacgcgctgc ttggcctgcg ttgaccgcaa cggcgagccg tcgtccggca agcgatttgc 3166621 atcccgtcag cgccttacct acagtcggga catcgcgttc tgccccgtgc tggaaggacc 3166681 gacatggcca gcagccagct cgacaggcag aggtcgcggt cggccaaaat gaaccgcgct 3166741 ctgacagcag cagaatggtg gcgtctgggc ctgatgttcg cggtgatcgt cgccttgcat 3166801 ctggttggct ggctcaccgt gacgctcttg gtggagcccg cgcggctcag cttgggcggc 3166861 aaggcattcg gcatcggcgt cgggctgacg gcgtacacgc tgggcttacg gcacgcgttc 3166921 gacgccgacc acatcgccgc catcgacaac accacccgca agctgatgag cgacggacac 3166981 cgaccccttg ccgtcgggtt cttcttttca ctgggccact ccacggtggt cttcgggctg 3167041 gcggtaatgc tggtgaccgg actcaaggct atcgtcggac cggtcgagaa cgactcctcg 3167101 acgctgcatc actacacagg cttgatcggt accagcattt ccggcgcgtt cctgtatttg 3167161 atcggcatcc tcaacgtcat cgtcctggtc ggcatcgtgc gtgtcttcgc ccacctgcgc 3167221 cgcggcgact acgacgaagc cgaactcgaa cagcagttgg acaaccgcgg actgctcatc 3167281 cggttcctcg gccgcttcac caagtcactc accaagtcct ggcatatgta cccggtcgga 3167341 tttttgttcg gtctcgggtt cgacaccgcc accgagatcg cgctgttggt gctggcggga 3167401 accagtgccg cggccggcct gccctggtat gccatcctgt gcctgcccgt cttgttcgcc 3167461 gccggcatgt gtctgctgga caccatcgac ggttcgttca tgaatttcgc gtacggctgg 3167521 gccttctcca gccccgtgcg caagatctac tacaacatca ccgtcaccgg actgtcggtg 3167581 gcagtcgcac tgttgattgg cagcgttgag ctgctgggcc tgatcgccaa ccagttgggt 3167641 tggcagggcc cgttctggga ctggcttggc ggcctcgacc tcaacaccgt cggcttcgtc 3167701 gtcgtcgcga tgttcgcgct cacctgggcc attgccctgc tggtctggca ctacggccgc 3167761 gttgaagagc ggtggacccc ggcgcccgac cgcacaactt gacctcgggc gatcaaccct 3167821 agggcggtgc cgccggaatc gagacggtag ccaagcgagc ggtcgacgtg ttggaaaaga 3167881 tcttcgccga gaacgatgtc cgcgcgaacg tcaaccgggc ggcgtttgag aacaacggga 3167941 tccgcgcgct ggacctgatg agctcaccgg ggtcggggaa gacgaccgtg ctgggcgccg 3168001 cgctcgacga gcacgccgac caattcgcaa tcggcgttat cgaaggcgac atcaccaccg 3168061 acctggacgc ggccaatggc cgcggcaccc aggtgtcgct gctgaacaac cagcatggct 3168121 tttgcgccga atgccacctc gacgcaccta tggtcaaccg cgccctagct ggtgcgcccg 3168181 acggagttcg acgtcggtaa gcgccaaggc gatggtctcc tcggtcaccg agggcaagga 3168241 caagccgctg atgtacccgg cgacgttccg ctcgagggat gtagtgctgc tcgacaagat 3168301 cgacttggtg ccctttctgg acgccgacgt ggacgcgtat atcgcgcatg tccgcgaggt 3168361 caacgcagcc gcgacgatcc tgccgaccag cacgcgcacc ggagccggca tggggtcctg 3168421 gtcatgagcc gccggaaacg gctcgtctca tcggctttca cggtgaggcc accgcagccg 3168481 aaatggacaa cgttgatcgt cttccgggcc tgacagcaat ccgactgtga aatgcactac 3168541 gcgacacgct aacccgttgc gcagttcaca ctcggggcgc gatcacagcg gagtgacata 3168601 ggccgagctg atcccaccgt cgaccaggaa cgtcgaagcg gtgatgaatg atgcgtcgtc 3168661 gctggctaaa aacgctaccg cagcagcaat ttcgtcgggc tcggcgaacc ggcccagcgg 3168721 cacatgcacc atgcggcgag cggcccgttc cgggttcttg gcgaaaagct cttgcagcag 3168781 tggggtgttc accggccccg ggcacaacgc gttgacccgg atgccctgcc gagcgaattg 3168841 cacgcccagt tcccgtgaca tagccagcac tccacccttg gaggcggtgt aggagatctg 3168901 cgacgttgcc gaacccatca ccgcaacgaa ggacgccgtg ttgacgatgg agcctttccc 3168961 agcaagcacc atgtggcgca gggccgcccg gcagcacaag tacaccgact tcaggttgac 3169021 gtcttgtacc cgttgccacg ccgcgagctc ggtgttttcg atcagattgt cctcgggtgg 3169081 tgagatgccg gcgttgttga acgcaatatc tatgcggccg taggtttcgg ctgctccgtc 3169141 gaacagcccg ttgacggcgt cctcatcgca aacgtcggtt ggcacaaaca agcctgatag 3169201 ttcgtcagcg gccgcaccac cggcctcgac gtcgacgtcg ccgaccacga tcgtggcgcc 3169261 ttccgcccgc atccgacggc cggcagccag gccaataccg ctgccaccgc ctgtgatcac 3169321 cgccacccgg ccggccagcc gttggctgag gtccatcaca tctcctcccc gacggcgatg 3169381 aacacatttt tggtttcggt gaactgcagc ggagcgtccg gccctagctc gcggcccaca 3169441 ccggactgct tgaaaccgcc aaacggggtg ttgaagcgca ccgacgagtg cgagtttacc 3169501 gacaggttgc cggattcgac cgcccgcgcc acccgcagcg cgcgggacag gtcatcggtc 3169561 cagatcgatc cggacagccc gtacgcggtg tcgttggcca ggctgatagc gtcggcctcg 3169621 tcgtcgaacg tcagcactac aaccaccggc ccgaagattt cgtcggtgac ggtgcggtcg 3169681 ccgcgtttgg gtgtgagaac ggttggtgga aaccaaaatc cgcgcccagc cggagccgta 3169741 ccccgaaacg ccaccggagc gtcgtcgggc acataaccgg cgaccttgtc acggtgtgcg 3169801 cgcgatacca gcggacccat ctcggtggcg cgtgatccgg ggtccccgac gacaatgctg 3169861 tgtaccgccg gctcgagcag ctccataaac cggtcgtaaa cgctgcgctg caccaggatt 3169921 cgacttcggg cacagcaatc ctgcccagcg ttgtcgaaga ccccggccgg cgcggtcgtc 3169981 gcggcgcgct ccaggtcgca gtcgtggaag acgatgttgg cgctcttgcc acccagttcc 3170041 aacgtcactc gtttgacttg agccgcggca ccggccatga cccgcttgcc gacttcggtg 3170101 gacccggtga acacgatctt gcgaatgtcg gggtgggtga cgaaccgctc cccgaccacc 3170161 gtgccctttc ccggcaacac ctgcagcagg tcttcgtcca gacccgcctc gacggccagc 3170221 tcaccgagcc gcatcgtggt cagcggcgtc agttcggcgg gtttgaccag caccgcgttg 3170281 ccggcggcca gcgccggcgc gatggcccag gacgcgatca ccatcgggaa attccatggc 3170341 gtgatcacac cgaccacgcc catcggttcg ttgaaagtga cgtccacccc gccggcaacg 3170401 ggaatctgcc tgccggacaa ccgttccggg ctggcggcat agaacgccaa cacgtcacgc 3170461 acgtggccgg cttcccactc ggccgacacg atcggatgtc cggaattggc tacctcgagc 3170521 gcggccagtt cgtcgaggtg ggcttgcacg gctgccgcga atgcgcgcag gccggccgcc 3170581 cgctgcgccg gtgccaaccg tgcccagcgc cgctgcgctg ctcgcgcgcg ttgcacggcg 3170641 tcgtccaccg cgttggcgtc ggtgtggtca actgaggcca gcacttcctc ggtggcggga 3170701 ttgatcagtt gcgtggtact catcgtggct ccgcttggct ctgccggccc gcgtatccgc 3170761 tggcggcgtc caccaacgcc ttaaacagcc gcagatcgtc caacgacttc tccggatgcc 3170821 actgcaccgc tagtacgaac gtgtccccag gtagctccag cgcctcgatt accccgtcga 3170881 catccaccgc actgaccacc aggccctcac cgacctggtc gatggcttgg tggtggtagc 3170941 acggcacgtc ggcggattcg ccgatcagct cggccaaccg ggtgcccgat gcggtgtgga 3171001 ccggcaacct ggtgaagacc ccgttgcccg cccgatgccc gctatggcca aggatgtcgg 3171061 gcaggtgctg gtgcagcgtg ccgccgagcg cgacgttgag cacctgggtg ccgcgacaga 3171121 tgcccaacac gggcatcccc cgctgaagcg cgccccgcaa tagcgcgaac tcccaagcgt 3171181 cgcggcccgg gcgagggtga tcggtggccg gatgcggctc ctggccataa gctgccgggt 3171241 ccaggtcgta gcccccggtg atcaccagag cgtgcaggct gtccagcacg cagccgacgc 3171301 tctcggggtc gaccggctgc ggcggcagca gtaccgcaac acccccggcc atggtgatgc 3171361 cttcgaagta atcggcgggc agataacccg caggaatatc ccaaaccccg gtgcgcacct 3171421 gctccagata agccgtcagg ccaaccaccg ggcgactcgc gcccagtggc gatcgcaagc 3171481 gcggcgaagc cgggcgcagc gggtcgccac catcggacac aggcgatcgc aagcgcggcg 3171541 aagccgggcg cagcgggtcg ccaccatcgg acacaggcga tcgcaagcgc ggcgaagccg 3171601 ggcgcagcgg gtcgccacca tcggacctag aggcgctcaa atccacgtat cctctcccaa 3171661 tcggtgaccg ccgcgttgaa cgccgccagc tccacacgcg cgttgttcag gtagtgcgcg 3171721 acaacatcct cgccgaacgc ctcgcgcacc agcgcagaat cctcgaacag caccgcggcg 3171781 tcggccagcg taaccggcag ccgttcgaca tcggcgcctt ggtaggcgtt gccgacacag 3171841 ggctcgggca gctgaaggcc ccgctcgata ccgtacaacc ctccagcaat gagagccgcc 3171901 accgccaggt actggttgac atcaccgccg ggaacccggc attcgacccg gatgttttgc 3171961 ccgtggccaa ccacccgcag ggcgcaggtg cgattgtcca gcccccaagc cagcgccgtc 3172021 ggcgcgaaac tgctatcggc aaatcgcttg taggagttaa tggtcggcgc atagcacagc 3172081 gtgaattcgc gcaacgtggc caactggccg gcgacgaagc tgcggaacat cgacgacatg 3172141 ccgtgcggcc cgttactgtc ggcaaacacc gcggagccat ccgtgccacg cagcgagaca 3172201 tggatgtgac agctattacc ttcgcgttca tcgtatttcg ccatgaacgt taggctcttg 3172261 ccgtgctggt cggcgatttc cttggcgccg ttcttgtaga tcgcatggtt gtcgcaggtg 3172321 accagcgcct cgtcgtaacg aaacccgatc tcctgctggc ccatgttgca ttcgcctttg 3172381 accgcctcga atcgcagacc cgcaccggcc atacccaacc ggatgtcgcg cagcaacggc 3172441 tccatccgcg aggatgccaa tatcgcgtag tcgatgttgt agtcgctggc cggggtcagc 3172501 ccgcgatacc cgctggccca tgcctggcga tacggctggt cgaacacgat gaactccagc 3172561 tcggtggcca catcggcgac cagtccgcgc gccttgagcc gatcgagctg acggcgcaga 3172621 atgctgcgcg gcgagacggc gacctcgctg ccgtcggccc agaccaggtc ggcgatcacc 3172681 agcgccgttc ccggtagcca aggaatcagc cgcagagtgg acaagtccgg cgtcatcacc 3172741 atatcgccgt agccggtgtc ccaactggcc atcgcatagc cgggcaccgt gttcaggtcg 3172801 acgtccacgg ccagcagata actgcagcac tcgacgccgc gggtggctat gtcgtcgacg 3172861 aaatgccggc ccgatatccg tttgccggcc agccggccct gcatgtcggt gaacgcgacg 3172921 atgacggtgt cgacgtcacc ggccgcgacc agtcgctcca actcggtcca cgccaacggc 3172981 ggcgaaccgg ggccggtcac cgcacttcct cccacaccat ggccgctagt caaccatcta 3173041 taggctccgg gcccacatgc tggctgtcgc gggcaccgcg aaccgccgga gccggcgagt 3173101 agacgcgaaa gaacatgatg ggcgctggtg cccatcatgt tcttttgcgc ctactcgcgc 3173161 tacagacagg tcaggatctc gacgccggta tcggtaacca gcagggtgtg ttcgaactgt 3173221 gcggtccact tgcggtcctt ggtgaccacc gtccaaccgt cgtcccagat ttcgtagtcc 3173281 agtgcgccca agttgatcat cggctcgatg gtgaaggtca tccccggctg catgatggtc 3173341 tcgacagcgg gctggtcgta gtgcaagacg accagcccgt tgtggaacgt cgtgccgatg 3173401 ccatgaccag tgaagtctcg aaccacgttg tacccgaacc gatttgcata cgactcgatg 3173461 acacgaccga taacggacaa cgcccgcccg ggcttgacgg tgttgatcgc acgcatggtc 3173521 gcttcgcggg tccggtcaac gagcaaccgg tgttcgtctg cgacatcgcc ggccggaaac 3173581 gtcgcgttgg tgtcaccgtg caccccaccg atgtaggcgg tgacgtcgat gttgacgatg 3173641 tcgccgtcgg tgatcaccgt cgagtcgggg attccatggc agatgacctc gttgagggac 3173701 gtgcagcacg acttcgggaa tcccttgtag cccagcgttg atgggtaggc gccgttgtcg 3173761 accaggtatt cgtgcgcgat ccggtcgagt tcgtcggtgg ttaccccggg cgcgaccgcc 3173821 ttgcccgcct cggccaacgc acctgcggcg atccggcctg ccacgcgcat cttctcgatg 3173881 acctcaggtg tctgcaccca cggctcgctg ccctcttggg cggccggttt gccgacgtat 3173941 tcggggcgcg cgatccagtt gggcaccggc cgtgtcgggg acagcacgcc gggggagagc 3174001 gcggtacgac taggcatccc gctagcttag ccgggcaaat tttggccgcg cccggctatc 3174061 agccccggtg tcggcgcagc agtgcgcgcc gcggtccctt gatgaccacc gacccgcaca 3174121 ccatccgacc ggtcaacacg acatgcggtg ttccttccgc cggtgcgtcc ttgcggcggt 3174181 cgctcgcgct acccacatag acctcgacgt cgtcgatcga cgcactggcg ccgttgggca 3174241 gccggacctc aagtgagccg aacatcatat cgagttcgat caccaccacc ggccccgcga 3174301 aacgggcctt gacgaggtcg agttcgattg accccagccg acgcaccagc gccagccggg 3174361 tgggcacgat ccattcgccg tggcgtttca gggagccggc ccagccgcgc agctccaccc 3174421 ggtcggccgc ggacgtgacg atcgcgccag gcctgggcag gtcaccgacc agcccatcca 3174481 gctcgcttcg cgtacacgcg aaggaaaccc gtgacgagcg ctgctcgaac tcgtcgatgt 3174541 tgataagccc gagcgccacg gcgttgtgca gtcgtcgcat tgtgccgttg cggtcggcgt 3174601 ccgagacccg caacgccacc atgtccccac cggtctccgt catggcccat tcccgagagt 3174661 tctggcacgg cttcaacggc gaacttcgcc taccccccgc aacttaccgc tgttgaaagg 3174721 ccgccgaaaa cctagcagtt taggtaatcc tttccgacga agagcgggag gcgttccggc 3174781 agcaagccgc agcccagcag atgtccctca gtaactggct gcgtcaagcg gggctcaggc 3174841 agctcgaggc acagcgacaa cgtcccctgc gcaccgccca ggaattgcgc gagttctttg 3174901 cgtcacggcc cgacgagaca ggggcagaac ctgattggca ggcgcatctg caggtgatgg 3174961 ctgaatcgcg ccgtcgcggc ctgccggcgc catgatcttc gtcgatacca acgtcttcat 3175021 gtatgcggtc ggtcgcgatc acccattgcg gatgcccgcc cgtgagttcc tcgagcacag 3175081 cctcgaacac caagaccgcc ttgtcacgtc agccgaggcc atgcaggaat tgctgaacgc 3175141 gtatgtgccc gtcgggcgga actcgacgct ggactcagca ttgaccttgg tgcgggcgct 3175201 gacggaaatc tggcccgtcg aggcggccga cgtcgcgcat gcgcgaaccc tgcaccaccg 3175261 ccaccccggt ctgggcgcgc gcgatctgct acacctggca tgctgccagc gtcgcggtgt 3175321 cacgcggatc aagacgttcg accacacact ggccagcgca ttccgatcat gacgcgtccg 3175381 tgtgggcgcg agcgtccgca gttgtacggc cctaacggcg tgtcgtcgta caaacgagga 3175441 ggggcgagcc gcgctacgcc aggtaccccg gcggcagcga ttcgaacatc accttggtca 3175501 tccgcaccgc gtattccgag ctaccgcccc cgacgatcag cgacgcaaat gccagatcgc 3175561 cacggtaccc ggcgaaccag gaatgcgatc cgcccgggaa ttcggcttcg ccggtcttac 3175621 cgaacacctc gccacagcca gcgatctcct tggcggtgcc attggtcacc accaaccgca 3175681 tcatgggccg cagcgcgtcg atcatcttct ggctgatcgg tgtggcatcg ccttcgacgg 3175741 ccgtcggccg gccggcgatc agctgtggaa ccggggtctt cccggcggct accgtcgccg 3175801 ccaccaaggc catgccgaac gggctggcca gcaccttgcc ctggccgaaa ccgtcctcgg 3175861 tgcgttcggc caggtccacc gtcggcggca ccgaaccggt caccgtggtg atgccgtcca 3175921 cctggtagtc aagcccgatc ccgtaccgcc gggccgcctg agtcagaccg cggggaggca 3175981 gcctgctgct cagctcggcg aaggtggtgt tgcaggaact ggcaaacgcg cgtgacatcg 3176041 gcaccacgcc cagatcaaag ccaccgtagt tgggaatggt gcgatgcccg atgtcgatct 3176101 ccccggggca acccagcagc gtctcagggg tagccaggtc acgctcgacg gccgcaccgg 3176161 cggtgatcat cttgaatgtc gacccgggtg gatatagacc ggtggtcgcg accggaccgt 3176221 ccgcatcggc cccggcgttc tgcgcgatcg ccaggatctc gccggtcgac ggcttgatca 3176281 cgacgatcat cgccttgccg ccccgggtgt tcaccgcgtg ttgcgcggcg ttttgcacga 3176341 cccgatccaa cgtgatcgaa accgacgacg caggtgatgg ggcgacctcg tgcagcaccg 3176401 agacgtcgac gccattttgg ttgacgctca ccacccgcca acccgccttg ccgtcgagtt 3176461 catcgacgac ggccttcttg acatcgttga ggaccgccgg cgcgaagtgc ttgtcggtcg 3176521 ggagcagctc ggcctgcggt gtgatcacca cgccaggcag ctgcccgatc gccgcggcca 3176581 cccggttgct gtcgtcggcg tgcaacgtga ccaggtccaa cggctgggtc gacgagctgg 3176641 cctgttcggc cagcagctgc ggatcattga gcgtgtcgtc gaaggggtgc agcgcgccca 3176701 ccaccgcgtg tgccgtgccg aagagctcgc ggccggcctg gccggcgtcc agcgagtagt 3176761 gatacagata gcccggcacc agcacatcgg tgccgccgac ttcgttcacc gaggcgcgcc 3176821 gcggcgggtc ggctcgtagc gcgaacgttt gatgttcgcc tagcttggga tgcaacccgc 3176881 tggtggtcca gcgaacgtgc caacgccctt cgtcgcgggc catcttcagc tggccgtcat 3176941 aggtccagat tcggtccttg ggcagatgcc agctgaagcg ataagcgacc gtaccggtgt 3177001 cctcggcgta cttggcgctg agaacctgcg catccaggtg ggcggcctgc agccccgccc 3177061 aggccgcgtt cagcgcttcg cgcgcctcgt tggggttgtc gctgagctgg gcggcggagg 3177121 cggtgtcacc gatggccagc gcggcgaaga acttttcggc cgccggaccg ggcccttggg 3177181 gacgcggggt gcagcccgac atggcgacga ccgcaagcag cagcaaacct gaggtggctg 3177241 aggctaatgt tgttttagtt accatcgttg ctgatgttaa gaactgtgac ggagacaccg 3177301 gccgcgacac accgagaccg aaccgttacg ccgagactag gtcgcgaatg gaacaccacc 3177361 gcgaaaatcg tggccagaaa tcgcaaccac gttacgctcg cgaccgctca atcgagcaag 3177421 gcgccgaccg caagcaccag caaacctgag acgccgcgca caaagtgcga aaccactgga 3177481 aggtgagccc taatttaggg ctgagcagga cctgtataac ggcctagtat ggcggtatgc 3177541 ggatactgcc gatttcgacg atcaagggca agctcaatga gttcgtcgac gcggtctcgt 3177601 cgacacagga ccagatcacc atcaccaaga acggtgcacc cgcagccgtt ctggtcggcg 3177661 ccgacgagtg ggaatcgttg caggagacgc tgtactggct ggcgcaaccc ggaatcaggg 3177721 agtcgatcgc tgaagccgac gccgacattg cctccggccg cacctacggc gaagacgaga 3177781 tccgcgccga attcggcgtc ccgcgacgcc cccactgagc ggtgccttac accgtgcggt 3177841 tcaccacaac cgcgcgtcga gacctccaca agctgccacc gcgcatcctc gcggcagtgg 3177901 tcgaattcgc gttcggcgat ctgtcgcgcg agcccctgcg ggtgggcaag ccccttcggc 3177961 gcgagttggc cggcacgttc agcgcgcgtc gcggaacgta ccgcctgctg taccggattg 3178021 acgacgagca cacaacggta gtgatcctgc gcgtcgatca ccgcgcggac atctaccgcc 3178081 gatagcaact caccgacggg cgctctgccg tccgacggca gccatgactg agatcggtcg 3178141 gccgggcggc tccgaaaaga cctgaacaga acctcaggat tcctatgctc ccaatgtggc 3178201 ggcaatcacg aagaagctaa tcctcggcca gatccgggaa gtggctgagg cgaacgacgg 3178261 ccgaccgccc ggctgtgagc gctttgccgc cgagaccgga attccagcaa gcgcgtggcg 3178321 tggacggtat tgctaaccca ttctttcaag accgacgatc ctgttggcat cgagaggtac 3178381 tggcagcgcc gacacttgcc agaccgcatc gccgtgcaca ggacgtcgtc agcgctgata 3178441 tgcccgcagc tcggcgctca gtccagcaac accgtcgcga acgtgccgat ctccttaaag 3178501 cccacccggg cgtaggcggc acgggccacc gtgttgaagc tgttcacata caggctggcg 3178561 atgcgcccgc tgccgacgat cactgcggcc aacgttgcgg taccagccgt gcccagaccg 3178621 ataccgcgcc actccggatg aacccagacc ccctggatct gcccgacggc cggagattgc 3178681 gatcccactt cggccttgaa gatcacttga ccgtgctcga atcgggccca cgcgcgtccg 3178741 gccgcgatga ggccggccac ccggcgacga tagccgcgac caccgtctcc gagccgaggg 3178801 tcgacgccga cttcgccgat gaacatgtcg acggcggcca ccaggtagga gtccagttcc 3178861 tcgggccgta cctggcgtac gccggtgtcg atagcgcagc tggggtgagt agccagggcc 3178921 atcagcggtt ggttgtcgcg gacatcccgc gccggacccc acaccggctc gagccgctgc 3178981 cacatcggca acaccaggtc ggccctgccg accagtgacg aacaccgtcg cggcgtgctc 3179041 atcgccacgt cggcgaacgc attcaggtcg atcggtccgc cgcgcagcgg gatgaggttg 3179101 gcaccggcga aacacaggga ttcgtgcgcg ccgcgtcggg tccacagctc cccgccaatc 3179161 gcattgggat cgatgccatg gtctgcgacc cgggcggcga ccatgcacga ttcgatcggg 3179221 tcgtcgtcga gtacccgcca cacggcggcg gcgtcacgca ccacggacac ttgccgctcg 3179281 ccgacaagcc gagagatggg cggagccgac atctgcgaac tccctttggt gggaactgac 3179341 ggccactgaa tgaaaagctg acccctatca gcttacggtc acaataggcg aaccgctcgg 3179401 tgtcgcgccc ggatcttgct cgcccatttc ggcggccagc cgcatcgcct cctcgatcag 3179461 cgtctcgacg atctgtgctt cgggcacggt cttgatcact tcgccccgta caaagatctg 3179521 acctttgccg ttgccggacg ccacgcccag gtcggcctca cgtgcttcac ccggaccatt 3179581 gacgacacac cccatcacgg ccacccgcaa cggcacatcg agaccatcca ggccggcggt 3179641 tacctcgttg gccagggtgt agacgtcgac ttgcgcgcga ccgcacgacg ggcaagacac 3179701 gatctcgagc gaacgcggcc gcaggttcaa cgactcgaga acctgattgc ccaccttgac 3179761 ttcctcgacc ggcggggccg acaacgacac ccggatggtg tcgcctatgc cccgcgacag 3179821 caacgcgccg aaggcaaccg cggacttgat ggtgccctgg aaagcagggc cggcctcggt 3179881 gacaccgagg tgcagtgggt agtcgcaccg tgcagcaagc agctcgtagg cggcgaccat 3179941 caccaccggg tcgttgtgct tgacgctgat cttgatgtca ccgaagccat gctcctcgaa 3180001 aagcgaagcc tcccacagcg ccgactcaac cagcgcctcg ggcgtggctt tgccatactt 3180061 ctccatgaac cgtttgtcca gcgaaccggc gttgacaccg attcggatcg ggatcccggc 3180121 cgcacccgcc gccttggcga cctcacccac ccggccgtca aactccttga tgttgcccgg 3180181 gttgacccgc accgcggcac atccagcgtc gatggcggcg aatatgtagc gcggctggaa 3180241 atgtatgtcc gcgactaccg ggatctggct gtgccgggcg atctcggcca gcgcgtcggc 3180301 gtcctcctgg cgcgggcagg ccacccgcac gatgtcgcat ccggccgcgg tcagctcggc 3180361 gatttgttgc aatgtcgagt tgacgtcgtg ggttttggtg gtgcacatcg attgcaccga 3180421 gaccggatgg tcactgccca cgccgacgtt gccgaccatc agctgacggg tggcgcgccg 3180481 gggagcgagc gtgggtgccg ggggctgcgg catgcccaag cctacagtca ctgaaaatcc 3180541 tttctaccta ctggaaaagc ctaatcgggt tgaccaggtc ggcggtgacg gtcaagagca 3180601 tgtacccgac gacaagaacc aagaccacat aggtcgccgg caagagtttg aggtaattca 3180661 ccggtgcggc cgccaccttg ccacgagccg accggaccat gttgcggatc ctctcgaaca 3180721 ccgcgacggc aatatggccg ccatcgaacg gcagcaacgg cagcaggttg atcgcagcca 3180781 ggatgaggtt cagctgggcc aagaagaacc agaacgccac ccacagccca tggtcgacgg 3180841 tgtcgccgcc gatgatgctg gcgcccacca cacttatcgg cgtctgcggg tcacgctgcc 3180901 cgccgccgat cgcccgcacc agcgcaccta ccttggtcgg gagggcggcc agcgccttgc 3180961 ccacctccac ggtcaggtcg ccggtgaccg cgaatgtggc cggcatggcg gagaacacgc 3181021 cgtagcgcac aggcccgacc cgggcggcgc ccaccccaat cgcaccgacc gttgccggct 3181081 ggagctcacc gccctgcccg ttagggatcc agcgttgggt ggattcgatg tccacgtagg 3181141 taacaatcgc ggtgccgtca cgctcgacaa cgatcgggac gctgccgtgt gacttgcgca 3181201 ccgcggcggc catctcgtcg aaactggaca ccggggtgtc accgaccttg accacgacgt 3181261 caccggagcg aattccggcc agcgccgccg gaccgggccc ggtgcactgc tcgagcttgc 3181321 cctggctcac ttcctgtgca acgcagccag tttcgccgat tacggccctg gttggcggat 3181381 gcaggttagg cagcccccag accagcgcga tggcatagat cagcaccagg cagatagcga 3181441 ggttcattcc gggcccggcg aataacactg cgacccgctt ccaggtggcc tgcttgtaca 3181501 tcgcacggtc acgttcgtcg gggtcgagtt cctcgaccgg ggtcatgccg gcgatgtcac 3181561 agaagccgcc cagcggaacg gctttgacac cgtattcggt ctcgccgcgc cgggtcgacc 3181621 acaacgtggg gccaaagccg acgaaatagc gacgtacctt catcccggtg cggcgcgcga 3181681 cccacatgtg accacattcg tgcagggcca ccgaaatcag gatcgcgagc gcgaacagca 3181741 caatgccggt aacaaacatc atcgaggtgt caggaccttt ctaacgtcga tgcgtgtcga 3181801 cccgctgcgc ccggcttcgc cgtgcttgcg atcgccaccg aagccatacc agataccgcg 3181861 cgctgcgctc gctcgcgggc ccagcgctgc gcgtcgagta cgtcatccac ggtagcgggt 3181921 tcgacggccc attggtcggc agcgtgcaac acgtcggcga tgatgccgac gatggccggg 3181981 aagccgatcc ggccagcaag gaacgccgct gctgcttctt cgttcgccgc attgtaaacc 3182041 gcggtcatgc agccaccggc tacgccggcc tgccgggcca actcgaccgc ggggaagacg 3182101 tcggtgtcca acggctcgaa ctcccagctc gacgcggtat ggaaatcaca ggcagcagcg 3182161 gcgccgctga cccgacgcgg ccagcccagc gctaacgaaa tcggtagctt catgtccggg 3182221 ggactggcct gggcgatcgt cgaaccgtcg atgaaggtga ccatcgaatg gatgatcgac 3182281 tgggggtgca ccacgacatc gatgcggtcg taggggatgc cgaacagcag gtgggtttcg 3182341 atgacctcaa gtcccttgtt gaccagcgac gccgaattca gcgtgttcat cgggcccatc 3182401 gaccacgtag gatgcgcgcc agcctgctcg ggggtgacat gctcgaggtc ggccgcggac 3182461 cagccccgaa acggccctcc cgaggccgtc agcaccagct tggcgacctc gtcgggagtg 3182521 ccgccgcgca ggcactgggc cagcgcggag tgttcggagt cgaccggcac gatctgaccg 3182581 ggccgcgccg cccgcagcac cagcgaacca ccggcgacca gcgattcctt gttggccagc 3182641 gccagccggg cacccgtctt gagcgcggcc aacgtcggtc gcaggcccaa cgcgccgacc 3182701 agcgcattga ggacgacgtc ggcctcggtc tgctcgacca gccgggtggc ggcgtcggat 3182761 ccgtggtagg ggatgtcgcc gacccgctgc gccgcgtgct cgtcagcgac ggcaatattg 3182821 gtcaccccgg tctgcgcacg ttgtcgcagc aacgtgtcca gatgggcgcc gccagcggcc 3182881 agcccgacta cctcgaaacg gtccggattg tcggcgatga cctgaagcgc ctgggtgccg 3182941 atcgagccgg tactgcccag caccaccacc cgcaaccggc cgtcagcgcg cccgtcggtc 3183001 gagttggtca cctcatcatt gtgcgccacc acctcgttgt caccgcgccg ccggatcacg 3183061 acgcgtccac cggtagccac acttccccgt ggaatgcaat cgtcttgatg cctgcgcttg 3183121 atgctaagat gccatgcgtg cgcacgacga tccgtatcga tgacgagctg taccgcgagg 3183181 tgaaagcaaa ggccgctcgt tccgggcgta ccgtggccgc ggttcttgaa gatgcggtgc 3183241 ggcgtggtct caacccgcct aagccgcagg ccgccggccg ttatcgagtc cagccgtcgg 3183301 gtaagggcgg cctgcggccc ggtgtcgatc tatcgtccaa cgccgcactt gccgaagcga 3183361 tgaacgacgg cgtgtcggtc gatgctgtgc gttgatgtca acgtgctcgt ttacgcgcat 3183421 cgggcagacc tacgggagca cgcggactat cggggtttgc ttgagcggct ggccaacgat 3183481 gacgagccgc tgggtctacc agatagcgtg ctcgccggct tcatccgggt ggttaccaac 3183541 cgccgcgtct tcaccgagcc gacgagccca caggacgcat ggcaggcagt cgacgcccta 3183601 ctcgcggcac ccgcagccat gcgacttcgg cctggcgagc gccactggat ggcctttcgg 3183661 cagttagcgt ccgatgttga tgcgaacggc aacgacattg cggacgcgca cctggccgcc 3183721 tacgcgctag agaacaacgc aacctggttg agcgccgacc gcggctttgc ccgtttccgt 3183781 cgactgcgct ggcgtcatcc gttggacggt cagacccatc tataaccggc cccactccga 3183841 atcactggtg tccacccagg aggacggcgt tcaacgccgc cgcagaagca aaggaatcga 3183901 agcgatgatc aacgttcagg ccaaaccggc cgcagcagcg agcctcgcag ccatcgcgat 3183961 tgcgttctta gcgggttgtt cgagcaccaa acccgtgtcg caagacacca gcccgaaacc 3184021 ggcgaccagc ccggcggcgc ccgttaccac ggcggcaatg gctgaccccg cagcggacct 3184081 gattggtcgt gggtgcgcgc aatacgcggc gcaaaatccc accggtcccg gatcggtggc 3184141 cggaatggcg caagacccgg tcgctaccgc ggcttccaac aacccgatgc tcagtaccct 3184201 gacctcggct ctgtcgggca agctgaaccc ggatgtgaat ctggtcgaca ccctcaacgg 3184261 cggcgagtac accgttttcg cccccaccaa cgccgcattc gacaagctgc cggcggccac 3184321 tatcgatcaa ctcaagactg acgccaagct gctcagcagc atcctgacct accacgtgat 3184381 agccggccag gcgagtccga gcaggatcga cggcacccat cagaccctgc aaggtgccga 3184441 cctgacggtg ataggcgccc gcgacgacct catggtcaac aacgccggtt tggtatgtgg 3184501 cggagttcac accgccaacg cgacggtgta catgatcgat acggtgctga tgcccccggc 3184561 acagtaacgt tcggcgcggt caaggcgagg cagcccgtgt aggcggtttg cctcgctcat 3184621 ccggcggctt cgtgccgata gatcacgtga tatcccaagc gcatgacggt gacaccgcgc 3184681 ccagcgcaag ccgatccccg cagcatgcct gctgaagtcg cgtctcgcga actgcgcaac 3184741 aacaccgccg ggctgctacg gcgcgtgcag gccggcgaag acatcaccat cactgccaac 3184801 ggcaaacccg ttgcgctgct gaccgcaggc agcccgcacg gcgccgatgg ttgagtcgag 3184861 acgagctgct gcggcggctt cggcatacgc aagcagatgc gggattgcac ccgcgacctc 3184921 gcaacgctca ctggcgacac caccgacgat ctcggtcccg tccggtgagg gccgctgccg 3184981 ttgccacgtc gcaaggggtg ccggtcgtga cccacgacgg cgacttcgac gccgtcgatg 3185041 gtgtggccga tgtggctatc attcgcatct gacgggtggc gagttcgacg tgaaccgact 3185101 ctgtcaacag cgctcgcgtg agcggtcctg ccaactcgtt gccgtcccgg cagatccaag 3185161 acctaaacgg caacgaataa ccgatgtgtt gaccctcgca ctagtcggct tcctcggcgg 3185221 cctcatcacc ggaatatcac catgcattct gccggtcctg ccagtaatct tcttctccgg 3185281 cgcgcagagc gtcgatgcag cgcaggtggc gaaacccgaa ggcgccgtag cagtccggcg 3185341 caaacgtgcg ctatcagcga cattgcggcc ctaccgggtg atcggtggtc tggtgctcag 3185401 tttcggcatg gtcaccctgc tcggctcggc attgctgtca gtgctgcatc taccgcagga 3185461 cgccatccgc tgggccgcac tggtcgcctt ggtggcaatc ggcgccggcc tcattttccc 3185521 gcggtttgaa caacttctgg aaaaaccgtt ctcccgtatt ccgcagaagc aaatcgtcac 3185581 tcgcagcaac ggtttcgggc tgggtctagc cctgggcgtg ttgtatgtcc cctgcgccgg 3185641 cccgattcta gctgcgatcg tcgtggccgg ggctactgcc accatcgggt tgggaaccgt 3185701 cgtgctcacc gcgacattcg cactcggagc cgcgttgccg ttgttgttct tcgccctcgc 3185761 cggccaacgg atagctgagc gggtgggcgc ttttcggcgc cgccagcgtg agatcaggat 3185821 cgccaccggt tccgtgacga tcctgctggc ggtggcgttg gtgttcgatc tgccggccgc 3185881 gctgcagcgg gctattcctg actacaccgc atcgctgcag cagcagatca gcaccggcac 3185941 ggagatacgg gaacaactga accttggcgg catcgtcaac gcccagaacg cacagctgtc 3186001 gaattgcagc gacggggccg cacaactcga aagctgcggc actgcaccag atctcaaagg 3186061 catcaccggc tggctcaaca cgcccggcaa caagccgatc gacctgaaat cattgcgtgg 3186121 caaggtggtg ctgattgact tttgggccta ctcctgcatt aactgccaac gggccatccc 3186181 ccacgtcgtc ggttggtatc aggcctacaa agacagtggt ttggcggtca tcggcgtgca 3186241 cacccccgag tacgctttcg agaaggtccc gggcaacgtc gccaaaggcg cggccaatct 3186301 gggcatcagc tatccgattg cgctcgacaa caactacgcc acttggacca actaccggaa 3186361 tcgctattgg cccgccgagt atctgatcga cgctaccggg acggtgcggc acatcaagtt 3186421 cggagaaggc gattacaacg tcaccgagac gttggtcagg cagttgctca acgatgccaa 3186481 gcccggcgtc aaactccccc agcccagcag caccaccacg cccgacctta ccccgcgggc 3186541 cgcacttact cccgagacgt acttcggagt cggcaaggtg gtcaactacg gcggcggcgg 3186601 cgcatatgac gaagggtcgg ccgtgtttga ctacccgccc agtttggcag ccaacagctt 3186661 tgcactgcgc ggccggtggg cgctggacta tcagggtgcc acgtccgacg gcaacgacgc 3186721 cgctatcaaa ttgaattacc acgccaaaga cgtctacatc gttgtcggtg gcaccggcac 3186781 cctcacggtc gtgagggacg gaaagccagc cacactaccg atcagcgggc cgccgaccac 3186841 ccatcaggtg gtcgccggct atcggctggc gtccgaaaca cttgaggtgc ggcccagcaa 3186901 ggggctacag gttttttcct tcacctacgg atgaatatcc atccaagacc cggacggctc 3186961 cgaagaaatc atgtcggggg tagcgagacg gcacaagccg ccgtctccgg cagcgaagga 3187021 gtgaacggca tgaaggtaaa gaacacaatt gcggcaacca gtttcgcggc ggccggcctg 3187081 gcggctctgg cggtggctgt ctcaccgccg gcggccgcag gcgatctggt gggcccgggc 3187141 tgcgcggaat acgcggcagc caatcccact gggccggcct cggtgcaggg aatgtcgcag 3187201 gacccggtcg cggtggcggc ctcgaacaat ccggagttga caacgctgac ggctgcactg 3187261 tcgggccagc tcaatccgca agtaaacctg gtggacaccc tcaacagcgg tcagtacacg 3187321 gtgttcgcac cgaccaacgc ggcatttagc aagctgccgg catccacgat cgacgagctc 3187381 aagaccaatt cgtcactgct gaccagcatc ctgacctacc acgtagtggc cggccaaacc 3187441 agcccggcca acgtcgtcgg cacccgtcag accctccagg gcgccagcgt gacggtgacc 3187501 ggtcagggta acagcctcaa ggtcggtaac gccgacgtcg tctgtggtgg ggtgtctacc 3187561 gccaacgcga cggtgtacat gattgacagc gtgctaatgc ctccggcgta atcgtccgcg 3187621 gaggccgccg acccgcccga gagcgactga gcatgtgcca gaatgttcgg gcagtgggag 3187681 ttcgacgtca gtccaaccgg aggaatcgcc gtggcaagta ccgaggtgga gcacttcgcc 3187741 ggctcgcaac atgaggtcga caccgccgag gttccatctg cagcgtgggg gtggagccgg 3187801 atcgatcacc gcacctggca catcgtcggc ctgtgcatct tcggcttcct gctggcgatg 3187861 ctgcggggca accacgtcgg ccacgtcgag gactggttcc tgatcacgtt tgccgcagtc 3187921 gtgctgttcg tcttggcgcg cgacttgtgg ggccgacgac gcggctggat cagatagcca 3187981 gcacaccgtt cggtgtgccc gacccggtca gcgccgcacc cgccgaaacc aggtaccggc 3188041 gaaggcaccg accaccagca caaccagcaa caccgcccaa ggccatgcac cgtgctggtt 3188101 aacccagcca gccagggcac cttgcaggcg gccggccgcg gcaatcaccg catcctgggg 3188161 attcgccccg acaccggcaa tcaggcgcag ctcgtagaga ccgtagtaac ccacgtacag 3188221 cccgaccacc accagcagcg cgccactgat ccggttgacg aacggcaaga ttcgccgtag 3188281 gcggtcggcc agcgccgagc tcgcggtcgc ggccgcgacg gcaagcacgc cgacaacgag 3188341 ggtcaggccc gcgacataag ccagatagat cgctacgctc ccgacgaccg aaccgccccg 3188401 caggcctgcc ccggtaaccg cgagaaacgg cccgatggtg catgacagcg aagcaaccgc 3188461 atagctgatg ccgtagccat acatggaacc cagccgtacc gttggagccc aacgcacgcc 3188521 gagggatcgg ggcgtcaacg ccgtcagccc tcgtcccaac agcagccacc cgccgagggc 3188581 gatgagcgcc agaccgatca gcaccgtggc atagggcagg tatcgctgca ccgccgtggc 3188641 cgcggaaatg gtcagggctc cgaagatgcc gaacaccgtc aagaagccca gcgccatccc 3188701 gaccgtggcg gctgccgctc ggcccactgc gctaagcggc cccgtccggc ccgccgaatc 3188761 ctgcccatac accaccaaca gcaggtaggc cggcaacatg gcaaacccgc atgggttcag 3188821 cgcagccacc aacccggcgg cgaacgccaa accgatcagc gcctcgttca ccgggtcagg 3188881 acgtcagcgc agccacccgg ccggacagct cgtcctgaga catggccgcg gtggggttgt 3188941 tgacgaacgt cgatgtgccg tccgcgcgat agaacacaaa tgccggttgc caaggcacgt 3189001 tgtagcgggc ccagatcaca ccatcggcgt cattgaggtt ggtgaaattc aggttgtact 3189061 tcgagacaaa gctctgcatc gccccgacgt cggcgcgggt ggcgattccg acgaaggtga 3189121 ccgccggatt agcggccgct acctggctga ggctgggggc ttctgcgttg cagaacgggc 3189181 accacggcgt ccagaaccac aacaccgccg gcttgccttg caggcttgcg ccatcgaagg 3189241 gagcaccgct gagcgtggtt gcggtgaact gcagacgttc atcggctgcc accgctcgcg 3189301 gtgtattggc cagaccgaac atcaggacaa ccgcgatagc aacggccaca atgccgtccg 3189361 caaacgcctt gatcggggac accaggcgaa gactcatgac agacctcact tgttcgtgtt 3189421 ttgacctaat gacgtaatac gctccgtgac ggttcagtac atcccggcgc cccctgcgct 3189481 cgcggccagc tgtccgcagc gctggctgat tcgcctgcgc tccagctacc cgcctacggc 3189541 ggccagctgt ccgcagcgct ggctgattcg cctgcgctct agctacccgc ctacggcggc 3189601 cagctgtccg caggcggcgc tgatctcccg cccacgggtg tctcgtaccg tgcaggaaac 3189661 tcctttcgcc cgaacccgtt tgacgaattc acgctcaacc ggcttggggc tggcatccca 3189721 atcactgccc ggagtcgggt tcagcgggat caggttcacg tgcgccaacg gcccgagaac 3189781 acgatgcagt cgctttccca gcaagtcggc ccgccacggt tggtcgttga catcacggat 3189841 cagcgcgtac tcaatagaca cccgtcgccc ggtcacattg gcgtagtacc gggccgcatc 3189901 gagcgcttcg ctgatcctcc accggttgtt gaccggaact agtgtatcgc gcaacccgtc 3189961 gtcgggggcg tgcagcgaca gcgccagggt cacgccgagc cgcgcgtcgg caaggttgcg 3190021 gatagcaggg gccagaccca ccgtcgacac cgtcaccgcg cgggccgaaa tcccgaaacc 3190081 ggacggcggc cgcgcggtaa tgcgctgaac tgcggccaac accctggcgt agttggccag 3190141 cggctccccc catacccatg aacaccacat tcgacaaccg atcgccgaag tcgtcgcgca 3190201 acgccgcggc gccggcacgc acctgctcga ggatctccgc cgtcgatagg ttgcgagtca 3190261 atccgccctg gccagtggca cagaacgggc aagccatgcc gcagccggcc tgcgaggaaa 3190321 tgcagaccgt gttgcgccgc ggatagcgca tcagcaccga ttcgaacatg gtaccgtcga 3190381 cggcccgcca caacgtcttt cgagtctggc cggcatcgca ggtgatgtcg gcggacgcgg 3190441 taagcaagtt cgggaacatc gctccggcga tccggtcgcg aacggccgcc ggaaggtcgg 3190501 tcatctgacg cggatcggcg atcagccgac cgtagtactg gtgtgcaagc tgcttggccc 3190561 gaaacgccgg cagccccagc tccgcgacgg cagacgctcg gcccgccgcg tcgagatcgg 3190621 ccaggtgccg cggcggccga cccggacgcg gctcatcgaa catcaactcg gggaccatga 3190681 cctgtccagt atcgccgttg tcagggcagc agtgtgagga ctatccaggc cgccaccgcg 3190741 gaaggcagta tgccgtcgag ccggtccatc agaccgccgt ggccgggtag caggcggccc 3190801 atgtctttga tgccgaggtc acgtttgacc tgcgactcca ccaggtcgcc cagcgcggtg 3190861 gtgagcacga aaagcacgcc gagcagtgca ccaatccacg gcgttttgcc gaccaggaaa 3190921 gtcgcggtga tgatcgttgc ggtgatcccg cacaccagcg aaccggcaaa gccctcccac 3190981 gacttcttcg ggctgatcgt cggaaccatc ggatgcttgc caaacagcac ccccacggcg 3191041 tagccgccga catcggaagc gatgaccgcg atcatcatgc agaacaccca tcccgagcca 3191101 ttttccgggt agaccagcat tgcgccgaaa gagcagaaca atgggaccca cacggccagg 3191161 aagaccgtgg ccgagacgtc ggacaagtag tttcccggcg acggtgcacc gccggtcgtc 3191221 gggcgcgtca cgctgtcctg catgaacagt cgccaaatca tgcagacaac gaccatgcca 3191281 ccaaagcccg ccaatgcgcc gaccgcgccg aacggccagg tcagccacac cgcggcctgc 3191341 ccgccaatca gcaacgggat aaccgggatg agatagcccg cttcccgcaa cctccgcacc 3191401 acctcatggg tagcgaccaa ggtggcgacg gccacgatgg caacccaaac gcgcggaacg 3191461 aacaccagca ccgcgatgag gactaggcct atggaaaggc ccaccacgat cgctgcgcgc 3191521 aaatcacggc cggcgcggga cgtttcggtc gccggctgct gtttagcacc acgcgccggc 3191581 tgctcggcgg ggtttccggt gccggcatcg ttggttgtca cggattttgt tgctgagcgg 3191641 ccgctagacc tccagcagct cgccttcttt gtgtttaacc agctcatcaa tttgggtgac 3191701 gtattggtgc gtggtcttgt cgagatcctt ttctgcgcga ccgacctcat cctcgccggc 3191761 ctcgccttcc ttacggatgc gatggagttc ctccatcgct ttgcgacgga tattacgcac 3191821 cgaaaccttg gcctcctccc ccttatgctt tgcctgtttg accagctctc gccgacgttc 3191881 ttcggtgagc tgcggtacgg ccacgcgaat aagggcgccg tcgttggtgg gattcactcc 3191941 aaggtcggag ttgcgaattg cagtctcgat agcgcgcaac tgattggctt catacggctt 3192001 tatcacgact agccgcgcct cggggacatt gatgctggcc agttgcgtga tcggggtggc 3192061 cgcaccgtag tagtcgatgg tgatccgaga gaacatgcca gggttggcgc ggccggtacg 3192121 gatagttgac aggtcgtcac gtgccaccgc cacagccttc tccattttct cttcggcgtc 3192181 gaagagagcc tcatcaatca tctgcgccgc tcctcctcat cgctgcgctc tgcatcgtcg 3192241 ccggcgccaa ccatctgcgc cgctcctcct catcgctgcg ctctgcatcg tcgccggcgc 3192301 caaccatctg cgccgctcct cctcatcgct gcgctctgca tcgtcgccgg cgcgaagcag 3192361 cgcgtagtcc ccttaggtgg tgaccagcgt tccgatcttc tcaccccgaa cagcacgggc 3192421 gatattgcca tcggtcagca ggttgaacac caggatcggc atgccattgt ccatgcaaag 3192481 gctgaacgcg gtggcgtcgg ctactcgcag cccgcggtcg aggacctcac gatgactgac 3192541 ggcggtgagc agttcggcct cggggttcac ccgcggatcc tcagcaaaca caccgtcgac 3192601 cgctttggcc atcaagacca cgtcggcacc gatctccagc gcacgctgcg ctgcggtggt 3192661 atccgtcgaa aagtacggca gccccatgcc ggcaccgaag atcaccaccc gtcccttctc 3192721 caggtggcgg acggcccgca acggcaggta cggttcggcc acctggccca tggtgatcgc 3192781 ggtctggact cgggtaacga tgccttcctt ctccaggaag tcttgcagtg caaggctgtt 3192841 catgacagtg ccgagcattc ccatatagtc cgacctggtg cgctccatac cgagctgctg 3192901 cagctgtgcg ccccggaaaa agttgccgcc gccgatcacg acggcgatct ggacgccgcc 3192961 gcgcaccaca tcggcgatct ggcgggccac ctgcgcgacg acatcgggat ccagcccgac 3193021 ctggcctccg ccgaacattt ccccgccgag cttgagcaac actcgcgagt acccggacag 3193081 ctgagccgcc gacgcggcgc cagtgctcgc aggctccggc ttcgaagccg gcgcgccggc 3193141 gacatcgggc tctgtcatct gactcctcgc acgacagtgc catcccggca ccaccaggac 3193201 ggcatctcac atcctgcctc aatagccgcg ctccggcgtg ggcggggtgc gttagtcacg 3193261 caacaacgag gggccggccg aggccaggcc cgtcgactat ctcaaggtgt gagcatcgct 3193321 cgagcaacaa agttggaata gttctgttct gaaccgggta cccaggggta ccggcagaca 3193381 tctccgcgag ggatgcctac gggccccacg acggggaagt ggcaccctca tgaagtttgg 3193441 agatatctct tggaagttct acttcttacc gatgaagccg atcttgaatc ggctctgccg 3193501 gagctggagt cgttcgcgca gtcggtgcag cgcgcaccgc tggacgaccc gggcgcggcc 3193561 aagggtgcgg acgccgatgt cgcgatcatt gacgcgcgcg ccgacttggc ggccgctcgc 3193621 cgggtgtgcc gccggctgac gactagcgca ccagcccttg ccgtggtggc tgttgttgcg 3193681 ccggccaact ttgtggcagt ggacggcgat tggatattcg atgacgtgct gttgaacgcg 3193741 gccggcgggg ccgagctgca ggcacggttg cggttggcga tcacacgtcg acggagcacg 3193801 ctagcgggca cactgcaatt cggggacctc gtccttcacc cagccagcta caccgcgtcg 3193861 ctgggcgacc gggacctggg gctgacgctc accgaattca aactcatgaa tttccttgtg 3193921 cagcatgccg gtcgggcgtt cacccggact cggctcatgc gtgaggtgtg gggctatgag 3193981 tgccatggtc gcattcgtac cgtcgatgtt cacgtacgac gactgcgcgc aaagctcgga 3194041 gccgagcacg aatcgatgat cgacaccgtt cgcggtgtgg gttatatggc ggtgacgcca 3194101 ccgcagccgc gctggatcat cagcgaatcg atactaaacc gttgcaagtg agtgatcttt 3194161 agtggtcact tgacttgcac cccgtctcgg ggttgttcgc cggccgggtg gccggttgcc 3194221 ttccgcgctt cacggccacc cgccgggcca ggcccggtct tacggtcggc tccacgcttg 3194281 acggcggccc caactgggcc gacgacgcta ggtggttcct cgtagcgtgc gaggttgatc 3194341 gcggcgttgt cgtcacgttg gtgcgtgatc gaacagccgt cgcattgcca tttttcgtcc 3194401 cagccgatgt cttgcacatg ccggcaggca tggcaggttt tcgacgatgg gaaccagcgg 3194461 tcggcgacca ccagactcga tccgtaccag cctgtcttgt aggacaggtg acggcgcggg 3194521 gttgccaggg ctgcatcaga cagtgcgcgc cgtctggcgc gcgcccccgg cagtcccttt 3194581 tgccgcagca ttcccgccgc atccagacct tcgacaacga tacggccgtg ggttttggcc 3194641 aatcgtgttg tcagcacgtg caggtggtgg gtacggacat cgttgacccg acggtgcagc 3194701 cgggacagtt cggtggtgcg ctcacagtag cggcgtgagc ctttcgtgca gcgtgagcgt 3194761 gcgcggctga cgcggcgcaa cccgcgcaac gcagcatcaa gcgggcgagg attcggcact 3194821 tgttcaagca ccgtgccctc agcgtctgca acagtggcca aacgccgcac accaacgtcg 3194881 acacccaccc gtgaatcagg aagcgccaca cgccgctgtt gggggcgttg gacgagcacc 3194941 cgcacgctcg catccaggcg ggtgccgttg cggcgcacgg tgatcgccag cacccgcgcc 3195001 cgacctttgg cgatgagccg ctcaacccgg cgggtgttct cgtacgtacg gatggtgccg 3195061 atcaccggca aggtgaggtg gcggcggtcg ggctccacac gcatcgcacc ggtcgtgaag 3195121 cacacgcgat cggcgtcgcg tcccttcttc ttaaaccggg gaacgcctac tgttttgcca 3195181 gcccgtttcc cggcacggca gctctgccag ttccaatacg catcgaccgc gcccgcgatg 3195241 ccatcggcat aggcctcttt cgagcattcc ggccaccaca cctgcccggt ctgcgcgttg 3195301 acacacacct ggtctttgac cgtgttccac cgtttgcgca acacccgcag cgacggcttc 3195361 gccgactcgg tgccatccgc gcgccacgcc ttgatgtcgg ctttaagcgc cgtgacggtc 3195421 cagttgaatg ccttacggcg agcaccaaaa tggcgcgcca agctggcagc ttgcgtctgg 3195481 gtcgggttca gcgtgaaccg aaacgcctgc acacaccacc cctcaggcac ctttaagcgc 3195541 gccatcacct agcctcgtgt cccccggcgc gtgccgccgc ggccacggca cgcgcagcac 3195601 ggttgccagc agcgcgtttg ccgtagagcc gcgcacatat cgacgtcaag atctcggtga 3195661 tatcgcccac aacgtcgtca tcgacatcgg ccgaatcgac cacaaccagc tcacggccgt 3195721 cagcggccag taccgcctgt acacactcaa agccgaaccg ccccaaccgg tcccgacgtt 3195781 tcatcacaat ccgcctcacc gtcggatcac ccagcagcgt aaggaacgtg cggcggcgcc 3195841 cgtacagcgc cgacccgact tcagtaacga ccttgccgac gggtatctgt tccgccgtgg 3195901 cccacgcggt cacgcctacc acttgccgat ccagatccac cttctgatca gccgacgaca 3195961 accgcgcaca caccgccgtc cgcccccacc gccccggctg cccggctggc tcgtcgacaa 3196021 gaatcactcg cccaaccctg cgggcgggaa ccggcaaccg cccgacacgc aaccagcgat 3196081 acgcaataac ccgcgccaca ccgttgccct cagcccacac caccagattc atacttccgt 3196141 tcctacaaca caccaccgac aaccaacgac cacccaaacg caacagctga cagccccttc 3196201 cgggcatcgg cagcaccggc cgaagactcc acagcgcgtt aatgcgccca ggtgtttgca 3196261 acggcggtgt cgaaggctgc cgagaacacg cccactgcgg caatgcgatg taggcttcac 3196321 gcccgtggct atggttcccg ctcaaacgac cggcggcact gcccacaagc gccgggagcg 3196381 cataggaacg atttaccgtt cggcccggca catgtgtcag tatccttgac atgggtctag 3196441 ccgatgacgc cccgctgggc tatctgctct accgggtggg agccgtactg cggccagagg 3196501 tttccgctgc gctcagtcca ctcggcctga cgctgcctga gttcgtctgc ctgagaatgc 3196561 tttcgcagtc accgggacta tccagcgccg aattggcccg gcacgcaagc gtcacaccgc 3196621 aggcgatgaa cacggtgttg cgcaagctgg aagatgccgg tgcggtggcc cggcccgcat 3196681 cggtgtcttc cgggcgttcg ctaccggcta cattgaccgc tcgaggccga gccctggcga 3196741 agcgcgccga ggccgtcgta cgcgccgccg atgcccgcgt cctggccagg ctgaccgcgc 3196801 ctcagcaacg cgagttcaaa cgaatgctgg agaagctcgg gtccgactag atccggacgc 3196861 gggctactcg gcgatatttg gggcgtggat ccgggcccag ggccgggcct cttcgagttc 3196921 gtaagccagc tccagcagca gcgcctcgcg cccggtatca gccgagagca tcatgcccac 3196981 gggcatgccg tccgcggatt gagccaacgg tagcgaaatc gccggcaccc ccgtgacgtt 3197041 ctgcactggc gtgaacacga cccagctgct cagccggtcg agcaccgtct gatagtcggt 3197101 aggcgcaagg tatccgacct gcggagtggc ctccgcgacc gttggcgtga gcaagacgtc 3197161 gtaggtaccg aagaaccgca cgctgcgccg ccgtagcatg cgcagacgca tgatcgccaa 3197221 cggcagccgg tgcaggttgc ggccggtatg gcgggccagc cccaaagtca gttcgtccag 3197281 ccgggtaggg tcgaacgtcc tgccgaatgt gcgccggccg ctgcgcactt gcgccagggc 3197341 caagaacccc caatagagca cgaaatcgtc cacgaaactg gccggtgccg gtgggtggtc 3197401 gacgtgttct acccggtgac ctagttcctc gagcagccct gccaacttca gcgtcagctg 3197461 ccgcacttcg gggctggcct cgcgcagaac cgagcgggtt actacggcaa tcctcagccg 3197521 ctgcttaacg gggcttgtga cgtccccgac cggcggcagc tggtggttac gccaaaggcg 3197581 ctcggcctcg cggtagaagg ctgcggtgtc gcgtaccgtg cgggtcagga cgccattggc 3197641 gacgatgccc accggcaacc tgcgatactc cggctccagc ggcaaccggc cgcgcgacgg 3197701 cttgagcccg accaacccgt tgcaggcggc cggaatacgg atcgagccgc cgccgtcgtt 3197761 ggcgtgcgcg atcggcacca cgccggctgc caccaaggcg cccgatcccg atgaggaggc 3197821 acccgctgtg tagtcggtat tccacggatt acggaccggt cccagccgag ggtgttcggc 3197881 cacggcgctg aagccgaatt ccgacaactg cgtcttgccc agggacacca gcccggtgcc 3197941 cagcaccacc cgggttatct cgctgtcggc gacggccgcg tatggttccc acgcgtcggt 3198001 gccatgcatc gacggctgtc cggcaacgtc gacgttgtcc ttgatgaagg tcggcactcc 3198061 actgaagaac gcttcctggc ccgtacccat cgcggccgcg tctcgcgcca cgtcgaaagc 3198121 cgcatacgcc aacgcgttca gtgccgggtt aacggcttcg gcgcgggcga tggcggcctc 3198181 gacgacgtct gcccgaccca ctcgacctga tcggatggcg tcggcgaggg cgaccgcgtc 3198241 gaggtcacca agggcatcgt caacgaaagc gtgtacgcgc gacatacccg gctaagcctg 3198301 gcccacctcg aagcggacga accgtgtcac cgtcacgccg gccacgtcga gcagggcctt 3198361 gacggtcttc ttattgtcgg acaccgacgc ctgctcaagc agcaccgcat ccttgaagaa 3198421 gccgttcagc cggccctcga caatcttggg cagcgcctgc tccggcttgc cctcggccct 3198481 tgccgtctcc tcggcgatgc ggcgttcgct ggccacgatg tcttcaggca cgtcgtcgcg 3198541 ggacaggtac cgcgcccgca gcgcggcgat ttgcaacgca acggcgtgcg cggcggccgc 3198601 gtcgtcgccg cggtactcga ccagtacacc caccgctggc ggcaggtcag cggaacgtcg 3198661 atgcaggtag gcttccacgg tcccgtcgaa aatcgccaca cgacgcagct cgagcttctc 3198721 gccgatcttg gccgacagct cggcgatcgc ctgctcgacg gtcttgtcgc cgatgctggc 3198781 acccttgagc gcgtcgacgt cggcgggctt agctgctgcc gccgccgcga ccacttggtc 3198841 ggccagcgtt tggaactccg cgttcttggc aacaaagtca gtctcgcagt tgagctcgat 3198901 cagcgcgccg tccttggccg ccaccaagcc ctcggccgta gcccgctcgg cacgcttgcc 3198961 gacatcctta gcgcccttga tccgcagcgc ctcgacggcc ttgtcgaagt ccccgtcggt 3199021 ttcggccagc gcgttcttac aggcgagcat gccggcgccg gtcagctccc tcagccgctt 3199081 gacgtcagcg gcagtgaagt tcgccatatc agcctttcct aggatgcatc tgtggttggt 3199141 tcggttgcgc ctgcgggggc gtcggtgagg gcagttgttg acgcggttgc tgatggcgtt 3199201 gccgaagctg tcgccgaggc cagcagctct tgctcccatt cggccagcgg ctcggcggct 3199261 tcggcctccg gcttgccgtc ggcgcgcccc agtccggcac gggcctgcag gccctcggcg 3199321 accgcggaag cgatcaccct agtcagcagc gcggccgagc ggatcgcgtc gtcgttgcct 3199381 gggattgggt agtcgacctc gtcggggtcg cagttcgtgt caaggatcgc gatgaccggg 3199441 atgcccagtt tgcgggcctc accgacggca atgtgctctt tgttcgtgtc gacgacccag 3199501 atcgccgacg gcaccttggc catgtcgcgg atgccgccga ggctgcgctc gagcttgttc 3199561 ttctcgcggg tcaatcccaa gatttccttc ttggtgcggc cctcgaagcc accggtctgc 3199621 tccatcgcct caagctcctt gaggcgttgc agccgcttat gcacggtgga gaagttggtg 3199681 agcatgcctc ccagccagcg ctggttcaca tacggcatgc cgacccgggt ggcttcggcg 3199741 gccaccgact cctgcgcctg cttctttgtg ccgacgaaga gcaccgaccc accgtgagcg 3199801 acggtctctt tcacgaactc gtacgcctta tcgatgaagg tcaacgtctg ctgcaggtcg 3199861 atgatgtaga tgccgttgcg gtcggtgaag atgaaacgct tcatcttggg attccagcga 3199921 cgggtctgat gcccgaagtg ggtgccgctg tcaagcagct gcttcatggt gactacggcc 3199981 atacctatgc cttactcatg tgtcggttgt tcgcccggca tcggctgaag ccgggccctg 3200041 gcgtctgccg cgatgccgga cccgggagga aatccccgaa gggaaccgcc gcgggaccgc 3200101 cccggcatgc tgttgcggat cccggaaagg cgggccgcgg tgcagacacg cgaagtcagc 3200161 ccgccgatgc gagctgcgcc gagtagttta caccgaccca gctggtgatt ttcccggcag 3200221 cggaatccac agcgacgaca ttgtccacaa aacgggcggc ggcgattggc caaatcgccc 3200281 gcgcggcgct gcactgcaaa ggtacggagg gttctgagcc gcagcgtact gatcctttgc 3200341 tggtcgctgc ttggtgcggc gccggcccat gccgacgact cccggctggg ctggccgctg 3200401 cggccgccgc cggcggtagt ccggcagttc gacgccgcat cgcccaattg gaatccgggg 3200461 caccgcggtg tcgacctggc cgggcgcccc ggtcagccgg tttacgcggc cggcagcgcg 3200521 acggtcgtat tcgccgggct gctcgcggga cggccggtgg tttcactggc ccacccgggt 3200581 gggctacgca ccagctacga gccggtagtc gcccaggtcc gggtcggtca gccggtgtcg 3200641 gcgcccaccg tgatcggcgc gctggcggcc gggcaccccg ggtgccaggc cgccgcctgt 3200701 ctgcactggg gggcgatgtg gggcccggct tcgggcgcca actatgtcga tccgctgggc 3200761 ctgctgaagt ccacaccgat acggctcaag ccgctatcca gcgaagggcg gacgctgcat 3200821 taccgccaag cggaacccgt atttgtgaac gaagccgccg ccggtgctct ggccggcgct 3200881 ggccatcgga aatccccgaa gcagggcgtt ttccgcggtg ccgcgcaggg cggtgacatc 3200941 gtcgcccggc aaccgccagg ccgctgggtt tgcccatcga gcgcgggcgg cccaatcggg 3201001 tggcaccgac aatgaaccag ccgagctccc cttccccaaa gcggccgata ccgatccgcc 3201061 aatgctttct cggtctagtg cccagtacca gtacggctgg ggcgtctgaa ccccgccaac 3201121 agcaccgccg cctgccacac ttgggctcgc ccgcgggccg gcgaagatgg ttggacccca 3201181 gctgtcaagc accgaggatc ccgagtcacc ggcgccgccc ggggtgcccg ggatcgcttg 3201241 ctgggcgagc gaagcctcga attgcagtag cccttcgtcg aagtagctga tgcccagcaa 3201301 ccgaacgatc gtcatcctgt tgtcaggcgt gagaccgaga agattctccg cgagccattg 3201361 ctgcaatgcc gagtaccacg ggatcagtga cgtcgacgag agttgctgca acagttgggg 3201421 caccgcggcg gtggtggcca gcggtgggac cgtcgacgat accgtcgccg ccgcctggcc 3201481 ggccagcgcg gcagggctgg tagtcactgg cgccgcggtg aacggggtca actccgtggc 3201541 gattgccgcg gagcccgcat aggcgtacat ggcggcagcg tcttgggccc acatctcggc 3201601 gtattgggcc tcggtggccg cgatcgccgg ggtgttctgc ccgaaaaagt tggtcgcgac 3201661 cagcgccacc aacagcgcgc ggttggcaac gaccaccggc gggggcaccg tcatcgcaaa 3201721 cgccagctca taggctgccg cggccgctct ggcctgcatg cccgcctgtt cagcctgacc 3201781 ggcggtggcg ctgagccacg ccacataagg cgtgaccgcg gccaccatcg acgccgctgc 3201841 gggccccgcc cagtacgcac cggtcagctc cgagatagcc aaccggtagc cgccggcggc 3201901 caagcccaat tcagccgcca aactatccca ggccgccgcg gcggccatca tgggccccga 3201961 tcccggacct gcgtacattc gaccggagtt gatctcgggc ggcaacaccc caaagtccaa 3202021 cgcccatccc tccctagccg gccgggatca cggcgtggtt acgcgcccca cccgaatagg 3202081 cagtggtacg tgatgcggtc acgaactggt cttgaatcgc cagcctcagg tcgctgatcg 3202141 ctgacagcgg ccccggtcgt cgaacaagcc agtccatcct gtgccctcat ccctgatagc 3202201 tggattttgg cggcttgaca tcggccgcac cagcgtttct gggtaagtgc ttacaaacga 3202261 gacgcatttg ctgtgaccgg agccgaatgt ttgattcccg gccagctacc gttcacctga 3202321 aggaagtcgg cgcgttaccc acagctcgat attcggggtc ctgccggccc gaaccgccac 3202381 cgcacaatcg atgccggctt cgcggctacc gtcgactcca tgaccgttgc cagcaccgct 3202441 caccatacac gtcggctacg tttcgggttg gcggcaccgt tgccccgcgc gggcacccag 3202501 atgcgcgcct tcgcgcaggc tgtcgaggcc gccgggttcg acgtgctggc cttcccggac 3202561 cacctggtgc cttcggtttc gccgttcgca ggcgcgaccg ccgcggcgat ggccacgcaa 3202621 cgactgcaca ccggcacatt ggtgctcaac aacgactttc gccatcccgt ggacaccgct 3202681 cgagaggcgg ccggtgtggc aaccctcgcc gaaggccgct tcgaactggg actgggcgcc 3202741 ggacaccgga ggtccgaata cgacgccgcc ggcattacct tcgattccgg ggcaacacgg 3202801 gtggcgcggc tcatcgaatc ggcgcacctg atccgtgcgc tgctggacgc ggagcccgtc 3202861 gacttcgacg ggcagcatta ccgggtgcac gccgaagcgg gctcactggt ggcaccgccg 3202921 aaggtccggg tccccctgct agtgggcggc aacgggaccg aggtgctgcg gctgggcgga 3202981 cgcatcgccg acattgtcgg cctggccggg atcagccaca accgcgacgc cacccaggtc 3203041 cggttcaccc acttcgacgc cgacggcctg gccgaccgga tcgccgtggt acgtcacgcg 3203101 gccggcgatc gcttcgaagc cattgagctc aacgcgctga tccaggcggt ggtctgcacc 3203161 aacgaccgaa acgcggcggc cgccgaactg gccgccacct tgggcgggat cacgcccgag 3203221 caggtcctcg agtcgccgtt tctgctgctc ggtacccacg agcagatggc cgaggctctc 3203281 gccgcgcggc agcggcggtt cggtgtcagc tattggacgg tgttcgacga gtgggctggc 3203341 cgcgcgtcgg caatgcgcga catcgccgag gtcatcgcgc tcctgcgcta cggctaggcc 3203401 cgcggatggg cccgctcgtg caccgcccgc aaccgggcga ccgcgacgtg ggtgtacagc 3203461 tgcgtggtcg ccaggctgga atgaccgagc agctcctgga ccacccgcag gtcggcgcca 3203521 ccttccagca ggtgggtcgc cgcgctgtgc cgcagcccgt gcggccccat atcgggtgcg 3203581 ccgtccaccg cggccacggt ctggtgcacc gcagtgcgtg cttgccgcac gtcaaggcgc 3203641 cggccccggg cacccagcag cagcgcgtgc ccggactccg cggtgaccag cgcgcgacgg 3203701 ccgtcgacca gccaggcgtg cagcgcatcg gcggctggct gcccgaacgg gacggtgcgc 3203761 tgcttgttgc ccttgccgag cacccgaacc aaccgatggc cggtgtcgat gtcgtcgacg 3203821 tccaggccgc acagctcgct gacccggata ccggtggcgt acaacagctc gacgatcaac 3203881 cggtcccgca gcgctagcgg atcaccttgc tctgcaccag attcggcagc cgccatggcg 3203941 cgcagcgcct gatcctgacg cagcaccgcc ggcaaggtgc gacgggcctt cggcacctgt 3204001 agccgggccg caggatcacc ggccagtagc ccgcgccgca ccgcccaggc ggtgaatgcc 3204061 ttaaccgccg aagtgcgccg cgccagcgtc gtgcgggcgg cgcccgctcc cgccgtcgcg 3204121 gccagccaag accgcaggac cgaaagggtt agtgcgtcca gactcgatcc gcgatcggcg 3204181 agaaacgcga agagcgatct tagatcgccc aggtaggcac gacgggtgtg caccgaccga 3204241 ccgcattgca gggcaaggta ttcgtcgaac tcgtcaagga tcgcctgcac tcccccacag 3204301 tcgcaggcat gacgtctcga gcccgagtcg acgcgccgca ccgtgtccgg ggtgagatct 3204361 ttcggcctgg gatagccgac ctagtgagtc ccggcctccg cctcagccag ttctttcttc 3204421 catttgcgga acatctcctc ggtgcgtccg cgccgccagt aaccggagat cgacgacgcc 3204481 catttggcat ccacaccgcg ctcgttgcga acgtatggcc gcaagttatg catgacggct 3204541 tgcgcctcac cgtgaataaa gacgtggacc tgtcccggca gccacgcggt ggtggtgacc 3204601 gcctcgatca gcggcgcgtg atcaccggcg cggtcctcgg gaaccagatc ggcgcgcccg 3204661 ccgcgataga cccagttcac ctcgacggca tccggcgcgg tcaggccgat ctcgtcgtcc 3204721 gggccggcaa cttcgatgaa tgccctaccg attgcgtcgg ggggcaacgc ttccagcgcg 3204781 gcggcgatgg cggggatcgc cgattcgtca cccgccagca aatgccagtc ggcggctggg 3204841 tcgggggcgt acgcgccgcc ggggcccatc aggtagatcg gttgcccacg ctgggcccca 3204901 gccgcccacg gaccggctac cccgtgctca ccgtgcagca cgatgtccac ggcgatctcg 3204961 cgggccgcgg cgtcgacatg acgaacggtc atggtgcgca ccggcggccg cttcgcggtg 3205021 ggcaggtcgg cgaagctgtc cagggtcagc ggccggggca accgcccgac atcgacatcg 3205081 tcgtcgacga acaccagctt gatgtaagag tcggtgaagt cgctggggac gaatgtgtcg 3205141 aagccgctgc cgccgagcac tacccggacc atgtgcggcg cgaggtgtcg ggtagcgaca 3205201 acctcaaagg cgtgcaatgg tcgacccgcc acatgtcctc ctgtccagac ccgacccgcg 3205261 tcgactatac gagccgggcc gctgcaccct tggccgcggc ctgaccggca ccggcgcgca 3205321 atattcgcca ccgcccgtcg cgacactcgg ccaacccggc gacctcgagg attgccagcg 3205381 gacctagcac ctgcgcgggc agcagcccgg agccgacagc gatctcatca atggtagcgg 3205441 cgccgcggcc cggcagggcc tcgtacactt ggcgttcggc ttcgcttagc acgtcgagcg 3205501 ctgcgccggg ccgcggttca tcaccggcca actcaccgat gtgaccgacg aactcgacga 3205561 tatcgtcggc ccgggtgacc aactccgcgc catggcgaag cagcgtatga cagcccgccg 3205621 atgccgagga tgtcaccggg ccgggcaccg ctgccaccac ccggcccaat gcccgcgccc 3205681 aggcagcggt gttggcggcg ccgctgcgca ggcccgcttc caccactacc gccgccctcg 3205741 cgaccgcggc caccaaccgg ttgcgggtta ggaaccggtg ccgggccgga cggacaccgg 3205801 gcgggtattc ggtgaacagc accccatgtt gggcaatgcg atgtagcaac gccgaatggc 3205861 ccgccggata cgggatgtca aatccgccgg ccagtacggc cacggtgatg ccctcggaat 3205921 ccagcgccgc gcggtgagcc gcaccgtcga tcccgtaggc gccaccggag acgaccgaga 3205981 cgtcgcgctc tgccaacccg gcggccagat cggccgcgac atgctcgccg taggccgtcg 3206041 cagcccgggt tccaacgacg gcggccgcac gtggtgccac ttcgtccagg cgcgcggggc 3206101 ccagggccca caacaccagc ggcgagtggc cgcacggcct tgcccgggct ccggcgccac 3206161 tgaaagcggc gaacgccagc accggccact cgtcgtcgtc gggagtgatc agacgcccac 3206221 cgcggcgcat gagtagctcg agatcgtctg cggcccggtc tatttcgcgt cgggcaccgg 3206281 tgtgctgcgc cagctcgtta ccgacctgcc cgcggcgcac ccggtcggcg gcctccacgg 3206341 ggcccacaca tcgcaccagc gcggccagct gggcgcacgg cggttcggcc acccgggaca 3206401 gataggccca cgcccgcgcc gtcggatcga tcatcgtcgt gctccggttt gccggaagct 3206461 cagggcggcg gcgacctcgt cgatgcctgg cgatgtgcga ccggccaagt cggccaaact 3206521 ccaggccacc cgcaaggtgc gatccacacc gcggatgctg agtagcccgc ggtccagcgc 3206581 ggtgcgcaac gggagcatcg cggcgctgct gggccgaaac ttgcggcgca acagcggccc 3206641 gctgacttcg gcgttggtcc ggaacccatg tggccgccat cgttgcgcgg ccgcctcccg 3206701 ggccagcgcc acccgctggc gaacctgcga cgtcgactcg ccgtccgcgg ccgagaacgc 3206761 cccggcccga agccgatgca tctgcacccg taggtccacc cgatccagca acggcccaga 3206821 cagtttgccc agataccgtc gtttggtagc cgccgcacag atgcaatcct gtggatcggc 3206881 gggcgcgcac gggcacgggt tggcggctag cacgagctga aaccgtgccg ggtagcacgc 3206941 caccccgtca cggcgcgcta ggcggatttc accgtcctcc aacggtgttc gcaatgcttc 3207001 cagcgcgcta aggctgatct cggcgcactc gtccaggaac aacaccccgc gatgcgccct 3207061 gctgaccgcc cctgggcgag ccatccccga tcccccgccg acaagcgccg caacgctgga 3207121 actgtggtgc ggcgccacga acggcggccg ggtaatcaac ggtgtgtccc ccgacagcag 3207181 gccagccacc gagtggatcg cggtcacctc caacgactcg ctgcccgaca gcgacggcaa 3207241 cagccccgga agacgttgcg ccagcattgt tttgccgaca cccggtggac cagtcagcat 3207301 gaggtgatgc gccccggcgg cggccacctc gacggcgaac cgtgcttggg actggcccac 3207361 cacatcggcg aggtccgccg cagactcggg ggtggtgtcg gccgtggtga tccgcccggc 3207421 caagccggtg gacccgcgta gccagctctg caactgcccc agcgtgcgaa caccccggac 3207481 gtcgattccg tccaccaggc tggcctcggg caggttgtcg gccggaacga cgacggccgg 3207541 ccaaccgtca cgtttggctg ccagcacggc gggcaacacc ccacgcaccg gacgcacccg 3207601 tccgtccagc gacaattcac ccagcagcag cgtgttctcc agacgttccc acggcttctt 3207661 ttgttgcgcc gacaacaccg ccgcggccag ggcgatgtcg tagaccgagc ccattttcgg 3207721 cagcgtcgcc ggcgacagcg cgagcgtgag cctggccatc ggccagctgt ttccgcaatt 3207781 ggtgaccgcc gcgcggaccc ggtcgcggga ctcctgcaat gcagcatcgg gcagacccac 3207841 cagatgcaca cccggcaacc ctgaggtgat gtcggcttcg atttccacga tctcgccgtc 3207901 cagcccccgc accgcgaccg agaacgcacg ccccagcgcc atcagccgat cccctgcagg 3207961 tgggtgagct ctggggtgcg gcctgaattc ttggggccga ctcgcacgcc gatcacatcg 3208021 atgcgcaccg cagcccagcg ctcttcctgg tcggccagcc acagcccggc caggcgacgc 3208081 aggcggcgaa ccttgcgctc ggtcaccgcg tgcgcgagcc ccccataacc gtcgccggtg 3208141 cgggtcttga cctcgacgaa caccaccgtg cgggtggcag cgtcgcaggc gatcacgtcc 3208201 agctcgccgt agcggcaacg ccagttgcgg ttcaagatcc gcaaccccat gctggtcagg 3208261 tagtccaccg ctagggcctc gcccatcgct cccagctgaa cccgagtcat cgtcttcagg 3208321 gttgtcatgc ggccaacctg cacgctggcc ccgacatcac ctgccacgaa tcgcgtctca 3208381 ccgatgccgc gacgaccagt tatccccagt cgcggccctg tccacagccc cagtactgcg 3208441 cgggatcacg acaccgcgtc cttgtcatca tcgtctccgt cacatagcaa cttctcgggt 3208501 cccggctact cgcaacgcac cgcaggcggc acacgccgat ccagcaacat catgttcggc 3208561 gccggaagag tcccgttagg tgattcggtc cgctctggtg tagacgttca tcgagtcccc 3208621 ccgcaggaaa gccaccagcg tgatcccgga cgcgtcggcc aacgaaaccg ccagcgacga 3208681 cggcgcggat accgcggcca gcaccggaat cccagccatc agcgcctttt gggtcaactc 3208741 gaacgacgcc cgcccgctga ccaacaacac cgaggcgcca agcggtattc ggtcacgctc 3208801 gaaagcccag ccgatgacct tgtcgaccgc attgtgccgg ccgatatcct cacgcacggc 3208861 aagcatggcg ccgtccaccc cgaatagtgc cgcagcgtgc agcccaccgg ttctcgcgaa 3208921 aaccttttgc gcgcgccgaa gttggtccgg catcgccttg agagtgtcgg cggcgacggt 3208981 agcgggatcg ccgcccggtg cgaatcggct gacctggctc accgcctgaa gcgacgcctt 3209041 accacagact ccgcacgacg aggtggtgta gaaggtgcgg gtgacatcga catcgggcgg 3209101 cttgacgccg ggcgccagag ccacatccaa aacgttgtac gtgctggccc ctgtggcatt 3209161 gccctcgacg cgcctgccac agtagctaac ggtcagcacg tcttcgcggt gcgcaaccac 3209221 cccttcggca agcagaaagc cttgcaccag ttcgaaatcc gatcctggcg tgcgcatggt 3209281 cacggtaacc ggcgtcccat tgacgcggat ctccagcggc tcctcgacgg ccaaggtttc 3209341 cggccgggtg atcacctgat cggcgctgag atgcctgacc cgccgatgcg ccgttgcgta 3209401 ccccactagg ccgttggctc caatcgcacg atgatcgcct tcgacaccgg ggtgttcgat 3209461 tgggccgcgg tatggtcgag cggaaccagc ggattggtct ccgggtagta ggccgcagca 3209521 ttgccgaccg gcgtcgaata tgccaccacc agaaagtctt ttgcccgccg ttcttgcaga 3209581 ccgccttggc cgtcggtcca ctccgacacc aggtcgacac ggtcacccgc cgtcaaaccg 3209641 aacgtttcga tgtcggccgg gttgatgaac accacccggc gtccgccctt cacgccgcga 3209701 tatcggtcgt cgagcccgta gatcgtggtg ttgtactggt catggctgcg tagggtctgt 3209761 agcaccagcc ggccgggcgg caccggcacc cactgcaacg gattgaccgc gaagttagct 3209821 ttgcctgtgc tggtacggaa ttcgcgcgca tcgcgcggcg ggtgcggcaa ttggaatccg 3209881 tcgggcacac gcaccttgtg gttgtagtcg tcacagccgg gcaccaccgc ggcgatggcg 3209941 tcacggatgg tgtcgtagtc atctgcgaac cgttcccatg gcaccggatg tccggggccg 3210001 aacaaggcgc gggccagctg gcagatgatc tgcacctcgc tgcgcacctg atcgctgggc 3210061 gggtgcaggc taccacgcga cagatgcacc atcgacatcg aatcctcaac cgacaccaat 3210121 tgtttgcgac cattgcgggt atcgcgatcg gtccgaccca gcgtcggcag gatcagcgcg 3210181 gtggcgccgt ggacaaggtg gctgcggttg agcttggtcg agacttgcac agtcagcgcg 3210241 cacctgcgca aggccgcctc ggtgacggcg gtgtcggggg tggccgacgc gaagtttccg 3210301 cccatgccca tgaagacgct gacccgaccg tcgcgcatgg cccggattgc ggccacggtg 3210361 tcaaagccgt gcgctcgggg gctggtaatg ccgaactcac gatccagcgc cgccaggaac 3210421 tgctcgggca tcttctccca gatccccatc gtgcggtccc cttgtacgtt ggaatgcccg 3210481 cgcaccgggc acacccccgc gccgggtttg ccgatcatgc cccgcagcag cagcacgttg 3210541 gtgacctcac cgatggtggc cacggcgtgg gcgtgttggg tcaagcccat agcccagcag 3210601 atgaccgtgc gctgcgacgc catcaacatc gcggcgaccc gctgaagttg cgcgagttcg 3210661 atgccggtgg cgtccatcac ggtgtccaag ccgacctgca gagtccggcg gcggtacccg 3210721 tcgaatccgg cacaatggtt gtcgacgaac gaccggtcga caacgctgcc ggggaccctc 3210781 tcctcggcct ccaacaacaa cctgcctaac ccggcgaaca atgccatgtc cccgccgagg 3210841 cggatctgca cgaactcgtc ggcgatcggg ataccatgtc ccacaacccc gttcaccttc 3210901 tgcggatctt tgaaccgaat caacccggcc tcgggcagcg ggttcacggc gatgatcttg 3210961 gcgccgttgg ccttcgcttt ccccagcacc gacagcatgc ggggatgatt ggtaccgggg 3211021 ttttgtccgg cgatcacgat caggtcggcg tgctcgacgt caccgatggt caccgagcct 3211081 tttccgattc cgatcgagtc ggtcagcgcc gcacccgagg actcgtggca catgttggag 3211141 cagtcgggca ggttgttggt gccgaaagag cgcacgagca gctggtaaca gaacgccgct 3211201 tcgttgctgg tgcgccccga tgtgtagaac acggcccggt cgggactgtc caacccgttg 3211261 agctgctcgg cgatcagctg ataagcggca tcccagctga tgggccggta gtggtcatca 3211321 ccggggcgca agaccatcgg gtgggcgagc cggccttgct gggacagcca atattcgggc 3211381 ttcgcggaca gctccgccac cgagtgccga gcgaagaact ccgcagtgac ggtacgcttg 3211441 gtggcctctt cggcgactgc cttggcgccg ttctcgcaga actcggccag cttgcgtccg 3211501 ccgggctcct ccggccacgc gcagcctggg cagtcgaagc cgttacgctg attcaaccga 3211561 gccagcgccg ccgcggtgcg cagcgcgccc atctgctgca tcccccgctg cagcgatacc 3211621 atcaccgccc gcacgcccgc ggcctcgcgt ttgcgcggcg ccaccgttac cgcctgctcg 3211681 tcatagtcgg cgaggacgtc gcgagacgcc gccgaccgct gccacctcac cgcctcaacg 3211741 tacatccacg accgaccgac tgccgcacac agccgattga cgtgtgacgg cgcttggggc 3211801 agctattccg gcaggcgcag ctcgggtttt tcgacttcct cgatgttgac gtccttgaac 3211861 gtgaccaccc gcacctgttt gacgaaccgt gccggccggt acatgtccca cacccaggcg 3211921 tcagccagcc gcagctcgaa gtacacctca ccgtcggtat tccgcggcac catctccaca 3211981 ctgtttgcca aatagaaacg tcgctcggtt tctacgacgt agctgaactg gccgacgatg 3212041 tccttgtatt cgcgatacag cgagagctcc atctcggttt catacttttc gagatcctct 3212101 gcactcatct gctcagacgt ccttctccct gccggttccc cggcttcccc gctcagtgcc 3212161 cctaagtgcc ctgagcgcga cccgtggccc gcattgtcgc tgggtgggaa ctcttgctcc 3212221 atcttccctc acccgtctgt gccgtcccgt cccgagggtc gggttggccg tcggcgacct 3212281 ctgcggtgtt cgacccactc gccacccggc gaacattgat gaacgagtaa cggtgctgcg 3212341 ggcagggtcc caatcgggcc agcgcccggc tgtgcgccgg ggtgctgtaa cccttgtgct 3212401 ccgcgaaacc gtacccgggg tgatcggcgt ccaacgcaac catcacgcgg tcccggctga 3212461 ccttggcgag cacgctagcc gcggcgatgc aggcggctgc cgcgtcgcca ccgatcaccg 3212521 gcaacgacgg catcggcagt cctggcacgc gaaagccgtc gctgagcaca taaccgggcc 3212581 gcaccgccag accggccacc gcgcgccgca taccttcgat attggccacg tgcacgccgc 3212641 ggcggtcgac ctcggccgac gggatgaaca ccacgtgata ggccaccgca taccggcaga 3212701 tcagcgggaa cagcttctcc cgcgcttgct cgctgagctt cttcgaatca tcaagggcgg 3212761 caagacttgc tatccgcccg gggccaagca cgcaggccgc gaccaccaac gggccagcgc 3212821 aggcgccgcg acccacttcg tcgaccccgg ccaccggccc cagaccacca cgatgcagcg 3212881 cggactccag ggtgcgcatt ccccgcaaac ccccagattt acggatcacc gtccgcggtg 3212941 gccaggtctt ggtcatattc cagccatggc taccgacctt gctggggatt caccgaacgc 3213001 acaacacccc aacgcgacgg cggccacacg atcaacctgg ccttaccgat gacgttggcc 3213061 accggcacgg tccccggtag cggatcgtca gtacatagca acgggcagtg agcgcgggaa 3213121 tccgccgaat gggtgcggtt gtcgcccatc acccagacac gcccgggcgg gacggtgacc 3213181 ggcccgaact cgctgcccag gcacgggtat atcgacgggt cggccatcat ggtggccgga 3213241 tccaggtatg gctccttcag tggcctgccg ttgaccgtca ggccggtgtc ggaccggcat 3213301 tgaaccgtct gtccgccgac cgcgatgaca cgcttgacca ggtcgttctc gtcgggaggc 3213361 acgaaaccga tgaacgacaa cgcgttctgc acccagcgca cggcgacgtt gtgcgaacgg 3213421 atcgacttgt aaccaacgtt ccacgacggc ggtcccctga agacgatgac gtcgccaggt 3213481 tgcggtgagc cgaagcggta gctgagtttg tccaccatga tgcggtcgcc gacgcacgtc 3213541 gaacacccgt gcaacgtggg ttccatcgat tccgacggaa tcagataagg gcgcgcgaca 3213601 aacgtcagca tgacgtagta gagcaccaca gcaatcaccg ccagcaccgc gaactcccgc 3213661 agcgttgatc gcttcgcggg ccgcggctcg tccgttttgg ccgccttgga gtcgccttcg 3213721 gagtccgcat ccggggctgc gtcgaacggg gctgcgtcga agacctggcc ggcaatgtcc 3213781 gggtcccggg aggagagctc cggctctgcc ggacccggct ggcgctccga tggggagtcc 3213841 gtggtttcgg tcacgagatc agcgtagcca gcgcaggtgg cggctttcga acatcgccga 3213901 gacgttcccg gtcagcgctt ctccttgatc ttggccttct ttccgcgcag ttcgcgcagg 3213961 tagtacagct tggcgcggcg aacatcgcca cgggtcacca cctcgatatg gtcgatgttc 3214021 ggcgagtgca cggggaaggt ccgttcgacg ccgacgccgt agctctcctt gcgcaccgtg 3214081 aacgtctcgc ggatgccccc gccctgccgg cggatcacca cgcccttgaa cacctggaga 3214141 cgttccttgg cgccctcgat caccttgaca tgcacgttga tggtgtcgcc cgggttgaac 3214201 gccgggatgt cgtcgcgcaa cgacggcttg tcgacgaagt ccagccggtt cattggaaat 3214261 gaccatcctt ggggtcgcgg cgtggttacc ccccacacgc agcgtgcggt ggtcaccaag 3214321 ccggtgggtt cgggcttatt ggtgcatctc gcagcaggcg gcacgcaacc cggccgacca 3214381 ccgcgacaga caactgctca attgtgccag acggtacgca tgcagtgaaa tcacaggaaa 3214441 tctccggtgg ttcgcggccg tcgaaaagcg cccgcaaatg gtacacatga cttacatatg 3214501 actagggtca aaccgcgcgt gtggaaaccc gaagcttggc gtgacaccca acagagggca 3214561 cttaagaggg caatgcggcc gcctacctgc acgttttcgc gatgtcagag gatgccgagg 3214621 gagaacaatg cgagcacggc cgctgacgtt gctcaccgct ttggcggcgg tgacattggt 3214681 ggtggttgcg ggctgcgagg cccgagtcga ggccgaagca tatagcgcgg ccgaccgcat 3214741 ttcgtctcga ccgcaagcgc gacctcagcc gcagccggtg gagctactgc tgcgcgccat 3214801 cacgccgcct agggctccgg cggcgtcgcc gaacgtcggg tttggcgaac tgcctacccg 3214861 ggtccggcag gcaaccgatg aggccgccgc catgggcgcc accctctcgg tggcggtgct 3214921 cgatcgcgct actggccagc tggtctccaa cggcaacacg cagattatcg ctaccgcgtc 3214981 ggtggccaag ctgttcatcg ccgacgatct gctgctggcc gaggccgagg gcaaagtcac 3215041 attgtcccca gaggaccatc atgcgttgga cgtcatgctg cagtcatccg acgatggtgc 3215101 ggccgagcga ttctggagtc aggacggcgg caatgccgtc gtcactcaag tcgcgcgccg 3215161 atatgggctc aggtcgaccg cgcctcccag cgacgggcgc tggtggaaca caatcagctc 3215221 cgcgccagac ctgatccgct actacgacat gctgctcgac gggtccggcg gcctaccact 3215281 ggatcgggcc gccgtcatca tcgccgacct ggcccagtcc acaccgaccg ggatcgacgg 3215341 ctacccgcag cggttcggca tccccgacgg tttgtacgcc gaaccggtcg cagtcaaaca 3215401 gggctggatg tgctgtatcg gcagcagctg gatgcatctg tccaccgggg tgatcggccc 3215461 ggaacgccgc tacatcatgg tgatcgagtc actgcagccc gccgacgacg ccaccgctcg 3215521 agcaaccatc acgcaagccg tcagaacgat gtttcccaac ggccggatct gacgctcgtc 3215581 cggtcgcctc accggcgcga gcagacgcaa aagccaccgc acgttcggcg tgtcggggga 3215641 tttcgcgtct gctcgccagc ggggctagtc ggggtgggac aggtcggggc gtcgttcgcg 3215701 ggtgcgctgc agcgagacct ctctgcgcca ggcggcaatt cgggcatggt cgccggagag 3215761 taggacctcg ggtacatcga ggccacgcca gctcgccggc cgggtgtagc tcggaccctc 3215821 aaggagcccg tccaggcccg ttgagtgcga atcatcttgg tgggaagcgg gattgccgag 3215881 aacaccggcc aacagtcgca gcacggcttc gaccatcacc acggccgccg actccccgcc 3215941 gggcaatacg tagtcgccga tcgagacttc ttcgacgcgc attcgccggg cggcatcctg 3216001 cacgacccgc tggtcgatgc cttcgtagcg gccgcaggcg aacaccagat ggctctcggt 3216061 ggtccagcgc tgggcggtgg cctgggtaaa caacacaccg gcgggcgtgg gaacaatcaa 3216121 caacgtttcg ctggaacaaa tttcgtcaag cgcttcaccc cacaccggcg ccttcatcac 3216181 cattcccggg ccgccgccgt agggtgcgtc gtccaccgag tgatgcacat cgtgggtcca 3216241 gcgccgcagg tcgtgcacgt taaggtcgac caggcccgat tcgatcgcct tgcccggcaa 3216301 cgactgtcgc aacgggtcca ggcaggcggg gaagatcgtc acgatatcga tgcgcacgcc 3216361 ttactccaga ttcagcaagc catggggcgg atcaatctca acgatgccgt cgtccaatga 3216421 caccgacgtg acgatggcac gcacaaacgg caccaaaacc tcatcggaat cacgcttgac 3216481 cgccagcaac tcaccagcgg cggtgtgcac cacttcggtg acgacaccaa caccctcccc 3216541 cgtcgccgtc tggaccataa gccccaccag ctggtgatcg taataggtgt ccggctcgtc 3216601 gatcgggggc aagtcatcgg cgtcgatcac gaacaagctg ccgcgcaacg catcggctgc 3216661 gtctcgatcg gccactccag cgagtcgcac caacaggcgg ccgccgtgct gccgcacact 3216721 ttcgatgacg taactcaccg cactgccctc ggcaccaccg tcaaaaggcc ccttagcgcg 3216781 caacctggta cccggcgcaa accggtcagc tgggtcgtcg gtgcggatct cgacgacgac 3216841 ctcgccggtg acaccgtgcg acttcaccac ccgcccgact accagctcca tgagcggggc 3216901 tccgctactg gtcggtgtcc accacgtcga cgcggatacc gcggccaccg ataccggcta 3216961 ccagagtgcg caatgcggta gcggtgcgtc ccccacgacc gatcaccttg cccaggtcgt 3217021 ctggatgaac gtggacttcg acggtgcgcc cccgccgact ggttatcagg tctacccgga 3217081 catcgtcagg attgtcgacg atcccacgga ccagatgctc aacagcgtca acgacgacgg 3217141 cgctcatttc cccgtcagct ttccgccgtc agctcggcct gctcgccacc cagcgccggc 3217201 gtgtccggct gctcaggctg cggcgctggc tcagcagcct tggcagcctt tttggccggc 3217261 gacttcttct tcggtttggt ggcctcggtg gtaggaccac cgtcggcggc ggccaacgcg 3217321 gcgttgaaca cctcgagctt gctgggcttg ggtgcggcga ccttcaaccg gccctgagcg 3217381 ccaggtaggc ccttaaactt ctgccaatcc ccggtgatct tcagcagctt gaggacgggc 3217441 tcggtgggct gagcacccac cgagagccag tactgggcac gctcggagtt gatctcgatg 3217501 agactcggct cttctttggg gtggtaccgg ccgattacct cgatcgctcg gccgtcgcgg 3217561 cgggtgcgcg catcggcgac ggcgacgcgg tactgaggat tgcggatctt gccaagccga 3217621 gtgagcttga tcttcacagc catgattgag cgctcctatt ggtgtcacgc tgcaattcag 3217681 cgacccgggc gggatgcccg gatccggttt tgcctcgcgt gtatgaccac cgggcggcaa 3217741 ccccgaacag gacaagtcgt cgcgcggtgg acagccgcca attgtgccag aacgtgatgc 3217801 tggggcagta attcgcccag cgggcttcac atcattttct ggaagcactt ggtttcgacc 3217861 cgcctgatga tccgccagcc atcgggggtg cgcacgaaat cgtcgtcgta ccacagtcca 3217921 cagaacagca cttgctgccg gtcgccggcg aacaccatcg ggttgaagca gatcacccgc 3217981 gacgacgcgg tatcgccgtc gacacggacc gagaagttgc ccaacatgtg cgcatatacc 3218041 gggaagtttc ccagcacctg cgacagccat tgcttgatct tcggatacct gccgtcgatg 3218101 ccacctagcg cgcgatagtc gatataggcg tcgggggtga acacccggtc aagatcgtcg 3218161 aatcggcgct ggtcaatcgc gctggagtag tccaccagca actgctggat ttccaaccgg 3218221 tcggaaattt cggccacgct caacatgctc cgatccaaca ccgcacacat cggccggaca 3218281 gcccccgacc agcccgagaa taggcctacc ggagccctgg aagttaaact ctgcgcccat 3218341 gcgaaagctc atgaccgcga ccgccgcgct ctgtgcctgc gcagtcaccg tcagtgcggg 3218401 tgccgcgtgg gccgatgccg acgtgcagcc ggccggctcc gtgccgatcc ccgatggccc 3218461 ggctcagacc tggatcgtgg ccgacctcga tagcggtcag gtgctagccg gccgcgacca 3218521 aaacgtggcc catccgcccg cgagcaccat caaggtgctg ttggcgctgg tggcactcga 3218581 cgagctggac ctgaactcca cggtcgtcgc cgacgtcgcc gacacacagg ccgagtgcaa 3218641 ctgcgtcggc gtcaaaccgg ggcgcagcta caccgcgcgc cagctgctcg acggcctgtt 3218701 gctggtgtcg ggcaacgacg ccgccaacac gttggcgcac atgctgggtg gccaagacgt 3218761 caccgtggcc aagatgaacg ccaaagccgc caccctaggt gcgacgtcca cccacgcgac 3218821 gacgccgtcc ggcctagacg gacccggcgg ctccggggcg tccaccgcgc acgacctggt 3218881 ggtcatcttc cgggccgcga tggccaatcc ggtgttcgcg cagatcaccg ccgagccctc 3218941 ggcgatgttc cccagcgata acggcgaaca gctgatcgtc aaccaggacg agctgctgca 3219001 gcggtacccg ggcgcgatcg gcggcaagac gggctacacc aacgccgctc gcaagacgtt 3219061 cgtgggtgcc gccgcccgcg gcggccgccg cctggtgatc gccatgatgt acgggctggt 3219121 caaagagggc ggaccgacgt attgggatca ggctgcgacc ctgttcgact ggggtttcgc 3219181 cctcaacccg caggccagcg tcggctcgct ctagcaccgc gagcagacgt gggcgctggt 3219241 gcgcccatca tgttcttttg cgtctgctgg cgctcatagg ccggcggtca gcagcgccgt 3219301 cagcatggga atccgctgtt cctcgagttc gggctgcggc agtacccctc gcacaatggc 3219361 ggcgccgtcg aagacgttgg tcatcaacgc cacgatgacc ggaaatgtct cctccggaaa 3219421 gctctcggca cccggcagag cacgcgcggc gtcgtggatc ttcgcgctgt actgccccag 3219481 cacattctgc agcgtctcct tgagcttctc gtcggtgcgc gcggcgacca taagctcgta 3219541 gagcaccgca ttcgtggagc cggccgtgat gtcccgcaaa atcgtcagcg ccgccggaag 3219601 cgccggccga tcggccggta tttcggcgac ttgcttggtg aacgtttcca gctgacggcg 3219661 caacacctcg tatgccgtgg ccgccatgaa atcacccatc gtttcgaagt gccggaacag 3219721 ggcgcctacc gacaccccag cccgcttggt gatcacggca gccgatgccc gcgcgtagcc 3219781 gacctcgatg atcgtgtcga tgctggcctg cagaagccgt gcaacggttt cttcgcggcg 3219841 ctgctgctgg gtcctggcca tgtcaggcag aacggctcag agcggcgccg agctcacccg 3219901 cccgcaggta gcgacccgac ttcacgttct gcccgtagcc gtcgcggaac tgacctccga 3219961 actggcctcc gcggaatacc acggtgccgc ccacgcccgt cgcaaccacg gtcgcatcgt 3220021 tgcggttgac catgcgtcgc agaccgccat agtagggcac cgcctcctcg tggtacccgt 3220081 ccaccgattc atctaggtgg gtagggtcaa tcaccgcgaa gtccgcacgg tcaccctggc 3220141 gcaacgtgcc cgcgcctata ccgaaccact cggccaactc accggtgagg cgatacactg 3220201 cccgctcgat ggacagaaac ggttgtccgg cccggtcggc gtctctggct cgtttgagca 3220261 gccgaagccc gaagttgtag aacgccatat tgcgcaggtg cgcgccggcg tcggagaagc 3220321 ccatgtggac actcggttcg gcggccagct tgttcagctg gttgggccgg tgattggcga 3220381 cgatggtggt ccatcggaca ttgcgctccc cgttgtccac cagcacatcg aggaacgcgt 3220441 ccagcgggtg cagcccgcgc tcgtcggcta ttgccccgaa actcttaccg atcaacgact 3220501 tatccgggca ttcgacgatc acggcgtcgt ggaagtcccg atgccacaac gaaggtccga 3220561 gcttgatgcg atcgaactcg cgccggaacg accggcggta agacctgtcg gccaggagct 3220621 cgttgcgctg cagttggtca cgcagatgaa gggccgccgt tccggcgccg aactcctcga 3220681 agaccggcag gtcgatgccg tcggagtaca gctcgaacgg gaccggcaga tgctggaatc 3220741 gcacctgaga gcctaagagc ttgttcagca cgcgggtgcc caacccgaac acgtgtaccg 3220801 ccagcggcat cgacttggcg tcggcggaca ccaacatgct cattcgaacg cccttgcgcc 3220861 ggttgaatat ccggctgctg gccaagaaaa acagcagcgc ggacaccggg ttgtcgacgt 3220921 cgggtgcgct ctgcagtatc cggccccggt ggcgcagcac cgagatcagc ttgcgacgct 3220981 cccgccaggt cgcgaaggtg gacggcagcg cacgcgagcg gaagcggtcg ccgtcgagct 3221041 tgtcgatagc ggcgtccatc ccggacatgc ccagcatccc ggcctcgagc gcctcatcga 3221101 gcagtttcgc catcttcgcc agctcggctt cggtgggccg gacggtgtcg tcggtggcac 3221161 gatcaaggcc cagtaccgcg gtccgcagat ccgaatggcc aagcagtgaa ctcacattcg 3221221 gcccgagggg cagggcgtcg atcgcttcga tgtactccgc gggcgtcgac cacgtctggt 3221281 tgtcccgcag ggcacccagg acaaattcgc ggggcaccgc ttcaacacgg ctgaacaggt 3221341 cggcggcatc ctcggagttg gcgtagaccg tcgacaacga gcagtttccc agcagcaccg 3221401 tggtgacacc gtggcgcacc gactcccgca aaccaggatc gagcaacacc tcggcgtcat 3221461 agtgggtgtg cacgtcgatg aagccaggca cgacccactt ccccgccgca tcaaccacct 3221521 ccgggcagcc ggtctcgtcc agtgcgccgg cagccaccgt ggccaccacg ccgtcgcgaa 3221581 tgcccagagt gcgagtcaat ggcgcattgc cggtgccgtc gaaccacagt ccgtcgcgaa 3221641 tgatcacgtc gtaggtcacc gtttcctcca gatcgttgag ttgccgccaa gctaacatag 3221701 atagcgatca ctcgcaatct ttttggctga cgccgcttcg ctgccgcggc gctggtcaag 3221761 tgggtgtcag cgaccgggcc ccggcgccgt tgtggtcggc ggcgtcaggg tggctgtgga 3221821 cgttgtatcg ggggtatcag gaatcgttgc tgggtccggc acggtgactg ccgggggaag 3221881 atcaccactg cggctggcca ccgcgggaat ccgaatgacc gctccttgtt gtccacattc 3221941 gttgctatgc acagtgacga ccatttcgcc gacgaggtcg ccctgcggtt gcggtcgcag 3222001 tgcgagcaac tgcgtcgtgg cctgtgtact cggcgagccg ttgggcccga cgcacgggaa 3222061 ctgcaccgtc tccggccgcg acttccactg gccctcgccg aactgcatga ggaacggcct 3222121 aacgggcgga gtcttggcct gggtgtggtc gttgtcgtcg agcatcgttg cggccgcgag 3222181 acattcggtc ggagtgcacg aagtgcggaa cgcccaccag gtgttcacgt ccggcggttg 3222241 cggcgtaggg gtgtagtcgt aggtctgctt tgagcgttgg atctcgatgc ggtatgtgcc 3222301 gtccagtggg accggcgcgg tgaccgcgac ggtggtcgtc ggggcgctgg gcacagccga 3222361 cccactggtc ggcgggcgcg cgacttcggt ggcggtcgtg ttcgtcttgc gcccaatcac 3222421 gatgccgacc gcgaacaggc cagccaatag caacaccgct accgcaccga ccaggatccg 3222481 gcgtggccgg cgcctggtgg ggctcgccgg agccttggtg gcggtggaga agttgtccag 3222541 gcggcgcgcc agcaccccgg ccgctgactg cagcatcgag ccgcgccgtt ggggggtcgg 3222601 ggccgccggc gccggtgccc gggcggatgg ttctttgcag tcgacggcct cgggccaacc 3222661 ataagccggg tagtcgacga catacgcttc ctcaccggcc gccgcggtga cctcagaagc 3222721 gtcgacaccc cccgagctct gatcagcgat cgcgacgccg gcctgttcgt tcatcgcgtc 3222781 ggcgaactcg cggcagctgc cgaaccggtc cgcgggcgct gtggcgagcg cacgcgagag 3222841 gacaccgtcg aggcgtgcca ggtccgggcg gaaggcggag agcttcggtg gctgcagcgg 3222901 tccggtgtgc gaacgatcaa ccggcggcgc accggcgaac aggtgtatgg cggtaagcgc 3222961 caacgcgtac tgatcggcac gcccgtcaac gtcggccccc gccgacagtt cgggcgccgg 3223021 atagctgggt tggctggcaa ttccgaagtc ggccaacagg atccgttggt cgccagcact 3223081 ctgactggtt agcacgacgt tggcggggtt gacgtcacga tgcagcaggc cgcgctggtg 3223141 ggcgtagtcg agagctccgg ctacggcagt gacgatggcg agtacctcac caaccggcaa 3223201 gaccgccgga aaccggtcgg ccatatgctg cgtggcgtcg atgccatcga cgtagtccat 3223261 cgcaatccac agctgcccgt cgaactcacc gcgatcatga acctccagga tgtgcgggtg 3223321 aaatagccgc gcggcaacct cggtctcccg ttgaaatcgg cggcgaaatt cgtcgtccgc 3223381 agccatcgcc ggcgaaagca ccttcagcgc ctgccagccg gggaatccgg gatgttgcac 3223441 gaggtagacc tcacccatcg cggaacaacc cagcatccgc acgacggtgt agccggcaaa 3223501 ggtcacgccg ctggccaacg ccattggccg atagtaaccg cgttcggcac ggcccgcgcg 3223561 gccaaagcta gggcccaaaa gtcctgccgc gcaaaatcac cagatcggga tgctgcagca 3223621 cacccggacc ctgccgcgga tcctgggcgt agcacaacag gtcggccgat gccctatcgt 3223681 ccagcccggg ccggcccagc cagcgtcgag catcccagca cgccgcgccc aacgcctcgt 3223741 gcgcggtcat cccaatcctc tgcagcgccg ctacctcgtc agcgatccgt ccgtgctcga 3223801 tcgtgctgcc cgcatcggtg cccgcgtata ccggcacccc cgcctcccgc gccgcagcga 3223861 cccgcccata gccgcgggca tacaggtcgc gcatgtgcgc ggcataggtt ggatagcgcc 3223921 ctgccgcatc ggcaatgccc ggaaagtttt ccaggttgat cagcgtgggg accaacgcgg 3223981 tgccgtgctc gagcatcaag gcgatggtgt cgtcggtgag gccggtgccg tgctcgatgc 3224041 agtcgatgcc ggcgttgatc aagccgggca gcgcgtcctc gctgaaaacg tgcgcggtga 3224101 cccgggcgcc ctgagcgtgt gccgtgtcga tggcggcttt gagcacgtca tcggaccaca 3224161 acggggcaag atcgccgatt tgacggtcga tccagtcacc gaccagcttg acccagccgt 3224221 caccgcggcg ggcctgctcg gctaccgctg ccggcagctg ggattcgtct tcgagctcga 3224281 ccgcgaagcc ggcgatgtaa cgcttgggtc tggccaggtg ccgtccggcg cggatgatgc 3224341 ggggcaggtc ttcgtggtcg tcaaggccgc gggtgtcggt cggcgagccg cagtcccgca 3224401 acagcagcgc gccgacgtca cgttcggtct cggcctgagc gatcgcctcg tcgagttcga 3224461 cgttgccgtg tttcccaagc ccgacatggc agtgcgcgtc gaccagcccg ggcaggatcc 3224521 agccgccgtc aaagacggtg tcggctcctg ccaccggttc ggtgctaatg cggccgtcga 3224581 cgatccacag ttggatcgcc gtctcgtcgg gcaggcccaa acctcgcacg tgcaggcgca 3224641 cggcgcggct acggggcctg atggtgtcga cccgcttcac ccggctccgc cgcactcgcg 3224701 atcgccacta cttcttgcct gggaacttca gcttggacag gtcgaagtcg gccaggccgg 3224761 gcggcagctc gtcgagacct ttgggcatct gtgagagatc agggagcccc ccaggtagcc 3224821 cagccaagcc cggcatcccg ggcacgccga acgggctctt gaccttcggc ggcgtcggac 3224881 cgcgcgtccc cttcttactc ttcttgccgg atttgccttt tgcgcccttg ctctttcgcg 3224941 tcgcggattt gcgccctatg cccggtatgc ccatgccccc gagcatggac gacatcatct 3225001 tgcgggcttc gaagaagcgc tcgaccagct ggttgacctc ggacaccgtg acgcccgagc 3225061 cgttggcgat gcgcagccgc cgcgaggcat tgatgatctt ggggtctgcc cgttcctgcg 3225121 gcgtcatgcc gcgaatgatg gcctggacac gatcgagttg tttgtcgtcg acctcggcca 3225181 acgcgtcctt catctgagcc gcgccgggca gcatgcccag caggttgccg atcgggccca 3225241 tcttgcgtac cgcgagcatc tgctcgagga agtcctccag ggtcagctcg ccggcgccga 3225301 tcttggctgc ggcctcctcg gcctgttgtg catcgaagac ctgctcggcc tgttcgatca 3225361 ggctcagcac atcgcccatg cccaagatgc gactggccat ccggtccggg tggaagacgt 3225421 cgaagtcctc cagcttctcc ccggtggagg cgaaaaggat tggaacaccg gtcacttcgc 3225481 gcaccgataa cgcggcacca ccgcgggcgt caccgtcgag cttggtcaag gccacaccgg 3225541 tgaacccgac gccctcgccg aacgccgcag cggtggtgac cgcgtcctgg ccgatcatcg 3225601 cgtccaggac gaacagcacc tcgtcggggt tgatggcgtc gcggatggcc gcggcctggg 3225661 ccatcagctc ctcgtcgatg cccagtcgtc cggcggtgtc gacgatgacg acgtcgaagt 3225721 gcttggcccg ggcctcggcc agcccggccg ccgccaccgc aaccgggtca ccggggccgg 3225781 actccggcga ggcacccgga tgcggcgcga acaccggcac tccggcacgc tcgccgacga 3225841 cctgcagctg gttcaccgcg gccggccgtt gcaggtcaca agcgaccagc agtggcgtgt 3225901 gtccttgtcc acgcaggcgg gcggccaatt tgccggccag tgtcgtcttc ccggagccct 3225961 gcaggccggc gagcatcacg acggtcggcg gggtcttcgc aaacgccaac tcgcgggttt 3226021 cgccgccgag gatgcttatc agttcctcgt tgacgatctt gacgacctgt tgagccgggt 3226081 tgagggcact tgacacctcg gccccgcggg cgcgttcttt gatccggtgg atgaatgccc 3226141 ggaccaccgg tagcgaaaca tcggcttcca gcagcgccaa acgaatttcg cgggtagtgg 3226201 catcgatatc ggcatcggtc agtcggccct tgccgcgcag cccctgcagg gcggcggtca 3226261 aacggtcaga cagcgattca aacacgcccg ccagcctaat ggtgatcgcg agcgccgcgc 3226321 agcggcaccg ttatccgttg actctgcgtc caccacgcaa aagtgcgagt aacccgcctg 3226381 gtggacgcag agtcaacacg atgcgacgtc ggacctgcgc cgaaaagcgt tgccatgcta 3226441 catttcaccg ccgccacctc acggttccgg ctggggaggg agcgggcaaa ttcggtccgt 3226501 agcgacgggg ggtggggagt cttgcagccg gtcagcgcga ccttcaaccc tccgttgcgg 3226561 ggttggcagc gccgggcgct ggtgcagtac ctgggcaccc agccgcggga tttcctcgcg 3226621 gtggccactc ccggatctgg caagacatcg ttcgcgctgc ggatcgcagc cgaactactc 3226681 cgttaccaca ctgtcgagca ggtcaccgtc gtcgtgccca cagagcacct caaggtgcag 3226741 tgggcgcatg ctgcggcagc acacggcctt tcccttgacc caaagttcgc caactccaat 3226801 ccgcagacct caccggagta tcacggcgta atggtcacct acgcccaggt cgcttcgcat 3226861 cccacgctgc accgagtgcg taccgaagcg cgcaagacgt tggtggtctt cgacgagatc 3226921 caccacggcg gcgacgccaa gacctgggga gacgccatcc gggaagcttt cggtgacgcc 3226981 acccgccgcc ttgccctgac gggtacaccg tttcgcagcg acgacagccc aatcccgttc 3227041 gtcagctacc agcccgacgc ggatggcgtg ctgcgttctc aggctgacca cacctacggc 3227101 tatgcggaag ccctcgctga cggtgtcgtc cggccggtgg tcttcctcgc ctattcgggg 3227161 caggcgcgct ggcgggacag cgccggcgag gagtacgagg cgcgactggg cgagccgctg 3227221 tctgccgagc agaccgcgcg ggcgtggcgc acagcgctcg acccggaagg cgagtggatg 3227281 ccggcggtga tcacggcggc cgatcgacgg ctccgacaac tgcgtgcgca cgtacccgac 3227341 gcgggcggca tgatcatcgc ctcggatcgc accacggccc gcgcttatgc ccgcctgctc 3227401 accacgatga cggccgaaga gcccacggtc gtgctctccg acgaccccgg atcgtcggcg 3227461 cgtatcacgg aatttgccca gggcaccagc cgttggctgg tcgcggtccg catggtctcc 3227521 gaaggtgtcg acgtgccccg gctttcggtc ggggtttacg ccaccaacgc ctccacgccg 3227581 ctgttcttcg cacaggccat cggtcggttc gtgaggtccc gccgaccggg tgaaaccgcg 3227641 agcatcttcg tgccgtcggt gcctaacctg ctgcagctgg ccagtgcgtt ggaggtgcag 3227701 cgtaaccacg tgctgggccg accgcaccgc gaatcggccc acgatcccct cgatggtgat 3227761 cccgccacca ggacgcaaac cgagcggggc ggcgcggagc ggggctttac cgcgttgggg 3227821 gccgatgcgg aactcgatca ggtcatcttc gacggttcct cgttcggcac cgccacccca 3227881 accgggagcg acgaggaggc cgactaccta ggcatccccg ggctgctcga tgccgagcag 3227941 atgcgcgccc tgctgcaccg ccgccaagac gagcagctga ggaaacgggc tcagcttcag 3228001 aaaggggcca cccagccagc aacgtcgggg gcttcggcat cggtgcatgg ccaactgcgc 3228061 gacctgcgcc gcgagctcca cacgctggtg tcgattgcgc accaccgcac cggcaaaccg 3228121 catggctgga tccacgacga acggcgccgc cgttgtggcg ggcctccgat cgccgctgcc 3228181 acccgcgctc agatcaaggc acgcatcgat gcgttgcgac agctcaactc cgagcggtca 3228241 tgagcgtgcg atcctaatcg ccgacgggtt cgtcgaccac aacgtcgacg ctggcgccca 3228301 acacctccag caggtgttgc tcgaccgcgg ctctcgcgtc caactccgcg gggaccgtca 3228361 cacagaacac atcggccgcg gtcgatccga acgtattgac cttcgcccag acaatgccgg 3228421 ctcccgcgcc ctccagcgcc ccggccagca acgcgagcaa acccgcccga tccatggccc 3228481 gaacttcgag gatcagcttg gccggcgcgg cggtgtcgag ccacaggatg cggggcggag 3228541 cggccgtacg agtcacgggc accccggcct gcacgtcccc ggcccgagcg gataccaagc 3228601 tggcggcatc gctgtcccgc ttctgcagca tgcccagcac gtcgacgtcg ccgttgaggg 3228661 caccgacaaa ctgctgacgc accaactccg ccgcgggcgg ggacccaaac agtggtgaca 3228721 ccacaaactc ggttatcgcg acaccctggt ggacgttgac cgacgccgaa tgtacgcgca 3228781 gcgagttcag cgccagcacc gcggcggctt tcgacaccag tccccgctcg tccggcgcca 3228841 ctattacggc gtcgatgcgt tcaccgtcgc gcggactaat ctccacatgc accccgtggt 3228901 cggccgccag cgaaagataa tggggtgcag tcggttcggc ttgaggcagc gactctccgg 3228961 ccatcaccat ccggcagcga cgcaccaggt catcgaccag tgacgccttc caatcgctcc 3229021 acaccccggg gccggtggcc ttcgagtccg cctccgacag ggcgtgcaaa acttcgagca 3229081 gttgcggatc cccacccagc gcctcggaca ccgcctcgat ggttttgggg tcgtttaagt 3229141 cacgtcgggt tgccgtaatc ggcagcagca ggtggtggcg gaccagcttg gagagcgtcc 3229201 gcacgtccgg cggcgacaac cccagcctgg tgcaaaccgg gattaccaat tcggccccga 3229261 gcacactgtg atcggtgccc cgtcccttgc cgatgtcgtg cagcagcgcg ccaagcgcaa 3229321 gcaggtcggg acgtgccacc cgggtggcca gtggcgccgc atgcaccgcg gtctcgacca 3229381 cgtgtcggtc aaccgtccac ttgtgggcga cgtcgcgcgg cggaaggtcg cgaatgggct 3229441 cccattccgg caacaaccgg ccccagagcc cggttcggtc gagcgcttcg atggtagcca 3229501 ccgtggtggg gccggcggag agcacaacta gtaagtcgtc caatgcctct tgcggccagg 3229561 gagtcggcag atccgggacg ctggcggcca accggctcag ggtggcggcg ccaatgggca 3229621 atccggtgtc ggccgacgcg gcggccactc ggagcaccag gccgggatcg tgttcgggtt 3229681 cggcgtcgcg ggcgagcacg atttcgccgg catactcgac gacaccctcg tcgagcggtc 3229741 gccgctttgg ccgccgcacc aaggccgaga tgccgcgccg cggcaatgca ttcgccgcag 3229801 tccgcagccc ggcttcggcg tggtaaccga tggtgcggcc agcactcgac agtgtgcgcg 3229861 ccaaatcgaa tcggtcaccg aaacccaacg cggcgctgat ctcgtcggcg aactgggcca 3229921 gcaggtggtc gcgtccgcgg cccgacaccc ggtgcagttc ggtgcgcaca tccagcaagg 3229981 tgcgatacgc accgtccagc gaacccgccg gcaggtccgt gtggccgata ccgtgccggt 3230041 cgatgagctg ggcgagagcc agcgcgtcta gcaactggac gtcccgaagg ccgccgcgac 3230101 ccaatttgag atcgggctct gcgcgctgcg cgatccggcc acagcgccgc caacgcgcat 3230161 atgtcatttc gacgagttcg cccatgcggg aacgaattcc gttgcgccac tggcgtcgca 3230221 cgccgtcgat caacgcgaac gagagctgct gatcgccggc gatgtggcgg gcttccagca 3230281 tgcctagagc ggccatcaga tcggaattgg cgatggtcaa tgcctcacta accgttcgca 3230341 cactgtgatc gagccgaatg ttggcatccc acaacggata ccacaacctg tcggcgacgg 3230401 gccgcaagat gtcagcaggc ttgccatcgt gcaacagcaa cacgtccagg tccgaatacg 3230461 gcagcagctc gcggcggccg agcccgccga ccccgacgat tgcaaaacca ctggcatcgg 3230521 cgatcccgat ctcgtcggcc ttgtcgatca gccaagactc atgcagatcc agccacgtct 3230581 gccgcagccc gaccggatcc agctcgcgat ggttgccgga cagcagctcg cgtcgggcga 3230641 cagctaaatc gcttgcggca caaggacttt ctgcctccat ctccctcgct agcgctaatt 3230701 ggtgcggccg ggttggttca gcacagtgcg gctagtttca taacgcgtcg tgtccgcgtt 3230761 caccggtgcg cacccgcacg atggtgtcta ccggactcac ccacaccttg ccgtcgccga 3230821 tcttgccggt gcgcgccgcc cggacaatgc tgtccacgac cttgtcgaca atggaatcgt 3230881 caacaacgac ctcgatccga accttcggta cgaaatccac cgagtattcg gccccgcggt 3230941 aaacctccgt gtggcccttc tgccgtccgt atccctggat ttcactgacc gtcatcccca 3231001 gcactcccgc gtcctcgagg ctcgtcttga cgtcgtcgag cgtgaacggc ttcacgatcg 3231061 cagtgatcag cttcatttcg gctccgcctc cactttctgg cctatacgct cctgaatgcc 3231121 gttgcggcta tcctccacgg tgacccgcgg ggggagaacc gagccgctgg cgacggcgaa 3231181 atcgtagccg ctttccgcgt gctcagcctc gtcgatgccg gtgctctctt gctccgcgtc 3231241 aagcctgagc ccgatggtga atttcaggat caatgccaag atcagggtga tgattccaga 3231301 gtagacgaga acactgcagg caccgagcgc ctgtcgttcc agctgggcga agcctccgcc 3231361 gtaaaacaac cccttcgata ccccggccac accattaatt gccggagcct ccggagctgc 3231421 cagcagaccc accagcagtg tgcccaccag accaccaacc aggtgcaccc cgaccacgtc 3231481 gagcgaatca tcgaagccca gtttgaattt cagccccacc gccagcgcgc acagcacccc 3231541 ggccgacacg cctaccgcca aggcacccag gacattaacc gacgagcagg acggcgtgat 3231601 ggcgaccagt ccggcgacga tgcccgacgc cgcgcccagc gtcgtagcct tgccatctcg 3231661 gacgcgctcc gtgagcagcc agccaagcat ggccgcggcc gtcgcaatcg tggtggtgac 3231721 aaacgtcgcc ccggcaacac cgttggcggt cgtcgccgat cctgcgttga acccgtacca 3231781 gccgaaccac agcagggcgg ccccgagcat cacaaacggc agattgtgcg gtcgaaacag 3231841 cgtcgccggc caaccgcgtc ttttgcccag cacgatcgcc agcatcaagg ccgccacacc 3231901 ggcgttgata tgaaccgcgg tgccgccggc gaagtcgatg gcgtgcagct tgttggcgat 3231961 ccagccgccg tgctcagcgg cgaaaccgtc aaatgcgaag acccagtgtg cgaccgggaa 3232021 atagacgaac gtcgcccaca aaccggcgaa caacagccag gcgccgaact tcaaccggtc 3232081 ggccaccgcc ccggagatca gcgcaaccgt gatgatcgcg aacatcagct ggaatgccac 3232141 aaacacggtc gccggcaggg tacccgccag cggaatattc accgcggcgg tctgcgtgct 3232201 cggatcggca gcaacagcat tgacgccgat gagacctttg agaccccagt attggctcgg 3232261 gttgccggcg atgttgccaa cgtcatcacc gaacgcaatc gagtagccgt aaagcgccca 3232321 gagcaccgtc acgacaccca tcgcgctgat gctcatcatg atcatgttca ggacgctctt 3232381 ggaacgcacc atgccgccgt agaaaaatgc cagacccggc gtcatcaaca gcacgagcgc 3232441 ggaactcacc agcatccagg cggtgtcgcc gccatccgga acgcccatga tggggaattg 3232501 gtccactcgc tatcacctcc agtcgagcgt tggcacggcc ccagccttac gactgacgac 3232561 ctgatccaga accatgcgca ctagttgttg cggcgatggt gccgccatgt ttcatcagga 3232621 ttaacgtaaa acttgctgtg aaagagcttt ccgtggcgat cgcaagcgcg gcgcagccgc 3232681 gcgcagcggg tcgccaccat cagaccccgt ggcgatcgca agcgcggcgc agccgcgcgc 3232741 agcgggtcgc caccatcaga ccccgtggcg atcgcaagcg cggcgcagcc gcgcgcagcg 3232801 ggtcgccacc atcaaacccc gtggcgatcg caagcgcggc gcagccgcgc gcagcgggtc 3232861 gccacctcgg ctagccgagc agggcgtcga cgaatgcggc gggttcgaaa ggcgccaggt 3232921 catcggggcc ttcaccaagc ccgaccagct tcaccggcac cccaagttcc tgttgaacgc 3232981 ggaacacaat gccgcccttg gccgttccgt ccagtttggt gagcaccgcg ccgctgatgt 3233041 cgacgacctc ggcgaacact ctggcctgcg ccaacccgtt ctgtccgatc gtggcatcga 3233101 gcaccagcaa cacctcgtca acggacgctc gccgagtcac cacgcgcttg accttgtcca 3233161 gctcgtccat caggccaacc ttggtgtgca gccgcccggc tgtatcgatg agcacgacgt 3233221 ctgcgccggc ggcgatgccc ttgtcgacgg cgtcgaacgc caccgatgcc gggtcggcgc 3233281 cttcgggccc gcgaaccacc gctgcgccaa cccgcgccgc ccaggtctgt agctgatcgg 3233341 cggcggccgc acggaaggtg tcagccgcac cgagtacgac ccgtcggccg tcggccacta 3233401 gtacccgcgc caacttgccg accgtggtgg tttttccggt gccgttgacg ccgacgacca 3233461 gcaacaccga aggatggccg gcgtgcggta gcgcgcggat cgagcggtcc atgccaggtt 3233521 gcagttcgtt gatcaggacg tcacgcaata ccgcccgggc gtcggcctcg gtacgcacgt 3233581 tgccgctggc caggcggctg cgcagctgcg acaccaccga cgcggtggcc gccggtccca 3233641 ggtcggcgac cagcagggtg tcctcgacgt cttgccagga gtcctcgtcc aggtcgccgc 3233701 cgccgatcag tcccaacagg ccgcgcccga gggcattctg cgatctggcg agccgtccgc 3233761 gcagtcgttc caatcgacct tcgggcggcg cgatggcgtc agcctcgggg acctctggag 3233821 cctggggttc tggctcaaac tcgggaaggt gtacgtcggc gatcgtgcgc ttgggcgcgt 3233881 cgcgagggac ggtcgcatcg tcgcccacgg cgggcagtcc gctcgtatcg atccgctcgg 3233941 ccggctgggt cgtcggcgtc tgactaaacg tgatgccaga cgatgcggtg taaccgcctg 3234001 agcggtcgac aacgccgcgc tcgggccgag gcgacagact gatgcgccgc cgacggtaga 3234061 gcaccagccc cagggtcagc gcagcgatga cgaccagggc ggcgatgacc gccgtggcga 3234121 tccacaaacc ttcccacacg ctgacaatcc ttccaggggt cgcttgcccc gatgcttagg 3234181 gacgaaccct acgaggaatt ggtaaccagc tgatccacct gctgaccgcg catgcgctgc 3234241 gagatgaccg cggtgatgcc gtcgttctgc atggttacgc cgtacagtgc gtccgcgacc 3234301 tccatcgtcg gcttctggtg ggtgatgatg atgatctgcg actgctctcg cagctgttcg 3234361 aacaggctga gcagtcggcg caggttcacg tcgtcgaggg cggcctccac ctcgtccatg 3234421 atgtagaacg gcgatggacg ggcacgaaag atcgcgacca gcatcgccac cgcggtcagc 3234481 gccttctcgc caccggagag caaagacagt cgggtaatct tcttgcccgg cgggcgggct 3234541 tcgacctcga tgccggtggt gagcatgtcg tcgggctcgg tcagccgcag ccgtccttca 3234601 ccaccgggga acaatgcggt gaacacgccg cgaaattcgc gttccacgtc tacgaacgcg 3234661 tcattgaaca cctgcaggat gcgggcgtca acatcggcga cgacgcccag cagatccttg 3234721 cgggcagcct tgacatcctc gagttgggtg gacaggaaat tgtagcgctc ctccaaggca 3234781 gcaaactctt cgagcgccag cgggttgacc ctgcccaact cggcaagcgc acgctcggcg 3234841 cgtttggccc ggcgctcctg ggtaacccgg tcgaacggca tgggggcggg cgcaatcacc 3234901 tgctcgccgc gttcgcgggc ttgctcgaac tcagccatct cgagctcggt cggtggtagc 3234961 gccacatgtg gaccgtattc ggtgatcaag tcggccggcg ccattccgaa ctgctctagc 3235021 accatctgct caagctgctc gatacgcagc gccgcctgcg cgttagccag ctcgtcgcgg 3235081 tgcagcgaat cggtgagttc ccccactcgg gcgctcagcg tgttcacctc gtcgcgcacc 3235141 gcggccatcg ccgctaaccg ctgctgacgt tgcgcggccg acgcgtcgcg cagttgcgac 3235201 gccccgtcca ccgcccggtg caaccgcccg gccagcagcc gtccgcagtc ggcgaccgct 3235261 gcggccaccg cggccgcatg cagtcttgcg gcgcgtgctt gctgagcccg cacccgcgcc 3235321 tcacgttccg ccgcagccgc acggcgcagc gaatcggccc gcccgcgaac cgcgttggcg 3235381 cgttcctcgg cggtgcgcac cgccagccgg gcttccactt cgacaccgcg ggcgcgatcg 3235441 gcagcggcac tgatcgcctg gcggtcgatc ggttgggcca cctgcacccg ttgggtctcc 3235501 tgggccttac gcagctgggt ctcaagttgt atgacgtcgt cgagagtctg tgtgcgcacg 3235561 gcttcctgtt ccgtacgctg ctgcagcaac cggttccact cttcttccgc cgcgcgggcc 3235621 tcctgcccga ggcggcccag ctgctcgtac atcgccgaga tggccgtgtc ggattcgtta 3235681 agcgcggcca aggcttgctc ggccgcgtcc tggcgggcgg actgctcggt cagcgcaccg 3235741 gccagggccg cattcaattg cgccgccagc gcctcggcag cggccagctc actcctggcc 3235801 ttgtcgatct cggaggtgac ctccaaggtg gacagcttgc ggtccgatcc gccgctgacc 3235861 cagccggcgc ccaccagatc accgtcaacg gtgaccgcgc gtagctccgg acgaatctcg 3235921 accaggccca ttgcctcagt caggtcgttg accaccgcga cacccgaaag catggcgatc 3235981 atcgcgccaa ccaactgcgg tggagactcg accaggtcta gggcccactg ggcgccgcta 3236041 ggcagcatct cccccgaggc ggattggggg gcttgcgggg ccggccagtc actcagcacg 3236101 aggaccgcgc gaccgccgtc ggcttgtttg agtgcgctga cggcactacc cgcggcagtc 3236161 aggccgtcca ccgcaagtgc gtcggccgcc ggcccgagcg ccgcggccag tgccgcttca 3236221 tagccggaac gtaccttcac caattgggcg atcgaaccga aaagccctgc gccactgcga 3236281 ttgtgcgcca gccacgccgc gccgtccttg cgctgtagcc ccactgcgag cgcatcgatg 3236341 cgagcccgta gcgatgccac ctggcgttcg gcggcgcgtt cggcggattg cagctcggcg 3236401 acgcgttcgt cggccaaccg caacgcggcc acagtacgct cgtggtgctc atccaggccg 3236461 acctcgcctt gatccagttc accgatgcgg ccctgcacgg tttcgaactc ggctcgggtc 3236521 tgctgggcgc gcattgcggc atcctcgatc cgctcggaca accgtgccac gctctcatcg 3236581 atcgattcga cacgcgcccg catggtctcc acctggccag ccagccgcgc cagtccctca 3236641 cggcggtccg cctcctcccg gaccgccgcc aggtgtgccc ggtcggcctc ggcggcgcgg 3236701 cgctcccggt cggccagctc tgcacgggca gcatcgagtc gggcacgcgc cgcgtccagc 3236761 tccgctaaca gttgttgctc ggcgacggcc acctgctggg cctcggcttc tagctcctcg 3236821 ggctttctgg ggtcggtgtc gctgaccgct accggctcga tatcgagatg atgggcgcgt 3236881 tcgctggcga tgcgcaccgt agcgtccacc cgttcggcca gcgcagacag cccgaaccaa 3236941 gtgtgctgga tcgactcggc ccgcgtcgag agttcggcga ccgcggactc atgcgcggcc 3237001 agctcctcgg atgccaccgc cagccgggcg gcggcctcgt catgctcgcg gcgcatcgca 3237061 gcctcggcct gaaagaccgc ttcccgttcg gctctgcggc ttaccaagtc gtcggccgcc 3237121 aggcgcagcc gggcgtcgcg cagatcggct tggatggccg cggcacgctg ggccgcctcg 3237181 gcctgccggc ccagcggttt gagttgacgc cggagctcgg tggtcagatc ggtgagccgg 3237241 gccaggttcg ccgccatcgt gtcgagtttg cgcagagctt tttccttgcg cttgcgatgc 3237301 ttgagcacac cggcggcttc ctcgatgaac gcccgccgat cctcaggccg cgactgcaag 3237361 atctcctcga gcttcccttg cccaacaatc acatgcatct cacggccgat gccggagtcg 3237421 ctcagcaact cctgcacatc catcaaacgg caactgctgc cgttgatttc gtattcgctg 3237481 gcaccgtcgc gaaacattct tcgggtgatc gacacctcgg tgtattcgat aggcagtgcg 3237541 ttgtcggagt tgtcgatgct aacggtgact tcggcgcggc ccagcggcgc acgcgacgag 3237601 gtgccggcga agatgacgtc ttccatcttg ccgccgcgca gcgtctttgc cccctgctcc 3237661 cccatcaccc acgccagggc atcgaccaca ttggatttgc cggagccgtt gggcccaacg 3237721 acggccgtaa tgcccggctc gaagcgtaaa gtcgtcggcg cggcgaagga cttgaagccc 3237781 ttcaacgtca gactcttgag gtacacgagg ggccagatta ccgctcgctg aacccggtga 3237841 tctgctccgt cgactgcgac cagtcggcga cgactttggc gacgcggccc ggtgtcgtgt 3237901 cgccctgcag cagctgcagc agcttctggc acgcagcgcg cggaccctgg gcgaccacca 3237961 gcacgcgtcc gtcggcgtgg ttggccgcgt aaccggtcag gccgagctcc aacgctcggc 3238021 agcgggtcca ccagcggaaa ccgactccct gcacccaccc gtgcacccag gcggtcagcc 3238081 gcacgtcagg cgccgacatc gacgacctcc aagttgaccg tggtgcccga cttgagggtg 3238141 cgcccgacgg tgcacgccag ctccaccgcg cggttgatga ccaccagcag acgctccttt 3238201 tcgtcctcgg tgaggcccga caagtcgagc tccatggtct cctcgatcag gggatagcgc 3238261 tcctggtcgc ggtcggccgc accggatacc ttgaccaccg cctggtagtc gtcgccgagc 3238321 cgccgggcca gcggctggtc actggccatc ccgctgcatg cggcgagtgc gatcttgagc 3238381 agctctccgg gggtgaatac cccgtcgacg tcctcggagc caaccagcac ctgcgccccc 3238441 cgcgtgctgc gtccgatgta acggcgcgtg ccggtgcgct cgacccacag ttgcgtcatg 3238501 gcttctttct acccgggggt ctttgcgtcg agatcgacgg cagcgccccc gcgagagaga 3238561 gcatcgcgct gacgtcgatc tcgatgcgtc aacacccgcc ctactttcgg ggccgcggct 3238621 ggcatcgcgg gcagtagaac gacgagcggt tcataaacct ctcccggcgt atcaccgcgc 3238681 cgcagcgccg acagttttcg ccttcgcggc cataagcgtc cagcgaccgc tcgaagtagc 3238741 ccgactcgcc gttgacgttg acatacaaag agtcgaacga ggtgccacct ttcgccagcg 3238801 cttcgcgcat cacgtcggcg gcggcatgca ggaccgctcc cagacgccgg caccttagtg 3238861 tggcggcgac gtgggcgccg ttcaccttgg cccgccacag cgcctcatcg gcatagatgt 3238921 tgccgattcc cgacaccacc cgctgatcca gcagctggcg cttgagttcg gaatgcttgc 3238981 gccgcaacac tttaactaca gcgtcacaat cgaaccgcgg gtcaagcggg tcgcgcgcca 3239041 ggtgggcgac cggcaccggt accacgctgc cgtccaccgt caccaggtcg gcaagcagcc 3239101 accctccgaa ggtccgttgg tcagcgaagc tcagcacggt cccgtcgtcg agcagcgcgg 3239161 aaatccggac gtgagcggca cacggcaccg ccccgagcag catctgccca ctcatgccca 3239221 ggtgcaccac gagtgcggtg tccgtcggcc tatggacccc agccgtattg agtgtcaacc 3239281 acaggtactt gccgcgccga tcggttccgt tgatccgcgc tccccgcagc cgcgccgtca 3239341 gatccgcggg cccggcatcg tggcggcgca cagcgcgggg gtggtgcacc cgaacctcgg 3239401 tgatggtccg gccggtcacg tgagcctgca agccgcgccg caccacctcg acttcgggca 3239461 gctcgggcat ccagtgatga tcgcaagcgc ggcgaagccg ggcgcagcgg gtcatcacca 3239521 tcgaaccagt gatgatcgca agcgcggcga agccgggcgc agcgggtcat caccatcgaa 3239581 ccagtgatga tcgcaagcgc ggcgaagccg ggcgcagtcc cccgcaagcg ggaggtgccc 3239641 ccaggtcatc accatcgaac cagtgatgat cgcaagcgcg gcgaagccgg gcgcagtccc 3239701 ccgcaagcgg gaggtgcccc caggtcatca ccatcgaacc agtgatgatc gcaagcgcgg 3239761 cgaagccggg cgcagtcccc cgcaagcgcg gcaaagccgg cgcccccagg tcatcaccat 3239821 caatccagtt aggcggaggt tttgcccggc atggcgttgt cgagcacttc cagggctttc 3239881 caagcggccg ccgcggcttt ttgctcggct tcttttttgg accggcccac tcctgaaccg 3239941 tattcgctgt ccatcacgac aaccaccgcg gtgaattcct tatcgtggtc cgggccggtg 3240001 gaggtgacca ggtatgacgg cgcacccagc cctcgcgctg cagtcagctc ctgcaagctg 3240061 gtcttccaat ccaatcccgc acccagggtc ggcgcggcgt ccagcaacgg gccaaacagc 3240121 cgcaggatca cctcacgggc cttctccata ccgtgttgca ggtagatcgc gcccagcagc 3240181 gattccatac cgtcggccag aatgctggac ttgtcggccc cgccggtgtt cgcctcgccg 3240241 cgacccaata gcacgtgaac accgaggcct tccgcacaga ggcggcgtgc gacgtcggcc 3240301 agggcctggg tgttgactac gctggcccgc agtttggcca gatccccctc cgaccgatca 3240361 ggatgacgat ggaacagcgc gtcggtgatg gtcagcccta gcacggcatc gccgagaaac 3240421 tccaaacgct cgttggtcgg cagcccgccg ttctcgtagg cgtagctgcg gtgggtcaac 3240481 gccagtgaga gcagctcgtc cgggaggtcc acaccgagtg cgtcgagcag gggttgtcgt 3240541 gaccggatca tcgctcacct cgtaatgtgt cggactccgg cccgagcatt tcgaccaact 3240601 tcgcccaccg cgggtcgatc tgttcatggc gatgacctgg ctcgctggcc agcgggacac 3240661 cgcactgcgg gcaaagaccc gggcagtccg gccggcacac cggcgaaaac ggcaattcca 3240721 gaccgaccgc atcgatgatc ggctgctcga gatcgatggt ttcgtcgacg acgcgtccga 3240781 cctcgtcttc ctcggtggtc tcgtcggtgg cgctatccgg ataggcaaac agttcggtca 3240841 gggctacctg aacgcgaccc cgcaccgggc tgaggcaacg agcacactcg ccgacggtcg 3240901 gggcggccac ggtcccggtc accaacacgc cttcggacac cgactcgacc cgcagatcca 3240961 ggtccagaag ggcgccctgg tcaatcgcga tcagctccag cccgatgcgt gcggggctgt 3241021 gcacggtgtc atgcagctcg aacatcgctc ccggtcgtcg ccccaaccgt gcgatgtcga 3241081 ccgtcatcgg cgacgccaca tgtcgctgcg cagtgggacc gtgctgcctg gccataagag 3241141 aaatcctacg gcgcacgcca cccagatcca cgccgcgttg ggcgttggcc ggcgcttgat 3241201 ccttgcgccg ggtgaatggt gttagcgcac cgcgtagtcg tgagtgccgg ccgctgtgcg 3241261 gagctggtgg cgaccgcggc caacggaccg cagggtgccg ttgaggaatt cctcgaattc 3241321 ggcgagcttg ttgtcgacgt agatatcgca ttctccgcgt agccggtccg cctcggcgtg 3241381 cgccgtgtcg acgaggcggg tcgattccgc gttggccgcc gcaaccacct cgttctgcga 3241441 taccaggcgc tgctgctctt tgatgccctc ctgcacggct ttctcgtagg agatgttgcc 3241501 gttttcgatc agccggtcgc attcggcctg ggcgcgactg acgctggcct cgtattcgcg 3241561 tttcgcggcg gtggcgatgc gaatcgcctc ctcgcgtgca tcggcgacca ttcgctcgct 3241621 gtgctggcgt gcctcgctga ccatccgatc agcctgcgcc ttcgcgtcag acaggatccg 3241681 gtcagcctcg gtgcgggcgt ggttgagtat cgactccgcc tcagtggtcg ccgaggacac 3241741 catagagtca gcgtgcgtct tagcgtcctg caacatcgaa tcacgtgcgt cgaggacgtc 3241801 ctgcgcgtca tccagctcac cggggatcgc atccttgatg tcgtcgatca actccagcac 3241861 atccccacgc gggacgacgc aacctgccgt catcggcacg cctcgggctt cttcgactat 3241921 ggcgctcaat tcgtccagcg cttcaaagac tcggtacacg gccacaccct cctggcatct 3241981 tgcaagatcc ctgttgttac cagtgtgcct ggtgtttcgt ctgtgactgc actggtggcg 3242041 ccggtgtgtc gggacacaat ttcatattcg acgagcccgg gcgaccactc agatcacgcg 3242101 gcctgctggg cgcgtgtcgt agaccgttcg gcggctgacg ggtgagccta cgtcgtctgg 3242161 gcgatcttgc ccgagcgtgc cgacaacgta ggtgtcgatg ctggcccgtc acggaccacg 3242221 ctatggtggc tcggtgaacg ggcactcaga cgacagtagc ggcgacgcga agcaagccgc 3242281 acccacgctg tatattttcc cgcatgccgg cggcaccgcg aaagactatg tcgcattttc 3242341 ccgagaattt tccgccgacg taaagcggat tgctgtccaa taccccggcc agcacgatcg 3242401 ttctggcctg ccaccgcttg agagtattcc caccctcgct gacgaaatct ttgcaatgat 3242461 gaaaccgtcg gctcggatcg acgatccggt ggcattcttt gggcacagta tgggcggaat 3242521 gctagccttc gaagtagcgt tgcgatacca atcggcgggc catcgagtcc tggcattctt 3242581 tgtgtcggcc tgctcagcac cgggtcatat cagatacaag cagctccaag atttatcaga 3242641 tcgcgagatg ttggacttgt tcacccgaat gacaggaatg aatccagatt tctttaccga 3242701 cgacgaattt ttcgttggag cgctacccac gttgcgagcg gtccgagcca tcgccggtta 3242761 ttcctgccca ccagagacga agctctcgtg tccgatttat gcctttatcg gagataaaga 3242821 ttggatcgca acgcaagacg acatggatcc gtggcgcgat cggacgacgg aagagttctc 3242881 tatccgtgta ttccctgggg atcacttcta cctcaacgac aatttgccag agctagtcag 3242941 cgacatagaa gacaaaacac tccaatggca tgatcgagct tagctatgct ccggatgtag 3243001 ctggccgaag atccaactgg ccgaagggct cgggggtcaa cacctggaca gccattcgct 3243061 ggacatttgc tgaagattca ccgtacgtcg gcaccggtct ggagcggatg gcttcagaca 3243121 cacacggggg cggtggcggc cgaccggtca ccccgccccc gcccggtatg caccatctcg 3243181 ggtgcagccg aggcgtgttg ttaatctcgt cacaacggga cgccggtcac aagacgtgcg 3243241 acccagccgc cggcggcact ctgacctcgg ttcttacctg actaccaatt cgtcaccggc 3243301 atcgcacacg tcacaccaac cacagcggac gcggcacggc acgcggaagg gacgttagac 3243361 tcggctagca ccaccaccgt gcccaggcaa cgacgccggc cgtcgctaag aaatttggtt 3243421 gacttcatga ataaggccgc gcccgccccg acaaatgatt accttacatt tgcgggctag 3243481 gcatagcgga gcaggggttt tagtctaggg ggagatcggc tggcgctgcg cagacatgct 3243541 gcggaagcag aactgcgtaa tcgtcaggtg gcttggtcag ttcagaccgg cacgtttcag 3243601 agcggtgggg atgtcccgac gtgcgatccg acaggggttc gcagggtccg caaaaaacat 3243661 agtgaacgcc agaaagccga atgggagtac aaggcgatgc cggtgaccga ccgttcagtg 3243721 ccctctttgc tgcaagagag ggccgaccag cagcctgaca gcactgcata tacgtacatc 3243781 gactacggat ccgaccccaa gggatttgct gacagcttga cttggtcgca ggtctacagt 3243841 cgtgcatgca tcattgctga agaactcaag ttatgcgggt tacccggaga tcgagtggcg 3243901 gttttagcgc cacaaggact ggaatatgtc cttgcattcc tgggcgcact tcaggctgga 3243961 tttatcgcgg ttccgctgtc aactccacag tatggcattc acgatgaccg cgtttctgcg 3244021 gtgttgcagg attccaagcc ggtagccatt ctcacgactt cgtccgtggt aggcgatgta 3244081 acgaaatacg cagccagcca cgacgggcag cctgccccgg tcgtagttga ggttgatctg 3244141 cttgatttgg actcgccgcg acagatgccg gctttctctc gtcagcacac cggggcggct 3244201 tatctccaat acacgtccgg atcgacgcgt acgccggccg gagtcattgt gtcgcacacg 3244261 aatgtcattg ccaatgtgac acaaagtatg tacggctatt tcggcgatcc cgcaaagatt 3244321 ccgaccggga ctgtggtgtc gtggctgcct ttgtatcacg atatgggcct gattctcgga 3244381 atttgcgcac cgctggtggc ccgacgccgc gcgatgttga tgagcccaat gtcatttttg 3244441 cgccgtccgg cccgctggat gcaactgctt gccaccagcg gccggtgctt ttctgcggca 3244501 ccgaatttcg ccttcgagct ggccgtgcgc agaacatctg accaggacat ggcggggctc 3244561 gacctgcgcg acgtggtcgg catcgtcagt ggcagtgagc gaatccatgt ggcaaccgtg 3244621 cggcggttca tcgagcggtt cgcgccgtac aatctcagcc ccaccgcgat acggccgtcg 3244681 tacgggctcg cggaagcgac cttatatgtg gcagctcccg aagccggcgc cgcgcccaag 3244741 acggtccgtt ttgactacga gcagctgacc gccgggcagg ctcggccctg cggaaccgat 3244801 gggtcggtcg gcaccgaact gatcagctac ggctcccccg acccatcgtc tgtgcgaatc 3244861 gtcaacccgg agaccatggt tgagaatccg cctggagtgg tcggtgagat ctgggtgcat 3244921 ggcgaccacg tgactatggg gtattggcag aagccgaagc agaccgcgca ggtcttcgac 3244981 gccaagctgg tcgatcccgc gccggcagcc ccggaggggc cgtggctgcg caccggcgac 3245041 ctgggcgtca tttccgatgg tgagctgttc atcatgggcc gcatcaaaga cctgctcatc 3245101 gtggacgggc gcaaccacta ccccgacgac atcgaggcaa cgatccagga gatcaccggt 3245161 ggacgggccg cggcgatcgc agtgcccgac gacatcaccg aacaactggt ggcgatcatc 3245221 gaattcaagc gacgcggtag taccgccgaa gaggtcatgc tcaagctccg ctcggtgaag 3245281 cgtgaggtca cctccgcgat atcgaagtca cacagcctgc gggtggccga tctcgttctg 3245341 gtgtcacctg gttcgattcc catcaccacc agcggcaaga tccggcggtc agcctgcgtc 3245401 gaacgctatc gcagcgacgg cttcaagcgg ctggacgtag ccgtatgacg ggaagcatca 3245461 gtggtgaagc cgaccttcgc cactggctaa tcgactacct agtaaccaat atcggctgca 3245521 cacctgacga ggtggacccc gatctgtcgc ttgccgacct cggcgtcagc tcccgcgacg 3245581 cggtcgtact gtccggcgaa ctgtcagagc tgctgggcag gaccgtatcg ccgattgact 3245641 tctgggagca cccgacgatc aacgcgctgg ccgcgtatct ggccgcaccc gagccgagcc 3245701 ccgactccga cgccgcagtc aagcgtggtg cccggaactc actcgacgag ccaatcgccg 3245761 tcgtcggcat gggatgtcgt ttccctggcg ggatttcgtg cccagaagca ttgtgggact 3245821 ttctctgtga acgccgttcc tcgatcagcc aggtgccgcc gcaacgatgg cagcccttcg 3245881 aaggcgggcc acccgaggta gccgcggcgc tagcgcgcac tacacggtgg ggctcatttt 3245941 tgcccgacat cgacgccttc gacgcggaat tcttcgagat ctcccccagc gaagccgaca 3246001 agatggaccc ccagcaacgc ctgctgctgg aagtggcctg ggaagcgttg gagcacgcgg 3246061 gaatcccgcc cggcacgctg cgccgctcgg caacaggagt gtttgccggg gcatgcctga 3246121 gcgaatacgg tgcgatggct tccgccgatc tgtcgcaggt cgatggttgg agcaatagcg 3246181 gtggcgcgat gagcatcatc gccaaccgcc tctcgtattt ccttgacctg cgcggcccgt 3246241 cggtggcggt agacaccgca tgctcgtcgt cgttggtagc gatccacctg gcctgccaga 3246301 gccttcggac ccaggactgt cacctggcaa tcgcagccgg cgtgaatttg ttgttgtccc 3246361 cggcggtatt tcgcggtttc gaccaagtcg gcgccttgtc cccgacaggt cagtgccgtg 3246421 cgttcgatgc gaccgccgac gggtttgtcc gcggcgaggg tgccggggta gtggtgctca 3246481 agcggttgac cgatgcacag cgcgacgggg atcgggtgct tgcggtgatc tgcggttctg 3246541 cggtcaacca ggacggccga tccaacgggc tgatggcccc caacccagcg gcccagatgg 3246601 cggtgctgcg tgccgcctac accaacgcgg ggatgcagcc cagcgaggtc gactacgtcg 3246661 aagcgcacgg aacagggacg ctgttgggcg acccgatcga agcccgcgct ctcggaacgg 3246721 tgctgggtcg cggccggccc gaggattctc cgttgctcat cggctctgtc aagaccaacc 3246781 tcggtcacac cgaggctgcg gctggaatcg cgggcttcat caagacggtg ctggctgtgc 3246841 agcatggcca gattccgcca aatcagcact tcgaaaccgc gaacccgcac attcccttta 3246901 ccgacttgcg gatgaaagtc gttgacacac aaactgaatg gccggcaacg ggccatcccc 3246961 gccgtgccgg tgtgtcgtcg ttcggcttcg gtggcacaaa cgcgcacgtg gtgatcgagc 3247021 agggccagga ggtgcgcccc gcgcctggac aaggcttaag tccggcggtg tcgaccctgg 3247081 tagtggccgg caagactatg cagcgggtgt ccgcgaccgc ggggatgcta gccgattgga 3247141 tggaagggcc cggcgctgac gtggccttgg ccgacgtggc ccacaccctc aatcaccacc 3247201 gatcgcggca acccaagttc ggcacggtgg tggcccgtga ccgtacccag gcgatagccg 3247261 gattgcgtgc gctggccgcc ggccaacacg cccccggcgt ggtcaaccct gccgacggct 3247321 cgccggggcc gggcaccgtg ttcgtctact ccggccgcgg ttcacagtgg gctggcatgg 3247381 gccgtcaatt gttggccgac gagccggctt tcgcggccgc ggtcgccgaa ttggaaccgg 3247441 tgtttgtcga gcaagccggc ttttcgttgc acgacgtgct ggctaacggc gaggaactgg 3247501 tcggtatcga gcagattcag ctcgggttga tcgggatgca gctggccctg accgaattat 3247561 ggtgttccta cggggtgcgg cccgacctgg tgatcggcca ctccatgggc gaggtggccg 3247621 ccgccgtggt cgccggggca ctgaccccgg ccgagggtct gcgggtgacc gccacccggt 3247681 cacggctgat ggcaccgttg tccggccagg gcggcatggc actgctggaa ctcgacgcgc 3247741 ccactaccga ggcgttgatt gccgacttcc cacaggtgac gctcggtatt tacaactcac 3247801 cacggcaaac ggtgatcgcc gggcccaccg agcagatcga tgagttgatc gcccgggtgc 3247861 gcgcgcaaaa ccggtttgcc agtcgggtca atatcgaagt ggccccgcac aatccggcca 3247921 tggatgcttt gcagccggcg atgcgttcgg agctggccga tctgacccca cggaccccca 3247981 ccatcggaat catctccacc acctacgcag acttgcacac ccaaccggtc ttcgacgccg 3248041 aacactgggc caccaacatg cgcaaccccg tgcgcttcca gcaggccatc gcttccgccg 3248101 gtagcggcgc cgacggcgcc taccacacct tcatcgaaat cagcgcacac ccgctgctga 3248161 cccaggccat catcgacact ctgcacagcg ctcaacccgg agccagatac accagcctcg 3248221 ggaccctgca acgcgacacc gacgacgtcg tgaccttccg gaccaacctc aacaaggccc 3248281 acaccatcca cccaccgcac accccccacc cccccgagcc acatccgccc atccccacca 3248341 ccccgtggca acacacccgt cactggatca ccaccaaata tccggccggc tctgttggat 3248401 cggccccccg agcgggcaca ctgctcggcc aacacaccac cgtcgccacg gtctcagcga 3248461 gtccgccctc ccacctctgg caagcaaggc tggctccgga cgccaagccg taccagggcg 3248521 gtcatcgatt ccaccaagtc gaggtggtcc cagcttctgt tgtgctgcac acaatccttt 3248581 ccgctgcaac agaattgggc tactccgcgt tgtccgaggt ccgattcgag caacccattt 3248641 tcgccgaccg gccacgtcta atccaggtcg tcgccgacaa ccgggcgatc agcctggcct 3248701 cgagtccggc tgccggaaca ccctcagacc ggtggacgcg gcatgttacc gcacaacttt 3248761 cctcgtcacc gtcggattcg gccagcagct tgaacgagca ccatcgcgcc aacgggcagc 3248821 cgcccgaacg tgctcaccgc gacctgattc ccgacctggc cgagctgctc gcaatgcgcg 3248881 gcatcgatgg cctgcctttc tcatggaccg tcgcgtcgtg gacacagcac tcgagcaacc 3248941 tcacggttgc gatcgatctc cccgaagctc tgcccgaagg gtcgactggg ccgctccttg 3249001 acgccgcggt gcacctcgcc gcgctatcgg acgtcgctga ttcgcggctc tacgtgccgg 3249061 caagcatcga gcagatatcg ctcggcgatg tcgtcaccgg gccgcgtagc tcggtgacgc 3249121 tgaaccgcac cgctcacgac gacgacggga tcaccgtcga tgtcaccgtt gcagcccacg 3249181 gcgaagtgcc gtccctgtcg atgaggtcgc ttcgataccg ggctctggac tttggcctag 3249241 acgttggtag ggcgcaaccg cccgcgtcga ccggtccggt cgaggcctac tgtgatgcca 3249301 ccaatttcgt acacacgatc gactggcaac cgcagaccgt tccggacgcg acgcacccag 3249361 gggccgaaca ggtaacccat ccaggacccg tcgcgataat cggcgatgac ggcgcagcgc 3249421 tgtgtgagac cctcgaaggg gcgggctacc agccggccgt gatgtccgat ggggtgtcgc 3249481 aggcccgcta cgtcgtttac gtcgcggatt ctgatccggc tggcgccgac gagaccgacg 3249541 tcgacttcgc cgtccggatc tgtaccgaaa tcaccggtct ggtgcggact ctcgcggaac 3249601 gcgatgcgga taagcccgcg gcgctatgga tcctcacccg cggagttcac gaatcggtcg 3249661 ccccgtccgc gctgcgccag agtttcctgt ggggccttgc cggtgtcatc gccgccgaac 3249721 atcccgagct gtggggcgga ctggtcgatc tcgcgatcaa cgacgactta ggcgaattcg 3249781 ggccggcact tgccgaactg cttgccaaac caagcaagtc gatcttggtg cgtcgtgacg 3249841 gcgtggtgct cgccccggcc ttggctcccg tccgtggcga gccggcgcgc aagtccttgc 3249901 agtgcaggcc cgacgcggcc tacctcatca ccggcggcct gggcgccctt ggcctgctga 3249961 tggccgattg gctcgccgac cgcggcgctc atcgattggt gttgaccggc cgcacgccat 3250021 tgccgccacg gcgggactgg caactcgaca ccctcgacac cgagctgcgc cggaggatcg 3250081 acgcgatccg cgccctggaa atgcgcgggg tgactgtcga agccgtcgcc gccgacgtcg 3250141 gctgccgcga agacgtgcag gccctgttgg ccgcgcgcga ccgtgacgga gcggcaccga 3250201 tccgcgggat catccacgcc gcgggcatta ccaacgatca attggtgacg agcatgaccg 3250261 gcgatgcggt gcgacaggtt atgtggccga agatcggcgg cagccaggtc ctacacgacg 3250321 catttccgcc cggcagcgtg gacttcttct acttgaccgc ctcggctgcc gggatattcg 3250381 gcattccagg gcagggttcc tacgccgccg ccaattccta cttggacgcg ctggcgcggg 3250441 cgcgccggca acagggctgc cacaccatga gcctcgactg ggtagcctgg cgggggctcg 3250501 gattggccgc ggacgcccag ctcgtcagcg aagagctagc gcgaatgggt tcgcgtgaca 3250561 tcacgccgtc ggaggcattc accgcttggg aattcgtcga tggctacgac gtcgcgcaag 3250621 cggtcgtggt gcccatgccc gctccggcgg gcgccgatgg atccggtgcg aacgcttacc 3250681 tattgccggc gcggaactgg tcggtgatgg cagcgaccga ggtgcgatcc gagctcgaac 3250741 aggggttacg ccgcatcatt gcagccgagc tgcgagtgcc tgagaaagag ctggacaccg 3250801 accgcccgtt cgccgagttg ggtctcaatt cccttatggc aatggcgatt cggcgcgagg 3250861 ccgagcagtt tgtcggcatc gagttgtctg ccaccatgtt gttcaaccac ccaacggtca 3250921 aatcactcgc cagctacctt gccaaacgtg tggcaccgca cgatgtgtca caagacaacc 3250981 agatttccgc gctatcctcg tcggccggaa gtgtgttgga cagtctattc gatcgcatcg 3251041 aatcggcgcc gcctgaggcc gagaggtcgg tgtgatgcga acggctttca gccggatttc 3251101 cggtatgacc gcgcaacagc gcacctccct agccgacgag ttcgacaggg tctctcgcat 3251161 cgccgtggcc gagccggttg cggtggttgg catcggctgc cgctttccgg gagatgtgga 3251221 tggaccagag agtttctggg actttctggt cgcgggcagg aatgcgatct cgacggtgcc 3251281 ggcagatcga tgggacgcag aagcgtttta ccaccccgac ccgctaacac cggggcggat 3251341 gacgacgaag tggggcggct tcgtccctga cgtcgcgggc ttcgacgccg aattcttcgg 3251401 tatcacaccg cgggaagccg cggcgatgga cccgcagcag cgaatgctgc tggaggttgc 3251461 ctgggaagca ctcgaacatg ccggcatacc accggattcc ctcggcggca cccgaaccgc 3251521 cgtcatgatg ggggtctatt tcaacgagta tcagtccatg ttggccgcca gtccgcagaa 3251581 cgtagacgcc tacagcggga ccggaaatgc acacagcatc acggtgggtc gcatctccta 3251641 cctgttggga ttacggggtc cggcggtcgc ggtggacacc gcctgctcgt cgtcgttggt 3251701 ggctgtgcac ctggcgtgtc agagtctgag gctgcgcgag accgatctgg ctctcgccgg 3251761 tggagtgagt atcacccttc gcccagagac ccaaatcgct atctctgcct ggggattgct 3251821 gtccccgcag ggccggtgtg ccgcattcga tgcggcggca gacggatttg tgcgcggtga 3251881 gggcgccgga gtggtagtgc tcaagcggtt gacggacgcg gtgcgcgacg gcgaccaggt 3251941 gctggcggtg gtgcgcggtt cggcagtcaa ccaggacggc aggtccaatg gcgtaacggc 3252001 gccgaatacg gcagcccagt gcgatgtgat cgccgatgcc ttgcgatccg gcgatgtggc 3252061 gcctgacagc gtgaattacg tagaggccca tggaaccggc acggtgctgg gcgacccgat 3252121 cgaattcgag gccctggccg ccacgtatgg ccacggcggg gacgcatgcg cgttgggtgc 3252181 ggtgaaaacc aacatcggtc atctggaggc ggccgccggg atcgcggggt tcatcaaggc 3252241 gacgctggcg gtacaacgcg cgacgatccc gccgaatctg catttctcgc aatggaatcc 3252301 agctatcgat gccgcgtcga ccaggttttt cgttcccacg cagaactccc cgtggccaac 3252361 cgcggagggg ccgcgccggg cggcggtgtc gtcgttcgga ttgggcggga cgaacgcaca 3252421 cgtgatcatc gagcaaggta gcgagctggc tccggtatcc gaaggcggcg aggacaccgg 3252481 ggtgtcgacg ttggtggtga cgggtaagac ggcccagcgg atggccgcga cggcgcaggt 3252541 gctggccgac tggatggaag gtccgggcgc cgaggtggcc gtagctgatg tcgcccacac 3252601 ggtcaaccat caccgggccc gccaagccac gttcggcacc gtcgtagccc gtgaccgcgc 3252661 ccaggcgata gccggactgc gcgcgctggc cgccggccaa cacgctcccg gagtggtgag 3252721 ccaccaggac ggttcgccgg ggccgggcac cgtattcgtc tactccggcc gcggctcgca 3252781 gtgggccggg atgggtcgcc aattgttggc cgacgagccg gctttcgccg ccgcggtcgc 3252841 cgagctggaa ccggtgtttg tcgagcaagc cggcttctcg ctgcgcgacg tgatcgccac 3252901 cggcaaggag ctagtcggta tcgagcagat ccagcttggc ctgatcggca tgcaactgac 3252961 attgactgag ctatggcgct cctacggggt gcagcccgac ctggtgatcg gccactccat 3253021 gggcgaggtg gccgccgccg tggtcgccgg agcgctgact ccggccgagg gtctgcgggt 3253081 gaccgccacc cgcgcacggt tgatggcgcc attgtccggc cagggcggca tggcactgct 3253141 gggactcgat gctgcggcca ccgaagcgtt aatcgcggac tacccgcagg tgacagtggg 3253201 gatctacaac tcgccgcggc agaccgtgat cgccgggccg accgaacaaa tcgatgagtt 3253261 gatcgcccgg gtgcgcgcgc aaaaccggtt tgccagtcgg gtcaatatcg aagtcgcccc 3253321 gcacaatccg gccatggatg cgctgcagcc ggcgatgcgt tcggagctgg ccgatctgac 3253381 cccacggacc cccaccatcg gaatcatctc caccacctac gcagacttgc acacccaacc 3253441 gatcttcgac gccgaacact gggccaccaa catgcgcaac cccgtgcgct tccagcaggc 3253501 catcgcttcc gccggtagcg gcgccgacgg cgcctaccac accttcatcg agatcagcgc 3253561 acacccgctg ctgacccagg cgattgccga caccttggaa gacgcgcacc gcccaaccaa 3253621 gtccgcagcg aaatacttga gcattggcac cttgcagcgt gatgccgatg acacggtcac 3253681 cttccgcacc aacctctaca ccgccgacat cgcccaccca ccgcatacct gtcacccgcc 3253741 cgagccgcac cccaccatcc ccaccacacc ctggcaacac acccaccact ggatcgccac 3253801 cacgcacccg agcacggcag cgccagaaga tccgggcagc aataaggttg tggtgaacgg 3253861 acaatcgaca tccgagagcc gtgcgctcga agactggtgc caccagctgg cctggccgat 3253921 ccgcccggca gtcagcgccg acccgcccag caccgccgcc tggctcgtgg tggcagacaa 3253981 cgaactctgc cacgagctgg cccgtgcggc cgattctcgg gtagacagcc tctcgccgcc 3254041 ggcgctcgca gcaggcagcg atccggccgc actgctcgac gcgctgcgcg gtgtggacaa 3254101 cgtgctctac gctccacccg tccccggtga actcctcgat attgaatcgg cctaccaggt 3254161 tttccacgca acgcgacggc tagccgccgc gatggtcgcc agcagcgcca cggctatttc 3254221 cccgccgaag ttgttcatca tgacccgcaa cgcccagccc atctcggaag gcgaccgagc 3254281 caaccctggc cacgctgtgc tgtggggtct cggccggtcg ctggcactag agcatcctga 3254341 aatctggggc ggcataatcg atcttgacga ttcgatgccc gcagagctgg ccgtgcggca 3254401 tgtgctgact gcagcccacg gtaccgacgg ggaggatcag gtcgtatacc ggtcgggcgc 3254461 acgccatgta ccccggctgc agaggcgaac tcttccgggg aaaccggtca cgttgaatgc 3254521 cgacgccagc cagctcgtca tcggtgcgac cggcaacatc ggaccgcatc tcatccgaca 3254581 gctcgcgcgg atgggggcta agacaatcgt cgcgatggct cgcaagcccg gcgcgctcga 3254641 cgagttgacc caatgtctcg ctgcgaccgg aacagatctc atcgcggtgg ccgccgatgc 3254701 gaccgatccc gccgccatgc aaaccctgtt cgaccgattc ggcacggagc taccgccact 3254761 ggagggaatc tatctggcgg cctttgcggg ccgcccagcg ctgctgagcg agatgaccga 3254821 cgacgacgtg accaccatgt ttcgtcccaa gttggacgcc ttggcgttgt tgcaccgacg 3254881 gtcactgaag agcccagtgc gccacttcgt tttgttctct tcggtgtcag gtctgctggg 3254941 ttctcgatgg ctcgcccatt acaccgcgac cagcgccttc ctggacagct tcgccggcgc 3255001 gcgtcgcacc atgggcctgc cggccaccgt cgtcgactgg ggactgtgga agtcgctggc 3255061 cgatgtgcaa aaagacgcga ctcaaatcag cgcggaatcc gggctgcaac ccatggctga 3255121 cgaggtggcc atcggcgcgc taccgctggt gatgaacccc gatgcggcag tcgcgaccgt 3255181 ggtggttgcc gcggactggc ccttgttggc cgcggcatat cgaacgcggg gagcccttcg 3255241 catagtcgac gacctgttgc cggcaccgga agacgtcggg aagggcgaaa gcgaattccg 3255301 cacatcgttg cgtagctgcc cggcggagaa acgacgggac atgttgttcg accatgtggg 3255361 cgccttggcc gccacggtga tgggaatgcc gcccacggag ccgctcgatc cgtcggccgg 3255421 cttcttccaa ctcggcatgg actcgctaat gagcgtgaca cttcagcggg cgttgtcgga 3255481 aagcctgggc gagttcttgc cggcgtccgt ggttttcgac tatccgaccg tttacagcct 3255541 caccgactac ctggccaccg tcctgcctga gctcctcgaa attggggcaa ccgcagtcgc 3255601 aacccagcaa gccaccgact cctaccacga actgaccgaa gccgagttgt tggaacaact 3255661 ttcggaacga ctaagaggaa cacaatgacc gcagcgacac cagatcgccg agcgatcatc 3255721 accgaggcgc tgcacaagat cgatgatctc acggcgcgcc tggaaatcgc cgaaaaatcc 3255781 agcagcgaac cgatcgcggt gatcggcatg ggttgccggt tcccgggcgg ggtcaacaac 3255841 cccgaacagt tctgggattt gttgtgcgcc ggccgaagcg gcatcgtccg ggttcccgcg 3255901 cagcggtggg acgccgacgc ctactactgt gatgatcaca ccgtgccggg gaccatctgc 3255961 agcaccgaag gcggttttct caccagctgg cagccagatg agttcgatgc ggagttcttc 3256021 tcaatctccc cgcgcgaagc ggcggcgatg gacccgcagc agcgattgtt gattgaagtt 3256081 gcgtgggaag cgctagaaga cgcgggcgtc ccgcaacaca ccattcgcgg tacgcaaacc 3256141 tcggtattcg tcggtgtcac cgcctacgac tacatgctca cgctggcggg ccggctacga 3256201 cctgttgacc tcgacgcgta catcccaacc gggaactcgg cgaacttcgc cgccggacgg 3256261 ctggcctaca tcctcggggc acgcggaccc gcggtggtca tcgacacggc ctgctcatcg 3256321 tcgttggtgg cggtgcacct ggcatgccag agcctgcgcg ggcgggaaag cgatatggcg 3256381 ttggtgggtg gaaccaacct tttgctgagc ccgggaccca gcatcgcttg ctcgcgatgg 3256441 gggatgctgt caccggaggg gcggtgcaag accttcgatg cgtccgccga tggatacgtg 3256501 cgcggcgagg gtgccgcggt ggtggtgctc aagcggctgg atgacgcggt gcgcgacggc 3256561 aaccgcattc ttgccgtggt acgcggttcg gcggtcaacc aggacggtgc cagcagcgga 3256621 gtgaccgttc ccaacgggcc agcgcaacag gcgttgctcg ccaaagcatt gacgtcgtcg 3256681 aagttgacag cggccgatat cgactacgtc gaggcccatg gaactggtac tccgctgggc 3256741 gacccgatcg aactcgattc actgagtaag gttttcagcg atcgagcggg ttcggatcag 3256801 ttggtgattg gatcggtgaa gaccaatctc ggtcacctgg aagcggcggc cggtgtcgcc 3256861 gggctgatga aagccgtgct cgcggtacac aacggctaca ttccgcggca tcttaacttc 3256921 caccagctga caccacatgc aagtgaggcc gcatctcggc tgaggatcgc cgccgatggt 3256981 attgactggc caaccaccgg tcgacctcgc cgggcggggg tgtcgtcgtt cggcgtcagt 3257041 gggacgaatg cacacgtggt gatcgagcag gcacccgatc cgatggccgc tgcgggaacg 3257101 gagccgcagc gcggccccgt tcccgcggtg tcgacgctgg tggtgttcgg caagaccgca 3257161 ccgcgggtgg ctgcgacggc atcggtgctg gcagattggc tggacggccc cggcgcggcg 3257221 gtgccgctgg ccgatgtcgc gcacaccctc aaccatcacc gggcccgtca gaccaggttc 3257281 ggcacggtag ccgctgtcga tcggcgccaa gcggtgatcg ggttacgcgc gctggccgcg 3257341 ggtcaatccg cccccggggt ggtggcaccc cgcgaaggct ccatcggagg cggcacggtg 3257401 ttcgtctact cgggacgagg atcgcagtgg gccggaatgg ggcgccaact gctggccgac 3257461 gagccggcat tcgccgctgc catcgccgaa ctggagccgg aattcgttgc tcaaggcggg 3257521 ttttcgctgc gcgacgtgat cgccggcgga aaagagttgg ttggcatcga acagatccag 3257581 ctgggactga tcgggatgca gctggcgctg accgcgttgt ggcgctcata cggcgtgaca 3257641 cccgatgcgg tgataggtca ctcgatgggc gaagtggccg ccgcggtggt ggccggggcg 3257701 ctgaccccgg cccagggatt acgggtgacc gcggtccggt cgaggctgat ggcgccgctg 3257761 tccgggcagg gcacgatggc gttgctggaa ctcgacgccg aagccactga ggcgctgatt 3257821 gccgactacc ccgaggtgag cctggggatc tatgcctccc cacgccaaac cgtgatttcc 3257881 gggccgccgc tattgatcga cgagctcatc gacaaggtgc gccaacagaa cggcttcgct 3257941 acccgagtca acatcgaggt ggccccccac aacccggcca tggatgcact gcaaccggcg 3258001 atgcgttcgg aattggccga tctcaccccg caaccgccga ccatcccgat catctccacc 3258061 acctacgccg acctcggcat ttccctgggt tccggcccca ggttcgacgc cgagcactgg 3258121 gcaaccaaca tgcgcaaccc ggtacggttc caccaggcca tcgctcatgc cggcgccgat 3258181 caccacacct tcatcgagat cagcgcccac ccgctgctga cccactcgat cagcgacacc 3258241 ctgcgcgcca gctacgatgt cgacaactat ctgagcatcg gcaccttgca acgcgacgct 3258301 cacgacaccc tcgagttcca cacgaacctc aacacgaccc acaccaccca tcccccccag 3258361 actccccacc cccccgaacc ccaccccgtg ctgcccacca ccccatggca gcacacccag 3258421 cactggatca ccgccacgtc ggccgcttac cacaggcccg acacccaccc gttgcttggc 3258481 gtcggtgtca ccgaccccac taacggcacc cgggtttggg aaagcgagct cgaccctgat 3258541 ctgctgtggc tcgccgatca cgtcatcgac gatctcgttg tgctgcccgg ggcggcctac 3258601 gctgagatcg cgctggcggc cgcgaccgac accttcgcag tcgagcaaga tcagccctgg 3258661 atgatcagcg agctcgacct tcggcagatg ctgcatgtga ccccaggcac cgtgttggtc 3258721 accacgctca ccggcgacga gcagcgatgc caggtcgaaa tacgcacccg cagcgggtct 3258781 tcgggatgga ccacccacgc caccgccacc gttgcccgcg ccgagccgtt agcaccgctg 3258841 gatcacgaag gacagcggcg cgaggtaacc actgccgacc tcgaggacca actggatccc 3258901 gacgacctgt atcagcgcct gcgcggcgcc ggccaacagc acggacccgc gtttcaaggc 3258961 atcgtggggc tggccgtcac gcaagctggc gtggcccgtg cgcaagtacg gctacccgca 3259021 tcggccagaa cgggttcccg tgagttcatg ctgcacccgg tgatgatgga tatcgcgttg 3259081 cagacactgg gagccacccg gacggcgacc gatctggccg gcggccagga cgcccggcag 3259141 ggcccatctt ccaactcggc cttggtggta ccggtgcgtt tcgccggtgt ccacgtgtac 3259201 ggcgatatca cccgcggggt tcgcgcggtc ggctctctgg ccgcagccgg tgaccggctg 3259261 gtcggcgagg tagtcctgac cgacgcgaat ggccaaccgc tgctggtcgt cgatgaagtc 3259321 gagatggcgg tgctcggatc cggcagtggc gcaacggaac tcaccaaccg cctattcatg 3259381 ttggagtggg agcccgcacc gctggaaaag accgccgagg ctacgggtgc cctgttgctg 3259441 atcggtgacc ccgccgcggg tgacccgctg ctgcccgcgc tgcagtcgtc gctgcgcgac 3259501 cgcatcaccg acctcgagct ggcatccgcg gccgacgaag ccacgctgcg cgcggcgatc 3259561 agccgaacct cctgggacgg gatcgttgtg gtctgtccgc cccgagcgaa cgacgaatcg 3259621 atgccggacg aggctcaact ggagttggca cgcacacgca cgctgctggt cgccagcgtg 3259681 gtcgagaccg tgacgcgaat gggtgcccgc aagagccccc gactgtggat cgtcacccgt 3259741 ggcgctgcac agttcgacgc aggcgagtcg gtcacgttgg cgcagaccgg cctacgtggc 3259801 atcgcacggg tgctgacatt tgagcattcg gagttgaata ccaccctcgt agatatcgaa 3259861 ccggacggca ccggctcgct ggccgccctg gccgaggagt tgcttgccgg ttccgaggcc 3259921 gacgaggtcg ccttgcgcga cggtcaacgc tatgtcaacc ggctggtgcc cgcacccacc 3259981 acgaccagtg gtgatctcgc cgccgaagct cgccaccagg tggtgaacct ggacagctcg 3260041 ggcgcttcca gggcagctgt ccgactgcag atcgatcaac ccggacggct ggacgcacta 3260101 aacgttcacg aggtgaaacg gggcagaccg caaggcgatc aagtcgaggt tcgcgtcgtc 3260161 gccgccggac tcaacttcag cgacgtgctc aaagcgatgg gcgtgtatcc gggactcgac 3260221 ggtgccgcgc cggtgatcgg cggcgaatgt gtcggctacg tgacggccat cggtgacgag 3260281 gttgacggcg tcgaggtcgg acagcgagtt atcgcattcg gccctggcac attcgggacc 3260341 catctgggga ccatcgccga tctcgtcgtc ccaattccgg acacgctagc cgacaacgag 3260401 gcggccacgt tcggcgtcgc ctatctcacc gcctggcact cgctgtgcga ggtcgggcgc 3260461 ctatcccccg gcgaacgcgt gctcatccat tccgccaccg gcggtgttgg aatggcggcg 3260521 gtctcgatcg cgaagatgat cggcgcccgc atctacacga cggccggttc ggacgccaaa 3260581 cgggaaatgc tttccaggct cggtgtcgag tacgtcggcg actcgcgaag cgtggatttc 3260641 gctgacgaga tcctcgagct gacagacggc tacggtgtgg acgtcgttct caattcgctg 3260701 gcgggcgagg cgattcaacg cggcgtgcag atccttgcgc ccggtggccg gttcatcgaa 3260761 ctgggcaaga aggacgtcta cgccgatgcc agcttgggct tggccgcgct agccaagagc 3260821 gcgtccttct ccgtggtcga cctcgacctg aatctcaagc tgcagccggc gcgctaccgc 3260881 caactcctgc aacacatcct gcagcacgtg gcggatggca aactcgaggt acttcccgtc 3260941 accgcattta gcctgcacga tgcggccgac gcattccggc ttatggcatc cggtaaacac 3261001 accggcaaga tcgtcatctc gataccccag cacggcagca tcgaggcgat cgctgccccg 3261061 ccaccacttc ctctggtcag ccgcgacggc ggctacctca tcgtcggcgg tatgggtggt 3261121 ctcggattcg tcgtcgcgcg ctggctggct gagcaaggtg cgggactgat tgtcctcaac 3261181 ggacgctcgg cccccagcga cgaggtggca gccgctatcg cggagctgaa cgcctccggt 3261241 agccggatcg aggtgatcac cggcgacatc accgagccag acaccgccga gcggctggtg 3261301 cgggcggtcg aagacgccgg gttccggctg gccggggtgg tgcacagcgc gatggttctc 3261361 gccgacgaga tcgtgttgaa catgaccgat tccgccgctc ggcgagtgtt cgccccgaag 3261421 gtcaccggca gctggcggct tcatgtggcc accgccgcgc gcgacgtcga ctggtggctg 3261481 accttctcct cggccgccgc gctgctgggc actcccgggc agggcgcgta cgccgccgcc 3261541 aactcgtggg tcgacggcct ggtcgcgcat cggcgctcgg ccggacttcc cgctgtcggg 3261601 atcaactggg gcccgtgggc cgacgttgga cgcgcgcagt tcttcaaaga cctcggggtg 3261661 gagatgatca acgccgagca ggggcttgcc gccatgcagg cggtactcac cgccgatcgc 3261721 gggcgcaccg gtgtgttcag cctcgacgcg cggcagtggt tccaatcgtt ccccgctgtg 3261781 gcggggtcct cgctgttcgc gaagctgcat gactcggcgg cccgcaaaag tgggcagcgg 3261841 cgcggcgggg gcgcgattcg cgctcagcta gacgccctcg acgcggccga acgcccaggc 3261901 cacctcgcgt ccgcgatcgc cgacgagatc cgtgcggtgc tgcgctcagg cgatcccatc 3261961 gatcaccacc gaccgctgga aaccctggga ctcgactcgc tgatgggcct ggaattgcgc 3262021 aatcggctgg aagcaagtct gggcatcacg ttgccggtcg cgttggtgtg ggcatacccg 3262081 acgatcagcg atctcgcgac cgccctgtgc gaacgaatgg actacgcgac acccgcggct 3262141 gcgcaggaga tttccgatac agaacccgaa ctgtccgacg aggagatgga tttgctcgcc 3262201 gatctggttg acgccagcga gctggaagct gcgacgcgag gcgagtcatg acaagtctgg 3262261 cggagcgcgc ggcgcaactg tcgccgaacg cgcgagcggc cctggcgcgc gagctcgtcc 3262321 gtgcgggtac gaccttcccg accgacatct gcgagccggt ggcggtggtg ggcatcggct 3262381 gtcgctttcc ggggaatgtg actgggccag agagcttttg gcagctactg gccgacggtg 3262441 tggacacaat cgagcaggtg ccgcctgatc ggtgggatgc ggacgcgttc tacgatcccg 3262501 atccttcggc gtcgggtcgg atgacgacga aatggggtgg tttcgtttcc gatgtcgacg 3262561 cgttcgacgc cgactttttc ggaatcactc ctcgggaagc cgtggcgatg gacccgcagc 3262621 atcggatgct gctcgaggtt gcctgggaag cgttggagca cgcgggtatt ccgccggatt 3262681 ccttgagcgg cactcgaacc ggcgtgatga tgggtctgtc gtcgtgggac tacacgatcg 3262741 tcaatatcga gcgcagagcc gacatcgacg cgtacctgag caccggaacc ccgcactgtg 3262801 ccgcggtggg gcggatcgcg tatctgttgg gattgcgtgg tccggccgtc gccgtagata 3262861 ccgcttgttc gtcgtcgctg gtggcaattc acttggcgtg tcagagcctt cgcctgcgtg 3262921 aaaccgacgt ggcattggcg ggcggggtgc agctcacctt gtcaccgttc accgccatcg 3262981 cgctgtccaa gtggtcggcg ctgtcaccga ccggccgatg caacagcttc gacgccaacg 3263041 cggatggatt cgtgcgcggc gagggctgcg gcgtggtggt gctcaagcgg ttggccgacg 3263101 cggtgcgcga ccaggaccgg gtgcttgcgg tggtccgcgg ttcggcaact aactccgatg 3263161 gtcggtccaa cggcatgacc gcaccgaacg cgctggcgca gcgtgacgtg atcacatccg 3263221 ccctcaagct tgcggatgtt acccctgaca gcgtgaacta tgtcgaaaca cacggcaccg 3263281 gaacggtgtt gggggacccc atcgagttcg agtcgctggc ggccacttat ggcctgggta 3263341 aaggccaggg cgagagcccg tgcgcattgg ggtcggtcaa gaccaacatc ggccacctgg 3263401 aggcggccgc cggtgtggct ggattcatca aggcggtgct ggcggtgcaa cgtgggcaca 3263461 ttccccgcaa cttgcacttc acccggtgga acccggccat cgacgcgtcg gcgacgcggc 3263521 tgttcgtgcc gaccgaaagc gccccgtggc cggcggctgc cggtccacgc agggctgcgg 3263581 tgtcatcgtt cggcctcagc gggaccaacg cgcacgtggt ggtcgagcag gcacccgaca 3263641 ccgcagtagc cgcagccggc ggcatgccgt atgtttcggc gctgaacgtc tccggcaaga 3263701 cggccgcgcg ggtggcgtcg gcggcggcgg tgctggccga ctggatgtcg gggccgggcg 3263761 cggcggcacc actggccgac gtggcacaca cgttgaaccg gcaccgggcc cggcacgcca 3263821 agttcgccac cgtcatcgcg cgtgaccgcg ccgaggcgat cgcggggttg cgagcgctgg 3263881 cggccggaca accacgcgtt ggggtggtgg attgcgacca gcatgccggt gggcctggcc 3263941 gggtttttgt gtattcgggt cagggctcgc agtgggcgtc gatgggccag cagttgctgg 3264001 ccaacgaacc ggcgttcgcc aaggcggtag ccgagctgga tccgatattc gttgaccagg 3264061 ttggcttttc gctgcagcaa acgcttatcg acggcgacga ggtggtgggc atcgaccgca 3264121 tccagccggt gctggtcggg atgcagttgg cgctgaccga gttatggcgg tcctatgggg 3264181 tgattccaga tgccgtgatc gggcactcga tgggtgaggt gtcggcggca gtggtggccg 3264241 gcgcgttgac gcccgagcag ggcttgcggg tcatcaccac ccggtcgcgg ttgatggcgc 3264301 ggctgtcggg gcagggagcg atggcgctgc tcgagctgga tgccgacgcc gccgaggcgc 3264361 tgattgccgg ctatccgcag gtgacgctgg cggtgcatgc gtcaccgcgc cagacggtga 3264421 tcgccgggcc gcccgagcag gtggacacgg tgatcgcggc ggtagcgacg caaaaccggt 3264481 tggcgcgccg cgtcgaagtc gacgtggcct cccatcaccc gatcatcgat cccatactgc 3264541 ccgagttgcg aagcgcgtta gcggatttga ctccgcagcc gccgagcatc ccgatcattt 3264601 ccactacgta cgaaagcgcg cagccggtgg cggatgccga ctattggtcg gccaacctgc 3264661 gcaacccggt gcgattccac caggccgtca ccgccgccgg tgtcgaccac aacaccttca 3264721 tcgaaatcag ccctcacccc gtgctcacgc acgcactcac cgacaccctg gatccggacg 3264781 gcagccatac agtcatgtcg acgatgaacc gcgaactgga ccagacgctg tatttccacg 3264841 cccaactcgc cgcggtcggt gtggctgcgt ccgagcacac caccggtcgc cttgtcgacc 3264901 tgccccccac accgtggcac catcagcgat tctgggtcac ggatcgttcg gcgatgtccg 3264961 agctggccgc gacccacccg ctcctgggcg cgcacatcga gatgccgcgc aacggagacc 3265021 atgtctggca gaccgatgtc ggcaccgagg tctgtccctg gttggcagac cacaaggtgt 3265081 tcggtcaacc catcatgccg gccgcggggt tcgccgagat cgccttggcg gcggccagcg 3265141 aagccctcgg cacagccgcc gacgccgtcg cacccaacat cgtgatcaac cagttcgagg 3265201 tggagcagat gctgcccctc gacggccaca cgccgctaac gacgcagtta attcgcggcg 3265261 gggacagcca gattcgggtc gagatctatt cccgcacgcg tggcggagag ttctgccgac 3265321 acgccacggc caaggttgaa caatcgccgc gcgaatgtgc gcacgcgcac ccggaagccc 3265381 aaggtcccgc caccgggaca acagtgtcgc cggccgattt ttatgccctg ctccgccaaa 3265441 ccggccaaca ccatggtccg gcgttcgcgg ccttaagccg gatcgtgcgc ctggccgatg 3265501 gttccgcgga aaccgagatc agcattcccg acgaggcgcc gcgccatccc gggtatcggc 3265561 tgcaccccgt ggtattggat gcggcattgc aaagcgtggg tgccgcgata cccgacggcg 3265621 agatcgcggg gtcggcggaa gccagctatc tgccagtgtc gttcgagacc atccgggtgt 3265681 accgcgacat cggtcggcac gtcaggtgtc gtgcccacct gacaaacctc gacggcggca 3265741 ccggaaagat gggcaggatc gtcctaatca acgacgccgg ccacatagcg gccgaagtgg 3265801 acggcatcta tctgcgtcgt gtcgaacgcc gtgcggtacc cctgccacta gagcagaaga 3265861 tcttcgatgc cgaatggacc gaaagcccga tcgcagccgt gccggctccg gagccagctg 3265921 ccgagacgac gcggggaagt tggctggtac tcgccgatgc aacggtggat gcgccaggca 3265981 aggcccaggc caagtcgatg gccgacgact tcgtgcagca gtggcgctca ccgatgcggc 3266041 gggtgcacac cgccgatatc cacgacgaat cggcggtgct ggccgcattt gcagaaacgg 3266101 caggcgatcc cgagcacccg ccggttggcg tggtggtgtt cgtcggcggt gcctcgagtc 3266161 gactggacga cgagctggcg gcggcgcgcg acacggtgtg gtcgatcacc acggtggttc 3266221 gtgcggtcgt cggcacgtgg cacggccgat caccgcggct atggctggtc accgggggcg 3266281 gactttccgt tgccgacgac gagccgggaa cacccgcggc ggcttccttg aaagggctgg 3266341 tgcgggtgct cgccttcgag cacccggaca tgcgcaccac cctggtcgat ctggacatca 3266401 cacaagaccc gctgaccgcg ctgagcgcgg aactgcggaa tgccgggagt gggtcgcgcc 3266461 atgatgacgt gatcgcgtgg cgcggcgagc gcaggttcgt cgaacggctg tcgcgcgcca 3266521 cgatcgatgt atccaaaggg catccggtgg tgcgccaggg agcgtcgtac gtcgtcaccg 3266581 gcggcctcgg cggtctcggc ctggtcgtcg ctcgttggct ggtggaccgc ggcgccggcc 3266641 gggtggtgct gggtggccgc agcgatccca ctgacgagca gtgcaacgtc ctggccgaac 3266701 tgcagacccg cgccgagatc gtggttgtcc gtggcgacgt ggcatcgccg ggggtggcag 3266761 aaaagctgat tgagacggcc cgacagtctg ggggccaatt gcgcggcgtc gtgcacgccg 3266821 ccgcggtcat cgaagacagc ctggtgttct ctatgagcag ggacaaccta gaacgggtgt 3266881 gggcacccaa ggccaccggt gcgctgcgca tgcacgaagc caccgctgac tgcgagctcg 3266941 actggtggct cggattctct tccgccgctt cgctattggg ttctcccggg caagcggcct 3267001 acgcgtgcgc cagcgcgtgg ctggacgcgc tggtcggatg gcgcagggca tccggcctgc 3267061 cggccgcggt gatcaactgg ggtccgtggt cggaggtagg cgtcgcccag gccttggtgg 3267121 gcagtgttct cgacacgatc agtgtcgcag aaggcatcga ggctctcgac tcattgcttg 3267181 ccgccgaccg gatccgcact ggagtggctc ggctgcgtgc cgatcgggcc ctggtcgcat 3267241 tcccggagat ccgcagcatc agctacttca cccaggtggt cgaggagctg gactcggcgg 3267301 gtgacctcgg cgactggggc gggcccgacg cgcttgccga cctcgacccg ggcgaggcgc 3267361 ggcgcgcggt gaccgagcgg atgtgtgcgc gcatcgctgc ggtgatgggc tacactgacc 3267421 agtcgactgt cgaacccgcc gtgcccttgg acaagcccct gaccgagctg gggctggatt 3267481 ctctgatggc ggtacgaata cgcaacggcg cgcgggcgga tttcggcgtg gaaccgccgg 3267541 tagcgctgat actgcaaggc gcgtccttgc atgacctgac ggcggactta atgcgccaac 3267601 tcgggctcaa tgatcccgat ccggcgctca acaacgctga cactattcgc gaccgggcgc 3267661 gccagcgcgc ggcagcgcga cacggagccg cgatgcggcg ccgacctaaa cctgaagtac 3267721 agggaggata agacctgtga gcatccccga gaacgcgatc gcggtggtcg gcatggccgg 3267781 ccgatttccg ggcgccaagg atgtttcggc gttctggagc aaccttcggc gcggtaagga 3267841 gtcgatcgtc accctgtccg aacaggagct gcgcgacgcc ggcgtcagcg acaagacgct 3267901 ggccgatccg gcgtatgtgc gtcgcgcccc gcttcttgac gggatcgacg agttcgacgc 3267961 cggcttcttc gggttcccgc cgctggccgc gcaggtgctg gatccccaac accggttgtt 3268021 cctgcagtgt gcatggcatg cgctcgagga cgcgggcgct gaccccgcac ggttcgacgg 3268081 ctcgatcggc gtatacggaa ccagctcccc cagcggctat ctgctgcaca acctgctgtc 3268141 gcatcgcgac ccgaacgctg tgttggccga gggactcaac ttcgaccagt tcagcctgtt 3268201 cttgcagaat gacaaggact ttctggcaac ccggatttcg cacgcgttca acctgcgcgg 3268261 gccgagcatc gcggtgcaaa ccgcgtgttc atcgtcgctg gtagcggtgc atctggcctg 3268321 cctgagcctg ctatccggcg aatgcgacat ggcgttggcc ggcgggtcgt cgctatgcat 3268381 cccgcaccgt gtcggctact tcacctcacc gggatcgatg gtgtcggcgg tgggccactg 3268441 tcggcccttc gacgtgcggg ccgacggcac ggtcttcggc agcggtgtcg ggttggtggt 3268501 gctcaagccg ctggcggccg ccatcgacgc cggagaccgg attcacgccg tcatccgcgg 3268561 atcggcgatc aacaacgacg gatcggcgaa gatggggtat gcggcgccca acccggccgc 3268621 tcaagccgat gtcatcgccg aagcccatgc ggtgtccggc atcgattcgt cgaccgtgag 3268681 ctatgtcgag tgccacggaa ccggcacccc gctcggtgat cctatcgaaa tccagggcct 3268741 gcgagcggcg ttcgaggtgt cgcagacgag ccgttcggcc ccttgtgttc tggggtcggt 3268801 caagtcgaac atcggccacc tggaagttgc tgccggcatc gcgggtctga tcaaaacgat 3268861 tctgtgccta aagaacaagg cactacccgc gacgctgcac tacaccagcc cgaacccgga 3268921 actgcgcttg gaccaaagtc cgttcgtcgt gcaaagcaag tacggcccct gggagtgcga 3268981 cggcgttcgt cgtgccgggg tgagttcgtt cggggtcggg ggtaccaacg cgcacgtcgt 3269041 cttggaggag gcgccagcag aagcatcgga ggtttcagcg cacgccgagc cggctggccc 3269101 tcaggtaatc ctgctctcgg cgcaaacggc cgcggcgctc ggcgagtcgc ggaccgccct 3269161 ggccgcggcg ctagaaacgc aagacggccc gcgcctgtcc gacgtggcct acacgctcgc 3269221 ccggcgccgc aagcacaacg tcacgatggc cgccgtcgtg cacgaccgcg agcacgcggc 3269281 caccgtgctg cgggcggccg agcacgacaa cgttttcgtt ggcgaagccg cccacgatgg 3269341 ggagcatggc gatcgcgccg acgccgcacc cacgtcggat cgcgtcgttt tcctgtttcc 3269401 cggacagggc gctcagcacg tcggaatggc aaaagggctc tatgacaccg agccggtctt 3269461 cgcccaacac ttcgacacct gcgccgccgg attccgcgac gagacaggca tcgacttgca 3269521 tgccgaagtg ttcgacggga ccgcaacaga tcttgagcgc attgaccgtt cgcaaccggc 3269581 attgttcacg gtggaatacg cgctcgcgaa gttggtcgac actttcggcg tgcgcgccgg 3269641 ggcgtacatc ggatacagca ccggcgaata catcgcggcc accctggccg gcgtattcga 3269701 cctgcagaca gcgatcaaaa cggtgtcgct gcgcgcccgc cttatgcatg agtcgccgcc 3269761 cggtgccatg gtcgcggtgg ctcttggccc cgatgacgtc acgcagtacc tgccaccgga 3269821 ggtcgagctg tccgcggtaa acgatcctgg taactgtgtg gtcgccgggc ccaaagacca 3269881 gatccgtgca ctgcgccaac gtcttaccga ggcagggatt cccgttcgcc gcgtccgggc 3269941 aacccacgcg ttccatacca gcgcgatgga tcccatgctg ggccaattcc aagaattcct 3270001 gtcccgtcaa cagctacgtc ctccgcgcac accgctgctg agcaacctca ccggtagctg 3270061 gatgtccgac cagcaagtag tcgatccggc cagctggacg cgtcaaatca gctcccccat 3270121 caggttcgcc gacgagctgg acgtggtgct ggcagctcca agtcgaatcc tggtcgaggt 3270181 tggtccgggc ggcagcctga ccggttcggc tatgcgccac ccgaagtggt cgaccacgca 3270241 ccgcaccgtt cggcttatgc gccacccact gcaagacgtc gacgaccgcg acacttttct 3270301 gcgcgcgctg ggcgaactct ggtctgccgg agtcgaggtc gactggacgc cgcggcgtcc 3270361 ggcggtgccg cacctcgttt ccctgccggg ttatccattt gcccgtcaac ggcattgggt 3270421 cgaacctaac cacacggttt gggcgcaggc tcccggcgca aacaacggct caccggccgg 3270481 cactgcggat ggttccacgg ccgccaccgt cgatgcagcc cgcaacggag agtcgcagac 3270541 cgaggttacg ctgcaacgca tctggtcaca gtgcctcggc gtcagctcgg tcgatcggaa 3270601 cgccaatttc ttcgacctcg gcggcgattc tttgatggcg atcagcatcg cgatggccgc 3270661 cgccaacgag ggtctgacca tcacgccgca ggatctctac gaatacccga ccctggcctc 3270721 gctgacggcc gccgtcgacg cgtcgttcgc gtccagcggg ttggcgaagc ccccggaggc 3270781 acaagcgaac ccggcggttc cacccaacgt cacgtacttc ctcgaccgcg gattgcgcga 3270841 caccggccgc tgtcgtgtcc cgctgatcct gcgcctggat cccaagatcg ggctaccgga 3270901 tattcgagcg gtgctgaccg cagtggtcaa ccaccacgac gcattgcgcc tgcacctggt 3270961 cggcaacgat gggatatggg agcagcacat cgcggcaccc gcagaattca ccgggctttc 3271021 caaccggtcg gtgcccaacg gcgtggctgc aggcagcccc gaggaacggg ccgcggtctt 3271081 gggcatcctg gccgaactcc ttgaggatca aacggatccg aacgcgccgc tggctgccgt 3271141 tcatatcgcc gccgcgcacg gcggtccgca ctatctgtgc cttgccatac atgcgatggt 3271201 caccgacgac tcatcgcgcc agatcctggc gaccgacatc gtcaccgcgt ttggacaacg 3271261 gctggcaggc gaggagatca cgctggaacc ggtcagcacg gggtggcggg aatggtcact 3271321 gcgttgcgcg gccctcgcga cgcatccggc ggcgctggac actcgctcgt actggatcga 3271381 gaattcgacc aaggcgactt tgtggctggc cgatgccctt cccaacgcgc ataccgccca 3271441 tccgccccgc gccgacgagc tcaccaagtt gtcgagcacg ctaagcgtcg agcagacatc 3271501 cgagctggac gacggccggc gcaggttccg ccggtcgatt cagacgatcc tgctggccgc 3271561 cctcggccgc acaatagctc agacggtagg tgagggtgtg gtcgccgtgg agctcgaagg 3271621 cgagggccgc tcggtgctgc ggccggatgt cgacctgcgc agaacggtcg gctggttcac 3271681 gacgtactac ccggtaccgc tggcatgcgc aacagggctg ggcgcgcttg cgcagctgga 3271741 cgcggtgcac aacactctta agtccgttcc gcactacgga attggatacg ggctgctgcg 3271801 ctacgtttac gccccgaccg gacgtgtcct gggcgctcag cgcacacccg acattcactt 3271861 ccggtatgcg ggcgtgatcc ccgagctacc gtccggcgat gctccagtac agttcgactc 3271921 ggacatgacg cttccggtgc gcgaaccgat cccagggatg ggccacgcca tcgaacttcg 3271981 ggtgtatcgg tttggtggct cactgcatct cgattggtgg tacgacaccc gccggatccc 3272041 ggcggcaacg gcagaagcgc tggagcggac cttcccgctg gccctcagcg cgctgatcca 3272101 ggaggccatc gcggccgagc acacagagca cgacgacagc gagatagtcg gggaacccga 3272161 ggcgggcgct ctggtggacc tgtcgagcat ggatgccggc tgaggaggat cggatgcgca 3272221 acgacgacat ggcggtggtg gttaacgggg ttcgcaagac ctacggcaag ggcaagattg 3272281 tggccctcga tgacgtgagt ttcaaggtgc gccgcggtga agtgatcggg ctgctgggcc 3272341 ccaacggggc cggcaagacg accatggtgg acatcttgtc gacgctgacc cgaccggatg 3272401 ccggctcggc gatcatcgct ggctacgatg ttgtttccga accggccggt gtacgccgct 3272461 cgatcatggt caccgggcag caggtggccg tcgacgacgc gctttccggt gagcagaacc 3272521 tggtgttgtt tggtcgtctg tggggactga gcaagtccgc ggcgcgcaaa cgcgccgccg 3272581 aactgctcga gcaattcagc ctcgtacatg ccggaaagag gcgggtgggc acctactccg 3272641 gcggaatgcg ccgacgaata gacatcgcgt gcggattggt ggtccaaccc caggtggcgt 3272701 tcttagacga gcccaccacc gggctcgatc ccaggagccg gcaagctatt tgggatctgg 3272761 tggccagctt caagaagctg ggcattgcca cgttgttgac cacgcagtat ctcgaggagg 3272821 cggatgcgct cagtgaccgc atcatcctga tcgatcacgg cataatcatc gccgaaggca 3272881 ccgcgaatga actcaagcac cgcgccggcg acaccttctg cgaaatagtg ccccgcgatc 3272941 tgaaggatct ggacgctatc gtcgcggcgc tcggttcgct gttgcccgag caccacaggg 3273001 cgatgctgac gcccgactca gaccgcatta cgatgccggc gcctgacggc atacgtatgc 3273061 tcgtcgaggc agcgcgccgg atcgacgagg cgaggatcga gctagccgat attgcgctgc 3273121 gccgaccgtc actcgatcac gtattcctgg ccatgacgac cgatcccacc gagtctctga 3273181 cccatctggt gtcggggtcc gcgcgatgag cggcccggcc atagatgcga gccccgccct 3273241 gaccttcaac cagtcaagcg cgagcattca gcagcgacgc ttatcgaccg ggcgacagat 3273301 gtgggtgctc tatcggcgtt tcgccgcgcc gagcctactc aacggtgaag tactcaccac 3273361 ggtgggcgcg ccgataattt tcatggtggg cttctatatc ccgttcgcca taccgtggaa 3273421 ccaatttgtg ggtggcgcca gctcgggcgt cgccagcaac ttagggcaat acatcacgcc 3273481 gttggtcaca ctgcaggcgg tctcgttcgc cgcgatcggg tcgggctttc gagccgcgac 3273541 cgattcgctg ctaggcgtca atcgtcggtt tcagtccatg ccgatggccc cgttgacgcc 3273601 actgcttgcc cgcgtgtggg tggctgtgga ccgatgcttc acgggtttgg tgatatcgct 3273661 agtttgcggc tacgtcatcg gattccgttt tcatcgcggg gccctctata tcgtcggttt 3273721 ttgcctactg gttatcgcga tcggggctgt gctgtcattc gccgctgacc tggttggcac 3273781 cgttaccagg aacccagacg cgatgctgcc gctgctgagc ttgcccattt tgatcttcgg 3273841 actgctgtcc attggtctca tgccgttaaa gctgtttccg cactggatcc atccatttgt 3273901 tcgcaaccag ccgatctccc agttcgtcgc ggcgctgcgg gcattggccg gagataccac 3273961 caagacagcc tcacaggtga gttggcctgt gatggctccg acgttgacgt ggttgttcgc 3274021 tttcgtggtg atcctggcgc tttcatccac cattgttttg gctaggcggc catgatcacg 3274081 acgacaagtc aggaaatcga gcttgcaccc acacgtttgc caggctcgca aaacgctgct 3274141 cggctgttcg ttgcgcagac ccttttgcag accaaccggt tgctaactcg atgggcacgt 3274201 gactatatca ccgttatcgg agcgatcgtg ttaccgattc tcttcatggt ggtgttgaac 3274261 attgtgctag gtaacctagc ttatgtcgta acccacgaca gcgggctcta cagcattgtt 3274321 ccgctgatcg cactcggcgc cgcgatcact gggtcaactt ttgtcgcgat cgacctgatg 3274381 cgcgagcgct ccttcggact gcttgcccga ctgtgggtgc tgcccgtgca ccgagcatcg 3274441 ggcctgatct ctcgaatcct ggcaaacgcg attcggactc tggtcaccac tttagtgatg 3274501 ctaggtactg gggtggtatt gggtttccgg tttcgacaag gcctgatccc gagcctcatg 3274561 tggattagtg tcccggtgat actgggcatc gcaatcgcgg ctatggtcac taccgtcgcg 3274621 ctttacacag cacaaaccgt tgttgtcgaa ggcgttgagc tggtgcaagc aatcgcgatc 3274681 ttcttctcca cgggtttggt gccgctcaac tcgtatccag gctggattca gccgttcgtc 3274741 gcccatcagc cggtgagcta cgccatcgcg gcgatgcgcg gttttgcaat gggtggtccg 3274801 gtcctctctc cgatgatcgg gatgctggtg tggaccgcgg gtatctgcgt cgtatgcgcc 3274861 gtacccttgg ccattggcta ccgacgggcc agcacgcatt gaccagcacc gctggcccgg 3274921 gatgccgtga cgagttggga gtgttgagat gtttcccgga tctgtgatcc gaaagctgtc 3274981 gcacagcgag gaagtcttcg cgcagtacga ggtttttact tccatgacaa tccagctgcg 3275041 cggtgttatc gatgtcgatg cgctgtcgga tgccttcgac gccctcttgg aaacccaccc 3275101 agtcctggcc agccaccttg agcaaagctc cgacggcggt tggaatctcg ttgccgacga 3275161 cctgctgcac tctggaatct gtgtcatcga cggcacggcc gccaccaacg ggtcaccgtc 3275221 gggaaacgcc gaactacggc tcgaccagag cgtgtcccta ttgcatctgc agctgatcct 3275281 ccgcgaagga ggagccgagc tgacgctata cctccatcac tgcatggccg atggtcatca 3275341 cggggccgtt ctcgtcgacg agctgttctc ccgctacacc gacgcggtca ctaccggtga 3275401 ccccggcccg ataaccccgc agcccacgcc gctgtcaatg gaggctgtgc tggcacagcg 3275461 gggtatcagg aagcaagggc tttcgggagc tgaacgtttt atgtcggtga tgtatgccta 3275521 tgagatccct gccaccgaga cgccggcggt cctcgcgcat cctgggctgc cccaagctgt 3275581 tccggtcacc cgactctggc tttccaagca gcagacatcg gacctcatgg cgttcggccg 3275641 cgagcatcgc ctcagcctta acgccgtggt cgcggcagcc atcctgctga ccgagtggca 3275701 gctgcgcaac accccgcacg tcccgattcc ctacgtttac cccgtcgacc tgcgatttgt 3275761 tctagctccc ccagtggccc cgacagaagc taccaatctc ctcggggcgg cgtcttacct 3275821 cgctgagatc gggccgaata ccgacatcgt ggatctggca agcgatatcg ttgccacact 3275881 tcgggctgac ttggccaatg gtgtgattca gcagtcgggg ctccacttcg gcacggcatt 3275941 cgaaggaact cctcccggcc taccaccact tgtcttctgc actgacgcca cttcatttcc 3276001 caccatgcgc acaccgccgg gcctggagat cgaagacatt aagggccaat tctattgttc 3276061 gatcagcgtc cccctcgatc tgtactcgtg tgccgtttac gcaggacaac tgatcatcga 3276121 gcatcatggg cacatcgcgg aaccggggaa gtccctcgag gcgatacgtt cactgctgtg 3276181 caccgttccc tcggagtatg gctggatcat ggagtgacct aacgaaccag cccgccgatc 3276241 gggcttcggc cagatcacgc actcgcgtcc cgaaccgatc atcatatccg ccccagctgc 3276301 ggtcgcggct gacaagcctt accccgcagc tcacctcatg atctcaccac gaggcttgcg 3276361 gcacaacaga attcgaccgc tatgatgccg ccggtgccgc cgcctgctcc tcggccagcg 3276421 tgtccgccaa gtactgggcc aaagcgcggg cggtgttgtt tgtggcgatg accttggggg 3276481 tcaggcgtat cccggtctcg gtttcaacgt gggtacgcat ctcgagcatg cccagcgaat 3276541 ccaggccgta ctcgatgaat gagcggtcag cgtcgatcgt gcgacgcagg atcacactgg 3276601 cctgctcaac cagcagacgc cgtagccggc cggcccattc atcttgcggc agcgaaagga 3276661 gctccatgcg gaatttgctt gggccccttg accgctgccc agtggatgcg aacatttcac 3276721 cccacgggct gcgtcggaca aggtcggcca gccatggcgc cccgaggatc ggaatgtaac 3276781 cgctgtaggc gcggtcgtgg cgcacgagcg tctcgaaggc atacgcacct tcctccgggg 3276841 tgatcatgat ttcgcccccc tcggccaaga acgtggcgcg gccgacctcg ccccacgcac 3276901 cccacgcaat cgcgctgacc ggcaggccct gggcgcggcg ccagtgcgcg aagacgtcga 3276961 cccagctgtt ggccgccgcg taggcgccct gacccggcga gccgagcaat gccgctcccg 3277021 aggagaacaa gcagaaccag tccagcggct gaccgagggt ggcgcggtgt aggttccagg 3277081 atccgaacac cttgggcgac cagtcgcgat cgatgagctc atcggtgatg ttggtcagcg 3277141 tggcatcctc gaccaccgcc gccgagtgca gcacaccgcg cagcggaagc ccggtagcgg 3277201 tcgccgcact caccagccgg tccgccgtgt cgggttcggc gatgttgcca cactccacca 3277261 cgatgtcggc cccagccgcg cgcaggcctt cgatggtctg ccgcgctttg gggttgggct 3277321 gggaacgtgc ggtcagcacg atccggccac agcccgccgc ggccagcttc gaggcgaaga 3277381 acaggccgag gccacccagg ccgccggtga tgatgtagga gccgtcgcgg cggtacagcg 3277441 gagcttgctc cggggtgacc gccacgcttc tacggccgct acgcggtacg tcgagcacga 3277501 gtttgccggt gtgctcggcg ttgctcattg cccggatggc gtcggccgcc tcggccaacg 3277561 ggtaatgagt gcattgcggt gcggtcagca ccccgtctgc ggtgagcttg aacaccgtgg 3277621 ccagcaactc acggacccgg tcgggctggg tgaccgacat cagcgcgagg tccaagtagt 3277681 agaaggtcag tccgcgacgg aacgggaaca gccccagccg ggtgttgccg taaacgtcgg 3277741 ccttgccgat ttcgacgaag cgtccgccga aggccaacaa ctccagcccc gcacgttggg 3277801 cggcgccggt cagcgagttc agcacgatat ccacgccgta cccgtcggtg tcgcgccgga 3277861 tctgctcggc gaactcgacg ctgcgcgaat cgtagacatg ctcgacgccc atgtcgcgca 3277921 gcatggctcg cttcgcggga ttgccggcgg tcgcgaaaat ctccgctccc ttggcgcggg 3277981 caatcgatat ggccgcctgc cccacaccgc cggtggcgga gtgaatcaac actttgtcac 3278041 cggccttgat ctgagccagg tcgttgagcc cataccaggc ggtggcatgc gcggtggccg 3278101 ccgtgatcgc ctgctcatcg gtcaagccgg gcggcagcgt gaccgcgagg ttggcgtcac 3278161 aggtgaggaa cgtccgccaa cagccacctt cggagaaacc gccaacacga tcaccgacct 3278221 ggtgaccggt gacaccttcc ccgaccgcag tcaccacacc gacgaaatcc atacccaact 3278281 gcggctcgcg gtcatcgata atggggaatc gtccaaacgc gatcaaaacg tcggcgaagt 3278341 tgatgctgga catgctgacc gcgacttcga tttgcccggg gccgggcgga actcggtcac 3278401 tcgcaacgaa ttccaacgtt tgcaagtctc ccggcctgcg gacctgcacc cgcataccgt 3278461 cgtggtcggg atccaagacc gcggtgcgcc gctcttcatg gcccagcgga ctgggggtca 3278521 agcgggccac ataccagtcg ccattccgcc aggccgtctc gtcctcttcc gatccgctca 3278581 gcagctgctg ggccacccgc tcaacgtccg tgtgttcgtc cacatcgatc aaggtggtgc 3278641 gcagcatcgg atgttcactg ctgatcaccc gtagcagacc acgcaggccg gcctgctcca 3278701 ggttggctct ttctcccgag tcgtgcggct tcactatctg ggcttgtctg gtcaccacga 3278761 acaagcgcgg cagctcgccc tcgaattcag ccagttcccg ggtgatccga accaggtgac 3278821 ggacctgttc acgaccggcc agcagactgt gctcatcggg gtcgccgacg cgaggcccat 3278881 acacgatcac cacaccatcg cggccacgca gctggctgcc cagcttttcg aggccagctt 3278941 gatcgttggg cggggtgtcc tggaccgacc aggacaggct ggcgcattcg gtgccttggg 3279001 ggccgtggga cttcagcgcg tccgtcaacg tggaagccaa catgtcgggg gtgtcgacgg 3279061 cgttggaagt gtcgatcaat agccacgatc cagcctcgcc gtcgccaacc tcgggcagcg 3279121 ctcgctgctg ccatccgagg gtcagtagcc gctcgctgac taggcggtca cgctcgtcgc 3279181 gttcggaggt cccggttccc atgcgtagcc cacgcacggc caacaggacg gtcccgtgct 3279241 cgtccagcac gtcgaggtcg gcctcaccac ctcgggtccc gtcgttgaag gccttggtca 3279301 accgcgtgta gcagtagcgg gcattgcggg taggcccgta ggcacgcagg ctgcgcacac 3279361 ccaacggcaa cagcaggcca ccagtggccg taccggcctg gacgcccgcg ccgaccgact 3279421 ggaaacaagc gtccagcagc gccgggtgga ttcggtaggc gccctgctgg aaccggatcg 3279481 acgcgggcag cgcgacctcg gccagcaccg tcgcggctcc cgcctcggcg gtatgcgcgg 3279541 tggtcagacc accgaacgcg gcgcccaaag taacaccacg ctcggcgaac gattcccgca 3279601 tggcggtccc gttcacggcg tgcggatgcg cctgcagcag agcggtgatg tcgtaccccg 3279661 gcggcgggca gtcatcttcg gcggcgcgca gcgccgcggt ggcatgccgg gtggtttcac 3279721 cgtcccggtt ggtctccacg gtgaagttga cgacaccagg cgcgtcgatc gatgcgacgg 3279781 cgtcgatcgg ggtctgctcg tcgagcaaca acatctgctc aaaggtgatg tcgcgaacct 3279841 cagccgcttc gccgaagacc tcagcggccg cagccaaagc catctcgcag taggcggcgc 3279901 cgggaagggc ggcaacgtta tgcacctgat gatcgctgag ccaggacagc accgaggtgc 3279961 caacgtcgcc ctgccagacg tggcgctcag gttcctcagt cagccgcaca tgcgagccaa 3280021 gcaacggatg cacggtgatg gtgcaggcac cttgtgcccg ctgttcttgc ccatcatcgt 3280081 cgatgaatag gcgggcgtgg gtccacgccg gcagcggcgc atccaccagc cgcccagcgg 3280141 gatacagcgc cgaatagtcc aaagcggcgc ccgcgcggtg cagctccgtc agcaagccgc 3280201 gcagaccatg cggcagaggc tgctctcgcc gcatgccggc cagggcggcg accgacatgt 3280261 cgaggcttcg gcccgtctgt tcgacggcgt gggtaagcag cgggtggggc gacagctccg 3280321 cgaagacccg gtagccgtcc tccatcgcag cctgcaccgc cgcggcgaac tgcaccgtgt 3280381 tgcgcagatt gtccacccag taagcgccat cgcacaccgg ctgctcgcgc gggtcgaaca 3280441 gggtcgccga gtagtacggc accttgggcg tcatcggagc aatgtccgcc agcgccgcgg 3280501 ccaaatcgtc gagtatcgga tcgacttgag gcgagtgcga cgccacgtcg acggccacct 3280561 cgcgcgccat cacgtcccgc tgctcccaac gggcgatgag gtcacgaacg gtgtcgctcg 3280621 taccgccgat caccgtggat tgcggggacg ccaccaccga gaccacaaca tcgtcgattc 3280681 cgcgtgccat cagctccgaa ttcacttgct tggcgggcaa ttccaccgag cccatggcac 3280741 cagcaccggc tatgcgggtc atcagcttcg agcggcggca aatgacgcgc gccgcgtcct 3280801 cgagcgacag tgcccccgcg acgacggccg cggccgactc acccatcgag tgtccgacga 3280861 ccgcgcccgg ccgcactccg taggtttgct ccatggtggc ggccaacgcg acctgaacgg 3280921 cgaacactgc cggctgcact ttgtcgattc cggtcacggt ctgctgcgcc gttatcgcct 3280981 cggtcaccga gaatcccgat tctgcggcga tcaccggctc cagcttggcg atggtggccg 3281041 cgaacactgg ttcgctggcg agcaattgcg tgcccatcgc cgcccactgc gacccttgcc 3281101 cggagaagac ccagaccggt cctcgatcac cgtgtcccac cgccgcgtca tagagggcgt 3281161 caccgtcggc cacctcgcgc aaaccctcga cgagctccgg caggttggcg gcaaccaccg 3281221 cggtgcgcac cggccggtgc gcgcggccac gcgccagcgt gtaggccaga tccgaggccg 3281281 ccacgcagtc ctggtgttct tccacccagg tggctagttg gcgggccgtc tggcgcagtg 3281341 cgtcgctgga cgtggacgac agcatgaata gccgcgggcc cacctcagcg tcgcccggtg 3281401 aactctcggg tgcggaagct tctgctgggg cctcttccac gatggcatgc acgttggtcc 3281461 cggacatccc gaacgaggac accgcgaccc gcttcggtgt gtgatcatta ccgttgggcc 3281521 acggcgtaac cgcttgcggc acaaagagcc cggtctcgac gtcggaaagc tcatcgggca 3281581 gccgattgaa atgcagcagc ggcggcacca ccccgtgccg cagtgacaga attgccttga 3281641 tcagcccgac ggtccccgcc gatgccgtgc tgtgccccat gttgctcttg gccgatccaa 3281701 gcgcgcaggg ggtgcccgcg ccatacaccc gcgccaggct gcggtactca atcgggtcgc 3281761 cgattggcgt accggtgccg tgcgcctcga ccacaccgac cgtttcgggc tgcacgcccg 3281821 ccgccgccaa cgccgcacgg tacacggcaa cctgggcgtc ctcggacggc atggtgagcg 3281881 tctccgtgcg gccgtcctga ttggtggccg tgccacgcac cacggcgaag atccgattac 3281941 cgtcgcgcag cgcatccggc agtcgcttca gcaacaccat cgcgcagccc tcggaacgca 3282001 caaacccatc cgcgtcagca tcgaatgaat ggcaccgacc ggttgacgac agcatgccct 3282061 gcgcagacgc cgccacactg gcatgcggct ccagcagcac cgcacaaccg cccgccaaag 3282121 cgaggtcagc ttcgccgtca tgcaggctgc ggcaggccag gtgcaccgcc atcagacccg 3282181 aagaacacgc ggtgtcaaac gtcatcgccg gaccatgtag acccaatgtg tgcgcgatcc 3282241 gccctgacgc cacactgttg ttgaggccgg taaccacata tggactggcc aaaccgcccg 3282301 ccgttgtggt gagtaccagg tagtcctcgt gggtcagccc agtaaaaacg gccgtcgagg 3282361 acccggccaa cgacgccgga tccagaccag catgctcgat cgcctcccac gacgtttcca 3282421 gcagtagccg ctgctgcgga tcgatcgagg tcgcttcccg ctcgctaatc ccgaagaact 3282481 cagcatcgaa accggcgacg tcgtcaagga acccacccca ccgggacacc gaccgcccgg 3282541 gaacccctgg ctcagggtcg taatagtcgt cggcgtccca gcggtcgggc ggaatctcgg 3282601 tgaccaagtc atcaccgcgc agcaacgact cccacagttt gtcgggcgag ttgatccccc 3282661 caggaagccg acatcccatc ccgatcaccg caacgggagt gacacgtgat tccatactct 3282721 tccaacctcg tctcagctca accggtgtta cccgacgaca tcagcgaatt ttcacaccgg 3282781 gaatgaaacg gccgcggtgc cgctctccca gctcttaagt aatccgagcc aacccggatc 3282841 ccgacaccaa agacaagtgt tacacgacgc caagaccccc cgcgggtagc gctggaatac 3282901 taacacgagc acatgtgctc gcgaccgagt ctcacctcgg acctgggcaa atgaccccat 3282961 gtcgcaggtg catggagttg ttcgggcagt ctcggcgagg ttgcagggct gttcgaccag 3283021 cggatttcga cactcggtaa cgcaagccag ttaggggcgg tcatcggtga tgctgcgcca 3283081 cgaagcacta catccgttgc accgcaatta ttttcggtgc ccgcatgacg ggcgcaatgc 3283141 cttaattgcg ttagccggcg acccgccgcg ggggcggcgc cacatcacat ccgaccgtgt 3283201 ccgatggtgg acccatggcg agccggcaaa cccctgctga gctggccaga tgcgacttgg 3283261 ctaagaccgc ggagcgcgag cacaccccga cggcgactgc gacaactcca agcgtggccg 3283321 gtaacgtgat gcccatgagt gtgcgttccc ttcccgctgc gttgcgcgcg tgtgcgcgtc 3283381 tgcaacccca tgacccggcc ttcacgttta tggattacga acaggactgg gacggcgttg 3283441 cgataaccct gacgtggtcg cagctgtatc ggcgaacgct gaatgtggca caggagctga 3283501 gccgttgtgg ttccacgggt gaccgcgtgg tgatctctgc tccgcaggga ctcgagtacg 3283561 tcgtcgcctt tctcggcgcg ttgcaggccg ggcgcatcgc cgtgccgctt tcggttccac 3283621 aaggcggcgt taccgatgaa cgttccgatt cggtactgag tgattcgtcg ccggtggcca 3283681 ttctcactac atcgtctgcc gtggacgacg tcgtgcaaca tgttgcgcgg cggcccgggg 3283741 aatccccgcc atcaattatc gaagttgatt tgctcgatct ggacgctccg aatgggtata 3283801 ccttcaaaga agacgagtat ccatctaccg cgtatttgca atacacctcc gggtccaccc 3283861 gcacgcccgc tggcgtggtg atgtcccatc agaacgttcg ggttaatttc gaacagctga 3283921 tgtctggcta ctttgcggat accgacggga ttccaccgcc aaattccgca ctcgtatcct 3283981 ggctaccctt ctaccacgac atgggtttgg taataggaat ttgcgcacca attctgggtg 3284041 gataccccgc ggtgctcacc agcccggtgt cgttcctgca gcgcccggcc cggtggatgc 3284101 acttgatggc cagcgatttt cacgcctttt cggcagcacc gaatttcgcc tttgaactag 3284161 cggcacgaag aacaaccgac gacgacatgg ccgggcgtga cctcggcaac atactgacca 3284221 tcctcagcgg tagcgagcgg gtacaggccg cgacgatcaa gcgcttcgcc gaccgctttg 3284281 ctcgcttcaa tctgcaggag agggtgatcc ggccttcata cgggctcgca gaagcaacgg 3284341 tgtacgtggc gacgagcaaa ccgggtcaac caccggagac cgtcgacttc gatactgaaa 3284401 gtttatccgc cggccatgcg aagccgtgcg caggcggcgg cgctacatcg ttgatcagct 3284461 acatgttgcc gcggtcaccg atcgtgcgga tcgtcgactc ggacacctgc atcgaatgtc 3284521 cggacggaac cgtcggcgag atctgggtgc acggcgacaa cgtcgctaat ggctattggc 3284581 aaaaacccga cgagagtgag cgcacgttcg gcggaaagat tgtcacccct tcgccgggca 3284641 cacccgaagg tccttggcta agaacgggcg actcaggttt cgtcaccgat ggcaaaatgt 3284701 tcatcatcgg tcggatcaaa gatctcctaa ttgtgtacgg acgcaaccac tcccccgacg 3284761 acatcgaggc aacgatccag gagatcaccc gcgggcgctg cgcggcgatc tcggttcccg 3284821 gtgaccgcag caccgaaaag ctggtcgcca ttatcgaact caagaagcgt ggcgactcag 3284881 atcaggacgc gatggctaga ctgggcgcta ttaaacgcga agtcacgtcg gctttatcga 3284941 gttcgcacgg tctcagcgtc gcggatctgg ttctggttgc gcctggctcg atccccatta 3285001 ccaccagcgg gaaggtcagg agaggggcgt gtgtcgagca atatcgacag gatcaattcg 3285061 cccgcttgga tgcctagtcc ggctggccgt ctacacagaa ttcggtatat ccgtttgaaa 3285121 aagtcctccc cggactgccg cgccaccatc accagcgggt cagccgacgg tcagcgaagg 3285181 tcaccccggc tcaccaacct gctcgtcgtc gccgcctggg ttgccgcggc ggtgatcgca 3285241 aatctgcttc tcacgttcac gcaagcagaa ccgcacgaca ccagcccggc gctgctgcca 3285301 caagatgcca agacagccgc cgccaccagc cggattgcgc aggctttccc cggcaccggt 3285361 agcaacgcta tcgcctatct cgtcgtggaa ggcggcagca cgcttgagcc gcaggaccag 3285421 ccttactacg acgccgccgt cggtgccctg cgcgccgaca cccgccacgt gggatccgtc 3285481 ctcgactggt ggtcagatcc cgtcaccgcc ccgctgggaa ccagccccga cggccgctcc 3285541 gctacggcca tggtgtggct gcggggcgag gcgggcacca cccaagctgc cgaatccctc 3285601 gatgccgtcc gatcggtgct gcgccagtta ccgcccagtg aggggcttcg cgccagcatc 3285661 gtggtcccgg caatcaccaa cgacatgccg atgcagataa ccgcctggca gagcgcgacg 3285721 atcgtgaccg ttgcggcggt gatcgccgtc ctactgctgc tgcgggcgcg cctgtcggtg 3285781 cgggccgcgg cgatcgtgct gctgaccgcg gacttgtcgc ttgcggtggc ctggccgctg 3285841 gccgcggtgg tgcggggaca cgattgggga accgattcgg tattttcttg gacgctggcc 3285901 gcggtcctga cgatcggaac catcaccgca gccaccatgc tggccgcgcg gctcgggtcc 3285961 gacgcaggtc attcggccgc gcccacatac cgcgacagcc tgcccgcgtt cgccctgccc 3286021 ggggcgtgtg tcgccatatt caccggcccg ctgctgctgg cccgaacccc agcgctgcac 3286081 ggagttggca ctgccgggct aggtgtcttt gtggcacttg cggcttcgtt gacggtgctg 3286141 cctgccctga tcgcgcttgc cggagcgtca cggcagttac cggcaccaac cacgggtgcc 3286201 ggctggacag gccggttgtc gctacccgtc tcttctgctt cggccctggg cacagcggca 3286261 gtgctggcga tctgcatgct acccatcatc gggatgcggt ggggtgtggc cgagaacccg 3286321 acaaggcaag gcggcgcaca agtccttccg gggaatgcgc ttcccgatgt ggtggtgatc 3286381 aaatccgctc gggacctgag ggacccagcc gcgctcatcg ccatcaacca ggtcagccac 3286441 cgtctggtgg aggttcccgg tgtgcgcaag gtggagtcgg cggcatggcc ggccggtgtc 3286501 ccgtggaccg acgcctcgct cagttccgcg gccggcaggc tcgccgacca gctgggtcag 3286561 caggccggat cgttcgtgcc ggcggtgact gcgatcaaat cgatgaagtc cataatcgaa 3286621 cagatgagcg gcgcggtcga ccaactggac agcaccgtga acgtgactct cgccggggca 3286681 aggcaagcac agcaatacct cgatcccatg ctcgccgccg cgcggaacct caaaaacaaa 3286741 accaccgaac tgtcggaata cctggaaacg atccacacct ggattgtcgg cttcacaaac 3286801 tgccccgacg acgtcctgtg cacggccatg cgcaaggtca ttgaacccta cgacatcgtg 3286861 gtcaccggca tgaacgagct gtccactggc gccgaccgca tctccgcgat atcgacacag 3286921 acaatgagcg cgttgtcctc ggcaccgcgg atggtggcgc agatgcggtc ggcgctagca 3286981 caggtgcgct cgttcgtacc caagctggaa acaaccatcc aggacgccat gccgcaaata 3287041 gcgcaggcgt cggcgatgct gaagaatctc agcgccgatt tcgccgatac cggtgagggc 3287101 ggcttccacc tgtccaggaa ggacctggcg gacccgtcgt accggcacgt acgggaatcg 3287161 atgttctcgt cagacggaac cgccacccgg ctgttcctct attctgacgg acaactggac 3287221 cttgctgcgg cagcacgcgc gcagcagctc gagatcgccg cgggcaaggc gatgaaatac 3287281 ggaagcctgg tcgacagcca ggtcacggtg ggtggggccg cgcaaatagc cgcggctgtc 3287341 cgcgatgccc tcatccacga tgctgtgcta ctggccgtta tcttgctcac ggtagtggct 3287401 ctggccagca tgtggcgcgg tgccgtccac ggtgctgcgg ttggcgtggg tgtgctggcc 3287461 tcttacctcg ccgccctggg ggtctcgatt gcactgtggc aacacctact ggatcgcgag 3287521 ctcaacgcct tggtcccgct ggtgtcgttc gccgtcctcg cttcgtgcgg cgtcccgtat 3287581 ctcgttgccg gcatcaaagc cggtcgtatc gccgacgagg caacgggtgc gcggtccaag 3287641 ggggcggtat ccgggcgggg agcggttgcg ccgcttgcgg cgctcggtgg cgtattcggc 3287701 gctggcctgg tgctggtgtc gggaggttcc ttcagcgtgc tcagtcagat tggcacggtt 3287761 gttgtgctcg gtctgggcgt gctgatcacg gtgcagcgag cgtggcttcc gaccacgcca 3287821 gggcggcgtt gaccgcctgt tcgagacccc atgccacgct cggctggccg acgacgatca 3287881 cccatcgcag acaccacact tggtaggggt tgccagttgt tggccgggtg agtggtcggc 3287941 gcgccgttgc ccggggtagg gttcgaggtc tttggatgat gggcgtttcc acgctgccca 3288001 aaggatgacc tcgacgtgtc cgagttcacg ttgaccgcgt gaagttaaac cggtgccgag 3288061 cgtgcactga gggcgaaatc cggcgccgat tttccgccct gagttcacgt tgggcgacgg 3288121 cgcccatgaa cgacgccaca tcgcacatgg cgctcaggcc aagcaccagc ccatctccgt 3288181 cgccggccac cgtcaccgat cgaacgacct cgacccccgc cctggcaaca acacgccgct 3288241 gccctctaca cctccgcgct gtcgaaaatt gtcacggagc cttgcggggg ctggtgcgac 3288301 tgatatgacg caccttccgc cagaggctag cccgacgttt actgacgtta ctgctgctta 3288361 ccgtttgtcg acggcacgtg aaaactgacc ccggcgcggc acccgaattt tgaccccctg 3288421 gtcgggtgga ctggctctac ccgagccagg aggaccgaag ggaatgttga ctgtggaaga 3288481 ttgggctgag attcgccgat tgcatcgcgc ggagggtttg ccgatcaaga tgatcgcccg 3288541 ggtgctgggg atttccaaga acacggtgaa gtcagcgttg gaatcaaacc agcagccgaa 3288601 atatgaacgg gcaccgcagg gttcgatcgt tgatgcggtt gagccgcgga tccgggagtt 3288661 gttgcaggcc tatccgacga tgccggcgac ggtgatcgcc gagcggatcg gctgggagcg 3288721 ctcgattcgg gtgctctcgg cgcgggtggc cgagctgcgc ccggtgtatc tgccgccgga 3288781 cccggcgtcg cgcaccacgt atgtggcagg cgaaattgcc cagtgcgact tctggtttcc 3288841 gccgatcgag ttgccggtag ggttcgggca gacccgcacg gccaaacagt tgccggtgct 3288901 gaccatggtg tgcgcctatt cgcgctggct gttggcgatg ctgctgccca gcaggtgtgc 3288961 cgaggacctg ttcgccggct ggtggcggct gatcgaggcg ttgggggcgg tgccgcgggt 3289021 gttggtgtgg gatggcgagg gcgcgatcgg gcgctggcgc ggcgggcggt cggagttgac 3289081 cactgagtgt caggcgttcc gcggcacgct ggcggccaag gtgctcatct gccggccggc 3289141 cgacccggag gccaagggcc tcattgaacg ggcccacgac tacctggagc gctcgttttt 3289201 gcccgggcgg gtgtttgcct cgccggccga tttcaacgcc caactgggcg cctggctggc 3289261 gctggtgaac acccgcaccc gccgggcgct gggttgtgcg cccaccgatc gcatcggcgc 3289321 ggatcgggcc gcgatgctga gcttgccgcc ggtggcgccg gccaccgggt ggtgcacctc 3289381 gctgcggctg ccccgggatc actatgtgcg ctgcgattcc aacgactact cggtgcaccc 3289441 gggtgtgatc gggcatcggg tgctggtgcg cgccgacctg gagcgggtgc atgtgttctg 3289501 cgacggtgag ctggtcgccg accacgagcg gatctgggcg gtccatcaga cggtctccga 3289561 tcccgcacat gtggaggcgg cgaaggtgtt gcgccgccgg cacttcagtg cagcatcacc 3289621 ggtagttgag ccgcaggtgc aggtccgctc actgagcgac tacgatgacg cgctgggagt 3289681 cgacatcgat ggcggggtgg cctgatgccc accaccaaag ccacccagcg ccgtgatgtt 3289741 tccaccgaga tcgcttacct gacaagagca ttgaaagctc ccaccctgcg tgagtcagtg 3289801 tcccggctgg ccgatcgcgc ccgcgccgag aactggagcc acgaagaata cctggccgcc 3289861 tgcctgcagc gggaagtgtc agcccgggag tcccatggtg gtgagggccg catccgcgcc 3289921 gcccgcttcc cggctcggaa gtcgttggaa gagttcgact ttgagcatgc tcgtggcctc 3289981 aaacgcgaca ccatcgcaca tctgggcacc ctggatttca tcaccgcccg cgataacgtc 3290041 gtgtttttgg gccccgcctg gcaccgggaa gactcatctt gcggtcggcc tggcgatacg 3290101 cgcgtgtcag gccggtcatc gggtgctgtt cgccaccgcc gccgaatggg tagcacggct 3290161 cgccgaggct caccacgccg ggcgcatcta cgccgaactc acccggcttt gccgctatcc 3290221 gctcctggtg gttgacgaag tcggctacat tccgtttgag cccgaggccg ccaacctctt 3290281 cttccagctg gtgtcctccc ggtatgagcg ggccagcttg atcgtcacgt ccaataaggc 3290341 cttcggccgg tggggcgagg ttttcggcgg cgacgacgtc gttgctgccg ccatgatcga 3290401 ccgcctcgtc caccatgctg aagtcgtcgc cctcaaaggc gacagctacc ggctcaaaga 3290461 ccgcgacctc ggccgcgtcc caccagccgg aaccaccgaa gaataaccac caaccgcccg 3290521 gtctaggggg tcaattttca gatgccgtca gggggtcagt tttcgggtgc cgttgacacc 3290581 gttcacaagg gcgtttcgag caacgcgtcg acgcaacttc ggcctagtcg acgttgacgg 3290641 gttcgttcca tttcgactgc gtgagctgaa tcgacccgga tccgaggtcg atgctcgctc 3290701 ggacgaggtg gtgcgagccg tcctgggcaa tccacacggt cgccggcctt gcactcttgg 3290761 cgccaggatc aagcatcttg acagagctcg cggggatggt cccggtgatt ttggtggtcg 3290821 aaattccgtc tatcacttcg gtaccttgcg cttggaggtt cgtgacaccg gacagcagct 3290881 gcgtcacccc agcggcagga tcgagcacgc gtgaagttga cagttcagaa atcgagccga 3290941 gattgctcca gtcgtcgaac agtttcaccg agatgttgtc gccttgtacc cgaaacggga 3291001 caccctgctc gtcgttgtag gtgcatacgc cctttgccgc gagcggattg gcccggacgt 3291061 cgacatcggc actggtaata cccagcaagc tgtcgacttt cccggttgtt cggaccgcta 3291121 cgtgcacgct ggtcaaccct tttgtcgcat caagcgactg cctgatctcg gcgaggagcg 3291181 cggggtcgga cgccgtcggg ctcacgggaa caccctgttc ctcggcatca ggtttcggcg 3291241 aagaacatcc tgatagccac aacgccaggc aggcacctag caccaccaga acagcggacg 3291301 tcaccgcccg ttttccatca ttcatttgcg ctcactacct cgattgtcaa atgggcccgc 3291361 aggccgaatg caggttgatt ggatcacgct gggcatgact gcccgcctcc tcactcgcgc 3291421 cattccggcg ctcgccgtcg ccgggccccg ccaaattgcc cgcctcctca ctcgcgccat 3291481 tccggcgctc gccgtcgccg ggctaggcat ggaccgatac ttccgcggcg gcgggttcga 3291541 caacctgcga cgtcggatca ccggattccg ttgggcggct gccagacatt tgctgggcga 3291601 catactcggc gaccgcagtg ggagtgggat gatcgaaaat cacggtaggt ggcagcgtca 3291661 gtccggtggc ggttttgagg cggttgcgta actccacagc cgttaatgag tcgaaaccga 3291721 ggtcgccgaa ttcggtgtcg gggtcgacgt cctcggcgga gggcctaccc agcactgccg 3291781 ctgcctgcag acacaccagc cccactagca gctcgagttg ttcgtccgcg gccagcccgt 3291841 gtaggcgttg agccagcgcc gacttcgacg aggtggcgtc accggtgtcg tcgatttggc 3291901 gtcggcgtgg gcggcgcgcg agcccgctga acagcgccgg caacgcaccg gcctgggccc 3291961 gggcgtctag tgcagcccgg tccaagagcg tggccaccgc cagagggtga tcgatggcca 3292021 gcgcagcgtc aaacaattcc accgcttcgg cagggctcat cggagccagc ccgctgcggc 3292081 tcatgcgggc cagatctcgg ctgctcaaat gcgcggtcat gccgccaggc tgttcccaca 3292141 aaccccacgc cagtgatatc ccggccaacc ctgcggcctg ccggtgagcg gccaacccgt 3292201 ccagaaacgc gtttgccgcc gagtagttgc cctgccccgg cgagccgacc gtggccgcga 3292261 tcgatgagca cagcgcaaac atcgacaaat ccaggtcact ggtggcctgg tgcaggttcc 3292321 acgccgcgtc caccttggcc cgcaacaccg tatcgatgcg gtccggtgtc aacgaggtga 3292381 tcactgcgtc atcgagcacg ccggcggcat gaatcacccc gcgcaccggc gggtactccc 3292441 gcgacagctg ggcaaacaac cccgctaccg cagcgcgatc ggccacgtca caggccacca 3292501 cctgcacctt ggcgccggcc tccgtcaagt cggcggccaa ttcggccgct ccctccgcgc 3292561 gatcgccccg ccgactggcc aacaccagat gacgcacccc ataggcgcca accaggtggc 3292621 gggccaacac cccaccaacc gccccggtgg caccggtgat caccaccgtg ccgtcggcaa 3292681 gccggtcggc caacgccgag ggcatggtta agacaacctt gccgatatgg cgggcctggc 3292741 tcatgaaccg gaaggccgcc ggggcgcagc gcacatccca cgtggtgacc ggtagccggt 3292801 gcagctcccg ggtgtcgaac agctcccgca cctcggccaa catctcctgc atgcgtgccg 3292861 ggccggcctc cgacaggtcg aacgcccgat actgcacgcc gggataatta gcggcgatct 3292921 cctgcgcatc gcggatatcc gtcttgccca tctcgaggaa acgcccaccg cggaccagta 3292981 agcgcagcga cgcatccacg aactcaccgg ccagcgagtc gagcaccaca tcaaccccgc 3293041 ggccctcggt gaccgccagg aacttctcct cgaactcgca tgtgcgggaa tcgccgatat 3293101 ggtcgtcgtc aaaccccatg gcgcgcagcg tgtcccactt gccacggctg gcggtgacga 3293161 aaacctccac gccccactgg cgagccagct gcacagccgc catgcccaca ccgccggtac 3293221 cggcatggat cagcaccgat tcgcccgcct tgatctcggc taaatcggcc aacccgtacc 3293281 aggccgtcaa gaacaccacc ggcacagcgg ctgcctgagc aaacgaccag ccttgcggca 3293341 cccgggtaac cagttgctga tccaccaccg ccagcggacc ggccccgccc aggaatccca 3293401 tcacggcgtc accgacggca agatcggtca cttcgggacc ggtctcaagc accaccccgg 3293461 cgccttcggc acccagcggt ggggcctggc cgggatacat ccctagggcg gccaccacat 3293521 cgcggaagtt gaccccgacg gccgccaccg ccacgcgcac ctgccccgcc tgtagcggtg 3293581 cctgtacctc cgggcagggc tggatcacca aatcctccag ggtcccgcca ccaccggcgg 3293641 ccaatcgcca cgccgactct gccgccggta acgctagcaa cgccggggcc ggggacagcc 3293701 ggggggcgtg cacagtgccg ccgcgcacca gcagctgggg ttccccgacg ccggctagca 3293761 ccgaggcatc caccgccgca tcggtgtcga tcaacacgat ccggccggga ttttcggcct 3293821 gcgcggaacg cgccatgccc cacaccgcgg cggcggccag gtcgctgatg tcctcgccag 3293881 ccagccccac gccaccatgg gtcaacacca ccaacgtggc cgcccgatcc gcgccgagcc 3293941 aggactgcaa cacctccagg gcggtgtggg tggccgcata caccgagccc accaccgagg 3294001 atgcttggcc accggcagac tcgagttccc acaccacgac actggcgtca ccatcactgc 3294061 cggcgcaaaa gtccgcccaa gacaccgggg caggtggggc ggacccgtta gcgccgccgc 3294121 tgaccaccga gatcggcgac cacaccactt ccagcggccc ctgatcggac gcaccgccgg 3294181 ccgcggtcac ggcggcgcgc agctgttctg cggttatcgg gcgagtaacc agcgagcgca 3294241 ccgtcaacac cggcagccca gtggcgtcgc agacgtccac ggaaatcgca tccgcgcccg 3294301 cggacgcgaa gcgggcccgc acccgtccag cgccgccggc atgcagcgac accccacgcc 3294361 agcaaaacgg cagtctcgtc tcggtgctcg cctgggtctt ctcgacggcc agcccgaggg 3294421 catgcagcac cgcgtccaac accgccggat gcatccccat tcggtcgacg gccacgccgg 3294481 cctcgccggg ggctacaact tcggcgaaca gctccgaccc ccgccgccag atcgccacca 3294541 gaccctgaaa cgcggggccg taggcataac cgcgctcggc caactgcgca tagccgtccg 3294601 agatatccac actctccgcg ccctcgggcg gccacacgga caaatccatc ggcgtctcag 3294661 cggcagccac ccccagcatg ccttcggcgt tcagcaacca accctgggat tgatcaccgc 3294721 gggaatacac cgacaccgca cggtgcccgg attcatcggc agccccgacg accacctgca 3294781 cctgaacccc gacacccggg tgcatcacca acggtgcggc cagcaccaac tcttcgatga 3294841 gcgcgcaccc gacctcatca ccggcgcgga tcaccaactc cacaaaaccc gccccgggga 3294901 acagcaccac cccgttcacc acgtggtcgg ccagccacgg ctgatccgca agcgacaacc 3294961 ggccggtcag caccacctcg tcagaatcgg gccgctcgac caccgcaccc aacaaggcat 3295021 gctcggtcgc gcccagaccc aacccggccg catcggcggg cccatccgcg cccggcgtct 3295081 cccaaaaccg ccgtcgctga aacgcatacg tgggcagctg cacccgccgt ccacccgagc 3295141 cggcgaacac cgccgaccac tgcaccggca caccggtggt gaacacctga ccggcagcac 3295201 cgagcgccga ggccagctcg ggccggtctt tgcccagcat cgacaccacc atcgcctcag 3295261 ccggggccaa ggactgctcg atcgagccag tcaaaccact tcccgggccg gcctcgatga 3295321 agtgggtcgc cccaagggtc tgcaaatgac gcgcactgtc cgcgaagcgc accggccgac 3295381 gaacgtggtc cacccagtac tgcgccgacc cgaaatcagg gccggccaac tcgcccgtca 3295441 cgttcgacac cagcccaagc tggggctcgc gtgcctgcac ccgggccgcg acacgcgcga 3295501 actcctcgag catcggctcc atcaacggcg aatgaaacgc atgcgagacc gccaactggt 3295561 gcacccgccg accctgcgcg gcgaaccgat ccgcaatcgc atttgccgcg gcctgcgcac 3295621 cggagatcac caccgattcg ggcgcgttga tcgcagcgat ccccacaccc tcacccagca 3295681 gcggctccac ctcgtcctca ctggcagcca ccgccaccat cgcaccgcct gccggcagcg 3295741 cctgcatcaa ccggccccgc gccaccacca gcatcgccgc gtccgccaac gtcaacacac 3295801 cggccgcgtg cgccgccgcc agctctccaa cggagtgacc catgacgaag tccggaagca 3295861 caccccaatc ccgcaacacc gcgaacgatg ccacctccac cgcgaacaac gcgggctgag 3295921 caaattcggt gctgtcaagc aaatccgcat cggcacccca aataacgtcg cgcagcggca 3295981 accgcagatg ccggtccaac tcgtcggcca ccgcatcgaa tgcctgcgca aacacgggca 3296041 actcgccgta caactcgcgg cccatcccga tgcgctgcgc gccctgccca ggaaacacga 3296101 ccaccgtctt gcccaccgac cctggctgac cgaccgccac gccggcaccc ggctcgcccg 3296161 ccgcgagccc agccagcccg gcaatcagtt gctcacggct tgcgccgacc accaccgctc 3296221 ggtgctcaaa caccgagcga ctggccaacg agcaccccac atcgatcgga tccagccctg 3296281 ggttggcctg cacgtgggcc ataagtcgac ccgcctgcgc cgtcaacgcc tcagccgatc 3296341 tcgccgaaat cacccacggc accatcgacg gccgcggccc ccggtgcttt cgctcgcctc 3296401 aaccggcgcc tctgcggggg ctggtacggg ggcctcttcc aagatcagat gcgcgttggt 3296461 gccgctgatc ccaaaggagg acaccgccgc ccggcgcgga cgcccgtcaa ccgaccactc 3296521 cctggcctcg gtcaacaccg acaccgcgcc gctggtccaa tccacccgcg gggaaggctc 3296581 atccacatgc aacgtcgccg gcatcacccc atgacgcatc gcctgcacca tcttgatcac 3296641 cccggcgacc cccgcggcgg cctgggtgtg gcccatgttc gacttgattg agcccaccca 3296701 cagcggctgc tccgctggac ctccctgccc gtaggtggac agcaatgcct gcgcttcgat 3296761 gggatcaccc aacgtggtgg cggtcccgtg tgcctccacc acgtctacgt ctgcggcgga 3296821 caacccggcg ttggccaacg ccacctggat cactcgctgc tgggcgagcc cattgggcgc 3296881 ggtcagccca ttggacgcac catcctggtt gaccgcgctc ccccgcacca ccgccagcac 3296941 cgaatgcccc aaccgccggg cgtccgatag ccgctccagc acaaccaccc cggcgccctc 3297001 gccccacccg gtgccgtcgg ccgcggccgc aaacgcctta catcgcccat cggcagccaa 3297061 cccccgctgc cgggaaaacc ccacaaaaat cgacggcagc cccatcaccg tcaccccacc 3297121 ggccaacgcc aaatcacact ccccggagcg caatgacgac atcgcccaat ggatcgccac 3297181 caacgacgac gaacaagcgg tatccactga caccgccggg ccctgcagcc ccaatacgta 3297241 cgacacacgt cccgaggcca cgctgattga cgtgccggtc aacccgtacc cttgcagccc 3297301 cccggtatcc ctattgccgt aactcgccgc gaaaatgccg gtgtacaccc cggtcgccga 3297361 accacgcaac gacaacgggt caatccccgc gtgctccaac gcctcccacg aaacctccag 3297421 catcaaccgc tgctgaggat ccatcgccaa cacttcacta ggagcgatgc cgaagaaccc 3297481 ggcgtcaaag ccggtggcgt cgtctagaaa tgccccccat cgcgtgtagg ttttgccctc 3297541 agcgtcggga tccggatcgt atagcccctc aacatcccag ccccgatcgg tcggaaactc 3297601 cgacaccacg tcgcgccccg ccgaaacgac atcccagagt ccgtccgggc catccacgcc 3297661 gcccggaaat cggcagccga ttcccaccac cgccaccggt tctgtcgcgc gttgctcata 3297721 ttcacgcagc cgagcgcgtg tctcatcgag ctcgacagca accttcttta ggtagtgaaa 3297781 aagcttttcg ctctgctggt cggcaccttc aacgctcatc gtccgttgct cctctatcac 3297841 ttcccaagtt cggaatcgat tagctggaaa atttcgtcag gagtcgaagc agcctggatc 3297901 agcttgccca ggcccgcctc gctgccggcg atggtgccca gcagggcacg caaacggtcg 3297961 gccacccgct gcttctcgcc gtcggcgatg acggccacca gctcttcgac cttgttcaac 3298021 tgctcttcga ttgcccaaag acccgtcgcg ccgctattca ccggaccggc tgatttcaat 3298081 cgaccatgcc cgccggccag ttcggcctcc aaatactggg ctaatcccga tatcgacccg 3298141 tagtcccacc caaccgtctc gggtaaccgc aggccggtaa ctgccgccaa tcgcttgcac 3298201 agcgtgactg tcatttgcga gtcaaaaccc agctccgaga aggcgagatc ctgatcgacc 3298261 gaccaaggat ctggctcacc taacatcttc gcggcctcgg cgcatacggc atccaccacc 3298321 agccgctgac gttcttgccg caaagcgacc aaccgctcgc gaagagtcgc cccgccgtcg 3298381 tttcctccgg cgatcgtcat gttggacgcc gacaggtcat cacgctgcgc ccgcacgccg 3298441 gacccaggtt cagtcaacga gagttcccaa atcggtttgg ttggactctg cttgcgcagg 3298501 gcgccacgca ccaatttccc gttcggggtt cgagggagtc gatcaacaac ggcaaaccta 3298561 tgcggcacct tgaacgcaga caatcggttg agcaatccgc ggtgaaggtc tcgcatgacc 3298621 gacccatcga tggtggcacc gctggtcgca accagaaaag cctgcagtgt cgacgcgccc 3298681 gtggactccc ttaccgcgac aaccgcggcc tcagccacgg cttcgtcctc gatgatgagt 3298741 cgctcgacct cacgcggatc aacgttgacc cctccgataa cctcggtgtc gtcggcgcgg 3298801 cagcggtagg taacccaccc gtcgctgtcg atacacaccc tgtcccgcgt gtcgagccaa 3298861 ccctcattcg cgacggggga atcaggccga ttccaatagc ccttagcgat cgccggtccg 3298921 cggacccata ggtcgccctc aaccccaggc ccggcagttg ttccatccgg cgctacaaca 3298981 cgaatctcgt agggcggcag cacccttccc agcgtcccca ggcgccattc gtcaacccga 3299041 ttcgatacga acgtctgccc gacctccgta gatccaatac cgtccagaat ggggatgccg 3299101 ccaaagaatt ccatgagccg ctcggcaaga cccagctcaa gggcctcccc ggctgacacc 3299161 acacatcgaa gcgaacggaa ggaatcagga gaacatgagt cgatgactct ggcaaagaaa 3299221 tttggcacac cgtagagcac cgatggccca aatcgcgcgc ttagaatggc cgctgcttct 3299281 ggagttaccg gcgccgaatt gatgaccgcg gaaccacctg tcgcgagtgg aaaccagacc 3299341 gaatttccta ggccgtaagc aaaatacatg cgtgcactac atagcccagt atcttcagga 3299401 gtgagccgca aggctttacg acacatagcg tccacgaacg tcaacgggtc ggcgtgccga 3299461 tgaatcgccg ccttcggcgg acccgtggta ccagacgtat acgtagcgta tgcgagtgcg 3299521 tcaccaccca tcggttcgta gcctccaggc gcgactcgag ccgcctcgga catgagttcc 3299581 gcggcttcgg ccacccgcga cggctgaaac cgatcgcgca gcgcatccga ggtgacgaca 3299641 agcgccggtt ccgtgttgcg tgcggccaac gcgtggtcgt cgcgatgcag ctccggattc 3299701 gctagaaacg ccataacccc acgagccagg cacgccagca atagctgcac caggtcgggc 3299761 gaatccggca ggcacaacag aacccgatca ccactggata gtccgcggtt tctcagcact 3299821 tccccaagac gtgcggcacc gtcgtggatt tgaccatgag tcaccacatc ggccgcatag 3299881 aaggccggcc ggtcgtacca tcccgcctcc gatgcctgct cagccaggag ccccgctaga 3299941 ttcccattcc gcatttattg gatgaccgcc ctagcgcgcc agagtgatgg catttgaaaa 3300001 ctgccagcga tcaggttctt catgcggagc atctcgaaat acgcttcgta gaaagtgctc 3300061 cgtcaccacc atgatcggct ggcctccgga aataacgcga taacggcggg caacggctct 3300121 ctttcgggaa ttctgatacc cgtggagcgc gagccaacct ggcaaatctc cgacccagac 3300181 tttagcctct tccttgaagg tctcgatgtg gctggctgcc ataacctcgc cgagaggatc 3300241 gtttgtctgc gtcaaccttg ttataattgc agccggcaac cgatcaatcg caatcaacga 3300301 ctccgcggct acaaataagt gttccgagtt ccgaccttta agtatgatgt accgctgcag 3300361 aacacggccg acccccacct gccctagctg ctcgaattcg gatagcttcg gtgaaacatc 3300421 gtgaatccgc tgcttgacga tttgcacgat cacctcatcg tcagcgacaa tgttgagaac 3300481 cctagtgagg gtgccattag ctgctatcag tattcgaagg tcacgattga gctttcggat 3300541 ctcttgatca gatagaaaac actcggtcat attcttcccc cacatacatg cgctgttatg 3300601 ccatagcatc taggcggctg aattcgtgat gtaggtaacg ctcaacgctg gccgaacgcc 3300661 gaaccttacc gctggtggtg accggaatag aacccggcgc caccataacg acatccgcga 3300721 cgcgcagacg atgtgacctg gatatcgcgg aggcgacttc acgtttgacg gtgcggagtc 3300781 gattcttttc ctcctcatct gtgcgacccc gcttcatgag ttcgataatg gttaccagct 3300841 tttcagtacg gtcatcgggc accgcaatcg ccacaacccg gccgccggtg atttcctgga 3300901 tcgtcgcctc gatgtcttcc ggatagtggt tggccccatc caccaccaac agctccttga 3300961 tgcgacccgt gatgaacagt tcgccctcga aaatgacgcc gaggtctccg gtccgcagcc 3301021 acggaccttc cgaagtcccg ggcgagggag tgacgagccg cgcgcggaac gtcgcctccg 3301081 tctgctgcgg gttgcgccag tagcccaagc cgacgttgtc tccctgcacc cagatttcgc 3301141 caaccgtccc cgcgggattc tccatcctgg tttcggggtc gacgatccgc acggttgacg 3301201 cccggggagc tccataactc accaggttgg ctccctcgct gccgttctcg gcacgcttcg 3301261 cctgaccgac cgacagctgc tggtagtcaa agcaaacact cttcggcgcg cgtcccggtc 3301321 cggcggtcgc cacgtacacc gtcgcctccg cgagcccata tgacggccgg attgccgtct 3301381 cgctgaggtt gaacggggcg aaccgctcgg tgaagcgccg cagcgtcgcg acgtttactc 3301441 gttcggcgcc ggtgacgatc gtccgcacat gcccgaggtc aagtccagcc atatcgtcgt 3301501 cggatgttct gcgtaccgcc aattcgaaac cgaaattcgg tgcgctggaa atctgtgcgc 3301561 ggtgtttggc taataattgc atccaacggg ccggccgctg caagaatgcc atgggactca 3301621 tcaacaccgc ggtgtcttga ttgatcatcg ggagaatgat gcccagcatc aaccccatgt 3301681 cgtgatagaa cggcagccac gatacgggag ttgacggaac cttttccgaa tccccgatgt 3301741 aatcggacat tagctgtacg cagttggtga tgacattctt gtgcgagagg acaacaccgg 3301801 ccggcgcgcg ggtcgaaccg gatgtgtact gtagatatgc tgtgctcgga cgctcgaacc 3301861 gagtcggatc gagcgctctg gatgagctca agtccagagc gtccacagcc acgacgatgg 3301921 gcgcggactg gccctgtgcg gcgcacgcat gtggcgcata tgtcgtgacc tcgtcaataa 3301981 ccgacgaggt cgtaagaata atggacggcg cagagtctcg taatgccgaa gatattcgtt 3302041 cgtcgtgaat gccgaattgt ggcaccggaa gaggaaccgc aatgagacca gcctgcagca 3302101 cacccataaa ggcgatgatg tattcaaggc cctgcggggc caatatcgcg acccgatcac 3302161 cgcttgacgc gtatatccag agctcctctg ccacgatcat cgctcgccgg tggacttgcc 3302221 accacgtcac ggtttcggtg aagccagccg gatccgtgtc atagtcaatg aacttgtacg 3302281 ccgcgcgatt ggggtactgg ctcgccgcct tctgtaggag atcagcgagc gacgactcgc 3302341 tcatagcgaa tcgcgatgtg ctcccgttca gcggttgtgc cgcttgctca ccggtgcccc 3302401 atgccggttg tgtcgccacc tcgcccgcgg catgaaatga cgagttggtt ttcatggtct 3302461 tccttcagct atggacggca gagagcagac ggctgcgctg ccgctttcat acgaatccga 3302521 gtcggcgcat agcgtctgta ccttgcccgg gctcgcgacg cgattcgtta aggtctcacg 3302581 accatagcag gtacgggcca cacccgaggg cctaatggga ttgacggaat cgtcagccgc 3302641 ggcgtcagcg ctggctgcag ccccattcgc gaaacacacc gtcgggctgg cttcgctaag 3302701 cctaatgagc accgtcttcg atgtcgacct gctgatcgtg ccggagcgaa ttcgccagcg 3302761 ccgtgcgatt cgttcgatcc ggtcgcgacg ggtggcaccc gcagagcgct gtcagacccc 3302821 acaggtcaca gttcagagac cgcaaccgac tgatcccgcc ggtacaccgt gcccccacca 3302881 acacgaatca cggcaagccc gttgcggcga ggccgaaccg agtaaccgct gatcaatggc 3302941 ctgacctcaa gtcagctgaa cgtgcgcacg gctgacctgt gggcactctg ggaaattcac 3303001 atcgagttcc aagctcgaca cgccgaaatc gcctgccgca cgccgcgatt ggcacgccag 3303061 ccgctgggcc ggcttacccc atcttcgcga gagtggcgca aatcatagct tcttgagccc 3303121 gcgcaaaacc ttggcgtgcg gcaggacagc cgtcaccgtc ttgcgcaggc tggggttaac 3303181 caaactgccg ttgatcagca caacgtagcg cagcccgtga tctcgccatt ccgctacttg 3303241 atcgatgact tcgtcagggg ttccactgaa gacgacttct ttcataagcg cagccgggac 3303301 cttggccgcg taggacaaaa ccgtctgttt gtccatggtt tgcgggatga tgtcctgcac 3303361 accggagaag tcggctccca ttggatgctc gacgccgtga cgcgcccagg cttccccagg 3303421 taccccgagc gcggtcatct tcacaacgac agattccagc gcctcttcca cgtcgtcgcg 3303481 attccgtcca gtgatgatgc cgcgcaccgc cgccggagta atcgacattg ggtcgcgtcc 3303541 ggcatcggac gccgcgctgc gcaccgcttc gagtgcgcga ctgtagtcgc tgggacgaac 3303601 cacaacaatg ggaatccagg catcggcgta acgtccggtg gcccgtaaca tccgcggccc 3303661 gtgggccgcg acccagattt cgggccattt cccacggtat ggcggaaggt cgaacaaggc 3303721 gttatgtaac ggaaagtatg gcgattcacg tgagataagc tccccgtttg aattccacaa 3303781 cgcgcgaatg gtggccaggg cttcttcgaa ccgcgccacc ggtttggtcc actccacacc 3303841 gtagggctcg ttgccttcac gttccccgac accgataccc aatatggctc ggcctcgggt 3303901 aagcaagtgc aaagtcgcgg cagcctgggc tgtgaccgct ggattgcgcc gacctgcatc 3303961 ggtcacgcac acgcccagtc gcagacggct gggcaacccg aaggcgaggt ttccaagcat 3304021 cgtccacggt tcgtaattgg catcgatctt gggcacgaat ttcgccgcaa ttccgagata 3304081 ttcggaagtc gcaatcgagc gcggcaccag cgcattcaga tggtcgccga cccaatacga 3304141 gtcggcgccc atcacggtgg cggccgccat gctagaccgt gccggcaggg tcggcggcaa 3304201 ccgcgagtgc acgagggcat caacaaaacc gaaacgaagt ccgcccacgc ctaccccttg 3304261 tctactacgc tgttgaccaa cgtcatggct aggaacgcta cctcagcgag tcatgtccgc 3304321 gcggtcgcgc gttgagcaac accggggtcg gatgtcatgg catcgaccgc gggctcggtg 3304381 ttgcgccaac ttctcttctg acgcatcgct cgtacatact gtctgccata ctccttgccc 3304441 atggccttca gtcgaaccca cagcctcctc gcccgcgcgg gcagtacctc gacctacaag 3304501 agagtttggc ggtactggta cccgttgatg acgcgcggac tcggtaacga cgaaatcgtg 3304561 ttcatcaact gggcctatga ggaagatccg ccgatggacc tgccactgga ggcatccgac 3304621 gagcccaacc gagcccacat caacctgtac caccgcaccg cgacccaggt cgatctgggc 3304681 ggcaagcagg tgctggaggt cagttgcgga cacggcggcg gagcctctta cctcacacgc 3304741 acgttgcacc cggcctccta caccggcctg gacttgaacc aggcgggaat caagttgtgc 3304801 aagaaacgac accggctgcc tggtttggac ttcgtgcgag gtgacgccga aaacctgccc 3304861 ttcgacgacg aatccttcga tgttgtgctc aatgtcgaag cctcgcactg ttacccgcac 3304921 tttcggcgtt tcctcgccga ggtggttcgc gtgctgcgcc caggagggta cttcccatac 3304981 gccgacctgc gccccaacaa tgagatcgcc gcatgggagg ccgacctcgc tgctaccccg 3305041 ctgcggcaac tgtcgcagcg gcaaatcaac gccgaagtgc tgcgcggcat cggaaacaat 3305101 tcacagaagt cacgggacct ggtcgaccgc catttgccgg ccttcctgcg tttcgcgggc 3305161 cgcgaattca tcggtgtgca gggcacgcag ctgtcccgct acctggaagg cggggaactc 3305221 tcgtaccgga tgtactgctt caccaaggac tgagccagtt tcgggtaatg tcgcccggat 3305281 gagcccagct gagcgcgagt tcgacatcgt tctatatggc gccaccggct tctccggcaa 3305341 gctgaccgcc gaacacctcg ctcacagcgg gtcaacagca cggatcgcat tggccggtcg 3305401 gtcaagcgaa cggctgcggg gcgtgcggat gatgttgggc ccgaacgcag cggactggcc 3305461 gctgatcctc gccgacgcat cccaaccctt gacgctcgag gcgatggccg cgcgggccca 3305521 ggtggtgctg accacggtcg gcccctacac gcgttacggc ctgccgctgg tggcggcctg 3305581 cgcgaaggcc ggaaccgact atgccgacct gactggcgag ttgatgttct gccgaaacag 3305641 catcgatctg taccacaaac aagccgccga cacgggcgcc cggataatcc tggcgtgcgg 3305701 attcgattcg atcccttcgg atttgaacgt gtatcagctg taccgtcggt ccgtcgagga 3305761 cggcaccggt gaactgtgtg acaccgacct cgtgctgcgt tcattctcgc aacgctgggt 3305821 ctccggcggc tcggtagcaa cgtattccga agcaatgcgc acggcatcca gcgaccccga 3305881 ggcccgtcgg ctcgtcaccg acccgtacac gctgaccacg gaccggggcg ccgaacccga 3305941 acttggtgcg cagccggatt ttcttcggcg tccaggacgt gatctggcgc ccgaacttgc 3306001 cggcttctgg accggcgggt ttgtgcaggc tccgtttaac actcgaatcg ttcggcgtag 3306061 caacgcatta caggagtggg cttatggccg gcggttccgc tactcggaaa caatgagtct 3306121 gggaaagtcg atggcggcgc cgattctcgc cgcagccgtc accggcactg tggcgggcac 3306181 catcgggttg gggaataagt atttcgaccg actaccccga cgattagtgg agcgcgtcac 3306241 gccaaagcca ggcaccggtc cgagccggaa aacgcaagag cggggccatt acaccttcga 3306301 gacgtacacc accacgacga ccggtgcccg ctacagggcg actttcgcgc acaacgtcga 3306361 cgcgtacaag tcgaccgcgg tgttgctcgc gcagagtggt ctggcgctgg cgctcgatcg 3306421 cgatcggctc gccgagctgc ggggggtgct cactcccgca gcggcgatgg gcgatgcgtt 3306481 gttggcgcgc ctcccgggcg ccggcgtggt catgggaacg accaggctga gctaacatct 3306541 ccaccccggc cgccagcaag attagctatg ccatgggcac attagcccaa tcctgttctc 3306601 ccagatctgg gcctttgccg ccgagaatca aactcctgac gacaacccac gttcacatgt 3306661 gggcttcagc accggcgctg caccatcgga agctcctcga ccagggtcgg caggttcagc 3306721 ggagcccgcg aagcgacaaa caccgcacga gccaaacctg tggaagctat cggtccgttc 3306781 gcccgccaat ccagtggaaa ctgccggtgc cggggctgcg tagcggtcac atatacgtgc 3306841 ggcatcttct ccctcagtcg gttcatcacc cacacgcgcg aaggccgaca ccccgtgccg 3306901 gttatcgcct ggctcggcga ggaagccctc tcactaacca gaaaaggctc gtcttcgccg 3306961 gaatagctca cgcatgtctc cagcagcaga aggtccactg cgcggtcaca catccacgcc 3307021 agcgcctcgg caggacggga gaggtggtag agcactccat agcagtacac cacgtcgtat 3307081 tggtgcgctt ctgctgggag atcgccgtcg agatctaggt ggtcgactgt gacattggga 3307141 ttggacccga agcgttggcg aatgacatcc agattctccc cccggggctc ggtgcagagc 3307201 accttgcacc cgcggtcgag gaagaactgc gtgtgatcgc cgatcccggc accaacctcc 3307261 agcacgctct tgttgccgag gtcgagcccc agcgtggcca ggtgctcctg acggcgggcg 3307321 ttgtgccgaa ggtaaaagat gctgtgaaaa tgccgttccg cagtcgggcg caacatgccg 3307381 gggagtcgca tcaggcgagg atagcgcacc tcgccgagga gaaacgctcg ccgatggcta 3307441 tcgagctgct gacacgtctg cggccccagc gtatcgacct cctcgagccc acgcccgctt 3307501 ggccctggac gccaccgcca cccagcagcg caaacgcccg gggcaagcac tcgaggtaga 3307561 gagcggcagc cggcgccagc tatcccttcc ggctcggaat aaagaagtag cagtaccggt 3307621 cgtcgcggtg ccgttggtag ggctgcagcc ccgcatcgtc ggcgtagacg aacggttcgt 3307681 aaccgtacgc gcggatgtcc gcaatggtcc gttccgggtc cggattagag gcggcgcccc 3307741 cgtagatctc caccagcagg accgggcgat cgcgccgcag aagctccgcg gcgcccgcga 3307801 tgaccgcgcg ctcgaggccc tcaacgtcga tcttcagcag acccaccggg aggggcagct 3307861 cggcggcgag cgcgtccagc gtggtacacg gcacccgtgt ccgctcgcga atccgaattc 3307921 gtcccgtgtc gtttagcgaa ctgaaggcgc tgtcggccgc cacgaaaaag tcgacctcgc 3307981 cgaccgcgtc cccggcggcc gtccgcagcg tgcggatgcg gtcttgcagg ccgttggcgg 3308041 ccacgttggc ctccaaccgc gaatgggtgc ccggcgccgg ctccagggct accaccgggg 3308101 ctaacctcgc ccaggccagg ctgtgtatgc cgacgttggc tccgacgtcg aggatgcagc 3308161 ggtctgggta gagcgcggaa tagagcgccg ccgcgatgtc gatctcggtc tcctcgaacc 3308221 cgccggtcaa ccgaacgatc cacgcgatgg ccgaccccgg ctcaagggtg acctggaggc 3308281 cgcgccaata ccagggggcc agccgccacc gatggggggg caagccaaac ggccgccagc 3308341 gttgcaggct tcgaacgagg cggtttggca tggcgcactc taacatccgg atcgcccgca 3308401 tccggtaggt cggccgttga gctccgaggt tctcgaaaca accagtggtg cccagatcca 3308461 aagggcgcca acgccgctgg cccttcgccg gcccaagccg tctgcacact accacccgca 3308521 tcaggcgcac atcttggaac tgcaccaggt ccaatcgtca gcagcgcctg gcgttgtgac 3308581 cgaacctcgg gtccgcagac ccactgcaat gttgcgcgac ccaaactatc ccccggggcg 3308641 gagtatttag cgtgttagtg ttgcacagtg aaatcgttga aactcgctcg tttcatcgcg 3308701 cgtagcgccg ccttcgaggt ttcgcgccgc tattctgagc gagacctgaa gcaccagttt 3308761 gtgaagcaac tcaaatcgcg tcgggtagat gtcgttttcg atgtcggcgc caactcagga 3308821 caatacgccg ccggcctccg ccgagcagca tataagggcc gcattgtctc gttcgaaccg 3308881 ctatccggac cgtttacgat cttggaaagc aaagcgtcaa cggatccact ttgggattgc 3308941 cggcagcatg cgttgggcga ttctgatgga acggttacga tcaatatcgc aggaaacgcc 3309001 ggtcagagca gttccgtctt gcccatgctg aaaagtcatc agaacgcttt tcccccggca 3309061 aactatgtcg gtacccaaga ggcgtccata catcgacttg attccgtggc gccagaattt 3309121 ctaggcatga acggtgtcgc ttttctcaag gtcgacgttc aaggctttga aaagcaggtg 3309181 ctcgccgggg gcaaatcaac catagatgac cattgcgtcg gcatgcaact cgaactgtcc 3309241 ttcctgccgt tgtacgaagg tggcatgctc attcctgaag ccctcgatct cgtgtattcc 3309301 ttgggcttca cgttgacggg attgctgcct tgtttcattg atgcaaataa tggtcgaatg 3309361 ttgcaggccg acggcatctt tttccgcgag gacgattgat tggaatcgct tcgcgaggcc 3309421 cggcaccaga ccgggcacca gaggtccgcg cagatcgcct gggtcgaaga tggtgcagac 3309481 gaaacgatac gccggcttga ccgcagctaa cacaaagaaa gtcgccatgg ccgcaccaat 3309541 gttttcgatc atcatcccca ccttgaacgt ggctgcggta ttgcctgcct gcctcgacag 3309601 catcgcccgt cagacctgcg gtgacttcga gctggtactg gtcgacggcg gctcgacgga 3309661 cgaaaccctc gacatcgcca acattttcgc ccccaacctc ggcgagcggt tgatcattca 3309721 tcgcgacacc gaccagggcg tctacgacgc catgaaccgc ggcgtggacc tggccaccgg 3309781 aacgtggttg ctctttctgg gcgcggacga cagcctgtac gaggctgaca ccctggcgcg 3309841 ggtggccgcc ttcattggcg aacacgagcc cagcgatctg gtatatggcg acgtgatcat 3309901 gcgctcaacc aatttccgct ggggtggcgc cttcgacctc gaccgtctgt tgttcaagcg 3309961 caacatctgc catcaggcga tcttctaccg ccgcggactc ttcggcacca tcggtcccta 3310021 caacctccgc taccgggtcc tggccgactg ggacttcaat attcgctgct tttccaaccc 3310081 agcgctcgtc acccgctaca tgcacgtggt cgttgcaagc tacaacgaat tcggcgggct 3310141 cagcaatacg atcgtcgaca aggagttttt gaagcggctg ccgatgtcca cgagactcgg 3310201 cataaggctg gtcatagttc tggtgcgcag gtggccaaag gtgatcagca gggccatggt 3310261 aatgcgcacc gtcatttctt ggcggcgccg acgttagcgc gataccaccg caacgttgac 3310321 tcgatgccct tgggcggcgt gatcttgggt ggccaacccg cctcttgcaa gaccgacacg 3310381 tctaacagct tgcgtggtgc ggcgcctgtc aagctctttg cgccagtgtc tcattatgtg 3310441 gacgctattt cggatctggg gtgggcgggt tgatccatgc cgcggtcgcc ggtttcgggg 3310501 gttgcggtga gacgccgaat ggattcgggt tggccgagta ggcgttggcc atggccgcgc 3310561 ccatgtggcc tggccaggtg cgggcgtgtt cgatcgaatc gaaccgttca ggagagtcgt 3310621 tgcggtactt cagcgttttg aactgcgaca cgctcccgaa tcaggtgctc gaccggacat 3310681 ccgttggcta gccggcgata tcgtgggcac cctttagcag acgagccgca gcgcactttc 3310741 gatgtgctgc gggaatccgg caaagtctgg tccgaaggct tcggcaagcc gccgggcggc 3310801 ttgtcggaac tcggccccac tgagcacctg ctttacggct gccgccacgc cttcagtgtt 3310861 gagccgctcg gttcgcagga gaacgccggc gccggcccgc tcaagggcct ccatgttcaa 3310921 gtgctggtcc atgttgctgg ggagcccgat caccggcacc ccggccgcca acgcctgctg 3310981 cgtcgtcggg ctgccgccgt tgcagagcac cacggcggag cgcgctgcag ccgcttcgcc 3311041 cggcaggtag tccgcgacga aggcgttggc cggcacgttc ttcaggtggt tccggccagc 3311101 ggtggccgcg atcaccgtga cgggtaaatc ggccagggcg ttcaaaacca cctgcaacag 3311161 gttctttccg ccggaactgc cgagggtcgc ataaataatc ggccggtctg tcggcagcga 3311221 gtgccaccaa gtcggcggtt ttacgtcggg cgaccacagg acgggtccga gatatcgatg 3311281 gttggccggc aggttgtatg tcggcaccag ctcgggtacg tcggcataca gggtgtagtc 3311341 accgtcggtg aaaatgcggc acaaatccca gcccagactc gacagcccgt gcttccggcg 3311401 gagccagttg agcgggagac aatagagggc aaagatcaac ggacggtaca ggcggtacag 3311461 gatgctgacc ggcctgaccc cgaagaagcg ggtccacggc acgtctggca gcggaaaccg 3311521 acggcgggcc tgaggactcc agtaggcgtt cgcgatggcg atgtacggaa tgccggctag 3311581 tcgggcgctg accgagagcg aaagacggtt gtcaccgacg actacgtccg gtgcgatctc 3311641 gttcaggatc ttcctgtcag ccgcgatgta tttgcgcaac gtccgcgtgt tgtagaagag 3311701 gcggccctga gcgattttaa ggagaacctc ctcgctgggg acggtgtgaa tcgggtgatg 3311761 tgggaacggg agcgggccca aaagcttatt gaaccgcggg tcgcaggcaa agtggacctc 3311821 ataacgactc gggtccagcg accgcgccaa cacgaacggc cggacgacgt gggccagggt 3311881 cgcggcctcc cctacaaaca ggatccgttg cctgcgagcg acaggctccg gtgcggcgtt 3311941 gggcgccgtg ctcgtcccag cgtccggtcc cgggtcgccg gcgacgcttg tttcctccat 3312001 actcgccccc taatctcgag gcagcccgta cccgcaggca acctcccaaa aatgcaatcc 3312061 cccaaaatgc aatgcgtcga gctatttctc acaccgaccg ctagttgcgg atcagaaatc 3312121 cgttgggcgc ggaagtccag ccgaatttgt tctcccgctc cgcatcatgc ttgtaatcgt 3312181 ttggaaattc atcctcatat gcctcgatcg cttcataggg tccaggccca aacccgggca 3312241 ggactgggtg gccgttgatg ttggaatcct cgactactag gtagtcaccg gcggagagta 3312301 gcggccgtag taatttcatc tcggccagca catgattcat cgagtggtcg ctatctaaga 3312361 tggcgaagat cttgccaggg tattcgtttt tgaggcgttg aatttgttcg gcaatcgccg 3312421 ggtcggtgga tgacgattca acgaacaaaa catctggttc gcgccgggct cttggatcga 3312481 gggctttgtg tgagttgtcc acggtaagta ccttgaatgg ctggccgatc tgcctcatga 3312541 tgttggcaaa atacaccgcc gagccgccgt agcgggtgcc gaactcgatg acgagggatg 3312601 gttgcaactc gctcaggatc tcctggtaat tccacatatc gctgacggat ttccagcaat 3312661 tgatccccat ataagtggtc ttcgtccaca ctaagttgcc gtagtaccac ttgtggtatt 3312721 cttccgccac tgcgtcgctc ggccggtaga ataactgggc cgcaaaactc gccactaacc 3312781 tgactagtcc gatcagttgc cctacaagac tagtccgact gcgccacact agccccattc 3312841 catcatctcc tcactgcgaa accgtagtca gtcgaatgtt ggtcatttag caagcctctt 3312901 taagagaact gatgaggtcg aagcggactc aatacatggc tgcggcaatt cgttagaccg 3312961 cgttcgcgcc cacgttgtga gctccgcgcg ccgcatcctt ggggctcggt gccgggcata 3313021 cgcgacccag cttgcggctg agcatcttct ggacaccgcc accgcacggc ggatggtagc 3313081 aacagattgg ggttaccctc aaaccgcggg ttatggactg ccaaaggtag ccagcttgtc 3313141 ctgctcgcgg tgacagcgca accacgggta gtgacactac cgccgtggcg ttcctcccca 3313201 cggcagaagg ccggggccgg tcgagttcgg gcacaagccc cagatcgtcg acaacgacga 3313261 tggcatcgtc ctggatcaca ccgtggagca cggcaatccg catgacgcgc cgcagctagc 3313321 gcccgcggtc gaacggatca ccacacgcgc cggacgcccg cccggcaccg tcaccgccga 3313381 ccgcggctac ggcgagaaac gcgtcgaaga tgacctgcac gacctcggtg tacgtacggt 3313441 cgcgataccg cgtaaaggca gaccctccca ggcccggcgc gccgaagaac aacggccatc 3313501 gttccgacga acagtcaagt ggcgcaccgg cagcgaaggc cgcatcagca ccctcaaacg 3313561 aaactacggt tggaaccgct cctgcatcga cggcaccgaa ggaacccgga tctggaccag 3313621 gcacggcatc ctcacccaca acctcatcaa gatcagcagc ctcgcagcat gacccggctc 3313681 ccagagcacg aagctctgcc ccaccaacag tccggcggca ttcgcccaca aacgactcac 3313741 ttagtcgccg tcactttttc aggtcgaagt aactagctgg ccaaccatgt ccggggccgg 3313801 ttctccggca tgaggcgcag agcattctcc acatgctgcg ggaatccaac gcggtctcgt 3313861 ccgaaggcat cggcgagtcg cgcggcggct tgtcggtact cggaccgact gatcacctgc 3313921 atcacggccc ctgccacccg ctgactcttc agccgctcag ttcgcagcag cacgcccgcc 3313981 ccggcccgct caacggcctc catattcaag tgctgatcga gattgcccgc gaccccgatc 3314041 accggcaccc cggccaccaa ggcctgctgg gtcgtcaaac tcccgccatt gcagaccacc 3314101 acggccgagc gagccgcagc ggcctcaccc ggcaggtagt ccgccacgaa ggcgttggcc 3314161 ggcacggtct tcaggtcact gcggcccgcg gtggccgcga tcaccgtcac cggcaactca 3314221 gccaacgcgt tcaacaccag ttgcaacaga tttctcccgc cggacgtgcc cagggttgcg 3314281 tacacgatcg gccggtcggt tggcagcgaa tcccaccatg tcggcggctt cccggcgggc 3314341 gaccacagga ccgggccaag gtactcgtgg ttggccggca agtcgtaggt gggcatcagc 3314401 tcgggcacgt cagcatacag ggtgtggtcc ccgtcggtga aaatgcggca caggttccac 3314461 cccagactcg acagcccgtg cctgcggcgg acccagttga gcggcatgca ctgcagggcg 3314521 aagagcaaag ggcgttccag gcggtagagg agcttgacca acctgacgcc gaacaagcgg 3314581 gtccatatca cgtcgggcag cggaaaacgc cgctgcgcgt acggactcca gtaggcattc 3314641 gcgatcgcga tgtaaggaat gccggccagt cgggcgctga ccgacagtga aatgcgaagg 3314701 tcaccgacga cgaggtccgg cgcgatctca tccaggaccc gcaggtccgc ctcaacgtac 3314761 ttccgcagcg tccgcatggc atagaaacga ccctgagtca gattgccgaa aaaccgctcg 3314821 ctggggatgg tgtgaatcgc atggtgacgg aaagggagcg gacctagaag ctggttgtag 3314881 cgcgggtcgc aggcgaagtg cacttcataa cgactagggt ccagcgactg cgcaagcgcg 3314941 aatggccgga cgacgtgagc cagggtcact gcttccgcga cgaaaaggat ccggcgcctg 3315001 cgtgcggcaa gcccaggtgc ggcgtccggt gtcgtgctga tggccgcgtc ccctctcacc 3315061 tcgctagcaa ccggtggccc gccccacctc gacgccgtag cgtacacgca cgacacgcgc 3315121 actcggggaa aacctcggca agagtggggc ggcgatacgt ttagcggcac cactgcgcgg 3315181 tcgttgccca ccccggtgac tatacccccg ggtggtatat ggtggagggc agagcgtgac 3315241 ctcaaccaaa gtggaggacc gagtgacggc agcagtgctg ggagcgatcg ggcacgcact 3315301 ggcgctgacc gcgtcgatga cctgggaaat cctgtgggcg ctgatcctgg gcttcgcgct 3315361 gtcggcggtg gttcaagccg tggtgcgccg ctccacgatc gtcacgctgc tcggcgacga 3315421 tcggccgcgc accctggtaa tcgccaccgg cctgggcgcg gcctcgtcgt cgtgctcgta 3315481 tgccgcggtg gctttggctc ggtcactatt ccgcaaaggg gccaacttca ctgccgctat 3315541 ggcgttcgag atcggttcca ccaacctcgt ggtggagttg ggcatcatcc tggccctgct 3315601 gatgggctgg cagttcaccg ccgccgagtt cgttggcggt ccaataatga tccttgtcct 3315661 ggccgtgttg ttccggttgt tcgtcggcgc ccggctcatc gacgccgccc gggaacaggc 3315721 cgaacgggga ctcgcaggct cgatggaagg ccatgccgcc atggacatgt ccatcaagcg 3315781 ggaaggctca ttttggcgac gactcctttc cccaccggga tttacctcca tcgcccatgt 3315841 gttcgtgatg gagtggttgg cgatcctgcg cgacctcatt ctcgggctgc tgatcgccgg 3315901 tgctatcgcg gcatgggtac ccgaatcgtt ctggcagagc ttctttttag ccaatcatcc 3315961 ggcctggtcg gcggtctggg gtccgatcat aggacccatc gtggccatcg tttcgtttgt 3316021 ttgctcgatc ggcaacgtgc cacttgccgc ggtgctgtgg aacggaggca tcagcttcgg 3316081 cggggtcatc gcgttcatct tcgccgacct actgatactg ccgatcctga atatctaccg 3316141 taaatactat ggcgccagga tgatgctggt gctgctcggc accttctacg catcgatggt 3316201 cgtcgctggc tatctcatcg aacttctctt cggtacaacg aatctcatcc cgagccagcg 3316261 cagcgctacg gtcatgaccg cagaaatatc gtggaactac accacctggc tcaacgtcat 3316321 ctttctggtg atcgcggcgg ccttggtggt ccgattcatc acatcgggcg gtctcccgat 3316381 gctacgcatg atgggcggct caccggatgc cccgcatgac caccatgacc gccacgacga 3316441 tcacctcggc cactagcgcc accacgccga tcagtcggcg ccgaaaaggc caccggcggc 3316501 ggtatcctgg cctgcgggta ttccacccat gggcaaaggg agcatgaccg cgcacgcaac 3316561 gccgaacgag ccggattatc cgccaccgcc tggcggtcca ccgccgccgg ccgatattgg 3316621 ccggttactg cttcggtgcc acgaccgccc tggaatcatc gccgcggtga gcaccttcct 3316681 ggcccgggcc ggcgccaaca tcatttctct ggaccagcac tccaccgcgc cggagggcgg 3316741 aacgttcttg cagcgcgcaa tctttcacct gcccggtctc acggccgccg tcgacgaact 3316801 gcagcgcgac ttcggcagca ctgtggcgga caagttcggc atcgactacc gatttgccga 3316861 agcagccaag cctaagcggg tcgcaatcat ggcatcgaca gaggaccact gcttgctgga 3316921 cttgttgtgg cgcaaccgtc gcggcgagct agaaatgtcg gttgtcatgg tgattgccaa 3316981 tcatcctgac ctggccgcgc acgtacgccc gttcggtgtg ccattcatac atattcccgc 3317041 cactcgcgac actcgtacgg aagccgaaca gcgtcagctt cagttgctaa gcggcaatgt 3317101 ggatttagta gtgctggcac gctacatgca gatactcagc ccggggttct tggaggcgat 3317161 cggctgcccg ctgatcaaca ttcaccattc gttccttcca gccttcaccg gcgcggcccc 3317221 gtaccagcgc gcacgagaac gcggcgtcaa actgatcggc gcgaccgccc actacgtgac 3317281 cgaagttctc gacgaggggc ccatcatcga acaagacgtc gttcgtgtcg accacaccca 3317341 caccgtcgat gatctggtgc gtgtcggcgc cgacgtcgaa cgcgcagtgc tttcccgcgc 3317401 cgtgctctgg cactgccaag accgcgtcat cgtgcatcac aaccagacca tcgtcttctg 3317461 acatgggtga ctgcgcgcgt tgcggtcaac ttcttggtgc ccatgatggt cacggcgtcg 3317521 actggccgtt tcggcgccgt cgcccagcgt gaactgaggg cggaaaatcg gctggcccga 3317581 atctcgcccc cagtgcacgc tcggcgccgt ttggcctcac ccggtcaacg tgaactgtcc 3317641 gggtgggcgc tgtcacgtag cgagcccacg tggggccggg gtcggcccgc caaaaacgcc 3317701 ccggcgcggc cagctcatga gcgagtacgc aagctcaagg gacacccgct ttgcactgtg 3317761 gaagaacccc gaagacctgg cctgcggcag gtgcggtcaa aggagcggag tgtagacagg 3317821 accggtgggt ctgctcagcg cggccccgaa ttaggacaat tttcgcacct agcgcatcca 3317881 atatcgcttt cgaagaacgt tcacgccagt cccactgggc cggtgcgaat ggtgcaacgc 3317941 gcctttcgtc gaaggaaacg ccgtccgcca ccgagcccgc gctaggcaag tcggtcccaa 3318001 gaacgtcgca aggatacgcc aagcggccgc ggtcaatctt gacttgtcgg ccaccgccgg 3318061 caaaccaaca ttcagccaca acgcgacaga gaggtaccca atgttcactg cccgtatccg 3318121 cgccctcgcc ggcatgtctc tgctagcctc ggcgatcgga ctggcggcct tcggagccgc 3318181 taccggcacc gccaatgccg ccccgaccca ccaacccgag tggggcacct acacctgcta 3318241 cgactacgca acccagacgt tctacgagtg ctttgacccc agctagtcgg cgaaggcctc 3318301 acacgatcgg acctagtccc gcaaaggagc taggtccgtt cggtgttgag cctgtcccgc 3318361 agccggcgat tcaccggttc gggcagcaac tcggacacgt caccgcccag catcgcgact 3318421 tctttggcca gtgaggacga cacgaacgaa taccgtggcg cggtcgcgac gaaaaaggtg 3318481 tccacaccgg caatgtgttt gttcatttgc gccatctgca gctcgtattc gaagtcggtg 3318541 ccggtgcgca gccccttcac gatcgcggtc atcccgcaag acctgacaaa gtcgaccacc 3318601 aagccatgcc cgacctgcac gcgcagattg ggcaggtgcg ttgtcgactc cttgaccatc 3318661 gcgatccgct cgtcgaggtc gaacatgccc gtctttgcag ggttgaccag gatggcaacc 3318721 accacctcgt cgaattgggc tgcggcgcgt tcgaaaatgt cgacgtggcc taacgtcacc 3318781 gggtcaaatg accctgggca taccgcgccc gtcatctgcg ccgctcctcc tcatcgctgc 3318841 gtcccccgca agcgggcacg gcccccaccg catcgtcgcc ggcggtcatg accgatgacg 3318901 ctacacgttg gcaaaaagcc gttcggccag ttccaaacgg gtgtcgccgt aaacacgctg 3318961 gggccatcgg cgccagccct ccggccacgt caacggcgcg cacgtggtcg cacgctccac 3319021 caccgctacg gttccctcgc gcgtccagcc gttggtgccc agtgcggcca ggatggcgtc 3319081 aacgtcggcg gagtcgacgt tgtagggcgg gtcggccaac accagatcca ccggggacgt 3319141 ggtcccggcc gccacgacgg ccgccaccgc gccccggcgc agcgtcgcac cggagagacc 3319201 tagggcctcg atgttgcgcg caatgacggc cgcgctgcgc tggtcggact ccacgaacag 3319261 cacggacgcc gctccccgcg acaacgcctc cagccccagg gcgccggaac ccgcatagag 3319321 gtccaacacc gccagaccgg tcagatcccg ccgcgcagtc acgatgttga atagcgactc 3319381 gcgcacccga tcggtggtag gtctggttcc gcgtggtggg acggcaatgc gccggcctcc 3319441 ggcgacaccg ccgatgatcc gggtcaagtg cgccgctctc cctcgcaagc gggcggtacc 3319501 cccacctcat cgcttcgtcc cccgcaagcg ggcggtaccc ccactgcatc gtcgccggcg 3319561 gtgctcatct gcgccgctcc tccgcaagcg ggcggtaccc ccacctcatc gcttcgtccc 3319621 ccgcaagcag gcggtacccc cactgcatcg tcgccggggc ggtcagctca ccaccaccaa 3319681 caggtctccg ccctccacct gggcggtgtc cgacaccgcc acccgctcca cggtgccggc 3319741 aaccggggcg gtgatcgggg cttccatctt catcgcctcg atggtggcga tggtttggcc 3319801 ggcgccgacc cgctcgccga cgcacacccc gaccgtgacg actccggcaa atggcgcggc 3319861 gatgtgtccg ggattgccgc ggtcggcctt ctcggcggcc ggaacggcac tggcaatgct 3319921 gcggtcgcgc actagcaccg gccgcagctg cccgttgagg atgcacatca ccgttcgcat 3319981 gccgcgttcg tcgggttcgg aaatggcctc cagcccgatc aacagctcca ccccacgctc 3320041 cagcttcacc cgatgctctt caccttggcg cagaccatag aagaactggt tggccgacaa 3320101 ttgcgacgtg tcgccgtagg cttcccggtg ctcattgaat tcctttgttg gactgggaaa 3320161 taacagcctg ttcagggtgg cctgacgctt ggctccgacc gacgataggg caatctcgtc 3320221 gtccgccgcc aattgcgcag tgggcctggc cgccccgcga ccggccagcg ccgcagtgcg 3320281 cagcggttcg ggccacccgc cgggcggatc acccagctcg ccccgcagaa atccgagtac 3320341 cgattccggg atgccaaatc gcgctggatc ggaggcgaat tcgtctgcac tgacaccggc 3320401 gccgaccagt gccagcgcca gatcgccgac caccttggac gttggcgtga ccttaaccag 3320461 cctgcccaac actcggtcgg cgcccgcgta ggcctcttcg atctcttcga atcgatctcc 3320521 cagaccaaga gcaattgctt gctggcgcag attggacagt tggccgcccg gaatctcgtg 3320581 gtgataaacc cgccccgtcg gccccggcaa cccagactcg aacggcgcat acacttttcg 3320641 taacgcctcc cagtacggct ccagggcgca caccgccgaa agcgacaggc cggtgtcgta 3320701 ctcggtgtgg gcagcggcag caacgatcga gctcagcgcg ggctggctgg tcgttcccgc 3320761 cagcggcgcg gcggcgccgt cgacggcatc ggccccggcg tgccaagcgg ccacatagct 3320821 ggcgagctgg ccacccggtg tgtcgtgggt gtgcaggtga acgggcaggt cgaagcgact 3320881 gcgcagggcg ctgaccaacc tttgagcggc cggcgggcgc aacagtccag ccatatcctt 3320941 gatcgccagc acatgggcgc cggcgtccac gatctgctca gccagtttca ggtagtagtc 3321001 cagcgtgtac agctgttcac ccggatcggt aaggtcgccc gtgtagcaca tcgcgacttc 3321061 tgctatcgca gaacctgttt cgcgtactgc gtcgatcgcc ggacgcatcg actcgatgtt 3321121 gttgagcgcg tcgaagatac gaaagatgtc gataccggtg gctgttgctt cttgcacaaa 3321181 cgccgacgtc acgatttccg ggtacggcgt gtagcccacg gtattgcggc cccgcaatag 3321241 catctgcaag cagatattgg gcattgctgc acgcagtgtg gccagccgtt cccagggatc 3321301 ctccttgaga aagcgcagcg ccacatcgta agtcgcaccg ccccaacact ccacggacaa 3321361 cagctgcggc atggtccgcg cgagatacgg tgccacccgc gacagtccgc tggtgcgtac 3321421 tcgggtagcc agtaacgact ggtgagcatc ccggaatgtg gtatcggtga ccccgaccgc 3321481 ggccgactcc cgcagccaac gagcaaatcc ttccggcccc aacttgacta gtcgctgctt 3321541 ggacccggcc ggtggtgcgg cccgcagatc aagatcgggc agcttgtcgt ccgggtagat 3321601 cgttgacgga cgcgagccat acgggttgtt gacggtgaca tcggccagga agttaaggat 3321661 cttggtgccg cggtcggccg aggcgcgcgc ggtcagcagc tgcggccgct catcaatgaa 3321721 ggacgtggtg acccggcccg ctcggaagtc cgggtcatcc aggaccgctt gcaggaacgg 3321781 aatattcgtc gataccccgc ggatccggaa ctccgcgatc gcccggcgcg cacggctcac 3321841 tgcggtaggg aggtcacggc cccgacaggt cagcttgacc agcatggagt cgaagtacgg 3321901 gctgatttct gcgcccaggt tggtgctgcc gtccaggcgg acaccggcac cgccggcggt 3321961 gcgcaacgcg ctgatccggc ccgtgtccgg ccggaagccg ttggccggat cctcggtggt 3322021 gatccggcac tgtagtgcgg cgccatgcgg tgcgatgtcc tcctgccgca ggcccaattg 3322081 ttcgagcgtc tccccggcgg caatgcgcag ctggctggcg accaggtcga cgtcggtaat 3322141 ctcctcggtc accgtgtgct ccacctgaac ccgcggattc atctcgatga agacatactc 3322201 ccctcgctcg tccagcagga actcgacggt gcccgcgcag ctgtacccga tatggcgggc 3322261 gaaggcgacc gcatcgacgc acatcttgta acgcaactcg gcgtccaggt gcggcgcggg 3322321 cgccagctcg atgaccttct gatggcgacg ctgcacactg cagtcacgct catagagatg 3322381 gatcacgtcg ccgaggttgt ccgccagaat ctgcacctcg atgtggcgtg gattgatcac 3322441 tgcctgctcg agatagaccg tcgggtcccc gaacgccgac tcggcttccc ggctggcggc 3322501 ttcgatcgcc tccggaagcg ccgcgatatc gccgacacga cgcatacccc ggcccccgcc 3322561 accggcaact gccttgacga acaacggaaa cggcatgccg gccgcaaccg acagcagttc 3322621 gtcgaccgag gccgacggcg ccgaggacat cagcacgggc aagccggctt cgcgggccgc 3322681 cgcgatggcg cgagacttat tcccagccag ctcaagcact tcggcgctgg gaccgacgaa 3322741 gctgatgccc gccgccgcgc atgccgcagc cagatccgga ttctccgata gaaacccgta 3322801 gccagggtag atagcgtcgg cacccgcccg acgggccgtc gcgacgatct cgtcgaccga 3322861 caggtatgca tgcaccgggt gaccgatgtc gccgatctgg taagactcgt ccgccttgag 3322921 acggtgctgc gaattgcggt cctcgtacgg ataaacggcc acggttccga cgcccagttc 3322981 gtaggcggca cgaaaggccc ggatcgcgat ctccccgcga ttggcgacga gcaccttgga 3323041 aaacacgtgt ggctccctta tccggatgtc tcagatcagc gtcgaccaat agtcccaaaa 3323101 gcggaccatg atcagcagga atactgtcgt gaaccagagc gtggccagcg accatcgcca 3323161 ttgatagagc agccgtgccc cgacgcgctc ctggcttccc cggttttccc gcatcggacc 3323221 gaaaacgatg gacgcgacca ccaccagcag tgtggcgatg accgcccaga ccaccatgca 3323281 gtatgggcac agggcaccga tacggtacag gctctggaat atcagccaat gcacgaacgc 3323341 cacaccaacc aggatcccga ccgccaggcc gatccaatac cacctgggca acggcacttt 3323401 cgccaccgcc agcaccccgg tgaccaccac cacggtgaag cccgcaatgc cgagaagcgg 3323461 gttgggaaag cccagcaacg acgcctgcgg tgtggtcatc accgagccgc acgacactat 3323521 cgggttgaca ttgcatgacg gcacatagat cggatcgagc agaatcctga ccttctccac 3323581 cgtgagcgtc atcgaagcga acagcccgat cacaccgccg atcagcaccc accacgcgct 3323641 aggcaccggc acccgcaccg cagccgggtc gccggatcgc tcggcaggtc gagctgccac 3323701 cacaatcgtc aggatgtcgc ggtagcagcg gccgagtcaa tgcccggcac atcacccaca 3323761 atttctttga tcttggcgac cagcgccgcc ggcgtcgacc actcgtactc tgtgccattg 3323821 acccggaccg tcggggtcgc gtgcacgttg accgccgccg ccagcccgtc gactttttcg 3323881 atgtacttgc cgctgttgat gcagtcgggc accttgccca cgacgccggc ttcgcgggca 3323941 agttcgatca accgcgcgtt gtcggggaaa tccttgccga gctcggcagg ctggatgtcc 3324001 ttgctgaaca aggcggcgtg gaagcggcgg aacgcctcga tcgattcgtc ggcaacgcaa 3324061 taagccgcag cagccgctcg cgacgaatag tgttgattgc tggcgctatc gagaatggcc 3324121 accatcgtgt aatcggccgc gacagcgccg atgtccacga gcttggacac ggttggcccg 3324181 aaaccgcgct cgaatatgcc gcacgccgga cacaggaaat cctcgtagaa ggacaccacg 3324241 gccttggggt tgctggttcc gggctgggtg accagcttgc tcgacgtcac ccgtactgca 3324301 tcgccggggc ccgcgacgcc gtccttcttg tcgtcgcgcg acgtcacgat gtagaagacc 3324361 aggacgacgg caaaaacgac gacgatggtg gtgccaccaa tctggacgag ccggccgaag 3324421 ctgccgtcgg cggacttcag atcgaatcgc ggggggcgtt tggatttgtc ggccacagtt 3324481 tcgctgatcc tcacgtgctc gatttgtcgg cttgtcgcgg ccgcggtcag gcgacggcgc 3324541 ctctagcgta ccggcggcaa gccagcctcg actcaaaccc ggctaaggtg cgcgcgcagc 3324601 gcggagatca gctcgttggt cccggcagca ctgcctcccc ccagctgaaa caggttgagg 3324661 aagccatgcg tcagcgaacc cagataccgc aagtccactg cagtcccggc agcccgcagc 3324721 gccttcgcat agctttctcc ttcgtcgcgc aatgggtcga agccggcgac cgcgatgaga 3324781 gcaggcgcca gcccggacag cgattcggcc aacaacggcg acaaccgcgg atccgccgga 3324841 tcgacatcgg aatccctgag gtattgcgtg tggaaccaat cgatgtcccg cttggtcagc 3324901 aggaagccat tgccgaacag gcccattgag cgagtctgtg cggtgaaatc ggtcctggga 3324961 tacagcagcc actgcagcac cggggtgggc ccaccctcgt agcgagcctt gtcgcgcgcc 3325021 aactgacaca ccacggccga caggttgccg cccgcactgt ccccgcccac cgcgacccgc 3325081 ccggggagcg caccgaactc atcggaagcg tgctcatggg cccatacaaa agccgcatag 3325141 gcatcttcaa ccgcggccgg cgccggatgc tcgggagcca accggtagtc gatcgacagt 3325201 acctggatgt cggcgtcgcg acaggtcaac cggcacagcg cgtcatgggt gtccaagtcc 3325261 ccgagcgtcc agccgccacc gtggtaaaag accagcagcg gcgtggcgcc accgccgctg 3325321 gggcggtagt gccgcgccgg gatctcaccg gctggtccgg gtattgacag gtcggtcacg 3325381 tcgacgtgga tctgcggacc gggcatcgcc tcgcatatcg cgcgcatgtg cgcgcgagag 3325441 gcgacgatgt cgtcgtctac ggccaggccg tcgacaccga agatccgcga agtcgacaac 3325501 atcagctgca gggtggggtc aagcgtattg ccatcgataa tgaccgatcg gccggccgac 3325561 aggatccgtt tggcaggcgt cgggatccac ggaaggacct tgactccgac gttgacgacg 3325621 gtgccctgca cacgccgtgt ccacatgcgc gggtggtttg ctccgagacg gaggtctgcc 3325681 acacctggca gactcttggt catgggctgc tccctacaaa actctgtcac gcgcagcaac 3325741 ggacactcga tccgcgccgt caggctggat gtctttcggg tcctgccggc cgacaccggg 3325801 caagcggtag gtgccgcgag tccggcgtgc ccaacggcca actctacgtg gtgaccaaag 3325861 tgttgaatgc cgaccagcac tattcgcggc ttacgccgcc gtcgccgaag gctgtggctc 3325921 agcacctgcc caggtgttga ttaggtggca tatccaactc ggtaatatcg tgatccccaa 3325981 gtcggtgaac ccaatgcgga ttgcgagcaa cttcgacgcg ttcgatttcc ctcgctcgat 3326041 gacggaaccc ggcttggtcc gaatccgaaa accttcaatt tcacaggcag gtgagatgac 3326101 gtgactggcg agtcgggcgc cgccgccgca ccctcgatta ccctcaacga cgagcatacg 3326161 atgccggtgc ttggcctcgg cgtcgcggaa ttgtcggacg acgagaccga acgtgcggtg 3326221 tccgcggcgc tggaaattgg ctgccggctg atcgacaccg cctacgccta tggcaacgag 3326281 gccgcggtcg gccgcgcaat tgcagcctcc ggcgttgccc gcgaagagct gttcgtcacc 3326341 accaagctag ccacccccga ccagggtttc acccgttccc aggaagcatg tagagccagt 3326401 ttggaccgcc tcggcctcga ctacgtcgac ctttacctaa ttcactggcc ggccccgccg 3326461 gtgggcaagt atgtggacgc ctggggaggc atgattcaat cccgcggaga gggccatgcc 3326521 cgatcgatcg gcgtgtccaa cttcaccgcg gagaacatcg aaaaccttat cgacctcaca 3326581 ttcgtcacgc cggcggtcaa ccagatcgag ctgcacccgc tgctcaacca ggacgaactg 3326641 cgcaaagcta acgcccagca caccgtcgtc acacagtcct actgccccct ggcactcggc 3326701 aggctgctgg acaacccaac cgtcacatca atcgccagcg aatacgtcaa gacgcccgca 3326761 caagtgctgc tgcggtggaa cctgcaattg ggcaatgcgg tggtcgtccg ctcggccaga 3326821 cccgagcgca tcgccagcaa cttcgacgtc ttcgacttcg agttggcggc cgaacacatg 3326881 gatgcattgg gcgggctcaa tgacggcacc cgggtgcgcg aggatccact gacctacgcc 3326941 ggcacctgat acgccgccga ctgtgaaccg cgcgacgtct cctcggcgtg tcacgtcgtg 3327001 agattcaccg tcggcgcgtg gactagcccg tcgggcaggt ggccgcggcc tgacgcagta 3327061 cgtcggacga tggctgatcc actggcagtg aatagccgcg cagcacggcg atgaattgca 3327121 tcgcgtactg acaggcgaag gccttgttgg gtggcatcca ttgggccggt ggcgaatcgc 3327181 ccttgtcctg atttgcctgc ccctgcacgg ccagcaggtt ggccggatcg ttggcgaagc 3327241 gcattcgctc ggagttcggc caccgatagg cgcccatgtc ccaggcatac gagagcggaa 3327301 cgatgtggtc gatctggacc gattggccaa cactggcgcc gcgttggaag gcaacggtgg 3327361 tgttggtgta cggatcgcgc agggtgccgg tggccaccgc attcggacac cgcttgatcg 3327421 acacatatgt cttgtcgacc agatcccggt cgaggatgtc gtcgcgggtg tcgcacccgt 3327481 tgtgccctcc cggcgcgtca ttgcgatcgt cccaggggtg accgaatgcg gacctgcggt 3327541 agtcgtagcg gtggatccgt ttgggtagca cggcgatgcc ggcgagcacg tcggcaccgg 3327601 gttgcacggt tggcacgcca gcgcgggcgg cgaactcgtc agcgtgcctg cccgccgatg 3327661 atcccagcgt ctgatacgcg accaccagcg ccagcgccgc gatcgccgac agccacagta 3327721 gcgttctgcg gttcatgact tatctaagta ttcgatgcgg tcggtgctgg tgaatcgcgc 3327781 ggccatcagc gccaatgcgg ggtctgtggg gttcttgtaa gcctcgatgc agaagtcccg 3327841 cgcggccact atgtattcct cgtgttcggc caatgacagc aaccgcagcg tgatggcctt 3327901 gccggattgg ttgcggccca gcacatctcc ctccttgcgc tccttcagat ccagatcggc 3327961 gagggcgaac ccgtccattg tcccggcgac cgcacgcagc cgctgacccg ccggcgtatc 3328021 cggcggcacc cagctggcca gcagacacac gctgggatgt tcgccgcgcc cgatgcggcc 3328081 gcgcagctgg tgcaattggc tgatgccgaa ccggtcggcg tccatcacca gcatgaccgt 3328141 agcgttgggg acatcgacgc caacctcaat gaccgtggtg cacaccagca catcgacctc 3328201 accggcccgg aaagccgcca tcgcagcgtc cttgtcgtcg gccgacaacc gtccatgcat 3328261 gagcgccaac cgcaactctg cgagctcggc ggaacgcaac cgggagaaca ggccttcggc 3328321 agtggccgat ggtcggacgc cgccttgaac gtcggtgtcg tcggactcat cgatgcgggg 3328381 cgccaccaca taggcctggc ggccggcggc agcctcttcg atgatgcgcc gccaggcgcg 3328441 gtcgagccag gcgggcttgt ccttgacaaa gatgacgttg gtggcaatcg gctggcgccc 3328501 gagcggaagt tcgcgcagcg tagaggtttc caggtcgcca tagacggtca gcgcgaccgt 3328561 gcgcggtatc ggcgtcgcgg tcatcaccag caggtgcggg gtaatgccgg cgggggcctt 3328621 ggcgcgcaac tgatctcgct gctcgacacc aaaccggtgt tgctcgtcga ccaccaccat 3328681 gcccaggttg tgaaagtcga cggcctcctg cagcagcgcg tgcgtgccga tgacgatgcc 3328741 gacctgaccg ctggcgattt cggcgcgaac ttgcttcttc tgccctgccg tcatcgaacc 3328801 ggtgagcagt gccacccggg tggcgttttc ggcgcctccc agttggccgc ccatggccag 3328861 cggccctagg acatcgcgga tcgatcgcaa gtgttgtgcg gcaaggactt ccgttggcgc 3328921 cagcagggca cactggtaac ccgcgtccac catctgcagc atcgccaaca ccgcaacgat 3328981 cgttttgccc gagcccactt cgccttgcag caggcgattc agcgggcggt tcgccgcgag 3329041 cccgtcggac aacacgtcga gcacctcacg ctgtcccgcc gtcagctcaa aaggcaaccg 3329101 ccgcagtagc tcagcggcaa gaccgttaga tttccaggcc gccgagggcc cggattccga 3329161 cagttcaccg tgccgtcggg ccaccagcgc ccactgcaga cccacggcct cgtcgaaggt 3329221 caggcgttcc cgggcgcgct cgcgtaacga ctggctttcg gcaaggtgaa tggcgcgcag 3329281 tgcctcgtcc tcggggatca ggccgtgctt ggcgcgtagt tccgcgggca acggatcatc 3329341 gacccggtcg agaacatcga gcacctgccg cacgcatttg aagatgtccc agctctgcac 3329401 ttttgtgctg gccggataga tcgggaagaa acgacgctcg aactcctcca cgaccaattc 3329461 accgctgatg gccttggagg catcagcgat acttttgagc gacctggtgc cgtggttctt 3329521 cccgtccggc gagtcgagga tgagaaacgc cggatgcgtg agctgcatcg cgcccttgta 3329581 gtagccgact tccccggaga gcatcacctt cgtgtgcttg gtgaggtccc gcatgatgta 3329641 gtccgcgttg aagaacgtgg ccgtcacctt gttgcggccg ccgccgacgg tgatgcgcag 3329701 acatttccga ttcggcttct ttttcatcgg aaacgaatac gtatcggtga tcacgtcgac 3329761 gatggtgatg tgctcgccag cttccggtcg cgcgtcaccg atacccaccc gcgccgcgcc 3329821 ctcgacgtag ctgcgcgggt agtggcggag caggtcgtcg acggtccgca tgccgaactg 3329881 ctcgtcgagg gcatcggctg ccgtggcgcc gaggacgcga tcgagccgat cgcttaacga 3329941 cgccaccgct actcgacccc gatcagcagc gcgtcgccgc ggtgtccggt gcggtaggag 3330001 accagctcgg tgcctggatg gtggtcgtgc acatgccgtt ccaggacgac agccacgtct 3330061 tcggttacgc cggcgccaat tagcaccgtc accagatcgc ctcccgatgc caacaacagg 3330121 tcgaccagac cgatggccgc cgcggcgaca tcgtcggcga cgatcagcac ctcgtcgccc 3330181 gcgataccca gaccgtcgcc cggcttgcag gtaccggccc aggtcagcgc cttttgggtg 3330241 gcaatgcgca ccgatccgtg ccgggaagca ccggcggcac gggccatgct gtagccgtcg 3330301 tcgacggcct ggcgggccgc gtcatgcacg gccagcgcgg ccaacccctg caccatcgat 3330361 ccggtcggca cgggtaccac gtcgacgccc cagccgatcg ccgcggtaca cccggccacc 3330421 agttcttcgg cggccacata gccattgggc agcaccatca cgtgcgcggc gccggtgtct 3330481 accacggccc gcaccagctg gtgggcactg atatcggcgg ccggtgtcac ggcgtctgga 3330541 cccggtcgca gcacgcaggc gccctccccg gcgaacagct cggcggcacc gtcgccgtcg 3330601 acgaccgcca gcacggcgcg gccccgcgtc cagccaccgg ccggcaatcc gctggtcccg 3330661 gaaccgagcg ccgagatcac gatccggcta actcgcccca ccgccaatcc ggcttccacg 3330721 gcggcaccgg cgtcgtcggt gtggacgtgt acggagtagc tgtcgggcgg agcagcggcg 3330781 atggccaccg actcacccaa ttccttgagt cgatcccgca actggtccgc cgctgcagca 3330841 tcacataccg ccaacagata catcacctcg aattgcgggg cggggcgttg ggtagccgtg 3330901 tcggtcggca acgcgcgcgg cgagggttcg tagaccgccc gggcaggtgc ctgcccgcag 3330961 atggtggagc gcaacgcgtc cagcagaacc agcaggcccc gtccgccggc gtccaccgcg 3331021 cccgcatcgg cgagcacgtc aagctgttcg ggggtctttt ccagcgcgat gaccgccgcg 3331081 tcaccggcgg cggtgaccgc accggccaac ccctcgtgcg cgcactggtc gacggctccg 3331141 gcggcggccc gcagcaccga gacgatagtt cccggcacct ccacgccacc catcgacgcg 3331201 acgaccaact cgacgccgcg ccacaacgcg gccccgaggg cgttggcgtc gaccgcccgc 3331261 aataccgcgc cagaggcggc ggccgcagtc gcggtcacct ctgcgatccc gcgcaggatc 3331321 tgggacagga tcacgccgga gttgccgcga gctccgttca acgcgcgccg gccgcgagag 3331381 cggccgcaac ccgcgccacg tcttcggcgt cagcctgcga attcgcgtgc aaatcagctt 3331441 ctacgaccgc ggcacgcatg gtgaacagca tgttgacgcc ggtatcggag tcagcgaccg 3331501 ggaacacatt gagccggttg atctcgtcga tgtggaggat cagatcgctg acgacggcgt 3331561 gtgcccagtc ccgcaaggcc gaggcgtcca acggccgatc cgccgtcccc actacaacac 3331621 acctcctccg caacacacct cctccgcgcc agcccgcgcc ccgagcctaa ccagacgtgg 3331681 tgacagcacg gtcacgacgc cgctctcccg gccaaggcgg gtgctgacat gtccgcgaag 3331741 ggctgatcgt tttggcgcta ccgcacaaca atggctatcc tgtgctagcc gcgggctaca 3331801 cgtaggcgtc ccggccaggt cgccggacct aagagatttg aggagcttga cgaatggccg 3331861 ctgtgtgcga tatctgcggg aaaggccccg gcttcggcaa gtcggtgtcg cactcccacc 3331921 gccgcaccag ccgccggtgg gatccgaaca tccagactgt gcacgccgtg acccgtcccg 3331981 gcggcaacaa gaagcgactc aacgtttgca catcctgcat caaggcgggc aagatcaccc 3332041 gcggctgacg cccggtaaca cctgcacgac tcagggcaac cgccaatcga tcggctcggc 3332101 acccatcccg acgagcagtt cgttggcgcg gctgaacgga cgcgagccga agaacccgcg 3332161 cgatgccgat agcggtgaag gatgcggcga ctcgatcgca acgcagttgc ccgcggccag 3332221 catcggcttc agagtcgacg cgtcacgacc ccacaggatc gccaccagcg gcgctgcgcg 3332281 cgccgccagg gcgcgaatcg cgcattccgt gaccgcttcc cagcccttgc cccggtgcga 3332341 cgccgggttg ctgggtcgca ccgtcagcac cctgttcaac agcaacacac cgcgttgcgc 3332401 ccagggcgtc agatcgccgt tcgagggcag cggatagccc aaatccgcgg tgtactcgtc 3332461 gaagatgttg gccagactgc gcggccacgg acgtacatca ggggccaccg agaagctaag 3332521 acccacagca tgtcctggag tcggataagg gtcttggcca acgataagga cacggacgtt 3332581 gtcgaacggg aaagtgaagg cgcgcaacac attcgatccg gcgggcaggt atctgcgccc 3332641 ggccgcgatc tcggcccgca agaactgccc catgtgggcc acctggtcgg ccaccggctc 3332701 gagcgcggcg gcccaccccc gctcgacgag ctcactcaac ggccgtgcgg tcactgcatc 3332761 cctttcgcgt acagacggtc accgcgtcac cctagcgaac cttgattgtc tggctcccca 3332821 aacgattgcc agcccgcgta tccagtccac tcctcgccgt cgaccagcac cctagccggc 3332881 ccgtcgagaa cccggccaat ggtgcgccac ccggccggca ccggaccgac gaaacaggcg 3332941 accagggcat gatcttcacc cccgcttagc acccacggcc aggggtcggt gcccagagcg 3333001 gttgcggccg cagtcaaagc gtcgcggtca gcggccaacg ccgcggcgga caggtcgatg 3333061 cgcacgccgg atgcctcggc gatgtgccgc agatcggcga gcagcccgtc ggagacatcg 3333121 atcatcgctt gagccccgac agccgcggcc gccgcgccgt ggccgtaggg cggctgcggc 3333181 accaaatggc ggcggcgcag ttcggcgaag tcttcaatcc cgttgcacca cagcgcatag 3333241 ccagcagccg agcggcccag ctcaccgacg acggccagca ccgagccggc cttcgccccg 3333301 gagcgcagca ccggggcacg accgtcaagg tcaccaatcg cggtgaccga caccacccac 3333361 tgccggcagc tgaccagatc gccgccgacg atgccggcac caatgcgccc cgcctcctcc 3333421 cacattccgt cgaccaacgc gctcgcctgc gccgccggcg tctcagcggg tgctccaaag 3333481 ccgaccacga acgcggtggc ccgcgccccc atcgcctcga tgtcggcggc attctgggcg 3333541 atcgccttgc ggccgacgtc ctgcggtgtc gaccagtcca gccggaagtg actatcttgc 3333601 accagcatgt ccgtcgacac cacagtgcga ccatcgccgg cagacaccag cgcggcatcg 3333661 tcgccgggcc cgagcagtac cgtggcgggt tgtcggcgcc cccgcaccag ccggtcgatc 3333721 acggcgaact cgccgagctg ctgcagcgtc ggggactccg ttgcaagtga gtgatcttta 3333781 gtggtcacgc gacttgcacc ccgtctcggg gttgttcggc agccttgggg ctgcttccct 3333841 tccgcgcttc acagccacct gccgggcgag gcccggtctt acggtcggct ccacgcttga 3333901 cggcggcccc aactgggccg acgatgctgg atgtttcctc gtagcgtgcg aggttgatgg 3333961 cagcgcagtc atcacgctga tggaccactg agcatcggtc gcattgccat tgttcgtccc 3334021 agccgatgtc ttgcacatgc cggcaggcgt ggcaggtttt cgacgacggg aaccagcggt 3334081 cggcgaccac cagcgccgac ccgtaccaga ctgtcttgta ggacaagtgc cgacgcggag 3334141 tgcccagggc cgcatccgac agtccgcgcc gacgagcgcg ggcacccggc aacccttttt 3334201 gccgcaacat ctctgtcgcg tccaagcctt cgacaacaat gcggccgtgg gtttgagcca 3334261 accgtgtcgt caggacgtgc aggtgatggg tgcggacatc gttgacccgg cgatgcaacc 3334321 gggaaatctg agtggtgcgc tcacggtagc gccgtgaacc tttcgtgcaa cgcgaacggg 3334381 cccggcacac gtggcgtagc tcgcgcagcg cggcgccgag cggtcgtggg ttctcaacct 3334441 gctcgatcgc cgtgccgtca gcggtggcga ccgtcgccag gcgccggacc ccgacatcga 3334501 caccaacccg cgaaccgggg tgcaccacct tcggctgctg cggacgctgg acaagcaccc 3334561 gcacactggc atccagacga gtgccgttgc ggcgcaccga gatcgccaat actcgcgccc 3334621 gaccggcctt gatcaggcgt tcgatacggc gggtgttctc gtgcgtgcgg acggtcccga 3334681 tgaccggcag ggtgaggtga cggcggtcgg gttccacacg catcgctccg gtcgtgaacg 3334741 acactcgatc ctggtcgcgg cctttgcgtt tgaaacgggg aaacccgacc cgtttaccgg 3334801 cgcgtttgcc ggcgcgggag gtctgccagt tccagtacgc ctcgaccgca cccgcgatgc 3334861 catcggcgta ggcctctttt gagcattcag gccaccacgc gacaccggtc tcggtgttga 3334921 cgcacacgtc gtccttgacg gtgttccagc gtttgcgcag cacgcgcagc gacggtttcg 3334981 ctgtcacggt cccgctggca tgccacgcct ggatgtcggc tttcagggtg gccacggtcc 3335041 agttgtatgc cttgcgacga gcaccgaaat gccgtgccag cgccttggcc tggtcctcgg 3335101 tcgggtccag cgtgaaccga aacgcttgga ccgtccagcc atcgggaacc tcgaacttgg 3335161 gcatcaggcg gcctcatggt cctcgccagc agcggccgcc aatgcgcgct tggttcgatt 3335221 ctcggcagcc cgtttgccat acagacgggc gcacatcgac gtcaggatct cggtcatatc 3335281 ccgcaccagg tcgtcatcga cctcggcaga gtcgactacc accagttcgc ggccctgcgc 3335341 cgcaaacgct gcctgcacgt acttcgagcc caaccggcag aaccgatccc ggtgctccac 3335401 cacgatccgg tggactgacg ggtcgcgcag cagtgaaagg aacttacggc ggtgctcgtt 3335461 gaacgcggaa ccgacctcgg tcacgacctt gtcgactggc atctgttggg ccgtggccca 3335521 cgcggtcacc cgcgcgacct gccgatccag atcggctttc tgatcggccg acgacacccg 3335581 tgcatacacc gcggtcggtg atcgcatgcc agcgtcccca gccggttcgt cgacgagaat 3335641 cagtcggccc actcgcctcg ccatcaccga caacagacca gcacgaaacc agcggtaggc 3335701 ggtccccgga gcaacaccgt tgcgctccgc ccacgtcgcc aggttcatat ctctgttcct 3335761 accgcacgcc actgacaact accgaccact caacccgcaa cagctggcac cccccgatgc 3335821 gtcgtcgccc acgccgcctc cttcggcccg ttctggccct gtggaccttc gaacacctcg 3335881 cccgacctgc ggtaagttga gtcactgccg gcgcgagcgg accgcgccag tgtatgagag 3335941 caaagaggtg gccgcgcagg tgacaggcga gtccgacggg ccgccgcgcg ccgtgctgat 3336001 cgccgcggcg gcgctggcgg cggcggtgat cggggtaatc ctggttgtcg cggcgaaccg 3336061 ccagccgccg gagcgaccgg ttgtcattcc ggccgtgccc gctccgcagg ccaccggtcc 3336121 cggctgcaaa gcactgctgg cggcgctgcc tcaacgactc ggcgagtatc ggcgcgcgcc 3336181 cgtcgcggag ccgaccactg cgggtgccac ggcctggcga acggggccaa acagcacacc 3336241 ggtgattttg cgctgtggac tcgaccgccc ggccgagttc gtggtgggtt cggccatcca 3336301 agtcgtcgat cgggtgcagt ggtttcaggt ggccgcgcaa aacccggacg agccaggccg 3336361 gtccacctgg tacaccgtgg accggccggt gtatgtggcg ctgacactcc cctcgggatc 3336421 ggggcccacc gcgatccagg aattgtcaga cgttatcgac cacaccatcc ccgcggtacc 3336481 catcgacccg gcgccggctc gctagtgccg atcgcaagcg cggcgcttgc gccgggcgcg 3336541 gcgggtcggc accatcgggc taagtgccga tcgcaagcgc ggcgcttgcg ccgggcgcgg 3336601 cgggtcggca ccatcgggct aagtgccgat cgcaagcgcg gcgcttgcgc cgggcgcggc 3336661 gggtcggcac catcgggcta agtgccgatc gcaagcgcgg cgctagcgcc gggcgcggcg 3336721 ggtcggcacc atcgggctaa gtgccgatcg caagcgcggc gctagcgccg ggcgcggcgg 3336781 gtcggcacca tcgggctagt gcaggcccac gccgcgggcc aatgtcgtct cgatcatcgt 3336841 cgccagcagg gtcggatagt cgacaccgct ggccgcccac atccgcgggt acatcgagat 3336901 cgtggtgaat cccggcatcg tgttgatctc gttgatcacc ggaccgtcgt cggtgaggaa 3336961 gaagtccacc ctggccagac cccggcagtc gatagccgcg aacgcccgga tcgccagctg 3337021 acgaatcgcc tctgcgacct ggtcatcgac cttggcgggc acgtccaatt cggctgcgtc 3337081 gtcgagatac ttggttgcga agtcgtagaa agagtcctcg cgtccccgca ccccggccac 3337141 ccggatctcc cccagcgtgc tggcttccag tgtgccgtcc ggcatttcga gcacaccgca 3337201 ttccagctcg cggccgctga tcgcggcctc gacgatgacc ttagggtcat gccggcgggc 3337261 ccgcgcgacc gcggcgggca gttgatccca actcgacacc cggctaacac cgatcgacga 3337321 gccgcctcgg gcgggtttga cgaacaccgg taagcccagc cgttcgcact cctggcggtg 3337381 cagtgtcgac cgcggcggac gcagcaccgc gtacgcaccc accggaagtc catcggcggc 3337441 gagcagcttc ttggtgaact ccttgtccat gccgacggca ctggccagca caccggcgcc 3337501 cacgtagggc accccggcga gttcgagcag tccctggatc gtgccgtcct cgccgtacgg 3337561 gccgtgcagt accgggaaca ccacgtcgac cgactccaga acctcgccgg ccccgggcgg 3337621 cagcgacacc aactggccac cacgccgcgg atcggccggc agcgccagct cggtgcccga 3337681 tcctgatttg acctgaggaa gctcccggtt ggtgatcgtc agggcgtcgg ggttggcgtc 3337741 ggtgagcacc cacgaacctg ccggggtgat acccaccgcg atcacgtcga accgccgcga 3337801 gtccaggttg cgcaggatgc tgccggcgga cacacacgag atggcgtgct cgttgctgcg 3337861 cccgccgaac acgacggcaa cgcggacacg ccgatcacgc cggtcgttag cactcacaac 3337921 ctgcagaggc taccgggtca ggcagacggg ctcccacgag ctgcagtttt cggtcgtgcc 3337981 ggcccgtgcg aggctcattc gggcttggtg cggcgaccca gcagcagcgt tatcgcctcg 3338041 tccaccgaca gccctttatg acagacccga tgcaccgcgt cggtgagtgg catttcgacg 3338101 tcgtagctgg acgccagcgc gagcacggat tcgcacgacg tcacgccttc gacgacatga 3338161 caagccttgc ccgccgactg caacgtttcg ccccggccca ggcgttcgcc aaacgatcgg 3338221 ttgcgcgaac gcggtgaggt gcaggtggcc accagatcac cgacccctgc cagaccggcc 3338281 aacgtcgcgc cgttggcgcc gagcgccgtc ccgagccgga tgatctccgc caggccccgg 3338341 gtgatgatcg cggccgcggt gttttcgccc agcccgatgc ccaccgccat tccgcacgca 3338401 agcgcgatga tgttcttgca cgccccgccg atctcggtgc cgacgacatc ggcgttggtg 3338461 taggggcgga agtacccgct gttcagcgcg cgctgcaagg caaccgcgcg gccggagtcg 3338521 ctgcacgcga cgacggtagc ggcgggctgg cattcggcga tctcgctggc caggttgggt 3338581 ccagagatca ccgcgacctg cggcggctcg gcaccggtca ccgagatgat gacctggctc 3338641 atccgcatca gggtgcccaa ctcgatgccc ttggccagac tgaccaaggt cgcaccctcg 3338701 ggcaacaggg gagcccaccg ctcgagattg gcccgcatgg tctgcgcggg cactcccaac 3338761 agcaccgtgg atgcgccccc aagtgcctcc tcggcatctg cggtggcatg aatgctcggt 3338821 ggtaacagcg caccgggcag atagtcgggg ttatatcggg tggtattgat ctgatcggcc 3338881 acctcagctc gccgcgccca cagcgtgacc tctccgcccg cgtcggccag caccttagcc 3338941 agggccgtgc cccatgcacc ggcgcccatc accgcgacgg tgcttgctat tccggccatc 3339001 cacacacact aatctgcgcc gcggttgccg tcgggaccgt gcctgggccc cggccacgac 3339061 cgtggcggca atgccgtcga agtgtgccgc gtggatcgac gctggcagga tgacttcatg 3339121 agcggcacac cggacgacgg cgatatcggc ttgatcatcg ccgtcaagcg cttggccgcg 3339181 gccaaaacca ggctggcccc ggtgttctcg gcgcagactc gcgagaacgt ggtgctggcc 3339241 atgctcgtcg acacgttgac cgccgcggcg ggtgtcggtt cactgcgctc gatcactgtt 3339301 atcacccccg acgaagccgc ggcggctgcg gcggccgggc tgggcgccga tgtactggcc 3339361 gacccgacac ccgaagacga tcccgaccca ctgaacaccg ccatcaccgc tgccgaacgc 3339421 gtggttgccg aaggggcctc caacatcgtt gtgctgcaag gcgatttgcc ggcattacag 3339481 acacaggaac tcgccgaggc aatctcggcc gcacgccacc atcggcgcag cttcgtcgcc 3339541 gaccggcttg ggaccggcac cgcggtactg tgtgcgttcg gcaccgcgct gcacccgcgg 3339601 ttcgggccgg attcgtccgc gcggcaccgc cgttcgggcg ctgtcgagct gacaggagcc 3339661 tggccgggcc tgcgctgcga tgtcgacacc cccgccgacc tgacggccgc acgccagctc 3339721 ggggtagggc ccgcgaccgc gcgagcggtc gcacatcgtt gaccgggacg gggcaacgcc 3339781 ggcgaggcat ccagggggtg aacggcagac caacggcgaa cggatgcctg ccgagtgctg 3339841 gcaaccccac ccaatgatga gcaatgatcg caaggtgacc gaaatcgaaa acagtcccgt 3339901 cacagaggtg cggccagagg agcatgcgtg gtatccagac gactcggcgc tggcggcacc 3339961 gcccgctgcc acccccgccg cgattagcga ccagctaccc tcggatcgct acctgaaccg 3340021 ggagctgagt tggctggact tcaacgcgcg cgtgcttgcc ctggccgccg ataagtcgat 3340081 gccattgctc gagcgcgcca agtttctggc aatcttcgcg tccaatctcg acgagttcta 3340141 catggtccgg gtggccggcc tcaaacgccg cgacgagatg gggttgtcgg tgcgctccgc 3340201 cgacggtcta acaccgcgcg aacaactagg ccggatcggc gagcagactc aacagctcgc 3340261 cagccggcat gcccgggtgt tcctcgattc ggtgctaccc gcgctcggcg aggaaggcat 3340321 ctacatcgtc acctgggccg atttggatca ggctgagcgc gaccgattgt cgacctattt 3340381 caacgaacag gtcttccccg tcctgacccc gctggccgtc gatcccgccc acccgttccc 3340441 gtttgtcagc gggttgagct tgaacctggc ggtcacggta cgccaacctg aagacggcac 3340501 ccagcatttc gcgagggtca aggtgcccga caacgtcgac cgcttcgtcg aactcgctgc 3340561 acgtgaggcc agcgaggaag ctgcggggac cgaaggccgg accgcgctgc ggttcctgcc 3340621 gatggaggag ctgatcgcgg ccttccttcc ggtgcttttc ccgggtatgg aaatcgtcga 3340681 gcaccacgca tttcgcatca ctcgcaacgc tgacttcgag gttgaagagg atcgcgacga 3340741 ggacctactg caggcgctcg agcgagaact ggcccgccgc cggttcggtt caccggtgcg 3340801 actcgagatc gcagacgaca tgaccgagag catgctggag ttgctgcttc gcgaactcga 3340861 cgtgcatccc ggtgatgtca tcgaagtgcc cgggctgctc gacctatcgt cgttgtggca 3340921 gatctacgcc gtggaccgcc cgacgcttaa ggatcggaca ttcgtcccag ctacccatcc 3340981 cgccttcgcc gagcgggaaa cacccaaaag catcttcgcg acgctgcgcg aaggcgatgt 3341041 gctggttcac catccgtatg actcgttctc caccagcgtg cagcgattca tcgaacaggc 3341101 cgcggccgac cccaacgtgc tggcgatcaa acagacgctg taccgcacct ccggcgactc 3341161 gccgatcgtc cgggcgctga tcgacgccgc cgaagccgga aagcaagtgg tggcactggt 3341221 cgagatcaag gcacgcttcg acgaacaggc caacatcgcc tgggcgcgcg cactagaaca 3341281 agccggcgtg catgtggcgt acgggctcgt cgggctcaag acgcactgca agaccgcctt 3341341 ggtggtgcgc cgcgaaggtc cgacaatccg gcggtactgc catgtcggca ccggcaatta 3341401 caacagcaag acagcacgac tctacgagga cgtcggactg ctgaccgctg cacccgatat 3341461 cggcgccgac ttgaccgact tgttcaattc gctcaccggc tactcacgca agttgtccta 3341521 ccgcaacttg ttggtggccc cgcacggaat ccgcgccggc atcattgacc gcgtcgagcg 3341581 ggaggtcgcg gcgcaccgtg cagagggtgc ccacaacggc aaaggccgca tccgactcaa 3341641 gatgaatgcc cttgttgatg agcaggtcat cgatgcgctg taccgcgcgt cgcgagccgg 3341701 tgtgcggatc gaggtggtgg tacgcggcat ctgcgcgctg cgtccaggtg cgcagggcat 3341761 ttcggaaaac atcatcgtgc gctcgattct cggccgcttc ctcgagcact cgcggatcct 3341821 ccatttccgt gccatcgacg agttctggat cggcagcgcc gacatgatgc accgcaacct 3341881 cgaccggcga gtcgaggtta tggctcaagt caaaaacccg aggctgaccg cgcagctgga 3341941 cgaattgttc gaatccgcac tggacccgtg cacccggtgc tgggagctcg ggcccgacgg 3342001 gcagtggacc gcgtcgccgc aagaaggcca tagcgtgcgc gaccatcagg aatcgctgat 3342061 ggaacggcac cgcagcccct gacactgcgt ggtgattccc gctgctgcac cgaccacatc 3342121 cacgaccgcg agcagcctgg ccgaattgac ctgcaggagt tgaggtgtcg atccagaact 3342181 cgtccgcccg ccggcgctcg gcgggccgga ttgtgtacgc cgccggtgcg gtgctctggc 3342241 gacccggcag tgccgattcg gaagggccgg tcgagatcgc tgtcattcac cgcccccgtt 3342301 acgacgactg gtcgctgccc aagggcaaag tggatccggg cgagaccgca ccggtggggg 3342361 cggtgcggga gatactcgag gagaccggtc accgcgccaa cctgggtagg cggctcctga 3342421 cggtgaccta cccgaccgac tccccttttc gaggcgtcaa gaaggtgcac tactgggcag 3342481 cgcgcagcac cggtggggaa ttcacccccg gcagtgaggt cgacgagctg atctggttac 3342541 cggttcccga cgcgatgaac aagcttgact acgcccagga tcgaaaagtc ctgtgccggt 3342601 tcgctaaaca cccggcggac actcagacgg tgctggtggt gcggcatggc accgcgggca 3342661 gcaaagcgca cttctccggg gacgacagca agcgaccgct agacaagagg ggtcgtgcgc 3342721 aggcagaagc gttggtacca cagctgctgg cgttcggcgc caccgatgtt tatgccgccg 3342781 accgggtgcg ctgccaccag acgatggagc cactcgccgc ggaactgaac gtgaccatac 3342841 acaacgagcc caccctgacc gaagagtcct acgccaacaa ccccaaacgc ggccgacacc 3342901 gagtgctgca gatcgtcgag caagtaggca cacccgtgat ctgcacgcag ggcaaggtca 3342961 ttcccgatct gatcacgtgg tggtgcgagc gcgacggtgt gcaccccgac aagtcccgca 3343021 atcgcaaagg cagcacgtgg gtgttgtcgt tgtcagccgg caggcttgtg acagccgacc 3343081 acatcggcgg tgcgctggcc gccaacgtgc gggcctaaca cacggatacc cttcgtcaca 3343141 ttgccaccgt gcaaagggta tccgtgtgtc ttgacctatt tgcgaccccg ccgagcggtt 3343201 gccttcttgg cgggagcctt ggtagccggc cgcttggccg ctgccttctt tgccggcgcc 3343261 ttggtcgccg ccttacgcac cgatgccttg accgcggtct tcttcaccgc cttggtcacc 3343321 ttcttggcgg gtgacttcgt ggccttgaca gctttcttgg cgggcgcctt ggtcgccgct 3343381 ttcttggcgg gcgccttggt cgccgccttc ctggcgggcg ccttggtcgc cgccttcttg 3343441 gcggcctttg tcgccttctt ggcaggtgcc ttcttcgcta ccttcttggc tgcactggcc 3343501 cccacaccac gcttaacagc gggtccttct gccgggagac gctgcgcgcc agacacaacc 3343561 gctttgaatt gcgcgcccgg gcggaacgcc ggcaccgacg tcggcttcac ctttactgtc 3343621 tcgccggtac gcggattgcg ggccactcga gccgcgcggc gacgctgttc gaacacaccg 3343681 aacccggtaa tggtgacgct gtcgcctttg tgtaccgcac gcacaatcgt gtcaacgaca 3343741 ttctcgacgg cggcggtcgc ctgccgacgg tccgagccca atttctgtgt gagcacgtca 3343801 atgagctctg ctttgttcat cccaaccctc cgaaaccagt ggtcctcgtt tggaaccgac 3343861 tagtggacac ggtaaaccct tacccggctg atttccaaga gccacgcgca atttcactga 3343921 gccaacgacc ggtttttcgc aatccggttg ccgcccttga ccggtggcgc ggccccaaaa 3343981 tggctcaggt tctgccggcg ggtcacgctg aaatttcgcc cggttctacg cctcaggggg 3344041 cgggtagagt gcgcggtttc cagtacgcgc acgcaccctc aaaggcctcg atctcgtcga 3344101 gtttccgcag cgtaagggct atatcgtcga gaccttcaag cagccgccac gccgagtggt 3344161 cgtcaatctt gaacggcagc accactgttg ctgcggtgat aattcgatct tgaagattgg 3344221 cagtgatttc caggcccgga ctctgctcaa tgagcttcca caggagttcc acatcgtctt 3344281 gggcaacctc ggccgccagc agcccggcct tgcccgcgtt gccgcggaaa atgtcaccaa 3344341 atcgggatga gataaccacc cggaatccgt agtccatgag cgcccagacc gcatgctctc 3344401 gcgaggatcc ggtgccgaaa tcgggcccgg caaccaggac cgaaccccgg tcaaagggac 3344461 tgaggtttag cacgaatgca ggatccgacc gccaacccgc gaacaagccg tcctcgaaac 3344521 cggttcgggt gacccgcttc agaaagaccg cgggaatgat ctgatcggtg tcgacattgg 3344581 accgccgcaa cggcacgcca ataccagagt gggtgtgaaa ggcttccatg ctgatcccct 3344641 agctgttctc agttcaattc aaatcggccg ggctggacag tgtgccgcga accgcggtgg 3344701 cggccgccac tgctggggac accaaatgtg tgcggccgcc cgcgccctgc cgcccttcga 3344761 agttgcggtt ggacgtcgcg gcgcagcgct ccccggacgc cagctgatcg ggattcatgc 3344821 ccagacacat cgagcatccc gcctgccgcc attgcgcgcc cgcgtcggtg aagatctcac 3344881 cgagcccttc ggcctcggcc tgcgcgcgta cccgcattga gcccggaacg atcagcatcc 3344941 gcacgccgtc ggccaccttg cggccacgca gcacttcggc gaccacccgc agatcttcaa 3345001 tgcgaccgtt ggtacacgac ccgacgaaca cggcgtcgac cgcgatgtcg cgcatcgcgg 3345061 ttccgggtcg aaggtccatg tacgccaatg ctttctcggc ggcctgccgc tcggcgtcgt 3345121 cggtcatcag ttgcggatct ggcaccgcgg ccgccagcgg taccccttgg cctgggttgg 3345181 tgccccaggt gacaaacggg ctcaacgacg cggcgtcgag atacacctcg gtgtcgaaaa 3345241 cggcgccgac gtcggtgcga agccgttgcc agtagacgag tgcggtgtcc cactgggcac 3345301 cggtgggtgc gtgcggacga ccacgcaaga acgcgtaggt ggtttcgtcc ggagccacca 3345361 tgcccgcacg agcgccggct tcgatgctca tgttgcagat cgtcatccgg ccttccatgg 3345421 acagcgattc gatggcgctg ccccggtatt cgatgacatg cccctggccg ccgccggtgc 3345481 cgatcttggc gatcaacgcc aggatgatgt ccttggccga cacaccgtcg ggcagccgcc 3345541 catcgacgtt gaccgccatg gtcttgaacg gccgcagcgg cagcgtctgg gtggccagca 3345601 cgtgctcgac ctccgacgta ccgatgccca tcgccaacgc gccgaatgcg ccgtgggttg 3345661 aggtgtggct atcgccacag acgatcgtca ttcccggctg ggtgagaccc aattgcggtc 3345721 cgacgacgtg cacgatgccc tgctcgatat cgcccattga atgcagccgg attccgaatt 3345781 cggcgcagtt tcggcgcaac gtctccacct gggtgcgtga caccgggtcg gcgatcggct 3345841 ggtcgatgtc gacggtgggc acgttgtgat cctcggtggc gagggtgagc tcgggccgcc 3345901 gcacccggcg cccggccagg cgcaggccgt cgaacgcctg cgggctggtg acctcatgca 3345961 ccagatgcag atcgatgtag atcaagtcgg gcgcacagcc cccgcctgat accacaatgt 3346021 ggtcgtccca aatcttctcg gccagtgtgc gtggctcgcc ggtctgcaag gccatctcga 3346081 agtgcctcta ttcattcgtt cgcgactcgc tggtcatctc aaaatacgag acgctatgat 3346141 ctctttgtga gacagcatag cggtatcggt gtcctcgaca aagccgttgg cgtgctgcac 3346201 gcggtcgcgg aatctccctg cggactggcc gaactctgcg atcgaaccga cctgcccagg 3346261 gccaccgcat accggctggc ggccgcgctg gaggtgcatc gcctgctggg gcgcggccag 3346321 gatggccact ggcggctcgg tccggccatc accgaactcg cgacccatgt cgacgatcca 3346381 ctgctggtgg cgtgcgcggc ggtactgcct cagctgcgcg acgccaccgg cgaaagcgtg 3346441 caggtatatc gccgcgaggg aacgtcgcgg gtctgcgtgg ccgcattgga accagctgcg 3346501 ggccttcgcg atacggtccc ggtcggggca cggttgccga tgaccgcggg ctcgggcgcc 3346561 aaagtgttgc tggcccacac cgacgccgcc acccaagcgg ccgtattgcc aaaggcggtg 3346621 ttcagcgccc gagcgctggc cgaggtgtgc cggcgcggct gggcgcaaag cgtggccgaa 3346681 cgcgagcctg gcgtggcgag cgtgtcggcg ccggtgcgcg acggccgggg cgtcgtgatc 3346741 gctgccatct cggtgtccgg cccgatcgac cggatgggcc gccgcccggg ggtccgatgg 3346801 gccgccgacc tgctgtccgc ggcggacgcg ctcacccgac ggctctagcc gcgttgtgct 3346861 acatcggttc gaccgcgatc acatagtcat tgccgtgcca cagaccgtct tgccgctcgt 3346921 tgagctgcaa tgcccgagcg cgcagttctt ccacataggc acgcattgcc atgccaagcc 3346981 cgttggagga gaatcgctcg attcgggcca agcacatgtt gagttggccg ttcacgtagc 3347041 gagctcggta gcggatgggg aagcggcgcg cttcaagaat gcgaaagccc gcaaggccca 3347101 gtcgccccag catccagtcc agcgggaact ctcggtacgg tcgttcgccg gcaagcaaca 3347161 ggcaggcgtc gcgcacgcga ccgatttccc agatgatttt gccactttcg gtttccggct 3347221 cgaattgcac gtagggctcc aagccgacta ggtaaagacg accatgatcg gcgagatgcg 3347281 ggcgcaaccg ctcgaacacg cggtcctgcc agtacggggc gaagccttcg atggccccga 3347341 ccaggtagtc gaccaagatg gtgtcgaacg tctcgccggc aagaaggctg tcgtctaccc 3347401 agttgccgac gagcaggcgg tcctgcgggc gcatggcgct acccaacgcg gcgcgggtct 3347461 tgtccgccag gctgcgggcg gccgtgaccg ccgtccagcg ctcggtcggc aaagtctgta 3347521 tccactgaag cgatttcaca ccggtaccgg catccaagac agtgccccag ggtctttcgc 3347581 cgtgcacgcc ttcgatgtag cggaacaagg atgagatccc ggccctcagt atgtacgagc 3347641 gaccgtggcg ggcgtgtagg tcttcgatgt ggcggatcag ggctgcgatc ttgggcattt 3347701 cggcccaggt cacacacatc gcagacgtcc atgcggccgg ttcggccgag cgcggtatcg 3347761 cggcgccggc ttcagaccct gccaaccgag cgatcgtcgt gggtgcttcc tcggagtaac 3347821 cactgtgatg tcttcctcac ggctgaagct ggcggactac cgatgaaccg acccaccgaa 3347881 actctatagc aaacgatatt cattttcaaa ctaggcaccg cgagcgtcac tggggtggcg 3347941 acgacgcgct accggcggag ccttgctgac acactgacgc catgggaacc aaacagcgcg 3348001 ccgacatcgt catgtccgag gctgaaatcg ccgacttcgt caactcgagc cgtaccggaa 3348061 cgctggccac catcggaccc gacggccagc cgcacttgac ggcgatgtgg tatgccgtga 3348121 tcgacggcga aatctggctg gagaccaagg ccaagtcgca gaaggccgtc aacctccgac 3348181 gggatccgcg ggtgagcttc ctgcttgaag acggcgacac ctacgacacg ctgcgcggcg 3348241 tgtcgttcga gggcgttgcc gagatcgtcg aggagcccga ggcgctgcac cgcgtcgggg 3348301 tcagcgtgtg ggaacgctac accggcccct acaccgacga gtgcaaaccg atggtcgacc 3348361 agatgatgaa caagcgggtc ggtgtgcgca tcgtggcccg tcggacccgc tcgtgggatc 3348421 accgcaagct ggggctgcca cacatgtcgg tgggtggctc gaccgccccg tagctgcccg 3348481 gcgagcagac gcaaaatcgc ccatttcgag acgaaattgg gcgattttgc gtctgctcgg 3348541 cagttgtagc cccgatggga ttcgaaccca cgctaccgcc gtgagagggc ggcgtcctag 3348601 gccgctagac gacggggccg gaaccgatcc gagctgccag catagctcac gccttgtgct 3348661 ggggtaccag gactcgaacc tagaatggct gaaccagaat cagctgtgtt gccaattaca 3348721 ccatacccca tgggctgcct aaaaccgctg ccgccagctg ttatgggccg acgtgcagac 3348781 taccaaagat tcgccacaca aggctcacgc gtgcccgacc agctggcgcg ccgcgcgcag 3348841 ccgctgcatg ctgcggtcac gaccgagcag ctccagcgat tcaaacaacg gcgggctgac 3348901 ggtcgtgccg gtggcggcca cccggatggg gctgaacgcc ttgcggggtt tgagcgccaa 3348961 accttcgatc aaggcgtcct taagggccgc ctcgatcagg ggtgccgtcc agtccgtcac 3349021 acttgtcagc gcggccaggg ccgcgtcgag caccgcggcc ccgtctgggc ctagctcctt 3349081 ggccgcggcc ttgggatcga tcacatactg atcgtcgttg aagaacttca acagctccca 3349141 cgcgtcaccg agcaccacga tgcgggtctg caccaactcg gcggcggcgg cgaatgccgc 3349201 ctcatccaac gcgatgtgat ggccgtgggt atccagatgg tcgcgcagcc tgaccgtgaa 3349261 gtcgcccacg tcgagcatcc ggatgtgctc ggcgttcagc gcgtcggcct tcttctggtc 3349321 gaaccgggcc gggctggagt tgacgtcggc aacgtcgaac gcggccacca tctcgtcgag 3349381 accgaacagg tcgtggtcgt cggctatgga ccagccgagc aacgcgaggt agttcagcag 3349441 gccttcgggg atgaacccgc ggtcgcggtg ggcaaacagg ttcgactgcg gatcgcgctt 3349501 cgagagcttc ttggtgccct cccccaagac cgttgggagg tgcgcgaatt tcggaatccg 3349561 ctcagctacc ccgatcctga tcaacgcctg atgtagcgcc agctggcgcg gcgtcgacgg 3349621 cagcaggtcc tcgccacgca acacatgggt gatcttcatc agcgcgtcgt cgcacgggtt 3349681 gaccaaggtg tataacggat caccgctggc tcgggtcaac gcgaagtcgg gtacggagcc 3349741 agccgcgaac gtcacgggcc cgcgcaccag gtcattccaa gcgaggtcgt catcgggcat 3349801 ccgcagccgc accaccggct ggcggccctc cgccaggtac gccgcacgct gcgcgtcggt 3349861 caagtgacga tcgaaattgt cgtaacccag cttgggattg cgcccggccg cgacatgacg 3349921 ggcctccact tcctcgggtg tggagaaagc gtggtaggcc tcgcccgcgg cgagcagtcg 3349981 ggcgagcacg tcacggtaga tttcggcgcg ctgcgactgc cggtacggcc cgtacggccc 3350041 acccacctcg ggcccctcat cccaatccag gccaagccag cgcagcgcgt ccagcagcgc 3350101 cagatagctt tcctcgctgt cgcgttgggc gtcggtgtcc tcgatgcgga acacgaaggt 3350161 gccaccggtg tgccgggcgt aggcccagtt gaacagcgcg gtgcggacca gaccgacgtg 3350221 cggagttccg gtgggtgaag ggcagaatcg gacccggact gtttccgtgg cggtcacggc 3350281 tttcctttgc ggactacggg attggtgagg gtgccgattc cctcgatggt gatcgagacg 3350341 gtgtcgccgt cctcgatggg accgactccc gcgggtgtgc cggtgaggat gagatcacct 3350401 ggcagcaagg tcattatcgc cgagatccat tccacgatgg cgccgatgtc atggatcatc 3350461 agcgaggtgc gggcgtgctg tttgacgtcg ccgttgacga cggtgcgcag ctcgagatcg 3350521 gccgggtcaa agggagcgag gtcggtgacg atccacggcc cgaccgggca gaaggtgtcg 3350581 tgccccttgg ctcgcgtcca ctgaccgtcg gattgctgct gatcgcgggc cgacacgtca 3350641 ttgccgatgg tgtagccgag gatattgtcg acggcctggg cggccgggac atccttgcac 3350701 gcccggccga tcacgatcgc cagctcaccc tcgaagtgca ccggtgatgc gttggcgggc 3350761 aatcgaattg gcgtattcgg accgatgatc gcggtgttgg gcttgaggaa tatcaccggg 3350821 tctgccggcg gccggccacc catttcggcg atgtgatcgg catagttctt cccgacacag 3350881 accaccttgc tcgccagtat cggagccagc aggcgaacgt cggccagcgg ccaggagcgt 3350941 ccggtgaagg tcggcgtacc gaacgggtgc tcggcgatct cgcgggccgt catctcactc 3351001 ggctcgccca gctcgccgtc gatgctggca aaagcgacac cgtccgggct ggcgattcga 3351061 ccgatacgca tttggatgag cttagccggg ccctgccggg cgacgattcg ggccggcacg 3351121 gcccgatgag gagcccggca atcagaccct gccgggcgac gattcgggcc ggcacggccc 3351181 gatgaggagc ccggcaatca gaccctgccg ggcgctgcgg gccctcacca tcgggccccg 3351241 tgccgggtga ctgtgccagc atgggtggat gtcgcgagat ccgactgggg tgggtgcgcg 3351301 ctgggcgatc atgatcgtct cgctgggggt gaccgcaagc tcgtttctct tcatcaacgg 3351361 tgtcgcgttc ttgatccccc ggctggaaaa tgcgcgcgga accccgctat ctcacgcggg 3351421 tctgttggcg tcgatgccca gctggggcct ggtggtcacg atgttcgcct ggggctatct 3351481 gctcgatcac gtcggcgaac ggatggtgat ggccgtgggc tcggcgctga ccgccgcggc 3351541 cgcctacgcc gcggcatcgg ttcattcgct gctgtggatc ggtgtcttcc tgtttctcgg 3351601 cggcatggcc gccggtggtt gcaacagcgc cggcgggcgg ctggtctcgg gttggttccc 3351661 gccccagcaa cgcggtctgg ccatgggaat ccgccagacc gcacaacctt tgggcatcgc 3351721 ctccggcgcg ttggtgatac ccgaactggc cgaacgcggg gtgcacgcag ggctgatgtt 3351781 tcccgccgtc gtgtgcacgt tggccgcggt ggccagcgtg ctcggtatcg tcgacccacc 3351841 gcgaaaatcc cgcacgaaag cctccgaaca ggagctggcc agcccttatc ggggatcgtc 3351901 gatcctgtgg cggatacacg cggcgtcggc gttgctgatg atgccgcaga cggtgaccgt 3351961 gacgttcatg ttggtctggc tgatcaacca ccacggctgg tcggtcgcgc aggccggtgt 3352021 cttggtgacc atatcgcagc tgctgggggc gctgggccgg gtcgcggtcg gccgctggtc 3352081 ggaccatgtc gggtcacgca tgcgtcccgt ccgcctgatc gccgctgccg ccgcggcgac 3352141 gttgtttctg ctcgcggcgg tcgataacga gggctcgaga tatgacgtgc tgctcatgat 3352201 cgccatctcg gtgatcgccg ttctggacaa cgggctagaa gccaccgcga tcaccgagta 3352261 cgccggaccg tactggagtg gccgggcgct gggtatccag aacactacgc agcggctgat 3352321 ggcggccgcc ggacccccac tgttcggtag tttgatcacc acggcggcct acccgacggc 3352381 atgggcctta tgcggtgtgt tcccgctggc cgcggtgccg ctggtgccgg ttcggctgct 3352441 cccacccggc ttggagacta gagcgcggcg gcaatccgtt cgccgacatc gctggtggca 3352501 agccgttcgc tgccacgcgt ggccaaatgg gcctcgacgg cccggtccac ccgggcagcc 3352561 gcgtcgtgtt cgccaaggtg ggacagcaat aacgccaccg acatgatcgc cgccgtcggg 3352621 tcggcgatgc cctgaccggc gatgtccggc gcgctgccat gcaccggctc gaacatcgac 3352681 gggttggccc gggtcgcgtc gatattccca ctggccgcca agccgatacc gccacatacc 3352741 gccgcggcca gatcggtgat gatgtcgccg aacaggttgt cggtgacgat cacgtcgaag 3352801 cgacccgggt cggtgatcat gtggatggtg gcggcgtcga cgtgctggta ggccacctcg 3352861 acgtccgggt agcattcgcc gacctcgtcg acggtccgca accacaatcc cccggcgaag 3352921 gtcaacacgt tcgttttgtg caccaatgtc agatgcttgc gacgccgtcg agcccgctcg 3352981 aacgcgtcgg caaccacacg ccgcacaccg aacgcggtgt tcacgctgac ttcggtggcc 3353041 acctcgttgg gcgtgccgac gcgaatcgcc ccgccgttgc cggtgtaggg tccctcggtg 3353101 ccctcgcgca ccaccacgaa gtcgatgccg ggattgccgg acagcgggct ggccaccccc 3353161 ggatacagcc gggccggacg caggttgatg tggtgatcca gctcgaagcg cagtcgcagc 3353221 aacagaccgc gctccaagac gccgcttggc accgacgggt caccgatcgc cccgagcagg 3353281 atcgcgtcgt ggttgcgcag ctcggccacc accgagtccg gcagcacctc gccggtggca 3353341 tgaaagcgcc gcgcacccag gtcatagctg gttttctgga cgcccggcac aaccgcgtcg 3353401 agcactttga ccgcctcggc ggttacctcg ggcccgatcc cgtcaccggc aatgatcgcg 3353461 agtttcatcg gcgtggaagg gctcacgaca gatcgacaac ctcgagcttg taggcgtcca 3353521 ccgccgccgc gatcgccgtc cgcacgtcgt cgggcacgtc ttggtccagc cgcagcagaa 3353581 tcgtcgcgcc cgggccttcg gcgtcttcgg agagctgcgc ggcctggata ttcaccccgg 3353641 ccgtccccag caacgtgccg atcttgccca gcgctcccgg ccggtcgacg tagtggatga 3353701 tcaggttgat cccctgggcg cgcagatcaa agtggcggcc gttgatctgc acgatcttct 3353761 gcgacagctg tgggccatac agcgtgcccg agacggtcac caccgaaccg tccgcgccga 3353821 ccgcgcgaac gtcgacgacg ctgcggtggt tggggctttc cgaggcctta cagatctcgg 3353881 cggtgacgcc acgttcggcg gccaatgccg gtgcgttgac aaatgtcacc gcatcctcga 3353941 tcaccgccga gaacaggccg cgcagcgccg aaaggcgcag cacctcaacc tcttcggcgg 3354001 ccagctcacc gcgcacctgc accgacaacg acaccggcag ttcgtcggac aacacacccg 3354061 ccagcacgcc gagcttacgc accagatcca gccagggcgc cacctcctcg ttgaccactc 3354121 cgccgccgac gttgaccgcg tcgggcacga attcccctgc cagggccagc cgcacgctct 3354181 cggcgacgtc ggtgcccgcc cggtcctgcg cctccgcggt ggacgcaccc agatgcggtg 3354241 tgaccaccac ctgtgccagc tcgaacagcg ggctgtcggt gcacggttcg gtggcgaaca 3354301 cgtccagacc ggccgcccgc acgtggccgc cggtgatcgc gtcggccagt gccgcctcgt 3354361 ccaccaggcc gccgcgcgcg gcgttgacga tgatgacgcc cggcttggtc ttcgccagcg 3354421 cctccttgtc gatcagtccc gccgtctccg gtgttttcgg taggtgcacc gagatgaaat 3354481 cggcgcgggc cagcaggtcg tccagggaca gcagttcgat gcccagctgc gccgcacggg 3354541 ccggcgaaac gtacgggtca taggcgacga cgtaagcgcc gaacgcagcg atccgctggg 3354601 cgaccaactg cccgatgcgg cccagaccca ccacgccgac ggttttgccg aagatctcgg 3354661 taccggaaaa cgacgaacgc ttccaggtgt gctcgcgcag cgacgcgtcg gccgccggaa 3354721 tctggcgtga ggcggccagc agcagcgcca gcgcatgctc cgcggcgctg tggatgttcg 3354781 acgtcggggc gttgaccacc agcacgccgc gggccgtcgc ggcgtccacg tcgacgttgt 3354841 ccagcccgac gccggcgcgc gcgacgatct tgagcttggg ggcggcggcc agcacctcgg 3354901 cgtcaaccgt ggtggccgat cgcaccagca gcgcgtccgc ttcgggcacc gcggccagca 3354961 gcttgtctcg gtccggaccg tcaacccagc gcacctcgac ctgatctccc aaggcggcaa 3355021 ccgttgatgg ggcaagtttg tcggcgatca acacaacagg caggctcacg ccgatagcgt 3355081 atcggctgta attgacgagt ggacgtcacc gtcgtcggca gcggacccaa cgggctcgcc 3355141 acggccgtca tctgcgcccg cgcgggcctg aacgtgcagg tcgtcgaggc ccaggcgacc 3355201 ttcggcggcg gcgcccgcag cgcggccgac ttcgaatttc ccgaagtttt acacgacgtg 3355261 tgctccgcgg tgcatccgct tgctttggcg tcgccgtttt tcgccgaatt cgacctaccc 3355321 gcgcgcggag tgacgctgac cgtgcccgac atcgcctacg ccaacccgct acccgggcgg 3355381 cccgcggcga tcgcctatca cgatctggcg cacacctgcg ccaagctgga cgacggcgcg 3355441 tcctggcggc gcctgctggg cccgttggtg gcgcactcgg agacggtcgt ggagttcatg 3355501 ctctccgaca agcggtcttt gcctactgca ctgggctcgg tcctgcgtct cgggctgcgg 3355561 atgctggccc agggcacccc tgcctggcgg tcgctggcgg gcgaggatgc ccgcgcgttg 3355621 ttcaccggcg ttgccgccca cgcgatttca ccgttgccgt cactggtgtc ggccggcgcc 3355681 ggactgatgc tggcaacgct ggcccattcg gtcggctggc cgattccggt gggcggcacc 3355741 caggcgatag ccgacgcgct gatcgccgat ctacgcgcgc atggtggtcg gctcgcggcc 3355801 ggtgtcgaga tcaccgaacc gcaaagaagt gtggtcgtct tcgacaccgc acccaccgcc 3355861 ctgctgcggg tttaccgcga caagcttcca catcggtatg ccaaagcatt gcgccgctat 3355921 cgatttcgcg ctggcatcgc caaggtggac ttcgtgctca gcgacgagat cccgtggtcg 3355981 gatccgcggc tgcggcgggc tgcgaccctg catctcggcg gcacccgtga ccagatggcg 3356041 cgcgccgagg cagacgtcgc ggcgggacgc cacgccgact ggccgatggt gctggccgcg 3356101 tgtccgcacg tcgccgaccc cggccgcatc gacgaaaccg gccgccgtcc gttctggacc 3356161 tatgcccacg tgccgtcggg gtccacgctc gacgcgaccg agaccgtaac cagcgtcctc 3356221 gagcggttcg cccccggctt ccgtgacatc gtggtggcgg cccgcgccgt gcccgccgcg 3356281 cggatggccg accacaacgc caactacgtc ggcggtgaca tcacggtcgg cgccaactcg 3356341 acctggcgcg cgatcgccgg ccccaccccg cggttgaatc cctggcgcac accgattccc 3356401 aaggtgtacc tgtgttctgc ggcgactccg cccggcgccg gcgtgcacgg catgtgcggc 3356461 tggtatgccg ctcgaacgct gttgcgcacc gagttcggca tcacccgcat gccccctttg 3356521 ggccatgagc tgaggccata acgaagcttg cgatcatcga ctattcggag gcgcgccagg 3356581 cggcagcggc gacaaccgga acgtcggcac ggtgctcaat cacgggtgca cggtgtgcat 3356641 cagaatggcg ggggttcgtt gtcgcggtga ggcgttcggc gaggaggtag tgtctacccc 3356701 ttgcccgcgg gttcgtgcgg actgaaggga tttcattggg aacccacggc tgcgtatcgc 3356761 agggcctcgg tgacgtctgc ttcctcaagc tcaggaagtt cggcgagaat ctcggtggat 3356821 gtcatttggt ccgcgaccat cgcgaccaca gtcgccactg ggatgcgcaa gccccggatg 3356881 catggcatgc ctcccatcac gtcggggtcg atggtgacgc gggtgactcg catgtctata 3356941 aggctagccg gtgacagcac gctggggcgg ttctccacca gccgtcttgg cttaagctca 3357001 gccaagagca agccggaggg agatttcggc accgcctgcg gcgcggtttc gggcggtgac 3357061 gctggcgtgg ttgcgttggc tgagggtgtc gacgatggcc agtccaagcc cggtgccgcc 3357121 ggtggtgcgc gcggtgtcgg tggtttccgc gagagccgag cggattgcgg cgagcagttc 3357181 ggggttgctt cgtggacgcc gcagggcgag ttcgagttcg gtggtcagga ggctaagggg 3357241 gtgcgaagtt cgtggcccgc atcgctgacg aattgacgtt ctcgctcgag cgcgtcttgc 3357301 agccgctgca gaaggtcgtt gaatgttgtt ccaagatacc ggatttcgtc tcgagccagt 3357361 ggcaatggca gacgggcatg tgggtcggtg gcgctgatgc ccgcggcgcg gatgcgcatg 3357421 cgttccacgg gtcgaagagc cgcggcggcc agcagatacg cgcccagggc gtcgatgagg 3357481 acgtctggcc gtgcccggat cggtgcgatc gagcttcatc gtggtttcat aatcacccga 3357541 tagaccatgg ctaaccgaac tgccatccac gccgtggatg caacggatgg agggaacgcg 3357601 catggcgggc gccaaacatg ctgggagaat cgtcgcgatc accaccgcgg cggcggtgat 3357661 actggcggcg tgcagttcgg gctccaaggg tggagcgggc agcggccacg ccggcaaagc 3357721 tcgttcggcg gtgaccacca ccgatgccga ctggaagccg gtggccgacg cgctgggacg 3357781 tagcggcaag ctcggagaca acaacaccgc gtatcggatc aacctgccgc gcaatgacct 3357841 tcacatcacg tcctacggtg tggacatcaa accggggctg tcgttgggcg ggtacgcggc 3357901 attcgcccga tacgacaaca acgaaacgct gctgatgggc gacctcgtga tcaccgagga 3357961 ggagttgccc aaggtcaccg atgcgttgca ggcgcatggt atcgcccaga ccgcactgca 3358021 caagcatctg ctgcagcaag acccgccggt gtggtggacc cacattcacg gcatgggtga 3358081 tgccgcccga ctggcccaag gactcaaggc ggcgttggat gccacaacga tcggcccgcc 3358141 taccccaccg ccggcacggc aaccaccggt cgacatcgac gtcgccggcg tcgaccaggc 3358201 gttgggccgc aagggaaccc aagatggtgg gctgatgaag tacagcatcc cccgcaaaga 3358261 caccatcatc gaggacgggc acgtgctgcc cgcagtgtcg ctgaacctga cgacggtgat 3358321 caattttcag ccggtgggcc gcggtcgcgc agcgatcaac ggcgatttca tcctgatcgc 3358381 ccccgaggtt caggaggtca tccgggcaat gcgtgccggc aacatcacga tcgtggaact 3358441 gcacaaccat gggctgaccg aagagccccg cctgttctac atgcattact gggccgtcga 3358501 cgacgcggtc accctggcgc gggcgctgcg cccggcgatg gatgccacca acctgcagtc 3358561 gtcataatcc cgatgcaacc gcataagggc tggtgtggct gatgcatcct gatggcggtg 3358621 catggtttcc tgctcgaacg ggtcagcgtg gtgcgcgacg aggcgacggt gctgcggcag 3358681 gtcagcgcgc attttcccgc tggccgctgc agtgcggtgc ggggcgccag tggatcggga 3358741 aagaccacgc tgctgcggtt gctgaaccgg ctcatcgatc cgacgtccgg aaaagtctgg 3358801 cttgacggtg tgccgctcac cgatctggat gtgctcgtgt tacgtcggcg ggtcggcctg 3358861 gttgcgcagg ctcccgtggt gcttaccgat gcggtgctca atgaggttcg cgtcggacgc 3358921 ccggacctgc cagaaggtcg agtgaccgag ctgctggcgc ggctgtgtct cggccagtcc 3358981 gcacgcgaag cgttcttgcc gcaccaacga tccgccttgc gcactgcgct gatacccgcg 3359041 atcgactcca cgaaagtcgt tgggctgatt agccttccgg gtgcgatgtc cggacttatc 3359101 ctggccgggg tcgacccgct gaccgcgatc cgctaccaaa tcgtggtgat gtacctgctg 3359161 ctcgccgcca ccgcggtggc agcgctgacc tgtgcacgcc tggctgaacg tgccttattc 3359221 gaccgcgcgc accggctcgt ttcgctgccc gcggcgactc gtcgggcatg agttcgcgac 3359281 tcgatcacag ccaatcgccg ctgtggcatg tggccgttgt cagtcgttat ccacggtctc 3359341 cgtgcccagg aagcgaaagc cctgatccag gtgcacttcc acctgagtac catcgggttt 3359401 ggtgacgagc accctcatcg cttcatccct tcttgtcgtc gtcgtggtta cgaaggcgac 3359461 gctaacggcg ccagatgaag ccccgatgaa ggcagcgacg ccggtgacac aacggggcgg 3359521 acctgccccg tggcacacgg cggttgccgg tcacgatcac tgcagtgtcg agacggccta 3359581 ggagctaggc cgtctcggtg atcgggcggt ccacccagct catcaggtcg cggagtttct 3359641 tgccgacgac ctcgatgggg tgctcggcgt tttgccggcg caactcttcg agctgtttgt 3359701 tgccgccctc gacgtcggcg accagcttgt ggacaaagct accgtcctgg atctcccgca 3359761 ggatgtcgcg catccgctcc ttggtgccgg catcgatgac gcgcgggcct gagaggtagc 3359821 cgccgaattc cgcggtgtcc gacaccgagt agtacatccg cgccaggcca ccctcgtaca 3359881 tcaagtcgac gatcagcttc agctcgtgca gcacctcgaa gtaggccaat tccgcggggt 3359941 agccggcttc gaccatgacc tcgaacccgg ccttgaccaa ttcctcggtg ccgccgcaca 3360001 acaccgtttg ctcaccgaac aggtcggttt cggtctcgtc tttgaacgtc gtcttgatga 3360061 cgccggcccg ggtgccgccg atcgctttgg catacgacag cgccagcgcc aagccgtcgc 3360121 ctcgcggatc ctgctctacc gcaaccaaac acggcacacc cttgccgtcg acgaactggc 3360181 ggcgcaccaa atgacccggt cccttcgggg cgaccatcgc gacggcgacg tcggcgggcg 3360241 gcttgatcaa gccgaagtga acgttgagtc cgtgaccgaa gaacagcgcg tcaccgggct 3360301 tgaggttggg ttcgatgtct cctgcgaaga tctcggcctg ggcggtgtcg ggggccaaca 3360361 ccatgaccac atcggcccat ttggcgacct cggcgggagt gtcgacgtcc aggccctgct 3360421 cttctacctt gggccgcgac cgcgaaccct gcttcagccc gacgcgcacc tgcacacccg 3360481 agtcgcgcag gcttagcgag tgcgcgtgcc cctggctgcc gtagccgatc acaccaacct 3360541 tgcggccctg aatgatcgac aggtctgcgt cgtcgtcgta gaacatctct agtgccaccg 3360601 ctgaatctct ccttacctgc tagctacttg gcggtgccga tgccgcgcgg accgcgggac 3360661 agcgacacca ttccggattg ggcgatttcg cgaataccga acggctccaa cacccgcagc 3360721 agggcctcta acttgccgcg gttaccggtg gcctcgacgg tcaatgactc cggggatacg 3360781 tcaatcacgt tggcgcgaaa cagattcacc gcttcgatca cttggctgcg gctgccggcg 3360841 tcggcttgga ccttgatgag cgccaattcc cgtgacaccg agtgctcgtc gtcctgctcg 3360901 acgatcttga tgacgttgat cagcttgttg agctgcttgg tgatctgctc gagcggagtg 3360961 tcctcggcgg agaccacgat ggtcatccgt gacctgtcct tgcactcggt ggcacccacc 3361021 gccaacgact cgatgttgaa accgcgccgg gagaacagcg ccgccacccg cgccagcacg 3361081 ccgggcttgt cttcgaccaa caccgacaac gtgtgcgtct tcgggctcat caggcgtggc 3361141 cttcggtgat gtcgtcgaac agggggcgaa tgccgcgggc ggcctggatc tcgtcattgc 3361201 tggtgcccgc ggccaccatc ggccacactt gcgcgtcggc accgacgatg aagtcgatca 3361261 ccaccgggca gtcgttgatc gcccgcgcct ggttgatgac gtcgacgacg tcctcttccc 3361321 gctcgcaccg caaccccaca caccccaagg cctcggccag tttcacgaag tcggggatgc 3361381 ggtgcgaatg agtggccagg tcggtctgcg agtaccgctc ggcatagaac aggctctgcc 3361441 actgccgcac catgcccagg ttgccgttgt tgatcagcgc caccttgacc ggtatgccct 3361501 cgaccgcgca ggtggccagc tcctggttgg tcatctggaa gcaaccgtcg ccgtcgatcg 3361561 cccagacctc ggtgccgggg agggcgatct tggcgcccat ggccgccggg atggcaaacc 3361621 ccatggtgcc cagaccgccg gagttcagcc agctgcgcgg cttttcgtat ctgatgaact 3361681 gcgcggccca catctggtgc tggccgacgc cggcgacgaa gacggcgtcc ggcccggcga 3361741 tctcgccgag cttttcgatc acgtattccg ggctcaggct gccgtcgctc tgcggcccat 3361801 agctcagcgg ataggtcttg cgcacaccgt tcaggtatgc ccaccagtcg gccatctcga 3361861 tggtgccggg aatgtggtgg tggcgcagca tcgcgatcag ttcggtgatg acggccttga 3361921 cgtcaccgac gatgggcacg tcggcgtggc ggttcttgcc gatctcggcc gggtcgatgt 3361981 cggcgtggat gaccttggct tccggcgcga acgagtcgag cttgccggtc acccggtcgt 3362041 cgaagcgggt acccagcgcg atcagcaggt cgctgcgctg cagcgccgcc acggcggcca 3362101 ccgtgccgtg catgccgggc atgccgaggt tttgccggtg gctgtcggga aacgcgccgc 3362161 gggccatcag cgtggtgacc accgggatgc cggtcagctc ggccagctcc cggagctgct 3362221 cggtggcctc accgcggatg acgccgccgc cgacatacag caccggcttg cgcgcggccg 3362281 cgatcagctt ggcggcctcg cggacctgcc ggctgtgcgg tttggtgttg ggcttgtagc 3362341 cgggcagctc catccgcggc ggccagctga acgtgcactg gccctgcagc acgtccttgg 3362401 ggatgtcgac cagcaccgcg cccggacggc cggaggccgc gatgtggaag gcctcggcca 3362461 gcacccgcgg aatgtcgtca ccggagcgga ccagaaagtt gtgcttggtg atcggcatcg 3362521 tgatgcccga gatgtcggcc tcctggaagg cgtcggtgcc gatcagcccc cgcccgacct 3362581 gaccggtgat agcgaccacc gggatcgagt ccatctgcgc gtcggccagc ggggtcacca 3362641 ggttggtcgc tccgggaccc gacgttgcca tgcacacgcc cacccggccg gtgacgtgcg 3362701 cgtagccgct ggcggcatgc ccggcgccct gttcgtggcg gaccagcacg tggcgcagct 3362761 ttttcgagtc gaacagcggg tcatacaccg gcagcaccgc accgcccgga atcccgaaaa 3362821 tgacgtcgac gccgagttcc tccagcgacc ggatgaccgc ctgtgcaccg gtaagctgct 3362881 gcagtgcaac atgtttcgga cgagccgccg ggtgctttgg ctcattcgcc gcgctgtgtg 3362941 gctctggctt gaatgtcggt gagtgtggct tggttggtgc gctcactgtt gtgtgatcct 3363001 ctattgctct ggaagtctcg ttggtggaca agaaaaaacc ctcgccagct cagctgctgc 3363061 acgagggtcg cgttggtgct cgcttgggct agtcaggcac caacgcgccg accaattact 3363121 acgagcatcc cgggctttcc ggccttgtcc atagtgtccg acggtagcct tcacacagct 3363181 cagcagtcaa atccgcggtg tcagtcttga tccgcgagcg tgacggcact gcgaaatccc 3363241 atgcgaattt tcgcggtggc gttacgctcg cgaactcgac gcccaccaag cggtgagatg 3363301 atgctggggt ggccaccaca tcgccggtcg tgatcaaagt gtcgccgatg gcgcacttcg 3363361 ccgtgggatt cctgaccctg ggtctgctgg tgccggtact gacctggccg gtgagcgccc 3363421 cgctgttagt cattccggtg gcgttgtcgg catcgatcat tcggctccgc acgctcgccg 3363481 acgagcgggg cgtgaccgtg cggacgctgg tcggcagccg cgcggtgcgc tgggacgaca 3363541 tcgacgggct gcggttccac cgcgggtcct gggcgcgcgc aacgctcaag gacggtaccg 3363601 agctgcgatt gcccgcggtg acctttgcga cgctgccgca cctgaccgaa gccagctcgg 3363661 gacgggtccc caacccgtac cgatgacagc gttcaggcca gcggatttgc cccgttgagc 3363721 agcacccata cggcaatacc cgccgcgatg ccacccaaca gcgctacaaa cgatccgata 3363781 aatggccgat gagcccagcc gcgggccgcg tccagaccgt agcggccggg accactcaag 3363841 ataacggcga ccgccatcac gaccagggtg atctggtatt catgcccgtc ctgcaggaag 3363901 tacgcgacgg gccgcgaatg ctgtgccgag atgccggcga gcaggccgtt gatcaagaag 3363961 gccagcgcgc ccgcggccgc cagcggagta aacaaaccca acaccagcag cactccggcg 3364021 acgatctcgc cgccagcgct cacataagcg aggatctcgg cgtgctggta accaatgtcg 3364081 gacagcgagt tctggaatcc ggccagaccc tggccgtccc accagccgaa caatttctgc 3364141 agcccatggg cgataaggac cgcgcccaga ccgacccgca atatcagcag cccgagattc 3364201 tgggtgccgc gccgacctgc ggcgcgtacc cgctcgtcgt cgtccatgtc gattccggca 3364261 gatcccgccg gtacctgccg gcccggctgc ggctggacgt agggcaacgg ctccgcagct 3364321 tcgatcaggc tgtacccgga gttgccgaca ccagagctag cggcatcata gggcgggata 3364381 acggtggtgg ttccactgcc aaagtccccg gcatatctgg ccggcgtcag gtcatcctcg 3364441 gggtcgacca ggcttgccga gacaggccgt ccaggcattg gcccaggcga atcatccggc 3364501 cgctgccaat gtgagtcatt cgaactggtc actcgtgtca gggtaaggcc atttagtgcc 3364561 gaattgggga tttgagcggc gctttcgcca gacaatccgc acattgaccc tgaccagccc 3364621 accaaaaggc cccaattggg ccgccatgcc gacagtgcgc accccggcag gtggcggcga 3364681 tgcccacaat gtccgtagcc tgtcggtcat gtggacaacg cggttggttc gatccggact 3364741 cgccgcgctg tgcgcggcag tgctggtatc gagcggctgc gcacggttca acgacgctca 3364801 atctcagccg ttcaccaccg aaccggagct gcggccccaa cccagctcga cacctccccc 3364861 cccgccgccg ctgccgccgg ttccctttcc caaggaatgt ccggcgccgg gcgtgatgca 3364921 aggctgcctt gagagcacca gcggcttgat catgggcatc gacagcaaga ccgcactggt 3364981 cgccgagcgc atcaccggtg ccgtcgagga gatctctatc agcgccgagc cgaaggtaaa 3365041 gacggtcatc cccgtggatc ctgccggtga cggtggcttg atggacattg tgctgtcgcc 3365101 cacctactcg caagaccggc tgatgtacgc ctacatcagc acgcccaccg acaaccgggt 3365161 ggtgcgagtg gccgacggcg acatccccaa ggacatcctg accggcatcc ccaaaggtgc 3365221 tgccggtaac accggggcgc tgatcttcac cagtcccacc acgctggtcg tgatgaccgg 3365281 ggatgctggc gacccggcgt tggccgccga tccccaatcg ttggccggta aggtcctgcg 3365341 tatcgaacag cccaccacca tcggccagac gccgccgacg acggcgctgt ctggcatcgg 3365401 ctccggcggc ggcttgtgca tcgatccggt cgacggctcg ctatatgtcg ccgaccgcac 3365461 gccaacggcg gaccgattgc agcgcatcac caagaactcg gaggtctcta cggtatggac 3365521 ctggccggac aagcccggcg tggccgggtg tgccgcgatg gacggcaccg tgctggtcaa 3365581 cctgattaat accaaactga cggtggcggt ccggctcgcg ccgtcgaccg gtgcggtcac 3365641 cggagaaccc gacgttgtcc gcaaagacac tcatgcgcat gcgtgggcat tacggatgtc 3365701 gccggacggc aacgtctggg gagccaccgt caacaagacc gccggcgacg ccgagaagct 3365761 cgacgatgtg gtgttcccgc tgttcccgca gggtggcggc ttcccgcgca acaacgacga 3365821 caagacctga cccggttagg gcacgtcgag cgtgaacctt acgacgccgt atcggcgtgt 3365881 ctcgtcgccc cgttcacgct cgtagaaccg gggtgaggct tccttgccag ggtcgatgtc 3365941 gtcgacatca aagtcgaggt cggagaggta gagcagatct tccgagcact ccggagccca 3366001 cacgctcacg ggctccaaca ggtaagccac ataatccccg acatcgctgc gactggccgt 3366061 cctaccgatg aaccaggcgg ccgcgtcgtc gagaatcggc attccacagg ggccagcgcg 3366121 ccacgagcaa cgggcgaact tgttgacctc ctcctccgtt tggctgccga acagttcggc 3366181 gagcacatgc tgccgctgcg aaagcacgtg cacggcgagg tgctcggatc ggctcgccac 3366241 ctcggaggtg ccggtgctcc tcggcaggcc gaccataaaa ctcgggggct gcacgctcgt 3366301 ttgggtagcg aagctgacca gacaacccgc ggggtgacca tcggcctggg ttgtcaccac 3366361 aaacaccggg tggtccagca tccccatcaa ctcgtcgaac gactcatcga tcacatcacc 3366421 atcatgaatc cgcgcaacgt cttctgacac tctttccgag cgttcagtcg gcgaatcgcc 3366481 gctaccgcca tcacgtcgac cggtgaggcc gccgtcacgg ccccaaatcg gcgacgatct 3366541 gggcacggaa tcagaacctg attgggtccc ggccagcctc gctggcgtgg gaagtcacca 3366601 cggtcgcgcc gcggcttctc aacccggccg acacgcgctc ccgatgctca ctgtcgtcgc 3366661 tgtcataggc atactcgaat gtggattagt gctacacatg cctgacaacg atttgtggta 3366721 ctgcgggcca tggacactat gggtgatggc cggtaggggt gttgcgtcgg gcgcgggagt 3366781 gtggcgaggt gatcgcgttg cgacgcccct tgcggtggcg attaccgcag ccggattggt 3366841 atcaggggcc cggataggac ccggtgcggc tgcgaaacgc gacccgcagc tcgcacagtg 3366901 gaacgagatt cgcagtcact accaagagat cgccgagtgg atcgaccacg acacagcaac 3366961 cgcacacccc gctgttgccg caacgcagat cagtgccgct ggctctttcg gccgcgccaa 3367021 tatggtcgac tacctggggc tcctggattc cagggccgac gaaacggtcc gacgcgacga 3367081 attttcgcgg tggctgtcgg ccaaacccga ctacttggtc accaccgagc aatctgtcga 3367141 cgccgccacg atagcccttc ctgaattccg ccatgcgtac gaccgcgcgg ccaccatcgg 3367201 gacactcaac gtgtatcgtc gcaactcccc tgacggtgat gaaccgctac ccgcggacgg 3367261 caactaaccc tgcccgcagg cctctagaac gagttcgcgc actcgggccg cgtcggcctg 3367321 tccgcgggtc gccttcatca ccgcaccgac aatcgcgccg gccgcggcca ccttgccgcc 3367381 gcgaatcttg tccgccacat caggatttgc ggccagggcc tcgtcgaccg cggcctgggt 3367441 caacgagtcg tcgcggacca acgccaaccc tctcgcagtc atcacctgtt cgggctcacc 3367501 ttcaccggcc agcacaccct ccacgacttg gcgggccaag ctgttggaca gcttgccctc 3367561 atcgaccaat gccaccacgg ctgcgacctg ggcaggagtg atggccagtt cgtccagccc 3367621 gatgccggcc tcgttggcct tttgcgccag gaagtttccc caccaggcgc gcgccgcctc 3367681 gctggacgcg ccgtgctcga cggtggcagc aaccaattcg acggcgccgg cgttgaccag 3367741 atcgcgcatc acctcgtcgg aaacgcccca ctcctgctga atcctcctgc ggctcaacca 3367801 cggcaattcg gggatcgtct ggcgtagtcg ctcgaccagc tcgcgactgg gcgcgacagg 3367861 ctccaaatcc ggctccggga agtaccgata gtcctcggcg gtctccttgg tgcggcccgc 3367921 gctggtgtaa ccggcctcgt gaaagtgtct ggtttcctgg gtgatccgac caccagacgc 3367981 caaaatagcg ccctggcgct gcatttcgta gcggacggcg acttcgacgc tcttcagcga 3368041 gttgacgttc ttggtctcgg tccgggtgcc gaattcggtc gtcccggccg gcttcagcga 3368101 cacgttggcg tcacagcgca tcgaaccctg gtccatccgg acatcagata catctaatgc 3368161 gcgcagcaga tcccgcaacg ccgtcacata ggaccgggcg atctgcggcg cccgggcacc 3368221 ggcgcccacg atgggtttgg tgacgatctc gatgagcggc acgccggcac ggttgtagtc 3368281 gatcagcgaa ccggtggcac cgtggatccg gcccgtctcg ctgccgatgt gggtgagctt 3368341 gccggtgtct tcttccatgt gagctcgctc aatctccacc cgccaagtgg tgccgtcttc 3368401 caaaggcgcg tccaggtagc cgttgatggc gatcggctcg tcgtactgtg agatctggta 3368461 gttcttgggc atgtcggggt agaagtagtt cttccgggcg aagcgacacc agggtacgat 3368521 ctcgcagttc agcgccagcc cgatgcggat cgccgactcc acggcggccc ggttgagcac 3368581 cggcagcgaa ccgggcaagc ccagacacac cggacacacc tgggtgtttg gctcgccgcc 3368641 gaatgtggtg gtgcagccac agaacatctt ggtcgcagtg gacagctcga cgtgcacctc 3368701 gaggccgagt accggctgga agcgcgcgac gacctcgtcg taatcgagca gttcagcccc 3368761 tgcggccttg gctgccccgg cagcaacagt catagccgcg atcctagttt gagcacccga 3368821 cgtcaaccga agaaggcggc ggcgtcgtcg taacggctct gcggcaccag tttgagtttg 3368881 cgaaccgcat ccgccagcgg aacccgaccg atgtcctggc cgcgcaacgt caccatctgg 3368941 ccgtactcgc ccgcatgcgc ggcgtcggcg gcgttcaccc cgaatcgggt ggccagcact 3369001 cggtcgtagg cggtcggagt accaccccgc tggatgtggc ccaacaccgt cacccggaca 3369061 tccttgttga tgcgcttctc gacctcgacc gccagctgcg ccgctacacc tgtgaaacgc 3369121 tcgtgcccga actcgtcgag accaccctcg cgcagcatga tcgtccccgg agccggtttg 3369181 gcgccttcgg cgaccacgca gatgaaatgc gagtccccgc gctggaaacg gcctttgacc 3369241 agtcggcaca cctcttcgat gtcgaacggc tgctcaggaa tcagggtcat gtgagcaccg 3369301 gaggccagcc cggcgttcag cgcgatccag ccggcatgcc tacccatcac ctccaccagc 3369361 atcacccgct cgtgggattc ggcggtgctg tgcagccggt cgatggcctc ggtggccacg 3369421 gtcaacgcgg tgtcgtggcc gaaggtcaca tcggtgcagt cgatgtcgtt gtcgatcgtc 3369481 tttggcaccc cgaccaccgg cacattctct tcggagagcc aactcgcggc ggtcagcgta 3369541 ccctcaccgc cgatcgggat caggacgtcg atcccgttgt cgtccaaggt ctgcatgatt 3369601 tggggcagcc ccgcccgcag tttgtcgggg tgcacccggg ccgtgcccag catcgtgccg 3369661 cccttggcca gcagccggtc attgcggtcg tcgttgtgca gttgaacacg gcggttctcc 3369721 agcagcccgc gaaagccgtt ctgaaatccg accaccgacg agccgtatcg ggcgtggcag 3369781 gtacgcacca ccgcacggat gacggcgtta aggccgggac agtcgccgcc tccggtaaga 3369841 actccaatcc gcataccctc atcttgccgc gcggccgccg acctggcgcg agcagacaca 3369901 gaatcgcacg ggcgaggggc gccggatgcg agtctgtgtc tgctcgccgc taaatggcgc 3369961 tcagtagcgg gccgcgggcg gcctcataag ccgcccccac ccggtagagc cggtcgtcgg 3370021 ccaatgccgg cgccatgatc tgtaggccaa ccggcaaccc gtcgtccggg gagagccccg 3370081 acggcacaga catgccgcag tggccggcca agttcagcgg cagcgtgcac aggtcgaaca 3370141 agtacatcgc cagcggatcg tccaccttct cacccatccg gaacgcggtg gtcggggtcg 3370201 tgggcgacac cagcacgtcg acggaccgat acgccgcgtc gaggtcgcgg gcgatcagcg 3370261 tgcgcacctt ctgcgcctgg ttgtaatagg cgtcgtagta gccggccgac aacgcgtagg 3370321 tgccgatcat gatgcgccgc ttgacctcgg gcccgaaacc ggcggcccgg gtcatcgcca 3370381 tcacctcctc ggcgctgcgg gtgccgtcgt cgccgacccg cagcccgtag cgcatcgcgt 3370441 cgaagcgcgc cagattgctc gacacctccg agggcagaat caggtaatag gcggccaggg 3370501 catggtcgaa gtgcgggcag tcgacctcgc tgacctcagc gcccagcgcg gttagctgct 3370561 ccacggcagc ctcgaaggag gccagcacgc ccggctggta gccctcgccg ccgtgcagct 3370621 gtcgaaccac gccgacccgc acgccacgca gatccccgac cgcgccggcc ctagcggcgc 3370681 ccaccacgtc gggcacctcg gcgtcgaccg acgtggagtc gcgcgggtcg tggccggcga 3370741 tcacctgatg caacagcgcg gtgtccaaga cggtgcgcgc acacgggccg ccctgatcca 3370801 gcgaggacgc gcaggccacc agcccatagc gcgacaccgt gccgtaggtg ggtttgacgc 3370861 cgacggtcgc ggtcagcgcg gccggctggc ggatcgaccc cccggtgtcg gatccgatgg 3370921 ccagcggcgc ctggaacgcg gccagcgccg ccgcgctgcc gccaccggaa ccgccgggta 3370981 cccggtcgag attccacggg ttgcgggtgg gaccgtaagc ggagttctcc gtcgacgagc 3371041 ccatcgcgaa ctcgtccatg ttggtcttgc ccaggatcgg gatccccgcg gcgcgcaacc 3371101 gcgcggtcag cgtggcgtcg tagggagatc gccatccctc caggattttt gacccgcagg 3371161 tggtgggcat gtcgctggtg gtgaagacgt ccttgagcgc cagcggcacc ccggccagcg 3371221 ccgacggcaa gggttctcca gcggccacct gcttgtcgat ggcggccgcc gccgccagcg 3371281 cctcatcggc cgccacatgc aggaaggcgt ggtacgtctc gtcggtcgcc tcgatctgat 3371341 ccaggcaggc ccgggtgatc tcggccgacg acacctcctt gatggcgatc ttggcggcca 3371401 gcgtcgcggc gtcggatcgg atgatgtccg tcactgttca tcccccagga tctgcgggac 3371461 ggcgaagcgg ccgtcgacgg catcgggcgc ctggtcgagc acctgacgct gggtcaggca 3371521 cggcacggtc tcgtccgggc gggtgacgtt gacgtccttg agcggattgt cggtggcctg 3371581 cacaccggtg acgtcgacgg cctggatctg gctgacgtgg gtcaggatgg cgtcgagttg 3371641 gccggcgaaa ctgtccagct cggtttcggt caatgccagc cgggcaagcc tggcgaggtg 3371701 ggcaacctcg tcgcgggaga tctgggacac gaccgcaaag cctaatgggt ggccggacgg 3371761 ccgacgccgg ctgccgaaac gccgtggata catcgttgtg ccacagtgtt ggccgtgcgt 3371821 tcgtatctat tgcgtatcga gctggccgac cggccgggca gccttgggtc gctggcggtc 3371881 gcgctcggct cggtgggcgc cgacatcctc tcgctcgacg tggtcgagcg cggcaacggc 3371941 tatgcgatcg acgacctggt ggtcgaactg cccccgggag cgatgcccga cacgctgatc 3372001 actgctgccg aggcgctgaa cggcgtccgg gtagacagcg tccgcccgca caccggcctg 3372061 ttggaagccc accgcgagct ggaactgctc gatcatgtgg ccgcggctga gggcgcgacc 3372121 gcacggctcc aggttctggt caacgaggcc ccccgggtgc tccgggtgag ctggtgcacg 3372181 gtgttgcgca gttccggcgg ggagctgcac cgtctggccg gcagcccagg tgcgccggag 3372241 acccgggcca attcggcgcc ctggctgccg atcgagcggg ccgcggcgct ggacggcggc 3372301 gccgactggg tgccgcaagc ctggcgcgac atggatacca ccatggtcgc ggctccattg 3372361 ggtgacacgc acaccgcggt ggtgctgggc aggccaggcc cggaatttcg cccgtcggag 3372421 gtggcgcggt tgggttatct agccggcatc gtggcgacga tgctgcgctg agcggttcgt 3372481 tggcaaccaa ggttcgccga gcgtaacgcc actgcgaaaa accgcgcgga gattcgcagt 3372541 gccgttacgt tcgtgacgcg ggtccgtcgg ccagcagtct ccggaaccca tcctcgtcca 3372601 gaatcggcac ccccaactcc accgccttgt cgtatttgga tcccggcgag tctccggcga 3372661 cgacatagtt ggtcttcttc gacaccgagc cggcggcctt gccgccgcgg gccacgatcg 3372721 cctccttggc gtcgtcgcgg gagaaaccgg tcagcgagcc ggtgaccacg atggtcagcc 3372781 cggccagcgt gcgtggcaca ctctcgtcac gctcgtcgac cattcgcacc ccggcggccc 3372841 gccacttgtc gacgatctcg cggtgccagt cgacggcgaa ccactcggtg accgcggcgg 3372901 caatggtcgg ccccaccccc tcgacggcgg ccagctggtc ggtggacgcc gcggcgatgg 3372961 cgtcaaggct gccgaactcg gtggccaggg cgcgggccgc cgtcggcccg acatggcgga 3373021 tggacagcgc caccagcacc cgccacagcg gtgccgcctt ggccttgtcg aggttgacca 3373081 gcagccgttt gccgttggcc gacagttcgc ctgccttggt tcggaacagg tcggtgcgca 3373141 gcaagtcccg ctcggtcagc gcgaacagct cgccctcgtc ggcgatcacc ttcgcctgca 3373201 agagcgccac acccgcctcg taaccgagca cctcgatgtc taggccgttg cggctggcga 3373261 cgtggaaaac ccgctcccgc agttgccccg ggcagccgcg ggcgttgggg caacggatgt 3373321 cggcgtcgcc ttccttctcc ggcgccaacg gcgaaccgca ctccgggcag gtggtgggca 3373381 tgatgaattc gcgttcggag ccatcgcgca gttcgacgac gggtcccagc acctcgggga 3373441 tcacgtcgcc ggccttgcgg atcaccacgg tgtcgccgat cagcacgccc ttgcgcttga 3373501 tctccgaggc gttgtgcagg gtggcctgtc ccaccgtcga cccggccacc ttcaccggcg 3373561 tcatgaacgc aaacggcgtg atccgcccgg tgcggccgac gttcacccgg atgtcgagca 3373621 gcttggtctg cgcttcctcg ggcgggtact tgtaggcgat ggcccagcgc ggcgcccgcg 3373681 acgtggaacc cagcctgcgc tgcaacgcca cctcgtcgac tttgaccacc acgccgtcga 3373741 tttcgtggtc cacctcgtgg cggtgctcgc cccagtagtc gatgcgctcg cgcacaccgg 3373801 ccaggtcggt tgccagggtg gtgtgttcgg aaaccggcag tccccatgcc cgcaacgcca 3373861 ggtatgcctg atgcagggtg gccgggcgaa agccctccac gtggcccagc ccgtggcaga 3373921 tcatccgcag ccggcggcgc gcggtgaccg ccgggtcttt ctggcgcagc gatcccgccg 3373981 cgctgttgcg ggggttggcg aacggcgcct tgccctcctc gacgaggctg gcgttgagcg 3374041 cctggaagtc gtccagccgg aagaagacct cgccgcggac ctcgaggacc tcgggcaccg 3374101 ggtagtcgtc gccgggggtg agccgttcgg gaacgtcggc gatggtccgg gcgttcaggg 3374161 tgacgtcctc gccggtgcgc ccgtcgccgc gggtggaggc ccgggtcagc cgtccctcgc 3374221 ggtagaccaa agacagcgcg acgccgtcga tcttgagctc acacaggtaa tgtgcggcgt 3374281 ctccgacctc ggcatggatg cggccggccc aggcggcgag ttcgtcggcg gtgaacgcgt 3374341 tgtcgaggct gagcattcgt tcgagatggt cgacgggctc gaaatccgtg gcgaagccgg 3374401 caccgccgac cagctgggtc ggcgaatcgg gcgtgcgcag ctcgggatgc tgctcctcga 3374461 gggcttccag acggcgcagc agctcgtcga attccgcgtc gctgatgatc ggcgcgtccc 3374521 gcacgtaata acggaactgg tgctcacgca cctcctcggc cagtgcctgc cactgccgca 3374581 acacctcggg agcggtctga tcggcgtctg gggagctcac tctggcaggc tagccgaggg 3374641 ggctcttccc tcagatggcc tctgggtccc gcgcgaacgc ctcagcgaca tcacgggcaa 3374701 gcccgaccgc ggtgcgggcc cactgccccg tcgcattggc cagaccacac gccgggctga 3374761 cgccgagtcg atcgcgtagc gccgagcgag gaacgccgag ccgatcggtg accgcgaccg 3374821 ccgcagcagc gacctcttcc atcgaaggtg ctcgctccgg ggcggtcacc gggaccaggc 3374881 ccagcacgac ggttcggccc gactcgacaa atgccgcgac agcatccaaa tccgcagcct 3374941 gcagtgtgct cgcatccacc gataccgcac taattctgct gcgctgcagc agatcccacg 3375001 gcaaatccgg actgcagctg tgtagcgcta cgtccgcgtc gacagccgcg atgcaagtgt 3375061 cgagcagcgc ttcggccacc gtctcgtcga gcggggcaac cgggctcaac gcggtcaccc 3375121 cggtcagccg gccgcccaac gccgccggca acgacggctc gtcgaactgc accaccaccg 3375181 gtgtgtcaag tcgacgcgcc agcgccgcgc gatgcgcggc aacgccttcg gccagcgagg 3375241 cggccaggtc acgcacggct ccggggtcgg tgatcgcccg gtgaccgttg gccagctcca 3375301 accccgcgac caatgtgact ggcccgggcg cctgcacctt caccgcccgc ccacagccac 3375361 gcaggcccgc ggtctcccag gcctcttcta aggcatccat atcctcgtcg aggaggctcg 3375421 cggcccgccg tgtcaccgcg ccgggtcgag cagcgatgcg gtagccacga ggcacggtgt 3375481 caatcgccac gtcgaccagc agtccgccgg ctcgccccag catgtcggcg ccgacgcccc 3375541 tggcgggcag ctcggtgaga taggccaatg cacccgccaa ctccccgacc acgacctgcg 3375601 cggcctctcg cgcggcggtg cccggccacg atccgatccc ggtggccgtt gcgaaaacac 3375661 tcacccggca accgtattcg acctcacatc gtcggctggc cgccaggggt gtctgctgca 3375721 ggttcgcccg ggtaccttcg aagcagaagg gtggcagatg gtgggattga cgcggccgct 3375781 gctgttatgt ggcgcgacac tactgattgc ggcgtgcacc cgggtggtgg gcggcacggc 3375841 ttcggcgact tttggcggtg accgacaggg catgcttgac gtcgctacga tcctgttgga 3375901 tcagtcacgg atgcaagcaa tcaccggctc cggcgatgac ctgacgatca tccccacgat 3375961 ggacacgacg tatcccgtcg acgtcgacga tttcgcccaa cccataccac gagaatgccg 3376021 gttcatctat gccgagacgg cagtctttgg ctctgagatc gaagcgtttc acaagaccac 3376081 cttccaggac cggccagatg gcagtctgat ctccgaggcg gccgccgcct atcgggatgc 3376141 cggcaccgcc cggcgtgcct tcgacaccct ggcggtcacc gtccacgact gcgcggcaag 3376201 tccggcaggc tggctgttcg tcagtaggtg gaccgccggc ggcaattccc tacacatccg 3376261 ggccggcgat tgcggtcgcg actaccgggt cctatcggcg gccctgttgg aagtgacctt 3376321 ctgcggcttc ccggaatcgg tctccgacat cgtgatgacg aacatcgccg ccaacgtgcc 3376381 gggttagcac ctcgagcccg cgttcaggat gccaggacgg atgtcaacgt ggtcagttgt 3376441 gcgttgcgct gcgcgacgac attggtgctg acatttccac cacgcgcgtt tagtctccgg 3376501 cgtcggcggc cgtggctgga cccgcatggc gcgggtccag ccaccgaccc cggaacgacc 3376561 ccaccctaat cgttccgcag tctgacgaat cgcctaccgg cctttccagc accccgatct 3376621 ggcgtagtgc tcgccggcac cgacggtagg cccgcgcgag agcctccatg gcctgattcc 3376681 actgggcctg ccagtcctga tgactcatcc cgagatcacc ttggcaagca cgcgacggcg 3376741 cggtgcgctc actggcgata tcggccccca agctctgcgt cgtgcccgta taaccggcca 3376801 tgtctccgac attggccgtc atcgccgggt agctgtacat actctgcgac accacgaatc 3376861 cctttcaaat attccgggca atgattttta gacactcttt cgatcgaaaa tttggtcgag 3376921 ttcacggccg tcagatcgtc aaactgacac caacccccca tcaccggcca caccgaccaa 3376981 atccggcccc cagctgcccg gcagcatcgg caccggcgca ccatcaccaa actcatcggc 3377041 caacaccgtc aaccccgccg gctgcccaac cgactccttg ccagccgtcc caacaaaccc 3377101 caacgcccca gcaccccgat cagaagccag caccgacacc gacgtgctcg cgggagccgg 3377161 ctccaccgcc gacaccaacc gcgcctgcgg cgacaccaca ccacccgaca ccggcgccac 3377221 actgcccacc aacgccgccg gcgcgccagc agccgcaccc accgccggca gagcggccaa 3377281 tcccgccacc ccggccaacc cagctacacc cggaaccaca gcagcggcca acgcccccaa 3377341 caacggcccc cccaaaaggg gaacaacacc aaacagattc ccaataaccc actcgacaac 3377401 tatgtctata acgcccgcaa tggcatacaa taatccgacc gcaacataag aagcattgat 3377461 agcgaactct gtcaatagtt gagcattgga tgccaacgta ataatgaacc caataatgtt 3377521 gaaacccagg atatccacaa agagctggaa ccaaacccaa gccaccgcag ggagttcgga 3377581 aagcaaagcg gacagatact ggtcatatgc tgcgaacgtt tcttctaaaa actgtacaat 3377641 ttcgtgccat gggaatgggg ttatggttgc ggcggccacg gcgttgctgg cttcattggc 3377701 gccgggtttg acgatgaccg gtgccgggcc ggtgtgtggt gtggccacca gcgcggcacc 3377761 caccaccgcc tcataggcgc tcatcacggt ggccgcctgg acccacatcc gcacatagtc 3377821 ggcctcgttg agcgcgatcg ggatcgtgtt gatcccaaag aaattcgtcg ccaccaacac 3377881 cgcatgcgtg aggtggttgg ccgccaactc cggcaacgtc ggcatctccg ccaacgcaca 3377941 aacatagcca gccgccgcgg cctcatgctc accggccgcc gccgcgctat ccgcactggc 3378001 ctgcaccaac cacgccacat acggcacata ggcggccaca aacaactcag cactgggacc 3378061 ctgccacacc ccggccccca ccgcggccac caccacgctc aactcttgcg ccacagcggc 3378121 gtactcggcg cttaacgcgc tccaccccgc cgcggccgcc tgcaacgaac ccggccccgg 3378181 accagcactt agcagcgccg aatgcacctc cggcggcgac gccaaccaca ccggcgccgt 3378241 cacaacgacc cacccgaaac cagatacgtg cccaggacac cgaactcgac cgtgcggtgc 3378301 gaggacaccg gatcacccgc tagcggaatc aatgtgcggc ggccagccgt gcggtcaacg 3378361 cctccaccgc cgcactggcg gccgccaaac cctcaggaac cacgctcagc gtcaccacgc 3378421 acactccttc cttaggcgcc tcccacaccc atctcccgga tttttgctct atcaactgtt 3378481 gtaaatagct acgattaccc aggcgtagac gacgacgccg cagattcctc acacccgcgc 3378541 ctgcgcaatt ggccacgcac caccgccggc agcgaggccg ccagccacac accaagctcc 3378601 tcgccgacca catcggctac cggatccacc aacagcgcaa cggcattcgg atgggggccc 3378661 atcaaaccca ccgatcatcc cggcgtcgcc gaccacgccc gcgccgtgtg ctagccgccc 3378721 cacttggcgg cttcggcccc atctcgagcc aacatcgcca tggtgttgga ctcatgggtg 3378781 ccagacatcg actgataggc ccgcaccaga tcctctaggg cctggttcca ctgggtctgc 3378841 cagccctgat acgtgatccc ggtatcaccc tgccaagcac tggacagcac ggcctgctca 3378901 ctggcgatat cggcccccaa gctctgcagc gtgcccgcat aaccggccat gtccccggca 3378961 tgagccatca tcgccggata gttgtacata atctgcgaca tcacaaaccc cttttcattc 3379021 cgagcagcga cttttttaaa acccggtgta gctggacgcg gcggcggcat cggcggccac 3379081 atacgtgccc gcggcctcac ccaaattggc ttgcgcgata tccagcaagg tattgacctt 3379141 ggcggccgcg gccacaaacc gggcatgcgc accctgaaac gccgccgcgg actctccctg 3379201 atgaaacgcc tgcgccgaca tcgcctgctg ctcggcctga ccgatcgtat gccgcatcaa 3379261 ccccgcctta gcggcaaacg ccgtatgcga agcgatcaac tgcggaatat gggcatccaa 3379321 caaactcatc acaattcctt ccaattcgaa tcaccaatta ctcgccgtca gatcgtcaaa 3379381 ctgacaccaa ccccccatca ccggccacac cgaccaaatc cggcccccag ctgcccggca 3379441 gcatcggcac cggcgcacca tcaccaaact catcggccaa caccgtcaac cccgccggct 3379501 gcccaaccga ctccttgcca gccgtcccaa caaaccccaa cgccccagca ccccgatcag 3379561 aagccagcac cgacaccgac gtgctcgcgg gagccggctc caccgccgac accaaccgcg 3379621 cctgcggcga caccacacca cccgacaccg gcgccacact gcccaccaac gccgccggcg 3379681 cgccagcagc cgcacccacc gccggcagag cggccaatcc cgccaccccg gccaacccag 3379741 ctacacccgg aaccacagca gcggccaacg cccccaacaa cggccccccc aaaaccggaa 3379801 tagcgccaaa aatattcgaa ataatccaac caatactgag tatcgccagt tccaagacaa 3379861 ccgcgaattc taatagcgga accagaacaa atacgcctaa cgcaaaacct accgtggcga 3379921 aaaacatatc gatcatcccg gtaaggacga gccaaggctc gaaatttacc agacccgtga 3379981 tcagctcgac gaatcccaca gcccaggctt cggcgctctt cattatcaat tcgccgacct 3380041 ctgtaaatgc ttgggcagcc atttccaaaa acttcgctaa ttctccgaat gggaatgggg 3380101 ttatggttgc ggcggccacg gcgttgctgg cttcattggc gccgggtttg acgatgaccg 3380161 gtgccgggcc ggtgtgtggt gtggccacca gcgcggcacc caccaccgcc tcataggcgc 3380221 tcatcacggt ggccgcctgg acccacatcc gcacatagtc ggcctcgttg agcgcgatcg 3380281 ggatcgtgtt gatcccaaag aaattcgtcg ccaccaacac cgcatgcgtg aggtggttgg 3380341 ccgccaactc cggcaacgtc ggcatctccg ccaacgcaca aacatagcca gccgccgcgg 3380401 cctcatgctc accggccgcc gccgcgctat ccgcactggc tgcaccaacc acgccacata 3380461 cggcacatag gcggccacaa acaactcagc actgggaccc tgccacaccc cggcccccac 3380521 cgcggccacc accacgctca actcttgcgc cacagcggcg tactcggcgc ttaacgcgct 3380581 ccaccccgcc gcggccgcct gcaacgaacc cggccccgga ccagcactta gcagcgccga 3380641 atgcacctcc ggcggcgacg ccaaccacac cggcgccgtc acaacgaccc acccgaaacc 3380701 agatacgtcg ccgccgccac cgcatcaccg gcggcataac cgatcccaga ctcacccaca 3380761 gcgaccccgg aacgacccag ctcctcgacc ccttcgcccg cgatcgccgc atgctcgcta 3380821 cctaaggcgc taaaccccac cgcactctgc aacgacaccg gatccgccgc cggcgccacc 3380881 accgccgtaa tcgccggcgc cgcgccagcg tgtgcggcgg ccagccgtgc ggtcaacgcc 3380941 tccaccgccg cactggcggc cgccaaaccc tcaggaacca ctctcagcgt caccacccac 3381001 actccttcct taggcgtcac acacccgcac gaccggttac cgtcaccagc ggagcgaatt 3381061 attgacacct gtcttgacgc ctgtcttgac atgcgtcagg caatattgat ctcacagatc 3381121 gttgcgtatg tcaactgtta ttgatagcta ctattacgta ggcgtaggtg acggctccgt 3381181 aggattcggg gactagcccg ttgcttgggc tgcccgaccc ccgccccgtc ccacgcaacc 3381241 cggctgcccg tcgtcgggcg acatcccggt ctctatcggc ggacccgagc agccgcccgg 3381301 ctagccagtc gcggccaagg ccagggacgt ggtgtacgag tgaaggttcc tcgcgtgatc 3381361 cttcgggtgg cagtctaggt ggtcagtgct ggggtgttgg tggtttgctg cttggcgggt 3381421 tcttcggtgc tggtcagtgc tgctcgggct cgggtgagga cctcgaggcc caggtagcgc 3381481 cgtccttcga tccattcgtc gtgttgttcg gcgaggacgg ctccgacgag gcggatgatc 3381541 gaggcgcggt cggggaagat gcccacgacg tcggttcggc gtcgtacctc tcggttgagg 3381601 cgttcctggg ggttgttgga ccagatttgg cgccagatct gcttggggaa ggcggtgaac 3381661 gccagcaggt cggtgcgggc ggtgtcgagg tgctcggcca ccgcggggag tttgtcggtc 3381721 agagcgtcga gtacccgatc atattgggca acaactgatt cggcgtcggg ctggtcgtag 3381781 atggagtgca gcagggtgcg cacccacggc caggagggct tcggggtggc tgccatcaga 3381841 ttggctgcgt agtgggttct gcagcgctgc caggccgctg cgggcagggt ggcgccgatc 3381901 gcggccacca ggccggcgtg ggcgtcgctg gtgaccagcg cgaccccgga caggccgcgg 3381961 gcgaccaggt cgcggaagaa cgccagccag ccggccccgt cctcggcgga ggtgacctgg 3382021 atgcccagga tctctcggta gccctcggcg ttgacgccgg tggcgatcaa ggtgtgcacc 3382081 ccgacgacgc ggcctgcctc gcgcaccttg agcaccaggg cgtcggcggc gaggaaggta 3382141 tacgggccgg catcgagcgg gcgggtccga aacgcctcta cggcttcgtc gagctctttg 3382201 gccatgatcg acacttgcga cttggaaagc tttgtcacac caagtgtttc gaccaggcgc 3382261 tccatccggc gagtggatac tcccagcagg tagcaggtcg ccaccacgct ggtcagtgcg 3382321 cgttcagctc gcttgcggcg ctgcagcagc cagtccggga aatagctgcc ctggcgcagc 3382381 ttggggatcg cgacgtcgat ggttgcggca cgggtgtcga aatcacggtg gcggtagccg 3382441 ttgcgctgat tggaccgctc atcgctgcgt tcgcggtagc ccgccccgca cagggcgtcg 3382501 gcttcagccc ccatcaaggc ggcgatgaac gtcgagagca gcccgcgcag cagatccggg 3382561 ctcgcctgtg cgagttggtc agccagaagc tgctcggcgt cgataagatg agaagaggtc 3382621 attgcgtcat ttccttcgat tgacttttgc tggtcgtttc gaaggatcac gcgatgaccg 3382681 cccactactg ggctacgaca cgcccaccgg ccttacctgc ccgtacacca cacccctgga 3382741 cgtaactcca gtcgccgggt ttctacgagt gatttggcgc cgagtcaagc cccggggttg 3382801 ccgccagtcg acaaccctga agcgccggcg atggtcgcgc tgccgagcac ctcgtcaccg 3382861 gctgggtcag gtcggtagag caccagcgtc tggccgcgcg ccacgccgcg cagcggggca 3382921 tgcaactgca cgaaaagcgc atcgccgatc aattccgcta ccgcactgac ggtttcaccg 3382981 tgcgcacgca cttggaccac gcagtcaacg ggtcctgacg gcgcggctcc ggcggtgaag 3383041 acgggagcgc gcccagtcag cgtttgcaca tcaaggtcgg tcacgtcacc tacgtgaacg 3383101 gtggcggtgt cggcgtcgat cgccgtgaca tagcgcggac gaccattcgg gcccggcccg 3383161 gcgatgccca ggcctctacg ctgcccgatg gtgaacccgt gcaccccatc atgggaagcc 3383221 agcaccacac catccgcgtc aaccaccaca ccacggcgaa ccccgatgcg ctcacccaaa 3383281 aaagccttgg tgttcccgga cggtatgaag cagatgtcgt ggctatccgg cttgttggcg 3383341 accgccaggc cgcggcgggc cgcctcggca cggatctgcc gcttcggcgt gtcgccgatc 3383401 gggaacgcgg cgtggcgcag ctgctgcgca gtgagcacgg caagcacata agactgatcc 3383461 ttgtcccggt cgacggcgcg gcgcagccgc ccacccgaca gccgggcgta gtggccggtg 3383521 gccaccgtat cgaaacccaa cgccacagcc ctggcggaca gagcagcgaa cttgatctgc 3383581 tgattgcacc gcacgcaagg gttcggagtt tccccgcggg catacgacga cacgaagtcg 3383641 ttgatcacgt cctctttgaa cttctctgcg aaatcccaaa cataaaacgg gattccgagc 3383701 acatcggcga cgcggcgcgc gtctgcagcg tcctctttgg aacaacagcc ccgcgagccg 3383761 gtgcgcagcg tgccgggcgc ggtcgatagc gccatgtgca ctccgaccac ctcgtgtccg 3383821 gcatcgacca tgcgggcggc agcaacagac gagtcgacgc caccgctcat cgcggcgaga 3383881 actttcatcg ggatgctccc gcggcggcta gggcggcccg ccgtgcacgt gccaccgccc 3383941 cgggaagcac ctccaacgcg gcatcgacat cagcctcaac actggtgtgc cccagcgaga 3384001 gacgcaatga tccgcgggcg ctggccgcgt cgacgcccat tgcaatcaac acatgcgagg 3384061 gctgcgctac acctgccgtg caggccgatc cggttgagca ctcgattccg ttagcgtcca 3384121 acaacatcaa cagcgcatcg ccttcgcagc cacggaaagt gaagtgcgcg ttacccgcta 3384181 gccgcatcgg gtcatcggcg ccgttaaggc aaacatcgtc aatctcagcc agcacaccct 3384241 cgaccagacg atcccgcagc agccgtaacc gcgcgctgtt ttcctcgagt ccgtccaccg 3384301 cgatctgcgc ggccgtcgcc attccaactg cactggcgac atcgggtgtg ccggaacgaa 3384361 tatcgcgctc ctgcccaccg ccgtgcataa ggggcacgca ggtgacgtcg cggcgcagca 3384421 gcaacgcacc cactcctggc gggccaccga atttgtgccc ggccacgctc atcgccgaca 3384481 gcccgctggc cccgaagtca agcgggagct gtcccaccgc ctgaatggca tcactgtgca 3384541 tcggcacgcc gaattccatg gcgacaactg acatttcggc gatcggtaga atagttccga 3384601 cctcgttgtt ggcccacatc accgatacca gcgcgacgtc gtcgtggctc tgcagtgcct 3384661 cgcgcagcgc agttgccgac accgagccgt cggcggcggt cggcagccag gtcacatggg 3384721 cgccttcgtg ttccacgagc cagttcaccg agtccagtac ggcgtggtgt tccacctcgg 3384781 tggtgacgat gcgacggcgg tgcggctccg catcgcggcg tgcccaatag atacctttga 3384841 cagccaggtt gtcgctttcg gtgccgcccg cggtgaagat cacctcggac ggacgagcgc 3384901 ctagcttgtc cgcgatcagc tcacgggcct cctcgatccg ccggcgcgcc gagcgcccgc 3384961 tggtgtgcag cgacgacgca ttgccgatgg tgcgctgcac ggccgccatc gcctcgatgg 3385021 cggcggggtg catcggggtg gtggcagcgt gatccaggta ggccatgacg cacctagaat 3385081 actggcccgg gcggcgacgc agaacgtgcg cgcaggccac ggccgcagca gcggctgggc 3385141 aatctggctg gggccagacc acttaggtcg ccggcacgtg ccggcggcct gggcgttgcc 3385201 ccgactgccc caaggctccc gcaagcaccg ctgactggca acggcgcgcg agattccgac 3385261 gatcggtacc tggcagctgc aaagactcga cgcgcaccca ggccagcgtg cggcgcacgg 3385321 tcagcagccg gcaaaccgac cgcaccaagg tgtcgtcgcc gacaaaggcc ggagcggtcg 3385381 agacggtgcc gtcgacgtgg tgatatgtca accggagtgg ctgcaccggg cggccggcat 3385441 cgattgcggc ctggaacatc gccggataga aagccccaca accgcgatgc gagcacccgg 3385501 ctcctgctcg ggccgctgga cggccggcat cgtcgcccgg ccgaccgcac caggtggtgc 3385561 cctcggggaa ggccaccacc gtctgaccgg cgcgcagccg acgcgcgatg gtatcgacaa 3385621 ccccgggaag ccgccgcagg ctggctcgct cgatcggaat gatcttcaga atgcgcgcca 3385681 cgatccctat agtccgtccg gtgaacatgt cggcgcgcgc gacgaacgac ccgggcaaca 3385741 ccgaaccgat gcagaagacg tccaaccagg acacgtgccc gctgaccacc aggactccgc 3385801 gcaggttccg aactggacta cccgacaccg tgatccggac accgaaaagg cgcagcacca 3385861 accggcagta gatgcgttgc acccgcgttc ggcccggcag tggcatcacc accagcggca 3385921 ctcccggtac caggagcaga gccaacatga cgcgaagcgc tacccgcagc accaccagcg 3385981 gccgccgcac ctgcgcagcg tcgccgacac tcacgcagct gacgccgcac gttgcgcggg 3386041 gcaaccagga gtgttcggtg actgcgggag cgctcatcgc gcgtcgttca ccatttccga 3386101 ggccgccgca accgaccgca gtcgtcgcag atatcgcgta tcggcgtggt ccttatccag 3386161 tagcaggcag aagtcgccca cgccaaagtc cgggtcgtgc gccggctccc cgcaggcccg 3386221 cgcgcccagt ctcaggtaac cgcgcatcag cgggggaact gctggccgtg gcggagggag 3386281 aatgtcgtcg agggacctcc cgtccacgcg caccggccgg taggggtaca cctggcactg 3386341 cggcggcgcg gcatgccggt tgaggatgaa gtcgcgcacc ccacgcagcc ggctgcccgg 3386401 cgtttcaccg tctcccccga ttggtactga cacacatccg gtcacatagt catagccgta 3386461 tcggtccagg taggccagga tgcccgccca catcaacaac accaccccac cgttgcggtg 3386521 accctcgcgc accacggcgc ggcccatctc caccaacgac ggccgcagcg gatcgaacgc 3386581 gcaaacgtcg aattccgttg cggtgtagag tcctccggcg gcgatggcac ccgccggtgc 3386641 cagcatccgg tagcaaccca ccagctcacc ggtgtcgtcg tcgcggacca gcaggtgatc 3386701 gcagtactcg tcgaaccggt cgccatcccg gcgcgtatcc gcggccgccg gcagtgcgaa 3386761 gcctggcgta gtgctgaaca cgtcatagcg gagccgctgc gccgcctcga ccatgctggg 3386821 atcggtggat agcaacaggg aatagcgcgg tccggttgac gatcctgtcg cgacgccatg 3386881 cggtttgtca ctgggtatca gcacagaagc gatgctcata gcaccaacgt ggcgcagccg 3386941 atcagctaat cggcatcaac gttgtgacgt gtcggtgcac gtcagatgac gaactgttgg 3387001 gctaggtgag caggcgccaa ggccccccac gcctcggcgt gtcggggtct tttgcgactg 3387061 ctcgcgcagg gaacctagcc cttgcgggcc ttgatggcct cggtcagctg cggagcgacc 3387121 ttgaacaggt ctcccaccac cccgtagtcg gcgatctcaa agatcggcgc ctcttcgtcc 3387181 ttgttgaccg cgacgatggt cttggacgtc tgcatgccag cgcggtgctg gatcgccccg 3387241 gagatgccca gggcaatgta gagctggggc gacaccgtct tgccggtctg gccgacctgg 3387301 aactggcccg ggtagtagcc ggagtcgact gcggcacgcg aggccccgac cgcggcgccc 3387361 agcgagtcgg ccagcgcctc gaccacgctg aagttctccg cgctgccgac accacggcca 3387421 ccggccacca caatggtcgc ctcggtcagc tccggccggt cgccggcgac cgccggttcg 3387481 cgcgcggtga tcctggcggc gttctccgcc gcagccggca cttccacgct gacctgctca 3387541 ccggcgccgg cggccggctc cgcctccacg gctcctgcgc gcacggtgat caccggggtg 3387601 tcgccgttgg cctgcgcttc gacggtgaac gccccaccga agatgctgtg gacacccact 3387661 ccaccttctc tcacgtcgac cacgtcgacc agcagacccg agccgatccg agccgcaagt 3387721 cggccggcga tctccttgcc gtccgcggtg gcggcgatta gtacgccggc aggggccgag 3387781 gactcggcca gcccggccag cacgtcgacc gccggggtga tcaggtattt gtcgacaagg 3387841 tcggactcgg cgacgtagat cttggcggca ccagccgcct taagcccgtc caccagcggc 3387901 gcggccgtcc ccggcacacc gacgacgacg gcggctggtt cgcccaaggc gcgggcggcg 3387961 gtgatcaatt cggcgctgac cttctttaac gcgccttcag cgtgctcaac gagcaccagt 3388021 acttcagcca tgggttatat cgctctcgtc tttgggaggt gcgtatgtct tagatgattt 3388081 tctgggcaac caggtactgc acgatctggt tgccgccttc accctcgtcg gtgaccttct 3388141 ccccggcagt cttggccggt ttgggcgtcg acgccagcac ggtggatccg gcgttggcca 3388201 gccccacctc gtcgctctcg acaccgatct cggccagggt cagcacggta acttccttct 3388261 tcttggcggc catgatgcct ttgaaggacg ggaagcgcgg ctcgttgatc ttctcgttca 3388321 cgctgatcac cgcgggcagc gtggcctcga gggtgaatac gccctcatcg gtctcacgct 3388381 cgccggtgat cttgccgccc tcgatcgaca ctttgcgcag gtgggtgagc tgcggcaggc 3388441 ccaggtactc ggcgatgatg gccggcaccg caccgcccac cccgtcggtc gattcgttgc 3388501 ctgcgatcac cagctcggtg ccctcgatgg tgcccaacgc gcgcgccaaa gcccacccgg 3388561 tttggatgac gtccgagccg tgcatgccgt cgtcctttag gtggacggcc ttgtcggcac 3388621 ccatcgacag cgccttgcgg atcgcctcgg tggcgcgctc ggggcccgcc gtcagcacgg 3388681 ttaccgaccc ttcgatgccg tcggcggcct ctttctcccg aatctgtagc gcttcctcca 3388741 cggcgcgctc gttgatctcg tccagcaccg cgtcggcggc ctcgcggtcc agcgtgaaat 3388801 cgccgtcggt cagcttgcgc tccgaccagg tatctgggac ctgcttgatc aggaccacga 3388861 tgttcgtcat gactgtggtt cgtcctcctc gaaggcggcc cgcagcgctc gactgcggaa 3388921 cctcggtcac acgttttgca accgcacagc gatattacta ttcggtaagt tcgcgtggtg 3388981 cgccctcaca ccatagcggg tggtagagca ggttcccacg cctgtgcctc gcccacgacc 3389041 ggcggatact cccggtgccc ggttcgcgaa tccgatgcca cgggttagcc tgccttaaca 3389101 atgtgcgcat tcgttcccca cgttccccgc catagccgag gcgacaaccc gccgtcggcc 3389161 tccacggcta gccctgcggt gttgacgctg accggcgagc gcaccatccc cgatctggac 3389221 atcgagaact actggtttcg ccgccaccag gtcgtctacc agcggctggc accccgctgc 3389281 acggcccgcg acgtgctgga agccggctgc ggcgagggat atggcgccga cctgatcgcc 3389341 tgcgtcgctc gccaggtcat cgcggtggac tacgacgaga ctgcggtggc ccatgtccgg 3389401 agccgctatc cccgagtgga ggtgatgcaa gcaaacctgg ccgagctgcc attgcccgac 3389461 gcgtcggtag acgtcgtggt caacttccag gtcatcgagc atctgtggga tcaagcccga 3389521 ttcgttcgcg agtgcgcccg ggtactgcgg ggctcgggac tgttgatggt gtccaccccc 3389581 aaccggatca ccttttcccc cggccgcgat accccgatca acccattcca cacccgcgag 3389641 ctcaatgccg acgagctcac ttcgctgttg atcgacgcgg gattcgtcga tgtggccatg 3389701 tgcgggttgt ttcatggccc acgcctgcgc gacatggacg cccgccacgg cggctccatc 3389761 atcgacgcac agatcatgcg ggcggtggcc ggcgcaccgt ggccacccga gctagccgca 3389821 gacgtcgcgg cggtcaccac cgccgacttc gagatggtgg cagcgggtca cgaccgtgac 3389881 atcgatgaca gcctggatct gatcgcgatc gcggtgcggc cttgaacacg tccgcaagcc 3389941 cggtgcccgg cctgttcacg cttgttctgc acactcacct gccctggctg gcccaccacg 3390001 ggcgctggcc ggtcggcgag gaatggctct atcagtcgtg ggcggcggcc tacctgccgc 3390061 tgctgcaggt gctggccgcg ctggccgacg agaaccggca ccggttgatc accctcggga 3390121 tgacgccggt ggtcaacgcc cagctcgacg acccatactg cctcaacggt gtgcatcact 3390181 ggctagccaa ctggcagctg cgcgccgaag aggccgccag cgtgcggtat gcccgtcagt 3390241 cgaagtcggc tgactatccg tcatgcacac cggaggcgtt gcgggccttt gggattcgcg 3390301 aatgtgccga tgcagctcgc gcgctcgaca acttcgccac gcggtggcgg cacggcggca 3390361 gcccactgct gcgcggcctg atcgacgccg gcacggtgga gctgctcggt ggcccacttg 3390421 cccacccgtt ccagccgctg ctggcaccgc ggctgcgcga gttcgcgctg cgcgaaggcc 3390481 tcgccgatgc tcagctgcgg ctggcgcacc gcccgaaagg gatctgggca cccgaatgcg 3390541 catacgcccc ggggatggag gtcgactacg ccaccgcggg ggtcagtcac ttcatggtcg 3390601 acggcccgtc gctgcacggc gacaccgcgc tgggccggcc ggtggggaaa accgatgtgg 3390661 tcgccttcgg tcgcgacttg caggtcagct accgggtgtg gtcaccgaaa tccggctacc 3390721 ccgggcacgc cgcctaccgc gacttccaca cctacgacca cctgaccgga ctcaaaccgg 3390781 ccagggtcac cgggcgtaac gtgccgtcgg agcaaaaggc accctacgat cccgagcgcg 3390841 ctgaccgcgc cgtcgacgtc catgttgccg atttcgtcga cgtggtgcgc aatcggctgc 3390901 tctccgagtc cgagcgcatc ggccggcccg cccacgtgat cgccgccttc gacaccgagt 3390961 tgttcggcca ctggtggtac gagggcccaa cctggctgca acgggtattg cgggctttac 3391021 ccgccgccgg tgtccgggtg ggcaccctga gcgatgcgat cgccgacgga ttcgtcggcg 3391081 acccggtcga attgccaccc agctcttggg gttccggcaa ggactggcag gtgtggagcg 3391141 gtgccaaggt ggccgatctg gtccagctca acagcgaagt ggtcgatacc gcgttgacca 3391201 ccatcgacaa ggcgctggcc cagacagcgt ccctggacgg accgctgcct cgcgatcacg 3391261 ttgctgatca gatcctgcgc gagaccctgc tcaccgtgtc cagcgactgg ccgttcatgg 3391321 tgagcaagga ctccgccgcc gactacgccc gctatcgtgc tcacctgcac gcacacgcca 3391381 cccgggagat cgccggcgcg ctggccgcgg gccgacgcga caccgcacgg cggctcgccg 3391441 aagggtggaa ccgcgccgac ggtctgttcg gcgccctgga cgctcggagg ctgcccaagt 3391501 gaacgcctcg cacaggcgga accggccgcg cgcatgagga tcctcatggt gtcgtgggag 3391561 tacccgccgg tggtgatcgg cggactcggc cgccacgtgc atcatctgtc gaccgcgcta 3391621 gccgcagccg gtcacgatgt cgtcgtgttg tcccggtgtc cgtcgggcac cgatcccagc 3391681 acacacccat cctccgatga ggtgaccgaa ggggtccggg tgattgcggc cgcgcaggac 3391741 ccgcacgagt tcacgtttgg caacgacatg atggcctgga ccctggcgat gggccacgcc 3391801 atgatccgcg ccgggctgcg cttgaagaaa cttggcaccg accgctcgtg gcgtcctgac 3391861 gtcgtgcacg cacacgactg gctggtggcc catccggcca tcgcccttgc ccagttctat 3391921 gacgtgccaa tggtttccac gattcatgca acggaggccg gtcgacattc cggctgggtc 3391981 tccggagctc tcagccgtca ggtgcacgcg gtcgagtcgt ggctggtgcg tgaatccgat 3392041 tcgctgatca catgctcggc gtcgatgaac gacgagatca ccgagctgtt cgggcccggg 3392101 ctggccgaga tcaccgtgat ccgtaacggc attgacgcgg cgcgctggcc gttcgcggcc 3392161 cgccgcccgc gcaccgggcc agccgaattg ctctatgtgg ggcggctgga gtacgagaag 3392221 ggcgtgcacg acgccatcgc cgcgctgccg cggctcaggc gcactcaccc aggcaccaca 3392281 ctgaccatcg ccggcgaagg cacccagcag gattggttga tcgatcaggc ccgcaaacac 3392341 cgggtgctca gagcaaccag gttcgtcgga cacctcgacc acaccgagct gctggcgttg 3392401 ctgcaccgag ccgacgccgc ggtgctgccc agccactacg aaccgtttgg gctggtggca 3392461 ctggaggccg ccgcggccgg caccccgctg gtgacgtcca acatcggcgg tctgggtgaa 3392521 gcggtcatca atggacagac cggggtgtcg tgtgcacccc gcgacgtagc ggggctggcc 3392581 gccgcggtgc gtagcgtgct cgacgatccg gccgccgcgc agcggcgcgc acgagccgcc 3392641 cggcaacggc tcacctccga cttcgactgg cagacggtgg ccaccgcgac cgcgcaggtg 3392701 tacctggcgg cgaagcgcgg tgaacggcag ccgcagcccc ggttgcccat cgtcgagcac 3392761 gctcttcccg atcggtagcc gtggcaggga cgtgatgatc ggagcaccgc agtgaaaccg 3392821 caggaccagg ggctccactt cccctatcgc tacgaccttc gactggcgcc tatgtggcta 3392881 ccgtttcgat ggccgggcag ccaaggcgtg accgtgaccg aggatggccg cttcgtcgca 3392941 cgctacgggc cgtttcgcgt cgaggcgcca ctgtctagcg tccgcgatgc gcacatcacc 3393001 ggcccatacc gatggtggac agcggtgggc ccccgactgt cgatggtcga cgacggactc 3393061 acgttcggaa ccaacgcagc tgccggtgtc tgcatccact tcgagccgcg gatccaccgc 3393121 gtgattggac tgcgggacca ttcggcgctg acagtgaccg ttgcggaccc cgaagggctg 3393181 gtcgccgcgc tcagcagcta gttcgccgag cgccccgtgc tgggcacaac ccgactcggc 3393241 ctggaggcgc ctgcatccaa gccgcaccgg cgcacaatta tctgccggag gtcaaccccc 3393301 tttatcgatt cggtatcgaa gacgccgttt gacatgccat gatcggcgaa ttcgcagttt 3393361 cagatgccag ggaggcgaca tggctcactc gatcgttcgc acgctgctgg cctcaggtgc 3393421 cgccacggcc ctgatcgcca ttcccacagc ctgctcgttt tcgatcggaa cgtcgcactc 3393481 gcactcggtg agcaaggccg aggtcgcccg gcagatcacc gccaagatga cagacgccgc 3393541 cggcaacaag cccgaatcgg tgacgtgccc aagcgatctc ccggcagagg tcggggccga 3393601 gctgaattgc gaaatgaaga tcaaggaccg cacgttcaac gtcaacgtca ccgtgaccag 3393661 tgtcgacggt agcgacgtca agttcgacat ggtggagacc gtcgacaaga accaggttgc 3393721 caacatcatc agcgacaaac tgttccagcg ggtgggcgcc aggcccgatt cggtgacctg 3393781 ccccgacaat ctaaagggcg tcgagggagc caaactgcgg tgtcgactga ccgacggcag 3393841 caaaacgtat ggcatctcgg tgattgtcac cagcgttgac gccggcgatg tcaacttcga 3393901 tttcaaggtc gatgaccacc ccgagtaggc tcaccgtgga atcggctgcc cggcagccaa 3393961 tttcgcgtac ccgatgtgga tggtcgccgg agcaccatcg gctttagggt gctcggggct 3394021 agcgggccgc cttcttgcgt tcgatgtcgg ccaaggcagc ggctagctca gcgcgctgcg 3394081 cggccgatgc ctcccaggac agctggcggt tcttgaccac cttggccggc gccccgaccg 3394141 cgatcgaata gtcgggaatt gcgccgcgga ccaccgcgtg cgagccgagc acgcagcccc 3394201 gtccgatggt ggtgccgcgc agcacgctca ccttcacgcc gatccaggtg tcgggcccga 3394261 tccgcaccgg actcttgatg atgccctggt ctttgatcgg cagcgtgatg tcgtccatcc 3394321 ggtggtcgaa atcgcagata tagcaccagt cggccattag caccgagtcc ccgatctcga 3394381 tgtcgagata ggtgttgatg acgttgtccc ggcccagcac caccttgtcg ccgaaccgca 3394441 gcgagccctc gtgggcacgg atcgtgttct tgtccccgat gtgcacccag cggccgatct 3394501 ccagttgcgc tagttccggt gtcgcgtgga tctccacacc cttgccgaga aacaccatgc 3394561 cgcgggtgat gatgtgcggg ttggccagct tgaacctcaa cagccgccag tagcgcacca 3394621 ggtaccacgg agtgtaggcg cggttggcaa gcacccattt cagcgatgcc agcgtgagga 3394681 acttggcctg acgtgggtcg cgcagccgcg atcctcgcca cctgcggtga agcggagcac 3394741 cccacatggt tgtcattggc gcagagctta gcttagctgt cggacctgtt tgggcgtatc 3394801 ggcgcatctg agaatgcgca tcggcgcgcg aggtgacgcc ggtggccgcc cccgcggggg 3394861 cggtgatcgg cacccggccc cacaccacac ccgatgacga gcccaaactg aggacgttca 3394921 cgacacttac accacgtaca cgacacgccc acggacaacc gggaaccgcc accggccaag 3394981 gacgcgagga accgaatctc gcccgccttg ccagaatgta cgtggtgacc cgagccgggc 3395041 aaccattaca cagttggcca gcactgacca acttcatttg tagcggttac cctcacctgt 3395101 actcattcgg ccgggcccgc cgatgagcga cccacgtagc ggaaggatct gggaacctgc 3395161 gaaaggataa ggcgcttgcg cgacgccttc cggcggccgt tgcggccgcc gtaatcgcgg 3395221 tcgagctggg cggttgcgga agtgccgact cgtgggtaga agcggccccc gcacaaggct 3395281 ggcccgcaca atacggcgac gccgccaaca gcagctacac cacgacgaat ggcgccacca 3395341 atctcacgct gcggtggacg cgttcggtca aaggaagctt ggctgccgga ccagccctga 3395401 gcgcacgcgg gtacctcgcg ttaaacgggc agaccccggc cgggtgttcg ctgatggagt 3395461 ggcagaacga caacaacggc cggcagcgct ggtgtgtgcg gctggtccag ggcggcggct 3395521 tcgccggccc gttgttcgac ggcttcgaca acctctacgt cggccagccg ggagcgataa 3395581 tctcctttcc gccgacccag tggacgcgct ggcgccagcc cgtgatcggg atgccgtcca 3395641 ccccgcggtt tctggggcat ggccgcctgc tcgtgagtac acacctgggg cagctgctgg 3395701 tattcgatac ccgccgcggc atggtggtcg gcagtccggt ggacctggtg gacggcatcg 3395761 atcccaccga tgcgacacgc ggactggccg actgcgcgcc agcccggccg ggctgcccgg 3395821 tcgcggccgc ccctgcgttc tcgtcggtca acggcacggt ggtggtcagc gtctggcagc 3395881 cgggcgaacc ggccgcgaag ctggtcgggc tgaaatacca cgctgagcaa ctcgtccgcg 3395941 agtggaccag tgacgctgtc agcgcgggcg tgctggccag cccagtgctc tccgccgacg 3396001 gatcgacggt ctacgtcaat gggcgcgacc accggctatg ggcactcaac gccgccgacg 3396061 ggaaagcgaa gtggtcagct cccctgggct ttctggcgca gacgccgccc gcactgaccc 3396121 cacatggact gatcgtgtcc ggcgggggcc ccgacaccgc gctggcggcg ttccgggatg 3396181 ccggtgatca cgccgagggg gcctggcgac gcgacgacgt tactgcgctg tcgaccgcga 3396241 gtctggccgg caccggcgtc ggctatacgg tcatcagcgg tccaaaccac gatggcacgc 3396301 ccggtttgtc gttgctggtc ttcgatccgg ccaacggcca cacggtcaac agctatccgc 3396361 tacccggagc gaccggatat cccgtcggtg tatcggtcgg caacgaccgc cgcgtggtga 3396421 ccgccaccag cgacggccag gtctacagct tcgcacctta gattgccagc ggcggaatgg 3396481 cgctgcgcgg cacctgggct tggcaagcgc cgacaaacga cggcagcagc tcaccctggg 3396541 cgaagtagaa aatcagactg tcgtcggtga tagcaaagtt ctggtagtga gccgggtcga 3396601 ggccggtcga aggcaatatc gcggcaccga aaccggtctg acgtgccagc tcgcgctgaa 3396661 cgatggggta gatgctgtcc agtggcgtgg tgccgggcac gaacaacgtg tcgaaggtga 3396721 tgggctgcga ggtcgcgagg ttgtagttga aggccttgta ccaggtggac ggatgtgccc 3396781 caccgaggtc ctggaagaat ttgagcacta cgctgcgggt ggcctgcggc ggctggccgg 3396841 agctgtgctg ttcgctggtg gcgtccattt ggtagggctg gtctcgcagc ggggacccct 3396901 gcgcgacgtt gacgaacccg tcgcggtttt gcgtgatgta gtcggtcagc gcctgctggt 3396961 cgggatagtc gacaggaaat gtcatatcca gcatgtactt agggcccgag gcgtgcacat 3397021 ggcagatctg gccggcctgc acagtgccgc ccaggccggc gcatgacggc ggcgcaccag 3397081 ccgccggcca gcccaccagg accacagcaa cgagcactgc ggtcgctatc agataacgca 3397141 tcgtctaatc gtcctcgcag accaaaggcg ttggcgcagc ttaggggtga ccgccgccag 3397201 cctacccgtg ccgctaccgg gacggccgac acacataggc ggtcacatgg ctcaaggacc 3397261 cggcaccaat gcgggtgatg accaccgcca gcggtctgct gccccgcagc cggagccgtc 3397321 gccgcagagc gtcgggatcg atcgcaacgc cgcgcaccag gatttcggct gccccgcaat 3397381 ccagcgctga cagcacctga cgcagccgac gctcgtcgaa ggccagctgc tcgagcacct 3397441 cgaacccgcg caacgcaggc ggcagccggt caccggacag gtaagcgatt tggggatcga 3397501 gctgccacag cccatgccgg gcgccgtagt tgcgtaccag gccggcacgg acgacggcgc 3397561 cgtcggggtc gacgatccat ttcccggcgg gccgcacacc gcagtcgtcg ggctcgtcgt 3397621 caccgatttg ttcaccggaa tcgaggatgc tggctcgacg gcggataccc gatccggcca 3397681 acccggccga ccaaagacat gcttctcgaa ccccaccgcg gtatgagatc acctcgatct 3397741 cgccctcgaa accgagccgg cccacctcct cgaaatctat tccgggagcg cacttgacga 3397801 ccacatcacg gccgcggtag cggtccagta gggggcccag gccgggctgg tagtcggcga 3397861 ggtggaagcg tcgccgcccg ttgctgcgac gcgccgggtc gatgacgacg accgcgtcgc 3397921 gggtcaccgg atgcagcaca tcggcgcggc acaggtcagc ttccattccc agggcggcca 3397981 ggttgtggcg cgccatggcc agccgcaccg ggtcgatatc gctgccgacc gcccggacag 3398041 ctagctcgcg cagcgcggcc agctcggtgc cgatggagca ggtcgcgtcg tgcactaccc 3398101 gaccggccag tcgcctggcc cggtgccggg ccacgggtgc tgcggtagcc tgctgcagcg 3398161 cctcatcggt gaatagccat tgcgacaccc caacgttcgg acacagctcg cccagtttgc 3398221 cggcggcgcg gcggcgcagc agcgtggtct ccaccagcca cggcgcccga tcgccaaacc 3398281 gggcgcgcac cgcggcggtg tcggcaatgc gagtggcagc ggtcagctcg agctctgcga 3398341 ccgcggccag cgcaaccgca cccgattctg accgcagata gctgacgtcg gcggtggtga 3398401 acgtgagacc agtgaagccg ctggtcagga cggcttaacc ccggtaatca tcacgttgta 3398461 gaaccagccc ttcggcacca catgccgcca gacgttggcg tccacccagc ccagagtctt 3398521 ccagctggtg aaggcgaagc gcgcccaacc ccagcccagc cgccctggcg gcaccgtgca 3398581 ctcaaacgtg cgcaacggcc aacccagcat cgccgcggtg aactcctcgg tggcggtttg 3398641 gacctcgact gcacccgcgt tgtgcgcgat ccgctgcagg tcctggggcg tgaatgtgtg 3398701 caggtcgacc agggcctcca gcgccgcggc gcgcgaggac tcatcgagct cgccttgtgg 3398761 tcggcgccag cctctcaggc cgggcagctt ggtagcgttg gtgacgacac gccaggtcag 3398821 cgtggacagt gtgcgagcgt agccgtcgcc gacggtggtc ggctcgccgg cgaacacgaa 3398881 gcgcccgccc ggcttgagta cccgaaccac ctcccgcaac gacagctcga cgtcgggaat 3398941 gtggtgcagc accgcatgcc cgaccacgag gtcgaaagcg tcgtcgtcgt acgggatgcc 3399001 ctcggcgtcg gcgacccggc cgtcgatgtc tagccccagc gcttgcccat tgcgggtggc 3399061 gaccttgacc atgccgggtg agaggtcggt gaccgatcca cgccgggcaa cgccagcctg 3399121 gatcaagttg agcaggaaga atccggttcc acagcccagt tccagtgcgc ggtcgtaggg 3399181 cagctgcgcg atgacctcat caggcacgat cgcgtcgaac cggccgcggg cgtagtcgac 3399241 gcaacgctgg tcataagaga tcgaccactt ctcgtcgtag ttctcggctt cccagtcgtg 3399301 gtagagcacc tgggcgagct tgctgtcgtg ccgagctgcg gccacctgct cggccgtggc 3399361 atgtggattg ggagtggcgt cggcggggat gtttgaactc ctcgtcatat aggcgagcct 3399421 aacggccgcc ccggtcaccc ttgctgccac caccttgacc agcggcgaac acctcgacat 3399481 agcgccgacg ctcagcggcg atccgctcgg ccggcgccag ctcgtagacg tcgctgatcc 3399541 cggctttggc cgcggccagc gcgtgcggcg ggccgtcaag aaagcgcctc gcccaggccg 3399601 ccgcggcgtc gtaaacgtcg tcgggggcca ccatgtcgtc gatcaggccc agcgccaagg 3399661 cctcctcggc gtcgaagaag cgcccgctga acaccagctc cttggctctg ctcggaccgg 3399721 ccgcacgggt cagccgggcc attccgtcgc cgctggggat caggccggcc aggatctcgg 3399781 tcgcgccgaa tttcacgttg tcaccgctga ctcgccaatc ggcggctagg gccagcgtaa 3399841 ggccggcacc caacgcgtat ccggtgatgg cggccacggt cggcttgggg atcgccgcaa 3399901 cggcgtcgac ggcctgctgc cgaatccggg cggcggtgtc ggcctcctgc gcgctcaatg 3399961 tccgcagttc gggcatgtcg tcgccggcgg agaagatttc gtggccgcca tacaggatca 3400021 ctgcggccac gtcgtcgcgt cgccccagct cgttggccgc ggcgaccact tcccggtaga 3400081 cctggcgggt catcgcgttg gtaggcggtc gcgataggag caacatggcc aggccggcat 3400141 cctgggagcc gtcactgacc acgacgttga cgaactcggg caccgttggc gtcacagcgg 3400201 ccacccggtc gatccgcccg atctccgggc ctggttatag cggtcggaat cgaagaactc 3400261 gatctcccag ttgtcgccgt tgcgggccag ctgtggctgc accgggacga tttggcgttc 3400321 gacggccagc acgtcggcaa cggtatgacc ggccagcgag tccagttgcg tccaggtcgg 3400381 cggcagcaag aagttgcggc cggcggcgaa gtcggcgata gcgtcggctg gcaacaccca 3400441 accagcccgg tcggattcgg tgttctcgcc gtcggcgcgc tgaccttcag gtagggcacc 3400501 cacaaagaag taggtgtcgt agcgccgggt cagttcggcc tccggggtga cccagttggc 3400561 ccagggccgt agcaggtcgg atcgcagcac cagcttttcc cgctgcagga agtccgcgaa 3400621 ggacagcgtc cggtcggcca gtgcgcgacg cgcgtcgccg tacaccgagg catccgagac 3400681 gatgctgttc ggtgccgaat ggtcctgatc gaccggcccg gcgaatagca cccccgactc 3400741 ctcgaacgtc tcgcgggccg ccgcgcagac caaggcttcg gcgagatcag gctcgatgcc 3400801 gaaccgctgc gcccaccact gcggcggcgg accggcccat gcccccagcc ggcccaagtc 3400861 ggcgtcgcgg tcgcggtcgt cgactccccc gccgggaaac accattaccc cggcggcgaa 3400921 atccatcgca gcgtgccgcc gcatcaagaa gacggccaga ccggacgctg atccggcgtc 3400981 cgggtcgcgg accaacatca cggtcgccgc cggcctcggt gtaggcgggg gtaccagtgg 3401041 ctcgcgaggt gaattcatga ctgtctccga tgggctgctc ggctgcgacg ccgtcgggcg 3401101 aaatatcgcc cgtcggccac ctccagcgtg atctcctggc caaacgcggt ggacaggttc 3401161 tcggcggtca gcgcgtcggg aagcaagccc gcggcaacca cccgggcctc cgacagcagc 3401221 aggcaatggc tgaagccggg cggaatctcc tcgacgtggt gggtgaccag aaccagcgcg 3401281 ggcgcgtcag ggtcggctgc caggtcggcc agccgggcga ccaattcctc tcggccacct 3401341 aagtccaggc cggcggcggg ttcgtcgagc agcagcagct ctggatctgt catcaaagcc 3401401 cgcgcaatca gcactcgctt gcgctcgccc tccgacagtg ttccgtatgt gcggttggcc 3401461 aaatgctcag cgcccaggct ctccagcatg tcgatcgcgc ggtggtagtc gacggcctcg 3401521 tagcgctcgc gccaccggcc caacactgca tagccggcgg agacgacaag atcgcggacg 3401581 cgttcgtcgc cgggcacccg ctccgccagc gccgaggaac tgagcccgac ccgagcacgc 3401641 agttccgaga cgtcaacccg gcctagccgc tcaccgagca caaaggccac ccccgacgac 3401701 ggatgctcag ccgcggcggc aatgcgcagc aatgacgtct tgccggcccc gttggggccg 3401761 acgatcaccc agcgttcgtc gagttcgacc gcccaatcca gcgggccgac cagcgtgcgc 3401821 ccattacggc gcagggacac gtttcggaag tcgatcagca ggtcggggtc agccgcatca 3401881 gggccgccgt tgtcgagcac ccgactatcg tgccgcatgc tccgcgagca acctagtcgg 3401941 ccgggatttc gacgcgacgc accccacagt cgccggcgtc ggcggcctcg atctcaccgc 3402001 gagtcacgcc caacaagaac aacaccgtgt ccaggtacgg atggctcaac gacgcatcgg 3402061 cgacctcacg caacgccggc ttggcgttga atgctattcc cagcccggcc gcgcccagca 3402121 tgtcgatatc gttggcgccg tcgccgaccg cgacggtctg ctccatcggc accccatact 3402181 ggctcgcgaa gtcccgaagc gccttggcct tgccgggccg gtcaacaatc ggccccacga 3402241 cccggccggt aagaatgcca tcgacgattt ccagctcgtt ggacgcaacg aaatccaaca 3402301 tcaactcgcg tgcgagcggc tcgatgatcc gccgaaagcc gccggaaacc acaccgcagc 3402361 gaaaacccag acgccgcaag gtccggatcg tggtccgagc accgggcatc agttcgagct 3402421 gctcggcgac gtcgtctatc accgtcgcgg gcagccccgc caaggtggca acacgacgct 3402481 gcagcgactc ggcgaagtcc agctcaccac gcatcgcggc ctcggtgatc gcggcgacct 3402541 gtccctgggc acccgcacgg gctgccagca tctcgatgac ctcgccttgg accagagtgg 3402601 agtcgacgtc gaagacgatc aggcgtttgg tgcgccaagc caagccgtag tcctcgacgg 3402661 ccacatcgac atgctcttcg gcggccacct tggtcagggc gatctgcagc ggacccacgc 3402721 atccaggcgg caccgagacc cgcaactcca ggccggtgac cgggtagtcg gaaatgccgc 3402781 ggatgaagtc gatgttgacg ccgagtgcgg ccactcccct ggccaccgcg ctgaacgctc 3402841 cggcggtaat cgggcgtccc agcacgaaaa tggtgtgggt ggacggttgc cgaatgattg 3402901 gcagatcgtc gctgcgctcg atggcgacgt ctagacccac cccgtggatg gcggccgcga 3402961 cgtcgtcgcg cagcgcggta ccgtcggcaa cgtccagcgg gcacgacacc agcacaccca 3403021 gcgtgagccg gccccggatc accacttgtt cgacgttgag cagctcgact ccgtgctgcg 3403081 cgagcacctc gaagagcgcg gatgtcacgc ctggctgatc catgccggtg accgtgatca 3403141 gcaccgacac cttggctggc atgctcacct tcagatgggg cccaaccggt acggccccat 3403201 cagctcgaag tcaccatggc gggctcgtca tgatgacgcc cgacgtgggc ctcggcgcga 3403261 aggcgttcca ccatatgcgg gtagtgcagt tcgaacgcgg gacgctcgga gcggatgcga 3403321 ggcagctcgg tgaagttgtg ccgcggcggc ggacaacttg tcgcccactc cagcgaattc 3403381 ccgtaccccc acgggtcgtc gacggtgacc acctcgccgt agcgccagct cttgaagacg 3403441 ttccacacga acgggaacat cgacgcaccc aggatgaagg ccccgatcgt cgagacgacg 3403501 ttgagaccct ggaagccgtc ggtgggtagg tagtcggcgt agcgacgcgg cataccctcg 3403561 tcgcccaacc agtgctgcac caggaacgtg gtgtgaaaac cgatgaacgt caaccagaag 3403621 tgcagtttgc ccaaccgctc gtcgagcagc cggccggtca tcttggggaa ccagaaatag 3403681 atgccggcga aggtggcgaa cacgatcgtg ccgaacagca cgtagtgaaa gtgtgcaacc 3403741 acgaaatagc tatcggtgac gtggaagtcc agcggcgggc tggccagcag cacaccggtg 3403801 agtccgccca gcaagaaggt gaccatgaag cccaccgaaa acaacatcgg ggtttcaaag 3403861 gtcaattgcc ccttccacat ggtgccgatc cagttgaaaa acttgatccc ggtcggcacc 3403921 gcgatcagat acgtcatgaa agagaagaag ggaagcagga cggctccggt cgcgaacatg 3403981 tggtgcgccc ataccgcgac cgacaacgcg gcgatcgaca gcgtcgcata aaccagcgtg 3404041 gtgtaaccga agatcggctt gcgggaaaac accgggaaga tctccgagac gatcccgaaa 3404101 aacggcagcg cgatgatgta gacctcgggg tggccgaaga accaaaacag gtgctgccac 3404161 agcaggactc cgccattggc ggcgtcatag atgtgagctc ccagatgccg gtcggcggcc 3404221 agcccgaaca atgccgccgt gagcagcggg aacgcaatca atatcaggat ggacgtcacc 3404281 atgatgttcc aggtgaagat cggcatccgg aacatcgtca tcccgggtgc gcgcatgcac 3404341 accacggtgg tgatcatgtt gaccgcgccc aggatcgtgc ccagacccgc aacgatcaaa 3404401 cccatgatcc acaggtcgcc cccggcgccg ggcgagtgaa tggcgtcggt cagcggcgtg 3404461 taggcggtcc acccgaagtc cgcggccccg cccggagtga tgaagccggc tgccccgatg 3404521 gtggcgccaa atacgaacag ccagaacgaa aaggcgttca gccgggggaa ggccacgtcg 3404581 ggtgcgccga tctgcagcgg cagcaccagg ttggcgaaac caaacacaat cggcgtggca 3404641 tagaacagca gcatgatcgt gccgtgcatg gtgaacaact ggttgaactg ctcattcgac 3404701 aagaactgca gaccgggtgc ggccagctcg gtccgcatca acaacgccag caggccaccg 3404761 atgaagaaaa agcttatgca cgcgacgcag tacatgatgc cgatcatctt gtgatcggtg 3404821 gtggtgatca gcttgtagac caggctcccc ttgggaccgg tgcgggccgg gtaaggacga 3404881 atggcttcga gttctcccag cgggggcgct tcggctgtca acgcactcct ccaaacatcc 3404941 agcccggacc gggccaaaac ccagtattga gaggcatctt agccctcgat caggctggcg 3405001 gcaggcctgg tcctacaaac cgtcgtaaat gccagactcc gccggcgggc cgttgcagac 3405061 caacgctttc cgcccgcgcg aatcggggtc gacggctggc cgagtgctac cgtcgaacgc 3405121 gtgctgtccg gcgggatgcg atccactgtt gctgtcgccg tagcggcagc cgtgatcgca 3405181 gcgtccagtg gttgcggctc cgatcaaccg gcccataagg cgtcacaatc gatgatcacg 3405241 cccaccaccc agatcgccgg cgccggggtg ctgggaaacg acagaaagcc ggatgagtcg 3405301 tgcgcgcgtg cggcggccgc ggccgatccg gggccaccga cccgaccagc gcacaatgcg 3405361 gcgggagtca gcccggagat ggtgcaggtg ccggcggagg cgcagcgcat cgtggtgctc 3405421 tccggtgacc agctcgacgc gctgtgcgcg ctgggcctgc aatcgcggat cgtcgccgcc 3405481 gcgttgccga acagctcctc aagtcaacct tcctatctgg gcacgaccgt gcatgatctg 3405541 cccggtgtcg gtactcgcag cgcccccgac ctgcgcgcca ttgcggcggc tcacccggat 3405601 ctgatcctgg gttcgcaggg tttgacgccg cagttgtatc cgcagctggc ggcgatcgcc 3405661 ccgacggtgt ttaccgcggc accgggcgcg gactgggaaa ataacctgcg tggtgtcggt 3405721 gccgccacgg cccgtatcgc cgcggtggac gcgctgatca ccgggttcgc cgaacacgcc 3405781 acccaggtcg ggaccaagca tgacgcgacc cacttccaag cgtcgatcgt gcagctgacc 3405841 gccaacacca tgcgggtata cggcgccaac aacttcccgg ccagcgtgct gagcgcggtc 3405901 ggcgtcgacc gaccgccgtc tcaacggttc accgacaagg cctacatcga gatcggcacc 3405961 acggccgccg acctggcgaa atcaccggac ttctcggcgg ccgacgccga tatcgtctac 3406021 ctgtcgtgcg cgtcggaagc agccgcggaa cgcgcggccg tcatcctgga tagcgaccca 3406081 tggcgcaagc tgtccgccaa ccgtgacaac cgggtcttcg tcgtcaacga ccaggtatgg 3406141 cagaccggcg agggtatggt cgctgcccgc ggcattgtcg atgatctgcg ctgggtcgac 3406201 gcgccgatca actagtgagg cgcagcgcta ggctttggga tacccacagc taaaaagtta 3406261 atcaaagaaa cgaagagggt tgccatgagc actgttgccg cctacgccgc catgtcggcg 3406321 accgaacccc tgaccaagac cacgatcacc cgtcgcgacc cgggcccgca cgacgtggcg 3406381 atcgacatca agttcgccgg aatctgtcac tcggacatcc ataccgtcaa agccgagtgg 3406441 ggccaaccga attaccctgt ggtccctggc cacgagatcg ccggcgtggt gaccgccgtg 3406501 ggctcggagg tgaccaagta ccggcagggc gaccgcgttg gggttggctg tttcgtggac 3406561 tcgtgccgcg agtgcaacag ttgcacgcgc ggcatcgaac agtactgcaa gccgggcgca 3406621 aacttcacct acaactcgat cggcaaagac ggccagccaa cccagggcgg ctacagcgaa 3406681 gcgatcgtcg tcgacgaaaa ctacgtgttg cgcatacccg acgtgctgcc cctggatgtg 3406741 gcggcgccgc tgttgtgcgc gggcatcacg ctgtactcgc cactgcgcca ctggaatgcc 3406801 ggggcgaaca cgcgggtggc gatcatcggc ctaggcggac tgggtcacat gggcgtcaag 3406861 ctgggcgccg cgatgggcgc cgacgtgacg gtgctgtccc aatcgctgaa gaaaatggag 3406921 gacggtctgc gcttgggggc caagagctac tacgcgaccg ccgacccgga caccttccgc 3406981 aagctgcgcg gcggcttcga cctgatcctg aacaccgtct cggctaactt ggacctcggc 3407041 cagtacctga acctgctgga cgtcgacggc acactcgtgg aactgggtat ccccgagcac 3407101 cccatggccg tgccggcgtt cgcgctagcg ctcatgcgac gcagcctggc cgggtccaac 3407161 atcggcggga tcgccgagac ccaggagatg ctcaatttct gtgccgagca cggcgtgaca 3407221 cccgaaatcg agctgattga accggactac atcaacgacg cctacgagcg cgtgctggcc 3407281 agcgacgtgc gctaccgctt cgtcatcgac atctcagccc tgtgaggccg gtgcgcgatc 3407341 acttccggat tcggactcgc cgacgtcgac gccggccagc ggccatccgg cggcggccag 3407401 gatgcctgcc acccgttgga tgttttccgg tccggcgtcg tgatgggtca cctcggagat 3407461 gaactcggcg atctcgtcgc ggtcgatcac acggtccgcc acggcgggcg acccgttctc 3407521 ggtgaagtgg cgcaccacct ccccgatctg ctcctcggtc agcggggtgc tgcgcaatag 3407581 cgacagcagc gccacccggt ccggcccggg gacgccctcg gggtagccaa cctgaagcca 3407641 gcgcagcacc gaacggaaga aatgcgggtg cgagaacgtt ttcgtcaccc caccagtctc 3407701 aaggtttcga catcactcgc gccagtgtgg tgcggcgcga ttcagacaat tcacgaggcg 3407761 ttcaccacga tcgcgagccc atggacccat gagcccgtga cattctgcag cgtcgtctag 3407821 cgggacggca acgacgaact gggttttcac cccgctcgat ttttcacccc gctcgattag 3407881 gtggcgtttg gcaagctggc tcgcgcgctg cggggcaagg ccatctggcg ttgcgctgtc 3407941 acgcgctgga gtgccctcgt gagaaatgac cggccccggg cagcggacgt cgacggtgtc 3408001 aggccggcca gtcgccgcgg gtcaaagagc ttggcgtgac gctccaccgg tagctatcgg 3408061 attccagaag cttgggcagc caattgtccc aggtgccagt cgcgccgcca gcggtatgca 3408121 ccgcggtacg cgcggcaaca aacgccttgt gacgagcgcg tccgagcggt catcggcctc 3408181 caccgtcatg cacagctcct tctccaggtc tacgccgacg tcgcggtcca cattggtgag 3408241 cttggcgaat gcctcggcaa cctcgtcgaa atgcgcctcc gcgtccgcat cgaacggtcc 3408301 gcccatgtca aagatcaact cgacgtagta gctagttacc gcatcaggtc agtgtttgct 3408361 ggcctcggag tccggccgaa caatggccca ttttcccgcg actctagaag tcccagtcat 3408421 cgtcctcggt gacgaccgcc ttgccgatca catagctcga cccggatccg gagaagaagt 3408481 catggttctc gtcggcgttg ggtgacagcg ccgacaggat ggccgggttc acgtcggtct 3408541 catcgcgggg gaacagcgcc tcatagccga ggttcatcag cgccttgttg gcgttgtagc 3408601 gcaagaactt cttgacgtcc tcggttagcc cgacctcgtc gtagaggtcc tgggtgtatt 3408661 ccacctcgtt gtcgtagagc tcgaacagta gctcgtaggt gtagtccttg agctcggcgc 3408721 gcgtgacgtc gtcaaccaac gccagaccac gctggaactt atagccgatg tagtaaccgt 3408781 gcacggcctc gtcgcggatg atcagccgga tcatgtcggc ggtgttggtc aacttggccc 3408841 gactcgacca gtacatcggc aggtagaacc cagagtagaa caggaagctc tccagcaggg 3408901 tggaggccac cttgcgcttg agcggctcgt cgccgcggta gtactgcagc acgatctcgg 3408961 ccttgcgctg cagattgcga ttttcctccg accagcggaa ggcgtcgtcg atctcggcgg 3409021 tggaacacag cgtggagaag atctggctgt agctcttggc gtgcaccgac tccataaacg 3409081 cgatgttggt caacaccgcc tcctcatgcg gagtcagcgc gtcgggaatc aggctgaccg 3409141 caccaacggt gccctggatg gtgtccagca tggtcaggcc ggtgaagacc cgcatggtta 3409201 gttgcttctc gccggcggtc agggtgcccc acgacgggat gtcattggac accggcacct 3409261 tctcgggcag ccagaagttt ccggtcagcc gatcccagac ctcggcgtcc ttctcatctt 3409321 gcagtcggtt ccagttgatc gctgagactc gatcaattag ctttgcgttt ccagtcacca 3409381 gaaccccact tcaccaggac aacaagctgc cttgctaggc ctcaaacact acccctgggg 3409441 tccgacaagg tactgcaaca caagaagttg tgtttgcgtg tcgcgaatcc gctcgcctgt 3409501 ttcgccggct agttcgccgc agcgaccgtc gcgcggtcgc tcgacaaacc gttgccgatc 3409561 ccgaagaacc ggtactcggc ggggttgacc gagcgggtgg tcagccagta ttgccaggtg 3409621 tagccgcacc agagcacggt gttcttgccg tgctcgtcga gataccagct gcggcagccg 3409681 ccactgttcc acaccgaccc agccagcctg cgctgcagct cctggttgaa ccggtcttgc 3409741 gcctcgcggg tgggggccag cgcttgcacg cccatccggt cgcatttcgc gatcgcatcg 3409801 gccacgtaat ggatctgcga ttcgatcatg aacaccacgg agttgtgtcc cagcccagtg 3409861 ttcggcccca gcaggaagaa caggttgggc atgttggcga cggtgatccc gcggtgtgca 3409921 ccgatgccct cacggttcca gcggtcgacc aggtcctcgc cgtgacgccc cttgatctgc 3409981 acataggtat aggagtcggt gacgtggaag ccggtggcgt acacgatcac atcggcttcc 3410041 cggaagacct cacggccagt gccgtcggcg gtgacgatcc cgtcgtgcgt gatccggtcg 3410101 atgcggtcgg tgatcagttc ggtcttcggg tccgccaccg cggggtaata ggtagaggag 3410161 ttcaggatcc gtttgcagcc gatgcgatac cgcggcgtca gcttgcgccg cagctcgcga 3410221 tccttcaccg atcgacgaat attgtatttg gcataggcct cgatgatctt caacgtgttg 3410281 ggccgcttgg tcatgccgta ggccagcgcc tcctgggccc agtagatgcc gaggcgcaac 3410341 agtgcccgta gcccggggac ggttcgcaac gcccggcgca gcgacaccgg cagctcttcg 3410401 ttggtgcgcg ggaccaccca cggcggggtg cgctgataga gctgaagttc ggcgacctgg 3410461 ccgacgatct cgggcacgat ctggatcgcg ctggcaccgg tcccgacgat cgccacccgc 3410521 ttgccggtca ggtcgatact gtggtcccac tgggcggaat ggaaagcggg gccggcgaat 3410581 tcgtcgcgac ctgcgatctc ggggaaggac gggatgtgca acgcaccggc cccggagatc 3410641 aggaactgcg cgacgtattc acgcccgtcg gcggtgaaca cgtgccagcg gcattcgtcg 3410701 tcgtcccagt agccgcgatc gacgagcgaa ttgaactcga tgtagcggcg caggccgtac 3410761 ttgtcggtga cccctttgag gtagcccaag atttcgtccc agtaggaaaa caggtgtttc 3410821 cagtccgcct tgggctcgaa cgagaaggag tacaggtgcg acgggatgtc gcacgcgcag 3410881 ccggggtagg tgttgtcgcg ccaggtgccg ccgacgtcgt cggctttctc caatatgacg 3410941 aagtccactc cttgcttttg cagtgcgatg gccatgccca aaccggagaa tccggttccg 3411001 atgatgacgg cgcgggtacg taccggcggc tggttggccg ggcttggcgt ggacggcttg 3411061 gcagccgtat cggcaatgct cacaatggat cggtcttcct gttcagcggc gagtttggcg 3411121 cttcagtcac cctcgccggc gcaaataccc agtacccagg gtatcggata tgacgagtgt 3411181 tgttgattgc cgcagcgatg tcaacggccg ccgcgctcaa cgcacggcgg gattgttggg 3411241 taccgcgtcg tggatcggtt ggtcagggtc gaccgcgatg cccagcgctt cggcggtgcc 3411301 gacgatcacg cccatcatga tggtggtcag atgcgccacg aactgctcac gcggcatgcg 3411361 gcgcgggctg tcgggttcgg ggcccaacca ccactcggtt gccgatgcgg ccgatccgaa 3411421 cgccgcgaat gcggcgagtt cgagcgcggc tcgattcagc tccatctcgc gcagctcgtt 3411481 gttgaacatc tcggccatgg ccagcgtgat ctcccggcct tcgttgaggg tgcgtaccgt 3411541 cgcctcggac tgctttgccg agcggccctg aatgaacacc cgcagcacgt tggggtgctg 3411601 gtcgacgagg ttgacgtact cctcgacgct gcgccggata acttcgcggg cagagtcggt 3411661 ggctaagtcg agcgacggga agatcgccgc ccacagcatg tcacgcagtc gcatcccgat 3411721 agcctcgagc aaatcggact tgtcggtgaa atgccgatag atcttgggct tggcggtgcc 3411781 ggcctcttcg gcgatttggc gcacactcag ctcggggccc agccggtcga tagcgcggaa 3411841 cgccgcgtcg acgatttcgt tgcgcacctt cttgcggtgc tcacgccacc gttcactgcg 3411901 ggcgtcgact ttcacccccg gctttgcact ggggtggggt cgggggattc tgaccacatc 3411961 aagcacctta ccgcgttgca agcgctgacc tgggcagact ggccacgcca ggcttggttg 3412021 aatgtgaggt tcacgacgcg acacgccgcg aagccgtcgc cactttcact ctggcgcgcc 3412081 ggtgctacag catgcaggac acgcaaccct cgacctcggt gccctccaac gccatctgcc 3412141 gcagccggat gtagtacagc gtcttgatcc ccttgcgcca ggcgtaaatc tgcgccttgt 3412201 tcacgtcgcg ggtggtggcg gtgtctttga agaacaacgt cagcgaaagc ccttgatcca 3412261 catgctgggt ggccgccgcg taggtgtcga tgatcttctc gtaaccgatc tcgtaggcgt 3412321 cttcgtagta ctccaggttg tcgttggtca tatacggcgc cgggtagtag acccgcccga 3412381 tcttgccttc cttgcggatc tcgaccttcg acacgatcgg gtgaatcgac gacgtcgaat 3412441 ggttgatgta ggaaatcgac ccggtcggcg gcaccgcctg caggttctgg ttgtagatgc 3412501 cgtgcgcttg caccgactcc ttgagccgac gccagtcgtc ctgcgttggg atgcggatgc 3412561 cggcgtcggc gaacagctgg cgtaccttct gggtcttcgg ctcccaaatc tggtcggtgt 3412621 acttgtcgaa gaattccccg gacgcgtact tggaccgctc gaaacccttg aagtgcgtgc 3412681 cgcgttcgat cgcgatgcgg ttggatgccc gcaacgcgtg atacagcacc gtatagaagt 3412741 agatgttggt gaagtcgatg ccttcgtcgg atccgtagaa gatgcgttcc cgggccaggt 3412801 agccgtgcag gttcatctgt cctagcccga tcgcgtggga gtcgttgttg ccctgctcga 3412861 ttgagggcac cgacttgata tgggtttggt cgctcaccgc ggtcaacgcg cggatcgcca 3412921 cctcgatcgt ctgcgcgaag tccggcgagt ccatcgtctt ggcgatgttc agcgacccca 3412981 ggttgcacga aatgtctttg cccactttgg catacgacaa gtcctcgttg aacaatgacg 3413041 gcgtagacac ttgcaggatc tccgagcaca ggttgctgtg cgtgatcttg ccatcaattg 3413101 gattagcgcg attgacggtg tcttcgaaca tgatataggg gtagccggac tcgaactgca 3413161 gctcggccag cgtctggaag aactcccgtg ccttgatctt ggtcttgcgg atgcgcgcgt 3413221 catcgaccat ttcgtagtac ttctcggtga ccgagatgtc agcgaacggc acaccgtaga 3413281 cccgctcgac atcgtagggc gagaacaggt acatgtcatc gttgcgcttg gccaactcga 3413341 aggtgatgtc ggggatcacc acccccagac tcagcgtctt gatccggatc ttctcgtcgg 3413401 cgttctcacg cttggtgtcc aggaatcggt agatgtcggg gtgatgggcg tgcaggtaca 3413461 ccgcgccggc accttgacga gcgcccagct ggttggcgta ggagaacgca tcctccagca 3413521 acttcatgat ggggatgacg cccgaggact ggttctcgat gttcttgatc ggcgcgccgt 3413581 gctcgcgaat gttggtcagc agcaacgcca ctcccccgcc acgcttggat agctgcagcg 3413641 cggagttgat cgaccgtccg atcgactcca tgttgtcttc gacgcgaagc aaaaaacagc 3413701 tcacgggctc cccgcgctgc ttcttgccag aattcaaaaa cgtcggtgtg gcgggctgga 3413761 agcggccgtc gatgatctcg tcgaccagca gctcggcaag tgcggtatcg ccggcggcca 3413821 acgttagcgc caccatgacc acgcggtcct cgaagcgctc cagatagcgc ttcccgtcaa 3413881 aggttttcag cgtgtaggag gtgtagtact tgaacgcacc caaaaacgtc ggaaaccgga 3413941 actttttggc gtaggcgcgg tctagcagcg tcttgacgaa gttgcgcgag tactggtcga 3414001 gaacctcacg ctcgtagtaa ttctcgcgga tcaggtagtc gagcttctcg tcctgattat 3414061 ggaagaagac cgtgttctga ttgacatgct gcaaaaagta ctggtgggct gcttcccgat 3414121 ccttgtcgaa ctggatcttg ccgtccgcgt cgtacaggtt cagcatcgcg ttcagcgcgt 3414181 gatagtccgt ttcgcccggc cccccagagt aagaggcgtg cgcgccggag gctacaggct 3414241 ctgcaatgac ggttggtggc acgtctgttc cttccagaat tcagcgagac cggtgcggac 3414301 ggcggcgacg tcgtcctcgg tgcccatcag ttcgaagcgg tataggtagg gaacgctaca 3414361 ttttcgggag acgacgtcgc cggcgtagca gaactcggca ccgaagttgg tattgccggc 3414421 agcgatgacc ccgcgcagct gcgctcgatt gtggtcgttg ttcaagaagg caatgacctg 3414481 tttggggacg tatccgccgg catcgagacc cgggttggcc cggccgccac cgtaggtggg 3414541 cagtatcagc acgtacggct cgtcgacctc gatccggcca tgcagcggta tccgcgtggc 3414601 gggaataccc agtttctgca caaagcggtg ggtgttctcc gacacgctgg agaaatagac 3414661 caggctgcgc cccgcgatat ccatggcacc gcaatcttcc ttatctatgt ctgccgcgct 3414721 aggcggtcag cgctgccccg gcgagcgcct tgatgcgatc ggggcggaaa cccgaccagt 3414781 ggtcgtttcc ggcgaccacg acgggtgctt gtaggtaacc cagcgccatc acgtagtccc 3414841 gcgcttcgga atccaggctg atatcaacct tctggtaggc gatgccctgc ttgtccagcg 3414901 ccttggaggt ggcactgcac tgtacgcacg cgggcttagt gtaaacggtc acggtcatgg 3414961 gcgtaccgct cctttgcgga aatcgggaat ctgacaggat ctggcaacga ctcaagtagt 3415021 gcatcttcga tatgttgagc ggcccgacaa ggctccagat tcccgtcata gcgcgaccac 3415081 gtccgtcgac ctggcgatgc cgatgccggg aagttcatcg cgccccgtgg atctggtgag 3415141 acctggtgaa cctgggatct gccggtactc gaaaacacta cacctagggg gtggcaccta 3415201 gggggtggca cggagaagag atacaagatg ttctgaataa cattttcgaa attccctggt 3415261 cgtaagcctg gctcgagcac cgcggcggcg tgtcgcagat cacctcagcc gccgccatac 3415321 ccgtctgacc caatatatca ccggccaccg acaacgtcgc cgctagcttc ccgccgaacc 3415381 gctaccagca aagccgatgg agccgctatt ggctgacccg cctcggccca gggatcagcc 3415441 gacctcagcg gccaagttgc cgacggcgtc gcgcacattc gccgccagcc cggcgtcgtc 3415501 cgcgaccgac ttgcccagag tttggaacgg caccgacagt ttgatcgcat cgaccacccg 3415561 cgtgccagcg atgctgaacg acttgcgagt ctcgtcgtgc gcccataccc cgccgtagcg 3415621 gcccatggag ccgccgatca cggccaacgg cttgtccttc aacgcgccat cgccgaatgg 3415681 cctggacagc cagtcgatcg cgttcttgat cacggccgga atgctgccgt tgtattccgg 3415741 cgtgaccacc aaggcagcgt gcgcgtcaga cgcggcctcc cgcaacgcgc tcaccggcgc 3415801 cggcacctcc gtcgctgtgt cgatgtcttc gttgtagaac ggcaggtccc ccagcccctc 3415861 gaacatggtg acggtgacgc cgtccggagc gaccttggca gccagctcgg cgatctggcg 3415921 gttgaacgac gccgcgcgca ggcttcccac taaggccaag attttgatgt cggacttggt 3415981 atctgacact gctacgttcc tttccgcttg ttggtccacg tccttgcacg agccaaccgg 3416041 accatggtcc gatttattcc gatcgcgtta cagtgcaaag gtgagcggcg ccgagcggtt 3416101 gggtgacttg cctgtgttcg cgaggcaaga gcccgtacca gagcggggcg acgcggcacg 3416161 caatcgtgca ctcctgttgg aggcggcgcg ccgcctgatc gcccgaagcg gtgcggacgc 3416221 aatcaccatg gacgacgttg ccgcggccgc tggcgtcggc aaaggcacct tgtttcgccg 3416281 cttcggcagc cgtgccggcc tgatgatggt gttgctcgat gaagacgagc gagccagtca 3416341 gcaggccttc ctgttcggcc cgccaccgct gggcccggat gctccgccgc tggaccgcct 3416401 gatcgcattc ggtcgggagc gaatgcgctt cgtccatgcc catcaccagc tgctgtcgga 3416461 agccaaccgg gatccacaaa cccgccacag cgcggcgcta tcggtactgc gcacccattt 3416521 gcgggtactg ctggcctcgg cgccgaccac cggcgacctg gatgcccaga ccgatgccct 3416581 gctagcgctg ctcgacgtcg actatgtcga gcaccaactc aacgccggcg gccataccct 3416641 gcaaaccctg ggcgacgcat gggagagcct ggcgcgaaaa ctgtgcggac gatgatcgat 3416701 cactatgccg acagcagcac cgcgatggat cctgcacgta gacctcgacc agtttttggc 3416761 gtcggtcgaa ctgctccgcc accccgaact ggcaggtttg ccggtcatcg tcggcggcaa 3416821 cggtgatccg accgaaccgc gaaaggtcgt cacctgtgcg tcgtatgagg cccgcgccta 3416881 cggtgtgcgc gccggcatgc cgttgcggac cgccgcccga cgatgccctg aggccacctt 3416941 cttgccgtcg aacccagccg cctacaacgc ggcgtccgag gaggtggtgg cgttattgcg 3417001 cgacctggga tacccggtcg aggtatgggg ctgggacgag gcttacctcg cggtggcgcc 3417061 cgggactccc gacgacccca tcgaagtcgc cgaagagatc cgaaaagtca tcttgtcgca 3417121 aaccgggctg tcttgctcga taggtatcag tgacaacaag cagcgcgcca agatcgctac 3417181 cgggttggcg aaaccagctg gcatctatca gctcaccgat gccaactgga tggccatcat 3417241 gggtgaccgt accgtcgaag cactgtgggg tgtggggccc aagactacga aaaggctggc 3417301 aaagcttggg atcaacaccg tttaccaact tgcacacacc gattccgggc tattgatgtc 3417361 cacgttcggt ccgcgaaccg cgctgtggct gctgctggcc aaaggcggag gcgataccga 3417421 agtcagtgcc caagcttggg ttccacgctc gcgcagccac gccgtcacct ttccacgaga 3417481 cctcacctgc cgatccgaaa tggaatcggc cgtgacggaa ttggcgcagc gaacactcaa 3417541 cgaggtggtg gcttcgtcgc gaaccgtcac ccgagtcgcg gtcaccgtgc gcacggcgac 3417601 gttctacacc cgcaccaaga tccgaaagct gcaagctccc agcaccgatc ccgacgtcat 3417661 caccgctgcc gcccggcacg ttcttgacct attcgagctg gatcggcccg tccggttgct 3417721 gggagtgcgg ttagaactgg cctagaaccg gcgggcacac cgcacctggg cggcgcgaag 3417781 tcttgaccgc accggccgct atggcccggg ccgaagcgcg cgcgtgaaga acacgttgac 3417841 tcgtcgcatc accagggtgt atggccacca cgcatatcgc ttgaacgcat acagcgcccg 3417901 gatgtccgcc gacgtataga ccaggtatct gttccttgtg accccggcca aaattttgtc 3417961 ggccgccttc tccggcgtca cggcgtgacc actgaaccgt tcgacccagc ggttgaccct 3418021 cgggtcgtcg cgatccactc cggcgatctc gaccgtattg accagcgggg tcttgacggc 3418081 gccaggcacc acgaccgaca ccccgatgcc gtgccgggcc agatcgaagc gcagcacctc 3418141 agaaagtccc cgcaacccgt acttgctagc gctataggcc gcatgccacg gcaagccaac 3418201 cagcccggcc gccgaggaca cattgaccag gtgcccgccc cgaccggcgg cgaccatcgg 3418261 tgggaccaag gtctcgatga cgtggattgg gcccatgaga ttgatcgcga ccatcctgct 3418321 ccactgatcg tgcgtgagct ggtcaacggt gccccaggcc gacacaccgg cgatgtttag 3418381 taccacgtcc atgctgggat gacgggcgtg gatatcggcc gcgaatgccg ccacgtcctg 3418441 gtagtcggag acgtccagaa ctcgatgctc gggcacctga gcgccgagtg cacgggcgtc 3418501 acacacggtt tgcgccaagc catcacggtc gcggtcggtc agatacagct cggcaccttg 3418561 cgccgcgagg cgcaacgcgg tcgcgcgacc gatgccactg gccgcgccgg tgacaaagca 3418621 ccgcttaccc gcgaaatact gtccggctcc cctctgcaac atggtcgtga cgataccggc 3418681 ggtaccgaca ccccctccgg taggacgatc gatgcgcccc gatagctatg gggccttgcc 3418741 gccaccccaa agcgcgttga gccacatctg ttcaagcacc cgaacccggc gcgctgcgtc 3418801 actgtcgggg ccgacgagca aagcgtcacc ggtcagcatc agcgcggtgg tagctgccag 3418861 ggtgcggacg agcgtcggga ggtcttcgct gatcggatgc gcagtgccgg ccttcacctc 3418921 agcctcgaag acgccgatgg tttcacgcaa cagcacttgg aactgccgct cgagaatatc 3418981 gcggatctcc atgtcgctct ggcgtgccgc attacaggcc cgcagcaccg ggtcgttgtt 3419041 cgcgtaaacg gcggcgacgc tgccgatcat ccggttgacg aactgctcgg gtgactcccc 3419101 tggctgacgg gcggagaaat gctggctggc ttcttcgagt tcttcggtgg cctcggccaa 3419161 gatctgggcg agcaccgagt atttggaatc gaagtagaag tagaaaccgg agcgggctac 3419221 ccctgcgcga aggctgatag cgcgcaccga caattccgcg aacggtgtct cctccagcag 3419281 ttcgcgtgcg gcccgcagaa tcgcctggcg atgcctgtca ccacgccgtc gcatcggcgg 3419341 cgcagcctgc ttctcgtctg cggcatgact ggtcaccttt tgatcacccc cttgaccttg 3419401 caccatggcg tctgaaaacg gaacatcggt agccgtcaaa ttgaccagaa ggatagattt 3419461 cagttacagc caccaccggt aaggagcgcc aatggcgacg atccaccccc cggcatacct 3419521 ccttgaccaa gccaagcgtc gcttcacgcc gtcgttcaac aactttcccg gcatgagtct 3419581 tgtcgaacac atgctgctga acaccaaatt cccggagaag aaactcgccg aaccgccgcc 3419641 aggcagcggg ctcaagccgg tcgtcggtga cgcggggctg ccgatccttg ggcacatgat 3419701 cgagatgttg cgcggcggac cggactatct gatgttcctg tacaagacga agggtccggt 3419761 cgtattcggc gactcagctg tgctgccggg tgtcgcagca ctgggccctg acgcggcgca 3419821 ggtcatctac tccaaccgca acaaggacta ctcgcagcag ggctgggtgc ccgtgatcgg 3419881 gcccttcttc caccgcggcc tgatgctgct cgacttcgaa gagcacatgt tccaccgacg 3419941 gatcatgcag gaggcgttcg tccggtccag gctcgccggc tacctcgagc agatggacag 3420001 ggtcgtctcg cgggtggtcg ccgacgactg ggtcgtcaac gacgcacgct tccttgtcta 3420061 tccggccatg aaggcgctca cgcttgacat cgcctcgatg gtcttcatgg gccacgaacc 3420121 cggcaccgat cacgaactgg tcaccaaggt gaacaaggcg ttcacgatta ccacccgtgc 3420181 cggcaacgcg gtgatccgca ccagcgtgcc accgttcacc tggtggcgag gactgcgagc 3420241 acgcgagctg ctggaaaact acttcaccgc ccgagtcaaa gagcgccgcg aagcgtcggg 3420301 caacgacctg ctgacggtgt tgtgccagac cgaagacgac gacggcaacc ggttctccga 3420361 cgccgacatc gtcaaccaca tgatcttctt gatgatggcc gcccacgata cctcgacgtc 3420421 aacggccacg acgatggcct accagctggc cgcccacccg gaatggcagc agcgctgccg 3420481 cgacgaatcg gaccggcatg gcgatgggcc gctcgacatc gaatccctag agcagctgga 3420541 atcgctcgac ctggtgatga acgagtcgat ccggttggtg acgccggtcc agtgggcgat 3420601 gcggcagacg gtgcgcgata ccgaactgct gggctactac ctacccaagg gcaccaacgt 3420661 gatcgcatac ccagggatga atcatcgcct gccggaaatc tggacagacc cgctgacatt 3420721 cgacccggaa cggtttaccg agccgcgcaa cgagcacaag cggcaccgct atgcgttcac 3420781 gccgttcggc ggcggcgtgc acaagtgcat cgggatggtg ttcgaccaat tggagataaa 3420841 gacgatcctg caccggctgc tgcgccgcta ccggctggag ctgtcccgtc ccgactacca 3420901 gccccgctgg gactacagtg ccatgccgat cccgatggac gggatgccga tcgtgctgcg 3420961 tcccaggtag gccctcttcg gcggattccg ccaatccacc ggtgccgcag atgaaagtgc 3421021 cagtgcgcag cccgcaccca ctttcgaccc gcggcgggag tcggtctgga tcagatcccg 3421081 ccgcgggtcg cgcgaatggt cagcgtcgct atcgtgcgcc gacggtgcaa gccctttcga 3421141 cttctatgac gaccgtttga atttggacgt cccctgttgc agaaaaccct cgctgcggtg 3421201 gaacctggcg atagcatctg atgacggtgt ggaaaccgcg gaatatgggt gtgctccagc 3421261 gacgaaaggc tcaatcgatg agcgcgacta aaagcaaggg tttgcgggcg tttcagacac 3421321 tggtcgcggc gctggctgcg gtagttgcag tactagcagc gggctgcgct acccagcgcg 3421381 ttcccacggt tctgccggaa tcggagttaa ttcctcaaag cctcggttag ctgctctgcg 3421441 acctcgccgg acgggtgcag cgcaaccacg cacatgcagg agcaagaagg gcgcgaccga 3421501 tgatcgcaaa aggcaacagg cggatccggg tagggcaatt gctgggcgca gcactggtcg 3421561 ctgcttttgc cctgacagcg gtgggatgca caatccagat gcctcagcca cctctcccgc 3421621 agcaggagtt aaggcggtag gtccggcctc agggtagctg ctaactaccc gatggggcag 3421681 tcacgtcgcc gtcgggcacg gtgcggacga gcgcggacac actcatccgt actttggtgc 3421741 ttagagccac caggaagcgg cagcgtccag atggcggcgg gtgcggcaac gggccaggct 3421801 gtcgtcacca gagccgatcg cttcgagaat ccgtacgtgg gcatgtccca cagcgaccac 3421861 gtcgctccat gtcggcagcg cctgctctgt gctcgacaag tgccggcgga acagctcgac 3421921 cagtatcagc aggaacagat ccagcatcgt attgccggcc gctcgcgcca gtccgacgtg 3421981 gaaccgaaac tcctcgacgg cggccgcgcg tacatcgtcg gtggggttat ccaacctcgg 3422041 ccgccccagc gtatcgagga aggacgcaac ctcaggttcg ctgcggcgct tgacaacttt 3422101 cgcgacattg tcgatctcga tggcatcccg gacgcaccgt aggtcttcgc ggctcggctt 3422161 gcggtactgc agatagagcg cgatggtgtc gatgctggct tgtggctggg gggtggtgac 3422221 gaccaacccg ccgccgggtc cgcggcgcat gtgcgcgatc gcgtgatatt ccagcagccg 3422281 caccgcttcc cgaagcaccg cgcggctcac ctggtagcgt tccaacagcg ctgtctcggt 3422341 cccgaagacc gacccgacct gccagccgct ggcggcgata tcgtcgccaa tggtggccgc 3422401 caacacctcg gccagcttgc cgcggggcgc gcccaggatc agctgctggg cccggcgcgg 3422461 ctcacgcgcc cggcccccgt tgcggacggc cgcatcgttg ccgcgctggt gctgctgcag 3422521 ccagccggct accgcctcaa cgtgtcgttc gcttaaggtt ttggcccacg ccgaatcacc 3422581 cgccgtgacg gccgcgacga tatcggaatg ttcgttgtgc acttgaccgg ccgcctcgac 3422641 ggcctcacct gcggattggg tacctgactt ctggacgtat cgcttggtca gccgcatcaa 3422701 gatgtcgata aacagctgta ggacagggtt tttcgattgc tccgcgagca cgcggtaaaa 3422761 ctgctcaggc ggcgggggca aaccgggccg ccaccgttcc tctgcgcgca agaccgctcg 3422821 cagcctttcg atgccgggtt cgtcgatatg ctctgcggca agagaggccg ccaagggttc 3422881 gagcaccaga cgcgcgccga gcaagtcacc gatggtggtg cccaggtatt cgagatagat 3422941 gaccacggcg cgggtagcgg gcccggcatt tggctcgcag atgaacagcc cgccgttcgg 3423001 tccacgacgc attcttgcca cctgatggtg ctcaaccaga cgcacagctt cgcgcagcac 3423061 cgatcgactc acgcaaaagc gttgctgcaa agcgctttcc gaacccaagg atgctccgat 3423121 cggccagccg cggcggacga tgtccgcctc gatgcggcgg gcgatcttcg acgctcgctt 3423181 gtccgtccag accgcgtccg gctcggtgct catttcaata gagtgtactg tattggctga 3423241 gtcaagggcg cgagctgggc cctagctaat caggggatca cgcggcatgc ccaggatccg 3423301 ctcggcaatc tgattgcggg tcacctccga cgtgccgccg gcgatcgcca tgccacgggc 3423361 gcccatcacc gttcggccaa tcaccctgcc ggggccgtcc agcaacgcaa tctcgggccc 3423421 ccatagcgcg gccgcgatgg cggcgccctc gatcatgtgc tctgccactt tgagcttggt 3423481 gatgttgccc tccggaccag ggccggctcc ttcgacgctg cgagcggcac ggcgcaggtt 3423541 cagcagccgc agtgcgtgat cctctgcgag gaaagcgccg actcgaattg gggcgcccgc 3423601 aaacgcatct gaccgccgct ggaccaattg caccagcttc gccgccattg cttcgtagta 3423661 cgagccactg ccgccgatgc tgacccgctc gttgcccagc gttgcccgcg ccaccgtcca 3423721 cccggagttc ggcgccccga caacgtcctc atcggggacg aagacatcgt tgaagaacac 3423781 ctcgttgaat tccgagtcgc cggtgatctg ccgcagcggc cgcacctcga caccgggggc 3423841 caacatgtcg atgatcaccg tggtgatgcc agcgtgtttg ggggcatccg gatcggtacg 3423901 cacggtagcc aggccacgcg cgcagtactg cgctccgctg gtccacacct tttgcccgtt 3423961 gatcttccag ccgccctcca cccgagttgc gcgggtcttg accgaggccg cgtcagaccc 3424021 cgcgtcaggt tcggagaaca gttggcacca tatctcctgc tggcgcagcg ctttctcgac 3424081 gaatctttca atctgccaag gcgttccgtg ctgaatcagc gtcaagatca cccacccggt 3424141 gatcgagtaa tccgggcgct cgatgcccgc cgcgctgaac tcttcctcga tcaccaactg 3424201 ctccaccgcg cccgcggcac gaccccacgg cctgggccaa tgcggcatca catagcccgt 3424261 ctcgatcagc ttgtcgcgct gtgcatcctt ttccagagca gcgatttcag cggcgtccga 3424321 acggatgcgg gcgcgcagct cctcggcctg tgccggcagg tccaagctga tcgcccgggt 3424381 aacgccagcc gcggtgcgct cgaaaacgtc tcggacgggc gcatcaccgc cgaacaatcc 3424441 cacggtcacc aacgcccggc gcagatgcag atgcgcgtca tgctcccagg taaagccaat 3424501 accgccgtgc acctggatgt tgagctcggc attgcgtgca taggccggaa acgccagggc 3424561 cgcagcgacc gcggcggcca gccgaaactg ctcctcatcc tctgctgccg cacgcgcggc 3424621 atcccagacc gcggcgatcg ccgactcggc ggccaccagc atgttcgcgc agtgatgctt 3424681 caccgcttga aacgtggcga tggtacggcc gaattgctgt cgcaccttgg cataggccac 3424741 ggcgctgtcc acgcagtcgg ccgccccacc gacggcctcg gcggccagca atgtgcgcgc 3424801 gcgggccaaa gccgattcat acgcaccaag caggatgtcg tcggtcgtga cgcgcacgtt 3424861 gtccaggcgc acgcggccac tccgccgggt cggatcaaag ttttccggca catcaaccga 3424921 gacgcccttg cggccgcgtt ccaacaccag cacgtcgtca ccggcggcaa ccaacagcag 3424981 ctcggcaagc ccggcgccca acacgattcc cgcctcaccg tcggcaacac cgtcggtaac 3425041 ctgcacctga ctatccagtc ccacacccgc cgtcagggtt ccgtcaatca gcgccggcaa 3425101 cagccgtgcc cgttggtcat cagtaccttc tttggcgacc accgctgagg cgatcacggt 3425161 cggcacaaac agccccggtg ccaccgcacg accgagctct tcgatcacca ccacaagctc 3425221 ggacaggcca tagccagagc caccgtgtcg ctcgtcgata tgcaggccga gccagcccag 3425281 ctcggcgagg ttctgccaga acggcgggcg ggcgtccccc gccgcgtcca gtgatgcacg 3425341 cgccgcccag cgcaccttct gcgaagtcaa gaacgcgcga gccaccccgg agagctcgcg 3425401 atggtcgtcg gtcaatgcaa tacccatcaa ggcctcctag cggcactacc ggacccacat 3425461 agcccccagg cggtattggt aaagagtata ctaattgtct gtcgcggccg cgagacacgg 3425521 cttgctcggg cacgccagcc ttgccctcgc caacgatgtc ggcgagacat gccaagctga 3425581 accgtgctcc ttcacgacgt ggccatcacc tcaatggacg tggccgccac ctcgtcgcgg 3425641 ctgaccaagg tcgcgcgcat cgccgccctg ttgcaccgcg ccgcgccaga cacacagctg 3425701 gtcacgatca tcgtgtcgtg gctctccggc gagctgccgc aacgccatat cggtgtcggg 3425761 tgggcggcat tgcggtccct accgccgccc gcgccgcaac cggcgttgac cgtcaccggt 3425821 gtcgacgcca ccctctctaa gatcggcact ctaccgggca aagggtctca ggcgcagcgc 3425881 gcggcactcg ttgcggaatt gttctccgcc gcaaccgaag ctgagcaaac ctttttgttg 3425941 cgactgctcg gcggtgaact gcgccagggc gcaaagggcg ggatcatggc cgatgcggtc 3426001 gcccaggccg ccgggctccc ggccgcgacg gtccaacgcg ccgcgatgct aggcggcgac 3426061 ctggcggcag cggcggcggc cggcctgtcc ggcgcggcgc tggacacctt caccctgcga 3426121 gtgggccgac cgataggccc gatgctggca cagaccgcga ccagcgtcca tgatgcactc 3426181 gaacgtcacg gcggcacaac cattttcgag gctaaactag acggcgcgcg agtgcagatc 3426241 caccgggcaa acgaccaggt caggatctac acccgaagcc tggacgacgt cactgcccgg 3426301 ctgcccgagg tggtggaggc aacactggca ctgccggtcc gggatctagt ggccgacggc 3426361 gaggcgatcg cgctgtgccc ggacaaccgg ccgcagcgtt tccaggtcac cgcatcacgg 3426421 ttcggccgat cggtcgatgt tgcggctgcc cgcgcgacgc agccactttc ggtgttcttc 3426481 ttcgacatcc tgcatcggga tggtaccgac ttgctcgaag cgccgaccac cgagcggctg 3426541 gccgccctgg acgcactggt gccggctcgg caccgcgtgg accggctgat cacgtccgat 3426601 ccaacggacg cggccaactt cctggatgcg acgctggccg ccggccacga gggggtgatg 3426661 gccaaggcac cggccgctcg ttaccttgcg ggtcgccgcg gagcgggctg gctgaaggtc 3426721 aagccggtgc acacactcga cttggtggtg ctcgcggtgg aatggggctc gggacgccgg 3426781 cgcggcaagc tctccaatat tcacctgggc gcacgcgatc cggctaccgg tggattcgtg 3426841 atggtgggca agaccttcaa aggaatgacc gacgccatgc tggactggca gaccaccagg 3426901 tttcacgaga tcgcggtggg tccgacagac ggctacgtcg tccaacttag gcccgagcag 3426961 gtggtcgagg tagccctcga cggcgtgcaa aggtcgtcgc gctacccggg cgggctggca 3427021 ttgcggtttg cccgcgtggt gcgctaccgc gccgacaagg acccggccga ggccgacacc 3427081 atcgatgccg tgcgcgcgct ctactgatcg cacggcgaga gtgactcctg cgacgggaca 3427141 cgccggctgg gcgtcgccag attcacgctc gtcgaccaag cgggcgggac aagcagctgc 3427201 aaggatcaac ggagatcgca cccgtgattg agggaggtga cggtggcagc gccgaccccg 3427261 tcgaatcgga tcgaagaacg ctccggacac gccagctgcg tccgcgccga tgccgacctg 3427321 ccacccgtgg ccatcctcgg tcgctccccc atcacgcttc ggcacaagat cttcttcgtg 3427381 gccgttgccg tgatcggcgc tctcgcctgg accgtcgtcg cgttcttccg caacgagccg 3427441 gtcaacgcgg tctggatcgt ggtcgcagcg ggctgcacct acatcatcgg gttccggttt 3427501 tatgcgcggc tgatcgaaat gaaagtcgtc cgtccccgcg acgatcacgc caccccggcc 3427561 gaaatcctcg acgacggcac cgactacgtg cccaccgacc ggcgggtggt attcggacac 3427621 cacttcgccg ccatcgccgg tgccgggccg cttgtcggac cagtactggc cacccagatg 3427681 ggttacttac ccagcagcat ctggattgtc gtcggcgcgg tgctggccgg atgtgtccag 3427741 gactacctgg tgttgtggat ctccgtgcgg cggcgtggcc gctccctggg tcagatggtt 3427801 cgcgacgaac tcggcgccac cgccggagtg gccgccctcg ttggaatccc ggtcattatc 3427861 accattgtga tcgcggtgct ggcgctggtg gtcgtgcggg ccctggccaa gagcccatgg 3427921 ggcgtcttct cgatcgccat gaccatcccc atcgccatct tcatgggctg ctacttgcgg 3427981 ttcctacgtc ccgggcgggt gtcggaagtt tcattgatcg ggatcggact gctgctgctc 3428041 gccgttgtct ccggtgattg ggttgcccat acctcctggg gcgcagcgtg gttcagcttg 3428101 tcaccggtga cactgtgttg gcttctcatc agctatggct tcgcagcttc ggtgctgccg 3428161 gtgtggctgc tgctcgcgcc acgcgactac ctgtcaacgt tcatgaaggt cggcaccatc 3428221 gcgcttctcg cgatcggtgt ttgtgcggct cacccgatca tcgaggcccc agcggtgtcg 3428281 aaattcgccg gtagcggcaa cggcccggtg ttcgccggct cactgtttcc attcctgttc 3428341 atcaccatcg cgtgcggggc gctgtctgga ttccacgcgc tcatctgctc gggcacgacg 3428401 ccgaagatgc tggagaagga aggccagatg cgcgtgatcg gctacggcgg catgatgacc 3428461 gagtccttcg tcgccgtcat agcactactc accgcggcga tcctcgacca gcacctatac 3428521 ttcaccctca acgcgccgtc cctgcatacc cacgacagcg cagccaccgc cgccaagtac 3428581 gtcaacgggc tcggtttgac gggctcaccg gtgaccccag accacatcag ccaggccgcc 3428641 gccagcgtcg gcgaacagac gatcgtgtcg cgcaccggcg gtgcgccgac gctggcgttc 3428701 ggcatggcgg agatgctgca tcgagtggtc ggcggtgtgg gcctcaaggc gttctggtat 3428761 cacttcgcga tcatgtttga ggctctgttc atcctcacca ccgtcgacgc cggcaccagg 3428821 gccgcgcgct tcatgatctc cgatgcgctg ggcaactttg gcggtgtgct gcgcaaactg 3428881 cagaatccga gctggcgtcc cggtgcgtgg gcttgccgtt tggtggtcgt cgcggcgtgg 3428941 ggcagcatcc tgctgctcgg tgtgaccgat ccgctgggcg gcatcaacac gctgttcccg 3429001 ctgttcggca ttgccaacca gttgcttgcc ggaattgcgc tgaccgtcat caccgtcgtc 3429061 gtcatcaaga aggggcgact gaagtgggct tggataccgg gtattccact gctgtgggat 3429121 ctggcggtca ccctgaccgc atcgtggcag aagatcttct ccgctgatcc ttctgtcggc 3429181 tactggactc agcatgctca ctacgcggca gcccagcacg caggcgagac cgcgttcggc 3429241 tcggccacca acgccgatga gatcaacgac gtcgtccgga acacattcgt ccagggcacc 3429301 ctgtcgatcg tcttcgtggt ggtcgtcgtg ctggttgttg tcgccggagt catagtggcg 3429361 ctgaagacaa ttcgcggccg cggcataccg ttggccgagg acgatccggc gccgtcgacg 3429421 ttgttcgcgc ccgctggcct gattcctaca gccgcagagc gaaagttgca acgacgtttg 3429481 ggcgcgccgg cctcggcttc cgtcgcggcg cccgactagc cctcccgctg cagtggtacc 3429541 ggcgccgcaa tcagacggcg agtaggcgtg ggtccaaccc gcgattcgcg gcagccggcg 3429601 gagagggcga ccaagagacg ttatcggttc gctcggggac tcatggccgg tctgctgggc 3429661 acgatggctc tcacgagcgg cggtggtgtc gctcgcgagg atccattgga acctgatccg 3429721 ctagccccga tcatcgacga ttccaggtaa acggattcga aggcacctat agggacgtgc 3429781 cctgacgccc cgccacaatg gacgcttggg tagcctgacc agccttatgc agtgacagtg 3429841 cgtcgagcat caattgagta gatcccacca ccggtgaaca ccagcaggaa gaagccgaag 3429901 cagaacagta tcgccggagt tccgccattg ccgtccggtg gaccgccgat cggccacagt 3429961 gcatacggtt gatgcatcca gaagtaggcg accgccattt cgcccgaggc aacgaacgcc 3430021 acagcgcggg taaacagccc ggttgcgatc agcagacctg ccaccaactc gatgaccccg 3430081 gcataccagc cgggccagga tccaaattcg acgggttgag ccgaggtgac gggccagccg 3430141 aaaaggatca tcgatccgta gccggcgaac agcagcccgt ataccaaccg aaagaggctc 3430201 agcacagccg gcaaacagcc ggcgagccga cggtcgagat ctttcaccat gacacgacgt 3430261 tacggggatc gaccgcgcga acgctgggcg gattttgtct cccaccggtg tgcctactca 3430321 cgtgtggacg cacgagcctc ctttgtgtac atttgtacat gtacaaatgt acacaaagga 3430381 ggggtcttga tctacctata cctcttgtgc gcgatcttcg cggaagtggt ggcaaccagc 3430441 ctgctcaaaa gcacggaagg gttcactcgg ttgtggccca cggtgggctg tctagtgggt 3430501 tatggcatcg ctttcgcgct gctggccttg tcgatctcgc acggcatgca gacggacgtc 3430561 gcctatgcgc tgtggtcggc aatcggtacg gccgccattg tgctggtcgc cgtactgttt 3430621 ctcggctcgc cgatatctgt gatgaaggtg gttggcgtcg gcctgattgt cgtcggcgtg 3430681 gtcacgttga acctggcggg tgcccattga ccgcaggctc cgaccgccgt ccacgcgacc 3430741 cagccggtcg ccggcaggcg atcgtcgagg cggccgagcg cgtgatcgct cgccagggcc 3430801 ttggcgggct gagccaccgc agggttgccg cggaggccaa tgtaccggtc gggtcgacga 3430861 cctactactt caatgacctc gacgcgctgc gggaagccgc gctcgcgcac gccgcaaacg 3430921 cctcggccga cctgttggcg cagtggcgca gcgacctcga caaggaccgc gacctggccg 3430981 cgaccctggc ccggctcacc accgtctacc tggccgacca ggaccgctat cgcacgctca 3431041 acgagttgta catggcggca gctcatcgac cggaactgca gcgcttggcc cggctgtggc 3431101 cagatggtct actcgcgctg ctcgaaccgc gcatcggtcg acgagccgcc aacgcggtca 3431161 ccgtgttttt cgacggcgct acgctgcacg cgcttatcac cggtaccccg ctgagcaccg 3431221 atgagctcac cgatgccatc gccaggctgg ttgcggacgg cccggaacag cgcgaagtgg 3431281 gacaatctgc ccatgcggga cgaacccccg actgacaccg cagcggctcc caccaccggt 3431341 gcggcacctg agattgacac cgcccgcgaa tacgaagtaa ccgccgaata ccagtcctgg 3431401 cgggtcgtct ggggaagcgc cgcagcattg ctgacggtcg gcgtcgggat aggcgcggcc 3431461 atcctcctcg ggtggttcac gttagcgcac cggcacccgg accagcctgg ggcggccgcg 3431521 acaccacccc ctgcggggct aacaacacgg tccgcgccca ccgccgcccc gccgtcaacg 3431581 ctgcaaagcc cagacctgga cagcgtcttt cttggcaacc tgcacgatcg cggcatctcg 3431641 ttcaccaacc ccgatgccgc cgtctacaac ggcaagatgg tctgcaccaa tctcggcggc 3431701 ggcatgaccg tgcagcaggt ggtcgaggca ttgcagagta gcagccctgc acttggcgac 3431761 cggacaaccg cttacgtggc cgtctcgatt cgcacgtatt gtccgaagta cgacgctgtg 3431821 ctgccaccgg gatcctgagt ggagctaagg ggactcgaac ccctgacccc cacactgcca 3431881 gtgtggtgcg ctaccagctg cgccatagcc ccatgaagtg atgcccatcg aagctacacc 3431941 accgccggaa agcgttcaaa gccccaggtc agcgagcctc acccgatgac ccgatcgacc 3432001 acttcgcggg cggtctgctg cacctcgacc agatgttgcg gtccacggaa ggactccgcg 3432061 tagatcttgt agacgtcctc ggtgcccgac ggacgcgcgg caaaccacgc attggccgtc 3432121 gtcaccttca atccgcccag cgcagcaccg ttgccgggcg cggtcgtcag ctttgcggtg 3432181 atcggctcac cggccaactc ggtggcgctc acctggtcgg ccgacagcct ggccaggcgg 3432241 gctttctgct cccgatcggc gggcgcgtcg atccgcgcat agcacggccc accgtactcg 3432301 ccggccagcg cgtgatatcg ctgcgacggc gtagccccgg tgaccgccag gatctcggcg 3432361 gccagcagcg ccatgatgat gccgtccttg tcggtggtcc ataccgatcc gtcccgtcgc 3432421 agaaatgatg cccccgccga ttcctcgccg ccgaagccca aggtggcgcc gatcagaccg 3432481 tcgacgaacc atttgaatcc gaccggtacc tcaacgagtt gacggccgat cccggcgacc 3432541 acccggtcga tgatcgacga gctgaccacc gtcttgccca cggcgatgcc ggccggccag 3432601 gacgggcggt gggtgtagag atattcgatg gccacggcca gatagtggtt aggattcagc 3432661 agcccttcgt caggggtgac tatgccgtgt cggtcggcgt cggcgtcgtt gccggtggcg 3432721 atctggtagc gctcccggtt gccgaacatc gttcggatga gcccagccat cgcatccggt 3432781 gaactgcagt ccatccggat cttcccgtcg gtgtccaggg tcatgaaccg ccaggttgcg 3432841 tcgaccagcg gattgaccac ggtcaggtct aggccatgcc ggtgggcgat ctcaccccag 3432901 taatccacgc tggccccgcc gagcgggtcg gcgccgatcc gcaccccggc ctcgcgaatg 3432961 gcggcgatat cgaccacgtt cggcaggtca tcgacatagt ggcccaggta gtcgtgtcgc 3433021 tgggcggtgc gtaacgcgcg ggccagcggc aaccgcttca ccatcgaccg agcgagcaga 3433081 atctcgttgg cacgcttggc tattgcggtg gtcgcagcgg tgtccgccgg gccaccgttg 3433141 ggtgggttgt acttgatgcc gccgtcggac ggcgggttgt gcgacggcgt cacaacgatc 3433201 ccgtcggcca gcgcttcggt ccggccgcgg ttgtaggtca agatggcgtg gctgattgcc 3433261 ggcgtcggcg tgtagcggtc gcgggagtcg acgacggcca ccacctgatt ggcggcgagt 3433321 acctccagcg ccgataccca tgccggttcc gaaaggccat gggtgtcacg gccgatgaac 3433381 agcggcccgg tggtcccctg ggcggcgcgg tattcgacga tagcctgggt gatggccaga 3433441 atatgtagtt cgttgaacgt tccggtcagg gctgagcccc ggtgccctga ggtgccgaaa 3433501 gcgacctgtt gagcgaggtc gtcgggatcg ggttcgatcg agtagtacgc agtcaccaga 3433561 tggggcaggt cgacgaggtc ttcgggctgg gccggttgac cggctcgtgg gttggccacc 3433621 atggctacca attctgccca caggccctac agtgcgaagc gcagcattag cacaccgaga 3433681 gggatcgacc agtgccaaac cacgattatc gcgagttggc tgcggttttc gccggcggag 3433741 cgttgggtgc gctggcccga gcagcgctga gcgcactcgc catccccgac ccagcccggt 3433801 ggccatggcc gacgttcacg gtcaacgtcg tcggcgcctt cctggtgggt tatttcacca 3433861 cccggctgct ggagcgattg cccctgtcga gttatcgacg cccattgctc ggcaccggat 3433921 tgtgcggcgg actgaccact ttctcgacga tgcaggtcga gacgatcagc atgatcgaac 3433981 acggtcattg gggtttggcc gctgcctact ccgtcgtcag catcaccctc ggattgctgg 3434041 cggtgcacct ggccacggtc ttggtacgcc gagtgcggat acgccgatga cggcctcgac 3434101 ggccctgacg gtggcaatct ggatcggcgt gatgctcatc ggcggtattg ggtccgtgtt 3434161 gcgttttctg gtcgatcgct cggtggcccg ccggctggcc cggacttttc cctacggcac 3434221 actgacggtg aacatcaccg gagccgcgct gctggggttt ctggccggcc tggcgttgcc 3434281 gaaagacgca gccttactgg ccggcacggg gttcgtcggc gcctacacca ccttttccac 3434341 ctggatgcta gaaacccaac ggttgggaga ggaccgccag atggtttcgg cattggccaa 3434401 tatcgtcgtc agcgttgtgc tcggtctagc cgcggcgcta ctcggtcagt ggatcgccca 3434461 gatatgaacg agcaatgcct gaagctgacc gcgtatttcg gcgagcggca acgcgctgtc 3434521 ggcggggcgg ggaggtttct ggccgatgcg atgctggatc tgttcggctc ccataacgtc 3434581 gcgaccagcg tgatgctgcg cggtaccacc agtttcgggc caaagcacga gtttcgctgc 3434641 gatcaatcgc tgagcctgtc cgaggacccg ccggtgaccg tcgccgccgt cgacatcgaa 3434701 tcgaaaatcc gctccctggt cgacgacgtc acagcgatga ccgaccgcgg cctggtgacc 3434761 ctggaacggg cgcgactggt cacccggcac agcggcgccg aggaattcgg cgacatcgac 3434821 agccgaaacg gagatgccgc caagctcacc atctacgccg gccgccaggt gcgggttgcc 3434881 ggggcgccgg cctactacac catctgcgag cttttgcatc gacatggatt cgcaggtgcc 3434941 acagtgctgc tcggcgtcga cggcacggca cacggtcggc gccgccgggc ccggttcttc 3435001 ggccgcaacg tcaatgttcc actgatgatc attgccgtcg gaacgcctgc acaggttgcc 3435061 gtggccgcaa tggaactcac cgcagcactg cctaacccgc tgctgaccat cgaacgggtg 3435121 cggctgtgca agcgcgacgg cgagttgttc gcccgccccc aacagctgcc gcagaccgat 3435181 gaccagggac gcaccctgtg gcaaaagctc atggttcaca ccgccgaagc aacccatcat 3435241 gaggggctgc cgatccaccg agcgcttgtc catcgactga tgcagtccga aacggcgcgg 3435301 ggcgctaccg cgctgcgcgg catctggggc ttttacggcg accataaacc ccatggggac 3435361 aagctatttc agctggtgcg tagggtgccg gtgaccacga tcatcgtcga cacaccccag 3435421 gctatcgcgc gcagcttcga catcgtcgat gagctgacga actggcacgg gctggtaacc 3435481 agtgagatgg tccctgcggc cgtgtcactc accgggtcac gggatggcac gcaaaagacc 3435541 ggtgaaaccc cactggcgcg ctacgactac tgagtgccag ccgccagatt ggtcagatcc 3435601 cacgtcgggg acgcttaccc aacccgcgat gcgaacatcc atttgtcggc cagcgccgat 3435661 acccagcccg ctacgacctc aggattatcc ggtggcacct cgaccaacac caactcgtca 3435721 acgcccagtt cgcgagcaca tcgacatcac caacctgagg attagccagg gccacggcca 3435781 gccgcagttc gccacgatca cgacccgact gttgccgtcg aacgatgcga cgtcgtcgcg 3435841 ccataatgtg cgcattgcag cgacgtattc ggcggtgcgc tctgcgcgcc gctcgaatgg 3435901 cactccgagc gcgtcgaact cctccttgga ccatccgacg ccacgcctag tgtcagccgc 3435961 ctaccactca accgatccag gctcgccgct tctttggcca ctatcaccgg gttgtgctca 3436021 ggcagcagta gcacgcccgt cgcgacgtcc acccgcgacg aggcggcagc ggcgaaactc 3436081 aacgcgatca tcgggtcaag ccaatccgcc tgtgccggaa ccgcgatgac gccgtcgcgg 3436141 gagtagggat aacgcgacgc gggccggtcc accatcacga catgttcgcc gacccacaag 3436201 gtggcgaagc cacagtcgtc cgccgcaacc gcgacggcat cgacgaccgc cgggtcggcg 3436261 ccggcaccta ttccagcgcg tgcagtccca gtcacatcgc acgagcgtct cacacaggcc 3436321 aattggcatt agcggccgtt gagcaactgc gccaagacgg ccgcatggct gcgtgccacg 3436381 tggcgcgtcg cggtcaccgg ggtcacaacg ctgcgcccgg tcagcttgcg cagctcggct 3436441 agtgccgcgc tgtcgtgcag ctcctcctgg tagcgcgacg cgaattcgtc aaaccgctcc 3436501 ggctggtggt ggtaccactc gcgcagctct ttggatggtg cgacgtcttt gcaccagatg 3436561 cccacccgct ggtcatcctt gcggattccg tgcggccaga tgcgatcgac caggacacgc 3436621 tggccgtcgt cgggatcgat gtcttcatag acgcgggcca cccgcacccg tgtctcgcgc 3436681 accattgtgc cagcgtatag ccgttaccgc gggggcttat ccacagccac cggcgccacc 3436741 agttgtcccg gtctgtgcag gctgctattc tcgaacacat gttcgagaca ttgaccgcga 3436801 tcgacccgga tgccgaggaa gcggcgttga tcgagcgaat cgccgagctg gagcggctta 3436861 agtcggcagc cgcggctggc caggcgcggg cggcggccgc tgtggacgcc gcccgcagag 3436921 ccgccgaagg agctgccggg gtgccggctg cgcgccgtgg acgtgggctg gccagtgaga 3436981 ttgccctggc tcgacgagat tcaccagccc ggggcagccg gcatctgggg tttgccaagg 3437041 ccttggttta cgagatgcca cacacgctgg ccgccctgga ctgcggcgcc ctctcggagt 3437101 ggcgggccac cctgatcgtg cgcgaaagcg catgtctgga tgtcgcggac cggcgcgcat 3437161 tagatgccga gttatgtggc gaccccggcg acttggaggg gatgggcgat gcgcgggtgg 3437221 tcgcggccgc cagggcgatc gcctatcggc tggacccgca ggccgtcgtc gaccgggcgg 3437281 ccaacgccga aaatgaccgt acggtcacca ttcggccggc accggacacc atgacgtatc 3437341 tgaccgccct gttgccagtc gcccaaggcg tgtcggtgta tgcggcgctg acccgagcgg 3437401 cagacacccg ctgcgacggg cgctcccgcg gccaagtcat ggccgacacc ctggtcgaac 3437461 gggtcaccgg ccgcgacgcg gcggtcccga ccccgatcgc ggtcaacctg gtcatgtcgg 3437521 atgaaacgct gctgggtgcg gccaacacac cggcgcagct gtgcggctac ggtcccattc 3437581 ctgcggccgt ggcacggacc atggtcgcta gcgccgtcac cgaccagaga tcgcgggcca 3437641 ccctgcgcag gctctacgct catcctcagg ccggggcgct ggtgtcgatg gaatcacggg 3437701 cgcggctgtt tccccgcggt ctggccgcct tcatcgagct gcgcgatcag cgttgccgca 3437761 ccccctactg tgacgcgccg atccgacacc gcgaccatgc ccacccctgg gccgacggcg 3437821 gcccgaccag cgcgcacaac gggcttggga cctgcgaacg ctgcaactac gccaaacaag 3437881 cccccggctg gcgggtcagc acaagtgtcg acgaaaatca cacgcacaca gccgaattca 3437941 ttaccccgac aggcagtcga caccggtccg gcgccccgcc gcacctgcct gcggtcaccg 3438001 tcagcgaact cgaggtccga atcggcatcg cgctcgctcg atacgccgcc tagtagtggt 3438061 aggtgtcagt cggagccggc atgtgaaccg gttcgtcctc gaagtcggac acttcgatgc 3438121 cgtaggcgcg ggccagatcg aggatcttgg tggcccgggc aatgcgcggc aggtcagacc 3438181 cgttgcggat ctcgcccccg tcgcgggcga actcagcgaa aaattccttc gcccagacga 3438241 tttcgtcctg cgacggggat agcccctcat tcaccaccgg acattggtcc ggcgaaaggc 3438301 agatcttgcc ggtcatgcca aactcggcgg agacggccgt ggcctcgatc agcttgagcg 3438361 cgttggagcc gatggtcggc ccgtcgatcg cgctgggcag accggcggcc cgggccgcga 3438421 tggtaaagcg cgaccgcgcg taggccaatg ttgccgggtc ttcgccaaag ccggtgtccc 3438481 ggcgaaagtc gccgataccg aaggcgagcc ggaaggtgcc cttggccgca gcaatctcgt 3438541 tgatgcgctc cagaccccgc gccgtttcga ccagtgcaac gatcggcacg ttaggtagtc 3438601 gtttcgcggt ctcggtgaca tggtccaccg attcgaccat cgccagcatc actccgccaa 3438661 cggggctatc ggccaacatc gctagatcgt ccgcccacca aggtgtgccg aagccgttga 3438721 tgcgcaccca gtcagcgttt ccgtcaccaa accaacgcac ggcgttgtcc cgggcggcat 3438781 gcttgtcttt gggagcgacc gcgtcctcga tatcgagcac gacgatgtcg gcgcgtgagt 3438841 gcgcggcgga ctcgaaccgg tcgccgtgcg cgccgttgac cagtaaccaa ctccgcgcga 3438901 gaaccggatc gatacgagac ccggccaccg gatccgccgt gttggtatcg acctgttcat 3438961 acattgaggt catctagtgt ctcttcgctc agtcgatgtc gacattgttc tccttaaacc 3439021 gtagcgacgt cgcaaatcgg attggcagga tgccccgcaa aacccacgtc catggtgttg 3439081 gatggcgtgg tgtccgacac tcgccgcagc cggacgatag cggcccggca gcaaaccatc 3439141 tgggacgtcc tggccgactt tggttccttg agttcatggg tcgagggcgt cgaccactcc 3439201 tgcgtcttga accacggtcc cgacggcgga gctctaggca gcacccgccg cgtgcaggtc 3439261 ggccgcaaca cgctggtgga gcgtgtcatc gagttcgacc cacccacgac actggcctac 3439321 cgcatcgagg gcctgcccgc ccggctgcgc aaagtcacca accgctggac actacggccg 3439381 gccgatcctg taggcgcggt gacggtggtc accttgacca gcacgatcga aatcggcggc 3439441 aacccgctgg cgcgtctggc cgaacttgtc gtcggccgcg ccatggccaa gcggtccaac 3439501 acgatgctcg ccgggctggc acaacgattg gaggacaaac atggctaacc gtcccgacat 3439561 catcatcgtg atgaccgacg aggaacgtgc ggtgccgccg tacgagtcgg ccgaggtgct 3439621 cgcctggcgt caacgcagct tgaccggccg ccgttggttc gacgagcacg ggatcagttt 3439681 cactcggcac tacaccggtt cgctggcgtg cgtgcccagc cgcccgacga ttttcaccgg 3439741 ccaatatccg gatctgcacg gcgtcaccca gaccgacggc atcggcaagc gattcgatga 3439801 ttcgcggctg cgctggctac gggccggcga ggtgccgacg ttgggtaact ggtttcgcgc 3439861 ggccgggtat gacactcact acgacggcaa gtggcacatc tcgcacgccg atctggaaga 3439921 ccccgcgacc ggtgcaccac tggccaccaa cgacaacgag ggcgtcgtcg actcggccgc 3439981 ggtgcggcgt tacctcgacg ccgacccgct cgggccatac ggcttctccg ggtgggtggg 3440041 ccccgagccc catggggcgg ggttggccaa cagcggtttt cgtcgcgacc cgctggtcgc 3440101 cgatcgtgtc gtcgcgtggc tgaccgagcg ctacgcccgg cggcgcgccg gtgacaccgc 3440161 cgcgatgcgc ccgttcttgc tggtggccag cttcgtcaac ccgcacgaca tcgtgctgtt 3440221 cccggcatgg gtgtggcgca gcccgctaaa gccctcccca ctggacccgc cacacgtacc 3440281 ggcggcgccg accgccgacg aggacctgtc gaccaagccg gccgcgcagg tcgcctaccg 3440341 ggaggcgtac tactccggat acggcctaac gcgtatggtc agccgcaact atgcccgcaa 3440401 cgcgcagcgc taccgggacc tctactaccg cctgcacgcc gaggtcgacg ggccgatcga 3440461 ccgtgtgggc cgcgcggtca ccgagggcgg atccgaggat gccatgctgg tgcgcacctc 3440521 cgaccatggc gatctgctcg gagcgcatgg cggactgcac cagaagtggt tcaacctcta 3440581 tgacgaggca accagggtgc cgttcgtcat tgcccgcatc ggcgagaagg caacccaacc 3440641 gcgcacggtc tcggcgccca cctcgcatgt cgacttggtg ccgacgctgc ttagcgcggc 3440701 cggcgtggac gtagacgtgg tggccgcggc cctggccgaa tcgttctccg aggtgcatcc 3440761 gctgcccggt cgtgacctga tgccggtcgt ggacggggct tcggccgacg agggtcgggc 3440821 catctacctg atgacgcgtg acaacgtgct cgaaggcgac accggcgcgt ccctgctgtc 3440881 gcggcaactg ggccgtatcg tgaatccgcc tgcaccgctg cgcatcaagg tgcccgccca 3440941 cgtcgccgcc aacttcgagg gattagtcgt acgggtcgat gacaccgacg ccgccggtgg 3441001 tgccgggcac ctgtggaaac tggtgcgtac cttcgacgac ccggccacct ggaccgaacc 3441061 cggtgtgcgt cacctggcca ccaacggcat gggcggcgac gcctatcgca ccgatccact 3441121 ggacgaccag tgggagctct acgacctgac cgccgatccc atcgaggcat acaaccggtg 3441181 gaccgaccca caactgcacg agctgcgaca gcatctgcgg atgctgctca aacagcaacg 3441241 tgcggtatcg gtaccggaac gcaaccaacc gtggccgtat gctcatcgac tgccgccgag 3441301 cggggcatcc aacggtttgg tgcggcgagt gttgggaagg ttcgtgcgct aattgcagaa 3441361 gctgctattc accatcgggt tggccctgtt cctgatcggc ctgcttaccg gattggtcat 3441421 cccggcactg aagaacccgc gcatggcgct gtcgagccac ctcgaggggg tcctcaacgg 3441481 gatgttcctc gtcgtgctcg gcctgctctg gccgcacatc gatctgcccg aggcatggca 3441541 ggttatcgcg gtggcgctga tcgtttactc cgcctacgcc aactggctgg cgaccctgct 3441601 cgcggcggcc tggggagcgg gccgtaaatt cgcgcccatc gcgaccggcg accacaaagc 3441661 cccggccgcc aaggagggat tcgtcagctt tctgttgttg tccctctcgg tggccatcgt 3441721 gatcggcgtg gtcatcgtca tcattggcct ctgacggcga cccgtccaac tacgccagcc 3441781 gcgctagctc ggcctgaagc ttgtccagat atcgaagcgt cgggtcgcga ggctcggtcg 3441841 gcagctccag caaaacccgc tccaccccta gatgccggta tccctcaagg tctttagccg 3441901 ccgcttcacc ccactggcac acggtcaccg gcacgtcgcc cccggccatg gcgcgcaacc 3441961 gctgaagcgg acccgacagc cgctgcggtg atggactgat cgcgatccac ccggcattga 3442021 gccgggctat ccgcgggaag ttcgccggtc ccccgcccac atacagcgga ggatagggct 3442081 ttgtcaccgg cttcggccag cagtagatcg gatcgaagtc cacatatgtc ccatggaatt 3442141 ccgcctgctc ctgcgtccag atctcgatta tcgcgcgcaa ccgctcatcg atcacacgtc 3442201 cgcgcaccgc agggtccaca ccatggttgg cgacttcttc gcgcaaccag cccacaccca 3442261 cgccgaagcg aaaccgtccc tgcgacacca gatccagcga ggcgacctcc ttggccgtga 3442321 cgatcggatc gcgttccggg atcagcgcga tgccggtgcc taacaccagt gactgggtgg 3442381 tagctgccgc ggccgccaac gccacaaagg gatccagggt gcggtaatac ttctccggaa 3442441 ttgggccacc gcccgggtag gggctctgcg tgttgacggg aatatgggtg tgctcggcga 3442501 ggaacagcga ctcaaacccg cggtgctcga gtgccgcacc cagctccgcc gggccgattc 3442561 cctcgtcggt gacgaacgtc aggacaccga attgcatgct tgctcccatc gtcttgtggc 3442621 tgcaagatct gcacgacgat acggccggcc gcgagttagg ccagtcccgc atcgaccagc 3442681 agacgtgaca gcccgagttc ggcgcacttc gtggctaccg gcgccagttc gtttcgcgca 3442741 tcggattccc gtccggtggc ggcaagcgtt tcgatatgaa gtatttgcgc ctgcagcgcc 3442801 gccagcggtc tgcgcgtacc gtcgatggcg gcggcgagag caccggcccg ttggcaggct 3442861 tggtcacgat cggcggagtc gccggcggac aacaggcgca ccgcggagtc ctcgtcgagt 3442921 tcggctgtca tggtggcgat tccattgtcg cgggggatgg tgcggggtgc cagcaaatcg 3442981 gcggccaccg ccgcaggtag cgcgatgccc agccggatcc gctcgttgtt gattcgggca 3443041 gccaggcgcg gcagccccag ctggacggca gtatcgcctc cggtggacag gcgatcagcc 3443101 gcaccctcat gatccccctg ggccgccttg acccgcgcgc cgatcacgta cctggcggcc 3443161 aggtagtcca ctgcaccccc ctcggaaccc agcagatagc tctcgtccat gagacgacca 3443221 gccccggcca gatcgccggt ctcgtagagc aattcggcga gcagcgaacc cgcaagccgc 3443281 gccgcgtgcg agtgggcccc cactgccgtg ccgacctcga acgccgttcg gaagttctgt 3443341 agcgcagcga caatgtcgag ccgattcctg gccgccatgc cgcgcaagca ctgcgcataa 3443401 acggtgccga acggtcccat catttcctgg tagggcgcgg cccagtccag cagtggatat 3443461 acctcggcga actcgaagcg gcagatcgcg gccaacgccg cggtgttgcc ggcggtcccg 3443521 gggactcgcg ggggcagggt gtccggtctc gacattgcct cggcgagaag gtcatccacg 3443581 cgctcgaccc ggtctgcgaa cacctcggcg accgcccgca acacgtctgc ctcggcccgc 3443641 agatccgcct gcgtcgcctc gggaagctcg gcccggccaa gggccgtttc gaaacgattc 3443701 agggcaccgg tggccggcgc cggccgttgc agcagaatgt tcgcccacgc gatggcgagt 3443761 tggagccggg cccgtgaaac caccatcgac gtcggcagtt tctgcacgat tgccagaagt 3443821 gtggtcatct ttgactgctc cggcaggttc gtttcatcct gctcgacaag atcgacggcg 3443881 cgcgcgggat cgcccgcggc cagtgcatgg tcgacggctt cgtgcaggta gccgttctcg 3443941 gcgaaccagg ccgatgccct gcggtgcagt tccgccaccc ggtgcgaccc gccacgttcg 3444001 aggcgacggt ggagaaagtc ggcgaacatt tggtggaagc gaaaccaatt cgggtcgtct 3444061 tcggtccgtt gcaggaacaa gccgcggtgc tcggcctctt ccagcatcgc ccgcccattg 3444121 gtgatcccgg ccagcgccga ggccagcccg ccgcacgtgc gttcggtgac cgatgccacc 3444181 agtaggaatt cgcgcagttc gggttccagg gtgtccagca cgttttcgct caggaattcg 3444241 tggatcacgt cactggcgcc ggaaagtccg cgcaggagtt gggtcgcgtc gcccccgccg 3444301 cgcagcgaca gcgcggccag ccgcagcgcc gcggcccacc cgtcggtaga ggtagtcagc 3444361 gcctgcacgt ctgcgcgcgg caatcgcaga ccaccagcat cgttcagcag cgcggcggcc 3444421 tcgtcggtat cgaagcgcaa agcagccgaa tcgatctcgg ctagttcgtc gccgatccgc 3444481 aacctgccca ccggcaaacc ggcgcgagac cagctggtca cgatgagctg caggtggtga 3444541 catccgttgt ccagcaggaa acccagggca gcttgggtgc ggctgtcgga cacccgatgc 3444601 cagtcgtcga tcaccaccgc gatccggtcg tcgttttcgt ggatttcgtc gatcagcgaa 3444661 gtcaacacgt agcggccggc gtcatcccca tgctcttcga gcacgtgccc caacgactcg 3444721 gccagcgtgg gccggacccg ccggatcgac tcgagcaggt gcgacaagaa ccacacctcg 3444781 ttgttgtcgt cgttgtcgat tgtcagccag gcgaccgcgg cgccgtcgcg cgagagctct 3444841 tcccgccatt gcgccgccag ggtgcttttg ccgaatcccg agggcgcgtg gatgaggatc 3444901 agccggcgcc gtccgccggc gcgcaggatg tcggtgagcc ggctgcgggt gaccagcgag 3444961 ccggtgggca ccgacggccg gtacttggtc gcgggtgtcg gaggcgtcgg gaccgtcggg 3445021 gtgccgccgc cggtatgccg atgcgccgcg tgcgcctcgg gcgagcgtcg gcgttccacg 3445081 cccagctcga cggggagggg catctcgtcg acgctgacgc cgttgcggcg ctgaacgtcg 3445141 cgaagctcct cgccaacgtc tgccgcggtc gcgggacgat ccgccggatg gcgggccatc 3445201 gcccgttcga tggcggcggc cacgtccgcg ggcagtccct gcttccgcag gtcggggatc 3445261 ggctgcgagg tgatccgcag gaactgggcg atcacccgct caccgctgcg gcgctcgtag 3445321 gcggcatggc cggtcagcgc acagaacaac gtcgcgccca gggagtacac gtcagaggcg 3445381 ggcgtcggcg atgctccttc gagaacttcc ggcgcggtga aagccgggga accggcaatc 3445441 accccggtcg ccgtctcgaa acccccggcg attctggcga ttccgaaatc ggtcagctgc 3445501 ggttccccgt agtcggtcag caggatattc cccggcttca cgtcacggtg cagggtgccg 3445561 acgcgatgcg cggcttccag cgctcccgcg agcttgacgc cgatcgacag cgtctcgcgc 3445621 cagtccagcg gcccgtgccg gcgaatcagc gtctccaacg aattcttggc gtggtagggc 3445681 atcacgatga agggccgccc acccgccaac acgcccacct gcaagacggt cacgatgtgc 3445741 gggtgcccgg aaaggcggcc catggcccgc tgctcgcgca ggaagcgctc gagattgtcc 3445801 cgatccaggt cggtgctcaa taccttgacg gcgacggcgc ggtccagcga gggctggacg 3445861 cagcggtaga cgacgccgaa tccgccgcgc ccgatctcct cgacattgtc gaatccagcc 3445921 tcaagcagtt ccgcgggaat attcgggacc aggtcccgcc gcgtcgcgtg cggatcaacg 3445981 tcggtcatcg acggtcacta tcctcggccg ggagggtatc accaccagtt tcatcgccgg 3446041 tgaccccaca ctatcgccaa gccgcggcgt cgcggctcga tacccaccgc acgcaaaagc 3446101 tccgttccca gaccaacgga gggaaggacc ggcaccagtt gacatacgag cagttcgctc 3446161 gtatgttgac gctgatgggg ccgagcgatc tgtggacggt ggaacgcgcg gcgcgccatt 3446221 ggggcgtgag cgcgtcgcgc gctcgcgcta tcctgtcgag ccgccacatt caccgggtca 3446281 gcggctaccc cgcgcaggcg atcaaggcgg tcaccctgcg ccagggtgcg cgcaccgacc 3446341 tcaaaaccgc caaccatctc gtgccggccg cacaagcgtt caccatggcc gagacgggtg 3446401 ccgcgatcgg agagaccgaa gatgagcggg cacgactgcg cattttcttc gagttcctcc 3446461 gcggcgccga tgagaccggg acatccgcgc tcgatctcat cgttgacgag cccgcgctga 3446521 tcggtgagca ccggttcgat gctttgttgg ccgcggctgc ggaatacatt tcggcgcgct 3446581 ggggccggcc tggacccttg tggtcggtga gtatcgaacg gtttctggac acggcctggt 3446641 gggtcagcga cctcccgtcg gcacgagcgt ttgccgccgt gtggacgccg gcgccgttcc 3446701 ggcgccgcgg catttaccta gatcgccacg acctcacgag cgatggagtg tgtgtcatgc 3446761 ccgaaccggt gttcaaccga accgagctcc agcgggcgtt cactgccctg gcggccaagc 3446821 tggaacgcag aggcgttgtc ggtcaggtgc acgttgtcgg cggggcggcg atgctactcg 3446881 cctacaactc ccgtgtcacc actcgcgata tcgacgcgtt gttctcaact gacgggccta 3446941 tgctcgaagc gattcgtgag gtcgctgacg aaatgggttg gccgcgaacg tggctcaaca 3447001 atcaggccag cggttacgtc tcccgcacac caggtgaagg cgcccccgtt ttcgatcacc 3447061 cattcctgca tgtcgtagcc acacccgcgc agcaccttct cgcgatgaaa gtcgttgcgg 3447121 cacgcggcgt gcgtgacggc gaagacattc gcctcctgct cgatcggctg cgaatcacca 3447181 gcgcggccgg cgtatgggag attgtcgcac gctactttcc cgccgaaacc atcaccgacc 3447241 ggtcgaggct cctcgtcgag gacctcctca accaatagca gaccactagc agtgaagccg 3447301 cggccgccgc gcgcagcacc ccagtgtcat ggattatcca tgattcgggc gtccccaatg 3447361 cgaaccgctt ctgtcagtcg gggctggggt ttcaccaccc gtttcaccga ccgctgaccc 3447421 caccataggc tcgatactgc cggggtgtca tcccaaacca gcgccggcac gaccggttga 3447481 gcgcgctctg ctcggaatag ccgagcagca ccgcgatttg gctcagatac aaccccggtt 3447541 gggcgaggta ccttgccgct tgcgcacggc gttcgcgctc gatgaggtca tggcaccgga 3447601 ggccctcggc agccaagcgc cgctgcagcg ttcgtgggtg catgtcgagt tggtcggcga 3447661 tggcctcggc gctgcattgg ccggtcggca gcaggcggcg ggccaacccg acgacccgct 3447721 cggagagcgt ggcatcgctc ggaaggtatt gggattccaa atatttcgtg gcgatgcgct 3447781 tggtttccgg atccgcatgg tcgatgggcc taccggcgag ccggtggtcc acctcgaacc 3447841 cgcaccatgt ccggccgaac cgaacggtac aacccaacgc ttcgcggtag gcggcgtcgg 3447901 tgcccagttg cgcatgtcgg aacgagaaaa cgcgcgcccg cgcctgcggt ccgcccagca 3447961 ggcggatcat ccgggcggcg ttggccatgc tcagctcgta tccctgcagc ggatagggaa 3448021 tccccggttc ggtcacctca tagccgaacc ggacgttgga ccgtgcggta gttgatgaaa 3448081 ccgtcagcgt cagggcgggc gaatggacgt agaggtagcg accgatcgcc tccagcccgc 3448141 cgaacaaggt ggcagcgttg cgcgcgatca ccgctaccgg gccgagaatg cccaggccct 3448201 gccagcgtgc aaggcgtagt ccgaagtccg ggcaatcgag ctcggcggcg ctggcctcca 3448261 gcatgcgcac gaacccggcc agcgacatga acgcgtcctc ttggtgttcg atgcccggcg 3448321 ggatgtcgaa gcgccgcaga aacggcagcg ggtccgcgcc gagctcgcgc atcaggtcgg 3448381 tgtaccccca caggttggtg gcgcggatga ggctgcccag ctccatcacc tcctgtcgga 3448441 aaatgataaa aggctgtcgc aaagtgtcaa tacgtggcgg gggtcctcca ccatgctgga 3448501 gccatgaacc agcatttcga cgtcctgatc atcggcgccg gcctatccgg catcgggacg 3448561 gcctgtcacg tgacggccga gttccccgac aagacaatcg ccctcctgga acgacgggag 3448621 cgcctgggcg gcacctggga cttgttccgc tacccgggag ttcgttcgga ctccgacatg 3448681 ttcaccttcg gctacaagtt ccgcccgtgg cgcgacgtga aggtgctcgc cgacggcgcg 3448741 tcgatccggc agtacatcgc cgacaccgcc acggagttcg gcgtcgacga gaagattcac 3448801 tacggcctga aggtcaacac cgccgagtgg tcgagccggc agtgccgttg gaccgtcgcg 3448861 ggcgtgcacg aggcgaccgg cgaaacccgg acctacacct gcgattacct catcagctgc 3448921 accggctact acaactacga cgcgggttat ctgccggact tccccggcgt gcaccggttc 3448981 ggcggccggt gcgtgcaccc gcagcactgg cccgaagacc tcgattattc cggcaagaag 3449041 gtcgtcgtca tcggcagcgg cgcaacggcg gtcactttgg ttccggcgat ggccggctcc 3449101 aaccccggca gtgccgcgca cgtgacgatg ctgcagcgat ccccgtcgta catcttctcg 3449161 ctgccggcgg tcgacaagat ctccgaagtc ctgggccgct tcctgccgga tcgctgggtc 3449221 tacgagtttg gccgcaggcg caacatcgcc atccagcgaa agctctacca ggcctgccgg 3449281 cgctggccca agctgatgcg gcgattgctg ctgtgggagg tacgacgccg cctcggccgc 3449341 tccgtggaca tgagcaactt caccccgaac tacctgccgt gggacgagcg gttgtgcgcc 3449401 gtgcccaacg gcgatctgtt taagacgctg gcctcgggcg cggcgtcggt ggtgaccgat 3449461 cagatcgaga ccttcaccga gaagggcatc ctgtgcaagt ccggccggga gatcgaggcc 3449521 gacatcatcg tcaccgcgac cggtctgaac atccagatgc tgggcgggat gcgactcatc 3449581 gtggacggcg ccgaatacca gctgccggag aagatgacct ataagggtgt gctgctggaa 3449641 aacgccccca atctggcctg gatcatcggc tacaccaacg cgtcatggac cctgaagtcc 3449701 gacatcgccg gcgcctacct gtgccggctg ctgcggcaca tggccgacaa cggctacacg 3449761 gtggcaacgc cgcgcgatgc gcaggactgc gcgctggacg ttggcatgtt cgaccagctg 3449821 aactccggct atgtgaagcg cggccaggac atcatgccgc gccagggctc caagcatccg 3449881 tggagggtgc tcatgcacta cgagaaggac gccaagatcc tgctcgaaga ccccatcgat 3449941 gacggcgtgc tgcacttcgc cgcagcggcc caagaccacg cggcggcctg agcatcatga 3450001 acctgcgcaa aaacgtcatc cggtccgtat tacgtggtgc ccggccactg ttcgcttccc 3450061 gccggctggg tattgccggc cgtcgagtcc tgctggcgac gctgacggcc ggcgcgcgcg 3450121 cccccaaggg cacccgcttt cagcgcgtca gcatcgccgg tgtcccggtc cagcgggtgc 3450181 aaccccccca tgcggcaacc agcgggacgc tgatctacct gcacggcggt gcctacgccc 3450241 tgggcagcgc ccggggctac cgcggcctgg ccgcccagct cgcggcggcg gccggaatga 3450301 cggcgctggt ccccgactac acccgcgcac cgcacgccca ctatccagtg gccctcgaag 3450361 agatggctgc ggtgtacacc cgcttgctcg acgacgggct cgacccgaaa acgaccgtca 3450421 tcgccggtga ttcggctggc ggagggttga ccctggcgct ggccatggcg ctgcgcgatc 3450481 gcggcatcca ggccccggcc gcactcggcc tgatctgccc gtgggccgat ctcgccgtcg 3450541 acatcgaagc gacgcgaccg gcgctgcgcg atccgctcat tcttccgtcg atgtgcaccg 3450601 aatgggcgcc gcgctacgta gggtcctccg atccgcggct gcccggtatc tccccggtct 3450661 acggcgacat gagcggcctg ccgcccatcg tcatgcagac cgcgggcgac gatccgatct 3450721 gcgttgacgc ggacaagatc gaaaccgcct gcgccgcttc gaaaacaagc atcgagcatc 3450781 gccggttcgc gggcatgtgg cacgacttcc atctgcaggt cagtctgctc cccgaagccc 3450841 gcgacgcgat cgccgacctc ggggcaaggc tgcgcggcca cctccaccaa tcgcagggac 3450901 aaccacgggg agtagtcaaa tgagctcatt cgaaggcaag gtcgccgtca tcaccggggc 3450961 cggctcgggc atcggcagag cgttggcact caacctctcc gagaagcgcg caaagcttgc 3451021 cctttccgat gtcgacaccg acgggctggc caaaaccgtg cgcctggctc aagcgctcgg 3451081 cgcgcaggtg aagtcggacc ggctcgacgt cgccgaacgc gaggcggtgc tggcccacgc 3451141 cgacgccgtc gtcgcacatt tcggcaccgt gcaccaggtc tacaacaacg ccggcatcgc 3451201 gtacaacggc aacgtcgaca agtcggagtt caaggacatc gagcgcatca tcgacgtcga 3451261 cttctggggc gtcgtcaacg gcaccaaagc ctttctgccg cacgtgattg cctccggcga 3451321 cggacacatc gtcaacatct ccagcctgtt cgggctgatc gcggtgcccg ggcaaagcgc 3451381 ctacaacgcg gccaagttcg cggtgcgcgg cttcaccgag gcgctgcgcc aggagatgct 3451441 ggtcgccagg catccggtca aggtgacgtg cgtgcatccc ggcggcatca aaaccgccgt 3451501 cgcgcgcaac gccaccgtgg ccgacggcga ggaccagcag acgttcgcgg agttcttcga 3451561 ccgccggctg gcgctgcatt cgccggagat ggccgccaaa accatcgtca acggagtcgc 3451621 caagggccag gcccgcgtcg tggtcggcct ggaggccaaa gccgtcgatg tgctcgcgcg 3451681 catcatgggc tcgtcgtatc agcggctggt tgccgccggc gtcgccaagt tcttcccctg 3451741 ggccaagtag gcccatagag ttctagaaag ggacaccacg atgaaaacca ccgcggcggt 3451801 actgttcgag gcgggcaaac cgttcgagct gatggagctc gatctcgacg ggccgggtcc 3451861 gggcgaggtg ttggtcaaat acaccgccgc cgggctgtgc cattccgacc tgcacctcac 3451921 cgatggtgat ttaccaccgc ggttcccgat cgtgggcggc cacgaagggt ccggggtcat 3451981 cgaggaggtg ggtgccggcg tcaccagggt caagcccgga gaccacgtgg tgtgcagctt 3452041 catcccgaac tgcgggactt gccgctactg ctgcaccggc cggcagaacc tgtgcgacat 3452101 gggggccacc atcctggagg gctgcatgcc ggacggcagt ttccgattcc attcccaggg 3452161 aacagatttc ggcgccatgt gcatgctggg cacgttcgcc gagcgggcca ccgtctcgca 3452221 gcattcggtg gtgaaggtgg acgactggct gccactggaa accgcggtgc tggtgggctg 3452281 cggcgtgccg tccggttggg gcaccgcggt caatgccgga aacctgcggg ccggcgacac 3452341 cgccgtcatc tacggcgtcg gcggcctggg catcaacgcg gtccagggcg cgaccgccgc 3452401 cggctgtaag tacgtcgtgg tggtggaccc ggtggctttc aagcgcgaga ccgcgctcaa 3452461 gttcggcgcc acccatgcct tcgccgacgc cgccagcgcg gcggccaagg tcgacgaact 3452521 cacctggggg cagggcgccg acgcggcgct gatcctggtg ggcaccgtcg acgacgaggt 3452581 ggtctcggcc gcgaccgcgg tgatcggcaa gggcggcacc gtcgtcatca ccgggctggc 3452641 ggacccggcc aaactcaccg tgcacgtctc cggaaccgat ttgacgctgc acgagaaaac 3452701 gatcaagggc tcgctgttcg gttcctgcaa tccgcaatac gacatcgtgc ggctgctgcg 3452761 cctctacgac gccggccagc tgatgctgga cgaactcgtg accaccacct acaacctcga 3452821 acaggtgaac cagggctacc aggatctgcg ggacggcaag aacattcggg gcgtgatcgt 3452881 gcactgacca gcttccacca accacgaatc cagagaggac gatgatgcgc aggctcaacg 3452941 gcgttgacgc gctgatgctg tatctcgacg gcggcagcgc ctacaaccac accctcaaga 3453001 tcagcgtgct cgacccgtcg accgacccgg acggctggtc gtggccgaag gcgcggcaga 3453061 tgttcgagga gcgcgcccac ctgcttccgg tcttccggct gcggtacctg cccacaccgc 3453121 tgggcctgca tcacccgatc tgggtcgagg atcccgaatt cgacctcgac gcgcacgtgc 3453181 gccgggtcgt ctgtcccgcc ccgggcggga tggcggaatt ctgcgcgctc gtcgagcaga 3453241 tctacgccca cccgctggat cgcgaccgcc cgctgtggca gacctgggtg gtcgagggcc 3453301 tcgacggcgg ccgcgtcgcc ctggtcacgc tgctgcacca cgcctactcc gacggcgtcg 3453361 gcgtgctgga catgctcgcc gcgttctaca acgacacgcc tgacgaggcc cccgtggttg 3453421 cgcccccgtg ggagccgccg ccgctgccgt ccacccggca acgcctcggt tgggccctgc 3453481 gggacctgcc ctccaggctc ggcaagatcg cgccgaccgt gcgggccgtt cgtgatcggg 3453541 tgcgcatcga acgggagttc gccaaagacg gcgaccggcg cgtcccgccc acgttcgacc 3453601 gctccgcacc gccgggcccg tttcagcgcg ggctgtcgcg cagccggcgg ttctcctgcg 3453661 aatcgttccc gctcgccgag gttcgcgagg tgagcaagac gctgggcgtc accatcaacg 3453721 acgtcttttt ggcgtgtgtg gccggtgccg ttcgtcgcta tctggagcgt tgcggctccc 3453781 ctcccaccga cgcgatggtg gccacgatgc cgctcgcggt caccccggcg gccgagcgcg 3453841 cccaccccgg caactactcg tcggtcgact acgtctggct acgcgccgac atcgccgacc 3453901 cgctcgagcg gctacacgcg acccacctcg ccgccgaggc caccaagcag cacttcgccc 3453961 agaccaagga cgccgacgtc ggcgcggtgg tcgagctgct gccggaacgc ctcatctcgg 3454021 gcctggcgcg tgccaacgcg cgcaccaagg gccgcttcga caccttcaag aacgtggtcg 3454081 tgtccaacgt gccggggccg cgtgagccgc ggtatctcgg ccgctggcgc gtcgaccagt 3454141 ggttttccac cgggcagatc tcccacggcg ccacgctcaa catgaccgtc tggagctatt 3454201 gcgaccagtt caacctgtgc gtaatggccg acgcagtcgc ggttcggaac acctgggaat 3454261 tgctcggcgg cttccgcgcc tcgcacgagg agctgctcgc ggcggcccgt gcccaagcca 3454321 cgcccaagga gatggccaca tgacccgcat caatccgatc gatctgtcct tcctgctgct 3454381 ggagcgggcc aaccggccca accacatggc cgcctacacg atcttcgaaa agccgaaagg 3454441 acagaaatcg tcgttcgggc cgcgcctgtt cgatgcctac cggcacagcc aggcggccaa 3454501 gcccttcaat cacaagctga aatggctggg cacagatgtt gcggcgtggg aaaccgtcga 3454561 gcccgacatg ggctatcaca ttcgacacct cgccctgccc gcaccgggtt ccatgcagca 3454621 gttccacgaa acggtctcgt tcctcaacac cggcctgctc gataggggcc acccgatgtg 3454681 ggagtgctac atcatcgacg gcatcgagcg cggccggatc gcgatcctgc tcaaggtgca 3454741 ccacgcgctc atcgacggtg aaggcggcct gcgcgcgatg cgcaacttcc tctccgattc 3454801 accggacgac acgacgctgg ccggtccctg gatgtcggcg cagggcgccg accggccacg 3454861 gcgcaccccc gccacggtgt cgcgcagggc gcaactgcaa ggacaactgc aaggaatgat 3454921 caaggggctg accaagctgc cgagcggcct gttcggcgtc agcgcggacg cggcggacct 3454981 tggtgcgcag gcactgagcc tcaaggcgcg caaggcgtcc ctgcccttca cggcgcgacg 3455041 cactctgttc aacaacacgg cgaaatcggc ggcgcgcgcg tacgggaacg tcgagttgcc 3455101 gctcgccgac gtcaaggccc tggccaaggc gaccggcacc tcggtcaacg acgtggtgat 3455161 gacggtcatc gacgacgcgc tgcaccacta cctcgccgaa caccaggcgt ccaccgaccg 3455221 gccgctggtg gcgttcatgc cgatgtcgct gcgtgagaag tcgggcgagg gcggtggcaa 3455281 ccgggtgagc gccgaactgg tcccgatggg tgcacccaag gcgagtcccg ttgagcgcct 3455341 taaggaaatc aacgcggcga ccacacgcgc gaaggacaaa gggcgcggca tgcaaacgac 3455401 gtcccgccag gcctacgcgc tgctactgct cggcagcctg acggtggcgg acgccctgcc 3455461 cctgctcggc aagttgccga gcgcgaatgt ggtgatatca aacatgaagg ggcccaccga 3455521 gcagctctac cttgccggtg cgccgctggt ggcgttcagt ggcctgccca tcgtgccgcc 3455581 gggcgccggg cttaacgtca ccttcgccag catcaacacc gcgctgtgca tcgccatcgg 3455641 cgcggcaccg gaagccgtgc acgaaccctc ccggctggcc gaactgatgc aacgggcatt 3455701 caccgagctc caaaccgaag ccggcacaac gagtcccaca acatcgaagt cgagaacccc 3455761 atgaagaaca ttggctggat gctcagacaa cgcgcgaccg tctcgccgcg gctgcaagcc 3455821 tacgtcgagc cgtccaccga cgtccggatg acctacgcgc agatgaacgc gctggcgaac 3455881 cggtgcgccg acgtgctcac cgcgctgggg atcgccaagg gcgaccgcgt ggcattgctg 3455941 atgcccaaca gcgtcgagtt ctgttgcctg ttctatggcg cggccaagct cggcgcggta 3456001 gcggtcccta tcaacacccg cctcgccgca cccgaggtga gtttcatcct gtccgacagc 3456061 ggcagcaagg tggtgatcta cggtgcgccg tcggcgccgg tgatcgacgc catcagggcg 3456121 caggccgacc ctccgggcac ggtcaccgac tggataggcg ccgactcgtt ggccgaacgc 3456181 ctgaggtcgg cggccgcaga cgagccggcg gtcgaatgcg gcggcgatga caacttgttc 3456241 atcatgtaca cctcgggcac caccggacat cccaagggag tggtgcatac ccacgaatcg 3456301 gtgcattcgg cggccagttc ctgggcctcg acgatcgacg tgcgctaccg cgaccgcctg 3456361 ctgctaccgc tgccgatgtt ccacgtggcg gcgttgacga cggtcatctt cagcgccatg 3456421 cgcggcgtca cgctgatctc gatgccgcag ttcgatgcga cgaaggtgtg gtcactgatc 3456481 gtcgaggagc gggtctgtat cggtggcgcc gtgccggcga tcctcaactt catgcgccag 3456541 gtgcccgagt tcgccgaact cgacgcgccc gacttccgct acttcatcac cggtggcgcg 3456601 cccatgccgg aggccctgat caagatctat gccgccaaga acatcgaggt cgtgcagggt 3456661 tacgcactca ccgaatcctg tggcggcggc accctgctgc tcagcgaaga cgcgctgcgc 3456721 aaagccggct cggccggacg cgccaccatg ttcaccgacg tggccgtgcg cggtgacgac 3456781 ggcgtgatcc gcgagcacgg cgaaggcgaa gtcgtgatca agtccgacat cctgctcaag 3456841 gaatactgga atcgcccgga ggccacccgc gacgctttcg acaacggttg gttccggacc 3456901 ggcgacatcg gcgaaatcga tgatgagggc tatctttaca tcaaggaccg gctgaaggac 3456961 atgatcattt ccggcggcga gaacgtctac ccggccgaga tcgaaagtgt gatcatcggc 3457021 gttcccgggg tcagcgaggt ggcggtcatc ggcttgcccg acgagaagtg gggcgagatc 3457081 gccgccgcca tcgtcgttgc cgaccagaac gaggtcagcg agcagcagat cgtcgagtac 3457141 tgcggaacca ggctcgcacg ctacaagctg cccaagaagg tgatcttcgc cgaggccatc 3457201 ccccgcaacc cgaccggcaa gatcctcaaa acggtgctgc gcgaacagta ttcggcgacg 3457261 gtgccgaagt gatgcacggc ccgagccgct aggacggcgc gagccgcacg atgccgggaa 3457321 cgaggtagcg cgcaacgtac gcacgcagcc cctcgtcgtc atcgagcggg atcgggccct 3457381 ccggcgcagc gaccggtaat ccgttgccgt tgagtgtgtt tgagttgccc gttcatgcgg 3457441 cggcgctcgt cgatctcctc ttgcaccagg gcctcgaccg ccagccgcga gccggtgatc 3457501 cggtcgtagc ttcgggtcca gcggccgccg cggcttgtcg ccccatgcgg tttggatcac 3457561 ccgccgcgtg ctgcggctgg tgtccagcca ggcgattgcc cgcgctcgca cccgctcggc 3457621 gcgcgggttg tcggcgaccc cgaagatgac ctcggcgacg atgtcgagcg cgatcgggcc 3457681 ggcgcggtct cggaaacgga cctcctcgcc cattggccag gtggcgagcg cctttcggtc 3457741 acgcgttcca tggcccgctc gtaggacctc agcgcctttc cgcggaaggg cgggctggcg 3457801 tagcgccggt cggcgcggtg ccttccatgc tgaccagcgt gtgctcgccg aagatcgcgt 3457861 ggtgagccgg tccatcgccg gggtcagctg aaggaccgag ttgctcgccg tgaagaccct 3457921 cttcacgtct tcgggattgg gtcacgcaca gcgcgtcgac ggctccaggc acgttgaaca 3457981 ggaagcgatc aaccgagttt tgtggtgcgc gcgtaaaacc gctgggggcc agccagtatt 3458041 cggctccaaa tgcgatcgag gataggcgca cccggggcct ccatcgggac tcttcgaact 3458101 accaccgctc accttgcagt gcgactacca agcccgccga cgtgtctgcg gcgcagtatt 3458161 cttcacgcac ctggcccgcg tactccccga cccagcaaag gagtccagga atgacatggc 3458221 agatcgtgtt cgtcgtgata tgcgtgatcg tcgccggcgt cgcggcattg ttctggcgac 3458281 tcccctccga tgacacgacg cgcagccggg ccaaaacagt gacaatagcc gccgtggcag 3458341 cggcggccgt gttcttcttc ttgggctgtt tcaccatcgt tggcacccgc cagttcgcga 3458401 ttatgaccac cttcggccgt cccaccggcg taagcctgaa caacggcttc cacggcaagt 3458461 ggccctggca gatgacccat cccatggatg gtgcggtgca gatcgacaag tacgtcaagg 3458521 aaggcaacac cgatcagcgc atcacggtgc ggctgggcaa tcaatccacc gcgctggcag 3458581 acgtcagcat ccgctggcaa ctcaagcagg ccgctgcccc ggaactgttc cagcagtaca 3458641 agaccttcga caacgtgcgc gtcaacctga tcgagcgcaa cctctcggtg gcgctcaacg 3458701 aggtgttcgc cggcttcaac ccgctggacc cgcgaaacct cgacgtgtcc ccgctgcctt 3458761 cgctggccaa gcgcgccgcc gacatcctgc gccaggacgt gggcgggcag gtcgacattt 3458821 tcgatgtcaa tgtgcccacc atccagtacg accagagcac cgaggacaag atcaaccagc 3458881 tcaaccagca gcgcgcgcag acctcgatcg ccctggaagc acagcgaact gccgaggccc 3458941 aggccaaggc caacgagatc ctgtcccgct cgatcagcga cgaccccaac gtggtggtgc 3459001 agaactgcat tacggccgcg atcaacaagg gaatcagccc gctgggttgc tggccgggaa 3459061 gctcagcgct acccaccatc gcagtgccgg gacggtaacc gcgaagattg accccatgcc 3459121 gatccccttt gccgatggga tgctcagccg gctgggtcgc cgcggggcag cgctcgacct 3459181 gatcgaggag ttcgaggacg agtccgggga gccccccgca tccctgagcc ccgccgacct 3459241 gctggccgcc gaaccggccc tgctgctgca gaagatggag aaccgcctcg tccggcacca 3459301 cctagccaat ccggacgtgt tgagcggcga acagctgcgc aagctgcgct acatcctcaa 3459361 tttcgccagg ctggccgact tcgaaccggg ggccgcgggg ccgggcggaa gccgcggtcg 3459421 cggggacatc tcggtgggcg gccaagtcgc gccttggcgg tcccgggtcg tcgacgcgtt 3459481 gtacgcaccg ctgcgcgagg agcccgatcc ggtcacggcg ctggagggcg cgaaagacgt 3459541 gctggcgacg ctggtcgacg accaggacga tcagcgtcga gtgctcatcg agcgccacgg 3459601 cagcgacttc tccgcgacgg aactcgacgc cgaggtcggc tacaagaagc tggtgaccgt 3459661 cctcggcggc ggcgggggcg cgggcttcgt ctacatcggc ggcatgcaac ggctgctggc 3459721 ggccggccag gtgcccgact acatgatcgg ctcgtcgttc gggtcgatca tcggcagcct 3459781 ggtggcccgt gaactgccgg tgccgatcga cgagtacgcc gagtgggcca aaacggtgtc 3459841 ctaccgcgcc atcctgggcc cggagcggcg gcgcagccgc cacgggttgg ccggaatgtt 3459901 caccctgcgc ttcgaccagt tcgcccatac cctgctcagc cgtgcggacg gcgaacggat 3459961 gcgcatgtcg gatctggcaa tcccgttcga tgtcgtcgtc gccggtgtgc gcaggcagcc 3460021 ttatgcggcg ctgccgtcca ggttccgcca tcgcgagcgg tctacactga cgttgcggtc 3460081 gctgccgttt ctgccgatcg gtatcggccc gtgggtggcg gcacgcatgt ggcaagtcgc 3460141 ggccttcatc gacttgcggg tggtcaagcc gatcgtcatc agcgccgacg gcgcgacacg 3460201 cgacgtcaac gtcgttgacg cggcgtcttt ctcgtcggcc atccccggtg tgctgcacca 3460261 cgaaaccagc gacccgcgga tgctgccaat cctcgacgag ttgtgcgccg accaggacgt 3460321 cgcggcgatg gtcgacggcg gcgcggccag caacgtcccg gtcgaattgg cgtgggagcg 3460381 ggtccgcgac gggcggctcg gcacccgcaa cgcgtgttat ctggcgttcg actgcttcca 3460441 tccgcactgg gacccccgac atctgtggct ggtaccgatc acccaggcgg tccagctgca 3460501 gatggtgcgc aacctgccct acgccgacca cctcgtccga ttcgagccga cgctgtcgcc 3460561 ggtgaacctg gcgccgtccg cggcggccat cgaccgggct tgccggtggg ggcgcgacag 3460621 cgtcgaaccg gcgattgcgg tgacatcggc gctgctggag ccgacgtggt gggaaggcga 3460681 caggcccccc gccgccgaac ccaaggaacg cacaaagtcg gcggcctcgt cgatgagcgc 3460741 cgtgatggcc gcgattcagg cgccgacggg ccggtttcgg cgatggcgaa gccgccacct 3460801 gacctagcga cggctacagg gaacgcgacc tcggcggtcg aaagcaaacc aggtgcacaa 3460861 gtgcaacaac aacgattccg atcaccaacc cagtcgccgc gcacgccgcg gtgctgacca 3460921 accaggtcag cgcgccacca gcagacccca ccaggtggtc atctaggtgg tgaaccaggc 3460981 ggtacggggc gtgccagccg aggtggtcgc tgcccacaag cactatgtgg ccgcccaccc 3461041 agagcatggc tcccatcccg accgctgaca gcgccgatag cagtttgggc atccccgcga 3461101 ccaggccccc gccgatccgc tgcccgaatc gggacgcggt ctgggtgagg cgcaggccga 3461161 cgtcgtccat ttggacgatg acggcgacga caccgtacac cgcggcggtg atgacgaggg 3461221 cgacgatgac gaggacgatg aggcgcggca cgaatggctg gtcggccacc tcgttgaggg 3461281 cgatcaccat gatctcggcg gataggatga agtcggtccg gatcgccccg gccaccagct 3461341 cgcgttcggc gacctgcggc gcggcgtcgt ggccacggcc gccgatgacg ccgcacacct 3461401 tttcggcgcc ctcgtagcac agatacgtgg cgcccaacat cagcagcggg gtcaacagcc 3461461 acggcacgag ctggctgagc agcaatgcac cgggaaggat gagcagcagc ttgttgcgca 3461521 ccgacccgat cgcgatgcgt ttgatgatcg gcagctcacg ctcagcggtg atccggtgga 3461581 cgtattgcgg cgtcaccgcc gtgtcgtcaa tgaccactcc cgcagccttt gccgtcgcac 3461641 gaccggcggc ggcgccgatg tcgtcaatcg aggcggcggc cagccgtgca agaaccgcga 3461701 catggtccag cagtccgaac agaccgccgc tcatcgcgac tccgccatca cgatcgaggt 3461761 taccgtctgc cgtcgttgtc gccagcggtg ccgtagagcc cgccgggtcg cagcgctcgc 3461821 agagccaccc ggccccccgg gtcttcagcg gtggcgggca cgaccgcgac gcaatcggca 3461881 ccggcatccg cgtaggcgcg gagccgggcc gccactcgat cggggctacc caacgcacac 3461941 acccggtcga gcagttcgct ggggacagcg accgccagtt cgcggcgagt agcccgggac 3462001 cgcgcgctac ggaccaggcc gtcgaaaccc agcgcgctga acatttcgcc atagccgggc 3462061 ggggcgaggt acaccgccag ctgagctgcc agctgggagt gcgcggccgc accggggttg 3462121 acggcgaccg gcacgcacac cgtgaggcgc ggcgcggcac ggccggccgc ggcggctgcg 3462181 ctgtcgatcg ccgcacgaac ccgcccgaca cggaacggcg atgccaggtt gagcacgacc 3462241 tcatcggcgt gctgcgcggc caggcgaatc atgccaggtc caaacgcccc caacgcaatt 3462301 cgcgtatcgg gcgccgcacc gcgcagccgg aatccgcggc tgttgacgtg acggccgctg 3462361 tattcgaccc gcgcaccggt aaatatcgac cgcaggcatt cgatggtttc gcgcatgacc 3462421 ggcacgtggt gcgcccaagg tcggccatgc cagccggcca cgatcgccgg actggaagct 3462481 cccagcgcga ggtcaacccg acagccggtg agagaagcga ccgaactgac ccctagcgcc 3462541 agccccaccg gaccgcgaac gccgacggct agcggtccga ccttcagcgt catgtttggc 3462601 gtgcggagcc cgatcgaggt cgcgagcgcg aacgcatcgt aggtcgccat ttcgccgatc 3462661 cacagcgcag cgaaacccgt gtcagcggcc gcgagcgcga catcggttgc ctcgtggtcg 3462721 gggcggtcaa gccagaacgg tagggcgact tcgatatcgg tcatagcatc gacacgtcgg 3462781 ccggctggtc gagcaggaca cgcccgggca gttcgcgtga tgcctcgttg acctggaaat 3462841 gggcggtagc ggtgaatgcg tcgcggaacc ggcgctgcag cggtgcgttg tcgtagatgg 3462901 cggtgccgcc cgccagatca tacatgctgc gcaccacgtc ggccgaggtc cgtaccgcgt 3462961 gcgtggccgc caaccgcagc cggttgcgca tcgtcaccgg taccgcctcg gcatcgtggc 3463021 tgacctgcca ggccgcctcg attacctcgt agaacagggc gcgggcggcg cccagcgccg 3463081 actcggcggt tgccgccgcg gcttgggtcg ccgaacgttc cgccaaggtc cgagtggacc 3463141 caagcccttt cttgccgccg gccagctcga ccagatcgtc aatcgcggcg cgcgcattgc 3463201 ccaacgcagc cgcgccaatc gacaacgcga aaaatccaaa caccggaaag cgatacagcg 3463261 gccggtccac gattggtccg tcaaacaccg agaacacgcg atcagcgggc acgaagacgt 3463321 cgtcggcaac gcagtcgtgg ctgccggtgc cacgcaaacc caatgtgtgc caagtgtcga 3463381 ggacctgcag ctcgtccttg ttcagcgcga cgaccgacgg cacttgccgg tcgtcgacga 3463441 agcagccggc gaacatgatg tccgcgtggt tgatcccgct gcaaaacggc cagcgtccgg 3463501 acaccacgac accgccgtcg acggaccggg ccgtgccacg tggcgcccac acccccgccg 3463561 cgacaccccg ccccccgccg aacatttcct cgcggctgcg cgccggcagg taggcgacca 3463621 gcagggcact ggtaatcgcg atcgacacac accatcccgc tgacgcgtca ccacgcgcca 3463681 ccgcctcggc gcaccgcagc gcccgcccgg gtgccagctc cggcgccgca acctcacgcg 3463741 gcatggtggc gcgcagcaag ccggcctcgc gcagccgggt caccagctcg tctggcagcc 3463801 gacgatcgcg ctcgatttcc gcggatcgcg ctcgggccca ccgcgcgatc ttctcggcga 3463861 ggatctcgat ctcggtttcg ctttggttca cgggcggctc ctgatgacgg tggcggttca 3463921 atgaagttac cacccttggt tcagtcattg aaccaggtac agttggtgga ccatggccgt 3463981 ttccgatcta tcccaccgct tcgaagggga gtcggtcggc cgggcgctcg agctagtcgg 3464041 tgaacgctgg acgctgctta tcctgcgtga ggcgttcttc ggggtgcggc ggttcggtca 3464101 gctcgcgcgg aaccttggca ttccgcggcc cacgctgtcc tcgcggctgc ggatgctcgt 3464161 cgaggtgggt ctttttgacc gggtgccata ttcctccgac cccgagcgac acgagtaccg 3464221 gctcaccgaa gcgggccgcg atctgttcgc cgcgatcgtc gtcctcatgc agtgggggga 3464281 tgagtacttg ccacgcccag aaggaccacc gatcaagctg cgccaccaca cctgcggcga 3464341 gcacgccgac ccacgcctga tctgtaccca ctgcggcgag gagatcaccg cgcgcaatgt 3464401 gacacctgaa ccggggccgg gctttaaagc caagctggcg tcctcataac gattcccaac 3464461 ctcaaattgt tgcgaatcga taatgcaagc cgaaccacgt cgccgaacaa ggccgtacac 3464521 cttggccggg aaactatcgt cattttgtgc accgtcgaac ggccctgaag ctcccgctgc 3464581 tgctggcggc aggcacggtg ctgggccaag cgccgcgggc cgccgccgaa gaaccaggcc 3464641 ggtggtcggc cgaccgcgca catcgctggt atcaagcgca cggctggctc gtcggtgcaa 3464701 actacatcac ctcgaacgcc atcaaccagc tcgagatgtt ccagccaggc acatacgatc 3464761 cccggcgcat cgacaacgag ctgggccttg cgcggtttca cgggttcaac accgtgcgag 3464821 tcttcctcca cgacctgctg tgggcccaag acgcgcccgg tttccaaacc cggctcgcgc 3464881 agttcgtcgc catcgcggcg cgataccaca tcaaaccgct ctttgtcctg ttcgactcct 3464941 gctgggaccc gctccccaga ccgggtcggc agcgggcgcc aagggctggg gtgcacaact 3465001 ccgggtgggt gcaaagtccg ggtgctgaac gcctcgatga ccgccgctat gccagcacgc 3465061 tgtacaacta cgtcacgggt gtgttgggcc aattccgcaa cgacgatcgc gtgttgggtt 3465121 gggacctgtg gaatgaaccc gacaatcccg cgcgcgtgta tcgcaaggtg gaaaggaaag 3465181 acaagctcga gcgcgtcgcg gagctcctcc cccaagtgtt ccgatgggcc cgcacggtcg 3465241 atccggttca accgctgacc agtggtgtct ggcaagggaa ttggggagat cccggacgcc 3465301 gcagcaccat cagcgccatt caactcgaca acgccgacgt gatcaccttc cacagttacg 3465361 ccgcgccggc cgaattcgag ggccgcatcg ctgagctcgc tccgttgcag cggccaatcc 3465421 tgtgcaccga gtacctggcg cggtcccaag gcagcactgt cgagggaatc ctgccgattg 3465481 ctaagcggca caacgttggt gcgttcaatt ggggtttggt ggcgggaaag actcagacct 3465541 atttgccgtg ggattcgtgg gatcacccct accgcgcgcc cccgaaggtg tggtttcacg 3465601 acctgctaca ccccaacggc cggccgtatc gggacggcga agttcaaacg attcggaagc 3465661 tgaacgggat gccgagccag gactaggctt tccccagccc gcattgggcg cggctcgccg 3465721 aatgcgagcc cgacacctac tgaaaaccat gtgcgcggtc ggcctggcgg aaccggatca 3465781 ggcggcgata ccgagttgct ggttaatctg cggccaggac agcaaacccc agggggtgag 3465841 cagtatccag tcgtggattt gccagggggc cagtacgaag ctgaacggcg ctccttggac 3465901 tacggctgtg tgctcgagga caaccgcttg ttgtgcgagc ggatcaagcg agcccgaata 3465961 gacatacgtc ggcggaagac cgttcagcga cccatacagc ggactgacca gcgggtcgtt 3466021 gaccgcaaga ttgcctgccc acgcctggct gatctgccag gtccccacat cgagccacgg 3466081 ggacagcaac accatggacg acggtactgg gttgccctgg ctcaccatgt attgggcggc 3466141 cgccagtgcg aggttgccgc ccgcggagtc cccgaccacg ctgacgttgg agaccccgtg 3466201 ttgcgcgatt tgcgtggaga tgagcccggc catcgccggt actaccgtcc cggcagtgcc 3466261 tccttcctgc accaacgggt aaatcggcac ttgcacggtc gcgccggtct ggtaagccgt 3466321 caccgagtag ttgagccagt ggaagattga cggcggcagg ataaacgcgc cgccgtgaat 3466381 ggcaaccacg tattcgccgg ttggatgagc cggcgtgatc tgcacgacgc tcatcccgtc 3466441 ataggtggtg tactggaccg tctgtcccag cagcgagttc agcaacggcg gtggggagtt 3466501 gccaagaaac cacgacagcg gcggtatgtc gctggcaatg agcgctaaaa gtggattgtt 3466561 tgggattgca aagtgagttt cgagcgcaga caaactcagc aggggtttca ccggccacag 3466621 cgaagcgatg tcgaatccgg cagcccctga cggcgtgccg gtgaagatcc ctgcctgcgc 3466681 cgccgcgaaa ggcggaacct gcgtgaatcc ggcggccagc gccgtggggg cccgctgaat 3466741 ttcctggtga atcgtggcaa aaccgttccc gataccgctg gcgaattcac tctgcaacag 3466801 cgaagcgttg gccagctcgg cggcggcata ccccttggcc gcgcctgtca atgcctgcac 3466861 aaaccgctca tgaaacaccg caagctgtgc gctaagagct tgatagtcct gaccatggcc 3466921 ggaaaacaaa gccgcgatcg cggctgacac ctcgtcctcg gcagcggcta ataccgtcgt 3466981 ggtggcaccc gcgacaccct ggctcgccgt cgcgaccacc gaaccaatcg aagccacgtc 3467041 tgtggccgcg gcggacatca cctccggcaa cgcaacaaca taagacacca cgccgctccc 3467101 gccacctcac ggcaacttcc ccagttgccc agccactacc gatcgccgag tagccggagc 3467161 ttatgcccac gccgagtagt cacgtgccag tttgcgcgaa ttcccaaagt tagaccggca 3467221 aacgtgacgg caccgatccg tgtggtgcag ccgccgggaa tcgaacactc tccgacgcaa 3467281 aacgacctgc gattacgcgc ggggcgttga tggcgtcaag aaggaatgag gcggcgaacg 3467341 cgggcgttgg ggtgccgcta tgcgttgaac aattgctata cgattgtgca acatcagcta 3467401 tcgtcgtact catgaccgcg accatcggct tccgacctac tgaaaaagac gagcagatca 3467461 tcaacgccgc aatgcgcagc ggcgagcgca agagcgacgt catccggcgg gcactgcagc 3467521 tgctcgaacg ggaagtgtgg atcaagcaag ctcgcaccga cgctgagcga cttcgagacg 3467581 aggatgtctc cactgaaccg gacgcgtggt gattcgggga gcggtctaca gggtcgactt 3467641 cggcgatgcg aagcgaggcc acgagcaacg cgggcggcgc tacgccgtgg tcatcagccc 3467701 cggctcgatg ccgtggagtg tagtaaccgt ggtgccgacg tcgacaagcg cccaacctgc 3467761 ggttttccga ccagagctgg aagtcatggg aacaaagaca cggttcctgg tggatcagat 3467821 ccggacgatc ggcatcgtct atgtgcacgg cgatccggtc gactatctgg accgtgacca 3467881 aatggccaag gtggaacacg ccgtggcacg ataccttggt ctgtgatggc cgtcgcatct 3467941 gcaaatgggc caccgacctg gcccttcggt ggagctgccg ggaatcgaac ccgggtccta 3468001 cggcattccc tcaaggcttc tccgtgcgca gttcgctatg cctctgctcg gatctcccgg 3468061 tcacgcgaac tagccgagat gacgatccca gtcgctgtgg ttgtcccgag gagtcccgcg 3468121 accggactca tcggtggatc cctctagctg atgccagggt ccgggccgag ggcgttcccg 3468181 gtctgacaga ctagccgtcg cttaggcagc gagagcgtag tcgcgctgat gtgaatcggc 3468241 gcttatttgg tcgcaacgac gcttacggtg gtctcttgcc tgcaccggca cgcttccctt 3468301 gattcgatgc gcgaagtcga aaccgttcag cccctcgcat ccctgccgac cttcggcagg 3468361 accatcaatc ctacgccgct ctcaacaacc ggcaacgcca ttaacttccc ggtcagatca 3468421 cgaagttcag gcgctcgagg atgtgaccgg ccagctcctt gtcgccgccg agttccacat 3468481 cctggctgcg cgccgggctc atcgggcgcc cgccggcgag cctggtgaac tgcagtccgt 3468541 ccaggcggat cgtcgccgtc ggcgccggcc caccgaagtc gtcgaccacc cgcgctcgac 3468601 cgtccacgga aacgcggatg ctgcgagaca gcgggccggt cagctccaac agcacgcggg 3468661 agccgtcggg cgctttggcc agcttgccga cgacgaaccc catggtggcc gctatctcat 3468721 cgaggaccag cggtgacgcc ggcccgccga gttcgtcgtc ggacgacggg cgctgcaccg 3468781 ccgcgcggat gtcctgttcg tgcatccagc agtcgaagat gcgtatccgc atgaaccgcc 3468841 cgtagctgtc ggggcccgag ggggtggtcg tcggcgcatt ccattcgtca tcggaaaggc 3468901 tcgctaagac cttgcggcgc tggctagtca ctgcgcgaaa ccgctccagc aagcccacac 3468961 ccgattctgt gcccagatga cgcacccagc actcgttcat cacgccgatg gggttgcgga 3469021 catgcgcaag cgcagagacg tctgtgtctg gttctggtgc ggcgatgccg agcagaaatg 3469081 actcggtgcc gatgatgtgc gacaccacgg ccttgacgtc ccaaccgggc agcggactcg 3469141 ttgcctgcca gtccgtctcg agcagtccat cgagcagcgc atccagggag tgccaaacgg 3469201 cgaacagccc ggccagcacg tcggacttgt ccagtgtggt aaggggacgg cccggtgtgg 3469261 tcacaaagtg atgctaaacc tcacattgcc cagttctcga tcaggtcatg cccttagcgc 3469321 gccgacccaa ctcgcggagc acttcacgct gggcatcacg acgggccatg tcctggcgtt 3469381 tgtcgcgggc ttgcttgcct cgggccagcg caagctcaac cttgaccttg ccttcggcga 3469441 aatacagcga caacggcacc agggcgaagt tgccttcgcg gatcttgccg accaaggtgt 3469501 cgatctggcg gcgatgcaac agcagtttgc ggttgcgtcg cggctcgtgg ttggtccagc 3469561 tgccgtgccg gtattccggg atgtgcgcgt tgcgcagcca cacttcgccg tcgtcgatgg 3469621 tggcgaacga atcggccagc gacgcctgcc cttcccgcag gctcttcacc tccgtgcctt 3469681 gcagcgcaac cccggcctcg aacacctcga tgatcgaata gttgtgccgg gctttgcgat 3469741 tgctggcaac gatctgccgg ccgccacgcg acgacttgga cacagctatc gccgcacgta 3469801 gaggcgcagc gttaagtaag ccgtcaaccc cgacatcgcc acgcccaaca gcagcagcca 3469861 cggcgtgatg aagaggatgt ccgcatagtc aaccttggca atgagattgg cttgataaaa 3469921 ctggttgagc gcattctcca ggaacaaagc ccgcaccacc atcaagcccg ctacggcgat 3469981 gccgacaccc atcgtcgcgg ccagcatcgc ctccactagg aacggcagct gggtgtacca 3470041 gcggctggca ccgaccaagc gcatgatgcc gatttcggtg cgccgcgtat aggcagccac 3470101 ttggaccatg ttggcgatca acagaatcgc cccgatggcc tgaaccagcg cgaccgcgaa 3470161 cgcggcattg ctcaaaccat caaggaccgc gaacagccgg tcaatcagct ccttttgatt 3470221 cagcacgtcc aagacgccgg gctgcccctt catagcggtg tcaaagtcct tgtgctgctc 3470281 ggggttctcc agcttgacaa tgaacgacgc cgggaacgaa tccttgcccg ccacgtcctt 3470341 gaactgggga aacttgcgga tggcatcgtc ataggcctgc tggcggttaa ggaaacgcac 3470401 cgctttgacg tcggatcgcg tttcgatctt ctcccgtaac gctttgcacg cagtggtatc 3470461 gcaggacgag tcgttggcgg aaacgtcttc ggtgagaaag acctgagatt ccacccggtc 3470521 gagatagatg gcccgggagc tgtcggccaa ccggaccacc aacataccgc cgccgaacaa 3470581 tccgaccgag atcgcggtcg tcaggatcat cgcgatcgtc atggtgacat tgcgacgaaa 3470641 gccggtcagg acctcattta gcaggaaacc gaaacgcact tagcgatcca tcccgtagac 3470701 gccacgctgt tcgtcgcgta ccagcctgcc cagggacaac tcaaccaccc gttggcgcat 3470761 cgagtcgacg atgtggtggt cgtgcgtggc catcagcacc gtcgtgccgg tgcggttgat 3470821 ccgctccaat aagtccatga tgtccctact ggtctccggg tcgaggtttc cggtgggctc 3470881 gtcggccagc agtaccagcg gccggttgac aaaggcgcgg gcgatcgcaa cgcgctgttg 3470941 ctcgccgccc gacagctcgt ctggcagccg attggccttg ccggacagac cgaccgtctc 3471001 gagcacttcg gggaccaccc ggttgatcgc gtcggtgcgt ttgccgatga cctccaatgc 3471061 gaaggcgacg ttgtcgtaca ccgtcttctg ctgcagcaac cgaaagtcct ggaagacgca 3471121 gccgatcacc tgacgcagct tcggtacgtg gcgaccgcgg agtttgttga catgaaactt 3471181 cgagacccgg acatcaccac tggtcggcgt ctccgctgcc agcagcagcc gcatgaaggt 3471241 tgacttgccc gaacccgacg ggccgatcag gaagacgaac tcacccttgt cgatcttgac 3471301 gttgatgtca tccaacgccg gacgcgccga cgatttgtac tgcttggtga catggtccag 3471361 ggtgatcatc acggcacgcc agtgtagcgg tgagattagc gggcaggcga aatcaacggg 3471421 tcggtggctc ggatttgggg taggtgccgg ccgtcggacc cggcccgggc tgcggtagcg 3471481 gtgccggtgg tgttggggtc gtggtgcccg ggccgaacgg cggcggcaac tcaaacggcg 3471541 gcgggacagc cgaatcggtc gtggtttcgg gcgggctgac cggcggtgtg ctcgacgtgg 3471601 tggtcggcgt cgccttgacg gtgggtggtt gcactctggt tcgcggcacc caggtgtagt 3471661 caggatcggg cacgaagccc ggcggcacca cctgggtcgg cggagagtca ccaggacctg 3471721 gtgcctgtgg cctataggtc tcgtaaatcc accacaccgc caggaacgcg gcgatcaaca 3471781 ccagggtcga cgtgcggatc cggccgaaca gatagcccgg ccagtgccgt ttctggttgc 3471841 tgagcttcac gctactgctc cggactttct gccaccgcgg cccgcgcatc ggccgcggtg 3471901 actatcccgg cgcgggtgag cgcgcggatc accagcaccc gcaactgccg gcccgcctcg 3471961 aactgcttgc cgggtagggt gcgggccacc agtcgcaggg tgacggtgtc cacttcgatg 3472021 cgctccacgc ccatgaccgt gggctcatcc aacaacagct ctcccagcag cgagtcgtgg 3472081 cgcgcgtgct cacactcctg atgcaagacc tcgttcacgc ggccgagatc ggcgctggtc 3472141 gggacgggga tgtccacgac cgcgcgggcc cagtccttgg acaggttgac cgacttgacg 3472201 atgttcccgt tgggaacggt gaacacctca ccctcgctgg aacgcagctt ggtcacccgc 3472261 agcgtgacgt cctccaccgt gccggccgcg ttctccggtg accccaccat gctgagttcg 3472321 accaaatcgc cgaacccgta ctgcttctcc acgatgatga agaacccggc gagtaggtcc 3472381 tgcaccaggc gttgggcacc gaagcccagc gcggcgccga gcaccgccgc cggccccacc 3472441 aacgcaccga ccggaaccgg caacacatcg atgacctcgt acacaacgac gacatagatg 3472501 aggacgatcg acacccacga gatcaccgac gctacggcct ggcggtgctt ggttgcctcc 3472561 gagcgcacca acgcgtcgct ttcggtaaac cccaggtcga ggcgccgggt cacccggttg 3472621 gcaagccaag tcacgaagcg ggccgccagc accgctgcga tcagcagcat gacgatgcgc 3472681 aggccccggt tgaggatcca gtcgccgatt tcaccgcgcc agaagttatg ccagtgctgt 3472741 gctatcgagg tggccagaac tgtgccgcta gtcgtcatta cgtcgattgc gccaccggat 3472801 cccggcttcc aggaatccgt cgaggtctcc atccagaacg gccgccggat tgccgacctc 3472861 gtactcggtg cgcagatcct tgaccatctg atatgggtgc agcacatagg aacgcatctg 3472921 gttaccccag gagctgccgc cgtcggcctt caacgcgtcg agctcggcgc gttcttctaa 3472981 gcgcttgcgt tccaacaact ttgcttgcag aacccgcatc gccgcgatct tgttctgcag 3473041 ttgggacttc tcgttctggc aggtgaccac gataccgctg ggaatgtggg tgagccgcac 3473101 cgctgagtct gtcgtgttca ccgattgccc gccgggcccg ctggagcgat agacgtcgac 3473161 gcggacatcg ccctcgggga tgtcaatgtg gtcggtggtc tccaccaccg gcagcacttc 3473221 gacttcggcg aacgacgtct gtcgccggct ctggttgtcg aacgggctga tccgcaccag 3473281 ccggtgggtg ccctgttcga ccgacaacgt gccgtaggcg aacggtgcgt gcacggcgaa 3473341 cgtggcgctt ttgatgccgg cttcttcggc ataggaggtg tcgaacacct cgacggggta 3473401 tttgtgctgc tcggcccagc ggatatacat ccgcatcagc atctcggccc agtctgcggc 3473461 gtccacccca cccgcgccgg accggatggt gaccagcgcc tcacgctcgt cgtattcccc 3473521 cgacagcagg gtgcgcacct cggtggcctc gatgtcggcg cgcaacgact tgagctccgc 3473581 gtcggcctcg gcgacggcat cggcggcggc cgcgcccgct tcctcggcgg ccagctcgta 3473641 gagcaccggc aggtcgtcca ggcggcgcct tagctcctcg acgcgccgca gctctccctg 3473701 ggtgtgcgac aactcgctgg tcacccgctg cgcccgggtc tggtcgtccc acaagtgcgg 3473761 atcagatgcc tcatgctcga gcttctcgat gcggctgcgc agaccctcga cgtcgagcac 3473821 ccgctccacc gtggtcaggg tgcagtccaa ggcggcgatg tcggcttgac ggtcggggtc 3473881 cacagcagcc aaggttaccg gcatcagcgt ctagcatcag atgaccgtca tgtgcaccgc 3473941 acgactgcgg cccagcccat tcgcagcccc ttgcgccgca gccgggcaca acacagaggc 3474001 tcgagtatgc gtccctatta catcgccatc gtgggctccg ggccgtcggc gttcttcgcc 3474061 gcggcatcct tgctgaaggc cgccgacacg accgaggacc tcgacatggc cgtcgacatg 3474121 ctggagatgt tgccgactcc ctgggggctg gtgcgctccg gggtcgcgcc ggatcacccc 3474181 aagatcaagt cgatcagcaa gcaattcgaa aagacggccg aggacccccg cttccgcttc 3474241 ttcggcaatg tggtcgtcgg cgaacacgtc cagcccggcg agctctccga gcgctacgac 3474301 gccgtgatct acgccgtcgg cgcgcagtcc gatcgcatgt tgaacatccc cggtgaggac 3474361 ctgccgggca gtatcgccgc cgtcgatttc gtcggctggt acaacgcaca tccacacttc 3474421 gagcaggtat cacccgatct gtcgggcgcc cgggccgtag ttatcggcaa tggaaacgtc 3474481 gcgctagacg tggcacggat tctgctcacc gatcccgacg tgttggcacg caccgatatc 3474541 gccgatcacg ctttggaatc gctacgccca cgcggtatcc aggaggtggt gatcgtcggg 3474601 cgccgaggtc cgctgcaggc cgcgttcacc acgttggagt tgcgcgagct ggccgacctc 3474661 gacggggttg acgtggtgat cgatccggcg gagctggacg gcattaccga cgaggacgcg 3474721 gccgcggtgg gcaaggtctg caagcagaac atcaaggtgc tgcgtggcta tgcggaccgc 3474781 gaaccccgcc cgggacaccg ccgcatggtg ttccggttct tgacctctcc gatcgagatc 3474841 aagggcaagc gcaaagtgga gcggatcgtg ctgggccgca acgagctggt ctccgacggc 3474901 agcgggcgag tggcggccaa ggacaccggc gagcgcgagg agctgccagc tcagctggtc 3474961 gtgcggtcgg tcggctaccg cggggtgccc acgcccgggc tgccgttcga cgaccagagc 3475021 gggaccatcc ccaacgtcgg cggccgaatc aacggcagcc ccaacgaata cgtcgtcggg 3475081 tggatcaagc gcgggccgac cggggtgatc gggaccaaca agaaggacgc ccaagacacc 3475141 gtcgacacct tgatcaagaa tcttggcaac gccaaggagg gcgccgagtg caagagcttt 3475201 ccggaagatc atgccgacca ggtggccgac tggctagcag cacgccagcc gaagctggtc 3475261 acgtcggccc actggcaggt gatcgacgct ttcgagcggg ccgccggcga gccgcacggg 3475321 cgtccccggg tcaagttggc cagcctggcc gagctgttgc ggattgggct cggctgatca 3475381 gcgaccgagc aacacccctg ggttgaggat cccggccggg tcgagtgcgg acttcgccgc 3475441 ccgcagggcc gccgcgaacg ggtcgggacg ctgccggtca taccaagcgc ggtggtcgcg 3475501 accgaccgca tggtggtggg tgatggtacc gccactggcg ctgatcgcct cggacacggc 3475561 agccttgatc tcgtcccact gcgcgtcgag cgacccccag cgcccgccgg catagatgcc 3475621 gtagtaagga gccgggccgt ccgggtagac atgggtgaat cgacaggtca ctactccggt 3475681 cccgcatacc ttccagatcg cggtccgagc ggcatcggtc accgcggcat gtagagtatc 3475741 gaatccgtcc caggtgcaag cggtttcgaa tgtttcggcg ataactccgc ggcgaaccag 3475801 cgcgtctcgt tgatacggca tgcgcagaaa cgccgagcgc cagttcgcgg ctgcgttgtg 3475861 ttccgttgcg tcgcttgtag ttccgcggct acgttgcgcg gtcaccgtgc cgccgtgttc 3475921 ggcggtgatc gccaccgccc ggtgcagcca cgggtctatc gggtggtcgg cagactcgaa 3475981 cgccaacacc aacagcccgc caccaacgga cgtgccggca ttcagcaacg cctcggccgg 3476041 atccaacagc cggcagttgg ccgggtacag ccccgcctga gcgatcgtcc gggtcgcggc 3476101 gaccgcggcg gcccagtcgt caaacaccac ggacaccgtg acctgccatc gcggacggtg 3476161 ttgcagccgc atccacgcct cggtgatgat gccaagcgtc ccctcggacc cgaggaacaa 3476221 ccggtccggg gatggtccgg caccgcttcc gggcagccgc cgggactcgc tgatccccac 3476281 cggggtgaca atccgcagcg attcggtcaa gtcgtcgata tgggtataga gcgtggcgaa 3476341 gtgtccgccg gagcgggtgg ccaaccagcc accgagagtc gagaagccga aggactgcgg 3476401 gaaatggcgc agtgtcaaat cgtgtgggcg aagctgatgc tcgatcgagg ggccgaacgc 3476461 acccgcctgg atgcgcgcgg cacggctgac acggtcaatc tcaagcaccg cgctcatggc 3476521 agtgacgtcg accgtgacca ccggctcatc gaagcgcggc tcgacaccgc caaccaccga 3476581 gctgccacca ccgtatggga tgaccgcaat cccctcgcgc gcacaccaat ccagcacgtc 3476641 gatcacgtcc tgctcgctgc ggggtcgggc gatgaggtcg ggcaggtggt cgagctggcc 3476701 ctgcaggttg cgtgcgatgt cgcgatacgc tttgccgcgc gcgtgtccgg cccgatcgac 3476761 gagatcgctt gagcagagcg cggccagcga tgccggcggg ctgacccgtg gggccgccaa 3476821 accgagcgcg gtcaggtccg gcggcgggtg gtcgctcagg tcatggccgg acaccagtgc 3476881 cgcgactcgc gactgtagcg cttgcgtctc ctgatcggag agcgcgtcct cgactgtgcc 3476941 ccaaccccac cacgaacgca tgctgatggt gtcagcgttt gaggacgatc atggctccgc 3477001 cgacgaccac cagcaccagg gccgcgacga tagcccatcc agcaccggct agccaccaca 3477061 tgacacccaa tgcggcgagt accggcgaca gcgcgaagaa caccattacc gggtgctgcc 3477121 taatcactgc gagggcactg gtcgcccgga ctcgatcgat ttccttgcct ggcatgccct 3477181 tcaggatgcc agctgactac cacaatgcaa gcagcgatga gccgacgaac cgtcatcctt 3477241 ggcctgctcc cgctcgctgt tgtcgtcacg aatggcgcac gatgcggcgc accaatgcct 3477301 gtgaccgaag gcggttcggg ctgtcattga caattcatga agatgcctgc cgcatcatat 3477361 ccgttgtgcc cgttgttcta gaagtccgac gtgctgagcc tgcccacccg gcgaccccat 3477421 atccggaacc cctcgcgcgc tgcagccgct cacctggtct gaacgaaagc tcgcacatga 3477481 gtggtcggat tccgccctaa caacgcgcca taaacgcagg ctcatgcgct gcgccacgat 3477541 gcgccgatgc atttcggtaa cgattgttag ttaacccttg tacgaaactc tcttgaggcg 3477601 ctctaaccga ctgcgtccaa agtggaggat cgaaaagatg ataggaaaat gagtacgcct 3477661 acgctgcctg atatggtagc tccatccccg agagtgcgag taaaagaccg ttgtcgccgg 3477721 atgatggggg acctacgcct ttccgttatc gatcagtgca atttgcgatg ccgttattgt 3477781 atgcccgaag agcactacac atggttgccg cggcaagatt tgctatccgt caaagaaatc 3477841 agcgccattg tagatgtttt cctttccgtt ggggtaagta aagttcgaat caccggtggc 3477901 gaaccgctga tccgcccaga tttgccggaa atagtgagga cattgagcgc aaaggtcggc 3477961 gaagattcag gtctgagaga cttagcgatc acgacgaacg gcgtccttct cgccgaccgc 3478021 gttgacggcc tgaaggctgc gggtatgaaa cgcatcactg tcagtcttga tacgttgcaa 3478081 cccgagcgct tcaaggcgat aagtcagcgt aatagccacg ataaggtcat cgcgggtatc 3478141 aaggctgtcg cagccgcggg atttacggac acaaaaatag acacaacggt gatgcgtggt 3478201 gccaatcacg atgagctggc tgatctgatc gaattcgctc ggactgttaa cgcggaagtc 3478261 aggttcattg agtacatgga cgtcggcggc gcaactcact gggcatggga gaaggtcttt 3478321 accaaagcga acatgctcga gtcccttgag aaacggtatg gacgtattga gcctttgccc 3478381 aaacatgata cggcgcccgc caatcgatat gcgcttccgg acggaactac cttcggaatt 3478441 atcgcgtcga caacggagcc attctgcgca acctgtgacc gttcacggtt gaccgccgat 3478501 ggcttatggc tgcattgctt gtacgcaata tcgggtatca acctaaggga gccgctgcgt 3478561 gcaggcgcga ctcacgatga cttggtggaa accgtgacaa ccggatggcg gcgacgaacg 3478621 gatcgcggag cagagcagcg tcttgcccaa cgcgagcgcg gagtgttcct gccattaagc 3478681 acgttaaagg ccgacccgca tctggagatg cacaccaggg gcgggtaagc cgaacgaaca 3478741 gtcgattgat caacgactcc acagttgagg aaggaaccat gacggtcagc acccctgagc 3478801 aacacgagca acgagcatcc cacgatgcat ccgagggaaa gcacaacgta tgtcagggga 3478861 ggctggccgc acttgccgac gcggccgtgt cagagaaact cggagcacta cctggctggc 3478921 agcttctcga catgcgactc agccgcgctt ttcagtgcac aaatttcgac caatccattg 3478981 acttcatgaa tagggtcgca tcaatagcaa acgatatcaa tcaccatccc gatatcgctg 3479041 tactggacaa gcgttcggtg cgcgtgacgg cgtggacgcg caagctgggc tatctgaccg 3479101 acatcgactt cgatcttgcg gcgtccgtcg aggcgatgta tgcgacagaa ttcgctgaca 3479161 ggccagcacg atgatcgacc atgcactcgc gctgacacat atcgatgagc gtggtgcggc 3479221 acgaatggtc gatgtgtccg agaaacccgt gactttgagg gttgccaaag cgtcagggct 3479281 cgtgatcatg aagccgtcta ccttgaggat gatttccgac ggtgccgctg ctaagggtga 3479341 cgtcatggcg gcggcccgga tagctggcat cgcggcggcg aaacgtacgg gtgatcttat 3479401 tccgctatgc cacccgttag ggctcgacgc tgtcagcgtc actatcacgc cgtgcgagcc 3479461 tgaccgggtg aagattctgg cgacaaccac cacgctgggg cgtaccggcg tggaaatgga 3479521 agcgttgacc gcagtttcag tcgccgcctt gactatctac gacatgtgca aagccgtcga 3479581 tcgagccatg gagatttctc agatcgtgct ccaagagaaa agcggcggcc ggtccggagt 3479641 ttatcgccga agtgcttctg atttggcctg tcagtcccga taagtaggtg agtgtctgaa 3479701 tgattaaagt gaatgttctt tacttcggtg ccgttcgtga ggcgtgtgac gaaacgcctc 3479761 gggaggaagt agaggttcag aacggtaccg atgtcgggaa tcttgttgat caactccagc 3479821 aaaaataccc tcgccttcgc gatcattgtc agcgagtaca gatggcggtc aaccaattca 3479881 tcgcgccgct gtcgaccgtt ctcggcgatg gtgatgaggt cgccttcatc ccgcaggtag 3479941 ccggaggctg aacaagggga tgaccggccg tgaatgcgct ctcatcgtcg ccgctgttcg 3480001 gcaacgtggg agttccagtg ccggcgtgca gaacgaccga aattcgccgc acccgaatag 3480061 tcgggtcgca tagatgacca gcagggatgg attcaccatc gtttgggatt ggaacgggac 3480121 gctgtgcgac gaccggacaa ttcttctcga cgcggttggg cagacgctgg tcaacgaggg 3480181 attcgagcct ctttcgcaac agcagctgat ccaacggttc gcacgcccac tacgaacgtt 3480241 tttcgagaat gcgtgcggtc gagatctctt gacgtccgag tgggaacgcg tccaatccac 3480301 ctttcgccga atctatcgat cgcgagaagc tgaagtcaca ctcgtcgaag atgcgtacga 3480361 cgttctggcg cagggaaacc gcagcgccgc tgggcagttc ttattatcgc tggcgcctca 3480421 cgacgagctt atgcacttcg tccaaaaata cgggattgcc aagtggttca acggaatccg 3480481 tggccggact cggcccgacc aagaaaaacc catgatgctg gcagaactga tcatgcagcg 3480541 ctctctgaat cccactcgcg tggtgcacat cggcgattcg cttgaggacg ccgctgctgc 3480601 cagcgcggtc ggagccattt ccgtcttggt caccggagct tcactgcagc cacccgaccg 3480661 agtcatgctc aaacagttgc agcccttcgt tgcgagttcg ctgaagcaag cactgcagta 3480721 cgcgggtggc gacggtgatt gacgacgaag gtacgcaggt ggtggcggcg cgcctgccgt 3480781 tcggatggtc agccgacagt ggggtgacag ccgacatcat cgaggcagcg atggaacttg 3480841 cgatcgacac agcgcgacat gccacggcac cgtttggcgc tgcgctgctt gatgttacga 3480901 cactccgagc attctcgggt ggcaacacct attttgaatc gggggatcgc ttcgctcacg 3480961 ccgaaaccaa cgttctacgg gccgcaatga gcacattgcc ggagctttca aatcacgtgc 3481021 tgatatccac cgccgagcca tgcccgatgt gcgcggcggc cagcgtgctc agcggagtga 3481081 gagccatcat cttcggcaca tcaatcgaga cccttatcca gtgcggttgg ttccaaatcc 3481141 gcatcagcgc ttcggatgtg gtggcggcct ccactcgtcc cacgcgtcca tcggtgtata 3481201 gcggtttcct cagccacaag acggacttgt tgtaccggaa ctccgaaaac cgacgagcaa 3481261 tgaacccctg gaccgatcca tcgcattgac tcggcttgcc gactacctca ctgacccagg 3481321 aggagagtta cgtccagggg tgtggtgtac gggcaggtaa ggccggtggg cgtgtcgtag 3481381 cccagtagtg ggcggtcatc gcgtgatcct tcgaaacgac cagcaaaagt caatcgaagg 3481441 aaatgacgca atgacctctt ctcatcttat cgacgccgag cagcttctgg ctgaccaact 3481501 cgcacaggcg agcccggatc tgctgcgcgg gctgctctcg acgttcatcg ccgccttgat 3481561 gggggctgaa gccgacgccc tgtgcggggc gggctaccgc gaacgcagcg atgagcggtc 3481621 caatcagcgc aacggctacc gccaccgtga tttcgacacc cgtgccgcaa ccatcgacgt 3481681 cgcgatcccc aagctgcgcc agggcagcta tttcccggac tggctgctgc agcgccgcaa 3481741 gcgagctgaa cgcgcactga ccagcgtggt ggcgacctgc tacctgctgg gagtatccac 3481801 tcgccggatg gagcgcctgg tcgaaacact tggtgtgaca aagctttcca agtcgcaagt 3481861 gtcgatcatg gccaaagagc tcgacgaagc cgtagaggcg tttcggaccc gcccgctcga 3481921 tgccggcccg tataccttcc tcgccgccga cgccctggtg ctcaaggtgc gcgaggcagg 3481981 ccgcgtcgtc ggggtgcaca ccttgatcgc caccggcgtc aacgccgagg gctaccgaga 3482041 gatcctgggc atccaggtca cctccgccga ggacggggcc ggctggctgg cgttcttccg 3482101 cgacctggtc gcccgcggcc tgtccggggt cgcgctggtc accagcgacg cccacgccgg 3482161 cctggtggcc gcgatcggcg ccaccctgcc cgcagcggcc tggcagcgct gcagaaccca 3482221 ctacgcagcc aatctgatgg cagccacccc gaagccctcc tggccgtggg tgcgcaccct 3482281 gctgcactcc atctacgacc agcccgacgc cgaatcagtt gttgcccaat atgatcgggt 3482341 actcgacgct ctgaccgaca aactccccgc ggtggccgag cacctcgaca ccgcccgcac 3482401 cgacctgctg gcgttcaccg ccttccccaa gcagatctgg cgccaaatct ggtccaacaa 3482461 cccccaggaa cgcctcaacc gagaggtacg acgccgaacc gacgtcgtgg gcatcttccc 3482521 cgaccgcgcc tcgatcatcc gcctcgtcgg agccgtcctc gccgaacaac acgacgaatg 3482581 gatcgaagga cggcgctacc tgggcctcga ggtcctcacc cgagcccgag cagcactgac 3482641 cagcaccgaa gaacccgcca agcagcaaac caccaacacc ccagcactga ccacctagac 3482701 tgccacccga aggatcacgc gaggaacctt cactcgtaca ccacgtccct ggccttggcc 3482761 aggaggagag caatcatgac tgaagccttg atcccggcac cgtcgcagat atcgctgacc 3482821 cgcgatgagg tgcgcaggta cagcaggcac ctcatcatcc cggatatcgg cgtcaacggc 3482881 caacagcggc tgaaggatgc gcgcgtattg tgtatcggcg ccggaggatt gggttcgcct 3482941 gctctcctgt atcttgcggc cgccggagtc ggtaccatcg gcatcatcga tggagaccac 3483001 gtggatgagt cgaatctgca acgccaaatc attcatggca catccgacgt gggtaggccg 3483061 aaagtagaat cagcagccga ggcggtggcg gaaatcaacc cgcacgtccg ggtgacgcaa 3483121 tatcgcgaaa tgctcaccca cgacaacgca ctggaaattt ttggcgatca cgacctcatt 3483181 gttgacggca cagacaactt cacgacgcgc tacctgatca atgatgccgc ggtcttggcc 3483241 ggcaaaccat atgtttgggg gtcgatctac cgattcaacg gccagaccag tgtgttttgg 3483301 cccggccggg ggccgtgtta tcgatgcctt catccagctc cgcccccgcc cggattggtg 3483361 ccgtcgtgcg ctgaaggcgg tgtactcggt gccatctgcg ccacgattgc gtcgatccag 3483421 gtaactgaag tgctgaagct ccttaccgga gtcggaactc ccctcgtcgg tcgcctgctc 3483481 atgtatgaag ctctcgacgc gacataccat caaatccgga tcgcgaagaa tcctgactgc 3483541 gccatttgcg gcgatgcgcc cacgatcacc gaattggtag atgacagcgt cagctgcgca 3483601 tcgacacaat cggtggatcc cgaactagtg atcagttgtg atgagttgcg aaccaaacag 3483661 cagtcggacc agaacttcct cttggtcgac gtgcgagagc ccgccgagtt cgacatcgcg 3483721 cacattccgg gcagcatctt gatacccaaa ggcgaaatcg gctcggcggc gggcctagcc 3483781 cagctaccgc tggacaagga aattgtcctg tactgcaaga gtggaatccg atcggcccag 3483841 gcgctaacca cgttgaaagc agccggactg cacaacgtga agcatctcga cggcggtatc 3483901 gcggagtgga cacgaaccat cgactcctcc ttgttggtgt actagcaccg aactatgcga 3483961 aaggattccc gccatggcac gctgcgatgt cctggtctcc gccgactggg ctgagagcaa 3484021 tctgcacgcg ccgaaggtcg ttttcgtcga agtggacgag gacaccagtg catatgaccg 3484081 tgaccatatt gccggcgcga tcaagttgga ctggcgcacc gacctgcagg atccggtcaa 3484141 acgtgacttc gtcgacgccc agcaattctc caagctgctg tccgagcgtg gcatcgccaa 3484201 cgaggacacg gtgatcctgt acggcggcaa caacaattgg ttcgccgcct acgcgtactg 3484261 gtatttcaag ctctacggcc atgagaaggt caagttgctc gacggcggcc gcaagaagtg 3484321 ggagctcgac ggacgcccgc tgtccagcga cccggtcagc cggccggtga cctcctacac 3484381 cgcctccccg ccggataaca cgattcgggc attccgcgac gaggtcctgg cggccatcaa 3484441 cgtcaagaac ctcatcgacg tgcgctctcc cgacgagttc tccggcaaga tcctggcccc 3484501 cgcgcacctg ccgcaggaac aaagccagcg gcccggacac attcctggtg ccatcaacgt 3484561 gccgtggagc agggccgcca acgaggacgg caccttcaag tccgatgagg agttggccaa 3484621 gctttacgcc gacgccggcc tagacaacag caaggaaacg attgcctact gccgaatcgg 3484681 ggaacggtcc tcgcacacct ggttcgtgtt gcgggaatta ctcggacacc aaaacgtcaa 3484741 gaactacgac ggcagttgga cagaatacgg ctccctggtg ggcgccccga tcgagttggg 3484801 aagctgatat gtgctctgga cccaagcaag gactgacatt gccggccagc gtcgacctgg 3484861 aaaaagaaac ggtgatcacc ggccgcgtag tggacggtga cggccaggcc gtgggcggcg 3484921 cgttcgtgcg gctgctggac tcctccgacg agttcaccgc ggaggtcgtc gcgtcggcca 3484981 ccggcgattt ccggttcttc gccgcgcccg gatcctggac gctgcgcgcg ctgtcggcgg 3485041 ccggcaacgg cgacgcggtg gtgcagccct cgggcgcggg catccacgag gtagacgtca 3485101 agatcacctg atagctagga aggatgtctg aatggccaat gtggtagctg aaggtgccta 3485161 cccttactgt cggctcactg atcagccgct gagtgtggac gaagtgctag ccgccgtctc 3485221 gggccccgaa caaggcggca ttgtcatatt tgtgggaaac gtgcgtgacc acaatgccgg 3485281 gcatgatgtc acgcggttgt tctacgaggc gtatccgccg atggtgattc ggacattgat 3485341 gtcgatcatc ggacggtgtg aagacaaggc cgagggtgtc cgcgttgctg tcgcgcaccg 3485401 gaccggtgaa ttgcaaatcg gtgatgccgc ggtcgttatt ggcgcgtcag ctccccaccg 3485461 tgcggaggca tttgacgccg cgcgtatgtg tatcgagttg cttaagcagg aagtgccgat 3485521 ttggaagaag gaattcagct cgaccggtgc tgaatgggtc ggcgatagac catgagtccg 3485581 tctccatcgg ccctgctcgc cgaccacccg gaccgcattc gttggaacgc gaaatacgag 3485641 tgcgctgacc ccacggaggc ggtatttgcg cccatatcct ggctcggcga cgtgctgcag 3485701 ttcggggtgc cagaagggcc ggttctggaa ctggcgtgcg gtcggtccgg caccgcgctg 3485761 gggctagccg cggcgggccg ctgcgtgact gcgatcgacg tttccgatac cgcgttggtt 3485821 cagctcgagc tcgaagcgac ccgacgggaa ttggccgatc gcctcacact ggtgcacgcc 3485881 gatctctgct cctggcagtc gggggatgga cgctttgctc tggtactttg ccgactattc 3485941 tggcatccgc ccacttttcg ccaggcttgc gaggctgtgg cgccgggcgg tgtagtggcg 3486001 tgggaggcat ggcggcggcc catcgatgtc gctcgggata cccgtcgagc cgaatggtgc 3486061 ttgaagccag gccagcccga gtctgaactt cccgccggct tcacggtgat tcgggtggtc 3486121 gacaccgatg gttcagagcc gtcgcggcgc atcatcgccc aacggtcact gtgaacggtc 3486181 cctggttgta tgcgcacgtc ctttgttgag aacccgtttc gcaccgctcc gataccgcca 3486241 gtctgatgca ccgaccgcgc cgcctcccac ccgcggaagc taacgaggtg tgcatgaaac 3486301 cggggcggtt cagcagcccg gttaattgac aatctgtgaa gaggttccca cgacaatggg 3486361 cacgttgggc tcgcgatgtc gcgcgattcg agcgaggttg ggtgacgttc ccgtttgagg 3486421 atctcgcccc agggcgatgg gttggcggga tgtcgatgta cccggaagag caaaacgtgg 3486481 catgcgataa cgatccgaga ggagtgcgat gacaagcacc tcgattccga cgttcccgtt 3486541 cgaccggccg gtcccgacgg agccgtcccc aatgctgtcg gaactgagaa acagctgtcc 3486601 ggtagccccg atagagttgc cctcggggca cacagcatgg ctcgtcactc gctttgacga 3486661 tgtaaaggga gtgctgtccg acaagcgttt cagctgcagg gcggcagcgc acccgtcgtc 3486721 gcccccgttc gtgccgttcg tgcagctttg ccccagcttg ttgagcatcg atgggcccca 3486781 acacaccgcg gcccgccgtc tgctcgcgca gggcctaaat cccggcttca tcgcacgcat 3486841 gcggcccgtt gtccaacaga tcgtcgacaa tgcgctcgac gatctggcag ccgcggaacc 3486901 accggtggac ttccaggaaa tagtaagtgt ccctatcgga gaacagctca tggccaagct 3486961 actcggggtc gagcccaaaa ccgtgcacga gctcgcggcg cacgtggatg cggcgatgtc 3487021 cgtgtgtgag atcggcgacg aggaggtgag ccggcggtgg tcagcactgt gcacgatggt 3487081 catcgacata ctgcaccgca agctcgccga accgggtgat gacctactta gcacgatcgc 3487141 ccaggcgaac cggcaacagt ccaccatgac cgacgagcag gttgtcggca tgctcctcac 3487201 cgtcgtgatc ggaggagtcg acacaccgat cgccgtgatc acaaacgggc tggcgagcct 3487261 gctgcaccac cgcgatcaat atgaacggct cgttgaagac ccaggccgtg tcgctcgtgc 3487321 ggttgaagaa atagtccggt ttaatccggc aactgaaatt gagcacttgc gagttgtcac 3487381 cgaggatgtc gtcattgccg gaaccgcgct atcggcgggg agcccagcat ttacctctat 3487441 cacttcggct aaccgcgact ccgaccaatt cctggacccc gatgagtttg atgtcgaacg 3487501 taatccgaac gaacacatag catttggata tggtccacat gcttgcccgg cctcagcgta 3487561 ttcacgcatg tgcttgacga cgttcttcac ctcgcttacc cagcgatttc cgcaacttca 3487621 actcgcaaga ccgtttgagg atttggaacg acggggtaag ggcctacatt cggtggggat 3487681 caaggaactc cttgttacct ggccgacgtg accccgcgtg ccagcaaggg actgttgact 3487741 tctccgacgg atgaaagccg ccctggaata tccaaccgct cctgctcctc ggtcaactca 3487801 agccgaaacc gccaacggtg gccacaaaat acgagttcgt ccacaacgtc ggcagccggg 3487861 accgcaacca cgcaaactcc tcacgcacta cccgcaaccg acggccccta attggggttg 3487921 ggcccatgat cggttggcgg ctcatcaggc ggtgcaggat cttggtgtgc ccgcctcggc 3487981 gcggcggagc cggggtcgag catctctttg cgagtgatga aggcacagcc ccggcgcggg 3488041 gtgggtgtgc aacacgaatg taggtagcgg gagttgaggc tgggcgcggt gtattctggt 3488101 tgttggataa acaaccagaa tggggagacg cgggtgggcg aggactcgct ggaggatctg 3488161 gagcagcggc gagcgcgact gtatgaccag ttggccgcga ccggcgattt ccggcgcggc 3488221 tcgatcagtg agaactatcg ccgctgcggc aagcccaatt gtgtgtgcgc gcaagagggt 3488281 caccccgggc atgggccgcg atatttgtgg acgcgcacgg tggccgggcg gggtaccaag 3488341 gggcggcagc tctcggtcga ggaggtggac aaggtgcgcg ccgagttggc caactatcac 3488401 cgtttcgcgc aggtcagtga gcagatcgtg gcggtcaacg aggcgatctg cgaggcccgc 3488461 ccaccgaacc cggcggccac ggcgcccccg gccggcacaa cggggcacaa aaaagggggc 3488521 tctgcgacca gatcgcggcg gagttcaccg ccgaggtaga gcggctggtt gcgctcgcgg 3488581 tcggtgcgct gggatcctcg gtgccgacct ggtcgcagtg gagttggcga tccgcactgc 3488641 gatgacccgg ctgggctcct cgctgctgga gcagctgctg ggcgccgaca ccgggcaccg 3488701 gggccagcgc atcgattgcg ggcaagggca ttgcgcgtgg ttcgtcggtt accgcgacaa 3488761 gaacctcgat accgtgctgg accgggtccg gttgctccgc gcctgctacc actgccgcac 3488821 ctgcgggcgt gggatggcgc cccctggatc tggaacctgg ccaccgcgat cctgcccgaa 3488881 gccaccccga tcgtggacct ctaccacgct cgccagcacg tccacgacct cgccggccag 3488941 ctcgcacccg ccctcggcga acaccacagt gactggctga ccgcccggct ggtcgacctc 3489001 gactccggcg acatcgaaac gctggttcaa caaccgatcg ggcagcacac cggtcacacg 3489061 taacgaagtg tgcatgaaac ccggagtggt tcaggggtcc gccgcgctcg tccgcgctgt 3489121 gagggtctcg gcactaccac gagatgagat cgaggcacca ggtgcattgt gcaccacatt 3489181 ctggcgatgt tggtgaggtt tgttcctgcg cccgtccgtg gcgcgttcgg gatcgttggg 3489241 gttggccggt tgcccacctc ggcggaagcg gacggtgagc gcggccgagt cgtcgacatt 3489301 tggcggtagg aggtttcgat gctgtttgtc agcgtggccc cggagtcggt aggggtggcg 3489361 gcggcgactc ttgttgggcc cccgttgatc ggcaacggcg ccgatcggcc cccggcaccg 3489421 gacaagccgg cgggatcttg tggggcaacg gccgttttcg cccaatcaca ggagtggagt 3489481 tttgaacgca acgacggcag gtgctgtgca attcaacgtc ttaggaccac tggaactaaa 3489541 cctccggggc accaaactgc cattgggaac gccgaaacaa cgtgccgtgc tcgccatgct 3489601 gttgctatcc cggaaccaag tcgtagcggc cgacgcactg gtccaggcaa tctgggagaa 3489661 gtcgccacct gcacgagccc gacgcaccgt ccacacgtac atttgcaacc ttcgccggac 3489721 cctgagcgat gcaggcgttg attcgcgcaa catcttggtt agtgagccgc cgggctatcg 3489781 ccttctcatt ggagatcgac agcaatgcga tctcgaccgt ttcgtggcag cgaaagaatc 3489841 gggactgcgc gcttctgcca aaggatattt tagcgaggcg atccgttatc tagattcggc 3489901 cttgcagaat tggcgcggtc cagtactggg ggacctacgc agctttatgt ttgtccaaat 3489961 gttcagcagg gcgttgaccg aagatgagct cctcgtccat acgaagctgg ccgaagctgc 3490021 aatcgcctgc ggacgcgccg acgtcgttat ccctaaattg gaaagactcg ttgcgatgca 3490081 tccttatcgc gagtcgttat ggaagcagtt aatgctcggc tactacgtga acgaatacca 3490141 gtccgcggca atcgacgcat atcatagact caagtccacg ctcgcagagg aactcggtgt 3490201 tgagccggca cccacgatac gtgcgctcta ccacaaaatt cttcgccaat tgcccatgga 3490261 cgatctcgtc ggccgagtca cgcgtggcag ggttgacttg cgtggcggca acggcgctaa 3490321 ggtagaggaa ctgaccgaga gcgataagga tctccttccc atcggtttgg cataactacg 3490381 cccctcaatg caagcgagct gattcgatgt tgtcgagccg gagcccgctc cgacctccgt 3490441 cacacagacc ggactacgaa tactgacccg cgctgctagc caaccccggt tcgtggaatc 3490501 acagtgagac gtgcctgcgt gacatgccaa cccgcaccat cacgatccat cagcccaccg 3490561 ggcataccag cgccggcacc gctaatactc attggcatca gcatcatcgg cataccacca 3490621 ccggcggccc cggccgcctg cgtcagcgcg actgggttag gcggcacacc caacccggac 3490681 atcgctgaag aagccatcga aatgggtatc gacccctgcc acgtcggcgg caccgacatc 3490741 gaccccacca actgagcctg ccctaagcca gcggacatcc ccgcacccaa ttcccgaccg 3490801 cctagcggct tgaacgccgg aatagccgca ccactcgggt tcggcgtatt cgcggcagcc 3490861 aacccgccag ccggcggacc taacctggtc gcgctttcac gcgccatact cgccaacgtc 3490921 gttatcggcg acgtcaggat gcggaccggg agaaacaaca tcgacgcatg ttgcaacggc 3490981 agctgcgaca ccagcgactg catcccgctc atcacggtcg gcaccgacgc catcgcaccc 3491041 tcgaccaccg gcgtcacggc agccgacgcc gtcgtcgcca tcccggcaac ctgggtaccc 3491101 acctgcgcgg ccaacccggc taagctcacc ggcggcagac taaatggcgc caacgtcgcc 3491161 gccacggact ttgccccagc gtgatagccc accatcgcag ccacatcctg agcccacatc 3491221 tccaggtaat cgaactccgt ggctgcgatc gccggggtgt tctgacccaa aacgttcgcc 3491281 gctatcaacg acgccagcga cacccgattc gccgtcaccg ccgtcggatg caccgtggcc 3491341 gccaacgcgg cctcaaacgc cgtcgctgcc gcgcgagcct gaatggccgc cagctgcgcc 3491401 tgcgacgcca ccgtgctcaa ccacccgaca tacggagacg cggcagcagc catcgacatc 3491461 gacgccggac cggtccacgg cccagtcgtc aacgcggcga gcaccgactc aaacgacgat 3491521 gccgatgccc acaaatccgc ggctagcccc tcccacgccg aggccgccgc aaacaacggc 3491581 cccgagccgg ctccggcaaa catgcgcgcc gagttgatct ccggcggcag ccacgaaaag 3491641 cccaaaacca tcgcaacccc agcccaatca gccgcccaga agggtctcgt acaagggtta 3491701 actaaacaat cgttaccgaa tgaatcgaca catcgtgacg caccgatggc tcagcacgcc 3491761 ggacttctag aacaacgagc acaacggata tgatgcggca ggcatcttca tggattgtca 3491821 atgacagccc aaaccgcctt cggccactgg cattggtgca ctgcaccgtg cgccattcgt 3491881 ggcgacaact gcgagcggga gcgggaccaa ggatgatggt cccggtcgcg acgggcgcga 3491941 tcccgctccg gagtggtcaa cgcatcaaac gacaaagcgc tcagctcatc gaccgcagca 3492001 tcgagccggt ccagcgccgc gaccaaacta gaattctcgc gcagacaccg ctgaaacgac 3492061 agtgacgcaa gggatttcat tgagaggacc aatgacccta tttgatcaaa ccggatgacc 3492121 ataccgtcaa cgttgtggac atacaggtgc tcaagaacgc agtcttgctg gcatgccggg 3492181 cgccgtcggt gcacaacagc cagccctggc gttgggtggc cgaaagcggc tccgagcaca 3492241 ctactgtgca cctgttcgtc aaccgccacc gaacggtgcc ggccaccgac cattccggcc 3492301 ggcaagcgat catcagttgc ggtgccgtac tcgatcacct tcgcatcgcc atgacggccg 3492361 cgcactggca ggcgaatatc actcgctttc cccagccgaa ccaacctgac cagttggcca 3492421 ccgtcgaatt cagtcccatc gatcacgtca cggcgggaca gcgaaaccgc gcccaggcga 3492481 ttctgcagcg ccgaaccgat cggcttccgt ttgacagccc gatgtactgg cacctgtttg 3492541 agcccgcgct gcgcgacgcc gtcgacaaag acgttgcgat gcttgatgtg gtatccgacg 3492601 accagcgaac acgactggtg gtagcgtcac aactcagcga agtcctgcgg cgggacgatc 3492661 cgtactatca cgccgaactc gaatggtgga cttcaccgtt cgtgctggcc catggtgtgc 3492721 cgccggatac gctggcatca gacgccgaac gcttgcgggt tgacctgggc cgtgacttcc 3492781 cggtccggag ctaccagaat cgccgtgccg agctagctga tgaccgatcg aaagtccttg 3492841 tgctgtcgac ccctagcgac acgcgagccg acgcactgag gtgtggcgaa gtgctgtcga 3492901 ccatcctact cgagtgcacc atggccggca tggctacctg cacgttgacc catctgatcg 3492961 aatccagtga cagtcgtgac atcgtgcggg gcctgacgag gcagcgaggc gagccgcaag 3493021 ccttgatccg ggtagggata gccccgccgt tggcagcagt tcccgccccc acaccacggc 3493081 ggccgctgga cagcgtcttg cagattcgcc agacgcccga gaaagggcgt aatgcctcag 3493141 atagaaatgc ccgtgaaacg ggttggttca gcccgccttg atcaggatgc ctttgtggat 3493201 gtcgggtagg gcggtgggga tgttagcgag gtagagctgc tcggttttct ccttggccaa 3493261 gatgaggagt cggttctgca ggtcggcgat tttgcggccg atctgggcgg ggttgaggct 3493321 gtctcggtag gtgatcaggt cggcctgctg ggccgcggag agcacccttg cggccagtgg 3493381 ccggtccagc ggcgtctgtg gggcatcgta gaggcgtcgg cggcggccgt cggcgctgct 3493441 ggcatacccg atcggtttga tggtcggggt gaggtagttg aggcggtcgt tgaccagctt 3493501 ccacatccgg ttgagcacgg cgcgttcctc ggcggtgtca tagcggtagt agaacgcgta 3493561 cttgcggacc aggtggttgt tcttggactc gatggtggcc tagtggtttt tcttgtacgg 3493621 gcgaaagcgg gtgaagtaga taccgttgtc gccggcccag ctgatgaccg gcttgttgag 3493681 aaacacggtg ccgttgtcga aatctaaacc cgttatccca tgcgggatct cggtgacaga 3493741 agctttgagc ccggcgagga tgtgggtacg ggcgttgttg cggacggtgc gggtgaacac 3493801 ccatccgatg tgcacgtcgg tcaagttcag ggtgtgggcg aactcgcctt tgagcgtcgg 3493861 accgcaatgg gcgacggtgt cgccctcgaa gaaccccggc tccgcctcga cctcatcgcc 3493921 ggccctgcga accttgatcg aattacgcag cagtggtgag ggtttcgtcg tcgacacacc 3493981 cgatatctgg tctttggcct tcgcggtctt cagataacga tcgatgctgg ccgcactcat 3494041 cgccaacagc tcctcacgca cctcggggcc atagcggtca cgcccaaact ccaacacacc 3494101 gtgacgttcc aacccatcaa gctgcagcac catcgaggcg gcaagatact tcccgcactg 3494161 cccacccgag gcggaccaca ccctctgcaa caccttcagc gcgtcatagg agtacttcag 3494221 cgaacgcggt ttgcgccgcc gcttggcaac actgcggccc agccccggcg atagcttggc 3494281 cgctgcgaca agccggcgcc gcgcgttatc acgtgactag cccgtcaggt caaccacctg 3494341 gtcgaaaatc cggccccggc tcttcttcaa agcctgcaca tacgccttgg cgtacctgct 3494401 ggtgacctcc gcgcgagatc tcatcgacaa cccacttccc atgcctcacg acggtcacca 3494461 tgtcgcgggc atatttacgt gaggcaccga gggtgtttcg cgggcattct tggtgagtca 3494521 agtcgaacgg ttgagccatg atcgacgatt ccgttaccgt gctgtcagaa gacgaaagtt 3494581 ggcaccggct gggcagcgtt gcactcggtc ggctagttac cacctttgct gatgagcctg 3494641 ggatcttcca gtcaatttcg tggtgcaagg ccgcaccgtg ctgtttcgta ccgcggaggg 3494701 cgccaaatta ttttcagccg tcgcgaagtg cgcggtggct ttcgaggcgg acgaccacaa 3494761 cgttgccgag ggctggagcg tgatcgtcaa ggttcgcgcc caggtgctga cgaccgacgc 3494821 gggggtccgc gaagccgaac gcgcccagtt actaccgtgg accgcgacgc tgaaacgtca 3494881 ctgtgtgcgg gtgatcccgt gggagatcac cggccgccac ttcaggttcg gtccggaacc 3494941 ggaccgcagc cagacctttg cctgcgaggc ctcgtcacac aaccagcgat agcgctccgc 3495001 gcctgcgagt caccttgcgc cgcttactga tcgccaccag ccgtgcgacg gcgtcttcaa 3495061 ttcctcgcgc cagctggccg gcatctgcta ccacgtcgta gtcggccagg atcccgaagt 3495121 acaggtcgtc ggcgtagctg agcatcgcga cactggtgcg cagttgcatc gcgatcggcg 3495181 aaaccgggta taggtcaagc acccgtctgc ccataatctg cagcggccgt cgtggacccg 3495241 gcacatttgt cgccacggtg acaacaccac gctgcggcag ccgcatcaac agcccgaccg 3495301 cccatgcggt catggggaac ggaaggcggt tggcaatcgc catcaaagta tttccgaatt 3495361 gtctctgtcc ccccgccttg gcccgagtca gccgcgagtg cacgatccgc agccgctgca 3495421 gcgggttctc ttgatccacc ggcaggttgg gcagcattaa cgaaacacgg ttatcggtct 3495481 tgctcaaagc gctgttggaa cgcgtcgaga ccggcactag cgtacgcagc gaatcaaacc 3495541 taggccgctc accccgctgg atgaggacgt tgcggtagct ttccgtaatc gcggcaagcg 3495601 caacatcatt gatggtgacg tcgaatttcc ggcacacctg ttcgacgtcg gcgagaggga 3495661 cctttgctgc gctgtagcga cgcaaatcac tgatcggccc gttcaacgac gacgcggcgg 3495721 gacttagcac gccggccgcg atctcactgg cacccttggc cgcgcgaacg atgcctgcca 3495781 tcacggcggt cgacgcggtc aacgcctcgc ttggattgac acggaatcca ccccgccgca 3495841 cagatgcgga ttgcgactgc atggtcgtgt ggatgttgct cgcgaagctg tcgctcatac 3495901 tttcatcgga gagcccagct agcaggtgag tcgccgcgat tccgtcggcc atgcagtggt 3495961 gcagtttggt caggatcgcc cacttgctgt ccgccaggcc ttcgatgacc cagacctccc 3496021 acagcggtcg accccggtcc aaacgacgcg ccatcagatc ggcgatcagc tcgaataact 3496081 ggtcttcgtt gccaggccgc ggcaaggcga tgcgccacac atgacggcca agatcgaagt 3496141 cgggatcgtc cacccatttg ggtgcaccga ggtcgaacgg gcgcaggcgt aaccgctgcc 3496201 cgaaccgggt acagggacgt aggcgttgag cgagcgacga taagaaggct tcctgatcgg 3496261 gagccggccc ctcgatgacc gccagagcgc cgattgccag actcacgtgc cgatccacgt 3496321 cttctgcctt gagaaacccg gcgtcaagtg tcgttaggtg attcatggtc agcgccttcc 3496381 ccggtgatcc ggattatctg caaccgtcag taccactctc cgctgcgagg agccgttgag 3496441 gcagggccaa aggtcctccg ctggcgagcc ttcgtgctct gccaccgcgg ctgtcgacgc 3496501 gcgatcctta atagatgacc gcagccgttg atgggaaagg cccggcagcc atgaacaccc 3496561 atttcccgga cgccgaaacc gtgcgaacgg ttctcaccct ggccgtccgg gccccctcca 3496621 tccacaacac gcagccgtgg cggtggcggg tatgcccgac gagtctggag ctgttctcta 3496681 gacccgatat gcagctgcgt agcaccgatc cggacgggcg tgagttgatc ctcagctgtg 3496741 gtgtggcatt gcaccactgc gtcgtcgctt tggcgtcgct gggctggcag gccaaggtaa 3496801 accgtttccc cgatcccaag gaccgctgcc atctggccac catcggggta caaccgcttg 3496861 ttcccgatca ggccgatgtc gccttggcgg cggccatacc gcggcgacgc accgatcggc 3496921 gcgcctacag ttgctggccg gtgccaggag gtgacatcgc gttgatggcc gcaagagcag 3496981 cccgtggcgg ggtcatgctg cggcaggtca gtgccctaga ccgaatgaaa gccattgtgg 3497041 cgcaggctgt cttggaccac gtgaccgacg aggaatatct gcgcgagctc accatttgga 3497101 gtgggcgcta cggttcagtg gccggggttc ccgcccgcaa cgagccgcca tcagacccca 3497161 gtgccccgat ccccggtcgc ctgttcgccg ggcccggtct gtctcagccg tccgacgtct 3497221 tacccgctga cgacggcgcc gcgatcctgg cactaggcac cgagacagac gaccggttgg 3497281 cccggctgcg cgccggcgag gccgccagca tcgtcttgtt gaccgcgacg gcaatggggc 3497341 tggcgtgctg cccgatcacc gaaccgctgg agatcgccaa gacccgcgac gcggtccgtg 3497401 ccgaggtgtt cggcgccggc ggctaccccc agatgctgct gcgagtgggt tgggcaccga 3497461 tcaatgccga cccgttgcca ccgacgccac ggcgcgaact gtcccaggtc gttgagtggc 3497521 cggaagagct actgcgacaa cggtgctgac catcgcagca ctgttccgct cgcgcccggt 3497581 acgctcgcga gggtgaattc gccgccggcc tgctctgccc gctgccgcag gttcgttaag 3497641 ccgcttccgg tgaactcgtc gggcagcccg cggccgttgt cggtcacctc gatgcacaag 3497701 tcgtcgtcga ctttgacccg gacggtcaac gtgctggcct tcgcatggcg aaccgcgttg 3497761 ctgaccgctt cccgaaccac cgcctcggcc tgatcggcga gcgcgctgtc gaccaccgac 3497821 aatggaccca cgaattgaac gctggtgcgc aaccccgagt cggcaaattg ggctacggcc 3497881 gcatcgattc gctgccggag ccgagtgata ccctgcgatg ctccgtgcag gtcataaatg 3497941 gtggtccgga tttcctgtat aacgtcttgc agatcgtcta ccacgtccga gagtcgttgc 3498001 tgcacttcag gattacgttc gtgcgggaca gcaccctgca aagccaggcc aatcgcgaag 3498061 agccgctgga tgacatggtc atggaggtca cgggcgatac gatcccggtc ggtcagtacg 3498121 tcgagttcgc gcatccgacg ttgcgaagtg gccaattgcc aagccagcgc ggcctggtcg 3498181 gcgaacgcgg ccatcatctc gagttgttcg tcggtgaaag cccctggacc gccttgactc 3498241 agcacaacaa cgacacccgc tacggtacct ctggcccgca gcggcaacag cagcgccgga 3498301 cctgcgtcgg ccagttcgtc caggccttcc aaatcgaccc ggtcgacccg tcgcggaatg 3498361 ccgttgacga agacctcccg cagcaccgcg cccgccaccg gaatcgttcg cccaacaatg 3498421 gaagccacag cgctgccgac tgtttcaatc accagcagct cccccacgtc agcggcaggc 3498481 atgtcctcgt cgacgggaac ggctaccagg gcagcgtcag ccgccgtcag cttgagcgcc 3498541 tccgcggcga caagccggaa caccgtcgcg ggttcggtgc cggacaacaa ctcggtggcg 3498601 atgtcacggg tggcctcgat ccacgactga cgcgccttag cctgctggta gagccgggca 3498661 ttcgcgactg cgatacccgc ggcggccgcc agcgcctgga ccagaacctc gtcgtcgtcg 3498721 ctgaacggtt gcccgttggt cttgtcagtc aggtacagag tgccgaacga ttcatcgcgc 3498781 acccgaaccg gtaccccgag gaaggtacgc atcggcggat gatacggcgg aaaaccaatc 3498841 gaggccgggt gcgcagaaac atcgtccagc cgtaacggtt tgggatcttc gatgagcagc 3498901 ccgatgacgc ctaggccttt cggtaggtgg ccgatccgcc gaacggtctc ctcgtcgatg 3498961 ccttcataga caaagtgcaa tacccgatgc tgccggtcgt gcacctccat agcgccatag 3499021 cgcgcatcga caaggctggt cgctgaatgc acgatagcgc gtagggttgc ctccaggtcc 3499081 aggcccgctg tgaccacgag catggcctcc accagaccat cgaggcggtc ccggccctcg 3499141 acgatctgct cgacccggtc ctgcacctcg accagcagct cgtgcaggcg tagttgggag 3499201 agcgtgtgac gcagtggacg cattgcggcg ccgtcgtttt cgtcgacgag gccccctgtt 3499261 gtcatggtcc atcaccgggt ggccgcgagc gcttcaactc cgtcgcgaat accgcggctt 3499321 gcgtccgacg ttccatgccc agcttggcca gcaaccgcga cacgtagttc ttcaccgtct 3499381 tttcggctag gaacattcgg tcggcgatct gcttgttggt caggccctcg ctaagcaggc 3499441 ccagtagcgt ccgctcctgg tcggtaaggc ctgatagcgg gtcctgcttc tcggcggcac 3499501 cgcgcagctt ggccatcagc gcggccgcgg cccgattgtc cagcagcgac cgtccagcgc 3499561 ccacatcttt gacggcgcgc gccaactcca ttcccttgat gtctttgacg acatatccgc 3499621 tggcaccggc gagaatcgca tctagcatgg cctcgtcaga ggtgtaggac gtgaggatca 3499681 gacagcgcag atcgggcatg cgggacaaca gatcgcggca cagttcaatg ccgttgccat 3499741 cgggcaaccg gacatccagc accgcgacat ctgggcgcgc ggcaggaacc ctggccatcg 3499801 cctcggcgac cgaacccgcc tcacctacga cgtcaagctc gggatcggcc ccaagcaagt 3499861 caaccagacc acgacgcacc acctcgtggt catcgaccaa gaagaccttt accaccaggg 3499921 caccactccc aagatccgct ccctacaagt tggcactgcg taccgtaagt acggcgcatc 3499981 cgggctggta tgcaccgcac aattcgtgcg cggagtgtga gtccgcgacg aacagctgac 3500041 ccggctttgc gttggcggcc agatgacggc acgcactgcc gccggcgatg gcccgatcca 3500101 cccgcacctc ggggtagagc cgggtccagt gggcgagccg acggctcagg tgtacatgcg 3500161 ccaaccggct gccctgttcg acgtcatcgg gtgtttcagc agcgtggaca gccacggccc 3500221 gcagcggaac tccgcgcagc ctggcctcct cgaatgcgtg ccgcagcacc acaccattgt 3500281 ccacctccgc gacaaccgcg ctgacctggg aggttgtcgc tggctcggcc ggcgacgggt 3500341 gaatcaccgc cacggggcat aaggccgacc cagccagggt cgccgcgacc gaaccccggc 3500401 gaccgcggac atgatcaagc cccaccgaac cgacgcacag catcgccgcg gacctggact 3500461 cctgcatcag cttggtgagc ggcctgccgc acagaacctc cgtttcgatc ttgaccggtt 3500521 gcccggtggc ctcgaccttc cgagaggcgt cgtgcagcgc cgctcgggcc gctgattgcc 3500581 caccgccctc gccggcggcg gacagttggg acggatcgat gacgtacacc agtcgcagcg 3500641 gaatgtctcg gttcaccgcc tcatcgaccg cccacaacgc cgcatgcgtt gccgcccttg 3500701 acccgtcgat accaacgacc actgcccgag ctggccgagg atcgctcatc gccgtctcct 3500761 tcgctggggc ggatacatcc cgtcggttca gcggtacgtt actggcgggg accgctatct 3500821 cccaggggcg ttggtcccca cctgagggcc gttagtcctt atcgaccgat gacagacgca 3500881 acccgtcagg gcgagaatga atctcaccta tcgcacgggt ggctcgtcca ggtccacaac 3500941 catcgcccag cttttcacag caaagtccca gaaatggctt acagttgccg acagctgccg 3501001 aaccagcggc cgtccatcgg ctgcatatcg cttgacccac agaatatttg ggcatagccg 3501061 cgctgtgaga gcgcatctcg atgcggccgg cacggcgtcg atcaatctcc gatccgccgt 3501121 cagtcgactg ccatacaacc tgcccgccca gttgtacact ggccgcggca cgagttgccg 3501181 cactggtcaa caagtatcag ccggcctgcg cccgagcgga gcccactcgg agccgctcgt 3501241 gaccatgggg ggagccactg ccgtctcccg catgcccaca ccgaggtccg aattgggctg 3501301 ggtgcgcaat cgacgttagg ggcctgcgga gtaatggact acgcgttctt accaccggag 3501361 atcaactccg cgcgtatgta cagcggtccc ggaccgaatt caatgttggt tgccgcggcc 3501421 agctgggatg cgctggccgc ggagttagca tccgcagcag agaactacgg ctcggtgatt 3501481 gcgcgtctga ccggtatgca ctggtggggc ccggcgtcca cgtcgatgct ggccatgtcg 3501541 gctccatacg tggaatggct ggagcggacc gccgcgcaga ccaagcagac cgctacccaa 3501601 gccagagcgg cggcggcggc attcgagcag gctcatgcga tgacggtgcc cccagcgttg 3501661 gtcacaggca tccggggtgc catcgtcgtc gaaacggcca gtgccagcaa caccgctggc 3501721 actccacctt gacccattca gttctcgacc agcacgacac cgtatccgca caaatgtaag 3501781 gagctgagac acaatggatt tcgcactgtt accaccggaa gtcaactccg cccggatgta 3501841 caccggccct ggggcaggat cgctgttggc tgccgcgggc ggctgggatt cgctggccgc 3501901 cgagttggcc accacagccg aggcatatgg atcggtgctg tccggactgg ccgccttgca 3501961 ttggcgtgga ccggcagcgg aatcgatggc ggtgacggcc gctccctata tcggttggct 3502021 gtacacgacc gccgaaaaga cacagcaaac agcgatccaa gccagggcgg cagcgctggc 3502081 cttcgagcaa gcatacgcaa tgaccctgcc gccaccggtg gtagcggcca accggataca 3502141 gctgctagca ctgatcgcga cgaacttctt cggccagaac actgcggcga tcgcggccac 3502201 cgaggcacag tacgccgaga tgtgggccca ggacgccgcc gcgatgtacg gttacgccac 3502261 cgcctcagcg gctgcggccc tgctgacacc gttctccccg ccgcggcaga ccaccaaccc 3502321 ggccggcctg accgctcagg ccgccgcggt cagccaggcc accgacccac tgtcgctgct 3502381 gattgagacg gtgacccaag cgctgcaagc gctgacgatt ccgagcttca tccctgagga 3502441 cttcaccttc cttgacgcca tattcgctgg atatgccacg gtaggtgtga cgcaggatgt 3502501 cgagtccttt gttgccggga ccatcggggc cgagagcaac ctaggccttt tgaacgtcgg 3502561 cgacgagaat cccgcggagg tgacaccggg cgactttggg atcggcgagt tggtttccgc 3502621 gaccagtccc ggcggtgggg tgtctgcgtc gggtgccggc ggtgcggcga gcgtcggcaa 3502681 cacggtgctc gcgagtgtcg gccgggcaaa ctcgattggg caactatcgg tcccaccgag 3502741 ctgggccgcg ccctcgacgc gccctgtctc ggcattgtcg cccgccggcc tgaccacact 3502801 cccggggacc gacgtggccg agcacgggat gccaggtgta ccgggggtgc cagtggcagc 3502861 agggcgagcc tccggcgtcc tacctcgata cggggttcgg ctcacggtga tggcccaccc 3502921 acccgcggca gggtaacccg gcgcctaacc gacaggcggc ccgttgggcg taaacgtcca 3502981 attgtcagga ttcttcggcg agtacaccac cggaagtatt tgaccgacgg tcggccactg 3503041 gtcgacgtcg acggccatgc gctgatacac ggcgtactca ttgaccgtgg gcccagtgat 3503101 gatcccggcg atggtgacat actgctggcc gcctgcgtcc ggtcgcgggc tgactccggt 3503161 caccaggagc gtgccgctgg ccagatctcc ccgcgggccg cgcgggataa gccgcggagc 3503221 aagaaatacc gctaggaccg cgatcagtat gagtagcacg ccaaactccc atcccacccg 3503281 gccatggtag gactgctggc atgagccgtt attacgccga gcgtgaactc agtgcaagaa 3503341 cgcacgcgaa aaatcgcact gggtacacgc tcggcgaaag gatggtgcac cagtgagcca 3503401 cgacgatcta atgcttgcgc tggctctggc cgaccgtgcg gacgaattga cgcgggtccg 3503461 gttcggggcg ctcgatctgc gcatcgacac caaaccggat ttgacgccgg tgaccgacgc 3503521 cgatcgggcg gtcgaatccg acgtgcgcca gacgctgggc cgcgaccggc ccggcgacgg 3503581 cgtcttgggc gaggagttcg gcggatcaac gaccttcacc ggacggcagt ggatcgtaga 3503641 cccgatcgac ggcaccaaaa actttgtgcg cggggtgccg gtgtgggcca gtttgatcgc 3503701 gctgcttgaa gatggcgtcc cgtcggtcgg tgtggtgagt gcgccggcgc tgcaacggcg 3503761 gtggtgggcg gcacgcggcc ggggcgcgtt cgcatccgtc gatggtgcgc gtccacaccg 3503821 gctgtcggtt tcctctgtgg cagagctgca ttcggcgagc ttgtcgtttt ccagtctgtc 3503881 cgggtgggcg cggccgggtc tacgtgaacg cttcatcggg ttgaccgata ccgtgtggcg 3503941 cgtgcgtgct tacggcgact ttctgtctta ctgcctggtg gccgagggcg ccgtcgatat 3504001 tgccgccgaa ccgcaagtgt cggtatggga tctggcggca ctggacatcg tggtgcgtga 3504061 ggcgggcggg cggctcacca gcctggacgg cgtcgccggc ccacacgggg gcagcgccgt 3504121 tgcaaccaac ggtctgttgc acgacgaggt gctgacacgg ctcaacgccg ggtaacctgg 3504181 cgctcgagag cgccatgagc gacccgttca ccatcgcaac caaacactgg caccgactgc 3504241 acgacagccg gatccagtgc gatgtatgtc cacgcgcatg caaacttcac gagggacagc 3504301 gtggcctgtg tttcgtccgc ggccgatttg acgatcaagt gaagctcacc agctacggac 3504361 gctctagcgg attctgtgtc gatccgatcg agaaaaagcc gctcaaccac ttcttgccag 3504421 gttcggcgac gctgtctttc ggcaccgccg ggtgcaacct ggcgtgcaag ttctgccaga 3504481 actgggatat ctccaagtcc cgcgagatcg acgtcctggc cagtcgggcg gccccggccg 3504541 acatcgcccg gaccgcacac gaattgggtt gccgcagcgt ggcattcacc tacaacgacc 3504601 caacgatctt ctgggagtat gccgccgatg tagccgacgc ctgccacgac cagggaatca 3504661 aagccgtcgc ggtgacggcc gggtacatgt gtcctgagcc ccgcgcggaa ttctaccggc 3504721 gtgtcgacgc cgccaacgtc gacctaaagg cattcaccga agacttttat cgcaaggttt 3504781 gcgtcagtca cctgcgcaac gtcctggaca ccctggccta cctgcggcac cagacgaatg 3504841 tgtggttgga gatcaccacc ctgctgattc ccggacgtaa cgacagcgac gcggaagtcg 3504901 ctgccgaatg cagatggatc cgcgaaaacc tgggcgtcga cgtgccggtg catttcaccg 3504961 cgttccatcc cgactacaag atgatggaca ccccggctac accaaccgcc acattgaccc 3505021 gagcccgcga gatcggcatt ggcgaaggcc tgcgcttcgt ctacaccgga aacgttcacg 3505081 atgccgtggg tggcagcacc tcgtgcccag gctgccgggc aacggtgatc gttcgcgact 3505141 ggtattcgat acgacattac gccctcaccg aggacggccg ctgccaagca tgcggctatc 3505201 agatgcctgg cgtgtacgac ggaccggccg gacactgggg ccagcgccgg ctgcccttgc 3505261 tgaccagctt gtcccggatg tgaacaactt aacaagcacc cctatcttac tccggagtaa 3505321 gatagggtgg tccgctatca ccccgatgac cgaggctgcc gtatgaccaa caccacctct 3505381 gctgcaaatg ctgcaaaacc ctccggcgca cgcaccgata gacgcggccg cacgaccggt 3505441 gtcggcctgg cgccccacaa acggaccggc atcgacgtcg cactggcgct gctaaccccg 3505501 attgtcggcc aggagttcct ggacaaatac cgcctgcgcg atccgctgaa ccgatcactg 3505561 cgctacggcg tgaagacgat gtttgccact gccggcgccg ccacccgtca gttccagcgg 3505621 gtgcaaggcc tgcggggcgg accgacccgg ctgaagtcca gcggccgaga ctacttcgat 3505681 ctgacgcccg atgacgacca gaagctgatc atcgagaccg tcgacgaatt cgccgaagag 3505741 gtactgcgac ccgccgcgca cgacgccgac gacgccgcga cctacccgtc cgacttgacc 3505801 gccaaggccg ccgagctggg cattaccgcg atcaacatcc ccgaggactt cgacggtatc 3505861 gccgaacacc gctccagcgt caccaacgtg ctggtggctg aggcactggc gtatggcgac 3505921 atgggcctgg cactgccgat cctggcgcct ggcggggtgg cgtccgcgct cacccattgg 3505981 ggcagcgccg atcagcaggc cacctatctc aaagagttcg ccggcgagaa cgttccgcag 3506041 gcctgcgtgg ccatcaccga accgcagcca ctattcgatc ccacccggct gaagaccacc 3506101 gcggtgcgca ccccgtccgg ttaccggctc gacggcgtga agtcgttgat cccggccgcc 3506161 gccgacgccg agctgtttat tgtcggcgcg cagctgggcg gcaagcccgc actgttcatt 3506221 gtcgagtccg cggccagcgg cctgaccgtc aaggcggatc cgagcatggg gattcgcggc 3506281 gcggcgttgg gccaggtcga actctgcggg gtgtcggtcc cgcttaacgc ccggctgggc 3506341 gaggacgaag ccagcgacaa cgactattcc gaggcgcttg cgctggcccg gttgggttgg 3506401 gcggcgctgg cggtcggtac ctctcacgcc gtgctcgact acgtcgtccc gtatgtgaaa 3506461 caacgccagg ctttcggcga gccgatcgct catcgccaag cggtggcgtt catgtgcgcc 3506521 aacatcgcga tcgagctcga cggcctgcgc ctgatcacct ggcgcggggc gtcccgtgcc 3506581 gagcagggtc tgccgttcgc aagggaagcg gcgctagcca agcggcttgg ctccgacaag 3506641 ggcatgcaga tcggcctgga cggggtgcaa ctgctgggcg gccacggcta caccaaggag 3506701 catccggttg agcgctggta ccgcgacctg cgagccatcg gcgtcgccga gggcgttgtt 3506761 gtcatctaga acgagctgaa agatcaatca tggcaataaa tctggaactg ccgcgcaagc 3506821 tgcaggcgat catcgtcaag acccatcagg gcgctgcgga gatgatgcgg ccgatagccc 3506881 gcaagtacga cctgaaggaa catgcctacc cggtcgaact cgacaccctg atcaatttgt 3506941 tcgagggcgc cgccgaatcg ttcaactttg ccggagccca ttcgcttcgc gacgaggacg 3507001 aaggcaagga cgaaaaccac aacggtgcca acatggccgc cgtggtacag acgatggagg 3507061 ccagctgggg cgacgtcgcg atgatgctgt cgctgcccta tcaggggctg ggtaacgcag 3507121 ccatctccgc ggtagccacc gacgagcagc tggagcggct gggcaaagtg tgggcagcga 3507181 tggccatcac cgaaccggaa ttcggatcgg actcggcggc agtgtcgacg accgccaccc 3507241 tcgacggcga cgagtacgtg atcaacggcg agaagatctt tgtcaccgcc ggttcccgcg 3507301 ccacccacat cgtggtctgg gccacgctgg acaaatcctt gggccgcccg gcgattaagt 3507361 cgttcatcgt gccccgtgag catcccggcg tgaccgtcga acgacttgaa cacaaactcg 3507421 gcatcaaggg ttctgatact gcggtgatcc ggttcgacaa cgcccgtatc cccaagggca 3507481 acctacttgg gaacccggaa atcgaggtcg gcaagggctt tgccggggtg atggagacct 3507541 tcgacaacac ccggccgatt gtggccgcca tggccgtcgg gatcggccgt gccgcactgg 3507601 aggaaatccg tagtgtcctc accggggccg gcgtggagat ctcctacgac aagccctcac 3507661 acacccagag cgccgcggcc gccgagttcc tgcggatgga ggccgactgg gaggccagct 3507721 acctactgtc cctgcgcgca gcctggcagg ccgacaacaa catccccaac tccaaagaag 3507781 cctcgatgag caaggccaag gcgggccgga tggccagcga cgtcacctgc aaaaccgtcg 3507841 aattggcagg aactaccggg tattccgagc aatcactgct ggagaagtgg gcccgcgact 3507901 ccaagatcct ggacatcttc gagggcaccc agcagatcca gcagctggtg gtcgcacgcc 3507961 gactgttggg cctgtcgtcg tccgagctca aatagcctcg gcgagcagac gtcaaagccc 3508021 ccgaatttca gtgaaatcgg gggcttttgc gtctgctggc gcccgtctgc acccccgcca 3508081 gtaggctggt cggcatgcgc gcggtacggg tgactcggct ggagggacca gatgcggtcg 3508141 aggtggccga ggtcgaggaa cccacgagcg ccggtgtggt catcgaggtg cacgctgccg 3508201 gcgtggcctt cccggacgca ctgctaaccc gtggccgtta ccagtaccgc ccggagccgc 3508261 cattcgtgct cggcgccgag atcgccggag tggttcgatc ggcgccggat aacagccaag 3508321 tgcgttccgg agacagggtt gtcggcctca cgatgctcac cggcggcatg gccgaagtcg 3508381 cggtattgtc gcccgagcgc gtgttcaagc tgccggacaa catgactttc gaggcgggcg 3508441 cgggcgtgct gttcaacgac ctgacggtgt acttcgcgct ggcggtccgg ggccggctgc 3508501 aggccggtga gacggtgctg gtgcacgggg cggcaggcgg gatcggcaca tcgacgttgc 3508561 gactagcgcc ggcgctcggg gcgtctcgca ccgtcgcggt ggtcagcacg caggagaagg 3508621 ccgagcttgc gacagtggcc ggggcgacag atgtggtgtt ggccgagggg ttcaaggacg 3508681 cggtacagga gctgacgaac ggccgtggtg tcgacatcgt cgtagacccg gtcggcggcg 3508741 accggttcac cgattcgctg cgctcgcttg ctgcgggagg acggctgttg gtcatcggct 3508801 tcactggcgg cgagattccc accgtgaagg taaaccgcct tctgctcaac aacattgacg 3508861 ttgtcggggt aggctggggc gcctggtcgc tgacccaccc cgatgcgctg gcccagcagt 3508921 ggtcacaact cgagcggctg ctacgctcgg gcaagctgcc tcctcccgaa ccagtggtct 3508981 acccactgga ccaagccgct gcggcgattg catcgctgga gaatcgcacc gccaagggga 3509041 aggtcgtact acgcgtgcgc gactaacgcc cctcccggga cgcgtcgccg gcgtgctctg 3509101 gccaatttgc cgcttcctca ctggtcgccg ttggcgtcgg ctacgtcatg ccgcacaact 3509161 cgcagcttgc ctggcgccag gcacgcggcg tatccgtggt atttgccata cagttcccat 3509221 gcggtgacgc gatcatcggg gtgcacgtcg atctgatgac cgtcggagaa ctcaagatgg 3509281 agatctccgg tgtcatacca gacgaaagct gtgcaggttg ccccggcgaa atcgaagagc 3509341 ggacgctcgt ggtcggctgg gtcgtttggg tcgatggcga ccacttctgc gggcgaggtt 3509401 tcgatggccg gcagagtcag ctgtagtggt accgagatga ccagctcgtt gtaatcgtcg 3509461 aagttcagca ccagaccgtc gcggaacata atccgctgaa ccgcacagcc ctctaaccac 3509521 tgctcggtca tttcctgttc ggtcatatat tcactctggc cttgttgtgc ccatatgtca 3509581 cgtacacaac cgccgaaatc tcgtgcggga ttacacccta ggcgtccgat ggacaccagt 3509641 accatctgac accgtgcccg actccagcac cgcattgcgg atcctcgtct acagcgacaa 3509701 cgtccagacc cgcgaacggg tgatgcgggc cctgggcaaa cggttgcacc cggatctgcc 3509761 cgatttgacc tacgtcgaag tggctaccgg tccgatggtg atacgccaga tggatcgggg 3509821 gggcatcgac ttggccatcc tcgacggtga ggcgacaccg accggaggca tgggaatcgc 3509881 caaacagctc aaagacgaac ttgccagttg cccgcccatc ctggtgctca ccggccgtcc 3509941 ggacgacacc tggctggcca gctggtcgcg ggccgaggcc gcagtgccgc atcccgtcga 3510001 ccccatcgtg ctgggccgca cggtgctctc actgttgcgc gcacccgccc actaaccgga 3510061 cgcggccggc attcgcggcg cgaacgttca gccgccccgc atttgaatct tcgggtcctt 3510121 tcttacccga ggtcgtaatt ggcccgctgc cgcttccggc cgcaacgacg gcgctgtctc 3510181 ctccgccgct gaagtctctg aagcctgctg accttgcgcg gtgcgtagtg tcgattccgg 3510241 aattccagaa cccgcggatt ggcctacccg cgttgtcgac agcggagcgg ccttggccgc 3510301 aactttcgga tccacagttg gcagcacccc cattgctgga acttcaagtt ctggaacttc 3510361 cacaacggct tccggtggcg cggaagccgc cggctctggc gctcgagctg actcggtggc 3510421 agttcccggg gcagagttag tgccgccacg tgccatctga cccagagcgg cgagcgcgag 3510481 cggggcaccg atgaagccgg ggctcacaac gccggcatgg gcactggtag cgctcgacgc 3510541 gacgacgtct ccggcgccaa catcgccacc gccgaaattc ccttggccta cactgccatg 3510601 ggccggatca ccggccgcta acccggcgct agccacgccg cttccgacgt agccgacgcc 3510661 cccgccgcta gcggttgcac ctccggtgcc gacgctctca ccgccgccgg tgccgacgct 3510721 ctcgccgcca gtagcgccgg tgccgacgct ctcaccgcca gtagcgccgg tggcgccggc 3510781 acccaaaccc ggaaatcgct gcagcaaact cgcccacggc accacttgcg cggcaatcgc 3510841 cgatgccccg gagtagtacc ccgacatggc ggccacatcc gcggcccaca tctcctcgta 3510901 cacaccctcg gcggcagcaa tcaacggcgc gttctgcccg aacaaattcg tcatcaccag 3510961 ctgcacgaat gcgtcgcggt tggcggccac cgccgccgga agcaccgtcg ccgcctgcgc 3511021 cgcctcgaag atgctggcca ctgcgcgcgc ctgtcccgcc gccccggccg actgagccgc 3511081 tgccgcggtc aaccaccccg cgtagggagc cgccgctgcc gccatcgcca aggctgccgg 3511141 accctgccac gcctgacccg ccagcccggc cgtgaccgac gcaaatgatt gcgccgcggt 3511201 ccccaactct tcggcaagcc cgtcccaggc tgccgccgcc gccagcatcg gcgcagtgcc 3511261 tgcaccgatg aacattcgca aggaattgat ctccggcggc agcacgacga aactcacagc 3511321 tcccgtcctt ccgcttcgct gctcgatgcc acgccgacct caatacggcc aacgattaac 3511381 cggcaaatgc cgagattaac aacaaatgct gcgcttatca gggggttaga ccaacattca 3511441 tacaattcgc cgggacgcgc aatccccagt tttgcttcgc agcgaccgac gccggaccca 3511501 gccacgggtt ctgcttcgac tcgcacaggt atgcaccagc ctgaccccgg gaatgtgggg 3511561 tggccgttgc gcgactatgt tgaaggtcac tgtgacggcc cgaagccccg gttcgtcacg 3511621 gcagcccggt caccgcccgg ccgccgcgct ggcggccccg tacgacggat catggagcga 3511681 gttgaacgtc tacataccca tcctggtact ggcggcgctg gccgccgcct tcgccgtggt 3511741 gtcggtggtg atcgcgagcc tggtcggccc gtcgcggttc aaccggtcaa agcaggccgc 3511801 ctacgaatgc gggatcgagc ccgctagcac tggagccaga acctccattg gccccggcgc 3511861 ggcgagcggg cagcggttcc ccatcaagta ctacctgacc gcgatgttgt tcatcgtctt 3511921 cgacatcgaa attgtgttcc tctacccgtg ggcggtcagc tacgactcgc tgggcacgtt 3511981 cgcgctggtc gagatggcga tattcatgct cacggtgttc gtggcctacg cgtatgtgtg 3512041 gcgccgcggg ggcctgacgt gggattgagg tagggcgtgg gactggaaga acagctgccc 3512101 ggcgggatcc tgctgtcgac cgtcgagaag gtggcgggct atgtccgcaa aaactccctg 3512161 tggccggcaa cattcggatt ggcgtgctgt gcgatcgaga tgatggcgac cgcgggacca 3512221 aggtttgaca ttgcgcggtt cgggatggaa cggttctcgg ccacgccgcg gcaggcagat 3512281 ctgatgatcg tggcgggccg ggtcagccag aagatggcgc cggtactgcg ccagatctat 3512341 gaccagatgg cggagccgaa atgggttctg gccatgggtg tgtgcgcctc gtcaggtggg 3512401 atgttcaaca actatgcgat cgtgcagggc gtggatcatg ttgttccggt cgacatctac 3512461 ctacccggct gcccgccgcg cccggagatg ctgctgcacg caatcctgaa gctgcacgaa 3512521 aagattcagc agatgccatt aggtatcaac cgggaacgcg ctatcgccga ggccgaagag 3512581 gcggcgttgt tggcccggcc caccatcgag atgcgcggac tgctgcgatg agcccgccga 3512641 accaagacgc ccaggaaggc cgcccggact cccccaccgc ggaggtggtc gacgttcgcc 3512701 gcggcatgtt cggcgtctcg ggcaccggtg acacctccgg ttacggacgg ttggtgcgcc 3512761 aagtcgtcct ccctggcagc agcccccggc cctacggcgg ctacttcgac gatatcgtcg 3512821 accggctggc cgaggcactg cggcacgagc gcgtcgaatt cgaggacgcc gtcgagaaag 3512881 tcgtggtcta ccgcgatgaa ctgaccctgc acgtccgccg ggatctactg ccgcgggtcg 3512941 cccagcggct gcgcgacgaa cccgaattgc gattcgagct gtgtcttggg gtgagcgggg 3513001 tgcactaccc gcacgagacg ggtcgggagc tgcatgccgt ctacccgctg cagtcgatca 3513061 cccacaaccg tcgcctccgg ttggaagtgt ctgcgccgga cagtgatccg cacatccctt 3513121 ccctgttcgc gatctatccg accaacgact ggcacgagcg ggaaacctac gacttcttcg 3513181 ggatcatctt cgacggccat ccggccctga cccggatcga gatgcccgat gactggcagg 3513241 ggcatccgca acgcaaggac taccctctcg gcggcatccc ggtcgaatac aagggcgcgc 3513301 agataccccc gcccgacgag cggaggggct acaactgatg acggcaatcg ccgactcggc 3513361 tggcggcgcc ggcgagaccg tcctggtcgc tggcgggcag gactggcagc aggtcgtgga 3513421 cgccgcgcgc agcgcggatc ccggtgaacg catcgtcgtc aacatggggc cccagcaccc 3513481 gtctacccac ggggtgttgc ggttaatcct ggagatcgag ggcgaaacag tcgtcgaagc 3513541 ccggtgcgga atcggctacc tgcacaccgg aatcgagaag aacctcgaat accggtactg 3513601 gacccagggc gtcaccttcg tgacccgaat ggattacctg tcaccgtttt tcaacgaaac 3513661 cgcctactgc ctcggcgtgg agaagctgct cggcatcacc gatgagatac ccgagcgggt 3513721 caacgtcatc cgcgtgctga tgatggagct caaccggatc tcgtcgcatt tggtcgcatt 3513781 ggcgaccggg ggcatggaat tgggcgccat gactccgatg ttcgtcggct tccgggcacg 3513841 cgagatcgtg ctcacgctgt tcgaaaagat caccggtttg cggatgaaca gcgcctacat 3513901 ccgacccggc ggcgtggcgc aggacttacc gcccaacgcg gccaccgaaa tcgcggaagc 3513961 actcaagcag ttgcgccaac cactgcgcga aatgggcgag ctgctcaacg aaaacgccat 3514021 ctggaaggcc cgcacccagg gcgtcggata cctggatctg accggatgca tggcactggg 3514081 catcaccggc ccgatactgc gttccactgg gttgccccac gacctgcgga aaagcgagcc 3514141 ctactgcgga taccagcact atgaattcga tgtgatcacc gacgacagct gtgatgccta 3514201 cgggcgctac atgattcgcg tcaaagagat gtgggagtcg atgaagatcg tggagcagtg 3514261 tctggacaag ttacgacccg gcccgaccat gatctccgat cgcaagctcg cctggccggc 3514321 cgacctgcag gtggggcccg acggcctggg caactcaccc aagcacatcg ccaaaatcat 3514381 gggctcctcg atggaagcgc tgatccacca cttcaaactg gtcaccgagg gcatccgggt 3514441 gccggcgggc caggtctacg tcgcggtgga gtccccccgt ggtgagctcg gcgtacacat 3514501 ggtcagcgac ggtggcaccc gcccctaccg ggtgcactac cgggatccct ccttcaccaa 3514561 cctgcagtcc gtcgccgcga tgtgcgaagg cgggatggtc gccgatttga tcgcggcggt 3514621 cgccagcatt gacccggtca tgggcggggt ggaccggtga cacagccacc cggtcagccg 3514681 gtgttcatcc ggctcggacc gccaccggac gaacccaacc agtttgtcgt cgagggcgct 3514741 ccgcggtcgt atccgccgga cgtactggcg cggctggagg tcgacgccaa ggagatcatc 3514801 ggccgctatc ccgacaggcg ctcggcgctg ttgccgttgc tgcacctggt gcagggcgag 3514861 gattcctacc tgacgccggc gggtttgcgg ttctgcgccg atcaactcgg gctgaccggg 3514921 gccgaggtgt cggcggtggc cagcttctac accatgtacc gccggcgccc caccggcgag 3514981 tacctggtgg gtgtgtgcac gaacacgctg tgcgccgtca tgggcggcga cgccatcttc 3515041 gaccgcctca aagagcatct cggcgtcggc cacgacgaaa ccacctccga cggtgtggtc 3515101 accttgcaac acatcgaatg caacgccgcc tgcgattacg caccggtggt gatggtcaac 3515161 tgggaattct tcgacaacca gacgccggag tccgcgcgcg aactcgtcga ctcgctgcgc 3515221 tccgacacac cgaaggcgcc cacccgcggc gcgccgctgt gcggcttccg gcaaacatcg 3515281 cgcatcctgg cgggtctacc cgaccagcgt cccgacgaag gccagggcgg tcccggcgcg 3515341 cccaccctgg ccgggctgca ggtggcaagg aagaacgaca tgcaggcgcc accaaccccc 3515401 ggagcggacg aatgaccacg caggccaccc cgttgacccc ggtgatcagc cgccactggg 3515461 acgacccgga gtcgtggacc ctggccactt atcaacgcca cgatcgctat cggggctatc 3515521 aggcgttgca gaaagccctg acgatgccgc ccgacgacgt gatcagcatc gtcaaggatt 3515581 ccgggttacg cggacgcggc ggcgcgggct ttgccaccgg gaccaagtgg tcgttcatcc 3515641 cgcagggcga caccggcgcc gcggccaagc cgcactacct ggtggtcaac gccgacgagt 3515701 ccgaacccgg tacgtgcaaa gacattccgt tgatgctggc gacgccacat gtgctcatcg 3515761 aaggcgtcat catcgccgcc tacgcgatcc gcgcccatca cgcgttcgtc tacgtacgcg 3515821 gtgaggtggt gccggtattg cgccggctgc acaacgcggt ggccgaggcc tatgccgccg 3515881 gcttcctagg ccgcaacatc ggaggttccg gattcgatct ggagctggtg gtacacgccg 3515941 gcgcgggcgc ctacatctgc ggcgaggaga ccgccctgct cgactcgctg gaaggccggc 3516001 gcggccagcc gcggctgcgg ccccccttcc ccgcggtggc cggtctgtat ggctgcccga 3516061 ccgtgatcaa caacgtcgaa acgatcgcca gtgtcccatc gatcatcctg ggcggcatcg 3516121 actggttccg gtcgatgggc agcgagaaat cgcctggctt caccctgtat tcgctgtccg 3516181 gccacgtcac ccgccccggc cagtacgagg cgccgctggg cattacgctg cgcgagttgc 3516241 tcgactacgc aggcggggtg cgcgccgggc accggctgaa gttctggaca ccgggcggct 3516301 cgtcgacccc gctgctcacc gacgagcatc tggatgtgcc gctggactac gagggtgtgg 3516361 gtgcggccgg ctcgatgctg gggaccaagg cgctggagat cttcgacgag accacctgcg 3516421 tggtgcgcgc ggtgcgccgc tggaccgagt tctacaagca cgaatcgtgt gggaaatgca 3516481 cgccgtgccg ggagggcacc ttctggctgg ataagatcta cgagcggctg gaaaccggcc 3516541 ggggtagcca tgaagacatt gacaaactgt tggacatttc cgattccatc ttgggaaagt 3516601 cgttctgcgc gttgggcgac ggtgccgcga gtccggtgat gtcgtcgatc aagcacttcc 3516661 gcgacgagta cctggcccac gtcgaaggag gcggttgccc attcgacccc cgagactcca 3516721 tgctcgtcgc gaacggagtg gacgcgtgac ccaggcggcc gacactgaca tccgggtagg 3516781 ccaaccggag atggtgacac tgaccatcga cggcgtcgaa atcagcgtcc ccaagggcac 3516841 gttggtgatt cgcgccgccg aactgatggg aatccagatc ccgcgattct gcgaccaccc 3516901 gctgctggag cccgtcggcg cctgccggca atgcctggtc gaggtcgaag ggcaacgcaa 3516961 gccgctggcg tcgtgcacca ccgtggccac cgacgacatg gtggtgcgca cccaactcac 3517021 ctccgagatt gccgacaagg cccagcacgg tgtgatggaa ctgctgctga tcaaccatcc 3517081 gctggattgc ccgatgtgcg acaagggcgg tgaatgcccg ctgcaaaacc aggcaatgtc 3517141 taacggccgc acggattctc gcttcaccga ggccaaacgt accttcgcca aaccgatcaa 3517201 catctccgcg caggtgctgc tggaccgcga acgttgcatc ctgtgcgccc gctgcacccg 3517261 gttctccgac cagatcgccg gcgatccgtt catcgatatg caggagcgcg gcgccctgca 3517321 gcaggtcggt atctacgccg atgaaccgtt cgagtcgtac ttctccggca acacggtgca 3517381 gatctgcccg gtgggggcgc taacggggac cgcctaccgg ttccgcgcgc gtccgttcga 3517441 tttggtctcc agccccagcg tctgcgagca ctgcgcgtcg ggctgcgcgc aacgcaccga 3517501 ccatcgccgc ggcaaggtgc tgcggcggct ggccggtgac gacccggaag tcaacgagga 3517561 gtggaactgc gacaagggcc ggtgggcctt cacgtacgcg acccagccgg acgtgatcac 3517621 cactcccctg atccgcgacg gtggggaccc caagggcgcg ctggtgccca cctcgtggtc 3517681 gcacgcaatg gcggtggccg cccagggact ggcggcagcg cggggccgca ccggggtgct 3517741 ggtcggcggc cgagtgacct gggaggacgc ctacgcgtac gccaagttcg cgcggatcac 3517801 gttgggcacc aacgacatcg acttccgcgc ccggccgcac tcggccgagg aggccgactt 3517861 cctggcggcc cgcatcgccg ggcggcatat ggcggtcagc tatgccgatt tggaatcggc 3517921 tccggtggtg ctgctggtgg gattcgagcc cgaagacgag tcgccgatcg tgtttctgcg 3517981 gttacgcaag gccgctcgca gacaccgcgt cccggtgtac acgatcgccc cctttgccac 3518041 tggtggcctg cacaaaatgt cgggccggct gatcaaaacc gttcctggtg gcgaacccgc 3518101 ggcgctggac gatctggcca ccggtgcagt gggcgacctg ctggccaccc cgggcgcggt 3518161 catcatagtc ggggagcgct tggccacggt accgggcgga ttgtcggcgg ccgctcggct 3518221 ggccgatacg accggcgccc gtttggcgtg ggtgccgcgg cgggcggggg aacgcggagc 3518281 gctggaagcc ggagcgttgc ccacgctgtt acccggtggc cgcccgctgg ccgacgaggt 3518341 cgcccgcgcg caggtgtgtg cggcgtggca tatcgccgaa ttgcctgccg cggctggacg 3518401 ggacgccgac ggcatcctgg ccgccgctgc cgacgagacg ttggctgcgc tgctggtcgg 3518461 gggtatcgaa cccgcggact tcgccgaccc ggacgccgtg ctggccgcgt tggacgccac 3518521 cggtttcgtg gtcagcctgg agctgcgaca cagtacggtc accgaacgcg ccgacgtggt 3518581 gttcccggtc gcgccgacga cccagaaagc cggcgcgttc gtcaactggg agggtcgcta 3518641 ccgtacattc gaacccgcgc tgcgcggcag cacactgcaa gctggccagt cggatcaccg 3518701 ggtgctggac gcgttggccg acgacatggg tgtccatctg ggcgtgccca ccgtggaggc 3518761 ggcccgcgag gagctggccg cgctcggtat ctgggacggc aaacacgctg ccggtcccca 3518821 catcgcggcc accgggccga cccaacccga agctggtgag gcgatcttga ccgggtggcg 3518881 gatgctcctc gacgagggcc gcctgcagga cggcgaacca tatctggccg gtaccgcgcg 3518941 cacacccgtg gtacggctgt cgccggatac ggcagccgag atcggcgccg ccgatggcga 3519001 ggcggtcacg gtcagcacgt cacgcggctc aatcaccttg ccgtgcagtg tcaccgacat 3519061 gcccgaccgc gtcgtgtggc ttccgctgaa ctcggcgggc tcgacggtgc accgacagct 3519121 gagggtgaca atcggcagca tcgtgaaaat cggagcgggc tcatgagcgt ctccccttgc 3519181 cgcgagcgcg cgtgttcccc cgcaagcggg aggtgccccc agtacgccga cacaccgatt 3519241 ttgatgtacc agtgcggacc ctcgcgcaag gagtggcggc catgaccacg ttcggccacg 3519301 acacctggtg gctggtggcg gccaaagcga tcgcggtatt cgtgttcctc atgctgacgg 3519361 tgctggtggc gatcctggcc gaacgcaagc tgctgggccg gatgcagttg cggcccggcc 3519421 ccaaccgggt tggcccaaaa ggagccctgc agagcctggc tgacggcatc aagctggcgc 3519481 tcaaagagag catcacaccc ggtggcatcg atcgattcgt atattttgtg gcgccgatca 3519541 tttcggtgat tccggcattc accgctttcg cgttcatccc gtttggtccc gaggtgtcgg 3519601 tgtttggcca ccggacaccg ttgcagataa ccgaccttcc cgtcgccgtg ctgttcatcc 3519661 tgggactgtc ggcgatcggg gtatacggca tcgtgctggg cggttgggcg tccgggtcca 3519721 cctacccgct gctgggcggg gtgcgctcca ccgcgcaggt catctcctac gaggtcgcga 3519781 tgggcctgtc gttcgcgacg gtgttcctta tggccggcac catgtcgacg tcgcagatcg 3519841 tggccgcaca agacggtgtc tggtatgcct tcctgttgtt gccgtcattc gtcatctatc 3519901 tcatttctat ggtgggtgaa accaaccggg cgccgttcga tttgcccgaa gccgagggcg 3519961 agctggtcgc gggattccac accgagtact cgtcgttgaa gttcgcgatg ttcatgctcg 3520021 ccgagtacgt caatatgact acggtttcgg cactggccgc gaccctattc ttcggtggct 3520081 ggcatgctcc ctggccgctg aacatgtggg cgagcgccaa caccggctgg tggccactga 3520141 tctggttcac cgctaaagtg tggggctttc tgttcatcta tttctggctg cgggctacgc 3520201 tgccgcggct gcgctacgac cagttcatgg cgctgggctg gaagttattg atccccgtct 3520261 cgctggtgtg ggtgatggtc gccgcgatca tccgctcact acgcaaccag ggctaccagt 3520321 actggacccc gactctggtg tttagcagca ttgtcgttgc cgctgccatg gtgctgttgt 3520381 tgcgaaagcc gttgagcgct cccggcgctc gcgcatcggc acggcaacgc ggggacgaag 3520441 gcaccagccc tgaaccggca tttccgacac caccgctgct agccggtgca accaaggaga 3520501 atgcaggtgg ctaacactga tcgtccggct ctcccccaca agcgggcggt acccccatct 3520561 cgggctgact ccggcccgcg tcgtcgccgg actaagttac tggacgccgt agccggattc 3520621 ggggtaacgc ttggttcgat gttcaaaaag acggtcaccg aggagtatcc ggaaaggccc 3520681 ggtccggtag cagcgcgcta ccacggccgt catcagctca accggtatcc ggacggcctg 3520741 gagaaatgca tcggctgcga gttgtgcgcc tgggcctgcc cggccgacgc aatctatgtc 3520801 gagggcgcgg acaataccga agaggagcgg ttttcgccgg gcgaacgcta cggccgggtg 3520861 taccagatta actatttgcg ttgcatcggt tgcggtttgt gcatcgaggc gtgcccgacg 3520921 cgggcgctga cgatgaccta tgattacgaa ctggccgacg acaaccgcgc cgacctgatc 3520981 tacgagaagg accggctgct ggccccgctg ctgcccgaga tggccgcgcc gccgcatccg 3521041 cggacgcccg gtgccaccga taaggactac tacctaggca atgtgaccgc cgagggcttg 3521101 cggggcgtgc gtgagagcca gaccaccgga gattcccgat gaccgcggtg ctggcttcag 3521161 atgtcatcgt ccgcacctcc accggggaag cggtgatgtt ctgggtgctc agtgcgttgg 3521221 cgctgctggg cgcggtcggg gttgtgctgg ccgtcaacgc cgtgtactca gcgatgtttc 3521281 tggcgatgac catgatcatc ctggcggtgt tctacatggc ccaggacgcg ctgtttttgg 3521341 gtgtcgtcca ggtggttgtc tacaccggcg cggtgatgat gctgttcctg ttcgtgctga 3521401 tgctgatcgg tgtggactcc gcggaatcac tgaaggagac gctgcgcggg cagcgggtcg 3521461 ccgcggtgct gaccggtgtc gggttcggcg ttctcctgat cagcaccatc ggccaggtgg 3521521 cgacccgagg ttttgccgga ctaaccgtcg ccaacgccaa cggcaacgtc gaaggcttgg 3521581 ccgcgctgat tttttcccgt tacctgtggg cgttcgagtt gaccagtgcg ctgttgatta 3521641 ccgccgccgt cggggcgatg gtgctagcgc accgggagcg tttcgagcgc cgcaagaccc 3521701 agcgcgaact ctcccaggaa cgcttccgtc ccggcgggca ccccaccccg ctgcccaacc 3521761 cgggtgtcta cgcgcgccac aacgcggtcg acgttgccgc cctgctcccc gacggttcct 3521821 attccgaatt gtcggtcccc cggatgctgc gcacccgcgg ggccgacggc ctgcaaacac 3521881 cctcgcccgg agccgtctcc ggctctttag aaggcggtgc atcatgaatc cggccaacta 3521941 cctttatctt tcggtgctgc tattcaccat cggagcctcc ggtgtgctgc tgcgacgcaa 3522001 cgcgatcgtg atgttcatgt gcgtcgagct catgctcaat gccgttaacc tggcgttcgt 3522061 caccttcgcg cgcatgcatg gccatctcga cgcccagatg atcgcgttct tcaccatggt 3522121 ggtggccgcc tgcgaagtgg tcgtcggcct ggccatcatc atgacgattt tccgtacccg 3522181 caaatcggcg tcggtcgacg acgcgaatct actcaaaggc tgacgacgcc accgtgacaa 3522241 cttccttggg gactcactac acctggctgc tggtggcact gccactggcg ggtgccgcaa 3522301 tcttgctgtt cggcggcaga cgcaccgatg cgtggggcca cctgctgggc tgtgccgcag 3522361 cgctggcggc attcggggtg ggcgcgatgc tgctggccga catgctcggt cgcgatgggc 3522421 tcgagcgcgc gatccatcag caggtgttca cctggatacc cgccggcgga ctccaagtcg 3522481 acttcgggct gcagatcgat cagttgtcca tgtgcttcgt gctgctgatc tccggggtcg 3522541 gatcgctgat tcacatctat tcggtcggct acatggccga ggacccggac cggcgcaggt 3522601 ttttcggcta tctcaacctg tttctggcct cgatgctgct gctggtggtc gccgacaact 3522661 atgtgttgct gtacgtcggc tgggagggtg tgggcctggc gtcgtatctg ttgatcggtt 3522721 tctggtacca caagccgtcg gcggccaccg cggccaaaaa ggcattcgtg atgaaccggg 3522781 ttggggacgc cggcctagcg gtgggtatgt tcttgacgtt tagcactttc ggcaccctgt 3522841 cgtatgccgg cgtgttcgcc ggcgtacccg ccgcaagtcg cgcagtgctg accgcgatcg 3522901 ggttgttgat gctgttgggg gcgtgcgcca agtccgcgca ggttccgctg caagcctggc 3522961 ttggcgacgc gatggagggc cccaccccgg tgtccgcgct gatccacgcc gccaccatgg 3523021 tgaccgccgg agtgtatttg attgtgcggt cgggcccgct gtacaacctg gcgcccaccg 3523081 cccaactggc ggtcgtcatc gtcggcgcgg tgacgctgct gtttggggcg atcatcggct 3523141 gcgccaagga cgacatcaaa cgtgcgctgg cagcctcgac cattagccag atcggctaca 3523201 tggtgctggc cgcgggcctg ggtccggccg gctacgcgtt tgcgatcatg catctgctca 3523261 ctcacggttt cttcaaggcc ggcctattcc ttgggtccgg cgcggtgatt cacgcgatgc 3523321 acgaagagca ggacatgcgc cgttacggtg gtctgcgcgc cgccctgccg gtcacgttcg 3523381 caaccttcgg cctggcgtat ctggcgatta tcggggtacc gccgttcgcg ggcttcttct 3523441 ccaaggatgc gatcatcgag gccgcattgg gcgccggcgg catccggggc tcgctgctgg 3523501 gcggtgccgc gctgctgggt gcgggcgtca ccgcgttcta catgacgcga gtgatgctga 3523561 tgaccttctt cggcgaaaag cgttggacgc caggcgccca tccgcacgag gcaccggccg 3523621 tgatgacctg gccgatgatc ttgctcgccg tcggctcggt gttctccggt ggcctgctcg 3523681 cggtgggtgg cacgttgcgg cattggctgc agccagttgt cggatctcat gaagaggcca 3523741 cccatgcgct gccgacctgg gtcgccacca ccctggcgct cggtgtggtc gccgtcggta 3523801 tcgcggtggc ctaccggatg tacggcaccg cgccgatccc gagggttgcc ccggttcggg 3523861 tgtcggcgct gaccgcggcc gcacgtgcgg acctgtacgg cgatgccttc aacgaggagg 3523921 tgttcatgcg ccctggtgcg caattgacca acgcggtggt cgcggtggac gacgcgggtg 3523981 tggacggctc ggttaacgcg ctggcgacgc tcgtgagcca gacttcgaat cgcctgcggc 3524041 aaatgcaaac cggcttcgcc cgtaactacg cgttatcgat gctggtagga gcggtgttag 3524101 tggcggcggc gctgctggtg gtgcagctgt ggtgaataac gtgccgtggc tgagcgtgct 3524161 ctggctggtg ccgctggcag gtgcggtgct gatcatcctg ctaccacccg gtcggcgccg 3524221 actcgccaag tgggccggta tggttgtcag cgtcctgacg ttggcggtgt cgatcgtcgt 3524281 cgcggccgaa ttcaagccca gcgccgagcc gtatcagttc gtcgaaaagc attcctggat 3524341 accggcgttc ggcgccggct atacccttgg tgtggacggc atcgcagtgg tgctggtgtt 3524401 gttgaccaca gtgctgattc cgttgctgct ggtggccggc tggaacgacg caaccgatgc 3524461 tgacgacctg tcccccgcaa gcgggaggta cccccagcgc ccggctccgc cgcgcttgcg 3524521 atcgtcaggt ggcgaacgca cccgaggcgt gcacgcctac gtggcattga cgctggccat 3524581 cgagtcgatg gtgctgatgt cggtgatcgc gctggacgtg ctgctgttct acgtgttctt 3524641 cgaggccatg ctgatcccga tgtacttcct catcggcggc ttcggccagg gggccggacg 3524701 ctcgcgtgcc gcggtgaagt tcttgctgta caacctgttt ggcgggttga tcatgctggc 3524761 ggcggtgatc gggctgtatg tggtgaccgc acagtacgat tcgggcacct tcgacttccg 3524821 tgagatcgtg gccggcgtgg cggcgggccg ctacggagcg gacccggcgg tgttcaaggc 3524881 gctgttcttg ggcttcatgt tcgcgttcgc gatcaaggct ccgctgtggc cgttccatcg 3524941 ctggctgccg gacgccgccg tcgagtccac cccagcgacc gcggtgctga tgatggcggt 3525001 gatggacaag gtcggcacct tcggcatgct gcgctactgc ctgcagctgt ttcctgaccc 3525061 gtcaacgtat ttccgtccgc tgatcgtgac gctggccatc atcggggtga tctacggcgc 3525121 gatcgtggcg atcggccaaa ccgacatgat gcggctgatc gcctacacct cgatctcgca 3525181 cttcgggttc atcatcgcag gcatcttcgt catgaccacc cagggccaga gcgggtcgac 3525241 gctgtacatg ctcaaccacg gcctgtccac ggcggcggtg ttcctgatcg ccggtttctt 3525301 gatagcgcgg cgcggcagcc gatcgatcgc cgactacggc ggtgtccaga aggtggcgcc 3525361 catcctggcc ggcacgttca tggtctcggc catggccacc gtatcgctgc ccggcctagc 3525421 cccgtttatc agcgaattcc tggttctgct gggcactttc agccgctact ggctggcggc 3525481 ggcgttcggc gttaccgcac tggtcctctc ggccgtttac atgctgtggc tctaccagcg 3525541 ggtgatgacc ggtccggtag ccgaaggcaa cgaacgcata ggggatctgg tgggccgcga 3525601 gatgatcgtg gtggcaccgt tgatcgcgct gttactcgtg cttggggtct accccaaacc 3525661 tgtgctcgac atcatcaatc cggcggtcga gaacaccatg accaccatcg gccagcatga 3525721 tcccgcgccc agcgtggcac acccggttcc ggccgtgggc gcctcccgga cagccgaagg 3525781 accgcaccca tgatcctgcc cgccccgcac gtcgagtact tcctgctcgc tccgatgctc 3525841 atcgtctttt cggttgcggt cgccggtgtg ctggccgagg ctttcctgcc gcgccggtgg 3525901 cgctatggcg cccaagtgac gctcgccctt ggcgggtcgg cagtggcact catcgcggtc 3525961 atcgtggtgg ccaggtcgat tcacgggtcg ggtcacgccg cggtgctggg ggccatagcc 3526021 gtggatcgag cgaccctgtt tctgcaaggc accgtactac tggtcacgat catggcagtc 3526081 gtcttcatgg ccgaacgcag cgcccgggtg agtccgcaac gccagaacac cctcgctgtg 3526141 gcgcggctcc ctggactcga ttcgtttacc ccgcaggctt ccgccgtgcc cggcagcgat 3526201 gctgagcgcc aagcggaacg ggcgggagcc acccagacgg aacttttccc gctggcgatg 3526261 ctgtccgtcg gcggcatgat ggtgtttccc gcgtccaacg acctgttgac gatgttcgtt 3526321 gcgctggagg tgctatcgct gccgctgtac ctgatgtgtg ggctggcccg gaatcgccgc 3526381 ctgctgtcgc aggaagccgc gatgaagtac ttcctgctgg gcgccttctc gtcggcgttc 3526441 ttcctctacg gcgtcgcgtt gctatacggc gcgaccggca cgctgacctt gccgggtatt 3526501 cgggatgcgt tggcagcgcg caccgacgac tcaatggcgt tggccggcgt cgcgctgctc 3526561 gcggtcggcc tactattcaa ggtcggcgcg gtgccattcc actcctggat tcccgatgtg 3526621 taccagggcg cacccacccc gatcaccggg ttcatggcgg ccgccaccaa ggtcgcggcg 3526681 ttcggtgcgc tgctccgggt ggtctatgtc gcgctgccgc cgctgcacga tcagtggcgc 3526741 ccggtgctgt gggcgattgc catcctcacc atgacggtgg gcaccgtcac cgcggtaaac 3526801 cagaccaacg tcaagcgtat gctggcctat tcatcggtcg cgcacgtcgg tttcatactt 3526861 accggcgtga tcgccgataa tccggcgggt ctttccgcga cgttgttcta tctggtcgcc 3526921 tacagcttca gcacgatggg tgcgtttgcc atcgtgggtc tggtccgagg cgccgacggc 3526981 tcagcaggtt cagaggatgc cgacctgtcc cactgggccg ggctgggaca gcgttcacct 3527041 atcgtgggcg tgatgctgtc gatgtttctg ctggccttcg ccggcatccc gttgaccagt 3527101 ggattcgtca gcaagttcgc ggtgtttagg gccgccgctt ccgccggcgc ggtgccgctg 3527161 gtaatcgtcg gcgtgatctc cagcggcgtc gccgcctact tctacgtgcg ggtgatcgtg 3527221 agcatgttct tcaccgaaga atccggtgac acaccacacg tggcggcacc cggcgtgctg 3527281 agcaaggccg ccattgcggt atgcacggta gtcaccgtgg tgctggggat cgccccgcag 3527341 ccggtgctcg acctggccga ccaggccgcc cagttgctgc gctgaatccg ttagggctga 3527401 ccgaagaagc ccgactggtc actgccctga ttgaagcccc ccgagctgtg gtcacccgtg 3527461 ttcgccacac ccgtgttgag ggtgcccgag ttcgcaatgc ctgtggtctg caggccagag 3527521 tttgcgatgc ccacggtgcc ggcacccgag ttatagaagc cgacgttgaa gccgccggag 3527581 ttggtgttat tgatgcccga ctgaacgtca ccgttgttcc catagccagc cgaaacattg 3527641 cccgtgttaa agaagcctga ggaattcatg ccggtgttgc cgaagcccga gctcgaaacg 3527701 gattggtcga ccgagcttcc aaacccggtg ttccggtcgc ccgagtcgaa accgcccgta 3527761 ttgatgctgc ccgagttcgc gaatcccgta ttgatactgc ccgcgtttgc gaagcccacg 3527821 tttagggtgc ccgcgttgcc aaagcccaca ctttggttgc ccgcattgcc aacgcccacg 3527881 ttaaaggaac cgccgttccc gacgcccatg tcttcgttgc ccgcgttccc gatgcccata 3527941 ttgaagaagc cggcgtttcc gaagcccgtg ttggtgtcgc cggcgtttcc gaagcccgtg 3528001 ttgatgtcgc ccgcgtttcc aaagccgaag ttgttgttgc ccgaattgaa gaagcccacg 3528061 ttgttgttgc cagagttgaa gaaaccgatg ttgttgttac ccgagttccc gaaacctaga 3528121 ttcccgatgc ccgagttcag cgcgccaatg cccaccaagt tgtcgccggt gagcccaaaa 3528181 ccgatgttgt tgttgccatt gttcccgagg ccgaggttat tgtcgccgtt gtttccgaaa 3528241 ccgatgttgg aggagccgat gtttccactg cccaagttga aggaaccgag atttccgccg 3528301 ccgaagttgg tacttccggt gtttccactg cccaggttcc cactgccaaa gtttccgttg 3528361 ccgaggtttc caaagcctcg gtttccgctg cccagattga cattgccaac gtttccgctg 3528421 ccgagattgg tgttgccgat atttccgctg cccaaattcg tggcaccgtc atttccgctg 3528481 cccacattgg cgttgccgga gtttccgcta cctacgttgg cgttgccgga atttccgctg 3528541 cccagattgt agtcaccggt gttcccgccg cccaggttcc cgacgccgat gttgccgagg 3528601 ccgatcgcgg cggccagcgc cgatggcgca gctggcaacg cctgctgcag accaattgac 3528661 cacgacgaca gctgcgccgc ggccgccgat gccccgccgt gatagcccac catcgcggcc 3528721 acatcggcgg cccacatctg ttcatacatc gcctcagcgg ccgcgatcgc cggcgcattc 3528781 tgcccaaaca gattcgacaa caccaactgc acaaacgcat tacggttggc cgccaccagc 3528841 atcggatgca ccgtcgccgc ccgcgccgcc tcaaacgcac tggccaccgc cttggcctga 3528901 gccgacgcgc cagcggcccg cgccgccgca gcagccaacc accccgcata cggcgccgcc 3528961 gccgcggcca tcgccgccgc cgccgcaccc tgccaggact gacccgccaa ccccgaagtc 3529021 accgacccaa acgaggacgc cgccaccgcc aactccgcgg ccaaacgatc ccaagccacc 3529081 gatgccgcaa gcatcggcgc agaccccgca ccggtaaaca tccgcaacga attaatctcc 3529141 ggcggcaaca ccgaataatt catcagccca gccccttccc ctacaggacg tcccggccaa 3529201 tgactcaggc aacggtgcac gtctctgtac tcgtagaaca aactgtagga aaacggcgcg 3529261 acgaataacg gcgatttcgt gaaaattctg gttcccgtca gaagcacgcc accctcggcc 3529321 acctcgtttg cgcacgccta gagcccgcgg tcggggggtg cggtctggat ctccaaagca 3529381 tctgctgctg cccggatctc ggctagccga tcagggtccg acaacagcgc cgtcatcgcg 3529441 aactcgacga tctcgtccgg gctgaccgcg aattccggca gcgccagagc gtcaaacaac 3529501 gcctgcacca gcctggcagc cgataacgga tgcattgcgc gcacatcacc ttcgccttgg 3529561 ccggtctcga tcaggccaac cagcgcgcgt tccatctccg cgactagctc ccgctccgcg 3529621 acgaaggatt cctgatgcag gtccggggtg atgaggatgg aaaccagcac atagggcgaa 3529681 gcatgcaggt ggtccaggga ttccgtcagc cagcggtgca gcttgaccac cgccggaacc 3529741 ggcatcgcgg tgatgtgacc gaacagctca agcggccact ccacggcgag ccgcaccagg 3529801 gccgcaagga tatcgcgttt ggccgagaag tgtttgtaga tggccggctg ctccaccccg 3529861 acggctgcgg caatgtctcg cgtcgaggtg gagctgtaac cccgcagcgc gatgagctcg 3529921 gcagcggccc ccaggatgcg gagtgccgtt gggctccagc ggccggcctg cctcggcatg 3529981 ccggcaaggc tagctggcac ctgggtggtc gccaaccagc gccatggcga ggttccggta 3530041 gaacgcgagc atgccgggcc attctttcga gctaaggtga ccccgttcgg cgaatcgcga 3530101 gcccgcccca acctgcacgg cctccagtcc gagccggtcc tcgtcattga tcatcgccat 3530161 cacgaactgc gacgtctgag ctgttgctgc cgcatcggcg gctaactcgg gggtggtgag 3530221 cacgccgccg agcacctgca cccggtcgat gctttgcgga ataaagccga accacaccac 3530281 ccgctcgccc gctatggcca gcgcgctgtt cggaaacgtc cacaacacga ccagattact 3530341 tttctgaacc tcgttgagct gcaacgactt cgcttctact ggaacggtga agggaaccct 3530401 gaggcgcaac gcccaccgcg aatactgccg aacgtccaga tcgcccccac caggaacgaa 3530461 cggctccagg gtttggcgat gcaggccgag cacgtggtag ttctcatgac cattttccgc 3530521 cgccaccttc caattagctc gccactcatg cgaccacgac tcgacctgca ccatctcacc 3530581 gagccgatag ccggcgaatt cgtcgtcagt caggtccaga tgcgccgcga ttggttcggc 3530641 atcggcatcc aggttgatcc acaccaatcc attccaggtg gccacggcga actgcggaag 3530701 ccggcactcc ctacggttga agtctaagtt ggcggccata tggggcgctc cgcgcaaccg 3530761 gccatccagc ccatagcgcc acaggtggta ttggcaggtc aacgtgtcga tgcgccccgc 3530821 accgggttcc accatcagca tcaaccggtg ccggcagatc ggcgaaagag cgtgcagctg 3530881 cccgtcgacg tcccgcacca ccatgaccgg ctcccctgcg acggacacgg tgacgtagtc 3530941 accggtcttg gcgacttggt cgacatgcgc gacaagcatc caggaccggt tgaagatccg 3531001 ttcccgctcc agctgccaca gctccgatga ggtgtaggcg gccggcggca ggcttagcgc 3531061 cggtggattg tcgtcgaggt aatccccgat gtcggtaagg atgtctccga gctcggctcg 3531121 gttatcagtt gataacatac cctccatgtt atcgactgat aaccgattgt caacagcgcg 3531181 caccggcccg accggccagc cggcggttca cctcgagaac ggacgggtgg ccagcacgta 3531241 ggtagccaac acggccaacg gtgccgccaa cggcagccat ggcacttgca gcgggaacga 3531301 cgtcgcagcc aacccagcga acgtgaaacc aacggcggca acggtcgtcg gccagctccc 3531361 ggcgacaaca ccggccccgt atcggcacac caggtagacg gcggcgcaaa accccgacaa 3531421 tgcggcaagc acatgcgtcg ggccggacac cacgatcatc accaccgaca acaccacggc 3531481 aagcgttgcc gccaggcgaa acaccgccgc cacccctacc gcaatcaccg cggcaagccc 3531541 cacgacaaca gccagcccgt gcgatcccac agcggccgac cccaccatca tcagtccgaa 3531601 caccgtggag agcccacgag tacccgggtg cgcaaacgag gtcatggcag cctcgcccgg 3531661 ctagctctgc cccgtccgcg acgacggcga ttgggcaacg cacccatcga ctgctgaagc 3531721 gagtgatccg ccggccagga cagcacgtcg accccgatgg tggccatgtc gcgatacatc 3531781 gcggagcgct gcagcgccca catccggacc accaggggat ccagttggtc ctggagcgga 3531841 cagctatcaa gaacgtcgac agcaaccacg acgtggccgc gtttacgcag gtcgatcaac 3531901 gccagcgcga actcggtatc cagcagcgtg gaaaacgcaa tgacaaccgc tcctgcggga 3531961 acagctgcgc gcggagccag cgtcccggtg gtgttttcga acccttcccc ggcgccgagc 3532021 acggtgtcga gcacccgata gaactggcgc tgcccgatgt cggcgcccag ccatcgcggc 3532081 cgattgccgc ccagcgcaac gatcccagca cggtcaccgt ttcgcagcgc ggtttgcacc 3532141 acctgagcag caccccgcac gactcgttcg gtggcctcgg tcgccggacc cgccggctgt 3532201 cgatacatgt cgatcaacac caccacgtca gcggcccggt cggtcaaccg ccttgtcacg 3532261 tgcagtcggc cacggcgcgc gcttaccacc cagttcacgg cacgtagctg gtcgcccggg 3532321 acatatgggc gaatgtcggc gtattcgaca cccggcccga cgtgccgggt gagatgagct 3532381 cccaggcggt cgagcaattc ggtctgcggc agtggcgtcg actgcggcgg tgtcagcgga 3532441 aacacgacga tttcggcggc gtcgacggtt ccggctccca tcaacaaccc accgcgtgcg 3532501 acgacggcga cccgggcccg gataggatag cgcccccagc gttgcgccac cgcggaaacc 3532561 gttgtcgtcc ggcgtgacac ggattccaga gcttcgaact gcattcccgc caacgccgat 3532621 accgtgagtt cgaccgcggc gtccacggat tccgttgtga cccacacggt cactcgcaca 3532681 tgttcgttct cgaaacatcg ctgcgaatcc gggtcaccgt gcacctggat caccgggacc 3532741 ggacgctgcc agctgatcga gcacaacacg ccgagcagcg gcgccgcgaa cgcaatcagc 3532801 tgccaacgac cagcgacgac cgctgcggct agcgcaactc cggcacaggt ggcaatcgcc 3532861 agcgtcagtt gtgatgcacg ccagcgcaac tcgacttcac acgtttggat cacatcgcgc 3532921 cgtagttcat ccagccaacc cgctacgttc cactaattcg gggaacaggc agacgccgca 3532981 acagctctga gaccacatca gcgcccgcaa tcttgcgcac ccacatctcc gggcgcaatg 3533041 tgatccgatg cgcgacggcc gcggtcgcaa gttccttgac atcttcgggt atgacgtagt 3533101 cccggccgag caacagagcg cgggcacggg agagctggac caggtcgagt tcggctcgcg 3533161 ggctggcgcc gacggccacc tgcggatggt gccgggtagc gttggccaac gacaccacat 3533221 agtgcaagac gtcctcgtgc acggtgacct gctcgaccga ttcacgcatg gccaacagat 3533281 cgtggcagtc caccacctga ttcaccgtcg gatccgcaga accgcgttcc aggcgacggc 3533341 gcagcatcga ggtctcgtct cgctcggaga ggtagcgcag ttccaaccgg atcgcgaacc 3533401 gatccagttg cgcctccggc agtggatatg tgccctcgta ttcgatcgga ttgtcggtcg 3533461 ccagaacgat gaatggcatt gccagtttat gggtttggcc atcgatgctc acctggccct 3533521 cggccattgc ctccaacagt gccgcttgcg tcttcggcgg cgtccggttg atctcgtcgg 3533581 cgagcaacag gttggtgaaa ataggcccgg cccggaattc gaaacgaccg gactgcatgt 3533641 catagatggt cgagccgagc agatcggccg gcagcaaatc aggcgtgaat tgcactcggg 3533701 tgaaatcgag ccccaacgcg gcggcgaagg atcgcgcgat cagcgtcttg ccgaggccgg 3533761 ggagatcttc gatgagcacg tggccacggg cgagcacggc ggtgaggatg agtgtcagtg 3533821 cagagcgctt ccccaccacc acacgttcga tttcgtcgag caccgcctcg cagtgggcgg 3533881 tggtcgtcgc ggccggcata atcatcgttg agtcatacct gttctaactt ctgcagaatt 3533941 tcttccagtg ccgcacggcc ggggcctggt tgacggtcgc cggtgtgcgt cacattgttc 3534001 gggttgaccc attcccacaa ttcgtcgccg aaaagcattc ggccggtggc agcaaaggca 3534061 accgggtctt tggcctgtct atggccggtg gcgatttcga accgtcgtgc gagcatcgga 3534121 cgcaaatgcc ggtcccagtc ggctcgagtg gactccgacc accggatcgt cgtctcggtg 3534181 ttggagagcc accggcgcaa cccctccccc agatcgtcgg agtccggcgc agccgtgagt 3534241 tcgtcccggt tgcccagcat ccggcggacg ttgagcagca ccagagccag ggcgagcccc 3534301 gacccggcga gcacgagccg acggtcgtgc agtatcagcg ccagcagctc aatccccacg 3534361 atgaggaaaa tccccagggc gataagcctt ttcatatagc ggtccgagtg ctcagttcgt 3534421 caagaaccag tcgaagcaaa cgcatcgcca cctcacggtg ctcctcgttc atcacgtgcg 3534481 ggctaaaacg cgcctcggcg aacaggctca ccaacgcggc ggcactagca ccatggagcg 3534541 cacggtgttc gacggctcgg gccagcacct cggtcggggt gtcgaagtcc tgaggggcaa 3534601 caccgggaac atgcgacagt tcacgctcca tcgccacgta acacgcaatt atcgcctccc 3534661 gtggttcgcg gcggaggtcg gccatctcgg ccagtccgat ctcggcggca cgcgccagtg 3534721 attccgaacg cgccgagggc gccggagact cgatgcgatc gccactgata cgagccggtg 3534781 ccgacttgcg ctgtcgtcgc gaggtaatca gcgaccccgc gacgaccatc aagaacaggc 3534841 cgattgtgct ggcaaagaga atgccgagca cgtcgtcatt gttgtcttgc ggcggttgcg 3534901 ggcgcgacgg cgtggtgctg gaagcatccg gcgtagcggt tgaatccggt atgggcgcag 3534961 caggaccgac atcatcgggc acgaacaacc gtgccagcag tatcgcaatc agcagccagg 3535021 ccaggattgt cccgagtccg agcaacagca cacgccagtt cggacgccct gctgcaccgc 3535081 caagcattgc cgagagctcc cccgcgctgg gcgccaccgg gagcggatgt cgcaaccggg 3535141 tgatgatggc gagcgctatc agcgcgagcg tcgcggcaag tgcggcgaca atgaacatca 3535201 gcgccgcccg gctgccgccg gccgccgcga gcggtgcacc gtcgtcggcc ggcaggtggc 3535261 cgcgcagggc agcgccagca agcatcaaga gcacgatcac gacgacgacg cgccctgtcg 3535321 gtttgtcact accgggctta gtaccgggca tacgcacacc actcgaccgg ttgcctgccg 3535381 ccgttgcggc ctgggggttg gttcaacctg gcttggttca tactggcacg tcagacgaca 3535441 ctgccgccag gagcggcgcg gtggacccct cgcacgacga tcgcggtggt ttggtccacc 3535501 cacgcgtcgt ccagcatgtc gtccgggtac agcagcatcc gcagcatggt ggcgcccccg 3535561 atcagctcga tcaaccggtc cgggtccacg tcgggatgcg cctcgccgcg gtcgacggcc 3535621 tcgcgcaggc gcatgcgcac cgcggcgaat aagtcggcaa aacgcgccag cacccgggcg 3535681 ttgagttcag cgtctgcggt catatcggct accagaccgg gtaacgcggc ccgcaccacc 3535741 ggggtggtga acacatcgcg ggtggccgcg atcatcattc ggatgtcggc ggcgatatca 3535801 ccggccgcag cctgcagcgc ggtgggcgcg gcgggaaacg cggcctcgtg cactagttcg 3535861 gccttgctcg accaccgccg gtacaacgcc gatttggtgg tgccggcgcg ttcggcgacc 3535921 gcggccaagc tgaggttcga atacccgatc tgcacaagca gttccgccgt cgccgacagg 3535981 atcgccgagt cgatgcgcgg atcacgcggc cgcccggcgc cgggggcctt gtcaagggag 3536041 ggcaggtctg ctttcataac gctacctaaa gtagcgtaat tgccgcacca gggaggcgct 3536101 tgtggccaac gaaccggcaa tcggagccat cgaccgactc cagcgctcga gccgcgacgt 3536161 gaccaccctg ccggcggtga tatcgcgctg gctgtcgagc gtgttgcccg gtggggcggc 3536221 acccgaggtg accgtggaaa gtggcgtgga ctccaccggc atgtcgtcgg aaaccatcat 3536281 cttgaccgcg cggtggcaac aagacgggcg atcgatccag cagaagctgg tggcgcgggt 3536341 ggcgccggcc gccgaggacg tgccggtgtt cccgacgtat cggcttgacc accaattcga 3536401 agtgatccgg ctggtcggag agctgaccga cgttcccgtc ccgcgggtgc gctggatcga 3536461 gaccaccggc gacgtgctgg gaactccgtt ctttctgatg gactacgtcg agggcgtggt 3536521 gccgcccgac gtcatgccgt acacgttcgg tgacaactgg ttcgccgacg cgcccgccga 3536581 gcgccagcgc caactgcagg acgccaccgt cgcagcgttg gccacactac attcaatccc 3536641 taacgcccag aacacgttta gcttcctcac ccagggccgc accagcgata ccacgctgca 3536701 ccggcacttc aactgggtac ggtcctggta cgacttcgcg gtggaaggca tcggtcgatc 3536761 cccactactg gaacggactt tcgagtggct gcaaagccac tggccggacg acgctgccgc 3536821 gcgcgagccg gtgttgctgt ggggggacgc gcgggtgggc aacgtcttgt accgagactt 3536881 tcagccggtg gcggtgctgg actgggaaat ggtggcgctg ggtccacggg aactcgacgt 3536941 cgcgtggatg atatttgcgc acagggtatt tcaggagctt gccggtttgg cgacgctgcc 3537001 gggtttgccg gaggtgatgc gtgaggacga tgtgcgcgcc acctaccagg cgcttaccgg 3537061 cgtggaactt ggtgacctgc actggtttta cgtgtactcc ggggtcatgt gggcatgcgt 3537121 gttcatgcgc accggtgcgc ggcgagtgca cttcggcgag atcgagaagc ccgacgatgt 3537181 ggagtcgctg ttctatcacg ccggcttgat gaagcatctt cttggagagg agcactaatg 3537241 ccgcaaatgc taggcccact cgacgagtac ccgctacatc agcttcccca gccgatcgcc 3537301 tggccgggct cctccgaccg caacttctac gaccgctcct acttcaacgc ccacgaccgc 3537361 accgggaaca tctttctgat caccggtatc ggctactacc ctaacctggg cgtgaaagac 3537421 gcgttcgtgc tgatcaggcg tgcggacata cagaccgcgg tgcatctttc ggatgccatc 3537481 gactccgacc ggctacacca gcacgtcaac ggttaccggg tggaggtcgt cgagccgctg 3537541 cgaaaactgc gtatcgtgct cgacgaaacc gaaggtgtgg cggccgatct cacctgggag 3537601 ggcctgttcg acgtcgtcca ggaacagccg cacgtcttgc gctccggcaa ccgggtgacc 3537661 ctggatgcgc agcgcttcgc gcagctgggc acctggagcg gccgcatcgt cgtcgacggc 3537721 gaacggatcg ccgtcgatcc ggcgacctgg ctcggcagcc gggaccggtc ctggggcatc 3537781 cggccggtgg gggaaccaga accggcgggc cggcccgccg acccaccctt cgagggcatg 3537841 tggtggctgt atgtgccgtt ggccttcgac gacttcgccg tcgtgctgat catccaggaa 3537901 gaacccgacg ggttccgctc gctcaacgac tgcacccgga tctggcgtga cggccacgtc 3537961 gagcagctgg gctggccgcg ggtgcggatc cactaccgct ccggcacccg catcccgacc 3538021 ggggcgacga tcgaggcaag cacccccgac ggcgcgccgg tgcacttcga cgtggagtcc 3538081 aaactggcgg tgccgaccca tgtcggtggc ggctacgggg gtgactcgga ctggtcacat 3538141 ggcatgtgga agggcgagaa gttcgtcgag cgaagaacct acgacatgac cgatccgacg 3538201 atcatcgcgc gggccggctt cggcgtcatc gaccacgtcg gtcgcgcgct atgccgcgac 3538261 ggcgacggga atccagtgca gggctggggt ctgtttgaac acggggcgct gggccgccac 3538321 gacccatcgg ggttcgccga ctggtctacg ctggcgccct aggcgcttca ggcttacttc 3538381 ggcaccggtg aggctatccg cattcgcgag tccagggttc ctgggcgccg gccgggaaac 3538441 ggcccgaaaa cgacggcagc cggaatagcc gaccggaacc gccgaaatgc ggttgactag 3538501 agcggtgaca aacccaccgt ggactgtcga tgttgtcgtg gtgggcgcgg gcttcgccgg 3538561 gctggccgcg gcccgcgagc tgacgcgaca gggtcacgag gtgctggtgt tcgaaggccg 3538621 cgatcgggtg ggcggccgct cgttaaccgg tcgcgtggca ggggtgcccg cggatatggg 3538681 cggctcgttc atcggcccga cccaagacgc cgtgctggcg ttggccaccg agctggggat 3538741 cccgacaacc ccgacccacc gcgacggccg aaacgtcatc cagtggcggg gatcggcacg 3538801 cagctatcgt ggcaccatcc ccaagctgtc gctgaccggg ctcatcgaca tcggccggtt 3538861 gcgttggcaa ttcgagcgaa ttgcccgcgg cgttccggtg gccgccccct gggatgcgcg 3538921 gcgcgcgcgt gaactcgacg acgtgtcgct cggggagtgg ttgcgcttgg tgcgcgccac 3538981 atcgtcctcg cggaacctga tggccatcat gacccgggtg acctggggtt gtgagcccga 3539041 cgatgtctcg atgctgcacg ccgcccgcta cgtacgcgcg gccggcggcc tggaccggct 3539101 gctcgacgtc aaaaatggtg cccagcagga ccgtgtgccg ggggggacac agcagatcgc 3539161 ccaggcggcc gccgcccaac tcggcgcacg cgtcctgctc aacgccgcgg tgcgtcgcat 3539221 cgaccggcac ggagcgggtg tgacggtcac gtccgatcag ggtcaggccg aggccgggtt 3539281 cgtcatcgtc gccattccac cggcccatcg cgtggccatc gagttcgatc ccccgctgcc 3539341 gccggaatat cagcagctcg cccaccattg gccgcagggc cggctgagca aggcctacgc 3539401 ggcctattcg acgccgttct ggcgggccag cgggtattcc ggccaggcgc tgtccgatga 3539461 ggcgccggtg ttcatcacct tcgacgtcag tccgcacgcc gacgggccag gcattctgat 3539521 ggggttcgtc gatgctcgcg ggttcgactc gctacccatc gaagagcgcc gccgcgatgc 3539581 attgcgctgc tttgcgtcgc tgttcggcga cgaagcgctc gacccccttg attatgttga 3539641 ctatcgttgg ggtacagagg aattcgcgcc gggtggtccg accgcggcgg taccgccggg 3539701 gtcgtggacg aaatacggtc actggttacg tgagccggtc ggtccgattc actgggcgag 3539761 cactgagacc gcggacgaat ggaccgggta tttcgacggc gccgtcagat ccggtcagcg 3539821 tgccgccgcc gaggtcgccg ccctgctatg agctgatccg ccggtcccgg acgtgccggg 3539881 tcaccgattc ggccagcgcc cgcaggtggc tgttcacctc ttggtgccgt tccagcatcg 3539941 agcagtggcc gccgggcagt tcaacgaggc cgacgacatt gggcgcggtg cgcgcaatcc 3540001 tgcgggactg gctgatcggc gttagtcgat cacgtacgcc gccgatcacc agggttggca 3540061 ccgtcagacc atccaggttg aggtgtgccg accctacttc ctcgacgagc atcttcgcgc 3540121 agccgccgcg ccccgcggca gacgtctggg tgaacaactc atagaccagt ctcgtggcgc 3540181 tggggtccgc gtcggcggcg accgccagcg tggagatcac gtgccggctt aaggccctgg 3540241 ccgcgccggg gagtggaaac ccgccgaacg tgttgaccag gctccggccg gccagcaccc 3540301 gaaccgggga caactcgcgt ggcaccgaca gcagtttcac cttgcgcacc aggtcgccgg 3540361 tggtggtgtt gatcagcgcg acggcgtccg tgcggcggcg gactttgtgg cggtagcggt 3540421 ccgaccaggc ggcaatggta atgccgccca tcgagtgccc agcgaccacc gcacgctcgc 3540481 gcggggccaa cgtagcgtcc aacaccgaat cgaggtcggc cgcaaggtga ttgaggctgt 3540541 aggcgccacg ccgtgggaca ccgcttcgac cgtggccgcg atggtcgaag gcgatcaccc 3540601 ggtagtcgcc ggccaggtcg gcgatttggt atgcccaggc ccggatggcg cagacgaaac 3540661 cgtgcgtcag cacaatcgga tagccgtgag gcggcccgaa cacctgggtg tgtaacgggg 3540721 tgccgtccgc cgcacggacg gtcaaggtgc ggctaggcgg taggacgtct ggaatctggg 3540781 tagccccgct gcttcgagtg ggtctccgag cactcatcgc cgctccccct tcgacgcggc 3540841 cccgttgccg ccttccggat gtcgcccact ctagcgtgca gttacttacg ggtagctgga 3540901 aatcgctgaa gcataggatc acagaataat aacgtcgcgg cccctgctct cagctggttt 3540961 cgcatcgcca gccgatcagt agtcgtctca gtaatcgtcg agggcggcca cgttgcgcca 3541021 actcggccac gtcgtctccc agatccggtg aattcggccg ttgcggtagg cggcaatgag 3541081 taccacctcg atgcgggtcg gctcctcgcc aggtcgcgac gtggtgatcc acacccgccc 3541141 ggcaaccttg tctgggcctc tacccatgcg tgctcgtcgt attcgaccgc gtagctgatc 3541201 gccgtggcgt agagcttgcg gtggctatcg cggaattttg cgaagctctg gctcagcccg 3541261 tcggagtaca tcaggaagtc tgggtcgtag tagtgctcga tcagctccgc gtttttggcg 3541321 acgaccatcc gatcgaacat ttcccgaagc agcgcaacgg acattcggcg atcctaaacc 3541381 ctggccgccg gccatctcac aacgtgagcg tggacgaatc cccatccatt gcgatgacga 3541441 gttcagaccg gacgggccgt tgcctgatca atcaggacct ccgctgccgc tcgggcgtgc 3541501 gcccaggggc cggcatcgtc gagggaggtg gacagtgcgg ccgcgccctc gaacagcacg 3541561 gcgagttgat tgcccaggct gcgcggatgc gctgcgccgg cttctcgggc cagccgggcg 3541621 aggcctttga tgtagtcgcg tttgtgcgag tggacgatcc gctcgactcc gggcatctcc 3541681 ccggccgcct cgaccgccgc gttgtggaat ggacaacctc gcatccgccc atcgcccctg 3541741 tttggacgat cgaacaatgc gagcagccgc tcgcgtggtg tcgcgttgga tgccttgggc 3541801 atcttgtcgg cctcgccggc ggcttgccgg agcccgcgca ggtactcctc caccaacgcg 3541861 gacttactcg gaaagtgttg gtagagagtc cgcttggata ccgaagcctt gttcgcaatc 3541921 agttcgaccc cggtggcgtt gatgccctcg cagtagaaca gctctgcagc cgccttcaag 3541981 atacgctgac gagcgccgcg gcccccgcgc ctggggggtt ccgttgttct ggtgaccggc 3542041 ggcatagtgc tgagtatacc gacctgttta caacacccct tagcgcgtgt accgtcaaag 3542101 cacaaagtac accaatcggt ttactgtagg aggtctcatg acttcactag ccgagcggac 3542161 cgtgctcgtc accggcgcca accgcggcat gggccgcgaa tacgtcgctc agcttctcgg 3542221 tcgcaaagtg gcaaaggtct atgccgctac ccgcaacccg ctggcaatcg acgttagcga 3542281 tccgcgcgtg attccgctcc aactcgacgt caccgacgcg gtgtcggtcg ccgaggcagc 3542341 cgacttagca accgatgtcg gcattctgat caacaatgcc ggcatctccc gggcgtcctc 3542401 ggtgctcgac aaggacacat ccgcgcttcg cggcgagctg gagacgaacc tgttcggacc 3542461 gctcgcgctg gcctccgcgt tcgccgaccg catcgccgag agatccggtg ccatcgtcaa 3542521 cgtttcctcg gtactcgcct ggcttcccct tggcatgagc tatggagtgt ccaaggcggc 3542581 gatgtggagc gcgacggagt cgatgcgtat cgagctggcg ccgcgcggtg tgcaggtggt 3542641 gggcgtctac gtggggctgg tcgacaccga catgggtcga ttcgccgacg cgccgaagtc 3542701 cgatcctgcc gatgtggtcc gccaggtgct cgacggaata gaggctggca aggaggacgt 3542761 gctggccgac gagatgagcc gtcaggtgcg cgcgtcgctg aatgtccctg cgcgggaacg 3542821 tatcgcgcgg ttgatgggta actgagtccg aaagtcgata tggccatgtc cgccaaggcc 3542881 tcagacgata ttgcctggct accggcgacc gctcaactcg cggtgctcgc cgccaagaag 3542941 gtgtccagcg cggagttagt cgagctgtat ctttcccgaa tcgacacgta caacgcgtcg 3543001 ctcaacgcga tcgtcaccgt tgaccccgac gccgcccgac gcgtcgccaa gcggtccgat 3543061 gcggcacgag cccgcggcga cgaactcggc ccgttgcatg ggttgccgat caccgtcaag 3543121 gacagctatg agacggccgg catgcgcacg acctgcggtc gccgcgacct tgccgactat 3543181 gtacccaccc aggacgccga ggcggtcgcc cggttgcgcc gggccggcgc gatcatcatg 3543241 ggcaagacaa acatgcccac cggcaaccag gacgtccagg ccagcaatcc ggtcttcggc 3543301 cgcaccaaca acccatggga cgccgcgcgc acgtccggcg gctcggccgg cggcggggcg 3543361 gccgccaccg cggccgggct gaccagcttc gactacggct cggagatcgg cggctctacc 3543421 aggatcccgg ctcattactg cggtctgtac ggccacaaat cgacctggcg ctcggttcct 3543481 ctggtcgggc acattcccag cgcaccaggt aatcccgggc gatgggggca agccgacatg 3543541 gcctgcgcgg gcgtgcaggt gcgcggtgcc cgcgacatca tccccgcact ggaggcgacc 3543601 gtcgggccga tgcgggcgga cggaggattc tcgtatgcgc tcgctccgcc acgagccggc 3543661 gcgctcaaag acttccgggt cgcggtctgg gccgaggacc cgcattgccc aattgacgcc 3543721 gacgtgcgtc gggccatgga tgatgctgtc gccgcgctgc gcgccgcggg cgcacacgtc 3543781 gttgagcagc ccgccaccat cccggtcgat atggcggtgt cgcacaacat cttccagagt 3543841 ctggtgttcg gcgccttcgc tgtcgaccgg tccaccctca gcccagcctc cgccgccgcg 3543901 ctcggattac gcgcggttcg gcatcctcgg ggcgaagccg ccaacgccct gggtgcgacg 3543961 ctacagagcc accgtgcgtg gttgttcgcc gatgcggcgc gccacgaaat gcgcgaccgg 3544021 tgggccggat tcttcaacga gttcgacgtg ctgctcctgc ccgtcacgcc cacccccgcg 3544081 ccgctccacc acaacaagga ccacgaccgg ttgggccgca ccatcgacgt cgacggcgtc 3544141 tcacgatcgt actgggacca actcaaatgg aacgcgctgg ccaacatcgc cggcaccccg 3544201 gccaccacca tgcccatcac caccacagct accggactcc cgatcggcat ccaggcgatg 3544261 gggcccgcgg gcggagaccg caccaccgta gagttcgccg ccctgctcac cgaagtccta 3544321 ggcggcttcc gcgttccccc tctttaggaa cgctcgggca gggccgcaat aacctcggcg 3544381 agccgatcgg gctgctccgc tgtcgtcagg tggccgcccg caagctcggt gatttccacc 3544441 gaatccgcaa gccgctctcg ggcgagccgc agttgctctc cctcgaatgg atcctcggcg 3544501 ctgcccacca caccaaaggc gacctcatcg cccagcgccg aaatgatccg cgccaggtcc 3544561 cagcgcgctg cgtgctcgcg atgctcgtcc acgaagcccg ccgtggcggg cagcacgcgc 3544621 acgccgtcgc gccggctgat cgcgtcgtgg agctccttca tctccgctgc gcttaatggg 3544681 tatccgcgcg agaagacggg gcgcaagaat ggggcgaaca tgcgccatga gcgctggccg 3544741 atcggcgtga tcgccgcgcc gagcggcgat gtgagcagcg gcgtcgtata ccaggcgtgg 3544801 gtgtggccgt cggcaaagat gccgccgttg gcgagcaggc aagccgtgat tcgggtccgc 3544861 tgatcgtttc ccgcccgctc gcgatcgatc cgccgcgcca gcagctcaag gctgacgatg 3544921 caggagtagt cgaaggcaac gacgacggtc tgcgctatcc cctcggcgtg ccagagggct 3544981 tcgacgagat ccgcgcgctc gaaggtcgag tacgggtaat cccggggttt gtcggagtcg 3545041 ccgtggccga tgtagtccag gtagatgcgg gggaagtgga atcgcgagct caagaaagct 3545101 tccaccttcg cccaaccgta ggaaccatcc ggccagccag gcaggaacgt tcgcgtgacc 3545161 cccgtcccag cagcgcgccg tatgaacgcg cgcagcggcg aacgtgggtt gatgcccggc 3545221 cgctcagcgt cgtagcccac cctctcccca gcggagaacc actcctgtgc gctgatgagc 3545281 gcgctcgccc ggtgcgtcat cgcgcgctcg ctagccgttg gcggaggttg tcgaggtcca 3545341 tgtcggtgca tctccgcaac caaagtacac cgataagttt acgtgtcgca ttaaccgatg 3545401 tacagtgtcg gttataagta caccgatcag tatacaagga gtcggcgtgc cccagagaca 3545461 ggccggcgac atcggcgcga cataccagga cgcgcccacg aagagcatca atgtgggcgg 3545521 aacgcgtttt gtctaccggc ggctcggtgc tgatgccggc gtgccggtga tctttctgca 3545581 ccacttgggc gcggtcttag acaactggga tccacgggtc gtcgacggca tcgccgccaa 3545641 gcatccagtg gtcactttcg acaaccgcgg tgtcggcgct tcggaaggcc agacgccgga 3545701 caccgtgacc accatggccg acgatgcgat cgcctttgtc cgtgccctgg ggttcgatca 3545761 ggttgatctc cttggattct cgttgggcgg cttcgtcgcg caggtgatcg cgcagcaaga 3545821 accgcagctc gttcgcaaga tcatcctcgc gggtaccgga ccggccggtg gtgtcggcat 3545881 cggcaaggtt actttcggga cgatccgcga gagcatcaag gccacactga ctttcaggga 3545941 tcccaaggag ttgcggttct tcacgcgaac cgacagcggc aaatcggcgg cgcgacagtt 3546001 cgtgaagcgg ctcaaggaac ggaaggacaa tcgcgacaaa tcgattacag tgcgcgcgtt 3546061 ccgctcccag ctcaaggcca tccatgcatg gggcacgcaa aagccttcgg acttgacgag 3546121 catcggccat ccggtcctga tcgcaaacgg tgacgacgac acgatggtgc ccaccagcaa 3546181 ctcgttggac ctcgctgacc ggctgcccga cgccacgctg cgcatctatc ccgacgccgg 3546241 ccacggcggg atattccagc accacgcaca gtttgtggac gatgccctgc agtttctcga 3546301 gtcgtgaagc gatttcgcat gaccaccaaa gccacgccca gaccagttgg attcgccgct 3546361 cctccccacc gtttcgcggt atcggcagag cgcacccatg gatctatcac cgcaccggcg 3546421 gacgagtcgg ctgcaagttg cgactcggcg ccggattccg caaaccggtg ccgacactgc 3546481 tactcgaaca ccggagccgc aagtccggca agaacttcgt cgcaccactg ctttacatca 3546541 ccgaccgtaa caatgtcatc gtcgttgcct ctgcccttgg gcaggcagaa aacccgcagt 3546601 ggtatcgcaa cctgccgccc aatcccgaca cccacattca gatcggatcc gatcgccgcc 3546661 cggtgagagc cgtcgtggcc agctcggacg agcgggcgcg cctatggccg cgcccagtag 3546721 acgcctacgc cgacttcgat tcttgccaaa gctggaccga gcgtgggatt ccggtgatca 3546781 tcttgcggcc acgctaatag gcgtcggcct gctccgcgtg gtcgagcgat cccggtgcgg 3546841 ttacccgcta cggggtgctt tcggcaccgc gatcggctag gccaccgagg gagcagacat 3546901 cgaatacagc ggccgaatca agtcgctgga cccggcaact cccacgggtg tcgtcaccgt 3546961 cgccgcgatg actggcggcc ggaagacctt tggccaggcg acgttgaacg tccgcttccg 3547021 ctgacccggc ggcctggtga cggcggccga ggacaaagaa gagcggcttc ggctgtccgg 3547081 aacccggatc gaactcgagg agctacttca gcttccggtc gatgttgcgt acgagggcct 3547141 gttgacggac gacgtttccg aatccgttcg caaaaagctc attacgctac gagccggtcc 3547201 ctcaagaacc gcctgctcga atctgcgcaa ccccgctggc gttggggcgg acgacggtgc 3547261 tcggcgtgat gtggtgcacc aaagggacat tgccgacgga actggcgttg agccagcaac 3547321 acaccgttga tcgcatgagt gatgtccacc caaccgcggt caccgacaac ggggatccag 3547381 tcgggatcat cgctggcata aggatatcgg cctgcaccgg cattgtgtgc tcacggccat 3547441 cgctgcctgg gaccaatcac cagcccctgg aaggtcgact acagccacaa gcccgacgat 3547501 ggtcgacaga tcaagatacg tctttcgaca aaacaagatc caatggtcga caaaacagga 3547561 caaactattc gacaaatcgg gatcagatgt acgacaaaac aggagtactt tgacgttgtg 3547621 gtgcatgatg aggctggtca cgagctgatc gagcggcaca tgctcgaaca gttgcgcgag 3547681 gttgcggagt acacccgtgt cgtgctgatc aatggtccac ggcaggctgg taagacgacg 3547741 ctgctccaac aattgcacgc cgagctaggc ggatggctgc gttcgttgga tgttgacgtc 3547801 gaacgcgcgt cggcgcgagc cgatcccgag gggtacatca tgtccgcgcc gcgcccgacg 3547861 ttcttggacg aggtccagtg cgccggggat ccgttgatcc tggcgatcaa gacggcaacc 3547921 gatcgtgacc gccggcccag acagttcttc ctgtcggggt cgacccgatt cctgacggtg 3547981 ccgacgctgt cggaatcact ggccggacgg gttgcgatcc tcgacctctg gccgctgtct 3548041 gtcgctgaac gatcgggtgt ccggccggag atcattgcgc aactgttcac tgaaccccaa 3548101 gtggtcctgg gcacggagcc cgccccggtc acgcgacatg agtatctgca gctggcctgc 3548161 gcgggtggct ttccggaagt tgtgcagcgc ccggcgggtc gcgcccgcag ccggtggttc 3548221 tcggactatc tgcgcacggt gacgcagcgc gacgtgcgcg agctgaagcg gatcgagcag 3548281 acggatcgcc tgccgcggtt catgcgctac ctggccgcta tcaccgcgca ggagctgaac 3548341 gtggccgaag cggcgcgggt catcggggtc gacgcgggga cgatccgttc ggatctggcg 3548401 ttgttcgaga cggtctatct ggtacatcgc ctgcccgcct ggtcgcggaa tctgaccgcg 3548461 aagatcaaga agcggtcaaa gatccacgtc gtcgacagtg gcttcgcggc ctggttgcgc 3548521 gggcaaagcg ccgactccct ggccaggcca accgcggagg gcgcgggccc gatcatggaa 3548581 acgttcgtga tcaacgagct gatgaagcta cgtgcggcga ccgaactcga ggttgacctg 3548641 tatcactttc gcgatcgaga cggacgggag atcgactgca ttcttcagac cccagacagt 3548701 cgcgtcgtcg gtgtcgaggt caaagcctcg gcgacagtga acgtccatga tttccgacac 3548761 ttgtcattcg cgcgtgaccg actcggcgac gaattcatca ccggagttct cttctacact 3548821 ggtgcccggg ctttgccgtt cggcgaccgg ttgatggctc tacccatcaa tctcctctgg 3548881 aacggacaat ccgtctccag cctgtaggcg cataccgatc gccatatttc aagagcaggt 3548941 tggagcttct gcccccaatc atcgtgcggc aacgatgggc ggctctagcg ctagtcgacg 3549001 cgctattcaa ccagctcaca ccgagctccc gcgcggccac atacccgcga ccgtgtgatg 3549061 caagcacccc accagctccg cgcatcacgc aacgaaccgg tcaaatcgta ggcttccaaa 3549121 atctccatga tctcctcggc agacttcacg tcaccccttt tcgggagctg aacaaccgac 3549181 gcggagccgt cggccgcgga tgccctgggg cggcggtccc caaacccgat atggctaacg 3549241 tcaagcggtc ggatcacggg tcgagttggg cgggggcgac tcggcacccg gcggcatggg 3549301 ctccggtgtg caggcgtcgg tcccaaacgg cgactaccag gccggggtcg ccgactgcca 3549361 atgcgctggc cagatgaacg gcgtcggctc cgcgtaaggc atgtgctcgg gcgaggtggc 3549421 cggcgtgctg ttcaaccgtc gcggtgagtt cgactgggcg ggtggcggcc cagaagtcct 3549481 cccagtcacg ctcggcgtcg gcgagctcgg attcggttag gtcgtgattg cgggccgctg 3549541 cagcgagtgc ggcgcggact tcggggtagg ccaggcggct ggacaatgcg gcgtcgcagc 3549601 cgtcccatag agcggacgcc agcgagctcc ctgtctcggt ggtgagaagt ttgacgaagg 3549661 cgctggcgtc gaagtagacg agcggcacgg tcagcgccgc tggtcgctga cccggtcaga 3549721 caccggccgc tgcggtcggg gcctgggccg tcccgcggct acgggccgct gcgcggtcgc 3549781 cttgccaatc acgccttcgg ccgtgagacg ctccaaggtg tctgtgctgt ccagcgcagc 3549841 gagtcgtgcg atcggaatcc cacgttcggt gatgacgacc tcgccaccgg cccgagctcg 3549901 atcgagccaa tcgctgaggt gcgcgcgcaa ctcggtcacg gatacatcca cactttgaac 3549961 tgtacactca ctgaaccgtg atttgtacat atcactctgc gtgcggcaac gacgacgtga 3550021 gagattgacc tgcgcaagcc ggaggcgagg tggcaacggc cggtacaccg attcgtccgc 3550081 ggtgctggcg acgccgaaac ggtcgatgtc gtggtgactg gtcaccttcc gtccaagctg 3550141 catccgaagg tgttgcaacg gaaggtgttt gccgtccgcg ctgggccttc ggcgcagctg 3550201 gcatttgtgg tcagctgcat ggcgacggca gcgcctcggt ggtgaacgcc gggtttagct 3550261 tgcagcggcc gagcaggctg cctcgttcct gctcggtgac agttggcccg acgatgaccg 3550321 cgcaccgccg ccaccacgag atataaccta gaggttatac tggtgcggaa gcgttggccg 3550381 tgatcctgct cccgcaggtc gaacggtggt tcttcgcgct caacagggat gcgatggcct 3550441 cggtcaccgg cgccatcgac ctgctcgaaa tggaggggcc gacgttgggc cgcccggtgg 3550501 tcgacaaagt gaacgactca acgtttcaca acatgaagga gctgcgcccc gccggcacca 3550561 gcatccggat cctgttcgcc ttcgacccgg cccggcaggc gatcctgctg ctgggcggtg 3550621 acaaggcagg caactggaaa cgctggtacg acaacaacat tccaatcgct gaccagcgct 3550681 ccgagaactg gctggcgagc gagcacggag gtggatgacc atggcccgca actggcgtga 3550741 cattcgcgcc gatgccgtcg cgcagggccg cgtggatctg cagcgggccg ccgtggcacg 3550801 cgaggagatg cgcgatgccg tcctggcgca ccgcctggcc gagatccgca aggcgctagg 3550861 ccacgcacgt caggccgacg tcgcggcgct gatgggggtc tctcaggccc gtgtctccaa 3550921 gctggagagc ggcgacctgt cccacaccga actcggcacc ctgcaggcct acgttgccgc 3550981 cctgggcggg cacctgcgca tcgtcgctga gttcggcgaa aatactgtcg agctgaccgc 3551041 ctgagctaac tcacgcccac acttccggcc ggtctcgatc tcccaagccc cagcacagct 3551101 cgtgttccca atctgttccc aaccagatcc ttagctatgc gcatgttccc aaaagtgttc 3551161 ccgcccatga aaacggcccc cggagtctcc tccgagggcc atttcgccgg tagcggggac 3551221 aggattcgat gaaccgcccc ggcatgtccg gagactccag ttcttggaaa ggatggggtc 3551281 atgtcaggtg gttcatcgag gaggtacccg ccggagctgc gtgagcgggc ggtgcggatg 3551341 gtcgcagaga tccgcggtca gcacgattcg gagtgggcag cgatcagtga ggtcgcccgt 3551401 ctacttggtg ttggctgcgc ggagacggtg cgtaagtggg tgcgccaggc gcaggtcgat 3551461 gccggcgcac ggcccgggac cacgaccgaa gaatccgctg agctgaagcg cttgcggcgg 3551521 gacaacgccg aattgcgaag ggcgaacgcg attttaaaga ccgcgtcggc tttcttcgcg 3551581 gccgagctcg accggccagc acgctaatta cccggttcat cgccgatcat cagggccacc 3551641 gcgagggccc cgatggtttg cggtggggtg tcgagtcgat ctgcacacag ctgaccgagc 3551701 tgggtgtgcc gatcgcccca tcgacctact acgaccacat caaccgggag cccagccgcc 3551761 gcgagctgcg cgatggcgaa ctcaaggagc acatcagccg cgtccacgcc gccaactacg 3551821 gtgtttacgg tgcccgcaaa gtgtggctaa ccctgaaccg tgagggcatc gaggtggcca 3551881 gatgcaccgt cgaacggctg atgaccaaac tcggcctgtc cgggaccacc cgcggcaaag 3551941 cccgcaggac cacgatcgct gatccggcca cagcccgtcc cgccgatctc gtccagcgcc 3552001 gcttcggacc accagcacct aaccggctgt gggtagcaga cctcacctat gtgtcgacct 3552061 gggcagggtt cgcctacgtg gcctttgtca ccgacgccta cgctcgcagg atcctgggct 3552121 ggcgggtcgc ttccacgatg gccacctcca tggtcctcga cgcgatcgag caagccatct 3552181 ggacccgcca acaagaaggc gtactcgacc tgaaagacgt tatccaccat acggataggg 3552241 gatctcagta cacatcgatc cggttcagcg agcggctcgc cgaggcaggc atccaaccgt 3552301 cggtcggagc ggtcggaagc tcctatgaca atgcactagc cgagacgatc aacggcctat 3552361 acaagaccga gctgatcaaa cccggcaagc cctggcggtc catcgaggat gtcgagttgg 3552421 ccaccgcgcg ctgggtcgac tggttcaacc atcgccgcct ctaccagtac tgcggcgacg 3552481 tcccgccggt cgaactcgag gctgcctact acgctcaacg ccagagacca gccgccggct 3552541 gaggtctcag atcagagagt ctccggactc accggggcgg ttcacgaacc tgcgacctct 3552601 gggttatgag ctaaccagtc gcaatctctc ccatcgcggt cggtctcata cgtccagatc 3552661 agcctctatt ccgccgtcca gcctgttccg ccgcgtcgcg gttgtacgga tttgaaccgc 3552721 cccggcatgt ccggagactc cagttcttgg aaaggatggg gtcatgtcag gtggttcatc 3552781 gaggaggtac ccgccggagc tgcgtgagcg ggcggtgcgg atggtcgcag agatccgcgg 3552841 tcagcacgat tcggagtggg cagcgatcag tgaggtcgcc cgtctacttg gtgttggctg 3552901 cgcggagacg gtgcgtaagt gggtgcgcca ggcgcaggtc gatgccggcg cacggcccgg 3552961 gaccacgacc gaagaatccg ctgagctgaa gcgcttgcgg cgggacaacg ccgaattgcg 3553021 aagggcgaac gcgattttaa agaccgcgtc ggctttcttc gcggccgagc tcgaccggcc 3553081 agcacgctaa ttacccggtt catcgccgat catcagggcc accgcgaggg ccccgatggt 3553141 ttgcggtggg gtgtcgagtc gatctgcaca cagctgaccg agctgggtgt gccgatcgcc 3553201 ccatcgacct actacgacca catcaaccgg gagcccagcc gccgcgagct gcgcgatggc 3553261 gaactcaagg agcacatcag ccgcgtccac gccgccaact acggtgttta cggtgcccgc 3553321 aaagtgtggc taaccctgaa ccgtgagggc atcgaggtgg ccagatgcac cgtcgaacgg 3553381 ctgatgacca aactcggcct gtccgggacc acccgcggca aagcccgcag gaccacgatc 3553441 gctgatccgg ccacagcccg tcccgccgat ctcgtccagc gccgcttcgg accaccagca 3553501 cctaaccggc tgtgggtagc agacctcacc tatgtgtcga cctgggcagg gttcgcctac 3553561 gtggcctttg tcaccgacgc ctacgctcgc aggatcctgg gctggcgggt cgcttccacg 3553621 atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg ccaacaagaa 3553681 ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca gtacacatcg 3553741 atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg agcggtcgga 3553801 agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac cgagctgatc 3553861 aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc gcgctgggtc 3553921 gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc ggtcgaactc 3553981 gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct cagatcagag 3554041 agtctccgga ctcaccgggg cggttcaatt cgtttcggcc tgttctgttc ccaaatccgt 3554101 tcccaacaca gcaatcagca gcaatcccag gccgaaatcg gtcagactct tggtggacct 3554161 acagcacctc gcctccatgt ggtcgcggag ctagtgaggg tccatcggca gcaccactta 3554221 gggcgcctcc gttgtcatca tggtcgataa gcggtagcgt ttacggtagt agaaccggaa 3554281 gttgcggagg aaccacgatg gcggtcaccc tggaccgggc ggtcgaggcc agcgagatcg 3554341 tcgatgccct gaaacccttc ggcgtcaccc aggtcgacgt cgccgcggtc atacaggtgt 3554401 ccgatcgggc ggtacgcggg tggcggaccg gcgacatccg ccctgagcgg tacgaccggc 3554461 tggcgcagct tcgtgacctc gtcctcctgc tctcggattc gcttaccccc cgaggtgtcg 3554521 gccagtggct gcacgccaaa aaccggctcc tcgacgggca gcgcccggtt gacctgctcg 3554581 ccaaggatcg ctacgaggat gtgcgaagcg cggcggagtc atttatcgac ggcgcctacg 3554641 tgtgaagctt gccgacgcga tcgccaccgc accgcggcga acgctcaaag gcacctactg 3554701 gcaccaaggc cccacacgtc accctgtgac ctcctgcgcc gaccccgccc gaggtcctgg 3554761 ccgttaccac cgaacgggcg agccgggagt ctggtacgca tcgaacaaag agcaaggtgc 3554821 atgggcggag ttgttccgcc acttcgtcga tgacggggtc gatccattcg aggtccgtcg 3554881 ccgcgtcggt cgagtggcgg tcacactcca ggtactcgac ctcacagacg agaggactcg 3554941 atcccatcta ggtgtggacg aaacagatct tctgtccgac gactacacca ccacccaggc 3555001 catcgccgcc gcccgcgatg ccaacttcga cgccgtactg gccccggcgg cggcgctccc 3555061 cggttgtcaa acacttgccg tgttcgttca cgcactgccc aacatcgagc ccgagcgatc 3555121 cgaggtccgt caaccgcctc cgcggctcgc caacctactc ccgctgatcc gtccgcacga 3555181 acacatgccc gactccgtgc gcagattgct tgcaacgctg acacgtgcag gagccgaagc 3555241 aatccggcgc cgacgacgtt aaaggcttcg agaccggacg ggctgtaggt tcctcaactg 3555301 tgtggcggat ggtctgagca cttaacgctt cgttgaccaa agccccactt gatgcgagga 3555361 cgcgatcaga caacggaatg gcctagccgc cgtcgcggtg gctttgcgcg actggggcgg 3555421 ctcacggaat ggtcgtcgtt ggcacctctg ctgtcgggcg taatgcaaag ggaatcaatg 3555481 tcaggtgaat ctcgcgttcg ggatcaccgt cggcgtgcat ggtgaactcg tactggtctg 3555541 caccggcccg atgtgcgggg cagcgcttat gattcgggtg ctctttgatc ttggcgatgg 3555601 cgttatcgat gaccgcggtc acgtctttgt tgcggataaa gagcaagatc gcggccttgg 3555661 tgtcgcgcca cacaaggtag ccgaatagct gcttcagcac atcgtccatg gttcttgggc 3555721 ccgaccacac tttgcattcg ccaatgaaga tgttgcggtc gtcgacgcga atgagaatgt 3555781 cggtcttgcc tgcgccgttg aagagttcgc ccccggcatc gccttcaaac tgtgcgttga 3555841 ggccgacgag cagcatgtct cggatttctt ccccgtcgag cttggcggcg acagatgggg 3555901 tgcgctccaa cgcgttccgc tggttacgga gcacccgaag tgcggactgg tagtcctcat 3555961 cctgcattgc aggctccggc ttgaatgctg ccctcgcgcc cgctgggcgg tgtggccgcg 3556021 gacgcacgct tttccgactg atcggagctg cgtatgtgtc ggcgtccttc ctgcggcgta 3556081 cagggaagcc gatctcggcc tggaggtttc gggtcgctaa gagctgctca cggcgcctcg 3556141 ccaccatgcc cggtagctcg ttgcgcagtc cttggttgtg caagtcgatc tgccggcgcg 3556201 accaaccgag gtacttctca atattcgcga tctgcttatg aaacgccgcg ttgatcgccg 3556261 cggcgtcatt cgacagattg tcgatcgcca ggtggatttc gtgaccttgt agccgcagta 3556321 cctgcggcgg catggtcgtg aactggtccg ggcgaaggtt aaagatgtcc ttatgcccct 3556381 cgaagggcac cacgagaacg agcctcgtca cgcgtcgggt gcgctgttcg ccccaatccc 3556441 ggtactgctg gtcgacctcg gtggctggca gcatgaaagc gtcgtcgacg cgcagatcgg 3556501 ggcattcgac cgaacccaat tcgacgagct gttcgacgac gtcatcaacg ggcgtgttca 3556561 gcaggtcgtc ggcgtcccag ctctgaagac gctgcgccgt ggcttggctc gcctttccga 3556621 gaaatccggc taaggagcca gcgagatcgt tgaggcgccc cttggaaaac agctgaacat 3556681 actccactta cccgaagata gtgctcatcc ccgacgcggc tacggaggcg tttcggcggc 3556741 gtgccgcgat gcaatgcagc cagcggagcc accgggccgt agccgacgtc gcgtcgtggg 3556801 tggcgacggg gttctccggg gtgccggaat ccttcgacga gcttgtcggg ggtcatgatt 3556861 actgttctcg atatgaacgg attcaaggat gcgaggcccg atcgtcttcc gctttcggca 3556921 tcggtttggg atatcgccca gcgatacaac aagggcggac ctaccgtcac tgaggcgcta 3556981 tacgaggcgc tgaaggaact cgaggcccaa gtcatcgctc tgcagcgaag cgagggtaag 3557041 ggcctgctca gccgcctgag ctgaacgact agaggattgg ggaaggggcc cccggggaat 3557101 ggatcatcct actgagcggg aatgggccag catcgccgaa catacacgcg cctccaactt 3557161 caccggcgac ctgttacgaa tgccgcctta cccgctgatc ctcaccctcc gaacgctggt 3557221 ggggtctgcc gaggtggtca ctgcatcaca taccctcttc ctgtcggcgg caactgaata 3557281 ctgaccagag cgcggcaagg tgggttctag tcaacgtcgc aacaattgat ggtctggtga 3557341 ggttagcagc gcggtgaaaa gttcagcggg actgcggtgc ccgaggactt ggcggggtcg 3557401 gttattgatc tcgtattcga cagcccgcag atggtcgggc gtgtaggtgc tgaggctggt 3557461 gccctttggg aagtattgcc gtagcagacc gttggagttc tcgttgctgg ctcgctgcca 3557521 cggtgagcgg gagtcgcaaa agtagaccgg cgcgcccagg tcggcggtga tgtcgatgtg 3557581 ccgggccatt tcgatgccct gatcccacgt gatggaccgg accagcgtca ccggcaagtc 3557641 gctcatggtc tcggtgatcg cgatgcgcag gcagtaagcg tcgtgggtcg gcaggtgcag 3557701 cagccgaatc agacgtgtct gtcgctcgac gagggtgcca atcgccgagc cctggttctt 3557761 accaacgatg agatctcctt cccagtggcc aggctcggag cggtcggcgg gatcgaacgg 3557821 ccgctggtga atcgacaaca tcggctgggc gaagcgcggg cggcgacggc caggacgcag 3557881 atgggcgcgg cgatgagttc gtcccgtgcg cagagggcca cggtgtggcg acttgacctg 3557941 cggcggccgg atcaatcgtg attgaggctg atagacggcc tgatagatgc tttcgtggca 3558001 caaccacatc gaccggtcat cggggtattt ccgtcgcaga tgccgggcga tctgttgcgg 3558061 gctccaccgc tgggccagca gctcggcgat cagctcacaa aggtcggggt ttttgtcgat 3558121 ccgacgccgg tgacggcgga ctcggcgttg aaccgcccag cgatgcgctt cgaacggccg 3558181 gtactggcca tcgcggcgac tgttgcggcg tagctcccgc gacaccgtcg agggtgcccg 3558241 tccgagctgg tcggcgatct tgcggatact taggcccgag cggcgcagat cggcgatgtt 3558301 gatccgctcc tcctcggaca gatagcgact actaatttgg cgcacagcca aacgatcgag 3558361 cgcgggcacg aatccgacgg cttcgccacg ccgataggtc ttgtatcccc gcgcccaatt 3558421 gtttgctgca gtccgggata ctccaacttc acgacccgct gccgagatgg accagccccg 3558481 agcccgcagc tccataaacc gttgacgctt ggccgactgt gggcgccggc ccggaccctt 3558541 tttcacgcga cgagacgatg acaacacaac ctccagaacc tagagatgtg ttgcgacacc 3558601 gcctagaaac caccttgccg acacctgatc agttttcggt tgccgctgac acaatgaaca 3558661 tggcccgctt cacccgttca gcgtcacgtg gataagcggc ccgtagcgcg tcccagtcgg 3558721 tttcggagta gtcgggccgt tgtacagggg catccggcgc ggccggtggc ggcatcttga 3558781 tgccgccacc ggccgcgtca cggttcgcgg ttggcgctcg cctgacgacg gtgctgctcc 3558841 cgttcctgag cacgctgctt tctagccttg cggtctccct gctttcccat ctcccggtcc 3558901 tcccggcggg tcacgatagc cgcgcactcc gacatacctg gcgcggcgcg gggcgctgcg 3558961 aaccggatgg gcgccaccac cgataaccat tgcgcgttgc ggcagccttc gcattagcaa 3559021 tgctggcgcg ccgctcgacg cctcggctat cacctcacct gaccaccgcg cgcatcaccg 3559081 acgagacctc atcatcgcgc ccgctctcgc aaacaccacg cccgccaaac ggggctggcc 3559141 cgagacgatt tcagaggccc ctacagaccg atccgcacgc ccgaaacccg ggttaccgct 3559201 aagcagccca ggacagcagc cgcagtcctg atcggcgaag actgacgttc agaccgcaag 3559261 caagctaaat agcaagccaa gcaattagca agactaatgt tcccaaatcc gttcccatcg 3559321 ggcatgaaaa tgaccccaga ggtcgcacct ctggggtcat ttccgctggt agcggggaca 3559381 ggattcgaac ctgcgacctc tgggttatga gcccagcgag ctaccgagct gctccacccc 3559441 gcgtcggtaa atgccaggct accgaacacg cacgaagctc gccaaatcgc gggtgccgga 3559501 gtacgaccgc ccagatcagc ggagctcggg catacagctg cgccgtacgc gtcgatgcga 3559561 tgatgattcc gcagccgctc agccagctcg gtgacctggc gcgtcgccca ggccgcaggg 3559621 ttctctgttc cccgaaaacg gccgcaccgt cgatctcaaa cgcaactgtc gcctcgccgg 3559681 ccgcgcccgg ccttgagctg tccaccggga tcgcgttggc gttcccgcgc ggtcccttcg 3559741 tcccggcagc cgcggcgtgg gagctccagg aagctaccag cgggaagttc cagctcggtc 3559801 tgggcacgca ggttcgcaag aatgtggtgc accgatacgg tatggccttc caccgtcccg 3559861 gtccgcggct gcgctacctg ctggccgtga aggcgtgctt cgccgttttc caaaccggga 3559921 caccggatca ccacggcgag ttcgacaatc ccgacttcat cactgcccaa tggagcccgg 3559981 cgcgcattga cccccccggt cccagccccg ctgggccgcg gtgaatccgt ggatgcggcg 3560041 aggtggccga cggggtgtgg ggcgaggccg ggttcgaggg gacgaccacg cggatccggg 3560101 agccgacgag cacccgtgag cagacgcaga agtccccgat ttccggtgaa atcggcgact 3560161 tctgcgtctg ctcgccgcga gcgccccgac tgactacccg gcgtcgttga acttggtgat 3560221 ggcctcatca agtcgctgca gcgccgaccc gtaggcggcg aagtcgccct tcttctgcgc 3560281 atcccgcgcc gcgccgatgg cagcctggat ctcctgcagc gcagcaactt tggccggcga 3560341 taaggtgacc gccccgacgg gaaccggggg cgccgcagtc accggcggcg gttggggtcc 3560401 actggcaggc ggcggtggat tcgcagcggg actcggtggt accgctgcct ccgtgggcgc 3560461 gatcccggta gccgtcgcac cggccccggg cccgaacaag ccggtgagcg catcccgcac 3560521 cgtggggccg tatcccacct tgtcgttgta catcatcgcc acccggatca gccgcgggta 3560581 ggacgaagca gcgtcgctgg ctcccgggga tgcatagacc ggttcgacgt agagcagtcc 3560641 gccccgggcc accgggagcg tgagcaagtt gccccagcgg atgcggtttt ggttgtcgcg 3560701 tccgatgaca ccgaggtcct gggacaccgc cggatcggtg gtgatcgcgt tgttggccaa 3560761 cttgggcccg ttgacctggc ctgggatggt caacaccgtg agattgccgt aggtcgcggg 3560821 atcggaactg gcgctgatgt aggcggccag atagtcacgc ttgaatctgt tcatcgcgct 3560881 gatcaactga tatgaggctg aattatcgtc cttagcaatg tttttcgcga cgatgtaata 3560941 cggcggctga taactgctgg cggtcggatt cgggtccagc ggcacgtccc agaaatccga 3561001 tgtggagaag aacgtcaccg gatcattgac gtggtatttg gccaacaaca tgcgctgcac 3561061 cttgaacagg tcctcgggat accgcaggtg ctcggcaagc tccggcgcaa tgtcgctctt 3561121 aggctttacc gtgccgggga agacctgcat ccaggccttg agcaccggat ccttttcgtc 3561181 ctgttggtac agcgtgaccg ttccgtcgta ggcatccaca gtggccttca ccgaattgcg 3561241 gatgtaggaa accttcttgt ccgggaccaa ccggttgaac gccacctcgt tggagtccgc 3561301 ggtcgccgag gacagcgagg tgagctcgga gtacgggtaa ttgtccaacg tggtgtagcc 3561361 gtcgacgatc cacaccagtc gcttgttgac gatcgcggga tacacagcgc tgtctgtcgt 3561421 cagccacggc gcgaccgcct ccacccgctg cgccggatcg cggttgaaca agatcttgct 3561481 gttggagcca atcacattgg agaacaaaaa gtttcgctcc gcgaacttcg cagcgaacac 3561541 gctacgggct aaccaaccac cgagcgggac tccaccgctt ccggtgtagg tgtatctctt 3561601 ggtgtcgatg ttagtttcgt agtcgtattc gcggtcgtcg ccattgcgtc caacgatcgc 3561661 atagtccgcg gacgtgttag agatcaccgg accgaagtag atccgcggct gatccagtgg 3561721 cgccggccca tcagacacca cggtgccatt ggccccgacg acgttgacca agaattcggg 3561781 gtaaccgcca ttttgattcg ggtcgttggc gataccgcgc acggtgttgg ccggtgaggc 3561841 gatgaacccg ttcccgtggg tgtacacggt atgccggttg atccagtccc gttggttgtc 3561901 gatcaaccgg tccgggttga gttcgcgggc cgcgacgacg tagtcgcgca ggttaccgtt 3561961 gcggtcgagg tagcggtcga tcgacagctg gtccgggaaa tagtagaagt tcttgccctg 3562021 ctggaactgg gtgaacgccg ggctaacgat tgtcgggtcg agtagccgga tgttcgaggt 3562081 agtcgcgcgg tcggcagcga cctgttgcgc ggtagccggg ctatcaccgc tgtaattgcg 3562141 ataggtcacc acatcagacg tcaggccata ggcttgccga gttgcggtga tacttcggct 3562201 gatatattcg ctctcttttt gcgcagcgtt gggtttgacg ctgatttgct cgacgatcaa 3562261 cggccagccg gcgccgacaa tcagcgacga cagcagcaac aacaccaggc cgatcgccgg 3562321 aatccgcaag tcccgcaggg cgatcgccga gaacactgcg gccgcgcaaa tcaacgcaat 3562381 cgccatcaga atcagcttcg ccggcaggac ggcgttgata tcggtgtacc cggcaccggt 3562441 gaacggcttg ccgccacgcg tgtgcgacag cagctcatac cgatccagcc aataagcaac 3562501 ggctttaagt aacaccagta ccccgaccag gctaaccaac tggacgcgcg ccgagcggct 3562561 cagcgcaccg gtgcgtccgg atagccgaat gccaccgaag atatagtgcg ccaccagatt 3562621 cgccacgaat gccagaaata ccgaaacgag catgtagctg agcatcagcc ggtagaacgg 3562681 caactcgaac gcgtagaagc cgaggtcccg cccgaactgc ggatccctaa ccccaaagtc 3562741 accgccgtgc aggaacagct ggatccgagc ccagtagctt tgggcgacga tgccggccag 3562801 caagccgatc gccgcgggga ttccgatgcc gactagccgc aggcgtgcca gcacgacggc 3562861 gcgataccgt gcaaccggat cgttgtcggc atccgggacg aacaccgggc gagtgcggta 3562921 ggccaaggcg agcccgccga acacgatgcc gccgaccacc accccggcaa ccaagcacac 3562981 cacgatgcgg gtagccagca tggtggtgaa cactgagcgg tagccaagct caccaaacca 3563041 cagccagtcg acgtaagcgt cgatcaaacg cgggccagcg agcagcagca cgatcacacc 3563101 cagtgcgatc atgatcagaa tccggctgcg ccgtgtcagt ttcggcatcc ttgcggcgga 3563161 ccgcattccc actagctacg ctccctgatc gttctggctg gttgagactt tctcgacggt 3563221 cataactcta cgcaccgcaa ccatccgcag cagccggcgc gagctagcag ctcggcgtcg 3563281 gcgagcccga cgtcatcgcg tgcagcgcgt ccaccgcctg gctaagcgtc tcgaccttca 3563341 ccaacttcaa accgggcggg ctgtcggaac ttgcctcgta gcagttcttc gcgggcacca 3563401 gaaacaccgt cgcgccggcc gctcgagcag cggccatctt gtgggtgatg ccaccgatct 3563461 ggcccacctt gccatcgacg gcgatcgtgc cggtgcctgc gacgaacgtc gacccaacca 3563521 ggtggccact ggtgagcttg tcgacgacgg ccagactgaa catcagtccg gccgaagggc 3563581 cgccgacgtt ggcgaggtgg aagtccacgg caaacggcgc ccacggcgcg tccaccacct 3563641 ctatgcccag gacgccttgg tcgcgatcct tattcttgcc cagcgtgatc tgcgcgatgc 3563701 cgggcggctc gttcttgcgg cggaagtcga tcgtcacctc ctggcccggt ttcgtgttct 3563761 tcaacagcgc ggtgaactgg tcgaggttgc ccaccggagt gccgtcgacg gcgtcgatgg 3563821 cgtcaccggc ctgcagcttg tccaccgatg gccctggatc catgaccgag gcgacggtga 3563881 ctgctttcgg atacttcagg taccccagag cggcgtactc agcggcggcc tcggagcgct 3563941 tgaaatcagc ggcgttgtca ttttcgatct cttcccgcga cttgcccgga gggtagacga 3564001 ggtcgcgtgg catcaactgt tcttgacccg aaagccacag ggccagggct tcacccaggg 3564061 ttagaccgtc gcgctgggag accgtcgtca tgttgaggtg acctgacgtc gggtaggtct 3564121 gggtgcccac gatctggacc acctgcttgc cgtctatctc gccgagcgtg tcgaacgttg 3564181 ggccgggtcc cagcgccaca aacggcacgg ttaccacggc gagcaacacg ccgaatacca 3564241 cgatcggcac cagcgcgacc atcaaggtca atatccgcct attcacgccg catacactag 3564301 acggacctgg ccgggctggt tcagctgcga gcgtgaccgc tgatcgcacc ttctgttccc 3564361 gcggtgagta ccggtgaggt catgggtgac ctgcctttcg gcttctcttc cggagacgac 3564421 cccccggaag atccgtctgg gcgcgataag cgcgggaagg acggtgccga ttccggatcg 3564481 ggcgccaatc cgttgggcgc gttcggcatc ggtggagaat tcaacatggc cgacctgggg 3564541 caaatcttca cccgcctagg agagatgttc ggcggcgtcg gcaccgcgat ggccgcgggc 3564601 aaaacctcag gaccggtcaa ctacgacttg gcccggcagg tcgcgtcgag ctcgatcggg 3564661 ttcatcgcgc ccatcccggc ggccacgaac tcggcgatcg ccgacgcggt gcatctggcc 3564721 gacacctggc ttgacggggc aacctcgcta cccgctggcg ccaccaaggc ggtgggttgg 3564781 agccccaccg actgggtcga caacaccttg gctacctgga aacggctgtg cgatcccatg 3564841 gcccagcaga tctccacggt ctgggcgtcg tcgctgccgg aagaggccaa gagcatggcc 3564901 ggcccgctgc tgtcgatcat gtcgcagatg ggcggcatag cgtttggttc gcaactgggc 3564961 caagcgctgg gccggctgtc ccgtgaggtg ctgacgtcta ccgacatcgg tctaccgctg 3565021 gggcccaagg gggtggccgc aatactgccc ggcgccgtcg aatcgtttgc cgccggactc 3565081 gagcaaccgc gcagcgagat tctgacgttc ctggccaccc gtgaggccgc acatcaccgc 3565141 ctgttcagcc acgttccctg gctggccagt caactgctcg gcgccgtcga ggcctacgcc 3565201 atgggcatga agatcgatat gaccggaatc gaggagctgg cccgcgatat caatccgacg 3565261 tcgctggccg atcccgccgc catggaacag ctgctgagcc agggagtatt cgagcccaag 3565321 gcaacgccgg cccagacgca ggcattggaa cgactcgaaa cactgctcgc cctgatcgaa 3565381 ggctgggtgc agaccgtggt gactgcggcg ctgggcgagc gaattccggg tgaggcagcg 3565441 ctcagcgaga cgctgcgccg acgccgagcc agtggcggcc ccgccgaaca gacctttgcg 3565501 acgttggtcg ggctggagct gcggccacgc aaactgcggg aggccggagc gctgtgggag 3565561 cgcctcaccc gggccgtcgg catggacgcc cgcgacgccg tctggcagca cccggacctg 3565621 ctgcccgcca ctgacgatct cgacgacccg gccgccttta tcgaccgtgt catcggcggc 3565681 gacaccagcg gtatcgacga agcgatcgcc gaactcgagc gggaccagca ggcccgcggc 3565741 gccgacgact ccggccacga tggcggtcct gtggataact gagcggtgtg tctgctcgca 3565801 gtgtggcacc gtctcaggtc atgcggcggg ctgcgtctgc tctgtattcg ttgaatcctg 3565861 cgatgccggt gctgctaaga cccgacggtg ccgtgcaagt gggctgggat cctcgtcggg 3565921 ctgtgctcgt ccgtccaccg cgtggattaa ccgcgacagg tttggccgcg ctgctgcggt 3565981 ccatgcgatc accgatacca atcaccgagt tgcagcgcca agccgccgag cgtggattgg 3566041 ttgacggtga cgccatggcg aaccttgtcg cgcaactggt tggcgcgggt gtagcgaccc 3566101 ccctagccaa ccccggaaac ctggattccc ggcgtcgcgc cgcgtccatc cgggtccacg 3566161 gtcgcgggcc gttgtcagac ctgctcgtcc aggcgctgcg ctgctccggt gcccggatca 3566221 ggcacagcag ccaaccacat gcggcggtga ctcccgcggg cgtggatctg gtggtgttgt 3566281 cggactatct ggtggccgat ccgcacatgg tgcgcgatct gcacaccgag agagttccgc 3566341 atcttcccgt tcgggttcgt gacggcaccg ggatggtcgg gcccctggtg gtccccggcg 3566401 tgaccagctg tctcggttgc gctgacctgc atcgcagcga ccgcgacgcc gcgtggccgg 3566461 ccatcgccgc ccaattgcgg gacaccgtcg gggtggccga ccgggccacg ttgttagcga 3566521 cggcggcgct ggcgctcagc caagtgaacc gggtgatcgc cgccgtgcgt ggacaggagg 3566581 cgacccctga gcccccgtcg gcgctgaaca ccaccttgga gttcgatctc aacgctggct 3566641 ctatcgtggc gcgacaatgg accaggcatc cgcggtgttt ttgttgacgt tacgtctaac 3566701 ccagtcgtcc ctgctccggc acgttggtcg agattgacgc ataggctctg gccaaggtgt 3566761 cgagcacgtc ctctgtcagg gtgcgctcgt tgcggtgctt gtccagcgtt tcgatgatcg 3566821 ctctgaacag ggcgtcggca gcgtcgtgct gcgttgatct tgctgacatg gtttcttgcg 3566881 gtccaccctc ctgcacattt cactgatgcg gccaacacca caacgcttgt cggcgcttgt 3566941 cgacgcttgt cgactcgggg caagctcaac cgtccgcacc caggcagttg ttaccagatc 3567001 aacaccccga ccggataacc gtcatggatg atgggagtgt gtcagatatc aaacggggcc 3567061 gcgccgcgcg caatgcgaag ctggccagca tcccggtcgg cttcgccggt cgggcggcgc 3567121 tcgggctcgg caagcgactg accggtaagt caaaagacga ggttaccgcc gagctgatgg 3567181 agaaggccgc caatcagttg tttaccgtcc tcggcgaact caagggtggc gcgatgaagg 3567241 tcggccaggc gctgtcggtg atggaggccg ccattcccga cgagttcggc gaaccctacc 3567301 gggaagcact gaccaagctg cagaaggacg ccccaccgct gcccgccagt aaggtgcacc 3567361 gggtactcga cggacagctg ggcaccaaat ggcgggagcg gttcagctcg ttcaacgaca 3567421 ccccagtggc atctgccagc atcggccagg tgcacaaagc aatctggtcg gacggccgag 3567481 aagtggccgt caagatccag tatcccggcg ccgacgaggc gctgcgcgcg gacctcaaga 3567541 ccatgcagcg catggtcggc gtgctcaaac agctctcacc cggcgccgac gtccaagggg 3567601 tggtcgacga actggttgaa cgcaccgaaa tggaactcga ctaccggctg gaggccgcca 3567661 accagcgcgc cttcgccaag gcgtaccacg accacccgcg cttccaggtg cctcacgtcg 3567721 tggcaagcgc accgaaggtg gtgatccagg agtggatcga aggtgtgccg atggcagaga 3567781 tcatccgtca cgggaccacc gagcagcgtg atctgatcgg tacgctgctc gccgagctca 3567841 ccttcgacgc accacggcgg ctggggttga tgcacggcga cgcccacccc ggtaatttca 3567901 tgctgctgcc cgacggccgg atgggcatca tcgacttcgg tgccgtggca ccgatgcccg 3567961 gcggcttccc gatagagctc gggatgacga ttcgactggc ccgcgagaag aactacgacc 3568021 tcctgttgcc gacgatggag aaggccgggt tgatccagcg aggacgacag gtgtcggttc 3568081 gcgagatcga cgagatgctg cgccaatacg tcgagcccat ccaggtcgag gtcttccact 3568141 acacccgcaa gtggttacag aaaatgaccg tcagtcagat cgaccgctcg gttgcgcaga 3568201 tcagaacggc gcgccagatg gacctgccgg ccaagctcgc gattccgatg cgggttatcg 3568261 catcggtggg cgcgatccta tgccagctgg acgcgcatgt gccgatcaag gccctgtcgg 3568321 aggagctgat cccgggtttc gccgagcccg acgcgatcgt cgtctgagcc ggctcgcgcc 3568381 ggcgggcgca ccatcgcggg ctatgcaaca gcatccttgc gcggacgtcc gcgcggacgc 3568441 ttgtgactca cgatcgagcc ttggtcgaat atctcaccac cccaaacgcc ccagggttca 3568501 gcccgctgaa gcgccgcggc caagcactgc cgcctgatcg ggcagctcac acacagtgtc 3568561 ttggctacct cgagaccggc cggggtatcg gcgaaccaca gatcgggatc accgacgtgg 3568621 cacggcaaaa ccggcaatct ttgtctgggg gtctgtctgg ggactgtcag taccgacacg 3568681 tcctgtttca cctgcttcct ggtctggtgg cggttcttcg aaagtgatcc ggaccaggga 3568741 tgctgcggtg ggcagatgtc ccgaaagttt ggccacggat cctgtgactt cgggtccgtg 3568801 gccatctggc gaaacggggc tgattacgta gcgcttacgt agagccccgc tccacggact 3568861 cgtcagtcgc ggcggcgaca cggttcttgc tatggggggt tcccgcggtt ggcaccgcgg 3568921 cagccgcgcc gacaccaaat gcgttgttgt caatcaccgc ggccgccctc ctctcgtgtc 3568981 gcgcgcggtt gccagccccc caatgccatc tccaggctgg cagcagaatg cgacctggag 3569041 gttaaccggt ggcagcagct gaccacaacc gattttctga cctgcgcgtt tgccggtaca 3569101 ggcccggttc aggtccgacc gcgaaccagc tgcagcacgt ccgatccgta ttgttccagc 3569161 ttgcgggcac cgatgccggg gatcgcgatc agcgccgcgt cgtcggtagg tagcagctcg 3569221 gcgatcgcga tcagggtgtt gtcggtgaaa acgacatagg cggggacgtt ctgttccttg 3569281 gcggtgctca gacgccagga cttgagctgc aacaacaact cctcgtcgac gtcggctgca 3569341 cacgtctcac accgccgcag catgacggcc gccgaagtgt tcagctcgtt gttacagatc 3569401 cggcagcgcg ctgcggcgcc ccggttgcgt cgggatgtgc ccggcaccgg atcggcgcgc 3569461 gtctgcggcg caatgccgtt gaggaaccgc gagggcttgc ggctctggcg cccgcccggg 3569521 gaccgtgata gcgcccagct gagcgccaaa tggactcggg cccgtgtgat tccgacgtag 3569581 agcagccgac gctcttcctc tacgggctcg ctattggggc cgtgtgccag cgcatgtgag 3569641 atgggcagcg tgccgtcagc caatccgacc aggaacaccg cgtcccattc cagtcccttg 3569701 gcggcgtgca gtgaggccag cgtgacgccc tgcaccaccg gtgggtgccg cgcctccgcc 3569761 cgccggcgta gctcggcaag caggcctggc agctgcagtg cgggacgctg cgccagctcg 3569821 tcgtcgacca gctcggccag cgcggtgagc gcttcccagc gttccctggc gcgggtgccg 3569881 accggcggtt gtgccgtcag ccccagtggt gcgagcaccg cgcgaaccac gtcggacaac 3569941 gcggcatcgg tatcacgttc ggacacacgc tgtaaggcaa gcaacgcctg cttgatttcc 3570001 tgacggttga aaaacccctc gccaccgcga acctgatagg cgatacccgc ctgggtcaac 3570061 gcctcttcat aaacctctga ctgcgcattg actcggtaga gaatggctac ctcggatggc 3570121 ggagtgcccg atgcgattaa ccgggcgatt gacgccgcca ccgtggcagc ctcggcgggc 3570181 tcgtcggaat gctcatggaa cgacgggacc ggacccggct cacgctggcc ggacaaccgt 3570241 agcttgctgc cggcaacacg gccccgggcg gcggcgatca cccggttagc caatgacacc 3570301 acctgcggag ttgaccggta atcacgctcc agccgcacca ccgcggcgtc cgggaaccgc 3570361 cgcgagaagt cgagtaggaa acgaggcgaa gccccggtaa acgagtagat ggtctggttg 3570421 gcgtcgccga cgacggtcag gtcgtcccga tcacccaacc aggccgagag cacccgctgc 3570481 tgcagggggg tgacgtcctg gtactcgtcc acgacgaaac accggtaccg gtcctggaac 3570541 tcctcggcca ccgcggcgtc gttttcaatc gcggccgcgg tgtgcagcaa caggtcgtcg 3570601 aagtcaagta aggtgacgcc gtcgccgcgg gccttgagcg cctcgtattc ggagtagaca 3570661 gccgcgattt gcgcggcgtc caacgggggg tctcggcgtg cggccgccac tgcggtcaca 3570721 tactcctcgg ggccgatcag ggacgccttg gcccactcga tctcgccggc caggtcacgc 3570781 acatcatcgg tgctggcgtg cagcctggtg cggctggcgg cgcgggccac cacggcgaac 3570841 ttgctgtcca gcagctgcca gccggtgtca gcgattacgc gcgaccagaa gtaccgcagc 3570901 tggcgatacg cggccgcgtg aaaggtcagc gcctgcacag cgccgacgcc cgaaccggtc 3570961 cgtgccgcgg cgtcgagtgc gcgcaaccgg ctgcgcattt cgcccgccgc gcgctgggtg 3571021 aatgtcacag ccagcacctg cccggcggcg acgtgaccgc tcgcgaccag cgaagcgatc 3571081 cggtgagtga tggtgcgggt cttgccggtt ccggcaccgg ccagcacgca caccggtcca 3571141 cgcggagcca gtacggcttc gcgctgctgg tcgtccagcc cggcaatcaa tgggtcgctg 3571201 gctatcgaca tgacgtccat cttggcagcg gtagatgaca gaccgggcgt gtcgccacgc 3571261 cgtggggcgt gcgacatgaa caactgccga gccgccacac cgcccgggtc gtcgccgcgc 3571321 taggttagcg tgtcatgatc accgctgcgc tcaccatcta tacgacatca tggtgtggct 3571381 attgccttcg actcaaaaca gcgctcacgg ccaaccgaat cgcttacgac gaggtcgaca 3571441 tcgaacacaa ccgtgcggcc gcggagttcg tcggctcggt caatggcggc aacagaactg 3571501 ttcccacggt gaagttcgcc gacgggtcga cgctgactaa cccgagcgcg gacgaggtca 3571561 aagcgaagct ggtaaagatc gcgggttaac gacgtggact ttcattcgca cgctgcccac 3571621 gattcgatga tcacgcgggc gatcgagatc gacccgggca gtagcagttt cgactccgac 3571681 gcactggacc aatcgccggc ggcaagcgct gcgcgcacct catcgcgggt gaaccacgcg 3571741 gcttcggcga tttcgccgtc gctgaacgag aactcctcat ccgggtcacc caaggcatga 3571801 aagccaacca ttaacgaccg cgggaacggc cactgctggc tgcccagata gcgcacatcg 3571861 cgaacggtca ggccgatttc ctcgcggatc tcccgggcga cgcagacttc gaacgactct 3571921 ccggcctcga caaagccagc caacagcgag aacatccgtt ccggccacgc cgcctggcga 3571981 gccaacacgg cacgatcagc gccgtcgtga accaggcaga tcaccgccgg gtcgatacgg 3572041 gggaactcct catgaccggt gatcgggttg acccgtgacc agccggccct ggccggtttc 3572101 gtcggcgcgc cgtctagggc gctgaatcgt gcgttgtcat gccagttcaa cagcgccgat 3572161 gccgacgaca ccagttggct gctggtgtcg tccatgattc ggccgagccc acgaaggtcc 3572221 accgcctcgg ctggtatgtc gggatcagcg atcggctgca gcgctgcccg caccgcccag 3572281 acgtggcggc cgccctcgac gcgacccagg aataccgcct ctggcggtgg cttgtcggcc 3572341 agctcgatgg ccgcgccaag caacacccgg ccgttggcga ccagcacgcg attgcgggaa 3572401 tccacccgca gcaatgccgc gcctggccat cccgcggcgg ccgcctccat gtcggtcctc 3572461 agccggtcgg cccggtcggc gccgacgcgc gaaagcaacg gaacgcttct cagctgaaaa 3572521 tccacgccgc ttacgttcgt cactggcgcc ccacctggtg gcgacccgcc gcgcccggct 3572581 ccgccgcgct tgcgatcgcc actagcgccc cacctggcga atatagagca gccggtcgct 3572641 ggcctcgatg gcgtccacct cgggcgcccc aatgcgcagc agctggccgt cacgtaccac 3572701 gccgagcacg atgtcgcgca ggtgccgcgg agacccgccc acctcggcct gctccacctc 3572761 acgttcggca acggccaggc cggcttccgg ggtcagcaga tcctcgatca tctccacgac 3572821 gctgggcgtc gtggtagcga tgccgagcag ccgcccggcg gtctcggagg agaccaccac 3572881 cgtgtccgca cccgactgcc gcaacaagtg ctggttttcg gcctcccgga tggacgccac 3572941 gatcttggct ttgggcgcaa tctcgcgcgc cgtcaacgtg acgagcacag cggtgtcgtc 3573001 gcgactggtg gcgacgatga tcgaagacgc atgctgagtg ccggccaacc tcagcacgtc 3573061 ggacttggtg gcatcaccat gcacggtgac cagaccggct gccgcggcac gttcgaggac 3573121 acccgaatcg gtgtcgacga ccacaatttc acccggaact aactcgtcac tgaccatcgc 3573181 ggccaccgcc gttttgccct tggtgccgta gccgatgacg acggtatggt tgcgcactct 3573241 gctcctccaa cgctggatct tgtacgcctg acgggatgtt tccgtgagga cttcgagagt 3573301 cgtgccgacc aacaagatca agaacgcaat ccgcagcggt gtgatgacga agatgttgat 3573361 cgctcgcgcg aattcggaaa tgggcgtgat gtcgccgtag ccggtcgtcg acagcgtcac 3573421 cgcagcgtag tagaggcaat ccagaaacgt cagccgatcg ccctgggcgt cgaggtagcc 3573481 gtcgcggtcg acgtagacga tcccggcggt gagcagcaac gccaccacag cgacgaccac 3573541 ccggcgtgaa ataacgcgag ctggactggc ccgcctttgg ggaatgcgca gcacgccgac 3573601 aagcgcgtaa ccaggctgcg cggtcagctt ctcgttgagc ccccgcaacc gccgccagct 3573661 accggccacc gaaatccgtc accggttagc cccaatgcac gccaaacgca cgacacaaat 3573721 ggtaaccacg tcaggtgtcc gaccgccgac cggcgcagtc ggtcagtagc atggccaact 3573781 cgccgggagc gggtaactcg tcggggacga ccgtgatgcc gctgcgcacg taatagaagg 3573841 cggtacgcac cgaggatgtc ggacatcccc gcaatgcggc ccaggccagt cgatagacag 3573901 cgagctggac agcggcctgc cgcatggctg ccggcccgtg cggcggcttg ccggtcttcc 3573961 agtccaccac ggtggcaccg ccgtcggggt cgacgaacac cgcgtcgatg cggccgcgca 3574021 ccacggtatc gccgatcggc atttcgaacg gcacttcgac cgccgccggg gtgcgagccg 3574081 cccacgatga tgcggtgaac gccctctgca acgcggccaa ctcctcagga tcgcccacct 3574141 cgcggtccgc tgcacctggc aggtcaccca ggtcaaacag cagttcagca ccgtaaaatt 3574201 gctgaaccca ggcgtgaaat gcatcgccca accacgcgtg cgggtccggg cgttttggca 3574261 gccgacacat cagccgctgc cgcgcaccga ccgggtcgcc gaccagctcc accaaactgc 3574321 tgaccgacaa atggttcggc agaccacggg caggtgctcc ccgcgccgcg tgcgcacgtt 3574381 cagccaacag tgcatcgacg tcagtggacc agggggcatc gcccgggcgc gggggatgat 3574441 cgatgtcggt ggtgcttccg ggcaagtcgg ccgacatggc cgccgccacc agcgccgcgc 3574501 cccgctccac atcgccgcga cgtgcggcca acggatcagc gggccaaacc gcctcgatag 3574561 cgttgtcaca caatgggttt cgctcatcgc cggcgggcgc cgacgcccac tgctcgacga 3574621 ctccgcaagg atcaccggca gcggccgaac ggtcaatgat gtccttgagt tcgcacagga 3574681 attccgatgg cccgcgcggc tttgtcccgg tgggccccca atggtggccg gacaccagca 3574741 gagtgtcctc agcccgggta acggccacgt acaacagtcg acgctcctcg tcaacgcgcc 3574801 gccgatcgag caggcgacga tgttcggaga tcttgtccga caactgtttt cggtcagcga 3574861 cagctgacgt gtccagtacg gggatgccgt gcgcgccggc cgaggcgcga tccccacgca 3574921 gcagcggcgg tagttcggcg gggtcggtaa gccagctgct gcgcgacacc gtcgacggaa 3574981 acactccgcg cgacaggtgt gccaccgcca ccacctgcca ttccaagccc ttggcggcgt 3575041 gcacggtcag cacctggacc cggtcgcagg cgacggtcaa ctcggcaggc ggcaaaccgt 3575101 tctcgaccac ctcggcgacg tccaaataag ccagcaggcc cgcaaccgac gcctcgctgg 3575161 acctagcgct ggcccgttcg gcgtaccccg cgaccacgtc ggcgaacgca tcaaggtgct 3575221 cgggtccggc ccagccacct gagaccgggg ccgaggcccg cacctcgcaa tcgacgccaa 3575281 gcacgcggcg cacctcggct actaggtcgg gcagggaatg accgaggcga ccgcgcagcg 3575341 cgctcagttc accggccaag gcgccgatgc gcccatatcc cgccaccgaa tacccctcgg 3575401 cggaacctgg atcgctgatg gcgtcggcca gacacggatt gtcggcgtcc gcgctggccg 3575461 ccatcgcgat cgattcgggc gacgccgttg acggtgattc gccactcagc gtcagcgcac 3575521 gccgccacag cgcggcgagg tcccgggcgc cgagccgcca ccgtgggcca gtcagcaccc 3575581 gcatcgcggc cgccccggcc gttgggtcgg caaccaggcg cagcatggcc accacctcgg 3575641 cgacctcggg gatggacagt aggccggcca gcccgacaac ttcagccggg attccgcggg 3575701 cccgcagggt atcagcgata gcggcggcgt cggcgttgcg gcgtaccagc accgccgcgg 3575761 tgggcggctt gacaccgtcc gcttctgccc gctggtaacg catccgcaag tggtcggcga 3575821 tccattcgcg ttcggcctgc acgtcgggaa gcaacgcgca gcggacggct ccaggcgggg 3575881 catccggacg cggccgcaac gcgcgcaccg caaccgagcg ccgccgcgcc tccgccgata 3575941 tgccattggc cacgcgcagc gcttgcggcg ggttgcgcca gctggtcagc agctccagca 3576001 ccggcgcggg ggtgccgtcc gataagggga agtcggtggt gaaccggggc aggttcgtcg 3576061 ccgaagcgcc gcgccacccg tagatcgact gaatcgggtc accgacagcc gtcagcgcca 3576121 acccgtcatc aacgccgccg ccaaacagcg acgacaacac aacgcgctgc gcgtgccccg 3576181 tgtcctggta ttcgtccagt aacaccaccc ggtagcgcct ccgcagatcc tggccaactt 3576241 ggggagaggt cgccgccaac cgtgcggccg aggccatctg catggcgaaa tccatcactt 3576301 tgccggcgtg catccgctca cccaacgcgt caagcaacgg caccaactcc gcgcgctggg 3576361 tctgggtggc cagcatccgc agcagccact ggctggggcc gcggtcacgc tgatagcggc 3576421 ccgccggcag agcgtggacc agccgttcca gctcgacgtg ggtgtcgcga agcgcgcggg 3576481 tgtcgaccag atgctcgcca agctggcccc ataaccgcac cacgatcgag gtgaccgccg 3576541 ccgggctctt gtcggtgcac agcacgccgt cgtacccgct gaccacatcg aatgccagct 3576601 gccacagctc ggtctcgctc agcaacctgg tatcgggttc cagcggtagc agcaggccgt 3576661 agtcgcgtag tagcgagccg gcaaaggcgt ggtaggtgct gactaccgga gcgcaggccg 3576721 ccgggtcgcc gcagccgagg ccgataccgg ccaacctggc cagacgggac cgaacgcggc 3576781 gcaacagctg gcccgcggcc ttgcgggtga acgtcaatcc cagcacctgg ccgggttccg 3576841 cgtagccgtt ggcaaccagc cacaccaccc gggcagccat cgtttcggtt tttccggcgc 3576901 cggctcccgc gatgacgacc agcgggccgg gaggtgcggc gattaccgcg gcctgctcag 3576961 cggtgggcgg gaaaagtcct agcgcgcagg ctagttcagc tggactgtag cgtgccggtg 3577021 ccgcggtttg ggtcatggcg ccgaccctcg gacgtgggcc ggacagcccg gccgcagcgg 3577081 gcagtgggtg cacccgtcgt tgcgccgagc gatgaactgg ggaccggctg tcgccgcggc 3577141 cagctgccgg acgaggttgc gccattcgtc gcgcgcggcc ggtgtgagtg gatcctgttt 3577201 gcgttcggcg acgccagcgg ccccgctttt gccgacatag accagccggg caccgccggg 3577261 ctcgtccccg gcgcgcacca agccttcggc caccgccagc tgatacatcg ccagctgggc 3577321 gtgctgctgg gcatcgtcct tgctgaccgg tgtcttgccg gttttgatgt cgacgatcac 3577381 caggcggccg gccgggtcgc gttccagccg atccgcccgg ccacgcaacc gaatttttct 3577441 ggcttgaccg ctaccgtcct cgagggcccc atcgatgtcg acctccacgc caacttcggt 3577501 cagctcggat cgactctgag ctcgccactg tacgaacgcc tggatcatcg cgcggtgccg 3577561 ggcaagctcg ttggccgaat accactgagc gccgaacggc agatggcccc acacccggtc 3577621 cagttcagcc agcagttggg attcgctcct gcccggctcg gcaaacagtg cgtgcaacac 3577681 cgatccgacg gcagacggca gctcgcgggt gtttgttccg ccgtgccgct cggccagcca 3577741 gcgcagtggg cagtcgttga gtgcctgcaa agtcgacggc gtcaacgtga cgagatcgtc 3577801 gctatcgcac aacggatcac tcgtgctgac cggggccagg ccatgccact cggacgggtc 3577861 ggcacctggc acaccggctt tggccaaccg ggccaattgc gttgccgcac aatcgcgatc 3577921 ggcgtcatct accgcgcagg caggcgcgca caccacaacg cgtaaccggc ctaccaccgc 3577981 cgcagccgac aacacgcgcg gcgccgagac cggctgcatc gcgacgggtt cgccatcgcc 3578041 gtcggcccac tgggcaatct cgaaaaagaa cgccgatggc agcaccgcct cgtgcccgcc 3578101 cccgcccgcg tcgctatcta cggcggtcac cagcaaccgc cgccgggccc gccccatcgc 3578161 ggtcaccagc agccggcgct cctcggccag caacggcgcg cgcatcgagg catccttcgt 3578221 gacaccgtcg agttcgtcca gcagccgctg ggtgccaagc acaccgccac gtggaaccgt 3578281 gttgggccac aagccgtcct gtaggccggc gataactacc agatcccatt cgtgtcccag 3578341 cgcggcatgt gcgctaagga ccatgacctg ctctgtcggg gctgccggtt cgggtcgcac 3578401 aaccggcagc tgcagcgcgg tgacgtgctc gacgagtccg cgcagggacg cacccgaggt 3578461 gcgggacacg taatggtcgg tgatgtcgaa caaggcggtc accgtttcca ggtcccgggt 3578521 ggcctggaca gccgccgcac caccatgctc gctggccgcc agccagcggc gttgcagacc 3578581 cgaccgttgc caggcagccc atagcgtgtg gcgcggatcc tggccaccca gacttcctga 3578641 gcggtggcag cgcgcggccg cggtcagcac ggcacgcacg cgccgcagtg cccgcgaccc 3578701 tggccccgat ggcggcgcgt cgccgccgag cacttccacc agcaggtcgc cgaacttcct 3578761 cgaagtctgg ccgggacgtg cgcgttgcag agtccggcgc agctggcgaa gtgataccgg 3578821 gtccacacca ccaatcggcc cggtgagcag gagcagcgcc tggtcgccgt cgagcccgtc 3578881 agccgtcgcc tcgagcaccg tgagcagcgc ccgtaccgcc ggctccgcgg acaacggccc 3578941 gccaactgca ggtggggcca ccggcacccc ggcggcggcc agagcgcgcg gcaaccgcac 3579001 agcgcgcggc accgacctga cgatcaccgc catctgcgac caaggcaccc catcgatcag 3579061 gtgcgcgcgt cgcagcgcgt cggcaatcat cgctgcctca gcgtgcgccg aaccggccag 3579121 gcgcaccgtc accgatccga cctcggtccc ggtgccctcg attcgccgac cgacgcttcg 3579181 acccggtagc cgtcgtgcga tgccggtgac ggcccgcgcc acggcgggtg cacaccgatg 3579241 agagaccgtc aacgtcaccg acggaatggg ggcaccacct gctggcggcg gatcgtcggc 3579301 cagcaggccg gtgggctcgc cgccgcggaa cccgaacacc gcttggttcg gatcaccggc 3579361 gatcagggcc agctcggtgc ccgccgccag catccggacc aggcgtgccg cctgcggatc 3579421 aagttgttgg gcgtcgtcga ccaaaagggt ccggacccgg gcgcgttcgg cggccagtaa 3579481 ctcaggatcg accgcgaagg cctccaaagc tgcccccacc agttcggcgg cactcagcgc 3579541 cggcgccgtg gcctgcggcg ccgccagccc caccgcaccc cgcaacaaca tcacctgctc 3579601 gtaccgctgg gcgaattgac cggcggcgat ccattccgga cggccgcggc gacggcccag 3579661 ttgctgcaac tccagcgggt ccaggccgcg ttcggcgcaa cgtgccaaca ggtttcgcag 3579721 ctcggtggcg aagccggcgg tagtcagcgc gggccgcaga tgcgcaggcc aggtggtggt 3579781 ggcggccggt ccgtcttcgg cgtccccggc cagcagttcc cgaatgatgg cgtcctgctc 3579841 ggcgctggta agcagccgcg gcaaggcgtc accggcgcgc tgtgcggcct tgcgcaagac 3579901 cgcataggcg tagctgtgca cggtgcgtac caccggttcg cggatcgccg cccggcaagg 3579961 gccgttggtg cgcgaccgca gcagcgccgt cgtcagcgca ctgcgggccc gcatgcccat 3580021 tcggccggaa ccggtcagca gcagaaccga ctccgggtcg gtgccggcgc cgatgtgagc 3580081 gaccgcggcc tcaaccaaca gtgtgctctt accggtgccc gggccgccca gcacaagcac 3580141 cggaccgcgc aaacccggcg cgagggccgc acccgcctcg acaccccaga tatgtgacat 3580201 agccgcatga catcacgagg gtctgacaag ctcggatact ggagctggca agaaaaccga 3580261 aaacgcgatg tgaggggtgg ctaccatggc ggcggtcgta ggcggcggtc cacaggacga 3580321 aatacccgaa gccgatgcgg tggagcaagg gcgtgctgtc gatttcgacg acgaagccgg 3580381 gttggacacc gcctacctca gcggcggcgc cggcgaccga gacgccagcg aagccgacgt 3580441 cgtcgaccaa gccttcgtcg ttccggtcgc cgacgacgaa gaaatcgacc ggtagcaggc 3580501 gtcgccgggc tggcatcatc gacgcgtgat catcgacctt cacgtacagc gctacggccc 3580561 gtcagggccc gcgcgggtgc tgaccatcca cggagtgacc gagcacgggc gcatctggca 3580621 ccggttagcc catcactttg cccgaaatcc ccatcgccgc acccgatctg ctgggccacg 3580681 gtaggtcacc atgggccgcg ccgtggacca tcgacgccaa cgtgtccgcc ctggcagcac 3580741 tcctcgacaa tcagggcgac ggtccggtag tggtggtcgg acactccttc ggcggcgctg 3580801 tcgctatgca cctggccgcg gcccgcccag accaggtcgc ggcgctggtg ttgctcgacc 3580861 cggcggtcgc tctggacggg tcccgggtac gcgaggtggt cgacgccatg ctggcctctc 3580921 ccgactacct ggaccccgcc gaggcccggg ccgagaaggc gaccggtgcc tgggcggacg 3580981 tggacccccc agtgctcgac gccgaactcg acgagcacct cgtcgcattg cccaacggtc 3581041 ggtacggttg gcgtatcagc ctgccggcga tggtgtgcta ctggagcgaa ctggcccgcg 3581101 acatcgtgct gccgccggtg ggaacggcaa ccacgctggt tcgggcggtc cgtgcgtcac 3581161 cggcgtacgt cagcgaccag ctgctcgcgg ccctggacaa acggctagga gccgattttg 3581221 agctactaga cttcgactgc gggcacatgg tgccccaagc caagcccact gaggtcgcgg 3581281 cggtgatccg cagtcgactg ggaccgcgct agccatggcg ccggtgaccg acgaacaggt 3581341 ggagctggtg cgctcactgg tcgcggccat cccactcggc cgggtgtcca cctacggcga 3581401 catcgcagct ctcacagggc tttccagtcc gcgtattgtc ggctggatta tgcggaccga 3581461 ttcctcggat ctgccctggc accgggtgat cagagcctcc gggcgcccag cacagcacct 3581521 ggccacccgg cagttggagt tgttgcgcgc agagggcgtt ctcagtgttg acggccgggt 3581581 ggcgctgagc gagatccgct atgagtttcc gccgggctga gtaggtttag agcactagcc 3581641 gcactagggc cgcggtgtgg gccaggccgg gaaacgcttc ggcggtggat cgtgggtgca 3581701 gcgcgtacac tgctaggcgg aacatcaacg cgcgcaacaa catctggggc cactccggca 3581761 gcgcgttcca ccgctcgatg agcccgtcgt cggccgcacc ccaggacagc gcgtcgacga 3581821 cggccacccc ggccgcccag gatgcgggcc gccagtaggg cgtgatgtcg gtgatccctg 3581881 gaggggcggt gcccgcgaaa agcactgtac cgtaaagatc tccgtgcacc agctggttcg 3581941 ggctcttggt cggcttacgc aacccggcaa gctgattgat cagatcgatc gatcgctggg 3582001 ggtccgctgc cgggggggcg gtcggcacgc ccggtgggac cgactgtaat ggccgctcct 3582061 cccacccagc tcggtctgcg gcgacgaaca catcgatctc ggcccagggc gccgcgggtc 3582121 cctgggtcaa gaatcggggg cgttccagtt ttccggtggc ctcatgcagc cgcaccgccg 3582181 ccgagacgac ctcatcatgc ctaggctccg gcgcgccggc gacgaacgtg tctgcccgcc 3582241 aaccagacac cacgtaccgg ccgtcggtcg atcggacggg ccgagccagg cgtacgccgt 3582301 cgacgaacaa cgtctcgcgc acccgggccg accaggccgc gcgggcgttg tcggccacca 3582361 tcgacaacac cacctcgccg catcgccagc caccttccca accggcaccc aacaggatgg 3582421 gttgcgcacc tgccaaaccg aacgccacca acacgtgctc gggcggcggc tcgacattca 3582481 caccggtcag cctagtagag cccatcgggg tgtattgggc ctgtatcggt cctagtacat 3582541 caccatgtcg ggctgcatct gcttggccca cgcgacgatc ccaccctgca ggtgtaccgc 3582601 gtcggagaaa ccggctttct tgaccgcagc caatgcctcg gccgagcgca cgcccgtctt 3582661 gcagtacagc acggcggtgc ggtcctgggg gagcttggcc agaccctcac ccgagttgat 3582721 caacgatttc ggaatcagtt gggctccgtc gatatgcacg atgtcccact ccacgggatc 3582781 gcgaacgtcg atcagtgcca gcttacggcc ggagtccagc cagtcgcgca gctcgcgcgg 3582841 cgtgatggtg gaacctttgg ccgcctgggc ggcatcgtca gcaaccacgc cgcagaactg 3582901 ttcgtagtcg accagctcgg tgatcttcgg tgtcgatggg tccttgcgga tggtgatcgt 3582961 gcgatagctc atctccagcg cgtcgtacac cagcaaccgg ccaagcagtg tttcacctat 3583021 cccggtgatc agcttgatcg cctcagtgcc catcaccgat gcgaccgagg cacagataat 3583081 gcccagcacc ccgccttcag cacaggacgg caccatgccc ggcggcggcg gctcgggata 3583141 caggtcgcgg tagttgacac ccaacccgtc gggggcgtcc tcccaaaaca ccgatgcctg 3583201 gccctcgaag cggtaaatcg acccccacac gtacggcttg ccagccagca ccgcggcgtc 3583261 gttgaccaga taccgggtgg cgaagttgtc ggtgccatcc aagatcaggt cgtactgctt 3583321 gaacaggtcg acggcgttgc tcggcgcaag ccgcagctcg tgtagtcgca cccggatcag 3583381 cgggttgatc gcgacaatcg aatcgcgcgc cgactgagcc ttggagcgcc cgacgtcagc 3583441 taccccatgg atgacctggc gctgcaggtt cgactcgtca accacatcga agtcgacgat 3583501 gccgatggtg ccgacgccgg cggcggccag atacaataac gtgggcgctc cgagcccgcc 3583561 ggcgccgatc accagtactc gcgcgttctt gagcctcttc tgcccgtcaa cacccaggtc 3583621 aggaatgatg agatggcggc tgtagcgagc tacctcttca cggctgagcg cggatgctgg 3583681 ctcaactagt ggcggcaagg atgtcgacac cgaatatctc ctcggttata tccgaaacgt 3583741 ctgctgcgcg tcgtcctgca aatacctcaa cgcccagctt gccacctttg cttccccggg 3583801 ttagggaatc gggtagggcc agggattgaa tcggcaggtc tttccatccg ccttaacgaa 3583861 gtcggggtca aacttggccg cgtcgtcatt ggaggtggaa aacgtctgct gcatcattac 3583921 cggagccaga ccgccttgtt ggtcgcacgg ctcgtggcgc aggtaaccga tggcatgacc 3583981 gacctcgtgg ttgatcacat attgccgata ggaacctacg tcaccttcga atggaacggc 3584041 tccgcgtacc cagcgcgcct cgttgatgaa cacccgcgat tggcgatcca tgccgccgaa 3584101 cgacgggttg tagcaggacg tctcgagccg gaattcgtag ccacaccccc cgcgcactgt 3584161 cgtcggcgac accagcgaaa tccggaagtc gggttttccg ctgtcgatcc gcacgaacgc 3584221 gaattgcgga ttgtgggtcc agcccttggg attggtcaac gtctggtcga ccatctgggc 3584281 gaatgcgttg tcaccgccgt acattgtggg atcaagaccg ttctcgatct cgacggtata 3584341 cctgaacact ttgacggtgc cttgaccgac ctggggagta gtgcccggaa cgacacgcca 3584401 ggtcttgtca ccagcctcgg tgaacgggcc gccatccggc agcgtcccgg ccggcagatt 3584461 ggcatcgaac actgcaagac cgcgaggcgg tgcgtcgagg atcgcggtcc ccaccacacc 3584521 aatggccggc gagtcccgga cggtctgggc cgccgcgggc cttggcgtgc tcgtcccggt 3584581 caccgtctgg tacaccacca ccgtggtcag caccatcaga accggcaggg cgtaggcgcg 3584641 ccagccgtac gtggacacga accgccccaa ccaggtttgt ttgcgccatt gacgcttccg 3584701 gtcgcggcgg gcccggaccc gtctgtcagt cgcggcgagc gggtcgcgca gggcccgcag 3584761 cggctcacgc cactcgtcac gcagcacggg tactcgactc gtgcttccgg cgggccacgg 3584821 agacgtcatt tcctcaggat gacacagctg gcccgggtcg cgaccctggc gcgcccgaat 3584881 gcaacaccca acaaactatc ccgccgctac cgatgccgca ggtagtaatg tcattccgac 3584941 agacgcgcgg cggtgggggt tggcacagtg gccctcgaat tagtgtgatc agattgagga 3585001 ctgatgagcg atctcgccaa gacagcgcag cgacgtgccc tcagatcgtc cggcagcgct 3585061 cggccagacg aagacgttcc ggccccgaac cggcgcggca accgactgcc tcgcgacgag 3585121 cgccgcggcc aattgcttgt cgttgccagt gacgtcttcg tcgatcgggg ttaccacgcg 3585181 gccggtatgg acgagatcgc ggatcgggcg ggagtcagta aacccgttct gtatcaacat 3585241 ttttcgagca agttagaact ttacctggct gtgcttcatc ggcacgtgga aaacctggtg 3585301 tccggcgtgc atcaggcgct gagcacgact accgacaacc ggcagcggtt gcacgtggcc 3585361 gtccaggcgt tcttcgactt catcgagcac gacagccagg gttaccggct gatcttcgag 3585421 aacgacttcg tcaccgagcc cgaggtcgcc gcacaggtgc gggtggccac cgaatcgtgc 3585481 atcgacgcag tgttcgcgct gatcagcgcc gattccggac tggacccgca ccgcgcccgg 3585541 atgatcgcgg tgggcttggt cggaatgagc gtcgactgcg ccagatactg gctggacgcc 3585601 gacaagccga tttccaagtc cgacgccgtc gagggcaccg tgcagttcgc ctggggcggg 3585661 ttgtcccacg tcccgcttac ccgctcgtag caacctttcc ggcggaccca gctgcggcgt 3585721 ccaccccgac gccgaagccc acccggcggg cgtctgcgac accgatctcg acataggcga 3585781 tcctggcggt gtgaattagg aagcgacggc cccgctcgtc ggtcagggtc agcaaaccag 3585841 agtcgtcgcg cagcgcgttg ctgacgagtt cttctacctc actgggcgtc tgcgcactgg 3585901 agaacaccag ctcgcgcgga ctgtccgtga taccgatctt gacctccacg gtggcccctt 3585961 ccattggcat tccgtcacag gcgtgtcacc agcaggctag tagacgcccc tggcccccat 3586021 aacggttagg tctaggccag cccgacacgc cgccagacac cccatccgcc ggcaggggct 3586081 cgataacatc agcaccatcg gtaacacagt taacgacctc tacgagtgcg ttcggaacgt 3586141 ccgggaagtc caggactacc cggacgacga gagctcgagc ggcttcgggg ctggccggac 3586201 ctgttcggaa ggcgagtttg cctgggcggc tgaccgccga tggcgcccgg tagctgcgat 3586261 cctcggcagt gtggtggcgc ttggcgcggt cgcgaccgca gtcattatca acagcggaga 3586321 tagcacgtcg accaaggcca ttgtcggggc accagccccg cgcacggtga tatccacctc 3586381 gccacgacca acggccccga ccagcacgtc accccaccct tcgcccagca ccttgcggcc 3586441 gcagctcccg ccggagacgg tcaccacggt ggcaccgccg ggcaccgggc ctactaccgt 3586501 gccgacgcga acccccaccg ccgcgccacc tcagactgct gtgccaccgc cggcgccgct 3586561 gaatccgcgc accgtcgtct accgcgtgac cggcaccaag cagctgttcg acctggtgaa 3586621 cgtcgtctac accgatgcgc ggggcttccc ggtgaccgac ttcaacgtgt cgctgccgtg 3586681 gacgaagatg gtcgttctga accccggcgt gcaaaccgaa tcggtcgtcg cgaccagcct 3586741 ttacagtcgt ctcaactgct cgatcgtcaa taccggcgct cagacggtgg tggcgtcaac 3586801 caacaatgcg atcatcgcga catgcactcg ctagatctgg gatctagctg agacccagtt 3586861 cccgcatgcg ttggtcgtgg gtctgctgca accggtcgaa gaaggcacca agctggctca 3586921 gtccaccact accggacacc accaggtcga ccagctcgtc gtggtcggcc aacaccagct 3586981 gggcctgcgt tatcgcctcg ccgagcagac gacgcgacca cagcgccagt cggctgcgct 3587041 gtttgccgct ggccgtcacc gctgcgcgca cttcggcgac gacgaactga gagtgcccgg 3587101 tctccgacaa cgccgcccgc accacgtcag caacctcgtc aggcagcccg tcggcgatct 3587161 ccagatacaa atcggcggcc aacgcatcgg caacataggt cttcaccagg gcttccagcc 3587221 atgtgctcgg cgtcgtcagc cggtggtagt tttctaacgc tgaggtgtac ttcgacatcg 3587281 ccgacaccac gtcgacgccg cgacgttcca acgcattgcg cagcagctcg tagtgcccca 3587341 tctcggcggc ggccatggat gccatcgaga tccttccccg cagatccggg gccatgcgcg 3587401 cctcatcggt caatcggtag aaggcggcaa cttcgccgta ggccagcaac gcgaacaatt 3587461 cgttgacgcc gggatgatcc gccggcagcc gtggcctggg tgaatcggcc acctgatcgg 3587521 cggatgaggg cgatggcatg gcaacactct agtaggcagg ctcagcggca aatgggaacc 3587581 tgctggccga ccagctatca tgctcgttag gtggcggcat tggttcgact gccgctaccg 3587641 gcgaaatgtg cgtgcatgga gtctgccccg cctggactgt gctaggggcc ggcgactcgg 3587701 cgacgtaatc ggagtcggaa ctcatgcgcg cgtgaaccgc gacagagaaa caccgacaca 3587761 cgaccgacac cgtcaccgaa aggccgctta ccctcgtatg accgcagtga aacacacaac 3587821 tgaatcaaca tttgccaaac ttggagtccg cgacgaaata gtccgcgcat taggggaaga 3587881 gggcatcaaa cggccctttg ctatccagga actcaccctg ccactcgcgc tcgacggcga 3587941 ggacgtgatc ggccaggccc gcaccggcat gggcaaaacg ttcgcttttg gcgtgccgct 3588001 gctgcagcgc atcacctccg gcgacggcac gagaccgctc actggcgctc cgcgggccct 3588061 ggtcgtagtc cccacccgcg agctgtgtct acaggtcacc gatgacctgg ccacggcggg 3588121 caagtacctg accgccggcc ccgacacaga cgacgctgcc gcggtacggc gccggctgtc 3588181 ggtggtgtcc atctacgggg gacggcccta cgagccgcag atcgaggcgc tacgcgccgg 3588241 cgccgacgtc gtggtcggca ccccgggtcg gctgctcgac ctgtgccagc agggccacct 3588301 gcagctgggc gggctatccg tgttggtgct cgacgaggcc gacgagatgc tcgacctggg 3588361 cttcctgccc gatatcgagc gaatcctgcg gcaaattccc gccgaccgac agtcgatgtt 3588421 gttttcggcg accatgccgg acccgatcat cacgctggcc cgaacgttca tggtccggcc 3588481 cacgcatatc cgggctgagg caccacattc ctcagcggtt cacgacgcga ccgagcagtt 3588541 cgtctaccgc gcccatgcgt tggacaaagt ggagttagtc agccgggtgc tgcaggctcg 3588601 tgaccgcggc gcgacgatga tcttcacccg caccaagcgg accgcccaga aggtcgccga 3588661 cgagttgacc gagcgcggtt tcgcagtcgg cgccgtgcac ggtgatctcg gacagctggc 3588721 acgcgagaag gcgctcaagg cgtttcgcac tggcggcatc gacgtattgg tggccaccga 3588781 cgtggccgcc cgcggcatcg acatcgacga cgttacccac gtgatcaact atcagtgccc 3588841 cgaagacgag aagatgtacg tccaccgcat cggtcgcacc ggccgtgccg gccgaaccgg 3588901 ggtcgcggtc accctggtgg actgggacga gctgccccgt tggagcatga tcgaccaagc 3588961 actgggcctg ggctcccccg atccggccga gacatactcc aactcgccgc atctgtatgc 3589021 cgagctggcc atcccggcca cggccggcgg taccgtcggc ccggcgcgca aatcgcaggg 3589081 caggcgacgt gacaccgact gcgacggcca gaaaacggca cagcacgccc gcaatacccc 3589141 caggcgtcgg cgcacccgcg gcggcaaacc cgtcaccgga caccccggca ccaacccaat 3589201 cagcagccca atcgtgggcg gcgacgccac ctcggagccg ggctccggca ccgcatcaga 3589261 ttccgggtcc gatgttgtgt ccggctcccg gtccggcaac ggcgaagctg cgcgacgccg 3589321 tcgtcgccgc cgccgacgcc cgacgcacgc ccaggacggc ttcgccgcgc gggctaactg 3589381 acccgcccac cgcatggtta aaccggagcg ccgcaccaag accgatatcg cggccgccgc 3589441 gacgatcgcg gtcgtggtgg ccgtggccgc gtcgttgatc tggtggacca gcgacgcccg 3589501 cgccaccatc agccggccgg cggcggttgc ggtgcccacc ccggccccgg ctcgcgaggt 3589561 cccgacctcg ctgaagcagc tgtggaccgc cgccagccca gccacccgcg ttcccgtggt 3589621 ggtgggcgga acagtggcta ctggcgacgg acgccaggtg gacgggcgcg acccagccac 3589681 cggtgagtcg ctctggagtt acgcccgaga caccgatctg tgtggggtga cctgggtcta 3589741 ccactacgcc gtcgcggtct atcggtacga ccggggttgc ggtcaggtca gcaccatcga 3589801 tggatccacc ggtcgccggg gagccgcccg cagcggctac gcggatccgc gggtgcgtct 3589861 tttttccgac ggcaccacgg tgttgtcggc cggggacacg cgcctggaac tgtggcgttc 3589921 agacatggtc cggatgctgg cctacggcga gatcgatgcc cgggtgaaac cgtcgaaccg 3589981 cggcctgcag tccgggtgca cgctggagtc ggcggcggcc agctcggcgg ccgtatcggt 3590041 gcttgaagcg tgtacgaacc aggctgacct gcggcttgtg ctgttacgcc cgggcaagga 3590101 ggacgacgag cccatccagc gcattgtccc ggaaccgggg gtccggccgg gttcgggcgc 3590161 ccgggtattg gtggtatcgc agaacaacac cgccgtgtac ctgcctgcaa gatcaggcgc 3590221 gcaaccgaga gtcgacgtga tcgacgagac cggcgccaca gtttcgagca cgctgctggc 3590281 caagccaccg tcaacttcgg ccgtggcgtc gcggaccggc aacctggtga cctggtggac 3590341 gggcgacgcg ttgttggtct tcgacgcggg caacctgacc cagcgctaca ccattgccgc 3590401 tggcgagacg actgcgccgg tggggccagg ggtgatgatg gcaggtcaac tcctggtgcc 3590461 ggtcaccggc gggatcggtg tctatgaccc ggtcagcggt gccaacaacc gttatatccc 3590521 ggtgacccgg ccgccaagca cgtcagcagt gatcccggca gtttctggat ccagggtcat 3590581 tgagcaacgt ggcgacacac tagtcgctct gggttgatcg cctatgttgg cgcgagcaga 3590641 cgcaaaatcg cccgaaaccg atggctttcg ggcgattttg cgtctgtcgc gctacaggtc 3590701 caccgtgaag gtgggcagcg gcctacctgt cttccagtgt ttgagcagcg cctgcgccag 3590761 ctcgcggtag gccaccgcgc ctttgttctt gcgcccagcc atcaccgacg agcccgaggc 3590821 gctggcctca gcgaagcgca cagtacgggg gatgggcgga gccagcacct gtaggtcgta 3590881 gcggtcggcg acatcgagca acacgtcacg ggtgtgggtg gttcgagagt cgtacagcgt 3590941 cggcagtgca cccaacaacc gcagattcgg attggtgatc tgctggacat cggcgaccgt 3591001 ccgcagaaac tggccgacac cccggtgcgc cagcatctcg cactgcagcg gcacgatggc 3591061 cttgtcggcg gccgtcagcc cgttgagggt gagcacaccc agcgacggcg gacagtcgat 3591121 gatgaccacg tcgaaccggt cggagaattt ggccaacgcg cgtttgagcg cgtactcacg 3591181 gcctgcccgc atcagcagca ttgcctcggc gcccgccaag tcaatgttgg ccggcagcaa 3591241 cgtcattccc tccatggtgg tgaccagcac ggcgttgggc tcgacttcac cgagcaacac 3591301 ctcgtgcaca gacaccggta gtttgtcggg atcttgacca agggagaagg tcagacaacc 3591361 ttgcggatcc agatcgacga gcagcacgcg ccgtcccttt tccaccatcg ccgcaccgag 3591421 cgaggcgacc gtagtcgtct tggccacccc gcccttctgg ttggccaccg ctagcacccg 3591481 ggtatcagtc ataggcgccg ctctcccccg caagcggcag ggacccccac ctcatcgtgc 3591541 tctcccttcg tcgtcgcccg cgcagtcaca gtgtcatcct ggcatgctgc tcgcacagtg 3591601 gttcgggcga caggcctagg atgtcgtcgg gcacaatctg tcggtatggg cgtgcgcaac 3591661 caccgattgc tactgctccg ccacggcgag accgcttggt cgacgctggg ccggcacacc 3591721 ggcggtaccg aggtcgagct gaccgatacc gggcgaacgc aggcagagct ggctggtcag 3591781 ctgctgggtg aactcgaact tgacgacccg attgtcatct gtagcccgcg tcgacggacg 3591841 ttggatactg ccaagttggc cggcctgacg gtgaatgagg taactgggct gctcgccgaa 3591901 tgggattacg gttcctatga gggccttacg acgccgcaga tccgggaatc cgaacccgat 3591961 tggctggtgt ggacgcacgg ctgcccagct ggagaaagcg tcgcacaggt aaacgatcgc 3592021 gctgacagcg ccgtcgcgct ggccctggag cacatgtcct cacgcgacgt gttgtttgtc 3592081 agccatggcc acttctcccg cgcggtgatc acgcgctggg tccagctacc gctcgccgaa 3592141 ggcagccgtt tcgcgatgcc caccgcctcg atcgggatct gcgggttcga gcacggcgtg 3592201 cgtcagctcg ccgtgctcgg gttgaccggt catccgcagc cgatcgcagc cgggtgagcg 3592261 cacacgtggc aaccttgcac ccagaaccac cgttcgcact gtgcggacca agaggcaccc 3592321 tgattgcccg cggggtgcgg acacgatact gcgacgtgcg ggccgcgcaa gcggcacttc 3592381 gctcaggtac agcaccaata ctgttgggcg cgttgccttt cgacgtgagc agacccgccg 3592441 cattgatggt gccggatggc gtgctgcggg cccggaagct gcctgactgg ccgaccggcc 3592501 cgctgcccaa ggtacgcgtc gccgccgccc ttccgccacc tgccgactac ctgacccgga 3592561 tcggccgcgc acgggatctg ctggccgcct tcgacggccc gttgcacaaa gtggtgctcg 3592621 cgcgcgccgt gcaactgacc gccgatgctc cgctggacgc gcgggtactg ttgcgcaggt 3592681 tggtcgtcgc cgacccgacc gcttacggct atctcgtcga cctcacctct gcgggcaacg 3592741 acgacaccgg ggcagccctg gtcggcgcca gcccagagct tctggtcgca cgatccggca 3592801 atcgcgtcat gtgcaagcca tttgccggct cagccccacg cgccgccgac cccaaactcg 3592861 acgccgccaa cgcggccgca ctagccagtt cggccaagaa ccgacacgaa caccaattgg 3592921 tcgtcgacac gatgcgggta gccctagagc cactatgcga ggacctgaca atcccagccc 3592981 agccccagtt gaaccgcacc gcagccgttt ggcatctgtg caccgcgatc accggccggc 3593041 tgcgcaacat ctcgacgacg gcaatcgatc tggctttggc gctacatccc accccggcgg 3593101 ttggtggggt cccgacaaaa gctgccaccg agctcatcgc cgaactcgag ggcgaccgtg 3593161 gcttctacgc cggcgcggtt ggttggtgcg acggccgggg cgacggccat tgggtggtgt 3593221 ctatccggtg cgcgcaactt tcggctgatc gacgcgcagc ccttgcgcac gctggcggtg 3593281 gcatcgtcgc cgaatcagac cccgatgacg aacttgaaga aaccacaacg aagttcgcca 3593341 cgatattgac cgcactggga gttgagcagt gaccgatacc atccgccgcg ctacaccggc 3593401 ggataccgcc gacatcgtgg ccatgattca cgcgctgggc ggaattcgag tatgccgccg 3593461 atcaatgcac tgtcaccgaa acacaaatac atacagcact tttcggagat ttcccgacga 3593521 tgcgaggcca cgtcgctgag gttaatggcg gagttgccgc gatggcgctg tggtttctga 3593581 acttttccac ctgggacggc gtcgcgggca tctatgtgga ggacttgttc gtctggccga 3593641 ggtttcgccg ccgcggcttg gcccgtggcc tgctgtcgac gctggccaga gaatgcgtcg 3593701 acaaccgcta cacgcggttg gcctggtcgg tgctgaactg gaattccgat gcaatcgcac 3593761 tgtatgaccg catcggcggg caaccgcagc acgagtggac tatctatcga ctgtcaggac 3593821 cgcggttggc tgcgctggcc gcaccacgct gatcacgccc ggcggcccag cggatcgaag 3593881 gcggactgaa cagcaatacc agcacgccaa gcgcgatgat tcccaccggg atcccgatcg 3593941 ccggctgatg cgaacccaca atcagatacc acgccaccgg cagcagcagc agctgggcga 3594001 acaccgccag cccgcgaccc caaagcttgc caaccgccag cctgcatccg gcggcgagca 3594061 ctgctccgcc gaccagtacg aaccaacctg cggtgcccag gccattgacg atgtgctggt 3594121 cggcgcccgc gagtccgcgc accagcaacg ccgcggccac caccagggcg gccccaccct 3594181 gcacggcgac gatcagtccg gcgccgcgca cggcggccgg ggctcgaaca ggcacagcat 3594241 cagcgtagtc acccggccgt gaccggcccg catcgtcaca ccacccaggc ccattgccgt 3594301 cctcctcaac gggccgaccc ggcccgcatc gtcacacggc ctaggcccat tgccgtcctc 3594361 ctcaacgggc cgacccggcc cgcatcgtca cacggcctaa gcccattgcc gtcctcctca 3594421 acgggccgac ccggcccgca tcgtcacacg gcctaagctc gtgcgtcatg cgtgcagtgc 3594481 tgatcgtcaa ccccactgcg accgccacca caccagccgg ccgcgacctg ctggcgcacg 3594541 ccctcgaaag ccgccttcag ctcacggttg agcacaccaa ccaccgcggt cacgggaccg 3594601 aactcggaca ggcggcggta gccgacgggg tggacctggt cgtggtgcat ggcggcgatg 3594661 gcacggtaag cgccgtagtc aacggcatgc tggggcgccc cggcacgacg ccggtccgac 3594721 cggtgccagc cgttgcggtt gtgcccggcg gctcggccaa cgtactagct cgcgcgctag 3594781 ggatttccgc ggacccgatc gctgccacca accaactcat ccagctgctc gacgactacg 3594841 gccgccacca gcagtggcgc cgcatcgggc tgatcgactg cggtgagcgg tgggcggtgt 3594901 tcaacgccgg catgggcgtc gacgccgagg tcgtggccgc ggtagaggcc gaacgcgaca 3594961 aaggcggcaa ggttacggcg tggcgctata ttcgcgctgc ggtgcgcgcg gtgctcgcct 3595021 gcactcgtcg cgaaccggct cttacgctgc aacttcccaa ccgcgatcca attaccggag 3595081 tgcactttgt gttcgtgtcc aactccagtc cgtggactta cgcaaacaac cggccggtat 3595141 ggaccaatcc cgactgcagg ttcgagtcgg ggctgggagt gttcgccacc accagcatga 3595201 aggtggtccc gaccctgagg gtggttcggc agatgttcgc aaaacagccc aagttcgagt 3595261 tcaaccacgt catcaacaac gacgacgtcg cgtgtctacg cgtcacctcc atggggcccc 3595321 cgatcgccag ccaattcgac ggggactacc tcggcgtgcg cgagacgatg acgttccgag 3595381 ctgttcccga cgccctcgcc gtagttgccc cgcccgcaag aaagcggatc tgagctgcag 3595441 aaacaaagat gtgatgggtg tgcgacacaa acgttgggcg aaactggcag cgtagtgtag 3595501 tacaactggg taagggctgt ggaacgagat cgccagagtg agatagccca cgcgcttacg 3595561 taacactatt gacatctgtt gagcctgtga aacgatcaaa aggttgcatg tagagaaatg 3595621 taggggtaca gaagcctttc ttgtgcaccc gttaccagcc aagaagaaac gcctgtgcgt 3595681 accgctgcgc acatagtgag gagtaacgac taatggattg gcgccacaag gcggtctgtc 3595741 gtgacgagga tccggaactg ttcttcccgg taggaaacag tggtccggca cttgcgcaga 3595801 tcgctgacgc gaaactggtc tgtaatcggt gcccggtcac cacagagtgc ctcagctggg 3595861 cactgaatac cggccaggac tcgggcgtct ggggaggcat gagcgaagac gagcggcgcg 3595921 cgctgaagcg tcgcaacgcc cgcacgaaag cccgtaccgg ggtctgacga ctcagttctg 3595981 cacagtgcgg ccccgacata cgtcggggcc gcactgttgc gtagcgcgct acagcatcaa 3596041 ccgtccccgg cgtccgaccg gtacccgtag caccacatcg gtgccacgtt cgcgggcgtc 3596101 ccgcatacct aacgagccgt ccaattccgc agagaccaag gtccgcacga tctgcaggcc 3596161 caggctgtcc gacttctcca ggctgaaacc ttgcggcaga ccaagcccgt cgtcgtgcac 3596221 gacgacatcg agccaacgcg cagagcgttc cgctcgaatc gtcacggacc cttccgccgc 3596281 cgccgggtcg aacgcatgct cgatcgcgtt ctgcaccagc tcggtgatca ccatgatcag 3596341 cgccgtggcg cggtcggagt cgagcacacc gaggtcgcca acccgattta tccggatcgg 3596401 cctgtccacc gatgccacat cgttcatgat cggcagaatc cggtcgatga cctcgtcaag 3596461 gttcacctgc tcgtccaccg acatcgacaa cgcatcgtgg accaaggcaa tcgacgacac 3596521 tcggcgcacc gactcgatca gcgcttcccg cccctcggcg ttggacgtcc ggcgagcctg 3596581 cagccgcaac agcgcggcca ccgtctgcag gttgttctta acccgatgat ggatttcccg 3596641 gatcgtggcg tccttggata tcagggctcg gtcgcgccgc ttcacctcgg tcacgtcgcg 3596701 gatcaatatc gcggcgccga cattgcgacc agctaccacc agcggcagag tccgcagcag 3596761 caccgtggcg ccgccggcgt cgacctccat ccgcataccc tttccatccc cggccagcaa 3596821 gtcctgcaca tgctcgtcta cctcgtgcgc ctcgaacggg tccgagatca gcgggcgcgt 3596881 cgcgtcaatg agattgacgc cctccaactc ggtggtcaaa cccattcggt ggtaagccga 3596941 tagggcattg gggctggcgt aagagaccac accgtcgaca tcgagacgga tgaagccgtc 3597001 acccgcgcgc gggctagatc gcgacatcgc cacgtcccct gcgtcgggaa aggtgccctc 3597061 cgccagcatc cggagaagat ctgtggcgca caaccgatag gcggtctcca ggtggccgga 3597121 tctacgtcgc gccgccagtt cgggttgatg ccgtgtcagc accgccacca cctgatcgcc 3597181 aaagcgcacc ggggagactt cgacactgtg gccgtcgtgt tgacatgaat tctgttggcc 3597241 gacagcgcct tcccgtcccg ggacaccacc ggagaaggtc gcggcgacca gcggcatgct 3597301 attggcggcg acgacggtgc ctaccgcgtc ggtatgcacc accgtcggcc cggtgttcgg 3597361 ccggcattgc gcaacgcaca ccaggacacc gtcgtcgcgg cgaacccaca tcaggtaatc 3597421 ggcaaacgac aagtcggcaa ggagctgcca ctccccgacc accgcatgca ggtggtccac 3597481 cgcgctgccc ggcagcaccg tgtgttcggc gagcagatca ccgagtgtgg acatgagtga 3597541 ctatcaacga ctagctgatc accgcgataa ggtcgccggc ctgaatgaca tcgcccaccg 3597601 ataccgccac cttgctgacc gttccggcag cttcggccag gacggggatc tccatcttca 3597661 tcgactccag cagcaccacg acgtcgccct tgtcgatctg atcgccttcg ttgacaacga 3597721 cttcgagaac gctggccacg atctcggcgc gaacatcctc ggccatcatc accccactct 3597781 tttcggccat gccgtatgct gactgctggt catcggactt ccatcaaact caggtatatc 3597841 gaaccataag aaccctgggg agcgcggcac gcgggctatt ggggtcgcgc gcgacgccgc 3597901 atgagaaact gggcaatgac cgggcggccg ctgcctgccc gcacctgagc aatgacggag 3597961 gttccgatgg ccaagcgtgg ccgtaagaag cgtgaccgca agtacagcaa ggccaaccac 3598021 ggcaagcggc ccaattccta acgcactgcg ctagggccct ccacggatga tggtggtccg 3598081 gcggatctct agccgaagac gctcccgcaa gccctcgggg gccctgtcgc ctcggcactt 3598141 ggtcccgatc aacgccttga tccgttcctc gagcccgtaa tgcctcaggc accccgggca 3598201 ggcctcgagg tgtcgccgca gcctctcgcg ggtttccggg gtgcattcac cgtcaagcag 3598261 ggtccacacc tcggcgatca cttccgcgca acccatgccg ccgtgggaat cgtcgtggtc 3598321 cgcgtgcgca tcggtcggac cgcaattttc gctcactggt gcaccatcct tgtgtcggtg 3598381 atctcggatg gattgccgat gtagaggcgc cgctgggtta gcgccccgcg cgcttgacag 3598441 ccgtgatgtc catcatgagt tttgcggagt ccggcggttg ccccggacgc gccgaccgtc 3598501 gacagggcca agcgccgacg agcgccgaac gactcgcccc gcacgccgac gcccagcccg 3598561 aattgctggc ctgcttggcc ggcgtcgccc gctccaccgg ctagtccgac aaagtcaccc 3598621 acgtcgggtt cggttgggcg gcagacaaac aactccgcaa cggtgtctgc gacttcgccg 3598681 gcgacagccg ccgagccaac ctctaggccg ccgacctgca caaccgcacc cgagcccgcg 3598741 gccacgacca ccggcacgcg gtacgcatcc tcgcccgcgc ctggctttac gtcatctgac 3598801 accgctggca agacggcatc gcttacgacc ccacccaaca ccgagccctg caggctctcc 3598861 ttgaccaagt tcgccaaacg gcggcttgac accgggctgc tcatgacgac accccctcgt 3598921 gcgcctgctc gcccctggca aacccccgat ccctggccac atcggctaaa agaccgcgca 3598981 actgacgtcg gccgcgatga agcctcgaca tcacggtgcc gatcggagta tccatgatct 3599041 cggcgatctc cttgtagggg aaaccttcga catcggcgta gtagaccgcc atccggaact 3599101 cttccggcaa tgcctgcagc gcctctttga tctcggtgtc cggcaacgct tctaacgctt 3599161 cgacttcagc cgagcgcagc ccggtcgagg aatgctcggc gttggacgcc agttgccaat 3599221 cggtgatctg ctcggtcgga tactccgccg gttgccgctg tttcttgcga tagctgttga 3599281 tgtaggtgtt ggtcagtatc cggtagagcc aggccttgag attggtaccg tgccggaacg 3599341 aacgaaatcc cgcataggcc ttcaccatcg tctcctggag caagtcctcg gcgtcggccg 3599401 gattgcgcgt catccgcagc gcaccgccgt acagctggtc caacagggga atcgcgtcgc 3599461 gctcgaaacg cgcggtcaac tcctcgtctg tctcctcaga cggcccaggc tgcagacccg 3599521 ccgaaccggt tacaccatcg atgtcggcca tcttgattaa ctgggtccct tcgtttgcgg 3599581 tgtcgccgga cagcaccggc gcggacaccg gacgtgcgag catgcgagcc aaccgcttct 3599641 cacccaacag gctcgtcgcc gttgacacca gactcccctc gtcccaatgt agaggccgcg 3599701 accgacactg tctgcaccgg tctggccagc cacgtggctg caggaaccga accaatcaac 3599761 cgtgttcgcc agcgggttat ttccagcgct gaatcgcatg cggcctgtcc cgcagtccgg 3599821 tggaatcgag cagggcgtta gggtgacgcc atgtcactca acggcaagac catgttcatc 3599881 tctggcgcca gtcgcggtat cggccttgcg atcgccaagc gggccgcgcg cgacggcgcc 3599941 aacattgcct tgatcgccaa gaccgccgag ccgcatccaa agctgccagg cacggtgttc 3600001 acggccgcca aggaactcga ggaagccggc ggccaggcac tgccgatcgt cggggatatc 3600061 cgcgacccgg atgcggtcgc gtccgcggtg gccaccaccg tggagcagtt cgggggcatc 3600121 gatatctgcg tcaacaatgc ctcggcgatc aacttagggt ccatcaccga ggtgccaatg 3600181 aagcgtttcg acctgatgaa cggcatccag gtgcgtggca cctacgcagt atcccaagcg 3600241 tgcattcccc atatgaaagg ccgtgagaac ccgcacatcc tgacgctgtc cccgccgatc 3600301 ctgctggaga agaagtggct gcggccgacg gcctacatga tggccaagta cggcatgacg 3600361 ctgtgcgcgc tgggaatcgc cgaggagatg cgcgccgacg gcatcgcgtc gaacacgttg 3600421 tggccacgca cgatggtggc caccgcggcg gtacagaacc tgctgggcgg cgacgaggcg 3600481 atggcgcggt cccgcaagcc cgaggtatac gccgacgcgg cctacgtcat cgtcaacaag 3600541 cccgccaccg aatacaccgg caagacgctg ctgtgcgagg acgtgctcgt cgaatccggc 3600601 gtcaccgact tgtcggtcta cgactgcgtc ccaggtgcga cgctcggcgt cgacctgtgg 3600661 gtggaagacg ccaacccgcc ggggtacctc ccggcctagc gacagcaaaa ccctgatcct 3600721 cgagttgccc gacgagcggg ccgtcgcgat cgtgccggtg ccgtcgaagt tgtcgctgaa 3600781 ggcggccggc ggccctaggg gtgcccaaag cggccatggc taaacccgct gccgccgaac 3600841 aagccaccgg ctacgtggtc ggcggcatct ccccgttcgg tcagcgcaag cggctgcgga 3600901 ccgtggtcga tgtgtcggcc ttgagctggg accgggtact gcggtgccgg caaacggcat 3600961 tgggccgtca cggtggcccc gccggacctg atcaccttga tcagcgcgat catcgctaac 3601021 atccgggcct agcgccgtac cggaaatcgg cgaggacttc accgatggcg tagcgcgcgc 3601081 tggccgccag cggcgggttg gtgtcttggt agtacgggag cgcgatcaag gcgatggcca 3601141 gagctctgcc gcgcccgcgc atccagtcgt cgtcggcggc gccgaccgcg acgcggaact 3601201 gagcacgggc gggcgccgac aggaggttcc acgcgatgat caagtcgacg ctggggtcac 3601261 cgacgcccat cagaccgaag tcaatgacgc ccgtcaagcg tccttgcgct gtcaggatgt 3601321 tgaaccggga caggtcaccg tggaaccaca tcggcggccc cgcatacgga ggaacgcgta 3601381 gggctgattc ccacgcggca gttgccgcgt ggacgtcgat gatcccgtcg agggccgcca 3601441 gcgctgcgcg tacctcggca tcctgctccc ccagcggcgc accccgcttg gcgggcggcc 3601501 cgcccatggg gtcggtggcc cgtaaggcgg tgatgaagtc agccaggtcc tcgacggccc 3601561 gattgggctc gacgaactcg gctgccgacg ggttctcacc cgcaacccag cggcacactg 3601621 accacggcca accgaacccc tcagccgggc tccccaaccc caccggaact gggctggcaa 3601681 cgcctagatg cgcagcgatc cgcggcagcc actgttgctc ggtccgaagg ctctcgatgg 3601741 cccagccaat gcgcgggatg cgcacggcca ggtcctcgcc tagccggtac attgcgttgt 3601801 ccgtgcccgc cgagcgcacc ggtgcaatgg gtagatccgc ccactgtggg aattgtgcac 3601861 gcagcagacg ccgcaccaga tcctcgtcga tatccacctc atcggcgtgc atctttgccc 3601921 ttaggacacg ttcgtaccgg tcgaagacgg ttccgtcctg ctcacagatc cgccgcacga 3601981 aagcaaagcc cgcccgcaac gccaccctcg ccgatgcgga gttctccggc tccaccttga 3602041 tcaccgcttc ggtcgcgccg tgttcggccg catactggca caccagatcg actgcgcgag 3602101 tggcgagtcc acgccctcgc cagctggggt agagcccata ggcaacgttg acctgcccgc 3602161 tagccagccc ctcgccgtcg aaacgcagat caatcgtacc cactattgtt tcggcaaccg 3602221 tcctgatgcc gaaagagcgc agcggcccgc cggtcaccca ttgctcgcgg cagtgccgga 3602281 tgtacgcttc gacgcttgct cgagtcgagg gcataccgct aagccaacgc actagccgtt 3602341 cgtccccccc agccagatgc gcatcgacat cgtccaggca cagtggcgat agagtgacga 3602401 tcccgtctga tagcccgtcg gacagcttcg caaagcgcac cccgcgattg tcggactcac 3602461 actggcttca ggcaaacctg ccgcgagcgc ccggcgagcg taatggcgcg gcaagaaatc 3602521 gcgcttggat tcgccgcagc gtcacacgcg tgggcacaga ccctcacagc agctggatct 3602581 gctcgggctg cgacctggcc ggctccaaca gctcaggccc gttgttgcgc acgttgttga 3602641 ccaacgtgga cacttggcgc agcgcgatgt cgcgcacatc cggcgggcgg gccagcagct 3602701 caggatccgg cggggcgtct ggattcagcc agtcgtccca gtcctcttcg gccagcagca 3602761 gcggcatccg gtcatggatc tcggccagct cgcccacggc atcggtggtg atcaccgtgc 3602821 agctcagcag cggtggggcg gacctgtaag acttccaaac cgaccacagc ccggccgtga 3602881 acaacagggc gccgtcgtgg cggtgcagga agaacggcgt cttggcgttc ggcctccccg 3602941 gggtggcgtc ggggtcgacg cgccattcgt accagccgtc catcggcacc aggcaacgct 3603001 tacttctgac cgcactccgg aacgccggcg acgtggcgac cttatcggcg cgggcgttga 3603061 tcagcggtgg gcctttggca tcgggtgcgc cgccgggccc ggccttgatc cacgacggaa 3603121 tcagtcccca gcgcatgagc cgcacccggc gggtgggctc gtcgtcgggc tcgctgtggc 3603181 gggacaccac tgtcgcgatc gtgtcggtgg gtgccacgtt gtagctcgtc ttcccgccac 3603241 cgcacccggt ggcctcgtct atggccgtga ttttctcggc cagctgggcc ggatcagtgg 3603301 tgaccgcaaa ccgtccgcac atgcttccta tggtgcctgg tacccacgac acccgccgac 3603361 acggcaggat gaagcggtga agacatggcc agccccaacg gcgccgacgc cggtgcgcgc 3603421 taccgtgacc gttccaggct cgaagtcgca gaccaaccgg gcgctggtgc tagcggcgct 3603481 ggcggccgca caaggccggg gcgcatcgac catctccggc gcgctgcgca gccgcgacac 3603541 cgaactgatg ctggacgcgc tgcagaccct gggcctgcgc gtcgacggtg tgggttcgga 3603601 actgacggtc agcggccgaa tcgaaccggg gcccggcgct cgggtggact gtggcttggc 3603661 gggcacggtg ttgcggtttg ttccgccgct ggcggcgctg ggctccgtcc cggtcacctt 3603721 cgacggcgat cagcaagccc ggggacggcc catcgcaccg ctgctggatg cgctgcgcga 3603781 gctcggcgtc gccgtcgacg gcaccggtct accgtttcgg gttcgcggca acgggtcgct 3603841 cgccggcggc accgtggcca tcgacgcgtc ggcgtcctca cagttcgtgt ccgggctgct 3603901 gctgtccgcg gcatcgttca ccgatggcct gaccgtccaa cacaccggtt cgtcgctgcc 3603961 gtctgcgccg cacatcgcga tgacggcggc gatgctgcgg caagccggag tcgacatcga 3604021 cgactcgaca ccgaaccgtt ggcaggtgcg ccccggtccg gtggcggcgc ggcgctggga 3604081 catcgaaccg gacctgacca acgcggtggc tttcctgtca gcggccgtgg tcagcggcgg 3604141 caccgtgcgc atcaccggct ggcctagagt cagcgtgcaa cccgccgacc acatcttggc 3604201 aattttgcgg cagctcaatg ccgttgtcat tcatgctgat tcatccctcg aggtgcgcgg 3604261 tccaacggga tacgacgggt ttgacgtcga cttgcgcgcc gtcggcgagc tgacgccatc 3604321 ggtcgcggcg ctggcggcgc tggcatcccc gggatcggtg tccagactaa gcggcattgc 3604381 ccatctgcgg ggccacgaaa ccgaccggct cgccgcgctg agcaccgaga tcaaccggtt 3604441 ggggggcacc tgccgggaaa cacccgacgg tctggtgatc accgcgacgc cgttgcggcc 3604501 cggcatctgg cgggcatacg cggaccatcg aatggcgatg gccggcgcga tcattgggct 3604561 gcgggtggcc ggagtcgagg tcgacgacat cgccgccacc accaagacgc tgccggagtt 3604621 tccgcggctg tgggccgaga tggtcggacc cggccagggg tgggggtacc cccagccgcg 3604681 cagcggccag cgggcgaggc gggcaaccgg gcaggggtcc ggcggttgag gcccggcgac 3604741 tacgacgagt ccgacgtcaa ggtgcgctcc ggcaggagtt cgcggccgcg gaccaagacc 3604801 cgtcccgagc acgccgacgc ggaggccgcc atggtggtca gcgtcgaccg cggccgctgg 3604861 gggtgtgtgc tgggcggccg ccccgatcgc cgaatcacgg cgatgcgcgc ccgcgagctc 3604921 ggccgcaccc cgatcgtggt cggcgacgac gtggacgtgg tcggtgacct gtccgggcgg 3604981 cccgacaccc tggcccgcat cgtgcggcga gcaccgcgac gaaccgtgtt gcgacgcacc 3605041 gccgatgaca ccgaccccac cgagcgggtg gtggtcgcca acgccgacca actgctgatc 3605101 gtggtcgcgc tggcagaccc gccgccacgc accggcctgg tcgaccgggc gctgatcgcc 3605161 gcctacgccg gcgggctgac cccgattctc tgcctgacca agaccgacct cgccccggcg 3605221 gaaccgttcg gcaagcagtt cgccgacctg gaattgaccg taaccgccgc aggcgtcgat 3605281 gatcctctgc tcgcggtggc ggacctgctg gccggcaaga tcaccgtcct gctcgggcat 3605341 tccggggtcg gcaagtcgac attggtgaat cgtcttgtac ccgaagctga tcgggcggtt 3605401 ggtgaggtca ccgagatcgg ccggggacgg cacacgtcga ctcggtcggt ggcgctgccg 3605461 ttgggagata cgctgtccgg ttccggctgg gtgattgaca ccccaggaat ccgctcattc 3605521 gggttggctc atatccagcc cgacaacgtg ctattggctt tctctgacct cgccgaggca 3605581 acccgcgagt gtccgcgcgg gtgcgggcac atgggaccgc cggccgatcc cgaatgcgcg 3605641 ttggatacct tgtccgggcc cgctgcccgc cgcgccgcgg ccgcccggcg actactggca 3605701 gtgctcagcc agacttgact agccgcatgc tcgtcgcgcg ccgagcaatc ttaggctgcc 3605761 agatcgtcgg gttcggtgac cgacttagcc atacgcttgc tgcgccgccg accccgcacg 3605821 gcggcaatcg cggtctttaa cccccgacga cgtccggtca ccggatcggc gcccgcgaaa 3605881 cccggcccca gaccagcgaa catccgctca ctgcgggtct cgggtgcatc gtcagcgttg 3605941 tcacgtaagt acttatccgg caacgacagc ttggcaaggg tgcgccaggt cttgccgtac 3606001 tgcaccaaga acgagcccgt ggtgtatggc aagtcgtatc tgtcgcagac ctcacgcacc 3606061 cgcaccgaaa tctcgtgaag ccggttgctc ggcaggtccg gatagaggtg atgctcgatt 3606121 tggtggcaca gattgccgct catgaaccgc agcgccggcc cagcgttgaa gtttgcgctg 3606181 cccagcatct gccgtaggta ccactggccc ttcggctcac cgatcatgtc cgtcttggtg 3606241 aatttctctg cgccatccgg gaaatggccg cagaagatca ccgcgttgga ccacacgttg 3606301 cggatcacgt tggccaccac gttggcggtc aaagtggacc gatacgtcgc ccccggggac 3606361 aacgaggtca gcgccgggaa cgcgacatag tccttgaaca cctggcggcc cgctttggct 3606421 gagaattcac gcaaccgggt tttagcggcc tcgcggtcgg cccgaccctt gaagatcttg 3606481 ccgatctcca agtgctgcag cgcaactccc cactcgaagc cgatcgcaag gatggtgttc 3606541 cacaccacgt tgaagatgtt gtagcgcttc cagcgctggt cacgggtgac gcgcagcatg 3606601 ccgtatccga cgtcgtcatc cataccgagg atgttggtgt atttgtggtg cacgaagttg 3606661 tgggtgtagc gccagtgctt ggacgatccg ctcatgtccc actcccacgt cgaggagtga 3606721 atctccgggt cgttcatcca gtcccactgg ccgtgcatga cgttgtggcc gatctccatg 3606781 ttttcgatga tcttggccac gccaagggtc agggcacctg tccaccaggc gaggcgtcgt 3606841 gagctgccag ccagcagtag ccgaccggac acctcgagcg cccgctgtgc ggcgatggtg 3606901 cggcggatgt agcgggcatc gcgttcgccg cgcgattctt caacgtctcg gcggatggca 3606961 tctagctcgg cggccaggtt ttcaatgtcg gcgtccgtca gatgcgcgaa tacgtcgacg 3607021 tcagtgatcg ccatcgtctt ctccctgcgt catacggccg atgacctacg ctatcgtaac 3607081 ttacgattcc gtaggttacc tatgagtaac actagatgtc cagcacgcaa tcacccgagg 3607141 cggccgacac gcaggtctgg acccgggttc cgggctcatg ccgctggccc gtgcgcagat 3607201 cccgaacatg gccttccacc aggtcgacca cacacgactg gcagatgccc atccggcagc 3607261 cgaagggtag ctgcacgccg gcgccctcac cggcgtccat caacgacgtg gcagcatcgg 3607321 cggctacgct cttgccactt cgggcgaacg tgacggtccc gcccgctcca gcgggcgccg 3607381 ttttggacac tgcgaaccgc tccaggtgca gtcggtcgct ggcacccgcc gatgaccaga 3607441 ccttgtcggc ctggttgagc acgccctccg gcccgcacgc ccaggtctgg cgttcacgcc 3607501 agtccggcac ctgctgaccg atccgggtca ggtccagccg gccctgggcg cgcgtctcgc 3607561 gcaccgacaa ccgataaccg ggatggtcgg ccgccagggc agccagctcg gcaccgaaca 3607621 tcacgtcagc tgcggtgggc gccgaatgca ggtgcactac gtcggtgatt tggttgcggc 3607681 gcaccaacgt tcgaagcatc gacattaccg gcgtaatccc cgacccggca gtcaaaaaca 3607741 gaatcaacgg gggcgccgga tccggtaata cgaaattgcc ctggggcgca gccagccgca 3607801 caatggtccc tggctttacc ccggccacca agtgggtgga caggaagccc tcgggcatcg 3607861 ccttcaccgt gacggtcacc atgcgcgcgg acccggatgc cgccggactc gacgtcagcg 3607921 aatacgaccg ccagcgccag cgcccgtcga ccagcagccc gatcccgatg tattggcccg 3607981 gctggtagtc gaaactgaag ccccagcccg gtttgatgaa cagggtcgcg gagtcttccg 3608041 tctctcggcg gacccctagg atgcgccccc gcaattcccg cgcggaccac agcggatttg 3608101 ccaggtgaag gtagtcgtcg ggcaacaatg gcgtcgtgat gcgcgcggca atcttgcgca 3608161 gcgcatgcca gcccggatgc cggtcggctc cggcgacggt ggggcgcctg gtgtcgatga 3608221 tgctggcgtt aagcgtcgtg tgtttcttgc tcataggaag ctcctgctcg gccttagctt 3608281 ccgcccaaca aagctacggt accgtaacct acggttccgt atctaggccc ggacgcgcag 3608341 actgcgtcac acccacggca tcgtcagagc aggtccagca gaaatggcag ctcttggttg 3608401 gcgtaccagg cgagatcgtg gtcctgggcg tcaccgacca ccagctcagc gtcctcgtcg 3608461 cccaggtcag cggcatcgat gaccgcaatc gccgccatca cggccggctc agcgccggca 3608521 ttgtcgacat atgcggcgac aacctgatcg atcgtgattg gccccgccag cctgacgacc 3608581 gcgtcatcaa gatcgggacg gtacgtggca tcgtcgacct cggcggccag caccgcgcgt 3608641 ctgggcggca gggcgtctgc ggtggcgccg atgtccgccg ctagcagacg caacgacgcc 3608701 aacgccgctt cgcgcagcgc cacctcggca agctcctcgt cgtcaccctc ggcgtacgac 3608761 tcacgcaacg tcggcgtcac tgcaaaagca gtgccgttga ccggccacaa cgcgccatcg 3608821 gcaacgagtc gctgcaacat ggccagggtg gccgggatgt agacctgcgt caccgggcga 3608881 tcaacgtggc cacatagtcg tcgacgtatg tcgacaactc gcggggcgga cgcctgtagt 3608941 tgccactcac aagcggccgt ggcggcagct tgacctttgg cttttccaca tctgcgtagt 3609001 caatcgtgga cagcaagtgg gccatcatgt tcagccgcgc gtgctttttg atatcagact 3609061 ccaccacgta ccaggggctg acgggggtgt cggtatgcac catcatctcg tcctttgcgc 3609121 gcgaatagtc ctcccaccga tacaccgatt ccaggtccat tgggctgagc ttccattgcc 3609181 ggaccgggtc attccgtcga gccttgaatc ggcgcaactg ttcggcgtct gagactgaaa 3609241 accagtattt gcgaagcaga atcccgtcat cgatcagcat ctgctcgaaa atcggggtct 3609301 gccgcaaaaa caacacatac tcctgcggcg tacagaaacc catgaccttc tccacaccgg 3609361 cgcggttgta ccaggaccga tcgaagagca ctatctcacc tttggcggga agatgggcaa 3609421 tataacgctg gtagtaccac tgaccccgct cgcgatccgt cggcgcgggc aatgccgcga 3609481 tacgagccac tcgcgggttg aggtactcgg tgatccgttt gatggcgcca cccttaccag 3609541 ctccgtcacg gccttcgaag atgaccacca gacgcgcacc cgaatgccgg gcccactctt 3609601 gcagcttcac gaattctgtt tgcagccgaa acaattcggc ttggtagacg gcatcggaga 3609661 tcttgcgccg gcccggcgca gctgatctgt gtcccttcgc tctcgacgac gcgccgtcgt 3609721 tggtcgcggt gctcacatca acggatggta tatccacaca tcaccatcga cccctaacaa 3609781 ctaccgcgaa gcctccagaa gctcgtccag tgcttggctc aacagccccg gcagcagatc 3609841 gacatcgctc atcgcgtcgc ggtcggcatt gatgccgaaa tacaacatcc cgttatacga 3609901 cgtcacgctg atggccagcg cctggttgtg cagtagcggc ggcacggagt aggtctccag 3609961 cagcttggta cccgcaatgt acatctgcga ctgggttccg ggggcattgg tgatcaacag 3610021 attgaacaac cgtgccgaaa agctagtggc gacccgcacc cccatggcgt gcaaagtggc 3610081 cggtgctaac cccgacaacg tgacgatagt cctggcatcg accaggctgg cggcggtcgg 3610141 gttggattcg gtggcgtgcg cgatctgcga caaccgcact acggcattgc cctcccccac 3610201 cgggaggtca accaagaacg gtgtcacctg gctgatcgcc tgaccagggc cggttgagtc 3610261 gagttggtcg tcggcataga ccgacagcgg cgccatcgcc cgaacagtcg cggtcggtgc 3610321 cacagcttca ccgcgtgaca tcagccagtt gcccaaggca ccggcaatca ccgtcagcac 3610381 cacgtcgtgg agtcacagtc gtagcgagcc cgcaccgtgc gatagtcatc aagacttgca 3610441 cgggcaaccg taaatcgccg attacgcgac acggtggcat tgagcgggct actgggcgcg 3610501 gtgccccgtg ccaccgtgcg ggcgatatcg agaaccttgc ggcccgtctc gacgagttgg 3610561 ccggaattcg ttaccaaccc ggcgaccgcg gatccgacgg cctgtagttg tgcgcccggc 3610621 cgcaccagcc agtccccgac cgcgcgcagc agcaaccgcg tggtgccggg gtcccgttcc 3610681 gggacccaga tgtcttccgg aaacgccggt ggacgccgcg tccggtcggc gatcacgtgg 3610741 cctatcgcca gcgcggtcac cccgttgatc agggcttggt gcgacttggt gtagagggca 3610801 atgcgattct tttccagacc ctcgacgaga tacatctccc acaatggccg cgatttgtcc 3610861 agcggccgag cggccagccg tgcgatcagc tcgtgcagtt gctcgtcact acccggcgac 3610921 ggcagggccg accgccggac gtggtaggtg atgtcgaagt cgcgatcgtc gatccacacc 3610981 ggcctggcca ggcccaattt cacttcctgg actttctgac gatagcgcgg tatctgcggc 3611041 agccgctgtt cgacggtttc cagcagtgcc tcgtagctca atccggcacg cggacggcgc 3611101 aggatcaaca gcaacccgac atacattggg gtggctgtgt tctccagctg atagaaggag 3611161 gcgtccgatg cagacaaccg ggtgaccact acggccctgt cctccttgtc aattcgtcgc 3611221 gacgagtcac gtcgtcgccc acgctaacgg ttagcccgac cacttcacgg cgcgggtaca 3611281 cgcaagcccg cattgtgcga tgatggccag caaccaaacc gctgcgcaac actcgtctgc 3611341 cactctccag caggctcctc gttcgatcga tgatgctgga gggtgcccct tgaccatcag 3611401 tcctatcgcg aactcaccgg gcgacacctt cgccgtcaca cccgtcgtcg agtacgagcc 3611461 gccgccgcga aacatcccgc cgtgcgggca atcatcgcac gcagcccggc ggccgcacac 3611521 cccgcagcta gctcgccgac aaccaatcag gccgagcggc cgggcaccgg cagcggtcac 3611581 ctccacggcc aagtcaccgc ggctgcgtca agcggggacc ttcgccgatg ccgcgctacg 3611641 ccgagtgctg gaggtcatcg accgccgccg cccggtgggc cagctgcgcc ccctgctggc 3611701 acccggcctc gtcgactccg tgctcgcggt gagccgcacg gcggccggac accaacaagg 3611761 cgcggccatg ctgcgccgca tccggctgac accggccgga cccgacaccg cggacaccgc 3611821 cgccgaggtc ttcggcacct acagtcgcgg ggaccggatc catgcgatcg cctgccgggt 3611881 ggaacaacgg cccgccggta acgaaacccg atggctgatg gtcgccctgc acatcgggtg 3611941 agatcgccgg cccacaccct agttcgaagc tactgcggcg gccggcagcc caccgccggt 3612001 gtagcgggcc agtatcggac cgacgatcgc catgacgaac acatacgccg tggccaaggc 3612061 ggcaaccccc gggatcgagg caccggccag cccgatgatg atcaaagaaa actccccccg 3612121 ggcaacgagc gcggtgccag cacgcagctg cccacgccgt gccactccct cccgccgggc 3612181 agcgaacatc ccggtggcca ccttggtcgc tgcggtgaca gcggccaggg ccagcgctac 3612241 cggaagcatt gaaacgagct ttcccgggtc aaccgacagg ccgattccca ggaagaagat 3612301 cgtggcgaac aagtcacgca gcggagtcag caccatgcgt gcccggtctg cggtctcccc 3612361 ggtaagcgtg aggcctacca gaaacgcacc cacagccgcc gacgcgtgca gcgactcggc 3612421 caccgccgcc acgatcaagg tgatgcccag cacccgcaac aacaattgtt cggaatcagg 3612481 atgagtcacc aaccggccga catgatgacc ccaacgatac gacgccgcga acgccccaag 3612541 caaagcggcg atcgccaccg tcatgcccac gaccgcctcg agccagctgc cgtctgtcgc 3612601 gagaaccgcg aacagcggca agtaggccgc catcgcgaag tcttcgagca ccagcaccga 3612661 cagcacagcc ggcgtttccc ggttgccgag ccgacgcagg tcctccaaca gccgcgcgat 3612721 cacacccgag gaggaaatgt aggtgacccc ggccagaccg aggatggcaa caccgtccaa 3612781 ccccaaaagc cagcccgcca ccgcaccggg cgtggcgttg aggacgatat cgacacccgc 3612841 cgacggcagg tggtggcgca gactgctggc gaactcggtc gcagaaaact ccagacccag 3612901 ggccaaaagc aacaacacga caccgatggg cgcaccggta gcgatgaact caccggcggc 3612961 ggccaccccc aagatgccgc cattgcctaa cgacaaaccc gccaacaaat acaccggaat 3613021 cggcgacaac gcgaatcgtc gtgccactgc acccagcacc gcaagcaccg ccaacaggac 3613081 gccgagctca aacaacagcg ccctcgaaac ctccaccggt tcagcccttt tcgacgatct 3613141 gttcgacccc ggcgatcccg tcctcggtgc cgatcacgat gaggacatct ccggctcgca 3613201 gcacatcagt cgggcccggc gaggccaaca catcctcgtc acgcacgatc gccacaatcg 3613261 acgcgccggt acgggtgcgc gcacgggtat cacccagcgg ccggtccaca aacaagctac 3613321 ccgcccggat gtgaatctga ccggccttaa gcccgggcac ctcacgcgtc agctcggtaa 3613381 atcgctcggc gatcctcggc gcacccagaa tctgagccac cgcctcggcc tcttcatcgg 3613441 tgagccgcaa aaccggtcgg gcttcgtccg gatcatcgcg gccatacagg acgacgtcga 3613501 aaccgccact gcgcctggca acgatgccga tccggtcacc gcgatagctg gtgaactcgt 3613561 atcgcaggcc cacccccggc agcagcacct ccttgacgtc cataggagtc aatccttgac 3613621 gaaatgcggc caagatagaa gcggtacggg caatctcgtt gactcaggta tgccggtgcg 3613681 gccacggcaa caacatcgac acctcgcggc ggtaatcgcg gtattggtcg cccagcgccg 3613741 cgagtaggtc gcgctcttcg aactgcaacg cgaccaagat gtagcccgtc gcgccgatcg 3613801 cgaaaagcaa gtgccccgcc gtcatcatgg gcgtcgccca gaacgcgacg acgaatccga 3613861 gcatgatcgg gtggcgtacc caccggtaga gcagatgagc ctgaaaaccg atctcggtgt 3613921 acggctttcc gcgccaagcc aaatacacct gccgtaggcc gaacaattcg aaatgattga 3613981 tcatgaaagt cgacgtcaac accgtggccc acccgagcca gaacaacgcc cacaacgcca 3614041 cccggccagc cggctgccgc acgtcccaga tgaccgccgg catcgttcgc cattgccagt 3614101 acagcaacaa cagcgcaacg ctggccagca gtacataggt gctgcgctcg atcgagggcg 3614161 gcacgaatcg agtccaccag cgtttgaaac cctgtcgtgc catcacgcta tgttggacgg 3614221 cgaacacgcc cagcagcacc aagttgacca cgaccgcctg gccgatcggc gccgcgatcg 3614281 cgtgatctac ggttcgtggc accactacgt cgccgacgaa accgatcgca tacccgaagg 3614341 caaccaggaa taccagatag ctcgcggccc cgtaaatgat cgtcaaataa cgcttcataa 3614401 cctgattctg ctccgcagga gtgtgcagct ggggcgttcg gcccgattgg cgccaatcag 3614461 cgattcaaca gtgccatgat gtgcggcatg gcctcgcggg ccgcaacgcg tcccgcctcg 3614521 cgggcggcgt cgatctggtg aaactccagc agcccaacag caccggtgtc gggtctgata 3614581 acgacctgcg caagactgag tgcggcatcc gccccacgct ggctgccgat tgtcatcgtg 3614641 cgcatcaagg tgtcgccgat tcctggcact tttggcgagc cgtcctgtcg agccgagccc 3614701 ggcccgccac cacctaagcc gatgctcacc gcgatcaatg ggccatcagg acttgcccgg 3614761 gtcgagaccg gaaggttgtc taacacaccg ccatccacat gcagtcgacc gttgtagacc 3614821 tggggcggat agatgcccgg cagccgaagg gaacacccaa tgacatcgac gagtcggcct 3614881 cggcggtgta cgaccggtcg gcgggcaagc aaatcgacgc taacgcaacg gaactccttt 3614941 ggcagctcct cgaccagtcg gtccccgaac gctgcttcta atagggtcag cgtccgtcga 3615001 ccacggacta gccccctgac cggaaacgcg tagtcactga gcggattgtg ccgaatgaag 3615061 tactcgtatg cgtaggcgtc cgctgttgcc gcgtccatac cgcacgctcc gaacaccgca 3615121 ataaccgccc ccatgctggt gccggcgaac cggtcgatgg tgaccccgac ccgctctagc 3615181 tcgtcaagaa ccccgaggtg cgcaaagccg cgcgcgccac cgccgccgag gactagaccg 3615241 atcgagcggc cggcgatgcg tgcggcgagc gggcgtacgt tttccaagat gcgtcggtaa 3615301 tgaaccacat gaaccgatcg cggcgtgatc aattcctccc actgacgccg gtgctcccgg 3615361 ctggcggccg gaccggccag cacgaggtcg gcaccccgcg cacgcgccgg cagccgcgcg 3615421 gcttgtgggt tgggatctcc cgcgaccagc actatccggt cggcgacgcg caggcagaag 3615481 tcccgccagc cggcatcctc gaccgcggca tgtagcacta ccttgtcggc gactcgctcc 3615541 gcgcgatcaa ggccgtcgcg gtcgacccgg ccggggtcaa cggcacgcaa ccgcgccgac 3615601 agcgcggtaa gcaggccagc ggccactgcc ggcacgggcg cgtcgccgct cactccgatc 3615661 accgaaacga ccacctcagg cgacgtcgag tcagtcgccg gtggcggtgc ctcccgcagc 3615721 cgcgttgcca gcacctttac caacgccgcc agcgcaccat ggtcggcgat ctcgtcgaac 3615781 tgtgccttgg tgagccgcac tagcttggtg tcgcgcaacg cccggaccgt cgcggaccgg 3615841 ggcgcgtcaa taagtagccc aagctccccg agaacctccc cgcgacccag ttctttgaga 3615901 acgatgctgt cctgcagcac ctgcacgcga cccgtgcgga tcacgtaaag cgaatcggac 3615961 gggtcacctt cgtggaagag atagcaaccc gcctccaact cgacgtcctc aacgtgctcc 3616021 ccgagctgtg ccaaggtggc cgcgtccagg ccggcaaata gcggcagatt ccccagcgga 3616081 tcggcgtcac cggccgccca atgctcaatc ggcgcggccg ccggctgggg aatcggtggc 3616141 tccaaccgcg gcgcgatcgc gggctccggc gccggcatct ggacggggtt gcggttggtt 3616201 ctacccagca ccgcggccgc gacagccacc gcgatgaaac agatggcagc catagcccat 3616261 ccgcgccgca acgcctcctc ggcagtaccg tgctccggct taccgatcaa gatcaccatc 3616321 accgcgacac cgagcaccgc accgagctgg cgagtggtgc taacgaccgc cgacgaggtg 3616381 gcatagctgc cgcccttggc gacctcggcc agcgctgcac tgctcaacac cggcaacgtc 3616441 gcgccgacac cgatgccctg cagcagttgg cccggcagcc acacgcggag gaaatccggc 3616501 tcggacccga cacgctgcaa ataccacacc aggctgccgg cccagaccag cgcaccaacg 3616561 aggacgatga cgcgatgccc atgccgaccg gcaacccgac ccagcgccgc cgccaccacg 3616621 gcagccacca ccgcagcggg cgcgatcgcg aaacccgcct tcagcagcga gtagtgccac 3616681 acatagttga ggtaaagcac atgggtaagg ccatagcagt aaaaacccgc tgcggcgacc 3616741 agcgtgagca ggttgcccgc cacgaacgac cggctacgca acagcgccgg ctcgaccagc 3616801 ggcgcggggt gcgaccgcga gctgtgcacg aacccaaccg aggtcaggac gctggccagg 3616861 aacgaaccga cggtggccac gctcaaccaa ccccagtccg gccccttgac caaaccgagg 3616921 gtaaccaacc cgagcgttac cgcaagcagc agcgcaccgc gcaagtcagg catgcggcgc 3616981 cggcccgagg cgcggctctc gacgagcatg cgcttggtgg cgatcgccgc gacgatgccc 3617041 agcggaacat tgaccagtaa cacccaccgc cagccggccc actccacgag gagcccgccg 3617101 atcggcgggc ccaggccagc cgcgatcgct gccgccgcac cccacaggcc gatagcgtgc 3617161 gcgcggcgcg ccgcgtcgaa gccctcaacg accagtgcga gcgaagcagg cacgagtatc 3617221 gcagccccga tgccctgcag cacccggaac gccaccaact gctcgacact gccggcgacg 3617281 gcgcacagcc cggacgcaat ggtgaacacc agcacaccgg acaggaatgt ccgtctgcgg 3617341 cccagcaaat cggccaacct gccggccgca accatgaagg cggcgaagac gatgttatag 3617401 ccgttcagaa tccaggacag gctcccgatg tcgtaggacg ggaaggaacg ctggatatcc 3617461 gggaacgcga tgttgacgat tgtcgagtcg agaaacgcca ggaaagcgcc gaaccccgct 3617521 accagcagaa ccgacgccga cgaaggtcgg cgacgacggg tgagattagc gaaccccttg 3617581 ccgccgtgca acgaaatgtg catgcgcgcc ggggcgcggg gtgtgccggg aagtgacttc 3617641 tgggaactga gaaaccgata cacccatctg caacctacgc gctaacgctt cttgaccgat 3617701 ttcggcggct tggcgccgcg gccttgtcgg cgggcggctt cgcgccgctc gcgccggcta 3617761 gcaccggccg gcactccggc cggcgtcttg tgggctccac cgccgttgcg ctgcacctga 3617821 gccgagccat cctccgcggg accggaatag gtcaaagcgg gcgactcgct ggcaacaccc 3617881 ttggcgcgta atgcacttgg agctctttcg cgcgcgccac catcgaccgc gctgcgttgc 3617941 tgcgcggcgg ctgcggccgc ggcggcgaat tcggcaagct ctgcgggttc ggcagccggg 3618001 gcaaccggcg gggcggggac cgcctccacg gtgacgttga acaggaagcc gaccgattcc 3618061 tctttcatgc cgtcgagcat ggccatgaac atgtcgtagc cctcacgctg gtactcgacc 3618121 aacggatcgc gctgcgccat cgcgcgcagc ccgataccct ccttgaggta gtccatctcg 3618181 tagaggtgtt cacgccactt acggtctatg acgttgagca gcacgttgcg ttccagctgg 3618241 cgcatcgcac cctcgccggc gatttcctcg agttcggctt cccgtgcggc ataggcacgt 3618301 tcggcgtcct tgagtagtgc ctccagcaac tcctcgcggg tgagatcgtc gcgctcgaat 3618361 tcgtggtcct tgcgggtcag cgagtcggcg gtgatcccca ccggatagag ggttttgagt 3618421 gccgtccaca acgcgtccag atcccaatct tcggcatagc cttcgccggt cgcgccgtcg 3618481 acgtaggcgg tgatgacatc gcggaccatg tccagcgcct ggtccttgag gttttcgcct 3618541 tcgaggatgc gccggcgctc ggcgtagatg accttgcgct gctggttcat cacctcgtcg 3618601 tatttgagga cgttcttgcg gacctcaaag ttctgctgct cgacctgggt ctgggcgctc 3618661 ttgatggccc gggtgaccat cttggcttcg atcggcacgt cgtcgggcag gttcagcctg 3618721 gtcaacaagg tctccaaggc cgcgccattg aagcggcgca tcagctcgtc acccagcgac 3618781 aaatagaagc gcgactcccc ggggtccccc tggcggccgg accggccacg caactggttg 3618841 tcgatccgcc gcgactcgtg gcgctcggtg cccagcacgt acaggccgcc ggcctcgatt 3618901 acttccttgg cctccttgct ggcttcctct ttgacgatgg gcagttcgga gtgccaggcc 3618961 gcctcgtact cctcgggcgt ctccaccgga tccaggccgc gttcgcgcag ccgctgatcg 3619021 gtgagaaagt cgacgttgcc gcccagcaca atgtcggtgc cgcgaccggc catgttggtg 3619081 gcgacggtga cgccgccgcg gcggcccgcc accgcgatga tggtcgcctc ttgctcgtgg 3619141 tacttggcgt tgagcacatt gtgcgggatg cgccgcttgg tgaactgccg cgacagatac 3619201 tccgagcgct ccacgctggt ggtgccgatc agcaccggct gtcccttcgc gtagcgctcg 3619261 gcgacgtcgt cgaccaccgc gatgtacttg gcctcctcgg tcttgtagat caggtcggac 3619321 tggtcttcac ggatcatcgg catgttggtc gggatgctga ccacgcccag cttgtagatc 3619381 tcgtgcagct cggccgcctc cgtctgggcg gtgccggtca tgccggcgag cttgtcgtag 3619441 agccggaagt agttctgcag cgtgatggtg gccagcgtct ggttctcggc cttgatctcg 3619501 acgtgctcct tggcctcgat ggcctggtgc atgccctcgt tgtagcggcg gccgatcagc 3619561 acccggccgg tgaactcgtc gacgatgagc acctcaccat cgcggacgat gtagtccttg 3619621 tcgcggctga acagctcttt ggccttcaga gcgttgttga gatagctgac caacggcgag 3619681 ttggcggcct cgtacaggtt gtcgatgccg agctggtctt cgacgaattc cacacccttc 3619741 tcgtgcacgc cgacggtgcg tttgcgtaga tcgacctcgt agtggacgtc cttttccatc 3619801 agcggcgcca accgggcgaa ctcggtgtac cagttggagg cgccgtcggc gggaccggag 3619861 atgatcagcg gggtgcgggc ctcgtcgatc aggatggaat cgacctcgtc gacaatggcg 3619921 taatggtgcc cgcgctgcac cagatcatcc agtgagtgcg ccatgttgtc gcgcaggtag 3619981 tcgaacccaa actcgttatt ggtgccgtag gtgatgtcgg cgttataggc cacccggcgt 3620041 tcatcgggtg tcatggtggc caaaatcacc ccgacctgaa gcccgaggaa gcggtgcacg 3620101 cggcccatcc actcactgtc gcgtttagcc aggtagtcgt tgacggtgac gatgtgcacg 3620161 ccgttgccgg ccagcgcatt gaggtaagcg ggcaacacac aggtcagggt cttgccttca 3620221 ccggtcttca tctcggcaac gttgcccagg tgcagggcgg ccgcacccat cacctgcacg 3620281 tcgaacggcc gctggtccag cacccgccag gcggcctcgc gggccacggc gaaggcctcg 3620341 ggcaacaggt cgtcgagggt ttctgggttt ttctggtcgg ccagccgccg cttgaactcg 3620401 tcggttttcg ccctcagctc ggcgtcggtg agtttctcga catcgtcgga caaagtgccg 3620461 acatagtcgg ccaccttctt gaggcgcttg accatgcgac cttcgccaag gcgcagcaac 3620521 ttcgacagca cagctatgtc cccgcatgtg taggagtctt tagataaggc gactcccatg 3620581 gtaggtgacg acgcggcgcg cgccgccgat cacgccagac ggatcaagcc gtagtcgtag 3620641 gcgtgccggc ggtagaccac cgacggccgt tcggtgtcct tgtcgtagaa caagaagaag 3620701 tcgtgtccaa ccagctccat ctggtagagc gcgtcatcga ccgacatcgg cttggccggg 3620761 tgttctttgg tgcgaacgat ccgcccaggc tcccgctcga cgacggcacc gtcgtgatcg 3620821 tgtgcctcgg ctggtctggt gttgaagccg ttctccggcg ctggcaccac cgcggtcgcc 3620881 tcggccagcg aaaccggggt tttgtcgccg tagtgcacct tgcggcgatc cttaccgcgg 3620941 cgcagccggc tctccagttt gacgaccgct gattcaagcg cggcatagaa gctgtcggcg 3621001 caggcctcac ctcgcaccac cggccctcgc ccacgcgcgg tgatctccac gcgctgacag 3621061 gacttgcgct ggcggcgatt acgttcgtgg tcgagttcga cgtcgaacag gtagatggtc 3621121 cggtcgaacc gctccaagcg ggcgagtttc tgcgaaacgt agatgcggaa gtggtcgggg 3621181 atctcgacat tacggccctt gaacacgatc tcagcgtttg atttcggttc ggccagaacc 3621241 tgacctgaat ccacggctag ccttgacata cgtgacaact cgtttctctt tccacgtcac 3621301 acgcgccctg cgtgcctggc cttcggggag acgcgccgac ggggtgggag cggttggaga 3621361 agttaccgcc gcaggctgcc cgccggagca agatgtcgat tgctcacctc ctatcgcggg 3621421 atactgattc aacctgggaa gcgcgagcgt gagtcgttaa aggttgatct cgacgttagc 3621481 ccgtgttcgg ctcaccgtgc caccaaattg accgacctgt ttcgagttct tcacgttgtc 3621541 ttggcaactg caccggctca ggcagatcct cacgcggccg cgaccgccaa cacggcaccc 3621601 acccgcacac cggcggcctg caagacccgg accgactcgc gcgccgtcgc cccggtggtg 3621661 atgatgtcgt cgacgagcac gacttcgttg cgcggccgct ggccccgcaa cagcacccga 3621721 cccgtgatgt tgcgctcgcg cgcggacgcc ccaagaccta ccgagtcccg ggctagcgct 3621781 cgcatccgca gcgccgggac gacggtgacg tcatggtggc gcccaagggt ggcacccgca 3621841 atccgcgcca tccggctgac ggggtcaccc ccacgccgtc gcgccgccca ccgtctcgtc 3621901 ggcgcaggca ccatcgtcag cgggttttcg agcatgcccc aggacaacag gtggtcgaca 3621961 ccgacaatca gcgcgcacgc cagtggcgcg acgaggtcgc gacggccgtg ctctttcata 3622021 gcgaggatcg cctgacgacg cacgcccgcg tagcggccga gcgcgaacac cggcacctgt 3622081 gggtcaacac gaggactcac cacgtgcggt tcaccggcag ccaccgacag ctcggcggca 3622141 caggcggcac accagcgggt cgccggcgca ccgcagccac cgcattccag cggcaggacg 3622201 aggtcaagca cacaccaagt gtcgcggtca ccggtgacag cagtgctgtc aatcggcgcc 3622261 gctgcgcagc ggcggccaga caaagctgag cgcaccctga ctcaattggg taatcacgct 3622321 ttccagataa cgcagcggca gctccgttcg ccactccgga agcaacgcag agagctgcaa 3622381 acaatcggct gcgaggctga cgtcggtgat gcgcagccca tgcggcagct caggcagtgg 3622441 aactcggtag gccggtgtcc gtgccggcag tgtccaccgc cgttgtccgg tgatcacggt 3622501 gcgcggtcgc agccacagtg tcgtctggga ggttgtgccc gccacgtcga cgtcgacctc 3622561 aagacctccc cagtcgggcc ggcgcgccca gcgcagccgg gccgctccgc tctcggacaa 3622621 ctcaccgcgc agctgcggcg tcgcttgacg gagcacatca tcgaaaatct cggtcggcag 3622681 ggcggacgac aactcgacgg gagcggcgat caccagcggc ggcacacccg ggcggatgtg 3622741 cacgttgcgc aggacggcaa cggcgctgtg caaatgatgc tgatcccagc tgatgccgcg 3622801 agcggccacc cgaacctcgc ccagctggcc gacggccagc ccctgcggtt ccagtgccga 3622861 gtccagctcg gtgacggtca gcaccacgtc atggtcccca atccgaaccg tgacttcctt 3622921 gccgatgagc agctgctgca aggtggtgaa caacgtccgg tagggcgccg caactgcctg 3622981 ggcggctccc gcgctgacca gcgacatccc ggtcgacgac cacagcgagg ccagcatgtc 3623041 caaggcacgg aagggatcat cccaacgcag ccggggaact cttggcgaca tcaacaagcg 3623101 cctcctcact gcgagggtag ccggtgtgct caggtcgcga aaaacgcagg cacagcactc 3623161 atccgggcaa taccggcgcc gcccccggca ccatcagccc cggtacgtcc gcccagcctg 3623221 gtcggctttc gacagacgcc gagtacatca acaccccttg cgggccggcg acatacacag 3623281 tcgacgggtt ggccgcgatc gccgtcagtg gagtttgcaa cccgcgggac ggcgcgtcgg 3623341 agttcacccc gtcgaggttt acataagaca ccggatgggc ggcgtcggtg cgtgtcacca 3623401 cgatgtcgtc accggttcgc caggacaacg acaccaccga ggaacccagc ccgaaaccca 3623461 gccgccgagg gtaggtcagg gcgaactggc cagcctgggt ctgctcgacg ccggcgagga 3623521 tcacctgccc accgatcacc atcgcggcgc gcgtcccgtc acgggacagt tgaagatcgt 3623581 tgatcgcccc cgggaagcgg ctggccaccg cggtcgaatc caccggaatc cgcgcgggtt 3623641 gccccgatgc cgggtcctgt atcgctcgca gcacgacgtt ggtatcgacc accacccaga 3623701 ccgcgtcgtc cagcgaccag ctgggccgcg acaggctgtg cccgtcggcg gactgcaccg 3623761 cctcgccgcc gaggtcgccg acccacaaag acgccgcctc atccggagcc ccgcgcccca 3623821 gcgtcaccac cgaggccacc tgacgcccgc tgcgtgatac ggcggccgcc gtctgctccg 3623881 gcatccgtcc gaaggccccg ggcacggggg tgactcgctg tgcgtccatc gccaccagtg 3623941 atccgttcac caaggcgtgc aaccccgcgg cggcaccgtc ggccaccccc gggtcggtgg 3624001 ccgcgacatc ggaagtggtc cacccctcgg caaacctgtc ttccagcggg gcgccgtcgg 3624061 cgttgatcac gtacggcccc ctgatgtcgg ccctggccaa ggtccagatg atctgtgcgg 3624121 caagtaattg cctgctgtgc ggatcggtgg tggacagctt ctccatgtcg actcgcgcgc 3624181 cgccgtaccc gcggccgatt ccgctctttc cgccgtcggc ccgagtcacc ggcccgcgca 3624241 gtcgtagcgg cggagcgagc agattacgca ccgtgcgcgc catctccggg cgtggacccg 3624301 ccagcagttt ggagacgagc tccgtggcca gctggtcgcg gtcggacacc gcgacgtagc 3624361 gcggatcggg aaccacggtc ttgccggtgg ggtcggcgaa gtacagggtg ttgcgcttgt 3624421 acgtttcttg gaactgctgc cagtccagga aaaccccgtt gggtaggcga tcgatgcgcc 3624481 aaccatcgga cgtcttgacc aactcgatcg ggcccggatc cggcagttga ccctcggcgg 3624541 tctcaaacac ccccacatcc gagagcgagc cgagaatgtc tgcccgcatg gtcaccgaaa 3624601 ccttctcggc gcttcgggtt tcgacgaaca ccacgtggtc gatcaacaac gcgctgccgg 3624661 cgtcgtccca ggcgttggaa gccgattcgg tgaggaactg acgcgccgcc aggtgccggt 3624721 tggccgggtc ggctgtggcc ttgaggaact cgcgtaacag cacgtcggga tccatacccg 3624781 ggctcggttt gggcagattc gacggcaccg gacgttcgac ggttccgatg gcttgcgggg 3624841 ccgacgtgct gggcacactg gcacagccgg ccagcactgc accaaggaac aacaaaattg 3624901 tcagccgcat caaccgctcc actccgcgtg ctcacgtggg cgctgacgtt ccttgtattc 3624961 cggtggcatc ggttgcggat tcggttgcgc gaccggttgc agaactggct gcgggatcgg 3625021 tttcatgggc agcgggctgg tggtgacctt gtggccgcgc accatcggaa gcgtcagccg 3625081 gaagcaggcg ccctcgccgg gttcgcccca cgcctcaagc cgaccctggt gcaatcgggc 3625141 atcctcgacg ctgatcgcca aacccagccc ggtgccgccg gaccgacgta cccgtgaggg 3625201 atccgagcgc cagaaccggc taaacaccag cttctcctca ccaggccgca gcccaacccc 3625261 gtagtcacgc acggtgacgg cgaccgtgtc ttcgtcggcg gccatccgga tccgcaccgg 3625321 tttgtgttcg gcgtggtcga tggcattggc aatcagattg cgcaggatcc gttctacccg 3625381 acgcgcatcg acctccgcga tcacctgctc ggcgggcaga tccaccagca actcgatacc 3625441 ggcctcctcg gccaggtggc ccacattgcc gagcgcgttg ttgaccgttg tgcgcaagtc 3625501 gaccgcctca accgacaact cggccacccc ggcgtcatgc cgcgagatct ccagcaggtc 3625561 gttgagcaac gtctcgaatc ggtccagctc gctaaccatc aactcggtgg accgccgcag 3625621 cgtggggtcg aggtcggcgc tgtggtcata gatcaagtcg gccgccatcc gcaccgtggt 3625681 cagcggcgta cgcagttcgt ggctgacgtc ggaggtgaac cggcgctgta ggttgccgaa 3625741 ctcctccagc tgggcgatct gtcgggacag gctctcggcc atgtcgttga acgacaccgc 3625801 cagcctggcc atgtcgtcct cgccgcgcac cggcatgcgt tcggacagat gtccctcggc 3625861 gaaacgttcg gcgatccgcg acgccgaccg caccggcacc accacctgac gcgacaccag 3625921 cagcgcaatg ccggcgagca ggactagcag taccaggccg ccggtggcca tcgtgccacg 3625981 caccagcgtg atcgtggctt gctcgctcgc cagcggaaag atcaggtata gctccaggtt 3626041 ggccacccgc gacaacgtcg gagtcccgat gatcagggcc ggcccggaga aaccttcggt 3626101 ctgcaccgtg gcgtactggt aggcggcctg cccggccttg acgaagccgc gcagcgcgtt 3626161 gggcacctga tcgacgggtc cggcagtaga ggcagcgcgc ggcccatcac ccggcaccat 3626221 cagcaccgca tcgaacgcac cggcgaggcc agcccccgaa gcggggtcgg ttttcgacgt 3626281 cagagtgttg cgcgcaagct gcaggctact gtccagtgag cgcgtctcct caccgttgac 3626341 gatcccgctg acggtggtgc gtgcccgctc gatctggtcg atcgccgccc tgaccttgat 3626401 gtcgaggaca cgattggtga cctggctggt cagcacaaag ccaagcgcca ggatgacggc 3626461 tagcgacagt ccaagggtca gcgccacgac ccgcagctgc agcgatcggc gccacgcgac 3626521 agctacggct cgactcaacg cactgaggcc ccgtgtcatc gggccagagc gaccccggcg 3626581 accccgaatg cgtcggcgcg agccgaagat catcggcgcc gctccttagc atcgctgcgc 3626641 tctgcatcgt cgccggcgcg gatcacggag gtccggcctt gtaccccact cctcgaacgg 3626701 tcagcaccac agtcgggttc tcgggatcct tttcgacctt ggcccgcaga cgctggacat 3626761 gcacgttcac cagcctggta tcggctgggt gccggtaacc ccatacctgt tcgagcagca 3626821 catcacgagt aaacacctgg cgcggcttgc gcgccaatgc gaccaacagg tcgaattcca 3626881 gcggtgtcaa cgagatctgc tcaccgttgc gagtgacctt gtgcgccggt acgtcgattt 3626941 ctacgtcggc gatggacagc atctcggcgg gttcgtcgtc gttgcggcgc agccgcgccc 3627001 gcacccgcgc aaccagctcc ttgggcttga acggcttcat gatgtagtcg tcggcgcccg 3627061 actccagacc cagcaccaca tccacggtgt cggtctttgc ggtgagcatc acgatcggaa 3627121 caccggaatc ggcgcgcaac acccggcaca cgtcgatgcc gttcataccg ggcagcatca 3627181 aatccaataa caccagatcg gggcgcagct cgcgcaccgc ggtcagagcc tgagtaccgt 3627241 cgccgatgac cgcggtgtcg aagccttccc cccgcagcac gatggtgagc atctcagcca 3627301 acgaagcgtc gtcgtcaacg accaaaatcc tttgcctcat ggtgtccatg gtgtcaccac 3627361 atcgggacaa aactggcgca ccacacgggc gtttcttgct tgattagggc aaataccctc 3627421 aacttggcac gtctggaggc gccaaagtcg ccgctagtcg gcccggatca acatcggcgc 3627481 cgacaaccag ccaccggccg ccccaccctt gggccgccaa ctcggcgtag accgcaccgg 3627541 tgcgctgctg aagttcagcg tcgcgttcgt aattgtcgcg cgcccgaccg gggtcacgct 3627601 gggcacggcc gcgggatcgt tccccggcga gctcggcaga gaccgcaagg agcacctgcc 3627661 agtcgggctt gggcaacccg agtcttgcaa attcgatccg ctgaacccag gccgctgcct 3627721 tcccggccgc gttttcatgt aggcgcgccg cgctgtaggc cgcgttggag gcgacgtagc 3627781 gatccaggat caccacgtcg tagccgcgac acagcccctg gatcgtgtgg accgcgccag 3627841 cgcggtcgag cgcgaacagc gtcgccatcg catacaccga cgatgcgagg tcaccgtgct 3627901 cgccgtgcag cgcctccgct gcgatgtcgg cggccaccga ctgtccgtag cgcgggaacg 3627961 ccagtgtggc caccgatctc ccggctgctc gaaaggcccc ggacagcttt tccaccaacg 3628021 tccgcttgcc agcgccgtca acgccctcaa tcgcgattag cacggcgcgg ccctgtcggt 3628081 ggcggcgcga gcagacgcaa aatcgccctt ttcgtcatga aaatgggcga ttttgcgtct 3628141 gctcgcgggt gggaggcact cagtagcggt agtggtccgg cttgtaggga ccctcgacgt 3628201 cgacgccgag gtattcggcc tgctccttgg tcagcttggt caggtgaccg ccaagggcct 3628261 cgacatggat tcgagccacc ttctcgtcga ggtgcttggg cagccggtac acctcgttgt 3628321 cgtactcgtc gttcttggtc cacagctcga tctgggcgat cgtctggtta gcgaagctgt 3628381 tgctcatcac gaacgagggg tgcccggtgg cattgcccag gttcagcagc cgcccctcgg 3628441 acagcacgat gatcgagcgg cccgtgtcgc caaaggtcca caggtcgacc tgaggcttga 3628501 cgttgacccg tgtcgccccg gagcgctcca gcccggccat gtcgatctcg ttgtcgaagt 3628561 ggccgatatt tcccaggatc gcgtggtcct tcatcgcctt aatgtgctcg agcatgatga 3628621 tgtctttgtt gccggtcgcg gttacgacga tgtcggcgtc cccgatggcc tcctcgacgg 3628681 tgaccacgtc gaagccctcc atcatggcct gcagcgcgtt gatcgggtcg atctcggtga 3628741 cggagacccg cgctccctgg cccttcatcg cctccgcaca gcccttaccg acgtcgccgt 3628801 agccgcagat gaggaccttc ttaccgccga tcagcgcgtc ggtgccgcgg ttgatgccgt 3628861 cgatcaggga gtgccgagtg ccgtacttgt tgtcgaattt ggacttggtc accgagtcgt 3628921 tgacgttgat cgccgggaag gccagatccc cggccgcggc gaattggtag agccgcagca 3628981 cgccggtggt ggtctcctcg gtgacgccct tgaccgactc ggctatcttg gtccacttgt 3629041 ccttgtcggt ctcgaagcgg gtccgtagca ggttcaggaa gaccttccac tcggcggggt 3629101 cgtcctcctc ggcgggcggc accacgccgg ccttctcata ctgcatgccg cgcagcacca 3629161 acatggtggc gtcaccgccg tcatcgagga tcatgttggc cggcttgtcg gggtccggcc 3629221 aggtgagcat ctgctcggcg gcccaccagt actcttcgag cgtctcgccc ttccacgcga 3629281 acaccgggac acccttgggc tcgtcggggg tgccgtgcgg gccgaccacg acggcggcgg 3629341 cggcgtgatc ctgggtggag aagatgttgc acgaggccca gcggacttcg gcgcccagcg 3629401 cggtgagggt ttcgatcaac accgcggtct gcaccgtcat gtgcagcgaa cccgagatcc 3629461 gggccccctt caggggttgc acctcggcat actcgcgccg cagcgacatc aggccgggca 3629521 tctcgtgctc ggcgatccgg agttctttgc ggccgaaatc cgctagtgac aggtcggcga 3629581 tcttaaagtc gatgccgtta cgaacgtcag gggtcagcga atttttggtc accaaatttc 3629641 cggtcatagg ggctttcatc cttctttggg ggctcacagg gatccgagcg ggctacttag 3629701 cctaggtacg ctcttgcagt cactgtagcc gccgtcggtc agccccgcag gtcaggggac 3629761 attgatcaca ccgtgacgct ccgcgaacgg cgttattagc cgtgctaggt ccgctgcgac 3629821 atcatggtcg gcctcgggcg gcatcgacac gtagctcaag cacagccgca cgatcgcacg 3629881 cgagagcaca ttggcgtcgt tatcggtggt ggccacccag gtatcggtga aggccggcgc 3629941 cagccgggcc gacgcgcggg tgatgatcgg cgcgctgtcg gtggtgatca gttgcagcag 3630001 atcgggcttg gcgacaccgg tcaacagcga gatgaccaac ggatctgccg ccgactcggc 3630061 gaagaacgac cgaaagccct gcaggaacgc ttcgtaaaag ttgccgacgt tggcgtccaa 3630121 cgatgcatgg acgttgtcca ctaatcggtc ggccaggcgc agcgcgtatc cctgcgccag 3630181 gccttgccgg gaaccgaatt cgttgtagat ggtctgccgg ctgatgcccg ccgcgcgggc 3630241 cacgtcggac agcgtgatgg cggaccagtc gcgggtcagc agcagatccc gcatcgcatc 3630301 cagcaccgaa tcccgcaaca gggcccgcga ggcctcggca tagggtatcc gcttcacagg 3630361 cgcgacagta gcgcttggag tgctcacgag cgagccacct ccaccatctc gaaatccgac 3630421 tttgccgcac cgcaatccgg gcaactccag tcatcgggga tgtcgtccca gcgggtgccg 3630481 gccgcgatgc cgtcctccgg ccaacccagc gcctcatcgt actcaaagcc gcattggata 3630541 cagcggaaca gtttgtagtc gttcacttag ttaccctcct atcttttcga aatcgacctt 3630601 ctcgcgcacc gcgcagtccg ggcagcacca gtcgtcggga atttgatccc agcctgtgcc 3630661 ggctgggaag ccttccctgg catcaccgtt ggcctcgtcg tagacgtagt cgcagaccgg 3630721 gcaccggtag gcggccatca tgccgaggct ccgtaacggg cgagtgcctt ctcccgcacg 3630781 cgcgggtgca ggttaacccg agtgatatcg ccgccgtagt gctccagcac ccggtgatcc 3630841 attaccttgc gccacaacgg cgggaagtag gtcagcgaga tcatcgatgc atacccactg 3630901 ggcaggttgg gcgcacccgc catgctccgc agtgtctgat agcggcgagt ggggttggcg 3630961 tggtgatcgc tgtgtcgctg caggtggtag aggaacaggt tggtgacgat gtggtcggag 3631021 ttccagctgt gcaccggggc gcagcgctcg tagcggccgt tggcgctctt ctgccgtagc 3631081 agtccgtagt gttcgaggta gttgacggcc tctaacaggc tgaagccgaa gactgcctgg 3631141 atgatgacga acgggatcag cgccgggccg aagaccgcga tcagcccacc ccacaacacc 3631201 accgacatca gccacgcgtt gagcacgtcg ttgcgcagat acgtcatggg attccagggg 3631261 ctgacgccga gccgacgcag ccgttgggcc tccaaatgaa cggccgagcg caagccgccg 3631321 ataacactgc ggggcaggaa ctcccacaac gtctcgccga accgcgccga cgccgggtcc 3631381 tccggtgtgg acacccggac gtgatggcca cggttgtgct cgatgtagaa gtgcccgtag 3631441 caggtctggg cgagggtgat cttggacagc caccgctcca gcgaatcctt cttgtgcccc 3631501 atttcgtggg cggtgttgat accgacgccg ccaagcacac cgaccgacag cgccacccca 3631561 agcttgcccg cccagctcaa ggcgccgtca aagccgagcc aactgaggtt tgcggcggtg 3631621 aacaggtatg cgcccagcac cacgctgagg tactggaacg ggatgtagat gtaggtgcag 3631681 tagcggtagt acttgtcatt ctccagccgg tcggtcacct cgtcgggcgg gttctgcccg 3631741 tcgggcccga agcgtaggtc aagaagcggc aacaagacgt agagcaggat cggtccgatc 3631801 cacagcggca cctgcgcggc ggcgtgccag ccgagctggt tcatccccca gatcagcggc 3631861 agcatcacca ccaaggccgt cggggcgatg aggcccataa gccacaggta acgcttcttg 3631921 tcccgccact cctcgacttc gggcggccgg ggggcttcgg gtccaccaga gccgatttgc 3631981 gtggtcatat gccaaacctc ctcatgagcc acaccacgtt gggatttgac aatagagcag 3632041 tttgcgtctt atgtctagac atataacgca atttgtaaat acgcggcgaa gctagttcaa 3632101 cacctccggg tcgcgctctc tcgagcttgc cgaaggccct gcgccgagtg ccggcgcccg 3632161 tagccgacat aaatcgcggt tccggccacc agccagatcc cgaaccggat ccaagtcaac 3632221 gcggtgaggt tcagcatcag ccacaggcac gcgcacactg cggcgatcgg aagtaacggc 3632281 acccacggag ctgtgaaccc ccgctgaagg tcgggtcggg tccggcgcag cacgaccact 3632341 ccggccgaga cgaggatgaa cgcgaacagt gtcccgacgt tgaccatctc ctcaagcttg 3632401 gtgatcggaa acaccgacgc cgtcgtggcc accaacaccg cgaccagcac cgtgacccgg 3632461 accggggtgc cgcgcgaacc ggtcttggcc aattgccgcg gcaccaagcc gtcgcgcgcc 3632521 atggcgaaca gcacgcggca ttgcccgagc atcaacacca tcaccaccgt ggtaagcccg 3632581 gccagcgcgc cgacggagat gatgccgctg gcccagtaca ccccgttggc ctggaacgcg 3632641 gtggccagat ttgccggccc gcggcccggt acggtccgca gttgggtgta tggaaccatg 3632701 cccgacagca ccaccgatac cgcgacgtag agaagggtca cgacccccag cgacgcgaga 3632761 atccctcgag ggacgtctcg ttgaggacgc ttggtctcct cggccatggt ggccacgatg 3632821 tcaaacccga taaacgcgaa gaacacgatc gatgccccgg ccagcacgcc gtaccatccg 3632881 tagtggctgc cttgggctcc ggtcagcaac gagaagacgg attgatcgag cccgccgccg 3632941 tggtgctgga cttcgggctc gggaatgaac ggcgagtagt tggcggccct gatgtagaag 3633001 gcaccgacga ccaccaccaa gacgaccacc gacaccttga ttgcggtgac caccgcggaa 3633061 aatctcgacg acaatttggt gcccaacgcg atcagggtcg ccaccaacgt gacgatcacg 3633121 agcgcacccc agtcgagctg cagcgatccg agatggcctg tgccattacc gaatccgaac 3633181 acggtgccca agtagctgga ccagcctttg gcgaccacgg ccgcacccat cgccagttcc 3633241 agcaccagat tccagccgat cacccaggcc aagaactccc cgaaggtggc ataagagaag 3633301 gtataggcgc tgccggccac cggcagcgtc gaggcgaact cggcgtagca cagcgcggcc 3633361 agcgcacagg tcgccgccgc gatcagaaac gatatccaga tggccgggcc ggtgatatcg 3633421 ccagcggtcg acgcggtaac cgtgaatatt ccggcgccaa tcaccaccga gacgccgaaa 3633481 acaaccaggt cccaccaggt gaggtccttg cgcagccgag tggtgggctc gtcggtgtcg 3633541 gcgattgact gttctaccga cttcatgcgc cgtcgaccgg ccatgcaccc gtcctctcgc 3633601 actcgttgtg accgcacagt actgggtact ctgcgaggat gacgggtcgc gtagggaacc 3633661 cgaaggacca cgccgtggtg atcggagcta gcatcgccgg gttgtgcgcc gcgcgggtgc 3633721 tctcggactt ctactccacg gtgacggttt tcgagcgcga cgagttgccg gaagcgccgg 3633781 cgaaccgggc cacggtccct caagaccgac acctgcacat gttgatggcc cgcggggcgc 3633841 aggaattcga cagcctgttc cccggcctgt tgcacgacat ggtggccgcg ggcgtgccca 3633901 tgcttgagaa ccggccggac tgtatctact tgggcgccgc cggccatgtc ctcgggacgg 3633961 ggcataccct gcgcaaggag ttcaccgcct acgtgcccag ccggccgcac ctggaatggc 3634021 agctgcggcg acgggtcctg cagctctcca acgtccagat tgtgcggcgc ctggtcaccg 3634081 agccacagtt cgagcgcagg cagcagcgag tggtcggcgt gctgctggat tcccctggta 3634141 gcggccaaga tcgggaacgc gaagagttca tagctgccga ccttgtcgtc gacgcagccg 3634201 gccggggtac ccgactgccg gtttggttga cgcagtgggg atatcggcgg ccggccgaag 3634261 acaccgtgga catcggcatc agctatgcca gccaccaatt tcgcattccc gacgggctga 3634321 tcgccgagaa ggtggtggtc gccggcgcct cacacgatca gtcgctgggg ctaggcatgc 3634381 tgtgctacga ggacggcacc tgggtcctca ccaccttcgg ggtggccgat gccaaaccgc 3634441 cgccgacttt cgacgagatg cgtgcactcg cggacaaact gctgccggcc cgcttcaccg 3634501 ccgcgctggc gcaagcccaa ccgatcggct gtccggcgtt tcatgctttc ccagccagca 3634561 gatggcgtcg ctacgacaag ctggaacgtt tcccgcgcgg aatcgtcccg ttcggcgatg 3634621 cggtggccag cttcaatccc accttcgggc agggcatgac gatgacctca ctgcaagccg 3634681 gccacctacg acgggcgctc aaagcccgca actcagctat gaaaggcgac ctggccgccg 3634741 aactcaatcg ggccaccgcc aagaccacct atccggtgtg gatgatgaac gcaatcggcg 3634801 acatcagttt ccaccacgcc accgctgagc cccttccccg atggtggcgc ccagccggtt 3634861 cgctgttcga ccaattcctc ggggccgcag aaaccgatcc tgttctcgcc gaatggtttc 3634921 tgcgacggtt ttcgctgctg gacagcctgt acatggtgcc gtcggtaccg atcatcggtc 3634981 gcgccattgc tcacaatctg cgattgtggc taaaagagca gcgtgagcgt cggcaacccg 3635041 tcacaacccg acggtcgccc tgaacagctt ggcgggttgg ccggcggtca gccggatcgg 3635101 gccgtcgtcg gccgccaccc aggcggccgt gccgcgctgt agcgtgagcg acccgcactt 3635161 cccgtgcacc gtcgccgaac cctcggtgca taacaagatc tgtggaccgt catggccgga 3635221 cgacgcgtcg acctcgtggc cgaggtgatc gccgtcgagc accagtagcg tggccgcgaa 3635281 ctcatcggtg ggcgtctcaa agaccagccc cagcccctcg cgccggatcg ggggccgcag 3635341 ccgagccttc ggcgtggggg cgaagtccag cacccgcaac aactcgggca catcgacgtg 3635401 cttaggggta agtccaccgc gtaacacgtt gtcggagttg gccatcactt ccacaccgaa 3635461 accacgcaca taggcgtgca ggttgccggc cggcaggaag atcgcctccc caggagccaa 3635521 gctgatgcgg ttgagcaaca acgccgccag cacaccggcg tcgccgggat aacgttcgcc 3635581 gagttccagc actgtcttgg cttcggcgcc aaattccgtt gcgccggagc tgacgtactg 3635641 gatagcgccg tccagcacgg caggcaccag cacgtcgatg tcgggctggg gtgcggtaat 3635701 ccaggtggtg aacagcgcac gcaaaccatc ggcatcggac ccctcgctca gcaagtcgat 3635761 gaacgggtcg aggtcggata cggccagcgc ccgcagcagc tcggtggtgc gagccgcctc 3635821 ccggaatccg gccagcgcct cgaacggctg cagcgccacc aataactctg gcttgtgact 3635881 ggtgtcgcgg tagttgcgga cgggtgagga caccggaatg cccattcgct cttcccgcag 3635941 gtagccctca accgcctgct cggcgctcgg atgggcctgc aacgatagtg gctcgtcggc 3636001 cgccaacacc ttgaccaaga acggcaacac atcgccgaat cgcgcgcgcg acgcggagcc 3636061 gagctgcccc tccggatccg cgaccaacgc ttcgagcaac gaggtttggc catgcggcgt 3636121 ctgcagccaa gccggatcac ccgggtgtgc accgaaccat agttcggcct cggggtgagc 3636181 ggccggcacc ggacgcccgg tgaattcggc gatagcggtg cgcgatcccc aagcgtaggt 3636241 gcgtaacgcg ccacgtagca gttccaccgg cgatctatcc tcgcaccagt cgcagataca 3636301 cggcggccat ctccagccga acggccaata ccgccccccc ggatcccacg ggggcgtcga 3636361 gcagctccgg cacatcctca gccgcgacca gataggcgtc atcgagcccg gcaacccgag 3636421 cggccaccac cgtccgctcg ccggccagcg ccagcgccaa cacccgcagc cgctgcggtg 3636481 ccggcccatc gatttcctcg tcatggaaca gcgcatccgg cggcgtcccg gcacgtagcg 3636541 ccacaaccgc atccgaaagc ctggtagcgg ccacaacctg gtttgcgatc cgcagcatga 3636601 ccgaactccc atgccgggcc agcgccagcg tcgcggcatt gtctccagcc agggccagct 3636661 ggcaaccgga aacgcgagcg gcaagtgcct tggccgggtt ggtgaacacc tctcggccgg 3636721 cgctgttgcg gagcgcctca gcatccagct cgtctgccag cgacgccaga tcgatgcgca 3636781 gcttgggatc cacggtttgc aaggccgcca gacccgcggc caggtaccgg gacaacccga 3636841 actcgtcagg aacccgcagc cgcggttcca gcaccgcgac gcgaccggcc gtgctgtccc 3636901 gcagcggacc ctcatacggt gccaccacga caacccgcgc gcccctgcgc accccgatcg 3636961 cggcggcccc gaccagcgcc gggtcgccgg ggtcgtcgcc ggcaacgatc agcacgtcaa 3637021 gcggcccgac ccagggcggc gccgcactgg cgagcacgat cggctcggcg gccccggcac 3637081 ctagcgtcga ggccaggatg gtcccggcgg tctcagcggt cccccggccg gtcacccaga 3637141 tcaccgagcg gggacggtca ctaccgcgca gcaagtccag ttcgccctcg tcggccgcgg 3637201 cagcgatggc acgcacctgt gcgccggcca tcgatgcggc ccgcagcagg gcaccccggt 3637261 cggcagcgat caggccttcg gtgtcctcga gatcgatcgc ccgggcgacg ttcacggtcc 3637321 ggccttcgca tgtgcgctct gggcagcgat ttcagcgctg acctgacgta ccaccgcgtc 3637381 aacgtccccg acgctgcggc cctccacatt gagccgcagc aacggctcgg tgtttgagct 3637441 gcgcaggttg aaccagctgt cgtcgcctaa gtcaacggtc acgccatcga ggtgatcaat 3637501 actgacaatc cggttgccga acgatttcaa cacggcctcc acacaggccg aagagtcgac 3637561 cacggtgaag ttgatctcgc cggaggattc atagcgttgg tagtccgcgg tcaactccga 3637621 cagcggtctg ctctgctcac cgagggcggc cagcacatgc agtgcggcca gcattccgga 3637681 atcggcaccc cagaagtcac ggaagtaata gtgcgccgaa tgttcaccac cgaaaatcgc 3637741 cccggtctcg gccatcagtg ccttgatata ggagtgccca acccgcgaac gcagcggcgt 3637801 accgccgcgc tcggcgacca gctcgggcac cgcgcgggag gtgatcacgt tgtggatgat 3637861 ggtggcgccg atctcccggt tgagttcccg cgcggccacc aatgcggtaa ccgtcgacgg 3637921 cgagaccggc tggccgcgtt cgtcgaccac gaagcagcgg tcggcgtcgc cgtcgaaagc 3637981 aagcccgata tcggcgccgg tgtcacgcac ataggcctgc agatccacca ggttcgccgg 3638041 gtccagcgga ttggcctcgt gattgggaaa cgatccgtcg agctcaaaat acgagggcaa 3638101 caaggtgatc gagtcgatca ccccaaggac cgccggcgcg gtgtgaccgg ccatgccgtt 3638161 gccggcgtcc acggccaccc gcaacggacg tagccccgag gtgtccacca gcgatcgcag 3638221 gaacgccccg tagtcgacca gcacgtcctg gtcggcaatg gttccgggcg tcccgtcgta 3638281 tcgtgcgacg ccggcgatca ggtcgtcacg gatggcggtc agcccggtat cggctccgac 3638341 tggtttggcg gcggcccgac acatcttgat gccgttgtat gccgccgggt tgtggctcgc 3638401 ggtgaacatc gctcccgggc agtccaacag ccccgaggcg aaataaagct gatcggtgga 3638461 cgccaaacca actcgcacca cgtcgaggcc ctgcccggtc accccggccg cgaacgcgtc 3638521 ggccagcgac ggcgaactgt cccgcatgtc gtgaccgatc accactggtc gcgcatcctc 3638581 ggtccgcatc aaccgcgcga atgcggcgcc gagatcggta accagcgact cgtcgatctc 3638641 ttcgccgacc agcccgcgta cgtcgtaagc cttgataacg cggtccacag ccgcggcggg 3638701 ccaagacatg cgcgggctcc tgacaaccta gattttctgc gactcttggc cgccagccta 3638761 tcggcccgcg aacgacgcgg gccgaatcgg tctcgaacag catgggaaga ctagtcggcg 3638821 gggtcgggca acacccgtag atgtccgcgc cggcgcccgg ccccaggctc gggcggcgca 3638881 agcacgccac ccccggtggg cgctccggtc gccgcagcgg gaaaatcgtc gaaaccatgc 3638941 agtggcgcgc catttccgcc tggatgatgc cggcgccccg cgctcgggcc accctcgcgc 3639001 accgcgtccg ccagggccac caggtcgtcc tcgtcggggt ggctgggcag cggcccggcg 3639061 tgacgcacga gttcccaccc gcgcggtgca gtgatgcgac cggcatggcc gacacacaga 3639121 tcccacgaat ggggctcccg cgcagtggca agcggaccga tcaccgccgt cgagtccgag 3639181 tagacgaacg tcaacgtcgc cactgcatag tgcggacacc cgggccggca gcagcgacgg 3639241 ggtacgttca cgaccgaaag gctatcgtgc accaacgccg ccgaagcgcc ggacacgcgc 3639301 atccgtccac gccgcgatgt ttaaccgtta ccatcggcgc gtgagcgatt cccgcagctc 3639361 ctcgtggagc cgtcggtcgc ggggcgggtc ggtagcgcgg cgagcaatcc ggcggggccg 3639421 cgagatgcgc gggccactgc tgccgccgac agtcccgggg tggcgcagcc gggccgagcg 3639481 gttcgacatg gcagtgctgg aagcctacga acccatcgag cgacgctggc aggagcgggt 3639541 gtcgcagctg gacatcgcgg tcgacgagat cccgaggatc gcagccaaag atcccgaaag 3639601 tgtgcagtgg ccgccggaag tcatcgccga cggaccgatc gcgctggccc ggctcatccc 3639661 ggccggcgtg gacgtccgcg gaaatgcgac gcgcgcgcga atcgtcttgt ttcgcaaacc 3639721 aattgaacga cgggccaagg acaccgagga acttggtgaa ttgctgcacg aaatcctggt 3639781 ggcccaggtg gccatctacc tggacgtcga cccatccgtc atcgacccga cgatcgacga 3639841 ctagttcgcg ccgccgactc cggcggccgg gtcagatgat cccgcgtttg aggcggcggc 3639901 gctcgcgttc ggaaagacca ccccagatgc cgaaccgctc gtcatgagcc agggcgtact 3639961 ccagacactc gtgccgcacc tcgcagccca tgcaaatctt cttggcctca cgcgtggagc 3640021 cgcccttctc cgggaagaac gcttcgggat ccgtttgcgc acatagcgca cggtcctgcc 3640081 attggtcggt ggcttccggc ggcagaggtt cctcgaatgg cgccggcgcc tcgggaacca 3640141 aactcagatg cggtcgcaaa actgccgttg ctgatgcggt agccgatccg gtagtggtat 3640201 gcggtgtgcc tcccattaca ccccgaaggt gttcatagga catgcctccg cctcctcact 3640261 cgatagatag tgaaatggtt tcccactgtt ttgatgtaca gttaacccaa ttcgaacaag 3640321 tgatcgaatc tcggtctgcg acaccgaaac cggccggcca accgcgaaat gacactgatg 3640381 tgattagaca caagttgggg acgcgggtca agtgtgccgg cgcatttcca tatcatctcg 3640441 taataaaatt tccgcggttc tgttgtggtt gggtcccggc gtgtcgagcg tgactcgtaa 3640501 ccaacgtttg gtgatgggcg ccgggaggta ctgtcctgcg atgtgaaggt caccgttctg 3640561 gccggtggag tcggcggcgc ccgcttcctg ctcggggtcc agcagctgct cggcctgggc 3640621 cagtttgctg ccaattctgc ccactcggac gccgaccacc aactgagcgc tgtcgtcaac 3640681 gtcggcgacg acgcctggat ccacgggctg cgtgtctgcc cggatctgga cacctgcatg 3640741 tataccctgg gcggcggggt ggacccccag cgcggctggg gccagcgtga cgaaacttgg 3640801 cacgccatgc aggaactggt gcgctatggc gtgcagcccg actggttcga gctcggggac 3640861 cgcgatctgg ccacccatct ggtgcgcacc cagatgctgc aggccggcta ccccctgtca 3640921 cagatcaccg aggccctatg cgatcgctgg caaccgggcg cccgcttgct gcctgccacc 3640981 gacgaccgtt gcgaaaccca tgtagtgatc accgacccgg tcgacgaaag ccgcaaggcg 3641041 atccattttc aggagtggtg ggtgcgctac cgtgcccagg tgccgacgca cagctttgct 3641101 tttgtcggcg ctgaaaagtc cagcgctgca accgaagcga tcgccgccct ggccgacgcc 3641161 gacatcatca tgctggcgcc gtctaatccg gtggtcagca tcggcgccat cctggccgtc 3641221 cccgggattc gcgcggcgtt gcgggaagca accgcaccga tcgtcggcta ctcgccgatc 3641281 atcggcgaaa agccgttgcg cggcatggcc gatacgtgcc tttcggttat cggggtggat 3641341 tccaccgcgg ccgctgtggg ccggcactac ggcgcgcggt gcgccaccgg gatactggac 3641401 tgctggctgg tgcacgacgg cgaccacgct gagattgacg gggtgacggt gcggtcggtg 3641461 ccgctgctga tgaccgaccc gaacgcgacg gctgagatgg ttcgcgccgg gtgcgacctt 3641521 gcgggagtgg tagcttgacc ggccccgaac atggctccgc ctcgaccatc gagatcctgc 3641581 ccgtcatcgg gctgcccgaa ttccgtcccg gcgacgatct gagcgccgcc gtcgccgcgg 3641641 cggcaccgtg gctacgcgac ggtgacgtcg tggtggttac cagcaaggtg gtgtccaaat 3641701 gcgagggccg gctggttccg gctcccgaag accccgagca aagagaccga ttgcgccgca 3641761 agctgatcga ggatgaggca gtgcgcgtgt tggcgcgcaa ggaccgcacg ttgatcaccg 3641821 agaatcgact cgggctggtt caggcggccg ccggcgtgga cggatccaac gtcggccggt 3641881 ccgagttagc gctgctgccg gtcgatcctg acgccagtgc cgcaaccttg cgcgccgggc 3641941 tgcgcgagcg gctcggcgtc accgtcgccg tggtcatcac cgacaccatg ggacgcgcct 3642001 ggcgcaacgg ccagaccgat gccgcagtcg gcgctgccgg tctggcggtg ctgcgcaact 3642061 atgccggtgt ccgcgaccca tacggcaatg agttggtggt caccgaggtc gcagtcgccg 3642121 acgagatcgc cgcggccgcc gacttggtca aaggcaaact gaccgcgacg ccggtggcgg 3642181 tggtgcgtgg gttcggcgtg tccgacgacg gctcgacagc ccggcaactg ctgcggccgg 3642241 gcgccaacga cctgttctgg ctcgggaccg ccgaagcgct cgagctgggt cgccagcaag 3642301 cccaactgtt gcgcaggtcc gttcgccggt ttagcaccga tccggtgccg ggcgacctcg 3642361 tcgaggctgc ggtcgccgag gccctcaccg cgccagcccc acatcacacc cggccgaccc 3642421 gattcgtgtg gctgcagaca ccggccatcc gcgcgcggct gctagatcgg atgaaagaca 3642481 agtggcggtc tgatctcacc agtgacggct tgcccgccga cgcgatagaa cgccgggtgg 3642541 cacgcggcca gatcctctat gacgcacccg aagtcgtcat accgatgctg gtgcccgacg 3642601 gagcacacag ctaccccgat gccgcccgca ccgacgccga gcacaccatg ttcacggtcg 3642661 ccgtcggagc ggccgtacaa gccttgctgg tcgcgctggc cgtgcgcggg ctgggcagtt 3642721 gctggatcgg ctcgacgatc tttgccgctg acctggtccg cgacgagctg gacctgccag 3642781 tcgactggga gccgttgggc gccatcgcga tcggatatgc cgacgagccg tccgggttgc 3642841 gcgacccggt gcctgccgcc gatttgctga tcctgaagtg acattcgctc tagcgacgat 3642901 aggctaccca gacatggcgg tcctgcagcc gatgccaacc atcaacctcc cgacggatca 3642961 attcaccgcg ttcggtcaaa agtggctcct cggctcgaaa ttctccaaga aggacgacag 3643021 gacttaggcg ccgtgataga tgccgctgtg ggcggcgcac tgtcggtgat gctcggcaac 3643081 atcccattgg tggttccgaa cgccaaccag ctgtaacctt cccaagcgcc gacgtgtacc 3643141 gctgctatcc ggcccgattc cagggacagc caccccatgc aacctagtca tccgacgcgc 3643201 cctggtgcgg tcatcagata tgtcggtagc tcccttgata cttgtcccat gacgacgttc 3643261 gccggcaaaa cggctgcgtc cgctgacaag gtgcgcgggg gctactacac gccgccggcg 3643321 gtggcccgat tccttgccca ctgggttcac caggcggggc cgaagatcct cgaaccatcc 3643381 tgcggcgatg gccgaatcct gcgcgaactc tccgccatca cagaccacgc gcacggtgtg 3643441 gaactcgttg cgcgcgaggc gaaaaagtcg cgggacttcg cgtccgtcga cactgagaac 3643501 ctttttacct ggctgcacaa gacccaactc ggcagctggg atggcgttgc cggcaacccg 3643561 ccctacatcc gcttcggaaa ctgggcatcc gaacaacggg atccggcact cgaattgatg 3643621 cggcgtgtgg gcctacgacc gaccaaactg accaatgcct gggtcccgtt tgtcgtggcg 3643681 agcacgacgc tagcgcgtga cggcggccga gtgggcctgg tggtcccggc ggaattgctt 3643741 caagtcacct acgcggcgca gctacgcgaa ttcctgctga gccgctatcg ggagatcacc 3643801 ctggttacct tcgagcggct ggtgttcgac ggaatcctgc aggaagttgt gctgttctgc 3643861 ggcgtcgtcg gtcccggtcc tgcacacata cgcaccgtca ggctcggcga tgcgaacgat 3643921 ctgaacgcgc tgggggacaa ggacttcacc aatgagtcag cgccggcgct tctccacgaa 3643981 aaggagaagt ggaccaagta cttcctcgac cccgctcaaa tccggctact gcgaggactc 3644041 aaacagtccg ccactatgat caggctcggc gaactggccg acgtggatgt gggcatcgtg 3644101 accggccgca acagcttctt cacgttcacc gatgccaagg cacaagcgct gggattgcga 3644161 gcgcactgcg ttcccctggt ctctcgcagc gcccaactca gcgggctgat ctatgacgag 3644221 gattgccggg catgcgatgt cgccggcaac caccgaacgt ggctactcga cgccgcggac 3644281 tatccaaccg atccagctct cgtcgctcac atcaccgcgg gtgaagcggc cggcgtccac 3644341 ctcggctaca agtgctcgat ccgcaagcca tggtggagca caccatcgct gtggatgccc 3644401 gacctcttta tgctgcgcca gatccacttc gccccgcggc tgaccgtcaa cgctgccgcg 3644461 gcgaccagca ccgataccgt gcaccgggtc cggctcgacc cgaacgtcga tccggcaact 3644521 cttgccgcgg tgttccacaa cagcgcgaca ttcgcgttcg ccgagatcat gggccgcagt 3644581 tatgggggcg gcatcttgga gttggagcct agggaagccg agcaactacc tatgccaccg 3644641 ccggcgtacg ggagcgcaga acttgcccag gatgttgatc tcctgctgaa agcaaacgag 3644701 atcgacaagg cgctcgacgt cgtggaccgt cacgttctga tcgacgggct cggcttgtcg 3644761 ccgcgcctgg tcgcaggttg ccgagcggca tggctcacgc tccgcgaccg caggaccaag 3644821 cgcggatctc ggcgataacc gcggcgggtg agcgcctcgc gtgcccggcc aacgatgtcg 3644881 atctcggcgc aagaagctca aacgtcggac gagtaacgga tcccgccgtc gggaagaaag 3644941 acaccgggcc atacccgggc accacttaac aactcgcagc gcgcgccgat gtcggccccg 3645001 tcaccgatca caccgtcgcg gatcaacgcc cgcggtccga tgcgagcacc gaagccgatg 3645061 atcgaacgct cgatcacgca cccggcctcc acccggacac catcgaagat gaccgcgccg 3645121 tccaatctgg tgccggggcc gatttcggca ccacgcccca cgacggtgcc gccaatcagc 3645181 aacgcaccgg gagataccgc cgcaccgtcg tgcaccaact gctcaccgcg gtgaccacgc 3645241 aaggccggag acggggcgat gccgcgcacc agatccgccg atccgcgaac gaagtcttcc 3645301 ggtgtgccca tgtcccgcca atagctggca tcgacatagc cgtagatctt gcagtcgccg 3645361 tcggcgagca aggccgggaa cacctcgcgt tccaccgaaa cctcccggcc ctgcggaatc 3645421 cggtcgatga cgttgcgttc gaagacatag cagccggcat tgatctggtc ggtcggcgga 3645481 tcctccgtct tctccagaaa ggcgactacg cggtcctcct cgtcggtggg tacgcagccg 3645541 aatgcccgcg ggtcgcccac ccgcaccagt tgcagcgtga catcggctcg attgcttcgg 3645601 tggaagtcca gcagttgggc cagatccgcg cccgagagca catcgccgtt aaacaccatc 3645661 gcggtgtcgt tgcgcagctt gccggcaacg ttggcgatgc cgccgccagt ccccaaggga 3645721 tgctcctcgg tcacgtattc gatctgtagg cccagtgcgg acccgtcgcc gaactccgct 3645781 tcgaagactg cgggtttgta ggacgtaccc aggatcacgt gctcgatgcc cgctgcggcg 3645841 atccgcgaca gcagatgggt gaggaacggc agtccggcgg taggcagcat tggcttgggc 3645901 gccgacagcg tcaacggccg cagtcgggta cccttgccac cgaccaggac caccgcatcg 3645961 acttggtgag ttgccaactc agtgccgccc ttctaccagc ttcagtttcc gtctgcggga 3646021 cctgcgcagt gaactgcgca ccatgaggtg ggaacgcagc gccagtgatc cccgcagggt 3646081 ccagcgcagc ggagcccgcc accaaccaga atgtcggtcg gctaagaaga tataggtgct 3646141 tttgtgatgg gcggccagat ggcttgccgg gtcgcgaccc gtcgaatgcg ccttgtggtg 3646201 cagaacctcg gctgacggca catacaccga cagccaaccg gctttgccaa gccggtcgcc 3646261 aaggtcgacg tcctccatgt acatgaagta acgttcgtcg aatccgccga cctggccaaa 3646321 cgccgaccgg cgcaccagta ggcaagaccc cgacaaccaa cccaccggcc gttcactggg 3646381 ctccagccgc tcctgccggt aggccgtcgt ccacggattg cgcggccaga acggcccgag 3646441 cactgcgtgc atgccgccgc ggatcaggct gggcatctgc cgcgccgacg ggtacaccga 3646501 cccgtcgggg tcccgaatca gcgggcccag cgcgcccgcg cggggccagc gggaggcggc 3646561 gtccagtagt gcatcgatac tgcccgggcc ccattgcacg tccgggttgg ccacgatcac 3646621 ccagtcatcg acccagggtt cgccggcatc gcccgccatt tcaccgagct gggcgatcgt 3646681 ccgattcacc gcggttccgt acccgaggtt ggcccctgtg ggcagcagcc gcacgttggg 3646741 gtagcgctgc accgcggcct gcggggtgcc gtcggtggag ccgttgtctg ccaacagcac 3646801 gctgaccggc cgctcggtgg ccagcgacaa cgacgccagg aaccgctcta gatggggccc 3646861 cggcgagtag gtcaccgcta ccaccggcag gacgtcagtc acgcgttgag ggtaaccgtc 3646921 gatcgatcga agttgagttc gcaggtgctg ccagcgccgt ggccagtgcg ctgcgccagt 3646981 gccgtagcgg cgtcaagccc gccagcgccc actgcctgct cgacagcgcg gaatagctcg 3647041 aacgcggcgc gggccgcgga aactgcgcgc tgctgaccgg acgcacccgc tgtgggtcgg 3647101 caccgcattc ttcgaacacc gcgcgggctt gaccgaaccg ggagaccacg ccctcgttag 3647161 cggcgtgcaa cacgcgtccg cgcacgcccg cgtcggccaa cgccagcagc gcctcggcca 3647221 ggtcggcgac gtaggtcggc gacccggtct ggtcgtcgac cacatccacc cgaccgtgtc 3647281 cggcggccag ccggcgcatg acggcgacga aatccttgcc ggtcccgccg gtgtagaccc 3647341 aggcggtccg taccacggca gcctccggga acgctgccag cacagcctgc tcgccggcga 3647401 gtttgctgcg ggcatacacg ccctgcggcg cggtttcatc ggtgggctcg tagggccggg 3647461 gctcggcgcc gccgaagtcg ccatcgaata cgtagtcggt ggagacgtgg attaaccgag 3647521 cacccacacg agcgcacgca cgggcgaggt gttgcgggcc agtggcattg accgcatagg 3647581 cgactgcctc attgctctcg gcgccgtcga cgtcggtgta ggcggcgcaa ttgatcacca 3647641 cgtcaccgtg tcggatgatc cgctcggccg cagcggggtc ggtgatatcc cactgcgagg 3647701 aagtcagcgc cagcatatcg cggccttccc gggcggcctg tgccgtcaga tggctgccca 3647761 gctgcccgcc cgcaccggtg atgactagcc tttctgacct gcccgccatg tgtttgagtc 3647821 tggcacgcct cgggcacgcc ggggttggct acccgacagg gcgccgttac acaagtagtc 3647881 tagtgtgatg tctgcgcaac gtgtggttcg tacggttcgt accgctcggg ctatttccac 3647941 ggcactggcc gtcgcgatcg tccttggcac cggggtggcg tggagcagtg tccggtcgtt 3648001 cgaagacggc atcttccaca tgtcggcgcc ctcgctgggg cacggcggcg acgacggcgc 3648061 gatcgacatt ttgctggtcg gcctggacag ccgtaccgac gcgcacggca acccgttgag 3648121 cgccgaggaa ttggcgacat tgcacgccgg cgacgaggaa gccaccaaca ccgacaccat 3648181 catcctgatc cgggtaccca acaacggaaa gtcggcgacc gcaatctcta taccgcggga 3648241 ctcctacgtc gcggctcccg gtctgggtaa gaccaagatc aacggcgtct acgggcaaac 3648301 cagagagacc aagcgggccg gcctggtcca agccggtgcc tcgccgaccg aagcggccgc 3648361 cgccggcacc gaggccgggc gtgaggcgtt gatcaagacg gtcgccgatc tgaccggcgt 3648421 caccgtcgac cactacgccg agatcgggct gctcggtttc gcgttgatcg ccgacgcact 3648481 cggcggcgtc gacgtctgcc tcaaagagcc tgtatacgaa ccactttcgg gtgccgattt 3648541 tccagccggg cggcaaaagc tcaacggtcc gcaagcgctc agcttcgttc gccagcggca 3648601 tgatctgccc cgcggcgacc tggaccgggt ggtacgtcag caggcggtga tggcggcgtt 3648661 ggcccaccgg gtcatctccg gacagacgct atccagcccc gccacgctga agcggttgga 3648721 gcaggccgtg cagcgctcgg tggtgctgtc ctccgggtgg gacatcatgg atttcgtccg 3648781 ccaattgcag aagctggccg gcggtaacgt tgccttcgcc accatcccgg tgctcgacgg 3648841 cgccggctgg agcgacgacg gcatgcaaag cgtggtgcgg gtggatccgc gtcaggtgca 3648901 ggactgggtc gtcggcctgc tgcacgagca ggaccagggc aagaccgacg agctggccta 3648961 cacacccgcc aagaccacgg ccaacgtggt caacgacacc gatatcaacg ggcttgcggc 3649021 agcggtgtca aaggtgttga gctccaaggg gtttaccacc ggatccgtcg gcaacaacga 3649081 cggcgaccac gtgcctggca gccaggtgcg ggccgcaaag gccgacgacc tgggcgcaca 3649141 gcaggtcgcc aaggaactgg gcgggttgcc ggtggtcgcc gatgcgtcaa tcgcgcctgg 3649201 gtcggtgcgg gtggtgctgg ccaacgacta cagcggtccg ggctccgggc tggggggtag 3649261 tgatccgaac ggcgtcgtat cgccggcccg cgcgttcaac ctcgggtccg ccgacgacac 3649321 gactcccccg ccgtcgccaa tccttaccgc cggctccgac gcgccggagt gcatcaactg 3649381 accacaccga ccaccctgag cggggcgatc ctggatccga tgctgcgcgc cgacccggtc 3649441 ggcccgcgca tcacctacta tgacgatgcc accggtgagc gcatcgagct atccgcggtg 3649501 acactggcta actgggccgc caagaccggc aacctgttgc gcgacgagct ggcggccgga 3649561 cccgccagcc gagtcgcgat cctattaccg gcccattggc agaccgcggc ggtgttgttc 3649621 ggcgtgtggt ggatcggtgc gcaagcgata ctcgacgatt ctcccgccga tgtggcactg 3649681 tgcaccgccg accgtctggc cgaagccgac gccgtcgtca acagcgcggc ggtagccggc 3649741 gaggtagccg tgctgtcgct ggatccattc ggtcgaccgg caaccggcct gccggtcggc 3649801 gtcaccgact atgcgaccgc ggtgcgggta cacggcgacc agatagttcc cgaacacaac 3649861 cccggtccgg tgcttgccgg tagatccgtc gagcagatcc tgcgcgactg cgcggcgtcc 3649921 gcggccgcca ggggtttgac ggcggcggat cgggtgctgt ccaccgcttc ctgggccgga 3649981 cccgatgagt tggtggacgg cctgctggcg atcctggccg ccggtgcgtc gttggtgcag 3650041 gtggccaatc ccgatccggc gatgctgcag cgcaggattg cgaccgaaaa ggtcacccgc 3650101 gtcctgtgac gcaggccgcg tccagcaggc gaaggcatca gagcaataca tattgatatc 3650161 gcgatatata gatgttaatg tcactgcaac gagctgccgc tgcaattaca gacccggaag 3650221 aaaggtacag gcaatggcga tacaagtgtt cttggcgaag gcgacaacga cggtgatcac 3650281 cggcttggcc ggcgtgaccg cctacgagat cttaaaaaag gccgcggcca aagcgccgct 3650341 tcgtcagacc gcggtatcgg cagcagcgct gggtctgcgc ggaacccgca aggccgagga 3650401 agccgcggaa tcggcccgcc taaaggtggc cgacgtgatg gccgaggctc gtgagcgcat 3650461 cggcgaggaa tcgcccactc cagcgatcag cgacctgcac gaccacgacc actgagcgcc 3650521 tcgccatgac cctggaagtg gtatcggacg cggccggacg catgcgggtc aaagtcgact 3650581 gggtccgttg cgattcccgg cgcgcggtcg cggtcgaaga ggccgttgcc aagcagaacg 3650641 gtgtgcgcgt cgtgcacgcc tacccgcgca ccgggtccgt ggtcgtgtgg tattcaccca 3650701 gacgcgccga ccgcgcggcg gtgctggcgg cgatcaaggg cgccgcgcac gtcgccgccg 3650761 aactgatccc cgcgcgtgcg ccgcactcgg ccgagatccg caacaccgac gtgctccgga 3650821 tggtcatcgg cggggtggca ctggccttgc tcggggtgcg ccgctacgtg ttcgcgcggc 3650881 caccgctgct cggaaccacc gggcggacgg tggccaccgg tgtcaccatt ttcaccgggt 3650941 atccgttcct gcgtggcgcg ctgcgctcgc tgcgctccgg aaaggccggc accgatgccc 3651001 tggtctccgc ggcgacggtg gcaagcctca tcctgcgcga gaacgtggtc gcactcaccg 3651061 tcctgtggtt gctcaacatc ggtgagtacc tgcaggatct gacgctgcgg cggacccggc 3651121 gggccatctc ggagctgctg cgcggcaacc aggacacggc ctgggtgcgc ctcaccgatc 3651181 cttctgcagg ctccgacgcg gccaccgaaa tccaggtccc gatcgacacc gtgcagatcg 3651241 gtgacgaggt ggtggtccac gagcacgtcg cgataccggt cgacggtgag gtggtcgacg 3651301 gcgaagcgat cgtcaatcag tccgcgatca ccggggaaaa cctgccggtc agcgtcgtgg 3651361 tcggaacgcg cgtgcacgcc ggttcggtcg tggtgcgcgg acgcgtggtg gtgcgcgccc 3651421 acgcggtagg caaccaaacc accatcggtc gcatcattag cagggtcgaa gaggctcagc 3651481 tcgaccgggc acccatccag acggtgggcg agaacttctc ccgccgcttc gttcccacct 3651541 cgttcatcgt ctcggccatc gcgttgctga tcaccggcga cgtgcggcgc gcgatgacca 3651601 tgttgttgat cgcatgcccg tgcgcggtgg gactgtccac cccgaccgcg atcagcgcag 3651661 cgatcggcaa cggcgcgcgc cgtggcatcc tgatcaaggg cggatcccac ctcgagcagg 3651721 cgggccgcgt cgacgccatc gtgttcgaca agaccgggac gttgaccgtg ggccgccccg 3651781 tggtcaccaa tatcgttgcc atgcataaag attgggagcc cgagcaagtg ctggcctatg 3651841 ccgccagctc ggagatccac tcacgtcatc cgctggccga ggcggtgatc cgctcgacgg 3651901 aggaacgccg catcagcatc ccaccacacg aggagtgcga ggtgctggtc ggcctgggca 3651961 tgcggacctg ggccgacggt cggaccctgc tgctgggcag tccgtcgttg ctgcgcgccg 3652021 aaaaagttcg ggtgtccaag aaggcgtcgg agtgggtcga caagctgcgc cgccaggcgg 3652081 agaccccgct gctgctcgcg gtggacggca cgctggtcgg cctgatcagc ctgcgcgacg 3652141 aggtgcgtcc ggaggcggcc caggtgctga cgaagctgcg ggccaatggg attcgccgga 3652201 tcgtcatgct caccggcgac cacccggaga tcgcccaggt tgtcgccgac gaactgggga 3652261 ttgatgagtg gcgcgccgag gtcatgccgg aggacaagct cgcggcggtg cgcgagctgc 3652321 aggacgacgg ctacgtcgtc gggatggtcg gcgacggcat caacgacgcc ccggcgctgg 3652381 ccgccgccga tatcgggatc gccatgggcc ttgccggaac cgacgtcgcc gtcgagaccg 3652441 ccgatgtcgc gctggccaac gacgacctgc accgcctgct cgacgttggg gacctgggcg 3652501 agcgggcagt ggatgtaatc cggcagaact acggcatgtc catcgccgtc aacgcggccg 3652561 ggctgctgat cggcgcgggc ggtgcgctct cgccggtgct ggcggcgatc ctgcacaacg 3652621 cgtcgtcggt ggcggtggtg gccaacagtt cccggttgat ccgctaccgc ctggaccgct 3652681 agcagccgca gccgtgacca cgccaggtgc ggatgccctg ccagaccgcg ataccggcga 3652741 tggccagccc gatcgcgggg tcaatccacc agccgttcga ccacacggca gtgatcgcca 3652801 gcccaagcag aaccgcggcg gcctgagcag cacacaggta gttctgggtg ccctcgcccg 3652861 cggtggcccc cgatcccagc cgctcaccca ctcggtggtt ggcccagccc aggaccggca 3652921 tcagcagcag ggcgatggcc gtcagtccga tgccgatcac cgaggtctcg gcacgatgct 3652981 cgccggctag gtggcggatg gattcggcaa cgaggtaggg ggccgtcagc caaaaagaca 3653041 ccgcaactcc acgctgtgcg cggtgctccg cggtcgcgga ccaagtgcgg tcgccggtga 3653101 accgccagag caccatcgcg ctggccaggc cctcggatcc gccacccagc gcccacccgg 3653161 tcaacgcgac ggatccgacc gcaataccct gccacagccc cacggcacct tcggtgagca 3653221 ataccgccag gctgacccac gccagccagc gggcccaccg aacgttccgc tgccattcgg 3653281 cctctcgcgc caccgacacg ggcgaatcca gcgtggattc atcgcggtgt tccgtcgtcg 3653341 tctccatccc gacgatggta gaggcaagac atgccgggcg gtcgccgcgg cgtcgcgaac 3653401 ccgtatggtt cagggaggat gccgcacgcc agggaaggtc accaccgatg ccgaccagca 3653461 accccgccaa accacttgac gggtttcggg tattggattt cacccagaac gtggccgggc 3653521 cgctggccgg gcaggtgctg gtcgacctgg gggctgaagt catcaaggtg gaggcgcccg 3653581 gcggtgaagc ggcccgtcag atcacctcgg tgttacccgg acgcccgccc ctggccacct 3653641 actttctgcc caacaatcgt ggcaagaagt cggtgacggt ggacctaacc accgagcagg 3653701 ccaagcagca gatgctgcgg ctcgcggaca ccgccgacgt tgtcttggag gcgtttcggc 3653761 ccggcaccat ggaaaagctg ggcctaggcc ctgatgactt gcgctctcgt aaccccaacc 3653821 tgatctacgc gcgcctaacc gcttacggcg gcaacggccc gcacggcagc cggccgggaa 3653881 tcgacctggt ggtggccgcc gaggccggca tgaccaccgg aatgcccacg cctgagggca 3653941 agccacagat catcccattt cagctcgtcg acaacgccag cggtcacgtg ctggcccagg 3654001 ccgtgctggc cgcgctgctg caccgcgagc ggaacggggt ggccgacgtc gtccaggtcg 3654061 cgatgtacga cgtcgcggtg ggactacaag ccaaccagct gatgatgcat ctcaatcggg 3654121 ccgctagcga ccagccgaag cctgaaccgg caccgaaggc caagcggcgc aagggagtcg 3654181 gcttcgctac ccagccatcg gacgcgtttc gcaccgccga tgggtacatc gtcatcagcg 3654241 catatgtgcc caaacactgg cagaagctgt gctacctcat cggccggcct gacctcgttg 3654301 aagatcaacg atttgccgaa caacgctccc ggtcgatcaa ctacgccgag ttgaccgccg 3654361 agttggaatt ggcactggcc agcaagaccg ccaccgaatg ggtccagttg ctgcaggcaa 3654421 acggcctcat ggcctgcctc gcccatacct ggaaacaggt cgtcgacacc ccccttttcg 3654481 ccgagaacga cctcaccctg gaagtcggtc gcggggcgga caccatcacg gtgatccgca 3654541 caccggcgcg ctacgccagc ttccgcgcgg tcgtcaccga tcccccgccc accgccggcg 3654601 aacacaatgc cgtgtttctg gcccggccct gacgctgtga ccattccgag gagtcaacac 3654661 atgagcaccg cagtcaacag ctgcaccgag gcgcccgcat cgcgatcaca gtggatgctg 3654721 gctaatctgc ggcacgatgt tcccgcatca cttgtcgtct tccttgttgc gttgccactt 3654781 tcgctgggga tcgcgatcgc ctccggggcc ccgataatcg ccggtgtgat cgccgccgtc 3654841 gtaggcggca ttgtcgccgg ggcggtcggt gggtcgccgg ttcaggtcag cggcccggcc 3654901 gcgggtctga ccgtggtggt cgccgagctg atcgatgagc tcggttggcc gatgctgtgt 3654961 ctgatgacga tcgccgcggg tgcactgcag atcgtgttcg gcctaagtcg gatggcgcgc 3655021 gccgcgctgg ccatcgcccc ggtcgtggtg cacgccatgc tggccggcat cggtatcacc 3655081 atcgcgctgc agcaaattca tgttctgctc ggtggtacgt cgcacagctc ggcgtggcgg 3655141 aacatcgtag cgttgccgga cggcatcctc catcacgaac tgcacgaagt gatcgtcggc 3655201 gggacggtta tcgcgatcct gttgatgtgg tcaaagctgc ccgccaaggt gcgtatcatt 3655261 cccggcccac tggtagccat cgcgggcgcg accgtgcttg cgttgctacc cgtgctacaa 3655321 accgaacgaa tcgacctgca gggcaacttc ttcgacgcga ttggcttgcc caaacttgcc 3655381 gaaatgtccc cgggaggaca gccgtggtct catgagatca gcgccatcgc gctcggtgtc 3655441 ctcaccattg cgctgatcgc aagcgtcgaa tcgctgctgt cggcggtcgg tgtcgacaag 3655501 ctgcatcacg gcccgcgcac cgacttcaac cgggagatgg tcgggcaggg cagcgcgaac 3655561 gtggtgtccg gattgctcgg cgggctgccc atcaccggtg tcatcgtgcg cagctcggcc 3655621 aacgtggccg ccggcgcccg aacccggatg tcgacgatcc tgcacggagt gtggatcctg 3655681 ctgtttgcgt cactgttcac caacctggtg gaactgattc ccaaggcggc gctggccggc 3655741 ctgctcatcg tgatcggtgc ccagctggtc aagctggcgc acatcaaact agcttggcgc 3655801 acaggaaatt tcgtaatcta cgccatcacc atcgtgtgtg tggtgttcct caatctgctg 3655861 gaaggcgtgg ccatcgggct ggtcgtggcg atcgtattcc tgttggtgcg ggtggtacgc 3655921 gcgcccgtcg aggtcaagcc ggtcggcggc gagcagtcca agcgatggcg ggtcgatatc 3655981 gacggcacgt tgagcttcct gctgctgccc cgcctgacca cggtgctctc gaagctgccg 3656041 gaagggtcgg aggtgacgtt aaacctgaac gcagactaca tcgacgactc cgtttccgag 3656101 gccatctccg attggcggcg cgcccacgag acgaggggcg gagtggtagc gatcgtggaa 3656161 acgtcgccgg ccaaactgca ccacgcacac gcccgaccac cgaagcgcca cttcgcgtct 3656221 gatccgattg gactggttcc gtggcgatca gcgcgcggca aagaccgcgg cagcgcttcg 3656281 gttctcgacc gcatcgacga gtatcaccgc aatggcgcgg ccgtgctgca cccgcatatc 3656341 gccgggctga ccgattcaca ggacccgtat gagctgttcc tcacctgtgc cgactcgcgg 3656401 attctgccga acgtcatcac cgccagcggc cccggcgacc tgtacaccgt ccgcaacctc 3656461 ggcaacctgg tgccgaccga tccggacgac cgatcggttg acgcggcact cgacttcgcc 3656521 gtcaaccagc tcggcgtcag ctcggttgtc gtctgcggac attcgtcgtg tgctgcgatg 3656581 acggcgctcc tggaagacga cccggccaac acgacgactc ccatgatgcg ttggctcgag 3656641 aatgcccacg acagcctggt ggtgttccgc aatcaccacc cggcacgccg cagcgccgaa 3656701 tccgccggtt accccgaagc cgaccagctg agcatcgtaa acgttgccgt tcaggtggaa 3656761 aggctgaccc gccacccgat cttggcgacc gcggtcgccg ctgctgatct acaggtcatc 3656821 ggcatattct tcgacatctc gaccgcccgg gtatacgagg tgggtccgaa cggcatcatc 3656881 tgcccggacg agccggccga ccgccccgtc gaccacgaat cagcgcagta gcgcccgcga 3656941 catcactacc cgctgaatct gattggtgcc ctcatagatc tgggtgatct tggcgtcgcg 3657001 cataaaccgc tcgaccggga agtcggtggt gtagccggcg ccgccgaaca gttgtacggc 3657061 atcggtggtg acctccatcg cgacgtcgga ggcgaagcac ttcgaggccg ccgaaatgaa 3657121 gcccagatcc ggctcaccgc gttcggcgcg ggcggcggcg gagtaaacca tcagccgagc 3657181 cgcctccacc ttcatcgcca tgtcggccag catgaactgc acggcctgaa acgtactgat 3657241 cgactcaccg aactgcttgc ggtccttggt gtaggcgatg gcagcatcca gcgcgccctg 3657301 ggcgataccc acggcctgcg cgccaatcgt gggacgggtg tggtccaacg tggccagcgc 3657361 ggtcttgaaa ccggtaccgg gctcaccgat gatgcgatcg ccggggatgc ggcagttctc 3657421 gaagtacagc tcggtggtcg gtgacccctt gatcccgagc ttgcgttctt tcggaccgac 3657481 ggtgaacccc tcgtcgtcct tgtgcaccat gaacgccgag atgccgttgg cgccccggtc 3657541 gggatcggtc accgccatca ccgtgtacca ggtcgacttg ccgccgttgg tgatccagca 3657601 cttggcgccg ttgagaatcc agtgatcccc atcggccttg gcccgcgtcc gcatggacgc 3657661 cgcgtcactg ccggcctcgc gttcactcaa tgcataggaa gccatcgccc cttcggcggc 3657721 caacgccggc agcacctgct tcttcagctc ctcggagccc cgcaggatca ggcccatggt 3657781 gcccagcttg ttgaccgcgg ggatcaacga cgcggacgcg tcgacgcggg ccacctcttc 3657841 gatcacgatg caggtagcta ccgagtcggc accctgaccg ccgtactcct ccggaatgtg 3657901 gacggcgttg aaaccggagg aattgagcgc cactagcgct tcttcgggga accgcgcctt 3657961 ctcgtccacc tcggcggcat gcggagcgat ctccttttcc gccaaagccc gtatcgccga 3658021 tcgcatttcg tcgtgttcct cgggcagctt gaacagatcg aacgacgggt ttccggccca 3658081 tccaaccatc ttggagccct cctaatctcc gtgctagtcg cgggttaact tacccgcaag 3658141 ccgctgcagt tccgcatcct tggccgccac gacgtcggcc agccggtcct ggaatgcgac 3658201 gatccgggcc ctcagctggg ggttggcggc tcccagcatc cgcaccgcca gcagtccggc 3658261 attaccggcg cccccgatgg acaccgtggc caccggaacc ccggccggca tttgcacgat 3658321 cgacagcagg gagtcaaggc cgtccagcct gcccagcggt accggcaccc cgatcaccgg 3658381 cagcggcgtc gcggcggcga ccataccggg caagtgcgcg gccccgcccg ctccggcgat 3658441 gatcacctcg agaccgcgct cggccgcgcc gcgcgcataa ctgaacatcg cctcaggggt 3658501 gcgatgggcc gaaacaaccc gaacctcggc cggaatgtcg aactcggcca gcgccgccgc 3658561 agcgtcggcc atcaccggcc agtcgctgtc gctgcccatg atcaccccga cccggggccg 3658621 ctcgccggca ggagtcatag gcgccgctcc tcctcatcgc ttcgtccccc gcacgcgggt 3658681 ggtaccccca ctgcatcgtc gctggcgcgg tgtgggtccc atccgtcagt ccaccgccca 3658741 tgggacaacc agtgtgccgc cagctcagcg cgttcacaca actgggcgac atcggagcca 3658801 aggaagttga tatgccccac cttgcgaccg ggtcgctcgg ccttgccgta gaggtgaacc 3658861 cgggcgtcgg gcattcgcgc aaacagatgg tgcagccgct cgtcgacgct catggccggc 3658921 ggctgcgcgg cgccgagcac attggccatc accgtcacgg gcaccacggc gtcgctgtcg 3658981 ccgagcgggt agtccaagac cgcgcgcaga tgctgctcga actggctggt gcgcgccccg 3659041 tcgatggtcc agtgcccgga attatgtggc cgcatcgcga gctcgttgac cagcaacgcc 3659101 ccgtcggtcg tctcgaacag ctcgacggcg agcacgccga ccacaccgag ttcgtcggcc 3659161 agctgcaacg ccaaccgttg cgccgcggtg gccaggtcgt cgggcagcgc cggcgccggc 3659221 gcgatcacca gcacacacgt gccgtcacgt tgcaccgtct ggaccaccgg ccacgccgca 3659281 ccctggccga acggcgaacg cgccaccagt gccgacagct cgcggcgcag gtccacccgt 3659341 tcctcgacca gcaccgccac gccgtcagcc aggcattcgc gagcgaaatc acgggcatcc 3659401 gccacatcac gtgccatccg aacgccccgg ccgtcgtaac ccccgcgcac tgccttgacc 3659461 acgatcgggg cgtcgacacg tgcggcgaag acgtcgattt cgtcggggtc tttgatgccc 3659521 gcgtagcggg gcacggcgac gcctgctgca gccagacgct gccgcatgac gagtttgtcc 3659581 tgggcgtgca ccagcgcctg cggcgacggt gcgacattga cgccatcggc gactagcttc 3659641 tccaacagct cgttcgggac gtgctcgtgg tcaaaggtca gcacgtcggc gccggccgca 3659701 acgcggcgca aggcggcaag atcggtgtgc gagccgatca ccacgttggg ggtgacctgc 3659761 gcggcagggt catctgccga ggtgaccaat acacggaggt tctgccccag cgcgatggca 3659821 gcctgatggg tcatccgggc cagctgaccg ccaccgacca tcgcaacgag gggggcaatg 3659881 aacgaggtga ccgccggggt gcgtgagctc gccacggcca tcatggtgtc acggcatctg 3659941 accggcgtac ttgccggcca cggcagccaa accgttacgt atcattttgc gtcgattttg 3660001 tgttcgtccg tacactcact tgttgtgtcc tttgccgatg ccaccatcgc gcgccttccc 3660061 ggggtggtcc agccctatgc gcagcgccac catgagctga tcaaatttgc catcgtcggc 3660121 ggcaccacat tcatcatcga cacagcaatt ttctacaccc tcaagctgac ggttctcgaa 3660181 cccaagccgg tgaccgcgaa ggtgatcgcc ggcatcgtcg ccgtcatcgc gtcctacgtg 3660241 ttgaacaggg agtggagctt ccgcgaccgc ggcggtcgcg agcgccacca tgaggcgctg 3660301 ctgttctttg cgttcagcgg cgtgggagtg ctgctgagca tggcgccgtt gtggttttcc 3660361 agctacatcc tgcagctacg ggtgccaacg gtgtcactga ccatggaaaa catcgccgac 3660421 ttcatctcgg cctacattat tggcaacttg ctgcaaatgg cgttccgctt ctgggcgttt 3660481 cggcgctggg tgttccccga cgagttcgcc cgcaaccccg acaaggccct ggaatccgcc 3660541 cttaccgcgg gcggcatcgc cgaagtcttc gaggacgtct tggagggcgg cttcgaggac 3660601 ggcaacgtca ccctgctgcg ggcctggcgt aaccgggcca accggttcgc tcagctgggc 3660661 gactcgtcgg agcccagggt gtcgaaaacc tcgtgataca gcaacgcatg cacctcccgc 3660721 aggcgcggaa tgttgtagaa ctcgagcgga tcttgtgacg cggactcgat aatcaacgtc 3660781 ccggtgcgaa aaatccgctc gaagatccgg tcccggaact ccacgctgtt gatccgtgct 3660841 agcggtatgt cgatcccgct gcgggtcagc acaccatgcc ggaacatcac ccgccggttg 3660901 gtcaccacga aatgtgtggt cagccagctc aggaatggcc acagcgtgag ccagccgacg 3660961 atcaccaacc agatccccca gatgaccgcg tgaatcacgt tcttagcgat ctgctgccaa 3661021 ggtgtcgagt tgacgaatcc ggacccgaac gccgccaacc cggtcagcaa gaccagcacc 3661081 acgacgggcc agattaagcg attccagtgc ggatggcggt gcagaacgac ctgctcgcca 3661141 gcggccagga cattctccgg atagctcatg cccgcgacct taatcttttg gggacgccag 3661201 ctccgcgcga gttaacgcaa atgcaccacg tcgcccgctg aaacaactac cgttcgaccg 3661261 ccgacgtcca gacacagccg accctggtca tcgatgtcac gcgcgatccc gacgacgtcc 3661321 tggccaccgg ggagctcgac gcgcacgcgc gacccaatgg tcaggctgcg agcacggtag 3661381 tcggccgcca gttgtgggtt ggcgttgcgc cactggatga tccgagcttc gagctcgcgc 3661441 aacagcctgc tggctatgcg gttgcggtcc ggtgccgcca ctccgaggtc cagcaatgag 3661501 gtcgcgtcgg gatcaacctc ttcgggggcc tgggtgacgt tgagtcccac accgagtacc 3661561 acaaacggct gcgcgacctc ggccaggatg ccggctaact tgccaccccg ggccagcacg 3661621 tcattgggcc acttgaggcc cgtttcggcc ggcgggactg caatcagggg ggccaccgaa 3661681 tcgagcaccg ccagacccgc ggccagtgac agccagcccc acgcttgcac cgggacgtcg 3661741 accacacgca caccgaccga caggatgatc tgcgctcggg cagtggccgc ccagccgcgg 3661801 ccatgacgcc cccgcccagc ggtctgatgc tcggcgatca acaccacccc gtcgatatcg 3661861 gccccggatg ccgcccgggc cagcaagtcg gcgttggtgg aaccggtttg ggccacgacg 3661921 tcaagttggc gccacccgga tccagcaccg atcagctggt cgcgcagtga gcgttcgtcc 3661981 aaaggcggcc tgagccgatc gcggtcggtc accgccccag cctaaggaag tagtgtgcgg 3662041 cagccgataa catcgactcc catgacaagc gttaccgacc gctcggctca ttccgcagag 3662101 cggtccaccg agcacaccat cgacatccac accaccgcgg gcaagctggc ggagctgcac 3662161 aaacgcaggg aagagtcgct gcaccccgtc ggtgaggatg ccgtcgaaaa agtacacgcc 3662221 aagggcaagc tgacggctcg cgagcgtatc tacgcgttgc tggatgagga ttcgttcgtc 3662281 gagctggacg cgctggccaa acaccgcagc accaacttca atctcggtga aaaacgcccg 3662341 ctcggcgacg gcgtggtcac cggctacggc accatcgacg ggcgcgacgt gtgcatcttc 3662401 agccaggacg ccacggtgtt tggcggcagc cttggcgagg tgtacggcga gaaaatcgtc 3662461 aaggtccagg aactggcgat caagaccggc cgtccgctca tcggcatcaa cgacggtgct 3662521 ggcgcgcgca tccaggaagg tgtcgtctcg ctgggcctgt acagccgtat ctttcgcaac 3662581 aacatcctgg cctccggcgt catcccgcaa atctcgttga tcatgggagc cgccgccggt 3662641 gggcacgtct actcccccgc cctgaccgac ttcgtgatca tggtcgatca gaccagccag 3662701 atgttcatca ccgggcccga cgtcatcaag accgtcaccg gcgaggaagt caccatggaa 3662761 gaactcggcg gcgcccacac ccacatggcc aagtcgggta cggcacacta cgccgcatcg 3662821 ggcgaacagg acgccttcga ctacgttcgc gagctgctga gctacctgcc gcccaacaac 3662881 tccaccgacg cgccccgata ccaagccgca gccccgacag ggcccatcga ggagaacctc 3662941 accgacgagg acctcgaatt ggatacgctg atcccggact cgcccaacca gccctatgac 3663001 atgcacgagg tgatcacccg gctcctcgac gacgaattcc tggagataca ggccggttac 3663061 gcccaaaaca tcgtggtggg gttcgggcgc atcgacggcc ggccagtcgg cattgtcgcc 3663121 aaccagccga cacacttcgc cggctgcctg gatatcaacg cctcggagaa agcggcccgg 3663181 tttgtgcgga cctgcgactg cttcaatatc cccatcgtca tgctggtgga cgtcccgggc 3663241 ttcctgccgg gcaccgacca ggaatacaac ggcatcatcc ggcgcggcgc caagctgctc 3663301 tacgcctacg gcgaggccac cgtgccaaag atcacggtca tcacccgcaa ggcctacggc 3663361 ggtgcgtact gcgttatggg ctccaaagac atgggctgcg acgtcaacct ggcgtggccg 3663421 accgcgcaga tcgcggtgat gggcgcctcc ggcgcagtgg gcttcgtgta ccgccagcag 3663481 ctggccgagg ccgccgccaa cggcgaggac atcgacaagc tgcggctgcg gctccagcag 3663541 gagtacgagg acacactggt caacccgtac gtggccgccg aacgcggata cgtcgacgcg 3663601 gtgatcccgc cgtcgcatac tcgcggctac atcgggaccg cgctgcggct gctggaacgc 3663661 aagatcgcgc agctgccgcc caaaaagcat gggaacgtgc ccctgtgagt cgagtgagcg 3663721 gaacgaacct gtgagtcgag tgagcggaac gaacgaagtg agtgacggga acgagacgaa 3663781 caatccggca gaagtgagtg acgggaacga gacgaacaat ccggcagaag tgagtgacgg 3663841 gaacgagacg aacaatccgg cccctgtgag tcgagtgagc ggaacgaacg aagtgagtga 3663901 cgggaacgag acgaacaatc cggcccctgt gagtcgagtg agcggaacga acgaagtgag 3663961 tgacgggaac gagacgaaca atccggcccc tgtgaccgag aagccgctgc atccgcacga 3664021 gccccacatc gagatactgc ggggacaacc caccgatcag gagctggccg cgttgatcgc 3664081 ggtgctgggc agtatcagcg gttcaacccc gcccgcgcaa cccgagccca cccggtgggg 3664141 gctgccggtc gaccagttgc ggtaccccgt cttcagttgg cagcgcatca cactgcaaga 3664201 aatgacgcac atgcgccgat gacccggctg gtgctcgggt ccgcctcccc tggccggctc 3664261 aaagtccttc gtgatgccgg cattgagccg ctggtcatcg cctcgcacgt cgacgaggat 3664321 gtcgtcatcg cggcgctggg gccggacgcg gtcccgagcg atgtggtgtg cgtactggcc 3664381 gcggcaaagg ccgcgcaggt cgcgaccacg ctgaccggaa cgcaacgcat tgtggccgcg 3664441 gattgcgttg tcgttgcctg tgattcgatg ctctacatcg aaggcaggct actcggcaag 3664501 ccagcgtcaa tcgacgaggc gcgcgagcag tggcggtcga tggcgggccg ggccggccaa 3664561 ctctatacgg gccacggtgt tatccggttg caggacaaca aaaccgtgta ccgtgctgct 3664621 gaaacagcaa taaccacagt atatttcgga acaccttcgg cctccgatct ggaggcttac 3664681 ctggccagtg gggagtcgct gcgggtcgcg ggtggattca ccctggacgg tctgggcggc 3664741 tggttcatcg acggcgtgca gggcaatccg tcgaatgtga tcggcttgag cctgccgttg 3664801 ctgcggtcgc tcgtgcagcg atgcgggctg tccgtcgccg cactgtgggc aggaaatgcg 3664861 ggcggcccag cgcacaagca gcagtagctt cggactgggc caggtcgcca gcggtaggct 3664921 cgatgatgtg ccgcttcccg cagaccctag ccccaccttg tcggcctacg cccatcccga 3664981 acggctcgtg accgccgact ggttgtcggc acacatgggc gcgccgggcc tggcgatcgt 3665041 cgaatccgac gaggacgtct tgctctacga cgtcggccat attcccggcg ccgtcaagat 3665101 cgactggcac accgacctca acgacccacg ggtgcgcgac tacatcaacg gcgagcagtt 3665161 cgccgaattg atggaccgca agggcatcgc ccgcgatgac accgtggtga tctatggcga 3665221 caagagcaat tggtgggccg cctatgcgtt gtgggtgttc acgctgttcg gtcacgccga 3665281 cgtgcgactc ctcaacggcg gccgtgacct ctggctcgcc gagcgccggg aaaccacctt 3665341 ggacgtcccg accaagacct gcaccggtta tcccgtcgtg cagcgcaacg atgcacccat 3665401 ccgcgcattc agagacgacg tgctggccat cctgggcgct cagccgctga tcgacgtacg 3665461 ctctcccgag gagtacaccg gcaagcgcac ccatatgccc gattaccccg aggaaggggc 3665521 gctgcgggcc ggtcacatcc ccacggcggt gcacattccg tgggggaagg ccgccgacga 3665581 aagtggacgg tttcgcagcc gcgaggaatt ggaacggctc tatgacttca taaacccgga 3665641 cgaccaaacc gtcgtctatt gccgcatcgg tgaacgctcc agccatacct ggttcgtgct 3665701 cacacacctg ctgggcaagg cagatgtacg gaactacgac ggctcgtgga ccgagtgggg 3665761 caacgccgtg cgagtgccga tcgtcgcggg cgaagaacca ggagtggtac ccgtcgtatg 3665821 accgcgcccg cgagcctgcc cgcgccgcta gcagaggtgg tatccgactt cgccgaagtc 3665881 cagggtcaag acaagctgag gctgttgctg gaattcgcca acgagctgcc ggcgcttccg 3665941 tcgcacctgg ccgagtccgc tatggagccg gtccccgagt gccagtctcc gctgtttttg 3666001 cacgtcgacg cgagtgaccc caaccgggtg cgcctgcatt tcagcgcgcc ggccgaagcg 3666061 ccaaccacgc gcgggttcgc ctcgatcctg gccgccggcc tagacgagca accggccgcc 3666121 gacatcttgg cggtgcccga ggatttctac accgagctgg gtctggctgc cttgatcagc 3666181 ccactgcggt tgcggggaat gtcggcgatg ctggcccgga tcaagcgccg gctgcgcgaa 3666241 gcggactgaa tcgaggaacc gcgtgagcgg gtcagcggcg cgacgcttaa acttcccccg 3666301 acaagacttg taagaaaatc tcttagagac gaagaatcag cccgacagga ggcgcagtgg 3666361 ctagtcacgc cggctcgagg atcgctcgga tctctaaggt tctcgtcgcc aatcgcggcg 3666421 agatcgcagt gcgggtgatc cgggcggccc gcgacgccgg cctgcccagc gtggcggtgt 3666481 acgccgaacc cgacgccgag tccccgcatg ttcggctggc cgacgaggcg ttcgcgctgg 3666541 gcggccagac ctcggcggag tcctatctgg acttcgccaa gatcctcgac gcggcagcca 3666601 agtccggggc caacgccatc caccccggct acggcttcct agcggaaaat gccgacttcg 3666661 cccaggcggt gatcgacgcc ggcctgatct ggatcggccc cagcccgcag tcgatccgcg 3666721 acctgggcga caaggtcacg gcccgtcaca tcgcggcccg cgctcaggcg cccctggtgc 3666781 cgggtacccc cgatccggtc aaaggcgccg acgaggtggt ggcattcgcc gaggagtacg 3666841 gcctgccgat cgcgatcaag gccgcccacg gcggcggcgg caagggcatg aaggtggccc 3666901 gcaccatcga cgagattccg gagctgtacg agtcggcggt gcgcgaggcc acggccgcgt 3666961 tcggccgcgg tgagtgctac gtggagcgct atctcgacaa gccgcgccac gtcgaagcac 3667021 aggtgatcgc cgaccagcac ggcaacgtcg tcgtcgccgg cacccgggac tgctcgctgc 3667081 agcgccgcta ccagaagctg gtcgaggagg cgcccgcacc gttcctgacc gactttcaac 3667141 gcaaagagat ccacgactcg gccaaacgga tttgcaaaga ggcccattac cacggcgccg 3667201 gcaccgtcga atacctggtc ggtcaggacg gcttgatctc gttcttggag gtcaacacgc 3667261 gccttcaggt agaacacccg gtcaccgagg aaaccgcggg catcgacttg gtgctgcagc 3667321 aattccggat cgccaacggc gaaaagctgg acatcaccga ggatcccacc ccgcgcgggc 3667381 acgccatcga attccggatc aacggcgagg acgcggggcg taacttccta ccggcgcccg 3667441 ggccggtgac aaagttccac ccgccgtccg gccccggtgt gcgggtggac tccggtgtcg 3667501 agaccggctc ggtgatcggc ggccagttcg actcgatgct ggccaagctg atcgtgcacg 3667561 gtgccgaccg cgccgaggcg ctggcgcggg cccggcgcgc gctgaacgag ttcggtgtcg 3667621 aaggcctggc gacggtcatc ccgtttcacc gcgccgtggt gtccgacccg gcattcatcg 3667681 gcgacgcgaa cggcttttcg gtacataccc gctggatcga gaccgagtgg aataacacca 3667741 tcgagccctt taccgacggc gaacctctcg acgaggacgc ccggccgcgt cagaaggtgg 3667801 tcgtcgaaat cgacggtcgc cgcgtcgaag tctcgctgcc ggctgatctc gcgctgtcca 3667861 atggcggcgg ttgcgacccg gtcggtgtca tccggcgcaa gcccaagccg cgcaagcggg 3667921 gtgcgcacac cggcgcggcg gcctccggtg acgcggtgac cgcgcctatg cagggcaccg 3667981 tagttaagtt cgcggtcgaa gaagggcaag aggtcgtggc cggcgaccta gtggtggtcc 3668041 tcgaggcgat gaagatggaa aacccggtca ccgcgcataa ggatggcacc atcaccgggc 3668101 tggcggtcga ggcgggcgcg gccatcaccc agggcacggt gctcgccgag atcaagtaag 3668161 cccggcggct actccaactg atcccgtagc cgtgccaatg acttggccag cagccgcgac 3668221 acgtgcatct gtgagatacc gacgcgctcg gcgatctgcg tttgggtcat cgagtcgaag 3668281 aacctgagca ccaagaccgt tcgttcccgc tcgggcaacg cctcgagcaa cggacgaagc 3668341 acctcccgat tctcgatctg gtcaagaccc gcatccacgt cgcccagggt gtctgtgatt 3668401 gcgcgggcat cgtcgtcgct gccgccaccg ctgtcgatgg acaaggtgtg gtaggaacta 3668461 cccgccagca aaccttcgat aacctcagcg cggtccatcc cgagctccgc ggcgagctcc 3668521 gatgccgacg gcgcccgccc gagccgctgc gacaaatcgg cggtggcggt acctagccgc 3668581 agatgcagtt ccttgagacg ccggggaacc ttgaccgacc agctgttgtc gcggaagtgt 3668641 cgtcggacct cgcccatgat ggtaggaacc gcgaaggaga cgaagtccga cccggtcttc 3668701 acgtcgaagc gaaccgcggc gttgaccagc ccgacccgcg cgacctgaat aaggtcgtca 3668761 cgcggttcgc cgcgaccctc gaaccgccgc gcgatgtgat cggccagcgg caagcaccgc 3668821 tgaacgatct tgtcccggtg ccgctggaat tccggtgagc cggcaggcaa accaaccagc 3668881 tcgcgaaaca tctccggaac gtcggcgtat tcgttagctc gcgatgcaga accgccggca 3668941 gcgcgcgccg tcacctgctg gatgccgccc gtcgggcggt caacgtgatg ccgaagacac 3669001 tgccggctac atcgggctgg cgaccgtcgt ggaaggtctg gacgtcgtcg gccagcgcgg 3669061 tcaggacatg ccagctaaag ctgcccggtg ccaccacgtc gtgggtgtcg caggcagcag 3669121 aagcctccac cacaacttcg tcttttcgcg gatcgaccac caggcgcagg gtggcatccg 3669181 gcaaggccga gcgaatcaac cgggtgcaca cctcgtccac cgccaacctc aggtcggcca 3669241 cggcgtcgaa atccaggtcc tcgaaggtgc cgatggcgcc gaccagggtg cgcagcagcg 3669301 ccaggttctc caggcgggca gcaacgttca gctcgacggc gcggacaccg cgttggcgcc 3669361 ccttggtggg taaatccgag tcggccatgc accctcccgg caagcttcga tcgacagtac 3669421 tcccgccttg ggtctggtct tcgagctggt cggtcatggt cggacctgct ggtagtgggg 3669481 atctaacgca acatggtcgg gattcatcat ggtgtacccg tgatacccat tcgcagctgc 3669541 cggtgaaacc ccgcgatgcc gggatttcca gccgcactag gatgtctagc cggccagccg 3669601 ctgccgccgg acttcgggat gttcggtata ccagcgatcg gcaatcttgc gtatccgccg 3669661 atgctcgaac gctagccacg ccaaaccaac cactgtgacg acaatcgcca ccacaccaaa 3669721 ggtcatgccc tcggcgtgat gtccggtgcc gaaagccgca agagctccga cgccgccgac 3669781 gacaccggcc acaatcaaca gatacccagg ccaatgcacc acgtcgatca gcgactcgcc 3669841 ggcaagcggc cgcgtcgtcc gcaagtggtc gacggggtca cgataggtgt cgcccatggc 3669901 ctcctccgtt tccgtcctat tccgccattt ctgcccatta ccaggcacta ccatcaacgg 3669961 tagaactcgt cgaacgggtt gtggagggat ctgacccatt tatttgttga ccgcggccga 3670021 cctggccgac ggctcacggc gccatgaccg ggccggcgat cggtgggacg cctatgcaga 3670081 gcgtcagcac catcagcgtc aacaaaaacc agccggcgcc gtgccatccc caccaggtac 3670141 cttccgcacg ccatacccgg taggtgcgca gaaacgccca cagcccggcc gcacacagga 3670201 tcagcgggcc ccccagcgcc agcaggatcc gctggggcgg gccgcaggcc gcggtgtcga 3670261 cgccgctgca cgtgctgacc aacaacgctc ccataatgag gaaaccgacc ccgacgacag 3670321 cggccacaac agcaaaccga atcgccgagt gcacctcgct gtcatcccgg cctagccgat 3670381 cgccgcgtga cggcccacct acttcgtgca tcggcgaatc tccatcccgc tcttggcggc 3670441 tgccttacgt caccaccggt aacgcgctgc gcaccgcggc tatcgcggcg tcgatctcgg 3670501 cggttgaaac cgtcagcggt ggacggaatc gcacggtgtc tgcaccggcc ggcaacacaa 3670561 tcaccgcacg ttgccacagc tggcggatca actcgtcacg gtcggcggtg gtcggcaggc 3670621 taaacgcaca catcagcccg cggccgcgcg gatcgagaac cactgccggg aagtccgcgg 3670681 cgagttcgtc aagccgggcg cgcagatact taccgtgctg caccgcccgc tcgaacaggc 3670741 cctcggcttc gatgacctcc aagatgcggc gggcgcgcac catgtcggta agattgccac 3670801 cccatgtcga gttgagccgt gatgggaccg cgaacacatt gtcggcgacc tcgtccaccc 3670861 gccgaccggc catcactccg catacctgcg tcttcttgcc gaacgccacg atgtcgggtg 3670921 cgacatccaa ctgctggtat gcccaggcgg ttccggtcaa cccgcagccg gtctgtactt 3670981 cgtcgaagat cagcagtgca tcaaactcgt cgcacagctc gcgcatcgca gcgaaaaact 3671041 ccgggcggaa atggcggtcg ccaccctcgc cctggatggg ttcggccaca aaacacgcga 3671101 tgtcgtgcgg gcgggtctcg aatgccgcgc gggcctggcg tagcgcctcg gcctctagcg 3671161 cggccatagc gggctcatcc aggccgggcc gcatgtacgg cgcatcgatg cgtggccagt 3671221 cgaatttcgg gaaccgggcg gtaatggtcg gcttggtgtt ggtcagcgac agggtatagc 3671281 cgctgcggcc gtgaaatgcc ccgcgcaggt ggagcacttg agtgcccagc gccgggtcga 3671341 tcccatgggc ttggttgtgc cgactcttcc agtcgaacgc ggctttgagc gcgttctcca 3671401 ccgccagggc gcccccttcg acgaagaaca gatgcggcag cgccgggtcg cccaagacac 3671461 gggcgaaggt ctcgacgaag cgggccatcg ccaccgagta cacgtcggaa ttgctgggct 3671521 tgttcagcgc ggcctgcatg agttcggcat ggaactcccg gtcgtccacc agcgccgggg 3671581 gattcatacc cagtgccgag gaggcaacga atgtgaacat gtccaggtag cgccgacccg 3671641 ttatagcgtc gaccagatat gaaccgcccg aacgggtcag atcgagcact atgtccagac 3671701 cgtcgaccag catgctgcgc cctagcacct catgaacccg gtctggtgtt gttggtctac 3671761 cggcaagagc gacggacttc acgacggcgg ccatgacgct atgatagcag gatttacgga 3671821 atattgatat ttatgctgga aaaattatgg tatatgctgc ctatcgctgt aaaaagtgtt 3671881 cagaatgatc gtgcttcgcg tccgcacgtt cgccgttgtc cggatccgtt gcaacaggtc 3671941 ctcgagcgcc cgtgcggacg cgacgcgcac cagcaagacg tagctctctt cgccggccac 3672001 cgagtaacag gactcgacct cctcgatatg ttctaggcgc gcgggggcat catctggttg 3672061 agacggatca agaggagtga tagccacgaa cgccgacaac aaatgcccaa ccgcctcggg 3672121 attgattcgc gccgaatatc cctggaccac accacgagac tccagccggc gcactcgcga 3672181 ttggaccgcc gagaccgaca gcccggctcg cgtggccaac tctgacagcg tcgcacgtcc 3672241 gtcggcggcc agttcgcgca ccaggatccg atcgatatcg tcgagcgcct cgttcatggc 3672301 cggagactat cgcaacggca gtgccgcatg agccgctcga aaagactgca gactggccag 3672361 ctgcgcgcgc gcttcgccgc cgggttgtca gccatgtacg ccgctgaggt gcccgcctac 3672421 ggcacgctgg tcgaggtatg cgcacaagtc aactccgatt acctgacccg gcatcggcga 3672481 gccgagcggc tggggtcgct tcagcgcgtc accgccgagc gccacggcgc catccgagtg 3672541 ggcaacccgg ccgaactcgc tgcggtcgcc gacctgttcg ccgcgttcgg gatgctgccg 3672601 gtcggctact acgatctgcg caccgctgag tcaccaattc cagtggtgtc caccgcattt 3672661 cgcccaatcg atgcgaacga gctggcacac aacccgtttc gggtgttcac ctcgatgctg 3672721 gccatcgagg atcggcggta cttcgatgcc gacctacgca cccgagtgca gaccttcctc 3672781 gcgcgccggc aactctttga ccccgcgttg ctcgcccagg cgcgggcaat cgcggctgac 3672841 ggcggctgcg atgccgacga cgcaccggct ttcgtcgccg cggcggtggc cgcgtttgcg 3672901 ctgtcgcggg aaccggtcga gaaatcctgg tacgacgagt tgtccagggt gtcggcggtg 3672961 gccgctgata tcgctggagt cggctccaca cacatcaacc atctgacgcc tcgggtgctc 3673021 gacatagacg atctgtaccg tcggatgacc gagcgcggca tcaccatgat cgacaccatc 3673081 caaggccctc cccgcaccga cggacccgat gtgttgttgc ggcaaacctc atttcgcgcg 3673141 ctggccgaac cacgcatgtt tcgcgacgag gacggtaccg tgacgccggg aatcctgcgg 3673201 gtgcggttcg gtgaggtcga ggcgcgcggt gtcgcgctga ccccgcgagg gcgcgaacgc 3673261 tacgaagccg cgatggcggc cgcagatccg gccgcggtct gggccactca ctttccctcg 3673321 acggatgcgg agatggccgc tcaaggcttg gcctactacc gaggtggtga cccgtcagcg 3673381 ccgatcgtct acgaagactt cctgcccgct tcggccgcgg gcatcttccg ctccaacctg 3673441 gatcgcgact cgcaaaccgg tgacggaccc gacgatgccg gctacaacgt cgattggttg 3673501 gccggggcaa tcggccgaca cattcacgac ccgtatgcgc tctatgacgc gctcgcccag 3673561 gaggagcggc gctgataacc actgacgcgt tacgagccca ggtgctcgaa gcctgccaag 3673621 cgatcggcgt aaccgccgcc cttggcgagc cgggcgaaca cagcctgccc gcgagcacac 3673681 cgatcaccgg cgacgtgctg ttcagcatcg caccgaccac cccggagcag gccgaccacg 3673741 cgatcgccgc ggcggccgca acatttacgg catggcgaag cacgccggcc ccggtgcgcg 3673801 gcgcgctcgt ggcccggctc ggcgagctgc tcaccgcaca ccagcaggac ctcgcgacac 3673861 tggtcacagt cgaagtaggc aagatcaccg ccgaggcgcg cggcgaagtg caggaaatga 3673921 tcgacgtctg ccagttctcg gtgggtctgt cacgccagct ctacggccgc accatcgcgt 3673981 cagagcgcgc tgggcaccgg ctcctggaaa cctggcatcc gctgggagtg gtgggcgtga 3674041 tcaccgcgtt caacttcccg gtcgcggtct gggcgtggaa caccgcggtg gcactggtct 3674101 gcggcgacac ggtggtgtgg aaaccctcgg agctgacgcc gttgacggcg ctggcctgcc 3674161 aggcgctgct cagtcgggcc gccgctgatg tcggcgcgcc ggccgcggtg ggcggcctgc 3674221 tgttgggcgg cgccgagcgt ggtgcgcaac tcgtcgacga cccgcgggtt gcgttgttgt 3674281 cggcgacggg ttcggtgcgg atgggccagc aggtcggtcc acgcgtcgcc cggcgcttcg 3674341 ggcgggtgct gctggagttg ggcggcaaca acgcggccat tgtggcgccg tcggccgacc 3674401 tggagctggc ggtgcgcggc atcgtgttcg ccgcggccgg caccgcaggt cagcgctgca 3674461 ccagcctgcg ccggctgatc gtgcaccgct cggtggctga cgatgtggtg gcacgcgtcg 3674521 tcggcgccta tcgccagctg gcgatcggtg acccgtcggc cccggacacg ctggtaggcc 3674581 cactcatcca cgaggccgcc taccgcgaca tggtggcagc gctcgagcgg gcacgcaccg 3674641 acggcggcga ggtcatcggc ggtgatcgtc gcgaggtggg ctcaccgggc gcctactatg 3674701 tcgcgcccgc tgtggtccga atgccgtccc agaccgccat cgtggcgacc gaaacgttcg 3674761 caccaatcct gtacgtgctc acctacgacg acctcgacga ggcgatagcc ctcaacaacg 3674821 cggtaccaca agggctttcg tcgtcgatct tcacgaccga cctgcgtgag gccgagcact 3674881 tcctcgacca gtccgactgc ggtatcgcca acgtcaacat cgggacgtcg ggagcggaga 3674941 tcggtggtgc cttcggcggc gagaagcaga ccggcggcgg ccgcgagtcc gggtccgacg 3675001 cgtggaaggc ctacatgcgc cgggccacca acaccgtcaa ctactcgagc gagctgccgc 3675061 tggcgcaggg cgtgaagttc gggtaaccat gcccgtgggt gcgtctgggc atcatcgacg 3675121 cgcgcttggg gttgggcggg gtggaattca tccatttcat tcagtgcccg ttgcgaatcc 3675181 ccaagctacc ccgacggcga ccagaggatg tcgatgggga cggcggcgag gcggtcgccg 3675241 aatggctggg cttgtgggcc ggtgtgcagg atcacgccgc cggcgaagcg tgcgccgact 3675301 ttgtcgcgga gtctgctgat cgagcgggtg tctctaccac ggagggttgc cgccgacttg 3675361 atttcgatcg cggcaatgag gccgtctgcg gtttccagta tgaggtctac ttcggcgccg 3675421 tctcgatcgc ggtagtggaa cagtcgaggt gcctgttgcg accatccgag ttgtcgccgg 3675481 agttctgcga tcacgaaagt ttcgatgatg gctccggccg cgttggggtt ggcatgtgga 3675541 ccggctccgg taggcgagac attgacgagg cgagcggcca gtccggagtc gagaaggagg 3675601 actttcggtc tatcgacgac ccgcttggaa aggttggtcg accacgcggg tatgcggtcg 3675661 atgagataca gggtctcgag gaggtcgagg tacggcggca gggtacgtac ggggatttcg 3675721 gcgtcggtag ctagggagct caggttaagt tcggacgcgc tgcgtgcggc tagaagtcgg 3675781 atgaggcgcg gcaggtcggc gatgcgttgg agattggaga cgtcggccgc gtcacgtttg 3675841 acgacgcggt cgacgttcct agctttcgcc gattcgcgac aaagccgtcg ccgatacgcg 3675901 gcactatctt cgccaattcg cggatatctc ctcaccgatt cgcgatatct ggcggagccg 3675961 gtggtgtcgc agcagggacg tcggggcaga cccaccccac cgaaagaacc accaccacct 3676021 gctcgcctag ccgaacgtgt ggtctacgtg agtaatatct gtcacatggc gacagccaga 3676081 aggcggttat ccccgcagga ccgccgcgct gaactgctcg ctctgggggc ggaggtcttt 3676141 gggaagcggc cttacgacga ggttcgcatc gatgagatcg ccgagcgcgc tggggtgtcg 3676201 cgggcactga tgtatcacta cttcccggac aagcgggcgt tcttcgccgc ggtcgtcaag 3676261 gacgaggccg accggctgta cgcggcgacc aacaaggcgc ccgcccctgg gatgacgatg 3676321 ttcgaagaga tacgaaccgg cgtgctggcc tatatggcct accaccaaca aaaccccgag 3676381 gcggcgtggg ccgcctacgt cggcctcggc cgatcggacc cggttctgct cggtatcgac 3676441 gacgaagcca agaaccgcca gatggaacac atcatgtccc gcatcgccga ggtcgtgagc 3676501 gggattgacc gcgataacac cctggaccca gaggtcgagc gcgacctgcg ggtgatcatc 3676561 cacggctggc tggcgttcac cttcgagctg tgtcgtcagc ggatcatgga cccgtcgacc 3676621 gacgctgaac ggctcgccga tgcttgcgca cacgcgctgc tggacgccat ctcccggctg 3676681 ccgcagatcc ctgccgaact ggctgacgcg atggcaaccg cgcgaatgtg agcggtaggc 3676741 ggtttttgtc ggtgcctgtt ggcacgatgg ctaggtgagg ttcgcgcagc cttcagcact 3676801 gagccgattc agcgcgctca cccgagactg gttcaccagc actttcgccg cgcccaccgc 3676861 cgcccaggcc agcgcctggg cggccatcgc agacggcgac aacacgctgg tcatcgctcc 3676921 caccggatcc gggaagaccc tggcggcgtt cctgtgggcc ctggatagct tggccggttc 3676981 ggaacctatg tccgagcggc cggcggccac ccgcgtgctg tatgtgtcgc cgctcaaagc 3677041 gttggccgtc gacgtcgagc gcaacctgcg cactccgctg gccggactga cccgactcgc 3677101 cgaacgccag ggtctgcccg cgccccagat cagggtgggc gtccgttcgg gcgacacccc 3677161 gcccgcactt cgccgccagc tcgtcagcca gccgcccgac gtgctgatca ccaccccgga 3677221 gtcattgttt ttgatgctca cttcggccgc acgccaaact ctgaccggtg tgcagaccgt 3677281 catcatcgac gaaattcatg ccatcgccgc caccaagcgc ggcgcacacc tggcactatc 3677341 cctagaacgg ctcgacgacc tgtctagccg gcgacgggcg cagcgcatcg ggctgtcggc 3677401 gaccgtacgt cctcccgagg aactcgcaag gttcctgtcc ggacagtccc cgacgaccat 3677461 tgtggcgccc ccggccgcca agaccgttga gctgtccgtg caggtgccgg tgcccgacat 3677521 ggccaacttg accgacaaca ccatctggcc ggatgtggag gctcggctgg tcgacctgat 3677581 cgaatcacac aactcgacca tcgtgttcgc caattcgcga cgattggccg agcgacttac 3677641 cgcacggctc aacgaaattc acgccgcgcg ctgcgggatt gagctcgcgc cagacaccaa 3677701 ccagcaggtt gccggcggcg ccccggcgca catcatgggc tcgggccaga cgttcggagc 3677761 gccgccggtg ctggcccgcg cccaccatgg ctcgatcagc aaggagcagc gcgccgttgt 3677821 cgaagaggac ctcaaacgcg ggcaactcaa agcggtggtg gcgacgtcca gcctggagct 3677881 gggcatcgac atgggcgcgg tcgatctggt gatccaagta caggcaccac catcggtggc 3677941 cagcgggctg cagcgcattg gccgggccgg tcatcaggtc ggcgagattt cgcggggggt 3678001 gctgtttccc aagcatcgca ccgacctact cggctgcgcg gtcagcgtgc agcgcatgct 3678061 tgccggtgag atcgagacca tgcgggtgcc ggccaaccca ctcgacattc tggcccagca 3678121 cacggtggcg gcggctgcgc tggaaccgtt ggatgccgac gcgtggttcg acaccgtgcg 3678181 gcgggccgcc ccgttcgcga ccctgccgcg tagcctgttc gaggccaccc tggacctgct 3678241 gtccggcaag tacccatcca ccgagttcgc tgagctgcgg ccgcggctgg tgtatgaccg 3678301 cgataccggc acgctgaccg cgcgacccgg agcccagcga ctggccgtca cctccggcgg 3678361 cgccattccc gatcgcgggt tgttcgccgt ctacctcgct accgagcggc cgtcgcgggt 3678421 aggcgaactc gacgaggaaa tggtttacga gtcccgcccc ggtgacgtga tctcgctggg 3678481 tgccaccagc tggcgaatca ccgagatcac ccacgaccgg gtgctggtga tccccgcgcc 3678541 gggccagccg gcccgattgc cgttctggcg cggagacgat gccggccgcc ccgccgagct 3678601 cggcgccgca ctcggcgccc tcaccggcga gctggccgcc ctggaccgta cggcattcgg 3678661 cacacgttgt gcgggtttgg gtttcgacga ctatgccacc gacaacctgt ggcgactgct 3678721 ggacgaccaa cgcaccgcta ccgcagtggt acccaccgac agcacattgt tggtcgagcg 3678781 gtttcgtgac gagctgggcg attggcgggt gatcttgcat tcgccgtatg ggctgcgggt 3678841 gcacggaccg ctcgcgctcg cagtcggccg gcggctgcgc gaccgctatg gcatcgacga 3678901 gaagccgacc gcctccgaca acggcatagt ggtgcgccta ccggacaccg tgtccgctgg 3678961 cgaagacagc ccgccgggtg ccgaactgtt cgttttcgac gccgacgaga tcgacccgat 3679021 cgtcaccacc gaagtggccg gttcggcgct gttcgcgtca cggttccggg aatcggcggc 3679081 ccgcgctctg ctgctgcccc gccggcaccc cggccgccgc tcgccgctgt ggcagcagcg 3679141 gcagcgcgcc gcccggctgt tggaagtggc ccgcaaatac cccgacttcc cgattgtgct 3679201 ggagacggtc cgcgagtgcc tgcaggacgt ctatgacgtc ccgatcttgg tcgagctgat 3679261 ggcgcggatc gcccagcggc gggtgcgtgt cgccgaagcc gagaccgcca aaccttcgcc 3679321 atttgcggca tcgctgttgt tcggctacgt cggcgccttc atgtacgagg gcgatacgcc 3679381 gctggccgaa cggcgcgccg ccgcgctcgc gctggacggc acgttgctgg ccgagctgct 3679441 aggccgggtg gagctgcgcg agctgctcga tcctgacgtc atcgccgcta ccagccgcca 3679501 gctccagcat ctggcggccg accgggtagc ccgtgacgcc gaaggggttg ccgatctgct 3679561 gcggctgctg ggtccgctca ccgaagacga gatcgctgcc cgggcgggcg cgcccgaggt 3679621 cagcggctgg ctggacggct tacgcgccgc caaacgcgcg ctcgtggtgt ccttcgccgg 3679681 ccgcagctgg tgggttgccg tcgaggacat gggccggctg cgcgacggcg ttggcgcggc 3679741 ggttccggtg gggctgccgg ccagcttcac cgaggcggta gccgacccgc tgggcgaact 3679801 actgggccgc tacgcacgca cccacacacc gttcaccacc gctgcggccg cagcccggtt 3679861 cggtcttggg ctgcgggtga ccgccgacgt gctgggccgg ctggccagcg atggccggct 3679921 ggtgcgcggc gaattcgtgg ccgcggccaa aggatccgcc ggcggcgagc agtggtgtga 3679981 cgccgaggtg ttgcgaattc tgcggcgccg ctcgctggcc gcactgaggg cgcaggcaga 3680041 gccggtcagc accgccgcct acggacgctt cctgccggcc tggcagcacg tttccgcggg 3680101 caactcgggc atcgacgggc tggccgcggt catcgatcag ctcgccggcg tccggatacc 3680161 ggcctcggcg atcgaaccgc tggtgcttgc cccacggatc cgcgattact cgccggcgat 3680221 gctcgacgag ctgctcgcga gcggggacgt cacctggtcg ggcgccgggt cgatctcagg 3680281 cagtgacggc tggatcgccc tgcaccccgc cgactcggcg cccatgacgc tggcggagcc 3680341 ggccgagatc gacttcaccg acgcccaccg ggcgatctta gccagcctgg gcactggcgg 3680401 cgcgtacttc ttccgccagt tgacccacga cggcctgacc gaggcggaac tcaaagccgc 3680461 tctgtgggaa ttgatttggg ccggacgagt gaccggcgac acgttcgcac cggtacgcgc 3680521 ggtactcggc ggggcgggca cccggaagcg tgctgctccc gcacacggcg ggcatcgacc 3680581 gccgcgcctg agccgatacc gcctcacgca cgcccaggcc cgcaacgctg acccgaccgt 3680641 cgccgggcgg tggtccgcgc tgccgcttcc cgaaccggac tccacgctgc gcgcccatta 3680701 ccaagccgag ctgctgttga accgccacgg cgtgttgacc aaagacgcag ttgctgccga 3680761 gggtgtggcg ggcgggttcg cgacgctcta caaggtgctc agtgcgttcg aggatgccgg 3680821 caggtgccag cgtggctact tcatcgagtc gttggggggc gctcagttcg ccgtcgcctc 3680881 gaccgtagac cggctgcgta gctacctcga cggtgtcgac cccgaacagc cggactacca 3680941 cgcggtggtg ctggccgctg ccgacccggc caacccgtat ggggcggcgt tgccctggcc 3681001 agcgtcgagc gctgacggta ccgcccggcc gggccgcaaa gccggcgcac tggtcgttct 3681061 ggtggacggc gagttggcct ggttcctcga gcgcggcggg cggtcgttgc tgacgttcac 3681121 cgatgatccc gaggccaacc acgcggcggc catcgggctg gccgacctgg tcaccgccgg 3681181 gcgcgtcgcg tcgattctgg tcgagcgggc cgacggcatg ccggtgctgc agcccggcgg 3681241 gcgggcgtcg gcggcactga cggcgctgct ggcagccggc ttcgtccgca cacctcgcgg 3681301 tctgcggcgg cggtaagcca tgcccgaggg cgacaccgtc tggcacaccg cggccacgtt 3681361 gcggcggcat ctggccggtc gcacgttgac acgttgcgac atccgagtgc cacggtttgc 3681421 cgccgtcgac ctcaccggcg aggtagtgga cgaggtgatc agtcggggca agcacctgtt 3681481 catccgaacc gggacagcca gcattcattc gcatctgcag atggacggca gctggcgggt 3681541 cggcaacagg ccggtgcggg tggatcatcg ggcgcgaatc attttggaag ccaaccagca 3681601 agaacaggcc atccgggtgg tcggcgtcga cctaggcctg ttggaggtca tcgaccggca 3681661 caacgacggc gccgtcgtcg cacacctagg acctgatctg ctggccgacg attgggaccc 3681721 gcagcgtgca gccgccaacc tgatcgttgc cccggaccgg cccatcgccg aggcactgct 3681781 cgaccagcgg gtgctcgccg ggatcggcaa cgtgtattgc aacgaactgt gcttcgtcag 3681841 cggagtattg ccgacggccc cggtgagcgc ggtcgccgac ccgcgccgcc tggtcacccg 3681901 cgcccgagac atgctgtggg tcaaccgctt ccgctggaat cggtgcacca ctggcgatac 3681961 ccgggccggc cggcgactgt gggtctacgg gcgggccggg cagggttgcc gccgctgcgg 3682021 cacgctcatc gcctacgaca ctaccgacga gcgggtgcgg tattggtgcc cggcctgcca 3682081 gcgctgaacc gggcgatcaa agccagcacc tagtcgcggc cgtgggtagc gaagaactgg 3682141 gcaatgactt gcgacccgtc gaacgcgcgc gtggtcgccc cgatgaccgc cttgggcaga 3682201 tattgcctgc cacccggcca ggtatgtccg ccattgtcga tctggtagga gatcacctcg 3682261 gtgccggccg cacatgagct ggaatcgaaa aggtgcacca ttgttccgtc cccgacgtca 3682321 ggcagctccg ccgccgacgg atcgccctga cacccatcga ccgcccgcca gcgatccacc 3682381 aagctcgcaa ccgagatgga atggctgagc ccgccgcgac cacgcaccgc cccgccgttg 3682441 aacggcacca gcgggtcggc ggtgccgtgt gcttcgagca ccgacaccgg ccgcgacgga 3682501 ttacatgtca cacccacacc cagcgtgccc gccaccggcg cgaccgcggc gaagatatcg 3682561 gcacggtcac acgccagccg gttggacatg aagccaccgt tggacatgcc ggtggcgaag 3682621 acgtgcccgg gagcgatgtc gaagtcgtgc accagctttg cggccagcgc gaccaagaac 3682681 ccaacgtcgt cgagatgacg gcgatccgcc ggcgacgccc ccctcccgtc ggcccagctt 3682741 ttgtcgtagc cgtcaggata gacaaccaac aagtcggcgg cgtcggcaac agcgtcgaaa 3682801 tcggtgagag cctcctgtcc ggctccggtg ccgccaccac cgtgcaggct gatcaccaac 3682861 ccggagggct cagcgggcgg cacgtgcaag cgataactgc gggtcaagcc cccgaactgg 3682921 aacgtcgcta ccgaactggc atgcctggcc agtagctgat caccgccaca cccggccagg 3682981 caaaccatga gaacgataag cgacagcatt cgcgcccacg gcatctcgtc aaggtaccga 3683041 tcgcgagcgc tcagcccgcg gcgccctgtc ccaccgcttg gaccgatgcg tgctcgtgca 3683101 acgccctggc ggcttcggga tgtacgggct tgaggtcgaa gatgacctcg gtgacggtcc 3683161 cggtgaacgc atagggcgcc ttgtcctcat agccgcggtc aacgaccagg ccgttgtcgc 3683221 ggccgatgtc catgccggca taggaggtaa aggccagcgg caccgtctgg ggcagctcac 3683281 cctctccgat caaccgatcg tcggcccaga gcgtcacccg accaccggag gcggcgacgg 3683341 gttgatggga atcgaacagc atccgcaccg tgacatcccc ggtggggagc ggctcgctgg 3683401 acacctgccg gtaggtttcg acgcccagga aggagtaggt gtggtgcagg tgccgctgtt 3683461 cgtcgaccca tagcgcgaac cctcccatga agtcggcgtt ggcgacgatc acaccctgcg 3683521 cgccgccgtc ggggatgtgc agccgtgcct cgatcgcgta agaacgaccg cagatacggg 3683581 ggaccatgcc gcgctgaatg ttctgcacgt cacctttgaa actgaaccgt gcggtggtgg 3683641 gcaggggcgg caggtcgccg aacattaccg cgagcccgcc cagcagcggc agcacccggt 3683701 ttcgttcggc ctcctgccac cacagctggg tgagctcggc gaccttgtcg ggatgctcgg 3683761 ctgccaggtt tttcgcctgg gagaagtcat ctggtaggta gtacagctcc cagacgtcct 3683821 ggtccgggtc gtaggtcccc ggcgcgaacc gtcgcatcgt ctccggtgac agatcccagg 3683881 gcgccttgtc caagcgagcg cacgcccacc agccgtcttt gtagatggca cggctgccga 3683941 agttttcgaa gtactgcacg gtgtggcggt cttcggcttc agcgtcgtcg aaggtccgca 3684001 cgaaactggt tccgtccatc ggttcctgct cgaagccgtc gacatgggtc ggctccggta 3684061 aaccgatggc cgccaacacg gtcggcgcga tgtcgatgca gtgggtgaac tggctacgaa 3684121 cacggccgtc tggccggatc cgggccggcc aagcgaccac caatggatcg cgcgtgccgc 3684181 ccaggtggct ggccatctgc ttgccccact gcaacggggt gttgctcgca tgcgcccacg 3684241 cgctggcgaa atgcggtgcg gtgaactcgt cgccgagtgc ggcgatgccg ccgtattgtt 3684301 cgatcagctc caattgccgc tcggcatcca gatccaggcc gttaaggaac gtcatctcat 3684361 tgaacgaacc ggtgttggtg ccctccatgc tggcgccatt gtcgccccag atgtagaaca 3684421 ccaacgtgtt gtcggactcg ccgagatcct cgatcgcgtc cagcagccgg ccaacattcc 3684481 agtccgcatt ttccgagaac ccggcgaaca cctccatctg gcgggcaaag agccgttttt 3684541 gcgcctccga catactgtcc cacgcgggga ataggtcggg ccgctcggtg agttcggcgt 3684601 cgggtggaat gatcccgagt cgcttttgcc gttcgaatgt cttctgccgg tacacatccc 3684661 agccatcatc gaactcacct cggtacttgt cggcccattc cttgaatacg tggtgtggcg 3684721 cgtgggtggc gccggtcgcg tagtacagca tccacggctt ggtggcattc tgggcccgca 3684781 cggtgtgcag ccactcgata gccttgtcgg tgaggtcgtc ggggaaatag tagggacggc 3684841 cgtcttcccc agaaccctcg ggtatgccta tgacggagtt gtcctgactg atgatcgggt 3684901 cgtactgacc cgcggcgccg ctcgggaagc cccagaaatg gtcgaatccc caacccagcg 3684961 gccagttgtc gaacggcccc gcggctccct ggacattgtc cggggtcaga tgccacttgc 3685021 cgaaagcgcc agtcacataa ccgttgtcgc gcagaatacg cggcagcgct gcgcaactgc 3685081 gtggcctgac cgccgaatac cccgggtacg ggccggggaa ctcgcagacc gacccgaagc 3685141 ccacccggtg atggttacgc ccggtcaaca gcgccgcacg ggtcggcgag cacaccgcgg 3685201 tcacatgaaa acggttgtag atcaacccat tctgggctag ccgggacagc gtcggggttc 3685261 ggatcgcgcc gccgaatgta tccggtccgc cgaacccagc gtcatcgatc aacacgatca 3685321 gcacattcgg tgcgtcgtcg ggcggaaagg gaccggggac aatcgaccag tcgccgaccg 3685381 actctgccat ggtgcggcca accacgccac caaagcggcg ctgcggtagc ggcagccggg 3685441 tgcggtctgg gttgaacttg cccatcgcct ctcgcaacgc cgcacccagg cttcgcaacg 3685501 tcgaacgact cagctccgca accgatttca ttggagagct agccaacgcc tgccccgctt 3685561 ccagtcggcc ttgtgcctcc gtcacggcga tgaccactgc tcggcccgcc gccagcgctt 3685621 ggccgatctt gtcggccagc ccggtcttga tccgatggtg ggcgaaggtg ccggccaatg 3685681 ctccggtcgc ggcgccgagc gccgccgagg ccaacagtgc cggcgagaac aggccgatcg 3685741 ccaggcccac cccggcgccc cacgcggcgc cgcgccggcc gagccgattt ccggtgtcga 3685801 ccaaaaccgg actgccctcg gcgtccttgc cgatcagcac cgcaccctgc agcggaatgc 3685861 ttttgtcctt ggcggcatcg acgagggttt gaaaatcgtg acgagccgaa tcgaggtcct 3685921 gatagccggc gacgagcacc agcgcgttgt cttcactcat cacgaaactc ccgatatgtg 3685981 tgtcacggcc ggcaatcggc cgcggctgac catgttggca acgtagcacc ggtcaacgtg 3686041 cgcgtgctgg cgaactcgcg gtgcgacccg gtcagcggat cgtcgaactc gatgcgctgc 3686101 gcgagcaact gcagcggtgt gctgaagtcg tgggcggcca cggatatcac gttggggtac 3686161 aacgggtcac ccatgatcgg tatccccagc gccgccatgt gcactcgcag ctggtgggtg 3686221 cgcccggtgg tcggtgtcag ccgatacaga ccgtcgcgcg ctatccgctc caccagcgtc 3686281 tccgcgttgg gaacgccggg ctcacagacc gcctgcagat ggccccggcg cttgacgatg 3686341 cgactgcgga ccaggcgcgg cagggccaga cccggggcaa cgggtgcgcg agccagatag 3686401 gtcttgcgca ccaaaccgcg ggcgaacatc gtctggtagc tgccgcgcac ctcgcgtcgg 3686461 gtggtgaaca acaacacccc ggcggtcagc cggtccagcc ggtgggccgg gctcagctcg 3686521 ggcaatccca gttcgcgacg cagccgcacc agcgcggtct gcgcgacgtg tcgcccccga 3686581 ggcatggtcg ccaagaaatg tggcttgtcg acgacgacga tgtcggcgtc ttgatgcagc 3686641 actgggacat cgaagggcac cggcacctcg tcgggcaggt cgcgatacag gtgcacaacc 3686701 gaaccgggcg gcagcaccgt gccactgtcg accaccgcac cgtcgtcgtc gaccacctcc 3686761 ccggccagca ccttcgcacg ggccgccacg ccaaaccgtg cggtcagctc ggctaacacc 3686821 gacccgccaa gcagtcgcac ccgcaccggc cccagcacgt cgtgcacgct aagcaaacga 3686881 tcctctggcc gcaacgccac acgagaccct ctcagtaagt ggaaatctcg tcctcggtcg 3686941 gtagcacccc ggtgaccatg aagatgacgc ggcggcccac ttccacagcg tggtcggcga 3687001 agcgctcaaa gaaacgaccc agcaacgccg tttccacacc gacgcgaacg ccgtgccgcc 3687061 attctcgatc tatcagcacg ctcagcaaat gcctatgcag gtcatccatc gcgtcgtcac 3687121 gatcgtgcag ttgcgcggct tcctgcgggt cacggttcac cagcacttgt cttgcactgt 3687181 cacccaacgc gattgccacc ttcgccatgt cggcgaagca gttgcgaact tcctcaggaa 3687241 gcacctggtt cggatactcg cgtcgggtga tcttggcaat atgcacagcc aacgcaccca 3687301 tgcgctcggt gtcggcgatg atctgcaccg cactgaagat ttcccgcagc tcgccggcca 3687361 ccggatgttg caacgccagc agcgcgaacg cttccttttc gacttgggct cgcatcgcca 3687421 cgatccgctc atggtcacgg attacttgtt cagcggcgcc aatgtcggcc tcgagcagag 3687481 cctgcgttgc gcgtttcatc gctatcccgg ccaggctgca catctctccc aatcgtccgg 3687541 ccaactcggt tagccgctgg tgatagaccg tccgcatggt gtcacgcctc tctgaccctg 3687601 agtcgtcgtg tggtgctgcc gcggatccac accgccatca tcgaccatgg cggcaccgcg 3687661 cgacataccc gcttggcgta gccttcaatc caaaggcacc ggctcgagga tctcggcacg 3687721 cgcctcgggt gcgctggccc gcaacatgtc cgccgaaacg tcgtcgggct gggcctggga 3687781 gagcacctcg gcctccacgc gcgccatata gttcgcgacc tcgcggtcga tgtctgcggc 3687841 ggtccacccg agcacgggcg cgaccacctc ggccacctcc cgggcgcagt cgacgccccg 3687901 gtgcgggtat tcgatggaaa tccgcatccg acgggccagg atgtcctcga gatgcagggc 3687961 gccctcggcg gcggcggcgt aagcggcttc caccttcaaa tagcccggtg cctccgttat 3688021 cgggctcaac aggctgggat cggaggccgc catcgctaga acgtcgctga tcagcgaacc 3688081 atagcggtcc agcagatggc gcacccggta cgggtgcagg ccctgcagcg cgccgacgtg 3688141 ttcggcctga ttgaccagtg caaagtaacc gtcggcgccc agcaggctga ccttctcggt 3688201 gatcgacggc gcaacgcggg cggggatgaa ctgcacagca gcgtcgatcg cgtcggccgc 3688261 cattactcgg taggtggtgt acttgccacc ggcgatggcc accaggcccg ccgccggcac 3688321 agccacggcg tgttcccggg acagcttgga ggtgtcgtcg ctttccccgg caagcagcgg 3688381 ccgcagcccg gcgtacactc cgtcaatgtc ggcgtgcgtc aacggggtcg ccaacacggc 3688441 gttgacagtg cccaggatgt agtcgatgtc ggccttggtg gccgcggggt gcgccaggtc 3688501 gaggttccag tcggtatcgg tggttccgat gatccagtga cttccccacg gaatgacaaa 3688561 catcaccgac ttctccgtgc gcaggatcat cgcgacgtca ctgacaatcc ggtcccgcgg 3688621 caccaccaca tgcacgccct tggatgcgcg cacctggaag cgcccgcgct gtttggacaa 3688681 cgcttgaatc tcatcggtcc agaccccggt cgcgttgacc acgacgtggc cgcgaacctc 3688741 ggcaaccgcg ccgttctcgg agtcgcggac gcccacgccg atcacccggt caccctctcg 3688801 caacaaggcc actacctggg tggagcagcg gacaaccgcg ccgtaatgcg ccgcggtgcg 3688861 cgcgaccgtc atggtgtgcc gggcgtcgtc gacgacggtg tcgtagtaac ggataccacc 3688921 gatcagcgag ctgcgcttca agccggggct cagtcgcagc gcaccggcgc gagtaaaatg 3688981 ccgttgcgcc ggaaccgatt tcgcgccacc cagccggtcg taaagaaaga tacccgcggc 3689041 gatgtaggga cgctcccacc agcgtttggt cagcgggaac aaaaacggca gcggcttgac 3689101 caaatgcggt gccagcgtgg tcagcgacag ttcacgttca tagagcgcct cacgcaccag 3689161 cccgaactcc agttgctcga ggtagcgcag cccgccgtgg aacatcttcg aggagcggct 3689221 cgacgtgccg gaggccaagt cccgcgcctc gaccaacgcc accttgagcc cacgggtggc 3689281 agcatccaaa gcgcatccgg agcccactac tccgccgccg atcaccacga cgtcgaattg 3689341 ctcggttccg agtcgcttcc aggcgaccgc gcgctgtgca ggtcccagcg ccgcggcggg 3689401 ccacccctgc ccgccgtccg gtgcctggat tgggttgctc acgaaaccgg ctcctgtcag 3689461 ttactcgtcg gtaggtggtg tggcaccaag gctagttgtt cagccgcgtc ttgagctgcc 3689521 gtgcagtcca gatcgtcgtg cgccatcagc cggcgggccg cctcggttat cgaacccgac 3689581 aacgatgggt aaacggccag tgtctgggcc agctcgttga cggtgatgcg gttctgaacg 3689641 gctacggcga tgggcaggat cagctccgat gcgatcggcg ccaccaccac gccgccgatc 3689701 acaacgccgg tggaccgccg gcagaagatc ttgacgaacc cgtgacgcat ctccgacatc 3689761 ttggcgcgcg cgttggttcg taacggcagc atgatggtcc gggcggccac cgaaccggcg 3689821 tcgatgaccg attgcggcac cccgaccgcg gcgatctcgg gcctggtgaa aaccgtcgcg 3689881 gccaccgtgc gtaaccggat cgggctgacg ccctccccca gcgcgtggta catcgcgatg 3689941 cggccctgca ttgcggcgac cgacgccagg ggcagcaaac ccgtgcagtc gcccgcggcg 3690001 tagatgccgg tcgccaacgt ccgcgacacc cggtccacgg tcaggtaatt gccccggcca 3690061 agctggatgc cgacccgttc caggcccagg ccgctggtgt tgggcaccga cccgatggtc 3690121 atcagggcgt ggctgccctc gacggtgcga ccgtcggtca tcgtgacgag caccccggcc 3690181 ccggtgcggg tgaccgatgc tgcccgggca tttttgaaca gccggactcc ccgttcggcg 3690241 aacgactctt ccaggaccag cgcagcgtca gcgtcctcat acggcagcac gtggtcctgg 3690301 ctggccacca ccgtgaccgg cacccccaat tcggtatagg cgtccacgaa ctcagcaccg 3690361 gtaaccccgg agcccaccac gatgaggtgg tcgggcaacg cgtccaagtc gtagagctgc 3690421 cgccaggtca gaatgcgctc accgtccggc tgggccgacg gcaggatccg cgggctggcg 3690481 ccggtggcga ccagcacgac gtcggcctca tgctcactgg tggagccgtc ggcggcggtc 3690541 gccttaatgc gatggcgcgc cagacccggt gtggagtcga tcaactcgcc ccggccggcg 3690601 atcacctgaa cccccatgct gagcagctgg gcggtgatgt cggccgactg tgcggcggcc 3690661 agcgtcttga cccgggcatg gatttgcggc aacgagatct tggcgtcgtc gaagtcgata 3690721 tgaaagccca ggtgcggcgc tcggcgcagt tcggtacgca gcccggtgga ggcgatgaac 3690781 gtcttcgacg gcacacagtc gtccagtacg gcagccccgc cgatgccgtc gcagtcaatc 3690841 acggtaactt gggttgtttc cgggtgtgag gtggcggcca ccagtgcggc ctcgtaaccg 3690901 gccgggccgc caccgaggat cacgatgcgg gtcaccacag cccataacct agctcggcga 3690961 cgatgcacgc cgcgcagcgg cgtgaggagg agccgagcag tccaacacag ctcggcgacg 3691021 atgcacgccg cgcagcggcg tgaggaggag ccgagcagtc aagcacagct tgacgatgac 3691081 ccgcaccgca gcgcggcgcg atgggtacca cccgagcccc cgccgtctaa gctttccccc 3691141 gtgccgctct acgccgccta cgggtcgaac atgcatcccg agcagatgct cgagcgcgca 3691201 ccccactcgc cgatggccgg aaccggctgg ttacccgggt ggcggctgac gttcggcggc 3691261 gaggacatcg gctgggaagg ggcgcttgcc accgtcgtcg aagacccaga ttcgaaggtg 3691321 ttcgtcgtgc tctacgacat gaccccggcg gacgagaaga accttgaccg gtgggaaggc 3691381 tccgagttcg gcatccacca gaagatccga tgccgcgtgg agcgcatttc ctcggacacc 3691441 acaacggatc ccgtcctcgc gtggttgtac gttttggacg cctgggaggg tggcctgccg 3691501 tcggcccgct atctaggtgt gatggccgat gccgctgaga tcgcgggcgc gccaagtgat 3691561 tacgtacatg acttgcgtac tcgcccggcc cgcaacatcg gcccgggaac tattgcctaa 3691621 ttatcgcgag cgcccaggct aatgcgcggc ggcctgctcg atgatgttga ccatcacccg 3691681 cagcccgatc gccagggctc gctcgtcgat gtcgaacgtc ggctgatgca ggtccaactg 3691741 cagtccgtca ccggaccaca cgcccagtcg agccatcgcg ccgggaacct cctccaaata 3691801 ccaggagaag tcctcaccac cgccggactg ccgggtatcg gccagcacac ctgggccaat 3691861 agcctcaata gcgtgggcga gaatgcgtgt cgagatttcc tcgttgacca ccggcggcac 3691921 cccccgacgg tattgcagcg tgtgctcgat cgccaacggt aatagcaacg ccgaaatggc 3691981 ttggcggaca agctcctcaa ggtcaaccca ggtctgccgg ctggccgtgc gaacagtgcc 3692041 ggacagaact ccggtttgcg gaatggcgtt ggcggccata cccgcgttga ccgcgcccca 3692101 caccagcacg gtgctgttac gtgggtcgat gcgacgcgac agcaccccgg gcagcccggt 3692161 gaccagcgtg ccgagcccgt agacgaggtc ggcggtcaag tgtggacgcg acgtgtgccc 3692221 gcccggcgaa tacagcgtga tttctatcga gtcggccgcc gacgtgatgg ggccttgccg 3692281 aacggcgacc ttgccgactt caagccgggg atcgcagtgc agggcgaaga tccgcgacac 3692341 cccggccaac gcgccggccg cgatcgcgtc gatggcacca ccgggcatca gttcctcggc 3692401 cgcctggaag atcaaccgca cccccaccgg cagctccggt accgaagcca atgccaatgc 3692461 ggcacccagc aggatcgcgg tgtgcgcatc atggccacaa gcatgcgcga cgttgggcat 3692521 ggtcgaggcg tagggcgcgc cggtccgctc ggccatcggc agcgcatcca tatcggcgcg 3692581 cagcgcgatc cgcggctgat gctgaggacc gaagtcgcag gtgagtcccg ttccaccggg 3692641 cagcaccttg gggttcagcc ccgcgtcggc taaccgctcg gcgacgaact gggtagtggc 3692701 gtattcctga cggcccaact ccggatagcg gtggatgtgc cggcgccagc cgaccaggtc 3692761 gtcgtggtgg gcggctagcc atgattcggc ggcgtcggcg aggctcatcg cgccgccctg 3692821 cgctgctgcg cggccagcac ccggtcacgc tcatcaggag tctgcgcgag acggacaacc 3692881 gtgcgtgcca acatgatcgc gccgtcaacc accgcgcggt cggcgctggc accagcggaa 3692941 gcgacggtga aggcccgttg gtgcaccgtc gccgcgccgg cgtccaggcc gatcaccgga 3693001 tggatcccgg gcagcacctg cgtcacgttg cccatgtcgg tgctacccag cggcagctct 3693061 gcctccaagg ctggcagcaa cggctcgcgc cccagccgct gcatctcctc ccggcacacg 3693121 tcagccagcc acgggtcggg tttgagctcc gcgtatgccg gtgcagcctc gtcgatttcg 3693181 tattcgcacc cggcggccag cgcgccggcc gcaaagcagg cgaacattct ggtctgcagc 3693241 tcgcgcagcg aatccgattc gaccgcacgc atcgcatact gcagcctcgc ctgcccgggg 3693301 atgacattga ccgcctgccc gccgtcggtc acaatgccgt gcaccatttg cccgggcgcc 3693361 aattgctgtc gaagtacccc aatagcgacc tgcgccacgg tcacggcgtc ggcggcgtta 3693421 acccctaggt gcggcgcgac ggccgcgtgc gattccttac cccgatagcg cacggtgacc 3693481 tcggacaggg ccagtgatcg tgcgccggcg atatcggtcg gcccgggatg gaccatcacg 3693541 gccaccgcaa cgtcatcgaa cgtcccggcc tgcagcatca gcgccttacc gccgccggac 3693601 tcctcggcag gggtccccag cagagccacg gtcaagccca ggtcgtccgc cacctcagcc 3693661 agtgccagcg cggtgcccac agcggaggcc gcaataatgt tgtgcccgca ggcgtgtccg 3693721 atcccgggaa gcgcgtcgta ctcggcgcac actccgacaa ccaacggtcc gctgccgtag 3693781 tcggcgcgaa acgccgtgtc caacccaccg gcggccgtgg tgatctcgaa accgcgttcg 3693841 gcgaccagcg cctgagcctt ggcgcagctg cgatgctcgg cgaacgccag ctcgggctcg 3693901 gcgtggatgg catgggacag ctcgaccagc tcgccaccac ggcgccgcac caattcctcg 3693961 acgcggtcgg atgcgctggc tgctggcatg ctcgcagtat ctcatcgacg agcacccgct 3694021 ccccggcgag cggctcagtt aagctcgccc agtgtggctg acccgcgccc cgatcccgac 3694081 gaactggccc ggcgggcggc gcaggtcatc gctgaccgca ccgggatcgg cgaacatgac 3694141 gtcgcggtcg tgctcgggtc gggatggtta ccggccgttg cggcgttggg ctccccgacc 3694201 accgtgctgc cgcaggccga actgcccggg tttgtgccgc caaccgcagc cgggcatgcg 3694261 ggcgagctac tgtccgtgcc catcggtgcg caccgggtgc tggtgctggc cggtcgcatc 3694321 cacgcctacg agggacacga cctgcgctac gtcgtgcatc cggttcgggc ggcccgtgcg 3694381 gcaggggcgc agattatggt gctcaccaac gccgccggtg ggctgcgggc ggaccttcag 3694441 gtcggccagc cggtgctgat cagcgatcac ctgaacctga ccgcacgttc gccactggtt 3694501 ggcggggagt tcgtcgacct gaccgacgcc tactcaccgc gactgcggga actcgcccgc 3694561 caatccgacc cgcagctggc cgaaggcgtc tacgccggcc tgccggggcc gcactacgag 3694621 acaccggcgg agatccggat gttgcagaca ctgggcgccg acctggtcgg catgtccacg 3694681 gtgcacgaga ccatcgcggc ccgggcggcg ggcgctgagg tactgggcgt atccctggtg 3694741 acaaatctgg cggccgggat caccggcgag ccgctgagcc acgccgaggt gctcgccgcc 3694801 ggagccgcat cggcgactcg gatgggcgcg ctgctagccg acgtgatcgc ccggttctaa 3694861 gccgtgacgc cagagaattg gatcgcccac gacccggacc cgcagacggc cgccgagctc 3694921 gccgcctgcg gccccgacga gctgaaagcg cggttcagcc gcccactggc gttcggcacc 3694981 gcggggttgc gcgggcacct gcggggcggg ccggacgcga tgaacctggc ggtggtgttg 3695041 cgcgccacct gggcggtggc acgggtgctc acggatcgag gtctggctgg ttcgccggtg 3695101 atcgtggggc gcgacgctcg gcacggctca ccggcgtttg ccgctgcggc cgccgaagtg 3695161 cttgccgccg caggtttttc cgtgctgctt ctgcccgatc ccgcacccac cccggtggtg 3695221 gcgttcgcgg tgcggcacac cggcgccgcc gctgggatac agatcacggc gtcacacaac 3695281 ccggcgaccg acaacggcta caaggtctat gtcgacggcg gccttcagct cctcgcccct 3695341 accgaccggc agatcgaagc cgcgatggcc accgcgcccc cggccgatca gatcgccagg 3695401 aagaccgtca accccagtga aaaccgcgcc tccgatctga tcgaccgtta tatccagcgt 3695461 gcggccgggg tccgaaggtg cgccggttcg gtccgggtgg ccctgacgcc gctgcacggg 3695521 gttggcgggg cgatggccgt cgagaccctt cggcgagccg gtttcaccga ggtgcatacc 3695581 gtggcgacgc aattcgcgcc gaatcccgac ttccccaccg tgacattgcc gaaccccgag 3695641 gagcccggag ccaccgacgc actgctcacc ctggctaccg acgtggacgc cgacgtcgcg 3695701 atcgcgctgg atcccgatgc ggatcgctgc gcggtcggga tacccacggt gtcgggatgg 3695761 cggatgctgt ccggtgacga aaccggttgg ctactaggtg attacatctt gtcgcaaacc 3695821 gacgaccggg cgtcgccgcc ggaaaccagg gtggtggcca gcaccgtggt gtcgtcgcgg 3695881 atgctggcgg cgatcgccgc gcatcacgct gccgtgcacg tggagaccct caccggcttt 3695941 aagtggctgg cgcgcgccga tgcgaacctg cccggcaccc tggtgtacgc ctacgaggaa 3696001 gcgatcgggc actgcgtcga ccccaccgcg gtgcgtgaca aagacggcat cagcgccgcg 3696061 gtgttggtgt gcgatctggt ggccgcgctc aaaggccagg gtcgttcggt gaccgacgcg 3696121 ctcgacgagc tcgcccgatg ctacggcgtg catgaggttg ccgccctgtc acgccccgtg 3696181 agcggcgccg tcgagaccac cgacctgatg cgacggctcc gcgaggaccc gccgcgtcgg 3696241 ctggccggtt tccccgccac ggtcaccgat atcggcgaca cgctgatcct caccggcggc 3696301 gacgacaaca tgttggtcag ggtggcggtg cggccttctg gaacagaacc gaagctgaag 3696361 tgctacttgg agattcgctg cgcggtgacc ggtgacctac cagctgcccg acagctggtg 3696421 cgggcgagga tcgatgagct gtcggctagc gtgcggcggt ggtggtgact cagcgcgggc 3696481 cgaactggcg atcgccggca tcgccgagac cgggcacaat gtaggcgacc tcgttaagcc 3696541 cttcgtcgat ggccgcagtg aacaaccgca cgtttggcgc agccttctgc agcgccgcga 3696601 ttccttctgg cgccgcaacc acacacagca ccgtgatatc cgctgcaccg cgcgagatca 3696661 gcagaccgag ggtgtgcgtc atcgacccgc cggtggccac catcgggtca agcaccatga 3696721 ccggtacatc cgtcaggtcg tcgggcagcg agtccagata cggcaccggc tggtgggttt 3696781 gctcgtcgcg ggcgacaccg acaaagccaa cgtgcgcctc cggcaaggcg gcatgcgcct 3696841 cgtcgaccat ccccaacccc gcccgcaaca caggaaccag caggggtggc ttggttagcc 3696901 gcgacccgac cgtctcggcc agcggcgtac ggatcgggac tggctcgcag ggcgcatcgc 3696961 gggtggcctc atagatcaac agcagcgtga gctcgcgcag cgctgcccgg aagccggcgt 3697021 tgtcggtgcg ttcgtcacgc agcgtggtca gtcgggccgc ggccagtggg tggtcaacga 3697081 catggacctg cacggcgttg aaccctatat aacaatcgtg gctcggtccc ctaaaagggg 3697141 gctgatacgg gtgcgtccat ccgcgcgacc ggtcaacccc gtccatatac tcccggcatg 3697201 ctccgcggaa tccaggctct cagccggccc ctgaccaggg tataccgtgc cttggcggtg 3697261 atcggtgtcc tggcagcatc gttgctggcc tcatgggtcg gcgctgtccc acaagtgggt 3697321 ctggcagcga gtgccctgcc gaccttcgcg cacgtggtca tcgtggtgga ggagaaccgc 3697381 tcgcaggccg ccatcatcgg taacaagtcg gctcccttca tcaattcgct ggccgccaac 3697441 ggcgcgatga tggcccaggc gttcgccgaa acacacccga gcgaaccgaa ctacctggca 3697501 ctgttcgctg gcaacacatt cgggttgacg aagaacacct gccccgtcaa cggcggcgcg 3697561 ctgcccaacc tgggttctga gttgctcagc gccggttaca cattcatggg gttcgccgaa 3697621 gacttgcctg cggtcggctc cacggtgtgc agtgcgggca aatacgcacg caaacacgtg 3697681 ccgtgggtca acttcagtaa cgtgccgacg acactgtcgg tgccgttttc ggcatttccg 3697741 aagccgcaga attaccccgg cctgccgacg gtgtcgtttg tcatccctaa cgccgacaac 3697801 gacatgcacg acggctcgat cgcccaaggc gacgcctggc tgaaccgcca cctgtcggca 3697861 tatgccaact gggccaagac aaacaacagc ctgctcgttg tgacctggga cgaagacgac 3697921 ggcagcagcc gcaatcagat cccgacggtg ttctacggcg cgcacgtgcg gcccggaact 3697981 tacaacgaga ccatcagcca ctacaacgtg ctgtccacat tggagcagat ctacggactg 3698041 cccaagacgg gttatgcgac caatgctccg ccaataaccg atatttgggg cgactagccg 3698101 ccgtcgctat tctgtgccgc atggttgctg acctcgtacc catccgcttg agcctgtccg 3698161 ctggtgaccg ctacacgctg tgggctcctc gctggcggga tgccggcgac gagtgggagg 3698221 cgttcctggg caaagacgac gacctgtatg gcttcgagag cgtctctgac ctggtcgcgt 3698281 tcgtgcgcac cgacaccgag aacgacctgg tcgaccaccc ggcatggcaa gacctgaccg 3698341 gagcccacgc gcacaacctc aatccggccg aagacaatca gttcgacctg gtcgtcgtcg 3698401 aggaactgct ggctgagaag ccgacggcgg agtcagtggc cgcgctggcc gcctcattgg 3698461 cgatcgtatc cgccatcgga tcggtgtgcg aactggcggc agtgtcgaag ttcttcaacg 3698521 gcaatcccat cctgggcacg gtttccggcg ggctcgaaca cttcaccgga aaagccggca 3698581 ataaacgctg gaattcgatt gccgaggtca tcggacgcag ctgggacgac gtgctcgcgg 3698641 ccatcgacga gatcatcagc acccccgagg tcgacgctga gctgtcggaa aaggtcgccg 3698701 aggagttggc ggaggagccc gagggcgccg aggaagtggc ggcggaggtg gaggccacgc 3698761 aggacacgca ggaggcggcc gagtccgacg acgaggaagc cgacgcaccc ggtgacagtg 3698821 tcgtactggg cggcgatcgg gacttctggt tgcaggtggg catcgacccg atccagatca 3698881 tgacgggcac cgccaccttc tacacgcttc gctgttacct ggatgatcga ccgatcttcc 3698941 tgggccgcaa tggtcggatc agtgtgtttg gctccgagcg ggcattggcc cgctatcttg 3699001 ccgatgagca cgaccacgac ttgtcggacc tgagcaccta cgacgacatc cgcacggccg 3699061 ccaccgacgg ctcgctggcg gttgccgtta ccgacgacaa cgtctatgtg ctcagtgggc 3699121 tggtcgacga ttttgccgac gggccggacg cggtggaccg tgagcagctc gacctggccg 3699181 tcgagctgct ccgcgatatc ggcgactact ccgaggacag cgcagtcgac aaggcactcg 3699241 agacaacccg cccgctgggc cagctggtgg cctatgtgtt ggacccccac tcggtcggca 3699301 aacccacggc cccgtatgcg gcggctgtcc gtgaatggga gaaattggaa aggttcgtgg 3699361 agtcgcggct caggcgcgaa taggcaccgt cagccggcga aggctagccg ccgcggcgct 3699421 tgccgatgtc cagggcacac gcggcgagga tcgcatccca gtcttcgatg ttgaaatggc 3699481 ccttgccgtg cgcccagtgc aaatcaacgt gcggaatcgc gcgctgcagg tattcgccca 3699541 tggcgcgtgg cacgaaggag tcacgatcac ccagccagat atgggtaggc acggccacct 3699601 cggcgaggtc gaaaccccac ggccgaaatt gcagaaatga ttcataggct gcgccgcggc 3699661 tgccctgtcg gaacgcttcg agctggatgg cgcgcaggtg gcggccgaag cgttcgtcgc 3699721 tcagcaggtg cttgtcggcc gcggggaccg cagccgccaa caacgtagaa aacagcccgg 3699781 gcgtgtattt cgcgcaccag ccgagcgggg caaacaacgc accgaatagc cgcggcccgc 3699841 ttcgcgccaa ccgcgcgtag caccgatcgg ccgcgttgag gctgcgcatg atatccggcg 3699901 tcgccagtgg accccatggt ccgagcgcgc cgacgaacgc tagtcgggtc cgcgggatga 3699961 cggcaccgca ggcgaatagg tgcggtcccg cgcccgaatg cccgaccacc ccgaactcct 3700021 ccagctcgaa cgcgtcagcc agggcacaca cgtccgcggg ccaatcgcga aaattgcgtc 3700081 ccgcttgaaa ggtggagcgc ccgtacccgg gccgatcaat cgctatcagt cggaagccgg 3700141 tgcgccgcgc ggcaccatcg gcgaaggccc cctcgagccg cgaacttggc gtgccgtgga 3700201 agtagaacgc tgggtagccg gtgctatcac cccattccag gtaggcaagc gcccgcccgt 3700261 cgggcagcat gagcacatcc gcctcgtcgg tgcgaatgcg ctcgggcagc gatggcggtg 3700321 gcccggtcaa gagcacacca gcgatggtat gccgatcaga gtcgattcag cgcgcgtgcc 3700381 atgcacgagt cctcgaggaa ccgatagcgc ctaggctggg actgccgcaa ccacagccga 3700441 tccagcgccg aacgcacgat ccggcgaacg ggtgtgcggg taacagcctt gtcgatgtcg 3700501 atggtggagg cgctgtcgcc gttcatgaca ggttcccttc aagcgtcctg caagcggttg 3700561 ccaaagccgt cgcctatttt ctgtcatcgg acggcgcgat ccatcggcac gggagcgtaa 3700621 atctgccccg ccgggggtcg tagcttgccg ggggcacgcc cgggtttata cgcgtattcg 3700681 ctgatgcggc ccggtcaacg agcgctatgc gccgccaccg gcagccgggg gcggcggcgc 3700741 agcaccggga tcgtcaagca cgggaccttc gaggatgggt ccggggtagt cgcggctgtg 3700801 gtcggggccg tcgctgtcgc ggtggaagtc gtcatggcag gtgtagggat cccagttggg 3700861 cccccatgcg gggtcgaaag gctgccccgg gcaccagtag tagtcgggca ccggcgcggt 3700921 ttgggctgcg gactgcgcgc cgaccccgag acccgccaca cccgtggcca ggatgcacgc 3700981 cgccagcatg agcgtgcggc acgcgaaccg gtacatgcga tgacggtacg aaagcgatct 3701041 ggcaagcaac tggacgctag gtgcgatata ccagagaact tgctgattac tcgctgtgac 3701101 ccatgagcgc cgcgaaccgc ggcttgatca cttcgtcgat tatcgccagc cgctggtcga 3701161 acggaatgaa cgcggatttc atcgcattga cggtgaagcg cgccaggtcg ctccagccat 3701221 aaccgaaagc ctctaccaaa cgatgcattt cgaggctcat cgaggtgtcg ctcatcagcc 3701281 ggttgtcggt attgacggtc acccggaacc gggcccgagc cagtaggtcg aacggatgct 3701341 cggcgatgct tgcgaccgcg ccggtctgca cgttggagct ggggcacagc tccagcggaa 3701401 ttcgcttgtc ccgcaggata gctgccagcc gacccaactg gaaaccgccg tcggcatcca 3701461 cgtcgatgtc gtcgacgatc cgcaccccgt gacccagccg gtcggcaccg cagaaggcga 3701521 tcgcctcgtg gatggacggc aacccgaacg cctcaccggc atgaatcgtg aagcgcgcgt 3701581 tgtgatcacg catgtactcg aatgcatcca agtgccgggt tggcgggtgg ccggcctccg 3701641 cgccggcgat gtcgaatccg acaactccct tgtcccggaa ccggatcgcc aactctgcga 3701701 tctcccggga cattgcggcg tgccgcatcg cggtgaccag acagcggacg gtgatgggtt 3701761 gaccatcggc ggcacacgcc ttctcgccgg cggcgaagcc cgtcagaacg gtgtcgacga 3701821 cgtcgtcgaa cgacagcccg cagctgatgt gcagctccgg cgcgaaccgc acctcggcat 3701881 agaccaccga atcggcggcc aggtcttgcg cgcattcgaa ggcgacccga tacaaggcct 3701941 cgggagtctg catcaccgcc accgtgtgcg aaaacggttc caggtagcgc tccagcgagc 3702001 cgctgtgcga ctgggtgcga aaccaacttg ccagcgcgtc gacgtcagtt gccggcaggt 3702061 cgtcgtatcc gacctgcccg gcaatgtcca gcacggtggc cggccgcagc ccgccgtcga 3702121 ggtgatcgtg cagcaacgcc ttgggggcta gcctgatcgt ctgcagggtc ggcgcagcgg 3702181 tcatcagacg atccgatcga cgattagcgg ccgcacctgc ggcggactgt cccggatact 3702241 ccaaccgccg gccagctcgg ctcgcgccgc accaaagcgc tcgggagcat tcgtgtagag 3702301 ggtgaacaac ggctcaccga ccacaaccgg ctcccccggg cggcgatgaa tccgcacccc 3702361 cgcaccgtgc tgtacgcgtg cgcccgggcg ggacctgccc gcaccgagtc gccatgccgc 3702421 taaccccact gccatcgcat cgatgtcgcc cattgtgccg ctcgcgcccg ccgtgacggt 3702481 ttccgaatgc gaaccgatcg gcaacggttt cgacaagtca cctccctgcg cggcaaccaa 3702541 ccggcgaaac cggtccattg cggtgccgtc ccgcagcgtc tgggccgggt cccggccgtg 3702601 gatcccggca agctcgagca tctcgccggc cagccgcaac gtcagctcca ccacgtcggg 3702661 cggtccgccg ccggccagca cctccagcgc ctcggccacc tcgagcgcat tgccgacggt 3702721 tcgacccagc gggcagttca tctccgtcag cagggcacgg gtgggcacgc catgcgccgc 3702781 gcccagttcg accatggtgt gcgcaagttc gcgcgcctgc actggcgacc tcatgaaggc 3702841 cccggaacca accttgacgt cgagcaccag tgcacccgca ccctcagcca gcttcttgct 3702901 cataatcgaa ctggcgatca acggcagcga ttcgacggtg ccggtaatgt cgcgcagcgc 3702961 atacagcttg gcatcggctg gcgccagctg gccggcggcg aagatcgcgg cgccgacgtc 3703021 gcaaagctgc tcgcgcaccc gctggttgga cagattcgcg gtgaacccgg tgatggattc 3703081 cagcttgtcc agggtgccgc cggtgtggcc gagtccgcgg cccgacgcct ggggcactgc 3703141 gccaccgcag gcggcgacga cgggcaccaa tggcagcgtg attttgtcac ctaccccgcc 3703201 ggtggaatgc ttgtccacgg tcgctagtgg cagatcggtg aaatccagcc gggcacccga 3703261 ggccagcatg gccgccgtcc atctggcgat ctcgccgcgg tccatgcccc gccaaacgat 3703321 cgccatcagc agcgccgaca tctgttcgtc ggcgacccgg ccgtcggtat aggccttgac 3703381 gacccagtcg atggcggcgt cggacaaccg gccgccgtca cgtttggtgc ggatgacggt 3703441 cggggcgtcg aatgcgaagt cggtcaccgg cgttcccggg ggaggtcgtc gaggccgaag 3703501 gcgtcgggca gcaggtcgcc gagccggcgg ggtcgcaccg gatggtcgat cagtagctcg 3703561 gaacccccgt gttcgagcag cacctgacgg catcgcccgc acggcatcag cacggatcca 3703621 tggccgtcga cgcaggccag cgcgagcagc cggccgccgc cggtcgaatg cagggcgcac 3703681 accaccgcac attcggcgca caaagtcaag ccatacgaga cgttttccac gttgcatccg 3703741 gtcaccacgc gaccatcgtc gaccagtgcg gccgcaccca ccgcaaaccg cgaatacggc 3703801 acataggctc cggctgctgc ctgggttgca ttgccccgca gcatattcca atcgacatca 3703861 ggcattcggc aaccccgctc gtcgatgggc cgactaagaa aagccagcct aaccccggat 3703921 ccacacacga tcccgatcgg actgttcgac accgcgggca acctggccaa gttaagctcg 3703981 attgcccggc tctagctgtt cgatagtgct tttaaggggt ttgccagcgg tgaatacaac 3704041 ggcgacaacc gtctcgcgcg ggcggcggcc acctcggacc ctgtatcggg gagatcccgg 3704101 tatgtggtcg tgggtatgcc atcgcatcag cggcgcgacg attttcttct tcctgtttgt 3704161 ccatgtcctg gacgccgcca tgctgcgggt gagcccgcag acctacaacg cggtgctggc 3704221 gacctacaag accccgatcg tcggcctgat ggagtacggc ctagtcgccg cggtcctttt 3704281 tcacgcactg aacgggattc gggtcatctt gatcgatttc tggtcggaag gcccgcgcta 3704341 tcagcggctg atgttgtgga tcatcggcag cgtcttcctc ttgctgatgg ttccggcagg 3704401 cgtggtggtg ggcatccaca tgtgggagca cttccgatga gcgccccggt cagacagcgc 3704461 agccatgacc gtccagccag cctggacaac ccacgatcac cacggcggcg tgccggcatg 3704521 cccaacttcg agaaattcgc ctggctgttc atgcggtttt ccggtgttgt gttggtgttc 3704581 ctggcgatcg ggcacgtgtt catcatgctg atgtgggaca acggcgtgta tcgcctggac 3704641 ttcaacttcg ttgcccaacg ctgggcgtcg ccgttctggc agacctggga tctgctgttg 3704701 ttgtggctgg cgcagctgca cggcggcaac ggtctgcgca ccatcattga cgactacagc 3704761 cgcaaagaca ccacccgatt ctggctgaac tcgttgctgg tgttgtccat gctgttcacc 3704821 ctgatgctgg gaacctacgt gatagtgaca ttcgacccga acatctcctg aaaggcccgg 3704881 aaggagcaca tgatcacgcc acctctcccc cgcaagcggg cggtaccccc acctcatcgc 3704941 tgcggccccc tcgtcgcttc gcggctgggg gtgcccccac tgcatcgtcg gcggcggcgt 3705001 tgatctgcca acaccgatac gacgtggtga tcgtcggcgc gggcggtgcc gggatgcgcg 3705061 ccgcggtcga ggcgggtccg cgggtgcgta ccgcggtgct gaccaagctg tatcccaccc 3705121 gcagccacac cggcgcggcc cagggcggca tgtgcgccgc gctggccaac gtcgaggacg 3705181 acaactggga gtggcacacg ttcgacaccg tcaagggcgg cgactatctc gccgaccagg 3705241 acgccgtgga gatcatgtgc aaggaagcca tcgacgcggt gctcgacctg gagaagatgg 3705301 ggatgccgtt caaccgcacc cccgagggcc gcatcgacca gcgccgcttc ggcgggcaca 3705361 cccgcgacca cggcaaggcc ccggtgcgcc gggcctgcta cgcggccgat cgcaccggcc 3705421 acatgattct gcagacgctg tatcagaact gcgtcaagca cgacgtcgag ttcttcaacg 3705481 agttttacgc gctggatttg gctttgactc aaacgccgtc gggcccggtg gccaccgggg 3705541 tgatcgccta cgagctagcg accggtgaca tccatgtctt tcacgccaag gccgtcgtga 3705601 tcgcgaccgg cggctcgggc cgcatgtata agaccacgtc caacgcacac accctgaccg 3705661 gcgacggcat cggcatcgtg ttccgcaagg gacttccctt ggaggacatg gagtttcacc 3705721 agtttcaccc taccggcctg gccggtctgg gcatcttaat ctccgaagcg gtgcgcggcg 3705781 aaggcggccg gctgctcaac ggggaaggtg agcgtttcat ggagcgctac gccccgacga 3705841 tcgtcgacct agcgccccgc gacatcgtcg cccgctcgat ggtgctggaa gtgctggagg 3705901 gacgcggcgc cggaccgctc aaggactacg tctacatcga cgtccgccac ctgggcgagg 3705961 aagtgctcga ggccaagctg cccgacatca ccgagttcgc ccgcacctac ctgggcgtgg 3706021 atccggtcac cgagctggtg ccggtctacc cgacgtgcca ctacctgatg ggcggcatcc 3706081 cgaccacagt caccgggcag gtgctgcggg acaacaccag cgttgtcccg ggcctgtatg 3706141 cggccggcga gtgcgcgtgc gtgtcggtgc atggcgccaa ccggctgggc accaactcgc 3706201 tgttggatat caacgtcttc ggtcgtcggg ccggcatcgc cgccgccagt tatgcgcagg 3706261 gtcacgactt tgtcgacatg ccgcccaacc cggaggccat ggtggtgggc tgggtcagcg 3706321 acatcctgtc cgaacacgga aacgagcggg tcgccgacat tcgcggggcg ctgcagcagt 3706381 cgatggacaa caacgccgcg gtgttccgca ccgaggagac cctgaagcag gcgctcaccg 3706441 acatccacgc gctcaaggag cgctactccc gaatcacggt gcacgacaag gggaaacgct 3706501 tcaacaccga cctgctggaa gccatcgagc tgggattttt actggagctg gccgaggtca 3706561 cggtggtcgg cgctttgaat cgcaaggagt cccgcggcgg tcacgcccgc gaggactatc 3706621 ccaaccgcga cgacgtcaac tacatgcgac acaccatggc ctacaaggaa attggggccg 3706681 ataaggaggg ccccgagctg cgcagcgatg tccgccttga tttcaaaccc gtcgtgcaga 3706741 cccgttacga acccaaggaa cggaagtact aatgagcgtc gagccggacg tcgaaacttt 3706801 ggatccgccc ctaccgccgg taccggacgg cgcggtgatg gtgaccgtca agatcgcccg 3706861 gttcaacccc gacgaccccg acgcgttcgc ggccaccggc ggctggcaga gcttccgggt 3706921 gccctgtttg cccagcgatc ggctgctcaa cctgctcatc tacatcaagg gctacctcga 3706981 cggcacgctc accttccggc gatcctgcgc ccatggggtg tgcggctctg atgccatgcg 3707041 catcaacggg gtgaaccggc tggcctgcaa ggtgctgatg cgtgacctgc tgccgaagaa 3707101 gaagggcaaa tcgttgaccg tcacggtcga gccgatccgc gggctgccgg tggaaaagga 3707161 cctggtggtc gacatggagc cgttcttcga cgcctaccgg gcgatcaaac cgtacctgat 3707221 caccagcggc aacccgccca cccgcgaacg gatccagagc ccgaccgacc gcgcccgcta 3707281 cgacgacacc accaagtgca tcctgtgcgc gtgctgcacc accagctgcc cggtgttctg 3707341 gcacgagggc agctacttcg gcccggcggc gatcgtcaac gcgcaccgct tcatcttcga 3707401 cagccgcgac gaggccgccg ccgagcgcct cgacatcctc aacgaggtcg acggggtgtg 3707461 gcgctgccgc accacgttca actgcaccga atcctgccca cggggcattg aggtgaccaa 3707521 ggcgatccag gaggtcaagc gcgcgctgat gttcacccgc tgagggcttg cgcgagcaga 3707581 cgcaaaatcg cccgaaaacc agtggttttg ggcgattttg cgtctgctcg cgcagccggg 3707641 tctacagcgt tgccaggtgc tgtttggttg cgccaggaac cgcagtcaac gcaatcgact 3707701 gatcgaaggt gacaaatcgg ccatcatgag cgaccgcgag ggccagcaag tacgcgtcgg 3707761 tgacctgttt ggggctgtgc aggcgggaac gatcgatgac ctttgagtcg agaatgctga 3707821 cggtgcagga ccagaactcg tgatagcgcg tgtgcgtcgc acgagccaac aagtcgatgg 3707881 catgggctac cgagattggg ctgggatagc gcggttggct gatgacgcgg acgaacccgt 3707941 tttgggtgat cgcacaggaa gcccatcccc gctcgatctg cccggtgatc cacgctcggg 3708001 cgcgctcgtg gtcgacgtga tcgcggtcca acagcgccag tagcacgttg acgtccaaca 3708061 gcgctcgcat cgatcacacg gcctcctcgt cacgaagccg atcgatcagc gcgttcgata 3708121 ccgctccacc gcgatgaggc aggggttcga agccatgaaa ggcgtcctcc tggctcgccg 3708181 caggctgggg attctggttg gttaacgctt gccgggccag atccgacagg atttcacccg 3708241 cggtgcgctt ctccctgcgt gcccgttcct tcacggccag caatacatcg tcgtcgatgg 3708301 acaacgtggt gcgcatgcat cagatgctat cgcaccaatc tgggcgcaac gcgtctacag 3708361 gatggccagc gctcgcggca ttgagaatct ccttcgtggg tgcactccca cgcgaggtag 3708421 gggccgacga ccaccatcta tgcccctggc aacggtgagc gccgcgcgat catgatccgc 3708481 gacggcgccg aatcgcagtt accctgcccc tcgtgtacaa cggtgaagtc ggcaggaagc 3708541 agacacgctg gctctcccgg cttgacacgt cgcttcgcgc tggctgtgcc cgcctcggcg 3708601 ccactgagag ccagcgactc ccatgccaat acgccgcctg gcatcaccgc ctcacaggcg 3708661 cggtgaaata tcgccgcatc ccaaaagagc ctgctgagca ccagcgcgaa acgcgtctcg 3708721 ccgggttccc agcagcccaa gtcggcctgc acgaggttga gccgatcggc cacgcctcga 3708781 cgcacggcct cgctgtccag ctgcagcagc gcgacatcgg acacatcgat tgcggtgacc 3708841 tggcggccgt gggcggccaa cgccagtgcg gtacccgatc gaccgctggc taactccaga 3708901 acgggaccgt ccggaacgcc tgctctgagg acatcggcga gccaaggcac cggggcaaac 3708961 ggcgcgtgcg ccgaacccgc gcgttcgtat cgcgcgttcc agtcgacgcg gttggggtgc 3709021 tcccgcagcg ccggatccgt ctgcacgctc atggccgatt ggccacccac tcaacaccgt 3709081 cgagtgcgaa ctccttcttc catatcggca catcctgttt gagccgctcg atgcacatgc 3709141 gagcggcgtc gaacgcggcc gcgcggtgag gagccgaagc accgatgaca accgccgcat 3709201 caccgatgcg caattcaccg gtccggtgtg ccacggcaac tcgcacaccg tcggcctgtc 3709261 gttcacactc ttcgatgatg tccatcagcg tgcggtgcac catggccgga taggcctcgt 3709321 agtacaactt ggtcacttcg tggccgttgt tgttgttacg cacggtaccc acgaagatga 3709381 cggcgccgcc ctgggaaggt ccagatatcg cgttgagcac ttcatcgacg ctcagcggct 3709441 catcggtgag ccggcagtag acatcggagc ccccggcaac ctgcggtatg aacgccaccg 3709501 tgtcgccatc gtcgagaatc gttgatgctg gcgctatgga ttcgttaacg gccatccgca 3709561 ctcgcttgcg aaaatcagca agtggcggat agtcgatttg caattggtcg actaagccgt 3709621 cgacggtggt gccgctttcg agtgagatct tctcgtgagc gaccttgcac gcttcgcgaa 3709681 ccgcgccaaa gtagagcaca ttgacagtaa tcattcaaca tccatcctcg gtggagccac 3709741 catcgctggg tttgacgtcc gcgtcgtgcc gccggtaatg acccgatcgg ccaccgcttt 3709801 tttcgtccaa tctgatatcc gtgatcgtca tggcacggtc gactgctttg cacatgtcgt 3709861 aaaccgtgag cgctgtcacc gtaacggcgg tcaacgcctc catctccaca cccgtacgtg 3709921 ccaccgtggt caccgtcgcc gcaatcgaga gccggtccgc gccctgcggc tcgagcgtga 3709981 cggtgaccgc ctcgatcccc agcgggtgac acagcgggat aagctcaccg gtccgtttgg 3710041 ccgccataat gccggctatc cgtgcggtcg ctatgacatc gccctttgcc gcggtgccgt 3710101 gacagatcat gtccagggtc gacggtttca tcaggacggc cccggatgcc cgcgctcgcc 3710161 gcaaggtcac cgccttcgcc gacacatcga ccattcgggc ggcgccttgt tcatcaaggt 3710221 gggtaagcac cccatcgtgg tcgttcaccg tgccacctgc tggctgcatt gctcatcgtg 3710281 cactgcgctg aaagcctcgg cgaggtcgaa gtcgacgcga gtcaaacagt gcatctggcg 3710341 cgtccaacaa gtcaaccgca ccgaccgctt gttatggaca ctgaaccgcc ccggcatgtc 3710401 cggagactcc agttcttgga aaggatgggg tcatgtcagg tggttcatcg aggaggtacc 3710461 cgccggagct gcgtgagcgg gcggtgcgga tggtcgcaga gatccgcggt cagcacgatt 3710521 cggagtgggc agcgatcagt gaggtcgccc gtctacttgg tgttggctgc gcggagacgg 3710581 tgcgtaagtg ggtgcgccag gcgcaggtcg atgccggcgc acggcccggg accacgaccg 3710641 aagaatccgc tgagctgaag cgcttgcggc gggacaacgc cgaattgcga agggcgaacg 3710701 cgattttaaa gaccgcgtcg gctttcttcg cggccgagct cgaccggcca gcacgctaat 3710761 tacccggttc atcgccgatc atcagggcca ccgcgagggc cccgatggtt tgcggtgggg 3710821 tgtcgagtcg atctgcacac agctgaccga gctgggtgtg ccgatcgccc catcgaccta 3710881 ctacgaccac atcaaccggg agcccagccg ccgcgagctg cgcgatggcg aactcaagga 3710941 gcacatcagc cgcgtccacg ccgccaacta cggtgtttac ggtgcccgca aagtgtggct 3711001 aaccctgaac cgtgagggca tcgaggtggc cagatgcacc gtcgaacggc tgatgaccaa 3711061 actcggcctg tccgggacca cccgcggcaa agcccgcagg accacgatcg ctgatccggc 3711121 cacagcccgt cccgccgatc tcgtccagcg ccgcttcgga ccaccagcac ctaaccggct 3711181 gtgggtagca gacctcacct atgtgtcgac ctgggcaggg ttcgcctacg tggcctttgt 3711241 caccgacgcc tacgctcgca ggatcctggg ctggcgggtc gcttccacga tggccacctc 3711301 catggtcctc gacgcgatcg agcaagccat ctggacccgc caacaagaag gcgtactcga 3711361 cctgaaagac gttatccacc atacggatag gggatctcag tacacatcga tccggttcag 3711421 cgagcggctc gccgaggcag gcatccaacc gtcggtcgga gcggtcggaa gctcctatga 3711481 caatgcacta gccgagacga tcaacggcct atacaagacc gagctgatca aacccggcaa 3711541 gccctggcgg tccatcgagg atgtcgagtt ggccaccgcg cgctgggtcg actggttcaa 3711601 ccatcgccgc ctctaccagt actgcggcga cgtcccgccg gtcgaactcg aggctgccta 3711661 ctacgctcaa cgccagagac cagccgccgg ctgaggtctc agatcagaga gtctccggac 3711721 tcaccggggc ggttcagagg caaccaccat ggttgttgtt ggaaccgatg cgcacaagta 3711781 cagccacacc tttgtggcca ccgacgaagt gggtcgccaa ctcggtgaga agaccgtcaa 3711841 ggccaccacg gccgggcacg ccacagccat catgtgggcc cgtgaacagt tcggcctcga 3711901 gctgatctgg ggcatcgagg actgccgcaa catgtcggcg cgtctggagc gtgacctact 3711961 ggcggccggc cagcaggtgg tgcgggtacc caccaagctg atggcccaga cccgcaagtc 3712021 ggcgcgcagt cggggcaagt cggatccgat cgatgcgctg gcggtggcgc gggcggtgct 3712081 gcgtgaaacc gacctacccc tggccaccca cgacgagacg tcgcgggagt tgaagttgtt 3712141 gactgaccgt cgagatgtcc ttgtggccca acgcacgtcg gcgatcaacc ggttgcgctg 3712201 gctcgtccat gaactcgatc ccgagcgggc accggcagca cgctcgctcg atgccgccaa 3712261 gcaccagcag gccctgcgga cctggctgga cacccagcca ggattggtcg ccgaactcgc 3712321 gcgcgccgag ctgaccgaca tcatccggct caccggcgag atcaacaccc tagcccagcg 3712381 catcagcgcc cgagtccacc aggtcgcccc cgcactgctg gaaatccctg gctgcgcgga 3712441 gctgactgca gccaaaatcg tcggcgaagc cgccggagtg acccggttca aaagcgaagc 3712501 cgccttcgcc tgccatgccg cagtggctcc catcccggtg tggtcgggca acaccgccgg 3712561 ccagatgcgg ctcagccgct cgggcaaccg ccagctcaac gccgccctac accgcatcgc 3712621 actgacccaa atccggatga ccgacagccg gggccaggcc tactaccaaa ggctgcaaga 3712681 cgccgggaaa accaaacgcg cagcactacg ctgcctcaaa cgccgcctag cccgcaccgt 3712741 cttccaggcc ctgcgcaccg tccaccagcc cagctccgaa cacacccaac ccgcggccgc 3712801 ttgccatagg agctattgct cgtcacacct cggcgagcca cctcgtctaa cggatatgac 3712861 acagaaaacc cgcatccagc ccctacctcc caagcgagcc ggcctgttga tccgcgcact 3712921 gtatcggatc gccaagcggc gcttcggcga agttcccgag ccgttcacgg tcaccgcaca 3712981 tcatcggcgg ctgctgatcg ccaatgtggt gcacgaagcc ctgctgcagc gagcgtcgcg 3713041 gaagctaccg cccagcgtcc gtgagctggc ggtgttttgg accgcccgca gcatcggctg 3713101 ctcgtggtgc gtggacttcg gagccatgct gcagcgcctg gacgggctgg acgtggacag 3713161 gctcacggac atcgacaatt acgccacctc atcgaaattc agcgacgacg aacgcgccgc 3713221 catcgcctac gccgaggcga tgaccgcaga cccgcattcg gtgaccgacg agcaggtggc 3713281 cgacctgcgg gcccgcttcg gcgaggccgg cgtgatcgag ctgacttacc agatcggcgt 3713341 ggagaacatg cgagcccgga tgaattcggc gctgggcatc accgagcaag gcttcaattc 3713401 cggtgatgcc tgccgcgtcc cgtgggctgc gcccgacgtt ccttcagcgg agagccggtg 3713461 aacttgtcgg gattggcgat atcccacagc gcgcacacct ttccgtcgcg cacggttatc 3713521 gcggtgatcc gcggcgccat cgcccgatac ccgtcgaccc cgggtaagcc cgccgtgtag 3713581 gcgccgagct ctccgttgac cagcgccagc tgattcgcgc cgaagagccc cgggccgtaa 3713641 cgctggacca gcccgagtat gaaccggacc accttgtcgg atccgcggac ggcccgtacc 3713701 gctgtgggcg ccttgccatt cgaatcgccg gtaaacgtca cgtcgggatg cagcagcgac 3713761 accaccgtgt ccaggtcacc agcggccatg gcggccatca gccggccgac cacctcgttg 3713821 tgggccggat ccggatcccc cgatatcagg gcgggctgcg ccgtgacggc cttgcgggcc 3713881 cgcgacgcca gctggcgcgc ggcggcctcg ctggttccca gcacctcggc cacttcggca 3713941 aacggcacgg cgaacccgtc gtgcagcacg aacgcgaccc gctgatcggg gcgcagccgc 3714001 tccagcacca ccatggccgc gaacctggcg tcctcggcgg ccaccacggc ggccaacgga 3714061 tcggtcgcgt ccaagccggt gaccaccggt tcgggcagcc aggtgccggt gtaggtctcc 3714121 cgccggtgcg ccgccgacct caacttgtcc agacccagcc ggctcaccac ggtggtcagc 3714181 caggcccgcg ggtcggcgat cacggtgtcc ggtgagtccc agcgcagcca ggcctcctgc 3714241 acgatgtcct cagcatcggc gaccgtgccg gtcagcctgt aggcgaccga catgagatgc 3714301 tgtcgcagtg cctcgaattc ggaaacctcc atcgaggtca ttgcccgagc ctagcgctgc 3714361 gctcgccaac acgacgacac gaaacctttg gttgcacttc gcccggcacg gtgccggcat 3714421 ccaacacccg gtcatcgtcc gcggcgacgg cgtcaccatc ttcgacgacc gcggcaagag 3714481 ctatctggac gccttgtccg ggctgttcgt ggtgcaggtc ggttacggcc gggccgaact 3714541 cgccgaggcg gccgcgcggc aagccggcac gctggggtat ttcccgctct gggggtatgc 3714601 caccccgccg gcgatcgagc tcgccgagcg cctggcccgc tacgcgcccg gggacctaaa 3714661 ccgggtgttt ttcaccagcg gcggcaccga ggccgtcgaa accgcctgga aggtggccaa 3714721 gcagtacttc aagctcaccg gcaaaccggg caaacaaaag gtcatttcac gctcgatcgc 3714781 ctaccacggc accacccagg gcgcgctggc gatcaccggc ctgccattgt tcaaggcgcc 3714841 attcgaaccg ctgacgccgg gcggcttccg ggtgcccaac accaatttct accgagcacc 3714901 gttgcacacc gacctcaaag agttcgggcg atgggctgct gaccggatcg ccgaggccat 3714961 cgagttcgaa ggccccgaca ccgtggccgc ggtgtttttg gagccggtgc agaacgcggg 3715021 cggctgcatc ccggcgccgc cgggttattt cgaacgggtc cgcgagatct gtgaccgcta 3715081 cgacgtgctg ctggtctccg acgaggtgat ctgtgcgttc ggccggatcg ggtcgatgtt 3715141 cgcctgtgaa gacctcggct acgtgcccga catgatcacc tgcgccaagg gcctgacgtc 3715201 gggctactcg ccgctgggcg cgatgatcgc cagcgaccgg ttgttcgaac cgttcaacga 3715261 cggcgagacg atgttcgcac acggctacac gtttggcggt catccggtgt cggcggccgt 3715321 cggcctggcc aacctcgaca tcttcgagcg cgagggtctc agcgatcacg tcaagcggaa 3715381 ttcccccgcg ctgcgggcca ccctggagaa actgtacgac ctgcccatcg tcggcgacat 3715441 ccgcggcgag gggtatttct tcggcatcga actggtcaaa gaccaggcga ccaagcaaac 3715501 cttcaccgat gacgaacgcg cacgactgct aggccaggta tccgcggcgc tctttgaggc 3715561 cgggctgtac tgccgcaccg acgaccgcgg ggaccccgtc gtccaggtgg ctcccccgct 3715621 gattagcgga cagcccgagt tcgacaccat cgaaaccatc ctgcgcagcg tgctcaccga 3715681 caccggacgc aaatatcttc atctgtaact ttcgtcccgc cagtcacagc gcggctcctc 3715741 gcggtcgggc cgccgatcac ctactctgca cagacgatgg ccttcttacg ttcggtatcg 3715801 tgcctggcag cagccgtgtt tgcggtaggc accggaattg gtctacctac cgcggccggc 3715861 gaacccaatg ccgcaccggc ggcgtgcccg tacaaggtgt ccaccccacc cgccgtggac 3715921 tcgtcggagg ttcccgcggc cggtgaaccc ccactgccgc tggtggtacc ccccaccccg 3715981 gtcggcggca acgcgctggg cggctgcggc atcatcaccg cccctggcag cgcgccagcg 3716041 cccggcgacg tctcagccga ggcctggctg gtggcggacc tggacagcgg cgcggtgatc 3716101 gccgcccggg atccgcacgg ccggcaccgc ccggccagcg tcatcaaggt gctggtggcg 3716161 atggcgtcca tcaacacgct caccctcaac aagtcggtcg ccggaaccgc cgacgacgcg 3716221 gcggtcgagg gcaccaaagt cggggtgaac accggtggca cctacaccgt caaccagctg 3716281 ctgcacgggc tgctgatgca ctccggcaac gacgctgcgt acgcgctggc caggcagctc 3716341 ggcggcatgc cggccgcgct ggagaaaatc aatctgctgg ccgccaagct gggcggccgg 3716401 gacacccgag tggccacgcc gtccggactg gacgggcccg gcatgagcac gtcggcctat 3716461 gacatcggcc tgttctaccg gtacgcgtgg cagaacccgg tcttcgccga catcgtcgcg 3716521 acccgcacct tcgacttccc ggggcacggc gaccatccag gctacgagtt ggagaacgac 3716581 aaccagctgc tctacaacta tccgggcgcg ctcggcggca agaccggcta taccgacgac 3716641 gcggggcaga ccttcgtggg cgcggccaac cgcgacggcc ggcggctgat gacggtgctg 3716701 ctgcacggga cccggcagcc gatcccgccg tgggagcagg cggcgcacct gctcgactac 3716761 gggttcaaca ccccggcagg cacccagatc gggacactga tcgaacccga cccgtcgctg 3716821 atgtccaccg accgcaatcc cgccgaccgg caacgagtcg acccccaggc cgcggcgcgg 3716881 atatcggccg ccgacgccct tccggtgcgg gttggcgtgg ccgtcatcgg cgccctgatc 3716941 gtgttcgggt tgatcatggt cgcgcgggcg atgaaccgcc ggccgcagca ctagctgctt 3717001 accccgatac cttcggcgtc gtttgcgggc gggcatccta gccggccttg gtcggcaccg 3717061 aaatcggggc ttgaccagcg gttgaccgcg tgacgacgct gtggcagcct catcgaaatg 3717121 actacagccc tataccagga cgcggggttc acgcccgccg gggcgcccga cgaccccgac 3717181 cgcgtggtgg acgtgctgag cgccccggta ccggtcaact gaccagatcg gggcgccggg 3717241 cgctcctcgt cgggctcacc gccgccagcg tcggcgtcct ctacgggtac gacctttccg 3717301 ccatcgcggg tgcgttgctg tctctcagcg aggaattcga actcaccact cgagaacagg 3717361 agttgctgac caccacggcg gtgctcggcc agatcgccgg ggcgcttggc ggcggcatcc 3717421 tcgccaacgc gatcggacgc aagaaatcgg tggtgctcat cgtcgccggc tacgcagtgt 3717481 tcgccctgct cggcgcgacc tcggtgtccg taccgatgct ggtggtggcg cgtctgctgc 3717541 tgggtgtgac aatcggcctg tcggtggtgg tggtgccggt gtatgtggcc gagtcggcgc 3717601 cggcggcggt gcgtgggtcg ttggtgaccg cgtatcagct ggcgacgctt agcggcatcg 3717661 tcgtcggtta cctggtcggc tacctgttgg ccggatcgca cggctggcgc gcgatgttcg 3717721 ggctggccgc cgcgccggcc acgctgctgt tgccgttgtt gtggcgcatg cccgataccg 3717781 cccgctggta tctgctcaag ggccggatcg ccgacgcgcg tagcgcgctg cggcggatcc 3717841 agccggaggc cgacatcgat gccgagctgg ccgatatggc ggccgcggtc gacgaacgcg 3717901 gcggcggtat cggcgaaatg gtgcggcggc cgtatctgcg ggccacgctg ttcgtcatcg 3717961 cgctcggctt cctcgtccag atcaccggga tcaacgcgat catctactac agtccgcgac 3718021 ttttcgccgc catgggcttc gcgggctatt tcgcgatgct tgccctgccc gcgatggtgc 3718081 aagtcgccgg cttggcggcg gtgtgtgcct cgctgtttct ggtcgatcgg ctgggccgtc 3718141 gcccgatcct gttgtccggc atcgcgacga tgatcaccgc agatgccgtg ctgatcaccg 3718201 tattcgccaa cgactccgat ggtggcacgg ggctggtgtt ggggttcgcc ggcgtgctgc 3718261 tgttcatcat cgggttcaac ttcggattcg gctcgctggt ctgggtgtac gccgcggaga 3718321 gcttcccgtc ccggctgcgg tcgatgggat cgagcccgat gctcacctcg acactgacgg 3718381 ccaacgcgat cgttgccgcc ttctcgctca ccatgctgcg tgtgctcggc ggcgcaggcg 3718441 ttttcgcggt cttcggcacg ttcgccgtcg tcgcgttcgt ggtcgtgtac cgctttgcgc 3718501 cggagaccaa gggccgcaaa ctcgaggaga tccggcactt ctgggagaac ggcggccgct 3718561 ggcccgccga gcggtcaccg gcggcggacg aaccgtgacc gtgctcggcg ccgacgccgt 3718621 cgtcatcgac ggccggatat gccggccagg gtgggtgcac accgccgatg gtcggattct 3718681 ctccggtggc gctggggcac cgcccatgcc ggccgacgcg gaattccccg atgcgatcgt 3718741 ggtgcccggc tttgtcgata tgcatgtgca cggcgggggc ggcgcgtcgt tcgccgacgg 3718801 caacgccgca gacatcgccc gtgcggccga gtttcacctg cggcacggca ccactaccac 3718861 gctggccagt ctggtcaccg cgggccccgc cgagttgctc tccgccgtgg gcgctttggc 3718921 cgaggcaact cgggacggcg tcgtcgcggg catccatctg gaggggccgt ggctgagccc 3718981 agcgcggtgc ggagcgcacg accacacccg gatgcgtgcc ccggatcccg ccgagatcga 3719041 gtcggtgctc gccgccgccg acggcgccgt ccggatggtc acgttggcac ccgagttgcc 3719101 cggaagcgat gcggcgatcc ggcgcttccg tgacgccgaa gtggttgtcg ccgtggggca 3719161 tacggatgcg acctacacac agacccgaca cgccatcgac ctgggcgcga cagtcggcac 3719221 ccacctgttc aacgcgatgc cgccgctgga ccatcgggcg cccggacccg tgctggcgtt 3719281 gctgtgcgac ccgcgggtga ccgtcgaaat catcgccgac ggcgtgcacg tgcaccccgc 3719341 ggtggtgcac gcggtgatcg aagccgtcgg tcccgatcgg gtcgccgtgg tcaccgacgc 3719401 gatcgccgcg gccggatgcg gcgatggcgc gttccggctc ggcacaatgc cgatcgaggt 3719461 cgagtcgagc gtggcacggg tggctggtgc gtcgacgctg gcgggcagca ccaccaccat 3719521 ggatcagctc ttccggacgg tggctgggct cggctcgaag tcggactcag ccggcgatgt 3719581 ggcgctggcc gccgcggtgc aggtgacctc ggcgacgccg gcccgcgctc tcgggctcac 3719641 cggggtgggc cggctggcgg cgggctatgc cgccaatctt gttgtgctgg accgtgatct 3719701 gcgggtgacg gccgtcatgg tcaacgatga ctggcgggtg ggctgagcgt ccgtggaggc 3719761 ccgtcacaat gcccaggctc gcaccgtgag tactcggtca acgttgacgg ttgccccggc 3719821 gacccggtca ctctggcgag ggctaccggc gccgcgcggc ttgtaccgca atcatccgat 3719881 cgccgcgaag cgctcggcag ccggcttggg cggtagccga cgacacgggt acggtctcac 3719941 ggcgcgagcc tgataaagcc cggcggcatg ggtcgtgcag gcgacggctc taccggtccg 3720001 tcaccaccgc cgccaccacc gctgccggcg ccgccactgc cggcagcgcc cccggactgc 3720061 ggaacaccag caggcggctc aacctctggc ggcgggggcg gcggctgttg cggcggcgct 3720121 ggtcgcggtg gcggcggtgc cacgatcggc gggggtggaa tcagggtctg cgccgccggc 3720181 ggcggtaccg gaatcggcgg cggattcggt atcaggggat cccccgcgcg aaccgctccg 3720241 agcaccgagg caagcatcgc acccgtcggt tcccgccatc ccggcgacat gatggtcatg 3720301 tccgacaccg acgcccgcag gtcgcttccc gagttgaccg cgctgcgcgt ggacgccgca 3720361 acgcgatgcg tcggttcatt cgatcccggc tcgaaattgg ccatggcgaa cgccatcttg 3720421 ctgtgatggt tcgggcagta gatctccact gccgcactga taaatcgggt catggtcgtc 3720481 gtgaggcgga cagggtagag gcgcatgacc gggtctatgt tgtaggcatc gttgcgtaac 3720541 ccgtccacaa tgtcgttcac cggcatgccg ccatcgagtt tgcgacacac tttgtgggcc 3720601 gcgtcgatga cgcgaggcac attcgcgacg gcggggattt cctttttctc gagcagcgcc 3720661 agaaaccgat cgtcttggtt tgggtcggcc gctgctgggc cgtcgtgcag aattgcggcg 3720721 ccgatcagca ccactaaggc ggcacccagg gcgccggcat ggctagcgat gccggtgaac 3720781 atgatggggt ttccgttctg ctaaaagccg ttacctggcg ggctttggat cgcgatccac 3720841 gccataggtg tggctgtctg gtcaggtttg accggcgcca tgatgtcgtt tcacagcgcc 3720901 gatgcagtct gggaggggac cagggcatgg gtgcattgag gagccagatc cagagaacca 3720961 caccggagcc gctggccgag gctcatccac aagccttcga tcccgctccc gttgtcggca 3721021 tgggcgcctg ccgacggaat cagcggatgg tcatagtggc gtcgggcgcc aggcctgcgc 3721081 gggcacacgc ggtgcggtgt cgatggttgt tctcatctgg taactccttt ccgcaggccg 3721141 caattcagcg gtatgggctc accgagatca ggctcgtcac gatcgcccgc actgctggcg 3721201 gctcacatgt acccagtgtt aaccttctag tgcactagaa ggtcaagggg agtcgcatga 3721261 agatcagcga ggtagccgcg ctcaccaaca ccagcaccaa gaccctccgc ttctacgaga 3721321 actcggggct gctgccgccg cctgcacgca cagcatcggg gtatcgcaac tatggacccg 3721381 agatcgtgga tcggctgcgg tttatccatc ggggccaagc ggccgggctg gcattacagg 3721441 aagtacgcca aatcctggcc atccacgacc gcggcgaggc gccgtgcgca cacgtccgcc 3721501 aactactgag cacccgcatc gacgaagtcc gcgcgcagat cgccgaactg attgccctcg 3721561 aaggccactt gcagaccctg cttgaccacg cttcatatgg cccgcccacc gaacacgacc 3721621 actccacggt gtgttggatc ctggaaagcg acctcgatga gcccaccgcc atcgaggtca 3721681 gcgacattca cgcctagagg tcgctgggta cgcgggctgg cccacgggtt ttacgccgaa 3721741 gccgtcgccg cccacgcggt ggcgaacagg atcagccacg cggtgacgaa cgcgaacacc 3721801 atcaagccca gcaccggccc gaacaccgcg cccgccgggc tgcgcaacac tatctgcagg 3721861 tagatcgccc ccacctgctt gaacagctcg aagccgaccg ccgccatcaa cccggcccgc 3721921 gccgcggtga ccaaaccgac cggctcccgc ggcagccggc caatcatcca ggtgaacagc 3721981 acccacgaca ccagcaccga taccagcacc gagatgcccc gaaagatctc gtcgaacact 3722041 gaaaactggg gtatttcaag ccatctcagt accgcagcca tcggcctggc atggccgagc 3722101 acggtgagcg cgatggtggc cacgatcacc acgaacgtcc ccaccatggc cgctagatcc 3722161 gacagtttgg tgcgcaagta gcccgccgga gcgactggat gtgcccacat ctggctcaac 3722221 gcttcccgca ggtgccacat ccagcccagg cccacccagg ccgcggtcgc cagaccgatc 3722281 accccgaccg acgcgcgtgc atcgatcgcc gaattcatca ggtcgaccag ctgctgtccc 3722341 accgcaccgg agaccgaggt gcggatgcgc tcctcgagcg tggtcagcag ctccggacga 3722401 cgcgacaacg cgaatccacc caccccgaaa ccgaccatca gcaaaggaaa tatcgcaaag 3722461 atcgtgtagt aggtgagtcc ggccgcaaaa agactgccgt tgcgatcgtt aaagcgcgtg 3722521 aacgcacgca cgacatggtc caaccacccg aaccgggccc gcagccggtc aagcacccct 3722581 ggctcggcga gctcgcccat gatcgactgc cctacccccg ttatagaagg aacccgagcc 3722641 gatcgtagac tcgctgaacc gttttgctgg ccacatcgtg ggcgcgctgc gccccggcgg 3722701 cgagcacggc ctccagctcc gcgggatctg cggtcaattc gtcaactctg gcttggatcg 3722761 ggttgacgaa ttcgacgacg gcctcggcgg tgtctttctt caaatcgccg tagccgtgtc 3722821 cggcatagcc gtcgacgaga acgtcgatgt cggtcccggt gaccgccgac tggatgttca 3722881 acaggttaga cacccctggc ttgacgtccg ggtcatagcg gatgtcacgt tcgctgtcgg 3722941 tcacggcgga gcgaatcttc ttggcggaca atgccggatc gtcgagcagg ttgatcaaac 3723001 cggcatcggt gcccgccgat ttgctcatct ttgacgtcgg gtcttgtaga tcgtagattt 3723061 tggcggtcat cttggggatg agcacgtcgg gaaccaccag ggtgccgggg aatcggctgt 3723121 tgaaccgttg cgcgacgtcg cgggccagct cgaggtgctg ccgctgatcc tccccgacgg 3723181 gcaccagctc ggtgtcgtag gccaacacgt ccgcggcctg cagtaccggg taggtgaaca 3723241 ggccgacggt ggtggcctcg ctgccctgac gcgccgactt gtctttgaac tgggtcatcc 3723301 gcgacgcctg gccaaagccg gtgaaacaac ccagcaccca cgccagctgg gtgtgagccg 3723361 gcacctgact ttgcacgaag atggtggcgc ggccgggatc gattcccaac gccaggtatt 3723421 gcgcggcggt aatcagggtc cggcgccgca gtgcctcggg atcctgaggg atggtgatcg 3723481 catgcaggtc gaccacgcag aagaacgcat cgtggtcatc ctgcaagcca acccattggg 3723541 cgacggcgcc caaggcatta ccgaggtgaa gcgagtcaga cgtgggctgc acgccggaga 3723601 agatccggcg ggacccggta ggggtgctca tgatgccccg atcctttcac gcggggtgcc 3723661 ctccccgtcg accaccggtc accacgctgc ttgcggtacc ggcggtaccg gctttagtgt 3723721 cggctctatg cgcagtccga tacgcgtggg ttcgggagag ccggtcctac tgctacaccc 3723781 gttcttgatg tcccaaacgg tgtgggagaa ggtcgcccag cagctggccg acaccggccg 3723841 cttcgaggta tttgccccca cgatggccgg ccacaacggc ggaccggcct cgggcacccg 3723901 gttttgtcct cggcggtgct ggccgaccac gtcgaacgcc agctcgacga actgggctgg 3723961 gaaaccagcc atatcgtcgg caactcgttg ggcggctggg tcgcgttcga actcgaacga 3724021 cgtggccggg cacgcagcgt gaccggtatc gccccggcgg gcggttggac ccgctggagt 3724081 ccggtcaagt tcgaagtgat cgctaagttc atcgcagggg cgccgatctt ggccgtcgcc 3724141 cacattcttg gccaacgggc gcttcggctg ccgttcagcc gcctgctggc caccctgccg 3724201 atcagcgcca caccggacgg cgtgagcgag cgcgagctgt ccggcatcat cgacgacgcc 3724261 gcgcactgcc cggcctattt tcagctgctg gtcaaggcgc tggtgctgcc cgggctgcag 3724321 gagttggaac acaccgccgt gccctcgcac gtggtgctgt gcgagcagga ccgggtggtc 3724381 cctcccagca ggttcagccg tcatttcacc gactcactgc cggcgggcca ccggctcacc 3724441 gtgctcgacg gcgtcggtca cgttccgatg ttcgaggctc cggggcgcat cactgagctg 3724501 atcaccagct tcatcgaaga gtgctgcccg catgtccggg ccagttagcg ggcgcgagca 3724561 gacgcaaaat cgcccatttc ggcacgaaat tgggcgattt tgcgtctgct cgccctaatt 3724621 ggccagctcc ttttccaggt tgtcggcgat cgcatcgagg aattcctcgc tattcagcca 3724681 gtcctgctcc ggaccgatga ggatcgcgag gtccttggtc atcttcccgc tctccaccgt 3724741 ggcgatgacg acggactcca gcttgtgggc gaagtcgatg acttcgggag tgccatccag 3724801 cttgccgcga tgctgtaatc cgcgggtcca ggcaaagatc gacgcgatcg ggtttgttga 3724861 ggtcggttta ccggcctgat actgccggta atgccgggtg acggtgccgt gggcggcttc 3724921 ggcctcgact gtcttgccgt cggccgtcat cagcaccgac gtcatcaggc ccagcgagcc 3724981 gtagccctgt gcgacggtgt ccgactgcac gtcgccgtcg tagttcttgc acgcccagac 3725041 gtaaccgcct tcccatttca ggcaggcggc gaccatgtcg tcgatcaacc gatgctcgta 3725101 ggtcagcccc gccgcttcga actgcgcctt gaattcctct tcgtagacgc gctcgaactc 3725161 gtctttgaac atcccgtcgt aggccttgag gatggtgttc ttggtggaca gatataccgg 3725221 ccatttcgcg ttgaggccgt aggagaacga cgcgcgcgcg aaatcccgga tggattcctt 3725281 gaagttgtac atccccagca cgacgccgcc gtcctcgggg atggacacca tttcgtgcac 3725341 gatcggcgcg ctgccgtcgg cgggcgtgaa agtcagtgtg acggtgcccg gttggtcgac 3725401 cttgaagttc gtcgcccgat attggtcacc aaaagcgtgc cggccgatga cgatcggctt 3725461 ggtccacccc ggaaccagtc gcggcacatt agaaatcacg ataggttcgc gaaagattgt 3725521 gccgcccaag atgttccgga ttgtcccatt gggcgacagc cacatcttct tcaggttgaa 3725581 ttcctcgaca cgggcctcgt cgggggtgat cgtcgcgcac tttacgccca caccgtgttt 3725641 cttgatcgca tacgccgcgt cgatcgtcac ctggtcgtcg gtggcgtcgc ggtgctcgat 3725701 gcccaagtcg taatagtcca agcggatgtc gagataggga aggataagca tgtccttgat 3725761 gagcttccag atgacacggg tcatctcgtc accgtcgagc tctacgaccg gaccgctgac 3725821 ttttatcttg ggtgcgttgg acatgggagt ccacatcaga ttactagcag cccgcgcggg 3725881 cccctagcgg ccggtaaagg gccagttgag accgccggag ttgtgctttg agttggcact 3725941 gagtagctgc catgcgctag gcttcgagtc ggtcatgagc gccagcgtca agccccggct 3726001 tgctggccgg caaccctcca accgcggtgg ggtgccccgg gtgatgacca ggttgagtag 3726061 ccatcgccgg ctgcgcggca agcgcgggtc cgccatgacg ggcccctgac cagacgggga 3726121 aagctcatga gcgccgacag caatagcacc gacgccgatc cgaccgcgca ttggtcgttc 3726181 gaaaccaaac agatacacgc tggtcagcac cctgatccga ccaccaacgc ccgggctctg 3726241 ccgatctatg cgaccacgtc gtacaccttc gacgacaccg cgcacgccgc cgccctgttc 3726301 ggactggaaa ttccgggcaa tatctacacc cggatcggca accccaccac cgacgtcgtc 3726361 gagcagcgca tcgccgcgct cgagggcggt gtggccgcgc tgttcctgtc gtcggggcag 3726421 gccgcggaga cgttcgccat cttgaacctg gccggcgcgg gcgatcacat cgtgtccagc 3726481 ccgcgcctgt acggcggcac ctacaacctg ttccactatt cgctggccaa gctcggcatc 3726541 gaggtcagct tcgtcgacga tccggacgat ctggacacct ggcaggcggc ggtacggccc 3726601 aacaccaagg cgttcttcgc cgagaccatc tccaacccgc agatcgacct gctggacacc 3726661 ccggcggttt ccgaggtcgc ccatcgcaac ggggtgccgt tgatcgtcga caacaccatc 3726721 gccacgccat acctgatcca accgttggcc cagggcgccg acatcgtcgt gcattcggcc 3726781 accaagtacc tgggcgggca cggtgccgcc atcgcgggtg tgatcgtcga cggcggcaac 3726841 ttcgattgga cccagggccg cttccccggc ttcaccaccc ccgaccccag ctaccacggc 3726901 gtggtgttcg ccgagctggg tccaccggcg tttgcgctca aagctcgagt gcagctgctc 3726961 cgtgactacg gctcggcggc ttcgccgttc aacgcgttct tggtggcgca gggtctggaa 3727021 acgctgagcc tgcggatcga gcggcacgtc gccaacgcgc agcgcgtcgc cgagttcctg 3727081 gccgcccgcg acgacgtgct ttcggtcaac tatgcggggc tgccctcctc gccctggcat 3727141 gagcgggcca agaggctggc gcccaaggga accggggccg tgctgtcctt cgagttggcc 3727201 ggcggcatcg aggccggcaa ggcattcgtg aacgcgttga agctgcacag ccacgtcgcc 3727261 aacatcggtg acgtgcgctc gctggtgatc cacccggcat cgaccactca tgcccagctg 3727321 agcccggccg agcagctggc gaccggggtc agcccgggcc tggtgcgttt ggctgtgggc 3727381 atcgaaggta tcgacgatat cctggccgac ctggagcttg gctttgccgc ggcccgcaga 3727441 ttcagcgccg acccgcagtc cgtggcggcg ttctgaggaa ttctgacatg acgatctccg 3727501 atgtacccac ccagacgctg cccgccgaag gcgaaatcgg cctgatagac gtcggctcgc 3727561 tgcaactgga aagcggggcg gtgatcgacg atgtctgtat cgccgtgcaa cgctggggca 3727621 aattgtcgcc cgcacgggac aacgtggtgg tggtcttgca cgcgctcacc ggcgactcgc 3727681 acatcactgg acccgccgga cccggccacc ccacccccgg ctggtgggac ggggtggccg 3727741 ggccgggtgc gccgattgac accacccgct ggtgcgcggt agctaccaat gtgctcggcg 3727801 gctgccgcgg ctccaccggg cccagctcgc ttgcccgcga cggaaagcct tggggctcaa 3727861 gatttccgct gatctcgata cgtgaccagg tgcaggcgga cgtcgcggcg ctggccgcgc 3727921 tgggcatcac cgaggtcgcc gccgtcgtcg gcggctccat gggcggcgcc cgggccctgg 3727981 aatgggtggt cggctacccg gatcgggtcc gagccggatt gctgctggcg gtcggtgcgc 3728041 gtgccaccgc agaccagatc ggcacgcaga caacgcaaat cgcggccatc aaagccgacc 3728101 cggactggca gagcggcgac taccacgaga cggggagggc accagacgcc gggctgcgac 3728161 tcgcccgccg cttcgcgcac ctcacctacc gcggcgagat cgagctcgac acccggttcg 3728221 ccaaccacaa ccagggcaac gaggatccga cggccggcgg gcgctacgcg gtgcaaagtt 3728281 atctggaaca ccaaggagac aaactgttat cccggttcga cgccggcagc tacgtgattc 3728341 tcaccgaggc gctcaacagc cacgacgtcg gccgcggccg cggcggggtc tccgcggctc 3728401 tgcgcgcctg cccggtgccg gtggtggtgg gcggcatcac ctccgaccgg ctctacccgc 3728461 tgcgcctgca gcaggagctg gccgacctgc tgccgggctg cgccgggctg cgagtcgtcg 3728521 agtcggtcta cggacacgac ggcttcctgg tggaaaccga ggccgtgggc gaattgatcc 3728581 gccagacact gggattggct gatcgtgaag gcgcgtgtcg gcggtgacgt gctcccgacg 3728641 cgacatgtcc ctgtcgtttg gctccgcggt cggcgcctac gagcgcgggc gcccctcgta 3728701 tccaccggaa gccatcgact ggctgctgcc ggccgccgcc cgccgcgtgc tcgacctggg 3728761 agcgggcacc ggcaagctga ccacccggct agtcgagcgc ggcctggacg tggttgccgt 3728821 cgacccgatc ccggagatgc tggacgtgct gcgtgctgcg ctgccgcaaa ccgtcgcgct 3728881 gctgggcacc gccgaagaga ttccgttgga cgacaacagc gttgacgcgg tgttggtggc 3728941 tcaggcgtgg cactgggtgg atcccgcccg ggcgattccg gaggtcgccc gggtgttgcg 3729001 tccgggcggg cggctcggcc tggtgtggaa cacccgcgac gaacggctgg gctgggtgcg 3729061 cgagctgggt gagatcatcg gtcgcgacgg cgatccggtg cgcgacaggg tgacgctgcc 3729121 cgagccgttc actacggtgc agcgccatca ggtcgagtgg acgaattacc tgacaccaca 3729181 agcccttatc gacctggtgg cttcgcgcag ctattgcatc acctcaccgg cgcaggtccg 3729241 caccaaaacg ctcgaccggg tgcggcagtt gctggccacc catccggcgc tggcgaatag 3729301 caacggcctg gcgctgccct acgtcacggt ctgtgtgcgg gcgactctgg cctgacgccg 3729361 cctttagggc ccggtgccgg tgtaaatcag gcccgccagt tgctggccga cgttgccgaa 3729421 gccggagacc agggccgagg tgatcaggcc cagcgcgccg gtgttgtaca cacccgagat 3729481 gtccgcgccg cggttgagga tgccggagag ttgggtgccg aagttggcga agcccgacgc 3729541 cgatccgagc agcggatccg agatcgcgtt gagcacgccc gacatgcccg cgccgaggtt 3729601 gtggaagccc gacaacccgc cgccaccgcc gatgttgaag aaccccgacg acgggaccgc 3729661 ggtggtgttg ccgaatcccg ggacgggcgg gatgaccaac ccggcgttga tggggccgag 3729721 cagcgcgttg acgtcgagaa ccactgggat tcggtcgatg gtgatctcca gagggaaggc 3729781 gaaggcgggg gtggcgccgg acaacgcgag gcccagcggg agttggggaa tggtgatttc 3729841 cgggctcacg aagggtccga tggtgacgga caggggcagc tcgacatgga ttggatcgac 3729901 gggtatgtgg aatcccggga tggtgatttc cggtgttaga tgggtcacgc caagcgaact 3729961 cagcagcacg gtgaatggca gaatctcgct gggcgccgtt tggatggcgg ggacattaac 3730021 gttgatgaac cccagcagcg taaggctgaa tggatcgatg atggagcctg agctgaatat 3730081 cgggcccacg gtgacaccgg ttgcggggtc gagtcccagg gcgggaatcg tgatgtcctg 3730141 gacggtgatg gggccgaggt cgaagactgg gtcgatgcga accgtgatcg gggaaatgga 3730201 caccggcggg atggtgaagc cgccgatgtg gccggttgcg ctgaggtcca agggaattgc 3730261 cggaaattgg atcgacggaa cgatgatggg tccggcgccg ccggacgcgt ggatgttcgc 3730321 gacagtgaat tcgggaatga tggtgctggt gtaggagaag ccgagcaggc cctggtagtc 3730381 gccccgccag aaggcgccgt tgctgtagtt gccggagatg aaggcgccgg tgttgacgtc 3730441 gccggagttg gccaccccgg tgttgatgtc accggtgttc aaccaacccg tgttgacact 3730501 gcccgggttg aaaccgcccg tattggcctg ccccgcgttg aaactgccgg tgttgtagct 3730561 acccgcattg accacacccg tgttgaaccc acccgcgttg aacaaccccg tgctggcaat 3730621 ccccgaatta ccgatcccgg tgttataact ccccgaattg aacacccccc agttcccggt 3730681 gccagagtta aagaacccca cattaccggt ccccgaatta aacaacccca cattcccgct 3730741 gccggtattg aaaccaccga acccggtcag attatcaccg gtcaacccaa taccgaaatt 3730801 cccactgccg gtgttagcga acccaatatt gcccacaccc atattcgcca aaccgaaatt 3730861 gtagctgccg gcattaccaa acccgatatt acccaaaccc atcagacccg gcgttaaccc 3730921 cgaattcccg agcccaaagt tgccccaccc gacattgccc aacccgacat tgttgccgcc 3730981 gatattgccg ccacccacat tgaacccacc gacgttgccc gcacccaggt taaagtcccc 3731041 gacattgccc aacccgacat tgcccaaccc cacatcggcc aacccgaaat tgaggaccag 3731101 accctgatgc agcgccgtcc cgctcgccaa caatcccgac aactgctgac cgacactacc 3731161 caaacccgac accaacgccg gcgcacccaa ccccaacacg ctggtgttga acagccccga 3731221 catgccagag ccgaaattca gcacacccga atgcagcgtg ccggcgttga aaacacccga 3731281 acccccaccc agcaacgccg acggagcctg attccagcca cccgacacca tcgcgccgac 3731341 attcccaaac cccgacaccc cacccgcacc ggagttgaag aaacccgacg acggagcacc 3731401 ggtcgtattc ccgaaccccg gcacggcggg aaggtcgatg aggatgtgaa cggggccgag 3731461 cgtgctgtgg gccacgaggt caaaggggat ttcgccgatg gtgattgccg gaatggtgac 3731521 ggcgccggtg ccaccggaca ggttgatgct cagcgggttc atcgcgggga tcgtgaggcc 3731581 gcccgggaag atgtcgacgg gctcgctgtg gccggtaatg ctggccagca gcgggatctc 3731641 gtcaatggtg acgacggggg tgctgaacgg caggttggcc aggaaagccg tgatggtccc 3731701 ttgcgacgag ctagcaccga tgactatctg gcttaacgcc aggggggtaa ggccgatggg 3731761 ggtgttgaag agtcccgtaa tcggaccgat tttcaggggc ccgccgggtt gtgagccaaa 3731821 caagtaattc agcgtgacgg gcacccgtgg aatatcgagg tgcgggacgg tgatggggcc 3731881 gaggccgacg ctgaccgtgg tggcggccag gtcgatctgg ggaatcggga tgctcggcac 3731941 agtgaagctg tcgatggcga cgttggcgct gaactcgggg cggatcgcgg gaatgtcgat 3732001 ggcggggata acgacggagc ccagtccgcc ggtgagggtg aggtccagga acggcgtttg 3732061 gggaagcacg gcggggcggt aggagaagcc gagcaggccc tggtagtcgc cccgccagaa 3732121 ggcgccgttg ctgtagttac cggagatgaa ggcgccggtg ttgacgtcgc cggagttggc 3732181 caccccggtg ttgatgtcac cggtgttcaa ccaacccgtg ttgacactgc ccgggttgaa 3732241 accgcccgta ttggcctgcc ccgcgttgaa actgccggtg ttgtagctac ccgcattgac 3732301 cacacccgtg ttgaacccac ccgcgttgaa caaccccgtg ctggcaatcc ccgaattacc 3732361 gatcccggta ttataactcc ccgaattgaa caccccccag ttcccggtgc cagagttaaa 3732421 gaaccccaca ttaccggtcc ccgaattaaa caaccccaca ttcccgctgc cggtattgaa 3732481 accaccgaac ccggtcagat tatcaccggt caacccaata ccgaaattcc cactgccggt 3732541 gttagcgaac ccaatattgc ccacacccat attcgccaaa ccgaaattgt agctgccggc 3732601 attaccaaac ccgatattac ccaaacccat cagacccggc gttaaccccg aattcgccaa 3732661 cccgacattg ccaaacccga cattgcccaa cccgacattg ttgccgccga tattgccgcc 3732721 acccacattg aacccaccga cgttgcccgc acccaggtta aagtccccga cattgcccaa 3732781 cccgacattg ccgccaccga ggttgctcaa ccccacgttc gggccgacga tcccgaccgc 3732841 ggaattgaag cccgagatca ggttgttggc gatgctcccg tcgaacaggc ccaacagtcc 3732901 cacacccagg cccgggacag ccaaaccgct gaagggatcc gacgtggtgg tggtggagtt 3732961 ccctgagccc ggctcggtga tgatcgggat gttgatgggg cccaccggga ttgtgacgtc 3733021 cacgttcagc ggaattgcgg gcagcacggt ggccgggatg aagacggcgt cctcgaggtt 3733081 gatggacacg tcgataggca ggatttcgtg cagaatcatt gactttacgg tggatgccgg 3733141 ggaaccgaaa gagaagttga gcggtatgga ttcactgaca gtgggcaacg ggatactgag 3733201 tcccgccatg gtgatgggaa tagaacttcc cggaattaca atcggattca gttcgatgcc 3733261 gtctctgaag tcaaacaaga aaagagtctg accgaccgac atgaacagct gggcgggctg 3733321 ggtctgtata ttcgtgattt ggattccgga gatatcgatg cttcccgtga tgcccaggcc 3733381 ggacagcagg gtagtggccg gggcgttaaa actcacattg acgtttccgt cgaggccaaa 3733441 attgatggcg gggatgggga tgtccgggac ggtaaagggg ccgacctcga ggtttcccgt 3733501 gacggtcagg aggggattta gcgcatccac aacggtggtg gtcgggatgc tgatggggcc 3733561 gatgccgccg ttgagggtga agtgaaatgg aaacagcccg ctggtgaggc caaagccgcc 3733621 tgggaccgcc ggaatggggc cgttggccgg ggttggcggg atgtagtccc accggaacgg 3733681 gaaagggcca atagaaaggg tggtgtgcag gtccaccggg atgcggtcaa ccgtgaaacc 3733741 ctgcgggaac acggtgaatc caccggtgcc gacggagaag ttggtgaggc tgaccacggg 3733801 gttttccggg aacgccaggc cgcccgggaa tagcgtgatg ctgtccaggc cgccggtcag 3733861 gttgacggtc accggtgttt ggtcgggaac ggtgaggccg gccgggaaca aggccaagga 3733921 cgatgtggac agattgaaag tcgcgccgaa cgggccgggg atcgtgcccg ggccgccgta 3733981 gctgccgatg atgggtccat tgatctgcag gtcgctgatg ctgaggtaga acgacccgga 3734041 ggggaatttc gcgccgggtg ggcctagcgg cgggccgtag tggtcgatcg tgatgaacgg 3734101 gtccggcaag acgaccgggt ccgcggtgat ttctgccatg gcggtttgcc cgaaaagaac 3734161 aaacgcggga ttcacgtgaa aaccctcgag gccgacggtt ccggtcacgt ggatcgggat 3734221 cgcgggaatg gtgatctccg ggagagtgaa ttcgcggatc ccgatgaatc ccccggtgat 3734281 ttgtatgtcg aatgccggaa tatcgatggg ctggacgtgg atgggaccga tcccgccaat 3734341 cacctgcagg tcaatgggga tttcggaaat ggtgaaaagg gtgccggggg tgaagggggc 3734401 caggacgttg atgttgttgc ccgttaagaa gaaaccggtg ttgtggcttc ccgaattgaa 3734461 tacgcccaaa ttcccggtgc cggagttgaa gaacccgaca ttgccggtac ccgaattgaa 3734521 caatcccaca ttctcgctgc ccgaattgaa accaccgaac ccagtcagat tgtccccgct 3734581 gagcccgata ccgatattcc cgttgccggt attggccaac ccgatgttgc cgatgcccat 3734641 gttcgccagg ccgaaattgc tgctgccggc attgccgaac ccgacgttgt cgaacccgat 3734701 attgcccaat ccgaagttgt tgccgcccag cgcgccgccc gacaacatcc ccgacaactg 3734761 agtacctaca ttgccgatac ccgacatcaa cgtgccggag ttgaaatagc ccgaaaccgt 3734821 tcccggcaac acctgcatgg cctgggtgga ctggttaaac cagcccgagg tgtgcgcgcc 3734881 gacgttcccg aatcccgaca ccccgccggc gccggtgtta aagaagcccg aggacggggc 3734941 ggtggtcgaa ttcccgaacc ccggcgacgc cggaacgttg ccgcccacga tgtcgacggg 3735001 cccgacgccg ccgatggcgt gcaggttcag ggggatgttg tcgatggtga ttgccggggt 3735061 gctcagggcg ttgatgtggc caatcacgtt gatcgccagc ggaagtggtt gctcgggaat 3735121 cgagaatccc ggaatggtga aggcctcggt gcctgccgtt acgccaagag tcagggtgag 3735181 cggccccccg gtgggaatgc tgaggccaac cgggaaaagg gtgagggctg gggtggaata 3735241 actgaaggtt actgggatgg aaaacccggt attgatatgt attgggccga tcaaggttgt 3735301 gggaatgggg gaagggctga gggcgacctg ttggatttgg ggaattgtta tggacgagac 3735361 gggccaggcc agcgtgatgg tttggttgaa gttttgtgcc ggccacaggg tgatgggatt 3735421 gattttgatg gggccgatcg aaatattggg tatgccgacg ccgagcgaga ttgccgggac 3735481 gttgatgggc gggacgacca agggtccgag gtagagggtt tcgttgatgt tgatcgggat 3735541 gtcgggaagt atgtggatgg gctcgatagt gatggcgccg acaccaccgt ttatgtccag 3735601 gctgagggga atgacaggaa gaacgttcgc tcccgaggag aagccgagca ggccctggta 3735661 gtcgccccgc cacaagacgc cgttgctgta gttaccggag atgaaggcac cggtgttgac 3735721 gtcgccggag ttggccaccc cggtgttgat gtcaccggtg ttcaaccaac ccgtgttgac 3735781 actgcccggg ttgaaaccgc ccgtattggc ctcccccgcg ttgaaactgc cggtgttgta 3735841 gctacccgca ttgaccacac ccgtattgaa cccacccgcg ttgaacaacc ccgtgctggc 3735901 aatccccgaa ttaccgatcc cggtgttata gctccccgaa ttgaacaccc cccagttccc 3735961 ggtgccggag ttaaagaacc ccacattacc ggtccccgaa ttaaacaacc ccacattccc 3736021 gctaccggta ttgaaaccac cgaacccggt cagattatca ccggtcaacc caataccgaa 3736081 attcccactg ccggtgttag cgaacccaat attgcccaca cccatattcg ccaaaccgaa 3736141 attgtagctg ccggcattac caaacccgat attacccaga cccatcagac ccggcgttaa 3736201 ccccgaattc ccgagcccaa agttgcccca cccgacattg cccaacccga cattgttgcc 3736261 accgatattg ccgccgccca cgttgtagct cccgacgttg ccggccccca cgttgtagct 3736321 gccgacgttg ccgcttcccg cgttgaagag gccaacgttg gccaaaccca gattgacggc 3736381 gagcgacttg gccggctcgg cggcggccgc caggcttgcc agcggcgagc caaacggcgc 3736441 caacgcctcg gccgccgccg aggcgccggt gtggtacccc agcatcgcgg ccacgtcctg 3736501 ggcccacatc agctcgtagt cgaactccgc ggccgcgatc gccggcgtgt tctggccgaa 3736561 cagattcgat aacgccagcg acactaacct cgaccgattg gccgcgatga cgaaggggtc 3736621 caccgtctcg gccaacgccg cctcgaacac acccaccacc gcccgggcct gcccggccgc 3736681 cgactcggcg gaggccgccg ccgcgctcaa ccaccccgca tacggggcgg ccgccgccgc 3736741 catcgcgacc gaggacggcc cctgccagat accaccgacc agccccgagg tcaccgaccc 3736801 gaaagccgcc gccgccgagc ccagctcggc ggccagctca tcccaggccg cggccgccgc 3736861 caacaggggg cccggacccg ccccggtata tatcagcagg gagttgatct ctggcggcat 3736921 tacgacaaaa ctcatgccgc cagccctttc ccgtgcgttc ccaacatcgc tgtcaaccgg 3736981 tgatcagggt gttgcgccgg cgccgccgag gccgccgtcg ccgccgaacc ctggctccgt 3737041 gcctgagttg ggctggccgg cctgcccttt gccgccggcg ccgccggcct tggcgccgct 3737101 gttgccgccg ttgccgccgt caccgccgtc accgccgtca ccgccgaggc cggtcgcgct 3737161 ctgagtgccg ccgccaatgc cgccctggcc acccttaccg ccgttgccac cgaagccgcc 3737221 gtccggggcg ttgcctccgc caccgcccgc gccgccaagg ccgccgttgc cgccggtgga 3737281 gccgccgcca ttgccgccct gcccaccgag gccgccctgg ccgccggcac cggcaaagac 3737341 gccgtcgccg ccccggccgc cgacaccgcc gttgccgccg ccaccggcca cggtgccgac 3737401 ggtaccgccg ccgttggggc cgccctgacc gccgtcgccg ccgaagccgc ccttgccgcc 3737461 gaaaaagccg ctgccgccgg cgccgccggc gccgccgcca ccgccgctgc cgccttgggt 3737521 gacggagctg ttgccgccga cgccgtcacc gccgtggcca ccgtcgccgc ccttgccgcc 3737581 ctcgccggag ctaaggctgc cgtttccgcc ggcgccgcca gcgccaccgg ccccaccgga 3737641 accgccgacg atgccgctgt tggcgccgat cgagcccccg ttgccgccgg caccgccgtt 3737701 gccgcccttg ccgccgtcgc cacctgagcc gttggggttg ctgccaccgg cgccgccctt 3737761 gccgccgttg ccgccggggg cgcccgtgac cccgatggag gcggggccgc tggtagcgcc 3737821 gaagctccca tcaccgccat tgccaccggc gccgcccttg ccgcctgagc cggtggcgtt 3737881 acccccggcg ccaccgttgc cgccggagcc gccggcgccg ccgcggctgc cgctgcccgg 3737941 gttggtggca ggcccaccgt ggtcaccgtt gcccccgtcg ccgcccttgc cgccaagcac 3738001 gacgccggtg ccgccggcgc cgccgttgcc gccgttgccg ccggcgccgc cgccaatgcc 3738061 gctgccgctg cccccggtgc caccgaaccc accctggcca cctgcgccgc cggcgccgcc 3738121 cgtgtcgccg ctgccgccgg cgccgccgtg gccgccgtta ccggcgttgc caccgcgagc 3738181 gttgccgttg ctggaaccgc cgttggcgcc agcgccgccc ttgccgcccg cgccgccggt 3738241 ggagccaggg ccgacaccgt cgccgccctt gccgccattg ccgcctgagc cggcgttgcc 3738301 ggcatcgcca ccgccaccgt tgccgccggc accgccgttg ccaccggcac caccggcgcc 3738361 gccgttgccg gccgagccag cgccgccgtt gccaccggca ccaccgctgc cgccgtgggc 3738421 cgccggactg gcctgtgctc aggctgcccc cgccagcacc ggcgccgccg ttgccgccgg 3738481 ccgcgccggc gccgcccgtg gtgccgctgc caccgctgcc gccgctgccg ccgtggccgg 3738541 cggcgctgga agtgccgccg ccgttgccgc cggcgccggc ggcaccaccg gccaagcccg 3738601 cgacgccggt gctgttgccg gagttgccgc cgttgccgcc gttgccgccg tcgccgccgg 3738661 tggcaccgcc gccgtggccg ccgttgccgg cgctgccgcc ggcaccgccc tggccgccgg 3738721 cgcccgcgga gccgttgccg ccgttgccgc cattgccgcc gttgccgccg tggccggcgg 3738781 tgacgttgac gacgcctgag ccgctggcgg caccgctgct gccgttgccg cccttgccgc 3738841 cggcgccgcc cgtcgtgccg tcgccgccgt ggccgccgtt gccgccgttg ccgccgtcgc 3738901 cgcccacagc gttgccgaag gacacgccgg cgacacccgc gttgccgccg gccccgccag 3738961 caccgcccgc gccgttgagg ccagtgcccc cattaccgcc ggcaccaccg gagccggcgt 3739021 tgccggtggt cgtgcttttg ctgctaccgc cgttaccgcc agcgccaccg gcccctccgg 3739081 caccgcccgc gtcggtgccg ataccgccat tgccgcccgc gccgccggag ccggcgtcac 3739141 cgcccaaacc gacgttcccg ccgtcgccgc cgttgccgcc cttgccgccg gcgccgccgt 3739201 cgccgcccgt ggtgctgacg ccgccgttgc cgccggcgcc gccgttgccg ccgaggccgc 3739261 cattgccttc ggggcctccc ggaccgccgt agccgccgtt gccgccggcg ccgccaaacc 3739321 cagtctcgga gacgccgccg ttgccgccga ggccgccgtt gccgcctaag gaaatgccgc 3739381 caccgccgtc gccgccgcta ccgccgttgc cgcctgtgcg cccttccccg ccgatgccgc 3739441 cctggccgcc gaagccgccg accccgccgg caccgccgtc cccgccggcg ccgccgacac 3739501 cgccaacacc gctagcaaag tcgcccgcgc cgccgggacc gccggcgccg cctgggccac 3739561 ccaacccggt gctagcgaag ccgccggcac cgccattgcc gccagcgccg cccgttgtcg 3739621 cggcgacgtc aacggcgccg ccaccgccgg cgccgccgaa gccgccgagg ccgccgttga 3739681 tcatgccggc accgccattg ccgccgttac cgcctttgcc gcccgtgccg aagaagccgg 3739741 cctggttcag cgccccaccg ccgttgccgc cgttgccggc gtcaccgccg ttgaggccgg 3739801 agccgccgtt gccgccgttg ccgccggccg cgccgctccc gttgccggcg gtgccgccct 3739861 tgccgccgtt gccgccattg ccgccgttac cgccgttggg ggtgatgccg tcggtgccgt 3739921 ccaagcccgt caaggagccg gtgccggcct tgcctccggt gccgccgacg ccggcgttgc 3739981 cgccgttgcc gccgttgccg ccggtaccgg ggtttcctac ggtgccgccg cccggcagca 3740041 tggccccgct gtttaggccg ttttcgccgg ccccgccgtc accggctttg ccgccatcgc 3740101 cgccgttgcc gccgtcgccg ccggtgcccg tggcgccgtc ggtgtacccg gccgcctgcg 3740161 ccttgccgcc cgcgccgcca ttgccgccgg cgccgccgtc gccaccgtta ccaccgctac 3740221 cgccgttctc gccgtttgcg ccgttagcat tggggccggc gccgtcggcg cctctctcgc 3740281 cggcgccgcc gatgccaccc tggccgccgt taccaccctt accaccgttg ccgccgtggc 3740341 cggccagtgt tccgccggcg ccgcccgccc cgccgttgcc gccagcccca ccgtcggtgc 3740401 ccgaggtgcc ggaatcaccg ctggtagggc ccggcgtacc ggcttggccg gccgcgccgt 3740461 tgccgccggc cccgccattg ccgccattgc cgacattccc gccgctgccg cccttgccgc 3740521 cgtcaccgcc gttgccgccc gcgacggtgg ggctggcgcc gttgccgccg ttgccgccgt 3740581 caccgccgct ggtgggtgcg gtgccatcgg cgccggtcgc acccttcatg gctggaatgg 3740641 cgcccttgcc gccggcccca ccctggccgg caacgcccac attgccgccg ttgccgccgg 3740701 caccgccgtt gccggcctta gcgaacgtgg cgaaggcgtc accacccttg ccgccgatgc 3740761 cgccgttgcc gccgttgccg ccctgtccgc cattcgcgcc attggcggac gcggagaagt 3740821 cttggccgtt ggctccggcg cccccgttgc cgcccttgcc gccgtccccg cccgtgccgg 3740881 ccgccgatcc gccgttgccg ccgatgccgc cgttgccgcc gttgccgccg ttgagggcaa 3740941 ggccggtgcc ggcgacgcca tttccgccgg caccacccgc accgccgtta ccgaccgacc 3741001 cgccatggcc gccgttacca ccggcgccgc cgttttctcc cgcgacggtg ggggtggcgc 3741061 cggcacctcc gttgccaccg ttgccgccgc tggtgggcgc ggtgccgttc gccccggccg 3741121 aaccgttcag ggccgggttc gcgctaacac cgccggcccc acccttgccg ccaacgccca 3741181 cttcaccgcc gttgccgccg tcaccgccgg caccctggtt gacggccaag gtcacatcac 3741241 cggcggcacc ggctccgcca tcaccggcct tgccgccgtc accgcccttg ccgccgttgc 3741301 cgcccatacc gccatcggca ccgggcgaac ccaaggtggc ggcgtcgaat ccgtttccgc 3741361 cggcgccgcc gctaccgccg gcaccgccct tgccgccgac gccgccgtcg ccgtgctggg 3741421 cgccgccatt tccgccatta ccgccgtggc ccccggcgcc gccattggtg ccgttaccgc 3741481 ccgtcggttg taaggcggta ccggtagcgc cggtggaacc cgcatgaccg gcaccgccgg 3741541 cgccgccggt gccgccgttg ccgaccaacc cgccatgacc gccattaccg ccggccccgc 3741601 cggcttgtag gggtgagttg gcggtggcgc cgatgccgcc atcgccgccg ttgccgccgc 3741661 tggtgggggt ggcgccggcg gcaccgtgcg cacccgccag caggccgccg gccccaccgg 3741721 ccccgcccac gccggggttg ccgccgtgac cgccgttacc gccggcaccg ttgttgacgg 3741781 cgaaactcgg atcgccagcg ccgcccttac caccgtcgcc gccgacgccg ccggccccgc 3741841 cggccccgcc gttgccaacc aataacccgc cgcgcccgcc gttgccgccg gttccgccgt 3741901 tgccgccgtc gctgccgtcg ccgccgttga ggccggcggc acccggcagg cccgcggccc 3741961 cggccccccc ggcgccgccg ttcccgaaca gcccggcgtc gccaccgttg ccgcctatac 3742021 ctccgatgcc gccgatcccg ccggcgccgc cgttgccgta gacaaatccg ccggacccgc 3742081 cgacgccacc attggtgccg gcgccgccgg acccgccggc cccgaacaac caggcgttgc 3742141 cgccggcacc accgttagcg ccggtcccgc cggccccgcc ggccccgccg ttgccgttca 3742201 accacccgcc ggatccgccg acaccgccgg cagcgccggc cccgccggac ccgccggacc 3742261 cgccgttgcc gaacaacccg gccgcgccgc cgggcccacc gacttgaccg gccgcccccg 3742321 aaccgccgtt accgccatta ccccacaaca accccccggc cccaccgggc tgcccggtcc 3742381 ccggcgcccc gtgaacgcca tcaccgatca gcgggcgccc caaccacagc tgtgtgggcg 3742441 cgttgatcgc acccaacact tgctgctcca gcgcctgcag cggtgatgca ttcgccgcct 3742501 cggcagtcgc atacgcgctg ccagccgcgg tcagcgagcg cacaaactgc tcatgaaacg 3742561 tcgccacccg ggcgctcaac gcctggtact cctgcgcgtg ggtaccaaac aacgccgcga 3742621 tcgccgccga cacctcatca ccggcggccg ccaacacctg cgtcgtcggg cccgctgccg 3742681 ccgcattcgc cgcgctgatg gcctgcccaa tcccggtcaa gtccgccgcg gccgccgcca 3742741 ccagctccgg cgccaccatc agcgacatga ccattcctcc aacaccaatg gcgcgtacag 3742801 ccggctcgcg cgagccttga ccgccggcgg caacccgagc gatcccatgg ccctaggcgg 3742861 ttctcgggcg aacgccacgt ttagcggatc gattcacccg gtcgttgcgt tgcggcgcag 3742921 caatagacat ctcgaagcac tccggctgcc aatctcgtcg cgtttattct gctcgtgacc 3742981 agcgcaggaa agggggggat tacgaaagtc ttcgggatct cagtgcacag tgcacacatg 3743041 tttaaccaat caccgtggca taacgcacac caaaggccga gagcgcggaa aacgcagaac 3743101 atcaattgga tcggttgcta gctttgccgc accgtggtca gccgcgccag gatcggtcgg 3743161 caatggcacc accggagcag gcgaaaggta cccggttcta gcccgtcccc aacgggtcaa 3743221 tggtggatgc gatatagacc atggccgccg cgaccgtcac ggtcgtcacg aaatcgatcc 3743281 ccttgctgcg caccaccaac aggccggccc gttcctcgga caacaccaac cgcagcaccg 3743341 ccgccacccc aacgccgata ccgatcagca gcgcaccacg gcgccagaag ttgacccccg 3743401 ccaggatcgg ccactgggcg ccaacagtgc gccgcaaaac ggccctcacg gtcatcgccg 3743461 ctcagccagc tccacgacac ttgtcagcaa ggacgcccgg ggcgaagggc gttcgccaag 3743521 tctgtagatg agctgcggga gatggccgac ggcgagggtt gagaagcgtc aacttcgatc 3743581 gtgatgcctg ggaggacttc ttatttcata cgcgatcggt gatgccgccc tgaagccgag 3743641 gtcgacggca gcgcggagac gttcgagaag acgtcgcggt gaggtcaatc ccggtgtgac 3743701 caacggccgg ttacggcccg gtgcccgcga acagcaggcc cgacagctgc tggccgacgt 3743761 tcataaagcc cgagacgaag gccgatgtga ccaggccaag cgtgcccgtg ttgtacacgc 3743821 ccgagatgcc cgcgccacgg ttgaggatgc cggagagctg ggtgccgaaa ttggcgaagc 3743881 ccgacgccga cccgagcagc ggatccgaga tcgcgttgag cacccccgac atgcccgacc 3743941 cggagttgga gaagccggac ccgccaccac cgccggtgtt gaagaagccc gacgacggcg 3744001 cggtggtgtc gttgccaaag cccggtgctc cgccgaaccc gaaaatcggg aggctgacgg 3744061 ggccgatggt ggtgctggcg tgtaactcca ccgggatccg gtcgataacg accgtcggga 3744121 gatcaaaggg tggggtgccg ccggacaaac cgaggcccag cgggagttgg ggaatcaggg 3744181 tgccgcccgg gatggtgaag cccggaatgg tcagcgacag cggcaggccg atgtggatgg 3744241 gtccggtggg aatggtgaat ccggggaagt gcagtgtcgt cgggttcaag ttgatgggtg 3744301 ccacggtgaa tggttgaagt atggagacct cgcccccggg catgccgtcg ggtccgaccg 3744361 cgaagaatga aaagctgggt ctgaccttga atccggagct gcttccggac gtcatcctga 3744421 tctccgagac ggcagcatcc aaacttaggc cagggatggt gagggtgatg gggtccacgg 3744481 tgatagggcc gacgtcgaag gtgggatcga tgcccaggtg gatcgagggg atggcgatgt 3744541 tcgggatgct gatcggcccg atgtggccga tcgcggcgaa gcccaacggg atggacggga 3744601 tgtggatggg cggaatgatg gtggcggggc cgatgtcgcc ggtgacgtcg gcgcccaccg 3744661 cggggaacag cggaatgggg tacccgaagg agaagccggc caagccctcg taattgcccc 3744721 gccataagat gccgttgcta aagttgcccg tgatgagggc gccggtgttg acattgcccg 3744781 cgttggcgac gccggtgttg gcgttaccgg tgttgaacca gccggtgttg gtgctgcctg 3744841 ggttgaagcc accggtgttg gtgtcaccag cattgaagct gcccgtgttg tacgacccgg 3744901 cgtttgccac accggtgttg aagccgccgg cgttgaccaa cccggtgctg gccaccccgg 3744961 agttgccgat accggtgttg tagctgccgg agttgaacaa cccgaagttg gcagtcccgg 3745021 agttgaagaa gccgatattg cctgtgccgg agttgaacag gccaatgttg ccagtgccgg 3745081 agttcaagcc gccgatgccg gactggttgt cgccggtgag cccgatcccg aggttgttgg 3745141 tgccggtgtt gccaaacccg atgttgccca ggcccatgtt ggcccagccg acgttgccgc 3745201 tgccggcgtt gcccagcccg atattgccca tgccggccag gcccgccgcc agacccgaat 3745261 tcccgaaccc gaagttggca tcgccgatat tgccgaaccc gacgttgccg ccgccgatgt 3745321 tgccgaagcc caggttcacg tcgccaatgt tgccgaatcc caggttcacg tcgccaatgt 3745381 tggccgcacc caggttgagg ttgccgatgt tgccgaggcc gacgttgccg ttgccgacgt 3745441 tagccaaccc gatgttgacg atggtgatgg ggttttgccc cacgttggag gccaacaagc 3745501 ccgacaggtg atcaccgacg ttgcccaggc ccgacaccaa cgccggcgtc ccaagcggca 3745561 gcgtgctggt gttgtagatc cccgacagcc ccgaaccgag gttgagcacg ccggagtgca 3745621 gtgtgccgac gttggcaata cccgaacccg cgcctgccaa agcggtgtgc gcctggttcc 3745681 accaccccga catgttcgcg ccgaagttgc cgaaacccga gcccccgccc gccccggtgt 3745741 tgaagaagcc cgacgacgga acggtggtgg tgttcccaat gcccggggtg ggcgggatgt 3745801 tgatcagcgg gatgtcgccg gcgatgacgt agagttcgcc gtcggcgttc gccgggatct 3745861 ccgggaacgt gatcgccgga atggtggcgc cgggggtgcc gacgaacaca tccaggttca 3745921 gcagcgagtt cgccgggaac gtcagaccac cggggaacag ggtgatcgcg tcgatgctgc 3745981 ccggcacctg gaaacccaac gggatctggt gaatattgag cgccggggtg ttgaacgcct 3746041 gagatgccgc attgaagacg gcatgcaccg ggccggtcgt gctgagcgtc gggattcccg 3746101 agatgatatt gccgccgacg aacaggtcac cggcgttgta gattctgccg accgagtacc 3746161 acgttgggcc gatcgcaccg gatgacgtcc agacgataaa cggctctatt tcgctggtcg 3746221 ccccgaccga cgcggccata tcgaggaccg ctcgtgcggc ggtcagggcg ggaatggtga 3746281 ccgaggggac cgcgatgggg ccgaagccga cgcttccggt gacgttcgga ttgagggcgg 3746341 gaatatcgat ttgcgggatg gtgaaggcgc ccatcgccgc gttgccggtc aggtgcgcgt 3746401 tgatcgccag aaccgggatg ggcgggacga ccaccgggcc gaaggccccg gtgaaatgcg 3746461 cgtccaggat ggtgatccgg ggaacgtcga ggctgtagga atagctgaat aggccttcgt 3746521 agttgccccg ccacaggatg ccgttgctga agttgcccga catgagggcg ccggtgtcga 3746581 cattgcccga gttcgcgatg ccggtgttgg cgttaccggt gttgaaccag ccggtgttga 3746641 tgctgcccgg gttgaagcca ccggtgttgg tgtcaccgac attgaagctg cccgtgttgt 3746701 acgacccggc gttggccaga cccgtagtga aaccaccggc attgaaaagc ccagtactgc 3746761 ccgttccgct attaccgatg ccggtgttga agctgcccga gttgaacaac ccccagtttc 3746821 cggtcccgga gttgaagaac ccgatgttgc cggtgccgga gttgaacagg ccaatgttgc 3746881 cggcaccgga gttcaagccg ccgatgccgg tctggttgtc gccggtcagc ccaatcccga 3746941 ggttgttggt gccggtgttg gcgaacccga tgttgcccac acccatgttg gccaggccaa 3747001 cgttggtgct gcccgcattg cccaacccga tattgccgat gccgagcgcc gcccccaggc 3747061 ccgaattgcc aaacccgacg ttgccgtggc cgatattgcc gaagccgacg ttggcgttcc 3747121 cgatattgcc caaccctagg ttgaggtcgc cgaggttggc cgcgcccagg ttgaagtccc 3747181 caacgttgcc caacccgagg ttgtagttgc cgacatcggc caacccgagg ttgatgatgg 3747241 ggctttgggt caacgccgtc ccggccgcca acacccccga cagctgctgg cccacgttgc 3747301 cggcacccga caccagcgcc ggcgtcccca aacccacgat agcggtgttg tacagccccg 3747361 atatccccga gccgacgttc agcacacccg agttcagcgt gccaacgttg agaacgcccg 3747421 agcccgcgcc cgccaacgcg gcatgcgcct ggttccacca gcctgagctg ccggccccga 3747481 agttgccgaa acccgacacc ccgcccgcgc cggagttgaa gaaacccgac gacggggtgg 3747541 cggtcgcgtt cccgaagccg ggcgtcggcg gaacgatgat gatcggaacg ctgctgtccg 3747601 gcacgctgat gttgagggcc aggctcagtg gcagcggatc gatcgtgaaa ccacccggga 3747661 atatcgtgat cggatccagc acgccggacg catcgatggt caacgggatc gcattttgcg 3747721 ggatgttgag gccaccgggg aacagcgtga aggccggaag accgcccgac acatcgatct 3747781 tgagcgggat aggcgatgtc gtgatcgttg ggatggtgac ggttgggagg gttagtgcga 3747841 ggctaccggt ggttgcgctg ctgggaccgg tatggatcag gatgccctga gtgggtgcgg 3747901 tgacaaagcc accactcatt ccggttgagt tggacgcccc aacgatccag ttgtcgccga 3747961 gcgcattcac gaacagcaac ggaagtctga agggcggcgg ggcgggggcc gggggcgtgt 3748021 cgagcggaat cgtgtaggtc tgaccgccga tcgtcatgct cggcaggaag acgatgggcg 3748081 ggatgaccat cgtttcgtgg atgtccagca ccactgcggg gacatcgatg ggctcgatcc 3748141 tgaagggccc gatgttgacg agttcgtgga tgtcgaacag cgacatgccg ggaatatcga 3748201 tctgatcgat gtggacggga ccgaggttga gggtttcgtt gatgtccacc agggtgctgc 3748261 cggtgatttc gatgctgtag gagaagccga ccagcccgtg gtgatcaccg gtccacagcg 3748321 cgccgttgtt gaagctgccg gagttgaacg cgccggtgtt gacattgccc gtgttgaagc 3748381 cgccggtgtt ggtgtggccg gcgttgaacc agccggtgtt gacattgcca gggttgaagc 3748441 cgccggtgtt ggtgttgccc gcgttgaggc tgccggtgtt gtaactaccg gcattggcca 3748501 gacccgtgtt gaaactcccg gcattgaaaa gcccggtact gcccgttccg ctgttaccga 3748561 tgccggtgtt gtagctgccc gagttgaaca acccccagtt tccggtcccg gtgttgaaga 3748621 acccgatgtt gccggtgccg gagttgaaca accccaggtt gccggcaccg gagttcaggc 3748681 cgccgaaccc ggtctggttg tcgccggtca gcccgatccc gaggttgttg gtgccggtat 3748741 tgccgaaccc gatgttgccc aggcccatgt tgccgaagcc gacgttgttg ctgccggcgt 3748801 tgcccaaccc gatgttgccg atccccggca gcgcccccag gcccgagttg ccgaacccga 3748861 cattgccgtg gccgaggttg ccgaacccga cgttgccgtc cccgaggttg cccaacccca 3748921 ggttctgccc gccgaggttg ccaccgccga ggttgaggtt gccgaggttg cccgcgccca 3748981 ggttgacgtc gccgacgttg gcgaagccga ggttgtagct gccgacgttg cccaggttga 3749041 cgatgttcag cggattcagg tgccgcagct cggcgatcgc cgcgtcgatg atgctcggct 3749101 gcccggagcc gcccgacccg ccgctggtca gcatcgccag caggccatcg atggacaccc 3749161 ccgacacgtg gttgcccagg ttgccgaaac ccgagatcac cgccggcgcg gagcccagcg 3749221 tgctcacgtt gaacatgccc gagatgtcga cgccggagtt cagcacaccg gatgccaggc 3749281 tgccggcatt gcccaggccg gagagcgtcc ccaccatcgg actcgaggcc tggttcagca 3749341 agccggacac ccccgcgccg aagttggcga tgcccgagcc gccaccgccg ccggtgttga 3749401 agaagcccga cgacggcagc tcggtcgagt tgccaaagcc cggcagcgcc ggaatgtcga 3749461 tgatcgagat gttgatgggt ccggcgctgc tgagaacgtc gaagttcagc ggaatcgggt 3749521 cgatcctggt gccggtgatg gtgaccgccg gaatgtcgac ggacacatcg atcggcacga 3749581 cctccgacat cgaaattccg ttgatagtgg aggccgggat gtcgatcggc ggaatgtcga 3749641 tgggtatgga ttggctgaac gagattgccg gcaattcgat ggcgtcgatg gtctgctgca 3749701 gcggcagggc caatccgccc agcgttgccg aagtaagggg tatggcgacc tgtatctgaa 3749761 ccgagattgt gggatcggga aattcatttg ggaacgcgtc gtggaggaac tgaagcttga 3749821 ggttaacgtt gaacggattg agctggacgt ttgagacggt gatcgggccg aacctgaatt 3749881 gtccggtaat gcccagcgca gaaagcaggg tggtggccgg ggcggtgaag ccggcgtcgg 3749941 cggcaccgtc gaagtcgatg tggattgccg gaatggggat gtccggcacg gcgaagccgt 3750001 agttcgcttg tcccgtgagg cccaggtgga tggggggaag gatcgtggtg tccgggatga 3750061 taatggggcc gatgccgccg gttgaagtcc agtggatcgg gaattcggga atcgtgatgc 3750121 cgacgttcag gccgaacagg ccctcgaagt tgcctcgcca caagatgccg ttgctgaagt 3750181 tgcccgacat gagggcgccg gtgtcgacat tgcccgaatt ggcgacgccg gtgttggcgt 3750241 tgccggtgtt gaaccagccg gtgttgatgc tgcccgggtt gaaaccaccg gtgttggtgt 3750301 cacccacatt gaagctgccc gtgttgtacg acccagggtt ggccacaccg gtattgaaat 3750361 taccggcatt gaaaagccca gtactgcccg ttccgccatt gccgatgccg gtgttgaagc 3750421 tgcccgagtt gaacaacccg aagttcccgg tcccggagtt gaacaacccg acgttgccgg 3750481 tgccggagtt gaacaacccg atgttgccgg caccggagtt caagccgccg atcccagtct 3750541 ggttgtcccc ggtcagccca atcccgaggt tgttggtgcc ggtgttaccg aacccgatgt 3750601 tgcccacacc catgttgccg aagccgacgt tgccgctgcc ggcattgccc aacccgatgt 3750661 tgcccacccc ggccaggccc gccgccagac ccgcattgcc caacccgaag ttggcatcgc 3750721 cgatattgcc gaacccgacg ttgccgccgc cgacattgcc caaacccacg ttcaagtcgc 3750781 cgatattggc cgcacccagg ttgaagtccc cgacgttgcc gaaaccgacg tttacgctgc 3750841 ccacatcggc caacccgaga ttgatgatga ggctctggtt gagtgccgtc cccgccgagg 3750901 acaaccccga cagctgctca ccaacattgc cgatgcccga gaccaccgcc ggggtccccg 3750961 gcggcaaccc gccggtgttg tacagccccg acacacccga gccgaagttc agcacacccg 3751021 atcccagcga accgaaattg gcgaaacccg aacccgcccc agccacctcg gtctgcgcct 3751081 ggttccacca acccgagctg cccgcaccga aattcccgaa gcccgacacc ccaccgtcgc 3751141 cggagttgaa gaaacccgac gacggagcgg tggtcgtgtt gccaaagccc ggggtcgccg 3751201 ggatattaac gccgttgatc aggatagggc cgacagtgac gctggcgccg aggttcagcg 3751261 ggatgcggtc gatcgtgatc ggcggggtgc tgaagccgtc aatctggccg tctatgtcga 3751321 tcgtcagcgg cagcggcgca gcgggaatgg tgaagcccgg gatcgtgaat cccagcgtgc 3751381 cgatcgacgc gctggccagc agcgccagtg gattgttggg aatactgatg ccattcggga 3751441 agatcgttac tgccggggta ctccagttga cggtcaccgg gaatgactgg ttaattctgg 3751501 tgtcgatatt aaggttacct aattggaggg tgacgttgcc ggcaagatct ttgatttcga 3751561 ttcctgaaat gttgacgacc cccaagccaa agaaggggcc gacggggaaa gtcgtgttga 3751621 agttctgagc cgggaacagg gtgatgggcg agatggtgat ggggccgacg ctgataggta 3751681 tggccgtacc gccaccaaaa gcggggatca cgatgtccgg aacgaccagc gggccgaggc 3751741 tgaaggtttg gtgaatgttg agcgggatgg tgggcaaaat ctggatcggc aacacggtga 3751801 tggggccgac gccgccgttg agctcgagac caatggggat cgccggaatg gtcgatccac 3751861 cggagagccc ccacaggccc tcgtagtcac cccgccacag cacaccgttg ctgaagttgc 3751921 ccgagatgaa cgcgccggtg ttgacattgc ccgagttggc gatgccggtg ttggtgttgc 3751981 cggtgttcag ccagccggtg ttgacgttgc ccgggttgaa gccacccgta ttggtgttgc 3752041 ccgcgttgaa gctgcccgtg ttatagctac ccacgttggc cacacccgtg ttgaacccac 3752101 caacgttgaa caacccggta ctggccgtcc ccgcattacc gacaccggtg ttgtagctgc 3752161 ccgagttgaa taccccgaag ttgccggtcc ccgaattgaa gaacccaatg ttgccggtgc 3752221 cggagttgaa caaccccaga ttaccggttc ctgaattcag gcccccaatg ccagtcaggt 3752281 tgtccccggt caacccgatc ccgatgttgt tgctacccgt gttggcaaaa ccgatgttgc 3752341 ccacacccag gtttgcgagg ccgtagttgc tgctgcccgc attgcccaac ccaatattgc 3752401 ccatgcccgg cggcaaccca agacccgagt tgccgaaccc gaagttggcg ttgccgatat 3752461 tgccgaaacc gaaattcccg ctaccggcgt tggcagcacc caaattctgc gcaccgacat 3752521 tggctgcgcc caggttgaat atcccgacat tgcccaaccc gacgttgtaa ttaccgacat 3752581 tgcccaagcc cgcgttaagc ctcaacatct tcgcgggtcc ggcaaataga gcattgagga 3752641 acgcgccgac accacccccc aacgcctgcg ccggtgggct gaacgccggc aacgccgcgg 3752701 cagcagccga cgcgccggaa tggtagccgg ccatcgccgc cacatcggcg gcccacatca 3752761 actcgtactc ggcctcgacg gccgcgatcg ctgcggcgtt ctgccccagc aggttcgaca 3752821 tcgccaacgc ccgcatcgcc atccggttga ccgccaccgc cgccggatcc accgtcgccg 3752881 ccaacgccgc ctcaaacgcc gccaccgcgg cccgcgcctg cccggccacc gccacggcct 3752941 gcgccgccac cgaacccaac caccccgcat acggggccgc cgcggccgcc atcgccgccg 3753001 ccgccgcacc ctgccacacc cccgccgtca ggcccgacgt cacctgccca aacgacaccg 3753061 ccgccgaccc caactcctca gccagcccat cccacgccgc ggccgccgcc agcaacgggc 3753121 tcgaccccgc acccgaatac atcagcacgg agttgatttc cggtggcaga actggaaaat 3753181 tcaaccgccc ctacctctgc cgctcacgat gcgttcacac ctcatcgtct caccacgacg 3753241 tggtgagcgc gggcacttcg acaaactaat ctgcaatatc ccgatcgcgt acaaacgtgc 3753301 cgacatttgc ggcgcattaa tgcccatatc ggcttgtatc tcttgtagtg ccgctttgac 3753361 ggggtggtgg tcaggtacgg tggcctcggg agaggctgga gggctcgacg ttttcggctg 3753421 agtgtctggg cccgtgaaag agatcgtctg ctccagcttt gtctcctgaa ctgacccggt 3753481 ttagggaatt ggtggccagg ttgcggaagt gcgcagcatc gacgtgtacc tgggtgaggc 3753541 atcgaatcat cgacaagcac cggagccgcg cgtgaactcc cgccgcgttg tggtcgggga 3753601 tgatgtggga gaccggccgg cagtgctgtg tacgaaggtt ctcccaccgc aacgagttca 3753661 cgcacgacgg tcggctgggt gggccctgga atacgtgaac tcttcatcaa cacaacatga 3753721 ttgacgatga aggggagaac ctccatgcac aacaacgcta acccgtgact gccgagaatc 3753781 caggacggag caggcggacg ctggtcggaa tcgacgcggc gatcacggcc tgtcaccaca 3753841 tcgcgatccg cgatgatgtc ggtgcgaggt cgattcgatt cagtgtcgaa cccacgctgg 3753901 ccggactgcg caccctcacc gacaagctca gcggttacga cgatatcgac gccaccgtgg 3753961 aaccgacctc gatgacgtgg ctgccgctca cgatcgctgt cgagaatgcc ggtgacacca 3754021 tgcacatggc cggcgcgcgg cattgcgccc ggctgcgggg tgcgatcgtg ggcaagagca 3754081 agtccgacgt catcgacgcc gaggttctca cccgcgccag cgaggtgttc gacctgacgc 3754141 cgctgacact gccgacgccc gcgcagttgg cgttacgtcg atcggtgatc cgacgtgccg 3754201 gcgcagtgat tgacgcgaac cggtcctggc gtcggttgat gtcgttggcg cggtaggcgt 3754261 tccccgatgt gtggaccgcg ttcgccgggt cgttaccgac cgcgacagcg gtgctggggc 3754321 gttggcccga catccgcttg ctggccggcg caccgacccg ccacgttgac cgccgtcatc 3754381 gccgcgcaca cccgcggtgt cgccgacacc ccggcccggc cgaggccatc aagaccgccg 3754441 caaccggctg ggccgcgttc tgggacgggc acctcgacct ggacgcactg gccgtcgatg 3754501 tcaccgagca tctcagcgac ctcaccgacg accgatgcgc gcgttggtga tgccggtgac 3754561 caagaaggtg ttgatcttgg gtgactagtc aatggtggtg gccagggtga gcagttcggg 3754621 gatctgcgag tcgatgcgcc aggcaggaag cggtgtaggt gatggcgcgc caggtggggg 3754681 tccccgccgg tgcgcacggt cgacagcagg gtgcgcagct cctctttggc gatccaggcc 3754741 gagagaatct gcgcgcgggg gtcgacggcg ttgatccgat tccgcatttt ggcgaagctt 3754801 ttgtccgaca agcgttcccg ggcggtcagc aagcgacgtc ggttggccca ctgcgggtcg 3754861 atcttgcggc cgcgccggtc gtggaacgcc caggtcaccc ggcggcgcac cgcggtcagc 3754921 gcgtcgttgg ccagcgtggt cacatggaag tggtcgacga cgagcttggc gttgggcagc 3754981 agcccgggcg tgcggatcgc cgaggcgtag gcagcggcgg ggtcgatggc caccgtactg 3755041 gatgctctcc cggaactgcg gtgtgcgcgc ttgcagccat gccagcaccg ccgcgccgcc 3755101 gcggccttca tgctgcccca taaacccctg atcaccggcc aggtcgacga acccggtatc 3755161 ccacgggtcg acccgtaccc accggccagt cttggcgcag cgctcccatc tgggttttcc 3755221 tcgccgtgtc tggtcaacgc ccagcaccgg ggtgggcaac ggctcggtca atacccgtct 3755281 cggcgtaggc aacaaacgcc cgatgtgccg tcggccacga cacggcgtca gcctgggcga 3755341 cctcggccca ccgagcgggc cgcatccccg atcgccttgg ccatctgccg acgcagccgc 3755401 agcgtgctgc ggacgcgggc aggtacctgg gtgatggcct cggtgaacgg ccccagcttg 3755461 cagtagtctt ctcggcatcg ccagcgaatt ttgttccagc gcaccatgat gcggtcttcg 3755521 ccataaggta gatctttcgg tgaggtaacc gcgtattcct tcactgatat cgagaccacc 3755581 cccgcacgac gggcacgccg ccgccgtcgg ctcatcggtg atcacatcga ccacccgggt 3755641 cccgtcactg cggcgctcga cacgctcaac ccgtgctcct ggcagcccga acaacactgt 3755701 cgtagcgtca gacacagccc ttggctcctt cctcggcctg aatgcttcgc aacacttaga 3755761 cttcagaagg ccaagggccc tcagccgcta aacacgccga ccaagatcaa cgagctacct 3755821 gcccggtcaa ggttgaagag cccccatatc agcaagggcc cggtgtcggc gcaaaattta 3755881 gcgtcgttgc gcccacacca gagttaccgc cgcacacacg gcgtgaccac cggcgtgcat 3755941 ttaagaatcc gttagggccc gacgccggtg aagagcaagc ccgacagttg ctggccgacg 3756001 ttgccgaaac ccgagacgac ggccgcggtg acaacaccca gcgcgccggt gttgtacacg 3756061 cccgagatgc cggcgccgcg gttgaggatg ccggagagct gggtgccgaa gttggcgaag 3756121 cccgacgccg acccgagcag cggatccgac atcgcgttga gcaatcccga catgcccgcg 3756181 ccggtgttgc taaagcccga accgccgcca gctccggtgt tgaagaagcc cgacgacggc 3756241 agcgtggtcg agttcccgaa acccggcgcc ccgccgaacc cggcgatcgg gacgttgatc 3756301 gggccgatag tggtgtcggc gtgcaggtcc agcaagatcc ggtcgagaac gatggccggg 3756361 atgtcgacgg gcgggatgcc attggacaac gcgaggccca gcgggagggt ggggatcagg 3756421 gtgccgcccg ggatggtgaa ccccgggatg gtcagcgaca ccggcaggcc gatgtcgatc 3756481 gggtcgaggg ggatggtgaa tcccgggaag gtcaccgtgc cggaggggat ggagatgggc 3756541 cccacaaagt atgccccttg cgtggacgtt gcacccccgc cgctagaggg cgcgatccgg 3756601 attccgggga agaagctggg cttgacccaa atctctgagg ttggtccgga cgtgctggtg 3756661 acggctcctt gggagtaact gacgagcacg ggcggggtcc tgacggtaat ggggttgacg 3756721 gtgatggagc cgacatggac ggcggggtcg aggcccaagt gaatggatgg aacagagatg 3756781 tccgggatgg cgatcgggcc gatgccaccg accgcggcga agccgaccgg aatgggcggg 3756841 atgtggatgg gcggcagcac ggtaatcggg ccgatcccgc cgctgacgtc ggcgcccacc 3756901 gcggggaaca gcgggagggt gtagcccacg gcgaagccgg ccaggccctg gtagtcgccg 3756961 cgccacagga tgccgttgct gaagttgccg gtgacgaagg cgccggtgtt gacattgccc 3757021 gcgttggcca ccccggtgtt ggcgttgccg gcgttgagcc agccggtgtt gatgctgccc 3757081 gggttgaagc ccccggtgtt ggtgtcaccg acattgaagc tgcccgtgtt gtagctgccg 3757141 gcgttggcca caccggtgtt gaaactgccg gcattgaaga gcccagtgct gcccgttccg 3757201 ctattgccga cgccggtgtt gaagctgccg gagttgaaca acccgaagtt gccggtcccg 3757261 gtgttgaaga acccgacgtt gccggcgccg gagttgaaca accccaggtt gccggcaccg 3757321 gaatttaggc cgccgatgcc ggtctggtag tcgccggtca gcccgatccc aatgttgccg 3757381 gtgccggtgt tggccaaccc gatattgccc acgcccacgt tggccaaccc ccagttgttg 3757441 ccgccggcat tgcccaaccc cacattgccc aggcccggca cgcccgcggt cagacccgag 3757501 ttgccgactc cgacattgcc gtggccaata ttgccgaacc ccaggttgcc ggcgccgata 3757561 tttccgaagc ccaggttgtg cgcgccgagg ttggccgcgc ccaggttgac ctccccgaca 3757621 ttgccgaaac cggcgttgtg gctgccgacg ttggccaacc cgatattcag aacggtcacc 3757681 gggttcaccg cggacccgcc ggaaagcagc cccgacagtt ggtggccgac gttgcccagg 3757741 cctgagacca gcgccggggt ccccaccccc agcgtgctgg tgttgtagat ccccgagaca 3757801 cccgagccca ggttgagcac accggaatgc agcgtgccaa cgttggcaaa acccgagccc 3757861 gcccccgcca gcgcggtgtg cgcctggttc caccagcccg aggtgcccgc gccgaagttg 3757921 ccgaagcccg atcccccgcc cgcgccggcg ttgaagaagc ccgacgacgg ggtgatggtg 3757981 ctgttcccaa tgcccggggt gggcgggatg ttgatcagcg ggatgctgct ggcgaggaca 3758041 tacaccgagc cgtcggcgct cgccgcgatc tcgggccagg tgatggccgg gatgtccacg 3758101 ccgccggcgc cggcggtcac gtccaggttc agcagcgagg tcgccgggaa cgtcaaacca 3758161 ccggggaaga gggtgatcgc gttgacgctg ccgggcacct ggaagcccaa cgtgatcggg 3758221 ccagtttcga gctgcggagt ggtaaacgcc ccgctggacg cggaaatggt gagatggctt 3758281 ccgtcgctcg tgccggcgcc gaaaacgagt gggccggtgg cgtagggcga accgtcggcc 3758341 gatccgaatg aatagaaggt tataccaagg ccattagtgc cttgagtcca catttcgaag 3758401 ggatctatcc tcatctccgc cccaaccgag gcgttgatta tttgctccac aatgacactc 3758461 accggcggaa tgcgcacgga ccccacaacg atgcggaagg cggcgcttcc ggtgatgttt 3758521 ggggtgagtg cggggatgtc gatctgcgga atggtgaatg cgcccatcgc gacgtttccg 3758581 gtcaggtgcg cattaacggc cggcaccggg atgggcggga ggaccacggg tccgaagccg 3758641 ccgtcgaggt gggcgtccac gatggtgatc cggggcacgt cgaggctgta gaacaggctg 3758701 aacaggccct cgtgatcacc ccgccacaac aggccgttgc tgaagttgcc cgacatgaac 3758761 gcgccggtgc cgacgttgcc cgagttggcg atgccggtat tggtgtggcc ggtgttgaac 3758821 cagccggtgt tgatggtgcc cgggttgaac cccccggtgt tggtgtcgcc ggcattgaag 3758881 ctgccggtgt tgtagctgcc ggcgttggcc acgccggtgt tgaagctgcc ggcattgaag 3758941 agcccagtgc tgcccgttcc gctattgccg atgccggtgt tgaagctgcc cgagttaccg 3759001 atgccgaagt tgccggtccc ggagttgaag aacccgatgt tgccggtgcc ggagttgaac 3759061 aagccgatat tgccggcacc ggagttcagg cccccgatgc cggtgaggtt gtcccccacc 3759121 agcccgatcc cgatgttgcc ggtgccggtg ttggccaacc cgatgttgcg cacgccctgg 3759181 ttggcgaaac catagttggc gctgccggca ttgccgaacc ccgtgttgcc caggccggcc 3759241 gcgccggcgg tcagacccga attgccgaaa ccgatattgc cgtggccgac gttggcgaag 3759301 ccgaggttgc cggtgccgac gttgcccagc cccaggtttt gcgcaccgag gttggccgcg 3759361 cccaggttaa cgtccccgac gttgccgaac ccgacgttga agttgcccac atccgccaac 3759421 ccgatgttga ggatggggat ctggttcaac gcggtcccgg ccgcagacac gcccgacagc 3759481 tgatggccga cgttgccgag gcccgacagc accgccggcg tcccgagcgg caacacgctg 3759541 gtgttgtaga tccccgagac acccgagccg acgttgagca cacccgagcc cagcgtgccg 3759601 acattcaaca cccccgatcc cgaccccgcc agcgcgctcg ccgcctggtt ccaccagccc 3759661 gacaggttcg acccgacgtt tccgaacccc gacaccccac cggcgccgga gttgaagaac 3759721 cccgacgacg gggtcgccgt ggtgttgccg aaccctggcg tcggcgggac atcgatgatc 3759781 gggatgctgc tgtcgggcac ggtgagattc agcgccaggt gcagcggcag cgggtcgatc 3759841 gtgtacccac ccgggaaaat cgtgatcgga tccagcgcgc cggacgcatc gatcgttaac 3759901 gggatggcgt tcgtggggat cgtcaggcca cccgcgaaca aggtgaaggc cggcagacca 3759961 ccgctgatgt tcacgtccaa caggaatctc gtggtagcga tttgcggaat ctcgaaaccc 3760021 ggaatagata tcttgagctc gccggtcgtt ccggggccag ggccggtgtg aatggtgatg 3760081 ccctgggtgg gcgccgggaa ggggtctccg aaattgggaa tcgccgcggt cgacccgagg 3760141 atccagtcct cgccttcgaa gcgcatgctg atgagcggaa gcgtcatggt tgacccgggt 3760201 gaggcgggga tgtccagcgg aatggttctc gtctgtgcgg gaattgtggt ggcgggcacc 3760261 aggacgatgg gatccatgtg gatcgattcg tggatctcta gcggtatcgc gggaacatcg 3760321 acctgcggga tggtgaaggg tccgatctcg acgatttcgt ggacgtcgaa cagcgacatg 3760381 ccggggatgt cgatctgctc gatgtggatg gggcccaggt tgagggtttc gttgaggtcc 3760441 agcagggtgc tgccggcgat gtcgatgctg aaggagaagc cgaccagccc gtggtagtca 3760501 ccggtccaca gcgccccgtt gttgaagctg ccggagttga acgcgccggt gttgacgttg 3760561 ccggtgttga acaggccggt gttggtgtgg ccggtgttga accagccggt gttgacggtg 3760621 cccgggttga cgccgccggt gttgaagctg cccacgttga ggctgccggt gttgtaggag 3760681 ccggcattgg ccagaccggt gttgaagttc cccgcgttga acaacccggt gctggccgtg 3760741 cccgcattcc ccacaccggt gctgtaactg cccgagttga acagcccgaa gttcccggtc 3760801 ccggtgttga agaacccgat gttgccggtg cccgagttga acaacccgag gttcccggtg 3760861 cccgagttca ggccgccgat cccggtccga tagtccccgg tcagcccgat cccgatgttg 3760921 ccggtgccgg tgttggccaa cccgatattg cccacaccca cgttggccaa cccccagctg 3760981 ccgctgccgg cgttacccaa ccccacattg cccaggcccc ccgcgcccgc ggtcaggccc 3761041 gcgttgccga atccgaaatt gccggcaccg atgttgccga acccgaggtt gccggtcccg 3761101 acgttgccca accccaagtt gctgccgccg aggttgccgg cgccgacgtt gatgttgccg 3761161 acgttgcccg cacccaggtt gaactcaccg acgttagcca aaccgaggtt caccccgccg 3761221 acattgccca aggccaaagc gttgccgatg tcgaggtgct gcagctcggc gatggccgcg 3761281 tcgatgatct gatcgaacac ggactcggca ggtgggaagg tgaggatcgc gatcaggcca 3761341 tcgatggaca cccccgacat atggtcgccg aggttgctga accccgagat caccgccggg 3761401 gtggtggcgt ccagcgtgct cacgttgaac agcccggaga tggcggtgcc ggagttcagc 3761461 acacccgagg ccagggtgcc ggcattgccc agccccgaga gtgtccccac cagtgacccc 3761521 gcgccggcct ggttgagcag gcccgacacg cccgcaccca agttgccgat gcccgatccg 3761581 ccgccggcac cggtgttgaa gaaccccgac gacggcatct gggtcgagtt cccgaagccc 3761641 ggcgccgccg ggatgtcgat gatcgggatg ttgaggggtc cggcactggt gcgaatgtcg 3761701 aagcccagcg ggatcgcgga aatggtggtg cctgtgatcg tgaccgccgg gatgtccacg 3761761 gacgcatcga tcggcaccac ttccgacatt gaaatcccat cgatgaccga ggccggaata 3761821 tcaacaggta tgcggatagg aatcgactca ctcaacgaaa tcgcatccag ggggatgggc 3761881 tcgatctcca ggggcacacc gatcccggcc accacgattg gctcaagatg aattggtccg 3761941 agttggcccg tgataggacc aagaacgggc aggcctaacg tgaaatccat gggcggaata 3762001 tcgatattcg agagcgtgat ggggccgaag ctgatgaagc taccgttatt cttcagggcg 3762061 gacagcaggg tggcttccgg ggcggtgaag ccgacggtga cgacgccatt gatgccgatg 3762121 tggatggcgg ggatggggat gtcgggcacg gtgaagctgt agtccgcgtc gccggtgatc 3762181 tgcaggtgca gcggcggaag gatcgtggtg tccgggatga cgatggggcc gataccgcca 3762241 gtcgtggtga tgcggatcgg gaattgcggg atcgtgatgc cataggacag gccgaacagg 3762301 ccctcgtggt cgccgcgcca cagcatgccg ttgctgtcgg tccccgacat gagggcgccg 3762361 gtgttgcggg tgcccgtatt cataatgccg gtgttgaacc agccggtgtt gatgtcgccc 3762421 gggttgaaac caccggtgtt ggtatcaccg acattgaagc tgcccgtgtt gtacgacccg 3762481 gggttggcga tgccggtgtt gaaattgccg gcattgaaga gcccagtact gccggttccg 3762541 ctattaccga tgccggtgtt gaaactaccg gagttgaaca gtccgaagtt gccggtgccg 3762601 gtgttgaaga acccgacatt gccggtgccg gaattgaaca atccgatatt gccactaccc 3762661 gagttgaggc cgccgatgcc ggtctggtag tcgccgacca gcccgatccc gatgttgccc 3762721 gtgccggtgt tggccaaccc gatgttgccc acacccaggt tggccagccc ccagttgttg 3762781 ctgccggcat tgctcaaccc cacgttgccc aggccggcca ggcccaccgc cggacccgag 3762841 ttggcgaacc cgacgttgcc ggcaccgatg ttgccgaacc cgacgttccc gctgccgaga 3762901 ttgcccaggc ccaggttctg cgcgccgatg ttggccgcac cccagttgag gtcccccaca 3762961 ttgcccaacc cggtgttgaa cgcgcccaca tcggcccacc cgatattgac aatggggctc 3763021 cggttgagca cggtcccatt tgccaagaac cccgacagct gctggccgag gttgccgatg 3763081 cccgagacca ccgccggggt gcccgctccc agggtgctgg tgttgtacca cccggagatc 3763141 cccgagccga cgttcagcac gcccgagctc agcgtgccgg cattggcaac tcccgagccc 3763201 gcccctgcta acacgtcgtg cccctggttc caccagcccg acgtgcccgc gccgacgttg 3763261 gcgaaccccg atccaccgcc gccgccggtg ttgaagaacc ccgacgatgg ggccgtggtg 3763321 gtgttgccga accccggcac cgccggcaca tcgatgatcg ggatcgggat atcgccgatg 3763381 aggatggtgc cgtcgaaggt cgccggcacg gtgtcgaggg tgaacccgtc gggcaacagc 3763441 gtgaacgcgt ccagccccac ggacagtccg gtgaccccgg cggaggcccg cggaaaggtc 3763501 agcccacccg ggaagaaggt gaacccgtcg ttggcgacct ccatacccac cgtcacgggg 3763561 gtttgcgcgg gaatggtgaa accattcggg aaaagcgtcc acggggtggt gtccaagttg 3763621 agggttaggg gaattggtgt cggggtgacc aatatctgac cgctaaccgt gaggccgggc 3763681 acaatgatgt tctctaggaa caagacaccg gcaacaactt ggaacgcatc aatggtgata 3763741 aatgggtcac tgaggcggaa cggctcgaga aaaagcccta tcgaaccggc gagcgggtca 3763801 agagcgcgaa tcggcgagat ggtgtttgcg gccaggtcca cgcttccggt gatgctggcg 3763861 atgggaagtg agggaatgct gatcggtggg acggtgaacg gacccaggcc gacggtggcg 3763921 tcggtgatct cgacgtgcac ggcgggtacc gggacgggcg ccacatgcag cgggcccacc 3763981 ccgccgatcg cgtgcacggt gaccgggaat tgggagatcg tgggcccgac gcggacgccg 3764041 accaggccct cgtagccgcc ccgccacaac aggccgttgc tgtagtcgcc cgtcatgaag 3764101 gcgccggtgc cgaaggtgcc cgcgttggcc aacccggtgt tggcatgccc ggtgttgaac 3764161 cagccggtgt tgatgccgcc cgggttgaag ccaccggtgt tggtgtcgcc ggcgttgaag 3764221 ctgcccgtgt tgtagtcacc agtgttggcg atgccggtgc tgaagctgcc ggcattgaag 3764281 agcccggtgc tggccgttcc gctattaccg atcccggtgt tgaagcggcc ggagttcccg 3764341 atgccgaagt tgccggtccc ggaattgaag aacccgacgt tgccggtgcc ggagttgaac 3764401 aacccgatat tgccgatgcc ggagttcaag cccccgatcc cggtccgatg gtccccgacc 3764461 agcccgatcc cgatgttgcc cgtgccggtg ttggcaaacc caatattgcc cacacccatg 3764521 ttcgccaagc catagttgtt gatgccggca ttgccaaaac caacattgcc cacccccgcc 3764581 gcgccggcgg tcaggcccaa gttggcaaac cccaggttgc catggccgat gttgcccaac 3764641 cccaggttgc cgtccccgac attgcccagg cccaggttgt gcccaccgat gttggccgca 3764701 cccaggttga cgtccccgac atttccgaac ccggtgttga agttgcccac attggccaac 3764761 ccgaggttgc cggcgagcat cgagcgcagc gtggttcccg ccgccgacac ccccgacagc 3764821 tgctggccca ggttgccgat gcccgacacc gccgccggtg tcccgaaagg caacacgctg 3764881 gtgttgtaga accccgagat ccctgagccc aggttgagca cacccgagcc cagggtgccc 3764941 acgttgccaa cacccgaacc ggcccccaac agcgcgctcg gcgcctggtt ccaccagccc 3765001 gagctgcccg cgccgacgtt gccgaaaccc gacaccccac ccgcaccgga gttgaagaat 3765061 cccgacgacg gggccgtggt ggtgttcccc actcccggcg ccgccgggat atgaaggccc 3765121 tggatcgtga tggggccgat cgtgaccccg ccccccacgg tcagggggat gcgatcgatc 3765181 gtgatcggcg gggtgctgaa cccgtcgatc tggccctcga tatcgatcga caacggcaac 3765241 ggctgcgcgg gaacactaaa tcccgggatg gtaaagcccg ggttactgat cgacacactc 3765301 accagcaacc ccaaaggatt atcgggagca ctgatgccat tcgggaacag cgtgatcgga 3765361 ggggtatccc atctgatcgt taaatcaatc tgtggattgg tgggtccggg aatggtggtg 3765421 tcgataacga tagggccgat aaagctgaca agctgaccgt tagaatcaaa ggtttggatt 3765481 tgtggaattg tgattttccc taaactgaag gtgggaaagg gcaattggtt gacaaatgtc 3765541 tgttgggcaa acagggtgat gggtgtgatg gtcagcgggc cgatgttgat gggtatgccg 3765601 ataccgccgc cgaaggcggg gatcacgatg tcgggaacca ccagcgggcc caagttgacg 3765661 gtttggtgaa tgctgagcgg gatggtgggc aggatcggga tgggctggat ggtgatcggg 3765721 ccgatgtcgc cgttgagcac caggccgatg ggaattgcgg ggatcgacga gccggcggag 3765781 acgccgaaca ggccctggta gtcacccacc cacagcacgc cgttgttgaa gttgcccgag 3765841 atgaacgcgc cggtgttgac gttgcccgag ttggcgatgc cggtgttggt gttgccggtg 3765901 ttcagccagc cggtgttcac accgccgggg ttgaagccac cggtgttggt gtcgccggcg 3765961 ttgaaactgc cggtgttgta actgcccacg ttcaccacgc cggtgttgaa attgccggca 3766021 ttgaacaacc ccgtgctggc cgtccccgca ttaccgacac cggtgttgta attacccgag 3766081 ttgaacaccc cgaagttccc ggtccccgaa ttgaagaacc ccacattccc ggtgccggag 3766141 ttgaacaacc cgatattccc ggtgcccgaa ttcaggcccc cgatacccgt caggtggttg 3766201 ccggtgagcc cgacgccgac gttgttggtg ccggtgttgc cgaaaccgat gttgcccaca 3766261 cccaggtttg cgaaaccata gttgctgctg cccgcattgc ccaacccgat attgcccaag 3766321 ccggccaggc ccgcccccag accggagttg ccgaacccga cattcccgtt accgaggttg 3766381 ccgaacccga cattggtgcc accggcattc ccgaaaccca gattctgccc acccacattg 3766441 cccgcgccca ggttgaacac cccgacattg cccaacccga cgttgtaatt gccgacattg 3766501 cccaaacccg cattcaggct cagcgccttc gcagggctgg cgaacagggc ggtaaggaac 3766561 gcgccgacac ctccccccag cgcctgcgcc ggtgggctga acgccggcaa cgccgcggca 3766621 gcagccgacg cgccggaatg gtagccggcc atcgccgcca catcggcggc ccacatcagc 3766681 tcgtactcgg cctcggcggc cgcgatcgcc ggcgtgttct gccccaacag attcgacacc 3766741 gccaacgcca ccagccgcgc ccggttggcc gccaccagcg ccggatccac cgtcgccgcc 3766801 aacgccgcct caaagacccc caccaccacc cgcgcctgcc cggccaccgc ctcggccgcg 3766861 gccgccaccg aacccaacca ccccgcatac ggcgccgccg cggccgccat cgccgccgcc 3766921 gccgcaccct gccacacccc cgccgtcagg cccgacgtca cctgcccaaa cgacaccgcc 3766981 gccgacccca actcctcagc cagcccatcc cacgccgcgg ccgccgccag caacgggctc 3767041 gaccccgcac ccgaatacat cagcacggag ttgatttccg gtggcaacac cggaaactcc 3767101 atcacccatt ccccttccca gcccgacacc aatccccacc gacacccccc acatgacgtg 3767161 tcgacgcccc gataattttg ctcgcattgc caacggccca agaacgattc cccgataatc 3767221 gcgggtactg ggtgcacttt gcacagacgc cgcagcaaaa tgcacatatg ccctgtccag 3767281 accggcgagc ggcagggcgt catctgccct gacacttcga ctgctggcgg agtccgcgag 3767341 catgctcacc gccgcggcgt gcgccgaacc ggcagcgccg gcaaatccat gaccccagcc 3767401 tgttcttggg tcactgcgac gttcactttt aagcgcgacc acgtaaggtt gggcaaagtt 3767461 cccaagcgtt tcacagtgtc agtgcacagt gcgcacctga ttaccaaaac cccgaacctc 3767521 actcgaaagc cgagagcggg taaaagtcgt tcagcgacct gtctggtaga gaaatccaga 3767581 cccgagtaca tgatccggtc gggatcgtac ttgcgccgca ctgtggtcag ccgcgacagg 3767641 ttcgcgccga agtattgtga cgccgcggcg ttggcctcca ggtagttgac atagccgccg 3767701 accgaaaagt gttgcaccgc gtggtgtgcg tcgctcagcc atttgttggc cgtcgccacc 3767761 tggccgtcgc tgggggtgtt gacataccac tgcaccacag cggactggcg gcaccaggga 3767821 aatgccgagc cctccgggtc catgtcgccc accgcgccgc ccagcgaatc gatcagagcc 3767881 gacgcgcggc ccgcagcggg tggccatgtt ccgatggcgg cgacgatggc ttgggccgcg 3767941 gccggattcg tcgtcccgat gacatcggat ccagccacga agccctccgg cggataggtc 3768001 gtatggccgc cggccagata cctcaccagg tccatacggc gcagcgtctt gtgctcaact 3768061 ccactgggtt gcactccaac cgcggacttg atcgcatccg cgacagccgc gccggaccgc 3768121 gccgggcagc tcgccagcac atgacaattg cctccggatg agctgaccgc ggggtcaacc 3768181 agaccccacg tggtgcggtc ggccccggcc agccacgtct gtcagccgac cagcacctgc 3768241 gcggccgcag acggcgcgaa atcgacacgg acgacatcgc agtccgcggt ggggaacctc 3768301 gcgaacgtca tcgatgtcgt caccccgaag ttgccgcccc cgccgccacg aagcgcccag 3768361 aacagctccg cgtggtcgtc ggcagacgcg ctcaccgcat caccgccggg caacaccacc 3768421 gtcgccgact tgagcgcatc gcaggtcaac cccgcatggc gagaatcggc gcctaacccg 3768481 ccgcccaggg tcaaacccgc cacacccacg gtcgggcagc tgccggtcgg aatcgcccgg 3768541 ctctcaccgg ccaacgcttg atggaccgca tagagatcgg tcgcggccga caccgtacgt 3768601 ttctcgtggc gctgtcgaaa tgcaccccgc ccggtaggcc cagcagatcg agcaccatgg 3768661 cgccattggc cgacgaggcg ccgatgtagg aatgtccgcc gccgcgcaca gcgatcttga 3768721 gcttgctggc cgccgctacg aaaccgcctt ccggacgtct gcctgcgagg cgaccgtcac 3768781 caccgcggcc ggattcaagc cgctgtagtt cgaattgaag atctgctttc cgctcgtgaa 3768841 cgccctgccg ttggccggca gcagcacctg cccgcctatc gatgaggcca gactggccca 3768901 cccatcaccc ggtgttgcgc gcgccaatat cgtcgggaag accgccgacg tcgccggcgc 3768961 tccgacggcg ccgcgaagaa acgtctggcg agacatcacg accgcgatcg tgtcgtatcg 3769021 agaaccccgg ccggtatcag aacgcgccag agcgcaaacc tttataactt cgtgtcccaa 3769081 atgtgacgac catggaccaa ggttcctgag atgaacctac ggcgccatca gaccctgacg 3769141 ctgcgactgc tggcggcatc cgcgggcatt ctcagcgccg cggccttcgc cgcgccagca 3769201 caggcaaacc ccgtcgacga cgcgttcatc gccgcgctga acaatgccgg cgtcaactac 3769261 ggcgatccgg tcgacgccaa agcgctgggt cagtccgtct gcccgatcct ggccgagccc 3769321 ggcgggtcgt ttaacaccgc ggtagccagc gttgtggcgc gcgcccaagg catgtcccag 3769381 gacatggcgc aaaccttcac cagtatcgcg atttcgatgt actgcccctc ggtgatggca 3769441 gacgtcgcca gcggcaacct gccggccctg ccagacatgc cggggctgcc cgggtcctag 3769501 gcgtgcgcgg ctcctagccg gtccctaacg gatcgatcgt ggatgcgatg tagaccatgg 3769561 ccgccgcgac cgtcacggtc gtcacgaaat cgatcccctt gctgcgcacc accaacaggc 3769621 cggcccgttc ctcggacaac accaaccgca gcaccgccgc caccccaacg ccgataccga 3769681 tcagcagcgc accacggcgc cagaagttag cccccgccag cacgaacccc accgcgaaga 3769741 tcgacccaac cagcaggatc ggccactggg cgccaacagt gcgccggaaa acggccctca 3769801 cggtcatcgc cgctcagcca gctccacgac attggtcaac aagaacgccc gggtcaacgg 3769861 gcccacgccg cccggattgg gtgacacgtg gccggcgagc tcccacacat cgggatgcac 3769921 gtcgccgacc agtccgtcat cagtgcggct gacgccgacg tcgattaccg cggcacccgg 3769981 gcgcaccatg tcagccgtca acaggtgcgc caccccgacc gcggccacga cgatgtcggc 3770041 ctgccgggtc aacgcgggca ggtcgcgggt accggtgtgg cacaacgtca ccgtggcatt 3770101 ctccgagcgc cgggtcagca acagccccag cggccggccc accgtcacac cacgaccgat 3770161 aacgaccaca tgcgcgccgg cgatcgagat gtcgtagcgc cgcagcaggt gcacaatgcc 3770221 gcgcggagta cacggcagcg gcgccggggt gcccagcacc agccggccca ggttggtcgg 3770281 gtgcaaccca tcggcgtcct tggccgggtc gacgcgctcc aacgccgcgt tctcgtcgag 3770341 atgcttgggc aacggcaact gcacgatgta gccggtgcag tcggggttgg cgttcagttc 3770401 gtcgatggtc tcattcagcg tggcggtgct gatgtcggcg ggcaggtcgc ggcgaatcga 3770461 cgtgatgccc accttggcgc aatcagcgtg cttaccgcgc acgtaggcct gcgaccccgg 3770521 gtcgtcaccg accaggatgg tgcccaagcc gggcgtgcgg cccgccgcgt ccaatgcggc 3770581 cacccgctgc ttgaggtcac cgaagatctc gtcgcgggta gccttgccgt ccagcatgat 3770641 cgcgcccacg ccagccagtc tggcatgcgt gtccgcggtg ccgatggcga cgacccgctc 3770701 acgcgcccac cgtacggaca acttgtacca ttgtggtaca gattatccgt acatctttct 3770761 aagagaggac gcatgagcat cagtgcgagc gaggcgaggc agcgcctgtt tccactcatc 3770821 gaacaggtca ataccgatca ccagccggtg cggatcacct cccgggccgg cgatgcggtg 3770881 ctgatgtccg ccgacgacta cgacgcgtgg caggaaacgg tctatctgct gcgctcaccg 3770941 gagaacgcca ggcggttgat ggaagcggtt gcccgggata aggctgggca ctcggctttc 3771001 accaagtctg tagatgagct gcgggagatg gccggcggcg aggagtgaga agcgtcaact 3771061 tcgatcccga tgcctgggag gacttcttgt tctggctggc cgctgatcgc aaaacggccc 3771121 gtcggatcac ccggttgatc ggagaaattc agcgtgatcc gttcagcggg atcggcaaac 3771181 ccgagccgct ccaaggtgag ttgtcgggat actggtcgcg ccggatcgac gacgaacacc 3771241 ggctggtgta tcgagcgggc gacgacgaag tcacgatgct gaaggcccga taccactact 3771301 gatttggggg ctggtggtat tccggcgggc ttaagctccc catgtggctc ccggcagctg 3771361 cgaagccccg gacgtgttca acccggccaa actcggtccg ctcacgctgc gtaaccgggt 3771421 catcaaggcc gccaccttcg aggcccgcac acctgacgcg ttggtgaccg atgacctgat 3771481 cgagtaccac cggctgccgg ccgcgggcgg ggtcgccatg accaccgtcg cctattgcgc 3771541 ggtctccccc ggcggacgca ccggcggcaa ccagatctgg atgcgcccgc atgcggtgcc 3771601 gggactgcgc cggctcaccg aggcgataca cgccgagggg gcggcgatca gcgcccagat 3771661 cggccacgcc ggcccggtgg ccgacgcccg ctccaaccag gcgaccgcgc tggctccggt 3771721 gcggttcttc aatccgatcg ctatgcggtt cgcccagaag gcgacccgcg aggacatcga 3771781 cgatgtgctg gccgcgcacg cccatgccgc ccggctggcc gtcgacgccg gcttcgacgc 3771841 cgtcgaaatc catttggggc ataactatct ggcgagcgcg tttctgtctc cgctgctcaa 3771901 ccggcgtgat gacgagttcg gcggttcgtt gcagaaccgg gcgaaggtag ctcgcggatt 3771961 ggtgatggcc gtgcgccgcg ccgtccggca gcaggtcgcg gtgaccgcca agctcaacat 3772021 gaccgatggc atccgcggcg gcatcacagt cgacgaggca ctgaccaccg ccaggtggct 3772081 gcaggacgac ggcgggctag acgcgatcga gctcaccgcg ggcagctcgc tggtcaaccc 3772141 gatgtatttg ttccgcggcg acgcgccggt taaggagttc gccgccgcgt tcaaaccacc 3772201 gctgcgctgg ggcatccgga tgaccggcca taggtttttc cgcgaatacc cctaccgcga 3772261 tgcctatctg ttacgcgagg ctcggttgtt tcgcgccgag ctgacaatcc cgctgattct 3772321 gctgggcggc atcaccaacc gaacgaccat ggacctggcg atggccgaag ggttcgagtt 3772381 cgtcgcgatg gctcgggcgc tgctcgccga gcccgacctg gtcaatcgga tcgcggccga 3772441 aggcagccag gtgcggtcgg cgtgcacaca ctgtaatcag tgcatggcca cgatttatcg 3772501 ccgcactcac tgtgtggtca ccggggctcc atagcgtcca gattgacgcc accgtgaaga 3772561 agtgcaaccc attgtgccgg aaatccggtt gacttccccg cgcgaatccg gctcaggcac 3772621 tattgaccgc gcgcagcata atttgaaccg atgagtcgac cccatccacc ggtgctgaca 3772681 gttcggtccg atcggtcgca gcaatgcttc gccgcgggcc gcgacgtggt tgtcgggagt 3772741 gatcttcgtg ccgacatgcg cgtggcgcac ccactgatcg cccgtgcgca cctgttgctg 3772801 cgcttcgatc ggggcaattg gatcgcgatc gacaacgatt cgcagagcgg gatgttcgtc 3772861 gacggccagc gggtgtcgga agtcgacatt tatgacggcc tgactatcaa catcgggaag 3772921 cccaccgggc cgtggatcac cttcgaggtc ggccatcacc agggcatcat cggacggctg 3772981 tcacgcaccc cgtcgtcgcg tcccggctca ccgatctagc cccctgccaa gcacagcccg 3773041 tgcgccgccg caaaggccac ggcttggtcg acgtcgacac gcgcacccac caacgacgcg 3773101 gtccgccaca ataccgggtc cacggtcgcg ccccgcaagt cggcgtcatc cagccgggcg 3773161 cccgtggtac gggcaccact gaggtcggcg ccgcgcagca cgcacttgcg caagtcggta 3773221 tccaccaggc tggtctctcg caaccggcag ccggtcaagt tgagaccacg cagatcattt 3773281 ccgccgagca cggcgagcgt gaaatccacg tcgtccaacg tcagcggccg cagccggcaa 3773341 gccacgaaga ccgagcccaa catgctgcac tgggcaaatg tgctgtgcca cagtgtcgtc 3773401 cgttcgaagg tgcaattacg aaacgccgac cctcggtgtt gtgactcggc cagattcacg 3773461 ccgctgaaat cgcattcgct gaacatcgcc cgttcggtgt gcaggcggct aaggtcctcg 3773521 tcgcggaagt ctcgaccggt gaattcgcaa tcaacccact gctgcaacgc ttttcaaccg 3773581 cccgcaggag acagggtggc cagcgcgtat tcgctcaccg cgatcagtgc atcggtcgcc 3773641 gacctgcgat tgcgggcgtc aacattgatc accggaatgt gtgcgggcag cgtcaacgcg 3773701 tcgcgcaccg cgctaaccgg ataccttggc gcgctgtcga actcgttgat ggcgatcaag 3773761 aacggcaggt tgcggtgttc gaagaagtcg accgccgcaa agctgtcctg cagacgccgg 3773821 cagtcgacca agacgatcgc cccgatggca ccacgcacca ggtcgtccca catgaaccag 3773881 aaccggcgct ggcccggggt accgaataga taaagcacca gatcctcgcc caaggtgatg 3773941 cggccgaagt ccatcgccac cgtggtgctc cgcttgtcgg gagtggcctc cagcatgtcg 3774001 acgccggcgg aggcatcggt gaccatcgct tcggtgcgca acggcatgat ctccgaaaca 3774061 gcgccgacga atgtggtctt gccggacccg aatccgcccg cgatgacgat cttcgtcgac 3774121 gcggtgccgg atgcctcaga gtgctttaag gccacgcagg gtccttccta tgagttcgtg 3774181 gcgttcgtcg cgggtcgatc ggtcggtcaa ggtcgcgtgc acccgaaggt aaccggacgt 3774241 gaccagatca ccgaccagca cacgcgccac acccaccggc aaatccagcc gagccgagat 3774301 ttccgcgacc gacggactgc caatgcacaa ttgcaagatc ctgcgtcgca tgtcgtaggc 3774361 cggccagcgg ccagccggtc ccgccggcag ggtctgcacc ggcgcctgaa gcggaaggtc 3774421 gacgtcggta ccggtacgtc cggcggtcag cgtgtagggg cggaccaggc ccgccttcgg 3774481 tctatcgccg gcaggattga acaacgccgc ccacccgctc gacaaggatg gccatctcat 3774541 aaccgatctg gccgatatcg catccggtcg cggccagcgc cgccagcgcc gacccgtctc 3774601 ccacctgcat caacagcagg tagccgttct gcatctcaac caccgactgc agcacctgcc 3774661 cgccgtcgaa cagttgcgcg gcgccgccgg ccaggctggc cagcccggac gtcaccgcgg 3774721 ccaactgatc ggcgcgttcg cgtggtagat gttcgctggc cgccacggga agcccgtcga 3774781 ccgacaccag caatgcatgg gccaccccgg gaacctcgcg ggcgaacttc gacaccagcc 3774841 agtcaagcgg gctgtccggc aagcgggctt tcattgctga ttgggtccct gactgctctc 3774901 gcgggcatgc gaccgcccgg tgcgcacgcc gccgaaatgg ctgctgatgg aggcacgaac 3774961 cgcgtcgggg tcgcgtaccg cagccgcgtg ccgcggcgct cggccgggat gaagtccgcc 3775021 gttggatgct agcgctgcac ccggatgctc ccgatcgggt ccctcaggca ccgccgcccc 3775081 cggcactaac cgggccccgg gttcgcgcac cggcaggccg tagtccgtgc gggactgcac 3775141 gggcttgtcc gcggcctcgg cggccgccga ccagccgtgg tcccacaccg acttccagtc 3775201 cagatcgggg ctgtgggcca gctcgtgcgg gtcacccacc atctcggaga gcatccgccg 3775261 gtagatgacg tcgtcatcaa ccgggcccgc cggtggcgcg ggtttggcgg gcggcggcgc 3775321 cggtcgcggt tctggtgcgg gcggttgttt gggctcctgt tgaaacctat cctcccacca 3775381 gggtgttttc agctcgcgcc gccgctgctg catcggctgg gccgggacgt cggcgatgcc 3775441 actggacccc ggggtacggc gcgggagcaa cgtgaccggt ggtagcggcc cgatggcggc 3775501 gggaacgtcc gtcggatcgg ccgccgcggg ttcaggacac ggcggcttga tcgcaaatac 3775561 ccgcggcttt ggcggctgcg ctggggccgt cccctcgagc acggctagcg gcaggtagac 3775621 ctcggcggtg gtgccggtgc cctgttcacc ggtcaccgga ccgcgcagcc cgactcggat 3775681 gccgtgccga ccggccagcc ggccgactac gaacagaccc atgtgccggg cactatccgg 3775741 ggtgacctca ccgccggccc gcagccgcat attggccatc cgccgatcgg catcggtcat 3775801 gcccaggccg gaatccgaga ttcgcagcag aacactgcct tcgctgccga ttgcggcggc 3775861 aacccgaacg ggtgtggtcg gtgacgagta gcgcaacgcg ttgtcgatca gctcggcaag 3775921 cagatgaatg acgccaccag ccgctgcgcc gactaccgca cagtcgggta ccctcgcgat 3775981 gtcgacgcgg cgatagtcct cgacctctga cacggcggcg ctgatcacgg ttgacagcgg 3776041 caccggctcg cggtggtcac gggtaatctg cgcaccggcc agcaccagca ggttggcgct 3776101 gttgcggcgc agccgggcgg ccaggtgatc gagccggaaa aggctgtcga gtcgggcggg 3776161 atcctcctcg ttgcgctcca gttggtcgat gaccgacagc tgctggtcga ccagggaacg 3776221 gctacgccgc gacatggtct caaacatctc gttgaccagc agtcgcaacc gcgtttcctc 3776281 gccggccagc aacagggccc gggtgtgcag ctcgtcgacc gcatgcgcga cctgaccgat 3776341 ttcctcggtg gtgtacaccg ccagtggctc ggggatcggc tcgtcgccgg cgcggaccgc 3776401 cgcgatctcg ccgtcgagat cggtatgagc aaccttgagc gccccatcac gcagtacccg 3776461 catcggcccg accagcgtgc gcgccaccac caacacgacg acgatcgcgg tcgcgatggc 3776521 ggccaacacc agcacggcgt cgcgaatcgc ggcatcccgc cggtcggtgg cctggctttg 3776581 caccgacttc gtcaccgcct cggtggtgtc ggtgatcacc tgctcggcaa tgtcgcgggt 3776641 gatctgtatc gagtgcagca gctctgggtt gttgaccagt gcaacggccg gatcggacat 3776701 gatcgccatc ctggtcacca tttgctgctg caggttcttg gtgtccggcg agcctgcacc 3776761 gagcgccgcg ctcatcccga acagcgtcga gggttcggtg ccggccaggg taaccatcgc 3776821 gctgcgcagt tgcggctcgg caaggtcggc gccgcgagtc accaggatct cctgcatcgt 3776881 catctgcccg cgggcgccaa cggctcggct caaaccctgc acctgggttc ggatttgctc 3776941 gctgtcaacc cgcaccgacg cgtcaatcac gttctgggcc gtcaacagca gcggcgcgta 3777001 ggcggtgacc cgatcccgca agccgatgct gtcggccagc accttatcca gcagcgcctg 3777061 accgccgttg agcagcgtgt tcactcccga ccgcacgtct gcgatgacgt cggtgtcggc 3777121 cagtcgcgtc tgcagctcgt acttgcgggc ggtgaagttt ttctgcgccc cctccacatc 3777181 gtgtccggtc gagctggcca gcacggcgac gtccagcgcc gacatgtatt tcgtgatcgc 3777241 gggtatcatt tcggcgcgcg cggcgaccag ccgcaggccg ctggtgctgg ccatcgcagc 3777301 ctcgacccgc aatcctgcta acaccatcgc cactaccagc ggcagaagcg cgatcgtgaa 3777361 cactttccat cggaccggcc agttgcgcgg cgaccaggac ggcgggcgtt gctgaggttt 3777421 gccgcgggcc ggttgagccg gggcggaaat atcagaagcg gccgccgcga ccgggatggt 3777481 cgggcgggcg aacatggtca cgtggccgcg gccgtgccac cggccgcacc cttatgcagc 3777541 gctcgaaaaa cggagagact catagacttc ctgctcatgc cttgatgccg tccgccccag 3777601 ccggccgggc gcggacgtaa acaactggca atccgacgag tatgacagcc cacggccgag 3777661 gtctccaccg ctgtcaccga gcatgtcacc ggacaggccg gcaaacgggc accgggcgct 3777721 ttgccatgat cggcggatgt tccggctgct gttcgtatct ccgcgtatcg cccccaacac 3777781 cggcaacgcc atccggacgt gcgccgcaac cggctgtgaa ctgcatctgg tcgagccgct 3777841 cggcttcgac ctgtccgaac ccaagctgcg acgggccggg ctggactacc acgacctggc 3777901 ctcggtcacc gttcatgcct cgctcgcgca cgcctgggag gcgctgtcgc cagcgcgggt 3777961 gttcgccttc acggcgcagg cgacgacgtt gttcaccaac gtcggctacc gggccggtga 3778021 cgtgttgatg ttcgggcccg aacccaccgg cctggacgag gccaccctgg ctgatacgca 3778081 catcaccggg caggtgcgca ttccgatgct ggcgggccgg cgctcgttga acctgtccaa 3778141 cgccgcagcc gtcgcggtct acgaggcctg gcgtcagcac ggctttgccg gggcggtcta 3778201 gtcgcgacca aggtgacacc gaaccagccg gtatgcgcac aacgaagctc atcggcgtcg 3778261 ggcgccggac aggagcaccc aaccggtgac agcacaccga acgcaacccg ggcgatcaca 3778321 tcggaccacg acatcccggg aaaatcgatg ccggtgagct tgcgcgtcca gctaccacca 3778381 ccgtcagcgg tgacaccttc accggcaaca acggcagcgc aggcgcagct gtcagcggcg 3778441 gcgcgcagcg aaggcgttgc ggtcaatgaa tctgccgcaa accccacgcc cgttggccca 3778501 tattgcgcta gcatccgggt gttgtgatct cgcaggttgc gtgctggcag cctgggggtg 3778561 ggttgtgatg tcgtttgtcg tagcagtccc ggaggcattg gcggcggccg cgtcggatgt 3778621 ggcgaacatc ggttctgcgc taagtgccgc gaatgcagcg gcagccgccg gcacaacggg 3778681 gctactggca gccggtgccg acgaggtctc ggccgccctg gcgtcgctgt tttccgggca 3778741 cgctgtgagc taccaacagg tcgcggccca ggcgacggcg ttacacgatc agtttgtcca 3778801 ggccttgacc ggtgccggcg gatcgtacgc cctcaccgag gccgccaacg tccagcagaa 3778861 tctgctgaac gcaattaacg cgcccactca ggcgctgttg gggcgcccgt taattggcga 3778921 cggggctgtc ggcaccgcca gcagccccga cgggcaagat ggcggtctgc tgttcggcaa 3778981 cgggggcgcc ggctacaaca gcgccgccac gcccggaatg gccggcggca acggcggcaa 3779041 cgccggattg atcggcaacg gcggtactgg cgggtcgggc ggtgccggcg cggccggtgg 3779101 cgccggcggc agcggcggct ggttgtacgg caacggcgga aacggcggca tcggcgggaa 3779161 tgcgatcgtc gcgggcggtg ccggcggcaa tgggggcgct ggcggcgccg ccggattgtg 3779221 gggcagtggc ggcagcggcg gccaaggcgg caacggtctg accggcaacg acggcgtgaa 3779281 tccggccccc gtcacaaacc ccgcgctaaa tggcgccgcc ggcgacagca atatcgagcc 3779341 gcaaaccagc gtcctgatcg gcacccaagg cggtgacggc acgcccgggg gtgctggcgt 3779401 caacggcggc aacggtggcg cgggcggaga cgccaatggc aaccccgcaa acacctcgat 3779461 cgccaacgca ggcgccggcg ggaacggcgc cgccggcggt gacggcggtg ccaatggcgg 3779521 tgcgggcggc gccggcgggc aggccgcgtc cgccggtagt tccgtcggcg gtgacggcgg 3779581 caacggcggt gccggcggta cgggcacgaa cgggcacgcc ggcggtgcgg gcggcgccgg 3779641 cggtgccggt ggtcgcggcg ggtggctggt cggcaacggt ggcaacggtg gcaacggtgc 3779701 cgccggcggc aacggcgcca tcggcggtac cggtggtgcc ggcggcgtcc ccgccaacca 3779761 gggcggtaac agcgccctag gcacccagcc ggtcggcggc gacggcggcg acggcggcaa 3779821 cgggggcacc ggaggcaccg gcgggcgtgg cggcgacggc ggatccggcg gcgcgggcgg 3779881 cgcgagcggt tggttgatgg gcaacggcgg caacggcggc aacggcggca ccggcggctc 3779941 aggcggtgtc ggcggcaatg gcggcatcgg cggtgacggc gccggcggcg gaaacgccac 3780001 gagcacgtcg agcatcccct tcgacgccca cgggggtaac ggcggcgctg gtggcgacgc 3780061 tggtcacggc ggaacgggcg gcgacggcgg tgacgggggg catgccggca ccggtggacg 3780121 tggcgggtta ctggccggcc agcacgccaa ctccggcaat ggcggtggcg gcggtaccgg 3780181 cggtgccggg ggcacccatg gcacccccgg cagcggcaac gcaggcggca ccggcaccgg 3780241 taacgctgac agcacaaacg gcgggccagg cagcgacggc ctcggcgggg acgcgtttaa 3780301 cggcagtcgc ggcaccgacg gcaaccccgg ctaattacca gccgttccag tgcgtcacgc 3780361 tctcggccgg cagccgcttg gccggccgga agtcgatgcc ttgtgtgtag gcgatcggaa 3780421 gcagcccgcc ttggctgtat tcgtcgtagg gaatgccgag cacgtcggcc accttgtgct 3780481 cgccgttgtc gagcaggtgc agcgtcgtcc agcacgaacc cagcccgcgg gagcgcagcg 3780541 ccaggcagaa gctccacacc gccgggaaca gtgaggccca aaacgacacg ccacccaccg 3780601 ccgactcgtc ttcccggcct ttcaggcagg ggatcagcag caccggcgcc cggtgcatgt 3780661 gttcggcgag ataggtcgcc gaatcgcgga cccgccccat ccgctcgccg cgggtgtcgc 3780721 cgtcggggta ctcgggcgcc ggcccgctga ggtagccccg ggcgttggcc aggtagacgt 3780781 cggcgatcgc ctttttcttg gcggcgtcct cgacgaacac ccactgccag ccttgggaat 3780841 tggaaccggt gggcgcctgc agcgccagct cgaggcattc catcagcacg tcgcgtggca 3780901 ccggcttgtc gaaatcgaga cgcttgcgca ccgagcgggt agtggtcagg acctcgtcga 3780961 cggacaggtt gagggtcatg tgggcaggct accgttgggc catgagcgtc gaactgacac 3781021 aagaggtttc tgccaggctc acgtccgacc tttacgggtg gttgaccacc gtcgcccgat 3781081 cggggcagcc ggttccgcgg ctggtgtggt tctacttcga cgggaccgac ctgacggtgt 3781141 actccatgcc tcaggcggcc aaggtcgccc acatcaccgc ccatccgcag gtcagcctga 3781201 acctggactc cgacggcaac ggcgccggga tcatcgtggt gggcgggacg gcggcggtgg 3781261 tggccaccga tgtcgactgc cgcgacgacg cgccgtattg ggccaagtac cgcgaggatg 3781321 ccgcgaagtt cgggctgacc gaggcgatcg ccgcctacag cacccggctg aagatcaccc 3781381 cgacccgggt gtggacgacg cccacgggct gagcgggctg gcccccgctc gccgccagag 3781441 tgaaatccac gacgcgtttg cggcgtgtcg cgtcgcccgt ttcactgtcg gcgcagaggt 3781501 tcaccggaag tcgcgcgagc gcgcgccgac cgccagggtg aggcggccca tccgttcggc 3781561 gacgacggtg attgcgccgc tggcgttttg gacctggccg cggatcagca gcgccggcgc 3781621 cgtgtgcgcg agcttgcggt gtcgcgccca caccccgggc gtgcagagca cgttgaccat 3781681 cccggtctcg tcttcgaggt tgatgaacgt caccccctgg gccgtggcgg gtcgctgccg 3781741 atgagtcacc gcgccggcga tcagcacgcg gtcgccgtcg gacaccgatc ccagcctctc 3781801 ggcgggcagc acccccatcg cgtccaggtc cgcccgcagg aactgggtcg gatagctgtc 3781861 cggggagacg ccggtggccc acacgtcggc ggcggccagc tccagctcgc tcatccccgg 3781921 cagcgccggg atgtgcgacg acgagcccac cccgggtaac cggtccggcc ggcccgtggc 3781981 cgcggccccg gccgcccaca gcgcctcccg ccgagacatg ccgaagcagc ccagcgcccc 3782041 ggccgtcgcc agcgcttcga cctgcggcac ggaaagctgc acccgcgacg tcaagtccgg 3782101 cagggaggtg aacgggccgt tggctgttcg ctccgcgacc agcttctcgg ccagctcggc 3782161 gccgaggtag cggacggcgc ccaagcccaa acgcacctcc gttccggcgt tctcacacgt 3782221 ggcgtgcgcc aggctggcat tgacacacgg gccgtgcacc gccacgccgt gccggcgggc 3782281 gtcggccacc agcgactgcg gcgaatagaa acccatcggc tgggcgcgca gcagcgccgc 3782341 acagaacgcc gccgggtggt gcagcttgaa ccacgccgag tagaacacca gcgacgcgaa 3782401 actcagtgcg tggctctcgg ggaagccgaa attggcaaac gcctccagct tttcgtagat 3782461 ccggtcgatc acctcgtcgg gggcgccgtg cagcgcgcgc atgccgtcgt agaaccggcc 3782521 gcgcagccgg cgcatgcgtt cggtggagcg tttggacccc atggcgcggc gcagctggtc 3782581 ggcctcggcg gcggaaaagc cggcgcagtc gaccgccaac tgcatcagct gctcctgaaa 3782641 cagcggcact cccagcgtct ttcgcaatgc cggcgccatc gacgggtgct cgtagatgac 3782701 cgggtcgacg ccgttgcgcc gccggatgta ggggtgcacc gatccgccct ggatgggccc 3782761 ggggcggatc agcgccacct ccaccaccag gtcgtagaac actcgcggct taaggcgcgg 3782821 cagggtggcc atctgcgcac gtgactccac ctggaacacg ccgacggaat cggcgcgggc 3782881 cagcatctca tacaccgccg gctcggagag gtcgaggcgg gccaggtcca cctcgatgcc 3782941 cttgtgctcg gccaccaggt ctttcgcata gtgcagcgcc gagagcatgc ccagcccgag 3783001 taggtcgaat ttcaccaagc cgattgccgc gcagtcgtct ttgtcccatt gcaggacgct 3783061 gcggttggcc atgcgcgccc attccaccgg gcacacgtcg gcgatcgggc ggtcgcagat 3783121 gaccatgccg ccggagtgga tgcctaggtg ccgcggcagg ttgcggatct gggtggccag 3783181 gtcgatcacc tgctcgggga tgccgtcaac gtcgtcggcc tgcccggtcc agtggctgac 3783241 ctgcttgctc cacgcgtcct gctggcccgg cgagaagccc agggcgcggg ccatgtcacg 3783301 caccgcgctg cgcccccggt aggtgatgac gttggcgacc tgggcggcgt agtcgcggcc 3783361 gtatttgtgg tagacgtact ggatgacctt ttcgcgctga tccgactcga tgtcgatgtc 3783421 gatgtcgggt ggcccgtcgc gggcgggcga taagaagcgc tcgaacaaca gctcgttggc 3783481 caccgggtcg acggcggtga cgcccagggc atagcagacc gcggagttgg ccgccgatcc 3783541 cctgccctga cacaggatgt cgttgtcccg gcaaaaccgg gtgatgtcgt gcaccaccag 3783601 gaagtagccc ggaaatctca gttgggcaat gactttcagc tcatgctcga tctgggagta 3783661 cgcccggggc gcgctcttgg gcggcccgta acgctcgcgg gcgcccgcca tgaccaacga 3783721 ccgcagccag ctgtcctcgg tgtgcccgtc gggaacatcg aacggcggca gccgcggcgc 3783781 gatgagctgt aggccaaagg cgcaccgctc gccgagctcg gcggccgcgg tcaccgcctc 3783841 ggggcaccac gcgaacaacc gggccatctc ctccccggac cgcaggtgcg ccccacccag 3783901 cggagccagc cacccggccg cggagtccag cgaccgccgg gcccggatgg ccgccatcgc 3783961 catcgccagc cgcccacgtg acggatccgc gaagtgcgcc ccggtggtgg cgacgatgcc 3784021 gacaccgaag cgcggcgcca gtccggccag cgcggcgttg cgttcgtcgt cgagcgggtg 3784081 accatgatgg gtcagctcga tgctgacccg gctgggggtg aaccggtcca ccagatcggc 3784141 cagcgcccgc tgcgccgcgg ccgggccacc ctgggaaagc gcttggcgca catggccttt 3784201 gcggcagcca gtcaggatgt gccagtgccc gccggcggcc tcggttagcg cgtcgaagtc 3784261 gtagcgcggc ttaccctttt cgccgccggc cagatgcgcc gccgccagtt gccgcgacaa 3784321 ccgccggtag ccttccgggc cgcgggccaa caccagcagg tgcgggccgg gcggatccgg 3784381 ccgctcggtg cgagccgtgg cgcccagtga cagctcggcg ccgaagaccg tgcgcacgtc 3784441 gagttccgcg gccgcttcgg cgaaccgcac cgccccgtac aggccgtcgt ggtcggtcag 3784501 cgccagggca cacaggccca gccgggcggc ctcctcgacc aactcctcgg gcgtgctggc 3784561 cccgtcgagg aagctgtacg ccgaatgcgc atgcagctcg gcatacgcga cggacgatcc 3784621 gacccgttcc cggcccggcg gctggtacgc cccgcgcttg cgggaccgtg ggacgtcccc 3784681 atccgcgtcg aacgccggca ccccggcatg gcgcggcttg ccgttaagca cccgttccat 3784741 ttccgcccag ctcggcggcc cgttgctcca ccccacattc cacagtatat cgaacaattg 3784801 ttcgatacag cgcagttgtt cagcacatct tcacctgcga aacatgttct taaccgtttg 3784861 ggccttctgc ttccggtgcg gtccggcgga cacttatacc tggggtcgca aaacgacggt 3784921 ggggacttgt catggcacaa ctgacggcac tggatgcggg ttttctcaag tcccgcgatc 3784981 cggagcggca cccgggcctg gcgatcggcg cagttgccgt cgtcaacggt gccgccccca 3785041 gctacgacca gctcaaaacg gttctcacag aacggattaa gtcgatacct cgatgtaccc 3785101 aggtgttggc gaccgagtgg atcgactatc cgggattcga cctcacccag cacgtgcgac 3785161 gggtggcgct tccccggccc ggcgacgaag ccgagctgtt ccgggccatc gcgctggcac 3785221 tggagcgtcc cctcgacccg gaccgcccgc tgtgggaatg ctggatcatc gaaggcctca 3785281 acggcaaccg ctgggcgatc ttgataaaaa tccaccattg catggccggc gccatgtcgg 3785341 cggcccacct gctggccagg ctctgcgacg atgccgacgg cagtgccttc gctaacaatg 3785401 ttgatatcaa acagattccg ccgtatggcg atgcgcggag ctgggccgaa acgctgtggc 3785461 gaatgtccgt cagcatcgct ggcgccgtct gcacggccgc ggcacgcgcc gtcagctggc 3785521 cggcagtgac gtcaccggcc ggcccggtca ccaccaggcg gcggtaccaa gcggtgcgcg 3785581 ttccccgcga cgccgtcgac gccgtgtgcc acaagttcgg ggtgaccgcc aacgacgtcg 3785641 cgctcgcggc catcaccgag ggcttccgaa cggttctgct gcaccgcggc cagcaaccgc 3785701 gcgccgactc actgcgtacc ctggagaaaa ccgatggcag ctcggccatg ctgccctatc 3785761 tccccgtcga gtacgacgac ccggtgcggc gattgcgcac cgtgcacaac cggtcacagc 3785821 agagcggccg tcgtcaaccc gacagtctgt cggactatac gcctctcatg ttgtgcgcca 3785881 agatgattca cgcgctagct cggttaccgc aacaaggcat cgtcaccctg gcgaccagtg 3785941 cacccaggcc acgccaccag ttacggctga tgggccagaa gatggaccag gtgctgccca 3786001 tcccgcccac cgcactgcag ctgagcaccg ggatcgcggt cctcagctac ggcgatgagc 3786061 tggtgttcgg catcaccgct gactatgacg ccgcgtccga aatgcagcag ctggtcaacg 3786121 gtatcgaact gggtgtggcg cgtctggtgg cgctcagcga cgattccgtg ctgctgttta 3786181 ccaaggatcg gcgtaagcgt tcatcccgcg cactccccag cgccgcgcgg cgggggcggc 3786241 cctctgtgcc gaccgcccga gcgcgtcact gacgccatct ccgtcggcgt tgacccccgt 3786301 gagagggtgg gtcgtgcgca agttgggccc ggtcaccatc gatccgcgcc gccatgacgc 3786361 ggtgctgttc gacaccacgt tggacgccac ccaggaactg gtccggcaac tccaggaagt 3786421 cggtgtgggc accggcgtct tcggtagtgg cctagacgtt ccgatcgtag cggccggccg 3786481 tctggcggtg cggccgggcc ggtgcgtggt cgtctcggcc cactcggcgg gcgtcacggc 3786541 cgcacgcgaa agcggatttg cgctgatcat cggtgtcgac cgcaccgggt gtcgggacgc 3786601 attgcgtcgc gacggcgccg acacggtggt caccgaccta agcgaggtca gcgtgcgcac 3786661 cggggaccga cgcatgtcgc agctgcccga cgcgttacag gcactcggcc tggccgacgg 3786721 cctggtcgcc cggcagcccg cggtgttctt cgacttcgac ggcacgctgt ccgacattgt 3786781 cgaggatccc gacgcggcct ggctcgcccc cggtgccttg gaggcactgc agaagttggc 3786841 cgcgcgctgt ccgatcgcgg tgctcagtgg ccgcgacctg gccgacgtga cacagcgggt 3786901 gggtctgccc ggcatctggt atgccggcag ccatggtttc gaattgaccg cacccgacgg 3786961 aacgcaccac cagaacgacg ccgcggcggc agccataccg gtgctgaaac aggcggctgc 3787021 cgagctgcgc cagcaacttg gacccttccc gggtgttgtg gtggagcaca agcggtttgg 3787081 cgtcgccgtg cactaccgca acgcggcccg ggaccgggtc ggcgaagtcg ccgcggcggt 3787141 gcgcacggcc gagcagcgtc atgcgctgcg ggtgacgacg ggccgcgaag tcatcgagtt 3787201 gcgtcccgat gtcgactggg acaaggggaa aacgctgctg tgggttcttg accatctgcc 3787261 gcattcgggc tcggctcccc tggtgccgat ctacctcggc gacgacatca ccgacgagga 3787321 cgctttcgat gtggtcggcc cccatggtgt tccaattgtg gtgcgccaca ccgacgacgg 3787381 tgaccgcgcc accgccgcac tgtttgcgct ggacagtccc gcacgggtcg cggagttcac 3787441 cgatcggctg gcgcgtcagc tccgtgaggc tcccctgcgg gcaacgtgag acgcggtgcc 3787501 gccgcgggcg atacgctccg accgtcaacg aggaggacgg ccatgtggtt tgcattggtg 3787561 aacccggaga tgctggccgc ggcggcgaca gacttgggcg gcatcaggtc agggatcagc 3787621 gccgcctatg cgcgtcctct gcggtgacct ggctggtagc ttaggcacgt ctttatcgac 3787681 accgggtgct gccagagaac tcgagacgcg gcacaggtcg gcaccatgag gcggcgtgca 3787741 atgacgaaga tggacgaggc tagcaatccg tgcggcgggg acatcgaagc tgagatgtgc 3787801 cagttgatgc gcgagcaacc acccgccgaa ggcgtcgtcg atcgtgtcgc gctgcaacgc 3787861 catcgaaacg ttgcgttgat cacgctgagc catccgcagg cgcagaacgc actcaacctg 3787921 gcgagctggc gtcggctgaa gcggctgctg gacgatctcg ccggcgaatc ggggctgcgg 3787981 gcggtggtgc tgcggggcgc cggtgacaag gcgttcgccg cgggtgccga catcaaggag 3788041 tttccgaaca cccgcatgag cgccgcggac gccgcggagt acaacgagag cctggccgtc 3788101 tgcctgaggg cgttgaccac gatgccgatc ccagtcatcg cggcggtccg ggggctcgcc 3788161 gtcggtggcg gctgtgagct ggcgacggcc tgcgatgtgt gcatcgcgac cgacgacgcg 3788221 cgcttcggca tcccgctggg caagctcggc gtcacgacgg gcttcaccga ggcggacacc 3788281 gtcgcgcgcc tcatcggtcc ggcggcgctg aagtatctgt tgttcagcgg agaactgatc 3788341 ggcattgagg aagccgcccg ctggtgattg gtgcaaaagg tcgtcgcacc acaggatttg 3788401 gcggccgcga cggccaaact ggtcggccag gtctgtcggc aatccgcggt gaccatgcgt 3788461 gcggcgaagg tggtcgccaa catgcacggc cgagcgctga ccggcgccga caccgatgcg 3788521 ctgatccggt tcggtgtcga agcctacgag ggggcggacc tacgcgaagg ggtggcggcc 3788581 ttcagccagg gacgcccacc caaatttgat gattagcgcc atgaccgatg ctgacagtgc 3788641 ggtccctccc cgactcgacg aggacgcgat ctcgaaactc gagctgaccg aggtcgccga 3788701 cctgatccgc acccggcaac tgacgtcggc agaagtgacc gagtcgacgc tgcggcgtat 3788761 cgaaaggctt gacccccagc tgaagagcta cgccttcgtc atgccggaaa ctgcgctagc 3788821 ggcggcacgt gccgccgacg ccgacatcgc gcgcggccac tacgagggtg tcctgcacgg 3788881 cgtaccgatc ggcgtgaagg atctctgcta cacggtcgac gccccgaccg cggccggcac 3788941 caccatcttt cgtgactttc gcccggcata cgacgcgacg gttgtcgcga ggttgcgcgc 3789001 ggccggcgcg gtgatcatcg gcaagctggc catgacggag ggggcctatc tcggctatca 3789061 ccccagtctg ccgaccccgg tcaatccctg ggacccgaca gcgtgggcgg gcgtgtcctc 3789121 gagcggctgc ggcgtggcca ccgcggcggg attgtgcttc ggctcgatcg ggtcggacac 3789181 cggggggtcg attcgctttc cgacgagcat gtgcggcgtc accgggatca aaccgacgtg 3789241 gggccgggtc agccgtcacg gcgtcgtcga acttgcggca agctacgacc acgtcgggcc 3789301 gatcacccgt agcgctcacg atgcggcggt attgctcagt gtcatagcgg gatccgatat 3789361 ccacgatccc tcgtgctcgg cggagcccgt tccggactat gccgccgacc tcgccttgac 3789421 acggattccg cgtgtcgggg tggactggtc gcagacgacg tcgtttgacg aggacaccac 3789481 ggcgatgctg gccgatgtcg tcaaaacgct cgacgacatc ggatggcccg tcatcgacgt 3789541 caagctgccc gcgcttgcgc cgatggtggc agcgttcgga aaaatgcgcg cggtcgaaac 3789601 ggcgatcgcg catgccgaca cctacccggc gcgcgccgac gagtacgggc cgatcatgcg 3789661 cgcaatgatc gacgccggac acaggctggc tgcggtggaa tatcagacgc tgaccgagcg 3789721 gcgtctggaa ttcacgcgat cgctgcgtcg cgtgttccac gacgtggaca tcctgctgat 3789781 gcccagcgcc ggaattgcct cgcccacact ggaaaccatg cgcgggctcg gacaagaccc 3789841 ggagctgacc gccagactgg cgatgccgac agcaccgttc aacgtcagcg gtaatcccgc 3789901 gatatgccta ccggcgggaa cgacggcgcg cggaacgccg ctcggcgtcc agttcatcgg 3789961 ccgtgaattc gacgagcact tgctcgtccg agccggccac gcatttcagc aagtcaccgg 3790021 gtatcatcgc cgacgcccgc cggtgtgaaa aaccctcggc cgcaaaaggc ttgcgaatgt 3790081 cgcaccgaag gtcgcggcga atcgccttac tggtatgttt acgaacacaa tctgtggcca 3790141 tcaagggagg acgcgttgag cattagcgcg gttgttttcg accgtgacgg tgtgctcacc 3790201 agctttgact ggacacgtgc cgaggaggat gtgcggcgaa tcacgggcct accattggag 3790261 gagatcgaac gccgctgggg tgggtggctc aacggattga ctatcgacga cgcgttcgtt 3790321 gaaacccagc caattagcga gttcctctcg agcctggcgc gcgagctcga gctcggttcg 3790381 aaggcaagag acgagctagt gcgcctcgac tacatggcgt tcgcccaggg atatccagac 3790441 gcgcgtccag cccttgaaga agcccggcgc cgtggcctca aggtcggtgt tctcacaaac 3790501 aacagcctgt tggtcagcgc ccgcagcctc cttcagtgcg ccgctctgca cgacctcgtc 3790561 gacgtcgtgc tgagttcgca gatgatcgga gctgccaagc ctgacccgcg ggcctatcaa 3790621 gcgatcgcgg aagccctcgg cgtctcgaca acgtcatgcc tgttcttcga cgacatcgcc 3790681 gactgggttg agggcgcacg gtgcgcgggc atgcgcgcgt acctcgtgga ccgttccgga 3790741 caaactcgcg acggcgtcgt tcgcgatttg tccagccttg gagcgatcct ggacggcgcg 3790801 ggaccatgac cgaacgtgac gagccggaca tcgccgacag ggacgcctca ttggttactc 3790861 tcatcgacca gccgcagtgc acttaggatg gcagccttaa ctaccgtcgc cgagcagtaa 3790921 agtgtcttgg caatccacaa cggcgcgtat ggcggttcgc agtgttgcga tagccaccca 3790981 cccgcgcgac tgatctgcgc cgacaaggat gtgccgctgt gcctctgcca atgcgccaga 3791041 gcttgaatgc aatatgctgt ctcttccgca gtcgcttggc cgtcgaaaaa tccccacgag 3791101 ccatcgggcc tctgcgtatt aagaatccac ccaatcgcat ctgagcatag ggcatcatca 3791161 tagttactgg cagcacatat cagatgcgca gtcgtataat atgccgatcg gtgccactta 3791221 tcccgccagc agaaccgtcc aggctccttg cttgatcgga tgaattccag aacctttcgt 3791281 actcgtggat gacatttgtc gtagcccgcc tgcttcaacg caccgagcac gtggacgttc 3791341 gtcgatatcg aggggccgac ttcgtgaaag taggtacgga accaatcggc gtcttcgaat 3791401 tgtaatacgg ctccgatatc cggcgaccgt ccaaacttcg acaaaacatc gtaggccaca 3791461 cttgtggtgt cacaatcttc caaggtggaa tttcctgtcc accccacacc tcgaccacgg 3791521 acccaatgtt gttcgacatg gtcaagatag ggtaggtacg tacgaacgat ctcaggatcg 3791581 gacaaatcaa tatccgtacg cgagagattc catagagacc aaacaatttc aaaaatctcg 3791641 gcttgataga aggccggcgc accgccatcg ccggcttgaa ttatcgatga gatgtacgcc 3791701 aaggcccgct tgtctcctgg tttaacatgt aacgcgaagt aggctgacgc tgatggcgaa 3791761 tacttgaccg atccatttgt ctcctgcaag ttatcgacat ccaacatacc gacaccgtct 3791821 tggccggcca gttctacgga gaaagctgcg gtgatatgtt tattgatttt gcttccgccg 3791881 agttttctca acttctgctc acgcactccg acaagctcgc cgaggatgga ttcctcgtgg 3791941 caaatggcaa ggccaagtcg cgccgcctca gccatcagcg taggtgcgat taactcaaac 3792001 ccgacggttg cgtcttttat atcaagttga gggccttcga aagcacccga ggtaaggttc 3792061 ttcagggcta gcaagccttt ttcaacttgc gctgcgcgcc tccgacgatg cttattcgac 3792121 gtgaggctga tcatggccgc caaagtggag agcagtcgat cttcgtagca gaaagggaac 3792181 tcggctcccc atgagccgtc aggaagctgg cgctcgcaaa gccagttgag ggcgaggtcg 3792241 cttagctcat catcgagctg gcccagcttc gcgacccacg cggtgtcata ggctgtgctc 3792301 gagatgccgt tgcctagtgc cgctttcgct agcagagtcc tgaaagtctc cataccatca 3792361 gccctccgcg aaccagattc catcatgaac acaacccaca ccgaaaactc tgtcaggctg 3792421 ggctcgatat ctgttgcgca gtacattgag ctgatcggcc gacatcgctg aatagtcagg 3792481 tttgggccgg aaatgacgga gatagatatg atcgtacaag atcctccgga gcgttgtttc 3792541 ggtcatatag tatgagggag caacggtgaa gtatagactc gtttttcctg agctgagcag 3792601 cggaaaatcg aaagtgctga aacgcccaaa tccgatgaac atgtcagcct tgtcaacata 3792661 ctcgccgtaa taaccttcga taatctctct ccgcgtggga ggtttaccat gcgtttcgtt 3792721 ccacgagata gaaaactgcg caacagactc agccgcatcg ttgccaaaaa caccgaagca 3792781 tagtctgtgt tcagtgttgg atgacgtaga gattgttagg tcatcgaatg acttgacaac 3792841 ggctgcccct tgcgcggtcg aaggcaatct tttcttataa tcaccataaa aaagaacgtg 3792901 gacctcgtgt tctttataga acgacagaat ttcctcgtcg ttggcaagca aggccatccc 3792961 ttcgagcgcc tgtacgatat atcgatcacc acgatccaga agatcgtcgc taaagattgg 3793021 cgagattact gtttcgatgc cgtgctcgaa gagcatcttc agaatacgaa ttgattgacg 3793081 caaggcggcc tgctgataat cgtcgtactg cggattacat tcgaggtgaa accagcggcg 3793141 tgtgccatcg aagggaaaga cggatacctt cggtccacgg caacgtacaa tctctgctac 3793201 ggatactaga ggaagatcca agaattcttt ttcgctaacc aagttcatgc ttcctcttaa 3793261 taactatcgc cggaatcagg atggtcttcg ggtccaggga cttcatgtag tgcgttaagt 3793321 agtgatttgc atcttatgcg gattgcgggg ccggtgagtc cgtggctgga aaggatgtgg 3793381 tcgcggctgg cgtggggaat gtaggccggt ggcagtccca gtgtgtaggt gcgcgtccgc 3793441 gggtgtgtcc gcccgatgtg gtggctaagg tgcgcgccga ttcccacgtc ggcaatcgca 3793501 tcttcgacac acacggtgat ccgatggcgg ccagccagct cggtcagtgc cgggctgatt 3793561 ggccagaccc attgtggatc aacgactgtc accccgatct gctcctcgct gaggcaccgg 3793621 gcggcgtcca tgcatggtcg actcatggca cccactgcga ccaagagcac gtcgggtcgc 3793681 caatgcggtg gtggtgtatg caagacgtcg aggccaccga tggtgtgttc ggccgtgatc 3793741 ggttcgcccg gcgccccttt ggggaaacgc acggcggtgg gagccgcggt cgcgatcgcg 3793801 gtacgcaact gttgtcgtag ccgaggcgcg tcgcgcggac aggcgatctg aaacccgggc 3793861 acgcaggcca gcagcgccag atcccacaaa ccgtgatggc tgggtccgtc gggcccggtt 3793921 accccagccc ggtccagcac cagcgtcacg ggtaaccggt gcagcccgat gtcgaacaga 3793981 agttggtcaa aggcgcggtg cagaaacgtc gagtacaccg cgacaacggg atgggttccc 3794041 gcggcagcta gcccggccgc gctggccaac aggtgttgtt cggcgatgcc cgaatcgaac 3794101 acccgatgcg ggtatcgcct cgacagcgcg cctagaccag tgggcagacg catcgccgcg 3794161 gtcagcccga cgacgtcgga tcggtcgtca gcaatgcgcg cgatttcgtc ctcgaacacg 3794221 tcggtccagc tccgctgact gggtgtgcta gcgaggccgg tggcaatgtc gaccaccccg 3794281 caggcgtgca tatggtccct ctcgtcagct tcggctggag gataaccccg gcccttacta 3794341 gtcactgcgt gaacaacaac gggcctagct gccgcggccg cttttcgtag aaccgcgcac 3794401 gtgtcgggga tgttgtgccc atcgaccgga ccgatgtagg taaatcccat gttctcaaag 3794461 aggttcggcc ctcggggtgt gccgacgcga agttcttcta ggtgtgccgc aagagcccca 3794521 gcggtggggt cgtaggagcg gccattgtca ttgagcacga cgatcacggg ccgggtagcg 3794581 gcaccgaggt tgttcaggcc ctcccatgcc acgcccccgg tgagggcgcc atcaccgatc 3794641 accgcgatga cacgtcggtc gcattgcccc tgcagggcca atgctttggc gatgccgtcc 3794701 acccaggcga ggctgaccga ggcatgggag ttctcgaccc agtcatgtgg cgattcatgg 3794761 cggttgggat accccgatag accatcggcc tggcgcagcg tggcgaagtc tttaccgcgg 3794821 ccggtgagca gcttgtgcgg ataggtttgg tgcccggtgt cgaacaccga tgtcgtgtgg 3794881 cgaggtgaac acccgatgca atgcgatggt cagctctacc atgccaagtc ccgcgccgag 3794941 atggccaccg gtagccgtca ctgtttctat gagccgccga cgcatctgca cggccagctc 3795001 tggcagctgg ctttcgggca atgcctgcac atcgcaaggt ccgccgatcg cggtaattga 3795061 accgccccgg tgagtccgga gactctctga tctgagacct cagccggcgg ctggtctctg 3795121 gcgttgagcg tagtaggcag cctcgagttc gaccggcggg acgtcgccgc agtactggta 3795181 gaggcggcga tggttgaacc agtcgaccca gcgcgcggtg gccaactcga catcctcgat 3795241 ggaccgccag ggcttgccgg gtttgatcag ctcggtcttg tataggccgt tgatcgtctc 3795301 ggctagtgca ttgtcatagg agcttccgac cgctccgacc gacggttgga tgcctgcctc 3795361 ggcgagccgc tcgctgaacc ggatcgatgt gtactgagat cccctatccg tatggtggat 3795421 aacgtctttc aggtcgagta cgccttcttg ttggcgggtc cagatggctt gctcgatcgc 3795481 gtcgaggacc atggaggtgg ccatcgtgga agcgacccgc cagcccagga tcctgcgagc 3795541 gtaggcgtcg gtgacaaagg ccacgtaggc gaaccctgcc caggtcgaca cataggtgag 3795601 gtctgctacc cacagccggt taggtgctgg tggtccgaag cggcgctgga cgagatcggc 3795661 gggacgggct gtggccggat cagcgatcgt ggtcctgcgg gctttgccgc gggtggtccc 3795721 ggacaggccg agtttggtca tcagccgttc gacggtgcat ctggccacct cgatgccctc 3795781 acggttcagg gttagccaca ctttgcgggc accgtaaaca ccgtagttgg cggcgtggac 3795841 gcggctgatg tgctccttga gttcgccatc gcgcagctcg cggcggctgg gctcccggtt 3795901 gatgtggtcg tagtaggtcg atggggcgat cggcacaccc agctcggtca gctgtgtgca 3795961 gatcgactcg acaccccacc gcaaaccatc ggggccctcg cggtggccct gatgatcggc 3796021 gatgaaccgg gtaattagcg tgctggccgg tcgagctcgg ccgcgaagaa agccgacgcg 3796081 gtctttaaaa tcgcgttcgc ccttcgcaat tcggcgttgt cccgccgcaa gcgcttcagc 3796141 tcagcggatt cttcggtcgt ggtcccgggc cgtgcgccgg catcgacctg cgcctggcgc 3796201 acccacttac gcaccgtctc cgcgcagcca acaccaagta gacgggcgac ctcactgatc 3796261 gctgcccact ccgaatcgtg ctgaccgcgg atctctgcga ccatccgcac cgcccgctca 3796321 cgcagctccg gcgggtacct cctcgatgaa ccacctgaca tgaccccatc ctttccaaga 3796381 actggagtct ccggacatgc cggggcggtt caaatcaagt ccccgcgtcc gttgcgaatc 3796441 gtggttgtca ttgcgcgcga acctgtttgg gaaggccgaa tcgcaccgtc tcggtcgcta 3796501 tcgagcgttc caccacggtg atcgaggcgt atccgcgaag tgcatcaatc acctgcccca 3796561 ccagtcgtgg cggcgcggag gctcccgcgg tgacaccgat cgtcgagacc gacgacagcc 3796621 attcgggctc aatgtcatca ggcccgtcaa tcaagtaggc cggcgtccca cttcgctgcg 3796681 ccaactcgac cagacgccgc gaattcgacg aattgcacga gccaatcacc aacacaacgt 3796741 cacattcacc gaccatcgat tgcagcgcac gctgtctgtt cgtggtggca tagcagatgt 3796801 cttcagaggg gggttggccc aacgtcggaa acctcgcgcg cagcgcatca atgacatcgg 3796861 cagtttcatc aagtgccagg gttgtctggg tcagatacga tagctgggta ccctcgggca 3796921 ggttcaacgc tgccacatca gcgggtgtct gcaccaataa tgttgaccgc ggagcgacgc 3796981 caagcgtgcc ttcggtctcc tcatgtccgg cgtgcccgat gaagaccacc gtgtcaccgc 3797041 gcgcggcaaa ccgtgcggct tcagcgtgga ctttcgccac cagtgggcag gtcgcgtcga 3797101 cgacctgcag tccccgctca tcagcgcccg cgcgcaccgc cggggaaacc ccatgcgcgg 3797161 agaacaccac gaccgccccc ggcggcggcg gatcgggaat ctcgtcgaga tcctcgacga 3797221 acactgctcc ccggtcccgc aactcggcaa ccacaacagt gttgtgcacg atttgcttgc 3797281 gcacatacac cgggccttcg gccacgtcaa gcactcgctt gaccgtctcg atagcacgct 3797341 ctacaccggc gcaaaacgac cgcggcgacg ccaacagcac cgtgacttca cccgaagcgt 3797401 atccctgtgc gaccggtccc acgaacacct cagccatcag cactcccggc gacatatcag 3797461 ttgcgacaac gcgatcaggt ctggggatcg caccgcatcg ggcagtgccg caatagcagc 3797521 ctggatgcgt tcatcggcgc atcgctgcgc cacatgacca cccccggcca ccttgacaag 3797581 cgcggtagcc cgctcgacat cgcttgctgt cattgcggca ggtgcttgat agagggccgc 3797641 caattcggtc gccgcttcgg atcgcgagtt cagggcggca acaactggca gtgtcgcctt 3797701 acgtcgggca aggtcgttgc cgaccggctt tcccgtcaca ccagggtcac cccagatgcc 3797761 gatcagatcg tcgacgcatt gaaacgcaag acccaactca tggccaaaac gctccaacgc 3797821 agcaatcgtc gcgtcgtctg cattggccac taaagctccc agagcgcaac aacaaccggt 3797881 cagggcggcc gtcttgcccg cggccatccg cagatagtca tcgactgtaa cttcgggctg 3797941 tccctccaat aaacaatcct caaactggcc gatacacaag tccaggcacg acatctgcaa 3798001 tcgccttatc gccctgaccg ccacacactc gtcggtcagg ccggtcagta tccgaacggc 3798061 cgtggcgtgc aacgcatctc ccaacaggat cgcgacgccc acaccccaca cactccatac 3798121 cgtcggccgt cccctgcgag tcgcatcccc atccatcaca tcgtcatgca acaacgtgaa 3798181 gttgtgcacc aactccacag ccgccgacac cggagtagca tcaccgacat caccaccgca 3798241 agccgcggcc gccgcgtaga caagggcggc gcgaaaatac ttgcccgacg atcctgccgc 3798301 tgtggatcga tcggcgttcc accagccaag gtgatatccc gccatcgtcg ccaacggctc 3798361 gcgcatcgac tcaatggccc gatgcagcac agggccacaa tccgctcgag cccgttctaa 3798421 caatgctttc ccaaggtcag cagggacact ccccagaaaa gccgcatcca gagtcaatac 3798481 gcctcccatt cttaacctca ccggagcaac agtgagtcgc tattttcagc gaacgagcaa 3798541 tcggcgatat tgcttcactt cggagatacc caaatatttc aaatatcaac gcaacatgta 3798601 cctatgcccg tcgaccaaca cgaccatcag ggttgttagc aatgatctcg gaattcgagt 3798661 tgtccagacg ccccgggtca tccactacag aaagacacgc ataccctgcg gcgacctata 3798721 cttcccatca cggcgggtag gttgccttcg acaatactgc aacattcaat tgcctggcct 3798781 ttctcggagt atcttgcgga cttgaagctc acacatcggc cggcgtcgaa cgcctcacgc 3798841 tgcagagcag tttagtggat ttcatcagca tcggatatgc ataattgaaa ccacagcact 3798901 ttcataaaca gtgtccagat gatttacacc taatttgggc ggcgaatgct acgcaatggt 3798961 ggtgcgcttc ccaagggagc acaacgcgaa gctaaagcag ttgcacgccg agaccgagcc 3799021 gaaaggtcgc cctgcgggga aggcggccac gggagaattg tgagctcggc ggtcgaccac 3799081 gacgtacccg ccacgccgta gtaatgggca tttgtacatg tacattcgca cacaaggaga 3799141 ggtcttgacg tatctattcc ctctctgcgc gatcgcggcg gaggcggcgg caaccagcct 3799201 gttcaagggc agtttcgggg actttcgcgt ctgctcgccg ggtcacgacg gggcgatcac 3799261 ggccatgccg agcgtcttgg cggcgtcgcg catccggtcg tcgtaggtgc acaaccggcc 3799321 cagatcgacg ccgagccgct gcgccgtcgc caagtggatg gcatcgagcg tgcgcagctc 3799381 gaatggcagc agcccaccag cgagatcgag gacgcgcttg tcgacgcgca gcagatcgag 3799441 atgagccagc gcccggcggc cggctttccg cgctgattca cccttgtcaa gcagggcccg 3799501 catgacctcc gcgcgcgcaa gggcactcga cactcgcggg tggcgggtgc gaaggtagcg 3799561 gcgcagcgcg tccgactctg gctcgcgaac cgcgagcttg acgatcgcgg acgagtcgag 3799621 atagatggcc gccatcaacg ctcgtgctca cgcaggcgcg caagcgtcac cgacggcagc 3799681 tcgacgcccg cgtcgaggtc gagcggttcg ggcagatcaa cgacgtcgag cgtggcacgc 3799741 tcgatctcgc cgcttgccag cagctgctcg tatggaccgc cctgcggcag cggcgagagc 3799801 agggcgacgg gccggccgcg gtcggtgatc tcgatcgtct cgccggcctc gactcggcgc 3799861 agcagctcgc tggcccgctg ccgcagcgca cgcaccccca ccgaggtcat tgtgctaact 3799921 gtagcacaag cggtcggcgt catgggccga cgttcgactc gcgcaggctt taagtaacgt 3799981 cggtgttaat tactaggacc tgaaaaagtc ggcgcgttgt tcctcggttg gttggcgctg 3800041 agctgggagg atggcctcaa tgcccttgtt gcggaaggga ttgaggccat cgtgtttcgt 3800101 actgtaggcg atcaggcatc gttgtgggaa tccgtgctgc ccgaggagtt gcggcggctg 3800161 cccgaagagc tggcccgggt ggatgcgctg ctcgatgatt cggcgttctt ctgcccgttt 3800221 gtgccgttct tcgacccgcg gatgggtcgg ccgtccatac cgatggagac ctatttgcgg 3800281 ttgatgttct tgaagttccg ttaccggttg ggctatgagt cgctgtgtcg ggaggtcacc 3800341 gattcgatca cctggcggcg gttctgccgt attccgttgg agggatcggt gccgcaccca 3800401 accacgttga tgaagctgac cacgcgctgc ggtgaggatg cggtggccgg gctcaatgag 3800461 gcgctgctgg ccaaggcggc cagcgaaaag ctgttgcgca ccaacaaggt ccgtgccgac 3800521 accaccgtgg tggagggcga tgtgggctat cccaccgaca ctggactgct cgccaaggcg 3800581 gtcggctcga tggcgcgcac cgtggcgcgg atcaaagccg cggacgcggg atcggcgccg 3800641 ctcggtgggt cgtcgggccc gcgcgatcgc ctccaagctg cggttacgcg gcgcgcagca 3800701 acgcgatcag gcgcaggcct tcgtgcgccg gatcaccggg gagctagccg ggatcgccga 3800761 gcaggcgctg accgaggctg ccgcggtggt acgtaacgcc caacgtgcgg tgcgccgcgc 3800821 cagtgggcgg cgcaaagcct ggctacgcca ggccatcaac catctcgaga agctgatcgg 3800881 acgcaccgag cgggtggtgg accaggcccg tagccggctg gccggggtaa tgcccgactc 3800941 aagcagccgc ctggtcagtc tccacgatgc cgacgctcgc ccgatccgca agggacgatt 3801001 gggcaagccg gtcgagttcg gctacaaggc ccaggtcgtc gacaacgccg acggtgtcat 3801061 cctggaccac agcgtcgagc tcggaaaccc cgcagatgca ccgcaattgg cacccgccat 3801121 cgaacggatc agccgccgca ccggacgccc accacgggca gtgaccgctg atcggggctg 3801181 cggagacgca tcggtcgaag atgatctcca ccagctcggg gtgcgcaacg tggccatccc 3801241 acgcaagagc aaacccagcg ccacccgccg cgcattcgaa caccgacggg cattccgcga 3801301 caagatcaaa tggcgaaccg gatccgaagg acgcatcaac cacctcaagc gcagctacgg 3801361 ctggaaccgc accgaactca ccggcatcac cggcgcccga acctggtgcg gacacggcgt 3801421 cttcgcccac aacctcgtca agatcagcac cctggcagcg tgacagacac ccgcgcccac 3801481 cccgaccacg ccacgcaggt cgcccagccc gccgccgtca atgcaaccgc gactttttca 3801541 ggtcttagta attagtggcc gccgctttgg gtccaccggg gccctgcggc gaaacaccag 3801601 acgtgatgcc gtgatcggcg atacccttcg acccattgaa gggagaacag ccatgtcgtt 3801661 tgtgatcgcg aaccccgaga tgctggcagc ggcggcgacc gatttggccg gcatccggtc 3801721 ggcgatcagc gccgcgaccg cggcggccgc ggccccgacg atccaggttg ccgcggccgg 3801781 cgccgacgag gtgtcgctgg ccatctcggc gctgtttggc cagcacgccc aggcctatca 3801841 ggcgctcagc gcccaggcga cgatctttca cgaccagttc gtgcaggccc tgacctccgg 3801901 cggcaacctg tatgcggccg ccgagagcca caccgtcgag cagatggtgc tcaacgcgat 3801961 caacgcgccc acccagacac tgttcggccg cccgctgatc ggcgacggcg ccaacgggac 3802021 cgcggagaac ccggacggcc aaaacggcgg cctgctgttc ggcaacggcg gcaacggctt 3802081 tacccagacg accgccgggg tggccggcgg caacggcggc agcgcggggt tgatcggcaa 3802141 cggcggggcc ggcggcggcg gcggggccgg cgccgccggc ggcctcggcg gcaacggcgg 3802201 gtggctgtac ggcaacggcg gggccggcgg catcgggggc gcgggcaccg gaaccggtgg 3802261 tcacggcggg gccggcgggg ccggcggccg ggcctggctg tggggcaccg gcggggccgg 3802321 cggagccggc ggtgacggcg gctggttgtt cggcgacggc ggggccggcg gcaccggcgg 3802381 caacggcggc agcggcttta acagcttgac ctcttcggtc ggcggcgccg gcggggccgg 3802441 tgggcacgcc gggctgttcg gcgccggcgg gaccggcggg accggcggca tcggcgggca 3802501 aaacaccgag accggcccgg ccgccagcaa cggcggcgcg ggcggcgccg gtggcggcgg 3802561 cgggtacctg gtcggcgatg gcggcgccgg cgggaccggc ggggccggcg ggaagaattc 3802621 cagcggtggc gccaccctca ccgggggcac cggagggacc ggcggggccg gcggggcggc 3802681 cgggtggctc tacggcagcg gcggcgccgg cggtgccggc ggcgccggcg ggctcaacaa 3802741 cgccggtggt gccaccggcg gcaccggcgg taccggcgga gccggcggct ctggagcgtg 3802801 gctgtacggc aacggcgggg ccgccggggc cggcggcaac ggcggcaaca ataccagcgc 3802861 cggcaccggt ggtgtcgggg ctagcggcgg gaccggcgga aacgccgggc tgatcggcgc 3802921 cggcggccac ggcggggccg gcggcgccgg cggaaaccaa accggtggcg tgggcaacgg 3802981 cggggccggc gggaacggcg gcgccggcgg ggccggtggt cagctgtacg gcaacggcgg 3803041 ggacggcggc aacggcgggg ccggcggggc caacatcgcc ggcggcaatg gcagcgacgg 3803101 cggcgccgcc ggccacggcg gggccggcgg gagcgcccgg ctgatcggag ccggcggcca 3803161 cggcggggac ggcggcgccg gcgggaacac cgccggcaga agggccgacg cgatcgccgg 3803221 caccggcggg gacggcggca acggcgggaa tggcggcttg ctaagcggca acgccggggc 3803281 cggcggccac ggcggggcgg gcgggagcag caccgcgacc accaccaccg gaacaccccc 3803341 aacgggtgca acgggcggca atggcggcaa cggcggggcc ggcggcacgg ccgggtttac 3803401 cggcagcggc ggcatcggcg gcaacggcgg ggccggcggc accggcggta acgccggtgt 3803461 cgccttgtcg gttggcagca cgggcggact gggcggtaac ggcggcagcg ggggcctcgg 3803521 cggcggcggc gggtcgctct tcggcaatgg cggggccggc ggtgtcggcg caaccggcgg 3803581 aaacggcgga agcggtatcg ggcccgccag cgtgggtggc aacggcggca agggcggcgt 3803641 tggtgcggcc ggcgggcttg ccgggcagat cggcaacggc ggtagtggtg ggtccggcgg 3803701 tgccgggggc aacggcggga ccggcgatac cgccggcaac ggtggcaatg gtggtgccgg 3803761 cgcggtcggc ggcaacgccc agctcatcgg caacggcggc aacggcggtg gcggcgggaa 3803821 cggcggaacc ggcgccgacg gcacctaagg cccgcgagca gacgcaaaat cgcccaattt 3803881 cgtgccgaat tgggcgattt tgcgtctgct cggcgcagct aacccgccac gtactccacc 3803941 gcgccgtcgt cgagcaccac ccgggcctcg gcgccgtcgg agccggccac ctcggtgcgg 3804001 aacaccgccc ggcccggctc ggtgcgccag atcaccgtcg acagcgtctc gccgggaaac 3804061 accggcttgg tgaaccgcgc ggcgatcgag gtgatgttgg ccgccacacc gccgccaagc 3804121 tcggccacca gcgcccggcc cgccaccccg taggtgcaca acccgtgcag gatcggcttg 3804181 ggaaacccgg ccagctgcgt ggcgaaccag gggtcgctgt gcagcgggtt gcggtcaccg 3804241 gagagccggt agatcagcgc ctggtcctca cgggtcggca tatcgattcg ggcgtcgggg 3804301 tggcggtccg gaaattccgg cgcggccggc cgctcacccc gcgctcctcc gaaacccccc 3804361 tgaccccgaa gcaccaacgt ggtaagcgtt tcggcaacca acgaacccga ttccgggtcg 3804421 caaccgcggc cgcgcagcac aacgatggcg ttcttgccct cccccttgtc ctggatgtcg 3804481 gcgacctcgg tgaccaccga cagttttccc gccgccggca gcggcgcatg cagccggatg 3804541 ccctgggagc cgtgtagcag cgccgccggg ttgaatgttc ccacctttgc ggccgcacca 3804601 aacgccggac agcaaatcac cgcatacgtc ggcaacactt gctggtcgat gccgtggctg 3804661 ttctccgtgg tgaacgccag atctccggtc ccggcgccca ccccgatcgc gtaaagcagc 3804721 gtgtcccggt cggtccactc gaacaacatc ggctcggtca ctgcacctat ggagttcgga 3804781 tcaatcgcca tgcaactctc ctcccggttg gaaaatcatc gcaagccctt cccccggacg 3804841 gtatcgacag ggcaggctat cgccatggcg aagcgcaccc cggtccggaa ggcctgcaca 3804901 gttctagccg tgctcgccgc gacgctactc ctcggcgcct gcggcggtcc cacgcagcca 3804961 cgcagcatca ccttgacctt tatccgcaac gcgcaatccc aggccaacgc cgacgggatc 3805021 atcgacaccg acatgcccgg ttccggcctc agcgccgacg gcaaagcaga ggcgcagcag 3805081 gtcgcgcacc aggtttcccg cagagatgtc gacagcatct attcctcccc catggcggcc 3805141 gaccagcaga ccgccgggcc gttggccggc gaacttggca agcaagtcga gattcttccg 3805201 ggcctgcaag cgatcaacgc cggctggttc aacggcaaac ccgaatcaat ggccaactca 3805261 acatatatgc tggcaccggc agactggctg gccggcgatg ttcacaacac tattccgggg 3805321 tcgatcagcg gcaccgaatt caattcccag ttcagcgccg ccgtccgcaa gatctacgac 3805381 agcggccaca atacgccggt cgtgttctcg cagggggtag cgatcatgat ctggacgctg 3805441 atgaacgcac gaaactctag ggacagcctg ctgaccaccc atccactgcc caacatcggc 3805501 cgcgtggtga tcaccggcaa cccagtgacc ggctggaggc tggtggaatg ggacggcatc 3805561 cgtaacttca cctgaccgcg cggttgacgc ttaccgccgc tgaccgccac gattgaccgc 3805621 atgcggtacg tcgttaccgg cggtaccggg tttatcgggc gccacgtggt atcccgtctc 3805681 ctggacggcc gacccgaggc acggctgtgg gcgctggttc gccgccagtc gttaagccgc 3805741 ttcgagcgcc tcgccggcca gtggggtgac cgggtaagac cgctggtcgg tgatctcacg 3805801 gagctcgaac tgtccgagcg gaccatcgcc gagctaggcg atatcgacca tgtgctgcac 3805861 tgtgcggcgg tacacgacac cacctgggcc gacgccaccc gcgccgtcat cgagctggcg 3805921 gcacgccttg acgccacgtt tcatcacgtg tcgtcgatcg cggtggccgg agacttcgcc 3805981 ggccactaca ccgaggccga cttcgacgtc ggccagcgcc taccgacccc gtatcatcgg 3806041 atgacattcg aggccgaacg gctggtgcgc tccacgcccg gcctgcgcta tcgcatctac 3806101 cgcccggcgg tggtggtggg tgattcgcgc accggcgaga tggacacgat cgacggaccc 3806161 tactacttgt tcggggtgct ggccaagctg gcggtgttgc cgtcgttcac cccgatgctg 3806221 ctgccggaca ttgggcgcac caacatcgtg ccggtcgact atgtggccga cgcgctggtg 3806281 gcgctcatgc acgccgacgg ccgggatggg cagacgtttc atttgaccgc gccgacagca 3806341 atcggactgc gcggcatcta ccgcgggatc gccggcgcgg ccggactgcc cccgctactc 3806401 gggacgctgc ccggctttgt ggccgcaccg gtgctcaacg cgcgcggccg cgccaaggtg 3806461 ctgcgcaaca tggcggccac ccaactggga attcccgccg agattttcga cgtcgtcggc 3806521 tgcgcgccca cgttcacgtc cgacacaacc cgggaagcgt tgcgcggcac cggcattcac 3806581 gtccccgaat tcgccaccta cgcgcccggg ctgtggcggt attgggccga gcacctcgac 3806641 cccgaccgcg cgcgtcgcaa cgatccgctg ctgggccgcc acgtcatcat caccggtgcg 3806701 tccagcggca tcgggagggc atcggcgatc gccgtcgcca aacggggtgc gacggtattc 3806761 gcgctggccc gcaacggcaa cgcgctagat gagctggtca ccgagatccg cgcccatggc 3806821 ggtcaggcgc acgcattcac ctgcgacgtc accgattccg cgtcggtgga gcacaccgtc 3806881 aaggacatcc tgggccgttt cgaccacgtg gactacctgg tgaacaacgc cggccggtcg 3806941 atacgccgct cggtggtcaa ctccaccgac cggctgcacg actacgagcg ggtgatggcg 3807001 gtcaactact tcggcgcggt gcgcatggtg ctggcgctgc tgccgcattg gcgcgagcgc 3807061 cggttcggcc acgtcgtcaa cgtctccagc gccggcgtgc aggcccgcaa tcccaagtac 3807121 agctcgtatc tgcccaccaa ggccgcgctg gacgcgttcg ccgacgtggt cgcctccgag 3807181 acgctgtccg accacatcac gttcaccaac atccatatgc cgctggtggc caccccgatg 3807241 atcgtgccgt cgcggcggct caacccggtg cgcgcgatca gcgccgaacg cgcggcggcg 3807301 atggtgatcc gcggactcgt ggaaaagccg gcgcgcatcg acactccgtt gggtacgctc 3807361 gccgaagccg gcaactacgt cgcgccacgg ctgtcgcgcc gaattctgca ccagctctat 3807421 ctgggctatc ccgattcagc tgcagcgcag gggatttcgc gtccagacgc ggaccgccca 3807481 ccggcgccgc ggcgtccccg gcgatccgcc cgcgcgggag tcccgaggcc gctcaggcgc 3807541 ttggggcgac tggtgcccgg tgtgcattgg tagtcacttc tggcaggtga actggttgac 3807601 gtcgatgtat ccgatgcgaa acatctcggc gcagccggtg aggtacttca tataccgctc 3807661 gtagacttcc tcggattgca gcgcgatggc ctggcccttg ttggcctgca acgccgcgga 3807721 ccagaggtcg agggttttcg catagtgcgg ctgcaacgat tgaactctgg tgacggtgaa 3807781 gccgtttgcg ctggcacact cctgcaccat cggtatcgag ggcagccgcc cacccggaaa 3807841 gatctcggtc acaatgaatt tcaggaaacg agcgaaggtg aacgacatgg gcaggccgcg 3807901 ttcgtggatc tctttcggat gcaacccggt gatggtgtgc agcagcatga ccccgtcagc 3807961 gggcagcagg cgatgcgcca ggctgaagaa cgcgtcgtag cgctcgtgac cgaaatgttc 3808021 gaaagcaccg atgctgacga tgcggtcgac gggctcgtca aactgttccc agccggccag 3808081 cagaacgcgt ttggagcgta gattttcgga gttggcgacc agctgctgaa cgtggttggc 3808141 ctggtttttg ctcagggtca gaccgacgac gttgacgtcg tatttttcca ccgcgcgcat 3808201 catggtggcg ccccagccgc agccgacgtc caacagtgtc atgcccggct gcaatccgag 3808261 tttgcccagc gcgagatcga tcttggcgat ctgcgcctct tgcagcgtca tgtcgtcgcg 3808321 ctcgaagtag gcgcagctgt aggtctgagt gggatcgagg aacagccgga agaagtcgtc 3808381 ggacaggtcg tagtgcgcct gcacgttggc gaagtgcggc ttcagctcgt cgggcattgg 3808441 gatagcgtat cgtcgtcgcg gtgagcgtcg tattcgccga cgtcgacacc ggcatcgacg 3808501 acgcgctggc cgtgatctat ctgctggcca gtcccgacgc cgatctggtc ggcatcgcct 3808561 cgaccggcgg aaacatcgcg gtaggtcaag tgtgcgcgaa caacctgagc ttgctcgaat 3808621 tgtgcggtgc cgcagacatc cccgtgtcca aaggcgccga tgagccgctc ggcggccggt 3808681 ggcccgatca cccaaagttt cacggcccca aggggatagg ctatgccgag ctgccggcca 3808741 gcaatcgccg gctcaccgat tatgacgcca cgacggcctg gatcgcggcg gcgcactccc 3808801 acgccggcga cctgatcggt ctggtcaccg gcccgctgac caacctggcg ctggcgctgc 3808861 gcgccgaacc cgcgctgccg aggctgctgc gccggctggt gatcatgggc ggcatgttcg 3808921 acggccagcc gatcaccgaa tggaacatcc gggtggatcc cgaggcggcc agcgaggtgt 3808981 tcaccgcgtg ggccggacaa cgacaactgc cgatcgtgtg cggtttggat ctcacccggc 3809041 gggtcgcgat gacaccggac attctcgccc ggctggcgtc cgtctgcggc tcgtctccgg 3809101 tgatgcgggt gatcgaggac gcgctgcggt tctacttcga gtctcatgag gcgcgcggac 3809161 atgggtacct ggcatatatg cacgacccgc tggccgccgc ggtcgcaatg gacccggaac 3809221 tcctgacgac ccggaccgcg acggtggatg tcgacccgac gggggcgacg gtcaccgact 3809281 ggtccgggaa gcgaaatccc aacgcgcgga tcggcatgag cgtcgatccg gcggtgttct 3809341 tcgaccggtt cgtcgaacgg atcggacgat tcgcgcgccg aacgtgaact gacggcggga 3809401 ttttcccgaa attctcgccc tgacgtcacg ttcggcgcaa gtcattcgta gcttccctcc 3809461 agataccacc gccgctgccg gtagcacagc agcaacgcgg tgccgggatc gccgtccagc 3809521 aatacctgag cgcgcgcggt gcggccactc gcccgatccg gatcccacca ccgctcgtcg 3809581 tccggccacg gtccggccca ccagcgcagc cgatcgtctc ggccacgaac cctcagccgc 3809641 gccgggtccg cggagaacat cccccggctg gtcacccgta tcgggtttcc ttgggcgtca 3809701 agcaagtcca ccggatcgtc gaacagcacc gccggcgacg ggtcgggcaa cctgccgggc 3809761 cacggctgac cggggtcggc ctgcggcacc ggctcagggg ctactaggcc cagcacggtc 3809821 aacgtgatgc gttcggccgg gccgtgtccg ccggatagca ccggcacccg cacggcctcc 3809881 ggaccgagca agccctgcac ccgcaccagc gcccgacggg cccgaagcct gtcctgttca 3809941 ccgagcccgc cccatagcgg caactgcaag ccttccgatg cggacaccgt ctccaccgcc 3810001 tgcagccgca gcagagtcac cgccgcggtg ggccggtcac gagcattccg gttgttcaac 3810061 cacccgtcca gttgccagcg cacccggtcg gcggtggcgt cctcggtcag cggctcggcg 3810121 caccgccaca cccggctgcg ctcttcgccg ttggcggtga cggcatgaat ggccagccgg 3810181 gtgcagccca ctccggcggc catcagcgcc cgatgcagct cggcggccag cgagcgcccg 3810241 gcgaacgccg cggcgtcgac ccggtcgatc ggcggatcgc atgccagctc ggcggccaga 3810301 tccggcggcg gctcccgccc gcagggcgcc cgttccggtt cgccgcgggc gaaccggtgc 3810361 gcggccaccg cgtcggcacc gaacctggac gccacgtcgg tacgagacag cgcggcgaac 3810421 tgtccgatgg tgcgaatccc catcctccac aacagatccg tcaggtcgtc ccggcccggc 3810481 ccggacaggc tcggctcggt ggcaagttgg cggatcgaca gcagcgacag aaaccgcgca 3810541 tcgcctcccg gctccacgat gcggccagca cgcgcggcga aaaccgcggt agacaaccgg 3810601 tcggcgattc cgacctgaca ctccgcgccg gccgcggcca ccgcgtcgat cagccgctcg 3810661 gccgccatct gctcggaccc gaaaaaacgg gccggcccgc gcaccggcaa caccaggagc 3810721 ccgggccgca gcagctcggc gcggggcacc agatcgtcta ccgccgcgat caccccttcg 3810781 aagagccggg cgtcgcggtc ggcgtcggca gtcgctataa acagttgcgg acaccgcgcc 3810841 gccgcctccc gacgccgcaa ccctcggcgc accccggccg cccgcgcggt cgccgagcag 3810901 gcgatcaccc ggtttgccaa cgtgaccgcg accggggccg tcgcggatag gcccgcggcc 3810961 gcggccgccg cgaccgcggg ccagtccata caccagatcg ccagcacgcg agcggaggcc 3811021 atcaccgtcc acgcccgttg atctgcagcc gcaccccact gatccgcccc aaccccgggg 3811081 tgggcacgcc cctgagggcc ggggtgatct catagccgca gacccgggcc gcaagccgcg 3811141 tcgacacgcc ttgccagtcg ccgtcggtga ccagcagggt gcagcctttt tgacgggcac 3811201 gggccaccac tgcccgcgcc cgcgcccgcg tcacccggcg ccctcccaga ccgagcacca 3811261 ccagatccat gccgtcgatc agcacagcgg ccacctcaac cggatcggtc ccgggatctg 3811321 gtatcaccgc gagccggctc agatccgccc ccatctccac cgcggccagc aacccgatat 3811381 ccggctggcc aacgatggcc gcgtttcccc cggccgccgt caccgatgcc accatgctca 3811441 gcagcagtga ccgcgcaccc gacagcactc ccaccgtccc cgggggcaac gacaccggtc 3811501 ccgccggcac caggtcgccc gaacggctgg gccccccgga caccttctcg gacagcaaag 3811561 ccatctgccg tcgtagtgat tcgagctgct cagcaccatt ttcaaggcgt tggtcggagg 3811621 cgaaggccgc agtcatgacc agcctcctgt tcgaaaatat gttcgaagtc agtaaacacc 3811681 cgtccttgga gtccgtcaag gtcatgagag gctgccttgt gcaatcgcgt aaaaccacct 3811741 cggtactggc ggctgccctg ctgttttgcg gcctgttagg cccagggacg gccccaccgg 3811801 ccaccggtgg cgggcctgcc tgccggccgg cagagctctt cgccaccgac aacaccaccg 3811861 atgggttcga gctaccggcc gttgcgacta tcgcactaac cggcacggtg gtgaccggat 3811921 cgaccctggt cgacggcgtg ttctggtcga atgagcgcca gcagatcggc tacgagcgct 3811981 cccgtgaatt tcatctgtgc gttgtcgacg cgcccacatt gcacaacgcc gccgaggcac 3812041 tgcaccgcca gttcaaccaa gaagcggtgc tgaccttcga ctacttgccg cagaatgcac 3812101 ccgaggcgga cgcgatcctc atcaccgtgc ccgacatcgg catcgcccgc ttccgcgatg 3812161 ccttcgcatc tgatttggct gcacaccacc gattacgggg cggatctgtc accacagccg 3812221 accacacctt aatcctggtc gccggcaacg gcgatctcga tgtcgcccgc cgactcgtcg 3812281 aggaggccgg cggggactgg aacgcaacca ccattgccca tggcaggcgt gaattcgtga 3812341 actagctgat caagggcgct ccgctggcca cccgagccgg gttggtcaca ttagttagtc 3812401 acagcaatct ctgggccggc gggcacaacg cgtattcatc ccgacagata ccaatgtgtc 3812461 gcctgtgaca aaagccgggc ctggctaatg ctggccgccg ctactcccac tcgatggtgg 3812521 cgggcggctt gctggtgatg tccagcacca cgcggttgac ctcggcgacc tcgttggtga 3812581 tccgggtcga gatgcgctcg agcacctcgt agggcacccg ggtccagtcg gcggtcatcg 3812641 cgtcttcact cgacaccgga cgcagcacaa tcgggtggcc ataggtgcga ccgtcaccct 3812701 gcacacccac cgagcggaca tcggccaaca gcaccaccgg acactgccag atctggttgt 3812761 ccaggcccgc cgcggtcagc tcctcacgca cgatcgaatc ggcgtgccgc agcgtatcca 3812821 accgcttggc ggtgacctcc ccgacgatcc gaatacccaa ccccggtccc ggaaacggct 3812881 ggcgcgccac gatctcctcc ggcagaccca actcccgccc gaccgcgcgc acctcgtctt 3812941 tgaacagcag ccgcagcggc tcaacgaggg tgaacttcag gtcgtcgggc aggccgccga 3813001 cattgtggtg gctcttgatg ttcgcggtgc cgctgccccc gccggactcc accacatccg 3813061 gatacagcgt gccctgcacc aggaactcag cagtcttacc gtccagcaca tcccgcaccg 3813121 cgccctcgaa cgcgcggatg aactgacggc cgatgatctt gcgtttgccc tcgggggcgc 3813181 tcacgcccga cagcgcctcg aggaaggtct cggccgcgtc gacggtgacc aggttggcgc 3813241 cggtggcggc cacgaaatcg cgttgcacct gcgcccgctc accggcgcgc aacagcccgt 3813301 ggtcgacgaa gacacaggtc aaccggtcgc cgatggcccg ctgcaccagg gccgcggcca 3813361 ccgcggaatc cacgccgccg gatagcccgc agatggcgtg gccgtcgccg atctgggtgc 3813421 gcacctgctc gatcagcgcg ttggcgatgt tggcgggcgt ccactgggcg ccgagcccgg 3813481 cgaagtcgtg caaaaaccgg ctgagcacct gttgcccgtg tggggtgtgc atcacctccg 3813541 ggtgatactg caccccggcc aggcgccggt cgaaggcctc gaaggcggcc accggggcac 3813601 cggcgctgct agccaccacg tcgaatccgt ccggcgcggc cgtgaccgcg tcaccgtgac 3813661 tcatccatac cggctgaacc tcgggaagat ccgaatgcag tttgccacca aggactttca 3813721 gttcagtccg accgtattcg cgagtgccgg tgtgggcgac gatccccccg agcgcctgcg 3813781 ccatggcctg aaacccgtag cagatgccaa gaaccggtac accgaggtcc agtagcgccg 3813841 gatcgagttt cggagcgccg tcggcgtaga cactggccgg tccaccggaa agcacgagcg 3813901 ccaccggctg acgggccctg atctcctcga tcgaggcggt gtgcggaatc acctcggaga 3813961 aaacccgtgc ttctcgaacc cgacgggcaa tcaactgggc atattgggca ccgaagtcga 3814021 ccaccaacac cggtcgagcc ggtgtctcag gcacgtcgat gtcagcaggc tgcaccacgg 3814081 ccagtcagtc tagtggctgg ggtgactccc gaggtcggcc ggtagcggtc catgggccgg 3814141 tccgcaggtt accgaagagg ccagtgctgc cgccgccact tgggccttct tcagtcccga 3814201 cagagagatt cgccgatcgt agacgaccgc cggcgatgct ctgatcaagg cgagctgacg 3814261 gcggtagatg ccagacatgg ccgcacagca ggcagcgctg cggcggtcga ggtgtggaat 3814321 cagccgcagt cccagcgaat accagtctgc ggcgcggtcg gcactgaacc gcagcagtgc 3814381 cgcgagccgt ccgtcggggt catcgagtgc cccggtgtcg tccaggcgga ggcgtacgcc 3814441 taatcggtcc agctcgtcgc gcggcaggta gatccgtcca ttcaaaaagt cctctcgaac 3814501 gtcgcgcaga atattggttt gctgcagagc gattcccaac tgctcggcgt atcgcgacgt 3814561 cgccgtgctg acgggtccaa agatggaaag acaaagcttt ccgatcgtgc cggccccccg 3814621 gcggcagtag acgatcagct cgtcgaaatc gcggcaacca gtccagtcga tttccatacg 3814681 ggcgccgtca atcaactctg cgaacatcgc gatcggcacc ggaaaccggc gagccgcgtc 3814741 agccagcgca accagcaccg gatcggatga atcatcaata ttatcaagtg atttcctgat 3814801 ggcatcgagc tcggtgatct tggtctcggg ggccagctcg ccgtcggcga cgtcgtcgat 3814861 ccggcggccg agcgcataga ccgcagatag tgccgctcgc ttttcgcgcg gcaagagtcg 3814921 gatgccgtag tagaagtttc tggcggccgt gcgcgtgatc gactcggtga ttcgatacgc 3814981 ctgttcgatc tcggtcatgc cgtcctccaa ctacggtgtt ggtcagtcac gcctgacgat 3815041 cgacgatgta gtgagccaaa tcctgaagct cagcggccgg gcgatcggga atgccgatgc 3815101 gcgccaccat gtcgatgcct tgcgttacgt gtcggcgggc ctccgcgctt gcccacctgc 3815161 gccccccacc gcactcgatc agttctgcga ccgctgcgag ctcatcatcg gacgctgtct 3815221 ggctgcccgt ctcgtccacc agccacgctg cgaggcggcg gccggccgaa ccgccgtgcg 3815281 ccacggtcca ggtaacgggc agagttttct tgcgggagcg aaggtccgag tacaccggct 3815341 tgccggtgat ctcaggacgg ccccaaatgc cgagcaggtc gtcgaccaat tggaaggcaa 3815401 gtccaatgtg acgaccgtag gcaaccaacg cttctcgcac cgaacgcggt gcgccagcga 3815461 gtaacgcgcc gacctcggcg ctggctgcca tcagtgctgc ggtcttgcct tcagccatct 3815521 tgagacactc atcgagtgcg acgtcggttc ggctttcgaa cgcggtgtcg gcggcctgcc 3815581 cacggatcaa ctcacgggtg gcttccgaaa tcgcgcgcag cgccgcaccg acgtgtggtg 3815641 aatcgcaatc cagcaggacc tcgtgcgcca gcgacagcat cgcatcaccg gccaatagcg 3815701 ccatcgcatc gccccacagt gcccacaccg tcggccggtg ccgacggtgc tcgtcgcggt 3815761 ccatgaggtc gtcatggacg agcgagaagt tgtgcaccag ttcaaccgag acggctccgg 3815821 gaatcgccga gtgggggtcg gcgccggcgg cttcggcggc gacaaacacc aaagcaggac 3815881 ggattgcctt gccgcagttg ttgttcactg gacggccgcg ttcatcagac cagccgaggt 3815941 ggtaggacac gacgggccgc atgtggggat cgaggcggtc agccatctgg cgcagcgtcg 3816001 gtgtgatgag ttcgtgtgcg agtcccaaaa cgggaagcgt gcgacgggtc atacggtcgc 3816061 tgtcgggttg cggtggcagt ccgtactttt cgtcggtacc gcgcattgcg tgaatctagc 3816121 attcgctcat ggcacggccc atgggcaagt tgcccagcaa tacgcgaaaa tgtgcacaat 3816181 gtgcaatggc ggaggcacta ttggagatcg ctggtcagac tattaatcaa aaggaccttg 3816241 gcaggagcgg acggatgacg cgtaccgaca atgacacttg ggatctggcc tccagcgtgg 3816301 gggcgaccgc cacaatgatc gccaccgccc gggcgttggc tagcagggcc gaaaaccctt 3816361 tgatcaatga tccattcgcc gagccgctgg tgcgcgccgt cggcatcgac ctgtttaccc 3816421 ggctggccag cggcgagttg aggcttgagg acatcggcga ccacgccacc gggggtcggt 3816481 ggatgatcga caacatcgcg attcggacca agttctacga tgactttttc ggtgacgcaa 3816541 ccacggcggg tattcggcag gtagtgattc tggcggctgg gctcgacacc cgcgcgtacc 3816601 gactgccctg gcccccgggc acggtggtct acgagatcga ccagcccgca gtcatcaagt 3816661 tcaagacacg ggccctcgcc aatctgaacg ccgaacccaa cgcagaacgg cacgccgtgg 3816721 ccgtcgatct gcgaaacgat tggccgacgg cgctgaagaa cgccggcttc gacccggcca 3816781 gaccgacagc cttcagcgcc gaggggttgc tgagctacct gcccccacag gggcaggacc 3816841 gcctgctcga tgcgattacc gcgctcagcg cccctgacag ccggttggcc acccagagcc 3816901 cactggtgct cgacctggcc gaggaagatg agaagaagat gcgcatgaaa tccgcggccg 3816961 aggcatggcg ggaacgcggc tttgatctgg acttgaccga gctgatctac ttcgatcaac 3817021 gcaacgacgt ggccgactac ctcgccggct ccggctggca ggtcaccacc agcaccggca 3817081 aggaactctt tgcggcccaa gggctgccgc ccttcgcgga cgaccacata actcggttcg 3817141 ccgaccgccg ctacatcagc gcggtgctga agtaggtggc cccggcacta tagccgggcc 3817201 taactcgtag gcttggtacg cgggcagagc cgccaggcat ggcgaactgg tatcgcccga 3817261 actatccgga agtgaggtcc cgcgtgctgg gtctgcccga gaaggtgcgt gcttgcctgt 3817321 tcgacctcga cggtgtgctc accgataccg cgagcctgca taccaaggcg tggaaggcca 3817381 tgtttgacgc ctacctagcc gagcgagccg agcgcaccgg cgaaaaattc gttcccttcg 3817441 accctgccgc ggactatcac acgtatgtgg acggcaagaa acgcgaagac ggcgttcgat 3817501 cgtttctgag cagccgcgcc atcgaaatac ccgacggttc cccggatgac ccgggcgccg 3817561 ccgagacggt gtatggcctg ggcaaccgca agaacgacat gttgcacaag ctgctgcgcg 3817621 acgatggggc ccaggtgttc gacgggtcgc ggcgctacct ggaggcggtc acggccgcgg 3817681 gtctcggtgt ggccgtggtg tcttcgagcg ccaacacccg cgacgtgctc gcgaccaccg 3817741 gtctggaccg gttcgtccag cagcgggtgg acggcgtgac gttgcgcgaa gagcacatcg 3817801 ccggcaagcc ggcccccgac tccttcctgc gcgcggcaga actgttgggg gttacccccg 3817861 acgcggcggc ggtgttcgag gacgccctgt ccggggtggc ggccggccgc gccggcaact 3817921 tcgccgtagt ggtgggcatc aaccgaacgg gccgggcggc tcaggccgcc cagttgcgcc 3817981 gccatggcgc cgacgtggtg gtaaccgatc tcgccgagct gctgtagggc atgatcgggc 3818041 gatgatcacc gaggacgcct tccccgtcga accgtggcag gtccgcgaga ccaagctcaa 3818101 cctgaacctg ctggcccagt ccgaatccct attcgccttg tccaacgggc acattggatt 3818161 acgcggcaac ctcgacgagg gcgaaccctt cggactgccg ggcacctacc tgaactcttt 3818221 ctacgaaatc cggccgctgc cgtacgccga ggccggttat ggatatccgg aggccggcca 3818281 gaccgttgtc gacgtcacca acggcaagat ctttcgcctg ttggtcggcg acgagccgtt 3818341 cgacgtccgg tatggcgaat tgatctccca cgaacggatc ctcgacctgc gcgccgggac 3818401 gctgacccgc cgcgcgcact ggcgctcacc ggcgggcaag caagtcaaag tgacgtccac 3818461 ccggctggtg tcgctggccc accgcagcgt cgcggcgatc gagtacgtcg tcgaggcaat 3818521 cgaggaattc gttcgcgtga ccgtgcagtc cgaactcgtc accaacgagg acgtaccgga 3818581 gacctcggcc gacccgcggg tgtcggccat cctggacagg ccgctacagg ccgtcgagca 3818641 cgaacgcacc gagcggggtg cacttctcat gcaccgcacc cgagccagcg cgctgatgat 3818701 ggccgcaggg atggaacacg aggtcgaggt tcccgggcgg gtcgagatca ccaccgacgc 3818761 ccgcccggac ctggcccgaa ccaccgtgat ctgcgggctg cgcccgggac agaagctgcg 3818821 catcgtcaaa tacctggcct atggctggtc cagcctgcgc tcccgcccgg cgctgcgcga 3818881 ccaggccgcc ggcgcgctgc acggtgcccg ctacagcggc tggcaggggc tgctggacgc 3818941 gcaacgcgcc tacctcgacg acttctggga cagcgcggac gtggaggtcg agggcgaccc 3819001 ggaatgtcag caagcggtgc gtttcgggtt atttcacctg ttgcaggcca gcgcgcgcgc 3819061 cgaacgccgc gcgatcccca gcaaggggct caccggaacc gggtatgacg gccacgcctt 3819121 ttgggacacc gaaggtttcg tgctaccggt gctcacctac accgcaccgc atgcggtcgc 3819181 cgacgcgctg cggtggcggg cgtcgacgtt ggacctggcc aaggagcggg cggccgagct 3819241 cggcctggaa ggtgccgcct ttccctggcg gaccatccgc ggacaggagt cctcggccta 3819301 ctggccggcc ggcacggcgg cctggcacat caacgccgac atcgcgatgg cgttcgagcg 3819361 gtaccgcatc gtcaccggcg acggttcgct ggaggaggaa tgcggccttg cggtgctgat 3819421 cgagaccgcc cggctgtggc tctcgctcgg gcaccacgac cgccacggcg tctggcacct 3819481 cgacggggtc accggtcccg acgagtacac ggcggtcgtc cgcgacaacg tgttcacgaa 3819541 tctgatggcg gcgcacaatc tgcacaccgc cgccgatgct tgcttgcgcc accccgaggc 3819601 ggcggaggcc atgggtgtca ccaccgagga gatggccgcc tggcgcgacg cggccgacgc 3819661 cgccaacatt ccctacgacg aggaactcgg tgtccaccag cagtgtgaag ggttcaccac 3819721 ccttgcggag tgggatttcg aagccaacac cacttatccg ttgctactgc acgaggccta 3819781 cgtgcgcttg tatcccgcac aggtgatcaa gcaggccgac ctggtgctgg cgatgcagtg 3819841 gcagagtcac gcgttcacgc ccgagcagaa ggcgcgcaac gtcgactact acgaacggcg 3819901 catggtgcgc gactcgtcgt tgtcggcctg cactcaggcg gtgatgtgcg ccgaggtcgg 3819961 ccatctcgag ttggcccacg actatgccta cgaagccgcc ctgatcgacc tgcgcgacct 3820021 gcaccgcaac acccgtgacg gcctacacat ggcttcgctg gccggagcct ggacggcgct 3820081 ggtcgtaggc ttcggcggcc tacgcgacga cgagggcatc ctgtccatcg atccgcagct 3820141 gcccgacggc atctcgcggc tgcggttccg gctgcgatgg cgcggcttcc ggctgatcgt 3820201 cgacgccaac cacaccgacg tcaccttcat ccttggcgac ggtcccggca cccagctgac 3820261 catgcgccac gccggccaag atctgacgct gcacacggac acaccgtcca ccatcgccgt 3820321 gcgcacccgt aagccgctgc tgccgccacc accgcagccg ccaggccgcg agccagtgca 3820381 ccgccgggct ttagcccggt gacgatacgg gccgcgtagc ggcccgagga ggagccgggc 3820441 aatcggctta gcccggtgac gatgcgggcc gcgtagcggc ccgaggagga gccgggcaat 3820501 ccagcctgag cccggtgacg atgcgggccg cgtagcggcc cgagaaggag ccgggcaatc 3820561 ggcttagccc ggtgacgatg cgggccgcgc tgggggcacc atccgcttgc ggggacgcgt 3820621 ctgcgtctac ctgggcggca ccggtgaacg tctcattcac cgcgcacctc cgcttcctgc 3820681 acggcggcga cgacccgggc aacgtcatcc ggggccatgt ggtcgtggac tggcagcgac 3820741 acgattcgcg agcaaatgtc cgccgtgacg gctagatcgg tcgactcgac taactcggca 3820801 ttcgtcacaa agtacggatg tcggtgctgc ggtgggttgt agtagtcgcg cgcctcgatc 3820861 gcgtgcctac gcaggctacc cagaaccgcg gccttgtggt cggcggacgt gcagcaagcg 3820921 ctcgcgaaac agagcgacgc aacattggcg ttgtcctgga aacgcacacc cgcgtcggcc 3820981 ataccggtgc gatagcactc gaggaccttg cggcgacttg ccaggcggcg atcaagcccg 3821041 actagttggc gtaggccaat agcggcgctg atctccgaca gcttgccgtt cattccgagc 3821101 tggatggact cgcgtgtttg caccaagccg aagttctgga acttgtatgc gtgctcgacg 3821161 agccgtggat cgcgagaaac cagagcgccg ccctcaccaa ccgcgaacgg cttggtcgca 3821221 tggaaggaga agatctcgca tgcaccgcgt ccaccgaggc gctcgccgtc ggcgtacgtg 3821281 gagccgaagc cggccgccga gtcgagcaca atcggtagct cccattcggc ggcgagctcc 3821341 tcccagacgc tgatctgggg attgccgacg ccgaacacat tggccagcag gatgccggcg 3821401 atccggtcgc ggaagcgttc gatgacggcg cgggcggagt ggacgcatgg ctgccatgtg 3821461 ttggcgtcga tgtcgatgaa ccagggacgg tacccagtcc atagcgcagc ctgagccacg 3821521 ccgacgaacg tgaacgacgg catcagcagg tagcggtccc gcgtaccggc gccgaaactg 3821581 acgtggagcg ccgcgaggag tgccagggtg ccgttggcga gggtagcaac gtgcagatga 3821641 ggtcccagat agtcgcgcag ggcgcgggca aaccgccgct cgttcggacc gaagttcgtg 3821701 taccagttag cctgggcgat ctgtacgaag tcctcggcga gctcggctgg cccgggaaag 3821761 ctcgggcgga tgaaggggat cttggggatc gtcgagccac ccggctcgag ttttaacatg 3821821 gacgtgcctg gggtcgcgcg tactgcggac ggcggctcca gcaccgagcc ggataacgtt 3821881 cggatcttca tatatgcaga gctcaaggtc cgtttgcagc gcgtcgggac agtttccgca 3821941 gcgcacttcg tcgcaccatc gttggcatcg gcgcctgaag cagttaccgc gagaaccgca 3822001 tcatgtcgaa cttgaggtta gccttacctc ttaagaatgt cacccccggg gagtgacccc 3822061 ggctgtccac gcgtgggcga cccggcacgc acggtgcacg ttccctggtg tctcacccgc 3822121 ctcattcgtc ccggcgccac agggctagcg atatggccgc ctcgcgtagt cggtccgggt 3822181 cggtacgcgt ggggcagatg aggaaggtgc ggcgcattga agcagcctgt caagcccggc 3822241 tcggtgaccg gcacggcgcg ttcagcatgg ggccagtgcg acgggctgga gcgaagcaag 3822301 caggctggcc gccagcgatt tcgccagcga ccggacgcgc ggtgcgctct cgacgtgcca 3822361 aaaacggatc ttgggagtga aattgccgcc gactaggggc ccgatgacgc aaaaacctgg 3822421 gctggcctcg aagtcgtcgt taaccagaag gccacggttg gtgcggttcg ggcggcacag 3822481 cccgttctgc atcgcgctga ccaggaacgg cgaggaacac gtgtccagct cctcgaaacc 3822541 gccacaattc accaccgcag cgaaggggac ggggtgggta tgctcggctc ccgcggctcg 3822601 gtaggtcatg gtggcgaacg gctggccgga cgcgcaggca tccacgcgca gtacttcgcc 3822661 ggcgagcagg ctcagcgtgc cgtccgcggc tagctcctcg gatgcctggc ggcaatcgcg 3822721 tcccgcacgc cgcaccaact tggtgaagtt catgccgtgc acgcagaaga actcttcctg 3822781 ctgcacgaga tccatcttgt gcagcgcctg cccaaacagg gcggcaacgg cgtcgtacaa 3822841 atcggccagg ttcaacgagc gttcttcggc cgtcgcgaga tcgtcgcgga tcgcggacat 3822901 gagatccgcc gcggcgatcg cttccgtaca gagcagcgtg cgcagccgcg ggaagtcaaa 3822961 ctccggcggc tgattgcaga tcatgtaggg cagcacgccg gagcgcgaga tgacggtgat 3823021 ggaccggacg cgtgcgcgga tgcgcgcgtc gtgacgcatt aggtagagcg cttccagcga 3823081 ggtggcgttg gaacccacga ccagtacgtt gcgcttctcc cacgactcga cgcggtcgag 3823141 cgaatcgcgc agtcgcgcaa cgttgctctc cccgccgggg gagtagaaat cgttgatata 3823201 ggtgaatgcg ggttcggaat cgctcgcaag gatggctttg gtcggggggc tgccaatggc 3823261 cacaaccact ttgcctgcag caattgccgt tggaccgttt ccagacgggc ggaggccgat 3823321 tcggtagtgg ccgtctgcgg agtgggcgct catggcctca gcgcggatgg tgacgatttc 3823381 ggccaggtca cgctcgccga gcgcggcgat ggcggcaatc atctgctccg acagaaatac 3823441 accgaagaga aaccgcggca ggtagagctc cccccactgg ttgccgtcca atgcgtcgcg 3823501 gttgtcgcag atccagcggg ccgcggccgc accgccctct gcctggaaga acgccagcca 3823561 gcgctgcttg ttctgctcca gccagatccg gtaggcggcc ttttccggct cgtcggcgaa 3823621 atcgtcgagc ttctgaatgg ccagcgatcc gatgctggag cgttggccat aggggattcc 3823681 gcaccagaac tgctcgtctc gctccaccac cgcgatgcgc aacttgggcg atgccgaggg 3823741 gctgctcagc agggcatcgg ccatttccag cagagtcata gagcacgcgg ccccgctgcc 3823801 gatgaacgca acgtcgaagg taggtggagt gatcatagtc atcaaataag ggaaggctaa 3823861 cataacctcg aggcggtggt taggcttccg cgggcttctc cggttcgagc acgacgcgga 3823921 caaacacctt gcggcctgac gcatcgacga accaagcgtt gcggaaatca tcatgggtca 3823981 acgcgcgcag gcgattcagg aaatgcccaa acgttccgcg ctcgttcagg tctagccgcc 3824041 ggagttgttc gaaatccttt ttcaggttga ggttgccctc ggtggccggc gatttagccg 3824101 tgtagctgcc gtcccggatg gcgtcgaaat gttccagcac caactcacgc tcgatgtcca 3824161 tcagccgggc gtagacactt cccgaggaat cccacgactc gatcgcgcat tcccgctggg 3824221 cgatgatcgg accatggtcc aactgatcgt cgatctcgtg gatcgtcacg ccgacttttt 3824281 gcccgtcgat gatcgagaag acctggggaa accagccgcg gttgtagggg ttgaaacccg 3824341 gatgaacatt cacacacctg accccatcga tcaaagcggc gggaaacctc tgtttacagt 3824401 ggaaggaaag gacgaggtca taccgctcca cgatttccgc gacgcgctct gcgacatcac 3824461 atcgcgggac acccggcagc tggccgatgg gggactgata gacgtccata tcgccatgcc 3824521 tggcctgcag atcgaccgcc agagcatggg cgtggacgtt gtcggtcagg atcaatatcg 3824581 tcacgactcg cccccgccag cctgcccagc gcccatcagc ggagccccca acaccagctc 3824641 accctacttc agggccgacg cataccggac ggccacgctg gggccagcgc agggacatca 3824701 gtcagtgcgg ttccaggatc cgggctaccg catcgttcac ggacagccgt agttcgtcac 3824761 cggtcagctc gtcgataccc gtcgccgagc gcagcatggg cgcaaagagc cgccaaccga 3824821 attgcagcgc aagggcgtgc gcgaccgcca gccgcgcgcc caagtcgctg tcgtagcgag 3824881 gccgtaccgc gtcgagcagc tccgcaacat tgggaaatcg ctgttgcagc tggcccacgg 3824941 gatatccgtc cagcagtgcc cgggctaaga cccgcccatg tcggtcgaga gcccgttcga 3825001 tgatgtcagc gggcgcctcg gagtgcaaca gtctggtcag cttcgtgccc aggtgatcga 3825061 gcacggcccc aaccagttgg tccttggtgc cgaagtgacg aaacaccagc ccgtggttga 3825121 ccttggatcg agcggcgatg tcgcgaatcg acgtcgcggc tggcccacgc tcggcgaaca 3825181 ggtcggtggc ggcctgcagg attgcggccg ctacctcttc ccgcccagtg ggcatcttgc 3825241 ggcggtcggt tgccggacgc gtagtcatcc ggctacagta accgatgtag tcatctgact 3825301 acactaacca ttcattgagg acgccagcaa tgacagatct gattaccgtg aagaagctgg 3825361 gcagccgtat cggcgcccaa atcgacgggg tgcgcctcgg aggcgatctg gaccccgccg 3825421 cagtcaacga gattcgcgcg gcactactgg cccacaaggt ggtcttcttc cgcggtcagc 3825481 accaactcga tgacgccgag cagctggcgt ttgccgggtt actgggcacc ccgatcggcc 3825541 acccggccgc gatcgccctc gccgacgatg caccgatcat cacgccgatc aactccgagt 3825601 tcggcaaggc gaaccgctgg cacaccgacg tcacgttcgc cgccaactat ccggccgcct 3825661 cggtactgcg cgcggtctcc ctgcccagct atggcgggtc gacgttgtgg gccaacaccg 3825721 ccgcggccta cgcggagctg cccgagccgc tcaagtgcct caccgaaaac ctgtgggcgc 3825781 tgcacaccaa ccgctatgac tacgtcacga ccaaaccgct gaccgcggcg cagcgggcct 3825841 tccgtcaggt gttcgagaag ccggacttcc gcaccgagca tcccgtggtg cgggtacacc 3825901 cggagaccgg tgagcgcacg ctgctagcgg gcgacttcgt gcgcagcttc gtcgggttgg 3825961 acagccacga atcaagggtg ttattcgaag tgctgcaacg gcgaatcacc atgcccgaaa 3826021 acaccatccg ctggaactgg gcgccgggcg acgtagccat ctgggacaac cgggccaccc 3826081 aacaccgggc gatcgacgac tacgacgacc agcaccggct gatgcaccgg gtcaccttga 3826141 tgggcgacgt gcccgtcgac gtgtacgggc aggctagccg ggtgatcagc ggggcgccga 3826201 tggagatcgc tggctgatca accagtaagc gcaacgcaat tatgtagcac catgcgtgct 3826261 accgttgggc ttgtggaggc aatcggaatc cgagaactaa gacagcacgc atcgcgatac 3826321 ctcgcccggg ttgaagccgg cgaggaactt ggcgtcacca acaaaggaag acttgtggcc 3826381 cgactcatcc cggtgcaggc cgcggagcgt tctcgcgaag ccctgattga atcaggtgtc 3826441 ctgattccgg ctcgtcgtcc acaaaacctt ctcgacgtca ccgccgaacc ggcgcgcggc 3826501 cgcaagcgca ccctgtccga tgttctcaac gaaatgcgcg acgagcagtg atctatatgg 3826561 acacctcggc cctgactaag ctgctcatct ccgagcccga gacgaccgaa ctgcggacat 3826621 ggctgaccgc gcaaagcggc cagggcgagg acgcggcgac aagcaccctt ggccgggtcg 3826681 agtcgatgag agtcgttgcc cgatacggac aaccaggcca aactgagcgt gcgcgttacc 3826741 tactcgacgg gctcgacatc ctcccgctca ccgaaccggt gatcggtcta gctgaaacga 3826801 tcggaccggc caccctacgt tctctcgacg cgattcacct cgcggccgca gcccagatca 3826861 agcgggaact gacagccttc gtcacctacg accaccgatt gttgagcgga tgccgtgagg 3826921 tcggcttcgt caccgcctca cccggcgcag tccggtgacc atatccaacg accgcacgct 3826981 tcctgatgcc tcagcccgcg ttgctgaccg gatcgatcgg caaccaccgc agcgcgcccg 3827041 gggcgtcggc aggcaccacc gggtgcgccg gttggatcgg cgccagccga cgatagggct 3827101 caccctgcgg tggccgccgg tcggtttcgc ccttgttcgg ccacagcgag gcggcccgct 3827161 cggcttgagc ggcgatggac agcgacgggt tgacacccag gttcgccgag atcgccgcac 3827221 cgtcaaccac gtacagcgtc ggatagccat agacccggtg ataggggtcg atgacgccgt 3827281 gctcggggtc gtcgccgatc accgcgccgc cgagaaagtg cgcggtgagc gggatgttga 3827341 acagctcacc ccaggtgccg ccggccacgc cgtcgatttt ggcggcgatg cgacgggtga 3827401 cctggttgcc gatcgggatc catgtagggt tcggctcgcc gtgtccctgc ttgctcgagt 3827461 accagcggat acccagcttc ccgcgcttgg tgaacgtggt gatcgagttg tccaggtgct 3827521 gcatgaccag cgcgatcacg gtgcgctcgc tccattgccg gggattgagc atccggatgg 3827581 tgccgcgcgg atcctgactg gcggtctgca gcaactgcct ccagcgcggc acatcggtgc 3827641 cctgcggacc ggagccgtcg gtcatcaagg tctgcagcag ccccatcgcg ttggagcctt 3827701 tgccgtagcg cacgggttcg atgtgggtgt cggccgtcgg gtgaatcgac gacgtgatcg 3827761 ccacgccgtg ggtcaggtcc aggtccggat tgaccttcaa ggtggcggcc ccgacgatcg 3827821 attctgagtt ggtgcgggta aggacaccca atcgcttcga gagaccaggg agccgacccc 3827881 tatcccgcat cttgaacagc agatgctggg tcccccaggt gcccgcggcc agcaccagct 3827941 gcgttgcggt gaaggtgcgc cgatcccggc gcagccaact gccggttcgc actgtgcgga 3828001 cctcccacaa cccgtcggac cgccgctcaa accccttcac cgtggtcatc ggaatcactt 3828061 gcgcgccagc tgattccgcg aggccaaggt agtttttcac cagggtgttc ttggcaccgt 3828121 ggcgacagcc cgtcatacag cagccgcatt ccaggcagcc ggtgcgcgcc ggcccggcac 3828181 cgccgaagta gggatcgggc acggtcttgc cgggcgtctt ggtgccgtcg gggccgaaga 3828241 acactccaac cggggtcggc acccaggtgt cgccaaaccc catctcgtcg gcgacctcct 3828301 tgacgatgcg gtcggcgtcg gtgaaggtcg ggttttgcac caccccgagc atccgctgcg 3828361 cctgctggta gtgcggcatc agctcgccac gccagtcggt gatgtgtgac cactgctggt 3828421 cggcgaagaa cggctccggc ggcacgtaca acgtgttggc gtagttgagg gagcccccgc 3828481 ccaccccggc gccggccagg atcatcacgt tgcgcagcgg gtggatacgt tgaatgccat 3828541 agcagcccaa cctcggcgcc cagagaaact tgcgcaggtc ccacgacgtc ttggcgaact 3828601 cctcgtcgga gaaccggcgg ccggcctcca gcacgccgac ccggtagccc ttttccgtca 3828661 gccgcagcgc ggtgacgctg cccccgaaac ccgatccaat aatcaggacg tcgtaatccg 3828721 gcttcatcgc tgcagtatga ccccctttac atcgggccag ttaatcagtc tctcaggtgg 3828781 cgtcagcccc caacggtcag gccgaccttc tggaactcct tgaggtcgca atacccggcc 3828841 ttggccatcg atcggcgtag cccaccgacc agattcaggc cgccgaacgg gtcgtccgac 3828901 ggcccgccca gcacccgcgc cagcggcggc cgctcgccga ccgcgatctg cagcaacgcc 3828961 ccccgcggca acgacgggtg cgccgccgcg gccggccaga accatccctc gccgagcgcc 3829021 tcggccgatt cggctaacgg ggtacccagc accaccgcgt cggcgccgca ggcgatggcc 3829081 ttggccaact cgccggaagt gtggatgtcg ccgtcggcca acacgtgcac gtagcggccg 3829141 cccgtctcgt cgaggtagtc gcgccgcgcg gcggcagcgt cggcgatcgc ggtcgccatc 3829201 ggcacgctga tgcccagcac ctcgtcggtc gtcgtcaccc cctgggtgga gccgtagcca 3829261 acgatgacgc cggcggcgcc ggtgcgcatc agatgcagcg cggtgcggtg gtcgagcacc 3829321 ccgccggcga cgaccggtat gtcgagctcg gagatgaagg tcttcaggtt gagcggctcg 3829381 ccgtcgctgg cgacgcgctc ggcggagacg atggtcccct ggatgaccag caagtcaata 3829441 ccggccgcaa ccagtaccgg tgtcagccac tgggcgtttt gcgggctcac ccgcaccgcg 3829501 gtggtcaccc cggcctcgcg gatgcgagcc accgcggcac ccaacaggtc gggatttagc 3829561 ggtgccgcgt gcagctcctg cagcaaccgg atcgccgtcg acggttcggg gtcggccgct 3829621 gcagcttcca agagttgggc gatttttgcc tcgacatcga ggtggcggcc gatcagcccc 3829681 tcgccgttga gcacgcccag cccgcccagc cggccgagct cgatcgcgaa ctccggggac 3829741 accagggcat cggtggggtg tgccaccact gggatctcga accggtaggc gtccagctgc 3829801 caggccgtgg agacgtcctt cgacgagcgg gtgcgccgcg acggcacgat gctaatctcg 3829861 ctgagttcat aggtgcggcg ggcggtgcgg cccatgccga tctcgaccat ctagatatcc 3829921 aggtcgccgt tagcgcgcgt agtagttggg cgcctcgacg gtcatcgcga cgtcgtgggg 3829981 atgactctcc ttgaggcccg cgggtgtgat ccggacgaat tgcgcctgct gtagcacctc 3830041 gatggtgggc gacccggtgt agcccatcgc ggcgcgcagg ccaccggtca actggtggat 3830101 caccgacgac agcggaccac ggaacggcac ccgcccctcg atcccttccg gcaccagttt 3830161 gtcttccgac agcgcgtcgt cggcgaagta gcgatccttg gaatacgacg tcgccccccc 3830221 acgccctcgc atggcaccca gtgatcccat gccgcgataa ctcttgtact gcttgccgtt 3830281 cacgaagatc agctcaccgg gcgcctcggc tgtgccggcc agcagcgagc ccagcatggc 3830341 cgtcgacgca ccggcggcca gcgccttggc gatgtcgccg gagtactgca gtccgccgtc 3830401 ggcgatcacc ggcacgccag caggacgaca agccgctaca gcttccaaga tcgccgtgat 3830461 ctgcggcgcg cccaccccgg ccaccaccct cgtcgtgcag atcgaccccg gccccacgcc 3830521 gactttcacc gcgtcggctc cggcgtcgac cagggccgcg gccgcggacc tggtggcgac 3830581 gttgccgcct accacctcaa cccggtcgcc gacttcggac ttgagtttgc ccaccatgtc 3830641 gagcaccaac cggttgtgcg cgtgcgcggt gtccacgacc agcacgtcga ccccagcgtc 3830701 gaccaacatc atggcgcgca cccaggcatc gccgccgacg ccgacggccg cccccaccag 3830761 cagccggccg tcgctgtcct tggtggccag cgggtgttgc tcggtcttga cgaagtcctt 3830821 gacggtgatc agcccggtca gccggccgcg gccgtcgacc acgggcagct tctcgatctt 3830881 gttgcggcgc aacaggccca gcgccgcgga cgcactgaca ccctcttgag cggtgatcag 3830941 cggggctttg gtcatcacct cggcgacctg cttggactgg tcgacctcaa accgcatgtc 3831001 acggttggtg atgatgccca ccagcgcacc gtcgtcgtcg accaccggca acccggagat 3831061 ccggaaccgg gcgcacagcg catcgacctg ggccaaggtg ttgtccggcc ggcaggtgac 3831121 gggatcggtg accatgccgg cctcggatcg cttcaccatc tcgacctggc cggcttgctc 3831181 ggcaacgggc aggttgcggt gcaacacccc catgccaccc gcccgtgcca tcgcgatggc 3831241 catacgcgac tcggtgacgg tgtccatcgc cgagctgacc agtggcacct tgagcctgat 3831301 cttcttggtg agctggctgg aggtatccgc ggtggcgggc accacgtcgg aagccgccgg 3831361 caacaacaag acgtcgtcga atgtcagccc cagcatcgcc accttgtgcg ggtcgtcgcc 3831421 gccggtgggc accgggtcag tagtcaggcc gcccatgcga acgtacgggc tgaccaccag 3831481 gtcggagctg tcttccaggc cggacatgcc acgggacatc ggtggggccc tccatacgca 3831541 tgttttcagt gagaagccca tcctatcggc tcgtaaccgc ccggtgacga tgcgcgccgc 3831601 agcgctggcc gagaagaacc ggacaatcac accgcgacga ggctgcgcca gcgtgtggtc 3831661 agcccgacac gaagcgagaa ctcaatttct ggcgttatca ccgcgtgctt gcgtagtgta 3831721 gaggggtgcg cgaccacctg ccgccgggtt tgccgcccga tccgtttgcc gacgacccct 3831781 gtgacccgtc ggccgcactg gaggcagtcg agcctggcca gcccctcgat caacaagagc 3831841 ggatggccgt cgaggccgac ttggccgatc tggccgtata cgaagctctg ttggcgcaca 3831901 agggaattcg tggacttgta gtgtgctgcg acgagtgcca gcaagaccac tatcacgact 3831961 gggacatgct gcgttccaat ctgttgcaac tgcttatcga cggcaccgtc cgcccgcacg 3832021 agccggccta cgatcccgaa ccggactcct acgtcacctg ggattactgc cggggatatg 3832081 ccgatgcttc gctcaacgag gcagcaccag acgcggacag gttccgccgc cgctgatcgc 3832141 gctcgctagt gcgtcggact caccggcgtt tccggtgctg gctgccccgc cggattcgtc 3832201 gcctcgtcgg cgggttccaa cgaggggtca attgagcccg gctcgggttt tgatgacggc 3832261 gtgctggggc tggctgccac cgtggacgtg gagttcggca tggggctctc cgagacgcct 3832321 gccgacatcg acggttcagc tgccgatgca ggcgtcgggg gggtcggcgg ctcgacgact 3832381 ggagccagcg gagtccacga gtttcccacc gaacccggag ccgcagggtt ggacggcgag 3832441 ccgggccgca gcgtggcgtt cgggtcgcgc gtctccacct tggtattcag caggttcacc 3832501 tcgttgatca ggtcctgccg gcggctaccg tcagtcacgg cctgcacggt gctgctgacc 3832561 tcagccagct catcctgcgc ctcggcccat tggccttggg caatcatttg ctcgaccttc 3832621 gccagattgg ccttggccga cagcacgatc tgatcgtcgc tgacccgcga tcggttgaac 3832681 atcatcgcgt gcaggccgta caacaggtcc ccggggcgag catcggccac cacggcgccg 3832741 aacccgctca gcaccaacag cgccgcggcc accgacccga cggccgccag gctgcgacga 3832801 gcccgtcgcc gttgcgctac cccggcgcgc aacgcggcga cggcctcgtc ctgtgaaacc 3832861 agggcactgg ccggcggcca cctcaagtcg tcgcgccact gtccgagcag ggcggccaac 3832921 gcgtcatcgc gaggatccgc gaagtcaacc tcctcccgtt cggcgagtgc gtcgagcagc 3832981 agatcggtgc gggccagctc atccaatggc ggccgatcgc caaggggatt accaaattca 3833041 cgcatagtca cctgccgcaa caatctcgtc cttcagccgc tgaagtgcac ggtgttgggc 3833101 cacccggacc gcccccgtgg tgctgccgac ggcggcggcg gtctcttccg cggacaggcc 3833161 gacgacaaca cgcagaatga ggatctcgcg ttgcttggcc ggcaagatct caagcaattc 3833221 gttcatccgg gtgaccgaat cggcctcgat ggccatctgc tccgggccgg cgtcggctga 3833281 ccagcgctca ggaagcgttt cggcgggata ggcccggtca cggccggctg cccgatgggc 3833341 gtcggcaacc ttgtgcgccg cgatgccgta cagaaacgcc aggaatggcc ggccgcggtc 3833401 ccgatagcgc ggcagcgccg ttatggtggc caagcacacc tcctgtgcca cgtcatctgc 3833461 tgacaggccg ctccgctcga ccgtgccgac tcgcgctcgg caatatcgca cgacgatcgg 3833521 gcggatggtc tccagcacct cccgaagcgc gttccggtct cctgccacgg cctccgcaac 3833581 cacagcgtcg agacgttccc cttgcattgt catcgacggc gatatctcca acgttacgaa 3833641 gcggacacat cccgggctaa ctcccggatc gaccataacg gcccaaccgc gttttaagcg 3833701 gtacgccagc atccaccggc gcgccgcacc tggcctgcgc aaatattgcg tattttggtg 3833761 agttcgcgca gctgttgtgc tgaaaacgtg acggtgccga tatcgatcag caagcatgcc 3833821 agcgcccacc gcagcggcaa gagcccaaac cgggcggtgg catcgagagc ctcttcaccg 3833881 acggcgcgtg ctcgcgcgac ggcgccagca ctgcacagcg cggcggccaa caccacgtcg 3833941 cttttgacgc ggtggcgcgc cgacgcgacg gccatggcct gcgtcagctc gaccgcttcc 3834001 tcggcatggc ggacagcagt tgcgccgtcg ccggtggcca tcgccaactc ggcggccacc 3834061 caccgccgac gcaccgccag gcggtccgcc acgagcgggg acaccaccaa cggatccgcg 3834121 cgatctaaca atgcccccgc ggcggcgaag cggccgacgc caagcgcatc ggccgccagc 3834181 ccgatcagtg catcggcacc agcttcccga tcggcgccgg ccaacgccaa ggcacgacca 3834241 tcccagccgc gcgccagcgt gtgccaacca agctgccgca acaacgatcc ctgcgtacta 3834301 tgcgccagcg atgccaacgg gcccgccggc accaggcgtc gcagcaccga caggtcgcca 3834361 taggcgtggg cgtaacgacc ctgcccacca gcggccacgg cgcgcaacca caagtggtgc 3834421 ggcgtgatcg ccgtcggtag cggccagctg cccggctggt ttccgaaggc ggcagcgacc 3834481 aacacttgct caaccaccgg agcgtgagga gtttcattca ccgtgatagc cgtgccttca 3834541 tcagtaaaaa gttggtggtt tcttcgttaa cggcatatta ctcacagctt tctttgcgct 3834601 aatttaggcg tactcacagc atgggatgac ctgggcaaat acctcatcta tccgcccggg 3834661 atagcatgcg gcgcaggcgg cgaatgcggc gcagatgaac gcagagttaa ttctcacgca 3834721 acggtccgat attgcacgcc aacggacgcc tattgacgga aattcggcag cgcccctagc 3834781 gtctatcctt gacggtagtc atcggtgacg ccactccact tcagttgcac aactcgcgcg 3834841 tccgcgaacc caacctccac ttgggcgtgt cgtgcagaga gggaatcagc aatgccacag 3834901 ccggagcagc taccgggacc caacgcagac atctggaact ggcaattgca aggcctgtgt 3834961 cgcggcatgg actcatcgat gttcttccat cccgacggcg agcgtggccg tgcccgaacg 3835021 cagcgcgaac aacgcgccaa ggaaatgtgt cggcgctgcc ccgtgatcga ggcgtgccga 3835081 tcccatgcgt tagaggtcgg tgagccctat ggcgtttggg gtggcctgtc cgaatccgag 3835141 cgcgacctac tcctcaaggg caccatggga cgcacccgcg gcatccgccg cacagcttaa 3835201 gccgcgcgag cagacgctaa agcccccgca cgctcggcgt gtcgggggct tttgcgtctg 3835261 ctgaccggag ttcagtgcgc gtgcccgtgg tgatggtcgt gatcttctgc cttggccggc 3835321 ttgtcgacca cgaccgtctc ggtggtgagt accatccggg caaccgatga cgcgttcaac 3835381 accgccgacc tagtcacctt gaccgggtcg atgacgccgt cagcggccaa gtcaccatag 3835441 ctcagggtgt tcacgttcag cccatgcccg gcgggtagct cgctgacctt gttgaccacc 3835501 accgagccgt ccaagccagc gttggcggcg atccagaaca acggcgcggc aagggcttcg 3835561 gagaacacgt cgacaccgag gacctcgtca ccggtcagcg acgcacgcag ttcggtcagc 3835621 gccttgcggg cctggtggat gagcgaggct cccccaccag ggacgatgcc ctcctcgacc 3835681 gcggccttgg cggccgcgac cgcatcctcg acgctttcct tgcgctcctt gagtgcggtc 3835741 tcggtggcgg cacccacctt gatgacagca accccgccgg ccagtttggc cagccgctcg 3835801 ccaagctttt cccgatccca atccgaatcg ctcttgtcga tctcggcacg caagtgcttc 3835861 gcccggttgg ccaccgcttc tgcggtgccg ccgccgtcga caatgaccgt gtcgtccttg 3835921 ctgaccacca cgcgtcgggc cgagcccagc acctccaagc ccacctcgcg cagcaccatg 3835981 ccggcgtcgg ggttgaccac ctggccaccc gtcaccaccg ccaggtcctc aaggaacgcc 3836041 ttacggcggt caccgaagta cggccccttg accgcgaccg ctttcaacgt cttgcgaatc 3836101 gcgttgacga ccagcgtcgc caacgcttcg ccctccacgt cttcagccac gatcagtagt 3836161 ggcttacccg ttcctgcaac cttttccagc aatggcaaca gatcgggaag cgagctgatc 3836221 ttgtcttggt gcagcaggat caacgcgtcc tcgagcaccg cctgctggtt atcgaagtcg 3836281 gtaacgaagt atgccgacaa gaagcccttg tcgaagccga taccctcggt gaactccaac 3836341 tcggtgccca gcgtcgagga ttcttcgacg ctgaccacgc cgtcgtggcc gaccttgctc 3836401 atcgcttcgc caaccaggtc accgatctgc tcgtcgcgcg aggacaccgt cgccacctgc 3836461 gcgatgccgg tcttgccgga caccggcgtg gccgatgcca gcagcgcctc ggataccgcg 3836521 tcggcggcct tgccgattcc cacgccgagc gcgatcgggt tgacgccggc ggccactagc 3836581 ctcaggccgc ccttgatcag tgcctgcgcc aagatggttg cggtggtggt gccgtcaccg 3836641 gccacatcgt tggtcttggt ggccaccgac ttcaccagct gggcgcccaa gtcttcaaac 3836701 ggatcttcca gctcgatctc acgtgccacc gtgacgccgt cgttggtaac cgtgggtccg 3836761 ccaaacgcct tggccagcac cacatgccgg ccgcgcggcc ccagcgtcac ccgcacggtg 3836821 tcggccagct tgtccatgcc gacctccatg gcgcgacgcg cggtttcgtc gtattcgatc 3836881 agcttgctca tcaggctcct ctacgcaggg ctagtccgct aacgcatgcc gccccggaaa 3836941 tcacccgtgg tgagcacggg gatcgccggg gcggaacacg ctctactact tggaaacgac 3837001 ggccagcacg tcgcgtgccg acaggatcag gtattcctcg ccgttgtact tgatctcggt 3837061 gccgccgtac ttgctgtaga tgacggtgtc accctccgca acgtccagcg ggatccgctt 3837121 ctcgccgtcc tcgtcccacc ggccagggcc gacggcaacg acggtgccct cctgcggctt 3837181 ctccttggcg gtgtcaggaa tgaccagacc ggacgcggtc gtggtctcgg cctcgttggc 3837241 ctgcacgaga atcttgtcct cgagtggctt gatgttcacc ttcgccacga ttggagccct 3837301 ccactatttg gatcagagcc cgggacgctc gcccggaccg gagttggcgg tcggtccggg 3837361 gcgtgccccg gaaccgtccg aattaccagg tgattcggca ttcgtccgcg ccctcgcgcc 3837421 gtcgtcgcgg gtgccgacgc aggggttagc cgattgccat ctagcactct atacatgaga 3837481 gtgctagcac tcaagggcgc ccccttgctt cctggttgcc agcgtgtccg ggtacgccag 3837541 gtgcaatgtc cgggtcaccg cacctgcccc tgcatcacgg gcagacccgg gtcactgggc 3837601 acgtccagcg gcgacggcgg cgctcccgcg gccaccagct gcgcggcgaa cgccgcgatc 3837661 atcgccccgt tgtcggtgca tagccgggga ctggggatcc gcaacgtccg gcccgcctcg 3837721 ccgcagcgct gtgtggccag ctctcgcagc cgggagttcg ccgccactcc ccccgcgatc 3837781 agcagcgttg agacgcctag cgcagtggcg gcccgtaccg ccttcatggt caacacgtcc 3837841 gcgacggcct cctggaatcc ggcggcaatg tcggcggtac ggaagcccgg gtcagccgcg 3837901 tggctttcca cataccgcgc gacggccgtc ttgagcccgg agaagctgaa cgcatagcgg 3837961 tcatcggccg ggccactcat gccgcgcggg aaaacgatgg cgtcccgatc accggtgcgc 3838021 gccaggtcgt cgagcgcctt gccacccgga tagcccaatc ccagcaaccg ggccaccttg 3838081 tcgtaggcct cgccggcggc gtcgtcgacg gtgctgccca gctcgatgat cggctcaccg 3838141 agcgagcgaa cgtgcaacag gtgggtatgt cctccggaca ccaacaacgc cacacactcg 3838201 ggcagcggcc cgtgttcgta gacgtcggcg gccaagtgcc cgcccagatg attcaccgca 3838261 tagaacggca ccccccaagc agccgaatat gccttggccg cagccactcc caccaacagg 3838321 gcgcccgcga gcccgggacc gatggtggcc gcgacaatgt ctggctgttt caagccggcg 3838381 gccgccagcg cgcggcgcat cgcgggaccc agtgcctcca ggtgcgcacg ggaggcaatc 3838441 tcggggacca cgccgccgaa ccgaacatgc tcgtcgacac tggaagccac ctcgtcggcc 3838501 aacaatgtca cggtgccatc gggatcgagc cgcgcgatgc cgacaccggt ttcatcgcag 3838561 gaggtttcga tgcccaagac tgtcgtcatg acgggtcccc cgaatcccta cgcatcgtgt 3838621 acgcgtcggc gccgctgacc cggtaatatc gccggcgcaa gccgacccgc tggaatccca 3838681 cgctgcgata cagcgcaaga gcggcgtcat tatcggtgcg gacctccagg tagaccacac 3838741 cacccctggc aaagtccagc agttcgcgca gcaaccgacg gccgatgccc cgcccctggt 3838801 aggccgggtc cacgccgatg gtgtgcacct cgtactcgaa cggcggtgtt cggcccaacc 3838861 gcgagattcc agcgtaaccg accagcgtgc caccgctgcg cgcacccaca tagtggttgt 3838921 gcgggctggc cagttcgcgg ttgaacgccg ccggcggcca gggatcgtca ccgacgaaca 3838981 gctgggcctc cagctcggcg caccgctggg cgtccgcgcg cgtcagcgcg ccgatggtga 3839041 cgggctcggt gtcggccgtc acgtgcaaac cgccagcggc ttggcatccg gccggcgaag 3839101 atacagcggc actaacggcg ccggcttgtc ggcccagttc accgcggcta ccagacccgc 3839161 cggcgacggg cggctgggct caacgcaggg gagcgcgaac agcgccgcgt gctccggcgc 3839221 accggcgacc gccaatgccg ggccgggatc gacgtcggcc gcggcattaa cggctggtcc 3839281 gaccgtacga atcccgtcgc agtagcgtgc ccagtagacc tcacgccggc gtgcatcggt 3839341 gaccaccagc gtgtcaccga tggtttgccc gccgatggcg tccaggctgc acacgccata 3839401 caccgggatg cccagtgcgt gcccgtacgc ggcggcggag gccatgcctg cgcgcagccc 3839461 ggtgaacggg cccggaccgc agcccaccac gacggcgtcc aggtcggcca ttgtgagcgc 3839521 ggcatcggca agcgcagcca gcacgttggg agtcagccgt tccgcgtgcg ctcgggcgtc 3839581 gacggtgacc ctctcgccca gcacaaccag atcatgacgc cgcacgatac ccgccgtgac 3839641 cgccggtgta gcggtgtcga tggccaagac ggtgcttatt tgcacgcggc tcatgaccgg 3839701 ccccacgacc aagtcgcgat cctggtgtcg gagtggctaa cccgctccag gcggacgtcg 3839761 aggtggcgct gcgagagccg ctcggccagg ccctcgcccc actccaccac gacgacggcg 3839821 tcttcaagat cggtgtcgag gtccagtgag tccagctcac tcagcaggtc ggcgctgttg 3839881 tggtccagca gtcggtagac gtcgacgtgg accatcgccg gcgtgcccgg ccgccgcggc 3839941 cggtgcattc gcgccagcac gaacgtcggc gatgtgatcg gcccctcgac atccatcgcc 3840001 atggcaatac ccttggccag caccgtcttt cccgcaccga gcggaccgga gagcaccacc 3840061 acgtcgccag cgcacagctg ctcacccagc cgggacccca gcgttagggt gtcctcgacg 3840121 cgcggcagcg tcgccgtgcc gccgcccgta agcccagccc tggctttcgg tcgtctgcgg 3840181 ataccctcac ggctcaacgg ttttcagcct cgcgataggt cctggtgata cgtcctcgcg 3840241 ggctggtgac cacttcgtag tggatggtgc cgacaagatc ggcccagtcc tgagccgtgg 3840301 gctcaccccg gatgcccggc ccgaacaaaa tcgcctcgtc gccttcggcc acatcaagcg 3840361 gcccggggcc caggtcgacc atgaactggt ccatgcagat ccgccccaca ccggggcatc 3840421 gtctgccgtt gatcagcacc tccagccgcc cgcccagcga ccggaacacg ccgtctgcgt 3840481 aaccgatcgg cagcagcgcc agattggtgt cgcgtggcgc gatccatgtg tgcccatacg 3840541 acacgccctc ccccgcacga atcgatttca ccagcgcaac agcacatttc acggtcatcg 3840601 ccggcaccag ccccatgtca ccgagggcgg gtaccgggct tagcccatac accgcgatgc 3840661 ccggccgcac caggtcgaac gtcaggtcgg ggcgcgccat agttgctgat gagttcgata 3840721 gatgcgccac ctcgaaccgc accccttgtt cgcgggcctg cgccagaaag gcggtaaacc 3840781 gttgggcctg aacatcgttg atggaatcgt caggcttgtc ggcgtaaacc atatgcgaca 3840841 tcagcccccg cagccggacg gcgtcctcgg ccatggcttg gcgtaacgcg gtcagcatgg 3840901 ccgggaattg tgccggtccc acgccattgc ggttcagccc ggtatccacc ttgacggtca 3840961 ccgtcgccgt ccggccggtc cggcgcaccg cgtgcaacag ttcgtcgagt tggcgcagcg 3841021 aggacaccgc gacctgcacg tcggccagca gcgcgggccc gaagtcgatg ccgggcggat 3841081 gcagccaggc cagcaccggt gcggtaatgc catcagcgcg cagcgctagc gcctcgtcga 3841141 cggtggcgac gccgagttcg gccgcaccgg ctcccagggc ggtttgggcg acgcgcgtag 3841201 caccgtgacc gtagccgtcg gccttgacca ccgccatcag ctgcgcgtgg ccggcgtgct 3841261 cacgcagcac ccgcacgttg tgttcaatag cgcccagatc caccatggcc tcggcgagga 3841321 ggccaggtgt ctgggatatc ggtgtcatgg ccaacgaagt cgtgccccgc ccatctgtcg 3841381 tgtcgtttgg ctttccgaca ttctcccaga accgtttcac tgagcagtat tccggcctgt 3841441 gcccgattgc cccgggtcgc ggtgctgggc tgcagccgtg tcggcgtgac tgtcctgtgg 3841501 ctcggtggtt ggttgccgat cacccggtgt ttggctcaga ttgccggtgc cgcatgatgg 3841561 ttggcgtcaa tagagtgcgg atcggccggc atgaattgac gggagcgtag cttgaccgcg 3841621 gcccatcacc cgtggcagga aacagttgca gtgtgtacta ttcgccctag actgccgcag 3841681 ttccggggga agtgaaccta ttgcgcccgt gcatcactgc acgggtatgg gctttggcgg 3841741 tcgcttcgca ccatcaacgc cgacagtgcg gacagcgcaa accgacggca caccccttgc 3841801 acggatgtgg ggtgtttttg agatggagcg aaagtaggcg tgtcttttat tttcacaacc 3841861 ccccaggcat tggacaacgc ggctaagtcc gtgtcgggga ttcacgattt gtggcgcaaa 3841921 ggacgctaag gcatcgatcc cggtggtcaa cgctatttga gccccccgct tccgacccgg 3841981 tgtcgaatag ggatgaggcc gctcctccgc cagcacatga ggcagtatca ccagatcagc 3842041 tttccggcca tagagcatcg tcaccgggtt aggcatggtt taggcagcgc ttagctgaga 3842101 acgccgaggc gtgtcggctc gccgaggccc aaaacagcac aaccttgcac tgatctagct 3842161 gaagaccaaa ccggcacagc agacattgcc atacgcgaca acagccgtca tcaaccgaaa 3842221 ggagcaaaga acaaacagat gcatccaatg ataccagcgg agtatatctc caacataata 3842281 tatgaaggcc cgggcgctga ctcattgttt ttcgcctccg ggcaattgcg agaattggct 3842341 tactcagttg aaacgacggc tgagtcgctc gaggacgagc tcgacgagct ggatgagaac 3842401 tggaaaggta gttcgtcgga cttgttggcc gacgcggttg agcggtatct ccaatggctg 3842461 tctaaacact ccagtcagct taagcatgcc gcctgggtga tcaacggcct cgcgaacgcc 3842521 tataacgaca cacgtcggaa ggtggtaccc ccggaggaga tcgccgccaa ccgcgaggag 3842581 aggcgcaggc tgatcgcgag caacgtggcc ggggtaaaca ctccagcaat cgcagacctc 3842641 gatgcacaat acgaccagta ccgggcccgc aatgtcgctg taatgaacgc ctatgtaagt 3842701 tggacccgat ctgcgctatc ggatctgccc cggtggcggg aaccgccgca gatctacagg 3842761 ggcgggtagg tccaagaggc cggcgcggtc ttgcaggcca gcaacaatgc cacggtcgac 3842821 caggcccatc gcttccgggc ccgcacgaca caccgcggtt tcagatgaat caggcgtttc 3842881 acaccatggt gaatatgctg ctgatccgtt tacacgtcag gttcgactga tctagcttca 3842941 ggttcgactg atctagctga aaaccaaacc ggcacagcga cattaccata cctgacaaca 3843001 gccgtcacca accgaaagga gcaaagaaca agcagatgca tctaatgata cccgcggagt 3843061 atatctccaa cgtaatatat gaaggtccgc gtgctgactc attgtatgcc gccgaccagc 3843121 gattgcgaca attagctgac tcagttagaa cgactgccga gtcgctcaac accacgctcg 3843181 acgagctgca cgagaactgg aaaggtagtt catcggaatg gatggccgac gcggctttgc 3843241 ggtatctcga ctggctgtct aaacactccc gtcagatttt gcgaaccgcc cgcgtgatcg 3843301 aatccctcgt aatggcctat gaggagacac ttctgagggt ggtacccccg gcgactatcg 3843361 ccaacaaccg cgaggaggtg cgcaggctga tcgcgagcaa cgtggccggg ggtaaacact 3843421 ccagcaatcg cagacctcga ggcacaatac gagcagtacc gggccgaaaa tatccaagca 3843481 atggaccgct atctaagttg gacccgattt gcgctatcga agctgccccg atggcgggag 3843541 ccgccgcaga tccacaggag cgggtaggtc caagaggccg gcgcggtctt gcaggccagc 3843601 aacaatgccg cggtcgacca ggcccatcgc ttcgctgctc gcacgacaca ccgcggtttc 3843661 agatgaatca ggcgtttcac accatggtga acatgttgct gacgtgtttt gcatgtcagg 3843721 agaaaccgag atgacgatca acaaccaggt gagcgacgct gacacccacg gcgccaccac 3843781 cggcgcccct gtcgaccgcc acgtaattcc ccaggggttg gcgtcacgta attccccagg 3843841 tgtgtgcctc agtggtaggt cttagcggcc cgtgtgggcg ttgtctagct ggtggtgcgg 3843901 ccgggtctct tgcggggtcg gtagctgggt ccgtccatga ggatttggtg gctggtgttg 3843961 atgagccgat ccaggagtga ttcggcgacg acggggttgg ggaacaggcc gtaccagtta 3844021 ttcggtgcgc ggttgctggt caagatcagc ggtttgccag tgatggcgcg gtcgctgatg 3844081 agctcgtaga ggtcatcagc gtgcatggcg gtgtgctcac gcatcgcgaa gtcgtccaga 3844141 atgagcacga gcggcttggt gtattcgcgg atgcgttggc cccaggatcg gtcggcgtgc 3844201 ccgccggcga ggtcggagag catgcgggag gttttggcga agcgcacgtc gccgccgcgg 3844261 cgggccacgg cgtggacaag tgcttgtgct acatgggttt ttccgacgcc gaccgggccg 3844321 tggaggatga ccgattcgcc ggcatccagc cagcgcagcg cggccagatc gcgcaacatc 3844381 gcaccgggca gtttcgggtt ggcagtgaag tcgaagtctt cgaaggtggc ttgggcttcg 3844441 aacttggcgc ggcgtaatcg tcgtgtcagg gcggcggact cgcggcgggc gatctcgtct 3844501 tcacgcaacg cttgcaggaa ttccagatgc cccaggtcgc cgttgcgggt ttgggccagg 3844561 cgggcgtcga gggtgtcgag catgccggac agtttcaggg tacgtagcgc attacgcagc 3844621 gccggatcac agatagacat ggatgcttct ccttgagaat agcgatgtgg attgtgtcgg 3844681 gatggttcgg gattccgctg tgagatcagg cgacggggtg tgtcggtgcc gaactgttca 3844741 gggccgcgca ggaacgcccc cagcggtgct tgccggacta ctggtggtcg gctcgttggc 3844801 ggcgtgttcg gtgccggcaa caaggatgcc cttgatggtg cgatagctcg ggtcgccgac 3844861 ctcgatggcg cgggcgcagg cggcctccag ccggtcgcag ccgtgtttgt cgcgtagccc 3844921 gagcacgcct tgggccgacc gtaggtggtg gatggcgttg tcgcgcatga attcggcgat 3844981 cacttgctgg ctggctgggc cgaccagttc ggcggtgtgt cgacaccagg tcggggtgcg 3845041 catgtggaag gcgatcttct ccggtgggta gtgggagaag tcggtggagc gcccgctggg 3845101 tcggcgcaca tgggtggcca ccacatcgtt gccggcgaag atctgcacca catcaccggc 3845161 ggtgcgcgcg tgcaggcgtt gcccgatcag ccgccacggc acggaataga gtgccttgcc 3845221 aactttgagg tgcgtgtcca ccccgacggt gccgatcgac cagctggtga gttcaaatgc 3845281 cctgggcggc aatgcgatca acgcttgttg ctccacagct tcgaacatcc gcaggggttg 3845341 ggcgccctcc aaggcacgta agtaccgaag cccggccact tcggtgctcc aggtgaccgc 3845401 cgcctgctgc atctgggcca gcgaatcgaa ctcgcggcct ttccaaaacg agtcccgcac 3845461 ataggtcatc ggccgctcca cgcggggttt atctttgggt tttctggcgc gggccgggtc 3845521 gaccagcgtg gcgtagtggc tggccagctc ggcgtaggag cggttgatct gcgggtcgta 3845581 caggtcgggc ttgtccaccc cggtcctgag gttgtcacac actagccgcg ccggcacccc 3845641 gtcgaagaat tcgaatgcgg cgacatggca agcacaccaa gcggtttggt ccatccggat 3845701 gaccggacgc acgaacaggt gtcgggagaa cgccagcacc atcacgaacg cccacaccgc 3845761 gacccggcgc gcggtggccg ggtcgaacca catgcccagc cgcccgtaat cgatctgcgc 3845821 ctcactaccc gcatcgaccg gtccgcgcgg caccgtgact ctctcgcggg ccacctcctc 3845881 ggcgaaatgc gttgcgatcc aacgcctcac cgacgactcc gacgccgcca ccccgtggtc 3845941 gtcacgcagc cgttgggcta tcgtggccac cgtgacatcg gcatccagcc agtccttgat 3846001 ccgatcatga tgcggcgcga tcagcggcca cgtcgacgcc cgcgccgccg gatcattcag 3846061 gaaaccaccc gccgatcaac tccgcccact gctcggcgct cagcggctcc ccaccgggct 3846121 cgataccggc ggcgatcgcc ggcgccgtat atttgcggac cgtcttgcga tcgatgccca 3846181 gcgactccga taaccggacc tgagagcggc ccgcgtgcca gtgggtcaac aactcgacca 3846241 aatcgagcat caagatactt ctcctcgcca tcagcgccct tccatccgtc agccgacgga 3846301 tgcagagcga accttgcagc aaggcccaca ccgggcagac acaccgccca tggtggggaa 3846361 ttacgtgaca gccggggtgg ggaattacgt gacggacaac ccctcaaacc tggggaaata 3846421 cgtgaccgct gacacacagg tattgacaat tgctcattca ggctcaaatt gcctgtgccg 3846481 catgatggtt ggcgttaaag cgtgaggacc tgccggatga attgacggga gcgtagctgt 3846541 gaccgcggtc ccgtcacccg tggcaggaaa cagttgcagt gtgtactatc cgccctagac 3846601 tgccgcagtt ccgggggaag tgaacctatt gcgcccgtgc atcactgcac gggtatgggc 3846661 tttggcagtc gcttcgcacc atcaacaccg acagtgcgac agcacaaacc gacggcacac 3846721 cccttgcacg gatgcggggt gtctttgaga gggagcgaag tagccgtgtc ttttatcttc 3846781 acaacccccc aggcactgga caacgcggct aagtccgtgt cggggattca cgatttgtgg 3846841 cgcaaaggac gctaaggcat cgatcccggt ggtcaacgct atttgagccc cccgcttccg 3846901 acccggtgtc gaatagggat gaggccgctc ctccgccagc acatgaggca gtatcaccag 3846961 atcagctttc cggccataga gcatcgtcac cgggttaggc atggtttagg cagcgcttag 3847021 ctgagaacgc cgaggcgtgt cggctcgccg aggcccaaaa cagcacaacc ttgcactgat 3847081 ctagctgaag accaaaccgg cacagcagac attgccatac gcgacaacag ccgtcatcaa 3847141 ccgaaaggag caaagaacaa acagatgcat ccaatgatac cagcggagta tatctccaac 3847201 ataatatatg aaggtccggg tgctgactca ttgtctgccg ccgccgagca attgcgacta 3847261 atgtataact cagctaacat gacggctaag tcgctcaccg acaggctcgg cgagctgcag 3847321 gagaactgga aaggtagttc gtcggacttg atggccgacg cggctgggcg gtatctcgac 3847381 tggctgacta aacactctcg tcaaattctg gaaaccgcct acgtgatcga cttcctcgca 3847441 tacgtctatg aggagacacg tcacaaggtg gtacccccgg cgactatcgc caacaaccgc 3847501 gaggaggtgc acaggctgat cgcgagcaac gtggccgggg taaacactcc agcaatcgca 3847561 ggactcgatg cacaatatca gcagtaccgg gcccaaaata tcgctgtcat gaacgactat 3847621 caaagtaccg cccggtttat cctagcgtat ctgccccgat ggcaggagcc gccgcagatc 3847681 tacgggggcg ggggcgggta ggtccagaag gccggggcgg aacctgtcaa catttctgag 3847741 acacgatttt cggggattta ttgagtcggc tggtcctcct tcggtggtgg gttgatcgcg 3847801 ctgaaggccg gtagcgcggg tggctcgggt ggtttgcgaa cgaatccgct cgaggtggtc 3847861 tcggtaggcg gtgtccagaa cggtggcgcg gtgccggcgg atctgatcgg cgcggccgta 3847921 gtgcacgtcg gcgggcgtgt gcagtccgat gccggaatgc ttgtgttcgt ggttgtacca 3847981 gccgaagaac cggtcgcagt gcacccgggc cgcctcgatc gactcgaacc gtttcgggaa 3848041 gtcgggccgg tacttgaggg tcttgaactg ggcctcagac aacgggttgt cgttgctggt 3848101 gtgcgggcgt gagtgcgact tggtgacacc gaggtcggcc agcagcagtg ccaccggttt 3848161 ggagctcatc gacgagccgc ggtcggcgtg cagggtcagc tggtcggcgc tgatgtgctg 3848221 ggcggcaagg gtttgcgcga tcagccgctc ggccaagacc ttcgactcac gcgaggccac 3848281 catccacccg accacgtagc gggagaagat gtcgaggatc acatacaggt agtaatagct 3848341 ccactttgct gggccacgca gcttggtgat atcccacgac cacaccgaat tcggctgatg 3848401 agcaaccaac tctggcttca ccgcagccgg gtgggtggcc tggcggcggc gatcaccggt 3848461 ctggccgcgc tcacgcagca gccgatacat cgtggactcg ctgcacaggt agatgccctc 3848521 gtcgagcagc gtggcatata ccaccgccgg cgccatgtca gcgaagcgct gcgagttcag 3848581 caccgccagt acgtgctcac gttcggccgc actcagcgcc cgcggctgcg cgctctcccg 3848641 cggtcccgac gggtcggtca ccgccgtgct ggtgaacgta tccgattgtg ccgacaaccg 3848701 tttcgagtgg gcccggtagt aggaggccgg cgcacgaccg gtcgccgcac acgcggcccg 3848761 aaccccgatc aacgggatca tctcctcgat ggccgtgtcg atcacgctca gcgctcactc 3848821 tcgcacatcg ccgcgctgtc ggctcagagc ctctccaaga gcgcggacag ttccccctgc 3848881 acacggatca cctcgcgtgc ggtgtcgagc tcggcgcgca gccacgcgat ctcggcgtca 3848941 gcggcattgg cgccggcctt gcccggcttg gggccccgcc gcgccgacag cgccgccaac 3849001 gccccccgat cacgctgatg gcgctattcg gtcagcaacg acgaatacag gttctcccgc 3849061 cgcaagatcg cacccctttc cgtgcgatcg gcgcggtcat actcatcaag gatcgccagc 3849121 ttgtacttca cggtgaacgt acgccgctgc gcccgctcag gcacctgagg atcaggcacc 3849181 tcgtccacgg tgaccgacga accccgtcgg ccagtaccag ccctattagt caacctcgtt 3849241 ctcttcgtac tcgccctcag gctcagtaaa catctccact cgcagtgtct cactcaaggt 3849301 tgacagagag ggtcggcgac gcggtcccac tgagcgccga cctcctcagg gtcggtgtgg 3849361 gcgaaaatcg tcttgaccgc cacggtcacc gccggggcgt gtttggccgc taccgcggtg 3849421 tacaggtttc gcatgaaatg cacccggcaa cgctgccacg acgccccact gaactgttgt 3849481 gccacagcgg ctttcagccc agcatgggca tcggagatca ccagatgcac cccggtcagc 3849541 ccacgcgctt tcagtgaggc caaaaactca cgccagaact cgtaagactc gctgtcaccc 3849601 acagcggtgc ccaacacttc gcgggtgccg tcgatggaca ccccggtggc caccaccaga 3849661 gcctgagaca ccacgtgcgc cccgacacgc accttgcaga aggtcgcatc gcagaacaca 3849721 tacgggaact cggtgtgggt caagctgcgg gtccgaaacg cctcgatctc ggtgtccaga 3849781 ccggcgcaga tgcgtgagac ctcggattta gacaccccgg cctgcacgcc catcgcggcc 3849841 accagatcat cgacactgcg cgtcgacacc ccgtgcacgt aggcctccat gatcaccgcg 3849901 tgcaacgctt tatcgatgcg gcggcgccgc tccaaaagcg acgggaagaa cgaaccggcc 3849961 cgcagcttgg ggatctgcac ctcgatatcg ccggccgtgg tcgacactgt cttgggccgg 3850021 tgcccattgc ggtgcacgat gcgcccatcg gagcgctcgt agcggcctgc accgatcgcc 3850081 tcggtggctt cggcctcgat caacgcctgc aacccggcac ggatcagctc ggcaaacacc 3850141 gccgaggcat cagcagcttc actcgcgtta cggaccgctc agttgcttcc cccaacgggg 3850201 ctttcgacgc tgggcttcga ccctgcccgt ttccaaacca agcggccagc ctgctaccgg 3850261 gcctcctgac agctacccgg accggactcc caccggcagg cgacgacgag ctttgatcag 3850321 gtcatgacct aagacatcac ctcctgatca ctgggcgcac cggctgcagt actagtgcgc 3850381 gaaatgctgt gcgtcgaagt ggccacccgg cttgaccttg tccagggcag ccaacgcggt 3850441 gaccgcgtcg tcgtgcaggg cccgcgccag gtcggcggag agtccttccc gaaccacgat 3850501 ccgcagcacc gccacgtcgg tggcgttgtc cggcatggtg taggcgggca cctgccaccc 3850561 gaaggtccgc agctcatggg agacgtcgaa ctccgtgtac ccgcggtcgc cggcgagccg 3850621 gaagctgacc accgggatcg ccgaaccatc cgagatcacc tcgcaatgat ccacctcgcg 3850681 cagctggtca cccagccacc gggcggtgtg cgacagcgcc tgcatcacct tggtatagcc 3850741 gtcgcgcccc agccgcagga agttgtagta ctggcccacc acctggttac cgggacggga 3850801 gaagttcagg gtgaaggtcg gcatgtcgcc gccgaggtag ttgacccgga aaaccagatc 3850861 ctccggcagg tgctcgggcc cgcgccacac gacaaacccg acgccgggat aggtcagccc 3850921 atacttgtgg ccgctgacgt tgatcgacac cacgcggggc agccgaaaat cccataccag 3850981 gtccggatgc aaaaacggca ccacaaagcc cccactggcc gcgtcgacgt gtaccgggac 3851041 gtccacaccc ccgccagccg ccagtttgtc cagcgcggcg cagatctcgg cgatgggttc 3851101 gagttcaccg gtataggtgg tgcccaagat cgccaccacg ccgatggtgt tctcgtcgac 3851161 ggcggcgagc acctgctcgg gggtgatgac gtagcggccc cgctccatcg gcaggtaacg 3851221 gggttcgacg tcgaagtagc ggcagaactt ctcccacacc acctggacgt tcgaacccat 3851281 caccagattg ggcatgcgcc ccttccaaga ccccacccgt tgccgccaac gccatttcag 3851341 ggccagccca cccagcatca ccgcctcgct ggagccgatg gtggacaccc cggtggcgct 3851401 ggtggggtcg tggtcgcgca gaccctcggc gtgaaacagg tcggcgacca tggacacaca 3851461 gcgcgcctcg atggccgcgg tcgccgggta ttcgtcctta tcgatcatgt tcttgtcgaa 3851521 cgtctcggcc atcagctttt cggcctccgg gtccatccag gtggtcacga aggtggccag 3851581 attcagccgc gagctaccgt cgagcatcag ctcgtcgtgg atgaagcgat aggccgcctc 3851641 gggatccatc gactcatcgg gcatccgcag cgccggcacc ggtgcggtga acatccgacc 3851701 ggtgtaggcc ggagcgatcg aatgcgcggg cacggacggg tgactgcgag acacggcgga 3851761 tcctttccgg gcttgttgcg gactggcagg actacagggc agccagagcg gcccgaatgt 3851821 ggccgctgat gcgcgacgcc gacgtgggcg catcgccggg gccgggatcg gcggccgcgg 3851881 ccgccgatgc ccgggcgtgc acgaacgccg cggccgcggc cgcctcccca gacggcaatc 3851941 ccgacgccag cagcgcaccg atcatcccgg acagcacgtc accggacccg gcggtggccg 3852001 cccaggactg gccggccgga ttgagataga ccgggccgcc gggatcggcg atgacggtga 3852061 cattgccctt gagcagcacg gtggcgccca gcgcgtcggc cagctggcgg caggccccca 3852121 cgcggtcgtc accgggcggc gccccggcca gccgggcgaa ctcaccggcg tgcggcgtca 3852181 agaccgtcgg ggcgttgcgg cccgccacca gatcggggtg gtccgccagc atggtcagcc 3852241 cgtcggcgtc gaccaacacc ggcaggtcgg tgtccagcgc gaaccacaac gcggcggccc 3852301 cggcttcgtc ggtgcccagg cccggcccga cgacccaggc ctgcacccgc ccggccgccg 3852361 ccggggtggg cgaggcgatg acctccggcc agtgcgcgag gacttccgca tgggcggtcc 3852421 cggcgtagcg gaccatgccg gaggtggcgg cgacggccgc cccggtgcac agcacggccg 3852481 cacccggata cgtcgacgac ccggccagca cgccggtcac gccctgggtg tatttgtcgt 3852541 cgcggggacc gggcaccggc cagcgcgcgg ccacgtcggt agcctcgaaa cccaacacgt 3852601 cggtgtgcgc caggtccagc ccgatatcga caaggacgac gcggccgcag tcggccagcg 3852661 cgtgcaccgg tttgagcccg ccaaaggtga cggtcagcgc ggcgtgcacg gcggggccgg 3852721 tgatcgcccc ggtcgccaca tcgatgccgc tggggatgtc gacggcgacc accggtatgg 3852781 cggcggcctg aaccgcggcg aacacctgcg cggccgccgg tcgcagcggc cccgagccgg 3852841 agatgccgac caccccgtcg atgacgagat cggtcgccgc cgagacactc tcgacgaggc 3852901 gacccccgga tttggtgaac gccgccagcg ccttgcgatg cgtgcggtcc gggttgagca 3852961 gcaccgcgtc ggcggcggcg ccgcggcgtc gcaggaacgt cgccgcccac agcgcgtcgc 3853021 caccgttgtc gccggatccg acgaccgcgc acacccggcg gccgaccacc ccacccgtgc 3853081 gagcggtcaa ctcacggccg atctcggtgg ccagcccgaa ggccgcgcgt cgcatcagcg 3853141 caccgtcggg caggctggcc aacaggggcg cctcagccgc gcggatggtg tcgacagagt 3853201 agtagtggcg catctcaggc ccgccgtcct cgggtgccgc gcctgtgcag cagacttttg 3853261 attctggccg gattccacag ccgaccgtcg cgttcccggg ccatcggata gaacagcaga 3853321 ccaatcagga tggacgcgaa gtgaccgacc gcggtgaagt ccagctcggc tttgtccatc 3853381 gcgatcagcg gaaaaccaaa gatgaccagc agcaccccga gatagcccca gcgccacggt 3853441 ttggcgatgt gataggtcaa taccgccatc acaccgacca ggaagtagct gaccccgata 3853501 tcacgagcgt gcaccatcct ttcggaggcg tctcggtgct ggatcgccag atagagcagg 3853561 ccttcgctca aataggtggc accgatgtga gcggtcaatc ccacggtgag ccaacgcaag 3853621 tggccgagcc aatgctcggc gggcgctagg aacagggtga acagcagcag gtacggttcc 3853681 aaattccggc cgtcgatcca caacaggctg gaaaacagca cctcgagcgg atcgcgcccc 3853741 aactcggcga tgttggtgga ccggtgcagg agcacgaaat gcagctggct cccggtgaga 3853801 ttgttctgga tgatcgtggt gatcaccaac acgaccagcc aggcataggt caacggggcg 3853861 ttgctgacga agtgccacac cgcgagcgcc cacgatcgca gccgtgccac caccgatgcg 3853921 tccgccacgg gtcaacactt acatggtttc gtcgacgtca ggcttcaggt gccaccacca 3853981 gcagaggata tacagcacca tcgaggtcac caatgcgacg accacccaga gtacgaccgt 3854041 gacgtcgatc caaaatccga ttgggggcgc gtcgggaagc gcattgcgca gcggtatcac 3854101 cgcaaaaagc attgccgcat accacgttgt catcggcggc tggaattgcc gccggccacg 3854161 tgcggtttga accgcaacga acaggcccac cccggccagc gcgatcaaca caccgacgat 3854221 gacggtgccg aatgccacgc tgctcggcga tcggtgcaac cccactcggt acggggcagg 3854281 cacattggcg tcgccgacac cggaaatgtc gacgttccag cccggaagcc ggtcgacgaa 3854341 tgtcaccgac acacgttccg gcgcgtgcgc ggctccgcgg tagagctgga ccgtgatcgg 3854401 ccccgaacgg tagtggtcga acggccaatt cgcgggatcc ccggagatgg tcagcgggac 3854461 gggaaagacg ccgggcagcg aaccactcga ccaggtgcgc ttggtaggcg ttaccacgga 3854521 tgtgaccgtg acggtgaggt cgtccttgag gccctgggtt tgcgaatcca gcagctcagt 3854581 cccaggtgac acggcgaggt tggcaaccag cacgcccttg atcgtctgaa gctgctcgac 3854641 gtgcagggtc accgtggtcc cgtcggccgt cggccgaccg tgggcgactt catgaggtcg 3854701 gccgaggccg gtgctgtgat acaacgcgat cacggtgacg taggccgcaa tcacgagcac 3854761 caaaccgaca acgactctca ggatgcgtcc caactcgcta cccgcccact tgtgcgttcc 3854821 ggcccggaaa ttgtaaccgc gggacccctc cgtcagcgga tgccaccgcc aggccacgtg 3854881 attgtgcgac agccgccatc ttcctgtggt aggtgatcat cgccgtcaac tccgcaccca 3854941 acgtctccgc ggtcgccacg tggatcgcgt cgagtgtctt gggccgccag gtcccgtacc 3855001 gtgagccacc tcgactattc gacggtgacg gacttggcca gattccgcgg cttgtcgaca 3855061 tcgtagccgc gggcgcgcgc caccgaagcc gcgaacacct gaagcggaat ggttgatagc 3855121 agcggctgca atagcgttga caccgctggg atttcgatca ggtgatcggc gtaggggcgc 3855181 accgtttcgt cgccctcctc ggcgatcacg atggtcaccg caccgcgggt ctggatttca 3855241 cggatgttgg acagcagctt ggcgtgcagc gtggccgacc ccttgggtga gggcatgacg 3855301 acgatgaccg gtaggccgtc ttcgatcagc gcgatcgggc cgtgcttgag ctcgccggcc 3855361 gcgaaaccct cggcgtgcat gtaggccaac tccttgagtt tgagtgcacc ctccagcgcc 3855421 accggatagc cgacatggcg acccaggaac agcacggtcg acgactgggc gaaccggtgg 3855481 gccagctcgg ccaccggtcc ggtcgccgcg atcacccggg ccaccaggtc cggcatcgct 3855541 tccagttcgt ggtactcgcg ctcgacctcg tcggggtatt tggtgccgcg ggcctgcgcc 3855601 aaggcaaggc cgagcagata gttggcagca atctgcgcca gaaacgtttt tgtggacgcc 3855661 acaccgatct ccgggccggc gcgggtgtag agcaccgcgt cgcactcgcg cgggatctgc 3855721 gagccgttgg tgttgcagat cgccagcacc ttggctttct gctccttggc gtgtcggacc 3855781 gcttccagcg tgtcggcggt ttccccggac tgcgagatcg ccaccaccaa ggtgctacgg 3855841 tccaacaccg gatcccgata ccgaaactcg ctggcgagtt ccacttccac gggcagccgc 3855901 gtccagtgct cgatcgcgta cttggccagc agcccggagt gatatgcggt accgcaggcc 3855961 accacgaaca ccttgtcgat ctcgcgcagt tcctggtcgc tcaaccgctg ctcgtcgagc 3856021 acgatccggc cacccacgaa gtgtccgagc aaggtgtcgg ccaccgcggc gggctgctcg 3856081 gcgatctcct tgagcatgaa gtactcgtag ccgccctttt cggcggcagc cagatcccag 3856141 tcgatgtgga aggggcggaa atcgcgccca gcttgtaggc catcgttgcc gtcgaaatcg 3856201 ctgatccggt agccgtcggc ggtgatcacc accgcctggt cctggccgag ctcgaccgct 3856261 tcccgggtgt gctcgataaa cgcggccacg tcggaaccga cgaacatctc gttgtcgccg 3856321 atgcccagca ccaggggcgt ggaacggcgg gccgccacga gggtgccggg gtcgtcggca 3856381 ttggcgaaca cgagcgtgaa atgcccctca agccggcgca gcacggcaag tacggagccg 3856441 acgaagtcat cggccgtctc gccgtgccga tacgcccgcg ccaccaggtg cgccgcgacc 3856501 tcggtatcgg tgtcgctggc aaactcgaca ccggcagtct ccagctcccg gcgcaagacg 3856561 gcgaagttct cgatgatgcc gttgtggacg acggcgatct tgccggcagc gtcgcggtgc 3856621 gggtgcgcgt tgcggtcggt gggacgaccg tgggtggccc agcgggtgtg gcccaggccg 3856681 gtagtaccgg acagcgccgt ggacggcatt tccgccacgg cttcctcgag gttggccagc 3856741 cggcccgcac gccggcgcac ggtgagtgtg ccaccgtcga ccagcgcgat gcccgacgag 3856801 tcgtagccgc ggtactccat ccggcgcagc gcgtccatga cgacgacgta ggcggggcgc 3856861 cgcccgacgt aaccgacaat tccgcacaca gcagaccagg gtagtgcagc atggtcggta 3856921 gggcagtccc gtcgcccaac cgacgctatc gtcgagtttg gccaccgcgc acgaaaggcc 3856981 aacacttgtc caacccatat gcccagcacc agctgaagct catcaggcac acgggtgcgc 3857041 tgatcctgtg gcagcaacgc acctacgtgg tctccgggac gcgcgagcaa tgcgaagcgg 3857101 cgtacaagtc ggcgcagacc tacaacctgc tcgttggttg gtggagtttg gtgtcgctcc 3857161 tcgcgatgaa ctggatcgcg ctgatttcca acttcaatgc gattcggcgg gtgcgagccg 3857221 ccgccgacgg ggcgtccgtt ccccacggcc cgcacgccat cgcccatcca gccgttcccc 3857281 ggggacccat accggcgggc tggtatccag acccgtccgg ggcgggactg cgttactggg 3857341 acggtgcgac gtggacccac tggacccatc cgccacgtca ccgctaacgt cgacgggtgc 3857401 cccggatccg caagctcgtc gccgccctgc accgccgggg accacaccgt gttttgcgcg 3857461 gtgacctggc ttttgccggc ctacccgggg tggtgtacac ccccgaggcg gggctgcacc 3857521 ttcccggtgt cgccttcggc cacgactggc tcaccggcac ctctcgctat tcgggtctat 3857581 tggagcattt ggcgtcatgg ggcatcgtgg ccgccgcccc cgacagcgag cgcggactgg 3857641 ccccatcggt cctgaatctg gccttcgatc tgggcgttgc cctcgacatc gtggccggtg 3857701 tccgccttgg gcctggaaaa atcagcgtgc accccgccaa gctcgggctg gtgggccatg 3857761 gtttcggtgg ctcggccgcc gtgttcgccg ccgccggctt gaccggcacg cacgtcaagt 3857821 ccgtggcggc gatattcccg acggtgacca atccggccgc ggagcagcca gccgcgaccc 3857881 tagacgttcc gggactgatt ctgaccgcac ctggcgatcc gaagacgctg acctccaacg 3857941 ccctcgggct atcccgggct tgggataagg ccaccctacg catcgtcagc aaagcccgag 3858001 ccggtggtct ggttgagggc agacgactga cgaaggtgtt ggggctccca ggcccacacc 3858061 gccggacgca gcgttcggtc cgggcgctgc tgaccgggta cctgttgtac acgctcggcg 3858121 gcgacaagac ttatcgcagg ttcgccgatc cagacctgca gctgcccaag acggacccga 3858181 tcgaccctga agcgccgccg atcaccccgg gggagaagat cgtgacgctg ttgaagtagc 3858241 gcgggacacc ccgacccgtc acggccccgc ctgcggaagc tcgtcggcgg cgatctcaca 3858301 gggggtggct ccctcggaca gcgcttccgg cgaaggccca ttcgccggtt ccggcgcacc 3858361 cggcggcgcc ggcgcagcga ccggaggcgg tggttcggcg accggaggcg cggcaggcgg 3858421 cggcggttcg gcgaccggcg gcagcgtggc ctcctcggcg acatcctggg cacgttcagc 3858481 gggcaccgat tcgtcggcgt cgtcgacctc gtctgcttcg tcgggctctg ttgcttcctt 3858541 cggctccgct gcctcgtcgg cctcttccgg gtgggcatcg tcgccgtcat cagcgttgtc 3858601 ggccgcgtct tcagcgaacg gatcgacggc acccggcgga ttgtcagctg ccaacggatc 3858661 ccccagctgt tcggccaccg aacccagcag gctatccacc gcatcgacga tccggttggc 3858721 aagcccggca agcccggcaa acccgcccag accgccggtg ccgccggcat cgccgaaacc 3858781 accggcgctg ccaacacccg ccggcgtcgc ggaaccatca cctggcgccg atccaaaatc 3858841 cgacggcgtc actggccgcg aggtcacggc cggcaccgga tccggcgggg gaagagcggc 3858901 cgcgggcgta atcgctgccg tcgcgctcgg ttgagccggc accgatgccg gagaaggttg 3858961 gcgaccgggc ccgagatcgt ccggaatctc gaagtgcgcg cgcggcgcgc tggccagctg 3859021 atcggtgacc gcatcatacg acgccgccac accggccgtt gtcgatcgca tcgtggtcag 3859081 ccagtcgttg cgaacatcgt cgtccacgta gggctgtatc tgttggcgaa ccacttcgac 3859141 ggccgtcggc cgatctgccc cctccgtcgt gagcgcttcg gccgcagcca accatgccgg 3859201 ccgctgcgcc agggcacgct cgtcgatcgc aatggccgtc gcgactttgg agtccaccag 3859261 ctgccagagg ttgtcgcgca gcgattcgca gcgttgggcc gcggcacgga cttcggtgac 3859321 caccgaattt ccagtctcac agtgacgctg cacaaagtgc accgccgcgt cggcccccga 3859381 tcccgtccat gccgctgcca agacggcgac ctggctacgc tccatccgca gcgcctccat 3859441 gagcacactg gcggcagccc gcagctgcgc gcagtcagcg tcgagcgcgt gcaggtcaag 3859501 tccgtcttcg ctgccgtacc agtcgtggat ctgggcaggg taggcggtca ggtcgggatg 3859561 ttggtagccc accaggtggc aagcccgcac gtagctttgc gtgtgctcgg ctgcgggcct 3859621 gccctcggcg agacgctcag cgacgttcaa ccggtcagcc accctcaccc gatccgcgcc 3859681 gccgcgcaca ggtcggcctc ggcgtagcgg ttggcgcccg cccgcaacgc aaacgcgatc 3859741 tgcacagccg cccgggacca cactgacaac tcgccggcca accggtctag cctgcagcgc 3859801 aacgcatcgc cacgcgaggc gtgcccccgg cccgcgcagg ctccgccaaa agccagcctc 3859861 gtcaggtgat tgccgatggc gtcatcgatg agctcggcgg cggcgctgaa ccggtcggca 3859921 accgcgtata ccgctgctat gtctatgccg gcgctgttta cgctatcggg tctcatgcct 3859981 attcggacgc cccgcgccgc gtcggggttc cagcatttcc ggttcagcgc gcggtgctca 3860041 ccgcgtcggc gaccgtggcc gccagccgct gggcgacgcc ctcgtcggct gcctccacca 3860101 tcacccgaat catcggctca gttccggacg ggcgcaagag gattcgaccc gtgtcaccca 3860161 gctcggccgc ggcctgctcg accgccgttc ggaccgaggg cgccgcggcg gcggtggcct 3860221 tgtcgacaac ctcgacgttg atcagcacct gcggcaacgt ccgcatcgcc gacgccaggt 3860281 cggacaacga cgagccggtc tgcaccatgc gggtcatcaa ccgcagcccg gtgacgatgc 3860341 cgtcaccggt ggagcccagc gccggcatga cgatgtggcc ggattgttcg cctccgaggc 3860401 tgtagtcacc ggcccgcagc tcttcgagga cgtagcggtc accgacggcg gttgtacgca 3860461 cggtgacgcc ggccgagcgc atggctaggt gcagcccgag gttactcatc acggtggcca 3860521 ccaatgtgtt gcaggccaac tcaccggcct ctttcattgc cagcgccagc accaccatga 3860581 tggcgtcacc gtcgacgagg tcaccgttgg cgtcgacggc caggcaccga tcggcgtctc 3860641 catcatgggc caggcccagg tcggcccgat gggcgagcac cgctgcccgc agcgggtcaa 3860701 ggtgagtcga tccacagccg tcgttgatgt tgcgtccgtt gggttcggcg ttgatcgcga 3860761 taacccgggc accggccgct cggtaggcgc gcggagccgc cgacgacgcg gccccatgag 3860821 cgcagtcgac caccacggcc aggtcatcga gccgggcggt ggcggccttg gccacgtggc 3860881 gcaggtagcg ttcggtcgca tcctcggcgt cgataacgcg gccaatcccc gcgccggccg 3860941 gccgcaaccc gggtccgcgg gagacgccga ggaccagatc ctcgatctga tcctcggtgt 3861001 cgtcatctaa tttgtggccg ccgggcccga agattttgat gccgttatcg ggcatcgggt 3861061 tatgcgacgc cgagatcatc accccgaagt cggcgtcgta ggcgccggtc agataggcca 3861121 ccgcgggggt cggcaacacc ccgacccgca gcgcgtcgac gccctcactg gtcaggccgg 3861181 cgatcacggc ggcctccagc atctcgccgc tggcccgcgg atcgcggcca agcaccgcga 3861241 ctcgccgacc cggtgcgccc gacctcgaca atcgtcgcgc cgccgcggcg cccagtgcca 3861301 gggccagttc cgcggtcaac tcgcgattgg cgacaccgcg cacaccatcg gtgccaaaca 3861361 gtcgacccat acggacaacc tttcacagtt gacggctgcg cacatatcca ctcttggcag 3861421 cgaatatgcc tgttggttca ccgacacgcc gacgagcgca cacaaacatg cacgcttgtc 3861481 gcccgaaagt gatgtcagcg cttgctgtac tggggcgcct tgcgggcctt cttcaggccg 3861541 tacttcttgc gctcggtggc gcgtggatca cgggtcaaga agccggcctt cttcagcgcg 3861601 ggccggtcct ccggcgatac cagaatcaat gcccgggcga tacccaggcg cagcgcgccg 3861661 gcctgacccg acgggccgcc gccgcccagg tgggcaaaga tgtcgaaact ttccacccga 3861721 tccacggtga ccaggggtgc cttgatcaac tgctggtgca ccttgtttgg gaagtagtcc 3861781 tccaagctgc ggccgttgag gtcgaacttg ccggtgccgg gcaccagccg cactcgtacc 3861841 acggcctcct tacggcgccc aacggtctgg atgggccgct ccaacacgaa cgattgtgcg 3861901 ggcccggccg gggccgccgg ggtttgcggg gctggggtgg tttcggtcat tgcgccacct 3861961 gcttgagctc gtacggaacc ggctgctgag cgctgtgcgg atgctccggg ccggcgtaga 3862021 cgcgaagctt gcgctggatc tggcggctga gcctgttctt gggcaacatg ccgaggatcg 3862081 ccttttccac cacgcggtcg gggtggcgtt gcattagctc accgatggtg cgcttgtgca 3862141 ggccgccggg ataccccgag tgccggtaaa ccatcttgtg ctgcagtttg tcgccgctga 3862201 tggcgacctt gtcggcgttg atcacgatga cgaagtcacc gccatcgaca ttgggggcga 3862261 acgtcggctt gtgcttgccg cgcagcaggt tggccgccgc gacggcaagg cggccaagca 3862321 ccacgtccgt ggcgtcgatg acgtaccacg atcgcgtggt gtcacccgcc ttgggcgcgt 3862381 acgtgggcac agcgcttacc ttcttttctc tcgggtggat cccggggtgc cccgggcgcc 3862441 ggtcaggcgt gaacggcggg ttggtctcgg cgaaccgaca ttgacccgag gtcccggcgt 3862501 accgcacgcc aaccgagcag cttaccgacg agcatccacg caggtcaaaa tgactgtgtg 3862561 gtcccgacgg ctctcccccg tcgggaccac acaggggtct gttgcgcgct ccggggcccg 3862621 gaactagcgt gcccaagctc cagccgcccg ccggtcggca tgcgccacgt cgtcggcacc 3862681 gtggcgaacc gcgtttccca agtcgatcag gatctcgttg agcgcgctgg ccgcctggtg 3862741 ccacttgagt tgctccgcgt ggtaggcggc ggccgcttcc cgtgtccaga gctgctgcaa 3862801 cggcgcgatc tgcgacctca gctcttgcag cgcagcgttg aaacgggccg cggtggtgtg 3862861 gatctcctga cgaacggagt attcgatggc gtcaaagttg tacgacaaca cggggtctgc 3862921 gttcatagtc gaggctgatc ctcggtctat aggtcgccgc cggcggcggc gatgtggcgg 3862981 gcatggattt ggccggcttc ccgcagcgcg gcctcgttgt ggcggatggt gtcggcgatc 3863041 gcgtgcagga cgtggtagag ccgcgtcgac tcggcgttcc agcgatccac cacatcctgg 3863101 aaccgagcgg ccgcgagccc accccacacc gacggcggca caccgctcat gcggccgatg 3863161 aatgcctgca gcatcgcacg gatttcctca ttgcgggcgt ccgtgatacc cgcaaccgaa 3863221 cgcatcaggt caaagtcggc gttcagcgtg ttcggtgtgc tcacatcaag taggaccgcc 3863281 gccaacctcg tctggttccc tccgatcctt cccggttcaa ccaacggcgt ggacggaccg 3863341 tacggcttgc gcacacacct ccctgaggag gtcttcatgg ccgggcccgc tctggcagcc 3863401 gacgctgatc cggaccgctc cgtcgagcag aatcgtccac cgcacctgat gcccggcgcg 3863461 gacctctcga taggtcaccg cgggccggcc ggctctgata tcggaggggt tgaagtcgac 3863521 gaataccccg gccggtgacg cgtcgatcgc ccgcttcaac cgctgcgcgg tgccaggcag 3863581 cgtctcaccg ggaaccggtg attgtgtgac gtgcaacgcc acctcgggat cggccggtga 3863641 agtgacctgt acccgcgccg aaccgggacc ggagaccacc cgctgcgtgg accagtccgc 3863701 cggaatcgtc agcgccaccc ggccctctac cagaagcgtc gtcggtggtc tttgcagggt 3863761 tgtcgcaccg tggcggacca cggcagccgg cgccagtaac gccaaggcga caccggcggc 3863821 cgcaacccgg gcaagtgtcg ggacccgaga gcgggtggca ggccgcgccg ccggatcggc 3863881 gggctcgtcg gaaggcggca gggcggccct ggccaaccgc gccagccgca cgccgtcgat 3863941 ctcgaccacg ctgctaccgg taccccgcac cgcaccggcg attgccgccg cgagcgctgc 3864001 cgccccggcg accgtactgg gcacgtcgat cagcaccacc gcggtaatac cccgcgtcat 3864061 ccgcgcaatg acactgccta cctggccggc aacggactcg gcgtccgtgc ggcgggccac 3864121 cgcggcgacc tcggcgccgg ccaccaacac cagtcgctcc gcgatctcca ccaccaccgt 3864181 tgcggccgaa acccccgagg acgcctgcct cagcagccac gaccgcgggt gcacgacgac 3864241 atcgcgggtc agcgtgcgtg cggctgcggt gaccacctcg acccgagccg ccgaccacca 3864301 cgacgggtgc acgacgaccg ggccgtcacg gtggtcgacg gccaccgatc gcagggcgtc 3864361 gaaccacagc gaatccacgg cgactggccg ttcgtccagc agcgctacct ggtcgtcgat 3864421 cgccgccagc gcggcggcag acactgcggt gtccgcgact acgtctgcgc cacaacacaa 3864481 tcggcggatg gcacccggac ccgcctcgat caccgcgcga tgtgggctca cgggggtggg 3864541 ctccaggcga cttgaaccag ttgctcgtca ccggcaccgg tgaccaggat gccccggccc 3864601 ggtggcagcg gcatcgggcg gctcgacccg aacagtgcgc cttcatccgg acgtccgctc 3864661 atcagcagtg cccggcagcc caggtcacgc aggctggcaa gcaccggctc gaacagcgcc 3864721 cgagcagcac ccccgctgcg ccgcgccacc accaggtgta aaccgagatc tcttgcgtgc 3864781 ggcaaatatt cgagcaagac catcagcggg ttgcccgatg agaccgcaac caggtcgtag 3864841 tcgtcgacca cgacatagat atccggaccc gaccaccagg acctggctcg cagctgcgcc 3864901 tggctcacat ccggggcggg catccgcgcc tggagcaggt cgaccagact cgacagcttg 3864961 gcacccagcg ccgccggcga gctgacgtag ccgctcatat gttccgactc gatgacgtcg 3865021 agcagggtgt gccggaagtc gacgatgaga agttgggctc gcgcggcggt atgggtccgg 3865081 acgatctcgc ggcacagggt ccgcaacgcg gccgtctttc cgcactcgtt gtcgcccagc 3865141 accagcaggt gcgggtggcg tccgaaatcg acggccaccg gctggcctcg acgttcctcg 3865201 aggccgagca agatgtgcgc accgagttcg tcgccggctc gggccacgac gctgtcgtag 3865261 tccacgcgcg cgggcagtag cggtatcggg ggcgccaccg gatcaccact tcggcgtcgt 3865321 agcgcaactc catccaggtc gggcagggcg atcaccatgt gcatcccgtc gcgggagagg 3865381 ccacggcccg gtctgtcgac cggcacccgt tgcgcctgcc tacggtccaa ttcggaatcc 3865441 gcgggatccg ccagccgtaa ctcgattcga ctgccgatct gatcccgcag cgacggcctg 3865501 atctccgccc accgtgctgc cgatagcgcc acatgtacgc cgaatgaaag cccttgagct 3865561 gccagggcaa cgatcgactc ctcaagggcc gcgaactcct ggcgtaagct tgcccagccg 3865621 tcgatgacaa gaaatatgtc cgcaaaagac tcagcggccg actttgctcg cagctggcgg 3865681 taccgcgcca ccgagtcgat gccgtggtcg cggaagaatg cctcccgaaa tcgcacggcc 3865741 gactccagtt cggcgagcat ccgcgatgcc agctgcggct gcgccctgcc ggccacggca 3865801 cccacatgcg gcagttcgtc cacctgggcc agcgccccgc cgccgaagtc caaacaatag 3865861 aactgcaccc ggcccgcatc gtgggtagca gccaacgcca tgatcagcgt ccgcagcgcg 3865921 gttgacttgc ccgtttgcgg tgcacctacg accgcgacat tgcctgcggc cccggacaag 3865981 tcgatcgtca gcggcacccg tgactgctcg aacggccgat cgacaatgcc gatgggtacg 3866041 gccagctcgg cctgcgccgg ctcagcgtca cgcagtaggg cgcccagcat cggtggctcg 3866101 tccagcggcg gtagccagac ttgatgcgca gccggtccat gaccgaccag ccggtcgagc 3866161 accgcatgca agacggtagg cgtgggcacc tcggctgtcc cgccgacggg accggctgtg 3866221 accggcgccg cagcgtgcgt ggtgaacggt cgcaccgacg gcggggctac cgggtggacc 3866281 gctgagggac tcgcccgtcg aagcggcccg gaaacgaacg cggtctgaaa tcggatcagc 3866341 tctccggttc ccgtttgcag caagcccgca ccgggggtgt tgggcagttg atatgcgtcc 3866401 tgcgtcccga gcacgttgcg tgattcactg gcggaccacg ttttcaggca cattcgatag 3866461 gacagatggg tttccagtcc acgcagtcgg ccctcgtcga gccgctgact ggccagcagc 3866521 aaatgcatgc ccagcgaccg gcccacccga ccgatcgcga ggaacacgtc gacgaattcg 3866581 ggatgttggc tcagcaattc ggaaaactcg tcgacgacga tgaacaggat cggcaggcag 3866641 ggaagttgcg cacccgtttg gcgtgcccgc tgatatgccg tgacactgac caagtggcct 3866701 gccatccgca gcagctgttg ccggcggctc atctcgccgg ccaatgcgtc ttgcatccgt 3866761 gcgaccagcg gtgcttcctc ggcaaggttg gtgatgaccg cggctacatg tggggctccc 3866821 gcgaggtcga gaaatgttgc accacccttg aagtcgacca gaaggaggtt gaggacttcg 3866881 ggcgaattgc gtgccatcat ccccagcgcg atggtacgca gcagctccga tttgcctgat 3866941 ccggtggcgc cgacgcacag cccgtgtgga cccatgccct gttccgcggc ttccttgatg 3867001 tctagctgca cggcggtacc gtcgggcgtg actccgatcg ggacacggag ccgatcatgt 3867061 tggtttacgt tgcgccacaa cgtgctcgga tcgaaagcgg ccacatcgcc gatgccgacc 3867121 agttccgccc aacccgagcc acggatgaac gtgcgacccg agtgcccgac ccggtgagcg 3867181 gccagccgac gggcgcatac cagcgcgtct tgaggctcca gctggtccgg gcacgctagc 3867241 gctgtcactt cgccggcaca tctgaccacc ggcggtgcac cgtctcgtct ggcgcccacc 3867301 tcgatcgtga tcacgccggt gatcgcgccg ttgccacgtt cggccgtgtc gacgatcgca 3867361 acaacgtggg ccaataccgt tgcggctagc gcattttgca tctctgccag ggtcgagtac 3867421 accatcgggg ctggccccaa ggcatcacag gcattcggat gttggttgtg cggcagccat 3867481 ttcagccaat cccagtgcgc gcggttgcgg tcactgacca cgccggcgat cagcaactcc 3867541 tccggtgagt gccatacggc cagctggcag atcatcgccc gcagcagccc gcggaccttg 3867601 gtcgggtcac cgtcgatggc gatcggaccg ccgacccgca aggggatcgc gatgggcgca 3867661 tccgcaatgg tcgcgtgtgc ggcaaggaaa cagcgcagcg cggcgcgggt gaccggatcc 3867721 gcacgctgcg ccggcggaag ctgcccgacc accaagcggg tggccagcgg tgcagatcca 3867781 actccgacac ggatgcgaca gaagtcggca gcacccggtc gacgctccca cattcgcgga 3867841 ccaccgatca atgtccacaa ggtggcagga tcgggatgcg tccagttcag tgatacgtgt 3867901 tgtgctgcag ccgtttgggt gacagatgtg cgcaagacac tcaggtaccc gaggtagtcg 3867961 acacggtcgt tgtggatacc ggagacatgc cgccggccgc gtccggttac cgcagtcacc 3868021 accaacgaga ccagcatcat cattgggaag gccagaaacg tggggtggcg cgtggccggc 3868081 gagcccggca agaacaccgt caccatgaca cccacggtcg ccaccgacat gacgaccggg 3868141 agcaggcgaa tcagcaggct ggacggttcc gaccgccgca actcgggcgg cggggcaacc 3868201 aggatgtccg cagtcgcgca cgccggccct gaattcatgc tgggcgacgg tatgcagcgc 3868261 gagaatccgc cgcaagtcgc ttgtggacaa ccgaataccg ggcgatcgag aaccggctac 3868321 cgttccggtg atccgagaat aaagggggag aatgcctacg tctgatccgg gactgcgccg 3868381 ggtcaccgta catgccggcg cccaggccgt cgacctgacc ttgcccgccg cggtgcccgt 3868441 cgcgactctg atcccgtcga tcgtcgacat cctgggtgac cgtggcgcca gcccggcgac 3868501 ggcggcgcgc taccagctgt ctgccctggg ggcgccagct ctgccaaacg caacgacatt 3868561 ggcgcaatgc ggtatccgcg acggcgccgt cctggtcttg cataagtcca gcgcccagcc 3868621 gcccaccccc cgctgtgacg atgtggccga agcggtggcg gcggcgcttg acaccacagc 3868681 ccggccccaa tgccagcgca cgacccggct cagcggtgcg ctggcggcaa gctgcatcac 3868741 cgccggcggc ggcctgatgc tggttcgaaa cgccctcggc accaacgtaa cccgctactc 3868801 cgacgccacg gccggagttg tagcggcggc cggcttggct gccttgctgt ttgcggtgat 3868861 tgcatgccgg acatatcggg acccgatcgc cggcctcacg ttgagcgtta tcgccaccat 3868921 attcggtgct gttgccggcc tactggcggt gcccggggtc cccggtgtcc atagcgtgct 3868981 agttgccgcg atggcggcgg ccgccacgtc ggtgctggca atgcgcataa cgggttgtgg 3869041 gggtatcacg ttgaccgcgg tggcgtgctg cgcggtagtc gtcgcggccg ctacgctggt 3869101 cggcgcgatc actgcggccc cggtgcctgc catcggttcg ctggccacgc tggcatcctt 3869161 tggtctgtta gaggtatccg cgcggatggc agtcctgttg gcggggttgt cgccacgatt 3869221 gccgcccgcg ctgaaccccg acgacgccga tgccctgccc accacggatc ggctgaccac 3869281 ccgagcgaac cgtgcagatg cttggttgac gagcctgctg gcggccttcg cggcctcggc 3869341 gaccatcggt gccatcggaa ccgccgtcgc aacccacggc atccacaggt ccagcatggg 3869401 cggtatcgcg ttggccgccg tcaccggtgc gctgctgctg ctacgagcac gttcagcaga 3869461 caccagaagg tcactggtgt ttgccatctg tggaatcacc accgttgcaa cggcatttac 3869521 cgtcgccgcg gatcgggctc tggaacacgg gccgtggatt gccgcgctga ccgccatgct 3869581 ggccgccgtg gcaatgtttt tgggcttcgt cgctcccgcg ttgtcgctct cgcccgtcac 3869641 gtaccgcacc atcgaattgc tggagtgtct ggcgctgatc gcaatggttc cattgaccgc 3869701 ttggctatgc ggcgcctaca gcgccgttcg ccacctcgac ctgacatgga catgaccacg 3869761 tcccgtaccc tgcgcctgct ggtggtatca gcgctcgcga cgctgtctgg gttgggaacg 3869821 ccggttgccc acgcggtttc gccgccgccg atcgacgaaa gatggctacc cgaatctgcg 3869881 ctgccggcgc cgccgcggcc gaccgtacaa cgtgaggtat gcaccgaggt caccgccgaa 3869941 tcgggacggg ctttcggccg ggctgagcgg tccgctcaac tcgccgacct cgaccaggtc 3870001 tggcgactca cccgcggcgc cggccaacgg gtcgcggtca tcgacaccgg cgttgcgcgc 3870061 catcgacggt tgcccaaggt ggttgccggc ggtgactatg tcttcaccgg ggacggcacc 3870121 gcggattgcg atgcacacgg cacgctggtg gccggaatta tcgcggccgc accggatgcg 3870181 caaagcgaca atttcagcgg ggtggcaccc gatgtcacct tgatcagcat tcgccagtcc 3870241 agcagcaagt tcgcaccggt cggcgacccg tccagcacag gtgttggtga cgtcgacacc 3870301 atggcgaagg ccgtgcggac ggccgccgac ctcggcgcgt cggtgatcaa catctcgtcg 3870361 attgcctgcg ttccggccgc ggctgcgccg gacgaccgcg cgctaggtgc cgctttggcc 3870421 tatgcggtcg atgtcaagaa cgccgtcatc gtggccgcgg ccggcaatac cggcggcgcc 3870481 gcgcagtgtc cgccgcaggc ccccggggta acccgggaca gcgtcacggt tgcggtgagt 3870541 ccggcctggt acgacgacta cgtgctgacc gtaggttcgg tgaacgccca aggcgaaccc 3870601 tcggcattca ctctcgccgg cccctgggtg gatgtcgccg ccaccggcga ggcggtgacc 3870661 tcgctcagcc cgttcggtga cgggaccgtg aacaggcttg gcggacagca tggttcgatt 3870721 ccgatatccg gaaccagtta tgcggcgccg gtcgtcagcg gcctggccgc cctgatccgg 3870781 gcccgctttc cgacgttgac cgcacggcag gtgatgcagc gcatcgaatc taccgcgcat 3870841 cacccacccg ccggatggga tccgctcgtc ggcaacggca cggtcgatgc cctggctgcg 3870901 gtcagcagcg actcgattcc gcaggccggc accgcaacga gcgaccccgc tccggtggcg 3870961 gtgccggtcc ctaggcggtc aacgcccggc ccatcggatc gccgcgccct acacaccgcc 3871021 tttgctggtg ccgcgatctg cctgctcgcg ctgatggcaa ccctggccac cgccagccgc 3871081 cggctacggc ccgggcgcaa cggtatcgcg ggcgactgac gcgttggctc tactcagctc 3871141 cggtccggac ggcagtgtcg ccaacaccgg ccacggcgcc gggatggcag ccgtcggcag 3871201 accgaggtcg tgtgccacgt cgtcgtcgtg gatcgcgaac cgcactccgg tgtcggtgac 3871261 caggtagcgc gtgccggtgc cgccgccgga caggctgcgc gcggctacgt aggcgctgcg 3871321 tcccggcggc aggtacaccg cgtccagtgc ggggccgcga ccgtcggctt gtgccagtgt 3871381 caccggaacc cctccgaggg gcaccggcgg gccgctgccc gccaagaacg cgacgcgagc 3871441 agcacccggc tgcgcgggcg tccaggtcac gcacaacgtg gtgaccgccc ttcccggcga 3871501 gccgtccacc ggtgttggcg gccggtcggg aaaggccgac accggcaagg tgttcacgat 3871561 cggagcgacg cgaatcacat cgggggccac cgtcgggacg ttgacgctgc cctgcgaatc 3871621 gccgaaccgc aacaaatccg cggcgacctg gccgatgcgc tgcacgccgt cctccagcac 3871681 cacgtaatac tcatcaccgc tcgcgcgagt gatgcgcacc acaccgccga ccagaaaccc 3871741 gggcagcccg accgaggccc gcccgccgcc acgaatccgg ggagccgtga tgcgcggtgc 3871801 ctccgggacg gcgttgagca acgattgcgc gaccacgtgc gggacccggc cctgcagccg 3871861 cagcgcccac accaccgccg ggtcggccag atccaccacg gcccgccgac cgccgtagag 3871921 caggtaggtg ggcgaacctg attcggtcgc caccaggatc atctgttcgg cggtcagcac 3871981 ctgcgccgac gagtcttcgg cgggcccgac gacgacagtc gttgatccgc cattgtcgct 3872041 atcgcagatc gcccacgccg attcggcgcc ggctagcggc tggtcaagca gctgcggcgc 3872101 acctggaata ccgagcagtg gaccgcgttt ggtgtggccc aattcggact cggacaccgg 3872161 ttgcgggttg gcgttcgtcg ccgcgatcaa ccgcgccgaa gccaggttca acaccggatg 3872221 ccagacatcg tccactcgca cgtagagtgc cccggattcc cgacccatca cgatcggcgc 3872281 ctgaccgagc gccgactgtg gccgcagcag cgcaacgaat gcgcatccca tcgcggcgac 3872341 gatcgccagc acgcacccga gggccagcga tgttgtgcgc gcgcgcagtg ctccggtcgc 3872401 tgcgcagaca tccccgaaca gcaacgcgca ctcgatgcgc cgcagcagaa atcggtaccc 3872461 gctgacgtgc agccaggtcg tcgctgggct cggcactggc tctcccacgg tggcgcgctg 3872521 atttctcccc acggtaggcg ttgcgacgca tgttcttcac cgtctatcca cagctaccga 3872581 catttgctcc ggctggatcg cgggtaaaat tccgtcgtga acaatcgacc catccgcctg 3872641 ctgacatccg gcagggctgg tttgggtgcg ggcgcattga tcaccgccgt cgtcctgctc 3872701 atcgccttgg gcgctgtttg gaccccggtt gccttcgccg atggatgccc ggacgccgaa 3872761 gtcacgttcg cccgcggcac cggcgagccg cccggaatcg ggcgcgttgg ccaggcgttc 3872821 gtcgactcgc tgcgccagca gactggcatg gagatcggag tatacccggt gaattacgcc 3872881 gccagccgcc tacagctgca cgggggagac ggcgccaacg acgccatatc gcacattaag 3872941 tccatggcct cgtcatgccc gaacaccaag ctggtcttgg gcggctattc gcagggcgca 3873001 accgtgatcg atatcgtggc cggggttccg ttgggcagca tcagctttgg cagtccgcta 3873061 cctgcggcat acgcagacaa cgtcgcagcg gtcgcggtct tcggcaatcc gtccaaccgc 3873121 gccggcggat cgctgtcgag cctgagcccg ctattcggtt ccaaggcgat tgacctgtgc 3873181 aatcccaccg atccgatctg ccatgtgggc cccggcaacg aattcagcgg acacatcgac 3873241 ggctacatac ccacctacac cacccaggcg gctagtttcg tcgtgcagag gctccgcgcc 3873301 gggtcggtgc cacatctgcc tggatccgtc ccgcagctgc ccgggtctgt ccttcagatg 3873361 cccggcactg ccgcaccggc tcccgaatcg ctgcacggtc gctgacgctt tgtcagtaag 3873421 cccataaaat cgcgtcatga ggttcatcgg ggtgatccca cgcccgcagc cgcattcggg 3873481 ccgctggcga gccggtgccg cacgccgcct caccagcctg gtggccgccg cctttgcggc 3873541 ggccacactg ttgcttaccc ccgcgctggc accaccggca tcggcgggct gcccggatgc 3873601 cgaggtggtg ttcgcccgcg gaaccggcga accacctggc ctcggtcggg taggccaagc 3873661 tttcgtcagt tcattgcgcc agcagaccaa caagagcatc gggacatacg gagtcaacta 3873721 cccggccaac ggtgatttct tggccgccgc tgacggcgcg aacgacgcca gcgaccacat 3873781 tcagcagatg gccagcgcgt gccgggccac gaggttggtg ctcggcggct actcccaggg 3873841 tgcggccgtg atcgacatcg tcaccgccgc accactgccc ggcctcgggt tcacgcagcc 3873901 gttgccgccc gcagcggacg atcacatcgc cgcgatcgcc ctgttcggga atccctcggg 3873961 ccgcgctggc gggctgatga gcgccctgac ccctcaattc gggtccaaga ccatcaacct 3874021 ctgcaacaac ggcgacccga tttgttcgga cggcaaccgg tggcgagcgc acctaggcta 3874081 cgtgcccggg atgaccaacc aggcggcgcg tttcgtcgcg agcaggatct aacgcgagcc 3874141 gccccataga ttccggctaa gcaacggctg cgccgccgcc cggccacgag tgaccgccgc 3874201 cgactggcac accgcttacc acggccttat gctggcgccg gaccccgccc gccaggcgcg 3874261 ccgcccgtca acgcagccga atgcgcattt gtccgccgaa tgcgccgcga tgaaccgcaa 3874321 tcatttcacc ggaagggaag tgtgcggaca cgctaaccgg acgctcgggc taacttcgac 3874381 cgctattgcg ctgaggaggg ttgatgccgg gcgtcataac aaacagtgaa agcccaaccg 3874441 cagccgacca cgacagaatt acggccacca gagagacgct ggaggattac acactgcggt 3874501 tggcgccgcg cagctatcgc aggtggcccc cggcggtggt gggcatctcc gctctcggcg 3874561 gcatcgccta cctggcggac ttcgcgatcg gcgccaatgt cggtatcacg tggggtaccg 3874621 cgaacgcgct gtgcggaatc gcaatcttcg cactggtggt cttcgtcacc ggcttgccgc 3874681 tggcctacta cgcggcgcgg tacaacatcg acctggatct gatttacccg cggtagcggt 3874741 ttcggctact acggctcggt ggtcaccaac gtcatctttg ccacgttcac gttcatcttc 3874801 tttgccctgg agggctcgat catggctcag ggccttaagc taggcctgca cattccgctg 3874861 tgggcgggtt acgcgtgctc gaccctgatc atcttcccgc tggtggtcta cgggatgaaa 3874921 gttttgtcac agctgcaact ttggaccacc ccgctctggc tgatcctgat ggcggcccca 3874981 tttggctacc tggtagtcag ccatcccgat tcgattggac agtttttctc ctacgccggc 3875041 aaggatggtc atggcggcct tagcttcggt tctgtcctgt tggcagcggg agtgtgcctg 3875101 tcactcatcg ctcagatcgc cgagcagatc gactacctgc gcttcatgcc gccacggacg 3875161 ccggagaacg cgaacaggtg gtggacgtgg acgctgctgg ccggtcccgg ctgggttgca 3875221 tttggggcga ccaaacagat catcggcctg ttcctggcgg tctatctgat ggccaacatc 3875281 cccggctcgt cgacaatcgc caaccagccg gtgcaccaat tcatgcagat ataccgcacc 3875341 ttcgtaccgg gctggctggc gttgacactc gccgtcatcc tggtggtctt gagccagatc 3875401 aagatcaacg tcacgaacgc gtattcgggc tcgctggcgt ggaccaattc attcacacgg 3875461 ctcaccaagc actatcccgg gcgggtcgtg tttcttgggg ttaacctcgc gattgcgttg 3875521 attctcatgg aagccaacat gtttgacttc ctgaacacaa tcctgggttg ctacgccaat 3875581 tgcggtatgg cctgggtggt ggcggtggcg tcggacatcg gcttcaacaa gtatctgctc 3875641 ggcctgtcgc cgaagactcc cgaattccgc cgcggcatgc tatacgccat caacccggtc 3875701 ggcttcgggt cgttgctgct ggccgcgggg ctgtcgatcg tcaccttctt cggcggtctg 3875761 ggtgcggcac tgcagcctta ttcaccattg gtggcaatcg tcaccgcgtt ggtaatgccg 3875821 cccattctgg cagccgcgac caaaggcaag tactaccttc gccgcacgca cgacggtatc 3875881 gatctgccca tgtacgacga gcacggcaat ccctcggccg cggtgttgac ttgccatgtc 3875941 tgccaccagg atttcgagcg gcccgacatg ctggcctgcc agacccatgg tgcgcatgtc 3876001 tgttcgctgt gcttgtccac ggacaagcag gccgagcatg tgcttcctgg gttagcccga 3876061 gcgcacatcc cgggtgacca agttccgtga cgcgagctgg tcatcgggcg gatagtccac 3876121 ctggatcaac gtcaacccgt gcgccggcgc gaccgcgaag tcgctggatc gtcctgtcgc 3876181 ggtgagcagc tcacgacacc aagttgtcgc gcgacggtgc tcgccgaccg ccagtagcgc 3876241 ccccaccaac gaccgcacca tcgaccaaca gaacgcgtcg gcggtgacgt gcgcggtgac 3876301 cagggtgccg gcacgcgacc agtccagccg ctgcagatca cgaatcgtgg tggcgccctc 3876361 gcgatgacgg cagaacgccg cgaagtcgtg cagccccatc aaatctcgcg acgcggccgt 3876421 catcgcatcc agatcaagct cgcgtggcca agcggtgatg tagcgcgcct gctgcggctc 3876481 gacaccgtag ggtgctgtcg acagccggta cacgtaatgc cgccgcagcg ccgagaatct 3876541 ggcgtcgaaa cccgctggtg cgcgcgtgat atcgaggatt cgaacgtcgg cgggcagaaa 3876601 tcgacccagc ctccgcaaca gcggcaggaa ttccggatca ccgacgtggc cggcgcgcgg 3876661 gtaagcgttc ggcaaggcat cggcgggcac gtcaacgtgg gcgacctggc cgctggcgtg 3876721 cacgcccgca tcagtgcgtc cggccgcccg cagccgcacc ggggtgcgga agatggtagt 3876781 cagcgccgca tcgagatcgc ccgcgaccgt gcgctgcccc acttgtgcag cccagcccgc 3876841 gaaatcggtt ccgtcgtagg cgatatcgag ccgaagacgg acaacgccgc taattctcgg 3876901 gggcctctgc ggggggctct tcgggggcct tcgcgtcagg ctcactggcg ccgaccacgt 3876961 caccctcctc agcaggcttg gcctcggact cctcggtcgg catggccgcc gccttcttgg 3877021 ccttcgcctg cgcggcagct acccggcgtg ctcgattggc ctccgaggtc accgtcttct 3877081 cccggaccag ttcgatcacg gccatcggag cgttgtcgcc cttacgtgcc tcgattttga 3877141 tgatacgggt gtagccacca tcgcggtcgg cgaagaacgg tccgatctcg gcgaacaagg 3877201 tatgcaccac atccttgtca cggagcttct tgagcacctc gcgccggttg tgcaacgcgc 3877261 cttttttggc atgcgtgatc agcttctccg cgtacggacg cagcgcccgg gccttcggct 3877321 cggtcgtcgt gatccgccca tgctcgaaca gggacgtggc gaggttggcc aagatcgcct 3877381 tctgatgtga agacgacccg ccgaggcgag ggcccttggt aggcttgggc atagctgacg 3877441 ctcctgtctg gattagaggc agtctaaagc tgttcggttt cggcgtagtc ctgctcgtcg 3877501 tacgcgccct cggtcgacca ggtgccggtg gcgacgtcgt agcccgcgac ctccgagggg 3877561 tcgaagctcg gcgggctgtc cttgagtgac aggcccagct ggtgcagctt gatcttcacc 3877621 tcgtcgatgg acttctgacc gaagttgcgg atgtcaagca ggtcggattc ggtgcgcgcc 3877681 accagttcgc ccacggtgtg caccccctcg cgcttgaggc agttgtagga ccgcaccgtc 3877741 agatccaggt cgtcgatcgg cagggcgaat gacgcaatgt gatcggcctc ggccggcgac 3877801 ggcccgatct cgatgccttc ggcctcgacg ttgagttccc gtgccaggcc gaacaactcg 3877861 accagcgtct tgccagccga cgccagcgcg tcgcgcgggc tgattgaatt cttggtctcc 3877921 acgtccagga tcagcttgtc gaagtcggtg cgctgctcga cccgggtggc gtccaccttg 3877981 taggtcactt tgagcaccgg tgagtagatg gaatcgactg gaatgcgccc aatttcggca 3878041 cccgaagccc ggttttgcac cgccgggaca tagccgcggc cacgctcgac gacgagctcg 3878101 acttccagct tgcccttatc gttcagcgtg gcgatgtgca tgccggggtt gtgcacggtg 3878161 acgccggccg gcggcacgat gtcgccggcg gtaacctcac ccggaccctg cttgcgtagg 3878221 tacatggtga ccggctcgtc ctcctccgag gacaccacca ggctcttgag attcaggatg 3878281 atctcggtga catcttcttt gaccccgggc accgtggtga attcgtgcag tacaccatcg 3878341 atgcgaatgc tggtgacggc cgctccggga atcgacgaca gcagggtgcg acgcagcgaa 3878401 ttgcccaggg tgtagccgaa tcccggctcc agcggttcga tcacgaactg ggatcggttg 3878461 tcggtgagga cgtcctcgga cagggtgggg cgctgtgaga tcagcatggt gtttcttctt 3878521 cctttcgacg tccgccatat gacgtctgtg ggggcactcg ggggcggcgc ccccgagggt 3878581 gggggtactc ggggggcggc gccccccgag ggttgggttg gggggtactc ggggggcggc 3878641 gccccccgag ggttgggttt actttgagta gtactcgacg atcagctgct cggtgagtgg 3878701 gacgtcgatc tgcgcgcgct cgggtagctg gtggatcagg acgcgttgcc gctcccccac 3878761 cacttgcagc cagctcggga tcggacgctc gcccgccgtc tcccgggcaa tctggaacgg 3878821 caccgtgttc agggacttgt cccgcacgtc gacgatgtcg tactgcgaca cccggtaact 3878881 ggggacgttg acgtgcacgc cgttgacgtt gaaatgcccg tggctgacca gctggcgagc 3878941 catccgccgg gtgcgcgcca gcccggcacg gtagatgacg ttgtccagcc ggctttcgag 3879001 gatcttcagc agttcttcac ccgtcttgcc gggctgccgc acggcctctt cgtagtagcg 3879061 gcggaactgc ttttccatta cgccgtatgt gaaacgggcc ttctgcttct cctgcagctg 3879121 aagcagatat tcgctttcct tgatccgcgc gcgaccgtgt tggccgggcg ggtagggacg 3879181 cttctcgaag gcctggtcgc caccgacgag gtcggtgcgc aaccgccgtg atttgcgggt 3879241 gacgggtccg gtgtaacgag ccatcttctc tcctagacgc gccggcgctt ggggggccgg 3879301 acaccgttat gcggctgggg ggtgacatcc gagatcgcgc ccacctccag gccggcggcc 3879361 tgcagcgacc ggatcgcggt ctcgcggccc gagcccgggc ccttgacgaa cacgtcgacc 3879421 ttgcgcaccc cgtggtcttg ggccttgcga gcggcgttct ccgcggccag ctgggccgca 3879481 aacggggtcg atttccggga acccttgaag ccgacgtgcc ccgacgatgc ccaggcaatg 3879541 acgttgcctt gcgggtcggt gatggtcacg atcgtgttgt tgaacgtgct cttgatgtgg 3879601 gcggcgccgt gcgggacgtt cttcttctcc cgccggcggg tcttctggcc cttcctagcc 3879661 gacgttgccg gccctttttt tgctggtggc atcggttacc tagccttctt cttgcctgcg 3879721 atggtgcgct tggggccttt gcgggtccgc gcgttggttt ttgtccgctg gccgcgtacc 3879781 ggcataccgc ggcggtgccg caacccctga tagcagccaa tctcgatctt gcgacggatg 3879841 tcggcctgta cctcgcggcg caggtcaccc tccaccttca ggttcgcttc gatgtagtcg 3879901 cgcaggtgga tcagctgttc ttcggtgaga tctctggtgc gcagatcccg gtcaatgccg 3879961 gtggccgcca ggatttcgtt cgagcgggta cggccgatgc caaagatgta ggtcagggcg 3880021 acctccatcc gcttatcgcg cggcaggtcg acgccgacga gtcgagccat aggtggcgtt 3880081 tcctcttcct ctgcggaggt atggtcccag tccgttccct gcccaaaaaa gatctttggg 3880141 tgtggggccc ggcctccgtc cgggcgtgaa tgagctggcc catctccatc gatgccagcc 3880201 gctcattggt gctgggggtc tgcatttagt tgtcgggccg tccggctcct cctcggacca 3880261 ctacgcggcc cgcatcgtcg ccgaactagc cctgcctttg tttgtgacgc ggatcggaac 3880321 agatcaccat aacccgcccg tgccgacgga tcagcctgca cttgtcacag atcggcttga 3880381 cgctcgggtt taccttcacg actgtctcgg tcctgttcta tgggtatgtc gctacttgta 3880441 ccggtacacg atgcggcccc gggacaggtc gtagggcgac aattccacca ccacccggtc 3880501 ctcgggcagg atgcgaatgt agtgctgacg catcttgccg ctgatgtggg cgagcacctt 3880561 gtggccgttc tccagctcaa tgcggaacat ggcattgggc aggggctcga ccacgcgacc 3880621 ctcgacctct atggcaccgt ccttcttggc cattactttc tggcgatcct tctcttcctt 3880681 gtcggtgcac ccgattccgg cgcagcacgt gctcggacta caaacgtgag ccggtggtgg 3880741 aaattccgcg aagggctccg agaaattttc aaaactgggc acgccaaacc ggcacgggac 3880801 accgcaccgc caacccacat tacccgcatc gccgtgctct gcgcaaaacg ccgtaggcca 3880861 cgcgctcacc ggaatagcac cggtgagccg agcggttaga gcaaccatga ccaattgtgc 3880921 cgccggcaaa cccagctcag gccctaacct cggccgattc ggatcgttcg gacgcggcgt 3880981 caccccccag caggccacag aaatcgaggc gctgggctac ggggcggtct gggtgggagg 3881041 ctcaccaccc gccgcactgt cctgggtgga accgattctg caagcgacca ccacattgtg 3881101 tgtggccacc ggcattgtca atatctggtc ggcaccggcc cagcgagtcg ccgaatcgtt 3881161 ccaccgcatc gaggcggcct acccgggccg ctttctgctg ggtatcggag tcgggcatgc 3881221 cgagatgatc agtgagtacc gcaagcccta caacgcgctg gtggaatacc tagaccggct 3881281 cgacgactat ggggtgcccg ccaaccgccg ggtggtggcc gcactgggcc cccgggtcct 3881341 gggcctgtcc gcacgccgca gcgccggggc gcacccgtac ctgaccacac ccgaacacac 3881401 ggcacgggcc cgtgagctga ttggtccgtc ggcgttcctg gcgcccgaac acaaggtggt 3881461 gctgaccacc gactcggcaa gggcccgtac ggtgggacgc caggcgctcg atatgtactt 3881521 caacctggct aactaccgca acaactggaa acggctgggc ttcaccgacg acgaagtctc 3881581 ccggccgggc agcgaccgcc tggttgacgc cgtggtcgcc tacggcactc cagacgcgat 3881641 cgcggcacgg ctgaacgaac acctgcttgc aggcgccgac catgtcccta ttcaggtcct 3881701 caccgaagat gacaacctgg tgtcggcgct gaccgaactc gcgaagccgc tccgactgac 3881761 ttgatcccga aacggagggt tgcgaaccca actggtcgcg gctccactcg gttaaggctc 3881821 ggttagggtt tgatccatgc ggttgctagt caccggtggc gcgggattca tcggcacgaa 3881881 tttcgtgcac agcgccgtac gtgagcatcc agacgatgcg gttaccgtac tcgacgccct 3881941 gacctacgcc ggccggcgcg agtcgctggc cgacgtggag gatgccatcc ggctggttca 3882001 gggcgatatc accgacgccg agctggtttc gcagctggtg gccgagtccg acgcggtggt 3882061 gcattttgcc gccgaatccc atgtcgacaa tgcactggac aatccggagc cgtttctgca 3882121 caccaacgtc atcgggacct tcaccatcct ggaagcggtg cgacgccacg gtgtgcgcct 3882181 gcaccacatc tccaccgacg aggtctacgg cgacttggag ctcgacgacc gggcgcggtt 3882241 caccgaatcg acgccctata acccgtccag cccttactcg gcgaccaagg cgggcgcaga 3882301 catgttggtc cgggcctggg ttcggtccta tggcgtacgc gcgacgatct ccaactgctc 3882361 caacaactac gggccgtatc agcacgtcga gaagttcatt ccgcgtcaga tcaccaatgt 3882421 gctcaccggg cggcggccca agctctacgg cgcgggcgcc aatgtccgtg actggatcca 3882481 cgtcgacgac cacaacagcg cggtgcggcg aatcctggac agaggccgca tcggccgaac 3882541 ctacctgatc agctccgagg gcgagcgtga caacctgacc gtgctgcgca cgctgctgcg 3882601 actgatggac cgcgatccgg acgacttcga ccacgtcacc gaccgcgtcg gccacgacct 3882661 gcgctatgcc atcgacccgt ccacgctcta cgacgaatta tgctgggcgc caaagcatac 3882721 cgatttcgag gagggcctgc ggaccacgat cgactggtac cgcgacaacg aatcgtggtg 3882781 gcgtccacta aaagacgcca cggaggcccg ctatcaagaa cgcggtcaat gagatgaaag 3882841 cacgcgaact cgacgtcccc ggcgcctggg agattacccc gaccatccat gtcgattccc 3882901 gcggactgtt cttcgaatgg cttaccgatc atgggttccg cgcattcgca ggtcacagtt 3882961 tggacgtccg gcaagtgaac tgctcggtgt catcggccgg tgtgctgcgc ggcctgcact 3883021 ttgcccagtt gccgccgagc caggccaagt atgtgacctg cgtttccggc tcggtgttcg 3883081 atgtcgtcgt cgacatccga gagggctcac cgacattcgg ccgatgggac tcggtgctgc 3883141 tcgacgacca agaccgtagg acgatctacg tctccgaagg cctagcgcac ggcttccttg 3883201 cactgcaaga caattcgacg gtgatgtact tgtgctcggc ggaatacaat ccgcagcgcg 3883261 agcacaccat ctgcgccaca gatccgacgt tggcggtcga ttggccgctg gtcgatggcg 3883321 ctgcccccag cctgtccgac cgtgatgccg ctgcgcccag cttcgaggat gtgcgcgcgt 3883381 ctggcctgct gcccaggtgg gaacagacgc agcggttcat tggggagatg cgcggcacct 3883441 agctcggtaa tcccttgtgt tgctttagct tcagcggtca cagcgcggcg attgttgtcg 3883501 gtggcccctc gtagaatttg gggtatgggt tcgggtagcc gcgaacggat tgtcgaggtc 3883561 tttgatgcgc tggatgccga gctggaccgc ttggacgagg tgtcttttga ggtgttgacc 3883621 accccagaac ggctgcggtc tctggaacgt ctggaatgct tggtgcgccg gctaccggcg 3883681 gtgggtcacg cgttgatcaa ccaacttgac gcccaagcca gcgaggaaga actgggcggc 3883741 acgctgtgct gcgcgctggc caaccggtta cgcatcacca agcccgacgc cgcccggcgc 3883801 atcgccgacg ccgccgatct cggacctcgt cgagcactca ccggtgaacc gctagcccca 3883861 cagttgaccg ccaccgccac cgcccaacgc cagggcctga tcggcgaggc gcacgtcaaa 3883921 gtgattcgcg ccctttttcg cccacctgcc cgccgcggtg gatgtgtcca cccgccaggc 3883981 cgccgaagcc gacctggccg gcaaagccgc tcaatatcgt cccgacgagc tggcccgcta 3884041 cgcccagcgg gtcatggact ggctacaccc cgacggcgac ctcaccgaca ccgaacgcgc 3884101 ccgcaaacgc ggcatcaccc tgagcaacca gcaatacgac ggcatgtcac ggctaagtgg 3884161 ctacctgacc ccccaagcgc gggccacctt tgaagccgtg ctagccaaac tggccgcccc 3884221 cggcgcgacc aaccccgacg accacacccc ggtcatcgac accacccccg atgcggccgc 3884281 catcgaccgc gacacccgca gccaagccca acgcaaccac gacgggctgc tggccgggct 3884341 gcgcgcgctg atcgcctccg ggaaactggg ccaacacaac ggtcttcccg tctcgatcgt 3884401 ggtcaccacc accctgaccg acctgcaaac cggcgccggc aagggcttca ccggcggcgg 3884461 caccctgcta cccatggccg atgtgatccg catgaccagc cacgcccacc actactcccc 3884521 cgcaagcggg aggtaccccc aggcgatctt cgaccacggc acacccctgg cgctgtatca 3884581 caccaaacgc ctagcctccc cggcccagcg gatcatgctg ttcgccaacg accgcggctg 3884641 caccaaaccc ggctgtgacg caccggccta ccacagccaa gcccaccacg tcaccgcctg 3884701 gaccagcacc ggacgcaccg acatcaccga gctgaccctg gcctgcggcc ccgacaaccg 3884761 actcgccgaa aaaggctgga ccacccacaa caacacccac ggccacaccg aatggctacc 3884821 accaccccac ctcgaccacg gccaaccccg caccaacacc ttccaccacc ccgaacgatt 3884881 cctccacaac caagacgacg acgacaaacc cgattgaccc ccagcagtca aagccacacg 3884941 ccacaacgcc gcacaaccat aaacaccgag tccgtcaggg cctggccgga gcaaacacgc 3885001 cacggtggta ggagctgtgg gcatatgcct tggagcccac cagttgtgac aacggcgtgt 3885061 gcaccgactg cccgcgtgcg agtctggcgg cgaccgcgtt taggtcgaac cgcggacgcc 3885121 agttcaagtc acggcgagcg cgggagttaa cgtacacgcg gtcgaggcgg tcggggaagc 3885181 gccaaccacg ctgggtccac acagccgcgg ccagcggtac ccgccgggcg aacaccgatg 3885241 ccgcgtcggt gcgcagctgc gtcaggtcat cacgggtaaa cggtgtggtc gccgacacca 3885301 gatagcgccc gaaccccagc tggggagctc gctgcgcggc gttgaggtgc gcatctaccg 3885361 cgtcttcgag cgcgacccgc cggcaggcat attcgttggc tttgatgttg tcctggctgc 3885421 gcccgtcata caggtcaggc atgtcatcgc cctcgacgaa gaatcgggca acacgcagca 3885481 cgacgcaggc caaaccgtcg ttgcgatgtg ccaactggca gaggtcctcg gagctagctt 3885541 tggtcacgcc gtagatgttc ttgggaatgg gcgtgacgga ttcgtcgatc cacgccgcgg 3885601 gctggtctgc cggcggtgtc agggcgtcgc cgaaaacggt cgtcgatgat gtcatgacga 3885661 aggcgcggac gttggcggcg accgcagcat ccagcacggt ctgggtaccg atgatgttcg 3885721 tgtccagaaa cgcctgacgc ggcaggaagg ccagttgcgg cttgtgatgg gcggccgcgt 3885781 ggaacaccac ctcaacgccg gccatcacgt ctcgcagcag tgctcgatca ctcacgcagc 3885841 caacgatatt cgtgtaccgc gacggtctgc tgtcgaggct gacgatgtcg gcgccccgtg 3885901 cacgcagagt gcgcaccagc gcctcgccca ggtgaccgga gctgccggta accagggtac 3885961 gcatcccgct ctcggcggcg gcagccgtcg gaggcgtgcc cgcgtgcaac aacagcggac 3886021 tggaccgcac gccggcgcgg actctcatgg tggctgcatg tgttcccacg actcacgccc 3886081 tcattcccac gaccactcga tcgatgtctt gcggggacaa ccactgcccc gcatgacttt 3886141 tcgcggtctg ccgaataacg tgggacacgg aaagccccgc ctgttgggcc gctgcgcgga 3886201 ggtgctcgac ctgcagcgaa tcgagtccgt ggaatccgaa cgagatctcc cacggatcga 3886261 tgccgtaact ctgcgggctc cggcccaaca tatccgatcg cgctgcgtcg aaaatccgat 3886321 ccaggtcgac tgccgccagg tgcccgcgcc gctgcaacgc ggcgacgagg cactcgatct 3886381 ggcaattccc ggccccgcga ccgaaaccca tcagcgttcc atccaggaaa tcggccccgg 3886441 cgtcgaatgc ctccaaggtg ttggcgacgg ccatggcgag gttgttgtgc ccgtggaagc 3886501 cgacggagac atcgctggca ccgcggagag cctcgacgta gcggcgcgcg tcctcgggca 3886561 ggaaggttcc cgtcgtatcc accacgtaaa cgatccggac gcccacatcg cgggcccgct 3886621 tcccggcagc agcaagcaca tcgggctcga agagatgcga cttcaccagc tggatcgaaa 3886681 cctccagacc ttttgactgc gcacgctcga cgaacggcat caccaactca aattcggtgg 3886741 cgatgacaca tatgcgcaga aagtccagat agtctccggc caaatcgacc gtctcgatgc 3886801 gggccagggc cggcacgatc acggcaccaa gtctcgcgtt tcgaaccacc gatcgggcgg 3886861 cgcggaaata ttcttcgtcg gtgtgagccg ccgggccctg cgccgcggcg gctccgatgg 3886921 tgacgccgtg accgatttca atgtagggaa ttcccgctgc gtcgagatcc ccgacaatcc 3886981 tgcggacatc gtcgtcggtg tactggaagt tcaccgcata gctgccgtca cggacggtcg 3887041 tgtccaggac aatcggctct ctgtgggtcg cagtcatgag catcagtgtc agcccgcacc 3887101 cttgccggat ccttgatgaa ttcttggacg cgcggctggt gtcctatcga cccagtccaa 3887161 tgtcgggttt gttgatctct ggatcgatcg cgatatcgag gacgcaggga ccggtggcgg 3887221 ccaacgcttt ttgcacaccg gcgcgcagct cgcagcgcgt atcgacccga atcccttccg 3887281 ctccaagggc gcgggccatc gccgccagat cgttcgcgcc gatgcgagcg accggcgacg 3887341 gatccatccg cccgctgacc gggccggcgc tggcactcat ttgtccgtcg ttgaggacag 3887401 cccaggtcac cctgatcccg tgcgcaaccg cagtggaaat ctccgtgcca tgcatcaaga 3887461 aagccccgtc cccggcgatg catatgacgt gttcttccgg tcgagccagg gccacgccaa 3887521 tggctccggc gatgccgcat cccatgggcg aaaagtcaac ggtggcaaag aatctgccgg 3887581 gccgccgcac cggtatccca cgaaacgtcc aagaaatgca ggtacccacg tcggcgcata 3887641 tcgtggcgtt gggtgcaagc tcgcggtcca gttcgtgcat cagctcaagc gggtgaatcg 3887701 attccccccg cgcttgcggg gtccccggca acgccgctgg cgccggcggc cgcacgccca 3887761 ccctccgaca aaagcgtggc ggccgcccgc agttcagggc attgacgaac gcgcgcccgg 3887821 acgtggtgat cccgagcgac gtagcgacga atcggccaac tgccgatgga tcgggatcga 3887881 catggacgac gtcggctttc agcccgcgcc agcggggcga aaaggagcgg gtaaccaacc 3887941 cgccgaagga aacaccgacc gcgatcaaca ggtcgcacgg tgtgtcgaag aggtactcgt 3888001 cggccctgcc gtcaccaaat atgccgagca cacccagaga cagcggatgg gtttccgcga 3888061 cgatcccccg cccgttcggt gtggtcgcaa aaggaagtcc cgccttctcg caaaacgcga 3888121 cgatctgctc gccgatgccg tccagccggc agccattccc cagcacgagc atgggggcac 3888181 gcgaccgatc cagcctaccg atcacctcgt cagcgacatc aggaccgcac ggcgccaggg 3888241 ttcttaggcc cccaagaccg gccgcggcag ttccaagttg gtgagccggc agccgctcgt 3888301 ccactagatc gcgcggcaga gcaatgtgca ccggtccgcg agggatgctc gccaaggccc 3888361 ggaacgccga atcgatcttg ctgcgcgcat tggcgatcga ttcgatggac accgaacagc 3888421 ggcagaaccg gcggaaggtt gcgcccaggc ccagtccgtc gtcgctcgta tcctgctgcg 3888481 agtgcaggcc gaattctccg accgccacct ccccggtcag gataagcatc ggaacctgat 3888541 tcaccgacgc attggccacg gcgctaatga cgttggtcgc cccaggtccc gccacaaaca 3888601 ccgcagcgga cttgccggac gcgcgggcga acccgtcggc caggtagccg gcgccgccct 3888661 cgtgccgggc caacacgatc tgaaagccgg catcgcggga cagacgcacc agcaacgaat 3888721 cgagccggga agtcggtagc ccgcatacga ccgaaatgcc ggctgcgcgc atcctggcga 3888781 cgagatgatc cccgacggtc acgggagtca cggccatgcc ccgatcacgg cggcctcgcc 3888841 catgcgctga tcgcgttccg gtaggtaggc cgggccgcag gcgcacaaga atgtcaacgg 3888901 aactgaacct agggcccgga ttttctgcgg gacaccagcc ggtatccaga ccgcatcgcc 3888961 gggcccgacc tcgccagatt cgtctccgac cgaaaccagc ccgcgccccg agagaacaaa 3889021 atagatctca tcggtggctt gcaatcggtg ccatacggtc tcggctcccg ccgccacggt 3889081 cgcatgggcc agactgaccg aggcgacgcc cacagtggcc cgatccacca ggacccgaat 3889141 ctcggacaag tccggcgcca cgaacggctc tgcctccctg gcgttgctga cgaacatggc 3889201 agcagcgtgt gcccgcgctc ttggcggatc cttgacgaat cctcggaacg cgggtttgtg 3889261 accggcggag agcgcgacgg ttgcctgcag cacagcgtct gtcgacgttg acgctcgctc 3889321 ccgttcgggc cgggttgaca tcccccacca ccggccacac aatgcgcccg gtggatgagc 3889381 agtggatcga gatactcagg atccaggcac tgtgtgctcg gtactgtttg acgatcgaca 3889441 cccaggatgg cgaaggctgg gcgggatgct ttaccgagga cggtgccttc gagttcgacg 3889501 gctgggtgat ccgggggcgg cccgcattac gcgaatacgc agatgcgcat gcccgcgtcg 3889561 tgcggggccg ccacttgacc acggatcttc tctacgaggt cgacggggac gtcgccaccg 3889621 ggcgcagcgc cagcgtggtc actctggcca ctgccgccgg ctacaagatc ctcggctcgg 3889681 gcgagtacca ggatcgcctc atcaagcagg acggccagtg gcgtatcgcg taccggcgat 3889741 tgcgcaacga tcggctggtg tcggatccca gcgtggcggt aaacgtcgcc gatgccgacg 3889801 tcgccgcggt cgtcggtcac cttctcgcgg ccgcgcgccg gctcggaacc cagatgagcg 3889861 acacgtaggg gcgacaagct agggccgacg tcggtgtacg gacacacgcg ctcgcgggtt 3889921 ggctgtgcag gaccttccct aaccccatca tcggacgccg acatgccgag cgagaaaatc 3889981 taggaccgcc cctgcgaaag cgtcgttgcg atcgccggcg accatatgtc cggcgccgcg 3890041 cacatcggtg aactcgactt gcggaaaccg cgagagaaat tggtcggcgc tttcttggcg 3890101 gacgatgtcg ctgacttggc cgcgcacgag aagcaccggc acttcgtcgc gcaggatcgt 3890161 cgcaacggct gcattcatgc ggtcgacgtc ggtgacctct acgggaggaa acgccgcgat 3890221 accaccgatg aactgcggat cccagtgcca ataccagcga tcaccgcggc ggcgcaggtt 3890281 ggccaccaag ccatccggat ccgaaggccg cggccgatgc gggttgtagt tggcgatgac 3890341 gtcagccacc tcgtccaacg agccgaaccc cgattccacc cgttcggcca tgaacgcgtg 3890401 gatcctgctc gccccggcca ggtccatatt cggcacgatg tccaccagca ccactgcgct 3890461 ggcaatgccc ggcgagagct cccccgccag cagcatcgcg gcaaacccac ccaaggaggc 3890521 gcccaccagc gccggctgcc caggcaggtt gcgcagcact tcctggatat cgccggcgaa 3890581 gctgaccaac cgatagtcgc cttcgctcga ccagtcggat tcgccatgcc cgcgcagatc 3890641 gatcgtgacc gcttgccagc cacgttcggc gacagcggct gcggcccgac cccatgagcg 3890701 tcgcgtctgt ccaccgccat gcaagaacac cacggcacgc gctcgcgggt ctcccaagcg 3890761 gtcggcgacg atacggactg aaccgccccg gcatgtccgg agactccagt tcttggaaag 3890821 gatggggtca tgtcaggtgg ttcatcgagg aggtacccgc cggagctgcg tgagcgggcg 3890881 gtgcggatgg tcgcagagat ccgcggtcag cacgattcgg agtgggcagc gatcagtgag 3890941 gtcgcccgtc tacttggtgt tggctgcgcg gagacggtgc gtaagtgggt gcgccaggcg 3891001 caggtcgatg ccggcgcacg gcccgggacc acgaccgaag aatccgctga gctgaagcgc 3891061 ttgcggcggg acaacgccga attgcgaagg gcgaacgcga ttttaaagac cgcgtcggct 3891121 ttcttcgcgg ccgagctcga ccggccagca cgctaattac ccggttcatc gccgatcatc 3891181 agggccaccg cgagggcccc gatggtttgc ggtggggtgt cgagtcgatc tgcacacagc 3891241 tgaccgagct gggtgtgccg atcgccccat cgacctacta cgaccacatc aaccgggagc 3891301 ccagccgccg cgagctgcgc gatggcgaac tcaaggagca catcagccgc gtccacgccg 3891361 ccaactacgg tgtttacggt gcccgcaaag tgtggctaac cctgaaccgt gagggcatcg 3891421 aggtggccag atgcaccgtc gaacggctga tgaccaaact cggcctgtcc gggaccaccc 3891481 gcggcaaagc ccgcaggacc acgatcgctg atccggccac agcccgtccc gccgatctcg 3891541 tccagcgccg cttcggacca ccagcaccta accggctgtg ggtagcagac ctcacctatg 3891601 tgtcgacctg ggcagggttc gcctacgtgg cctttgtcac cgacgcctac gctcgcagga 3891661 tcctgggctg gcgggtcgct tccacgatgg ccacctccat ggtcctcgac gcgatcgagc 3891721 aagccatctg gacccgccaa caagaaggcg tactcgacct gaaagacgtt atccaccata 3891781 cggatagggg atctcagtac acatcgatcc ggttcagcga gcggctcgcc gaggcaggca 3891841 tccaaccgtc ggtcggagcg gtcggaagct cctatgacaa tgcactagcc gagacgatca 3891901 acggcctata caagaccgag ctgatcaaac ccggcaagcc ctggcggtcc atcgaggatg 3891961 tcgagttggc caccgcgcgc tgggtcgact ggttcaacca tcgccgcctc taccagtact 3892021 gcggcgacgt cccgccggtc gaactcgagg ctgcctacta cgctcaacgc cagagaccag 3892081 ccgccggctg aggtctcaga tcagagagtc tccggactca ccggggcggt tcagacaccg 3892141 cccggcccgt ggaccgagaa cgattcagct gccattgata tcgggtccat caggggatcc 3892201 agaaccatcc gtttgcatgc cctaccacga tcctgtccta ccgagcggcc cgcagtcacc 3892261 ccagattcgg cgtcaatccg gcacccggtt cgtggtccat ccacggaacc caaggcgcca 3892321 ttttcgcagt gattgcacgc tcggcgaaag gtgttaccca gacgctacag ctatgcgtgc 3892381 ccgtagaatg caaatccctg ctcgcggtcg aggtaggtat cggccttgtt cttgatgaaa 3892441 aacacataga caatcagcga gaccgctatg cacgcggtca cgtaggcgat gaacatcggc 3892501 acctgatcgc gttccttaag agcctggtag atcagcggcg cggtgccgcc gaagaccgag 3892561 ttcgccagtg catagccgac tccgacacca agggcgcgca cgtgcgcggg gaacagttcg 3892621 gacttgacca gtgcattgat cgagcagtat ccggtcagaa tcacatagcc aacggccacc 3892681 aatagaaacg acattgtcgg cgaacgtgtt tcgggaagat aagtaacaag gacgtaggta 3892741 tagatgagtc cgccgacgcc gaaccacagc agcagtggct tgcggccgat cttgtcgctg 3892801 atcatgcccc cgatgggctg cagcatcatc aacagaatca gaccaaccag gttgatccaa 3892861 gtagcggtca tcgcctgcga accgtagaca ctcttgacga tcgcaggtgc attgacgctg 3892921 taggtataaa acgcgaccgt gccgcccaac gtgacgagga aacagagcag caatggcttc 3892981 caatagtggg tggccagttc acggagcgac ccggagtcgt ggtcccgccc ggccttgatc 3893041 gcagtcaggc gttcctgact gagcgattca tccatcgtgc gccgcaacca gaacaccacg 3893101 atcgcggcgc caccgcctac ggcgaagccg atgcgccagc cgaattcgtg aacctgctcg 3893161 cgggtgaaga ccgccaggat gactagcagg gtgaactggg caagcacgtg cccacccacc 3893221 agcgtcacat actgaaacga cgagaagtag ccgcgccgct cccgcgtcgc ggcctcagac 3893281 atgtacgtcg ccgacgtgcc gtactctccg ccggtcgcaa atccctggac gagccgacac 3893341 aaaataagca ggatcggcgc agcgacgcca atgctcgagc gagacggcac caacgccacg 3893401 atcagcgaac aggcggccat cagcgacaca ctgaacgtca gcgcggcccg gcggccgcgg 3893461 cggtcggcaa accgaccaag gaaccacgat ccgacgggcc gggtcacgaa ggtaacagcg 3893521 aagatcgcgt agacatagac cgtcgagttg cgatcggccc gatcaaagaa ttggtcctcg 3893581 aaatacgtag cgaacacggt gtagacgtag acgtcatacc actcgaccag attgcccgac 3893641 gatccccgga tcgtgttcca aatggcccga cgggtctcgg cctgactcgg gcgcgatgga 3893701 ggtgcaatgg aaacggtcat ggtgtcctcc atgcgattcg cattgtcgcg ccgtctgacg 3893761 gtcaccatag tgaccgacgt cagcacccgc cgtgcagggc tggagcgtgg tcggttttga 3893821 ctctgcggtc aaggtgacgt ccctcggcgt gtcgccggcg tggatgcaga ctcgatgccg 3893881 ctctttagtg caactaattt cgttgaagtg cctgcgaggt ataggacttc acgattggtt 3893941 aatgtagcgt tcaccccgtg ttggggtcga tttggccgga ccagtcgtca ccaacgcttg 3894001 gcgtgcgcgc caggcgggcg atcagatcgc ttgactacca atcaatcttg agctcccggg 3894061 ccgatgctcg ggctaaatga ggaggagcac gcgtgtcttt cactgcgcaa ccggagatgt 3894121 tggcggccgc ggctggcgaa cttcgttccc tgggggcaac gctgaaggct agcaatgccg 3894181 ccgcagccgt gccgacgact ggggtggtgc ccccggctgc cgacgaggtg tcgctgctgc 3894241 ttgccacaca attccgtacg catgcggcga cgtatcagac ggccagcgcc aaggccgcgg 3894301 tgatccatga gcagtttgtg accacgctgg ccaccagcgc tagttcatat gcggacaccg 3894361 aggccgccaa cgctgtggtc accggctagc tgacctgacg gtattcgagc ggaaggatta 3894421 tcgaagtggt ggatttcggg gcgttaccac cggagatcaa ctccgcgagg atgtacgccg 3894481 gcccgggttc ggcctcgctg gtggccgccg cgaagatgtg ggacagcgtg gcgagtgacc 3894541 tgttttcggc cgcgtcggcg tttcagtcgg tggtctgggg tctgacggtg gggtcgtgga 3894601 taggttcgtc ggcgggtctg atggcggcgg cggcctcgcc gtatgtggcg tggatgagcg 3894661 tcaccgcggg gcaggcccag ctgaccgccg cccaggtccg ggttgctgcg gcggcctacg 3894721 agacagcgta taggctgacg gtgcccccgc cggtgatcgc cgagaaccgt accgaactga 3894781 tgacgctgac cgcgaccaac ctcttggggc aaaacacgcc ggcgatcgag gccaatcagg 3894841 ccgcatacag ccagatgtgg ggccaagacg cggaggcgat gtatggctac gccgccacgg 3894901 cggcgacggc gaccgaggcg ttgctgccgt tcgaggacgc cccactgatc accaaccccg 3894961 gcgggctcct tgagcaggcc gtcgcggtcg aggaggccat cgacaccgcc gcggcgaacc 3895021 agttgatgaa caatgtgccc caagcgctgc aacagctggc ccagccagcg cagggcgtcg 3895081 taccttcttc caagctgggt gggctgtgga cggcggtctc gccgcatctg tcgccgctca 3895141 gcaacgtcag ttcgatagcc aacaaccaca tgtcgatgat gggcacgggt gtgtcgatga 3895201 ccaacacctt gcactcgatg ttgaagggct tagctccggc ggcggctcag gccgtggaaa 3895261 ccgcggcgga aaacggggtc tgggcgatga gctcgctggg cagccagctg ggttcgtcgc 3895321 tgggttcttc gggtctgggc gctggggtgg ccgccaactt gggtcgggcg gcctcggtcg 3895381 gttcgttgtc ggtgccgcca gcatgggccg cggccaacca ggcggtcacc ccggcggcgc 3895441 gggcgctgcc gctgaccagc ctgaccagcg ccgcccaaac cgcccccgga cacatgctgg 3895501 gcgggctacc gctggggcac tcggtcaacg ccggcagcgg tatcaacaat gcgctgcggg 3895561 tgccggcacg ggcctacgcg ataccccgca caccggccgc cggatagcac gaccggtttg 3895621 cgcggatgcg tcggcgttgt tccccgccgc ggttggcgtg ctctggcaat ctggtctaag 3895681 ggacccgacc ccaccgggcg gaccccacgg catcgagggg ctgtcgctgg cattcgaaaa 3895741 gccgtcaccg gtaacggcat tgacgcagga actacgattc gcgacgacca tgacgggcgg 3895801 cgtcagcctc gcgatctgga tggccggtgt tacgcgggag atcaacctgc tcgcgcaggc 3895861 ctcacaatgg cgcaggctgg ggggaacctt cccgaccaac agccaactca ccaacgagtc 3895921 agccgcttcc ctgcggctct acgctcaact aatcgacctc ctcgacatgg tcgtcgacgt 3895981 cgacatcttg tcgggaacaa gtgcgggcgg catcaacgcg gctttgcttg cgtcatcccg 3896041 agtcaccggg tctgacctgg gcgggatccg cgacctctgg ctcgatcttg gggccttgac 3896101 cgagcttctc cgagatccgc gggacaagaa aacaccgtcc ctcttgtacg gcgacgaacg 3896161 catattcgcc gctctggcca agcggcttcc caagctggcg accgggccgt tcccgcccac 3896221 gacctttccg gaggccgcgc gcaccccgtc caccaccctg tacatcacga cgacgctgct 3896281 agccggggaa acaagcagat tcaccgactc attcggcact ctcgtccagg atgtcgacct 3896341 ccgcggtctg ttcaccttca ccgaaaccga cctggcgcgg ccagacacgg cgccggcgct 3896401 ggcactagca gcgcgcagtt ccgcctcatt cccacttgcg ttcgaaccct cctttctgcc 3896461 gttcacgaag ggaaccgcca agaagggaga ggtgccggct cgaccggcga tggcgccgtt 3896521 caccagcctt acccgtccgc actgggttag cgatggtggc ttgctggaca accggccaat 3896581 tggcgttttg ttcaagcgca tcttcgaccg tccagcccga cggccggttc gccgggtgct 3896641 cctgttcgtc gtaccatcgt ccggacccgc acccgacccg atgcatgagc caccaccgga 3896701 caacgtcgac gagccactcg ggctcatcga cgggctgctg aagggcctgg ccgcggtcac 3896761 cacccagtcg atcgcggccg acctacgcgc gatccgcgcc catcaggact gcatggaagc 3896821 gcgcacagat gccaaactgc ggctcgcaga gctggcggca acgctgcgga acggcacacg 3896881 gttgctcacc ccgtccctgc tcacggatta ccggacccgc gaggcaacca agcaggccca 3896941 gaccctcacc agcgctctgc tgcgccggct ttccacctgt ccgccggagt cgggcccggc 3897001 aaccgaaagc cttcccaaga gctggtcagc cgaactcacc gtcggtggtg acgccgacaa 3897061 ggtgtgccgg cagcagatca ccgcgacgat cctgctttct tggtcgcagc cgaccgccca 3897121 gccgctccca cagagtccag ccgagctggc tcggttcggt cagccggcct acgaccttgc 3897181 aaaaggatgc gcgctcaccg tcatccgggc ggcattccag ctggcacgtt cggatgctga 3897241 catcgccgcg ttggcggaag tcaccgaagc aatccaccgg gcgtggcgac cgaccgcgtc 3897301 atccgatctc agtgtgctag tgcggacgat gtgtagcaga ccagcgatcc gacaagggtc 3897361 gctcgagaac gccgctgacc agctcgctgc cgactatctc caacaatcca cggtgcccgg 3897421 cgacgcttgg gagcggctcg gtgccgcctt ggtgaacgcc tacccgacct tgacgcaact 3897481 tgccgccagc gcttcagccg actcgggtgc cccgacagac tctctgctcg cccgggacca 3897541 tgttgcagcc ggtcagttgg aaacgtacct cagctatctg gggacctatc cagggcgtgc 3897601 cgacgactcg cgcgacgcac cgaccatggc atggaagcta ttcgatctcg ccacgacgca 3897661 gcgcgcgatg ctcccggccg acgcagagat cgagcaaggc ctcgaactcg tgcaggttag 3897721 cgccgacacc cgcagcctgc tcgcacctga ctggcagaca gcccagcaga agctcaccgg 3897781 catgcgcttg catcatttcg gtgcgttcta caagaggtca tggcgagcca atgactggat 3897841 gtggggccga ctcgacggag cgggatggct cgtccacgtg ctgctagacc cgcgccgggt 3897901 gcgctggatc gtcggggagc gcgccgatac caacgggccg cagagcggtg cacaatggtt 3897961 cctaggcaaa ctcaaagaac ttggggcacc tgactttccg agtccgggct acccgctgcc 3898021 ggcggtcggc ggcgggccgg cccaacatct gaccgaggac atgctgctcg atgagcttgg 3898081 cttcctggac gacccagcaa agccgctgcc ggccagcatt ccgtggaccg cgctgtggtt 3898141 gtcgcaggcg tggcaacaac gagtcctcga agaggaattg gacggactgg ccaacacggt 3898201 gctcgaccca cagcccggaa aattgccgga ctggagcccg acgagttcac gaacatgggc 3898261 gaccaaggta ttggccgctc accctggcga cgccaaatat gctctgctga acgaaaatcc 3898321 aatcgcaggc gaaacattcg ccagcgacaa gggctcacca ctgatggcgc acacggtcgc 3898381 caaagccgcc gcgactgcgg ccggagcagc cggctcggtc cggcagctgc ccagtgtatt 3898441 gaagccacca ctgatcacgt tgcggacact caccctcagt ggataccgag tggtctcgtt 3898501 gaccaaaggc attgccagat cgaccattat cgccggcgcg ctgctacttg tgctcggcgt 3898561 cgcggcggcg atccagtcgg tgaccgtgtt cggagtcact ggcctgatcg cggccgggac 3898621 tgggggcttg ctggtcgtcc taggcacttg gcaggtctcc ggcaggctcc tttttgcact 3898681 gctgtctttc tcggttgtcg gcgcggtact cgcgttggcg acgcccgtcg tacgcgaatg 3898741 gctgttcggc acccagcagc agcccggctg ggtaggcact cacgcgtatt ggcttggcgc 3898801 ccaatggtgg caccccctgg tcgtcgtcgg gctcatcgca ctggtggcca tcatgatcgc 3898861 agcggccacc ccaggacgac ggtgacgatg cgtgcggtga tccggaattc aggaaccgag 3898921 gcccgcggcg ccgtcagccg ccgccaactg atcgagcgct tcgccggtgt agacggcgag 3898981 ccgctgcagg tgtggcagtg tgtcacggca gccgatgaat ccaaagttca gcgtgccggc 3899041 gtaactctgc aaagtaacgt tgagagcctg gctgtgcgcc accagggaga ccggatagga 3899101 cgcctccatc cggctgcccc gcaggtagag cacgtcctcg ggccccggca cattgctgac 3899161 acacaggttg aacgtgtacg gccagggtgg cttcacccca ctgagcgtgc tggccaactg 3899221 caccccgtac ggcgccatca acgcggcgct ataggccagg atcgcgtcct tgtccatgga 3899281 cctcagctga gccttggccg cgcgggttga cgccgtgacc gccgccagcc gctgcaccgg 3899341 atcggcaacg tcggtaccca acgtcgccag gatggtcgcg accgcgttgc cgccgccctc 3899401 gtcgtccttg ggtcgcacgt tgaccggcaa gaccacgatc agcgacttgt tgggcagctc 3899461 acccagctcg tccagaaaac gtcgtaagcc gcctccgatg atcgccaacg cgacgtcgtt 3899521 gattgtggca tcatattgag ccccaatggc tttcagtcga tccagcggat attgctgggt 3899581 ggcgaagcgg cggttgcggc tgatgcgggt gttgagtatg cagtgcggcg cttgcaccga 3899641 gccgacgagg ttgcggtact cgtgatcact gcgcagctgg gcgttgacca gcgccttggt 3899701 gagctcgaac gtcgatcgtc ccgcaccggc caccgaacct aagacgctac cgaccccgct 3899761 gaccagcccg cccaaccccc gcaccacgtc gcccaggcca tcgagcacgt taccggcccc 3899821 agctatcaaa ccgccgccga cggagtcttg agtgtcggcg ggtgatcggc caggtgtggg 3899881 aatgttgaag aacaacgggt gggtggtgtc gtgcgggtcg gtggacaggc tgcgggccag 3899941 cattttctgg ccggtatagc cgtctatcaa cgagtggtgc atcttgatgt agatcgcgaa 3900001 ccggccacct tcgaggcctt cgatgaaatg cacttcccac ggcggacggc gtaggtccag 3900061 ggcgtgacta tgcaagcggg acaccgggat cccgagttca cgctcgtcgc cagggctggc 3900121 cagcgccgac cggcgcacgt ggtagtccag gtcgaagttg tcatcaacga cccaggactg 3900181 cgtgggatgg tatagcagct ccggatggct cagtcttagg ctccagggtt cgacgacctc 3900241 gctggccttg ctttcgtcga cgagttggcg cagcaagtcc ggcggcgcac ccgagggcgg 3900301 cgtgaacggc atcaacgcac caacgtgcat catcgtggtc gacgattcgg agtacaggaa 3900361 aaacatgtcc tgcggaccca accgccgggc cgtctggctc acgggccact ccttcgttgg 3900421 aggcattctc aggccgtcta gcgccgccag ataactaccg tagatgatcg cggccgtgcg 3900481 ttgtacgccg catcacatcg cgtgaactcc gttgtagagc agcagtaaac cgatcacgac 3900541 cagaatggcc gcgaccatcc cggcatggtt cttctccatc cagtctttaa gtcgttccag 3900601 cgaatcgtcg agtcggtcac cggcagccac gtaggccaat atcgggatcg cgaccgtgga 3900661 tgcagccaac atggcaaaga atgccgtgta aatccaggaa cccgcggcgc cgtggccgcc 3900721 gctgccgatg gccaatccgg ccgccgcgca aatgatcagc acctcgggtc tcaccaccac 3900781 cagcacggcc cctaccaatc cggcgcgtgc cggggtgaag ctggcgaatg cgcgcatcca 3900841 gcccggcatt tcggtgtggc gatgccgggt cagccaccga agcacgccga acacgatcag 3900901 tgccgacccg aggaccaccc gtagccagga tgcccaggcc ggcgatgttg tgctcaaacc 3900961 gccaagtgcg ccggaggccg caacaaagac ggcggtcacc acggccaagc ccaacagcca 3901021 gccgcccagg aaggccaggc tgctcggccg cggctgcggc gagtgtacga ccagtaccgc 3901081 tgggatcacc gacaacggcg agagcgcaat gaccaacgcc agcggcacga gcccggtgag 3901141 cacggagacc caatgacctg ccacgggcag caatcctcgc attgacaccg cctcggtgac 3901201 caccgagcgc cccaattcga cgaaattgcg cgcacccaac cgccgttcgg gctgttgata 3901261 gccatcgcga gacgtcgatc gccgaaacgt acgtgcaaag acggcggctt ggtgagccga 3901321 cgattaacga cgctgcccgg ccaaacgctc gccctgacag aattgcgacc cgaagtccac 3901381 cgtcacggtc gacggcgtgc ggaactgagc ggcgaaggtg agctggggat ccaccggcgc 3901441 ctgatcgagc cccagtcgct ttgtctcgag gttgatctcc acggtatcgg gggcagttct 3901501 gcgcgcattg gtgttccggt ccggccgcat gttcggctcg gtgcctctgc tttcgccggg 3901561 ctttctgatg gcaagttcat cggtgtcctg ctgcgggccc agctcggcga actccttgcc 3901621 attgttggcg atggtgtagg tcagcaggta accggcaaag cccgacgcaa acgagcccga 3901681 tggcgacggt ggcagcggct cggcgaatcg aactaccagc cgaaggaccg cgcctcgggg 3901741 atgcgagacg tccacggagg ccacggtaat ggtcgcgggc ggtgtcggcc cgggccgcaa 3901801 ctggcaactg agtcgcttcg gcagctgcga ccagtcgtct ggcagcggcc tgatccaggc 3901861 gtagaccgcg acacctacca tcgtcagcac tgccgcgacc ggaactgtca accgcaggcc 3901921 ggccggcatg cccagccagc ggctccggag accgtcgaca cgcagggcca gcggtgatcg 3901981 tggtgcagag gggttgggtc gacgatgtct ggtccagcgg tcaccgtccc aatatcgttg 3902041 ccccgctgaa ccgtcaggat cggtatacca tcctgccggc ggcgaagtcg ccacgtcgtg 3902101 ctccattcaa cagtcggtaa ggatcagctg cggtgccgct cctcgcggac tacggcggcg 3902161 catcgaacaa ctccggtagc gaatcgagga tctgcacgtg gtcgccgcgc cacgcaaacc 3902221 ccacaatcgt caagatgcca tcttggcagc cgtcacagct ttgacgtgtc cgatagctca 3902281 gcaccacgat gtcgtttgtg gatgccggac cgatcaaatt ggtgaacggg taggccctcg 3902341 gcgttgcggt tccgacgaac gttccccgat gaaacatcag cgcctggtcc ggggagctgt 3902401 tggtggcgtc ttgcaccgtc accagcaccg cggacaggtc cgcgcacggg tcgtagttgc 3902461 tgtcctccgg cgtactattc cacggcctgc cggttttgga atcgggggca agctgggcca 3902521 gcgcggcgcg cacggccgtt gcctcgtccg gcccacacgg gccaacctgg gatgccgggg 3902581 aagtggggcg cgccggtgcg gacgtcgttg ccggcgccgc ctggttggca cctggtcgga 3902641 cccgatgcat accggcgtac gcaacgaccg cggcagccac acaggccagg accacgagcg 3902701 ccaccagcca ggcggtgggc caagacccgc ctggggcggg cggagtggta tcgacgtcat 3902761 cggacggctg gtatgccggc gcaggccagt cggggtcaat ctcgtcagac acctaacccg 3902821 ctaaccctcc cggtacccgc ccgctggctg tgcgatactt gccgagcttg ccgaattgta 3902881 gccagaacgt gcaggtagcg gaaacaagcg ggccgtctcg aggggccccg ccggccggtg 3902941 aggctgacca catccagcat tctgatagct ggcttcacag caatctggcc ccatactaga 3903001 cgtcatgcag caagcgacgg caccgcaacc gctggcagcg cgccagttgg ttcgacggcg 3903061 cctggccgag gcatatgatg gcgcgttctg agggcaatcg cccacgccat cgcgctgtgc 3903121 ctcagccgtc gcggatccgc aagcggctgt cgcggggcgt tatgacgctc gtgtcggtgg 3903181 ttgccctgct gatgaccggc gcagggtatt gggtagccca cggcgcgctg ggcggcatca 3903241 ccatttcgca ggccctaacc cccgaggatc cccgttccag cggcaacaac atgaacatct 3903301 tgctcatcgg gctggactcg cgcaaagacc aggaaggcaa cgacctgccc tggtcggtct 3903361 tgaagcagct acacgcgggc gattccgacg acggcggcta caacacgaac acgctgatac 3903421 ttgtgcacgt cggtgccgat ggcaaagtgg tggccttctc gatcccccgc gacgactggg 3903481 tgcccttcac cggcgttccg ggatacaacc acatcaagat caaagaggcg tacgggctga 3903541 ccaagcaata cgtggcagaa cagctggcca accagggtgt gagcgaccgg aaagagctcg 3903601 agacccgggg ccgtgaagct gcccgggccg cgaccctgcg ggcggtgcga agcctgaccg 3903661 gcgtcccgat cgactacttc gccgagatca atttggccgg tttctacgat ttggcccaga 3903721 ccctcggcgg cgttgatgtg tgcctgaacc atgccgtcta cgactcgtac tccggagccg 3903781 acttccccgc cgggcgtcaa cggttgaatg ccgcgcaggc gctggcgttt gtccggcagc 3903841 gtcatggcct agacaacggg gacctggacc gcacccaccg ccagcaagca ttcctgtcgt 3903901 cggtcatgcg cgaacttcag gattcgggca ccttcaccaa cctggacagg ctcgacaacc 3903961 tgatggccgt ggcacgcaaa gatgtggtgc tgtcggccgg ctgggacgag gacctgttcc 3904021 gccggatggg cgacctggcg ggcggtaacg tcgaattccg gacgctgccc gtggtgcgct 3904081 acgacaacat cgacggccag gatgtcaaca ttatcgaccc gaccgcgatc cgggccgagg 3904141 tagcggcggc atttggcagc gcgccgccaa cgtcgcagac cgccgcggcc gccaaaccta 3904201 acccatccac cgtcgtcgat gtggtcaatg ccggcagcat cagcggactg gccagccagg 3904261 tctccggtgc gctgctgaag cgcggctaca ccgcgggtca ggtgcgtgac cgcgaatccg 3904321 gcgatccgtt caccaccgcc atcgagtacg gtgccggcgc ggaaacggac gcccagaacg 3904381 tggcagacct gctcggtatc gacgccccca accatcccga tcccgccgtc gcgcccggac 3904441 acatccgtgt gacggtggat accaacttct ccctaccggc acccgacgag gccaccgccg 3904501 ccgcgacgtc caccgaaacc agcacatatc cgctgtacgg cggcggcacc accaccgacc 3904561 cgacaccgga ccaaggggcg cccatcgatg gcggcggcgt gccctgcgtg aactaggtaa 3904621 gttatccgac cactccacgc agcccgtcgg cgccgaacac cggctccagc atgggcgaga 3904681 agtccgggcc ccttcgcagc atgtggccgc cgtcgacgtt gatgacctgt ccggtgatcc 3904741 aactggccgc gtcgctgagc aaaaacattg ctaggttcgc gacgtcttcg acctcaccca 3904801 cccgcggtaa tggcgtgcag acccggtagt ccgcgctcag ctccggcgac tctgtgacgg 3904861 gcacaaccag atctgtacgg atcaggcccg ggcggatgct gttgacccgt acccacgacg 3904921 ggccgagttc gtcagcggcc agtttcatca tgtggtcaac ggccgacttg gtgaccccgt 3904981 aggcgccgaa ccagcgatgg gtgttgctgg ccgcgatcga ggagatgccg acgaacgaac 3905041 cgccgccgcc gcgtaccaat tcccgcgcgg cgtgcttgag cacgtacatg gtgccattga 3905101 cattgaggtc cacggtgcgc cgccaggcct gcgagtcgat ctgggtgatt ggcccaatgg 3905161 tctgagaccc gcccgcgcaa tgcaccacac cgtgcagccg gccatgccac gcggttgccg 3905221 cgtccaccac acgcagggtc tgctcctcgt cggtgatgtc ggccggctca tagccgatcg 3905281 ctccggtctt gagcgcctcg atgtctttga cagccgccgc cagcttgtct ggatttcgtc 3905341 ccacgatcat gacggcggct ccagccgcga ccaacccggc ggccaccccc ttgccgattc 3905401 cgctgccacc tccggtgacc aggtaggtcc ggtcttggaa agaaagctgc acttgaggcc 3905461 cctcacgccg aaactgaaac aggttctcgc cattttggac catgcggccc gtcacttgcg 3905521 ccgaaggtga actcacggcg aggtttcgcg gcgctcgcga attcatgccc tcagttcacg 3905581 ttcgacgttc gtgatcaacg gtgccgccat cgtggaggga ttccataggt tgcggcttgt 3905641 tgccacattg cggccagtgt gcgccgccgg gtgcgcgtcc acggtaggct tcaaccacga 3905701 attatcgggc aacgatatcg gagtcggagt tggcaataac tggttcggcc gcaccgtcat 3905761 ggccgcgact attgcacgcc gagggccccc cttccgtcat ttgtatacgg ctgttggtgg 3905821 ggttggtgtt tctcagtgag ggaatccaga aattcatgta tccagatcag ctgggtccgg 3905881 gccgcttcga gcggatcggc atccccgccg ccacgttctt cgccgatctg gacggggtgg 3905941 tcgagattgt ctgcggcaca ctggtcctcc tcggcctgct gacccgggtc gcggcggtgc 3906001 cgttgctcat cgacatggtg ggagcgatcg tgctgaccaa actccgagca ctgcagccgg 3906061 gcgggtttct cggggtagag ggcttctggg gcatggccca cgctgcccgg accgacctgt 3906121 cgatgctgct cggattgatc ttcctgctgt ggtccggccc cggccggtgg tcactagata 3906181 ggcgactgtc caaacgcgcc acggcttgcg gcgcgaggtg aacccgcgac gtagcgcgac 3906241 cgatgcaccg gactcaacga cgagtcagcg gtggcgtcgc gaatgaactg cccgatctga 3906301 cgcaacgaac gggtcgcttc gggcaccagc ggtgtggcga gttggaaaag atgagcctga 3906361 ccgggccaaa cccgtacctc ggcacagacg cctgccgccg ccagcttgcc ggcgcccagc 3906421 tgcgcgtcgt gcagcagcac ttcggagccg gaaacgtgaa taagtgtcgg cggcaagctg 3906481 gattcgatat ggtcgagcgg ctcatagagg tcttcgggcc tgccgtcgac catgttcttg 3906541 gcagcggccg ccctgaccca tgccgccaag gcatcgaatg cccgcgccgg aaacatcgcg 3906601 tcggtcccga tgttgggatg gtcctgcttg ggccccttgg ccagctgcag caacggagag 3906661 atggccacta ttgccgccgg tttctcgtcg tcgcactgca gccgctgcgc aagcgcgagc 3906721 gcaaggtaac cacccgcgga atcaccggcc aacacgatct gttccggccg gtatccgcgc 3906781 gcccgcaacc attggtatgc atcgtggcag tcgtcgagcg ccatccccag cgaatgctta 3906841 gggatcagcc gatagtcgac tatcaacacg ggtgattcgg caaatcctga cagcgcgttg 3906901 acgatcctgc tgtgcgaatt cggcccgcac atgacaaacg cgccgccgtg caaatagagc 3906961 accacccgcc cagcgccgtc ggccgcccgc accccaggcg cacgcaccaa ctgggcggta 3907021 gcattcggca aatttatcgt tgttcggacc gtgccctgcc cggggcgcca aaccctgcat 3907081 gcgaagtcga cgaaccccaa cggcagaggc aggggcgata ggtaactgcc cacagtcata 3907141 agtggcttga tcgtcatgcg cgatgccagt gccgccaacc gacctgcaac actagggccg 3907201 ctttcggtga tctcgatggg agccccgtcc cagcacgaat cggaattcga gcatcccgac 3907261 gattgcaggg gccggcgtgc gtaatacgag gacattttca gcacgtttcg ccggaatgtg 3907321 gccggtggtt ggcgttagct gcacggaagc gcctgagctg gcccgccgtc accgcccgat 3907381 ttatcaatcg caaatctcgc acttcccgtt tacgtagttg ctccaaccag acgcagccca 3907441 attcgggctc ctccccccat caatcattcg gtggcgcgaa gttcaccaga gtcccggaca 3907501 cgctcacgcg aactacctgc atttagggga tcacaggcac cttgaaatgc atcggtgtat 3907561 gactgggagt ttgctgtacg tctattggta agtgcgaatt cgccgccggc tacccgcacc 3907621 ccgtagaatc gcaagccgat atcggcttgg tcacctgagg tgttctatgc gggagtttca 3907681 gcgggccgcg gtgcgcctgc acatcctgca ccacgctgcc gacaacgagg tgcacggcgc 3907741 gtggctgacc caagaactga gccggcacgg ctaccgggtc agccccggca cgttgtaccc 3907801 gaccctgcac cggctcgaag ccgacggcct gctggtgtcc gagcaacggg tcgtcgacgg 3907861 ccgcgcgcgc cgcgtctacc gggctacccc ggctggccgg gcagcactga ccgaggatcg 3907921 ccgggcactg gaagagctgg cccgcgaagt cctcggcggg caatcgcaca ccgctggtaa 3907981 cgggacctga accgcgtcga cggtacccat cgccggggcc aaaccgtgac gacgtctgca 3908041 gcgcaatgcg ggcttggctt acagttatgt aatgtctacc aaatctgacc acggcgaaat 3908101 cggtgacgtc gaaccgctgg cagacagcac cgcgagccag gccaggcgag tcgtcgccgc 3908161 atatgcgaac gacgccgacg agtgtcggat cttcctgtcc atgctcggta ttggaccggc 3908221 caaactcgag agctaatggc tccctcggga ggccaggagg cgcagatttg cgattcggag 3908281 accttcgggg actctgactt cgtggtggta gccaatcgac tgcccgtcga tctggagcgt 3908341 cttcccgacg gcagcacaac ctggaaacgc agccccggag gcttggtcac cgccttggag 3908401 ccggtgctgc ggcgtcggcg cggggcctgg gtcggctggc ccggcgttaa cgacgacggg 3908461 gccgaacccg acctccacgt gctggacggc cccatcatcc aagacgagct ggaacttcat 3908521 ccggtacggc tgagcaccac ggacatagct cagtactacg agggattctc caacgccaca 3908581 ctgtggccgc tgtaccacga cgtcatcgtc aagccgctct accaccgcga atggtgggat 3908641 cgctacgtcg acgtcaacca gcgctttgcc gaggccgcgt cgcgcgccgc cgcccacggc 3908701 gcaaccgtgt gggtacagga ctaccagctg cagctggtac cgaagatgct gcgcatgctg 3908761 cggcccgatc tgaccatcgg tttctttttg cacatcccgt tcccgccggt agagctgttt 3908821 atgcagatgc cgtggcgcac cgagatcatc cagggcctac tgggcgccga cctggtgggc 3908881 ttccatcttc cgggcggtgc ccagaatttc ctgatcctgt cccggcgtct ggtcggcacc 3908941 gacacttccc gcggaaccgt cggtgtgcgg tcgcggttcg gtgcggcggt gctcgggtcc 3909001 cgcaccatac gagttggcgc ctttcctatc tcggttgact ccggcgcgct cgaccacgct 3909061 gcccgcgacc gcaacatcag gcgccgggcc cgcgagattc gcaccgaact gggaaatccg 3909121 cgcaagatcc tgctcggtgt tgaccggctc gactacacca agggcatcga cgtacggctg 3909181 aaggcctttt ccgagctgct ggccgagggc cgcgtcaaac gcgacgacac cgtcgtggtc 3909241 cagctggcta ccccgagccg cgagcgggtg gagagctacc agacgctgcg caacgacatc 3909301 gaacgccagg tcggccacat taacggcgag tacggtgagg ttggccatcc ggtagtgcat 3909361 tacctgcatc gaccggctcc gcgcgacgag cttatcgctt tcttcgtggc cagcgacgtc 3909421 atgctggtca ccccactacg cgacgggatg aacctggtgg ccaaggagta cgtcgcttgc 3909481 cgcagcgatc ttggcggtgc cctggtgctc agcgaattca ccggggccgc agccgaactc 3909541 cggcacgcat acctggtcaa cccgcacgac ctggaaggcg tcaaggacgg gatagaggaa 3909601 gcgctcaacc agacggagga ggcgggccgg cggcgaatgc ggtcgctgcg acgccaagtg 3909661 ctcgcccacg acgtggaccg ctgggcacag tcgtttctcg acgctctcgc cggggcacac 3909721 ccgaggggcc aaggctaacg gtcaagccgc tcccgctcgc gagcagacgc agaatcgccc 3909781 atttcggcac gaaattgggc gattctgcgt ctgctcgcgc cctggaagct ggtgcggctg 3909841 cccaaaggct gtgatactcg atggagcgcg aaggcccgaa ggagggcatg tgaacatccg 3909901 ttgcggactg gccgctgggg ccgtcatctg ctcggccgtc gcactgggaa ttgcgctgca 3909961 ctccggtgac ccggcgcgtg cgctcggacc gccgccggat ggcagttact ccttcaacca 3910021 ggccggagtg tccggggtga cgtggacgat taccgcgctg tgcgatcagc cgtcgggaac 3910081 ccgtaacatg aacgactatt ctgaccccat cgtttgggcg ttcaactgcg ctctcaacgt 3910141 ggtgagtacg acgccccaac agatcacccg tacggaccgg ctgcagaact tcagcggcag 3910201 ggctcggatg agtagcatgc tgtggacctt ccaggtgaat caggcagacg gcgtggcgtg 3910261 tccggacggc agcacggcac cgtccagcga aacctatgcg ttcagcgacg agacgctgac 3910321 cgggacgcac accaccgtgc atggcgccgt gtgtggcctg cagccaaagt tgagcaaaca 3910381 accgttttca ctgcagctca tcggcccgcc acccagcccg gtccagcgtt atccgttgta 3910441 ctgcaacaac attgcgatgt gctattaaat cggcgtgatg taggcgatca gccatttgcc 3910501 gtcaatccgc tggaaatcca cccgcagtcg gctgccgtcg tagagcggct gtcgcgtctt 3910561 gtcggtcacg gtgcggttca aatagaccat caccgatgcg caatcgcgtt tggcatccat 3910621 gactcccaca ccgacgacat tggcctggac caccacttca cgcttcttcg cctccgggat 3910681 gatctgcgca ttggcgctct tctggaactc ctggcgatag tccggcgtca gcagcgggta 3910741 caccgcggtg aggctgcgct cgacagtttg gtagtcgtaa ccgaagactt gtgggatttc 3910801 ctgcatggcc agcttcggta acagtgcccg cgccgacgct tcgcccccgg tctgcacccg 3910861 gtcccagtag aaccagccac cggccgcgga caaacccacg atggtggcga ccatcagcgc 3910921 gtaggcgacg gaaatcaacc gtctcatcag ttgcccccgt ccgggtactt caggtcgtag 3910981 ccggtcatcc ggccgttctc gtcctcatgc acgatgaccc gaagacgata gggcatggac 3911041 ggcttgttga cgccgtcgat atcggcgacc gtcacccgca ccgacaccaa taccgatgcg 3911101 ttgtcgctga tttcgtcaat gccctccaac gcggcgccgt tgacgacggc ctccgatgtc 3911161 gcgttggtgg cccggaatag acccttgagg ttgtccacgt tgttgttggc gttcagcatg 3911221 ccgcgtagcg gcccactggt gccgttgacg aaccggttca cgctctcgtc gatggtgtcc 3911281 ggcgtgtagc tgaacatgtt gaccacggtc tgggtggcgg catcgacaaa acgctggttg 3911341 cgggcttgcc gggcgtccgc atcccggttc tgcataacca gtgcggtcac accccatgcc 3911401 agcgcggcaa tcgccaatag gcctgccgcc agcgaaagcc agccgaccag gacgcggtgt 3911461 gccggccggc gtggcggcgg tttgaccggc ctcagggcgg gcttggcggc tttcgccggc 3911521 ttcgattcgg tccgcgccgc ggccctcacc gtcgccgcac cctgggcggg acggctcgac 3911581 tcgccttccg ccggacccgc ggggcgggac gccttacgac gagcgcgccg cgtcgtcgac 3911641 tgctgtccgc cggctacacc ggtatctgcg gccactacag ctgcctcgga tcgcgcatga 3911701 gatccaccca attctcggcg ctggatgcgc ccgtcatccc gggcgcgaag ataccagtgc 3911761 cgccggccgg gtccgcgaag gctccgctga gttggtcata gatggtgtag gccggaccgc 3911821 tggcctgcgg ttgggggcca ggcgccggcc cgggcggagg cccggtgcct tccggtggtg 3911881 gcggcggcgg aatggtcgcc gggtatggca cctggggtgg ctccggcggg tacccgggcg 3911941 gcatccatga cgtgaacggt ggaggcggac cgttgtcgtt gggcggcggc gcaggctgcg 3912001 ccggctggtg cggcgccggc ccggggccgg ccacctggcc gggtggcggc ggtccgacga 3912061 tgggcacgcc cgggtcggga tccgcgcccg gcgggatgta agggaacttg ttgggcggca 3912121 gaatgtttcg cccatccgtc acctcggtgc cgtacgggat cggcggaccg cgccacgggt 3912181 tggttccaac tggcacatag ccacgcggat cccgacataa ctgcaccgtc ggtgcccgct 3912241 taccggggaa ttcctggcac gggtagttgc gagcgccgcg caccgtgctc gggtcgttct 3912301 gcgcggtctt gcagtacatg tcccggggaa tctcgcgtac cgactcgtcg gccggcgacc 3912361 ggaccagcgg cgggggcaag aacccggtca tgcagggcgg cgggtcgtgc aggtcgatct 3912421 tgaagtccag cttggcgccc tcgtcctggg gtacgccgcc cgccgaggtg atgatcgcgg 3912481 cgaacagcgc cgggaaaacc accaggagct gttcgatcga cttgtgatag atcacgccca 3912541 cccggcccag gttggccaga ctggccgcca gcgcgggaaa cgaaggacga atcccggaga 3912601 acgcggtgtt ggcctcgtcg atcgcatccg gggcgtcggc caacgtgtcg cgcagccgcg 3912661 ggtctgccgc acggagctgc caggtgaacc gcgccagccc atcggcgagt gacttgatgt 3912721 ccccgccggc gcggatctgg gcttgcagga acgggccggc ctgatcgatc aactgcgaaa 3912781 cctgtggata gttggcgttg gcctcatcca ccagcaaccg ggccgactcg atcagccggg 3912841 ccagttccgg accggcgcca ttggtcgcga tgaacgcctc gtgcagcagc tcccgcagcc 3912901 gggtgtcgcc aaggctgccg agcagcgtct cggcctgacg caacaggtcg gcgacgtctt 3912961 gcccgattcg ggtgttctgc cgctggatcc ggaagccgtt gcgcaacttg gtcgacgacg 3913021 ggttctccgg cggcactagg tcgatgtact gctcaccgat ggccgaaacg ctgcgtacgg 3913081 tggcggtgac gttcgacgga atggcggtgc cactgttcag tcgcatgtgc gcggtaacgc 3913141 cattgggatt tagccccacc gactccaccc gcccgaccgc gacaccgcgg taggtgacgt 3913201 tggcgttctt gtacaggcca ccgcccgcga cgaagtcggc actcacgccg taggttccga 3913261 tgccgaacgt ggcgggcaga cgcagataaa agatcgccat cacgctcagg gtgatgacgg 3913321 tgatcaccgc aaaaatggac aactggatct tggcgagtcg gtcgatcatg tccgggcccc 3913381 tactgtcccg acgccgtacc gggtggaatc ttaaatgggt cggccgcttg cccggacagg 3913441 ttggccagtt cgccaatcag gaagtcgggc gggttgagga tctcgtccat gtgcgccatg 3913501 ttcgggtcga agtacgccgt ggtgaagaac gtctcaccaa tccggcgcag ggtgaggtcg 3913561 aaggtggtga acacgttaag atagtcgccg cgcaccgcct gcttgatacc gaagttggga 3913621 aatgggaacg tcagcaacag ctgcagcgag gtgacgaaat cctttcggtc gtcgttgagg 3913681 gccttgacga tcgagtagag gtctttgagg tcttcaccga aatccacctt ggtctcggcc 3913741 agcacgtgcg acgtgaccat cgtcaacctt ttgagcgcgg cgaacgcgtc gacgatgtgg 3913801 tcccggttct ggttgagcac gcgaaccgcg tcgggcagcg tgtccagtgc tcggcccagg 3913861 ttgtccttgt cacgtgccag gatcgcggag actcggttca gcccatccaa cgcatcgatg 3913921 atgtcgtgaa cctgccggtt caggcccgcc gtcaactccg cgagcctggg gaccaggttg 3913981 acgaactggg cctgccgacc cgccaccgcc tggtgggtct cgtcaatgat ctcttccaac 3914041 gcaccgacgt tacccttgtt gaccaccacc cccagcgccg agaaaacctc ctcggtggtg 3914101 gggaatcggt cggtgttggc ctcggtgatt ctcgagccgt caaccaacct cccggtcggc 3914161 gggcggtccg tcggtggcgc cagctctaca tgtaacgaac ccagcagcga ggtctgggag 3914221 accttcgcca cggcgttggc cggcagcaac acattcttgt ccaggtccag cttcacggcg 3914281 gcataaaagg atccgtcggg tcgttggacc gcgacaatgc cggccacgct gccgacggtg 3914341 acgtcatcga ccatgaccgg tgagttctgc ggcaacgtcg ccacatcagc catttcgacg 3914401 gtgaccgagt aggcaccttc accgtgcccg gcggtgccag gcagcggcag cgagttcagc 3914461 ccgccaaact gacagccggc aagcagcgcg ctgctggccg tcaatatgat ggcgcgcaac 3914521 cagattcggt tcatccgccg cccccatgct cgcccggtcc tgcccccggc gccggcgggg 3914581 ccggtgccgg accgggcgcc ggtgggacga gtaggctctg cagatccgcg gggttgccca 3914641 caggcgctcc tcctcccgcg ggtacccaag tcaattccgg aaccggcgtc tccgacttgg 3914701 cctcggtggc cggggtgtcg tagatgatct ggcccttgta cgccgtgatc gtgttaagcg 3914761 ggtggaacat gatcggcggg taattcaccg tgagccggcg cagcaccggc cccagccgct 3914821 cacggcagat ctcggcgcgc cggtagtagt ccggcgccga cgggcccgcg gcggtatcga 3914881 aggaaccgcc gcagatgaac tgcaccgggt tagcgaagtt gggtatcgac aacagaccgt 3914941 tgagggtgcc ttgcgcaggg tcatagatgt tgtagaagtt ggtgatcccc ggcccagcca 3915001 cgtgcagcac ttgctcgatg ttctcgctct ggtcactcaa cgtctgcgca aagtcgttga 3915061 gctgattcac cgtttcgatc agcgtcgagt tgttctcgcg caagaacccc ctgatgtcgg 3915121 acagcgcctg gttgagcgtg cccagggtct ggtccagatt ggccgagctg tcggcgagca 3915181 cctgcgacac cgatgccacg tggccggcga actgcacaat ctgctcgtcg ctctccgata 3915241 gcgcgtcgac cagtacctgc aggttcttga cggtgccgaa gatgtcgccg cgcgaatccc 3915301 ccagccgccc ggcgacctgc gcaagctcgc gcaacgcgtt gtgtaacgag tctccgttgc 3915361 cgtcaagggt gtccgcggcc tggttgatcg ccgcgcccag cggcccctgc agctcgcccg 3915421 ccgccggact caggtcggcg gccaaccggg tgagcccctc tttcacctcg tcccattcca 3915481 ccggcaccgc ggtgcgatcc agatcgatcc gaccgttgtc gggcagtacc gccccgccgg 3915541 tatacaccgg ggtgagctga atgaagcgcg ccgccaccaa attcggcgac atgatcacgg 3915601 cctgcacgtc cacgggcacc ttgacgtcct tggacaccga catagtgatc ttgacgtcgg 3915661 acgaccgcgg ctcgatcatg tcgatctcac ccaccgggac gcccaggacg cggacctggt 3915721 caccgggata gagcccgaca gcagaggtga agtagcccac gatggtgcgc ttattaccgg 3915781 tggacgagag cacgtacacg ccgcccacca gcgcggccac cagcgcgatc accgtggcgt 3915841 agcgcaatcc ccggctcccc gtcaacatgg cgacccggcc catcacggcg acttcggtct 3915901 gatgatccag cgctcctgga taaacccgcg caagtaatcg gcgaggctat ccggcagctt 3915961 gcccggctga aagaccaggt cgaacacggt cgccaccagc ggcccgggca gcacgctgta 3916021 gacgttgaca ttgaatccgg gtccggatcc gaccacctcc cccagcgtgg tcgcgtacgt 3916081 gggcagccgc ttgagggcct cggtgatata gtcgcggcgc tcgttgaggt tggccagcac 3916141 caggttgagc ttgctcaaag ccgggccgaa ctccttacgg ttgtcggcga caaagccgga 3916201 aatctgcgct gcaacatcgt cgatcccaga gatcaacgcg ctgagcgcgg cccgccgggc 3916261 atcgagcgcc gcaaacaact ggttgccgtc ctcgaccagc ttgttgacct gttcggcgcg 3916321 ttcggacaac accgatgtca ccgacttggc gtgcgccagc aggccttgca gcgcttcgtc 3916381 gcgacgattc agggcgcgcg acagcgacgt cagcccgtcc acggcaccac gcacctgcgg 3916441 ggtggcgtca tgcaaggcct gggtgaacac gttcaaggcc tgctcgaact gcggcctatt 3916501 caggtcgttg gcgttgcggc ccagatcctg cagcaccccg ttgagcgtgt agggcgtggt 3916561 ggtccggctc aacggaatcg tggtcgactt gccggagcca gccggactga ccgcgatgga 3916621 gcgctcgccg aggatggtgt cggtgcggat cgcggccagg gactggtcgc cgacgacgat 3916681 gctgcggtcc acgctgaagg tgacctttgc actgtttccg gccagactca cggccgacac 3916741 cgcgcccacc ttgaggcccg agacataaac cgagttaccg ggggtgatcc caccggcgtc 3916801 ggtgaaatac gcgtcgtagg ttttgccctg tggccagaaa ggcaacccgc tgtagccgaa 3916861 tgcgatcagg acgacgcaga tcaccagcac caggccgaag atgccggtgc ggagcgggtc 3916921 gcgttcgtgt ttgctacttg gcttcctatt tagcaaaggc gcacctcccc ttgctgggat 3916981 ccggctggcc gccgatcggc agcaggatgt cgctgccggc cggtccgttg atcttgatcg 3917041 tcaccgagca gaagtagatg ttgaagaatg ctccgtaact gcccagcgcg gacaggcgca 3917101 ggtagtcctc gccgagctgc tcgatgtcgt tgttgacctc ggcctttcgg ttgtccagct 3917161 cggtagccag cggccgggcg ttttccagga tgccttgcag cggccggcgc gaattccgca 3917221 acagttccgt aagatccgtc gtcgtcgacg ccagcggcga aatggcgccc gcgatcggat 3917281 cccggttctt ggccaggccg ctgaccagct gctgcagctg gtcgacactg gccgaaaatt 3917341 gcgcgctctt tgcatcgacg gtcgccagca ccgcgttgag gttggtgatt acctcgccga 3917401 tcagctggtc gcgtgcgccc agcgccgccg agaaggcacc ggtgtcggcg agcacgttcg 3917461 ccaacggacc accctggccc tgcagcaact cgatgaccgc actggtgatg gtgttgatct 3917521 tgtcagcgtc aaagcctttc agcaccggcc gtagcccacc cagcaacgca tcgagatcca 3917581 gtgcgggctg ggtgtgggcc acgttgatgg tgccacccgg cggcagcttg cgcagttcac 3917641 ccggacccga cgtgatctcc aggaaccggt cgcccaccag gttttcgtac cggatcaccg 3917701 cacgcgtgga cgagtacagc gtgtagctgc ggtcgatcgc gaatgccacg tcgatgctgt 3917761 ggtctgggtt gagcttgacc gccttcactg aaccgaccgg cacaccggcg atgcgaacct 3917821 tctggcctgc cttcagccgc gacgcgtcgg tgaaggtggc gtggtagacg gttgtgggac 3917881 caaaccggaa gtccccgaag accaccacca gaccggcggc caccagcagc atgaccaccg 3917941 cgaagacgct gaccttgatc accatcgacc ggtgcgaggg aacgcccgag cccgccatca 3918001 gaagtcgtcc cgttccgcga acgcaccgtt gaacaggaac tgcagcgtcg acggcgcgtc 3918061 aacctgtaac tcggtgaacg gctggtatgg gatcaaagcg ttgtcggtga ccaggaacgg 3918121 cgcgcggtag aacgacccgc ccgtctgctt ggtcgggata tcgggcaacc ctcggcagtt 3918181 cggaccgccg gaggcgttga cgatcggcag gctctccgga taggtgtacg acggcgcacc 3918241 caacacgaag ctcgacgagg tgaacagccc agccttacgg acaccgatta gcggggcaaa 3918301 ctccttgaca ccgcgcgcga tgcccttgaa aaggcagccg aataccgggg agtagtcgga 3918361 ggtcactttg agcggggctc ggagccggtt gatggcgtcg atgaaattct gttcggcggg 3918421 cgccaacgtc tcataggcgt tattagacag accgatggtg gctagcagcg tgtcgttgag 3918481 gttgtccttc tggtcgacga tcgtcttgtt gatcgtcggc aggttatcga acacggtgtt 3918541 caggtccccg gcggcgtcag catagacgtt ggccaccacc gccgccttgc ggaaatcctc 3918601 ctgaagggcg ggtaactttg ggttcgcttg gcgggtcagc gtgttcagtc ccgacaacag 3918661 cgcacccagg tcatcgccgt ggccgcgcag gccttcggac agcgcgctca gcgtcgcgtt 3918721 cgtttcaagc ggatcgatct tgtgtagcag gtcgatgagc gattggaaca acgtgttgac 3918781 ctcaagctgt acctgagacg ccgccacgtg cgcattcgga cttagcggct tgggcgacgg 3918841 cgtctttggc ggaatgaatt ccaccgattt ggcgccgaag atggtgtttc cggcgatgcg 3918901 caccgtcgcg ttggagggga taaaacccat ctcgccgctg tcgatggcca gcttgagccg 3918961 tgcttggttg ccgctgtagc tgatatccgt gaccttgcct acctggatgc cacggtattt 3919021 gaccttggcg cccttctcca taaccaggcc ggccctcggc gacgataccg tgacggtgtc 3919081 cgtagacgtg aaagccgccg tatacgaaag ataagtcagc actgcggatc ccaccatcag 3919141 cccggccagc agcgccgctg ccaccctgac actggtgcgt cgagatccgc cgccggacat 3919201 gtttcctttc tgaaggtttt taccccgaga ggttgaagtt accggacgcg ccgtagacgg 3919261 cgagcgagat gaacaaggtg atgacaacaa ccacgatcag cgaggtccgt acggcctgcc 3919321 cgaccgcgac cccgacccca accgacccgc cgctggcgtt gtagccgtag taggtatgca 3919381 ccagcattac cgcgatcgac atagcgatgg cttgcataaa cgaccacaac aggtcggagg 3919441 ggatgaggaa ggtgttgaag taatggtcat aaaggcccgc ggactgccca ttgacgaaca 3919501 ccgtggtgaa acgagcggcg aagaacgcgg ccagcaccga caacgaatac aacggaatga 3919561 tcgccaccag gccggcgatc agccgggttg acaccaaata ggacaccgag tgcaccgcca 3919621 tgcattcgac ggcgtcgatc tcctcagaga cccgcatggc acccagctgc gcggtggctc 3919681 cggccccgat ggtggccgcc agcgcgatac ccgcgatcac cggcgcgaca acgcggacgt 3919741 tcaaaaacgc cgacaggaac ccggtcaacg cctcgatacc gatgtcgccc agcgacgaat 3919801 acccctgcac ggcgatcacg ccaccggacg ccagggtcaa aaaggccgcc accccgaccg 3919861 tgccgccgat catgaccagc gctccggcgc ccagcgtcat ctcggcgacc agccggaccg 3919921 tctccttccg gtagcgggtg atggcgttgg gcacatagcg catggtttcg ccgtagaaca 3919981 gcgcctgctc accgaagttg tcgaccggcc gctgcagccg cgaaaagaaa cggcgaaacc 3920041 ggatagtgac gtcgtagctc atcgcttcat caccatcgct cgctcaccgt cgtttgttac 3920101 tgcgccgaga ttcgcacacc tatagcggtc atgactacgt tgatcacgaa aaggcagatg 3920161 aacgcgtaga cgacggtctc gttgaccgca ttgcccaccc ccttgggccc acccttgacc 3920221 gtcagaccgc ggtaacaccc gaccagcccg gccatgaccc cgaacagtag cgccttgatc 3920281 tccgccagta tcaattcgcg cagtccggtg agcacggtca gaccgttgat aaacgcaccc 3920341 gggttgacgc cctgaagaaa gaccgagaac gcgtagccgc cggacaggcc aatggcgcac 3920401 accaagccgt tgagcagcag cgcaaccaat gtggacgcca acaccctggg gaccacgagc 3920461 cgttgaattg ggtcgatgcc cagcacccgc atcgcgtcga tttcctcacg gatggtgcga 3920521 gcgcccaggt cggcgcagat cgccgtggcg ccagcacctg ccaccaccag cacagtcacg 3920581 accgggccca gctgggtgat ggtgccgaac gccgttccgg cgccggacaa gtcggcggcc 3920641 ccaatttcac gcaacagaat gttgagggtg aacgccacca ggaccgtgaa cggaatggac 3920701 accagcaacg tcgggactag cgaaacgcgg gccaccatcc aggtctggtc caaaaactcg 3920761 cggaactgga acggccgccg gaaagcggca cgcgcggtgt ccatcgacat ttcgaagaac 3920821 ccgccgacgg cccgggccgg aaccgcaagt tgttggatca actggggtcc ccccgtctac 3920881 tgctcgcggc gaagtctgtg agtctcctga acgcgcttag ggcccgcacg ttgcacggtg 3920941 tgagccggcc catcctaacc cagaacgagt ttgcggtgtc aacgaaccgc acaccggatc 3921001 aactgggtca atttcgctgg ttaagcccta tgttggcgtg gtgattcgga caccgattcc 3921061 aataatcggc cgcctatatc cacgggtcac tgacgcatca gatcggtcgc cgaaaagctc 3921121 tgttccggat cccgaccagc aaagtagtcc cgcagcgtcg cggtgagctc ggtgggatcc 3921181 caggacgtgc cgtccgcgct gaaccggcgc tccatgtgcg gcggtgacac cagcgtcacc 3921241 tgcggaccgt agacgatgaa cacctgaccg ttgacttccg cggcagccgg ggacgccaga 3921301 aactggacca ggcttaccac atgctgcggc gacagcgggt cgatctggcc cgcttcgaca 3921361 tcgggtgcgg cgccgaagac atcggccgtc atcgcggtgc gcgcccgcgg acaaatcaca 3921421 ttggcgcaaa cgccgtagcg cccgagcgcc cgcgccgccg acagggttag cgcggtgatg 3921481 ccagccttgg cggcggcgta attcgcctgc cccaccgggc ccaccagacc cgcctccgac 3921541 gaggtgttga cgagccggcc gaagaccgat cccccttcgg catccttggc tttgtcccgc 3921601 cagtaggcag cggcgttgcg ggtgagcaga aaatggccgc gcaggtgcac cgcgatcacg 3921661 gcgtcccact cctcgtcgga catgttgaac agcatccggt cgcgggtgat gccggcattg 3921721 ttcaccacga tgtccagtcc gcccagcccg acggcgctgg cgagcagttc gtcggccgtc 3921781 gcgcgctggc tgatatcacc ggctaccgcg acggccttag caccagcatc ggcagcggcg 3921841 gcgccgatct cgtcgacgac gtcggaagca tccagggcgg aagcaacatc gttgacgacg 3921901 acggtggcgc ccaaccgggc caggccgagc gcttcggccc gacccaaacc cgcggccgcg 3921961 ccggtgacca ccgccacctt tccggacaga tcggtcgtgt tcgtggtacg cggcgagcga 3922021 ttggactcag tcaatttcaa tttatgaata cctctagttc cgtcctactc accacgcgac 3922081 aacgccgcac gcgggcattc cgcgatggcc tgctcggcca gatcctcctg atcaaccggg 3922141 atcggatcgg tcttgaccac ggcatagtcc tcgtcgtcca ggtcgaagat atccggtgcg 3922201 attcccaagc acaccgcgtt gccttcacat cggtctcggt ccacgatcac ccgcacggca 3922261 ccctccttac cctgaccatc cccccggtcg ctgctagttc caccataagg ccctgctaca 3922321 tccgaggaaa cggtcgctgg attcagagac tagaacgtgt tacaaccggg aagacggccg 3922381 ggttgccgtt ggcgttggtt gtcgacagct agtggacggc tgctgacggc cagtgataaa 3922441 gacgcgatca ttcaatcgga ggcagctgag atgcgcatca gttacacccc gcagcaggag 3922501 gagctgcgcc gcgagctgcg ctcgtacttt gccacgttga tgacgccgga acgccgggag 3922561 gcgctgagct cggtccaggg tgaatacggc gtcggcaatg tctaccggga gacgatcgcg 3922621 caaatgggcc gcgacgggtg gcttgcgctg ggctggccca aggaatacgg cggccagggc 3922681 cgctcggcga tggaccagct gatcttcacc gatgaagccg ccatcgccgg tgcaccggtg 3922741 ccgttcctga ccatcaacag cgtggcgccg acgatcatgg cctacggaac cgacgagcag 3922801 aagaggtttt tcctgccccg gatcgccgcc ggggatctgc acttctcgat cggctactcc 3922861 gagcccggcg ccggcaccga cctggccaac ctgcgcacca ccgcggttcg cgacggcgat 3922921 gactatgtgg tcaacggcca gaagatgtgg accagcctga ttcagtacgc cgactacgtc 3922981 tggttagcgg tacgcaccaa cccggagtct tctggggcca aaaaacaccg tggcatatcg 3923041 gtgttaatcg tgccgacgac cgctgagggc ttctcctgga ctccagtgca caccatggcc 3923101 ggtccggaca ccagcgccac ctactactcc gacgtgcggg taccggtggc caaccgggtc 3923161 ggtgaggaaa acgccggctg gaagctggtg accaaccagc tcaaccacga gcgggtcgcc 3923221 ctggtgtcgc cggcaccgat tttcggatgc ctgcgcgagg tccgcgaatg ggcacaaaac 3923281 accaaggacg ccggcggcac caggctgatc gactcggagt gggtgcagct caacctggcc 3923341 cgggtacacg ccaaggccga agtcctcaag ctgatcaact gggagctggc ttcctcgcaa 3923401 agtgggccga aggacgctgg accgtcaccg gccgatgcgt cggcggccaa ggtgttcggt 3923461 accgagctgg ccaccgaggc ctaccggctg ctgatggagg tgttgggcac tgcggcgacc 3923521 ctgcgccaga attcgccagg cgcgttgctg cgcggccgcg tcgaacggat gcaccgggcg 3923581 tgcctgatcc tgacgttcgg cggcggcacc aacgaagtcc agcgcgacat catcggcatg 3923641 gtcgcgctgg gactgccgcg agccaaccgc tgagcggacc tgagaggaca agacgtcatg 3923701 gatttcacga caaccgaagc cgcccaggat cttggtggtc tggtcgacac catcgtggac 3923761 gcggtgtgca cgccggagca tcaacgtgag ctggacaagc tcgagcagcg gttcgaccgc 3923821 gagctgtggc gcaagctgat agacgccggc atcctgtcca gtgcggcgcc ggagtcgctg 3923881 ggcggcgatg gcttcggcgt gctcgagcag gttgcggtgc tggtggcgtt ggggcatcaa 3923941 ctggccgcgg tgccgtacct ggagtcggtg gtgctcgccg ccggcgccct ggcccggttc 3924001 ggctcgccgg aactgcagca gggctggggg gtgtcggcgg tctccggcga tcggatcctc 3924061 accgtcgccc tcgacggtga gatgggcgag ggtccggtgc aggccgccgg caccggacat 3924121 ggctaccgcc tcaccggcac acgcacccag gtcgggtacg gcccggtggc cgacgcattt 3924181 ctggtacccg ccgaaaccga ttccggtgca gccgttttcc tggttgccgc cggcgaccca 3924241 ggggttgcgg tgaccgcact ggccaccacc ggactgggca gcgtcggaca cctcgagcta 3924301 aacggggcca aagtggacgc cgcccgcagg gtcggcggaa ccgatgtcgc ggtttggctc 3924361 ggcacgcttt ccaccctgag ccgcaccgct tttcagctcg gtgtgctcga gcgcggactg 3924421 caaatgacgg ccgaatatgc gcgcacccgt gaacaattcg accgcccgat cggcagcttc 3924481 caggcggtgg ggcaacggtt ggctgacggc tacatcgacg tcaagggatt gcgactgacg 3924541 cttacccagg cggcctggcg ggtggccgaa gattccctgg caagccggga gtgcccccag 3924601 ccagccgaca tcgacgtcgc caccgcgggg ttctgggccg ccgaagccgg gcatcgggtg 3924661 gcgcatacca tcgtgcatgt gcatggcggc gtcggcgtcg acaccgatca tcccgtacac 3924721 cggtatttcc tggccgccaa gcagaccgag ttcgcgttgg gcggcgccac cggtcagctc 3924781 cgccgaatcg gccgtgaact ggcggaaacc cctgcctagc cctgcctagc ccggcgacga 3924841 tgcggtccgc gcagcggacc gagaaggagc gggcgaatcg aacccaccga tgactcccac 3924901 tcacccgacc gtcaccgaac ttctgctgcc gctatccgaa atcgacgatc ggggcgtcta 3924961 tttcgaggac tcgttcacca gttggcgcga ccacatccgg cacggtgccg caatcgccgc 3925021 agcgctgcgg gaacgcctgg acccggcgcg gccgccacac gtcggtgtgt tactgcagaa 3925081 cacgccgttc ttctcggcga cactggtggc cggcgcgctg tcggggatcg tcccggtggg 3925141 cctcaacccg gtgcgccgcg gcgcggcact ggccggcgac atcgctaaag ccgactgcca 3925201 gttggtgctc accggctcgg gatcggcgga ggtaccggcc gatgtcgagc acatcaatgt 3925261 cgactccccc gaatggaccg acgaggtggc cgcacaccgg gataccgagg tgcgttttcg 3925321 atccgcggat ctcgcagacc ttttcatgct gatcttcacc tcgggcacca gcggcgaccc 3925381 gaaggcggtg aagtgcagcc accgcaaggt tgcgatcgcc ggcgtgacga tcacgcagcg 3925441 cttcagtctg ggccgcgacg acgtctgcta cgtctcgatg ccgttgttcc attccaacgc 3925501 ggtgctggtc ggctgggcgg tggctgcggc ctgccaaggc tcaatggcgt tgcgacgcaa 3925561 attttcggcg tcgcagttcc tggccgacgt ccgccgttat ggcgccactt acgccaacta 3925621 cgtgggcaag cctctttcgt atgtgcttgc gacaccggag cttcccgacg acgcggacaa 3925681 cccgctgcgg gcggtgtacg gcaacgaggg agtacccggt gacatcgacc gtttcgggcg 3925741 caggttcggc tgcgttgtca tggacggctt cggctcgact gaaggcgggg tggcgatcac 3925801 gcggacactc gacaccccgg cgggcgccct gggcccactg ccggggggaa tccaaatcgt 3925861 cgaccccgac accggcgaac cgtgcccgac aggagtggtc ggcgaactgg tcaacaccgc 3925921 cgggccgggc ggtttcgaag gctattacaa cgacgaggcc gccgaggccg agcggatggc 3925981 cggcggcgtc taccacagtg gcgacctcgc ctatcgcgac gacgccggct acgcctattt 3926041 cgccggtcgg ctcggcgact ggatgcgagt cgacggtgaa aatctaggca ccgcaccgat 3926101 cgagcgggtg ctgatgcgct acccggacgc caccgaggtc gctgtgtatc cggtacccga 3926161 tccggtggtg ggtgatcagg tgatggccgc gttagtgttg gcgcccggca ccaaattcga 3926221 tgccgacaag ttccgggcgt ttctgaccga gcagcccgac ctggggcaca agcagtggcc 3926281 gtcgtatgtg cgggtcagcg cggggctgcc gcgcaccatg accttcaagg tgatcaagcg 3926341 ccagttgtcg gccgaaggtg tcgcctgcgc cgatccggtg tggccgattc gccggtagcc 3926401 tcacggcgcg ccaccatgct caccgggatc tggccggatg gtggacccga ataatcgggt 3926461 agaaccgccg aatgagctgc ccggatcgcg atacgatcca ttcctagcaa ttgcaccgat 3926521 gatgcacggc cgcggccggg ttcggcttgg gctggtgcga ggtaccggat gtcgtttgtg 3926581 ttggtttcgc cggagaccgt ggcggcggtg gccacggatc tcaagcgcat cggcgcctcg 3926641 ctggcccacg aaaacgcgtc ggcggccgct tcgacgacgg cggtggtctc cgcggccgcc 3926701 gacgaggtat cgacggcggt cgccgctctg ttctcccaac acgcccaggg ctaccaagcg 3926761 gcggccgctc aggtagcagc gtttcatagc cggtttgtgc aagccctgac ggccggtgcc 3926821 ggggcgtacg catttgccga ggcggccaac gcgtcgccgc tacagtcagc catgggtgcg 3926881 gtaagcgcgt ctgcgcagac gctgttgtcg cgcccgttga tcggcaatgg cgccaatgcg 3926941 acgacgccgg gcggtaacgg cggcgacggc ggatggctat tcggcagcgg cggcaacggc 3927001 gcgcccggcg cggcgggcca gtccggcggt aacggcgggt cagccggact gtggggtaac 3927061 ggcggcgcgg gtggcgccgg cggcagcggc ggcgccgccg gcggcaacgg cggtaacggc 3927121 gggtggctgt tcggcgccgg cggcaccggc ggtatcggcg gcaccggtgc tcccggcgcc 3927181 atgggcggca ccggcggcaa cggcggcaac ggcgcgctgc tgatcggcgg cggcggcctc 3927241 ggcggcgccg gcggcatggg tggcaccggc ggcggcaccg gcggcaccgg cggcaacggc 3927301 ggcaacggcg cgctgctgat cggcgctggt ggtgtcggag gtgctggcgg gatcggtggc 3927361 cagggtaccg gcgccggcgg tgccgccggc gccggcggca ccgggggcaa cggcggcgcc 3927421 ggggggttgt tcatgaacgg cggcgacggc ggcgccggcg gtcaaggcgg cgacggtgcg 3927481 gccggcgacg cggctgccag cgccggcggc accggcggca aaggcggcca aggcggcgac 3927541 ggcggcaccg gaggggccgg cggcgcaggc ccagtgctgt tcggccacgg cggcgccggc 3927601 ggcatgggcg gccaaggcgg caccggtgga atgggcggcg ccggcggaga cggcaccacc 3927661 gtcatcgcgg ccggtaccgg gggggagggc ggcaccggcg gcgcggccgg cgccggcgga 3927721 gccgcaggcg ctcgcggggc tctcaccagc ggcggcctag ccggcggcgt cggggccggc 3927781 ggcaccggcg gcaccggcgg taccggcggc aacggcgctg acgccgctgc tgtggtgggc 3927841 ttcggcgcga acggcgaccc tggcttcgct ggcggcaaag gcggtaacgg cggaataggt 3927901 ggggccgcgg tgacaggcgg ggtcgccggc gacggcggca ccggcggcaa aggtggcacc 3927961 ggcggtgccg gcggcgccgg caacgacgcc ggcagcaccg gcaatcccgg cggtaagggc 3928021 ggcgacggcg ggatcggcgg tgccggcggg gccggcggcg cggccggcac cggcaacggc 3928081 ggccatgccg gcaacacagg tgacggcggc gacggcggga ccggcggtaa cggcggcaac 3928141 ggcaccggag gcgtgaacgg cgccgacaac accctcaacc ccgacacccc cggcggcgcc 3928201 ggggagcccg gcggggccgg cggggccggc ggggccggcg gggccgccgg cggcccgggc 3928261 ggtaccggcg gtaccggcgg taacggcggc aacggcggca acggcggcaa cggcggcaac 3928321 ggcggcaacg gcggcaacgg cggcaatgcc ggcaacaaca gcaccaatgc cccagtcggt 3928381 ggcgaaggcg gcgccggcgg cgacggcggc gccggcggcg caggcggggc cgccaacggc 3928441 ggcaccgcgg gcagccaggg cactgggggc gtcggcggcg acggcggcgc gggcggcaac 3928501 ggcggcggcg gcaaggctgg caccggcaac agcggcaact ttggggtgga cggcgaagcc 3928561 ggcttcagcg gcggcgccgg tggcaacggc ggcgtaggcg gggccgccgg cgccaatggc 3928621 ggaaccggcg gcagcggtgg taatggcggt gacggcggtg cgggaggcat tggcggggcc 3928681 ggcggcaacg gcataccggg cactggcaca gagcctgccg ggggcaccgg cgccaaaggt 3928741 ggagacggcg gcgacggtgg cgccggcggc gcaggcggca atgccggcgg ggccggcggc 3928801 cagggcggca atgccggcca gggtggcgcc ggcggtgcgg gcggcaacgc cgtgattccc 3928861 ggcgacggcg tcgggaaggc gccgcacggc gacgcgggcg gcagcggcgg agacggcggc 3928921 aaaggcggcc agggcggtag tggcggcacc ggcggatccg gtgccccgat cggtggcggc 3928981 gccggaggca ccggagggtc cggcggacac gccggcaagg gtggcgccgg cggcatcggc 3929041 gcacagggca ccaccatcac cgtgcccggg aacggcggca acgccggcga cggcggcaac 3929101 ggcggcaacg ccggcgccgg tggaaacggc ggctccggcg acttcggtgg caataccacc 3929161 agcggcgcct ccggcagcgg cggcaacggc ggcaacgccg gcaccgcggg tagcggcggt 3929221 gcgggcggaa ccggcggcac cggccttagc ggcggcaacg gtggcaacgg cggcaacggc 3929281 ggcaacggcg gtgacggcgg taacggcgcc cacggcaccg tcggcgccca gttcgtcccg 3929341 gccaccagct tgcccacacc caacggcggg gccggtggca acggtggcac cggaagcaac 3929401 ggcggcgcgc ccggccccgc cggggcgccc ggccccacta ccggcggtaa cgctggcagc 3929461 cagggcatcg gcggcgacgg gggcaacggc ggcgacggcg gtaaaggcgg tgacggcgcc 3929521 gacgctgtca acgtcgtatt catgccgact gagccacagg ccgcgaccgg cactgccggc 3929581 agcgccggtg accccaccgg cggtaacgga gggcccggca ctcccggcag ccccatggtt 3929641 gccccgcccc cgccaacgcc aatcactcaa gtccaacagg gcggtgacgg tggcgccggg 3929701 ggcaccggat ccaccaacgc caacgacggc acagccaccg gcggaaaggg cggagaaggc 3929761 ggagtcggca gcattctcgg cgggcccggc ggcaacggcg gaactggcgg caacgcctcg 3929821 gcaaccggca ccaacggggt ggccaacgcc gggaatggcg gcaagggtgg cgacggcggc 3929881 cagtttgggg ccggcggcaa cggtggtgcc ggcggcagcg taaccgacgg atccgccggc 3929941 agcaccgcag gcaacggcgg caacggcggc aacgcaacca acggcaccat cgcaggccaa 3930001 cccgccggcg gcaacggctc ggccggcggg aaaggcggcg acggcggcaa catcgccgcc 3930061 ggtgccaccg gcaccgccgg caacggcggg aacggcggca acggcaacga cggcgccgtc 3930121 aacgccggca ccggcggctc cggcgggaac ggcggtaacg ccggtggcgg cggcgccaat 3930181 ggcggcgacg gcggcgccgg cggcgccggc ggggccggcg ggcgtggcgg caagggcatc 3930241 gacggcgggt tcggcggtga cggcggcaac ggcggcagca acaacggcac cggcgccggt 3930301 ggcaacggcg gcaacggcgg caccggcggg gtcggctcgg ttggcgcggc tggtggcgat 3930361 ggcggcaacg gcggcaccgg cggcttcgcc ggtttcggcg gcaccgcagg caatggcggt 3930421 tccggcggca cgggcggggc cggcggcgac ggcggcaccg gcggggacgg cggcaacggc 3930481 gttatcgccg gcggcggggg gaccggcggc aacggcggcg ccagcggggc cggcggcgcc 3930541 ggcggcacgg gcgggttcgc cggcaacggc aatgccggcg gcaatggcgg caccggcggc 3930601 gcgagcgagg acggcgacaa cggcaacgct ggcagcggcg ccaccggcgg taccggcggc 3930661 aacggcggca ccggcggcga cggcggcgct gccgggctgg gcggcgtcgc gtgaggttga 3930721 ccggcgatca ccgtagccag cacggcccgt gacaccggtc cggcacgcca ccctcgtcgt 3930781 tcaggtggtg tcgccactcg cgctacacaa cgcttcacgg cactcgtcga gacttatgct 3930841 cgagttctga tacgtggagc aactgttttg gcgttcgacc cgtattgcgc aggtggcggt 3930901 actggaaaac gtagacgtgt tgggcgggtg acgaataaga tcctggccta actactgcgt 3930961 caattatgcc gcggtggccg cgccgtccgg ttgggagttc gcccatgtcg ttcgtgttga 3931021 tcgcaccgga attcgtgaca gcagccgcgg gggatctgac gaatctgggt tcgtcgatta 3931081 gcgcggccaa cgcgtcggca gccagtgcga ccacgcaggt gctggctgcg ggcgccgatg 3931141 aggtgtctgc ccgtattgcg gcgctgttcg gcgggtttgg cctggagtac caggcgatta 3931201 gtgcgcaggt ggcggcctac caccagcggt ttgtgcaggc cttgagtacc ggcgcgggcg 3931261 catatgcctc ggccgaggcc gccgccgctg agcagatcgt gctgggcgtg atcaatgcgc 3931321 ccacccaggc gctgctgggg cgcccgttga tcggtgacgg cgccaatgcg acgactcccg 3931381 gcggggccgg cggggccggc ggtctgctgt tcggcaacgg cggggccggg gcagccgggg 3931441 cgcccggcca ggccggcggg cctggcgggc ccgccggatt gtggggcaac ggcgggcccg 3931501 gcggggccgg cggcagcggt gggggcaccg gcggtgccgg cggcgccggt gggtggctgt 3931561 tcggggttgg cggcgccggc ggtgtcggtg gggccggtgg cggcaccggc ggggcgggcg 3931621 ggcccggtgg tttgatctgg ggcggcggcg gggccggcgg tgtcggtggg gccggtggcg 3931681 gcaccggcgg ggccggcggc cgcgccgagc tgctgttcgg cgccggcggt gcgggtgggg 3931741 cgggcaccga cggcgggccc ggtgctaccg gcgggaccgg cggacacggc ggagtcggcg 3931801 gcgacggcgg atggctggca cccggcgggg ccggcggggc cggcgggcaa ggcggggcag 3931861 gtggtgccgg cagcgatggt ggcgcgttgg gtggtaccgg cgggacgggc ggtaccggcg 3931921 gcgccggtgg cgccggcggt cgcggcgcac tgctgctggg cgctggcgga cagggcggcc 3931981 tcggcggcgc cggcggacaa ggcggcaccg gcggggccgg cggagatggc gttctggggg 3932041 gtgtcggtgg cactggtggt aagggcggtg tcggcggcgt ggctggcctc ggcggggccg 3932101 gtggtgccgc gggccagctc ttcagcgccg gaggcgcggc gggtgccgtt ggggttggcg 3932161 gcaccggcgg ccagggtggg gctggcggtg ccggagcggc cggcgccgac gcccccgcca 3932221 gcacaggtct aaccggtggt accgggttcg ctggcggggc cggcggcgtc ggcggccagg 3932281 gcggcaacgc cattgccggc ggcatcaacg gctccggtgg tgccggcggc accggcggcc 3932341 aaggcggcgc cggcggcatg ggtggctccg gtgctgataa tgccagcggg attggcgccg 3932401 acggcggcgc gggtgggact ggcggtaacg ccggcgccgg cggggccggc ggggccgccg 3932461 gcaccggagg aaccggcggg gttgtcggcg ccgcgggcaa ggccggtatc ggcggcaccg 3932521 gcggccaagg cggcgccggc ggcgcgggca gcgccggcac ggatgcgacc gctaccggtg 3932581 ccaccggcgg caccgggttt tccggtggag ccggcggggc cggcggggcc ggcggcaaca 3932641 ccggggttgg cggcaccaac ggctccggcg ggcaaggcgg caccggcggc gcgggcggcg 3932701 ccggtggtgc tggcggtgtc ggcgccgaca accccaccgg catcggcggc accggcggca 3932761 ccggcgggaa aggcggcgcc ggcggggccg gcgggcaggg cggtagcagc ggtgccggcg 3932821 gcaccaacgg ctctggtggc gctggcggca ccggcggaca aggcggcgcc gggggcgctg 3932881 gcggggccgg cgccgataac cccaccggca tcggcggcgc cggcggcacc ggcggcaccg 3932941 gcggagcggc cggagccggc ggggccggtg gcgccatcgg taccggcggc accggcggcg 3933001 cggtgggcag cgtcggtaac gccgggatcg gcggtaccgg cggtacgggt ggtgtcggtg 3933061 gtgctggtgg tgcaggtgcg gctgcggccg ctggcagcag cgctaccggt ggcgccgggt 3933121 tcgccggcgg cgccggcgga gaaggcggag cgggcggcaa cagcggtgtg ggcggcacca 3933181 acggctccgg cggcgccggc ggtgcaggcg gcaagggcgg caccggaggt gccggcgggt 3933241 ccggcgcgga caaccccacc ggtgctggtt tcgccggtgg cgccggcggc acaggtggcg 3933301 cggccggcgc cggcggggcc ggcggggcga ccggtaccgg cggcaccggc ggcgttgtcg 3933361 gcgccaccgg tagtgcaggc atcggcgggg ccggcggccg cggcggtgac ggcggcgatg 3933421 gggccagcgg tctcggcctg ggcctctccg gctttgacgg cggccaaggc ggccaaggcg 3933481 gggccggcgg cagcgccggc gccggcggca tcaacggggc cggcggggcc ggcggcaacg 3933541 gcggcgacgg cggggacggc gcaaccggtg ccgcaggtct cggcgacaac ggcggggtcg 3933601 gcggtgacgg tggggccggt ggcgccgccg gcaacggcgg caacgcgggc gtcggcctga 3933661 cagccaaggc cggcgacggc ggcgccgcgg gcaatggcgg caacgggggc gccggcggtg 3933721 ctggcggggc cggcgacaac aatttcaacg gcggccaggg tggtgccggc ggccaaggcg 3933781 gccaaggcgg cctgggcggg gcaagcacca cctcgatcaa cgccaacggc ggcgccggcg 3933841 gcaacggcgg caccggcggc aaaggcggcg ccggtggtgc gggaaccctg ggcgtcggcg 3933901 gctccggcgg caccggcggg gacggcggcg atgcgggctc tggtggtggc ggcggcttcg 3933961 gcggggccgc gggtaaggcc ggcggcggcg gaaacggcgg ccgcggcggt gacggcggcg 3934021 atggggccag cggtctcggc ctgggcctct ccggctttga cggcggccaa ggcggccaag 3934081 gcggggccgg cggcagcgcc ggcgccggcg gcatcaacgg ggccggcggg gccggcggca 3934141 acggcggcga cggcggggac ggcgcaaccg gtgccgcagg tctcggcgac aacggcgggg 3934201 tcggcggtga cggtggggcc ggtggcgccg ccggcaacgg cggcaacgcg ggcgtcggcc 3934261 tgacagccaa ggccggcgac ggcggcgccg cgggcaatgg cggcaacggg ggcgccggcg 3934321 gtgctggcgg ggccggcgac aacaatttca acggcggcca gggtggtgcc ggcggccaag 3934381 gcggccaagg cggcctgggc ggggcaagca ccacctcgat caacgccaac ggcggcgccg 3934441 gcggcaacgg cggcaccggc ggcaaaggcg gcgccggtgg tgcgggaacc ctgggcgtcg 3934501 gcggctccgg cggcaccggc ggggacggcg gcgatgcggg ctctggtggt ggcggcggct 3934561 tcggcggggc cgcgggtaag gccggcggcg gcggaaacgg cggtgttggc ggtgacggcg 3934621 gcgagggagc cagcggtctc ggcctgggcc tctccggctt tgacggcggc caaggcggcc 3934681 aaggcggggc cggcggcagc gccggcgccg gcggcatcaa cggggccggc ggggccggcg 3934741 gcaccggcgg ggccggtggt gacggcgccc cggcgaccct gatcggcgga cccgacggcg 3934801 gtgacggcgg ccaaggcggc atcggcgggg acggcggcaa cgccggattc ggcgccggtg 3934861 ttcccggcga cggcggggac ggcggcaacg ccggattcgg cgccggtgtt cccggcgacg 3934921 gcgggatcgg cggcaccggc ggggccgggg gcgccggcgg cgccggcgcc gacggggacc 3934981 ccagcattga cggcggccaa ggtggtgccg gcggccacgg cggccaaggc ggcaaaggcg 3935041 gcctgaacag caccgggcta gccagcgccg ccagcggtga cggcggcaac ggcggggccg 3935101 gcggggccgg cggcaacggc ggcgacggcg acggctttat cggcgggtcc ggcggcaccg 3935161 gcgggaccgg cggcgacgcc ggcgtcggcg gcctggccaa caccggcgga accgcgggca 3935221 acgccggtat cggcggggcc ggcggccgcg gcggcgacgg cggggccggc gacagcggcg 3935281 ccctctccca agacggcaac ggcttcgccg gcggccaagg cggccaaggc ggggtcggcg 3935341 gcaacgccgg cgccggcggc atcaacgggg ccggcggcac cggcggcacc ggcggggccg 3935401 gtggtgacgg ccagaacgga acgacaggcg tggcgagcga gggcggcgcc ggcggccaag 3935461 gcggtgacgg cggccaaggc ggcatcggcg gggccggcgg caacgccgga ttcggcgccg 3935521 gtgttcccgg cgacggcggg atcggcggca ccggcggggc cgggggcgcc ggcggcgccg 3935581 gcgccgacgg ggaccccagc attgacggcg gccaaggtgg tgccggcggc cacggcggcc 3935641 aaggcggcaa aggcggcctg aacagcaccg ggctagccag cgccgccagc ggtgacggcg 3935701 gcaacggcgg ggccggcggg gccggcggca acggcggcga cggcgacggc tttatcggcg 3935761 ggtccggcgg caccggcggg accggcggcg acgccggcgt cggcggcctg gccaacaccg 3935821 gcggaaccgc gggcaacgcc ggtatcggcg gggccggcgg ccgcggcggc gacggcgggg 3935881 ccggcgacag cggcgccctc tcccaagacg gcaacggctt cgccggcggc caaggcggcc 3935941 aaggcggggt cggcggcaac gccggcgccg gcggcatcaa cggggccggc ggcaccggcg 3936001 gcaccggcgg ggccggtggt gacggccaga acggaacgac aggcgtggcg agcgagggcg 3936061 gcgccggcgg ccaaggcggt gacggcggcc aaggcggcat cggcggggcc ggcggcaacg 3936121 ccggattcgg cgccggtgtt cccggcgacg gcgggatcgg cggcaccggc ggggccgggg 3936181 gcgccggcgg cgccggcgcc gacggggacc ccagcattga cggcggccaa ggtggtgccg 3936241 gcggccacgg cggccaaggc ggcaaaggcg gcctgaacag caccgggcta gccagcgccg 3936301 ccagcggtga cggcggcaac ggcggggccg gcggggccgg cggcaacggc ggagccggcg 3936361 ggctcggcgg gggcggtggc acaggcggca ccaacggcaa cggcggcctc ggcggaggcg 3936421 gcggcaacgg cggagccggc ggtgccgggg gaacgcccac cggcagtggc accgagggga 3936481 ccggcggcga cggtggagat gccggcgccg gcggcaacgg cggctctgcc accggcgtcg 3936541 gtaacggcgg taacggcggt gatggcggca acggcggcga cggcggcaac ggcgcacccg 3936601 gcggcttcgg tggcggcgct ggcgccggcg gcttgggcgg ctccggcgcc ggcggcggca 3936661 ccgacggcga cgacggcaac ggcggcagcc ccggcaccga cggcagctaa gctaacggca 3936721 gcccaaagcg ccagcagcca cccgacaacg ctgggcggct acccatggcc cgttggcagc 3936781 acaggctggc gatggccgtc cgaccgataa cacccgggcc atcgcatccc cagcacaacc 3936841 agctgtcctc gcgggcttat gcacgacggg ggagcactac cccacaagcg atggcaccac 3936901 tacatcgatc agatgcggcc cgggctcggc gaaggccgcg cgcagggcgt cggcgaattc 3936961 ctcgcaggtg gtgacacgac gtgcaggaac acccatacct tcggcgatct tgacgaaatc 3937021 cattgtggga cgcgatatat caaggagatc cagggccttc gggccaggat ccgaccccgc 3937081 gccgacacgt tgcagctcga tccgcagaat gtcgtaggcg ccgttgttgt agatgacggt 3937141 ggtgacgtcg aggttctccc gcgcttggct ccacaatcct gaaatcgtgt acattgccga 3937201 cccgtcggat tccaggcaca acaccgggcg gtcgggcgcg gcgaccgcgg caccgaccgc 3937261 agccgggatg ccgtaaccga ttgccccgcc ggtcagcgta agccagtcat gggccggggc 3937321 cccggcggtg gcctgcggca gcaggacacc acaagtattc gactcgtcga caacaatcgc 3937381 ccgttccggc agcaacgcac cgaccacatc ggccgccgac accgacgtca ggtcacccgt 3937441 cggcagctgc ggacgtgacg cgcccgccac cggggcaacc gtcccgggcg ctacctcgtc 3937501 ggccaacgcg gccagtgcgt cggccgcacc accgggttcg gcaagcacgt gcacctcaca 3937561 accggccggc accaggtcac tgggcatacc cgggtaggcg aaaaacgaca ccggcgacct 3937621 ggccccggcc agcacgagat gtttgacccc gtccagctgg gccgcggcac cttcagcgaa 3937681 ataggccagc cgttcgacgg cggggatacc ggcgccacgt tccaggcacg tcggaaacgt 3937741 ctcgcataac caacgggccc cggttgcctg cacgatccgc gcagccgcgg tcagccccgg 3937801 cccgcgggtg gcatccccac cgatcagcat catggcgggt tcccctgagc gcagcacccc 3937861 agccaccggc cccacgtcca ctggcgccgc cgccgcctga gccggcacgc ccgcggccgc 3937921 gtgggcaccg tcgctccaac acacatccgc gggcagaatc agcgtcgcga tctgtgaacc 3937981 tgaccggctg gccgcaatgg ccgcttcagc gtcggccccg acgtcggcgg cagcctccgt 3938041 ccggcgcacc catcccgaaa cggtgccagc gaccgcatcg atatcggatt ccagcggggc 3938101 gtcgtacttc ttgtggtaag tcgcgtggtc tccgacaacc accaccatcg gcacccgggc 3938161 acggcgcgcg ttgtgcaggt tggccaggcc gttgcccagt ccggggccca gatgcagcag 3938221 caccgccgcc ggccggccag caatgcgggc ataaccgtcg gcggccccgg tagccacgcc 3938281 ttcgaacagg gtcagcatgc cacgcatgcg cgggacggcg tctagcgccg ccacgaaatg 3938341 catttccgac gtgccagggt tggcgaagca cacatcgaca cccccgtcga ccagggtgtt 3938401 gatcagggcc tgagcaccgt tcacgtctgc acctttcctc gtgggtccag cttgaatacc 3938461 cgcacagcgt tgccgtgcag aaagtcgcga cgagcttcgt cgcttagccc cagttcgtca 3938521 agaccggtca gggcgtgcgt gtgggcgatc atcgggtaat tggtaccaaa cagcaccttg 3938581 cgctgtcccg tgtcggtttt catgaaccgc accagcttcc cgggcagccg cttgatggtg 3938641 taggccgagg tgtcgatgta gacattctcg tgtttgcggg cgaccgcgac catctcctcg 3938701 gtccacggat agccgacatg tccgcacacg atcaccagtt ccggaaagtc caacgccacc 3938761 tggtcgatgt agggaatggg gcgtccggtc tccgacggcc gcagcgggcc ggtgtgacca 3938821 acctgggtgc agaacggcac cgcggactgc acgcattcgg cgaacaacgg atagtagcgg 3938881 cggtcggtcg gcggggcgcc ccatagccaa ggcaccaccc gcaggccgac gaacccctca 3938941 ccgactcggc gcctcaactc ccggacggcc gccatcgggc gatccaggtc gaccgccgcc 3939001 agaccggcaa aacggttggg gtacaaccgg acccattccg caacagcgtc attagagatg 3939061 aggtcctggc cgttggggcc acgccaggcg ctgagcaaac ccagggtgac gccgccggcg 3939121 tccatcgagg agacggtcgc ttcgatcggg atgtcggtct ccgggataga cccaccggtc 3939181 caccggcgca gcgaggcgaa catatcgccg tgtaggaacc gttgcgtcgg atgctgcatc 3939241 cacacatcga tggtcatcgc gtttcagact gtagccgccc gggcggcgac tacccgcggc 3939301 gacgctgcag atcatcgccc ggccagggtg ctaccaggtt gctgccatcc ccgaatgttc 3939361 gcggtcggag ggcgacgcga cgtgttgaaa cgccgtacgt tcgggccttc ccgcgagaag 3939421 ccctagccgc ccgagattgt ccctcccggc gttcgtggcc acgcggtgct tcgccttttt 3939481 gcccatccca aattacacgg gtggtactca cgagaaagct tggacgtatt gggcgggtgc 3939541 tgaattatga tcccgacaca actgcatcaa tttagccgcg tcgtgatgct atccgccgac 3939601 ggtttggagc tggtccgtgt cgttcgtgtt gatctcaccc gaagttgtgt ccgccgccgc 3939661 cggggatcta gcgaacgtgg gatcgacaat cagcgccgcc aacaaggcgg cagcggctgc 3939721 gaccacgcag gtgctggccg cgggcgccga tgaggtgtca gcgcgcatcg cggcgctgtt 3939781 tggtatgtac ggcctggaat atcaggcgat cagtgcgcaa gttgccgcgt atcaccagca 3939841 gttcgtgcag acgttgcgca ccggagcggc ctcgtacatg ttggccgagg ccaccaacgt 3939901 cgagcaaaat ctactgaacc tcatcaacgc gccgacccag acgctgctcg ggcgcccgct 3939961 gatcggagac ggggccaacg cgacgacgcc gggcggggcc ggcggagacg gcgggctgct 3940021 gtttggcagc ggcggcaacg gcgcgcccgg tgcacccggc caggctggcg gtgccggtgg 3940081 gtctgccggg ctactgggca acggcgggag cggcggagcc ggcgggacgg gcgcgcccgg 3940141 cggaaacggc ggcaatgccg gttggctata cggccgcggc ggagtcggcg gcgccggggg 3940201 aatcggcggc ggaacaggcg gggccggcgg gcacgcgtgg ctgttcggcc acgggggaac 3940261 cggcggtatc ggtggcgggc ccggcggcaa cggcgggtgg ctgctcggca acggcggaca 3940321 tggcggcgct ggcggaatcg gtggcggcag cggcggcgct ggcgggaacg gcgggtggct 3940381 gctcggcaac ggcggtatcg gcggagcggg cggaaccggc ggcggagcgg gcggcaccgg 3940441 tggcaacgcc gcgtggctgc tcggcggtgg tggtaccggc ggcgccggcg gaatcggtgg 3940501 tggcaacggc gggcacggcg gcaacggcgg gtggctgctc ggcaacggcg gcaacggcgg 3940561 cctcggcggt gacggtgacg gcggtactgg cggcggccac ggcggcaacg gcgggaatcc 3940621 cgggtggctc ttgggcacag ccgggggtgg cggcaacggt ggcgccggca gcaccggtac 3940681 tgcaggtggc ggctctgggg gcaccggcgg cgacggcggg accggcgggc gtggcggcct 3940741 gttaatgggc gccggcgccg gcgggcacgg tggcactggc ggcgcgggcg gtgccggtgt 3940801 caacggtggc ggcgccggcg gggccggcgg ggccggcggc aacggcggcg ccgggggtca 3940861 agccgccctg ctgttcgggc gcggcggcac cggcggagcc ggcggctacg gcggcgatgg 3940921 cggtggcggc ggtgacggct tcgacggcac gatggccggc ctgggtggta ccggtggcag 3940981 cggcggcacc ggcggtgacg gcggcgcccc cggcaacggt ggcgccgggg gtgccggcca 3941041 gttgttgagc catagcggcg tggccggtgc tagcggcaaa ggtggtgccg gcggcaccgg 3941101 cggcaacggc ggggccggca gtgccggcgc cgacgccccc gcaggctccg gcgcgatggg 3941161 tagcactggc tttgctggcg gcgccggcgg tgacggcggt aacggcggcg ggagcggtgc 3941221 cagccaaggc aacggcggca acggcggcaa cggcggcacc ggcggcaaag gcggcaccgg 3941281 cggggccggc atgaacagcc tcgacccgct gctagccgcc caagacggcg gccaaggcgg 3941341 caccggcggc accggcggca acgccggcgc cggcggcacc ggcttcaccc aaggcgccga 3941401 cggcaacgcc ggcaacggcg gtgacggcgg ggtcggcggc aacggcggaa acggcgcaga 3941461 caacaccacc accgccgccg ccggcaccac aggcggggcc ggcggggccg gcggggccgg 3941521 cggaaccggc ggagccgccg gcaccggcac cggcggccaa caaggcaacg gcggcaacgg 3941581 cggcaacggc ggcaccggcg gcaaaggcgg caccggcggg gccggcatga acagcctcga 3941641 cccgctgcta gccgcccaag acggcggcca aggcggcacc ggcggcaccg gcggcaacgc 3941701 cggcgccggc ggcaccggct tcaccccaag gcgccgacgg caacgccggc aacggcggtg 3941761 acggcggggt cggcggcaac ggcggaaacg gcgcagacaa caccaccacc gccgccgccg 3941821 gcaccacagg cggggccggc ggggccggcg gggccggcgg aaccggcgga accggcggag 3941881 ccgccggcac cggcaccggc ggccaacaag gcaacggcgg caacggcggc aacggcggca 3941941 ccggcggcaa aggcggcacc ggcggcgacg gtgcactcgc aggcagcagc ggtggtgccg 3942001 gcggtaaagg cggcaacggc ggcgacgccg gcaaggccgg taccggctcc gctcctggca 3942061 cggcggggac cggcggcgat gggggtaagg gcggcaacgg cggcattggc gctgccggca 3942121 caaccggccc cgtaggcacc ggcgcgtccg gcggcaccgg tggtagtggt ggcgccggcg 3942181 gaaccggcgg tgacggcggc gccgccaacg gcggcaccgc cggggctggc ggggcgggcg 3942241 gcaatggcgg caaaggcggc gacggtggag caggcgtcac cagcagcacc gccggcaaca 3942301 gcggcggcgc gggcggcagc ggcggaaagg gcggagacgc gggcgcgggc ggcgccggtg 3942361 ccactccggg cgccaacggt atcgctggca atggcggcga cggcggagat ggcgcggctg 3942421 gtgccgtcgg catctccggc gcaaccggcg ctggcgacgg cgggcatggc ggaaccggcg 3942481 cggccggcgg caacggtgga accggcggtg ctggcggtag cggcatcgac ggcgtcggcg 3942541 gcgggaccgg aggtaccggc ggcaacggcg gcaacggcgc catcggcggc gctggcggag 3942601 acgccggtgg tagcggaaat agcggcggaa acggtgggat tggcggaaag ggcggaaacg 3942661 ccggtgccgg tggtgccgcg ggcagcaacg gcggtaccgt cggcgccaac ggtaccggcg 3942721 gcgacggcgg caacggcggc gctgccgggg ccgccacggc tggcagcaac ggtggggccg 3942781 gcaccggctc ggccggcggc aacggcggca ccggcggcag aggcggcagt ggtggcgccg 3942841 gcggcgacgg tatcggtggc gtcggcggcg gcaagggcgg caacggcgcg gacggcgaag 3942901 tcggcggtgc gggcggcgcc ggcggcagcg ggcccaacac cagtcccggc ggcaacggcg 3942961 ggcaaggagg tcaaggcggc agcggtggtg ccggtggggc ggccggggct ggcggcgccg 3943021 gtggcggcgc taacggcacc gctggcaacg gcggccaagg cggtgccggc ggcaccggcg 3943081 gcgccggcgc agcctcctca gctaccaacg gcggcagcgg cggcgccggc ggcaccggag 3943141 gcgacggcgg cagcggcggc gccggcggca ccggaggcgc cggcggcacc ggcggggcgg 3943201 ccggcgacgg cggacaaggt ggccagggcg gcgccggcgg cggtgccggt ggtcaaggtg 3943261 gtgccggcgg tgccggcggg accggcggca acggcggcaa tatcaccggc ggcaccgcgg 3943321 gcaccgcggg ggccgccggt aacggcggcg ccgccggaaa gggtggcgcc ggcggccaag 3943381 gcggcaccgg tggcgggacc gggggtcagg gtggcgccgg cggcgacggc ggtgccggcg 3943441 gcaccggcgg cgaccgcacc gtcggcggtg gcacggtccc cgccggctcc ggtggacaag 3943501 gcggtaacgc tggcggtggt ggggccggcg ggcagggtgg agccgacggc ggcagcggcg 3943561 gcgacggcgg cgacgccggc acaggtggca atggcggtaa cggcggcaac cgtaattccg 3943621 gcaatggcac cggcggcgct ggcggcaacg gtggtggtgg tgctaacggt ggcgccggcg 3943681 gcgctggggg cagcggcggc ggcaccggcg gcaacggcgg cgctggcggc gacgccggcg 3943741 acgccggcaa cggcggcaac ggcaacggca ccggcaacgg cggcaacggc ggcaacggcg 3943801 gcatcgccgg catgggcggc aacggcggtg ccgggacggg cagcggcaac ggcggcaacg 3943861 gcggcagcgg cggcaacggc ggcaacgccg gcatgggcgg caacagcggc accggcagcg 3943921 gcgacggcgg tgccggcggg aacggcggcg cggcgggcac gggcggcacc ggcggcgacg 3943981 gcggcctcac cggtactggc ggcaccggcg gcagcggtgg caccggcggt gacggcggta 3944041 acggcggcaa cggagcagat aacaccgcaa acatgactgc gcaggcgggc ggtgacggtg 3944101 gcaacggcgg cgacggtggc ttcggcggcg gggccggggc cggcggcggt ggcttgaccg 3944161 ctggcgccaa cggcaccggc gggcaaggcg gcgccggcgg cgatggcggc aacggggcca 3944221 tcggcggcca cggcccactc actgacgacc ccggcggcaa cgggggcacc ggcggcaacg 3944281 gcggcaccgg cggcaccggc ggcgcgggca tcggcagcct tggcggcggc actggcggcg 3944341 atggcggcaa cggcggcaac ggcggtaccg gcggcgaggg cggcgaggtc ggcggcgccg 3944401 gcggcaccgg cggtgcggcc ggcaatggcg gcgatggcgg caccggcggc accggcggcg 3944461 gggacggggg cgccggcggc accggcggca ccggcggcac cggcggcctc ggcgaccccc 3944521 gggtcggcgg atccggcggc gacggcggca ccggcggcag cggcggtgcg gccggcaatg 3944581 gcggcaacgg cggcaacgcc ggcgcgggag gcaatggcaa cggcggcacc ggtggggccg 3944641 gcggtatcgg cggcaccggc ggcaatggcg gcgacgccga gcccggagtg cccccgggag 3944701 ccggtggtgc tggcggcgcc ggcaccaccg gcggcaaggg tggcaccggc ggcaacggca 3944761 gtggcaccgg ctcgggcggc accggcggcg atggcggcac cggcggtggt ggtgggaacg 3944821 gcggcaccgg ctggaatggc ggcaagggag acaccggcag cggcggtggc gccggagacg 3944881 gtggtaaggc accagccggt ggcaccggcg gcgccggcgg cgacggcgga gcgggcggca 3944941 agggcggcag cggcggcgtc tagtcgcgat gggcccagcg gccgcgatgg tgcgccgggc 3945001 gtccgccggc gagtggtcca gccagatttg acgacaaacg gcgacccagc ggtatccccc 3945061 agccgcggcg ccatagccgc gacccgcgca atcaggaacc gctcgtcacg tgtcccgcat 3945121 gcacgtcatc ggctggccgc gcctcggtct gctccttggc ccagcggtag tccggcttac 3945181 cggcgggcga acgcttcacc tcgtcgacaa accacagact gcgcggcact ttgtagcccg 3945241 cgatctcgga gcgcacgaac gagtccaact cggccaacga cggccgacaa cccggccggg 3945301 cctgcaccac ggcggccacc tgctggccgt aacgcggatc gggcaccccg accaccagag 3945361 cgtcgaacac gtcgggatgc cccttcaaag cggcctcgac ctcttcgggg tagaccttct 3945421 cgccgccgct gttgatcgac accgagccac gacccagcat ggtgaccgtg ccgtcctcct 3945481 cgacttgggc gtagtccccc ggaatggcgt agcgcacacc gttaatcgtc cggaacgtct 3945541 cggccgtctt cttctcgtcc ttgtagtagc cgacgggaat gttgcccttc ttggcgagcg 3945601 tgcgcgcggg tcgcactgct tggcgcctgg tgcaccggtc gccgggcggc tcctccccag 3945661 ggcgctccag gttcgttgcg gcattaccag aaagccggca catattagat gagtggcaac 3945721 taaggttctc acttaaagat gccgccatat cggccgtggt tgcaccggcg caaagatggt 3945781 tgggagttcg cccatgtcgt tcgtgttgat cgcaccggaa ttcgtgacag cagccgcggg 3945841 ggatctgacg aatctgggtt cgtcgattag cgcggccaac gcgtcggcag ccagtgcgac 3945901 cacgcaggtg ctggctgcgg gcgccgatga ggtgtctgcc cgtattgcgg cgctgttcgg 3945961 cgggtttggc ctggagtacc aggcgattag tgcgcaggtg gcggcctacc accagcggtt 3946021 tgtgcaggcc ttgagtaccg gcgcgggcgc atatgcctcg gccgaggccg ccgccgctga 3946081 gcagatcgtg ctgggcgtga tcaatgcgcc cacccaggcg ctgctggggc gcccgttgat 3946141 cggtgacggc gccaatgcga cgactcccgg cggggccggc ggggccggcg gtctgctgtt 3946201 cggcaacggc ggggccgggg cagccggggc gcccggccag gccggcgggc ctggcgggcc 3946261 cgccggattg tggggcaacg gcgggcccgg cggggccggc ggcagcggtg ggggcaccgg 3946321 cggtgccggc ggcgccggtg ggtggctgtt cggggttggc ggcgccggcg gtgtcggtgg 3946381 ggccggtggc ggcaccggcg gggcgggtgg gcccggtggt ttgatctggg gcggcggcgg 3946441 ggccggcggt gtcggtgggg ccggtggcgg caccggcggg gccggcggcc gcgccgagct 3946501 gctgttcggc gccggcggtg cgggtggggc gggcaccgac ggcgggcccg gtgctaccgg 3946561 cgggaccggc ggacacggcg gagtcggcgg cgacggcgga tggctggcac ccggcggggc 3946621 cggcggggcc ggcgggcaag gcggggcagg tggtgccggc agcgatggtg gcgcgttggg 3946681 tggtaccggc gggacgggcg gtaccggcgg cgccggtggc gccggcggtc gcggcgcact 3946741 gctgctgggc gctggcggac agggcggcct cggcggcgcc ggcggacaag gcggcaccgg 3946801 cggggccggc ggagatggcg ttctgggggg tgtcggtggc actggtggta agggcggtgt 3946861 cggcggcgtg gctggcctcg gcggggccgg tggtgccgcg ggccagctct tcagcgccag 3946921 cggagcggcc ggtaacgccg gtgtcggcgg ggccggcggc caaggcggtg acggcggagc 3946981 cggcggggcc ggcgccgacg ccgaccagcc cggcgccacc ggcggcaccg ggttcgccgg 3947041 tggagccggc ggagccggcg gggccggcgg tagcagcggt gccggcggca ccaacggctc 3947101 cggcggcgcc ggcggacaag gcggcgccgg gggtgctggc ggggccggcg ccgataaccc 3947161 caccggcatc ggcggcaccg gcggtgacgg cggcaccggc ggagccgccg gagccggcgg 3947221 ggccggcgga gcggccggca ccggaggcac cggcggcatg atcggcacca caggcaacgc 3947281 cggtgtcggc ggggccggcg gccaaggcgg tgacggcgga gccggcgggg ccggcgccga 3947341 cgccgaccag cccggcgcca ccggcggcac cgggttcgcc ggtggagccg gcggggccgg 3947401 cggggccggc ggtagcagcg gtgccggcgg caccaacggc tccggcggcg ccggcggcac 3947461 cggcggacaa ggcggcgccg ggggtgctgg cggggccggc gccgataacc ccaccggcat 3947521 cggcggcacc ggcggtgacg gcggcaccgg cggagcggcc ggagccggcg gggccggcgg 3947581 agcggccggc accggaggca ccggcggcat gatcggcacc acaggcaacg ccggtgtcgg 3947641 cggggccggc ggccaaggcg gtgacggcgg agccggcggg gccggcgccg acgccgacca 3947701 gcccggcgcc accggcggca ccgggttcgc cggtggagcc ggcggggccg gcaaggccgg 3947761 cggtagcagc agtgccggcg gcaccaacag ctccggcagc gccggcggca ccggcagaca 3947821 aagcggcacc gggggtgctg gcggggccgg cgccgataac cccaccggca tcggcggcac 3947881 cggcggtgac ggcggcaccg gcggagcggc cggagccggc ggggccggcg gagcggccgg 3947941 caccggaggc accggcggca tgatcggcac cacaggcaac gccggtgtcg gcggggccgg 3948001 cggtagcagc ggtgccggcg gcaccaacgg ctccggcggc gccggcggca ccgacggaca 3948061 aggcggcgcc gggggtgctg gcggggccgg cgccgataac cccaccggca tcggcggcac 3948121 cggcggtgac ggcggcaccg gcggagcggc cggagccggc ggggccggcg gagcggccgg 3948181 caccggaggc accggcggca tgatcggcac cacaggcaac gccggtgtcg gcggggccgg 3948241 cggccaaggc ggtgacggcg gagccggcgg ggccggcgcc gacgccgacc agcccggcgc 3948301 caccggcggc accgggttcg ccggtggagc cggcggggcc ggcgggtccg gcggtagcag 3948361 ctgtgccggc ggcaccaacg gctccggcgg cgccggcggc acctgcggac aagtcgtcgc 3948421 cgggggtgct ggcatcagct tcagcaacgg cagcaacggc ggcaccggcg gcaccggggg 3948481 cgtgggcggc accgggggcg acggcggcaa cgcaggcacc ggcgccggcg accccggcaa 3948541 aggcggcacc ggcggcaccg gcggcaccgg cggcagcggc ggggccggcg gtagcggcgg 3948601 ggccaacttc aacggcggca ccggcggcac cggcggcacc ggcggcaaag gcggcctaaa 3948661 caccgacgga ctcagcagcg ccaccagcgg caccggcggc accggcggca ccggcggcaa 3948721 aggcggcacc ggcggggccg gcgacgactc cgccggcggg accggcggca caggcggggc 3948781 cggcggcaac gccggcgccg gcggcctagc caacaccggc ggcaccgcag gcaacgcggg 3948841 catcggcggt gacggcggcc aaggcggtaa cggcggccaa ggagacagcg gttccggatt 3948901 gggcggccag cccggctttg ccggcggggc cggcggcaaa ggcggggccg gcggtagcag 3948961 cggtgccggc ggcaccaacg gctccggcgg cgccggcggg gccggcggac aaggcggcgc 3949021 cgggggtgct ggcatcagct tcagcaacgg cagcaacggc ggcaccggcg gcaccggggg 3949081 cgtgggcggc accgggggcg acggcggcaa cgcaggcacc ggcgccggcg accccggcaa 3949141 aggcggcacc ggcggcaccg gcggcaccgg cggcagcggc ggggccggcg gtagcggcgg 3949201 ggccaacttc aacggcggca ccggcggcac cggcggcacc ggcggcaccg gcggcaaagg 3949261 cggcatgggc ggcatcgctg gcgacggcgg gcccggcggt gacggcggca acgccggggt 3949321 cggaggaaaa ggcggcacca acggcaacgg cggcagcggc gggaccggcg gcacaggcgg 3949381 ggccggcggc aacgccggcg ccggcggcct agccaacacc ggcggcaccg caggcaacgc 3949441 gggcatcggc ggtgacggcg gccaaggcgg taacggcggc caaggagaca gcggttccgg 3949501 attgggcggc cagcccggct ttgccggcgg ccccggcggc aaaggcgggg ccggcggcaa 3949561 cgccggcacc ggcggcacca acggctccgg cgccggcggg gccggcggac aaggcggcgc 3949621 cgggggtgct ggcatcagct tcagcaacgg cagcaacggc ggcaccggcg gcaccggggg 3949681 cgtgggcggc accgggggcg acggcggcaa cgcaggcacc ggcgccggcg accccggcaa 3949741 aggcggcacc ggcggcaccg gcggcaccgg cggcagcggc ggggccggcg gtagcggcgg 3949801 ggccaacttc aacggcggca ccggcggcac cggcggcacc ggcggcaccg gcggcaaagg 3949861 cggcatgggc ggcatcgctg gcgacggcgg gcccggcggt gacggcggca acgccggggt 3949921 cggaggaaaa ggcggcacca acggcaacgg cggcagcggc gggaccggcg gcacaggcgg 3949981 gcccggcggc agcggcggcg cgcccaccgg cagcggcacc ggcggcaaag gcggcgccgg 3950041 cggtgacggc ggcgatggcg ccgacggagg ggcagccacc ggcgtcggcg acggcggcga 3950101 cggtggtaac ggtggtaacg gtggtaacgg cggcacgggc gtcggctcgc ccggcggcct 3950161 cggcggggca ggaggcactg gaggcctcgg cggcgccggt gcaggcggcg gagccgacgg 3950221 cgatgatggc gacgacggcc aacccggcaa caacggcagc tgaagcacca cctgccacca 3950281 gacaacgccg tcgatgtggc gctccggcgt gcgcaaggca aatcggtgcg atcctgacca 3950341 gccaggtgat tacctggttc gactcatgcc gagcgaccgt cccagcgccg cagtggatgc 3950401 atacacggta ggttcgacgg acaccctggg ctggctgacc gaatggccgc cgcagctccc 3950461 cgaccgaacc gtcagcggca acatgtcacc tgcatcgtcg ccaagcccag gcgatcgccc 3950521 ggccccgcaa gcggatgtgt tctcctgccc tccgtgggca gcgcgcccga cacccgtaag 3950581 cggatgtccc cgacggactc cggccggcct agccgatggc taccccaggg agtgccgcac 3950641 gatggccgtc gatcaagtgc ggtccggctt cggcgaaccc tccgcagcta tttcgcgacg 3950701 cgcgagaaca cccgtgcctt acttccatcc acatcgatgt cggctcggcc cccgagaggc 3950761 acgacagccg acccatgtcg accttccgtg cggggtgtcc ggagccggtc gcagccgcac 3950821 ccatcaccca ccgctcgtca cgtgtcccgc atgcacgtca tcggctggcc gcgcctcggt 3950881 ctgctccttg gcccagcggt agtccggctt accggcgggc gaacgcttca cctcgtcgac 3950941 aaaccacaga ctgcgcggca ctttgtagcc cgcgatctcg gagcgcacga acgagtccaa 3951001 ctcggccaac gacggccgac aacccggccg ggcctgcacc acggcggcca cctgctggcc 3951061 gtaacgcgga tcgggcaccc cgaccaccag agcgtcgaac acgtcgggat gccccttcaa 3951121 agcggcctcg acctcttcgg ggtagacctt ctcgccgccg ctgttgatcg acaccgagcc 3951181 acgacccagc atggtgaccg tgccgtcctc ctcgacttgg gcgtagtccc ccggaatggc 3951241 gtagcgcaca ccgttaatcg tccggaacgt ctcggccgtc ttcttctcgt ccttgtagta 3951301 gccgacggga atgttgccct tcttggcgat gacgccccgc atccccgagc cgggcttgac 3951361 ttcgttgccg tcgtcatcga gcacgacggt gcgatggtcg atccgcaccc ggggcccgcc 3951421 gccatgcgcc tgcccggcag caacgacgct ggtaccgcca aaacccgtct ccgacgagcc 3951481 aattgagtcc gtgatcaccc gattcggcag cagctcaagg agtttctcct tgatgctcgg 3951541 cgagaacagc gccgcggtgc tggccaacag gaacaacgac gacaggtcgt agtcgttgcc 3951601 cttgaccagc gcgtcgacca gcgggcgggc catcgcatca ccggtgaaga acagcaggtt 3951661 caccttgtgt ttgtggatcg tgcgccacac ctcgtcggcg ttgaattccg gtgccagtac 3951721 cgtggtttgg cccgagaaga gcgccatcca ggtggccgac tgggtggcgc cgtggatcat 3951781 cggcgggatc gggtagcgga tcatcggtgg attcgccgcg gccgccttgg ccaggtcgta 3951841 ttcgtctttg acgaactctc ctgtcgcaaa gtcggttcca ccgaacagca cacgatagat 3951901 gtcctcgtga cgccacatca cacccttggg gaaaccggtg gtgccgccgg tgtagagcag 3951961 atagatggcg tcggcgctgc gttcgccgaa gtcacgctcc ggcgagcccg ccgcgatcgc 3952021 ggaatagaac tcgacgccgc cgtagcgccg atagtcctgg tccgagccgt cctcgacgac 3952081 caagatcgtc cttacatggg gcgtgtcggg gagaacgttg gcgacccggt cggcgtagcg 3952141 gcgttcgtgc accaacgcga ccatgtcgga gttgtcgaac aggtagcgaa gttcgccctc 3952201 cacgtaacgg aagttgacgt tcaccaagat ggcgcccgcc ttcacgatgc ccagcatcgc 3952261 gatcacgatc tcgatgcggt tgcggcagta caggccgacc ttgtcgtcct tttgcacgcc 3952321 ttgatcgatc aggtggtgcg cgaggcggtt ggccttatcc tccagctggg cgtaggtcaa 3952381 ctgctcatcg ccgcagataa cggcgacacg gtcaggcacg gcgtcgatgg cgtgctcggc 3952441 gagatcggca atattcaggg ccacggccac caaactagaa cgtgttacat ttcttgacaa 3952501 gctcacaccc gacgggcaga aagaggtggc ggccgtggca accgtggaat ccggacccga 3952561 cgcgctggtg gagcggcgcg gccacaccct gatcgtgacc atgaaccggc cggccgcccg 3952621 caacgcgctg agcaccgaaa tgatgcgaat catggtgcag gcctgggatc gcgtcgacaa 3952681 cgatcccgac atccgttgct gcatcctcac cggagccggt ggctactttt gcgccggcat 3952741 ggacctcaag gcggcaaccc agaaaccgcc gggcgactct ttcaaggacg gcagctacgg 3952801 cccgtcgcgc atcgatgccc tgctcaaagg gcgccgcttg accaaaccgc tgatcgccgc 3952861 cgtcgagggg cccgcgatcg ccggcggcac cgagatcctg cagggcaccg acatccgggt 3952921 cgccggtgaa agtgcgaagt tcggcatctc cgaggccaag tggagcctgt acccgatggg 3952981 cggctcggcc gtgcggctgg tccggcagat cccctacact ctggcctgcg acctgctgct 3953041 gaccggacgg cacattaccg ccgccgaggc caaggaaatg ggcctgatcg gccacgtggt 3953101 gcccgacggc caggcgctga ccaaggctct agaacttgcc gacgccatct cggctaacgg 3953161 acccctggcc gtgcaggcca tcctgcggtc catccgcgag accgagtgca tgcccgaaaa 3953221 cgaggcgttc aagatcgaca cccagatcgg catcaaggtc ttcctgtccg acgacgccaa 3953281 ggaaggcccg cgcgcgttcg ccgagaagcg cgcacccaac ttccagaacc gctaggcgcc 3953341 gagcgtgaac tgagggcgag atttcggccg attttccgcc ctcagttcac gttggacggc 3953401 ggtgtcggtg cacgacggca cactgcgatc gtgatcgaac cattcctcgg cagcgaagcg 3953461 attgcctccg gcgcgttgac gcggcaccgg ctgcgaagcg catacgccac gatccacccc 3953521 gacgtctatg tctcccccgg cgccgacctg accgcatgga gtcgcgctca ggccgcctgg 3953581 ctatggtcgc ggcggcgcgg cgtcatcgcc gggcagtcgg cggcggcgat gcacggcgcc 3953641 aaatgggtcg acgcgcgaca ggcggccgag ctgctctacg accaccgtcg cccgccggcc 3953701 ggcatccaca cctggtcgga ccgtgtcgcc gacgacgaga tccagccaat ctccggcatg 3953761 aatacgacca caccggcgcg caccgccctc gacctcgccc gccgctatcc ggtcggcaag 3953821 gccgtcgcgg ccatcgatgc gctcgcccgc gcgacggacc tcaagctggc cgatgtcgag 3953881 atgctcgccg aacgctaccg gggaagccgc ggcatccgaa atgctcgtat cgcattggat 3953941 ctggtggatc caggtgccga gtcacctcgc gagacgtggc tgcgtctgct actcatccga 3954001 gcgggctttc caagaccaca gacccagatc ccggtttacg acgagtacgg ccagctggtc 3954061 gcggttatcg atatgggttg ggcaggaatc aaggtcggcg tggattacga gggcgaccat 3954121 caccggaccg accgcagaac gttcaacaag gacatcaagc gtgccgaagc gttgaccgag 3954181 cttgggtgga ccgacgtacg cgtgacggtc gaggacaccg agggtggcat catctggcgg 3954241 gtgtcagcgg cctggcagcg ccgaacgtga actcacggcg gagattcggc cgatattccg 3954301 ccctcagttc acgttcggcg tggctcagcc cagcggcggg ctcggcgtga acaccaccgg 3954361 catggattcc aggccgctga caaagttcgc cggccgcagc ggcaacacgg agtcatcggc 3954421 gaccaaccgc aggtcgggta gccgccgcaa cacccgttcc gtcatcaacg acagctccaa 3954481 ccgggccagc tgattgccca ggcagaaatg cgtgccgaag ccaaacgcca agtggctgtt 3954541 tggatttcgc tgaacatcaa acttttccgg ttcacagaaa accgcctcgt cgaagttcgc 3954601 cgactcgaag agcagcatca tcttctcgcc ggcacacaac gccgtgccgt gaaactcggt 3954661 atccgcggtc aacacccggc acatgttctt taccggggcg gtccaacgta gcatctcctc 3954721 gatggccccg ggcagcaacg acgggtcgcg ctgcagcagg tcccactggt cacggttgcg 3954781 cagcagctgc tcggtaccac cgctcaaggt atgccgcgtg gtctcgtcgc cgccgatcag 3954841 gatcagcagc gtctccatga ccagctcgtc gtcgcttagc cgctcgccgt caacttcgga 3954901 actcaccagc acgctgacca ggtcgtcggt ggggtccgct cgccgtgccg caatggtggc 3954961 ccgggtgaag tcgttgtagg ccgcgaaggc gtccatggtg atctggaaat cctcttgaga 3955021 cacatgcgaa ctgaggaatg tcaccagatc gtcggaccac cgcaagaaca tgtcccgctg 3955081 ctctggacgc accccgagca tgtcgccgat caccgccatc ggtagcggcg cggccaggtc 3955141 ccgcacgaag tcacactcgc cgcgttcgca cacggcgtcg atcagggtgt cacacagcgc 3955201 ggcaatcgac gcctccttgt ccttcacccg cttgcgggtg aagccggcgt taaccagctt 3955261 gcgccgcaac agatgtgcgg gatcgtccat gtcgatcatc atcggcaggg cgggctggtc 3955321 ggggcggatg ccgccggcgt tggagaacag ctcgggttga cgttcggcgt cgatcaccgc 3955381 ctggtacgtc gacgcggccg ccaggccgtt gcgatcgcgg aacaccggtt ggttggcccg 3955441 catccaccgg tacgcggccc gcgcctcgcg gctggcgtag aagttgccgt cggccagatc 3955501 cacgtccgga gcttcagtca tcgcgatcct ccgcactaca gtgggcgata tgcccgtctc 3955561 gcaacacacc atcgccggca cggtgctcac catgccggtg cgcattcgca ccgccaacct 3955621 gcattccgcg atgttctcgg tgcccgccga cccagcgcag cgcctcatcg actacagcgg 3955681 gctgcgggtg tgcgaatacc tgcccggtaa ggcaatcgtg atgcagatgc tggtgcgcta 3955741 cgtcgacggg gatttggggc gataccacga gtacggcacc gcgatcatgg tgaacccgcc 3955801 cggcacccaa cgccgcgggc ccagagccct cacccgagcc gccgcgttca tccatcatct 3955861 gccggtagat caggtgttca cgcttgaggc cgggcgcacc atctggggct tcccgaagat 3955921 catggcggac ttcaacgtca ccgacggccg gaggttcggc ttcgacgtca gcgccgacgg 3955981 acggttgatc gccgggatcg agttcagcac cggcctgccg gtgccgaccc tcgggtggca 3956041 aatgttgaag acctactccc accatgacgg cgtaactcgc gagattccct gggaaatgaa 3956101 agtctcgggc ctgcgcgccc ggctcggcgg cgcccgactg cggttgggag accatcccta 3956161 cgccaaagaa ctggcatcgc tgggcctgcc gaagcgggct ctgttgtccc agtcggcggc 3956221 caacgtagaa atgaccttcg gcgacggtca cccgatctga accgcaagaa agcgaagcca 3956281 tcagcccaat ctagaacgcg ttctagcccg ctggcaagga tcgatcagac cagggcggca 3956341 aggtcgcgga cctgctctgc gctgccggcg gtcaccacca tcatggtgac cccggcggcc 3956401 tcccagacgg ccatctgctt acgcacgtgg tcgatgtcac cgacgatcac ggcgtcgtcg 3956461 acgagctcgt ccgggatgat ctcggcggcc tcgtccttgc ggccagaccg aaataacttg 3956521 gtgacctcat cgaccacttg cgtgtacccc atccggcgat agacgtcggc gtggaagttg 3956581 gtctcttcgg cgcccatccc gcccatgtag agcgccagga acggcttgat tccggcaaac 3956641 gcggccgccc gatcgtcggt gatgaccacc tgcgccgtcg cgcagatctc gaagtcctcg 3956701 cggctacgcc gggcgccggg ccgggcgaat ccttcgtcga gccattcgtt gtacatgccg 3956761 gccatgcgtg gcgaatagaa gatgggcagc cagccatcgc agatctcggc ggccagcgcg 3956821 acgttcttgg gcccctcggc ccccagcatg attggtatgt cggcgcgcag cggatgggtg 3956881 atgggtttga gcgctttgcc cagacctgtc gtgccctccc ccgtcagtgg cagccggtag 3956941 tgcggcccgg cgctggtcac cggcgattct cgggcccaca cctggcgcac gatgtcgatg 3957001 tattcgcggg tgcgagccag cggcttggga aaccgctgcc cgtaccaacc ctcgaccacc 3957061 tgcggaccgg acacgccgag cccgagaatg tgccggccac cggacagatg gtccagtgtc 3957121 agcgcggcca tcgcacaggc cgttggtgtg cgcgcggaca gctggatcac cgacgtaccc 3957181 agccgcaccc gttgcgtcga cgagccccac caggccagcg gcgtgtaggc gtcggacccc 3957241 cacgcctcgg cggtgaacac cgtgtcaaaa cccgcatcct cggccgcggc gacgagttcc 3957301 gcatggttct gcggcggctg cgcgccccaa taccccagct gtagtccgag cttcatccct 3957361 gcctccacga cgcccttcag gagggcaatg ttgaaaccgt tgttagaacc tgttctactc 3957421 gacaggcgtg acagccagct cgagcggccc ggcgctgatc gatcactctg agccgcccct 3957481 ttccgcgccc ctcacgttgt ccttcgacta cacccgttcg gtggggccca cgttaagcag 3957541 gtttttcacc gccttgcgtg cacgccgcat tgtcggggtg cgcggatccg acggccgagt 3957601 ccatgtgccg ccggtggaat atgacccggt tacctacgaa cccctgagcg aaatggtacc 3957661 ggtgtccagc gtcggcaccg tcgcgtcctg gacctggcaa cccgagccgc tagccggcca 3957721 gcccctggac cggccgttcg cctgggcgct gatcaagctc gacggcgccg acaccttgct 3957781 gatgcacgcc gttgatgtgg gaaccgccgg cccttccgcc atccacaccg gcgcccgggt 3957841 gcacgcgcat tgggccgacc aaccggtggg cgccatcacc gatatcgcct gctttgcgct 3957901 cggcgagacc gcagaaccgg tggcggctca caagaccgag gatgcgcggg acccggtcac 3957961 catgatcgtc acgccgatcc agctggaaat tcagcacacc gcctcgcacg aggagagtgc 3958021 gtatctgcgc gccatcgccc agggcaagct cgtgggcgcc agaaccggaa agaccggcaa 3958081 ggtatacttc ccgccgcatg gcgccgaccc ggccaccggg aaacccacct ccgagtttgt 3958141 cgagctgccc gacaagggca cggtgacgac gttcgcgatc gtcaacatcc cgttcctggg 3958201 ccagcgaatc aagccgccct atgtggcggc ctacgtgttg ctcgacggcg ccgacatccc 3958261 gtttttgcat ttggtttccg acgtcgacgc gcaccaggtg cggatgggca tgcgcgtcga 3958321 ggcggtgtgg aagccgcggg agcggtgggg actgggcatc gacaacatcg agtacttccg 3958381 ccccaccggc gaaccggatg ccaactacga cacctacaag caccacctgt aaagggccca 3958441 ccaaccaatg agcgttcgcg atattgccgt tgtcggcttc gcccacgccc cgcacgtgcg 3958501 ccgcaccgac ggcactacca acggcgtcga gatgctgatg ccgtgcttcg cccagctata 3958561 cgacgagctg ggcatcacca aggccgacat cggattctgg tgttcgggtt cgtcggatta 3958621 cctggctgga cgagcatttt cgttcatctc cgcgatcgac tccatcggag ccgtaccgcc 3958681 gatcaacgaa tcgcacgtcg agatggacgc cgcctgggca ctgtatgagg cctacatcaa 3958741 actgctgacc ggcgaggtcg acaccgcgct ggtgtacggc ttcgggaagt cctcggccgg 3958801 aacgctgcgc cgtgtgctgt cccgccagac cgacccgtac accgtcgcgc cgctgtggcc 3958861 ggattcggta tcgatggcgg gactacaggc gcggttgggg ctggactccg gcaagtggac 3958921 ccacgagcag atggcgcgag tggcgttcga ttccttcacc aacgctcgcc gggtggattc 3958981 cgtggagccg ccgatcaccg tcggggaact gctggcacgg ccgttttttg ccgatccgct 3959041 gcggcgccac gacattgcgc cgattaccga cggtgccgcc gcggtcgtgc tcgcggccga 3959101 caaccgcgcc cgagaactgc gcgaaaatcc ggcgtggatc accggaatcg aacatcgcat 3959161 cgagtctccg gcgctggggg cgcgcgacat caccgagtct ccgtcgacca aactggcggc 3959221 caagatagcc accggcggac acaccggcga catcgacgtg gcggagatcc atgggccctt 3959281 tacccaccag cacctgatcg tcgcggaggc catcaggatt ccgggtaaga cgaaagtgaa 3959341 tccgtccggc ggcccgttgg ccgccaaccc catgttcgcc gccggccttg agcgtatcgg 3959401 ctttgccgca caacatacct gggacggatc ggcgcggcgc gtgctggcgc acgccaccag 3959461 cggaccggcg ctgcagcaaa acctggtcgc ggtcatggaa ggacggggat agtggagggg 3959521 cagcgctgat ggccggaaag ctggccgccg tactcggcac cgggcagacc aagtatgtcg 3959581 ccaagcgcca agacgtttcg atgaacggtc tggtgcggga ggccatcgac cgagcgctgg 3959641 cggattccgg ttccaccttc gacgacatcg acgccgtcgt ggtcggcaag gcgcccgact 3959701 tcttcgaagg ggtgatgatg ccggagctat tcatggccga cgccatgggc gcgaccggca 3959761 agccgctgat ccgggtacac accgccggtt cggttggcgg atccaccggg gtagtggctg 3959821 ccagcctggt gcaatccggc aaataccgcc gggtcctggc attagcctgg gaaaagcagt 3959881 cggaatccaa tgccatgtgg gcgttgtcga ttcctgtgcc gttcaccaaa ccggtcggtg 3959941 ccggtgcggg gggatacttc gccccgcatg tccgggccta tatccgccgc tcgggcgcac 3960001 cggcacacat cggtgctatg gttgcggtca aggaccggct caacggcagc cgcaacccgt 3960061 tggcacatct gcagcagccc gacatcaccc tggagaaggt gatggcatct cagatgctct 3960121 gggatccaat acgtttcgat gagacgtgcc cgtcgtcaga cggtgcgtgc gcggttgtcg 3960181 tcggcgacga ggagatcgcc gacgcgcgac tggcgcaagg gcatccggtg gcctggattc 3960241 atggcaccgc attacgcacc gagccgctgg ctttcgccgg gcgcgaccag gtcaacccgc 3960301 aggccggccg cgacgcggcg gcggcgctgt ggaaggccgc gggcatcacc agccccatcg 3960361 acgaaatcga cgccgccgaa atttacgtcc cgttctcctg gttcgagccg atgtggttgg 3960421 agaatctggg atttgcccgc gagggcgagg gctggaagct caccgaggcc ggcgagactg 3960481 cgatcggcgg tcgactaccg gtgaaccctt ccggcggcgt gctgtccgcc aatccgatcg 3960541 gcgcatcggg cctgatccgc ttcgccgagg ccgcgatcca agtcatgggc aaggcggagg 3960601 cgcgtcaagt tccgggtgcg cgaaaggcct tggggcacgc ttacggtggc ggctcgcagt 3960661 acttctctat gtgggtggtc ggctgcgaga aacccaaaca ggcagccgca taatcgcccg 3960721 gcgcgatccg ggcgacgccg cagaccatcc gagcatggtg aagttcacac ccgatagcca 3960781 gacgtcagtt ctgcgcgcgg gcaagtgctc aggtactctt tctccgtcgc ggtcgcgatt 3960841 gcaaaggggg agctggccgg tggattccga acgccgacgc tacgggtggc cgcggaatcg 3960901 acgcacctta gccattactg gagctgcagt cgttgtcgtg gtgaccctcg cagccattgg 3960961 ttacctgatc tttgagccaa aaatttctgg gtcgtccacg tccaggcagg ccgcatcgcc 3961021 aaccactcct tccccgccca gccaggtcgt ggtgccgatc gacctttgga atcccgacgg 3961081 ggtgacggtg gacctggcgg acgccgttta cgtggccgac tccggtcaca agcgactgct 3961141 gaaactgccg gccggctcca acaccccgac cacgttgcca ttcaccgaca ccatcggtcc 3961201 aggcggcgtg gcggtaaaca gcaaccgcga cgtctatgtc atcgatgaag acagccacca 3961261 tgtgttgaaa ctcgcggccg gcatcgaacc cccggtcgag ctcccgttcg gcagccttgg 3961321 cgatgcgcat ggtttggcag tggaccgcag cgacagcgtc tatgtcgtcg actatgacaa 3961381 tgccaaagtg ttgaaactgc ccccaggcgc agatacccct accgaactgc cgttcgtcgg 3961441 gctcgaccac ccctatgatg tggcggtgga cggtgctggc accgtctacg tgaccgacag 3961501 cggccacaat cgcgtggtgg cgttgaccgc ggggtcggcc acgccggtgc acctcccatt 3961561 cgccgatctc agctttcccg ccggtgtgac ggtggaccgc gacgatagcg tctatgtggc 3961621 cgatctgaac aacaatcggg tgctgaagct ggcggccggc tcgaatgcgc agtcgcagct 3961681 gccgttcacc ggactcttct ccccaactga tgtggcggtg gacaacgacg gcgccgtcta 3961741 cgtgatcgac ttttacaacc ggatgttaaa actgccgacg gcttaacccg cagcgacgcc 3961801 tacatgggtt ccagtccggc cagatgccgt gcagccaggt cacgataggc ctgcggattg 3961861 acgttaaccc acatttccgc gcccgttcct tcaatcggtc ccttcacctt ggccggcgct 3961921 cccgttacca gcattccggc cggaatctga gtgccggcca ccaccagcgc tcccgcggcg 3961981 atcatgcagc gcgcgccgat taccgctccg tcgaggaccg tcgcgtggtt ggcgatcaga 3962041 gcctcagacc cgacgtggac gccgtggatc acacacaggt gcgccactgt cgcccccggg 3962101 ccgatgtcta ccgggatgcc gggcggtgcg tgtaataccg ccccgtcctg cacattggcc 3962161 ccctcgcgca cgacgacggg cgcatagtcg ccgcgcagca cggcattgaa ccagaccgac 3962221 gccccagcct cgatggtgac gtcgccgatc agggtggctg tcggggccac aaacgcggtg 3962281 ggatcgatcc ggggcgatcg gccctcgaaa gaaaacagcg gcatcgttta gatatacgcc 3962341 cgtcgtacat atgccgtggc cagactcgct gtcgttgcgc tcaccggaga gaaaactgta 3962401 acgtgttcta gttagcgata ccgatcggga ggtgacaggt gagtaccgac acgagtgggg 3962461 tcggtgttcg ggagatcgat gccggcgcct tgccgaccag gtatgcgcgt ggctggcatt 3962521 gcctgggcgt cgcgaaggac tatttggaag ggaagccaca cggggtagag gcgttcggca 3962581 ccaagctggt tgtgttcgct gattcccacg gggacctgaa agtcctcgac ggctactgcc 3962641 ggcacatggg cggcgacctg tccgagggca ccgtcaaagg cgacgaggtc gcttgcccgt 3962701 tccacgactg gcgctggggt ggcgacggcc gctgcaaatt ggtgccgtat gccaggcgca 3962761 cacccagaat ggcgcgcact cggtcgtgga cgaccgatgt gcgcagcggg ctgctgtttg 3962821 tctggcacga ccatgagggc aatccacccg accccgcggt ccggatcccc gagattcccg 3962881 aggcggccag cgacgagtgg accgactggc ggtggaaccg catcctcatc gaagggtcca 3962941 actgccgcga catcatcgac aacgtcaccg atatggcgca cttcttctac atccacttcg 3963001 gtttgccgac gtacttcaag aacgtcttcg agggccacat cgcctcgcag tatctgcaca 3963061 acgtgggccg gcccgatgtc gacgatctgg ggacgtctta cggtgaggcg catctggatt 3963121 cggaggcgtc ttacttcggg ccgtcgttca tgatcaactg gctgcacaac cgctacggca 3963181 actacaagtc cgagtcgatc ctgatcaact gccactaccc ggtgacccag aactcgttcg 3963241 tcctgcaatg gggcgtcatc gtcgaaaagc ccaagggtat gagcgaagag atgaccgaca 3963301 agttgtcgcg ggtgttcacc gagggcgtca gcaagggctt cttgcaggat gtcgagatct 3963361 ggaagcacaa gacccgcatc gacaacccgc tgctggttga agaggacggc gccgtgtatc 3963421 agctgcgccg ctggtatgaa cagttttatg tcgacgtagc cgacataaaa ccagagatgg 3963481 tggagcgctt cgagatcgag gtcgacacca agcgcgccaa cgagttctgg aatgccgagg 3963541 tagagaagaa cttgaaatcg agagaagttt ccgacgacgt gcccgccgag caacactgac 3963601 ggacatgcct gacgatcagc cggcggttcc cgacgtcgat cggctggccc ggtcgatgct 3963661 actgctgcac ggtgatcatc acgatcacaa cgattccccc gagcaacacc gcacatgtgg 3963721 atcctggtcg aagtcaaggg atttcgctga cgacccgcag cgtgctgccg cggtgcgcga 3963781 agccagccgc gccgagcgcg accgttatct gacctcaggc ctgcaaccgg tggattgccg 3963841 gttctgccat gtcacggtga ccgtaaagag gctggggccg ggtcataccg ctgtgcaatg 3963901 gaacaccgag gcgtcgcggc gctgcgcgta cttcaccgag ctgcgggcac gcggcgggga 3963961 ttccgcacgc accaggtcct gtccccggct gaccgacagc atcgaacacg cagtggccga 3964021 gggctacttg gagcaccacg acccaaaccg ataacgtcgc acacccgctt gccgcgggat 3964081 acggtgccgc atccggcacg gtgccaccga ggcgtacggt ttgtgacggc ggttccggga 3964141 ctgagcttcc tatgaagcct ctccggtgtg cgcgagtcga tcgaggcgca ccagagcatc 3964201 gtgttcgccg ccctggcggt cagaactgga ttgaacacca aacaggttgg agcatcaaga 3964261 aattcgttca aacactacgt cgctaccgca ccgtgacccc gcgccggcaa ccacaccctc 3964321 cgtgcgggac cttcagggtc cgatgcaaga gcaccggtct gttggatggg gatgcgactc 3964381 gtaggcgatg ccctgtccat tcactcgaga tccattcact cgaggtcgac ttcgcccggc 3964441 ccgccccgca tctcacaaaa cgagggttta ctgtgacctt attgtcgcgc aaaaagaaag 3964501 gcccgattct gaatattggg cagccagcca aatccgcggc aatcctcctt gtagagcagc 3964561 ttgaaaccga gttcagaagc ttttgattcg agatcagcgt cggtgatacc ccattgccag 3964621 atgtctggta tatcgcgcca cggcttgtca tgatccgggt gctttttatc gagtttttga 3964681 aagagatctc tataagcctt gttaagtttg ctgtgcggga cgttacgaaa gtagtgcttt 3964741 tcacccagat ccaacaatct aactgtggtt gttgaaccta tccattgttg attataaatc 3964801 agcaaacaac gtacattctt tgcgtacata tccaggatag tgtcccaatc aggcgacact 3964861 tggtgcagca gcacatcgaa taggaagagg gcgtcgacat taccgacttt atcggcaatc 3964921 tcctggtctc cgaagttccc ctcaataacg cgaagttgcg gatatgaatt tgcacgggcc 3964981 gcgactgttg gagttatgcg gccatcgacc aatactgcct cttttaccgg gtacttatcc 3965041 agggcgcgaa atgtataggc gccttccact ccccaaacgg caccgagatc cgcgaacgac 3965101 tctatgcgac atgatgtgaa agcccgatct atcaggttga ttttgcctct aacgagccaa 3965161 tagccaccct gtcgtaaccg atccaacatc atttcacctc aaatacgtgt gttcaactgg 3965221 cgtctcgctg gatggtagat caccgctagc tggtccagta tgccgcctgc catgagcttg 3965281 ggaccagtgt gatctgcttg tgccaggggg aggacgggac caccttgatt gccagtcacg 3965341 ggaccacggc gcgccgcgtc ggtggtcttt tcgcttattc gtgcgatcgt cgtgacagct 3965401 caagtcacgg gaggcggcgg gcatggcttt tcgggaggtc agtatgaaca agatcaggga 3965461 agtgctgcgg gtctggctgg gggtggccgg gttgccggcc ccggggtgcc gcacgatcgc 3965521 cgcgcattgc ggtatggacc gcaagacggt gcggcgctac gtggaggccc gcgcaggcac 3965581 ccggtctgcc ccgcgacgac gatgtcagcg ctatcgatga cgggttgatc ggggcggtcg 3965641 ccgacgcggt gcgtccggcc cggccggatg gtcatggtgc ggcgtgggag caactgctgg 3965701 gggtagcgaa ctgttcacta ccccctggca atcaggtggt cccgtctccc tggcaagcga 3965761 cagctcgggc aatccacctg gtaatcaaac aatttcgggc ggctggcgag ttgtcgcgct 3965821 gaggcgggca aattgcgtat ctgctcgacc aatgcgacgc ggctggccaa acagcacacc 3965881 gaatcacagc ccggcgaacc gctctttgac catttcgacc gtgagcccgt agtcagccaa 3965941 cgaataggaa tgctttgggg cccgggcacc gctctggctc tcggcgtgga cggttgtcat 3966001 tgcctgtcga gcctcgtcgg acagcgtcaa cccgaagtgc cggtagatat ctgccaccgt 3966061 acccagcgga tcggcaatca agtcgtggta gtccacgtcg tagaactggg ccgaatcata 3966121 tttggcccgt gcggcattga accgctccag cccacgcgac caggtgtcca tcgcgtccgc 3966181 accgatctgg gcgcccacaa acttcgtcga ccacccttct gtggtgtgct gcgccagcga 3966241 gcacatcgac gccatgatcg tctccaccgg ccggtgagtc tgcaccacca gggcatcggg 3966301 ataggtcgcc atcagcgcat ccagggcaaa tagatgactc ggattcttta gtacccaccg 3966361 cttttcggca tcgttgagcc caatcagctg caggttgcgg cggtgccggc aatacgacgg 3966421 cgtccagtcc tggcgtgaca accagtcggc atagctgggt acatgcgcca gcgcctcgta 3966481 cgacaccgaa tgcagcgact gccgcaacag ctgccaacac tcctccaact cgtaggccgc 3966541 catgaaatgc aagccggtgt atcccggatt ctcggcatga tgctgggtga actgtgcatc 3966601 gagctggcga tacaacgggt ttgactccca ggtctcgcgc ggggggcgcg gctgcgggta 3966661 ctcggccagc cacatgtgca ggccttggtg ggccgggtcg gcgcccagca gccggtgcag 3966721 cgcagtggtt ccggtgcgca ccaacccggt gacgaagata ggccgtttga tggcaacgtc 3966781 gacgtgctcc ggatactgct tccacgcgga ctgggacagt agcctggcca ccagcgcacc 3966841 gcgcaggaag aaccggttca tcttgctgcc caacacggtg aggccggctt cgccctggta 3966901 agcgtccagc aacacaccca gcgcctcacg gtagttgtcg tcgtcggtgc caaaatcgtc 3966961 gagacccacc agtttggtag ccgatgcgtg cagttcgtcg acggtggcca catctttccg 3967021 atcgggacgc cgagtcatta cgtgtggtac tccccgcaat tgacgtccag ggtctgcccg 3967081 gtgatgccgc tggccaggtc gctggccagg aaaagaatcg ctgaggccac ctcgtcttcg 3967141 gttggcagcc gtttgagatc ggagtttgcc gcggtcgcct gatagatctg atccacggta 3967201 gtgccgtatt tgccggcctg atggtcgaaa tagcttttca gcgtgtcacc ccagatatag 3967261 ccgggtgcaa cggaattgac gcgaattccc tgctcgccca gttccgtggc cagcgaatgc 3967321 gacatagcta gcagtacgga cttggccatc ttgtaggtgc cgtatttcgg ctgcgagtgc 3967381 cggatcacca tggagttgac gttgacgatc gcgccgtgag actgcgccag cgcgggcgta 3967441 aacgcctgga tgagtcgtag cgtccccagc gcgctgagct ctatcgcgtc acggatgtgc 3967501 tcaaatgtgg tgccggccaa tggtttcatc gatggcaccc ggaacgcgtt gttgatcagc 3967561 acgtcggcct tgccgtacgc cgccagcgtg gcctgcacaa ggttgcttac gtcgtcgtcg 3967621 tcggtgatgt cggtgcgcac cgccaccgcc cgtcgcccgg tgtcgatgat ctgcttggcg 3967681 acgtcgtcga gacgctccgc gctgcgcgca gccagcacca gatcggcgcc gtctcgcgca 3967741 catcggtgcg ccagcgtcgt gcccagcccc ggtccgacgc cactgacgac gatcaccttg 3967801 cgcttgagca tcccggtcat cccagcatcc tggtcgcgat ttgccgttga cgcaacgcaa 3967861 ttcgcgcccg ccaatcatcc tcggaaatct tgttgtgctg gtaatgcggc agtgccgccg 3967921 ggatggcgtc gaagtcgacc agttcgacgg tgggcccatc ggcctcggtg agctcacggg 3967981 atacccgctg ccagcggaac tgcagaaacc cgcgccgatg gccgagcgtt tccacccagt 3968041 tggtcacacc cggattctgc tcggcgacca cgatgcgcac cttgccatcc gggtccgctt 3968101 gggcctggct ggcattcaac gaggtctgat gattgatata gtccagcgag atgtaccaca 3968161 tgctgcccaa ctgaaaccct aagtaggggg cgtcgctcac cggcaccgtg atcaccagcg 3968221 cttgacccgg ccgcagctcg aaatgaccgg ccgacgagta ttgggtggcc aggccaccgg 3968281 gagtcaaccg aggcgccacc atggtgttaa ccgggatatt gaggtagaac cactggggaa 3968341 actgtaacca ggttttcacc cggttcacaa gctgggatcc cgctgtggca taacgctttt 3968401 ccatgagctc gcgagtcagc ggcggcggcg cggtgccgac ggtgtccagc ctggcgatgg 3968461 ccagcgtgcc gcgctgttgt gaccaatcgc cgtacacctc ccggatcact agttgcccgg 3968521 gagcgctggg ccgcaaccgc cattcgaagc tgccgtccgc ggcgatgtcg agctcacggt 3968581 cgtcgaacgc ggcctggctg gccggcacgt tatagtcggt gtactcgccg ccgagcagct 3968641 gaaagctcag gtcggtggtg gtgccgcgcc gtccgctgac cacatagtcg cggttggcct 3968701 gcagccgggt gccgaagtag agggtgtcgg ggttgtccag gcccatcttc gtgaacggcc 3968761 ctgttccgga ctgcaggaac gggtggtcac gctcgtagtc gaaggccagg tgcatgcagc 3968821 ccgcgatgca gccggccagg tattgcagcc cttcgagcag gtcggcttca gtctcgatgt 3968881 gcggggcggc ggctaccagc tgctcggctt ctgcgatcgc ctcgcgcagc gggtcggagt 3968941 acacgacttc gacactagaa cgtgttcctg ttttgcgtca atggcgaaca tctgcccccg 3969001 tcatttacgg caattgaaga caaagcccgc tcgcttccag agccctgcgc acgagctacc 3969061 cattattgat ctagcttatt gttgcgttat acgacagtct gagcagtata ttgtccgcta 3969121 tatgtgtatt cgtagcggcg tggattgacg cgaggctggt cgagacccgg tccgtgagat 3969181 gcgccggaaa gggtgtcgcg atgcggaact ctactgaccg tccagcggcg gctaacgaag 3969241 tctgcatccg cgacagccac ccaatgacgc gcctgccgtt gcgatctcag cactgacggc 3969301 agcccggcct atccgcgacc agctagggaa aggcattcgc agatgttcat ggatttcgcg 3969361 atgcttccgc cggaagtcaa ctcgacacgg atgtatagcg ggccgggagc gggctcgttg 3969421 tgggccgccg ccgccgcctg ggatcaggtg tcggcggaat tgcagtcggc ggcggagacc 3969481 taccgctcgg tgatcgccag cctcaccggc tggcaatggc tgggtccatc gtctgtgagg 3969541 atgggtgcgg cggtcacccc gtatgttgag tggctgacca ccaccgccgc gcaggcaagg 3969601 cagacggcca cccagatcac cgcggccgcg accggatttg agcaggcgtt cgccatgacg 3969661 gtgccgccac cggcaatcat ggccaaccgt gcacaggtgc tatcgctgat agcgaccaac 3969721 tttttcggcc agaacaccgc ggcgattgcg gccctggaga cccagtacgc cgagatgtgg 3969781 gaacaggacg ccaccgccat gtacgactac gcggccacct cggcggcagc gcggactttg 3969841 acaccattta cctccccgca gcaagacacc aactcagccg gtctgccggc gcaaagcgcc 3969901 gaagtcagcc gcgcgaccgc caacgccggc gccgccgacg gcaactggct gggaaacctc 3969961 ctggaagaaa tcggaatact gctgctgccg atcgcgcccg agctgacacc ctttttcctg 3970021 gaggcgggcg aaatcgtcaa tgcgatacct ttcccgagca tcgtcgggga cgagttctgt 3970081 ttgctcgacg gcctactggc ttggtacgca acgatcggct cgatcaacaa catcaattcg 3970141 atgggtaccg gcatcattgg ggccgagaag aatttgggga tcttgcccga gctagggagc 3970201 gcggctgcgg cggccgctcc cccaccagcc gacatcgccc cggcgttcct cgcgccgctg 3970261 accagcatgg ccaagtcact atcggacgga gcactacgcg gcccgggcga agtttcggcc 3970321 gcgatgcgcg gcgcgggtac catcgggcaa atgtcggtgc cgcccgcctg gaaggcgccc 3970381 gcggtcacca ccgtcagggc gttcgatgcc accccaatga ccacactgcc cggcggcgac 3970441 gcccccgccg ctggagtgcc tggactgccc gggatgccag cctcgggggc cggacgggct 3970501 ggcgtggtgc cccgatacgg cgtacggctg accgtgatga cacgtccact ctcgggcggg 3970561 tgacatcagt gcgtgatggc ggcgcacctt gaccgtcgcg cattgcgctt ccaacaccaa 3970621 cgaactggga ctgcagtagt agcgcaaccg cgcttggagc gggtccccac cggttatggc 3970681 attcgatacc gcaccaaagc gaaatcagtt cccgaacccc gaccgctggt tctcgctgtt 3970741 gaagccgccc gagttgtgga cgcccgagtt gaagaatccg gagtttaaga cgcccgagtt 3970801 tgcgataccc accgtttggg cgccggtgtt tccgaaaccc gccgcgatga cggtgccgac 3970861 cgcgttttgc aggcccgagt tgttgctgcc cccgttctgg aaccccgagc tgccaacgcc 3970921 cgtgttgaag aagcccgagt tgttcgtgcc ggcgttgccg aaacccgagt tcgggccgga 3970981 gccggtgccg acggcgccga acccggtgtt gagcgaaccc gcgttgaagg cgccggtgtt 3971041 agtgaggccc gagtttgacc agccggtatt tctgacgccg gcgttgaaat caccggtatt 3971101 gccttgtccg gaattgaagt tgcccgtgtt gatgctgccc gcgttgaagc tgcccgcatt 3971161 caactggccc gagttcccga agccgaggtt tattgcgccg ccgtttccga aaccggtatt 3971221 cacgcccccg gcgtttccca tgcccgtgtt gagcgaaccc gagttcccga aaccggtgtt 3971281 atggcccgcc acgaacgggc ccacggtggc ccccgagttt ccgatgccca cattcgcgtc 3971341 gctggagttg ccgataccca ggttgccgtt gccggagttg aagaaaccga cgttgttggt 3971401 gcccgagttg aacaagccga tgtttccgct gcccgagttc agcccaccga tgcccacctg 3971461 gttgttgccg gtgagcccga agccgatgtt gccgttgccg gtgttgccga aaccgatgtt 3971521 gccaatgccg ttgttgccga agccccagtt gctgctgccg gtgttcccgg cgccgatgtt 3971581 gccgctgccg gcgtttccgg tgccgacgtt gttgttgccg gtattcccga acccgacgtt 3971641 ggagttgccg gtattgccgc tgcccaggtt gctgttgccg aggttcccgt tgccgacgtt 3971701 cccactgcct ggaagtcccg ccctcccgtt gccgctgccg aagttgccgt tgccgacgtt 3971761 cccgccgccc acgttggagt tgccgatgtt gccgctgccc aggttcgcgt caccccggtt 3971821 cccgctgccc aggttggtgt taccgatgtt gccgttgccg gtgttcaggt cgccggtgtt 3971881 gccgccgccc aggttggcgt tgccgatatt gccgataccc aggttcggca gctgctgcag 3971941 cgcctgctga aagggcacca actgctcggc cgccgccgag gccccggagt ggtagcccac 3972001 catcgcggcc acatccgcgg cccacatctg ttcgtaggcg ccctcaacgg ccgcaatcag 3972061 cggcgcgttc agcccgaacc aattcgacat caccaactgc gcaaacgcat tgcggttggc 3972121 cgccaccagc aaagggtgca ccgtcgccgc ccgtgccgcc tcgaacgcgc tggccaccgc 3972181 cttggcctgg gccgccgccc ccgcggcccg ggccgccgca gcgctcaacc atcccgcata 3972241 cggtgccgcc gccgccgcca tcgccgccgc cgccggcccc tgccacgcct gacttgccaa 3972301 gtccgaggtc accgacccaa acgaggacgc cgcggacccc aactctgcgg ccagcccgtc 3972361 ccaggccacc gccgccgcca acatcggcgc cgaacctgca cccgtgaaca tccgcaacga 3972421 attaagctcc ggcggcaata ccgcatagtt catgaccccg tcccttcccg acctgacaat 3972481 cagtcagaac cgtaggacaa accgggtcgg accatctgcg tttccgtgaa atccgcgaac 3972541 cagcggtgtc gtcaatgcgt tacggccgca ccgctatcca gctcgcgttt gatttccagc 3972601 gcgatgtcga tgagctggtc ttcctggcca ccgatgagct tgcgctgacc ggcccggtgc 3972661 aacagcgccg acgccggcac gccgtagcgc tcggcctggc ggaccgcatg cttgaggaag 3972721 ctggagtaga ccccggaata ccccatgatc aacgcgttgc ggtcgagcag acattcggcc 3972781 ggcatggccg ggcgcaccac gtcctcggcg gcgtcggcaa tgtcgaagaa atcaatgccg 3972841 gtcttgacgc cgatcttgtc gaacaccccg atcagcgcct cgaccggcgc gttacccgcc 3972901 ccggcgccga aacgccggca ggacccgtcg atctgcttgg cgcccgcgcg caccgccgcc 3972961 accgaattgg ccaccccgag accgaggttc tcgtgcccat gaaagcccac ctgggcgtct 3973021 tcgccgagct cggcgaccag ggccgacacc cggtcggcca cgccgtcgag caccagggca 3973081 ccggcggagt cgacgacgta gacacactgg cagccggcgt cggccatgat gcgggcctgg 3973141 gcggccagtt tctccggcgc aatggtgtgg gccatcatca aaaacccgac ggtttccaga 3973201 cccagttcgc gggccagccc gaaatgctgg atcgacacgt cggcctcggt gcagtgggtg 3973261 gcgatccggc agatcgaccc gccgttgtcc cgcgcctctt tgatgtcgtc cttggtgccc 3973321 acaccgggca acatcaaaaa cgcgatccgg gcctctttcg cggtcgccgc ggccagcttg 3973381 atcagctcct gctcaggggt tttcgagaag ccatagttga acgatgagcc gcccaggccg 3973441 tcgccgtggg tcacctcgat caccggcacg ccagcggcgt ctagggcggc cacgatggca 3973501 ccgacctcgt ccttggtgaa ttggtggcgt ttgtggtgcg acccatcccg cagcgaggtg 3973561 tccgtgatgc ggacgtccca catatcggtc atcgcgctcc tcctacaacc agcgtctcct 3973621 tggcgatctc ctcgcccacc ttggtggccg ccgcggtcat gatgtccagg ttgcccgcat 3973681 agggcggcag gtaatccccg gcgccctcaa cctcgacgaa cgtggtgacc agcgcctgcc 3973741 cgcccgagtt gatcgacggc tcgtcgaact gcggttcgtt gagcagccgg tatccaggca 3973801 cgtaggtctg cacctctttg acgacgtcgt ggatggaggc ggcgatcgct tcgcggtcgg 3973861 cgtcggtggg gatggcgcaa aagatggtgt cgcgcatgat catcggcggg tcggcgggat 3973921 tcaagatgat gatcgccttg ccgcgggcgg ccccgccgat ggtctggacc ccacgggcgg 3973981 tggtcttggt gaactcgtcg atgttggcgc gcgtgcccgg tcccgctgaa acggaagcca 3974041 ccgacgccac gatctcggcg tagggcacct ccacgatccg agacaccgcg tacacgatcg 3974101 gaatggtcgc ctgtcccccg caggtgatca tgttgacgtt cggcgcgtcc aggtgctcgc 3974161 gcaggttcgc cggcgggatc accgccggac ccaccgccgc cggcgtcagg tcgatggccc 3974221 ggatcccggc ctcggcgtac ttgggcgccg cgtcccggtg cacgtaggca ctggttgcct 3974281 cgaacaccag gtcgggttta tcgggctgcg ccagcagcca gtccaccccc tcgtgggtgg 3974341 tctccaaacc cagcttggcc gcgcgcgcca ggccatcgct ctccgggtcg atgcccacca 3974401 tccagcgcgg ctccagccac tccgatcgca gcagcttgta cagcagatcg gtgctgatat 3974461 ttcccgaccc gacaatcgcc acttttgcct tggacggcat gttgctcccc ttattcgaac 3974521 gacaaccgga ccaaacccag cccggtgaag tcggcgacaa actcgtcgcc ggcccgcgcc 3974581 tcgaccgcga acgtgcatga cccgggtaac acgatgtcgc ctttgcgcag ccgcacgccg 3974641 aaactctcga ccttgccggc cagccaagcc accgcggtcg ccgggttacc caacaccgca 3974701 tcactgcggc cctcggccac cacctcgccg ttgcgggtca gcttcgcatc gatcgccctg 3974761 acgtcaagat cggccggcgg cacccgggcc gcgcccaaca cgaagcccgc cgccgaggcg 3974821 ttgtcggcga tggtgtcgca gatcttgatc tgccaatcct tgatcctggt gtcgatcagc 3974881 tcgatggcgg gcaccagggc ctcggtggcc gccagcacgt cgtcctcggt gcagcccgca 3974941 cccggtaggt cggcggccag gatgaagccc acctccacct caacccgcgg agacaggtac 3975001 cgggacgcct ggaccggcgt gtcttcgaac acctgcatgt cgtcgagcag gtgtccgtag 3975061 tctggttcgt caacccccat catctgctgc atgatcggcg acgacagccc gaccttatga 3975121 cccaccacgc gggcaccctc ggccacccgc tgccggatgt tgatcaactg gatctcgtag 3975181 gcgtcgacga catcgatctc gggatgggcg gcggtcagtt gaccgatcgg gtcgcggctt 3975241 cgctcggctt gtgctaggtc ggcggccagc tcatcacggg tggcatcacg gagcattcgg 3975301 cgaagtcccc tcgtaggcgt gaccgggcca gtagcgcccg acccgagcaa ttctataacg 3975361 tgttctacat gactgtgcag gagttcgacg tcgtggtggt cggcagcggc gccgccggca 3975421 tggttgctgc gctggtcgcc gctcaccgag gtctctcgac ggtagtcgtc gagaaggccc 3975481 cgcactacgg cggctccacc gcacgctcgg gcggcggcgt ctggatcccc aacaacgagg 3975541 tcctcaagcg ccgcggcgtt cgagatacac cggaggcggc acgcacctat ctgcacggca 3975601 tcgtcggcga aatcgtcgag ccggaacgca tcgatgctta cctcgaccgc gggcccgaga 3975661 tgctgtcgtt cgtgctgaag cacacgccgc tgaagatgtg ctgggtaccc ggctactccg 3975721 actactaccc cgaggctccg ggcggccgcc cgggcggacg ttcgatcgag ccgaaaccgt 3975781 tcaacgcgcg caagcttggt gccgacatgg ccgggctgga gcccgcgtat ggcaaggttc 3975841 cgctcaatgt ggttgtgatg cagcaggact acgttcgcct caatcagctc aaacgtcacc 3975901 cccgtggcgt gctgcgcagc atgaaggtcg gcgcccgcac gatgtgggcg aaggcaacag 3975961 gtaagaacct ggtcggcatg ggtcgagccc tcattgggcc gttgcggatc gggttgcagc 3976021 gcgccggagt gccggtcgaa ctcaacaccg ccttcaccga tcttttcgtc gaaaatggcg 3976081 tcgtgtccgg ggtatacgtc cgcgattccc acgaggcgga atccgctgag ccgcagctga 3976141 tccgggctcg ccgcggcgtg atcctggcct gtggtggttt cgagcataac gagcagatgc 3976201 gaatcaagta ccagcgggca cccatcacca ccgagtggac cgtgggcgcc agcgccaata 3976261 ccggtgacgg cattctcgcc gccgaaaagc tcggcgcagc actggatctg atggatgacg 3976321 cttggtgggg cccgacggta ccgctggtcg gcaaaccatg gttcgcgctc tcggagcgca 3976381 actctcccgg ttcgatcatc gtcaacatgt caggcaagcg attcatgaac gaatcgatgc 3976441 catacgtcga agcctgtcat catatgtacg gcggcgaaca cggccagggg cccggaccgg 3976501 gcgagaacat tccggcgtgg ctggtgttcg accagcgata ccgggaccgc tacatcttcg 3976561 cgggactaca accagggcaa cgcattccga gcaggtggct ggattccggc gtcatcgtcc 3976621 aggccgatac ccttgcggag ctggccggca aggccggtct acccgcggac gaactcactg 3976681 ccaccgtcca gcgtttcaac gcattcgccc ggtccggtgt cgacgaggac taccaccgcg 3976741 gggaaagtgc ctacgatcgc tactacggcg acccgagcaa caagcccaat ccgaacctcg 3976801 gcgaggtcgg ccacccgccc tattatggcg ccaagatggt tccgggcgac ctggggacca 3976861 agggcggtat ccgcaccgat gtcaacggac gtgctctgcg ggacgacggc agcatcatcg 3976921 acggccttta cgctgcaggc aatgtcagtg ccccagtgat gggacacacc taccccggtc 3976981 cgggcggcac gataggcccg gcgatgacgt tcgggtacct ggcggcgctg cacattgccg 3977041 atcaggcggg aaagcgctga tatgcccatc gacttggacg tcgcgctggg tgcacagcta 3977101 ccgcccgtcg aattctcttg gaccagtacc gatgtgcagc tctaccagct gggactgggc 3977161 gccggctctg atccgatgaa cccccgtgag ctgagttatc tggcggacga tacaccgcag 3977221 gtgttgccga cgttcggcaa cgtcgcggcc accttccacc tcaccacacc accgaccgtc 3977281 cagtttccgg gcatcgatat cgagctcagc aaggtgctgc acgccagcga gcgagtcgag 3977341 gttcccgccc cgctgccgcc gtcgggttcg gccagggcgg tcacccggtt caccgacatc 3977401 tgggacaagg gcaaagccgc ggtaatctgc agcgaaacga cggcgaccac accggacggc 3977461 ttgctgctgt ggacgcagaa gcggtcgatc tatgcccgtg gcgaaggcgg attcggcggc 3977521 aagcgcgggc cgtcgggatc agatgtcgcg ccggagcggg cgcccgatct gcaggtcgcg 3977581 atgccgattc tgccgcagca agcgctgctc taccggctct gcggcgaccg caacccgctg 3977641 cactcggatc ccgaattcgc cgctgccgca ggctttcccc ggcctattct gcatggcctg 3977701 tgcacctatg ggatgacctg caaggcgatc gtcgatgcat tgctggactc cgatgcgacg 3977761 gccgtggccg gctacggcgc acgctttgct ggcgtggcgt acccgggcga gacgctcacg 3977821 gtcaacgtgt ggaaggacgg ccgccgcctg gtggccagtg tcgtcgcacc cactcgtgac 3977881 aacgctgtgg tgctcagcgg agtggagctg gtgccggcat agcggtgcgg tcggcgctaa 3977941 aggtttggtg agactgcgga tttcgcagaa gtcgacatga cattgctgct atggtctgcg 3978001 gtgacggggc cgtcgcagtg gtggcgcggc ggttgggccg agccggcggg atgttgtcat 3978061 ggcggatttc ttgacgttgt caccagaggt gaattcggcc cggatgtacg cgggtggggg 3978121 gcccgggtcg ctatcggcgg ccgcggcggc ctgggatgag ttggccgccg aactgtggtt 3978181 ggcggcggcc tcgttcgagt cggtgtgctc cggcctggcg gaccgttggt ggcaagggcc 3978241 gtcgtctcgg atgatggcgg cgcaggccgc ccgccatacg gggtggctgg ccgcggcggc 3978301 cacccaggca gagggagcag ccagccaggc tcagacgatg gcgctggcct atgaagcggc 3978361 gttcgccgca accgtacacc cggcgctggt cgcggcgaac cgcgccctcg tggcctggtt 3978421 ggcggggtcg aatgtgttcg ggcagaacac cccggcgatt gcggccgccg aggccatcta 3978481 cgagcagatg tgggctcagg atgttgtcgc gatgttgaac taccatgcgg tggcctcggc 3978541 ggtcggggcg cggttgcggc cgtggcagca gttgctgcat gagctgccca ggcggttggg 3978601 cggcgaacac tccgacagca caaacacgga actcgctaac ccgagttcaa cgacgacacg 3978661 cattaccgtc cccggcgcat ctccggtgca tgcagcgacg ttactgccgt tcatcggaag 3978721 gctactggcg gcgcgttatg ccgagctgaa caccgcgatc ggcacgaact ggtttccggg 3978781 caccacgcca gaagtggtga gctatccggc caccatcggg gtccttagcg gctctcttgg 3978841 cgccgtcgat gccaaccagt ccatcgctat cggtcagcag atgttgcaca acgagatcct 3978901 ggccgccacg gcctccggtc agccggtgac ggtggccgga ctgtcgatgg gcagcatggt 3978961 catcgaccgc gaacttgcct atctggccat cgaccccaac gcgccaccct cgagcgcgct 3979021 cacattcgtc gagctcgccg gcccggaacg cggtcttgcc cagacctacc tgcccgttgg 3979081 caccaccatt ccaatcgcgg ggtacaccgt ggggaatgcg cccgagagcc agtacaacac 3979141 cagcgtggtt tatagccagt acgatatctg ggccgatccg cccgaccgtc cgtggaacct 3979201 gttggccggc gccaacgcac tgatgggcgc ggcttacttt cacgatctga ccgcctacgc 3979261 cgcaccacaa caggggatag agatcgccgc tgtcacgagt tcactgggcg gaaccacgac 3979321 aacgtacatg attccgtcgc ccacgctgcc gttgctgttg ccactgaagc agatcggtgt 3979381 cccagactgg atcgtcggcg ggctgaacaa cgtgctgaag ccgctcgtcg acgcgggcta 3979441 ctcacagtac gcccccaccg ccggccctta tttcagccac ggcaacctgg tgtggtagtt 3979501 aacccaggat cagcccggac gtaggcaccc cggtgcccgc ggtgacgagc acatgctcga 3979561 cgcccgccac cgggttcacc gaggtgccgc gcagctgccg caccccctcc gcgatgccgt 3979621 tcatgccatg gatgtaggct tcgccgagtt gaccgccgtg ggtgttgatg ggcagccgcc 3979681 cgcccacctc gatcgcgccg tcggcgatga agtctttcgc ttcgcccttg ccgcagaatc 3979741 ccaactcctc caactgaatc agggtaaacg gcgtgaagtg gtcgtagagg actgcggtct 3979801 ggacatcggc cggcgtcagc cccgactgcg cccatagctg ccggcccacc aggcccatct 3979861 cgggcaggcc gtcgagttcc ggccggtagt agctgaccat cgtgtactgg tctggactgc 3979921 agccctgcgc agccgcctca atgaccaccg ggcgctgctt gaggtcccgt gcgcgcgcag 3979981 ctgacgtcac cacgatcgcg accgcgccgt cggtctcctg gcagcagtcc agcagccgca 3980041 gcggctcggc gatccacctc gaattctggt ggtcctcaat ggttatcggc ttgccgtaga 3980101 agtacgcctt ggggttgttg gcggcatgct tgcggtcggc caccgagaca gcaccgaagt 3980161 cccggctggt cgcaccagac aggtgcatgt accggcgagc gatcatcgcc acttgcgcgg 3980221 cgggcgtgga gagcccgtgc ggatacgaaa acgaattgtc cacgccggtg gagtcggcat 3980281 tctcggtcaa acgagtttgc acctgaccga accgcatgcc ggatcgttcg ttgaatgccc 3980341 gatacgccac cacgacgtca gccaccccgg tggccactgc catagcggcg tgctgcacgg 3980401 tcgcacatgc ggcgccaccg ccgtagtgga tcttggagaa gaacgtcagc tcgccgatgc 3980461 cggccgcacg cgccacggcg atttcggtgt tggtgtccat cgtgaacgtg gtcagcccgt 3980521 cgacatcggt cgggctcagg cccgcatcgg ccaacgcatc caacaccgcc tcggccgcca 3980581 gccgcagctc acttcgaccg gagttcttcg aaaagtcggt ggcgccgata ccgacgatgg 3980641 ccgcctgacc cgataacact acgaatccct catcgaaagt tccaccgtcg cggtcacgtg 3980701 gtcgccaagg gtattgcggc ccaccacctt taccgtgatc aagccgtcgt tcaccgcggt 3980761 cacctcaccg gagaacgtca ccgtgtcgta ggcgtaccac ggcaccccca gccgcagccc 3980821 aatcgacttg atcagcgccg acgggcccgc ccagtcggtg acgtagcgtt gcaccagccc 3980881 ggtgtcggtg aggatgttga cgaaaatgtc tttcgacccc tgggcgacgg ccttgtctcg 3980941 atcatgatgc acatcctgga agtccctggt agccagcgcc gttgagacga tgaacgtcgg 3981001 gtctccgtag agcttcagct caggcagcac agcaccaaca accgtcattc gtcaggctcc 3981061 catgcgtaga ggctccagtc ggggaaatcg atataggtcg ctcgtaccgg cataccgatc 3981121 gcaacacgag caggatcggc cccccgcagc tcgcccagca tgcgtacccc ttcctcgagc 3981181 tccaccagcg cgatcacgaa gggcaccgtg cgacccggaa ctttcggcgc gtgatgcacc 3981241 acgaagctga acaccgtgcc gcgaccgctg gagacgacgt agttgatcgg caccgatttg 3981301 tcttgccaca ccgccggcac cggtgggtgc cgcaggctgc catcggcaag ccgctggatc 3981361 cgcaattcgt gggccttgac tccatcccag aaaaacgcgg tgtcccgcga cgacgaggga 3981421 cgcatcatag cgtcgggatc caaatcgtca ggcaccgagc tcggagaacc cgcgggcttg 3981481 aatttgagga tgcgccaatt catctctgcg acgtcctcgt ccccgacttg ccatacgatg 3981541 tgctggttga tgaaccagcc ctcgccgagc gcggtttgct tgggtccgac gacgtcaccg 3981601 agctcggcgc tgatgctgac ttgctccccg ggcaataggt agcggtggta ggtctgctcg 3981661 cagttggtgg caaccacacc gatgtagccg gcgtcgtcga acagcttgat gatgggtccc 3981721 agcggatcgt ccttcggacg cactccgccc agacccatca tggtccacac ctgaatcatg 3981781 gccggtggcg cgacgattcc ggggtggccg gcggcgcgag ccgccgcgtc gtccacatag 3981841 atggggttgc ggtcgccgat ggcctccacc cagttgttga tcatcggctg gttcaccggg 3981901 tcacgggcca ggcgcggctt gctgggcccg gccgccttga tctgggcaac cgcttcctga 3981961 atgtcgctca ccccggtcac ctgggcaccc ttggcacttt gaggccagac gcggcgatca 3982021 tctcgcgcat gacttcgttc acacccccgc cgaaggtgat caccaggttg cgcttggtct 3982081 gggcgtccag ccagcgcagt agctcggcgg tgtcgggttc ggcggggttg ccgtacttgc 3982141 caacgatttc ctcggcgagc cggccggcac gctgaacacg ctcggtgcca aagactttcg 3982201 tggccgcggc atcggccatg ttgatgtcct caccggcgga cgctacctgc cagttgagca 3982261 actcgttgat ccgccagatc gcacgaatct caccaagagc ccgcttgacg tcgtcgtggt 3982321 cgatcggcgt cacgccgttg ccacccggca cggacgccca cgcgtgcacc cggtcgtaga 3982381 tgctggcgaa ccgcccggcc gggccgagca ttacccgttc gttgttgagt tgggtggtga 3982441 tcagccgcca gccgtcgttc tcctttccga ccagcatgtc gaccggcacg cgcacgtcgt 3982501 tgtagtacgt ggcattggtg tggtgggcgc cgtcggccaa gatgatcggc gtccaggaat 3982561 agccgggatc cttggtgtcg acgattagaa tggaaatgcc tttgtgctta gcggcattcg 3982621 ggtcggtgcg gcaggccagc cagatgtagt cggcgtcgtg tgcgccggtg gtgaagacct 3982681 tctggccgtt gacgatgtag tggtcgccgt cgcgaacggc ggtggtgcgc aacgacgcca 3982741 ggtcggtgcc ggcttccggc tcggtgtagc cgatcgcgaa gtgcgcctca ccggccagga 3982801 tcgccggcag gaacttcttc ttctgcagct cgctgccgtg cgcctgcagc gtggggccga 3982861 cggtctgcag cgtcaccgcg ggcagcggca cgtcggcgcg atgggcctcg ttgacgaaga 3982921 tctgctgctc gatcggacca aaacccagac cgccgaactc tttcggccac ccaacaccga 3982981 gcctgccgtc ccggcccatg cgccgtatca ccgcacggta ggccgggccg tgccggtctt 3983041 tctccatctc cgtgcgctcg tcgggcgaga tgagattcga aaagtattgc cgtatctcgg 3983101 cttgcagctg gcgctgctcc ggcgtcaggt caatgaacat cgcgctccca ggagctcaag 3983161 gcgatgcgag ggcccgccca gcagccgggt gaggtccttg atcgtggagt agtagcggtg 3983221 catcggatac gtgacgtcca tccccatgcc gccgtgcagg tgatggcaga tttgcatcgc 3983281 cggcggcgcc tgcgatgtca cccagtaccc gaggacgccc agatcatctc ccgcatccag 3983341 atcctcggcc agtctccaga tcaccgactt ggccaccagg tcaatggtgc gcgaggcgat 3983401 gtaaacctcg gcgagctgcg cggccacggt ctggaaggtt gacagcggct taccgaactg 3983461 cttccggttc gccacgtagt cggcggtcag ccgcagcgcc ccggcgacca gcccgtcggc 3983521 gtatgcaccc atgacggcca gcgctagctg attgacccgg tgcgcggcta catccgccag 3983581 gatgtcacag tcggcaaccg ccacgccgtc catcgtcatc acatactcgt ctgaaccatt 3983641 cgatgtgggc gtacgaacca tgcgcacacc gtcggccgtc ggcgacacca ccacgacggc 3983701 gttgtcggcg gtcaccaaca tccagtccgc ctgttcggcg tagccaacac cgactttggt 3983761 gcccgacaac cgcccaccca caaagctagt ggcaggccga tccggcagcg ccgccccggg 3983821 ctcgttgagc gcggcggtca gtactcctcc cttggccacc ccggccagga agcggtcctg 3983881 ttgctcggcg gatgccagct cgagcagcgg caccacccca agacccagcg ttgccagcgc 3983941 cggcgtgacg gcgccgtggc gacccacctc ggtgagcagc gcgccgactt cgaataggcc 3984001 cacgccgtcg ccgccgagac gttccggcac cggcagcgcc gtcacaccac cgcagaccag 3984061 cgcctcccac gagatgtccc gctccaacac cgacgtgacc acgtcggcga cggcttgctg 3984121 ttccgcagtg ggatcgaaat ccattagtga gcaaccgggc atctaccggt gtagtcgacc 3984181 tgccagtgct taatgccgtt gagccagccg gaccgcagcc gctcgggcgc cgagatcggc 3984241 ttgaggtcgg gcatgtggtc ggctacggcg ttaaagatta ggttgatcgt catccgggcc 3984301 agattcgcac cgatgcagta gtgagcgccg gtgccgccga agccgacgtg cgggttgggg 3984361 ttgcgcagga tgttaaatgt gaacggatcc tggaaaacct cttcgtcgaa gttagccgac 3984421 cggtagaaca tcaccacccg ctgacccttc ttaatctgta cgccggacaa ctcgtagtcc 3984481 cgcagcgcgg tgcgctgaaa agcggtgacc ggggttgccc agcgcacgat ctcatcggcc 3984541 gcggtctccg gacgcacttt cttgtacagc tcccactggt cggggtgttc agcgaacgcc 3984601 atcatgccct gggtgatgga gttgcgggtg gtctcgttac cggccaccgc cagcatcacc 3984661 acgaagaagc cgaactcgtc gtcggagagc ttctcgccgt cgatatcggc ttggatcaac 3984721 tgagtcacga tgtcgtcggc ggggttcttc gccttctcct cggccatctt catcgcatag 3984781 ccgatcagct ccgccgagga cgccttcgga tcgatgtggg cgtattccgg atcctcgttg 3984841 ccggtcatct cgtttgacca gtggaacagc ttgccgcggt cctcctgcgg cacgcccagc 3984901 aagcccgcga tcgcctgcaa tggcagctca caggaaacct gctcgacaaa gtctccagaa 3984961 cccgcggcgg ccgcctccgc ggcgatcttc tgggcgcgct cctggagctc gtcatgcagg 3985021 cgtccgaccg cacgtggcgt gaagccgcga gagatgatct tgcgcagccg ggtgtggtgc 3985081 ggcgcgtcca tgttgagcat gacgaagcgc tgaacctcga tgtcctcacg cgcgatgtcg 3985141 ttcttgaatc gcgggatcac cccgttttcg tagctggaga acacgtcgct atgccgcgat 3985201 atctctttga cgtcgttgag tttggtgatc gcccagaaac cgccgtcgtg aaagccgccg 3985261 cccttgccag gatcctgccc gttccaccag atcggcgccg cggaccgcag ctcggcgaat 3985321 tcggcaaccg gcagccgttc ggcgtagatt gcggggtcgg tgaaatcgaa cccgggcggc 3985381 agattggggc tgggcacggt agttctcctt actgcaatct ccactgactg gtgattccac 3985441 gacactagct gtcctagtga ggaccttctg ccagtaaaac atgccttcac cgcagacaaa 3985501 aggcattgaa gcaaccttgc ttgtcatagt aatgaaacgt gttctagcct ggccccatgg 3985561 gttacccggt catcgttgaa gccacccgca gccccatcgg caaacgcaac ggatggctgt 3985621 cggggctgca tgccaccgag ttgttgggcg cggtgcaaaa ggcggtggtc gacaaggccg 3985681 gcatccagtc cggccttcac gccggtgacg tcgaacaggt catcggcggt tgcgtgaccc 3985741 agttcgggga gcaatccaac aacatcagcc gggtggcctg gctgacggcc ggtttgcccg 3985801 aacacgtcgg cgccaccacc gtcgactgcc agtgcggcag cggccagcag gccaaccatc 3985861 tgattgccgg gttgatcgcg gccggtgcca tcgatgtcgg catcgcctgc ggcatcgagg 3985921 cgatgagccg ggtcgggctg ggcgccaacg ccgggccgga ccgctcgctg atccgcgcgc 3985981 agtcatggga tatcgacctg ccgaaccagt tcgaggccgc cgagcggatc gccaagcggc 3986041 gcggcatcac ccgcgaggac gtggatgtct tcgggctcga gtcgcagcga cgcgcgcagc 3986101 gggcctgggc ggagggccgc tttgaccgcg agatctcgcc gatccaggcg ccggtgctcg 3986161 acgagcagaa tcagcccacc ggcgagcggc gcctggtctt tcgcgaccag ggcctgcgcg 3986221 agaccacgat ggcggggcta ggcgagctga aaccggtgct cgagggcggc atccacaccg 3986281 cgggcacgtc gtcgcagatc tccgacggcg cggcagccgt gttgtggatg gacgaagccg 3986341 tggcacgtgc gcacggcctg accccgcggg cccggatcgt cgcccaggca ctcgtcggcg 3986401 ccgagcccta ctaccacctg gacggcccgg tgcagtccac cgcgaaggtg ctggagaagg 3986461 ccggcatgaa gatcggcgac atcgacatcg tcgagatcaa cgaggcgttc gcgtccgtgg 3986521 tgctgtcctg ggcgcgggtg cacgagcccg acatggaccg ggtcaacgtc aacggcgggg 3986581 cgatcgcgct ggggcatccg gtgggctgca ccggcagccg gctgatcacc accgccctgc 3986641 acgagctcga gcgcaccgac cagagcctcg cgctgatcac catgtgcgcc ggcggggccc 3986701 tgtccaccgg caccatcatc gagcggattt aacctagctg cggcagggca ccgtgcggcg 3986761 tgactgcaac atgaagcgac cgatgattag atagcgaggc ggacgcgcgc ctttggcgac 3986821 ccttggtcgc taggatcagc gtcatgccga aatcaccgcc gcggtttctg aattcgccgc 3986881 tcagcgactt ctttatcaag tggatgtcac ggattaatac ctggatgtac cgccgcaacg 3986941 acggggaggg tctgggcggc accttccaga agattccggt cgcgctgctg accaccaccg 3987001 gccgcaagac cggccagccg cgggtcaacc cgctctactt cctgcgcgac ggtgggcggg 3987061 tcattgtcgc ggcctccaag ggcggcgcgg agaagaaccc gatgtggtac ctcaacctca 3987121 aggccaaccc caaggttcag gtacagatca aaaaggaagt gctggacctt accgcgcggg 3987181 acgcgaccga cgaggagcgc gccgaatatt ggccacagtt ggtcacgatg tacccaagtt 3987241 atcaggacta ccagtcctgg accgaccgca cgatcccgat cgtggtttgc gaaccctgac 3987301 cgttcccaac ttcgccgaac gtgaagccag ggcgagaaaa cggccgaaat ctcgccctga 3987361 gttcacgctc ggcgcagata actaggcccc atagaccgga accggcggcc gcgacttggc 3987421 caacaggtcg ctgacgacgg gccccagctc ggccggatcc catttcacgc ccttgtccac 3987481 ctgcgggcca tgcgcccagc cctcggcgac ccggatgatg ccgccctcga cctcgaatac 3987541 cttcccagtg acatcgcggg actccgcact gcccagccat accaccaagg gtgagacgtt 3987601 ctccggggcc atcgcgtcga acccctcctg cggcttggcc atcatctccg cgaacacagt 3987661 ctcggtcatg cgggtgcgcg ccgccggcgc gatcgcgttg acggtcacgc cgtaccgcct 3987721 catttcggcg gcgccgacga gcgtcagcgc cgcgattccg gccttggcgg cgctgtagtt 3987781 gccctgcccc acgctgccct gtaggcccgc gccagagctg gtgttgatga tccgcgcgtc 3987841 aatgtctttc ggggctttgc ccgccttgga cagtccccgc caatgggacg cggcgtgccg 3987901 catggtggcg aagtggccct tgaggtgcac cgcgatgaca gcgtcgaact cctcttcgct 3987961 ggtgttggcg atcatccggt cccgcacgat gccggcattg ttcaccagga cgtccacacc 3988021 accgtacgtc tcgacggcgg cctggatcag gttggccgcc tggtcccagt ccgagatgtc 3988081 cgacccgtcg gcgacggctt ggccaccggc cgcaaggatc tcgtcgacca cgtcttgggc 3988141 tgcgctgccg ccgcttgccg gcgaaccgtc caggcccaca ccgatatcgt tgaccaccac 3988201 gcgcgcaccc tcggccgcga aggccaacgc atgtgcgcgg ccgatgccgc cacccgctcc 3988261 ggtgacgatg accacccggc cgtcgaccaa gcccatgacc ccattgctcc tttgctcgtc 3988321 acttgttggc actcgaggcg cccaggtacg gcggcggctc accgccgccg tgcacctcga 3988381 gcgtcgcccc gctgatatat gacgccgcat cggacgccaa aaacgctgca gcccaaccaa 3988441 tgtcggcagg tcgtgccagc cggcccaacg gcaccgtggc ggcgacgcga gcgatcgact 3988501 cggcatcacc gtagaacagt tcggaccgtt cggtttccac catgccgacc accacggcgt 3988561 tgacccgaac cttgggtgcc cattccaccg ccagcgtggt ggtcaggttt tccaggcctg 3988621 ccttggccgc gccataggcc gccgtgccgg gagtgggacg gcgaccgctg acgctacaga 3988681 tgtttacgat cgacccaccg ttgggctgcg cttgcatcag cacgttggcg tgctgggaaa 3988741 ccagcagcgg tgcaagcaca ttgagctcga cgatctttcg gtggaagttg tgtgtcgcct 3988801 cggcggccag cgcgtatggc gagccgcccg cgttgttgac cagcatgtcg agtcggccgt 3988861 gccgctcccc gatctcaccg accaggcgct tgaccgagtc ctcgtcccgg atgtcgcagc 3988921 ggtggaactc atacggttgg ccgtcgaccg ctcgtcgcgc gcaggtgatc acggtcgcgc 3988981 cctgttcggc gaataccgag ctgatgcccg cgcctacccc gcggacaccg ccggtgacca 3989041 aaaccacccg cccggccagc ccgaaattga tggcgtcggc tgcctcggcg agagtcactg 3989101 tgctagcgta ccaagcaagt gcttgcttag gtagcgaacc cgcaggagtg caatgccgat 3989161 cacctccacc acgcccgaac cgggcatcgt cgcggtcacc gtcgactacc cgccggtcaa 3989221 cgccatcccg tcgaaagcgt ggttcgacct ggccgacgcg gtgacggccg cgggcgccaa 3989281 ctccgacacc cgcgcggtga tcctgcgggc cgaggggcgc ggcttcaacg ccggggtgga 3989341 catcaaagag atgcaacgaa ccgaaggttt cacggcgctg atcgacgcca accgcggctg 3989401 cttcgccgca ttccgcgccg tctacgagtg cgcggtgccg gtgatcgccg ccgtgaacgg 3989461 attctgcgtg ggcggcggca tcggcctggt cggcaactcc gacgtcatcg tggcctccga 3989521 ggacgccacc ttcggcctgc ccgaggtgga acggggcgcg ctgggcgcgg ccacgcacct 3989581 ctcgcggctg gtgccccagc acctgatgcg acggctgttc tttacggcgg ccaccgtgga 3989641 cgcggccacc ttgcagcact tcggctcggt gcacgaggtg gtgtcccgcg atcagctgga 3989701 cgaggccgct ttgcgggtgg cccgcgacat cgccgccaaa gacacccggg tcatccgcgc 3989761 cgccaaggag gcgctgaact tcatcgacgt gcaacgggtc aatgcgagtt accggatgga 3989821 gcaaggtttt accttcgagc tcaacctcgc cggagtcgcc gacgagcacc gcgacgcctt 3989881 tgtgaagaag tcatagtgcc cgataaacga accgctcttg acgacgccgt cgcgcaattg 3989941 cgcagcggca tgaccatcgg catcgccggc tggggctcgc ggcgcaagcc catggcgttc 3990001 gtgcgggcca tcctgcgctc ggatgtcacc gatttgacgg tggtcaccta cggcgggccg 3990061 gacctggggc tgctgtgctc ggcgggcaag gtcaagcggg tctactacgg gttcgtctcg 3990121 ctggactcgc cgccgttcta cgacccgtgg ttcgcgcacg cccgcaccag cggcgcgatc 3990181 gaggcccggg agatggacga gggcatgctg cgctgcggtt tgcaggccgc ggcacaacgg 3990241 ctgccgttcc tgcctattcg cgccgggctg ggcagctcgg taccacagtt ctgggcaggc 3990301 gagctgcaga cggtcacgtc gccgtatccg gcgcctggcg gcgggtacga gacactgatc 3990361 gccatgccgg cactgcgcct ggatgccgcc ttcgcccact tgaatctcgg tgacagccac 3990421 ggcaatgcgg cctacaccgg catcgacccc tacttcgacg atctcttctt gatggccgcc 3990481 gagcggcgct ttctgtcggt ggagcgcatc gtcgccaccg aggaactggt caaatcggtg 3990541 ccgccgcagg cgctgttggt caaccggatg atggtcgacg ccatcgtgga agcacccggc 3990601 ggcgcccact tcaccaccgc cgcaccggac tacgggcgcg acgagcagtt ccagcggcac 3990661 tacgccgaag cggcgtcgac acaggtgggt tggcagcagt tcgtgcacac ctacctatcc 3990721 ggcaccgaag cggactacca ggccgcggtg cacaactttg gagcatcacg gtgagcaccc 3990781 gagccgaagt gtgtgccgtc gcctgcgccg agttgttccg cgatgcaggc gaaatcatga 3990841 tcagccccat gaccaacatg gcctcggtag gggcgcggct ggcgcggctc accttcgcgc 3990901 cggacattct gctgaccgac ggcgaggctc agctgctcgc ggacacaccg gcattgggca 3990961 agacgggcgc cccaaacagg attgaggggt ggatgccgtt cggccgggtt ttcgaaaccc 3991021 tggcctgggg gcgccggcac gtggtgatgg gcgccaatca ggtcgaccgc tatggcaatc 3991081 agaacatctc ggcgttcggg ccgctgcagc ggccgacccg gcagatgttc ggcgtccgcg 3991141 gctcgccggg caacaccatc aaccacgcca ccagttactg ggtgggcaac cactgcaagc 3991201 gggtctttgt cgaggccgtc gatgtggtct ccggcatcgg ctacgacaag gtggatccgg 3991261 acaatccggc cttccggttc gtcaacgtct accgggtggt gtccaaccta ggcgtgttcg 3991321 acttcggcgg ccccgaccac tccatgcggg cggtatccct acaccccggg gtgacgcccg 3991381 gcgacgtccg cgacgccacc tcgttcgagg tgcatgacct cgacgcggcc gagcagacca 3991441 ggctgcccac cgacgacgaa ctgcacctga tccgcgcggt aatcgatccg aagtcgttgc 3991501 gggacaggga gatacgatca tgattgttcc gcctcctctc ccccgcaagc gggaggtgcg 3991561 cccacatcgc ttcgtcccct gcaagcgggt ggtaccccca ctgcattgtc ggcggtggct 3991621 atgaggctgc gtacgccgct gaccgagctc atcggcatcg agcacccggt ggtgcagacc 3991681 gggatgggct gggtggccgg tgcccggctg gtgtcggcca ccgccaacgc gggcgggctg 3991741 ggcatcttgg cctcggccac catgacgctg gacgagctgg cggcggcgat cacaaaggtc 3991801 aaggccgtca ccgacaagcc attcggggtg aacatccgcg ccgacgcagc cgacgcgggc 3991861 gaccgcgtcg agttgatgat ccgcgagggg gtgcgggtgg cctcgttcgc gttggcaccc 3991921 aaacagcagc tgatcgcccg gctcaaagaa gccggcgcgg tggtcatacc gtcgatcggc 3991981 gcggccaaac atgcgcgcaa ggtggcggcc tggggcgccg acgcgatgat cgtgcagggc 3992041 ggcgagggcg gcggccacac cgggccggtc gccaccacgc tgctgttgcc gtcggtgctg 3992101 gacgccgtgg cgggcaccgg catcccggtg atcgccgccg gcggcttctt cgacgggcgc 3992161 gggctagccg cggcgttgtg ctacggcgcc gccggggtgg ccatgggcac ccggtttctg 3992221 ctcacctcgg attccaccgt gcccgacgcg gtcaaacggc gttacctgca ggccggcttg 3992281 gacggcaccg tggtcaccac ccgcgtcgac gggatgccgc accgggtgct gcgcaccgag 3992341 ctggtcgaga agctggaaag cggctcgcgg gcacgaggtt tcgcggccgc gctgcgcaat 3992401 gccggcaagt ttagacggat gtcgcagatg acctggcggt cgatgatccg agacggcctg 3992461 accatgcgcc acggcaagga attgacctgg tcacaggtgc tgatggcggc aaacaccccg 3992521 atgctgctca aagccggcct ggtcgacggc aacaccgagg ccggggtgct ggcatcgggc 3992581 caggtagcgg gcattcttga cgacctaccg tcgtgcaaag agctgatcga gtcgatcgtg 3992641 cttgacgcca tcacacattt acaaaccgca tctgcgctgg tggagtgact gacgcgtgtc 3992701 aagcagagta cgctatcgca gctatgtcga ccgtcgagat ggaccaggcg gctccagagt 3992761 ccgccgcgca ccaccctctg ccggaccccg gtgagtcggt ccccagactc gcgctgccca 3992821 cgatcgggat cttcctggcc acgctcaccg cgttcgtcgg ttctacgacc gcttacatca 3992881 gcggatggat cccgttctgg gtgacgatcc ccgtcaacgc cgcggtcacg ttcgtgatgt 3992941 tcaccgtcgt gcatgacgca tcgcattacg cgatcagctc catccggtgg gtgaacgggc 3993001 tgttcgggcg gctggcgtgg cttttcgtcg ggccggtggt cgcgttcccg gccttcgggt 3993061 acatccacat ccagcaccac cgccattcca acgacgacga gcaagacccg gacaccttcg 3993121 cctcacacgg ctcgctgtgg gtgctgccgt tgcgctggtc gatggtcgag tacttctaca 3993181 tcaagtacta cctgcctcgc ggccgcagcc ggccggtcat cgaggtcgcc gagacgctgg 3993241 tgatgatgac cctgttcctg accggcctga tcgtcgccat cgtcaccggc aacttctgga 3993301 cgctggcgat cgtcttcctg atcccgcaac gtatcggcct taccgtgctg gcctggtggt 3993361 tcgactggct gccccaccac ggtctggagg acacccagcg cagcaaccgc taccgcgcga 3993421 cccgcaaccg ggttggcgcc gagtggctgt tcaccccggt gctgctgtcg cagaactacc 3993481 acttggtgca ccacctgcac ccgtcggtgc cgttctaccg gtacctgcgc acctggcggc 3993541 gcaacgagga ggcgtatctg gaacgcaacg ccgcgatctc cacggtcttt ggccagcaac 3993601 tgaatccgga cgagtaccgg cagtggaagg agctcaacgg ccggctcgcg cgactgctgc 3993661 cggtgcggat gccggcccgc tccagctcgc cgcacgcggt gctgcaccgc atcccggtcg 3993721 cgtcggtgga tcccatcacc gccgatgcca ccctggtgac tttcgcggtg ccggaagcat 3993781 tgcgggacgc gttccgattc gagccgggcc agcacgtgac ggtgcgcacc gacctgggcg 3993841 gccaaggcat ccggcgcaac tactcgatct gcgccccggc cacccgcgcc cagctgcgca 3993901 tcgccgtcaa acacattccc ggcggggcgt tttcgacgtt cgtggccaac gaactgaagg 3993961 ccggcgacgt gctcgagctg atgacaccga ccggccggtt cggcaccccg ctggatccgt 3994021 tgcaccgcaa gcactatgtg ggcctggtgg ccggcagcgg gatcaccccg gtgctgtcca 3994081 tcctggcgac cacgctggag atcgagaccg aaagccgatt cacgctgatc tacggcaacc 3994141 gcaccaagga atcgacgatg tttcgggccg agctggatcg tctggagtcg cgctatgccg 3994201 accggctgga aatcctgcac gtgctctcca gcgagccgct gcacaccccg gagctgcgcg 3994261 ggcgcatcga ccgagacaaa ctcaccaggt ggctgacgag taccctgcgg ccggccggtg 3994321 tggacgaatg gttcatctgc ggcccgctcg ccatggccac cgcggtgcgc gagaccctga 3994381 tcgagcacgg cgtggactcc gagcgcattc acctggagtt gttctacggg ttcgacacgc 3994441 ccccggcgac ccgtccctcc tatgcgggag ccaccgtcac cttcacgctg tccgggcagc 3994501 gggcgatatt cgatctggtg cccggcgact cgattctgga aggggcgctg gggctgcgca 3994561 gcgatgcgcc gtatgcgtgc atgggcggcg catgcggcac ctgccgagcc aaactgatcg 3994621 agggcaacgt cgagatggac cacaacttcg ccctccggaa ggcggagctg gatgccggct 3994681 acatcctgac ctgccagtca cacccgacga caccattcgt cgccgtcgac tacgacgcct 3994741 aggttcgtgg cgccgcccca tacttgcgcc gactgtgaat ctgacgacgc gacacgccga 3994801 ttcgccgtcg tgtggttcac tctcggcgct catgggcgcc atcccgccgc ccgcatcgcg 3994861 gcatcgacgc ggccaacgaa cgtgccccgg cggtaccaga gcagctcact ggtgaccctg 3994921 atgatcgtcc agcccagatc cagcaacgcg gtggaccgct cgatgtcccg agcccgctgc 3994981 gccgggtctg tccaatgctg tggcccgtca tactcgacac cgactcgcaa ttgctcgtag 3995041 cccaggtcga tgcgggcgac gaagtccccg tagtcgtcaa acactctgat ctgtgtttgc 3995101 ggcttcggca gaccggcatc gatcaacacc aatcgggtcc acgtctcctg tggggattcc 3995161 gcacccccgt cgatcagcgg cagcaccgca cggaggcgga ccaggccgcg cgcaccggta 3995221 tgttcggcaa tgacggcctg cacgtcggcg accttgacat cggtcgaatt cgccaacgcg 3995281 tccagccgtt gaacggcctg cagccgcgag ggtgtgcgcc gcccgatatc gaaggcggtg 3995341 cgcgccgggg tggttaccgc gacaccgtca accgcaaccg tctcgtgcgg cgccaatcga 3995401 tccgtgtgca cgacgatgcg cggcggaggc tttcgattgg cgtgcactaa ctctgcgtca 3995461 agcgctgggt ttacccactt cgcgccaagc agcgccgccg ccgaattgcc ggccacgacg 3995521 gcgcggcgcc gcgaccacag ccacgccgcg tgggcgcgct ggcgcgccgt cagctccaca 3995581 ccggccgggg cgtagacgcc cgggtagact ggctcgtaga gctgtctcat ggcccgctcc 3995641 ggaatggcct ttgcggccaa cacttccgag cccaggacgg gccatggaag ttcgtccatg 3995701 gccacatcct ggcatcaccc accgacaccc cgccgacagt gaatcgcacg acgcgacacg 3995761 ccgacgaccc gtcgtgagat tcaccctcgg cgccaacgaa ggcctacagc cgctcgataa 3995821 tggtgacgtt ggcggtgccg ccgccctcgc acatggtctg cagcccgtag cggccaccga 3995881 tgcgctccag ctcgcccagc atggtggtga acagtttggc gccggtggcg cctagcggat 3995941 gccccagcgc gatcgcgccg ccgttggggt tgaccttcgc cgggtcggcc ttgatttcct 3996001 tgagccaggc cataactacc ggcgcgaacg cctcgttgat ctcgacggtg tcgatgtcgt 3996061 cgatggcaag cccggtcttg tccagcgcgt accgggtggc ggggatgggt ccggtcagca 3996121 tgaataccgg gtcggcggcg cgcgcactga tgtggtggat gcgggcacgg ggcctaagtc 3996181 catggtcttt gacggcccgc tcggaggcca gcaacactgc actggcgccg tcggagatct 3996241 gactggccat cgccgccgtc agccggccgc cctcgaccag cggctgcaag ccggccatct 3996301 tctccagcga cgactcccgc gggccctcat caacccggaa cggcccggat tcggtttcca 3996361 cagtgatgat ttcgttttcg aagtggccgg cgcggatcgc cgcgaacgcg cgttcgtggc 3996421 tggtcagcga gtaccgctcc atctcttcac gggacaggtt ccacttctcg gcgatcagct 3996481 ccgagccacg gaactgtgaa atctcctggt cgccataccg gtgtaaccat tgcttggatt 3996541 cgttggtcgg cgaggtgaac ccgaactgtt cgcccacggt catcgccgac gagatcggga 3996601 tctggctcat gttctgcacg ccgccggcca cgatgacatc cgccgtgccg gacatgatcg 3996661 cctgcgcgcc aaaggaaatc gcctgctggc tggatccgca ctggcggtcc acggtgacac 3996721 cggggacctc ttcgggatag ccggcggcca gccacgacag tcgggcgatg ttgcccgcct 3996781 gtccgccgat ggcgtcgaca catccggcga tcacgtcgtc gacggcggcg gggtcgatgt 3996841 cggtccggtc cagcagtccg cgccaggcca gggcacccag gtcgacggga tggataccgg 3996901 ccagtgcgcc gccccgcttg ccgaccgcgg tccgtacggc gtcgatgacg tacgcctctg 3996961 tcataaccgc tcctctcccg ttgccagtga gtggtacccc caccgcatcg tcgtcgacac 3997021 ggggcatttc agactccctc tttggtgatc ccgccaagca cgatggctag gtattgctgg 3997081 cccacctgct gggcggtgag cggcccaccg ggtcgatacc agcgcaccga cacccaggtg 3997141 gtgtcacgga tgaatcggta gaccaggtcg acgtctaggt cgggccggaa gtagccctct 3997201 tcgatgccct ggttgagcac gtccacccac atcttgcgct gctgcttgtt acggtcctcg 3997261 atgtaggaaa acctgggttg cgacgccagc cgttgcgctt catcctggta gatcaccact 3997321 tgcgcgtgat gatgctcgat cgcctcaaac gacgccatga acaggccctg cagccgctcc 3997381 agcggattgg ccgtgctatc cacgatgtcg cggtaacggg cgaagagcca atcgaggaaa 3997441 ccgcgtaaca gctcatcgac catctcctct ttggaggcga aatggtgata caggctgccg 3997501 gataggatgc cggcgccgtc ggcgatatcg cgcacggtgg tggcgcgcag tccgcgctcg 3997561 gcgaacatcg ccgccgcgag ctccagcaac tcgcctcgcc ggctattgac ctgaccggcc 3997621 actcgatcca tccgaccaga ctatcaacca agcgcttgct cggccagctg cgacctcgat 3997681 ggggtgggaa tccgggaatt cggtacgagg gatgcgccct tcgctcaccg gggcattaga 3997741 tgcgacgttg ctggcgctgg atggacgcct tgcccgcaca gcccggccca ggtgcaggat 3997801 cgaggggctt ggtacctgat cacgggagac atctggggta tcggcggaga gtgcctagcg 3997861 ttctgggcat tctggcggat tgcgcatatt cttccgcgcg tcgtcatagc ctaatcggac 3997921 tacgcggatc gtgccgatca ccctggtgcg gcggcggcgc cagtaacgag gaggtcaaca 3997981 tggctcattt ttcggtgttg ccgccggaga tcaactcgtt gcggatgtac ctgggtgccg 3998041 gttcggcgcc gatgcttcag gcggcggcgg cctgggacgg gctggccgcg gagttgggaa 3998101 ccgccgcgtc gtcgttctcc tcggtgacca cggggttaac cgggcaggcg tggcagggcc 3998161 cggcgtcggc ggcgatggcc gccgcggcgg cgccgtatgc gggctttttg accacagcct 3998221 cggctcaagc ccagctggct gccgggcagg ctaaggcggt ggccagcgtg ttcgaggccg 3998281 ccaaggccgc gatcgtgcct ccggccgcgg tggcggccaa ccgtgaggcg ttcttggcgt 3998341 tgattcggtc gaattggctg gggctcaacg cgccgtggat cgccgccgtt gaaagccttt 3998401 acgaggaata ctgggccgct gatgtggcgg cgatgaccgg ctatcacgcc ggggcctcgc 3998461 aggccgccgc gcagttgccg ttgccggccg gcctgcaaca gttcctcaac accctgccca 3998521 atctgggcat cggcaaccag ggcaacgcca acctcggcgg cggcaacacc ggcagcggca 3998581 acatcggcaa cggaaacaaa ggcagctcca acctcggcgg cggcaacatc ggcaataaca 3998641 acatcggcag cggcaaccga ggcagcgaca acttcggcgc cggcaacgtc ggcaccggaa 3998701 acatcggctt cggcaaccag ggccccatag acgttaacct cttggcgacg ccgggccaga 3998761 acaacgtggg cctgggcaac atcggcaaca acaacatggg cttcggcaac accggcgacg 3998821 ccaacaccgg cggcggcaac accggcaacg gcaacatcgg tggcggcaac accggcaaca 3998881 acaacttcgg cttcggcaac accggcaaca acaacatcgg aatcgggctc accggcaaca 3998941 atcagatggg catcaacctg gccgggctgc tgaactccgg cagcggcaat atcggcatcg 3999001 gcaactccgg caccaacaac atcggcttgt tcaactccgg cagcggcaac atcggcgtct 3999061 tcaacaccgg agccaatacc ctggtgcctg gcgacctcaa caacctgggc gtcgggaatt 3999121 ccggcaacgc caacatcggc ttcgggaacg cgggcgttct caacaccggc ttcgggaacg 3999181 cgagcatcct caacaccggc ttggggaacg cgggtgaatt aaacaccggc ttcggaaacg 3999241 cgggcttcgt caacacgggg tttgacaact ccggcaacgt caacaccggc aatgggaact 3999301 cgggcaacat caacaccggc tcgtggaatg cgggcaatgt gaacaccggt ttcgggatca 3999361 ttaccgacag cggcctgacc aactcgggct tcggcaacac cggcaccgac gtctcgggct 3999421 tcttcaacac ccccaccggc cccttagccg tcgacgtctc cgggttcttc aacacggcca 3999481 gcgggggcac tgtcatcaac ggccagacct cgggcattgg caacatcggc gtcccgggca 3999541 ccctctttgg ctccgtccgg agcggcttga acacgggcct gtttaacatg ggcaccgcca 3999601 tatcggggtt gttcaacctg cgccagctgt tggggtagcg cgacactcac gggtgctggc 3999661 aggataccga aatcacctca ccagtcaggt aactcgagta gtcgctggcc agaaacgcga 3999721 tggtggccgc cacctcccag ggctcggcgg cccggccgaa cgcctcgccg gccgccagcc 3999781 ggtccagcag ctcggccgag gcggtcttgt ccaggaactt gtgccgggcg atgctgggcg 3999841 agacggcgtt gatccgcacc ccatactcgg cggcttcgat tgcgctgcac cgggtcaacg 3999901 ccatcacccc ggccttggcg gcggcatagt gcgactgcga atgctgggcc cgccagccca 3999961 gcacgctggc gttgttgacg atcaccccgc catgcggcgc gtcgcggaag tagcgcaatg 4000021 cggcccgggt ggcccggaac accgacgtca ggctcacgtc taacacgcgg tcccactcgt 4000081 cgtcggtcat gtcggccacc ggcgtctgcc cgcccagccc ggcgttgttg accagcacgt 4000141 cgagccggcc catccgggcg gtggtcgagt cgatcagcgc gtcgacctgg gcggtggacg 4000201 tcacgtcgca caccacatgc tccacccggc ccagccccag cgcagacaac tcggcggccg 4000261 tctcccccag ccgtcgttca tggtggtccg agatcaccac gtcggcgccc tccgccaagg 4000321 ctcgccgcgc ggtggccgaa ccgatgccgg tgcccgcagc cgccgtcacg acgaccacct 4000381 tgccatccag aagtccatgt ccggcaatct ctttcggcgc tacggacagg ttcatccctt 4000441 ggcctcccgg ggcagaccga gcacccgctc ggcgatgatg ttgcgctgga tctcgttgga 4000501 tcctccgtag atggtgtcgg cgcgggtgaa tagatatagc cgctgccact cgtcgaactc 4000561 gccgtcgggc atggtcattc cgggtttacc gatcacgtcc atggccagct cacccaggtt 4000621 gcgatgccag ttggcccaca acaactttga cacattgtcc tggccgggct gctcaacggc 4000681 tggcccttcc atggtggcca aagcatagga gcgcatggcg cgcagcccgg tccacgcccg 4000741 ggtcagccgc tcccggatca gcgggtcatc cgcggcggcg gtgcgccgcg ccagctcgac 4000801 cagattggaa agctcacggg cgtagacgat ctgctgaccc agcgtcgaga cgccgcgctc 4000861 gaaggtcagc gtcgccatcg cgacccgcca gccgtcgccc ggtgcgccga ccaccaggtc 4000921 ggcgtcggtg cgggcgtcgt cgaagaacac ctcgttgaac tccgcggtgc cggtgatctg 4000981 cacgatcggc cggatctgca cgccgggctg gtccagcggc accagcagat acgacaggcc 4001041 ggcgtggcgc tgcgagccct tctcggtgcg tgcgagcaca aagcaccatt gcgacaggtg 4001101 cgccagcgac gtccacacct tctggccgtt tatcacccac tggtcgccgt cgagttctgc 4001161 ggtggtcgca acgctggcca ggtcgctgcc agcgccgggc tccgaatatc cctgacacca 4001221 cagctcggtg acgtcgcgga tgcgcggcag gaagcgccgc tgctgctgcg gcgttccgaa 4001281 cgcgatcagc gtcggaccca gcagttcctc gccgaagtgg ttgaccttgt ccggcgcgtc 4001341 ggcgcgggcg tattcctcgt agaacgccac ccggtgcgcg gtcgagagcc cccgcccgcc 4001401 gtgttcttcc ggccagccca ggcaggtcag ccccgcggcg gccaggcgct gattccacgc 4001461 ccggcgttcc tcgaacgctt cgtgctcgcg ccccggcccg ccgaggccct taagtgccgc 4001521 gaattcgccg gccagattgt cggcgagcca accgcggacc tgcgcccgga actcctcgac 4001581 gtcctgcatg ccctgtaggc taacctacca agcacttgct ttgttaggag cgtccgttga 4001641 taaacgatct gcgcaccgtg cccgcggcgc tggatcgtct cgtgcgccag ctacccgacc 4001701 acacggcgtt gatcgccgag gaccggcgtt tcacgtcgac cgagctgcgc gacgcggtct 4001761 acggcgccgc ggcggcgctg atcgccctcg gtgtcgaacc cgcagaccgg gtggccatct 4001821 ggtcgccgaa cacctggcac tgggtggtgg cctgcctggc gatccaccac gccggcgccg 4001881 cggtggtgcc gttgaacacc cgctacaccg ccacagaagc caccgacatc ttggaccgag 4001941 ccggcgcgcc ggtgctgttc gcggcgggcc tcttcctggg cgccgaccgg gcggccggcc 4002001 tggaccgggc cgcgctgccc gcgttgcggc acgtcgtgcg ggtgccggtc gaagccgacg 4002061 acgggacctg ggacgagttc atcgccacgg gtgccggggc cctggatgcc gtcgcagccc 4002121 gtgccgccgc cgtcgcaccc caggacgtca gcgacatcct gttcacctcc ggcaccaccg 4002181 gccgcagcaa aggcgtgctg tgcgcgcacc ggcagtcgct gtcggcctcg gcatcctggg 4002241 ccgccaacgg gaagatcacc agcgacgacc gctacctgtg catcaacccg ttcttccaca 4002301 acttcggcta caaggccggc atcctggcct gcctgcagac cggtgccacg ctgatcccgc 4002361 acgtgacgtt cgatccgctg cacgcgctgc gggccatcga gcgccaccgc atcaccgtgt 4002421 tgccgggccc tccgaccatc taccagagcc tgctggatca cccggcccgc aaagacttcg 4002481 acctgagctc gctgcggttc gcggtcaccg gtgcggccac cgtgccggtg gtgctggtgg 4002541 agcgcatgca gtccgaactt gacatcgaca tcgtgctgac cgcctacggg ttgaccgagg 4002601 ccaacgggat ggggacgatg tgccgccccg aggacgacgc ggtgaccgtt gcgacgacgt 4002661 gcgggcggcc gttcgccgac tttgagttgc gcattgcgga cgacggggaa gtgttgctgc 4002721 gcgggccgaa cgtcatggtg ggctatctgg acgacacgga ggcgaccgcg gccgccatcg 4002781 acgccgacgg ctggctgcac accggcgaca tcggtgccgt cgaccaggcg ggcaacctgc 4002841 gcatcaccga ccgcctgaag gacatgtaca tctgcggcgg attcaacgtc tatcccgccg 4002901 aggtcgagca ggtgctggcc cggatggacg gcgtcgcgga cgccgcggtg atcggcgttc 4002961 ccgaccagcg gctgggcgag gtcggccggg cgttcgtggt ggcgcgcccc ggcacgggcc 4003021 tcgacgaggc atcggtgatc gcttacaccc gtgaacattt ggcgaacttc aagacacccc 4003081 ggtcggtgcg gttcgtcgac gtactgccgc gcaacgccgc cggtaaggtg agcaaaccac 4003141 aactgcgaga gctgggctag atggacctga atttcgacga cgagaccctg gcctttcagg 4003201 ccgaggtgcg cgagttcctc gccgccaatg ccgcatcgat cccgacgaag tcctacgaca 4003261 atgcggaagg ctttgcgcaa caccgttatt gggaccgagt actgttcgac gcgggcctgt 4003321 cggtgatcac ctggccggct aagtatggtg gccgggacgc gccgctgctg cactggatcg 4003381 tgttcgagga ggagtacttt cgcgccggcg ccccgggccg ggccagcgcc aacggcacct 4003441 cgatgctggc gccgacgctg ttcgcgcacg gcacagccga acagcttgac cggatcctgc 4003501 cgaaaatggc tagcggcgaa cagatctggg cgcaggcctg gtcggagccg gaatccggca 4003561 gcgacctggc gtcgctgcgc tccaccgcga gcaaggtcga cggcggctgg ctactcaacg 4003621 ggcagaagat ctggagctcg cgggcgccgt tcgccgacat gggttttggg ctgttccgct 4003681 ccgatcccgc ggtcgaacgg caccgcgggc tcacgtattt catgttcgac ctgaaagcca 4003741 agggtgttac cgtgcgccca atcgcccaac tgggcggcga caccggtttc ggtgagatct 4003801 ttctcgacga cgtgttcgtc cccgaccggg atgtgattgg ggcaccgaac gacggatggc 4003861 gcgcggccat gagcacgtca agcaacgagc gcggcatgtc gctgcgcagc ccagcccgct 4003921 tcctggcctc cgccgaacgg ctggtccagc tgtggaagga ccgcggctcg cccccggagt 4003981 tcgccgaccg ggtcgccgac gcctggatca aggcgcaggc ctaccggctg cagaccttcg 4004041 gcacggtgac caggctggcc gccggtggcg aactgggggc ggaatcgtcg gtgaccaagg 4004101 tgttctggtc cgagctggac gtgcacttgc atcagaccgc gctcgacctg cgcggcgccg 4004161 atggggagct ggccggcccg tggaccgagg ggttgctgtt cgccctgggc ggcccgatct 4004221 atgccgggac caacgaaatc cagcgcaaca tcattgccga acggctgctg ggcctgccac 4004281 gcgagaagac gtgaccatgg aattcgcact caacgaacag cagcgcgact tcgcggccag 4004341 catcgacgcg gcgctcggcg ccgccgacct gcccggcgtc gtccgtgctt gggctgccgg 4004401 tgatgtggcg cccggccgca aggtgtggca gcagttggcc aacctgggcg tcaccgcgtt 4004461 gggcgtagcg gagaagttcg acggactggg tgccagtccg gtcgatctgg ttgtcgcgct 4004521 cgaacgtctc gggcgctggt gcgtgcccgg cccggtcacc gaatccattg ccgtggcacc 4004581 gattctgctg gctcatgatg atcaggctga acgcagccat gggctagctt ccggtgagct 4004641 catcgccacc gtggccatgc cgccgcgggt tccgcgcgcc gtcgacgccg acaccgccgg 4004701 gctggtactg ctcgcgggcg atggcagcgt caccgaaggg acgccgggtg attgccaccg 4004761 gtccgtcgac cccagccggc ggctgtatga ggtggcggca tccggccagg cctggcgggc 4004821 cccgaaagac gtagtggcgc gcgcctatga gttcggggcg ctggccaccg ccgcacaact 4004881 ggtcggcgcc gggcaggcgc tgctggaggc cgccgtcaac tacgccaaac agcgcacgca 4004941 gttcggccgg gcgatcggct cgtatcaggc catcaagcac aaactcgccg acgtgcacat 4005001 tgcgatcgag ctggcctgcc ccctggttta cggcgcggcc gtgtcactcg agccgcgcga 4005061 tgtcagcgcc gccaaagccg ccgcgagcga ggcggctctg ctggcggcac gctgggcgtt 4005121 gcagacccac ggcgccatcg ggttcacctg cgagcatgac ctgtcgctgt ggttgttgcg 4005181 ggtgcaggcg ttgcactcgg cctggggtac gccgcaggag catcggcggc gtgtgctgga 4005241 ggcgctatga ccccccctga agaacggcag atgctacggg aaaccgtcgc ctccctggtg 4005301 gctaagcatg ccggcccggc ggcggtgcgc gcagcgatgg cctccgaccg cggctacgac 4005361 gaatcgctgt ggcggctgct atgtgagcag gtcggtgccg ccgcgctggt cattccggag 4005421 gagctgggcg gcgcgggcgg tgaactcgcc gatgccgcga tcgtcgtgca ggagctgggc 4005481 cgggcgctgg tgccttctcc gctgctgggc accacgctgg cggagctggc gctgctggcc 4005541 gcagctaagc cggatgcgca agcactcacg gagcttgccc aaggcagcgc gatcggcgcg 4005601 ctggtgttgg accccgacta cgtggtcaac ggcgacatcg ccgatatcgt cgtcgccgcc 4005661 accagcgggc agctgaccag gtggactcgc tttagcgcgc agcccgtcgc caccatggac 4005721 cccactcgcc ggctggcccg cctgcaatcc gaagagaccg agccgctgtg ccccgatccc 4005781 ggaatcgccg acaccgcagc aatcctgttg gcggccgagc agatcggcgc cgccgaacgc 4005841 tgcctgcagc tgaccgtcga atacgccaag agccgagtgc aattcggccg cccgatcggc 4005901 agtttccagg ccctcaagca tcggatggcc gacctgtatg tgaccatcgc cgcggcccgg 4005961 gccgtcgtcg ccgacgcctg ccacgcgccc acacccacca acgccgccac cgcgcggctg 4006021 gccgccagcg aggcgttgag caccgcggcg gccgagggca tccaactgca cggcggcatc 4006081 gcgatcacct gggaacacga catgcacctg tatttcaaac gagcgcacgg cagtgcacaa 4006141 ttgctcgagt cgccacgaga ggtgctgcgc cgtttggaat ctgaggtgtg ggagtcgccg 4006201 tgacggatcg tgtcgccctg cgtgccggcg ttcccccgtt ctacgtgatg gacgtctggt 4006261 tggcggccgc ggagcgccag cgcacccatg gggatctggt gaatctttcg gcgggccaac 4006321 ccagtgcggg cgctccggaa ccggtgcgtg cggccgcggc cgccgccctg catctcaacc 4006381 agttgggata ctcggtggcg ctgggtattc cggagctgcg cgacgctatc gccgcggatt 4006441 accaacgccg gcatggcatc accgtcgaac ccgatgcggt ggtgatcacc acgggctcct 4006501 cgggcggctt tctgctcgcg tttctggcgt gcttcgacgc cggtgatcgg gtcgcgatgg 4006561 ccagtcccgg ctacccgtgc taccggaata tcctgtcagc gctgggatgt gaggtcgtgg 4006621 agatcccgtg cggaccgcag acccgattcc aaccgaccgc gcagatgctg gccgagatcg 4006681 acccaccgct gcgcggtgtc gtcgtcgcca gcccggccaa cccgaccgga accgtcatcc 4006741 cgcccgaaga actggcggcc atcgcgtcgt ggtgtgacgc atcggatgtc cggttgatca 4006801 gtgatgaggt ctaccacggc ctggtgtacc agggggcacc gcaaaccagc tgcgcctggc 4006861 agacgtcgcg aaacgcggtg gtagtcaaca gcttttccaa gtattacgcg atgacgggct 4006921 ggcggctggg ctggctgctg gtgccgacgg tgctgcgccg cgcggtggac tgcctgaccg 4006981 gcaacttcac catctgcccg ccggtcttgt cgcagatcgc cgcggtgtcc gcgttcaccc 4007041 cggaggcgac cgccgaggcc gacggcaacc tggccagcta cgcgatcaac cgctcgctgt 4007101 tgctggacgg tctgcgtcgc atcggcatcg accggctggc acccaccgac ggcgcattct 4007161 acgtctacgc cgacgtctcg gacttcacca gcgattcgct ggccttctgc tcaaagttgc 4007221 tggccgacac cggtgttgcg atcgcacccg gaatcgattt cgacaccgca cgggggggtt 4007281 cgtttgttcg gatatcgttt gccgggccaa gcggcgacat cgaagaagcc ttacggcgca 4007341 tcggctcctg gctgccgagc caatagctcg tcgatgcgcg tctcgagcgc gccgcgctcg 4007401 ccgatatctg ccacgttgat cccgaaccgt tcgctcaggg tgtcgacaac cgctgccgca 4007461 tcggcaaggc ggatcttctc ggtaccaccg gcacggtgaa cggcaaggtc gcggccagat 4007521 aggttccacc gggcgtcgtc ggtgatcacc gcggcggtca gtcccgtgac gaacttcgat 4007581 gccgggtgtg ttgaggcgta ccagctggcc actttcagat cgatctgcgg gcgggtctgg 4007641 gtggtgaatt cgtacagtgt ctgccatgtg tcccggacca tcgcctgcaa gacaaagccg 4007701 tcgacgcggt cctcgagccg ataaggttcg tgcgttgtcg gctggacggc gccggtttcg 4007761 aggcgaagcg gtgaggtcgg tgtttggccg ccgaatccga cgtcgacgag atagcatccg 4007821 cccgagccgg ggaacgtgac ccccagcagg gtgtgcgtct gcggcggcag gggcgcgtcc 4007881 ggcgcgagct tccagacgac gcgggcggcg aatcggcgca cccgatagcc gagttcggcc 4007941 agcacataac ccatcagccc gttgtgctca aagcagtacc cgcctcggcg ccgaagtacc 4008001 agcttgtcgg ccagcgcctg tggactgagg tcgtcgaccg gcacccccag cagcgggtcg 4008061 aggttctcga acggaatcgt tcgactgtgc acggtcacca gatcctgcag aacatccagg 4008121 gttggatcgg tagcgccgcg atagttgatg cgatcgaagt acgcggtcag atccagtgcc 4008181 atgttgccat tctgacctcg tcgccgcgtc ggaccgaccg cagggtattc gggcgttcgt 4008241 cgcgcagccg gccaactatg tcgcaccgat tgtggtttgc cacatgagtt tctgggtcga 4008301 cggcaaacaa agtgccctcg cagcgacacg tgtcggcggc tacggcaaac tgcccgctga 4008361 cactcaccca tccggtggct gctcgcgcca tctggccgaa tgcccggcgg gtcggaggat 4008421 cggcgccgga cacaacaacg catcatgatg cctgttactg atgctattgc cgacccacgg 4008481 caccggaggc ttgcaggccg gtgtcgactt ggacgacaag gaagccctcg ccgaactgat 4008541 cggcgacaat gctgctcctt gacgtaagcg tctgcatatt cgccatccgc gaggacagct 4008601 gccccaacca cgcgacatac cggacgtggc tcaccagact gcttaccggc gacggcgagc 4008661 agacgcaaaa tcgcccaaca cgcccgcaaa atgggcgatt ttgcgtctgc tcgcgccact 4008721 agagccaggt gtcctgggtg gtggtgatga ggaaagcctc caggtcgtcg cgccagtgcg 4008781 ccggcgtggt cttttccggc tcgatgccgg tgtagtcgcc gcgatagaac agcagcggcc 4008841 gcggcttgac cgccgggacc tctgacagtg actcgacggc accgaacacc acgaagtgat 4008901 cgccgccgtc gtgcaccgac gccaccgtgc agtcaatgta ggccagcgat ccctcgatga 4008961 tcggtgagcc tagttccgaa gggcgccaat cgataccggc gaacttgtcc ggctccttcg 4009021 agccgaatcg cgccgagacg tctttctgct tttcggtcag tacgttgacg cagaaccggc 4009081 cgctggcctc gatggcctgc caggaccgcg acaccttagt ggggcagaac agcaccaacg 4009141 gcggttccaa cgacagcgcc gcgaacgact ggcacgcaaa cccgacgggc acgtcgtcgt 4009201 gcacagtggt gatgacagtg atccccgtac agaactgacc gagcacggag cggaacgtgc 4009261 gtggatcgat ctgagccgac atcgtttgct ttcgagctag ccgcgagcgc ctacggtgaa 4009321 atcgtgaccc cataggctga ccgcagtact ctcccgggcg atccagtccc gatcgtcgac 4009381 ttgcctgccc tcacaaccga attcgatgtc gaagccaccg ggcgtcttca tgtagaacga 4009441 cagcatcagg tcgttgacat gccggcccag ggtggccgac atcggcacct tgcgccgcaa 4009501 cgcccggtcc aggcacaggc ccacgtcgtc ggcctgctcg acctcgacca tcaggtgcac 4009561 gatgccgctg gacgtcggca tcggcaggaa ggccaacgag tggtgacgcg ggttacagcc 4009621 gaagaaacgc agccaggctg gcggcccgtc ggcgggccgc cctaccatct gtggcggtag 4009681 ccgcatcgag tcacgcagcc gaaagccgag cacgtctcgg tagaaatgca acgcctcagc 4009741 atcgtcgcgg gtggacagca ccacatggcc cataccctgc tcaccggtga cgaacctgtg 4009801 cccatacggg ctgaccactc ggcggtgttc cagcgcggta ccgtggaaga cctccaggca 4009861 attgccggaa gggtcggcaa accggatcat ctcgtccacc cggcgatcgg ccagctcggc 4009921 ggcggtggcc tctttgtacg gcgtgccctc caaatccagg cggttccgga tttcctgcag 4009981 gccttcggca ttcgcgcatt cccaaccggc ctccaacagc ctgtcgtgct caccgggcac 4010041 gaccaccagc cgggccggaa agtcatccat ccgcagatac agggcccctt ctggggcccc 4010101 tttgccctcg accatgccca ggaccttcag tccatactcc cgccaggcag ccatgtcagt 4010161 ggcctcgatg cgcagatagc ccagcgaccg gatgctcatc tgccacctcc cagaaattca 4010221 atcgtcagct tgttgaactc gtcgaacttc tccacctgca cccaatgccc acactgcccg 4010281 aatacgtgca gctgcgcacg cggaatcgtt ttcaacgcaa ccagcgcgcc gtccagcggg 4010341 ttgacccggt cctcacgacc ccagatcagc aacaccggct ggcgcagccg atacacctcg 4010401 cgccacatca tgccggcctc gaagtcggct ccggcgaacg actttcccat cgcccgtgtt 4010461 gccgtcaacg actccggggt gctggccagc gcaaaccgct gatccaccaa ctcgggggtg 4010521 atcaggttct tgtcgtagac catgacccgc aggaacgcct cgaggttctc ccgggtgggc 4010581 gcaacggaga acttcgacag ccgtttgact ccctcggtcg ggtcgggcgc aaacaggttg 4010641 atactcaggc cccccgggcc catcagcact aaccgtcctg cccgggccgg gtagtccagc 4010701 gcaaaccgga ccgcggttcc cccgcccaac gagttgccca ccagcggtac ccgccccagc 4010761 cccagctgat cgaagagccc cttcagcgcc atcgcggcat agcgattgaa ctggccgtgc 4010821 tcggcccgct tgtcggaatg gccgtaaccg ggctggtcga cggccagcac atgaaagtgc 4010881 cgcgccagca ccgcgatatt acgcgagaag ttcgtccagc tcgccgcgcc gggcccaccg 4010941 ccgtgcagta gcaccaccgt ctggtcgttg cccacgccgg cctcgtggta gtgcagtttc 4011001 agcggcccgt cgacgtccac ttccgcaaag cgcgaggtgg attcgaacgt caattcctcg 4011061 gtagctgtca tttcgcctag accagctaga ccatggtgtc gccgggcggc aacccgaact 4011121 cgtggtttcc aaagatcacg tatgcccgct cggggtcgtt ggcggcgtgc acccgaccgg 4011181 cgtgcgcgtc gcgccagaac cgttgaatcg gagcctcatt ggacaacgcg gtggcaccgg 4011241 acgcctcgaa cagccggtcg atcgaggcga ttgagcgacc ggtggcgcgc acctggtcgc 4011301 ggcgcgcacg ggcgcgcagt tcgaacggaa tctccttgcc ggcagccagc agcgcgtatt 4011361 cgtcgctcac attaccgatc agttggcgcc acgcggcgtc gatgtcgctg gccgcctcgg 4011421 cgatacggac cttggcaaac gggtcgtctt tggccttttc cccggcgaac gccgcgcgca 4011481 cccgcttgcc ctggtgctcg acgtgcgcgg cgtaggcacc gtaggccatg ccgacaatcg 4011541 gcgccgaaat cgtagtggga tgcattgtgc cccatggcat tttatagaca ggtgcgctgt 4011601 tggtcgccag ccctcccgcg gtgtggtcgt tcatcgcctt gtacgacaag aaccggtgcc 4011661 ggggcacaaa gacatccttg accaccaggg tgttgctgcc ggtaccacgt aagccgacca 4011721 cgtaccacac gtccttgatc tcgtattcgc tgcgcgggat caggaaactg ccgaagtcca 4011781 ccggccggcc gtccttgatg accgggccgc cgacgaacgt ccagctggca tggtcgcagc 4011841 ccgaggacca gttccacgac ccgttgacca ggtagccacc gtcgaccacc acgcccgccc 4011901 ccatcggtgc gtacgaggac gagatccgcg tactcgggtc ctcgccccag acctcctctt 4011961 gggcccgttg gtcgaacagc gccagatgcc agttgtgcac gccgacgatt gagctcaccc 4012021 acccggtgga accacacacg ctcgccagtc gacgcgtcgc ctcgaagaac agcgcagggt 4012081 cgcactgcag tccgccccac tgctgcggct gcaacagggt gaagaagccg acgtcgtcga 4012141 gcgccttgac ggtctcgtcg ggcagccgcc gcagatcctc cgtggcctgg gcgcgatccc 4012201 gaatctccgg cagcagatta tcgatggcag ccaagacaga ctgagcatca cgctgttgaa 4012261 tggacgtcac ttacttttgc ctctccgggt tgcgaactta gagaaagact agaacacgtt 4012321 ccgatttgtg tcgagctagg tattcctgcg gcaggtagcg ataccaaatg ggttttctgt 4012381 aacatgttct agttatgacg gaagagagga cgggtcttga ccgaggcaat tggagacgag 4012441 ccactcggcg accacgtcct tgaactgcag atcgccgagg tcgtcgacga aaccgacgag 4012501 gcgcgatcgc tggtcttcgc ggtgcccgac ggatcggacg acccggagat cccccctcgg 4012561 cgcctgcgtt acgcccccgg ccaattcttg acgctgcgcg tgcccagcga gcgtaccggt 4012621 tcggtggcgc gctgctactc gttgtgcagt tcgccctaca ccgacgacgc cttggcggtc 4012681 acggtcaaac gaaccgccga cgggtacgcc tccaactggt tgtgcgatca cgcgcaggtg 4012741 ggcatgcgca tccacgtgct ggccccgtcg ggcaacttcg tccccacaac cctcgacgcc 4012801 gatttcctcc tgctggcagc gggtagcggc atcaccccga tcatgtcgat ctgcaaatcg 4012861 gcgcttgccg agggcggtgg acaggtgacg ctgctctacg ccaaccgcga cgaccgctcg 4012921 gtcatcttcg gagacgcgct gcgcgagttg gcggcgaagt atcccgaccg gctcacggtg 4012981 ctgcactggc tagagtcgct gcaggggctg ccgagcgcga gcgcgctggc caagctcgtc 4013041 gcgccctaca ccgaccggcc ggtgttcatc tgtgggcccg gcccgttcat gcaggcggcc 4013101 cgggacgccc tggcggcgct gaaagtgccc gcccaacagg tgcacatcga ggtgttcaag 4013161 tcgctggaat cggatccgtt cgcggccgtc aaggtcgacg acagcggtga cgaggcgccg 4013221 gcgaccgcgg tggtggaact cgacggccaa acccacaccg tctcctggcc gcgcaccgcc 4013281 aagctgctcg acgtgctgct ggccgcgggc ctggacgcgc cgttctcctg ccgggaaggc 4013341 cactgcggtg cgtgtgcgtg caccctgcgc gccggcaaag tgaatatggg agtcaacgac 4013401 gtgctcgagc agcaggatct cgatgaggga ctgattttgg cctgtcaatc tcgcccggaa 4013461 tctgattcgg tggaagtgac ctacgacgag tagtcccgga agggagcgag atgacgcggc 4013521 tgataccggg ttgcacgctc gtcgggctga tgctgacgtt actgcccgcg cccacctcgg 4013581 cggccgggag caacaccgcc accaccctgt tcccggtcga cgaggtcacc cagctggaga 4013641 cgcacacctt cctcgattgc caccccaacg gcagctgcga cttcgtcgct ggagcaaatc 4013701 tgcgcacacc cgacggcccg acgggctttc cgcccgggct gtgggcgcgc caaaccaccg 4013761 agatccgttc gacgaaccgg ttggcctatc tggacgcgca cgccaccagc cagttcgaac 4013821 gggtaatgaa ggcgggcgga tccgacgtga tcaccaccgt ctacttcggc gagggtccgc 4013881 cggacaaata ccagaccacc ggggtcatcg actcgaccaa ttggtcgacc ggtcaaccga 4013941 tgaccgacgt caacgtcatc gtgtgtacac acatgcaggt ggtctacccg ggggtcaacc 4014001 tcacctcgcc cagcacctgc gcgcaagcca acttttccta gctaggactc gtcctggtac 4014061 tcgctgagcc ggtaaatcaa cgcggcagac ccagcagccg ttcggcggcc accgtcaaca 4014121 ggatctgctc ggtaccgccg gctatcgtca ggcaccgggt gttgaggaag tcgtacaccg 4014181 cgcggttctc gacgagcccg cccccgtcgg acacctccat caggtattcg gccagcgcct 4014241 gtcggtagcg cacgccgatc agtttgcgga cgctggattg cgcccccgga tcctggccgc 4014301 cgacggccaa ctcggcgatc cgccggtcca acagcgcacc ggcctgagcc agcaggatca 4014361 gcctgcccag ccgatcttgc tgcgcgacat cgagttccat gtcacccaag accttgagca 4014421 gctcttccat cgggttgccc agcgcggtcc cggtggccat cgcgacccgc tcgttggcta 4014481 gcgtggtgcg cgccagccgc cagccgtcgt tcacggcgcc gacgaccatc tcgtcgggga 4014541 cgaacacatt gtccaggaag acctcgttga acagcgagtc gccggtgatc tcgcgcagcg 4014601 gtcggatctc aattcccggt gtggtcatgt ccaccaggaa gtaggtaatg cccttgtgct 4014661 tcggagcatc cgggtcggtg cgcgccaggc acacccccca ccgagccttg tgagccgccg 4014721 acgtccacac cttctgtccg gtgagcagcc agccgccgtc agcccgcacc gccttggtac 4014781 gcagcgacgc caggtccgaa ccggcccccg gctcggaaaa tagctgacac caaaggaatt 4014841 caccgcgcat ggtggccggg acgaaacgtt cgatctgttc cggcgtgccg tgttcaagga 4014901 tggtcggcgc cgcccaccag ccgatcacca ggtccgggcg ctcaaccttg gccgcggcca 4014961 gttcctgatc gatcagcagt tgctcggccg gggacgcgcc gcgcccgtac ggcgccggcc 4015021 agtgcggcgc cagcaggccg gtgtccgcca gcgccacctg acgtttctcc tcgggcaacg 4015081 cggccacctc ggcgaccgcc gccgcgatct ccggtcgcag gccggccacc tcggccaggt 4015141 cgacgcccaa gcgacgacgg acaccggcct gggtcagcgc cgtaacccga cgcagccagc 4015201 gcccggatcc accgaggaac ccaccgattc cgtgggcccg gcgcagatac aaatgcgcgt 4015261 cgtgctccca ggtgcagccg ataccaccga gcacctggat acagtccttg gcgttggctt 4015321 tggcggcgtc gatgccgatg ctcgcggcca ccgccgcggc gatcgaaagt tgggtgccat 4015381 cggaatcggc tgcggcgcgg gccgcatcgg cggcggccac atcggcctgc tcggcacggc 4015441 acaacatctg agcacacagg tgcttgacag cctggaagct gccgatcggc ttgccgaatt 4015501 gctcccgcac cttggcgtag gcaaccgcgg tatcgagcgt ccatcgagcc accccggccg 4015561 cctcggccgc cagcacggta gcggccaggt cttccacccg ctcccccgac acctccagaa 4015621 cggtgaccgg tgccgatgtc agcaccatcc gggccagcgg cagcgaaaag tcggtggccc 4015681 gcagcggctc cactacgacc tcgtcgcaag cagtgtccac cagcagccaa ttcccgtcgg 4015741 ccggcaacag cacgacgccg ccgggcgcgc caccaagcac tcggccgacg gtgcccgacg 4015801 cggtcgacgt cttcgggtcg acctgcacgc caccgtcgat agccaccccg gcgaaccgtt 4015861 cacccgacgc tagcgcgctg cgcagcttgg gatcggagac aaccaaagtg gccaccgcgg 4015921 tggtcgcgac cggccccggt accaacgccc tggccgcctc gtcgaccatc gcacacaggt 4015981 cctcgatgct gccgccagct ccgccacaat cctctgggac ggcgacaccg aagaggccca 4016041 ggcccgccag cccggcgaac accggccgcc atgcgtccgc atttccttct tcgaagccgt 4016101 attccatgtc gcggaccgcc gcagtcgcgg ccgcacctga ggccgcggtg cgggcccagc 4016161 cgcgcaccaa ctcacgagcc gcggattgtt cgtcggtgac ggtcgctacc acctgcagac 4016221 ctccgcgtcg acaatttcac atagcaatgg agcgttcttg cccactagaa cgtgttctaa 4016281 tagtgctaac gatcaaccgt caagtcgaag gcaataactc cagcacatgt cgtcgtctcg 4016341 gctgtcggga ggtgggaaat ctacacacag catgcgtatc gtttgcaaac gaaccgcccg 4016401 gaagaggagc tgcccgctac atgtcgtcag cgaacacgaa caccagtagc gctcccgacg 4016461 caccacctcg cgcggtcatg aaagtggcgg tacttgccga gtccgagctc ggatcggagg 4016521 cacagcggga gcgccgcaag cgcatcttgg acgccaccat ggctatcgcg tccaaaggcg 4016581 gctatgaggc ggtgcagatg cgcgccgtcg ccgaccgcgc cgacgtcgcg gttggcacgc 4016641 tgtaccggta cttcccgtcg aaggtgcatc tgctggtgtc ggcgctgggt cgggaattca 4016701 gccgcatcga cgcgaaaacc gaccgctccg cggtcgccgg ggccaccccc ttccagcggc 4016761 tgaactttat ggtcggcaag ctcaaccgcg cgatgcaacg caatccgcta ctcaccgagg 4016821 ccatgacacg tgcctacgtg ttcgccgacg cctcggcggc cagcgaggtc gaccaggtcg 4016881 aaaagctcat cgacagtatg ttcgcgcgtg caatggccaa cggcgaacca accgaggacc 4016941 agtaccacat agcgcgggtg atctcggacg tgtggttgtc gaacctgctc gcgtggctta 4017001 cccgacgagc ctcggctacc gacgtcagca agcggctgga cctggccgtg cggctgctga 4017061 tcggcgatca agacagcgcc tagaagactt acgccggcgg acccgcggtg cggccccgga 4017121 ccagctcggt atcgagcacc tcgatgacgg gcagtccgga ccgcggcggc ttcagcaata 4017181 gctcgcccgc ccggtgcccc ttgtgcagac tcggctgcgc gaccgtggtc agcccccggc 4017241 tcagcgcctc tggcactccg tcaaaccctg tgacggtcat ctgcccgggc acgtaaatcc 4017301 cgtgcgcccg aaggtaatcc atagctgaga gcgccaagat gtccgctgtg cacatcagcg 4017361 cggtcagccg cggattggcc tgcagagcca ccttggcggc agtgccgccg gacgtcggca 4017421 aatgctcgta gctttccacc acggtcagcg agtccgggtc gacgccggcg gccgtcatcg 4017481 cctcccatac gccgacgatg cgttcgcgct gtacgtcgaa ggtcggcgac cgcagccgct 4017541 cggcgtccac caagtcttgc cgccgatccc gtcccagccg catggtcagc aggccgagct 4017601 cgcgatgccc caacccgagt acgtagccgg caagctcacg catcgccgcc cggtcgtcga 4017661 tgccgacccg ggacactccg gagaggtctt tgggctggtc gaccaccacc accggcagcc 4017721 gccgctgcag cacgacctgc aggtagggat cgtcgtcgcc taccgaatac accacgaagc 4017781 cgtccacccc agcgccgagc acggcagctg tgccgtccgc aaggctccga ctggagccga 4017841 cggaaaccag ctgcaggccc tgccccagct cttcgcacga ctgcgccact cccgcaacaa 4017901 aatcccgcgc ggccgggtcg ctgaagaaat aggtcagcgg ttcggccatc accaaaccga 4017961 ccgcaccggc tttgcgggtc cgcaacgatc gcgccaccgg atccggtccg gcatagccca 4018021 gtcgcttggc cgtggcaagc actcgttcac gtagatcggc ggagagctga tccggtcggt 4018081 taaaagcatt cgagacagtg gtgcgggaca ccttgagctc ggctgctaac gacgccagag 4018141 tcgcccgcct ccgcggtgtg ggactcacgt tcggtgaggg tacagcggac cctcgagcac 4018201 gcaatatcgt gggccggctg gcaaccgtcg gtttcgacgt tggtgacgac ccctcgttca 4018261 tgaatcgttc ttgagctccc cgttttgctg gatgcccagg caccgccggt actgctgcgc 4018321 ttaagcttgt cgcacatggt gccggcaggg aggaacagtg ggcaagcagc tagccgcgct 4018381 cgccgcgctg gtcggtgcgt gcatgctcgc agccggatgc accaacgtgg tcgacgggac 4018441 cgccgtggct gccgacaaat ccggaccact gcatcaggat ccgataccgg tttcagcgct 4018501 tgaagggctg cttctcgact tgagccagat caatgccgcg ctgggtgcga catcgatgaa 4018561 ggtgtggttc aacgccaagg caatgtggga ctggagcaag agcgtggccg acaagaattg 4018621 cctggctatc gacggtccag cacaggaaaa ggtctatgcc ggcaccgggt ggaccgctat 4018681 gcgcggccaa cggctggatg acagcatcga tgactccaag aaacgcgacc actacgccat 4018741 tcaagcggtc gtcggcttcc cgaccgcaca tgatgccgag gagttctaca gctcctcggt 4018801 gcaaagctgg agcagctgct cgaaccgccg gtttgtcgaa gtcacccccg gacaggacga 4018861 cgccgcctgg actgtggctg acgttgtcaa cgacaacggc atgctcagta gctcgcaggt 4018921 tcaggaaggc ggcgacggat ggacctgcca gcgtgccctg actgcgcgca acaacgtcac 4018981 tatcgacatt gtcacgtgcg cctatagcca accggatttg gtggcgattg gcatcgctaa 4019041 ccaaatcgcg gccaaggttg ctaagcagta ggcatggccg acggtcccct tgccatcacg 4019101 gcgaaatcgg tttacataca tggctattcg gtagatacgg cagagattcc aacagctgtg 4019161 cgtggccacc cgaatgccgc gggaaccgcg atcaaggacc gccgctgatg cggccgaaac 4019221 ttgggcgtcc caatatcgcg cggtattcca acaggtttag cgtgcctacc gccagatccg 4019281 atgctccgtt gtcggtgacc tggatgggcg ttgcgacgct gctggtcgac gacggatcgt 4019341 cggccctgat gactgatggc tacttttccc ggcccggcct ggcacgggtg gcggcgggta 4019401 aagtgtcgcc gtcagcggag cgggtcgacg gttgccttgc ccgggccaat gtctcccggc 4019461 tgacggccgt tatcccggtg cacactcaca tcgaccacgc gatggattcc gcgctggtcg 4019521 ccgaccgtac cggagcccag ctggtcgggg gggagtcggc ggccaatgtc gggcgcggat 4019581 acgggttgcc tgaggagtct cttgtcgtcg ccgtcccagg tgaaccaatc cagttgggcg 4019641 ccttcgacgt gacgttggtg gagtcgcatc actgcccacc cgaccggttt cccggtgtga 4019701 tcagcgcacc actgacaccg ccggtgaagg cgtcggccta ccgctgcggt gaggcgtggt 4019761 cgacgctggt gcaccaccgg ccatcggggc gccggctgtt aatccaggac agcgccggtt 4019821 tcgtcagcgg cgcactggcc ggttaccgcg ccgatgccgc ctacctcagt gtcggccagc 4019881 tcggcctgca accgccgtca tacctgctcg aatactggac cgagaccgtg cgcacggtgg 4019941 gcgtccgccg cgtgattctc atccactggg acgacttttt tcggccgctg tcaaagccgt 4020001 tgcgggcctt gccatatgcg gccgacgacc tagacctgtc gatccgcatc ctcgacgagc 4020061 tggccgccca ggacggcgtc gcgctgcaga tgccgacggt gtggcgccgc gaggatccct 4020121 ggatgtgaag cgctctagcc cttgacactt gctgttgcgc tgatactgct tgccgtggtc 4020181 ctggggttcg cggttgcccg cccacgcggc tggccggagg cagcggcggc ggttccggca 4020241 gcggtcatcc tgttagcgat cggggcgatc tcgccccagc aggcgatggc gcaggtgtcc 4020301 gggctggcgc gcgtggtcgc gtttctgggt gcggttctgg tgctggctaa gctgtgcgac 4020361 gacgaaggcc tgttcgaggc agccggcgcg gccatggctc gagcgagcgc ggagtcgcac 4020421 cgactgctac ggcaggtgtt cgccgtctcg gccgccatca ccgcggcgct ctgcctggac 4020481 gccaccgtgg tgctgttgac cccggtggtg ctggcgacgg tccgccggct gcggaccccg 4020541 gtgcgcccct atgcctacgc caccgcccac ctagccaacg ccgcttcgct gctgcttccg 4020601 gtgtcgaatc tgaccaacct gctcgcctac cacggtgccg gcatctcgtt caccaagttc 4020661 acgctgctga tggcattgcc ttggctgtcc gccgtggccg cggtctatgt ggtcttccgc 4020721 tggtttttcg cccgggatct acgcgtggtg ccggaccggc agcaactcaa gccggcgccg 4020781 cgcctgccaa tgttcgtgct ggtggtggtg gcgctgacac tcgggggctt cgccgtcgcc 4020841 gagtcggtgg gactggcccc aacgtgggcg gcgctggctg gcgccgcagt gttggcgctg 4020901 cgaagtctgc ggcgtggaca cacttcggtg ctgcggatcg cgcgcgccgt caacgtgtcg 4020961 ttcctggtct ttgtgttggc cctgggtgtc gtggtgcacg cggtcatgct caacggcatg 4021021 gccgccagga tgtccgccgt gctgccgacc gggtccgggt tgcccgcgct gctcggcatc 4021081 gccgcgctgg ccgccgtgct ggccaacgtg gtcaacaacc tgcccgcgac tctggtgtta 4021141 gtgccgctgg tggcggccgg cgggccggcg gccgtgctgg ccgtgctact cggggtcaac 4021201 atcggaccca acctgaccta tgccggttcg ctgtctaacc tgctgtggcg gggcgtgctg 4021261 cgccggcaca acgtcgacgc cagcgtcggc gagtacaccc gactgggact gtgcaccgtg 4021321 cctgcggccc tggcgatggc ggtgctcgcg ctgtgggcca gcgcccaggt tctggggatc 4021381 tagccgcaag ggcgcgagca gacgcagaat cgcatgattt gagctcaaat catgcgattc 4021441 tgcgtctgct cgcgaggctc gcgtggccgc cggcgctggc gggcgatctc ggcgagcacc 4021501 accccagcgg ccaccgaggc gttcagtgat tcggcctgag cggccatcgg gatggacacc 4021561 acctcgtcac agttctgcct taccaaccgg gacaacccct tgccttccga cccaacgacc 4021621 accaccaacg agtcagtgcc atctacatcg tcgagcgcgg tgccgccacc ggcgtccagt 4021681 ccgatcaccc gcactccacg atcggcccag cccttcagcg tcctggtgag attggtggcc 4021741 cgggccaccg gaatccgggc cgccgccccg gcgctggtgc gccacgccac cgcggtcacc 4021801 gacgcagaac ggcgttgcgg aatcagcacc ccatggccac cgaacgcggc caccgaccgc 4021861 acgatcgcac cgaggttgcg cgggtcggaa aggttgtcca aagcgaccag cagcgcaggc 4021921 ggttggtcga gggcggcggc cagcaggtca tcgggatggg cgtagttgta cggtggcacc 4021981 tgtagcgcga tgccttgatg gaggtggttg gcggtcatcc gatccaggtc ggcacgtagc 4022041 agctcgacga tcgcaatccc tgaatcagcc gcccgcgcaa cgcattcagt cagtcgctcg 4022101 tcggcctcgg taccaagggc gacgtatagc gcggtggccg gaacacccgc gcgcaggcat 4022161 tccagcactg ggttgcgacc caacaccgtc tcggtctcgt ccgcgcgctt gaccgggcgg 4022221 cgtggctgtg cacgtgcccg cttggcggcg ggatggtggg gacgcaggtg cgccggcggg 4022281 gtaggcccgc gcccttccag cccacggcgt cgctgaccgc ccgagccgac gcctgcgcct 4022341 ttcttggtac cggatttgcg gaccgcaccc cgccgccgag agttaccggg catctacttg 4022401 gtgtcaccac ccagcagcga ccactgtggc ccgtcggcgg tgtcggtgac ctcgatgccg 4022461 gctctcttca gccgaccccg gatctcgtcg gcgagcgccc agttgcgctg ctcgcgggcc 4022521 ttttcccgat tctgtagttc agcctggacc agcacatcga cggcggccag cgctgccgag 4022581 gtttcgtctc gggattccca gcgctggtcg agcgggtcac agcccaggat gcccatcatc 4022641 gcccgaatcg cgctagcgct tcgcaaggcc ccgtcgtggt cgccggcatc gagtgcccgg 4022701 ttgccttccg cccgcacgtg gtgaatctcg gcgagcgcga tcggaacgga caggtcgtcg 4022761 tcgagcgctt cggcgaaccg tggggtcgga tcgccggggc agacggcgcc cacccgggtg 4022821 cgaacgcggt gcaggaagtc ctctagcccg acataggctt tcaccgcatc ctgcatagcg 4022881 gtctcggaga actcgagcat cgaccggtag tgcgcgctgc ccaggtaata acgcagctca 4022941 gccggccgca cccgctgcaa catcgccggc atggacaaca cgttgcccag cgacttgctc 4023001 atcttctccc cgcccatcgt cacccagcca ttgtgcagcc agtagcgggc gaacccatca 4023061 ccggcggcgc ggctctgggc gatttcgttc tcatgatgcg ggaagactaa atccattcca 4023121 ccgcaatgga tatcgaattc cggcccgaga tagctgcgag ccattgccga gcattccaga 4023181 tgccagcccg gacgcccgcg gccccacggc gtcggccacg acggttcacc cggcttttcg 4023241 cccttccaca aagtgaagtc gcgctggtcc cgcttgccgg cagccacacc ttcgccctga 4023301 tggacgtcat cgatcttgtg accggataac tggccgtact ccgggtagct cagaacgtcg 4023361 aagtaaacgt caccgccacc ggtatacgcg tggccggcct ggatcaggcg ctcgatcatc 4023421 tcgatcatct gggtgatatg cccggtggcg cgcggctccg cggacggcgg caagacgtcc 4023481 agagcgtcgt aggccgcggt gaaggcacgc tcgtgggtag ccgcccactc ccaccacggc 4023541 cggcccgccg cggcggcctt ggccaggatc ttgtcttcga tgtcggtcac gttgcggata 4023601 aacgcgacgt cgtagccacg cgcgagcaac catcggcgca ggatgtcgaa ggcgaccccg 4023661 ctgcggacat gcccgatatg cggtaggccc tgcaccgtgg caccgcacag gtagatcgag 4023721 acgtgtccag gtcgcaacgg gacgaaatcc cgcacgacac cggcggcagt gtcgtgtagc 4023781 cgcaagcgag cccgatcggt cacgacgtgc cagcttacct gcccaattgc tgcaacctgc 4023841 ggcgcgcgcg tccggaccag gagtgcgcta ccgcaacgaa accaccaatg ccgtagcgat 4023901 tgcggccaag ccctcgccgc ggccagtgag gcccagcccg tcggtggtgg tagccgacac 4023961 cgacaccggc gcgttgagca gacgtgacag caccgcctgc gcctcgagcc ggcgccaacc 4024021 gatcttcggt cggttgccga tcacctgcac cacagcgttg ccgacccgat agccatgctg 4024081 ggtgatcagg acgacgacat ggcgcaacat gtcggcacca ctgacaccct gccaacgggg 4024141 atcgtcgacg ccgaacacct cgccaatgtc gcctagcccc gcggccgaca gcaccgcgtc 4024201 gcacagcgca tgaacggcca cgtcaccgtc ggagtggccc gcgcaaccgt cggcgctcgg 4024261 gaacaacaac cctaccagcc agcacggacg tccgggttcg atcggatgca catcggtccc 4024321 caaaccaacg cggggcagct gattcacccg cgcactatag cttgggccag caacagatcc 4024381 agtttggtgg tgatcttgaa cgccagcgga tcgccgtcga ccacctgcac ctggccgccg 4024441 atatgctcga ccagcgacgc gtcatcggtg tactcggcgg ctggaaggtc tagggagccg 4024501 cgctgatatg accgcagcag caggtcggta gtgaaccctt gtggggtctg cacggcccgc 4024561 agcccggctc gttccggcgt gcccaggacc accccgttgg catccacggc cttgatggtg 4024621 tcagaaagcg gcagtacggg aacgacggcg gcataaccgt cccgcaacgc ctcgaccacc 4024681 cgggcgacca gggccggtgg tgtcagtgcc cgcgcggcat catgcacaag cacaaactcc 4024741 ggctccgcgg tcccggacag cactgtcagc gccaggttca cggtgtcagt gcgattcgac 4024801 ccacccgcca caatcatcgc cctgtggccg aggatctgcc tcgcctcgtc cgtacggtcg 4024861 gcgggcacgg ccacaacaac ggtgtcaact acccccgaat ccagcaggcc atcgacggcc 4024921 cgctcaatga gagtctgccc gtcgagctgg taaaacgcct tgggcacacc gacggccaac 4024981 cgctcccccg accccgcagc cgggacgatc gcaactactt cgcccgcttc cctgaccact 4025041 agagcctcag ggcggtcaag acgcggcggc taaaacctcg tcaaggatgg tctcggcttt 4025101 ggcgtcatcg gtgctctcag ccaacgccaa ctcgccgacc agaatctgcc gggccttggc 4025161 cagcatgcgc ttctcaccgg ccgacaagcc acgctcctgg tcgcgacgcc acaaatcgcg 4025221 cactacctcg gccaccttgt tcacatcgcc ggatgcgagt ttctcgaggt tcgccttgta 4025281 acgacgtgac cagttcgtcg gctcctcggt gtgcggggca cgcaacacct ggaaaacctt 4025341 gtccaggcct tcctgcccga cgacatcgcg aacaccgacg tattcggcgt tttcagcggg 4025401 aactcgtact gtcaggtcgc cctgcgcaac tttcaagacg agatactctt tttgttcccc 4025461 tttgatggtc cgggtttcga tcgcctcgac taacgcagca ccgtggtgtg gatagacaac 4025521 ggtgtctccg accttgaaaa tcatctgatt tgagcccctt tcgttactcc atgctaacac 4025581 ggggccctaa cgggcgccga acaacggtgc aggtcagggg catagcgcgg gaagattggg 4025641 ggttgacaga cgggcctaga agtgcatcgc cgaatctggg acgcccctga gaacggggtg 4025701 cccgggctac cgcgccggtc cggtcgacgc cgcggtcccc accgctaccg tcggcggcac 4025761 ctaactacta ctgtgcatag tcgagccgca ggcaccatgc cgcgccaagg ccgagcagga 4025821 ggcatccgag tgaaccgctg caacatccgc ctgcgtcttg ccgggatgac cacctgggtg 4025881 gcgagcatcg ccctgctggc cgccgcactg agcggttgcg gggccggtca gatctcccag 4025941 acagcgaacc agaagccggc cgtcaacggc aatcggctca ccatcaacaa cgtgttgctg 4026001 cgcgacatcc gcatccaggc cgtccaaacc agcgatttca tccagccagg caaagcggtg 4026061 gatctggtgc tggtagccgt caaccaatca cccgacgttt cggaccggct ggtgggcatc 4026121 accagtgata tcggctcggt gacggtggcc ggcgacgctc gactgcccgc atccgggatg 4026181 ctttttgtcg ggacgccgga cggccagatc gtggcgccgg ggcccttgcc atccaatcaa 4026241 gcggccaagg cgaccgttaa cttgaccaag ccgatcgcaa acggcctcac ctacaacttc 4026301 accttcaagt tcgagaaggc cggtcagggc agcgtaatgg tgccgatctc ggccggattg 4026361 gctacgccgc acgaataggc gccgcatcgt cgccagacga gcgactcgct cgggttgtca 4026421 cacccccccg atacggtcac ggcgtggcca acgctcgttc gcagtaccgc tgttcggaat 4026481 gccgccatgt cagcgcgaag tgggtgggac gctgcctgga gtgcggccgc tggggcaccg 4026541 tagacgaggt ggcggtgctc agtgccgtcg gtggcaccag gcgccgttcg gtggcgccgg 4026601 cgtcgggcgc cgttccgatc agtgccgtcg acgcgcatcg gacccgaccc tgcccaaccg 4026661 gcatcgacga actggaccgg gtgctaggtg gcggtatcgt tcccggttcg gtgacactgc 4026721 tggccggcga tcccggagtg ggtaagtcga cgctgttgct cgaggtcgcg caccgctggg 4026781 cccagtccgg acggcgcgcg ctctatgtct ctggtgagga atccgccggt cagatccggc 4026841 tgcgtgccga ccggatcggc tgcggcacgg aggtcgagga gatctacctc gccgcacagt 4026901 ccgacgtgca caccgtgctc gaccagatcg agacggtgca gccggcactg gtcatcgtcg 4026961 actcggtgca gaccatgtcc accagcgagg ccgacggcgt caccggcggg gtcacgcagg 4027021 tccgtgcggt tacggctgcc ctgaccgctg ccgccaaggc caacgaggtc gcattgattc 4027081 tcgtcggcca cgtcacgaag gacggggcca tagccggacc gcgttcgcta gagcacctcg 4027141 tcgacgttgt gctgcatttt gaaggggacc gcaacggtgc gctgcggatg gtccgcgggg 4027201 tcaagaaccg attcggcgcc gccgatgaag tcggatgttt cctcctgcac gacaacggaa 4027261 ttgacggtat cgtcgacccg tcgaacctgt tcctggacca gcggccgaca cccgtcgccg 4027321 gtaccgcgat caccgtgacg ctggacggaa aacggccgct cgtcggggaa gtccaggcat 4027381 tgctggccac accgtgcggc ggctcgccga ggcgggccgt cagcgggatc caccaggccc 4027441 gcgctgcgat gatcgctgct gtgctggaaa agcacgcacg gctggcgatc gccgttaacg 4027501 acatctacct gtccaccgtg ggcggcatgc ggttgaccga gccgtcggcg gatctggcgg 4027561 tcgccatcgc gctcgcctcg gcctatgcaa atctgccgct gcccaccact gccgtcatga 4027621 tcggcgaggt aggtctggcc ggcgacatcc ggcgggtcaa cgggatggcg cggcgcctta 4027681 gcgaagccgc ccgccaaggg ttcaccatcg ccttggtccc gcccagtgac gatccggtgc 4027741 cgcccggtat gcacgcgctg cgcgcatcca ccatcgtcgc ggcgctgcag tacatggtcg 4027801 acattgccga ccaccgcggc accaccctcg caaccccgcc ctcacattcc gggactggac 4027861 acgtcccact agggcgcggt acatagcaga atgcacgctg tgactcgtcc gaccctgcgt 4027921 gaggctgtcg cccgcctagc cccgggcact gggctgcggg acggcctgga gcgtatcctg 4027981 cgcggccgca ctggtgccct gatcgtgctg ggccatgacg agaatgtcga ggccatctgc 4028041 gatggtggct tctccctcga tgtccgctat gcagcaaccc ggctacgcga gctgtgcaag 4028101 atggacggcg ccgtggtgct gtccaccgac ggcagccgca tcgtgcgggc caacgtgcaa 4028161 ctggtaccgg atccgtcgat ccccaccgac gaatcgggga cccggcaccg ctcggccgag 4028221 cgggccgcga tccagaccgg ttacccggtg atctcagtga gccactcgat gaacatcgtg 4028281 accgtctacg tccgcgggga acgtcacgta ttgaccgact cggcaaccat cctgtcgcgg 4028341 gccaaccagg ccatcgcaac cctggagcgg tacaaaacca ggctcgacga ggtcagccgg 4028401 caactgtcca gggcagaaat cgaggacttc gtcacgctgc gcgatgtgat gacggtggtg 4028461 caacgcctcg agctggtccg gcgaatcggg ctggtgatcg actacgacgt ggtcgaactc 4028521 ggcactgatg gtcgtcagct gcggctgcag ctcgacgagt tgctcggcgg caacgacacc 4028581 gcccgggaat tgatcgtgcg cgattaccac gccaacccgg aaccaccgtc cacggggcaa 4028641 atcaatgcca ccctggacga actggacgcc ctgtcggacg gcgacctcct cgatttcacc 4028701 gcgctggcaa aggttttcgg atatccgacg accacggaag cgcaggattc gacgctgagc 4028761 ccgcgtggct accgcgcgat ggccggtatc ccccggctcc agttcgccca tgccgacctg 4028821 ctggtccggg cgttcggaac gttgcagggt ctgctggcgg ccagcgccgg cgatctgcaa 4028881 tcagtggacg gcatcggcgc catgtgggcc cgtcatgtgc gcgaggggtt gtcacagctg 4028941 gcggaatcga ccatcagcga tcaataatta tccgccttgc gcgggagact ccggcggagg 4029001 cgcctgcgct ggacccggag cgggtaccgg cccgggcggc ggcggcggct gattcaggat 4029061 gaacggaacc ggcagcgagc gcagattgcc cagttgtacc acgagattgt aggtgcccgg 4029121 cccgatcgcc ggccgcggca atgggcagcg cggcgccgat cccatcccgg tccaggtcac 4029181 cgcggtcgtt acctgctcac cgggggaaaa cgtcttgacc agcgtctcat tcgagggcgc 4029241 gcagtccagg ttggaccaca accgcttgtt gtccagcgag taaacgtagg cggccaacac 4029301 cgcggcccca acgtcgcgtt tacaggacac caggccgatg ttggtgacca ccatggtgaa 4029361 cttcggctgg tcgccgacgt agtactgcgg cgcgttggtc aaacctttga cggccagcgt 4029421 cgaatcgggg caatcgtccc cttccttgag caccggcggc ggctgcaccg cggcggtggg 4029481 cgtgggtgtc tcggggtttt ggccctgcgg cggggccgcg gcggcgttac cttcggtttg 4029541 cccggccggc tggggtgctt ggggtgccgg cgagcccgga tggctctggg cggaggccgg 4029601 cttgtcggcg ctgaccggtt tggcaccggc gctgctgtcg acgaaggcga tgacgatggc 4029661 caccgcgatc ccgactacga cgaccgcgat gcccagggcc agccccctgc gccgccagta 4029721 gatctcggta ggtagcgggc cacgcggttc cagatccagc acgattacac cgtagggcca 4029781 ggtcacgcaa acgcgcttga cccgcctcgg cgtgtcgccg gcttcgctgg ccgacgccgt 4029841 gttaacggtg gcctgttatc gggcggtaac tcagacctcc tcgccgatgt tgccgatgtg 4029901 gtcgcgcagt acagcccgcc catcgtcgag ttgataggtg acgcccacga tcgccaggct 4029961 gccccctgcg attcgttctg agatggccga tgaacgcgcc atgaggatcg ccaccgtctc 4030021 gtgtacatgt cgttgctcga actcgtcgac acgactcaga ccgtcacggc ggccgagcag 4030081 gaccgacggc gcaacccttt ccacgacgtc tcgcacgtag ccgcctggca gggtgccgtc 4030141 gttgatcgcg gccaaagcgg cgttcacggc gccgcagctg tcgtggccga ggacgacgat 4030201 gagcggcaca ttgagcacgg tcaccgcgta ctctatggag cccagcacgg ccgagtcgat 4030261 gacatgcccg gcggtgcgga ccacgaacat gtcgcccagg ccttggtcga agatgatctc 4030321 agcggccact cggctgtccg cgcagccgaa gatcaccgcc gtgggcttct gcccggcggc 4030381 caagccggct cggtggtcga cgctctgact gggatgctgg ggccggccgg cgacgaatcg 4030441 ctcgttaccc tctttgagtg ctttccacgc ggctaccgga ttggtgttgg gcatgcctca 4030501 catactgccg gaaccgtcgg tgaccggccc gcgacacata tcagatacca atcttctcgc 4030561 ttggtatcag cgatcgcacc gggatctgcc ctggcgagag cccggtgtca gcccgtggca 4030621 gatcctggtc agcgagttca tgctgcagca gacgccggcc gcccgggtgc tggcgatctg 4030681 gccggactgg gtgcggcggt ggcccacgcc gtcggccacc gccacggcca gcaccgccga 4030741 tgtgttacgc gcctggggca agctgggcta tcccaggcga gccaagcgct tacacgagtg 4030801 cgccaccgtc atcgcccgcg accacaatga cgtggtgccc gacgatatcg agatcctggt 4030861 caccctgccg ggcgtcggga gctacaccgc gcgcgcggtg gcgtgtttcg cttaccgcca 4030921 gcgggtgccg gtggtggaca ccaatgtgcg gcgcgtggtg gcccgcgccg ttcacggccg 4030981 cgccgacgcc ggtgcgccat cggtgccgcg cgaccacgcc gacgtcttgg cgctgttgcc 4031041 gcaccgcgag acggcgcctg aattttcggt cgcgctgatg gagttgggtg cgacggtgtg 4031101 caccgcccgc acaccccggt gcgggttatg cccgctggac tggtgcgcat ggcggcatgc 4031161 cggttatccg ccgtcggacg gtccgccgcg ccgggggcag gcctacaccg gaaccgaccg 4031221 ccaagtccgc ggacggttac tggatgtgtt gcgcgccgcg gagtttcccg tcacccgggc 4031281 cgagttggac gtggcgtggc tgaccgatac cgcacagcgt gaccgggcgc tggagtcgct 4031341 gctggccgat gcgctggtga cccggacggt cgatggccgg ttcgcgttgc ccggcgaagg 4031401 gttttagccg ggtaggccgt ccgcaccggc ggcgccgaaa ccgccgggat caccggggtt 4031461 gcccgcgacg actgtcccag ctcccgcggc gccacccgcg ccgccagcgc cgccggcacc 4031521 tccctggccc ccggtaccgc ccgcaccgtg gacacctggc tggctgaaca ttccggcacc 4031581 tccgccggca cctccggcac cgcccttgcc gccgttgccg ccggcgccgc cggcaccacc 4031641 gttgccgccg tcaccgccga ccaggccaga gccgcccttg cctccggcgc cgcccgaggc 4031701 acccgtgccg ccgatgccgc cggcaccgcc ggcgcccccg ttaccgccgt cgccgaacag 4031761 cagcccgccc tgaccgcccg cacccccgac accgccgaca cccccggtgc cggcggtgtt 4031821 ggcgccagcc ccgccggggc cgccgtcgcc tccgctacca aaaaaggtca gcgtgccggt 4031881 ggcgccgccg ccaatgccac cattgccgcc cgcagccccg gtgccgcccc ggcccccggc 4031941 gccaccgttg ccgccgatcc cgttgccacc gtttagcgct aggccgttgc ccccgttgcc 4032001 cccgtcgccg ccccgggcgc cggcgccacc gtcaccgcca ttgcccccgt ttcccccgta 4032061 ggcccagcca gtaccggtat tgacaccgat gccgccgggt gcgccgttgc cgccgggcgc 4032121 gccgggaccg ccgtcgccgc cattgcctcc gttgcccccc gtcacagggc cttcactcgt 4032181 atcgctgccg ctgccgccta aaccgccagc gccgccagcc cgccctggta cggcacccgg 4032241 gttgccgggc agccctgcgc caccgctacc ggcgccgttg ttggcgccgg ggcttccgtt 4032301 tgccgcctgg ctggtctggt tcggcggcgg gttcatcccg ttggttccgg gggcacccac 4032361 cccgccgacg ccgccgtcgc cgccggcgcc gatcagcccc gcgttgccgc cggcaccccc 4032421 attgccgggc aacccgccga tgaccgcggc cccgcccgcc ccgcccacac cgccattgcc 4032481 gaacgcgccg gcggcgccgc cggctcctcc gttgccactg acggtcgttc ccaccccgcc 4032541 gaacccgccg gcaccgccgt tgccgaccag ccagcccgcc ccaccggctc cgcccacacc 4032601 gccggcgccg gcggcgttgc tcccgccggc cccgccattg ccgccgtgcc cgaacagccc 4032661 ggcggcgccg ccgctaccac cgggcccgcc cacaccggcc acacccgacc cgccgttgcc 4032721 gccattaccc cacagcagcc cgccgggccc gcccggctgc ccggttcccg gcgccccgtc 4032781 ggcaccgttg ccgatcaacg ggcgccccag tagcgtctgc gcgggcccgt tgatcaagcc 4032841 gagcacctgc tgctccacgg cttgcagcgg cgacgcgttg gcggcctcgg cgctggcata 4032901 ggagttcact cccgcgttta acgcctgcac gaactgggca tgaaaccccg ccgcctgggc 4032961 gctgagcctc tggtactcct gggcgtgcgc gccgaacagc gccgccatcg ccgccgacac 4033021 ctcgtccgcg gcggccgcta gcacacccgt ggtcggggcg gccgcggcgg cgttggcggc 4033081 attgagcgcc gaaccgatac cggccacctc tgaagccacc gacatcagcg cttccggcgc 4033141 cacgatcaca aacgacatct gacacccctt tccgcggcgc ggcctgacgg cccgatcgta 4033201 gcgcgatcac gggccgacaa aacccgttat ggccaggctt ttcgccacat tgcccgcgcc 4033261 gcgtgggctc acggggtaag ccccgccagg aacgactcca ccgcccgccg gtaaacctgt 4033321 ggagcctcgt catgaaccag atgaccggcg tcgggaacac gcaaatacgc tgtcggataa 4033381 tctctttcag ccatcgcgcg catctggccc gggggagtta ccccatcgcc ggcctcgatg 4033441 agcagcgccg gcgaccgtac ggcccgccac tgcgcccagt agtcacgggt gccccattcg 4033501 gcggcgatct cgatccatcg tgcggtgcgc ccgtgtagcc gccacccggt ggccgtgcgg 4033561 tcgaatgcgt ccaggaagta ccggccggcg acgggcccga actcggcgaa tacctgttcg 4033621 gcagagtcga attcgaccgg aagggcgcgc agccacggct cccatgggcc ggtggtccta 4033681 ccacggaagt ccggcgccat gtcctcgacc accagcgccg aaaccagttc cgggcgctcg 4033741 gcagccagac accacgaatg caaggctccc atcgaatgtc cgaccatcct ggtcggcgcg 4033801 cccagcgccg aaaccgcgtc gcccagatcg gccacgaagc gttcggtgct gatcgggtgt 4033861 ggatcggcga cgtcacgccc gcggtgccag ggcgcgtcgt aggtgtacac ggcgcctaac 4033921 agcgtcagcc acggaagctg acgggcccag gtggaacccc tacccatcaa gccgtgcacc 4033981 aggaccaacg gctcgccccg tccgccgcga tgggttaaca gattcgctgg catgcggggc 4034041 acggtagcct agcggcatgc cagtggtgaa gatcaacgca atcgaggtgc ccgccggcgc 4034101 tggccccgag ctggagaagc ggttcgctca ccgcgcgcac gcggtcgaga actccccggg 4034161 tttcctcggc tttcagctgt tacgtccggt caagggtgaa gaacgctact tcgtggtgac 4034221 acactgggag tccgatgaag cattccaggc gtgggcaaac gggcccgcca tcgcagccca 4034281 tgccggacac cgggccaacc ccgtggcgac cggtgcttcg ctgctggaat tcgaggtcgt 4034341 gcttgacgtc ggtgggaccg gcaagactgc ataaccggcg cgcggggcgc cggatgctgg 4034401 cgttaagcgc cgcggcggca ttgattgtgg cgctggcgtc gggttgctcc tcagctccga 4034461 cgccgtccgc gaacgcggca aatcacgggc accggatcga caccagaact ccgcctggtc 4034521 tgcgggcgca acagaccatg gacatgctca actcggactg gccgatcggc gagatcggcg 4034581 ttggcactct cgccgcgccc gggcaggtcg acacggtcaa gaccaccatg gaagcgctct 4034641 ggtgggatcg cccgttcgcg ctggccggcg tcgatatcgg cgccagtgtg gccgcgttgc 4034701 acctcatctc ctcttacggc gcgcaacaag acatccgcat tcataccgac gacgacggct 4034761 gggttgaccg attcgacgtc gaaacgcagg cgccgtcgat cgcttcgtgg cgcgacgtcg 4034821 acgcggcgct gagcaagacc ggcgcccgct actcatttca ggtggcaaag gtcgacaacg 4034881 gtcgctgcga cccggtggcg ggcaccaaca ccggcgaatc cctgccgctg gcatcgatct 4034941 tcaagttgta cgtgttacat gcgctggccg gtgcggtcca gcacaacacg gtgtcctggg 4035001 atgatctgct gacggtcacc gccaaaagca aagccgtggg ctcttccggc ctggaactgc 4035061 ctgtgggggc acgtgtttcg gttcgcacag ccgccgagaa gatgatcgcc accagtgaca 4035121 acatggccac cgacttgctg atcgaaaggc tgggcacccg cgccatcgag gaagcgctgg 4035181 ccagcgccgg ccatcacgat ccggccagca tgaccccctt ccccacgatg tacgagctgt 4035241 tctccgtcgg ctggggcaag ccagatctgc gtgaccagtg gaagcatgcg acccaacagg 4035301 tccgtgccca gatactgcgg caaaccaatt ccacgcccta ccaacccgac ccaacgcgcg 4035361 ctcacactcc ggcgtcaaac tacggtgcgg aatggtacgg cagcgccgaa gatatctgcc 4035421 gtgtgcacgc ggcactgcga gccgacgcgg tcggcccggc ctcgcccgtc cgacagatca 4035481 tgtccgccgt cccgggtatc cagctggacc gcagcgtgtg gccctatatc ggcgcgaaag 4035541 caggtggcct gccaggcgat ctgacgttca gctggtacgc cgtcgacaag accggccaac 4035601 catgggtggt gagctttcag ctgaactggc cccgcgatca cggaccgacg gtgaccggct 4035661 ggatgctgca ggtcgccagg caagtctttg cgttgatagc gccacaatag atcgctacag 4035721 cccaggcatc cggaggtatc cgcggctcgc ttccgtaacg accggccggt cgtgctcgac 4035781 gtgaacaacg agacacttcc cgcgccggtg cgttcgacgg ccgattcgct ccggctcacc 4035841 gataggaggc gccaccgtgg gatggatcgg cgatccgatt tggctcgagg aggtgctacg 4035901 gccggcactc ggcgagcgcc tgcgggtgct cgacggctgg cgggaacgcg gacacggcga 4035961 ctttcgcgat atccgcggtg tgatgtggca ccacaccggc aactcacgtg agaccgccaa 4036021 aagcattgcc cgcggccggc ccgacttacc cggcccgctg gccaatctgc acatcgcgca 4036081 cagcggggtc gtaacgatcg tcgcggtagg cgtgtgctgg cacgccggcc gcggcagcta 4036141 cccgtggctg ccaaccgaca acgccaactg gcacatgatt ggcgtcgagt gcgcgtggcc 4036201 gaccatccgg cgtgacggct cctacgacgc cggtgagcgc tggcctgacg cgcagatcgt 4036261 gagcatgcga gacgtcgccg cggcgctcac gctcaagctc ggctacgggc ccgaacgcaa 4036321 tattgggcac aaagagtatg ccggggcggc tcaaggcaaa tgggacccgg gaaacctgtc 4036381 gatggactgg tttcgcgccg aggtggcaaa ggacacgcgg ggcgagttcg accaccccct 4036441 caccccgccg ccggcggtga ttgcccgccc accgattctg cccaagccgc gcaacccgcg 4036501 tgacgatcgc atcctgctcg aggaggtgtg ggaccagcta cgcggcatcg agggccgcgg 4036561 ctggccggta ctcggcgaca agacgatcgt cgactaccta gccgagctcg gcaataaggt 4036621 cgacgccctg gccgcaaaac tcgacgcgcg cgagggcctc gaccggccca gtgacactcg 4036681 gtagctgctc cagcaggcgg cggggtgctg acggacccgc tgcaacgatg tcaaccgggc 4036741 tggcccggct ggccgggctg gccgggtgca ccttcagggc cgaactggcc gaggttgccg 4036801 tcgccgccac ggcccgcagg cccaacgccc ggggagccgg ccacaccggg ctcaccaccg 4036861 cccccgccat tacccccgct accggcggca ccgcccagcc cggagctacc ggagacgccg 4036921 aacaggccgc ccgcgccgcc cgcgccgccc gcgccgccgt ctccgccggc gccgccgtca 4036981 ccgccgatcc cgccgttacc cccgtcacca gcgtcacccc caacgcctgg ttgcccggcg 4037041 ccgcccattc cgccctgacc gccggtgccg ccccgggcgc cgatgccgcc ttcgccccct 4037101 gtgcccccga tgccacctgc gccgccgtcg ccgaccagga gcccaccccg accgccagag 4037161 ccgccggccc cgccggtccc cccggtgccg ccggtcccgc cggtcccgcc aacggacagg 4037221 ccagtaccgc cagtgccacc ggtgccaccg gtttgcccga actcggtgcc tggctggccg 4037281 ggtccgcccg gttcaccgtt gttacccatg ctgccctggc ccgccgggac ggtcaggggg 4037341 ttgaccccgg cggcacccgc cgcgccggcc gcgccgagcc ccccggcccc accgttgccg 4037401 aatagccacg cgttgccgcc ggcaccaccg gcgccgccgt tgccgccggg gatagtcgcc 4037461 gccccgccgt tgccgcccgc gccgccgttg ccgtacagca gcccgccttg tccgccggac 4037521 ccgccggccg caccggctcc gccggcacca ccggatccgc cgttgccgat caacccggcc 4037581 gccccgccgc tgccgccggc aatccccgcg ttcaccccgg ccgcgccatt accgccattg 4037641 ccatacagca gcccgcccgc cccgccgttt tgcccgggcg ccgtcccgtc ggcaccatca 4037701 ccgatcaacg gacgtcccaa cagtgcctgg gtgggcgcat tgatcgcgtt cagcaaactc 4037761 tgctgaacgt tggcggcctc ggcgttggca tacgcccccg cacccgcact caaggtctgc 4037821 acgaactggg catgaaacgt cgccgcctgc gcgctgatcg tctgatacgc ctgggcgtgc 4037881 gcgccgaaga gcgccgcaat cgccgccgat acgtcatctg cgcccgcggc cagcacgccg 4037941 gtggtcggga tgctggcagc cgcgttagcc gcgctgatcg tcgaaccgag attggccaaa 4038001 tccgtggccg ccgccgacag gaactccgga acggcaatca caaacgacat tggccacctc 4038061 cgaacagctt ccggacaaac cgacgtcagc agagtctatt gtcacagcgg atcggcggtc 4038121 gcggttttcg cctaatacgg ccgatggacc tagaccgcta ccgcgcggcc ggctccgggc 4038181 cgcccgcgct gtgcgctcca gccttggcca gatccggctc ggccggcggc ttgcgggtac 4038241 cggtgaaggt gaacaccgcg tcctcgccgg gaccttcacc gtcccagttg tccacgtcga 4038301 cggtgaccac ctgacccggc ccgacctcct cgaagaggat cttctccgag agctgatctt 4038361 cgatctcacg ctggatggtg cgccgcaacg ggcgggcccc caacaccggg tcgaagccac 4038421 gcttggccag cagcgccttg gccgcatcgg tcagcaccag cgccatgtcc ttgctcttga 4038481 gctggccggc gacccggctg atcatcaggt cgaccatccg gatgatctcc tcgcgggtca 4038541 gctggtggaa gacgatgatg tcgtcgatgc ggttgaggaa ctccgggcgg aagtgtttct 4038601 tcagctcgtc gttgaccttc tgtttcatcc gctcgtagtc gttctcaccg ccgcccttgg 4038661 aaaagcccag accgaccggc ttagagatgt cggaggtgcc cagattggac gtaaagatca 4038721 gcacggtgtt cttgaagtcc accgtgcggc cctgcccgtc ggtgagccgg ccatcctcga 4038781 gcacctgcag caggctgttg tagatctcct gatgcgcctt ctcgatctcg tcgaacagca 4038841 ccaccgagaa cggcttgcgc cgcaccttct cggtgagttg gccgccctcc tcgtagccga 4038901 cgtatccggg cggcgcgccg aatagccgcg acgcggtgaa ccggtcgtgg aattcaccca 4038961 tgtcaatctg aataagcgcg tcgtcgtcac cgaacaagaa gttggccagc gccttggaca 4039021 gttcggtctt accgacaccg gacgggccgg cgaagatgaa cgagcccgac gggcgcttgg 4039081 ggtctttcag cccggcccgg gtacgccgga tggccttgga aacggccttg acggcgtcct 4039141 cttgcccgat gatccgcttg tgcagctctt cttccatccg caacagccgg gtggtctcgg 4039201 cctcggtgag cttgaacacc gggataccgg tccagttgcc cagcacctcg gcgatctgct 4039261 cgtcgtcgac ctccgcgacc acgtcaagat cgcctgaacg ccactgcttt tcgcgctcag 4039321 cacgctgtgc gaccagtgtc ttctcccggt cgcgcaggct ggcggccttc tcgaagtcct 4039381 gggcgtcgat agccgattcc ttctcccgac gagcctcggc gatcttctca tcgaactcgc 4039441 gtaggtctgg cggtgcggtc atgcgacgaa tccgcatccg agcacccgcc tcgtcgatca 4039501 ggtcgatcgc cttgtcgggc aggaaccggt cgttgatgta gcggtcggcc agggtcgcgg 4039561 cggccaccat cgccgcatcg gtgatcgaca cccggtggtg cgcctcgtac cggtcccgca 4039621 ggcccttgag gatctcgatg gtgtgctcca ccgtcggctc acccacctgc accggctgga 4039681 agcggcgctc cagcgcggcg tccttctcga tgtacttgcg gtattcgtcg agcgtggtgg 4039741 cgccgatcgt ttgcagttca ccgcgagcga gcttcggttt caggatcgag gcggcgtcga 4039801 tcgcgccctc ggcggctcca gcaccgacca aggtgtgcag ctcgtcgata aacaggatga 4039861 tgtcaccgcg ggtgttgatc tccttgagca ccttcttgag gcgttcctcg aagtcaccgc 4039921 ggtagcggct acccgccacc agcgatccca gatccagcgt gtagagctgc ttgtccttga 4039981 gcgtctcggg cacctcgccg tgcacgatgg cctgcgccag tccttcgacg accgcggtct 4040041 tgccgacgcc gggctcgccg atcagcaccg ggttgttctt ggtgcgccga gagagcacct 4040101 gcatgacccg ctcgatttcc ttctcgcggc cgatgaccgg gtccagtttg ccttccatcg 4040161 ccgccgccgt gaggttgcgg ccgaactggt cgagcaccaa ggacgtagac ggagagccgg 4040221 actctccccc gcggccgccg gtgccggctt cggcggcctc cttgccttgg taaccggaga 4040281 gcagctggat cacctgctgg cgcacccggg tcagctcggc gcccagcttg accagcacct 4040341 gggcggccac gccttcaccc tctcggatga ggcccagcaa aatgtgttcg gtcccgatgt 4040401 agttgtggcc aagctgcagc gcttcacgca agctcagctc gaggaccttt ttggcgcggg 4040461 gggtaaacgg aatgtgccca gacggcgcct gctggccctg gccgatgatc tcctcgacct 4040521 gactgcgcac accttccagc gagatcccca acgactccag tgacttggcg gcaacgcctt 4040581 ccccttcatg gatcaggcct aaaagaatgt gctcggtgcc gatgtagttg tggttgagca 4040641 tcctggcctc ttcctgagcc aggacgacga ccctgcgggc acggtcggta aatcgttcga 4040701 acatcggtgg ctacctgctc tccctcacca tcggatacag cggtcgacac cgcgtacctg 4040761 ccgtccactg taatggtcgg cctgccaggg ttcctaacct tgcggtgcct ggtcggttcc 4040821 ggggcgcagc gccccaagtc gccgttgaac agaaccgcat aggagataaa cgagaaaacc 4040881 acccaagcgt ttccggcgcc gagcggccat cggttcgccg ccagcgaacg cggcaaagta 4040941 ccggcgccca ggctttcgcc tgggcgccgg tagccaaatg tcaggtcgcc gcgtggtatg 4041001 cgtcgatgac gtcggccggg atccggcctc gcgtcgacac attgtgcccg ttacgacgag 4041061 cccattcgcg gatcgccgcg ctctgctcgc ggtcgatcgc gccacgtcca cggccggatc 4041121 cggaacggcc gcgccggcgc ccaccgacgc gacggcccgc cgccacccat tgcttcaggt 4041181 cgccacgcag tttcgtggca ttcttagtgg aaaggtcgat ctcataggtc accccgtcaa 4041241 gcccgaattc gaccgtttcg tcggcggcgc ccgaaccgtc gaaatcgtcg accaaggtga 4041301 cggttacttt cttcgccatt ggcttaccct cgcgtttctt cctgtgcagt acggatagac 4041361 tccccggtca ccaatctgcc ataagaacgc agaatactca atccagacac aacacccaca 4041421 gttcagttgg agtgtggtcg aacaatcggg aacaaaactg tctccctaat tgacaaccca 4041481 gtcaaagaca tcaacaaccg atcgataccc attccggttc cggtgcacgg tggcatgccg 4041541 tactccagag cggccagaaa atcctcgtca agcaccatcg cttcgtcatc gccagcggcc 4041601 gcggcacggg cctggtcggc gaatctctcc cgctggacta ccgggtcgct taattccgag 4041661 tagccggtgg caagttcgat tccgcgcaga tagaggtccc acttctcggt tacgccgggg 4041721 atactgcggt gctgacgggt caaaggcgtt gtctgaaccg gaaaatcctt gacaaatgtg 4041781 ggtgcgctca agctcttgcc cactgtgcgc tcccagagtt cctcgatgag tttgccgtgg 4041841 ccgaagccac ggttgtcatg aatcgctggg tctttctcca ggccaaggct atcggcgatc 4041901 ccacgtaagc gatcgaccgt cgtctgcggt gtgatctctt caccgagcgc cacagacagc 4041961 gacgggtaca tttgtatagt cgcccattct ccgtcgatgt catagacact gccgtcgggc 4042021 aacggcagtt gtctggttcc gatcgcctca tcggccacct cttgaataag ctcccgggtg 4042081 acgactgccg aatcgtcata ggttccgtag gtctggtagg tctccagcat ggagaattcc 4042141 ggagaatgcg tggaatcggc tccttcgttt cggaacactc gattaagttc gaagaccttg 4042201 tcgaaaccac ccacgatgca gcgcttgagg aacagttccg gcgcgatccg caggtacaga 4042261 tcgatgtcta gggcattgga atgagtggcg aacggacggg ccgccgcacc accggctaac 4042321 gtctgcaaga cgggcgtctc gacttccagg aacccacgac gttgaagcgc cgtccggatc 4042381 gcgcggacga cggcgatccg tagtcgagcc accgcgcgcg cttccggtcg aactatgagg 4042441 tcaacatagc gctgacgaac ccgcgactct tcactcatct ctttgtgcgc gacgggaagc 4042501 ggccgcagcg acttggcggc gatccgccag caatccgcca ggacggacag ctcgccgcgg 4042561 cgcgaactga tcaccgcgcc atgcacgtag acgatgtcgc ccaggtcgac atcggctttc 4042621 catgcgtcga gagcagcctg gccgaccttg tcgaggctga tcatcacttg cagctgggta 4042681 ccatcgccgt cctgaagtgt cgcaaagcat agctttcccg agttgcgcgc aaagatcact 4042741 cggcccgcga cgccgacgat gtcttcggtc gcggtatcga tcggcaagtc agggtgggcg 4042801 gcgcgaacct cggccaacgt gtgagtgcgc ggcaccgcga cgggataggg atcgcgcccc 4042861 tgggccagca agcgagcgcg cttgtcccgg cgaatccgga actgctcagg aaggtcttct 4042921 gctgtgtcag cggcactcac gacgtgccag cttaaatgac ctcacgccga cgctcgtggg 4042981 tggcgtcgag cctgtcggcg gcgggcgacc cggtacccag actcgatgcc ggcatcgacg 4043041 tcagcgcgcc gtcttgagcc ggccgcgctg gacttcgagg ttacgctcga acaccagccg 4043101 cagaccctgc aaggtcaggt gctggtcgta atggtcgacg gtgtgcaatt ccggcagcag 4043161 caggggcgcg gtatgcccgg tagccacgat cgcgacatcg tggtcgacgg agaaaccgga 4043221 cacgtcctcg cggatgcggc ctaccaaccc gtctaccagc ccggcgaagc cgaacaccgc 4043281 accggcttgc atgcattcga cggtgttctt gccaaccacc gaacgtgggc gggcaagttc 4043341 aacgcggcgc aatgccgccg agcgggccgc cgcggcatcg gaagacacct gcaccccggg 4043401 cgcgatggcg ccgccaagaa attcaccctt ggccgataca acatcaacac agatcgagga 4043461 tccaaagtca acgacgatgg cggccttccg gaaccggtca taggcggcca aacagttcac 4043521 gatgcggtct gcgcccactt ccttcgggtt gtcgacgagc aaagggatcc cggtgcgtac 4043581 tccgggctcg atcagcacgt gcggcaccga cggccagtac tggtcgagca ttatccgcac 4043641 ctcgtgcagc acggacggga ccgtggacaa ggcggcggta ccggtgagcc gctcggaatc 4043701 ctcgccgatc agcccgtcga tcgtcagtgc cagttcgtcg gcggtgactt cggattcggt 4043761 gcgtatccgc cactgctgca cgacctttgc gtgctctttc attccggaca gcaggcccac 4043821 aacggtgtgg gtgttgcgga cgtcaatcgc cagcagcacg gctatcccac accgagccgg 4043881 gggtctagca gctcgcccgc gttttcgggc acaaatgccg gatcgtggcc catgtcgatc 4043941 ggtttgttgt aagcgtcgac aaacacgatc cgcggctggt atgtgcgggc ccgggcgtcg 4044001 tccatcgtcg cgtacgcaat cagaatcacc agatcccccg gatgcaccaa gtgcgcggcg 4044061 gcaccgttga tgccaatcac accactgccg cgttcgccgg tgatcgcgta ggtgaccagt 4044121 cgagcaccgt tgtcgatatc gacgatggtt acctgttcgc cttccagcag gtcggcggcg 4044181 tccatcaagt cggcatcgat ggtcaccgag ccgacgtagt gcaggtcggc gcaggtcacc 4044241 gtggcgcggt ggatcttcga cttcagcatc gtccgtaaca tcagtttctc caatgtgatt 4044301 cgaggattgc ccggtatccg tccgggcggt cggtgccggc gaaagttccg atttcaatcg 4044361 caatgttgtc cagcagcctg gtggtgccaa gccgggcagc aaccagcagc cgaccggaac 4044421 cgttgagcgg catcgggcca agcccgatat cgcgcagctc caggtagtcg accgccacgc 4044481 cgggtgcagc gtcgagcacc gcacgggcgg catccagcgc ggcctgcgcg ccagccgttg 4044541 ccgcatgcgc tgcggccgtt agcgccgccg agagcgcgac ggccgccgca cgctgggccg 4044601 ggtccaggta gcggttgcgc gacgacatcg ccagcccgtc ggcttcgcgc acggtcggca 4044661 cgccgaccac cgcgacatcg aggttgaagt ccgcgaccag ctgccggatc agcaccagct 4044721 gctggtagtc cttctcaccg aagaacaccc gatccgggcg cacgatctgc agcagcttta 4044781 gcacgaccgt cagcacgccg gcgaaatggg ttggccgcgg gccgccctcg agttcggcgg 4044841 ccaacggacc gggttgcacg gtggtgcgca ggccgtcggg atacatcgcc gcggtagttg 4044901 gcgtgaaagc gatttccacg ccttcggccc gcagttgcgc caggtcgtcg tccggggtgc 4044961 ggggataggc gtcgagatct tccccggcac cgaattgcat cgggttgacg aagatcgaca 4045021 cgacgacgac cgatccgggc acccgcttgg ccgcacgcac caacgcgagg tggccttcgt 4045081 gcagcgcacc catagtaggc accaacatca ctcgccggcc ggtgagtcgc agtgcgcgac 4045141 tgacatcggc gacatccccc ggtgccgagt acacattgag ttcaccggga tggaacgcag 4045201 gaatcgtcat gccgtcaaaa cctcgacgac atccgcgggg gcgtgtgcgc gctgcgcggt 4045261 ccgcagcgcg tttatccggt atgcctgggc cagcgctgcg tcgacgtccg cgagggccgc 4045321 cagatgatcc gcgaccgctg ccgcatcgcc gcgggcgacc ggtccggtga gcgcggcctg 4045381 tccccgctgc agcgtgttct ccagcgccgc tctggccagc ggcccgacga tgcgctccac 4045441 gatcccgccc ggctggtcgt cgacggtttg ttggccgagc agttcccccc cgctcagggc 4045501 ggcccgcaac gcctcgagcg catcggccag cacggtgacg atgtggttgc tcgcatgggc 4045561 cagcgccgcg tggtagagga tgcgggcgtc ttcgcgcaca caaaacggct ccccgcccat 4045621 ctcaagaacc agtgactgtc cgatcgcata cccgacgtcg tcggccgcgg tgatcccgaa 4045681 gcaggtatcc ggcagccggc tgatgtcctc gtcggagccg gtgaaggtca tcgccgggtg 4045741 aatcgccaat ggtatgcagc cctgttgggc tagcggcgcc agaatgccaa tcccgttagc 4045801 tccggaggtg tgcgccacaa tcgtttgtgg ccgcaccgcc gaggtggctg ccaggccgga 4045861 taccaggccg gcgagttcgc tgtcggtgac cgccaatagc agcagctcag cgctggccgc 4045921 gacgtccagc ggtggcagca ccggggtatc aggcagccgg cgctgcgcgc gccgccggga 4045981 cgcatgagag atggcgctgc acgccaccac aacatggtcg gcgcgctgca gcgcgacccc 4046041 tagcgcggtg ccgacccggc cagccgagat gatccccacc ttgagcctgg ccggacgcaa 4046101 accgtcgaac cgctccatag cagacggcct cacaggtttc ttggttcgtt ccagtcccat 4046161 gcccgggtac cggacggtca ccaagactgt agtcgatttg cacgtcaaga cccacccggg 4046221 gcactgctga tttggtcact acaccaacag tgtcggttgc cggcggcaat cgggcgggta 4046281 caccctggca caagcggcgc cgctattcac cgcggcggcg acgccggccg cctccggtcg 4046341 actcgacctg aagcctcgcc ataaggtcgg cgaccgactg gccgccggtt agcgggtccc 4046401 gcgcatgcaa accaccggag tcatccggcg gcgtgtccgc ggtgcggtgc cggcgtgtgg 4046461 gctcagcagg cggcggcggg gccattggag gtgacggcgg acgctcaccc gtcccggcac 4046521 cggagccgcc gatgtcatgg tcgcggtgct ccgccgaatg acgcgaccgg cgaccggatt 4046581 caccgtattg tgccgcgagc tcgacgtagg cgggcggatt gtaggcctgg tctgctgggc 4046641 tggcatgccg ggcgcggcgc cggcgccccg gcggcggggc cgccggcgtg gtttcgggtt 4046701 cgacggatgc ccactggctg ccaggtgttt cggcaggcag ccactgcccg tggctggtga 4046761 ccggctgcca gactggtcgc tcctgttgcg gcgggagcgg cgggggccgg tggcgcggct 4046821 cgaataacgg ctcaggttgc ggcggtggcg gagcctcata atgccgcggt cctccgctca 4046881 ccggtggtac ccccacctcc ggcacatcga tgatcgaggc ctcgtcggtg cggctggcgc 4046941 catcaccccc gcggaccgcc ataacccgat cgctggatac ccagtccgcg ggagggctct 4047001 cgccatcgag ggcacgcgcg gccctggcct ctttctccac ggtccccagc gccggacggt 4047061 gctcgaggtc ggcgtcgaac aaaatctcca ggctggttcg cagcgcggcc agttcggccc 4047121 gcagggctgc tacctcgtcg gcggccggag cgcgcaactc cgaggccagc tcgcggcgca 4047181 gctgagattc cagggtcagc tcgtactccc ggcgcgccga aatctcgcga tccaactgaa 4047241 ggtcatagac cagcttcagg tcacgcaccc gggcctgatc cacgtcgctt tgccggcggt 4047301 aaagcaccga cacaaacgca cccgcgaccg ccgcccacag cgccagcaga acagcgagct 4047361 tgagaagttc cacgcgatcg gtgaaaacca atgcggaact ggccccaatc gccaggacca 4047421 gcaacgccgt caaaagcacc caacccggcc tgcggccgcc gcgccggacc cgggcgccgc 4047481 gggacagaac ggtcatggcc tgactgtacc cgggcgaggt caatccgcgt gtcgcgccgg 4047541 tccggcgatt cccgcatggc ttagccgggt aggcagttcg gccaaattcg ccgcgtagac 4047601 aaccccgcat tccgggtcgg ccgcccggcc agcaacgtcg acgaccgacg cccatcgccg 4047661 gttcggccat ggccgacgca gcgcgcggca ccgtcgggtg tgggctagct ttccgcgccg 4047721 tcggcgtgct cggtcggatc ctgcggagac ttgcagcaat gttgcagcca aagcgcggca 4047781 accaccaacg ctagcgcgct gcccgccgcc accaccgtgc cagtggtgtc ctcggcggcc 4047841 gcccgcagcc atgaccgccg cggcaggaag tacgccagca ccccgatcca ccaccccgtc 4047901 accagcgcac ccacccaggc cgaggccttg gctaccatca agctgcgcgc caccacaagc 4047961 gggtgcagcc agccgggccc gtctccgatc tcgccatcgc tgatcttgac ccgcacgtag 4048021 cgagcccaca acgcctcggc gaccgcgacc gcgagcaagg acaagcccgt ccacaccgtg 4048081 atcggcggaa accaccggta aagcaccgcc accaacagat atcccaccgc cgcggcgccg 4048141 accaccgcgg cggtcagatc acgttttcgg gtcggtccca tcagctttcc ggtgcccgac 4048201 tgacggggtg tctgctattc agatcgaacg acggcctaaa caaccgcaca ctgtcgcggt 4048261 cggcgggctc cagctcggcc agcagtcgcg tgacgggccg cgggcacccg gcaaccgtca 4048321 gctgcgccgt tgggtcgacg gcaatccacg ggatcaacac aaaggcccgc agatgcgcca 4048381 gtgggtgcgg cagcgtgagg tggttctccc gcgcggtcac ttcgaccaga gcctcggtgg 4048441 ccgaggtctg gtagcaggcg atcaggtcga cgtcgagatt tcgtggaccc cagcgctggc 4048501 cacgcaccct gcccgcagcg cgctcgaact cctgcgcccg ccgcagccac tcccgcggtt 4048561 cgcaggtagg atcgtcggcg atcagcaccg cattgaggaa ctgcccctgc tccaccccac 4048621 cccaggggtc ggcctcatat atcggggaag ccgcaatcaa cgcatcgccg agaccgtcgg 4048681 cgaccgaccg caatcgtgcc aggcggtcac ccaggttgga gccaaccgag agcactaccc 4048741 gcgtcatacc gcgccgcccg ccgggactac ccaaccgcgg ccgccgcgcc gtgagcgtcg 4048801 gatcaccacc gccacatcgt cgaacgtctg cggaatgggc gcctgcggct tgtgtaccgc 4048861 cacctcaacg gcatgcactc gctggtcgtc catcacgtga tcagcgatct cggccccgac 4048921 cgtttcgatc agcttccgcg ggggtccggc gacgatctcg gccgcccgcg aagccagccg 4048981 cacgtagtca taggtgtcgg ccaagtcgtc gctgttggcg gcctcggcca ggtctatcca 4049041 cacggtgaca tcgatgacaa accgctgccc ggccactcgc tcgtggtcgt agaccccgtg 4049101 ccgaccatgc acggtcaggc cgcgcagttc gattcggtca gccatcgcgt tctatccttt 4049161 ccgctcccat ccacgcttcg accaccttga tggcatcgac cgaggcccgc acatcatgca 4049221 cccgcacacc ccaggccccg tgcagtgcgg ccagcgcgga aatcaccgcc gtcgcggtgt 4049281 cacgcccatc ggttggccgc atcacgccgt cgggcccggc caacaacgca ccgaggaagc 4049341 gcttgcgcga agcacccacc agcactggga ttccggtcgc gaccagttcc ggaagggcat 4049401 gcaagatcgc ccaattatgt tgcgccgtct tggcgaatcc aagcccggga tcgagcacca 4049461 gccttgccgg gtcgacgcct gcggccaccg cgtcggcgac gctggccagc aggtcggcac 4049521 ggacctcggc caccacgttg ccgtagcgca caggcacatg cggggtatcg gccgataccg 4049581 cccgccagtg catcaacacc cacggcacat cggcctcggc caacagcggc cccatcgccg 4049641 gatcggcccg cccacccgac acgtcgttga ccatctgggc accgttctgc aacgccgccc 4049701 gagcgacatc cgcgcgcatg gtatcgatgc tgacggtgat gccttgtgct gcaagctctt 4049761 tgacgacggg tatgacacga gacgtctcca ccgccgggtc aacccgagtg gcaccgggcc 4049821 ggctcgactc accaccgacg tcgacgatgc ccgcacctgc ggctgccatc gccagaccgt 4049881 gcttcaccgc atcgtcgaga tcgagataac acccgccgtc cgagaaagag tcgtccgtga 4049941 cgtttagaac ccccatcacc tgcacgggcg ccggactcac ttccgcaaaa tgaggtcgag 4050001 cgcttcggct cgagaagcgg cattggtttt gaacagtccg cgcaccgccg acgtagtggt 4050061 gaccgagccg ggcttgcgaa ccccgcgcat cgccatgcac agatgctcag cctcgatcac 4050121 cacgattacc ccgcgtggat cgagtttttt catcagggca tcggcgatct gactggtgag 4050181 ccgctcctgg acctgaggtc gcttggcgta cagatcgacc agtcgcgcga tctttgacaa 4050241 gccggtcacc ctgccgtcgt cgcccgggat gtagccgacg tgggccacac cgtggaacgc 4050301 caccaggtgg tgttcgcagg tggagtacat agggatttcc ttgaccaaca ccagctcgtc 4050361 gtggtcttcg tcgaacatgg tgttcaacac cgagtcgggg tcggtgtaga gcccggcgaa 4050421 catttcgcgg tatgaccggg caacccggga cggggtggct accaagccgt ccctatccgg 4050481 atcctcgccg atcgcgtaca gcaattcgcg caccgcggcc tcggcacgtt gctggtcgaa 4050541 cacacggata cgagcagatg cgctgcgcga atccagctgc gacatcgaat gctccgttcg 4050601 tcagccgtgg gccggcttgg tccgactgac ctcgtcatcc tgctccgccg aggactcatc 4050661 ggaacccgga tcggcttgac cggtcgggta gggctgaccc ggatacgtcg gtgccggttc 4050721 accgctatag ctgggccgat gagatgacct tgggggccat cccggcgcat gccagcccgc 4050781 cggggcaccg tagtcaggct gggtggagcc gtactggcgg tcaccggacc ggtgggtgcc 4050841 ggcgggcgaa ccgttggcgc cgtgcccggt ttggccggcg tcggaccggg cggcctcagc 4050901 ggcttgggta gcctgcgcaa tcgccgcctt gaacgccggc tcggggaccg gctggggcca 4050961 aggttcgccg cgttcgatcg cgagctcgcc gggtgtcttg atgggcggtt tgtccgacgg 4051021 gatccggcca ccgaagtcgt cgaacatggt gagccgcggc cgcttttcga cgtcagcgaa 4051081 gatgctttcc agctcgggtc ggtgcagggt ctccttttcc agcagctcgc cggccaaagt 4051141 gtccagcacg tcgcggtatt cggtcaggat ttcccacgct tcggtatgcg ccgcctcgat 4051201 aagcttgcgg acctcttcgt cgatctcgcg ggcgacctcg tgggagtagt ccggctgggt 4051261 gcccatggta cgtccgagga acgggtcgcc gtgttcggag ccgtatttga ccgcgcccag 4051321 cttggagctc attccaaatt cggtgaccat tgagcgcgct atcttggtgg cctgctcgat 4051381 gtcggacacc gcgccggtgg tcggctcacg aaacaccagt tcttcggcgg cgcgcccacc 4051441 catcgcgaac accagttgcg cgatcatttc cgagcgggtc cgcaggccct tgtcttcttc 4051501 cggcaccgcc accgcgtgcc cgccggtacg cccgcgcgcc aggatcgtca ccttataaat 4051561 cggctcgata tcgggcatcg cccaagcggc cagggtgtgc ccgccctcgt gataggcggt 4051621 gatcttcttc tcctgctcgc tgatgatccg gcctttgcgg cgcgggccgc cgatcacccg 4051681 gtccaccgct tcctcgaggg cgggaccggt gatgacggtg ccgttctccc gggcggtcag 4051741 cagcgccgcc tcgttgatga cgttggccag gtcggctccg gtcatgccga cggtccgctt 4051801 ggccagtccg tcgaggtcgg cgtccgcggc catcggcttg cccttggagt gcacgcgcag 4051861 caccgcccgc cgacccgcca gatcggggtt ggataccggg atctggcggt cgaagcggcc 4051921 cggccgcaac agcgccgggt ccaggatgtc gggccggttg gtggccgcga tcaggatgac 4051981 gccggcgcga tcgccaaaac cgtccatttc gactagcaac tggttgaggg tctgctcacg 4052041 ctcgtcgtga ccgccgccca gcccggcgcc tctttgtcgg ccgacggcgt cgatctcgtc 4052101 gacgaagatg atgcacgggc tgttctgctt ggcctgctcg aacaggtctc tgacacggga 4052161 tgcgccgacg ccgacgaaca tttcgacgaa gtcggagccg gagatggtga agaacggcac 4052221 tccggcttcg ccggccaccg cacgagccag caacgtctta ccggttcccg gcggcccgta 4052281 gagcagcacg cctttgggga tcttggcgcc cagcgcttgg tacctgctgg ggttctgcag 4052341 gaagtccttg atctcgtaga gctcctcgac cgcctcgtcg acacctgcga cgtcggcgaa 4052401 ggtggtcttg ggcatgtcct tgctcagttg cttggcgcgt gacttgccga acccgaagcc 4052461 catccgggcg ccgccttgca tgcgggagaa catcacgaac agccccacca gcaacagcag 4052521 cggcagcacg tagaccagca gctcgcccag gatgctgccc tggttgacga ccgtgctgac 4052581 cttcgcgttt ttggcgctga gcgcgttgaa caggtcgacg gcgtacccgg tggggtactt 4052641 ggtgatgacc ttctcggacc cgtcggtctc gttgttaccc ttcttcagga tcagccgcag 4052701 ctgttgctcg cgatcgtcga tctgtgcgct cttgacgttg tcgccgttga tctgtgttat 4052761 cgccaccgag gtatcaacgg gcttgtagcc gcgggtgtcg tcgctgaagt aaaagaacga 4052821 ccagccgagc agcaccacga cggcgatcgc tgttatggtg cgagtcacgt ttttccggtt 4052881 catcgatcat cggccgtgcc ggccaggtcc ttcccgatac acgcagctgg aaagtccagg 4052941 ttaccgctcg tggcgatcgc aaacccggcg gagccgggtg cagcgggtcg ccaccatcag 4053001 ccccgtggcg atcgcaaacc ccgcgcctgg cgacaatgcg gcccgcaaaa cgggccgagg 4053061 aggagccagg caatcacccc agagccgggt gcagcgggtc gccaccatca gccccgtggc 4053121 gatcgcaaac cccgcgcctg gcgacaatgc ggcccgcaaa acgggccgag gaggagccag 4053181 gcaatcaccc cagagccggg tgcagcgggt cgccaccatc agccccgtgg cgatcgcaaa 4053241 ccccgcgcct ggcgacaatg cggcccgcaa aacgggccga ggaggagcca ggcaatcacc 4053301 ccagagccgg gtgcagcggg tcgccaccat cagccccgtg gcgatcgcaa accccgcgcc 4053361 tggcgacaat gcggcccgca aaacgggccg aggaggagcc aggcaatcac cccagagccg 4053421 ggtgcagcgg gtcgccacca tcagccccgt ggcgatcgca aaccccgcgc ctggcgacaa 4053481 tgcggcccgc aaaacgggcc gaggaggagc caggcaatca ccccagagcc gggtgcagcg 4053541 ggtcgccact ggctagacca acgaccggta gttcccgacg gcgtcggaaa atccgacagc 4053601 tgagcgttcg ggtcaaacac gcggtgcacc ggacctgatt tggctcgaat tggtgcgcac 4053661 cgagggtcgg gcacatcgct ccggtcgcat gtgtcactgc accgggcgac acccgatctg 4053721 cccagctctc agcgacagct gcctgacctg cggttttgtt cacaagttgg ttgcggctgt 4053781 gcgggattgt aggcggcgtt gaccggcaga aaccgagttg tcgcgcatag gtgagcacag 4053841 cgaccatcgc ccccggtgga gtccagtgtt gcggacgtga ctaaagagca gcacgggcag 4053901 cgggagcaga actcgggtca attgagtcat ccagcgcgcg aacgtggttc ggcgcagccc 4053961 cggttggctg tctgggcgtg aaggtgctcc cgagcggccg gcccgccatg aaggcgcgcc 4054021 aaagctttgg cattgtgcac attttccacc cgtgctctat taatgctgag ccgcgaattg 4054081 tgagcccagt cgggaaacac gcggagcacc agagtcaccg cagcggccgg ggcggttcaa 4054141 ctcaccatgg atcgctctcg tcgtctggtg ctggacaatc gtcgctgtag cgcgtcgcga 4054201 acacctcagc ttctgctgcc gcggcttctt ccggcgatgg taacccccag gtttcgccca 4054261 cggtcttacg tagcagtgcg acgcggtgtt catctgcatc gacctgttga ctcatcctgt 4054321 caaggatgaa ggcgtactgg gccgactgcg ccttctgccg cgccaggtcg gcaatcacca 4054381 ggatctcaga agcgagctgc gactcactca tccaggccac cctggccgac agctcgacat 4054441 ggtcaatccg gccgtccatc agcgtcgata ccgacaccgt gcgtggggga ttcgtcacgg 4054501 taaaaagcgc gatctcttgt tcggtgtccg tctccgcctg accgtgggca ttgtccaggt 4054561 cgggtccggt gtccggggtc gccgccgacc cgacgccaat aatcggatcc gcagtccagc 4054621 cctccgcgcc gtcggcaccc cagagatcca cggcgtcgaa atcgttgctg tcaaagtcat 4054681 ttccgggcaa gtccaccgtc ccttcggaat tcattgccac ccgggaaggg tcggcctggg 4054741 cagctggcgt ggtcagtccg aacaggtcgt tgggaagacg ctgtggcctg cactgcgggc 4054801 agcaaacgtg gtcaggtaaa caacccgtcg atagccttgc gccacgcttc gtcggcctcg 4054861 ctatatatct tcgccgcaat tcgaagactt ttggcgagat cgacaccggc cgtatgcaag 4054921 gacgagccca gggcattgtg ggcagtcaag tacacattta acgtgtcgtt gaactgtgag 4054981 cagtacggac cgtgagtgat cgccacagat tcgcctaggc cagcggcagc ttcgacgccc 4055041 gaggaggcat cgaccgccgc gttgtcatgg tgcgacgcca gtacaccgag acgctcgggc 4055101 tggacggtca agttttccgt cattgatcgt gtcccttccg tttagcattg cgcgttgtta 4055161 ggcgctggct agcaatggat ttggctcgcc atgccgttag acgacgtttc gtaccagcac 4055221 cttttgccca ccgcccgcgt cagcttcgac tggcgcgcgc tcggcgtctt cagtgcccgc 4055281 cgccgcgcct tccgagtact tcttcgtcgt cgtccctttc gacgcccccg aagaggggtg 4055341 catgccgccc atgcctacgg gtccgcccat accttgggaa ccctgcgcgg agaccagctg 4055401 cgactgcccg ccgacctgct cggcagcggc gccgaccggg ccatcagctc ggggccgtag 4055461 cgcctgccga gttgaggcgg catggacctg agccaggctc ggcaagcccc caaaaccgga 4055521 cccgccccca atgccggcca gggcgggcaa gctggctgag ctcgccaggc tatccgcgtg 4055581 agccaagccc gacgatgcgg acagaccggc cgcaccgaac aagccagtca cttgcgacaa 4055641 gccgctggtc gcgccggtca agccggggac gcccgcaaag aaggactcca ggttcgacca 4055701 ccctcgagag aacagtccgg tcacccaccc cgtgagcttg tcccaaagct ctttcaggcc 4055761 gttgagcgcg tttgtgatga actcccacac ttctccgagg gtgcccttga tgatgtccgc 4055821 cacatccgaa atgatgtccg caatggcggc cgcgaccaac tccgccaatt tggcaagcaa 4055881 tttgaggagt tgagtcgcgt tgatcagcgt tttcacgacc aagtaggcaa gcgcgccgcc 4055941 cactacggcc atcgcgcccg cgcaaaacgg cgcctggaag gcggccgata gggcgtgccc 4056001 gacgaccggg atgtaggtca ggtccacagc caccgggcgc acgaactcga gacctttctt 4056061 ggcgccctcc aggatgtcgc gggtcgtctg gaccgcgttg gcctggtcgt ggatcaggct 4056121 gatgagctga cgatcgaggt ctgccagttc ctggaaaaaa ttcacgtggt tgcggttttt 4056181 gccggcgtat ttgtccgcgg ccgaacctaa ccagccatca cccggaaacg ctgctgccag 4056241 ctcctccagg gctttttcga agtactctag tgaggagtaa aggatacccc cttggttggg 4056301 tattccaatc cccagaaggt cgtacaagcc gtcaatggca ctgatcgttg gatcgatgat 4056361 gaacgctctg ctcatgcctg ccgcctatct caacggtcgt cgattccatg catagccttg 4056421 gttctgcatt gcacgcgtag ggcctacagt ctggctgtca tgcttggccg atgtcaacag 4056481 tttttttcat gctaagcaga tcgtcagttt tgagttcgtg aagacggcat gttcacttgt 4056541 tgtcgactac atcgtctgcg cacatttgcc ctcctgcaac tgcgctgcga caatgcgcca 4056601 accgccgtgt aggcggcgcg atcccaaggc agtgtctccg acgtcgatgc ctgcgcttcg 4056661 ccttcgatcg gtatgagatc tgttgcagga gagtctatat agtgtgctca tggggctagc 4056721 cggcggcggc ctcgtggcgg gcacaatcac ctcgccggtg gcgcaatcag ggctgtgcta 4056781 acccaccatc actcacccga ttcggcgtcg aagcggggcg ctctcatggt tgcgaggcaa 4056841 agcaaatctc ggttgtccta aatcgcgtcc gctaaacacc tagctaggcc gatctgtcat 4056901 tatctccgat catgtttgat aaggcgacga aaaccgacga tggaaatccg ttgcgctcgg 4056961 caagatcggc gaagtattgc ggcggcctta tctaaaccac tgaagtttta gtaattatcc 4057021 gtccgagata tccgaatata gcgaacaccg gtaccttgcg aagaaaagcc tgaatctgat 4057081 aacgccgata tccactcggg agttatcggg caacggaaag cgaaacggcc tccgtcggag 4057141 agcgactggg atagccctgg ttccgggtgg tttgctatcc cgggataacg gcagtgctac 4057201 atgctcggac cgatttgcga tgcagcccca ccaatgcggt gtctcgcctt agtagacacc 4057261 tgccgaggat gggttacatg gtggtcagct actgagccaa ccggtcgcac ggcgagccgt 4057321 atcaagatca cgccaagaca gcggttaatt ctatcagcaa atgtttctat aggactctat 4057381 agcccgcctg agctattccg gtgctgtcgg ctaagcctgt gaccggtgtc actgcagcaa 4057441 gccatttcac cgattggctc acgtttggga ccctcgactg actgcggttg gttgacctgc 4057501 tgcttttgtc cgcgaattca ccggaatttg aactggacct ggccggcaat cgtggggcag 4057561 tcactgtgag ctgtagccat gccagctgca caggaagtgc gatccggacg tcaagggagg 4057621 cccgactggt ccggccggcc gatcaatgat gcgcggcagc acccgcgaca atcgcctctg 4057681 gctgctcccc aagcccttct caggccggtg cccggtgtga tttggtgaga cgatgggcgc 4057741 acctaccgaa cggttagttg ataccaacgg cgtgcgactg cgagtggtcg aggccggtga 4057801 gcccggcgca cccgtggtga tactggccca cggctttccc gaactggcct attcatggag 4057861 acaccagatt cctgcgcttg ccgacgccgg ctaccacgtg ttggctcccg atcagcgcgg 4057921 ttacggcgga tcgtctcgcc cagaggcgat cgaggcctac gacattcacc ggttgaccgc 4057981 tgacctagtg ggcctactag atgatgtcgg tgccgagcgg gcggtctggg ttggtcatga 4058041 ctggggtgcc gtggtggtgt ggaacgcgcc actgctgcac gctgaccgag tcgccgccgt 4058101 tgccgcgttg agcgtccccg cgctgccccg ggcacaggtg ccgccgacgc aagcgttccg 4058161 cagcaggttt ggggagaact tcttctacat cctttatttc caggagcccg gcatcgccga 4058221 cgccgaactc aatggcgacc cggcccgcac gatgcgccga atgatcggcg gtctgcgccc 4058281 tccgggcgat cagagcgcgg caatgcgtat gctggcgccc ggccccgacg gctttatcga 4058341 tcggcttccg gagccggccg ggttgccggc ctggattagt caggaggaac tcgaccacta 4058401 catcggcgag ttcacccgca ccggtttcac cggcggcctg aactggtacc gcaacttcga 4058461 ccgcaactgg gagaccacgg ccgacctcgc cggcaagacg atctccgtgc cctcgttgtt 4058521 cattgcgggc acagccgatc ccgtcttgac gttcacccgc accgaccgcg ctgcggaggt 4058581 gatctccggc ccgtatcgcg aggtgctgat cgacggggcc ggtcactggc tgcagcagga 4058641 acgtcccggt gaggtgaccg cggccctgct ggagttcctg acggggttgg agttgcgatg 4058701 aaggcaccgt tgcgttttgg cgttttcatc acgccattcc atccgaccgg tcaatccccg 4058761 accgtggcgt tgcaatacga catggagcgc gtcgttgcgc tggaccggct cggctacgac 4058821 gaggcgtggt ttggcgaaca ccactccggt ggctacgagc tgatcgcttg cccggaggtg 4058881 tttatcgcgg ccgcagcgga acggaccacc cacatccggc taggtaccgg agtggtttcg 4058941 ctgccctacc atcatccgct aatggtggcc gaccgttggg tgctgctgga tcacctgacc 4059001 cgtgggcggg tcatgttcgg caccggcccc ggcgcgctgc cgtcggacgc ctacatgatg 4059061 ggcatcgatc cggtcgagca gcgacgaatg atgcaggagt ccctcgaggc gattctcgcg 4059121 ctgttccgtg ccgcacctga cgagcgaatc gaccgccact ccgactggtt caccctgcgt 4059181 gaagcgcaat tgcacatccg cccctacacc tggccgtacc ccgaaatcgc taccgcagcc 4059241 atgatttcgc catcgggtcc gcgactggcc ggtgcgctgg gcacgtcgct gttatcactg 4059301 tcgatgtcag tgcccggcgg ctacgctgcg ctggaaacag cgtggggcgt ggtgcgggag 4059361 caggccgcca aagctgggcg gggcgagccg gatcgcgccg attggcgggt gttgagcatc 4059421 atgcacttgt cggacagccg cgaccaggcg atcgacgact gcacttacgg gttacccgac 4059481 ttctcgaggt acttcggcgc ggcagggttt gtcccgttgg cgaacaccgt ggaaggcacc 4059541 cagtcgtctc gggaattcgt cgagcaatac gcggccaagg gaaattgctg catcggcacg 4059601 cccgatgacg cgatcgccca cattgaagac ttgctgcacc ggtcgggtgg cttcggaacg 4059661 ttgctactgc tcggccacga ctgggccccg ccaccggcaa cctttcactc ctatgagctg 4059721 ttcgcccgtg ctgtgattcc ttatttcaag ggacaactcg cggcgccgcg ggcgtcgcac 4059781 gaatgggcta gaggcaagcg cgaccaattg attggccgcg ccggcgaagc ggtcgtcaaa 4059841 gccatcaccg agcacgtcgc cgaacaaggg gaagcgggca gctgacgcgg gcgcagtgtt 4059901 cccaacgacg acatgcccgt gtatcgggcg ccaaagtcga cgctgatcgg cccgccctgc 4059961 gcggacccaa cttaggaccc gggttaggcc cagctggagc cgacggcgct gtcggtttgt 4060021 gccatgttgt tgccggcagc ctgcaccttc tgcccgtggg cgttggcctg ctcgtagatc 4060081 acctggaagt tacggcccag ctgggtaatg aacccctggc aggccgccga accggcgccg 4060141 ccccaaaagt cactcgcggt caacacatca gaaatgatgg cctgatgctc ggcctccagc 4060201 gacccggcct gagcgcggat catggcgccg tgagcgtcga cgtccccgaa ttgatagttg 4060261 atggtcatgt gtcctcctga gtcgtcgggc cgggtcagct gctgaggatc tgctgggagg 4060321 cctgctcttg ctgttcgtag ttgttggcgt cgcgaaccag cccgtcacgc accccgtgca 4060381 gcatgttcac gatgttgcga aacgcctgat tcatctgggt catggtgtct agcgaggtcg 4060441 cctcggccat gccactccag cccgcgccgg aaatgttttg cgcggacgcc cacatccggc 4060501 gagcctcgtc ctccaccgtc tgggcgtgca cctcaaaacg gcccgccatg tcccgcatcg 4060561 cgtgcggatc cgtcataaaa cgcgaggtca tatgaattcc tccctttgaa tcgtcgaatt 4060621 cgatcctcga tcaacgaacg tagttggtca tccgccagcc ggcggttggg cgatcacggt 4060681 cggcttgaat ccgtatcgag gggcagcgaa gttgttaaac gcacgtccgg caccgctacc 4060741 catgagcggc atcccgccaa acgcgtgtgt cgaaccttca gcggcggccg cggctccgag 4060801 gccgttggac gccgccagca ccgcggggct cgccgccggg gtcgtggcgg tccaaacggc 4060861 cggaaccttc aatcccccga ccgacgccgc ctgaccgacg gcgcccgcaa cgccgctcag 4060921 gccagcactc ggaatggccg gaacggcggc cggcaacgcc ttggcggcct caccggcggc 4060981 tttggcgcct tcactcgccc acttcggcag gtcgtgcgcc aggccaaagt agtccttgaa 4061041 ttgggtgacc atgagccgag cgggcgagac ccatttgccg aacgtgtcca tggccacgct 4061101 ggcatcaagt tcggccgacc cggtcacacc ctgcacaaag tcgccaagca ctccgcccac 4061161 gatgagcccg ctaccgtccg aggaccaggt gtgcccggtc aaaccgagcg ccttgccaag 4061221 gtcggtgagc caaggcggtt cattggtgaa gattccgcta agcccaaaca acgctttagg 4061281 aatgtcggtg agtgcttgcg catttgcggc cccgctgaca gcttgtccga cagatgcggc 4061341 ctggctggcc agcccggccg ggttgatggt ctgcgccgcc ggattgaatg gcgacaactg 4061401 cgtcgccgcc gccgacgcgc cagcatagcc gtacatcgcg gccgcatcct gggcccacat 4061461 ctcggcgtat tgcgcctcgg tggccgcgat cgccgccgtg ttctggccaa ggaagttcgt 4061521 cgccagcaac gccatcaaca acgccctgtt ggccgcgatc tccgggggcg gcacggtcgc 4061581 gaaaaacgcc gcctcataag cactcgccgc tgccaccgct tggctgccgg cttgctcggc 4061641 ctgcccggcg gtgctcctca accacgccac ctggggcgtg gcggcagcca ccatggacgc 4061701 cgcggaggac ccctgccatg gcccgtcggc caggccagtg atcagagcgt cgtaggtgga 4061761 cgccgtggtt tgcaactcgg cggccagcgc ctcccaggcc gccgcggcag ccagcatcgg 4061821 tcccgaaccg ggtccggcgt acatcagcgc ggagttgacc tccggcggta actgagcaaa 4061881 gtccagcatc ggcctctcct aagcgatcgt ggcggcgttg gcagcctcgg tggccgcata 4061941 tgaaccggcg ctgatgccca gcgtggtcgc caactgctcc tggaccgcca tcgcctcggc 4062001 actaatcgcc tggtacagct gtgcatgcgc ggcaaactgg gaggcggtta gcagggacac 4062061 caaatcagcg gcggccggaa ccacacccgt cgtcgggccc gccaccgctg catttccggc 4062121 ccgcgcaacg gcgttgatcg actgcagttc ccccgcggtc gcagccagca tctctggctc 4062181 ggcgtgcatg atcgacatgg tattttcctc cctctaatgc acgttgcatc aatagctttc 4062241 ggcgttcccc gctgaagacc acacgacagg ctagccatcc ttataggaac atcacagact 4062301 tcacacaggt tgttcacggc taagtcaata acaattcatt tacttcaagg gcatttccgt 4062361 agcttttgaa attcccctga aattcattgg taacaagtaa tttgagtttg gtatgaattt 4062421 cggggtactg gcatcgacgg gccctacaat gcgcaacttg cgcacaccac gccacgctga 4062481 agcaaccgtc gaccgattca acggcgcagg cgctagggtc ccaggcatga tccgattggt 4062541 ccgtcattcg atcgccctgg tggccgccgg ccttgccgcc gcattgtcgg ggtgcgattc 4062601 ccacaactcg ggatcgctcg gtgccgatcc gcggcaggtg accgtgttcg gatccgggca 4062661 agtgcagggt gtgccggaca cgttgatcgc tgacgtcggc attcaggtca ccgcggccga 4062721 cgtcaccagc gcgatgaacc agaccaatga tcgccagcaa gcggtgatcg atgcactggt 4062781 gggtgccggc ctggaccgca aggacatccg caccaccagg gtcaccgtgg caccgcagta 4062841 cagcaatccg gagccggccg gaaccgccac catcaccggg tatcgggcag acaacgacat 4062901 cgaggtgaag atccacccga ccgacgccgc gtcgcggctg ctggccctcg tcgtcagcac 4062961 cggcggtgac gccacccgga tcagctcggt cagctactcg attggcgacg actcgcagct 4063021 ggtgaaggat gcccgggcgc gcgccttcca agacgccaag aaccgtgcgg accagtacgc 4063081 acaactgtcg gggctgcggc taggcaaggt gatctcgatc tccgaggcat ctggcgccgc 4063141 gcccacgcac gaggcgccgg cgccgccgcg cggcctatcc gcggtgccgc tggaacccgg 4063201 ccagcagacg gtgggcttct cggtcacggt ggtctgggaa ctgacctagc cgcctactga 4063261 tagaccctgg ggtccagcgt cccgatgtat gacaggtcac ggtagcgttc gtcgtagtcc 4063321 aggccgtagc ccacgacgaa gtcgttggga atgtcgaaac ccacgtacgc gatttcgacg 4063381 ttggcgtgca ccgcatcggg cttgcgcagc agcgtgcaca cccgcaatga ccgcggattc 4063441 cggctcgtca ggttccgcga caaccacgaa agcgtaaggc cggagtcgac gacgtcctcg 4063501 acgatcagca cgtcgcggcc gtggatgtcg cggtcgaggt ccttgaggat ccgcaccacg 4063561 cccgacgagg atgtcgatga cccatacgaa ctcaccgcca tgaactcgaa ctgggtcggc 4063621 acgggaatcg ctcgcgccag gtcggtgacg aagagcaccg cgcccttcag cacggtgatc 4063681 agcagcagat cctggccggt ggtagcggac agctcgcggt agtcgttgcc gatctgctcg 4063741 ccgagctcgg cgatgcgggc ctgaatctgc tcggccgtga gcagcaccga cttgatgtcc 4063801 cccggataaa gctccgccgt ctgcccgggg gtgatcgccg aggagctctg ggtcacgtgc 4063861 acagcgtgcc acgccgcggg accaacgacc aacgcgggcg tcaaacgggc tcgcgccgca 4063921 acacaagtac gccgtcgcgc cgcccggcga ccagtcgctg accgcgcaac gtggacccaa 4063981 ccgctacccc gccctgaccg cgccacgcgg tgaccagccg gtccactccg cggatctgcc 4064041 tgtcggtcag tccggtcgcg ccgccggcca gcagccagcc ccgaatcacc cggcgccgca 4064101 ccgcatccgg cagcgcggtc aaggcgctgg tactcaactc ctgtccccgt gagccagcaa 4064161 cagcggctcc gggcagcgcc tgcgcagcga tcgtgtcgat gaggtcagtg tcctcgcgca 4064221 acgctgtcgc ggtgcgagcc agcgcttcgg ccacacctcc gcccagcacg tcctccagca 4064281 gtggcagcac ttcggtgcgc aatcgggttc gggtgaagcg gcggtcggtg ttgtgcggat 4064341 cctgccaggc ggtcaggccc agctcccggc aggccgcatg tgtcacgctg cggcgcaccc 4064401 ccagcagcgg ccggcaccag ggcggatcgt acggacgcat gccggcgatc gaccgggccc 4064461 ccgaaccacg gccaagcccc aacaacactg tctcggcctg atcatcgagc gtatgggcca 4064521 acagcaccgg gccatcgcgg tgctcctcca atgccgagta gcgggcgctg cgcgccgccg 4064581 cctcccggcc gccggccgcg cccacctgaa cgcaaagcac ccgcgcgtcc acacatccca 4064641 gcgaaatcgc ttgtatgcga gctgtttccg cgaccgtggc cgagccgggc tgcagaccgt 4064701 ggtccacgat cagtgcggtg gtgggccaca gccgtgcggc tacagcggtg agcgccaacg 4064761 agtccgggcc gccggagagc cccacgctcc aacggtcgca ggcgtcgaga tggacccgag 4064821 cgaactgctc cgcagccgca cgcagctgcg ctacagcact ctgtcgatcc atcgctgcgg 4064881 gttttcgatc tcggcaggca acggcagcgt ctcggggccc gaccagatcg tgttgaacag 4064941 cttcattccc gcccggtcga ccacatggtc gacgaatgcc ttgcctcggg tgtactggct 4065001 gagcttggcg tcgaagccca gcagagctcg caccagccgc tgcagcggcg gctgtttgtg 4065061 atgacgacgg tcgtcgaagc ggcggcggat ggtggccacc gagggcacca ccatcggccc 4065121 gaccgcatcc atcacatgct cggcatggcc ttccagcagc gtgccaagta ccagcagctg 4065181 gtctaaggcc ttacgttgcg gctcggattg cacggctcgc accaggccca gaatgcccga 4065241 cgggttgacc tcggaatcgt cggtaccgtg tccacggctg cggatgaagt ccgccagccg 4065301 gctcaccacc cgcccgatgt cgtcaacggg ttcgaaggtc aacaggttta gcgcctgcga 4065361 catgtagccg gacagccagg ggttggcggt gaactggact cggtgggtga cctcgtgcag 4065421 gcacacccac aaccggaaat cggacggctc gacccgcagt tgacgctcga cggcgatcac 4065481 attgggatat accagcagca agcagccttc tccggcggct ccgaacgggt cgtactggcc 4065541 gaggatgccc gaggccacaa acgccagcac ggcaccggtc tgcgcaccgg tgatccgacc 4065601 ggtgagaaac ccccgcggtt tggcgcttcc gtgcgtcatc gcccgcatcg attcggcggc 4065661 cgagcgaatc cacgccggcc ggtcgacgac acgggccggc ggcaccacac cgtcggcgat 4065721 cagaccggtg acgtcgcgca ccggcggttc ggccttctcc gccgcgacgg tcagctcgtc 4065781 gatcacctgg cgacgggtgt attcggtgga cggcggagcg ggccgggcca gccgctcccc 4065841 gacgctggcc gcaaattccc aatcgaccgt gttccccagt gtcagctcgg acgctccggt 4065901 cacgtcgtgc acccgcagaa ccacaactta gtggccagag cgtccatcgc gttgcgaccg 4065961 ttgggaccgg cttcgttgga gatgaacgcg aaggtgagca ctcggccgct acggtcggtg 4066021 agcaccccga ctagcgagtt gatcgcggtc agcgagccgg tcttggcccg caaccacccg 4066081 gccggaccct ggtcggtggc cgcgtcgagg aagcgctcgc ccagcgtgcc actgccaccg 4066141 gcgatcggta gcagatccag cagcggccgc aacgcgggct ggtcgggtcc agccgcggcc 4066201 tgcatcgttg catcgagcgt ccgagcggtc aggcggttgt cgagcgacaa tccactagaa 4066261 tccaccagcg cagcgccggc ggtgtcgatg tgtgcggtgt tcaatcggct ggtcaccgcg 4066321 tcgaccgcgc cactaaagct ctgcggccgg ttgatcgcga ccgctacctc gcggccgatg 4066381 cactcggcca tcacattgtc ggaggcgttc atcatctgag acagtcgctg gatcaacggc 4066441 gccgactgca ccacggccag ctgccgcgcg ccggccggag ccgatgcgat cgtcaccgcc 4066501 gcggggtcca ggccaagggc tttggccaac tcccgaccgg catccagcgc cggggtgcgg 4066561 gaccgtctcg aattgacggt ggtcggctgg atacgcccgg cgtcgatcat cgccgcttcg 4066621 atcggcgcga tgtcaccgtt gtcgatatcg gccggatccc aacccggcgc catcgtcgga 4066681 ccgctaaacg ccgaagcgtc cacctgcacg gcggtgggcg tcacaccgct gcggcgaatt 4066741 tgttcgacga ggtcaccgat gcgagccgcg ccgtgatacc aggtgtcctg accgggcggc 4066801 gctgccgaca gcgtcggatc gcccgcgccc accaacacga caggtccctg ggggttctgg 4066861 ccgccggcca ccacccgcgt gctgatccgg gcctgtcggt ccagtgtcag cagagccgcc 4066921 gccgccgtca ggattttgtt ggtcgaagcc ggcaccaagg gcacgtcgtc tagccgctgc 4066981 caaagttctt gtccggtcag ggcatcggtg atccgacctg ctaacttgcc cagatcagga 4067041 tcggccgcca ccaccgcaag cgccgcggtc acgccagcgg cactcggtgt cgcagcggtg 4067101 tccgccacag ggaccactcc cgccttgact gtgggtggcc gcggtggagg cgcaggtgcg 4067161 cgcacgccag cccggtgacc accagtagtg accagcgctg cggccgccac cacaacggcg 4067221 acaaacgcca gcacggccgc gccgacgacc acgtgcgtgg atttccgcca gcgtgtggga 4067281 cccatgagct ctcctgcctt tccggtccca ttctgccgaa ccggccgggc gacgctgcca 4067341 cggtaccggc tcgactaggg tgtccacgga cgcattggac ctgcccgttg tcccatgcac 4067401 tctgatctga aggagccgac gcgtgcaatt cgacgtgacc atcgaaattc ccaagggcca 4067461 gcgcaacaaa tacgaggtcg accatgagac ggggcgggtt cgtctggacc ggtacctgta 4067521 caccccgatg gcctacccga ccgactacgg cttcatcgag gacaccctag gtgacgatgg 4067581 cgacccgctg gacgcgctgg tgctgctacc gcagccggtc ttccccgggg tgctggtggc 4067641 ggcgcggccg gtggggatgt tccggatggt cgacgagcac ggcggcgacg acaaagtgct 4067701 gtgcgtccca gccggtgacc cccggtggga ccacgtccaa gacatcgggg acgttccggc 4067761 tttcgagctg gatgcgatca agcatttctt tgtgcactac aaggacctgg aaccaggtaa 4067821 gttcgtcaag gcggccgact gggtcgaccg cgccgaagcc gaggcagagg tgcagcgttc 4067881 agtggagcgc ttcaaggccg gtacacactg atttgggctt agggcgcccg ccccgcgcct 4067941 tggcaccctc cgccggtcat gatccgaact tcgtggggga cctgactgtt aggcgattgc 4068001 gccgcacact ctcggtgaac gccgccccga taaaaaccac ccccaccgaa gcggtgaccc 4068061 actcggggac ggcgaatcgg tggtcgatgg acaacagcaa gattatggcg agcgcgccaa 4068121 tcgcccagtg tgcgccgtgt tccaggtaca cgtaccggtc cagtgtgtcc tgtcgcacca 4068181 gatagatcgt gatcgaccgg acaaacatcg cacccaccac accaaggccg agcgcgatga 4068241 tgatcgggtc cgtagtgatc gcaaaggccc cggtgacgcc gtcgaaagag aaggcggcgt 4068301 cgagcacctc cagatacagg aacaacgcgc aaccagcctt tccggccgcc tgcctcgcct 4068361 gcacgcccgg cgtggcttca cccaaccccg ccggccggaa cgcccggctg atcccgttga 4068421 cgacaagata ggtcaccatg cccaaaaggc cggcgatcag caccgtaccc cgctgatcgc 4068481 tggagtgtgt caacagcgcg ccggcaagga ccaacccaac actggccact atcaccggga 4068541 cctgaccgag tcgaccgatg cgggcaaagg ggacctcaat ccacttcagc catttgatat 4068601 cgcggtcgtg aacgacgaag tccaggaaaa gcatcagcag gaacatgccg ccgaacgccg 4068661 cgatctgcgg atgcgcagcg gtgatcagtt tttcatagct gggcgatccg tccgcaaatt 4068721 ccagcgcgcc atgggccggt ggacgaagcg ccagctccat tgcgcggacg gggtccaggc 4068781 ccgcggtggt ccagatgatg gccagcggga acaccagccg catcccgaac accgcaataa 4068841 gaatcccgat ggtcaggaac atccgctgcc aaaacgggct catccgctgc agaatcgcgg 4068901 cgttgatgat ggcgttgtcg aacgacagcg atacctcaag gagcgccaga accgccagca 4068961 agaacagggc ggtcggcccg ccgtgcaaat atccggtaac caacgccacc accgtcatca 4069021 gcagcgagaa gccgaagatg cggaacgttg acatggatcc ttccgaggaa aaaccccaca 4069081 atagcgacga accgacatca attggtcagg ctcgcgccgc gcagcgcggc caaccggccc 4069141 gcctactatt ttcagtcgtg acgatccatg tcggttggcc gttggcgccg ccgcggtgac 4069201 cgaagtcggc gatacggcat ctcctgttgg ctcctcgggc gcctctggcg gagctatcgc 4069261 aagcggcagc gtagcccggg tcggcacggc ggccgcggtt accgcgctgt gcggctacgc 4069321 ggtgatttat ctggcggccc gcaacctggc tcccaacggc ttctcggtat tcggggtgtt 4069381 ctggggcgca ttcggactgg tcaccggggc cgccaacggc ctgctgcaag aaaccacccg 4069441 cgaggtccgc tcgctggggt acttggacgt ctctgcagac ggccgccgta cccatccgct 4069501 gcgggtctcc gggatggtcg gcctcggctc gttggtcgtg atcgccggta gctcaccgtt 4069561 gtggagcggg cgggtattcg ccgaggcgcg ctggctatcg gtcgcattgc tcagcatcgg 4069621 gctggctggg ttttgcctac acgccaccct gctgggcatg ctggccggca ccaaccggtg 4069681 gacccagtac ggcgcgctga tggtggccga cgcggtcatc cgggtggtgg tcgccgcggc 4069741 cacgttcgtg atcggatggc agctggtcgg gttcatctgg gcaaccgtgg cgggttcggt 4069801 tgcctggctg atcatgttga tgacctcacc cccgacacgc gcggccgccc gcttgatgac 4069861 gcccggcgct actgcgacat tcctgagggg cgccgcccat tcgatcatcg cggccggtgc 4069921 cagcgcgata ttggtgatgg ggtttccggt cttgctgaag ctaacctcca atgaactggg 4069981 cgcgcaggga ggcgttgtca tccttgcggt gacgttaacc cgggcgccac tgctggtgcc 4070041 actgaccgcc atgcaaggca acctcatcgc gcatttcgtc gatgaacgca ccgagcggat 4070101 tcgggcgcta atcgcgccgg cggcgctcat cggcggcgtt ggcgcagtcg ggatgctggc 4070161 ggccggcgtc gtaggtccat ggattatgcg cgtcgcgttc gggtcggaat accagtccag 4070221 cagcgcattg ctggcctggt tgacggcggc cgcggtggcg atcgcaatgc tgacactcac 4070281 cggtgccgcc gcggtcgcgg ccgcactgca ccgggcgtat tcgctgggct gggttggtgc 4070341 gacggttggg tcgggcttgt tgctgctgct gccgctgtcc ttggagaccc gcaccgtggt 4070401 cgcgttgtta tgcggtccgc tggtgggaat cggcgtccat ttggtggcgc tggcgcggac 4070461 ggacgagtaa gcggccgatc agccccggac caacgtgtaa cttgtgggct taaatggcct 4070521 cgaaaatgga cactgaaacg cactactcgg acgtctgggt cgtcattccc gccttcaacg 4070581 aagccgccgt gatcggcaag gtcgtcaccg atgtgcggtc agtcttcgac cacgtcgtct 4070641 gcgtggacga cggcagcacc gacggcaccg gcgacatcgc ccggcggtcc ggtgctcacc 4070701 tcgtacgcca tccgatcaac ctgggccagg gggcggccat tcagaccgga atcgagtacg 4070761 cccgcaagca gccgggcgcc caggtctttg ccacctttga cggcgacggc cagcaccgcg 4070821 tcaaagacgt ggccgcaatg gtcgaccggc tcggcgcagg tgacgtcgat gtggtgatcg 4070881 gaacgcggtt cggccggccc gtgggcaaag cttcggccag ccgaccgcca ctgatgaagc 4070941 ggatcgtgct gcagacagga gcgcggttga gccgtcgagg ccgccgactt ggcttgaccg 4071001 acaccaacaa tggcctgagg gtgttcaaca agaccgtggc cgacgggctg aacatcacca 4071061 tgagcggcat gagccacgcc accgagttca tcatgttgat cgccgaaaac cattggcggg 4071121 tagcggaaga accggtcgag gtgctctaca ccgagtattc gaagtcgaaa ggccaaccgc 4071181 tgctcaacgg cgtcaacatc attttcgacg ggtttctgcg agggaggatg ccacgatgaa 4071241 ctggatccag gtgctgttga tcgcgtcgat catcgggttg ctgttctacc tgttgcggtc 4071301 gcgccgaagc gcgcggtcgc gtgcctgggt caaggtgggc tatgtcttgt tcgtgctcgc 4071361 cggcatctat gccgtgctga gaccggacga caccacagtg gtcgcaaact ggtttggggt 4071421 gcgccgcggc accgacctga tgctctacgc actggtgatg gcgttcagtt tcaccacact 4071481 gagcacctac atgcggttca aggacctcga gttacgctac gcgcgcatcg cccgggctct 4071541 ggcacttgag ggcgcacagg cgcccgaaca gtgccggtaa gacccagcca cttgagggcg 4071601 cacaggcgcc cgaattaagc cgcgattcga tctgcgcaga ccgtagccag gaaggacccg 4071661 gcggcctaca gttcttagag ttactgcatc tctgaccagc aggaggcgat atgtccgacc 4071721 ctgacgacgt caccacatca tctgacgacc gcgacgaggg cgaaccggaa atagacctgc 4071781 tgccggcctg atgactcaga gctcatcggt cgaacgcctg gtcggcgaga tcgacgagtt 4071841 cggttacacc gtagtcgagg atgtcctcga cgccgattcg gttgccgcat acctagcgga 4071901 tacccgtcgg ctggaacggg agctaccgac cgtcatcgcc aactccacaa ccgtcgtcaa 4071961 gggcctggcg cggcccggcc atgtcccggt cgaccgggtc gaccacgact gggtgcgcat 4072021 cgacaacttg ttgctgcacg gcacccgcta cgaggcgctg ccggtacacc ccaagctgct 4072081 gccggtcatc gagggtgtgc ttggccgcga ctgcctgttg tcgtggtgta tgacgagcaa 4072141 ccagctgccg ggcgcggtgg ctcagcgctt gcactgcgac gacgaaatgt atccgctgcc 4072201 gcggccgcat caaccgctgc tgtgcaacgc gttgatcgcg ctgtgcgatt tcaccgccga 4072261 caacggcgcc acccaagtgg tgcccggttc acatcgctgg cccgagcggc cgtcgccgcc 4072321 atacccggag ggcaagccgg tcgagatcaa tgcgggcgac gcgttgatct ggaatggcag 4072381 cctgtggcat accgccgcag cgaaccgcac cgatgccccg cggccggcat tgaccatcaa 4072441 cttctgcgtg gggttcgtgc gccagcaggt caatcaacag ctgtccatcc cgcgagagtt 4072501 ggtgcgctgc tttgaacctc ggctacagga actgatcggc tacgggctat acgccggaaa 4072561 gatgggccga atcgactggc gaccgccggc cgactatctc gacgccgacc ggcatccgtt 4072621 cttggacgcc gtagcggacc gtctgcagac ttcggtcagg ctctgatcaa tcagtgtgct 4072681 tgtgccggaa gtactcgacc gtgcgacgca cgccgtcggc caactcgatc tgcggacgcc 4072741 agcccaaaac ccgttcggct aagccgatgt caaggcagga ccgcttaaga tcgcctagcc 4072801 gcggcgggtg gaactcaggg tcgtcgggcc cgccgacagc cgcggccacc gccgaatgca 4072861 gttggcggtc cgacgtttcc ttaccggtgc cgatgttgaa gcgcagccca ccgccgacgt 4072921 ccgcggacac ccggacaaac gcgtcgacca cgtcgtcgac aaacacatag tcgcgcgtat 4072981 tggtgccgtc gccgaacacc ctggtgggtt tgcccgagag cagcgcctgc gcgaagatcg 4073041 ctaccacacc cgcttcaccg tgtgggtcct ggcgaggacc gtagacgtta gccggtgcga 4073101 tatgcgagca gtccaggccg tagagatgtc gaaaggtgtt caggtagatt tcgccggcca 4073161 ctttgcccgc ggcatacggc gaggccggat cggtgggcgc tgtctcaggg gttggatact 4073221 ccggcggggt gccatagatc gatcctcccg aggaggtgtg cacgatcttg cggacaccgg 4073281 tctgccgcgc ggcctcggct aggcgcaccg tgccgatgac attgaccgcg gcgtcgaatt 4073341 gcgggtcagc caccgaacgg cggacatcga tctgggccgc caggtgaaat accacctcgg 4073401 gccggtgctg ctcgaggatg gcgtgtagat cggcggtcac aatgtcggct tcgacgaaga 4073461 cgtgtgcgga gttgtcggcc agatgctcga ggttggtcgc ccggccggtc gcgaagttgt 4073521 ccaatcccac caccgaatga ccatctgcca gcaaccggtc gactaacgtc gagccgatga 4073581 atccggccgc cccagtgacc agtgcgcgca ccggcccacc ataccggcgg cccatgccag 4073641 cgccccgtat gcctcgggtc gccctggtcg ccgtattgct gatcacggtg cagctggtgg 4073701 ttcgcgtggt gctggcattt gggggctatt tctattggga cgacttgatc ctcgtcggca 4073761 gggccggcac tgggggcctg ttgtcgccgt cgtacctgtt cgacgaccac gacggccacg 4073821 tgatgcccgg tgccttcctg gttgcgggcg ccattatccg ggtggcaccc ctggtgtgga 4073881 ccggaccagc gatcagcctg gtggtgctgc agctgctgga gtcgctggcg ttgctgcgcg 4073941 cgttgtatgt gatatcgagc tggcggccgg tactcctgat cccattgacg ttcgcgctgt 4074001 tcacaccgct agcggtgccg gggttcgcgt ggtgggcggc tgcgctcaac tcgctgccga 4074061 tgctggccgc gctggcgtgg gtgtgcgccg atgccatcct gctggtgcgg accggcaacc 4074121 accgctacgc cgtcaccggt gtcctggttt acctcggtgg cctgctgttc ttcgagaagg 4074181 ccgcggtgat cccgttcgtc tccttcgcgg tggccgcgct gcagtgccat gtgcgcggcg 4074241 accggtcagc tttggcgacg gtgtggcggg ccggtgtccg gttgtggacg ccgtcgctgg 4074301 cactgaccgt cggctgggta gccctttatc tggcggtggt ggatcaacgg cgatggagtt 4074361 ccgatctgtc gatgacgtgg gatctgctgt gccgttcggt cacccacggc atagtgccgg 4074421 cactggccgg cgggccgtgg gactgggcgc gctgggctcc ggcatccccg tgggccactc 4074481 ccccggcggt ggtgatggtg ctcggctggc tggtgttgat cgcagtgctt gcgctgtcac 4074541 tggtccgcaa gcgacgcatc ggcccggtgt ggctgaccgc ggccggctac gcggtggcct 4074601 gccaggtgcc gatctttctg atgcgctcgt cgccgttcac cgcgctcgag ttggcccaga 4074661 ccctccggta cttcccggat cttgtcgtcg tgctggcgct gctagccgcc gtcgcgctgc 4074721 aggcacccaa tcgcgccggc acccgctggc tggacgcctc gccggcccga gccgttgcga 4074781 cagtcgcttc ggccgtgttg tttttgacca gcagcctgta ttcgaccgcg acgtttctgg 4074841 ccagttggcg tgacaacccc accgagggat acctgaagaa cgcccaggca agtctggccg 4074901 cggccgcgtc aggtgcgccg ctactggatc aggaagtcga tccgctggtg ttgcaacgag 4074961 tggcctggcc ggagaacttg gccagccaca tgttcgccct gctgcgcgtc cgaccggaat 4075021 tcgctacgac aacaacacaa ttgagaatgt tcaccagcac aggtcggctg gtcgacgcga 4075081 aagtgacctg ggtccggacg atcatcgcgg ggccggtgcc gcagtgcggc tacttcgtcc 4075141 agccggaccg gccggaacgt ctgatcctcg acggcccctt gctgcccggc gactggaccg 4075201 tcgaactcaa ctacctggcc aacagcgacg gctcgatggc gctggcactt tctgacggac 4075261 ctgagcggaa ggttccggtg catccgggtc tcaatcgggt gtacgcccgg ctaccagggg 4075321 ccggcgacgc aatcacggtg cgagccaaca ccaccgcgct ttcgctgtgc atcggagcgg 4075381 cgccggtggg atttctggca ccggcctgac ctcaacgccg gtcgccacag ccgctcaaac 4075441 gtggcggccg cgcgtattcg accgtccgta gtggttcgtt aaagcgttgc agtacaacgc 4075501 atacaacaat caatcggcca ttgagttcgc acgctcatgc agttgcgaat ggtcggtgga 4075561 tgctcgaagc caatgcagaa agcgaccggc tcgatgagct gcaccagcag tatcaccgag 4075621 atgatcttgg cggtaatcag gcttgtatct cttgtagtgt ggcggcggca actgaatact 4075681 gaccagagcg cggcaactga aaattgacca gcttcctgga gagccttggc tatgggccaa 4075741 ggaggaagcg agtgttgagc gtggaggatt gggccgagat ccggcggttg cgccggtcgg 4075801 agcggttgcc gatttcggag atcgcgcggg tgttgaagat ttcgcggaac acggtgaagt 4075861 cggcgttggc ctccgatggg ccgccgaagt accagcgtgc ggcgaagggc tcggttgcag 4075921 atgaggccga gccgcggatc cgggagttgt tggcagccta tccgcggatg cctgcgacgg 4075981 tgatcgccga gcggatcggt tggtggtatt cgatccggac gctcagcggg cgagtacgcg 4076041 agttgcggcc gctgtatctg ccgccggatc cggcgtcgcg cgacatatgt ggccggtgag 4076101 atcgggcagt gcgacttctg gttccccgat gtcgttgtgc cggtggggta cggccaggtc 4076161 cgcaccgcca cggcgttacc tgtgctgacc atggtgtgtg ggtattcgcg gtgggcctcg 4076221 gcgctgttga tcccgacacg caccgccgaa gacttgtatg ccgggtggtg gcagcatctt 4076281 tcgacgttgg gcgccgttcc aagggtgttg gtgtgggacg gcgagggcgc ggtcgggcgg 4076341 tggtgggcgc gccaacctga actgactgcg gcatgccatg ccttccgcgg caccctggcc 4076401 gccaaagtgt ggatctgtaa accggtgatc ccgaagccaa ggggctggtc gaacgtttcc 4076461 acgactacct ggagcgggcg ttcttgccgg gtcgggtctt tgcctctccg gcggatttca 4076521 atacccagtt gcaggcctgg ctggtgcggg ccaatcaccg ccagcaccga gtgctgggat 4076581 gtcgaccggc agatcgcatc gaggccgata ccgcagcgat gctgacattg ccgccggtcg 4076641 ggcccagcat cgggtggcga acctcgacac ggctgccgcg cgatcattac gtgcgcctcg 4076701 acggcaacga ctactcggtg catccggtcg cgatcggccg gcgcatcgag atcaccgcag 4076761 atctgagccg ggtccgggtc tggtgtggcg gcaccctggt cgccgatcat gaccgcatct 4076821 gggccaaaca ccagacgatc agcgatcccg agcatgtcgt ggccgccaaa ctgctgcgac 4076881 gcaaacggtt cgacatcgtc ggtccacccc accacgttga ggtcgaacaa cgtctcctga 4076941 ccacctacga caccgtgttg ggccttgacg ggccggtggc ctgatggcag ccaagaccgc 4077001 taccaacagc cgcgatgtgg ccgccgagct ggcgtatctg acccgggcgc tgaaagcccc 4077061 caccctgcgc ggggccatcg agcagctcgc tgaccgcgcc cgcaccaaga cttggagcta 4077121 tgaggagttc ctcgcagcgt gtctgcaacg cgaggtgtcg gcccgcgaat cccacggcgg 4077181 cgaaggacgc atcagggccg cccgcttccc atcgcgcaag tcgttggagg agttcgactt 4077241 cgaccacgcc cgcggtctca aacgcgacac catagcgcat ctgggcaccc tggacttcgt 4077301 caccctagca atcgggatcg cgatccgcgc ctgccaggcc ggccaccgcg tcctattcgc 4077361 caccgcctcg caatgggttg atcgtctggc cgccgcccac cacagcggca ccctgcaatc 4077421 tgaactgatt cggctggccc gatacccgct gctggtcgtc gacgaagtgg gctacatccc 4077481 cttcgaaccc gaagccgcca acctgttctt ccaattggtg tcgtcccgct acgaacgggc 4077541 cagcctcatc gtcacgtcaa ataagccctt cgggcgctgg ggcgaagtat tcggcgacga 4077601 cgtcgtagcc gcggccatga tcgaccgact cgtgcaccac gccgaagtca tcgcactcaa 4077661 aggagacagc taccgcatca aagaccgaga cctcggccgc gtccccaccg tcacggccga 4077721 cgaccaatga aaccaagctg gtcaattttc gattgccgac acctgatcag ttttcggttg 4077781 ccgttgacat agtgcccaaa acacgcaccc acatcagatg cagaacccct tgacaaccaa 4077841 tagggaatct cttcgcatga tggaggttgc tggcaccaat ccatcaggaa ggcccttgtt 4077901 gaccggcact gggttggggg tccaccgcga tgggtgagta tggcaagtgc ggcacgtatg 4077961 cacccgtctt ggtgcacgcg gccaagggca gcccgttagc gccgtcgccc agcgtgaact 4078021 gagggcggag aatcggccgg aatctcgccc tcagtgcacg ctcggcgccg tttggcctca 4078081 cccggtcaac gtgaactgtc cggggcgggc actgtcgcgt agcgagccca cgtggggccg 4078141 gggtcggccc gccaaaaacg ccccggcgcg gccagctcat gagcgggtac gcaagctcaa 4078201 gcagatctcc gtagccgtga cggagtgctt catcgatgtc cgcagcgatg gcagcggcca 4078261 gtgcgtgcct aaacccgtct tgcgcagagt ctttcgcagc gggcgggtag ttgcacgtcg 4078321 tcgccgaagt gctgacgatc ccgttgcggt cggagaccgc gagtagccag cgcgcgtccg 4078381 gggcagcatc tcgcgcagca cgctgaagtg tcgcggcacc ggaagccggg ggcgtgaaga 4078441 gacccgccat gacaccggct ggacggcgcg gggcagagtc ccgcggagtg gtgggcttcg 4078501 acgttgagtt cgtcggtgcc tactggccgc cgctgattgc ggcgaccaca gcattatcgc 4078561 tatcggggta gagcagcgcc atagaggcct cggagaggta gcggcgctcg ctggcctgcc 4078621 attcgtcgtg catgtcggcc aggacggcgc ccacaaggcg gatcacggct gcaggattcg 4078681 ggaagatccc cacgacgcgg gagcgtcgct tgatctcctt attgatgcgc tccaatggat 4078741 tggtcgacca gatcttttgc cagtgcgcct tgggaaatgc ggtgaacgcc aatacttctg 4078801 ccctggcgtc gtccatcagc gggccgatct tgggaaacga cgcggcgagg cgatcacgga 4078861 ccccctccca ggtcgcgtgc accgcctcgg cgtcgggtgc cgagaaaatc attcgaaaca 4078921 tgctggcgac catgtcggcc ttgtccttgg gcacgtgggc gagcagattg cgcgcgaagt 4078981 gcacccgaca gcgctgatgc ccagcgccct ggaaacagcg cttcaacgcc ttcaccagcc 4079041 cggcgtgctg gtcactgatc accagccgga caccaccgag gccgcgcccc ttgagcgagg 4079101 tcaggaaccc gcgccagaag gtctcatcct cgctgtcgcc gacgtcgagg ccgaggatct 4079161 cgcgtgaccc gtcggcggcg atgccgctgg caacgatgac ggccatcgac accacctggc 4079221 cagtaccgtt gcgcacgttg agataggtgg cgtcgaggta gacgtagggg aactcgatgt 4079281 gcccgagcgt gcgggtgcgg aacgcgccga cgatctcgtc gagtccggca cagatccgcg 4079341 acacctcgga tttggagatg ccggtctcca cacccatcgc ctcgaccagg tcgtcgaccg 4079401 cacgggtaga gataccgtgc acgtaggcct ccatcaccac cgcgtacaag gcctgatcga 4079461 tccgccggcg cggctcgagg atcgccggga agaaagagcc cttgcgcagc ttagggattc 4079521 gcagttccac gtcaccggcc tgcgtggaca gcacccgcga tcgggcaccg ttgcgatcgg 4079581 tcacccgagt gtcgctgcgt tcataacggg cagcgccgat ccgttcagtg gcttcgagct 4079641 cgctgagttc ctgcaacacc agacggacgg catcacggat caagtcgacg ccatcaccag 4079701 tgcggaacgc gtcgagcaac tcggacaggg cagactgtgg caaggccatc ggcgggatct 4079761 ccttcggtgc gtgcttggcg gtacacaccg acgatctcgc cgacggcccc tacctcatcg 4079821 gagccactcc gcaacaaccc ctaaacccac cacgctgcgg gacgcttacc ggcggcgtgg 4079881 cacaacgttc ggtatcgctg atcggcatca ggaggttagt gcgatcagaa gtcgtaagtg 4079941 ggctcggcgt cgaggatccc cttgaacatc gcgaccaggc ccgtgagatc agagttggcg 4080001 cgcgccacgt gacaagcgcc gtgcaactct tccaggtcgg tcttccccca gtcgaggcca 4080061 gaaccgcgtt cggacaacag gagatcgaag aactcgcggg tcgagcggcc gttgccctcg 4080121 cggaacgggt gggcatagtt cacgtagtcg taccggtatg cgacctggcc agcgagatca 4080181 ccttcgccga ccgctctgag ccggtcgagc tggtagatct ccgcagccac atgctccatg 4080241 ggccgactga tgccgcccgg cgcgcagaaa gactcgtcct ccttctcgat gccgactgtc 4080301 cgcagatctc ccgcccagac gtaaatgtcc tggaacagct ggcggtgaat cgcccgcagg 4080361 tatgcgagat ctgtgcggtc gcccagcaga ttgggatcct cgcggagttc gatcacccgg 4080421 gcctcaacga ggtcgttctc ggcatcacgc agttcggcat gcgttcgagc gccgacccgg 4080481 ttcctcaaga cggacatagc ggggatgaag tagccctgcc aattccgttc gtgatcgccg 4080541 gtgtcccatg gatgcggcac tccaccccgg ttactggatg ttgtaccggc ggcggacgcg 4080601 ctcacccaac tcggctgccg tgatcttgcc gcgggcgtag tcgttctgat cggcacgggt 4080661 ggcggcggtg ctgcgggtgc cctccagctc ggtgttgcgg cgagttgccc tgacattcct 4080721 gaagcgccgc ttcaccttct gcaactcggt cgcctggaca aacacttcat ctcatttggt 4080781 ggtcctgacc aggatagtcg acagcgctga cattgcagga agttgaccgt caagcacagc 4080841 acggttctcc accgctgatg tacgaccatc atgtctcgtt ggtcctgtaa tcgacggcgt 4080901 cccaccggct cgacaagaaa tcccaccagg tgactggacg caaggccggt ggggccccct 4080961 acaccgtcac catcccggag ttcggagccg cagctttgcg cgagcagcgg gcactggtca 4081021 tcccgttcga cccggtgttt ccggcccggc gcggcacccg ctagtccgag gccaacgtcg 4081081 cacccactgg cgggcgatcc gcggagagga cttcaaatgg gttgtcccgc actcgatccg 4081141 caagtccgtc gtcaccgcgg tggaacgctc gatagggctg gaagccgcgg cccagcaggc 4081201 cgggcacagc ggcagcgaga tcacccggcg gcactacgtc gagcggtccg tgacggtgcc 4081261 cgactacacc gccgccctgg acgagtattc gcgccctatc cgcgccttca ggccattaaa 4081321 gagcaacagg ccgggtgata taccgacctg acctgcaaag atggagccgc ctaggagaat 4081381 cgaactcctg acctattcat tacgagtgaa tcgctctacc gactgagcta aggcggcttt 4081441 tcccctgggt gcccgcttgc cgggcggcac gagtctacgg caggcgggcc ggcccgccca 4081501 agtttgcggc ggtcgctacc gcagttcctg gccgatggtg gcgaccatgg catcgacggc 4081561 gaacttcggt ttgacgttga ccgctagcgc ttccctgcac gccaacaccg cttcgatgca 4081621 gcgcagcagc cgctccggcg gggcgtgggc ggccagcgca gcaacccggt cggccatatc 4081681 cgggtggttg gcccgcaccc cacccgcgtg ggctgcgacc aacagtgcat cccggaagta 4081741 ggtcgccaga tcgatcagtg cccggtccag cgcatcgcgc gaggcccgcg tctgccggga 4081801 tttctgccgt cgttcaagat ccttcatcgc gccggtggca ccacgcaacg ccgcgccggt 4081861 gcctttaccg gtacctccgg ctcccagcgc cgtccgcagt tcttcggtct cggcctcgat 4081921 acgctgcgcg gtcaacgcta aggcctcggc ctcggcgccg gccaccaact cctcggcggc 4081981 tgcgtaggca cgcgagggtg tcgcggcgtc acgtgccagc cccaaagccc gctcgcgtcg 4082041 ctgccgggcc tgcggatcgg tggccagccg gcgcgctcgt ccgacatggc caccactgac 4082101 cgacgccgcc caattggccg tgtcggggtc caacccgtcg ccgtcgctca gcacctgcgc 4082161 gatcgcgtgg gtcgacggag tcaccaacgc gacatgccta caccgggatc gcagcgtgac 4082221 cgcaatgtcc tcgggatcca ccgacggcgc gcacagcagg aacaccgtcg acggcggcgg 4082281 ctcctcgaca accttgagca acgcgttggc ggcgccttcg gtcaaccgat cggcgtcctc 4082341 aatcaccacg atctgccagt gcccggtagt cggccggcgc gcggcgattt gcacgatggc 4082401 ccgcatttcg tccacaccga tcgacagacc ttcgggaatc acccggcgta cgtcggcgtg 4082461 ggtgcccgcc agcgtggtcg tacacgcccg gcagcgcccg cacccgggct ccccgcccga 4082521 cgtacattgc aaagccgccg cgaagcacag cgcggcaacc gagcgcccag aaccgggcgg 4082581 accggtgagc agccacgcgt gtgtcatagt cccgccgcca cccgcgctgt gagccgaatc 4082641 acgacgggcc gccttggccg tggcaagcag ctcggcttcc accgcttgct ggcctaccag 4082701 ccgcgtaaac accccggaca tcatcggcaa cagtagctat ccgcgccgac agataccgat 4082761 cagcgttcgt ttcgcgacaa ttccgtgatc tttcgtcgcc atttggatgg atgccgaggc 4082821 gttcgtcggt ttccggcaag tccccgccgc ccgatacggt gggctaatgg caaccacggc 4082881 ggcgctaccc agacggatcc atgcattcgt ccggtgggta gtgcgcactc cgtggccgct 4082941 gttctcgctg agcatgctgc agtccgacat catcggcgca ttgttcgtgc tcggattcct 4083001 gcgctacggc ctgccgcctc aggacaatat ccaactgcag gatctgccac cggtcaacct 4083061 actgatcttc gtcagcacgg taatcatctt gttcctcgcc ggggccgtgg tgaacctgaa 4083121 gctgctgatg ccggtctttc gatggcagcg ccgcgacaac ctgctcaccg agcctgatcc 4083181 ggccgccacc gagctggccc gcagccgcgc attgcgcatg ccgttgtacc gcactctgat 4083241 cagcctggcg gtctgggcta ccggcggcgg ggtgttcatc ctcgccagct ggtcggtggc 4083301 caagcatgcg gcccccgtcg tggcggtggc caccgcgctg ggtgccaccg ccaccgccat 4083361 catcggctac ctgcagtctg aacgggtgtt acggccggtg gccgtcgcgg cgctgcgcag 4083421 cggtgtgccg gaaaacgtca acgcacccgg cgtcatactg cgactgatgc tggcgtggat 4083481 tccgtccacc ggcgtaccac tcctggcgat cgtgctggcc gtagcggcgg acaagattgc 4083541 cttgctgcac gccacaccag aggcgctgtt caatcccatc ctgatgatgg cactggccgc 4083601 gctgggcatc ggatccgtca gcaccctgtt ggtggccatg tcgatcgccg acccgttacg 4083661 ccagttgcgc tgggcgctaa gcgaggtgca gcgcggcaac tacaacgccc acatgcagat 4083721 ttacgacgcc agcgaactgg gcctgctaca agccggcttc aacgacatgg tccgcgagct 4083781 gtccgagcgg cagcggttgc gtgacttgtt cggtcgctac gtcggcgaag acgtggcccg 4083841 gcgggccctg gagcgcggca ccgagttggg cggtcaggaa cgcgacgtcg cggtgctgtt 4083901 cgtggatctg gtcggctcca cgcaactggc cgcgacacga ccgcccgccg aggtggtcca 4083961 gctgctcaac gagttcttcc gggtggtggt cgaaaccgtc gcccggcacg gtgggttcgt 4084021 caacaagttc caaggcgacg ccgcgctggc catcttcggt gcacccatcg aacaccccga 4084081 cggtgctggt gccgcgctat cggcagcacg tgagctccac gacgaactca tcccagtgct 4084141 gggttccgcg gagttcggca tcggcgtgtc ggccggaagg gccatcgccg gccacatcgg 4084201 cgctcaagcc cgcttcgagt acaccgtcat cggcgacccg gtcaacgagg ccgcccggct 4084261 caccgaactg gccaaactcg aggatggcca cgttctggcg tcggcgatcg cggtcagtgg 4084321 cgccctggac gccgaagcat tgtgttggga tgttggcgag gtggttgagc tccgcggacg 4084381 tgctgcaccc acccaactag ccaggccaat gaatctggct gcacccgaag aggtttccag 4084441 cgaagtacgc ggctagtcgc gcttggctgc cttcttcgcc ggcaccttcc gggcagcttt 4084501 cctggctggc cgttttgccg gaccccgggc tcggcgatcg gccaacagct cggcggcgcg 4084561 ctcgtcggtt atggaagcca cgtcgtcgcc cttacgcagg ctggcattgg tctcaccgtc 4084621 ggtgacgtac ggcccgaatc ggccgtcctt gatgaccatt ggcttgcccg acgccggatc 4084681 tgttcccagc tcgcgcagcg gcggagccga agcgctttgc cggccacgac gtttcggctc 4084741 tgcgtagatc ttcagggctt cgtcgagcgt gatggtgaat atctggtctt cggtgaccag 4084801 tgatcgagaa tcgttgccgc gctttagata cggtccgtag cgcccgttct gcgcggtgat 4084861 ctcctcaccc gaggcggggt ccactccgac cacgcgcggc agtgacagca gcctcagcgc 4084921 gtcttcgagg gtgaccgtct gtaggtccat gctccgcagc aacgaaccgg tgcgcggttt 4084981 gggcccggcg gccttctggc gtttcttgac tccctgagcg gccgcggccg catcagccgc 4085041 aggctccggc aggatctcgg tcacatacgg cccaaaccgg ccttccctgg ccacgatctc 4085101 gtggccggtt tctgggtcca agcccaaagt ccgtccctgt tgcggtgtgg caaagagctc 4085161 ttcggccacc tgtagagtca gctcgtccgg ggtaatcgag tcgctgaggt tggcccgctg 4085221 cggcgtgggc tcaccggtgt cgccggccac caaacgttcc aggtagggac cgttcttgcc 4085281 cacccgaaca tatatggggc gtccgtgggt gtcgtcaaaa agcttgatag agtttacttc 4085341 tcgtgcgtcg atgccctcga gattgatccc gacaagcttc ttgaggccac ccgatcgggc 4085401 taccgaatcg ggcacaccgt gatcgccacc aaagtagaag ttgttgagcc agttggtgcg 4085461 gcgctcgttg ccggcggcga tctcgtcgag ctcgtcttcc atcgccgcgg tgaagtcgta 4085521 gtcgacgagc cgaccgaaat gctgctcgag cagaccggtt accgcgaacg ccacccatga 4085581 cggcaccagt gcactgccct tcttgtgcac gtagccgcga tcctggatgg tcttgatgat 4085641 cgacgagtag gtcgacgggc ggccgatgcc cagctcctcg agcgctttga ccagcgacgc 4085701 ctcggtgtag cgggccggcg ggttggtggc atggccgtct ggggtcaact cgacgatgtc 4085761 caaccgttga cccggggtca gatggggcag tcgccgctcg gcatcgtcag cctcgccgcc 4085821 gaccagctcg tccacggtct ccacgtaggc cttgaggaag cccgggaacg tcaaggtgcg 4085881 tccggtcgcg gagaacacca cctcctggtg ccccgacatg ccagtgatcc gcaggctcag 4085941 cgtcatgccc cgcgcatcgg ccatctgcga ggctacggtg cgttgccaaa tcagctcata 4086001 gagccggaaa tcatcaatgt tgggaccgtc gagttcgcga cgcaccgcgt ccggggtggc 4086061 aaacgtttca ccggcgggcc ggatagcctc gtgcgcttcc tgggcgttct tcaccttgcg 4086121 ggtgtattgg cgcggcgccg gcgcgacgta ctcgtcgccg tagagctggc gcgcctgggt 4086181 acgtgcggcg ttgatcgccg actccgacag cgtggtggag tcggtacgca tataggtgat 4086241 gtagccgttt tcgtacagcc gctgggcgat gctcatcgtc cgctcggcgg agaaccgcag 4086301 cttgcggctg gcctcttgct gcagcgtgga ggtcatgaac ggcgggtacg ggcgccgggc 4086361 gtagggcttc tcctcggccg aggccacggt cagctgcgtg ccatccaggc ccgcggccaa 4086421 cgcggtcgcg ctcccctcgt cgagcacaat gacttcgtcg cctttgcgca gcgtgcccag 4086481 cgagtcgaaa tcgcggccag tggccacccg ccggccagcc acggccgtca gccgggcgct 4086541 gaaggtgggc ggcgcggcgt ccgggtcgga cacgctggca tccagcttgg caaggatgtc 4086601 ccagtaggcc gcgctgcgga acgccatgcg gtcgcgttcg cgcgccacga tgatgcgggt 4086661 ggccaccgac tgcacccggc ccgccgacaa cttgggggcg accttcttcc acagcactgg 4086721 gctgacttcg tagccgtaca gccggtccag gatgcgccgg gtctcctgcg cgtcgaccag 4086781 gtcgatgtct aggtcgcggg ggtgctcggc ggcggcgcgg atcgccggtt cggtgatctc 4086841 gtggaagacc atccgcttta ccggtatgcg cggtttgagg gtttccagca gatgccaggc 4086901 aatagcttcg ccctcacggt ccccatccgt ggccagatac agctcgtcca cgtctttgag 4086961 caggcccctg agctcgctga cggtgctccg tttctccggg ctgatgatgt agagcggttc 4087021 gaagtcggcg tcgacgttga ccccgagccg cgcccacggc tgcgacttgt actttgcggg 4087081 tacatccgac gcggcccgcg gcaagtcacg gatgtgcccc cgggaggact cgacgatgta 4087141 gccagagccc aggtaggagg ccagcttgcg cgccttggtg ggcgactcga cgatgaccag 4087201 tcgccggccg ctgccattgc cgccgctgcc acggcccttc gttttcgggt cagccaactg 4087261 cgcccacgct ccatctctta tcccggcccc tatcgagacc gccccggtag gtagaggacg 4087321 cggccgactg ccgaatccca ggtgaattcc ggtacgccgg cgttccctcg cctgtgggca 4087381 actgacaatc tcgcactcta gggcgggcct gcgcaaaccg gctgcaaaca gattacccac 4087441 accaaaggct caaacgggcc gctcaggacg ctcggagatc cgcatcgtcg ccgaactagg 4087501 tccgactgcc cggctcctca gcggacccca gcgggaccgc atcgtcgccg agctaggtcc 4087561 gactgcccgg ctcctcagcg gaccccagcg ggaccgcatc gtcgccgagc taggtccgag 4087621 gccactgtac ccatgcctcg gccccgtctg ggggttcccc cacgttctcc accaggcgcg 4087681 ataacctgcg tcgcccgcta atccgcagcg ctggtcgggt cccccgggta ccgatcaggg 4087741 tgggcgcaat cccgaccctc atcagcgccg acgccagcgg agaatgggtg tccggcgcgt 4087801 gtggatccag gcccagcaaa tagcggtcgg cctccgggct gcctgccgcc agagtccaag 4087861 ccctcagctc gcgcggcccg ggcagccatc gcgggggcac ggtcttgacc gcaccgcgcg 4087921 tccactcggc ggcgatgccg cacaacagcg ggtcgacggc cgtccgcacc agcggggtgt 4087981 tttcgtcggt acgggcgacc tcgggtacca aaccagcctc ctggatcatc tcggccaggg 4088041 ccgatgcgcg ccaggactcg gcgacgacta ccgacagccg agcgccgcaa ccaaccagca 4088101 cgatctggcc cgggcccgcc agcaccccgg aaagatccgc gaccgcggga ggtactgact 4088161 ccgcggcgaa gaaggaaagc tggctcacct caccgacagt aagccagcga gcgggtcgct 4088221 ggctttaggc atccggcgcg gcggcagcgc gccatgtggc gagcagacgt aaagccccca 4088281 aaacggaacc gttttggggg ctttttgcgt ctgctcgcgg gggtaactca gagcgagcgg 4088341 actccggtgg cctgggggcc cttagggctg tggccgatct cgaactcgac cttctggttt 4088401 tcttcaaggg tgcggaagcc cgttccctgg atctccgtgt agtggacaaa tacatccgcg 4088461 gaaccgtctt cgggggcgat aaagccgaac cccttctccg cgttgaacca cttcacagtt 4088521 ccctgtggca tttctcgatc tttccttttc ttctgggtgc ggtgcaccgc ctttcggtgc 4088581 cccgggccag ctgcggccgc catacctcgc cgagtcgccg gaacttcacc cgaccgataa 4088641 cctcgcagga accgcggccg caacgtcgat cctgcgaaag tttgacacga acacagaagc 4088701 tgcgaccgcc aatcagtcaa tcatgttcat cgcgtcggca acagcctctg ggtgtggacg 4088761 gagctacgaa gggtccgcaa atggcgagtt tcggcagcca cctgctggcc gcagcggtcg 4088821 ccgggacccc gccgggcgag cgtccgctgc gccacgtcgc cgagctgcca ccgcaggccg 4088881 gccggccgcg cggttggccg gagtgggccg agcccgacgt ggtggatgcg tttgccgacc 4088941 gcggcatcag ctcgccgtgg tcacaccagg ctgaggccgc cgagttggcg tacgccggcc 4089001 gccacgtggt gataggcacc ggcccggcgt ctggaaagtc gttggcctat caacttctcg 4089061 tgctcaacgc gctggcaacc gactcccggg cgcgtgcgct gtatctgtcg ccgacgaagg 4089121 cgctcggcca cgaccagttg cgcgccgcac atgcgctggc ggccgcggtg ccacggctgg 4089181 ctgacgtcgc gccgacggcc tatgacggcg acagtcccga cgaggtgcgc cgctttgccc 4089241 gcgagcgctc ccggtggctg ttctccaacc cggagatgac acacctatcg gtgcttcgaa 4089301 accatgcgcg ctgggctgtg ctgttgcgga atctccgctt tgtgatcgtc gacgaatgcc 4089361 attactaccg tggtgttttc ggctcgaatg tggcgatggt actgcgccgt ttactacggc 4089421 tgtgcgcgcg ctactctgcg cacccgacgg tgatcttcgc cagcgcgaca acggcctcgc 4089481 cgggcgcgac ggctgccgac ctgatcggcc agccggtcgt ggaggtcacc gaggacggct 4089541 caccccgggg ggctcgcacg gtggcattgt gggagcccgc gctgcggtcg gatgtgatcg 4089601 gcgagcacgg cgccccggtg cgacgctccg ccggtgccga ggcggcccgg gtgatggccg 4089661 acctgatcgt cgagggagcg cagaccttga cgttcgtccg atcgcggcgc gcggcggaac 4089721 tgactgcact gggtgcccgg gcgcgactgg tcgacattgc cccggaactg tcggacacgg 4089781 tggcgtcgta tcgggccggt tatcttgccg aggaccgtag cgcgctgcac caggccctgg 4089841 ccgagggcca gctgcgcggg ctggctacca ccaacgcttt ggagttgggc gttgatatcg 4089901 ccggactgga tgcggtggtg ctggctggtt ttcccgggac ggtggcctcg ttctggcagc 4089961 aggcgggccg gtcgggccgg cgcggccagg gcgcgctggt ggtgttgatt gcccgtgacg 4090021 atccgctgga cacgtatttg gtccaccatc ccgcagcatt gttggacaaa ccggtcgagc 4090081 gcgtggtgat cgatccggtt aacccgcacc tgctgggtcc ccaattgctt tgtgcagcaa 4090141 cagaactgcc tttagacgac gccgaggtcc ggtcctgggg cgccgttgag gtggcggaga 4090201 gtctggttga cgacgggctg ttgcggcgcc ggaacggcag gtactttccg gcgcccgggg 4090261 tgaaaccgca tgccgccgtg gatgtccggg gggctatcgg tggccagatc gtcatcgtgg 4090321 aggccggaac cgggcggctc ttgggcagcg tgggcgtcgg tcaggccccg gccgcagcgc 4090381 acccaggcgc ggtgtacctg caccagggcg agacctacgt cgttgactcg ctggatttcc 4090441 aggacggaat cgccttcgtg cacgccgagg atcccggcta tgccacgttc gcgcgagagg 4090501 tcaccgacat cgcggtcacc ggcaccggcg agcggttggt cttcgggccc gttgctttgg 4090561 gtttggtgcc ggtgactgtc accaatcacg tcgtcggcta cctgcgccgc cagctgtccg 4090621 gggaggtgct ggacttcgtg gagctggaca tgccggaaca caccttgccc acaaccgcgg 4090681 tcatgtacac aatcacttcg gatgcattgg tccgcagcgg tattgaggcc acacggattc 4090741 ccgggtcgtt gcacgccgcc gaacacgcgg ccatcgggct gctgccgctg gtggccagct 4090801 gcgaccgcgg cgatatcggc ggcatgtcca cagcgaccgg gcccgagggg ctgcccagtg 4090861 tctttgtcta cgacggctat ccgggtggag ccggattcgc cgaacgcggc tttcgccggg 4090921 cccgcacctg gctgggcgcc accgcggagg ccatcgaagc ctgcgaatgc cccagtgggt 4090981 gtccatcgtg tgtgcaatcc cccaagtgcg gcaatggcaa cgacccgtta gacaaggcgg 4091041 gcgcggtgcg ggtgctgcgg ctggtgctcg ccgagttaag tgaggaatca ccgtgagcag 4091101 cccagcgttc cggcgttgtc gggcaaagcg gggtcgtcgt cttagccgat gtgatgcact 4091161 tgacatcagt gtcttcggcc tatcacgtag tggtcgtggg cgccggccga agatccgggc 4091221 gggaggtgac acgtgtcgtt tgtgatcgcg gcgccggagg cgttggactc ggcagcaacg 4091281 gacctcgtgg tcctgggctc gacgttaggc gcggccactg cggccgcggc ggcccagacg 4091341 acgggtatcg tggccgcggc ccacgacgag gtgtcggcgg cgatcgcagc cctgttttcc 4091401 gcccacggcc aggcctatca ggccgccagc gcgcaggccg cggcgtttca cacccggttc 4091461 atccgtgcgc gctcccgaca tccgcagcag gaaacgacct gtcgccgtgt gcgataggca 4091521 aatcaccagg caacacgccg gcagctccgg taaggccaac atcgaccacc tacccagggc 4091581 attcccatgc acgtcaccgc cgcatagcaa gttgcggatg ctgagtggtc cgctaccacc 4091641 cggtatggca acgccggtgg tcatggcacc acctcgggtc tgatctgcct cggaggccgg 4091701 ccgctggcac gaaggcaacg acggttcggg cgggttggcc tagcgatacc acacgcatgc 4091761 gctgtcctgc aagggaattc cctcggcgac caccggtacc ccaccgagtc aacggcgcac 4091821 cgcgtccgta gactgctcgc atgacccacg actggctgct cgtggagacg ctgggggacg 4091881 aaccggccgt ggtagcacgg gggcgtgagc tgaagaagct cgtcccgatc accacgttcc 4091941 tgcgtcgcag tccctatttg gcggcggtcc gcacagctat cgccgagacg ctgcagaccg 4092001 gccaaagcct gaccagcatc actcccaagc acgatcgcgt catccgcacc gaacctgtaa 4092061 taatgaccga cggccgcatg cacggcgtgc aggtgtggag tggccccaca gacgccgaac 4092121 cgcccgaccg gccgatccca ggcccgctga agtgggacct gacccgtggt gtggccaccg 4092181 acaccccgga gtcactgacc aacagcggca agaatcccga ggtcgagatc acctacggcc 4092241 gagccttcgc cgaagacctg ccggcgcgcg agctcaatcc gaacgaaacc caggtgcttg 4092301 ccatggcagt taaagccaag cccggcaaaa cactatgcag catttgggat ctcactgatt 4092361 ggcaaggaac acccatccgg atcggcttcg tggcgcgaag cgctctggag ccgggaccaa 4092421 acggccgcga tcacctggtc gcccgggcaa tgaattggcg tgctgagacc aaggcccctg 4092481 cagtgcccgt cgacgacttg gctcagcgga tccttatcgg actggcgcag gccggagtcc 4092541 accgggcact ggtcgatctc aaaacctgga ccctgctgaa atggctcgac caaccctgct 4092601 ctttctacga ctggcggcgt agcgcggccg atgggcctcg tctacatccc gacgaccagc 4092661 acgtgatcga cgccatgaca agagacctcg ccaacggatc ggccagtcat gtgctgcgct 4092721 tgcctgggca cgacgtcgat tgggtgccgg tccatgtcac cgtcaaccgg atagagctcg 4092781 aaccggatac cttcgctgga ctggtcgctc tgcgactgcc caccgacgaa gaacttgccg 4092841 acgccggact gccgaaagcc accgacgtca ccacctgaca accagtcctt tcgactcagc 4092901 aacggcagct gccgatccgc ggctaccgtt gcttgtcgtg aacggtttga cggtgatccg 4092961 gactgcgcgc tcgctgagcg gcctacgccc acgctgtcgg tcagattgcg tcgatgaatc 4093021 ctatgcgctc tgaactgaac tgggctgaat gcgcgagccg ccgacgtagg gaatcggcaa 4093081 cgcccgtcgg acgaccccgc cgatctcgtc gtcgacatcc agtggcgccg gcatcagcag 4093141 ggtggtgacg attgcccgtt cagacagtcg ccgcaaggcc ccgggcctgc taggaggtcg 4093201 ggttccccgg gacgtcgacc acaccctggt cgcaatgtcc aacgtaagca acaggtttga 4093261 gtatgaggtg ccggtagcga ggatgaattc gccagtcctg gtacacgcgc acggacatcg 4093321 caggtgccgc gatgcggccg gcctctggcc accgccgaat cggcgtagcc gtcgggcact 4093381 ttcaagatcg ggtcagcgcg cctgatgcgc accgggccgc cacctcagcg ccatggtgtt 4093441 tcggacatcc tccaatcgcc gccgatcccc gaggaacacc aggtcgcccg cgtgcgggcg 4093501 aaaggcagcg aggacttttg ggaaacccac gcacatgctt cccggatagc gataagctgc 4093561 gctccagcag attgtccgcc ggtgaccggg cggcccttcg atcggcatcg cgcggtggtc 4093621 ggaggtgtcc gatgtcatat gtgatcgcgg cgccggaggc gctggtggcg gcggccacgg 4093681 atttggctac tctcggctcg acgatcggcg ccgccaacgc ggccgctgcg ggctcgacaa 4093741 cggcgttgct gaccgccggc gccgacgaag tgtcggcggc gatagcggcc tattcggaat 4093801 gcacggccag acctatcagg cactcagtgc gcgggcggcg gcgttccatg agcggttcgt 4093861 gcaggccttg gccacaggtg ggggcgccta tgcggccgcc gaggccgcca gcgtctcgcc 4093921 gctgcagagc gcgctcgatt tgctgaatgc gcccactcag gcgctgttgg ggcgtccgtt 4093981 ggtgggcaat ggcgccaatg gggccccggg gactggggca aacggcggcg atggcgggat 4094041 tttgttcggg tccggggggg ccggcgggtc cggagcggcc ggcatggcgg gtggcaacgg 4094101 cggggccgcc gggctgttcg gcaacggcgg agccggcgga gccggcggca gcgcgacggc 4094161 cggtgcggcc ggggcgggcg ggaacggcgg ggccggcggg ctgctgttcg gtaccgccgg 4094221 ggccggcggc aacggcgggt taagcctcgg tttgggcgtc gccggcggcg ccggcggcgc 4094281 cggcgggtcg ggcggtagtg acaccgccgg acacgggggg accggtggtg ccggcggcct 4094341 gctattcggc gccggcgagg acggcacaac gcccggtggc aacggtgggg cgggcggtgt 4094401 cgccgggctg ttcggcgacg gcggcaacgg tggtaacgcc ggagttggca cgcccgcggg 4094461 caacgtcggc gccggcggca ccggcggcct gctgctcggc caggacggca tgaccgggtt 4094521 gacgtagccg cgtggcgggg ccgcgccttg cttccgggac taccacccgc aggtcgctgg 4094581 ccgtagttgg ttctccccgc tagcccacca ctagcttcgc ttgccgatag cttcgcttgc 4094641 cgatagaact agatcgtcgt caacccggtg tcgtgggcac cttggccggc cccgcccgcg 4094701 cggtggcggt cgccacaccc gcgaacgcga cagccacctc gacggtgacg accacgtcga 4094761 ggtccaccac cctgcactgc gcgtgctcga cgcgcatcgc acgggccacc agcgtcgcac 4094821 gcgcgcaggc cgccgccagt ccggacggca gccgggcggc agcggctaac gaagccagat 4094881 cagccgccgc ctgtgcgcgg tgacgagcca ccaccgccga ccctagatat gcacccgcac 4094941 cggtgacgca cagcagcacc gcgaccatcg cgacggcaag cacggtggcc gagccgcggt 4095001 cgaccccggc tcggccaccg aaattgccct agcagcaatg tccaacgtag gcaacaggtt 4095061 tgagtgtgct gtgacagtgg cgaccacaaa ctcgccgtcc cggtgcacct ggaccagcgc 4095121 cgcacgcggg gcgatgctgc gggcgacgtc ggtcgccgag cgtacgtcac cgcgcgcggc 4095181 caatcgagcg gcctcgcggg ccgcgtcgat acagcgcacc tgcattgata ccgcggtgac 4095241 gcccgccagg cacagcacca gcaccagcac cagggtggcg atcgccaacg ccgcttccac 4095301 ggtgctcgca cccgcacacg acgctaaacc ttggtgctga gcgcgcgacc gatgatgcgg 4095361 ttgagcgccg acacaatgga atccccggtg acgaccgtgt agaggatcgc accgaaggca 4095421 gccgccgcga tggtaccgat ggcgtattcc acggtggaca tgcccgactc gtcgaccgcc 4095481 agcgccgtca tccgcgccac gagtacacga aacatggtga tcaccaacat attcctttct 4095541 cataccaggc caaactgcaa gacatcaccg gccagcccga ctactagcgg gacaatgccc 4095601 acacacagaa acgccggtaa gaagcacagt cccagcgggc cggcgatcag cacaccggcc 4095661 cgctcggcgg ccgccgcggc cgcctgtgcg gcgtcgtgcc gaacctggac ggccagttcg 4095721 acaatgccat cggcgagcgc cgcgcccgaa gccgccgaac gccgtgccaa ccgcagtacc 4095781 gcatcggtct gcgcatcgtg ggtgcccggc ggcaaatccg gcggcctcga ccaggcgatg 4095841 ttggggtcgg cacccaatgc cagcaggtcg gcggcccggc gcaacacgcg cgccagccgc 4095901 ggcggcgcga ccgcagcggt ggcggccgcg gccgtcgaca ccgccatccc cgcagccaga 4095961 cacacggcca gcacgtcaag gctggctgcg acggctagcg ggtccgcgac atccgtccgc 4096021 cctagcagca gcccctggtg tggccgatgc gcgcggggcg gcctcccggc tcgcgcccgt 4096081 accaccgacg ggccggcacc gagccacaac gccatggcca gcaacaccgc cgccgcactc 4096141 acaacactgg ccgatcggtg atccggtccg accacagcag cccggcgcag gccagtgtca 4096201 gcccgaccac cagcagccat ccgcccacgc gtcccgtcag cagaaagctc agcggccggg 4096261 cgccgatcag ttgaccaagc agcaccccga gcagcggcag gattgccaat atggccgcac 4096321 tggcccgggc accggccatc cccgctgaca cccgcgcgga gaaccgttgc cgctcagcga 4096381 catcacgttg ggcggcacgc atcaaactgg ctatcgccaa gccgtgatca ctgcccagtt 4096441 gccagcagac cgcgagccgc tcccagtacg cgggcagcgc cgaggatcgg gccgcagcga 4096501 gcaggccagc cgtgacgtcg gcacccaatc gtgcccgcgc cgcgaccgcg cgcaaggcaa 4096561 cggcaaccgg gccgccggtc tcgtcggccg cgatgctgaa tgcgcggact ggatgggcgc 4096621 ccgcgcgcag ttcacccacc accagctcaa gcgcggcctc cagcgcctgc ccctcgcggc 4096681 tgcggcgcag gtagcggcga cgccggcggt agcgcaggcc gagtgttgcg cccagcaccg 4096741 cgacagccac aacggtcggt aacggtagca aggctgccac accaaccgcg acacagccaa 4096801 caccccaggc aacccgccgg gcgccgacca gaagcacccg ccggccggtg tcgtctggag 4096861 taaggcggca ccgcggcgac ccgggcaaca ccacgagcgc aagcgacaaa atcagggcag 4096921 cggacgctat accgctcatg ccgatgcccg gcttctcagc aaatcgtgca gggcggccgc 4096981 gtcgtcactc atcccacggt ccgcgtgcca caccgtcacc gcctggaccc gcccttcagc 4097041 ttggcgcagc acggcgatct cggcgagccg gcgacggcct gcccgatcgc gcgcgacgtg 4097101 cagcaggact tggactgccg cggcgagctg gctgtgcaga gcagcgcggt caaggccgcc 4097161 gagcgccccc aacgcttcca tgcgtgcagg gacctcaccc gggttgttgg cgtgtacggt 4097221 gcccgcgccg ccctcgtgac cggtattgag cgccgccaac agatccacca cctcggctcc 4097281 cctaacctca ccgaccacga tgcggtcggg ccgcatccgc agcgcctgtc ggacgagttg 4097341 acgcacggtt acctcaccga ttccttcgac gttcgcacgc cgcgcaacca gcttgaccag 4097401 atgtggatgc cgaggggcca gctcggcggc atcctcgacg cacacgatcc gctcatcggg 4097461 cgacacggcg cccaacatcg ctgccagcaa cgttgtcttc ccggcaccgg ttccgccgca 4097521 cacgaggaat gccagccggg cggtgacgat gtcggcgacc agcgcggcgg ccgcggggtc 4097581 gatcgcgccc gccgcagcca acgcggccag atcctgagtc gcgggacgca acacccgcaa 4097641 cgacaagcaa gtgccctggg tcgccacggg cggcaacacc gcatgcagcc gcaccgcgaa 4097701 ccctccgacg ccgatcccgg ttagttgacc gtccacccag ggttgcgcgt cgtcgagccg 4097761 acggccggcc gccaaagcca gccgttgtgc caaccttcgc accgctgact cgtcagcaaa 4097821 ccgaatctgg ctgcgtcgca atccgtttcc gtcgtccacc cacaccgagt cgggcgcggt 4097881 gaccagaacg tcggtggtgc cgtctgcgga tagcagcggt tcgaggatgc cagcgccggt 4097941 cagttctgtc tgcagcacac gaagattcgc cagcacttcg gtgtcgccga gcatcccccc 4098001 ggactcggcc cggatcgcgg cggccaccac actgggccgc agcgggccgg attcggatgc 4098061 cagccgttcg cggacgcgtt cgatcaggga gccggtcatg ccgccctacc gtgtcgccct 4098121 gacccagcac gtggcagcac accaagtacc cgtcgggcag ccgatgccag caccgatcgc 4098181 cgtcgcagtc gaagaccccc gtgttccagc tgttcggcta gccgcggctg ggccctcatg 4098241 gatgccagta gcggcacccc ggcgacgtcc gcgacctctg ccgcccgcaa tccccccggg 4098301 gagggccccc gcaccaccag acccaggttg gggttgatcg cggtcagcac aggcgccatc 4098361 gtcgcggcgg ccgcacatgc ccgcacatcg catgggctga ccaggacgac gagatcggcg 4098421 gcatccagcg ctgcttgggt ggcatcggtc agacgacgtg gaagatcgca gaccacggtg 4098481 actcccccac gtcggccggc gtcgatcacg gcgtccaccg gcccggcgtc taactcgtag 4098541 ccgcgccgag ttcccgagag cacgctgatc ccccgcggtc gcggcaatgc cgcacgcacc 4098601 gccgaccaat tcagccgtcc accctgtagc gccaggtcgg gccaacgcag accgggggcg 4098661 gtttcgccgc ccaccagaag atcgatgccg ccggcccacg gatcgagatc gaccaacagc 4098721 gcatcagcgg cggcctgcgc cagggcaacc gcaaacaacg atgccccagc gccaccgcga 4098781 cccccgatga ccgcgaccac cgccccgcag atcccgtcat cgcgtgccga ttcagcagct 4098841 tcggcgagct cgcggaccag ttcaccctcc tgctcgggca tcctcagcac gtgctgggcc 4098901 ccgacggtta tggcagccgc ccaggtcgcc gtcgcggctt cggttccggt caacacgctg 4098961 acgtgggtgc gccggggtag cgcgagccgc ccacaccggt ccgccgccgc gtggtcgagc 4099021 accacagccg ccgccgccga ccacgtcttt ctgctcaccg gatggcggcc gccgagatga 4099081 acaacgcgaa ccccgacggc tgcggcgact cggtccagct cgtcgcgcaa ccccggatcg 4099141 gtcagcatcg ccaacacgcc cgagcccacc gggtggctac cagacgggcc accagggcct 4099201 gagaagactg tcacccaccc accgtgcggg gtccatggtg tgggacacca gtcccaaagg 4099261 cgcaattggg gacagacgtg caactgtgca caaacgcccc tgagggggtc cgggcaacac 4099321 gattcccgca acgcccagaa agctgggcta agcaccgggc tgacgacgtt tgcgtggctg 4099381 ccaaaaggga cgacccccgc caggggggga ggaggcgagg gtcgtcgtgc atcagccccg 4099441 gggggtcgga ctgatacacc ctcggctatg gccgagtaat gcttactata cacatgacag 4099501 tgcgcagtca cgcaagtacc ggacgcaatg gaaagcacag cttgagccgt gtaaatgctc 4099561 ttgacttctc gacaacatcg gtagtcaatt gacctgttcg ggaacaaggt cgccggccgg 4099621 tccaactgcc gacctatgct gggtcggtga ccgtctccga ctcgcccgcc cagcggcaaa 4099681 ccccaccgca aacaccggga ggcaccgctc cgcgagcccg caccgcggcc tttttcgacc 4099741 tggacaagac catcattgcc aagtccagca cactggcgtt cagcaaacct ttcttcgctc 4099801 agggactgct caaccgccgc gccgtgctga agtccagcta cgcgcagttc atctttctgc 4099861 tgtccggtgc tgaccatgac cagatggacc ggatgcgcac ccacctgacc aacatgtgcg 4099921 ccggttggga cgtagcccag gtgcggtcga tagtcaacga aaccctgcac gacatcgtga 4099981 ccccactggt gttcgccgag gccgcggacc tcatcgccgc ccacaagctg tgcggccgcg 4100041 acgtcgtggt ggtctcggct tcgggcgagg agatcgtcgg cccgatcgcc cgcgcgctgg 4100101 gcgcgaccca tgcgatggcg acccggatga tcgtcgagga cggcaagtac acaggcgagg 4100161 tcgcgttcta ctgctacggc gaaggtaagg cgcaagccat ccgtgagctg gctgccagtg 4100221 agggctaccc gctggaacac tgctacgcgt actccgactc gatcaccgat ctgccgatgc 4100281 ttgaggcggt tgggcatgcc tcggtggtca accctgatcg cggcttacga aaggaagcca 4100341 gcgtgcgcgg ttggcccgtg ttgtcgttct ctcggccggt gtcgctgcgc gaccggatcc 4100401 cggcaccgtc agccgcggcg atcgccacga ctgcggcggt gggtatcagc gccctagccg 4100461 ccggcgcggt cacctacgcg ctactacgcc gcttcgcgtt tcagccctag cgacgatgcg 4100521 ggccacacag tggcccgagg aggaacgggg ccacgaagca ggccgccgga tcgcgcccga 4100581 gcgggcgggc agcaaacgtc tagcccacgc aatccaaagc cgcttcgtaa ctttcgcaga 4100641 attgggcctt gctgtgttaa aggtctagta gtacaaagga accacggaag cccggtgagg 4100701 ccaaggctcg atccagaaga gaaggttcgg tctcccgacc cgggcgccca gcatggttcc 4100761 cggcacccac gcggagtcat agccacgata acggcagaag tgttgcgggt ctgcgtaatt 4100821 gcgaacagca gatggcatcg acggcccttt gggtggggct acagctagaa gcgtcgcaag 4100881 atcgccgagg ccacccacgc aaccccagga gtgcacgctt ggtaaccgag aaccgtgttg 4100941 gtgggcggcg attcgagttc ttcgggtcgc cgcctgcttt ttgttttctg gatcaagtat 4101001 tacggccatt cgaggcccgc cggttagccg ctcggctatc taggcgcgta attcagtgac 4101061 cgtttggccg ggctgtctcg cggctgtgcc agatcacagc ggcgaagtgc cgcagccgtg 4101121 acccgctcgg ggtagccggg ctgtttgagc aaccagacac gccgaacgtg caaccacggc 4101181 ggctccaccc ggcggggcgt gtccccgcca ccaatgcacg ttcggcgcag ccggcgcacc 4101241 ctcggcgcgg agtttaggaa ctactcatcc aggtgacaac gactcggcaa tcgacaaagc 4101301 ctcccgcgcg ccgtcgagca tcgcgccgca acacagcaac agccagcccg ccaccccatc 4101361 aggtgtgccc ccggcgaacc tgcgggcagc gtcgtggtat tcggcgggtt ggcgcatcca 4101421 aatcacttcg ggaacaccca gcccgtgcgg atccagtccg gtggcgattg tcaccagccg 4101481 cgacaccgcg cgggccacca caccgtcggc acagccaaac ggcctcagcg tcaagagctc 4101541 cccgtgtgcg accgcagcaa ccaccggcgc cgatgccagg gtggggtggg ttaccacatc 4101601 cgcgagcaac tccaaacgcg ggccaacgtc ggcatcggac cgcggacgcc caagccgatc 4101661 gtcatcgacc tggtcggcgg ccgccagcat gtgtaggcgg gccagcgcct gcaacggtgc 4101721 ccgccgccac accccgacca ccggacccgc gccgccttcc agcgcctgcc ccacccgaag 4101781 cgctcccgcg aacaccggat cgctgagcgc cggcttgccc gaggtgggcg cccccgcgtc 4101841 gtgcagccgc gcaggaccac cgtcgagcac cgaggaggcc cgcgccgccc gcaacgaggc 4101901 ctcggcggcg gccaccggcc agccccgcag gttggcccgg tgccggtgca cgcggctcag 4101961 cgcgtcgcgc acccggtcgc tggccgcagc aacgcccggg agctccatta gcggagccag 4102021 cgggtcgacc gtcacaggtt gccaaccttt cggggagctg agggggcacc gggaatggcc 4102081 tgaagcaact ggcgggtgta ctcgtggcgg ggccggctga acacctcctc ggtagaggcg 4102141 tgctccacca cccggccggc ccgcatgacc aggacgtcgt cggcaatctg ccggatcacc 4102201 gccagatcat ggctgatgaa caaatacgtc aaacccaggt cggcctgcag atcggccagc 4102261 agatccagga tctgtgcctg caccaatacg tcgagcgccg acaccgcttc gtcgcacacc 4102321 aatacctccg ggcgcagcgc cagcgcacgc gcgatcgcta cccgctgccg ctgaccaccc 4102381 gacagctcac ggggccgccg gcccagtatc gacgacggca gcgccacctg atcgaccagc 4102441 tcacgcaccg ccctttgccg ctgccggcgg tcaccgacgt gatggacgcg taacggttcc 4102501 tcgatggcgc gaaacaccga gtacatggga tccaggctgc tgtatgggtt ttggaacacc 4102561 ggctggaccc ggcggcgaaa ggccagcacc tggtcccggg ccagcgcgcc gacgtcgtag 4102621 gtgccgtcga aaacgaccgt gcccgaggta ggttggagca gcccaagcac catccgcgct 4102681 agcgtcgact tgcctgaccc ggattcgccg acgattgcca gggtgctcgc ccgcggtagc 4102741 cggaatgaca ctccgtcgac ggcgcgagac tccacccgcc gccacggtgc gccgcgggac 4102801 tcccggtaaa tcttggtcag ctccgagacg acgagaatgt cgccggcctg cgtggttgcc 4102861 cgtgaccggg attccggcgg acgtctgctg cgcgccgtca gcgatggagc cgcggccacc 4102921 aggcgccggg tgtactcgtg ctgagggctt tgcaggattg actgcgccgc accggattcc 4102981 accaccactc cacgacggac gacgacgaca gcctcggccc gctgcgcggc caacgccaga 4103041 tcgtgggtga tcagtagcag cgcggtgcct agttcgtcgg tgagtccctg aagatgatcg 4103101 agcacctgcc gctgcacggt gacatccaac gcggacgtcg gctcatcggc gatcagcagc 4103161 cgcggcctgc ccgccaagcc gatcgcaatc aacgcccgct ggcacatgcc gccggacagc 4103221 tgatgcgggt agcgtccggc ttgcttcgcc ggatccggca ggcccgcctc agcgagtagc 4103281 tccaccgccc gtcgtcgtgc tgcgcgaccg tcggtattgg cccgcaacgc ttctgtgacc 4103341 tgaaagccga ccttccaaac cggattgagg ttggtcatcg gatcctgggg aacatagccg 4103401 atctcccgtc cccttatcga ccgtagccgc ttggcatcgg ccccggtgat gtcgcgcccg 4103461 tcgaacacaa cgcgtccagc ggtgatccgt ccaccagccg gaagcaaccc aagaatcgcc 4103521 gcggccgtcg tggatttgcc cgacccggac tcacccacca cggcgacggt ttgaccgctc 4103581 cggacggcca gatccacccc acacacggcg ggagcatcgg tgccgaacgt aacttccagg 4103641 ccctccaccg acaacagcgg cgctgctggg acgctcatgc ccgccatgcc cgcgaagccg 4103701 gatccagcgc gtcgcgcaaa gcgtcgccca tcatcatgaa cgccagcacc gtaatcgcca 4103761 gcgcgcccgc aggatagaac aaaattggcg agcccgaccg tagccgggtc tgcgcgacat 4103821 tgatgtcgcc accccaggac accaccgacg tcggcaatcc gaccccgagg taggacagcg 4103881 tggcctcggt gacgatgaag atccccagag cgacggtagc caccgcgatc accgggccca 4103941 cggcgttggg cagcgcgtgc cgaagcagaa tctgaaacct attcaacccc aatgccttag 4104001 ctgcaaggac gtaatcgctg gcacgcacct cgagcaccgc accgcgcgcg atcctggcca 4104061 cttgcggcca gccgaacaat gccaagatgg cgatcaccgt ccacaccgtg cggtgatgca 4104121 tgacttgcat gagcacgatg gcggccaaca gcaacggcaa gccgagaaac acatcggtga 4104181 cccgcgaaac caccgcatcg atccagctcc cgtaaaaacc ggccaatgcg cctaacgccc 4104241 cgcccacgac gaacacggcc agcgttgccc ccaacccgac cgtgaccgaa gcccgcgcac 4104301 catacaccgt gcgcgaatag atgtcgtggc cctgcaggtc ggtgccgaac cagtgcgcgg 4104361 ccgatggcgc aagcatgctt tggctgggat cggcataggt gggatcggct gcggtaaaca 4104421 acgacggaaa cgccgccacg acaagaatca gcaggatcag cgccgcggcg atcacgaatt 4104481 taggacgccg gcgcaacccg cgccaggcat cgagccagaa ccccgtgtgc tcagccatag 4104541 cggatccgcg ggtccagggc cgcatacagc agatccacca acagattggt gatcaggtag 4104601 atcagcacca gcaccgtcac gatcgacacc accgtcggcg tctcctgacg cgtgaccgct 4104661 tgatacagca cgcccccgac gccgtggatg ttgaagattc cttcggtcac aatcgctccg 4104721 cccatcagcg cgcccagatc cgcgcccagg aaggtcacca ccggaatcag cgaattgcgc 4104781 agaatgtgca ccgtcaccac ccggggccgc gacaacccct tggcggtggc ggtgcggaca 4104841 tagtcagcgt gtgcgttggc cgccaccgcc gagcgggtca atcgcaccac gtaggcgaat 4104901 gacatggcgc ccagcacgat cccgggtagc agcaggcggc cgacgctcgc ccgttcgccc 4104961 accgtgaccg gcgcgatttc gagctggacc ccgaataaga actgcgccag aaagcccagc 4105021 acgaagatgg ggatcgcaat aatgacaagt ccggtaacca gcaccgcgga atcgaagatt 4105081 ccaccctgac gtaggccggc gatcacgccg aatccgattc cgagcactgc ctccaccgcc 4105141 agggcgatca aggccagcct gatggtgacc ggaaacgcat gcgccagaac ggcactgacc 4105201 ggcagcccag aatacgcacg acccaagtca ccgtgcagaa ttccgcccag atagcgcaag 4105261 tattgcacga ggaacggatc gtcgaggtgg taatgcgaac gcagctgcgc ggccaccgcg 4105321 ggagtcaacg gacggtcgcc cgccagcgcg gcaactgggt caccgggcag cagaaagacc 4105381 atgccgtaga tcagcagtgt cgcgcccagg aaaaccggca ccatcacggc gactcggcgc 4105441 gcaacatacc agcccatgtc aggccttgac gatgttctcg tagtcgggca gaccattcca 4105501 ggtgacggtg acgttgctga cttgcgacga ccatccgacg acactgatgt aatcccagag 4105561 cggcacaact ggcatgtcgt gaaacaggat tcgctgcgcg tcgttgacca gctcgtggga 4105621 ttcggttaac gtgggggcgg cttcggcggc ggccagcgcc gcgtcgaatt ccgggttgat 4105681 gtagccgacg tcgttggatc cggcgccggc ggtgaacagc ggagcgagaa actcgatcat 4105741 cgacgggtag tcgccccgcc atccagcgcg aaatgcactg tcgatggcgc ggttggtgat 4105801 ctgggtgcga aatccggcga aggtgggctg cggcgcggcc accgcatcga tgcccaacac 4105861 gttcttgatg ctgttggcca ccgcgtccac ccaatcccga tggccagcgt cagcgttata 4105921 ggcgatcgcg taccggccgc tccacggtga gatcgcatcg gcctgcgccc agagccgccg 4105981 agcccgctgc gggtcgtagt ccagcacctc gttgcccggc aggttgggat cgaagcccgg 4106041 caacgaccgg gcggtgaaat cgcgggccgg actgcgggtt ccggcgaaga tctgctggca 4106101 gatttgcggc cggttgatgg cggccgacag cgccaaccgg cgcagccgcc cctcctcgcc 4106161 accgaaatgc ggcagccgca acggagtgtc gagggtctga ttgatcgctg cgggcccgct 4106221 ggtagcgtgg tcgcccaggt cgcgctggta gaccgtcaac gcgctcggcg gaatcgtgtc 4106281 caggacatcg agattgccgg acagcaagtc ggcataggcg gtgtccagat tggcgtagaa 4106341 ctcgaatcgc aaacctttgt tacggggctt gcggttgccg tggtagtcgg ggttgggcac 4106401 caggtcgatt ctgacgttgt gttcccaggc cggcccggct gggccgtcgg cgagtttgta 4106461 cgggccgttg ccgatcgggt tgcggccgaa cgcggccatg tcccgaaatg cggagtccgg 4106521 cagcggataa aacgagctgt ggccaaggcg caacgtgaag tcgatggtcg gcgccttaag 4106581 ccgcacggtg aactccaggt cgttgaccac gcgcaacccg gacatggtgg tccggctctt 4106641 atcccctggc gcgccggcca cgtcatcgaa cccttcgatc gggctgaaaa agtgctgctg 4106701 cagttgggca ttggtgctca gggctccgta gttccacgcg tcgacgaacg agtgggccgt 4106761 caccggcgag ccgtcggtga acttccagcc gggtttgaca gtgatccggt agttgacgtt 4106821 atcggcgctc tcgattgact gcgcgacctc cagcgacggc ttgccaacgg cgtcatagga 4106881 catcaggccg gcgaacaacc gatcgatgat gcgcccaccg ttgctgtcgt tggtgccggt 4106941 cgggatcagc gggttgggcg gttccccgcc gttgaccagc accacgtcag ggctcaggac 4107001 accgccgcca caaccggcca ctggcgcaag caccagcaat ccggtggcaa gggctgccag 4107061 ggccgcccgc atctgacgca ccatgacagc gaccctaaag ccttcttgtg cagtccggct 4107121 ccccagccgg tgaagtgcgg cctggccagc gcagccgaca cactcgccgg tgaccgttag 4107181 ctaccacgcc acccagagtg ccggcgaacc ggtgggacga tgttttggga acgctcacac 4107241 cgtcgttcgc gatccggtgt tggctaccca ccgcgactgc gcttcccaag ggaagacctc 4107301 gcccgaccgg gcgctgttgg cgtgcggcat cctcgaggag gaccggtggt gtcggcgctg 4107361 tggcgaggaa ggcagcccgc gcgacaccgt gaccaggagg ttgactcact ggtgtgggct 4107421 gcacccgggt gtgagcgtag atcactcatg tcttagccga tgctgccgct tggattgccg 4107481 ccgtcgtggc ccagcggtgc cccaacgcga tccgccgcgc cgataaagct aaccggtgcc 4107541 aacgaacgac gccacatcgc acatgtcgct cacgccagcc gatctccgtt gccggccacc 4107601 gtaaccgtca gcacgactcg gcacaatgcc agccgcacgc tgcaaggccg accaacgtgt 4107661 gatgtgtagc ctgcaagaca ccggctttct tggctatgac tgcatcctgg tcagcgattg 4107721 cactgtgacg actttgccca gctcaacctc tgccatgccg gctgtatcgt cgcgcggtta 4107781 ggctcacatc cgtgagtgag tccacccccg aagtctcctc gtcatacccg ccgccagcgc 4107841 acttcgccga gcacgcgaac gcccgcgccg agctttaccg cgaggccgag gaagaccggc 4107901 tggctttttg ggccaagcag gccaaccgac tgtcctggac gacgccgttc accgaggtgt 4107961 tggactggtc gggggcgccg ttcgccaagt ggttcgtggg cggcgagctc aacgtcgcct 4108021 acaactgtgt ggatcgtcac gtcgaggccg gccatggaga tcgggtcgcc atccactggg 4108081 aaggcgagcc ggtcggcgac cggcgcacgc tgacctattc cgatctgctt gccgaggtat 4108141 ccaaagccgc gaacgcgctc accgacctcg gtctggtggc cggtgaccgc gtcgccatct 4108201 acctgccgtt gatccctgag gccgtgatcg ccatgctggc ctgtgcccgg ctaggcatca 4108261 tgcatagcgt tgttttcggc gggttcaccg ctgcggcctt gcaggcccgg atcgtcgacg 4108321 cccaagccaa gctgctgatc accgcggacg ggcagtttcg gcgcggcaag ccatcgcccc 4108381 tcaaggcggc cgctgacgag gcccttgcag cgatccccga ctgctcggtc gagcacgttc 4108441 tggtggtgcg gcgcacggga attgagatgg cctggagcga gggccgcgac ctgtggtggc 4108501 accatgtcgt cggctcagct tcaccggcac acaccccgga gcctttcgat tccgagcacc 4108561 cgctgttcct gctgtacacg tcaggcacca ccggcaagcc caaaggcatt atgcacacca 4108621 gcggcggcta tctcactcag tgttgctaca cgatgcgcac cattttcgat gtcaagccgg 4108681 acagcgacgt gttctggtgc accgccgaca tcggctgggt caccggccac acctacggcg 4108741 tctacggccc gctgtgcaac ggagtcaccg aggttctcta cgagggcacg ccggataccc 4108801 ccgaccgaca ccggcatttc cagatcatcg aaaaatacgg cgtgacaatc tattacaccg 4108861 cccccaccct catccggatg tttatgaagt ggggccgtga gatccccgac agccacgacc 4108921 tgtccagcct gcggctgctg gggtcggtcg gcgaaccgat caaccccgag gcttggcgtt 4108981 ggtaccgcga tgtcatcggc ggcggacgca ccccgctggt agacacctgg tggcagaccg 4109041 agaccggctc cgcgatgatc tccccgctgc ccggaatcgc tgcggccaaa ccgggttcag 4109101 cgatgacgcc gctgcccggg atctcggcca agatcgtcga cgatcacggt gatccgttgc 4109161 caccgcacac cgagggcgcc cagcatgtta ccgggtacct cgtcctagac cagccgtggc 4109221 cgtcgatgtt gcgcggcatc tggggcgacc ccgcgcggta ttggcactct tactggtcca 4109281 aattttccga caagggctac tacttcgccg gggacggcgc tcgcatagac cccgacggcg 4109341 cgatctgggt actaggccgc atcgacgacg tgatgaacgt gtccgggcac cggatctcga 4109401 ccgccgaggt ggaatcggcg ctggtcgctc actctggcgt ggccgaggcg gcggtggtcg 4109461 gggttaccga cgagaccacg acccaggcca tctgtgcgtt cgtcgtgcta cgcgccaact 4109521 acgcccccca tgaccgcaca gccgaagagt tgcgcaccga agtggctcga gtgatctcgc 4109581 ccatcgcacg gccacgcgac gtccacgtag tgcccgaact acccaagact cgtagcggca 4109641 aaatcatgcg tcgactgctg cgcgacgtcg cggaaaaccg tgagcttggc gacacgtcga 4109701 cgctgctcga tcccaccgta ttcgacgcga tccgggccgc caagtaggtc gcggcacgat 4109761 caaccgggtc agcccagcca actcaggccg gtaccgggac gaatcccgcg cccggccggt 4109821 tcttggcgtt gatgtcggcc aggtcggcgt tgatcgacat caccaccgcc ggggtgtgca 4109881 gcgggatgta tttggtgatg caactcggca gattgtcgct gaatgcgccg tggatcatcc 4109941 cgaccagcag attgtcgacg gtcaccggcg caccggagtc gcccggtccg ccgcagacct 4110001 gcatcacaag ggtgcccgga ctctcccctg gcccccaggt aaccccgcac gagttaccgg 4110061 tggtgcggcc ctgcttgcag gcgatctggc cgaacgacgg gtccgggcca atgccgttga 4110121 tcgcaaaccc gttgaagacg gccaccgggg tcaccttggc cgggtcgaac ttgatcaccg 4110181 cgtagtccag gccgtcgttg ccggcgacca tgatgcctac cgggcccgcg ttctcggcac 4110241 cctcagcggc gatctgcgcg cccgggcccc cacagtgggc ggaagtgaag ccgatgaggt 4110301 caccgttctt gtcatggccg atggtggtta gggtgcacat ggtgtccccg ttgacgacga 4110361 tgcccgcacc accgcccagc ggtagcttgt cgtcggctgc cgcggtgttc gcaggtaggc 4110421 acacaacggc caaaagcacg gccgcgaatg ccgcggcaaa gcgcctgtgc gccgtctgca 4110481 acgcaatgct cccgtcatat cgtcagacac ttgagaacag atccgccagt ttagacgatc 4110541 gcaccgcaac atcggcctct gttcaaacgg ccgcacacgt caagacgtgg ctaactctgt 4110601 cccgccgccc ttggtgttgg ctggcctcgt atggcaccgc accgcatggc aacatgaacc 4110661 gcgatgccag ccgaaccgct cggcgacgat gcgggccgga tgacggcccg aggaggagcc 4110721 gagcaatcga accgagctcg gcgacgatgc gggccggatg acggcctagg gtggggtacc 4110781 gccgctggcg agggcgagcc gagcaatcga atcgagagga ccgtctgtga gcaagatcga 4110841 tcgcaagaac ggtgtgccca gcacgctgac cacgattccg ttggccgacc cgcacgccgg 4110901 acctgctgag ccgtcgatcg gtgacctgat caaagacgcg acaacgcaga tgtcgacgct 4110961 ggtccgagcc gaggtcgagc tggcccgcgc cgagatcacc cgggacgtca agaagggact 4111021 gaccggcagt gttttcttca tctcctcgct ggtggtcggg ttctactcca ccttcttttt 4111081 cttctttttc gtcgccgaac tgctcgatac ctggatctgg cgctgggtgg ctttcttgct 4111141 cgtgttcgcc ataatggtcg tggtcaccgc cgtgttggcc ctcttgggtt tcctgaaagt 4111201 ccggcgcatc cggggaccgc ggcagaccat tgcgtcggtc aaagagacgc gcaccgcact 4111261 taccccgggc catgacaaaa cccctgtgac accaaaaccc gtgacatctg atcgcgcgac 4111321 gccggttgac ccctcgggtt ggtagatggc ggcaccagat ccgtcgatga cccgcatcgc 4111381 cgggccatgg cgtcatctgg acgtgcacgc caacggcatc cgattccacg tcgtcgaggc 4111441 tgtgccgtcc ggccagccgg agggcccgga tgcggctacg ccccccatgc agccggccct 4111501 ggcgaggccg ctggtcatac tgctccatgg tttcggctcg ttctggtggt cctggcgtca 4111561 tcagttgtgc ggcctgaccg gggcgcgggt ggtcgcggtc gatctgcgcg gctacggcgg 4111621 cagcgacaaa ccgccccgcg ggtacgacgg ctggacgctg gccggcgata cggccggtct 4111681 catccgtgcg ctcgggcacc catcggcgac gctggtcggc cacgccgatg gcggactggc 4111741 ctgctggacc accgcgctgc tgcattcgcg gctggtgcgc gccatagcgc tgatcagctc 4111801 accgcacccc gccgcgctac ggcgatccac gctgacccgg cgtgatcagc ggcacgcact 4111861 gttaccgaca ttgctgcgtt accagctgcc gatctggccg gagcgcttgc tgacccgcaa 4111921 caacgcagcg gagatcgagc gcctcgtgcg cgcccgtggc tgcgccaaat ggcttgcatc 4111981 cgaggacttc tcgcaagcaa tcgaccacct tcgacaggcg atccagatcc cggcggcggc 4112041 gcattgcgca ctcgagtacc agcgctgggc ggtgcgcagc cagctgcgca gcgaagggcg 4112101 gcgattcatc agggcgatga cacagcaact ggggatgccg ctgctgcact tacgaggcga 4112161 cgccgaccct tacgtgctgg ccgacccggt agagcgcacc cagcgctacg caccacacgg 4112221 gcggtacata tccattgccg gcgcaggaca tttcagtcac gaagaggcgc cggaggaagt 4112281 caaccgacat ctgatgcgtt tcctcgagca ggtgcaccag ctcagctgac gcaggccccg 4112341 gtgccgaccg gttgggtagc accgattttg gcaagctgcc ccgccacctc gccggccgtc 4112401 agcacaaacc cagtttcggc gtcgtcgatg gctgcgccga acaccacacc gagcacctga 4112461 ccgttgaggt cgatcagggg cccacccgaa tcaccttgct ccacatcggc tctgatggtg 4112521 tacacgtcgc gggtaaccgg ctccgggtcc ccgtaaatat cggggccact gagtctgatg 4112581 gcctcgcgaa tcctggcggg tgtggcagtg aaattgccgc cgccgggata acccagcacc 4112641 acaacgtcgg caccggtttt cgccggctcc gcagcgaaga ccagcggcgg cggcggcaag 4112701 tgcggaacgg ccaggatcgc tacgtcgacc gacgggtcgt aggacaccac cgtggcctcg 4112761 aagggcttgt cgccggcata caccgtgacg ttgttggatc cggccaccac gtgcgcgttg 4112821 gtcatcaccc gatcgggtga gatcacgaag ccggtgccct ccaacacttt ctggcatctg 4112881 ggtgccaggc tgcggatttt gacgacactt ggctcggtgg ccgccaccac cggattgttg 4112941 accagcgctg ggtcgggtga ggccactgga atgaccggcg tgcggctgaa cggctccaaa 4113001 accgcgggca ggccggaggt gttcagcagg gccgacagcc gcttgggcac cgtcttcagc 4113061 caggtgggtg ccgcctcgtt gacccgggcg agcacccgcg aacccttcac cgcggcagcc 4113121 agctcgggct gctctttcga ctgtgtcagc ggcatcgcca acaaccacgc cgcggtgagc 4113181 accacgacca gctgcacccc taccccaatg accgagtcga tcaaccggat cggccggtta 4113241 cggatcgccc cgcggacggc gcggcccagc accacaccag cgacctcgcc gactacgacc 4113301 agtgccagga tcaggaacag cgcggcaaac agtttggccc gcggagcgct gatttgactg 4113361 acgatatgcg gcgccagcag cacgccggct gtcgcgccca gcagcacccc gccaaacgac 4113421 agcattgagc ccagcgcacc ggcacgccag ccggagatgg ctgcaataaa tgcgaccgcc 4113481 aagacggcga tatccagcca ctgcgacggg gtcatcgaat tcatcgcggg tcactctcgt 4113541 cgtcgatcag caccattgcc gcgtccaact cgcggatgtc accggtgtcc cagggttgtg 4113601 cccagcccgc gacatcgagc accgcggaaa tcacctggcc agtgaagccc cataccagca 4113661 tctggtttaa caggaacgcc ggcccggccc agcgacgagt gtgcgggcgg cggtacacca 4113721 tgagccgatt ggccggattg atgaaggcgc gcaccggtac ccgcgcgacg atcgccgttt 4113781 cggcctcgtt gacgacggcc accggcccgg gatccggcga gtacgccagc accgggacaa 4113841 catggaaccg cgacggcgca atgaacgtcc gctccatggt ggccagcgga tgcagcctgg 4113901 acgggtcaat cccggtttct tcgttcgcct cacgcaaggc ggtggccacc ggcccgtcgt 4113961 cggcggggtc gaccacaccg ccgggaaaag ccgcctggcc ggcatggtgg cgcaatgtcg 4114021 aggcccgcac ggtcagcagt aggtcggcgt cgtctgggac accaccgtcg cctggcccgg 4114081 cctccgggcc agaaaacagc accagaacgg ccgcctcgcg gtgatcccgg cgcgacgatg 4114141 tcattgccga cacagccccg gcggccgtca ccatcgctag cacatcggcg ggcaaccgac 4114201 gccggtaggc gtcgggtatc tggccaacgt tgtcgaccag tggacgcagc caggacgggc 4114261 cggcatcagg ccgcagggca accgtccccc gtgaaccggt gggggtcgct cccgcttgca 4114321 ggggggtacc cccagcactc atcggcgcct cctttgggtc caaagttgcc cagctcctct 4114381 tcaagccgct aacccggccc acatcaccgc cgagtggagc ccacctgctc agagcaggcc 4114441 ggaccggcta cgcggcccgc accgcaaccg tactcatccc gcgtcgttcc cgaccgcagc 4114501 cacgatctcg tcggcactgc cgaaagcccg cggcagggtc tgggcaacgc taccgtccgg 4114561 ccgcagaacc accgtcgcgg gcatcacatt tgcgacccgc agcgcggccg ccaccctgcg 4114621 gcggtcatcc tgcagcgtcg gcaaccggac gccgagatcg gccagccgcg acagcgcggc 4114681 cgcctcgttc tggccctgat gcaccgtcac gaccagcacg gcgggcccga cccgtcgttg 4114741 atattcggcc atcacgggca gctcggtcat gcacggcgcg caccaatgcg cccacagatt 4114801 gatgaccacc cgacgtccgg ccagcgcgcg ggcgacgtcg acggccgaac cgtcgcccgc 4114861 acacaccacc acaacaccgc gtagtgccgc cgcgcccgga ccgttacctg ccgcgggaca 4114921 gggcggcagg tttgcgcgct gccgggacca agccaatgct tccggggtat cgccgtcgcg 4114981 atgttcgcgc ggggcgggcc gctggctgat cgtgctcgag gcggaatagt catgcagttg 4115041 ggcaaccagc gccgccatca gcgctgccac caccgccagg atcgcgatgg tccagcgggt 4115101 ctttccggtt aacgtcgtca ttgcggtctc agcgggggtt gttggcaggc ttggcattac 4115161 agtccagcca gggccagcag gtgatcggtc tcggggccct ggaccagggg cgccgcgagc 4115221 agcggttcag tggggccaag cccgaaggag gggcagtctt tggcaagcac acaaacacca 4115281 cacgccggtc tgcgggcgtg gcacacccgc cgtccgtgaa agatcactcg gtggctgagc 4115341 aaggtccact ccttgcgttc gatcagctca ccgaccgcct gctccacctt gaccgggtcc 4115401 tctgcggtgg tccagcgcca ccggcgcacc aatcgtccga aatgagtatc caccgtgatt 4115461 ccggggatac cgaatgcgtt acccaggatg acattggcgg ttttgcgccc caccccgggc 4115521 agcgtcacca acttgtccat ggtggccggc acctcaccgc caaaccgctc aactagggcc 4115581 tgccccaggc cgatgagaga ggccgctttg ttgcggtaga agccggtggg gcggatgagg 4115641 ctctcgagct cggtgcgatc cgcctgggcg tagtcccgtg ccgtccgata ccgcgcgaac 4115701 aaggctggcg tcgtcaaatt cacccgtttg tcggtgctct gcgccgaaag tatggttgcc 4115761 acggctagct cgagcggcgt ggtgaagtcc agctcgcagt atacgtgcgg aaatgcctgt 4115821 gccaaagcgc gattcattcg ccgcgcccgt cgcaccaagg cgagccgggt ttctgcagac 4115881 cagcgcccgg gcacgtcggc ggcacgcgcc gctggcttcg atctggatga cttcgccgct 4115941 gtcacctacg acagagtact gatttcgtga tctcactgag acctcgtgtt gattcgaagc 4116001 catgtttact ctccttgtgt catggttgct cgtggcctgc gttcctgggt tgttgatgct 4116061 ggcgaccctc gggttgggac ggctggaaag gtttctggcc cgagacacgg tcacggcgac 4116121 cgacgtcgcg gagtttctcg agcaggccga ggccgtggat gtgcatacgc tcgctcggaa 4116181 tggaatgccg gaggcgctgg attacctgca tcgacgtcaa gcccggcgaa tcaccgattc 4116241 accgccgctt gggtctggcg ctgggccacg gtatgccggg ccgctgtttg tcaccgatct 4116301 cgatagcccc gtcgagccac cccggcatgg ccagcccaat ccgcagttta gaacggctcg 4116361 acacgcaaat cacgtgtagc gttggcacgg cgaaccggtt ggcctacctc tagactcttc 4116421 tcgttggcaa acggttagtg tgcccgtatc acttcgtcgg aaagttgaag aggcaacgtg 4116481 gacgagatcc tggccagggc aggaatcttc caaggcgtgg agcccagcgc aatcgccgca 4116541 ctgacgaaac agctgcagcc cgtcgacttc ccccgtggac acacggtctt cgcggaaggg 4116601 gagccgggcg atcggctgta catcatcatc tcggggaagg tcaagatcgg tcgccgggca 4116661 ccagacggcc gagaaaacct gttaaccatc atgggcccgt cggacatgtt cggcgagttg 4116721 tcgatcttcg acccgggtcc gcgcacgtcc agcgcgacca cgatcaccga ggtgcgggcg 4116781 gtgtcgatgg accgcgacgc gctgcggtca tggatcgccg atcgtcccga aatctccgaa 4116841 cagctgctgc gggtgctggc ccgccggctg cgccgcacca acaacaacct ggccgacctc 4116901 atcttcaccg atgtgcccgg tcgggtggcc aagcagctgt tgcagctcgc ccagcgtttc 4116961 ggcacccagg aaggtggcgc attgcgggtc acccacgacc tgacacagga agaaatcgcc 4117021 cagctggtcg gggcctcacg cgagacggtg aacaaggcac tggctgattt cgctcaccgc 4117081 ggctggatcc gccttgaggg caagagtgtg ctgatctctg actccgaaag actggcccgc 4117141 cgagcgaggt aagcgcgcgc cgcgcgggcg caaccgagcg agctagcttc ctcacgccca 4117201 gcagacacag agtcgcacgc aaacgacgga ttttgtgcga ttgtgcggct gctcgcgcta 4117261 ccgagtccgc agatagtcca gttgtgcctg caccgaccat tcggccgcat tccaaagctt 4117321 ttcgtcaacg tcgaggtaga cgtgttcgac gacctcgcgg accgtggcgt cgtcaccgag 4117381 atcccgcaac gcggcgcgta tctgctccag acgttcgtgc cggtgcagca ggtatcccga 4117441 tgcaatcgct tccaggtcga gcaagtccgg cccgtgcccc ggcagcacgg tccgccggcc 4117501 caggccacgc agccggtgca gcgattccaa gtagtcggct aggctgccgt cttccttgtc 4117561 gatgacggtg gtcccgcaac ccaacacggt gtcggcggtc aacacggcgt cgtcgaggac 4117621 aaatgacagc gaatctgcgg tgtggccagg ggtggccaac acggtaatgg ttaacccggc 4117681 aacgtcgatc acttccccgt cggtcagcgt ctccccatca cgtcgcaaga actgcggatc 4117741 cgcggcccgt accggcgccc cggtcagcgc gaccagtttg tcgatgccgc tggtgtggtc 4117801 gccatgacga tgactgatca gtaccaacgc gatgcggcca agcgcggcaa cccgtgccag 4117861 gtgctcgtcg tcgtccgggc ctggatcgac aacgaccagc tcgtcactga gcgggccgcg 4117921 cagcacccag gtgttggtgc cgtccaacgt cagcaaaccg gggttgtcgg ccaacaggac 4117981 cgacgcggtg tcggtgaccg cgcgcagctg gccgtaggcg ggatgggtca gcgactcagc 4118041 tgtcttcgac atcggccgct agccgacctc cacgatcaac tcgacttcca ccggcgcatc 4118101 caacggtagc tcggatacgc cgaccgccga acgcgcatgc gcgccgctat cgccgaacac 4118161 ctcggccagc agatcggagg ccccgttgat cacgctcggc tggccgtgaa accccggtgc 4118221 cgaagcgaca aacccgacga ctttgaccac ccgggtcacc gcgtcgagat ccaccagcga 4118281 atcaacggct gccagcgcat tgagcgcgca gatccgcgcg agcgtcttgc cctcctccgg 4118341 gttgacgtcg gcgccgagct tgccggtccg caccagcttg cctgcctcca acggcagctg 4118401 gcccgcggtg tagaccaggt tgccggtgcg cacagctgga acgtaggccg ccagcggcgc 4118461 cgccacttgc ggtagcgtga caccgagttg ccctaatcgg gctttagcgc tcattaaccc 4118521 cgatacctcc tacttcgggc gcttcaggta agcgacgtgc tgctcaccgg tgggcccggg 4118581 cagcaccgcc accagctccc agccatcggc tccccactgg tcgaggatct gtttggtggc 4118641 gtgcgtcaac agcgggaccg tggcgtactc ccatgcggtg ggttgggtca tgacgcgagc 4118701 ttatcggtcg gactggaccc gctccgctca gcccggtagc ccggaaagat cgccaggcca 4118761 tcgggctagc atgccatggt ggcaaccaca tctagcggcg gtagttccgt cggctggccg 4118821 tcacgcttgt cgggggtccg actgcacctt gtcaccggca aaggcggtac cgggaagtcg 4118881 acgatcgcgg ccgcgctcgc gctgacgctg gcagcgggcg gccgcaaagt cctactcgtc 4118941 gaagtcgagg ggcgccaggg gattgcgcaa ctcttcgacg tcccgccact gccctaccag 4119001 gaacttaaga tcgcgaccgc cgagcgcggc ggccaggtca acgccttggc aatcgacatc 4119061 gaggccgcct tcctggaata cctcgacatg ttttacaacc tcggtatcgc aggccgggcc 4119121 atgcgccgta tcggcgcggt cgagttcgcg acgacgatcg cgcccggtct gcgcgacgtg 4119181 ctgctcaccg gcaagatcaa ggagacggtg gtgcgcctcg acaagaacaa gctgccggtc 4119241 tatgatgcaa tcgtcgtcga tgcgcctccg accgggcgca tcgcgcgctt cctggatgtc 4119301 accaaggcgg tgtccgatct ggccaagggc ggaccggtgc atgcgcaaag cgaaggcgtg 4119361 gtgaagttac tgcactccaa ccagaccgcc atccatttgg tcactctgtt agaagcgctg 4119421 ccggtgcagg agacactgga agccatcgag gagcttgcgc agatggaact gccgatcggc 4119481 agtgtgatcg tgaaccgcaa catccccgcc catttggagc ctcaggactt ggcgaaggcc 4119541 gccgagggcg aggtcgatgc agactcggtg cgggccgggt tgttgacggc cggggtcaag 4119601 cttcccgacg ccgatttcgc cggcctgctt accgagacca tccagcatgc cacccgaatc 4119661 accgcacgcg ccgaaatcgc acaacagctt gacgccttgc aggttccgcg attggaattg 4119721 ccgacggtct ctgacggcgt cgaccttggc agcctctacg agctctcgga atcacttgcc 4119781 cagcaggggg ttcgatgagt gtcacaccga agaccctcga tatgggcgca atcctggccg 4119841 acacatccaa ccgggtggtt gtgtgctgcg gcgccggtgg ggtcggcaag accactaccg 4119901 cggccgcgct ggcgttgcgc gcggccgagt atggccgcac tgtggtcgtt ttgacgattg 4119961 acccagccaa gcgattggca caagcactgg ggatcaacga tcttggcaac acaccacaac 4120021 gcgtgccatt ggcacccgag gttcccggcg agctacacgc gatgatgctc gacatgcgcc 4120081 gcacgtttga cgaaatggtt atgcaatact ctggacccga acgggcgcaa tcgattctgg 4120141 acaaccagtt ctatcagacc gtcgccacat cgcttgccgg cacccaagag tacatggcta 4120201 tggagaagct gggccaactg ctaagccagg accgctggga cctgattgtg gtagacactc 4120261 cgccgtcgcg taacgcgctg gacttcttag acgcgccaaa gcgactgggc agcttcatgg 4120321 atagtcggct gtggaggctg ttactcgctc ccggccgggg catcgggcgg ctgatcaccg 4120381 gcgtgatggg attggccatg aaggcgttgt ccaccgtgct cggttcccag atgctggccg 4120441 acgcagcagc gttcgttcaa tcgctggacg ccacgttcgg tggtttccgc gagaaggcag 4120501 accgcactta cgcgttgttg aaacggcgcg gcacccagtt cgtggtggtg tcggcggccg 4120561 aacccgacgc actgcgcgag gcgtccttct tcgtcgaccg gctatcgcag gagagcatgc 4120621 cgctagcggg gctggtcttc aaccgcacgc acccgatgct gtgcgcattg ccgatcgagc 4120681 gggcaatcga cgccgccgaa acgttggatg ccgagaccac cgactccgac gccacatcgc 4120741 tggccgcagc ggtgctgcgt atccatgccg agcgcgggca gacagccaaa cgggagatcc 4120801 ggctgctgtc ccggttcacc ggagccaacc ccaccgtgcc ggtcgttggg gtaccgtcgc 4120861 tcccgtttga cgtctctgac ctggaagcgc tgcgggcgct cgccgaccag ctcaccacgg 4120921 tcggcaacga tgcgggccgc gcagcgggcc gctgaggaac cggcccatca gtgacggtcg 4120981 gcaacgatgc gggccgcgca gcgggccgct gaggaaccgg cccatcagtg acggtcggcg 4121041 acgatgcggg ccgtacaaca tctgaccggg atccggctat tgggcacaag ccagttccta 4121101 ttgggcacaa gccaattaga aatgaatggc ttttgctgta accaaaccgt aatcagaagc 4121161 gacgggaccg cggcacctat ccgcagtccc tgagtggcta tccggcggtg ccggtgcggc 4121221 gcttgcgctt ctcaaggtag tccgaccacg aaaccacctc gggatgttgc ttgagcagag 4121281 ccctgcgctg gcgctcggtc atgccacccc aaacaccgaa ctcgaccttg ttgtccagcg 4121341 catctgccgc acactcttgc attaccggac agtgacggca gatcaccgcg gccttgcgtt 4121401 gtgcggctcc tcgaacaaag agttcgtcag ggtcggtagt ccggcacagc gccttggata 4121461 cccacgcgat ccgctcttcc gcgtctacgc tgcgtaccac gttctgtgca gccgtgaggt 4121521 tagtccttcg agcggctgga cgggttcctg acacgagctg atcccttcct cccggccgcc 4121581 gtgtgcgacc gccctcctcg gaaacagccg atgctgcgag cgacgccaca ccatgcacat 4121641 cggtgttacc tgtatctcac tgatctgtat aagtcaggtg gtcgtgtgcc aattgcgcaa 4121701 cagtacgata acgctttttt gggacgagcg tgccgtcttg tctggatcgg ccgggggaaa 4121761 tgccgccgct tcggtcccgt ttacggggtc tgaccagtga cgcagccgca aatatcgcgc 4121821 ccgccccgat cccgcagtga ctcacccgcc cgcggaaaga ttctattgga ccgagcggca 4121881 cggtggagtg acaggaggtc gctactgtag tacgcatgcc cgagcgcctc ccggccgcga 4121941 tcaccgttct gaagctggct gggtgctgtc tgttggccag tgtcgtcgcc actgcgctga 4122001 cgttcccgtt cgcaggcggg ctagggctga tgtccaatcg tgcctctgag gtcgttgcca 4122061 acggctcggc ccagctgctc gaggggcaag tgcctgcggt atcgacgatg gtcgacgcga 4122121 agggcaacac gatcgcgtgg ctgtactcgc agcgccggtt cgaggtgccc tcggacaaga 4122181 tcgccaacac gatgaagctg gcgatcgtct cgattgaaga taagcggttc gccgaccaca 4122241 gcggcgtgga ctggaagggc accctgaccg gcctggcggg ctacgcgtcc ggcgacctcg 4122301 acacgcgcgg cggctcgacg ctcgaacaac agtacgtgaa gaactaccaa ctgctggtga 4122361 cagcccaaac cgatgccgag aagcgagcgg ccgtcgaaac cactccggcc cgcaagcttc 4122421 gcgagatccg gatggcactc acgctggaca agaccttcac aaaatctgaa atcctgaccc 4122481 gatacttgaa cctggtctcg ttcggcaata actcgttcgg cgtgcaggac gcggcgcaaa 4122541 cgtacttcgg catcaacgcg tccgacctga attggcagca agcggcgctg ctggccggca 4122601 tggtgcaatc gaccagcacg ctcaacccgt acaccaaccc cgacggcgcg ctggcccggc 4122661 ggaacgtggt cctcgacacc atgatcgaga accttcccgg ggaggcggag gcgttgcgtg 4122721 ccgccaaggc cgagccgctg ggggtactgc cgcagcccaa tgagttgccg cgcggctgca 4122781 tcgcggccgg cgaccgcgca ttcttctgcg actacgtcca ggagtacctg tctcgggccg 4122841 ggatcagcaa ggagcaggtc gccacgggcg ggtacctgat ccgcaccacc ctggacccag 4122901 aggtgcaggc accggtcaag gccgccatcg acaagtacgc cagcccgaac ctggccggta 4122961 tttccagcgt gatgagcgtg atcaaaccgg gtaaggatgc gcacaaggtg ttggccatgg 4123021 ccagtaaccg caaatacggg ctggatctag aagccggcga aaccatgcgg ccgcagccat 4123081 tctccctggt tggcgacggc gccgggtcta tcttcaagat cttcaccacg gccgctgctc 4123141 tggacatggg catgggtatt aacgcccaac tcgacgtgcc gccccgattc caggccaaag 4123201 gtctgggaag tggcggggca aaggggtgcc ccaaagagac ctggtgtgtg gtgaacgccg 4123261 gcaactaccg cggctcgatg aatgtcaccg acgcgctggc aacctcgcca aacaccgcgt 4123321 tcgccaagct gatctcgcag gtcggggtgg ggcgtgcggt cgatatggcc atcaaactcg 4123381 ggctgaggtc ttatgcgaat cccggcaccg cacgcgacta caaccccgac agcaatgaga 4123441 gcttggctga cttcgtcaaa cgacagaacc tgggttcgtt caccctcggc cccatcgagt 4123501 taaacgcgct ggagctgtcc aacgtggcgg ccacgttggc atccggcggc gtgtggtgcc 4123561 cccccaaccc aatcgaccag ctcatcgacc gcaacggcaa cgaagtcgcg gtcaccaccg 4123621 agacgtgcga ccaggtggtg cccgcagggc tggcgaacac cctcgccaac gcgatgagca 4123681 aggacgccgt gggcagcggc acggcggccg gttcggccgg cgcggcgggc tgggatctgc 4123741 cgatgtccgg caaaaccggc accaccgagg cgcaccggtc ggccggcttc gtgggcttca 4123801 ccaaccgcta cgcggcggcg aactacatct acgacgactc cagctcgccg acagatctgt 4123861 gttccggccc gctgcgccat tgcggcagcg gcgacttgta cggcggcaac gagccatccc 4123921 gcacctggtt cgccgcgatg aagccgatcg ccaacaactt cggcgaagtg cagctaccac 4123981 cgaccgatcc acgctatgtc gacggcgcac caggctcacg ggtaccaagc gtggccggtc 4124041 tggatgtcga cgccgcacgc cagcgcctca aggacgcggg cttccaggtc gccgaccaaa 4124101 ccaactcggt caacagctcc gccaagtatg gtgaggtggt cggaacgtcg cccagcggtc 4124161 aaacaattcc gggttcgatc gtcacgatcc agatcagcaa cggcatcccg ccggctccgc 4124221 ctccgccacc gctgcctgag gatggtgggc cgccaccgcc ggtcggatcg caggtggtgg 4124281 agattccggg gctgccgccg atcaccattc cgctgctggc gccaccaccc ccagcgcctc 4124341 ccccgtaggc cctcccaatc ggcctcgtgc cgctgcagac gcgcgatcag acctcgaccg 4124401 gcagtaggct gcgtgcatgg ctgctgtctt gcccaccttg atccgcaccg gcgccgtggc 4124461 gttgggctcg gccatcgccg ggattggtta cgctgcgctg gtcgagcgca atgcattcgt 4124521 cctgcgcgag gtgaccatgc cagtcttgac tccgggctcc acaccgctgc gggtgctgca 4124581 catcagcgat ctgcatatgc tgcccaacca gcaccgcaaa caggcctggc tgcgcgagct 4124641 cgccagctgg gagccggatc tggtcgtcaa caccggtgac aacctggctc accccaaggc 4124701 ggtgcccgcc gtcgtccaaa ccctgagcga tctgctgtcc cggccgggtg tcttcgtgtt 4124761 cggcagcaac gactactttg ggccgcgcct gaagaaccca atgaactatc tgaccagccc 4124821 ggatcaccgc gtccgcggag cagcgctgcc ctggcaggat ctgcgggcgg cgttcaccga 4124881 acgtgggtgg ctcgacctaa cccatacccg ccgcgagttc gaagttgccg gtctgcacat 4124941 cgccgctgcg ggcgtcgacg acccgcatat cgaccgagac cgctacgaca ccatcgccgg 4125001 cccggccagc ccggccgcca acctgcggct ggggctcacc cattcaccgg agccgcgggt 4125061 gttggaccgc ttcgccgccg atggttacca gttggtgctg gccggccaca cccacggcgg 4125121 gcagctgtgc ctgccgttgt acggggcgct ggtcactaac tgcggtctgg accgctcccg 4125181 ggccaaagga gcgtcacact ggggtgcaaa catgcggctg cacgtctccg ccgggatcgg 4125241 cacttcgccg tttgcgccgg tgagattctg ctgccggccc gaagcaaccc tgctgacgtt 4125301 gatcgcgacc ccaatgggcg ggcgcgattc gagcagcaac ctgggccgct cacagccgac 4125361 agtgtcggtg cgttgagcgg cggggcctgt atcgcggtcc gcagcctatc ccggagctgg 4125421 acggacaacg cgatccggtt gatcgaggcg gacgcccgcc gtagcgccga cacccacctg 4125481 ctgcgctacc cactgcccgc tgcctggtgc acggatgtcg acgtcgagct gtacctcaag 4125541 gacgagacga cccatatcac cggcagtctc aaacaccggt tggcacgttc gttgttcctc 4125601 tatgcgctat gcaacggctg gatcaacgag aacaccacgg tggtggaggc atcgtcgggt 4125661 tcaacggcgg tgtccgaggc ctatttcgcg gcgctgctgg gtctgccgtt catcgccgtg 4125721 atgccggccg cgaccagcgc ttccaaaatc gcgttgatcg aatcacaagg tggccgttgt 4125781 catttcgtcc agaattcaag tcaagtgtac gccgaggcgg agcgcgtcgc caaggaaacc 4125841 ggcggccact atctggacca gttcaccaac gcggagcgcg caaccgactg gcgcggcaac 4125901 aacaacatcg ccgagtcgat ctacgtgcaa atgcgcgaag agaagcaccc caccccggaa 4125961 tggatcgtcg tgggtgcggg caccggcgga accagcgcga cgatcggccg ctacatccgc 4126021 taccgacggc acgcgacccg gctgtgcgtc gtcgatccgg agaattccgc gttcttcccc 4126081 gcgtactccg aaggccggta cgacatcgtc atgcccacat cgtcccgtat cgagggcatc 4126141 ggccggccgc gggtcgagcc gtcgtttctg cccggtgtgg tcgaccgcat ggtggcggtc 4126201 cccgacgcgg cgtcgatcgc tgccgcccgg catgtcagcg ccgttctggg gcgccgagtg 4126261 ggaccgtcta ccggcaccaa cctctggggc gcgttcggac tgctcgccga gatggtcaag 4126321 cagggccgca gcggctcggt ggtcacactg ctcgccgaca gcggcgatcg ctacgccgac 4126381 acctactttt ccgacgagtg ggtcagtgcc caggggctcg atccggccgg gccggctgcg 4126441 gcgctggtgg aattcgagcg ctcctgtcga tggacgtgac ggtcggacct gcggtttggc 4126501 tagtcaacgg tccggtgcga taggctgtcg tggcttcaag cggggtgtgg cgcagcttgg 4126561 tagcgcgctt cgttcgggac gaagaggccg tgggttcaaa tcccgccacc ccgaccgaga 4126621 gatcgctgac gacagcctta cccggcgcag cgtggtagct tgctgcagtc tgctcgggcg 4126681 gcagcgccac cctgacggtg ctggttgacc atgccggaca gcacgtcaac gcacaggcat 4126741 ttccaacgga agttgtaggt taccggccgc cctaaaacac ggtgcacttt tcgttaaagg 4126801 ttgtgggtgt ggatccaacg aaattcgttg ccccggcgtg ggcagcgccg tgtccacagg 4126861 gggacccgcc gcgcattacg cctatgggcc cacccccgta ccgcgggagt tggctctgca 4126921 ccccgagcca atcatgcttc tctcggagtc cgacgcggga ctgggacgac tcgcatgagc 4126981 cggacgcctc ctgcctgacc cccacctgct aggaacgtaa accgggagag tttcgtcgga 4127041 gccagaattg gatttcctcc ccgagcaatc ggcccgaaac cgcggggttg tttccgccga 4127101 ccgtcgacaa catgtggcgt gcgttggatg actgggaaat gtatctccac gacgcagcgc 4127161 cacaactgcc gctcttgatc cgttgcgccc tggtgcatta ccaattcgag gcgatcgggc 4127221 catttctcga cggcaacgca cgactcgggc gtctgttcat catcctttgc cttgttgcat 4127281 tgggacggtt gccgctaacg ggcggggcga aaccgcaccc gagtgccgcg gcggggcaca 4127341 agcatgatgg agcgccgcac gatccgctct ggctcgtcgt cgacggcggt gaactcgccc 4127401 tcgcgcagca gcacgtgcaa tacggtgatc aactctcgca tcgaaaagtt cgcacccaga 4127461 cagcgtttca cgccgccgcc gaacggaacc caggcatagg tttgcggccg cgtaccgagg 4127521 aaccgctcgg ggcggaactc gtgtgggtgc tcatacacct cggcgctgcg gttgatcgcg 4127581 atgatgtgga ccacgattcg tgtgccagcc tccacacggt aaccgccgat ggttagtggt 4127641 tgcgcggcga cacgagccgt caacggcgcg ggcggacgca cccgcaacgt ctcgttgatc 4127701 accgccgtcg tgaaggcttc cccaccgcca acggcctccg ctcgcacgcg ccgcaacgcg 4127761 tccggatggt gcagcagcaa gtcgaacgcc cacgccaacg tggtcgccgt ggtttcatgc 4127821 cccgccagca cgagggtgat cagatcgtcg cggatctcgc tgtctgacaa ctgttccccg 4127881 gactctccgc gcgcgctcac gagcaacgac aggacgtcgt gtcgctcgcc caggcgtgga 4127941 tcggcgcgcc gctgcgcaat gagcgccatg acgacgtcgt cgatctcggt gttggcgcgg 4128001 gcgcgtgcag gccagactcg tagtgcgccc aaccgacgca gtgcgtagcg cacggtcaac 4128061 tgctctgaaa caccaagatt caacagccgc tcgaacggcc ggcccaagcg ccggacctcc 4128121 tcggggtcgt cgaccccgaa tatgaccttg acgatcacat ccagcatcag cgaccgcgcc 4128181 accgtcaaca tcgcaaacgg acggtcaacc ggccatgtat gcatcgccgc gcgagtggag 4128241 ttctcgataa tcggaacgta acgatccagc gcagcgccat gtaatggcgg cgtcaagagt 4128301 tttcgacgtc gaagatgctc cggctcctcc tggacaaaca tcgaccccga cccatagatc 4128361 gccgctgccg gccccacccc ctcgcccccg agcaggacgt cggtgggagc ggtgaaaacc 4128421 tccttggcca gcgccgagtc ggacacgatc gcaacgtcac ccaggctgag aatgggcatc 4128481 gtcatgatcg gtccgtaccg acggatcagt cgcagcatcc ggcgctcgcc acccgccagg 4128541 taggcaaccg cgtaggcggc cgcgaaggcc gcgcgaaatc cacggggcgc cggcaagccc 4128601 ggtgggccgc ccaaagcatc cggcgcgtgc tcccggcgaa ccgcaaacgc tgccacgcct 4128661 acgacagaag cacagcgttt cgggtcggtc aacgcagcag ggctagcaag cgacctcagc 4128721 accatcggtt cccgaaggtg cggtccggcg ctaccgcgtc gaaaatcgca gaccgcgcca 4128781 gccggttggg aatgaggccg tttcaccggc gggcgtcccg cgcagcgttt cgccgcagac 4128841 cctatgttgg ccatgcgcga tataggccac ccggcaccaa ggtgccatga ccgccacaac 4128901 cagggccgcg gcggcaaccg ccaggtgtcc gatcgtcagc gcaactaaac ccgcaaccag 4128961 cccgacagcc cacacggcag ccaccaccag ccccggcaca ttgacactcg cggggacgct 4129021 gccgctaccg cttggctgcg gcgcggatgc atgatcaccg gcgtcactcc cggtgtagac 4129081 catgaccact cccagcgata aaaggttgcc gatcaaggta acccatacag gccgtcggca 4129141 gccaccggcg aacagctctt cgaggatgcc gtcaggacat tgacagctac cagacaccat 4129201 ttccacaccg tcaaaatgtg gcgcgtgaca cgcacggcgg cacgctggca acgtggcgtg 4129261 cgccgcaggc ctcgactatc tggtgccgat cacagcatgc atgctcgtcg gtttgtacgg 4129321 cgttaccggt cgatcctgcc gccccgtacc ccagtcaagg catcgtggag cgtggaaaac 4129381 aaccgaaagg tcttgtccag acccatcaga tgaatcggcc ttctggttac cgaaccccgt 4129441 gcaaccaccc cgaacttgac ggattggcct atcttttcgg atgttgccgc caaaatcttc 4129501 agccccaccg accccagaaa ctccaccgcg gaaaggtcga tgactagcgc cgtcggattg 4129561 tcggccacaa cttcgccgat ggcctcttca agtgccgcag cggtgatcaa atcaatctca 4129621 ccaccgatgc tgagcacggc gaccccgtta tggtcggcaa ccgtgacggt gatcgagtcg 4129681 ggagctgaca atggcgatcc tcttgtccga gccgtccgtg tggtgaaagc ctagcccgcc 4129741 tgcgaactgc ggcggcggcc catcagcgta ggatttgccg gctgcacacg acctgtgtgc 4129801 gggccgcaat cgcggcgaag gcgctggggc gtgggtgaat tgcctaacaa ccctcgagtg 4129861 cggacacgca tatagcctcc gtcgaaattg gcctataggc gttccttgac cgccgccgac 4129921 aagcgtgcgc cgtcggcttt cccggcggcg atcacggtgg ccgccttcat taccagaccc 4129981 atctgtttca tgctgggccg atgcccgagt tcttcggcca cctcggctat ggcggtgtcg 4130041 gcgacatcgg ccagttcccc ctcggtgagc ggcgtcggga ggtactcgtc aatgatccgt 4130101 gcctcggcat gctcggtggc ggcgagctca ccgcggccgt tttgggtgta gatctccgcc 4130161 gcctcaccac gcttgcgcga ttccctggcc aacaccttga tcacctcgtc gtcggagagc 4130221 tctcttgcct gcttgccaga gacctcctcg gtctggatcg cggccagcag catgcgtatg 4130281 gtcgcggtcc gcagcttgtc ctgcgtcttc atcgcttggg tcaggtctga ccgaagctgg 4130341 gatttaagtt ccgccattgc acaaacgcta cgcgccgcaa cgcccgaaac ccgacactga 4130401 gacctacatt gagaaatgca ccgaccgccg acaggatgga ggccatgacg aacgacgaca 4130461 gctgctgcgt ccggtgagca tgctgccgcc tggctacccg gttgaaccac cgcccgtggc 4130521 gccgggatat gcgccggccg gatatccgcc ctaccccgct acaccacccg ggtacggccc 4130581 gccgggttat ggtgcgccgc ccagctatgg ccccccgcct ggctatggtc cacccctcgg 4130641 ctaccccgcc gcaccgcccg gctgcggccc accgcccggc tatggcccac cgctcggcta 4130701 tggcccaccg gtcgccccgg gcgcggtcaa accaggaata atcccgctgc ggccgttgac 4130761 cttgagcgat atcttcaacg gcgcggtcgg ctacatccgc gctaacccga aggcgacgct 4130821 gggattgacc gccatggtcg tggtgaccct gcaaatcatc tcactggtgg ccctatttgg 4130881 ccccatgacc gccttcggtg acatcgtgac cggggagccc gacgagctga ccggcgcggt 4130941 ggtgggcggt tggtcagcgt cattcggcgc cagtctcctg gtcagctggc tagcgggtgt 4131001 gctgctcagc ggcatgctca ccgtcatcgt cgggcgggcc gtgttcggtt cgccgatcac 4131061 cgtcggcgag gcgtgggcca aggttcgcgg tcgcctgctc gcgttgttcg gcctggcact 4131121 gctggaagca gccggcgtgg tggcggtgct cgggctggcg gtcgtcatac tttccggggt 4131181 cgcggcggcg gccaacgagg cagcggcggc cctcctcggc ttcccgctgc tgctcgtggt 4131241 tggggtgtcg ctggcctatt tgtatgtcgt cctgctgttc gcacccgtgc tgatcgtgct 4131301 ggagaggctg cccatcgtcg aggcgatcac cagatccttt gcgctcgtgc gtcatggctt 4131361 ctggcgggtc ctgggcatcc gcctgctgac ggtgctggtg gtgggcgtag ttggtaatgc 4131421 gatcgcggct cctttcatga tcgtcggcga gatagtgacg gccgtcacag cgtccgacgg 4131481 gtcagtcacc atgcggctcg tcggcgctac gctctcggcc atcggagtga cgatcggcca 4131541 gattgtcacc gcgccgttca gcgccggagt tgtcgtgctg ttatacaccg accgccgtat 4131601 ccgtgccgag gccttcgacc tggtattgca gaccggctta gaagccggcc ccgccggcgg 4131661 gcccgccccg gtggagtcca ccgacaacct atggctcacg cggcctttct aaagggagtt 4131721 agtgaggaca ggctgacagt gccctccatc gacatcgacc gcgaagccgc acaccaagcc 4131781 gcacaacgcg agctcgacaa accgatctac cccaaagact ccctgaccaa ggaactcacc 4131841 gactggatcg acgagcagct gtaccggatt ttggagaagg gatcctcgat acctggcggt 4131901 tggttcacca tcaccgtgct gctcatcttg ctgatgatcg cggtgaccgc cgccgtccag 4131961 atcgcacggc gcaccatgcg caccaaccgc ggcggtgact accagttgtt cgacgccggc 4132021 caattgaccg cagcccagca tcgctccacg gctgaaagct atgccgccga gggtaattgg 4132081 gctgcggcga tccgccaccg gctacaagcc gtggctcgcg agttggagga gaccggcatg 4132141 ctcaacccgg ctgccgggcg caccgccaac gagctggcca gcgatgcggg cgaggtttta 4132201 ccgcatctgg caggggaatt gacgcaggcg gcaaccgctt tcaacgacgt cacctacggc 4132261 gagcggcccg gaacccaagg cgcctaccaa atgatcgccg acctcgatga ccatctgcgg 4132321 tcccgttcac cggccgtcgt atctgcagtg cagcacccgg ccgtgttcga ctcgtgggcg 4132381 caggtccggt gattcccaca cgtctcgcaa ccgtgcgccg ccgacggccg tggcgcgggg 4132441 tgttgctcac gctggccgca gtcgccgtcg tggcctcgat cggcacctat ttgacggcgc 4132501 cacggcctgg aggcgccatg gcccccgcgt ccaccagctc gacggggggc cacgcgctgg 4132561 cgacgctgct tggcaaccac ggcgtcgagg ttgtcgtggc cgactccatc gccgatgtcg 4132621 aagccgcggc acgccccgac tcgctgctgt tggtggcgca gacgcagtat ctagtcgaca 4132681 acgcactgct ggatcggctg gcgaaagccc ccggtgacct gttgctggtg gcacccacct 4132741 cacgaactcg tacggcgctg acgccgcaac tgcgcatcgc ggccgccagc ccattcaaca 4132801 gtcagccgaa ttgtacgctg cgggaagcta atcgggcagg atcggtgcag tgggggccca 4132861 gtgacaccta ccaggccacc ggcgacctgg tgttgaccag ctgttacggc ggggcattgg 4132921 tccgctttcg tgctgagggc cgaaccatca cggtggttgg cagcagcaac ttcatgacca 4132981 acggcggcct gctgccggcc ggcaatgccg cactggccat gaacctcgcg ggcaaccggc 4133041 ctcgtctcgt ctggtacgcg cccgaccaca ttgaggggga aatgtcttct ccgtcatctc 4133101 tttccgacct gattccggag aacgtgcact ggaccatctg gcaattgtgg ctggtggtgc 4133161 tcttggtggc actctggaaa ggccggcgga tcggtccact ggtggccgag gagttacccg 4133221 ttgtgatccg cgcgtcggag actgtcgagg gtcgcggtcg gttgtaccga tcccgtcggg 4133281 cgcgtgatcg cgccgcggac gcactacgca ccgcgacgct gcaacgcctg cggccccgac 4133341 ttggggtggg cgcaggcgcg ccggcgccag cagtggtgac aaccatagcg cagcgcagca 4133401 aagctgaccc gccgtttgtt gcctaccatt tattcggccc ggcaccggcc accgacaatg 4133461 acctgttaca acttgcccgt gcgctcgacg acatcgaaag gcaggtcacc cactcgtgac 4133521 acagtccgcg tccaacccgc aagctcctcc cacccaaacc cctggcgctg aattgcccgg 4133581 ctatcccccg caagcgggtg gtgcccctac agcggcccct tccgggccgc atcctcaccg 4133641 ggctgaagca gaatcggcac gtgatgcatt gctggcatta cgcgccgagg tcgccaaggc 4133701 cgtcgtcgga caggacgggg tgatcagcgg cctggtgatc gctctgttgt gccgtgggca 4133761 cgtgctcctg gaaggtgttc caggagtggc gaagacgctg attgtccgcg ctatgtccgc 4133821 cgctttgcaa ctggagttca agcgggtgca gttcacccct gacctgatgc caggcgacgt 4133881 caccggttca ctggtctacg atgcccgcac cgccgagttc gtgttccggc cgggcccggt 4133941 gttcaccaat ttgctgctgg ccgatgagat caaccgcacc ccacccaaga cgcaggccgc 4134001 gctgctcgag gcgatggaag agcgtcaagt cagtgtggag ggtgagccta agccgctgcc 4134061 caacccgttc atcgtcgccg cgacgcagaa cccgatcgaa tacgagggca cctatcagtt 4134121 gcccgaagcc caactggatc gtttcctgct gaaactgaat gtgacactgc cggcacgcga 4134181 ttccgagatc gccatccttg accggcacgc gcacgggttc gacccgcgcg atctatccgc 4134241 gatcaatccg gtggccgggc cggccgagct ggcggctggc cgcgaggcgg tgcgccacgt 4134301 gctggtcgct aatgaggtgc tgggctacat cgtcgacatc gtcggggcca cccgctcctc 4134361 gcccgcacta cagctcggtg tgtcgccgcg tggggcaacc gccctgctgg gcaccgcccg 4134421 gtcctgggcg tggctgtccg ggcgcgatta cgtcaccccc gacgacgtga aggcgatggc 4134481 ccgaccgacg ctacgccacc gggtgatgct acgcccggaa gccgagctgg aaggcgccac 4134541 acccgacggc gttctcgacg gaattctggc ctcggttccg gtgccccgct agtgatccgt 4134601 gtgatcggcg ccggcgacga tgcagtgggg gcaccacccg cttgcggggg acgaagcgat 4134661 ggggtggggg tacgccccca caagtgggag gtacccccac ccgcttgcgg gggagagcgg 4134721 cgcagatgat cctaaccgga cgcaccggct tgctggccct gatctgcgtc ctgccgatag 4134781 cgctgtcccc ttggccggca agggctttcg tgatgttgct ggtggcgctt gcggtagcgg 4134841 tgaccgtgga caccctgcta gcggccagca cccgtaagtt gcgctttacc cgctcgccgt 4134901 atacctccgc ccggctcggg cagcccgtgg acgcgagcct gctgctctgc aatgggggcc 4134961 gccgccggtt ccgcggccag gttcgtgacg cctggccgcc cagtgcccgt gcgcagccgc 4135021 acacccacga tgtcgacgtg gctgccgggc agcgccagca ggtgcacacc gcactgcggc 4135081 cagttcggcg tggggaccag cgcgcagcaa tggtcacggc ccgttcgatc ggaccactgg 4135141 ggttggcggg acggcagagt tcacagtcgg tgcccggctt ggtccgggtg ctgccgccgt 4135201 tcctgtctcg caagcacctg ccgtcgaggc tggccaagct gcgggagatc gacgggctgt 4135261 tacccacgtt gatacgcggc caaggcaccg aattcgattc gctgcgcgag tatgtcgtcg 4135321 gcgacgacgt ccgctcgatc gattggcgcg cgagcgcacg ccgcgccgat gtcatggtcc 4135381 gcacctggcg gcccgaacgg gaccgccgag tcgtcatcgt gctcgacacc ggacgcatgg 4135441 cggcggggcg ggtcggtgtc gacccgaccg ccgccgatcc cgccgggtgg ccgcggctgg 4135501 actggtccat ggatgccgca ctgctgttgg cggcactggc gtcacgagcc ggcgaccatg 4135561 tcgacttcct ggcccacgac cggatcagcc gcgccggcgt gtttggcgcc tcgcgtagcg 4135621 aactgcttgc ccaactggtc gatgccatgg ccccgctgcg accggcgctt atcgaatccg 4135681 actggcatgc aatgattgcc accatcttgc ggcgcacccg gaggcgatcg ctggtggtgc 4135741 tgctgaccga cctcaacgcg accgctctcg acgagggcct gttgccggtg ctgccgcagt 4135801 tgtcggcccg acaccatgtg ctggtcgccg cggttgccga cccgcgcgtc gatcaactgg 4135861 ccgccgggcg gtccgacgcg gcagcggtgt acgacgctgc ggctgcggag cgcgcccgca 4135921 acgaccggcg tgcgatcgcg tcacaactgc gccgaggcgg ggtagatgtc atcgacgctc 4135981 ctcccgccga aatcgcaccc ggacttgcgg atcgctacct ggcgatgaaa gcgaccggcc 4136041 gcctctaatt tccgacctcc attgtgaaat gtgcgacgcc agcgcggcgt gtcgtgtcgc 4136101 gagtttcact ctcgggggag ttcagccggt cgggaccacg tcgggcgcgt cctccatgtc 4136161 gccggtctcc ccggcttgcg cggcacgacg accgaagtag ccgatgtagg acagaaacac 4136221 cgcctcggcg atgatcccga cggcgatccg aacaaacgtc ggcaacggcg acggtgtcac 4136281 caccgcctcg atcagacctg cgaccagaaa cacacccacc aagcccaccg cgaccgacac 4136341 gacaccacgt ccttgctcgg cgaggacctg tccgcgcggg cggttgcctg cagatatcac 4136401 cgaccacccc agccgcatcc caatcgccgc ggcgagaaag acggccgtca gctccagcag 4136461 cccgtgcgga agaagcaggc ccagcaggaa atcgcccttc cccgcctgga acatcagccc 4136521 ggcgatcagt ccgacgttgg cggcgttatc gaagagcacc agcggtatcg gcagccccag 4136581 cacaacagac atcgcgatgc acgtggtagc cacccaggag ttgttcaccc agacctgcag 4136641 agcgaacgac gcggccgggt gctcgctgta ataggactgg acgtcatggc tgaccaattc 4136701 gtctatctca gtgggcgtcc cgatcgcgga ctgcacctcg tgactgccgg ccacccagaa 4136761 cccgatcagc accacgacgg cgaaaaacgc caccgcagtc gccagccacc accgccaggt 4136821 acggtaggcc acgaccggga acgacactgt ccagaaccga atgaacgtac gggtcagcgg 4136881 tgcgtgcgcg cctgtgaccg cggaccgagc ccgcgcgact agactcgaca gccgaccggt 4136941 catcaactgg tccgacgaag ccgatctgag catcgacaga tgcgtggaca cacgctgata 4137001 tagctcgacg agttcgtcga tttcggctcc gctcagtgaa tggcgcttct tgatcaagtg 4137061 gtcgagccgg tcccacgtgc cgcggttggt cagcaagaac gcgtcgacgt ccaccctgcg 4137121 cagcctacct aagccgccga gcgtgagcgg tggccaatgc cgagtgcagc agagcaccgc 4137181 accaaagcct gtagcgtttg ttggtatgtc ggaggtggtg accggcgacg ccgtggtgct 4137241 cgacgtacag atcgcccagt tgccggtgcg cgcggtcagc gcggtcatcg atatcaccat 4137301 aatattcatc ggctacatcc tcggtctgat gctgtgggcg accgccctga cccagttcga 4137361 cgaagccttg accaccgcat tcctgatcat cttcacggtg ctggcgctgg tcggctatcc 4137421 cctggtctgg gaaaccgcaa cgcggggccg atcagtgggg aagatcgtga tgggtctgcg 4137481 ggtggtgtca gacgacggtg gcccggagcg gttccggcag gcgctgtttc gcgcgttagc 4137541 gtcggtggtg gagatctgga tgctgctcgg gagccccgcc gtgatctgca gcatgttgtc 4137601 gccaaaagcc aagcgagtcg gcgacgtctt cgcgggcacg gtcgttgtca gcgaacgtgg 4137661 tccgcggttg gggccgccgc cggtgatgcc accgtcgctg gcctggtggg cgtcgtcgct 4137721 gcaattgtct gggcttaccg ccggccaagc cgaggttgca cgtcaatttc tggtgcgggc 4137781 accgcaactc gatcctgcgc tacgcgagca gatggcctac cggatcgccg gtgatgtggt 4137841 tgcccgcatc gctccgccgc cgccacccgg agttccacca cagttggtcc tggccgccgt 4137901 cctcgccgaa cgacaccggc gtgaactgtt gcgactgcgt cccacgctgc ctcccgcagg 4137961 acaggcgcca tgggcccaaa tggcgcctca tcggggttgg ccgcccggtt tgtccggcgc 4138021 cacgccgtgg tctcctcagc agccggtgat cccctggccg gagccagatc cgccaccgca 4138081 agccgctccc tggccgcagc aggcgccgga cggcccggga ttctcgccgc cgggctagca 4138141 gctagtcttc gctgcgccgg atcccccgag cgtgcggaca tgttcaggcg cacagcgaaa 4138201 gctaggacac gtcaacccaa tccagggtcc gctgcaccgc cttgcgccag ccggcataac 4138261 ccgcggcacg ctcgtcgtcg tcccacgtcg gtgtccaccg cttgtcctct cgccagttgg 4138321 cccgcagatc ggacggagcc gcccagaacc cgaccgccaa gcccgccgcg taggccacac 4138381 ctagtgcggt ggtctcggcg accaccggcc gcaccacatc cacacccaac acgtcggcct 4138441 ggatctgcat acacaggtcg ttgccggtga tcccgccatc caccttcaac acctgcaggc 4138501 gaacaccgga gtctgcttcc atggcgtcca ccacatcgcg gctctggtag cagatcgcct 4138561 ccagcgttgc gcgcgccagg tgcgcgttgg tgttgaaccg cgacaacccg acgatcgcgc 4138621 cgcgcgcatc ggaccgccag tatggcgcga acagcccgga aaacgccggc acgaaataca 4138681 tgccgccgtt gtcggggacc tggcgggcca gcgcctcact ctgtgcggcg ccgctgatga 4138741 tgcccagctg atcgcgtagc cactgcaccg ccgagccggt caccgcgatc gaaccttcaa 4138801 gcgcgtacac gggtttagcg ttcccgaatt ggtagcacac cgtggttagc aggccgttat 4138861 tcgatcgcac gatcgtttca ccggtgttca gcagcagaaa attgccggtc ccataggtgt 4138921 ttttcgcctc ccctggggcc agacagactt gaccgaccat ggccgcatgc tgatcaccga 4138981 gaactccggt gatcggcacc tcaccgccga caggcccggt cgccagcgtg acaccgtaag 4139041 gctccgacgg cgccgacgat gcgatctcgg gcagcatggc ccgaggtatc gaaaacaacg 4139101 acaacagctc gtcgtcccag tccagcgtct ctagatccat caacatggtc cggctggcgt 4139161 tggttacatc ggtgacatgc acaccccccc gcggcccgcc ggtcagattc cacaacaccc 4139221 aggtgtccgg tgtgccgaac aatgcgtcgc cgttctcggc ggccgcgcgg actccatcga 4139281 cattttccag gatccactgc agcttgccgc cagagaaata agttgccggc ggcaggcccg 4139341 ccttgcggcg gatcaggttt ccacgaccgt ctcgatccag cgccgacgcg atgcggtcgg 4139401 tgcgggtatc ctgccataca atcgcgttgt agtagggccg tccggtgtgc cgattccata 4139461 ccagcgtcgt ctcacgttgg ttggtaatcc ccaacgcggc aatatctttc ggcgataggt 4139521 tggtggcgtt gagcaccgag atcaacaccg acgcggtgcg ctcccagatc tcgaccgggt 4139581 tgtgctccac ccagccggcc cggggcagga tctgctcgtg ctcgagctgg tggcgggcca 4139641 cctcggcacc gtggtgatcg aagatcatgc agcgggtgct ggtggtgccc tggtcgatgg 4139701 cggctatgaa atccgaggac tcggccaatt gctctcctag gatggcgtcg gacactgcat 4139761 gtaatcgtcc atgatggtcc accgcagcgg cgggtccgac gccgtcagcc ggagaagggg 4139821 tcgcgaattc taatgccctc gaacttgcgg aagtcgcggt cgtgactcca gatcgtggcg 4139881 atgccgtgat ggcgcatgag cgcgacgagg tgggcgtcgg gaaccagatt gcctcgcggc 4139941 ttgaccgggt cggctactcg ccgatagacg ggccagaatc cgttggcctc gccgacctgc 4140001 cgcacgtgcg gtcgtgaggt gaattgctcg atgttttcga cggcgacctc aggcgccagc 4140061 ggcgcaccca acaacgtcgg atgggtgaca acccgtagat aacccagcgc gacgggccac 4140121 aatagatata ccagccctgg cccagccagg aatcgctcaa cgagcgtctt cgccttatcg 4140181 tgaaacgggc tggctcggtg cgtcgcatgg accagaacat cgacgtcaaa ggtttcgctc 4140241 acccacggtc caaaatcgcc caaacagcgt ccttgtcgtc aagatccaca cggggccgca 4140301 agtcggcagt cgaccagcgg atgtcaacgt ttggaggagg ctcggccgcc agagcttgcg 4140361 caagcaattc ggaggcgagc tgccctaacg ttttgcgctc ctcgcgctgg cgtcgtttca 4140421 acgcccgcag tatgtcgtca tcgaggtcga tcgtagtgcg catacatcag atgctaactc 4140481 gatatgcatc tgatgcgaac gatctcaccc ttcttgcgct gccggcacga aacctgttgc 4140541 atcagcaatg tgggcgaaga ggtaacgcgc accacatata gccgcgaaca tcagcgcgag 4140601 taccggcgca aggtgcggct gtgcttggac gtcttcgaga ccatgcttgc gcagaccagg 4140661 ttcgaggccg accggccact caccggcatg gagatcgaat gcaacctcgt cgacgccgac 4140721 taccagccgg ccatgtcgaa ccgctatgtg ctggatgcca tcgccgaccc ggcgtaccag 4140781 accgaattag gcgcttacaa catcgaattc aatgttccgc ctcgcccgct accgggacgc 4140841 acttgcctag agctggagga cgaagtccgc gccagcctca acgatgccga gaccaaggcc 4140901 agctgcagcg gagctcacat cgtgatgatc ggcatcttgc ccacactgat gccagagcat 4140961 ctgaccgacg gctggatgag cgcatcagcg cgttatgcgg ctctcaacga gtcgattttc 4141021 aaggcccgcg gcgaggatat ccccatcaac atcgccggcc cggaaccgct gagctgccat 4141081 gccggatcca tcgcacccga atccgcttgc accagtgtgc aattacattt gcagctagca 4141141 ccggcggatt ttccggctaa ctggaatgcg gctcaggtac tggccggacc gcagttagca 4141201 ctaggtgcca actcgcccta tttcttcggc caccagctgt ggtcggaaac ccgcatcgag 4141261 ctgttcacac agtccactga tgcccgtccc gaggagctga aatcgcgagg ggtgcgcccc 4141321 cgggtatggt ttggcgaacg ctggatcacc tccgtcctcg acttgtttca ggaaaacatc 4141381 cgctacttcc ccaccctgct acccgaggtg tccgacgagg accccctcgc agagctttcg 4141441 gctggacgca tcccacacct gtccgaattg cggctgcata acggcacggt gtaccggtgg 4141501 aaccggccgg tgtacgacgt ggtcgacggg cgcccgcatc tgcggctgga gaaccgggtg 4141561 ctacccgccg ggccgacggt cgttgacatg ctggcgaatc atgccttcta ctacggcgca 4141621 ctacgcggtc tgtccgaggc cgacccccca ttgtggacgc agatgaattt cgctgcggca 4141681 caagcgaatt tcctggcagc cgccaggtac ggcatggacg cccagttgga ttggccgggc 4141741 ttgggcgagg tgacgacgcg ggagttggtg ttgggcacgt tgttgccaat ggcacacgag 4141801 ggactgcggc ggtggggtgt cgacgcggag gtacgcgacc ggttcctggg tgtcatcggc 4141861 ggtcgcgccc agaccggccg caacggcgcg cgctggcagg tcgccaccgt ggcggcccta 4141921 caagacggcg ggctgacccg gcccgcggca ctggctgaga tgctgcgccg gtactgcgag 4141981 cacatgcaca gcaacgaacc cgtgcatacc tgggacacgt agtccacgag taggttggga 4142041 gccatgaccg acgaggtaat ggactgggac agcgcctacc gtgagcaagg cgccttcgag 4142101 gggccgccgc cgtggaacat cggtgaaccc cagcctgagc tggcaacgct gatcgcggcc 4142161 ggcaaggtcc gcagtgacgt gctagacgcc ggatgcggat acgccgaact gtcattggcc 4142221 cttgccgccg acggctacac cgtggtcggc atcgacctca cgcccaccgc cgtcgcggct 4142281 gccaccaagg ccgctgagga gcgcggtttg accacggcca gcttcgtgca ggccgacatc 4142341 acggagttcg cggcttatcc agccggctcc gccggccgct tttccacggt gatcgacagc 4142401 accctgtttc attcgctgcc ggtggacagc cgcgaccgct atctgagctc ggtgcaccgc 4142461 gcggcggccc cgggcgccag ctattacgtg ctggtcttcg ccaagggcgc cttccccgcc 4142521 gagctggaag tcaagccaaa cgaagtcgac gaggacgagt tgcgtgccgc ggtgagcaaa 4142581 tactggaaga tcgacgaaat ccggcccgcc ttcattcatg tcaatccggt cacgattccg 4142641 ccccagctgg ccggagcgcc agtcgaattc ccgccatacg atcacgacga gaagggtcgg 4142701 gtgaagttcc ccgcctatct actcaccgcc cacaaggccg gctgaggcta acgttcgccg 4142761 ctggtcgccg cggtcgccgc gaccaacgcc tcggcgaagg cgtccaggtc atcggcggtg 4142821 ttgtccacgt gcggcgagat ccgcagcacc ggcgccggca gttccagcgg tgcccgctcc 4142881 actccggcgt aggtggtcac gatccgccgc tgcgagagca accaggcccg caccgctgcc 4142941 gggtcggcgc cgtcgatcgg cgccagggtg gtgatcgcgc taggctcgtc gaccgcttcg 4143001 accacccgcc aaccggacac atcggcgagt acggtcctgg cgatgtcgcc cagctcagcc 4143061 aagcgtgccc gaatagcctg cggcccgcac gccagatgct caccgagtgc gaccgaaaac 4143121 cccactcgcg cagctacatt ggcttcgcca aatccgagtt gttgggccac tgtcagcggc 4143181 ggcatccagt ctggcgcggg cagcctcgca cgtaaccgct ccatcagctc aggacgaacc 4143241 gccagcaccc caactccgcg gggcccggcg atccacttgc gcgacgaggc atacgtgacg 4143301 tcggcaccca ccgcacaatc cacgtggccc aggccctgcg cggcatccac gaccagcggc 4143361 agtttcagct cggtgcacag ttgcgccacc atcgccagcg gctgtgcgac gccacggtgg 4143421 ctggccacca cggtcaggtg cactaggtcg ggcgggtcgt cggccaacat gaaggccgcg 4143481 tcgtcgagcg ctaccctgcc gtcctgcaga gttggtaacg gacgcacgtc gaagccatgg 4143541 gcggccatca cagccaggtt cggcccgtat tcgccgggca agcaagccag cgtccggttc 4143601 tccccaggcc agctgcccag cagcagatcc aacgcgtgca gcgagccggt ggtgaacacc 4143661 acctcggcgt cgggcaggcc gctcagtgcg gcgaccgccg cacgtccggc gtcgagcacg 4143721 gcggcggcgg cctcagccgc gacataaccg ccaacctcgg cctcgtgccg cgcgtgctgg 4143781 gctgcggcgt cgagtgcggc gaaactctgg cgcgaacagg ccgcgctgtc caggtgtagc 4143841 cccgcgacgg gcgggcgcgc tgcccgccat cggtcggcca gcgaatcgcc ggcggggctg 4143901 tttgcgccgc ttctcctcat cgcttcgtcc tgcatcgtcg ccggcgcggc tcacttggcg 4143961 gccagcgaca ggccaaagtc accggcttca tcggtccacc atcggatgcg atgcagtccg 4144021 gccgcggcca actcggcacc gaccgcttgc ggccggaact tgcacgagac ctcggtcaac 4144081 atctcctccc cggcgtcgaa gtcgacggtc aggtccagtg caccgacccg tacccgctgg 4144141 cgaccgtcgg cacgcaacca catctcaatc cgctcttctg cgctgttcca acgggcgacg 4144201 tgctggaagg catcgacgtc gaaatccgct tcgagttccc ggttgatcac ggcaagcacg 4144261 ttgcgattga actgagccgt caccccgcca ggatcgtcgt aggcgcgcac cagccgggcc 4144321 gcgtccttga ccaggtcggt gcccagcagc aggctatcgc ccggccgcat taccccggcc 4144381 agggccgtca ggaactgcgc gcgcggcccg ggcgtgaggt tgccgatcgt ggaccccaag 4144441 aacacaaaca ggcgccgtcc tcccctggga atctcggtta aatgctcctc gaaatcccca 4144501 caaacagcgt tgatttcgac accactgtat tcacgctgaa ttgcggtcgc agttgccgac 4144561 agcacgctgg cgtcgacgtc gaacgggacg aatctgcgca gcgatccccg gtggcgcaac 4144621 gcatccagca gcatccgggt cttctccgag gtgccgctac ccaactcgac caaagtatcg 4144681 gcccggcagg cggaagccac ttcggccgat ctggcccgta ggatttcggc ctcggctcgg 4144741 gtcgggtagt actccggcaa ccgggtgatc tgatcgaaca gttcactacc caccgtgtcg 4144801 taaaaccact tgggcggtaa cgatttcggt gtcttctgca ggccagagta cacatcgcgg 4144861 cgcaacgcca gatgccccgc atcctcgccc agatggttgg caaccgacac tctcatcgag 4144921 gtcctttcgc gcggtccaat gcggtcagcg tgacaccctt ttgggttacc tccaccaggt 4144981 ggcggtccgg cacgtcgccc caaccggagt cgtcgtcgta tggttcgctg gccagcacca 4145041 ccccgtcggc gcgccgcagg atggacagcg tgtctcccca ggtggtcgcg atgagccggg 4145101 aaccgttggc cgccaagatg tttagtcggg catttgggtc ggccgcgccg accttgacaa 4145161 tggtgtctcc cagagcgtcc agaccgtgag cgaagatggt ggccgcgagt atcgcgctgt 4145221 cacagaccga ttcggccgcc gggcccgccg gcaacacggc acgatcaacc acaccgttgt 4145281 gcgctagcaa ccagtgccca tcggtgaacg gcggggtcgc gctgacttcg atcggcatac 4145341 cgacagtcgc cgagcgcacc gcggcgagga tgcagtgact acgcagcgcc ggcgccaccg 4145401 agtgaaacga cgtgtccccc cacagcggag ccgggctgcg ccaacgccgg ggaatggcac 4145461 cgtcgaagaa gccgacaccc caaccgtcgg cgttcatcag cccgtgcttt tgccgacgcg 4145521 gcgcatatga ctgcacccgc agaccctgcg gcgggtccag caccaacgaa gaaaccgcga 4145581 cctgtgcccc gagccacccc aggtgacgac acatcagatg tcccacgcca accggacacc 4145641 ggcaaagatc tggcggcgat acgggtgatc ccagttgcgg aagctgggcc gcaggatggc 4145701 cggctccacc gcccacgagc cgccgcgtag cacgcgatag tcgccgccga agaacggctg 4145761 tgagtaccgc tcatagacca tcgggacgaa ccccggccag ggccgcaacg gcgaggtggt 4145821 ccactcccag acatcgccca gcatctgctc ggccccgcac gccgatgccc cggccgggta 4145881 ggcacccacc ggcgcggggc gcagcgtttg accgcccagg ttggcatagg tgtctgtggg 4145941 ctcctcggtt ccccacgggt agcggcggcg ggaaccagtc gccggatccc acgcgcaagc 4146001 cttctcccac tccacctcgg tgggcaaccg cgcgcctgcc caggcggcgt acgcctcggc 4146061 ctcaaagtag ctgacatgct gcaccggctc atcggcggga atgtcctcga cgtgcccgaa 4146121 ccgggtccgc gtccgcccgc ccgacctcca gaattgcgga gcggtcagcc ccgcgcgctg 4146181 gcggtgctgc cagccacgtt ccgaccacca ccgcgactgg gtgtaaccgc cgtcgtcgat 4146241 gaagtcttgc cattcaccgt tggtgaccgg aacccggccg atccggaatg cgggcacgtc 4146301 gacgacgtga gccggacgtt cgttgtccaa tgagcacggt tcgtccgcgg cgtccacgcc 4146361 cagcacgaac gggccgccgg ctaccagcac cgacgttccg gccatcctcg gccgtccggc 4146421 gggcagggcg gaagtcgcgg ccaacagtgg cgagccggtc cgtaggttca aggcctgcag 4146481 catggtttcg tcgtgctggt tttcgtggct gatcaccatc gcgaacacga agctgtcgcc 4146541 gtcttcaggt agagcggcaa gggcatccag cgcagcggag cgcaccgttg cgcagtagga 4146601 ccgcgcccgc gccggggaca gcaacggcag ttccacgcga ctggcgcggg aatgctcgaa 4146661 ggcgtcgtag agaccctcga ccgccggcgg caaaagcccg ggctggcctg ggtcgccgcc 4146721 gcgtagcagc cacaactcct cctgctgacc gatgtgtgcc aggtcccaca ccagcgggct 4146781 catcaacggg tcatactggc agcaaagctc ggcatcgtcg aagtcgacca gccgcaacgt 4146841 ccgcgcccgc gcccgcgcca gatgacaagc cagctgctcg ggtgaagtca cgacgccccg 4146901 tgcatcatcc cggtgacggc tgacgcgatg ccgcccgcga tcacccggtc ggagaaatcg 4146961 tctgccgggc aaacacccct gtcgacgtgg tccaccaacc gctgcatcgc gccgatgagt 4147021 tcagtcggta cccgccgcgc ggcgatggcc aggcatctgt tggctgccaa gtagagccgc 4147081 cggtcggcca ggccgatccg ggccgcggtg tcccaggccg tggccaccgg ttcgaccgcg 4147141 tcgaccgcca aatctgccgc caccgggtcg tcgagcagcg tcaccaaggt gaacaccacc 4147201 gcgggccaca cctcgtcggg cacgctgtcg aggtagcgaa tttccagcca ttgccgagga 4147261 cgcaccggcg ggaacaacgt tgtcaggtgg taaaccaggt cggcgacggt agcgcggcga 4147321 ccgtccagca gcacccgacc gtcaacccag tcggtgaagg gcacgtagtc cgtcaccgca 4147381 cgggtgtctt gagtgtccgg gcttcgcacc atcatcaccg gcgccttcaa ggcatactta 4147441 gcccagtcga tgccggggtg gtcgccactg gcaccaagaa tggggccgca gcgcgcggag 4147501 tccatctggc cccacacccg ctgccgggtg gactgccagc cggaaaaccg gccgcccagc 4147561 atcggggagt tggcggcaat cgcgatcatc gtcggcccca aggcgtgcgc caggcggact 4147621 cgctcagccc atccttcctg cggtccggca tccagattga cctggatcgc ggctgtcgag 4147681 gtcatcatcg ccgcacccgg cactccgcta tggctggcgg cgaaaaactg ctccatggcc 4147741 cgatagcgtg cgcccggatt gacccgcacc ggcgaccgca gcgggtctgc acccaggaag 4147801 accaaaccca gcccggcatt ggcaagcgcc gaccgtagca ccgcctgatc gcgcgtcatg 4147861 gcaccgatgg ctgccagcac gccgtcggcg ggcggtccgg acagttcgac ggcaccaccg 4147921 ggctccacgc tgaccacgct gccgcccggc agcggactga gccattcgag aacctcggtg 4147981 atctcttccc agctgggccg gcgaaacgga tcggccgggt cgaagcagtg cgcctccatc 4148041 tccagaccga cgcgtcccaa cggaccatcg acgaggcagc cgtccgcgat gtattccgcg 4148101 gcggccgatg aatcggtgat ctcgacgtcg tccggggcag cgttatccag ctgcgaggcc 4148161 gcggcggtca tggcggcaag cgtcatatca cgatccctcc gggcccggcg catgcctaaa 4148221 acatgccctg cggaccgttg gttgcgcagc taccagaacg atagccgcca ccggtttatc 4148281 ctgccgaccg ccccgccgcg cgaatgaact ggccaactca gcccagtgtg ttctgcattg 4148341 ccccggccag cacgttgacc gcgggaccgg cgttgccgga ctgacagacc ttggcctgca 4148401 gcaatacgtt ttcgcgcagc ctggtttgaa caaagcatcg tcgatcggtg ccggcctctt 4148461 gtttggtcca ggcctcgtcg gtgccggtcg acggcccacc ggcgaacgac cacacctgcg 4148521 tcgtcccgtc gtccaggtgg atcgcggtgg tctgccccga gcagcccacg gttcggtcga 4148581 caacgcggtg aaacgcccgg tctgcggcat cgttgctggc gaatacccca accgcctgct 4148641 tgaccaggtg ggtttggtcg gtggcggacg tctgcgtggt agcgccgttg aacgacgcca 4148701 ggtcgggatc gtcgtacacc tcgggcagcc cgatgtccac ccagttgttg cacgccggta 4148761 gttcgaccca aaacgcctgg aacggcctgg tgaacaccgc ctcccacccc attggggcgc 4148821 cgacgatgtt gccgaccgac ccctttccga gcaccgcgta ggacacaacc ccgggctccg 4148881 acgggtgtgc gtcggcaaca ggtaccgcga accctgctat gacggctaga ccgatgctca 4148941 ccaccgcggc ggcgattcgc atggagctag accttggccg agatgtcgac gtcgatgttg 4149001 cccatgggtc acgatcatgc caacatggcc gaccaaacag aaggcccttt tcttgaacga 4149061 gtcaagaaaa ggggctggtg cgcccggccg ctaggggcgc gccggcgcgg tggtgggacc 4149121 cggaccagtc ggaccaacag ccgggccacc cgggcctcca gggaaaccgg ggcccatgcc 4149181 tgggggaggg ggtccgccgg ggaacccgaa cacccatccc atccctggac cgggggcaac 4149241 gggaccgacg gggcggaaca tgccgtggtg gtaatagcgg tggtaagggc atttcccctg 4149301 cccgagaacc aacgcgccag agaagaagat gactgcgacg gtgaaaacaa ttccggccac 4149361 gatcaccacc cacgctgcgg cccggtagag cctgggcggc ttttcctgct gcggcgatgg 4149421 cggcggtgaa gttgtggcgg cggacggcgg tggcgctgcg ggttggggtg tctcggtcat 4149481 cttttgaata tcgctcggcc ggcaaccacc gtaaagggtt gcggttacaa acgtgccatg 4149541 aactggtgca gcccggtgcc ttgggtggac accgggctgc tggttgtcgg ttacggagcg 4149601 ggtgtcgcgg gaggactgac ggacgacggc acctggccgg gcccgcccgg gcctgggccg 4149661 ggacgcactg ccgcgggccc accgtgcgga ctgcccggtc gcagcatcat cgccgggtgc 4149721 tggtggtgtt gccggtggtg gaagccgccg tgaccggcat gcttgccgag gatatagccg 4149781 gtgaagaaga tgaccgccac gatgaacacg gttccggcgg caatggctac ccacgccgcg 4149841 gctttgaaca ccttgggggt ctggtgaggc ggtggtgttg gggtttcaga tgtttcactc 4149901 atgtgtcgca tgatgccttg gcaaacagta acgcgactat gcgtccctta tgtagcagct 4149961 gtgagcgcgc gggctgggta tcggcccggg acaccaccat ggctgcgtct cggtgtcaga 4150021 gcaccagagc tacgggtctg accagggctt gaacgggttg accgcgaact gaatcacccg 4150081 gtacggccca ttctgccggg cccgagtgtc ccactggctg acgaagatcc gcagttcgtc 4150141 gatggtggac ccgggcgaga tgtagccgcc atacggttgt gcgagtcgat tgtcgtaggg 4150201 cggtggcagg ctttccgccg gctccggcca ctcgtcgtgg cgcaccaccg tggtcaccgg 4150261 ggcggcgccc agcgacgtcg ggtggtgtgc cacccgaacc tccatgttgc cggtgctggc 4150321 gttgaaatac gacagcaccg tctggccgtc gatctgacgg atgctcatct cgccgagctg 4150381 gtcgggccag agcggagtcg gcggcttgtt ccaaccgccg tcggggccgc ccgcccagcc 4150441 ctgccagcgg gaccggtcgg tgaacgattc cggggtggcc cgatacagca ccgccggctc 4150501 cccacgggtg aagctgtcgg ccacgatgta gacccaccca gttggcgaat cgggcgtggg 4150561 aaccgggtcg tagtatccgc tgatctgtgt ctgccggccg tcctggtagg cggcgttgcg 4150621 cctggacccc gacacggtct gccagccgcc gcgcgccgcc tcggcccgca ccaggcggga 4150681 attctgcggc tgcaggtcct tggtggtggt caccatcagg tagttgcggc ggttgatctg 4150741 caccacaccg gcgggcagct gtgagtctcc aggcggcgtg ggatcggcca gcagcggcgt 4150801 gccgacgccg gtgacaccgg tgtagcgcac cccggccgga tcgtcgatcg actcggtgtc 4150861 gacgtgcagc gcgaccggcg cataccagcc accgaacccg acaccctgac cggcgaagct 4150921 gtccccgcac acctgcagca gttgactggg gaattccacg aactcgcaca ggtcggtggc 4150981 accgatgccg tagtccccgg tgggggttcc ggtaccggcc gtcggaccga ttcgcagcac 4151041 ttgaccgggc gccagcggcg gcaggatggg ccggggcgcc ggcgccggcg gatcggcgcg 4151101 tgcataccaa acacattgcg ggacaaggaa agacactacc agcgagcacc gcacgaccca 4151161 ggcggagcac acccgcatat cacaagtcgg cggtcagcag ctcggcgatc tggatggtgt 4151221 tcagcgccgc ccccttgcgc aggttatccc ccgacacgaa cagcgccaga ccacgcccgt 4151281 cgggcacccc cgggtcgcgc cggatccggc cgaccagaga ttcgtcgaca ccggcggcgg 4151341 ccagcggcgt cggcacgtcg accagctgca cgcccgtagc accgtcgagc agctcgcgcg 4151401 cccgctccgg cgagagcggc tgcgcgaact cggcgttgat cgacaaagag tgtccggtga 4151461 acaccggaac ccgcacacag gtgccgctga ccaacaggtc ggggatgcca aggatcttgc 4151521 ggctctcgaa gcgcaacttt tgatcctcgt ctgtctcgcc ggagccgtcg tccaccaggg 4151581 atccggccag cggcaccacg ttgaacgcga tcggggcgac gtaggtgttc ggcggcggga 4151641 actcgagcgc gccgccgtca tacaccagct gctcggcccc accgatgacc gcacgcgcct 4151701 gctcggccag ctcggccacc ccggccaggc cgctaccgga caccgcctga tacgacgaga 4151761 ccaccaaccg caccagtcgg gcttcgtcgt gcagcacctt gagcaccggc atcgcggcca 4151821 tggtggtgca gttcgggttg gcgatgatgc ccttaggccg gcggtgcgcg tcgcgttcaa 4151881 agttcacctc ggacaccacc aacggcacgt cggggtcctt acgccacgcc gacgagttgt 4151941 cgatcaccgt gactccggcc gccgcaaagc ggggcgcctg caccttcgac atggccgagc 4152001 cggcggagaa caacgcgata tccagcccgc tcgggtcggc cgtctcggcg tcttccactt 4152061 cgatctcctg gccgcggaag gccagcttgc ggccctgcga tcgggccgac gcgaagaacc 4152121 gcaccgcgct cgccgggaaa tcccgctcgt cgagcaacgt gcgcatgacc tgacccacct 4152181 gaccggtggc ccccacgatc cctattgaca ggcccatcta ccgtcccgtc cccgcgtaca 4152241 ccgtggcctc ctcgtcgccg ccgagcccga acgcttcatg cagcgcgacc acggccttgt 4152301 ccagttcggt gtcgcggcac aacaccgaga tcctgatctc cgaggtggag atcagctcga 4152361 tgttgacccc caccgccgcc agcgcctcac agaacgtcgc ggtgaccccg gggtggctgc 4152421 gcatgccggc accgatcagc gataccttgc cgatgtggtc gtcgtacagc agctgtgaga 4152481 agccgatctc gtttctgagc gagtccagtt tttccacggc ggcgggcccg acgtcgcggg 4152541 agcaggtgaa ggtgatgtcg gtcttgccgt cctcgacctt ggagacgttc tgcagcacca 4152601 tgtcgatgtt gacgtcggcg tcggccaccg ccctaaacac cttggccgca tacccgggga 4152661 tgtcgggcag cccgacgatg gtcaccttgg cctcgctgcg gtcgtgcgcg actccggtca 4152721 ggatggggtc ttccatgggt acgtccttga tcgatccgac aacgacggtg cccggtctgt 4152781 ccgagtacga cgaccggacg tgcaccggaa tattatggcg gcgagcgtat tccacgcagc 4152841 gcagcatcag caccttggcg ccgcaggccg ccatctcgag catttcctcg aaggtcacgg 4152901 tgtcgagctt tcgggcgttg cgcacgatgc gcgggtcggc gctgaagatg ccgtccacgt 4152961 cggtgtagat ctcacagaca tcggcaccca gcgcggcggc catggcgacg gcggtggtgt 4153021 ccgagccgcc gcggcccaac gtcgtgacat ccttggtgtc ctggctgacc ccttggaatc 4153081 cggccaccaa aacgacccgc ccctcctcaa gggcggtttg cagccgcccc ggcgtgacgt 4153141 cgatgatctt ggcgttgccg tgggtgccgg tggtgatcac cccggcctgc gaaccggtga 4153201 acgaccgggc atgcgcgccg agcgactcga tggccatggc caccaacgca ttcgagatgc 4153261 gttcaccggc ggtaagcagc atgtccagct cccgaggcgg cggcgccggg cacacctgct 4153321 gagccagatc cagcaggtcg tcggtggtat cccccatggc agagacgacg acgacgacgt 4153381 cattgccttg cttcttggtg gcgacgatgc gttcggcgac gcggcgaatc cgttcggcgt 4153441 cggccaccga ggatccgccg tacttctgca cgacgagcgc cactgtttcc ctttccgggg 4153501 aagattggag acaggtccag aatagggggc gcgccggcct gcgctgactc tgcgtccacc 4153561 acgggaatgt gcgagtagcc cacacggtgg acgcagagtc aacgtgtaaa gtgcttcatg 4153621 tgcagcgggt gctcctcctc ggacgccgcg acggggtctg atccagaccg gcttcccgtc 4153681 gcgggacgtt cgcgatgcgc cggtctgagg ttccttctca ccatcccgga gcaactaccg 4153741 tgacaacttc tgaatcgccc gacgcctata ccgagtcgtt tggggcccac accatcgtga 4153801 aacccgccgg cccacctcgc gtcggtcagc cctcgtggaa tccgcagcga gcctcgtcga 4153861 tgccggtcaa ccgctaccgg ccgttcgccg aggaggtcga gcccatccgg ctgagaaacc 4153921 gcacgtggcc tgatcgcgtc atcgatcgtg cgccgctgtg gtgcgcggtc gacttacgcg 4153981 atggcaacca ggcgctgatc gacccgatga gcccggcccg caagcgccgc atgttcgacc 4154041 tgctggtccg gatgggctac aaggagattg aggtggggtt cccctcggcc agccagaccg 4154101 acttcgactt cgtcagagag atcatcgagc agggcgccat tcccgacgac gtcaccatcc 4154161 aggtgctcac ccaatgccgt cccgagctga tcgagcgcac cttccaggcg tgttcgggcg 4154221 caccccgggc catcgtgcac ttctacaact cgacgtcaat cctgcagcgc cgcgtggtct 4154281 ttcgcgccaa ccgggctgag gtgcaggcca tcgcgacaga tggggcgcgc aagtgcgtcg 4154341 agcaggccgc caaatacccg ggcacgcagt ggcgattcga gtactccccg gagtcctaca 4154401 ccggcaccga actggaatac gccaaacagg tgtgcgacgc cgtcggcgag gtcattgcgc 4154461 cgacgccgga gcgcccgatc atcttcaacc tgcccgccac ggtggagatg acgacgccca 4154521 atgtctacgc cgactcgatc gagtggatga gccgcaacct agccaaccgg gagtcggtca 4154581 tcctgagcct gcacccgcac aatgaccgcg gaaccgccgt cgccgcagcg gaattgggtt 4154641 tcgcggccgg ggctgatcgg atcgagggct gcctgttcgg caacggcgag cgcaccggca 4154701 acgtgtgcct ggtcacgctg ggactcaacc tgttctcccg aggtgtggac ccgcagatcg 4154761 acttctccaa tattgacgag atccggcgca cggtggagta ctgcaaccag ctgccggtgc 4154821 acgaacgtca cccctatggc ggcgacctgg tgtacaccgc gttctccggt agccaccagg 4154881 acgccatcaa caagggccta gacgcgatga agctggatgc ggatgccgcc gactgtgacg 4154941 tcgacgacat gctgtggcag gtgccgtatc tgcccatcga cccgcgcgat gtcgggcgca 4155001 cctacgaggc ggtgatccgg gtcaactcgc agtccggcaa gggcggcgtg gcctacatca 4155061 tgaagaccga ccacggcctt tccctgccgc ggcggctgca gatcgagttt tcccaggtaa 4155121 tccagaagat cgcagagggt acagcaggcg agggtggcga ggtctcgccc aaggagatgt 4155181 gggatgcgtt cgccgaggag tatctggccc cggtgcggcc tttggagcgg ataaggcaac 4155241 atgtggacgc tgccgacgac gacggcggca cgaccagcat cacggcgacc gtcaagatca 4155301 acggcgtgga gaccgagatc agcgggtccg gtaacggtcc gttggccgcg ttcgtccatg 4155361 cgctggccga tgtcgggttt gacgtggccg tgctggacta ctacgagcac gcgatgagcg 4155421 ccggcgacga cgctcaggcc gccgcgtatg tggaggcctc cgtgacgatc gcgagcccgg 4155481 cgcagccggg cgaagcgggt cggcacgcat cggaccccgt gacgatcgcg agcccggcgc 4155541 agccgggcga agcgggtcgg cacgcatcgg accccgtgac gagtaagacg gtgtggggtg 4155601 tcggtatcgc accgtcaatc accaccgcgt cgctgcgcgc cgtggtgtcg gcggtcaacc 4155661 gggcggcacg ctaggacggc gctgaactag ggtcggggtc cgcggcatga tttttcgcag 4155721 tgacgttccg ctcgccgttt cagaacaacg ctaactgctt ttcgacggga gcgacgtcgg 4155781 tgaagtcctc cacgctggcg cccccgacga cggcaccgat gcactccatg aatcgcgctt 4155841 caggcatcac cggaaccccc agctgcaggg cgtgatagcc cttgccgtgt tcgggggcgg 4155901 tcgcgttgca gaccaccagt gaggtatccc ggtctacgac gtcgctgtag gccagcccgg 4155961 cgtgcagaat ccgttcgacg agttcctcgt gggtccgttt tacctcggcc gccagcccca 4156021 cccgcatgcc ctggaccagc gggcggccct ggacataccg gcccgggttg aggtaggggc 4156081 aggccatccg ggctgccacg gccttcagcg gtcgcagctc gtcgtgagtc acccggccgt 4156141 tgggccaccg gcgccgtgtc accgggtgca ccggcagcca gacgtcgagt tcgcgcgcac 4156201 tctctagggc agctgccagt atcccggtca atacccggac gtcgtcgaat gcatcgtgcg 4156261 gccgttgctg gggcacaccc caatgcgcgg caagtgtctc cagccgcaga ttgtcgacgc 4156321 caagctgcag ccggcgggcc agctcgaccg tgcacatgac gaagtcaacc gggagttcgg 4156381 cctcggcgat ctcggcctcc gcagcgagaa acgcatagtc gaacgcgaca ttgtgcgcga 4156441 ccagagtgcg cccgcgcagc acgtcgacaa cctcaccggc gatatcggcg aactgtggct 4156501 ggccatcgag catggcggcg gtcaggccgt gcacgtgggt ggggcccggg tccaccttgg 4156561 gatttagcag gctgaccacg gattgctcta gtcggccggc ggcgtccagg ccgagcaccg 4156621 caaggctgat gatccgggcc tggcccggcc gaaagcccga ggtctcgacg tcgatgacgg 4156681 cccaaccccg atcctggtgg ctggctggcc gtccccaggt gtggctcaca agacgaggat 4156741 gacacgtccg agcgacatca cctggtcgct acgcatcgtg tcggcccgta aaacccggac 4156801 gcgggcgacc cgccgcaccc ggcgacaagc gccgagcttg cgatcgccct gaatccaacg 4156861 cgggcgaccc gccgcacccg gcgacaagcg ccgagcttgc gatcgccctg aatccaacgc 4156921 gggcgacccg ccgcacccgg cgacaagcgc cgagcttgcg atcgcccgta aactgcccgg 4156981 gtggtaacca cccgggcacg cctggcccta gccgccggcg cgggcgcacg ctgggcgtcg 4157041 cgggtcaccg gtcgcggcgc cggagcgatg atcggcggtc tggtcgccat gaccctggac 4157101 cgctcgatcc tgcgccaact cgggatgggc cggcgcaccg tcgtcgtcac cggcaccaac 4157161 ggcaagtcga ccaccacacg gatgaccgcg gccgcgctgg gcacgttggg agccgtggcc 4157221 accaacgccg agggcgccaa catggacgcc ggcctggtgg ccgcgctcgc cgctcaccgc 4157281 gacgccgagc tggcggtgct ggaagtcgac gagatgcacg taccgcacat ctccgatgcc 4157341 gtcgatcccg ccgtcgtcgt cttgctcaac ctctcccgag accagctgga ccgggtcggc 4157401 gagatcaacg tcatcgaacg cacactgcgg gccgggctgg cccggcaccc cgacgctgtc 4157461 gtggtcgcca actgcgacga cgtgctgatg acctcggccg cctacgacag ccccaacgtc 4157521 gtttgggtgg ctgccggcgg cgcgtggtca aacgattcgg tcagctgccc gcgcagcggc 4157581 gaggtcatcg ttcgcaaggc cccctctcag gaagaccact ggtactccac cggcgccgac 4157641 ttcaagcggc ccgccccgca ctggtggttc gacgacgcca cgctgtatgg gcccgacggg 4157701 ctggcgctgc cgatgcggct ggcactgcca ggctcggtga atcgcggcaa cgccgcccaa 4157761 gccgtggccg ccgcagtcgc cctcggcgcc gatccggctg tggccgtcgc cgccgtctgc 4157821 caggtcgacg aggtcgccgg acgctaccgg accgttcgta tcggcgcgca ccaagcccgg 4157881 atcctgctgg ccaaaaaccc ggccggctgg caggaagcgc tggcgatggt cgacaagcat 4157941 gcagacgggg tggtcatcgc ggtcaacggg cgggttcctg acggcgagga cctgtcctgg 4158001 ttgtgggacg tgcgcttcga gcacttcgag aagacccgag tggtagccgc tggggagcgc 4158061 ggcaccgatt tggcggttcg cctcggatat gcaggcgtcg agcacaccct ggtgcacgac 4158121 accgtggccg ccatcgcctc atgcccaccc gggcgggtgg aggtcgtcgc caactacacc 4158181 gcgttcctgc agctgcaacg agcattggcg cgtcgtggct gattctgtgg tgcggatcgg 4158241 gctcgtgctg cccgacgtga tgggcaccta cggcgacggc ggcaacgccg tggtgctacg 4158301 acagcggctg ctgctgcgcg gcatcgccgc cgagatcgtc gagatcacgc tggccgatcc 4158361 agtgccggat tcgctggacc tctacacgct gggcggagcg gaggactacg cgcagcggct 4158421 ggccacccgg cacctacgtc gatatccggg cctgcaacgc gcggcgggcc ggggtgctcc 4158481 agtattggcg atctgcgcgg ccatccaggt gcttgggcac tggtacgaga cgtcgtcggg 4158541 agaccgggtc gacggcgtgg ggttgctgga tgtgaccacg tcaccgcagg atgcgcgcac 4158601 catcggcgag ttggtcagca agccgttgct ggccggtttg acccaaccct tgaccggttt 4158661 tgagaaccac cgcggcggca ccgtcctcgg gcccggaacg tcgcccttgg gcgcggtggt 4158721 caagggagcc ggcaaccggg ccggcgacgg ttttgatggc gcggttgcgg gcagcgtggt 4158781 cgcgacctac atgcacgggc cgtgcctggc ccgcaacccg gagcttgccg acctgctgct 4158841 gagcaaggtg gttggtgagc tggcgccgct ggatttgccc gaggtggacc tgctgcgccg 4158901 cgaacggcta tccgcgcgtt aggtggggcg ttagggccgc catcccctgg ccagcagagc 4158961 ggcacgcacg cggttcacca cgtcgtcggg gttgtcctcg gcgatcacgc gaatgacgat 4159021 ccagcccaac tcggccagct tgcggagccg ccgctggtct ttcacgtagc gaccgcggtc 4159081 gctgcgatgc tgatcaccgt cgtactcggc ggccaccatg tatttctccc agcccatgtc 4159141 gagcacgcca acgttgcgcc agcggtggac caccggaatt tgcgtcgtgg ggactggcag 4159201 gccggcgtcg atcaacaaca gccgcagcca ggtctccttg ggcgacgcgg cgccgccatc 4159261 aacaaggggc agcacgtcac gcaaccggcg gacacctcgg gcgcccgcgt gacgcttggc 4159321 caatagaagc acgtcgtcgc gggaaaacgg ggtggcacgc atgagggcat cgagacgagc 4159381 cacggcttcg ccgcgggaca gatggcggcc gaggtcgtat gccgtccgcg ccagtgtggt 4159441 gaccggcagg cccaccaccc tggtgatctc gtcgtcgcac aaggtctcac gacgtatgac 4159501 aagaccgtgc tgcgggcggg tagtgggaga aatcagctcg atggccacgt cgacgtccac 4159561 ccactgagca ccatgcagcg cagaggccgc attaccagct atgacgccat ggcgcctcgt 4159621 ggctagccag gcgccaaccg tgcgatccca aagtgtgggc actgagcgcc tcgagacgta 4159681 cacaccgcgg aacatcggct gataccaacg ttgcagctcg tgcctggtca ggcgaccagc 4159741 ggtgatggcc tcgctgccga tgaagacgtc acccatgacg gacatgctgg cactccgcac 4159801 cgacatccgt gagatcaaca ttttgcaggc aaggtgcgag tagcggcctg cagaacgttg 4159861 atctcggcga aagtcggatg tcggcgaatc aggcgagcac gcggcggccg gcgagcgctc 4159921 ggcccagggt gagctcgtcg gcgaattcca ggtcaccgcc catcggcagc ccggacgcga 4159981 tccgtgtgac ggtcaggccg gggatgtcgc gcagcattcg caccaggtag gtggccgtcg 4160041 cctcgccctc ggtgttgggg tcggtggcga tgatgacctc ggtgacgtcg acgtcgtcga 4160101 cccgttcccc gatgcggctc agcagttcgc ggatccgcag ctgatccggc ccaattccgg 4160161 acagcgggtc aagcgccccg cccaggacgt gatagcgacc ccggaactcg cgggtgcgct 4160221 cgacggcctg gatgtctttg ggttcctcga caatgcacac cacggacgca tcgcgacgga 4160281 tatcagagca gattctgcaa cgctcgttgt cagagacatt cccacacacc gcgcagaatc 4160341 gcacgccgtc ccgaaccttc gccagcacac cggtcagccg gtcgatgtcc gacggttcta 4160401 ccgacaacag gtggaaggcg attcgctgcg cactcttggg tccgatcccc ggcaacttgc 4160461 cgagttcgtc aatcaggtcc tggacgggtc cctcaaacat gtcggtgcag gtcagatccc 4160521 tggtacaggt ggtgcgcccg gcgcacccgg catacccggc atacccggca tacctggcgc 4160581 tcccggcggc gcagccggtg gtgccggcgg gcgcatcgcg ccggccaatg cacccagccg 4160641 ttcctgcgcc atcttcgtca cctgctggga cgcgtcgcgc atcgcaccga cgatcaggtc 4160701 ctgcaaggtc tcgatgtcgt cgggatcgac gaccttgggg tcgatcgtca cgccgatcac 4160761 ctccccgctg cctttgacga cgaccttgac caggccccca ccggcttgac cgtgcacctc 4160821 agagttcgcc agctgttgct gggcctccag gagcttttgc tgcatctgct gcgcctgagc 4160881 gagcagcgcc gacatgtcgc ctccgggttg catgacagtc ccctagcatc ttggtctcga 4160941 gttggtttcg cctgtggttg tcgggcgatt cggaacattc agcctagacc gcgccgcgtt 4161001 acctttgcgc cgtggaccta cgagttggcc cgcgtgtcgg gttcgccatg atagtcgggg 4161061 tactcgtcgc agcagcgacg ccgatcatct cgtccgcgag cgcaaccccc gccaacatcg 4161121 ccggcatggt cgtcttcatc gaccccggac acaacggagc caacgacgca tcgatcggcc 4161181 gccaggtacc caccggtcgc ggcggcacca agaactgcca ggccagcgga acgtcaacca 4161241 acagcggcta cccggagcac accttcacct gggaaaccgg gctgcggctg cgggccgcgt 4161301 tgaacgcatt gggggttcgg accgccctgt cacgtggcaa cgacaacgcg ctcggaccgt 4161361 gtgtcgatga gcgcgccaat atggccaacg cgttgcgccc caacgcgatc gtgagcctgc 4161421 acgccgacgg cggaccggcg tctggccgcg gattccacgt caactactcg gccccgccgc 4161481 tcaacgcgat acaggccggt ccctcggttc agttcgctcg aatcatgcgc gaccagctgc 4161541 aggcctcggg cattccgaag gcgaactaca tcggccagga cggcctgtac ggacgttcgg 4161601 acttggccgg cctgaaccta gcccaatatc cgtcgatcct ggtcgagttg ggcaacatga 4161661 agaaccccgc ggactcggcg ctgatggagt ccgccgaggg caggcaaaaa tacgccaacg 4161721 ccctggttcg cggcgtcgcc ggcttcctgg ccacccaggg ccaggcgcgt tagccccgca 4161781 cacaggcggc acccccaccg cgcccgcatc gtcgtcaggc gtcaccctcg agttcggtct 4161841 tgaggttgga cagcacctcg gcctggatct tcttcagccc tagcggcgca aaggtcttct 4161901 cgaagaaacc cttgaccccg cccgcgccgg tccaggtggt cttcaccgtg acgctggaac 4161961 cgggtccggc gggagcgacc gtccagttgg tgaccatgga cgaattcatg tccttctcga 4162021 tgacggtgtg cccggcaacg tccacgttca cctgcacatc gcgaacacgc gactgcgtcg 4162081 cctgcagccg ccacttggcg actgtgcccc gccccttgcc gccctcgagc acctggtact 4162141 cgctgtagtg cggggacagg attttaggac ggacggtctc atagtcggcc agcgcgtcga 4162201 gtgtggccgt gggctcagca ttgatcaaga tcgtgctggc tgcgctcacc tgtcccatca 4162261 gggccggact ccttcgtttg tgattgctgc accgcccgca cccggatgca ggggcagttg 4162321 tcgaggacta gggtatatac ggtgcctgtc cctggatctg cacagtcggc ttacgcctgc 4162381 ggcgtcgagc ggttgctggc gagctatcga tccatccccg cgactgcatc catccggctt 4162441 gccaagccca cctcaaatct gttccgcgcc cgcgtcaaac acgatgcacg cggcctggac 4162501 gcatcgggac tgaccggtgt catcggtatc gatcccgagg cccgcaccgc cgacgtggcc 4162561 ggcatgtgca catacgagga cctaatcgcc gcgacactgc actacggtct gtcaccattg 4162621 gtggttccgc agctgaggac gatcacattg ggcggagcgg tcaccggctt gggtatcgag 4162681 tcggcgtcgt tccgcaacgg cctgccccac gagtcggtgc tggagatgga tatcctcacc 4162741 ggcgcaggag aacttctcac cgtctcgccc ggacagcact ccgacttgta ccgtgcattc 4162801 cctaactcgt atgggacact gggctattca acccggcttc gaatccagct ggagccggtc 4162861 cggccgtttg tcgcgctgcg gcacatccga tttagctcgt tgacggcgat ggtggccgca 4162921 atggagcgca tcatcgacac cggcggactg gacggcgaat cggtggacta tctcgacggg 4162981 gtggttttca gcgctgacga aagctacctg tgcatcggca tgcagacgag cgtaccgggc 4163041 ccggtcagcg actacaccgg acaagacatc tactaccggt cgatccaaca cgaggcgggg 4163101 atcaaggaag accggttgac catccacgat tacttctggc gctgggacac cgattggttc 4163161 tggtgctcac gatcgtttgg tgcccaaaac ccgcggctgc gccgctggtg gccgcggcgc 4163221 taccggcgta gcagtgtcta ctggaggttg atggcgctcg atcagcgctt cgggatcgcc 4163281 gaccggttcg agaacagcag gggtcgtccc gcgcgtgaac gggtggtgca ggatatcgaa 4163341 gtgccgatcg aacggacctg cgagtttctg gagtggttcg gggaaaacgt gcccatttcg 4163401 ccaatctggt tgtgcccgtt gcggctacgc gatcacgccg gctggccgct gtacccgatc 4163461 cggcctgacc gtagctatgt caacatcggg ttctggtcgt cggtgccggt tggcgccacc 4163521 gagggcgcca ccaaccgcaa gatcgagaac aaggtgagtg cgctcgacgg gcacaagtcg 4163581 ctctactccg actccttcta tacccgcgag gagttcgacg agctctacgg cggcgagact 4163641 tacaacactg tgaagaaagc ctacgatccc gattcgcgtc tcctcgatct ttacgcaaag 4163701 gcggtgcaac gacgatgaca acgggcagac tcagcatggc cgagatcctg gagatcttca 4163761 ccgcgaccgg gcaacacccg ctgaagttca ccgcgtatga cggcagcacc gcgggacaag 4163821 acgacgccac actgggcctg gatcttcgga cgccccgcgg cgccacctac ttagctaccg 4163881 ctcccggcga actcggcctg gcccgcgctt atgtgtcggg tgacctacag gcacacggag 4163941 tacatcccgg cgatccgtac gaactgctca aaacgctgac cgaaagggtc gacttcaaac 4164001 ggccgtcggc gcgggtgctg gctaatgtgg tgcgctcgat cggcgttgag cacatactgc 4164061 ccatcgcgcc gccaccccag gaggcgcgac cccggtggcg tcgaatggct aatggcttgc 4164121 tgcacagcaa gacccgtgac gccgaggcta tccatcacca ctacgacgtc tccaacaact 4164181 tctacgagtg ggtgctcggg ccatcgatga cctacacgtg cgcggtgttt ccgaacgctg 4164241 aggcttcgct ggagcaggcc caagagaaca aataccgact cattttcgaa aagctacggc 4164301 tagagccggg tgaccggcta ctcgacgtcg gctgcggctg gggcggcatg gtgcgctacg 4164361 ccgcccgacg cggtgtccgg gtgatcggcg ccacgctctc ggccgagcag gccaagtggg 4164421 gccagaaagc agtcgaggac gagggattga gcgacctcgc gcaggtgcgg cattccgact 4164481 accgcgacgt agccgagacc ggtttcgacg ccgtttcttc gatcgggcta accgagcaca 4164541 tcggcgtcaa gaattacccg ttctacttcg ggtttctcaa gtcgaagttg cgcaccggcg 4164601 gcttgctgct caatcactgc atcacccgcc acgacaacag gtcgacgtcc tttgccggcg 4164661 ggttcaccga ccgttacgtt ttccccgacg gggagctgac gggctcggga cgtattacca 4164721 ccgagatcca gcaggtcggc ttggaagtgc tgcacgagga gaacttccgc catcactacg 4164781 cgatgacgct gcgcgactgg tgcggcaacc tcgtcgaaca ctgggacgac gcggtcgccg 4164841 aggtcggtct gccgaccgcc aaggtgtggg gcctgtacat ggcggcttcg cgggtggcct 4164901 tcgaacgaaa caacctgcag ctacatcacg tattggcgac caaggtggac ccccggggcg 4164961 acgacagctt gccactgcgg ccctggtggc agccctaggc gttgtctatc cggcgcgcgc 4165021 ccagctcgtt ctgcagcagc tcgagtgcaa cctcttccgg gtcgcgacgc ggcgacgggt 4165081 cgccacggcc ggcttcggcg agcatgtgct cctcttcgtc gcgctgagtg gaattcgctg 4165141 tgggggcagg gtttacggcc ttggcggtcg ccacgttcgc tcccccgccg acgggtgatg 4165201 ccgccgcagc cggttcaccg gtctcacacc gcacccgcca gttgactccc agcgcgtctt 4165261 taagcgcctc ggcgaggaca tcggcgttgc gctgttcgga cagccgccgc gccagcggcg 4165321 ccgattcgtg ggtcagcacc agcgtgttgt cctctagcgc acggacggtg gcacccgcca 4165381 gcatcacctc ggtggtacgg ctgcgcaggc gcaccttgtc gcgcaccgtc ggccacatgg 4165441 accgaaccgc ggccacggtg ggttcgctcg aggccggtgt gggggccagc accggtctcg 4165501 gttcacgcgc gggctggtgt ttcggctcgg cagccgcagc cgacgggcgt ggtacggctt 4165561 gcggcgccgg gatcgacatg tccaaccggg tctcgatccg ttcgacccgc tgcaacagtg 4165621 ccgattcggc gtcgctcgcc gagggcagca gcagtcgcgc gcaaaccact tccagcagca 4165681 gacgcggcgc ggtcgcaccg cgcatctcgc ctagcccggc ctgcaccacc tcggcatatc 4165741 gggtcagggt cgcccgcccg atccgggcgg cttgctcgcg catccgatcc agcgcgtctt 4165801 cgggcgcatc caccaccccg cgagatgccg cgtcgggaac cgattgcagc acaatcaggt 4165861 cgcggaatcg ctccagcaga tcggtagcga aacgccgagg gtcatgtccg ccatcgatca 4165921 ccgattcgat cgccccgaac aatgcggccg catcgcaagc ggccagtgcg tcgaccgcgt 4165981 cgtcgatcag ggcgacgtcg gtgacaccca gcagccccag cgcccgggtg taggtcacgt 4166041 gggtgtccgc ggccccagcc agcaattggt ccagcaccga gagcgtatcc cgtggggaac 4166101 ctccgccggc ccggatcacc aacgggtaca ccgcatcgtc gacgacgacg ccctcctgct 4166161 cgcagatccg cgcgagcaac gcccgcatag tgcgcggcgg cagcagccgg aacgggtagt 4166221 gatgagtgcg cgaccgaatc gtcggcagta ccttctccgg ttcggtggtg gcgaatatga 4166281 agatcaggtg ttcgggcggt tcctccacga tcttgagcag cgcgttgaat cccgcggtgg 4166341 tcaccatgtg cgcctcgtcg acgataaata cccggtaccg tgactggacc ggcgcataga 4166401 acgcgcggtc ccgcagctcg cgggtgtcgt ccacgccgcc gtggctggcg gcatccagct 4166461 ctaccacgtc gatgctgccg ggggcgttgg gcgccaacga aacgcaggat tcgcagaccc 4166521 cgcacgggtt ggcggtaggg ccctgcgcac agttcaacga ccgcgccagg atacgcgctg 4166581 acgacgtctt tccgcagcca cgcggcccag agaacaggta cgcgtggttg atccggccgg 4166641 catccagcgc caccgacagc ggcgcggtga cgtgctcctg ccccaccacc tccgcgaagc 4166701 ttgccggtcg gtacttgcgg tagagagcca cgtcagcagg ctaccgaccc taggcgacga 4166761 gtgtgttcgc agcgtcgaat gtgaacgttc ggcgtgattt cggcgcgcgg gttcccgctc 4166821 tcagcgcacg ttcggcgccg aggaggctag tccctggtta agcaatgtct cggtcgccgc 4166881 cagcagcgcg caggtcgcca acccgtcaac cgcgttgcgc aggtccggta ccgacggaaa 4166941 cgacggcgcg atccggatgt tcttgtcgtc cggatccttt cgatacggga acgacgcccc 4167001 cgcctcggtc accgcgatac caacgtcctt agccaaggct acggtccggc gcgcggtccc 4167061 gggcaacacg tcgaggctga tgaagtagcc acccttgggc tcggtccagg aggcgatctt 4167121 ggactcgctt agccgctgat ccagaacttc ggccaccaac gcgaatttcg gcgccagtat 4167181 ctgctggtga cgcaacatgt gtagacgtac cccatcggcg tcgccgaaga agcgtagatg 4167241 ccgcagctgg ttgaccttgt ccgggccgat cgacttcttc ccggcgtact gcagatacca 4167301 ggcgatgttg cctaacgatc caccgaagaa gctgacaccg ccgccggcga aggtgatctt 4167361 cgaggtggac gcgaagacgt aggggcggtt ggggttgccg gccttggcgg ccagcccgag 4167421 cacgtcgacc tggcgcggga aatccagcgt cagggtatgc accgcatacg cgttgtccca 4167481 gaacaagcgg aagtcaggtg ccgccgtccg catctggacg agtcggcgaa ccgtttccca 4167541 ggaataggtg acgcccgaag ggttgccgaa gaccggtacc gtccacatcc ccttgatggc 4167601 tgggtcgacg gcaaccagtt cttcgatcag atcgacgtcg ggcccatcct gcagcatggg 4167661 tatcgggatc atctcgatgc ccatggtctc ggtgatggca aagtgccggt catagccggg 4167721 gaccgggcac aggaatttga tgccgtcctg ctcctgaatc caaggccgcg gcgagtccac 4167781 gccgccatac aacatggaga aggcgacgat gtcgtgcatc aattccaggc tggagttgtt 4167841 gcccgcgatc aggttgggca ctgcgatgcc gagcagttcg gcgaagatag cccgcaggcc 4167901 cggcaggccg tgctggccac catagttgcg ggtgtcggtg ccctccgggt cgcggtagtc 4167961 gtctccgggc aagctcagca gctggttcga caggtcgagc tgctctgcgg atggtttgcc 4168021 gcgggtgaga tccagagcca gcttcatgcc ctgaagcgcc gcataatcct gctgatggcg 4168081 tgcgtgtagt gccgctagct cttgggggct aagagagtcg aacgacaccg tgggcccttt 4168141 cgccgagtcg aaaaccgtgg gtataccgag gtccagtcag tgccccggct gaaggggacc 4168201 ccgcgcaccc gacagagccc gttgaccctt gctgccttcc agccctgggg gagttcacag 4168261 gatagacgcc gcgcggggtc caccgtgagt ctaatacctg ggctggaacg cccgggacgg 4168321 actcagcggg ctaccatatg ctgcggagga ttcgcctagt ggcctatggc gctcgcctgg 4168381 aacgcgggtt gggttaacag ccctcgcggg ttcaaatccc gcatcctccg ccaggtggtc 4168441 cgcagcgcgg acgggaacgc ggacgggaac gcggacggga acaatgtggg ctggtcggct 4168501 tctcaccggc tcggttcacc agcctaagga ggggtatggg gcgcaaggtc gccgtgctgt 4168561 ggcacgcgtc gttttcgatt ggcgccggcg tcctctactt ctatttcgta ttgccccgtt 4168621 ggcctgagct gatgggtgac accggacact cgctggggac tgggctccgg attgccacgg 4168681 gcgcgttggt cggtctggcc gcactgccgg tggtattcac tttgctgcgc acccgcaagc 4168741 cggagctggg caccccgcag ctggcgctgt caatgcgaat ctggtcgatc atggctcacg 4168801 tgctggccgg cgcgctgatc gtcggcaccg cgattagcga ggtctggctc agcctggatg 4168861 ccgccgggca gtggttgttc gggatctacg gagctgccgc cgcgatcgcg gtgctcgggt 4168921 tcttcgggtt ctacctgtcg tttgtcgccg agctgccgcc gccaccgccg aagccgctca 4168981 agccgaagaa acccaagcag cgacgccttc gccgcaagaa gacggccaag ggcgacgagg 4169041 ctgagccgga agccgccgaa gaagccgaga acacggagct ggcggcgcag gaggacgagg 4169101 aggccgtcga agctcccccg gaaagcatag aaagcccggg aggtgaaccc gagtcggcga 4169161 cccgggaagc tccggcagca gagaccgcca ccgccgagga gccccggggc gggttacgga 4169221 atcgccgccc caccggcaaa acctcacatc gacgccggcg cactcgcagc ggtgtccagg 4169281 tcgccaaggt cgacgaatag ccgcggtcag gtgctgtagc ggcggctgtg aaccctgcga 4169341 cgcaatgtcg gcgtgtcacg ttgtcggatt cactgtcgcc ggctagcgct ttcccgtcag 4169401 aagacgagaa gcctccccga tctccaacta gcatcgagat cgggcttgcg aaggttgggt 4169461 tgcaaaatgg atgtcatcag atgggctcgc cggcttgcgg tggtggcggg cacagcagcg 4169521 gcagtgacca ctcctgggct actgagtgcg cacgttccga tggtctccgc cgaaccgtgt 4169581 cccgacgtcg aggtggtgtt tgcccgtggc accggggagc cacctggtat tggcagcgtc 4169641 ggaggactgt tcgtcgacgc actgcgtttc ccaggttggc gccaagtcac tcggggtcta 4169701 cgccgttaac taccccgcca gtaacgactt tgccagcagc gacttcccta agacggtcat 4169761 cgacggaatt cgcgacgcgg gctctcatat ccagtcaatg gcgatgagct gtccccagac 4169821 caggcaagtg ctcggtggat actcccaagg tgcggccgtg gccggttatg tcacctcggc 4169881 tgtggtaccg ccggctgtac ccgtgcaggc ggtaccggca ccgatggccc cggaggtagc 4169941 aaaccacgtc gccgcggtca ctctgttcgg cgcaccgtcg gctcaattcc tgggccagta 4170001 cggcgcgccg ccgatagcca tcggtcccct gtaccagccg aaaacgcttc agttgtgtgc 4170061 cgatggcgac tcgatttgtg gcgacggcaa cagcccggtc gcgcatggcc tgtacgcggt 4170121 gaacggcatg gtaggccagg gcgcgaattt cgccgccagc cgcctgtagc cagaactgcg 4170181 ctgccacccc agcgagagct gggcggtgat ccaatgcaga atgccaccat gcgcgttctg 4170241 gtcaccggcg gtacgggatt tgtgggcggg tggactgcca aagccatcgc tgacgcgggc 4170301 cactccgtcc ggttcctggt gcgaaatccc gcacggctga agacgtctgt cgcgaaactg 4170361 ggcgtcgacg tgtcggactt tgcggttgca gacatatccg accgcgattc ggtacgggag 4170421 gcgttgaacg gatgcgacgc cgtcgtgcac agcgccgcgc tggtggcaac cgacccgcgt 4170481 gagacttcgc ggatgctgag tacgaacatg gcgggcgccc aaaatgttct cggtcaagcc 4170541 gtcgagctcg gaatggatcc gatcgtgcat gtgtcgagct tcacggcgct gtttcgtccc 4170601 aacttggcga cgctgagcgc tgatctgccg gttgccggtg ggacggatgg atacggacaa 4170661 tccaaagcgc agatcgaaat ctatgcgcgc ggtcttcagg acgccggcgc accggtgaac 4170721 atcacttatc ctggcatggt cctcggcccg ccggtgggcg atcaattcgg tgaagccggg 4170781 gagggtgtcc ggtccgcatt gtggatgcat gtcattcccg ggcgcggcgc ggcgtggttg 4170841 atcgtcgacg tccgagatgt ggcggcactg cacgcggcgt tgttggaatc cgggcgtggg 4170901 ccgcgccgct acactgcggg aggtcatcgg attccggtgc ccgagctcgc gaaaattctg 4170961 ggcgggtcgc cggcaccacg atgctggccg tcccggtgcc cgattccgcg ctgcgtgtcg 4171021 cgggatcggt gctggatcaa gccgggccct atctgccttt caatactccg ttcaccgcgg 4171081 caggtatgca gtactacaca cagatgccgg agtccgacga ttcgccgagc gaaaaagaac 4171141 taggcatcac ctaccgcgat ccgcgcgaca ccgtggccga caccgtcacg gccctgcgcg 4171201 gcctgggcag ctaactgccg tcgggaggtt ccgccggttc cgcgtcgggg cgcgaattct 4171261 tcaaccactg cttcagccgg agcagttcgt tgacgacgat gccgacgccc aggaggatga 4171321 ccagcgtcac cacaatagcg gtggccacgt agtccatggt gacagccccg ccacggcgca 4171381 cgttcaggcc gcttgctgtc ggatcgagag gacctacgcg atgaaggcgg tgacctgcac 4171441 caacgcaaag ctcgaggtag tcgaccggcc gtccccggcg ccggccaagg gtcaactgtt 4171501 gctcgatgtg ctgcggtgcg gtatctgcgg atcggacctg catgcccgct tgcactgtga 4171561 tgaactggcc gacgtgatgg ccgaatctgg ctaccacgcc ttcatgcgat cgaatcagca 4171621 ggtggtgttc ggacacgagt tctgtggcga ggtggtcgat tacggtcccg gcacccgcag 4171681 gacccctagg cgcggcaccc cggtcgtcgc catgccgctg ctgcggcgtg gcaacaaaga 4171741 ggtgcacggg atcgggcttt cgacaatggc gccgggcgcc tacgccgagc ggctcgtcgt 4171801 cgagcagtcg ctgacgtttc ctgtcccgaa cgggctggcg cccgagatag ccgcgctgac 4171861 cgagcccatg gccgtcggat ggcacgccgt ccggcgcggc gaggtgggca agggcgacgt 4171921 cgcgatcgtg atcgggtgcg gtccgatcgg cctcgcggtg atctgcatgc tgaagtcgcg 4171981 cggggtacac acggtgatcg caagcgactt ttcacccggc cgtcgtgccc tcgcaaccgc 4172041 ctgtggcgct gattccgtag tcgatcccgt acaggactca ccgtatgcgg tagccgccgg 4172101 ccttggacag ggaaacagac acctgcaaag catcctcgac gcgttcgacc tcgcagtcgg 4172161 cacggtcgaa agactgcagc ggctgcggct gccgtggtgg cacctttggc gggctgccga 4172221 agcagctggc gccgcaacgc caaagcgtcc agtcatcttc gaatgtgttg gcgttccggg 4172281 aattatcgat ggcatcatcg ccagcgcacc gctgttctcg cgcgtcgtcg tggtcggcgt 4172341 ctgcatgggc tcagaccaca tccggccggc gatggcgatc aacaaagaga tcaacctgcg 4172401 gttcgtcctc ggctacacac cgttagagtt ccgcgacacg ttgcacatgc tggccgacgg 4172461 caaggtcaac gccgcgccgc tgatcaccgg gacggtcggt ttacccggcg tggcggcagc 4172521 attcgatgcg ctcggcgatc ccgaggcgca cgcaaaaatc atgatcgacc ccaagagcaa 4172581 cgccgcgagt ccccaaccat tccgcgtgga gtgaatgatg cgggatagcc gcacggcgtt 4172641 ggatccaccc gggacgacag cttgaattca ggcggcctct gctttaaagc gcacactacc 4172701 gcgcctgctg cggcatggat ccaaatatcc gccaaagtac gtatggacat ccgatagccc 4172761 ggcgcaccta cgacccgccg cgcagacaca tttacgcgtt cgcaccgatg gctgcggacc 4172821 cagcaaatgg cagagttaga gcgtcggccg tgtcttgagt caatgcttcc aggccggcac 4172881 cttttctccc gtggaccgca tgtgcccacg gtcgcgtcag taccgcccga atcattcctt 4172941 gaggcctatt gcagatgaaa ccgtcgcctg ccgataccca cgtcgtgatt gccggtgctg 4173001 gcatcgcggg attggctgcc gccatgatcc tggccgaagc cggggtgcga gtcacattgt 4173061 gcgaagctgc atccgaagct gggggcaagg ccaagagttt acgtctcgcg gacggccacc 4173121 cgaccgagca cagtttgcgg gtttacaccg atacttacca aaccctgctg acgctgttct 4173181 cgcgtatacc caccgaacat gacaggaccg tgctagacaa cctggtcggc gtcagcatgg 4173241 tttcggctac cgcgcaaggc gtgattggcc gaatcgctgc gccagttgcc ttgcaacgcc 4173301 ggcggccaac cttcgcgcgg atcataggca aggtagtcga accgccgcgg caacttgtcc 4173361 ggatcttgtt gcgcggccca atggtaatcg ttggtctggc ccaacgaggt gtgccggcca 4173421 ccgacgtcct ccattacctc tacgcccatc tacggctgct gtggatgtgc cgagagcgac 4173481 tcttggcgga gctgggcgat atctcgtatg cggattatct gcagctcggc tgcaagtctg 4173541 cccaggcgca ggaattcttt tctgctgtgc cgcgcattta cgtcgcggcg cgcaccagtg 4173601 ccgaagcggc ggccattgcg cccatcgttc tcaaggggct gtttcgcctg aaaagtaatt 4173661 gtccatcagc cctcaacgac gcaaagctgc ccgcgatcat gatgatggat ggaccgacca 4173721 gcgagcgcat ggtcgatccc tggattcgcc acctgacaag gctcggcgtg gacatccact 4173781 tcaacacgcg tgtcggcgat ctcgagttcg acgacggtcg cgtcaccgca ttgatatcgt 4173841 ccgatggccg ccggtttgcc tgcgactatg ccctgctcgc ggtgccctat ctgacgctgc 4173901 gagagctggc caaatcagct catgtcaagc gatatctccc tcagctcaca cagcagcacg 4173961 cccttgcgct tgaggcatcg aacggaatcc agtgttttct gcgcgacctc cctgcgacgt 4174021 ggcctccgtt catccgccct ggagtcgtca ctacgcatct gcaaagccag tggtcgctgg 4174081 tctgcgttct gcagggagaa ggtttctgga aaaacgtccg cctgccggaa ggaacccgct 4174141 acgttctgtc aataacctgg agtgatgtgg aaacgcccgg acctgttttt gatcggccat 4174201 tgagtgaatg tacgccagat gagatcttga ccgagtgcct gacgcagtgc ggcctcgata 4174261 aatcgaacgt cttgggctgg cggatcgatc acgagctgaa gcacttagac gaggccgaat 4174321 acgaaaaggt ggcgagcgag ctgcctcctc atcttgtctc ggcgcctgcg cgcgggcagc 4174381 gcatggtgaa tttctcgccg cttaccgtat tgatgccggg cgcgcgccac cgctccccgg 4174441 gtatttgcac ctcagtgcct aaccttttgc tagccggtga ggtgatctat tcacccgacc 4174501 tgaccttgtt tgttccgacc atggagaagg cggcatgctc cggctatctg gccgcccgcc 4174561 aaatcatgaa catggttgct tcgcacgccg caccgctgcg gatcgacttc cgggatcccg 4174621 ccccatttgc ggttctgcgg cgggtggacc gatggttttg gagccgccgc cgacgaccgc 4174681 cagaccggtc gacatttgca accccaccaa ccgccatgcc ggcgccgagc cacctgaccg 4174741 acgtggatcg ctctgcaagt tagccgccgg taacccacca agcctcgtca cgctacaagt 4174801 ccaccgttga accgacggcg ttgacgcgtc acatatccct gatccttcaa gaacgtggag 4174861 tttcccttga ctgtgcacac cgtcgccacc aacaatgctg cgcccgtcat agccgccggt 4174921 cccgtcggcc ctagcagacg acgccgtcgc gtgcacgccc cacttacgcg acgccgccaa 4174981 ccctcctcct cggcggtgct gctggtggcg gctttcggcg ccttcctcgc tttccttgac 4175041 tccacgatcg tcaacgtcgc gttccccgat atccagcggc acttccacag cgacatcagt 4175101 gacctgtcct ggatgctcaa cgcctacaac attgttttcg cggcgttcct ggtggccgcc 4175161 ggcaggctgg ccgacctgat ggggcgcaag cgggtgttca tcttgggggt ggcgttgttc 4175221 accgtcgcgt ccgggctgtg cgcgatcgcc gaaagcgtcg gggaactggt tgcgttccgt 4175281 gtgctgcaag gcatcggcgc agcggttctg gtaccggctt cgctggggct ggtcgtcgag 4175341 gccttcccgg ccgagcggcg cgcgcacggg gtcaacctgt ggggtgcggc gggggccatc 4175401 gccgcgggcc tcggcccgcc gatcggtggc gccctcatcg aggcggatgg ctggcggtgg 4175461 gtgttcctgg tgaaccttcc gctgggggta ttcgctgtgc tggccgctcg gcgggcactg 4175521 gtggagaacc gggccgccgg acgtcggcgt gtgcccgacg tgcgcggcgc ggtgctgctg 4175581 gctttcgcgc tgggcctttt gacgctggga ttgatcaagg gcccggattg gggttgggcc 4175641 agcctgccga ccagcgggtc attgctggcc gcggcggtcg cgatggttgg gtttgtgatg 4175701 agctcacgac accacccggc accgatggtc gagcccacgc tgttgcgcat ccagtcgttc 4175761 gtggccggca ccgggctgac cgccgtggcc agcgccggct tctacgccta tctgctgacg 4175821 cacgtgctgt tcctcaacta cgtctggggt tacacgctgc tggaggctgg catggccgtc 4175881 gcccccgccg cgctggtcgc cgccgtcgtc gcggcggtgc ttggccgcgt cgccgaccgg 4175941 cacggttacc gcttcatcgt cggcatcggc gcgttgatct gggctgccag cctgctgtgg 4176001 tatctcaagg ttgtcgggtc ccagcccgat ttcctcggtg aatggctgcc cggccagata 4176061 ctgcagggaa tcggggtggg cgctaccttc ccgctgctcg gcagtgccgc cttggcccgg 4176121 ctggccaagg gcggcagcta cgccaccgct tcggcggtga ccggcaccat ccgccaggtt 4176181 ggcgccgtca tcggcgtcgc ggtgctggtg atcctggtcg gcacaccggc accgggcgca 4176241 gccgaagagg cgttgcgtca cgggtgggcg ttggccgcga tctgtttcgt ggcggtgggg 4176301 atcggggcgc tgtcgctggg tcgcatccgc ccagtcccag ctgcggttga acccccgccg 4176361 gggccgccgg tggctccgtt gggagcgcgg cggccgccga gacccgcacc ggtggcctca 4176421 cccgccgcgg cagtggcccc gacccccaag acttcccgcg aagtcaacct gctggaggct 4176481 ctgcggtttg ccaggccgga cacgcaacag attgagctgc aagcaggctc gtatttgttc 4176541 cacgcgggcg atgtgtccga tgcgctctac gtggtgcgca gcggccgcct gcaagtcctc 4176601 gccggcgacg gcgcaaagga cgaagtggtg gccgagctgg gccgtggtca ggtggtcggg 4176661 gagctcgggg tgctgctcga tgcgccgcgg tccgcgtcgg ttcgtgcggt acgcgactcg 4176721 tccctgatgc gagtgaccaa ggccgaattc gcgaagatcg ccgatgccgg ggtgcttggg 4176781 gcgctggcgg gggtactggc caaacgacag caccagacac gcgtggcctc tcagcggaca 4176841 acgccggagg tcgttgtcgc ggtcgtcggt gtcgacgcca atgcaccggt cgcaatggtg 4176901 gccaccgaat tgtgcagggc actgtcgaca cggctacgtg ctgtcgcccc cggccgggtc 4176961 gactgcgacg ggttggaacg tgccgagcag accgccgacc gggtggtgct gcatgcggcc 4177021 gtcggcgacg cgcggtggcg ggaattctgt ttgcgtgtcg ccgatcgcgt ggtgctggtg 4177081 gccagcaacc cggccgtgcc tgtggccccg ctgccgaccc gagcgaccgg cgccgacctg 4177141 gtgctggccg gacggcccgc cggccgggag caccgacgtg cctgggagca gttgatcacg 4177201 ccgcggtcga tgcatgtggt ccgacgcgaa tttgtcgccg acgacctgcg ggtgctcgcc 4177261 acgcgtatcg cgggccgttc cgtggggcta gtcctcagcg gtggggcagc gagggcgtgt 4177321 gcccacttgg gcgtgctgga ggaactggag gccgccgggg tcaccgtcga ccgctttgcc 4177381 ggcaccagca tgggcgcaat catcgcggct ctggcggcca gcggtttgga tgctgccggg 4177441 gtggatgcgc aaatctacga gcacttcgtg cgcaagagcc acggcgacta caccctgccg 4177501 agcaaggggc tgatccgcgg gaaacgcacc cagtccacgc tacgcacgat cttcggagac 4177561 catttggtgg aggagctgcc gaaacatttc cgctgcgtca gtgtcgacct attggcccgg 4177621 cgtcccgtcg tgcaccgcca aggcccgctc gccgacgtcg tcggctgctc gatgcggctg 4177681 ccttttctgt atgcgccact gccctacggc ggcaccctgc acgtcgacgg cggtgtgctg 4177741 gacaacgtgc ccgtcaccac gctggtgggc aaggacggcc cactgattgc ggtaaacgtg 4177801 gcctctggcg gaaatccaag ccccgcgtcc ggcggccatc gccgcggcaa accacgggtg 4177861 cccggcctaa ccgacaccct gctgcgcacc atgacaatca gcagcgcgat ggcatcggaa 4177921 aaagtgttgg cccaggccga cctggtgatc aagcccaacc cgatcggcgt cggactcatg 4177981 gagtaccacc agatcgaccg cgcccgtgaa gcgggccgga tcgcggcccg tgaagcgttg 4178041 ccacaaatca tggagctggt gcacggctga acctgggcag ggccgctaag atactgtgac 4178101 cacggccacg ctatcggcgg cctggccagc tttccgggcc gctacccgat gggagtcctc 4178161 acccacgccg ccggcggacc caaccccgat tgttcgaccg cagacactga tctatcgcgc 4178221 aggcgttgcc gcatggtgga ctagcccaat gacgcgggct gacggcaagc gcgaccgtga 4178281 cgagatgttc gtcgaataca ccaagagcat ctgccccgtc tgcaaggtcg tggtcgacgc 4178341 ccaggtcaat atccgccacg acaaggtgta tttgcgtaag cgctgccgcg agcacggaag 4178401 tttcgaggcc ctggtgtacg gggatgccca gatgtatttg gaatcagcac gattcaacaa 4178461 accgggcacc tttccgctgc ggtttcagac cgaggtgcgc gacggctgtc ccagtgactg 4178521 cgggctgtgc ccggaccaca agcaacacgc ctgcctgggg ttgatcgagg tcaacacaca 4178581 ctgcaacctg gactgcccga tctgtttcgc cgactctggc caccaacccg acggctacgc 4178641 catcaccgcg gcgcagtgtg aacggatgct cgacacgctc gttgccgccg agggtgaacc 4178701 cgaagtggtg atgttctccg gtggcgaacc gaccatccac aaacaactcc tcgagttcgt 4178761 cgacgccgcc caggcccgcc cggtcaagac cgtcatcatc aacaccaacg gcatccggct 4178821 ggcctccgac cggcgattcg tcgaccagct cgccacccgc aaccgtcccg gccaccccgt 4178881 gcacatctac ctgcagttcg acggcctgga cgaggcaaca catcgtcgaa tccggggcca 4178941 cgatctgcgg gacgtaaagc agcgggccct ggacaactgc gccgcggcgg gcctgaccgt 4179001 cagcctggtg gccgcggtgg aacgcggcct caacgagcac gagctcggcg cggtcatccg 4179061 ccacggcatg gcgcagcccg gagtgcaacc ggtggtattt cagccggtca cccacgccgg 4179121 ccggcatgtg cagttcgacc cgctgacccg actgaccaac tccgacatca tcgcctgcat 4179181 caccgcgcaa ctgcccgaat ggttcaggcc cggtgacttc tttccggtgc catgctgctt 4179241 ccccagctgc cgatcgatca cctacctgct caccgacggg gagcatgtgg tcccgattcc 4179301 gcggctgctc aatgtcgagg actacctcga ctacgtctcc aaccgggtga tccctgacct 4179361 ggcgatccgc gaagccttgg agaacttgtg gtcggcgtcg gcggtgccag gcaccgacac 4179421 catgaccgca cagctacagc gggctaccgc cgccctgaac tgcgccgagg gctgcgggat 4179481 caacctgccc gaggccctca cgcacctcac cgaccgggtc ttcgccatcg tcatccaaga 4179541 cttccaggat ccctacaccc tcaacgtcaa acagctgatg aaatgctgcg tgcaacagat 4179601 caccccggac ggacggctga tcccgttctg cgcctacaac tcggtcggct atcgagagca 4179661 ggtgcgtgaa cagctcaccg gggtaccggt acccgacatt gtgcccaatg ccatcccact 4179721 cgccgggttg ctggcggacg caccacacgg atcaaaacag gccaataccg gtgggagtat 4179781 cgccaggctc gcggggccaa cccgaggtgc gccgatggca ctgccaccac agcagatcaa 4179841 agcgtgttgc gccgacgcct attcccgcga catcgtcgcc ttgctactcg gtgactcctt 4179901 tcacccgggc ggcgcgacat tgacccgtag gttggctgac caactcgggc tgaggtcgac 4179961 aggcgacccg cggcgggtcg ccgacatcgc cgccgggccc ggcgcctccg cacggctgct 4180021 ggccagcgac tacggtgtgg ctgtcgacgg ggtcgacatc agcgagatca acgtgaagcg 4180081 cgcccaagcc gccgtcgcgc aaaccggcct gaccgagcgg gtgcgcttcc acctgggcga 4180141 cgccgaatca gtcccgttgc ccgacgacac attcgacgcg ctggtgtgcg agtgcgcgtt 4180201 ctgcacattc ccggacaaga acgccgccgc ccagcagttc gctcggattc tgcgtcctgg 4180261 tggcctggcc ggcatcaccg atgtcactgt cggggacggc ggcctgccgg cggagctgac 4180321 cccattggcc gcgtgggtcg cctgcatcgc cgacgcccga accgtcaccg actacaccga 4180381 catcctcgaa ggggccggat tgcgcacccg ccacatcgag tctcatgacg agagcctgct 4180441 ggacatgatc gaccgcatcg acgcgcggat caccgccttg cacgtcgccg caccggagat 4180501 cctcgccgac aacggcattc gccacgactc ggtgcgcgat ttcacagcgc tcgcacgcgc 4180561 cgcggtacaa accggacgaa tcggatacac gttgatgatc gcggaaaagc cgtgataatc 4180621 caggaaatgt gggacagacc aatcgcattt cccgcatctg aggagcgagc cgcaccgcgt 4180681 tacttcgacg tgtttccccc cttcaagtcg gtatcccggc tcggctgcac ccgcttgggt 4180741 tcgcccggca tcttcggata gttcggcgga tacggcatgt caccgagccc gcgctcctcg 4180801 tcggcggcgg ccaagtccag caatggtgca atcgactggg ccacgtcgtc catgccggcc 4180861 caggggtcgt cgcggatctt caccagctcg ggcaccgtgg tcatggtgta gtcgtcggga 4180921 tccgcgccgg ccagctcttc ccaggtcaac ggcatcgata ccgtcgcgat cggggtagga 4180981 cgcaccgaat aggccgacgc catggtgcgg tcgcgggcgt tttggttgaa gtcgatgaag 4181041 atacgcgcgc cccgttcttc cttccaccac gacgtcgtca ccgcatccgg tgcgcggcgc 4181101 tcgacttccc gggccaacgc aatgcccgcc cgacgcacct cgacgaagtc ccagtcggtg 4181161 gcgatgcgca ggaatacgtg aatccctcta cccccggatg tcttcggata accgaccaga 4181221 ccgaggtcgt ccagcacgga ccggagcaca tcgacggcga ccgtacgcgc ctccacgaag 4181281 ccggtgcccg gttgcggatc tagatcgatg cgcaattcgt cggggtgctc ggtgtcgggg 4181341 cagcgcactt gccacgggtg cagggtgatt gtgcccatct gcgccgccca tacgatcgcc 4181401 gccgggtggg tcaccttcag cgcgtcagcc atccgccccg acggaaacgt cacccggcac 4181461 gtctgcaggt agtcagggcg gtgccgcggg atccgctttt ggtagatctg ctcgccgtcg 4181521 acgccgtccg ggaagcgctg caagtgcgtc ggccggtcac gcagcgccgt cagcatcgga 4181581 cccccggcca cggcgaagta gtactcaacg aggcggcgct tggtgccgtg cgaccccagc 4181641 ttcgggaaat acatcctgtc cgggctagtc aaccgcaccg cgatgccgtc gacgtcgagt 4181701 tcctcagctg ccgccgccat atcggaattc cagcatgccg cacgcaagaa tgagcacatg 4181761 cagttacccg tcatgccgcc ggtgtcgccg atgctggcca aatcggtcac cgcaatcccg 4181821 ccggacgcgt cgtatgaacc caaatgggac ggattccgct ccatctgctt tcgcgacggt 4181881 gatcaggtcg aactgggtag ccgcaacgag cggccgatga cccgctactt ccccgagctg 4181941 gtcgccgcga tcagggccga gctgccgcat cgctgtgtga tcgacgggga gatcatcatc 4182001 gccaccgacc acggcttgga cttcgaggcg ctgcaacagc gcatccatcc tgccgagtcg 4182061 agggtgcgaa tgcttgccga ccgcacacca gcctccttca tcgcattcga cctgctggcc 4182121 ctcggcgacg acgactacac cgggcgaccg ttcagcgaaa gacgagccgc tctggtcgat 4182181 gccgtaactg gttcgggggc cgacgctgac ctgtcgatcc acgtcacccc ggcaaccacc 4182241 gacatggcga ccgcacaacg atggttctcc gagttcgagg gggccggtct agacggtgtc 4182301 atcgccaaac cgccgcacat cacctatcaa ccggacaaac gcgttatgtt caagatcaaa 4182361 cacctgcgga ccgccgattg cgtggtggcc ggctaccggg tgcacaagtc cggcagtgac 4182421 gcgatcggct cactgctgct agggctttac caggaggacg gccaactcgc gtcggtcggc 4182481 gtgatcggcg cgttccccat ggccgaacga cgccggctat taaccgagct gcagccgctg 4182541 gtcaccagct tcgacgacca cccatggaac tgggccgccc acgttgccgg ccagcgcacc 4182601 ccacgtaaga acgagttctc ccgctggaat gtcggcaaag acctgtcgtt cgtgccgctg 4182661 cgacccgagc gggtggtcga ggtccgctac gaccgcatgg aaggcgcgcg gttccgccac 4182721 accgcacagt tcaaccggtg gcgccccgac cgcgacccac gctcatgcag ctatgcccag 4182781 ctcgaacgcc cgctcaccgt cagcctctcc gacattgtgc cgggcctacg ctaaggtgcg 4182841 accctcttcg gtcagttgat ccccggtggg ccgatcggct cgggcgccac atccgggtcg 4182901 gttcgttgcg ttcggccgcg taacatctgc ggcatggcgg tgctgcccgc gtgccggttg 4182961 ggacttgtcg tctgtgtggc gaccgcagtg atcacagcaa ccatggtgtt ggctacgccg 4183021 agctatgcat gcgcctgcgg tgccgcggtc acagcacatg gctcccaagc aactttgaat 4183081 catgaagtcg cgctgcttca ttgggacggg acgaccgaga cgatcgtcat gcagctggca 4183141 atgaacgccg ataccgacaa cgttgccttg gtagtgccca ccccgacgcc ggcgatagtt 4183201 acaaccgcgg accagtccac gttcggcgag ctggacacgc tcagtgcgcc gttgatcgag 4183261 catcagcgac attggagctt aaggcgcggt gtcggtgcct ccggtcccca ggaggccgcc 4183321 gcccgggccc cgcatgtgct caaccaggtt cgccttggcc cgctggaggc caccaccttg 4183381 accggcgggg atctgagcgg cctgcagact tggttgtctg acaacggcta tgcgattcga 4183441 ccggcggtgt cagcggcgct ggatccctac gtgcgtgacg gatgggcgtt cgtggcgatc 4183501 cggctgacca gcaccgacct gatagtgggc gggctcgatc cggtgcggat gaccttccga 4183561 tcgtcgcggt tggtgtatcc catgcggcta tcggtcgccg cccaggagcc gcaacatgtc 4183621 accatcttca ccctgtccga tcaccggcag cagcgcaccg acgccgacgc tgccacacag 4183681 acaacccacg tccggttcgc gggcgacatg tccactgcgg ttcgtgaccc tctgttgcgc 4183741 gagctgatcg gcaaccacgg ctcatatctg accaaggtcg aggtggacat ctatcagaca 4183801 tcgcgaatct cttcggattt cacgttcggc aacgcaccaa acgacgatcc gtaccggcag 4183861 gtggtcaccg tttacgacga tgtcgcactc cccccgctgc tgctggtggt cgtgtcggcg 4183921 atcgcggtgg gcgcggcggg cggggccgtt gtggtggttc tgcggcgacg gcggcgcgcc 4183981 cacactgggt agtccgccac ggtgagggcg ctcagcgagg cagggattct ggtccttcag 4184041 acaaacccgc cacggccggg tgcgccatca accggtcgag aaaaccccgc tgccccttga 4184101 gcagtttggt gcgtgcccgc gctaccggaa accagctcac ccggtcgacc tcggggaact 4184161 tacgcatctt gcccgagccc ttcggccagt ccaattcgaa ggtgctgctt cgtgcgtcgg 4184221 tgatgtccag atccgcccgg acaccgaaca cggtcaccac cttgccgccg gactgtttca 4184281 gcgacccgaa gtcgattcgc ggcccgtcag gcacgcacaa cccgatctcc tcggagaact 4184341 cgcgccgggc ggccagccac ggatcttcgc cgccggtgta ttcgcccttc gggatcgacc 4184401 aagcgccgtc gtcctttccc gcccaaaacg ggccgcccgg atgcgccaga aggacgtcga 4184461 cgacaccggc gcgcgcccga tacagcagca cacccgcgct gagcttgggc atgagtacgg 4184521 gttctttaga tcccgacggc ctgttccaga tccttcagcg acgattccag gtgcgcaagc 4184581 agccgctgca agtggggcac actgcgacgg cacccgacca gtccgaagtc gagattccca 4184641 gcattgttca ccagggtgat gttcaacgct tgaccgtccg ggatgttcga caatgggtaa 4184701 ctaccgtcaa gccgggccgt gccgtagtag agcgggtcta ccggccccgg cacattcgag 4184761 atgacgatgt tgaacggtgg cggcactgcc gacaagaaac ccggtacacc cgccaacgtc 4184821 agcggcgcca tattcaatgc cgacaatgca agcacctgca gctgcggcaa ttcggagagc 4184881 actttcttgt tgccgtccat ggacgcgctg atggtctgaa tccgttgcgc tgggtcgtcg 4184941 acatgggtgg cgagattgca caggacgctg ccgaccaagt tgccgccggc gtcagcgtcc 4185001 tctttggagc gtaggctcac cggaaccatc gcgatcagcg gtctgtccgg cagcgcattc 4185061 cgctctatca ggtagtagcg caacgcaccg gcacacatcg ccaggacggc gtcgttgacg 4185121 gtcacaccgg cggcctgctt gacgctcttg atccggtcca gcgaccagga ctgcgcagcg 4185181 caccggcggg ctcccccgac cttgacgttg aacatgctgt gtggcgccgc gaacggcagc 4185241 gtcaactgct gctcgagtag cgccgcacga gccagcttca gcgtcgacgg tgcaagtccg 4185301 acaacggatc ccgccatctt gaacagcgca tccaacagtg acgagccgtc cgatggcggg 4185361 cgcgtacgtg ggcgcggagg caggttccag atggcgcgca cctcggcgtc gtccgggtca 4185421 gccgacagcg tgcgctgcgc cagcttcatc gccgaaacac cgtcgatcag ggcgtggtgc 4185481 attttggtgt acatagcaaa ccggccgtcg ttcagcccct ccaccacgtg cagctcccac 4185541 agcgggcggt ggcgatcgag caggctggta tgcagccttg aggtcagctc gagcagatcg 4185601 cggactcgtc ctggcgaggg cagcgccgag cggcgaacgt ggtaatcgat gtcgatgtcg 4185661 tcgtcataag cccatgccac acgggcgatt ccacccccga tcgtcgcagg gtgctttcgg 4185721 aacatgggct ggaattcgtc gttggcaacc aaacgctcgg tgaactcacg gacgaactca 4185781 ggaccagctc cctgcggtgg ctcgaacaac gacaagccac ccacatgcat ggggtgttca 4185841 cgagattcaa tgaaaagaaa catcgagtcg ttgggcatca tcagatccat gcacccatta 4185901 cacccattac cgagtgatcc gggaaggctt ctgtggtgcc cgaggttcgg caagtcgcaa 4185961 gaacatcgcc gcccagctga cttcgggatg acaacgcatg tagtccggag cggcttgagg 4186021 ttgcaacgtc gggtgggcga agtagtccgg ctgagaggta ttggtggcag catgggtttg 4186081 tgacctcaat gtcgttggcc tgggatgtgg tgtcggtcga caagccggac gatgtcaacg 4186141 tcgtgatcgg ccaggcgcac ttcatcaaag cggtcgaaga cctgcacgag gccatggtcg 4186201 gcgtgagccc atcgctacgg ttcgggctcg ccttttgcga ggcttccggg ccccggttgg 4186261 ttcgacatac cggcaacgat ggcgatttgg tcgaactcgc gacccgcact gcgctggcca 4186321 tcgcggccgg gcatagcttc gtgatcttct tacgtgaggg gtttcccatc aacatcctca 4186381 acccggtgca ggcggtgccc gaggtctgca cgatctactg cgccacagcc aatccggtcg 4186441 acgttgtcgt cgcggtgacc ccgcatggtc gcggcatcgt gggtgttgtc gacgggcaga 4186501 cccctctggg agtggagacc gatcgcgaca ttgcgcagcg gcgtgacctg ttgcgcgcca 4186561 tcggttacaa gctctgatac gggccgccgg tccgcccttg acagcgggac gtccgccgca 4186621 gagggtcgac ggcatgtccg tggtgcgcgg gaccgctctg gctaactacc cgagcctggt 4186681 tgccgggttg ggcggtgacc cggccactct gctacgggcc gcgggtgttc gggatcagga 4186741 tgtcggcaac tatgacgcgt tcatttcgat ccgggcagcg attcgggcaa tcgaatcggc 4186801 cgcagcggtc accgccacaa tggatttcgg gagacgattg gcacagcggc aagggattga 4186861 gatcctggga ccggtcggtg tggcggcccg cacggccgcc acggtcggtg acgctctggc 4186921 gatcttcaac accttcatgg cggcctacag cccagttatc gccatccgga tcacgccgct 4186981 ggccggacag cggtcattta ttgcactcga gttcctgctc gacgagccgg cgtcgtatcc 4187041 gcagaccatg gagctggcgc tcggggtggc gctcggggtg atccggttgt tgttgggcgc 4187101 tgactacgcc ccactggccg tgcacttacc ccacgaccca ctcacacccg aagccttcta 4187161 cctgcagtac ttcggctgcc ggccttactt cgccgaacgt gttggtggtt tcaccatgcg 4187221 caccgcggac ctgagccgtc ccctcaaccg cgacgatgtc gcccaccggg tggtcgtcga 4187281 ctacctgagc agcatcacgc cgctgggcga ggggatcgtg gaatcggtgc gcaccatcgt 4187341 gcgccagctg ctgcccaccg gagcggcgac gctcaacgtg gtcgccgagc agttccacct 4187401 gcacccgaaa acgctgcaac gtcgacttgc ggaggagaac accacattcg ttattctggt 4187461 cgatcgggtc cgcaaggatg tcgctgatcg ctacctaagg accaccggga tcggccttac 4187521 ccatttggca cgtgaactgg gctacgccga acaaagcgtg ttgacccgct cgtgcaaacg 4187581 ctggttcgga accggaccgg ccgcctaccg caaccaggcc aggttacaga caaccgtgag 4187641 cgcacctggc agcgggcgtg gtccgaatcc aggtaacgtc tcagtatcct gctgaccgat 4187701 ggatcaagat cgatcggaca acacggcatt gcgccgtggt ctgcgaattg ccctgcgcgg 4187761 gcgccgcgat ccgctgcccg tggcgggccg gcggagccgg acctccggcg gaatcgatga 4187821 cctgcacacc cggaaggtgc ttgacctgac catccggctc gccgaggtga tgttgtcgtc 4187881 cggctctggc accgcggatg tcgtcgccac agcccaggac gtggctcagg cctaccagct 4187941 caccgattgc gttgtcgaca tcaccgttac caccatcatc gtgtccgcgc tagcgaccac 4188001 agacactccg ccggtcacca tcatgcggtc ggtccggacc cggtccactg actacagccg 4188061 gctggccgaa ctcgatcgac tcgttcagcg gataacctcc ggtggcgtcg cagtcgacca 4188121 ggctcacgag gctatggacg agttgaccga acggccccac ccctacccgc gctggctcgc 4188181 gaccgcgggg gcggcgggct tcgcactcgg cgtcgccatg ttgctcggcg gaacctggct 4188241 gacctgcgtc ttggctgccg tgacgtctgg cgtgatcgac cgactgggcc ggctgctgaa 4188301 ccggatcggg accccgttgt tcttccagcg cgtgttcggc gcggggatcg cgaccctggt 4188361 cgcggtggcg gcttacctga tcgccggcca ggatccgacc gcgctggtgg ccaccggaat 4188421 cgttgtgctg ctgtctggga tgaccttggt gggttcgatg caggacgcgg tcaccgggta 4188481 catgctcacc gcactcgccc ggcttggcga cgccctgttc ctgaccgcag ggatcgtcgt 4188541 cggcatcctc atctcgttgc ggggcgtcac caatgccggc atccagatcg aactgcatgt 4188601 cgacgcaacc acgacgctcg ccaccccggg catgccgcta ccgattctcg tcgcggtaag 4188661 cggtgcggcg ctgtccggcg tgtgcctgac gatcgcgagc tatgcgccgc tacgttctgt 4188721 ggccaccgcc ggactctcgg ccggactcgc cgaactggtg ctcatcggac tcggcgcggc 4188781 cgggttcggc cgagtggtcg ccacctggac cgccgcgatc ggcgtcggct tcttggccac 4188841 cctgatctca atccgtcggc aggctcccgc cttggtgacg gccaccgccg gcatcatgcc 4188901 gatgctgccg ggccttgcgg tcttccgtgc cgtgttcgcg ttcgccgtca atgacacacc 4188961 cgacggcggt ctgacccagc tgctggaagc ggccgcgact gcactcgcgc ttggcagcgg 4189021 ggtggtgttg ggcgagttcc tcgcctcacc attgcggtac ggcgccggcc ggatcggcga 4189081 cctctttcgg atcgagggtc cacccgggct ccggcgggcg gtcggccgtg tggtgcgcct 4189141 acagccggcc aagagccagc agccgaccgg caccggtggc caacggtggc gaagcgtcgc 4189201 gctggagccg acgacggccg acgacgtgga cgccggctat cgcggcgatt ggcccgctac 4189261 ctgcaccagc gcgaccgagg tgcgctagcc agcctcgcca gcgccgacca actgctccca 4189321 gctagcgggc accatcggca ccgacggact acccccgaac tcaccaccgg ccaacgtggt 4189381 caaccccgca ggacgcccaa cggactcctt gccagcggtc ccggcaaacc ccaacacgcc 4189441 ggcaccccga tccgaagcca gcaccgacgc cgacgtgccc gcgggagctg gcgccaccgc 4189501 cgacaccagc ctggcctccg gtcgcaccgc ggcagactca cgcaacggcg gcgccgctgc 4189561 aggtcgtacg gcaacgggcg ccacgtccgc cgtctttagg cccacttcga tagcctgcgc 4189621 gtccgcactc gccaggtccg cgaggtactg gccaacgccg ataggcaccg cagtcgacaa 4189681 cgaagtcgac aacgaggtcg gcaccgcaat cacggagccg gtgagcacaa atggcgatgc 4189741 gatcacaagg agcggcgggc cgaagatgat cgccaggacc gcaaagacga tcgcgtatgc 4189801 gaagatcacg agcggcaaga tgagcacgat gatgatcgtg tatgcgacga ttgcaaacag 4189861 gatctcgagc gatatcagaa agagctgaat caatatctcg atgatgatcc cgatgatgct 4189921 ggcggggtcc aacgttgcgg cggagatcgc cggcagggcg ctggcaacgc cagcaccgcc 4189981 gttgaacagt accggagccg gtgtggtttg cggtgccgac gccagcgccg catcggaggt 4190041 gccctcatag atactcatcg tggtggccgc ctgaatccac atccgcgcat agtcggcctc 4190101 attgagcgcg atcgggatcg tattgattcc aaagaaattc gttcccagca acaccgcatg 4190161 gctggtgtga ttagcggcca actcggtcag cgtcggcatc gccgccagcg cgctggcata 4190221 tgccgtggtc ataacctcat gctgggtggc cagccgcgca ctgtcggcac tggatttagt 4190281 tagctaggct agataaggta ggtgggcggc cacataagct tcggcgctcg gaccctccca 4190341 cgccccgccc tgtaccgctg ccagcaccgc agtgagctct tgggctgccg aagcatactc 4190401 cgcactcagc gatgtccatt ccgcggcagc cgcctgcaac gaagccgggc ccggaccggc 4190461 gctgagcaac gccgaatgca cctccggcgg cgaggcaaac cagatgggcg ccgtcatagt 4190521 gagccccctg aaaccgaatc caccgccggc gacagcgcgg ccgccgccaa accatcagga 4190581 actaccccct gcgtcactcc tcgtactcct tcggtcatct caccgaccaa ccgcagccga 4190641 aagccggtca gtcaactgac tgggccaagt cgcacacatg acggactata gatctcacgc 4190701 aatatagcga taatcgatca tttccacgag ctacgatgct cgagttgccc agccagcaag 4190761 atacgtccct ttacaccagg cagataaact gggctggctt tggtcaaacc cagcgcgacc 4190821 cgcagcactt cctcatagcc cgaccgcgcg ctccagttcc ttcaacgagg tctccagatg 4190881 gctgagtacc cgctgcacgt gtggaacgct gcggcggcaa cccacgactc cgaagtcgag 4190941 actatcggcg gtgctggtca gggtgatgtt gagcgcttgt ccgtcgagca ccaacgacat 4191001 tggatagttg ccgaccatcc tggcgccgtt gaagtacagc ggttcgcgcg caccgggcac 4191061 gttcgagatg cacacattaa acggcggtgg cgttgccttg gccaagcccg gcagggtgtt 4191121 cagcgcagct gggctcaaca gcagcagtga caccgccaac gcctgggcgc ggggcagctg 4191181 cgatagtacg ttcttattac cgcgcatcga agcgtggatg gcgttcagcc ggtcggctgg 4191241 atcatcaagg tgggtggcca gattacacaa caccgccccg accatgttgc cgccgaccga 4191301 gtcgcggtcg gtgcgcaggc tcaccggaac catcgcaacc agcggcgtgt ccggcagcgc 4191361 gtcgttgtcg tccagatatt cgcgaagtgc gccggcgcac atcgccagca ccacgtcgtt 4191421 gaggctgacc ccggccgcgt ctttcaccgc cttgacccgg tccaacggcc aggactgcgc 4191481 ggcgcagcgc cgcgctcccc cgacggcgac attgagcatg gtgtgcgggg ccccgaaggg 4191541 cagtgtcaac tgttgttcga tcaacgcgga acgcgccagt cgcaacgttg agggagcgag 4191601 cccggcaacc gatcccagca tgccccccag ctgttgcagg cggccgcgcc gtcgcttgat 4191661 ggcggtgtgc tgcgtcgccg gtgaccaggc ggtgcgcaac ttgccctcga tggggtcggt 4191721 ggtcatcggc tggcgcatca gcgtaagtcc ggacaccccg tcgaccaggg cgtggtgcat 4191781 cttcgaatag atcgcaaagc gtccatcccg gaggccctcg atcacgtgtg tttcccagag 4191841 cgggcggtgc cggtcgagca gattggagtg taaccgtgac gtcagttcca gcagctcacg 4191901 cacccggccc ggcgccggca gggcagaccg ccgcgcgtgg tagccgaggt cgacgtcagc 4191961 gtcggtcgac cagccgaggt tgatgagtgc accgtgaagc gacgtggggc gcttgcgaaa 4192021 tagcggtgct atctcgcggc actgaagcat cgcctgatag gtttcccgca caaacccacg 4192081 tcccgccccc gcgggtggct cgaacagttg cagcgcgccg acatgcagcg gatgctctcg 4192141 cgactcggct gataagaaca gcgcatcgat cggtgacatc agttccatgg cgtgctcctg 4192201 gtgatgcgct tcaccgtcag ccggctcgcc gaagccgacg tcgtaaagcg caggtgatcg 4192261 tcgtcgaccg ggccctcgcg caacaccttg aggtccgcca gggggcttcg gcgccctgca 4192321 gcggccgggt cggcatggct gcgggcagct gcggacagca gatggtgtgc ccgggtgcgt 4192381 ccatgtgggc cggcgaagtt ggacacgtcg ctgagcagga aacccttgaa tgccaccttc 4192441 tcggcaacgt ccacggcgac tccgtcgacc gagaggttga tgccaccgaa ggccagcaga 4192501 ttgaggccgg tggcagtgat gctgatgtcg gccgccaatt cgcgtccgga ttgcagccgg 4192561 attccgtttt cggtaaaagt atcgatcgcc tcggtgacca ccgaggcccg gccgtcgcgg 4192621 atggccttga acatgtcggc atctggcacc gcgcacaggc gttggtccca tgggttgtag 4192681 accggcttga agtgctcgtc ggccggatat ccggcggcca gctgcttggc gttgagatga 4192741 cggatcagtc gccgggcggc tctcggatac cgttggcata accgccacac caaccgttgc 4192801 ttggcgatgt ctttgcgccg ggtgacggcg taggcccgat cgcggcctat catttgggca 4192861 tggttaccgc gccggcggtc tgggccatgg ccggcaccag cgtgaccgcg gtcgcgccgc 4192921 tgccgatgat caccatccgc agctcatggt gacgtgttcg ccggtgtcga agcgttcgat 4192981 ctccaccagc cagcgagcgt cctcggtgga ccatgatggc gtccgcgctg gcggtcgcct 4193041 tctcgtgctg ccacggcttg aactcatagc tgaacgtgtg caggtcggag tcggatcgaa 4193101 ttgctggata ccgggcttca acgatcgcga atgtcttggc cggctgcatt gtctttaggt 4193161 agtaggcggc gccagtgccg gagatgccgg cgccaacgat cagcacgtcg acgtgttcga 4193221 tgctggctga ctgctcggag tgcacggcgt acttcctgtt cgggcgaagg ctgacccgcg 4193281 acttcgttgt caaccggggg tggtgtgcgt caccgaactc actgtgcacc agcactcggc 4193341 cttgagtctt gacactagaa gacaacaatt tgacttttca agacacagcg tcacctgtgc 4193401 gcggtgccag cggcgcggcg ccaggccgtg tggcgcagta ggcgcagccc attgagtccg 4193461 acgatgatgg tggaaccttc gtgtcgggcg acgcccagtg gcaatggcaa cgtgaaggcc 4193521 aggtcccaca caacgagccc ggcgatgaat gtcacggcca cgatgaggtt ggcgaccacg 4193581 atgcggcggg ctcgccgcga catggcgata acggtgggaa tggtggtcag gtcatcgcgg 4193641 acgacgacgg cgtcggcggt ctgcagggtg agttccgatc gggcgctgcc catggcgatg 4193701 ccgacatgcg cggccgctaa ggccggagcg tcgttgatac cgtcaccgac cacggtcaat 4193761 ctggcacctc cagcttgcag ctgccgcacg gctgcgacct tgtcgtcggg cagtagcccg 4193821 gcccgtacgt cgtcgatgcc aacctgtaca ccgagccgat cggcggtggc ccggttgtcg 4193881 ccggtaagca ataccggttt ggccccggtc agtttggtcg cagcggaaat cgccgcggcg 4193941 gcttcggggc gaagctgatc ggtgatggcg agtagcccga cgggatggct atcgcatacc 4194001 acgacgacga cggtgtagcc ctcgccttgc agaaagtcga ccgccgtgat catggaagct 4194061 tcgagcgcgg cggcgccggc agtgcccagc agtgccgtcg ccgatccgac cgcaatgacg 4194121 tggccatcga cgcgggcggt gacacggcaa cctgggtgtg cggtgaactc gccgacggtc 4194181 ggcagccgga tgcggcgaga ctgggcggct ttcacgatgg ccgcacccag tgggtgctca 4194241 ctgggatact ccgctgcagc cgcaagccgc agcagttcat catcggtgaa tcgtcgttcg 4194301 tacacccaga tgccggcgag ttcgggggta ccgcgggtaa gggtgccggt cttgtcgaac 4194361 gcgatccgtg tggtggttcc aagttgttcc atcacgatcg cggacttggc gagcaccccg 4194421 tggcggccgg cgttggcgat tgcggccaat agtggcggca tggtggccag cacgaccgca 4194481 cacggcgacg cgacgatcat gaacgtcatg gctcgcagca acgcccgctg cagggtctcc 4194541 ccccatagcg ggggcaccgc gaatacggcg agggtcacgg cgaccatgcc gatcgagtag 4194601 cgttgttcga ctttctcgat gaacagctgg gtgcgcgcct tggtctggct ggcctgttca 4194661 accagggtgg caatgcgagc gacgacggaa tcccgcgcga gccggtcgac ccggatccgc 4194721 agggcgccgg tgccgttgac agtgccggcg aacacctgat cgccgattga cttgtcgacg 4194781 ggcagcggct ctccggtgac ggtggcctga tcgacttcgc tgccgccggc aagcacggtt 4194841 gcgtccgccg agatgcgctc accgggccgt accagcacga tgtccccaat ccttaggtcg 4194901 gcggcgttga ccgtttcctc accaccgccg gcgcccacgc gggtcgcggt gcccggcgcg 4194961 aggcccatta gcccacgcac cgagtccgcg gtgcgggccg ttaccagtgc ttccagagca 4195021 ccggaggttg cgaagatgac aatgagcaga gcgccctcgg cgatctgccc gatggcggcc 4195081 gcgccgatcg ccgcgaccac catcagcaga tcgacatcta gggtccttcg ctgtagcgcc 4195141 tgtagcccgg ccagccctgg ctcccaaccg ccggtcgcgt agcacgccag aaacagcgcc 4195201 caccgcaccc attgcggtgc tccgcacagc tgtgtcagta gtcccgctga aaacaggccc 4195261 aacgccagcg cggcccaacg catctccgac aacgcgaaca gcttggttcg gcgcgctagg 4195321 accaacggcg acgctgaggt gcaccgggcg ggagagagtt cacgaacagc cacccggcca 4195381 acatatcaga atatatgatc atatgttcat ttatttcttt ggggataggc tgcctaacca 4195441 tggggcacgg ggtcgaaggc aggaatcgtc cgtcagcgcc gttggattcc caggccgccg 4195501 cgcaggtcgc gtccacactg caggcgttgg cgactccgag ccggctgatg atcctcaccc 4195561 agctacggaa cggcccgctt ccggtaaccg acctcgccga ggctattgga atggaacagt 4195621 ccgccgtctc gcatcaactt cgagtgttgc ggaatctcgg cttggtcgtg ggcgaccggg 4195681 caggccgtag catcgtctac agcctctacg acacgcatgt ggcgcagctt cttgacgaag 4195741 ccatttacca cagcgagcac ttgcaccttg gtctctccga ccggcacccc agcgcgggct 4195801 aagcggtcag gctcataagc tcgcgggtca ctttcaccca tgaccggcga gctttacaga 4195861 ccccagcgcc tcaaggggca ccacctcaag ggcgcagcca ccgtggcggg cgcgcaatcg 4195921 acaggtcgtt gccgaccgag cgctggtgtg ccaggaattc ggtggtcatg acggcgcaga 4195981 tggtgtgcca accgaggtcc tcgggtccgg tcgcacagca gccgtcacga tagaagccgg 4196041 taagcggatc ggtgccaccc tgttccaggg cgccgcccag cacattgcaa tcggacatgg 4196101 acctaagtgt ctaagctgcg ccagccacgc cgtcggacct atcagctaat tcggcgcgcg 4196161 tcgcggcgca ctattcccgc gcgagggtct ggccgggtcg cggaattgct tcgagcaagc 4196221 aggcggccgc cctgacgtcg gcgtccgaat acatccgggc gatcgcggta aacacctcgc 4196281 ccgcctttct cagctcttcc tgcgccgctt gattcagcgc cagcagcccg gtggccgccg 4196341 tcgtgaacgc cgttaccgcc cacgccgaca cctcctcggc cccggcgggc aatagcgagc 4196401 tcagcgagac ccacgccacc gcaccggcct gtagcccttg gaatgcgttg ttgacgacct 4196461 gcgatccgat gtcggcaacg gccggatcga atgacatgga ctgcatgtgt ctctccctag 4196521 attgcgcggg ctcgggcccc aacgacgaga tctaagcgag gaattcagtt gtcggtagcg 4196581 atagtagtaa taggatatag tccgcgctga cgaaatagaa gacgagatat gccgtcgcac 4196641 tgaataattt gtcaccaagg gcgctgccgc cccgtgctac ccctgggcat gttgtccacc 4196701 tgcggcgcgg taggttcagc ggcgtgatac ttacgggtgc gttcttggcc gatgccgccg 4196761 cagcggtgga caacaaactc aatgtgcaag gcggcgtgct gtccagattt gcggtcggtc 4196821 ctgaccggct ggcccgattt gtgttggtgg tgttgacgca ggcggagcct gacagttcgg 4196881 accgcgacat tacggtcgag atgaggccgc cgaccgatga cgaaccgata cgcctgaatt 4196941 tcgaggcgcc cgaagcggcc gttgccgagt tccccggatt cgcattcttc gaaatccaac 4197001 tgcgcctgcc ggttaacggc cgttgggtgc tggtggtgac tggcggcacc ggagcgatat 4197061 cgcttccggt gctggtgagc gacatgcctg cgacgatagg tttttgacgc gccggtcttg 4197121 agcgacgacc cccggggctt gcagaaaggt tgtcccgtgc accagcagca tccctacaac 4197181 gcagctggat tcggctgacc gtgctgacac ccaacccagc ggtaggttcg gcagcgtgat 4197241 agtcggggcc ttcctcgccg aagcggcctc ggtggtggac aacaagctca atgtctccgg 4197301 cggcgtgctg taccgatttg cggtggatcc ggaccggtcg gcccagtttc tgctggtggt 4197361 gttgacccag gccgagaccg atgatccgga tcggcgggtc gacgtagagg tttggcctcc 4197421 gacgggcgac gacgcgcacc acatcgagtt cgagctaccc gaggccgccg tcgccgccga 4197481 ggtcggattc gccatcttcc ggatcgaggt aaacctgccc gtcgacggcc gttgggtgct 4197541 ggtggtaacc ggcggcgccg gaacgatctc gctgccgctg atcgtgacgg ggtgaggcgt 4197601 aggcccctgc cgacggagct gccagcccta ttgatcgaat gggagcagga cgccgaggcc 4197661 gaatggcgat ccggacggga acagacgccg tgccttagcg gcgaactgtg ggacctgctc 4197721 gcccagcgca tctagcaggc tgtgcggcgt gaacggcggg tcgatgtagg cgtcgaccat 4197781 cccgaggatc actgctttcg ttgcctcttc gtacaaatcc aactgatcga ggagaaagtc 4197841 gtcgggatgc aaagctttga tctgataggg ctttagcgcg tcatcaggga agtgcttgag 4197901 gtttgtcgtg actatcacct ccgcgcgctc tcggaccgct gcagctagca catgtcgatc 4197961 tttgtaatgg ttgttcatgg cggcgatgag gtcgttgtac ccgaaagcga atgcggtagt 4198021 cagcccgttc ggtgctgatg ttgaggcggt cgaccatggt tcgccgagtc tcggccagga 4198081 tgtcctccga ccacagaggc cgataggtgc cctcgtcagc gaaccgcaac agggcatcaa 4198141 ccagcgggtg tggcacgagc acgcacgcgt ccagtactac ggggaacggc atgctcggcc 4198201 tcctctactt cttttctgca agcgccgcct ggagctcacc aagggcgtcg cggctcaact 4198261 cgcctagtgc tgcacggcga ttcgaccggg tttcttgctg atattcgagc agcgcgtcaa 4198321 ggctcactcg gcggtggcgg cccggcttct caaatgggat tcgaccatcc tccaagagcc 4198381 gaacgagggt cgggcgtgag atgttcaata ggtcggcggc ttcttgggtg gttagtttga 4198441 ggtggcgtgg caccaatgaa atgcctttgc cttgcgacaa ggccagcacg acgttgtaca 4198501 gcgcatctct gactggttca ggaagcgtca tcggttgtcc ggcgttgcca cacacggaaa 4198561 cttcaggcgc gccaagcacc tccagcaagg aggtcatgtc ctgcgggtcg cgggggtgga 4198621 agtactgtcc gttcctggac tgcggctgtc atgcagctta gcgtaattcg aacaaaacga 4198681 aacgtcgagt ctctgaccag gcatttacgc aagctactgc gccgctaacc gcgccgggtc 4198741 gcgcacttgg ccgcctcaaa cgccgcctag cacggtgacg tcgagcccgg cggagcgcac 4198801 cagctgatca gagctggaaa ccggcgcgcg tctgccgcgg ccgaagccta cacgcgggcg 4198861 gatctgcggc gcggtgaagc gcgcaaaggt ccagcagatc actccgcacg atttgcggca 4198921 caccgcggcc agcttggcgg tgtcggccgg cgtcaacgtt ttggcgctgc aacggattct 4198981 cgggcacaag tccgcgaagg tcaccctgga cacgtatgcg gatctcttcg atgccgatct 4199041 tgatgcagtc gccgtcactc tcgggaaaga tgccgaccag caaacctgaa aataccctgc 4199101 tgaactgcac taacagtcaa agggatttgg cggtggcgga gggatttgaa ccctcggacg 4199161 gtgttagccg tcacacgctt tcgaggcgtg ctccttaggc cgctcggaca cgccaccgcg 4199221 gtgaagctta ccgaatcggc gcaccctcac cccaatcgct ggcgggcgaa gaaggcctcc 4199281 agcggcgcgg cgcactcccg cgcgagcaca ccgccgcgta cctccgggcg gtgattgagc 4199341 cgacgatcac ggaccacgtc ccacaacgag ccgaccgccc cggtcttggg ctcccaggca 4199401 ccgaagacca gccgcgcgac gcgggccagc accagggcac cggcacacat agtgcacggt 4199461 tcgacggtga ccgccaaggt ggtcccctcc agccgccacc cgtcgccgag cacaccggcc 4199521 gccaaccgca tcgccaggat ttccgcgtgc gcggtgggat cgccgagcgc ctcgcgggca 4199581 ttcaccgccc gggcgagttc ggttccgtcg gcgccgacga ccaccgcgcc caccggcacg 4199641 tcgcgcggac ccgccgtcgc cgcgaccgcc aacgccgcac ggatcagatc ttcgtcagtg 4199701 gtcaccgccc gcgcttgcgg tcaccgacct aggcggtcga tcaccgccga cagctggtca 4199761 gcgaagccca tttcgcgggc gatgcggccc agctgttcgt cggcgtaaag gtcggtctcg 4199821 tcgaggatga ctcccagaac cgcctcgggc aggccgatgt cggacagcag gcccaggtcg 4199881 ccttcctcga acggatcggc atcctcgagg tcttcgggat cgatctcggc gtccagattg 4199941 tccaggacct ccgcggcgat gtcgtagtcc agcgcggcgg tggcgtcgga cagcaacagc 4200001 cgagttcccg agggcgccgg gcgcacaatg acgaaaaatt cgtcgtcgac gtcgagtagc 4200061 ccgaagacgg ctcccgcgct acgcagctca cgcagttccg tctcggcagc ccgcagactg 4200121 gtcaacgctt tggggcccat cggagagcag cgccagcggc cctcttcacg cacaaccgca 4200181 acaccgaaac cgtccggtgt gtccgcggcc ggtctttgca tggaggcccg ttgtgctccc 4200241 atgggcgcct acggtagtcg ctgaccaggc ctcctgacca gatggtgctc agacagcgga 4200301 gatctggtcg cccctcaggg cgccgccacg ggctacctat gccaaccttg gactgtgact 4200361 cggactgtcg cggcgccacc ggtgtgcgtg cttgggctgg gactcatcgg cggttccatc 4200421 atgcgggccg ccgcagcggc gggccgtgaa gtctttggct acaaccggtc ggtggagggt 4200481 gcccacggcg cccgctccga cgggtttgat gccataaccg atctcaacca aacgctaacc 4200541 cgggccgccg ctaccgaggc gttgatcgtg ctggccgttc cgatgccggc cttgccaggc 4200601 atgctcgccc atattcgcaa atcggcacct ggctgtccgt tgaccgacgt caccagcgtc 4200661 aaatgcgcgg ttctcgacga ggtcacggcg gctggtctgc aggcgcgcta cgtcggcggt 4200721 cacccgatga cgggcaccgc gcactcgggt tggaccgccg gtcacggcgg cttgttcaac 4200781 agagccccct gggtggtcag cgtcgatgac catgtcgacc ccacggtgtg gtcgatggtg 4200841 atgacgctgg cgctggactg cggggcgatg gtggtgcccg ccaaatccga cgagcacgac 4200901 gccgccgctg ctgccgtctc gcacctgcca cacctgctcg ctgaggcgct cgccgtcact 4200961 gcggccgagg taccacttgc cttcgcgttg gctgcagggt ctttccgcga tgccacccgg 4201021 gtggcagcca ccgctcctga cctagtgcgg gcaatgtgtg aagctaacac cggccaactg 4201081 gcgccggccg cggaccggat catcgacctg ctgagccgtg cgcgtgattc gctgcaatcc 4201141 cacggttcga tagccgacct cgccgacgcg ggccacgccg cacgcacacg ctatgacagc 4201201 ttcccgcgct ccgacatcgt caccgtcgtt attggcgcgg acaaatggcg cgagcaactg 4201261 gccgccgcgg ggcgggcggg cggggtgatt acatccgctc tgccaagcct ggatagtcca 4201321 caatgaaccc gtcggagtcg acggtcacgg tggtgtcagc caccggtgag cgcagcttga 4201381 tcccatccag acgtccttcg ctggtatagc tcacggtggc cgcatcgacg ctcatctcgg 4201441 gcacgtttac atagaccacc ggcagcgcga tcgattccgc tcgttcgtgc agcccaaggc 4201501 gacgaatcgg caacgcattg aagaatggac tgaacaccaa atcgatgtcc aatgcaccgt 4201561 tgtatgctgc gcgccgttca ccctggtggt cagtcaccaa ccacatgttc tcctcgtcgc 4201621 gggcgatggc gagctggcgt tcccgctcgg ctagtgtgac cgtcagcccg aaccgtttgg 4201681 tggcaccggt ttcgtcggtc tgcagatcgt agtgcgcgcc aaacgccgga ttattcgcgg 4201741 tagccgcggc cacaatgcgg ccgttcgccc taatccgctt gccggacaac tggactcgta 4201801 ccgattccat gcgcgagatg tcctgcgcac gccaggtcaa catggccggc cagacgcgcg 4201861 gagtcagatc agaggggact gcgttcacac tgtctaccgt agggcgtgtc caccgcctgc 4201921 ggcaggtttg tcgacaaccg cggcgagctt gcgcatcctc ccggtgcccg gcaccgatac 4201981 ccaccccgcc aacgccagca gtccgtcgag ggtcaacgcc aacgcggcca ccatcatcgc 4202041 accgaccaga gcgatgtgga atcgacgctc cttgatcccg tcgatcaagt agccacccag 4202101 ccccccgaga ctggcgtagg cggccaccgt cgcggtggcg accacttgca gcgtcgcgct 4202161 gcgtagtccg ccgagcatca gcggtagtgc attgggtacc tcgacgcgca gcagcacctg 4202221 ggactcggtc atgcccatcg cccgggcggc atcgaccacc agcggatcaa cactggcaat 4202281 gccggcgtac gtgctggcca gcaaagacgg gatacccaac agcatcagcg ccaccagcgg 4202341 cggccccaat cccagcccga atagcagcac ccctagcagc agaacaccca acgtgggcaa 4202401 agcgcgcaaa ccattgaccg cacccaccac cagcagcgtc ccgcgaccgg tgtgcccgat 4202461 aagcagcccg actggcacgg cgatcagtgc tgaagcggcc accgccaccg cggtgtattc 4202521 caggtgctca cacgtgcgga ctgccaagcc gactggaccg gtccagttac tggcggttag 4202581 caggtaggac agcgcctgct gcaggaaatt catcgcgctc cgcccgtgat cggggccgcg 4202641 acctggcggc gccgacgggc tgcccgcggc gcccgttccc atggcgtggc cagccgaccg 4202701 gcgaggttga tcaccacgtc gacgacaatc gccagcagga acatcgctac gacgccggca 4202761 acgatctggt cactcttgtt ggtctgatac cccgcggtga accaggttcc caggcccccg 4202821 attcctatca ccgaacccac ggacaccatc gcgatgttgg taaccgcgac cacccgcagc 4202881 ccggctacca gcacggggat agacagcggc agttcgactt tcaacatctg agcgatccgc 4202941 gaatagccga tggcggtggc cgcgtcatgc acctgcgccg gcaccgcgtc cagcgcttcg 4203001 agcaccgccc gcaccagcag ggccgtggtg taggccgcca acgccacaat gacattggcc 4203061 tcgtcgagga tccgggttcc gatgatcagc ggcaacacca cgaatagcgc tagcgacggg 4203121 atggtgaata taacgctggc ggtcgccgtc gtcagccggc gaagcagcgg cgcgcgctgc 4203181 accagcaggc ccaacggcac cgcgctcatc agcccgatca gcaccggcag caacgagagg 4203241 cgcagatgga cgacggtcag cgcccaggcc gctcccgggt gggtcatcag gtagtgcatg 4203301 gcttagctcc gccgccggcc ttcttgcctt tttggaactc ggccagcacg tcggcggcca 4203361 gtatcccgcc gatgaccttg ccaccgccgt caacggcgac accgaccccc gacggcgagg 4203421 acaaggcggc gtccagcgcc tggctgaggt taccgttcgg gcggaacacc gaaccgccga 4203481 cggtcatggc atccgacaat gccgcgccgc cgcggtgacg ccgccggcca tcggcgtcga 4203541 tccagcccaa cggcgcaccc gcaccgtcga ccaccagcac ccagccgtca cgaacttgcc 4203601 tgtcccgggc atcggaaagg ccgttcaccg agacttgctc gatgtcgcgc acaggtagtc 4203661 cggccgcgtc gaacagctgc agccaccgat agccgcgacc gagaccgatg aacttcgaca 4203721 cgaagtcatt cgccggactg gataacagcc gggcagtttc gtcgtactgc gcaagcgcgc 4203781 cgcccggggc gaacaccgcc accagatcgg cgagcttcaa cgcctcgtcg atgtcgtgcg 4203841 tcacgaagac aatggtcttg tgcaactcgg cttgcagacg aagtatttcg ttctgtagct 4203901 cgtggcgaac caccgggtcg acggccgaga acggctcgtc catcaacaag atcggcggat 4203961 cggccgcgag tgcccgtgcc acgccgaccc gttgctgttc gccgcccgag agctgggccg 4204021 ggtagcgggt ggcgaccttg gggtccagcc cgacacgctc aagcacctca taaccggctt 4204081 tgcgggctgc ccggcgcggc tgacccttca gcaccggcac cgttgcgacg ttgtcgatga 4204141 cccgttgatg aggcatcagc cccgcgttct ggatgacata gccaattccc aggcgcagct 4204201 tcaccgcatt gaccgtcgac acgtcggtac cgtcgacagt gatggtgccc gaggtcggat 4204261 ccaccattcg gttgatcatt cgcagcgccg tcgtcttgcc gcagccggag gggccgacga 4204321 agacggtcag catgccgtta gggacttcca gcgtcagccg gtctacggcg gtggcaccgt 4204381 gtgcgtacac cttgctgaca tcgtcaaagc agatcaacgt ggtgcctact gccgcactgg 4204441 atgatcgaaa ccgttgtccc gcacccattt ccgcgcggcc tggtcggggt ccaccccgga 4204501 gttgccggac accgctgcat tgagctcggc caggccggca gtggtcagct ttgccgacac 4204561 cgcgtccagc acatctttga ggtgatccga cttctttcgc gaattcacaa gcggcacaat 4204621 gtttccggct aggaagttat gttcgggatc ttccagcacc accaggtggt tttgcgggat 4204681 agccgcagag gtgctgaaga ggttggcggc tgtggccgtt ccctccacca gtgctcgcac 4204741 ggtcaccgca ccgccgccgt cgttgatggt cacgaagttg cccggcgcga tgtcgagtga 4204801 gtatttgtgc cgcagcccgg gcaacccgga cggccgggtc tgaaaggccg acggcgccgc 4204861 gaacttcaca tccgcggaat gcggggccag gtcggcgatc gttttcaggt tccaccgggc 4204921 ggcggtagcg gcggtgacgg tgacggtgtc agtgtcagag gccggcgacg gcgtcaggat 4204981 cgacagatcg ccgggaagtc gcttgtagag ctccaactca acggcatcga gcatggtcac 4205041 cgtggcgtcg ggttgaaagt acagcagcaa gttgccgata tactccggca ccaggtcgat 4205101 ggaatgatct ttgagcgcca ggatatacgt ctctcgactg ccaattccca accgccgccc 4205161 cacgtcgaaa ccgttggcct gcaacacttg tgcgtagatt tcggcgatca cctgcgattc 4205221 cggaaaatca ccggacccga cgacgatgga cttcacactg ccggtcgctg acccgagcgg 4205281 atcagcattg gcgcaggacg caaccaggca caccgtcgcg agccacacag ccgcagcgac 4205341 agttgcgcga cgtaggcgtc gcagcatcct catgcagttg acactatcgt cagcggcggc 4205401 gccgtgcttc cacaactcgg catgtactgg gatttttccg gcgtggtttg gtttcattct 4205461 gtgtgggata ggacaaaaat ggtgtcatga ccagcaatcc ctcttcctcg gctgatcaac 4205521 cactcagcgg tacaacggtg cctggctcgg tgcccggtaa ggcaccggaa gagccacccg 4205581 tcaagttcac ccgcgccgcc gccgtatggt cggcgctgat cgtcggcttt ctgatcctca 4205641 tcctgttgct gatattcatc gcccagaaca ccgcctcggc ccaatttgcg ttcttcggct 4205701 ggcgctggag cctgccacta ggggtggcta tcttgctggc ggccgtgggc ggcgggctga 4205761 tcaccgtctt cgccggcacc gcgcggatcc ttcagttgcg acgtgcggcc aaaaagaccc 4205821 acgcggccgc ccttcgctaa ctgggcatcc ccgacgcggg attacccgct cttcttggca 4205881 atctctgcca gaccgcgagc gatcagcggc gcaacaacgt caggcaccga ctcagccgcg 4205941 gtgtccttcc cctcggcctg ctcggacatg cgtcggcggt agtcgatgcc ggcggcgatg 4206001 atggcgagct tgaaataggc caaggccatg tagaactccc agtggcctag cggctgcccg 4206061 gagacgagtg aataccgatc ggccagctcg tcggctgctg gcagcagcgg cgaagtccac 4206121 gctgcctgcg catgcacaat taagtccagc gcggggtcgc ggtatacgca catcagggcc 4206181 gcgtcggaca gcggatcccc cagggtggag agctcccagt ccaccaccgc gcgaacatgg 4206241 catgggtcat cggtgtccaa gatcgtgttg tcgatccggt agtcgccgtg cacgatcgat 4206301 gtgcggctct gttgtggaat ggcttgctgc agggctaaat gcagtcgcga aatgtcggcg 4206361 tcgcggtggt cgtcgggcag ccgcaccagc tcccattgtg acccccaccg gcgcacctgc 4206421 cgttccagat agccgtcggg tttgccgaaa tcgctcagtc cgacggcctt cgggtcgatg 4206481 ctatgcaagt cgacgagtac ccggatcaag gcgtcgacac agccctcgat gaccgaacgg 4206541 ctgccgagcg cttcgagttc ggcgcgccgg cgcaccactt gcccggcaac gaattcgaca 4206601 acctggaacg gcgcgcccag caccgagtcg tcctggcaca gcgagatcgt gcgcgccacc 4206661 ggaaccggtg tgtctcccag cgcggcgacc accctgtact cgcgggccat gtcgtgcgcc 4206721 gacggtgtca gcccgtgcag gggcggacgg cgcaccaacc agctcgacgc gtcatcatag 4206781 acccggaagg tcagattgga gcgtccaccg gagatcagct cgccacgcaa ctcgccgtcg 4206841 cgcccgatcc ccagcgaacg cagataccgg tccagcgcgc ccagatcgag cccgtcgagt 4206901 cggtcaaccg aagtcaccga acttgtttac cactcgcgca atgcccggct ttagctcagg 4206961 ccgccttcga ctcggcgccg agcggtaccg ccgaactacg gcgtcacgat gttgaaggcc 4207021 gaatcgggcc ggtcgaggac gctcaagaat gtctgcagca ccgtccggtc gccgaacacc 4207081 tcgaaaccgg gtgagctgat atcgcccagc gccgcggcga ccaaccgaac cttgtcgccc 4207141 accgtcaccg tcgcgttcgc cgtcgccgga tcggcgggaa gcttgcgatg tatcaacacg 4207201 ccgttgcgca gcgtgagccg atagttgaca tccggctcgg tgaaggtgaa atcgatggcc 4207261 aggtcgaggt cccatgcgcg tgggccattg atgctgatcg ccaggacgtc aaagatttgg 4207321 tccggcgtca gctgggcgaa aaacgtgggc gccgggactt gcccggagct gcccgggttc 4207381 ccgtcgcgca gctcggcggc cccggtcaga aagaaattgc gccaggtcgc acactccgcg 4207441 ccgtaggcca gctgctccag ggtgtcggca tagagcccgc gggccgcagc gtgctcgctg 4207501 tcggcgaaca ccgcatggtc gagaagcgtt gccgcccaac ggaaatcacc tgcgtcgaag 4207561 gcttcgcggg ccagctccag cactcggtcg atgccaccca acgcgtcgac ataacgcggc 4207621 gccagcgcct cgggcggatg cggccacaac cagcccgggt taccgtcaaa ccagcccatg 4207681 taacgctgat agatcgcctt cacgttatgg ctgaccgacc cgtagtagcc gtgggtgtgc 4207741 catgcccgct gcagcgccgg tggcagctgg aacatctcgg cgatctccac accggtgtag 4207801 ccctggttca gcagccgcag cgtctgatcg tgcagatatg aatacatgtc gcgctgttgc 4207861 gacaagaact cgacgatctt ctcgcgtccc cacgtcggcc agtggtgcga ggcgaacacc 4207921 acgtcggttc ggtcggcaaa ggtgtcaatc gcctcggtga gatagcccga ccaggcgcgc 4207981 ggatcgcgca ccaaggcgcc gcgcagggtc agcaggttgt gcaggttatg cgtggcgttt 4208041 tcggccatgc acaacgcgcg gaagcgcggg aaatagaagt gcatctccgc aggggcctcg 4208101 gtgcccgggg ccatctggaa ctcgatctcc accccgtcga tggtgtgggt ctccccggtc 4208161 tcggtgatgt cgaccgtcgg cacgacgagc gaaacctcac cggtcgacag tgtctgcccg 4208221 aggccgcagc cgacgtgccc ccggagaccg cgcgccaaca cggtgccgta catgtagccc 4208281 gcacggcgca tcatcgccga gccggcgtag atgttttcct gcacggcgtg cgcggtgaac 4208341 ccctccggcg ccagcaccgc cacctttccc gcgtccacgt cggcctgggt ggtgacgccg 4208401 agcaccccac cgaaatgatc gacatggctg tgggtgtaga tgaccgcgac cacggggcgg 4208461 tcggctccgc ggtgggcgcg atacaagtcc agcgcggcgg cggccacctc ggtggacacc 4208521 aacgggtcga tgacgatcag cccagtgtca ccctcaacga agctgatatt ggagatatcg 4208581 aatccgcgga cctgatagat gcccggcacc acctggtaga ggccctgttt cgcggtcagc 4208641 tgggattgcc gccacaggct gggatgcacc gatgtcggcg cggcaccgtc gagaaacgag 4208701 tacgcgtcgt tgtcccacac cacgcgacca tcggcagcct tgatcacaca cggggacagc 4208761 gcggcaatga atccgcgatc ggcgtcgtcg aaatccgttg tgtcatgcaa cggtaacgag 4208821 tgttcaccgt gtgccgcctg gatgacggca gtgggaggtt tgtgttccat cggcactaca 4208881 ttgccactac tacggtgcac gccggtagat gccgttggcg aaccacgcta ccgaccagaa 4208941 agagagaatt ttccgccgca cctagacctc gggccctgct aacgcgcata ctgccgaagc 4209001 ggtcctcaat gccgatggac cgctacgaca ggcaaaggag cacagggtga agcgtggact 4209061 gacggtcgcg gtagccggag ccgccattct ggtcgcaggt ctttccggat gttcaagcaa 4209121 caagtcgact acaggaagcg gtgagaccac gaccgcggca ggcacgacgg caagccccgg 4209181 cgccgcctcc gggccgaagg tcgtcatcga cggtaaggac cagaacgtca ccggctccgt 4209241 ggtgtgcaca accgcggccg gcaatgtcaa catcgcgatc ggcggggcgg cgaccggcat 4209301 tgccgccgtg ctcaccgacg gcaaccctcc ggaggtgaag tccgttgggc tcggtaacgt 4209361 caacggcgtc acgctgggat acacgtcggg caccggacag ggtaacgcct cggcaaccaa 4209421 ggacggcagc cactacaaga tcactgggac cgctaccggg gtcgacatgg ccaacccgat 4209481 gtcaccggtg aacaagtcgt tcgaaatcga ggtgacctgt tcctaaccta aagcgtgtcg 4209541 atgcgggctg tgaacagcgc gtcggagccg ggcagtcagg cctagcgcgg cgacgattcg 4209601 agcggttgcc atccgtcaag tggcaaccgc accgcaaact cggtatatcc gggtgagcta 4209661 ctcacggtga tcgttccgtt gtgcgccttg accacagcgg agacgatcgc caggccgagc 4209721 ccggtgctac cggcttggcg ggaccgtgac gtatcgccgc gggcgaaccg ctcgaaaacc 4209781 tcggactgca gcgcggccgg aatacccggc ccattgtcga tcacctgcag cacgacgtgc 4209841 gtcggcccgg tgctcaagcg cgtcgtcacg atcgtgccgg gaccggtgtg cacgcgggcg 4209901 ttggccagca ggttggtcac cacctggtgc aaccgtgccg catcacccgg gatgaccacc 4209961 ggttcggggg gcaggtcgag cgcccactgg tgatctggtc cggcaacatg agcgtcgctg 4210021 accgcgtcaa ccgcaagccg cgacatgtcc accggtccgc gttccagcgg ccgccccgag 4210081 tccagacgcg ccagcagcag caggtcctcg acgagacgtg ttatccgctc ggtctccgat 4210141 gccacccggc tcatcgcgtg tgcgacggcc tcgggatcgt cccctatccg ctgcgtcaat 4210201 tccgtgtaac cacggatcgc cgcaagggga gttcgcagtt catgactggc atcggcaacg 4210261 aactggcgca cacaggtttc actggcctgc cgcgccgaca gtgcggcagc gatgtggtcg 4210321 agcatccggt tgagcgccga cccgagttgc cccacctcgg tggaggggtt tgcgtcaggt 4210381 tcgggcaccc ggaccggtag cttgacctcg ccgcgatcca acggtaggtc gacgacttcg 4210441 ctcgcggttt gcgcgacgcg ccgcaacggc gccagcgccc gcttgatgat gacgattccg 4210501 gcggtcgtcg cggcgaccaa cgcaatcacc gtgacgattc cgaaaatgat cagcatctgc 4210561 aacatcgtgg cgtcgacgtt gcccatcgac aggccggtga cgatgacgtc gtgcccgttt 4210621 cggctcggag cggccagcac acggtaccgg cccagaccgt cgagatccag ggtcagcggt 4210681 gtgcggctgc cggcgatccg ttccagctgg gaccggccgg ttgacgtcaa cgccgcccgc 4210741 gaaccactgc cggtcagata tccggcggcg accgtcgtgc cgtcgctgac caccgccgcc 4210801 accatcccgg ccggctggcc cggagcatcg agaaacctcg gaccggggcc cgaccggatg 4210861 tagttgtgcg tctcgtgccg ccagggcgga cggggcattt tctccggata catcaacacc 4210921 gagcggtacg acgttccgcc gagttggttg tcaagttgtg ccaccagatg acgacgcagc 4210981 gccatttcgg ttgccgcggt gattcccaca cacaccacgg cgaggacgac aacctgtccg 4211041 accaggagcc gcagccgaag cgaccaaatt cgcggactgc tagcgggccg gcttgagcac 4211101 atagccggcg ccgcgcagcg tgtgaatcat gggttcgcga ccgttgtcga tctttttgcg 4211161 caggtacgag atgtacagct ccacgatatt ggaccggccg ccgaagtcgt aactccagac 4211221 gcggtccaga atctgggctt tgctcagcac ccgcttggag ttgtgcatca tgaaccgcag 4211281 cagctcgaac tcggtggacg tcaacgacac cggttcgccg gcgcgcatca cctcgtggct 4211341 gtcttcgtcc agcaccaagt ctccgaccac tagctgggca ccgctgtcga ctgtcgtcac 4211401 ccccgtgcga cgcagtaacg cccgcagccg aagcacgacc tcctcgatgc taaacggctt 4211461 ggtgacgtag tcgtcgcccc ccgcggtcaa cccagctata cgatcttcca ccgcgtcctt 4211521 ggccgtcagc agtagaaccg gcaggcctgg attctcgctg cgcaacttgt gcagcacgtc 4211581 aagaccgctc atgtcaggca acatcacgtc gagcacaacc acatcgggcc gctggcggcg 4211641 ggccgccgca atcgccgacg atccgtcacc ggcggtggtg atgttccaac cttcataccg 4211701 caatgccatg gacaccatct cggccagaac gggttcgtcg tcgaccacca gcacagtgac 4211761 cggttggcca tcggcgcgcc gcattacgac acgctcaacc gagatgcggt gctgcgtcac 4211821 agcgtcaagt atccgcacac ggctgagcag acgccatgcg gatcctatgt gcgcgctatg 4211881 aaacccgatt tggggcacgt tcggagcctg ccagcgggcc ggatccgggc ggtaccccac 4211941 tcacgtcggc gcgcatgttg gtaccagtag cggctgctgg cgaccgggct gctgaagcaa 4212001 atcccgctgc cacgcttgag gcagcgtccc ggaccaacgc caattggtcg ctctccgtcg 4212061 ccgttgtgga agtcgccgac ccggacagtt cgatcagaca tagccaagga tcggtagcat 4212121 gacgatacgc attccgatag cggggaattg aggtgccgtg acagacactt tgttcgcaga 4212181 tgtctccgaa tatcaagtgc ccgtgaataa ctcgtatccc taccgagtgc tgtcgatccg 4212241 cgtctgcgac ggcacctatc gggatcgtaa tttcgcgcac aactaccgat ggatgcgctc 4212301 ggcattcgac agcgggcgac tcacattcgg aatcgtctac acctacgccc gtccgaattg 4212361 gtgggccaat gccaacaccg tgcgctcgat gatcgacgca gcgggcggct tgcatccccg 4212421 ggtcgcgctg atgctggatg tcgaatcagg cgggaacccg cccggtgacg ggtcgagctg 4212481 gatcaaccgg ctgtactgga acctggcaga ctacgccggc tcgcccgtgc gaatcatcgg 4212541 ttatgccaac gcctacgact tcttcaacat gtggcgtgtt cgcccggcgg gcctgcgcgt 4212601 cattggcgcg ggttatggtt ccaatccgaa ccttcccgga caagtggcgc accagtacac 4212661 cgacggcagt gggtatagcc ccaatcttcc acagggcgct ccaccgttcg gtcgatgcga 4212721 tatgaactct gccaacggac taacaccgca acagtttgcc gccgcatgcg gcgtcacaac 4212781 gaccggagga ccgctgatgg cactcaccga cgaagaacaa accgaactac tgaccaaagt 4212841 ccgcgagata tgggaccaac tgcgcgggcc caacggcgcc gggtggcctc agctcggaca 4212901 gaacgaacag ggccaggacc tcactccggt tgacgcgata gcggtgatca agaacgacgt 4212961 ggcggccatg ctcgcggaat agcccgcgat ctccgtcagc tcgtggcccg ctgcgcggat 4213021 acgaaaaggt ttggcgggat tgagtcttcg ccactgtgag ggatgctgcg gccataccga 4213081 gccagcagct cgggcaacgt tgccgtcgac acgtcccagc cacgttcacg cagccactcg 4213141 gcgaccgcgg tgcgctgctc tgcataccag aggtcatcga catctgatat ctcagtttcg 4213201 accagcttgg ctgccgcggc ccgcatccgc cgcatgtccg cacgctggcg tcgcattcgc 4213261 tcagggtcga gaaaaccggc gccggggacg ttggacgcca accaactgcc cggcctgctg 4213321 agcgcatcga tacgctcgaa caacagatcc tgagcccgcg ccggcaggta ccgcaccaac 4213381 ccttcggcta accacgcaca cggcttcgat gggtcaaatc cggctttctg cagtgccttt 4213441 ggccagtcct gacgaaggtc tatgggaacg ttcaccagct gcgaagccgg ctgcgcgcca 4213501 tgctggcgca acgtggctga tttgaattcc agcaccttgg gctggtccag ctcgtacacc 4213561 acggtgccgt ccggccaggg cagccgccag gcacgcgagt ccaggcccga ggcgaggatc 4213621 actacttgcc tcaccccagc gtcggcggta gccaggaaat actcgtcgaa aaacgcggtc 4213681 cgggcggcca tgaaatcgat catctgctgt atcggcgccc gcaggtccgg gtcgaggtcg 4213741 gtcgcaccgg ccagcaacgt gcgattcgtg tacatgctcc atatcccgtc gccggccgcg 4213801 tccacaaaga tccgcgcgaa cggatcgttg atcaatgggt tgtcgctctc ggtctcggcc 4213861 gcacgcgccg ccgccacacc cagtgcggtg gcgcccacgc tctcggtaat ggcccaggaa 4213921 tcgttgtcgg tccgcggcac agttaatcct cccccaggcc ggaaacgtca gttttgcaaa 4213981 ctattcttcc agccgccgag gggcccgcgc gctcgtcaag agtgtcctac gctttctccc 4214041 agatggtcta caggttgcag aggagcgcga tggggtccac gccgccacgt acgccgcagg 4214101 aggtattcgc ccaccacggc caggcgctcg ccgcgggcga cctcgatgag atcgtcgccg 4214161 actacgccga cgactccttt gtcatcactc cggccggtat cgcgcgcggc aaggaaggta 4214221 ttcgccaact gttcgtcaag ttgctcgacg acataccaaa cgcactgtgg gacttaaaga 4214281 cccaaatctt cgagggcgac atactgttcc tggagtggac cgcgaattcc gcggtcagcc 4214341 gagtcgacga cggagtcgat actttcgtat tccgagacgg cacgatctgg gcgcataccg 4214401 tccggtacac cccgcacccc aagacctgac gtttcgagca ggtggcggat gtggacctcg 4214461 aggcggtcgc ctattaccga tcagaccgag gcactgttgt ctgacgcggg cggatacccc 4214521 cagggggcgc gttcctcgcc gcgcacgaag tcggtaggtt gcagccgcac tttgcggagg 4214581 aaccgcctgc tgatctgccg gataggatga gcccgtgacg acgctgaagg agcttggagc 4214641 acgggtcgcc gctctggaag cgaaccaggc cgactatcga gccgtcctcg cggccgtcaa 4214701 cccgccgggc gccaaccagc gagaaatcgc gacgaccgtc cgggaacaca ccggacgact 4214761 ggaccgcgtg acgaccaaag tcggccagct cgcggccaag tccgacgaca ccaatgcgcg 4214821 ggtgcggtct ctggaagagg gacaggccga gatcaaggac cttctgctcc gcgccctcga 4214881 caagtgattc tccgaatggc tgcgcgattt tttgagcccg gcatcgaacg gtgatctgtg 4214941 gtcggtgaat ccgcgacacg ccgtggtttc gggtcgtgcc ggatggcgtc aaatggccag 4215001 ctcagaacac ctttcgagac cacgattttc gagaccacga tcaggtgctg ttgcaggctc 4215061 tcctaaagcc gtagggcgtg tttgaaccgc accatgatgg ggtgcgcgga catcggttgg 4215121 cgatacgggc tcgaggttgc agatcctgtc cgcgctcgtg gccggcaccc gagcgaccct 4215181 gtcgaagacc gcgccttgat tacctggcgg tgagcgcgag ccgcctgacc agggccgcga 4215241 agcacagcgg cgccagcagc caggtcagct gcatggccgc tgtcaccggg tgaccacgaa 4215301 accagaacat caccgggttg aaccacaggt cgtcgacgta ggaccaggtc ccggcgagga 4215361 tcgccgacaa ctccacggcc agggccgtga gctgtcccca gagcacctgg acggtcaggc 4215421 cgagggcgcc cgcgggctcg cccgtgagcg ctcgcgccag cagccagccc atggtcagga 4215481 ggggaccatc ccacacggag tgggcgagca ggaacaccac ggtggggagc gggagcggcg 4215541 tggcccactc gatgatcggg gtgttggtcc aagcgctgag cccgaacacc ggcagctccc 4215601 agaccagacc gatgagcgtg ccgagcaaca gcatccgcgc gagctcgggt cgagtccttc 4215661 gagcgcgcag catgagcacc acgaccgcga gcgcgacgag caggtcggct acgtagtagc 4215721 catgggcgag gggatcgtta tccataagcg tgttctgttg tatgccacta agcatcgtat 4215781 ttgcctccgc gaaccttggt gagcaacagt gacgaacagt gacggcgagc cgccagttga 4215841 cccgcacgtg ggcacaacgg cgagcttccc gcaccgatgg ctacgaaccc cggccacgca 4215901 acgctatgcg gtcgccagcc agctgggcgc gcaggatccg ttggatcgcc ccagcggtac 4215961 ggtccgggtt ctcggggcgc agcgccaccg ccaactccag cacctcgacc ggggtgcgca 4216021 gcgtgcactg gcacgggttt gggcaccaag gcgtcgaacc caccggcgcg ccagtcccta 4216081 attcctaatc cagcggtcga tggtatgccg gctgatccga accttccgcc cgaacgggtc 4216141 ggtgtgctca cgggaggcca gctcgcgcac catctttccc cgctccttgg tggaatgcgc 4216201 tgcatcggcg gcctcccgga tcaactgata ccgaaacaat ccgatcgccc tcgcccgctc 4216261 cgcgcgcacc tcgccttatc atcgccgacc gccaccggcc gctcctttcc gtttggtgtc 4216321 ccgtgaacac acgacagcgc acaggattac ggcccaatcg gcggttaggg cagggtcgac 4216381 tcgtgttgca cccactcgcc gggtcacccc ggcgccacca gccgcccacc cgataccgcc 4216441 accgccgtct cggccagcga caccgtggac agcgcgaact ggcactcgat cacggtcacg 4216501 accgcggcga tcaccgtcac cgcataggcg aacaccccga cggccgcatc cggcatcacc 4216561 ggatccggat ccaccgcgcg caacatgacg gtgaacaccg accgaaccgc ctcggcacgc 4216621 tcggcaaagc gacgcagcca accccgcacc gtctcggccg ggcgagccaa atccgcggcg 4216681 atgcggcgga acccgacctg gctcaaggcc ttctccgccg gcgcgggcac agctccacca 4216741 cgtatgtgcg ctcgcagccc gacatgctgg ccgacggcgc gcagagttgg gcacgagttg 4216801 tgacaatccg tgacagcttt cccggcgcct gataccaacg gaacgcgttt gcgctagtaa 4216861 agagcgcgcc cgaagagatt cgaactccca accttctgat ccgtagtcag atgctctatc 4216921 cgttgagcta cgggcgcttg tcttcagttg tgtcccctaa aggactgcgg aggcgagagg 4216981 atttgaacct ccggtcccct tgaaggggga caactcatta gcagtgagcc ccattcggcc 4217041 gctctggcac gcctccatgg acttcccgag agtacccgga ctccccgagc cgccggaggc 4217101 ctagcgtaca cagccgccac atatgctgtc gacgtgaccg cccgcctgcg acccgagctg 4217161 gctgggctgc cggtttatgt gcccggcaaa acggtgccgg gcgccatcaa gctggccagc 4217221 aacgaaaccg tgttcggccc gctgcccagc gtccgtgccg ccatcgaccg ggctaccgac 4217281 acggtcaacc gctaccccga caacggctgc gtgcagctca aggccgcgct ggcccggcat 4217341 cttggcccgg acttcgctcc cgagcacgtc gccgtcggtt gcggctcggt cagcctctgc 4217401 cagcaactcg ttcaggtcac cgcctcggtt ggtgacgaag tggtcttcgg ctggcgcagc 4217461 tttgagctct atccaccaca ggtccgggtc gccggcgcta tccccatcca ggtgccgttg 4217521 accgaccaca cgttcgacct ctacgccatg ctcgccacgg tcaccgaccg cacccggctg 4217581 atcttcgtgt gcaaccccaa caatccgacc tccaccgtcg tcggtccgga cgcgctggcc 4217641 cgcttcgtcg aggcggttcc ggcgcacatc ctgatcgcca tcgacgaggc gtatgtggag 4217701 tacatccggg acggcatgcg gcccgacagc ttaggcctgg ttcgcgcaca caacaatgtc 4217761 gttgtgctgc gtacgttttc gaaagcgtac ggcctggcgg ggttgcggat cggctacgcg 4217821 atcggccacc ccgacgtcat aaccgcgctg gacaaggtct acgtgccatt taccgtgtcg 4217881 agtatcgggc aggccgcggc catcgcgtcc ctggacgccg ccgacgagct gctggcccgt 4217941 accgacaccg tggttgccga gcgcgcccgc gtcagcgccg agttgcgtgc tgccgggttc 4218001 acgctgccgc catcgcaggc caactttgtc tggcttccgc tgggatcccg cacccaagac 4218061 ttcgtggagc aggccgccga tgcacgcatc gtggtccgcc cgtacggcac ggatggcgtt 4218121 cgggtcaccg tcgccgcacc agaggagaac gacgcgttcc tgcggttcgc ccgccgctgg 4218181 cggagcgacc aatgagcgtg gcccgtaaga aaattcgacg cccacgctcg agcgtcacgg 4218241 ctatctggcc gggttgcggc cggtgaacgc gatcagccgc tccagcgccc cgccatcttc 4218301 cggcacgtcg accggttcat tgaaaccggc cacactacgt tcctccggct tgatgagctt 4218361 tcgtgccagc tctaggacgt attcggccaa cgaatcggca gccttcagct cactcccgac 4218421 ggcgaccgcg taatcccagg cgtgcaccag aaattcgacc gagaagaccg agacggcaac 4218481 cttggccgac atcgagccgg gacccagcga tacgtctcct tccagaccgt gacggtgcca 4218541 ggcgtccagg gccgaacggg cggcgccgct caccaggcgc tccacagagt caatgtccgc 4218601 acgcagtgag aattccgcgc cgaccatgcc gccgaggacc atgattgagt tgagcaaatg 4218661 ctcggttagt tttttcacgt cgtaccccgg gcacggtgtc tgcttggcct tgtcctggcg 4218721 gccgatggtg tgcagcactt gctgcagcac ctgcagcgcg gcttccgcgc acgccagctc 4218781 gtcggtcggt ggggaatctg gtccgggtcg cgattcaggc ggcatactgg ccacgctacg 4218841 gtctgggcat gggcgaaacc tacgaatccg tcaccgtcga aaccaaggac caggtcgcgc 4218901 aggtgacgct gatcgggccg ggcaagggca acgcgatggg gcccgcattc tggtcggaga 4218961 tgcccgaggt gttccatgcc ctggacgccg accgtgaggt gcgggccatc gtcatcaccg 4219021 gatcgggcaa gaacttcagc tacggcctgg acgtaccggc catgggcgga atgttcgccc 4219081 cgttgatcgc cgacggcgcg ctggcccgcc cacgcacgga cttccacacc gaaatactgc 4219141 gcatgcagaa ggcgatcaac gccgtcgccg actgccgcac ccccacgatc gcggccgtcc 4219201 agggttggtg catcggcggc gccgtcgacc tgatctccgc ggtcgacatc cggtatgcca 4219261 gcgccgacgc gaagttctcg gtgcgcgagg tcaagctagc gattgttgcc gacatgggca 4219321 gcctggcgcg ccttccacta atcctgagcg acggccatct acgagaactc gcgctgaccg 4219381 gcaaaaatat cgacgcggcc cgcgccgaga agatcggcct ggtcaacgac gtctacgatg 4219441 acgccgacca gacgctggcc gcggcccacg cgactgccgc cgagatcgcc gccaacccac 4219501 ctttggcggt ctacggcatc aaggacgttc tcgaccaaca acgcacgtcc gccgtctcgg 4219561 agaacctgcg ctatgtcgcc gcctggaacg ccgcgtttct gccgtccaag gacctcaccg 4219621 aaggtatttc cgcgacgttc gccaagcgcc cgccccagtt caccggcgag tagacccggc 4219681 gaccatgcgc gctggcgacg gcaagatccg tgtcccggcc gacctagacg ccgtcacggc 4219741 aaccggcgaa gaggaccact ccgaaatcga cggtgcggcc gtcgaccgga tctggcgggc 4219801 cgcacgccat tggtatcggg ccggtatgca tcccgcgatc cagttgtgca ttcggcacca 4219861 tgggcgggtc gtgctcaacc gcgcgatcgg gcacggctgg ggcaacgccc ccaccgatga 4219921 ggccgatgcc gagaagatcc cggtgacgac tgacaccccg ttctgcgtgt actcggcggc 4219981 caaggcgatc acggcgaccg ttgtacacat gctcgtcgag cgcggacact tcgcgctcga 4220041 cgaccgcgtc tgcgagtacc tgccctccta caccagtcat ggcaagcacc gcaccacgat 4220101 ccggcacgtg ctgacccaca gcgcaggcgt cccgtttccc accgggcccc gacccgacgt 4220161 cagacgcgcg gacgaccatg aatacgcggt ggaaaggctc ggcgaactac ggccgctata 4220221 tcggcccgga ctggtacaca tctaccacgc gctgacctgg ggtccgttga tgcgtgagat 4220281 cgtctacgcg gccaccggca aggaaatccg cgagatcctg gccaccgaga tcctcgaccc 4220341 gctgggcttt cggtggacca acttcggcgt cgccgagcgc gatgtgccgc tggtcgcgcc 4220401 cagtcacgcc accgggcggc agctgccgcc ggtgatcgcc gcggtgttcc gcaaggcgat 4220461 cggcggaacc gtgcacgaga tcatccccta tacgaacacc ccgttcttcc tcagcaccat 4220521 cctcccgtcg tccaacactg tgtcaacggc caacgagctg tcccgcttta tggaaatcct 4220581 gcgccgcggt ggcgaactcg acggtgttcg tgtactgagt cccgagacgc tgcgcggcgc 4220641 ggtgacggaa tgccggcgct tgcgaccgga cttcgccacc gggctgatgc cgcttcgctg 4220701 gggcaccggg ttcatgctgg ggtccgccaa gtacgggccg ttcgggcgca acgcgccggc 4220761 ggcattcggc catctcggtc tggtcaacat tgcggtttgg gccgaccccg aacgagctct 4220821 gtcgggcggt ttgatcagta gcggcaaacc cggtagggac cccgaggctg ggcgctacgg 4220881 cgccctgctg aacgccatta ccgccgaaat accacgggca tcgtcgggct gatctgccca 4220941 cgagcacgcc acgccgccct aaccgagccg gacggctttg tcgtgccggt cacatgtcgg 4221001 cctgttgcct tatgtcaaga tgcgccgccg tacgcgcgca ttatcaacga gtcaacgtgg 4221061 tcggtgcaga cctgctatac tcgaacgtat gttcgagata tcgttgtcgg acccggtgga 4221121 gctgcgcgat gccgacgatg ccgcgctgct tgccgcaatc gaggactgcg cgcgtgccga 4221181 ggtggccgcc ggcgcccgcc gcctgtcagc gatcgccgaa ctcaccagcc ggcgcaccgg 4221241 caatgaccag cgggccgact gggcgtgcga cggctgggac tgcgcggccg ccgaggtggc 4221301 cgccgcactg accgtaagcc accgtaaggc ctccgggcag atgcatctga gcctcaccct 4221361 aaaccgactg ccccaggtgg cggcgttgtt tttggccggg cagctcagcg cgcggctggt 4221421 gtcgatcatc gcctggcgca cctacctggt tcgcgacccc gaagcgctga gtctgctcga 4221481 tgccgccctc gccaaacacg ccacagcgtg gggtccgctg tcggccccca aactggaaaa 4221541 ggctatcgac tcctggattg atcggtacga tcccgccgca ctgcgacgca cccgtatctc 4221601 ggcccgcagc cgcgacctgt gcatcggtga tcccgacgaa gatgccggca ccgccgcact 4221661 atggggccgg ttgtttgcca ccgacgccgc catgctggat aagcgcctca cccagctggc 4221721 ccacggcgtc tgcgacgacg atccccgaac catcgcccag cggcgcgccg atgcgctggg 4221781 cgcgctggcc gccggcgctg atcggcttac ctgcggctgc ggtaattccg actgcccatc 4221841 cagtgccggc aaccaccggc aggcaaccgg tgtggtcatc cacgtcgtcg ccgacgcggc 4221901 agcactaggc gctgcacctg acccacgcct atccggcccg gaacccgcgt tggcacccga 4221961 agcacccgcc accccggcgg tcaagccgcc ggccgcgctg atcagcggcg ggggtgtggt 4222021 gcccgcgcca ctgctggccg agctgatccg cggtggggcc gccctcagcc gcatgcgcca 4222081 tcccggcgat ctgcgatcgg agccgcacta ccggccgtcg gccaagctgg ccgaattcgt 4222141 ccggatccga gacatgacct gccgattccc cggctgcgac cagcccaccg aattctgcga 4222201 catcgaccac acactgccct acccactcgg gcccacccac ccgtccaacc tgaaatgcct 4222261 ctgccgcaaa caccaccttc tcaagacctt ctggaccggc tggcgtgatg tgcaactgcc 4222321 cgacggcacc atcatctgga ccgcgcccaa cggccacacc tacaccactc atcccgacag 4222381 ccgaatcttc ttacctagct ggcacaccac caccgccgca ctacccccag caccatcccc 4222441 gccagccatt ggtcccactc acaccctgct gatgccacga cggcgccgga cccgagcggc 4222501 cgagctggcc caccgcatta aacgcgaacg cgcccacgtc acccaacgca acaagccacc 4222561 cccaagcggc ggggatacag cggtggcgga gggatttgaa cccccggacg gtgttagccg 4222621 tctctcgctt tcaaggcgag tgcattaggc cgctctgcca cgccaccgct gataagggta 4222681 acgagccggt agcgtgacca tcatgcgtgc cgtcgtcgcc gaatcctcag atcgactggt 4222741 atggcaggaa gtccccgacg tgtcggctgg gccgggcgaa gtgctcatca aggttgccgc 4222801 ttccggtgtc aaccgcgccg acgtgctaca ggccgccggc aaatatccgc cgcccccggg 4222861 agtaagcgac atcatcggcc tagaggtgag cggcatcgtc gctgcggtcg gtcccggggt 4222921 taccgaatgg tctgccggac aagaggtttg cgccttgctt gccggcggcg gctatgccga 4222981 atacgttgcc gttccggccg accaggtgct gccgattccg ccgagcgtca acctggtcga 4223041 ctcagccgcc ctgcccgaag tggcgtgcac ggtgtggtcg aacctggtga tgaccgctca 4223101 tctgcggccg ggtcagctgg tgctgattca cggcggggcc agcggcatcg gcagccacgc 4223161 gatccaggtg gtccgcgccc tggcagcacg ggtggcgatc accgccggct caccggagaa 4223221 actggagctc tgtcgcgacc tgggcgccca aatcaccatc aactaccgcg acgaggattt 4223281 cgtcgcgcgg ctgaagcaag agaccgatgg tagcggcgct gacatcatcc tcgacatcat 4223341 gggagcgtcc tacctggacc gcaatatcga cgcgctggcc accgacggcc agctgatagt 4223401 cattggcatg cagggcgggg tgaaggccga gctcaacctg ggcaagctgc tcaccaagcg 4223461 ggcgcgcgtc atcggtacca cgctgcgggc ccggccggtc agcggcccgc acggcaaggc 4223521 ggccatcgcc caggcggtgg cggcctcggt ctggccgatg atcgccgcga accgggtccg 4223581 gcccgtcatc ggcacccggc tgcccatcca acaggcggca caagcgcatg aactgatgtt 4223641 gtcgggcaag acgttcggaa agattctgct gacggtatag gcgaacctcg cggccggatc 4223701 aacctagcga cgccagcgcg cgcaccagct ggtcgacttc ggccatcgtc gagtaatgcg 4223761 ccagcccgac ggtgaccgcg ccgccgacgt cgttgacgcc cagcacgtcg agcacgcgtg 4223821 agccggtgtt ggcgatcgcg agaattccgt tgtccgccag ccgctgcacc acgcggtcag 4223881 ccggcacctt gtggaccgcg aagctgacca ccggtatctg tgcttccggg cgaccgatca 4223941 gcatcaccaa tggcagcgag cgcaacgaca ccatcagata gtcgaagacc cggttcaggt 4224001 acgcgtcagc agattgcatc gacaccgcta gtcgttcgcg tctgctgccg cgagccgact 4224061 cgtcgagcgc cgccaggtac tcaatgctgg cgaccacacc agccagcaga ccaaactggt 4224121 gcacgccgat ctccaggcgc gccggcccgg tggcatacgg attggtcgaa accgatccga 4224181 aggaattcat cactgacggg tcacggaaaa ccatcgcccc aatcggcgga ccaccccagg 4224241 catgcgcatt caccgtcacc acgtcggcgt cggtttctct gatatcgagc aaccgatacg 4224301 gcgcggccgc ggaatggtcg accaccacca gtgcccccac gtcgtgcacc agtttggtca 4224361 tcgcccgcag atcggtgacc ccgcccagcg ttccggatgc ggagttgacg gcgaccagcc 4224421 tggttgactt gctgatcagg ctctcccact gccacgtcgg cagctcgccg gtctcgatgt 4224481 cgacctcggc ccacttaacc ttggcgccgt agcggtgcgc cgcccgcagc cacggagcga 4224541 tgttggcctc gtcgtcaaga cgactgacga tcacttcgta tcccagcccg gcgcgtgagg 4224601 acgacgcttc ggccagcaac gacagcagca ccgcccggtc ggcgcccagc accacgccgc 4224661 ccgggtcagc gttgaccaga tcggccaccg cttcacgggc ggcgtcgagt accgccgcgc 4224721 tacgccgcgc cgacgggtga gcacccactg tgctagcgcc cgaccggcgg aaggccgtcg 4224781 acacggtggt cgcgacggaa tcgggaatca gcattccggc cggtgcatcg aagtgcaccc 4224841 atccgtcacc cagcgatggg tgcaatccgc gcacccgggc gacgtcgtat gccatgccag 4224901 ccaccttaga actcgggtgt cctagacgtc ccagcccgcc cgggcttccc tgagccatgt 4224961 cacccggcca gccatactaa tcgagtgggc ctgtggttcg gtacgctaat cgctttgatt 4225021 ttgctgatag cgccgggggc aatggttgct cgcatcgccc agctgaggtg gccggtcgcc 4225081 atcgcggttg gcccggcgct gacatacggc gtggtggcac tcgcgatcat cccctatggc 4225141 gcgctcggaa ttccctggaa cggttggacc gcgctggccg ccttggcggt gacgtgcgct 4225201 gtagcgaccg gtttgcagct actgcttgcc cgttttcggg acctcgacgc cgaggcactt 4225261 gcggttagcc gctggcccgc ggttacggtc gccgccgggg tgctgctggg cgccctgttg 4225321 atcggatggg ccgcatatcg cggcataccg cactggcagt ccatccccag cacctgggac 4225381 gcggtctggc acgccaacac cgtacgtttc atcctggaca ccggccaggc gtcctcgact 4225441 cacatggggg agcttcgcaa cgtcgagacc catgccccgt tgtactaccc gtcggtgttc 4225501 cacgggctgg tcgcggtgtt ctgccagtta accggcgcgg cacccaccac cggctacaca 4225561 ctgagttcgc tggccgcctc ggtctggctg tttccggtca gtgcagccgt tctcacctgg 4225621 cgcgcggtgc gctcacaccc gggcgcgctg tggtcggcct cctgcgcctc ggcagagtgg 4225681 cgcgccgccg gagcggcggg caccgccgcg gcactctcgg cgtcgttcac cgcggtgccc 4225741 tacgtcgagt tcgataccgc cgctatgccc aacctggcgg cctacggcat cgcggtgccg 4225801 acgatggtgc tgatcacctc gacattgcgg caccgcgacc gcatcccggt ggccgtgcta 4225861 gcgctggtcg gcgtcttctc actgcacatt accggcggta tcgtcgtagc gctgttggtg 4225921 tcggcctggt ggcttttcga ggcactgcgg catcctgtgc gatcaaggct ggccgacctg 4225981 ttgacgctgg ccggcgtggc agcgatggcc gggttggtca tgttgccgca gttcttgagc 4226041 gtcaggcagc aggaagacat catcgccgga cacgcttttc ccacctatct cagcaagaag 4226101 cgtgggctgt tcgacgctgt tttccagcac tcccgccatc tcaacgactt cccggtccag 4226161 tacgcgctca ttgtgttggc cgccatcggc gggctcattc tgctggtcaa gaagatctgg 4226221 tggccgctgg cggtttggct gctgttgatt gtgatgaacg tcgacgcggg aacaccgttg 4226281 ggcggaccta tcggaggggt ggccggcgca ctcggcgagt tcttctatca cgatccgcgc 4226341 cgcatcgcgg cggccacaac cctgctgttg atgctgatgg caggtgtggc gctgttcgcg 4226401 acagtcatgt tgctagtggc cgcggcgaaa cgactgaccg accgtttcag accccagccg 4226461 gtgtctgtct gggcatcggc gaccgcgaca ctactgatcg gagccactct ggtcagtgcg 4226521 tggcattact ttccccggca ccgatttctg ttcggcgaca agtacgactc ggtgatgatc 4226581 gaccagaaag atctcgacgc catggcatac ctggcgagtt tgcccggcgc acgcgacacg 4226641 ttgattggca acgccaacac ggacggcacc gcgtggatgt atgccgtggc cggcctacac 4226701 ccgctgtgga cccactacga ctacccgctg caacagggcc cgggctatca ccggttcatc 4226761 ttctgggcct atggccgcaa cggggagagc gatcctcggg tactcgaggc catccaagtc 4226821 ctccgtatcc gctatatcct gaccagcact ccgacggtgc gggggtttgc cgtgccggac 4226881 ggactagtgt cgttagagac atcgaggtcg tgggcgaaga tctacgacaa cggcgaggcc 4226941 cgaatctacg aatggcgcgg cactgccgca gcaacacact cctagaaggt gcgtaagagg 4227001 atggtgattg gattgagtac cggcagcgac gacgacgacg tcgaggtcat cggcggcgtc 4227061 gacccgcggc tgatagcggt gcaggagaac gactccgacg agtcgtcgct gaccgacctg 4227121 gtcgagcagc ccgccaaggt gatgcgcatc ggcaccatga tcaagcaact gctcgaggag 4227181 gttcgcgccg ccccactcga cgaagccagc cgcaatcggc tacgcgatat ccacgccacc 4227241 agcatccgcg aactcgaaga tggtctggcc ccggaactgc gcgaggagct cgaccggctt 4227301 accctgccgt tcaacgagga cgccgtgccc tcggacgccg agttgcgcat tgcccaggca 4227361 cagctggtcg gctggctgga agggctgttc cacggcatcc aaaccgcgct atttgctcag 4227421 caaatggcgg cgcgcgcgca gctgcaacaa atgcgccagg gtgcgctgcc gcccggggtc 4227481 ggcaagtcgg gccagcacgg ccacggcacc ggacaatacc tgtaagccgt gtcggatccg 4227541 caccatcccc atatccagac gcacaacgcg tgggtggagt tccctatctt cgacgccaag 4227601 tcacgttcgc tgaagaaggc ggtcctgggt aaagcgggcg gcaccatcgg gcgcaacaac 4227661 tccaacgtcg tcgtcatcga agcgttgcgc gacatcacca tggagctgaa cctgggtgac 4227721 cgggtcggtc tggtcggaca caacggagcc ggcaaatcga cgctgctacg cctgctttcg 4227781 ggcatctacg agcccacccg cggctgggcg aaggtcaccg gaagggtggc gccggtcttc 4227841 gatctgggca tcggcatgga ccccgagatc tccggctacg agaacatcat cattcgtggg 4227901 ctgtttctgg gacagacccg caaacagatg caggcgaaag tggatgagat cgccgaattc 4227961 accgaattgg gcgagtacct ttcgatgccg ctgcgcacct attccaccgg gatgcgagtc 4228021 cgcctggcga tgggcgtggt caccagcatc gacccagaga tcctgttgct cgacgaaggc 4228081 atcggcgccg tggacgccga cttcctgagg aaggcccagt cccggctgca gaatttggtc 4228141 gaacgttccg ggatcctggt tttcgcaagc cattccaacg agtttttggc tcgactatgc 4228201 aagaccgcga tatggattga ccatggcgtc atcaggctcg ccggtggtat cgaagaggtg 4228261 gtacgggcct acgagggtga ggacgccgcc cggcacgtgc gcgaagtact ggccgagacc 4228321 caggccgaca gacagaacgt ccagggatga ctgaatcggt cttcgccgtt gtggtaaccc 4228381 accggcgccc cgacgagctg gccaagtcgc tggatgtgct gaccgcccag acccggttac 4228441 cggaccacct gatcgtggtc gataacgacg gttgcggcga cagcccggtc cgcgagcttg 4228501 tcgcgggaca accgatcgcc accacgtatt tggggtcacg ccgaaacctg ggcggtgccg 4228561 gcggtttcgc gctgggcatg ctgcacgcgc tggcacaggg cgccgattgg gtgtggctgg 4228621 ccgacgacga cgggcacgcg caagatgcta gggtactggc aaccctgctg gcgtgcgccg 4228681 agaagtacag cctcgccgag gtgtcaccga tggtgtgcaa catagacgac ccgacgcggc 4228741 tggcgtttcc gttgcggcgt ggcctggtat ggcgcaggcg cgcaagtgaa ttgcgcaccg 4228801 aggcgggcca agagctgctg cctgggatcg catcactgtt caacggcgca ctgtttcggg 4228861 catccaccct agcggcgatc ggcgtgcctg acctgcggct gttcatccgc ggcgacgagg 4228921 tggagatgca ccgccggctg atccggtccg gtctaccgtt cggaacctgt ctggacgcgg 4228981 cctacctgca cccctgcgga tcagacgaat tcaagccgat cctttgtggc cgcatgcacg 4229041 cccaatatcc cgacgatccc gggaagcggt ttttcaccta ccgcaaccgt ggctatgtat 4229101 tgtcgcaacc cggcctgcgc aaactattgg cccaggaatg gctgcggttc ggctggttct 4229161 tcctggtgac ccgccgcgac cctaaaggcc tgtgggagtg gattcggttg cgccgcctgg 4229221 gccgtcggga gaagtttggc aagcctggag gatctgcatg acattcatgg atgctcaagc 4229281 tagcttccag acacagtcgc ggacactggc ccgcgtccga ggcgatctgg tcgacgggtt 4229341 ccgccgccac gagctgtggc tgcacctggg ctggcaggac atcaagcagc ggtaccgccg 4229401 ctcggtgctg gggccgttct ggatcaccat cgccaccgga acgaccgccg tcgcgatggg 4229461 cggcctgtat tccaagctgt ttcggctcga gctgtctgag cacctgccct acgtcacgct 4229521 cgggctgatc gtctggaacc tgatcaacgc cgccatcctg gacggcgcag aggttttcgt 4229581 cgccaacgaa ggtctgatca aacagctgcc ggcaccgttg agcgtgcacg tctatcggtt 4229641 ggtgtggcgg cagatgatct tcttcgccca caacatcgtc atctacttcg tcatcgcgat 4229701 catctttcct aagccgtggt cgtgggcgga tctgtcgttt cttccggcgc tggcgctcat 4229761 tttcctcaat tgcgtttggg tgtcactgtg tttcggcatc ctggcgaccc gctaccgcga 4229821 catcggcccg ctgctgtttt ccgttgtgca gttgttgttc ttcatgacgc cgatcatctg 4229881 gaacgacgag accctgcgtc ggcagggcgc gggccgctgg tcgagcatcg tcgagctcaa 4229941 cccgctgctg cactatctgg acatcgtgcg ggcgccactg ttgggcgctc accaggagct 4230001 gcggcactgg ctggtggtgc tggtgttgac cgtcgtcggc tggatgctgg cggcgttcgc 4230061 gatgcggcag tatcgcgcgc gggtgcccta ctgggtgtag ggactattcc ggcggctata 4230121 gccgaccggc ttctttcacg cggcttgcgc gtgacgggcc gccgttgatc tcaagatcgg 4230181 ctggcaacgg ccgcgtacca gcggcagcat ggattaggtt caccgtttgc cgatgaggct 4230241 cagagggcgg gacggatgga aatacttgtc accgggggcg cgggcttcca gggaagccat 4230301 ctgaccgagt cactgctggc caatgggcat tgggtcactg tcctcgacaa gtcttcgagg 4230361 aatgcggttc gtaacatgca gggatttcgt tcgcatgacc gcgccgcgtt catatccggt 4230421 tcggtaaccg acggccagac gatcgaccgc gcggtgcggg accatcacgt cgtatttcac 4230481 ctggccgcgc atgtcaacgt ggaccagtcc ttgggcgacc cggagagctt tctcgaaacc 4230541 aatgtcatgg gaacctaccg cgtcctggaa gccgtccggc gctacaggaa ccgcttgata 4230601 tacgtatcga cgtgcgaagt ctacggcgac ggacacaatc tcaaggaagg cgaacgactt 4230661 gacgaacacg cggagctgaa gccgaacagt ccatatggcg cttccaaggc ggcggccgac 4230721 cgcttgtgct actcgtactt tcgctcctac ggactcgacg tcacgatcgt ccgtccgttc 4230781 aacatcttcg gcgtccgcca aaaggctggg cgattcggcg cgctgattcc gcggctggtc 4230841 cgccagggca tcaacggtga aggcctgaca atcttcggcg caggtagcgc aacccgggat 4230901 tacctgtatg tcagtgacat cgtgggcgcg tacaacctgg tattacgaac tccaaccctg 4230961 cgtggtcagg ccatcaattt tgccagcggg aaagataccc gggtgaggga catcgtcgag 4231021 tatgttgcgg acaagttcgg tgccaggatc gagcaccgcg acgctcgccc cggagaggtc 4231081 cagcgctttc ccgctgacat ttcgcttgcc aaaagcatcg ggttccagcc gcaagtcgaa 4231141 atttgggacg gcatcgatcg ctatatcaat tgggccaagg atcagcccca atacccatat 4231201 gagcaggacg ggtttagcgg ttccagcgtt ctctaataca cccgtcgccg ccatcgtctg 4231261 ccggtaaagt gggccgaaat ggcgcggaac taccagctgg aaggattacc tcccattcga 4231321 tggtgaccgt agcacgccga ccggtgtgcc cggtgacgct gacaccgggt gacccggcgc 4231381 tagcgtcggt gcgcgacctg gtcgacgcgt ggagcgcgca tgatgcgctg gcagagctgg 4231441 tcacgatgtt cggcggcgcg tttccgcaga cggaccatct ggaagcgcgg ctggcgagcc 4231501 tggacaagtt cagcacggca tgggactacc gggcgcgcgc acgtgcagca cgagcgctcc 4231561 acggcgaacc ggtgcggtgc caggactccg gcggtggggc gcgatggctg atcccccgcc 4231621 tggacttgcc ggccaagaag cgggacgcga tcgtcgggtt ggcgcagcag ctggggctca 4231681 ccttggaatc gaccccgcag ggaacaacct tcgaccacgt tctagtcatc ggcaccggac 4231741 gtcattccaa cctgatccgg gcccgctggg cccgggaatt ggcaaagggt cgccaggttg 4231801 gtcacatcgt gctcgccgcc gcatcgcgtc gattgctgcc ctccgaggat gacgcggtcg 4231861 cggtctgtgc gccgggcgca cgcaccgaat tcgagctatt agcggccgcg gcaagggacg 4231921 cattcggcct ggacgtccac ccagcggtgc ggtatgtgcg ccagcgggac gacaacccgc 4231981 accgggacag catggtgtgg cgcttcgccg ccgacaccaa tgacctaggc gttccgatca 4232041 ccctgctgga ggcgccatcg ccggagcccg acagcagccg cgccacctcg gccgacacct 4232101 tcacgtttac cgcacacacg ctgggtatgc aggactcaac gtgtctgttg gtgaccgggc 4232161 aaccgttcgt gccctaccag aacttcgacg cactgcgaac tctggcgctg cccttcggga 4232221 tacaggtgga gacagtgggc ttcggcatcg accgctacga cgggctgggt gagttggacc 4232281 aacaacaccc tgccaagctg ctgcaggagg tccgctcgac gatccgagcg gcccgagccc 4232341 tgctggaacg gatcgaggcc ggcgagcgca tggctaccga tcctcggcgg tgatggtgca 4232401 tggcgtggcc ggcgggtagc tgcccgatac ggctcgcaac cgtcccggtg gcggccacgg 4232461 ccgtagtccc atgttggcta ggtaccgcac cggattgaca tgcccgtcct gcgtgcggac 4232521 ctcgaaatgc agataaccat ctgccgattc gccttgcgca ccgatggtgc ccagttgcgc 4232581 tcccgcggcg attcgatcac caaggacaag gcggccctcg tccccgggcc gaaatacata 4232641 gacaacgtcg agctcgcagc gtgcgatcgt cagcgacacc aggccatcga cctcgtcgat 4232701 cgcgctgacg gccccggaag cgaccgcgta gacgggtgtt cccggatcgg tggcgaagtc 4232761 gacaccggga tggaaaccac ccgcgtgcgg accgtacccg cggccgatcg cgcgcggctc 4232821 ccggtcgatc ggcagccgcc cgcccggctc gagcggatcg aagtcgccgc ggatgcgccg 4232881 ccggtagtcg gctttgagca ggtcgacctc gtcgagtgcg tagccaaaca acaaagatcg 4232941 gtcgtaggcc accccgaagt tgaaccgata gtccggatcg agccgcagat agtgctccac 4233001 cctcaagatc cgttccgcca gcgtcgacca gccgctatgc accagccgaa cccccggaag 4233061 ctgtccgatg cgaccgtgat cggtgatgtt ggccggccaa tgcgggttgt gcatcagctt 4233121 gccgcctgcc cgcagacccg ggtaccagcg ccacaacggt ccacgtagcg cttcggcggt 4233181 tcccatcacc ggaatcaggt cgggatactc cggatcatcc cagcgtgaca ccatcggaca 4233241 catcagcgcc acgatgtcgt ccggtgtgcg ggctaacacc gcccgaagat cgatgtcggt 4233301 ctcgaccaac caatcggcat cgaccatcat cacccagtcc gggcggcaga agtccgccat 4233361 ccgatacagc agttccagcc cggcggactc aggaatcagc catggcgtgg gcggcagatc 4233421 tggtcgggcc cgcaccacgt tcgtcaccgc aggatggttc gccaggatct cggcggtgtc 4233481 atcggtgctg cggtcgtcga tcacgtagat gtcgtcgctg aacacggcca acgagtccaa 4233541 cgttgcggct agtgtccgcc cggcgttgtg cgcacgcgtc atcgccagaa tccgcatgcc 4233601 gcctctctat caccccagaa cacaggtcca gtagttgggt ctgtccgcca atccagcggg 4233661 aagggcgggc gccgcgggca gatcgtgggc ggccagcagc tgcgcggtgg tggtccccac 4233721 cgagcgccat ccgtggttgt ccaggtagcc ggacacctcg tggcgggggc cggcatagtt 4233781 gagtgcccag atgtccagat gaaagccatg ctctcgccag cctcgggtcg cggtgcggat 4233841 catctcttcc acccgagcgg aatcccgatc cgcagaaccg aggaaggcct cgagggccag 4233901 ccggctcccg ggcgcgctca agtcggtgac gtggtccagc agacgattct gcgcgtccgg 4233961 gggaaggtat ccgaacaaac cctcggcgat ccacgcggcc ggctcggccg catcgaagcc 4234021 gccgcggcgc agcgcatcgg gccaatcgtg acgcaggtcg gccggcacca tccgcagatc 4234081 cgcggtcggc tgggcaccca agccggcgag cgtttgagcc ttgaactcga gcacccgagg 4234141 ctgatcgacc tcgaacaccg tcgtatccgc cggccatggc agccggtacc cgcgtgcgtc 4234201 gagccccgac gccaggatca ccgcttgccg aacgccggcg gcggccgcgt ccaagaagaa 4234261 ctgatcgaag tagcgggtgc gcaccaccaa ctcggtcgtc attcgctgca agccccaggc 4234321 cgcgtcgggg tcgtccacat cggcagcatc cagttctccg gttgcccatc gggtgaggaa 4234381 ctcgacaccc acggcacgaa ccaacggttc ggcgaacggg tcgtcgatga ggggctgggc 4234441 cgccctggcc gccctggccc ttcccgcggc gaccagcgtg gcggtcgcgc cgacaccggt 4234501 ggctaggtcc cagctatcgt cgtcggtacg cgccacggat ccatcttcgg cccggtccgg 4234561 ccgccaacgc tccgctgtcg acccgaacaa ccggttacaa ctgcgtgacg aatatcgatg 4234621 acggctgcac cttaagggtg taacactgaa gcgccacgaa tccgatttat cgtcctgtgg 4234681 tgatcggtga aacggcaccc acagcacgct attaggtaaa cagctatccg ggcgcaggcg 4234741 acaacgcagt caccgaagcg ccgcgaaagg tcggcggacg tgagcgagaa agtcgagtca 4234801 aaggggctag cggatgcggc acgcgatcac ctcgcggctg agttggcccg gctgcggcag 4234861 cgacgcgatc ggctggaggt cgaggtcaag aacgaccggg gcatgatcgg cgatcacggc 4234921 gacgcggccg aggcgataca acgtgccgac gaactggcca tcctcggtga ccggatcaat 4234981 gaactggacc ggcggctgcg caccgggccc accccctgga gcgggtcgga aacgctgccc 4235041 ggcggcaccg aggtgacctt gcggttccct gacggtgaag tcgtcacgat gcatgtaatc 4235101 tccgtcgtcg aagagacgcc ggtgggccga gaagccgaaa ccctgacggc gcgcagccca 4235161 ctaggtcagg ccctggccgg tcaccaaccc ggcgacacgg tgacctactc gaccccgcag 4235221 ggtcctaatc aggtccagct gcttgctgtc aagctgccct cataattcgc acaccgcacc 4235281 aggctcgccg cccccattag acttcccccg atgatccgat cggagtctgg tgccgcgccg 4235341 ccacgccaac acctgcacct gtcggcacag gtaatgcggt tcgttgtcac cggcggcctc 4235401 gctgggatag ttgactttgg cctctacgtc gtgctgtaca aggtggcggg cctacaggtc 4235461 gacctgtcca aggccatcag cttcatcgtc ggcaccatca ccgcgtacct gatcaaccgc 4235521 cggtggacat tccaggccga gcccagcacg gcccgattcg tcgcggtcat gctcctctac 4235581 ggaatcacct tcgccgtgca ggtcggactc aaccacctct gcctcgcact cttgcactac 4235641 cgggcgtggg ccatccccgt cgcgtttgtg atcgcgcagg gcaccgccac ggtaatcaac 4235701 ttcatcgtgc agcgagccgt gatcttccgg atccgctgag ccggtcaggg tcgaatcggg 4235761 cgggtaccct ctttgacgat gttgagcgtg ggagctacca ctaccgccac ccggctgacc 4235821 gggtggggcc gcacagcgcc gtcggtggcg aatgtgcttc gcaccccaga tgccgagatg 4235881 atcgtcaagg cggtggctcg ggtcgccgag tcggggggcg gccggggtgc tatcgcgcgc 4235941 gggctgggcc gctcctatgg ggacaacgcc caaaacggcg gtgggttggt gatcgacatg 4236001 acgccgctga acactatcca ctccattgac gccgacacca agctggtcga catcgacgcc 4236061 ggggtcaacc tcgaccaact gatgaaagcc gccctgccgt tcgggctgtg ggtcccggtg 4236121 ctgccgggaa cccggcaggt caccgtcggc ggggcgatcg cctgcgatat ccacggcaag 4236181 aaccatcaca gcgctggcag cttcggtaac cacgtgcgca gcatggacct gctgaccgcc 4236241 gacggcgaga tccgtcatct cactccgacc ggcgaggacg ccgaactgtt ctgggccacc 4236301 gtcgggggca acggtctcac cggcatcatc atgcgggcca ccatcgagat gacgcccact 4236361 tcgacggcgt acttcatcgc cgacggcgac gtcaccgcca gcctcgacga gaccatcgcc 4236421 ctgcacagcg acggcagcga agcgcgctac acctattcca gtgcctggtt cgacgcgatc 4236481 agcgctcccc cgaagctggg ccgcgcggcg gtatcgcgtg gccgcctggc caccgtcgag 4236541 caattgcctg cgaaactgcg gagcgaacct ttgaaattcg atgcgccaca gctacttacg 4236601 ttgcccgacg tgtttcccaa cgggctggcc aacaaatata ccttcggccc gatcggcgaa 4236661 ctgtggtacc gcaaatccgg cacctatcgc ggcaaggtcc agaacctcac gcagttctac 4236721 catccgctgg acatgttcgg cgaatggaac cgcgcctacg gcccagcggg cttcctgcaa 4236781 tatcagttcg tgatccccac agaggcggtt gatgagttca agaagatcat cggcgttatt 4236841 caagcctcgg gtcactactc gtttctcaac gtgttcaagc tgttcggccc ccgcaaccag 4236901 gcgccgctca gcttccccat cccgggctgg aacatctgcg tcgacttccc catcaaggac 4236961 gggctgggga agttcgtcag cgaactcgac cgccgggtac tggaattcgg cggccggctc 4237021 tacaccgcca aagactcccg taccaccgcc gaaacctttc atgccatgta tccgcgcgtc 4237081 gacgaatgga tctccgtgcg ccgcaaggtc gatccgctgc gcgtattcgc ctccgacatg 4237141 gcccgacgct tggagctgct gtagatggtt cttgatgccg taggaaaccc ccagacggtg 4237201 ctgctgctcg gtggcacctc cgagatcggg ctcgccatct gcgagcgcta cctgcacaat 4237261 tcggcggccc gcatcgtgct ggcctgcctg cccgacgacc cacggcggga ggacgcggcc 4237321 gctgcgatga agcaggccgg cgcgcggtcg gtggagctga tcgactttga cgccctggat 4237381 accgacagcc acccgaagat gatcgaggcg gccttctccg gcggtgatgt ggacgtggct 4237441 atcgtcgcgt tcggcttgct cggcgacgcc gaagagctgt ggcagaacca gcgcaaggcg 4237501 gtgcagatcg ccgaaatcaa ctacaccgca gcggtttcgg tgggcgtgct gctggctgag 4237561 aagatgcgcg ctcagggctt cggtcagatc atcgcgatga gctcggccgc cggtgagcgg 4237621 gtgcgacggg cgaacttcgt ctacggctcc accaaggccg gtctggacgg gttttacctg 4237681 gggttgtcag aagcgctgcg cgagtacggt gttcgtgtgc tggtgatccg gcccggccag 4237741 gtgcgtaccc ggatgagcgc gcacctcaag gaagctccat tgaccgtcga caaggagtac 4237801 gtcgccaacc tcgcggtgac cgcgtccgca aaaggtaagg aattggtttg ggcgccagca 4237861 gcgttccgct acgtcatgat ggtgttgcgt cacatcccgc ggagcatctt ccgcaagctg 4237921 cccatctgag tatgccgagc agacgcaaaa gcccccaatt cgggcacgaa atgggggctt 4237981 ttacgtctgc tcgcgcccgg gaggtgctgg tcgctcttgg ccagctggca gcggcggtgg 4238041 tagtggccgt cggtgtcgcg gtggtgtccc tgctcgccat tgcgcgggtg gagtggcccg 4238101 ccttcccgtc gtccaaccag ctgcatgcgc tgaccaccgt cggccaggtc ggctgcctgg 4238161 ccgggctggt cggcatcggc tggttgtggc ggcacggtcg attccggcga ctggcccggc 4238221 tgggcgggct ggttttggta tccgcgttta ccgtcgtgac gctgggcatg ccgctgggcg 4238281 ccaccaagct gtatctgttc ggcatctctg tcgaccagca gttccgcacc gaatacctca 4238341 cccggctcac cgacaccgcc gccctgcgcg acatgaccta catcggactg ccaccgtttt 4238401 acccaccggg ctggttctgg atcggcggac gcgcggcggc gctgaccggg acgccggcct 4238461 gggagatgtt caagccgtgg gcgatcacct cgatggccat tgcggtggcc gtcgcgctgg 4238521 tgctgtggtg gcggatgatc cgcttcgaat acgccttgct ggtcaccgtc gccacagcgg 4238581 cggtgatgct ggcctacagc tcgccggagc cctacgccgc gatgatcacg gtgttgttgc 4238641 cgccgatgct cgtactgacc tggtcgggcc tgggcgcgcg cgaccgtcag ggctgggccg 4238701 cggtggtcgg tgccggcgtc ttcctgggct tcgcggccac ctggtacacc ctgttggtcg 4238761 cctacggcgc gttcacggtg gtgctgatgg cgctgctgct ggccgggtcg cggctgcaat 4238821 ccggaatcaa ggcggcggta gacccgctgt gccggcttgc cgtcgtcggc gcgatcgcgg 4238881 ccgccatcgg atccaccacc tggctgccct acctgctgcg ggcggcccgc gacccggtca 4238941 gcgacaccgg cagcgcccag cactacctac ccgcagacgg cgccgcactg accttcccca 4239001 tgctgcagtt ctccctgctg ggcgcgatct gtctgctggg cacgctgtgg ctggtgatgc 4239061 gcgcgcgatc atcggcgcca gccggcgccc tggccatcgg cgtgctggcc gtctacctgt 4239121 ggtccctgct gtcgatgctg gccacattgg cgcgcaccac actgctgtcg tttcgcctgc 4239181 agccgacgct gagcgtgctg ctggtggcgg ccggtgcgtt cggcttcgtc gaagcggtcc 4239241 aagcccttgg caaacggggt cgcggtgtca ttccgatggc cgccgccatc gggttggccg 4239301 gcgcgatcgc gttcagccag gacatccccg acgtgttgcg gccggacctg accatcgcct 4239361 acaccgacac cgacggctac ggccagcgcg gcgaccggcg accgcccggc tccgagaagt 4239421 actacccagc catcgatgcc gccatccggc gcgtcaccgg caagcgccgc gatcggaccg 4239481 tcgtgttgac cgccgactac agcttcctgt cgtactaccc ctactggggc tttcaggggt 4239541 tgacgccgca ctacgccaac ccgctggcac agttcgacaa gcgcgccaca cagatcgaca 4239601 gctggtcggg actctccacc gccgacgagt tcatcgccgc gctggacaag ctgccctggc 4239661 agccgccgac cgtcttcctc atgcgccacg gcgcacataa cagctacacc ctgcggctgg 4239721 cccaggacgt ctaccccaac cagcccaatg ttcgccgcta cacggtggac ctacggaccg 4239781 ccctcttcgc cgacccgcgt ttcgtcgtcg aggacattgg cccgttcgtg ctggccatcc 4239841 gcaagccgca ggagagcgcg tgatggctac cgaagccgcc ccaccccgta tcgccgtccg 4239901 gctaccatct acctccgtgc gcgacgcggg agcaaactac cggatcgccc ggtacgtcgc 4239961 tgtggtggcg ggtctgctag gcgctgtgct ggccatcgcc accccactgc tgccggtcaa 4240021 ccagaccacc gcgcaattga actggcccca aaacggcacg ttcgccagtg tcgaggcacc 4240081 gctgattggc tacgtggcca ccgacttgaa catcaccgtc ccctgccagg ccgccgccgg 4240141 actggccgga tcgcagaaca ccggcaagac ggtgttgttg tcaacggtgc ccaagcaggc 4240201 gcctaaggcc gtcgatcgcg ggctgctgct gcaacgggcc aacgacgacc tggtgcttgt 4240261 ggtgcgtaat gtcccgttgg tcaccgcccc gctgagtcag gtgctcggcc cgacctgtca 4240321 gcggttgaca ttcaccgcgc acgccgatcg ggtcgccgcc gaattcgtcg gactggtgca 4240381 gggacccaat gctgagcacc ccggtgcacc gctgcgcggt gagcgcagcg gctacgactt 4240441 ccgcccgcag atcgtcgggg tgttcaccga cctggccggg ccggcgccac cgggtctgag 4240501 cttctcggcg agcgtggata cccgctacag cagcagcccc acgccgctga agatggccgc 4240561 catgatcctc ggggtagcgc tcaccggcgc cgccctggtg gcgctgcaca tcctggacac 4240621 cgccgacggc atgcggcacc ggcggttcct gcccgcgcgc tggtggtcga ccggcggtct 4240681 ggacaccctg gttatcgccg tgctggtgtg gtggcatttc gtcggggcca acacctccga 4240741 cgacggctac atcctgacca tggcccgggt gtccgagcat gcgggctata tggccaacta 4240801 ctaccgctgg ttcggcacac ccgaggcgcc tttcggctgg tactacgacc tgctggcgct 4240861 gtgggctcat gtcagcacgg ccagtatctg gatgcgccta cccaccctgg cgatggcgct 4240921 cacctgctgg tgggtaatca gccgtgaggt cattccccgg ctggggcacg ccgtcaagac 4240981 gagccgggca gcggcgtgga cggcggcggg catgtttctg gctgtctggc tgccgctgga 4241041 caacggcctt cggcccgagc cgatcatcgc cctgggcatc ctgctgacct ggtgctcggt 4241101 ggagcgggcg gtggccacca gccggctgct gccggtggca atcgcctgca tcatcggtgc 4241161 cttgaccctg ttctccgggc cgacgggcat cgcctcgatc ggtgcgctgc tggtcgcgat 4241221 cgggccgcta cggaccatcc tgcaccggcg ttccaggcgg ttcggcgtgc taccactggt 4241281 ggcgccgatc ctggccgcgg ccaccgtcac cgcgatcccg atctttcgtg atcagacctt 4241341 cgcgggcgag atccaggcca acctcctcaa gcgtgccgta gggcccagcc tgaagtggtt 4241401 cgacgaacac atccgctacg agcggctgtt catggccagc cccgacggct cgatcgcccg 4241461 ccgcttcgcc gtgctggcct tggtgctggc gctcgcggta tcggtggcaa tgtcgttacg 4241521 taagggccgc attccaggta ccgctgctgg accgagccgc cgcatcatcg gcatcacgat 4241581 catttccttc ctcgcgatga tgttcacccc gacaaagtgg acccatcact tcggggtgtt 4241641 cgcggggttg gccgggtcgc tgggggcgct tgccgcggtc gcggtgacgg gcgctgcgat 4241701 gcgctcgcgg cggaaccgga ccgtgttcgc cgccgtggtg gtcttcgtgt tggccctgtc 4241761 gttcgccagt gtcaacggct ggtggtacgt gtccaacttc ggtgtgccat ggtcgaactc 4241821 gtttccgaag tggcgatggt cgcttaccac cgcactcctc gagctgacgg tgctggtgct 4241881 gctgctagcg gcatggttcc acttcgtcgc caacggtgac gggcgccgaa cagccaggcc 4241941 aacccggttt agggcacgac tagccggaat tgtccagtcc ccgttggcaa ttgccacgtg 4242001 gttgctggtg cttttcgagg tggtatcgct gacccaggcg atgatttccc agtacccggc 4242061 gtggtcggtt ggccggtcta acctacaggc tttggccggc aagacctgcg ggctggccga 4242121 agacgtgctg gtggagctgg atcccaacgc aggcatgctg gcgccggtga ccgcgccgtt 4242181 ggccgacgcc ctgggagccg gcctgtctga agccttcaca cccaacggca ttcccgccga 4242241 cgtcaccgcc gacccggtga tggaacgtcc aggggatcgc agtttcctca acgacgacgg 4242301 gctgatcacc ggcagcgaac ccggcaccga agggggcacc acggccgcac cgggaatcaa 4242361 cggctcccgc gcccggctgc cctacaacct ggacccggcc cgtacaccgg tgctgggcag 4242421 ctggcgagcc ggcgtgcagg tgcccgccat gctgcggtcg ggctggtacc ggctgcccac 4242481 caacgagcag cgggacaggg cgccgctgct ggtggtgacg gcggccgggc gattcgactc 4242541 ccgcgaggtc cggttgcagt gggccaccga cgagcaagcg gccgccggac accacggtgg 4242601 gtcgatggaa ttcgccgacg tcggtgccgc gccggcctgg cgcaacctgc gcgcaccact 4242661 gtccgccatc ccgagcaccg ccacccaggt ccggttggtc gccgacgacc aggatctggc 4242721 gccgcagcac tggatcgccc tcacaccacc gcggattccg cgggtgcgca cgctgcagaa 4242781 cgtggtgggc gcagcggatc cggtgttcct ggactggctg gtggggctgg cattcccctg 4242841 ccaacgcccg ttcggccacc aatacggcgt cgacgagaca cccaagtggc ggatcctgcc 4242901 ggaccggttc ggcgccgaag ccaactcacc ggtgatggat cacaatggcg gtggcccgct 4242961 gggcatcacc gagctgctga tgcgcgcaac cacggtggcc agctacctca aagacgactg 4243021 gtttagggac tggggcgcgt tacagcggtt gacgccttac taccccgacg cccagcccgc 4243081 tgatctgaac ctaggaacgg tgactcgcag cgggctgtgg agtccggcgc cgttgcgccg 4243141 cggctagaag tgccgtggcc accgactcgg cgacaacctc cgcggccccg catcctcacc 4243201 gcccttaacc gcgtcgccta ccatcgagcc tcgtgcccca cgacggtaat gagcgatctc 4243261 accggatcgc acgcctagca gccgtcgtct cgggaatcgc gggtctgctg ctgtgcggca 4243321 tcgttccgct gcttccggtg aaccaaacca ccgcgaccat cttctggccg cagggcagca 4243381 ccgccgacgg caacatcacc cagatcaccg cccctctggt atccggggcg ccacgcgcgc 4243441 tggacatctc gatcccctgc tcggccatcg ccacgctgcc cgccaacggc ggcctggtgc 4243501 tgtccacact gccggccggt ggcgtggata ccggtaaggc cgggctgttc gtccgcgcca 4243561 accaggacac ggtcgtcgtg gcgttccgcg actcggtggc cgcggtggcg gcccgctcca 4243621 cgatcgcagc gggaggctgt agcgcgctgc atatctgggc cgataccggc ggcgcgggcg 4243681 ctgattttat gggtataccc ggcggcgccg ggaccctgcc gccggagaag aagccacagg 4243741 ttggcggcat cttcaccgac ctgaaggtcg gagcgcagcc cgggctgtcg gcccgcgtcg 4243801 acatcgacac tcggtttatc acgacgcccg gcgcgctcaa gaaggccgtg atgctcctcg 4243861 gcgtgctggc ggtcctggta gccatggtgg ggctggccgc gctggaccgg ctcagcaggg 4243921 gccgcaccct gcgcgactgg ctgacccgat atcgcccgcg ggtgcgggtc ggattcgcca 4243981 gccggctcgc tgacgcagcg gtgatcgcga ccttgttgct ctggcatgtc atcggcgcca 4244041 cctcgtccga tgacggctac cttctgaccg tcgcccgggt cgccccgaag gccggctatg 4244101 tagccaacta ctaccggtat ttcggcacga cggaggcgcc gttcgactgg tatacatcgg 4244161 tgcttgccca gctggcggcg gtgagcaccg ccggcgtctg gatgcgcctg cccgccaccc 4244221 tggccggaat cgcctgctgg ctgatcgtca gccgtttcgt gctgcggcgg ctgggaccgg 4244281 gcccgggcgg gctggcgtcc aaccgggtcg ctgtgttcac cgctggtgcg gtgttcctgt 4244341 ccgcctggct gccgttcaac aacggcctgc gtcccgagcc gctgatcgcg ctgggtgtgc 4244401 tggtcacgtg ggtgttggtg gaacggtcga tcgcgctcgg acggctggcc ccggccgcgg 4244461 tagccatcat cgtggcgacg cttaccgcga cgctggcacc gcaggggttg atcgcgctgg 4244521 ccccgctgct gactggtgcg cgcgccatcg cccagaggat ccggcgccgc cgggcgaccg 4244581 atggactgct ggcgccgctg gcggtgctgg ccgcggcgtt gtcgctgatc accgtggtgg 4244641 tgtttcggga ccagacgctg gccacggtgg ccgaatcggc acgcatcaag tacaaggtcg 4244701 gcccgaccat cgcctggtac caggacttcc tgcgctacta cttccttacc gtggagagca 4244761 acgttgaggg gtcgatgtcc cgccggttcg cggtgctggt gttgctgttc tgcctgttcg 4244821 gggtgctgtt cgtgctgctg cggcgcggcc gggtggcggg gctggccagc ggcccggcct 4244881 ggcgactgat cggcactacg gcggtcggcc tgctgctgct cacgttcacg ccaaccaagt 4244941 gggccgtgca gttcggcgca ttcgccgggc tggccggggt gttgggtgcg gtcaccgcgt 4245001 tcacctttgc ccgcatcggt ctacatagtc gacgcaacct cacgctgtac gtgaccgcgt 4245061 tgctgttcgt gctggcgtgg gcaacctcgg gcatcaacgg gtggttctac gtcggcaact 4245121 acggggtgcc gtggtatgac atccagcccg tcatcgccag ccacccggtg acgtcgatgt 4245181 ttctgacgct gtcgatcctc accggattgc tggcagcctg gtatcacttc cggatggact 4245241 acgccgggca caccgaagtc aaagacaacc ggcgcaaccg catcttggcc tctacgccac 4245301 tgctggtggt cgcggtgatc atggtcgcag gcgaagtcgg ctcgatggcc aaggccgcgg 4245361 tgttccgtta cccgctttac accaccgcca aggccaacct gaccgcgctc agcaccgggc 4245421 tgtccagctg tgcgatggcc gacgacgtgc tggccgagcc cgaccccaat gccggcatgc 4245481 tgcaaccggt tccgggccag gcgttcggac cggacggacc gctgggcggt atcagtcccg 4245541 tcggcttcaa acccgagggc gtgggcgagg acctcaagtc cgacccggtg gtctccaaac 4245601 ccgggctggt caactccgat gcgtcgccca acaaacccaa cgccgccatc accgactccg 4245661 cgggcaccgc cggagggaag ggcccggtcg ggatcaacgg gtcgcacgcg gcgctgccgt 4245721 tcggattgga cccggcacgt accccggtga tgggcagcta cggggagaac aacctggccg 4245781 ccacggccac ctcggcctgg taccagttac cgccccgcag cccggaccgg ccgctggtgg 4245841 tggtttccgc ggccggcgcc atctggtcct acaaggagga cggcgatttc atctacggcc 4245901 agtccctgaa actgcagtgg ggcgtcaccg gcccggacgg ccgcatccag ccactggggc 4245961 aggtatttcc gatcgacatc ggaccgcaac ccgcgtggcg caatctgcgg tttccgctgg 4246021 cctgggcgcc gccggaggcc gacgtggcgc gcattgtcgc ctatgacccg aacctgagcc 4246081 ctgagcaatg gttcgccttc accccgcccc gggttccggt gctggaatct ctgcagcggt 4246141 tgatcgggtc agcgacaccg gtgttgatgg acatcgcgac cgcagccaac ttcccctgcc 4246201 agcgaccgtt ttccgagcat ctcggcattg ccgagcttcc gcagtaccgg atcctgccgg 4246261 accacaagca gacggcggcg tcgtcgaacc tatggcagtc cagctcgacc ggcggtccgt 4246321 tcctgttcac ccaggcgctg ctgcgcacct cgacgatcgc cacgtacctg cgtggggact 4246381 ggtatcgcga ctggggatcg gtggagcagt accaccggct ggtgccggcc gatcaggctc 4246441 cagacgccgt tgtcgaggag ggcgtgatca ctgtgcccgg ctggggtcgg ccaggaccga 4246501 tcagggcgct gccatgacac agtgcgcgag cagacgcaaa agcaccccaa atcgggcgat 4246561 tttgggggct tttgcgtctg ctcgcgggac gcgctgggtg gccaccatcg ccgggctgat 4246621 tggctttgtg ttgtcggtgg cgacgccgct gctgcccgtc gtgcagacca ccgcgatgct 4246681 cgactggcca cagcgggggc aactgggcag cgtgaccgcc ccgctgatct cgctgacgcc 4246741 ggtcgacttt accgccaccg tgccgtgcga cgtggtgcgc gccatgccac ccgcgggcgg 4246801 ggtggtgctg ggcaccgcac ccaagcaagg caaggacgcc aatttgcagg cgttgttcgt 4246861 cgtcgtcagc gcccagcgcg tggacgtcac cgaccgcaac gtggtgatct tgtccgtgcc 4246921 gcgcgagcag gtgacgtccc cgcagtgtca acgcatcgag gtcacctcta cccacgccgg 4246981 caccttcgcc aacttcgtcg ggctcaagga cccgtcgggc gcgccgctgc gcagcggctt 4247041 ccccgacccc aacctgcgcc cgcagattgt cggggtgttc accgacctga ccgggcccgc 4247101 gccgcccggg ctggcggtct cggcgaccat cgacacccgg ttctccaccc ggccgaccac 4247161 gctgaaactg ctggcgatca tcggggcgat cgtggccacc gtcgtcgcac tgatcgcgtt 4247221 gtggcgcctg gaccagttgg acgggcgggg ctcaattgcc cagctcctcc tcaggccgtt 4247281 ccggcctgca tcgtcgccgg gcggcatgcg ccggctgatt ccggcaagct ggcgcacctt 4247341 caccctgacc gacgccgtgg tgatattcgg cttcctgctc tggcatgtca tcggcgcgaa 4247401 ttcgtcggac gacggctaca tcctgggcat ggcccgagtc gccgaccacg ccggctacat 4247461 gtccaactat ttccgctggt tcggcagccc ggaggatccc ttcggctggt attacaacct 4247521 gctggcgctg atgacccatg tcagcgacgc cagtctgtgg atgcgcctgc cagacctggc 4247581 cgccgggcta gtgtgctggc tgctgctgtc gcgtgaggtg ctgccccgcc tcgggccggc 4247641 ggtggaggcc agcaaacccg cctactgggc ggcggccatg gtcttgctga ccgcgtggat 4247701 gccgttcaac aacggcctgc ggccggaggg catcatcgcg ctcggctcgc tggtcaccta 4247761 tgtgctgatc gagcggtcca tgcggtacag ccggctcaca ccggcggcgc tggccgtcgt 4247821 taccgccgca ttcacactgg gtgtgcagcc caccggcctg atcgcggtgg ccgcgctggt 4247881 ggccggcggc cgcccgatgc tgcggatctt ggtgcgccgt catcgcctgg tcggcacgtt 4247941 gccgttggtg tcgccgatgc tggccgccgg caccgtcatc ctgaccgtgg tgttcgccga 4248001 ccagaccctg tcaacggtgt tggaagccac cagggttcgc gccaaaatcg ggccgagcca 4248061 ggcgtggtat accgagaacc tgcgttacta ctacctcatc ctgcccaccg tcgacggttc 4248121 gctgtcgcgg cgcttcggct ttttgatcac cgcgctatgc ctgttcaccg cggtgttcat 4248181 catgttgcgg cgcaagcgaa ttcccagcgt ggcccgcgga ccggcgtggc ggctgatggg 4248241 cgtcatcttc ggcaccatgt tcttcctgat gttcacgccc accaagtggg tgcaccactt 4248301 cgggctgttc gccgccgtag gggcggcgat ggccgcgctg acgacggtgt tggtatcccc 4248361 atcggtgctg cgctggtcgc gcaaccggat ggcgttcctg gcggcgttat tcttcctgct 4248421 ggcgttgtgt tgggccacca ccaacggctg gtggtatgtc tccagctacg gtgtgccgtt 4248481 caacagcgcg atgccgaaga tcgacgggat cacagtcagc acaatctttt tcgccctgtt 4248541 tgcgatcgcc gccggctatg cggcctggct gcacttcgcg ccccgcggcg ccggcgaagg 4248601 gcggctgatc cgcgcgctga cgacagcccc ggtaccgatc gtggccggtt tcatggcggc 4248661 ggtgttcgtc gcgtccatgg tggccgggat cgtgcgacag tacccgacct actccaacgg 4248721 ctggtccaac gtgcgggcgt ttgtcggcgg ctgcggactg gccgacgacg tactcgtcga 4248781 gcctgatacc aatgcgggtt tcatgaagcc gctggacggc gattcgggtt cttggggccc 4248841 cttgggcccg ctgggtggag tcaacccggt cggcttcacg cccaacggcg taccggaaca 4248901 cacggtggcc gaggcgatcg tgatgaaacc caaccagccc ggcaccgact acgactggga 4248961 tgcgccgacc aagctgacga gtcctggcat caatggttct acggtgccgc tgccctatgg 4249021 gctcgatccc gcccgggtac cgttggcagg cacctacacc accggcgcac agcaacagag 4249081 cacactcgtc tcggcgtggt atctcctgcc taagccggac gacgggcatc cgctggtcgt 4249141 ggtgaccgcc gcgggcaaga tcgccggcaa cagcgtgctg cacgggtaca cccccgggca 4249201 gactgtggtg ctcgaatacg ccatgccggg acccggagcg ctggtacccg ccgggcggat 4249261 ggtgcccgac gacctatacg gagagcagcc caaggcgtgg cgcaacctgc gcttcgcccg 4249321 agcaaagatg cccgccgatg ccgtcgcggt ccgggtggtg gccgaggatc tgtcgctgac 4249381 accggaggac tggatcgcgg tgaccccgcc gcgggtaccg gacctgcgct cactgcagga 4249441 atatgtgggc tcgacgcagc cggtgctgct ggactgggcg gtcggtttgg ccttcccgtg 4249501 ccagcagccg atgctgcacg ccaatggcat cgccgaaatc ccgaagttcc gcatcacacc 4249561 ggactactcg gctaagaagc tggacaccga cacgtgggaa gacggcacta acggcggcct 4249621 gctcgggatc accgacctgt tgctgcgggc ccacgtcatg gccacctacc tgtcccgcga 4249681 ctgggcccgc gattggggtt ccctgcgcaa gttcgacacc ctggtcgatg cccctcccgc 4249741 ccagctcgag ttgggcaccg cgacccgcag cggcctgtgg tcaccgggca agatccgaat 4249801 tggtccatag cgtcaggctc cgcagtcgat agcggcacga tgttcgtcat tagacggccc 4249861 catcagttag gcctcctatg ctgctcggta tgcaccaggc cggccatgtt ggcacacacg 4249921 aacggcgcgc agccgcaacg aggcggtccg ccctgactgc ggcagggtta gccgtcgtcg 4249981 gcgcaggggt gttgggcgcg tcggcgtgca gtccacaaaa gtctcctcag ccatcatcac 4250041 cccggttgcc cgacaatgcg ctgatcacgc tcggggtggc cgccggcccg ccgcctacgc 4250101 ccagcagagt aggaatctcg tcggtgctga aaattggccg cgatctgtac gtgatcgatt 4250161 gcggcctggg ctcgctgaac gcattcacca acgcgggcct gcaattcgac gatctcaaag 4250221 ccatgtttat cacccacttg cacaccgacc acatcgtcga ctactacaac ttctttctct 4250281 ccggtggctt ccttgcccca cccggtcgag cgccggtcct ggtctatggt ccgggcccag 4250341 ctgggggttt gccgccaagt gaagtcggca acccgaatcc agccaccgtc aaccccgcca 4250401 acccgacacc gggccttgcc gcggccaccg aagcgctgca tcgagcgttc gcttacacca 4250461 gcaacatctt catccgcgac tacggcattg acaacgttgc ggacctggtt aaagtcacgg 4250521 agatcgggct accaccagga tcggactacc gcaacagagc gccaaagatg agcccgttct 4250581 cggtcgcatc ggacgacaac gtttccgtca ccgcaacgct ggtctcccac tacgacgtct 4250641 acccagcgtt cggattccgc ttcgatctga agaaatcggg tgtgtccgtt accttctcgg 4250701 gtgacaccac taagtccgac aacctgatta ccctcgctca aggcactgac attctggtcc 4250761 acgaggcggt gttcagcctc gatacggctt actttggcaa cgctttcccc ccgaactatc 4250821 tggtgaactc acacacctcc gcagagcagg tgggggaggt ggccgcagcg gccaagccca 4250881 aacaattgat cctgagccac tacgcccctg acgacctacc cgactcgcag tggctcgaca 4250941 agatcaagaa gaattactcg ggcatgacca ccatcgcgcg ggacggccag gtcttcgccc 4251001 tctgatccgt tagcggtagc gccccgttcg acgatcgctg cctagagcta gacatatata 4251061 aaacctatgc aatagggtcg cggcatgccc gagtacgacc tagaggccgt ggacaagctg 4251121 cccttctcga cccctgaaaa ggcgcagcgc taccaaacgg aaaactatcg cggggccatg 4251181 ggcctcaact ggtacctcac ggatccgacc ctgcagttca tcatggccta ttacctacga 4251241 cccgatgaat tggcgttcgc agaaccccat ctgacccgca ttggtgagct gacggggggg 4251301 ccagtgacgc gttgggccga ggaaaccgac cgcaaccccc cgcggctcga acgctacgac 4251361 cggtgggggc atgacatcag ccgggtagtg ctgccggaat cgttcatcca atccaagcgc 4251421 gccgtcatcg aggcgcgaca agccgtgcgc gacgacgcgg cacgggccgg cgtcaagccg 4251481 tcgctggcac tcttcgccgc cgactatctg ctcaaccagg ccgatatcgg tatggcttgc 4251541 gcgctcgcca ctggcggcaa catggtccgg tcgctggtga ctgcctacgc gccacccgat 4251601 gtgcgcgaat tcgtcctagg caaactcaat tccggcgagt gggacggcga ggccgcgcag 4251661 ctgctgacgg agcgtgcggg cggctccgat ctgggagctc tggagacgac ggccacccgc 4251721 agcggcgacg tgtggctgct gaacggcttc aagtggtttg cgtccaactg cgccggggag 4251781 gcgttcgtgg tgttggccaa gcccgagggg gcgcctgact cgactcgagg tgtggccacc 4251841 ttcctcgtgc tacggacgcg ccgtgacggt tcccgcaacg gcgtgcgtat ccgtcggctg 4251901 aaggacaagc tcggcacccg ctctgtcgcc tccggtgaaa tcgagttcgt cgacgccgaa 4251961 gcctttctgt tgtccggcga accgagcgct gacgcgggcc cgtccgacgg caagggactc 4252021 acccgcatga tggagctgac caacagattg cggttgggca ccgcctcgtt cgccctcggc 4252081 aacgcgcgcc gcgcgctggt cgaatcgctg tgctacgccg ggcagcggcg ggcattcggt 4252141 ggggcgctca tcgacaagcc gctgatgcgc cgcaagctgg ccgaaatggt cgttgatgtg 4252201 gaagccgcgc tggcgatggt gttcgacggc ttcggagcgg cgaaccaccg ccagcccaga 4252261 tgcctgccgc aacgtatcgc ggtgccggtc accaagctta agacttgccg gctcgggatc 4252321 accgtggcat cggatgcgat cgagatccac ggcggcaatg gctacatcga gacctggccg 4252381 gtggcccggt tgctgcgtga cgcgcaagtc aacacgatct gggagggccc cgacaacatc 4252441 ctgtgtctgg atgtgcggcg cgggatcgag cagacgcgcg ctcacgagac actgttggcg 4252501 cggctgcgcg atgcggtgtc ggtgtccgac gatgacgaca ccacgcggct ggtctcgcgc 4252561 cgcattgagg acctcgacgc ggcgatcacc gcttggacca aactcgacag gcagctggcc 4252621 gaggcgcggc tgttcccgct ggcccaattc atgggcgacg tctacgccgg cgcgttgctc 4252681 accgagcagg ccgcctggga acgggcaacc cgcggcaccg accgcaaggc actcgtcgcc 4252741 cgcctgtacg cgcgccggta tctcgccgac caaggcccgc tgcgcggtat cgacgcagat 4252801 tgcgatgagg cgctgcagcg tttcgacgaa ctcgtggcgg gcgcgttcac tgccgagcag 4252861 acgtaaaagc ccccaattcg tggctcttct gacacttccg tgggtgagtt tgtgtcctga 4252921 gtaggcgcac gtcgttgtgg cttaaggttt ctggcttgtc aaggatcaga aacacaagga 4252981 gccgacaacg acgtgcgcaa tgtgaggcta tttcgtgcgc tgctgggtgt cgacaagcgc 4253041 accgtgattg aggacatcga attcgaggag gatgacgccg gagacggtgc gcgggtgatc 4253101 gcccgggtgc ggccacgaag tgcagtgttg cgccgctgtg gtcgctgcgg tcgcaaggcg 4253161 tcctggtatg accgcggtgc gggcctgcgc caatggcgca gtctggattg gggcaccgtc 4253221 gaggtgttct tggaggccga ggcgccgcgg gtgaactgcc ccacccatgg gccgacggtg 4253281 gtggcggtgc cgtgggcgcg tcatcatgcc gggcacacgt atgctttcga tgacacggtg 4253341 gcctggctgg cggtggcgtg ttcgaagacc gcggtgtgcg agttgatgcg gatcgcctgg 4253401 cgcaccgtcg gggcgatcgt ggcccgggtc tgggccgaca ccgaaaagcg cattgaccgg 4253461 ttcgcgaact tgcgccgcat cggtatcgat gagatctcct acaagcgcca ccaccggtac 4253521 ctgacggtgg tcgtcgatca cgacagcggc cggttggtgt gggccgcccc gggccacgac 4253581 aaggccaccc tgggcttgtt cttcgatgcc ctgggcgctg agcgggccgc ccagattact 4253641 cacgtttcgg ccgatgccgc ggactggatc gctgacgtgg tcaccgagcg ctgcccggat 4253701 gcgattcaat gcgccgatcc gtttcatgtg gtggcctggg ccaccgaggc gctcgacgtc 4253761 gagcggcgcc gagcctggaa cgacgcacgg gcgatcgcgc gcaccgaacc caagtggggc 4253821 cggggccggc ccggtaagaa cgccgcacca cgtccgggcc gcgagcgggc acggcggctc 4253881 aagggcgccc gctacgcgct gtggaagaac cccgaggacc tcaccgaacg ccaaagcgcc 4253941 aaactggcct ggatcgccaa gaccgatccc cgtctgtatc gcgcctacct gctcaaagag 4254001 agcctgcggc atgtgttttc ggtcaagggc gaggaaggta aacaggccct ggaccggtgg 4254061 atctcctggg cccagcgctg tcgcatcccg gtattcgtcg agcttgccgc ccgcatcaaa 4254121 cgccaccggg tggccatcga cgccgccctc gaccacggcc tatcccaagg cctgatcgaa 4254181 tccaccaaca ccaagatccg cctactgacc cggatcgcgt tcggattccg ctcaccacaa 4254241 gccctcatcg ccctagccat gctcaccctc gccggccacc gccccaccct gccaggccga 4254301 cacaaccacc cacagatcag tcagtagagc ccaattcgta ccgaatttgg gggcttttac 4254361 gtctgctcgc gctacccagc tagaccggga tcaggccgtg cttgcggccc acccgccacc 4254421 acagctgctt gtcccgcagc aggtgcatcg acttgcgcaa cagcagccgg gtctcatgcg 4254481 ggtcgatgac ggcatcgatg aacccgcgct cggcggcgat ccacgggatc gccatgttga 4254541 ggttgtaatt ctcgacgaag ctcttccgga tcgcttgcgc ctccggcgca ttcgggtccg 4254601 ggaaacgctt catcagcaac tgcgcggccc cgtcggcgcc gatcaccgcg atgcgcgcgg 4254661 tgggccaggc gaagttcagg tcggcggtca gctgcttgga ccccatcacc gcgtaggcac 4254721 cgccgtagga cttgcggatg gtgatcgtca ccttcggcac atcagcctcg accaccgcgt 4254781 acaagaacct cccaccgcgc ttgatgatcc cgttcttttc ctgttccacc ccgggcaaaa 4254841 accccggtgt gtccacgacg aacaccagcg ggatgtcgaa cgcgtcgcta aaccggatga 4254901 accgtgcggc cttgtcggac gcctcgttgt cgatcgcccc cgacatgtgc atgggctggt 4254961 tggccaccac accaacggtc cgcccgtcca cccgcgcgta gccggtgatg atcgcctgcc 4255021 cggcctgggc agcgacgtcg aggaagtcgc cgtcgtcgaa gatccgcagc aggacctcgt 4255081 gcatgtcgta ggccatgttg tccgagtccg gcacgatcga gtcgagttcc agatcgtggc 4255141 cggtgatttc gggttccagc ccggggttga cgaccggcgg tttgtcgaag cagttggacg 4255201 gcagaaacga cagaaagtcc cgcacgtact ggtatgcggc ggcctcggac tccaccacct 4255261 gatggatgtt gccgtagctc gcctggtggt cggcgccccc cagctcgtcg aggctgacgt 4255321 cctcaccggt gacgtccttg atgacgtcgg ggccggtgac gaacatgtaa ccctggtcgc 4255381 gcaccgccac caccagatcg gtctggatcg gcgaatacac cgctccccca gcgcatttgc 4255441 ccaaaatgat ggagatctgc ggcaccagcc cactgagcag ttcgtggcgg cgccccagct 4255501 cggcgtacca ggccagcgag gtgacggcgt cttggatgcg ggcgccgccg gagtcgttga 4255561 tgccgacgat cgggcagccg accatcgcgc accactccat cagccgggcc accttgcggc 4255621 caaacatctc cccgacggtg ccgccgaaca cggtttggtc gtgcgagaac acgccgaccg 4255681 gccggccgtt gatgaggcca tgtccggtga ccacgccgtc cccgtagagc gcgttggggt 4255741 caccgggggt gcggcacagc gctccgatct ccatgaagct acccggatcg accagctcgt 4255801 agatgcgggc gcgggcactc gggatgccct tcttgtcgcg cttggcggcg gccttctcac 4255861 cgccgggttc cttggccaac tccaggcgtt cgcgcagctc cgccagcttc tcggcggtgg 4255921 tatgcagaac cggctcggtg acggtcactg cttgcctacc tcacttgttc gatcggcctc 4255981 gatctgcccc aacgcgcggc tcatgtgttc gcccaccttg gcgatgatcg gctcgtcgat 4256041 ggcctgaatg tgctcgccac cgatcggcac cacctcgagg tcggaaacgt actcgcccca 4256101 cccgccgtcc ggctggcgca cggcgtagcg gggctcgaac atgatcgcgt cgtcatggta 4256161 gcgatcggcc atgtagaggg tgacatgccc gtcgtacggc tggatctggg cggtgtcgat 4256221 cgcccggttg tccagatacg acgtgcgttg gtgttcgatg atcccggccg ggatctgcac 4256281 accggactgg ctgacggcgt ccagcacgaa ccggacctgg ccctcgtcgt cgagctcctc 4256341 gagctgctcg tacgggatcg ccgggatggt cacgttgaac gtcttctcgg cgaaggcggc 4256401 gtagcggtcc cagcgcttgc ggatctcctc cttggtctgc gggatctcct caccggcgcg 4256461 caccgcgtcg atcagcccga cgaaccgcac gtccttgccc agccgccgca aaccgatcgc 4256521 gcacgcgtag gccagcacac cgcccagcga ccaacccacc aggacatagg gcccgtcgcc 4256581 ctgcatctcg atcagcttcg gcacgtactg ctgtgcacgc tcttcgatcg acccctcgac 4256641 ccgttcgaag ccatacattg gggtgtccgc cggcagccgg cccagcagcg gctcgtacac 4256701 caccgtcgag ccgccggccg gatgaaacac gaacaccggc accttcccgc ctgcttcggg 4256761 ccgcgcccgc agggtgcgga cgaacccatc gatctgcccg gcctccaaat acgtgcgcac 4256821 cttgtcggcc agcgcctcga tgttcgacga cgtcagcacg tcctcggcgg tgatcgggcc 4256881 ttcggcgcgc tcggaaagcc gctgcgcaat cttggccgcg gcctcgtcgt ccagcctggg 4256941 cagctcgttg aagatgccgc ccggggactt gccggtgacg atcgcccagg tggcgaaggt 4257001 gacccgctcg gcagcgtccc gcggcggcac gtcgacgttg agcgcgggcc ctgtcgggtt 4257061 tggctgctcg ccgttttgcg gcgacgggag cgcaaccccg gcttccgagt cgaccggctc 4257121 ggtcttgccc accttgccat gcagcaattc ggcctgggcc cgcgcgatct cctcagcggt 4257181 ctgggttttc tggtgctcgt gcagctgctg cacctcgtca cggtgctcga ccgcgtattc 4257241 gatcagcttc tccacgttgt agaggttggc gtcgcgcacc gcggtcagct ggatcggtgg 4257301 caggtcgaag tcgtactcga cgcggttttt gatgcgcacc gccatcagcg agtccaggcc 4257361 aagctcgatc agcggcacct cccacggcag gtcctcgggc tcatagccca tcgcagaccc 4257421 gacaatcagg cccagccgct cggcgatggt ctcaccggaa tcaggcgacc atcgggtcat 4257481 gccggacggc atgtaacggg tggtcaggct gtccgaaagc gtctcggcgt ccgcgtcttc 4257541 ggcgggcgtt tccggcgcga caggcgcccc gtccgcaacc gcgatcgccg tcgccgcacc 4257601 caccgcggtg ggcaacaccg attcggaccc cgctcgggac accagggcgt cgtagaccag 4257661 cgtgaaggac tcgtcgatgc gggcgtgcac ctgcaccgag gcgccgccgg ggtgacgggt 4257721 catcgtcgtc accagccggg cgccgtcgcc gggcaccgcg cgctgctcgg cggcggtcag 4257781 ttgcgcgtcc ggaagcacgt gggcggcggc ggccctgacc aacgcggcca agtccacatt 4257841 gccgtcccgc ggcgcgtact cccagacgtg ccgcccatcc ggcagggcga catgggtgcc 4257901 cggcatgtac gtcgagccgt cgccggagaa gtgcgcgggc agccagtgct ccttgcgctt 4257961 gaaccgggtc ggcggaatgt tcgcgtaatc ctgcggccca ctggcgcggc taaacagcgt 4258021 gcgtatgtcc aggtcgtggc cgtacacata cagctgcgcc atggtcgaga ccatcgagga 4258081 gacctcgtct tgcttgcggg ccagcgtcgg gatcaactgg gcgtcatgca gcccggcatc 4258141 ggcggtggtc agggcgacct gcatcagcgc caccggattg ggtgccagct ccaggaaggt 4258201 ggtgtgcccg ctgtcgacgg cgttgcggat gccgtgggtg aagtagacgg aatgccgcag 4258261 ccccttcttc cagtattcga cgtcgtggat gggttcgccg ccgggtttga tgtagcggcc 4258321 ctcgtgcacc gtcgagaaga tcccacacgt cgggctcgtc ggcttgatgc cttgcagctc 4258381 cgcggtgagc tcgcccagca gcgggtccat ctgcgaggtg tggctggcgc ccttggtcgc 4258441 gaatttgcgg gcgaacttgc cctcggcctc ggcgcgggca aggatcgcgt ccacctgctc 4258501 gggggggccg ccgatgaccg tctgggtggg cgcggcgtag acacacacct ccagatcggg 4258561 gaagtcggag aacacttctc tgatttcgtc ggcggagtat tccaccagcg ccatcaaccg 4258621 gatgtactcg ccgaacagca tcgcctcacc ctcgcccatc aggtgcgagc gcgagcagat 4258681 cgcccgggtg gcatcccgca gcgacagccc gccggcgaag taggccgacg cggcctcacc 4258741 cagcgactgg ccgatgaccg cggccggttt ggcgccgtga tggcgcagca gctcacccag 4258801 cgcgatctgg atcgcgaaga tggtgacctg ggtggtctcg atgccgtagt cctgcgcgtc 4258861 gtccaggatc agctccagca ccgagtagcc cagctcgtct tggaccaggg cgtcgacctt 4258921 ctcgatccac gccgcgaaca cctcgttgcg caggtacagg ctcttgccca tcttgcgatg 4258981 ctgggcgccg aatccggcga gcacccagac cgggccggtg gtcaccggcc cgtcgacgct 4259041 gaacacgttc ggcgcctgct tgcccgcggc gaccgcgcgc aggcccttga tggcctcgtc 4259101 gtggtcgtgg gccaacacca ccgcgcggga acggccgtgg ttgcgccgcg acaacgacct 4259161 gccgatcgat tccagcgagg aggcctggcc ttccgggctt tgcatccagt ccgccaactc 4259221 ggcggccgcc gccttcttgc gggacgtcag aaacgccgac accgccaacg ggaccaatgg 4259281 tgccgtaacc tcttgggccg caagctcttc caacgcggct tccttgagcc gcagcgcctc 4259341 ctcggtgact ccgggcagtt cgggctccgg ctcttcggcg accgccgagt cggtgatgat 4259401 gttgccgaac tcgtcgaacc gcagcgcgtg gcctgccaac gtgggcgcct cggcgggttc 4259461 ggcggccgcc ttgggttccg gctcgggttc cggttccttt tccaccacgt cacgcggcag 4259521 gacctcgcgc accaccacgt gcgcgttggc gccgccgaag ccgaagctgg acaccccggc 4259581 cagcgcgtag ccgccgtatc gcggccagtc ggtgggcgtg gtgatcatct tcaaccgcat 4259641 cgcgtcgaag tcgatgtagg ggctggggcc ggcgaagttg atcgacggcg gcagtttgtc 4259701 gtgctgcagc gccagcacca ccttggccat gctggccgcg ccggccgccg attccaggtg 4259761 cccgacgttg gttttcaccg cacccagcag cgccggccga tcggccggac ggcccctacc 4259821 gaccacccgg cccagcgcct cggcctcgat tgggtcgccg aggatggtgc cggtgccgtg 4259881 cgcctcgatg tagtcgacgg tgcgcggatc gatgccggcg tccttgtagg cccggcgcag 4259941 cacgtcggcc tgcgcgtcct ggttgggtgc gatcaggccg ttggaccggc cgtcgtggtt 4260001 gaccgcgctg ccggcgatca cggccaggat cgcgtcgccg tcgcggcggg cgtcgtcgac 4260061 ccgcttgagc accagcatgc cgccgccttc ggagcgggtg tagccgtcgg cgtcggctga 4260121 gaacgacttg atccggccgt cgggcgccag caccgcaccg atctcgtcga aacccagggt 4260181 gaccatcggt gtgatcaacg cgttcacccc gccggcgacc actacgtcgg cctcgccgtt 4260241 gcgcagcgcc tgcaccccct ggtggatggc caccagcgaa ctcgagcacg cggtgtcaat 4260301 ggtgaccgac ggtccgtgga agtcgtagaa gtaggacacc cggttggcga tgatcgagct 4260361 gctggtgccg gtgatcgcat acgggtgcgc gaccgtcggg tccgacaccg ccaggaagct 4260421 gtagtcgttg gtggagctgc cgatgtacac accgacggcc tggccgcgca ggctcgacgc 4260481 cgggatgcgg gcgtgctcga gcgcctccca ggtcagctcc agcgccatcc gctgctgcgg 4260541 gtcgatgttg tcggcttcgg tcttggccac cgcgaagaac tccgaatcga agcccttgat 4260601 gtccttcagg tagccgcccc gggtgcgggc cccggcgacc cgcgcggcca gccgcggctc 4260661 ttcgaggaat tccgaccagc gcccgtcggg caggtcggtg atcccgtcgc ggccttccag 4260721 cagcgcctgc caggtctgct cgggggtgtt catctcgccc gggaagcggg tggacaagcc 4260781 cacgatcgcg atgtcgacgc gctcggccgg gccggtgcgc gaccagtctt cggcgtcatc 4260841 gcccgctagg tcggtctccg gctcgccctc gatgatccgg gtggccagcg attcgatggt 4260901 cggatgcgcg aacgccaccg cgaccgacag cgtgaccccg gtcaggtctt ctatgtcggc 4260961 ggccatcgcg acggcatcgc gcgacgacag acccagctcc accatgggca ccgattcgtc 4261021 gatcgagtcc ggtgcctttc cgacggcctt acccacccag ttgcgcagcc actggcgcat 4261081 ctcggggacc gttagctcgg ccctttcggc gggggcgttc tcctgggatt ccgctacgtc 4261141 agccatgggt cctcagtccg aagtggcgaa gaccgtcggg gaacccacgc cactgcgcag 4261201 gctgccgtcg aggtaggccg cacggcaggc gcggcggccg atcttgccgc tggaggttcg 4261261 cggaatcgtg ccggccgaca ccagcaggac gtcacgcacg gtcaccccat gcccgacggc 4261321 gatggccgcc cggatgtcat cgacgatggg ctggtggtcg agcttatgcg tgccggccgc 4261381 ccgttcgccg acgatcacca gctgctcgga ggtgtcctcg gggtcgaatt tcagcccggc 4261441 gtgcgagtcg tcgaacactg tctgaggaag ctggttggcc ggaaccgaga aggccgccgc 4261501 gtagccaacc cgcaacgcct tggtcgactc ctgcgccgtg cactcgagat cctgtgggta 4261561 gtgattgcgg ccgtcgatga tgacgaggtc cttgatccgg ccggctatgt agaggtggtc 4261621 cttgaagtag gtgccgtagt cgccggtacg cacccacagc gcgtcgtctg gggcgccctc 4261681 ggcgcgcgac tcgctgatcc gcgatttgag gatgttcttg aaggtctggg cggactcttc 4261741 ttctttgccc caataaccgg tacccaagtt gttgccgtgc agccagatct caccgatctg 4261801 tccgtccggc agttcgctgg ccgtgtcggc gtcgacgatg accgcccatt cgctgacccc 4261861 gaccttgccc gcagagacct gggcgacggc gttgggtgca tcggcggcca cctcaacgaa 4261921 ccgctggttg ttcagctcgt cgcggtccac gtggatcacg gtgggcacct cgtccatcgg 4261981 cgtggtcgag acgaacagcg tggcctccgc tagcccatag gacggcttga cggcggtctg 4262041 cttcaaaccg tacggcgcaa atgcttcgaa gaacttgcgc atcgacgccg gcgacaccgg 4262101 ctcgctgccg ttgaggatgc ccttgacgtt gctcaggtcc agcggcggct cgtcgtctcg 4262161 aggcacaccg cgcaccgcgg cgtgttcgaa tgcgaagttc ggcgccgcag agaaggtgcc 4262221 accggtttct ccgggcttgc gggcgagctc gcggatccag cgaccgggcc gccgcacgaa 4262281 cgccgcgggc gtcataaagg tgaagctgtg gcctagcacc gacgccagca gcaccgtgat 4262341 cagacccatg tcgtggaaga acgggagcca gctgaccccg cggtcgcctt cctgtccttc 4262401 cagggcattg agcacctgca ccacattggt gggcaggttc agatgggtga tctgcacgcc 4262461 gctcggtatg cgggtggaac ccgacgtgta ctgcaagtac gcgacggttt cctcgttggc 4262521 ctcgggctgc tgccaggtgg cggcgacttc ggtgggcacc gcgtcgacgg caatgacgcg 4262581 cgggcgctcc ttggccgatc gggcccggat gaacttgcgg accccttcgg cggagtcggt 4262641 ggtggtcagg atcgtcgacg gggcacagtc gtcgagcacc gcgtgtaacc gaccgacgtg 4262701 ccccggctcg gccgggtcga acaacggcac cgcaatgcgg ccggagtaga gggcgccgaa 4262761 gaaggagatg aggtagtcca ggttctgcgg gcacaggatg gcgacgcggt cacccggctg 4262821 ggtgacttgc tgcaggcggg ctcccaccgc acggttgcgc gcgctgaagt cagaccacaa 4262881 gatgtcgcgc gcgacaccgt ctcgttcggt ggaaaagtcc aggaaccggt aggccagctt 4262941 gtcgccacga accttcgccc acttttcgac gtgacgaacc aggttggtgt tggctgggaa 4263001 cctgatcttt ccattcacga tgaacgggtt gtggtacgcc atcccactct ctcctgtcac 4263061 aaacatctcg gccggctctg ccggcggcca ccgggtgtcg gctccgccaa cgggttaccc 4263121 gcgcacatca acccctaccg cgctcacgtc ggcgaacgca gtttgcagcc agctttgacc 4263181 cgactgggtc ctgcacatgc tcttagtttt ctcttaatgt taagggccgg tgcctgacag 4263241 accaaatcac aaggtaccgc tgttcgaggc cgccatcaac gtacgcgggg cggtgtcgag 4263301 tcgcccggtt catcggtgga ccgccgccta gcgtccatcg tcctcgggga aatatcacct 4263361 atgtttgggg tggggcgcat tttcgataag ttgatgcgcc cagttcaacg tccactcggt 4263421 cgccggttct ccatcggaat tccagaattc gggtgtcgca tacatagcat ggaccggctg 4263481 gccggcgccg ccggccaggg tgttcagcgt agtcggcaag ttggcgggac tgaacgcctg 4263541 tgccggggcc gcacagatca ggtcgccctg ggcgcagatc tcgttggtcc ggccgtcgag 4263601 cgcaccaaaa ccgcccggcc gcgggccggt catagtcaaa ccaagcccgg acaacactgg 4263661 gacttcgtgc agggtgatct cggcgccttc gccgcgcggg ctaggcggga cctgattacc 4263721 caccccctgc tgacgacgac cgtcggcgat cagcgtcacg cctagtacta ggtcctcgtc 4263781 cacgggtccc cggccgttgc cgatatcgct agccacgtcg cccgcgatca ccgcgccctg 4263841 cgaaaacccg atcagcacat agctggtcaa cgggcacctg ttgttcatat cggtcatcgc 4263901 tgccaccatc gcgcgggtgc cctctgcccg gctgtcgttg tacgacatct gattatccgt 4263961 ggtcagcgga ttgtggaatt gggccgtgta ggcaactgtg taggtctgca cccgggcggg 4264021 tgcgaattgc tgggcgatcg gcccagttac cttgagcagc aacgccttcg gaaactgcac 4264081 cggattcagt gggttctgct gcggcgatga ctcccaggtt ccgggaaccg agatcatctg 4264141 cacgtcgggg caggacgcat cctggaaggc cggtcggggt ttgtgcggat gtgctggggt 4264201 gggccccggt ggtaaaactc ctggcggcac cgcgctgggc ggcgattcgg cgccgcgcag 4264261 catgatcacc acggccacga tgaccagcgc tacgacggac gccatcgcgc ccgccgctat 4264321 ccaggcaagg attcggtggc gcttacgccg agagttcttg gccatgttct cctgctaaca 4264381 gagtcggtag cgcacgcgaa aggggtgcac ccgcgccgcg cgatagcgcg gccatcccgc 4264441 ccgttgccgc actccctcta cggtaccggc ccgctacgcg gcttcgcccg agtcgcgatg 4264501 tcgtgcacgt ctgccgcaag gatcatccga tagcggccag gcagctcgca tcggcacctg 4264561 gcttagcgga tcgcaccgac gatatcgccc gacatagcgc ccagctgggg cgcccacgag 4264621 ccccagccgt tgtcaccgct ggctgggaag tcgaagtgtc cgttgtgccc gccgacgctg 4264681 cgatactggt tgtagaacat gcggctgtta cccatcgcct cggcggcttg gccgatcatg 4264741 gcggcgggat cgctggctcc cgggttggtc gggctccaca cccacacccg ggtgttgttt 4264801 tgcgccagca ggctggcatg cacccacggg tcgtgccact tccaccgacc cagctgtggt 4264861 gctccccaca ttccgttggt gtccacaccg ccgaattgct gcatgcccgc cgcgatcgca 4264921 ccgttggtgg tggtgttcga cgggtacaaa aagcccgaca tcgagccagc gaagccgaag 4264981 cggtcggggt ggaaggccgc cagcgccatc gccccgtaac cgccctgagc ggcgccaacg 4265041 gccgcatggc caccgggggc caagccccgg ttagcggcca gccagtcggg cagctcagcg 4265101 gacaagaagg tgtcccactg cttgctgcca tcctgctccc agttggtgta catgctgtac 4265161 gcaccaccgg ccggtgccac caccgaaatc cccttgcccg ccaacgtgtt catcgcgtta 4265221 cccgcggtga cccagttact gacatccggg ccggcgttga aggcgtccag cagatacacc 4265281 gcgtgcggcc caccggctag gaaggccacc gggatgtccc ggcccatcga gggcgacggc 4265341 accatcaggt tctcgtatgg ggcggccttg gcggtgggtt ccgcggctac cgcgacaccg 4265401 cccaacccga atgacagtgc ggcaatccag agcgcccgca gcagcgccga ccgacccttc 4265461 atgtgtccac ctccgtcgtg taaggctgtg tgcacccggc gtcagaccgc cccggccaac 4265521 ccctagcccg tcaggtagct aaccacacgg cccgcggcgg gagctaggga cgggatttag 4265581 gaaacatcta gcggcggcga ccacaagggt caccgccgct agatgttgtg tctgttcgga 4265641 gctaggcgcc ctggggcgcg ggcccggtgt tgggcgtggc acccagtgcc cgttgcaggt 4265701 cgggcttcat agcgttgagc tgcgcgcccc agtactccca gctgtgcgta ccgctgtccg 4265761 ggaagtcgaa cacgccgttg tggccgccac cggcgttgta ggcgtcttgg aacttgatgt 4265821 tgctggtccg cacgaagccc tcgaggaact tggccggcag gttgttgcca cccagatccg 4265881 acggcttgcc gttgccgcag tacacccaga cgcgggtgtt gttggcgatc agcttcccga 4265941 cgttcaacag cgggtcgttg cgctgccacg ccgggtcctc cttcgggccc cacatgtcgg 4266001 aggccttgta gccgccagcg tcacccatcg ccaggccgat cagggtggga cccatcgcct 4266061 gggaggggtc caacaggccc gacatcgctc ccgcgtagac gaactgctgg gggtgataga 4266121 tcgccagcgt cagcgccgaa gaagcagcca tcgaaagacc gacgacggcg cttccggtgg 4266181 gcttgacgtg cctgttggcc tgcagccacc ccggcagctc gctggtcagg aaggtctccc 4266241 acttgtaagt ctggcaaccg gccttgccgc aggcgggctg gtaccagtcg gagtagaagc 4266301 ttgactggcc acccaccggc atgaccaccg acaggcccga ctggtcgtac cactcgaacg 4266361 ccggggtgtt gatgtcccag ccgctgaagt cgtcctgcgc gcgcaggccg tcgagcaggt 4266421 acagggcggg cgagttggca ccaccacttt ggaattggac cttgatgtca cggcccatcg 4266481 acggcgacgg cacctgcagg tactccaccg gcaagcccgg ccgggaaaat gcccccgcgg 4266541 tcgccgtgcc accgacggcg ccgaccagac ccgacactag ggccgcgccg acggccccga 4266601 ccacgagtcg acgcgacata cccgtgacgg cgccacgaac cctgtcaaca agctgcattc 4266661 ttgcttccct catcctcatc tcaacgcatc catgcatgtt tgggcgcatc ctgaattagg 4266721 tcagactgca ggcgctgggc ccggcagtgc tcgtgtagtc aaccacaact tcgggcgtcc 4266781 acccgcatca agcgcaccgc cgaaaccctt atccggcggt cgttcacggc caattcggga 4266841 ccgacgcgac ggcctgaagg tggcatttcc gcagtgtctg ggcatgtgtc gaccgctagt 4266901 gccggctcaa ttgtgatctt gctgtcagta ttgcccccgc gctcattgcc cctcactccc 4266961 gcggtggcgg gccgggcccg tcgggaacat cgagcccaca ccggaccaat tcatagcgcg 4267021 gaacgcggtc gatgcggtaa cgggtgaact cgtaggaatg caacacgttg gagaggaatc 4267081 ggtgcagggt gatcggggcg cgcacagaat taagcaccgc ccgagtcgcg gggcattgca 4267141 gggccgcttc ggcttgcgtg acccactgct ggtcgatgta gccgggaata cccgggtacc 4267201 acttcaccca gggtccgtcg gcgatcaccc agtccgggaa cagattcttg tcatggccga 4267261 tacgggcatg cttcagccgc tcggtgtgcg cggccaatgg gtttaccagc ccgatttggt 4267321 cgatcacccg gacatcgagc ccgacgttca tgcctagcat gcccatgttg gtgaaaaaca 4267381 ctgcgtgctg cggtttcggc gccggcttgc cacccggcgc ggtccccgac gagggccgga 4267441 tcatcggcac caggtcccac tggttgtagt tgcccgacgg caatagcaac gccccttccg 4267501 gggtgttgtt gagcgctgta agcacggcag ccattcgcgg gtaatcgagg tagtccgcgg 4267561 cggtcagcgg atgcgcgtgc ccggtggcct gggcgtagaa gcggcgctcg tcgacgatgc 4267621 ccgaataggt gacccgggtg gcgtcgtcac ccatgcccgg cgagtttgcc gcccacagcg 4267681 accaacccgc gatccccagc cagagcccgc tgagcgcgcc gactagccag cgaccggtct 4267741 cccgcgaaaa gtccttaccg tcgggcagca aaataggaat gacccccacc ggggccagca 4267801 aacaaaacag cggcgccagc aacacccggc cgtgcataaa gtcgccgcct tgccgaatcc 4267861 agtacagcgc ctgcagcacg ccgctgccga cgatgaaagc caccacggcc ggcggacttt 4267921 gcaccgcccg ggccacccga ccgtagtcgg gtgccagcac gggacgcagg aacgacggcc 4267981 ggcggcgcgc cgtcatcaac agcaatccca gcggcaccga cagcaccaac ggcacccaca 4268041 gtgcgtacgg ccggttgaag ttcgacacgt agatcatgcc ttgcgaccac ttgtcgcccg 4268101 cggcatcctt ggccagcgcg gtactcggaa ccagcagtcc gtaatagccc atccggaaga 4268161 tctggtaggc caccggcaag aatccgccgg ccagcacgat cagcacgcgg cgacgccagg 4268221 tccgcgcggc gatcaacatc atgatcagcg ccagcccgcc gatcagcgcg aattccggcc 4268281 gcactagcac gctgcatccg gcgacgaagg ccaacgcgcc gaggaacatc tggctgtccg 4268341 ggcgggcccg cagcggctgt gaccagcaga ccatcatcca ccacaacagc cccagatagg 4268401 ccaacaccag cccgctctcc aggccggagg tggcgaagtc gcgggccggt ggcaccgcga 4268461 tatataccag cgccccggcc ggaagcatga tcgcccgacg gccccgcagg ctgggtgcgt 4268521 acaaccggcc ggtccccagc atgagcagca ccattcccag cagcgaaagc accatggcca 4268581 gggccaacgc cacgtactcc aggcgcatcg gcccgcccac ccagccgccc acatacagca 4268641 gatacgtcca cgctgtcgag gtgttcgctt cgactcgctc gccctggttg aagaccggtc 4268701 cgttgccggc caataggttg cgtaccgtcc gcaggacgat cagtccgtcg tcagcgatcc 4268761 agcgacgttg ccagctcccc cagccgaaca gcacggcgac cgccgtcacc gacagccaca 4268821 agctgacccg gaccatgggc tcatacggaa acaccggccg accgacccgc ccgaccaccg 4268881 gccggcgggg cagcaccccg actgggagga cgttgagctt gaggctagcc gaaggcaaca 4268941 gcggccccaa ccgttgctat ccacgccagc gccagcagct gcaatacccg gtcacgcagc 4269001 gcgatatctt ccggctcccc ggccaggccg ccatcgacgt ccaccgcgta gcgcaggatc 4269061 gcgatggtga acggaatcat cgacaccgcg aaccaggacc cgctgtagcc gtcgcgctcg 4269121 aaagcccaca gcccgtagca caagaccacc gcggtggccg acaacgtcca gacgaaccgc 4269181 agataggtgc tggtgtagct ttccagcgac ttgcggatcg cagcgccggt gcgttcggcc 4269241 agatgcagct cggcgtagcg cttgccggcc accatgaaca gcgaaccgaa tgccatgatc 4269301 agcaaaaacc acttggacag cgggattttg gtggccacgc ccccggcgat ggcgcggatc 4269361 aaatacgccg acgacacgac gcagatttcc accaccgctt gatgcttgag accaaagcaa 4269421 tacgccaact gcatggcgag gtagacgacc attaccagcg ccaggttcgg ggtcagcatc 4269481 caggcaccgg ccagcgatgt cactcccagt accaccgcca cggtgtacgc cagccactcg 4269541 ggcaccacgc cggcggcgat cggccggaac cttttggtgg ggtgctcccg gtctgcctcg 4269601 acgtcacgca catcgttgac gaggtacacc gccgaggcgg ccaggctgaa caccacgaag 4269661 gccatcgaca ccttgctgag cacctcgacg tagtcgtagc ggacaccgcc gcccaacgcg 4269721 gccagcggcg cggccagcac cagcacgttt ttcacccact ggcgcgggcg gatcgccttg 4269781 accaccccgg cgaccaggtt tgccggaggt tgagtcacca catcttcact catccgagct 4269841 catctcttcc gggccctttg ccggcccccg ccgacgctgt ccacgatggc cccgacggtg 4269901 gcgcccagag caacacccac ggccacatca ctggggtagt ggacccccag cagtattcgc 4269961 gacagcgcca tcggcggcac cagcacaacc ggtagcggca gcccggtggc tctgcccatg 4270021 agcagggccg cggccgtggt cgaggtggcg tgtgccgacg gaaagctcag ttgacttggc 4270081 gtgtccacgt tgaccgcgat ggccggatga tccggccgct gacgccgcac cagccgcttg 4270141 atcagcacgg cgatggcatg ggcgacgaac gcgcccgccc ccgccacaag ccattcccgg 4270201 cggcgccgtg gcagggctat cgcgcccagc agcgccagga tcagccaacc gatgcagtgc 4270261 tcgccgaagt gggagagtcc gcgcgcagtg gccagcatcc ccggacggtc gaccagcgcc 4270321 gactgcacgg ccaccatcac ggcgacttcg ccgcgtggcg cccgttcagc catgctcggg 4270381 ctcttggttt gccgccggca gcagcgccgt ctcccacttc tgcttgctgg acagcgtcgg 4270441 caacgcgtcg cgataaatcc ggcgcatctc ctcgaaccgt ttcagcaact ggcgctgacg 4270501 gcgcaacgac tgccacagca acgcgaacat cttggcccgg tcgcgctgcc ggtagaccac 4270561 gccgcatccg tcggccgtgg tgacggtggc cccgtcgaca gtgcacagca ggaaccagcg 4270621 cgcatcctgg gtcggaacgt tgaactccgg gcgacggtgg tgttgggggt tggcggcggt 4270681 caggttgtgc atgatcccgc gggccagccg gtagccgatg accaacgggt tcaccggcgg 4270741 cttcattgcc ttgttcttgt gcaacggcgg cggcaactca ctggccgccg gcagcaccac 4270801 cgcgtccgga tagctcttgc ggatgcggtg cacttgcggc agcgccgatt ccaggatcga 4270861 aaagatgtgc tcggggccgg cgagaaagtc gtcgatggcc ttgttctgga ttgccaccgt 4270921 cgaatattcc aggcaggcaa ggtgtttcag ggttgccttg agatggctgc ggaccaggcc 4270981 gatgacttgc gcctttgggc cgtcccagtg catggcggcc accaccagcc ggttgcgcag 4271041 atggaaatag gcctgccagt cgatggcgtc atccttatcg ctccaggcca tgtgccagat 4271101 cgccgcaccg ggcagcgtga cggtcggata cccgtgctcg gcggcccgca ggccgtaatc 4271161 ggcgtcgtcc catttgatga acaacggcag cggctgtcct agctcttcgg cgacctggcg 4271221 tgggatcatg cacgtccacc agccgttgta gtcgacatcg atacgccggt gcagcaactt 4271281 gctacgggag ttgttgtcgt tcaacgggta ttcggcgaag tcgtggtcat actcggcatg 4271341 cggcgcggcg gtccacatga atatcgaccg gtctacgact tcgcccatga tgtgcaggtg 4271401 cgacggctcc tgcaggttga gcatctgacc acccaccagc atcggcgcct tggcgaaccg 4271461 gtgcatggcc agcacccgca gaatcgagtc cggctcgagg cggatgtcgt cgtccatgaa 4271521 taggatctgc tgacagtcgg tgtttttcag tgcctcatac atcacccggc tgtagccgcc 4271581 ggaaccgccc aggttgggct ggtcgtggat ggagagccga ctacccaatc tcgcagccgc 4271641 ggcggggaaa tccgggtggt cgcgcacctt gcgctcaccc tgatcaggca cgatcaccgc 4271701 cccgatcacc tggtccacca gcggatcggc ggtgagttct cgcagcgcgt tgacgcagtc 4271761 tgcggggcgg ttgaacgtcg ggatgccgac cgcgatgttg gccgtccccg gagcggggct 4271821 ggtggcatac cagccaccac tgtgcagggt gaccgcggtg tcggtggtga tgtcgaacca 4271881 gacccacccg ccgtcttcga aaggctgcag caccacttcg gtctccacgg cggctggctg 4271941 atcctcggtg ccggtgaagt cgtggccctc aacgaagatc cgggcaccgg tggccttggt 4272001 ccggtagacg tctacccgcc cggcgccggt cacctgcacg cgcaacacca ccgatttgca 4272061 cgtcgtccaa cgtcgccaat agctagccgg gaaagcgttg aagtaggtgg cgaacgacac 4272121 ctcggactcc gcgccaatct gtagcgaggt ccgggttggc gcatgcgcgc gccgggcgtt 4272181 ggtcgttgac tcctcgaggt acagcttgcg cacgtcaagg ggttcacctg ggcgcggcag 4272241 gatgacccga gacagcaggc tcgcggcgag ttcactcatg cgccgtcctg aagcagtggg 4272301 acgccgtcgc gcagatgcgg cgcgaggacg ttgtcgtaca tgttcaaggc gctggcaatg 4272361 gccatatgca tatccagata ttggtaggtg cccaaccggc cgccgaacag taccttcgat 4272421 gacgcggtct cggacttcgc cctggcccga taggtggcca acagggcgcg gtcagcctcg 4272481 gtgttgatcg gatagtatgg ctcgtcgtcg tcctcggcga accgggagta ttcccgcatg 4272541 atcaccgttt tgtccgttgg gtagtcacgc tcggggtgga agtggcggaa ctcgtggatg 4272601 cgcgtgtagg ggacgtcgag atcgttgtag ttcatcaccg cggtgccctg aaagtccccg 4272661 atcggtagca cttccacctc gaagtccaag gtgcgccagc ccaatcggcc ttcggcgtag 4272721 tcgaagtagc ggtccagcgg gccggtgtaa acgaccgggg ccgccgggct gccggggcgc 4272781 agctggccgc gcacgtcgaa ccagtcggtg ttcagcctga cctcgatgcg gtggtcagcg 4272841 gccatgtttt gcaaccacgc cgtgtacccg tcggtcggca aaccctcgta agtatcgctg 4272901 aaataccggt tgtcgaaggt gtagcgcacg ggaagccgcg tgatgttggc ggccggaagt 4272961 tctttggggt cagtctgcca ttgcttggcc gtgtacccct tgacgaacgc ttcgtagagc 4273021 ggccggccga tcagcgagat ggccttctcc tcgaggttct gcgcgtcggc ggtgtcgatc 4273081 tcggcggcct gctcggcgat cagctggcgg gcttgctcgg gcgtgaagta cttgccgaag 4273141 aactgcgata ccaggccgag ccccatcgga aactgatatg cctgcccgtt gtgcatcgcg 4273201 aagacccggt gccggtagtc ggtgaagtcg gtgaactgcc gcacgtagtc ccacactctc 4273261 ttattagagg tgtgaaacag gtgcgcaccg tacttgtgga cctcgatgcc ggtctgtggc 4273321 tcggcttcgg aataggcatt gcccccgatg tgcgggcgcc gctcgaggac gagcacgcgc 4273381 ttgtcgagtt gggtggccac gcgctcggca atcgtcaggc cgaagaatcc tgagccgacg 4273441 acgaaaaggt caaaacgagc ggtcatcggt tgcatagggt aaccgacctt gctggcaaaa 4273501 cccgatttgg cagctcgtgg cggtcatggc ccgaacgggt ttcaccgcag gtgcgcatgg 4273561 ccgaccagtg tggttggccg gaggtcgttt ggtcgcgatt gcctcacgat tcgatataac 4273621 cactctagtc acatcaacca cactcgtacc atcgagcgtg tgggttcatg ccatgcactc 4273681 gcgaccgcgg gagccggcga acccggcgcc acacataatc cagattgagg agacttccgt 4273741 gccgaaccga cgccgacgca agctctcgac agccatgagc gcggtcgccg ccctggcagt 4273801 tgcaagtcct tgtgcatatt ttcttgtcta cgaatcaacc gaaacgaccg agcggcccga 4273861 gcaccatgaa ttcaagcagg cggcggtgtt gaccgacctg cccggcgagc tgatgtccgc 4273921 gctatcgcag gggttgtccc agttcgggat caacataccg ccggtgccca gcctgaccgg 4273981 gagcggcgat gccagcacgg gtctaaccgg tcctggcctg actagtccgg gattgaccag 4274041 cccgggattg accagcccgg gcctcaccga ccctgccctt accagtccgg gcctgacgcc 4274101 aaccctgccc ggatcactcg ccgcgcccgg caccaccctg gcgccaacgc ccggcgtggg 4274161 ggccaatccg gcgctcacca accccgcgct gaccagcccg accggggcga cgccgggatt 4274221 gaccagcccg acgggtttgg atcccgcgct gggcggcgcc aacgaaatcc cgattacgac 4274281 gccggtcgga ttggatcccg gggctgacgg cacctatccg atcctcggtg atccaacact 4274341 ggggaccata ccgagcagcc ccgccaccac ctccaccggc ggcggcggtc tcgtcaacga 4274401 cgtgatgcag gtggccaacg agttgggcgc cagtcaggct atcgacctgc taaaaggtgt 4274461 gctaatgccg tcgatcatgc aggccgtcca gaatggcggc gcggccgcgc cggcagccag 4274521 cccgccggtc ccgcccatcc ccgcggccgc ggcggtgcca ccgacggacc caatcaccgt 4274581 gccggtcgcc taagccccgg gtcggccgaa aacgcacccg cggccaaggc gtcggtcatt 4274641 gcttcggccc gtcacaatta ctcgcctaag ggtcgctagg tgttctcgag agttttatcg 4274701 caccgattcc gtgtcgtctc attaatacca atagaaacac acgtaacatc agctggtgcc 4274761 gtcccgcacc cgcgcgccga cgacgctgct caccgcgatg gcagcgaccg tcgtcatcgt 4274821 cgcgtggata gcgaatcgtc cacccgccag ctcccatgaa ccatcgccga cgcccaacac 4274881 ccagctcgcc gagcagccac tgatcgggct cggcggcggc gtcacggtac gcgaactcac 4274941 ccaggacaca ccgttttcat tggtggcgtt gactggcgac ctggccggta cctccgctcg 4275001 tgtgcgcgcc aagcgcccgg acggtgactg ggggccgtgg tatcagaccg agtatgaaac 4275061 cgaaccacgc gatccggcgg gcaccgacgg gtccgtggaa cttggaggac tcaatccggg 4275121 tccccgtagc accgatccgg tgttcgtggg caccaccacc accgtgcagg tcgcggtgac 4275181 tcgcccgatc gacgcaccga taactcaacc gccggcgggg cggccgccca acgacttgct 4275241 cgacagcggt ttgggatacc gtccagccac caaggaacag ccattcgggc agaacatctc 4275301 cgcgatcctg atctcgccgc cgcaagcgcc gcccggaacg cagtggacgc caccaaccgc 4275361 agtcaccatg gcaggccagc cgccggccat catcagccgg gcggaatggg gcgcagacga 4275421 gtcactgcga tgcgaaacac cggagtacga caggggggtt cgtgccgcgg tggtccacca 4275481 caccgcgggg agcaacgact actctccgct ggagtccgcc ggcatagtca aagccatcta 4275541 cacttaccac agcaagaccc tgggctggtg tgacatcgcg tacaacgccc tcgtcgacaa 4275601 gtacggccag gtgttcgagg gtagcgccgg cggcctcacc aagccggtcg aagggttcca 4275661 caccggcgga ttcaaccgca acacctgggg ggttgccatg atcggcaact tcgacgatgt 4275721 ggcccccacg ccgatccaga tccgaaccgt cggccggctg ctcggctggc ggctgggcat 4275781 ggacgacgtc gatcccagga gcatggtgga tctgcagtca gcgggtagct cgtacaccac 4275841 gtttccgggt ggcgccatag cgcgattgcc cgccatcttc acccatcgcg acgtcggcaa 4275901 caccgactgt ccgggcaacg ccgcctacgc tgtgatggac gagatccggg acatcgcagc 4275961 acatttcaac gacccgccgg aggagctgat caaggcgctg gaaggcggcg cgatctatca 4276021 gcgctggcag gcgttgggcg gcatgaacag cgcgctgggt gcaccgacct cgccggaggc 4276081 cgacgccgcg gatggggcgc ggtatgcaac cttcgctaag ggcgccatgt attggtcgcc 4276141 ggtgaccgac gctcagccga tcacgggggc aatctatgag gcctgggctt cgcagagcta 4276201 cgaacgcggc ccgctgggac tgccgaccag cgcggagatc caggagccgc tgcagatcac 4276261 gcagaacttt caacacggaa ccttgaactt cgagcgcctc accggcaatg tcaccgaagt 4276321 cgtcgacggg atcacgacgc cactggcgac gcggcccccg agcggcccga cggtgccgcc 4276381 cgaacacttc acgctgccaa cgcatccgat cacctgagtc gcgggtgtgc actattcaca 4276441 ttatgtgtgt gcacttttca cattctggct tttgcggcgc ggaatcgccg gcgcatagac 4276501 accctgtgcc attaggctcc atttgccggg ctgatcaccg ggtcgccgca ggccagtcga 4276561 gaggaacaac gtgtcgttcg tggtcacagt gccggaggcc gtggcggctg cggcggggga 4276621 tttggcggcc atcggctcga cgcttcggga agcgaccgct gcggcggcgg gccccacgac 4276681 cgggctggcg gccgcggccg ccgacgacgt gtcgatcgct gtctcgcagc tgttcggcag 4276741 gtacggccag gaatttcaaa ccgtgagcaa ccaactggcc gcgtttcata ccgagttcgt 4276801 acgcacgttg aaccgcggcg cggcggcgta tctcaacacc gaaagcgcta acggcgggca 4276861 gctgttcggt cagatcgagg cgggacagcg cgccgtttcc gcggccgcgg ccgccgctcc 4276921 gggcggcgca tacggccaac tcgttgccaa cacggccacc aacctggaat ccctctacgg 4276981 cgcatggtcg gccaacccgt tcccattcct ccgccagatc atcgccaacc agcaggttta 4277041 ctggcagcag atcgccgcgg cgctcgccaa cgccgtccag aacttccccg ccctggtggc 4277101 gaatttgcca gcggccatcg acgcggccgt ccagcaattc ctggccttca acgcggcgta 4277161 ctacatccaa cagattatta gctcgcagat cggcttcgcc cagctattcg ccacgacggt 4277221 cggtcagggg gtcaccagcg tcattgccgg gtggcccaac cttgcggcgg agcttcagct 4277281 agcgtttcaa cagcttctgg tgggtgacta caacgccgcg gtggcgaacc tgggtaaggc 4277341 catgacaaac cttctggtca ccgggttcga caccagcgac gtgacgatcg gcacaatggg 4277401 caccaccatt agtgtcaccg cgaaacccaa gctgctgggc ccgctgggag atctgttcac 4277461 catcatgacc atcccggcac aagaggcgca gtacttcacc aacctgatgc ccccctccat 4277521 cctgcgagac atgtcgcaga acttcaccaa cgtgctcacg acgctctcca acccgaacat 4277581 ccaggcggtc gcttcgttcg atatcgcaac caccgccggg actttgagca ccttcttcgg 4277641 ggtgccattg gtgctcactt acgccacatt gggtgcgccg ttcgcgtcac tgaacgcgat 4277701 tgcgacgagc gcggaaacca tcgagcaggc cctgttggcc ggcaactacc taggggcggt 4277761 gggtgcgctt atcgacgccc cggcccacgc gttagacggc ttcctcaaca gcgcaaccgt 4277821 gttggatacg ccgatcctgg tgcccacggg gctcccgtcc cctctgcccc cgacggtcgg 4277881 gatcacgctg cacttgcctt tcgacgggat tctcgtgccg ccgcatcccg tcaccgcgac 4277941 gatcagcttc ccgggtgctc cggttcctat tcccggtttc ccaaccaccg taaccgtttt 4278001 cggcacaccc ttcatgggaa tggctccgct gctgatcaac tacattcccc aacagctcgc 4278061 cctggcaatc aaaccggcgg cttagcgcgg cgtggcccgt tggttggtgt cgtaggttgc 4278121 catgccaagc tccaaccatg cggttagcag ccgctgatct gccgccgcgg ccacaacctc 4278181 gtcgtcatcg agttgctcgg ccgatgcgca gtgcaccgcg tcgtagccac gcatgggcca 4278241 ggtcagccgc gcgggtcacg acctgctcat ccacctcgat ggcgtccatc tcggaccaca 4278301 tctggtcacg gttcgcccgt cgcgaatctg cgcgaggggc cggctcagtc acgcactccc 4278361 gagccacaaa ggcgccgggt cacgtgggcc atgctaggac caccagcgct ccagcacccg 4278421 cgcgacgccg tcctcgctat tgggtgcagt gacctcgtcg gcgacggcca gcgcgtcggg 4278481 atgcgcgtta cccatcgcca cacccaaacc ggcccgcagc agcatcggca cgtcgttggg 4278541 catgtcgccg aacgccacca cctccgcgtc ggaaattcca agcggccggg caatctcgtc 4278601 gacaccggtg gccttgctga taccgagcgg cacgatctcc accagcccgt tattggtcga 4278661 gtaggtgata tcgccctcga aaccgacatg cttagccagt tcggcggcca tgtcggcact 4278721 ggcagcaccg gctttacgga tcagcagttt gatcgccggc gcgctgagca ggtggtcgat 4278781 cgacacttcg gtgttgtccg gattcagcca cgcatgctcg tagcccggcg agctgacgaa 4278841 ctggggggtc gccgtgtcgt gtgcgcgctc gccgatccgc tcgaccgcca gtcccgcacc 4278901 cggtatgacg cgggtcgcaa cttcggccaa cgttgccagg gcgtcgacgg gcagggtgcg 4278961 caccgacatc acccgatcgg tcccggggtc gtagatgacg gcgccgttgg cgcacaccgc 4279021 catcggcgcg aagccgaggg catcgacgat gggtcgcacc cagcgcggcg gccggccggt 4279081 ggccaggatg aagtgcgtgc cggcgtctac cgcggcatgc accgcgtcgc gagtgcgttt 4279141 ggtgacggtt tctccgtcat cgagcagggt tccgtcgacg tcacacgcga cgagcgccgg 4279201 cacagtcggt ttcaaagttg gctggcttgt cagtgcgggc cgacttggct gcgccgtgat 4279261 gaggtcacgc cgtcgtatcc gcgcttttgc cgccgcttcg ccaattcagc gattctgagc 4279321 tgcctggact cctccaccgt cggcgcgccg ccgcccagcc gccgcggcac ccagtgctcc 4279381 cccttgggat gtggatactc ctcctgtacg cggtagagaa tcgcattcat cgcttggcgc 4279441 agcacggcat tgagctgctc ggcattgccc tccggccgca ccggcgatcc gatcgccgcg 4279501 acgatcggaa tcttgttgcg gaacaggttc tttggatgat ccttgggcca gatccggtgc 4279561 gcgccccaga cgatcatggg aataatcggc acctgcgcct ccagcgccat ccgggccgct 4279621 ccggtcttga actcgcgcag ttcgaggctg cggctgatag tcgcctccgg gtgtaaccca 4279681 acgagttccc cggcccgcaa ccgctgcact gccaccgcgt acgcatcggc ccccacactg 4279741 cgatccaccg ggatgagctg ggcatgcttg atcacgtagt tgaccgcccg tacgtcttgc 4279801 atctcggcct tgatcatgaa ccgcagccgc cgccgccgat ggtgggcggc gatcgatgcc 4279861 ggaacccagt ccacgtagct cgtgtgattg agtgcgatca acgcgccgcc acgttcgggg 4279921 atgttctcca ggccttcgaa tgtgatcttg tttccgttgg ccgcgacgat cgacggaaca 4279981 agaatctcca tcatccggaa gaacggctca gccatgtatt ctccttcacc tcttaccgcg 4280041 attcatgcgg tgtccggcta gcggcccttg ccgccgcctc gtcagcctcc atccgtgccg 4280101 cctcggccag cgttggcgcg ccgccgccga gtcggcgggg cacccagtac gccccagccg 4280161 gatgcggata ccgctcctgc gcttgccaca gcagcgcggt catcgactca cgcagcgccg 4280221 cgttggtctg ttcgatgcct gccgcggccc gcagcggccg acccacctgt accgtgaccg 4280281 gcaccttggc gcgtcctatc tgcctgggat ggtccttggt ccagatccgc tgagcacccc 4280341 agacaacgac gggcacaatc gggacatccg cttccgcggc cattcgggcg gcccccgtct 4280401 tgaacccttt gagctcgaag ctacggctga tggtggcctc cgggtagacc ccgaccagtt 4280461 ccccttcgcg cagccgctgc accgccaccg cataggcgct accgccggcg ccccggtcca 4280521 ccggaatggt ccgggtgtgc ctgatcagga agttgaccaa ccgcacccgt tgcatctcgg 4280581 ccttgatcat gaacctcatc cggcgacgcc gacgatgcat ggccaacgcg gccggcagcc 4280641 aatcgacata gctggtgtga ttgatagcga ccacggcgcc gccttggtcg ggcacattct 4280701 cctcgccgac gtaggtgatc cgggttccgg tggccagcac cagcaactgg gccaggatct 4280761 ctaagacgcg ataggtcggc tccgccatcg gtcactgctc cggcgccccg gcgggatggg 4280821 ctcgctgagc gcggcgcgca gccctaaccg ccgcctcctg cgcgtccaac cgggccgcct 4280881 cggcaagcga cggggcgccg ccgcccagcc ggtgcggcac ccagaactcg ccggccggat 4280941 gcggtccgta cagttcttgg gcccgctcca gcaaatgttg catccgggag tgcagcaggc 4281001 cgttcagttc agcggtgggc agcgtcggtt cgatccgttc accgacgaca atcgtgaccg 4281061 gcaccttcgg gcgaaacagc tttttgggac ggtccttagt ccagatccgc tgcgcacccc 4281121 aaacaatatg cggaacgatc ggcaccccgg cctcgatcgc cattcgggcc gcccccgtct 4281181 tgaattcctt gatctcgaag ctgcggctga tggtcgcctc ggggtacacg ccgacgagtt 4281241 cgccggcctt cagcatcctg acggcggcgt cgtaggacgc ggacccgtcc tgccgatcca 4281301 ccgggatgtg gcgcaggctg cgcataatgg gaccggtgat cttgtgatcg aacacctcct 4281361 gcttggccat gaaccgcacc ttgcgcccga ggccctgttg gtaggcgggc aaacccgcaa 4281421 aggtgaagtc gaggtagctg gtgtggttga tcgcgacgac ggcgccgccg ctggtcggta 4281481 ggttatccac acccgtgacg gtgatcttca gaccctgtat gcgccaggac aagcgagcaa 4281541 gccgaatgac ggtgccgtat accggttcca cagcagttca gcctagtggt cccggctgca 4281601 agccgcccaa agtggcgaaa acccaaattg acgaaagagg tgagccgtgt ccttcccctc 4281661 atcgccaccc gcgctgcccg cgatcgttgc ccggtttgcc gtcggcaggc cggtgcgcgc 4281721 ggtgtgggtc aacgaactgg gcggcgtcac cttccgggtg gactccggca tgggcgccgg 4281781 ctgcgagttc atcaaggtcg ccaggagggg taccgccgac ttcgctaatg aggcgcggcg 4281841 gctgcgctgg gccgcgccgt acctggcggt gccgcgggta ctgggtgtcg gggtcgacgg 4281901 cgattgggcc tggttgcaca ccgatgcgct gcccggcttg tccgcggtgc acccgcgctg 4281961 gcgggcgtcc ccgcaggtcg cggtcccggc gctgggtgcg gggctgcgca ccctgcacga 4282021 cagcttgccg gtgcactcat gtccgttcga ctggtcgacg gccagccggc tggccaagct 4282081 ggccccggcg cgacgcgcgg aactgggtga ctcaccgccg gttgatcggt tggtcgtctg 4282141 tcacggcgac gcgtgctcac ccaacaccat cctcgatgac accggccgct gttgcggaca 4282201 cgtcgacttc ggcaatctcg gtgtggccga tcggtgggcc gacctcgcgg tcgcgacgct 4282261 gtcgttgcaa tggaactttc ccgactaccc gggccaggtc agagatgacg agttcttcgc 4282321 cgcctacggt gtggcgccgg acccggctcg catcgactac taccgccggc tgtggcaggc 4282381 cgaagacgac agctcacgct aagctcgagg ctgcgctttg cgctcgtaag ctcttccgaa 4282441 aggtagctgt gcaggtcaca agcgttggtc acgccggctt tctgatccag acccaggccg 4282501 gcagcatcct gtgcgaccct tgggtcaatc cggcctactt tgcgtcttgg tttccgttcc 4282561 ccgacaacag cgggctggac tggggcgctt tgggtgagtg cgattatctg tatgtctcgc 4282621 acctacataa ggaccacttc gacgcggaaa atctacgagc gcacgtcaac aaggacgccg 4282681 tcgtgctgct gcccgacttt ccggtacccg acctgcgaaa tgagttgcag aagttaggat 4282741 ttcatcggtt cttcgaaacc accgactcgg tcaaacaccg cctgagggga cccaacggcg 4282801 atctcgacgt gatgatcatc gcactgcggg cccccgccga cggtccgatc ggcgactcgg 4282861 cgctagtcgt tgccgacggc gaaacaacgg ctttcaacat gaacgacgcc cgcccggtcg 4282921 atttggacgt gctggcatcg gagttcggtc acatcgacgt gcatatgctg cagtactcgg 4282981 gcgcgatctg gtacccgatg gtctacgaca tgccggcgcg cgcgaaggat gcgttcggcg 4283041 cccaaaagcg gcaacggcag atggaccgtg ctcgccagta catcgcgcag gtgggagcga 4283101 cgtgggtggt gccgtcggcg gggccgccat gctttttagc ccccgagctg cgccacctca 4283161 acgacgacgg tagcgatccg gccaatatct tccccgacca gatggtgttc ctggatcaga 4283221 tgcgggcgca cggccaggac ggcgggctgc tgatgatccc cggctcgact gcggatttca 4283281 ctggtacaac cctgaattca ttgcgccatc cactgcccgc cgaacaggtc gaggccatct 4283341 ttaccaccga caaagccgca tacatcgctg actatgccga ccggatggcg ccggtgctcg 4283401 ccgcgcaaaa ggctggctgg gccgccgccg ccggcgagcc actgctgcag ccgctgcgca 4283461 ccctgttcga gccgatcatg ctgcaaagca acgagatctg cgacggcatc ggatacccgg 4283521 tcgagctcgc catcggtccc gaaaccattg ttttggactt tccgaaaaga gctgtacgag 4283581 aaccgattcc cgacgagagg ttccgctacg ggttcgcgat cgcgccggag ctggtgcgca 4283641 cggtgctgcg cgacaacgaa cccgactggg tcaacaccat cttcttatcc acccgatttc 4283701 gggcatggcg ggttggtggc tacaacgaat acctttacac gttcttcaag tgtctgaccg 4283761 acgaacgcat cgcctacgcc gacggctggt tcgccgaggc ccacgatgac tcctcatcga 4283821 tcaccctgaa cggttgggag atccagcgcc gctgccccca tctcaaagcc gacctatcga 4283881 aattcggtgt ggtggaaggc aacacgctca cttgtaacct gcacggctgg cagtggcgtc 4283941 tggacgacgg tcgctgcctc accgcccggg gccatcaact acgcagttca cggccatgat 4284001 gcagttctac gacgacggcg ttgtacagct ggatcgtgct gcactcacgc tgcgccgcta 4284061 tcattttcct tcgggcacgg ccaaggtcat cccactggac cagatccgcg gatatcaggc 4284121 tgaatcgctg ggctttttaa tggcccggtt caatatctgg ggcaggccag accttcgccg 4284181 ctggctgcca ctggacgtgt accggccgct gaagtcgacg ttggtcaccc tcgacgtacc 4284241 ggggatgcgg ccgaaaccag cctgcacgcc cacgcgcccc aaagaattca tcgcactgct 4284301 ggacgagttg ctcgccctcc accgaacgtg aacccacggt ttcgcgcgcg attttcgcac 4284361 tgccctgggg cacagcctca ctccagactt aagccacagc gacgatccaa gcgacgtgtc 4284421 atgtgcctgg tttaagtgtc gcgagcgtgc cgtcggcggt gcggatatag atggatttca 4284481 tggccgcgat gtaattggcg acggattcgc ttgcgatcgg gttgtccggg aataataccg 4284541 tcactgtggt ctgatgctga taccgattga cccacatcga gacctgatga gaaaccctac 4284601 cttcgtcgta aatcctaaaa ttcagatcgg aattagcgac cgtagaaaga ggcgcaatgc 4284661 tggcatccag aaaggacatc acgaaattgc ccggccgggg cggcctcagc cccgtttcgg 4284721 ggcgtgccag ctccaatacg cggtcgaatg gtacggtcgc caggtcctta cccgaatcga 4284781 aggagatctg cgcgacacgg gcggcgctat cgaaaagtcc tgaggcgacc ggcacggtga 4284841 tcggcaccaa cccggtaaac cagcccgtcg ttctgagttc tgtcggcgtc ctacgtgtat 4284901 cagtcgtcgt taccacgtca aacgtttcac agttggtcaa ctcgcgctca gcgagggcgg 4284961 cgcaggcgaa aacgccaccg ctaaaacggg cgcccgcagc gacgcaggcg gcttcgaatc 4285021 gctcgccctg ttgctcgtcc atcagcgttt cggtaagcag ctttccggta tggggcaccg 4285081 atagatcgcc gagcggcaac gggaagtgcg gcagggttcc gtcgttgttg gcagcgaatt 4285141 cgacccaacg gcgcacccgg gcggagtcca acgtcaaggc ggccgtgtcg gcgtactgtc 4285201 ggacacagtg gtcgtcgtag cggcccgccg gcgggagctc gatcggcggg tcgcctccca 4285261 ccaatgcgga gtacatcata tggatctcga tgaaaaggac gcccacaatc atcggatcga 4285321 cacagagatg agcgatactc gcatagaagg tgaagtgatc gtcactctga ataatcccga 4285381 acaagaagca gtcccactgc aacggctgcg gcgttgcaat gtggtggcgc agctccgccg 4285441 acgtcatgtt ctgatgctca gcttggacga cttcgatatc tgcagggtca gcgatggtat 4285501 gccgaacgat gtgttcggca ttgtcgaact caaaccaact gtggtaggtg tcgtggcggc 4285561 gaaggtgtgc gttgatcgca taattcatgg cgcggatgtt gcaccggcca ggtagatccc 4285621 aggtgaagat catcaggcgc gacatatcga gaccgcgcgc tacatgatcg cgataacgtc 4285681 gaaggtgttg agcttgttga tagctgggcg gcacctcact tatcggcgct tgccgggctt 4285741 tcgccttcgc cgtcggtgat gcgtgccaac agataatcga acctgggtcc ggcgtccagt 4285801 cgcggagcgt tgtaatgcta aacactcatt cctcctgcac tcggaccgag ccccgccagg 4285861 gcacgcaagt aagctacggc cagacggtgt gacactcaaa ccggcgggcg taatttcctc 4285921 cgacgacgct ccgcagacca caatcgtcag cggcggagta cggttgctca ccatgtggtc 4285981 caccgtgctg gtcttggcgc tctcggtgat ctgcgagccg gtacggatcg gtttggtggt 4286041 cctcatgctc aacaggcgcc gcccgctgct ccatttgctc acattcttgt gcggtggtta 4286101 cacgatggct ggtggcgtgg ccatggtgac gcttgtggtc ctcggggcca ctccgttggc 4286161 cggacatttc agtgtggccg aggtacagat cgggaccggg ctgattgcct tgcttatcgc 4286221 gtttgcgctg accacaaatg tcataggcaa gcatgtccgg cgagctaccc acgcccgcgt 4286281 cggagacgac ggtggcaggg tcctacggga gtcggtaccg ccaagtggtg cgcataagct 4286341 ggctgtgcgt gcacgttgtt ttctgcaggg cgattcgctg tatgtcgccg gggtgagtgg 4286401 cctaggagcc gcactgcctt cggccaacta catgggcgcg atggccgcca ttcttgcctc 4286461 cggcgctacg ccggcaacac aggcactggc tgtcgttacg ttcaacgtgg tggcattcac 4286521 agtggccgaa gtccccctcg tcagctacct ggcagcaccg cgtaagaccc gcgcgttcat 4286581 ggctgcgctg caatcatggc tgcggtcccg tagccgccgc gacgccgcgt tgctggtggc 4286641 cgccggaggt tgcctgatgc tcacgctagg cctgagcaac ctgtaggcgg cggcgggctt 4286701 gcctaacgca gagctctcac atgaaatgtc caggcgtctc cgactgcgtt gcgaccgtaa 4286761 ggcacgataa cgtgtttgct attgctgctg gtttgcgttg gtcggccgct gtaccgccgc 4286821 tacacaaagg ggacgctgtg accaaactgc tcgtcggggc catcgcgggc ggaatgctag 4286881 cttgcgcagc tatattgggc gacggaatcg cttcggccga tactgcgttg atagtacccg 4286941 gtaccgcacc gtccccgtac gggccactca ggtcgctcta tcatttcaat cccgcgatgc 4287001 agcctcagat cggcgcgaat tactacaacc ccaccgctac ccgccacgtc gtttcatatc 4287061 caggcagctt ttggcctgtc acaggcttga attcgcccac cgtcggcagt tctgtcagtg 4287121 ccgggacgaa caatctcgat gcggcgatcc gcagcactga cggaccaatc ttcgtggccg 4287181 ggttatcaca gggcacgctc gtgcttgacc gcgagcaggc acggttagcg aatgacccga 4287241 cggctcctcc ccctgggcaa ctcacattca tcaaggccgg cgaccctaac aatcttcttt 4287301 ggcgggcgtt taggccggga acccacgtgc cgatcatcga ctacaccgtt ccggccccag 4287361 cggaaagcca gtacgacaca atcaatatcg tgggccagta cgacattttt tctgacccgc 4287421 ctaatcgtcc gggcaaccta ctcgctgacc tcaatgcgat tgccgcgggc ggatactacg 4287481 gccacagcgc caccgcattc tcggacccag ctcgcgttgc gcctagggac attacgacga 4287541 caacgaacag tttgggtgcg acgaccacga cctacttcat ccggaccgat cagctacctc 4287601 tggtgcgggc gctggtggac atggcgggcc tgcccccgca ggcggcggga acagttgatg 4287661 ccgcactgcg gcccataatt gacagggctt atcagcccgg accagcaccc gctgtgaacc 4287721 cgcgtgattt ggtccagggc atccgcggta tccccgccat cgcccctgcc atcgccatcc 4287781 ctatcggcag caccaccggg gccagtgccg ccaccagcac cgctgccgcc acggcagcag 4287841 caacaaatgc gctccgcggg gccaacgtgg gcccgggcgc caacaaggcg ttgtcgatgg 4287901 tccggggttt gctacccaaa gggaagaagc actagccata aagtccacga cctacggtgg 4287961 cgtttcgcag ttgggggtgt aaagggggtt gaggtcttcg acgatggcgg ttgctgctgg 4288021 cccaccaatc cgttgctgct gacgccaatc catcgggaag gccctgggtg gcgtcttggt 4288081 gcgcccggag gggcagcccg ttggcgcccg tcgtcgagcg tgaactgagg gcggacctcg 4288141 ggcagacacg ccgaggtctt ccttttgggc agcgtggaac cgcccatcat cgaaagacct 4288201 cgacccctac cccggcaacg acgcgccgac tacctcacac cctcaactgc gaagagatcc 4288261 taaagcctga gcccgtcgtg taaccaaaga ccgatcagat cgtcgtcgtc gggcggtgat 4288321 tgctcttctt cttccttggg caacaacggc ttacgtttgg ttcgttgggc acggccacgg 4288381 cgtcggccaa ggggccacca ggttgccggc cgccagcttg acggcaacca ccagttcgcc 4288441 tgcccaacca acacggcaat ggcgggcacg gtaacggtac gcaccaagaa ggtatccagc 4288501 aaaagcccgg tccctaggac gaacgcacct tgaaccacgc tacccaagct ggcgaatacc 4288561 agaccgtaca tcgaggcagc catgatcaaa cccgccgcag tgatcacacc acctgttgag 4288621 gccacggtcc ggatgacacc ggaacgcacc cccaagacgg cctcttcacg cagcctagaa 4288681 ataagcagca tattgtaatc tgcgcccacc gcgaccaata taacgaaggt caatcccgga 4288741 atgctccaat gcatttcctg accgagtaaa aattggaaca cgataacgcc aataccgagc 4288801 gccgccaggt acgatacgat aaccgagccg atcagataca gcggtgccac aatcgcacgc 4288861 agcaaaacga tcaatatgag cagaacgatg cagacggtca tggcgatgat caatcggagg 4288921 tcgtgatcgg agtagtcgcg cgtgtccttg agaacgacgg gcaatccgac gacagacacc 4288981 ttggcatcgg ccagtgcggt atttggttgc gcccctcgag cggccgccgt gatcgcgtca 4289041 atttggtcca tggcagcagt gctgaatgga ttcaggtcgg tttgtatcaa ataccgtatt 4289101 gagtggccgt cgggtgaaat gaaggccgcc gcgacttttt tgagttggtc tacattcagc 4289161 ccgcctagca gatcccgata ctctgacggc atcgtctcgg ctttgacgct ctcaccggtg 4289221 gcatacgaca acaactccgg gggaatatag aaccccgcca tcgccggcgt ggtcgcggtg 4289281 tccttcattg ccaataggaa cgccgaggcc tcgcccaacc cgaaacccat cttcttcacc 4289341 tggtcgacca acaactgcac gccctcagcc agttgccggc tcccgtcggc gagatcattg 4289401 acccccttgt tcaccaggtt gatcttggat cgcacaccac caggactgct catccccagt 4289461 gaacccatcg ccctgatgac ggtggccagc gccccgcgta atccggacac ggtggctgcc 4289521 agggtctgca ctgcgcgcgt ggcctgcagc tgtcgagcca actcagatat ctttgccagc 4289581 gttccgtcgt cgcgcgctgt gaccaaacgc tgcagttcgg tgcgcgcact ggcacaagcc 4289641 ggatcggcag tgcacatcgg gctgctatcc agcgccccca gcaccgggct tgcccactcg 4289701 gtgttgttcg ctacaaagct cgcatccgcg tcaatggtgt caccgagtgc ccgcatgctg 4289761 ccgatcagct tctccgcgcc ttccagttcg ccgagaaccc tgttgccccc gagcaggtcc 4289821 tgaaggtacg ccagcgcgtc gatgaggccg ccgaccgtgg atatggcccg gttaacttgg 4289881 gcccgtacgt cgccgagttt gctcgccatc aggttggctc caccggccag tttgtcgatg 4289941 tcgccggtgt gcacagcgat ctgcttggaa ccctcatcca gcttgctgcc gacttcgcca 4290001 gcctgccagg acgtccgggc ctgctccagc gaccgtccag cgggtcgggt aatgcccctg 4290061 accatcgcga cacccggcac ttggctcacc cgctgcacca tctgctctag gtcggcgaga 4290121 gccttcggcg tgcgcagatc cgtcgaggat tggatgaaca ggtactcggg aatgatcagg 4290181 ttagacggga aatgcttgtc caacgcggca tacccgatcg aactctcgac ggaagccgga 4290241 agcgtcttgc gatcgtcgta gttgtaccgg gccagtcctg cgcagccggc cagaataacc 4290301 agcaccagcg cgctggcgag cagatgagtc ttgggccgac gcacgatgtg cacccccgaa 4290361 ctccgccaaa agcgccgggt gaggtcacgg cgcggcgcga tccaaccgcg acgcccggtc 4290421 agcaccatca gggcgggtag cagtgtgaca gctgcgaaga agaccacggc taccgagatt 4290481 cccaacatcg gaccaaccgt tttgagaatt cccagttggg taaacaccat cccgagaaag 4290541 gtgattgcta cggtagccgc ggaggcggcg atcaccttac cgatggatgt caatgccttc 4290601 ttgacggctt gatccgaatc cgcgccctgc cgtaaatagt cgtgatatcg actaatcaga 4290661 aataccgcgt aatccgttcc cgcaccgacc atcatcccgc tcataaaaat aatgctctgg 4290721 ttagcaatac cgaggcccgc caagccggct attgcaacga ggcgctgtgc aaccaccacg 4290781 gacatgccaa ttgttatcaa tggcaacacc atggtgatcg gattcccgta gatgatcagc 4290841 aaaatgacca acaacaggat cgtgatcgca aactcgatgc gactgcggtc ccgttgcccg 4290901 gtgaggttca gatcggcgac ggtggccgcg ggcccggtca ggttagccgt cagtgtcgag 4290961 cctgcgacct ggtgttcgac gatgtcagcg acgcgggcgt acgcctgctt ggactgggtc 4291021 gaacccaggt cgccgggaag gccgaccggc aggatccagg cctgattgtc tttgctggtc 4291081 atgagctccc gcaggggcgg tgtggtgacg aagtcctgga gcatcacgac gtctcgagta 4291141 tcgcgtcgca gggcgtcaac cagctctttg tagctgcgtt catcggccgc gccgagccct 4291201 ttggcatcgc tgagcaccac caccgcaacg ctctgcaacc cggcttcacg aaatgccgcg 4291261 gtcatctgcc gggtcgagac caacaccggg gcgtccgatg gcagaatcgc cactggatgc 4291321 cgctgggaga tcgcgtccag ggacggcacc gtcggcgcaa gcagacccgc aagcgcgacc 4291381 cagaaggcga tcaccaccca cggccttcgg acgataaggc gccctagccg cggaaagaca 4291441 cccccgtcac cggttggcct aagcggtttc gatcgtaagt tcgtcgaggg tctcggtgtt 4291501 ctgacaggct gcatcaagac gtcgcacatt cctcatctgc tccgcacgtg cccgccttga 4291561 gcgccagccg tggtggtcgc tgtgaggcga gtgagacagc aggggatcgg tcacctgacg 4291621 aatttacgtg cgcaaccact aagcttctct atctaccgtc acattcgcaa cctttagatt 4291681 gcagatatcg ataaaatcac ccgcgcgaca agaccgccat gtcatccttt cgatgttatt 4291741 tcgccggcct ggggaaagcg caacgacgtt gcctacacgt tccgccgtcc caccgttggc 4291801 aatgcgcata cacaccgatc taattgccct cagatatgcg gtaacggatt cgcgagcgac 4291861 cggattatct gggaatagca cgctcgccgc ggtctcgtcg aaacgaccga ccatcgtact 4291921 tagcggatag gtgaccctcc cgtcgctgta ggtaccaacg ttgaggccct cgaacagttt 4291981 cgtcaccgcc gagagcggtc ccacttgtgc gtcgaaaaag ttcaccaggg aaaaaagcgg 4292041 ttggggcctg cgcagcgacg gcgacaattc gacgacccgt tcgaacggca ctttcgccag 4292101 atccgcacca gtatcgaagg aggtctgcgc gattcgtgca atctcgttaa aggacaatcc 4292161 ggcgactgga acggtcaccg ggatctgccc ggtgaaccac ccctgcgtca taaggtcggc 4292221 tggtgtgcgg atatctttgg gagtaattcc aaaataggta tcggcgccgg tcaactcgtg 4292281 tatcgcgatg gcgatgcaag ccagcatgcc accaatgaaa cgagcgttcg ccgccatgca 4292341 ggcggattcg aatcgctgtg tttgctgctc gtccattagc atcatgctga gcaggtcgcc 4292401 gccgcagcgt acagacggat ctccgagggg cagcggaaat tccgggaaag ttccgttatt 4292461 gatttcggcg aagtcgatcc acgcgcgcac ctccggggaa tcgacggtca acgccgaggt 4292521 gtactcgtgc tgcctgacgc agaagtccac atagctgcca gcctccgata acccaatcgg 4292581 tggctcaccc attatcagcg cggtgtacat cgactggaac tccatgagtc cgactcctac 4292641 gaactgaccg tccgcatgca gatgatcgat gctggcatag aacgtgaagg agtctgctcg 4292701 ctgaatgact ccgaagctga agcagtccca atgaagcgaa tccggtgtcg ccacgatgtg 4292761 ctgtcgcagg tccgcgctcg tcatctcgcc atgtgtggtc ggaacaaatt cgatatccgc 4292821 cggatcggcg atgctgtgcc gaacgatgtg gtcggtatct cgaagctcga accagctgcg 4292881 gtatgtatcg tgccgacgaa ggtgcgcatt gatgacatag gtcatggcgc gcagatcgca 4292941 gtgaccaaac acctcaacgg acgcaatgag cagccgcgag tgatcgagcc cccgggcagc 4293001 ctgctcagaa aagctccgaa tttgtctggc ttgtacataa ctgggaggca cagcactcac 4293061 cggcgctgca agggctttcg cgcacgaggc aggtgttggg tgccacgaaa ctaacacgcc 4293121 gggcgctggg tcccagtctt tgaccgctga caactctact ggtcctattc gcactaatag 4293181 ctcctatttc agcgcgtgcg gaatacgtat gcggcgaaac gttcttactg tgacgacagc 4293241 gcggcagcag gagcgtcgtc gggcgccagc tgttcataca agtgatccgc taagccccgc 4293301 accgtggcgc tgacgttctt gggtgccaac cggattccgg tctcggtctc gatccgagtg 4293361 cgcagctcta gtgcgcccaa cgaatcaagt ccatactcgg gtagcgggcg gtcagggtcg 4293421 acggtgcgcc gcagaatcag gctgacctgc tcggcgacca gctgccgaag ccgcgccggc 4293481 cactcgtcgc gtggcagctc gttcagctcg acgcggaatt tgcttgtgcc cgaaccgttg 4293541 ctgctggaga acacttcgaa aaaccggctg cgctctgcga aggcgaccag ccacggggct 4293601 ccgatgaccg gggcatagcc ggtatagacg cggttgtggc gcaatagcgc ctcgaacgcg 4293661 taagcacctt cgtcgggagt gatcgccgtg tagttgcttt cctccaatgc cgaagcccgc 4293721 gcgggcgatg ccgaccacca ccccaactgg ccgatatccg accaggctcc ccacgcgatc 4293781 gcggtagccg gcaggccctg agcttgccgc caatgcgcga aggcgtccag ccagctgttg 4293841 gccgctgagt aggcactctg tcccggcgag ccggtgagag ctgccgccga cgaaaacaag 4293901 cagaaccagt caagcggctg tccgctggtt gcttcatgca actcccaggc accgtgaacc 4293961 tttggcgccc agtcgcgcgc cagcaactcg tcggtgatat tggccaaggt ggcgtcctcg 4294021 accaccgcgg ccgcgtgtag cacgcctcgt accggaagcc cggtggccac agcggtcgcc 4294081 accaaccgct ccgcggtacc cggttgggcg atgtcaccgc attccaccac gacttcagag 4294141 cccatcgccg cgatggcctc gatcgtttcc ctcatctttt gcgtcggctg ggtgcgggaa 4294201 ttcagcacga tccggccgca accggccgcg gccatcttct cggccaggaa cagccctagc 4294261 ccaccgaggc cgccggtgat gatgtaggag ccgtcgggac ggaacacctg agcttgttcc 4294321 ggaggcaggg taacgaggct ttttccggtc tgtgggatgt ggaggacgag tttgccggtg 4294381 tgctcggcgt tgcccatcac acggatggcg gtggccgcct cgacgagggg gtaatgggtg 4294441 ctctgcggca tcggcaactc gccggctgcg gtcaagcgat agaccgtgcc gagcaggtcg 4294501 cgcagctctt ctgggtgtgt cgcagacagc aaccccaggt ctacggcgta gaaggacagg 4294561 ttgcgccgga agggaaagag ccccagcttg gtgtcaccat agatgtcgcg cttgccaatc 4294621 tcgacgaacc gtccccggaa ggcgagcagt ttcagcccgg caagttgcgc ggcgccggtc 4294681 accgagttga gcacgacatc gacaccccgg ccgttagtgt cccgccgaat ctgctcggcg 4294741 aactcgatgc tgcgcgagtc atagacatgc tcaataccca tgttgcgcaa tagctctcga 4294801 cgctgtgggg taccggcggt ggcgaagatc tcagcgcccg ccgcgcgggc tatagcgatc 4294861 gccgcttgtc cgaccccgcc ggtgccggag tgaattagca ccgtgtcacc cgccctaatc 4294921 cgggcgagct catgcagtcc gtaccaggcg gtggcgtgcg cggtggtcac cgcagcggcc 4294981 tgtgcgtcac ccaggcccgg tggcagcgtc gcggccagcc gagcgtcaca cgtgacgaat 4295041 gtgccccagc agccgttagg cgacatgcca ccaacatggt caccaacctt gtggtcagtg 4295101 acgcctggtc cgaccgcggt caccacgccg gcgaaatccg tgcccagctg gggcaggtgt 4295161 ccctcgaagc tggggtagcg accgaaagcg atgagtacat cggcaaagtt gacgctggac 4295221 gcacggaccg caacctcgat ctgtcctggt cctggtggaa cgcggtgaaa cgcggccagc 4295281 tctatcgttt gcatatcgcc gggggtacgg atctgcaggc gcatgccgct ctgctgatga 4295341 tccgcgacga tggtgcgccg ctcctgagga cgcaacgggg tcggacacaa gcgcgccacg 4295401 taccactcgt tgtctcgcca ggcggtctcg tcttcttccg acgtggccag caattggcgt 4295461 gccagctgct cgacaccggt ctgttcgtcc acgtcgatct gggtggcacg caggtgaggg 4295521 tgctcggcgc cgatcgtccg cagtagacca cgcagcccgc cctgctcaag attgacgcag 4295581 tcgtcggcca gcacccgctg ggcaccccgc gtcacgacgt acatgcgcgg caccgccccg 4295641 ggaaggtctg acaattcgcg agcgataccc accagccggc gaacgtactc agcgccgcga 4295701 tccgcgctcc cctgatgcgg cgtaccggtg ttcgacccgg tgagcacgac cacgccgcta 4295761 aactcgtcgc taccaacttg atcgcgtagc tggtcggcgg cggccaactg gtcgtcgtgc 4295821 agtggccacc gcatcgtcgt gcacgccgcg ctgtgttccc taaacgcgtc cgctagccgg 4295881 gtagcggtca catcagaggc agcgcagtca ctgatcagca gccattttcc agcgccagag 4295941 gggtccatct cgggcagctc acgctggtgc cattcgatgg tgagtaagcg ctcattcagc 4296001 acccgattgt gtttgtcgcg ctcggacact cccgtaccga ttcgcagtcc gcacacggcc 4296061 agcaacaccg tgccgtgcgc gtccagcacg tcgatatcgg cctcgacgcc gaccaactcg 4296121 actttggtca cccgcgtgta gcaatagcga gcggtacgca ccggagcata ggcacggact 4296181 cggcgcaccc ccaacggcac caataggccg ctacctaccg actggctatc gggatgcgcg 4296241 ccgaccgact ggaaacaggc atccaggagg gccgggtgga ttgcgtacag gccctgctgc 4296301 gaacgaatcg agccgggcag cgcgacttcg gccagcattg tggcggtcgc atcctccgcg 4296361 acataggcca cggccaggcc ggtgaaggcc ggaccatatt gcacaccgtg cttgtcgaat 4296421 tgccggcgca gatcctcacc gtccacgcgg caagggtggg cttccaataa ggaggccatg 4296481 tcgtacgccg gcggctcgca ttcgccggat acctgctgca gcaccgccga cgcacgccgc 4296541 aagtgatgcc caacgccttc ctgcaaggcc tcgacggcga agtcgacgac accgggcgag 4296601 gtcaccgttg ccacggtgga caccggggtc tggtcatcca gcagcagcat cgcctcaaag 4296661 cgcatgtcgc gtacttcgga ctgctcgccg aggacggcac gggccgcaga caacgccatc 4296721 tcgcagtagg cggcccctgg aagagcagcc acgttgtgta tccggtgatc gcccaaccag 4296781 ggcaaggttg cggtaccaac atcggcctgc caggcgtggc gttccggctc ttcgggcaat 4296841 cgcacgtgtg cgcccaacaa cgggtgcacg gctaccgtgg agccacccgg cgaccgattg 4296901 tcaacgcctt cgcggtcata gaacaggaac cggtgcgacc acgccggcag cggagcatcg 4296961 accaagcggc cttggggaca gagcaccgag aagtccactg ccgcaccagc gttgtgcaga 4297021 tccgtcagca ggcgacggag ccccagcggc aatggctgct cccgccgcat accggccagc 4297081 gcggcaaccg gcatgcctac actgccggca atctgatcga ccgcgtgggt cagcagcggg 4297141 tgcggcgaaa gctcggcgaa gactcggtac ccgtcgtcga gcgccgagcg caccgcagcg 4297201 gagaaccgca cggtgtggcg caaattgtcg gcccagtaac gcgcgtcgca cgccggcgct 4297261 tcgcgcgggt cgaaaagcgt cgccgaatag tagggaatct caggagcttt cggattcagg 4297321 tcggccagcg cagctatcaa ctcgtcgagg atcggatcca cctgcggcga atgcgaagcc 4297381 acgtcgacgg ccaccgcccg cgccagcacg tctcgccgct cccatatgtc gaccagcttg 4297441 cgcaccgact cggtgcctcc ggcgatcacg gtggactgcg gcgcggtcac cacggcgacc 4297501 accacatcgt cgatgcctag agcggtcaat tccgactgca cagctaaggc aggcaactcc 4297561 accgacgcca tcgccgcgga accggcgatc gtcgccatca gttttgatcg tcggcagatg 4297621 acgcgtaccc catcttcggc tgacagcact cctgcgacca cagccgcggc cgactcaccc 4297681 attgagtggc cgatcacggc gcccgggcgc actccgtatg ccgccatcgt ggctgccaac 4297741 gcgacctgca tcgcgaagat ggtcggctga actctgtcga tgccagtcac ggtctcgggc 4297801 gccgtcatcg cctcggtgac cgagaacccg gactccgcgg cgatcaatgg ctctagctcc 4297861 gcaacggtcg cggcgaacac cgattcgttc gtcagcagat cggcgcccat cgctgcccac 4297921 tgcgaccctt gcccggagaa taaccagacc ggcccgcggt catcctgccc caccgcgggc 4297981 tggtaaacgg tgtcaccgtc ggcgacctcg cccaagccgg caatcagctc gtcgacgctg 4298041 ctcgcgatga ccgccgtgcg caccgaccgg tgcgtacgcc gccgcgccag cgtgtacgca 4298101 agatccgaga gcaccaggga gtcggcgtgc tgctgtatcc agtcggtcaa ccgctgagca 4298161 gtctgccgca gcgcgtcggc cgaggaagcg gacagcgtga acaaggcagg ggtgccggtc 4298221 gggggggtgc tcgccgcgtg gggctgggct tcggtttgcg gagcttgctc cacaacagcg 4298281 tgcacgttcg ttcccgagaa cccataagac gacactgccg cccgccgggg cacctgacga 4298341 ccgttggtgg gccacggtgt ggtcacctcg ggcacgaaga ggttggtggt gatgccagca 4298401 atctcatcgg gcagccgagt gaagtgcaga ttacgtggaa ccacaccatg tttcagagcg 4298461 agaaccacct tgattagccc tagcaccccg gcggtcgact gggtgtgtcc gaagttggtc 4298521 ttcaccgatg cgagtgcgca cgggccgtcg accccataca cctcggagac acttgcatat 4298581 tcaatggggt caccgatcgg ggtgccgggg ccgtgcgctt cgaccatgcc gaccgtcgcg 4298641 gcgtccacgc caccggcagc caacgccgct cgataagccg caacctgtgc gggctgcgaa 4298701 ggcgtcgcga tattgaccgt gtggccatcc tgatttgcgg acgtgccacg aattaccgcc 4298761 aggatccggt caccgtcggc caatgcatcc ggcaaccgct tgagcaccac cacggcacaa 4298821 ccctcgcctg acacgaaccc gtcagccgcg acatcgaacg cgcgacaacg tccggtcggg 4298881 gacaacatgc ccaaagcgga tccagcagcg gccttgcgtg gctccagcat caaggcgaca 4298941 ccccccgcca aggcaacgtc gctttcaccc tcgtgcaggc tgcgacacgc catgtgcacg 4299001 gccgtcaggc cggacgagca tgcggtatca acggttattg ccggaccgtg cagtcgcatc 4299061 gcgtaggcga cccggcccga cgccatgctg aagctgttgc ccagatatcc gtacggctcc 4299121 tccaattgtt tggcgtcggc cgccaccatc gtgtagtcac catgggtgac acccgcgaac 4299181 acgccggtcg ccgagcctgc cagcgtttgc tgagtaagac cggcgtgctc catggcctcc 4299241 caggacgtct ccagcaacag acgttgctgc ggatcgatcg caatcgcctc ccgctcgccg 4299301 atgccaaaga actcgcaatc gaaatccgcg gggttatcca ggaaaccgcc ccacttgcac 4299361 accgtccgac cgggcacgcc cggctgcggg tcgtagaact cgtcgcaatc ccaccggtcc 4299421 ggcggcacct cggtgatcag gtcgtcgcct cgtaacaacg ccttccacaa caactcgggg 4299481 gaatcgatcc cgccgggcag ccggcaagcc atgccgataa cagcaaccgg agtcacacgt 4299541 ggttcagcca acgtccatgc acccctatct gcaccagtgc ctgacgccgc cgaccccaag 4299601 cccaatgccg gaggcgatac gtagcctaac tagcaatcct tcgatgtagc tgtgtctttg 4299661 gtggctcttt agttctaagc ggctgtgcta ctggggcact gggccctact tcggtttgtc 4299721 gtggcatggg cagcccgcgg tctgccgcag tctgaagttc gcggcctgag cgcgcgctat 4299781 cttccacgcc gggccggtag tctgacgctt catggtttcg ctttccatcc cctcgatgtt 4299841 gcgccagtgc gtcaacctgc acccggacgg cacggcattc acttacatcg attacgaacg 4299901 ggattcggag ggcataagtg aaagcctgac gtggtcgcag gtgtatcggc gaaccctaaa 4299961 cgttgcagca gaagtccgcc gccatgccgc aattggtgac cgtgcagtga tattggcccc 4300021 acaaggactc gattatattg ttgcttttct gggcgcttta caggccggtc ttattgcggt 4300081 tccactttcg gctccgctcg gcggcgccag cgatgaacgt gttgacgcgg tagtgcgtga 4300141 cgcgaaaccc aatgtcgttc tgacaacatc cgcgataatg ggcgatgtcg tcccgcgcgt 4300201 tacgccaccg cccggtattg ccagcccgcc aacggttgcg gtcgatcaac tagatctgga 4300261 ctcgccgata cgatctaata ttgtggacga ttctctccaa acaaccgcat atttgcagta 4300321 tacgtcggga tcgacccgca cacctgccgg tgtaatgatt acctacaaga atatattggc 4300381 aaatttccag cagatgattt ccgcctattt cgccgacacc ggagccgtac cgccattgga 4300441 ccttttcatt atgtcgtggc taccgttcta tcatgacatg ggtttggttc tgggagtttg 4300501 tgcgccgatt atcgtaggat gcggcgctgt gctcacaagc ccggtggcgt ttctgcagcg 4300561 accagcccgg tggctgcaat tgatggcacg cgagggccag gcgttttcgg cggcaccgaa 4300621 cttcgccttc gaactgacgg cagcaaaagc aatagatgac gacttggccg ggctcgacct 4300681 tggacggatc aaaaccatcc tctgcggcag tgaaagggtg catccggcga ccctcaagcg 4300741 ctttgtcgac cggtttagcc gtttcaatct tcgagaattc gcaattcggc ccgcgtacgg 4300801 actcgcggaa gccacggtgt atgtggcgac cagccaagcc ggccaacccc cagaaatccg 4300861 ttacttcgaa ccccacgaac tttccgctgg gcaggccaag ccgtgcgcaa ccggggcggg 4300921 cacagctctg gtcagttacc cgctgccgca atcacccatt gttcggatcg tcgatcccaa 4300981 caccaatacc gagtgcccac ccggaacaat cggtgagatc tgggtacacg gcgacaatgt 4301041 cgccggcggc tattgggaaa agcctgacga gactgaacgc accttcggag gagcactggt 4301101 cgctccctcg gccggcacac ccgtagggcc ttggctacga actggcgact cgggcttcgt 4301161 gtctgaggac aagtttttca tcatcggcag aataaaggat ctgttgattg tttacggccg 4301221 caatcattct cccgacgaca tcgaggcaac gatccaggag atcactcggg gccgctgtgc 4301281 ggcgatagcg gttccgagca atggcgtgga gaagctcgtt gccatcgtcg aactcaacaa 4301341 ccgcggcaac ttggacacag agaggctgag cttcgtcacg cgtgaagtca cctcggcgat 4301401 atccacctcg catggattga gcgtgtcgga tctggttctg gtggcgcccg gctcgattcc 4301461 gatcaccacg agcggcaagg tcagacgtgc cgagtgtgtg aagctgtatc gacacaacga 4301521 gttcacccgg ttggacgcta agccgttgca agcgagcgat ctttagtggt cacgcgactt 4301581 gcaccccgtc tcggggttgt tcggcagcca tgcggctgcc tcccttccgc gcttcacagc 4301641 caccagccgg gcaaggcccg gtcttacggt cggctccacg cttaacgacg ggaaccagcg 4301701 gtcggcgacc accagcgccg acccgtacca gcccgtcttg taggacaagt gccggcgcgg 4301761 agtgcccagg gccgagtccg acagtccgcg ccggcgggcg cgggcgccgg gaagcccctt 4301821 ttgccgcagc atccccgcag cgtccaaacc ttcaacaacg atgtggccgt gggtttgagc 4301881 caatcgtgtt gtcaggacat gcaggtgatg agtgcggaca tcgttgaccc ggcgatgcag 4301941 ccgggaaatc tcggtggtgc gctcgcggta gcgccgtgag cctttcgtgc accgcgaccg 4302001 cgcacggctg gcgtaccgta gctctttgag tgccgtgtcg agtggccgtg gattgggcac 4302061 ttcttcgagc actgcgcccg cctcgttggc gaccgtggcc agccggcgca ccccgacgtc 4302121 aacgccaacc cgtgaaccgg gctgtgccac gttgggctgc tgcgggcgtt gcacgaggac 4302181 ccgcacactg gcgtcgagcc gggtgccgtt acggcgcacc gagattgcca gcacccgcgc 4302241 ccggcctgtg gcgatgagcc gttcaatccg gcgtgtgttc tcgtgcgtac ggacggtccc 4302301 gacgaccgga agtgtgagat gacggcgatc aggttcgacg cgcatcgctc cggtcgtgaa 4302361 tgtcacgcgg tcctgatcgc ggcctttctt cttgaaccgg gggaagccca ttgtcttgcc 4302421 ctcacgttta ccggatcggg agttctgcca gttccagtac gcatcgacag cgccgccaat 4302481 gccgtcggcg taagcctctt tcgagcactc cggccaccac accgccccgg tctcggcgtt 4302541 gacacacacc tcgtccttga cggtgttcca ccgtttacga agcacccgca gcgacggctt 4302601 gacagtcccg ataccagtaa cgcgccacgc ctcgatatcg gctttcaaag tagcgaccgc 4302661 ccagttgtag gccttgcggc gagcgccgaa atgccgcgcc agcgcgcggg cctggtcctc 4302721 ggttgggtcc agcgtgaacc ggaacgcctg cacacaccag ccttctggca cctcgaatct 4302781 ggccatcaag ctgcctccgc gtccccgacc gcagcagcaa gggcacgctt ggccccgttc 4302841 tgtgcagcgc gttcaccata gagccgagca cacatcgagg tcaggatctc ggtcatatcg 4302901 cccaccaggt cgtcatcaac ctcagccaag tcgaccacca ccaattcccg gccctgggcg 4302961 acaagagcgg cctcgacgta ctcagagcca aaccagcaga accgatcccg gtgctccacc 4303021 acgatccgcg tcaccaccgg atcacccagc agcgcaaaaa acttacggcg atgtccattc 4303081 aacgcccaac caccctcggc caccaccttg tcgacagaga gatgttgcga tgtggcccac 4303141 gcggtcaccc gcgcgacccg ccgatccaga tcggacctct gatccgctga cgatacccgc 4303201 gcgtacacca acgtccgccc gcgcccagac tcctcgactg ccggatcgtt caccagaatg 4303261 agccgaccca ctcgctgcgc cggaaccggc aacagcccgg ctcgaaacca gcgatacgcg 4303321 atcacccacg caacaccgtt gcgctccgcc cacaccgcca aattcatcca tctgttccta 4303381 cagcacacca ccgacaacta ccgaccactc aaaacgcaac agttggcagc cctacgatcg 4303441 gccagcgcct gacgggcggc gttatatcca gggatgaacg tgattcccgg cccaccgtga 4303501 caaccggcac tgcccaggta caacccggct atcgggatcg gctggccgat aaagcctttc 4303561 gggccaggcc tgttggggcc gatctggtcc gagtgcagca gggcatggca gtagtcccca 4303621 cccggggcac cgaacatcac acccatgtgt ttgggggtaa aggtggtgta ccggagaatg 4303681 ctgcctttga agttcggtgc caacctagtg atcttgtcga tcacgttctg ccccatttcg 4303741 acctttgccc ggccgtaccc tccgtatttt gagccaccct cgatcgggaa ccacattgcg 4303801 aacgccgacg cggcctgctt acccgccggg gccaggctgg gatcatgcag cgacgggatc 4303861 tgcaacacca cggtcggatc ggccgggacg atcccacgcc ggcaatcctc ccactgctgc 4303921 tgaacctgct ccggtgtaca gaaaatgccc atcgatgcct gcatgctcgg atcgttgagt 4303981 gcctggtagg gcgccgcgaa ggccggtggc tgcgcgagcg caaaatgcat ctgcagatag 4304041 ctgccgcggt ggtcgatgcg caaatagcga tcgcggattt ccgacggcaa cactgccgga 4304101 tcgatcagct cgttgatggt gacgtcgggt gctatggcgg agaccacgat cggggaggtc 4304161 aaggtgtccc ccgccgcggt gcgcacgccc cgcacgcggg ctgacgaccg actattgtca 4304221 accacgatct cggtcacctt ggaacgtaac cggacctcgc cgccggtgcg ttccagcaat 4304281 tgcgacagat gggtggtaag cgcgccgatg ccaccgcgca atttcttcca ccgcacgaag 4304341 tcgccctccg ggacacccaa tccgaaggcg agcgcggcag cgctgcccgg tgtggccggc 4304401 ccgcgataga gcgtgttcac ggccagcacg gtcatcgacc cgcgcagggc gccgtgcttc 4304461 tcgcggtccg ggaaatggcg gtccaacacg tcggtgaccg atccgaacag catgtcatcg 4304521 atcgctgacc gttcgaattc atttgtggca caggcataca tctcgtcgaa gctcttgggc 4304581 agagttccgg cttcgaaacg ccccagcgcc cgggtcggcg cctggctcca cgccagcagg 4304641 cccgccatcc cggtgacggc gtctgccccg tgcacccgat ggaggtgggt aagcatcttc 4304701 gtcgggtcgg tgaattggac caccggatcg tccccgacac cgcgcaacgc taccgacatc 4304761 acctccagat cgaccgtcgg caagctgtcc aggcctaact cgctgctgac cgccgaggag 4304821 gtcgggaact gcaccgatcc ggcgatctcg aaccggtacc cgtcgaacag ctccaccgtg 4304881 gaggccatcc cgccggcgta gcgcttagcg tccagacacg cggtccgcag tccggctcgc 4304941 tgcagcagca ctgccgcggt cagcccgttg tgcccggcgc cgataactat cgcgtcataa 4305001 ccagtcatac gcgtctccag caatgcaggc tcgcacgcgc tcgatgtttt gtcaattatg 4305061 acgaaactgt gagggtggtc caggtgtcgg agatgccgac gcgcagcgac tccagtgcga 4305121 cgtggcagac ccgcgccagc tccccgagcg accggtcact cccaagcatc caggcttcca 4305181 tcgcgccgaa caccgccgcg gcgacgcatc gtgcggtgac ggcgatgtgc aatcgggcat 4305241 cgggtgcacc cgcgatatcg cagttacgtc gccgcaattg ggcctggatg gcatcggcga 4305301 agtcggcttc cacctcgcgc atatggcgga cgatccggct cggctccaac tcgccgcgcc 4305361 gcaacgacgc aatcttcgtc actgcgtcaa cgtcataagg aaacgagaag atagccgctt 4305421 gcacggaatc gatgatcgat tcgtcggccg gtctagcatc cagcgccgcg cgaaaccagt 4305481 gcagtccggc gtcgtagtcg gcaaacagca aatcgtgctt ggatctgaag tggcgataga 4305541 aagtacgcag cgacaccccg gcgtcctccg caatctgctc ggctgaggta gcctcgacgc 4305601 cctgggccag aaatcgcacc agggcggcct ggcgcagtgc ctcgcgagtg cgttcgctgc 4305661 gcgccgtctg cgggggccgg accatgactg caagctatcg tcaattttcg ttctgtcaac 4305721 attgacaaaa ctgttggcca cggcgagact gcgcgcatgg tgtcgcttct tgttcacgct 4305781 gcgctgggag tagtcgtcat cggctggatc gtctcgtcga acccgaaggt tttcaccagg 4305841 ccggccggcg gatcgtggtt ctcgctgccg gagtgtgtgt actacgtcgt cggtattgcc 4305901 tcgatcgcgc tggggtggta cttcaacatt cgttttgtgc agcagtacgc gcacggagcc 4305961 gccaaccctc tctggggtcc cggcagctgg gcggagtacg tccggctgat gttcaccaac 4306021 ccggcggcca gttcggccgg ccaggactac accattgcca acgtgatcct gctgccgctg 4306081 ttttccacca ccgacggcta ccgacgtggt ctgcggcggc cctggctgta tttcgtgagc 4306141 agcctgttca ccagctttgc attcgcgttc gcgttctact tcgccaccat cgaacgtcag 4306201 caccgacacg aacgttcccg tgcgacggtc ggcgcctagg cggcgactgg cttggtggcc 4306261 cgccacctca ggcgagcgcc cgcgacatcg acgtggatat cagtgaatcc cacagctcgc 4306321 agccgaccgg ggaggtccgc cggggcgatc ggagtgtagg tgtcggcgat gtgtattagg 4306381 cgaaacggca gcgacggcac accgtcgctg ccggcaaaga cgccacctgg ttgcagcacc 4306441 cggtacgcct cagcgaatag ctggtcctgc agttgggcgc tggcaacatg gtgcagcatc 4306501 gtgaaacaca ccacggacgt gaagtgatca tcgggcagcc cggtctgggt gccatcgccg 4306561 cggatgatgc gcgcccgctg gccgtagcgg cggttcaggc gctcgaccat cgagttgtcg 4306621 acttcaacgg cggtgagcga ggcggtcagg ccaaggagcg cttgcagtgt cgccccataa 4306681 ccggggccga tctccagcgt ccgggggccg agttcgacgt gctgcaacgc ccagggcagg 4306741 agctgattgg ccaccgcttt ttcccagcct gccgagctgc aatgacgccg atgtagaaga 4306801 ttcatggcca tggcccagaa cactagttag ccaccggccg gcagtcttcc gatattctgc 4306861 cttaatatgt cggaaaacag ccaccacagg ctggccacaa cctcgttgac gctcccgccg 4306921 ggagcgcgga tcgaacgcca ccgccatccg tcacaccaga tcgtctatcc gtccgcaggg 4306981 gcggtctcgg tcaccactca cgcgggaacc tggattacgc cggtaaatcg ggcaatctgg 4307041 ataccggcgg gctgttggca ccaacacaag ttccacggcc acacgcaatt tcacggcgta 4307101 gcgctggatc cgcagcgcta tcgcggcggc ccggcaaccc cgacggtgct cgcggtcaat 4307161 ccgttgatgc gcgaactcgt catcgcgtgt tcgcaggccg accgaaccga caccgacgag 4307221 caccaccgga tgttggccgt actgcaggat caactgccaa caacgagcat ccgcgagcca 4307281 ctgtgggttc cctcaccaac cgatcgccgg ttgcggcacg cgtgcgcgtt gatcgccgac 4307341 aacctgaccc agcccttgac gctgcagcag atcggcggcc ggatcggtgt cagccagcgc 4307401 acgctgagcc gtctgttcag cgacgagctg ggtatgacgt tcccgcaatg gcgcacccag 4307461 ctgcgcctgc aacatgcgct cgtgttgctc gccgagcgcc acgacgtcac gtccgtggcg 4307521 tccgaatgcg gttgggccac accaagcgcg ttcattgaca cctaccgaca agccttcgga 4307581 cacactcccg gccaagccgc taagccaatg gcggcgaccc gcctcacccg gctccgccgc 4307641 gctcgcgatc gccgctaagc gaccggctcc agcacttcga cacccacgaa cggaaccagt 4307701 gcgtccggga ctctaacgct gccgtcgggc cgctggtggt tctccaggat cgcaaccagc 4307761 caccgggtgg tggccagcgt tccgttgagg gtggccgcga tctgcggctt gccgctggca 4307821 tcccggtagc gggtcgccaa ccggcgcgcc tgaaaggtgg tgcagttcga cgtcgacgtc 4307881 agctcgcgat aggccccctg cgtcggaatc cacgcctcgc agtcgaactt gcgggcggcc 4307941 gacgagccga gatcacccgc ggccacgtcg atgacccgat acggcacctc gatgcgtgcc 4308001 agcatctggc gctgccagcc cagcagccgc tcatgttcgt gctccgcgtc ggccggtgtg 4308061 cagtagacga agccctcgac tttgtcgaac tggtgcaccc ggatgatgcc gcgcgtgtcc 4308121 ttgccatggc tgccggcctc acgtcggaaa cacgacgacc agcccgcata ccgcagcggc 4308181 ccgcgggaaa ggtccagaat ctcgccggag tgataccccg ccagcggtac ctcggaggtg 4308241 cccacaaggt agaggccgtc gccctctacc cggtacacct cctcggcgtg ggcgcctaga 4308301 aatcccgtgc ctaccatcac ttccgggcgc accagcaccg gcgggatcgt agggacaaag 4308361 ccgttgtcga cggctagctt cagcgccagc tgcagcaatc caagctgcag tagggcaccc 4308421 cgaccggtca ggaagtagaa ccgtgaaccc gacaccttgg cgccgcgctg catgtcgatc 4308481 aggcccagcg actcgccgag ctccaggtgg tccttggggt tctcgaggta gctgggctcg 4308541 ccgacgacgt cgagcaccgc gtagtcgtcc tccccgccgg cgggtacccc gtccacgatg 4308601 acattcgaga tcgccaggtg cgccgcggtg aacgccgcct ccgcttcgac ctcgtcggcc 4308661 tcagcggctt tgacctgctc ggcgagttcc ttcgcgcgcc gcagcagcgg cgggcgctct 4308721 tcgggagacg cgccacccac gcttttgctg gcggctttct gctcggcccg taacgaatcg 4308781 gcggtcgaga tcacggcccg gcgggcggcg tcggccgtca gcagggcatc taccagcgcc 4308841 gggtcctcgc cgcggctgag ttgtgagcgg cgtaccgcgt cggggttttc acgaagcagc 4308901 ttcaggtcga tcacggccgc aagactactt ttgacgccca gtcagggtgg cggcagagga 4308961 ccatccaccc gcgatgaagc gatcccgcaa gctgacaact gcaacattgg tcatgcggcc 4309021 ccgccgaccc tgtcagaatg gagcggatgt tggacgcgcc cgagcaggac cccgtcgatc 4309081 ccggcgaccc ggccagcccc ccgcacgggg aggcggaaca gccgctgccc gggcctcggt 4309141 ggccacgcgc cctgcgcgcg tcggcgaccc ggcgagcgct actcctcacc gctttgggtg 4309201 gcctgctgat tgccgggctg gtcaccgcga ttcccgccgt cggccgcgcg ccggagcggc 4309261 tggccggcta catcgccagc aatccggtgc ccagcactgg cgccaagatc aacgcttcgt 4309321 tcaaccgcgt cgccagtggt gactgcttga tgtggccgga cggcacgccg gagtctgccg 4309381 ccatcgtcag ctgtgccgac gagcaccggt tcgaagtcgc cgagtccatt gacatgcgga 4309441 cattccccgg catggagtac gggcaaaacg ctgctccccc gtcgcccgcc cgcattcagc 4309501 agatcagcga ggagcagtgc gaagctgctg tgcgccgcta cctcggcacg aagttcgatc 4309561 ccaacagcaa gttcaccatc agcatgctgt ggcccggcga ccgggcgtgg cggcaggccg 4309621 gtgagcgccg catgctctgt ggcttgcagt cgcccggtcc gaacaaccag cagctcgcct 4309681 tcaagggcaa ggtcgccgac atcgaccagt ccaaggtctg gccggccggt acctgcctgg 4309741 gcatcgatgc caccaccaac cagccgatcg acgtgccggt ggactgcgcg gcaccgcacg 4309801 cgatggaggt atccggcacg gtcaacctgg ccgagaggtt tcccgacgcg ctgccgagcg 4309861 aacccgagca ggacgggttc atcaaggacg cgtgcacccg gatgacggac gcctacctcg 4309921 cacccctcaa gttgcgtacc accaccctga cgctgatcta ccccacgctg acgctgccca 4309981 gctggtcggc gggtagccgc gtggtcgcat gcagtatcgg cgcgaccctg ggcaacgggg 4310041 ggtgggcaac cctggtgaac agcgctaagg gggcgctgct gatcaacggc cagccgccgg 4310101 tacccccacc cgacattccc gaggagcggc tcaacctgcc gccgattccg cttcagctgc 4310161 caacgcctcg gcccgccccc ccggctcagc agctgccaag taccccacca ggcactcagc 4310221 acctccctgc ccaacagcca gtggttacgc ccacccggcc acccgaatcg catgcgccag 4310281 cgtcggcagc accggccgag acccagccac cgccaccaga cgccggagcg ccgccggcga 4310341 cccaatcacc agaggccaca ccgcctggcc ccgccgagcc cgcaccggca ggctagccgg 4310401 gtgacagtac ggatggaccc gcagcggttc gacgaactgg tgtccgacgc actcgacctc 4310461 attccgcccg aactggcgga cgccatggac aacgtcgtcg tgttagtcgc caatcgccac 4310521 ccccagcacg aaaatctgct cggccagtac gaaggggtcg cgttaaccga gcgcggctcc 4310581 gactacgccg gatcgctgcc tgatgccatc acgatctacc gcgaggcgct gctggacgcc 4310641 tgcgactctg aggatgaggt cgtcgaccag gtcgccatca cggtgatcca tgaggtcgcc 4310701 catcacttcg gcatcgacga cgagcgcttg gaccaactgg gctggcgtga cgaaccagcg 4310761 cccgggcgcg gcaacccgga tttgtcggca cccgatgcta tgaacggccc atgagcacgg 4310821 actgccgcga ctgccgggcg ggcttggatc actgccacgg caccgtcatt cgtcatccct 4310881 tggcacggcc ggaatgcacc gagccggact gtgtcagccc cgagctgcaa ccccatatct 4310941 tcgtcctaga ctgcaatgcc gtcagctgcg aatgcactga atcggccacg gcgcccgggt 4311001 ccttcagatc agcccatcgg gtcggtgctt gacgtcaccg cgtgtgtgac cgggctggct 4311061 gcggcttcag cggggtccgg acaaaacggc ggcttccgga ggccccactg cacacaactc 4311121 catcgcccat cggttatcgg ggccagcacc accgactcga cgttttccaa gtggttgtcc 4311181 aacacaaagt tgccgtccac cccggcaagc accgcggcgg ccagccggat cgccgcgctg 4311241 tggctcacga cgacgatgtc gccgtcccag tcaccgtcgt cgaggtaacg catgcgcagg 4311301 tcggcgagca ccggcagata acgatccagg acgtcgttgg cggtctcgcc accgggcagc 4311361 ggcacatcca actccccgcg atgccagcgg ctgtaggtgg cgttgaactc ggcgaccgcc 4311421 tcgtcgtcgt tgcggttttc cagctcccct acctgtacct cgtgaatgcc ggcaacctcg 4311481 tgggccacca tgtcgagttc ggcagcgacc accgcggccg tctggtaggc ccggatagcc 4311541 accgagtgtg cgagcagtgc cggccggcga caaccgctgc gcgcgaacgc cctggcctga 4311601 tcacgaccca gcggtgtcag cgccgttccc ggcggcaggg tatccaacct gcgctcgacg 4311661 ttgccatagg actggccgtg ccgcagcagc accaaacgac cgctcatgct tgcgccccct 4311721 ggtcgtccgg gcgaaccagg gtctgctcgg gtttgcccgc gcggagccgc gctaaccagc 4311781 gtgatgcttc gtctaccagg ggcggctgcg cccctgccgc tggccccgtc ggccaggacc 4311841 ccaggtatcg cacatcagca caacgtcggt gcaccgcctt gagtgcctcg gcgacggcct 4311901 cgtcgtcgat gtggccgacg caatccacga agaacagata ggtgccaagt tcggtacggg 4311961 tgggccggga ttcaatccga gtgagatcga tgccgcggat gccgaactcg gccagcgcag 4312021 ctaccagcgc accgggctgg ttgtcgatgc gcagcactgc agacgtgcga tcggctccgg 4312081 tgcgcgccgg aggcggcccg ggccgaccaa ccaggacgaa gcgggtgcgg gcattggatt 4312141 cgtcaacgac accgtcggcc agggccgcca atccccaacg agcggccgcc agcggcgagg 4312201 tcaccgcggc gtcaaccaag ccgtcagcca cctgccgggc cgcgtccgcg ttggaataag 4312261 ccggccgcag gtcggcggcg ggaagatggg ccgccaacca ctgccgcacc tgtgcagccg 4312321 ccaccggaaa ggccgccagg gtccgcacgt ccgcggcgtt gcgcccgggt ttgaccacga 4312381 tgctgaacgt cacgtccagc gttgtctcgg cgaacacctg caggcgcaca ccgatggcca 4312441 ggctatccaa agtaggcagc acggaaccgt cgatcgagtt ctcgatcggc acgcacgcat 4312501 aatccgcacc gccgtcgcgg accgcagcca gtgctgcggg cgcgctctcg accggcatcc 4312561 gctgcagtgc atcgggcccg gtctcgggaa ctaggccggc ggccaccatc cggaccaggg 4312621 ctgcctcggt gaatgtccct tccggaccga ggtaagcgat acgcaccacg ctcacaaccc 4312681 taacgacgca aagccgaccg ccaactcttg cgaccagacc gtgcattagt taacttaggc 4312741 ttacctaaac acaggaggtc gtggatgccg ccgctcacca gtctcgcgcc gactactgcc 4312801 gagcgaattc gcagcgcctg cgcgcgggcc gggggcgcct tgctggtggt tgagcgggag 4312861 gatccggtcc ccgtgcccat acaccatttg ttgtacgacg ggtccttcgc cgtggcggtt 4312921 ccggtcgatc gtggcgaggt gtccggttcg caagcgctgc tggagttgac tgactatgcg 4312981 ccgctgccgg tgcgtgaacc cgtccgttcg ctggtgtgga tccgcggctg cctccaccag 4313041 atcccgcccg cagagctggt tgagaccctg gacctgatcg ccaccgataa tccgaatccg 4313101 gccctgctac aagtcgagac cccgaggccc gggccggccg atgcggcgga gacccggtat 4313161 accatgcagc ggctggagat cgaatccgta gtggtgaccg acgccaccgg cgccgaaccc 4313221 gttaccgtgg cggacctgct cgcggcccga cccgatccgt tttgtgaaat cgaatcaacc 4313281 ttgctctggc acctagccac cgcccatgac gatgtggtcg cgcggctggt atccaggctg 4313341 ccggcaccgc tacgacgcgg acagatccgc cccctcggtc tcgatcggta cggcgtccgg 4313401 tttcgcattg aagctcgcga cggagaccgc gacatccgac tgccgttcca taagccggtg 4313461 gacgacatga ccgggctaag ccaggccatc cgggtgctca tgggttgccc gttccgcaac 4313521 gggctgcgcg cccgcaggta gcaggcacag ccgccgctcg gccgcgttgg ccggctgcat 4313581 ccaaaggttc agccacgtac gttgtctagg tccggggttg gcatccgaca acccgacgac 4313641 actgatatcg atcccgcgtg actcttatgt accgatccct ggccacggcc gggacaaaat 4313701 caacgccgcg ttcgcgctgg gcggggggcg gctgctgacc caaacggtcg agttggctac 4313761 tggcctgcac ctggatcact atgccgaggt cggattcagc gagttcgccg acctcgtcga 4313821 cgccttcgat ccgttggccg gcgtcgatct accggcaggc tgccaaacac ttgacggacg 4313881 tgcagcgctg ggctacgtcc ggactcgggc cacaccacgg gccgatctag agggctccga 4313941 cgtgccggtg ccagccgccg cgttcgaaac acagccctaa cgacacgctg ccgaatatga 4314001 cccgtgtcgg aaattagggc gacaagagta atgcggctca acatagcctt gctttactta 4314061 ggcaaacctg ccttcaacca ggaggttatt atcatcctgt ggtaactagg aaagcctttc 4314121 ctgagtaagt attgccttcg ttgcataccg ccctttacct gcgttaatct gcattttatg 4314181 acagaatacg aagggcctaa gacaaaattc cacgcgttaa tgcaggaaca gattcataac 4314241 gaattcacag cggcacaaca atatgtcgcg atcgcggttt atttcgacag cgaagacctg 4314301 ccgcagttgg cgaagcattt ttacagccaa gcggtcgagg aacgaaacca tgcaatgatg 4314361 ctcgtgcaac acctgctcga ccgcgacctt cgtgtcgaaa ttcccggcgt agacacggtg 4314421 cgaaaccagt tcgacagacc ccgcgaggca ctggcgctgg cgctcgatca ggaacgcaca 4314481 gtcaccgacc aggtcggtcg gctgacagcg gtggcccgcg acgagggcga tttcctcggc 4314541 gagcagttca tgcagtggtt cttgcaggaa cagatcgaag aggtggcctt gatggcaacc 4314601 ctggtgcggg ttgccgatcg ggccggggcc aacctgttcg agctagagaa cttcgtcgca 4314661 cgtgaagtgg atgtggcgcc ggccgcatca ggcgccccgc acgctgccgg gggccgcctc 4314721 tagatccctg gcggggatca gcgagtggtc ccgttcgccc gcccgtcttc cagccaggcc 4314781 ttggtgcggc cggggtggtg agtaccaatc caggccaccc cgacctcccg gcaaaagtcg 4314841 atgtcctcgt actcatcgac gttccagcag tacaccgccc ggccctgagc tgccgagcgg 4314901 tcaacgagtt gcggatattc ctttaacgca ggcagtgagg gtcccacggc ggttgccccg 4314961 accgccgtgg ccgcactgct ggtcaggtat cggggggtct tgccgagcaa caccgtcggc 4315021 agcagcggtg cagcccgccg gatccgccag accgcggcgg ccgaaaacga catcaccacc 4315081 gcacgggatc gatctgcgga ggcgggtgcg gcaataccga accggtgtag cagcgccagc 4315141 agcttgtttt ccaccagcga gccgtatcgg acgggatgct tggtctcgac gaagatcttc 4315201 accggccggt gccagtccaa aaccagcgaa acaagcgcgt ccagggtcag cagactggtg 4315261 tcgccgtgcg aaccgtcggg gcgccagctg tcgtgccacg cgccgtactc cagctcgcgt 4315321 agctgggcca gcgtcatcgt gctgaccaag ccggctcccg tcgaggttcg gtccaggcgg 4315381 cggtcatgca cacagaccag atgcccgtcc cgggtcaacc gcacatcaca ttccacgccg 4315441 tcggcgccct ctttgagcgc caggtcgtag gcggcaaggg tatgctccgg ccgagccgcc 4315501 gacgcaccac ggtgagcaac cacaaaggga tgtccggcga gcacctcgtc ggcccatgtc 4315561 atgtccacta tgctgccggt tcctgcccgt ccaactcaac cgcaacagaa gatgccggcg 4315621 cggaacgccc gtctgtgttc accaccaccc agcgatgcgc tgggcgttcg accggctttt 4315681 gctcgaaccc ctcgaagacc cgcgcagcag cggccaccgc ggccgccgca cacagatacg 4315741 ccagcaccat catggtggtg ttgttggcga tgccctgagc gtcggtgacc cagctggtgg 4315801 cgaacgcgaa catcgatatc gcgttgctga cgatccacac gatccaccac accacgatcg 4315861 gcctgcgcag ccgcgtgtag cggtcctcga ccagcgccaa ctcgatgacg tacagcggag 4315921 cccacagcag attgaccatc ggcaataggc agccggccca taactcacgg gcggaacgcc 4315981 gctccggcaa gccttgatgc ataaacgcgg cggcccgacg ggcgaccagc caccggacca 4316041 acaggacaat ggtagtgccg gccgccgcaa tcgccgccaa gctgaccaaa acccccagcc 4316101 agaccgaggc gctggccacc accgagttca acaatgtgtt tcggttgatg accagcaaca 4316161 cataccgcac cacaaacacc acgaccgcga tgctgaacac cagcaggctc accaacagcg 4316221 tggtgcgcac cgccgccggc gatggccctg ctttcgccga ggccggcacg ggagcctggt 4316281 cgacatggtc ggttagcccc caccgcggta tcccggcgta gcggggagta ggcccacgta 4316341 accgtgggcc gtgccgtggc ggcggtgccg ccccgggtcg caccgctatc caccgaaaac 4316401 ctgggggaag ccgcggcggt gtgcgccgcg tgtcggaggc cgtcggcacc tgcgggcgcg 4316461 ccggtgtacg ccagcgcgcc tcggccggca tatccgccaa cggcgccagc aacatccccc 4316521 gacagcgtgg acaccacacg cgttgccgct cacggacgtt ccagccagtt ccgcactggg 4316581 agcacacttg gatcaccaga ccagcctagt gacttctccg ccccgcaccg gtacggcatt 4316641 gtccgcgccg tcaacaggcg ttgaggcagg cttccgcgct ggattgggcg cgcccggtcg 4316701 cggcacgtcc agcacgacac agctacctac gactatccac agtttccaca gctttatcca 4316761 cagcggtaag aatccgacga atggcgttaa caccggctcc atccgtcagc caggcccaca 4316821 actgtggata acagcgcccg tcaatgcgtt ctcatcgaca gcctggcagg tacccgagcg 4316881 aaatggattg tcgcactaag catccacatc tgccccggct gcacctagca gcctgcccgc 4316941 ccgggcccgg cctgctcctg cgatcgtcaa accacacatt tcgcggcgct gccggcgcag 4317001 tatccggacg tcttgtggcg ctgcgaggta ccaatttttc cccaccattc accaggagtt 4317061 attatcgcgt gcacgacact tcgttgtgac ttacctcacc gtcgtgaggt gagcatgcag 4317121 gtgaaaggcg actgatggcc acacactcgt acggcccagg gtctacaacg ccgccgaact 4317181 ctggctcgcc ggagtggaac cacgcctacg gcattgccgc tttgcgggcc gccctgatcg 4317241 ctctggcgtt actggcgatt ctggccgtca tcgctttggt ttgagtcccc ggccactcgg 4317301 gtggcaccga gtcggtccgg acgccctggt cagaaccggt tctcggattt gggtaacccc 4317361 ccttgtgtca ctgccgtttc ggtggtcaca gcacggcaat tgttgtgggt ggcctttcat 4317421 agaactgcga catggattac cgcggtcgtg aggaaatcgt cgaggctggt tgcaccccca 4317481 cggagccagc cagaaattct gtagatcaga gttggcttga ttatgaatca tgctctagca 4317541 cagggcaact cgtgagtgtg ttgaacacta ccgtcctgtt ctgcgttccg gcactcgaat 4317601 aacctcccgt cccactcgaa atattgcgca gcctaagata aatcagcttc atagccgaat 4317661 ccttgcctgg caaaaggacc gcggttattg attaacttgc gcagctcgat cggatagtcc 4317721 aggaatggca cgaattccgg ctacgcatgc gcccagacaa aattccttga gcgcgagctc 4317781 ggcggcctcc acggtgacgg cgccatgaat cccggcatcg acgagacgcc ccgggctgtc 4317841 ttggatcggc cggcccgagg cgtctttgcg cccgtcaagg tccaccctga tagccaaatg 4317901 cgccagctgg cggcaaccac cccgttgtct tcgatccgca gccgtaaacc gtcgttcgtc 4317961 ggcgcccgtc gcccaacgtg aactgagggc ggagaatcgg ccggaatctc gccctcagtt 4318021 cacgctcggc gccgtttggc ctcacccagt caatgtgatc tgtgcgggcg ggcgttggcg 4318081 cgtagcgaac cccagtggcg ccggcccgcc aagcacgccc cggcgcggcc agctcatcag 4318141 cggctacgca agcgcaacgg cgcccgcgat gggctgtgga agaacccgga ggatctcacc 4318201 gaacaccaga atgccaagct gtcgcgctca tctactcaaa gaaggcctac ggcacctgtt 4318261 ttcggtcaaa ggcgaagaga gtaagcaggc actggaccgg ttgatcttct aggcgcggcc 4318321 ccgagtgagc atactttggt ggcttgtatc tcttgtagtg ccgctttgac ggggtggtgg 4318381 tcaggtacgg tggcctcggg agaggctgga gggctcgacg ttttcggctg agtgtctggg 4318441 cccgtgaaag agatcgtctg ctccagcttt gtctcctgaa ctgacccggt ttagggaatt 4318501 ggtggccagg ttgcggaagt gcgcagcatc gacgtgtacc tgggtgaggc atcgaatcat 4318561 cgacaagcac cggagccgcg cgtgaactcc cgccgcgttg tggtcgggga tgatgtggga 4318621 gaccggccgg cagtgctgtg tacgaaggtt ctcccaccgc aacgagttca cgcacgacgg 4318681 tcggctgggt gggccctgga atacgtgaac tcttcatcaa cacaacatga ttgacgatga 4318741 aggggagaac ctccatgcac aacaacgcta acccgtgact gccgagaatc caggacggag 4318801 caggcggacg ctggtcggaa tcgacgcggc gatcacggcc tgtcaccaca tcgcgatccg 4318861 cgatgatgtc ggtgcgaggt cgattcgatt cagtgtcgaa cccacgctgg ccggactgcg 4318921 caccctcacc gacaagctca gcggttacga cgatatcgac gccaccgtgg aaccgacctc 4318981 gatgacgtgg ctgccgctca cgatcgctgt cgagaatgcc ggtgacacca tgcacatggc 4319041 cggcgcgcgg cattgcgccc ggctgcgggg tgcgatcgtg ggcaagagca agtccgacgt 4319101 catcgacgcc gaggttctca cccgcgccag cgaggtgttc gacctgacgc cgctgacact 4319161 gccgacgccc gcgcagttgg cgttacgtcg atcggtgatc cgacgtgccg gcgcagtgat 4319221 tgacgcgaac cggtcctggc gtcggttgat gtcgttggcg cggtaggcgt tccccgatgt 4319281 gtggaccgcg ttcgccgggt cgttaccgac cgcgacagcg gtgctggggc gttggcccga 4319341 catccgcttg ctggccggcg caccgacccg caactggcgg cgttctacca ccggctgatg 4319401 accacccaga ggcattgcca cacccaggcc accatcgccg tagcccgcaa gctggccgaa 4319461 cgcacccggg tgacgatcac caccggccgc ccctaccagc tgcgcgacac caacggcgac 4319521 cctgtcaccg cccgcggcgc gaaagaactg atcgacgccc actaccacgt cgacaccagg 4319581 acccacccac acaaccgcgc ccacactgac accatgcaga actcgaaacc ggcacgctga 4319641 acaccactgt cggcagggga tccggttgca cacgcaacgg tcacttgagg cgatcgtctc 4319701 cattcctggc tccttgccgc ccattgttgt cggcgagcaa ggagtcacag tggagtcccc 4319761 gcagcgtagc gaggaaaacc gaccttgacg cccgacgagc ggcaacgaga accggcaacg 4319821 aggaatggtc ttcgacaagc ccaccgtgag ttgtctatcg gtttctcatt ttcagcgtct 4319881 tttcagagtc gcgcaacaca atccgatgcc cgtcgagatc cgtcgcgact acacacacac 4319941 ccagcatctc gaccatcgcg actccggccg acgacggcta acgagcagct tcgccccacc 4320001 cgcccccgca gcaacaacac aacggcacgg cagcagctga tcactgccca aaacacgcac 4320061 ccacatcaga tgcagaaccc cttgacaacc aatagggaat ctcttcacga atgagggggc 4320121 agttggggtt tgaatccgcc ggtttccagt aggtatctgt cggcttagtt ggtgagattg 4320181 cgaaagccga gggtcgatcc ccggaggtgc tcgacgcggc cgctgatcgc ttcggtcggc 4320241 gggttgaccg tggtcactgt tttgggcgtc gatccactgc gggaattccc actaccacgt 4320301 ccggccggat caccggcgac tcgcggtgca cggcccgctc cagcacctcc ttggtcaatt 4320361 cgttagccgt ccccgccaac tgcccagccg tcgacttctt cttgcccacc caccccatag 4320421 accttcgcca cacagcgcct tccgtccacc caacagcggt ccgatgacgg acccccgacg 4320481 gggacttcag cgaccaggaa cgcgcccata gacgtggtat cagcctgggg gcgtcctggt 4320541 agcctatgcc gtccgccctg gggcatcgac cccaaggtcg ttgttgcgac gcgagcggtc 4320601 atggagcagg gttgacttgt caagctagag ccagcccatc gcgtgggagg cacccgcgcg 4320661 aaaagaaaca tcggacgatc atttcatcga aggaaggaat gccgtggccg aatacacctt 4320721 gccagacctg gactgggact acggagcact ggaaccgcac atctcgggtc agatcaacga 4320781 gcttcaccac agcaagcacc acgccaccta cgtaaagggc gccaatgacg ccgtcgccaa 4320841 actcgaagag gcgcgcgcca aggaagatca ctcagcgatc ttgctgaacg aaaagaatct 4320901 agctttcaac ctcgccggcc acgtcaatca caccatctgg tggaagaacc tgtcgcctaa 4320961 cggtggtgac aagcccaccg gcgaactcgc cgcagccatc gccgacgcgt tcggttcgtt 4321021 cgacaagttc cgtgcgcagt tccacgcggc cgctaccacc gtgcaggggt cgggctgggc 4321081 ggcactgggc tgggacacac tcggcaacaa gctgctgata ttccaggttt acgaccacca 4321141 gacgaacttc ccgctaggca ttgttccgct gctgctgctc gacatgtggg aacacgcctt 4321201 ctacctgcag tacaagaacg tcaaagtcga ctttgccaag gcgttttgga acgtcgtgaa 4321261 ctgggccgat gtgcagtcac ggtatgcggc cgcgacctcg cagaccaagg ggttgatatt 4321321 cggctgaccc cgctgccgca agcgtcgggc tcagtattcc ggagtcgcgc atcaccatcg 4321381 cccttatcct ggccttatat tgcagctttg tgaacacggc cgcggtggcc gtgtcgagtt 4321441 gcagggcgcg taaaccacgc gcatgcttgg ttactcgagc taccatttat ttcgagctac 4321501 cagcgtggtt aggacggagg cgtcgcggag gggcgagatg ggtaccgggt caggtgggcc 4321561 tattggggtt tctcccttcc attcgcgtgg tgccctgaaa gggttcgtga tctctggacg 4321621 ttggcctgat tcgaccaaag agtgggccca gctgctgatg gtcgcagttc gggtcgcgtc 4321681 gttgcccggc ttgctctcca ccacaacggt gtttggtgcc cgcgaagagt tgcccgacga 4321741 acccgagccg gggaccgtcg gtctggtgct ggccgagggc accgtcttcg gtgaatcagc 4321801 aattcagcca ggatatttcg ctgatcatca accccctgca ttgctgatgc tgcatccacc 4321861 ctcggagacc acgccgtcgc tgccggaatg caccggggcg gcgtcagggt gcgtgctgct 4321921 gccgggatta ccgtatctgg gattggaaca tcgtgcggct tgggtggagg ctgaagccga 4321981 cggcaccatc acatctatgg tgagccgggt gggcgtcgac ccgataagcc atcccgacac 4322041 cgcaattctg gcaatgctgc ttgcagcata aggaaattcg aaggagtctg ttcgggcggc 4322101 gaatcgccaa atacgggtgg ccgaacttgt ccgacatcct ggtgcacacc aaatatgacc 4322161 gctagcctgg ggacgttagc gaaggggagt agtcccgaat cgtcgagtcg acatactggc 4322221 gaaaagcccg gctggcgaac cgtttgatac caacggtggg cgagaccttc gaccgatgtt 4322281 cgatgaccga ctggtcgtcg acaacgcgtc gaaaggtcgc ctgccatgct cgccgccaca 4322341 ctgctaagtc tgggagccgt tttccttgct gagctcggcg acagatccca gctcatcacg 4322401 atgacctaca cacttcgcta ccgctggtgg gtggtgctga ccggggtggc gatcgcagcg 4322461 ttcacggtgc acggggtagc ggtggcgatc ggccactttt tgggctcgac cgtgccggcc 4322521 cggccggccg cctgcgtatc ggcgatcgca ttcctgatct ttgccgtgtg ggtctggcgg 4322581 gaggacacgg ccagcgacag cgaaacctcg ccaaccgctg ccgaaccccg actcgcgctg 4322641 ttcaccgtgg tctcgtcgtt cgcactggct gagctgggtg acaagacaac gttggcgacg 4322701 gtgaccttgg ccagcgatca ccactgggcc ggcgtatgga tcggcaccac cctgggcatg 4322761 atcctggccg acggcctggc gatcggcgca gggctgctgc tgcaccggcg ccttccggag 4322821 cggttgctgc aggtcctgac tggcctgctg ttcctgctgt tcggactgtg gttgctgttc 4322881 gacgacgcgt tgggcttcag atcggttgcc atcgccgtga cagcggcggt ggtgctggcc 4322941 gcggcaacta cggcggtatc ggtgcgggtg gcgcaaactc gtcggcggcg gccaaccgct 4323001 gctgcgacac cagaagatga ctcgacacgc cccgagcggt cgtcggtcgc gccgggccat 4323061 cccgggagca tcttgctacc gcttccggaa gtgtctttgc gggggcgccg accgccctca 4323121 gggtcgcctg acgagcgctg tgcggaccca ggcagcaaag gaggctctcg gcgaatctcc 4323181 gttggctgct ggttgcccgg agtcggccgc atccgcccga cacggtcatc ctgatctgct 4323241 cgccgaacac gtgggcgacg gaccaacgcg cgtgttttca tcggatattc tgcggataac 4323301 ctgtgaaatc cgttcgtcgt gtggacacat caccgaatcg gttggaccct catcgggggg 4323361 gtcttcgttg acccctcaca acgtcagcac ccaatccgct caggtttgca cttggttgtg 4323421 gacacaactg tcgctaccat gatcagcaaa tacatacaga taaccgtttg ctcttggagc 4323481 ccggtggagg tcacatcgat gagcacgacg ttcgctgccc gcctgaaccg cctgttcgac 4323541 acggtttatc cgcccggacg cgggccacat acctccgcgg aggtgatcgc ggcgctcaag 4323601 gcagagggca tcacgatgtc ggctccctac ctatcacagc tacgctcagg aaaccgtacg 4323661 aacccatcgg gggcgaccat ggccgccctg gccaacttct tccgcatcaa ggcggcctac 4323721 ttcaccgacg acgagtacta cgaaaagctc gacaaggaat tgcagtggct gtgcacgatg 4323781 cgcgacgacg gcgtgcgccg gatcgcgcag cgggcccacg ggttgccctc cgcggcgcag 4323841 cagaaggtgt tggaccggat cgacgagctg cggcgtgccg aagggatcga cgcttagtcc 4323901 ctgataccga ccgcccgctc cacccgacct ggcgggttgg ggttggtctg ccccgattag 4323961 ggttgcccca gcgatcaccg cgatagtcca cgagataccg ggaggcggcc gggaatgggc 4324021 ctgttcggca agcgaaagag ccgcgcgacc cgtcgcgcgg aagcccgcgc gatcaaagcc 4324081 cgcgccaagc tcgaggccaa gctgtcggcc aagaacgagg cgcgccgcat caaggccgcc 4324141 cagcgcgcgg aatcaaaggc gctcaaggcg cagctgaagg cccggcggga cagcgaccgg 4324201 gcggcgctca aggtcgccga agccgagctc aaggtagcac gcgaaggcaa gttgctgtca 4324261 ccgacgcgga ttcgccggtt gctgacggtt tctcggctcc tggccccgat actgacgccg 4324321 gtgatatacc gggccgcgat ggctgcccgc gggttgatcg accagcggcg cgccgatcag 4324381 ctcggggtcc cgctggcaca gatcggccgg ttctccggtc atggcgcccg gttgtcggcg 4324441 cgggttgggg gagccgagcg atcgttgcgg atggtgcagg aaaagaagcc gaaggacgta 4324501 gaaaccaaac agttcgtgtc ggcggtgacc aatcggctca ccgatctgtc ggcggccgtc 4324561 gcggccgcgg agcacatgcc cgcaaagcgg cgccggacgg cccactcggc gatctcgtcg 4324621 cagctggatg gcatcgaggc ggacctgatg gcccggctcg ggttgaccta accggcggcc 4324681 cgatgaccgc aattggcatg tcacatccgc ctcgcgtgca tcggcgggtc ggcgggcagc 4324741 gcactgcact gaccgcgggc atcggcctct tgctggccgc cttggtgctg accaccatcg 4324801 cgaacccacc tgcggcgttt gcgcacaccg cgcagctgtc caccgctacg cccgcacccg 4324861 cagtcgccgc caccgacgcg aacgacgtcc cgacgtggcc attcgtcgta gggaccgtgg 4324921 cggcggttgc cgtggctgca ttgtgggccg ttcggcgcgg gcgctaacca atcaaccccg 4324981 gtagcccgga aggtgcggca ccgtgtcctg gcatgatggg accgagcgtt tgcgatctag 4325041 tgagcgacga caatgctgca aaggagcggc cacatgccag acccgcagga tcgacccgac 4325101 agcgagccga gcgacgcatc gacgccgcca gctaagaagc tgccggccaa gaaggccgcc 4325161 aagaaagcac cagcaagaaa gacgccggcg aagaaggcac ccgccaaaaa aacacccgcc 4325221 aagggtgcta agtccgcgcc accaaagcct gccgaggcgc ccgtcagttt gcagcagcgg 4325281 atcgaaacca acggccagct tgcagctgct gctaaggatg cagcggcaca agcaaagtcg 4325341 acagtggaag gcgccaacga cgccctggcg cgcaacgcat cagtgccggc gccgagtcac 4325401 tcgcccgtgc cgctgatcgt tgccgtcacg cttagcctgc tggcgctgct gctgatccgg 4325461 caactgcgcc gccgctgaac gcgctggcac catagtggcc atctcatttc gcccaaccgc 4325521 tgacctcgtc gacgacatcg ggcccgacgt gcgcagctgt gacctacagt tccgccaatt 4325581 cggcggccga tcgcagttcg ccggaccgat cagcaccgtg cggtgttttc aggacaatgc 4325641 gttgctgaag tcggtgctct cgcagccaag tgcgggcggt gtgctggtca tcgacggcgc 4325701 cgggtccctg cacaccgcgt tggtcggtga tgtcatcgcc gagttggccc gctctaccgg 4325761 ctggaccggg ttgatcgtcc acggcgcggt gcgagatgcc gccgcgctgc gcggcatcga 4325821 catcggcatc aaagcgctgg gcaccaatcc ccgcaagagc accaagaccg gtgccggaga 4325881 acgcgacgtt gaaatcacgc tgggcggggt gacattcgtt ccgggcgata tcgcctacag 4325941 cgacgacgac ggcatcatcg tcgtctgact atggcctaaa ccggcgctaa accgtcgcta 4326001 aagctaaacc cccaccgggg caggcctttt ggcgaaccgc agaccctcgt cgtcgatctt 4326061 gccgcgccgg atgagccgga tgtcacgtag gtagttctga ttcaggcgcc acggtgtacg 4326121 cgaaccctgc ttgggcagct cgtccagcga gcgcagcacg taacctgggg tgaactccat 4326181 gaagggccgc tcttcgacat ctgagcccgg tcgctcgacg accacggtgt caaaaccgtt 4326241 gtcgtccatg taattcaaca agcgacagac aaactccgac accaggtcgg ccttcagcgt 4326301 ccaggaggca ttggtgtagc caaccgtgta ggccatgttg gggatgccgg aaagcatcat 4326361 gcccttgtag gccatcgtcg tggtgatgtc cacttgttgt ccgtcgatag tcgccgtcgc 4326421 cccaccaaaa agctgcaggt tcaaccccgt tgcggtaatg atgatgtcag ccggcagttc 4326481 gcgacctgag ttcagccgga ttccggtcgc ggtgaaccgt tcaatggtgt cggtcaccac 4326541 ctcgaccttc ccgtgacgaa tggcccggaa caggtcgccg ttgggcacca agcacaatcg 4326601 ctggtcccag gggttgtagt gcgggccgaa gtgctttcgc acgtcgtacc cctcgggtag 4326661 ctggcgctgg atcaggctca ggaacatctt ccgcatgcgc cgtggccact tctggcaggc 4326721 gctgtacacg gccgcctggc gcagcacgtt cttccaccgt accgcggtgt aggccatggt 4326781 ctccggcagc cagcggttga gcttctcggc gatgccgtcc cggtctggct gcgacacgat 4326841 gtaggtgggt gagcgctgca gcatcgtgac gtgcttggcg cccgagtccg ccagcgccgg 4326901 cacgagcgtg accgccgttg cgccactgcc gatcacgacg atgttcttag cgtcgtagtc 4326961 gaggtcctcg ggccagtgct gcggatggat gatcggcccg acgaaatcct ccgagccggc 4327021 gaatctcggc gagtagccct cgtcgtagtt gtagtagccg ctgcacagaa agaggaattc 4327081 gcaggtgagg gcgctgagcg tgccgtggct ttggatgtga acggtccagc ggttttccgc 4327141 ggtcgaccaa tcggcactga tcaccttgtg gtggaaccgg atatgcctgt cgattccata 4327201 catggccgcg gtgctcttga cgtactcgag gatgggcttg ccgtcggcga tcgcctgccg 4327261 tccggtccag ggacggaatc ggaaacctag cgtgtacatg tcggagtcgg agcgaattcc 4327321 gggataacgg aacaaatccc aggtgccgcc catggattcc cgcttttcca ggatggcgta 4327381 gctcttggtc gggcaacggt cctgcaggtg ccaggccgcg ctgacaccgg agattccagc 4327441 gcccacgatg acaacgtcga ggtgctcggt catggatcca cgctatcaac gtaatgtcga 4327501 ggccgtcaac gagatgtcga cactatcgac acgtagtaag ctgccagggt gaccacctcc 4327561 gcggccagtc aggcttcgct gcctaggggc cggcgcaccg cgcggccgtc cggcgacgat 4327621 cgtgaactgg cgatcctcgc caccgccgag aaccttctcg aggaccgtcc gctggccgat 4327681 atctcggtcg acgatctggc caagggcgcc ggtatctcga ggccgacgtt ctacttctat 4327741 ttcccatcca aggaagcggt gctgctgacc ctgctggacc gggtggtcaa tcaagccgac 4327801 atggccctac agacccttgc cgagaatccc gccgacaccg accgcgagaa catgtggcgc 4327861 accgggatca acgtgttctt cgagacattc gggtcgcaca aggcggtaac ccgagccggt 4327921 caggccgcca gggcaaccag tgtcgaagtc gccgaactgt ggtcgacgtt tatgcagaag 4327981 tggatcgcct acacggccgc cgtgatcgac gccgaacgcg accgaggcgc ggcgccgcgc 4328041 accctgccgg cccatgaact ggccacagcg ctcaacctga tgaacgagcg gacgctgttc 4328101 gcgtcattcg ccggcgaaca gccctcggtg ccggaagccc gcgtgctgga tacgctggtg 4328161 cacatctggg tgaccagcat ttacggcgag aaccgctaag ccgcactcgg tcgggggtgc 4328221 tcggtcgatg ctcagtgcca aagcggcatg cagatctcac ggaggtccgg tggacgatct 4328281 ggcagccgaa gtggcgcctt gggtaggcaa tggcgtgcgg tcatatagga gcgggtgcat 4328341 tcgcatgtcg gacacgtggc gttgccgcct ggtaccgcgg tgttcgtggc cgacagcggg 4328401 ctaatgcgac ccggtccacg ccaggagcgt gtcggccggc caggtgttga cgatccggtc 4328461 ggcgggcacc tccgcgtcca aggcgcgctg ggcgccgtag ccgaggaagt ccagctggcc 4328521 gggtgcgtgc gcgtcggtgt cgatgctgaa cacgcagccg atgtcgcgcg ctaggtgcaa 4328581 caggcgcgtc ggtgggtctc ggcgttccgg acgggagttg atctccacgg cggtgccgtg 4328641 ctcacggcag gcggtgaaca ccgcctctgc atcgaacttc gattctggcc ggatgccacg 4328701 attgccggcg atcagccggc cggtgcagtg gcccagcacg tcggtgtgac cgttggccac 4328761 ggcgcgcacc atccgtcgcg tcatcgctgc cgaatccatc gacagcttgg agtgcacgct 4328821 ggccaccacg atgtcgaggc ggtccagcat ctcgggttcc tggtccaagc tcccgtcttc 4328881 gaggatgtcg acctcgatcc cggtcaggat gcgcagcggc gcgaacttct cgcgcagctc 4328941 gtcgatcacg tccagctgct tgcgcaaccg gtccggagac aggccgttgg cgatcgtcaa 4329001 ccgcggtgag tgatcggtca atgcgcagta ctggtgacct agcgccgccg cggtggccat 4329061 catctcctcg atcggcgcgg acccgtccga ccagttcgaa tgcagatgca gatccccgcg 4329121 caatgcggca cggatcgccc ctccaccgag atcctcagcg tcagcgcgta attcagccag 4329181 caggtccggc tcgcggccag accaggcctg ggcgatgact ttcgcggttt tgggaccgat 4329241 acccgccagc gactgccagc tgttggcctg gccgtgccgc tgccgcgccg cgtcgtcaag 4329301 gccctcgata atgtcggcgg cattgcgata ggccatcacc cgcctcgggt cgtggcggtt 4329361 ccggtccttg taataggcga tctgccgcag cgctgttacc gggtccatta tcgggctcac 4329421 accagttgcc cgaagacgac cccggtgaca accaccgcga agccggccat ttcgccgagg 4329481 atgagcaacg ccattaacac ccccgcaccc tttgcgggac gctcgaattg gttcgcggtg 4329541 gcacggcgcg cgccatgggt gacataactc gccaacagga tgggtttcgt atcaaatccg 4329601 agggcacagt tcatcgcttc actgagttta gttgggacct aggcccagat gccgtcgcgg 4329661 cctggggcgc cattgcccta gataacaatc tgataaagcg gagcaaacaa gctgtggtgc 4329721 acactcgggc acgtatcagg ttggctacac agcgaagcgc aacagctctt cagtggttat 4329781 cgggcgctcg ttcttggcgg ggaactcgtg gcttttgacc gggtggcgaa accatgacca 4329841 ggcgattcgc cccatccgtg accggggtac tgggttggta cgcacagcga cactcctgcg 4329901 atcggacaac tcgactggca cctcacatta aacctctatg tgacgaagcc cacatcgact 4329961 cattagacac ctcggagctg gcaaacagtg aacggcgcgc cgagcaatta tcaaatgttt 4330021 ctgatgtgac tctagtgatt attgaagcgg tgcagcggtc ggcttaacag gcgccggcag 4330081 ggcactggaa cccatcaagt accggtctac ggccgcggca gcggcccggc cctcggcaat 4330141 cgcccagacg atcaatgact ggccccggcc catgtcaccg gctacgaaca caccaggaac 4330201 cgaggtgtcg aagtcgtcgc cacgggccac gttcccacgc tcggtgaact tcactccgag 4330261 gtcggtcaac aggcccgccc gttccgggcc gacgaaaccc atcgccagca acaccaggtc 4330321 ggcttcgagc tcgaagtcgg agccctcaac cttgacgaac ttgccatcca gcatggtcac 4330381 ttcgtgtgcc cgcagcgcgc tcacgcgccc gtccgtgccg acgaacgcct cggtgttgac 4330441 cgagaacacc cgctcgccac cctcctcatg cgcggccgat acccgataca tcagcgggta 4330501 agtcggccat ggggtggatt cggcgcgggc gtccggtgga cgcggcatga tctcgaactg 4330561 gtgcacggcg atcgcgccct ggcggtgcac ggtacccagg cagtccgccc cggtgtcgcc 4330621 gccaccgatg atgacgacct tcttgccctt tgcggtgatc ggcggctgcc cgtcctcatc 4330681 gaggacgtca tctccttctt gcacccggtt ggcccacggc agaaactcca tcgcctgatg 4330741 gacgccctcc agctcgcggc cgggaatcgg cagctcgcgc caagcggttg cgccaccggc 4330801 caatacgacc gcatcgaaat cagcgcgcag cttttcggcg ctaatgtcga ccccgacgtt 4330861 gacgcccggc cggaattcgg ttccttcgga gcgcatttgg tccaaacgcc gatcaagatg 4330921 ccgcttttcc atcttgaatt ccgggatgcc gtaacgcagc agcccgccga tgcggtcttc 4330981 gcgctcgaaa acggtgacgg tgtgacccgc ccgggtgagt tgctgggcgg cggccaaacc 4331041 cgccggcccc gaacccacca cagcaaccgt ttgcccggtc agcttccgcg gcggacgtgg 4331101 ttgcacccat ccttcgtcga aggccttgtc gatgatctcc agctcgatct gcttgatcgt 4331161 caccggatcc tggttgatgc ccagcacaca cgccggctcg cacggagccg ggcacaaccg 4331221 gccggtgaag tcggggaagt tgttggtggc gtgcagccgt tcgattgcgt cgcgccagcg 4331281 gccccggcgg accagatcgt tccattccgg gatcaagtta cccagcggac atccgttgtg 4331341 acagaacgga atgccgcaat ccatgcagcg ggtcgcctgt tggcgcaggc tctcgttgtc 4331401 gaattcctcg tagacttccc gccagtctcg cagccgcagc gggaccggcc gtcgcttcgg 4331461 caatttccgg tgggtgtatt tgaggaagcc gcccggatca gccatgcgca gccgccatga 4331521 tcgccttgtc gacatcaacg ccgtcacgtt cagccagggc gatcgcctgc aggacccgtt 4331581 tgtagtcacg cggcatcacc ttgacgaagt ggcgctgctg tcccgaccag tcggacagaa 4331641 tccgctggcc gacagcggaa tcggtagcgt cgacgtgcac ttgtatggtg ccgtgcagcc 4331701 agtccgcgtc atcctcgtcg agggtctcga gttcgaccat ctccgagttg aggttggccg 4331761 gcagttcacc gtcgggatcg taaacatagg ccacaccgcc ggacataccc gccgcaaagt 4331821 tacggccggt gcggcccaga atgacaaccc tgccgccggt catgtactcg cagccgtgat 4331881 cgccgacacc ctctaccacg gcgtgggccc cggaattgcg caccgcgaac cgttcgccta 4331941 ccacaccgcg caggtaaacc tcgccactgg ttgcgccgaa cagaatcaca ttgcccccga 4332001 tgatgttgtc ctcggcgaca taatcctgcg gcgcgtcatc cgacggccgc accacaatcc 4332061 ggccaccgga tagccctttg ccgacgtagt cattggcgtc gccatacacc cgcaaggtaa 4332121 ttcccttggg cacgaaggct ccgaagctgt ttcccgcgga tccgtcgaac gtgatatcga 4332181 tggttccgtc cggcaagcct tggccgccat aggccttcgt cagctcgtgg ccgagcatgg 4332241 tgcccaccgt gcggttgaca ttgcctatgg tggtggagaa gcggaccggc ttgccggaat 4332301 ccagtgcttc cctgctcatc acgatcagct gctgatcgag cgccttgtct agaccgtgat 4332361 cctggcgcga actgcagtac agatcctgat tcatgaaggc cgactccggc tcgtggagca 4332421 ccggcgccag atccagctta tgcgccttcc agtgcgcgcg tgccagcgtg gtgtccagcg 4332481 cacctgcctg tccaaccgcc tcgttcacag tgcggaagcc caactgcgcc aaatattccc 4332541 ggacttcctc ggcgatgaac atgaagaagt tctccacgaa ctcgggcttc ccggtgaacc 4332601 gctcccggag caacggattc tgggtggcca caccaaccgg gcacgtgtcc aggtggcaca 4332661 cccgcatcat gatgcagccg gccactacca acggcgcggt cgcgaatccg aactcttctg 4332721 ccccgagcag cgtagcgatc atcacatcgc gacccgtctt gagctgaccg tccacctgga 4332781 ccacaattcg atcacgtaac ccgttgagca gcaacgtctg ctgtgtctca gccagaccca 4332841 actcccaggg tgctccggcg tgcttcatcg atgtcagcgg ggtcgcgccg gtgccaccat 4332901 cgtgccctga gatcaagacc acgtcggcgt gggctttgga aacgccagcc gcaaccgtcc 4332961 ctaccccgtt ttcggagacc agcttgacgt gtacccgcgc ggatggattg gcgttcttta 4333021 ggtcgtggat cagctgcgcc agatcctcaa tggagtagat gtcgtggtgg ggcggcggtg 4333081 agatcagacc gacaccgggc gtggagtgcc ggacctcggc cacccaaggg tacaccttgt 4333141 gccccggaag ctgacctccc tcaccaggtt tcgcgccctg cgccatcttg atctggaggt 4333201 cggtgcagtt ggtcaggtaa tgcgaggtga cgccaaaccg ggcggaggct acctgcttaa 4333261 tggcgcttcg gcgccaatcc ccgttggggt cgcggtcaaa tcgcttgacg tcctcgccgc 4333321 cttcaccaca gtttgaccgg gcaccaagcc ggttcattgc gatggccagc gtctcgtgcg 4333381 cttcagcgga aatcgagccg tagctcatcg cccccgttga gaagcgcttg acgatttcgc 4333441 tggccggctc gacctcgtcc agcgggactg gaggacgaac cccggtacgg aacttgagca 4333501 gaccacgcag cgatgccatc cgctcgctct ggtcgtcgac cagacgggtg tactccttga 4333561 agatcttgta ctggccggtt cgcgtggagt gctgcagctt gaacacagtc tccgggttga 4333621 acaggtggta ctcgccctcg cggcgccact ggtattcccc acccacctcg agttcgcggt 4333681 gagcgcgttc gtccggccgg tccagatagg ccagccggtg ccgggctgcg acatcggccg 4333741 cgatgtcatc cagggtgatc ccgccggtgg ggcaggtaag cccggtgaag tattcgtcga 4333801 gcacttgctc ggagatgccg acagcctgga acagttgcgc accggtgtag gaggccagcg 4333861 tcgagatgcc catcttcgac atcactttca gcacaccctt acctgcggct ttgatgtagt 4333921 tgttcagcgc cgccgtacgg tcgatgccct cgataacacc gcggtcgagc atgtcctcga 4333981 tcgactcgaa caccaggtag gggttgatcg cggccgcgcc gaatccgacc agcgcggcca 4334041 tgtggtgcac ctcgcgggca tcaccggact cgaccaccag acccacttgg gtgcgggtcc 4334101 gttcccgaac caggtggtgg tgcactcccg caacggcgag cagcgacggt atcggagcca 4334161 tttcctcgtc ggactcgcgg tcggacaaga tgatgatccg agcgccgtcg gcgattgccg 4334221 ccgccgccgc gccacgtacc tcttccagcg cggcagccag cccagcacct ccctcggaga 4334281 cccggtacag acagcgaatc accttggacc gcaatccgtg tgggcgccca ttgaccttgt 4334341 cgttgggatc gaggctgacc agcttggcga gctcgtggtt acgcagaatc ggctggggca 4334401 gcacgatctg gtggcaggag ttctggtccg ggttgagcaa gtcacgttcg ccgccggtgg 4334461 tgccctgcag gctggtcacc acctcctcgc ggatggcgtc caacggcggg ttggtcacct 4334521 gggcgaacag ctgatggaag tagtcgtaga gcatgcgcgg acgctgcgac aacaccgcaa 4334581 ctggagtgtc ggtgcccatc gacccgattg gctcggcacc gagccgagcc atcggcgcta 4334641 ccagcaggtt gagctcctcg taggtatagc cgaatgccaa ctgccgcatg acgattcgat 4334701 ggtggggcat ccgcacgtct ttgccctccg gcaattcgtc gagcggaact agtccgttgt 4334761 caagccactc ctgatacgga tgctcggccg ccaggtcggc cttgatctcc tcatcggaga 4334821 cgatgcggcc ctgcgcggtg tccaccaaga acatccggcc cggctgcagc cgcatccggc 4334881 gcaccaccgt cgacggatgc aggtccaaca caccggcctc ggaagccatc accaccaaac 4334941 cgtcgtcggt gacccagatt cgcgacgggc gtaggccatt gcggtccagc acggcgccca 4335001 cgacggtgcc gtcggtgaac gtcatcgacg ccgggccgtc ccacggctcc atcaacgagg 4335061 cgtgatactg gtaaaacgcc cgccgcgcgg ggtccatcga ctcgtggcgc tcccaggcct 4335121 cagggatcat catcagcacc gcgtgggcca ggctgcgtcc gcccaggtgc agcagttcga 4335181 gcacctcgtc gaagcgcgcg gtgtccgagg cacccggggt acagatcggg aacagctttt 4335241 cgacatcggc cgccgaccca aagatgtcgg tcttgatcag cgcctcgcgg gcccgcatcc 4335301 agttctcgtt accggtgacg gtgttgatct ccccgttgtg cgcgatccgc cggaatggat 4335361 gcgccagcgg ccaggacggg aaagtgttcg tggagaaccg cgagtgcacg atgcctagcg 4335421 cgctggtcag tcgctcgtcc tgcaaatcga ggtagaaggc cttgagctgc ggggtggtca 4335481 gcatgccctt gtagacgagc gtctggccgg acaggctcgg gaagtacacg gtttcccggc 4335541 ccggcccgtc ttgacccgga cccttggtgc cgagttcatg ctcggcccgc ttgcggacca 4335601 catagcagcg ccgctccaac gccatgccgg acgcgccagc caagaacacc tgccggaagg 4335661 tgggcatggc atcacgggac agcgcgccca gcgatgagtc gtcggtgggg acgctgcgcc 4335721 aacccaggac ttgcagcccc tcggcctcgg cgattttctg tacggcggcg caggccgcgg 4335781 cggcgtcttt agatgactgc ggcaagaacg cgatacccgt ggcatagctg cctggggcag 4335841 gcaactcgaa atccacggct tcgcgaagga attcgtccgg aacctgaatc aggatgcccg 4335901 cgccgtcacc gctgcggggt tcggcgcctt gcgcgccccg atgctcgagg ttgagcaggg 4335961 cggtgatcgc cttgtccacg atgtcgcggc tacgacggcc gtgcatgtcc acaaccatgg 4336021 caaccccgca cgaatcgtgt tcgaacgcgg ggttatacaa cccgacgcgc ttaggcgtca 4336081 tacccaccta acccttcagc agactttctg cgcggccgcc tttgcggatt cgacggggcc 4336141 gcacccggag gtagcgggca agaccccttc ggtcttgtcg ataggctgtc cgtcaagcgg 4336201 gcgtgatccg gtcggggctt cgtccgtgca gcagtgaacg cttggccctg gaatcggact 4336261 cgacaagtcg taaaacgata tgacaaaacc cgcttgacat gccaactttc ccaatactaa 4336321 ctcgtcagcc ggcggcaccg tagctgccgc gtggccagca accgaccgta tcgtcacatg 4336381 catttttcct cgtccaaatc cggctgcgct agctgcgtgg cggtctgatc gccagccaca 4336441 ggaaatgctt agatacgttt gctgtgaaat ccggagcacc gctgtttcgc cacttgcgcc 4336501 ggtgggaaca accgccggaa cggcgggtat ctgtgttgtt gcatggcgat gccgccgcga 4336561 cgactaccca gcgcaacccc ccagagtttg cgcgatccta aaaggggtct aaaaagggcg 4336621 tctagacagc cagcagtcag tccagggagc tagccgatac gggacgatat tggtcggcgt 4336681 ccggcatggg cgatcttacc gtggggctca tcagccgcga gctcgcctca gccggccacc 4336741 ggcgcgacaa tcgatcgcct gtcacctgag gagcttatgt acgagcgtga cgaattcctg 4336801 cgcgatcgga tccgaccaca ccagcccggc accccgcggg gatactcgcc ccgtccgccg 4336861 tccggagatc gctgccccgc gccaccgcct ggccggcacg ctgctgccgc tacgccacca 4336921 gggccgccgc gcctgccttc agctccactg cgtccattgc cggacccggc ttggccacgc 4336981 cagccggagg ccccgccacc gagcacctgg gccgaccccg ccctggcgcc gatacgcagt 4337041 cggacgcgac ccggcgagcg tggttggcga cgcatggtgc ggctggtcac ctttggcctt 4337101 gtcggcctgg gccggtcggg catgcagcgc caggaggccc aattcgaagc aacgatacga 4337161 accgtcctgc atggcaacca caaggtcgcc gtgctgggca aaggaggtgt gggaaagacg 4337221 tcggttgcgg cgtgcgtcgg atcgatcctt gccgaactgc gccagcagga ccgtatcgtc 4337281 gggatcgacg ccgacaccgc cttcggcagg ctgagcagcc gaatcgatcc tcgagcagct 4337341 ggttcgttct gggagctgac caccgacacg aatctgcggt ccttcaccga tatcaccgcg 4337401 cgcctgggcc gaaattccgc gggactgtac gtcctggcag gccagccggc atccggtccg 4337461 cgccgggtgc tcgatccggc catctaccgc gaagccgccc taaggttgga tcaccatttc 4337521 gcaatctcgg tgatcgactg cggttcctcc atggaggcgg cggtcaccca ggaagtattg 4337581 cgcgatgtgg atgctctgat cgtggtgtcc tcgccctggg cggatggtgc ctccgctgcc 4337641 gccaacacca tcgaatggct gtcggattat ggcctgacag gtttgttgcg acgcagcatc 4337701 gtggtgctca acgattcgga cggacacgcc gacaagcgca ccaagtcatt gctggcccag 4337761 gaattcatcg accacgggca gcctgtggtc gaggtgccct tcgatcccca tttgcggccc 4337821 gggggggtca tcgatatgag ccacgaaatg gccccgacga cgcggctgaa aatcctgcag 4337881 gtcgccgcga cggtgacggc gtacttcgcg tcgcgacccg ccgacgcaca cggcagcccg 4337941 ccccggtgac ctggctggct gacccggtcg gcaacagcag gatcgcccga gcgcaggcct 4338001 gcaaaacgtc aatctcggcg cccatcgtcg aatcctggcg ggcgcaacgc ggcgcgcaat 4338061 gtggacagcg cgagaaatct tgtcgatgtt ctcgcgctgt ccacatccag ggcatctcac 4338121 cgccactgtt ccgcagaccc ctcgaaccag cggtccaggc ggcggttgcg tcatgccgat 4338181 tgggcagaca cccggtggtc gcgcaccggg taaccgttgc gctcggccag ggatcgcagc 4338241 tggcccaacg cgaatgcccg cgcccggcct gattcgggaa ttacgacccc tgcccacagc 4338301 ccttccgcac ccgcggactc gacggcgtcg cgtgcacaca gccaccggcg cgggcaagcc 4338361 cggcacaggg tcttggcctc gtcgtcggga gtcgtcgtcc aacgatcggg atcttgcgtg 4338421 caaacgccga gcgggacctc atacagggcg gttactgtca tgtctacgtt cctccagaaa 4338481 gcgttgcagg ttgtagcctc tgccgcgaaa gcgtatcgca ttaaccatag cgatgcaaca 4338541 gtttcctcct ctgcctgcct agcggtgctg cggctccggt tcggcgagct ccgagctcta 4338601 gtgcgcgcac cgccgagtac cagggcatag atcctgttaa tcagctgtgt atctggcctc 4338661 gccggcgcgt atccgacccc ttcgggcaga tcttccagga aaagtgttct gacatgcgac 4338721 agttcaggtg tgaagtgaac tgtagcggca gttcggtttg gctaggaaac tatttccata 4338781 gcgggccgtc gcgtcgctag atccaaaatg tagcgaagtc atagcagtag aagggtgcaa 4338841 cggttaggat ggcgggcgag cggaaagtct gcccaccgtc ccggctagta cccgcgaata 4338901 agggatcaac gcagatgtct aaagcagggt cgactgtcgg accggcgccg ctggtcgcgt 4338961 gcagcggcgg cacatcagac gtgattgagc cccgtcgcgg tgtcgcgatc attggccact 4339021 cgtgccgagt cggcacccag atcgacgatt ctcgaatctc tcagacacat ctgcgagcgg 4339081 tatccgatga tggacggtgg cggatcgtcg gcaacatccc gagaggtatg ttcgtcggcg 4339141 gacgacgcgg cagctcggtg accgtcagcg ataagaccct aatccgattc ggcgatcccc 4339201 ctggaggcaa ggcgttgacg ttcgaagtcg tcaggccgtc ggattccgct gcacagcacg 4339261 gccgcgtaca accatcagcg gacctgtcgg acgacccggc gcacaacgct gcgccggtcg 4339321 caccggaccc cggcgtggtt cgcgcagggg cggccgcggc tgcgcgccgt cgtgaacttg 4339381 acatcagcca acgcagcttg gcggccgacg ggatcatcaa cgcgggcgcg ctcatcgcgt 4339441 tcgagaaagg ccgtagttgg ccccgggaac ggacccgggc aaaactcgaa gaagtgctgc 4339501 agtggcccgc tggaaccatc gcgcgaatcc gtcggggcga gcccaccgag cccgcaacaa 4339561 accccgacgc gtcccccgga ctccggcctg ccgacggccc ggcgtccttg atcgcgcagg 4339621 ctgtcaccgc cgccgtagac ggctgcagtc tggctatcgc agcgttgccg gcgaccgagg 4339681 accccgagtt caccgaacgt gccgcgccga tccttgctga tttgcgccag ctcgaggcga 4339741 ttgccgtcca agcaacccgc atcagccgga ttaccccgga attgatcaag gcgttgggcg 4339801 cggtacgtcg ccaccacgac gaattaatga ggctgggagc aaccgcccct ggtgccacac 4339861 tggcgcagcg cttatatgcc gcacggcggc gcgcgaacct ttccaccctg gagactgccc 4339921 aagcggccgg cgtcgcagaa gaaatgatcg tcggcgccga agccgaggaa gagttgccag 4339981 ccgaggccac cgaagcgatc gaagcactga tccgtcagat caattgaggt cggctccgag 4340041 cgtcccacaa gtacaggcac gccgtaacgc tcaagttcaa cggtccgggg aacgcgcgcg 4340101 ttctccggcg tttgacggtg cgttccatcg tgccgcgaac ttgaaaacgc cagcgtcacc 4340161 aaaaaattcg tgcaccaacc cccctccgag cgctgctaag ctcaatgtgc agtgcaaagg 4340221 tgcagataat gatggcgcac cggaacggcg agcgtaagga aacacataaa tggcatcggg 4340281 tagcggtctt tgcaagacga cgagtaactt tatttggggc cagttactct tgcttggaga 4340341 gggaatcccc gacccaggcg acattttcaa caccggttcg tcgctgttca aacaaatcag 4340401 cgacaaaatg ggactcgcca ttccgggcac caactggatc ggccaagcgg cggaagctta 4340461 cctaaaccag aacatcgcgc aacaacttcg cgcacaggtg atgggcgatc tcgacaaatt 4340521 aaccggcaac atgatctcga atcaggccaa atacgtctcc gatacgcgcg acgtcctgcg 4340581 ggccatgaag aagatgattg acggtgtcta caaggtttgt aagggcctcg aaaagattcc 4340641 gctgctcggc cacttgtggt cgtgggagct cgcaatccct atgtccggca tcgcgatggc 4340701 cgttgtcggc ggcgcattgc tctatctaac gattatgacg ctgatgaatg cgaccaacct 4340761 gaggggaatt ctcggcaggc tgatcgagat gttgacgacc ttgccaaagt tccccggcct 4340821 gcccgggttg cccagcctgc ccgacatcat cgacggcctc tggccgccga agttgcccga 4340881 cattccgatc cccggcctgc ccgacatccc gggcctaccc gacttcaaat ggccgcccac 4340941 ccccggcagc ccgttgttcc ccgacctccc gtcgttccca gggttccccg ggttcccgga 4341001 gttccccgcc atccccgggt tccccgcact gcccgggttg cccagcattc ccaacttgtt 4341061 ccccggcttg ccgggtctgg gcgacctgct gcccggcgta ggcgatttgg gcaagttacc 4341121 cacctggact gagctggccg ctttgcctga cttcttgggc ggcttcgccg gcctgcccag 4341181 cttgggtttt ggcaatctgc tcagctttgc cagtttgccc accgtgggtc aggtgaccgc 4341241 caccatgggt cagctgcaac agctcgtggc ggccggcggt ggccccagcc aactggccag 4341301 catgggcagc caacaagcgc aactgatctc gtcgcaggcc cagcaaggag gccagcagca 4341361 cgccaccctc gtgagcgaca agaaggaaga cgaggaaggc gtggccgagg cggagcgtgc 4341421 acccatcgac gctggcaccg cggccagcca acgggggcag gaggggaccg tcctttgatc 4341481 ggacaccgag tcgccagcag gtctgtgcca tagcgagtcg aagccatagc gagtagaaag 4341541 ttaaacgtag aggagggttc aacccatgac cggatttctc ggtgtcgtgc cttcgttcct 4341601 gaaggtgctg gcgggcatgc acaacgagat cgtgggtgat atcaaaaggg cgaccgatac 4341661 ggtcgccggg attagcggac gagttcagct tacccatggt tcgttcacgt cgaaattcaa 4341721 tgacacgctg caagagtttg agaccacccg tagcagcacg ggcacgggtt tgcagggagt 4341781 caccagcgga ctggccaata atctgctcgc agccgccggc gcctacctca aggccgacga 4341841 tggcctagcc ggtgttatcg acaagatttt cggttgatca tgacgggtcc gtccgctgca 4341901 ggccgcgcgg gcaccgccga caacgtggtc ggcgtcgagg taaccatcga cggcatgttg 4341961 gtgatcgccg atcggttaca cctggttgat ttccctgtca cgcttgggat tcggccgaat 4342021 atcccgcaag aggatctgcg agacatcgtc tgggaacagg tgcagcgtga cctcacagcg 4342081 caaggggtgc tcgacctcca cggggagccc caaccgacgg tcgcggagat ggtcgaaacc 4342141 ctgggcaggc cagatcggac cttggagggt cgctggtggc ggcgcgacat tggcggcgtc 4342201 atggtgcgct tcgtcgtgtg ccgcaggggc gaccgccatg tgatcgcggc gcgcgacggc 4342261 gacatgctgg tgctgcagtt ggtggcgccg caggtcggct tggcgggcat ggtgacagcg 4342321 gtgctggggc ccgccgaacc cgccaacgtc gaacccctga cgggtgtggc aaccgagcta 4342381 gccgaatgca caaccgcgtc ccaattgacg caatacggta tcgcaccggc ctcggcccgc 4342441 gtctatgccg agatcgtggg taacccgacc ggctgggtgg agatcgttgc cagccaacgc 4342501 caccccggcg gcaccacgac gcagaccgac gccgccgctg gcgtcctgga ctccaagctc 4342561 ggtaggctgg tgtcgcttcc ccgccgtgtt ggaggcgacc tgtacggaag cttcctgccc 4342621 ggcactcagc agaacttgga gcgtgcgctg gacggcttgc tagagctgct ccctgcgggc 4342681 gcttggctag atcacacctc agatcacgca caagcctcct cccgaggctg acccctcaca 4342741 tctccgctac gacttcagaa agggacgcca tggtggaccc gccgggcaac gacgacgacc 4342801 acggtgatct cgacgccctc gatttctccg ccgcccacac caacgaggcg tcgccgctgg 4342861 acgccttaga cgactatgcg ccggtgcaga ccgatgacgc cgaaggcgac ctggacgccc 4342921 tccatgcgct caccgaacgc gacgaggagc cggagctgga gttgttcacg gtgaccaacc 4342981 ctcaagggtc ggtgtcggtc tcaaccctga tggacggcag aatccagcac gtcgagctga 4343041 cggacaaggc gaccagcatg tccgaagcgc agctggccga cgagatcttc gttattgccg 4343101 atctggcccg ccaaaaggcg cgggcgtcgc agtacacgtt catggtggag aacatcggtg 4343161 aactgaccga cgaagacgca gaaggcagcg ccctgctgcg ggaattcgtg gggatgaccc 4343221 tgaatctgcc gacgccggaa gaggctgccg cagccgaagc cgaagtgttc gccacccgct 4343281 acgatgtcga ctacacctcc cggtacaagg ccgatgactg atcgcttggc cagtctgttc 4343341 gaaagcgccg tcagcatgtt gccgatgtcg gaggcgcggt cgctagatct gttcaccgag 4343401 atcaccaact acgacgaatc cgcttgcgac gcatggatcg gccggatccg gtgtggggac 4343461 accgaccggg tgacgctgtt tcgcgcctgg tattcgcgcc gcaatttcgg acagttgtcg 4343521 ggatcggtcc agatctcgat gagcacgtta aacgccagga ttgccatcgg ggggctgtac 4343581 ggcgatatca cctacccggt cacctcgccg ctagcgatca ccatgggctt tgccgcatgc 4343641 gaggcagcgc aaggcaatta cgccgacgcc atggaggcct tagaggccgc cccggtcgcg 4343701 ggttccgagc acctggtggc gtggatgaag gcggttgtct acggcgcggc cgaacgctgg 4343761 accgacgtga tcgaccaggt caagagtgct gggaaatggc cggacaagtt tttggccggc 4343821 gcggccggtg tggcgcacgg ggttgccgcg gcaaacctgg ccttgttcac cgaagccgaa 4343881 cgccgactca ccgaggccaa cgactcgccc gccggtgagg cgtgtgcgcg cgccatcgcc 4343941 tggtatctgg cgatggcacg gcgcagccag ggcaacgaaa gcgccgcggt ggcgctgctg 4344001 gaatggttac agaccactca ccccgagccc aaagtggctg cggcgctgaa ggatccctcc 4344061 taccggctga agacgaccac cgccgaacag atcgcatccc gcgccgatcc ctgggatccg 4344121 ggcagtgtcg tgaccgacaa ctccggccgg gagcggctgc tcgccgaggc ccaagccgaa 4344181 ctcgaccgcc aaattgggct cacccgggtt aaaaatcaga ttgaacgcta ccgcgcggcg 4344241 acgctgatgg cccgggtccg cgccgccaag ggtatgaagg tcgcccagcc cagcaagcac 4344301 atgatcttca ccggaccgcc cggtaccggc aagaccacga tcgcgcgggt ggtggccaat 4344361 atcctggccg gcttaggcgt cattgccgaa cccaaactcg tcgagacgtc gcgcaaggac 4344421 ttcgtcgccg agtacgaggg gcaatcggcg gtcaagaccg ctaagacgat cgatcaggcg 4344481 ctgggcgggg tgcttttcat cgacgaggct tatgcgctgg tgcaggaaag agacggccgc 4344541 accgatccgt tcggtcaaga ggcgctggac acgctgctgg cgcggatgga gaacgaccgg 4344601 gaccggctgg tggtgatcat cgccgggtac agctccgaca tagatcggct gctggaaacc 4344661 aacgagggtc tgcggtcgcg gttcgccact cgcatcgagt tcgacaccta ttcccccgag 4344721 gaactcctcg agatcgccaa cgtcattgcc gctgctgatg attcggcgtt gaccgcagag 4344781 gcggccgaga actttcttca ggccgccaag cagttggagc agcgcatgtt gcgcggccgg 4344841 cgcgccctgg acgtcgccgg caacggtcgg tatgcgcgcc agctggtgga ggccagcgag 4344901 caatgccggg acatgcgtct agcccaggtc ctcgatatcg acaccctcga cgaagaccgg 4344961 cttcgcgaga tcaacggctc agatatggcg gaggctatcg ccgcggtgca cgcacacctc 4345021 aacatgagag aatgaactat ggggcttcgc ctcaccacca aggttcaggt tagcggctgg 4345081 cgttttctgc tgcgccggct cgaacacgcc atcgtgcgcc gggacacccg gatgtttgac 4345141 gacccgctgc agttctacag ccgctcgatc gctcttggca tcgtcgtcgc ggtcctgatt 4345201 ctggcgggtg ccgcgctgct ggcgtacttc aaaccacaag gcaaactcgg cggcaccagc 4345261 ctgttcaccg accgcgcgac caaccagctt tacgtgctgc tgtccggaca gttgcatccg 4345321 gtctacaacc tgacttcggc gcggctggtg ctgggcaatc cggccaaccc ggccaccgtg 4345381 aagtcctccg aactgagcaa gctgccgatg ggccagaccg ttggaatccc cggcgccccc 4345441 tacgccacgc ctgtttcggc gggcagcacc tcgatctgga ccctatgcga caccgtcgcc 4345501 cgagccgact ccacttcccc ggtagtgcag accgcggtca tcgcgatgcc gttggagatc 4345561 gatgcttcga tcgatccgct ccagtcacac gaagcggtgc tggtgtccta ccagggcgaa 4345621 acctggatcg tcacaactaa gggacgccac gccatagatc tgaccgaccg cgccctcacc 4345681 tcgtcgatgg ggataccggt gacggccagg ccaaccccga tctcggaggg catgttcaac 4345741 gcgctgcctg atatggggcc ctggcagctg ccgccgatac cggcggcggg cgcgcccaat 4345801 tcgcttggcc tacctgatga tctagtgatc ggatcggtct tccagatcca caccgacaag 4345861 ggcccgcaat actatgtggt gctgcccgac ggcatcgcgc aggtcaacgc gacaaccgct 4345921 gcggcgctgc gcgccaccca ggcgcacggg ctggtcgcgc caccggcaat ggtgcccagt 4345981 ctggtcgtca gaatcgccga acgggtatac ccctcaccgc tacccgatga accgctcaag 4346041 atcgtgtccc ggccgcagga tcccgcgctg tgctggtcat ggcaacgcag cgccggcgac 4346101 cagtcgccgc agtcaacggt gctgtccggc cggcatctgc cgatatcgcc ctcagcgatg 4346161 aacatgggga tcaagcagat ccacgggacg gcgaccgttt acctcgacgg cggaaaattc 4346221 gtggcactgc aatcccccga tcctcgatac accgaatcga tgtactacat cgatccacag 4346281 ggcgtgcgtt atggggtgcc taacgcggag acagccaagt cgctgggcct gagttcaccc 4346341 caaaacgcgc cctgggagat cgttcgtctc ctggtcgacg gtccggtgct gtcgaaagat 4346401 gccgcactgc tcgagcacga cacgctgccc gctgacccta gcccccgaaa agttcccgcc 4346461 ggagcctccg gagccccctg atgacgacca agaagttcac tcccaccatt acccgtggcc 4346521 cccggttgac cccgggcgag atcagcctca cgccgcccga tgacctgggc atcgacatcc 4346581 caccgtcggg cgtccaaaag atccttccct acgtgatggg tggcgccatg ctcggcatga 4346641 tcgccatcat ggtggccggc ggcaccaggc agctgtcgcc gtacatgttg atgatgccgc 4346701 tgatgatgat cgtgatgatg gtcggcggtc tggccggtag caccggtggt ggcggcaaga 4346761 aggtgcccga aatcaacgcc gaccgcaagg agtacctgcg gtatttggca ggactacgca 4346821 cccgagtgac gtcctcggcc acctctcagg tggcgttctt ctcctaccac gcaccgcatc 4346881 ccgaggatct gttgtcgatc gtcggcaccc aacggcagtg gtcccggccg gccaacgccg 4346941 acttctatgc ggccacccga atcggtatcg gtgaccagcc ggcggtggat cgattattga 4347001 agccggccgt cggcggggag ttggccgccg ccagcgcagc acctcagccg ttcctggagc 4347061 cggtcagtca tatgtgggtg gtcaagtttc tacgaaccca tggattgatc catgactgcc 4347121 cgaaactgct gcaactccgt acctttccga ctatcgcgat cggcggggac ttggcggggg 4347181 cagccggcct gatgacggcg atgatctgtc acctagccgt gttccaccca ccggacctgc 4347241 tgcagatccg ggtgctcacc gaggaacccg acgaccccga ctggtcctgg ctcaaatggc 4347301 ttccgcacgt acagcaccag accgaaaccg atgcggccgg gtccacccgg ctgatcttca 4347361 cgcgccagga aggtctgtcg gacctggccg cgcgcgggcc acacgcaccc gattcgcttc 4347421 ccggcggccc ctacgtagtc gtcgtcgacc tgaccggcgg caaggctgga ttcccgcccg 4347481 acggtagggc cggtgtcacg gtgatcacgt tgggcaacca tcgcggctcg gcctaccgca 4347541 tcagggtgca cgaggatggg acggctgatg accggctccc taaccaatcg tttcgccagg 4347601 tgacatcggt caccgatcgg atgtcgccgc agcaagccag ccgtatcgcg cgaaagttgg 4347661 ccggatggtc catcacgggc accatcctcg acaagacgtc gcgggtccag aagaaggtgg 4347721 ccaccgactg gcaccagctg gtcggtgcgc aaagtgtcga ggagataaca ccttcccgct 4347781 ggaggatgta caccgacacc gaccgtgacc ggctaaagat cccgtttggt catgaactaa 4347841 agaccggcaa cgtcatgtac ctggacatca aagagggcgc ggaattcggc gccggaccgc 4347901 acggcatgct catcgggacc acggggtctg ggaagtccga attcctgcgc accctgatcc 4347961 tgtcgctggt ggcaatgact catccagatc aggtgaatct cctgctcacc gacttcaaag 4348021 gtggttcaac cttcctggga atggaaaagc ttccgcacac tgccgctgtc gtcaccaaca 4348081 tggccgagga agccgagctc gtcagccgga tgggcgaggt gttgaccgga gaactcgatc 4348141 ggcgccagtc gatcctccga caggccggga tgaaagtcgg cgcggccgga gccctgtccg 4348201 gcgtggccga atacgagaag taccgcgaac gcggtgccga cctacccccg ctgccaacgc 4348261 ttttcgtcgt cgtcgacgag ttcgccgagc tgttgcagag tcacccggac ttcatcgggc 4348321 tgttcgaccg gatctgccgc gtcgggcggt cgctgagggt ccatctgctg ctggctaccc 4348381 agtcgctgca gaccggcggt gttcgcatcg acaaactgga gccaaacctg acatatcgaa 4348441 tcgcattgcg caccaccagc tctcatgaat ccaaggcggt aatcggcaca ccggaggcgc 4348501 agtacatcac caacaaggag agcggtgtcg ggtttctccg ggtcggcatg gaagacccgg 4348561 tcaagttcag caccttctac atcagtgggc catacatgcc gccggcggca ggcgtcgaaa 4348621 ccaatggtga agccggaggg cccggtcaac agaccactag acaagccgcg cgcattcaca 4348681 ggttcaccgc ggcaccggtt ctcgaggagg cgccgacacc gtgacccgcg ccggcgacga 4348741 tgcaaagcgc agcgatgagg aggagcggcg ccaacggccc gcgccggcga cgatgcaaag 4348801 cgcagcgatg aggaggagcg gcgcgcatga ctgctgaacc ggaagtacgg acgctgcgcg 4348861 aggttgtgct ggaccagctc ggcactgctg aatcgcgtgc gtacaagatg tggctgccgc 4348921 cgttgaccaa tccggtcccg ctcaacgagc tcatcgcccg tgatcggcga caacccctgc 4348981 gatttgccct ggggatcatg gatgaaccgc gccgccatct acaggatgtg tggggcgtag 4349041 acgtttccgg ggccggcggc aacatcggta ttgggggcgc acctcaaacc gggaagtcga 4349101 cgctactgca gacgatggtg atgtcggccg ccgccacaca ctcaccgcgc aacgttcagt 4349161 tctattgcat cgacctaggt ggcggcgggc tgatctatct cgaaaacctt ccacacgtcg 4349221 gtggggtagc caatcggtcc gagcccgaca aggtcaaccg ggtggtcgca gagatgcaag 4349281 ccgtcatgcg gcaacgggaa accaccttca aggaacaccg agtgggctcg atcgggatgt 4349341 accggcagct gcgtgacgat ccaagtcaac ccgttgcgtc cgatccatac ggcgacgtct 4349401 ttctgatcat cgacggatgg cccggttttg tcggcgagtt ccccgacctt gaggggcagg 4349461 ttcaagatct ggccgcccag gggctggcgt tcggcgtcca cgtcatcatc tccacgccac 4349521 gctggacaga gctgaagtcg cgtgttcgcg actacctcgg caccaagatc gagttccggc 4349581 ttggtgacgt caatgaaacc cagatcgacc ggattacccg cgagatcccg gcgaatcgtc 4349641 cgggtcgggc agtgtcgatg gaaaagcacc atctgatgat cggcgtgccc aggttcgacg 4349701 gcgtgcacag cgccgataac ctggtggagg cgatcaccgc gggggtgacg cagatcgctt 4349761 cccagcacac cgaacaggca cctccggtgc gggtcctgcc ggagcgtatc cacctgcacg 4349821 aactcgaccc gaacccgccg ggaccagagt ccgactaccg cactcgctgg gagattccga 4349881 tcggcttgcg cgagacggac ctgacgccgg ctcactgcca catgcacacg aacccgcacc 4349941 tactgatctt cggtgcggcc aaatcgggca agacgaccat tgcccacgcg atcgcgcgcg 4350001 ccatttgtgc ccgaaacagt ccccagcagg tgcggttcat gctcgcggac taccgctcgg 4350061 gcctgctgga cgcggtgccg gacacccatc tgctgggcgc cggcgcgatc aaccgcaaca 4350121 gcgcgtcgct agacgaggcc gttcaagcac tggcggtcaa cctgaagaag cggttgccgc 4350181 cgaccgacct gacgacggcg cagctacgct cgcgttcgtg gtggagcgga tttgacgtcg 4350241 tgcttctggt cgacgattgg cacatgatcg tgggtgccgc cggggggatg ccgccgatgg 4350301 caccgctggc cccgttattg ccggcggcgg cagatatcgg gttgcacatc attgtcacct 4350361 gtcagatgag ccaggcttac aaggcaacca tggacaagtt cgtcggcgcc gcattcgggt 4350421 cgggcgctcc gacaatgttc ctttcgggcg agaagcagga attcccatcc agtgagttca 4350481 aggtcaagcg gcgcccccct ggccaggcat ttctcgtctc gccagacggc aaagaggtca 4350541 tccaggcccc ctacatcgag cctccagaag aagtgttcgc agcaccccca agcgccggtt 4350601 aagattattt cattgccggt gtagcaggac ccgagctcag cccggtaatc gagttcgggc 4350661 aatgctgacc atcgggtttg tttccggcta taaccgaacg gtttgtgtac gggatacaaa 4350721 tacagggagg gaagaagtag gcaaatggaa aaaatgtcac atgatccgat cgctgccgac 4350781 attggcacgc aagtgagcga caacgctctg cacggcgtga cggccggctc gacggcgctg 4350841 acgtcggtga ccgggctggt tcccgcgggg gccgatgagg tctccgccca agcggcgacg 4350901 gcgttcacat cggagggcat ccaattgctg gcttccaatg catcggccca agaccagctc 4350961 caccgtgcgg gcgaagcggt ccaggacgtc gcccgcacct attcgcaaat cgacgacggc 4351021 gccgccggcg tcttcgccga ataggccccc aacacatcgg agggagtgat caccatgctg 4351081 tggcacgcaa tgccaccgga gctaaatacc gcacggctga tggccggcgc gggtccggct 4351141 ccaatgcttg cggcggccgc gggatggcag acgctttcgg cggctctgga cgctcaggcc 4351201 gtcgagttga ccgcgcgcct gaactctctg ggagaagcct ggactggagg tggcagcgac 4351261 aaggcgcttg cggctgcaac gccgatggtg gtctggctac aaaccgcgtc aacacaggcc 4351321 aagacccgtg cgatgcaggc gacggcgcaa gccgcggcat acacccaggc catggccacg 4351381 acgccgtcgc tgccggagat cgccgccaac cacatcaccc aggccgtcct tacggccacc 4351441 aacttcttcg gtatcaacac gatcccgatc gcgttgaccg agatggatta tttcatccgt 4351501 atgtggaacc aggcagccct ggcaatggag gtctaccagg ccgagaccgc ggttaacacg 4351561 cttttcgaga agctcgagcc gatggcgtcg atccttgatc ccggcgcgag ccagagcacg 4351621 acgaacccga tcttcggaat gccctcccct ggcagctcaa caccggttgg ccagttgccg 4351681 ccggcggcta cccagaccct cggccaactg ggtgagatga gcggcccgat gcagcagctg 4351741 acccagccgc tgcagcaggt gacgtcgttg ttcagccagg tgggcggcac cggcggcggc 4351801 aacccagccg acgaggaagc cgcgcagatg ggcctgctcg gcaccagtcc gctgtcgaac 4351861 catccgctgg ctggtggatc aggccccagc gcgggcgcgg gcctgctgcg cgcggagtcg 4351921 ctacctggcg caggtgggtc gttgacccgc acgccgctga tgtctcagct gatcgaaaag 4351981 ccggttgccc cctcggtgat gccggcggct gctgccggat cgtcggcgac gggtggcgcc 4352041 gctccggtgg gtgcgggagc gatgggccag ggtgcgcaat ccggcggctc caccaggccg 4352101 ggtctggtcg cgccggcacc gctcgcgcag gagcgtgaag aagacgacga ggacgactgg 4352161 gacgaagagg acgactggtg agctcccgta atgacaacag acttcccggc cacccgggcc 4352221 ggaagacttg ccaacatttt ggcgaggaag gtaaagagag aaagtagtcc agcatggcag 4352281 agatgaagac cgatgccgct accctcgcgc aggaggcagg taatttcgag cggatctccg 4352341 gcgacctgaa aacccagatc gaccaggtgg agtcgacggc aggttcgttg cagggccagt 4352401 ggcgcggcgc ggcggggacg gccgcccagg ccgcggtggt gcgcttccaa gaagcagcca 4352461 ataagcagaa gcaggaactc gacgagatct cgacgaatat tcgtcaggcc ggcgtccaat 4352521 actcgagggc cgacgaggag cagcagcagg cgctgtcctc gcaaatgggc ttctgacccg 4352581 ctaatacgaa aagaaacgga gcaaaaacat gacagagcag cagtggaatt tcgcgggtat 4352641 cgaggccgcg gcaagcgcaa tccagggaaa tgtcacgtcc attcattccc tccttgacga 4352701 ggggaagcag tccctgacca agctcgcagc ggcctggggc ggtagcggtt cggaggcgta 4352761 ccagggtgtc cagcaaaaat gggacgccac ggctaccgag ctgaacaacg cgctgcagaa 4352821 cctggcgcgg acgatcagcg aagccggtca ggcaatggct tcgaccgaag gcaacgtcac 4352881 tgggatgttc gcatagggca acgccgagtt cgcgtagaat agcgaaacac gggatcgggc 4352941 gagttcgacc ttccgtcggt ctcgcccttt ctcgtgttta tacgtttgag cgcactctga 4353001 gaggttgtca tggcggccga ctacgacaag ctcttccggc cgcacgaagg tatggaagct 4353061 ccggacgata tggcagcgca gccgttcttc gaccccagtg cttcgtttcc gccggcgccc 4353121 gcatcggcaa acctaccgaa gcccaacggc cagactccgc ccccgacgtc cgacgacctg 4353181 tcggagcggt tcgtgtcggc cccgccgccg ccacccccac ccccacctcc gcctccgcca 4353241 actccgatgc cgatcgccgc aggagagccg ccctcgccgg aaccggccgc atctaaacca 4353301 cccacacccc ccatgcccat cgccggaccc gaaccggccc cacccaaacc acccacaccc 4353361 cccatgccca tcgccggacc cgaaccggcc ccacccaaac cacccacacc tccgatgccc 4353421 atcgccggac ctgcacccac cccaaccgaa tcccagttgg cgccccccag accaccgaca 4353481 ccacaaacgc caaccggagc gccgcagcaa ccggaatcac cggcgcccca cgtaccctcg 4353541 cacgggccac atcaaccccg gcgcaccgca ccagcaccgc cctgggcaaa gatgccaatc 4353601 ggcgaacccc cgcccgctcc gtccagaccg tctgcgtccc cggccgaacc accgacccgg 4353661 cctgcccccc aacactcccg acgtgcgcgc cggggtcacc gctatcgcac agacaccgaa 4353721 cgaaacgtcg ggaaggtagc aactggtcca tccatccagg cgcggctgcg ggcagaggaa 4353781 gcatccggcg cgcagctcgc ccccggaacg gagccctcgc cagcgccgtt gggccaaccg 4353841 agatcgtatc tggctccgcc cacccgcccc gcgccgacag aacctccccc cagcccctcg 4353901 ccgcagcgca actccggtcg gcgtgccgag cgacgcgtcc accccgattt agccgcccaa 4353961 catgccgcgg cgcaacctga ttcaattacg gccgcaacca ctggcggtcg tcgccgcaag 4354021 cgtgcagcgc cggatctcga cgcgacacag aaatccttaa ggccggcggc caaggggccg 4354081 aaggtgaaga aggtgaagcc ccagaaaccg aaggccacga agccgcccaa agtggtgtcg 4354141 cagcgcggct ggcgacattg ggtgcatgcg ttgacgcgaa tcaacctggg cctgtcaccc 4354201 gacgagaagt acgagctgga cctgcacgct cgagtccgcc gcaatccccg cgggtcgtat 4354261 cagatcgccg tcgtcggtct caaaggtggg gctggcaaaa ccacgctgac agcagcgttg 4354321 gggtcgacgt tggctcaggt gcgggccgac cggatcctgg ctctagacgc ggatccaggc 4354381 gccggaaacc tcgccgatcg ggtagggcga caatcgggcg cgaccatcgc tgatgtgctt 4354441 gcagaaaaag agctgtcgca ctacaacgac atccgcgcac acactagcgt caatgcggtc 4354501 aatctggaag tgctgccggc accggaatac agctcggcgc agcgcgcgct cagcgacgcc 4354561 gactggcatt tcatcgccga tcctgcgtcg aggttttaca acctcgtctt ggctgattgt 4354621 ggggccggct tcttcgaccc gctgacccgc ggcgtgctgt ccacggtgtc cggtgtcgtg 4354681 gtcgtggcaa gtgtctcaat cgacggcgca caacaggcgt cggtcgcgtt ggactggttg 4354741 cgcaacaacg gttaccaaga tttggcgagc cgcgcatgcg tggtcatcaa tcacatcatg 4354801 ccgggagaac ccaatgtcgc agttaaagac ctggtgcggc atttcgaaca gcaagttcaa 4354861 cccggccggg tcgtggtcat gccgtgggac aggcacattg cggccggaac cgagatttca 4354921 ctcgacttgc tcgaccctat ctacaagcgc aaggtcctcg aattggccgc agcgctatcc 4354981 gacgatttcg agagggctgg acgtcgttga gcgcacctgc tgttgctgct ggtcctaccg 4355041 ccgcgggggc aaccgctgcg cggcctgcca ccacccgggt gacgatcctg accggcagac 4355101 ggatgaccga tttggtactg ccagcggcgg tgccgatgga aacttatatt gacgacaccg 4355161 tcgcggtgct ttccgaggtg ttggaagaca cgccggctga tgtactcggc ggcttcgact 4355221 ttaccgcgca aggcgtgtgg gcgttcgctc gtcccggatc gccgccgctg aagctcgacc 4355281 agtcactcga tgacgccggg gtggtcgacg ggtcactgct gactctggtg tcagtcagtc 4355341 gcaccgagcg ctaccgaccg ttggtcgagg atgtcatcga cgcgatcgcc gtgcttgacg 4355401 agtcacctga gttcgaccgc acggcattga atcgctttgt gggggcggcg atcccgcttt 4355461 tgaccgcgcc cgtcatcggg atggcgatgc gggcgtggtg ggaaactggg cgtagcttgt 4355521 ggtggccgtt ggcgattggc atcctgggga tcgctgtgct ggtaggcagc ttcgtcgcga 4355581 acaggttcta ccagagcggc cacctggccg agtgcctact ggtcacgacg tatctgctga 4355641 tcgcaaccgc cgcagcgctg gccgtgccgt tgccgcgcgg ggtcaactcg ttgggggcgc 4355701 cacaagttgc cggcgccgct acggccgtgc tgtttttgac cttgatgacg cggggcggcc 4355761 ctcggaagcg tcatgagttg gcgtcgtttg ccgtgatcac cgctatcgcg gtcatcgcgg 4355821 ccgccgctgc cttcggctat ggataccagg actgggtccc cgcggggggg atcgcattcg 4355881 ggctgttcat tgtgacgaat gcggccaagc tgaccgtcgc ggtcgcgcgg atcgcgctgc 4355941 cgccgattcc ggtacccggc gaaaccgtgg acaacgagga gttgctcgat cccgtcgcga 4356001 ccccggaggc taccagcgaa gaaaccccga cctggcaggc catcatcgcg tcggtgcccg 4356061 cgtccgcggt ccggctcacc gagcgcagca aactggccaa gcaacttctg atcggatacg 4356121 tcacgtcggg caccctgatt ctggctgccg gtgccatcgc ggtcgtggtg cgcgggcact 4356181 tctttgtaca cagcctggtg gtcgcgggtt tgatcacgac cgtctgcgga tttcgctcgc 4356241 ggctttacgc cgagcgctgg tgtgcgtggg cgttgctggc ggcgacggtc gcgattccga 4356301 cgggtctgac ggccaaactc atcatctggt acccgcacta tgcctggctg ttgttgagcg 4356361 tctacctcac ggtagccctg gttgcgctcg tggtggtcgg gtcgatggct cacgtccggc 4356421 gcgtttcacc ggtcgtaaaa cgaactctgg aattgatcga cggcgccatg atcgctgcca 4356481 tcattcccat gctgctgtgg atcaccgggg tgtacgacac ggtccgcaat atccggttct 4356541 gagccggatc ggctgattgg cggttcctga cagaacatcg aggacacggc gcaggtttgc 4356601 ataccttcgg cgcccgacaa attgctgcga ttgagcgtgt ggcgcgtccg gtaaaatttg 4356661 ctcgatgggg aacacgtata ggagatccgg caatggctga accgttggcc gtcgatccca 4356721 ccggcttgag cgcagcggcc gcgaaattgg ccggcctcgt ttttccgcag cctccggcgc 4356781 cgatcgcggt cagcggaacg gattcggtgg tagcagcaat caacgagacc atgccaagca 4356841 tcgaatcgct ggtcagtgac gggctgcccg gcgtgaaagc cgccctgact cgaacagcat 4356901 ccaacatgaa cgcggcggcg gacgtctatg cgaagaccga tcagtcactg ggaaccagtt 4356961 tgagccagta tgcattcggc tcgtcgggcg aaggcctggc tggcgtcgcc tcggtcggtg 4357021 gtcagccaag tcaggctacc cagctgctga gcacacccgt gtcacaggtc acgacccagc 4357081 tcggcgagac ggccgctgag ctggcacccc gtgttgttgc gacggtgccg caactcgttc 4357141 agctggctcc gcacgccgtt cagatgtcgc aaaacgcatc ccccatcgct cagacgatca 4357201 gtcaaaccgc ccaacaggcc gcccagagcg cgcagggcgg cagcggccca atgcccgcac 4357261 agcttgccag cgctgaaaaa ccggccaccg agcaagcgga gccggtccac gaagtgacaa 4357321 acgacgatca gggcgaccag ggcgacgtgc agccggccga ggtcgttgcc gcggcacgtg 4357381 acgaaggcgc cggcgcatca ccgggccagc agcccggcgg gggcgttccc gcgcaagcca 4357441 tggataccgg agccggtgcc cgcccagcgg cgagtccgct ggcggccccc gtcgatccgt 4357501 cgactccggc accctcaaca accacaacgt tgtagaccgg gcctgccagc ggctccgtct 4357561 cgcacgcagc gcctgttgct gtcctggcct cgtcagcatg cggcggccag ggcccggtcg 4357621 agcaacccgg tgacgtattg ccagtacagc cagtccgcga cggccacacg ctggacggcc 4357681 gcgtcagtcg cagtgtgcgc ttggtgcagg gcaatctcct gtgagtgggc agcgtaggcc 4357741 cggaacgccc gcagatgagc ggcctcgcgg ccggtagcgg tgctggtcat gggcttcatc 4357801 agctcgaacc acagcatgtg ccgctcatcg cccggtggat tgacatccac cggcgccggc 4357861 ggcaacaagt cgagcaaacg ctgatcggta gtgtcggcca gctgagccgc cgccgagggg 4357921 tcgacgacct ccagccgcga ccggcccgtc attttgccgc tctccggaat gtcatctggc 4357981 tccagcacaa tcttggccac accgggatcc gaactggcca actgctccgc ggtaccgatc 4358041 accgcccgca gcgtcatgtc gtggaaagcc gcccaggctt gcacggccaa aaccgggtag 4358101 gtggcacagc gtgcaatttc gtcaaccggg attgcgtgat ccgcgctggc caagtacacc 4358161 ttattcggca attccatccc gtcgggtatg taggccagcc catagctgtt ggccacgacg 4358221 atggaaccgt cggtggtcac cgcggtgatc cagaagaacc cgtagtcgcc cgcgttgttg 4358281 tcggacgcgt tgagcgccgc cgcgatgcgt cgcgccaacc gcagcgcatc accgcggcca 4358341 cgctggcggg cgctggcagc tgcagtggcg gcgtcgcgtg ccgcccgagc cgccgacacc 4358401 gggatcatcg acaccggcgt accgtcatct gcagactcgc tgcgatcggg tttgtcgatg 4358461 tgatcggtcg acggcgggcg ggcaggaggt gccgtccgcg ccgaggccgc ccgcgtgctc 4358521 ggtgccgccg ccttgtccga ggtagccacc ggcgcccgcc cagtggcagc atgcgacccc 4358581 gcgcccgagg ccgcggccgt acccacgctc gaacgcgcgc ccgctcccac ggcggtaccg 4358641 ctcggcgcgg cggccgccgc ccgtgcgccc gggacaccgg acgccgcagc cggcgtcacc 4358701 gacgcggcgg attcgtccgc atgggcaggc cccgactgcg tccccccgcc cgcatgctgg 4358761 cccggcacac caggttgctc cgccaacgcc gcgggtttga cgtgcggcgc cggctcgccc 4358821 cctggggtgc ccggtgttgc tggaccagac ggaccgggag tggccggtgt aaccggctgg 4358881 ggcccaggcg atggcgccgg tgccggagcc ggctgcgggt gtggagcggg agctggggta 4358941 acgggcgtgg ccggggttgc cggtgtggcc ggggcgaccg ggggggtgac cggcgtgatc 4359001 ggggttggct cgcctggtgt gcccggtttg accggggtca ccggggtgac cggcttgccc 4359061 ggggtcaccg gcgtgacggg agtgccgggc gttggtgtga tcggagttac cggcgctccc 4359121 gggatgggtg tgattggggt tcccggggtg atcggggttc ccggggtgat cggggttccc 4359181 ggtgtgcccg gtgtgcccgg ggatggcacg accagggtag gcacgtctgg gggtggcggc 4359241 gacttctgct gaagcaaatc ctcgagtgcg ttcttcggag gtttccaatt cttggattcc 4359301 agcacccgct cagcggtctc ggcgaccaga ctgacattgg ccccatgcgt cgccgtgacc 4359361 aatgaattga tggcggtatg gcgctcatca gcatccaggc tagggtcatt ctccaggata 4359421 tcgatctccc gttgagcgcc atccacatta ttgccgatat cggatttagc ttgctcaatc 4359481 aacccggcaa tatgcctgtg ccaggtaatc accgtggcga gataatcctg cagcgtcatc 4359541 aattgattga tgtttgcacc cagggcgccg ttggcagcat tggcggcgcc gccggaccat 4359601 aggccgcctt cgaagacgtg gcctttctgc tggcggcagg tgtccaatac atcggtgacc 4359661 ctttgcaaaa cctggctata ttcctgggcc cggtcataga aagtgtcttc atcggcttcc 4359721 acccagccgc ccggatccag catctgtctg gcatagctgc ccgtcggcct ggtaatactc 4359781 atcccctact gccctcccca aaccgccaga tcgcctcgcg gatcaccgtc cggttggcct 4359841 ccggcatttc acgccggctc ggccgctgga tccaccccgc gccggtattc gcagtaaccc 4359901 gttgaatccg cgcgcatgat gcaccgcttg ggcgatcagc cgggtggtca cctcgcttgc 4359961 gctggccgcg ctgtcgcacg gggcgctcgg tggtaacgga cgtcataatt aaccagcgta 4360021 accgaaccta agaccagcta gctgcggcaa tattggcgac caggactatg gcgccctccg 4360081 aacccggccg atccatgtca aaacattgac aatgcgtact cacgccgtgt cgggcgcgct 4360141 gaatgaccgc attgcggcgc tcattcggtg cgtagtcgct accaccgcaa caatgggctt 4360201 aggccattcc ttcgttcatc gcgcgggaca tggccgataa cgcagcggtc agctgctcgc 4360261 ccgccgcgtc gttatacgcg gacgccgcgg cctgcgcatt gtgcagcgcc tcgttgaccc 4360321 gctgagccac cgcctcggca cccagcttct tcagcaaacc atcttcgatg cgcaggccgg 4360381 tgagccactg gtgcccattg atcgtcactt cgacggtctc ggcttcgtcg gtggcgcgga 4360441 aggatccgtt gttcatctga ttgagcgtcc cgtctagggc cgactgaaac cgcgccgcca 4360501 gcgtcaacgc ccgggcgaca tgcgggtcca attcgtccat gctcacttcg actccttact 4360561 gtcctggcgc cgacggttac caatgacggc ctcggtccat gcccgatcct cggtgtagag 4360621 cgcctcgtct tcctgctgag aacccttgga cttggcgccc ccttgtccct gatgcgcggc 4360681 acccatcggc attcccatgc caccgccgcc cagcgcggcg ccgccgccgg cccttccctg 4360741 gcctaagccg gcaatgtcac cagcgccagc gggccgcacc gattcggcgc ccccgatcgc 4360801 ggatcccaac ggcgccgacg gcaccccgcc gcctccaccg ccaccgagcg atgccgcttt 4360861 gaccgccacg tcgcccgaca gcgctgcggc ttcccgccca gccgacgtca gctgcgccgc 4360921 cgtgtcagcc gggaggccac cacccggcga tccggtaggc ggaaccatcg gtgcggctgg 4360981 catcccggta ccgggagtca caccggagcc gtcagacggc ggcatcagga agccagggat 4361041 caatccctgc tcttgcggag gcgggggcgg gtcgatcttg atggcggggg gaggcttcgg 4361101 cgggtttacc ggttccaggg ctgccttgtt gttgtattcg gtcagcacct tctccgacct 4361161 ctgctgatac tccgcgtaca ccgggagaat ttggtcgcgg gccgaagggt tttccgcgta 4361221 aagccgttcg agcccgacta tgtcttcata agtcggatgt tcccgcctag cccacacgtg 4361281 cagctgcgcg acatattgag cctgcttggc catcgcagcg ctcaatttgg ccatgtggag 4361341 tatccattgc cgttgttgat cgagcgaagc ctcgcaagcg gtagccgcat cgccttccca 4361401 gttgtcaaac ccccggaacc gcttgacgtc gccttgcagc gtcaggttga aagtgttcca 4361461 cccatccgca aagtgcgcga gcgatgcgcc ttggtcgccc gtttcgagct tccttgccgc 4361521 ttctttgaga tccatgaagt tgggttcacc ggccgtggcc accctcggcg tatcggttag 4361581 ttcggccgaa ctgtcccctc cgacggcccc ggccgattct gcctgcacag ttccttcgcc 4361641 gtcgttgtcc agcgcggtcg cagcctcctc atcaacctcg ccatacgcct tggccgcgtt 4361701 gcgcagcgag gtcgccagac gctgccgctc tttggcaccg gccgccaggt attcccgcat 4361761 gttgtcggcg gacaatacca gctgttgggc ggcgttttta gccgccgtga gttcgcacgg 4361821 tgtgatgggg acatcagtcg gtgggtccgc catcggggcc tccacctcgt tggccctgtt 4361881 caaaatctct tgctgatcca ccgtcacggt ctgcgactgc gtcatatcgg atcatcctcc 4361941 ttagtgctat agccattatc gtcgctaaac tgaaaggttc ctgcactaat ttgatgccgc 4362001 ccgttcatgc cggcatcgcg aacggatcgc cctacttcgg cagcgccatc tggtagcggc 4362061 tttcctcggg tggggaaacc cggcgaatcg gcagctgccg atgccgcggg gtaccgatca 4362121 cattgtgccg cagaatcacc cggtcaatac cgggatgcgg gccgagatag gtcgtcgcat 4362181 tcggccacgc cacctttacc tcctgcccga tgtgtgcgcc gatcaaccgg gcaaattcct 4362241 cgaactgtgg cccgactgtg accatcgcac ctgccgccgc cgcacgcacc acgaactggg 4362301 tgaatgtctg agcgtcaccc aggttgaggg cgatgtcgac atcgtcgaag ggcatgtaga 4362361 ccgggcatcg gttcaccgtc tcgccgacca gtaccccagc tgacccgatc ggcagctggc 4362421 agtggcggtt ggccaccaga tgctggcctt gcagcgcggg ccgctgcccg ccaaataggc 4362481 gggcgaagcc cctgggtgtc ttgggcttgt ccgccgtggt cagcaacacc gtggactgcg 4362541 gggccatccc cggcgcgacc cggactctgg tgatggtgtg gtccgcgcgc gccgaccacc 4362601 atacatccgg acctccgggc gccgcgtagg cggcagtgta ggcatcgcgc cccttgatca 4362661 tcgaccattt ctcccgcaca aagccgatgt cggtggcgtg gtcgtagtca tcgaagctgc 4362721 ggccacacac cgcgtcgaca ccatggctag ccagtcgatc ggcaatgcgc gtcgcggacg 4362781 ccaccaaata ccgggccagt cctgcgacgc cttcatcgcg gcgctgcgcc gatttgcggg 4362841 tgcgttccgg gtcggcgcgc agcacgatcc aggtccggcg gttcgccggc gccgggtctg 4362901 tcccgatcac ctgctgatac agactcacca cgtccggcgc tgcggtattg ccgacgcggt 4362961 agccggctga gacgatatcg gcctccaagt cgggacagtg caccgacagg agctcctcca 4363021 ccagtccggt gtccagcatg tcgtcggtgt gggcttgccc gtcgacgatg accgtcggcg 4363081 tgaatggtcg gggaatgagc tcgattacgg cgaccagaaa ctcgccttgc cagcgcaccg 4363141 caacgtgatc tcctggcttc acggtggccc cgaccacagg ttctgacgag gaatccgggg 4363201 gccgtcggcg ccgccgcaac cacgcgtaca ccgccgccac ccagccggtg atccggcggc 4363261 cgtagaaagt gaccgtggcc acgatgacgc ccaacgaggc cagcgcaatc cccgcccacc 4363321 agtagcgcgt ctccaagaat gcgatgatgc atggcggggc caacgcggag gcaagcaagg 4363381 cgtgcccggt gctgaaccgc agccctaaag gatttctcat cggcggctca gcgcccgtct 4363441 agccagcgcg cccaggccca gggccaacgt aaggccgacg gccaccaacg ccacagccgt 4363501 aatcgggcga cgatcgggac ccggctccac caccgggggt ggaagtcgtc tgacgttgta 4363561 tggcgccgaa gcagggccgg gcggaatgtc ccacgtcagc gcggccaccg catcgatgac 4363621 gccggcgccg accaggtcgt cgaccccgcc cccggggtgt ctcgcggtgg cggtgatccg 4363681 gtggatgatc tgcgccggcg tcaggtcggg gaaccgctgc cgaagcaggg ccgccagacc 4363741 cgacacatat gccgcggcaa acgaggtgcc ggcgatgggt accggcccct cccggccttg 4363801 cagcgcattc accggttcac cggtgtcgcc gagcgcgacg atgttttctg cgggcgcggc 4363861 cacgtccacc cacggtccgt gcatcgagaa cgagctgggc atcccggtct ggccgatacc 4363921 gccgacgctt aacaccagcg gtgcgtacca cgccggggtg acaacggtct gcacattgtt 4363981 ccagccgcgt gggtcgccgg gtgtggacgg gtccggcgcc ggattctgta cgcaatcgcc 4364041 accggtgttg ccggccgcga ccaccaccac cacgcctttg acgttgaccg catagtcgat 4364101 ggatgcaccc agtgaggttt catcgatcgg cctgctcacc ttgtagcagg cggcttcact 4364161 gatgttgatc acacccacgc cgaggttggc ggcgtgcacc acggcgcggg caagactgcg 4364221 gatggaaccg gcggccgggg tggcgttggg gtcattcggg ttggcttgtg agccgaccgg 4364281 ttcgaaggcc tcagacgtct gacgtagcga gagcagtcga gcgtcgggcg cgacgccgac 4364341 gaacccgtcg gtgggcgcgg gccggcccgc gatgatggat gctgtgagag tcccatgggc 4364401 atcacagtca gacaggccgt taccggcctg gtcgacgaaa tcgccgccag gttccgccgg 4364461 gacccgtggc gaagcgtcga caccggtgtc gatcaccgcc accgtcaccc cggccccggt 4364521 cgcgaacttg tgggcatcgg ccacgcccag atacgtgttg ctccacggcg gatcgtggaa 4364581 cccggacccc ggcagcgtgg tgggcgacgc gcacaaaacg cgctgttcgg taggctgatc 4364641 cgggcccgtc acgtcgggcg gcaacgcgcc cggatcgatc ggcggtggcg tgatggccga 4364701 tgcgggcgac gcggtgagca acgccagcgc caccgtgatc agaaagatac ggtgcactcc 4364761 cagaacactc cattcgttga gattcattgc gattcattga gctgcgttgc taccttgggc 4364821 cacttgacgg acctgtgtgc attttagacg taacggctgg gcaaacaacg ctgtcacgcc 4364881 tgggctggtc cgccgcgccg accagggcgc gtaggcgctg tacctggacc acgccgggac 4364941 tcaacggttt tgctaccgca ctagccgata tgcggctgct accaaacgat cgcggccatg 4365001 tctcggttgt ctgagcacac gctgcgtatc gcggcatcga tgtcggtggc ggtgatgatc 4365061 tgcagatcct gaaccgatac cggttggccc gcacgttttt gcgcaaccac ccgggtgtcc 4365121 cggaaccctt cggcgcgttc gatcacgttg cgggcgaacc gaccgttttg catagcgtcg 4365181 ataccgtgct gcccactagg ggtggtgtag ttacggatgg tggtgaccgc gtcgaggaat 4365241 acctcccgtg cggcgtcatc gagctggctg gcgcgcggtg tagcgtagcg gtgtccaatc 4365301 tcgacgatct ccaccggcga ataagactcg aaccgcagct ttcggttgaa ccggccagcc 4365361 aaacccgggt tcacggtgag gaattcatcc acctgatcct catagccggc cccgatgaaa 4365421 cagaagtcga atcggtgtgt ttccaattga accaggagtt gattgaccgc ctccatgccg 4365481 atcatgtccg gtgttccgtc ttgatgacgt tcgatcagcg agtagaactc gtccatgaaa 4365541 atgattcgcc cgagtgactt ttcgatcagc tcgttcgtct tgggtcctga ctccccgatg 4365601 tagtgcccac agaagtccga tcggcgaact tctcgaattt cggggtgacg cacgatcccc 4365661 atgccggcgt agatcttgcc gagcgcttca gcggtggttg tcttacctgt gcctggtggc 4365721 cccaccagca acatgtggtt ggtctgcccc tccaccggta ggccgtgctc taggcgcatc 4365781 atgcgcacct cgagttggtc ttccagcgcc gataccgctt gcttgaccgc cgccaggccc 4365841 acctgtttgg ccagcagttc ccggccctcg gctagcagct cgccgcgccg ctgcgctgca 4365901 ttgtcgtcat cgagctggtc gcggcttttc gccgtcgaag catcccaacg gtcggagcgg 4365961 ctggcgatgg ttcgttcatc ggtaacaatc aagcgcaggt tcgggtccgc cagggcttct 4366021 ttggcggcgt cggtgagcac cccgttgatg gtggccttcg acagccagat ctgggccttg 4366081 tcctcctcat gcagttgccg gtacaccatc ccccgcacat acgccaagtc ggcgaccagc 4366141 agcggaatat cggccggtcc gatcgccgcg gtgagcacgt cggcgccgaa ccgctccgat 4366201 gacctgctgt gtccgatcac gtccacccgg tccagccagt ccagggccac tcgcccctgc 4366261 ccgagatggg cggcggcgtg ggctgccagc gcacaaatcg acgcggtcac cgccggcatg 4366321 acgatcgcct gtggcggcag atcctcggcg gccgtcgaca acacgtcggg ccatcgctgc 4366381 gtgacgtaca tcaggaacgc ccgagccagc tgatgccact ggtagttgcg ccacgaatcc 4366441 aatagctcgc ggtttgctaa cagggcatcg gccttcgcat actcccccgc gatcgtcaac 4366501 gccgacgaca gcgccagccc cacctgagat gcgtcggtca ccgtgatccc gatggatggt 4366561 cccagctgga cctcagcggc caacgtccgg ccgatccgcg tggtctcgcg gtgcagccac 4366621 tcgctatggg cgttgagctg cttaagcgag gccagatcgc ggtcaccgca ggcgatacga 4366681 cccagccacg cgtcggccat cgacggatcg gcctcggtgg cagccacaaa ctcaggcaac 4366741 gccgccacgc atccctggcc attcttgatc gtcatcgccc gatcgaaatg ccggcgcgca 4366801 gtgagtaaat cacccatcgt gtccaccatt ctcgacatcg ccgccgctgt caccgcggtt 4366861 gcaacgtgtg tctgtcactc tgtgcctcaa attccgttgg caacgttcta ccggcctatc 4366921 gacatcgtga ccggctcaag gctgacatag cggttctccg cacggaacat ttccatctca 4366981 accagccagt tttgtcctgc cgcaccgact ttcaccgttg cccgatcgat ttgttcgatg 4367041 gtcacctcga agccatgccg atcgctctcg gacagcgagg taccgggtcg ggcaatggtg 4367101 atgacactgg ctggccgtgg cgtgggcgaa atcgcgacat cgacaccgct gccttcagat 4367161 ttgccgtcat cgccgttctt gcgccgccgc acgtactcca cgacgccgac agtggtgcgc 4367221 ggcgcgggcc gtggtgtgcc gacgatgctc aactgcggca tgcgtacgct ggcccaacgc 4367281 tcttggtcgc gagtgtgcac acacacccgc tcaccggcac cgacgacgcg aatcacgatc 4367341 ctcttggcga tcgtgtcgtc cgcggccacg aagacgcgcg acagctcacc ggcgtcggta 4367401 acgggaatca tcagccggtc cccgttgctc agcttgccaa tcaacacccc cgacggtccg 4367461 atctcggtga ctagctgcgc cggcaacggg cagcgccgct gtccgcgtag gtgtggacgt 4367521 ggcccgcaca tgttggccgc agccgcggcg gcttgctcac cattgagccg acgcaagatc 4367581 acactgggcg gggtaggcgc cggcgtcggt gtgcgcacgg tgatggtcgc ggtgcacgtc 4367641 gcgtccggat acaccgttac gttctggatg acctcatcgg cacgcagcgt ccaggcttgc 4367701 gagagaaccc gcgacgaaat cgcctcagcc gggtacgcat acgtcgtcat ccacccggct 4367761 tcaccgcgga tagctttcca gcgctgcgca ctcccggcta ccgcgtccga ccccagccgg 4367821 cgatcaagct cagccaagtc tgttgcggtg gccagtttgg cgcgcaagcc ctgacagcgc 4367881 agggagctgg caacgcgttg ggcgaccgaa atggcagcgg ccccaacgct ggtacgccag 4367941 cgtaaagctt gggtgttgcc gatcaccgga agccgcatga tcagccacgt ttcgcgccgc 4368001 ccggcatacg gcggcgtacc gatctccgcg tcatacaccc gcgggtaatc gccgacggtg 4368061 ccggttcgcg agccgaaggt gacgacgctg attgaatcga gttccaggtc cagcgggtgg 4368121 cgcagcaacg gcgcgagctc aacgacgtca atcacgttgt cgctttctac ggtcaccgac 4368181 ccggtgaccg tagtcgcccg gtgcgctcgg ccgagaagtt gcaccgccac caccgcgaca 4368241 ccgtcttgca cgcggacgcc acccccggat cggttgttgg ccaaggtaat tgggtcattc 4368301 catttgacgg gacgccgacc ccgcagcccc agtaccgccc acgaccacgc cggctgaccc 4368361 caccactgta cgaacaccaa ggcgacgccg accacgacag ccatgaccgc acctagctgg 4368421 ccgcccagcg cccagcccgc cgacgcgagc acgaacactg tccacacccc ggcgacccgc 4368481 ctcgcactgc gcgggctgaa cccggtcagc ttggacgtca acgcgccctc cgtagccgag 4368541 ccccgattgc cattgccagc acaccggtgg ccactgcgcc gacgaacccg atagcgatat 4368601 tgcgcgcccg gtgatcgggc ggagggggtg gcgcggcggg cgtgatcacc cggctctgtg 4368661 cacccggggc catccgatca ccggatggga tgttaaacgt caatgcggcg accggatcca 4368721 ccagcccgta ccccagtttg ttgtccacgc ccgcaggcgg attgtgcgcc gactgcacga 4368781 tccggttgat cacttggtag gcagtcaact cggggaattt ggcccgcacc agtgccgcga 4368841 cgccgctgac gtaggccgcc gaaaagctgg tgccccagaa cggcatattc ttctcgcctg 4368901 gccgcgacgg cgggtaggca ttgaccggtc cgccgccttg tggcgataga cccatgatgt 4368961 gggttcccgg tgccgcgaca ccgacccacg gacccgacat gctcttgtcc agtgcggcgc 4369021 cgtaggcatc gacggcacct accgacagga cgtaatcaga gaaccatgac ggtgacgaca 4369081 caaccgtgac ctgatgccag tcccggggat ctgacgggtc cagcgggtca tacatcgggt 4369141 tgttgccgca gccggcctcc ccgtcgttgc cggctgctgc cacgatcacc gcatccttga 4369201 cggtggccgc ataccacagc gcggcgccca gcacccgctg gtcgcccgga gccgccgcag 4369261 gcagacatgc ggtcaccgaa atgttgatca ctttcgcccc catgttcgcc gcgtgtacca 4369321 cggcacgcgc caccgagtcg agggtgcccg ctttgacttt ctcatcggag ttgggacccg 4369381 ccgacgacgg gttgaccggc tcgaaggccc gcgaggactg ccgaatcgag atgatggtcg 4369441 catgcggggc cacccccacc accccgtccg gggcgcccgg cgggggtggg ggcaccgcgg 4369501 gttcgtcttc ggtttgcgga tccggcggtc cattggacgg cgccatggcg cccgcatcct 4369561 cgggtggtgg tggcggtggc gcaacggttt gggtgatcgt caccggcggc ggcgggggca 4369621 tcgggggcgg gacttctacc ggcggcgccg gcgcggcggt gaccggcggc ggcccggccg 4369681 gcggtgggaa cgccgcggtg gccggcatgg cccttggcat cggtaaaatc ccaagcggtg 4369741 cagcggcaat gatcgaactc accaccgtgc cgtgcgcgtc gcaatccgat aggccgtcct 4369801 cccccatgat gtagtcgcca ccgggcacca ccggcagccg cgggttggga ctgacgccgg 4369861 tgtcgatgac tgccacgggc acaccgttgc cggtgctgta ctgccacgcc ttgctgatgt 4369921 tgaccaggtt gaagcccggt gctagctgcg ccacgtcggg atttcttacg gtgatcggtg 4369981 tggagcagct gttggagcgg cgcatgggct gatcaggtcc aggccgcgcg tctgcaggca 4370041 ccatcgccgg atctaccgac ggcggtggga tagcctgtgc cgcaggaaca ttagctgaca 4370101 aagcaacgag ggtgagggcg gcgctcgcgg ccgcggcccg caggccaggt cggtttagtg 4370161 gcgaagccat gcaaacagcc cccctagggc cgcagccgct ggtaacaacg cgatcatggc 4370221 cagcacttct agccattcca cggtcaaccg gatgatcggc ctaaaccgcg tcgccggtac 4370281 cacgagggcc acggccaaac ccaatgcggc gaaagccgcg acgaagatcg caggccaaag 4370341 cagcccggtc tgaacacctt tcggggtgtc gagggcgtac ttaagcaccc cggcacacac 4370401 cgcggcggac gccccgcaca ccaatgcgac cgcttggtat ttggcggcga acccgcggcc 4370461 ctgggtgatg aagaggccca ccgtcaagcc ggcaaccaac aacgccaacc aggcccacgg 4370521 ttgacgtggc gtcagcaccc cccataccgc ggcgggcagt acgagcgaca ccccgacgca 4370581 catacccacc tgtaccgcgt taaccagccg cgccgacgcg gcgatcgcgg tgccgcgggc 4370641 ggtgatgtcg gtcagttcat tgtcctcatc gtcggcgtcg gcttcgctga ccggagccac 4370701 cgtatcgacg ggcattcccg cacggcgcgc gaacagatcc cggccggtga tcgatccgaa 4370761 gtgcgggggt cgtacccgtg ccacccacaa cgcaacggtc ggagtcatcc tgatcaggac 4370821 aagcagccct accagcacgc aaatcgccag cacctgcatc gaaaccggcc taaacattcg 4370881 gacggcggcg acagcggcaa ggatcccgca caccgttacc accgcggtga ccactgcggt 4370941 ctgccaccgc ttgcgggtcg ccacgccgat cgtgatcgca cccagaacca ccaccacgag 4371001 cccgatcagc gcatgagccg ccccgagcgc gcccggcggc gcgcacgcgg cggccacggc 4371061 aagcaacacc accgccagcc acccgaaccc actgaacagg tcacggcgct cccgccaacc 4371121 ccaccacacc accaatgccc cgatcaccag gagcacacca atcccgccag ccatcgcagc 4371181 tgggaccggg ctgtcggtga ttgtgcgtgt ccgcaacgtc agggccagca ccactccgac 4371241 cgccatggcg ataatcgcca tggcggtgtg ggcggcagtc agcgaggtta ccggcgcaaa 4371301 catccgatcc ccgccgtcac gccccagcca cttgcccatg gccgccagcc cggtggatag 4371361 cgattcgtac tgtggctcaa acgactcgcc agcaacccgg ggtactagca ccagcgtgtc 4371421 accgtcttga acgcccagct cgtcgaggct cttgttgatg tccagccgca ccccgttgat 4371481 cttgtgtagc tcatagctac ccgccggcag cgcaaccccg tcgaaacctt tgcgcttcag 4371541 atcggcatcg aacaactcca ccattccttc gaagaatccc tctactggaa ttccggcggg 4371601 gaatacctgg gagcatagat gcttgtcgta gcaaatgttg accgcacaac gtgccgggaa 4371661 agcaacctta tgcggcgcag tcactgcgcc gcccgttcgg catccggaac gtatttgtcg 4371721 gccaatccgg cggtgatttc gaaaagccgc aaccgcgact tcttatttaa ttcatgcacc 4371781 gtatcaatga tcccgccttt ggccaggtgc ggatcgaacg gcattgcttc cacgattgca 4371841 ccgaccttgg taaaacgttc ggtcaggtag gccagcgcat ccttgtcggt aatgctgtcg 4371901 gtgtggttga ggatcacggt gctgcgcgag accagctcgt gataaccctg cgccctgagg 4371961 tagtccaccg cccgcagcac cggccgggac cggtccgcgg tgattcccga gacgaacacc 4372021 agggtgtcgg tgctctgcag cactgccttc atcacgtcgt gctctaggtc gggcgaggtg 4372081 tcgatgacaa tgacggtatg agttcgccgc agccgagaca acactgcgga gaacatcgcc 4372141 gggacgagcg gcctgggctg gtctgatgtc cgatttccgg ccagtacgtc gagcccgacc 4372201 gtgttttgcc ccaggtgttc gcgaatgtct gcgtagccct ggacatcggt gtcgttgata 4372261 atggcggcgt aatcccccgg cggcgactcg tcgatgcggt cggccagggt accgaaactc 4372321 ggaaccgcgt cgatcgcaat cacgttctcc gggcggcatt cccgaaacac gccgccgatg 4372381 cacgcggcca tcgtggtgac ccccacgccg cccttgccgg acacaaccgt gatgacatat 4372441 tgccgacgga tatgccgacg gatacgtccc tgtaaattgc ggtagtgccg ttcccggggc 4372501 gattcacctg gattaatttt gtgaaatgaa acggaataga cgaatttccg ccaaccggtt 4372561 cccgggggaa tctttctagg ggcagccaga tcggtaatac gcatggtgtc ggataccgaa 4372621 tcccgaaagt gatgccgcac cgacggatcg ccgcgtccga tcgcgccgtc gtctaacata 4372681 ttcgggtcat tccacgggtt cgtcacgacc gcgatgctaa catgattcga gattccttgt 4372741 ttactgcgcg tgagcggctc tttgagtgca ttagtttgct attcgccaga caatgtcatt 4372801 cacaccacac gccggtatga gtaccattcg tcaccagcgg gcaagcggcg gatgagccgt 4372861 tgcaccgccc caccgatatc agagcgcgat ccaggcgaga ggacctggta acgtcgctgt 4372921 cccgaggtca cgctttcgac gcagatgcgc ccggcggcag tgtccacgat ggccaccgtt 4372981 gaatcgccga ccaggatgcg cgccgacttt tccggcccga cgcctgcctg cagcgccacc 4373041 agggtggcgt gcgcggagcg tgtgggatcg gcggccatgg ttaccatctg tagctggtcg 4373101 acgtccaggc gctgactgag cagatacgac cgcaaggtgc cggcgtcgcg tacggcgtgt 4373161 agtagttcgt cggcgtcgac ggtgaccggc cgtagcgggg cggcctcagc gacaccgcac 4373221 aaccgctcga cctgacccac cacgagttcg ccggccccgg cctcgtcact ggcggtgccg 4373281 gccgggtaga gccgcaccag gttgccgtgg cgttccaaga ccacccacca ggtggcaaat 4373341 cggcaaatgg ccgcgcgcgt cggttcgccg cccggcaccc cgatcgttac cagtaacccc 4373401 aggtcgcgtc gcaacagcac ggtcagccat tcacggacca tggggtcggc attaccggcc 4373461 tggtccagag cccccaccgc catcaactcg gcggccaccg ggtggcgaag tgcccgctcg 4373521 gcggtgtcca accgcggcaa caatggccgt aatcccagtt ccggacaggt ttgttccacc 4373581 ccggttaccg cttgtagtac ccacaggcca tcgaccgtcg tcgtcagcat gtcacgttca 4373641 tctcaaccag ctagcagcaa gcagaaggtg gggcagacgc gcggtccgcg catgtacccc 4373701 accttcactc ggcccacggc cggctttaga acaagcccgc gatggcctgg tcggttccga 4373761 tcgcgttgtc cagcacgtgg ccggtggtag tcccatgctg acccaccgtc tcaatgagcc 4373821 cctgcagccc cgacagcatc tgcgcctggg cgtcgaaaaa cccttgcgcg ccgtggcccg 4373881 cgaaaaactc ttgcagcgca tttgttttgc tggcggtgtc ttcgtaaatc atgtggagct 4373941 ggccggcgcg cgagcccacg tcggaagcga agtcggatac ggctcccggg ttatacgtga 4374001 tttgatctga catgtgaaat tcctttccga ggcgtgaaac gagttgggtc aggatccgtg 4374061 gctagcgccg aacagcgcct gaaacgctgt ctgcgagtcc gcctcgtgtc cctccatcag 4374121 ggctgcggcc tgcacgaggc cctcggccag gcgcgtgccc ccggtaagga ccttgttcaa 4374181 ttcattggtg atctcggtgg ctgtcatatg cgaagcaacg acgccggtac cagaccaggt 4374241 ggcggggttc atgacgtttt cctggttggc taggtagccc ttggcgattc ccatggcttg 4374301 ctccatattc gcctggatat cgttggcggt gctgcgcagc atctgcggtg ttacctgaat 4374361 tgtgtctgcc acgggccctt ctcctttact gccgttagcc gttccccctc aaatatcggg 4374421 gcatgacgcg aagtgtatgg ctgctctgcg gacctgtcga ttcaccctgt gcccgagcta 4374481 gatctaccgc cggtcatcga caacacgcac cgtcgcggcc tgctcggact tgccatggga 4374541 tccgcggtga ccccccgccg catgtccgac cggcatgcct ccgatagggg tgccacccac 4374601 cgtggtcgtc ggcgcgcgca cgacgtcggc gcccaaagcc ccgctgggcc gcaacccgac 4374661 cggcctgccg ctggtgcccg attcgaaagc gctgacggga cgtgtgaagc tcgtcgctgg 4374721 cataccgccg ccgcccaggg cggcgcctcc gccgcccgca ccgacctcgg tggccgcggc 4374781 cgatatccca ccggccgccg aagccgccga ggctcccggc gccgccccgc ccatgcccag 4374841 tgcgcccggg ttggcgaaca tgcccaccat cgattgcagc ggctgcatcg cgctcatcgg 4374901 tgcctgcatc agtcccgacg gcgcctgcag cgcctgcggg gccgcttgca tcaccgcctg 4374961 catcggctgc atgaacgtgc tgagctgatt gccgaagttc tcacccgccg acgttgactg 4375021 acccgcgccg gttgaccctg cctgcacgcc ctggtaggcg gaacgcatcc cgtcaccggc 4375081 cgcggcctcg gcggccgcct ggccaaccgc cgccgcagcc tgtgccggag cggccggaga 4375141 tgcacccatg gtcgcgaccg gcggcggaat tgccagactc tcggccagcg cggcgagaac 4375201 ccctccgtag gtggcgccca ccgcggcgtt attcggccac atcaccccga aatactcgac 4375261 gtccaaagag acgattcgag gcgttagtgt ccacagcacg ctggggttga tggcgttgtc 4375321 gacgccccat tcgtcgcggt tctccatgca ctcgggggca gggcgcatgg ccgcgttggc 4375381 ggtctcaaac gccgcgatcg cggtcgatac cacggccggc ttcacgtcga cccagccggc 4375441 cagtccgtgc agcgtggcgt tgagcatggt gacgttaagc gccgaggccg ccgacccgac 4375501 acccaaccag ctcgccgcgg tggcggcggt gttgatcgcc gacgcgacac ccgaggcgtg 4375561 gtggctggcg cccagtgtgg tccacgccgt ttgattggcc agatgggtgc ccacgccggt 4375621 gcccgccgtg agcagcaggt cgttggcctc aggtgtccgc gcagcccatc ctggatcggg 4375681 catgcctact tacccttgca gcgccgatgc cgccgcgcgc gccgcttctg tggtcacgta 4375741 cacgcccgac gcgaggccct gctaacccgc gaacaggccg cgctgactgg cgtgttcggc 4375801 aacgacaccc aggtagctgg caccgcacgc gttgagcgct gcggagaaca tcgcggagtc 4375861 gggatcacca cccatcggcg tggtgctaag cagggctggc gccgctccgg cggccgctgc 4375921 ctcggtttcc gcactgatcg ccgactcggc agctgccgac gccagcactg cttctggttg 4375981 cacagaccaa accatgttcg cccctccgat tgcttctgca atgcgtgatg gtcgctgagt 4376041 gtaatgcgag tcggccgatc gcgtatgcgc aaatcagtcg tctgcaccga tgccgacgtc 4376101 gaactggtcc acgccgcccc atgtgttcca gcatgtcagc ggtacgtgtg gggcggatgt 4376161 gaaatctgcg acgcctggag gatacgcgcg ggtgtcactg aacccgtaca gcgacatggt 4376221 caggcagaaa gtagccatgc gcgctatctt gcgatccggc cctactgctc gccgggcacc 4376281 gacggatacc ccaccaaaat cccctcgacg tcgccgtcgg caccgaccaa cagacctcgt 4376341 ccaggcggca acgtttgggc tcgcaccgat cgattgattc ggttttgcgg atcgttatcc 4376401 atatacaact gggccacttt cgccgaggtc tgggatttca cccaggggtc catcggcatc 4376461 gtggcccagt tcgcgctgtt gcgcgtgctg aatacgtgca aaccgacctg gcgggcgcgt 4376521 tccatcaact tccacagcgc cgcacccacc ggcggcttct gtgggtagct ctgagccggc 4376581 cgcaggtcct gcacgtcgtc gatgagcaca aagtgccgcg gtccttccca cggcttgagt 4376641 gcgcgcaact cctcctggct caaacccttg ggcggcaacc gcggcagcaa gatctgctgg 4376701 gccaactcgg tgatcacctc gtcgatttca tcttggtcgt aggcatacgc gcgcacatac 4376761 ccaggggcgt gcagatctcg cagaccgtgc ggagccgttt tagggtcgat cagcgtgagc 4376821 tgcgcctgct gcgggctgaa ccggttcatc accgcctcgc cgatggccac cagcgccgtg 4376881 gtcttgccgc agccttgccg acctaagatc atcaaccctg ggctctcgcg cagcttgatc 4376941 ggcaccggac ccagctcgtg gcgctctccg atcgcaaacg cgatcgacag atcgtcaccg 4377001 ccctggtgga cggcctcgtg ctcgacaatc gcggacagtt ccacccgctg tggcagccgc 4377061 tgcagacttg cgtgcttggt caccccggcc acgtcggcga ttcgcgcccc gacatcggtg 4377121 atgcccacca gctcgccggt accggggtcg gccagggccg gaacaccgat tcgcagctcg 4377181 tgcaggcttt ccgtcaaacc aaatcctggg cggttcaacg tccgccgcgc cgcctcccgc 4377241 gattcgatcg acaaatgccc catctggctc tcaccgggat cggccagccg caactgaatt 4377301 cgcgccgtga cattctgcag caggctctgc cgctgcccat gaatccagcc gccggcactg 4377361 cacatcaggt gcaccccgta ttcgggaccg cggctgctca acgagatgat gcggtccccc 4377421 aacagggtgt ccttggcgta caggtcgtcg tagtcgtcga gcaccacaaa gacatcgccg 4377481 aacgcgtcgg tgggatcggt gccacccacc ccgtcgccgc cgatcccgaa ccggcgctcg 4377541 cggaacccgt ccatgtcgat cttggctcgc cgaaacgcct cttcccgcgc atcgatcagc 4377601 gcatccatgg tgctcaagat gcgttcgatg ccctcggcat ccttgggcga cacgatatcg 4377661 gtaacgtgtg gaagcgaccc aatctgggcc atggtcgccc cgccgatgca aaagaacgtc 4377721 actcgctccg gggtgtacat cgttgccgcc gaacacatca gcgccatcaa ggttgtggtc 4377781 ttgccgcgct gcttggcgcc caccacgatg atgttgctgc gtagcgcgtc gacggcgtgt 4377841 accacttgct gggattcttc ggggatgtcc atcactccca ccgggaacat cagtcccggg 4377901 ttttgaccgt agtcgacatg ccagggtttg ccacgatacg cagccaccag cctatcgacc 4377961 ggctcggggt cttccagcgg cgccaaccac ggccggcgcg gcgatcggtg cggcacgttg 4378021 tatagcgact cccgcagcac gtcgacgatc ttcttcttct tgaaaccgtc gtcgtaatag 4378081 aggaattcgt cgggttccgc atcggcggcc gcggcggtcg ccaatgcctc ggcgtcggcg 4378141 gcatccagcg gttggtactg ccagtcgtac agccggggtt gggtcaacgt catgtcgatg 4378201 gttcgggcca cctctttctt cttcggcacc acaaacggcg cagagaggta aaagcagcgg 4378261 aacggttcca gatcccgcgg ccccaccttg agcagcgcga aaccgttctc cttcgacggc 4378321 agatggtagg cggcgtcgct gccgatcact tcgcggctgt catcaccgga ttcagcgcgc 4378381 agcgcaatcc gaaacgcgat gttggacttg accttttgca gcgacgacag gtccagccgt 4378441 tgaccgccta gcatgaagaa gacgttggcg ccgcgaccct cctgaccgat gtggatgatc 4378501 agatcaatcc actttttgtg gttggcgaac agctccaggt attcgtcgac gatcaccagc 4378561 agcaccggca ccggcggcag atcgcgtccg gcgaggcgaa tctcttcgta gtcgttggcg 4378621 tcgcgcgcac ctaccgattt gaacagttcg tagcgctgtt tgatctcgcc gtcgataact 4378681 ctgcgcatcc gctcggccag atgccgctcg tctttgccga ggttggatag cgcggccacc 4378741 acgtgcggga tgcccaggat gtcctgggca gccgattcga atttcatgtc gacgaagatg 4378801 acgttgaatg tttccggtga gtgcgtcagc gcgatcccat agaccaacga caagaagagc 4378861 tccgacttgc ccgagccgct ggttccgatg accactgagt gaaacccgaa gccgccaaag 4378921 tccttggcgc gcaggatgat gttctgcagc tcgccgttcg gtttggcgcc caccggaatc 4378981 tcacaccacc gatcgtcgcc gcgaccgcgc cgctcggccc acaaccgatc gacatccaat 4379041 tcccgggggt cgctaatgcc gagcgaacgc agcagctcgg ccgcgccgct ggtggaatcg 4379101 gtgacctcgc tgcgactggt cggtgaccac cgcgccatcg cccgcgcata tcggtaggcc 4379161 cggtggatgg acagctggtc ggcatgcgcg aagaacgtgc cgcgcgcccg caacagcggc 4379221 gccgggcgct ggtcgtcatc ggcgtctgcg ccatcgcgac cggccttgac cgcggttgcc 4379281 gccccatgtc gttgggccat ctcgaagacc tggtcctcgg cgaaccccac accggtgccc 4379341 acccgggacg cgatgcgcag caccgtaagc ccggccttgc cgacctgccc gaccacgctc 4379401 tcccacgcat ccgggctgcc ggtgttgtcg tcgacgatca ccaggtgcgg ccccaaatcc 4379461 acgccgacct gcccggtttc cagcgccgag cccatcgcgg ttgggctggc caccgtcggc 4379521 ggggtccatg cgcctcgctt gcccttcata tgcagctcgg ctcccagcgc cgcctccagt 4379581 tcctcgggtg tggcaaagat cagccgccgc cagccgcagg catcgaacag ctcgtcgtgc 4379641 aggttgtggg ggagccacac catccacgcc cacacctcgc ggttgcgcgt caccaccatc 4379701 agcttgacgt cacgcgggtt gtgaaacacc gccagcgagc acaacaccga ccgcatcagc 4379761 gaccgcaccc ggtccaggtc ctcgctcacg aagctgaagc ctggtgccga ccgtaggttc 4379821 accaccttgg cgatatcgcg aatcttgcgc tgctccaaga tgaaatcgcg cagcgcctgc 4379881 ccggtcacgg gctctagctc ctcatcggag gaaatgtccg gccaggtcac cgacaacacc 4379941 gaatctggtg cgtgctgcac acccgtgccc acccgcacct ctaagaagtc gacgtcgccg 4380001 cggccacgct cccacatccg cggaccgcca atgatggcgc ccagtccggg tgggtccgaa 4380061 tgcacggcgt tctgccattc acgttgcgca cacaccgccg tctggatttc gtcgcggttg 4380121 gtgtccaggt cacgaagata tcgacgacgc cccttctcca actcacccca ggtgatcttg 4380181 cgggctcgac cgaatcgtcc ggagaacgcc agcatgctga acgcgccgat gcccatcagc 4380241 gggaagaacc ccgtggccaa gctgcgcacg cccgacacgt acagcatgac gatggtgccg 4380301 atcagcgcca cgatcaacgc gggaacgccg atcatcaccc agatgttgcg cggctcgcgc 4380361 tccggcagag ctatcggcgg attcggagcc acccgaacgg gtttcggcgg gtcgatgttg 4380421 acgcggttga tgggaaacgc tttcttggac atctaggcgc ccgccttcgc cgttgtcgtc 4380481 acaatggcca cctggccgag ggtgggcaca gtatcgcgag ccaagagtgc cgcatcccgc 4380541 gacagagccg gtcccgcagc aaaagtccgc agcaacggcc acggcgcctg cacggccgca 4380601 cccggatcca ggcccagcgc ccgcagcgtc gcctcgtcgt tggcgatccc gaatcgcacc 4380661 ccattgccgg acacccagaa caacgattcg cgcgactcgg cggtgatcac accgctggtc 4380721 gatgtcacga agttggccgc gccgggcaac accagcacct gggtggccac caccgacgcc 4380781 ggggcgcggt catcgcgtac cagccgcacg atccggctgt ccatcgacgg gggcaccgga 4380841 agcccccgcc cgttgtagac cgcgacccgg gcctgtggat ccgtcgacgc cttctcccac 4380901 gacacgcagg tggtcggatc cgccgcggtg tcaacgaaat tcagccgccc ggccgggtag 4380961 tactccaccg gcagcgaggt cacctgcggt gtgtggacca gcacatcggg ggtcaccacc 4381021 cgcggcgccg ccgccccgta ggagttcgcg ctgcgcagca gatcggccac gaagctgctg 4381081 atcttttgca ccccgtcggg cagcagcaca tagaactggc tgcccccgcc ggcggtttgg 4381141 gcctgcaaca ccgatcccac ccgagcgccc ggcacccacg tcgacggggt gcccgcctcg 4381201 ggcaccgctg gcacccgcag cggctcggtc gcgggcagcc cgtcgaagag cgcccgtgag 4381261 atctgtattg gtgatgtcac gccggggtcg agccccaagc tcaaggtgac cgccctgttg 4381321 gtcggatcga tctgtgagcg tttgccaccc cagatcacgt aggtgctgcc gtcgaaagtc 4381381 accagcagcc cggcgtcgtc gcgcaggtgt gtggcgcggc caccgccggt gatcgggccc 4381441 gcgatcgagg tgaccaccgg cttgtccgcg ctgcgcgggc gtcccgccgt gtcgcacacc 4381501 gcccacgccg agaccgcgcc ccggttcacc ggcatggccg cgggtgcgcc cgggatgccg 4381561 accagcggcc cggtcggata cttggcgatc tcggcgggct tgacccatgt cggctgcccc 4381621 gccgtgccgg tggccagccg cgcggacgtc aagttcagcg ccggatacaa ccggccgtcg 4381681 atgcgcgcgt agagtgcccc ggagtcgcgg tccccgatga tcgccgagtc acccacaatg 4381741 ccggtgggct tgagcacgtt gagcagcatc atccatccgg cggcaatggc caccaacacc 4381801 atcgacaacg ccagcgcggc ggtctgcttg cggtcgtcgt gtttcatgcg caccgagaac 4381861 cgggtggtcg ccgcccgcag ccgccggttg tagaacagat gaccggaatt ttggtcgcgg 4381921 ttggacaaac tcagcggcat tctcaatacc ccctggggct gcggcgagga tcggcctgct 4381981 ggatcagatc ggccagattc gatgcgtcgg cggccacccc gtagtggcct ctcgcgtagt 4382041 tgatgaacgc gcatgcctgg gcgaccaaat cgtggatgtt ggtcgacgtg cccggctcgt 4382101 gataggccgc gaacgtcgga gcgatgaact gccagacgcc cctgctgggg gtgccccgcg 4382161 cggcgttgga atcccagtgg tttatggcgt tggcgttgta gttcgattcg cgacgggcca 4382221 ccaggtccat gccgcgagtc cagcgtgccc gcgcggctgg atcgtgaacg ccttggatat 4382281 ccaacgcttt ttggatcgcc gccaacactt gggcgcgtcc accgggtgtg gttacctggg 4382341 gtcgtcttgc cgcggcggtg cgcaggtagc gcagccggcg cagccgcagc cccaatagcc 4382401 gcgcccgcga cctacatcgc gcgatgtgcc gatgctgagc ccgcagccgg gcggccatcc 4382461 gggccatcgc ctcccgccgg cccagcggtg tgtcggtcaa ggccatggca tcggtcttgg 4382521 cggcttccag gagtgcacgc gtcgcggtcc tggcgtgcgc atgatcgatc tgggctgccg 4382581 ccatgatctg ggccagcgcc tcatcggtgt tggccaatcg acgcagcgct cttgcggcgc 4382641 cgcgccagcg gtacgccgct gcggtgggaa cggcgttggc cacccaggat atcgcgttcg 4382701 catactgctg gatctgcggc gcgtcgatgt cggcgccgga aacgccgccc gcgaacaggc 4382761 cgtggccccg ggacagcgcc gctatcgcct gcgtggtcag gggatcagtc aagggttcgc 4382821 cttcggtgcc aatcctgtgc catgtgctca catccgttgc cgggtgccat cacctcggcc 4382881 gtaccgacca gaccacgacc ttgtcgattg cggccggtgc ccctgcgcca cgaccatact 4382941 gccgttgacg cgacccgact actcagtcgt ggcgcgaagg ccgacctcgc cccagggcga 4383001 ctattcctta accttgtcgt cgttcggcac aacaaggatc cgtcgcgtcg acttggtgac 4383061 caccggcttg tcgtcggcag atttcaccgg aacactcggc ggtactgtta agcggccctt 4383121 gaccggttga ccattcggca cagcccgtca cccgcttctc gaccggcttg tccttattgg 4383181 ctccttccgc gcccgcaccc aacgcgcccg gcggcaccat cggcatgccg gtcatgccgg 4383241 ccggccccga cgcccgcggg gtgccactaa ccgggtccgg cgtcaccgac ttggccggcg 4383301 ccccggctgg agtcgtcggt ggcgacgacg tcggcacggg tgggggaccc agatagcccg 4383361 tcggggtggt gccaccaccc ccgccgccgg cgccgacgtc accagcgccc ggctcgccgc 4383421 cgaggccggg ctcaccttcg atgctgtcca ccagccgcgc cccgtccgcg acgtccagtc 4383481 cctccgcgcc ataggtctgt tgaagcgcac tcatcagcgg ctgcattgct ccctgccccg 4383541 cttgcatggc ctgctgggga agctgcgtga gtgggcccat gacgccgccg accgcgccgc 4383601 cgagcgcgcc ggtaatgccc gacaccgctt gttgcatcat ctgcgtcgcc ccctaggcct 4383661 ccgcctgagc gcccaccccc tggaattgtt gggccgcatc ggcctcattc gccgagaact 4383721 tttgcacggc atcggccgca tgcgcccgcc gatctaggtc ctccaggccg ctggactcga 4383781 cgtcgccggg cacaccggaa ttaccggcgg cgaaaagtgc cccattagcg atgtcggcag 4383841 gtgcgggtag gtctacgggg acagcgggaa acggcgccgg tccggacgct ggcggcgtcg 4383901 tcaaaacctg caataagatt tccggcgtca ccttgatcgg aactcccgga gccgggccgg 4383961 gtgccggatt ctgatctccg gtcatgatca cacctcgaac ttcatccgta gcgccccttc 4384021 ggacgctctt tcgtgtgctt gtcgacattg gccgcagcat cgccattttg tcacgccgcg 4384081 cgtcgaccgg tattcagctc acggtgtcgg gcctcgtatg gtgatcaggg agtttcgggc 4384141 agcggttcag gcagcgaacc ctcgtgagcc gccacgcctg gtggttacgg cataccaggc 4384201 caggtgatag ttggcgaggt agtcctgctc gtcgatcagt gcctcgatcg ccgccagcag 4384261 catccaatcc ccgacagcgg tgagctcgtg actgggatag gccttgagca ccgactcctt 4384321 gaccgcggtg atgcagccgt gcagcagctc ggcttcgttt tccagcacgc cggttttgcg 4384381 taccgccggc agcgcgatcg cctgcgcgat ccgcggcagg ctatcgcggc ggcgcacagc 4384441 ttcgaccaag gtcggcccga actcgtccac cttgggtatc gccgagcgcg ctgaccgatc 4384501 accggtcagc gcaggcgcat ctggccccgg ctcggcaacg taggtgttgg actcgtgggc 4384561 tgccacggcg acgacggcgc ccagcaagtc gatcacgtcg gcatcacggc gtcgcgcggt 4384621 tggctccagc agcgtcacgt tcgcgggcag ccggacgtgg ggcggaatcc acccgccggc 4384681 caaatcggtg accagcaggg tggtggtgcc gtcgtcgcgc agcccggccg cccatgagat 4384741 tcgcggctcc tggcgcgcca cggcatccac gattcgctgt aggcgttgct gctcagccgc 4384801 ccgagccgat accgcgcccg ccgtcgcgcc ggcggtggcc gacagtgccg aggcgccggc 4384861 cattgtcgac gagctcgcac cagcctgtcc agccacagct ttcgaggctg cgcgctccac 4384921 cggagaaacc agcgccccac ccgccgatgg ggccgatgac gccgagggcg ccaccggcgc 4384981 gccggatacg ggcgccgtag gaaccgaggg cacggcgggg gctgccacga cggggggccg 4385041 tagatcagag ccgtaagccg gcagcggtcc cgcggggaca gccgagccac caaccaccgg 4385101 cgcagcgggt gccgccaccg gcccggcggt caccacggtc ggcgcgaccg gcccggtagt 4385161 accggtcgac gccggtggag cgcccgaggt gttcgccggc gtgtcaactg gcccgtgtgt 4385221 ggcttcgatg cccgcagcca tggtcggcgc agacaccacg ggcggtgtcg ttatcggagg 4385281 agtggcctgc ggcgggggaa ccgaccccga ctgcatcgcc gtcattgccc cttccgacag 4385341 cgaatgcgcg ccagccgcgg ccggttgccc cgtcaccatc ccggtcgcaa acgattgccc 4385401 aatcgacgta ggcgatacgc cctgtccgag ggccgccggc gacagcgacc cgccgggcat 4385461 cgccaccggg ggccagtggc gatgactggc ggtgtagcag cggcgggtgt tgtgaccacc 4385521 ggggcaggtg gtacaggtcc tgcgctggcg tgccgactcg aagcgcctat tgctcgcggc 4385581 gccggttgtg agcaggcggc ctggacacca ctgccagaaa agccgccagc acccaccgat 4385641 tgtggcgatg ccaggtctcc ggcgccctca acactcccaa agctaccgcc acgcgcgccg 4385701 ggaccggtca gcgctgccag atcgttctcc ctgatcaggc ggggtggtgg tgcgtcgtcg 4385761 acgttgaaac cattggcccg tgcccacgtc cggggatcgt caccgatatc ttcggcttcg 4385821 aggatctcct gcatggccgt catgaccttg tcgacggcgt cccgggatgc gttcgccgca 4385881 tcggcattgc acctggtttg gatcgcctgg atttccgcca actgctccgg caacggcttt 4385941 ttcgacgcaa gaacgtcgtc gatttcctta tttccttccc ctgcaatgcc ggtcaaccgg 4386001 ctccgcaaat agtcgatggc gtcagcggcg gtattgaagg cgcccttctt tatttcgtac 4386061 ttctctgcct tagtgacctc ggatttcgct ccccgaaggt accggccaat caggtcttcg 4386121 gccgttctac cctgattccg caacaaaaga tcatgttggc tgatcagatt ccttgcgagc 4386181 tcttgctttt gcatggccca agtggcccag tgttgcgcgg cggcacgtag ggccgccgac 4386241 ggggccggcc accacggccc caccagcacc gcgctccacc taccgggcgg aagatcagcc 4386301 gccaccacat acctgcttca tagcagcatc tttcacgttg ccgtcgtcaa gtgcagcctg 4386361 ccactcagct tgagtgccac cactcgccat tacgatcgtt gtgcggtaag cggtcgctag 4386421 cgcgcgcggc gtcgcggtgt ttggcatcta gggcgggatc ggctgctgca ttgtcgagga 4386481 ttgccgcagc atttgtgagc gtgatgcggg ccagtgcctt atcgctcccg ttcgtgtcaa 4386541 ctggtacagc atgggccacc agcttgtatg tgtcgcacaa ttgccgctga gccgcggcag 4386601 tctgggcagc ggtgtaggta ggcaccgagg tcgtagccgg tgtagccgcg ggcctggcgt 4386661 ttgtcagggc cacgatcagc gcagcgaccg ccaccacagc agcgatcgcg gccaccacga 4386721 tggcgggcca actacgtgtg cgtggtatgg gcaagggtgc tggcgcggtc acgccgcaga 4386781 tggtatccgc tgaccgcctg tttgccgctt gcaccagacc acaccaaccc ggacacgccg 4386841 cggcggatgc gttacgtcac cggtgaccac gcggtgcagg tgttccaact gaccagcacc 4386901 gttatcgatc tcaccaccaa gcgcaaacac accacggtcg tgtacgcggc cacctccatg 4386961 tcgggaacgc cacccctgca caggtagcct gctggttgct gggtcattgc gccatgcctt 4387021 cgagaacaaa ttgcatcgga tgcgcgacgt cacctacgca aaacccctcc caagtccgcg 4387081 ctggtcaggg ccccaaggtt agggcacccg cgcaacagcg ccgccggccc gctccgtatc 4387141 gacggccacg acaacatcgc gtccacgcta cgccgcaatg acgcggccct cgccagcccg 4387201 ttaaaccatc acagtcctgt tgaaacgcca ttttgccgag gccttgggcg catcgccggt 4387261 aagcgctgct gaccgcccgg ctgaaatcga tgagcatcac tatcttatct actgttttag 4387321 tatgcggatt gtcgcgacaa tggcatcgca cgagaaacgt caacctaacc cttatagtcc 4387381 ttccaaaggg tgaataaggg cttaccttcg ctatccagga aagaatcctt tatcacgttg 4387441 acagatacgt ctaggtaatg tgacaattca accagtcgat cagccgcggt aattgcaaca 4387501 accgttccac tggaatcgat aagggtatcc cgttgccgac cggcaaacgt catcgtcccg 4387561 atgctatatt ccggcatcag ctcttcgggc tgaaaaggtg cccggatcgc cggcagttca 4387621 cgctcgcttc ggacggagcc tccgaaatac ccgtaaagat atttttcgat cacggacatt 4387681 gatgcagcgg cgaattcata gccttcccgg ctcatgcgat cggatgacgt aataacatac 4387741 catccggcga gccggtcgat aaagtagcgg acttcaccgc ccttgttcca aaggatagtc 4387801 cggccgtcat tcgtttccga cccttggatc atgttcatgc cagataagcg gatccagtcc 4387861 tgcaaatccg ttgacaggtc cacacctatt gtcactgtcg caacaccccg cgccttatta 4387921 actcttccac tttgcgcatc tcgttctgat gatcgaatat ccgcacttgg atggatccgc 4387981 ccggctggcc gcaccccggc gcgacctcag atacttcgat gaaccatccc tcaggcaacc 4388041 aatcaatggt atacgcgtgg taggggtcgc gtaacgacgt cacgtgcagg gcacgttgtt 4388101 cccatgatgc cgggcgccca tgttccatga tcgccaggta cttgccctga tcgccgccta 4388161 tacgatctag ctgggggccg tagtcactaa gaaatttttc gagattagtg taggcgatcc 4388221 ttgtccctgg aaccgcacca ttgttaggcg gaaaattaga gtactgctgg ccccatgggc 4388281 ctacactatt aaatcgctct tgataccgtt cttgggtata gggctgtccc tgaggatcgc 4388341 ggccgaatgg ggcgttgggg tcctccatga gctgggccac aaccgggttt atccgactgc 4388401 gatcggccgg attgtctgta aagtcccagt ggcgcgataa tggctcgcca tactgcgggt 4388461 caaccgcttc gtcggataac cgatgccaac cctctccagc tggttcgttc gaatgcattg 4388521 caagctgctg ctcgcgatgc ggcgcatgca gcccaggtgg ctcgctaccg ctaccgtggc 4388581 ctgacccgtg agacgcaccg tcgtgggtcg gctcggatcc gtgtgcgctc agtgatcggc 4388641 cgtgaggccc cgattcggtt gagtgaacgc ctccgggcgt ggaatgcggc gccgccgcgg 4388701 gtgctgctgg tgtggtcgcc cactggggtt ggtgcgcggt agcgggcgct gactcgacag 4388761 gaggtccgcc aagcaacgtc gtcgccggtg gtgcttgcgc cgggacatgt tcacccggtt 4388821 gcggcaggcc atgcggcaca tgtgtgccgg gcgtggtggc tgcggatacc cggggctggc 4388881 ctgccgacgc cgacgacggc gccaccggtt cagccggtct gtcgacgggc ggcggtttgg 4388941 attcggtggg gctgtgcggc agtggaccgt tggcgggcac gggcgccggt ttcgccgcgg 4389001 gcgcgggtgc cgggtggccc gattctggtg gttcgatccg tggtggttgc ggtcctggcc 4389061 gcggcggcgt tgctgggggc tcaaggtgcg gtgtcgtcgg ctcaagccgc tccttgaggc 4389121 ctcgcacgcc cgcgagaatg tcgcggccct tgctgccaag tttcgacagc ggcccgcccg 4389181 gcaaagctag cgtcgcggcg tcgaatacgg tcttgcctag cgcctcatta gggttggtcg 4389241 tccactcatc ccaatggatg aggcttttgc cgaactgctt ccacgactcc acaacgccgg 4389301 gagcgttctc gccgcccagg cccgccagcg gcgccatccc agtcagcatc tcctcccagg 4389361 agcgatacca cccgaacggg tctatcgagg cgcgcagtgg ccctaggtcc caggagtcct 4389421 tggccatccc gaaggcctcc tcgccgaagc ctttgagctg ctgcccggtg ccatcgatga 4389481 ccacacccac cgggttgctg tgcaagaacc gatcccattg tttgcctgcg tggtctgcca 4389541 tcgcggtgat caccgcctcg gcgtgcgaca ccaccgcggt gatctccgca gccaacgcgt 4389601 ccacttcccc gctgaactgg tcgaccacca ccgcgatgtc atgggcgatg cgctggatct 4389661 cgtcttcgtc ctggtcggtc agaaactccc acacctcttt gatcccggtc agcggatcgc 4389721 agatgcgggc caacaaatcc aggaccgccg catgcaccgc gtcgatgcgg gcggcatagg 4389781 cgtctagctg ggccgccagc tggtggcatt ggcccacgac agcggtggtg ctggcgtacg 4389841 cgtcagcaaa cgccgactcg atcagccccg cctccgggag ctgctgggcg cgaataacgc 4389901 ccatcggccc cgccgtcgac tgaatctcag tcagcgcgaa ctgcgtgccc gcgctgcgcc 4389961 acgccacagc cgccgcacgt agctttgtcg aatccccgtt cggccagatc atcccgatat 4390021 acggggccac ccacccccag cccttcgggg cgccaccgcc gccaccgacc gccgacggcg 4390081 gcgcacccac gccgacacag ccgctcggcg gcggcgccgg caacggcgcc gcccgcccag 4390141 cgacatccga catcgcctcg gccaacgagt agttgtgcgc gctcatgcgc accccatcgc 4390201 cgaggttgca caatccgttg cgcgccaccg acatcgcctg caccagcgcg gccgccgaac 4390261 cgtcatagga gcgcccgaac accgccccag ccggatcatc accggccatc cccgcacacc 4390321 cggccagcgc cgcggtcagc gacgagatca ccgcacccaa acccgcaccc gcagccacca 4390381 ccgcgccgcc cgcgctatca agggccgcgg gatcgaccgc caacggcgcc atcagctcac 4390441 gaccacatac ccaaattcgt ggccatcgcg ccggtgtagt tggcgtgcgc gctctgcccc 4390501 gcggccgtga gctgggccaa cgcctggcgc atcatcgcct caccggcagc ccaatgtcgt 4390561 tgcgcctcag catgagccgc cgcgccctcc cccgtccacg tcacatgcag ccgggtaacc 4390621 aaggactcaa tctcggcgac cagctcctcg acgtggcgac cgaattcggc catccgcgcc 4390681 accgcatcag ccaacacggt cggatccacc cgaaacggct cagccaccgc ccacctcacg 4390741 aagcacctgc gccgacgcgg tctcgttgtg ttgataaccc gcaccggcgt gagctatcgc 4390801 cgccgccagc atcgacaatc ccagctgcac ctcaccggcc ccgcgatgcc atagctccca 4390861 cgccgagcca tacgcactgc ccgacgcccc gcgccacccg cccaacatct gcccgacctg 4390921 agcgtccagc tcggccagtt gaaccgcgag atgctcggcc gctccatcca acgacgcggc 4390981 gaaaccctgc atcaccgcag gctctacgcg cagcgtgtcg tcggcaccca tggccgcaac 4391041 ctaacaatgc ccaggcaccg ccacaattca gccgcccggg cgcacccgcc gcagccctaa 4391101 aggctgctgg cgccgtcggc ggtgccgtcg ccgtcggtgt cggtcagccg tacgtcccag 4391161 cggccgtcgc catcggtgtc gacgtatccg gtcacacgct gctcaccagc acacagcacc 4391221 cgatcggcca gcccgtcacc gtcggtatcg agtagccggt cgtctaaacc accgaacccg 4391281 tcgaagtcaa ccagtggacc accggtgtgc tcgacgccgt cgagcccata ccagcgcagt 4391341 tgtccgccgc ggtcgacggc gaccgcccag gtccccgatc cgtcgtcgat gaagtagctt 4391401 tccggggtgc cgtcgttgtc gacgtcgaat acggcgtggt cggcaacgtc gtcgccgtcg 4391461 aagtcggcca gcgcgtcatc gcgcagaccg tcgccgtcga gatccaggcc aatcgcgtcc 4391521 agccggccgt caccgtcgag gtcgacgtcg aacgggcggt tccagatccc ggcgctgccg 4391581 tcgtcgccgg ctatgcagta ctccacaacc gttctgacgc gactcccaag ctagcggttc 4391641 ccccgtgatt tccaccagga cagcagctcg gttgtcgcct cctcggtgga caacgggccg 4391701 cgctctagcc gcagctcctt caagtagcgc cacgcctcgc cgacttgcgg gcccgccgga 4391761 atgtcgagca ccgccatgat ctggttgccg tccaggtcgg ggcgcacccg atccagatcc 4391821 tcctgggcgg ccagctccgc gatccgctct tccagccggt cgtaactggc ctgcaaccgc 4391881 gcggcccggc gcttgttgcg ggtcgtgcag tcggcgcgca ccagcttgtg cagccgtggc 4391941 agtagggccc cggcgtcggt gacatagcgg cgcaccgcag agtcggtcca tttcccatcg 4392001 ccgtagccgt gaaaccgcag atgcaggtag accagctgcg agatgtcgtc gatcatctgc 4392061 ttggaatact tcagcgcccg catccgcttg cgcaccatct tggcgccgac cacttcgtgg 4392121 tgatggaagc tcaccccacc gtcgggttcg tgacggcggg tggcgggctt gccgatgtcg 4392181 tgcagcagcg ccgcccagcg caacaccaga tccgggccgt cgtcctccag cgcgatcgcc 4392241 tgccgcagca cggtcaagga atgctgatag acgtccttgt gctggtgatg ttcgtcgatc 4392301 gccatccgca tcccaccgat ttcaggcaag accacagcac ccataccgct ctgcaccatc 4392361 aggtcgatac ccgcggccgg atcctcaccg accagcagct tgtccagctc ggcggccacc 4392421 cgttcggcgc tgattcgggc caactgcggc gccatctctt cgatcgccgc gcgcacccgc 4392481 ggcgccaccg cgaatccaag ttgcgagacg aaccgcgcgg cgcgcagcat ccgcaacgga 4392541 tcgtcgccaa aggaccccga cggcgccgcc ggggtgtcta acaccttggc ccgcagcgcc 4392601 gccaagccac caagcggatc caggaattcg cccggcccag tggcggtgac gcgcacagcc 4392661 attgcgttcg tggtgaagtc gcggcggacc agatcgccct cgaggcaatc gccgaaacgt 4392721 acctctggat gacgcgaaac ccggtcgtag ctgtcggcac ggaatgtggt gatctccatg 4392781 cggtggtcgc tcttacccac gccgacggtg ccgaattcga ttccggtatc ccacaccgca 4392841 tcggcccacg gccgcacgat ctcctgcacc cgctcgggac gggcgtcggt ggtgaagtcc 4392901 aggtcggggc tcaaccggcc caacagtgca tctcgcaccg aaccgccgac cagatacaac 4392961 tcgtgtcccg cggcggcgaa caccgacccg agttcccgca ataaggcagc atgcctgttc 4393021 aaggcaaccg cagcggcggt tagcagatcg gcttcctgga cggcttccgg cacgttcgat 4393081 cagcctaatg gcagtcgaag tgggccggga cggtcggtgg aggaaccggc aaccctcgtt 4393141 gccgcacccg tcgcattggc cggtgtcggg acgaggtatc gtcgtgccca tctccgcgcg 4393201 acaaacagcc ggcgacaata ttaagaatcc ttgggtgcgg tcgcgtcttg tcgctcgaag 4393261 gtgggcaaat cgtgcgcccc cgacacagcg acttctgtga tagatgtgac tggcgcgact 4393321 caattggtca gcgcgggtcg cctgcaccgc cccgctccct cgcccaacga ataagtcctg 4393381 gccgacgatg ggcgctcaga cggcgagtac atcgggaaca cccgcccgta ccagctacta 4393441 tcgctggggt gtccgacggc gaacaagcca aatcacgtcg acgccggggg cggcgccgcg 4393501 ggcggcgcgc tgcggctaca gccgagaatc acatggacgc ccaaccggcc ggcgacgcca 4393561 ccccgacccc ggcaacggcg aagcggtccc ggtcccgctc acctcgtcgc gggtcgactc 4393621 ggatgcgcac cgtgcacgaa acatcggctg gagggttggt cattgacggt atcgacggtc 4393681 cacgagacgc gcaggtcgcg gctctgatcg gccgcgtcga ccggcgcggc cggctgctgt 4393741 ggtcgctacc caaggggcac atcgagttgg gcgagaccgc cgagcagacc gccatccgcg 4393801 aggtcgccga ggagaccggc atccgcggca gtgtgctcgc cgcgctgggg cgcatcgact 4393861 actggttcgt caccgacggc cggcgggtgc acaagaccgt ccaccattat ttgatgcggt 4393921 ttttaggcgg agagctgtcc gacgaagacc tcgaggtagc cgaggtagcc tgggtgccga 4393981 tccgggaact gccgtctcga ctggcctacg ccgacgaacg tcgactagcc gaggtggccg 4394041 acgaactgat cgacaagctg cagagcgacg gccccgccgc gcttccgccg ctaccaccca 4394101 gctcgcctcg tcgacggccg caaacgcatt cacgcgctcg tcatgccgat gactcagcac 4394161 cgggtcagca caacggtccc gggccggggc cgtgaccgca ctgcaactcg gctgggccgc 4394221 tttggcgcgc gtcacctcag cgatcggcgt cgtggccggc ctcgggatgg cgctcacggt 4394281 accgtcggcg gcaccgcacg cgctcgcagg cgagcccagc ccgacgcctt ttgtccaggt 4394341 ccgcatcgat caggtgaccc cggacgtggt gaccacttcc agcgaacccc atgtcaccgt 4394401 cagcggaacg gtgaccaata ccggtgaccg cccagtccgc gatgtgatgg tccggcttga 4394461 gcacgccgcc gcggtcacgt cgtcaacggc gttacgcacc tcgctcgacg gcggcaccga 4394521 ccagtaccag ccggccgcgg acttcctcac ggtcgccccc gaactagacc gcgggcaaga 4394581 ggccggcttt accctctcgg ccccgctgcg ctcgctgacc aggccgtcgt tggccgtcaa 4394641 ccagcccggg atctacccgg tcctggtcaa cgtcaatggg acacccgact acggtgcgcc 4394701 tgcgcggctc gacaatgcgc ggttcctgtt gcccgtggtc ggagtgccac ccgaccaggc 4394761 caccgacttc ggctccgctg ttgcaccaga aacgacggcg ccggtctgga tcaccatgct 4394821 gtggccgctg gccgaccggc cccggttggc ccccggggca cccggtggca ccgttcccgt 4394881 ccggctggtc gacgacgacc tggcaaactc gctggccaac ggcggccggc tggacatcct 4394941 cctgtcggcg gccgagttcg ccaccaaccg ggaagtcgac cccgacggcg ccgtcggccg 4395001 agcgctgtgc ctggccatcg acccagatct actcatcacc gtcaatgcga tgaccggcgg 4395061 ctacgtcgtg tccgactcgc ccgacggggc cgctcaacta ccgggcaccc cgacccaccc 4395121 gggcaccggc caggccgccg catccagctg gctggatcga ttgcggacgc tagtccaccg 4395181 gacatgcgtg acgccgctgc cttttgccca agccgacctg gatgctttgc agcgggttaa 4395241 tgatccgagg ctgagcgcga tcgcaaccat cagccccgcc gacatcgtcg accgcatcct 4395301 ggatgtcagc tccacccgcg gcgcaaccgt gctgcccgac ggcccgttga ccggccgggc 4395361 gatcaacttg ctcagcaccc acggcaacac ggttgccgtc gcggccgccg attttagccc 4395421 cgaggaacag cagggttcgt cccagatcgg ctccgcgctc ttacccgcta ccgcgccccg 4395481 gcggttgtcc ccgcgggtgg tagcggcgcc gtttgatccc gcggtcgggg ccgcgctggc 4395541 cgccgcggga acaaacccga ccgttcctac ctatctagat ccctcgttgt tcgttcggat 4395601 cgcgcatgaa tcgatcaccg cgcgccgcca ggacgccttg ggcgcaatgc tgtggcgcag 4395661 cttggagccg aatgccgcgc cccgtaccca aatcctggtg ccgccggcgt cgtggagcct 4395721 ggccagcgac gacgcgcagg tcatcctgac cgcgctggcc accgccatcc ggtctggcct 4395781 ggccgtgccg cgaccactac cggcggtgat cgctgacgcc gcggcccgca ccgagccacc 4395841 ggaacccccg ggcgcttaca gcgccgctcg cggccggttc aatgacgaca tcaccacgca 4395901 gatcggcggg caggttgccc ggctatggaa gctgacctcg gcgttgacca tcgatgaccg 4395961 caccgggctg accggcgtgc agtacaccgc accactacgc gaggacatgt tgcgcgcgct 4396021 gagccaatcg ctaccacccg atacccgcaa cgggctggcc cagcagcggc tggccgtcgt 4396081 tggaaagacg atcgacgatc ttttcggcgc ggtgaccatc gtcaacccgg gcggctccta 4396141 cactctggcc accgagcaca gtccgctgcc gttggcgctg cataatggcc tcgccgtgcc 4396201 aatccgggtc cggctacagg tcgatgctcc gcccgggatg acggtggccg atgtcggtca 4396261 gatcgagcta ccgcccgggt acctgccgct acgagtacca atcgaggtga acttcacaca 4396321 gcgggttgcc gtcgacgtgt cgctgcggac ccccgacggc gtcgcgctgg gtgaaccggt 4396381 gcggttgtcg gtgcactcca acgcctacgg caaggtgttg ttcgcgatca cgctatccgc 4396441 tgcggccgtg ctggtaacgc tggcgggccg gcgcctttgg caccggttcc gtggccagcc 4396501 tgatcgcgcc gacctggatc gccccgacct gcctaccggc aaacacgccc cgcagcgccg 4396561 tgccgtagcc agtcgggatg acgaaaagca ccgggtatga gaccctcccc tggagaggtg 4396621 cccacggcat cgcagaggca gcccgagctg tccgacgcgg cgctggtatc gcactcctgg 4396681 gcaatggcat tcgcgacgct gatcagccgg atcaccggct ttgcccggat cgtgctgctg 4396741 gccgcgatct taggtgcggc gctggccagc tcgttctcgg tggccaacca gctgccgaac 4396801 ctggtcgccg cactcgtgct ggaggccacc ttcaccgcca tcttcgtacc ggtgctggcc 4396861 cgcgccgagc aggacgaccc ggacggcggc gcggcgttcg tgcgccgttt ggtcacgttg 4396921 gcaaccaccc tgctgctggg cgccaccacg ctgtcggtgc tggccgcgcc actgcttgtg 4396981 cggttgatgc tgggcacaaa cccacaggtt aacgagccgc tgaccacggc gttcgcttac 4397041 ctgctgctac cgcaagtcct cgtctacggc ctctcgtcgg tattcatggc gatcctgaac 4397101 acccgcaatg tgttcgggcc gccggcctgg gcgcccgtcg tcaacaatgt cgtcgccatc 4397161 gcgaccctag cggtgtatct ggcggtcccc ggcgagcttt cagtcgatcc ggttcggatg 4397221 ggcaacgcca agctgctggt gctcggcatc ggcaccaccg caggcgtgtt tgcacagacc 4397281 gcggtgctgc tggtggccat ccggcgcgag cacatcagcc tgcgccccct gtggggaatc 4397341 gatcagcggc tcaagcgctt tggcgcgatg gccgccgcga tggtgctcta tgtgctgatc 4397401 agccagctcg gcctggtggt cggtaaccgg atcgccagca cggcagcggc ttccggcccc 4397461 gcgatctaca actacacctg gctagtgctg atgttgccat tcggcatgat cggcgtgacg 4397521 gtgctgaccg tggtgatgcc gcggctgagc cgcaatgccg cggccgacga taccccggcc 4397581 gtgctcgccg acctgtcgct agccaccagg ctgaccatga tcacgctgat cccaacggtg 4397641 gcgttcatga cggtcggcgg tccggcgatc ggtagcgcgc tttttgcata cggcaacttc 4397701 ggcgacgttg atgccgggta cctgggggcg gcgatcgcat tgtcggcgtt cacgttgatc 4397761 ccctatgcgt tagtgctgtt gcagctacgc gtgttctacg cccgcgagca gccgtggaca 4397821 ccaatcacga tcatcgtggt catcaccggc gtcaagatcc tcggctcgct gctggcgccg 4397881 catattaccg gtgatcccca gctggtcgcg gcctatctcg ggctggctaa cggactcgga 4397941 tttctcgccg gcacgatcgt cggctactac atactgcgtc gggccctgcg gcccgacggc 4398001 ggccagctga tcggcgtcgg cgaggcgcga accgtcctgg tgaccgtcgc cgcgtcgttg 4398061 cttgccggac tgctggcaca cgtggccgat cggttactag ggctaagcga gctgacggcc 4398121 cacgcgggca gcgtcggttc gctgctgcgg ctgtcggtgc tggctctcat catgctgcca 4398181 attctggctg cggtcaccct ctgcgcacgg gtgcccgagg cgcgggcggc gctggatgcc 4398241 gtgcgagccc gaatcaggag ccggcgcttg aagaccgggc ctcagaccca gaatgtcttg 4398301 gatcaatcgt ctcgccccgg accggtcacg taccctgagc ggaggcgttt ggccccgccg 4398361 cgggggaaaa gtgtggtcca cgagccgatc cggcgcaggc ctccggagca ggtagccaga 4398421 gccgggagag cgaaaggacc ggaggtgatc gaccgcccat cggagaacgc ctcgtttggt 4398481 gccgcgtcgg gtgccgagct gccgcggccc gtcgccgacg agcttcagct cgacgcgcca 4398541 gccggccgtg accccggccc cgtttcccgg ccgcacccat ccgacctgca aaacggcgat 4398601 ctgcccgccg atgcggcccg tgggccgatt gcgttcgacg cgctccgcga accggaccga 4398661 gaatcgtcgg cccccccaga tgatgtgcag ctggttcccg gcgcccgcat cgctaacggc 4398721 cgctaccgcc tgctgatctt ccacgggggt gtaccacccc tgcagttctg gcaggcgctt 4398781 gacacagcgc tggaccgcca ggtggcgctg accttcgtcg acccgcaggg cgtcctgccc 4398841 gacgacgtcc tccaggagac cttgtcccgt acgttgcggc tcagccggat cgacaagccc 4398901 ggtgtcgccc gagtgcttga cgtcgtgcac acccgggccg gtggtctggt agtcgcggag 4398961 tggatccgcg gcggttcgtt acaggaagtc gccgacacct caccgtcgcc ggttggcgcc 4399021 atccgggcga tgcagtccct ggccgcggcc gcagatgctg cccaccgcgc cggtgttgcg 4399081 ctgtcgatcg accatcccag ccgggtgcgc gtgagcatcg acggcgacgt cgtgctggcc 4399141 tacccggcga ccatgccgga cgccaacccg caagacgaca tccgcggcat cggcgcctcc 4399201 ctgtacgccc tgctggtcaa ccggtggccg ctgccggagg ccggcgtgcg cagcgggttg 4399261 gcacccgccg agcgcgacac cgctggccag cccatcgaac ccgccgacat cgaccgtgac 4399321 atccccttcc agatttccgc ggtggcggcc cggtcggttc aaggagacgg cgggatacgc 4399381 agcgcgtcaa cgctgttgaa tctaatgcag caggcgaccg cggtggccga tcgcaccgag 4399441 gtgctgggac cgatcgacga agcaccggtc tccgcggccc cgcgcacatc cgcgcccaac 4399501 agcgaaacct acacccgccg ccgtcgcaac ctgctgatcg gcatcggcgc gggtgctgcc 4399561 gtcctcatgg tggccctgct ggtcttggct tcggtgttga gccggatatt cggcgatgtc 4399621 agcggcggcc tcaacaagga cgaactgggc ctcaacgcac ccaccgcgtc gacctcggcg 4399681 gccagttcgg cgccgcccgg cagcgtcgtc aaacccacca aggtcacggt cttctccccc 4399741 gacggcggcg ccgacaaccc cggggaggct gatttggcca tcgacggcaa tccggccact 4399801 tcctggaaga ccgacatcta taccgacccc gtcccgttcc ctagcttcaa gaacggagtc 4399861 ggtttgatgt tgcagctgcc ccaggccacg gtggtcggca ccgtcgccat cgacgtggcc 4399921 agcaccggca ccaaggtgga gatccgctcg gcatccacgc cgacgccggc aacgctggag 4399981 gataccgccg tgttgacttc ggccaccgcg ctgcggcccg gccacaacac catctcggtc 4400041 gaggcggccg cgcccacctc gaatctgctg gtgtggatct ctaccttggg aaccaccgac 4400101 ggaaagagtc aagccgacat ctcggagatc acgatttacg ccgcgtcctg accgggccgg 4400161 gcacggccag ccagggtgaa gtgctatgcc gccaccgatt ggttactgtc cggccgtggg 4400221 tttcgggggc cgtcacgagc gcagcgacgc cgagctgctg gccgcccatg tcgccggcga 4400281 ccggtacgcc ttcgatcagt tgttccgccg tcatcaccgc cagctacacc ggctcgcgcg 4400341 gctcaccagc cggacctccg aggacgccga cgatgcgctg caagacgcga tgctgtcagc 4400401 gcaccgcggc gccggctcgt tccggtacga tgccgccgtc agcagttggt tgcaccgcat 4400461 cgtggtcaac gcttgcctgg accggctgcg tcgggccaaa gcccatccga ccgcccctct 4400521 agaagatgtc tatccggtcg cggaccggac cgcgcaggtc gagaccgcga tcgcggtgca 4400581 gcgggcactg atgcggctgc ccgtcgagca gcgggccgcg gtggtcgccg tggacatgca 4400641 gggctattcg atcgccgaca cccgcccgga tgctgggcgt ggccgagggc accgtcaaga 4400701 gccgctgcgc ccgggcgcgg gcccgcctag cgcggctgct gggctatctc aacaccgggg 4400761 tgaacatccg gcgctgaccc cgttgccggt ccgtcgtagc atcgatccac gggctcgccg 4400821 ctaccccaca tctggctatt gccaccgggc atgacggaca ctggggccga tgagtgcagc 4400881 cgacaaggat ccagacaaac atagcgccga tgcggacccg ccgctgaccg ttgagctgct 4400941 ggccgacctg caagcaggtc tgctggacga cgcaaccgcc gcccgcatcc gcagccgggt 4401001 ccgctcagac ccgcaggctc agcaaatcct gcgcgcgttg aaccgggtac gccgcgatgt 4401061 cgccgcgatg ggtgccgacc ccgcttgggg gccagctgct cgcccagcgg tcgtcgacag 4401121 catttcggcg gccttacggt cggcgcgccc gaacagctca cccggcgccg ctcacgccgc 4401181 ccgtccgcac gtccaccccg tccgaatgat cgccggcgcg gccggattgt gcgccgtggc 4401241 cacagcgatc ggtgtcggcg ccgtggtcga tgcaccgcca cccgcaccga gtgcaccgac 4401301 aaccgcgcag cacatcacgg tgtcaaaacc tgccccggtg attccgctgt ctcggccgca 4401361 ggttctcgac ctgcttcacc acaccccgga ctatggccca cccggaggcc cgctgggcga 4401421 tccgtcccgg cgtacgtcct gcctgagcgg cctcggctat ccggcgtcca cgccggtgct 4401481 gggcgcgcag ccgatcgata tcgacgctcg gcccgccgta ctgctggtga tacccgcgga 4401541 cacgcccgac aaactggccg tttttgcggt cgcgccgcac tgcagcgccg ccgataccgg 4401601 gttgttggct agcaccgtgg tcccccgcgc atgatgggtc tgggtgctgt cgctcgcctg 4401661 cgggaacagc agtgcctacg ctggcgttcg ttgtctcaag atctgccctc gcactcgaaa 4401721 ggctcgcatg accgccccgc ctgtccatga ccgcgcacac caccccgttc gcgacgtgat 4401781 cgttatcggc tccggtcccg cggggtacac tgcggcgctc tacgccgccc gtgcccagct 4401841 ggcgccgctg gtcttcgagg gcacgtcttt cggcggcgcg ctgatgacca ccaccgacgt 4401901 ggagaactac ccgggatttc gcaacggcat caccggtcca gagttgatgg atgagatgcg 4401961 ggaacaggcg ctgcgattcg gcgcggacct gcgtatggaa gacgtcgagt cggtatcact 4402021 tcacgggccg ctgaaatcgg tcgtcaccgc cgacggacag acccaccggg cccgagccgt 4402081 gatcctggca atgggcgcag cggcacgcta tctgcaggtg cccggcgaac aggaattgct 4402141 cgggcgcggg gtgagctcgt gcgccacctg cgacggattc ttcttccgcg atcaggacat 4402201 cgccgtcatc ggcggcggtg actcggcaat ggaggaagct accttcctga cccgattcgc 4402261 tcgcagtgtg acgctggtgc atcgccgcga cgagttccgg gcttccaaaa tcatgctcga 4402321 tcgcgcccgc aacaacgaca agatacggtt cctcaccaac cacaccgtgg tcgcggtgga 4402381 cggggacacc acagtgaccg gcttgcgggt acgcgacacc aacaccggtg ccgaaaccac 4402441 cctgccggta accggtgttt tcgtcgcgat cggccacgag ccgcggtcgg gcttggtgcg 4402501 cgaggccatc gacgtcgacc cggacggcta cgtgttggtg caggggcgta ccaccagcac 4402561 ctcactgccg ggcgtgttcg ctgccggcga cctggtggat cgcacctatc gccaggcggt 4402621 taccgcagcg ggcagtggct gcgccgcggc tatcgacgcc gagcgctggc tcgccgagca 4402681 cgcagcaacc ggagaagctg acagtaccga cgcattgata ggagcacaac gatgaccgat 4402741 tccgagaagt ccgccaccat caaagttacc gacgcatcct ttgccaccga cgtgctatcc 4402801 agcaacaagc ctgtgctggt tgacttttgg gcgacatggt gtggaccttg caagatggta 4402861 gcgcccgttc tcgaggaaat cgccaccgag cgcgcaacag acctcaccgt cgccaagctc 4402921 gacgtggaca ccaacccgga gaccgcccgc aacttccagg tcgtctcgat ccctaccctg 4402981 atcttgttca aggacggcca gccggtgaaa cgaatcgttg gcgccaaggg taaggctgcg 4403041 ttgctgcgcg agctctcaga cgtggttccc aacctcaact agcccccgcg gttagcctgg 4403101 ggttttcccg aaatcggcaa ggatctgcga caataccggt tggctggtcc gcattgtcaa 4403161 cgatgtgagc taatcccgga gggcccttgg tatgccgagt ccgcgccgcg aagacggcga 4403221 tgcgctgcgc tgtggcgacc gcagtgcggc cgtcaccgag atccgggctg cgctgaccgc 4403281 gttagggatg ctggatcatc aggaagaaga cctgacgacg ggccgtaacg tcgcccttga 4403341 gttgttcgac gcgcagctcg accaggcggt ccgtgccttc caacagcatc gcggcctgct 4403401 ggtggacggc atcgtcggtg aggccaccta ccgcgcgttg aaagaagcct cctaccggct 4403461 cggggcccgc acgctgtacc accaattcgg cgccccgctc tacggggacg acgtcgctac 4403521 actgcaggcc cggctgcagg atcttggttt ctacaccggg ctggtcgacg gtcatttcgg 4403581 gttgcagacc cacaatgcgt tgatgtccta tcagcgtgag tacggacttg ccgcagacgg 4403641 tatctgcggc ccagaaacgt tgcgctcctt gtactttcta agttcgcgag tcagcggtgg 4403701 ctcgccacat gcgattcgcg aagaagagct ggtccgcagc tcggggccga agctgtctgg 4403761 caaacggatc atcattgatc ccggtcgcgg cggcgtggac cacggactta tcgcgcaagg 4403821 tccggctggg cccatcagcg aagcagactt gttgtgggac ttggcaagtc ggctcgaagg 4403881 acggatggca gctatcggta tggagaccca cctgtcccgt ccgaccaacc gtagtccgtc 4403941 cgacgcagag cgtgccgcca ccgccaacgc cgttggcgca gacctgatga tcagcctgcg 4404001 ctgcgagacc cagaccagtc tcgcggccaa cggcgtggct tcctttcact tcggcaactc 4404061 gcacggctcg gtgtctacca tcggccgcaa tcttgccgat ttcattcaac gagaagtggt 4404121 ggcgcgcacc ggtttacggg attgccgtgt gcatggtcga acgtgggatc tgttgcggct 4404181 gaccaggatg ccgaccgttc aggtcgatat cggctacatc accaaccccc acgatcgtgg 4404241 gatgctggtc tcaacgcaga cgcgcgatgc catcgccgaa ggcattctcg ccgcggtcaa 4404301 acggctgtat ctgttaggca agaacgatcg gcccaccggc acattcactt tcgccgagtt 4404361 gctggcccac gaactgtctg tcgagcgagc gggtagactc ggcggttctt aagcccagtg 4404421 gccgcgtggg gtttacgacg tgttgccggc cgtcgacccc gctgctatcg gctcttgcag 4404481 tcgagcattc tccagcaagc gttcaagagc ggcctcgact tcggctttcc accccagccc 4404541 tttgtccagt tcgaggcgta gcctcggaaa gtacgggtgc ggtgccacca cgacgaaacc 4404601 cacgtccatc aagaagttcg cgtcgatgat gcagtgttcg acacagcagt cgccgagggc 4404661 ctccaacacc ggccgcacat caggtgttac cgcgcccggg ttttgcaaat cggtggccgc 4404721 tggtgtccgg ccgaaagctt ccagcgcccg gacgccgcgc cgaaccaact cttcaatcac 4404781 ccgggcaatc agactgtgcg gtaagtcgtc atctgcttgc ccgcgctcga tgcccatcga 4404841 cgtaagcagc accgcgtccg ccgacaccgg cgcggtagga aaccgctggg ctcgcggcac 4404901 cgcactgggc ggagcgtaga gcacataccc gaggcagggt ggttcggcgt ggctgcgctc 4404961 atccgggact gccgttgcga cctgaccgca cgaaccccac tccagcatca ccatcgacaa 4405021 ccaggcttcc ttttcgaatt cggggtcggc gaggtggtcg tctttgccga gaatcgcggg 4405081 gtcgacctcc cagaaaacgc agcgtcgcgc atgcttgggg agctgctcga aggcttcgag 4405141 tcgtaacgct gtgatacgag cggacactag tctcctggcc tccgtgcggc attgcaaccg 4405201 atggccctac acctccgcgg gccaatgtgc accagcaacc cttctagaat aagagagtcg 4405261 atcgctatcg ggccagtatt cgcgatgcca ctccagccga cttgcaccgc atcgtgtccg 4405321 gccggtgaca attgtccggt ccattgcccc gtccaatctc gaatccgctt gccgcacacc 4405381 gcgtctccgt tgattcccgc tccccgcagc gggttggctt aggcgccgga accggcgcgt 4405441 tgtcacagtg acgtaattac agagcgtccc tgtgcaggcc tttatctcgg ccatcagtgg 4405501 tcatcaaacc gactatgcgc gctaaatcat cgaccgagcc gaactccacc acaatcttac 4405561 ccttgcgttt gcccagactg acggtcaccc gcgtgtcaaa ggtggtcgat agacgctcag 4405621 caacatcttg gaggccaggc atctgaatcg gcttacgccg cggcggcgcg ggtgtagtcg 4405681 cgtcgctgtg atgggcttgg cgattggcct cgtgattggc cagcgtgacc gtctcctcgg 4405741 tggctcgcac cgacaggccc tccgcgacga tccggctcgc cagctcctct tgcgcctccg 4405801 gtccggcctc gagcgacagc agggcgcgag catgcccggc cgacagcacg ccggcggcca 4405861 ctcgccgctg taccgggatg gggagtttga gcaatcggat catgttggtg atcaagggcc 4405921 gcgagcggcc gatgcgcgcc gccagttcat cgtgggtgac cccgaattcg tcgagcaatt 4405981 gctggtatgc cgccgcttct tctaacggat tcagctgtac tcgatgaata ttttccagga 4406041 gggcgtcgcg cagcagatta tcgtcgccgg tctcacgcac gatggccggg atggtggcca 4406101 agcccgcctc ttgggcagcc cgccagcgcc gctcccccat cactatctgg tagcgcacgc 4406161 cggtttggga tccagccaat gaccgcacca cgatcggctg caggagaccg aattcgcgga 4406221 tggagtgcac caactcggcc agtgcctctt cgtcgaacac ctgacgcggc tgacggggat 4406281 tagcctcgat ggcgctcggt gggatttccc gatagatggc gcccatcacg gaagtgtccg 4406341 ggaccggtcc gccgattacg acatctgccg tggcagatcc catccgggga cccaaggtcg 4406401 gtggccccga ttctccgtct gccgggccag tcgggatcag cgcagccagg ccacggccga 4406461 ggccaccctt tctgcgtgac ggctgggtca tggtcgtccc ttcgcggatg gtggtcggtc 4406521 acgctcggca agttcgcggc tcgcgtcgag gtaactcatc gcgccgcgcg aaccgggatc 4406581 gtaatcgatg atggtcatgc tgtagcccgg cgcttcggaa accttgacgc tgcgtggaat 4406641 caccgtccgc aacactttgc ttccgaaata ctgacggacc tcgtcggcta cttgatcggc 4406701 gagctttgtc cggccgtcat acatggtaag gatcacggtg gtgacctcga gttgggggtt 4406761 gaggtgggcc ttcaccatct cgatgttgcg cataagctgc gacacaccct ccaacgcgta 4406821 gtactcgcat tggatcggga tcatcacctc cggtgccgcg acgagtgcgt tgatggtcag 4406881 cagccccagc gagggcgggc aatcgacgaa aacgtagtcg aagtcgaagt tgtcgagtgc 4406941 ggccagggcg gtgcgcaacc ggttctcgcg cgccaccatg ctcaccaatt cgatttcggc 4407001 gccggccaga tcgatcgtcg ccgggatgca gaacagccgc tcgctgtgcg ggctgcgccg 4407061 tagcgccgtg tgcaacgaaa cctcgccgat aagcatctcg taggacgagg gtgtgccgga 4407121 ttgccggtcg gtgataccca atgcggtgct cgcgttgccc tggggatcga gatcgatcac 4407181 gagtgtcttg aggccctgca cagcaagcgc ggcagcgata ttgacggcgg tggtcgtctt 4407241 accgaccccg cccttctgat tcgcgatggt gagcacccgg cgtcgacccg gccgctgcag 4407301 cggctcgtgg gtggtgtgca ggacccgcat cgcacgttct gctgcagcgc cgatgggggt 4407361 gtcgaattct gtcgatgttt cacgtgaaac attcatcgtc ggattgtgcg cggcctcagg 4407421 cgtcggtgtc ggtggtgtca tttcccgctg gaatggttcg atagttgaag cctggcccga 4407481 ccttacgagc gcggacggtc cagcggccac cgggccccac ggagcactca cgccgtccct 4407541 ccactcgcca tccgtgccga ccctcgggcg atctgctttc cacgtcgtgc gaacaccacg 4407601 gtcgcgggcg gacgcaaata gttcgcgcca catgtcacca ccctgacatc aaccgcgccc 4407661 gatgcgatca tcacacgccg gtgctcccgt acttcgtcgt gagcccgctc gcctttgatg 4407721 gcgagcattc gcccgttcgg ccgtatcaac ggcatgctcc atttcgtcaa cttgtccaac 4407781 gcggccaccg cccgtgacac cgcagcgtcg ctgccgccca attggtcctg cacccaggac 4407841 tcctcggcgc gcccccgcac gatctcaacg gccacgccca gatctgtcac catctctcga 4407901 agagactcgg tgcggcgcag tagcggttct aggagaacta cctggaggtc cggccgcgct 4407961 atcgccaatg gcacgcccgg caacccggct ccgctaccga tatccacgac ccggtcaccg 4408021 cgttcgagga gctcaccgat cacggcgcag ttcagtagat gccggtccca tagcctaccg 4408081 acttcgcggg gtcccaccag cccccgctcc acaccgggtc ccgccaacgc ttcggcgtac 4408141 cgccgagcaa ggccaagccg cggtccgaag atcgcagacg ccgcgggctc gatcggagac 4408201 attacgcact ccgccggctc gtgaggtctg tgtcatgttt cacgtgaaac attctccgct 4408261 ctcgagacgc tggcccagcc gctcggccac gcatcgctta ctgcggcgtc ggtcggagcc 4408321 gctggctcgc gagctagtcg cggagcacaa cgactcggcg ttctggctcc acgccttcgc 4408381 tttcgctgtg cacacctggc accgctgcaa ccgcatcgtg gacgatcttc cgttcgaacg 4408441 gcgtcattgg aacgagttcc tcgcggtcac cggtttcggc cactcgccgc gccacctcgt 4408501 cggccagcgc cgccaattcc tcccggcgcc gccgtcgcca cctcgcgatg tctagcatca 4408561 accggctccg cacaccggtc ttctgatgca ccgccaaccg ggtgagttcc tgcagagcgt 4408621 cgagcacctc gcccccgcgc ccgaccaact tgttcaggtc gtcactgccg tcgatgctca 4408681 ccaccgcacg attgccttcg acatcgaggt cgatgtcgcc atcgaagtcc aacacgtcca 4408741 ataactcttc caggtagtcg cctgcaatct cgccctcggc gaccaatctc tcttcttgat 4408801 cgtcggcctc gtcagcatcc gtcgccgtgt cctcccggac gcctccaccc ggtgcttctg 4408861 cgtcgacgtc gaagtcggtg gtgtcagcgt cggccatggc ttgctctccc ctcgtctgca 4408921 ggcgggttgt gtttgtggga ccgcctgccc ggctgcccgg aaggattgtc aacgtttgcg 4408981 ttttttcggt cgcaccccgg gccgcggcgt acgggcgctc gggccgctgt tgcgtctggc 4409041 cggattggac gtgtcggctg gtcgctcagt gctggcgtcc gactctgccc cgtcatcggt 4409101 gtccccggct tctgttgggg ctgccgcatt ggtcgctgga gcggtcttcg ggctccgctt 4409161 gggcttagct cccggggccg gcgcgttggc cgcccggcgc cggaccgcct cctgcttttt 4409221 ggcctcctcc tccttttcga tcatgccgaa gacgtaatgc tgctgcccga acgtccagat 4409281 attgttcgag aaccaataca agatgatcgc cagtggcagg aacggtccgc cgacgactac 4409341 gccgagcgga aatacgtaca gcgccagctt gttcatcatc gcggtctgtg gattcgcagc 4409401 cgcctcggcg ctctgccgcg cgatagacgc gcgactgttg aagtacgtcg cgatgccggc 4409461 caagatcatc accggcacac ccaccgcgat caacgcaggg cgactgaaat cgacgaacgc 4409521 atccaacccg gaccgttgcg tcatgtacgc cccgatcgga gcgccgaaca agttggcatc 4409581 taggaagtgg ccgacgtcga ccgggctaaa gacgtagttc ccagtcagtc ggttctcgat 4409641 caccgacaag tgtggttgac caaagccccc ggtcgtacgg ttaaacgagc gcaacacatg 4409701 atagagccca agaaacaccg gaatctgcgc cagcatcggc aaacatccga gaatggggtt 4409761 gaagccgtgc tcgcgttgca gcttttgcat ttcgagcgcc atccgctgac gatccttgcc 4409821 gtatttcttt tgcaaggcct tgatctgtgg ttgcagttcc tgcatctgcc tggtggtgcg 4409881 aatctggcgc acgaacggct tgtacagcag cgcacgcagc gtgaagacca ggaacatcac 4409941 cgacaacgcc caggcgaaga agttggatgg tcctagcaca aacgcgaaca gccggtacca 4410001 aacccacatg atccacgaca ccgggtagta gatgaagtcg agactgaaga aatcaaacaa 4410061 aagactcacg ctcccctcgc tttgacgcag ggttccagtc gtcgttcgcg ccgtcgacgt 4410121 ctgtctggca gctccggcct gtcgttaagc cttccggtat cggatcccat cctccccgat 4410181 gccatggtcc gcactttgcg agcctgatca tggtcaacca gcttccccgc aacaggccat 4410241 actcggtgag cgcatcgacg gcgtactgac tacaggtagg gacaaagcgg cacgacgccg 4410301 gtcgtagcgg cgaaagcatg tgccgataga cctggataac gaaaatcaac ccccgcgctg 4410361 atgctctacc ggtaacccgg accacgcgcc cacagctttg cctagacaga ctcaccgatc 4410421 actacctgcc agttcgacag ccctccgcaa gccgcatcgc agttgctgct ccaaccgagc 4410481 cgaggagaca tgccggctgc tcggcagcgc gcggatcacc acatgatcgg acgggtggag 4410541 ttctttgacg atcgacccag ccacgtgccg cagccgacgt gccacgcggt ggcgttccac 4410601 ggccgacccc accgacttgg cgataatcag tccgacgcgc ggcccaccgc cactcccacg 4410661 ccaccaataa acgaccatgt cagaccgcac ggtacgcatc ccgtgcttca ccgttgtttc 4410721 aaaatccgct gaccgcctca tgcggttgcg tgcacgaagc accgcaaata agcccggtgt 4410781 tgcaatcaag cactgagcgt gcgccgaccc ttgcgtcgcc ggctggacac aattgacctc 4410841 ccggcgcggg tacgcatccg taagcggaaa ccgtgaacac gagctcgccg ccggttgttc 4410901 ggctggaagg tccttttgcc cttggtcacg ggcgtctcct cgctatgtct ggcaacatca 4410961 ccatccggcc actcactgcc ttccaactcg attggcccgc gggacagtcg gaggtggttt 4411021 tcgctgctgg ccggcgcggt ccctggacta atccaggtcg cagccgcatc gccgactttc 4411081 gggcgactgt tcgagggtac ttacgcgcct tcgcctggtc aaacctcgcc cacccggcaa 4411141 ccgcttcagg gcatcctgcc cgctaagctg ctcaccatcc gtacacccga gaccgccaca 4411201 ctcacaaaga acccaccaca acgcaaaaca acggttggca gccgtacgga aaactgttag 4411261 cttcgggcgg tgtagttatc acgccgtttc agcgtggaaa cggcactcga caatcaagcg 4411321 aggatggcgg atcgactagc ggcccggaca acttgaaccg ggtgttttca acacgaggat 4411381 cgcgagccgt tgccggtagg ttgcggctgg ttatcgacgg tactgtccac atttgtggat 4411441 agccatgtgg acagttcacc tgcccacaac aacggttgta gctcgacccg gaaccaagac 4411501 ccggaactaa cgagaaccag ggagatacgt cg //